Publications by Tag
The following tags appear in the publications listed in the review:
Agent Agentic Applications Attention Mechanism BERT Bias Mitigation Distillation Efficiency And Optimization Ethics And Bias Fairness Few Shot Fine Tuning GPT Has Code In Context Learning Interpretability And Explainability Language Modeling Large Scale Training Masked Language Model Merging Model Architecture Multimodal Models Pretraining Methods Prompting Pruning Quantization RAG Reinforcement Learning Responsible AI Scaling Laws Security Survey Paper TACL Tokenization Tools Training Techniques Transformer
Tags
See below a list of all tags and the related papers
🏷 Agent
- Egothink: Evaluating First-person Perspective Thinking Capability Of Vision-language Models Cheng Sijie, Guo Zhicheng, Wu Jingwen, Fang Kechen, Li Peng, Liu Huaping, Liu Yang
- Chessgpt: Bridging Policy Learning And Language Modeling Feng Xidong, Luo Yicheng, Wang Ziyan, Tang Hongrui, Yang Mengyue, Shao Kun, Mguni David, Du Yali, Wang Jun
- Exposing Limitations Of Language Model Agents In Sequential-task Compositions On The Web Furuta Hiroki, Matsuo Yutaka, Faust Aleksandra, Gur Izzeddin
- Improving Knowledge Extraction From Llms For Task Learning Through Agent Analysis Kirk James R., Wray Robert E., Lindes Peter, Laird John E.
- M3dbench: Let's Instruct Large Models With Multi-modal 3D Prompts Li Mingsheng, Chen Xin, Zhang Chi, Chen Sijin, Zhu Hongyuan, Yin Fukun, Yu Gang, Chen Tao
- Testing Language Model Agents Safely In The Wild Naihin Silen, Atkinson David, Green Marc, Hamadi Merwane, Swift Craig, Schonholtz Douglas, Kalai Adam Tauman, Bau David
- Kwaiagents: Generalized Information-seeking Agent System With Large Language Models Pan Haojie, Zhai Zepeng, Yuan Hao, Lv Yaojia, Fu Ruiji, Liu Ming, Wang Zhongyuan, Qin Bing
- Experiential Co-learning Of Software-developing Agents Qian Chen, Dang Yufan, Li Jiahao, Liu Wei, Xie Zihao, Wang Yifei, Chen Weize, Yang Cheng, Cong Xin, Che Xiaoyin, Liu Zhiyuan, Sun Maosong
- Taskweaver: A Code-first Agent Framework Qiao Bo, Li Liqun, Zhang Xu, He Shilin, Kang Yu, Zhang Chaoyun, Yang Fangkai, Dong Hang, Zhang Jue, Wang Lu, Ma Minghua, Zhao Pu, Qin Si, Qin Xiaoting, Du Chao, Xu Yong, Lin Qingwei, Rajmohan Saravan, Zhang Dongmei
- Taskbench: Benchmarking Large Language Models For Task Automation Shen Yongliang, Song Kaitao, Tan Xu, Zhang Wenqi, Ren Kan, Yuan Siyu, Lu Weiming, Li Dongsheng, Zhuang Yueting
- Adaplanner: Adaptive Planning From Feedback With Language Models Sun Haotian, Zhuang Yuchen, Kong Lingkai, Dai Bo, Zhang Chao
- Corex: Pushing The Boundaries Of Complex Reasoning Through Multi-model Collaboration Sun Qiushi, Yin Zhangyue, Li Xiang, Wu Zhiyong, Qiu Xipeng, Kong Lingpeng
- Auto-gpt For Online Decision Making: Benchmarks And Additional Opinions Yang Hui, Yue Sifu, He Yunzhong
- Finmem: A Performance-enhanced LLM Trading Agent With Layered Memory And Character Design Yu Yangyang, Li Haohang, Chen Zhi, Jiang Yuechen, Li Yang, Zhang Denghui, Liu Rong, Suchow Jordan W., Khashanah Khaldoun
- Large Language Models Illuminate A Progressive Pathway To Artificial Healthcare Assistant: A Review Yuan Mingze, Bao Peng, Yuan Jiajia, Shen Yunhao, Chen Zifan, Xie Yi, Zhao Jie, Chen Yang, Zhang Li, Shen Lin, Dong Bin
- Language Agent Tree Search Unifies Reasoning Acting And Planning In Language Models Zhou Andy, Yan Kai, Shlapentokh-rothman Michal, Wang Haohan, Wang Yu-xiong
- Large Language Model As A Policy Teacher For Training Reinforcement Learning Agents Zhou Zihao, Hu Bin, Zhao Chenyang, Zhang Pu, Liu Bin
- Exploring Autonomous Agents Through The Lens Of Large Language Models: A Review Barua Saikat
- STARLING: Self-supervised Training Of Text-based Reinforcement Learning Agent With Large Language Models Basavatia Shreyas, Murugesan Keerthiram, Ratnakar Shivam
- NESTFUL: A Benchmark For Evaluating Llms On Nested Sequences Of API Calls Basu Kinjal, Abdelaziz Ibrahim, Bradford Kelsey, Crouse Maxwell, Kate Kiran, Kumaravel Sadhana, Goyal Saurabh, Munawar Asim, Rizk Yara, Wang Xin, Lastras Luis, Kapanipathi Pavan
- Workarena++: Towards Compositional Planning And Reasoning-based Common Knowledge Work Tasks Boisvert Léo, Thakkar Megh, Gasse Maxime, Caccia Massimo, De Chezelles Thibault Le Sellier, Cappart Quentin, Chapados Nicolas, Lacoste Alexandre, Drouin Alexandre
- Can We Rely On LLM Agents To Draft Long-horizon Plans? Let's Take Travelplanner As An Example Chen Yanan, Pesaranghader Ali, Sadhu Tanmana, Yi Dong Hoon
- Professional Agents -- Evolving Large Language Models Into Autonomous Experts With Human-level Competencies Chu Zhixuan, Wang Yan, Zhu Feng, Yu Lu, Li Longfei, Gu Jinjie
- Informed AI Regulation: Comparing The Ethical Frameworks Of Leading LLM Chatbots Using An Ethics-based Audit To Assess Moral Reasoning And Normative Values Chun Jon, Elkins Katherine
- Jailbreaking Text-to-image Models With Llm-based Agents Dong Yingkai, Li Zheng, Meng Xiangtao, Yu Ning, Guo Shanqing
- Towards A Copilot In BIM Authoring Tool Using A Large Language Model-based Agent For Intelligent Human-machine Interaction Du Changyu, Nousias Stavros, Borrmann André
- An Extremely Data-efficient And Generative Llm-based Reinforcement Learning Agent For Recommenders Feng Shuang, Feng Grace
- Large Language Model-based Human-agent Collaboration For Complex Task Solving Feng Xueyang, Chen Zhi-yuan, Qin Yujia, Lin Yankai, Chen Xu, Liu Zhiyuan, Wen Ji-rong
- Langsuite: Planning, Controlling And Interacting With Large Language Models In Embodied Text Environments Jia Zixia, Wang Mengmeng, Tong Baichen, Zhu Song-chun, Zheng Zilong
- Omniact: A Dataset And Benchmark For Enabling Multimodal Generalist Autonomous Agents For Desktop And Web Kapoor Raghav, Butala Yash Parag, Russak Melisa, Koh Jing Yu, Kamble Kiran, Alshikh Waseem, Salakhutdinov Ruslan
- Rethinking Chatgpt's Success: Usability And Cognitive Behaviors Enabled By Auto-regressive Llms' Prompting Li Xinzhe, Liu Ming
- Towards Automated Data Sciences With Natural Language And Sagecopilot: Practices And Lessons Learned Liao Yuan, Bian Jiang, Yun Yuhui, Wang Shuo, Zhang Yubo, Chu Jiaming, Wang Tao, Li Kewei, Li Yuchen, Li Xuhong, Ji Shilei, Xiong Haoyi
- Graph-enhanced Large Language Models In Asynchronous Plan Reasoning Lin Fangru, La Malfa Emanuele, Hofmann Valentin, Yang Elle Michelle, Cohn Anthony, Pierrehumbert Janet B.
- Arondight: Red Teaming Large Vision Language Models With Auto-generated Multi-modal Jailbreak Prompts Liu Yi, Cai Chengjun, Zhang Xiaoli, Yuan Xingliang, Wang Cong
- Agentlite: A Lightweight Library For Building And Advancing Task-oriented LLM Agent System Liu Zhiwei, Yao Weiran, Zhang Jianguo, Yang Liangwei, Liu Zuxin, Tan Juntao, Choubey Prafulla K., Lan Tian, Wu Jason, Wang Huan, Heinecke Shelby, Xiong Caiming, Savarese Silvio
- Mental Modeling Of Reinforcement Learning Agents By Language Models Lu Wenhao, Zhao Xufeng, Spisak Josua, Lee Jae Hee, Wermter Stefan
- The Conversation Is The Command: Interacting With Real-world Autonomous Robot Through Natural Language Nwankwo Linus, Rueckert Elmar
- Iterative Experience Refinement Of Software-developing Agents Qian Chen, Li Jiahao, Dang Yufan, Liu Wei, Wang Yifei, Xie Zihao, Chen Weize, Yang Cheng, Zhang Yingli, Liu Zhiyuan, Sun Maosong
- Athena: Safe Autonomous Agents With Verbal Contrastive Learning Sadhu Tanmana, Pesaranghader Ali, Chen Yanan, Yi Dong Hoon
- Ehragent: Code Empowers Large Language Models For Few-shot Complex Tabular Reasoning On Electronic Health Records Shi Wenqi, Xu Ran, Zhuang Yuchen, Yu Yue, Zhang Jieyu, Wu Hang, Zhu Yuanda, Ho Joyce, Yang Carl, Wang May D.
- Natural Language Programming In Medicine: Administering Evidence Based Clinical Workflows With Autonomous Agents Powered By Generative Large Language Models Vaid Akhil, Lampert Joshua, Lee Juhee, Sawant Ashwin, Apakama Donald, Sakhuja Ankit, Soroush Ali, Bick Sarah, Abbott Ethan, Gomez Hernando, Hadley Michael, Lee Denise, Landi Isotta, Duong Son Q, Bussola Nicole, Nabeel Ismail, Muehlstedt Silke, Muehlstedt Silke, Freeman Robert, Kovatch Patricia, Carr Brendan, Wang Fei, Glicksberg Benjamin, Argulian Edgar, Lerakis Stamatios, Khera Rohan, Reich David L., Kraft Monica, Charney Alexander, Nadkarni Girish
- Instruction Following With Goal-conditioned Reinforcement Learning In Virtual Environments Volovikova Zoya, Skrynnik Alexey, Kuderov Petr, Panov Aleksandr I.
- Can Large Language Models Understand Uncommon Meanings Of Common Words? Wu Jinyang, Che Feihu, Zheng Xinxin, Zhang Shuai, Jin Ruihan, Nie Shuai, Shao Pengpeng, Tao Jianhua
- Converging Paradigms: The Synergy Of Symbolic And Connectionist AI In Llm-empowered Autonomous Agents Xiong Haoyi, Wang Zhiyuan, Li Xuhong, Bian Jiang, Xie Zeke, Mumtaz Shahid, Barnes Laura E.
- Agentohana: Design Unified Data And Training Pipeline For Effective Agent Learning Zhang Jianguo, Lan Tian, Murthy Rithesh, Liu Zhiwei, Yao Weiran, Tan Juntao, Hoang Thai, Yang Liangwei, Feng Yihao, Liu Zuxin, Awalgaonkar Tulika, Niebles Juan Carlos, Savarese Silvio, Heinecke Shelby, Wang Huan, Xiong Caiming
- Xlam: A Family Of Large Action Models To Empower AI Agent Systems Zhang Jianguo, Lan Tian, Zhu Ming, Liu Zuxin, Hoang Thai, Kokane Shirley, Yao Weiran, Tan Juntao, Prabhakar Akshara, Chen Haolin, Liu Zhiwei, Feng Yihao, Awalgaonkar Tulika, Murthy Rithesh, Hu Eric, Chen Zeyuan, Xu Ran, Niebles Juan Carlos, Heinecke Shelby, Wang Huan, Savarese Silvio, Xiong Caiming
🏷 Agentic
- Visual Dialog Das Abhishek, Kottur Satwik, Gupta Khushi, Singh Avi, Yadav Deshraj, Moura José M. F., Parikh Devi, Batra Dhruv
- Towards Neural Machine Translation With Latent Tree Attention Bradbury James, Socher Richard
- A Copy-augmented Sequence-to-sequence Architecture Gives Good Performance On Task-oriented Dialogue Eric Mihail, Manning Christopher D.
- Reinforced Mnemonic Reader For Machine Reading Comprehension Hu Minghao, Peng Yuxing, Huang Zhen, Qiu Xipeng, Wei Furu, Zhou Ming
- A Practical Approach To Dialogue Response Generation In Closed Domains Lu Yichao, Keung Phillip, Zhang Shaonan, Sun Jason, Bhardwaj Vikas
- Parlai: A Dialog Research Software Platform Miller Alexander H., Feng Will, Fisch Adam, Lu Jiasen, Batra Dhruv, Bordes Antoine, Parikh Devi, Weston Jason
- An Empirical Analysis Of Multiple-turn Reasoning Strategies In Reading Comprehension Tasks Shen Yelong, Liu Xiaodong, Duh Kevin, Gao Jianfeng
- Are You Talking To Me? Reasoned Visual Dialog Generation Through Adversarial Learning Wu Qi, Wang Peng, Shen Chunhua, Reid Ian, Hengel Anton Van Den
- Mojitalk: Generating Emotional Responses At Scale Zhou Xianda, Wang William Yang
- A Knowledge-grounded Multimodal Search-based Conversational Agent Agarwal Shubham, Dusek Ondrej, Konstas Ioannis, Rieser Verena
- Attention Based Natural Language Grounding By Navigating Virtual Environment B Akilesh, Sinha Abhishek, Sarkar Mausoom, Krishnamurthy Balaji
- Achieving Fluency And Coherency In Task-oriented Dialog Gangadharaiah Rashmi, Narayanaswamy Balakrishnan, Elkan Charles
- Neural Machine Translation With Adequacy-oriented Learning Kong Xiang, Tu Zhaopeng, Shi Shuming, Hovy Eduard, Zhang Tong
- Answerer In Questioner's Mind: Information Theoretic Approach To Goal-oriented Visual Dialog Lee Sang-woo, Heo Yu-jung, Zhang Byoung-tak
- Polite Dialogue Generation Without Parallel Data Niu Tong, Bansal Mohit
- Recursive Visual Attention In Visual Dialog Niu Yulei, Zhang Hanwang, Zhang Manli, Zhang Jianhong, Lu Zhiwu, Wen Ji-rong
- Extending Neural Generative Conversational Model Using External Knowledge Sources Parthasarathi Prasanna, Pineau Joelle
- Taskmaster-1: Toward A Realistic And Diverse Dialog Dataset Byrne Bill, Krishnamoorthi Karthik, Sankar Chinnadhurai, Neelakantan Arvind, Duckworth Daniel, Yavuz Semih, Goodrich Ben, Dubey Amit, Cedilnik Andy, Kim Kyu-young
- DMRM: A Dual-channel Multi-hop Reasoning Model For Visual Dialog Chen Feilong, Meng Fandong, Xu Jiaming, Li Peng, Xu Bo, Zhou Jie
- Reinforcement Learning Based Graph-to-sequence Model For Natural Question Generation Chen Yu, Wu Lingfei, Zaki Mohammed J.
- Contrastive Multi-document Question Generation Cho Woon Sang, Zhang Yizhe, Rao Sudha, Celikyilmaz Asli, Xiong Chenyan, Gao Jianfeng, Wang Mengdi, Dolan Bill
- Deep Learning Based Chatbot Models Csaky Richard
- An Empirical Comparison On Imitation Learning And Reinforcement Learning For Paraphrase Generation Du Wanyu, Ji Yangfeng
- Episodic Memory Reader: Learning What To Remember For Question Answering From Streaming Data Han Moonsu, Kang Minki, Jung Hyunwoo, Hwang Sung Ju
- Evaluating Rewards For Question Generation Models Hosking Tom, Riedel Sebastian
- A Corpus-free State2seq User Simulator For Task-oriented Dialogue Hou Yutai, Fang Meng, Che Wanxiang, Liu Ting
- All-in-one Image-grounded Conversational Agents Ju Da, Shuster Kurt, Boureau Y-lan, Weston Jason
- Generative Question Refinement With Deep Reinforcement Learning In Retrieval-based QA System Liu Ye, Zhang Chenwei, Yan Xiaohui, Chang Yi, Yu Philip S.
- Multi-hop Reading Comprehension Via Deep Reinforcement Learning Based Document Traversal Long Alex, Mason Joel, Blair Alan, Wang Wei
- Good, Better, Best: Textual Distractors Generation For Multiple-choice Visual Question Answering Via Reinforcement Learning Lu Jiaying, Ye Xin, Ren Yi, Yang Yezhou
- Stabilizing Transformers For Reinforcement Learning Parisotto Emilio, Song H. Francis, Rae Jack W., Pascanu Razvan, Gulcehre Caglar, Jayakumar Siddhant M., Jaderberg Max, Kaufman Raphael Lopez, Clark Aidan, Noury Seb, Botvinick Matthew M., Heess Nicolas, Hadsell Raia
- Hierarchical Reinforcement Learning For Open-domain Dialog Saleh Abdelrhman, Jaques Natasha, Ghandeharioun Asma, Shen Judy Hanwen, Picard Rosalind
- Do Neural Dialog Systems Use The Conversation History Effectively? An Empirical Study Sankar Chinnadhurai, Subramanian Sandeep, Pal Christopher, Chandar Sarath, Bengio Yoshua
- Generalization In Generation: A Closer Look At Exposure Bias Schmidt Florian
- The Dialogue Dodecathlon: Open-domain Knowledge And Image Grounded Conversational Agents Shuster Kurt, Ju Da, Roller Stephen, Dinan Emily, Boureau Y-lan, Weston Jason
- Multilingual Question Answering From Formatted Text Applied To Conversational Agents Siblini Wissam, Pasqual Charlotte, Lavielle Axel, Challal Mohamed, Cauchois Cyril
- Natural Language Generation Using Reinforcement Learning With External Rewards Srinivasan Vidhushini, Santhanam Sashank, Shaikh Samira
- Transfertransfo: A Transfer Learning Approach For Neural Network Based Conversational Agents Wolf Thomas, Sanh Victor, Chaumond Julien, Delangue Clement
- Review Conversational Reading Comprehension Xu Hu, Liu Bing, Shu Lei, Yu Philip S.
- Making History Matter: History-advantage Sequence Training For Visual Dialog Yang Tianhao, Zha Zheng-jun, Zhang Hanwang
- Consistent Dialogue Generation With Self-supervised Feature Learning Zhang Yizhe, Gao Xiang, Lee Sungjin, Brockett Chris, Galley Michel, Gao Jianfeng, Dolan Bill
- Building Task-oriented Visual Dialog Systems Through Alternative Optimization Between Dialog Policy And Language Generation Zhou Mingyang, Arnold Josh, Yu Zhou
- Vision-language Navigation With Self-supervised Auxiliary Reasoning Tasks Zhu Fengda, Zhu Yi, Chang Xiaojun, Liang Xiaodan
- A Study On Multimodal And Interactive Explanations For Visual Question Answering Alipour Kamran, Schulze Jurgen P., Yao Yi, Ziskind Avi, Burachas Giedrius
- Fluent Response Generation For Conversational Question Answering Baheti Ashutosh, Ritter Alan, Small Kevin
- Co-attentional Transformers For Story-based Video Understanding Bebensee Björn, Zhang Byoung-tak
- Exploring Fluent Query Reformulations With Text-to-text Transformers And Reinforcement Learning Chen Jerry Zikun, Yu Shi, Wang Haoran
- Probing Emergent Semantics In Predictive Agents Via Question Answering Das Abhishek, Carnevale Federico, Merzic Hamza, Rimell Laura, Schneider Rosalia, Abramson Josh, Hung Alden, Ahuja Arun, Clark Stephen, Wayne Gregory, Hill Felix
- Addressing Some Limitations Of Transformers With Feedback Memory Fan Angela, Lavril Thibaut, Grave Edouard, Joulin Armand, Sukhbaatar Sainbayar
- Recurrent Chunking Mechanisms For Long-text Machine Reading Comprehension Gong Hongyu, Shen Yelong, Yu Dian, Chen Jianshu, Yu Dong
- Towards Learning A Generic Agent For Vision-and-language Navigation Via Pre-training Hao Weituo, Li Chunyuan, Li Xiujun, Carin Lawrence, Gao Jianfeng
- Sub-instruction Aware Vision-and-language Navigation Hong Yicong, Rodriguez-opazo Cristian, Wu Qi, Gould Stephen
- Less Is More: Data-efficient Complex Question Answering Over Knowledge Bases Hua Yuncheng, Li Yuan-fang, Qi Guilin, Wu Wei, Zhang Jingyao, Qi Daiqing
- Lambert: Language And Action Learning Using Multimodal BERT Miyazawa Kazuki, Aoki Tatsuya, Horii Takato, Nagai Takayuki
- Avgout: A Simple Output-probability Measure To Eliminate Dull Responses Niu Tong, Bansal Mohit
- Multi-view Attention Network For Visual Dialog Park Sungjin, Whang Taesun, Yoon Yeochan, Lim Heuiseok
- How To Evaluate Your Dialogue System: Probe Tasks As An Alternative For Token-level Evaluation Metrics Parthasarathi Prasanna, Pineau Joelle, Chandar Sarath
- Nlpgym -- A Toolkit For Evaluating RL Agents On Natural Language Processing Tasks Ramamurthy Rajkumar, Sifa Rafet, Bauckhage Christian
- Personalized Query Rewriting In Conversational AI Agents Roshan-ghias Alireza, Mathialagan Clint Solomon, Ponnusamy Pragaash, Mathias Lambert, Guo Chenlei
- VD-BERT: A Unified Vision And Dialog Transformer With BERT Wang Yue, Joty Shafiq, Lyu Michael R., King Irwin, Xiong Caiming, Hoi Steven C. H.
- Exploring Question-specific Rewards For Generating Deep Questions Xie Yuxi, Pan Liangming, Wang Dongzhe, Kan Min-yen, Feng Yansong
- Off-policy Self-critical Training For Transformer In Visual Paragraph Generation Yan Shiyang, Hua Yang, Robertson Neil M.
- Low-resource Knowledge-grounded Dialogue Generation Zhao Xueliang, Wu Wei, Tao Chongyang, Xu Can, Zhao Dongyan, Yan Rui
- Vision-dialog Navigation By Exploring Cross-modal Memory Zhu Yi, Zhu Fengda, Zhan Zhaohuan, Lin Bingqian, Jiao Jianbin, Chang Xiaojun, Liang Xiaodan
- Goal-directed Story Generation: Augmenting Generative Language Models With Reinforcement Learning Alabdulkarim Amal, Li Winston, Martin Lara J., Riedl Mark O.
- Reinforcement Learning For Few-shot Text Generation Adaptation Cheng Pengsen, Dai Jinqiao, Liu Jiamiao, Liu Jiayong, Jia Peng
- Open-domain Conversational Search Assistant With Transformers Ferreira Rafael, Leite Mariana, Semedo David, Magalhaes Joao
- Berta\'u: Ita\'u BERT For Digital Customer Service Finardi Paulo, Viegas José Dié, Ferreira Gustavo T., Mansano Alex F., Caridá Vinicius F.
- An Adversarially-learned Turing Test For Dialog Generation Models Gao Xiang, Zhang Yizhe, Galley Michel, Dolan Bill
- Towards Incremental Transformers: An Empirical Analysis Of Transformer Models For Incremental NLU Kahardipraja Patrick, Madureira Brielen, Schlangen David
- Multimodal Transformer With Variable-length Memory For Vision-and-language Navigation Lin Chuang, Jiang Yi, Cai Jianfei, Qu Lizhen, Haffari Gholamreza, Yuan Zehuan
- Domain-independent User Simulation With Transformers For Task-oriented Dialogue Systems Lin Hsien-chin, Lubis Nurul, Hu Songbo, Van Niekerk Carel, Geishauser Christian, Heck Michael, Feng Shutong, Gašić Milica
- Unified Questioner Transformer For Descriptive Question Generation In Goal-oriented Visual Dialogue Matsumori Shoya, Shingyouchi Kosuke, Abe Yuki, Fukuchi Yosuke, Sugiura Komei, Imai Michita
- Language Models Are Few-shot Butlers Micheli Vincent, Fleuret François
- Software-based Dialogue Systems: Survey, Taxonomy And Challenges Motger Quim, Franch Xavier, Marco Jordi
- Episodic Transformer For Vision-and-language Navigation Pashevich Alexander, Schmid Cordelia, Sun Chen
- Societal Biases In Language Generation: Progress And Challenges Sheng Emily, Chang Kai-wei, Natarajan Premkumar, Peng Nanyun
- Contextual Biasing Of Language Models For Speech Recognition In Goal-oriented Conversational Agents Shenoy Ashish, Bodapati Sravan, Kirchhoff Katrin
- Adapting Long Context NLM For ASR Rescoring In Conversational Agents Shenoy Ashish, Bodapati Sravan, Sunkara Monica, Ronanki Srikanth, Kirchhoff Katrin
- Pre-trained Language Models As Prior Knowledge For Playing Text-based Games Singh Ishika, Singh Gargi, Modi Ashutosh
- Bob: BERT Over BERT For Training Persona-based Dialogue Models From Limited Personalized Data Song Haoyu, Wang Yan, Zhang Kaiyan, Zhang Wei-nan, Liu Ting
- Embodied BERT: A Transformer Model For Embodied, Language-guided Visual Task Completion Suglia Alessandro, Gao Qiaozi, Thomason Jesse, Thattai Govind, Sukhatme Gaurav
- Not All Memories Are Created Equal: Learning To Forget By Expiring Sukhbaatar Sainbayar, Ju Da, Poff Spencer, Roller Stephen, Szlam Arthur, Weston Jason, Fan Angela
- Towards Reinforcement Learning For Pivot-based Neural Machine Translation With Non-autoregressive Transformer Tokarchuk Evgeniia, Rosendahl Jan, Wang Weiyue, Petrushkov Pavel, Lancewicki Tomer, Khadivi Shahram, Ney Hermann
- Learning Better Visual Dialog Agents With Pretrained Visual-linguistic Representation Tu Tao, Ping Qing, Thattai Govind, Tur Gokhan, Natarajan Prem
- Visual Question Rewriting For Increasing Response Rate Wei Jiayi, Li Xilian, Zhang Yi, Wang Xin
- TR-BERT: Dynamic Token Reduction For Accelerating BERT Inference Ye Deming, Lin Yankai, Huang Yufei, Sun Maosong
- Gpt-3-driven Pedagogical Agents For Training Children's Curious Question-asking Skills Abdelghani Rania, Wang Yen-hsiang, Yuan Xingdi, Wang Tong, Lucas Pauline, Sauzéon Hélène, Oudeyer Pierre-yves
- Making Large Language Models Interactive: A Pioneer Study On Supporting Complex Information-seeking Tasks With Implicit Constraints Ahmadvand Ali, Arabzadeh Negar, Kiseleva Julia, Sanz Patricio Figueroa, Deng Xin, Jauhar Sujay, Gamon Michael, Agichtein Eugene, Friend Ned, Aniruddha
- Multimodal Sequential Generative Models For Semi-supervised Language Instruction Following Akuzawa Kei, Iwasawa Yusuke, Matsuo Yutaka
- Improving Alignment Of Dialogue Agents Via Targeted Human Judgements Amelia Glaese, Nat Mcaleese, Maja Trębacz, John Aslanides, Vlad Firoiu, Timo Ewalds, Maribeth Rauh, Laura Weidinger, Martin Chadwick, Phoebe Thacker, Lucy Campbell-gillingham, Jonathan Uesato, Po-sen Huang, Ramona Comanescu, Fan Yang, Abigail See, Sumanth Dathathri, Rory Greig, Charlie Chen, Doug Fritz, Jaume Sanchez Elias, Richard Green, Soňa Mokrá, Nicholas Fernando, Boxi Wu, Rachel Foley, Susannah Young, Iason Gabriel, William Isaac, John Mellor, Demis Hassabis, Koray Kavukcuoglu, Lisa Anne Hendricks, Geoffrey Irving
- Using Cognitive Psychology To Understand GPT-3 Binz Marcel, Schulz Eric
- Large Language Models Can Implement Policy Iteration Brooks Ethan, Walls Logan, Lewis Richard L., Singh Satinder
- A Model-agnostic Data Manipulation Method For Persona-based Dialogue Generation Cao Yu, Bi Wei, Fang Meng, Shi Shuming, Tao Dacheng
- Robust Preference Learning For Storytelling Via Contrastive Reinforcement Learning Castricato Louis, Havrilla Alexander, Matiana Shahbuland, Pieler Michael, Ye Anbang, Yang Ian, Frazier Spencer, Riedl Mark
- DFM: Dialogue Foundation Model For Universal Large-scale Dialogue-oriented Task Learning Chen Zhi, Bao Jijia, Chen Lu, Liu Yuncong, Ma Da, Chen Bei, Wu Mengyue, Zhu Su, Dong Xin, Ge Fujiang, Miao Qingliang, Lou Jian-guang, Yu Kai
- One Agent To Rule Them All: Towards Multi-agent Conversational AI Clarke Christopher, Peper Joseph Joshua, Krishnamurthy Karthik, Talamonti Walter, Leach Kevin, Lasecki Walter, Kang Yiping, Tang Lingjia, Mars Jason
- On Reality And The Limits Of Language Data: Aligning Llms With Human Norms Collier Nigel H., Liu Fangyu, Shareghi Ehsan
- Rlprompt: Optimizing Discrete Text Prompts With Reinforcement Learning Deng Mingkai, Wang Jianyu, Hsieh Cheng-ping, Wang Yihan, Guo Han, Shu Tianmin, Song Meng, Xing Eric P., Hu Zhiting
- Generating Executable Action Plans With Environmentally-aware Language Models Gramopadhye Maitrey, Szafir Daniel
- Bundle MCR: Towards Conversational Bundle Recommendation He Zhankui, Zhao Handong, Yu Tong, Kim Sungchul, Du Fan, Mcauley Julian
- Inner Monologue: Embodied Reasoning Through Planning With Language Models Huang Wenlong, Xia Fei, Xiao Ted, Chan Harris, Liang Jacky, Florence Pete, Zeng Andy, Tompson Jonathan, Mordatch Igor, Chebotar Yevgen, Sermanet Pierre, Brown Noah, Jackson Tomas, Luu Linda, Levine Sergey, Hausman Karol, Ichter Brian
- A New Path: Scaling Vision-and-language Navigation With Synthetic Instructions And Imitation Learning Kamath Aishwarya, Anderson Peter, Wang Su, Koh Jing Yu, Ku Alexander, Waters Austin, Yang Yinfei, Baldridge Jason, Parekh Zarana
- Improving Language Model Prompting In Support Of Semi-autonomous Task Learning Kirk James R., Wray Robert E., Lindes Peter, Laird John E.
- Iterative Vision-and-language Navigation Krantz Jacob, Banerjee Shurjo, Zhu Wang, Corso Jason, Anderson Peter, Lee Stefan, Thomason Jesse
- Evolution Through Large Models Lehman Joel, Gordon Jonathan, Jain Shawn, Ndousse Kamal, Yeh Cathy, Stanley Kenneth O.
- Modeling Coreference Relations In Visual Dialog Li Mingxiao, Moens Marie-francine
- Pre-trained Language Models For Interactive Decision-making Li Shuang, Puig Xavier, Paxton Chris, Du Yilun, Wang Clinton, Fan Linxi, Chen Tao, Huang De-an, Akyürek Ekin, Anandkumar Anima, Andreas Jacob, Mordatch Igor, Torralba Antonio, Zhu Yuke
- ADAPT: Vision-language Navigation With Modality-aligned Action Prompts Lin Bingqian, Zhu Yi, Chen Zicong, Liang Xiwen, Liu Jianzhuang, Liang Xiaodan
- Gentus: Simulating User Behaviour And Language In Task-oriented Dialogues With Generative Transformers Lin Hsien-chin, Geishauser Christian, Feng Shutong, Lubis Nurul, Van Niekerk Carel, Heck Michael, Gašić Milica
- Contextual Transformer For Offline Meta Reinforcement Learning Lin Runji, Li Ye, Feng Xidong, Zhang Zhaowei, Fung Xian Hong Wu, Zhang Haifeng, Wang Jun, Du Yali, Yang Yaodong
- Instruction-following Agents With Multimodal Transformer Liu Hao, Lee Lisa, Lee Kimin, Abbeel Pieter
- A Generative User Simulator With Gpt-based Architecture And Goal State Tracking For Reinforced Multi-domain Dialog Systems Liu Hong, Cai Yucheng, Ou Zhijian, Huang Yi, Feng Junlan
- Improving Personality Consistency In Conversation By Persona Extending Liu Yifan, Wei Wei, Liu Jiayi, Mao Xianling, Fang Rui, Chen Dangyang
- Training Language Models To Follow Instructions With Human Feedback Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe
- Quick Starting Dialog Systems With Paraphrase Generation Marceau Louis, Belbahar Raouf, Queudot Marc, Naji Nada, Charton Eric, Meurs Marie-jean
- Do As I Can, Not As I Say: Grounding Language In Robotic Affordances Michael Ahn, Anthony Brohan, Noah Brown, Yevgen Chebotar, Omar Cortes, Byron David, Chelsea Finn, Chuyuan Fu, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Daniel Ho, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Eric Jang, Rosario Jauregui Ruano, Kyle Jeffrey, Sally Jesmonth, Nikhil J Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Kuang-huei Lee, Sergey Levine, Yao Lu, Linda Luu, Carolina Parada, Peter Pastor, Jornell Quiambao, Kanishka Rao, Jarek Rettinghouse, Diego Reyes, Pierre Sermanet, Nicolas Sievers, Clayton Tan, Alexander Toshev, Vincent Vanhoucke, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Mengyuan Yan, Andy Zeng
- Ignore Previous Prompt: Attack Techniques For Language Models Perez Fábio, Ribeiro Ian
- LINGUIST: Language Model Instruction Tuning To Generate Annotated Utterances For Intent Classification And Slot Tagging Rosenbaum Andy, Soltan Saleh, Hamza Wael, Versley Yannick, Boese Markus
- The Goldilocks Of Pragmatic Understanding: Fine-tuning Strategy Matters For Implicature Resolution By Llms Ruis Laura, Khan Akbir, Biderman Stella, Hooker Sara, Rocktäschel Tim, Grefenstette Edward
- React: Synergizing Reasoning And Acting In Language Models Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, Yuan Cao
- Llm-planner: Few-shot Grounded Planning For Embodied Agents With Large Language Models Song Chan Hee, Wu Jiaman, Washington Clayton, Sadler Brian M., Chao Wei-lun, Su Yu
- Chatgpt: The End Of Online Exam Integrity? Susnjak Teo
- The AI Teacher Test: Measuring The Pedagogical Ability Of Blender And GPT-3 In Educational Dialogues Tack Anaïs, Piech Chris
- On The Effect Of Pre-training For Transformer In Different Modality On Offline Reinforcement Learning Takagi Shiro
- Planbench: An Extensible Benchmark For Evaluating Large Language Models On Planning And Reasoning About Change Valmeekam Karthik, Marquez Matthew, Olmo Alberto, Sreedharan Sarath, Kambhampati Subbarao
- Enabling Conversational Interaction With Mobile UI Using Large Language Models Wang Bryan, Li Gang, Li Yang
- EGCR: Explanation Generation For Conversational Recommendation Wen Bingbing, Bu Xiaoning, Shah Chirag
- Language Models As Zero-shot Planners: Extracting Actionable Knowledge For Embodied Agents Wenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch
- Evaluating And Improving Context Attention Distribution On Multi-turn Response Generation Using Self-contained Distractions Xing Yujie, Gulla Jon Atle
- Active Example Selection For In-context Learning Yiming Zhang, Shi Feng, Chenhao Tan
- KRLS: Improving End-to-end Response Generation In Task Oriented Dialog With Reinforced Keywords Learning Yu Xiao, Wu Qingyang, Qian Kun, Yu Zhou
- Analyzing And Simulating User Utterance Reformulation In Conversational Recommender Systems Zhang Shuo, Wang Mu-chun, Balog Krisztian
- Can Offline Reinforcement Learning Help Natural Language Understanding? Zhang Ziqi, Wang Yile, Zhang Yue, Wang Donglin
- KPT: Keyword-guided Pre-training For Grounded Dialog Generation Zhu Qi, Mi Fei, Zhang Zheng, Wang Yasheng, Li Yitong, Jiang Xin, Liu Qun, Zhu Xiaoyan, Huang Minlie
- Future Sight: Dynamic Story Generation With Large Pretrained Language Models Zimmerman Brian D., Sahu Gaurav, Vechtomova Olga
- I Was Blind But Now I See: Implementing Vision-enabled Dialogue In Social Robots Abbo Giulio Antonio, Belpaeme Tony
- LMRL Gym: Benchmarks For Multi-turn Reinforcement Learning With Language Models Abdulhai Marwa, White Isadora, Snell Charlie, Sun Charles, Hong Joey, Zhai Yuexiang, Xu Kelvin, Levine Sergey
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon
- Bring Your Own KG: Self-supervised Program Synthesis For Zero-shot KGQA Agarwal Dhruv, Das Rajarshi, Khosla Sopan, Gangadharaiah Rashmi
- Can We Trust The Evaluation On Chatgpt? Aiyappa Rachith, An Jisun, Kwak Haewoon, Ahn Yong-yeol
- Rest Meets React: Self-improvement For Multi-step Reasoning LLM Agent Aksitov Renat, Miryoosefi Sobhan, Li Zonglin, Li Daliang, Babayan Sheila, Kopparapu Kavya, Fisher Zachary, Guo Ruiqi, Prakash Sushant, Srinivasan Pranesh, Zaheer Manzil, Yu Felix, Kumar Sanjiv
- An In-depth Look At Gemini's Language Abilities Akter Syeda Nahida, Yu Zichun, Muhamed Aashiq, Ou Tianyue, Bäuerle Alex, Cabrera Ángel Alexander, Dholakia Krish, Xiong Chenyan, Neubig Graham
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Akyürek Afra Feyza, Akyürek Ekin, Madaan Aman, Kalyan Ashwin, Clark Peter, Wijaya Derry, Tandon Niket
- Can Language Models Employ The Socratic Method? Experiments With Code Debugging Al-hossami Erfan, Bunescu Razvan, Smith Justin, Teehan Ryan
- Knowledge Of Knowledge: Exploring Known-unknowns Uncertainty With Large Language Models Amayuelas Alfonso, Wong Kyle, Pan Liangming, Chen Wenhu, Wang William
- On Generative Agents In Recommendation An Zhang, Yuxin Chen, Leheng Sheng, Xiang Wang, Tat-seng Chua
- Chemcrow: Augmenting Large-language Models With Chemistry Tools Andres M Bran, Sam Cox, Oliver Schilter, Carlo Baldassari, Andrew D White, Philippe Schwaller
- Expel: LLM Agents Are Experiential Learners Andrew Zhao, Daniel Huang, Quentin Xu, Matthieu Lin, Yong-jin Liu, Gao Huang
- Agenttuning: Enabling Generalized Agent Abilities For Llms Aohan Zeng, Mingdao Liu, Rui Lu, Bowen Wang, Xiao Liu, Yuxiao Dong, Jie Tang
- Leancontext: Cost-efficient Domain-specific Question Answering Using Llms Arefeen Md Adnan, Debnath Biplob, Chakradhar Srimat
- Mitigating Open-vocabulary Caption Hallucinations Assaf Ben-kish, Moran Yanuka, Morris Alper, Raja Giryes, Hadar Averbuch-elor
- Llms-augmented Contextual Bandit Baheri Ali, Alm Cecilia O.
- Leftover Lunch: Advantage-based Offline Reinforcement Learning For Language Models Baheti Ashutosh, Lu Ximing, Brahman Faeze, Bras Ronan Le, Sap Maarten, Riedl Mark
- Chatgpt: Applications, Opportunities, And Threats Bahrini Aram, Khamoshifar Mohammadsadra, Abbasimehr Hossein, Riggs Robert J., Esmaeili Maryam, Majdabadkohne Rastin Mastali, Pasehvar Morteza
- Qwen Technical Report Bai Jinze, Bai Shuai, Chu Yunfei, Cui Zeyu, Dang Kai, Deng Xiaodong, Fan Yang, Ge Wenbin, Han Yu, Huang Fei, Hui Binyuan, Ji Luo, Li Mei, Lin Junyang, Lin Runji, Liu Dayiheng, Liu Gao, Lu Chengqiang, Lu Keming, Ma Jianxin, Men Rui, Ren Xingzhang, Ren Xuancheng, Tan Chuanqi, Tan Sinan, Tu Jianhong, Wang Peng, Wang Shijie, Wang Wei, Wu Shengguang, Xu Benfeng, Xu Jin, Yang An, Yang Hao, Yang Jian, Yang Shusheng, Yao Yang, Yu Bowen, Yuan Hongyi, Yuan Zheng, Zhang Jianwei, Zhang Xingxuan, Zhang Yichang, Zhang Zhenru, Zhou Chang, Zhou Jingren, Zhou Xiaohuan, Zhu Tianhang
- Benchmarking LLM Powered Chatbots: Methods And Metrics Banerjee Debarag, Singh Pooja, Avadhanam Arjun, Srivastava Saksham
- Understanding Telecom Language Through Large Language Models Bariah Lina, Zou Hang, Zhao Qiyang, Mouhouche Belkacem, Bader Faouzi, Debbah Merouane
- Testing The Depth Of Chatgpt's Comprehension Via Cross-modal Tasks Based On Ascii-art: Gpt3.5's Abilities In Regard To Recognizing And Generating Ascii-art Are Not Totally Lacking Bayani David
- @ve: A Chatbot For Latin Bendel Oliver, N'diaye Karim
- Language Model Unalignment: Parametric Red-teaming To Expose Hidden Harms And Biases Bhardwaj Rishabh, Poria Soujanya
- Personality Testing Of Large Language Models: Limited Temporal Stability, But Highlighted Prosociality Bodroza Bojana, Dinic Bojana M., Bojic Ljubisa
- Chemcrow: Augmenting Large-language Models With Chemistry Tools Bran Andres M, Cox Sam, Schilter Oliver, Baldassari Carlo, White Andrew D, Schwaller Philippe
- Fine-grained Affective Processing Capabilities Emerging From Large Language Models Broekens Joost, Hilpert Bernhard, Verberne Suzan, Baraka Kim, Gebhard Patrick, Plaat Aske
- Weak-to-strong Generalization: Eliciting Strong Capabilities With Weak Supervision Burns Collin, Izmailov Pavel, Kirchner Jan Hendrik, Baker Bowen, Gao Leo, Aschenbrenner Leopold, Chen Yining, Ecoffet Adrien, Joglekar Manas, Leike Jan, Sutskever Ilya, Wu Jeff
- Systematic Rectification Of Language Models Via Dead-end Analysis Cao Meng, Fatemi Mehdi, Cheung Jackie Chi Kit, Shabanian Samira
- Grounding Large Language Models In Interactive Environments With Online Reinforcement Learning Carta Thomas, Romac Clément, Wolf Thomas, Lamprier Sylvain, Sigaud Olivier, Oudeyer Pierre-yves
- Dialogue Chain-of-thought Distillation For Commonsense-aware Conversational Agents Chae Hyungjoo, Song Yongho, Ong Kai Tzu-iunn, Kwon Taeyoon, Kim Minjin, Yu Youngjae, Lee Dongha, Kang Dongyeop, Yeo Jinyoung
- Clembench: Using Game Play To Evaluate Chat-optimized Language Models As Conversational Agents Chalamalasetti Kranti, Götze Jana, Hakimov Sherzod, Madureira Brielen, Sadler Philipp, Schlangen David
- Chatgpt May Pass The Bar Exam Soon, But Has A Long Way To Go For The Lexglue Benchmark Chalkidis Ilias
- Learning To Generate Better Than Your LLM Chang Jonathan D., Brantley Kiante, Ramamurthy Rajkumar, Misra Dipendra, Sun Wen
- Memgpt: Towards Llms As Operating Systems Charles Packer, Sarah Wooders, Kevin Lin, Vivian Fang, Shishir G. Patil, Ion Stoica, Joseph E. Gonzalez
- Fireact: Toward Language Agent Fine-tuning Chen Baian, Shu Chang, Shareghi Ehsan, Collier Nigel, Narasimhan Karthik, Yao Shunyu
- Chatgpt's One-year Anniversary: Are Open-source Large Language Models Catching Up? Chen Hailin, Jiao Fangkai, Li Xingxuan, Qin Chengwei, Ravaut Mathieu, Zhao Ruochen, Xiong Caiming, Joty Shafiq
- Walking Down The Memory Maze: Beyond Context Limit Through Interactive Reading Chen Howard, Pasunuru Ramakanth, Weston Jason, Celikyilmaz Asli
- Reconcile: Round-table Conference Improves Reasoning Via Consensus Among Diverse Llms Chen Justin Chih-yao, Saha Swarnadeep, Bansal Mohit
- Introspective Tips: Large Language Model For In-context Decision Making Chen Liting, Wang Lu, Dong Hang, Du Yali, Yan Jie, Yang Fangkai, Li Shuang, Zhao Pu, Qin Si, Rajmohan Saravan, Lin Qingwei, Zhang Dongmei
- Driving With Llms: Fusing Object-level Vector Modality For Explainable Autonomous Driving Chen Long, Sinavski Oleg, Hünermann Jan, Karnsund Alice, Willmott Andrew James, Birch Danny, Maund Daniel, Shotton Jamie
- Natural Response Generation For Chinese Reading Comprehension Chen Nuo, Li Hongguang, Bao Yinan, Wang Baoyuan, Li Jia
- Webvln: Vision-and-language Navigation On Websites Chen Qi, Pitawela Dileepa, Zhao Chongyang, Zhou Gengze, Chen Hsiang-ting, Wu Qi
- Asking Before Acting: Gather Information In Embodied Decision Making With Language Models Chen Xiaoyu, Zhang Shenao, Zhang Pushi, Zhao Li, Chen Jianyu
- Llf-bench: Benchmark For Interactive Learning From Language Feedback Cheng Ching-an, Kolobov Andrey, Misra Dipendra, Nie Allen, Swaminathan Adith
- Egothink: Evaluating First-person Perspective Thinking Capability Of Vision-language Models Cheng Sijie, Guo Zhicheng, Wu Jingwen, Fang Kechen, Li Peng, Liu Huaping, Liu Yang
- INSTRUCTEVAL: Towards Holistic Evaluation Of Instruction-tuned Large Language Models Chia Yew Ken, Hong Pengfei, Bing Lidong, Poria Soujanya
- An Integrative Survey On Mental Health Conversational Agents To Bridge Computer Science And Medical Perspectives Cho Young Min, Rai Sunny, Ungar Lyle, Sedoc João, Guntuku Sharath Chandra
- Do Llms Understand Social Knowledge? Evaluating The Sociability Of Large Language Models With Socket Benchmark Choi Minje, Pei Jiaxin, Kumar Sagar, Shu Chang, Jurgens David
- LIMA: Less Is More For Alignment Chunting Zhou, Pengfei Liu, Puxin Xu, Srini Iyer, Jiao Sun, Yuning Mao, Xuezhe Ma, Avia Efrat, Ping Yu, Lili Yu, Susan Zhang, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer, Omer Levy
- Remember What You Did So You Know What To Do Next Ciosici Manuel R., Hedges Alex, Kankanampati Yash, Martin Justin, Freedman Marjorie, Weischedel Ralph
- Cognitively Inspired Components For Social Conversational Agents Clay Alex, Alonso Eduardo, Mondragón Esther
- Reinforcement Learning Fine-tuning Of Language Models Is Biased Towards More Extractable Features Cruz Diogo, Pona Edoardo, Holness-tofts Alex, Schmied Elias, Alonso Víctor Abia, Griffin Charlie, Cirstea Bogdan-ionut
- Ultrafeedback: Boosting Language Models With Scaled AI Feedback Cui Ganqu, Yuan Lifan, Ding Ning, Yao Guanming, He Bingxiang, Zhu Wei, Ni Yuan, Xie Guotong, Xie Ruobing, Lin Yankai, Liu Zhiyuan, Sun Maosong
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- Im-promptu: In-context Composition From Image Prompts Dedhia Bhishma, Chang Michael, Snell Jake C., Griffiths Thomas L., Jha Niraj K.
- Prompting And Evaluating Large Language Models For Proactive Dialogues: Clarification, Target-guided, And Non-collaboration Deng Yang, Liao Lizi, Chen Liang, Wang Hongru, Lei Wenqiang, Chua Tat-seng
- Plug-and-play Policy Planner For Large Language Model Powered Dialogue Agents Deng Yang, Zhang Wenxuan, Lam Wai, Ng See-kiong, Chua Tat-seng
- Everything Of Thoughts: Defying The Law Of Penrose Triangle For Thought Generation Ding Ruomeng, Zhang Chaoyun, Wang Lu, Xu Yong, Ma Minghua, Zhang Wei, Qin Si, Rajmohan Saravan, Lin Qingwei, Zhang Dongmei
- Self-agreement: A Framework For Fine-tuning Language Models To Find Agreement Among Diverse Opinions Ding Shiyao, Ito Takayuki
- The Philosopher's Stone: Trojaning Plugins Of Large Language Models Dong Tian, Xue Minhui, Chen Guoxing, Holland Rayne, Meng Yan, Li Shaofeng, Liu Zhen, Zhu Haojin
- PACE: Improving Prompt With Actor-critic Editing For Large Language Model Dong Yihong, Luo Kangcheng, Jiang Xue, Jin Zhi, Li Ge
- Blending Reward Functions Via Few Expert Demonstrations For Faithful And Accurate Knowledge-grounded Dialogue Generation Du Wanyu, Ji Yangfeng
- Improving Factuality And Reasoning In Language Models Through Multiagent Debate Du Yilun, Li Shuang, Torralba Antonio, Tenenbaum Joshua B., Mordatch Igor
- Guiding Pretraining In Reinforcement Learning With Large Language Models Du Yuqing, Watkins Olivia, Wang Zihan, Colas Cédric, Darrell Trevor, Abbeel Pieter, Gupta Abhishek, Andreas Jacob
- Frugal Lms Trained To Invoke Symbolic Solvers Achieve Parameter-efficient Arithmetic Reasoning Dutta Subhabrata, Singh Joykirat, Pandey Ishan, Manchanda Sunny, Chakrabarti Soumen, Chakraborty Tanmoy
- Proximal Policy Optimization Actual Combat: Manipulating Output Tokenizer Length Fan Miao, Hu Chen, Zhou Shuchang
- Trapping LLM Hallucinations Using Tagged Context Prompts Feldman Philip, Foulds James R., Pan Shimei
- Chessgpt: Bridging Policy Learning And Language Modeling Feng Xidong, Luo Yicheng, Wang Ziyan, Tang Hongrui, Yang Mengyue, Shao Kun, Mguni David, Du Yali, Wang Jun
- Llama Rider: Spurring Large Language Models To Explore The Open World Feng Yicheng, Wang Yuxuan, Liu Jiazheng, Zheng Sipeng, Lu Zongqing
- Enhancing Pipeline-based Conversational Agents With Large Language Models Foosherian Mina, Purwins Hendrik, Rathnayake Purna, Alam Touhidul, Teimao Rui, Thoben Klaus-dieter
- Chain-of-thought Hub: A Continuous Effort To Measure Large Language Models' Reasoning Performance Fu Yao, Ou Litu, Chen Mingyu, Wan Yuhao, Peng Hao, Khot Tushar
- Exposing Limitations Of Language Model Agents In Sequential-task Compositions On The Web Furuta Hiroki, Matsuo Yutaka, Faust Aleksandra, Gur Izzeddin
- Strategyllm: Large Language Models As Strategy Generators, Executors, Optimizers, And Evaluators For Problem Solving Gao Chang, Jiang Haiyun, Cai Deng, Shi Shuming, Lam Wai
- ASSISTGUI: Task-oriented Desktop Graphical User Interface Automation Gao Difei, Ji Lei, Bai Zechen, Ouyang Mingyu, Li Peiran, Mao Dongxing, Wu Qinchen, Zhang Weichen, Wang Peiyi, Guo Xiangwu, Wang Hengxu, Zhou Luowei, Shou Mike Zheng
- Making Llama SEE And Draw With SEED Tokenizer Ge Yuying, Zhao Sijie, Zeng Ziyun, Ge Yixiao, Li Chen, Wang Xintao, Shan Ying
- Intention And Context Elicitation With Large Language Models In The Legal Aid Intake Process Goodson Nick, Lu Rongfei
- AMAGO: Scalable In-context Reinforcement Learning For Adaptive Agents Grigsby Jake, Fan Linxi, Zhu Yuke
- Voyager: An Open-ended Embodied Agent With Large Language Models Guanzhi Wang, Yuqi Xie, Yunfan Jiang, Ajay Mandlekar, Chaowei Xiao, Yuke Zhu, Linxi Fan, Anima Anandkumar
- Reinforced Self-training (rest) For Language Modeling Gulcehre Caglar, Paine Tom Le, Srinivasan Srivatsan, Konyushkova Ksenia, Weerts Lotte, Sharma Abhishek, Siddhant Aditya, Ahern Alex, Wang Miaosen, Gu Chenjie, Macherey Wolfgang, Doucet Arnaud, Firat Orhan, De Freitas Nando
- Empowering Working Memory For Large Language Model Agents Guo Jing, Li Nan, Qi Jianchuan, Yang Hang, Li Ruiqiao, Feng Yuzhen, Zhang Si, Xu Ming
- PPTC Benchmark: Evaluating Large Language Models For Powerpoint Task Completion Guo Yiduo, Zhang Zekai, Liang Yaobo, Zhao Dongyan, Duan Nan
- A Real-world Webagent With Planning, Long Context Understanding, And Program Synthesis Gur Izzeddin, Furuta Hiroki, Huang Austin, Safdari Mustafa, Matsuo Yutaka, Eck Douglas, Faust Aleksandra
- Deception Abilities Emerged In Large Language Models Hagendorff Thilo
- Llms As Visual Explainers: Advancing Image Classification With Evolving Visual Descriptions Han Songhao, Zhuo Le, Liao Yue, Liu Si
- Personalisation Within Bounds: A Risk Taxonomy And Policy Framework For The Alignment Of Large Language Models With Personalised Feedback Hannah Rose Kirk, Bertie Vidgen, Paul Röttger, Scott A. Hale
- Blockwise Parallel Transformer For Large Context Models Hao Liu, Pieter Abbeel
- CHAI-DT: A Framework For Prompting Conversational Generative AI Agents To Actively Participate In Co-creation Harwood Brandon
- Chatgpt As Your Personal Data Scientist Hassan Md Mahadi, Knipper Alex, Santu Shubhra Kanti Karmaker
- Large Language Models Are Biased To Overestimate Profoundness Herrera-berg Eugenio, Browne Tomás Vergara, León-villagrá Pablo, Vives Marc-lluís, Calderon Cristian Buc
- Assessing The Efficacy Of Large Language Models In Generating Accurate Teacher Responses Hicke Yann, Masand Abhishek, Guo Wentao, Gangavarapu Tushaar
- L2MAC: Large Language Model Automatic Computer For Extensive Code Generation Holt Samuel, Luyten Max Ruiz, Van Der Schaar Mihaela
- Zero-shot Goal-directed Dialogue Via RL On Imagined Conversations Hong Joey, Levine Sergey, Dragan Anca
- Cogagent: A Visual Language Model For GUI Agents Hong Wenyi, Wang Weihan, Lv Qingsong, Xu Jiazheng, Yu Wenmeng, Ji Junhui, Wang Yan, Wang Zihan, Zhang Yuxuan, Li Juanzi, Xu Bin, Dong Yuxiao, Ding Ming, Tang Jie
- AMERICANO: Argument Generation With Discourse-driven Decomposition And Agent Interaction Hu Zhe, Chan Hou Pong, Yin Yu
- Agentcoder: Multi-agent-based Code Generation With Iterative Testing And Optimisation Huang Dong, Zhang Jie M., Luck Michael, Bu Qingwen, Qing Yuhao, Cui Heming
- Acegpt, Localizing Large Language Models In Arabic Huang Huang, Yu Fei, Zhu Jianqing, Sun Xuening, Cheng Hao, Song Dingjie, Chen Zhihong, Alharthi Abdulmohsen, An Bang, He Juncai, Liu Ziche, Zhang Zhiyi, Chen Junying, Li Jianquan, Wang Benyou, Zhang Lian, Sun Ruoyu, Wan Xiang, Li Haizhou, Xu Jinchao
- Who Is Chatgpt? Benchmarking Llms' Psychological Portrayal Using Psychobench Huang Jen-tse, Wang Wenxuan, Li Eric John, Lam Man Ho, Ren Shujie, Yuan Youliang, Jiao Wenxiang, Tu Zhaopeng, Lyu Michael R.
- Grounded Decoding: Guiding Text Generation With Grounded Models For Embodied Agents Huang Wenlong, Xia Fei, Shah Dhruv, Driess Danny, Zeng Andy, Lu Yao, Florence Pete, Mordatch Igor, Levine Sergey, Hausman Karol, Ichter Brian
- Advancing Transformer Architecture In Long-context Large Language Models: A Comprehensive Survey Huang Yunpeng, Xu Jingwei, Lai Junyu, Jiang Zixu, Chen Taolue, Li Zenan, Yao Yuan, Ma Xiaoxing, Yang Lijuan, Chen Hao, Li Shupeng, Zhao Penghao
- Evaluating And Enhancing Large Language Models For Conversational Reasoning On Knowledge Graphs Huang Yuxuan, Shi Lida, Liu Anqi, Xu Hao
- Instructed To Bias: Instruction-tuned Language Models Exhibit Emergent Cognitive Bias Itzhak Itay, Stanovsky Gabriel, Rosenfeld Nir, Belinkov Yonatan
- Coarse-tuning Models Of Code With Reinforcement Learning Feedback Jain Abhinav, Adiole Chima, Chaudhuri Swarat, Reps Thomas, Jermaine Chris
- Cotran: An Llm-based Code Translator Using Reinforcement Learning With Feedback From Compiler And Symbolic Execution Jana Prithwish, Jha Piyush, Ju Haoyang, Kishore Gautham, Mahajan Aryan, Ganesh Vijay
- Personalized Soups: Personalized Large Language Model Alignment Via Post-hoc Parameter Merging Jang Joel, Kim Seungone, Lin Bill Yuchen, Wang Yizhong, Hessel Jack, Zettlemoyer Luke, Hajishirzi Hannaneh, Choi Yejin, Ammanabrolu Prithviraj
- Chatgpt Is Fun, But It Is Not Funny! Humor Is Still Challenging Large Language Models Jentzsch Sophie, Kersting Kristian
- Chatbot Is Not All You Need: Information-rich Prompting For More Realistic Responses Jeong Seokhoon, Makhmud Assentay
- Chit-chat Or Deep Talk: Prompt Engineering For Process Mining Jessen Urszula, Sroka Michal, Fahland Dirk
- Personallm: Investigating The Ability Of Large Language Models To Express Personality Traits Jiang Hang, Zhang Xiajie, Cao Xubo, Breazeal Cynthia, Roy Deb, Kabbara Jad
- Prompt-based Length Controlled Generation With Reinforcement Learning Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- Better To Ask In English: Cross-lingual Evaluation Of Large Language Models For Healthcare Queries Jin Yiqiao, Chandra Mohit, Verma Gaurav, Hu Yibo, De Choudhury Munmun, Kumar Srijan
- Robust Training For Conversational Question Answering Models With Reinforced Reformulation Generation Kaiser Magdalena, Roy Rishiraj Saha, Weikum Gerhard
- Dialmat: Dialogue-enabled Transformer With Moment-based Adversarial Training Kaneda Kanta, Korekata Ryosuke, Wada Yuiga, Nagashima Shunya, Kambara Motonari, Iioka Yui, Matsuo Haruka, Imai Yuto, Nishimura Takayuki, Sugiura Komei
- Chatmof: An Autonomous AI System For Predicting And Generating Metal-organic Frameworks Kang Yeonghun, Kim Jihan
- Lagr-seq: Language-guided Reinforcement Learning With Sample-efficient Querying Karimpanal Thommen George, Semage Laknath Buddhika, Rana Santu, Le Hung, Tran Truyen, Gupta Sunil, Venkatesh Svetha
- Dspy: Compiling Declarative Language Model Calls Into Self-improving Pipelines Khattab Omar, Singhvi Arnav, Maheshwari Paridhi, Zhang Zhiyuan, Santhanam Keshav, Vardhamanan Sri, Haq Saiful, Sharma Ashutosh, Joshi Thomas T., Moazam Hanna, Miller Heather, Zaharia Matei, Potts Christopher
- Reliability Check: An Analysis Of Gpt-3's Response To Sensitive Topics And Prompt Wording Khatun Aisha, Brown Daniel G.
- Language Models Can Solve Computer Tasks Kim Geunwoo, Baldi Pierre, Mcaleer Stephen
- Aligning Large Language Models Through Synthetic Feedback Kim Sungdong, Bae Sanghwan, Shin Jamin, Kang Soyoung, Kwak Donghyun, Yoo Kang Min, Seo Minjoon
- Exploiting Language Models As A Source Of Knowledge For Cognitive Agents Kirk James R., Wray Robert E., Laird John E.
- Improving Knowledge Extraction From Llms For Task Learning Through Agent Analysis Kirk James R., Wray Robert E., Lindes Peter, Laird John E.
- Understanding Catastrophic Forgetting In Language Models Via Implicit Inference Kotha Suhas, Springer Jacob Mitchell, Raghunathan Aditi
- Enhancing Large Language Models With Climate Resources Kraus Mathias, Bingler Julia Anna, Leippold Markus, Schimanski Tobias, Senni Chiara Colesanti, Stammbach Dominik, Vaghefi Saeid Ashraf, Webersinke Nicolas
- Prompt Generate Train (PGT): Few-shot Domain Adaption Of Retrieval Augmented Generation Models For Open Book Question-answering Krishna C. S.
- Jotr: A Joint Transformer And Reinforcement Learning Framework For Dialog Policy Learning Kwan Wai-chung, Wang Huimin, Wang Hongru, Wang Zezhong, Wu Xian, Zheng Yefeng, Wong Kam-fai
- Reward Design With Language Models Kwon Minae, Xie Sang Michael, Bullard Kalesha, Sadigh Dorsa
- Openassistant Conversations -- Democratizing Large Language Model Alignment Köpf Andreas, Kilcher Yannic, Von Rütte Dimitri, Anagnostidis Sotiris, Tam Zhi-rui, Stevens Keith, Barhoum Abdullah, Duc Nguyen Minh, Stanley Oliver, Nagyfi Richárd, Es Shahul, Suri Sameer, Glushkov David, Dantuluri Arnav, Maguire Andrew, Schuhmann Christoph, Nguyen Huu, Mattick Alexander
- Exploring Large Language Models To Facilitate Variable Autonomy For Human-robot Teaming Lakhnati Younes, Pascher Max, Gerken Jens
- A Data Source For Reasoning Embodied Agents Lanchantin Jack, Sukhbaatar Sainbayar, Synnaeve Gabriel, Sun Yuxuan, Srinet Kavya, Szlam Arthur
- Challenges Of Gpt-3-based Conversational Agents For Healthcare Lechner Fabian, Lahnala Allison, Welch Charles, Flek Lucie
- Prompted Llms As Chatbot Modules For Long Open-domain Conversation Lee Gibbeum, Hartmann Volker, Park Jongho, Papailiopoulos Dimitris, Lee Kangwook
- RLAIF Vs. RLHF: Scaling Reinforcement Learning From Human Feedback With AI Feedback Lee Harrison, Phatale Samrat, Mansoor Hassan, Mesnard Thomas, Ferret Johan, Lu Kellie, Bishop Colton, Hall Ethan, Carbune Victor, Rastogi Abhinav, Prakash Sushant
- Violation Of Expectation Via Metacognitive Prompting Reduces Theory Of Mind Prediction Error In Large Language Models Leer Courtland, Trost Vincent, Voruganti Vineeth
- Lora Fine-tuning Efficiently Undoes Safety Training In Llama 2-chat 70B Lermen Simon, Rogers-smith Charlie, Ladish Jeffrey
- Sequential Monte Carlo Steering Of Large Language Models Using Probabilistic Programs Lew Alexander K., Zhi-xuan Tan, Grand Gabriel, Mansinghka Vikash K.
- Learning To Rewrite Prompts For Personalized Text Generation Li Cheng, Zhang Mingyang, Mei Qiaozhu, Kong Weize, Bendersky Michael
- Dialogue For Prompting: A Policy-gradient-based Discrete Prompt Generation For Few-shot Learning Li Chengzhengxu, Liu Xiaoming, Wang Yichen, Li Duyi, Lan Yu, Shen Chao
- Modelscope-agent: Building Your Customizable Agent System With Open-source Large Language Models Li Chenliang, Chen Hehong, Yan Ming, Shen Weizhou, Xu Haiyang, Wu Zhikai, Zhang Zhicheng, Zhou Wenmeng, Chen Yingda, Cheng Chen, Shi Hongzhu, Zhang Ji, Huang Fei, Zhou Jingren
- Tool-augmented Reward Modeling Li Lei, Chai Yekun, Wang Shuohuan, Sun Yu, Tian Hao, Zhang Ningyu, Wu Hua
- M3dbench: Let's Instruct Large Models With Multi-modal 3D Prompts Li Mingsheng, Chen Xin, Zhang Chi, Chen Sijin, Zhu Hongyuan, Yin Fukun, Yu Gang, Chen Tao
- A Zero-shot Language Agent For Computer Control With Structured Reflection Li Tao, Li Gang, Deng Zhiwei, Wang Bryan, Li Yang
- Prompting Large Language Models For Counterfactual Generation: An Empirical Study Li Yongqi, Xu Mayi, Miao Xin, Zhou Shen, Qian Tieyun
- Posgen: Personalized Opening Sentence Generation For Online Insurance Sales Li Yu, Zhang Yi, Wu Weijia, Zhou Zimu, Li Qiang
- RAIN: Your Language Models Can Align Themselves Without Finetuning Li Yuhui, Wei Fangyun, Zhao Jinjing, Zhang Chao, Zhang Hongyang
- Guiding Large Language Models Via Directional Stimulus Prompting Li Zekun, Peng Baolin, He Pengcheng, Galley Michel, Gao Jianfeng, Yan Xifeng
- Batgpt: A Bidirectional Autoregessive Talker From Generative Pre-trained Transformer Li Zuchao, Zhang Shitou, Zhao Hai, Yang Yifei, Yang Dongjie
- Towards End-to-end Embodied Decision Making Via Multi-modal Large Language Model: Explorations With Gpt4-vision And Beyond Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Tianyu Liu, Baobao Chang
- Encouraging Divergent Thinking In Large Language Models Through Multi-agent Debate Liang Tian, He Zhiwei, Jiao Wenxiang, Wang Xing, Wang Rui, Yang Yujiu, Tu Zhaopeng, Shi Shuming
- Modeling Complex Mathematical Reasoning Via Large Language Model Based Mathagent Liao Haoran, Du Qinyi, Hu Shaohua, He Hao, Xu Yanyan, Tian Jidong, Jin Yaohui
- Opening Up Chatgpt: Tracking Openness, Transparency, And Accountability In Instruction-tuned Text Generators Liesenfeld Andreas, Lopez Alianda, Dingemanse Mark
- The Unlocking Spell On Base Llms: Rethinking Alignment Via In-context Learning Lin Bill Yuchen, Ravichander Abhilasha, Lu Ximing, Dziri Nouha, Sclar Melanie, Chandu Khyathi, Bhagavatula Chandra, Choi Yejin
- Learning To Model The World With Language Lin Jessy, Du Yuqing, Watkins Olivia, Hafner Danijar, Abbeel Pieter, Klein Dan, Dragan Anca
- Agentsims: An Open-source Sandbox For Large Language Model Evaluation Lin Jiaju, Zhao Haoran, Zhang Aochi, Wu Yiting, Ping Huqiuyue, Chen Qin
- Mitigating The Alignment Tax Of RLHF Lin Yong, Lin Hangyu, Xiong Wei, Diao Shizhe, Liu Jianmeng, Zhang Jipeng, Pan Rui, Wang Haoxiang, Hu Wenbin, Zhang Hanning, Dong Hanze, Pi Renjie, Zhao Han, Jiang Nan, Ji Heng, Yao Yuan, Zhang Tong
- Ring Attention With Blockwise Transformers For Near-infinite Context Liu Hao, Zaharia Matei, Abbeel Pieter
- Alltogether: Investigating The Efficacy Of Spliced Prompt For Web Navigation Using Large Language Models Liu Jiarun, Hu Wentao, Zhang Chunhong
- Retrieval-based Knowledge Transfer: An Effective Approach For Extreme Large Language Model Compression Liu Jiduan, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Wang Ran Lucien, Yan Rui
- Llm-powered Hierarchical Language Agent For Real-time Human-ai Coordination Liu Jijia, Yu Chao, Gao Jiaxuan, Xie Yuqing, Liao Qingmin, Wu Yi, Wang Yu
- Think-in-memory: Recalling And Post-thinking Enable Llms With Long-term Memory Liu Lei, Yang Xiaoyan, Shen Yue, Hu Binbin, Zhang Zhiqiang, Gu Jinjie, Zhang Guannan
- Conversational Question Answering With Reformulations Over Knowledge Graph Liu Lihui, Hill Blaine, Du Boxin, Wang Fei, Tong Hanghang
- Bt-adapter: Video Conversation Is Feasible Without Video Instruction Tuning Liu Ruyang, Li Chen, Ge Yixiao, Shan Ying, Li Thomas H., Li Ge
- Aligning Large Language Models With Human Preferences Through Representation Engineering Liu Wenhao, Wang Xiaohua, Wu Muling, Li Tianlong, Lv Changze, Ling Zixuan, Zhu Jianhao, Zhang Cenyuan, Zheng Xiaoqing, Huang Xuanjing
- Fingpt: Democratizing Internet-scale Data For Financial Large Language Models Liu Xiao-yang, Wang Guoxuan, Yang Hongyang, Zha Daochen
- Summary Of Chatgpt-related Research And Perspective Towards The Future Of Large Language Models Liu Yiheng, Han Tianle, Ma Siyuan, Zhang Jiayue, Yang Yuanyuan, Tian Jiaming, He Hao, Li Antong, He Mengshen, Liu Zhengliang, Wu Zihao, Zhao Lin, Zhu Dajiang, Li Xiang, Qiang Ning, Shen Dingang, Liu Tianming, Ge Bao
- Reason For Future, Act For Now: A Principled Framework For Autonomous LLM Agents With Provable Sample Efficiency Liu Zhihan, Hu Hao, Zhang Shenao, Guo Hongyi, Ke Shuqi, Liu Boyi, Wang Zhaoran
- SAIE Framework: Support Alone Isn't Enough -- Advancing LLM Training With Adversarial Remarks Loem Mengsay, Kaneko Masahiro, Okazaki Naoaki
- Discuss Before Moving: Visual Language Navigation Via Multi-expert Discussions Long Yuxing, Li Xiaoqi, Cai Wenzhe, Dong Hao
- Mathvista: Evaluating Mathematical Reasoning Of Foundation Models In Visual Contexts Lu Pan, Bansal Hritik, Xia Tony, Liu Jiacheng, Li Chunyuan, Hajishirzi Hannaneh, Cheng Hao, Chang Kai-wei, Galley Michel, Gao Jianfeng
- Inference-time Policy Adapters (IPA): Tailoring Extreme-scale Lms Without Fine-tuning Lu Ximing, Brahman Faeze, West Peter, Jang Jaehun, Chandu Khyathi, Ravichander Abhilasha, Qin Lianhui, Ammanabrolu Prithviraj, Jiang Liwei, Ramnath Sahana, Dziri Nouha, Fisher Jillian, Lin Bill Yuchen, Hallinan Skyler, Ren Xiang, Welleck Sean, Choi Yejin
- Multimodal Procedural Planning Via Dual Text-image Prompting Lu Yujie, Lu Pan, Chen Zhiyu, Zhu Wanrong, Wang Xin Eric, Wang William Yang
- Wizardmath: Empowering Mathematical Reasoning For Large Language Models Via Reinforced Evol-instruct Luo Haipeng, Sun Qingfeng, Xu Can, Zhao Pu, Lou Jianguang, Tao Chongyang, Geng Xiubo, Lin Qingwei, Chen Shifeng, Zhang Dongmei
- Exploring Small Language Models With Prompt-learning Paradigm For Efficient Domain-specific Text Classification Luo Hengyu, Liu Peng, Esping Stefan
- YAYI 2: Multilingual Open-source Large Language Models Luo Yin, Kong Qingchao, Xu Nan, Cao Jia, Hao Bao, Qu Baoyu, Chen Bo, Zhu Chao, Zhao Chenyang, Zhang Donglei, Feng Fan, Zhao Feifei, Sun Hailong, Yang Hanxuan, Pan Haojun, Liu Hongyu, Guo Jianbin, Du Jiangtao, Wang Jingyi, Li Junfeng, Sun Lei, Liu Liduo, Dong Lifeng, Liu Lili, Wang Lin, Zhang Liwen, Wang Minzheng, Wang Pin, Yu Ping, Li Qingxiao, Yan Rui, Zou Rui, Li Ruiqun, Huang Taiwen, Wang Xiaodong, Wu Xiaofei, Peng Xin, Zhang Xina, Fang Xing, Xiao Xinglin, Hao Yanni, Dong Yao, Wang Yigang, Liu Ying, Jiang Yongyu, Wang Yungan, Wang Yuqi, Wang Zhangsheng, Yu Zhaoxin, Luo Zhen, Mao Wenji, Wang Lei, Zeng Dajun
- Paperqa: Retrieval-augmented Generative Agent For Scientific Research Lála Jakub, O'donoghue Odhran, Shtedritski Aleksandar, Cox Sam, Rodriques Samuel G., White Andrew D.
- Query Rewriting For Retrieval-augmented Large Language Models Ma Xinbei, Gong Yeyun, He Pengcheng, Zhao Hai, Duan Nan
- Brain In A Vat: On Missing Pieces Towards Artificial General Intelligence In Large Language Models Ma Yuxi, Zhang Chi, Zhu Song-chun
- Video-chatgpt: Towards Detailed Video Understanding Via Large Vision And Language Models Maaz Muhammad, Rasheed Hanoona, Khan Salman, Khan Fahad Shahbaz
- Self-refine: Iterative Refinement With Self-feedback Madaan Aman, Tandon Niket, Gupta Prakhar, Hallinan Skyler, Gao Luyu, Wiegreffe Sarah, Alon Uri, Dziri Nouha, Prabhumoye Shrimai, Yang Yiming, Gupta Shashank, Majumder Bodhisattwa Prasad, Hermann Katherine, Welleck Sean, Yazdanbakhsh Amir, Clark Peter
- Interpreting Learned Feedback Patterns In Large Language Models Marks Luke, Abdullah Amir, Neo Clement, Arike Rauno, Krueger David, Torr Philip, Barez Fazl
- Is Feedback All You Need? Leveraging Natural Language Feedback In Goal-conditioned Reinforcement Learning Mccallum Sabrina, Taylor-davies Max, Albrecht Stefano V., Suglia Alessandro
- An AI Chatbot For Explaining Deep Reinforcement Learning Decisions Of Service-oriented Systems Metzger Andreas, Bartel Jone, Laufer Jan
- Think Before You Act: Unified Policy For Interleaving Language Reasoning With Actions Mezghani Lina, Bojanowski Piotr, Alahari Karteek, Sukhbaatar Sainbayar
- Mindstorms In Natural Language-based Societies Of Mind Mingchen Zhuge, Haozhe Liu, Francesco Faccio, Dylan R. Ashley, Róbert Csordás, Anand Gopalakrishnan, Abdullah Hamdi, Hasan Abed Al Kader Hammoud, Vincent Herrmann, Kazuki Irie, Louis Kirsch, Bing Li, Guohao Li, Shuming Liu, Jinjie Mai, Piotr Piękos, Aditya Ramesh, Imanol Schlag, Weimin Shi, Aleksandar Stanić, Wenyi Wang, Yuhui Wang, Mengmeng Xu, Deng-ping Fan, Bernard Ghanem, Jürgen Schmidhuber
- LLM Aided Semi-supervision For Extractive Dialog Summarization Mishra Nishant, Sahu Gaurav, Calixto Iacer, Abu-hanna Ameen, Laradji Issam H.
- Towards Human-like Spoken Dialogue Generation Between AI Agents From Written Dialogue Mitsui Kentaro, Hono Yukiya, Sawada Kei
- Boosting Theory-of-mind Performance In Large Language Models Via Prompting Moghaddam Shima Rahimi, Honey Christopher J.
- Llms Killed The Script Kiddie: How Agents Supported By Large Language Models Change The Landscape Of Network Threat Testing Moskal Stephen, Laney Sam, Hemberg Erik, O'reilly Una-may
- Superhf: Supervised Iterative Learning From Human Feedback Mukobi Gabriel, Chatain Peter, Fong Su, Windesheim Robert, Kutyniok Gitta, Bhatia Kush, Alberti Silas
- A Brief History Of Prompt: Leveraging Language Models. (through Advanced Prompting) Muktadir Golam Md
- Salespeople Vs Salesbot: Exploring The Role Of Educational Value In Conversational Recommender Systems Murakhovs'ka Lidiya, Laban Philippe, Xie Tian, Xiong Caiming, Wu Chien-sheng
- Unival: Unified Model For Image, Video, Audio And Language Tasks Mustafa Shukor, Corentin Dancette, Alexandre Rame, Matthieu Cord
- Testing Language Model Agents Safely In The Wild Naihin Silen, Atkinson David, Green Marc, Hamadi Merwane, Swift Craig, Schonholtz Douglas, Kalai Adam Tauman, Bau David
- DERA: Enhancing Large Language Model Completions With Dialog-enabled Resolving Agents Nair Varun, Schumacher Elliot, Tso Geoffrey, Kannan Anitha
- On Evaluating The Integration Of Reasoning And Action In LLM Agents With Database Question Answering Nan Linyong, Zhang Ellen, Zou Weijin, Zhao Yilun, Zhou Wenfei, Cohan Arman
- Reinforcement Replaces Supervision: Query Focused Summarization Using Deep Reinforcement Learning Nath Swaroop, Khadilkar Harshad, Bhattacharyya Pushpak
- Enhancing Logical Reasoning In Large Language Models To Facilitate Legal Applications Nguyen Ha-thanh, Fungwacharakorn Wachara, Satoh Ken
- Reflexion: Language Agents With Verbal Reinforcement Learning Noah Shinn, Federico Cassano, Edward Berman, Ashwin Gopinath, Karthik Narasimhan, Shunyu Yao
- Prodigy: A Profile-based Dialogue Generation Dataset Occhipinti Daniela, Tekiroglu Serra Sinem, Guerini Marco
- Deepthought: An Architecture For Autonomous Self-motivated Systems Oliveira Arlindo L., Domingos Tiago, Figueiredo Mário, Lima Pedro U.
- Autoplan: Automatic Planning Of Interactive Decision-making Tasks With Large Language Models Ouyang Siqi, Li Lei
- Kwaiagents: Generalized Information-seeking Agent System With Large Language Models Pan Haojie, Zhai Zepeng, Yuan Hao, Lv Yaojia, Fu Ruiji, Liu Ming, Wang Zhongyuan, Qin Bing
- Let's Reinforce Step By Step Pan Sarah, Lialin Vladislav, Muckatira Sherin, Rumshisky Anna
- Language Model Self-improvement By Reinforcement Learning Contemplation Pang Jing-cheng, Wang Pengyuan, Li Kaiyuan, Chen Xiong-hui, Xu Jiacheng, Zhang Zongzhang, Yu Yang
- Sequencing Matters: A Generate-retrieve-generate Model For Building Conversational Agents Patwardhan Quinn, Yang Grace Hui
- FP8-LM: Training FP8 Large Language Models Peng Houwen, Wu Kan, Wei Yixuan, Zhao Guoshuai, Yang Yuxiang, Liu Ze, Xiong Yifan, Yang Ziyue, Ni Bolin, Hu Jingcheng, Li Ruihang, Zhang Miaosen, Li Chen, Ning Jia, Wang Ruizhe, Zhang Zheng, Liu Shuguang, Chau Joe, Hu Han, Cheng Peng
- Self-driven Grounding: Large Language Model Agents With Automatical Language-aligned Skill Learning Peng Shaohui, Hu Xing, Yi Qi, Zhang Rui, Guo Jiaming, Huang Di, Tian Zikang, Chen Ruizhi, Du Zidong, Guo Qi, Chen Yunji, Li Ling
- LARG, Language-based Automatic Reward And Goal Generation Perez Julien, Proux Denys, Roux Claude, Niemaz Michael
- Let Models Speak Ciphers: Multiagent Debate Through Embeddings Pham Chau, Liu Boyi, Yang Yingxiang, Chen Zhengyu, Liu Tianyi, Yuan Jianbo, Plummer Bryan A., Wang Zhaoran, Yang Hongxia
- LLM Self Defense: By Self Examination, Llms Know They Are Being Tricked Phute Mansi, Helbling Alec, Hull Matthew, Peng Shengyun, Szyller Sebastian, Cornelius Cory, Chau Duen Horng
- Painter: Teaching Auto-regressive Language Models To Draw Sketches Pourreza Reza, Bhattacharyya Apratim, Panchal Sunny, Lee Mingu, Madan Pulkit, Memisevic Roland
- Adapt: As-needed Decomposition And Planning With Language Models Prasad Archiki, Koller Alexander, Hartmann Mareike, Clark Peter, Sabharwal Ashish, Bansal Mohit, Khot Tushar
- Automatic Prompt Optimization With "gradient Descent" And Beam Search Pryzant Reid, Iter Dan, Li Jerry, Lee Yin Tat, Zhu Chenguang, Zeng Michael
- Neural Task Synthesis For Visual Programming Pădurean Victor-alexandru, Tzannetos Georgios, Singla Adish
- Vidcom: Fast Video Comprehension Through Large Language Models With Multimodal Tools Qi Ji, Ji Kaixuan, Yu Jifan, Wang Duokang, Xu Bin, Hou Lei, Li Juanzi
- PILLOW: Enhancing Efficient Instruction Fine-tuning Via Prompt Matching Qi Zhenting, Tan Xiaoyu, Shi Shaojie, Qu Chao, Xu Yinghui, Qi Yuan
- Experiential Co-learning Of Software-developing Agents Qian Chen, Dang Yufan, Li Jiahao, Liu Wei, Xie Zihao, Wang Yifei, Chen Weize, Yang Cheng, Cong Xin, Che Xiaoyin, Liu Zhiyuan, Sun Maosong
- Taskweaver: A Code-first Agent Framework Qiao Bo, Li Liqun, Zhang Xu, He Shilin, Kang Yu, Zhang Chaoyun, Yang Fangkai, Dong Hang, Zhang Jue, Wang Lu, Ma Minghua, Zhao Pu, Qin Si, Qin Xiaoting, Du Chao, Xu Yong, Lin Qingwei, Rajmohan Saravan, Zhang Dongmei
- March In Chat: Interactive Prompting For Remote Embodied Referring Expression Qiao Yanyuan, Qi Yuankai, Yu Zheng, Liu Jing, Wu Qi
- Embodied Executable Policy Learning With Language-based Scene Summarization Qiu Jielin, Xu Mengdi, Han William, Moon Seungwhan, Zhao Ding
- Direct Preference Optimization: Your Language Model Is Secretly A Reward Model Rafael Rafailov, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D. Manning, Chelsea Finn
- Improving Generalization In Task-oriented Dialogues With Workflows And Action Plans Raimondo Stefania, Pal Christopher, Liu Xiaotian, Vazquez David, Palacios Hector
- K-PERM: Personalized Response Generation Using Dynamic Knowledge Retrieval And Persona-adaptive Queries Raj Kanak, Roy Kaushik, Bonagiri Vamshi, Govil Priyanshul, Thirunarayanan Krishnaprasad, Gaur Manas
- Aligning Neural Machine Translation Models: Human Feedback In Training And Inference Ramos Miguel Moura, Fernandes Patrick, Farinhas António, Martins André F. T.
- SYNDICOM: Improving Conversational Commonsense With Error-injection And Natural Language Feedback Richardson Christopher, Sundar Anirudh, Heck Larry
- Synergistic Integration Of Large Language Models And Cognitive Architectures For Robust AI: An Exploratory Analysis Romero Oscar J., Zimmerman John, Steinfeld Aaron, Tomasic Anthony
- The Manipulation Problem: Conversational AI As A Threat To Epistemic Agency Rosenberg Louis
- Axiomatic Preference Modeling For Longform Question Answering Rosset Corby, Zheng Guoqing, Dibia Victor, Awadallah Ahmed, Bennett Paul
- Generating Phishing Attacks Using Chatgpt Roy Sayak Saha, Naragam Krishna Vamsi, Nilizadeh Shirin
- From Chatbots To Phishbots? -- Preventing Phishing Scams Created Using Chatgpt, Google Bard And Claude Roy Sayak Saha, Thota Poojitha, Naragam Krishna Vamsi, Nilizadeh Shirin
- TPTU: Large Language Model-based AI Agents For Task Planning And Tool Usage Ruan Jingqing, Chen Yihong, Zhang Bin, Xu Zhiwei, Bao Tianpeng, Du Guoqing, Shi Shiwei, Mao Hangyu, Li Ziyue, Zeng Xingyu, Zhao Rui
- Identifying The Risks Of LM Agents With An Lm-emulated Sandbox Ruan Yangjun, Dong Honghua, Wang Andrew, Pitis Silviu, Zhou Yongchao, Ba Jimmy, Dubois Yann, Maddison Chris J., Hashimoto Tatsunori
- Verbosity Bias In Preference Labeling By Large Language Models Saito Keita, Wachi Akifumi, Wataoka Koki, Akimoto Youhei
- Extending The Frontier Of Chatgpt: Code Generation And Debugging Sakib Fardin Ahsan, Khan Saadat Hasan, Karim A. H. M. Rezaul
- Efficient RLHF: Reducing The Memory Usage Of PPO Santacroce Michael, Lu Yadong, Yu Han, Li Yuanzhi, Shen Yelong
- Open-ended Instructable Embodied Agents With Memory-augmented Large Language Models Sarch Gabriel, Wu Yue, Tarr Michael J., Fragkiadaki Katerina
- Large Language Models Can Strategically Deceive Their Users When Put Under Pressure Scheurer Jérémy, Balesni Mikita, Hobbhahn Marius
- Ruffle&riley: Towards The Automated Induction Of Conversational Tutoring Systems Schmucker Robin, Xia Meng, Azaria Amos, Mitchell Tom
- Personality Traits In Large Language Models Serapio-garcía Greg, Safdari Mustafa, Crepy Clément, Sun Luning, Fitz Stephen, Romero Peter, Abdulhai Marwa, Faust Aleksandra, Matarić Maja
- Character-llm: A Trainable Agent For Role-playing Shao Yunfan, Li Linyang, Dai Junqi, Qiu Xipeng
- Human Choice Prediction In Language-based Persuasion Games: Simulation-based Off-policy Evaluation Shapira Eilam, Apel Reut, Tennenholtz Moshe, Reichart Roi
- Pangu-coder2: Boosting Large Language Models For Code With Ranking Feedback Shen Bo, Zhang Jiaxin, Chen Taihong, Zan Daoguang, Geng Bing, Fu An, Zeng Muhan, Yu Ailun, Ji Jichuan, Zhao Jingyang, Guo Yuenan, Wang Qianxiang
- The Trickle-down Impact Of Reward (in-)consistency On RLHF Shen Lingfeng, Chen Sihao, Song Linfeng, Jin Lifeng, Peng Baolin, Mi Haitao, Khashabi Daniel, Yu Dong
- Taskbench: Benchmarking Large Language Models For Task Automation Shen Yongliang, Song Kaitao, Tan Xu, Zhang Wenqi, Ren Kan, Yuan Siyu, Lu Weiming, Li Dongsheng, Zhuang Yueting
- Can Language Agents Be Alternatives To PPO? A Preliminary Empirical Study On Openai Gym Sheng Junjie, Huang Zixiao, Shen Chuyun, Li Wenhao, Hua Yun, Jin Bo, Zha Hongyuan, Wang Xiangfeng
- Badgpt: Exploring Security Vulnerabilities Of Chatgpt Via Backdoor Attacks To Instructgpt Shi Jiawen, Liu Yixin, Zhou Pan, Sun Lichao
- Unleashing The Power Of Pre-trained Language Models For Offline Reinforcement Learning Shi Ruizhe, Liu Yuyao, Ze Yanjie, Du Simon S., Xu Huazhe
- Reasoning With Language Model Is Planning With World Model Shibo Hao, Yi Gu, Haodi Ma, Joshua Jiahua Hong, Zhen Wang, Daisy Zhe Wang, Zhiting Hu
- Rewritelm: An Instruction-tuned Large Language Model For Text Rewriting Shu Lei, Luo Liangchen, Hoskere Jayakumar, Zhu Yun, Liu Yinxiao, Tong Simon, Chen Jindong, Meng Lei
- SC-ML: Self-supervised Counterfactual Metric Learning For Debiased Visual Question Answering Shu Xinyao, Yan Shiyang, Yang Xu, Wu Ziheng, Chen Zhongfeng, Lu Zhenyu
- Beyond Task Performance: Evaluating And Reducing The Flaws Of Large Multimodal Models With In-context Learning Shukor Mustafa, Rame Alexandre, Dancette Corentin, Cord Matthieu
- Unlocking Model Insights: A Dataset For Automated Model Card Generation Singh Shruti, Lodwal Hitesh, Malwat Husain, Thakur Rakesh, Singh Mayank
- A Long Way To Go: Investigating Length Correlations In RLHF Singhal Prasann, Goyal Tanya, Xu Jiacheng, Durrett Greg
- Should We Be Going MAD? A Look At Multi-agent Debate Strategies For Llms Smit Andries, Duckworth Paul, Grinsztajn Nathan, Barrett Thomas D., Pretorius Arnu
- Step: Stacked LLM Policies For Web Actions Sodhi Paloma, Branavan S. R. K., Artzi Yoav, Mcdonald Ryan
- Preference Ranking Optimization For Human Alignment Song Feifan, Yu Bowen, Li Minghao, Yu Haiyang, Huang Fei, Li Yongbin, Wang Houfeng
- Chatgpt: A Study On Its Utility For Ubiquitous Software Engineering Tasks Sridhara Giriprasad, G. Ranjani H., Mazumdar Sourav
- Cognitive Architectures For Language Agents Sumers Theodore R., Yao Shunyu, Narasimhan Karthik, Griffiths Thomas L.
- Reinforcement Learning In The Era Of Llms: What Is Essential? What Is Needed? An RL Perspective On RLHF, Prompting, And Beyond Sun Hao
- Query-dependent Prompt Evaluation And Optimization With Offline Inverse RL Sun Hao, Hüyük Alihan, Van Der Schaar Mihaela
- Adaplanner: Adaptive Planning From Feedback With Language Models Sun Haotian, Zhuang Yuchen, Kong Lingkai, Dai Bo, Zhang Chao
- Think-on-graph: Deep And Responsible Reasoning Of Large Language Model On Knowledge Graph Sun Jiashuo, Xu Chengjin, Tang Lumingyuan, Wang Saizhuo, Lin Chen, Gong Yeyun, Ni Lionel M., Shum Heung-yeung, Guo Jian
- Corex: Pushing The Boundaries Of Complex Reasoning Through Multi-model Collaboration Sun Qiushi, Yin Zhangyue, Li Xiang, Wu Zhiyong, Qiu Xipeng, Kong Lingpeng
- A Prompt Learning Framework For Source Code Summarization Sun Weisong, Fang Chunrong, You Yudu, Chen Yuchen, Liu Yi, Wang Chong, Zhang Jian, Zhang Quanjun, Qian Hanwei, Zhao Wei, Liu Yang, Chen Zhenyu
- A Short Survey Of Viewing Large Language Models In Legal Aspect Sun Zhongxiang
- Do Large Language Models Show Decision Heuristics Similar To Humans? A Case Study Using GPT-3.5 Suri Gaurav, Slater Lily R., Ziaee Ali, Nguyen Morgan
- Can Language Models Teach Weaker Agents? Teacher Explanations Improve Students Via Personalization Swarnadeep Saha, Peter Hase, Mohit Bansal
- Large Language Models As Generalizable Policies For Embodied Tasks Szot Andrew, Schwarzer Max, Agrawal Harsh, Mazoure Bogdan, Talbott Walter, Metcalf Katherine, Mackraz Natalie, Hjelm Devon, Toshev Alexander
- Redcoast: A Lightweight Tool To Automate Distributed Training Of Llms On Any Gpu/tpus Tan Bowen, Zhu Yun, Liu Lijuan, Wang Hongyi, Zhuang Yonghao, Chen Jindong, Xing Eric, Hu Zhiting
- Is Chatgpt A Good Multi-party Conversation Solver? Tan Chao-hong, Gu Jia-chen, Ling Zhen-hua
- Toolalpaca: Generalized Tool Learning For Language Models With 3000 Simulated Cases Tang Qiaoyu, Deng Ziliang, Lin Hongyu, Han Xianpei, Liang Qiao, Cao Boxi, Sun Le
- Ml-bench: Evaluating Large Language Models And Agents For Machine Learning Tasks On Repository-level Code Tang Xiangru, Liu Yuliang, Cai Zefan, Shao Yanjun, Lu Junjie, Zhang Yichi, Deng Zexuan, Hu Helan, An Kaikai, Huang Ruijun, Si Shuzheng, Chen Sheng, Zhao Haozhe, Chen Liang, Wang Yan, Liu Tianyu, Jiang Zhiwei, Chang Baobao, Fang Yin, Qin Yujia, Zhou Wangchunshu, Zhao Yilun, Cohan Arman, Gerstein Mark
- Medagents: Large Language Models As Collaborators For Zero-shot Medical Reasoning Tang Xiangru, Zou Anni, Zhang Zhuosheng, Li Ziming, Zhao Yilun, Zhang Xingyao, Cohan Arman, Gerstein Mark
- RSVP: Customer Intent Detection Via Agent Response Contrastive And Generative Pre-training Tang Yu-chien, Wang Wei-yao, Yen An-zi, Peng Wen-chih
- Webwise: Web Interface Control And Sequential Exploration With Large Language Models Tao Heyi, T Sethuraman V, Shlapentokh-rothman Michal, Hoiem Derek
- Adversarial Conversational Shaping For Intelligent Agents Tarasiewicz Piotr, Kenjeyev Sultan, Sebag Ilana, Alshehabi Shehab
- Just Ask For Calibration: Strategies For Eliciting Calibrated Confidence Scores From Language Models Fine-tuned With Human Feedback Tian Katherine, Mitchell Eric, Zhou Allan, Sharma Archit, Rafailov Rafael, Yao Huaxiu, Finn Chelsea, Manning Christopher D.
- DUMA: A Dual-mind Conversational Agent With Fast And Slow Thinking Tian Xiaoyu, Chen Liangyu, Liu Na, Liu Yaxuan, Zou Wei, Chen Kaijiang, Cui Ming
- Evil Geniuses: Delving Into The Safety Of Llm-based Agents Tian Yu, Yang Xiao, Zhang Jingyuan, Dong Yinpeng, Su Hang
- Macgyver: Are Large Language Models Creative Problem Solvers? Tian Yufei, Ravichander Abhilasha, Qin Lianhui, Bras Ronan Le, Marjieh Raja, Peng Nanyun, Choi Yejin, Griffiths Thomas L., Brahman Faeze
- Activation Addition: Steering Language Models Without Optimization Turner Alexander Matt, Thiergart Lisa, Leech Gavin, Udell David, Vazquez Juan J., Mini Ulisse, Macdiarmid Monte
- Simple Is Better And Large Is Not Enough: Towards Ensembling Of Foundational Language Models Tyagi Nancy, Shiri Aidin, Sarkar Surjodeep, Umrawal Abhishek Kumar, Gaur Manas
- Evaluating Shutdown Avoidance Of Language Models In Textual Scenarios Van Der Weij Teun, Lermen Simon, Lang Leon
- Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases In Dialogue Systems Wan Yixin, Zhao Jieyu, Chadha Aman, Peng Nanyun, Chang Kai-wei
- Dialog Action-aware Transformer For Dialog Policy Learning Wang Huimin, Kwan Wai-chung, Wong Kam-fai
- Notechat: A Dataset Of Synthetic Doctor-patient Conversations Conditioned On Clinical Notes Wang Junda, Yao Zonghai, Yang Zhichao, Zhou Huixue, Li Rumeng, Wang Xun, Xu Yucheng, Yu Hong
- Adapting LLM Agents With Universal Feedback In Communication Wang Kuan, Lu Yadong, Santacroce Michael, Gong Yeyun, Zhang Chao, Shen Yelong
- Making Large Language Models Better Reasoners With Alignment Wang Peiyi, Li Lei, Chen Liang, Song Feifan, Lin Binghuai, Cao Yunbo, Liu Tianyu, Sui Zhifang
- Ivygpt: Interactive Chinese Pathway Language Model In Medical Domain Wang Rongsheng, Duan Yaofei, Lam Chantong, Chen Jiexi, Xu Jiangsheng, Chen Haoming, Liu Xiaohong, Pang Patrick Cheong-iao, Tan Tao
- STEPS: A Benchmark For Order Reasoning In Sequential Tasks Wang Weizhi, Wang Hong, Yan Xifeng
- MINT: Evaluating Llms In Multi-turn Interaction With Tools And Language Feedback Wang Xingyao, Wang Zihan, Liu Jiateng, Chen Yangyi, Yuan Lifan, Peng Hao, Ji Heng
- Promptagent: Strategic Planning With Language Models Enables Expert-level Prompt Optimization Wang Xinyuan, Li Chenxi, Wang Zhen, Bai Fan, Luo Haotian, Zhang Jiayou, Jojic Nebojsa, Xing Eric P., Hu Zhiting
- Lifelongmemory: Leveraging Llms For Answering Queries In Long-form Egocentric Videos Wang Ying, Yang Yanlai, Ren Mengye
- Enabling Language Models To Implicitly Learn Self-improvement Wang Ziqi, Hou Le, Lu Tianjian, Wu Yuexin, Li Yunxuan, Yu Hongkun, Ji Heng
- Is Chatgpt Good At Search? Investigating Large Language Models As Re-ranking Agents Weiwei Sun, Lingyong Yan, Xinyu Ma, Shuaiqiang Wang, Pengjie Ren, Zhumin Chen, Dawei Yin, Zhaochun Ren
- Unveiling The Implicit Toxicity In Large Language Models Wen Jiaxin, Ke Pei, Sun Hao, Zhang Zhexin, Li Chengfei, Bai Jinfeng, Huang Minlie
- Next-gpt: Any-to-any Multimodal LLM Wu Shengqiong, Fei Hao, Qu Leigang, Ji Wei, Chua Tat-seng
- A New Dialogue Response Generation Agent For Large Language Models By Asking Questions To Detect User's Intentions Wu Siwei, Shen Xiangqing, Xia Rui
- Unveiling Security, Privacy, And Ethical Concerns Of Chatgpt Wu Xiaodong, Duan Ran, Ni Jianbing
- Mathchat: Converse To Tackle Challenging Math Problems With LLM Agents Wu Yiran, Jia Feiran, Zhang Shaokun, Li Hangyu, Zhu Erkang, Wang Yue, Lee Yin Tat, Peng Richard, Wu Qingyun, Wang Chi
- Smartplay: A Benchmark For Llms As Intelligent Agents Wu Yue, Tang Xuan, Mitchell Tom M., Li Yuanzhi
- Language Models Meet World Models: Embodied Experiences Enhance Language Models Xiang Jiannan, Tao Tianhua, Gu Yi, Shu Tianmin, Wang Zirui, Yang Zichao, Hu Zhiting
- Personalized Abstractive Summarization By Tri-agent Generation Pipeline Xiao Wen, Xie Yujia, Carenini Giuseppe, He Pengcheng
- O3D: Offline Data-driven Discovery And Distillation For Sequential Decision-making With Large Language Models Xiao Yuchen, Sun Yanchao, Xu Mengda, Madhushani Udari, Vann Jared, Garg Deepeka, Ganesh Sumitra
- Funqa: Towards Surprising Video Comprehension Xie Binzhu, Zhang Sicheng, Zhou Zitang, Li Bo, Zhang Yuanhan, Hessel Jack, Yang Jingkang, Liu Ziwei
- Text2reward: Reward Shaping With Language Models For Reinforcement Learning Xie Tianbao, Zhao Siheng, Wu Chen Henry, Liu Yitao, Luo Qian, Zhong Victor, Yang Yanchao, Yu Tao
- Openagents: An Open Platform For Language Agents In The Wild Xie Tianbao, Zhou Fan, Cheng Zhoujun, Shi Peng, Weng Luoxuan, Liu Yitao, Hua Toh Jing, Zhao Junning, Liu Qian, Liu Che, Liu Leo Z., Xu Yiheng, Su Hongjin, Shin Dongchan, Xiong Caiming, Yu Tao
- Expertprompting: Instructing Large Language Models To Be Distinguished Experts Xu Benfeng, Yang An, Lin Junyang, Wang Quan, Zhou Chang, Zhang Yongdong, Mao Zhendong
- Gentopia: A Collaborative Platform For Tool-augmented Llms Xu Binfeng, Liu Xukun, Shen Hua, Han Zeyu, Li Yuhan, Yue Murong, Peng Zhiyuan, Liu Yuchen, Yao Ziyu, Xu Dongkuan
- No Train Still Gain. Unleash Mathematical Reasoning Of Large Language Models With Monte Carlo Tree Search Guided By Energy Function Xu Haotian
- Recommender AI Agent: Integrating Large Language Models For Interactive Recommendations Xu Huang, Jianxun Lian, Yuxuan Lei, Jing Yao, Defu Lian, Xing Xie
- Shattering The Agent-environment Interface For Fine-tuning Inclusive Language Models Xu Wanqiao, Dong Shi, Arumugam Dilip, Van Roy Benjamin
- Parameter-efficient Tuning Helps Language Model Alignment Xue Tianci, Wang Ziqi, Ji Heng
- Recmind: Large Language Model Powered Agent For Recommendation Yancheng Wang, Ziyan Jiang, Zheng Chen, Fan Yang, Yingxue Zhou, Eunah Cho, Xing Fan, Xiaojiang Huang, Yanbin Lu, Yingzhen Yang
- LACMA: Language-aligning Contrastive Learning With Meta-actions For Embodied Instruction Following Yang Cheng-fu, Chen Yen-chun, Yang Jianwei, Dai Xiyang, Yuan Lu, Wang Yu-chiang Frank, Chang Kai-wei
- Can You Follow Me? Testing Situational Understanding In Chatgpt Yang Chenghao, Ettinger Allyson
- PRCA: Fitting Black-box Large Language Models For Retrieval Question Answering Via Pluggable Reward-driven Contextual Adapter Yang Haoyan, Li Zhitao, Zhang Yong, Wang Jianzong, Cheng Ning, Li Ming, Xiao Jing
- Auto-gpt For Online Decision Making: Benchmarks And Additional Opinions Yang Hui, Yue Sifu, He Yunzhong
- Zhongjing: Enhancing The Chinese Medical Capabilities Of Large Language Model Through Expert Feedback And Real-world Multi-turn Dialogue Yang Songhua, Zhao Hanjie, Zhu Senbin, Zhou Guangyu, Xu Hongfei, Jia Yuxiang, Zan Hongying
- Harnessing The Power Of Large Language Models For Natural Language To First-order Logic Translation Yang Yuan, Xiong Siheng, Payani Ali, Shareghi Ehsan, Fekri Faramarz
- Improving Language Model Negotiation With Self-play And In-context Learning From AI Feedback Yao Fu, Hao Peng, Tushar Khot, Mirella Lapata
- Embodiedgpt: Vision-language Pre-training Via Embodied Chain Of Thought Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo
- Retroformer: Retrospective Large Language Agents With Policy Gradient Optimization Yao Weiran, Heinecke Shelby, Niebles Juan Carlos, Liu Zhiwei, Feng Yihao, Xue Le, Murthy Rithesh, Chen Zeyuan, Zhang Jianguo, Arpit Devansh, Xu Ran, Mui Phil, Wang Huan, Xiong Caiming, Savarese Silvio
- Rational Decision-making Agent With Internalized Utility Judgment Ye Yining, Cong Xin, Tian Shizuo, Qin Yujia, Liu Chong, Lin Yankai, Liu Zhiyuan, Sun Maosong
- Lemur: Harmonizing Natural Language And Code For Language Agents Yiheng Xu, Hongjin Su, Chen Xing, Boyu Mi, Qian Liu, Weijia Shi, Binyuan Hui, Fan Zhou, Yitao Liu, Tianbao Xie, Zhoujun Cheng, Siheng Zhao, Lingpeng Kong, Bailin Wang, Caiming Xiong, Tao Yu
- Agent Lumos: Unified And Modular Training For Open-source Language Agents Yin Da, Brahman Faeze, Ravichander Abhilasha, Chandu Khyathi, Chang Kai-wei, Choi Yejin, Lin Bill Yuchen
- LAMM: Language-assisted Multi-modal Instruction-tuning Dataset, Framework, And Benchmark Yin Zhenfei, Wang Jiong, Cao Jianjian, Shi Zhelun, Liu Dingning, Li Mukai, Sheng Lu, Bai Lei, Huang Xiaoshui, Wang Zhiyong, Shao Jing, Ouyang Wanli
- The Neuro-symbolic Inverse Planning Engine (NIPE): Modeling Probabilistic Social Inferences From Linguistic Inputs Ying Lance, Collins Katherine M., Wei Megan, Zhang Cedegao E., Zhi-xuan Tan, Weller Adrian, Tenenbaum Joshua B., Wong Lionel
- Hugginggpt: Solving AI Tasks With Chatgpt And Its Friends In Hugging Face Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu, Yueting Zhuang
- Asyncmld: Asynchronous Multi-llm Framework For Dialogue Recommendation System Yoshimaru Naoki, Okuma Motoharu, Iio Takamasa, Hatano Kenji
- Fundamental Limitations Of Alignment In Large Language Models Yotam Wolf, Noam Wies, Oshri Avnery, Yoav Levine, Amnon Shashua
- \(P^{3}O\): Transferring Visual Representations For Reinforcement Learning Via Prompting You Guoliang, Chu Xiaomeng, Duan Yifan, Peng Jie, Ji Jianmin, Zhang Yu, Zhang Yanyong
- Finmem: A Performance-enhanced LLM Trading Agent With Layered Memory And Character Design Yu Yangyang, Li Haohang, Chen Zhi, Jiang Yuechen, Li Yang, Zhang Denghui, Liu Rong, Suchow Jordan W., Khashanah Khaldoun
- Fine-tuning Language Models With Generative Adversarial Reward Modelling Yu Zhang Ze, Jaw Lau Jia, Hui Zhang, Low Bryan Kian Hsiang
- Large Language Models Illuminate A Progressive Pathway To Artificial Healthcare Assistant: A Review Yuan Mingze, Bao Peng, Yuan Jiajia, Shen Yunhao, Chen Zifan, Xie Yi, Zhao Jie, Chen Yang, Zhang Li, Shen Lin, Dong Bin
- Plan, Eliminate, And Track -- Language Models Are Good Teachers For Embodied Agents Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye
- Keeping Users Engaged During Repeated Administration Of The Same Questionnaire: Using Large Language Models To Reliably Diversify Questions Yun Hye Sun, Arjmand Mehdi, Sherlock Phillip, Paasche-orlow Michael K., Griffith James W., Bickmore Timothy
- Conditionally Combining Robot Skills Using Large Language Models Zentner K. R., Julian Ryan, Ichter Brian, Sukhatme Gaurav S.
- Building Open-ended Embodied Agent Via Language-policy Bidirectional Adaptation Zhai Shaopeng, Wang Jie, Zhang Tianyi, Huang Fuxian, Zhang Qi, Zhou Ming, Hou Jing, Qiao Yu, Liu Yu
- Removing RLHF Protections In GPT-4 Via Fine-tuning Zhan Qiusi, Fang Richard, Bindu Rohan, Gupta Akul, Hashimoto Tatsunori, Kang Daniel
- Mobile-env: Building Qualified Evaluation Benchmarks For LLM-GUI Interaction Zhang Danyang, Shen Zhennan, Xie Rui, Zhang Situo, Xie Tianbao, Zhao Zihan, Chen Siyuan, Chen Lu, Xu Hongshen, Cao Ruisheng, Yu Kai
- On The Safety Of Open-sourced Large Language Models: Does Alignment Really Prevent Them From Being Misused? Zhang Hangfan, Guo Zhimeng, Zhu Huaisheng, Cao Bochuan, Lin Lu, Jia Jinyuan, Chen Jinghui, Wu Dinghao
- The Wisdom Of Hindsight Makes Language Models Better Instruction Followers Zhang Tianjun, Liu Fangchen, Wong Justin, Abbeel Pieter, Gonzalez Joseph E.
- Adarefiner: Refining Decisions Of Language Models With Adaptive Feedback Zhang Wanpeng, Lu Zongqing
- Probing The Multi-turn Planning Capabilities Of Llms Via 20 Question Games Zhang Yizhe, Lu Jiarui, Jaitly Navdeep
- Reinforced UI Instruction Grounding: Towards A Generic UI Task Automation API Zhang Zhizheng, Xie Wenxuan, Zhang Xiaoyi, Lu Yan
- Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-thought Reasoning To Language Agents Zhang Zhuosheng, Yao Yao, Zhang Aston, Tang Xiangru, Ma Xinbei, He Zhiwei, Wang Yiming, Gerstein Mark, Wang Rui, Liu Gongshen, Zhao Hai
- You Only Look At Screens: Multimodal Chain-of-action Agents Zhang Zhuosheng, Zhang Aston
- Chbias: Bias Evaluation And Mitigation Of Chinese Conversational Language Models Zhao Jiaxu, Fang Meng, Shi Zijing, Li Yitong, Chen Ling, Pechenizkiy Mykola
- Babystories: Can Reinforcement Learning Teach Baby Language Models To Write Better Stories? Zhao Xingmeng, Wang Tongnian, Osborn Sheri, Rios Anthony
- Chat With The Environment: Interactive Multimodal Perception Using Large Language Models Zhao Xufeng, Li Mengdi, Weber Cornelius, Hafez Muhammad Burhan, Wermter Stefan
- Slic-hf: Sequence Likelihood Calibration With Human Feedback Zhao Yao, Joshi Rishabh, Liu Tianqi, Khalman Misha, Saleh Mohammad, Liu Peter J.
- Sglang: Efficient Execution Of Structured Language Model Programs Zheng Lianmin, Yin Liangsheng, Xie Zhiqiang, Sun Chuyue, Huang Jeff, Yu Cody Hao, Cao Shiyi, Kozyrakis Christos, Stoica Ion, Gonzalez Joseph E., Barrett Clark, Sheng Ying
- Improving Generalization Of Alignment With Human Preferences Through Group Invariant Learning Zheng Rui, Shen Wei, Hua Yuan, Lai Wenbin, Dou Shihan, Zhou Yuhao, Xi Zhiheng, Wang Xiao, Huang Haoran, Gui Tao, Zhang Qi, Huang Xuanjing
- The Rise And Potential Of Large Language Model Based Agents: A Survey Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang, Changhao Jiang, Yicheng Zou, Xiangyang Liu, Zhangyue Yin, Shihan Dou, Rongxiang Weng, Wensen Cheng, Qi Zhang, Wenjuan Qin, Yongyan Zheng, Xipeng Qiu, Xuanjing Huang, Tao Gui
- Humanoid Agents: Platform For Simulating Human-like Generative Agents Zhilin Wang, Yu Ying Chiu, Yu Cheung Chiu
- Principle-driven Self-alignment Of Language Models From Scratch With Minimal Human Supervision Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan
- Hierarchical Neural Program Synthesis Zhong Linghan, Lindeborg Ryan, Zhang Jesse, Lim Joseph J., Sun Shao-hua
- Language Agent Tree Search Unifies Reasoning Acting And Planning In Language Models Zhou Andy, Yan Kai, Shlapentokh-rothman Michal, Wang Haohan, Wang Yu-xiong
- Navgpt: Explicit Reasoning In Vision-and-language Navigation With Large Language Models Zhou Gengze, Hong Yicong, Wu Qi
- Characterglm: Customizing Chinese Conversational AI Characters With Large Language Models Zhou Jinfeng, Chen Zhuang, Wan Dazhen, Wen Bosi, Song Yi, Yu Jifan, Huang Yongkang, Peng Libiao, Yang Jiaming, Xiao Xiyao, Sabour Sahand, Zhang Xiaohan, Hou Wenjing, Zhang Yijia, Dong Yuxiao, Tang Jie, Huang Minlie
- Large Language Model As A Policy Teacher For Training Reinforcement Learning Agents Zhou Zihao, Hu Bin, Zhao Chenyang, Zhang Pu, Liu Bin
- Towards An On-device Agent For Text Rewriting Zhu Yun, Liu Yinxiao, Stahlberg Felix, Kumar Shankar, Chen Yu-hui, Luo Liangchen, Shu Lei, Liu Renjie, Chen Jindong, Meng Lei
- JARVIS-1: Open-world Multi-task Agents With Memory-augmented Multimodal Language Models Zihao Wang, Shaofei Cai, Anji Liu, Yonggang Jin, Jinbing Hou, Bowei Zhang, Haowei Lin, Zhaofeng He, Zilong Zheng, Yaodong Yang, Xiaojian Ma, Yitao Liang
- Describe, Explain, Plan And Select: Interactive Planning With Large Language Models Enables Open-world Multi-task Agents Zihao Wang, Shaofei Cai, Guanzhou Chen, Anji Liu, Xiaojian Ma, Yitao Liang
- Knowledge-infused Llm-powered Conversational Health Agent: A Case Study For Diabetes Patients Abbasian Mahyar, Yang Zhongqi, Khatibi Elahe, Zhang Pengfei, Nagesh Nitish, Azimi Iman, Jain Ramesh, Rahmani Amir M.
- Granite-function Calling Model: Introducing Function Calling Abilities Via Multi-task Learning Of Granular Tasks Abdelaziz Ibrahim, Basu Kinjal, Agarwal Mayank, Kumaravel Sadhana, Stallone Matthew, Panda Rameswar, Rizk Yara, Bhargav Gp, Crouse Maxwell, Gunasekara Chulaka, Ikbal Shajith, Joshi Sachin, Karanam Hima, Kumar Vineet, Munawar Asim, Neelam Sumit, Raghu Dinesh, Sharma Udit, Soria Adriana Meza, Sreedhar Dheeraj, Venkateswaran Praveen, Unuvar Merve, Cox David, Roukos Salim, Lastras Luis, Kapanipathi Pavan
- Synthetic Dialogue Dataset Generation Using LLM Agents Abdullin Yelaman, Molla-aliod Diego, Ofoghi Bahadorreza, Yearwood John, Li Qingyang
- Infercept: Efficient Intercept Support For Augmented Large Language Model Inference Abhyankar Reyna, He Zijian, Srivatsa Vikranth, Zhang Hao, Zhang Yiying
- Stackrag Agent: Improving Developer Answers With Retrieval-augmented Generation Abrahamyan Davit, Fard Fatemeh H.
- Promptwizard: Task-aware Agent-driven Prompt Optimization Framework Agarwal Eshaan, Dani Vivek, Ganu Tanuja, Nambi Akshay
- Scalable Ensembling For Mitigating Reward Overoptimisation Ahmed Ahmed M., Rafailov Rafael, Sharkov Stepan, Li Xuechen, Koyejo Sanmi
- Tuning Large Multimodal Models For Videos Using Reinforcement Learning From AI Feedback Ahn Daechul, Choi Yura, Yu Youngjae, Kang Dongyeop, Choi Jonghyun
- Timechara: Evaluating Point-in-time Character Hallucination Of Role-playing Large Language Models Ahn Jaewoo, Lee Taehyun, Lim Junyoung, Kim Jin-hwa, Yun Sangdoo, Lee Hwaran, Kim Gunhee
- FACTS About Building Retrieval Augmented Generation-based Chatbots Akkiraju Rama, Xu Anbang, Bora Deepak, Yu Tan, An Lu, Seth Vishal, Shukla Aaditya, Gundecha Pritam, Mehta Hridhay, Jha Ashwin, Raj Prithvi, Balasubramanian Abhinav, Maram Murali, Muthusamy Guru, Annepally Shivakesh Reddy, Knowles Sidney, Du Min, Burnett Nick, Javiya Sean, Marannan Ashok, Kumari Mamta, Jha Surbhi, Dereszenski Ethan, Chakraborty Anupam, Ranjan Subhash, Terfai Amina, Surya Anoop, Mercer Tracey, Thanigachalam Vinodh Kumar, Bar Tamar, Krishnan Sanjana, Kilaru Samy, Jaksic Jasmine, Algarici Nave, Liberman Jacob, Conway Joey, Nayyar Sonu, Boitano Justin
- History Of Generative Artificial Intelligence (AI) Chatbots: Past, Present, And Future Development Al-amin Md., Ali Mohammad Shazed, Salam Abdus, Khan Arif, Ali Ashraf, Ullah Ahsan, Alam Md Nur, Chowdhury Shamsul Kabir
- TREC Ikat 2023: A Test Collection For Evaluating Conversational And Interactive Knowledge Assistants Aliannejadi Mohammad, Abbasiantaeb Zahra, Chatterjee Shubham, Dalton Jeffery, Azzopardi Leif
- Toward Conversational Agents With Context And Time Sensitive Long-term Memory Alonso Nick, Figliolia Tomás, Ndirango Anthony, Millidge Beren
- Mm-phyrlhf: Reinforcement Learning Framework For Multimodal Physics Question-answering Anand Avinash, Kapuriya Janak, Kirtani Chhavi, Singh Apoorv, Saraf Jay, Lal Naman, Kumar Jatin, Shivam Adarsh Raj, Verma Astha, Shah Rajiv Ratn, Zimmermann Roger
- Critique-out-loud Reward Models Ankner Zachary, Paul Mansheej, Cui Brandon, Chang Jonathan D., Ammanabrolu Prithviraj
- BAMO At Semeval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense Ansari Baktash, Rostamkhani Mohammadmostafa, Eetemadi Sauleh
- A Framework For Fine-tuning Llms Using Heterogeneous Feedback Aponte Ryan, Rossi Ryan A., Guo Shunan, Dernoncourt Franck, Yu Tong, Chen Xiang, Mitra Subrata, Lipka Nedim
- Coaching Copilot: Blended Form Of An Llm-powered Chatbot And A Human Coach To Effectively Support Self-reflection For Leadership Growth Arakawa Riku, Yakura Hiromu
- Elicitron: An LLM Agent-based Simulation Framework For Design Requirements Elicitation Ataei Mohammadmehdi, Cheong Hyunmin, Grandi Daniele, Wang Ye, Morris Nigel, Tessier Alexander
- Current State Of LLM Risks And AI Guardrails Ayyamperumal Suriya Ganesh, Ge Limin
- Hybrid Preference Optimization: Augmenting Direct Preference Optimization With Auxiliary Objectives Badrinath Anirudhan, Agarwal Prabhat, Xu Jiajing
- Efficient Model-agnostic Alignment Via Bayesian Persuasion Bai Fengshuo, Wang Mingzhi, Zhang Zhaowei, Chen Boyuan, Xu Yinda, Wen Ying, Yang Yaodong
- Longwriter: Unleashing 10,000+ Word Generation From Long Context Llms Bai Yushi, Zhang Jiajie, Lv Xin, Zheng Linzhi, Zhu Siqi, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- Patentgpt: A Large Language Model For Intellectual Property Bai Zilong, Zhang Ruiji, Chen Linqing, Cai Qijun, Zhong Yuan, Wang Cong, Fang Yan, Fang Jie, Sun Jing, Wang Weikuan, Zhou Lizhi, Hua Haoran, Qiu Tian, Wang Chaochao, Sun Cheng, Lu Jianping, Wang Yixin, Xia Yubin, Hu Meng, Liu Haowen, Xu Peng, Xu Licong, Bian Fu, Gu Xiaolong, Zhang Lisha, Wang Weilei, Tu Changyang
- CYGENT: A Cybersecurity Conversational Agent With Log Summarization Powered By GPT-3 Balasubramanian Prasasthy, Seby Justin, Kostakos Panos
- Linguistic Calibration Of Long-form Generations Band Neil, Li Xuechen, Ma Tengyu, Hashimoto Tatsunori
- Llms With Chain-of-thought Are Non-causal Reasoners Bao Guangsheng, Zhang Hongbo, Yang Linyi, Wang Cunxiang, Zhang Yue
- Aligning Large Language Models From Self-reference AI Feedback With One General Principle Bao Rong, Zheng Rui, Dou Shihan, Wang Xiao, Zhou Enyu, Wang Bo, Zhang Qi, Ding Liang, Tao Dacheng
- Transforming Agency. On The Mode Of Existence Of Large Language Models Barandiaran Xabier E., Almendros Lola S.
- Reinforcement Learning From LLM Feedback To Counteract Goal Misgeneralization Barj Houda Nait El, Sautory Theophile
- Exploring Autonomous Agents Through The Lens Of Large Language Models: A Review Barua Saikat
- STARLING: Self-supervised Training Of Text-based Reinforcement Learning Agent With Large Language Models Basavatia Shreyas, Murugesan Keerthiram, Ratnakar Shivam
- Medaide: Leveraging Large Language Models For On-premise Medical Assistance On Edge Devices Basit Abdul, Hussain Khizar, Hanif Muhammad Abdullah, Shafique Muhammad
- NESTFUL: A Benchmark For Evaluating Llms On Nested Sequences Of API Calls Basu Kinjal, Abdelaziz Ibrahim, Bradford Kelsey, Crouse Maxwell, Kate Kiran, Kumaravel Sadhana, Goyal Saurabh, Munawar Asim, Rizk Yara, Wang Xin, Lastras Luis, Kapanipathi Pavan
- Clembench-2024: A Challenging, Dynamic, Complementary, Multilingual Benchmark And Underlying Flexible Framework For Llms As Multi-action Agents Beyer Anne, Chalamalasetti Kranti, Hakimov Sherzod, Madureira Brielen, Sadler Philipp, Schlangen David
- Efficient Reinforcement Learning Via Large Language Model-based Search Bhambri Siddhant, Bhattacharjee Amrita, Liu Huan, Kambhampati Subbarao
- Workarena++: Towards Compositional Planning And Reasoning-based Common Knowledge Work Tasks Boisvert Léo, Thakkar Megh, Gasse Maxime, Caccia Massimo, De Chezelles Thibault Le Sellier, Cappart Quentin, Chapados Nicolas, Lacoste Alexandre, Drouin Alexandre
- Windows Agent Arena: Evaluating Multi-modal OS Agents At Scale Bonatti Rogerio, Zhao Dan, Bonacci Francesco, Dupont Dillon, Abdali Sara, Li Yinheng, Wagle Justin, Koishida Kazuhito, Bucker Arthur, Jang Lawrence, Hui Zack
- Embardiment: An Embodied AI Agent For Productivity In XR Bovo Riccardo, Abreu Steven, Ahuja Karan, Gonzalez Eric J, Cheng Li-te, Gonzalez-franco Mar
- ARES: Alternating Reinforcement Learning And Supervised Fine-tuning For Enhanced Multi-modal Chain-of-thought Reasoning Through Diverse AI Feedback Byun Ju-seung, Chun Jiyun, Kil Jihyung, Perrault Andrew
- High-dimension Human Value Representation In Large Language Models Cahyawijaya Samuel, Chen Delong, Bang Yejin, Khalatbari Leila, Wilie Bryan, Ji Ziwei, Ishii Etsuko, Fung Pascale
- Internlm2 Technical Report Cai Zheng, Cao Maosong, Chen Haojiong, Chen Kai, Chen Keyu, Chen Xin, Chen Xun, Chen Zehui, Chen Zhi, Chu Pei, Dong Xiaoyi, Duan Haodong, Fan Qi, Fei Zhaoye, Gao Yang, Ge Jiaye, Gu Chenya, Gu Yuzhe, Gui Tao, Guo Aijia, Guo Qipeng, He Conghui, Hu Yingfan, Huang Ting, Jiang Tao, Jiao Penglong, Jin Zhenjiang, Lei Zhikai, Li Jiaxing, Li Jingwen, Li Linyang, Li Shuaibin, Li Wei, Li Yining, Liu Hongwei, Liu Jiangning, Hong Jiawei, Liu Kaiwen, Liu Kuikun, Liu Xiaoran, Lv Chengqi, Lv Haijun, Lv Kai, Ma Li, Ma Runyuan, Ma Zerun, Ning Wenchang, Ouyang Linke, Qiu Jiantao, Qu Yuan, Shang Fukai, Shao Yunfan, Song Demin, Song Zifan, Sui Zhihao, Sun Peng, Sun Yu, Tang Huanze, Wang Bin, Wang Guoteng, Wang Jiaqi, Wang Jiayu, Wang Rui, Wang Yudong, Wang Ziyi, Wei Xingjian, Weng Qizhen, Wu Fan, Xiong Yingtong, Xu Chao, Xu Ruiliang, Yan Hang, Yan Yirong, Yang Xiaogui, Ye Haochen, Ying Huaiyuan, Yu Jia, Yu Jing, Zang Yuhang, Zhang Chuyu, Zhang Li, Zhang Pan, Zhang Peng, Zhang Ruijie, Zhang Shuo, Zhang Songyang, Zhang Wenjian, Zhang Wenwei, Zhang Xingcheng, Zhang Xinyue, Zhao Hui, Zhao Qian, Zhao Xiaomeng, Zhou Fengzhe, Zhou Zaida, Zhuo Jingming, Zou Yicheng, Qiu Xipeng, Qiao Yu, Lin Dahua
- Beyond Sparse Rewards: Enhancing Reinforcement Learning With Language Model Critique In Text Generation Cao Meng, Shu Lei, Yu Lei, Zhu Yun, Wichers Nevan, Liu Yinxiao, Meng Lei
- Survey On Large Language Model-enhanced Reinforcement Learning: Concept, Taxonomy, And Methods Cao Yuji, Zhao Huan, Cheng Yuheng, Shu Ting, Liu Guolong, Liang Gaoqi, Zhao Junhua, Li Yun
- Graphinsight: Unlocking Insights In Large Language Models For Graph Structure Understanding Cao Yukun, Han Shuo, Gao Zengyi, Ding Zezhong, Xie Xike, Zhou S. Kevin
- Can Formal Argumentative Reasoning Enhance Llms Performances? Castagna Federico, Sassoon Isabel, Parsons Simon
- HAMMR: Hierarchical Multimodal React Agents For Generic VQA Castrejon Lluis, Mensink Thomas, Zhou Howard, Ferrari Vittorio, Araujo Andre, Uijlings Jasper
- AMEX: Android Multi-annotation Expo Dataset For Mobile GUI Agents Chai Yuxiang, Huang Siyuan, Niu Yazhe, Xiao Han, Liu Liang, Zhang Dingyu, Gao Peng, Ren Shuai, Li Hongsheng
- Cross-modal Safety Alignment: Is Textual Unlearning All You Need? Chakraborty Trishna, Shayegani Erfan, Cai Zikui, Abu-ghazaleh Nael, Asif M. Salman, Dong Yue, Roy-chowdhury Amit K., Song Chengyu
- Dense Reward For Free In Reinforcement Learning From Human Feedback Chan Alex J., Sun Hao, Holt Samuel, Van Der Schaar Mihaela
- Negotiationtom: A Benchmark For Stress-testing Machine Theory Of Mind On Negotiation Surrounding Chan Chunkit, Jiayang Cheng, Yim Yauwai, Deng Zheye, Fan Wei, Li Haoran, Liu Xin, Zhang Hongming, Wang Weiqi, Song Yangqiu
- Injecting Salesperson's Dialogue Strategies In Large Language Models With Chain-of-thought Reasoning Chang Wen-yu, Chen Yun-nung
- When Large Language Models Meet Evolutionary Algorithms Chao Wang, Zhao Jiaxuan, Jiao Licheng, Li Lingling, Liu Fang, Yang Shuyuan
- Large Language Models As Instruments Of Power: New Regimes Of Autonomous Manipulation And Control Chaudhary Yaqub, Penn Jonnie
- All Roads Lead To Rome: Unveiling The Trajectory Of Recommender Systems Across The LLM Era Chen Bo, Dai Xinyi, Guo Huifeng, Guo Wei, Liu Weiwen, Liu Yong, Qin Jiarui, Tang Ruiming, Wang Yichao, Wu Chuhan, Wu Yaxiong, Zhang Hao
- GUI-WORLD: A Dataset For Gui-oriented Multimodal Llm-based Agents Chen Dongping, Huang Yue, Wu Siyuan, Tang Jingyu, Chen Liuyi, Bai Yilin, He Zhigang, Wang Chenlong, Zhou Huichi, Li Yiqiang, Zhou Tianshuo, Yu Yue, Gao Chujie, Zhang Qihui, Gui Yi, Li Zhen, Wan Yao, Zhou Pan, Gao Jianfeng, Sun Lichao
- Magdi: Structured Distillation Of Multi-agent Interaction Graphs Improves Reasoning In Smaller Language Models Chen Justin Chih-yao, Saha Swarnadeep, Stengel-eskin Elias, Bansal Mohit
- Instructioncp: A Fast Approach To Transfer Large Language Models Into Target Language Chen Kuang-ming, Lee Hung-yi
- Pca-bench: Evaluating Multimodal Large Language Models In Perception-cognition-action Chain Chen Liang, Zhang Yichi, Ren Shuhuai, Zhao Haozhe, Cai Zefan, Wang Yuchi, Wang Peiyi, Meng Xiangdi, Liu Tianyu, Chang Baobao
- Apollonion: Profile-centric Dialog Agent Chen Shangyu, Zhao Zibo, Zhao Yuanyuan, Li Xiang
- Advancing Tool-augmented Large Language Models: Integrating Insights From Errors In Inference Trees Chen Sijia, Wang Yibo, Wu Yi-feng, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Zhang Lijun
- Beyond Natural Language: Llms Leveraging Alternative Formats For Enhanced Reasoning And Communication Chen Weize, Yuan Chenfei, Yuan Jiarui, Su Yusheng, Qian Chen, Yang Cheng, Xie Ruobing, Liu Zhiyuan, Sun Maosong
- Reprompt: Planning By Automatic Prompt Engineering For Large Language Models Agents Chen Weizhe, Koenig Sven, Dilkina Bistra
- Cause-aware Empathetic Response Generation Via Chain-of-thought Fine-tuning Chen Xinhao, Yang Chong, Lan Man, Cai Li, Chen Yang, Hu Tu, Zhuang Xinlin, Zhou Aimin
- Hiqa: A Hierarchical Contextual Augmentation RAG For Massive Documents QA Chen Xinyue, Gao Pengyu, Song Jiangjiang, Tan Xiaoyang
- Iteralign: Iterative Constitutional Alignment Of Large Language Models Chen Xiusi, Wen Hongzhi, Nag Sreyashi, Luo Chen, Yin Qingyu, Li Ruirui, Li Zheng, Wang Wei
- RL-JACK: Reinforcement Learning-powered Black-box Jailbreaking Attack Against Llms Chen Xuan, Nie Yuzhou, Yan Lu, Mao Yunshu, Guo Wenbo, Zhang Xiangyu
- Can We Rely On LLM Agents To Draft Long-horizon Plans? Let's Take Travelplanner As An Example Chen Yanan, Pesaranghader Ali, Sadhu Tanmana, Yi Dong Hoon
- Extroversion Or Introversion? Controlling The Personality Of Your Large Language Models Chen Yanquan, Wu Zhen, Guo Junjie, Huang Shujian, Dai Xinyu
- Recent Trends In Personalized Dialogue Generation: A Review Of Datasets, Methodologies, And Evaluations Chen Yi-pei, Nishida Noriki, Nakayama Hideki, Matsumoto Yuji
- Agent-flan: Designing Data And Methods Of Effective Agent Tuning For Large Language Models Chen Zehui, Liu Kuikun, Wang Qiuchen, Zhang Wenwei, Liu Jiangning, Lin Dahua, Chen Kai, Zhao Feng
- Online Personalizing White-box Llms Generation With Neural Bandits Chen Zekai, Daniel Weeden, Chen Po-yu, Buet-golfouse Francois
- Autoprm: Automating Procedural Supervision For Multi-step Reasoning Via Controllable Question Decomposition Chen Zhaorun, Zhao Zhuokai, Zhu Zhihong, Zhang Ruiqi, Li Xiang, Raj Bhiksha, Yao Huaxiu
- What Are The Essential Factors In Crafting Effective Long Context Multi-hop Instruction Datasets? Insights And Best Practices Chen Zhi, Chen Qiguang, Qin Libo, Guo Qipeng, Lv Haijun, Zou Yicheng, Che Wanxiang, Yan Hang, Chen Kai, Lin Dahua
- Self-playing Adversarial Language Game Enhances LLM Reasoning Cheng Pengyu, Hu Tianhao, Xu Han, Zhang Zhisong, Dai Yong, Han Lei, Du Nan
- Towards Achieving Human Parity On End-to-end Simultaneous Speech Translation Via LLM Agent Cheng Shanbo, Huang Zhichao, Ko Tom, Li Hang, Peng Ningxin, Xu Lu, Zhang Qini
- Small Agent Can Also Rock! Empowering Small Language Models As Hallucination Detector Cheng Xiaoxue, Li Junyi, Zhao Wayne Xin, Zhang Hongzhi, Zhang Fuzheng, Zhang Di, Gai Kun, Wen Ji-rong
- Can Large Language Models Be Trusted For Evaluation? Scalable Meta-evaluation Of Llms As Evaluators Via Agent Debate Chern Steffi, Chern Ethan, Neubig Graham, Liu Pengfei
- Walert: Putting Conversational Search Knowledge Into Action By Building And Evaluating A Large Language Model-powered Chatbot Cherumanal Sachin Pathiyan, Tian Lin, Abushaqra Futoon M., De Paula Angel Felipe Magnossao, Ji Kaixin, Hettiachchi Danula, Trippas Johanne R., Ali Halil, Scholer Falk, Spina Damiano
- Human-centered Llm-agent User Interface: A Position Paper Chin Daniel, Wang Yuxuan, Xia Gus
- Lota-bench: Benchmarking Language-oriented Task Planners For Embodied Agents Choi Jae-woo, Yoon Youngwoo, Ong Hyobin, Kim Jaehong, Jang Minsu
- Cohesive Conversations: Enhancing Authenticity In Multi-agent Simulated Dialogues Chu Kuanchao, Chen Yi-pei, Nakayama Hideki
- Professional Agents -- Evolving Large Language Models Into Autonomous Experts With Human-level Competencies Chu Zhixuan, Wang Yan, Zhu Feng, Yu Lu, Li Longfei, Gu Jinjie
- Informed AI Regulation: Comparing The Ethical Frameworks Of Leading LLM Chatbots Using An Ethics-based Audit To Assess Moral Reasoning And Normative Values Chun Jon, Elkins Katherine
- Improving Token-based World Models With Parallel Observation Prediction Cohen Lior, Wang Kaixin, Kang Bingyi, Mannor Shie
- Attentionlego: An Open-source Building Block For Spatially-scalable Large Language Model Accelerator With Processing-in-memory Technology Cong Rongqing, He Wenyang, Li Mingxuan, Luo Bangning, Yang Zebin, Yang Yuchao, Huang Ru, Yan Bonan
- Gamebench: Evaluating Strategic Reasoning Abilities Of LLM Agents Costarelli Anthony, Allen Mat, Hauksson Roman, Sodunke Grace, Hariharan Suhas, Cheng Carlson, Li Wenjie, Clymer Joshua, Yadav Arjun
- Generating Code World Models With Large Language Models Guided By Monte Carlo Tree Search Dainese Nicola, Merler Matteo, Alakuijala Minttu, Marttinen Pekka
- A Complete Survey On Llm-based AI Chatbots Dam Sumit Kumar, Hong Choong Seon, Qiao Yu, Zhang Chaoning
- Impact Of Non-standard Unicode Characters On Security And Comprehension In Large Language Models Daniel Johan S, Pal Anand
- Active Preference Optimization For Sample Efficient RLHF Das Nirjhar, Chakraborty Souradip, Pacchiano Aldo, Chowdhury Sayak Ray
- Retail-gpt: Leveraging Retrieval Augmented Generation (RAG) For Building E-commerce Chat Assistants De Freitas Bruno Amaral Teixeira, Lotufo Roberto De Alencar
- Will GPT-4 Run DOOM? De Wynter Adrian
- Evaluating Large Language Models For Automatic Analysis Of Teacher Simulations De-fitero-dominguez David, Albaladejo-gonzález Mariano, Garcia-cabot Antonio, Garcia-lopez Eva, Moreno-cediel Antonio, Barno Erin, Reich Justin
- Deepseek-v2: A Strong, Economical, And Efficient Mixture-of-experts Language Model Deepseek-ai, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Li, Hui Qu, J. L. Cai, Jian Liang, Jianzhong Guo, Jiaqi Ni, Jiashi Li, Jin Chen, Jingyang Yuan, Junjie Qiu, Junxiao Song, Kai Dong, Kaige Gao, Kang Guan, Lean Wang, Lecong Zhang, Lei Xu, Leyi Xia, Liang Zhao, Liyue Zhang, Meng Li, Miaojun Wang, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Mingming Li, Ning Tian, Panpan Huang, Peiyi Wang, Peng Zhang, Qihao Zhu, Qinyu Chen, Qiushi Du, R. J. Chen, R. L. Jin, Ruiqi Ge, Ruizhe Pan, Runxin Xu, Ruyi Chen, S. S. Li, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shaoqing Wu, Shengfeng Ye, Shirong Ma, Shiyu Wang, Shuang Zhou, Shuiping Yu, Shunfeng Zhou, Size Zheng, T. Wang, Tian Pei, Tian Yuan, Tianyu Sun, W. L. Xiao, Wangding Zeng, Wei An, Wen Liu, Wenfeng Liang, Wenjun Gao, Wentao Zhang, X. Q. Li, Xiangyue Jin, Xianzu Wang, Xiao Bi, Xiaodong Liu, Xiaohan Wang, Xiaojin Shen, Xiaokang Chen, Xiaosha Chen, Xiaotao Nie, Xiaowen Sun, Xiaoxiang Wang, Xin Liu, Xin Xie, Xingkai Yu, Xinnan Song, Xinyi Zhou, Xinyu Yang, Xuan Lu, Xuecheng Su, Y. Wu, Y. K. Li, Y. X. Wei, Y. X. Zhu, Yanhong Xu, Yanping Huang, Yao Li, Yao Zhao, Yaofeng Sun, Yaohui Li, Yaohui Wang, Yi Zheng, Yichao Zhang, Yiliang Xiong, Yilong Zhao, Ying He, Ying Tang, Yishi Piao, Yixin Dong, Yixuan Tan, Yiyuan Liu, Yongji Wang, Yongqiang Guo, Yuchen Zhu, Yuduan Wang, Yuheng Zou, Yukun Zha, Yunxian Ma, Yuting Yan, Yuxiang You, Yuxuan Liu, Z. Z. Ren, Zehui Ren, Zhangli Sha, Zhe Fu, Zhen Huang, Zhen Zhang, Zhenda Xie, Zhewen Hao, Zhihong Shao, Zhiniu Wen, Zhipeng Xu, Zhongyu Zhang, Zhuoshu Li, Zihan Wang, Zihui Gu, Zilin Li, Ziwei Xie
- MIMIR: A Streamlined Platform For Personalized Agent Tuning In Domain Expertise Deng Chunyuan, Tang Xiangru, Zhao Yilun, Wang Hanming, Wang Haoran, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- On The Multi-turn Instruction Following For Conversational Web Agents Deng Yang, Zhang Xuan, Zhang Wenxuan, Yuan Yifei, Ng See-kiong, Chua Tat-seng
- LMGT: Optimizing Exploration-exploitation Balance In Reinforcement Learning Through Language Model Guided Trade-offs Deng Yongxin, Qiu Xihe, Tan Xiaoyu, Chu Wei, Xu Yinghui
- Sycophancy To Subterfuge: Investigating Reward-tampering In Large Language Models Denison Carson, Macdiarmid Monte, Barez Fazl, Duvenaud David, Kravec Shauna, Marks Samuel, Schiefer Nicholas, Soklaski Ryan, Tamkin Alex, Kaplan Jared, Shlegeris Buck, Bowman Samuel R., Perez Ethan, Hubinger Evan
- Class-level Code Generation From Natural Language Using Iterative, Tool-enhanced Reasoning Over Repository Deshpande Ajinkya, Agarwal Anmol, Shet Shashank, Iyer Arun, Kanade Aditya, Bairi Ramakrishna, Parthasarathy Suresh
- Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets Devine Peter
- Large Language Model Agent In Financial Trading: A Survey Ding Han, Li Yinheng, Wang Junhao, Chen Hang
- Self-play With Execution Feedback: Improving Instruction-following Capabilities Of Large Language Models Dong Guanting, Lu Keming, Li Chengpeng, Xia Tingyu, Yu Bowen, Zhou Chang, Zhou Jingren
- RLHF Workflow: From Reward Modeling To Online RLHF Dong Hanze, Xiong Wei, Pang Bo, Wang Haoxiang, Zhao Han, Zhou Yingbo, Jiang Nan, Sahoo Doyen, Xiong Caiming, Zhang Tong
- Jailbreaking Text-to-image Models With Llm-based Agents Dong Yingkai, Li Zheng, Meng Xiangtao, Yu Ning, Guo Shanqing
- Training A Vision Language Model As Smartphone Assistant Dorka Nicolai, Marecki Janusz, Anwar Ammar
- Integrating Physician Diagnostic Logic Into Large Language Models: Preference Learning From Process Feedback Dou Chengfeng, Jin Zhi, Jiao Wenpin, Zhao Haiyan, Zhao Yongqiang, Tao Zhenwei
- Stepcoder: Improve Code Generation With Reinforcement Learning From Compiler Feedback Dou Shihan, Liu Yan, Jia Haoxiang, Xiong Limao, Zhou Enyu, Shen Wei, Shan Junjie, Huang Caishuang, Wang Xiao, Fan Xiaoran, Xi Zhiheng, Zhou Yuhao, Ji Tao, Zheng Rui, Zhang Qi, Huang Xuanjing, Gui Tao
- Workarena: How Capable Are Web Agents At Solving Common Knowledge Work Tasks? Drouin Alexandre, Gasse Maxime, Caccia Massimo, Laradji Issam H., Del Verme Manuel, Marty Tom, Boisvert Léo, Thakkar Megh, Cappart Quentin, Vazquez David, Chapados Nicolas, Lacoste Alexandre
- Towards A Copilot In BIM Authoring Tool Using A Large Language Model-based Agent For Intelligent Human-machine Interaction Du Changyu, Nousias Stavros, Borrmann André
- In-context Learning With Reinforcement Learning For Incomplete Utterance Rewriting Du Haowei, Zhao Dongyan
- Prompting Multi-modal Tokens To Enhance End-to-end Autonomous Driving Imitation Learning With Llms Duan Yiqun, Zhang Qiang, Xu Renjing
- Applying RLAIF For Code Generation With Api-usage In Lightweight Llms Dutta Sujan, Mahinder Sayantan, Anantha Raviteja, Bandyopadhyay Bortik
- Lusifer: Llm-based User Simulated Feedback Environment For Online Recommender Systems Ebrat Danial, Rueda Luis
- Tinyagent: Function Calling At The Edge Erdogan Lutfi Eren, Lee Nicholas, Jha Siddharth, Kim Sehoon, Tabrizi Ryan, Moon Suhong, Hooper Coleman, Anumanchipalli Gopala, Keutzer Kurt, Gholami Amir
- Navigation Instruction Generation With BEV Perception And Large Language Models Fan Sheng, Liu Rui, Wang Wenguan, Yang Yi
- Prompt Optimizer Of Text-to-image Diffusion Models For Abstract Concept Understanding Fan Zezhong, Li Xiaohan, Fang Chenhao, Biswas Topojoy, Nag Kaushiki, Xu Jianpeng, Achan Kannan
- DARA: Decomposition-alignment-reasoning Autonomous Language Agent For Question Answering Over Knowledge Graphs Fang Haishuo, Zhu Xiaodan, Gurevych Iryna
- Large Language Models Are Neurosymbolic Reasoners Fang Meng, Deng Shilong, Zhang Yudi, Shi Zijing, Chen Ling, Pechenizkiy Mykola, Wang Jun
- AGILE: A Novel Framework Of LLM Agents Feng Peiyuan, He Yichen, Huang Guanhua, Lin Yuan, Zhang Hanchong, Zhang Yuchen, Li Hang
- An Extremely Data-efficient And Generative Llm-based Reinforcement Learning Agent For Recommenders Feng Shuang, Feng Grace
- Large Language Model-based Human-agent Collaboration For Complex Task Solving Feng Xueyang, Chen Zhi-yuan, Qin Yujia, Lin Yankai, Chen Xu, Liu Zhiyuan, Wen Ji-rong
- Generative AI And Large Language Models For Cyber Security: All Insights You Need Ferrag Mohamed Amine, Alwahedi Fatima, Battah Ammar, Cherif Bilel, Mechri Abdechakour, Tihanyi Norbert
- Limits Of Large Language Models In Debating Humans Flamino James, Modi Mohammed Shahid, Szymanski Boleslaw K., Cross Brendan, Mikolajczyk Colton
- Contrastive Policy Gradient: Aligning Llms On Sequence-level Scores In A Supervised-friendly Fashion Flet-berliac Yannis, Grinsztajn Nathan, Strub Florian, Choi Eugene, Cremer Chris, Ahmadian Arash, Chandak Yash, Azar Mohammad Gheshlaghi, Pietquin Olivier, Geist Matthieu
- Representing Rule-based Chatbots With Transformers Friedman Dan, Panigrahi Abhishek, Chen Danqi
- LLM Agents In Interaction: Measuring Personality Consistency And Linguistic Alignment In Interacting Populations Of Large Language Models Frisch Ivar, Giulianelli Mario
- Preact: Predicting Future In React Enhances Agent's Planning Ability Fu Dayuan, Huang Jianzhao, Lu Siyuan, Dong Guanting, Wang Yejie, He Keqing, Xu Weiran
- Scene-llm: Extending Language Model For 3D Visual Understanding And Reasoning Fu Rao, Liu Jingyu, Chen Xilun, Nie Yixin, Xiong Wenhan
- Autoguide: Automated Generation And Selection Of State-aware Guidelines For Large Language Model Agents Fu Yao, Kim Dong-ki, Kim Jaekyeom, Sohn Sungryull, Logeswaran Lajanugen, Bae Kyunghoon, Lee Honglak
- Application Of LLM Agents In Recruitment: A Novel Framework For Resume Screening Gan Chengguang, Zhang Qinghao, Mori Tatsunori
- Aligning LLM Agents By Learning Latent Preference From User Edits Gao Ge, Taymanov Alexey, Salinas Eduardo, Mineiro Paul, Misra Dipendra
- Efficient Tool Use With Chain-of-abstraction Reasoning Gao Silin, Dwivedi-yu Jane, Yu Ping, Tan Xiaoqing Ellen, Pasunuru Ramakanth, Golovneva Olga, Sinha Koustuv, Celikyilmaz Asli, Bosselut Antoine, Wang Tianlu
- Worldgpt: Empowering LLM As Multimodal World Model Ge Zhiqi, Huang Hongzhe, Zhou Mingze, Li Juncheng, Wang Guoming, Tang Siliang, Zhuang Yueting
- Reinforcement Learning Problem Solving With Large Language Models Gholamian Sina, Huh Domingo
- A Closer Look At The Limitations Of Instruction Tuning Ghosh Sreyan, Evuru Chandra Kiran Reddy, Kumar Sonal, S Ramaneswaran, Aneja Deepali, Jin Zeyu, Duraiswami Ramani, Manocha Dinesh
- Agentquest: A Modular Benchmark Framework To Measure Progress And Improve LLM Agents Gioacchini Luca, Siracusano Giuseppe, Sanvito Davide, Gashteovski Kiril, Friede David, Bifulco Roberto, Lawrence Carolin
- Direct Alignment Of Draft Model For Speculative Decoding With Chat-fine-tuned Llms Goel Raghavv, Gagrani Mukul, Jeon Wonseok, Park Junyoung, Lee Mingu, Lott Christopher
- Language Guided Exploration For RL Agents In Text Environments Golchha Hitesh, Yerawar Sahil, Patel Dhruvesh, Dan Soham, Murugesan Keerthiram
- Stress-testing Capability Elicitation With Password-locked Models Greenblatt Ryan, Roger Fabien, Krasheninnikov Dmitrii, Krueger David
- Teams-rl: Teaching Llms To Generate Better Instruction Datasets Via Reinforcement Learning Gu Shangding, Knoll Alois, Jin Ming
- Continual Skill And Task Learning Via Dialogue Gu Weiwei, Kondepudi Suresh, Huang Lixiao, Gopalan Nakul
- Middleware For Llms: Tools Are Instrumental For Language Agents In Complex Environments Gu Yu, Shu Yiheng, Yu Hao, Liu Xiao, Dong Yuxiao, Tang Jie, Srinivasa Jayanth, Latapie Hugo, Su Yu
- Inquire, Interact, And Integrate: A Proactive Agent Collaborative Framework For Zero-shot Multimodal Medical Reasoning Gu Zishan, Liu Fenglin, Yin Changchang, Zhang Ping
- AMOR: A Recipe For Building Adaptable Modular Knowledge Agents Through Process Feedback Guan Jian, Wu Wei, Wen Zujie, Xu Peng, Wang Hongning, Huang Minlie
- Conversational Topic Recommendation In Counseling And Psychotherapy With Decision Transformer And Large Language Models Gunal Aylin, Lin Baihan, Bouneffouf Djallel
- Direct Language Model Alignment From Online AI Feedback Guo Shangmin, Zhang Biao, Liu Tianlin, Liu Tianqi, Khalman Misha, Llinares Felipe, Rame Alexandre, Mesnard Thomas, Zhao Yao, Piot Bilal, Ferret Johan, Blondel Mathieu
- Agent-simt: Agent-assisted Simultaneous Machine Translation With Large Language Models Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Sillm: Large Language Models For Simultaneous Machine Translation Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Ds-agent: Automated Data Science By Empowering Large Language Models With Case-based Reasoning Guo Siyuan, Deng Cheng, Wen Ying, Chen Hechang, Chang Yi, Wang Jun
- METAREFLECTION: Learning Instructions For Language Agents Using Past Reflections Gupta Priyanshu, Kirtania Shashank, Singha Ananya, Gulwani Sumit, Radhakrishna Arjun, Shi Sherry, Soares Gustavo
- Clochat: Understanding How People Customize, Interact, And Experience Personas In Large Language Models Ha Juhye, Jeon Hyeon, Han Daeun, Seo Jinwook, Oh Changhoon
- Would I Lie To You? Inference Time Alignment Of Language Models Using Direct Preference Heads Hadji-kyriacou Avelina Asada, Arandjelovic Ognjen
- Value Augmented Sampling For Language Model Alignment And Personalization Han Seungwook, Shenfeld Idan, Srivastava Akash, Kim Yoon, Agrawal Pulkit
- Vision-and-language Navigation Generative Pretrained Transformer Hanlin Wen
- Astprompter: Weakly Supervised Automated Language Model Red-teaming To Identify Likely Toxic Prompts Hardy Amelia F., Liu Houjun, Lange Bernard, Kochenderfer Mykel J.
- Sub-goal Distillation: A Method To Improve Small Language Agents Hashemzadeh Maryam, Stengel-eskin Elias, Chandar Sarath, Cote Marc-alexandre
- Teaching Large Language Models To Reason With Reinforcement Learning Havrilla Alex, Du Yuqing, Raparthy Sharath Chandra, Nalmpantis Christoforos, Dwivedi-yu Jane, Zhuravinskyi Maksym, Hambro Eric, Sukhbaatar Sainbayar, Raileanu Roberta
- Large Language Models Are Biased Reinforcement Learners Hayes William M., Yax Nicolas, Palminteri Stefano
- Relative Value Biases In Large Language Models Hayes William M., Yax Nicolas, Palminteri Stefano
- Webvoyager: Building An End-to-end Web Agent With Large Multimodal Models He Hongliang, Yao Wenlin, Ma Kaixin, Yu Wenhao, Dai Yong, Zhang Hongming, Lan Zhenzhong, Yu Dong
- From Words To Actions: Unveiling The Theoretical Underpinnings Of Llm-driven Autonomous Systems He Jianliang, Chen Siyu, Zhang Fengzhuo, Yang Zhuoran
- Llms Meet Multimodal Generation And Editing: A Survey He Yingqing, Liu Zhaoyang, Chen Jingye, Tian Zeyue, Liu Hongyu, Chi Xiaowei, Liu Runtao, Yuan Ruibin, Xing Yazhou, Wang Wenhai, Dai Jifeng, Zhang Yong, Xue Wei, Liu Qifeng, Guo Yike, Chen Qifeng
- Intent-conditioned And Non-toxic Counterspeech Generation Using Multi-task Instruction Tuning With RLAIF Hengle Amey, Kumar Aswini, Singh Sahajpreet, Bandhakavi Anil, Akhtar Md Shad, Chakroborty Tanmoy
- Outcome-constrained Large Language Models For Countering Hate Speech Hong Lingzi, Luo Pengcheng, Blanco Eduardo, Song Xiaoying
- Curiosity-driven Red-teaming For Large Language Models Hong Zhang-wei, Shenfeld Idan, Wang Tsun-hsuan, Chuang Yung-sung, Pareja Aldo, Glass James, Srivastava Akash, Agrawal Pulkit
- Evolution Of Social Norms In LLM Agents Using Natural Language Horiguchi Ilya, Yoshida Takahide, Ikegami Takashi
- Chatglm-rlhf: Practices Of Aligning Large Language Models With Human Feedback Hou Zhenyu, Niu Yilin, Du Zhengxiao, Zhang Xiaohan, Liu Xiao, Zeng Aohan, Zheng Qinkai, Huang Minlie, Wang Hongning, Tang Jie, Dong Yuxiao
- Openrlhf: An Easy-to-use, Scalable And High-performance RLHF Framework Hu Jian, Wu Xibin, Wang Weixun, Xianyu, Zhang Dehao, Cao Yu
- Hiagent: Hierarchical Working Memory Management For Solving Long-horizon Agent Tasks With Large Language Model Hu Mengkang, Chen Tianxing, Chen Qiguang, Mu Yao, Shao Wenqi, Luo Ping
- Agentgen: Enhancing Planning Abilities For Large Language Model Based Agent Via Environment And Task Generation Hu Mengkang, Zhao Pu, Xu Can, Sun Qingfeng, Lou Jianguang, Lin Qingwei, Luo Ping, Rajmohan Saravan, Zhang Dongmei
- A Survey On Large Language Model-based Game Agents Hu Sihao, Huang Tiansheng, Ilhan Fatih, Tekin Selim, Liu Gaowen, Kompella Ramana, Liu Ling
- Pokergpt: An End-to-end Lightweight Solver For Multi-player Texas Hold'em Via Large Language Model Huang Chenghao, Cao Yanbo, Wen Yinlong, Zhou Tao, Zhang Yanru
- A Survey On Evaluation Of Multimodal Large Language Models Huang Jiaxing, Zhang Jingyi
- The N+ Implementation Details Of RLHF With PPO: A Case Study On TL;DR Summarization Huang Shengyi, Noukhovitch Michael, Hosseini Arian, Rasul Kashif, Wang Weixun, Tunstall Lewis
- Queryagent: A Reliable And Efficient Reasoning Framework With Environmental Feedback-based Self-correction Huang Xiang, Cheng Sitao, Huang Shanshan, Shen Jiayu, Xu Yong, Zhang Chaoyun, Qu Yuzhong
- WESE: Weak Exploration To Strong Exploitation For LLM Agents Huang Xu, Liu Weiwen, Chen Xiaolong, Wang Xingmei, Lian Defu, Wang Yasheng, Tang Ruiming, Chen Enhong
- Enhancing The Capability And Robustness Of Large Language Models Through Reinforcement Learning-driven Query Refinement Huang Zisu, Wang Xiaohua, Zhang Feiran, Xu Zhibo, Zhang Cenyuan, Zheng Xiaoqing, Huang Xuanjing
- Mapcoder: Multi-agent Code Generation For Competitive Problem Solving Islam Md. Ashraful, Ali Mohammed Eunus, Parvez Md Rizwan
- Datanarrative: Automated Data-driven Storytelling With Visualizations And Texts Islam Mohammed Saidul, Laskar Md Tahmid Rahman, Parvez Md Rizwan, Hoque Enamul, Joty Shafiq
- Elements Of World Knowledge (EWOK): A Cognition-inspired Framework For Evaluating Basic World Knowledge In Language Models Ivanova Anna A., Sathe Aalok, Lipkin Benjamin, Kumar Unnathi, Radkani Setayesh, Clark Thomas H., Kauf Carina, Hu Jennifer, Pramod R. T., Grand Gabriel, Paulun Vivian, Ryskina Maria, Akyürek Ekin, Wilcox Ethan, Rashid Nafisa, Choshen Leshem, Levy Roger, Fedorenko Evelina, Tenenbaum Joshua, Andreas Jacob
- LLM Agents Improve Semantic Code Search Jain Sarthak University Of Illinois Urbana Champaign And Cisco, Dora Aditya University Of Illinois Urbana Champaign, Sam Ka Seng University Of Illinois Urbana Champaign, Singh Prabhat Cisco
- RLSF: Reinforcement Learning Via Symbolic Feedback Jha Piyush, Jana Prithwish, Arora Arnav, Ganesh Vijay
- Simulbench: Evaluating Language Models With Creative Simulation Tasks Jia Qi, Yue Xiang, Zheng Tianyu, Huang Jie, Lin Bill Yuchen
- Langsuite: Planning, Controlling And Interacting With Large Language Models In Embodied Text Environments Jia Zixia, Wang Mengmeng, Tong Baichen, Zhu Song-chun, Zheng Zilong
- The Real, The Better: Aligning Large Language Models With Online Human Behaviors Jiang Guanying, Yan Lingyong, Shi Haibo, Yin Dawei
- Prompt-based Length Controlled Generation With Multiple Control Types Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- BIDER: Bridging Knowledge Inconsistency For Efficient Retrieval-augmented Llms Via Key Supporting Evidence Jin Jiajie, Zhu Yutao, Zhou Yujia, Dou Zhicheng
- Read To Play (r2-play): Decision Transformer With Multimodal Game Instruction Jin Yonggang, Zhang Ge, Zhao Hao, Zheng Tianyu, Guo Jarvi, Xiang Liuyu, Yue Shawn, Huang Stephen W., He Zhaofeng, Fu Jie
- Doing Personal LAPS: Llm-augmented Dialogue Construction For Personalized Multi-session Conversational Search Joko Hideaki, Chatterjee Shubham, Ramsay Andrew, De Vries Arjen P., Dalton Jeff, Hasibi Faegheh
- Llm-based Open-domain Integrated Task And Knowledge Assistants With Programmable Policies Joshi Harshit, Liu Shicheng, Chen James, Weigle Robert, Lam Monica S.
- RAP: Retrieval-augmented Planning With Contextual Memory For Multimodal LLM Agents Kagaya Tomoyuki, Yuan Thong Jing, Lou Yuxuan, Karlekar Jayashree, Pranata Sugiri, Kinose Akira, Oguri Koki, Wick Felix, You Yang
- Omniact: A Dataset And Benchmark For Enabling Multimodal Generalist Autonomous Agents For Desktop And Web Kapoor Raghav, Butala Yash Parag, Russak Melisa, Koh Jing Yu, Kamble Kiran, Alshikh Waseem, Salakhutdinov Ruslan
- Instruct, Not Assist: Llm-based Multi-turn Planning And Hierarchical Questioning For Socratic Code Debugging Kargupta Priyanka, Agarwal Ishika, Hakkani-tur Dilek, Han Jiawei
- Can Reinforcement Learning Unlock The Hidden Dangers In Aligned Large Language Models? Karkevandi Mohammad Bahrami, Vishwamitra Nishant, Najafirad Peyman
- Alpaca Against Vicuna: Using Llms To Uncover Memorization Of Llms Kassem Aly M., Mahmoud Omar, Mireshghallah Niloofar, Kim Hyunwoo, Tsvetkov Yulia, Choi Yejin, Saad Sherif, Rana Santu
- Finding A Needle In The Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases With Minimal Distribution Distortion Kassem Aly M., Saad Sherif
- Bridging The Preference Gap Between Retrievers And Llms Ke Zixuan, Kong Weize, Li Cheng, Zhang Mingyang, Mei Qiaozhu, Bendersky Michael
- Visiongpt: Vision-language Understanding Agent Using Generalized Multimodal Framework Kelly Chris, Hu Luhui, Yang Bang, Tian Yu, Yang Deshun, Yang Cindy, Huang Zaoshan, Li Zihao, Hu Jiayin, Zou Yuexian
- On Scalable Oversight With Weak Llms Judging Strong Llms Kenton Zachary, Siegel Noah Y., Kramár János, Brown-cohen Jonah, Albanie Samuel, Bulian Jannis, Agarwal Rishabh, Lindner David, Tang Yunhao, Goodman Noah D., Shah Rohin
- Understanding Large-language Model (llm)-powered Human-robot Interaction Kim Callie Y., Lee Christine P., Mutlu Bilge
- Dialsim: A Real-time Simulator For Evaluating Long-term Dialogue Understanding Of Conversational Agents Kim Jiho, Chay Woosog, Hwang Hyeonji, Kyung Daeun, Chung Hyunseung, Cho Eunbyeol, Jo Yohan, Choi Edward
- Rethinking The Role Of Proxy Rewards In Language Model Alignment Kim Sungdong, Seo Minjoon
- Cleared For Takeoff? Compositional & Conditional Reasoning May Be The Achilles Heel To (flight-booking) Language Agents Kohli Harsh, Sun Huan
- Prewrite: Prompt Rewriting With Reinforcement Learning Kong Weize, Hombaiah Spurthi Amba, Zhang Mingyang, Mei Qiaozhu, Bendersky Michael
- QPO: Query-dependent Prompt Optimization Via Multi-loop Offline Reinforcement Learning Kong Yilun, Mao Hangyu, Zhao Qi, Zhang Bin, Ruan Jingqing, Shen Li, Chang Yongzhe, Wang Xueqian, Zhao Rui, Tao Dacheng
- Can Large Language Models Explore In-context? Krishnamurthy Akshay, Harris Keegan, Foster Dylan J., Zhang Cyril, Slivkins Aleksandrs
- Reinforcement Learning For Optimizing RAG For Domain Chatbots Kulkarni Mandar, Tangarajan Praveen, Kim Kyung, Trivedi Anusua
- Fine-tuning, Quantization, And Llms: Navigating Unintended Outcomes Kumar Divyanshu, Kumar Anurakt, Agarwal Sahil, Harshangi Prashanth
- Large Language Model Agents For Improving Engagement With Behavior Change Interventions: Application To Digital Mindfulness Kumar Harsh, Yoo Suhyeon, Bernuy Angela Zavaleta, Shi Jiakai, Luo Huayin, Williams Joseph, Kuzminykh Anastasia, Anderson Ashton, Kornfield Rachel
- Improving Socratic Question Generation Using Data Augmentation And Preference Optimization Kumar Nischal Ashok, Lan Andrew
- Continual Learning Optimizations For Auto-regressive Decoder Of Multilingual ASR Systems Kwok Chin Yuen, Yip Jia Qi, Chng Eng Siong
- Open Models, Closed Minds? On Agents Capabilities In Mimicking Human Personalities Through Open Large Language Models La Cava Lucio, Tagarelli Andrea
- Can Github Issues Be Solved With Tree Of Thoughts? La Rosa Ricardo, Hulse Corey, Liu Bangdi
- RES-Q: Evaluating Code-editing Large Language Model Systems At The Repository Scale Labash Beck, Rosedale August, Reents Alex, Negritto Lucas, Wiel Colin
- Autowebglm: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent Lai Hanyu, Liu Xiao, Iong Iat Long, Yao Shuntian, Chen Yuxuan, Shen Pengbo, Yu Hao, Zhang Hanchen, Zhang Xiaohan, Dong Yuxiao, Tang Jie
- Alarm: Align Language Models Via Hierarchical Rewards Modeling Lai Yuhang, Wang Siyuan, Liu Shujun, Huang Xuanjing, Wei Zhongyu
- Fine-tuning Language Models With Reward Learning On Policy Lang Hao, Huang Fei, Li Yongbin
- Sparse Rewards Can Self-train Dialogue Agents Lattimer Barrett Martin, Gangal Varun, Mcdonald Ryan, Yang Yi
- Frontier AI Ethics: Anticipating And Evaluating The Societal Impacts Of Generative Agents Lazar Seth
- Benchmarks Underestimate The Readiness Of Multi-lingual Dialogue Agents Lee Andrew H., Semnani Sina J., Castillo-lópez Galo, De Chalendar Gäel, Choudhury Monojit, Dua Ashna, Kavitha Kapil Rajesh, Kim Sungkyun, Kodali Prashant, Kumaraguru Ponnurangam, Lombard Alexis, Moradshahi Mehrad, Park Gihyun, Semmar Nasredine, Seo Jiwon, Shen Tianhao, Shrivastava Manish, Xiong Deyi, Lam Monica S.
- Improving Conversational Abilities Of Quantized Large Language Models Via Direct Preference Alignment Lee Janghwan, Park Seongmin, Hong Sukjin, Kim Minsoo, Chang Du-seong, Choi Jungwook
- A Human-inspired Reading Agent With Gist Memory Of Very Long Contexts Lee Kuang-huei, Chen Xinyun, Furuta Hiroki, Canny John, Fischer Ian
- Reinforcement Learning From Reflective Feedback (RLRF): Aligning And Improving Llms Via Fine-grained Self-reflection Lee Kyungjae, Hwang Dasol, Park Sunghyun, Jang Youngsoo, Lee Moontae
- COCOA: Cbt-based Conversational Counseling Agent Using Memory Specialized In Cognitive Distortions And Dynamic Prompt Lee Suyeon, Kang Jieun, Kim Harim, Chung Kyoung-mee, Lee Dongha, Yeo Jinyoung
- Learning To Reduce: Optimal Representations Of Structured Data In Prompting Large Language Models Lee Younghun, Kim Sungchul, Yu Tong, Rossi Ryan A., Chen Xiang
- Autocoder: Enhancing Code Large Language Model With \textsc{aiev-instruct} Lei Bin, Li Yuchen, Chen Qiuwu
- Using Hallucinations To Bypass Gpt4's Filter Lemkin Benjamin
- More RLHF, More Trust? On The Impact Of Human Preference Alignment On Language Model Trustworthiness Li Aaron J., Krishna Satyapriya, Lakkaraju Himabindu
- Incorporating External Knowledge And Goal Guidance For Llm-based Conversational Recommender Systems Li Chuang, Deng Yang, Hu Hengchang, Kan Min-yen, Li Haizhou
- A Study On Training And Developing Large Language Models For Behavior Tree Generation Li Fu, Wang Xueying, Li Bin, Wu Yunlong, Wang Yanzhen, Yi Xiaodong
- Hello Again! Llm-powered Personalized Agent For Long-term Dialogue Li Hao, Yang Chenghao, Zhang An, Deng Yang, Wang Xiang, Chua Tat-seng
- Backdoor Removal For Generative Large Language Models Li Haoran, Chen Yulin, Zheng Zihao, Hu Qi, Chan Chunkit, Liu Heshan, Song Yangqiu
- Stylechat: Learning Recitation-augmented Memory In Llms For Stylized Dialogue Generation Li Jinpeng, Zhang Zekai, Tu Quan, Cheng Xin, Zhao Dongyan, Yan Rui
- Dialogue Action Tokens: Steering Language Models In Goal-directed Dialogue With A Multi-turn Planner Li Kenneth, Wang Yiming, Viégas Fernanda, Wattenberg Martin
- Predicting Vs. Acting: A Trade-off Between World Modeling & Agent Modeling Li Margaret, Shi Weijia, Pagnoni Artidoro, West Peter, Holtzman Ari
- Coevol: Constructing Better Responses For Instruction Finetuning Through Multi-agent Cooperation Li Renhao, Tan Minghuan, Wong Derek F., Yang Min
- IQA-EVAL: Automatic Evaluation Of Human-model Interactive Question Answering Li Ruosen, Wang Barry, Li Ruochen, Du Xinya
- TOPA: Extend Large Language Models For Video Understanding Via Text-only Pre-alignment Li Wei, Fan Hehe, Wong Yongkang, Kankanhalli Mohan, Yang Yi
- Citation-enhanced Generation For Llm-based Chatbots Li Weitao, Li Junkai, Ma Weizhi, Liu Yang
- Reinforcement Learning With Token-level Feedback For Controllable Text Generation Li Wendi, Wei Wei, Xu Kaihe, Xie Wenfeng, Chen Dangyang, Cheng Yu
- Tutorly: Turning Programming Videos Into Apprenticeship Learning Environments With Llms Li Wengxi, Pea Roy, Haber Nick, Subramonyam Hari
- Personalized Language Modeling From Personalized Human Feedback Li Xinyu, Lipton Zachary C., Leqi Liu
- Rethinking Chatgpt's Success: Usability And Cognitive Behaviors Enabled By Auto-regressive Llms' Prompting Li Xinzhe, Liu Ming
- Q-adapter: Training Your LLM Adapter As A Residual Q-function Li Yi-chen, Zhang Fuxiang, Qiu Wenjie, Yuan Lei, Jia Chengxing, Zhang Zongzhang, Yu Yang
- Womd-reasoning: A Large-scale Language Dataset For Interaction And Driving Intentions Reasoning Li Yiheng, Ge Chongjian, Li Chenran, Xu Chenfeng, Tomizuka Masayoshi, Tang Chen, Ding Mingyu, Zhan Wei
- Mateval: A Multi-agent Discussion Framework For Advancing Open-ended Text Evaluation Li Yu, Zhang Shenyu, Wu Rui, Huang Xiutian, Chen Yongrui, Xu Wenhao, Qi Guilin, Min Dehai
- ISQA: Informative Factuality Feedback For Scientific Summarization Li Zekai, Qin Yanxia, Liu Qian, Kan Min-yen
- Formal-llm: Integrating Formal Language And Natural Language For Controllable Llm-based Agents Li Zelong, Hua Wenyue, Wang Hao, Zhu He, Zhang Yongfeng
- Cog-ga: A Large Language Models-based Generative Agent For Vision-language Navigation In Continuous Environments Li Zhiyuan, Lu Yanfeng, Mu Yao, Qiao Hong
- Self-instructed Derived Prompt Generation Meets In-context Learning: Unlocking New Potential Of Black-box Llms Li Zhuo, Du Yuhao, Hu Jinpeng, Wan Xiang, Gao Anningzhe
- Recai: Leveraging Large Language Models For Next-generation Recommender Systems Lian Jianxun, Lei Yuxuan, Huang Xu, Yao Jing, Xu Wei, Xie Xing
- What Is The Best Model? Application-driven Evaluation For Large Language Models Lian Shiguo, Zhao Kaikai, Liu Xinhui, Lei Xuejiao, Yang Bikun, Zhang Wenjing, Wang Kai, Liu Zhaoxiang
- Online Training Of Large Language Models: Learn While Chatting Liang Juhao, Wang Ziwei, Ma Zhuoheng, Li Jianquan, Zhang Zhiyi, Wu Xiangbo, Wang Benyou
- Bytecomposer: A Human-like Melody Composition Method Based On Language Model Agent Liang Xia, Du Xingjian, Lin Jiaju, Zou Pei, Wan Yuan, Zhu Bilei
- Languashrink: Reducing Token Overhead With Psycholinguistics Liang Xuechen, Tao Meiling, Xia Yinghui, Shi Tianyu, Wang Jun, Yang Jingsong
- Self-evolving Agents With Reflective And Memory-augmented Abilities Liang Xuechen, Tao Meiling, Xia Yinghui, Shi Tianyu, Wang Jun, Yang Jingsong
- Controllable Text Generation For Large Language Models: A Survey Liang Xun, Wang Hanyu, Wang Yezhaohui, Song Shichao, Yang Jiawei, Niu Simin, Hu Jie, Liu Dan, Yao Shunyu, Xiong Feiyu, Li Zhiyu
- Learning To Trust Your Feelings: Leveraging Self-awareness In Llms For Hallucination Mitigation Liang Yuxin, Song Zhuoyang, Wang Hao, Zhang Jiaxing
- Enhancing Reinforcement Learning With Label-sensitive Reward For Natural Language Understanding Liao Kuo, Li Shuang, Zhao Meng, Liu Liqun, Xue Mengge, Hu Zhenyu, Han Honglin, Yin Chengguo
- Towards Automated Data Sciences With Natural Language And Sagecopilot: Practices And Lessons Learned Liao Yuan, Bian Jiang, Yun Yuhui, Wang Shuo, Zhang Yubo, Chu Jiaming, Wang Tao, Li Kewei, Li Yuchen, Li Xuhong, Ji Shilei, Xiong Haoyi
- Graph-enhanced Large Language Models In Asynchronous Plan Reasoning Lin Fangru, La Malfa Emanuele, Hofmann Valentin, Yang Elle Michelle, Cohn Anthony, Pierrehumbert Janet B.
- FLAME: Factuality-aware Alignment For Large Language Models Lin Sheng-chieh, Gao Luyu, Oguz Barlas, Xiong Wenhan, Lin Jimmy, Yih Wen-tau, Chen Xilun
- Dogerm: Equipping Reward Models With Domain Knowledge Through Model Merging Lin Tzu-han, Li Chen-an, Lee Hung-yi, Chen Yun-nung
- Improving Visual Storytelling With Multimodal Large Language Models Lin Xiaochuan, Chen Xiangyong
- Compromising Embodied Agents With Contextual Backdoor Attacks Liu Aishan, Zhou Yuguang, Liu Xianglong, Zhang Tianyuan, Liang Siyuan, Wang Jiakai, Pu Yanjun, Li Tianlin, Zhang Junqi, Zhou Wenbo, Guo Qing, Tao Dacheng
- Progressively Selective Label Enhancement For Language Model Alignment Liu Biao, Xu Ning, Geng Xin
- Logic Agent: Enhancing Validity With Logic Rule Invocation Liu Hanmeng, Teng Zhiyang, Zhang Chaoli, Zhang Yue
- Tiny Refinements Elicit Resilience: Toward Efficient Prefix-model Against LLM Red-teaming Liu Jiaxu, Yin Xiangyu, Wu Sihao, Wang Jianhong, Fang Meng, Yi Xinping, Huang Xiaowei
- Visualwebbench: How Far Have Multimodal Llms Evolved In Web Page Understanding And Grounding? Liu Junpeng, Song Yifan, Lin Bill Yuchen, Lam Wai, Neubig Graham, Li Yuanzhi, Yue Xiang
- Large Language Model-based Agents For Software Engineering: A Survey Liu Junwei, Wang Kaixin, Chen Yixuan, Peng Xin, Chen Zhenpeng, Zhang Lingming, Lou Yiling
- Generative Expressive Conversational Speech Synthesis Liu Rui, Hu Yifan, Ren Yi, Yin Xiang, Li Haizhou
- RL-GPT: Integrating Reinforcement Learning And Code-as-policy Liu Shaoteng, Yuan Haoqi, Hu Minda, Li Yanwei, Chen Yukang, Liu Shu, Lu Zongqing, Jia Jiaya
- SPINACH: Sparql-based Information Navigation For Challenging Real-world Questions Liu Shicheng, Semnani Sina J., Triedman Harold, Xu Jialiang, Zhao Isaac Dan, Lam Monica S.
- Toolace: Winning The Points Of LLM Function Calling Liu Weiwen, Huang Xu, Zeng Xingshan, Hao Xinlong, Yu Shuai, Li Dexun, Wang Shuai, Gan Weinan, Liu Zhengying, Yu Yuanqing, Wang Zezhong, Wang Yuxian, Ning Wu, Hou Yutai, Wang Bin, Wu Chuhan, Wang Xinzhi, Liu Yong, Wang Yasheng, Tang Duyu, Tu Dandan, Shang Lifeng, Jiang Xin, Tang Ruiming, Lian Defu, Liu Qun, Chen Enhong
- Extensive Self-contrast Enables Feedback-free Language Model Alignment Liu Xiao, Song Xixuan, Dong Yuxiao, Tang Jie
- Arondight: Red Teaming Large Vision Language Models With Auto-generated Multi-modal Jailbreak Prompts Liu Yi, Cai Chengjun, Zhang Xiaoli, Yuan Xingliang, Wang Cong
- World Models With Hints Of Large Language Models For Goal Achieving Liu Zeyuan, Huan Ziyu, Wang Xiyao, Lyu Jiafei, Tao Jian, Li Xiu, Huang Furong, Xu Huazhe
- Mixture Of Insightful Experts (mote): The Synergy Of Thought Chains And Expert Mixtures In Self-alignment Liu Zhili, Gou Yunhao, Chen Kai, Hong Lanqing, Gao Jiahui, Mi Fei, Zhang Yu, Li Zhenguo, Jiang Xin, Liu Qun, Kwok James T.
- Agentlite: A Lightweight Library For Building And Advancing Task-oriented LLM Agent System Liu Zhiwei, Yao Weiran, Zhang Jianguo, Yang Liangwei, Liu Zuxin, Tan Juntao, Choubey Prafulla K., Lan Tian, Wu Jason, Wang Huan, Heinecke Shelby, Xiong Caiming, Savarese Silvio
- Large Language Models Know What Makes Exemplary Contexts Long Quanyu, Chen Jianda, Wang Wenya, Pan Sinno Jialin
- Personalitychat: Conversation Distillation For Personalized Dialog Modeling With Facts And Traits Lotfi Ehsan, De Bruyn Maxime, Buhmann Jeska, Daelemans Walter
- Answering Real-world Clinical Questions Using Large Language Model Based Systems Low Yen Sia 1 And 2, Jackson Michael L. 1 And 2, Hyde Rebecca J. 1 And 2, Brown Robert E. 1 And 2, Sanghavi Neil M. 1 And 2, Baldwin Julian D. 1 And 2, Pike C. William 1 And 2, Muralidharan Jananee 1 And 2, Hui Gavin 1 And 2, Alexander Natasha 1 And 7, Hassan Hadeel 1 And 7, Nene Rahul V. 1 And 7, Pike Morgan 1 And 7, Pokrzywa Courtney J. 1 And 7, Vedak Shivam 1 And 7, Yan Adam Paul 1 And 7, Yao Dong-han 1 And 7, Zipursky Amy R. 1 And 7, Dinh Christina 1 And 7, Ballentine Philip 1 And 7, Derieg Dan C. 1 And 7, Polony Vladimir 1 And 7, Chawdry Rehan N. 1 And 7, Davies Jordan 1 And 7, Hyde Brigham B. 1 And 7, Shah Nigam H. 1 And 7, Gombar Saurabh 1 And 8
- Investigating Bias Representations In Llama 2 Chat Via Activation Steering Lu Dawn, Rimsky Nina
- YODA: Teacher-student Progressive Learning For Language Models Lu Jianqiao, Zhong Wanjun, Wang Yufei, Guo Zhijiang, Zhu Qi, Huang Wenyong, Wang Yanlin, Mi Fei, Wang Baojun, Wang Yasheng, Shang Lifeng, Jiang Xin, Liu Qun
- Mental Modeling Of Reinforcement Learning Agents By Language Models Lu Wenhao, Zhao Xufeng, Spisak Josua, Lee Jae Hee, Wermter Stefan
- Arena Learning: Build Data Flywheel For Llms Post-training Via Simulated Chatbot Arena Luo Haipeng, Sun Qingfeng, Xu Can, Zhao Pu, Lin Qingwei, Lou Jianguang, Chen Shifeng, Tang Yansong, Chen Weizhu
- Decoupled Alignment For Robust Plug-and-play Adaptation Luo Haozheng, Yu Jiahao, Zhang Wenxin, Li Jialong, Hu Jerry Yao-chieh, Xing Xinyu, Liu Han
- End-to-end Neuro-symbolic Reinforcement Learning With Textual Explanations Luo Lirui, Zhang Guoxi, Xu Hongming, Yang Yaodong, Fang Cong, Li Qing
- WILBUR: Adaptive In-context Learning For Robust And Accurate Web Agents Lutz Michael, Bohra Arth, Saroyan Manvel, Harutyunyan Artem, Campagna Giovanni
- Weblinx: Real-world Website Navigation With Multi-turn Dialogue Lù Xing Han, Kasner Zdeněk, Reddy Siva
- Agentboard: An Analytical Evaluation Board Of Multi-turn LLM Agents Ma Chang, Zhang Junlei, Zhu Zhihao, Yang Cheng, Yang Yujiu, Jin Yaohui, Lan Zhenzhong, Kong Lingpeng, He Junxian
- Coco-agent: A Comprehensive Cognitive MLLM Agent For Smartphone GUI Automation Ma Xinbei, Zhang Zhuosheng, Zhao Hai
- Sciagent: Tool-augmented Language Models For Scientific Reasoning Ma Yubo, Gou Zhibin, Hao Junheng, Xu Ruochen, Wang Shuohang, Pan Liangming, Yang Yujiu, Cao Yixin, Sun Aixin, Awadalla Hany, Chen Weizhu
- Large Language Models As Minecraft Agents Madge Chris, Poesio Massimo
- A LLM Benchmark Based On The Minecraft Builder Dialog Agent Task Madge Chris, Poesio Massimo
- Evaluating Very Long-term Conversational Memory Of LLM Agents Maharana Adyasha, Lee Dong-ho, Tulyakov Sergey, Bansal Mohit, Barbieri Francesco, Fang Yuwei
- Leveraging Environment Interaction For Automated PDDL Generation And Planning With Large Language Models Mahdavi Sadegh, Aoki Raquel, Tang Keyi, Cao Yanshuai
- From Tarzan To Tolkien: Controlling The Language Proficiency Level Of Llms For Content Generation Malik Ali, Mayhew Stephen, Piech Chris, Bicknell Klinton
- Coalitions Of Large Language Models Increase The Robustness Of AI Agents Mangal Prattyush, Mak Carol, Kanakis Theo, Donovan Timothy, Braines Dave, Pyzer-knapp Edward
- Reinforced Prompt Personalization For Recommendation With Large Language Models Mao Wenyu, Wu Jiancan, Chen Weijian, Gao Chongming, Wang Xiang, He Xiangnan
- Don't Forget Your Reward Values: Language Model Alignment Via Value-based Calibration Mao Xin, Li Feng-lin, Xu Huimin, Zhang Wei, Luu Anh Tuan
- LLM Critics Help Catch LLM Bugs Mcaleese Nat, Pokorny Rai Michael, Uribe Juan Felipe Ceron, Nitishinskaya Evgenia, Trebacz Maja, Leike Jan
- Realhf: Optimized RLHF Training For Large Language Models Through Parameter Reallocation Mei Zhiyu, Fu Wei, Li Kaiwei, Wang Guangju, Zhang Huanchen, Wu Yi
- Logic-enhanced Language Model Agents For Trustworthy Social Simulations Mensfelt Agnieszka, Stathis Kostas, Trencsenyi Vince
- Transforming Wearable Data Into Health Insights Using Large Language Model Agents Merrill Mike A., Paruchuri Akshay, Rezaei Naghmeh, Kovacs Geza, Perez Javier, Liu Yun, Schenck Erik, Hammerquist Nova, Sunshine Jake, Tailor Shyam, Ayush Kumar, Su Hao-wei, He Qian, Mclean Cory Y., Malhotra Mark, Patel Shwetak, Zhan Jiening, Althoff Tim, Mcduff Daniel, Liu Xin
- A Comparison Of LLM Finetuning Methods & Evaluation Metrics With Travel Chatbot Use Case Meyer Sonia, Singh Shreya, Tam Bertha, Ton Christopher, Ren Angel
- Subject-driven Text-to-image Generation Via Preference-based Reinforcement Learning Miao Yanting, Loh William, Kothawade Suraj, Poupart Pascal, Rashwan Abdullah, Li Yeqing
- Smart Language Agents In Real-world Planning Miin Annabelle, Wei Timothy
- Overview Of Dialogue Robot Competition 2023 Minato Takashi, Higashinaka Ryuichiro, Sakai Kurima, Funayama Tomo, Nishizaki Hiromitsu, Naga Takayuki
- Llm-based Frameworks For API Argument Filling In Task-oriented Conversational Systems Mok Jisoo, Kachuee Mohammad, Dai Shuyang, Ray Shayan, Taghavi Tara, Yoon Sungroh
- Language-conditioned Offline RL For Multi-robot Navigation Morad Steven, Shankar Ajay, Blumenkamp Jan, Prorok Amanda
- Exploring The Landscape Of Large Language Models: Foundations, Techniques, And Challenges Moradi Milad, Yan Ke, Colwell David, Samwald Matthias, Asgari Rhona
- Evaluating The Evaluator: Measuring Llms' Adherence To Task Evaluation Instructions Murugadoss Bhuvanashree, Poelitz Christian, Drosos Ian, Le Vu, Mckenna Nick, Negreanu Carina Suzana, Parnin Chris, Sarkar Advait
- Negation Blindness In Large Language Models: Unveiling The NO Syndrome In Image Generation Nadeem Mohammad, Sohail Shahab Saquib, Cambria Erik, Schuller Björn W., Hussain Amir
- Review-feedback-reason (refer): A Novel Framework For NLG Evaluation And Reasoning Narsupalli Yaswanth, Chandra Abhranil, Muppirala Sreevatsa, Gupta Manish, Goyal Pawan
- Learning Goal-conditioned Representations For Language Reward Models Nath Vaskar, Slack Dylan, Da Jeff, Ma Yuntao, Zhang Hugh, Whitehead Spencer, Hendryx Sean
- Balancing Exploration And Exploitation In LLM Using Soft RLLF For Enhanced Negation Understanding Nguyen Ha-thanh, Satoh Ken
- Llm-based Conversational AI Therapist For Daily Functioning Screening And Psychotherapeutic Intervention Via Everyday Smart Devices Nie Jingping, Shao Hanya, Fan Yuang, Shao Qijia, You Haoxuan, Preindl Matthias, Jiang Xiaofan
- Enhancing Dialogue State Tracking Models Through Llm-backed User-agents Simulation Niu Cheng, Wang Xingguang, Cheng Xuxin, Song Juntong, Zhang Tong
- Role-play Zero-shot Prompting With Large Language Models For Open-domain Human-machine Conversation Njifenjou Ahmed, Sucal Virgile, Jabaian Bassam, Lefèvre Fabrice
- Mobileflow: A Multimodal LLM For Mobile GUI Agent Nong Songqin, Zhu Jiali, Wu Rui, Jin Jiongchao, Shan Shuo, Huang Xiutian, Xu Wenhao
- The Conversation Is The Command: Interacting With Real-world Autonomous Robot Through Natural Language Nwankwo Linus, Rueckert Elmar
- Multilingual Prompts In Llm-based Recommenders: Performance Across Languages Ozsoy Makbule Gulcin
- Hierarchical Continual Reinforcement Learning Via Large Language Model Pan Chaofan, Yang Xin, Wang Hao, Wei Wei, Li Tianrui
- Knowledgeable Agents By Offline Reinforcement Learning From Large Language Model Rollouts Pang Jing-cheng, Yang Si-hang, Li Kaiyuan, Zhang Jiaji, Chen Xiong-hui, Tang Nan, Yu Yang
- Measuring Memorization In RLHF For Code Completion Pappu Aneesh, Porter Billy, Shumailov Ilia, Hayes Jamie
- Building Trust In Mental Health Chatbots: Safety Metrics And Llm-based Evaluation Tools Park Jung In, Abbasian Mahyar, Azimi Iman, Bounds Dawn, Jun Angela, Han Jaesu, Mccarron Robert, Borelli Jessica, Li Jia, Mahmoudi Mona, Wiedenhoeft Carmen, Rahmani Amir
- Disentangling Length From Quality In Direct Preference Optimization Park Ryan, Rafailov Rafael, Ermon Stefano, Finn Chelsea
- The Ultimate Guide To Fine-tuning Llms From Basics To Breakthroughs: An Exhaustive Review Of Technologies, Research, Best Practices, Applied Research Challenges And Opportunities Parthasarathy Venkatesh Balavadhani, Zafar Ahtsham, Khan Aafaq, Shahid Arsalan
- Evidence To Generate (E2G): A Single-agent Two-step Prompting For Context Grounded And Retrieval Augmented Reasoning Parvez Md Rizwan
- Large Language Models Can Self-improve At Web Agent Tasks Patel Ajay, Hofmarcher Markus, Leoveanu-condrei Claudiu, Dinu Marius-constantin, Callison-burch Chris, Hochreiter Sepp
- Goex: Perspectives And Designs Towards A Runtime For Autonomous LLM Applications Patil Shishir G., Zhang Tianjun, Fang Vivian, C. Noppapon, Huang Roy, Hao Aaron, Casado Martin, Gonzalez Joseph E., Popa Raluca Ada, Stoica Ion
- Generative AI As A Metacognitive Agent: A Comparative Mixed-method Study With Human Participants On Icf-mimicking Exam Performance Pavlovic Jelena University Of Belgrade, Faculty Of Philosophy And Koucing Centar Resarch Lab, Krstic Jugoslav Koucing Centar Research Lab, Mitrovic Luka Koucing Centar Research Lab, Babic Djordje Koucing Centar Research Lab, Milosavljevic Adrijana Koucing Centar Research Lab, Nikolic Milena Koucing Centar Research Lab, Karaklic Tijana Koucing Centar Research Lab, Mitrovic Tijana Koucing Centar Research Lab
- Autonomous Workflow For Multimodal Fine-grained Training Assistants Towards Mixed Reality Pei Jiahuan, Viola Irene, Huang Haochen, Wang Junxiao, Ahsan Moonisa, Ye Fanghua, Yiming Jiang, Sai Yao, Wang Di, Chen Zhumin, Ren Pengjie, Cesar Pablo
- Reasoning With Large Language Models, A Survey Plaat Aske, Wong Annie, Verberne Suzan, Broekens Joost, Van Stein Niki, Back Thomas
- Iterative Experience Refinement Of Software-developing Agents Qian Chen, Li Jiahao, Dang Yufan, Liu Wei, Wang Yifei, Xie Zihao, Chen Weize, Yang Cheng, Zhang Yingli, Liu Zhiyuan, Sun Maosong
- Agent Planning With World Knowledge Model Qiao Shuofei, Fang Runnan, Zhang Ningyu, Zhu Yuqi, Chen Xiang, Deng Shumin, Jiang Yong, Xie Pengjun, Huang Fei, Chen Huajun
- Automatically Generating Numerous Context-driven SFT Data For Llms Across Diverse Granularity Quan Shanghaoran
- Towards Generalizable Agents In Text-based Educational Environments: A Study Of Integrating RL With Llms Radmehr Bahar, Singla Adish, Käser Tanja
- STEER: Assessing The Economic Rationality Of Large Language Models Raman Narun, Lundy Taylor, Amouyal Samuel, Levine Yoav, Leyton-brown Kevin, Tennenholtz Moshe
- Self-refinement Of Language Models From External Proxy Metrics Feedback Ramji Keshav, Lee Young-suk, Astudillo Ramón Fernandez, Sultan Md Arafat, Naseem Tahira, Munawar Asim, Florian Radu, Roukos Salim
- WARP: On The Benefits Of Weight Averaged Rewarded Policies Ramé Alexandre, Ferret Johan, Vieillard Nino, Dadashi Robert, Hussenot Léonard, Cedoz Pierre-louis, Sessa Pier Giuseppe, Girgin Sertan, Douillard Arthur, Bachem Olivier
- Capturing Minds, Not Just Words: Enhancing Role-playing Language Models With Personality-indicative Data Ran Yiting, Wang Xintao, Xu Rui, Yuan Xinfeng, Liang Jiaqing, Xiao Yanghua, Yang Deqing
- A Critical Look At Tokenwise Reward-guided Text Generation Rashid Ahmad, Wu Ruotian, Grosse Julia, Kristiadi Agustinus, Poupart Pascal
- Codeattack: Revealing Safety Generalization Challenges Of Large Language Models Via Code Completion Ren Qibing, Gao Chang, Shao Jing, Yan Junchi, Tan Xin, Lam Wai, Ma Lizhuang
- Self-reflection In LLM Agents: Effects On Problem-solving Performance Renze Matthew, Guven Erhan
- Countering Reward Over-optimization In LLM With Demonstration-guided Reinforcement Learning Rita Mathieu, Strub Florian, Chaabouni Rahma, Michel Paul, Dupoux Emmanuel, Pietquin Olivier
- Exploring Advanced Large Language Models With Llmsuite Roffo Giorgio
- Talking To Machines: Do You Read Me? Rojas-barahona Lina M.
- Procedural Adherence And Interpretability Through Neuro-symbolic Generative Agents Rothkopf Raven, Zeng Hannah Tongxin, Santolucito Mark
- ERATTA: Extreme RAG For Table To Answers With Large Language Models Roychowdhury Sohini, Krema Marko, Mahammad Anvar, Moore Brian, Mukherjee Arijit, Prakashchandra Punit
- Mitigating Hallucination In Fictional Character Role-play Sadeq Nafis, Xie Zhouhang, Kang Byungkyu, Lamba Prarit, Gao Xiang, Mcauley Julian
- Athena: Safe Autonomous Agents With Verbal Contrastive Learning Sadhu Tanmana, Pesaranghader Ali, Chen Yanan, Yi Dong Hoon
- ICAL: Continual Learning Of Multimodal Agents By Transforming Trajectories Into Actionable Insights Sarch Gabriel, Jang Lawrence, Tarr Michael J., Cohen William W., Marino Kenneth, Fragkiadaki Katerina
- HELPER-X: A Unified Instructable Embodied Agent To Tackle Four Interactive Vision-language Domains With Memory-augmented Language Models Sarch Gabriel, Somani Sahil, Kapoor Raghav, Tarr Michael J., Fragkiadaki Katerina
- An Implementation Of Werewolf Agent That Does Not Truly Trust Llms Sato Takehiro, Ozaki Shintaro, Yokoyama Daisaku
- Improving The Validity Of Automatically Generated Feedback Via Reinforcement Learning Scarlatos Alexander, Smith Digory, Woodhead Simon, Lan Andrew
- Ruffle&riley: Insights From Designing And Evaluating A Large Language Model-based Conversational Tutoring System Schmucker Robin, Xia Meng, Azaria Amos, Mitchell Tom
- Metacognitive Myopia In Large Language Models Scholten Florian, Rebholz Tobias R., Hütter Mandy
- THREAD: Thinking Deeper With Recursive Spawning Schroeder Philip, Morgan Nathaniel, Luo Hongyin, Glass James
- In-context Learning Agents Are Asymmetric Belief Updaters Schubert Johannes A., Jagadish Akshay K., Binz Marcel, Schulz Eric
- Ai-native Memory: A Pathway From Llms Towards AGI Shang Jingbo, Zheng Zai, Wei Jiale, Ying Xiang, Tao Felix, Team Mindverse
- A Critical Evaluation Of AI Feedback For Aligning Large Language Models Sharma Archit, Keh Sedrick, Mitchell Eric, Finn Chelsea, Arora Kushal, Kollar Thomas
- Patched MOA: Optimizing Inference For Diverse Software Development Tasks Sharma Asankhaya
- A Preliminary Study On Using Large Language Models In Software Pentesting Shashwat Kumar, Hahn Francis, Ou Xinming, Goldgof Dmitry, Hall Lawrence, Ligatti Jay, Rajgopalan S. Raj, Tabari Armin Ziaie
- Nemo-aligner: Scalable Toolkit For Efficient Model Alignment Shen Gerald, Wang Zhilin, Delalleau Olivier, Zeng Jiaqi, Dong Yi, Egert Daniel, Sun Shengyang, Zhang Jimmy, Jain Sahil, Taghibakhshi Ali, Ausin Markel Sanz, Aithal Ashwath, Kuchaiev Oleksii
- Shortcutsbench: A Large-scale Real-world Benchmark For Api-based Agents Shen Haiyang, Li Yue, Meng Desong, Cai Dongqi, Qi Sheng, Zhang Li, Xu Mengwei, Ma Yun
- Improving Reinforcement Learning From Human Feedback Using Contrastive Rewards Shen Wei, Zhang Xiaoying, Yao Yuanshun, Zheng Rui, Guo Hongyi, Liu Yang
- Small Llms Are Weak Tool Learners: A Multi-llm Agent Shen Weizhou, Li Chenliang, Chen Hongzhan, Yan Ming, Quan Xiaojun, Chen Hehong, Zhang Ji, Huang Fei
- Toursynbio: A Multi-modal Large Model And Agent Framework To Bridge Text And Protein Sequences For Protein Engineering Shen Yiqing, Chen Zan, Mamalakis Michail, Liu Yungeng, Li Tianbin, Su Yanzhou, He Junjun, Liò Pietro, Wang Yu Guang
- Beyond Human Preferences: Exploring Reinforcement Learning Trajectory Evaluation And Improvement Through Llms Shen Zichao, Zhu Tianchen, Sun Qingyun, Gao Shiqi, Li Jianxin
- From Language Models To Practical Self-improving Computer Agents Sheng Alex
- Opex: A Component-wise Analysis Of Llm-centric Agents In Embodied Instruction Following Shi Haochen, Sun Zhiyuan, Yuan Xingdi, Côté Marc-alexandre, Liu Bang
- Ask-eda: A Design Assistant Empowered By LLM, Hybrid RAG And Abbreviation De-hallucination Shi Luyao, Kazda Michael, Sears Bradley, Shropshire Nick, Puri Ruchir
- Ehragent: Code Empowers Large Language Models For Few-shot Complex Tabular Reasoning On Electronic Health Records Shi Wenqi, Xu Ran, Zhuang Yuchen, Yu Yue, Zhang Jieyu, Wu Hang, Zhu Yuanda, Ho Joyce, Yang Carl, Wang May D.
- Large Language Models Are Learnable Planners For Long-term Recommendation Shi Wentao, He Xiangnan, Zhang Yang, Gao Chongming, Li Xinyue, Zhang Jizhi, Wang Qifan, Feng Fuli
- Agentre: An Agent-based Framework For Navigating Complex Information Landscapes In Relation Extraction Shi Yuchen, Jiang Guochao, Qiu Tian, Yang Deqing
- Eragent: Enhancing Retrieval-augmented Language Models With Improved Accuracy, Efficiency, And Personalization Shi Yunxiao, Zi Xing, Shi Zijing, Zhang Haimin, Wu Qiang, Xu Min
- An Active Inference Strategy For Prompting Reliable Responses From Large Language Models In Medical Practice Shusterman Roma, Waters Allison C., O`neill Shannon, Luu Phan, Tucker Don M.
- Llm-dcache: Improving Tool-augmented Llms With Gpt-driven Localized Data Caching Singh Simranjit, Fore Michael, Karatzas Andreas, Lee Chaehong, Jian Yanan, Shangguan Longfei, Yu Fuxun, Anagnostopoulos Iraklis, Stamoulis Dimitrios
- Evaluating Zero-shot GPT-4V Performance On 3D Visual Question Answering Benchmarks Singh Simranjit, Pavlakos Georgios, Stamoulis Dimitrios
- LGR2: Language Guided Reward Relabeling For Accelerating Hierarchical Reinforcement Learning Singh Utsav, Bhattacharyya Pramit, Namboodiri Vinay P.
- Scaling LLM Test-time Compute Optimally Can Be More Effective Than Scaling Model Parameters Snell Charlie, Lee Jaehoon, Xu Kelvin, Kumar Aviral
- Chatting Up Attachment: Using Llms To Predict Adult Bonds Soares Paulo, Mccurdy Sean, Gerber Andrew J., Fonagy Peter
- From Words To Worlds: Transforming One-line Prompt Into Immersive Multi-modal Digital Stories With Communicative LLM Agent Sohn Samuel S., Li Danrui, Zhang Sen, Chang Che-jui, Kapadia Mubbasir
- A New Pipeline For Generating Instruction Dataset Via RAG And Self Fine-tuning Song Chih-wei, Lee Yu-kai, Tsai Yin-te
- Position: Leverage Foundational Models For Black-box Optimization Song Xingyou, Tian Yingtao, Lange Robert Tjarko, Lee Chansoo, Tang Yujin, Chen Yutian
- Pedagogical Alignment Of Large Language Models Sonkar Shashank, Ni Kangqi, Chaudhary Sapana, Baraniuk Richard G.
- Empo: Theory-driven Dataset Construction For Empathetic Response Generation Through Preference Optimization Sotolar Ondrej
- Virtual Agents For Alcohol Use Counseling: Exploring Llm-powered Motivational Interviewing Steenstra Ian, Nouraei Farnaz, Arjmand Mehdi, Bickmore Timothy W.
- LACIE: Listener-aware Finetuning For Confidence Calibration In Large Language Models Stengel-eskin Elias, Hase Peter, Bansal Mohit
- RLVF: Learning From Verbal Feedback Without Overgeneralization Stephan Moritz, Khazatsky Alexander, Mitchell Eric, Chen Annie S, Hsu Sheryl, Sharma Archit, Finn Chelsea
- Structured Chain-of-thought Prompting For Few-shot Generation Of Content-grounded QA Conversations Sultan Md Arafat, Ganhotra Jatin, Astudillo Ramón Fernandez
- Hierarchical In-context Reinforcement Learning With Hindsight Modular Reflections For Planning Sun Chuanneng, Huang Songjun, Pompili Dario
- Building Better AI Agents: A Provocation On The Utilisation Of Persona In Llm-based Conversational Agents Sun Guangzhi, Zhan Xiao, Such Jose
- Supervised Fine-tuning As Inverse Reinforcement Learning Sun Hao
- Lawluo: A Chinese Law Firm Co-run By LLM Agents Sun Jingyun, Dai Chengxiao, Luo Zhongze, Chang Yangbo, Li Yang
- Dr-llava: Visual Instruction Tuning With Symbolic Clinical Grounding Sun Shenghuan, Goldgof Gregory M., Schubert Alexander, Sun Zhiqing, Hartvigsen Thomas, Butte Atul J., Alaa Ahmed
- Towards Detecting Llms Hallucination Via Markov Chain-based Multi-agent Debate Framework Sun Xiaoxi, Li Jinpeng, Zhong Yan, Zhao Dongyan, Yan Rui
- DFA-RAG: Conversational Semantic Router For Large Language Model With Definite Finite Automaton Sun Yiyou, Hu Junjie, Cheng Wei, Chen Haifeng
- Enhancing Agent Learning Through World Dynamics Modeling Sun Zhiyuan, Shi Haochen, Côté Marc-alexandre, Berseth Glen, Yuan Xingdi, Liu Bang
- Taskgen: A Task-based, Memory-infused Agentic Framework Using Strictjson Tan John Chong Min, Saroj Prince, Runwal Bharat, Maheshwari Hardik, Sheng Brian Lim Yi, Cottrill Richard, Chona Alankrit, Kumar Ambuj, Motani Mehul
- Jill Watson: A Virtual Teaching Assistant Powered By Chatgpt Taneja Karan, Maiti Pratyusha, Kakar Sandeep, Guruprasad Pranav, Rao Sanjeev, Goel Ashok K.
- Prioritizing Safeguarding Over Autonomy: Risks Of LLM Agents For Science Tang Xiangru, Jin Qiao, Zhu Kunlun, Yuan Tongxin, Zhang Yichi, Zhou Wangchunshu, Qu Meng, Zhao Yilun, Tang Jian, Zhang Zhuosheng, Cohan Arman, Lu Zhiyong, Gerstein Mark
- Enhancing Commentary Strategies For Imperfect Information Card Games: A Study Of Large Language Models In Guandan Commentary Tao Meiling, Liang Xuechen, Wang Ziyi, Tao Yiling, Shi Tianyu
- When To Trust Llms: Aligning Confidence With Response Quality Tao Shuchang, Yao Liuyi, Ding Hanxing, Xie Yuexiang, Cao Qi, Sun Fei, Gao Jinyang, Shen Huawei, Ding Bolin
- A Survey On Self-evolution Of Large Language Models Tao Zhengwei, Lin Ting-en, Chen Xiancai, Li Hangyu, Wu Yuchuan, Li Yongbin, Jin Zhi, Huang Fei, Tao Dacheng, Zhou Jingren
- Distilling Llms' Decomposition Abilities Into Compact Language Models Tarasov Denis, Shridhar Kumar
- Dynamic Contexts For Generating Suggestion Questions In RAG Based Conversational Systems Tayal Anuja, Tyagi Aman
- Embedding-aligned Language Models Tennenholtz Guy, Chow Yinlam, Hsu Chih-wei, Shani Lior, Liang Ethan, Boutilier Craig
- Metacognition Is All You Need? Using Introspection In Generative Agents To Improve Goal-directed Behavior Toy Jason, Macadam Josh, Tabor Phil
- Charactereval: A Chinese Benchmark For Role-playing Conversational Agent Evaluation Tu Quan, Fan Shilong, Tian Zihang, Yan Rui
- Lifelong Robot Library Learning: Bootstrapping Composable And Generalizable Skills For Embodied Control With Language Models Tziafas Georgios, Kasaei Hamidreza
- Bootstrapping Llm-based Task-oriented Dialogue Agents Via Self-talk Ulmer Dennis, Mansimov Elman, Lin Kaixiang, Sun Justin, Gao Xibin, Zhang Yi
- Hypotermqa: Hypothetical Terms Dataset For Benchmarking Hallucination Tendency Of Llms Uluoglakci Cem Middle East Technical University, Temizel Tugba Taskaya Middle East Technical University
- Investigating Context Effects In Similarity Judgements In Large Language Models Uprety Sagar, Jaiswal Amit Kumar, Liu Haiming, Song Dawei
- Natural Language Programming In Medicine: Administering Evidence Based Clinical Workflows With Autonomous Agents Powered By Generative Large Language Models Vaid Akhil, Lampert Joshua, Lee Juhee, Sawant Ashwin, Apakama Donald, Sakhuja Ankit, Soroush Ali, Bick Sarah, Abbott Ethan, Gomez Hernando, Hadley Michael, Lee Denise, Landi Isotta, Duong Son Q, Bussola Nicole, Nabeel Ismail, Muehlstedt Silke, Muehlstedt Silke, Freeman Robert, Kovatch Patricia, Carr Brendan, Wang Fei, Glicksberg Benjamin, Argulian Edgar, Lerakis Stamatios, Khera Rohan, Reich David L., Kraft Monica, Charney Alexander, Nadkarni Girish
- On The Brittle Foundations Of React Prompting For Agentic Large Language Models Verma Mudit, Bhambri Siddhant, Kambhampati Subbarao
- Theory Of Mind Abilities Of Large Language Models In Human-robot Interaction : An Illusion? Verma Mudit, Bhambri Siddhant, Kambhampati Subbarao
- RAG Based Question-answering For Contextual Response Prediction System Veturi Sriram, Vaichal Saurabh, Jagadheesh Reshma Lal, Tripto Nafis Irtiza, Yan Nian
- Input Conditioned Graph Generation For Language Agents Vierling Lukas, Fu Jie, Chen Kai
- Instruction Following With Goal-conditioned Reinforcement Learning In Virtual Environments Volovikova Zoya, Skrynnik Alexey, Kuderov Petr, Panov Aleksandr I.
- Cot Rerailer: Enhancing The Reliability Of Large Language Models In Complex Reasoning Tasks Through Error Detection And Correction Wan Guangya, Wu Yuqi, Chen Jie, Li Sheng
- Mllm-tool: A Multimodal Large Language Model For Tool Agent Learning Wang Chenyu Michael, Luo Weixin Michael, Chen Qianyu Michael, Mai Haonan Michael, Guo Jindi Michael, Dong Sixun Michael, Xiaohua Michael, Xuan, Li Zhengxin, Ma Lin, Gao Shenghua
- Adapting Open-source Large Language Models For Cost-effective, Expert-level Clinical Note Generation With On-policy Reinforcement Learning Wang Hanyin, Gao Chufan, Liu Bolun, Xu Qiping, Hussein Guleid, Labban Mohamad El, Iheasirim Kingsley, Korsapati Hariprasad, Outcalt Chuck, Sun Jimeng
- Reinforcement Learning For Edit-based Non-autoregressive Neural Machine Translation Wang Hao, Morimura Tetsuro, Honda Ukyo, Kawahara Daisuke
- Interpretable Preferences Via Multi-objective Reward Modeling And Mixture-of-experts Wang Haoxiang, Xiong Wei, Xie Tengyang, Zhao Han, Zhang Tong
- Mixture-of-agents Enhances Large Language Model Capabilities Wang Junlin, Wang Jue, Athiwaratkun Ben, Zhang Ce, Zou James
- Prefclm: Enhancing Preference-based Reinforcement Learning With Crowdsourced Large Language Models Wang Ruiqi, Zhao Dezhong, Yuan Ziqin, Obi Ike, Min Byung-cheol
- Unique Security And Privacy Threats Of Large Language Model: A Comprehensive Survey Wang Shang, Zhu Tianqing, Liu Bo, Ding Ming, Guo Xu, Ye Dayong, Zhou Wanlei, Yu Philip S.
- Benchmark Self-evolving: A Multi-agent Framework For Dynamic LLM Evaluation Wang Siyuan, Long Zhuohan, Fan Zhihao, Wei Zhongyu, Huang Xuanjing
- Weaver: Foundation Models For Creative Writing Wang Tiannan, Chen Jiamin, Jia Qingrui, Wang Shuai, Fang Ruoyu, Wang Huilin, Gao Zhaowei, Xie Chunzhao, Xu Chuou, Dai Jihong, Liu Yibin, Wu Jialong, Ding Shengwei, Li Long, Huang Zhiwei, Deng Xinle, Yu Teng, Ma Gangan, Xiao Han, Chen Zixin, Xiang Danjun, Wang Yunxia, Zhu Yuanyuan, Xiao Yi, Wang Jing, Wang Yiru, Ding Siran, Huang Jiayang, Xu Jiayi, Tayier Yilihamu, Hu Zhenyu, Gao Yuan, Zheng Chengfeng, Ye Yueshu, Li Yihang, Wan Lei, Jiang Xinyue, Wang Yujie, Cheng Siyu, Song Zhule, Tang Xiangru, Xu Xiaohua, Zhang Ningyu, Chen Huajun, Jiang Yuchen Eleanor, Zhou Wangchunshu
- Using Advanced Llms To Enhance Smaller Llms: An Interpretable Knowledge Distillation Approach Wang Tong, Sudhir K., Hong Dat
- Reasoning Or Simply Next Token Prediction? A Benchmark For Stress-testing Large Language Models Wang Wentian, Kantor Paul, Feldman Jacob, Gallos Lazaros, Wang Hao
- Characteristic AI Agents Via Large Language Models Wang Xi, Dai Hongliang, Gao Shen, Li Piji
- Metatool: Facilitating Large Language Models To Master Tools With Meta-task Augmentation Wang Xiaohan, Li Dian, Zhao Yilin, Sinbadliu, Wang Hui
- Longllava: Scaling Multi-modal Llms To 1000 Images Efficiently Via Hybrid Architecture Wang Xidong, Song Dingjie, Chen Shunian, Zhang Chen, Wang Benyou
- Executable Code Actions Elicit Better LLM Agents Wang Xingyao, Chen Yangyi, Yuan Lifan, Zhang Yizhe, Li Yunzhu, Peng Hao, Ji Heng
- Modaverse: Efficiently Transforming Modalities With Llms Wang Xinyu, Zhuang Bohan, Wu Qi
- Maferw: Query Rewriting With Multi-aspect Feedbacks For Retrieval-augmented Large Language Models Wang Yujing, Zhang Hainan, Pang Liang, Pang Liang, Zheng Hongwei, Zheng Zhiming
- Sibyl: Simple Yet Effective Agent Framework For Complex Real-world Reasoning Wang Yulong, Shen Tianhao, Liu Lifeng, Xie Jian
- M-RAG: Reinforcing Large Language Model Performance Through Retrieval-augmented Generation With Multiple Partitions Wang Zheng, Teo Shu Xian, Ouyang Jieer, Xu Yongjun, Shi Wei
- Genartist: Multimodal LLM As An Agent For Unified Image Generation And Editing Wang Zhenyu, Li Aoxue, Li Zhenguo, Liu Xihui
- Omnijarvis: Unified Vision-language-action Tokenization Enables Open-world Instruction Following Agents Wang Zihao, Cai Shaofei, Mu Zhancun, Lin Haowei, Zhang Ceyao, Liu Xuejie, Li Qing, Liu Anji, Ma Xiaojian, Liang Yitao
- Re2llm: Reflective Reinforcement Large Language Model For Session-based Recommendation Wang Ziyan, Du Yingpeng, Sun Zhu, Chua Haoyan, Feng Kaidong, Wang Wenya, Zhang Jie
- Large Language User Interfaces: Voice Interactive User Interfaces Powered By Llms Wasti Syed Mekael, Pu Ken Q., Neshati Ali
- Is There No Such Thing As A Bad Question? H4R: Hallucibot For Ratiocination, Rewriting, Ranking, And Routing Watson William, Cho Nicole, Srishankar Nishan
- Long-form Factuality In Large Language Models Wei Jerry, Yang Chengrun, Song Xinying, Lu Yifeng, Hu Nathan, Huang Jie, Tran Dustin, Peng Daiyi, Liu Ruibo, Huang Da, Du Cosmo, Le Quoc V.
- On-policy Fine-grained Knowledge Feedback For Hallucination Mitigation Wen Xueru, Lu Xinyu, Guan Xinyan, Lu Yaojie, Lin Hongyu, He Ben, Han Xianpei, Sun Le
- Aligning Large Language Models For Controllable Recommendations Wensheng Lu, Jianxun Lian, Wei Zhang, Guanghua Li, Mingyang Zhou, Hao Liao, Xing Xie
- Llama Pro: Progressive Llama With Block Expansion Wu Chengyue, Gan Yukang, Ge Yixiao, Lu Zeyu, Wang Jiahao, Feng Ye, Shan Ying, Luo Ping
- RLPF: Reinforcement Learning From Prediction Feedback For User Summarization With Llms Wu Jiaxing, Ning Lin, Liu Luyang, Lee Harrison, Wu Neo, Wang Chao, Prakash Sushant, O'banion Shawn, Green Bradley, Xie Jun
- Can Large Language Models Understand Uncommon Meanings Of Common Words? Wu Jinyang, Che Feihu, Zheng Xinxin, Zhang Shuai, Jin Ruihan, Nie Shuai, Shao Pengpeng, Tao Jianhua
- Noiseboost: Alleviating Hallucination With Noise Perturbation For Multimodal Large Language Models Wu Kai, Jiang Boyuan, Jiang Zhengkai, He Qingdong, Luo Donghao, Wang Shengzhi, Liu Qingwen, Wang Chengjie
- Mixture-of-skills: Learning To Optimize Data Usage For Fine-tuning Large Language Models Wu Minghao, Vu Thuy-trang, Qu Lizhen, Haffari Gholamreza
- GUI Action Narrator: Where And When Did That Action Take Place? Wu Qinchen, Gao Difei, Lin Kevin Qinghong, Wu Zhuoyu, Guo Xiangwu, Li Peiran, Zhang Weichen, Wang Hengxu, Shou Mike Zheng
- Avatar: Optimizing LLM Agents For Tool-assisted Knowledge Retrieval Wu Shirley, Zhao Shiyu, Huang Qian, Huang Kexin, Yasunaga Michihiro, Cao Kaidi, Ioannidis Vassilis N., Subbian Karthik, Leskovec Jure, Zou James
- Enhance Reasoning For Large Language Models In The Game Werewolf Wu Shuang, Zhu Liwen, Yang Tao, Xu Shiwei, Fu Qiang, Wei Yang, Fu Haobo
- Sunnie: An Anthropomorphic Llm-based Conversational Agent For Mental Well-being Activity Recommendation Wu Siyi, Han Feixue, Yao Bingsheng, Xie Tianyi, Zhao Xuan, Wang Dakuo
- Unigen: A Unified Framework For Textual Dataset Generation Using Large Language Models Wu Siyuan, Huang Yue, Gao Chujie, Chen Dongping, Zhang Qihui, Wan Yao, Zhou Tianyi, Zhang Xiangliang, Gao Jianfeng, Xiao Chaowei, Sun Lichao
- Enhancing Q-learning With Large Language Model Heuristics Wu Xiefeng
- Knowledge-infused Legal Wisdom: Navigating LLM Consultation Through The Lens Of Diagnostics And Positive-unlabeled Reinforcement Learning Wu Yang, Wang Chenghao, Gumusel Ece, Liu Xiaozhong
- Agentkit: Structured LLM Reasoning With Dynamic Graphs Wu Yue, Fan Yewen, Min So Yeon, Prabhumoye Shrimai, Mcaleer Stephen, Bisk Yonatan, Salakhutdinov Ruslan, Li Yuanzhi, Mitchell Tom
- Imitating Language Via Scalable Inverse Reinforcement Learning Wulfmeier Markus, Bloesch Michael, Vieillard Nino, Ahuja Arun, Bornschein Jorg, Huang Sandy, Sokolov Artem, Barnes Matt, Desjardins Guillaume, Bewley Alex, Bechtle Sarah Maria Elisabeth, Springenberg Jost Tobias, Momchev Nikola, Bachem Olivier, Geist Matthieu, Riedmiller Martin
- FOFO: A Benchmark To Evaluate Llms' Format-following Capability Xia Congying, Xing Chen, Du Jiangshu, Yang Xinyi, Feng Yihao, Xu Ran, Yin Wenpeng, Xiong Caiming
- Inverse-q*: Token Level Reinforcement Learning For Aligning Large Language Models Without Preference Data Xia Han, Gao Songyang, Ge Qiming, Xi Zhiheng, Zhang Qi, Huang Xuanjing
- Guardagent: Safeguard LLM Agents By A Guard Agent Via Knowledge-enabled Reasoning Xiang Zhen, Zheng Linzhi, Li Yanjie, Hong Junyuan, Li Qinbin, Xie Han, Zhang Jiawei, Xiong Zidi, Xie Chulin, Yang Carl, Song Dawn, Li Bo
- Infllm: Training-free Long-context Extrapolation For Llms With An Efficient Context Memory Xiao Chaojun, Zhang Pengle, Han Xu, Xiao Guangxuan, Lin Yankai, Zhang Zhengyan, Liu Zhiyuan, Sun Maosong
- Leveraging Human Revisions For Improving Text-to-layout Models Xie Amber, Cheng Chin-yi, Huang Forrest, Li Yang
- Large Multimodal Agents: A Survey Xie Junlin, Chen Zhihong, Zhang Ruifei, Wan Xiang, Li Guanbin
- Exploratory Preference Optimization: Harnessing Implicit Q*-approximation For Sample-efficient RLHF Xie Tengyang, Foster Dylan J., Krishnamurthy Akshay, Rosset Corby, Awadallah Ahmed, Rakhlin Alexander
- Understanding The Weakness Of Large Language Model Agents Within A Complex Android Environment Xing Mingzhe, Zhang Rongkai, Xue Hui, Chen Qi, Yang Fan, Xiao Zhen
- Converging Paradigms: The Synergy Of Symbolic And Connectionist AI In Llm-empowered Autonomous Agents Xiong Haoyi, Wang Zhiyuan, Li Xuhong, Bian Jiang, Xie Zeke, Mumtaz Shahid, Barnes Laura E.
- Building Math Agents With Multi-turn Iterative Preference Learning Xiong Wei, Shi Chengshuai, Shen Jiaming, Rosenberg Aviv, Qin Zhen, Calandriello Daniele, Khalman Misha, Joshi Rishabh, Piot Bilal, Saleh Mohammad, Jin Chi, Zhang Tong, Liu Tianqi
- Watch Every Step! LLM Agent Learning Via Iterative Step-level Process Refinement Xiong Weimin, Song Yifan, Zhao Xiutian, Wu Wenhao, Wang Xun, Wang Ke, Li Cheng, Peng Wei, Li Sujian
- Aligning Large Language Models Via Fine-grained Supervision Xu Dehong, Qiu Liang, Kim Minseok, Ladhak Faisal, Do Jaeyoung
- KIWI: A Dataset Of Knowledge-intensive Writing Instructions For Answering Research Questions Xu Fangyuan, Lo Kyle, Soldaini Luca, Kuehl Bailey, Choi Eunsol, Wadden David
- Large Language Models For Education: A Survey Xu Hanyi, Gan Wensheng, Qi Zhenlian, Wu Jiayang, Yu Philip S.
- Rejection Improves Reliability: Training Llms To Refuse Unknown Questions Using RL From Knowledge Feedback Xu Hongshen, Zhu Zichen, Zhang Situo, Ma Da, Fan Shuai, Chen Lu, Yu Kai
- Redagent: Red Teaming Large Language Models With Context-aware Autonomous Language Agent Xu Huiyu, Zhang Wenhui, Wang Zhibo, Xiao Feng, Zheng Rui, Feng Yunhe, Ba Zhongjie, Ren Kui
- Reverse Image Retrieval Cues Parametric Memory In Multimodal Llms Xu Jialiang, Moor Michael, Leskovec Jure
- Tur[k]ingbench: A Challenge Benchmark For Web Agents Xu Kevin, Kordi Yeganeh, Nayak Tanay, Asija Ado, Wang Yizhong, Sanders Kate, Byerly Adam, Zhang Jingyu, Van Durme Benjamin, Khashabi Daniel
- MINDECHO: Role-playing Language Agents For Key Opinion Leaders Xu Rui, Lu Dakuan, Tan Xiaoyu, Wang Xintao, Yuan Siyu, Chen Jiangjie, Chu Wei, Yinghui Xu
- Is DPO Superior To PPO For LLM Alignment? A Comprehensive Study Xu Shusheng, Fu Wei, Gao Jiaxuan, Ye Wenjie, Liu Weilin, Mei Zhiyu, Wang Guangju, Yu Chao, Wu Yi
- AIOS Compiler: LLM As Interpreter For Natural Language Programming And Flow Programming Of AI Agents Xu Shuyuan, Li Zelong, Mei Kai, Zhang Yongfeng
- Sayself: Teaching Llms To Express Confidence With Self-reflective Rationales Xu Tianyang, Wu Shujin, Diao Shizhe, Liu Xiaoze, Wang Xingyao, Chen Yangyi, Gao Jing
- Hr-multiwoz: A Task Oriented Dialogue (TOD) Dataset For HR LLM Agent Xu Weijie, Huang Zicheng, Hu Wenxiang, Fang Xi, Cherukuri Rajesh Kumar, Nayyar Naumaan, Malandri Lorenzo, Sengamedu Srinivasan H.
- Generate-on-graph: Treat LLM As Both Agent And KG In Incomplete Knowledge Graph Question Answering Xu Yao, He Shizhu, Chen Jiabei, Wang Zihao, Song Yangqiu, Tong Hanghang, Liu Kang, Zhao Jun
- Empirical Evidence Of Large Language Model's Influence On Human Spoken Communication Yakura Hiromu, Lopez-lopez Ezequiel, Brinkmann Levin, Serna Ignacio, Gupta Prateek, Rahwan Iyad
- Mirror: A Multiple-perspective Self-reflection Method For Knowledge-rich Reasoning Yan Hanqi, Zhu Qinglin, Wang Xinyu, Gui Lin, He Yulan
- Exploring The LLM Journey From Cognition To Expression With Linear Representations Yan Yuzi, Li Jialian, Zhang Yipin, Yan Dong
- Social Life Simulation For Non-cognitive Skills Learning Yan Zihan, Xiang Yaohong, Huang Yun
- Crafting Customisable Characters With Llms: Introducing Simschat, A Persona-driven Role-playing Agent Framework Yang Bohao, Liu Dong, Tang Chen, Xiao Chenghao, Zhao Kun, Li Chao, Yuan Lin, Yang Guang, Huang Lanxiao, Lin Chenghua
- IM-RAG: Multi-round Retrieval-augmented Generation Through Learning Inner Monologues Yang Diji, Rao Jinmeng, Chen Kezhen, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- Aligning Llms Through Multi-perspective User Preference Ranking-based Feedback For Programming Question Answering Yang Hongyu, He Liyang, Hou Min, Shen Shuanghong, Li Rui, Hou Jiahui, Ma Jianhui, Zhao Junda
- 3D-GRAND: A Million-scale Dataset For 3d-llms With Better Grounding And Less Hallucination Yang Jianing, Chen Xuweiyi, Madaan Nikhil, Iyengar Madhavan, Qian Shengyi, Fouhey David F., Chai Joyce
- If LLM Is The Wizard, Then Code Is The Wand: A Survey On How Code Empowers Large Language Models To Serve As Intelligent Agents Yang Ke, Liu Jiateng, Wu John, Yang Chaoqi, Fung Yi R., Li Sha, Huang Zixuan, Cao Xu, Wang Xingyao, Wang Yiquan, Ji Heng, Zhai Chengxiang
- Regularizing Hidden States Enables Learning Generalizable Reward Model For Llms Yang Rui, Ding Ruomeng, Lin Yong, Zhang Huan, Zhang Tong
- Selfgoal: Your Language Agents Already Know How To Achieve High-level Goals Yang Ruihan, Chen Jiangjie, Zhang Yikai, Yuan Siyu, Chen Aili, Richardson Kyle, Xiao Yanghua, Yang Deqing
- Confidence Calibration And Rationalization For Llms Via Multi-agent Deliberation Yang Ruixin, Rajagopal Dheeraj, Hayati Shirley Anugrah, Hu Bin, Kang Dongyeop
- Video As The New Language For Real-world Decision Making Yang Sherry, Walker Jacob, Parker-holder Jack, Du Yilun, Bruce Jake, Barreto Andre, Abbeel Pieter, Schuurmans Dale
- Is Crowdsourcing Breaking Your Bank? Cost-effective Fine-tuning Of Pre-trained Language Models With Proximal Policy Optimization Yang Shuo, Kasneci Gjergji
- Pre-trained Language Models Improve The Few-shot Prompt Ability Of Decision Transformer Yang Yu, Xu Pan
- React Meets Actre: When Language Agents Enjoy Training Data Autonomy Yang Zonghan, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Curiousllm: Elevating Multi-document QA With Reasoning-infused Knowledge Graph Prompting Yang Zukang, Zhu Zixuan
- Preference-guided Reflective Sampling For Aligning Language Models Ye Hai, Ng Hwee Tou
- MMAU: A Holistic Benchmark Of Agent Capabilities Across Diverse Domains Yin Guoli, Bai Haoping, Ma Shuang, Nan Feng, Sun Yanchao, Xu Zhaoyang, Ma Shen, Lu Jiarui, Kong Xiang, Zhang Aonan, Yap Dian Ang, Zhang Yizhe, Ahnert Karsten, Kamath Vik, Berglund Mathias, Walsh Dominic, Gindele Tobias, Wiest Juergen, Lai Zhengfeng, Wang Xiaoming, Shan Jiulong, Cao Meng, Pang Ruoming, Wang Zirui
- Debunc: Mitigating Hallucinations In Large Language Model Agent Communication With Uncertainty Estimations Yoffe Luke, Amayuelas Alfonso, Wang William Yang
- TLCR: Token-level Continuous Reward For Fine-grained Reinforcement Learning From Human Feedback Yoon Eunseop, Yoon Hee Suk, Eom Soohwan, Han Gunsoo, Nam Daniel Wontae, Jo Daejin, On Kyoung-woon, Hasegawa-johnson Mark A., Kim Sungwoong, Yoo Chang D.
- Flow Of Reasoning: Efficient Training Of LLM Policy With Divergent Thinking Yu Fangxu, Jiang Lai, Kang Haoqiang, Hao Shibo, Qin Lianhui
- Neeko: Leveraging Dynamic Lora For Efficient Multi-character Role-playing Agent Yu Xiaoyan, Luo Tongxu, Wei Yifan, Lei Fangyu, Huang Yiming, Peng Hao, Zhu Liehuang
- EASYTOOL: Enhancing Llm-based Agents With Concise Tool Instruction Yuan Siyu, Song Kaitao, Chen Jiangjie, Tan Xu, Shen Yongliang, Kan Ren, Li Dongsheng, Yang Deqing
- R-judge: Benchmarking Safety Risk Awareness For LLM Agents Yuan Tongxin, He Zhiwei, Dong Lingzhong, Wang Yiming, Zhao Ruijie, Xia Tian, Xu Lizhen, Zhou Binglin, Li Fangqi, Zhang Zhuosheng, Wang Rui, Liu Gongshen
- Self-rewarding Language Models Yuan Weizhe, Pang Richard Yuanzhe, Cho Kyunghyun, Li Xian, Sukhbaatar Sainbayar, Xu Jing, Weston Jason
- Evaluating Character Understanding Of Large Language Models Via Character Profiling From Fictional Works Yuan Xinfeng, Yuan Siyu, Cui Yuhan, Lin Tianhe, Wang Xintao, Xu Rui, Chen Jiangjie, Yang Deqing
- Envgen: Generating And Adapting Environments Via Llms For Training Embodied Agents Zala Abhay, Cho Jaemin, Lin Han, Yoon Jaehong, Bansal Mohit
- Personarag: Enhancing Retrieval-augmented Generation Systems With User-centric Agents Zerhoudi Saber, Granitzer Michael
- Online Self-preferring Language Models Zhai Yuanzhao, Zhang Zhuo, Xu Kele, Peng Hanyang, Yu Yue, Feng Dawei, Yang Cheng, Ding Bo, Wang Huaimin
- Injecagent: Benchmarking Indirect Prompt Injections In Tool-integrated Large Language Model Agents Zhan Qiusi, Liang Zhixiang, Ying Zifan, Kang Daniel
- MC-GPT: Empowering Vision-and-language Navigation With Memory Map And Reasoning Chains Zhan Zhaohuan, Yu Lisha, Yu Sijie, Tan Guang
- Rest-mcts*: LLM Self-training Via Process Reward Guided Tree Search Zhang Dan, Zhoubian Sining, Hu Ziniu, Yue Yisong, Dong Yuxiao, Tang Jie
- Agentohana: Design Unified Data And Training Pipeline For Effective Agent Learning Zhang Jianguo, Lan Tian, Murthy Rithesh, Liu Zhiwei, Yao Weiran, Tan Juntao, Hoang Thai, Yang Liangwei, Feng Yihao, Liu Zuxin, Awalgaonkar Tulika, Niebles Juan Carlos, Savarese Silvio, Heinecke Shelby, Wang Huan, Xiong Caiming
- Xlam: A Family Of Large Action Models To Empower AI Agent Systems Zhang Jianguo, Lan Tian, Zhu Ming, Liu Zuxin, Hoang Thai, Kokane Shirley, Yao Weiran, Tan Juntao, Prabhakar Akshara, Chen Haolin, Liu Zhiwei, Feng Yihao, Awalgaonkar Tulika, Murthy Rithesh, Hu Eric, Chen Zeyuan, Xu Ran, Niebles Juan Carlos, Heinecke Shelby, Wang Huan, Savarese Silvio, Xiong Caiming
- Android In The Zoo: Chain-of-action-thought For GUI Agents Zhang Jiwen, Wu Jihao, Teng Yihua, Liao Minghui, Xu Nuo, Xiao Xiao, Wei Zhongyu, Tang Duyu
- Automated Multi-level Preference For Mllms Zhang Mengxi, Wu Wenhao, Lu Yu, Song Yuxin, Rong Kang, Yao Huanjin, Zhao Jianbo, Liu Fanglong, Sun Yifan, Feng Haocheng, Wang Jingdong
- Offline Training Of Language Model Agents With Functions As Learnable Weights Zhang Shaokun, Zhang Jieyu, Liu Jiale, Song Linxin, Wang Chi, Krishna Ranjay, Wu Qingyun
- Mindspeech: Continuous Imagined Speech Decoding Using High-density Fnirs And Prompt Tuning For Advanced Human-ai Interaction Zhang Suyi, Alam Ekram, Baber Jack, Bianco Francesca, Turner Edward, Chamanzar Maysam, Dehghani Hamid
- Agent-pro: Learning To Evolve Via Policy-level Reflection And Optimization Zhang Wenqi, Tang Ke, Wu Hai, Wang Mengna, Shen Yongliang, Hou Guiyang, Tan Zeqi, Li Peng, Zhuang Yueting, Lu Weiming
- Ask-before-plan: Proactive Language Agents For Real-world Planning Zhang Xuan, Deng Yang, Ren Zifeng, Ng See-kiong, Chua Tat-seng
- TREACLE: Thrifty Reasoning Via Context-aware LLM And Prompt Selection Zhang Xuechen, Huang Zijian, Taga Ege Onur, Joe-wong Carlee, Oymak Samet, Chen Jiasi
- LLM As A Mastermind: A Survey Of Strategic Reasoning With Large Language Models Zhang Yadong, Mao Shaoguang, Ge Tao, Wang Xun, De Wynter Adrian, Xia Yan, Wu Wenshan, Song Ting, Lan Man, Wei Furu
- Pybench: Evaluating LLM Agent On Various Real-world Coding Tasks Zhang Yaolun, Pan Yinxu, Wang Yudong, Cai Jie
- Worldqa: Multimodal World Knowledge In Videos Through Long-chain Reasoning Zhang Yuanhan, Zhang Kaichen, Li Bo, Pu Fanyi, Setiadharma Christopher Arif, Yang Jingkang, Liu Ziwei
- PPTC-R Benchmark: Towards Evaluating The Robustness Of Large Language Models For Powerpoint Task Completion Zhang Zekai, Guo Yiduo, Liang Yaobo, Zhao Dongyan, Duan Nan
- A Survey On The Memory Mechanism Of Large Language Model Based Agents Zhang Zeyu, Bo Xiaohe, Ma Chen, Li Rui, Chen Xu, Dai Quanyu, Zhu Jieming, Dong Zhenhua, Wen Ji-rong
- Longagent: Scaling Language Models To 128k Context Through Multi-agent Collaboration Zhao Jun, Zu Can, Xu Hao, Lu Yi, He Wei, Ding Yiwen, Gui Tao, Zhang Qi, Huang Xuanjing
- Diffagent: Fast And Accurate Text-to-image API Selection With Large Language Model Zhao Lirui, Yang Yue, Zhang Kaipeng, Shao Wenqi, Zhang Yuxin, Qiao Yu, Luo Ping, Ji Rongrong
- Aqulia-med LLM: Pioneering Full-process Open-source Medical Language Models Zhao Lulu, Zeng Weihao, Shi Xiaofeng, Zhou Hua, Hao Donglin, Lin Yonghua
- Defending Large Language Models Against Jailbreak Attacks Via Layer-specific Editing Zhao Wei, Li Zhe, Li Yige, Zhang Ye, Sun Jun
- Enhancing Cross-domain Pre-trained Decision Transformers With Adaptive Attention Zhao Wenhao, Xu Qiushui, Xu Linjie, Song Lei, Wang Jinyu, Zhou Chunlai, Bian Jiang
- SWIFT:A Scalable Lightweight Infrastructure For Fine-tuning Zhao Yuze, Huang Jintao, Hu Jinghan, Wang Xingjun, Mao Yunlin, Zhang Daoze, Jiang Zeyinzi, Wu Zhikai, Ai Baole, Wang Ang, Zhou Wenmeng, Chen Yingda
- Adversarial Contrastive Decoding: Boosting Safety Alignment Of Large Language Models Via Opposite Prompt Optimization Zhao Zhengyue, Zhang Xiaoyun, Xu Kaidi, Hu Xing, Zhang Rui, Du Zidong, Guo Qi, Chen Yunji
- Gpt-4v(ision) Is A Generalist Web Agent, If Grounded Zheng Boyuan, Gou Boyu, Kil Jihyung, Sun Huan, Su Yu
- Balancing Enhancement, Harmlessness, And General Capabilities: Enhancing Conversational Llms With Direct RLHF Zheng Chen, Sun Ke, Wu Hang, Xi Chenguang, Zhou Xun
- Ali-agent: Assessing Llms' Alignment With Human Values Via Agent-based Evaluation Zheng Jingnan, Wang Han, Zhang An, Nguyen Tai D., Sun Jun, Chua Tat-seng
- More-3s:multimodal-based Offline Reinforcement Learning With Shared Semantic Spaces Zheng Tianyu, Zhang Ge, Qu Xingwei, Kuang Ming, Huang Stephen W., He Zhaofeng
- Training Large Language Models For Reasoning Through Reverse Curriculum Reinforcement Learning Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang
- Agentgym: Evolving Large Language Model-based Agents Across Diverse Environments Zhiheng Xi, Yiwen Ding, Wenxiang Chen, Boyang Hong, Honglin Guo, Junzhe Wang, Dingwen Yang, Chenyang Liao, Xin Guo, Wei He, Songyang Gao, Lu Chen, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-gang Jiang
- Navgpt-2: Unleashing Navigational Reasoning Capability For Large Vision-language Models Zhou Gengze, Hong Yicong, Wang Zun, Wang Xin Eric, Wu Qi
- Sequence To Sequence Reward Modeling: Improving RLHF By Language Feedback Zhou Jiayi, Ji Jiaming, Dai Juntao, Yang Yaodong
- Self-discover: Large Language Models Self-compose Reasoning Structures Zhou Pei, Pujara Jay, Ren Xiang, Chen Xinyun, Cheng Heng-tze, Le Quoc V., Chi Ed H., Zhou Denny, Mishra Swaroop, Zheng Huaixiu Steven
- Enhancing The General Agent Capabilities Of Low-parameter Llms Through Tuning And Multi-branch Reasoning Zhou Qinhao, Zhang Zihan, Xiang Xiang, Wang Ke, Wu Yuchuan, Li Yongbin
- Cogmg: Collaborative Augmentation Between Large Language Model And Knowledge Graph Zhou Tong, Chen Yubo, Liu Kang, Zhao Jun
- Archer: Training Language Model Agents Via Hierarchical Multi-turn RL Zhou Yifei, Zanette Andrea, Pan Jiayi, Levine Sergey, Kumar Aviral
- Defending Jailbreak Prompts Via In-context Adversarial Game Zhou Yujun, Han Yufei, Zhuang Haomin, Guo Kehan, Liang Zhenwen, Bao Hongyan, Zhang Xiangliang
- Llmbind: A Unified Modality-task Integration Framework Zhu Bin, Ning Munan, Jin Peng, Lin Bin, Huang Jinfa, Song Qi, Zhang Junwu, Tang Zhenyu, Pan Mingjun, Zhou Xing, Yuan Li
- Dynamic Evaluation Of Large Language Models By Meta Probing Agents Zhu Kaijie, Wang Jindong, Zhao Qinlin, Xu Ruochen, Xie Xing
- LIRE: Listwise Reward Enhancement For Preference Alignment Zhu Mingye, Liu Yi, Zhang Lei, Guo Junbo, Mao Zhendong
- Knowagent: Knowledge-augmented Planning For Llm-based Agents Zhu Yuqi, Qiao Shuofei, Ou Yixin, Deng Shumin, Zhang Ningyu, Lyu Shiwei, Shen Yue, Liang Lei, Gu Jinjie, Chen Huajun
🏷 Applications
- Multimodal Attention For Neural Machine Translation Caglayan Ozan, Barrault Loïc, Bougares Fethi
- Joint Copying And Restricted Generation For Paraphrase Cao Ziqiang, Luo Chuwei, Li Wenjie, Li Sujian
- Tree-to-sequence Attentional Neural Machine Translation Eriguchi Akiko, Hashimoto Kazuma, Tsuruoka Yoshimasa
- Multi-way, Multilingual Neural Machine Translation With A Shared Attention Mechanism Firat Orhan, Cho Kyunghyun, Bengio Yoshua
- Multimodal Compact Bilinear Pooling For Visual Question Answering And Visual Grounding Fukui Akira, Park Dong Huk, Yang Daylen, Rohrbach Anna, Darrell Trevor, Rohrbach Marcus
- Character-level Question Answering With Attention Golub David, He Xiaodong
- Revisiting Visual Question Answering Baselines Jabri Allan, Joulin Armand, Van Der Maaten Laurens
- Google's Multilingual Neural Machine Translation System: Enabling Zero-shot Translation Johnson Melvin, Schuster Mike, Le Quoc V., Krikun Maxim, Wu Yonghui, Chen Zhifeng, Thorat Nikhil, Viégas Fernanda, Wattenberg Martin, Corrado Greg, Hughes Macduff, Dean Jeffrey
- Fully Character-level Neural Machine Translation Without Explicit Segmentation Lee Jason, Cho Kyunghyun, Hofmann Thomas
- Interactive Attention For Neural Machine Translation Meng Fandong, Lu Zhengdong, Li Hang, Liu Qun
- Temporal Attention Model For Neural Machine Translation Sankaran Baskaran, Mi Haitao, Al-onaizan Yaser, Ittycheriah Abe
- Generative Deep Neural Networks For Dialogue: A Short Review Serban Iulian Vlad, Lowe Ryan, Charlin Laurent, Pineau Joelle
- Syntactically Guided Neural Machine Translation Stahlberg Felix, Hasler Eva, Waite Aurelien, Byrne Bill
- Neural Machine Translation Advised By Statistical Machine Translation Wang Xing, Lu Zhengdong, Tu Zhaopeng, Li Hang, Xiong Deyi, Zhang Min
- An Attentional Neural Conversation Model With Improved Specificity Yao Kaisheng, Peng Baolin, Zweig Geoffrey, Wong Kam-fai
- Don't Just Assume; Look And Answer: Overcoming Priors For Visual Question Answering Agrawal Aishwarya, Batra Dhruv, Parikh Devi, Kembhavi Aniruddha
- Weighted Transformer Network For Machine Translation Ahmed Karim, Keskar Nitish Shirish, Socher Richard
- Attention Is All You Need Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin
- Towards Neural Machine Translation With Latent Tree Attention Bradbury James, Socher Richard
- Incorporating Global Visual Features Into Attention-based Neural Machine Translation Calixto Iacer, Liu Qun, Campbell Nick
- Enabling Multi-source Neural Machine Translation By Concatenating Source Sentences In Multiple Languages Dabre Raj, Cromieres Fabien, Kurohashi Sadao
- Neural Machine Translation Leveraging Phrase-based Models In A Hybrid Search Dahlmann Leonard, Matusov Evgeny, Petrushkov Pavel, Khadivi Shahram
- An Empirical Study On The Effectiveness Of Images In Multimodal Neural Machine Translation Delbrouck Jean-benoit, Dupont Stéphane
- Sockeye: A Toolkit For Neural Machine Translation Hieber Felix, Domhan Tobias, Denkowski Michael, Vilar David, Sokolov Artem, Clifton Ann, Post Matt
- Reinforced Mnemonic Reader For Machine Reading Comprehension Hu Minghao, Peng Yuxing, Huang Zhen, Qiu Xipeng, Wei Furu, Zhou Ming
- An Exploration Of Neural Sequence-to-sequence Architectures For Automatic Post-editing Junczys-dowmunt Marcin, Grundkiewicz Roman
- Simple Recurrent Units For Highly Parallelizable Recurrence Lei Tao, Zhang Yu, Wang Sida I., Dai Hui, Artzi Yoav
- Task-driven Visual Saliency And Attention-based Visual Question Answering Lin Yuetan, Pang Zhangyang, Wang Donghui, Zhuang Yueting
- Ivqa: Inverse Visual Question Answering Liu Feng, Xiang Tao, Hospedales Timothy M., Yang Wankou, Sun Changyin
- Table-to-text Generation By Structure-aware Seq2seq Learning Liu Tianyu, Wang Kexiang, Sha Lei, Chang Baobao, Sui Zhifang
- Visual Question Answering With Memory-augmented Networks Ma Chao, Shen Chunhua, Dick Anthony, Wu Qi, Wang Peng, Hengel Anton Van Den, Reid Ian
- A Generative Approach To Question Answering Mitra Rajarshee
- Predicting Target Language CCG Supertags Improves Neural Machine Translation Nadejde Maria, Reddy Siva, Sennrich Rico, Dwojak Tomasz, Junczys-dowmunt Marcin, Koehn Philipp, Birch Alexandra
- Visual Reference Resolution Using Attention Memory For Visual Dialog Seo Paul Hongsuck, Lehrmann Andreas, Han Bohyung, Sigal Leonid
- Recurrent And Contextual Models For Visual Question Answering Sharang Abhijit, Lau Eric
- A Unified Query-based Generative Model For Question Generation And Question Answering Song Linfeng, Wang Zhiguo, Hamza Wael
- Tips And Tricks For Visual Question Answering: Learnings From The 2017 Challenge Teney Damien, Anderson Peter, He Xiaodong, Hengel Anton Van Den
- Self-attentive Residual Decoder For Neural Machine Translation Werlen Lesly Miculicich, Pappas Nikolaos, Ram Dhananjay, Popescu-belis Andrei
- Neural Text Generation: A Practical Guide Xie Ziang
- The Forgettable-watcher Model For Video Question Answering Xue Hongyang, Zhao Zhou, Cai Deng
- Deepprobe: Information Directed Sequence Understanding And Chatbot Design Via Recurrent Neural Networks Yin Zi, Chang Keng-hao, Zhang Ruofei
- Chunk-based Bi-scale Decoder For Neural Machine Translation Zhou Hao, Tu Zhaopeng, Huang Shujian, Liu Xiaohua, Li Hang, Chen Jiajun
- A Knowledge-grounded Multimodal Search-based Conversational Agent Agarwal Shubham, Dusek Ondrej, Konstas Ioannis, Rieser Verena
- Redecode Framework For Iterative Improvement In Paraphrase Generation Aggarwal Milan, Kumari Nupur, Bansal Ayush, Krishnamurthy Balaji
- Code2seq: Generating Sequences From Structured Representations Of Code Alon Uri, Brody Shaked, Levy Omer, Yahav Eran
- Multilingual Extractive Reading Comprehension By Runtime Machine Translation Asai Akari, Eriguchi Akiko, Hashimoto Kazuma, Tsuruoka Yoshimasa
- Training Deeper Neural Machine Translation Models With Transparent Attention Bapna Ankur, Chen Mia Xu, Firat Orhan, Cao Yuan, Wu Yonghui
- Commonsense For Generative Multi-hop Question Answering Tasks Bauer Lisa, Wang Yicheng, Bansal Mohit
- LIUM-CVC Submissions For WMT18 Multimodal Translation Task Caglayan Ozan, Bardet Adrien, Bougares Fethi, Barrault Loïc, Wang Kai, Masana Marc, Herranz Luis, Van De Weijer Joost
- Textually Enriched Neural Module Networks For Visual Question Answering Chandu Khyathi Raghavi, Pyreddy Mary Arpita, Felix Matthieu, Joshi Narendra Nath
- The Best Of Both Worlds: Combining Recent Advances In Neural Machine Translation Chen Mia Xu, Firat Orhan, Bapna Ankur, Johnson Melvin, Macherey Wolfgang, Foster George, Jones Llion, Parmar Niki, Schuster Mike, Chen Zhifeng, Wu Yonghui, Hughes Macduff
- Reciprocal Attention Fusion For Visual Question Answering Farazi Moshiur R, Khan Salman H
- SALSA-TEXT : Self Attentive Latent Space Based Adversarial Text Generation Gagnon-marchand Jules, Sadeghi Hamed, Haidar Md. Akmal, Rezagholizadeh Mehdi
- Question-guided Hybrid Convolution For Visual Question Answering Gao Peng, Lu Pan, Li Hongsheng, Li Shuang, Li Yikang, Hoi Steven, Wang Xiaogang
- The Memad Submission To The WMT18 Multimodal Translation Task Grönroos Stig-arne, Huet Benoit, Kurimo Mikko, Laaksonen Jorma, Merialdo Bernard, Pham Phu, Sjöberg Mats, Sulubacak Umut, Tiedemann Jörg, Troncy Raphael, Vázquez Raúl
- BERT: Pre-training Of Deep Bidirectional Transformers For Language Understanding Jacob Devlin, Ming-wei Chang, Kenton Lee, Kristina Toutanova
- Closed-book Training To Improve Summarization Encoder Memory Jiang Yichen, Bansal Mohit
- SNU_IDS At Semeval-2018 Task 12: Sentence Encoder With Contextualized Vectors For Argument Reasoning Comprehension Kim Taeuk, Choi Jihun, Lee Sang-goo
- Neural Machine Translation With Adequacy-oriented Learning Kong Xiang, Tu Zhaopeng, Shi Shuming, Hovy Eduard, Zhang Tong
- Fast And Simple Mixture Of Softmaxes With BPE And Hybrid-lightrnn For Language Generation Kong Xiang, Xie Qizhe, Dai Zihang, Hovy Eduard
- A Question-focused Multi-factor Attention Network For Question Answering Kundu Souvik, Ng Hwee Tou
- Answerer In Questioner's Mind: Information Theoretic Approach To Goal-oriented Visual Dialog Lee Sang-woo, Heo Yu-jung, Zhang Byoung-tak
- Global Encoding For Abstractive Summarization Lin Junyang, Sun Xu, Ma Shuming, Su Qi
- Seq2rdf: An End-to-end Application For Deriving Triples From Natural Language Text Liu Yue, Zhang Tongtao, Liang Zhicheng, Ji Heng, Mcguinness Deborah L.
- Neural Machine Translation With Key-value Memory-augmented Attention Meng Fandong, Tu Zhaopeng, Cheng Yong, Wu Haiyang, Zhai Junjie, Yang Yuekui, Wang Di
- Improved Fusion Of Visual And Language Representations By Dense Symmetric Co-attention For Visual Question Answering Nguyen Duy-kien, Okatani Takayuki
- Phrase-based Attentions Nguyen Phi Xuan, Joty Shafiq
- Polite Dialogue Generation Without Parallel Data Niu Tong, Bansal Mohit
- Recursive Visual Attention In Visual Dialog Niu Yulei, Zhang Hanwang, Zhang Manli, Zhang Jianhong, Lu Zhiwu, Wen Ji-rong
- Dual Recurrent Attention Units For Visual Question Answering Osman Ahmed, Samek Wojciech
- Multimodal Explanations: Justifying Decisions And Pointing To The Evidence Park Dong Huk, Hendricks Lisa Anne, Akata Zeynep, Rohrbach Anna, Schiele Bernt, Darrell Trevor, Rohrbach Marcus
- Dynamic Fusion With Intra- And Inter- Modality Attention Flow For Visual Question Answering Peng Gao, Jiang Zhengkai, You Haoxuan, Lu Pan, Hoi Steven, Wang Xiaogang, Li Hongsheng
- Contextual Parameter Generation For Universal Neural Machine Translation Platanios Emmanouil Antonios, Sachan Mrinmaya, Neubig Graham, Mitchell Tom
- Training Tips For The Transformer Model Popel Martin, Bojar Ondřej
- Cross-lingual Transfer Learning For Multilingual Task Oriented Dialog Schuster Sebastian, Gupta Sonal, Shah Rushin, Lewis Mike
- Adapting Visual Question Answering Models For Enhancing Multimodal Community Q&A Platforms Srivastava Avikalp, Liu Hsin Wen, Fujita Sumio
- The University Of Cambridge's Machine Translation Systems For WMT18 Stahlberg Felix, De Gispert Adria, Byrne Bill
- An Operation Sequence Model For Explainable Neural Machine Translation Stahlberg Felix, Saunders Danielle, Byrne Bill
- Inducing Grammars With And For Neural Machine Translation Tran Ke, Bisk Yonatan
- Can You Tell Me How To Get Past Sesame Street? Sentence-level Pretraining Beyond Language Modeling Wang Alex, Hula Jan, Xia Patrick, Pappagari Raghavendra, Mccoy R. Thomas, Patel Roma, Kim Najoung, Tenney Ian, Huang Yinghui, Yu Katherin, Jin Shuning, Chen Berlin, Van Durme Benjamin, Grave Edouard, Pavlick Ellie, Bowman Samuel R.
- Structured Triplet Learning With Pos-tag Guided Attention For Visual Question Answering Wang Zhe, Liu Xiaoyi, Chen Liangjian, Wang Limin, Qiao Yu, Xie Xiaohui, Fowlkes Charless
- Learning Neural Templates For Text Generation Wiseman Sam, Shieber Stuart M., Rush Alexander M.
- Dual Ask-answer Network For Machine Reading Comprehension Xiao Han, Wang Feng, Yan Jianfeng, Zheng Jingyao
- Qanet: Combining Local Convolution With Global Self-attention For Reading Comprehension Yu Adams Wei, Dohan David, Luong Minh-thang, Zhao Rui, Chen Kai, Norouzi Mohammad, Le Quoc V.
- Medical Exam Question Answering With Large-scale Reading Comprehension Zhang Xiao, Wu Ji, He Zhiyang, Liu Xien, Su Ying
- Phrase Table As Recommendation Memory For Neural Machine Translation Zhao Yang, Wang Yining, Zhang Jiajun, Zong Chengqing
- A Visual Attention Grounding Neural Model For Multimodal Machine Translation Zhou Mingyang, Cheng Runxiang, Lee Yong Jae, Yu Zhou
- Retrieval-enhanced Adversarial Training For Neural Response Generation Zhu Qingfu, Cui Lei, Zhang Weinan, Wei Furu, Liu Ting
- Abductive Reasoning As Self-supervision For Common Sense Question Answering Aakur Sathyanarayanan N., Sarkar Sudeep
- Syntactically Supervised Transformers For Faster Neural Machine Translation Akoury Nader, Krishna Kalpesh, Iyyer Mohit
- Synthetic QA Corpora Generation With Roundtrip Consistency Alberti Chris, Andor Daniel, Pitler Emily, Devlin Jacob, Collins Michael
- Fusion Of Detected Objects In Text For Visual Question Answering Alberti Chris, Ling Jeffrey, Collins Michael, Reitter David
- The Missing Ingredient In Zero-shot Neural Machine Translation Arivazhagan Naveen, Bapna Ankur, Firat Orhan, Aharoni Roee, Johnson Melvin, Macherey Wolfgang
- Whatcha Lookin' At? Deeplifting Bert's Attention In Question Answering Arkhangelskaia Ekaterina, Dutta Sourav
- Keeping Notes: Conditional Natural Language Generation With A Scratchpad Mechanism Benmalek Ryan Y., Khabsa Madian, Desu Suma, Cardie Claire, Banko Michele
- Meta Answering For Machine Reading Borschinger Benjamin, Boyd-graber Jordan, Buck Christian, Bulian Jannis, Ciaramita Massimiliano, Huebscher Michelle Chen, Gajewski Wojciech, Kilcher Yannic, Nogueira Rodrigo, Saralegu Lierni Sestorain
- Language Features Matter: Effective Language Representations For Vision-language Tasks Burns Andrea, Tan Reuben, Saenko Kate, Sclaroff Stan, Plummer Bryan A.
- An Empirical Study Of Generation Order For Machine Translation Chan William, Stern Mitchell, Kiros Jamie, Uszkoreit Jakob
- Making Neural Machine Reading Comprehension Faster Chatterjee Debajyoti
- Distilling Knowledge Learned In BERT For Text Generation Chen Yen-chun, Gan Zhe, Cheng Yu, Liu Jingzhou, Liu Jingjing
- Cross-lingual Natural Language Generation Via Pre-training Chi Zewen, Dong Li, Wei Furu, Wang Wenhui, Mao Xian-ling, Huang Heyan
- On The Use Of BERT For Neural Machine Translation Clinchant Stéphane, Jung Kweon Woo, Nikoulina Vassilina
- Exploring The Limits Of Transfer Learning With A Unified Text-to-text Transformer Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J. Liu
- Unified Language Model Pre-training For Natural Language Understanding And Generation Dong Li, Yang Nan, Wang Wenhui, Wei Furu, Liu Xiaodong, Wang Yu, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- Retrosynthesis With Attention-based NMT Model And Chemical Analysis Of The "wrong" Predictions Duan Hongliang, Wang Ling, Zhang Chengyun, Li Jianjun
- Parallel Scheduled Sampling Duckworth Daniel, Neelakantan Arvind, Goodrich Ben, Kaiser Lukasz, Bengio Samy
- Pre-trained Language Model Representations For Language Generation Edunov Sergey, Baevski Alexei, Auli Michael
- ELI5: Long Form Question Answering Fan Angela, Jernite Yacine, Perez Ethan, Grangier David, Weston Jason, Auli Michael
- Heterogeneous Memory Enhanced Multimodal Attention Model For Video Question Answering Fan Chenyou, Zhang Xiaofan, Zhang Shu, Wang Wensheng, Zhang Chi, Huang Heng
- Question-agnostic Attention For Visual Question Answering Farazi Moshiur R, Khan Salman H, Barnes Nick
- Neural Data-to-text Generation: A Comparison Between Pipeline And End-to-end Architectures Ferreira Thiago Castro, Van Der Lee Chris, Van Miltenburg Emiel, Krahmer Emiel
- MRQA 2019 Shared Task: Evaluating Generalization In Reading Comprehension Fisch Adam, Talmor Alon, Jia Robin, Seo Minjoon, Choi Eunsol, Chen Danqi
- TANDA: Transfer And Adapt Pre-trained Transformer Models For Answer Sentence Selection Garg Siddhant, Vu Thuy, Moschitti Alessandro
- Self-attentive Model For Headline Generation Gavrilov Daniil, Kalaidin Pavel, Malykh Valentin
- Generating Natural Language Explanations For Visual Question Answering Using Scene Graphs And Visual Attention Ghosh Shalini, Burachas Giedrius, Ray Arijit, Ziskind Avi
- Span Selection Pre-training For Question Answering Glass Michael, Gliozzo Alfio, Chakravarti Rishav, Ferritto Anthony, Pan Lin, Bhargav G P Shrivatsa, Garg Dinesh, Sil Avirup
- Insertion-based Decoding With Automatically Inferred Generation Order Gu Jiatao, Liu Qi, Cho Kyunghyun
- Reweighted Proximal Pruning For Large-scale Language Representation Guo Fu-ming, Liu Sijia, Mungall Finlay S., Lin Xue, Wang Yanzhi
- Fine-tuning By Curriculum Learning For Non-autoregressive Neural Machine Translation Guo Junliang, Tan Xu, Xu Linli, Qin Tao, Chen Enhong, Liu Tie-yan
- Zero-shot Paraphrase Generation With Multilingual Language Models Guo Yinpeng, Liao Yi, Jiang Xin, Zhang Qing, Zhang Yibo, Liu Qun
- Exploring Neural Net Augmentation To BERT For Question Answering On SQUAD 2.0 Gupta Suhas
- Episodic Memory Reader: Learning What To Remember For Question Answering From Streaming Data Han Moonsu, Kang Minki, Jung Hyunwoo, Hwang Sung Ju
- Resurrecting Submodularity For Neural Text Generation Han Simeng, Lin Xiang, Joty Shafiq
- Exposure Bias Versus Self-recovery: Are Distortions Really Incremental For Autoregressive Text Generation? He Tianxing, Zhang Jingzhao, Zhou Zhiming, Glass James
- Convert: Efficient And Accurate Conversational Representations From Transformers Henderson Matthew, Casanueva Iñigo, Mrkšić Nikola, Su Pei-hao, Wen Tsung-hsien, Vulić Ivan
- Efficient Adaptation Of Pretrained Transformers For Abstractive Summarization Hoang Andrew, Bosselut Antoine, Celikyilmaz Asli, Choi Yejin
- Evaluating Rewards For Question Generation Models Hosking Tom, Riedel Sebastian
- INSET: Sentence Infilling With Inter-sentential Transformer Huang Yichen, Zhang Yizhe, Elachqar Oussama, Cheng Yu
- Designing Dialogue Systems: A Mean, Grumpy, Sarcastic Chatbot In The Browser Ilić Suzana, Nakano Reiichiro, Hajnal Ivo
- Document-level Neural Machine Translation With Associated Memory Network Jiang Shu, Wang Rui, Li Zuchao, Utiyama Masao, Chen Kehai, Sumita Eiichiro, Zhao Hai, Lu Bao-liang
- Tinybert: Distilling BERT For Natural Language Understanding Jiao Xiaoqi, Yin Yichun, Shang Lifeng, Jiang Xin, Chen Xiao, Li Linlin, Wang Fang, Liu Qun
- Amr-to-text Generation With Cache Transition Systems Jin Lisa, Gildea Daniel
- All-in-one Image-grounded Conversational Agents Ju Da, Shuster Kurt, Boureau Y-lan, Weston Jason
- Microsoft Translator At WMT 2019: Towards Large-scale Document-level Neural Machine Translation Junczys-dowmunt Marcin
- Let Me Know What To Ask: Interrogative-word-aware Question Generation Kang Junmo, Roman Haritz Puerto San, Myaeng Sung-hyon
- Weak Supervision Helps Emergence Of Word-object Alignment And Improves Vision-language Tasks Kervadec Corentin Liris, Antipov Grigory Liris, Baccouche Moez Liris, Wolf Christian Liris
- CTRL: A Conditional Transformer Language Model For Controllable Generation Keskar Nitish Shirish, Mccann Bryan, Varshney Lav R., Xiong Caiming, Socher Richard
- Sample Efficient Text Summarization Using A Single Pre-trained Transformer Khandelwal Urvashi, Clark Kevin, Jurafsky Dan, Kaiser Lukasz
- A Case Study: Exploiting Neural Machine Translation To Translate CUDA To Opencl Kim Yonghae, Kim Hyesoon
- When And Why Is Document-level Context Useful In Neural Machine Translation? Kim Yunsu, Tran Duc Thanh, Ney Hermann
- Learning To Answer By Learning To Ask: Getting The Best Of GPT-2 And BERT Worlds Klein Tassilo, Nabi Moin
- Investigating Multilingual NMT Representations At Scale Kudugunta Sneha Reddy, Bapna Ankur, Caswell Isaac, Arivazhagan Naveen, Firat Orhan
- Patent Claim Generation By Fine-tuning Openai GPT-2 Lee Jieh-sheng, Hsiang Jieh
- BART: Denoising Sequence-to-sequence Pre-training For Natural Language Generation, Translation, And Comprehension Lewis Mike, Liu Yinhan, Goyal Naman, Ghazvininejad Marjan, Mohamed Abdelrahman, Levy Omer, Stoyanov Ves, Zettlemoyer Luke
- Bertsel: Answer Selection With Pre-trained Models Li Dongfang, Yu Yifei, Chen Qingcai, Li Xinyu
- Big Bidirectional Insertion Representations For Documents Li Lala, Chan William
- Pretrained Language Models For Document-level Neural Machine Translation Li Liangyou, Jiang Xin, Liu Qun
- Explicit Sentence Compression For Neural Machine Translation Li Zuchao, Wang Rui, Chen Kehai, Utiyama Masao, Sumita Eiichiro, Zhang Zhuosheng, Zhao Hai
- Probing Representations Learned By Multimodal Recurrent And Transformer Models Libovický Jindřich, Madhyastha Pranava
- Commongen: A Constrained Text Generation Challenge For Generative Commonsense Reasoning Lin Bill Yuchen, Zhou Wangchunshu, Shen Ming, Zhou Pei, Bhagavatula Chandra, Choi Yejin, Ren Xiang
- TAB-VCR: Tags And Attributes Based Visual Commonsense Reasoning Baselines Lin Jingxiang, Jain Unnat, Schwing Alexander G.
- Task-oriented Conversation Generation Using Heterogeneous Memory Networks Lin Zehao, Huang Xinjing, Ji Feng, Chen Haiqing, Zhang Ying
- Learning To Generate Questions By Learning What Not To Generate Liu Bang, Zhao Mingjun, Niu Di, Lai Kunfeng, He Yancheng, Wei Haojie, Xu Yu
- Attention-informed Mixed-language Training For Zero-shot Cross-lingual Task-oriented Dialogue Systems Liu Zihan, Winata Genta Indra, Lin Zhaojiang, Xu Peng, Fung Pascale
- Multi-hop Reading Comprehension Via Deep Reinforcement Learning Based Document Traversal Long Alex, Mason Joel, Blair Alan, Wang Wei
- Vilbert: Pretraining Task-agnostic Visiolinguistic Representations For Vision-and-language Tasks Lu Jiasen, Batra Dhruv, Parikh Devi, Lee Stefan
- Good, Better, Best: Textual Distractors Generation For Multiple-choice Visual Question Answering Via Reinforcement Learning Lu Jiaying, Ye Xin, Ren Yi, Yang Yezhou
- Selective Attention For Context-aware Neural Machine Translation Maruf Sameen, Martins André F. T., Haffari Gholamreza
- Structured Pruning Of A Bert-based Question Answering Model Mccarley J. S., Chakravarti Rishav, Sil Avirup
- Self-attentional Models Application In Task-oriented Dialogue Generation Systems Mehrjardi Mansour Saffar, Trabelsi Amine, Zaiane Osmar R.
- Megatron-lm: Training Multi-billion Parameter Language Models Using Model Parallelism Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick Legresley, Jared Casper, Bryan Catanzaro
- Interrogating The Explanatory Power Of Attention In Neural Machine Translation Moradi Pooya, Kambhatla Nishant, Sarkar Anoop
- Neural Assistant: Joint Action Prediction, Response Generation, And Latent Knowledge Reasoning Neelakantan Arvind, Yavuz Semih, Narang Sharan, Prasad Vishaal, Goodrich Ben, Duckworth Daniel, Sankar Chinnadhurai, Yan Xifeng
- A Simple But Effective Method To Incorporate Multi-turn Context With BERT For Conversational Machine Comprehension Ohsugi Yasuhito, Saito Itsumi, Nishida Kyosuke, Asano Hisako, Tomita Junji
- Frustratingly Easy Natural Question Answering Pan Lin, Chakravarti Rishav, Ferritto Anthony, Glass Michael, Gliozzo Alfio, Roukos Salim, Florian Radu, Sil Avirup
- AI Meets Austen: Towards Human-robot Discussions Of Literary Metaphor Parde Natalie, Nielsen Rodney D.
- Stabilizing Transformers For Reinforcement Learning Parisotto Emilio, Song H. Francis, Rae Jack W., Pascanu Razvan, Gulcehre Caglar, Jayakumar Siddhant M., Jaderberg Max, Kaufman Raphael Lopez, Clark Aidan, Noury Seb, Botvinick Matthew M., Heess Nicolas, Hadsell Raia
- A Modular Task-oriented Dialogue System Using A Neural Mixture-of-experts Pei Jiahuan, Ren Pengjie, De Rijke Maarten
- Text Generation With Exemplar-based Adaptive Decoding Peng Hao, Parikh Ankur P., Faruqui Manaal, Dhingra Bhuwan, Das Dipanjan
- Language Models As Knowledge Bases? Petroni Fabio, Rocktäschel Tim, Lewis Patrick, Bakhtin Anton, Wu Yuxiang, Miller Alexander H., Riedel Sebastian
- Answering Complex Open-domain Questions Through Iterative Query Generation Qi Peng, Lin Xiaowen, Mehr Leo, Wang Zijian, Manning Christopher D.
- On Leveraging The Visual Modality For Neural Machine Translation Raunak Vikas, Choe Sang Keun, Lu Quanyang, Xu Yi, Metze Florian
- On Compositionality In Neural Machine Translation Raunak Vikas, Kumar Vaibhav, Metze Florian
- Leveraging Pre-trained Checkpoints For Sequence Generation Tasks Rothe Sascha, Narayan Shashi, Severyn Aliaksei
- A Survey Of Natural Language Generation Techniques With A Focus On Dialogue Systems - Past, Present And Future Directions Santhanam Sashank, Shaikh Samira
- Factor Graph Attention Schwartz Idan, Yu Seunghak, Hazan Tamir, Schwing Alexander
- Do Massively Pretrained Language Models Make Better Storytellers? See Abigail, Pappu Aneesh, Saxena Rohun, Yerukola Akhila, Manning Christopher D.
- Taking A HINT: Leveraging Explanations To Make Vision And Language Models More Grounded Selvaraju Ramprasaath R., Lee Stefan, Shen Yilin, Jin Hongxia, Ghosh Shalini, Heck Larry, Batra Dhruv, Parikh Devi
- Pragmatically Informative Text Generation Shen Sheng, Fried Daniel, Andreas Jacob, Klein Dan
- Flexibly-structured Model For Task-oriented Dialogues Shu Lei, Molino Piero, Namazifar Mahdi, Xu Hu, Liu Bing, Zheng Huaixiu, Tur Gokhan
- Multilingual Question Answering From Formatted Text Applied To Conversational Agents Siblini Wissam, Pasqual Charlotte, Lavielle Axel, Challal Mohamed, Cauchois Cyril
- MASS: Masked Sequence To Sequence Pre-training For Language Generation Song Kaitao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Tie-yan
- Unpaired Cross-lingual Image Caption Generation With Self-supervised Rewards Song Yuqing, Chen Shizhe, Zhao Yida, Jin Qin
- On NMT Search Errors And Model Errors: Cat Got Your Tongue? Stahlberg Felix, Byrne Bill
- Insertion Transformer: Flexible Sequence Generation Via Insertion Operations Stern Mitchell, Chan William, Kiros Jamie, Uszkoreit Jakob
- VL-BERT: Pre-training Of Generic Visual-linguistic Representations Su Weijie, Zhu Xizhou, Cao Yue, Li Bin, Lu Lewei, Wei Furu, Dai Jifeng
- Augmenting Self-attention With Persistent Memory Sukhbaatar Sainbayar, Grave Edouard, Lample Guillaume, Jegou Herve, Joulin Armand
- Generating Diverse Translation By Manipulating Multi-head Attention Sun Zewei, Huang Shujian, Wei Hao-ran, Dai Xin-yu, Chen Jiajun
- Syntax-infused Transformer And BERT Models For Machine Translation And Natural Language Understanding Sundararaman Dhanasekar, Subramanian Vivek, Wang Guoyin, Si Shijing, Shen Dinghan, Wang Dong, Carin Lawrence
- LXMERT: Learning Cross-modality Encoder Representations From Transformers Tan Hao, Bansal Mohit
- Waldorf: Wasteless Language-model Distillation On Reading-comprehension Tian James Yi, Kreuzer Alexander P., Chen Pai-hung, Will Hans-martin
- Repurposing Entailment For Multi-hop Question Answering Tasks Trivedi Harsh, Kwon Heeyoung, Khot Tushar, Sabharwal Ashish, Balasubramanian Niranjan
- Tree-structured Semantic Encoder With Knowledge Sharing For Domain Adaptation In Natural Language Generation Tseng Bo-hsiang, Budzianowski Paweł, Wu Yen-chen, Gašić Milica
- Capturing Greater Context For Question Generation Tuan Luu Anh, Shah Darsh J, Barzilay Regina
- How Does BERT Answer Questions? A Layer-wise Analysis Of Transformer Representations Van Aken Betty, Winter Benjamin, Löser Alexander, Gers Felix A.
- Visualizing Attention In Transformer-based Language Representation Models Vig Jesse
- Do NLP Models Know Numbers? Probing Numeracy In Embeddings Wallace Eric, Wang Yizhong, Li Sujian, Singh Sameer, Gardner Matt
- Do Multi-hop Readers Dream Of Reasoning Chains? Wang Haoyu, Yu Mo, Guo Xiaoxiao, Das Rajarshi, Xiong Wenhan, Gao Tian
- Improving Knowledge-aware Dialogue Generation Via Knowledge Base Question Answering Wang Jian, Liu Junhao, Bi Wei, Liu Xiaojiang, He Kejing, Xu Ruifeng, Yang Min
- Theme-aware Generation Model For Chinese Lyrics Wang Jie, Zhao Xinyan
- Structbert: Incorporating Language Structures Into Pre-training For Deep Language Understanding Wang Wei, Bi Bin, Yan Ming, Wu Chen, Bao Zuyi, Xia Jiangnan, Peng Liwei, Si Luo
- Adapting And Evaluating A Deep Learning Language Model For Clinical Why-question Answering Wen Andrew, Elwazir Mohamed Y., Moon Sungrim, Fan Jungwei
- Improving Neural Machine Translation With Pre-trained Representation Weng Rongxiang, Yu Heng, Huang Shujian, Luo Weihua, Chen Jiajun
- Huggingface's Transformers: State-of-the-art Natural Language Processing Wolf Thomas, Debut Lysandre, Sanh Victor, Chaumond Julien, Delangue Clement, Moi Anthony, Cistac Pierric, Rault Tim, Louf Rémi, Funtowicz Morgan, Davison Joe, Shleifer Sam, Von Platen Patrick, Ma Clara, Jernite Yacine, Plu Julien, Xu Canwen, Scao Teven Le, Gugger Sylvain, Drame Mariama, Lhoest Quentin, Rush Alexander M.
- Transfertransfo: A Transfer Learning Approach For Neural Network Based Conversational Agents Wolf Thomas, Sanh Victor, Chaumond Julien, Delangue Clement
- Learning To Memorize In Neural Task-oriented Dialogue Systems Wu Chien-sheng
- Integrated Triaging For Fast Reading Comprehension Wu Felix, Li Boyi, Wang Lequn, Lao Ni, Blitzer John, Weinberger Kilian Q.
- Transformer-based Cascaded Multimodal Speech Translation Wu Zixiu, Caglayan Ozan, Ive Julia, Wang Josiah, Specia Lucia
- Sharing Attention Weights For Fast Transformer Xiao Tong, Li Yinqiao, Zhu Jingbo, Yu Zhengtao, Liu Tongran
- Visual Entailment: A Novel Task For Fine-grained Image Understanding Xie Ning, Lai Farley, Doran Derek, Kadav Asim
- Towards Making The Most Of BERT In Neural Machine Translation Yang Jiacheng, Wang Mingxuan, Zhou Hao, Zhao Chengqi, Yu Yong, Zhang Weinan, Li Lei
- Data Augmentation For BERT Fine-tuning In Open-domain Question Answering Yang Wei, Xie Yuqing, Tan Luchen, Xiong Kun, Li Ming, Lin Jimmy
- Model Compression With Two-stage Multi-teacher Knowledge Distillation For Web Question Answering System Yang Ze, Shou Linjun, Gong Ming, Lin Wutao, Jiang Daxin
- Deepcopy: Grounded Response Generation With Hierarchical Pointer Networks Yavuz Semih, Rastogi Abhinav, Chao Guan-lin, Hakkani-tur Dilek
- Knowledge-grounded Response Generation With Deep Attentional Latent-variable Model Ye Hao-tong, Lo Kai-ling, Su Shang-yu, Chen Yun-nung
- Bp-transformer: Modelling Long-range Context Via Binary Partitioning Ye Zihao, Guo Qipeng, Gan Quan, Qiu Xipeng, Zhang Zheng
- Multimodal Unified Attention Networks For Vision-and-language Interactions Yu Zhou, Cui Yuhao, Yu Jun, Tao Dacheng, Tian Qi
- PEGASUS: Pre-training With Extracted Gap-sentences For Abstractive Summarization Zhang Jingqing, Zhao Yao, Saleh Mohammad, Liu Peter J.
- Addressing Semantic Drift In Question Generation For Semi-supervised Question Answering Zhang Shiyue, Bansal Mohit
- Bertscore: Evaluating Text Generation With BERT Zhang Tianyi, Kishore Varsha, Wu Felix, Weinberger Kilian Q., Artzi Yoav
- Dialogpt: Large-scale Generative Pre-training For Conversational Response Generation Zhang Yizhe, Sun Siqi, Galley Michel, Chen Yen-chun, Brockett Chris, Gao Xiang, Gao Jianfeng, Liu Jingjing, Dolan Bill
- Semantics-aware BERT For Language Understanding Zhang Zhuosheng, Wu Yuwei, Zhao Hai, Li Zuchao, Zhang Shuailiang, Zhou Xi, Zhou Xiang
- MUSE: Parallel Multi-scale Attention For Sequence To Sequence Learning Zhao Guangxiang, Sun Xu, Xu Jingjing, Zhang Zhiyuan, Luo Liangchen
- A Pre-training Based Personalized Dialogue Generation Model With Persona-sparse Data Zheng Yinhe, Zhang Rongsheng, Mao Xiaoxi, Huang Minlie
- Synchronous Bidirectional Neural Machine Translation Zhou Long, Zhang Jiajun, Zong Chengqing
- Unified Vision-language Pre-training For Image Captioning And VQA Zhou Luowei, Palangi Hamid, Zhang Lei, Hu Houdong, Corso Jason J., Gao Jianfeng
- Using Interlinear Glosses As Pivot In Low-resource Multilingual Machine Translation Zhou Zhong, Levin Lori, Mortensen David R., Waibel Alex
- Encoder-agnostic Adaptation For Conditional Language Generation Ziegler Zachary M., Melas-kyriazi Luke, Gehrmann Sebastian, Rush Alexander M.
- Paranoid Transformer: Reading Narrative Of Madness As Computational Approach To Creativity Agafonova Yana, Tikhonov Alexey, Yamshchikov Ivan P.
- Accelerating Natural Language Understanding In Task-oriented Dialog Ahuja Ojas, Desai Shrey
- A Study On Multimodal And Interactive Explanations For Visual Question Answering Alipour Kamran, Schulze Jurgen P., Yao Yi, Ziskind Avi, Burachas Giedrius
- Optimizing Transformer For Low-resource Neural Machine Translation Araabi Ali, Monz Christof
- Stress Test Evaluation Of Transformer-based Models In Natural Language Understanding Tasks Aspillaga Carlos, Carvallo Andrés, Araujo Vladimir
- Fluent Response Generation For Conversational Question Answering Baheti Ashutosh, Ritter Alan, Small Kevin
- Character-level Transformer-based Neural Machine Translation Banar Nikolay, Daelemans Walter, Kestemont Mike
- Co-attentional Transformers For Story-based Video Understanding Bebensee Björn, Zhang Byoung-tak
- Longformer: The Long-document Transformer Beltagy Iz, Peters Matthew E., Cohan Arman
- PALM: Pre-training An Autoencoding&autoregressive Language Model For Context-conditioned Generation Bi Bin, Li Chenliang, Wu Chen, Yan Ming, Wang Wei, Huang Songfang, Huang Fei, Si Luo
- Advances Of Transformer-based Models For News Headline Generation Bukhtiyarov Alexey, Gusev Ilya
- DIET: Lightweight Language Understanding For Dialogue Systems Bunk Tanja, Varshneya Daksh, Vlasov Vladimir, Nichol Alan
- Memory Transformer Burtsev Mikhail S., Kuratov Yuri, Peganov Anton, Sapunov Grigory V.
- Pretrained Language Models For Dialogue Generation With Multiple Input Sources Cao Yu, Bi Wei, Fang Meng, Tao Dacheng
- Syntax-driven Iterative Expansion Language Models For Controllable Text Generation Casas Noe, Fonollosa José A. R., Costa-jussà Marta R.
- Exploring Fluent Query Reformulations With Text-to-text Transformers And Reinforcement Learning Chen Jerry Zikun, Yu Shi, Wang Haoran
- Dipair: Fast And Accurate Distillation For Trillion-scale Text Matching And Pair Modeling Chen Jiecao, Yang Liu, Raman Karthik, Bendersky Michael, Yeh Jung-jung, Zhou Yun, Najork Marc, Cai Danyang, Emadzadeh Ehsan
- Accurate Word Alignment Induction From Neural Machine Translation Chen Yun, Liu Yang, Chen Guanhua, Jiang Xin, Liu Qun
- X-LXMERT: Paint, Caption And Answer Questions With Multi-modal Transformers Cho Jaemin, Lu Jiasen, Schwenk Dustin, Hajishirzi Hannaneh, Kembhavi Aniruddha
- Rethinking Embedding Coupling In Pre-trained Language Models Chung Hyung Won, Févry Thibault, Tsai Henry, Johnson Melvin, Ruder Sebastian
- Pymt5: Multi-mode Translation Of Natural Language And Python Code With Transformers Clement Colin B., Drain Dawn, Timcheck Jonathan, Svyatkovskiy Alexey, Sundaresan Neel
- Unsupervised Explanation Generation For Machine Reading Comprehension Cui Yiming, Liu Ting, Wang Shijin, Hu Guoping
- Probing Emergent Semantics In Predictive Agents Via Question Answering Das Abhishek, Carnevale Federico, Merzic Hamza, Rimell Laura, Schneider Rosalia, Abramson Josh, Hung Alden, Ahuja Arun, Clark Stephen, Wayne Gregory, Hill Felix
- Residual Energy-based Models For Text Generation Deng Yuntian, Bakhtin Anton, Ott Myle, Szlam Arthur, Ranzato Marc'aurelio
- Cascaded Text Generation With Markov Transformers Deng Yuntian, Rush Alexander M.
- Ernie-doc: A Retrospective Long-document Modeling Transformer Ding Siyu, Shang Junyuan, Wang Shuohuan, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Gshard: Scaling Giant Models With Conditional Computation And Automatic Sharding Dmitry Lepikhin, Hyoukjoong Lee, Yuanzhong Xu, Dehao Chen, Orhan Firat, Yanping Huang, Maxim Krikun, Noam Shazeer, Zhifeng Chen
- Injecting Entity Types Into Entity-guided Text Generation Dong Xiangyu, Yu Wenhao, Zhu Chenguang, Jiang Meng
- Training Multilingual Machine Translation By Alternately Freezing Language-specific Encoders-decoders Escolano Carlos, Costa-jussà Marta R., Fonollosa José A. R., Artetxe Mikel
- A Comparison Of LSTM And BERT For Small Corpus Ezen-can Aysu
- The Unreasonable Volatility Of Neural Machine Translation Models Fadaee Marzieh, Monz Christof
- CERT: Contrastive Self-supervised Learning For Language Understanding Fang Hongchao, Wang Sicheng, Zhou Meng, Ding Jiayuan, Xie Pengtao
- Attention Guided Semantic Relationship Parsing For Visual Question Answering Farazi Moshiur, Khan Salman, Barnes Nick
- Compressing Large-scale Transformer-based Models: A Case Study On BERT Ganesh Prakhar, Chen Yao, Lou Xin, Khan Mohammad Ali, Yang Yin, Sajjad Hassan, Nakov Preslav, Chen Deming, Winslett Marianne
- Robust Conversational AI With Grounded Text Generation Gao Jianfeng, Peng Baolin, Li Chunyuan, Li Jinchao, Shayandeh Shahin, Liden Lars, Shum Heung-yeung
- Relevance Transformer: Generating Concise Code Snippets With Relevance Feedback Gemmell Carlos, Rossetto Federico, Dalton Jeffrey
- IQ-VQA: Intelligent Visual Question Answering Goel Vatsal, Chandak Mohit, Anand Ashish, Guha Prithwijit
- ISAAQ -- Mastering Textbook Questions With Pre-trained Transformers And Bottom-up And Top-down Attention Gomez-perez Jose Manuel, Ortega Raul
- Teaforn: Teacher-forcing With N-grams Goodman Sebastian, Ding Nan, Soricut Radu
- Contextualize Knowledge Bases With Transformer For End-to-end Task-oriented Dialogue Systems Gou Yanjie, Lei Yinjie, Liu Lingqiao, Dai Yong, Shen Chunxu
- A Simple Yet Strong Pipeline For Hotpotqa Groeneveld Dirk, Khot Tushar, Mausam, Sabharwal Ashish
- Token-level Adaptive Training For Neural Machine Translation Gu Shuhao, Zhang Jinchao, Meng Fandong, Feng Yang, Xie Wanying, Zhou Jie, Yu Dong
- Incorporating BERT Into Parallel Sequence Decoding With Adapters Guo Junliang, Zhang Zhirui, Xu Linli, Wei Hao-ran, Chen Boxing, Chen Enhong
- Conversational Machine Comprehension: A Literature Review Gupta Somil, Rawat Bhanu Pratap Singh, Yu Hong
- ECONET: Effective Continual Pretraining Of Language Models For Event Temporal Reasoning Han Rujun, Ren Xiang, Peng Nanyun
- Exploring Explainable Selection To Control Abstractive Summarization Haonan Wang, Yang Gao, Yu Bai, Lapata Mirella, Heyan Huang
- Enriched Pre-trained Transformers For Joint Slot Filling And Intent Detection Hardalov Momchil, Koychev Ivan, Nakov Preslav
- Guided Transformer: Leveraging Multiple External Sources For Representation Learning In Conversational Search Hashemi Helia, Zamani Hamed, Croft W. Bruce
- Policy-driven Neural Response Generation For Knowledge-grounded Dialogue Systems Hedayatnia Behnam, Gopalakrishnan Karthik, Kim Seokhwan, Liu Yang, Eric Mihail, Hakkani-tur Dilek
- PAIR: Planning And Iterative Refinement In Pre-trained Transformers For Long Text Generation Hua Xinyu, Wang Lu
- Less Is More: Data-efficient Complex Question Answering Over Knowledge Bases Hua Yuncheng, Li Yuan-fang, Qi Guilin, Wu Wei, Zhang Jingyao, Qi Daiqing
- TRANS-BLSTM: Transformer With Bidirectional LSTM For Language Understanding Huang Zhiheng, Xu Peng, Liang Davis, Mishra Ajay, Xiang Bing
- Exploring BERT Parameter Efficiency On The Stanford Question Answering Dataset V2.0 Hulburd Eric
- Contrastive Code Representation Learning Jain Paras, Jain Ajay, Zhang Tianjun, Abbeel Pieter, Gonzalez Joseph E., Stoica Ion
- Are Natural Language Inference Models Imppressive? Learning Implicature And Presupposition Jeretic Paloma, Warstadt Alex, Bhooshan Suvrat, Williams Adina
- In Defense Of Grid Features For Visual Question Answering Jiang Huaizu, Misra Ishan, Rohrbach Marcus, Learned-miller Erik, Chen Xinlei
- How Can We Know When Language Models Know? On The Calibration Of Language Models For Question Answering Jiang Zhengbao, Araki Jun, Ding Haibo, Neubig Graham
- Text-to-text Pre-training For Data-to-text Tasks Kale Mihir, Rastogi Abhinav
- Meta-context Transformers For Domain-specific Response Generation Kar Debanjana, Samanta Suranjana, Azad Amar Prakash
- Do Fine-tuned Commonsense Language Models Really Generalize? Kejriwal Mayank, Shen Ke
- Unsupervised Bitext Mining And Translation Via Self-trained Contextual Embeddings Keung Phillip, Salazar Julian, Lu Yichao, Smith Noah A.
- Nearest Neighbor Machine Translation Khandelwal Urvashi, Fan Angela, Jurafsky Dan, Zettlemoyer Luke, Lewis Mike
- Current Limitations Of Language Models: What You Need Is Retrieval Komatsuzaki Aran
- Reducing Language Biases In Visual Question Answering With Visually-grounded Question Encoder Kv Gouthaman, Mittal Anurag
- Fid-ex: Improving Sequence-to-sequence Models For Extractive Rationale Generation Lakhotia Kushal, Paranjape Bhargavi, Ghoshal Asish, Yih Wen-tau, Mehdad Yashar, Iyer Srinivasan
- PONE: A Novel Automatic Evaluation Metric For Open-domain Generative Dialogue Systems Lan Tian, Mao Xian-ling, Wei Wei, Gao Xiaoyan, Huang Heyan
- Utilizing Bidirectional Encoder Representations From Transformers For Answer Selection Laskar Md Tahmid Rahman, Hoque Enamul, Huang Jimmy Xiangji
- Context-based Transformer Models For Answer Sentence Selection Lauriola Ivano, Moschitti Alessandro
- Transformers To Learn Hierarchical Contexts In Multiparty Dialogue For Span-based Question Answering Li Changmao, Choi Jinho D.
- An Empirical Investigation Of Pre-trained Transformer Language Models For Open-domain Dialogue Generation Li Piji
- TMT: A Transformer-based Modal Translator For Improving Multimodal Sequence Representations In Audio Visual Scene-aware Dialog Li Wubo, Jiang Dongwei, Zou Wei, Li Xiangang
- Multilingual Speech Translation With Efficient Finetuning Of Pretrained Models Li Xian, Wang Changhan, Tang Yun, Tran Chau, Tang Yuqing, Pino Juan, Baevski Alexei, Conneau Alexis, Auli Michael
- Interpretable NLG For Task-oriented Dialogue Systems With Heterogeneous Rendering Machines Li Yangming, Yao Kaisheng
- Rewriter-evaluator Architecture For Neural Machine Translation Li Yangming, Yao Kaisheng
- Neural Machine Translation With Joint Representation Li Yanyang, Wang Qiang, Xiao Tong, Liu Tongran, Zhu Jingbo
- A Comparison Of Pre-trained Vision-and-language Models For Multimodal Representation Learning Across Medical Images And Reports Li Yikuan, Wang Hanyin, Luo Yuan
- Train Large, Then Compress: Rethinking Model Size For Efficient Training And Inference Of Transformers Li Zhuohan, Wallace Eric, Shen Sheng, Lin Kevin, Keutzer Kurt, Klein Dan, Gonzalez Joseph E.
- LRTA: A Transparent Neural-symbolic Reasoning Framework With Modular Supervision For Visual Question Answering Liang Weixin, Niu Feiyang, Reganti Aishwarya, Thattai Govind, Tur Gokhan
- XGLUE: A New Benchmark Dataset For Cross-lingual Pre-training, Understanding And Generation Liang Yaobo, Duan Nan, Gong Yeyun, Wu Ning, Guo Fenfei, Qi Weizhen, Gong Ming, Shou Linjun, Jiang Daxin, Cao Guihong, Fan Xiaodong, Zhang Ruofei, Agrawal Rahul, Cui Edward, Wei Sining, Bharti Taroon, Qiao Ying, Chen Jiun-hung, Wu Winnie, Liu Shuguang, Yang Fan, Campos Daniel, Majumder Rangan, Zhou Ming
- Probabilistically Masked Language Model Capable Of Autoregressive Generation In Arbitrary Word Order Liao Yi, Jiang Xin, Liu Qun
- Autoregressive Knowledge Distillation Through Imitation Learning Lin Alexander, Wohlwend Jeremy, Chen Howard, Lei Tao
- Readonce Transformers: Reusable Representations Of Text For Transformers Lin Shih-ting, Sabharwal Ashish, Khot Tushar
- Mintl: Minimalist Transfer Learning For Task-oriented Dialogue Systems Lin Zhaojiang, Madotto Andrea, Winata Genta Indra, Fung Pascale
- Variational Transformers For Diverse Response Generation Lin Zhaojiang, Winata Genta Indra, Xu Peng, Liu Zihan, Fung Pascale
- Improving Zero-shot Translation By Disentangling Positional Information Liu Danni, Niehues Jan, Cross James, Guzmán Francisco, Li Xian
- Rikinet: Reading Wikipedia Pages For Natural Question Answering Liu Dayiheng, Gong Yeyun, Fu Jie, Yan Yu, Chen Jiusheng, Jiang Daxin, Lv Jiancheng, Duan Nan
- Task-level Curriculum Learning For Non-autoregressive Neural Machine Translation Liu Jinglin, Ren Yi, Tan Xu, Zhang Chen, Qin Tao, Zhao Zhou, Liu Tie-yan
- Very Deep Transformers For Neural Machine Translation Liu Xiaodong, Duh Kevin, Liu Liyuan, Gao Jianfeng
- KG-BART: Knowledge Graph-augmented BART For Generative Commonsense Reasoning Liu Ye, Wan Yao, He Lifang, Peng Hao, Yu Philip S.
- Lite Training Strategies For Portuguese-english And English-portuguese Translation Lopes Alexandre, Nogueira Rodrigo, Lotufo Roberto, Pedrini Helio
- VECO: Variable And Flexible Cross-lingual Pre-training For Language Understanding And Generation Luo Fuli, Wang Wei, Liu Jiahao, Liu Yijia, Bi Bin, Huang Songfang, Huang Fei, Si Luo
- Charbert: Character-aware Pre-trained Language Model Ma Wentao, Cui Yiming, Si Chenglei, Liu Ting, Wang Shijin, Hu Guoping
- Language Models As Few-shot Learner For Task-oriented Dialogue Systems Madotto Andrea, Liu Zihan, Lin Zhaojiang, Fung Pascale
- Gpt-too: A Language-model-first Approach For Amr-to-text Generation Mager Manuel, Astudillo Ramon Fernandez, Naseem Tahira, Sultan Md Arafat, Lee Young-suk, Florian Radu, Roukos Salim
- Capturing Document Context Inside Sentence-level Neural Machine Translation Models With Self-training Mansimov Elman, Melis Gábor, Yu Lei
- Natural Language Rationales With Full-stack Visual Reasoning: From Pixels To Semantic Frames To Commonsense Graphs Marasović Ana, Bhagavatula Chandra, Park Jae Sung, Bras Ronan Le, Smith Noah A., Choi Yejin
- Dialoglue: A Natural Language Understanding Benchmark For Task-oriented Dialogue Mehri Shikib, Eric Mihail, Hakkani-tur Dilek
- Delight: Deep And Light-weight Transformer Mehta Sachin, Ghazvininejad Marjan, Iyer Srinivasan, Zettlemoyer Luke, Hajishirzi Hannaneh
- Generation-distillation For Efficient Natural Language Understanding In Low-data Settings Melas-kyriazi Luke, Han George, Liang Celine
- Wechat Neural Machine Translation Systems For WMT20 Meng Fandong, Yan Jianhao, Liu Yijin, Gao Yuan, Zeng Xianfeng, Zeng Qinsong, Li Peng, Chen Ming, Zhou Jie, Liu Sifan, Zhou Hao
- ADER: Adaptively Distilled Exemplar Replay Towards Continual Learning For Session-based Recommendation Mi Fei, Lin Xiaoyu, Faltings Boi
- Modern Methods For Text Generation Montesinos Dimas Munoz
- NABU \(\mathrm{-}\) Multilingual Graph-based Neural RDF Verbalizer Moussallem Diego, Gnaneshwar Dwaraknath, Ferreira Thiago Castro, Ngomo Axel-cyrille Ngonga
- Stepwise Extractive Summarization And Planning With Structured Transformers Narayan Shashi, Maynez Joshua, Adamek Jakub, Pighin Daniele, Bratanič Blaž, Mcdonald Ryan
- I Like Fish, Especially Dolphins: Addressing Contradictions In Dialogue Modeling Nie Yixin, Williamson Mary, Bansal Mohit, Kiela Douwe, Weston Jason
- Better Distractions: Transformer-based Distractor Generation And Multiple Choice Question Filtering Offerijns Jeroen, Verberne Suzan, Verhoef Tessa
- Facts2story: Controlling Text Generation By Key Facts Orbach Eyal Bar Ilan University, Goldberg Yoav Bar Ilan University And Allen Institute For Artificial Intelligence
- On Task-level Dialogue Composition Of Generative Transformer Model Parthasarathi Prasanna, Neelakantan Arvind, Narang Sharan
- MAD-X: An Adapter-based Framework For Multi-task Cross-lingual Transfer Pfeiffer Jonas, Vulić Ivan, Gurevych Iryna, Ruder Sebastian
- Sparsifying Transformer Models With Trainable Representation Pooling Pietruszka Michał, Borchmann Łukasz, Garncarek Łukasz
- Intermediate-task Transfer Learning With Pretrained Models For Natural Language Understanding: When And Why Does It Work? Pruksachatkun Yada, Phang Jason, Liu Haokun, Htut Phu Mon, Zhang Xiaoyi, Pang Richard Yuanzhe, Vania Clara, Kann Katharina, Bowman Samuel R.
- BANG: Bridging Autoregressive And Non-autoregressive Generation With Large Scale Pretraining Qi Weizhen, Gong Yeyun, Jiao Jian, Yan Yu, Chen Weizhu, Liu Dayiheng, Tang Kewen, Li Houqiang, Chen Jiusheng, Zhang Ruofei, Zhou Ming, Duan Nan
- Prophetnet: Predicting Future N-gram For Sequence-to-sequence Pre-training Qi Weizhen, Yan Yu, Gong Yeyun, Liu Dayiheng, Duan Nan, Chen Jiusheng, Zhang Ruofei, Zhou Ming
- Nlpgym -- A Toolkit For Evaluating RL Agents On Natural Language Processing Tasks Ramamurthy Rajkumar, Sifa Rafet, Bauckhage Christian
- Investigating Pretrained Language Models For Graph-to-text Generation Ribeiro Leonardo F. R., Schmitt Martin, Schütze Hinrich, Gurevych Iryna
- Personalized Query Rewriting In Conversational AI Agents Roshan-ghias Alireza, Mathialagan Clint Solomon, Ponnusamy Pragaash, Mathias Lambert, Guo Chenlei
- Lareqa: Language-agnostic Answer Retrieval From A Multilingual Pool Roy Uma, Constant Noah, Al-rfou Rami, Barua Aditya, Phillips Aaron, Yang Yinfei
- Fine-tuning BERT For Schema-guided Zero-shot Dialogue State Tracking Ruan Yu-ping, Ling Zhen-hua, Gu Jia-chen, Liu Quan
- Hierarchical Transformer For Task Oriented Dialog Systems Santra Bishal, Anusha Potnuru, Goyal Pawan
- It's Not Just Size That Matters: Small Language Models Are Also Few-shot Learners Schick Timo, Schütze Hinrich
- What BERT Sees: Cross-modal Transfer For Visual Question Generation Scialom Thomas, Bordes Patrick, Dray Paul-alexis, Staiano Jacopo, Gallinari Patrick
- BLEURT: Learning Robust Metrics For Text Generation Sellam Thibault, Das Dipanjan, Parikh Ankur P.
- Reasoning Over History: Context Aware Visual Dialog Shah Muhammad A., Mehri Shikib, Srinivasan Tejas
- End-to-end Synthetic Data Generation For Domain Adaptation Of Question Answering Systems Shakeri Siamak, Santos Cicero Nogueira Dos, Zhu Henry, Ng Patrick, Nan Feng, Wang Zhiguo, Nallapati Ramesh, Xiang Bing
- Data-efficient Methods For Dialogue Systems Shalyminov Igor
- Hybrid Generative-retrieval Transformers For Dialogue Domain Adaptation Shalyminov Igor, Sordoni Alessandro, Atkinson Adam, Schulz Hannes
- A Simple But Tough-to-beat Data Augmentation Approach For Natural Language Understanding And Generation Shen Dinghan, Zheng Mingzhi, Shen Yelong, Qu Yanru, Chen Weizhu
- Neural Data-to-text Generation Via Jointly Learning The Segmentation And Correspondence Shen Xiaoyu, Chang Ernie, Su Hui, Zhou Jie, Klakow Dietrich
- Pre-trained Summarization Distillation Shleifer Sam, Rush Alexander M.
- Saying No Is An Art: Contextualized Fallback Responses For Unanswerable Dialogue Queries Shrivastava Ashish, Dhole Kaustubh, Bhatt Abhinav, Raghunath Sharvani
- Unnatural Language Inference Sinha Koustuv, Parthasarathi Prasanna, Pineau Joelle, Williams Adina
- Neural Machine Translation With Error Correction Song Kaitao, Tan Xu, Lu Jianfeng
- Recipes For Adapting Pre-trained Monolingual And Multilingual Models To Machine Translation Stickland Asa Cooper, Li Xian, Ghazvininejad Marjan
- Benchmarking Machine Reading Comprehension: A Psychological Perspective Sugawara Saku, Stenetorp Pontus, Aizawa Akiko
- Intellicode Compose: Code Generation Using Transformer Svyatkovskiy Alexey, Deng Shao Kun, Fu Shengyu, Sundaresan Neel
- Interpretable Neural Computation For Real-world Compositional Visual Question Answering Tang Ruixue, Ma Chao
- Synthesizer: Rethinking Self-attention In Transformer Models Tay Yi, Bahri Dara, Metzler Donald, Juan Da-cheng, Zhao Zhe, Zheng Che
- The Language Interpretability Tool: Extensible, Interactive Visualizations And Analysis For NLP Models Tenney Ian, Wexler James, Bastings Jasmijn, Bolukbasi Tolga, Coenen Andy, Gehrmann Sebastian, Jiang Ellen, Pushkarna Mahima, Radebaugh Carey, Reif Emily, Yuan Ann
- On Optimal Transformer Depth For Low-resource Language Translation Van Biljon Elan, Pretorius Arnu, Kreutzer Julia
- Unsupervised Pretraining For Neural Machine Translation Using Elastic Weight Consolidation Variš Dušan, Bojar Ondřej
- Multi-task Learning With Multi-head Attention For Multi-choice Reading Comprehension Wan Hui
- HAT: Hardware-aware Transformers For Efficient Natural Language Processing Wang Hanrui, Wu Zhanghao, Liu Zhijian, Cai Han, Zhu Ligeng, Gan Chuang, Han Song
- SRQA: Synthetic Reader For Factoid Question Answering Wang Jiuniu, Xu Wenjia, Fu Xingyu, Wei Yang, Jin Li, Chen Ziyan, Xu Guangluan, Wu Yirong
- Minilm: Deep Self-attention Distillation For Task-agnostic Compression Of Pre-trained Transformers Wang Wenhui, Wei Furu, Dong Li, Bao Hangbo, Yang Nan, Zhou Ming
- Lightseq: A High Performance Inference Library For Transformers Wang Xiaohui, Xiong Ying, Wei Yang, Wang Mingxuan, Li Lei
- Towards Faithful Neural Table-to-text Generation With Content-matching Constraints Wang Zhenyi, Wang Xiaoyang, An Bang, Yu Dong, Chen Changyou
- On Modality Bias In The TVQA Dataset Winterbottom Thomas, Xiao Sarah, Mclean Alistair, Moubayed Noura Al
- Contextual Neural Machine Translation Improves Translation Of Cataphoric Pronouns Wong Kayyen, Maruf Sameen, Haffari Gholamreza
- Cort: Complementary Rankings From Transformers Wrzalik Marco, Krechel Dirk
- TOD-BERT: Pre-trained Natural Language Understanding For Task-oriented Dialogue Wu Chien-sheng, Hoi Steven, Socher Richard, Xiong Caiming
- Lite Transformer With Long-short Range Attention Wu Zhanghao, Liu Zhijian, Lin Ji, Lin Yujun, Han Song
- CG-BERT: Conditional Text Generation With BERT For Generalized Few-shot Intent Detection Xia Congying, Zhang Chenwei, Nguyen Hoang, Zhang Jiawei, Yu Philip
- ERNIE-GEN: An Enhanced Multi-flow Pre-training And Fine-tuning Framework For Natural Language Generation Xiao Dongling, Zhang Han, Li Yukun, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Generating Informative Dialogue Responses With Keywords-guided Networks Xu Heng-da, Mao Xian-ling, Chi Zewen, Zhu Jing-jing, Sun Fanshu, Huang Heyan
- EDITOR: An Edit-based Transformer With Repositioning For Neural Machine Translation With Soft Lexical Constraints Xu Weijia, Carpuat Marine
- MC-BERT: Efficient Language Pre-training Via A Meta Controller Xu Zhenhui, Gong Linyuan, Ke Guolin, He Di, Zheng Shuxin, Wang Liwei, Bian Jiang, Liu Tie-yan
- Multi-unit Transformers For Neural Machine Translation Yan Jianhao, Meng Fandong, Zhou Jie
- Just Ask: Learning To Answer Questions From Millions Of Narrated Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- On The Sub-layer Functionalities Of Transformer Decoder Yang Yilin, Wang Longyue, Shi Shuming, Tadepalli Prasad, Lee Stefan, Tu Zhaopeng
- Contrastive Triple Extraction With Generative Transformer Ye Hongbin, Zhang Ningyu, Deng Shumin, Chen Mosha, Tan Chuanqi, Huang Fei, Chen Huajun
- Hard-coded Gaussian Attention For Neural Machine Translation You Weiqiu, Sun Simeng, Iyyer Mohit
- Cocolm: Complex Commonsense Enhanced Language Model With Discourse Relations Yu Changlong, Zhang Hongming, Song Yangqiu, Ng Wilfred
- Few-shot Generative Conversational Query Rewriting Yu Shi, Liu Jiahua, Yang Jingqin, Xiong Chenyan, Bennett Paul, Gao Jianfeng, Liu Zhiyuan
- Clinical Reading Comprehension: A Thorough Analysis Of The Emrqa Dataset Yue Xiang, Gutierrez Bernal Jimenez, Sun Huan
- GOBO: Quantizing Attention-based NLP Models For Low Latency And Energy Efficient Inference Zadeh Ali Hadi, Edo Isak, Awad Omar Mohamed, Moshovos Andreas
- Big Bird: Transformers For Longer Sequences Zaheer Manzil, Guruganesh Guru, Dubey Avinava, Ainslie Joshua, Alberti Chris, Ontanon Santiago, Pham Philip, Ravula Anirudh, Wang Qifan, Yang Li, Ahmed Amr
- Token Drop Mechanism For Neural Machine Translation Zhang Huaao, Qiu Shigui, Duan Xiangyu, Zhang Min
- Improving Adversarial Text Generation By Modeling The Distant Future Zhang Ruiyi, Chen Changyou, Gan Zhe, Wang Wenlin, Shen Dinghan, Wang Guoyin, Wen Zheng, Carin Lawrence
- Trojaning Language Models For Fun And Profit Zhang Xinyang, Zhang Zheng, Ji Shouling, Wang Ting
- POINTER: Constrained Progressive Text Generation Via Insertion-based Generative Pre-training Zhang Yizhe, Wang Guoyin, Li Chunyuan, Gan Zhe, Brockett Chris, Dolan Bill
- BERT-JAM: Boosting Bert-enhanced Neural Machine Translation With Joint Attention Zhang Zhebin, Wu Sai, Jiang Dawei, Chen Gang
- Know What You Don't Need: Single-shot Meta-pruning For Attention Heads Zhang Zhengyan, Qi Fanchao, Liu Zhiyuan, Liu Qun, Sun Maosong
- Stylized Dialogue Response Generation Using Stylized Unpaired Texts Zheng Yinhe, Chen Zikai, Zhang Rongsheng, Huang Shilei, Mao Xiaoxi, Huang Minlie
- Towards Making The Most Of Context In Neural Machine Translation Zheng Zaixiang, Yue Xiang, Huang Shujian, Chen Jiajun, Birch Alexandra
- Improving Non-autoregressive Neural Machine Translation With Monolingual Data Zhou Jiawei, Keung Phillip
- Pre-training Text-to-text Transformers For Concept-centric Common Sense Zhou Wangchunshu, Lee Dong-ho, Selvam Ravi Kiran, Lee Seyeon, Lin Bill Yuchen, Ren Xiang
- Overcoming Language Priors With Self-supervised Learning For Visual Question Answering Zhu Xi, Mao Zhendong, Liu Chunxiao, Zhang Peng, Wang Bin, Zhang Yongdong
- Goal-directed Story Generation: Augmenting Generative Language Models With Reinforcement Learning Alabdulkarim Amal, Li Winston, Martin Lara J., Riedl Mark O.
- One Question Answering Model For Many Languages With Cross-lingual Dense Passage Retrieval Asai Akari, Yu Xinyan, Kasai Jungo, Hajishirzi Hannaneh
- See, Hear, Read: Leveraging Multimodality With Guided Attention For Abstractive Text Summarization Atri Yash Kumar, Pramanick Shraman, Goyal Vikram, Chakraborty Tanmoy
- Self-supervised Test-time Learning For Reading Comprehension Banerjee Pratyay, Gokhale Tejas, Baral Chitta
- G-transformer For Document-level Machine Translation Bao Guangsheng, Zhang Yue, Teng Zhiyang, Chen Boxing, Luo Weihua
- S2s-ft: Fine-tuning Pretrained Transformer Encoders For Sequence-to-sequence Learning Bao Hangbo, Dong Li, Wang Wenhui, Yang Nan, Wei Furu
- Non-autoregressive Translation By Learning Target Categorical Codes Bao Yu, Huang Shujian, Xiao Tong, Wang Dongqi, Dai Xinyu, Chen Jiajun
- Exploring Unsupervised Pretraining Objectives For Machine Translation Baziotis Christos, Titov Ivan, Birch Alexandra, Haddow Barry
- Efficient Inference For Multilingual Neural Machine Translation Berard Alexandre, Lee Dain, Clinchant Stéphane, Jung Kweonwoo, Nikoulina Vassilina
- On The Universality Of Deep Contextual Language Models Bhatt Shaily, Goyal Poonam, Dandapat Sandipan, Choudhury Monojit, Sitaram Sunayana
- Leveraging Redundancy In Attention With Reuse Transformers Bhojanapalli Srinadh, Chakrabarti Ayan, Veit Andreas, Lukasik Michal, Jain Himanshu, Liu Frederick, Chang Yin-wen, Kumar Sanjiv
- Improving Language Models By Retrieving From Trillions Of Tokens Borgeaud Sebastian, Mensch Arthur, Hoffmann Jordan, Cai Trevor, Rutherford Eliza, Millican Katie, Driessche George Van Den, Lespiau Jean-baptiste, Damoc Bogdan, Clark Aidan, Casas Diego De Las, Guy Aurelia, Menick Jacob, Ring Roman, Hennigan Tom, Huang Saffron, Maggiore Loren, Jones Chris, Cassirer Albin, Brock Andy, Paganini Michela, Irving Geoffrey, Vinyals Oriol, Osindero Simon, Simonyan Karen, Rae Jack W., Elsen Erich, Sifre Laurent
- What Changes Can Large-scale Language Models Bring? Intensive Study On Hyperclova: Billions-scale Korean Generative Pretrained Transformers Boseop Kim, Hyoungseok Kim, Sang-woo Lee, Gichang Lee, Donghyun Kwak, Dong Hyeon Jeon, Sunghyun Park, Sungju Kim, Seonhoon Kim, Dongpil Seo, Heungsub Lee, Minyoung Jeong, Sungjae Lee, Minsub Kim, Suk Hyun Ko, Seokhun Kim, Taeyong Park, Jinuk Kim, Soyoung Kang, Na-hyeon Ryu, Kang Min Yoo, Minsuk Chang, Soobin Suh, Sookyo In, Jinseong Park, Kyungduk Kim, Hiun Kim, Jisu Jeong, Yong Goo Yeo, Donghoon Ham, Dongju Park, Min Young Lee, Jaewook Kang, Inho Kang, Jung-woo Ha, Woomyoung Park, Nako Sung
- Long Context Question Answering Via Supervised Contrastive Learning Caciularu Avi, Dagan Ido, Goldberger Jacob, Cohan Arman
- Curriculum Learning For Language Modeling Campos Daniel
- Coarse-grained Decomposition And Fine-grained Interaction For Multi-hop Question Answering Cao Xing, Liu Yun
- Can Transformers Jump Around Right In Natural Language? Assessing Performance Transfer From SCAN Chaabouni Rahma, Dessì Roberto, Kharitonov Eugene
- The Selectgen Challenge: Finding The Best Training Samples For Few-shot Neural Text Generation Chang Ernie, Shen Xiaoyu, Marin Alex, Demberg Vera
- On Training Instance Selection For Few-shot Neural Text Generation Chang Ernie, Shen Xiaoyu, Yeh Hui-syuan, Demberg Vera
- Neural Data-to-text Generation With Lm-based Text Augmentation Chang Ernie, Shen Xiaoyu, Zhu Dawei, Demberg Vera, Su Hui
- Generic Attention-model Explainability For Interpreting Bi-modal And Encoder-decoder Transformers Chefer Hila, Gur Shir, Wolf Lior
- Topic Transferable Table Question Answering Chemmengath Saneem Ahmed, Kumar Vishwajeet, Bharadwaj Samarth, Sen Jaydeep, Canim Mustafa, Chakrabarti Soumen, Gliozzo Alfio, Sankaranarayanan Karthik
- Towards Making The Most Of Multilingual Pretraining For Zero-shot Neural Machine Translation Chen Guanhua, Ma Shuming, Chen Yun, Zhang Dongdong, Pan Jia, Wang Wenping, Wei Furu
- Visualgpt: Data-efficient Adaptation Of Pretrained Language Models For Image Captioning Chen Jun, Guo Han, Yi Kai, Li Boyang, Elhoseiny Mohamed
- Reinforcement Learning For Few-shot Text Generation Adaptation Cheng Pengsen, Dai Jinqiao, Liu Jiamiao, Liu Jiayong, Jia Peng
- MT6: Multilingual Pretrained Text-to-text Transformer With Translation Pairs Chi Zewen, Dong Li, Ma Shuming, Mao Shaohan Huang Xian-ling, Huang Heyan, Wei Furu
- Breaking Down Multilingual Machine Translation Chiang Ting-rui, Chen Yi-pei, Yeh Yi-ting, Neubig Graham
- Unifying Vision-and-language Tasks Via Text Generation Cho Jaemin, Lei Jie, Tan Hao, Bansal Mohit
- Ensemble Fine-tuned Mbert For Translation Quality Estimation Chowdhury Shaika, Baili Naouel, Vannah Brian
- Long-range Modeling Of Source Code Files With Ewash: Extended Window Access By Syntax Hierarchy Clement Colin B., Lu Shuai, Liu Xiaoyu, Tufano Michele, Drain Dawn, Duan Nan, Sundaresan Neel, Svyatkovskiy Alexey
- Distilling Transformers For Neural Cross-domain Search Clement Colin B., Wu Chen, Drain Dawn, Sundaresan Neel
- Control Prefixes For Parameter-efficient Text Generation Clive Jordan, Cao Kris, Rei Marek
- Multilingual Multi-aspect Explainability Analyses On Machine Reading Comprehension Models Cui Yiming, Zhang Wei-nan, Che Wanxiang, Liu Ting, Chen Zhigang, Wang Shijin
- YANMTT: Yet Another Neural Machine Translation Toolkit Dabre Raj, Sumita Eiichiro
- Mention Memory: Incorporating Textual Knowledge Into Transformers Through Entity Mention Attention De Jong Michiel, Zemlyanskiy Yury, Fitzgerald Nicholas, Sha Fei, Cohen William
- Compression, Transduction, And Creation: A Unified Framework For Evaluating Natural Language Generation Deng Mingkai, Tan Bowen, Liu Zhengzhong, Xing Eric P., Hu Zhiting
- Multitask Finetuning For Improving Neural Machine Translation In Indian Languages Desai Shaily, Kshirsagar Atharva, Marathe Manisha
- EKTVQA: Generalized Use Of External Knowledge To Empower Scene Text In Text-vqa Dey Arka Ujjal, Valveny Ernest, Harit Gaurav
- Openprompt: An Open-source Framework For Prompt-learning Ding Ning, Hu Shengding, Zhao Weilin, Chen Yulin, Liu Zhiyuan, Zheng Hai-tao, Sun Maosong
- On-the-fly Attention Modulation For Neural Generation Dong Yue, Bhagavatula Chandra, Lu Ximing, Hwang Jena D., Bosselut Antoine, Cheung Jackie Chi Kit, Choi Yejin
- Attention Forcing For Machine Translation Dou Qingyun, Lu Yiting, Manakul Potsawee, Wu Xixin, Gales Mark J. F.
- Robustness Challenges In Model Distillation And Pruning For Natural Language Understanding Du Mengnan, Mukherjee Subhabrata, Cheng Yu, Shokouhi Milad, Hu Xia, Awadallah Ahmed Hassan
- Generative Context Pair Selection For Multi-hop Question Answering Dua Dheeru, Santos Cicero Nogueira Dos, Ng Patrick, Athiwaratkun Ben, Xiang Bing, Gardner Matt, Singh Sameer
- Robustly Optimized And Distilled Training For Natural Language Understanding Elfadeel Haytham, Peshterliev Stan
- Does CLIP Benefit Visual Question Answering In The Medical Domain As Much As It Does In The General Domain? Eslami Sedigheh, De Melo Gerard, Meinel Christoph
- Compressing Visual-linguistic Model Via Knowledge Distillation Fang Zhiyuan, Wang Jianfeng, Hu Xiaowei, Wang Lijuan, Yang Yezhou, Liu Zicheng
- SAPPHIRE: Approaches For Enhanced Concept-to-text Generation Feng Steven Y., Huynh Jessica, Narisetty Chaitanya, Hovy Eduard, Gangal Varun
- Retrieve, Caption, Generate: Visual Grounding For Enhancing Commonsense In Text Generation Models Feng Steven Y., Lu Kevin, Tao Zhuofu, Alikhani Malihe, Mitamura Teruko, Hovy Eduard, Gangal Varun
- Attention Weights In Transformer NMT Fail Aligning Words Between Sequences But Largely Explain Model Predictions Ferrando Javier, Costa-jussà Marta R.
- Berta\'u: Ita\'u BERT For Digital Customer Service Finardi Paulo, Viegas José Dié, Ferreira Gustavo T., Mansano Alex F., Caridá Vinicius F.
- Grounded Graph Decoding Improves Compositional Generalization In Question Answering Gai Yu, Jain Paras, Zhang Wendi, Gonzalez Joseph E., Song Dawn, Stoica Ion
- Chop Chop BERT: Visual Question Answering By Chopping Visualbert's Heads Gao Chenyu, Zhu Qi, Wang Peng, Wu Qi
- Scalable Transformers For Neural Machine Translation Gao Peng, Geng Shijie, Qiao Yu, Wang Xiaogang, Dai Jifeng, Li Hongsheng
- Improving And Diagnosing Knowledge-based Visual Question Answering Via Entity Enhanced Knowledge Injection Garcia-olano Diego, Onoe Yasumasa, Ghosh Joydeep
- Cross-attention Is All You Need: Adapting Pretrained Transformers For Machine Translation Gheini Mozhdeh, Ren Xiang, May Jonathan
- Xeroalign: Zero-shot Cross-lingual Transformer Alignment Gritta Milan, Iacobacci Ignacio
- Long Text Generation By Modeling Sentence-level And Discourse-level Coherence Guan Jian, Mao Xiaoxi, Fan Changjie, Liu Zitao, Ding Wenbiao, Huang Minlie
- Block-skim: Efficient Question Answering For Transformer Guan Yue, Li Zhengyi, Leng Jingwen, Lin Zhouhan, Guo Minyi, Zhu Yuhao
- Longt5: Efficient Text-to-text Transformer For Long Sequences Guo Mandy, Ainslie Joshua, Uthus David, Ontanon Santiago, Ni Jianmo, Sung Yun-hsuan, Yang Yinfei
- A Million Tweets Are Worth A Few Points: Tuning Transformers For Customer Service Tasks Hadifar Amir, Labat Sofie, Hoste Véronique, Develder Chris, Demeester Thomas
- Greedy Gradient Ensemble For Robust Visual Question Answering Han Xinzhe, Wang Shuhui, Su Chi, Huang Qingming, Tian Qi
- Text2app: A Framework For Creating Android Apps From Text Descriptions Hasan Masum, Mehrab Kazi Sajeed, Ahmad Wasi Uddin, Shahriyar Rifat
- Zero-shot Controlled Generation With Encoder-decoder Transformers Hazarika Devamanyu, Namazifar Mahdi, Hakkani-tür Dilek
- Generate, Annotate, And Learn: NLP With Synthetic Text He Xuanli, Nassar Islam, Kiros Jamie, Haffari Gholamreza, Norouzi Mohammad
- Open Domain Question Answering Over Tables Via Dense Retrieval Herzig Jonathan, Müller Thomas, Krichene Syrine, Eisenschlos Julian Martin
- ENCONTER: Entity Constrained Progressive Sequence Generation Via Insertion-based Transformer Hsieh Lee-hsun, Lee Yang-yin, Lim Ee-peng
- The Niutrans System For WNGT 2020 Efficiency Task Hu Chi, Li Bei, Lin Ye, Li Yinqiao, Li Yanyang, Wang Chenglong, Xiao Tong, Zhu Jingbo
- Controllable Dialogue Generation With Disentangled Multi-grained Style Specification And Attribute Consistency Reward Hu Zhe, Cao Zhiwei, Chan Hou Pong, Liu Jiachen, Xiao Xinyan, Su Jinsong, Wu Hua
- Non-autoregressive Translation With Layer-wise Prediction And Deep Supervision Huang Chenyang, Zhou Hao, Zaïane Osmar R., Mou Lili, Li Lei
- Transfer Learning For Sequence Generation: From Single-source To Multi-source Huang Xuancheng, Xu Jingfang, Sun Maosong, Liu Yang
- Unifying Multimodal Transformer For Bi-directional Image And Text Generation Huang Yupan, Xue Hongwei, Liu Bei, Lu Yutong
- Visqa: X-raying Vision And Language Reasoning In Transformers Jaunet Theo, Kervadec Corentin, Vuillemot Romain, Antipov Grigory, Baccouche Moez, Wolf Christian
- Retraining Distilbert For A Voice Shopping Assistant By Using Universal Dependencies Jayarao Pratik, Sharma Arpit
- Improving Non-autoregressive Generation With Mixup Training Jiang Ting, Huang Shaohan, Zhang Zihan, Wang Deqing, Zhuang Fuzhen, Wei Furu, Huang Haizhen, Zhang Liangjie, Zhang Qi
- Enriching Transformers With Structured Tensor-product Representations For Abstractive Summarization Jiang Yichen, Celikyilmaz Asli, Smolensky Paul, Soulos Paul, Rao Sudha, Palangi Hamid, Fernandez Roland, Smith Caitlin, Bansal Mohit, Gao Jianfeng
- Lightmbert: A Simple Yet Effective Method For Multilingual BERT Distillation Jiao Xiaoqi, Yin Yichun, Shang Lifeng, Jiang Xin, Chen Xiao, Li Linlin, Wang Fang, Liu Qun
- A Good Prompt Is Worth Millions Of Parameters: Low-resource Prompt-based Learning For Vision-language Models Jin Woojeong, Cheng Yu, Shen Yelong, Chen Weizhu, Ren Xiang
- Rethinking Self-supervision Objectives For Generalizable Coherence Modeling Jwalapuram Prathyusha, Joty Shafiq, Lin Xiang
- Nmt5 -- Is Parallel Data Still Relevant For Pre-training Massively Multilingual Language Models? Kale Mihir, Siddhant Aditya, Constant Noah, Johnson Melvin, Al-rfou Rami, Xue Linting
- Distilling Large Language Models Into Tiny And Effective Students Using Pqrnn Kaliamoorthi Prabhu, Siddhant Aditya, Li Edward, Johnson Melvin
- Tiny Neural Models For Seq2seq Kandoor Arun
- Amazon Sagemaker Model Parallelism: A General And Flexible Framework For Large Model Training Karakus Can, Huilgol Rahul, Wu Fei, Subramanian Anirudh, Daniel Cade, Cavdar Derya, Xu Teng, Chen Haohan, Rahnama Arash, Quintela Luis
- What Do Pre-trained Code Models Know About Code? Karmakar Anjan, Robbes Romain
- Uquad1.0: Development Of An Urdu Question Answering Training Data For Machine Reading Comprehension Kazi Samreen, Khoja Shakeel
- Generating Rich Product Descriptions For Conversational E-commerce Systems Kedia Shashank, Mantha Aditya, Gupta Sneha, Guo Stephen, Achan Kannan
- How Transferable Are Reasoning Patterns In VQA? Kervadec Corentin, Jaunet Theo, Antipov Grigory, Baccouche Moez, Vuillemot Romain, Wolf Christian
- Visual Question Answering Based On Local-scene-aware Referring Expression Generation Kim Jung-jun, Lee Dong-gyu, Wu Jialin, Jung Hong-gyu, Lee Seong-whan
- Improving Scheduled Sampling With Elastic Weight Consolidation For Neural Machine Translation Korakakis Michalis, Vlachos Andreas
- Controlling Conditional Language Models Without Catastrophic Forgetting Korbak Tomasz, Elsahar Hady, Kruszewski German, Dymetman Marc
- Hurdles To Progress In Long-form Question Answering Krishna Kalpesh, Roy Aurko, Iyyer Mohit
- On The Significance Of Question Encoder Sequence Model In The Out-of-distribution Performance In Visual Question Answering Kv Gouthaman, Mittal Anurag
- Pagnol: An Extra-large French Generative Model Launay Julien, Tommasone Elena, Pannier Baptiste, Boniface François, Chatelain Amélie, Cappelli Alessandro, Poli Iacopo, Seddah Djamé
- You Only Need One Model For Open-domain Question Answering Lee Haejun, Kedia Akhil, Lee Jongwon, Paranjape Ashwin, Manning Christopher D., Woo Kyoung-gu
- Towards Few-shot Fact-checking Via Perplexity Lee Nayeon, Bang Yejin, Madotto Andrea, Khabsa Madian, Fung Pascale
- Exploiting Sentence-level Representations For Passage Ranking Leonhardt Jurek, Beringer Fabian, Anand Avishek
- Residual Tree Aggregation Of Layers For Neural Machine Translation Li Guoliang, Li Yiyang
- Personalized Transformer For Explainable Recommendation Li Lei, Zhang Yongfeng, Chen Li
- Ensemble ALBERT On Squad 2.0 Li Shilun, Li Renee, Peng Veronica
- Question-aware Memory Network For Multi-hop Question Answering In Human-robot Interaction Li Xinmeng, Alazab Mamoun, Li Qian, Yu Keping, Yin Quanjun
- Lyra: A Benchmark For Turducken-style Code Generation Liang Qingyuan, Sun Zeyu, Zhu Qihao, Zhang Wenjie, Yu Lian, Xiong Yingfei, Zhang Lu
- Domain-independent User Simulation With Transformers For Task-oriented Dialogue Systems Lin Hsien-chin, Lubis Nurul, Hu Songbo, Van Niekerk Carel, Geishauser Christian, Heck Michael, Feng Shutong, Gašić Milica
- MDQE: A More Accurate Direct Pretraining For Machine Translation Quality Estimation Lin Lei
- Few-shot Learning With Multilingual Language Models Lin Xi Victoria, Mihaylov Todor, Artetxe Mikel, Wang Tianlu, Chen Shuohui, Simig Daniel, Ott Myle, Goyal Naman, Bhosale Shruti, Du Jingfei, Pasunuru Ramakanth, Shleifer Sam, Koura Punit Singh, Chaudhary Vishrav, O'horo Brian, Wang Jeff, Zettlemoyer Luke, Kozareva Zornitsa, Diab Mona, Stoyanov Veselin, Li Xian
- VX2TEXT: End-to-end Learning Of Video-based Text Generation From Multimodal Inputs Lin Xudong, Bertasius Gedas, Wang Jue, Chang Shih-fu, Parikh Devi, Torresani Lorenzo
- Dexperts: Decoding-time Controlled Text Generation With Experts And Anti-experts Liu Alisa, Sap Maarten, Lu Ximing, Swayamdipta Swabha, Bhagavatula Chandra, Smith Noah A., Choi Yejin
- Codeqa: A Question Answering Dataset For Source Code Comprehension Liu Chenxiao, Wan Xiaojun
- What Makes Good In-context Examples For GPT-\(3\)? Liu Jiachang, Shen Dinghan, Zhang Yizhe, Dolan Bill, Carin Lawrence, Chen Weizhu
- Unified Multimodal Pre-training And Prompt-based Tuning For Vision-language Understanding And Generation Liu Tianyi, Wu Zuxuan, Xiong Wenhan, Chen Jingjing, Jiang Yu-gang
- A Token-level Reference-free Hallucination Detection Benchmark For Free-form Text Generation Liu Tianyu, Zhang Yizhe, Brockett Chris, Mao Yi, Sui Zhifang, Chen Weizhu, Dolan Bill
- GPT Understands, Too Liu Xiao, Zheng Yanan, Du Zhengxiao, Ding Ming, Qian Yujie, Yang Zhilin, Tang Jie
- Enriching Non-autoregressive Transformer With Syntactic And Semanticstructures For Neural Machine Translation Liu Ye, Wan Yao, Zhang Jian-guo, Zhao Wenting, Yu Philip S.
- Confidence-aware Scheduled Sampling For Neural Machine Translation Liu Yijin, Meng Fandong, Chen Yufeng, Xu Jinan, Zhou Jie
- Scheduled Sampling Based On Decoding Steps For Neural Machine Translation Liu Yijin, Meng Fandong, Chen Yufeng, Xu Jinan, Zhou Jie
- Iconqa: A New Benchmark For Abstract Diagram Understanding And Visual Language Reasoning Lu Pan, Qiu Liang, Chen Jiaqi, Xia Tony, Zhao Yizhou, Zhang Wei, Yu Zhou, Liang Xiaodan, Zhu Song-chun
- X-METRA-ADA: Cross-lingual Meta-transfer Learning Adaptation To Natural Language Understanding And Question Answering M'hamdi Meryem, Kim Doo Soon, Dernoncourt Franck, Bui Trung, Ren Xiang, May Jonathan
- Deltalm: Encoder-decoder Pre-training For Language Generation And Translation By Augmenting Pretrained Multilingual Encoders Ma Shuming, Dong Li, Huang Shaohan, Zhang Dongdong, Muzio Alexandre, Singhal Saksham, Awadalla Hany Hassan, Song Xia, Wei Furu
- Evaluating Large Language Models Trained On Code Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde De Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter, Philippe Tillet, Felipe Petroski Such, Dave Cummings, Matthias Plappert, Fotios Chantzis, Elizabeth Barnes, Ariel Herbert-voss, William Hebgen Guss, Alex Nichol, Alex Paino, Nikolas Tezak, Jie Tang, Igor Babuschkin, Suchir Balaji, Shantanu Jain, William Saunders, Christopher Hesse, Andrew N. Carr, Jan Leike, Josh Achiam, Vedant Misra, Evan Morikawa, Alec Radford, Matthew Knight, Miles Brundage, Mira Murati, Katie Mayer, Peter Welinder, Bob Mcgrew, Dario Amodei, Sam Mccandlish, Ilya Sutskever, Wojciech Zaremba
- How Much Do Language Models Copy From Their Training Data? Evaluating Linguistic Novelty In Text Generation Using RAVEN Mccoy R. Thomas, Smolensky Paul, Linzen Tal, Gao Jianfeng, Celikyilmaz Asli
- Improving Compositional Generalization With Self-training For Data-to-text Generation Mehta Sanket Vaibhav, Rao Jinfeng, Tay Yi, Kale Mihir, Parikh Ankur P., Strubell Emma
- Prevent The Language Model From Being Overconfident In Neural Machine Translation Miao Mengqi, Meng Fandong, Liu Yijin, Zhou Xiao-hua, Zhou Jie
- Structural Analysis Of An All-purpose Question Answering Model Micheli Vincent, Heinrich Quentin, Fleuret François, Belblidia Wacim
- Recent Advances In Natural Language Processing Via Large Pre-trained Language Models: A Survey Min Bonan, Ross Hayley, Sulem Elior, Veyseh Amir Pouran Ben, Nguyen Thien Huu, Sainz Oscar, Agirre Eneko, Heinz Ilana, Roth Dan
- Distantly Supervised Transformers For E-commerce Product QA Mittal Happy, Chakrabarti Aniket, Bayar Belhassen, Sharma Animesh Anant, Rasiwasia Nikhil
- BERTGEN: Multi-task Generation Through BERT Mitzalis Faidon, Caglayan Ozan, Madhyastha Pranava, Specia Lucia
- Multi-modal Understanding And Generation For Medical Images And Text Via Vision-language Pre-training Moon Jong Hak, Lee Hyungyung, Shin Woncheol, Kim Young-hak, Choi Edward
- Software-based Dialogue Systems: Survey, Taxonomy And Challenges Motger Quim, Franch Xavier, Marco Jordi
- Planning With Learned Entity Prompts For Abstractive Summarization Narayan Shashi, Zhao Yao, Maynez Joshua, Simoes Gonçalo, Nikolaev Vitaly, Mcdonald Ryan
- Hierarchical Transformers Are More Efficient Language Models Nawrot Piotr, Tworkowski Szymon, Tyrolski Michał, Kaiser Łukasz, Wu Yuhuai, Szegedy Christian, Michalewski Henryk
- Improving Transformers With Probabilistic Attention Keys Nguyen Tam, Nguyen Tan M., Le Dung D., Nguyen Duy Khuong, Tran Viet-anh, Baraniuk Richard G., Ho Nhat, Osher Stanley J.
- Materialized Knowledge Bases From Commonsense Transformers Nguyen Tuan-phong, Razniewski Simon
- Code Generation From Natural Language With Less Prior And More Monolingual Data Norouzi Sajad, Tang Keyi, Cao Yanshuai
- Learning To Retrieve Prompts For In-context Learning Ohad Rubin, Jonathan Herzig, Jonathan Berant
- Cascading Adaptors To Leverage English Data To Improve Performance Of Question Answering For Low-resource Languages Pandya Hariom A., Ardeshna Bhavik, Bhatt Brijesh S.
- PENELOPIE: Enabling Open Information Extraction For The Greek Language Through Machine Translation Papadopoulos Dimitris, Papadakis Nikolaos, Matsatsinis Nikolaos
- Do Encoder Representations Of Generative Dialogue Models Encode Sufficient Information About The Task ? Parthasarathi Prasanna, Pineau Joelle, Chandar Sarath
- A Comparative Study Of Transformer-based Language Models On Extractive Question Answering Pearce Kate, Zhan Tiffany, Komanduri Aneesh, Zhan Justin
- ABC: Attention With Bounded-memory Control Peng Hao, Kasai Jungo, Pappas Nikolaos, Yogatama Dani, Wu Zhaofeng, Kong Lingpeng, Schwartz Roy, Smith Noah A.
- Cotext: Multi-task Learning With Code-text Transformer Phan Long, Tran Hieu, Le Daniel, Nguyen Hieu, Anibal James, Peltekian Alec, Ye Yanfang
- Context Transformer With Stacked Pointer Networks For Conversational Question Answering Over Knowledge Graphs Plepi Joan, Kacupaj Endri, Singh Kuldeep, Thakkar Harsh, Lehmann Jens
- Focused Attention Improves Document-grounded Generation Prabhumoye Shrimai, Hashimoto Kazuma, Zhou Yingbo, Black Alan W, Salakhutdinov Ruslan
- Attentive Fine-tuning Of Transformers For Translation Of Low-resourced Languages @loresmt 2021 Puranik Karthik, Hande Adeep, Priyadharshini Ruba, Durairaj Thenmozhi, Sampath Anbukkarasi, Thamburaj Kingston Pal, Chakravarthi Bharathi Raja
- Robertuito: A Pre-trained Language Model For Social Media Text In Spanish Pérez Juan Manuel, Furman Damián A., Alemany Laura Alonso, Luque Franco
- Vinmt: Neural Machine Translation Toolkit Quan Nguyen Hoang, Dat Nguyen Thanh, Cong Nguyen Hoang Minh, Van Vinh Nguyen, Vinh Ngo Thi, Thai Nguyen Phuong, Viet Tran Hong
- Grid Search Hyperparameter Benchmarking Of BERT, ALBERT, And Longformer On Duorc Quijano Alex John, Nguyen Sam, Ordonez Juanita
- Prompt Programming For Large Language Models: Beyond The Few-shot Paradigm Reynolds Laria, Mcdonell Kyle
- Revisiting Context Choices For Context-aware Machine Translation Rikters Matīss, Nakazawa Toshiaki
- Tailor: Generating And Perturbing Text With Semantic Controls Ross Alexis, Wu Tongshuang, Peng Hao, Peters Matthew E., Gardner Matt
- Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information? Rozanova Julia, Ferreira Deborah, Dubba Krishna, Cheng Weiwei, Zhang Dell, Freitas Andre
- End-to-end Training Of Neural Retrievers For Open-domain Question Answering Sachan Devendra Singh, Patwary Mostofa, Shoeybi Mohammad, Kant Neel, Ping Wei, Hamilton William L, Catanzaro Bryan
- Towards Solving Multimodal Comprehension Sahu Pritish, Sikka Karan, Divakaran Ajay
- Simple Entity-centric Questions Challenge Dense Retrievers Sciavolino Christopher, Zhong Zexuan, Lee Jinhyuk, Chen Danqi
- Metaicl: Learning To Learn In Context Sewon Min, Mike Lewis, Luke Zettlemoyer, Hannaneh Hajishirzi
- Audio-visual Scene-aware Dialog And Reasoning Using Audio-visual Transformers With Joint Student-teacher Learning Shah Ankit P., Geng Shijie, Gao Peng, Cherian Anoop, Hori Takaaki, Marks Tim K., Roux Jonathan Le, Hori Chiori
- CPT: A Pre-trained Unbalanced Transformer For Both Chinese Language Understanding And Generation Shao Yunfan, Geng Zhichao, Liu Yitao, Dai Junqi, Yan Hang, Yang Fei, Zhe Li, Bao Hujun, Qiu Xipeng
- Better Neural Machine Translation By Extracting Linguistic Information From BERT Shavarani Hassan S., Sarkar Anoop
- FPM: A Collection Of Large-scale Foundation Pre-trained Language Models Shen Dezhou
- Societal Biases In Language Generation: Progress And Challenges Sheng Emily, Chang Kai-wei, Natarajan Premkumar, Peng Nanyun
- Contextual Biasing Of Language Models For Speech Recognition In Goal-oriented Conversational Agents Shenoy Ashish, Bodapati Sravan, Kirchhoff Katrin
- Exploring Low-cost Transformer Model Compression For Large-scale Commercial Reply Suggestions Shrivastava Vaishnavi, Gaonkar Radhika, Gupta Shashank, Jha Abhishek
- Structural Persistence In Language Models: Priming As A Window Into Abstract Language Representations Sinclair Arabella, Jumelet Jaap, Zuidema Willem, Fernández Raquel
- Semantics-aware Attention Improves Neural Machine Translation Slobodkin Aviv, Choshen Leshem, Abend Omri
- Multimodal Integration Of Human-like Attention In Visual Question Answering Sood Ekta, Kögel Fabian, Müller Philipp, Thomas Dominike, Bace Mihai, Bulling Andreas
- VQA-MHUG: A Gaze Dataset To Study Multimodal Neural Attention In Visual Question Answering Sood Ekta, Kögel Fabian, Strohm Florian, Dhar Prajit, Bulling Andreas
- Txt: Crossmodal End-to-end Learning With Transformers Steitz Jan-martin O., Pfeiffer Jonas, Gurevych Iryna, Roth Stefan
- Ernie-tiny : A Progressive Distillation Framework For Pretrained Transformer Compression Su Weiyue, Chen Xuyi, Feng Shikun, Liu Jiaxiang, Liu Weixin, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Non-autoregressive Text Generation With Pre-trained Language Models Su Yixuan, Cai Deng, Wang Yan, Vandyke David, Baker Simon, Li Piji, Collier Nigel
- Tacl: Improving BERT Pre-training With Token-aware Contrastive Learning Su Yixuan, Liu Fangyu, Meng Zaiqiao, Lan Tian, Shu Lei, Shareghi Ehsan, Collier Nigel
- NVIDIA Nemo Neural Machine Translation Systems For English-german And English-russian News And Biomedical Tasks At WMT21 Subramanian Sandeep, Hrinchuk Oleksii, Adams Virginia, Kuchaiev Oleksii
- A Non-hierarchical Attention Network With Modality Dropout For Textual Response Generation In Multimodal Dialogue Systems Sun Rongyi, Chen Borun, Zhou Qingyu, Li Yinghui, Cao Yunbo, Zheng Hai-tao
- Lightningdot: Pre-training Visual-semantic Embeddings For Real-time Image-text Retrieval Sun Siqi, Chen Yen-chun, Li Linjie, Wang Shuohang, Fang Yuwei, Liu Jingjing
- Know Deeper: Knowledge-conversation Cyclic Utilization Mechanism For Open-domain Dialogue Generation Sun Yajing, Hu Yue, Xing Luxi, Xie Yuqiang, Wei Xiangpeng
- ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Sun Yu, Wang Shuohuan, Feng Shikun, Ding Siyu, Pang Chao, Shang Junyuan, Liu Jiaxiang, Chen Xuyi, Zhao Yanbin, Lu Yuxiang, Liu Weixin, Wu Zhihua, Gong Weibao, Liang Jianzhong, Shang Zhizhou, Sun Peng, Liu Wei, Ouyang Xuan, Yu Dianhai, Tian Hao, Wu Hua, Wang Haifeng
- Multilingual Translation Via Grafting Pre-trained Language Models Sun Zewei, Wang Mingxuan, Li Lei
- Data Processing Matters: Srph-konvergen Ai's Machine Translation System For WMT'21 Sutawika Lintang, Cruz Jan Christian Blaise
- Trusting Roberta Over BERT: Insights From Checklisting The Natural Language Inference Task Tarunesh Ishan, Aditya Somak, Choudhury Monojit
- Diff-explainer: Differentiable Convex Optimization For Explainable Multi-hop Inference Thayaparan Mokanarangan, Valentino Marco, Ferreira Deborah, Rozanova Julia, Freitas André
- Towards Reinforcement Learning For Pivot-based Neural Machine Translation With Non-autoregressive Transformer Tokarchuk Evgeniia, Rosendahl Jan, Wang Weiyue, Petrushkov Pavel, Lancewicki Tomer, Khadivi Shahram, Ney Hermann
- Exploring Transformers In Natural Language Generation: GPT, BERT, And Xlnet Topal M. Onat, Bas Anil, Van Heerden Imke
- Local Explanation Of Dialogue Response Generation Tuan Yi-lin, Pryor Connor, Chen Wenhu, Getoor Lise, Wang William Yang
- Comparing Test Sets With Item Response Theory Vania Clara, Htut Phu Mon, Huang William, Mungra Dhara, Pang Richard Yuanzhe, Phang Jason, Liu Haokun, Cho Kyunghyun, Bowman Samuel R.
- Bertnesia: Investigating The Capture And Forgetting Of Knowledge In BERT Wallat Jonas, Singh Jaspreet, Anand Avishek
- Selective Knowledge Distillation For Neural Machine Translation Wang Fusheng, Yan Jianhao, Meng Fandong, Zhou Jie
- Contextualized Scene Imagination For Generative Commonsense Reasoning Wang Peifeng, Zamora Jonathan, Liu Junfeng, Ilievski Filip, Chen Muhao, Ren Xiang
- Sentence Semantic Regression For Text Generation Wang Wei, Li Piji, Zheng Hai-tao
- Lightseq2: Accelerated Training For Transformer-based Models On Gpus Wang Xiaohui, Wei Yang, Xiong Ying, Huang Guyue, Qian Xian, Ding Yufei, Wang Mingxuan, Li Lei
- Distilled Dual-encoder Model For Vision-language Understanding Wang Zekun, Wang Wenhui, Zhu Haichao, Liu Ming, Qin Bing, Wei Furu
- Training Multilingual Pre-trained Language Model With Byte-level Subwords Wei Junqiu, Liu Qun, Guo Yinpeng, Jiang Xin
- Pangu-\(α\): Large-scale Autoregressive Pretrained Chinese Language Models With Auto-parallel Computation Wei Zeng, Xiaozhe Ren, Teng Su, Hui Wang, Yi Liao, Zhiwei Wang, Xin Jiang, Zhenzhang Yang, Kaisheng Wang, Xiaoda Zhang, Chen Li, Ziyan Gong, Yifan Yao, Xinjing Huang, Jun Wang, Jianfeng Yu, Qi Guo, Yue Yu, Yan Zhang, Jin Wang, Hengtao Tao, Dasen Yan, Zexuan Yi, Fang Peng, Fangqing Jiang, Han Zhang, Lingfeng Deng, Yehong Zhang, Zhe Lin, Chao Zhang, Shaojie Zhang, Mingyue Guo, Shanzhi Gu, Gaojun Fan, Yaowei Wang, Xuefeng Jin, Qun Liu, Yonghong Tian
- AI Chains: Transparent And Controllable Human-ai Interaction By Chaining Large Language Model Prompts Wu Tongshuang, Terry Michael, Cai Carrie J.
- Prefix-tuning: Optimizing Continuous Prompts For Generation Xiang Lisa Li, Percy Liang
- Simple Local Attentions Remain Competitive For Long-context Tasks Xiong Wenhan, Oğuz Barlas, Gupta Anchit, Chen Xilun, Liskovich Diana, Levy Omer, Yih Wen-tau, Mehdad Yashar
- Moca: Incorporating Multi-stage Domain Pretraining And Cross-guided Multimodal Attention For Textbook Question Answering Xu Fangzhi, Lin Qika, Liu Jun, Zhang Lingling, Zhao Tianzhe, Chai Qi, Pan Yudai
- Dissecting Generation Modes For Abstractive Summarization Models Via Ablation And Attribution Xu Jiacheng, Durrett Greg
- Attention-guided Generative Models For Extractive Question Answering Xu Peng, Liang Davis, Huang Zhiheng, Xiang Bing
- K-PLUG: Knowledge-injected Pre-trained Language Model For Natural Language Understanding And Generation In E-commerce Xu Song, Li Haoran, Yuan Peng, Wang Yujia, Wu Youzheng, He Xiaodong, Liu Ying, Zhou Bowen
- Improving Conversational Recommendation Systems' Quality With Context-aware Item Meta Information Yang Bowen, Han Cong, Li Yu, Zuo Lei, Yu Zhou
- An Empirical Study Of GPT-3 For Few-shot Knowledge-based VQA Yang Zhengyuan, Gan Zhe, Wang Jianfeng, Hu Xiaowei, Lu Yumao, Liu Zicheng, Wang Lijuan
- TR-BERT: Dynamic Token Reduction For Accelerating BERT Inference Ye Deming, Lin Yankai, Huang Yufei, Sun Maosong
- Leveraging Linguistic Coordination In Reranking N-best Candidates For End-to-end Response Selection Using BERT Yu Mingzhi University Of Pittsburgh, Litman Diane University Of Pittsburgh
- Language Bias In Visual Question Answering: A Survey And Taxonomy Yuan Desen
- Prune Once For All: Sparse Pre-trained Language Models Zafrir Ofir, Larey Ariel, Boudoukh Guy, Shen Haihao, Wasserblat Moshe
- Bert-coqac: Bert-based Conversational Question Answering In Context Zaib Munazza, Tran Dai Hoang, Sagar Subhash, Mahmood Adnan, Zhang Wei E., Sheng Quan Z.
- Topical Language Generation Using Transformers Zandie Rohola, Mahoor Mohammad H.
- Readtwice: Reading Very Large Documents With Memories Zemlyanskiy Yury, Ainslie Joshua, De Jong Michiel, Pham Philip, Eckstein Ilya, Sha Fei
- Wechat Neural Machine Translation Systems For WMT21 Zeng Xianfeng, Liu Yijin, Li Ernan, Ran Qiu, Meng Fandong, Li Peng, Xu Jinan, Zhou Jie
- Ernie-vilg: Unified Generative Pre-training For Bidirectional Vision-language Generation Zhang Han, Yin Weichong, Fang Yewei, Li Lanxin, Duan Boqiang, Wu Zhihua, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- DSGPT: Domain-specific Generative Pre-training Of Transformers For Text Generation In E-commerce Title And Review Summarization Zhang Xueying, Jiang Yunjiang, Shang Yue, Cheng Zhaomeng, Zhang Chi, Fan Xiaochuan, Xiao Yun, Long Bo
- Retgen: A Joint Framework For Retrieval And Grounded Text Generation Modeling Zhang Yizhe, Sun Siqi, Gao Xiang, Fang Yuwei, Brockett Chris, Galley Michel, Gao Jianfeng, Dolan Bill
- An Exploratory Study On Long Dialogue Summarization: What Works And What's Next Zhang Yusen, Ni Ansong, Yu Tao, Zhang Rui, Zhu Chenguang, Deb Budhaditya, Celikyilmaz Asli, Awadallah Ahmed Hassan, Radev Dragomir
- Towards More Efficient Insertion Transformer With Fractional Positional Encoding Zhang Zhisong, Zhang Yizhe, Dolan Bill
- Learning To Rehearse In Long Sequence Memorization Zhang Zhu, Zhou Chang, Ma Jianxin, Lin Zhijie, Zhou Jingren, Yang Hongxia, Zhao Zhou
- Fewnlu: Benchmarking State-of-the-art Methods For Few-shot Natural Language Understanding Zheng Yanan, Zhou Jing, Qian Yujie, Ding Ming, Liao Chonghua, Li Jian, Salakhutdinov Ruslan, Tang Jie, Ruder Sebastian, Yang Zhilin
- Mmchat: Multi-modal Chat Dataset On Social Media Zheng Yinhe, Chen Guanyi, Liu Xin, Sun Jian
- Self-guided Curriculum Learning For Neural Machine Translation Zhou Lei, Ding Liang, Duh Kevin, Watanabe Shinji, Sasano Ryohei, Takeda Koichi
- Thinking Clearly, Talking Fast: Concept-guided Non-autoregressive Generation For Open-domain Dialogue Systems Zou Yicheng, Liu Zhihua, Hu Xingwu, Zhang Qi
- Palm: Scaling Language Modeling With Pathways Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-hellstern, Douglas Eck, Jeff Dean, Slav Petrov, Noah Fiedel
- Global Memory Transformer For Processing Long Documents Adel Arij Al
- Vector Representations Of Idioms In Conversational Systems Adewumi Tosin, Liwicki Foteini, Liwicki Marcus
- Vl-interpret: An Interactive Visualization Tool For Interpreting Vision-language Transformers Aflalo Estelle, Du Meng, Tseng Shao-yen, Liu Yongfei, Wu Chenfei, Duan Nan, Lal Vasudev
- Using Large Language Models To Simulate Multiple Humans And Replicate Human Subject Studies Aher Gati, Arriaga Rosa I., Kalai Adam Tauman
- Few-shot Training Llms For Project-specific Code-summarization Ahmed Toufique, Devanbu Premkumar
- Solving Quantitative Reasoning Problems With Language Models Aitor Lewkowycz, Anders Andreassen, David Dohan, Ethan Dyer, Henryk Michalewski, Vinay Ramasesh, Ambrose Slone, Cem Anil, Imanol Schlag, Theo Gutman-solo, Yuhuai Wu, Behnam Neyshabur, Guy Gur-ari, Vedant Misra
- Compound Tokens: Channel Fusion For Vision-language Representation Learning Aladago Maxwell Mbabilla, Piergiovanni Aj
- Despite "super-human" Performance, Current Llms Are Unsuited For Decisions About Ethics And Safety Albrecht Joshua, Kitanidis Ellie, Fetterman Abraham J.
- Prompting As Probing: Using Language Models For Knowledge Base Construction Alivanistos Dimitrios, Santamaría Selene Báez, Cochez Michael, Kalo Jan-christoph, Van Krieken Emile, Thanapalasingam Thiviyan
- Text And Patterns: For Effective Chain Of Thought, It Takes Two To Tango Aman Madaan, Amir Yazdanbakhsh
- Language Models Of Code Are Few-shot Commonsense Learners Aman Madaan, Shuyan Zhou, Uri Alon, Yiming Yang, Graham Neubig
- Query Refinement Prompts For Closed-book Long-form Question Answering Amplayo Reinald Kim, Webster Kellie, Collins Michael, Das Dipanjan, Narayan Shashi
- Input-tuning: Adapting Unfamiliar Inputs To Frozen Pretrained Models An Shengnan, Li Yifei, Lin Zeqi, Liu Qian, Chen Bei, Fu Qiang, Chen Weizhu, Zheng Nanning, Lou Jian-guang
- A Comparative Study On Language Models For Task-oriented Dialogue Systems Andreas Vinsen Marselino, Winata Genta Indra, Purwarianti Ayu
- Compositional Semantic Parsing With Large Language Models Andrew Drozdov, Nathanael Schärli, Ekin Akyürek, Nathan Scales, Xinying Song, Xinyun Chen, Olivier Bousquet, Denny Zhou
- Socratic Models: Composing Zero-shot Multimodal Reasoning With Language Andy Zeng, Maria Attarian, Brian Ichter, Krzysztof Choromanski, Adrian Wong, Stefan Welker, Federico Tombari, Aveek Purohit, Michael Ryoo, Vikas Sindhwani, Johnny Lee, Vincent Vanhoucke, Pete Florence
- Small Batch Sizes Improve Training Of Low-resource Neural MT Atrio Àlex R., Popescu-belis Andrei
- Vl-beit: Generative Vision-language Pretraining Bao Hangbo, Wang Wenhui, Dong Li, Wei Furu
- P\(^3\)LM: Probabilistically Permuted Prophet Language Modeling For Generative Pre-training Bao Junwei, Wang Yifan, Ying Jiangyong, Gong Yeyun, Zhao Jing, Wu Youzheng, He Xiaodong
- \(\textit{latent}\)-glat: Glancing At Latent Variables For Parallel Text Generation Bao Yu, Zhou Hao, Huang Shujian, Wang Dongqi, Qian Lihua, Dai Xinyu, Chen Jiajun, Li Lei
- Logical Reasoning For Task Oriented Dialogue Systems Beygi Sajjad, Fazel-zarandi Maryam, Cervone Alessandra, Krishnan Prakash, Jonnalagadda Siddhartha Reddy
- Vector-quantized Input-contextualized Soft Prompts For Natural Language Understanding Bhardwaj Rishabh, Saha Amrita, Hoi Steven C. H., Poria Soujanya
- Banglanlg And Banglat5: Benchmarks And Resources For Evaluating Low-resource Natural Language Generation In Bangla Bhattacharjee Abhik, Hasan Tahmid, Ahmad Wasi Uddin, Shahriyar Rifat
- BLOOM: A 176b-parameter Open-access Multilingual Language Model Bigscience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova Del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina Mcmillan-major, Iz Beltagy, Huu Nguyen, Lucile Saulnier, Samson Tan, Pedro Ortiz Suarez, Victor Sanh, Hugo Laurençon, Yacine Jernite, Julien Launay, Margaret Mitchell, Colin Raffel, Aaron Gokaslan, Adi Simhi, Aitor Soroa, Alham Fikri Aji, Amit Alfassy, Anna Rogers, Ariel Kreisberg Nitzav, Canwen Xu, Chenghao Mou, Chris Emezue, Christopher Klamm, Colin Leong, Daniel Van Strien, David Ifeoluwa Adelani, Dragomir Radev, Eduardo González Ponferrada, Efrat Levkovizh, Ethan Kim, Eyal Bar Natan, Francesco De Toni, Gérard Dupont, Germán Kruszewski, Giada Pistilli, Hady Elsahar, Hamza Benyamina, Hieu Tran, Ian Yu, Idris Abdulmumin, Isaac Johnson, Itziar Gonzalez-dios, Javier De La Rosa, Jenny Chim, Jesse Dodge, Jian Zhu, Jonathan Chang, Jörg Frohberg, Joseph Tobing, Joydeep Bhattacharjee, Khalid Almubarak, Kimbo Chen, Kyle Lo, Leandro Von Werra, Leon Weber, Long Phan, Loubna Ben Allal, Ludovic Tanguy, Manan Dey, Manuel Romero Muñoz, Maraim Masoud, María Grandury, Mario Šaško, Max Huang, Maximin Coavoux, Mayank Singh, Mike Tian-jian Jiang, Minh Chien Vu, Mohammad A. Jauhar, Mustafa Ghaleb, Nishant Subramani, Nora Kassner, Nurulaqilla Khamis, Olivier Nguyen, Omar Espejel, Ona De Gibert, Paulo Villegas, Peter Henderson, Pierre Colombo, Priscilla Amuok, Quentin Lhoest, Rheza Harliman, Rishi Bommasani, Roberto Luis López, Rui Ribeiro, Salomey Osei, Sampo Pyysalo, Sebastian Nagel, Shamik Bose, Shamsuddeen Hassan Muhammad, Shanya Sharma, Shayne Longpre, Somaieh Nikpoor, Stanislav Silberberg, Suhas Pai, Sydney Zink, Tiago Timponi Torrent, Timo Schick, Tristan Thrush, Valentin Danchev, Vassilina Nikoulina, Veronika Laippala, Violette Lepercq, Vrinda Prabhu, Zaid Alyafeai, Zeerak Talat, Arun Raja, Benjamin Heinzerling, Chenglei Si, Davut Emre Taşar, Elizabeth Salesky, Sabrina J. Mielke, Wilson Y. Lee, Abheesht Sharma, Andrea Santilli, Antoine Chaffin, Arnaud Stiegler, Debajyoti Datta, Eliza Szczechla, Gunjan Chhablani, Han Wang, Harshit Pandey, Hendrik Strobelt, Jason Alan Fries, Jos Rozen, Leo Gao, Lintang Sutawika, M Saiful Bari, Maged S. Al-shaibani, Matteo Manica, Nihal Nayak, Ryan Teehan, Samuel Albanie, Sheng Shen, Srulik Ben-david, Stephen H. Bach, Taewoon Kim, Tali Bers, Thibault Fevry, Trishala Neeraj, Urmish Thakker, Vikas Raunak, Xiangru Tang, Zheng-xin Yong, Zhiqing Sun, Shaked Brody, Yallow Uri, Hadar Tojarieh, Adam Roberts, Hyung Won Chung, Jaesung Tae, Jason Phang, Ofir Press, Conglong Li, Deepak Narayanan, Hatim Bourfoune, Jared Casper, Jeff Rasley, Max Ryabinin, Mayank Mishra, Minjia Zhang, Mohammad Shoeybi, Myriam Peyrounette, Nicolas Patry, Nouamane Tazi, Omar Sanseviero, Patrick Von Platen, Pierre Cornette, Pierre François Lavallée, Rémi Lacroix, Samyam Rajbhandari, Sanchit Gandhi, Shaden Smith, Stéphane Requena, Suraj Patil, Tim Dettmers, Ahmed Baruwa, Amanpreet Singh, Anastasia Cheveleva, Anne-laure Ligozat, Arjun Subramonian, Aurélie Névéol, Charles Lovering, Dan Garrette, Deepak Tunuguntla, Ehud Reiter, Ekaterina Taktasheva, Ekaterina Voloshina, Eli Bogdanov, Genta Indra Winata, Hailey Schoelkopf, Jan-christoph Kalo, Jekaterina Novikova, Jessica Zosa Forde, Jordan Clive, Jungo Kasai, Ken Kawamura, Liam Hazan, Marine Carpuat, Miruna Clinciu, Najoung Kim, Newton Cheng, Oleg Serikov, Omer Antverg, Oskar Van Der Wal, Rui Zhang, Ruochen Zhang, Sebastian Gehrmann, Shachar Mirkin, Shani Pais, Tatiana Shavrina, Thomas Scialom, Tian Yun, Tomasz Limisiewicz, Verena Rieser, Vitaly Protasov, Vladislav Mikhailov, Yada Pruksachatkun, Yonatan Belinkov, Zachary Bamberger, Zdeněk Kasner, Alice Rueda, Amanda Pestana, Amir Feizpour, Ammar Khan, Amy Faranak, Ana Santos, Anthony Hevia, Antigona Unldreaj, Arash Aghagol, Arezoo Abdollahi, Aycha Tammour, Azadeh Hajihosseini, Bahareh Behroozi, Benjamin Ajibade, Bharat Saxena, Carlos Muñoz Ferrandis, Daniel Mcduff, Danish Contractor, David Lansky, Davis David, Douwe Kiela, Duong A. Nguyen, Edward Tan, Emi Baylor, Ezinwanne Ozoani, Fatima Mirza, Frankline Ononiwu, Habib Rezanejad, Hessie Jones, Indrani Bhattacharya, Irene Solaiman, Irina Sedenko, Isar Nejadgholi, Jesse Passmore, Josh Seltzer, Julio Bonis Sanz, Livia Dutra, Mairon Samagaio, Maraim Elbadri, Margot Mieskes, Marissa Gerchick, Martha Akinlolu, Michael Mckenna, Mike Qiu, Muhammed Ghauri, Mykola Burynok, Nafis Abrar, Nazneen Rajani, Nour Elkott, Nour Fahmy, Olanrewaju Samuel, Ran An, Rasmus Kromann, Ryan Hao, Samira Alizadeh, Sarmad Shubber, Silas Wang, Sourav Roy, Sylvain Viguier, Thanh Le, Tobi Oyebade, Trieu Le, Yoyo Yang, Zach Nguyen, Abhinav Ramesh Kashyap, Alfredo Palasciano, Alison Callahan, Anima Shukla, Antonio Miranda-escalada, Ayush Singh, Benjamin Beilharz, Bo Wang, Caio Brito, Chenxi Zhou, Chirag Jain, Chuxin Xu, Clémentine Fourrier, Daniel León Periñán, Daniel Molano, Dian Yu, Enrique Manjavacas, Fabio Barth, Florian Fuhrimann, Gabriel Altay, Giyaseddin Bayrak, Gully Burns, Helena U. Vrabec, Imane Bello, Ishani Dash, Jihyun Kang, John Giorgi, Jonas Golde, Jose David Posada, Karthik Rangasai Sivaraman, Lokesh Bulchandani, Lu Liu, Luisa Shinzato, Madeleine Hahn De Bykhovetz, Maiko Takeuchi, Marc Pàmies, Maria A Castillo, Marianna Nezhurina, Mario Sänger, Matthias Samwald, Michael Cullan, Michael Weinberg, Michiel De Wolf, Mina Mihaljcic, Minna Liu, Moritz Freidank, Myungsun Kang, Natasha Seelam, Nathan Dahlberg, Nicholas Michio Broad, Nikolaus Muellner, Pascale Fung, Patrick Haller, Ramya Chandrasekhar, Renata Eisenberg, Robert Martin, Rodrigo Canalli, Rosaline Su, Ruisi Su, Samuel Cahyawijaya, Samuele Garda, Shlok S Deshmukh, Shubhanshu Mishra, Sid Kiblawi, Simon Ott, Sinee Sang-aroonsiri, Srishti Kumar, Stefan Schweter, Sushil Bharati, Tanmay Laud, Théo Gigant, Tomoya Kainuma, Wojciech Kusa, Yanis Labrak, Yash Shailesh Bajaj, Yash Venkatraman, Yifan Xu, Yingxin Xu, Yu Xu, Zhe Tan, Zhongli Xie, Zifan Ye, Mathilde Bras, Younes Belkada, Thomas Wolf
- Inpars: Data Augmentation For Information Retrieval Using Large Language Models Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Nogueira Rodrigo
- Petals: Collaborative Inference And Fine-tuning Of Large Models Borzunov Alexander, Baranchuk Dmitry, Dettmers Tim, Ryabinin Max, Belkada Younes, Chumachenko Artem, Samygin Pavel, Raffel Colin
- Understanding Attention For Vision-and-language Tasks Cao Feiqi, Han Soyeon Caren, Long Siqu, Xu Changwei, Poon Josiah
- HIBRIDS: Attention With Hierarchical Biases For Structure-aware Long Document Summarization Cao Shuyang, Wang Lu
- Ernie-code: Beyond English-centric Cross-lingual Pretraining For Programming Languages Chai Yekun, Wang Shuohuan, Pang Chao, Sun Yu, Tian Hao, Wu Hua
- Analyzing Semantic Faithfulness Of Language Models Via Input Intervention On Question Answering Chaturvedi Akshay, Bhar Swarnadeep, Saha Soumadeep, Garain Utpal, Asher Nicholas
- Litevl: Efficient Video-language Learning With Enhanced Spatial-temporal Modeling Chen Dongsheng, Tao Chaofan, Hou Lu, Shang Lifeng, Jiang Xin, Liu Qun
- Controllable Text Generation With Language Constraints Chen Howard, Li Huihan, Chen Danqi, Narasimhan Karthik
- Leveraging Natural Supervision For Language Representation Learning And Generation Chen Mingda
- Murag: Multimodal Retrieval-augmented Generator For Open Question Answering Over Images And Text Chen Wenhu, Hu Hexiang, Chen Xi, Verga Pat, Cohen William W.
- Towards Improving Faithfulness In Abstractive Summarization Chen Xiuying, Li Mingzhe, Gao Xin, Zhang Xiangliang
- Curriculum: A Broad-coverage Benchmark For Linguistic Phenomena In Natural Language Understanding Chen Zeming, Gao Qiyue
- Task-aware Specialization For Efficient And Robust Dense Retrieval For Open-domain Question Answering Cheng Hao, Fang Hao, Liu Xiaodong, Gao Jianfeng
- Neural Machine Translation With Contrastive Translation Memories Cheng Xin, Gao Shen, Liu Lemao, Zhao Dongyan, Yan Rui
- Prompting GPT-3 To Be Reliable Chenglei Si, Zhe Gan, Zhengyuan Yang, Shuohang Wang, Jianfeng Wang, Jordan Boyd-graber, Lijuan Wang
- Grounding In Social Media: An Approach To Building A Chit-chat Dialogue Model Choudhary Ritvik, Kawahara Daisuke
- Language-family Adapters For Low-resource Multilingual Neural Machine Translation Chronopoulou Alexandra, Stojanovski Dario, Fraser Alexander
- On Reality And The Limits Of Language Data: Aligning Llms With Human Norms Collier Nigel H., Liu Fangyu, Shareghi Ehsan
- LSG Attention: Extrapolation Of Pretrained Transformers To Long Sequences Condevaux Charles, Harispe Sébastien
- Building The Intent Landscape Of Real-world Conversational Corpora With Extractive Question-answering Transformers Corbeil Jean-philippe, Li Mia Taige, Ghavidel Hadi Abdi
- Decoder Tuning: Efficient Language Understanding As Decoding Cui Ganqu, Li Wentao, Ding Ning, Huang Longtao, Liu Zhiyuan, Sun Maosong
- Neural Knowledge Bank For Pretrained Transformers Dai Damai, Jiang Wenbin, Dong Qingxiu, Lyu Yajuan, She Qiaoqiao, Sui Zhifang
- Enabling Multimodal Generation On CLIP Via Vision-language Knowledge Distillation Dai Wenliang, Hou Lu, Shang Lifeng, Jiang Xin, Liu Qun, Fung Pascale
- Promptagator: Few-shot Dense Retrieval From 8 Examples Dai Zhuyun, Zhao Vincent Y., Ma Ji, Luan Yi, Ni Jianmo, Lu Jing, Bakalov Anton, Guu Kelvin, Hall Keith B., Chang Ming-wei
- Understanding BLOOM: An Empirical Study On Diverse NLP Tasks Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- PACIFIC: Towards Proactive Conversational Question Answering Over Tabular And Textual Data In Finance Deng Yang, Lei Wenqiang, Zhang Wenxuan, Lam Wai, Chua Tat-seng
- Less Is More: Linear Layers On CLIP Features As Powerful Vizwiz Model Deuser Fabian, Habel Konrad, Rösch Philipp J., Oswald Norbert
- Write And Paint: Generative Vision-language Models Are Unified Modal Learners Diao Shizhe, Zhou Wangchunshu, Zhang Xinsong, Wang Jiawei
- Parallel Attention Forcing For Machine Translation Dou Qingyun, Gales Mark
- Shortcut Learning Of Large Language Models In Natural Language Understanding Du Mengnan, He Fengxiang, Zou Na, Tao Dacheng, Hu Xia
- Codegen: An Open Large Language Model For Code With Multi-turn Program Synthesis Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong
- Leveraging Pre-trained Models For Failure Analysis Triplets Generation Ezukwoke Kenneth, Hoayek Anis, Batton-hubert Mireille, Boucher Xavier, Gounet Pascal, Adrian Jerome
- Topic-aware Response Generation In Task-oriented Dialogue With Unstructured Knowledge Access Feng Yue, Lampouras Gerasimos, Iacobacci Ignacio
- Learn To Remember: Transformer With Recurrent Memory For Document-level Machine Translation Feng Yukun, Li Feng, Song Ziang, Zheng Boyuan, Koehn Philipp
- Towards Opening The Black Box Of Neural Machine Translation: Source And Target Interpretations Of The Transformer Ferrando Javier, Gállego Gerard I., Alastruey Belen, Escolano Carlos, Costa-jussà Marta R.
- Should Attention Be All We Need? The Epistemic And Ethical Implications Of Unification In Machine Learning Fishman Nic, Hancox-li Leif
- An Empirical Study Of End-to-end Video-language Transformers With Masked Visual Modeling Fu Tsu-jui, Li Linjie, Gan Zhe, Lin Kevin, Wang William Yang, Wang Lijuan, Liu Zicheng
- Pnlp-mixer: An Efficient All-mlp Architecture For Language Fusco Francesco, Pascual Damian, Staar Peter, Antognini Diego
- Re2g: Retrieve, Rerank, Generate Glass Michael, Rossiello Gaetano, Chowdhury Md Faisal Mahbub, Naik Ankita Rajaram, Cai Pengshan, Gliozzo Alfio
- Diffuseq: Sequence To Sequence Text Generation With Diffusion Models Gong Shansan, Li Mukai, Feng Jiangtao, Wu Zhiyong, Kong Lingpeng
- Does Entity Abstraction Help Generative Transformers Reason? Gontier Nicolas, Reddy Siva, Pal Christopher
- Checks And Strategies For Enabling Code-switched Machine Translation Gowda Thamme, Gheini Mozhdeh, May Jonathan
- Optimal Transport For Unsupervised Hallucination Detection In Neural Machine Translation Guerreiro Nuno M., Colombo Pierre, Piantanida Pablo, Martins André F. T.
- Looking For A Needle In A Haystack: A Comprehensive Study Of Hallucinations In Neural Machine Translation Guerreiro Nuno M., Voita Elena, Martins André F. T.
- LVP-M3: Language-aware Visual Prompt For Multilingual Multimodal Machine Translation Guo Hongcheng, Liu Jiaheng, Huang Haoyang, Yang Jian, Li Zhoujun, Zhang Dongdong, Cui Zheng, Wei Furu
- STI: Turbocharge NLP Inference At The Edge Via Elastic Pipelining Guo Liwei, Choe Wonkyo, Lin Felix Xiaozhu
- MALM: Mixing Augmented Language Modeling For Zero-shot Machine Translation Gupta Kshitij
- Show, Don't Tell: Demonstrations Outperform Descriptions For Schema-guided Task-oriented Dialogue Gupta Raghav, Lee Harrison, Zhao Jeffrey, Rastogi Abhinav, Cao Yuan, Wu Yonghui
- Understanding HTML With Large Language Models Gur Izzeddin, Nachum Ofir, Miao Yingjie, Safdari Mustafa, Huang Austin, Chowdhery Aakanksha, Narang Sharan, Fiedel Noah, Faust Aleksandra
- FOLIO: Natural Language Reasoning With First-order Logic Han Simeng, Schoelkopf Hailey, Zhao Yilun, Qi Zhenting, Riddell Martin, Zhou Wenfei, Coady James, Peng David, Qiao Yujie, Benson Luke, Sun Lucy, Wardle-solano Alex, Szabo Hannah, Zubova Ekaterina, Burtell Matthew, Fan Jonathan, Liu Yixin, Wong Brian, Sailor Malcolm, Ni Ansong, Nan Linyong, Kasai Jungo, Yu Tao, Zhang Rui, Fabbri Alexander R., Kryscinski Wojciech, Yavuz Semih, Liu Ye, Lin Xi Victoria, Joty Shafiq, Zhou Yingbo, Xiong Caiming, Ying Rex, Cohan Arman, Radev Dragomir
- SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model For Text Generation And Modular Control Han Xiaochuang, Kumar Sachin, Tsvetkov Yulia
- Supervised Visual Attention For Simultaneous Multimodal Machine Translation Haralampieva Veneta, Caglayan Ozan, Specia Lucia
- Z-code++: A Pre-trained Language Model Optimized For Abstractive Summarization He Pengcheng, Peng Baolin, Lu Liyang, Wang Song, Mei Jie, Liu Yang, Xu Ruochen, Awadalla Hany Hassan, Shi Yu, Zhu Chenguang, Xiong Wayne, Zeng Michael, Gao Jianfeng, Huang Xuedong
- VLMAE: Vision-language Masked Autoencoder He Sunan, Guo Taian, Dai Tao, Qiao Ruizhi, Wu Chen, Shu Xiujun, Ren Bo
- TRAC: A Textual Benchmark For Reasoning About Actions And Change He Weinan, Huang Canming, Xiao Zhanhao, Liu Yongmei
- Bridging The Data Gap Between Training And Inference For Unsupervised Neural Machine Translation He Zhiwei, Wang Xing, Wang Rui, Shi Shuming, Tu Zhaopeng
- Prompt-to-prompt Image Editing With Cross Attention Control Hertz Amir, Mokady Ron, Tenenbaum Jay, Aberman Kfir, Pritch Yael, Cohen-or Daniel
- Wikiwhy: Answering And Explaining Cause-and-effect Questions Ho Matthew, Sharma Aditya, Chang Justin, Saxon Michael, Levy Sharon, Lu Yujie, Wang William Yang
- Compressing Pre-trained Transformers Via Low-bit Nxm Sparsity For Natural Language Understanding Holmes Connor, Zhang Minjia, He Yuxiong, Wu Bo
- TRUE: Re-evaluating Factual Consistency Evaluation Honovich Or, Aharoni Roee, Herzig Jonathan, Taitelbaum Hagai, Kukliansy Doron, Cohen Vered, Scialom Thomas, Szpektor Idan, Hassidim Avinatan, Matias Yossi
- Syntax-guided Localized Self-attention By Constituency Syntactic Distance Hou Shengyuan, Kai Jushi, Xue Haotian, Zhu Bingyu, Yuan Bo, Huang Longtao, Wang Xinbing, Lin Zhouhan
- XDBERT: Distilling Visual Information To BERT From Cross-modal Systems To Improve Language Understanding Hsu Chan-jan, Lee Hung-yi, Tsao Yu
- Dialogue Meaning Representation For Task-oriented Dialogue Systems Hu Xiangkun, Dai Junqi, Yan Hang, Zhang Yi, Guo Qipeng, Qiu Xipeng, Zhang Zheng
- Promptcap: Prompt-guided Task-aware Image Captioning Hu Yushi, Hua Hang, Yang Zhengyuan, Shi Weijia, Smith Noah A, Luo Jiebo
- PLANET: Dynamic Content Planning In Autoregressive Transformers For Long-form Text Generation Hu Zhe, Chan Hou Pong, Liu Jiachen, Xiao Xinyan, Wu Hua, Huang Lifu
- REVEAL: Retrieval-augmented Visual-language Pre-training With Multi-source Multimodal Knowledge Memory Hu Ziniu, Iscen Ahmet, Sun Chen, Wang Zirui, Chang Kai-wei, Sun Yizhou, Schmid Cordelia, Ross David A., Fathi Alireza
- Empowering Language Models With Knowledge Graph Reasoning For Question Answering Hu Ziniu, Xu Yichong, Yu Wenhao, Wang Shuohang, Yang Ziyi, Zhu Chenguang, Chang Kai-wei, Sun Yizhou
- Personalized Dialogue Generation With Persona-adaptive Attention Huang Qiushi, Zhang Yu, Ko Tom, Liu Xubo, Wu Bo, Wang Wenwu, Tang Lilian
- Layoutlmv3: Pre-training For Document AI With Unified Text And Image Masking Huang Yupan, Lv Tengchao, Cui Lei, Lu Yutong, Wei Furu
- Clues Before Answers: Generation-enhanced Multiple-choice QA Huang Zixian, Wu Ao, Zhou Jiaying, Gu Yu, Zhao Yue, Cheng Gong
- Uniform Complexity For Text Generation Imperial Joseph Marvin, Madabushi Harish Tayyar
- Efficient Long-text Understanding With Short-text Models Ivgi Maor, Shaham Uri, Berant Jonathan
- Survey Of Hallucination In Natural Language Generation Ji Ziwei, Lee Nayeon, Frieske Rita, Yu Tiezheng, Su Dan, Xu Yan, Ishii Etsuko, Bang Yejin, Chen Delong, Dai Wenliang, Chan Ho Shu, Madotto Andrea, Fung Pascale
- RHO (\(\rho\)): Reducing Hallucination In Open-domain Dialogues With Knowledge Grounding Ji Ziwei, Liu Zihan, Lee Nayeon, Yu Tiezheng, Wilie Bryan, Zeng Min, Fung Pascale
- Pairreranker: Pairwise Reranking For Natural Language Generation Jiang Dongfu, Lin Bill Yuchen, Ren Xiang
- Informative Language Representation Learning For Massively Multilingual Neural Machine Translation Jin Renren, Xiong Deyi
- Self-prompting Large Language Models For Zero-shot Open-domain QA Junlong Li, Jinyuan Wang, Zhuosheng Zhang, Hai Zhao
- Latency Adjustable Transformer Encoder For Language Understanding Kachuee Sajjad, Sharifkhani Mohammad
- Fie: Building A Global Probability Space By Leveraging Early Fusion In Encoder For Open-domain Question Answering Kedia Akhil, Zaidi Mohd Abbas, Lee Haejun
- What Makes Data-to-text Generation Hard For Pretrained Language Models? Keymanesh Moniba, Benton Adrian, Dredze Mark
- Decomposed Prompting: A Modular Approach For Solving Complex Tasks Khot Tushar, Trivedi Harsh, Finlayson Matthew, Fu Yao, Richardson Kyle, Clark Peter, Sabharwal Ashish
- High Recall Data-to-text Generation With Progressive Edit Kim Choonghan, Lee Gary Geunbae
- Downstream Datasets Make Surprisingly Good Pretraining Corpora Krishna Kundan, Garg Saurabh, Bigham Jeffrey P., Lipton Zachary C.
- Towards The Generation Of Musical Explanations With GPT-3 Krol Stephen James, Llano Maria Teresa, Mccormack Jon
- Few-shot Question Generation For Personalized Feedback In Intelligent Tutoring Systems Kulshreshtha Devang, Shayan Muhammad, Belfer Robert, Reddy Siva, Serban Iulian Vlad, Kochmar Ekaterina
- Mucot: Multilingual Contrastive Training For Question-answering In Low-resource Languages Kumar Gokul Karthik, Gehlot Abhishek Singh, Mullappilly Sahal Shaji, Nandakumar Karthik
- The Optimal BERT Surgeon: Scalable And Accurate Second-order Pruning For Large Language Models Kurtic Eldar, Campos Daniel, Nguyen Tuan, Frantar Elias, Kurtz Mark, Fineran Benjamin, Goin Michael, Alistarh Dan
- Language Detoxification With Attribute-discriminative Latent Space Kwak Jin Myung, Kim Minseon, Hwang Sung Ju
- Interactive Code Generation Via Test-driven User-intent Formalization Lahiri Shuvendu K., Fakhoury Sarah, Naik Aaditya, Sakkas Georgios, Chakraborty Saikat, Musuvathi Madanlal, Choudhury Piali, Von Veh Curtis, Inala Jeevana Priya, Wang Chenglong, Gao Jianfeng
- Evaluation Of Synthetic Datasets For Conversational Recommender Systems Lara Harsh, Tiwari Manoj
- Internet-augmented Language Models Through Few-shot Prompting For Open-domain Question Answering Lazaridou Angeliki, Gribovskaya Elena, Stokowiec Wojciech, Grigorev Nikolai
- Guiding Visual Question Answering With Attention Priors Le Thao Minh, Le Vuong, Gupta Sunil, Venkatesh Svetha, Tran Truyen
- Evaluating Human-language Model Interaction Lee Mina, Srivastava Megha, Hardy Amelia, Thickstun John, Durmus Esin, Paranjape Ashwin, Gerard-ursin Ines, Li Xiang Lisa, Ladhak Faisal, Rong Frieda, Wang Rose E., Kwon Minae, Park Joon Sung, Cao Hancheng, Lee Tony, Bommasani Rishi, Bernstein Michael, Liang Percy
- Littlebird: Efficient Faster & Longer Transformer For Question Answering Lee Minchul, Han Kijong, Shin Myeong Cheol
- Large Language Models With Controllable Working Memory Li Daliang, Rawat Ankit Singh, Zaheer Manzil, Wang Xin, Lukasik Michal, Veit Andreas, Yu Felix, Kumar Sanjiv
- Summer: Wechat Neural Machine Translation Systems For The WMT22 Biomedical Translation Task Li Ernan, Meng Fandong, Zhou Jie
- A Survey On Retrieval-augmented Text Generation Li Huayang, Su Yixuan, Cai Deng, Wang Yan, Liu Lemao
- Learning To Transfer Prompts For Text Generation Li Junyi, Tang Tianyi, Nie Jian-yun, Wen Ji-rong, Zhao Wayne Xin
- ELMER: A Non-autoregressive Pre-trained Language Model For Efficient And Effective Text Generation Li Junyi, Tang Tianyi, Zhao Wayne Xin, Nie Jian-yun, Wen Ji-rong
- LAVENDER: Unifying Video-language Understanding As Masked Language Modeling Li Linjie, Gan Zhe, Lin Kevin, Lin Chung-ching, Liu Zicheng, Liu Ce, Wang Lijuan
- Task-specific Pre-training And Prompt Decomposition For Knowledge Graph Population With Language Models Li Tianyi, Huang Wenyu, Papasarantopoulos Nikos, Vougiouklis Pavlos, Pan Jeff Z.
- Evade The Trap Of Mediocrity: Promoting Diversity And Novelty In Text Generation Via Concentrating Attention Li Wenhao, Yi Xiaoyuan, Hu Jinyi, Sun Maosong, Xie Xing
- P-transformer: Towards Better Document-to-document Neural Machine Translation Li Yachao, Li Junhui, Jiang Jing, Tao Shimin, Yang Hao, Zhang Min
- Uni-eden: Universal Encoder-decoder Network By Multi-granular Vision-language Pre-training Li Yehao, Fan Jiahao, Pan Yingwei, Yao Ting, Lin Weiyao, Mei Tao
- VALHALLA: Visual Hallucination For Machine Translation Li Yi, Panda Rameswar, Kim Yoon, Chen Chun-fu, Feris Rogerio, Cox David, Vasconcelos Nuno
- Clinical-longformer And Clinical-bigbird: Transformers For Long Clinical Sequences Li Yikuan, Wehbe Ramsey M., Ahmad Faraz S., Wang Hanyin, Luo Yuan
- DQ-BART: Efficient Sequence-to-sequence Model Via Joint Distillation And Quantization Li Zheng, Wang Zijian, Tan Ming, Nallapati Ramesh, Bhatia Parminder, Arnold Andrew, Xiang Bing, Roth Dan
- Simple And Effective Gradient-based Tuning Of Sequence-to-sequence Models Lichtarge Jared, Alberti Chris, Kumar Shankar
- Delving Deeper Into Cross-lingual Visual Question Answering Liu Chen, Pfeiffer Jonas, Korhonen Anna, Vulić Ivan, Gurevych Iryna
- Rainier: Reinforced Knowledge Introspector For Commonsense Question Answering Liu Jiacheng, Hallinan Skyler, Lu Ximing, He Pengfei, Welleck Sean, Hajishirzi Hannaneh, Choi Yejin
- Relational Memory Augmented Language Models Liu Qi, Yogatama Dani, Blunsom Phil
- 3DALL-E: Integrating Text-to-image AI In 3D Design Workflows Liu Vivian, Vermeulen Jo, Fitzmaurice George, Matejka Justin
- PSP: Pre-trained Soft Prompts For Few-shot Abstractive Summarization Liu Xiaochen, Gao Yang, Bai Yu, Li Jiawei, Hu Yinan, Huang Heyan, Chen Boxing
- Dialogconv: A Lightweight Fully Convolutional Network For Multi-view Response Selection Liu Yongkang, Feng Shi, Gao Wei, Wang Daling, Zhang Yifei
- Declaration-based Prompt Tuning For Visual Question Answering Liu Yuhang, Wei Wei, Peng Daowan, Zhu Feida
- Relaxed Attention For Transformer Models Lohrenz Timo, Möller Björn, Li Zhengyang, Fingscheidt Tim
- Learn To Explain: Multimodal Reasoning Via Thought Chains For Science Question Answering Lu Pan, Mishra Swaroop, Xia Tony, Qiu Liang, Chang Kai-wei, Zhu Song-chun, Tafjord Oyvind, Clark Peter, Kalyan Ashwin
- Imagination-augmented Natural Language Understanding Lu Yujie, Zhu Wanrong, Wang Xin Eric, Eckstein Miguel, Wang William Yang
- Choose Your QA Model Wisely: A Systematic Study Of Generative And Extractive Readers For Question Answering Luo Man, Hashimoto Kazuma, Yavuz Semih, Liu Zhiwei, Baral Chitta, Zhou Yingbo
- Biogpt: Generative Pre-trained Transformer For Biomedical Text Generation And Mining Luo Renqian, Sun Liai, Xia Yingce, Qin Tao, Zhang Sheng, Poon Hoifung, Liu Tie-yan
- A Frustratingly Simple Approach For End-to-end Image Captioning Luo Ziyang, Xi Yadong, Zhang Rongsheng, Ma Jing
- Torchscale: Transformers At Scale Ma Shuming, Wang Hongyu, Huang Shaohan, Wang Wenhui, Chi Zewen, Dong Li, Benhaim Alon, Patra Barun, Chaudhary Vishrav, Song Xia, Wei Furu
- Text Generation With Text-editing Models Malmi Eric, Dong Yue, Mallinson Jonathan, Chuklin Aleksandr, Adamek Jakub, Mirylenka Daniil, Stahlberg Felix, Krause Sebastian, Kumar Shankar, Severyn Aliaksei
- DIALOG-22 Ruatd Generated Text Detection Maloyan Narek, Nutfullin Bulat, Ilyushin Eugene
- Quick Starting Dialog Systems With Paraphrase Generation Marceau Louis, Belbahar Raouf, Queudot Marc, Naji Nada, Charton Eric, Meurs Marie-jean
- Generating Training Data With Language Models: Towards Zero-shot Language Understanding Meng Yu, Huang Jiaxin, Zhang Yu, Han Jiawei
- SKILL: Structured Knowledge Infusion For Large Language Models Moiseev Fedor, Dong Zhe, Alfonseca Enrique, Jaggi Martin
- Domain-specific Text Generation For Machine Translation Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- Task Formulation Matters When Learning Continually: A Case Study In Visual Question Answering Nikandrou Mavina, Yu Lu, Suglia Alessandro, Konstas Ioannis, Rieser Verena
- Budgetlongformer: Can We Cheaply Pretrain A Sota Legal Language Model From Scratch? Niklaus Joel, Giofré Daniele
- No Language Left Behind: Scaling Human-centered Machine Translation Nllb Team, Marta R. Costa-jussà, James Cross, Onur Çelebi, Maha Elbayad, Kenneth Heafield, Kevin Heffernan, Elahe Kalbassi, Janice Lam, Daniel Licht, Jean Maillard, Anna Sun, Skyler Wang, Guillaume Wenzek, Al Youngblood, Bapi Akula, Loic Barrault, Gabriel Mejia Gonzalez, Prangthip Hansanti, John Hoffman, Semarley Jarrett, Kaushik Ram Sadagopan, Dirk Rowe, Shannon Spruit, Chau Tran, Pierre Andrews, Necip Fazil Ayan, Shruti Bhosale, Sergey Edunov, Angela Fan, Cynthia Gao, Vedanuj Goswami, Francisco Guzmán, Philipp Koehn, Alexandre Mourachko, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Jeff Wang
- The Turing Deception Noever David, Ciolino Matt
- Improving The Cross-lingual Generalisation In Visual Question Answering Nooralahzadeh Farhad, Sennrich Rico
- Minialbert: Model Distillation Via Parameter-efficient Recursive Transformers Nouriborji Mohammadmahdi, Rohanian Omid, Kouchaki Samaneh, Clifton David A.
- Measuring And Narrowing The Compositionality Gap In Language Models Ofir Press, Muru Zhang, Sewon Min, Ludwig Schmidt, Noah A. Smith, Mike Lewis
- Neural Language Taskonomy: Which NLP Tasks Are The Most Predictive Of Fmri Brain Activity? Oota Subba Reddy, Arora Jashn, Agarwal Veeral, Marreddy Mounika, Gupta Manish, Surampudi Bapi Raju
- Parameter-efficient Abstractive Question Answering Over Tables Or Text Pal Vaishali, Kanoulas Evangelos, De Rijke Maarten
- Follow-up Attention: An Empirical Study Of Developer And Neural Model Code Exploration Paltenghi Matteo, Pandita Rahul, Henley Austin Z., Ziegler Albert
- Relation-aware Language-graph Transformer For Question Answering Park Jinyoung, Choi Hyeong Kyu, Ko Juyeon, Park Hyeonjin, Kim Ji-hoon, Jeong Jisu, Kim Kyungmin, Kim Hyunwoo J.
- Bidirectional Language Models Are Also Few-shot Learners Patel Ajay, Li Bryan, Rasooli Mohammad Sadegh, Constant Noah, Raffel Colin, Callison-burch Chris
- Beyond English-centric Bitexts For Better Multilingual Language Representation Learning Patra Barun, Singhal Saksham, Huang Shaohan, Chi Zewen, Dong Li, Wei Furu, Chaudhary Vishrav, Song Xia
- Xfboost: Improving Text Generation With Controllable Decoders Peng Xiangyu, Sollami Michael
- Holistic Evaluation Of Language Models Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao, Jue Wang, Keshav Santhanam, Laurel Orr, Lucia Zheng, Mert Yuksekgonul, Mirac Suzgun, Nathan Kim, Neel Guha, Niladri Chatterji, Omar Khattab, Peter Henderson, Qian Huang, Ryan Chi, Sang Michael Xie, Shibani Santurkar, Surya Ganguli, Tatsunori Hashimoto, Thomas Icard, Tianyi Zhang, Vishrav Chaudhary, William Wang, Xuechen Li, Yifan Mai, Yuhui Zhang, Yuta Koreeda
- Ignore Previous Prompt: Attack Techniques For Language Models Perez Fábio, Ribeiro Ian
- Lifting The Curse Of Multilinguality By Pre-training Modular Transformers Pfeiffer Jonas, Goyal Naman, Lin Xi Victoria, Li Xian, Cross James, Riedel Sebastian, Artetxe Mikel
- Vit5: Pretrained Text-to-text Transformer For Vietnamese Language Generation Phan Long, Tran Hieu, Nguyen Hieu, Trinh Trieu H.
- Hypertuning: Toward Adapting Large Language Models Without Back-propagation Phang Jason, Mao Yi, He Pengcheng, Chen Weizhu
- Investigating Efficiently Extending Transformers For Long Input Summarization Phang Jason, Zhao Yao, Liu Peter J.
- Pre-training Image-language Transformers For Open-vocabulary Tasks Piergiovanni Aj, Kuo Weicheng, Angelova Anelia
- Answer-me: Multi-task Open-vocabulary Visual Question Answering Piergiovanni Aj, Li Wei, Kuo Weicheng, Saffar Mohammad, Bertsch Fred, Angelova Anelia
- Synchromesh: Reliable Code Generation From Pre-trained Language Models Poesia Gabriel, Polozov Oleksandr, Le Vu, Tiwari Ashish, Soares Gustavo, Meek Christopher, Gulwani Sumit
- A Self-paced Mixed Distillation Method For Non-autoregressive Generation Qi Weizhen, Gong Yeyun, Shen Yelong, Jiao Jian, Yan Yu, Li Houqiang, Zhang Ruofei, Chen Weizhu, Duan Nan
- Diffusion Glancing Transformer For Parallel Sequence To Sequence Learning Qian Lihua, Wang Mingxuan, Liu Yang, Zhou Hao
- Parallel Context Windows For Large Language Models Ratner Nir, Levine Yoav, Belinkov Yonatan, Ram Ori, Magar Inbal, Abend Omri, Karpas Ehud, Shashua Amnon, Leyton-brown Kevin, Shoham Yoav
- VLC-BERT: Visual Question Answering With Contextualized Commonsense Knowledge Ravi Sahithya, Chinchure Aditya, Sigal Leonid, Liao Renjie, Shwartz Vered
- Entity-conditioned Question Generation For Robust Attention Distribution In Neural Information Retrieval Reddy Revanth Gangi, Sultan Md Arafat, Franz Martin, Sil Avirup, Ji Heng
- Super-prompting: Utilizing Model-independent Contextual Data To Reduce Data Annotation Required In Visual Commonsense Tasks Rezaei Navid, Reformat Marek Z.
- Breakpoint Transformers For Modeling And Tracking Intermediate Beliefs Richardson Kyle, Tamari Ronen, Sultan Oren, Tsarfaty Reut, Shahaf Dafna, Sabharwal Ashish
- Additive Interventions Yield Robust Multi-domain Machine Translation Models Rippeth Elijah, Post Matt
- Leveraging Large Language Models For Multiple Choice Question Answering Robinson Joshua, Rytting Christopher Michael, Wingate David
- Lamda: Language Models For Dialog Applications Romal Thoppilan, Daniel De Freitas, Jamie Hall, Noam Shazeer, Apoorv Kulshreshtha, Heng-tze Cheng, Alicia Jin, Taylor Bos, Leslie Baker, Yu Du, Yaguang Li, Hongrae Lee, Huaixiu Steven Zheng, Amin Ghafouri, Marcelo Menegali, Yanping Huang, Maxim Krikun, Dmitry Lepikhin, James Qin, Dehao Chen, Yuanzhong Xu, Zhifeng Chen, Adam Roberts, Maarten Bosma, Vincent Zhao, Yanqi Zhou, Chung-ching Chang, Igor Krivokon, Will Rusch, Marc Pickett, Pranesh Srinivasan, Laichee Man, Kathleen Meier-hellstern, Meredith Ringel Morris, Tulsee Doshi, Renelito Delos Santos, Toju Duke, Johnny Soraker, Ben Zevenbergen, Vinodkumar Prabhakaran, Mark Diaz, Ben Hutchinson, Kristen Olson, Alejandra Molina, Erin Hoffman-john, Josh Lee, Lora Aroyo, Ravi Rajakumar, Alena Butryna, Matthew Lamm, Viktoriya Kuzmina, Joe Fenton, Aaron Cohen, Rachel Bernstein, Ray Kurzweil, Blaise Aguera-arcas, Claire Cui, Marian Croak, Ed Chi, Quoc Le
- Dialokg: Knowledge-structure Aware Task-oriented Dialogue Generation Rony Md Rashad Al Hasan, Usbeck Ricardo, Lehmann Jens
- Billions Of Parameters Are Worth More Than In-domain Training Data: A Case Study In The Legal Case Entailment Task Rosa Guilherme Moraes, Bonifacio Luiz, Jeronymo Vitor, Abonizio Hugo, Lotufo Roberto, Nogueira Rodrigo
- LINGUIST: Language Model Instruction Tuning To Generate Annotated Utterances For Intent Classification And Slot Tagging Rosenbaum Andy, Soltan Saleh, Hamza Wael, Versley Yannick, Boese Markus
- Alexatm 20B: Few-shot Learning Using A Large-scale Multilingual Seq2seq Model Saleh Soltan, Shankar Ananthakrishnan, Jack Fitzgerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, Stephen Rawls, Andy Rosenbaum, Anna Rumshisky, Chandana Satya Prakash, Mukund Sridhar, Fabian Triefenbach, Apurv Verma, Gokhan Tur, Prem Natarajan
- IT5: Text-to-text Pretraining For Italian Language Understanding And Generation Sarti Gabriele, Nissim Malvina
- Training Language Models With Language Feedback Scheurer Jérémy, Campos Jon Ander, Chan Jun Shern, Chen Angelica, Cho Kyunghyun, Perez Ethan
- Non-autoregressive Neural Machine Translation: A Call For Clarity Schmidt Robin M., Pires Telmo, Peitz Stephan, Lööf Jonas
- Confident Adaptive Language Modeling Schuster Tal, Fisch Adam, Gupta Jai, Dehghani Mostafa, Bahri Dara, Tran Vinh Q., Tay Yi, Metzler Donald
- Bird-eye Transformers For Text Generation Models Sha Lei, Song Yuhang, Yordanov Yordan, Salvatori Tommaso, Lukasiewicz Thomas
- Overcoming Catastrophic Forgetting Beyond Continual Learning: Balanced Training For Neural Machine Translation Shao Chenze, Feng Yang
- Incorporating Domain Knowledge Through Task Augmentation For Front-end Javascript Code Generation Shen Sijie, Zhu Xiang, Dong Yihong, Guo Qizhi, Zhen Yankun, Li Ge
- Ebms Vs. CL: Exploring Self-supervised Visual Pretraining For Visual Question Answering Shevchenko Violetta, Abbasnejad Ehsan, Dick Anthony, Hengel Anton Van Den, Teney Damien
- React: Synergizing Reasoning And Acting In Language Models Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, Yuan Cao
- Reasoning With Language Model Prompting: A Survey Shuofei Qiao, Yixin Ou, Ningyu Zhang, Xiang Chen, Yunzhi Yao, Shumin Deng, Chuanqi Tan, Fei Huang, Huajun Chen
- Towards The Next 1000 Languages In Multilingual Machine Translation: Exploring The Synergy Between Supervised And Self-supervised Learning Siddhant Aditya, Bapna Ankur, Firat Orhan, Cao Yuan, Chen Mia Xu, Caswell Isaac, Garcia Xavier
- On The Efficacy Of Co-attention Transformer Layers In Visual Question Answering Sikarwar Ankur, Kreiman Gabriel
- Large Language Models Encode Clinical Knowledge Singhal Karan, Azizi Shekoofeh, Tu Tao, Mahdavi S. Sara, Wei Jason, Chung Hyung Won, Scales Nathan, Tanwani Ajay, Cole-lewis Heather, Pfohl Stephen, Payne Perry, Seneviratne Martin, Gamble Paul, Kelly Chris, Scharli Nathaneal, Chowdhery Aakanksha, Mansfield Philip, Arcas Blaise Aguera Y, Webster Dale, Corrado Greg S., Matias Yossi, Chou Katherine, Gottweis Juraj, Tomasev Nenad, Liu Yun, Rajkomar Alvin, Barral Joelle, Semturs Christopher, Karthikesalingam Alan, Natarajan Vivek
- Improving The Domain Adaptation Of Retrieval Augmented Generation (RAG) Models For Open Domain Question Answering Siriwardhana Shamane, Weerasekera Rivindu, Wen Elliott, Kaluarachchi Tharindu, Rana Rajib, Nanayakkara Suranga
- CLIP Models Are Few-shot Learners: Empirical Studies On VQA And Visual Entailment Song Haoyu, Dong Li, Zhang Wei-nan, Liu Ting, Wei Furu
- Towards Zero-shot And Few-shot Table Question Answering Using GPT-3 Srivastava Pragya, Ganu Tanuja, Guha Saikat
- Conciseness: An Overlooked Language Task Stahlberg Felix, Kumar Aashish, Alberti Chris, Kumar Shankar
- QA4QG: Using Question Answering To Constrain Multi-hop Question Generation Su Dan, Xu Peng, Fung Pascale
- Investigating Explainability Of Generative AI For Code Through Scenario-based Design Sun Jiao, Liao Q. Vera, Muller Michael, Agarwal Mayank, Houde Stephanie, Talamadupula Kartik, Weisz Justin D.
- Tokenization Consistency Matters For Generative Models On Extractive NLP Tasks Sun Kaiser, Qi Peng, Zhang Yuhao, Liu Lan, Wang William Yang, Huang Zhiheng
- Metaphorical User Simulators For Evaluating Task-oriented Dialogue Systems Sun Weiwei, Guo Shuyu, Zhang Shuo, Ren Pengjie, Chen Zhumin, De Rijke Maarten, Ren Zhaochun
- Dynamar: Dynamic Prompt With Mask Token Representation Sun Xiaodi, Rajagopalan Sunny, Nigam Priyanka, Lu Weiyi, Xu Yi, Zeng Belinda, Chilimbi Trishul
- Recitation-augmented Language Models Sun Zhiqing, Wang Xuezhi, Tay Yi, Yang Yiming, Zhou Denny
- Pseudo-ood Training For Robust Language Models Sundararaman Dhanasekar, Mehta Nikhil, Carin Lawrence
- News Summarization And Evaluation In The Era Of GPT-3 Tanya Goyal, Junyi Jessy Li, Greg Durrett
- Benchmarking Large Language Models For Automated Verilog RTL Code Generation Thakur Shailja, Ahmad Baleegh, Fan Zhenxing, Pearce Hammond, Tan Benjamin, Karri Ramesh, Dolan-gavitt Brendan, Garg Siddharth
- Interleaving Retrieval With Chain-of-thought Reasoning For Knowledge-intensive Multi-step Questions Trivedi Harsh, Balasubramanian Niranjan, Khot Tushar, Sabharwal Ashish
- Adavae: Exploring Adaptive Gpt-2s In Variational Auto-encoders For Language Modeling Tu Haoqin, Yang Zhongliang, Yang Jinshuai, Huang Yongfeng
- Inferring Implicit Relations In Complex Questions With Language Models Uri Katz, Mor Geva, Jonathan Berant
- Towards Reasoning-aware Explainable VQA Vaideeswaran Rakesh, Gao Feng, Mathur Abhinav, Thattai Govind
- VLSP 2021 - Vimrc Challenge: Vietnamese Machine Reading Comprehension Van Nguyen Kiet, Tran Son Quoc, Nguyen Luan Thanh, Van Huynh Tin, Luu Son T., Nguyen Ngan Luu-thuy
- Prompting Palm For Translation: Assessing Strategies And Performance Vilar David, Freitag Markus, Cherry Colin, Luo Jiaming, Ratnakar Viresh, Foster George
- How Large Language Models Are Transforming Machine-paraphrased Plagiarism Wahle Jan Philip, Ruas Terry, Kirstein Frederic, Gipp Bela
- What Do They Capture? -- A Structural Analysis Of Pre-trained Language Models For Source Code Wan Yao, Zhao Wei, Zhang Hongyu, Sui Yulei, Xu Guandong, Jin Hai
- Unite: Unified Translation Evaluation Wan Yu, Liu Dayiheng, Yang Baosong, Zhang Haibo, Chen Boxing, Wong Derek F., Chao Lidia S.
- Can Language Models Make Fun? A Case Study In Chinese Comical Crosstalk Wang Benyou, Wu Xiangbo, Liu Xiaokang, Li Jianquan, Tiwari Prayag, Xie Qianqian
- STRUDEL: Structured Dialogue Summarization For Dialogue Comprehension Wang Borui, Feng Chengcheng, Nair Arjun, Mao Madelyn, Desai Jai, Celikyilmaz Asli, Li Haoran, Mehdad Yashar, Radev Dragomir
- Knowledge Prompting In Pre-trained Language Model For Natural Language Understanding Wang Jianing, Huang Wenkang, Shi Qiuhui, Wang Hongbin, Qiu Minghui, Li Xiang, Gao Ming
- Hybrid-regressive Neural Machine Translation Wang Qiang, Hu Xinhui, Chen Ming
- Image As A Foreign Language: Beit Pretraining For All Vision And Vision-language Tasks Wang Wenhui, Bao Hangbo, Dong Li, Bjorck Johan, Peng Zhiliang, Liu Qiang, Aggarwal Kriti, Mohammed Owais Khan, Singhal Saksham, Som Subhojit, Wei Furu
- Understanding And Improving Sequence-to-sequence Pretraining For Neural Machine Translation Wang Wenxuan, Jiao Wenxiang, Hao Yongchang, Wang Xing, Shi Shuming, Tu Zhaopeng, Lyu Michael
- Elaboration-generating Commonsense Question Answering At Scale Wang Wenya, Srikumar Vivek, Hajishirzi Hanna, Smith Noah A.
- Finding Skill Neurons In Pre-trained Transformer-based Language Models Wang Xiaozhi, Wen Kaiyue, Zhang Zhengyan, Hou Lei, Liu Zhiyuan, Li Juanzi
- Compilable Neural Code Generation With Compiler Feedback Wang Xin, Wang Yasheng, Wan Yao, Mi Fei, Li Yitong, Zhou Pingyi, Liu Jin, Wu Hao, Jiang Xin, Liu Qun
- Code4struct: Code Generation For Few-shot Event Structure Prediction Wang Xingyao, Li Sha, Ji Heng
- Controlling Styles In Neural Machine Translation With Activation Prompt Wang Yifan, Sun Zewei, Cheng Shanbo, Zheng Weiguo, Wang Mingxuan
- Two-stage LLM Fine-tuning With Less Specialization And More Generalization Wang Yihan, Si Si, Li Daliang, Lukasik Michal, Yu Felix, Hsieh Cho-jui, Dhillon Inderjit S, Kumar Sanjiv
- Pan More Gold From The Sand: Refining Open-domain Dialogue Training With Noisy Self-retrieval Generation Wang Yihe, Li Yitong, Wang Yasheng, Mi Fei, Zhou Pingyi, Wang Xin, Liu Jin, Jiang Xin, Liu Qun
- Multimodal Adaptive Distillation For Leveraging Unimodal Encoders For Vision-language Tasks Wang Zhecan, Codella Noel, Chen Yen-chun, Zhou Luowei, Dai Xiyang, Xiao Bin, Yang Jianwei, You Haoxuan, Chang Kai-wei, Chang Shih-fu, Yuan Lu
- Language Models With Image Descriptors Are Strong Few-shot Video-language Learners Wang Zhenhailong, Li Manling, Xu Ruochen, Zhou Luowei, Lei Jie, Lin Xudong, Wang Shuohang, Yang Ziyi, Zhu Chenguang, Hoiem Derek, Chang Shih-fu, Bansal Mohit, Ji Heng
- CREATER: Ctr-driven Advertising Text Generation With Controlled Pre-training And Contrastive Fine-tuning Wei Penghui, Yang Xuanhua, Liu Shaoguo, Wang Liang, Zheng Bo
- Probing Of Quantitative Values In Abstractive Summarization Models White Nathan M.
- FRSUM: Towards Faithful Abstractive Summarization Via Enhancing Factual Robustness Wu Wenhao, Li Wei, Liu Jiachen, Xiao Xinyan, Cao Ziqiang, Li Sujian, Wu Hua
- Precisely The Point: Adversarial Augmentations For Faithful And Informative Text Generation Wu Wenhao, Li Wei, Liu Jiachen, Xiao Xinyan, Li Sujian, Lyu Yajuan
- An Efficient Memory-augmented Transformer For Knowledge-intensive NLP Tasks Wu Yuxiang, Zhao Yu, Hu Baotian, Minervini Pasquale, Stenetorp Pontus, Riedel Sebastian
- Modeling Context With Linear Attention For Scalable Document-level Translation Wu Zhaofeng, Peng Hao, Pappas Nikolaos, Smith Noah A.
- The Unreliability Of Explanations In Few-shot Prompting For Textual Reasoning Xi Ye, Greg Durrett
- Autoqgs: Auto-prompt For Low-resource Knowledge-based Question Generation From SPARQL Xiong Guanming, Bao Junwei, Zhao Wen, Wu Youzheng, He Xiaodong
- MGA-VQA: Multi-granularity Alignment For Visual Question Answering Xiong Peixi, Shen Yilin, Jin Hongxia
- Adapting Pretrained Text-to-text Models For Long Text Sequences Xiong Wenhan, Gupta Anchit, Toshniwal Shubham, Mehdad Yashar, Yih Wen-tau
- Integrating Translation Memories Into Non-autoregressive Machine Translation Xu Jitao, Crego Josep, Yvon François
- SESCORE2: Learning Text Generation Evaluation Via Synthesizing Realistic Mistakes Xu Wenda, Qian Xian, Wang Mingxuan, Li Lei, Wang William Yang
- Rationale-augmented Ensembles In Language Models Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc Le, Ed Chi, Denny Zhou
- Probing Causes Of Hallucinations In Neural Machine Translations Yan Jianhao, Meng Fandong, Zhou Jie
- Generative Negative Text Replay For Continual Vision-language Pretraining Yan Shipeng, Hong Lanqing, Xu Hang, Han Jianhua, Tuytelaars Tinne, Li Zhenguo, He Xuming
- Learning To Answer Visual Questions From Web Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Zero-shot Video Question Answering Via Frozen Bidirectional Language Models Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- GLUE-X: Evaluating Natural Language Understanding Models From An Out-of-distribution Generalization Perspective Yang Linyi, Zhang Shuibai, Qin Libo, Li Yafu, Wang Yidong, Liu Hanmeng, Wang Jindong, Xie Xing, Zhang Yue
- Building Knowledge-grounded Dialogue Systems With Graph-based Semantic Modeling Yang Yizhe, Huang Heyan, Gao Yang, And Jiawei Li
- Prompt Tuning For Discriminative Pre-trained Language Models Yao Yuan, Dong Bowen, Zhang Ao, Zhang Zhengyan, Xie Ruobing, Liu Zhiyuan, Lin Leyu, Sun Maosong, Wang Jianyong
- Random-ltd: Random And Layerwise Token Dropping Brings Efficient Training For Large-scale Transformers Yao Zhewei, Wu Xiaoxia, Li Conglong, Holmes Connor, Zhang Minjia, Li Cheng, He Yuxiong
- Language Models Are General-purpose Interfaces Yaru Hao, Haoyu Song, Li Dong, Shaohan Huang, Zewen Chi, Wenhui Wang, Shuming Ma, Furu Wei
- Retrieval-augmented Multimodal Language Modeling Yasunaga Michihiro, Aghajanyan Armen, Shi Weijia, James Rich, Leskovec Jure, Liang Percy, Lewis Mike, Zettlemoyer Luke, Yih Wen-tau
- Modeling Multi-hop Question Answering As Single Sequence Prediction Yavuz Semih, Hashimoto Kazuma, Zhou Yingbo, Keskar Nitish Shirish, Xiong Caiming
- UL2: Unifying Language Learning Paradigms Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Xavier Garcia, Jason Wei, Xuezhi Wang, Hyung Won Chung, Siamak Shakeri, Dara Bahri, Tal Schuster, Huaixiu Steven Zheng, Denny Zhou, Neil Houlsby, Donald Metzler
- Categorizing Semantic Representations For Neural Machine Translation Yin Yongjing, Li Yafu, Meng Fandong, Zhou Jie, Zhang Yue
- Scaling Autoregressive Models For Content-rich Text-to-image Generation Yu Jiahui, Xu Yuanzhong, Koh Jing Yu, Luong Thang, Baid Gunjan, Wang Zirui, Vasudevan Vijay, Ku Alexander, Yang Yinfei, Ayan Burcu Karagol, Hutchinson Ben, Han Wei, Parekh Zarana, Li Xin, Zhang Han, Baldridge Jason, Wu Yonghui
- Bilaterally Slimmable Transformer For Elastic And Efficient Visual Question Answering Yu Zhou, Jin Zitian, Yu Jun, Xu Mingliang, Wang Hongbo, Fan Jianping
- Wudaomm: A Large-scale Multi-modal Dataset For Pre-training Models Yuan Sha, Zhao Shuai, Leng Jiahong, Xue Zhao, Zhao Hanyu, Liu Peiyu, Gong Zheng, Zhao Wayne Xin, Li Junyi, Tang Jie
- Competition-level Code Generation With Alphacode Yujia Li, David Choi, Junyoung Chung, Nate Kushman, Julian Schrittwieser, Rémi Leblond, Tom Eccles, James Keeling, Felix Gimeno, Agustin Dal Lago, Thomas Hubert, Peter Choy, Cyprien De Masson D'autume, Igor Babuschkin, Xinyun Chen, Po-sen Huang, Johannes Welbl, Sven Gowal, Alexey Cherepanov, James Molloy, Daniel J. Mankowitz, Esme Sutherland Robson, Pushmeet Kohli, Nando De Freitas, Koray Kavukcuoglu, Oriol Vinyals
- Bridging Cross-lingual Gaps During Leveraging The Multilingual Sequence-to-sequence Pretraining For Text Generation And Understanding Zan Changtong, Ding Liang, Shen Li, Cao Yu, Liu Weifeng, Tao Dacheng
- Dyrex: Dynamic Query Representation For Extractive Question Answering Zaratiana Urchade, Khbir Niama El, Núñez Dennis, Holat Pierre, Tomeh Nadi, Charnois Thierry
- Scala: Accelerating Adaptation Of Pre-trained Transformer-based Language Models Via Efficient Large-batch Adversarial Noise Zhang Minjia, Naresh Niranjan Uma, He Yuxiong
- SMDT: Selective Memory-augmented Neural Document Translation Zhang Xu, Yang Jian, Huang Haoyang, Ma Shuming, Zhang Dongdong, Li Jinlong, Wei Furu
- BLISS: Robust Sequence-to-sequence Learning Via Self-supervised Input Representation Zhang Zheng, Ding Liang, Cheng Dazhao, Liu Xuebo, Zhang Min, Tao Dacheng
- Can Offline Reinforcement Learning Help Natural Language Understanding? Zhang Ziqi, Wang Yile, Zhang Yue, Wang Donglin
- Attend, Memorize And Generate: Towards Faithful Table-to-text Generation In Few Shots Zhao Wenting, Liu Ye, Wan Yao, Yu Philip S.
- Tacube: Pre-computing Data Cubes For Answering Numerical-reasoning Questions Over Tabular Data Zhou Fan, Hu Mengkang, Dong Haoyu, Cheng Zhoujun, Han Shi, Zhang Dongmei
- Syntax-informed Question Answering With Heterogeneous Graph Transformer Zhu Fangyi, Tan Lok You, Ng See-kiong, Bressan Stéphane
- KPT: Keyword-guided Pre-training For Grounded Dialog Generation Zhu Qi, Mi Fei, Zhang Zheng, Wang Yasheng, Li Yitong, Jiang Xin, Liu Qun, Zhu Xiaoyan, Huang Minlie
- Exploring And Evaluating Personalized Models For Code Generation Zlotchevski Andrei, Drain Dawn, Svyatkovskiy Alexey, Clement Colin, Sundaresan Neel, Tufano Michele
- St-moe: Designing Stable And Transferable Sparse Expert Models Zoph Barret, Bello Irwan, Kumar Sameer, Du Nan, Huang Yanping, Dean Jeff, Shazeer Noam, Fedus William
- Efficient Long Sequence Modeling Via State Space Augmented Transformer Zuo Simiao, Liu Xiaodong, Jiao Jian, Charles Denis, Manavoglu Eren, Zhao Tuo, Gao Jianfeng
- Generator-retriever-generator Approach For Open-domain Question Answering Abdallah Abdelrahman, Jatowt Adam
- Attention Over Pre-trained Sentence Embeddings For Long Document Classification Abdaoui Amine, Dutta Sourav
- Inpars Toolkit: A Unified And Reproducible Synthetic Data Generation Pipeline For Neural Information Retrieval Abonizio Hugo, Bonifacio Luiz, Jeronymo Vitor, Lotufo Roberto, Zavrel Jakub, Nogueira Rodrigo
- The ADAIO System At The BEA-2023 Shared Task On Generating AI Teacher Responses In Educational Dialogues Adigwe Adaeze 2 And 3, Yuan Zheng 2 And 3
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon
- KITLM: Domain-specific Knowledge Integration Into Language Models For Question Answering Agarwal Ankush, Gawade Sakharam, Azad Amar Prakash, Bhattacharyya Pushpak
- Bring Your Own KG: Self-supervised Program Synthesis For Zero-shot KGQA Agarwal Dhruv, Das Rajarshi, Khosla Sopan, Gangadharaiah Rashmi
- Let's Sample Step By Step: Adaptive-consistency For Efficient Reasoning And Coding With Llms Aggarwal Pranjal, Madaan Aman, Yang Yiming, Mausam
- Simul-llm: A Framework For Exploring High-quality Simultaneous Translation With Large Language Models Agostinelli Victor, Wild Max, Raffel Matthew, Fuad Kazi Ahmed Asif, Chen Lizhong
- Performance Trade-offs Of Watermarking Large Language Models Ajith Anirudh, Singh Sameer, Pruthi Danish
- CESAR: Automatic Induction Of Compositional Instructions For Multi-turn Dialogs Aksu Taha, Hazarika Devamanyu, Mehri Shikib, Kim Seokhwan, Hakkani-tür Dilek, Liu Yang, Namazifar Mahdi
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Akyürek Afra Feyza, Akyürek Ekin, Madaan Aman, Kalyan Ashwin, Clark Peter, Wijaya Derry, Tandon Niket
- The (ab)use Of Open Source Code To Train Large Language Models Al-kaswan Ali, Izadi Maliheh
- Position Interpolation Improves Alibi Extrapolation Al-khateeb Faisal, Dey Nolan, Soboleva Daria, Hestness Joel
- Chatgpt And Beyond: The Generative AI Revolution In Education Al-smadi Mohammad
- Mamba: Linear-time Sequence Modeling With Selective State Spaces Albert Gu, Tri Dao
- Mistral 7B Albert Q. Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego De Las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lucile Saulnier, Lélio Renard Lavaud, Marie-anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timothée Lacroix, William El Sayed
- RET-LLM: Towards A General Read-write Memory For Large Language Models Ali Modarressi, Ayyoob Imani, Mohsen Fayyaz, Hinrich Schütze
- Pixt3: Pixel-based Table-to-text Generation Alonso Iñigo, Agirre Eneko, Lapata Mirella
- Is BERT Blind? Exploring The Effect Of Vision-and-language Pretraining On Visual Language Understanding Alper Morris, Fiman Michael, Averbuch-elor Hadar
- Steering Large Language Models For Machine Translation With Finetuning And In-context Learning Alves Duarte M., Guerreiro Nuno M., Alves João, Pombal José, Rei Ricardo, De Souza José G. C., Colombo Pierre, Martins André F. T.
- Taqyim: Evaluating Arabic NLP Tasks Using Chatgpt Models Alyafeai Zaid, Alshaibani Maged S., Alkhamissi Badr, Luqman Hamzah, Alareqi Ebrahim, Fadel Ali
- Prompt-engineering And Transformer-based Question Generation And Evaluation Amyeen Rubaba
- Openleaf: Open-domain Interleaved Image-text Generation And Evaluation An Jie, Yang Zhengyuan, Li Linjie, Wang Jianfeng, Lin Kevin, Liu Zicheng, Wang Lijuan, Luo Jiebo
- Chemcrow: Augmenting Large-language Models With Chemistry Tools Andres M Bran, Sam Cox, Oliver Schilter, Carlo Baldassari, Andrew D White, Philippe Schwaller
- Detecting And Preventing Hallucinations In Large Vision Language Models Anisha Gunjal, Jihan Yin, Erhan Bas
- Scaling Laws For Language Encoding Models In Fmri Antonello Richard, Vaidya Aditya, Huth Alexander G.
- A Memory Model For Question Answering From Streaming Data Supported By Rehearsal And Anticipation Of Coreference Information Araujo Vladimir, Soto Alvaro, Moens Marie-francine
- Sequence-to-sequence Spanish Pre-trained Language Models Araujo Vladimir, Trusca Maria Mihaela, Tufiño Rodrigo, Moens Marie-francine
- Leancontext: Cost-efficient Domain-specific Question Answering Using Llms Arefeen Md Adnan, Debnath Biplob, Chakradhar Srimat
- Towards A Unified Multimodal Reasoning Framework Arun Abhinav, Mal Dipendra Singh, Soni Mehul, Sawada Tomohiro
- Factuality Challenges In The Era Of Large Language Models Augenstein Isabelle, Baldwin Timothy, Cha Meeyoung, Chakraborty Tanmoy, Ciampaglia Giovanni Luca, Corney David, Diresta Renee, Ferrara Emilio, Hale Scott, Halevy Alon, Hovy Eduard, Ji Heng, Menczer Filippo, Miguez Ruben, Nakov Preslav, Scheufele Dietram, Sharma Shivam, Zagni Giovanni
- Using Large Language Models For Zero-shot Natural Language Generation From Knowledge Graphs Axelsson Agnes, Skantze Gabriel
- Scaling Transformer To 1M Tokens And Beyond With RMT Aydar Bulatov, Yuri Kuratov, Yermek Kapushev, Mikhail S. Burtsev
- Evaluating Chatgpt As A Question Answering System: A Comprehensive Analysis And Comparison With Existing Models Bahak Hossein, Taheri Farzaneh, Zojaji Zahra, Kazemi Arefeh
- Chatgpt: Applications, Opportunities, And Threats Bahrini Aram, Khamoshifar Mohammadsadra, Abbasimehr Hossein, Riggs Robert J., Esmaeili Maryam, Majdabadkohne Rastin Mastali, Pasehvar Morteza
- Qwen Technical Report Bai Jinze, Bai Shuai, Chu Yunfei, Cui Zeyu, Dang Kai, Deng Xiaodong, Fan Yang, Ge Wenbin, Han Yu, Huang Fei, Hui Binyuan, Ji Luo, Li Mei, Lin Junyang, Lin Runji, Liu Dayiheng, Liu Gao, Lu Chengqiang, Lu Keming, Ma Jianxin, Men Rui, Ren Xingzhang, Ren Xuancheng, Tan Chuanqi, Tan Sinan, Tu Jianhong, Wang Peng, Wang Shijie, Wang Wei, Wu Shengguang, Xu Benfeng, Xu Jin, Yang An, Yang Hao, Yang Jian, Yang Shusheng, Yao Yang, Yu Bowen, Yuan Hongyi, Yuan Zheng, Zhang Jianwei, Zhang Xingxuan, Zhang Yichang, Zhang Zhenru, Zhou Chang, Zhou Jingren, Zhou Xiaohuan, Zhu Tianhang
- Longbench: A Bilingual, Multitask Benchmark For Long Context Understanding Bai Yushi, Lv Xin, Zhang Jiajie, Lyu Hongchang, Tang Jiankai, Huang Zhidian, Du Zhengxiao, Liu Xiao, Zeng Aohan, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- Gpt-4: A Review On Advancements And Opportunities In Natural Language Processing Baktash Jawid Ahmad, Dawodi Mursal
- Tamil-llama: A New Tamil Language Model Based On Llama 2 Balachandran Abhinand
- No That's Not What I Meant: Handling Third Position Repair In Conversational Question Answering Balaraman Vevake, Eshghi Arash, Konstas Ioannis, Papaioannou Ioannis
- Fine-tuning Large Enterprise Language Models Via Ontological Reasoning Baldazzi Teodoro, Bellomarini Luigi, Ceri Stefano, Colombo Andrea, Gentili Andrea, Sallinger Emanuel
- The Belebele Benchmark: A Parallel Reading Comprehension Dataset In 122 Language Variants Bandarkar Lucas, Liang Davis, Muller Benjamin, Artetxe Mikel, Shukla Satya Narayan, Husa Donald, Goyal Naman, Krishnan Abhinandan, Zettlemoyer Luke, Khabsa Madian
- Exploring Iterative Enhancement For Improving Learnersourced Multiple-choice Question Explanations With Large Language Models Bao Qiming, Leinonen Juho, Peng Alex Yuxuan, Zhong Wanjun, Gendron Gaël, Pistotti Timothy, Huang Alice, Denny Paul, Witbrock Michael, Liu Jiamou
- Check Your Facts And Try Again: Improving Large Language Models With External Knowledge And Automated Feedback Baolin Peng, Michel Galley, Pengcheng He, Hao Cheng, Yujia Xie, Yu Hu, Qiuyuan Huang, Lars Liden, Zhou Yu, Weizhu Chen, Jianfeng Gao
- Do Language Models Learn About Legal Entity Types During Pretraining? Barale Claire, Rovatsos Michael, Bhuta Nehal
- Llamantino: Llama 2 Models For Effective Text Generation In Italian Language Basile Pierpaolo, Musacchio Elio, Polignano Marco, Siciliani Lucia, Fiameni Giuseppe, Semeraro Giovanni
- Text Summarization Using Large Language Models: A Comparative Study Of Mpt-7b-instruct, Falcon-7b-instruct, And Openai Chat-gpt Models Basyal Lochan, Sanghvi Mihir
- Exploring The Landscape Of Large Language Models In Medical Question Answering Bean Andrew M., Korgul Karolina, Krones Felix, Mccraith Robert, Mahdi Adam
- Sensitivity, Performance, Robustness: Deconstructing The Effect Of Sociodemographic Prompting Beck Tilman, Schuff Hendrik, Lauscher Anne, Gurevych Iryna
- AQUALLM: Audio Question Answering Data Generation Using Large Language Models Behera Swarup Ranjan, Injeti Krishna Mohan, Patibandla Jaya Sai Kiran, Pokala Praveen Kumar, Pailla Balakrishna Reddy
- Prompt Sketching For Large Language Models Beurer-kellner Luca, Müller Mark Niklas, Fischer Marc, Vechev Martin
- When Automated Assessment Meets Automated Content Generation: Examining Text Quality In The Era Of Gpts Bevilacqua Marialena, Oketch Kezia, Qin Ruiyang, Stamey Will, Zhang Xinyuan, Gan Yi, Yang Kai, Abbasi Ahmed
- Investigating Answerability Of Llms For Long-form Question Answering Bhat Meghana Moorthy, Meng Rui, Liu Ye, Zhou Yingbo, Yavuz Semih
- Exploring Large Language Models For Code Explanation Bhattacharya Paheli, Chakraborty Manojit, Palepu Kartheek N S N, Pandey Vikas, Dindorkar Ishan, Rajpurohit Rakesh, Gupta Rishabh
- Helm: Highlighted Evidence Augmented Language Model For Enhanced Table-to-text Generation Bian Junyi, Qin Xiaolei, Zou Wuhe, Huang Mengzuo, Luo Congyi, Zhang Ke, Zhang Weidong
- The Ups And Downs Of Large Language Model Inference With Vocabulary Trimming By Language Heuristics Bogoychev Nikolay, Chen Pinzhen, Haddow Barry, Birch Alexandra
- Chemcrow: Augmenting Large-language Models With Chemistry Tools Bran Andres M, Cox Sam, Schilter Oliver, Baldassari Carlo, White Andrew D, Schwaller Philippe
- Fine-grained Affective Processing Capabilities Emerging From Large Language Models Broekens Joost, Hilpert Bernhard, Verberne Suzan, Baraka Kim, Gebhard Patrick, Plaat Aske
- Insights Into Classifying And Mitigating Llms' Hallucinations Bruno Alessandro, Mazzeo Pier Luigi, Chetouani Aladine, Tliba Marouane, Kerkouri Mohamed Amine
- Local Large Language Models For Complex Structured Medical Tasks Bumgardner V. K. Cody, Mullen Aaron, Armstrong Sam, Hickey Caylin, Talbert Jeff
- A Comparative Study Of Code Generation Using Chatgpt 3.5 Across 10 Programming Languages Buscemi Alessio
- Just Tell Me: Prompt Engineering In Business Process Management Busch Kiran, Rochlitzer Alexander, Sola Diana, Leopold Henrik
- Automatic Construction Of A Korean Toxic Instruction Dataset For Ethical Tuning Of Large Language Models Byun Sungjoo, Jang Dongjun, Jo Hyemi, Shin Hyopil
- Pokemonchat: Auditing Chatgpt For Pok\'emon Universe Knowledge Cabello Laura, Li Jiaang, Chalkidis Ilias
- Development Of The Chatgpt, Generative Artificial Intelligence And Natural Large Language Models For Accountable Reporting And Use (CANGARU) Guidelines Cacciamani Giovanni E., Eppler Michael B., Ganjavi Conner, Pekan Asli, Biedermann Brett, Collins Gary S., Gill Inderbir S.
- Peek Across: Improving Multi-document Modeling Via Cross-document Question-answering Caciularu Avi, Peters Matthew E., Goldberger Jacob, Dagan Ido, Cohan Arman
- Leveraging Large Language Models For Scalable Vector Graphics-driven Image Understanding Cai Mu, Huang Zeyi, Li Yuheng, Ojha Utkarsh, Wang Haohan, Lee Yong Jae
- On The Application Of Large Language Models For Language Teaching And Assessment Technology Caines Andrew, Benedetto Luca, Taslimipoor Shiva, Davis Christopher, Gao Yuan, Andersen Oeistein, Yuan Zheng, Elliott Mark, Moore Russell, Bryant Christopher, Rei Marek, Yannakoudakis Helen, Mullooly Andrew, Nicholls Diane, Buttery Paula
- A Systematic Study Of Knowledge Distillation For Natural Language Generation With Pseudo-target Training Calderon Nitay, Mukherjee Subhabrata, Reichart Roi, Kantor Amir
- Oberta: Improving Sparse Transfer Learning Via Improved Initialization, Distillation, And Pruning Regimes Campos Daniel, Marques Alexandre, Kurtz Mark, Zhai Chengxiang
- H2ogpt: Democratizing Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Prabhu Prithvi, Gambera Jeff, Landry Mark, Bansal Shivam, Chesler Ryan, Lee Chun Ming, Conde Marcos V., Stetsenko Pasha, Grellier Olivier, Ambati Srisatish
- Successor Features For Efficient Multisubject Controlled Text Generation Cao Meng, Fatemi Mehdi, Cheung Jackie Chi Kit, Shabanian Samira
- Autohall: Automated Hallucination Dataset Generation For Large Language Models Cao Zouying, Yang Yifei, Zhao Hai
- Promoting Generalized Cross-lingual Question Answering In Few-resource Scenarios Via Self-knowledge Distillation Carrino Casimiro Pio, Escolano Carlos, Fonollosa José A. R.
- Examining The Influence Of Varied Levels Of Domain Knowledge Base Inclusion In Gpt-based Intelligent Tutors Castleman Blake, Turkcan Mehmet Kerem
- Chatgpt To Replace Crowdsourcing Of Paraphrases For Intent Classification: Higher Diversity And Comparable Model Robustness Cegin Jan, Simko Jakub, Brusilovsky Peter
- Muse: Text-to-image Generation Via Masked Generative Transformers Chang Huiwen, Zhang Han, Barber Jarred, Maschinot Aj, Lezama Jose, Jiang Lu, Yang Ming-hsuan, Murphy Kevin, Freeman William T., Rubinstein Michael, Li Yuanzhen, Krishnan Dilip
- Learning To Generate Better Than Your LLM Chang Jonathan D., Brantley Kiante, Ramamurthy Rajkumar, Misra Dipendra, Sun Wen
- Salesbot 2.0: A Human-like Intent-guided Chit-chat Dataset Chang Wen-yu, Chen Yun-nung
- Cabbage Sweeter Than Cake? Analysing The Potential Of Large Language Models For Learning Conceptual Spaces Chatterjee Usashi, Gajbhiye Amit, Schockaert Steven
- Efficient GPT Model Pre-training Using Tensor Train Matrix Representation Chekalina Viktoriia, Novikov Georgii, Gusak Julia, Oseledets Ivan, Panchenko Alexander
- Fireact: Toward Language Agent Fine-tuning Chen Baian, Shu Chang, Shareghi Ehsan, Collier Nigel, Narasimhan Karthik, Yao Shunyu
- CLEX: Continuous Length Extrapolation For Large Language Models Chen Guanzheng, Li Xin, Meng Zaiqiao, Liang Shangsong, Bing Lidong
- Personalised Distillation: Empowering Open-sourced Llms With Adaptive Learning For Code Generation Chen Hailin, Saha Amrita, Hoi Steven, Joty Shafiq
- Walking Down The Memory Maze: Beyond Context Limit Through Interactive Reading Chen Howard, Pasunuru Ramakanth, Weston Jason, Celikyilmaz Asli
- Fairytalecqa: Integrating A Commonsense Knowledge Graph Into Children's Storybook Narratives Chen Jiaju, Lu Yuxuan, Zhang Shao, Yao Bingsheng, Dong Yuanzhe, Xu Ying, Li Yunyao, Wang Qianwen, Wang Dakuo, Sun Yuling
- Adaptation With Self-evaluation To Improve Selective Prediction In Llms Chen Jiefeng, Yoon Jinsung, Ebrahimi Sayna, Arik Sercan O, Pfister Tomas, Jha Somesh
- Quantifying Uncertainty In Answers From Any Language Model And Enhancing Their Trustworthiness Chen Jiuhai, Mueller Jonas
- Forgetful Large Language Models: Lessons Learned From Using Llms In Robot Programming Chen Juo-tung, Huang Chien-ming
- Epi-curriculum: Episodic Curriculum Learning For Low-resource Domain Adaptation In Neural Machine Translation Chen Keyu, Zhuang Di, Li Mingchen, Chang J. Morris
- Compcodevet: A Compiler-guided Validation And Enhancement Approach For Code Dataset Chen Le, Bhattacharjee Arijit, Ahmed Nesreen K., Hasabnis Niranjan, Oren Gal, Lei Bin, Jannesari Ali
- Large Language Models Are Visual Reasoning Coordinators Chen Liangyu, Li Bo, Shen Sheng, Yang Jingkang, Li Chunyuan, Keutzer Kurt, Darrell Trevor, Liu Ziwei
- Few-shot Data Synthesis For Open Domain Multi-hop Question Answering Chen Mingda, Chen Xilun, Yih Wen-tau
- Monolingual Or Multilingual Instruction Tuning: Which Makes A Better Alpaca Chen Pinzhen, Ji Shaoxiong, Bogoychev Nikolay, Kutuzov Andrey, Haddow Barry, Heafield Kenneth
- Dynamic Strategy Chain: Dynamic Zero-shot Cot For Long Mental Health Support Generation Chen Qi, Liu Dexi
- Evaluation Of Chatgpt Family Of Models For Biomedical Reasoning And Classification Chen Shan, Li Yingya, Lu Sheng, Van Hoang, Aerts Hugo Jwl, Savova Guergana K., Bitterman Danielle S.
- Evaluating Factual Consistency Of Summaries With Large Language Models Chen Shiqi, Gao Siyang, He Junxian
- Extending Context Window Of Large Language Models Via Positional Interpolation Chen Shouyuan, Wong Sherman, Chen Liangjian, Tian Yuandong
- Theoremqa: A Theorem-driven Question Answering Dataset Chen Wenhu, Yin Ming, Ku Max, Lu Pan, Wan Yixin, Ma Xueguang, Xu Jianyu, Wang Xinyi, Xia Tony
- Evaluating, Understanding, And Improving Constrained Text Generation For Large Language Models Chen Xiang, Wan Xiaojun
- Universal Self-consistency For Large Language Model Generation Chen Xinyun, Aksitov Renat, Alon Uri, Ren Jie, Xiao Kefan, Yin Pengcheng, Prakash Sushant, Sutton Charles, Wang Xuezhi, Zhou Denny
- Teaching Large Language Models To Self-debug Chen Xinyun, Lin Maxwell, Schärli Nathanael, Zhou Denny
- Minprompt: Graph-based Minimal Prompt Data Augmentation For Few-shot Question Answering Chen Xiusi, Jiang Jyun-yu, Chang Wei-cheng, Hsieh Cho-jui, Yu Hsiang-fu, Wang Wei
- Autoeval-video: An Automatic Benchmark For Assessing Large Vision Language Models In Open-ended Video Question Answering Chen Xiuyuan, Lin Yuan, Zhang Yuchen, Huang Weiran
- Tigerbot: An Open Multilingual Multitask LLM Chen Ye, Cai Wei, Wu Liangmin, Li Xiaowei, Xin Zhanxuan, Fu Cong
- NL2TL: Transforming Natural Languages To Temporal Logics Using Large Language Models Chen Yongchao, Gandhi Rujul, Zhang Yang, Fan Chuchu
- Token Prediction As Implicit Classification To Identify Llm-generated Text Chen Yutian, Kang Hao, Zhai Vivian, Li Liangze, Singh Rita, Raj Bhiksha
- T-eval: Evaluating The Tool Utilization Capability Of Large Language Models Step By Step Chen Zehui, Du Weihua, Zhang Wenwei, Liu Kuikun, Liu Jiangning, Zheng Miao, Zhuo Jingming, Zhang Songyang, Lin Dahua, Chen Kai, Zhao Feng
- Internvl: Scaling Up Vision Foundation Models And Aligning For Generic Visual-linguistic Tasks Chen Zhe, Wu Jiannan, Wang Wenhai, Su Weijie, Chen Guo, Xing Sen, Zhong Muyan, Zhang Qinglong, Zhu Xizhou, Lu Lewei, Li Bin, Luo Ping, Lu Tong, Qiao Yu, Dai Jifeng
- Lmexplainer: Grounding Knowledge And Explaining Language Models Chen Zichen, Chen Jianda, Chen Yuanyuan, Yu Han, Singh Ambuj K, Sra Misha
- Xplainllm: A QA Explanation Dataset For Understanding LLM Decision-making Chen Zichen, Chen Jianda, Gaidhani Mitali, Singh Ambuj, Sra Misha
- Llf-bench: Benchmark For Interactive Learning From Language Feedback Cheng Ching-an, Kolobov Andrey, Misra Dipendra, Nie Allen, Swaminathan Adith
- Black-box Prompt Optimization: Aligning Large Language Models Without Model Training Cheng Jiale, Liu Xiao, Zheng Kehan, Ke Pei, Wang Hongning, Dong Yuxiao, Tang Jie, Huang Minlie
- Lift Yourself Up: Retrieval-augmented Text Generation With Self Memory Cheng Xin, Luo Di, Chen Xiuying, Liu Lemao, Zhao Dongyan, Yan Rui
- Attention Alignment And Flexible Positional Embeddings Improve Transformer Length Extrapolation Chi Ta-chung, Fan Ting-han, Rudnicky Alexander I.
- M\(^{2}\)chat: Empowering VLM For Multimodal LLM Interleaved Text-image Generation Chi Xiaowei, Zhang Rongyu, Jiang Zhengkai, Liu Yijiang, Wang Yatian, Qi Xingqun, Luo Wenhan, Gao Peng, Zhang Shanghang, Liu Qifeng, Guo Yike
- INSTRUCTEVAL: Towards Holistic Evaluation Of Instruction-tuned Large Language Models Chia Yew Ken, Hong Pengfei, Bing Lidong, Poria Soujanya
- Symbolic Planning And Code Generation For Grounded Dialogue Chiu Justin T., Zhao Wenting, Chen Derek, Vaduguru Saujas, Rush Alexander M., Fried Daniel
- Improving Zero-shot Reader By Reducing Distractions From Irrelevant Documents In Open-domain Question Answering Cho Sukmin, Seo Jeongyeon, Jeong Soyeong, Park Jong C.
- KCTS: Knowledge-constrained Tree Search Decoding With Token-level Hallucination Detection Choi Sehyun, Fang Tianqing, Wang Zhaowei, Song Yangqiu
- Generative Data Augmentation Using Llms Improves Distributional Robustness In Question Answering Chowdhury Arijit Ghosh, Chadha Aman
- Language And Task Arithmetic With Parameter-efficient Layers For Zero-shot Summarization Chronopoulou Alexandra, Pfeiffer Jonas, Maynez Joshua, Wang Xinyi, Ruder Sebastian, Agrawal Priyanka
- Investigating Hallucinations In Pruned Large Language Models For Abstractive Summarization Chrysostomou George, Zhao Zhixue, Williams Miles, Aletras Nikolaos
- Long Story Short: A Summarize-then-search Method For Long Video Question Answering Chung Jiwan, Yu Youngjae
- Increasing Diversity While Maintaining Accuracy: Text Data Generation With Large Language Models And Human Interventions Chung John Joon Young, Kamar Ece, Amershi Saleema
- Instructtods: Large Language Models For End-to-end Task-oriented Dialogue Systems Chung Willy, Cahyawijaya Samuel, Wilie Bryan, Lovenia Holy, Fung Pascale
- Dr. Jekyll And Mr. Hyde: Two Faces Of Llms Collu Matteo Gioele, Janssen-groesbeek Tom, Koffas Stefanos, Conti Mauro, Picek Stjepan
- Strong And Efficient Baselines For Open Domain Conversational Question Answering Coman Andrei C., Barlacchi Gianni, De Gispert Adrià
- Seamlessm4t: Massively Multilingual & Multimodal Machine Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Cora, Dale David, Dong Ning, Duquenne Paul-ambroise, Elsahar Hady, Gong Hongyu, Heffernan Kevin, Hoffman John, Klaiber Christopher, Li Pengwei, Licht Daniel, Maillard Jean, Rakotoarison Alice, Sadagopan Kaushik Ram, Wenzek Guillaume, Ye Ethan, Akula Bapi, Chen Peng-jen, Hachem Naji El, Ellis Brian, Gonzalez Gabriel Mejia, Haaheim Justin, Hansanti Prangthip, Howes Russ, Huang Bernie, Hwang Min-jae, Inaguma Hirofumi, Jain Somya, Kalbassi Elahe, Kallet Amanda, Kulikov Ilia, Lam Janice, Li Daniel, Ma Xutai, Mavlyutov Ruslan, Peloquin Benjamin, Ramadan Mohamed, Ramakrishnan Abinesh, Sun Anna, Tran Kevin, Tran Tuan, Tufanov Igor, Vogeti Vish, Wood Carleigh, Yang Yilin, Yu Bokai, Andrews Pierre, Balioglu Can, Costa-jussà Marta R., Celebi Onur, Elbayad Maha, Gao Cynthia, Guzmán Francisco, Kao Justine, Lee Ann, Mourachko Alexandre, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler
- Machine Mindset: An MBTI Exploration Of Large Language Models Cui Jiaxi, Lv Liuzhenghao, Wen Jing, Wang Rongsheng, Tang Jing, Tian Yonghong, Yuan Li
- Ada-instruct: Adapting Instruction Generators For Complex Reasoning Cui Wanyun, Wang Qianle
- Evaluating Prompt-based Question Answering For Object Prediction In The Open Research Knowledge Graph D'souza Jennifer, Hrou Moussab, Auer Sören
- Neural Retrievers Are Biased Towards Llm-generated Content Dai Sunhao, Zhou Yuqi, Pang Liang, Liu Weihao, Hu Xiaolin, Liu Yong, Zhang Xiao, Wang Gang, Xu Jun
- Syntactic Knowledge Via Graph Attention With BERT In Machine Translation Dai Yuqian, Sharoff Serge, De Kamps Marc
- Educhat: A Large-scale Language Model-based Chatbot System For Intelligent Education Dan Yuhao, Lei Zhikai, Gu Yiyang, Li Yong, Yin Jianghao, Lin Jiaju, Ye Linhao, Tie Zhiyan, Zhou Yougen, Wang Yilei, Zhou Aimin, Zhou Ze, Chen Qin, Zhou Jie, He Liang, Qiu Xipeng
- Visual Chain Of Thought: Bridging Logical Gaps With Multimodal Infillings Daniel Rose, Vaishnavi Himakunthala, Andy Ouyang, Ryan He, Alex Mei, Yujie Lu, Michael Saxon, Chinmay Sonar, Diba Mirza, William Yang Wang
- Palm-e: An Embodied Multimodal Language Model Danny Driess, Fei Xia, Mehdi S. M. Sajjadi, Corey Lynch, Aakanksha Chowdhery, Brian Ichter, Ayzaan Wahid, Jonathan Tompson, Quan Vuong, Tianhe Yu, Wenlong Huang, Yevgen Chebotar, Pierre Sermanet, Daniel Duckworth, Sergey Levine, Vincent Vanhoucke, Karol Hausman, Marc Toussaint, Klaus Greff, Andy Zeng, Igor Mordatch, Pete Florence
- Flashattention-2: Faster Attention With Better Parallelism And Work Partitioning Dao Tri
- Optimizing Distributed Training On Frontier For Large Language Models Dash Sajal, Lyngaas Isaac, Yin Junqi, Wang Xiao, Egele Romain, Cong Guojing, Wang Feiyi, Balaprakash Prasanna
- GPT-3 Models Are Few-shot Financial Reasoners De Padua Raul Salles, Qureshi Imran, Karakaplan Mustafa U.
- Facechat: An Emotion-aware Face-to-face Dialogue Framework Deema Alnuhait, Qingyang Wu, Zhou Yu
- Divergent Token Metrics: Measuring Degradation To Prune Away LLM Components -- And Optimize Quantization Deiseroth Björn, Meuer Max, Gritsch Nikolas, Eichenberg Constantin, Schramowski Patrick, Aßenmacher Matthias, Kersting Kristian
- Speak While You Think: Streaming Speech Synthesis During Text Generation Dekel Avihu, Shechtman Slava, Fernandez Raul, Haws David, Kons Zvi, Hoory Ron
- Controlled Text Generation Via Language Model Arithmetic Dekoninck Jasper, Fischer Marc, Beurer-kellner Luca, Vechev Martin
- Skipdecode: Autoregressive Skip Decoding With Batching And Caching For Efficient LLM Inference Del Corro Luciano, Del Giorno Allie, Agarwal Sahaj, Yu Bin, Awadallah Ahmed, Mukherjee Subhabrata
- Prompting And Evaluating Large Language Models For Proactive Dialogues: Clarification, Target-guided, And Non-collaboration Deng Yang, Liao Lizi, Chen Liang, Wang Hongru, Lei Wenqiang, Chua Tat-seng
- Plug-and-play Policy Planner For Large Language Model Powered Dialogue Agents Deng Yang, Zhang Wenxuan, Lam Wai, Ng See-kiong, Chua Tat-seng
- Musilingo: Bridging Music And Text With Pre-trained Language Models For Music Captioning And Query Response Deng Zihao, Ma Yinghao, Liu Yudong, Guo Rongchen, Zhang Ge, Chen Wenhu, Huang Wenhao, Benetos Emmanouil
- Testing AI On Language Comprehension Tasks Reveals Insensitivity To Underlying Meaning Dentella Vittoria, Guenther Fritz, Murphy Elliot, Marcus Gary, Leivada Evelina
- Can Chatgpt Read Who You Are? Derner Erik, Kučera Dalibor, Oliver Nuria, Zahálka Jan
- Spqr: A Sparse-quantized Representation For Near-lossless LLM Weight Compression Dettmers Tim, Svirschevski Ruslan, Egiazarian Vage, Kuznedelev Denis, Frantar Elias, Ashkboos Saleh, Borzunov Alexander, Hoefler Torsten, Alistarh Dan
- Chatgpt For GTFS: Benchmarking Llms On GTFS Understanding And Retrieval Devunuri Saipraneeth, Qiam Shirin, Lehe Lewis
- Context-aware Transformer Pre-training For Answer Sentence Selection Di Liello Luca, Garg Siddhant, Moschitti Alessandro
- Evaluating Chatgpt As A Recommender System: A Rigorous Approach Di Palma Dario, Biancofiore Giovanni Maria, Anelli Vito Walter, Narducci Fedelucio, Di Noia Tommaso, Di Sciascio Eugenio
- Codefuse-13b: A Pretrained Multi-lingual Code Large Language Model Di Peng, Li Jianguo, Yu Hang, Jiang Wei, Cai Wenting, Cao Yang, Chen Chaoyu, Chen Dajun, Chen Hongwei, Chen Liang, Fan Gang, Gong Jie, Gong Zi, Hu Wen, Guo Tingting, Lei Zhichao, Li Ting, Li Zheng, Liang Ming, Liao Cong, Liu Bingchang, Liu Jiachen, Liu Zhiwei, Lu Shaojun, Shen Min, Wang Guangpei, Wang Huan, Wang Zhi, Xu Zhaogui, Yang Jiawei, Ye Qing, Zhang Gehao, Zhang Yu, Zhao Zelin, Zheng Xunjin, Zhou Hailian, Zhu Lifu, Zhu Xianying
- A Comprehensive Evaluation Of Neural SPARQL Query Generation From Natural Language Questions Diallo Papa Abdou Karim Karou, Reyd Samuel, Zouaq Amal
- Lmflow: An Extensible Toolkit For Finetuning And Inference Of Large Foundation Models Diao Shizhe, Pan Rui, Dong Hanze, Shum Ka Shun, Zhang Jipeng, Xiong Wei, Zhang Tong
- Can Large Language Models Assist In Hazard Analysis? Diemert Simon, Weber Jens H
- Can Instruction Fine-tuned Language Models Identify Social Bias Through Prompting? Dige Omkar, Tian Jacob-junqi, Emerson David, Khattak Faiza Khan
- Using Large Language Model To Solve And Explain Physics Word Problems Approaching Human Level Ding Jingzhe, Cen Yan, Wei Xinyuan
- HPC-GPT: Integrating Large Language Model For High-performance Computing Ding Xianzhong, Chen Le, Emani Murali, Liao Chunhua, Lin Pei-hung, Vanderbruggen Tristan, Xie Zhen, Cerpa Alberto E., Du Wan
- Mapping The Design Space Of Interactions In Human-ai Text Co-creation Tasks Ding Zijian, Chan Joel
- Do Llms Work On Charts? Designing Few-shot Prompts For Chart Question Answering And Summarization Do Xuan Long, Hassanpour Mohammad, Masry Ahmed, Kavehzadeh Parsa, Hoque Enamul, Joty Shafiq
- Prompt Optimization Via Adversarial In-context Learning Do Xuan Long, Zhao Yiran, Brown Hannah, Xie Yuxi, Zhao James Xu, Chen Nancy F., Kawaguchi Kenji, Shieh Michael, He Junxian
- Probing Explicit And Implicit Gender Bias Through LLM Conditional Text Generation Dong Xiangjue, Wang Yibo, Yu Philip S., Caverlee James
- BAMBOO: A Comprehensive Benchmark For Evaluating Long Text Modeling Capacities Of Large Language Models Dong Zican, Tang Tianyi, Li Junyi, Zhao Wayne Xin, Wen Ji-rong
- Plugmed: Improving Specificity In Patient-centered Medical Dialogue Generation Using In-context Learning Dou Chengfeng, Jin Zhi, Jiao Wenping, Zhao Haiyan, Tao Zhenwei, Zhao Yongqiang
- Scratch Copilot Evaluation: Assessing Ai-assisted Creative Coding For Families Druga Stefania, Otero Nancy
- Classeval: A Manually-crafted Benchmark For Evaluating Llms On Class-level Code Generation Du Xueying, Liu Mingwei, Wang Kaixin, Wang Hanlin, Liu Junwei, Chen Yixuan, Feng Jiayi, Sha Chaofeng, Peng Xin, Lou Yiling
- Exploring The Relationship Between In-context Learning And Instruction Tuning Duan Hanyu, Tang Yixuan, Yang Yi, Abbasi Ahmed, Tam Kar Yan
- Bridging The Language Gap: Knowledge Injected Multilingual Question Answering Duan Zhichao, Li Xiuxing, Zhang Zhengyan, Li Zhenyu, Liu Ning, Wang Jianyong
- Sweeping Heterogeneity With Smart Mops: Mixture Of Prompts For LLM Task Adaptation Dun Chen, Garcia Mirian Hipolito, Zheng Guoqing, Awadallah Ahmed Hassan, Kyrillidis Anastasios, Sim Robert
- Stack Attention: Improving The Ability Of Transformers To Model Hierarchical Patterns Dusell Brian, Chiang David
- Halo: Estimation And Reduction Of Hallucinations In Open-source Weak Large Language Models Elaraby Mohamed, Lu Mengyin, Dunn Jacob, Zhang Xueying, Wang Yu, Liu Shizhu, Tian Pingchuan, Wang Yuping, Wang Yuxuan
- How Useful Are Educational Questions Generated By Large Language Models? Elkins Sabina, Kochmar Ekaterina, Cheung Jackie C. K., Serban Iulian
- A Comprehensive Performance Study Of Large Language Models On Novel AI Accelerators Emani Murali, Foreman Sam, Sastry Varuni, Xie Zhen, Raskar Siddhisanket, Arnold William, Thakur Rajeev, Vishwanath Venkatram, Papka Michael E.
- Context-driven Interactive Query Simulations Based On Generative Large Language Models Engelmann Björn, Breuer Timo, Friese Jana Isabelle, Schaer Philipp, Fuhr Norbert
- Textgenshap: Scalable Post-hoc Explanations In Text Generation With Long Documents Enouen James, Nakhost Hootan, Ebrahimi Sayna, Arik Sercan O, Liu Yan, Pfister Tomas
- Jacotext: A Pretrained Model For Java Code-text Generation Espejel Jessica López, Alassan Mahaman Sanoussi Yahaya, Dahhane Walid, Ettifouri El Hassane
- Chatgpt Outperforms Crowd-workers For Text-annotation Tasks Fabrizio Gilardi, Meysam Alizadeh, Maël Kubli
- Lm-polygraph: Uncertainty Estimation For Language Models Fadeeva Ekaterina, Vashurin Roman, Tsvigun Akim, Vazhentsev Artem, Petrakov Sergey, Fedyanin Kirill, Vasilev Daniil, Goncharova Elizaveta, Panchenko Alexander, Panov Maxim, Baldwin Timothy, Shelmanov Artem
- Chain-of-thought Tuning: Masked Language Models Can Also Think Step By Step In Natural Language Understanding Fan Caoyun, Tian Jidong, Li Yitian, Chen Wenqing, He Hao, Jin Yaohui
- Large Language Models For Code Analysis: Do Llms Really Do Their Job? Fang Chongzhou, Miao Ning, Srivastav Shaurya, Liu Jialin, Zhang Ruoyu, Fang Ruijie, Asmita, Tsang Ryan, Nazari Najmeh, Wang Han, Homayoun Houman
- Chatgpt As Data Augmentation For Compositional Generalization: A Case Study In Open Intent Detection Fang Yihao, Li Xianzhi, Thomas Stephen W., Zhu Xiaodan
- An Empirical Study Of Multitask Learning To Improve Open Domain Dialogue Systems Farahani Mehrdad, Johansson Richard
- Chatgpt Performance On Standardized Testing Exam -- A Proposed Strategy For Learners Farooq Umer, Anwar Saira
- Prompting Large Language Models With Speech Recognition Abilities Fathullah Yassir, Wu Chunyang, Lakomkin Egor, Jia Junteng, Shangguan Yuan, Li Ke, Guo Jinxi, Xiong Wenhan, Mahadeokar Jay, Kalinli Ozlem, Fuegen Christian, Seltzer Mike
- Audiochatllama: Towards General-purpose Speech Abilities For Llms Fathullah Yassir, Wu Chunyang, Lakomkin Egor, Li Ke, Jia Junteng, Shangguan Yuan, Mahadeokar Jay, Kalinli Ozlem, Fuegen Christian, Seltzer Mike
- Revisiting Instruction Fine-tuned Model Evaluation To Guide Industrial Applications Faysse Manuel, Viaud Gautier, Hudelot Céline, Colombo Pierre
- Friend Or Foe? Exploring The Implications Of Large Language Models On The Science System Fecher Benedikt, Hebing Marcel, Laufer Melissa, Pohle Jörg, Sofsky Fabian
- Transferable Decoding With Visual Entities For Zero-shot Image Captioning Fei Junjie, Wang Teng, Zhang Jinrui, He Zhenyu, Wang Chengjie, Zheng Feng
- Extending Context Window Of Large Language Models Via Semantic Compression Fei Weizhi, Niu Xueyan, Zhou Pingyi, Hou Lu, Bai Bo, Deng Lei, Han Wei
- Improving Factual Consistency Of Text Summarization By Adversarially Decoupling Comprehension And Embellishment Abilities Of Llms Feng Huawen, Fan Yan, Liu Xiong, Lin Ting-en, Yao Zekun, Wu Yuchuan, Huang Fei, Li Yongbin, Ma Qianli
- Towards Llm-driven Dialogue State Tracking Feng Yujie, Lu Zexin, Liu Bo, Zhan Liming, Wu Xiao-ming
- The Devil Is In The Errors: Leveraging Large Language Models For Fine-grained Machine Translation Evaluation Fernandes Patrick, Deutsch Daniel, Finkelstein Mara, Riley Parker, Martins André F. T., Neubig Graham, Garg Ankush, Clark Jonathan H., Freitag Markus, Firat Orhan
- Automating Behavioral Testing In Machine Translation Ferrando Javier, Sperber Matthias, Setiawan Hendra, Telaar Dominic, Hasan Saša
- Should Chatgpt Be Biased? Challenges And Risks Of Bias In Large Language Models Ferrara Emilio
- Leveraging Large Language Models For Automated Dialogue Analysis Finch Sarah E., Paek Ellie S., Choi Jinho D.
- MBR And QE Finetuning: Training-time Distillation Of The Best And Most Expensive Decoding Methods Finkelstein Mara, Naskar Subhajit, Mirzazadeh Mehdi, Shah Apurva, Freitag Markus
- Medalign: A Clinician-generated Dataset For Instruction Following With Electronic Medical Records Fleming Scott L., Lozano Alejandro, Haberkorn William J., Jindal Jenelle A., Reis Eduardo P., Thapa Rahul, Blankemeier Louis, Genkins Julian Z., Steinberg Ethan, Nayak Ashwin, Patel Birju S., Chiang Chia-chun, Callahan Alison, Huo Zepeng, Gatidis Sergios, Adams Scott J., Fayanju Oluseyi, Shah Shreya J., Savage Thomas, Goh Ethan, Chaudhari Akshay S., Aghaeepour Nima, Sharp Christopher, Pfeffer Michael A., Liang Percy, Chen Jonathan H., Morse Keith E., Brunskill Emma P., Fries Jason A., Shah Nigam H.
- Matching Pairs: Attributing Fine-tuned Models To Their Pre-trained Large Language Models Foley Myles, Rawat Ambrish, Lee Taesung, Hou Yufang, Picco Gabriele, Zizzo Giulio
- Enhancing Pipeline-based Conversational Agents With Large Language Models Foosherian Mina, Purwins Hendrik, Rathnayake Purna, Alam Touhidul, Teimao Rui, Thoben Klaus-dieter
- A Review Of Chatgpt Applications In Education, Marketing, Software Engineering, And Healthcare: Benefits, Drawbacks, And Research Directions Fraiwan Mohammad, Khasawneh Natheer
- Chatgpt & Mechanical Engineering: Examining Performance On The FE Mechanical Engineering And Undergraduate Exams Frenkel Matthew, Emara Hebah
- Inca: Rethinking In-car Conversational System Assessment Leveraging Large Language Models Friedl Ken E., Khan Abbas Goher, Sahoo Soumya Ranjan, Rony Md Rashad Al Hasan, Germies Jana, Süß Christian
- Codeapex: A Bilingual Programming Evaluation Benchmark For Large Language Models Fu Lingyue, Chai Huacan, Luo Shuang, Du Kounianhua, Zhang Weiming, Fan Longteng, Lei Jiayi, Rui Renting, Lin Jianghao, Fang Yuchen, Liu Yifan, Wang Jingkuan, Qi Siyuan, Zhang Kangning, Zhang Weinan, Yu Yong
- Generate Then Select: Open-ended Visual Question Answering Guided By World Knowledge Fu Xingyu, Zhang Sheng, Kwon Gukyeong, Perera Pramuditha, Zhu Henghui, Zhang Yuhao, Li Alexander Hanbo, Wang William Yang, Wang Zhiguo, Castelli Vittorio, Ng Patrick, Roth Dan, Xiang Bing
- Dynamic Clue Bottlenecks: Towards Interpretable-by-design Visual Question Answering Fu Xingyu, Zhou Ben, Chen Sihao, Yatskar Mark, Roth Dan
- Chain-of-thought Hub: A Continuous Effort To Measure Large Language Models' Reasoning Performance Fu Yao, Ou Litu, Chen Mingyu, Wan Yuhao, Peng Hao, Khot Tushar
- Safety Alignment In NLP Tasks: Weakly Aligned Summarization As An In-context Attack Fu Yu, Li Yufei, Xiao Wen, Liu Cong, Dong Yue
- Exposing Limitations Of Language Model Agents In Sequential-task Compositions On The Web Furuta Hiroki, Matsuo Yutaka, Faust Aleksandra, Gur Izzeddin
- The Impact And Applications Of Chatgpt: A Systematic Review Of Literature Reviews Gabashvili Irene S.
- Natural Language Commanding Via Program Synthesis Gandhi Apurva, Nguyen Thong Q., Jiao Huitian, Steen Robert, Bhatawdekar Ameya
- CLIPAG: Towards Generator-free Text-to-image Generation Ganz Roy, Elad Michael
- Jsontuning: Towards Generalizable, Robust, And Controllable Instruction Tuning Gao Chang, Zhang Wenxuan, Chen Guizhen, Lam Wai
- Text-to-sql Empowered By Large Language Models: A Benchmark Evaluation Gao Dawei, Wang Haibin, Li Yaliang, Sun Xiuyu, Qian Yichen, Ding Bolin, Zhou Jingren
- ASSISTGUI: Task-oriented Desktop Graphical User Interface Automation Gao Difei, Ji Lei, Bai Zechen, Ouyang Mingyu, Li Peiran, Mao Dongxing, Wu Qinchen, Zhang Weichen, Wang Peiyi, Guo Xiangwu, Wang Hengxu, Zhou Luowei, Shou Mike Zheng
- Assistgpt: A General Multi-modal Assistant That Can Plan, Execute, Inspect, And Learn Gao Difei, Ji Lei, Zhou Luowei, Lin Kevin Qinghong, Chen Joya, Fan Zihan, Shou Mike Zheng
- Policy-gradient Training Of Language Models For Ranking Gao Ge, Chang Jonathan D., Cardie Claire, Brantley Kianté, Joachim Thorsten
- How To Design Translation Prompts For Chatgpt: An Empirical Study Gao Yuan, Wang Ruili, Hou Feng
- CLOVA: A Closed-loop Visual Assistant With Tool Usage And Update Gao Zhi, Du Yuntao, Zhang Xintong, Ma Xiaojian, Han Wenjuan, Zhu Song-chun, Li Qing
- Simulating H.P. Lovecraft Horror Literature With The Chatgpt Large Language Model Garrido-merchán Eduardo C., Arroyo-barrigüete José Luis, Gozalo-brizuela Roberto
- Building Trustworthy Neurosymbolic AI Systems: Consistency, Reliability, Explainability, And Safety Gaur Manas, Sheth Amit
- A Lightweight Method To Generate Unanswerable Questions In English Gautam Vagrant, Zhang Miaoran, Klakow Dietrich
- Ahead-of-time P-tuning Gavrilov Daniil, Balagansky Nikita
- Mllm-bench: Evaluating Multimodal Llms With Per-sample Criteria Ge Wentao, Chen Shunian, Chen Guiming Hardy, Chen Zhihong, Chen Junying, Yan Shuo, Zhu Chenghao, Lin Ziyue, Xie Wenya, Zhang Xinyi, Chai Yichen, Liu Xiaoyu, Song Dingjie, Wang Xidong, Gao Anningzhe, Zhang Zhiyi, Li Jianquan, Wan Xiang, Wang Benyou
- A Survey Of Confidence Estimation And Calibration In Large Language Models Geng Jiahui, Cai Fengyu, Wang Yuxia, Koeppl Heinz, Nakov Preslav, Gurevych Iryna
- Dictionary-based Phrase-level Prompting Of Large Language Models For Machine Translation Ghazvininejad Marjan, Gonen Hila, Zettlemoyer Luke
- Adapting Llms For Efficient, Personalized Information Retrieval: Methods And Implications Ghodratnama Samira, Zakershahrak Mehrdad
- Does Synthetic Data Make Large Language Models More Efficient? Gholami Sia, Omar Marwan
- Clipsyntel: CLIP And LLM Synergy For Multimodal Question Summarization In Healthcare Ghosh Akash, Acharya Arkadeep, Jain Raghav, Saha Sriparna, Chadha Aman, Sinha Setu
- Chatgpt Perpetuates Gender Bias In Machine Translation And Ignores Non-gendered Pronouns: Findings Across Bengali And Five Other Low-resource Languages Ghosh Sourojit, Caliskan Aylin
- Resetox: Re-learning Attention Weights For Toxicity Mitigation In Machine Translation Gilabert Javier García, Escolano Carlos, Costa-jussà Marta R.
- Semantic Compression With Large Language Models Gilbert Henry, Sandborn Michael, Schmidt Douglas C., Spencer-smith Jesse, White Jules
- Prompt Cache: Modular Attention Reuse For Low-latency Inference Gim In, Chen Guojun, Lee Seung-seob, Sarda Nikhil, Khandelwal Anurag, Zhong Lin
- Mtuncertainty: Assessing The Need For Post-editing Of Machine Translation Outputs By Fine-tuning Openai Llms Gladkoff Serge, Han Lifeng, Erofeev Gleb, Sorokina Irina, Nenadic Goran
- Coascore: Chain-of-aspects Prompting For NLG Evaluation Gong Peiyuan, Mao Jiaxin
- LILO: Learning Interpretable Libraries By Compressing And Documenting Code Grand Gabriel, Wong Lionel, Bowers Maddy, Olausson Theo X., Liu Muxin, Tenenbaum Joshua B., Andreas Jacob
- Not What You've Signed Up For: Compromising Real-world Llm-integrated Applications With Indirect Prompt Injection Greshake Kai, Abdelnabi Sahar, Mishra Shailesh, Endres Christoph, Holz Thorsten, Fritz Mario
- Pokemqa: Programmable Knowledge Editing For Multi-hop Question Answering Gu Hengrui, Zhou Kaixiong, Han Xiaotian, Liu Ninghao, Wang Ruobing, Wang Xin
- Neuron Patching: Semantic-based Neuron-level Language Model Repair For Code Generation Gu Jian, Aleti Aldeida, Chen Chunyang, Zhang Hongyu
- Linguistically Informed Chatgpt Prompts To Enhance Japanese-chinese Machine Translation: A Case Study On Attributive Clauses Gu Wenshi
- Mitigating Large Language Model Hallucinations Via Autonomous Knowledge Graph-based Retrofitting Guan Xinyan, Liu Yanjiang, Lin Hongyu, Lu Yaojie, He Ben, Han Xianpei, Sun Le
- Hallucinations In Large Multilingual Translation Models Guerreiro Nuno M., Alves Duarte, Waldendorf Jonas, Haddow Barry, Birch Alexandra, Colombo Pierre, Martins André F. T.
- Hiformer: Heterogeneous Feature Interactions Learning With Transformers For Recommender Systems Gui Huan, Wang Ruoxi, Yin Ke, Jin Long, Kula Maciej, Xu Taibai, Hong Lichan, Chi Ed H.
- Reinforced Self-training (rest) For Language Modeling Gulcehre Caglar, Paine Tom Le, Srinivasan Srivatsan, Konyushkova Ksenia, Weerts Lotte, Sharma Abhishek, Siddhant Aditya, Ahern Alex, Wang Miaosen, Gu Chenjie, Macherey Wolfgang, Doucet Arnaud, Firat Orhan, De Freitas Nando
- OWL: A Large Language Model For IT Operations Guo Hongcheng, Yang Jian, Liu Jiaheng, Yang Liqun, Chai Linzheng, Bai Jiaqi, Peng Junran, Hu Xiaorong, Chen Chao, Zhang Dongfeng, Shi Xu, Zheng Tieqiao, Zheng Liangfan, Zhang Bo, Xu Ke, Li Zhoujun
- Glancing Future For Simultaneous Machine Translation Guo Shoutao, Zhang Shaolei, Feng Yang
- Knowledgenavigator: Leveraging Large Language Models For Enhanced Reasoning Over Knowledge Graph Guo Tiezheng, Yang Qingwen, Wang Chen, Liu Yanyi, Li Pan, Tang Jiawei, Li Dapeng, Wen Yingyou
- Instruction Fusion: Advancing Prompt Evolution Through Hybridization Guo Weidong, Yang Jiuding, Yang Kaitong, Li Xiangyang, Rao Zhuwei, Xu Yu, Niu Di
- Retrieval-augmented Code Generation For Universal Information Extraction Guo Yucan, Li Zixuan, Jin Xiaolong, Liu Yantao, Zeng Yutao, Liu Wenxuan, Li Xiang, Yang Pan, Bai Long, Guo Jiafeng, Cheng Xueqi
- Improving Small Language Models On Pubmedqa Via Generative Data Augmentation Guo Zhen, Wang Peiqi, Wang Yanwei, Yu Shangdi
- Evaluating Large Language Models: A Comprehensive Survey Guo Zishan, Jin Renren, Liu Chuang, Huang Yufei, Shi Dan, Supryadi, Yu Linhao, Liu Yan, Li Jiaxuan, Xiong Bojian, Xiong Deyi
- Point-bind & Point-llm: Aligning Point Cloud With Multi-modality For 3D Understanding, Generation, And Instruction Following Guo Ziyu, Zhang Renrui, Zhu Xiangyang, Tang Yiwen, Ma Xianzheng, Han Jiaming, Chen Kexin, Gao Peng, Li Xianzhi, Li Hongsheng, Heng Pheng-ann
- Testing LLM Performance On The Physics GRE: Some Observations Gupta Pranav
- Grace: Generation Using Associated Code Edits Gupta Priyanshu, Khare Avishree, Bajpai Yasharth, Chakraborty Saikat, Gulwani Sumit, Kanade Aditya, Radhakrishna Arjun, Soares Gustavo, Tiwari Ashish
- A Real-world Webagent With Planning, Long Context Understanding, And Program Synthesis Gur Izzeddin, Furuta Hiroki, Huang Austin, Safdari Mustafa, Matsuo Yutaka, Eck Douglas, Faust Aleksandra
- A Survey Of Vision-language Pre-training From The Lens Of Multimodal Machine Translation Gwinnup Jeremy, Duh Kevin
- Regulating Chatgpt And Other Large Generative AI Models Hacker Philipp, Engel Andreas, Mauer Marco
- Is GPT-4 A Reliable Rater? Evaluating Consistency In GPT-4 Text Ratings Hackl Veronika, Müller Alexandra Elena, Granitzer Michael, Sailer Maximilian
- Evaluation Of AI Chatbots For Patient-specific EHR Questions Hamidi Alaleh, Roberts Kirk
- Lm-infinite: Zero-shot Extreme Length Generalization For Large Language Models Han Chi, Wang Qifan, Peng Hao, Xiong Wenhan, Chen Yu, Ji Heng, Wang Sinong
- Onellm: One Framework To Align All Modalities With Language Han Jiaming, Gong Kaixiong, Zhang Yiyuan, Wang Jiaqi, Zhang Kaipeng, Lin Dahua, Qiao Yu, Gao Peng, Yue Xiangyu
- Cheddar: Student-chatgpt Dialogue In EFL Writing Education Han Jieun, Yoo Haneul, Myung Junho, Kim Minsun, Lee Tak Yeon, Ahn So-yeon, Oh Alice
- Medalpaca -- An Open-source Collection Of Medical Conversational AI Models And Training Data Han Tianyu, Adams Lisa C., Papaioannou Jens-michalis, Grundmann Paul, Oberhauser Tom, Löser Alexander, Truhn Daniel, Bressem Keno K.
- Blockwise Parallel Transformer For Large Context Models Hao Liu, Pieter Abbeel
- Toolkengpt: Augmenting Frozen Language Models With Massive Tools Via Tool Embeddings Hao Shibo, Liu Tianyang, Wang Zhen, Hu Zhiting
- Herd: Using Multiple, Smaller Llms To Match The Performances Of Proprietary, Large Llms Via An Intelligent Composer Hari Surya Narayanan, Thomson Matt
- Unlocking The Potential Of Chatgpt: A Comprehensive Exploration Of Its Applications, Advantages, Limitations, And Future Directions In Natural Language Processing Hariri Walid
- Leveraging Large Language Models For Sequential Recommendation Harte Jesse, Zorgdrager Wouter, Louridas Panos, Katsifodimos Asterios, Jannach Dietmar, Fragkoulis Marios
- The Political Ideology Of Conversational AI: Converging Evidence On Chatgpt's Pro-environmental, Left-libertarian Orientation Hartmann Jochen, Schwenzow Jasper, Witte Maximilian
- Sok: Memorization In General-purpose Large Language Models Hartmann Valentin, Suri Anshuman, Bindschaedler Vincent, Evans David, Tople Shruti, West Robert
- Zero- And Few-shot Prompting With Llms: A Comparative Study With Fine-tuned Models For Bangla Sentiment Analysis Hasan Md. Arid, Das Shudipta, Anjum Afiyat, Alam Firoj, Anjum Anika, Sarker Avijit, Noori Sheak Rashed Haider
- Never Lost In The Middle: Mastering Long-context Question Answering With Position-agnostic Decompositional Training He Junqing, Pan Kunhao, Dong Xiaoqun, Song Zhuoyang, Liu Yibo, Sun Qianguo, Liang Yuxin, Wang Hao, Zhang Enming, Zhang Jiaxing
- Multi-modal Latent Space Learning For Chain-of-thought Reasoning In Language Models He Liqi, Li Zuchao, Cai Xiantao, Wang Ping
- Can Large Language Models Understand Real-world Complex Instructions? He Qianyu, Zeng Jie, Huang Wenhao, Chen Lina, Xiao Jin, He Qianxi, Zhou Xunzhe, Chen Lida, Wang Xintao, Huang Yuncheng, Ye Haoning, Li Zihan, Chen Shisong, Zhang Yikai, Gu Zhouhong, Liang Jiaqing, Xiao Yanghua
- Multimodal Graph Transformer For Multimodal Question Answering He Xuehai, Wang Xin Eric
- Analyzing The Efficacy Of An Llm-only Approach For Image-based Document Question Answering Hegde Nidhi, Paul Sujoy, Madan Gagan, Aggarwal Gaurav
- How Good Are GPT Models At Machine Translation? A Comprehensive Evaluation Hendy Amr, Abdelrehim Mohamed, Sharaf Amr, Raunak Vikas, Gabr Mohamed, Matsushita Hitokazu, Kim Young Jin, Afify Mohamed, Awadalla Hany Hassan
- Towards Verifiable Text Generation With Symbolic References Hennigen Lucas Torroba, Shen Shannon, Nrusimha Aniruddha, Gapp Bernhard, Sontag David, Kim Yoon
- Improving Long Context Document-level Machine Translation Herold Christian, Ney Hermann
- Backpack Language Models Hewitt John, Thickstun John, Manning Christopher D., Liang Percy
- Assessing The Efficacy Of Large Language Models In Generating Accurate Teacher Responses Hicke Yann, Masand Abhishek, Guo Wentao, Gangavarapu Tushaar
- Harnessing The Zero-shot Power Of Instruction-tuned Large Language Model In End-to-end Speech Recognition Higuchi Yosuke, Ogawa Tetsuji, Kobayashi Tetsunori
- Newsgpt: Chatgpt Integration For Robot-reporter Hireche Abdelhadi, Belkacem Abdelkader Nasreddine, Jamil Sadia, Chen Chao
- L2MAC: Large Language Model Automatic Computer For Extensive Code Generation Holt Samuel, Luyten Max Ruiz, Van Der Schaar Mihaela
- Negative Object Presence Evaluation (NOPE) To Measure Object Hallucination In Vision-language Models Holy Lovenia, Wenliang Dai, Samuel Cahyawijaya, Ziwei Ji, Pascale Fung
- Turbulence: Systematically And Automatically Testing Instruction-tuned Large Language Models For Code Honarvar Shahin, Van Der Wilk Mark, Donaldson Alastair
- Zero-shot Goal-directed Dialogue Via RL On Imagined Conversations Hong Joey, Levine Sergey, Dragan Anca
- 3D-LLM: Injecting The 3D World Into Large Language Models Hong Yining, Zhen Haoyu, Chen Peihao, Zheng Shuhong, Du Yilun, Chen Zhenfang, Gan Chuang
- CIEM: Contrastive Instruction Evaluation Method For Better Instruction Tuning Hongyu Hu, Jiyuan Zhang, Minyi Zhao, Zhenbang Sun
- Ralle: A Framework For Developing And Evaluating Retrieval-augmented Large Language Models Hoshi Yasuto, Miyashita Daisuke, Ng Youyang, Tatsuno Kento, Morioka Yasuhiro, Torii Osamu, Deguchi Jun
- Distilling Step-by-step! Outperforming Larger Language Models With Less Training Data And Smaller Model Sizes Hsieh Cheng-yu, Li Chun-liang, Yeh Chih-kuan, Nakhost Hootan, Fujii Yasuhisa, Ratner Alexander, Krishna Ranjay, Lee Chen-yu, Pfister Tomas
- Scaled Prompt-tuning For Few-shot Natural Language Generation Hu Ting, Meinel Christoph, Yang Haojin
- BLIVA: A Simple Multimodal LLM For Better Handling Of Text-rich Visual Questions Hu Wenbo, Xu Yifan, Li Yi, Li Weiyue, Chen Zeyuan, Tu Zhuowen
- Enhancing Answer Selection In Community Question Answering With Pre-trained And Large Language Models Hu Xinghang
- Separate The Wheat From The Chaff: Model Deficiency Unlearning Via Parameter-efficient Module Operation Hu Xinshuo, Li Dongfang, Hu Baotian, Zheng Zihao, Liu Zhenyu, Zhang Min
- Do Large Language Models Know About Facts? Hu Xuming, Chen Junzhe, Li Xiaochuan, Guo Yufei, Wen Lijie, Yu Philip S., Guo Zhijiang
- Decipherpref: Analyzing Influential Factors In Human Preference Judgments Via GPT-4 Hu Yebowen, Song Kaiqiang, Cho Sangwoo, Wang Xiaoyang, Foroosh Hassan, Liu Fei
- Visual Program Distillation: Distilling Tools And Programmatic Reasoning Into Vision-language Models Hu Yushi, Stretcu Otilia, Lu Chun-ta, Viswanathan Krishnamurthy, Hata Kenji, Luo Enming, Krishna Ranjay, Fuxman Ariel
- Token-level Adversarial Prompt Detection Based On Perplexity Measures And Contextual Information Hu Zhengmian, Wu Gang, Mitra Saayan, Zhang Ruiyi, Sun Tong, Huang Heng, Swaminathan Viswanathan
- Unlocking The Potential Of User Feedback: Leveraging Large Language Model As User Simulator To Enhance Dialogue System Hu Zhiyuan, Feng Yue, Luu Anh Tuan, Hooi Bryan, Lipani Aldo
- SMILE: Single-turn To Multi-turn Inclusive Language Expansion Via Chatgpt For Mental Health Support Huachuan Qiu, Hongliang He, Shuai Zhang, Anqi Li, Zhenzhong Lan
- Agentcoder: Multi-agent-based Code Generation With Iterative Testing And Optimisation Huang Dong, Zhang Jie M., Luck Michael, Bu Qingwen, Qing Yuhao, Cui Heming
- Directed Acyclic Transformer Pre-training For High-quality Non-autoregressive Text Generation Huang Fei, Ke Pei, Huang Minlie
- Chatgpt For Shaping The Future Of Dentistry: The Potential Of Multi-modal Large Language Model Huang Hanyao, Zheng Ou, Wang Dongdong, Yin Jiayi, Wang Zijin, Ding Shengxuan, Yin Heng, Xu Chuan, Yang Renjie, Zheng Qian, Shi Bing
- Who Is Chatgpt? Benchmarking Llms' Psychological Portrayal Using Psychobench Huang Jen-tse, Wang Wenxuan, Li Eric John, Lam Man Ho, Ren Shujie, Yuan Youliang, Jiao Wenxiang, Tu Zhaopeng, Lyu Michael R.
- Large Language Models Cannot Self-correct Reasoning Yet Huang Jie, Chen Xinyun, Mishra Swaroop, Zheng Huaixiu Steven, Yu Adams Wei, Song Xinying, Zhou Denny
- Modality Plug-and-play: Elastic Modality Adaptation In Multimodal Llms For Embodied AI Huang Kai, Yang Boyuan, Gao Wei
- Can Large Language Models Explain Themselves? A Study Of Llm-generated Self-explanations Huang Shiyuan, Mamidanna Siddarth, Jangam Shreedhar, Zhou Yilun, Gilpin Leilani H.
- Grounded Decoding: Guiding Text Generation With Grounded Models For Embodied Agents Huang Wenlong, Xia Fei, Shah Dhruv, Driess Danny, Zeng Andy, Lu Yao, Florence Pete, Mordatch Igor, Levine Sergey, Hausman Karol, Ichter Brian
- Generic Attention-model Explainability By Weighted Relevance Accumulation Huang Yiming, Jia Aozhe, Zhang Xiaodan, Zhang Jiawei
- Evaluating And Enhancing Large Language Models For Conversational Reasoning On Knowledge Graphs Huang Yuxuan, Shi Lida, Liu Anqi, Xu Hao
- How Does Pretraining Improve Discourse-aware Translation? Huang Zhihong, Wang Longyue, Liu Siyou, Wong Derek F.
- Soft Prompt Decoding For Multilingual Dense Retrieval Huang Zhiqi, Zeng Hansi, Zamani Hamed, Allan James
- Llama 2: Open Foundation And Fine-tuned Chat Models Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini, Rui Hou, Hakan Inan, Marcin Kardas, Viktor Kerkez, Madian Khabsa, Isabel Kloumann, Artem Korenev, Punit Singh Koura, Marie-anne Lachaux, Thibaut Lavril, Jenya Lee, Diana Liskovich, Yinghai Lu, Yuning Mao, Xavier Martinet, Todor Mihaylov, Pushkar Mishra, Igor Molybog, Yixin Nie, Andrew Poulton, Jeremy Reizenstein, Rashi Rungta, Kalyan Saladi, Alan Schelten, Ruan Silva, Eric Michael Smith, Ranjan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, Jian Xiang Kuan, Puxin Xu, Zheng Yan, Iliyan Zarov, Yuchen Zhang, Angela Fan, Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom
- TADA: Efficient Task-agnostic Domain Adaptation For Transformers Hung Chia-chien, Lange Lukas, Strötgen Jannik
- Walking A Tightrope -- Evaluating Large Language Models In High-risk Domains Hung Chia-chien, Rim Wiem Ben, Frost Lindsay, Bruckner Lars, Lawrence Carolin
- Retrieving Supporting Evidence For Generative Question Answering Huo Siqing, Arabzadeh Negar, Clarke Charles L. A.
- Towards Leveraging Llms For Conditional QA Hussain Syed-amad, Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- Understanding The Effectiveness Of Very Large Language Models On Dialog Evaluation Huynh Jessica, Jiao Cathy, Gupta Prakhar, Mehri Shikib, Bajaj Payal, Chaudhary Vishrav, Eskenazi Maxine
- Large Language Models As Sous Chefs: Revising Recipes With GPT-3 Hwang Alyssa, Li Bryan, Hou Zhaoyi, Roth Dan
- Llama Guard: Llm-based Input-output Safeguard For Human-ai Conversations Inan Hakan, Upasani Kartikeya, Chi Jianfeng, Rungta Rashi, Iyer Krithika, Mao Yuning, Tontchev Michael, Hu Qing, Fuller Brian, Testuggine Davide, Khabsa Madian
- Pushing Boundaries: Exploring Zero Shot Object Classification With Large Multimodal Models Islam Ashhadul, Biswas Md. Rafiul, Zaghouani Wajdi, Belhaouari Samir Brahim, Shah Zubair
- 14 Examples Of How Llms Can Transform Materials Science And Chemistry: A Reflection On A Large Language Model Hackathon Jablonka Kevin Maik, Ai Qianxiang, Al-feghali Alexander, Badhwar Shruti, Bocarsly Joshua D., Bran Andres M, Bringuier Stefan, Brinson L. Catherine, Choudhary Kamal, Circi Defne, Cox Sam, De Jong Wibe A., Evans Matthew L., Gastellu Nicolas, Genzling Jerome, Gil María Victoria, Gupta Ankur K., Hong Zhi, Imran Alishba, Kruschwitz Sabine, Labarre Anne, Lála Jakub, Liu Tao, Ma Steven, Majumdar Sauradeep, Merz Garrett W., Moitessier Nicolas, Moubarak Elias, Mouriño Beatriz, Pelkie Brenden, Pieler Michael, Ramos Mayk Caldas, Ranković Bojana, Rodriques Samuel G., Sanders Jacob N., Schwaller Philippe, Schwarting Marcus, Shi Jiale, Smit Berend, Smith Ben E., Van Herck Joren, Völker Christoph, Ward Logan, Warren Sean, Weiser Benjamin, Zhang Sylvester, Zhang Xiaoqi, Zia Ghezal Ahmad, Scourtas Aristana, Schmidt Kj, Foster Ian, White Andrew D., Blaiszik Ben
- Inpars-v2: Large Language Models As Efficient Dataset Generators For Information Retrieval Jeronymo Vitor, Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Lotufo Roberto, Zavrel Jakub, Nogueira Rodrigo
- Neuro Symbolic Reasoning For Planning: Counterexample Guided Inductive Synthesis Using Large Language Models And Satisfiability Solving Jha Sumit Kumar, Jha Susmit, Lincoln Patrick, Bastian Nathaniel D., Velasquez Alvaro, Ewetz Rickard, Neema Sandeep
- Tailoring Language Generation Models Under Total Variation Distance Ji Haozhe, Ke Pei, Hu Zhipeng, Zhang Rongsheng, Huang Minlie
- Rethinking Large Language Models In Mental Health Applications Ji Shaoxiong, Zhang Tianlin, Yang Kailai, Ananiadou Sophia, Cambria Erik
- Exploring The Impact Of Instruction Data Scaling On Large Language Models: An Empirical Study On Real-world Use Cases Ji Yunjie, Deng Yong, Gong Yan, Peng Yiping, Niu Qiang, Zhang Lei, Ma Baochang, Li Xiangang
- Benchmarking And Explaining Large Language Model-based Code Generation: A Causality-centric Approach Ji Zhenlan, Ma Pingchuan, Li Zongjie, Wang Shuai
- Zero-shot Faithfulness Evaluation For Text Summarization With Foundation Language Model Jia Qi, Ren Siyu, Liu Yizhu, Zhu Kenny Q.
- Mistral 7B Jiang Albert Q., Sablayrolles Alexandre, Mensch Arthur, Bamford Chris, Chaplot Devendra Singh, Casas Diego De Las, Bressand Florian, Lengyel Gianna, Lample Guillaume, Saulnier Lucile, Lavaud Lélio Renard, Lachaux Marie-anne, Stock Pierre, Scao Teven Le, Lavril Thibaut, Wang Thomas, Lacroix Timothée, Sayed William El
- Bus:efficient And Effective Vision-language Pre-training With Bottom-up Patch Summarization Jiang Chaoya, Xu Haiyang, Ye Wei, Ye Qinghao, Li Chenliang, Yan Ming, Bi Bin, Zhang Shikun, Huang Fei, Huang Songfang
- Tigerscore: Towards Building Explainable Metric For All Text Generation Tasks Jiang Dongfu, Li Yishan, Zhang Ge, Huang Wenhao, Lin Bill Yuchen, Chen Wenhu
- From CLIP To DINO: Visual Encoders Shout In Multi-modal Large Language Models Jiang Dongsheng, Liu Yuchen, Liu Songlin, Zhao Jin'e, Zhang Hao, Gao Zhen, Zhang Xiaopeng, Li Jin, Xiong Hongkai
- Personallm: Investigating The Ability Of Large Language Models To Express Personality Traits Jiang Hang, Zhang Xiajie, Cao Xubo, Breazeal Cynthia, Roy Deb, Kabbara Jad
- Llmlingua: Compressing Prompts For Accelerated Inference Of Large Language Models Jiang Huiqiang, Wu Qianhui, Lin Chin-yew, Yang Yuqing, Qiu Lili
- Forcing Generative Models To Degenerate Ones: The Power Of Data Poisoning Attacks Jiang Shuli, Kadhe Swanand Ravindra, Zhou Yi, Cai Ling, Baracaldo Nathalie
- Selfevolve: A Code Evolution Framework Via Large Language Models Jiang Shuyang, Wang Yuhao, Wang Yu
- Prompt Packer: Deceiving Llms Through Compositional Instruction With Hidden Attacks Jiang Shuyu, Chen Xingshu, Tang Rui
- Hykge: A Hypothesis Knowledge Graph Enhanced Framework For Accurate And Reliable Medical Llms Responses Jiang Xinke, Zhang Ruizhe, Xu Yongxin, Qiu Rihong, Fang Yue, Wang Zhiyuan, Tang Jinyi, Ding Hongxin, Chu Xu, Zhao Junfeng, Wang Yasha
- BRAINTEASER: Lateral Thinking Puzzles For Large Language Models Jiang Yifan, Ilievski Filip, Ma Kaixin, Sourati Zhivar
- Iluvui: Instruction-tuned Language-vision Modeling Of Uis From Machine Conversations Jiang Yue, Schoop Eldon, Swearngin Amanda, Nichols Jeffrey
- Followbench: A Multi-level Fine-grained Constraints Following Benchmark For Large Language Models Jiang Yuxin, Wang Yufei, Zeng Xingshan, Zhong Wanjun, Li Liangyou, Mi Fei, Shang Lifeng, Jiang Xin, Liu Qun, Wang Wei
- Parrot: Translating During Chat Using Large Language Models Tuned With Human Translation And Feedback Jiao Wenxiang, Huang Jen-tse, Wang Wenxuan, He Zhiwei, Liang Tian, Wang Xing, Shi Shuming, Tu Zhaopeng
- Chain Of Thought Prompt Tuning In Vision Language Models Jiaxin Ge, Hongyin Luo, Siyuan Qian, Yulu Gan, Jie Fu, Shanghang Zhang
- Prompt-based Length Controlled Generation With Reinforcement Learning Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- Medcpt: Contrastive Pre-trained Transformers With Large-scale Pubmed Search Logs For Zero-shot Biomedical Information Retrieval Jin Qiao, Kim Won, Chen Qingyu, Comeau Donald C., Yeganova Lana, Wilbur W. John, Lu Zhiyong
- Binary Code Summarization: Benchmarking Chatgpt/gpt-4 And Other Large Language Models Jin Xin, Larson Jonathan, Yang Weiwei, Lin Zhiqiang
- Better To Ask In English: Cross-lingual Evaluation Of Large Language Models For Healthcare Queries Jin Yiqiao, Chandra Mohit, Verma Gaurav, Hu Yibo, De Choudhury Munmun, Kumar Srijan
- Knowledge-augmented Large Language Models For Personalized Contextual Query Suggestion Jinheon Baek, Nirupama Chandrasekaran, Silviu Cucerzan, Allen Herring, Sujay Kumar Jauhar
- GPT Is Becoming A Turing Machine: Here Are Some Ways To Program It Jojic Ana, Wang Zhen, Jojic Nebojsa
- Teaching Language Models To Hallucinate Less With Synthetic Tasks Jones Erik, Palangi Hamid, Simões Clarisse, Chandrasekaran Varun, Mukherjee Subhabrata, Mitra Arindam, Awadallah Ahmed, Kamar Ece
- "it's Not Like Jarvis, But It's Pretty Close!" -- Examining Chatgpt's Usage Among Undergraduate Students In Computer Science Joshi Ishika, Budhiraja Ritvik, Akolekar Harshal D, Challa Jagat Sesh, Kumar Dhruv
- Chatgpt In The Classroom: An Analysis Of Its Strengths And Weaknesses For Solving Undergraduate Computer Science Questions Joshi Ishika, Budhiraja Ritvik, Dev Harshal, Kadia Jahnvi, Ataullah M. Osama, Mitra Sayan, Kumar Dhruv, Akolekar Harshal D.
- A Glimpse In Chatgpt Capabilities And Its Impact For AI Research Joublin Frank, Ceravola Antonello, Deigmoeller Joerg, Gienger Michael, Franzius Mathias, Eggert Julian
- Impossible Distillation: From Low-quality Model To High-quality Dataset & Model For Summarization And Paraphrasing Jung Jaehun, West Peter, Jiang Liwei, Brahman Faeze, Lu Ximing, Fisher Jillian, Sorensen Taylor, Choi Yejin
- BLIP-2: Bootstrapping Language-image Pre-training With Frozen Image Encoders And Large Language Models Junnan Li, Dongxu Li, Silvio Savarese, Steven Hoi
- TARJAMAT: Evaluation Of Bard And Chatgpt On Machine Translation Of Ten Arabic Varieties Kadaoui Karima, Magdy Samar M., Waheed Abdul, Khondaker Md Tawkat Islam, El-shangiti Ahmed Oumar, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- Domain-specific Code Language Models: Unraveling The Potential For HPC Codes And Tasks Kadosh Tal, Hasabnis Niranjan, Vo Vy A., Schneider Nadav, Krien Neva, Capota Mihai, Wasay Abdul, Ahmed Nesreen, Willke Ted, Tamir Guy, Pinter Yuval, Mattson Timothy, Oren Gal
- Robust Training For Conversational Question Answering Models With Reinforced Reformulation Generation Kaiser Magdalena, Roy Rishiraj Saha, Weikum Gerhard
- Contextual Code Switching For Machine Translation Using Language Models Kaji Arshad, Shah Manan
- Swectrl-mini: A Data-transparent Transformer-based Large Language Model For Controllable Text Generation In Swedish Kalpakchi Dmytro, Boye Johan
- Evaluating Open-domain Question Answering In The Era Of Large Language Models Kamalloo Ehsan, Dziri Nouha, Clarke Charles L. A., Rafiei Davood
- Impact Of Co-occurrence On Factual Knowledge Of Large Language Models Kang Cheongwoong, Choi Jaesik
- Ever: Mitigating Hallucination In Large Language Models Through Real-time Verification And Rectification Kang Haoqiang, Ni Juntong, Yao Huaxiu
- Knowledge-augmented Reasoning Distillation For Small Language Models In Knowledge-intensive Tasks Kang Minki, Lee Seanie, Baek Jinheon, Kawaguchi Kenji, Hwang Sung Ju
- Shs-nlp At Radsum23: Domain-adaptive Pre-training Of Instruction-tuned Llms For Radiology Report Impression Generation Karn Sanjeev Kumar, Ghosh Rikhiya, P Kusuma, Farri Oladimeji
- Exploring Automatic Evaluation Methods Based On A Decoder-based LLM For Text Generation Kasahara Tomohito, Kawahara Daisuke
- Evaluating GPT-4 And Chatgpt On Japanese Medical Licensing Examinations Kasai Jungo, Kasai Yuhei, Sakaguchi Keisuke, Yamada Yutaro, Radev Dragomir
- Sorted Llama: Unlocking The Potential Of Intermediate Layers Of Large Language Models For Dynamic Inference Kavehzadeh Parsa, Valipour Mojtaba, Tahaei Marzieh, Ghodsi Ali, Chen Boxing, Rezagholizadeh Mehdi
- Application Of Frozen Large-scale Models To Multimodal Task-oriented Dialogue Kawamoto Tatsuki, Suzuki Takuma, Miyama Ko, Meguro Takumi, Takagi Tomohiro
- Appropriateness Is All You Need! Kempt Hendrik, Lavie Alon, Nagel Saskia K.
- Shikra: Unleashing Multimodal Llm's Referential Dialogue Magic Keqin Chen, Zhao Zhang, Weili Zeng, Richong Zhang, Feng Zhu, Rui Zhao
- Turning English-centric Llms Into Polyglots: How Much Multilinguality Is Needed? Kew Tannon, Schottmann Florian, Sennrich Rico
- Can Chatgpt And Bard Generate Aligned Assessment Items? A Reliability Analysis Against Human Performance Khademi Abdolvahab
- Assessing The Promise And Pitfalls Of Chatgpt For Automated Code Generation Khan Muhammad Fawad Akbar, Ramsdell Max, Falor Erik, Karimi Hamid
- Q: How To Specialize Large Vision-language Models To Data-scarce VQA Tasks? A: Self-train On Unlabeled Images! Khan Zaid, Bg Vijay Kumar, Schulter Samuel, Yu Xiang, Fu Yun, Chandraker Manmohan
- Tst\(^\mathrm{r}\): Target Similarity Tuning Meets The Real World Khatry Anirudh, Gulwani Sumit, Gupta Priyanshu, Le Vu, Singha Ananya, Singh Mukul, Verbruggen Gust
- Reliability Check: An Analysis Of Gpt-3's Response To Sensitive Topics And Prompt Wording Khatun Aisha, Brown Daniel G.
- Exploring The Potential Of Large Language Models To Generate Formative Programming Feedback Kiesler Natalie, Lohr Dominic, Keuning Hieke
- Visually-situated Natural Language Understanding With Contrastive Reading Model And Frozen Large Language Models Kim Geewook, Lee Hodong, Kim Daehee, Jung Haeji, Park Sanghee, Kim Yoonsik, Yun Sangdoo, Kil Taeho, Lee Bado, Park Seunghyun
- GTA: Gated Toxicity Avoidance For LM Performance Preservation Kim Heegyu, Cho Hyunsouk
- Fantom: A Benchmark For Stress-testing Machine Theory Of Mind In Interactions Kim Hyunwoo, Sclar Melanie, Zhou Xuhui, Bras Ronan Le, Kim Gunhee, Choi Yejin, Sap Maarten
- Which Is Better? Exploring Prompting Strategy For Llm-based Metrics Kim Joonghoon, Park Saeran, Jeong Kiyoon, Lee Sangmin, Han Seung Hun, Lee Jiyoon, Kang Pilsung
- Speculative Decoding With Big Little Decoder Kim Sehoon, Mangalam Karttikeya, Moon Suhong, Malik Jitendra, Mahoney Michael W., Gholami Amir, Keutzer Kurt
- Cotever: Chain Of Thought Prompting Annotation Toolkit For Explanation Verification Kim Seungone, Joo Se June, Jang Yul, Chae Hyungjoo, Yeo Jinyoung
- Evallm: Interactive Evaluation Of Large Language Model Prompts On User-defined Criteria Kim Tae Soo, Lee Yoonjoo, Shin Jamin, Kim Young-ho, Kim Juho
- Exploiting Language Models As A Source Of Knowledge For Cognitive Agents Kirk James R., Wray Robert E., Laird John E.
- Large Language Models Are Temporal And Causal Reasoners For Video Question Answering Ko Dohwan, Lee Ji Soo, Kang Wooyoung, Roh Byungseok, Kim Hyunwoo J.
- Transformer Language Models Handle Word Frequency In Prediction Head Kobayashi Goro, Kuribayashi Tatsuki, Yokoi Sho, Inui Kentaro
- Chatgpt: Jack Of All Trades, Master Of None Kocoń Jan, Cichecki Igor, Kaszyca Oliwier, Kochanek Mateusz, Szydło Dominika, Baran Joanna, Bielaniewicz Julita, Gruza Marcin, Janz Arkadiusz, Kanclerz Kamil, Kocoń Anna, Koptyra Bartłomiej, Mieleszczenko-kowszewicz Wiktoria, Miłkowski Piotr, Oleksy Marcin, Piasecki Maciej, Radliński Łukasz, Wojtasik Konrad, Woźniak Stanisław, Kazienko Przemysław
- Writing Your Own Book: A Method For Going From Closed To Open Book QA To Improve Robustness And Performance Of Smaller Llms Kokaia Giorgi, Sinha Pratyush, Jiang Yutong, Boujemaa Nozha
- Contextual Refinement Of Translations: Large Language Models For Sentence And Document-level Post-editing Koneru Sai, Exel Miriam, Huck Matthias, Niehues Jan
- Little Giants: Exploring The Potential Of Small Llms As Evaluation Metrics In Summarization In The Eval4nlp 2023 Shared Task Kotonya Neema, Krishnasamy Saran, Tetreault Joel, Jaimes Alejandro
- Do Large Language Models Pay Similar Attention Like Human Programmers When Generating Code? Kou Bonan, Chen Shengmai, Wang Zhijie, Ma Lei, Zhang Tianyi
- In-context Explainers: Harnessing Llms For Explaining Black Box Models Kroeger Nicholas, Ley Dan, Krishna Satyapriya, Agarwal Chirag, Lakkaraju Himabindu
- Fundamentals Of Generative Large Language Models And Perspectives In Cyber-defense Kucharavy Andrei, Schillaci Zachary, Maréchal Loïc, Würsch Maxime, Dolamic Ljiljana, Sabonnadiere Remi, David Dimitri Percia, Mermoud Alain, Lenders Vincent
- Sensi-bert: Towards Sensitivity Driven Fine-tuning For Parameter-efficient BERT Kundu Souvik, Sridhar Sharath Nittur, Szankin Maciej, Sundaresan Sairam
- DACBERT: Leveraging Dependency Agreement For Cost-efficient Bert Pretraining Kuo Martin, Zhang Jianyi, Chen Yiran
- Sparse Fine-tuning For Inference Acceleration Of Large Language Models Kurtic Eldar, Kuznedelev Denis, Frantar Elias, Goin Michael, Alistarh Dan
- Chatgpt For Arabic Grammatical Error Correction Kwon Sang Yun, Bhatia Gagan, Nagoud El Moatez Billah, Abdul-mageed Muhammad
- Beyond English: Evaluating Llms For Arabic Grammatical Error Correction Kwon Sang Yun, Bhatia Gagan, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- Hallucination Augmented Recitations For Language Models Köksal Abdullatif, Aksitov Renat, Chang Chung-ching
- Longform: Effective Instruction Tuning With Reverse Instructions Köksal Abdullatif, Schick Timo, Korhonen Anna, Schütze Hinrich
- Instruction-following Speech Recognition Lai Cheng-i Jeff, Lu Zhiyun, Cao Liangliang, Pang Ruoming
- Multidimensional Evaluation For Text Style Transfer Using Chatgpt Lai Huiyuan, Toral Antonio, Nissim Malvina
- Chatgpt Beyond English: Towards A Comprehensive Evaluation Of Large Language Models In Multilingual Learning Lai Viet Dac, Ngo Nghia Trung, Veyseh Amir Pouran Ben, Man Hieu, Dernoncourt Franck, Bui Trung, Nguyen Thien Huu
- Mini-dalle3: Interactive Text To Image By Prompting Large Language Models Lai Zeqiang, Zhu Xizhou, Dai Jifeng, Qiao Yu, Wang Wenhai
- Improving Zero-shot Visual Question Answering Via Large Language Models With Reasoning Question Prompts Lan Yunshi, Li Xiang, Liu Xin, Li Yang, Qin Wei, Qian Weining
- A Systematic Study And Comprehensive Evaluation Of Chatgpt On Benchmark Datasets Laskar Md Tahmid Rahman, Bari M Saiful, Rahman Mizanur, Bhuiyan Md Amran Hossen, Joty Shafiq, Huang Jimmy Xiangji
- Building Efficient Universal Classifiers With Natural Language Inference Laurer Moritz, Van Atteveldt Wouter, Casas Andreu, Welbers Kasper
- Making Large Language Models Better Data Creators Lee Dong-ho, Pujara Jay, Sewak Mohit, White Ryen W., Jauhar Sujay Kumar
- Towards Reliable And Fluent Large Language Models: Incorporating Feedback Learning Loops In QA Systems Lee Dongyub, Whang Taesun, Lee Chanhee, Lim Heuiseok
- RLAIF Vs. RLHF: Scaling Reinforcement Learning From Human Feedback With AI Feedback Lee Harrison, Phatale Samrat, Mansoor Hassan, Mesnard Thomas, Ferret Johan, Lu Kellie, Bishop Colton, Hall Ethan, Carbune Victor, Rastogi Abhinav, Prakash Sushant
- How Well Do Large Language Models Truly Ground? Lee Hyunji, Joo Sejune, Kim Chaeeun, Jang Joel, Kim Doyoung, On Kyoung-woon, Seo Minjoon
- Knowledge Corpus Error In Question Answering Lee Yejoon, Oh Philhoon, Thorne James
- Chain Of Natural Language Inference For Reducing Large Language Model Ungrounded Hallucinations Lei Deren, Li Yaxi, Hu Mengya, Wang Mingyu, Yun Vincent, Ching Emily, Kamal Eslam
- Mitigating Fine-grained Hallucination By Fine-tuning Large Vision-language Models With Caption Rewrites Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-peng Lim
- T-sciq: Teaching Multimodal Chain-of-thought Reasoning Via Mixed Large Language Model Signals For Science Question Answering Lei Wang, Yi Hu, Jiabang He, Xing Xu, Ning Liu, Hui Liu, Heng Tao Shen
- The Eval4nlp 2023 Shared Task On Prompting Large Language Models As Explainable Metrics Leiter Christoph, Opitz Juri, Deutsch Daniel, Gao Yang, Dror Rotem, Eger Steffen
- A Sentence Is Worth A Thousand Pictures: Can Large Language Models Understand Hum4n L4ngu4ge And The W0rld Behind W0rds? Leivada Evelina, Marcus Gary, Günther Fritz, Murphy Elliot
- Getting From Generative AI To Trustworthy AI: What Llms Might Learn From Cyc Lenat Doug, Marcus Gary
- Talk The Walk: Synthetic Data Generation For Conversational Music Recommendation Leszczynski Megan, Zhang Shu, Ganti Ravi, Balog Krisztian, Radlinski Filip, Pereira Fernando, Chaganty Arun Tejasvi
- Guiding LLM To Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut Triggers Levy Mosh, Ravfogel Shauli, Goldberg Yoav
- Deliberate Then Generate: Enhanced Prompting Framework For Text Generation Li Bei, Wang Rui, Guo Junliang, Song Kaitao, Tan Xu, Hassan Hany, Menezes Arul, Xiao Tong, Bian Jiang, Zhu Jingbo
- Making Large Language Models A Better Foundation For Dense Retrieval Li Chaofan, Liu Zheng, Xiao Shitao, Shao Yingxia
- Learning To Rewrite Prompts For Personalized Text Generation Li Cheng, Zhang Mingyang, Mei Qiaozhu, Kong Weize, Bendersky Michael
- Teach Llms To Personalize -- An Approach Inspired By Writing Education Li Cheng, Zhang Mingyang, Mei Qiaozhu, Wang Yaqing, Hombaiah Spurthi Amba, Liang Yi, Bendersky Michael
- Mixed Distillation Helps Smaller Language Model Better Reasoning Li Chenglin, Chen Qianglong, Li Liangyue, Wang Caiyu, Li Yicheng, Chen Zulong, Zhang Yin
- Modelscope-agent: Building Your Customizable Agent System With Open-source Large Language Models Li Chenliang, Chen Hehong, Yan Ming, Shen Weizhou, Xu Haiyang, Wu Zhikai, Zhang Zhicheng, Zhou Wenmeng, Chen Yingda, Cheng Chen, Shi Hongzhu, Zhang Ji, Huang Fei, Zhou Jingren
- Llava-med: Training A Large Language-and-vision Assistant For Biomedicine In One Day Li Chunyuan, Wong Cliff, Zhang Sheng, Usuyama Naoto, Liu Haotian, Yang Jianwei, Naumann Tristan, Poon Hoifung, Gao Jianfeng
- Revisiting Large Language Models As Zero-shot Relation Extractors Li Guozheng, Wang Peng, Ke Wenjun
- Acecoder: Utilizing Existing Code To Enhance Code Generation Li Jia, Zhao Yunfei, Li Yongmin, Li Ge, Jin Zhi
- CFGPT: Chinese Financial Assistant With Large Language Model Li Jiangtong, Bian Yuxuan, Wang Guoxuan, Lei Yang, Cheng Dawei, Ding Zhijun, Jiang Changjun
- Overprompt: Enhancing Chatgpt Through Efficient In-context Learning Li Jiazheng, Zhao Runcong, Yang Yongxin, He Yulan, Gui Lin
- Contradoc: Understanding Self-contradictions In Documents With Large Language Models Li Jierui, Raheja Vipul, Kumar Dhruv
- MAQA: A Multimodal QA Benchmark For Negation Li Judith Yue, Jansen Aren, Huang Qingqing, Lee Joonseok, Ganti Ravi, Kuzmin Dima
- Covlm: Composing Visual Entities And Relationships In Large Language Models Via Communicative Decoding Li Junyan, Chen Delin, Hong Yining, Chen Zhenfang, Chen Peihao, Shen Yikang, Gan Chuang
- Watermarking Llms With Weight Quantization Li Linyang, Jiang Botian, Wang Pengyu, Ren Ke, Yan Hang, Qiu Xipeng
- Reflection-tuning: Data Recycling Improves LLM Instruction-tuning Li Ming, Chen Lichang, Chen Jiuhai, He Shwai, Huang Heng, Gu Jiuxiang, Zhou Tianyi
- Skillgpt: A Restful API Service For Skill Extraction And Standardization Using A Large Language Model Li Nan, Kang Bo, De Bie Tijl
- Adapting Large Language Models For Education: Foundational Capabilities, Potentials, And Challenges Li Qingyao, Fu Lingyue, Zhang Weiming, Chen Xianyu, Yu Jingwei, Xia Wei, Zhang Weinan, Tang Ruiming, Yu Yong
- Are Human-generated Demonstrations Necessary For In-context Learning? Li Rui, Wang Guoyin, Li Jiwei
- TRAQ: Trustworthy Retrieval Augmented Question Answering Via Conformal Prediction Li Shuo, Park Sangdon, Lee Insup, Bastani Osbert
- Context Matters: Data-efficient Augmentation Of Large Language Models For Scientific Applications Li Xiang, Tang Haoran, Chen Siyu, Wang Ziwei, Maravi Anurag, Abram Marcin
- Image Content Generation With Causal Reasoning Li Xiaochuan, Fan Baoyu, Zhang Runze, Jin Liang, Wang Di, Guo Zhenhua, Zhao Yaqian, Li Rengang
- Unigen: A Unified Generative Framework For Retrieval And Question Answering With Large Language Models Li Xiaoxi, Zhou Yujia, Dou Zhicheng
- Think Outside The Code: Brainstorming Boosts Large Language Models In Code Generation Li Xin-ye, Xue Jiang-tian, Xie Zheng, Li Ming
- Unlocking Temporal Question Answering For Large Language Models Using Code Execution Li Xingxuan, Cheng Liying, Tan Qingyu, Ng Hwee Tou, Joty Shafiq, Bing Lidong
- Explicit Syntactic Guidance For Neural Text Generation Li Yafu, Cui Leyang, Yan Jianhao, Yin Yongjing, Bi Wei, Shi Shuming, Zhang Yue
- Translation-enhanced Multilingual Text-to-image Generation Li Yaoyiran, Chang Ching-yun, Rawls Stephen, Vulić Ivan, Korhonen Anna
- Loftq: Lora-fine-tuning-aware Quantization For Large Language Models Li Yixiao, Yu Yifan, Liang Chen, He Pengcheng, Karampatziakis Nikos, Chen Weizhu, Zhao Tuo
- Losparse: Structured Compression Of Large Language Models Based On Low-rank And Sparse Approximation Li Yixiao, Yu Yifan, Zhang Qingru, Liang Chen, He Pengcheng, Chen Weizhu, Zhao Tuo
- Prompting Large Language Models For Counterfactual Generation: An Empirical Study Li Yongqi, Xu Mayi, Miao Xin, Zhou Shen, Qian Tieyun
- Posgen: Personalized Opening Sentence Generation For Online Insurance Sales Li Yu, Zhang Yi, Wu Weijia, Zhou Zimu, Li Qiang
- Compressing Context To Enhance Inference Efficiency Of Large Language Models Li Yucheng, Dong Bo, Lin Chenghua, Guerin Frank
- Towards Vision Enhancing Llms: Empowering Multimodal Knowledge Storage And Sharing In Llms Li Yunxin, Hu Baotian, Wang Wei, Cao Xiaochun, Zhang Min
- Advancing Precise Outline-conditioned Text Generation With Task Duality And Explicit Outline Control Li Yunzhe, Chen Qian, Yan Weixiang, Wang Wen, Zhang Qinglin, Sundaram Hari
- Guiding Large Language Models Via Directional Stimulus Prompting Li Zekun, Peng Baolin, He Pengcheng, Galley Michel, Gao Jianfeng, Yan Xifeng
- Flexkbqa: A Flexible Llm-powered Framework For Few-shot Knowledge Base Question Answering Li Zhenyu, Fan Sunqi, Gu Yu, Li Xiuxing, Duan Zhichao, Dong Bowen, Liu Ning, Wang Jianyong
- Split And Merge: Aligning Position Biases In Large Language Model Based Evaluators Li Zongjie, Wang Chaozheng, Ma Pingchuan, Wu Daoyuan, Wang Shuai, Gao Cuiyun, Liu Yang
- Batgpt: A Bidirectional Autoregessive Talker From Generative Pre-trained Transformer Li Zuchao, Zhang Shitou, Zhao Hai, Yang Yifei, Yang Dongjie
- Beyond Factuality: A Comprehensive Evaluation Of Large Language Models As Knowledge Generators Liang Chen, Yang Deng, Yatao Bian, Zeyu Qin, Bingzhe Wu, Tat-seng Chua, Kam-fai Wong
- Encouraging Divergent Thinking In Large Language Models Through Multi-agent Debate Liang Tian, He Zhiwei, Jiao Wenxiang, Wang Xing, Wang Rui, Yang Yujiu, Tu Zhaopeng, Shi Shuming
- Uhgeval: Benchmarking The Hallucination Of Chinese Large Language Models Via Unconstrained Generation Liang Xun, Song Shichao, Niu Simin, Li Zhiyu, Xiong Feiyu, Tang Bo, Wang Yezhaohui, He Dawei, Cheng Peng, Wang Zhonghao, Deng Haiying
- GPT-4 Enhanced Multimodal Grounding For Autonomous Driving: Leveraging Cross-modal Attention With Large Language Models Liao Haicheng, Shen Huanming, Li Zhenning, Wang Chengyue, Li Guofa, Bie Yiming, Xu Chengzhong
- AI Transparency In The Age Of Llms: A Human-centered Research Roadmap Liao Q. Vera, Vaughan Jennifer Wortman
- Opening Up Chatgpt: Tracking Openness, Transparency, And Accountability In Instruction-tuned Text Generators Liesenfeld Andreas, Lopez Alianda, Dingemanse Mark
- Paralinguistics-enhanced Large Language Modeling Of Spoken Dialogue Lin Guan-ting, Shivakumar Prashanth Gurunath, Gandhe Ankur, Yang Chao-han Huck, Gu Yile, Ghosh Shalini, Stolcke Andreas, Lee Hung-yi, Bulyko Ivan
- AWQ: Activation-aware Weight Quantization For LLM Compression And Acceleration Lin Ji, Tang Jiaming, Tang Haotian, Yang Shang, Chen Wei-ming, Wang Wei-chen, Xiao Guangxuan, Dang Xingyu, Gan Chuang, Han Song
- Batchprompt: Accomplish More With Less Lin Jianzhe, Diesendruck Maurice, Du Liang, Abraham Robin
- Advances In Embodied Navigation Using Large Language Models: A Survey Lin Jinzhou, Gao Han, Feng Xuxiang, Xu Rongtao, Wang Changwei, Zhang Man, Guo Li, Xu Shibiao
- Dynamic Scheduled Sampling With Imitation Loss For Neural Text Generation Lin Xiang, Jwalapuram Prathyusha, Joty Shafiq
- Towards Language-guided Interactive 3D Generation: Llms As Layout Interpreter With Generative Feedback Lin Yiqi, Wu Hao, Wang Ruichen, Lu Haonan, Lin Xiaodong, Xiong Hui, Wang Lin
- Pushing Large Language Models To The 6G Edge: Vision, Challenges, And Opportunities Lin Zheng, Qu Guanqiao, Chen Qiyuan, Chen Xianhao, Chen Zhe, Huang Kaibin
- SPHINX: The Joint Mixing Of Weights, Tasks, And Visual Embeddings For Multi-modal Large Language Models Lin Ziyi, Liu Chris, Zhang Renrui, Gao Peng, Qiu Longtian, Xiao Han, Qiu Han, Lin Chen, Shao Wenqi, Chen Keqin, Han Jiaming, Huang Siyuan, Zhang Yichi, He Xuming, Li Hongsheng, Qiao Yu
- Are Multilingual Llms Culturally-diverse Reasoners? An Investigation Into Multicultural Proverbs And Sayings Liu Chen Cecilia, Koto Fajri, Baldwin Timothy, Gurevych Iryna
- Code Execution With Pre-trained Language Models Liu Chenxiao, Lu Shuai, Chen Weizhu, Jiang Daxin, Svyatkovskiy Alexey, Fu Shengyu, Sundaresan Neel, Duan Nan
- Logicot: Logical Chain-of-thought Instruction-tuning Liu Hanmeng, Teng Zhiyang, Cui Leyang, Zhang Chaoli, Zhou Qiji, Zhang Yue
- Glore: Evaluating Logical Reasoning Of Large Language Models Liu Hanmeng, Teng Zhiyang, Ning Ruoxi, Liu Jian, Zhou Qiji, Zhang Yue
- Ring Attention With Blockwise Transformers For Near-infinite Context Liu Hao, Zaharia Matei, Abbeel Pieter
- Retrieval-based Knowledge Transfer: An Effective Approach For Extreme Large Language Model Compression Liu Jiduan, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Wang Ran Lucien, Yan Rui
- Llm-powered Hierarchical Language Agent For Real-time Human-ai Coordination Liu Jijia, Yu Chao, Gao Jiaxuan, Xie Yuqing, Liao Qingmin, Wu Yi, Wang Yu
- RETA-LLM: A Retrieval-augmented Large Language Model Toolkit Liu Jiongnan, Jin Jiajie, Wang Zihan, Cheng Jiehan, Dou Zhicheng, Wen Ji-rong
- Conversational Question Answering With Reformulations Over Knowledge Graph Liu Lihui, Hill Blaine, Du Boxin, Wang Fei, Tong Hanghang
- An Evaluation Of GPT-4V And Gemini In Online VQA Liu Mengchen, Chen Chongyan, Gurari Danna
- Chipnemo: Domain-adapted Llms For Chip Design Liu Mingjie, Ene Teodor-dumitru, Kirby Robert, Cheng Chris, Pinckney Nathaniel, Liang Rongjian, Alben Jonah, Anand Himyanshu, Banerjee Sanmitra, Bayraktaroglu Ismet, Bhaskaran Bonita, Catanzaro Bryan, Chaudhuri Arjun, Clay Sharon, Dally Bill, Dang Laura, Deshpande Parikshit, Dhodhi Siddhanth, Halepete Sameer, Hill Eric, Hu Jiashang, Jain Sumit, Jindal Ankit, Khailany Brucek, Kokai George, Kunal Kishor, Li Xiaowei, Lind Charley, Liu Hao, Oberman Stuart, Omar Sujeet, Pasandi Ghasem, Pratty Sreedhar, Raiman Jonathan, Sarkar Ambar, Shao Zhengjiang, Sun Hanfei, Suthar Pratik P, Tej Varun, Turner Walker, Xu Kaizhe, Ren Haoxing
- X-eval: Generalizable Multi-aspect Text Evaluation Via Augmented Instruction Tuning With Auxiliary Evaluation Aspects Liu Minqian, Shen Ying, Xu Zhiyang, Cao Yixin, Cho Eunah, Kumar Vaibhav, Ghanadan Reza, Huang Lifu
- Nlebench+norglm: A Comprehensive Empirical Analysis And Benchmark Dataset For Generative Language Models In Norwegian Liu Peng, Zhang Lemei, Farup Terje Nissen, Lauvrak Even W., Ingvaldsen Jon Espen, Eide Simen, Gulla Jon Atle, Yang Zhirong
- Exploring The Boundaries Of GPT-4 In Radiology Liu Qianchu, Hyland Stephanie, Bannur Shruthi, Bouzid Kenza, Castro Daniel C., Wetscherek Maria Teodora, Tinn Robert, Sharma Harshita, Pérez-garcía Fernando, Schwaighofer Anton, Rajpurkar Pranav, Khanna Sameer Tajdin, Poon Hoifung, Usuyama Naoto, Thieme Anja, Nori Aditya V., Lungren Matthew P., Oktay Ozan, Alvarez-valle Javier
- Music Understanding Llama: Advancing Text-to-music Generation With Question Answering And Captioning Liu Shansong, Hussain Atin Sakkeer, Sun Chenshuo, Shan Ying
- MMHQA-ICL: Multimodal In-context Learning For Hybrid Question Answering Over Text, Tables And Images Liu Weihao, Lei Fangyu, Luo Tongxu, Lei Jiahe, He Shizhu, Zhao Jun, Liu Kang
- Beyond Text: Unveiling Multimodal Proficiency Of Large Language Models With Multiapi Benchmark Liu Xiao, Lin Jianfeng, Zhang Jiawei
- Fingpt: Democratizing Internet-scale Data For Financial Large Language Models Liu Xiao-yang, Wang Guoxuan, Yang Hongyang, Zha Daochen
- BOLT: Fast Energy-based Controlled Text Generation With Tunable Biases Liu Xin, Khalifa Muhammad, Wang Lu
- Litcab: Lightweight Language Model Calibration Over Short- And Long-form Responses Liu Xin, Khalifa Muhammad, Wang Lu
- Query-utterance Attention With Joint Modeling For Query-focused Meeting Summarization Liu Xingxian, Duan Bin, Xiao Bo, Xu Yajing
- What Large Language Models Bring To Text-rich VQA? Liu Xuejing, Tang Wei, Ni Xinzhe, Lu Jinghui, Zhao Rui, Li Zechao, Tan Fei
- Gumsum: Multi-genre Data And Evaluation For English Abstractive Summarization Liu Yang Janet, Zeldes Amir
- G-eval: NLG Evaluation Using GPT-4 With Better Human Alignment Liu Yang, Iter Dan, Xu Yichong, Wang Shuohang, Xu Ruochen, Zhu Chenguang
- Trustworthy Llms: A Survey And Guideline For Evaluating Large Language Models' Alignment Liu Yang, Yao Yuanshun, Ton Jean-francois, Zhang Xiaoying, Guo Ruocheng, Cheng Hao, Klochkov Yegor, Taufiq Muhammad Faaiz, Li Hang
- RECALL: A Benchmark For Llms Robustness Against External Counterfactual Knowledge Liu Yi, Huang Lianzhe, Li Shicheng, Chen Sishuo, Zhou Hao, Meng Fandong, Zhou Jie, Sun Xu
- Summary Of Chatgpt-related Research And Perspective Towards The Future Of Large Language Models Liu Yiheng, Han Tianle, Ma Siyuan, Zhang Jiayue, Yang Yuanyuan, Tian Jiaming, He Hao, Li Antong, He Mengshen, Liu Zhengliang, Wu Zihao, Zhao Lin, Zhu Dajiang, Li Xiang, Qiang Ning, Shen Dingang, Liu Tianming, Ge Bao
- Instruction Position Matters In Sequence Generation With Large Language Models Liu Yijin, Zeng Xianfeng, Meng Fandong, Zhou Jie
- Llms As Narcissistic Evaluators: When Ego Inflates Evaluation Scores Liu Yiqi, Moosavi Nafise Sadat, Lin Chenghua
- On Learning To Summarize With Large Language Models As References Liu Yixin, Shi Kejian, He Katherine S, Ye Longtian, Fabbri Alexander R., Liu Pengfei, Radev Dragomir, Cohan Arman
- Binary And Ternary Natural Language Generation Liu Zechun, Oguz Barlas, Pappu Aasish, Shi Yangyang, Krishnamoorthi Raghuraman
- Picking The Underused Heads: A Network Pruning Perspective Of Attention Head Selection For Fusing Dialogue Coreference Information Liu Zhengyuan, Chen Nancy F.
- Deja Vu: Contextual Sparsity For Efficient Llms At Inference Time Liu Zichang, Wang Jue, Dao Tri, Zhou Tianyi, Yuan Binhang, Song Zhao, Shrivastava Anshumali, Zhang Ce, Tian Yuandong, Re Christopher, Chen Beidi
- On Exploring The Reasoning Capability Of Large Language Models With Knowledge Graphs Lo Pei-chi, Tsai Yi-hang, Lim Ee-peng, Hwang San-yih
- Exploring Effectiveness Of GPT-3 In Grammatical Error Correction: A Study On Performance And Controllability In Prompt-based Methods Loem Mengsay, Kaneko Masahiro, Takase Sho, Okazaki Naoaki
- Harnessing Large Language Models' Empathetic Response Generation Capabilities For Online Mental Health Counselling Support Loh Siyuan Brandon, Raamkumar Aravind Sesagiri
- Data-to-text Generation For Severely Under-resourced Languages With GPT-3.5: A Bit Of Help Needed From Google Translate Lorandi Michela, Belz Anya
- Probing Pretrained Language Models With Hierarchy Properties Lovón-melgarejo Jesús, Moreno Jose G., Besançon Romaric, Ferret Olivier, Tamine Lynda
- Bounding The Capabilities Of Large Language Models In Open Text Generation With Prompt Constraints Lu Albert, Zhang Hongxin, Zhang Yanzhe, Wang Xuezhi, Yang Diyi
- EPA: Easy Prompt Augmentation On Large Language Models Via Multiple Sources And Multiple Targets Lu Hongyuan, Lam Wai
- Chain-of-dictionary Prompting Elicits Translation In Large Language Models Lu Hongyuan, Yang Haoran, Huang Haoyang, Zhang Dongdong, Lam Wai, Wei Furu
- Unified-io 2: Scaling Autoregressive Multimodal Models With Vision, Language, Audio, And Action Lu Jiasen, Clark Christopher, Lee Sangho, Zhang Zichen, Khosla Savya, Marten Ryan, Hoiem Derek, Kembhavi Aniruddha
- Ziya-visual: Bilingual Large Vision-language Model Via Multi-task Instruction Tuning Lu Junyu, Zhang Dixiang, Wu Xiaojun, Gao Xinyu, Gan Ruyi, Zhang Jiaxing, Song Yan, Zhang Pingjian
- Error Analysis Prompting Enables Human-like Translation Evaluation In Large Language Models Lu Qingyu, Qiu Baopu, Ding Liang, Zhang Kanjian, Kocmi Tom, Tao Dacheng
- Inference-time Policy Adapters (IPA): Tailoring Extreme-scale Lms Without Fine-tuning Lu Ximing, Brahman Faeze, West Peter, Jang Jaehun, Chandu Khyathi, Ravichander Abhilasha, Qin Lianhui, Ammanabrolu Prithviraj, Jiang Liwei, Ramnath Sahana, Dziri Nouha, Fisher Jillian, Lin Bill Yuchen, Hallinan Skyler, Ren Xiang, Welleck Sean, Choi Yejin
- Cheap And Quick: Efficient Vision-language Instruction Tuning For Large Language Models Luo Gen, Zhou Yiyi, Ren Tianhe, Chen Shengxin, Sun Xiaoshuai, Ji Rongrong
- SAIL: Search-augmented Instruction Learning Luo Hongyin, Chuang Yung-sung, Gong Yuan, Zhang Tianhua, Kim Yoon, Wu Xixin, Fox Danny, Meng Helen, Glass James
- Critique Ability Of Large Language Models Luo Liangchen, Lin Zi, Liu Yinxiao, Shu Lei, Zhu Yun, Shang Jingbo, Meng Lei
- Taiyi: A Bilingual Fine-tuned Large Language Model For Diverse Biomedical Tasks Luo Ling, Ning Jinzhong, Zhao Yingwen, Wang Zhijun, Ding Zeyuan, Chen Peng, Fu Weiru, Han Qinyu, Xu Guangtao, Qiu Yunzhi, Pan Dinghao, Li Jiru, Li Hao, Feng Wenduo, Tu Senbo, Liu Yuqi, Yang Zhihao, Wang Jian, Sun Yuanyuan, Lin Hongfei
- Reasoning On Graphs: Faithful And Interpretable Large Language Model Reasoning Luo Linhao, Li Yuan-fang, Haffari Gholamreza, Pan Shirui
- Systematic Assessment Of Factual Knowledge In Large Language Models Luo Linhao, Vu Thuy-trang, Phung Dinh, Haffari Gholamreza
- Hrot: Hybrid Prompt Strategy And Retrieval Of Thought For Table-text Hybrid Question Answering Luo Tongxu, Lei Fangyu, Lei Jiahe, Liu Weihao, He Shihu, Zhao Jun, Liu Kang
- Chatgpt As A Factual Inconsistency Evaluator For Text Summarization Luo Zheheng, Xie Qianqian, Ananiadou Sophia
- Collie: Collaborative Training Of Large Language Models In An Efficient Way Lv Kai, Zhang Shuo, Gu Tianle, Xing Shuhao, Hong Jiawei, Chen Keyu, Liu Xiaoran, Yang Yuqing, Guo Honglin, Liu Tengxiao, Sun Yu, Guo Qipeng, Yan Hang, Qiu Xipeng
- KOSMOS-2.5: A Multimodal Literate Model Lv Tengchao, Huang Yupan, Chen Jingye, Zhao Yuzhong, Jia Yilin, Cui Lei, Ma Shuming, Chang Yaoyao, Huang Shaohan, Wang Wenhui, Dong Li, Luo Weiyao, Wu Shaoxiang, Wang Guoxin, Zhang Cha, Wei Furu
- Comparative Analysis Of Drug-gpt And Chatgpt Llms For Healthcare Insights: Evaluating Accuracy And Relevance In Patient And HCP Contexts Lysandrou Giorgos, Owen Roma English, Mursec Kirsty, Brun Grant Le, Fairley Elizabeth A. L.
- A Paradigm Shift: The Future Of Machine Translation Lies With Large Language Models Lyu Chenyang, Du Zefeng, Xu Jitao, Duan Yitao, Wu Minghao, Lynn Teresa, Aji Alham Fikri, Wong Derek F., Liu Siyou, Wang Longyue
- Llm-rec: Personalized Recommendation Via Prompting Large Language Models Lyu Hanjia, Jiang Song, Zeng Hanqing, Xia Yinglong, Wang Qifan, Zhang Si, Chen Ren, Leung Christopher, Tang Jiajie, Luo Jiebo
- Paperqa: Retrieval-augmented Generative Agent For Scientific Research Lála Jakub, O'donoghue Odhran, Shtedritski Aleksandar, Cox Sam, Rodriques Samuel G., White Andrew D.
- Vista-llama: Reliable Video Narrator Via Equal Distance To Visual Tokens Ma Fan, Jin Xiaojie, Wang Heng, Xian Yuchen, Feng Jiashi, Yang Yi
- Untying The Reversal Curse Via Bidirectional Language Model Editing Ma Jun-yu, Gu Jia-chen, Ling Zhen-hua, Liu Quan, Liu Cong
- Chain-of-skills: A Configurable Model For Open-domain Question Answering Ma Kaixin, Cheng Hao, Zhang Yu, Liu Xiaodong, Nyberg Eric, Gao Jianfeng
- "oops, Did I Just Say That?" Testing And Repairing Unethical Suggestions Of Large Language Models With Suggest-critique-reflect Process Ma Pingchuan, Li Zongjie, Sun Ao, Wang Shuai
- (why) Is My Prompt Getting Worse? Rethinking Regression Testing For Evolving LLM Apis Ma Wanqin, Yang Chenyang, Kästner Christian
- Tomchallenges: A Principle-guided Dataset And Diverse Evaluation Tasks For Exploring Theory Of Mind Ma Xiaomeng, Gao Lingyu, Xu Qihui
- At Which Training Stage Does Code Data Help Llms Reasoning? Ma Yingwei, Liu Yue, Yu Yue, Zhang Yuanliang, Jiang Yu, Wang Changjian, Li Shanshan
- Non-autoregressive Streaming Transformer For Simultaneous Translation Ma Zhengrui, Zhang Shaolei, Guo Shoutao, Shao Chenze, Zhang Min, Feng Yang
- Large Language Models For Telecom: Forthcoming Impact On The Industry Maatouk Ali, Piovesan Nicola, Ayed Fadhel, De Domenico Antonio, Debbah Merouane
- Exploring Prompting Large Language Models As Explainable Metrics Mahmoudi Ghazaleh
- Enhancing Conversational Quality In Language Learning Chatbots: An Evaluation Of GPT4 For ASR Error Correction Mai Long, Carson-berndsen Julie
- Adapting Pre-trained Generative Models For Extractive Question Answering Mallick Prabir, Nayak Tapas, Bhattacharya Indrajit
- Aligning Large Language Models For Clinical Tasks Manathunga Supun, Hettigoda Isuru
- Zero-shot Conversational Summarization Evaluations With Small Large Language Models Manuvinakurike Ramesh, Sahay Saurav, Manepalli Sangeeta, Nachman Lama
- Large Language Models Know Your Contextual Search Intent: A Prompting Framework For Conversational Search Mao Kelong, Dou Zhicheng, Mo Fengran, Hou Jiewen, Chen Haonan, Qian Hongjin
- ICL Markup: Structuring In-context Learning Using Soft-token Tags Marc-etienne Brunet, Ashton Anderson, Richard Zemel
- Benchmarking Large Language Model Capabilities For Conditional Generation Maynez Joshua, Agrawal Priyanka, Gehrmann Sebastian
- Improving Automatic VQA Evaluation Using Large Language Models Mañas Oscar, Krojer Benno, Agrawal Aishwarya
- Sources Of Hallucination By Large Language Models On Inference Tasks Mckenna Nick, Li Tianyi, Cheng Liang, Hosseini Mohammad Javad, Johnson Mark, Steedman Mark
- JAB: Joint Adversarial Prompting And Belief Augmentation Mehrabi Ninareh, Goyal Palash, Ramakrishna Anil, Dhamala Jwala, Ghosh Shalini, Zemel Richard, Chang Kai-wei, Galstyan Aram, Gupta Rahul
- Towards Multilingual Automatic Dialogue Evaluation Mendonça John, Lavie Alon, Trancoso Isabel
- Generating Illustrated Instructions Menon Sachit, Misra Ishan, Girdhar Rohit
- For Generated Text, Is Nli-neutral Text The Best Text? Mersinias Michail, Mahowald Kyle
- From Bytes To Biases: Investigating The Cultural Self-perception Of Large Language Models Messner Wolfgang, Greene Tatum, Matalone Josephine
- An AI Chatbot For Explaining Deep Reinforcement Learning Decisions Of Service-oriented Systems Metzger Andreas, Bartel Jone, Laufer Jan
- Developing A Scalable Benchmark For Assessing Large Language Models In Knowledge Graph Engineering Meyer Lars-peter, Frey Johannes, Junghanns Kurt, Brei Felix, Bulert Kirill, Gründer-fahrer Sabine, Martin Michael
- Think Twice: Measuring The Efficiency Of Eliminating Prediction Shortcuts Of Question Answering Models Mikula Lukáš, Štefánik Michal, Petrovič Marek, Sojka Petr
- Beyond Accuracy: Evaluating Self-consistency Of Code Large Language Models With Identitychain Min Marcus J., Ding Yangruibo, Buratti Luca, Pujar Saurabh, Kaiser Gail, Jana Suman, Ray Baishakhi
- Attention Link: An Efficient Attention-based Low Resource Machine Translation Architecture Min Zeping
- PEFTT: Parameter-efficient Fine-tuning For Low-resource Tibetan Pre-trained Language Models Mingjun Zhou, Zhuoma Daiqing, Nuo Qun, Tashi Nyima
- Relu Strikes Back: Exploiting Activation Sparsity In Large Language Models Mirzadeh Iman, Alizadeh Keivan, Mehta Sachin, Del Mundo Carlo C, Tuzel Oncel, Samei Golnoosh, Rastegari Mohammad, Farajtabar Mehrdad
- VQA With Cascade Of Self- And Co-attention Blocks Mishra Aakansha, Anand Ashish, Guha Prithwijit
- LLM Aided Semi-supervision For Extractive Dialog Summarization Mishra Nishant, Sahu Gaurav, Calixto Iacer, Abu-hanna Ameen, Laradji Issam H.
- Synthetic Imitation Edit Feedback For Factual Alignment In Clinical Summarization Mishra Prakamya, Yao Zonghai, Chen Shuwei, Wang Beining, Mittal Rohan, Yu Hong
- Triggering Multi-hop Reasoning For Question Answering In Language Models Using Soft Prompts And Random Walks Misra Kanishka, Santos Cicero Nogueira Dos, Shakeri Siamak
- Evaluation Of Faithfulness Using The Longest Supported Subsequence Mittal Anirudh, Schick Timo, Artetxe Mikel, Dwivedi-yu Jane
- State Of What Art? A Call For Multi-prompt LLM Evaluation Mizrahi Moran, Kaplan Guy, Malkin Dan, Dror Rotem, Shahaf Dafna, Stanovsky Gabriel
- Roll Up Your Sleeves: Working With A Collaborative And Engaging Task-oriented Dialogue System Mo Lingbo, Chen Shijie, Chen Ziru, Deng Xiang, Lewis Ashley, Singh Sunit, Stevens Samuel, Tai Chang-you, Wang Zhen, Yue Xiang, Zhang Tianshu, Su Yu, Sun Huan
- Chatgpt In The Age Of Generative AI And Large Language Models: A Concise Survey Mohamadi Salman, Mujtaba Ghulam, Le Ngan, Doretto Gianfranco, Adjeroh Donald A.
- Adaptive Machine Translation With Large Language Models Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- Fine-tuning Large Language Models For Adaptive Machine Translation Moslem Yasmin, Haque Rejwanul, Way Andy
- The Impact Of Artificial Intelligence On The Evolution Of Digital Education: A Comparative Study Of Openai Text Generation Tools Including Chatgpt, Bing Chat, Bard, And Ernie Motlagh Negin Yazdani, Khajavi Matin, Sharifi Abbas, Ahmadi Mohsen
- Navigating Prompt Complexity For Zero-shot Classification: A Study Of Large Language Models In Computational Social Science Mu Yida, Wu Ben P., Thorne William, Robinson Ambrose, Aletras Nikolaos, Scarton Carolina, Bontcheva Kalina, Song Xingyi
- Augmenting Large Language Model Translators Via Translation Memories Mu Yongyu, Reheman Abudurexiti, Cao Zhiquan, Fan Yuchun, Li Bei, Li Yinqiao, Xiao Tong, Zhang Chunliang, Zhu Jingbo
- Assessing Translation Capabilities Of Large Language Models Involving English And Indian Languages Mujadia Vandan, Urlana Ashok, Bhaskar Yash, Pavani Penumalla Aditya, Shravya Kukkapalli, Krishnamurthy Parameswari, Sharma Dipti Misra
- "medium" Lms Of Code In The Era Of Llms: Lessons From Stackoverflow Mukherjee Manisha, Hellendoorn Vincent J.
- A Brief History Of Prompt: Leveraging Language Models. (through Advanced Prompting) Muktadir Golam Md
- Llms-healthcare : Current Applications And Challenges Of Large Language Models In Various Medical Specialties Mumtaz Ummara, Ahmed Awais, Mumtaz Summaya
- LLM4VV: Developing Llm-driven Testsuite For Compiler Validation Munley Christian, Jarmusch Aaron, Chandrasekaran Sunita
- Testing Llms On Code Generation With Varying Levels Of Prompt Specificity Murr Lincoln, Grainger Morgan, Gao David
- Self-contradictory Hallucinations Of Large Language Models: Evaluation, Detection And Mitigation Mündler Niels, He Jingxuan, Jenko Slobodan, Vechev Martin
- Generating Medically-accurate Summaries Of Patient-provider Dialogue: A Multi-stage Approach Using Large Language Models Nair Varun, Schumacher Elliot, Kannan Anitha
- DERA: Enhancing Large Language Model Completions With Dialog-enabled Resolving Agents Nair Varun, Schumacher Elliot, Tso Geoffrey, Kannan Anitha
- On Evaluating The Integration Of Reasoning And Action In LLM Agents With Database Question Answering Nan Linyong, Zhang Ellen, Zou Weijin, Zhao Yilun, Zhou Wenfei, Cohan Arman
- Enhancing Few-shot Text-to-sql Capabilities Of Large Language Models: A Study On Prompt Design Strategies Nan Linyong, Zhao Yilun, Zou Weijin, Ri Narutatsu, Tae Jaesung, Zhang Ellen, Cohan Arman, Radev Dragomir
- Cheaply Evaluating Inference Efficiency Metrics For Autoregressive Transformer Apis Narayanan Deepak, Santhanam Keshav, Henderson Peter, Bommasani Rishi, Lee Tony, Liang Percy
- Reinforcement Replaces Supervision: Query Focused Summarization Using Deep Reinforcement Learning Nath Swaroop, Khadilkar Harshad, Bhattacharyya Pushpak
- Large Language Models In Healthcare And Medical Domain: A Review Nazi Zabir Al, Peng Wei
- On Robustness Of Finetuned Transformer-based NLP Models Neerudu Pavan Kalyan Reddy, Oota Subba Reddy, Marreddy Mounika, Kagita Venkateswara Rao, Gupta Manish
- Exploring The Maze Of Multilingual Modeling Nezhad Sina Bagheri, Agrawal Ameeta
- Enhancing Logical Reasoning In Large Language Models To Facilitate Legal Applications Nguyen Ha-thanh, Fungwacharakorn Wachara, Satoh Ken
- Black-box Analysis: Gpts Across Time In Legal Textual Entailment Task Nguyen Ha-thanh, Goebel Randy, Toni Francesca, Stathis Kostas, Satoh Ken
- Cof-cot: Enhancing Large Language Models With Coarse-to-fine Chain-of-thought Prompting For Multi-domain NLU Tasks Nguyen Hoang H., Liu Ye, Zhang Chenwei, Zhang Tao, Yu Philip S.
- When Giant Language Brains Just Aren't Enough! Domain Pizzazz With Knowledge Sparkle Dust Nguyen Minh-tien, Nguyen Duy-hung, Sabahi Shahab, Le Hung, Yang Jeff, Hotta Hajime
- EVJVQA Challenge: Multilingual Visual Question Answering Nguyen Ngan Luu-thuy, Nguyen Nghia Hieu, Vo Duong T. D, Tran Khanh Quoc, Van Nguyen Kiet
- Culturax: A Cleaned, Enormous, And Multilingual Dataset For Large Language Models In 167 Languages Nguyen Thuat, Van Nguyen Chien, Lai Viet Dac, Man Hieu, Ngo Nghia Trung, Dernoncourt Franck, Rossi Ryan A., Nguyen Thien Huu
- Democratizing Llms For Low-resource Languages By Leveraging Their English Dominant Abilities With Linguistically-diverse Prompts Nguyen Xuan-phi, Aljunied Sharifah Mahani, Joty Shafiq, Bing Lidong
- LEVER: Learning To Verify Language-to-code Generation With Execution Ni Ansong, Iyer Srini, Radev Dragomir, Stoyanov Ves, Yih Wen-tau, Wang Sida I., Lin Xi Victoria
- L2ceval: Evaluating Language-to-code Generation Capabilities Of Large Language Models Ni Ansong, Yin Pengcheng, Zhao Yilun, Riddell Martin, Feng Troy, Shen Rui, Yin Stephen, Liu Ye, Yavuz Semih, Xiong Caiming, Joty Shafiq, Zhou Yingbo, Radev Dragomir, Cohan Arman
- Code-style In-context Learning For Knowledge-based Question Answering Nie Zhijie, Zhang Richong, Wang Zhongyuan, Liu Xudong
- Xgen-7b Technical Report Nijkamp Erik, Xie Tian, Hayashi Hiroaki, Pang Bo, Xia Congying, Xing Chen, Vig Jesse, Yavuz Semih, Laban Philippe, Krause Ben, Purushwalkam Senthil, Niu Tong, Kryściński Wojciech, Murakhovs'ka Lidiya, Choubey Prafulla Kumar, Fabbri Alex, Liu Ye, Meng Rui, Tu Lifu, Bhat Meghana, Wu Chien-sheng, Savarese Silvio, Zhou Yingbo, Joty Shafiq, Xiong Caiming
- Ragtruth: A Hallucination Corpus For Developing Trustworthy Retrieval-augmented Language Models Niu Cheng, Wu Yuanhao, Zhu Juno, Xu Siliang, Shum Kashun, Zhong Randy, Song Juntong, Zhang Tong
- Evaluating AI Vocational Skills Through Professional Testing Noever David, Ciolino Matt
- Chatbots As Problem Solvers: Playing Twenty Questions With Role Reversals Noever David, Mckee Forrest
- Capabilities Of GPT-4 On Medical Challenge Problems Nori Harsha, King Nicholas, Mckinney Scott Mayer, Carignan Dean, Horvitz Eric
- Contrastive Decoding Improves Reasoning In Large Language Models O'brien Sean, Lewis Mike
- Steering Language Generation: Harnessing Contrastive Expert Guidance And Negative Prompting For Coherent And Diverse Synthetic Data Generation O'neill Charles, Ting Yuan-sen, Ciuca Ioana, Miller Jack, Bui Thang
- Comparative Analysis Of CHATGPT And The Evolution Of Language Models Ogundare Oluwatosin, Araya Gustavo Quiros
- Blackvip: Black-box Visual Prompting For Robust Transfer Learning Oh Changdae, Hwang Hyeji, Lee Hee-young, Lim Yongtaek, Jung Geunyoung, Jung Jiyoung, Choi Hosik, Song Kyungwoo
- Data Augmentation For Neural Machine Translation Using Generative Language Model Oh Seokjin, Lee Su Ah, Jung Woohwan
- Large Language Model (LLM) Bias Index -- LLMBI Oketunji Abiodun Finbarrs, Anas Muhammad, Saina Deepthi
- Askit: Unified Programming Interface For Programming With Large Language Models Okuda Katsumi, Amarasinghe Saman
- Is Self-repair A Silver Bullet For Code Generation? Olausson Theo X., Inala Jeevana Priya, Wang Chenglong, Gao Jianfeng, Solar-lezama Armando
- Generative AI: Implications And Applications For Education Olga Anastasia Olnancy, Tzirides, Saini Akash, Zapata Gabriela, Searsmith Duane, Cope Bill, Kalantzis Mary, Castro Vania, Kourkoulou Theodora, Jones John, Da Silva Rodrigo Abrantes, Whiting Jen, Kastania Nikoleta Polyxeni
- Thoughtsource: A Central Hub For Large Language Model Reasoning Data Ott Simon, Hebenstreit Konstantin, Liévin Valentin, Hother Christoffer Egeberg, Moradi Milad, Mayrhauser Maximilian, Praas Robert, Winther Ole, Samwald Matthias
- Dialogbench: Evaluating Llms As Human-like Dialogue Systems Ou Jiao, Lu Junda, Liu Che, Tang Yihong, Zhang Fuzheng, Zhang Di, Gai Kun
- The Shifted And The Overlooked: A Task-oriented Investigation Of User-gpt Interactions Ouyang Siru, Wang Shuohang, Liu Yang, Zhong Ming, Jiao Yizhu, Iter Dan, Pryzant Reid, Zhu Chenguang, Ji Heng, Han Jiawei
- Med-halt: Medical Domain Hallucination Test For Large Language Models Pal Ankit, Umapathi Logesh Kumar, Sankarasubbu Malaikannan
- Ontochatgpt Information System: Ontology-driven Structured Prompts For Chatgpt Meta-learning Palagin Oleksandr, Kaverinskiy Vladislav, Litvin Anna, Malakhov Kyrylo
- Towards Vision-language Mechanistic Interpretability: A Causal Tracing Tool For BLIP Palit Vedant, Pandey Rohan, Arora Aryaman, Liang Paul Pu
- Stelocoder: A Decoder-only LLM For Multi-language To Python Code Translation Pan Jialing, Sadé Adrien, Kim Jin, Soriano Eric, Sole Guillem, Flamant Sylvain
- Retrieving-to-answer: Zero-shot Video Question Answering With Frozen Large Language Models Pan Junting, Lin Ziyi, Ge Yuying, Zhu Xiatian, Zhang Renrui, Wang Yi, Qiao Yu, Li Hongsheng
- Automatically Correcting Large Language Models: Surveying The Landscape Of Diverse Self-correction Strategies Pan Liangming, Saxon Michael, Xu Wenda, Nathani Deepak, Wang Xinyi, Wang William Yang
- Semantic Composition In Visually Grounded Language Models Pandey Rohan
- Automating Customer Service Using Langchain: Building Custom Open-source GPT Chatbot For Organizations Pandya Keivalya, Holia Mehfuza
- Language Model Self-improvement By Reinforcement Learning Contemplation Pang Jing-cheng, Wang Pengyuan, Li Kaiyuan, Chen Xiong-hui, Xu Jiacheng, Zhang Zongzhang, Yu Yang
- Frozen Transformers In Language Models Are Effective Visual Encoder Layers Pang Ziqi, Xie Ziyang, Man Yunze, Wang Yu-xiong
- Clip-guided Vision-language Pre-training For Question Answering In 3D Scenes Parelli Maria, Delitzas Alexandros, Hars Nikolas, Vlassis Georgios, Anagnostidis Sotirios, Bachmann Gregor, Hofmann Thomas
- Interpretable Visual Question Answering Via Reasoning Supervision Parelli Maria, Mallis Dimitrios, Diomataris Markos, Pitsikalis Vassilis
- Abductive Reasoning With The GPT-4 Language Model: Case Studies From Criminal Investigation, Medical Practice, Scientific Research Pareschi Remo
- Localized Symbolic Knowledge Distillation For Visual Commonsense Models Park Jae Sung, Hessel Jack, Chandu Khyathi Raghavi, Liang Paul Pu, Lu Ximing, West Peter, Yu Youngjae, Huang Qiuyuan, Gao Jianfeng, Farhadi Ali, Choi Yejin
- Graph Elicitation For Guiding Multi-step Reasoning In Large Language Models Park Jinyoung, Patel Ameen, Khan Omar Zia, Kim Hyunwoo J., Kim Joo-kyung
- 1.5 Million Materials Narratives Generated By Chatbots Park Yang Jeong, Jerng Sung Eun, Park Jin-sung, Kwon Choah, Hsu Chia-wei, Ren Zhichu, Yoon Sungroh, Li Ju
- A Large Language Model Approach To Educational Survey Feedback Analysis Parker Michael J., Anderson Caitlin, Stone Claire, Oh Yearim
- Evaluating In-context Learning Of Libraries For Code Generation Patel Arkil, Reddy Siva, Bahdanau Dzmitry, Dasigi Pradeep
- Sequencing Matters: A Generate-retrieve-generate Model For Building Conversational Agents Patwardhan Quinn, Yang Grace Hui
- Evaluation Of Medium-large Language Models At Zero-shot Closed Book Generative Question Answering Peinl René, Wirth Johannes
- Self-driven Grounding: Large Language Model Agents With Automatical Language-aligned Skill Learning Peng Shaohui, Hu Xing, Yi Qi, Zhang Rui, Guo Jiaming, Huang Di, Tian Zikang, Chen Ruizhi, Du Zidong, Guo Qi, Chen Yunji, Li Ling
- Promptinfuser: How Tightly Coupling AI And UI Design Impacts Designers' Workflows Petridis Savvas, Terry Michael, Cai Carrie J.
- Recjpq: Training Large-catalogue Sequential Recommenders Petrov Aleksandr V., Macdonald Craig
- Zero-shot Question Answering Over Financial Documents Using Large Language Models Phogat Karmvir Singh, Harsha Chetan, Dasaratha Sridhar, Ramakrishna Shashishekar, Puranam Sai Akhil
- LLM Self Defense: By Self Examination, Llms Know They Are Being Tricked Phute Mansi, Helbling Alec, Hull Matthew, Peng Shengyun, Szyller Sebastian, Cornelius Cory, Chau Duen Horng
- Emptying The Ocean With A Spoon: Should We Edit Models? Pinter Yuval, Elhadad Michael
- Typhoon: Thai Large Language Models Pipatanakul Kunat, Jirabovonvisut Phatrasek, Manakul Potsawee, Sripaisarnmongkol Sittipong, Patomwong Ruangsak, Chokchainant Pathomporn, Tharnpipitchai Kasima
- Ai-assisted Coding: Experiments With GPT-4 Poldrack Russell A, Lu Thomas, Beguš Gašper
- Rephrase, Augment, Reason: Visual Grounding Of Questions For Vision-language Models Prasad Archiki, Stengel-eskin Elias, Bansal Mohit
- Epik-eval: Evaluation For Language Models As Epistemic Models Prato Gabriele, Huang Jerry, Parthasarathi Prasannna, Sodhani Shagun, Chandar Sarath
- CORE-GPT: Combining Open Access Research And Large Language Models For Credible, Trustworthy Question Answering Pride David, Cancellieri Matteo, Knoth Petr
- Chatgpt Vs Human-authored Text: Insights Into Controllable Text Summarization And Sentence Style Transfer Pu Dongqi, Demberg Vera
- Summarization Is (almost) Dead Pu Xiao, Gao Mingqi, Wan Xiaojun
- Decomposed Prompting For Machine Translation Between Related Languages Using Large Language Models Puduppully Ratish, Kunchukuttan Anoop, Dabre Raj, Aw Ai Ti, Chen Nancy F.
- Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! Qi Xiangyu, Zeng Yi, Xie Tinghao, Chen Pin-yu, Jia Ruoxi, Mittal Prateek, Henderson Peter
- "merge Conflicts!" Exploring The Impacts Of External Distractors To Parametric Knowledge Graphs Qian Cheng, Zhao Xinran, Wu Sherry Tongshuang
- Taskweaver: A Code-first Agent Framework Qiao Bo, Li Liqun, Zhang Xu, He Shilin, Kang Yu, Zhang Chaoyun, Yang Fangkai, Dong Hang, Zhang Jue, Wang Lu, Ma Minghua, Zhao Pu, Qin Si, Qin Xiaoting, Du Chao, Xu Yong, Lin Qingwei, Rajmohan Saravan, Zhang Dongmei
- Gameeval: Evaluating Llms On Conversational Games Qiao Dan, Wu Chenfei, Liang Yaobo, Li Juntao, Duan Nan
- In-context Learning With Iterative Demonstration Selection Qin Chengwei, Zhang Aston, Chen Chen, Dagar Anirudh, Ye Wenming
- Faithful Chain-of-thought Reasoning Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong, Marianna Apidianaki, Chris Callison-burch
- Adalora: Adaptive Budget Allocation For Parameter-efficient Fine-tuning Qingru Zhang, Minshuo Chen, Alexander Bukharin, Nikos Karampatziakis, Pengcheng He, Yu Cheng, Weizhu Chen, Tuo Zhao
- Codegeex: A Pre-trained Model For Code Generation With Multilingual Benchmarking On Humaneval-x Qinkai Zheng, Xiao Xia, Xu Zou, Yuxiao Dong, Shan Wang, Yufei Xue, Zihan Wang, Lei Shen, Andi Wang, Yang Li, Teng Su, Zhilin Yang, Jie Tang
- Embodied Executable Policy Learning With Language-based Scene Summarization Qiu Jielin, Xu Mengdi, Han William, Moon Seungwhan, Zhao Ding
- Direct Preference Optimization: Your Language Model Is Secretly A Reward Model Rafael Rafailov, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D. Manning, Chelsea Finn
- Logic-scaffolding: Personalized Aspect-instructed Recommendation Explanation Generation Using Llms Rahdari Behnam, Ding Hao, Fan Ziwei, Ma Yifei, Chen Zhuotong, Deoras Anoop, Kveton Branislav
- Quantized Transformer Language Model Implementations On Edge Devices Rahman Mohammad Wali Ur, Abrar Murad Mehrab, Copening Hunter Gibbons, Hariri Salim, Shao Sicong, Satam Pratik, Salehi Soheil
- Semantic Consistency For Assuring Reliability Of Large Language Models Raj Harsh, Gupta Vipul, Rosati Domenic, Majumdar Subhabrata
- K-PERM: Personalized Response Generation Using Dynamic Knowledge Retrieval And Persona-adaptive Queries Raj Kanak, Roy Kaushik, Bonagiri Vamshi, Govil Priyanshul, Thirunarayanan Krishnaprasad, Gaur Manas
- Multimodal Multi-hop Question Answering Through A Conversation Between Tools And Efficiently Finetuned Large Language Models Rajabzadeh Hossein, Wang Suyuchen, Kwon Hyock Ju, Liu Bang
- Does The "most Sinfully Decadent Cake Ever" Taste Good? Answering Yes/no Questions From Figurative Contexts Rakshit Geetanjali, Flanigan Jeffrey
- Controllable Generation Of Dialogue Acts For Dialogue Systems Via Few-shot Response Generation And Ranking Ramirez Angela, Agarwal Karik, Juraska Juraj, Garg Utkarsh, Walker Marilyn A.
- Tailoring Self-rationalizers With Multi-reward Distillation Ramnath Sahana, Joshi Brihi, Hallinan Skyler, Lu Ximing, Li Liunian Harold, Chan Aaron, Hessel Jack, Choi Yejin, Ren Xiang
- Aligning Neural Machine Translation Models: Human Feedback In Training And Inference Ramos Miguel Moura, Fernandes Patrick, Farinhas António, Martins André F. T.
- Evaluating Llms On Document-based QA: Exact Answer Selection And Numerical Extraction Using Cogtale Dataset Rasool Zafaryab, Kurniawan Stefanus, Balugo Sherwin, Barnett Scott, Vasa Rajesh, Chesser Courtney, Hampstead Benjamin M., Belleville Sylvie, Mouzakis Kon, Bahar-fuchs Alex
- Dissecting In-context Learning Of Translations In Gpts Raunak Vikas, Awadalla Hany Hassan, Menezes Arul
- Do Gpts Produce Less Literal Translations? Raunak Vikas, Menezes Arul, Post Matt, Awadalla Hany Hassan
- Leveraging GPT-4 For Automatic Translation Post-editing Raunak Vikas, Sharaf Amr, Wang Yiren, Awadallah Hany Hassan, Menezes Arul
- On Context Utilization In Summarization With Large Language Models Ravaut Mathieu, Sun Aixin, Chen Nancy F., Joty Shafiq
- \(\textit{dial Beinfo For Faithfulness}\): Improving Factuality Of Information-seeking Dialogue Via Behavioural Fine-tuning Razumovskaia Evgeniia, Vulić Ivan, Marković Pavle, Cichy Tomasz, Zheng Qian, Wen Tsung-hsien, Budzianowski Paweł
- Nemo Guardrails: A Toolkit For Controllable And Safe LLM Applications With Programmable Rails Rebedea Traian, Dinu Razvan, Sreedhar Makesh, Parisien Christopher, Cohen Jonathan
- Towards A Performance Analysis On Pre-trained Visual Question Answering Models For Autonomous Driving Rekanar Kaavya, Eising Ciarán, Sistu Ganesh, Hayes Martin
- Investigating The Factual Knowledge Boundary Of Large Language Models With Retrieval Augmentation Ren Ruiyang, Wang Yuhao, Qu Yingqi, Zhao Wayne Xin, Liu Jing, Tian Hao, Wu Hua, Wen Ji-rong, Wang Haifeng
- Sparq Attention: Bandwidth-efficient LLM Inference Ribar Luka, Chelombiev Ivan, Hudlass-galley Luke, Blake Charlie, Luschi Carlo, Orr Douglas
- Integrating Summarization And Retrieval For Enhanced Personalization Via Large Language Models Richardson Chris, Zhang Yao, Gillespie Kellen, Kar Sudipta, Singh Arshdeep, Raeesy Zeynab, Khan Omar Zia, Sethy Abhinav
- Commonsense Reasoning For Conversational AI: A Survey Of The State Of The Art Richardson Christopher, Heck Larry
- SYNDICOM: Improving Conversational Commonsense With Error-injection And Natural Language Feedback Richardson Christopher, Sundar Anirudh, Heck Larry
- Chatgpt MT: Competitive For High- (but Not Low-) Resource Languages Robinson Nathaniel R., Ogayo Perez, Mortensen David R., Neubig Graham
- Benchmarking Causal Study To Interpret Large Language Models For Source Code Rodriguez-cardenas Daniel, Palacio David N., Khati Dipin, Burke Henry, Poshyvanyk Denys
- Carexpert: Leveraging Large Language Models For In-car Conversational Question Answering Rony Md Rashad Al Hasan, Suess Christian, Bhat Sinchana Ramakanth, Sudhi Viju, Schneider Julia, Vogel Maximilian, Teucher Roman, Friedl Ken E., Sahoo Soumya
- Grading Conversational Responses Of Chatbots Rosario Grant, Noever David
- Axiomatic Preference Modeling For Longform Question Answering Rosset Corby, Zheng Guoqing, Dibia Victor, Awadallah Ahmed, Bennett Paul
- Knowledge-infused Self Attention Transformers Roy Kaushik, Zi Yuxin, Narayanan Vignesh, Gaur Manas, Sheth Amit
- Unveiling The Potential Of Large Language Models In Generating Semantic And Cross-language Clones Roy Palash R., Alam Ajmain I., Al-omari Farouq, Roy Banani, Roy Chanchal K., Schneider Kevin A.
- From Chatbots To Phishbots? -- Preventing Phishing Scams Created Using Chatgpt, Google Bard And Claude Roy Sayak Saha, Thota Poojitha, Naragam Krishna Vamsi, Nilizadeh Shirin
- Journey Of Hallucination-minimized Generative AI Solutions For Financial Decision Makers Roychowdhury Sohini
- Hallucination-minimized Data-to-answer Framework For Financial Decision-makers Roychowdhury Sohini, Alvarez Andres, Moore Brian, Krema Marko, Gelpi Maria Paz, Rodriguez Federico Martin, Rodriguez Angel, Cabrejas Jose Ramon, Serrano Pablo Martinez, Agrawal Punit, Mukherjee Arijit
- Code Llama: Open Foundation Models For Code Rozière Baptiste, Gehring Jonas, Gloeckle Fabian, Sootla Sten, Gat Itai, Tan Xiaoqing Ellen, Adi Yossi, Liu Jingyu, Sauvestre Romain, Remez Tal, Rapin Jérémy, Kozhevnikov Artyom, Evtimov Ivan, Bitton Joanna, Bhatt Manish, Ferrer Cristian Canton, Grattafiori Aaron, Xiong Wenhan, Défossez Alexandre, Copet Jade, Azhar Faisal, Touvron Hugo, Martin Louis, Usunier Nicolas, Scialom Thomas, Synnaeve Gabriel
- TPTU: Large Language Model-based AI Agents For Task Planning And Tool Usage Ruan Jingqing, Chen Yihong, Zhang Bin, Xu Zhiwei, Bao Tianpeng, Du Guoqing, Shi Shiwei, Mao Hangyu, Li Ziyue, Zeng Xingyu, Zhao Rui
- Identifying The Risks Of LM Agents With An Lm-emulated Sandbox Ruan Yangjun, Dong Honghua, Wang Andrew, Pitis Silviu, Zhou Yongchao, Ba Jimmy, Dubois Yann, Maddison Chris J., Hashimoto Tatsunori
- Delucionqa: Detecting Hallucinations In Domain-specific Question Answering Sadat Mobashir, Zhou Zhengyu, Lange Lukas, Araki Jun, Gundroo Arsalan, Wang Bingqing, Menon Rakesh R, Parvez Md Rizwan, Feng Zhe
- Uncertainty Guided Global Memory Improves Multi-hop Question Answering Sagirova Alsu, Burtsev Mikhail
- Branch-solve-merge Improves Large Language Model Evaluation And Generation Saha Swarnadeep, Levy Omer, Celikyilmaz Asli, Bansal Mohit, Weston Jason, Li Xian
- Attention Lens: A Tool For Mechanistically Interpreting The Attention Head Information Retrieval Mechanism Sakarvadia Mansi, Khan Arham, Ajith Aswathy, Grzenda Daniel, Hudson Nathaniel, Bauer André, Chard Kyle, Foster Ian
- Extending The Frontier Of Chatgpt: Code Generation And Debugging Sakib Fardin Ahsan, Khan Saadat Hasan, Karim A. H. M. Rezaul
- A Symmetric Dual Encoding Dense Retrieval Framework For Knowledge-intensive Visual Question Answering Salemi Alireza, Pizzorno Juan Altmayer, Zamani Hamed
- Pre-training Multi-modal Dense Retrievers For Outside-knowledge Visual Question Answering Salemi Alireza, Rafiee Mahta, Zamani Hamed
- Zero-shot Translation Of Attention Patterns In VQA Models To Natural Language Salewski Leonard, Koepke A. Sophia, Lensch Hendrik P. A., Akata Zeynep
- Breaking The Silence: The Threats Of Using Llms In Software Engineering Sallou June, Durieux Thomas, Panichella Annibale
- Uni-nlx: Unifying Textual Explanations For Vision And Vision-language Tasks Sammani Fawaz, Deligiannis Nikos
- Testing The Limits Of Unified Sequence To Sequence LLM Pretraining On Diverse Table Data Tasks Sarkar Soumajyoti, Lausen Leonard
- Inseq: An Interpretability Toolkit For Sequence Generation Models Sarti Gabriele, Feldhus Nils, Sickert Ludwig, Van Der Wal Oskar, Nissim Malvina, Bisazza Arianna
- RAMP: Retrieval And Attribute-marking Enhanced Prompting For Attribute-controlled Translation Sarti Gabriele, Htut Phu Mon, Niu Xing, Hsu Benjamin, Currey Anna, Dinu Georgiana, Nadejde Maria
- Unlocking Practical Applications In Legal Domain: Evaluation Of GPT For Zero-shot Semantic Annotation Of Legal Texts Savelka Jaromir
- Cross-lingual Supervision Improves Large Language Models Pre-training Schioppa Andrea, Garcia Xavier, Firat Orhan
- A Review Of Repository Level Prompting For Llms Schonholtz Douglas
- Ignore This Title And Hackaprompt: Exposing Systemic Vulnerabilities Of Llms Through A Global Scale Prompt Hacking Competition Schulhoff Sander, Pinto Jeremy, Khan Anaum, Bouchard Louis-françois, Si Chenglei, Anati Svetlina, Tagliabue Valen, Kost Anson Liu, Carnahan Christopher, Boyd-graber Jordan
- SEMQA: Semi-extractive Multi-source Question Answering Schuster Tal, Lelkes Adam D., Sun Haitian, Gupta Jai, Berant Jonathan, Cohen William W., Metzler Donald
- Surgicalgpt: End-to-end Language-vision GPT For Visual Question Answering In Surgery Seenivasan Lalithkumar, Islam Mobarakol, Kannan Gokul, Ren Hongliang
- Let's Have A Chat! A Conversation With Chatgpt: Technology, Applications, And Limitations Shahriar Sakib, Hayawi Kadhim
- Hero: Roberta And Longformer Hebrew Language Models Shalumov Vitaly, Haskey Harel
- PB-LLM: Partially Binarized Large Language Models Shang Yuzhang, Yuan Zhihang, Wu Qiang, Dong Zhen
- One-shot Sensitivity-aware Mixed Sparsity Pruning For Large Language Models Shao Hang, Liu Bei, Xiao Bo, Zeng Ke, Wan Guanglu, Qian Yanmin
- Enhancing Retrieval-augmented Large Language Models With Iterative Retrieval-generation Synergy Shao Zhihong, Gong Yeyun, Shen Yelong, Huang Minlie, Duan Nan, Chen Weizhu
- Language Is Not All You Need: Aligning Perception With Language Models Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Johan Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei
- Democratizing Llms: An Exploration Of Cost-performance Trade-offs In Self-refined Open-source Models Shashidhar Sumuk, Chinta Abhinav, Sahai Vaibhav, Wang Zhenhailong, Ji Heng
- Exploring The Factual Consistency In Dialogue Comprehension Of Large Language Models She Shuaijie, Huang Shujian, Wang Xingyun, Zhou Yanke, Chen Jiajun
- Pangu-coder2: Boosting Large Language Models For Code With Ranking Feedback Shen Bo, Zhang Jiaxin, Chen Taihong, Zan Daoguang, Geng Bing, Fu An, Zeng Muhan, Yu Ailun, Ji Jichuan, Zhao Jingyang, Guo Yuenan, Wang Qianxiang
- Is Translation Helpful? An Empirical Analysis Of Cross-lingual Transfer In Low-resource Dialog Generation Shen Lei, Yu Shuai, Shen Xiaoyu
- Retrieval-generation Alignment For End-to-end Task-oriented Dialogue System Shen Weizhou, Gao Yingqi, Huang Canbin, Wan Fanqi, Quan Xiaojun, Bi Wei
- Graphextqa: A Benchmark For Evaluating Graph-enhanced Large Language Models Shen Yuanchun, Liao Ruotong, Han Zhen, Ma Yunpu, Tresp Volker
- Why Does Chatgpt Fall Short In Providing Truthful Answers? Shen Zheng, Jie Huang, Kevin Chen-chuan Chang
- Crossget: Cross-guided Ensemble Of Tokens For Accelerating Vision-language Transformers Shi Dachuan, Tao Chaofan, Rao Anyi, Yang Zhendong, Yuan Chun, Wang Jiaqi
- Sotana: The Open-source Software Development Assistant Shi Ensheng, Zhang Fengji, Wang Yanlin, Chen Bei, Du Lun, Zhang Hongyu, Han Shi, Zhang Dongmei, Sun Hongbin
- Prompt Space Optimizing Few-shot Reasoning Success With Large Language Models Shi Fobo, Qing Peijun, Yang Dong, Wang Nan, Lei Youbo, Lu Haonan, Lin Xiaodong, Li Duantengchuan
- QADYNAMICS: Training Dynamics-driven Synthetic QA Diagnostic For Zero-shot Commonsense Question Answering Shi Haochen, Wang Weiqi, Fang Tianqing, Xu Baixuan, Ding Wenxuan, Liu Xin, Song Yangqiu
- Trusting Your Evidence: Hallucinate Less With Context-aware Decoding Shi Weijia, Han Xiaochuang, Lewis Mike, Tsvetkov Yulia, Zettlemoyer Luke, Yih Scott Wen-tau
- MKRAG: Medical Knowledge Retrieval Augmented Generation For Medical Question Answering Shi Yucheng, Xu Shaochen, Yang Tianze, Liu Zhengliang, Liu Tianming, Li Quanzheng, Li Xiang, Liu Ninghao
- RADE: Reference-assisted Dialogue Evaluation For Open-domain Dialogue Shi Zhengliang, Sun Weiwei, Zhang Shuo, Zhang Zhen, Ren Pengjie, Ren Zhaochun
- Exploring The Robustness Of Large Language Models For Solving Programming Problems Shirafuji Atsushi, Watanobe Yutaka, Ito Takumi, Morishita Makoto, Nakamura Yuki, Oda Yusuke, Suzuki Jun
- SCREWS: A Modular Framework For Reasoning With Revisions Shridhar Kumar, Jhamtani Harsh, Fang Hao, Van Durme Benjamin, Eisner Jason, Xia Patrick
- What Does CLIP Know About A Red Circle? Visual Prompt Engineering For Vlms Shtedritski Aleksandar, Rupprecht Christian, Vedaldi Andrea
- POSQA: Probe The World Models Of Llms With Size Comparisons Shu Chang, Han Jiuzhou, Liu Fangyu, Shareghi Ehsan, Collier Nigel
- Fusion-eval: Integrating Assistant Evaluators With Llms Shu Lei, Wichers Nevan, Luo Liangchen, Zhu Yun, Liu Yinxiao, Chen Jindong, Meng Lei
- SC-ML: Self-supervised Counterfactual Metric Learning For Debiased Visual Question Answering Shu Xinyao, Yan Shiyang, Yang Xu, Wu Ziheng, Chen Zhongfeng, Lu Zhenyu
- A Comprehensive Evaluation Of Large Language Models On Legal Judgment Prediction Shui Ruihao, Cao Yixin, Wang Xiang, Chua Tat-seng
- Meta-control Of Dialogue Systems Using Large Language Models Shukuri Kotaro, Ishigaki Ryoma, Suzuki Jundai, Naganuma Tsubasa, Fujimoto Takuma, Kawakubo Daisuke, Shuzo Masaki, Maeda Eisaku
- Automatic Prompt Augmentation And Selection With Chain-of-thought From Labeled Data Shum Kashun, Diao Shizhe, Zhang Tong
- Getting More Out Of Mixture Of Language Model Reasoning Experts Si Chenglei, Shi Weijia, Zhao Chen, Zettlemoyer Luke, Boyd-graber Jordan
- Knowledge Unlearning For Llms: Tasks, Methods, And Challenges Si Nianwen, Zhang Hao, Chang Heyu, Zhang Wenlin, Qu Dan, Zhang Weiqiang
- Combo Of Thinking And Observing For Outside-knowledge VQA Si Qingyi, Mo Yuchen, Lin Zheng, Ji Huishan, Wang Weiping
- Mondrian: Prompt Abstraction Attack Against Large Language Models For Cheaper API Pricing Si Wai Man, Backes Michael, Zhang Yang
- GPT-4 As An Agronomist Assistant? Answering Agriculture Exams Using Large Language Models Silva Bruno, Nunes Leonardo, Estevão Roberto, Aski Vijay, Chandra Ranveer
- Generalized Planning In PDDL Domains With Pretrained Large Language Models Silver Tom, Dan Soham, Srinivas Kavitha, Tenenbaum Joshua B., Kaelbling Leslie Pack, Katz Michael
- The Confidence-competence Gap In Large Language Models: A Cognitive Study Singh Aniket Kumar, Devkota Suman, Lamichhane Bishal, Dhakal Uttam, Dhakal Chandra
- Explaining Black Box Text Modules In Natural Language With Language Models Singh Chandan, Hsu Aliyah R., Antonello Richard, Jain Shailee, Huth Alexander G., Yu Bin, Gao Jianfeng
- Assessing GPT4-V On Structured Reasoning Tasks Singh Mukul, Cambronero José, Gulwani Sumit, Le Vu, Verbruggen Gust
- Towards Expert-level Medical Question Answering With Large Language Models Singhal Karan, Tu Tao, Gottweis Juraj, Sayres Rory, Wulczyn Ellery, Hou Le, Clark Kevin, Pfohl Stephen, Cole-lewis Heather, Neal Darlene, Schaekermann Mike, Wang Amy, Amin Mohamed, Lachgar Sami, Mansfield Philip, Prakash Sushant, Green Bradley, Dominowska Ewa, Arcas Blaise Aguera Y, Tomasev Nenad, Liu Yun, Wong Renee, Semturs Christopher, Mahdavi S. Sara, Barral Joelle, Webster Dale, Corrado Greg S., Matias Yossi, Azizi Shekoofeh, Karthikesalingam Alan, Natarajan Vivek
- A Long Way To Go: Investigating Length Correlations In RLHF Singhal Prasann, Goyal Tanya, Xu Jiacheng, Durrett Greg
- EEL: Efficiently Encoding Lattices For Reranking Singhal Prasann, Xu Jiacheng, Ye Xi, Durrett Greg
- Dont Add, Dont Miss: Effective Content Preserving Generation From Pre-selected Text Spans Slobodkin Aviv, Caciularu Avi, Hirsch Eran, Dagan Ido
- On Early Detection Of Hallucinations In Factual Question Answering Snyder Ben, Moisescu Marius, Zafar Muhammad Bilal
- Decoding Chatgpt: A Taxonomy Of Existing Research, Current Challenges, And Possible Future Directions Sohail Shahab Saquib, Farhat Faiza, Himeur Yassine, Nadeem Mohammad, Madsen Dag Øivind, Singh Yashbir, Atalla Shadi, Mansoor Wathiq
- SPARSEFIT: Few-shot Prompting With Sparse Fine-tuning For Jointly Generating Predictions And Natural Language Explanations Solano Jesus, Sanni Mardhiyah, Camburu Oana-maria, Minervini Pasquale
- Biomedical Knowledge Graph-optimized Prompt Generation For Large Language Models Soman Karthik, Rose Peter W, Morris John H, Akbas Rabia E, Smith Brett, Peetoom Braian, Villouta-reyes Catalina, Cerono Gabriel, Shi Yongmei, Rizk-jackson Angela, Israni Sharat, Nelson Charlotte A, Huang Sui, Baranzini Sergio E
- Dynamics Of Instruction Tuning: Each Ability Of Large Language Models Has Its Own Growth Pace Song Chiyu, Zhou Zhanchao, Yan Jianhao, Fei Yuejiao, Lan Zhenzhong, Zhang Yue
- Zebra: Extending Context Window With Layerwise Grouped Local-global Attention Song Kaiqiang, Wang Xiaoyang, Cho Sangwoo, Pan Xiaoman, Yu Dong
- Deduction Under Perturbed Evidence: Probing Student Simulation Capabilities Of Large Language Models Sonkar Shashank, Baraniuk Richard G.
- Joint Prompt Optimization Of Stacked Llms Using Variational Inference Sordoni Alessandro, Yuan Xingdi, Côté Marc-alexandre, Pereira Matheus, Trischler Adam, Xiao Ziang, Hosseini Arian, Niedtner Friederike, Roux Nicolas Le
- Learning Ui-to-code Reverse Generator Using Visual Critic Without Rendering Soselia Davit, Saifullah Khalid, Zhou Tianyi
- Curriculum-guided Abstractive Summarization Sotudeh Sajad, Deilamsalehy Hanieh, Dernoncourt Franck, Goharian Nazli
- Chatgpt: A Study On Its Utility For Ubiquitous Software Engineering Tasks Sridhara Giriprasad, G. Ranjani H., Mazumdar Sourav
- Response-act Guided Reinforced Dialogue Generation For Mental Health Counseling Srivastava Aseem, Pandey Ishan, Akhtar Md. Shad, Chakraborty Tanmoy
- Structured Packing In LLM Training Improves Long Context Utilization Staniszewski Konrad, Tworkowski Szymon, Jaszczur Sebastian, Zhao Yu, Michalewski Henryk, Kuciński Łukasz, Miłoś Piotr
- Seeing Seeds Beyond Weeds: Green Teaming Generative AI For Beneficial Uses Stapleton Logan, Taylor Jordan, Fox Sarah, Wu Tongshuang, Zhu Haiyi
- Distilled GPT For Source Code Summarization Su Chia-yi, Mcmillan Collin
- Learning From Red Teaming: Gender Bias Provocation And Mitigation In Large Language Models Su Hsuan, Cheng Cheng-chu, Farn Hua, Kumar Shachi H, Sahay Saurav, Chen Shang-tse, Lee Hung-yi
- The Synergy Of Speculative Decoding And Batching In Serving Large Language Models Su Qidong, Giannoula Christina, Pekhimenko Gennady
- Choice Fusion As Knowledge For Zero-shot Dialogue State Tracking Su Ruolin, Yang Jingfeng, Wu Ting-wei, Juang Biing-hwang
- Dial-mae: Contextual Masked Auto-encoder For Retrieval-based Dialogue Systems Su Zhenpeng, Wu Xing, Zhou Wei, Ma Guangyuan, Hu Songlin
- HC3 Plus: A Semantic-invariant Human Chatgpt Comparison Corpus Su Zhenpeng, Wu Xing, Zhou Wei, Ma Guangyuan, Hu Songlin
- Can Large Language Models Change User Preference Adversarially? Subhash Varshini
- Jmedlora:medical Domain Adaptation On Japanese Large Language Models Using Instruction-tuning Sukeda Issey, Suzuki Masahiro, Sakaji Hiroki, Kodera Satoshi
- Datatales: Investigating The Use Of Large Language Models For Authoring Data-driven Articles Sultanum Nicole, Srinivasan Arjun
- Towards Verifiable Text Generation With Evolving Memory And Self-reflection Sun Hao, Cai Hengyi, Wang Bo, Hou Yingyan, Wei Xiaochi, Wang Shuaiqiang, Zhang Yan, Yin Dawei
- Think-on-graph: Deep And Responsible Reasoning Of Large Language Model On Knowledge Graph Sun Jiashuo, Xu Chengjin, Tang Lumingyuan, Wang Saizhuo, Lin Chen, Gong Yeyun, Ni Lionel M., Shum Heung-yeung, Guo Jian
- Make Prompt-based Black-box Tuning Colorful: Boosting Model Generalization From Three Orthogonal Perspectives Sun Qiushi, Han Chengcheng, Chen Nuo, Zhu Renyu, Gong Jingyang, Li Xiang, Gao Ming
- Generative Multimodal Models Are In-context Learners Sun Quan, Cui Yufeng, Zhang Xiaosong, Zhang Fan, Yu Qiying, Luo Zhengxiong, Wang Yueze, Rao Yongming, Liu Jingjing, Huang Tiejun, Wang Xinlong
- A Prompt Learning Framework For Source Code Summarization Sun Weisong, Fang Chunrong, You Yudu, Chen Yuchen, Liu Yi, Wang Chong, Zhang Jian, Zhang Quanjun, Qian Hanwei, Zhao Wei, Liu Yang, Chen Zhenyu
- Multimodal Question Answering For Unified Information Extraction Sun Yuxuan, Zhang Kai, Su Yu
- A Short Survey Of Viewing Large Language Models In Legal Aspect Sun Zhongxiang
- Large Language Models As Generalizable Policies For Embodied Tasks Szot Andrew, Schwarzer Max, Agrawal Harsh, Mazoure Bogdan, Talbott Walter, Metcalf Katherine, Mackraz Natalie, Hjelm Devon, Toshev Alexander
- Gender-specific Machine Translation With Large Language Models Sánchez Eduardo, Andrews Pierre, Stenetorp Pontus, Artetxe Mikel, Costa-jussà Marta R.
- Tackling VQA With Pretrained Foundation Models Without Further Training Tan Alvin De Jun, Shen Bingquan
- Cappy: Outperforming And Boosting Large Multi-task Lms With A Small Scorer Tan Bowen, Zhu Yun, Liu Lijuan, Xing Eric, Hu Zhiting, Chen Jindong
- Copilot For Xcode: Exploring Ai-assisted Programming By Prompting Cloud-based Large Language Models Tan Chee Wei, Guo Shangxin, Wong Man Fai, Hang Ching Nam
- Massive Editing For Large Language Models Via Meta Learning Tan Chenmien, Zhang Ge, Fu Jie
- Towards Robust Temporal Reasoning Of Large Language Models Via A Multi-hop QA Dataset And Pseudo-instruction Tuning Tan Qingyu, Ng Hwee Tou, Bing Lidong
- Medchatzh: A Better Medical Adviser Learns From Better Instructions Tan Yang, Li Mingchen, Huang Zijie, Yu Huiqun, Fan Guisheng
- Can Chatgpt Replace Traditional KBQA Models? An In-depth Analysis Of The Question Answering Performance Of The GPT LLM Family Tan Yiming, Min Dehai, Li Yu, Li Wenbo, Hu Nan, Chen Yongrui, Qi Guilin
- Sparsity-guided Holistic Explanation For Llms With Interpretable Inference-time Intervention Tan Zhen, Chen Tianlong, Zhang Zhenyu, Liu Huan
- Ml-bench: Evaluating Large Language Models And Agents For Machine Learning Tasks On Repository-level Code Tang Xiangru, Liu Yuliang, Cai Zefan, Shao Yanjun, Lu Junjie, Zhang Yichi, Deng Zexuan, Hu Helan, An Kaikai, Huang Ruijun, Si Shuzheng, Chen Sheng, Zhao Haozhe, Chen Liang, Wang Yan, Liu Tianyu, Jiang Zhiwei, Chang Baobao, Fang Yin, Qin Yujia, Zhou Wangchunshu, Zhao Yilun, Cohan Arman, Gerstein Mark
- RSVP: Customer Intent Detection Via Agent Response Contrastive And Generative Pre-training Tang Yu-chien, Wang Wei-yao, Yen An-zi, Peng Wen-chih
- A Benchmark For Learning To Translate A New Language From One Grammar Book Tanzer Garrett, Suzgun Mirac, Visser Eline, Jurafsky Dan, Melas-kyriazi Luke
- Can BERT Refrain From Forgetting On Sequential Tasks? A Probing Study Tao Mingxu, Feng Yansong, Zhao Dongyan
- Expanding Frozen Vision-language Models Without Retraining: Towards Improved Robot Perception Tavassoli Riley, Amani Mani, Akhavian Reza
- Caption Anything: Interactive Image Description With Diverse Multimodal Controls Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao
- CLAMP: Contrastive Language Model Prompt-tuning Teterwak Piotr, Sun Ximeng, Plummer Bryan A., Saenko Kate, Lim Ser-nam
- Verigen: A Large Language Model For Verilog Code Generation Thakur Shailja, Ahmad Baleegh, Pearce Hammond, Tan Benjamin, Dolan-gavitt Brendan, Karri Ramesh, Garg Siddharth
- SPDF: Sparse Pre-training And Dense Fine-tuning For Large Language Models Thangarasa Vithursan, Gupta Abhay, Marshall William, Li Tianda, Leong Kevin, Decoste Dennis, Lie Sean, Saxena Shreyas
- Is Chatgpt The Ultimate Programming Assistant -- How Far Is It? Tian Haoye, Lu Weiqi, Li Tsz On, Tang Xunzhu, Cheung Shing-chi, Klein Jacques, Bissyandé Tegawendé F.
- Soft-prompt Tuning For Large Language Models To Evaluate Bias Tian Jacob-junqi, Emerson David, Miyandoab Sevil Zanjani, Pandya Deval, Seyyed-kalantari Laleh, Khattak Faiza Khan
- Graph Neural Prompting With Large Language Models Tian Yijun, Song Huan, Wang Zichen, Wang Haozhu, Hu Ziqing, Wang Fang, Chawla Nitesh V., Xu Panpan
- Chimed-gpt: A Chinese Medical Large Language Model With Full Training Regime And Better Alignment To Human Preferences Tian Yuanhe, Gan Ruyi, Song Yan, Zhang Jiaxing, Zhang Yongdong
- RLHF-V: Towards Trustworthy Mllms Via Behavior Alignment From Fine-grained Correctional Human Feedback Tianyu Yu, Yuan Yao, Haoye Zhang, Taiwen He, Yifeng Han, Ganqu Cui, Jinyi Hu, Zhiyuan Liu, Hai-tao Zheng, Maosong Sun, Tat-seng Chua
- Better Language Models Of Code Through Self-improvement To Hung Quoc, Bui Nghi D. Q., Guo Jin, Nguyen Tien N.
- Clinical Camel: An Open Expert-level Medical Language Model With Dialogue-based Knowledge Encoding Toma Augustin, Lawler Patrick R., Ba Jimmy, Krishnan Rahul G., Rubin Barry B., Wang Bo
- Llmseceval: A Dataset Of Natural Language Prompts For Security Evaluations Tony Catherine, Mutas Markus, Ferreyra Nicolás E. Díaz, Scandariato Riccardo
- Videberta: A Powerful Pre-trained Language Model For Vietnamese Tran Cong Dao, Pham Nhut Huy, Nguyen Anh, Hy Truong Son, Vu Tu
- Bioinstruct: Instruction Tuning Of Large Language Models For Biomedical Natural Language Processing Tran Hieu, Yang Zhichao, Yao Zonghai, Yu Hong
- Viclevr: A Visual Reasoning Dataset And Hybrid Multimodal Fusion Model For Visual Question Answering In Vietnamese Tran Khiem Vinh, Phan Hao Phu, Van Nguyen Kiet, Nguyen Ngan Luu Thuy
- Bartphobeit: Pre-trained Sequence-to-sequence And Image Transformers Models For Vietnamese Visual Question Answering Tran Khiem Vinh, Van Nguyen Kiet, Nguyen Ngan Luu Thuy
- Overinformative Question Answering By Humans And Machines Tsvilodub Polina, Franke Michael, Hawkins Robert D., Goodman Noah D.
- Unlocking Anticipatory Text Generation: A Constrained Approach For Large Language Models Decoding Tu Lifu, Yavuz Semih, Qu Jin, Xu Jiacheng, Meng Rui, Xiong Caiming, Zhou Yingbo
- Automating Code-related Tasks Through Transformers: The Impact Of Pre-training Tufano Rosalia, Pascarella Luca, Bavota Gabriele
- Mlongt5: A Multilingual And Efficient Text-to-text Transformer For Longer Sequences Uthus David, Ontañón Santiago, Ainslie Joshua, Guo Mandy
- Don't Believe Everything You Read: Enhancing Summarization Interpretability Through Automatic Identification Of Hallucinations In Large Language Models Vakharia Priyesh, Joshi Devavrat, Chavan Meenal, Sonawane Dhananjay, Garg Bhrigu, Mazaheri Parsa
- Mini-gpts: Efficient Large Language Models Through Contextual Pruning Valicenti Tim, Vidal Justice, Patnaik Ritik
- Detecting And Correcting Hate Speech In Multimodal Memes With Large Visual Language Model Van Minh-hao, Wu Xintao
- Accelerating Llama Inference By Enabling Intermediate Layer Decoding Via Instruction Tuning With LITE Varshney Neeraj, Chatterjee Agneet, Parmar Mihir, Baral Chitta
- A Stitch In Time Saves Nine: Detecting And Mitigating Hallucinations Of Llms By Validating Low-confidence Generation Varshney Neeraj, Yao Wenlin, Zhang Hongming, Chen Jianshu, Yu Dong
- Adventures Of Trustworthy Vision-language Models: A Survey Vatsa Mayank, Jain Anubhooti, Singh Richa
- ASPIRO: Any-shot Structured Parsing-error-induced Reprompting For Consistent Data-to-text Generation Vejvar Martin, Fujimoto Yasutaka
- Rethinking The Evaluating Framework For Natural Language Understanding In AI Systems: Language Acquisition As A Core For Future Metrics Vera Patricio, Moya Pedro, Barraza Lisa
- Simplesafetytests: A Test Suite For Identifying Critical Safety Risks In Large Language Models Vidgen Bertie, Scherrer Nino, Kirk Hannah Rose, Qian Rebecca, Kannappan Anand, Hale Scott A., Röttger Paul
- Mtcue: Learning Zero-shot Control Of Extra-textual Attributes By Leveraging Unstructured Context In Neural Machine Translation Vincent Sebastian, Flynn Robert, Scarton Carolina
- Generative Pre-trained Transformer For Vietnamese Community-based COVID-19 Question Answering Vo Tam Minh, Tran Khiem Vinh
- Using Natural Language Explanations To Rescale Human Judgments Wadhwa Manya, Chen Jifan, Li Junyi Jessy, Durrett Greg
- Empirical Evaluation Of Uncertainty Quantification In Retrieval-augmented Language Models For Science Wagle Sridevi, Munikoti Sai, Acharya Anurag, Smith Sara, Horawalavithana Sameera
- Universal Self-adaptive Prompting Wan Xingchen, Sun Ruoxi, Nakhost Hootan, Dai Hanjun, Eisenschlos Julian Martin, Arik Sercan O., Pfister Tomas
- Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases In Dialogue Systems Wan Yixin, Zhao Jieyu, Chadha Aman, Peng Nanyun, Chang Kai-wei
- Efficient Large Language Models: A Survey Wan Zhongwei, Wang Xin, Liu Che, Alam Samiul, Zheng Yu, Liu Jiachen, Qu Zhongnan, Yan Shen, Zhu Yi, Zhang Quanlu, Chowdhury Mosharaf, Zhang Mi
- Shall We Pretrain Autoregressive Language Models With Retrieval? A Comprehensive Study Wang Boxin, Ping Wei, Xu Peng, Mcafee Lawrence, Liu Zihan, Shoeybi Mohammad, Dong Yi, Kuchaiev Oleksii, Li Bo, Xiao Chaowei, Anandkumar Anima, Catanzaro Bryan
- Keqing: Knowledge-based Question Answering Is A Nature Chain-of-thought Mentor Of LLM Wang Chaojie, Xu Yishi, Peng Zhong, Zhang Chenxi, Chen Bo, Wang Xinrun, Feng Lei, An Bo
- Cost-effective Hyperparameter Optimization For Large Language Model Generation Inference Wang Chi, Liu Susan Xueqing, Awadallah Ahmed H.
- Robust Natural Language Understanding With Residual Attention Debiasing Wang Fei, Huang James Y., Yan Tianyi, Zhou Wenxuan, Chen Muhao
- A Survey Of The Evolution Of Language Model-based Dialogue Systems Wang Hongru, Wang Lingzhi, Du Yiming, Chen Liang, Zhou Jingyan, Wang Yufei, Wong Kam-fai
- Cue-cot: Chain-of-thought Prompting For Responding To In-depth Dialogue Questions With Llms Wang Hongru, Wang Rui, Mi Fei, Deng Yang, Wang Zezhong, Liang Bin, Xu Ruifeng, Wong Kam-fai
- On The Robustness Of Chatgpt: An Adversarial And Out-of-distribution Perspective Wang Jindong, Hu Xixu, Hou Wenxin, Chen Hao, Zheng Runkai, Wang Yidong, Yang Linyi, Huang Haojun, Ye Wei, Geng Xiubo, Jiao Binxin, Zhang Yue, Xie Xing
- Chated: A Chatbot Leveraging Chatgpt For An Enhanced Learning Experience In Higher Education Wang Kevin, Ramos Jason, Lawrence Ramon
- Document-level Machine Translation With Large Language Models Wang Longyue, Lyu Chenyang, Ji Tianbo, Zhang Zhirui, Yu Dian, Shi Shuming, Tu Zhaopeng
- Simultaneous Machine Translation With Large Language Models Wang Minghan, Zhao Jinming, Vu Thuy-trang, Shiri Fatemeh, Shareghi Ehsan, Haffari Gholamreza
- Demystifying Instruction Mixing For Fine-tuning Large Language Models Wang Renxi, Li Haonan, Wu Minghao, Wang Yuxia, Han Xudong, Zhang Chiyu, Baldwin Timothy
- Vamos: Versatile Action Models For Video Understanding Wang Shijie, Zhao Qi, Do Minh Quan, Agarwal Nakul, Lee Kwonjoon, Sun Chen
- READ: Recurrent Adaptation Of Large Transformers Wang Sid, Nguyen John, Li Ke, Wu Carole-jean
- CAR: Conceptualization-augmented Reasoner For Zero-shot Commonsense Question Answering Wang Weiqi, Fang Tianqing, Ding Wenxuan, Xu Baixuan, Liu Xin, Song Yangqiu, Bosselut Antoine
- TRACE: A Comprehensive Benchmark For Continual Learning In Large Language Models Wang Xiao, Zhang Yuansen, Chen Tianze, Gao Songyang, Jin Senjie, Yang Xianjun, Xi Zhiheng, Zheng Rui, Zou Yicheng, Gui Tao, Zhang Qi, Huang Xuanjing
- MINT: Evaluating Llms In Multi-turn Interaction With Tools And Language Feedback Wang Xingyao, Wang Zihan, Liu Jiateng, Chen Yangyi, Yuan Lifan, Peng Hao, Ji Heng
- Emotional Intelligence Of Large Language Models Wang Xuena, Li Xueting, Yin Zi, Wu Yue, Jia Liu
- Adding Guardrails To Advanced Chatbots Wang Yanchen, Singh Lisa
- Automated Evaluation Of Personalized Text Generation Using Large Language Models Wang Yaqing, Jiang Jiepu, Zhang Mingyang, Li Cheng, Liang Yi, Mei Qiaozhu, Bendersky Michael
- Hint-enhanced In-context Learning Wakes Large Language Models Up For Knowledge-intensive Tasks Wang Yifan, Guo Qingyan, Ni Xinzhe, Shi Chufan, Liu Lemao, Jiang Haiyun, Yang Yujiu
- Rescue: Ranking LLM Responses With Partial Ordering To Improve Response Generation Wang Yikun, Zheng Rui, Li Haoming, Zhang Qi, Gui Tao, Liu Fei
- Augmenting Black-box Llms With Medical Textbooks For Clinical Question Answering Wang Yubo, Ma Xueguang, Chen Wenhu
- Metacognitive Prompting Improves Understanding In Large Language Models Wang Yuqing, Zhao Yun
- Chatcoder: Chat-based Refine Requirement Improves Llms' Code Generation Wang Zejun, Li Jia, Li Ge, Jin Zhi
- Smarttrim: Adaptive Tokens And Attention Pruning For Efficient Vision-language Models Wang Zekun, Chen Jingchang, Zhou Wangchunshu, Zhu Haichao, Liang Jiafeng, Shan Liping, Liu Ming, Xu Dongliang, Yang Qing, Qin Bing
- Gpt4video: A Unified Multimodal Large Language Model For Lnstruction-followed Understanding And Safety-aware Generation Wang Zhanyu, Wang Longyue, Zhao Zhen, Wu Minghao, Lyu Chenyang, Li Huayang, Cai Deng, Zhou Luping, Shi Shuming, Tu Zhaopeng
- Mediagpt : A Large Language Model For Chinese Media Wang Zhonghao, Lu Zijia, Jin Bo, Deng Haiying
- Empowering Few-shot Recommender Systems With Large Language Models -- Enhanced Representations Wang Zhoumeng
- Can Llms Like GPT-4 Outperform Traditional AI Tools In Dementia Diagnosis? Maybe, But Not Today Wang Zhuo, Li Rongzhen, Dong Bowen, Wang Jie, Li Xiuxing, Liu Ning, Mao Chenhui, Zhang Wei, Dong Liling, Gao Jing, Wang Jianyong
- An Effective Data Creation Pipeline To Generate High-quality Financial Instruction Data For Large Language Model Wang Ziao, Wang Jianning, Wu Junda, Zhang Xiaofeng
- Enabling Language Models To Implicitly Learn Self-improvement Wang Ziqi, Hou Le, Lu Tianjian, Wu Yuexin, Li Yunxuan, Yu Hongkun, Ji Heng
- Call For Papers -- The Babylm Challenge: Sample-efficient Pretraining On A Developmentally Plausible Corpus Warstadt Alex, Choshen Leshem, Mueller Aaron, Williams Adina, Wilcox Ethan, Zhuang Chengxu
- Machine Translation For Ge'ez Language Wassie Aman Kassahun
- Polylm: An Open Source Polyglot Large Language Model Wei Xiangpeng, Wei Haoran, Lin Huan, Li Tianhao, Zhang Pei, Ren Xingzhang, Li Mei, Wan Yu, Cao Zhiwei, Xie Binbin, Hu Tianxiang, Li Shangjie, Hui Binyuan, Yu Bowen, Liu Dayiheng, Yang Baosong, Huang Fei, Xie Jun
- Magicoder: Empowering Code Generation With Oss-instruct Wei Yuxiang, Wang Zhe, Liu Jiawei, Ding Yifeng, Zhang Lingming
- Copiloting The Copilots: Fusing Large Language Models With Completion Engines For Automated Program Repair Wei Yuxiang, Xia Chunqiu Steven, Zhang Lingming
- Is Chatgpt Good At Search? Investigating Large Language Models As Re-ranking Agents Weiwei Sun, Lingyong Yan, Xinyu Ma, Shuaiqiang Wang, Pengjie Ren, Zhumin Chen, Dawei Yin, Zhaochun Ren
- Droidbot-gpt: Gpt-powered UI Automation For Android Wen Hao, Wang Hongming, Liu Jiaxuan, Li Yuanchun
- Mindmap: Knowledge Graph Prompting Sparks Graph Of Thoughts In Large Language Models Wen Yilin, Wang Zifeng, Sun Jimeng
- Hyperpolyglot Llms: Cross-lingual Interpretability In Token Embeddings Wen-yi Andrea W, Mimno David
- Generative Recommendation: Towards Next-generation Recommender Paradigm Wenjie Wang, Xinyu Lin, Fuli Feng, Xiangnan He, Tat-seng Chua
- An Overview Of Temporal Commonsense Reasoning And Acquisition Wenzel Georg, Jatowt Adam
- Exploring Parameter-efficient Fine-tuning Techniques For Code Generation With Large Language Models Weyssow Martin, Zhou Xin, Kim Kisub, Lo David, Sahraoui Houari
- Resolving References In Visually-grounded Dialogue Via Text Generation Willemsen Bram, Qian Livia, Skantze Gabriel
- Natural Language Generation And Understanding Of Big Code For Ai-assisted Programming: A Review Wong Man Fai, Guo Shangxin, Hang Ching Nam, Ho Siu Wai, Tan Chee Wei
- Pmc-llama: Towards Building Open-source Language Models For Medicine Wu Chaoyi, Lin Weixiong, Zhang Xiaoman, Zhang Ya, Wang Yanfeng, Xie Weidi
- Rethinking Model Selection And Decoding For Keyphrase Generation With Pre-trained Sequence-to-sequence Models Wu Di, Ahmad Wasi Uddin, Chang Kai-wei
- Exploring The Limits Of Chatgpt In Software Security Applications Wu Fangzhou, Zhang Qingzhao, Bajaj Ati Priya, Bao Tiffany, Zhang Ning, Wang Ruoyu "fish", Xiao Chaowei
- P-laplacian Adaptation For Generative Pre-trained Vision-language Models Wu Haoyuan, Zhang Xinyun, Xu Peng, Liao Peiyu, Yao Xufeng, Yu Bei
- Efficient LLM Inference Solution On Intel GPU Wu Hui, Gan Yi, Yuan Feng, Ma Jing, Zhu Wei, Xu Yutao, Zhu Hong, Zhu Yuhua, Liu Xiaoli, Gu Jinghui, Zhao Peng
- Large Language Models Leverage External Knowledge To Extend Clinical Insight Beyond Language Boundaries Wu Jiageng, Wu Xian, Qiu Zhaopeng, Li Minghui, Zhang Yingying, Zheng Yefeng, Yuan Changzheng, Yang Jie
- Document Flattening: Beyond Concatenating Context For Document-level Neural Machine Translation Wu Minghao, Foster George, Qu Lizhen, Haffari Gholamreza
- A Comparative Study Of Open-source Large Language Models, GPT-4 And Claude 2: Multiple-choice Test Taking In Nephrology Wu Sean, Koo Michael, Blum Lesley, Black Andy, Kao Liyo, Scalzo Fabien, Kurtz Ira
- YUAN 2.0: A Large Language Model With Localized Filtering-based Attention Wu Shaohua, Zhao Xudong, Wang Shenling, Luo Jiangang, Li Lingjun, Chen Xi, Zhao Bing, Wang Wei, Yu Tong, Zhang Rongguo, Zhang Jiahua, Wang Chao
- Analyzing Chain-of-thought Prompting In Large Language Models Via Gradient-based Feature Attributions Wu Skyler, Shen Eric Meng, Badrinath Charumathi, Ma Jiaqi, Lakkaraju Himabindu
- Towards Robust Text Retrieval With Progressive Learning Wu Tong, Qin Yulei, Zhang Enwei, Xu Zihan, Gao Yuting, Li Ke, Sun Xing
- Self-correcting Llm-controlled Diffusion Models Wu Tsung-han, Lian Long, Gonzalez Joseph E., Li Boyi, Darrell Trevor
- Unleashing Potential Of Evidence In Knowledge-intensive Dialogue Generation Wu Xianjie, Yang Jian, Li Tongliang, Liang Di, Zhang Shiwei, Du Yiyang, Li Zhoujun
- Unveiling Security, Privacy, And Ethical Concerns Of Chatgpt Wu Xiaodong, Duan Ran, Ni Jianbing
- From Language Modeling To Instruction Following: Understanding The Behavior Shift In Llms After Instruction Tuning Wu Xuansheng, Yao Wenlin, Chen Jianshu, Pan Xiaoman, Wang Xiaoyang, Liu Ninghao, Yu Dong
- TLM: Token-level Masking For Transformers Wu Yangjun, Fang Kebin, Zhang Dongxiang, Wang Han, Zhang Hao, Chen Gang
- Retrieve-rewrite-answer: A Kg-to-text Enhanced Llms Framework For Knowledge Graph Question Answering Wu Yike, Hu Nan, Bi Sheng, Qi Guilin, Ren Jie, Xie Anhuan, Song Wei
- Fine-grained Human Feedback Gives Better Rewards For Language Model Training Wu Zeqiu, Hu Yushi, Shi Weijia, Dziri Nouha, Suhr Alane, Ammanabrolu Prithviraj, Smith Noah A., Ostendorf Mari, Hajishirzi Hannaneh
- Openicl: An Open-source Framework For In-context Learning Wu Zhenyu, Wang Yaoxiang, Ye Jiacheng, Feng Jiangtao, Xu Jingjing, Qiao Yu, Wu Zhiyong
- A Simple Baseline For Knowledge-based Visual Question Answering Xenos Alexandros, Stafylakis Themos, Patras Ioannis, Tzimiropoulos Georgios
- LLMGA: Multimodal Large Language Model Based Generation Assistant Xia Bin, Wang Shiyin, Tao Yingfan, Wang Yitong, Jia Jiaya
- Improving Question Generation With Multi-level Content Planning Xia Zehua, Gou Qi, Yu Bowen, Yu Haiyang, Huang Fei, Li Yongbin, Nguyen Cam-tu
- Efficient Streaming Language Models With Attention Sinks Xiao Guangxuan, Tian Yuandong, Chen Beidi, Han Song, Lewis Mike
- Can I Trust Your Answer? Visually Grounded Video Question Answering Xiao Junbin, Yao Angela, Li Yicong, Chua Tat Seng
- Lm-cocktail: Resilient Tuning Of Language Models Via Model Merging Xiao Shitao, Liu Zheng, Zhang Peitian, Xing Xingrun
- Personalized Abstractive Summarization By Tri-agent Generation Pipeline Xiao Wen, Xie Yujia, Carenini Giuseppe, He Pengcheng
- A Survey On Multi-behavior Sequential Recommendation Xiaoqing Chen, Zhitao Li, Weike Pan, Zhong Ming
- Pangu-σ: Towards Trillion Parameter Language Model With Sparse Heterogeneous Computing Xiaozhe Ren, Pingyi Zhou, Xinfan Meng, Xinjing Huang, Yadao Wang, Weichao Wang, Pengfei Li, Xiaoda Zhang, Alexander Podolskiy, Grigory Arshinov, Andrey Bout, Irina Piontkovskaya, Jiansheng Wei, Xin Jiang, Teng Su, Qun Liu, Jun Yao
- Tencentllmeval: A Hierarchical Evaluation Of Real-world Capabilities For Human-aligned Llms Xie Shuyi, Yao Wenlin, Dai Yong, Wang Shaobo, Zhou Donlin, Jin Lifeng, Feng Xinhua, Wei Pengzhi, Lin Yujie, Hu Zhichao, Yu Dong, Zhang Zhengyou, Nie Jing, Liu Yuhong
- Making Small Language Models Better Multi-task Learners With Mixture-of-task-adapters Xie Yukang, Wang Chengyu, Yan Junbing, Zhou Jiyong, Deng Feiqi, Huang Jun
- Examining Inter-consistency Of Large Language Models Collaboration: An In-depth Analysis Via Debate Xiong Kai, Ding Xiao, Cao Yixin, Liu Ting, Qin Bing
- The Program Testing Ability Of Large Language Models For Code Xiong Weimin, Guo Yiwen, Chen Hao
- Llmcad: Fast And Scalable On-device Large Language Model Inference Xu Daliang, Yin Wangsong, Jin Xin, Zhang Ying, Wei Shiyun, Xu Mengwei, Liu Xuanzhe
- A Paradigm Shift In Machine Translation: Boosting Translation Performance Of Large Language Models Xu Haoran, Kim Young Jin, Sharaf Amr, Awadalla Hany Hassan
- Penetrative AI: Making Llms Comprehend The Physical World Xu Huatao, Han Liying, Yang Qirui, Li Mo, Srivastava Mani
- Retrieval-based Video Language Model For Efficient Long Video Question Answering Xu Jiaqi, Lan Cuiling, Xie Wenxuan, Chen Xuejin, Lu Yan
- Superclue: A Comprehensive Chinese Large Language Model Benchmark Xu Liang, Li Anqi, Zhu Lei, Xue Hang, Zhu Changtai, Zhao Kangkang, He Haonan, Zhang Xuanwei, Kang Qiyue, Lan Zhenzhong
- Sc-safety: A Multi-round Open-ended Question Adversarial Safety Benchmark For Large Language Models In Chinese Xu Liang, Zhao Kangkang, Zhu Lei, Xue Hang
- Knowledge-infused Prompting: Assessing And Advancing Clinical Text Data Generation With Large Language Models Xu Ran, Cui Hejie, Yu Yue, Kan Xuan, Shi Wenqi, Zhuang Yuchen, Jin Wei, Ho Joyce, Yang Carl
- Fine-tuned Llms Know More, Hallucinate Less With Few-shot Sequence-to-sequence Semantic Parsing Over Wikidata Xu Silei, Liu Shicheng, Culhane Theo, Pertseva Elizaveta, Wu Meng-hsi, Semnani Sina J., Lam Monica S.
- Llmrefine: Pinpointing And Refining Large Language Models Via Fine-grained Actionable Feedback Xu Wenda, Deutsch Daniel, Finkelstein Mara, Juraska Juraj, Zhang Biao, Liu Zhongtao, Wang William Yang, Li Lei, Freitag Markus
- INSTRUCTSCORE: Explainable Text Generation Evaluation With Finegrained Feedback Xu Wenda, Wang Danqing, Pan Liangming, Song Zhenqiao, Freitag Markus, Wang William Yang, Li Lei
- An LLM Can Fool Itself: A Prompt-based Adversarial Attack Xu Xilie, Kong Keyi, Liu Ning, Cui Lizhen, Wang Di, Zhang Jingfeng, Kankanhalli Mohan
- Mental-llm: Leveraging Large Language Models For Mental Health Prediction Via Online Text Data Xu Xuhai, Yao Bingsheng, Dong Yuanzhe, Gabriel Saadia, Yu Hong, Hendler James, Ghassemi Marzyeh, Dey Anind K., Wang Dakuo
- Drivegpt4: Interpretable End-to-end Autonomous Driving Via Large Language Model Xu Zhenhua, Zhang Yujia, Xie Enze, Zhao Zhen, Guo Yong, Wong Kwan-yee. K., Li Zhenguo, Zhao Hengshuang
- How Robust Is GPT-3.5 To Predecessors? A Comprehensive Study On Language Understanding Tasks Xuanting Chen, Junjie Ye, Can Zu, Nuo Xu, Rui Zheng, Minlong Peng, Jie Zhou, Tao Gui, Qi Zhang, Xuanjing Huang
- Bias And Fairness In Chatbots: An Overview Xue Jintang, Wang Yun-cheng, Wei Chengwei, Liu Xiaofeng, Woo Jonghye, Kuo C. -c. Jay
- Optimizing Machine Translation Through Prompt Engineering: An Investigation Into Chatgpt's Customizability Yamada Masaru
- Understanding In-context Learning From Repetitions Yan Jianhao, Xu Jin, Song Chiyu, Wu Chenming, Li Yafu, Zhang Yue
- Predicting Text Preference Via Structured Comparative Reasoning Yan Jing Nathan, Liu Tianqi, Chiu Justin T, Shen Jiaming, Qin Zhen, Yu Yue, Zhao Yao, Lakshmanan Charu, Kurzion Yair, Rush Alexander M., Liu Jialu, Bendersky Michael
- Backdooring Instruction-tuned Large Language Models With Virtual Prompt Injection Yan Jun, Yadav Vikas, Li Shiyang, Chen Lichang, Tang Zheng, Wang Hai, Srinivasan Vijay, Ren Xiang, Jin Hongxia
- Voila-a: Aligning Vision-language Models With User's Gaze Attention Yan Kun, Ji Lei, Wang Zeyu, Wang Yuntao, Duan Nan, Ma Shuai
- Codescope: An Execution-based Multilingual Multitask Multidimensional Benchmark For Evaluating Llms On Code Understanding And Generation Yan Weixiang, Liu Haitian, Wang Yunkun, Li Yunzhe, Chen Qian, Wang Wen, Lin Tingyu, Zhao Weishan, Zhu Li, Sundaram Hari, Deng Shuiguang
- Multimodal Chatgpt For Medical Applications: An Experimental Study Of GPT-4V Yan Zhiling, Zhang Kai, Zhou Rong, He Lifang, Li Xiang, Sun Lichao
- Effective Distillation Of Table-based Reasoning Ability From Llms Yang Bohao, Tang Chen, Zhao Kun, Xiao Chenghao, Lin Chenghua
- Beyond Testers' Biases: Guiding Model Testing With Knowledge Bases Using Llms Yang Chenyang, Rustogi Rishabh, Brower-sinning Rachel, Lewis Grace A., Kästner Christian, Wu Tongshuang
- Zero-shot Query Reformulation For Conversational Search Yang Dayu, Zhang Yue, Fang Hui
- Empower Large Language Model To Perform Better On Industrial Domain-specific Question Answering Yang Fangkai, Zhao Pu, Wang Zezhong, Wang Lu, Zhang Jue, Garg Mohit, Lin Qingwei, Rajmohan Saravan, Zhang Dongmei
- Knowledge-prompted Estimator: A Novel Approach To Explainable Machine Translation Assessment Yang Hao, Zhang Min, Tao Shimin, Wang Minghan, Wei Daimeng, Jiang Yanfei
- PRCA: Fitting Black-box Large Language Models For Retrieval Question Answering Via Pluggable Reward-driven Contextual Adapter Yang Haoyan, Li Zhitao, Zhang Yong, Wang Jianzong, Cheng Ning, Li Ming, Xiao Jing
- Longqlora: Efficient And Effective Method To Extend Context Length Of Large Language Models Yang Jianxin
- Good Questions Help Zero-shot Image Reasoning Yang Kaiwen, Shen Tao, Tian Xinmei, Geng Xiubo, Tao Chongyang, Tao Dacheng, Zhou Tianyi
- Revisiting Parallel Context Windows: A Frustratingly Simple Alternative And Chain-of-thought Deterioration Yang Kejuan, Liu Xiao, Men Kaiwen, Zeng Aohan, Dong Yuxiao, Tang Jie
- Kobigbird-large: Transformation Of Transformer For Korean Language Understanding Yang Kisu, Jang Yoonna, Lee Taewoo, Seong Jinwoo, Lee Hyungjin, Jang Hwanseok, Lim Heuiseok
- Medxchat: A Unified Multimodal Large Language Model Framework Towards Cxrs Understanding And Generation Yang Ling, Wang Zhanyu, Chen Zhenghao, Liang Xinyu, Zhou Luping
- Give Us The Facts: Enhancing Large Language Models With Knowledge Graphs For Fact-aware Language Modeling Yang Linyao, Chen Hongyang, Li Zhao, Ding Xiao, Wu Xindong
- Supervised Knowledge Makes Large Language Models Better In-context Learners Yang Linyi, Zhang Shuibai, Yu Zhuohao, Bao Guangsheng, Wang Yidong, Wang Jindong, Xu Ruochen, Ye Wei, Xie Xing, Chen Weizhu, Zhang Yue
- Uncertainty-aware Language Modeling For Selective Question Answering Yang Qi, Ravikumar Shreya, Schmitt-ulms Fynn, Lolla Satvik, Demir Ege, Elistratov Iaroslav, Lavaee Alex, Lolla Sadhana, Ahmadi Elaheh, Rus Daniela, Amini Alexander, Perez Alejandro
- Integrating UMLS Knowledge Into Large Language Models For Medical Question Answering Yang Rui, Marrese-taylor Edison, Ke Yuhe, Cheng Lechao, Chen Qingyu, Li Irene
- Lidar-llm: Exploring The Potential Of Large Language Models For 3D Lidar Understanding Yang Senqiao, Liu Jiaming, Zhang Ray, Pan Mingjie, Guo Zoey, Li Xiaoqi, Chen Zehui, Gao Peng, Guo Yandong, Zhang Shanghang
- Progressive Evidence Refinement For Open-domain Multimodal Retrieval Question Answering Yang Shuwen, Wu Anran, Wu Xingjiao, Xiao Luwei, Ma Tianlong, Jin Cheng, He Liang
- Zhongjing: Enhancing The Chinese Medical Capabilities Of Large Language Model Through Expert Feedback And Real-world Multi-turn Dialogue Yang Songhua, Zhao Hanjie, Zhu Senbin, Zhou Guangyu, Xu Hongfei, Jia Yuxiang, Zan Hongying
- Neural Machine Translation Data Generation And Augmentation Using Chatgpt Yang Wayne, Nicolai Garrett
- Shadow Alignment: The Ease Of Subverting Safely-aligned Language Models Yang Xianjun, Wang Xiao, Zhang Qi, Petzold Linda, Wang William Yang, Zhao Xun, Lin Dahua
- Human-in-the-loop Machine Translation With Large Language Model Yang Xinyi, Zhan Runzhe, Wong Derek F., Wu Junchao, Chao Lidia S.
- Lever LM: Configuring In-context Sequence To Lever Large Vision Language Models Yang Xu, Peng Yingzhe, Ma Haoxuan, Xu Shuo, Zhang Chi, Han Yucheng, Zhang Hanwang
- On The Calibration Of Multilingual Question Answering Llms Yang Yahan, Dan Soham, Roth Dan, Lee Insup
- Mindllm: Pre-training Lightweight Large Language Model From Scratch, Evaluations And Domain Applications Yang Yizhe, Sun Huashan, Li Jiawei, Liu Runheng, Li Yinghao, Liu Yuhang, Huang Heyan, Gao Yang
- Improving The Reliability Of Large Language Models By Leveraging Uncertainty-aware In-context Learning Yang Yuchen, Li Houqiang, Wang Yanfeng, Wang Yu
- On The Planning, Search, And Memorization Capabilities Of Large Language Models Yang Yunhao, Tomar Anshul
- Surpassing GPT-4 Medical Coding With A Two-stage Approach Yang Zhichao, Batra Sanjit Singh, Stremmel Joel, Halperin Eran
- Re-vilm: Retrieval-augmented Visual Language Model For Zero And Few-shot Image Captioning Yang Zhuolin, Ping Wei, Liu Zihan, Korthikanti Vijay, Nie Weili, Huang De-an, Fan Linxi, Yu Zhiding, Lan Shiyi, Li Bo, Liu Ming-yu, Zhu Yuke, Shoeybi Mohammad, Catanzaro Bryan, Xiao Chaowei, Anandkumar Anima
- Quantization-aware And Tensor-compressed Training Of Transformers For Natural Language Understanding Yang Zi, Choudhary Samridhi, Kunzmann Siegfried, Zhang Zheng
- Benchmarking Llm-based Machine Translation On Cultural Awareness Yao Binwei, Jiang Ming, Yang Diyi, Hu Junjie
- Promptcare: Prompt Copyright Protection By Watermark Injection And Verification Yao Hongwei, Lou Jian, Ren Kui, Qin Zhan
- Knowledge Plugins: Enhancing Large Language Models For Domain-specific Recommendations Yao Jing, Xu Wei, Lian Jianxun, Wang Xiting, Yi Xiaoyuan, Xie Xing
- Embodiedgpt: Vision-language Pre-training Via Embodied Chain Of Thought Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo
- A Survey On Large Language Model (LLM) Security And Privacy: The Good, The Bad, And The Ugly Yao Yifan, Duan Jinhao, Xu Kaidi, Cai Yuanfang, Sun Zhibo, Zhang Yue
- Cognitive Mirage: A Review Of Hallucinations In Large Language Models Ye Hongbin, Liu Tong, Zhang Aijia, Hua Wei, Jia Weiqiang
- Generating Data For Symbolic Language With Large Language Models Ye Jiacheng, Li Chengzu, Kong Lingpeng, Yu Tao
- Qilin-med: Multi-stage Knowledge Injection Advanced Medical Large Language Model Ye Qichen, Liu Junling, Chong Dading, Zhou Peilin, Hua Yining, Liu Fenglin, Cao Meng, Wang Ziming, Cheng Xuxin, Lei Zhu, Guo Zhenhua
- Tram: A Token-level Retrieval-augmented Mechanism For Source Code Summarization Ye Tong, Wu Lingfei, Ma Tengfei, Zhang Xuhong, Du Yangkai, Liu Peiyu, Ji Shouling, Wang Wenhai
- Rational Decision-making Agent With Internalized Utility Judgment Ye Yining, Cong Xin, Tian Shizuo, Qin Yujia, Liu Chong, Lin Yankai, Liu Zhiyuan, Sun Maosong
- QAID: Question Answering Inspired Few-shot Intent Detection Yehudai Asaf, Vetzler Matan, Mass Yosi, Lazar Koren, Cohen Doron, Carmeli Boaz
- A Multitask, Multilingual, Multimodal Evaluation Of Chatgpt On Reasoning, Hallucination, And Interactivity Yejin Bang, Samuel Cahyawijaya, Nayeon Lee, Wenliang Dai, Dan Su, Bryan Wilie, Holy Lovenia, Ziwei Ji, Tiezheng Yu, Willy Chung, Quyet V. Do, Yan Xu, Pascale Fung
- Benchmarking And Defending Against Indirect Prompt Injection Attacks On Large Language Models Yi Jingwei, Xie Yueqi, Zhu Bin, Kiciman Emre, Sun Guangzhong, Xie Xing, Wu Fangzhao
- Element-aware Summarization With Large Language Models: Expert-aligned Evaluation And Chain-of-thought Method Yiming Wang, Zhuosheng Zhang, Rui Wang
- Modulora: Finetuning 2-bit Llms On Consumer Gpus By Integrating With Modular Quantizers Yin Junjie, Dong Jiahao, Wang Yingheng, De Sa Christopher, Kuleshov Volodymyr
- Alignment Is Not Sufficient To Prevent Large Language Models From Generating Harmful Information: A Psychoanalytic Perspective Yin Zi, Ding Wei, Liu Jia
- Intuitive Or Dependent? Investigating Llms' Behavior Style To Conflicting Prompts Ying Jiahao, Cao Yixin, Xiong Kai, He Yidong, Cui Long, Liu Yongbin
- Analyzing And Mitigating Object Hallucination In Large Vision-language Models Yiyang Zhou, Chenhang Cui, Jaehong Yoon, Linjun Zhang, Zhun Deng, Chelsea Finn, Mohit Bansal, Huaxiu Yao
- Search-adaptor: Embedding Customization For Information Retrieval Yoon Jinsung, Arik Sercan O, Chen Yanfei, Pfister Tomas
- Decoding In-context Learning: Neuroscience-inspired Analysis Of Representations In Large Language Models Yousefi Safoora, Betthauser Leo, Hasanbeig Hosein, Millière Raphaël, Momennejad Ida
- Assessing Prompt Injection Risks In 200+ Custom Gpts Yu Jiahao, Wu Yuhang, Shu Dong, Jin Mingyu, Yang Sabrina, Xing Xinyu
- MELO: Enhancing Model Editing With Neuron-indexed Dynamic Lora Yu Lang, Chen Qin, Zhou Jie, He Liang
- Scaling Autoregressive Multi-modal Models: Pretraining And Instruction Tuning Yu Lili, Shi Bowen, Pasunuru Ramakanth, Muller Benjamin, Golovneva Olga, Wang Tianlu, Babu Arun, Tang Binh, Karrer Brian, Sheynin Shelly, Ross Candace, Polyak Adam, Howes Russell, Sharma Vasu, Xu Puxin, Tamoyan Hovhannes, Ashual Oron, Singer Uriel, Li Shang-wen, Zhang Susan, James Richard, Ghosh Gargi, Taigman Yaniv, Fazel-zarandi Maryam, Celikyilmaz Asli, Zettlemoyer Luke, Aghajanyan Armen
- Constructive Large Language Models Alignment With Diverse Feedback Yu Tianshu, Lin Ting-en, Wu Yuchuan, Yang Min, Huang Fei, Li Yongbin
- Seqgpt: An Out-of-the-box Large Language Model For Open Domain Sequence Understanding Yu Tianyu, Jiang Chengyue, Lou Chao, Huang Shen, Wang Xiaobin, Liu Wei, Cai Jiong, Li Yangning, Li Yinghui, Tu Kewei, Zheng Hai-tao, Zhang Ningyu, Xie Pengjun, Huang Fei, Jiang Yong
- Improving Language Models Via Plug-and-play Retrieval Feedback Yu Wenhao, Zhang Zhihan, Liang Zhenwen, Jiang Meng, Sabharwal Ashish
- Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning Yu Yue, Shen Jiaming, Liu Tianqi, Qin Zhen, Yan Jing Nathan, Liu Jialu, Zhang Chao, Bendersky Michael
- Wavecoder: Widespread And Versatile Enhancement For Code Large Language Models By Instruction Tuning Yu Zhaojian, Zhang Xin, Shang Ning, Huang Yangyu, Xu Can, Zhao Yishujie, Hu Wenxiang, Yin Qiufeng
- Prophet: Prompting Large Language Models With Complementary Answer Heuristics For Knowledge-based Visual Question Answering Yu Zhou, Ouyang Xuecheng, Shao Zhenwei, Wang Meng, Yu Jun
- Towards Better Chain-of-thought Prompting Strategies: A Survey Yu Zihan, He Liang, Wu Zhen, Dai Xinyu, Chen Jiajun
- Large Language Models Illuminate A Progressive Pathway To Artificial Healthcare Assistant: A Review Yuan Mingze, Bao Peng, Yuan Jiajia, Shen Yunhao, Chen Zifan, Xie Yi, Zhao Jie, Chen Yang, Zhang Li, Shen Lin, Dong Bin
- ANALOGYKB: Unlocking Analogical Reasoning Of Language Models With A Million-scale Knowledge Base Yuan Siyu, Chen Jiangjie, Sun Changzhi, Liang Jiaqing, Xiao Yanghua, Yang Deqing
- Where To Go Next For Recommender Systems? ID- Vs. Modality-based Recommender Models Revisited Yuan Zheng, Yuan Fajie, Song Yu, Li Youhua, Fu Junchen, Yang Fei, Pan Yunzhu, Ni Yongxin
- Multilingual Augmentation For Robust Visual Question Answering In Remote Sensing Images Yuan Zhenghang, Mou Lichao, Zhu Xiao Xiang
- Evotext: Enhancing Natural Language Generation Models Via Self-escalation Learning For Up-to-date Knowledge And Improved Performance Yuan Zhengqing, Xue Huiwen, Zhang Chao, Liu Yongming
- Prompt Highlighter: Interactive Control For Multi-modal Llms Yuechen Zhang, Shengju Qian, Bohao Peng, Shu Liu, Jiaya Jia
- Contextual Object Detection With Multimodal Large Language Models Yuhang Zang, Wei Li, Jun Han, Kaiyang Zhou, Chen Change Loy
- Pre-training To Learn In Context Yuxian Gu, Li Dong, Furu Wei, Minlie Huang
- Building Trust In Conversational AI: A Comprehensive Review And Solution Architecture For Explainable, Privacy-aware Systems Using Llms And Knowledge Graph Zafar Ahtsham, Parthasarathy Venkatesh Balavadhani, Van Chan Le, Shahid Saad, Khan Aafaq Iqbal, Shahid Arsalan
- Mascqa: A Question Answering Dataset For Investigating Materials Science Knowledge Of Large Language Models Zaki Mohd, Jayadeva, Mausam, Krishnan N. M. Anoop
- Improving Machine Translation With Large Language Models: A Preliminary Study With Cooperative Decoding Zeng Jiali, Meng Fandong, Yin Yongjing, Zhou Jie
- Text Alignment Is An Efficient Unified Model For Massive NLP Tasks Zha Yuheng, Yang Yichi, Li Ruichen, Hu Zhiting
- Mm-narrator: Narrating Long-form Videos With Multimodal In-context Learning Zhang Chaoyi, Lin Kevin, Yang Zhengyuan, Wang Jianfeng, Li Linjie, Lin Chung-ching, Liu Zicheng, Wang Lijuan
- User-controlled Knowledge Fusion In Large Language Models: Balancing Creativity And Hallucination Zhang Chen
- Xdial-eval: A Multilingual Open-domain Dialogue Evaluation Benchmark Zhang Chen, D'haro Luis Fernando, Tang Chengguang, Shi Ke, Tang Guohua, Li Haizhou
- Task-agnostic Distillation Of Encoder-decoder Language Models Zhang Chen, Yang Yang, Wang Jingang, Song Dawei
- Video-llama: An Instruction-tuned Audio-visual Language Model For Video Understanding Zhang Hang, Li Xin, Bing Lidong
- Controllable Text Generation With Residual Memory Transformer Zhang Hanqing, Si Sun, Wu Haiming, Song Dawei
- Uncovering Hidden Connections: Iterative Search And Reasoning For Video-grounded Dialog Zhang Haoyu, Liu Meng, Wang Yaowei, Cao Da, Guan Weili, Nie Liqiang
- Tractable Control For Autoregressive Language Generation Zhang Honghua, Dang Meihua, Peng Nanyun, Broeck Guy Van Den
- Efficient Toxic Content Detection By Bootstrapping And Distilling Large Language Models Zhang Jiang, Wu Qiong, Xu Yiming, Cao Cheng, Du Zheng, Psounis Konstantinos
- Empirical Evaluation Of Chatgpt On Requirements Information Retrieval Under Zero-shot Setting Zhang Jianzhang, Chen Yiyang, Niu Nan, Wang Yinglin, Liu Chuang
- Towards Perceiving Small Visual Details In Zero-shot Visual Question Answering With Multimodal Llms Zhang Jiarui, Khayatkhoei Mahyar, Chhikara Prateek, Ilievski Filip
- Glm-dialog: Noise-tolerant Pre-training For Knowledge-grounded Dialogue Generation Zhang Jing, Zhang Xiaokang, Zhang-li Daniel, Yu Jifan, Yao Zijun, Ma Zeyao, Xu Yiqi, Wang Haohua, Zhang Xiaohan, Lin Nianyi, Lu Sunrui, Li Juanzi, Tang Jie
- The Potential And Pitfalls Of Using A Large Language Model Such As Chatgpt Or GPT-4 As A Clinical Assistant Zhang Jingqing, Sun Kai, Jagadeesh Akshay, Ghahfarokhi Mahta, Gupta Deepa, Gupta Ashok, Gupta Vibhor, Guo Yike
- Moqagpt : Zero-shot Multi-modal Open-domain Question Answering With Large Language Model Zhang Le, Wu Yihong, Mo Fengran, Nie Jian-yun, Agrawal Aishwarya
- Marathon: A Race Through The Realm Of Long Context With Large Language Models Zhang Lei, Li Yunshui, Liu Ziqiang, Yang Jiaxi, Liu Junhao, Chen Longze, Luo Run, Yang Min
- A Two-stage Adaptation Of Large Language Models For Text Ranking Zhang Longhui, Zhang Yanzhao, Long Dingkun, Xie Pengjun, Zhang Meishan, Zhang Min
- Knowgpt: Knowledge Graph Based Prompting For Large Language Models Zhang Qinggang, Dong Junnan, Chen Hao, Zha Daochen, Yu Zailiang, Huang Xiao
- Instruction Tuning For Large Language Models: A Survey Zhang Shengyu, Dong Linfeng, Li Xiaoya, Zhang Sen, Sun Xiaofei, Wang Shuhe, Li Jiwei, Hu Runyi, Zhang Tianwei, Wu Fei, Wang Guoyin
- Automl-gpt: Automatic Machine Learning With GPT Zhang Shujian, Gong Chengyue, Wu Lemeng, Liu Xingchao, Zhou Mingyuan
- Planning With Large Language Models For Code Generation Zhang Shun, Chen Zhenfang, Shen Yikang, Ding Mingyu, Tenenbaum Joshua B., Gan Chuang
- History-aware Hierarchical Transformer For Multi-session Open-domain Dialogue System Zhang Tong, Liu Yong, Li Boyang, Zeng Zhiwei, Wang Pengwei, You Yuan, Miao Chunyan, Cui Lizhen
- EHR Interaction Between Patients And AI: Noteaid EHR Interaction Zhang Xiaocheng, Yao Zonghai, Yu Hong
- PMC-VQA: Visual Instruction Tuning For Medical Visual Question Answering Zhang Xiaoman, Wu Chaoyi, Zhao Ziheng, Lin Weixiong, Zhang Ya, Wang Yanfeng, Xie Weidi
- Making Large Language Models Perform Better In Knowledge Graph Completion Zhang Yichi, Chen Zhuo, Guo Lingbing, Xu Yajing, Zhang Wen, Chen Huajun
- Meta Prompting For AI Systems Zhang Yifan, Yuan Yang, Yao Andrew Chi-chih
- Llamavae: Guiding Large Language Model Generation Via Continuous Latent Sentence Spaces Zhang Yingji, Carvalho Danilo S., Pratt-hartmann Ian, Freitas André
- Cached Transformers: Improving Transformers With Differentiable Memory Cache Zhang Zhaoyang, Shao Wenqi, Ge Yixiao, Wang Xiaogang, Gu Jinwei, Luo Ping
- VISAR: A Human-ai Argumentative Writing Assistant With Visual Programming And Rapid Draft Prototyping Zhang Zheng, Gao Jie, Dhaliwal Ranjodh Singh, Li Toby Jia-jun
- Students' Perceptions And Preferences Of Generative Artificial Intelligence Feedback For Programming Zhang Zhengdong, Dong Zihan, Shi Yang, Matsuda Noboru, Price Thomas, Xu Dongkuan
- H\(_2\)O: Heavy-hitter Oracle For Efficient Generative Inference Of Large Language Models Zhang Zhenyu, Sheng Ying, Zhou Tianyi, Chen Tianlong, Zheng Lianmin, Cai Ruisi, Song Zhao, Tian Yuandong, Ré Christopher, Barrett Clark, Wang Zhangyang, Chen Beidi
- Accountable Textual-visual Chat Learns To Reject Human Instructions In Image Re-creation Zhang Zhiwei, Liu Yuliang
- SVIT: Scaling Up Visual Instruction Tuning Zhao Bo, Wu Boya, He Muyang, Huang Tiejun
- Large Language Models Are Complex Table Parsers Zhao Bowen, Ji Changkai, Zhang Yuejie, He Wen, Wang Yingwen, Wang Qing, Feng Rui, Zhang Xiaobo
- Explainability For Large Language Models: A Survey Zhao Haiyan, Chen Hanjie, Yang Fan, Liu Ninghao, Deng Huiqi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Du Mengnan
- Unimc: A Unified Framework For Long-term Memory Conversation Via Relevance Representation Learning Zhao Kang, Liu Wei, Luan Jian, Gao Minglei, Qian Li, Teng Hanlin, Wang Bin
- Retrieving Multimodal Information For Augmented Generation: A Survey Zhao Ruochen, Chen Hailin, Wang Weishi, Jiao Fangkai, Do Xuan Long, Qin Chengwei, Ding Bosheng, Guo Xiaobao, Li Minzhi, Li Xingxuan, Joty Shafiq
- Verify-and-edit: A Knowledge-enhanced Chain-of-thought Framework Zhao Ruochen, Li Xingxuan, Joty Shafiq, Qin Chengwei, Bing Lidong
- Group Preference Optimization: Few-shot Alignment Of Large Language Models Zhao Siyan, Dang John, Grover Aditya
- Pareto Optimal Learning For Estimating Large Language Model Errors Zhao Theodore, Wei Mu, Preston J. Samuel, Poon Hoifung
- DIVKNOWQA: Assessing The Reasoning Ability Of Llms Via Open-domain Question Answering Over Knowledge Base And Text Zhao Wenting, Liu Ye, Niu Tong, Wan Yao, Yu Philip S., Joty Shafiq, Zhou Yingbo, Yavuz Semih
- Easygen: Easing Multimodal Generation With Bidiffuser And Llms Zhao Xiangyu, Liu Bo, Liu Qijiong, Shi Guangyuan, Wu Xiao-ming
- Slic-hf: Sequence Likelihood Calibration With Human Feedback Zhao Yao, Joshi Rishabh, Liu Tianqi, Khalman Misha, Saleh Mohammad, Liu Peter J.
- Lookahead: An Inference Acceleration Framework For Large Language Model With Lossless Generation Accuracy Zhao Yao, Xie Zhitian, Liang Chen, Zhuang Chenyi, Gu Jinjie
- Investigating Table-to-text Generation Capabilities Of Llms In Real-world Information Seeking Scenarios Zhao Yilun, Zhang Haowei, Si Shengyun, Nan Linyong, Tang Xiangru, Cohan Arman
- Robut: A Systematic Study Of Table QA Robustness Against Human-annotated Adversarial Perturbations Zhao Yilun, Zhao Chen, Nan Linyong, Qi Zhenting, Zhang Wenlin, Tang Xiangru, Mi Boyu, Radev Dragomir
- Librisqa: A Novel Dataset And Framework For Spoken Question Answering With Large Language Models Zhao Zihan, Jiang Yiyang, Liu Heyang, Wang Yanfeng, Wang Yu
- Lmsys-chat-1m: A Large-scale Real-world LLM Conversation Dataset Zheng Lianmin, Chiang Wei-lin, Sheng Ying, Li Tianle, Zhuang Siyuan, Wu Zhanghao, Zhuang Yonghao, Li Zhuohan, Lin Zi, Xing Eric P., Gonzalez Joseph E., Stoica Ion, Zhang Hao
- Sglang: Efficient Execution Of Structured Language Model Programs Zheng Lianmin, Yin Liangsheng, Xie Zhiqiang, Sun Chuyue, Huang Jeff, Yu Cody Hao, Cao Shiyi, Kozyrakis Christos, Stoica Ion, Gonzalez Joseph E., Barrett Clark, Sheng Ying
- Memorycompanion: A Smart Healthcare Solution To Empower Efficient Alzheimer's Care Via Unleashing Generative AI Zheng Lifei, Heo Yeonie, Fang Yi
- Building Emotional Support Chatbots In The Era Of Llms Zheng Zhonghua, Liao Lizi, Deng Yang, Nie Liqiang
- Marinegpt: Unlocking Secrets Of Ocean To The Public Zheng Ziqiang, Zhang Jipeng, Vu Tuan-anh, Diao Shizhe, Tim Yue Him Wong, Yeung Sai-kit
- CRITIC: Large Language Models Can Self-correct With Tool-interactive Critiquing Zhibin Gou, Zhihong Shao, Yeyun Gong, Yelong Shen, Yujiu Yang, Nan Duan, Weizhu Chen
- The Rise And Potential Of Large Language Model Based Agents: A Survey Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang, Changhao Jiang, Yicheng Zou, Xiangyang Liu, Zhangyue Yin, Shihan Dou, Rongxiang Weng, Wensen Cheng, Qi Zhang, Wenjuan Qin, Yongyan Zheng, Xipeng Qiu, Xuanjing Huang, Tao Gui
- Exploring Human-like Translation Strategy With Large Language Models Zhiwei He, Tian Liang, Wenxiang Jiao, Zhuosheng Zhang, Yujiu Yang, Rui Wang, Zhaopeng Tu, Shuming Shi, Xing Wang
- Bag Of Tricks For Effective Language Model Pretraining And Downstream Adaptation: A Case Study On GLUE Zhong Qihuang, Ding Liang, Peng Keqin, Liu Juhua, Du Bo, Shen Li, Zhan Yibing, Tao Dacheng
- Let's Think Outside The Box: Exploring Leap-of-thought In Large Language Models With Creative Humor Generation Zhong Shanshan, Huang Zhongzhan, Gao Shanghua, Wen Wushao, Lin Liang, Zitnik Marinka, Zhou Pan
- Mquake: Assessing Knowledge Editing In Language Models Via Multi-hop Questions Zhong Zexuan, Wu Zhengxuan, Manning Christopher D., Potts Christopher, Chen Danqi
- Batch Calibration: Rethinking Calibration For In-context Learning And Prompt Engineering Zhou Han, Wan Xingchen, Proleev Lev, Mincu Diana, Chen Jilin, Heller Katherine, Roy Subhrajit
- Ethical Chatgpt: Concerns, Challenges, And Commandments Zhou Jianlong, Müller Heimo, Holzinger Andreas, Chen Fang
- Think Before You Speak: Cultivating Communication Skills Of Large Language Models Via Inner Monologue Zhou Junkai, Pang Liang, Shen Huawei, Cheng Xueqi
- Exploring Recommendation Capabilities Of Gpt-4v(ision): A Preliminary Case Study Zhou Peilin, Cao Meng, Huang You-liang, Ye Qichen, Zhang Peiyan, Liu Junling, Xie Yueqi, Hua Yining, Kim Jaeboum
- Infmllm: A Unified Framework For Visual-language Tasks Zhou Qiang, Wang Zhibin, Chu Wei, Xu Yinghui, Li Hao, Qi Yuan
- Multi-stage Pre-training Enhanced By Chatgpt For Multi-scenario Multi-domain Dialogue Summarization Zhou Weixiao, Li Gengyao, Cheng Xianfu, Liang Xinnian, Zhu Junnan, Zhai Feifei, Li Zhoujun
- Making Harmful Behaviors Unlearnable For Large Language Models Zhou Xin, Lu Yi, Ma Ruotian, Gui Tao, Zhang Qi, Huang Xuanjing
- Towards Joint Modeling Of Dialogue Response And Speech Synthesis Based On Large Language Model Zhou Xinyu, Chen Delong, Chen Yudong
- Can GPT Models Follow Human Summarization Guidelines? Evaluating Chatgpt And GPT-4 For Dialogue Summarization Zhou Yongxin, Ringeval Fabien, Portet François
- Quantifying And Analyzing Entity-level Memorization In Large Language Models Zhou Zhenhong, Xiang Jiuyang, Chen Chaomeng, Su Sen
- Dialogue-contextualized Re-ranking For Medical History-taking Zhu Jian, Valmianski Ilya, Kannan Anitha
- VL-GPT: A Generative Pre-trained Transformer For Vision And Language Understanding And Generation Zhu Jinguo, Ding Xiaohan, Ge Yixiao, Ge Yuying, Zhao Sijie, Zhao Hengshuang, Wang Xiaohua, Shan Ying
- Promptbench: A Unified Library For Evaluation Of Large Language Models Zhu Kaijie, Zhao Qinlin, Chen Hao, Wang Jindong, Xie Xing
- Are NLP Models Good At Tracing Thoughts: An Overview Of Narrative Understanding Zhu Lixing, Zhao Runcong, Gui Lin, He Yulan
- Promptcblue: A Chinese Prompt Tuning Benchmark For The Medical Domain Zhu Wei, Wang Xiaoling, Zheng Huanran, Chen Mosha, Tang Buzhou
- Multilingual Machine Translation With Large Language Models: Empirical Results And Analysis Zhu Wenhao, Liu Hongyi, Dong Qingxiu, Xu Jingjing, Huang Shujian, Kong Lingpeng, Chen Jiajun, Li Lei
- Question Answering As Programming For Solving Time-sensitive Questions Zhu Xinyu, Yang Cheng, Chen Bei, Li Siheng, Lou Jian-guang, Yang Yujiu
- Furthest Reasoning With Plan Assessment: Stable Reasoning Path With Retrieval-augmented Large Language Models Zhu Yin, Luo Zhiling, Cheng Gong
- 3d-vista: Pre-trained Transformer For 3D Vision And Text Alignment Zhu Ziyu, Ma Xiaojian, Chen Yixin, Deng Zhidong, Huang Siyuan, Li Qing
- Open-source Large Language Models Are Strong Zero-shot Query Likelihood Models For Document Ranking Zhuang Shengyao, Liu Bing, Koopman Bevan, Zuccon Guido
- Toolqa: A Dataset For LLM Question Answering With External Tools Zhuang Yuchen, Yu Yue, Wang Kuan, Sun Haotian, Zhang Chao
- Ice-score: Instructing Large Language Models To Evaluate Code Zhuo Terry Yue
- Red Teaming Chatgpt Via Jailbreaking: Bias, Robustness, Reliability And Toxicity Zhuo Terry Yue, Huang Yujin, Chen Chunyang, Xing Zhenchang
- Scissorhands: Exploiting The Persistence Of Importance Hypothesis For LLM KV Cache Compression At Test Time Zichang Liu, Aditya Desai, Fangshuo Liao, Weitao Wang, Victor Xie, Zhaozhuo Xu, Anastasios Kyrillidis, Anshumali Shrivastava
- Recommender Systems In The Era Of Large Language Models (llms) Zihuai Zhao, Wenqi Fan, Jiatong Li, Yunqing Liu, Xiaowei Mei, Yiqi Wang, Zhen Wen, Fei Wang, Xiangyu Zhao, Jiliang Tang, Qing Li
- A Family Of Pretrained Transformer Language Models For Russian Zmitrovich Dmitry, Abramov Alexander, Kalmykov Andrey, Tikhonova Maria, Taktasheva Ekaterina, Astafurov Danil, Baushenko Mark, Snegirev Artem, Kadulin Vitalii, Markov Sergey, Shavrina Tatiana, Mikhailov Vladislav, Fenogenova Alena
- Incorporating Probing Signals Into Multimodal Machine Translation Via Visual Question-answering Pairs Zuo Yuxin, Li Bei, Lv Chuanhao, Zheng Tong, Xiao Tong, Zhu Jingbo
- A Dataset And Benchmark For Hospital Course Summarization With Adapted Large Language Models Aali Asad, Van Veen Dave, Arefeen Yamin Ishraq, Hom Jason, Bluethgen Christian, Reis Eduardo Pontes, Gatidis Sergios, Clifford Namuun, Daws Joseph, Tehrani Arash S., Kim Jangwon, Chaudhari Akshay S.
- Psycholex: Unveiling The Psychological Mind Of Large Language Models Abbasi Mohammad Amin, Mirnezami Farnaz Sadat, Naderi Hassan
- Can Llms Be Fooled? Investigating Vulnerabilities In Llms Abdali Sara, He Jia, Barberan Cj, Anarfi Richard
- Are You Still On Track!? Catching LLM Task Drift With Activations Abdelnabi Sahar, Fay Aideen, Cherubin Giovanni, Salem Ahmed, Fritz Mario, Paverd Andrew
- Synthetic Dialogue Dataset Generation Using LLM Agents Abdullin Yelaman, Molla-aliod Diego, Ofoghi Bahadorreza, Yearwood John, Li Qingyang
- The Challenges Of Evaluating LLM Applications: An Analysis Of Automated, Human, And Llm-based Approaches Abeysinghe Bhashithe, Circi Ruhan
- Supporting Student Decisions On Learning Recommendations: An Llm-based Chatbot With Knowledge Graph Contextualization For Conversational Explainability And Mentoring Abu-rasheed Hasan, Abdulsalam Mohamad Hussam, Weber Christian, Fathi Madjid
- Knowledge Graphs As Context Sources For Llm-based Explanations Of Learning Recommendations Abu-rasheed Hasan, Weber Christian, Fathi Madjid
- Pathocl: Path-based Prompt Augmentation For OCL Generation With GPT-4 Abukhalaf Seif, Hamdaqa Mohammad, Khomh Foutse
- A Survey On Symbolic Knowledge Distillation Of Large Language Models Acharya Kamal, Velasquez Alvaro, Song Houbing Herbert
- Fine-grained Contract NER Using Instruction Based Model Adibhatla Hiranmai Sri, Baswani Pavan, Shrivastava Manish
- Keyformer: KV Cache Reduction Through Key Tokens Selection For Efficient Generative Inference Adnan Muhammad, Arunkumar Akhil, Jain Gaurav, Nair Prashant J., Soloveychik Ilya, Kamath Purushotham
- Copilot Evaluation Harness: Evaluating Llm-guided Software Programming Agarwal Anisha, Chan Aaron, Chandel Shubham, Jang Jinu, Miller Shaun, Moghaddam Roshanak Zilouchian, Mohylevskyy Yevhen, Sundaresan Neel, Tufano Michele
- Step-by-step Unmasking For Parameter-efficient Fine-tuning Of Large Language Models Agarwal Aradhye, Ramesh Suhas K, Sengupta Ayan, Chakraborty Tanmoy
- Faithfulness Vs. Plausibility: On The (un)reliability Of Explanations From Large Language Models Agarwal Chirag, Tanneru Sree Harsha, Lakkaraju Himabindu
- Mememqa: Multimodal Question Answering For Memes Via Rationale-based Inferencing Agarwal Siddhant, Sharma Shivam, Nakov Preslav, Chakraborty Tanmoy
- Tricy: Trigger-guided Data-to-text Generation With Intent Aware Attention-copy Agarwal Vibhav, Ghosh Sourav, Bss Harichandana, Arora Himanshu, Raja Barath Raj Kandur
- Codemirage: Hallucinations In Code Generated By Large Language Models Agarwal Vibhor, Pei Yulong, Alamir Salwa, Liu Xiaomo
- Enabling High-sparsity Foundational Llama Models With Efficient Pretraining And Deployment Agarwalla Abhinav, Gupta Abhay, Marques Alexandre, Pandit Shubhra, Goin Michael, Kurtic Eldar, Leong Kevin, Nguyen Tuan, Salem Mahmoud, Alistarh Dan, Lie Sean, Kurtz Mark
- Etalon: Holistic Performance Evaluation Framework For LLM Inference Systems Agrawal Amey, Agarwal Anmol, Kedia Nitin, Mohan Jayashree, Kundu Souvik, Kwatra Nipun, Ramjee Ramachandran, Tumanov Alexey
- Mindful-rag: A Study Of Points Of Failure In Retrieval Augmented Generation Agrawal Garima, Kumarage Tharindu, Alghamdi Zeyad, Liu Huan
- The Curious Case Of Nonverbal Abstract Reasoning With Multi-modal Large Language Models Ahrabian Kian, Sourati Zhivar, Sun Kexuan, Zhang Jiarui, Jiang Yifan, Morstatter Fred, Pujara Jay
- Sphinx: Sample Efficient Multilingual Instruction Fine-tuning Through N-shot Guided Prompting Ahuja Sanchit, Tanmay Kumar, Chauhan Hardik Hansrajbhai, Patra Barun, Aggarwal Kriti, Del Corro Luciano, Mitra Arindam, Dhamecha Tejas Indulal, Awadallah Ahmed, Choudhary Monojit, Chaudhary Vishrav, Sitaram Sunayana
- QASE Enhanced Plms: Improved Control In Text Generation For MRC Ai Lin, Hui Zheng, Liu Zizhou, Hirschberg Julia
- Semantically Diverse Language Generation For Uncertainty Estimation In Language Models Aichberger Lukas, Schweighofer Kajetan, Ielanskyi Mykyta, Hochreiter Sepp
- Plug And Play With Prompts: A Prompt Tuning Approach For Controlling Text Generation Ajwani Rohan Deepak, Zhu Zining, Rose Jonathan, Rudzicz Frank
- Clustered Retrieved Augmented Generation (CRAG) Akesson Simon, Santos Frances A.
- Shadowllm: Predictor-based Contextual Sparsity For Large Language Models Akhauri Yash, Abouelhamayed Ahmed F, Dotzel Jordan, Zhang Zhiru, Rush Alexander M, Huda Safeen, Abdelfattah Mohamed S
- FACTS About Building Retrieval Augmented Generation-based Chatbots Akkiraju Rama, Xu Anbang, Bora Deepak, Yu Tan, An Lu, Seth Vishal, Shukla Aaditya, Gundecha Pritam, Mehta Hridhay, Jha Ashwin, Raj Prithvi, Balasubramanian Abhinav, Maram Murali, Muthusamy Guru, Annepally Shivakesh Reddy, Knowles Sidney, Du Min, Burnett Nick, Javiya Sean, Marannan Ashok, Kumari Mamta, Jha Surbhi, Dereszenski Ethan, Chakraborty Anupam, Ranjan Subhash, Terfai Amina, Surya Anoop, Mercer Tracey, Thanigachalam Vinodh Kumar, Bar Tamar, Krishnan Sanjana, Kilaru Samy, Jaksic Jasmine, Algarici Nave, Liberman Jacob, Conway Joey, Nayyar Sonu, Boitano Justin
- Benchmarking Open-source Language Models For Efficient Question Answering In Industrial Applications Alassan Mahaman Sanoussi Yahaya, Espejel Jessica López, Bouhandi Merieme, Dahhane Walid, Ettifouri El Hassane
- Enhancing Textbook Question Answering Task With Large Language Models And Retrieval Augmented Generation Alawwad Hessa Abdulrahman, Alhothali Areej, Naseem Usman, Alkhathlan Ali, Jamal Amani
- Should We Fine-tune Or RAG? Evaluating Different Techniques To Adapt Llms For Dialogue Alghisi Simone, Rizzoli Massimo, Roccabruna Gabriel, Mousavi Seyed Mahed, Riccardi Giuseppe
- Evaluating The Retrieval Component In Llm-based Question Answering Systems Alinejad Ashkan, Kumar Krtin, Vahdat Ali
- Chatgpt Alternative Solutions: Large Language Models Survey Alipour Hanieh, Pendar Nick, Roy Kohinoor
- Toward Conversational Agents With Context And Time Sensitive Long-term Memory Alonso Nick, Figliolia Tomás, Ndirango Anthony, Millidge Beren
- D-NLP At Semeval-2024 Task 2: Evaluating Clinical Inference Capabilities Of Large Language Models Altinok Duygu
- Syntaxshap: Syntax-aware Explainability Method For Text Generation Amara Kenza, Sevastjanova Rita, El-assady Mennatallah
- Do Llms Exhibit Human-like Reasoning? Evaluating Theory Of Mind In Llms For Open-ended Responses Amirizaniani Maryam, Martin Elias, Sivachenko Maryna, Mashhadi Afra, Shah Chirag
- Funaudiollm: Voice Understanding And Generation Foundation Models For Natural Interaction Between Humans And Llms An Keyu, Chen Qian, Deng Chong, Du Zhihao, Gao Changfeng, Gao Zhifu, Gu Yue, He Ting, Hu Hangrui, Hu Kai, Ji Shengpeng, Li Yabin, Li Zerui, Lu Heng, Luo Haoneng, Lv Xiang, Ma Bin, Ma Ziyang, Ni Chongjia, Song Changhe, Shi Jiaqi, Shi Xian, Wang Hao, Wang Wen, Wang Yuxuan, Xiao Zhangyu, Yan Zhijie, Yang Yexin, Zhang Bin, Zhang Qinglin, Zhang Shiliang, Zhao Nan, Zheng Siqi
- Make Your LLM Fully Utilize The Context An Shengnan, Ma Zexiong, Lin Zeqi, Zheng Nanning, Lou Jian-guang
- Mm-phyrlhf: Reinforcement Learning Framework For Multimodal Physics Question-answering Anand Avinash, Kapuriya Janak, Kirtani Chhavi, Singh Apoorv, Saraf Jay, Lal Naman, Kumar Jatin, Shivam Adarsh Raj, Verma Astha, Shah Rajiv Ratn, Zimmermann Roger
- Tensor Train Low-rank Approximation (tt-lora): Democratizing AI With Accelerated Llms Anjum Afia, Eren Maksim E., Boureima Ismael, Alexandrov Boian, Bhattarai Manish
- Generalization V.s. Memorization: Tracing Language Models' Capabilities Back To Pretraining Data Antoniades Antonis, Wang Xinyi, Elazar Yanai, Amayuelas Alfonso, Albalak Alon, Zhang Kexun, Wang William Yang
- Evaluating Chatgpt On Nuclear Domain-specific Data Anwar Muhammad, De Costa Mischa, Hammad Issam, Lau Daniel
- A Framework For Fine-tuning Llms Using Heterogeneous Feedback Aponte Ryan, Rossi Ryan A., Guo Shunan, Dernoncourt Franck, Yu Tong, Chen Xiang, Mitra Subrata, Lipka Nedim
- An Assessment On Comprehending Mental Health Through Large Language Models Arcan Mihael, Niland David-paul, Delahunty Fionn
- Calmqa: Exploring Culturally Specific Long-form Question Answering Across 23 Languages Arora Shane, Karpinska Marzena, Chen Hung-ting, Bhattacharjee Ipsita, Iyyer Mohit, Choi Eunsol
- Exploring The Robustness Of Task-oriented Dialogue Systems For Colloquial German Varieties Artemova Ekaterina, Blaschke Verena, Plank Barbara
- To Code, Or Not To Code? Exploring Impact Of Code In Pre-training Aryabumi Viraat, Su Yixuan, Ma Raymond, Morisot Adrien, Zhang Ivan, Locatelli Acyr, Fadaee Marzieh, Üstün Ahmet, Hooker Sara
- Assessing Llms For Zero-shot Abstractive Summarization Through The Lens Of Relevance Paraphrasing Askari Hadi, Chhabra Anshuman, Chen Muhao, Mohapatra Prasant
- Elicitron: An LLM Agent-based Simulation Framework For Design Requirements Elicitation Ataei Mohammadmehdi, Cheong Hyunmin, Grandi Daniele, Wang Ye, Morris Nigel, Tessier Alexander
- Can Open-source Llms Compete With Commercial Models? Exploring The Few-shot Performance Of Current GPT Models In Biomedical Tasks Ateia Samy, Kruschwitz Udo
- Bifurcated Attention: Accelerating Massively Parallel Decoding With Shared Prefixes In Llms Athiwaratkun Ben, Gonugondla Sujan Kumar, Gouda Sanjay Krishna, Qian Haifeng, Ding Hantian, Sun Qing, Wang Jun, Guo Jiacheng, Chen Liangfu, Bhatia Parminder, Nallapati Ramesh, Sengupta Sudipta, Xiang Bing
- GRAD-SUM: Leveraging Gradient Summarization For Optimal Prompt Engineering Austin Derek, Chartock Elliott
- Semscore: Automated Evaluation Of Instruction-tuned Llms Based On Semantic Textual Similarity Aynetdinov Ansar, Akbik Alan
- Current State Of LLM Risks And AI Guardrails Ayyamperumal Suriya Ganesh, Ge Limin
- Feedback-generation For Programming Exercises With GPT-4 Azaiz Imen, Kiesler Natalie, Strickroth Sven
- Llm-driven Robots Risk Enacting Discrimination, Violence, And Unlawful Actions Azeem Rumaisa, Hundt Andrew, Mansouri Masoumeh, Brandão Martim
- Accuracy And Consistency Of Llms In The Registered Dietitian Exam: The Impact Of Prompt Engineering And Knowledge Retrieval Azimi Iman, Qi Mohan, Wang Li, Rahmani Amir M., Li Youlin
- Lamda: Large Model Fine-tuning Via Spectrally Decomposed Low-dimensional Adaptation Azizi Seyedarmin, Kundu Souvik, Pedram Massoud
- Hallucination Of Multimodal Large Language Models: A Survey Bai Zechen, Wang Pichao, Xiao Tianjun, He Tong, Han Zongbo, Zhang Zheng, Shou Mike Zheng
- Downstream Bias Mitigation Is All You Need Baksi Arkadeep, Singh Rahul, Joshi Tarun
- RAG Vs Fine-tuning: Pipelines, Tradeoffs, And A Case Study On Agriculture Balaguer Angels, Benara Vinamra, Cunha Renato Luiz De Freitas, Filho Roberto De M. Estevão, Hendry Todd, Holstein Daniel, Marsman Jennifer, Mecklenburg Nick, Malvar Sara, Nunes Leonardo O., Padilha Rafael, Sharp Morris, Silva Bruno, Sharma Swati, Aski Vijay, Chandra Ranveer
- CYGENT: A Cybersecurity Conversational Agent With Log Summarization Powered By GPT-3 Balasubramanian Prasasthy, Seby Justin, Kostakos Panos
- Artifacts Or Abduction: How Do Llms Answer Multiple-choice Questions Without The Question? Balepur Nishant, Ravichander Abhilasha, Rudinger Rachel
- Demystifying Platform Requirements For Diverse LLM Inference Use Cases Bambhaniya Abhimanyu, Raj Ritik, Jeong Geonhwa, Kundu Souvik, Srinivasan Sudarshan, Elavazhagan Midhilesh, Kumar Madhu, Krishna Tushar
- Llms Will Always Hallucinate, And We Need To Live With This Banerjee Sourav, Agarwal Ayushi, Singla Saloni
- Comparing Bad Apples To Good Oranges: Aligning Large Language Models Via Joint Preference Optimization Bansal Hritik, Suvarna Ashima, Bhatt Gantavya, Peng Nanyun, Chang Kai-wei, Grover Aditya
- LLM Augmented Llms: Expanding Capabilities Through Composition Bansal Rachit, Samanta Bidisha, Dalmia Siddharth, Gupta Nitish, Vashishth Shikhar, Ganapathy Sriram, Bapna Abhishek, Jain Prateek, Talukdar Partha
- To Each (textual Sequence) Its Own: Improving Memorized-data Unlearning In Large Language Models Barbulescu George-octavian, Triantafillou Peter
- TTQA-RS- A Break-down Prompting Approach For Multi-hop Table-text Question Answering With Reasoning And Summarization Bardhan Jayetri, Xiao Bushi, Wang Daisy Zhe
- Find The Gap: Knowledge Base Reasoning For Visual Question Answering Barezi Elham J., Kordjamshidi Parisa
- Fine-tuning Or Fine-failing? Debunking Performance Myths In Large Language Models Barnett Scott, Brannelly Zac, Kurniawan Stefanus, Wong Sheng
- Slicerchat: Building A Local Chatbot For 3D Slicer Barr Colton
- Llms' Reading Comprehension Is Affected By Parametric Knowledge And Struggles With Hypothetical Statements Basmov Victoria, Goldberg Yoav, Tsarfaty Reut
- A Comparative Study Of DSL Code Generation: Fine-tuning Vs. Optimized Retrieval Augmentation Bassamzadeh Nastaran, Methani Chhaya
- NESTFUL: A Benchmark For Evaluating Llms On Nested Sequences Of API Calls Basu Kinjal, Abdelaziz Ibrahim, Bradford Kelsey, Crouse Maxwell, Kate Kiran, Kumaravel Sadhana, Goyal Saurabh, Munawar Asim, Rizk Yara, Wang Xin, Lastras Luis, Kapanipathi Pavan
- Understanding Information Storage And Transfer In Multi-modal Large Language Models Basu Samyadeep, Grayson Martin, Morrison Cecily, Nushi Besmira, Feizi Soheil, Massiceti Daniela
- Internalinspector \(I^2\): Robust Confidence Estimation In Llms Through Internal States Beigi Mohammad, Shen Ying, Yang Runing, Lin Zihao, Wang Qifan, Mohan Ankith, He Jianfeng, Jin Ming, Lu Chang-tien, Huang Lifu
- Luna: An Evaluation Foundation Model To Catch Language Model Hallucinations With High Accuracy And Low Cost Belyi Masha, Friel Robert, Shao Shuai, Sanyal Atindriyo
- Crafting Interpretable Embeddings By Asking Llms Questions Benara Vinamra, Singh Chandan, Morris John X., Antonello Richard, Stoica Ion, Huth Alexander G., Gao Jianfeng
- SUTRA: Scalable Multilingual Language Model Architecture Bendale Abhijit, Sapienza Michael, Ripplinger Steven, Gibbs Simon, Lee Jaewon, Mistry Pranav
- Machine Translation Hallucination Detection For Low And High Resource Languages Using Large Language Models Benkirane Kenza, Gongas Laura, Pelles Shahar, Fuchs Naomi, Darmon Joshua, Stenetorp Pontus, Adelani David Ifeoluwa, Sánchez Eduardo
- Prompting Large Language Models With Human Error Markings For Self-correcting Machine Translation Berger Nathaniel, Riezler Stefan, Exel Miriam, Huck Matthias
- Think Big, Generate Quick: Llm-to-slm For Fast Autoregressive Decoding Bergner Benjamin, Skliar Andrii, Royer Amelie, Blankevoort Tijmen, Asano Yuki, Bejnordi Babak Ehteshami
- Checkembed: Effective Verification Of LLM Solutions To Open-ended Tasks Besta Maciej, Paleari Lorenzo, Kubicek Ales, Nyczyk Piotr, Gerstenberger Robert, Iff Patrick, Lehmann Tomasz, Niewiadomski Hubert, Hoefler Torsten
- Enhancing Event Reasoning In Large Language Models Through Instruction Fine-tuning With Semantic Causal Graphs Bethany Mazal, Bethany Emet, Wherry Brandon, Chiang Cho-yu, Vishwamitra Nishant, Rios Anthony, Najafirad Peyman
- Taking The Next Step With Generative Artificial Intelligence: The Transformative Role Of Multimodal Large Language Models In Science Education Bewersdorff Arne, Hartmann Christian, Hornberger Marie, Seßler Kathrin, Bannert Maria, Kasneci Enkelejda, Kasneci Gjergji, Zhai Xiaoming, Nerdel Claudia
- Soft Prompting For Unlearning In Large Language Models Bhaila Karuna, Van Minh-hao, Wu Xintao
- On The Robustness Of Language Models For Tabular Question Answering Bhandari Kushal Raj, Xing Sixue, Dan Soham, Gao Jianxi
- An Approach To Build Zero-shot Slot-filling System For Industry-grade Conversational Assistants Bhargav G P Shrivatsa, Neelam Sumit, Sharma Udit, Ikbal Shajith, Sreedhar Dheeraj, Karanam Hima, Joshi Sachindra, Dhoolia Pankaj, Garg Dinesh, Croutwater Kyle, Qi Haode, Wayne Eric, Murdock J William
- Speculative Streaming: Fast LLM Inference Without Auxiliary Models Bhendawade Nikhil, Belousova Irina, Fu Qichen, Mason Henry, Rastegari Mohammad, Najibi Mahyar
- Iterative Refinement Of Project-level Code Context For Precise Code Generation With Compiler Feedback Bi Zhangqian, Wan Yao, Wang Zheng, Zhang Hongyu, Guan Batu, Lu Fangxin, Zhang Zili, Sui Yulei, Jin Hai, Shi Xuanhua
- Learning Shortcuts: On The Misleading Promise Of NLU In Language Models Bihani Geetanjali, Rayz Julia Taylor
- Lipost: Improved Content Understanding With Effective Use Of Multi-task Contrastive Learning Bindal Akanksha, Ramanujam Sudarshan, Golland Dave, Hazen Tj, Jiang Tina, Zhang Fengyu, Yan Peng
- Statically Contextualizing Large Language Models With Typed Holes Blinn Andrew, Li Xiang, Kim June Hyung, Omar Cyrus
- Task-oriented Dialogue With In-context Learning Bocklisch Tom, Werkmeister Thomas, Varshneya Daksh, Nichol Alan
- Towards Cross-tokenizer Distillation: The Universal Logit Distillation Loss For Llms Boizard Nicolas, Haddad Kevin El, Hudelot Céline, Colombo Pierre
- Rambla: A Framework For Evaluating The Reliability Of Llms As Assistants In The Biomedical Domain Bolton William James, Poyiadzi Rafael, Morrell Edward R., Bueno Gabriela Van Bergen Gonzalez, Goetz Lea
- Windows Agent Arena: Evaluating Multi-modal OS Agents At Scale Bonatti Rogerio, Zhao Dan, Bonacci Francesco, Dupont Dillon, Abdali Sara, Li Yinheng, Wagle Justin, Koishida Kazuhito, Bucker Arthur, Jang Lawrence, Hui Zack
- The Unreasonable Ineffectiveness Of Nucleus Sampling On Mitigating Text Memorization Borec Luka, Sadler Philipp, Schlangen David
- Improving Language Models Trained On Translated Data With Continual Pre-training And Dictionary Learning Analysis Boughorbel Sabri, Parvez Md Rizwan, Hawasly Majd
- Retrieving Examples From Memory For Retrieval Augmented Neural Machine Translation: A Systematic Comparison Bouthors Maxime, Crego Josep, Yvon Francois
- Improving Sequential Recommendations With Llms Boz Artun, Zorgdrager Wouter, Kotti Zoe, Harte Jesse, Louridas Panos, Jannach Dietmar, Fragkoulis Marios
- Conditioning Llms With Emotion In Neural Machine Translation Brazier Charles, Rouas Jean-luc
- Large Language Models "ad Referendum": How Good Are They At Machine Translation In The Legal Domain? Briva-iglesias Vicent, Camargo Joao Lucas Cavalheiro, Dogru Gokhan
- LOCOST: State-space Models For Long Document Abstractive Summarization Bronnec Florian Le, Duong Song, Ravaut Mathieu, Allauzen Alexandre, Chen Nancy F., Guigue Vincent, Lumbreras Alberto, Soulier Laure, Gallinari Patrick
- Exploring Precision And Recall To Assess The Quality And Diversity Of Llms Bronnec Florian Le, Verine Alexandre, Negrevergne Benjamin, Chevaleyre Yann, Allauzen Alexandre
- Generation, Distillation And Evaluation Of Motivational Interviewing-style Reflections With A Foundational Language Model Brown Andrew, Zhu Jiading, Abdelwahab Mohamed, Dong Alec, Wang Cindy, Rose Jonathan
- Enhancing Trust In Llms: Algorithms For Comparing And Interpreting Llms Brown Nik Bear
- Pheme: Efficient And Conversational Speech Generation Budzianowski Paweł, Sereda Taras, Cichy Tomasz, Vulić Ivan
- Pangea: Procedural Artificial Narrative Using Generative AI For Turn-based Video Games Buongiorno Steph, Klinkert Lawrence Jake, Chawla Tanishq, Zhuang Zixin, Clark Corey
- Conversational Complexity For Assessing Risk In Large Language Models Burden John, Cebrian Manuel, Hernandez-orallo Jose
- Aligning Large Language Models With Counterfactual DPO Butcher Bradley
- Reducing Hallucination In Structured Outputs Via Retrieval-augmented Generation Béchard Patrice, Ayala Orlando Marquez
- Olora: Orthonormal Low-rank Adaptation Of Large Language Models Büyükakyüz Kerim
- On Large Language Models In National Security Applications Caballero William N., Jenkins Phillip R.
- Wiki-llava: Hierarchical Retrieval-augmented Generation For Multimodal Llms Caffagni Davide, Cocchi Federico, Moratelli Nicholas, Sarto Sara, Cornia Marcella, Baraldi Lorenzo, Cucchiara Rita
- On The Transformations Across Reward Model, Parameter Update, And In-context Prompt Cai Deng, Li Huayang, Fu Tingchen, Li Siheng, Xu Weiwen, Li Shuaiyi, Cao Bowen, Zhang Zhisong, Huang Xinting, Cui Leyang, Wang Yan, Liu Lemao, Watanabe Taro, Shi Shuming
- Sciassess: Benchmarking LLM Proficiency In Scientific Literature Analysis Cai Hengxing, Cai Xiaochen, Chang Junhan, Li Sihang, Yao Lin, Wang Changxin, Gao Zhifeng, Wang Hongshuai, Li Yongge, Lin Mujie, Yang Shuwen, Wang Jiankun, Xu Mingjun, Huang Jin, Xi Fang, Zhuang Jiaxi, Yin Yuqi, Li Yaqi, Chen Changhong, Cheng Zheng, Zhao Zifeng, Zhang Linfeng, Ke Guolin
- Rethinking How To Evaluate Language Model Jailbreak Cai Hongyu, Arunasalam Arjun, Lin Leo Y., Bianchi Antonio, Celik Z. Berkay
- CERET: Cost-effective Extrinsic Refinement For Text Generation Cai Jason, Su Hang, Sunkara Monica, Shalyminov Igor, Mansour Saab
- A Survey On Mixture Of Experts Cai Weilin, Jiang Juyong, Wang Fan, Tang Jing, Kim Sunghun, Huang Jiayi
- Towards Large Language Model Aided Program Refinement Cai Yufan, Hou Zhe, Luan Xiaokun, Baena David Miguel Sanan, Lin Yun, Sun Jun, Dong Jin Song
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes Cao He, Shao Yanjun, Liu Zhiyuan, Liu Zijing, Tang Xiangru, Yao Yuan, Li Yu
- Recent Advances In Text Embedding: A Comprehensive Review Of Top-performing Methods On The MTEB Benchmark Cao Hongliu
- Beyond Sparse Rewards: Enhancing Reinforcement Learning With Language Model Critique In Text Generation Cao Meng, Shu Lei, Yu Lei, Zhu Yun, Wichers Nevan, Liu Yinxiao, Meng Lei
- Personalized Steering Of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization Cao Yuanpu, Zhang Tianrong, Cao Bochuan, Yin Ziyi, Lin Lu, Ma Fenglong, Chen Jinghui
- Survey On Large Language Model-enhanced Reinforcement Learning: Concept, Taxonomy, And Methods Cao Yuji, Zhao Huan, Cheng Yuheng, Shu Ting, Liu Guolong, Liang Gaoqi, Zhao Junhua, Li Yun
- Captioning Visualizations With Large Language Models (CVLLM): A Tutorial Carenini Giuseppe, Johnson Jordon, Salamatian Ali
- A Review Of Multi-modal Large Language And Vision Models Carolan Kilian, Fennelly Laura, Smeaton Alan F.
- HAMMR: Hierarchical Multimodal React Agents For Generic VQA Castrejon Lluis, Mensink Thomas, Zhou Howard, Ferrari Vittorio, Araujo Andre, Uijlings Jasper
- AMEX: Android Multi-annotation Expo Dataset For Mobile GUI Agents Chai Yuxiang, Huang Siyuan, Niu Yazhe, Xiao Han, Liu Liang, Zhang Dingyu, Gao Peng, Ren Shuai, Li Hongsheng
- Detoxbench: Benchmarking Large Language Models For Multitask Fraud & Abuse Detection Chakraborty Joymallya, Xia Wei, Majumder Anirban, Ma Dan, Chaabene Walid, Janvekar Naveed
- Investigating Llms As Voting Assistants Via Contextual Augmentation: A Case Study On The European Parliament Elections 2024 Chalkidis Ilias
- Scaling Synthetic Data Creation With 1,000,000,000 Personas Chan Xin, Wang Xiaoyang, Yu Dian, Mi Haitao, Yu Dong
- Goldfish: Monolingual Language Models For 350 Languages Chang Tyler A., Arnett Catherine, Tu Zhuowen, Bergen Benjamin K.
- Detecting Hallucination And Coverage Errors In Retrieval Augmented Generation For Controversial Topics Chang Tyler A., Tomanek Katrin, Hoffmann Jessica, Thain Nithum, Van Liemt Erin, Meier-hellstern Kathleen, Dixon Lucas
- Injecting Salesperson's Dialogue Strategies In Large Language Models With Chain-of-thought Reasoning Chang Wen-yu, Chen Yun-nung
- Bias-aware Low-rank Adaptation: Mitigating Catastrophic Inheritance Of Large Language Models Chang Yupeng, Chang Yi, Wu Yuan
- Xprompt:explaining Large Language Model's Generation Via Joint Prompt Attribution Chang Yurui, Cao Bochuan, Wang Yujia, Chen Jinghui, Lin Lu
- When Large Language Models Meet Evolutionary Algorithms Chao Wang, Zhao Jiaxuan, Jiao Licheng, Li Lingling, Liu Fang, Yang Shuyuan
- VITA: Towards Open-source Interactive Omni Multimodal LLM Chaoyou Fu, Haojia Lin, Zuwei Long, Yunhang Shen, Meng Zhao, Yifan Zhang, Shaoqi Dong, Xiong Wang, Di Yin, Long Ma, Xiawu Zheng, Ran He, Rongrong Ji, Yunsheng Wu, Caifeng Shan, Xing Sun
- \'evaluation Des Capacit\'es De R\'eponse De Larges Mod\`eles De Langage (LLM) Pour Des Questions D'historiens Chartier Mathieu, Dakkoune Nabil, Bourgeois Guillaume, Jean Stéphane
- Phantom: General Trigger Attacks On Retrieval Augmented Language Generation Chaudhari Harsh, Severi Giorgio, Abascal John, Jagielski Matthew, Choquette-choo Christopher A., Nasr Milad, Nita-rotaru Cristina, Oprea Alina
- Veagle: Advancements In Multimodal Representation Learning Chawla Rajat, Datta Arkajit, Verma Tushar, Jha Adarsh, Gautam Anmol, Vatsal Ayush, Chaterjee Sukrit, Ns Mukunda, Bhola Ishaan
- DUAL-REFLECT: Enhancing Large Language Models For Reflective Translation Through Dual Learning Feedback Mechanisms Chen Andong, Lou Lianzhang, Chen Kehai, Bai Xuefeng, Xiang Yang, Yang Muyun, Zhao Tiejun, Zhang Min
- All Roads Lead To Rome: Unveiling The Trajectory Of Recommender Systems Across The LLM Era Chen Bo, Dai Xinyi, Guo Huifeng, Guo Wei, Liu Weiwen, Liu Yong, Qin Jiarui, Tang Ruiming, Wang Yichao, Wu Chuhan, Wu Yaxiong, Zhang Hao
- Self-cognition In Large Language Models: An Exploratory Study Chen Dongping, Shi Jiawen, Wan Yao, Zhou Pan, Gong Neil Zhenqiang, Sun Lichao
- Hardware-aware Parallel Prompt Decoding For Memory-efficient Acceleration Of LLM Inference Chen Hao Mark, Luk Wayne, Yiu Ka Fai Cedric, Li Rui, Mishchenko Konstantin, Venieris Stylianos I., Fan Hongxiang
- Efficiency In Focus: Layernorm As A Catalyst For Fine-tuning Medical Visual Language Pre-trained Models Chen Jiawei, Yang Dingkang, Jiang Yue, Li Mingcheng, Wei Jinjie, Hou Xiaolu, Zhang Lihua
- Control-dag: Constrained Decoding For Non-autoregressive Directed Acyclic T5 Using Weighted Finite State Automata Chen Jinghong, Lin Weizhe, Mei Jingbiao, Byrne Bill
- Automated Data Curation For Robust Language Model Fine-tuning Chen Jiuhai, Mueller Jonas
- Putting People In Llms' Shoes: Generating Better Answers Via Question Rewriter Chen Junhao, Wang Bowen, Jiang Zhouqiang, Nakashima Yuta
- Nlperturbator: Studying The Robustness Of Code Llms To Natural Language Variations Chen Junkai, Li Zhenhao, Hu Xing, Xia Xin
- Cost-effective Instruction Learning For Pathology Vision And Language Analysis Chen Kaitao, Liu Mianxin, Yan Fang, Ma Lei, Shi Xiaoming, Wang Lilong, Wang Xiaosong, Zhu Lifeng, Wang Zhe, Zhou Mu, Zhang Shaoting
- OMPGPT: A Generative Pre-trained Transformer Model For Openmp Chen Le, Bhattacharjee Arijit, Ahmed Nesreen, Hasabnis Niranjan, Oren Gal, Vo Vy, Jannesari Ali
- Identifying Query-relevant Neurons In Large Language Models For Long-form Texts Chen Lihu, Dejl Adam, Toni Francesca
- Bailong: Bilingual Transfer Learning Based On Qlora And Zip-tie Embedding Chen Lung-chuan, Li Zong-ru
- Quantifying And Mitigating Unimodal Biases In Multimodal Large Language Models: A Causal Perspective Chen Meiqi, Cao Yixin, Zhang Yan, Lu Chaochao
- Jumpcoder: Go Beyond Autoregressive Coder Via Online Modification Chen Mouxiang, Tian Hao, Liu Zhongxin, Ren Xiaoxue, Sun Jianling
- Enhancing Visual Question Answering Through Ranking-based Hybrid Training And Multimodal Fusion Chen Peiyuan, Zhang Zecheng, Dong Yiping, Zhou Li, Wang Han
- WSI-VQA: Interpreting Whole Slide Images By Generative Visual Question Answering Chen Pingyi, Zhu Chenglu, Zheng Sunyi, Li Honglin, Yang Lin
- Llm-based Multi-hop Question Answering With Knowledge Graph Integration In Evolving Environments Chen Ruirui, Jiang Weifeng, Qin Chengwei, Rawal Ishaan Singh, Tan Cheston, Choi Dongkyu, Xiong Bo, Ai Bo
- Struq: Defending Against Prompt Injection With Structured Queries Chen Sizhe, Piet Julien, Sitawarin Chawin, Wagner David
- Ternaryllm: Ternarized Large Language Model Chen Tianqi, Li Zhe, Xu Weixiang, Zhu Zeyu, Li Dong, Tian Lu, Barsoum Emad, Wang Peisong, Cheng Jian
- Octopus: On-device Language Model For Function Calling Of Software Apis Chen Wei, Li Zhiyuan, Ma Mingyuan
- GRATH: Gradual Self-truthifying For Large Language Models Chen Weixin, Song Dawn, Li Bo
- Reprompt: Planning By Automatic Prompt Engineering For Large Language Models Agents Chen Weizhe, Koenig Sven, Dilkina Bistra
- Gazexplain: Learning To Predict Natural Language Explanations Of Visual Scanpaths Chen Xianyu, Jiang Ming, Zhao Qi
- The Sifo Benchmark: Investigating The Sequential Instruction Following Ability Of Large Language Models Chen Xinyi, Liao Baohao, Qi Jirui, Eustratiadis Panagiotis, Monz Christof, Bisazza Arianna, De Rijke Maarten
- Can We Rely On LLM Agents To Draft Long-horizon Plans? Let's Take Travelplanner As An Example Chen Yanan, Pesaranghader Ali, Sadhu Tanmana, Yi Dong Hoon
- Extroversion Or Introversion? Controlling The Personality Of Your Large Language Models Chen Yanquan, Wu Zhen, Guo Junjie, Huang Shujian, Dai Xinyu
- Simulating Field Experiments With Large Language Models Chen Yaoyu, Hu Yuheng, Lu Yingda
- NACL: A General And Effective KV Cache Eviction Framework For Llms At Inference Time Chen Yilong, Wang Guoxia, Shang Junyuan, Cui Shiyao, Zhang Zhenyu, Liu Tingwen, Wang Shuohuan, Sun Yu, Yu Dianhai, Wu Hua
- Hallucination Detection: Robustly Discerning Reliable Answers In Large Language Models Chen Yuyan, Fu Qiang, Yuan Yichen, Wen Zhihao, Fan Ge, Liu Dayiheng, Zhang Dongmei, Li Zhixu, Xiao Yanghua
- Online Personalizing White-box Llms Generation With Neural Bandits Chen Zekai, Daniel Weeden, Chen Po-yu, Buet-golfouse Francois
- What Are The Essential Factors In Crafting Effective Long Context Multi-hop Instruction Datasets? Insights And Best Practices Chen Zhi, Chen Qiguang, Qin Libo, Guo Qipeng, Lv Haijun, Zou Yicheng, Che Wanxiang, Yan Hang, Chen Kai, Lin Dahua
- An Empirical Study Of Data Ability Boundary In Llms' Math Reasoning Chen Zui, Chen Yezeng, Han Jiaqi, Huang Zhijie, Qi Ji, Zhou Yi
- "in Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles Through In-dialogue Learning Cheng Chuanqi, Tu Quan, Wu Wei, Shang Shuo, Mao Cunli, Yu Zhengtao, Yan Rui
- Trojanrag: Retrieval-augmented Generation Can Be Backdoor Driver In Large Language Models Cheng Pengzhou, Ding Yidong, Ju Tianjie, Wu Zongru, Du Wei, Yi Ping, Zhang Zhuosheng, Liu Gongshen
- Can AI Assistants Know What They Don't Know? Cheng Qinyuan, Sun Tianxiang, Liu Xiangyang, Zhang Wenwei, Yin Zhangyue, Li Shimin, Li Linyang, He Zhengfu, Chen Kai, Qiu Xipeng
- BISCUIT: Scaffolding Llm-generated Code With Ephemeral Uis In Computational Notebooks Cheng Ruijia, Barik Titus, Leung Alan, Hohman Fred, Nichols Jeffrey
- Call Me When Necessary: Llms Can Efficiently And Faithfully Reason Over Structured Environments Cheng Sitao, Zhuang Ziyuan, Xu Yong, Yang Fangkai, Zhang Chaoyun, Qin Xiaoting, Huang Xiang, Chen Ling, Lin Qingwei, Zhang Dongmei, Rajmohan Saravan, Zhang Qi
- Multi-prompting Decoder Helps Better Language Understanding Cheng Zifeng, Chen Zhaoling, Jiang Zhiwei, Yin Yafeng, Ge Shiping, Liu Yuliang, Gu Qing
- ANOLE: An Open, Autoregressive, Native Large Multimodal Models For Interleaved Image-text Generation Chern Ethan, Su Jiadi, Ma Yan, Liu Pengfei
- Walert: Putting Conversational Search Knowledge Into Action By Building And Evaluating A Large Language Model-powered Chatbot Cherumanal Sachin Pathiyan, Tian Lin, Abushaqra Futoon M., De Paula Angel Felipe Magnossao, Ji Kaixin, Hettiachchi Danula, Trippas Johanne R., Ali Halil, Scholer Falk, Spina Damiano
- A Reality Check Of The Benefits Of LLM In Business Cheung Ming
- Revisiting Zero-shot Abstractive Summarization In The Era Of Large Language Models From The Perspective Of Position Bias Chhabra Anshuman, Askari Hadi, Mohapatra Prasant
- Lamsum: Creating Extractive Summaries Of User Generated Content Using Llms Chhikara Garima, Sharma Anurag, Gurucharan V., Ghosh Kripabandhu, Chakraborty Abhijnan
- Chatbot Arena: An Open Platform For Evaluating Llms By Human Preference Chiang Wei-lin, Zheng Lianmin, Sheng Ying, Angelopoulos Anastasios Nikolas, Li Tianle, Li Dacheng, Zhang Hao, Zhu Banghua, Jordan Michael, Gonzalez Joseph E., Stoica Ion
- Human-centered Llm-agent User Interface: A Position Paper Chin Daniel, Wang Yuxuan, Xia Gus
- An Empirical Study Of In-context Learning In Llms For Machine Translation Chitale Pranjal A., Gala Jay, Dabre Raj
- Language-image Models With 3D Understanding Cho Jang Hyun, Ivanovic Boris, Cao Yulong, Schmerling Edward, Wang Yue, Weng Xinshuo, Li Boyi, You Yurong, Krähenbühl Philipp, Wang Yan, Pavone Marco
- Multi-granularity Guided Fusion-in-decoder Choi Eunseong, Lee Hyeri, Lee Jongwuk
- When Life Gives You Llms, Make LLM-ADE: Large Language Models With Adaptive Data Engineering Choi Stephen, Gazeley William
- Autotutor Meets Large Language Models: A Language Model Tutor With Rich Pedagogy And Guardrails Chowdhury Sankalan Pal, Zouhar Vilém, Sachan Mrinmaya
- A Better LLM Evaluator For Text Generation: The Impact Of Prompt Output Sequencing And Optimization Chu Kuanchao, Chen Yi-pei, Nakayama Hideki
- Beamaggr: Beam Aggregation Reasoning Over Multi-source Knowledge For Multi-hop Question Answering Chu Zheng, Chen Jingchang, Chen Qianglong, Wang Haotian, Zhu Kun, Du Xiyuan, Yu Weijiang, Liu Ming, Qin Bing
- Professional Agents -- Evolving Large Language Models Into Autonomous Experts With Human-level Competencies Chu Zhixuan, Wang Yan, Zhu Feng, Yu Lu, Li Longfei, Gu Jinjie
- AI Safety In Generative AI Large Language Models: A Survey Chua Jaymari, Li Yun, Yang Shiyi, Wang Chen, Yao Lina
- Crosslingual Capabilities And Knowledge Barriers In Multilingual Large Language Models Chua Lynn, Ghazi Badih, Huang Yangsibo, Kamath Pritish, Kumar Ravi, Manurangsi Pasin, Sinha Amer, Xie Chulin, Zhang Chiyuan
- Lookback Lens: Detecting And Mitigating Contextual Hallucinations In Large Language Models Using Only Attention Maps Chuang Yung-sung, Qiu Linlu, Hsieh Cheng-yu, Krishna Ranjay, Kim Yoon, Glass James
- PEFT-U: Parameter-efficient Fine-tuning For User Personalization Clarke Christopher, Heng Yuzhao, Tang Lingjia, Mars Jason
- Modulating Language Model Experiences Through Frictions Collins Katherine M., Chen Valerie, Sucholutsky Ilia, Kirk Hannah Rose, Sadek Malak, Sargeant Holli, Talwalkar Ameet, Weller Adrian, Bhatt Umang
- Ragsys: Item-cold-start Recommender As RAG System Contal Emile, Mcgoldrick Garrin
- LLASP: Fine-tuning Large Language Models For Answer Set Programming Coppolillo Erica, Calimeri Francesco, Manco Giuseppe, Perri Simona, Ricca Francesco
- Teenytinyllama: Open-source Tiny Language Models Trained In Brazilian Portuguese Corrêa Nicholas Kluge, Falk Sophia, Fatimah Shiza, Sen Aniket, De Oliveira Nythamar
- Gamebench: Evaluating Strategic Reasoning Abilities Of LLM Agents Costarelli Anthony, Allen Mat, Hauksson Roman, Sodunke Grace, Hariharan Suhas, Cheng Carlson, Li Wenjie, Clymer Joshua, Yadav Arjun
- The Power Of Noise: Redefining Retrieval For RAG Systems Cuconasu Florin, Trappolini Giovanni, Siciliano Federico, Filice Simone, Campagnano Cesare, Maarek Yoelle, Tonellotto Nicola, Silvestri Fabrizio
- A Tale Of Trust And Accuracy: Base Vs. Instruct Llms In RAG Systems Cuconasu Florin, Trappolini Giovanni, Tonellotto Nicola, Silvestri Fabrizio
- Efficiently Exploring Large Language Models For Document-level Machine Translation With In-context Learning Cui Menglong, Du Jiangcun, Zhu Shaolin, Xiong Deyi
- Insights From Benchmarking Frontier Language Models On Web App Code Generation Cui Yi
- Synthetic Dataset Creation And Fine-tuning Of Transformer Models For Question Answering In Serbian Cvetanović Aleksa, Tadić Predrag
- Getting The Most Out Of Your Tokenizer For Pre-training And Domain Adaptation Dagan Gautier, Synnaeve Gabriel, Rozière Baptiste
- MHPP: Exploring The Capabilities And Limitations Of Language Models Beyond Basic Code Generation Dai Jianbo, Lu Jianqiao, Feng Yunlong, Ruan Rongju, Cheng Ming, Tan Haochen, Guo Zhijiang
- ACT-MNMT Auto-constriction Turning For Multilingual Neural Machine Translation Dai Shaojie, Liu Xin, Luo Ping, Yu Yue
- Large Language Models Can Better Understand Knowledge Graphs Than We Thought Dai Xinbang, Hua Yuncheng, Wu Tongtong, Sheng Yang, Ji Qiu, Qi Guilin
- Generating Code World Models With Large Language Models Guided By Monte Carlo Tree Search Dainese Nicola, Merler Matteo, Alakuijala Minttu, Marttinen Pekka
- Faculty Perspectives On The Potential Of RAG In Computer Science Higher Education Dakshit Sagnik
- Inference To The Best Explanation In Large Language Models Dalal Dhairya, Valentino Marco, Freitas André, Buitelaar Paul
- A Complete Survey On Llm-based AI Chatbots Dam Sumit Kumar, Hong Choong Seon, Qiao Yu, Zhang Chaoning
- Synthetic Patient-physician Dialogue Generation From Clinical Notes Using LLM Das Trisha, Albassam Dina, Sun Jimeng
- Xmainframe: A Large Language Model For Mainframe Modernization Dau Anh T. V., Dao Hieu Trung, Nguyen Anh Tuan, Tran Hieu Trung, Nguyen Phong X., Bui Nghi D. Q.
- Aligning Model Evaluations With Human Preferences: Mitigating Token Count Bias In Language Model Assessments Daynauth Roland, Mars Jason
- Application Of GPT Language Models For Innovation In Activities In University Teaching De Buenaga Manuel, Bueno Francisco Javier
- Retail-gpt: Leveraging Retrieval Augmented Generation (RAG) For Building E-commerce Chat Assistants De Freitas Bruno Amaral Teixeira, Lotufo Roberto De Alencar
- Polyrating: A Cost-effective And Bias-aware Rating System For LLM Evaluation Dekoninck Jasper, Baader Maximilian, Vechev Martin
- Text-tuple-table: Towards Information Integration In Text-to-table Generation Via Global Tuple Extraction Deng Zheye, Chan Chunkit, Wang Weiqi, Sun Yuxi, Fan Wei, Zheng Tianshi, Yim Yauwai, Song Yangqiu
- Garak: A Framework For Security Probing Large Language Models Derczynski Leon, Galinkin Erick, Martin Jeffrey, Majumdar Subho, Inie Nanna
- Glam: Fine-tuning Large Language Models For Domain Knowledge Graph Alignment Via Neighborhood Partitioning And Generative Subgraph Encoding Dernbach Stefan, Agarwal Khushbu, Zuniga Alejandro, Henry Michael, Choudhury Sutanay
- Class-level Code Generation From Natural Language Using Iterative, Tool-enhanced Reasoning Over Repository Deshpande Ajinkya, Agarwal Anmol, Shet Shashank, Iyer Arun, Kanade Aditya, Bairi Ramakrishna, Parthasarathy Suresh
- Chain-of-translation Prompting (cotr): A Novel Prompting Technique For Low Resource Languages Deshpande Tejas, Kowtal Nidhi, Joshi Raviraj
- Facts-and-feelings: Capturing Both Objectivity And Subjectivity In Table-to-text Generation Dey Tathagata, Bhattacharyya Pushpak
- Ranking Large Language Models Without Ground Truth Dhurandhar Amit, Nair Rahul, Singh Moninder, Daly Elizabeth, Ramamurthy Karthikeyan Natesan
- Could We Have Had Better Multilingual Llms If English Was Not The Central Language? Diandaru Ryandito, Susanto Lucky, Tang Zilu, Purwarianti Ayu, Wijaya Derry
- Learning How To Ask: Cycle-consistency Refines Prompts In Multimodal Foundation Models Diesendruck Maurice, Lin Jianzhe, Imani Shima, Mahalingam Gayathri, Xu Mingyang, Zhao Jie
- Entgpt: Linking Generative Large Language Models With Knowledge Bases Ding Yifan, Poudel Amrit, Zeng Qingkai, Weninger Tim, Veeramani Balaji, Bhattacharya Sanmitra
- Boosting Large Language Models With Socratic Method For Conversational Mathematics Teaching Ding Yuyang, Hu Hanglei, Zhou Jie, Chen Qin, Jiang Bo, He Liang
- Finding Blind Spots In Evaluator Llms With Interpretable Checklists Doddapaneni Sumanth, Khan Mohammed Safi Ur Rahman, Verma Sshubam, Khapra Mitesh M.
- Modality-aware Integration With Large Language Models For Knowledge-based Visual Question Answering Dong Junnan, Zhang Qinggang, Zhou Huachi, Zha Daochen, Zheng Pai, Huang Xiao
- QAQ: Quality Adaptive Quantization For LLM KV Cache Dong Shichen, Cheng Wen, Qin Jiayu, Wang Wei
- A Framework For Real-time Safeguarding The Text Generation Of Large Language Model Dong Ximing, Lin Dayi, Wang Shaowei, Hassan Ahmed E.
- Building Guardrails For Large Language Models Dong Yi, Mu Ronghui, Jin Gaojie, Qi Yi, Hu Jinwei, Zhao Xingyu, Meng Jie, Ruan Wenjie, Huang Xiaowei
- Unmemorization In Large Language Models Via Self-distillation And Deliberate Imagination Dong Yijiang River, Lin Hongzhou, Belkin Mikhail, Huerta Ramon, Vulić Ivan
- Attacks, Defenses And Evaluations For LLM Conversation Safety: A Survey Dong Zhichen, Zhou Zhanhui, Yang Chao, Shao Jing, Qiao Yu
- Biomedical Large Languages Models Seem Not To Be Superior To Generalist Models On Unseen Medical Data Dorfner Felix J., Dada Amin, Busch Felix, Makowski Marcus R., Han Tianyu, Truhn Daniel, Kleesiek Jens, Sushil Madhumita, Lammert Jacqueline, Adams Lisa C., Bressem Keno K.
- Learning From Students: Applying T-distributions To Explore Accurate And Efficient Formats For Llms Dotzel Jordan, Chen Yuzong, Kotb Bahaa, Prasad Sushma, Wu Gang, Li Sheng, Abdelfattah Mohamed S., Zhang Zhiru
- What's Wrong With Your Code Generated By Large Language Models? An Extensive Study Dou Shihan, Jia Haoxiang, Wu Shenxi, Zheng Huiyuan, Zhou Weikang, Wu Muling, Chai Mingxu, Fan Jessica, Huang Caishuang, Tao Yunbo, Liu Yan, Zhou Enyu, Zhang Ming, Zhou Yuhao, Wu Yueming, Zheng Rui, Wen Ming, Weng Rongxiang, Wang Jingang, Cai Xunliang, Gui Tao, Qiu Xipeng, Zhang Qi, Huang Xuanjing
- Stepcoder: Improve Code Generation With Reinforcement Learning From Compiler Feedback Dou Shihan, Liu Yan, Jia Haoxiang, Xiong Limao, Zhou Enyu, Shen Wei, Shan Junjie, Huang Caishuang, Wang Xiao, Fan Xiaoran, Xi Zhiheng, Zhou Yuhao, Ji Tao, Zheng Rui, Zhang Qi, Huang Xuanjing, Gui Tao
- Evaluating Language Model Context Windows: A "working Memory" Test And Inference-time Correction Dsouza Amanda, Glaze Christopher, Shin Changho, Sala Frederic
- Evidence-enhanced Triplet Generation Framework For Hallucination Alleviation In Generative Question Answering Du Haowei, Zhang Huishuai, Zhao Dongyan
- Internal And External Knowledge Interactive Refinement Framework For Knowledge-intensive Question Answering Du Haowei, Zhao Dongyan
- Perltqa: A Personal Long-term Memory Dataset For Memory Classification, Retrieval, And Synthesis In Question Answering Du Yiming, Wang Hongru, Zhao Zhengyi, Liang Bin, Wang Baojun, Zhong Wanjun, Wang Zezhong, Wong Kam-fai
- CLEFT: Language-image Contrastive Learning With Efficient Large Language Model And Prompt Fine-tuning Du Yuexi, Chang Brian, Dvornek Nicha C.
- Applying RLAIF For Code Generation With Api-usage In Lightweight Llms Dutta Sujan, Mahinder Sayantan, Anantha Raviteja, Bandyopadhyay Bortik
- AXOLOTL: Fairness Through Assisted Self-debiasing Of Large Language Model Outputs Ebrahimi Sana, Chen Kaiwen, Asudeh Abolfazl, Das Gautam, Koudas Nick
- CROME: Cross-modal Adapters For Efficient Multimodal LLM Ebrahimi Sayna, Arik Sercan O., Nama Tejas, Pfister Tomas
- Dialogue You Can Trust: Human And AI Perspectives On Generated Conversations Ebubechukwu Ike, Takeuchi Johane, Ceravola Antonello, Joublin Frank
- Relation Extraction With Fine-tuned Large Language Models In Retrieval Augmented Generation Frameworks Efeoglu Sefika, Paschke Adrian
- Reducing Selection Bias In Large Language Models Eicher J. E., Irgolič R. F.
- Conversational Prompt Engineering Ein-dor Liat, Toledo-ronen Orith, Spector Artem, Gretz Shai, Dankin Lena, Halfon Alon, Katz Yoav, Slonim Noam
- Arabic Automatic Story Generation With Large Language Models El-shangiti Ahmed Oumar, Alwajih Fakhraddin, Abdul-mageed Muhammad
- Creating Arabic LLM Prompts At Scale El-sheikh Abdelrahman, Elmogtaba Ahmed, Darwish Kareem, Elmallah Muhammad, Elneima Ashraf, Sawaf Hassan
- Persuasiveness Of Generated Free-text Rationales In Subjective Decisions: A Case Study On Pairwise Argument Ranking Elaraby Mohamed, Litman Diane, Li Xiang Lorraine, Magooda Ahmed
- Ocassionally Secure: A Comparative Analysis Of Code Generation Assistants Elgedawy Ran, Sadik John, Dutta Senjuti, Gautam Anuj, Georgiou Konstantinos, Gholamrezae Farzin, Ji Fujiao, Lim Kyungchan, Liu Qian, Ruoti Scott
- Layerskip: Enabling Early Exit Inference And Self-speculative Decoding Elhoushi Mostafa, Shrivastava Akshat, Liskovich Diana, Hosmer Basil, Wasti Bram, Lai Liangzhen, Mahmoud Anas, Acun Bilge, Agarwal Saurabh, Roman Ahmed, Aly Ahmed A, Chen Beidi, Wu Carole-jean
- M2QA: Multi-domain Multilingual Question Answering Engländer Leon, Sterz Hannah, Poth Clifton, Pfeiffer Jonas, Kuznetsov Ilia, Gurevych Iryna
- From LLM To NMT: Advancing Low-resource Machine Translation With Claude Enis Maxim, Hopkins Mark
- Kapqa: Knowledge-augmented Product Question-answering Eppalapally Swetha, Dangi Daksh, Bhat Chaithra, Gupta Ankita, Zhang Ruiyi, Agarwal Shubham, Bagga Karishma, Yoon Seunghyun, Lipka Nedim, Rossi Ryan A., Dernoncourt Franck
- Navigating Uncertainty: Optimizing API Dependency For Hallucination Reduction In Closed-book Question Answering Erbacher Pierre, Falissar Louis, Guigue Vincent, Soulier Laure
- Low-cost Language Models: Survey And Performance Evaluation On Python Code Generation Espejel Jessica López, Alassan Mahaman Sanoussi Yahaya, Bouhandi Merieme, Dahhane Walid, Ettifouri El Hassane
- HLAT: High-quality Large Language Model Pre-trained On AWS Trainium Fan Haozheng, Zhou Hao, Huang Guangtai, Raman Parameswaran, Fu Xinwei, Gupta Gaurav, Ram Dhananjay, Wang Yida, Huan Jun
- Fedmkt: Federated Mutual Knowledge Transfer For Large And Small Language Models Fan Tao, Ma Guoqiang, Kang Yan, Gu Hanlin, Song Yuanfeng, Fan Lixin, Chen Kai, Yang Qiang
- A Survey On RAG Meeting Llms: Towards Retrieval-augmented Large Language Models Fan Wenqi, Ding Yujuan, Ning Liangbo, Wang Shijie, Li Hengyun, Yin Dawei, Chua Tat-seng, Li Qing
- Biasalert: A Plug-and-play Tool For Social Bias Detection In Llms Fan Zhiting, Chen Ruizhe, Xu Ruiling, Liu Zuozhu
- DARA: Decomposition-alignment-reasoning Autonomous Language Agent For Question Answering Over Knowledge Graphs Fang Haishuo, Zhu Xiaodan, Gurevych Iryna
- TRACE The Evidence: Constructing Knowledge-grounded Reasoning Chains For Retrieval-augmented Generation Fang Jinyuan, Meng Zaiqiao, Macdonald Craig
- Large Language Models Are Neurosymbolic Reasoners Fang Meng, Deng Shilong, Zhang Yudi, Shi Zijing, Chen Ling, Pechenizkiy Mykola, Wang Jun
- Unraveling The Dominance Of Large Language Models Over Transformer Models For Bangla Natural Language Inference: A Comprehensive Study Faria Fatema Tuj Johora, Moin Mukaffi Bin, Fahim Asif Iftekher, Debnath Pronay, Shah Faisal Muhammad
- Integrating A.I. In Higher Education: Protocol For A Pilot Study With 'samcares: An Adaptive Learning Hub' Faruqui Syed Hasib Akhter, Tasnim Nazia, Basith Iftekhar Ibne, Obeidat Suleiman, Yildiz Faruk
- Porting Large Language Models To Mobile Devices For Question Answering Fassold Hannes
- Gensco: Can Question Decomposition Based Passage Alignment Improve Question Answering? Fazili Barah, Goswami Koustava, Modani Natwar, Nair Inderjeet
- Ragged Edges: The Double-edged Sword Of Retrieval-augmented Chatbots Feldman Philip, Foulds James R., Pan Shimei
- Legend: Leveraging Representation Engineering To Annotate Safety Margin For Preference Datasets Feng Duanyu, Qin Bowen, Huang Chen, Huang Youcheng, Zhang Zheng, Lei Wenqiang
- Optimus: Accelerating Large-scale Multi-modal LLM Training By Bubble Exploitation Feng Weiqi, Chen Yangrui, Wang Shaoyu, Peng Yanghua, Lin Haibin, Yu Minlan
- Mt-ladder: A Model-agnostic Framework Boosting Llm-based Machine Translation To The Next Level Feng Zhaopeng, Zhang Yan, Chen Ruizhe, Meng Zijie, Liu Zuozhu
- Tear: Improving Llm-based Machine Translation With Systematic Self-refinement Feng Zhaopeng, Zhang Yan, Li Hao, Wu Bei, Liao Jiayu, Liu Wenqiang, Lang Jun, Feng Yang, Wu Jian, Liu Zuozhu
- Unveiling And Manipulating Prompt Influence In Large Language Models Feng Zijian, Zhou Hanzhang, Zhu Zixiao, Qian Junlang, Mao Kezhi
- Towards Trustworthy AI: A Review Of Ethical And Robust Large Language Models Ferdaus Md Meftahul, Abdelguerfi Mahdi, Ioup Elias, Niles Kendall N., Pathak Ken, Sloan Steven
- Syllabusqa: A Course Logistics Question Answering Dataset Fernandez Nigel, Scarlatos Alexander, Lan Andrew
- Generative AI And Large Language Models For Cyber Security: All Insights You Need Ferrag Mohamed Amine, Alwahedi Fatima, Battah Ammar, Cherif Bilel, Mechri Abdechakour, Tihanyi Norbert
- Exaranker-open: Synthetic Explanation For IR Using Open-source Llms Ferraretto Fernando, Laitz Thiago, Lotufo Roberto, Nogueira Rodrigo
- The Chronicles Of RAG: The Retriever, The Chunk And The Generator Finardi Paulo, Avila Leonardo, Castaldoni Rodrigo, Gengo Pedro, Larcher Celio, Piau Marcos, Costa Pablo, Caridá Vinicius
- Introducing The Newspalm MBR And QE Dataset: Llm-generated High-quality Parallel Data Outperforms Traditional Web-crawled Data Finkelstein Mara, Vilar David, Freitag Markus
- Grillbot In Practice: Lessons And Tradeoffs Deploying Large Language Models For Adaptable Conversational Task Assistants Fischer Sophie, Gemmell Carlos, Tecklenburg Niklas, Mackie Iain, Rossetto Federico, Dalton Jeffrey
- RAG Foundry: A Framework For Enhancing Llms For Retrieval Augmented Generation Fleischer Daniel, Berchansky Moshe, Wasserblat Moshe, Izsak Peter
- Re-adaptir: Improving Information Retrieval Through Reverse Engineered Adaptation Fleshman William, Van Durme Benjamin
- Contrastive Policy Gradient: Aligning Llms On Sequence-level Scores In A Supervised-friendly Fashion Flet-berliac Yannis, Grinsztajn Nathan, Strub Florian, Choi Eugene, Cremer Chris, Ahmadian Arash, Chandak Yash, Azar Mohammad Gheshlaghi, Pietquin Olivier, Geist Matthieu
- An Application Of Large Language Models To Coding Negotiation Transcripts Friedman Ray, Cho Jaewoo, Brett Jeanne, Zhan Xuhui, Han Ningyu, Kannan Sriram, Ma Yingxiang, Spencer-smith Jesse, Jäckel Elisabeth, Zerres Alfred, Hooper Madison, Babbit Katie, Acharya Manish, Adair Wendi, Aslani Soroush, Aykaç Tayfun, Bauman Chris, Bennett Rebecca, Brady Garrett, Briggs Peggy, Dowie Cheryl, Eck Chase, Geiger Igmar, Jacob Frank, Kern Molly, Lee Sujin, Liu Leigh Anne, Liu Wu, Loewenstein Jeffrey, Lytle Anne, Ma Li, Mann Michel, Mislin Alexandra, Mitchell Tyree, Nagler Hannah Martensen Née, Nandkeolyar Amit, Olekalns Mara, Paliakova Elena, Parlamis Jennifer, Pierce Jason, Pierce Nancy, Pinkley Robin, Prime Nathalie, Ramirez-marin Jimena, Rockmann Kevin, Ross William, Semnani-azad Zhaleh, Schroeder Juliana, Smith Philip, Stimmer Elena, Swaab Roderick, Thompson Leigh, Tinsley Cathy, Tuncel Ece, Weingart Laurie, Wilken Robert, Yao Jingjing, Zhang Zhi-xue
- Scene-llm: Extending Language Model For 3D Visual Understanding And Reasoning Fu Rao, Liu Jingyu, Chen Xilun, Nie Yixin, Xiong Wenhan
- Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight In The Real World For Meeting Summarization? Fu Xue-yong, Laskar Md Tahmid Rahman, Khasanova Elena, Chen Cheng, Tn Shashi Bhushan
- Cross-task Defense: Instruction-tuning Llms For Content Safety Fu Yu, Xiao Wen, Chen Jia, Li Jiachen, Papalexakis Evangelos, Chien Aichi, Dong Yue
- Continual Pre-training For Cross-lingual LLM Adaptation: Enhancing Japanese Language Capabilities Fujii Kazuki, Nakamura Taishi, Loem Mengsay, Iida Hiroki, Ohi Masanari, Hattori Kakeru, Shota Hirai, Mizuki Sakae, Yokota Rio, Okazaki Naoaki
- Thinking Fair And Slow: On The Efficacy Of Structured Prompts For Debiasing Language Models Furniturewala Shaz, Jandial Surgan, Java Abhinav, Banerjee Pragyan, Shahid Simra, Bhatia Sumit, Jaidka Kokil
- Zero-shot Persuasive Chatbots With Llm-generated Strategies And Information Retrieval Furumai Kazuaki, Legaspi Roberto, Vizcarra Julio, Yamazaki Yudai, Nishimura Yasutaka, Semnani Sina J., Ikeda Kazushi, Shi Weiyan, Lam Monica S.
- Large Language Models And Games: A Survey And Roadmap Gallotta Roberto, Todd Graham, Zammit Marvin, Earle Sam, Liapis Antonios, Togelius Julian, Yannakakis Georgios N.
- Application Of LLM Agents In Recruitment: A Novel Framework For Resume Screening Gan Chengguang, Zhang Qinghao, Mori Tatsunori
- Context-augmented Retrieval: A Novel Framework For Fast Information Retrieval Based Response Generation Using Large Language Model Ganesh Sai, Purwar Anupam, B Gautam
- The Best Of Both Worlds: Toward An Honest And Helpful Large Language Model Gao Chujie, Zhang Qihui, Chen Dongping, Huang Yue, Wu Siyuan, Fu Zhengyan, Wan Yao, Zhang Xiangliang, Sun Lichao
- Aligning LLM Agents By Learning Latent Preference From User Edits Gao Ge, Taymanov Alexey, Salinas Eduardo, Mineiro Paul, Misra Dipendra
- Selfcp: Compressing Over-limit Prompt Via The Frozen Large Language Model Itself Gao Jun, Cao Ziqiang, Li Wenjie
- Harnessing The Intrinsic Knowledge Of Pretrained Language Models For Challenging Text Classification Settings Gao Lingyu
- Tc-llava: Rethinking The Transfer From Image To Video Understanding With Temporal Considerations Gao Mingze, Liu Jingyu, Li Mingda, Xie Jiangtao, Liu Qingbin, Zhao Bo, Chen Xi, Xiong Hui
- Autovcoder: A Systematic Framework For Automated Verilog Code Generation Using Llms Gao Mingzhe, Zhao Jieru, Lin Zhe, Ding Wenchao, Hou Xiaofeng, Feng Yu, Li Chao, Guo Minyi
- Meta Reasoning For Large Language Models Gao Peizhong, Xie Ao, Mao Shaoguang, Wu Wenshan, Xia Yan, Mi Haipeng, Wei Furu
- Improving Language Understanding From Screenshots Gao Tianyu, Wang Zirui, Bhaskar Adithya, Chen Danqi
- Customizing Language Model Responses With Contrastive In-context Learning Gao Xiang, Das Kamalika
- Dr3: Ask Large Language Models Not To Give Off-topic Answers In Open Domain Multi-hop Question Answering Gao Yuan, Zhu Yiheng, Cao Yuanbin, Zhou Yinzhi, Wu Zhen, Chen Yujie, Wu Shenglan, Hu Haoyuan, Dai Xinyu
- A Review On The Use Of Large Language Models As Virtual Tutors García-méndez Silvia, De Arriba-pérez Francisco, Somoza-lópez María Del Carmen
- Cocobo: Exploring Large Language Models As The Engine For End-user Robot Programming Ge Yate, Dai Yi, Shan Run, Li Kechun, Hu Yuanda, Sun Xiaohua
- Does Object Grounding Really Reduce Hallucination Of Large Vision-language Models? Geigle Gregor, Timofte Radu, Glavaš Goran
- Gemini 1.5: Unlocking Multimodal Understanding Across Millions Of Tokens Of Context Gemini Team, Georgiev Petko, Lei Ving Ian, Burnell Ryan, Bai Libin, Gulati Anmol, Tanzer Garrett, Vincent Damien, Pan Zhufeng, Wang Shibo, Mariooryad Soroosh, Ding Yifan, Geng Xinyang, Alcober Fred, Frostig Roy, Omernick Mark, Walker Lexi, Paduraru Cosmin, Sorokin Christina, Tacchetti Andrea, Gaffney Colin, Daruki Samira, Sercinoglu Olcan, Gleicher Zach, Love Juliette, Voigtlaender Paul, Jain Rohan, Surita Gabriela, Mohamed Kareem, Blevins Rory, Ahn Junwhan, Zhu Tao, Kawintiranon Kornraphop, Firat Orhan, Gu Yiming, Zhang Yujing, Rahtz Matthew, Faruqui Manaal, Clay Natalie, Gilmer Justin, Co-reyes Jd, Penchev Ivo, Zhu Rui, Morioka Nobuyuki, Hui Kevin, Haridasan Krishna, Campos Victor, Mahdieh Mahdis, Guo Mandy, Hassan Samer, Kilgour Kevin, Vezer Arpi, Cheng Heng-tze, De Liedekerke Raoul, Goyal Siddharth, Barham Paul, Strouse Dj, Noury Seb, Adler Jonas, Sundararajan Mukund, Vikram Sharad, Lepikhin Dmitry, Paganini Michela, Garcia Xavier, Yang Fan, Valter Dasha, Trebacz Maja, Vodrahalli Kiran, Asawaroengchai Chulayuth, Ring Roman, Kalb Norbert, Soares Livio Baldini, Brahma Siddhartha, Steiner David, Yu Tianhe, Mentzer Fabian, He Antoine, Gonzalez Lucas, Xu Bibo, Kaufman Raphael Lopez, Shafey Laurent El, Oh Junhyuk, Hennigan Tom, Driessche George Van Den, Odoom Seth, Lucic Mario, Roelofs Becca, Lall Sid, Marathe Amit, Chan Betty, Ontanon Santiago, He Luheng, Teplyashin Denis, Lai Jonathan, Crone Phil, Damoc Bogdan, Ho Lewis, Riedel Sebastian, Lenc Karel, Yeh Chih-kuan, Chowdhery Aakanksha, Xu Yang, Kazemi Mehran, Amid Ehsan, Petrushkina Anastasia, Swersky Kevin, Khodaei Ali, Chen Gowoon, Larkin Chris, Pinto Mario, Yan Geng, Badia Adria Puigdomenech, Patil Piyush, Hansen Steven, Orr Dave, Arnold Sebastien M. R., Grimstad Jordan, Dai Andrew, Douglas Sholto, Sinha Rishika, Yadav Vikas, Chen Xi, Gribovskaya Elena, Austin Jacob, Zhao Jeffrey, Patel Kaushal, Komarek Paul, Austin Sophia, Borgeaud Sebastian, Friso Linda, Goyal Abhimanyu, Caine Ben, Cao Kris, Chung Da-woon, Lamm Matthew, Barth-maron Gabe, Kagohara Thais, Olszewska Kate, Chen Mia, Shivakumar Kaushik, Agarwal Rishabh, Godhia Harshal, Rajwar Ravi, Snaider Javier, Dotiwalla Xerxes, Liu Yuan, Barua Aditya, Ungureanu Victor, Zhang Yuan, Batsaikhan Bat-orgil, Wirth Mateo, Qin James, Danihelka Ivo, Doshi Tulsee, Chadwick Martin, Chen Jilin, Jain Sanil, Le Quoc, Kar Arjun, Gurumurthy Madhu, Li Cheng, Sang Ruoxin, Liu Fangyu, Lamprou Lampros, Munoz Rich, Lintz Nathan, Mehta Harsh, Howard Heidi, Reynolds Malcolm, Aroyo Lora, Wang Quan, Blanco Lorenzo, Cassirer Albin, Griffith Jordan, Das Dipanjan, Lee Stephan, Sygnowski Jakub, Fisher Zach, Besley James, Powell Richard, Ahmed Zafarali, Paulus Dominik, Reitter David, Borsos Zalan, Joshi Rishabh, Pope Aedan, Hand Steven, Selo Vittorio, Jain Vihan, Sethi Nikhil, Goel Megha, Makino Takaki, May Rhys, Yang Zhen, Schalkwyk Johan, Butterfield Christina, Hauth Anja, Goldin Alex, Hawkins Will, Senter Evan, Brin Sergey, Woodman Oliver, Ritter Marvin, Noland Eric, Giang Minh, Bolina Vijay, Lee Lisa, Blyth Tim, Mackinnon Ian, Reid Machel, Sarvana Obaid, Silver David, Chen Alexander, Wang Lily, Maggiore Loren, Chang Oscar, Attaluri Nithya, Thornton Gregory, Chiu Chung-cheng, Bunyan Oskar, Levine Nir, Chung Timothy, Eltyshev Evgenii, Si Xiance, Lillicrap Timothy, Brady Demetra, Aggarwal Vaibhav, Wu Boxi, Xu Yuanzhong, Mcilroy Ross, Badola Kartikeya, Sandhu Paramjit, Moreira Erica, Stokowiec Wojciech, Hemsley Ross, Li Dong, Tudor Alex, Shyam Pranav, Rahimtoroghi Elahe, Haykal Salem, Sprechmann Pablo, Zhou Xiang, Mincu Diana, Li Yujia, Addanki Ravi, Krishna Kalpesh, Wu Xiao, Frechette Alexandre, Eyal Matan, Dafoe Allan, Lacey Dave, Whang Jay, Avrahami Thi, Zhang Ye, Taropa Emanuel, Lin Hanzhao, Toyama Daniel, Rutherford Eliza, Sano Motoki, Choe Hyunjeong, Tomala Alex, Safranek-shrader Chalence, Kassner Nora, Pajarskas Mantas, Harvey Matt, Sechrist Sean, Fortunato Meire, Lyu Christina, Elsayed Gamaleldin, Kuang Chenkai, Lottes James, Chu Eric, Jia Chao, Chen Chih-wei, Humphreys Peter, Baumli Kate, Tao Connie, Samuel Rajkumar, Santos Cicero Nogueira Dos, Andreassen Anders, Rakićević Nemanja, Grewe Dominik, Kumar Aviral, Winkler Stephanie, Caton Jonathan, Brock Andrew, Dalmia Sid, Sheahan Hannah, Barr Iain, Miao Yingjie, Natsev Paul, Devlin Jacob, Behbahani Feryal, Prost Flavien, Sun Yanhua, Myaskovsky Artiom, Pillai Thanumalayan Sankaranarayana, Hurt Dan, Lazaridou Angeliki, Xiong Xi, Zheng Ce, Pardo Fabio, Li Xiaowei, Horgan Dan, Stanton Joe, Ambar Moran, Xia Fei, Lince Alejandro, Wang Mingqiu, Mustafa Basil, Webson Albert, Lee Hyo, Anil Rohan, Wicke Martin, Dozat Timothy, Sinha Abhishek, Piqueras Enrique, Dabir Elahe, Upadhyay Shyam, Boral Anudhyan, Hendricks Lisa Anne, Fry Corey, Djolonga Josip, Su Yi, Walker Jake, Labanowski Jane, Huang Ronny, Misra Vedant, Chen Jeremy, Skerry-ryan Rj, Singh Avi, Rijhwani Shruti, Yu Dian, Castro-ros Alex, Changpinyo Beer, Datta Romina, Bagri Sumit, Hrafnkelsson Arnar Mar, Maggioni Marcello, Zheng Daniel, Sulsky Yury, Hou Shaobo, Paine Tom Le, Yang Antoine, Riesa Jason, Rogozinska Dominika, Marcus Dror, Badawy Dalia El, Zhang Qiao, Wang Luyu, Miller Helen, Greer Jeremy, Sjos Lars Lowe, Nova Azade, Zen Heiga, Chaabouni Rahma, Rosca Mihaela, Jiang Jiepu, Chen Charlie, Liu Ruibo, Sainath Tara, Krikun Maxim, Polozov Alex, Lespiau Jean-baptiste, Newlan Josh, Cankara Zeyncep, Kwak Soo, Xu Yunhan, Chen Phil, Coenen Andy, Meyer Clemens, Tsihlas Katerina, Ma Ada, Gottweis Juraj, Xing Jinwei, Gu Chenjie, Miao Jin, Frank Christian, Cankara Zeynep, Ganapathy Sanjay, Dasgupta Ishita, Hughes-fitt Steph, Chen Heng, Reid David, Rong Keran, Fan Hongmin, Van Amersfoort Joost, Zhuang Vincent, Cohen Aaron, Gu Shixiang Shane, Mohananey Anhad, Ilic Anastasija, Tobin Taylor, Wieting John, Bortsova Anna, Thacker Phoebe, Wang Emma, Caveness Emily, Chiu Justin, Sezener Eren, Kaskasoli Alex, Baker Steven, Millican Katie, Elhawaty Mohamed, Aisopos Kostas, Lebsack Carl, Byrd Nathan, Dai Hanjun, Jia Wenhao, Wiethoff Matthew, Davoodi Elnaz, Weston Albert, Yagati Lakshman, Ahuja Arun, Gao Isabel, Pundak Golan, Zhang Susan, Azzam Michael, Sim Khe Chai, Caelles Sergi, Keeling James, Sharma Abhanshu, Swing Andy, Li Yaguang, Liu Chenxi, Bostock Carrie Grimes, Bansal Yamini, Nado Zachary, Anand Ankesh, Lipschultz Josh, Karmarkar Abhijit, Proleev Lev, Ittycheriah Abe, Yeganeh Soheil Hassas, Polovets George, Faust Aleksandra, Sun Jiao, Rrustemi Alban, Li Pen, Shivanna Rakesh, Liu Jeremiah, Welty Chris, Lebron Federico, Baddepudi Anirudh, Krause Sebastian, Parisotto Emilio, Soricut Radu, Xu Zheng, Bloxwich Dawn, Johnson Melvin, Neyshabur Behnam, Mao-jones Justin, Wang Renshen, Ramasesh Vinay, Abbas Zaheer, Guez Arthur, Segal Constant, Nguyen Duc Dung, Svensson James, Hou Le, York Sarah, Milan Kieran, Bridgers Sophie, Gworek Wiktor, Tagliasacchi Marco, Lee-thorp James, Chang Michael, Guseynov Alexey, Hartman Ale Jakse, Kwong Michael, Zhao Ruizhe, Kashem Sheleem, Cole Elizabeth, Miech Antoine, Tanburn Richard, Phuong Mary, Pavetic Filip, Cevey Sebastien, Comanescu Ramona, Ives Richard, Yang Sherry, Du Cosmo, Li Bo, Zhang Zizhao, Iinuma Mariko, Hu Clara Huiyi, Roy Aurko, Bijwadia Shaan, Zhu Zhenkai, Martins Danilo, Saputro Rachel, Gergely Anita, Zheng Steven, Jia Dawei, Antonoglou Ioannis, Sadovsky Adam, Gu Shane, Bi Yingying, Andreev Alek, Samangooei Sina, Khan Mina, Kocisky Tomas, Filos Angelos, Kumar Chintu, Bishop Colton, Yu Adams, Hodkinson Sarah, Mittal Sid, Shah Premal, Moufarek Alexandre, Cheng Yong, Bloniarz Adam, Lee Jaehoon, Pejman Pedram, Michel Paul, Spencer Stephen, Feinberg Vladimir, Xiong Xuehan, Savinov Nikolay, Smith Charlotte, Shakeri Siamak, Tran Dustin, Chesus Mary, Bohnet Bernd, Tucker George, Von Glehn Tamara, Muir Carrie, Mao Yiran, Kazawa Hideto, Slone Ambrose, Soparkar Kedar, Shrivastava Disha, Cobon-kerr James, Sharman Michael, Pavagadhi Jay, Araya Carlos, Misiunas Karolis, Ghelani Nimesh, Laskin Michael, Barker David, Li Qiujia, Briukhov Anton, Houlsby Neil, Glaese Mia, Lakshminarayanan Balaji, Schucher Nathan, Tang Yunhao, Collins Eli, Lim Hyeontaek, Feng Fangxiaoyu, Recasens Adria, Lai Guangda, Magni Alberto, De Cao Nicola, Siddhant Aditya, Ashwood Zoe, Orbay Jordi, Dehghani Mostafa, Brennan Jenny, He Yifan, Xu Kelvin, Gao Yang, Saroufim Carl, Molloy James, Wu Xinyi, Arnold Seb, Chang Solomon, Schrittwieser Julian, Buchatskaya Elena, Radpour Soroush, Polacek Martin, Giordano Skye, Bapna Ankur, Tokumine Simon, Hellendoorn Vincent, Sottiaux Thibault, Cogan Sarah, Severyn Aliaksei, Saleh Mohammad, Thakoor Shantanu, Shefey Laurent, Qiao Siyuan, Gaba Meenu, Chang Shuo-yiin, Swanson Craig, Zhang Biao, Lee Benjamin, Rubenstein Paul Kishan, Song Gan, Kwiatkowski Tom, Koop Anna, Kannan Ajay, Kao David, Schuh Parker, Stjerngren Axel, Ghiasi Golnaz, Gibson Gena, Vilnis Luke, Yuan Ye, Ferreira Felipe Tiengo, Kamath Aishwarya, Klimenko Ted, Franko Ken, Xiao Kefan, Bhattacharya Indro, Patel Miteyan, Wang Rui, Morris Alex, Strudel Robin, Sharma Vivek, Choy Peter, Hashemi Sayed Hadi, Landon Jessica, Finkelstein Mara, Jhakra Priya, Frye Justin, Barnes Megan, Mauger Matthew, Daun Dennis, Baatarsukh Khuslen, Tung Matthew, Farhan Wael, Michalewski Henryk, Viola Fabio, Quitry Felix De Chaumont, Lan Charline Le, Hudson Tom, Wang Qingze, Fischer Felix, Zheng Ivy, White Elspeth, Dragan Anca, Alayrac Jean-baptiste, Ni Eric, Pritzel Alexander, Iwanicki Adam, Isard Michael, Bulanova Anna, Zilka Lukas, Dyer Ethan, Sachan Devendra, Srinivasan Srivatsan, Muckenhirn Hannah, Cai Honglong, Mandhane Amol, Tariq Mukarram, Rae Jack W., Wang Gary, Ayoub Kareem, Fitzgerald Nicholas, Zhao Yao, Han Woohyun, Alberti Chris, Garrette Dan, Krishnakumar Kashyap, Gimenez Mai, Levskaya Anselm, Sohn Daniel, Matak Josip, Iturrate Inaki, Chang Michael B., Xiang Jackie, Cao Yuan, Ranka Nishant, Brown Geoff, Hutter Adrian, Mirrokni Vahab, Chen Nanxin, Yao Kaisheng, Egyed Zoltan, Galilee Francois, Liechty Tyler, Kallakuri Praveen, Palmer Evan, Ghemawat Sanjay, Liu Jasmine, Tao David, Thornton Chloe, Green Tim, Jasarevic Mimi, Lin Sharon, Cotruta Victor, Tan Yi-xuan, Fiedel Noah, Yu Hongkun, Chi Ed, Neitz Alexander, Heitkaemper Jens, Sinha Anu, Zhou Denny, Sun Yi, Kaed Charbel, Hulse Brice, Mishra Swaroop, Georgaki Maria, Kudugunta Sneha, Farabet Clement, Shafran Izhak, Vlasic Daniel, Tsitsulin Anton, Ananthanarayanan Rajagopal, Carin Alen, Su Guolong, Sun Pei, V Shashank, Carvajal Gabriel, Broder Josef, Comsa Iulia, Repina Alena, Wong William, Chen Warren Weilun, Hawkins Peter, Filonov Egor, Loher Lucia, Hirnschall Christoph, Wang Weiyi, Ye Jingchen, Burns Andrea, Cate Hardie, Wright Diana Gage, Piccinini Federico, Zhang Lei, Lin Chu-cheng, Gog Ionel, Kulizhskaya Yana, Sreevatsa Ashwin, Song Shuang, Cobo Luis C., Iyer Anand, Tekur Chetan, Garrido Guillermo, Xiao Zhuyun, Kemp Rupert, Zheng Huaixiu Steven, Li Hui, Agarwal Ananth, Ngani Christel, Goshvadi Kati, Santamaria-fernandez Rebeca, Fica Wojciech, Chen Xinyun, Gorgolewski Chris, Sun Sean, Garg Roopal, Ye Xinyu, Eslami S. M. Ali, Hua Nan, Simon Jon, Joshi Pratik, Kim Yelin, Tenney Ian, Potluri Sahitya, Thiet Lam Nguyen, Yuan Quan, Luisier Florian, Chronopoulou Alexandra, Scellato Salvatore, Srinivasan Praveen, Chen Minmin, Koverkathu Vinod, Dalibard Valentin, Xu Yaming, Saeta Brennan, Anderson Keith, Sellam Thibault, Fernando Nick, Huot Fantine, Jung Junehyuk, Varadarajan Mani, Quinn Michael, Raul Amit, Le Maigo, Habalov Ruslan, Clark Jon, Jalan Komal, Bullard Kalesha, Singhal Achintya, Luong Thang, Wang Boyu, Rajayogam Sujeevan, Eisenschlos Julian, Jia Johnson, Finchelstein Daniel, Yakubovich Alex, Balle Daniel, Fink Michael, Agarwal Sameer, Li Jing, Dvijotham Dj, Pal Shalini, Kang Kai, Konzelmann Jaclyn, Beattie Jennifer, Dousse Olivier, Wu Diane, Crocker Remi, Elkind Chen, Jonnalagadda Siddhartha Reddy, Lee Jong, Holtmann-rice Dan, Kallarackal Krystal, Liu Rosanne, Vnukov Denis, Vats Neera, Invernizzi Luca, Jafari Mohsen, Zhou Huanjie, Taylor Lilly, Prendki Jennifer, Wu Marcus, Eccles Tom, Liu Tianqi, Kopparapu Kavya, Beaufays Francoise, Angermueller Christof, Marzoca Andreea, Sarcar Shourya, Dib Hilal, Stanway Jeff, Perbet Frank, Trdin Nejc, Sterneck Rachel, Khorlin Andrey, Li Dinghua, Wu Xihui, Goenka Sonam, Madras David, Goldshtein Sasha, Gierke Willi, Zhou Tong, Liu Yaxin, Liang Yannie, White Anais, Li Yunjie, Singh Shreya, Bahargam Sanaz, Epstein Mark, Basu Sujoy, Lao Li, Ozturel Adnan, Crous Carl, Zhai Alex, Lu Han, Tung Zora, Gaur Neeraj, Walton Alanna, Dixon Lucas, Zhang Ming, Globerson Amir, Uy Grant, Bolt Andrew, Wiles Olivia, Nasr Milad, Shumailov Ilia, Selvi Marco, Piccinno Francesco, Aguilar Ricardo, Mccarthy Sara, Khalman Misha, Shukla Mrinal, Galic Vlado, Carpenter John, Villela Kevin, Zhang Haibin, Richardson Harry, Martens James, Bosnjak Matko, Belle Shreyas Rammohan, Seibert Jeff, Alnahlawi Mahmoud, Mcwilliams Brian, Singh Sankalp, Louis Annie, Ding Wen, Popovici Dan, Simicich Lenin, Knight Laura, Mehta Pulkit, Gupta Nishesh, Shi Chongyang, Fatehi Saaber, Mitrovic Jovana, Grills Alex, Pagadora Joseph, Petrova Dessie, Eisenbud Danielle, Zhang Zhishuai, Yates Damion, Mittal Bhavishya, Tripuraneni Nilesh, Assael Yannis, Brovelli Thomas, Jain Prateek, Velimirovic Mihajlo, Akbulut Canfer, Mu Jiaqi, Macherey Wolfgang, Kumar Ravin, Xu Jun, Qureshi Haroon, Comanici Gheorghe, Wiesner Jeremy, Gong Zhitao, Ruddock Anton, Bauer Matthias, Felt Nick, Gp Anirudh, Arnab Anurag, Zelle Dustin, Rothfuss Jonas, Rosgen Bill, Shenoy Ashish, Seybold Bryan, Li Xinjian, Mudigonda Jayaram, Erdogan Goker, Xia Jiawei, Simsa Jiri, Michi Andrea, Yao Yi, Yew Christopher, Kan Steven, Caswell Isaac, Radebaugh Carey, Elisseeff Andre, Valenzuela Pedro, Mckinney Kay, Paterson Kim, Cui Albert, Latorre-chimoto Eri, Kim Solomon, Zeng William, Durden Ken, Ponnapalli Priya, Sosea Tiberiu, Choquette-choo Christopher A., Manyika James, Robenek Brona, Vashisht Harsha, Pereira Sebastien, Lam Hoi, Velic Marko, Owusu-afriyie Denese, Lee Katherine, Bolukbasi Tolga, Parrish Alicia, Lu Shawn, Park Jane, Venkatraman Balaji, Talbert Alice, Rosique Lambert, Cheng Yuchung, Sozanschi Andrei, Paszke Adam, Kumar Praveen, Austin Jessica, Li Lu, Salama Khalid, Kim Wooyeol, Dukkipati Nandita, Baryshnikov Anthony, Kaplanis Christos, Sheng Xianghai, Chervonyi Yuri, Unlu Caglar, Casas Diego De Las, Askham Harry, Tunyasuvunakool Kathryn, Gimeno Felix, Poder Siim, Kwak Chester, Miecnikowski Matt, Mirrokni Vahab, Dimitriev Alek, Parisi Aaron, Liu Dangyi, Tsai Tomy, Shevlane Toby, Kouridi Christina, Garmon Drew, Goedeckemeyer Adrian, Brown Adam R., Vijayakumar Anitha, Elqursh Ali, Jazayeri Sadegh, Huang Jin, Carthy Sara Mc, Hoover Jay, Kim Lucy, Kumar Sandeep, Chen Wei, Biles Courtney, Bingham Garrett, Rosen Evan, Wang Lisa, Tan Qijun, Engel David, Pongetti Francesco, De Cesare Dario, Hwang Dongseong, Yu Lily, Pullman Jennifer, Narayanan Srini, Levin Kyle, Gopal Siddharth, Li Megan, Aharoni Asaf, Trinh Trieu, Lo Jessica, Casagrande Norman, Vij Roopali, Matthey Loic, Ramadhana Bramandia, Matthews Austin, Carey Cj, Johnson Matthew, Goranova Kremena, Shah Rohin, Ashraf Shereen, Dasgupta Kingshuk, Larsen Rasmus, Wang Yicheng, Vuyyuru Manish Reddy, Jiang Chong, Ijazi Joana, Osawa Kazuki, Smith Celine, Boppana Ramya Sree, Bilal Taylan, Koizumi Yuma, Xu Ying, Altun Yasemin, Shabat Nir, Bariach Ben, Korchemniy Alex, Choo Kiam, Ronneberger Olaf, Iwuanyanwu Chimezie, Zhao Shubin, Soergel David, Hsieh Cho-jui, Cai Irene, Iqbal Shariq, Sundermeyer Martin, Chen Zhe, Bursztein Elie, Malaviya Chaitanya, Biadsy Fadi, Shroff Prakash, Dhillon Inderjit, Latkar Tejasi, Dyer Chris, Forbes Hannah, Nicosia Massimo, Nikolaev Vitaly, Greene Somer, Georgiev Marin, Wang Pidong, Martin Nina, Sedghi Hanie, Zhang John, Banzal Praseem, Fritz Doug, Rao Vikram, Wang Xuezhi, Zhang Jiageng, Patraucean Viorica, Du Dayou, Mordatch Igor, Jurin Ivan, Liu Lewis, Dubey Ayush, Mohan Abhi, Nowakowski Janek, Ion Vlad-doru, Wei Nan, Tojo Reiko, Raad Maria Abi, Hudson Drew A., Keshava Vaishakh, Agrawal Shubham, Ramirez Kevin, Wu Zhichun, Nguyen Hoang, Liu Ji, Sewak Madhavi, Petrini Bryce, Choi Donghyun, Philips Ivan, Wang Ziyue, Bica Ioana, Garg Ankush, Wilkiewicz Jarek, Agrawal Priyanka, Li Xiaowei, Guo Danhao, Xue Emily, Shaik Naseer, Leach Andrew, Khan Sadh Mnm, Wiesinger Julia, Jerome Sammy, Chakladar Abhishek, Wang Alek Wenjiao, Ornduff Tina, Abu Folake, Ghaffarkhah Alireza, Wainwright Marcus, Cortes Mario, Liu Frederick, Maynez Joshua, Terzis Andreas, Samangouei Pouya, Mansour Riham, Kępa Tomasz, Aubet François-xavier, Algymr Anton, Banica Dan, Weisz Agoston, Orban Andras, Senges Alexandre, Andrejczuk Ewa, Geller Mark, Santo Niccolo Dal, Anklin Valentin, Merey Majd Al, Baeuml Martin, Strohman Trevor, Bai Junwen, Petrov Slav, Wu Yonghui, Hassabis Demis, Kavukcuoglu Koray, Dean Jeffrey, Vinyals Oriol
- Sketch-guided Constrained Decoding For Boosting Blackbox Large Language Models Without Logit Access Geng Saibo, Döner Berkay, Wendler Chris, Josifoski Martin, West Robert
- Syndarin: Synthesising Datasets For Automated Reasoning In Low-resource Languages Ghazaryan Gayane, Arakelyan Erik, Minervini Pasquale, Augenstein Isabelle
- AI And Memory Wall Gholami Amir, Yao Zhewei, Kim Sehoon, Hooper Coleman, Mahoney Michael W., Keutzer Kurt
- Exploring The Frontier Of Vision-language Models: A Survey Of Current Methodologies And Future Directions Ghosh Akash, Acharya Arkadeep, Saha Sriparna, Jain Vinija, Chadha Aman
- Investigating The Translation Capabilities Of Large Language Models Trained On Parallel Data Only Gilabert Javier García, Escolano Carlos, Savall Aleix Sant, Fornaciari Francesca De Luca, Mash Audrey, Liao Xixian, Melero Maite
- Agentquest: A Modular Benchmark Framework To Measure Progress And Improve LLM Agents Gioacchini Luca, Siracusano Giuseppe, Sanvito Davide, Gashteovski Kiril, Friede David, Bifulco Roberto, Lawrence Carolin
- Explicit And Implicit Large Language Model Personas Generate Opinions But Fail To Replicate Deeper Perceptions And Biases Giorgi Salvatore, Liu Tingting, Aich Ankit, Isman Kelsey, Sherman Garrick, Fried Zachary, Sedoc João, Ungar Lyle H., Curtis Brenda
- Calibrated Large Language Models For Binary Question Answering Giovannotti Patrizio, Gammerman Alexander
- Blendsql: A Scalable Dialect For Unifying Hybrid Question Answering In Relational Algebra Glenn Parker, Dakle Parag Pravin, Wang Liang, Raghavan Preethi
- Chatglm: A Family Of Large Language Models From GLM-130B To GLM-4 All Tools Glm Team, :, Zeng Aohan, Xu Bin, Wang Bowen, Zhang Chenhui, Yin Da, Zhang Dan, Rojas Diego, Feng Guanyu, Zhao Hanlin, Lai Hanyu, Yu Hao, Wang Hongning, Sun Jiadai, Zhang Jiajie, Cheng Jiale, Gui Jiayi, Tang Jie, Zhang Jing, Sun Jingyu, Li Juanzi, Zhao Lei, Wu Lindong, Zhong Lucen, Liu Mingdao, Huang Minlie, Zhang Peng, Zheng Qinkai, Lu Rui, Duan Shuaiqi, Zhang Shudan, Cao Shulin, Yang Shuxun, Tam Weng Lam, Zhao Wenyi, Liu Xiao, Xia Xiao, Zhang Xiaohan, Gu Xiaotao, Lv Xin, Liu Xinghan, Liu Xinyi, Yang Xinyue, Song Xixuan, Zhang Xunkai, An Yifan, Xu Yifan, Niu Yilin, Yang Yuantao, Li Yueyan, Bai Yushi, Dong Yuxiao, Qi Zehan, Wang Zhaoyu, Yang Zhen, Du Zhengxiao, Hou Zhenyu, Wang Zihan
- Analysis Of Plan-based Retrieval For Grounded Text Generation Godbole Ameya, Monath Nicholas, Kim Seungyeon, Rawat Ankit Singh, Mccallum Andrew, Zaheer Manzil
- Direct Alignment Of Draft Model For Speculative Decoding With Chat-fine-tuned Llms Goel Raghavv, Gagrani Mukul, Jeon Wonseok, Park Junyoung, Lee Mingu, Lott Christopher
- Tokenshap: Interpreting Large Language Models With Monte Carlo Shapley Value Estimation Goldshmidt Roni, Horovicz Miriam
- Mechanistic Interpretability Of Large Language Models With Applications To The Financial Services Industry Golgoon Ashkan, Filom Khashayar, Kannan Arjun Ravi
- I've Got The "answer"! Interpretation Of Llms Hidden States In Question Answering Goloviznina Valeriya, Kotelnikov Evgeny
- Transforming Llms Into Cross-modal And Cross-lingual Retrieval Systems Gomez Frank Palma, Sanabria Ramon, Sung Yun-hsuan, Cer Daniel, Dalmia Siddharth, Abrego Gustavo Hernandez
- AST-T5: Structure-aware Pretraining For Code Generation And Understanding Gong Linyuan, Elhoushi Mostafa, Cheung Alvin
- The Advantages Of Context Specific Language Models: The Case Of The Erasmian Language Model Gonçalves João, Jelicic Nick, Murgia Michele, Stamhuis Evert
- Multi-frame, Lightweight & Efficient Vision-language Models For Question Answering In Autonomous Driving Gopalkrishnan Akshay, Greer Ross, Trivedi Mohan
- Few Shot Chain-of-thought Driven Reasoning To Prompt Llms For Open Ended Medical Question Answering Gramopadhye Ojas, Nachane Saeel Sandeep, Chanda Prateek, Ramakrishnan Ganesh, Jadhav Kshitij Sharad, Nandwani Yatin, Raghu Dinesh, Joshi Sachindra
- Inference Acceleration For Large Language Models Using "stairs" Assisted Greedy Generation Grigaliūnas Domas, Lukoševičius Mantas
- Model Editing Harms General Abilities Of Large Language Models: Regularization To The Rescue Gu Jia-chen, Xu Hao-xiang, Ma Jun-yu, Lu Pan, Ling Zhen-hua, Chang Kai-wei, Peng Nanyun
- Middleware For Llms: Tools Are Instrumental For Language Agents In Complex Environments Gu Yu, Shu Yiheng, Yu Hao, Liu Xiao, Dong Yuxiao, Tang Jie, Srinivasa Jayanth, Latapie Hugo, Su Yu
- Anah-v2: Scaling Analytical Hallucination Annotation Of Large Language Models Gu Yuzhe, Ji Ziwei, Zhang Wenwei, Lyu Chengqi, Lin Dahua, Chen Kai
- Inquire, Interact, And Integrate: A Proactive Agent Collaborative Framework For Zero-shot Multimodal Medical Reasoning Gu Zishan, Liu Fenglin, Yin Changchang, Zhang Ping
- Codeip: A Grammar-guided Multi-bit Watermark For Large Language Models Of Code Guan Batu, Wan Yao, Bi Zhangqian, Wang Zheng, Zhang Hongyu, Zhou Pan, Sun Lichao
- Boosting Lossless Speculative Decoding Via Feature Sampling And Partial Alignment Distillation Gui Lujun, Xiao Bin, Su Lei, Chen Weipeng
- On The Interchangeability Of Positional Embeddings In Multilingual Neural Machine Translation Models Gumma Varun, Chitale Pranjal A., Bali Kalika
- Integrating Large Language Models With Graphical Session-based Recommendation Guo Naicheng, Cheng Hongwei, Liang Qianqiao, Chen Linxun, Han Bing
- Agent-simt: Agent-assisted Simultaneous Machine Translation With Large Language Models Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Sillm: Large Language Models For Simultaneous Machine Translation Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Serial Position Effects Of Large Language Models Guo Xiaobo, Vosoughi Soroush
- Generative AI For Synthetic Data Generation: Methods, Challenges And The Future Guo Xu, Chen Yiqiang
- Attention Score Is Not All You Need For Token Importance Indicator In KV Cache Reduction: Value Also Matters Guo Zhiyu, Kamigaito Hidetaka, Watanabe Taro
- AAVENUE: Detecting LLM Biases On NLU Tasks In AAVE Via A Novel Benchmark Gupta Abhay, Meng Philip, Yurtseven Ece, O'brien Sean, Zhu Kevin
- LLM Task Interference: An Initial Study On The Impact Of Task-switch In Conversational History Gupta Akash, Sheth Ivaxi, Raina Vyas, Gales Mark, Fritz Mario
- Irel At Semeval-2024 Task 9: Improving Conventional Prompting Methods For Brain Teasers Gupta Harshit, Chaudhary Manav, Raha Tathagata, Subramanian Shivansh, Varma Vasudeva
- Changing Answer Order Can Decrease MMLU Accuracy Gupta Vipul, Pantoja David, Ross Candace, Williams Adina, Ung Megan
- Hipporag: Neurobiologically Inspired Long-term Memory For Large Language Models Gutiérrez Bernal Jiménez, Shu Yiheng, Gu Yu, Yasunaga Michihiro, Su Yu
- METAL: Towards Multilingual Meta-evaluation Hada Rishav, Gumma Varun, Ahmed Mohamed, Bali Kalika, Sitaram Sunayana
- Recent Advances In Generative AI And Large Language Models: Current Status, Challenges, And Perspectives Hagos Desta Haileselassie, Battle Rick, Rawat Danda B.
- On The Impacts Of Contexts On Repository-level Code Generation Hai Nam Le, Nguyen Dung Manh, Bui Nghi D. Q.
- PECC: Problem Extraction And Coding Challenges Haller Patrick, Golde Jonas, Akbik Alan
- Llm-personalize: Aligning LLM Planners With Human Preferences Via Reinforced Self-training For Housekeeping Robots Han Dongge, Mcinroe Trevor, Jelley Adam, Albrecht Stefano V., Bell Peter, Storkey Amos
- MERLIN: Multimodal Embedding Refinement Via Llm-based Iterative Navigation For Text-video Retrieval-rerank Pipeline Han Donghoon, Park Eunhwan, Lee Gisang, Lee Adam, Kwak Nojun
- RECIPE4U: Student-chatgpt Interaction Dataset In EFL Writing Education Han Jieun, Yoo Haneul, Myung Junho, Kim Minsun, Lee Tak Yeon, Ahn So-yeon, Oh Alice
- Transfer Learning For Text Diffusion Models Han Kehang, Kenealy Kathleen, Barua Aditya, Fiedel Noah, Constant Noah
- RAG-QA Arena: Evaluating Domain Robustness For Long-form Retrieval Augmented Question Answering Han Rujun, Zhang Yuhao, Qi Peng, Xu Yumo, Wang Jenyuan, Liu Lan, Wang William Yang, Min Bonan, Castelli Vittorio
- Exploring Chatgpt And Its Impact On Society Haque Md. Asraful, Li Shuai
- Risk And Response In Large Language Models: Evaluating Key Threat Categories Harandizadeh Bahareh, Salinas Abel, Morstatter Fred
- It Cannot Be Right If It Was Written By AI: On Lawyers' Preferences Of Documents Perceived As Authored By An LLM Vs A Human Harasta Jakub, Novotná Tereza, Savelka Jaromir
- A Comparison Of Large Language Model And Human Performance On Random Number Generation Tasks Harrison Rachel M.
- A Comprehensive Overview Of Large Language Models (llms) For Cyber Defences: Opportunities And Directions Hassanin Mohammed, Moustafa Nour
- The Larger The Better? Improved LLM Code-generation Via Budget Reallocation Hassid Michael, Remez Tal, Gehring Jonas, Schwartz Roy, Adi Yossi
- Chain-of-instructions: Compositional Instruction Tuning On Large Language Models Hayati Shirley Anugrah, Jung Taehee, Bodding-long Tristan, Kar Sudipta, Sethy Abhinav, Kim Joo-kyung, Kang Dongyeop
- Large Language Models Are Biased Reinforcement Learners Hayes William M., Yax Nicolas, Palminteri Stefano
- Safety Arithmetic: A Framework For Test-time Safety Alignment Of Language Models By Steering Parameters And Activations Hazra Rima, Layek Sayan, Banerjee Somnath, Poria Soujanya
- A Law Of Next-token Prediction In Large Language Models He Hangfeng, Su Weijie J.
- Webvoyager: Building An End-to-end Web Agent With Large Multimodal Models He Hongliang, Yao Wenlin, Ma Kaixin, Yu Wenhao, Dai Yong, Zhang Hongming, Lan Zhenzhong, Yu Dong
- Cos: Enhancing Personalization And Mitigating Bias With Context Steering He Jerry Zhi-yang, Pandey Sashrika, Schrum Mariah L., Dragan Anca
- Efficient LLM Inference With Kcache He Qiaozhi, Wu Zhihua
- Pitvqa: Image-grounded Text Embedding LLM For Visual Question Answering In Pituitary Surgery He Runlong, Xu Mengya, Das Adrito, Khan Danyal Z., Bano Sophia, Marcus Hani J., Stoyanov Danail, Clarkson Matthew J., Islam Mobarakol
- MARS: Mixture Of Auto-regressive Models For Fine-grained Text-to-image Synthesis He Wanggui, Fu Siming, Liu Mushui, Wang Xierui, Xiao Wenyi, Shu Fangxun, Wang Yi, Zhang Lei, Yu Zhelun, Li Haoyuan, Huang Ziwei, Gan Leilei, Jiang Hao
- G-retriever: Retrieval-augmented Generation For Textual Graph Understanding And Question Answering He Xiaoxin, Tian Yijun, Sun Yifei, Chawla Nitesh V., Laurent Thomas, Lecun Yann, Bresson Xavier, Hooi Bryan
- Llms Meet Multimodal Generation And Editing: A Survey He Yingqing, Liu Zhaoyang, Chen Jingye, Tian Zeyue, Liu Hongyu, Chi Xiaowei, Liu Runtao, Yuan Ruibin, Xing Yazhou, Wang Wenhai, Dai Jifeng, Zhang Yong, Xue Wei, Liu Qifeng, Guo Yike, Chen Qifeng
- Fortifying Ethical Boundaries In AI: Advanced Strategies For Enhancing Security In Large Language Models He Yunhong, Qiu Jianling, Zhang Wei, Yuan Zhengqing
- Let The Code LLM Edit Itself When You Edit The Code He Zhenyu, Zhang Jun, Luo Shengjie, Xu Jingjing, Zhang Zhi, He Di
- Telechat Technical Report He Zhongjiang, Wang Zihan, Liu Xinzhang, Liu Shixuan, Yao Yitong, Huang Yuyao, Li Xuelong, Li Yongxiang, Che Zhonghao, Zhang Zhaoxi, Wang Yan, Wang Xin, Pu Luwen, Xu Huinan, Fang Ruiyu, Zhao Yu, Zhang Jie, Huang Xiaomeng, Lu Zhilong, Peng Jiaxin, Zheng Wenjun, Wang Shiquan, Yang Bingkai, He Xuewei, Jiang Zhuoru, Xie Qiyi, Zhang Yanhan, Li Zhongqiu, Shi Lingling, Fu Weiwei, Zhang Yin, Huang Zilu, Xiong Sishi, Zhang Yuxiang, Wang Chao, Song Shuangyong
- HMT: Hierarchical Memory Transformer For Long Context Language Processing He Zifan, Qin Zongyue, Prakriya Neha, Sun Yizhou, Cong Jason
- Evaluation Of The Programming Skills Of Large Language Models Heitz Luc Bryan, Chamas Joun, Scherb Christopher
- Multilingual Needle In A Haystack: Investigating Long-context Behavior Of Multilingual Large Language Models Hengle Amey, Bajpai Prasoon, Dan Soham, Chakraborty Tanmoy
- N-gram Prediction And Word Difference Representations For Language Modeling Heo Dongnyeong, Rim Daniela Noemi, Choi Heeyoul
- Lilium: Ebay's Large Language Models For E-commerce Herold Christian, Kozielski Michael, Ekimov Leonid, Petrushkov Pavel, Vandenbussche Pierre-yves, Khadivi Shahram
- Super Tiny Language Models Hillier Dylan, Guertler Leon, Tan Cheston, Agrawal Palaash, Ruirui Chen, Cheng Bobby
- Deepspeed-fastgen: High-throughput Text Generation For Llms Via MII And Deepspeed-inference Holmes Connor, Tanaka Masahiro, Wyatt Michael, Awan Ammar Ahmad, Rasley Jeff, Rajbhandari Samyam, Aminabadi Reza Yazdani, Qin Heyang, Bakhtiari Arash, Kurilenko Lev, He Yuxiong
- Evaluating The Elementary Multilingual Capabilities Of Large Language Models With Multiq Holtermann Carolin, Röttger Paul, Dill Timm, Lauscher Anne
- The Hallucinations Leaderboard -- An Open Effort To Measure Hallucinations In Large Language Models Hong Giwon, Gema Aryo Pradipta, Saxena Rohit, Du Xiaotang, Nie Ping, Zhao Yu, Perez-beltrachini Laura, Ryabinin Max, He Xuanli, Fourrier Clémentine, Minervini Pasquale
- Towards Building A Robust Knowledge Intensive Question Answering Model With Large Language Models Hong Hong Xingyun, Shao Shao Yan, Wang Wang Zhilin, Duan Duan Manni, Xiongnan Jin
- Accelerating Multilingual Language Model For Excessively Tokenized Languages Hong Jimin, Lee Gibbeum, Cho Jaewoong
- Outcome-constrained Large Language Models For Countering Hate Speech Hong Lingzi, Luo Pengcheng, Blanco Eduardo, Song Xiaoying
- Curiosity-driven Red-teaming For Large Language Models Hong Zhang-wei, Shenfeld Idan, Wang Tsun-hsuan, Chuang Yung-sung, Pareja Aldo, Glass James, Srivastava Akash, Agrawal Pulkit
- Large Language Models For Expansion Of Spoken Language Understanding Systems To New Languages Hoscilowicz Jakub, Pawlowski Pawel, Skorupa Marcin, Sowański Marcin, Janicki Artur
- Securing Vision-language Models With A Robust Encoder Against Jailbreak And Adversarial Attacks Hossain Md Zarif, Imteaj Ahmed
- V-star: Training Verifiers For Self-taught Reasoners Hosseini Arian, Yuan Xingdi, Malkin Nikolay, Courville Aaron, Sordoni Alessandro, Agarwal Rishabh
- A Systematic Evaluation Of Large Language Models For Generating Programming Code Hou Wenpin, Ji Zhicheng
- Uncovering Bias In Large Vision-language Models At Scale With Counterfactuals Howard Phillip, Fraser Kathleen C., Bhiwandiwalla Anahita, Kiritchenko Svetlana
- RAGGED: Towards Informed Design Of Retrieval Augmented Generation Systems Hsia Jennifer, Shaikh Afreen, Wang Zhiruo, Neubig Graham
- Game Generation Via Large Language Models Hu Chengpeng, Zhao Yunlong, Liu Jialin
- Multimodal Chain-of-thought Reasoning Via Chatgpt To Protect Children From Age-inappropriate Apps Hu Chuanbo, Liu Bin, Yin Minglei, Zhou Yilu, Li Xin
- SLM Meets LLM: Balancing Latency, Interpretability And Consistency In Hallucination Detection Hu Mengya, Xu Rui, Lei Deren, Li Yaxi, Wang Mingyu, Ching Emily, Kamal Eslam, Deng Alex
- Mitigating Large Language Model Hallucination With Faithful Finetuning Hu Minda, He Bowei, Wang Yufei, Li Liangyou, Ma Chen, King Irwin
- Minicpm: Unveiling The Potential Of Small Language Models With Scalable Training Strategies Hu Shengding, Tu Yuge, Han Xu, He Chaoqun, Cui Ganqu, Long Xiang, Zheng Zhi, Fang Yewei, Huang Yuxiang, Zhao Weilin, Zhang Xinrong, Thai Zheng Leng, Zhang Kaihuo, Wang Chongyi, Yao Yuan, Zhao Chenyang, Zhou Jie, Cai Jie, Zhai Zhongwu, Ding Ning, Jia Chao, Zeng Guoyang, Li Dahai, Liu Zhiyuan, Sun Maosong
- FOKE: A Personalized And Explainable Education Framework Integrating Foundation Models, Knowledge Graphs, And Prompt Engineering Hu Silan, Wang Xiaoning
- Refchecker: Reference-based Fine-grained Hallucination Checker And Benchmark For Large Language Models Hu Xiangkun, Ru Dongyu, Qiu Lin, Guo Qipeng, Zhang Tianhang, Xu Yang, Luo Yun, Liu Pengfei, Zhang Yue, Zhang Zheng
- Gentranslate: Large Language Models Are Generative Multilingual Speech And Machine Translators Hu Yuchen, Chen Chen, Yang Chao-han Huck, Li Ruizhe, Zhang Dong, Chen Zhehuai, Chng Eng Siong
- RAG And RAU: A Survey On Retrieval-augmented Language Model In Natural Language Processing Hu Yucheng, Lu Yuxing
- Prompt Perturbation In Retrieval-augmented Generation Based Large Language Models Hu Zhibo Hye-young, Wang Chen Hye-young, Shu Yanfeng Hye-young, Helen Hye-young, Paik, Zhu Liming
- V2xum-llm: Cross-modal Video Summarization With Temporal Prompt Instruction Tuning Hua Hang, Tang Yunlong, Xu Chenliang, Luo Jiebo
- Large Language Models In Mental Health Care: A Scoping Review Hua Yining, Liu Fenglin, Yang Kailai, Li Zehan, Na Hongbin, Sheu Yi-han, Zhou Peilin, Moran Lauren V., Ananiadou Sophia, Beam Andrew, Torous John
- Applying And Evaluating Large Language Models In Mental Health Care: A Scoping Review Of Human-assessed Generative Tasks Hua Yining, Na Hongbin, Li Zehan, Liu Fenglin, Fang Xiao, Clifton David, Torous John
- Pokergpt: An End-to-end Lightweight Solver For Multi-player Texas Hold'em Via Large Language Model Huang Chenghao, Cao Yanbo, Wen Yinlong, Zhou Tao, Zhang Yanru
- What Affects The Stability Of Tool Learning? An Empirical Study On The Robustness Of Tool Learning Frameworks Huang Chengrui, Shi Zhengliang, Wen Yuntao, Chen Xiuying, Han Peng, Gao Shen, Shang Shuo
- Training Language Models To Generate Text With Citations Via Fine-grained Rewards Huang Chengyu, Wu Zeqiu, Hu Yushi, Wang Wenya
- Generating Educational Materials With Different Levels Of Readability Using Llms Huang Chieh-yang, Wei Jing, Huang Ting-hao 'kenneth'
- Verbalized Probabilistic Graphical Modeling With Large Language Models Huang Hengguan, Shen Xing, Wang Songtao, Liu Dianbo, Wang Hao
- Conme: Rethinking Evaluation Of Compositional Reasoning For Modern Vlms Huang Irene, Lin Wei, Mirza M. Jehanzeb, Hansen Jacob A., Doveh Sivan, Butoi Victor Ion, Herzig Roei, Arbelle Assaf, Kuhene Hilde, Darrel Trevor, Gan Chuang, Oliva Aude, Feris Rogerio, Karlinsky Leonid
- Mitigating Catastrophic Forgetting In Large Language Models With Self-synthesized Rehearsal Huang Jianheng, Cui Leyang, Wang Ante, Yang Chengyi, Liao Xinting, Song Linfeng, Yao Junfeng, Su Jinsong
- A Survey On Evaluation Of Multimodal Large Language Models Huang Jiaxing, Zhang Jingyi
- Reason3d: Searching And Reasoning 3D Segmentation Via Large Language Model Huang Kuan-chih, Li Xiangtai, Qi Lu, Yan Shuicheng, Yang Ming-hsuan
- Integrating Multi-scale Contextualized Information For Byte-based Neural Machine Translation Huang Langlin, Feng Yang
- The Solution For The 5th GCAIAC Zero-shot Referring Expression Comprehension Challenge Huang Longfei, Yu Feng, Guan Zhihao, Wan Zhonghua, Yang Yang
- Translate-and-revise: Boosting Large Language Models For Constrained Translation Huang Pengcheng, Mu Yongyu, Wu Yuzhang, Li Bei, Xiao Chunyang, Xiao Tong, Zhu Jingbo
- The N+ Implementation Details Of RLHF With PPO: A Case Study On TL;DR Summarization Huang Shengyi, Noukhovitch Michael, Hosseini Arian, Rasul Kashif, Wang Weixun, Tunstall Lewis
- Planning And Editing What You Retrieve For Enhanced Tool Learning Huang Tenghao, Jung Dongwon, Chen Muhao
- Brainchat: Decoding Semantic Information From Fmri Using Vision-language Pretrained Models Huang Wanaiu
- Visual Hallucinations Of Multi-modal Large Language Models Huang Wen, Liu Hongbin, Guo Minxin, Gong Neil Zhenqiang
- Prompting Large Language Models With Knowledge Graphs For Question Answering Involving Long-tail Facts Huang Wenyu, Zhou Guancheng, Lapata Mirella, Vougiouklis Pavlos, Montella Sebastien, Pan Jeff Z.
- WESE: Weak Exploration To Strong Exploitation For LLM Agents Huang Xu, Liu Weiwen, Chen Xiaolong, Wang Xingmei, Lian Defu, Wang Yasheng, Tang Ruiming, Chen Enhong
- Lost In The Source Language: How Large Language Models Evaluate The Quality Of Machine Translation Huang Xu, Zhang Zhirui, Geng Xiang, Du Yichao, Chen Jiajun, Huang Shujian
- Aligning Translation-specific Understanding To General Understanding In Large Language Models Huang Yichong, Feng Xiaocheng, Li Baohang, Fu Chengpeng, Huo Wenshuai, Liu Ting, Qin Bing
- A Comprehensive Survey On Evaluating Large Language Model Applications In The Medical Industry Huang Yining, Tang Keke, Chen Meilian, Wang Boyuan
- Fastfid: Improve Inference Efficiency Of Open Domain Question Answering Via Sentence Selection Huang Yufei, Han Xu, Sun Maosong
- Self-introspective Decoding: Alleviating Hallucinations For Large Vision-language Models Huo Fushuo, Xu Wenchao, Zhang Zhong, Wang Haozhao, Chen Zhicheng, Zhao Peilin
- Enhancing Incremental Summarization With Structured Representations Hwang Eunjeong, Zhou Yichao, Wendt James Bradley, Gunel Beliz, Vo Nguyen, Xie Jing, Tata Sandeep
- MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs Hwang Yerin, Kim Yongil, Jang Yunah, Bang Jeesoo, Bae Hyunkyung, Jung Kyomin
- Assessing Llms Suitability For Knowledge Graph Completion Iga Vasile Ionut Remus, Silaghi Gheorghe Cosmin
- Improving Multilingual Instruction Finetuning Via Linguistically Natural And Diverse Datasets Indurthi Sathish Reddy, Zhou Wenxuan, Chollampatt Shamil, Agrawal Ravi, Song Kaiqiang, Zhao Lingxiao, Zhu Chenguang
- Breaking The Language Barrier: Can Direct Inference Outperform Pre-translation In Multilingual LLM Applications? Intrator Yotam, Halfon Matan, Goldenberg Roman, Tsarfaty Reut, Eyal Matan, Rivlin Ehud, Matias Yossi, Aizenberg Natalia
- Hallucinations Or Attention Misdirection? The Path To Strategic Value Extraction In Business Using Large Language Models Ioste Aline
- Grade Score: Quantifying LLM Performance In Option Selection Iourovitski Dmitri
- Scaling Laws For Downstream Task Performance Of Large Language Models Isik Berivan, Ponomareva Natalia, Hazimeh Hussein, Paparas Dimitris, Vassilvitskii Sergei, Koyejo Sanmi
- Autocompletion Of Chief Complaints In The Electronic Health Records Using Large Language Models Islam K M Sajjadul, Nipu Ayesha Siddika, Madiraju Praveen, Deshpande Priya
- Mapcoder: Multi-agent Code Generation For Competitive Problem Solving Islam Md. Ashraful, Ali Mohammed Eunus, Parvez Md Rizwan
- Are Large Vision Language Models Up To The Challenge Of Chart Comprehension And Reasoning? An Extensive Investigation Into The Capabilities And Limitations Of Lvlms Islam Mohammed Saidul, Rahman Raian, Masry Ahmed, Laskar Md Tahmid Rahman, Nayeem Mir Tafseer, Hoque Enamul
- Quality Or Quantity? On Data Scale And Diversity In Adapting Large Language Models For Low-resource Translation Iyer Vivek, Malik Bhavitvya, Stepachev Pavel, Chen Pinzhen, Haddow Barry, Birch Alexandra
- Response Generation For Cognitive Behavioral Therapy With Large Language Models: Comparative Study With Socratic Questioning Izumi Kenta, Tanaka Hiroki, Shidara Kazuhiro, Adachi Hiroyoshi, Kanayama Daisuke, Kudo Takashi, Nakamura Satoshi
- Integrating Large Language Models With Graph-based Reasoning For Conversational Question Answering Jain Parag, Lapata Mirella
- Ffn-skipllm: A Hidden Gem For Autoregressive Decoding With Adaptive Feed Forward Skipping Jaiswal Ajay, Hu Bodun, Yin Lu, Ro Yeonju, Liu Shiwei, Chen Tianlong, Akella Aditya
- Are Human Conversations Special? A Large Language Model Perspective Jawale Toshish, Animesh Chaitanya, Vallath Sekhar, Talamadupula Kartik, Heck Larry
- OLAPH: Improving Factuality In Biomedical Long-form Question Answering Jeong Minbyul, Hwang Hyeon, Yoon Chanwoong, Lee Taewhoo, Kang Jaewoo
- RLSF: Reinforcement Learning Via Symbolic Feedback Jha Piyush, Jana Prithwish, Arora Arnav, Ganesh Vijay
- Precision Empowers, Excess Distracts: Visual Question Answering With Dynamically Infused Knowledge In Language Models Jhalani Manas, M Annervaz K, Bhattacharyya Pushpak
- Towards Flexible Evaluation For Generative Visual Question Answering Ji Huishan, Si Qingyi, Lin Zheng, Wang Weiping
- ANAH: Analytical Annotation Of Hallucinations In Large Language Models Ji Ziwei, Gu Yuzhe, Zhang Wenwei, Lyu Chengqi, Lin Dahua, Chen Kai
- Generalizing Reward Modeling For Out-of-distribution Preference Learning Jia Chen
- Bwarea Model: Learning World Model, Inverse Dynamics, And Policy For Controllable Language Generation Jia Chengxing, Wang Pengyuan, Li Ziniu, Li Yi-chen, Zhang Zhilong, Tang Nan, Yu Yang
- Pre-training Cross-lingual Open Domain Question Answering With Large-scale Synthetic Supervision Jiang Fan, Drummond Tom, Cohn Trevor
- Can Large Language Models Generate High-quality Patent Claims? Jiang Lekang, Zhang Caiqi, Scherz Pascal A, Goetz Stephan
- Detecting Scams Using Large Language Models Jiang Liming
- Utilizing Large Languagemodels To Detect Privacy Leaks In Mini-app Code Jiang Liming
- Seek And Solve Reasoning For Table Question Answering Jiang Ruya, Wang Chun, Deng Weihong
- Turning Generative Models Degenerate: The Power Of Data Poisoning Attacks Jiang Shuli, Kadhe Swanand Ravindra, Zhou Yi, Ahmed Farhan, Cai Ling, Baracaldo Nathalie
- Sketch: A Toolkit For Streamlining LLM Operations Jiang Xin, Li Xiang, Ma Wenjia, Fang Xuezhi, Yao Yiqun, Yu Naitong, Meng Xuying, Han Peng, Li Jing, Sun Aixin, Wang Yequan
- SEED: Customize Large Language Models With Sample-efficient Adaptation For Code Generation Jiang Xue, Dong Yihong, Jin Zhi, Li Ge
- Unlocking The Power Of Large Language Models For Entity Alignment Jiang Xuhui, Shen Yinghan, Shi Zhichao, Xu Chengjin, Li Wei, Li Zixuan, Guo Jian, Shen Huawei, Wang Yuanzhuo
- Large Visual-language Models Are Also Good Classifiers: A Study Of In-context Multimodal Fake News Detection Jiang Ye, Wang Yimin
- Retrieve, Summarize, Plan: Advancing Multi-hop Question Answering With An Iterative Approach Jiang Zhouyu, Sun Mengshu, Liang Lei, Zhang Zhiqiang
- Gradable Chatgpt Translation Evaluation Jiao Hui, Peng Bei, Zong Lu, Zhang Xiaojun, Li Xinwei
- Prompt-based Length Controlled Generation With Multiple Control Types Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- GUARD: Role-playing To Generate Natural-language Jailbreakings To Test Guideline Adherence Of Large Language Models Jin Haibo, Chen Ruoxi, Zhou Andy, Zhang Yang, Wang Haohan
- BIDER: Bridging Knowledge Inconsistency For Efficient Retrieval-augmented Llms Via Key Supporting Evidence Jin Jiajie, Zhu Yutao, Zhou Yujia, Dou Zhicheng
- Can Chatgpt Support Developers? An Empirical Evaluation Of Large Language Models For Code Generation Jin Kailun, Wang Chung-yu, Pham Hung Viet, Hemmati Hadi
- Self-selected Attention Span For Accelerating Large Language Model Inference Jin Tian, Yazar Wanzin, Xu Zifei, Sharify Sayeh, Wang Xin
- Logical Closed Loop: Uncovering Object Hallucinations In Large Vision-language Models Junfei Wu, Qiang Liu, Ding Wang, Jinghao Zhang, Shu Wu, Liang Wang, Tieniu Tan
- Information-theoretic Distillation For Reference-less Summarization Jung Jaehun, Lu Ximing, Jiang Liwei, Brahman Faeze, West Peter, Koh Pang Wei, Choi Yejin
- Effective Context Selection In Llm-based Leaderboard Generation: An Empirical Study Kabongo Salomon, D'souza Jennifer, Auer Sören
- A Survey On Hardware Accelerators For Large Language Models Kachris Christoforos
- RAP: Retrieval-augmented Planning With Contextual Memory For Multimodal LLM Agents Kagaya Tomoyuki, Yuan Thong Jing, Lou Yuxuan, Karlekar Jayashree, Pranata Sugiri, Kinose Akira, Oguri Koki, Wick Felix, You Yang
- Language Repository For Long Video Understanding Kahatapitiya Kumara, Ranasinghe Kanchana, Park Jongwoo, Ryoo Michael S.
- Eagle: Ethical Dataset Given From Real Interactions Kaneko Masahiro, Bollegala Danushka, Baldwin Timothy
- C-RAG: Certified Generation Risks For Retrieval-augmented Language Models Kang Mintong, Gürel Nezihe Merve, Yu Ning, Song Dawn, Li Bo
- Prompting Large Language Models With Audio For General-purpose Speech Summarization Kang Wonjune, Roy Deb
- Continuous Language Model Interpolation For Dynamic And Controllable Text Generation Kangaslahti Sara, Alvarez-melis David
- MEDIC: Towards A Comprehensive Framework For Evaluating Llms In Clinical Applications Kanithi Praveen K, Christophe Clément, Pimentel Marco Af, Raha Tathagata, Saadi Nada, Javed Hamza, Maslenkova Svetlana, Hayat Nasir, Rajan Ronnie, Khan Shadab
- Visualizing Dialogues: Enhancing Image Selection Through Dialogue Understanding With Large Language Models Kao Chang-sheng, Chen Yun-nung
- Omniact: A Dataset And Benchmark For Enabling Multimodal Generalist Autonomous Agents For Desktop And Web Kapoor Raghav, Butala Yash Parag, Russak Melisa, Koh Jing Yu, Kamble Kiran, Alshikh Waseem, Salakhutdinov Ruslan
- Large Language Models Must Be Taught To Know What They Don't Know Kapoor Sanyam, Gruver Nate, Roberts Manley, Collins Katherine, Pal Arka, Bhatt Umang, Weller Adrian, Dooley Samuel, Goldblum Micah, Wilson Andrew Gordon
- Beyond Traditional Benchmarks: Analyzing Behaviors Of Open Llms On Data-to-text Generation Kasner Zdeněk, Dušek Ondřej
- From Text To Transformation: A Comprehensive Review Of Large Language Models' Versatility Kaur Pravneet, Kashyap Gautam Siddharth, Kumar Ankit, Nafis Md Tabrez, Kumar Sandeep, Shokeen Vikrant
- Exploring The Frontiers Of Llms In Psychological Applications: A Comprehensive Review Ke Luoma, Tong Song, Cheng Peng, Peng Kaiping
- Development And Testing Of Retrieval Augmented Generation In Large Language Models -- A Case Study Report Ke Yuhe, Jin Liyuan, Elangovan Kabilan, Abdullah Hairil Rizal, Liu Nan, Sia Alex Tiong Heng, Soh Chai Rick, Tung Joshua Yi Min, Ong Jasmine Chiat Ling, Ting Daniel Shu Wei
- Visiongpt: Vision-language Understanding Agent Using Generalized Multimodal Framework Kelly Chris, Hu Luhui, Yang Bang, Tian Yu, Yang Deshun, Yang Cindy, Huang Zaoshan, Li Zihao, Hu Jiayin, Zou Yuexian
- On Scalable Oversight With Weak Llms Judging Strong Llms Kenton Zachary, Siegel Noah Y., Kramár János, Brown-cohen Jonah, Albanie Samuel, Bulian Jannis, Agarwal Rishabh, Lindner David, Tang Yunhao, Goodman Noah D., Shah Rohin
- Autonomous Prompt Engineering In Large Language Models Kepel Daan, Valogianni Konstantina
- Reinforcement Of Explainability Of Chatgpt Prompts By Embedding Breast Cancer Self-screening Rules Into AI Responses Khan Yousef, Hamed Ahmed Abdeen
- Self-training Large Language Models For Improved Visual Program Synthesis With Visual Reinforcement Khan Zaid, Bg Vijay Kumar, Schulter Samuel, Fu Yun, Chandraker Manmohan
- Consistency And Uncertainty: Identifying Unreliable Responses From Black-box Vision-language Models For Selective Visual Question Answering Khan Zaid, Fu Yun
- Tabular Embedding Model (TEM): Finetuning Embedding Models For Tabular RAG Applications Khanna Sujit, Subedi Shishir
- How Good Is My Video LMM? Complex Video Reasoning And Robustness Evaluation Suite For Video-lmms Khattak Muhammad Uzair, Naeem Muhammad Ferjad, Hassan Jameel, Naseer Muzammal, Tombari Federico, Khan Fahad Shahbaz, Khan Salman
- Assessing Language Models' Worldview For Fiction Generation Khatun Aisha, Brown Daniel G.
- A Study On Large Language Models' Limitations In Multiple-choice Question Answering Khatun Aisha, Brown Daniel G.
- Efficient Medical Question Answering With Knowledge-augmented Question Generation Khlaut Julien, Dancette Corentin, Ferreres Elodie, Bennani Alaedine, Hérent Paul, Manceron Pierre
- Beyond Code Generation: An Observational Study Of Chatgpt Usage In Software Engineering Practice Khojah Ranim, Mohamad Mazen, Leitner Philipp, Neto Francisco Gomes De Oliveira
- A Comprehensive Survey Of Accelerated Generation Techniques In Large Language Models Khoshnoodi Mahsa, Jain Vinija, Gao Mingye, Srikanth Malavika, Chadha Aman
- Guiding Large Language Models To Post-edit Machine Translation With Error Annotations Ki Dayeon, Carpuat Marine
- Ontology-free General-domain Knowledge Graph-to-text Generation Dataset Synthesis Using Large Language Model Kim Daehee, Kang Deokhyung, Ryu Sangwon, Lee Gary Geunbae
- Fine-tuning CLIP Text Encoders With Two-step Paraphrasing Kim Hyunjae, Yoon Seunghyun, Bui Trung, Zhao Handong, Tran Quan, Dernoncourt Franck, Kang Jaewoo
- Sure: Summarizing Retrievals Using Answer Candidates For Open-domain QA Of Llms Kim Jaehyung, Nam Jaehyun, Mo Sangwoo, Park Jongjin, Lee Sang-woo, Seo Minjoon, Ha Jung-woo, Shin Jinwoo
- RE-RAG: Improving Open-domain QA Performance And Interpretability With Relevance Estimator In Retrieval-augmented Generation Kim Kiseung, Lee Jay-yoon
- Advisorqa: Towards Helpful And Harmless Advice-seeking Question Answering With Collective Intelligence Kim Minbeom, Lee Hwanhee, Park Joonsuk, Lee Hwaran, Jung Kyomin
- Augmenting Query And Passage For Retrieval-augmented Generation Using Llms For Open-domain Question Answering Kim Minsang, Park Cheoneum, Baek Seungjun
- Automatic Jailbreaking Of The Text-to-image Generative AI Systems Kim Minseon, Lee Hyomin, Gong Boqing, Zhang Huishuai, Hwang Sung Ju
- THEANINE: Revisiting Memory Management In Long-term Conversations With Timeline-augmented Response Generation Kim Seo Hyun, Ong Kai Tzu-iunn, Kwon Taeyoon, Kim Namyoung, Ka Keummin, Bae Seonghyeon, Jo Yohan, Hwang Seung-won, Lee Dongha, Yeo Jinyoung
- A Survey On Integration Of Large Language Models With Intelligent Robots Kim Yeseung, Kim Dohyun, Choi Jieun, Park Jisang, Oh Nayoung, Park Daehyung
- Tablevqa-bench: A Visual Question Answering Benchmark On Multiple Table Domains Kim Yoonsik, Yim Moonbin, Song Ka Yeon
- Adaptive Contrastive Decoding In Retrieval-augmented Generation For Handling Noisy Contexts Kim Youna, Kim Hyuhng Joon, Park Cheonbok, Park Choonghyun, Cho Hyunsoo, Kim Junyeob, Yoo Kang Min, Lee Sang-goo, Kim Taeuk
- Health-llm: Large Language Models For Health Prediction Via Wearable Sensor Data Kim Yubin, Xu Xuhai, Mcduff Daniel, Breazeal Cynthia, Park Hae Won
- Medexqa: Medical Question Answering Benchmark With Multiple Explanations Kim Yunsoo, Wu Jinge, Abdulle Yusuf, Wu Honghan
- What's Wrong? Refining Meeting Summaries With LLM Feedback Kirstein Frederic, Ruas Terry, Gipp Bela
- Unveiling Divergent Inductive Biases Of Llms On Temporal Data Kishore Sindhu, He Hangfeng
- Cleared For Takeoff? Compositional & Conditional Reasoning May Be The Achilles Heel To (flight-booking) Language Agents Kohli Harsh, Sun Huan
- Blending Llms Into Cascaded Speech Translation: Kit's Offline Speech Translation System For IWSLT 2024 Koneru Sai, Nguyen Thai-binh, Pham Ngoc-quan, Liu Danni, Li Zhaolin, Waibel Alexander, Niehues Jan
- Aligning Large Language Models With Representation Editing: A Control Perspective Kong Lingkai, Wang Haorui, Mu Wenhao, Du Yuanqi, Zhuang Yuchen, Zhou Yifei, Song Yue, Zhang Rongzhi, Wang Kai, Zhang Chao
- Prewrite: Prompt Rewriting With Reinforcement Learning Kong Weize, Hombaiah Spurthi Amba, Zhang Mingyang, Mei Qiaozhu, Bendersky Michael
- Controllable Navigation Instruction Generation With Chain Of Thought Prompting Kong Xianghao, Chen Jinyu, Wang Wenguan, Su Hang, Hu Xiaolin, Yang Yi, Liu Si
- Audio Flamingo: A Novel Audio Language Model With Few-shot Learning And Dialogue Abilities Kong Zhifeng, Goel Arushi, Badlani Rohan, Ping Wei, Valle Rafael, Catanzaro Bryan
- Proptest: Automatic Property Testing For Improved Visual Programming Koo Jaywon, Yang Ziyan, Cascante-bonilla Paola, Ray Baishakhi, Ordonez Vicente
- Transllama: Llm-based Simultaneous Translation System Koshkin Roman, Sudoh Katsuhito, Nakamura Satoshi
- Llms Are Zero-shot Context-aware Simultaneous Translators Koshkin Roman, Sudoh Katsuhito, Nakamura Satoshi
- Hints-in-browser: Benchmarking Language Models For Programming Feedback Generation Kotalwar Nachiket, Gotovos Alkis, Singla Adish
- Arabiangpt: Native Arabic Gpt-based Large Language Model Koubaa Anis, Ammar Adel, Ghouti Lahouari, Najar Omar, Sibaee Serry
- Large Language Models For Tuning Evolution Strategies Kramer Oliver
- Retrieval-augmented Code Generation For Situated Action Generation: A Case Study On Minecraft Kranti Chalamalasetti, Hakimov Sherzod, Schlangen David
- Understanding The Effects Of Iterative Prompting On Truthfulness Krishna Satyapriya, Agarwal Chirag, Lakkaraju Himabindu
- Can Large Language Models Explore In-context? Krishnamurthy Akshay, Harris Keegan, Foster Dylan J., Zhang Cyril, Slivkins Aleksandrs
- Synthdst: Synthetic Data Is All You Need For Few-shot Dialog State Tracking Kulkarni Atharva, Tseng Bo-hsiang, Moniz Joel Ruben Antony, Piraviperumal Dhivya, Yu Hong, Bhargava Shruti
- Reinforcement Learning For Optimizing RAG For Domain Chatbots Kulkarni Mandar, Tangarajan Praveen, Kim Kyung, Trivedi Anusua
- Automatic Question-answer Generation For Long-tail Knowledge Kumar Rohan, Kim Youngmin, Ravi Sunitha, Sun Haitian, Faloutsos Christos, Salakhutdinov Ruslan, Yoon Minji
- Mt-eval: A Multi-turn Capabilities Evaluation Benchmark For Large Language Models Kwan Wai-chung, Zeng Xingshan, Jiang Yuxin, Wang Yufei, Li Liangyou, Shang Lifeng, Jiang Xin, Liu Qun, Wong Kam-fai
- Are Llms Effective Negotiators? Systematic Evaluation Of The Multifaceted Capabilities Of Llms In Negotiation Dialogues Kwon Deuksin, Weiss Emily, Kulshrestha Tara, Chawla Kushal, Lucas Gale M., Gratch Jonathan
- Zero-shot Text-guided Infinite Image Synthesis With LLM Guidance Kwon Soyeong, Lee Taegyeong, Kim Taehwan
- Can Github Issues Be Solved With Tree Of Thoughts? La Rosa Ricardo, Hulse Corey, Liu Bangdi
- Biomistral: A Collection Of Open-source Pretrained Large Language Models For Medical Domains Labrak Yanis, Bazoge Adrien, Morin Emmanuel, Gourraud Pierre-antoine, Rouvier Mickael, Dufour Richard
- When To Retrieve: Teaching Llms To Utilize Information Retrieval Effectively Labruna Tiziano, Campos Jon Ander, Azkune Gorka
- SEC-QA: A Systematic Evaluation Corpus For Financial QA Lai Viet Dac, Krumdick Michael, Lovering Charles, Reddy Varshini, Schmidt Craig, Tanner Chris
- RVISA: Reasoning And Verification For Implicit Sentiment Analysis Lai Wenna, Xie Haoran, Xu Guandong, Li Qing
- Alarm: Align Language Models Via Hierarchical Rewards Modeling Lai Yuhang, Wang Siyuan, Liu Shujun, Huang Xuanjing, Wei Zhongyu
- Residual-based Language Models Are Free Boosters For Biomedical Imaging Lai Zhixin, Wu Jing, Chen Suiyao, Zhou Yucheng, Hovakimyan Naira
- Criticeval: Evaluating Large Language Model As Critic Lan Tian, Zhang Wenwei, Xu Chen, Huang Heyan, Lin Dahua, Chen Kai, Mao Xian-ling
- Enhancing Neural Machine Translation Of Low-resource Languages: Corpus Development, Human Evaluation And Explainable AI Architectures Lankford Séamus
- Adaptmllm: Fine-tuning Multilingual Language Models On Low-resource Languages With Integrated LLM Playgrounds Lankford Séamus, Afli Haithem, Way Andy
- Human Evaluation Of English--irish Transformer-based NMT Lankford Séamus, Afli Haithem, Way Andy
- Adaptnmt: An Open-source, Language-agnostic Development Environment For Neural Machine Translation Lankford Séamus, Afli Haithem, Way Andy
- Transformers For Low-resource Languages:is F\'eidir Linn! Lankford Séamus, Afli Haithem, Way Andy
- IBD: Alleviating Hallucinations In Large Vision-language Models Via Image-biased Decoding Lanyun Zhu, Deyi Ji, Tianrun Chen, Peng Xu, Jieping Ye, Jun Liu
- A Systematic Survey And Critical Review On Evaluating Large Language Models: Challenges, Limitations, And Recommendations Laskar Md Tahmid Rahman, Alqahtani Sawsan, Bari M Saiful, Rahman Mizanur, Khan Mohammad Abdullah Matin, Khan Haidar, Jahan Israt, Bhuiyan Amran, Tan Chee Wei, Parvez Md Rizwan, Hoque Enamul, Joty Shafiq, Huang Jimmy
- Query-opt: Optimizing Inference Of Large Language Models Via Multi-query Instructions In Meeting Summarization Laskar Md Tahmid Rahman, Khasanova Elena, Fu Xue-yong, Chen Cheng, Tn Shashi Bhushan
- Melting Point: Mobile Evaluation Of Language Transformers Laskaridis Stefanos, Katevas Kleomenis, Minto Lorenzo, Haddadi Hamed
- INDICT: Code Generation With Internal Dialogues Of Critiques For Both Security And Helpfulness Le Hung, Zhou Yingbo, Xiong Caiming, Savarese Silvio, Sahoo Doyen
- MATTER: Memory-augmented Transformer Using Heterogeneous Knowledge Sources Lee Dongkyu, Prakash Chandana Satya, Fitzgerald Jack, Lehmann Jens
- Hip Attention: Sparse Sub-quadratic Attention With Hierarchical Attention Pruning Lee Heejun, Park Geon, Lee Youngwan, Kim Jina, Jeong Wonyoung, Jeon Myeongjae, Hwang Sung Ju
- CATS: Contextually-aware Thresholding For Sparsity In Large Language Models Lee Je-yong, Lee Donghyun, Zhang Genghan, Tiwari Mo, Mirhoseini Azalia
- The Life Cycle Of Large Language Models: A Review Of Biases In Education Lee Jinsook, Hicke Yann, Yu Renzhe, Brooks Christopher, Kizilcec René F.
- Multimodal Reasoning With Multimodal Knowledge Graph Lee Junlin, Wang Yequan, Li Jing, Zhang Min
- Visual Question Answering Instruction: Unlocking Multimodal Large Language Model To Domain-specific Visual Multitasks Lee Jusung, Cha Sungguk, Lee Younghyun, Yang Cheoljong
- Probability Of Differentiation Reveals Brittleness Of Homogeneity Bias In Large Language Models Lee Messi H. J., Lai Calvin K.
- LLM2LLM: Boosting Llms With Novel Iterative Data Enhancement Lee Nicholas, Wattanawong Thanakul, Kim Sehoon, Mangalam Karttikeya, Shen Sheng, Anumanchipalli Gopala, Mahoney Michael W., Keutzer Kurt, Gholami Amir
- Prompting Large Language Models For Zero-shot Essay Scoring Via Multi-trait Specialization Lee Sanwoo, Cai Yida, Meng Desong, Wang Ziyang, Wu Yunfang
- Effective And Efficient Conversation Retrieval For Dialogue State Tracking With Implicit Text Summaries Lee Seanie, Cheng Jianpeng, Driesen Joris, Coca Alexandru, Johannsen Anders
- Do Llms Have Distinct And Consistent Personality? TRAIT: Personality Testset Designed For Llms With Psychometrics Lee Seungbeen, Lim Seungwon, Han Seungju, Oh Giyeong, Chae Hyungjoo, Chung Jiwan, Kim Minju, Kwak Beong-woo, Lee Yeonsoo, Lee Dongha, Yeo Jinyoung, Yu Youngjae
- Materials Science In The Era Of Large Language Models: A Perspective Lei Ge, Docherty Ronan, Cooper Samuel J.
- Prexme! Large Scale Prompt Exploration Of Open Source Llms For Machine Translation And Summarization Evaluation Leiter Christoph, Eger Steffen
- Intent-based Prompt Calibration: Enhancing Prompt Optimization With Synthetic Boundary Cases Levi Elad, Brosh Eli, Friedmann Matan
- Cascade Reward Sampling For Efficient Decoding-time Alignment Li Bolian, Wang Yifan, Grama Ananth, Zhang Ruqi
- Are Bigger Encoders Always Better In Vision Large Models? Li Bozhou, Liang Hao, Meng Zimo, Zhang Wentao
- Eliciting Better Multilingual Structured Reasoning From Llms Through Code Li Bryan, Alkhouli Tamer, Bonadiman Daniele, Pappas Nikolaos, Mansour Saab
- Searchlvlms: A Plug-and-play Framework For Augmenting Large Vision-language Models By Searching Up-to-date Internet Knowledge Li Chuanhao, Li Zhen, Jing Chenchen, Liu Shuo, Shao Wenqi, Wu Yuwei, Luo Ping, Qiao Yu, Zhang Kaipeng
- Humaneval On Latest GPT Models -- 2024 Li Daniel, Murr Lincoln
- Contextualization Distillation From Large Language Model For Knowledge Graph Completion Li Dawei, Tan Zhen, Chen Tianlong, Liu Huan
- Hello Again! Llm-powered Personalized Agent For Long-term Dialogue Li Hao, Yang Chenghao, Zhang An, Deng Yang, Wang Xiang, Chua Tat-seng
- Look Within, Why Llms Hallucinate: A Causal Perspective Li He, Chi Haoang, Liu Mingyu, Yang Wenjing
- Datacomp-lm: In Search Of The Next Generation Of Training Sets For Language Models Li Jeffrey, Fang Alex, Smyrnis Georgios, Ivgi Maor, Jordan Matt, Gadre Samir, Bansal Hritik, Guha Etash, Keh Sedrick, Arora Kushal, Garg Saurabh, Xin Rui, Muennighoff Niklas, Heckel Reinhard, Mercat Jean, Chen Mayee, Gururangan Suchin, Wortsman Mitchell, Albalak Alon, Bitton Yonatan, Nezhurina Marianna, Abbas Amro, Hsieh Cheng-yu, Ghosh Dhruba, Gardner Josh, Kilian Maciej, Zhang Hanlin, Shao Rulin, Pratt Sarah, Sanyal Sunny, Ilharco Gabriel, Daras Giannis, Marathe Kalyani, Gokaslan Aaron, Zhang Jieyu, Chandu Khyathi, Nguyen Thao, Vasiljevic Igor, Kakade Sham, Song Shuran, Sanghavi Sujay, Faghri Fartash, Oh Sewoong, Zettlemoyer Luke, Lo Kyle, El-nouby Alaaeldin, Pouransari Hadi, Toshev Alexander, Wang Stephanie, Groeneveld Dirk, Soldaini Luca, Koh Pang Wei, Jitsev Jenia, Kollar Thomas, Dimakis Alexandros G., Carmon Yair, Dave Achal, Schmidt Ludwig, Shankar Vaishaal
- MT-PATCHER: Selective And Extendable Knowledge Distillation From Large Language Models For Machine Translation Li Jiahuan, Cheng Shanbo, Huang Shujian, Chen Jiajun
- A Cross-language Investigation Into Jailbreak Attacks In Large Language Models Li Jie, Liu Yi, Liu Chongyang, Shi Ling, Ren Xiaoning, Zheng Yaowen, Liu Yang, Xue Yinxing
- Pctoolkit: A Unified Plug-and-play Prompt Compression Toolkit Of Large Language Models Li Jinyi, Lan Yihuai, Wang Lei, Wang Hao
- Chemvlm: Exploring The Power Of Multimodal Large Language Models In Chemistry Area Li Junxian, Zhang Di, Wang Xunzhi, Hao Zeying, Lei Jingdi, Tan Qian, Zhou Cai, Liu Wei, Yang Yaotian, Xiong Xinrui, Wang Weiyun, Chen Zhe, Wang Wenhai, Li Wei, Zhang Shufei, Su Mao, Ouyang Wanli, Li Yuqiang, Zhou Dongzhan
- The Dawn After The Dark: An Empirical Study On Factuality Hallucination In Large Language Models Li Junyi, Chen Jie, Ren Ruiyang, Cheng Xiaoxue, Zhao Wayne Xin, Nie Jian-yun, Wen Ji-rong
- Predicting Vs. Acting: A Trade-off Between World Modeling & Agent Modeling Li Margaret, Shi Weijia, Pagnoni Artidoro, West Peter, Holtzman Ari
- A Sentiment Consolidation Framework For Meta-review Generation Li Miao, Lau Jey Han, Hovy Eduard
- Bridging The Language Gap: Enhancing Multilingual Prompt-based Code Generation In Llms Via Zero-shot Cross-lingual Transfer Li Mingda, Mishra Abhijit, Mujumdar Utkarsh
- Needlebench: Can Llms Do Retrieval And Reasoning In 1 Million Context Window? Li Mo, Zhang Songyang, Liu Yunxin, Chen Kai
- From Text To Insight: Leveraging Large Language Models For Performance Evaluation In Management Li Ning, Zhou Huaikang, Xu Mingze
- IQA-EVAL: Automatic Evaluation Of Human-model Interactive Question Answering Li Ruosen, Wang Barry, Li Ruochen, Du Xinya
- MEDIQ: Question-asking Llms For Adaptive And Reliable Clinical Reasoning Li Shuyue Stella, Balachandran Vidhisha, Feng Shangbin, Ilgen Jonathan, Pierson Emma, Koh Pang Wei, Tsvetkov Yulia
- Improving Faithfulness Of Large Language Models In Summarization Via Sliding Generation And Self-consistency Li Taiji, Li Zhi, Zhang Yin
- Stbench: Assessing The Ability Of Large Language Models In Spatio-temporal Analysis Li Wenbin, Yao Di, Zhao Ruibo, Chen Wenjie, Xu Zijie, Luo Chengxue, Gong Chang, Jing Quanliang, Tan Haining, Bi Jingping
- Reinforcement Learning With Token-level Feedback For Controllable Text Generation Li Wendi, Wei Wei, Xu Kaihe, Xie Wenfeng, Chen Dangyang, Cheng Yu
- Optimizing Psychological Counseling With Instruction-tuned Large Language Models Li Wenjie, Sun Tianyu, Qian Kun, Wang Wenhong
- Tele-flm Technical Report Li Xiang, Yao Yiqun, Jiang Xin, Fang Xuezhi, Wang Chao, Liu Xinzhang, Wang Zihan, Zhao Yu, Wang Xin, Huang Yuyao, Song Shuangyong, Li Yongxiang, Zhang Zheng, Zhao Bo, Sun Aixin, Wang Yequan, He Zhongjiang, Wang Zhongyuan, Li Xuelong, Huang Tiejun
- Making Long-context Language Models Better Multi-hop Reasoners Li Yanyang, Liang Shuo, Lyu Michael R., Wang Liwei
- Backdoorllm: A Comprehensive Benchmark For Backdoor Attacks On Large Language Models Li Yige, Huang Hanxun, Zhao Yunhan, Ma Xingjun, Sun Jun
- Womd-reasoning: A Large-scale Language Dataset For Interaction And Driving Intentions Reasoning Li Yiheng, Ge Chongjian, Li Chenran, Xu Chenfeng, Tomizuka Masayoshi, Tang Chen, Ding Mingyu, Zhan Wei
- The First Place Solution Of WSDM Cup 2024: Leveraging Large Language Models For Conversational Multi-doc QA Li Yiming, Zhang Zhao
- Style-talker: Finetuning Audio Language Model And Style-based Text-to-speech Model For Fast Spoken Dialogue Generation Li Yinghao Aaron, Jiang Xilin, Darefsky Jordan, Zhu Ge, Mesgarani Nima
- A Simple But Effective Approach To Improve Structured Language Model Output For Information Extraction Li Yinghao, Ramprasad Rampi, Zhang Chao
- Data Generation Using Large Language Models For Text Classification: An Empirical Case Study Li Yinheng, Bonatti Rogerio, Abdali Sara, Wagle Justin, Koishida Kazuhito
- Guiding Enumerative Program Synthesis With Large Language Models Li Yixuan, Parsert Julian, Polgreen Elizabeth
- Finding Challenging Metaphors That Confuse Pretrained Language Models Li Yucheng, Guerin Frank, Lin Chenghua
- Dynamic Data Sampler For Cross-language Transfer Learning In Large Language Models Li Yudong, Feng Yuhao, Zhou Wen, Zhao Zhe, Shen Linlin, Hou Cheng, Hou Xianxu
- EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty Li Yuhui, Wei Fangyun, Zhang Chao, Zhang Hongyang
- Large Language Models For Human-like Autonomous Driving: A Survey Li Yun, Katsumata Kai, Javanmardi Ehsan, Tsukada Manabu
- Llms Meet Long Video: Advancing Long Video Question Answering With An Interactive Visual Adapter In Llms Li Yunxin, Chen Xinyu, Hu Baotain, Zhang Min
- ISQA: Informative Factuality Feedback For Scientific Summarization Li Zekai, Qin Yanxia, Liu Qian, Kan Min-yen
- Do Vision-language Transformers Exhibit Visual Commonsense? An Empirical Study Of VCR Li Zhenyang, Guo Yangyang, Wang Kejie, Chen Xiaolin, Nie Liqiang, Kankanhalli Mohan
- Focusllm: Scaling Llm's Context By Parallel Decoding Li Zhenyu, Zhang Yike, Pan Tengyu, Sun Yutao, Duan Zhichao, Fang Junjie, Han Rong, Wang Zixuan, Wang Jianyong
- Refiner: Restructure Retrieval Content Efficiently To Advance Question-answering Capabilities Li Zhonghao, Hu Xuming, Liu Aiwei, Zheng Kening, Huang Sirui, Xiong Hui
- Synthesize Step-by-step: Tools, Templates And Llms As Data Generators For Reasoning-based Chart VQA Li Zhuowan, Jasani Bhavan, Tang Peng, Ghadar Shabnam
- Retrieval Augmented Generation Or Long-context Llms? A Comprehensive Study And Hybrid Approach Li Zhuowan, Li Cheng, Zhang Mingyang, Mei Qiaozhu, Bendersky Michael
- Intermediate Distillation: Data-efficient Distillation From Black-box Llms For Information Retrieval Li Zizhong, Zhang Haopeng, Zhang Jiawei
- 500xcompressor: Generalized Prompt Compression For Large Language Models Li Zongqian, Su Yixuan, Collier Nigel
- Task Oriented In-domain Data Augmentation Liang Xiao, Hu Xinyu, Zuo Simiao, Gong Yeyun, Lou Qiang, Liu Yi, Huang Shao-lun, Jiao Jian
- Fennec: Fine-grained Language Model Evaluation And Correction Extended Through Branching And Bridging Liang Xiaobo, Zhang Haoke, Hu Helan, Li Juntao, Xu Jun, Zhang Min
- Mitigating Hallucination In Visual-language Models Via Re-balancing Contrastive Decoding Liang Xiaoyu, Yu Jiayuan, Mu Lianrui, Zhuang Jiedong, Hu Jiaqi, Yang Yuchen, Ye Jiangnan, Lu Lu, Chen Jian, Hu Haoji
- Internal Consistency And Self-feedback In Large Language Models: A Survey Liang Xun, Song Shichao, Zheng Zifan, Wang Hanyu, Yu Qingchen, Li Xunkai, Li Rong-hua, Cheng Peng, Wang Zhonghao, Xiong Feiyu, Li Zhiyu
- Controllable Text Generation For Large Language Models: A Survey Liang Xun, Wang Hanyu, Wang Yezhaohui, Song Shichao, Yang Jiawei, Niu Simin, Hu Jie, Liu Dan, Yao Shunyu, Xiong Feiyu, Li Zhiyu
- I-SHEEP: Self-alignment Of LLM From Scratch Through An Iterative Self-enhancement Paradigm Liang Yiming, Zhang Ge, Qu Xingwei, Zheng Tianyu, Guo Jiawei, Du Xinrun, Yang Zhenzhu, Liu Jiaheng, Lin Chenghua, Ma Lei, Huang Wenhao, Zhang Jiajun
- Mathchat: Benchmarking Mathematical Reasoning And Instruction Following In Multi-turn Interactions Liang Zhenwen, Yu Dian, Yu Wenhao, Yao Wenlin, Zhang Zhihan, Zhang Xiangliang, Yu Dong
- IKUN For WMT24 General MT Task: Llms Are Here For Multilingual Machine Translation Liao Baohao, Herold Christian, Khadivi Shahram, Monz Christof
- 3-in-1: 2D Rotary Adaptation For Efficient Finetuning, Efficient Batching And Composability Liao Baohao, Monz Christof
- Extending Token Computation For LLM Reasoning Liao Bingli, Vargas Danilo Vasconcellos
- Imagination Augmented Generation: Learning To Imagine Richer Context For Question Answering Over Large Language Models Liao Huanxuan, He Shizhu, Xu Yao, Zhang Yuanzhe, Liu Kang, Liu Shengping, Zhao Jun
- Enhancing Reinforcement Learning With Label-sensitive Reward For Natural Language Understanding Liao Kuo, Li Shuang, Zhao Meng, Liu Liqun, Xue Mengge, Hu Zhenyu, Han Honglin, Yin Chengguo
- MING-MOE: Enhancing Medical Multi-task Learning In Large Language Models With Sparse Mixture Of Low-rank Adapter Experts Liao Yusheng, Jiang Shuyang, Wang Yu, Wang Yanfeng
- E2LLM: Encoder Elongated Large Language Models For Long-context Understanding And Reasoning Liao Zihan, Wang Jun, Yu Hang, Wei Lingxiao, Li Jianguo, Wang Jun, Zhang Wei
- D2LLM: Decomposed And Distilled Large Language Models For Semantic Search Liao Zihan, Yu Hang, Li Jianguo, Wang Jun, Zhang Wei
- The Power Of Words: Generating Powershell Attacks From Natural Language Liguori Pietro, Marescalco Christian, Natella Roberto, Orbinato Vittorio, Pianese Luciano
- Retrieval-augmented Natural Language Reasoning For Explainable Visual Question Answering Lim Su Hyeon, Kim Minkuk, Kim Hyeon Bae, Kim Seong Tae
- Emojicrypt: Prompt Encryption For Secure Communication With Large Language Models Lin Guo, Hua Wenyue, Zhang Yongfeng
- Syncpkl: Harnessing Llms To Generate Synthetic Data For Commonsense Persona Knowledge Linking Lin Kuan-yen
- Diversedialogue: A Methodology For Designing Chatbots With Human-like Diversity Lin Xiaoyu, Yu Xinkai, Aich Ankit, Giorgi Salvatore, Ungar Lyle
- Evaluating Text-to-visual Generation With Image-to-text Generation Lin Zhiqiu, Pathak Deepak, Li Baiqi, Li Jiayao, Xia Xide, Neubig Graham, Zhang Pengchuan, Ramanan Deva
- Compromising Embodied Agents With Contextual Backdoor Attacks Liu Aishan, Zhou Yuguang, Liu Xianglong, Zhang Tianyuan, Liang Siyuan, Wang Jiakai, Pu Yanjun, Li Tianlin, Zhang Junqi, Zhou Wenbo, Guo Qing, Tao Dacheng
- Cool-fusion: Fuse Large Language Models Without Training Liu Cong, Quan Xiaojun, Pan Yan, Lin Liang, Wu Weigang, Chen Xu
- Lumina-mgpt: Illuminate Flexible Photorealistic Text-to-image Generation With Multimodal Generative Pretraining Liu Dongyang, Zhao Shitian, Zhuo Le, Lin Weifeng, Qiao Yu, Li Hongsheng, Gao Peng
- A Practice-friendly Two-stage Llm-enhanced Paradigm In Sequential Recommendation Liu Dugang, Xian Shenxian, Lin Xiaolin, Zhang Xiaolian, Zhu Hong, Fang Yuan, Chen Zhen, Ming Zhong
- An Incomplete Loop: Instruction Inference, Instruction Following, And In-context Learning In Language Models Liu Emmy, Neubig Graham, Andreas Jacob
- Exploring Vulnerabilities And Protections In Large Language Models: A Survey Liu Frank Weizhen, Hu Chenhui
- Pefomed: Parameter Efficient Fine-tuning Of Multimodal Large Language Models For Medical Imaging Liu Gang, He Jinlong, Li Pengfei, He Genrong, Chen Zhaolin, Zhong Shenjun
- On The Intrinsic Self-correction Capability Of Llms: Uncertainty And Latent Concept Liu Guangliang, Mao Haitao, Cao Bochuan, Xue Zhiyu, Johnson Kristen, Tang Jiliang, Wang Rongrong
- Speculative Decoding Via Early-exiting For Faster LLM Inference With Thompson Sampling Control Mechanism Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang
- Tiny Refinements Elicit Resilience: Toward Efficient Prefix-model Against LLM Red-teaming Liu Jiaxu, Yin Xiangyu, Wu Sihao, Wang Jianhong, Fang Meng, Yi Xinping, Huang Xiaowei
- Mm-instruct: Generated Visual Instructions For Large Multimodal Model Alignment Liu Jihao, Huang Xin, Zheng Jinliang, Liu Boxiao, Wang Jia, Yoshie Osamu, Liu Yu, Li Hongsheng
- Visualwebbench: How Far Have Multimodal Llms Evolved In Web Page Understanding And Grounding? Liu Junpeng, Song Yifan, Lin Bill Yuchen, Lam Wai, Neubig Graham, Li Yuanzhi, Yue Xiang
- Synthetic Context Generation For Question Generation Liu Naiming, Wang Zichao, Baraniuk Richard
- Proofread: Fixes All Errors With One Tap Liu Renjie, Zhang Yanxiang, Zhu Yun, Sun Haicheng, Zhang Yuanbo, Huang Michael Xuelin, Cai Shanqing, Meng Lei, Zhai Shumin
- ST-LLM: Large Language Models Are Effective Temporal Learners Liu Ruyang, Li Chen, Tang Haoran, Ge Yixiao, Shan Ying, Li Ge
- SPINACH: Sparql-based Information Navigation For Challenging Real-world Questions Liu Shicheng, Semnani Sina J., Triedman Harold, Xu Jialiang, Zhao Isaac Dan, Lam Monica S.
- Are Llms Effective Backbones For Fine-tuning? An Experimental Investigation Of Supervised Llms On Chinese Short Text Matching Liu Shulin, Xu Chengcheng, Liu Hao, Yu Tinghao, Yang Tao
- Multi-group Uncertainty Quantification For Long-form Text Generation Liu Terrance, Wu Zhiwei Steven
- Memlong: Memory-augmented Retrieval For Long Text Modeling Liu Weijie, Tang Zecheng, Li Juntao, Chen Kehai, Zhang Min
- Imposter.ai: Adversarial Attacks With Hidden Intentions Towards Aligned Large Language Models Liu Xiao, Li Liangzhi, Xiang Tong, Ye Fuying, Wei Lu, Li Wangyue, Garcia Noa
- Extensive Self-contrast Enables Feedback-free Language Model Alignment Liu Xiao, Song Xixuan, Dong Yuxiao, Tang Jie
- SHIELD: Evaluation And Defense Strategies For Copyright Compliance In LLM Text Generation Liu Xiaoze, Sun Ting, Xu Tianyang, Wu Feijie, Wang Cunxiang, Wang Xiaoqian, Gao Jing
- Customizing Large Language Model Generation Style Using Parameter-efficient Finetuning Liu Xinyue, Diddee Harshita, Ippolito Daphne
- Adaptive Draft-verification For Efficient Large Language Model Decoding Liu Xukun, Lei Bowen, Zhang Ruqi, Xu Dongkuan
- Prompt Framework For Role-playing: Generation And Evaluation Liu Xun, Ni Zhengwei
- Do Large Language Models Possess Sensitive To Sentiment? Liu Yang, Zhu Xichou, Shen Zhou, Liu Yi, Li Min, Chen Yujun, John Benzi, Ma Zhenzhen, Hu Tao, Xu Zhiyang, Luo Wei, Wang Junhui
- Era-cot: Improving Chain-of-thought Through Entity Relationship Analysis Liu Yanming, Peng Xinyue, Du Tianyu, Yin Jianwei, Liu Weihao, Zhang Xuhong
- Arondight: Red Teaming Large Vision Language Models With Auto-generated Multi-modal Jailbreak Prompts Liu Yi, Cai Chengjun, Zhang Xiaoli, Yuan Xingliang, Wang Cong
- Accelerating Inference In Large Language Models With A Unified Layer Skipping Strategy Liu Yijin, Meng Fandong, Zhou Jie
- Let's Learn Step By Step: Enhancing In-context Learning Ability With Curriculum Learning Liu Yinpeng, Liu Jiawei, Shi Xiang, Cheng Qikai, Huang Yong, Lu Wei
- Chatzero:zero-shot Cross-lingual Dialogue Generation Via Pseudo-target Language Liu Yongkang, Shi Feng, Wang Daling, Zhang Yifei, Schütze Hinrich
- Synthvlm: High-efficiency And High-quality Synthetic Data For Vision Language Models Liu Zheng, Liang Hao, Huang Xijie, Xiong Wentao, Yu Qinhan, Sun Linzhuang, Chen Chong, He Conghui, Cui Bin, Zhang Wentao
- Agentlite: A Lightweight Library For Building And Advancing Task-oriented LLM Agent System Liu Zhiwei, Yao Weiran, Zhang Jianguo, Yang Liangwei, Liu Zuxin, Tan Juntao, Choubey Prafulla K., Lan Tian, Wu Jason, Wang Huan, Heinecke Shelby, Xiong Caiming, Savarese Silvio
- Chatqa: Surpassing GPT-4 On Conversational QA And RAG Liu Zihan, Ping Wei, Roy Rajarshi, Xu Peng, Lee Chankyu, Shoeybi Mohammad, Catanzaro Bryan
- Enabling Weak Llms To Judge Response Reliability Via Meta Ranking Liu Zijun, Kou Boqun, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Chain-of-spot: Interactive Reasoning Improves Large Vision-language Models Liu Zuyan, Dong Yuhao, Rao Yongming, Zhou Jie, Lu Jiwen
- Towards Aligning Language Models With Textual Feedback Lloret Saüc Abadal, Dhuliawala Shehzaad, Murugesan Keerthiram, Sachan Mrinmaya
- Generative Multi-modal Knowledge Retrieval With Large Language Models Long Xinwei, Zeng Jiali, Meng Fandong, Ma Zhiyuan, Zhang Kaiyan, Zhou Bowen, Zhou Jie
- High-quality Data-to-text Generation For Severely Under-resourced Languages With Out-of-the-box Large Language Models Lorandi Michela, Belz Anya
- Sparser Is Faster And Less Is More: Efficient Sparse Attention For Long-range Transformers Lou Chao, Jia Zixia, Zheng Zilong, Tu Kewei
- Answering Real-world Clinical Questions Using Large Language Model Based Systems Low Yen Sia 1 And 2, Jackson Michael L. 1 And 2, Hyde Rebecca J. 1 And 2, Brown Robert E. 1 And 2, Sanghavi Neil M. 1 And 2, Baldwin Julian D. 1 And 2, Pike C. William 1 And 2, Muralidharan Jananee 1 And 2, Hui Gavin 1 And 2, Alexander Natasha 1 And 7, Hassan Hadeel 1 And 7, Nene Rahul V. 1 And 7, Pike Morgan 1 And 7, Pokrzywa Courtney J. 1 And 7, Vedak Shivam 1 And 7, Yan Adam Paul 1 And 7, Yao Dong-han 1 And 7, Zipursky Amy R. 1 And 7, Dinh Christina 1 And 7, Ballentine Philip 1 And 7, Derieg Dan C. 1 And 7, Polony Vladimir 1 And 7, Chawdry Rehan N. 1 And 7, Davies Jordan 1 And 7, Hyde Brigham B. 1 And 7, Shah Nigam H. 1 And 7, Gombar Saurabh 1 And 8
- Efficient Encoder-decoder Transformer Decoding For Decomposable Tasks Lu Bo-ru, Haduong Nikita, Lin Chien-yu, Cheng Hao, Smith Noah A., Ostendorf Mari
- From GPT-4 To Gemini And Beyond: Assessing The Landscape Of Mllms On Generalizability, Trustworthiness And Causality Through Four Modalities Lu Chaochao, Qian Chen, Zheng Guodong, Fan Hongxing, Gao Hongzhi, Zhang Jie, Shao Jing, Deng Jingyi, Fu Jinlan, Huang Kexin, Li Kunchang, Li Lijun, Wang Limin, Sheng Lu, Chen Meiqi, Zhang Ming, Ren Qibing, Chen Sirui, Gui Tao, Ouyang Wanli, Wang Yali, Teng Yan, Wang Yaru, Wang Yi, He Yinan, Wang Yingchun, Wang Yixu, Zhang Yongting, Qiao Yu, Shen Yujiong, Mou Yurong, Chen Yuxi, Zhang Zaibin, Shi Zhelun, Yin Zhenfei, Wang Zhipin
- Deepseek-vl: Towards Real-world Vision-language Understanding Lu Haoyu, Liu Wen, Zhang Bo, Wang Bingxuan, Dong Kai, Liu Bo, Sun Jingxiang, Ren Tongzheng, Li Zhuoshu, Yang Hao, Sun Yaofeng, Deng Chengqi, Xu Hanwei, Xie Zhenda, Ruan Chong
- Revisiting Multi-modal LLM Evaluation Lu Jian, Srivastava Shikhar, Chen Junyu, Shrestha Robik, Acharya Manoj, Kafle Kushal, Kanan Christopher
- A Bounding Box Is Worth One Token: Interleaving Layout And Text In A Large Language Model For Document Understanding Lu Jinghui, Yu Haiyang, Wang Yanjie, Ye Yongjie, Tang Jingqun, Yang Ziwei, Wu Binghong, Liu Qi, Feng Hao, Wang Han, Liu Hao, Huang Can
- Datasculpt: Crafting Data Landscapes For LLM Post-training Through Multi-objective Partitioning Lu Keer, Liang Zheng, Nie Xiaonan, Pan Da, Zhang Shusen, Zhao Keshi, Chen Weipeng, Zhou Zenan, Dong Guosheng, Zhang Wentao, Cui Bin
- Insights Into LLM Long-context Failures: When Transformers Know But Don't Tell Lu Taiming, Gao Muhan, Yu Kuai, Byerly Adam, Khashabi Daniel
- Fine-tuning Large Language Models For Domain Adaptation: Exploration Of Training Strategies, Scaling, Model Merging And Synergistic Capabilities Lu Wei, Luu Rachel K., Buehler Markus J.
- From Understanding To Utilization: A Survey On Explainability For Large Language Models Luo Haoyan, Specia Lucia
- Hallucination Detection And Hallucination Mitigation: An Investigation Luo Junliang, Li Tianyu, Wu Di, Jenkin Michael, Liu Steve, Dudek Gregory
- BGE Landmark Embedding: A Chunking-free Embedding Method For Retrieval Augmented Long-context Large Language Models Luo Kun, Liu Zheng, Xiao Shitao, Liu Kang
- Duetsim: Building User Simulator With Dual Large Language Models For Task-oriented Dialogues Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- Zero-shot Cross-domain Dialogue State Tracking Via Dual Low-rank Adaptation Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- Knowla: Enhancing Parameter-efficient Finetuning With Knowledgeable Adaptation Luo Xindi, Sun Zequn, Zhao Jing, Zhao Zhe, Hu Wei
- Source2synth: Synthetic Data Generation And Curation Grounded In Real Data Sources Lupidi Alisia, Gemmell Carlos, Cancedda Nicola, Dwivedi-yu Jane, Weston Jason, Foerster Jakob, Raileanu Roberta, Lomeli Maria
- GPT-4 Generated Narratives Of Life Events Using A Structured Narrative Prompt: A Validation Study Lynch Christopher J., Jensen Erik, Munro Madison H., Zamponi Virginia, Martinez Joseph, O'brien Kevin, Feldhaus Brandon, Smith Katherine, Reinhold Ann Marie, Gore Ross
- Beyond Probabilities: Unveiling The Misalignment In Evaluating Large Language Models Lyu Chenyang, Wu Minghao, Aji Alham Fikri
- Automatic Summarization Of Doctor-patient Encounter Dialogues Using Large Language Model Through Prompt Tuning Lyu Mengxian, Peng Cheng, Li Xiaohan, Balian Patrick, Bian Jiang, Wu Yonghui
- Dempt: Decoding-enhanced Multi-phase Prompt Tuning For Making Llms Be Better Context-aware Translators Lyu Xinglin, Li Junhui, Zhao Yanqing, Zhang Min, Wei Daimeng, Tao Shimin, Yang Hao, Zhang Min
- Knowtuning: Knowledge-aware Fine-tuning For Large Language Models Lyu Yougang, Yan Lingyong, Wang Shuaiqiang, Shi Haibo, Yin Dawei, Ren Pengjie, Chen Zhumin, De Rijke Maarten, Ren Zhaochun
- Enhancing Programming Education With Chatgpt: A Case Study On Student Perceptions And Interactions In A Python Course Ma Boxaun, Chen Li, Konomi Shin'ichi
- Agentboard: An Analytical Evaluation Board Of Multi-turn LLM Agents Ma Chang, Zhang Junlei, Zhu Zhihao, Yang Cheng, Yang Yujiu, Jin Yaohui, Lan Zhenzhong, Kong Lingpeng, He Junxian
- When Emotional Stimuli Meet Prompt Designing: An Auto-prompt Graphical Paradigm Ma Chenggian, Zhao Xiangyu, Zhang Chunhui, Qin Yanzhao, Zhang Wentao
- Debate On Graph: A Flexible And Reliable Reasoning Framework For Large Language Models Ma Jie, Gao Zhitao, Chai Qi, Sun Wangchun, Wang Pinghui, Pei Hongbin, Tao Jing, Song Lingyun, Liu Jun, Zhang Chen, Cui Lizhen
- Think-on-graph 2.0: Deep And Interpretable Large Language Model Reasoning With Knowledge Graph-guided Retrieval Ma Shengjie, Xu Chengjin, Jiang Xuhui, Li Muzhi, Qu Huaren, Guo Jian
- Is It Possible To Edit Large Language Models Robustly? Ma Xinbei, Ju Tianjie, Qiu Jiyang, Zhang Zhuosheng, Zhao Hai, Liu Lifeng, Wang Yulong
- Compositional API Recommendation For Library-oriented Code Generation Ma Zexiong, An Shengnan, Xie Bing, Lin Zeqi
- Llamoco: Instruction Tuning Of Large Language Models For Optimization Code Generation Ma Zeyuan, Guo Hongshu, Chen Jiacheng, Peng Guojun, Cao Zhiguang, Ma Yining, Gong Yue-jiao
- Gerea: Question-aware Prompt Captions For Knowledge-based Visual Question Answering Ma Ziyu, Li Shutao, Sun Bin, Cai Jianfei, Long Zuxiang, Ma Fuyan
- Exploring The Impact Of The Output Format On The Evaluation Of Large Language Models For Code Translation Macedo Marcos, Tian Yuan, Cogo Filipe R., Adams Bram
- LLM In-context Recall Is Prompt Dependent Machlab Daniel, Battle Rick
- Testing The Effect Of Code Documentation On Large Language Model Code Understanding Macke William, Doyle Michael
- Hallucination-free? Assessing The Reliability Of Leading AI Legal Research Tools Magesh Varun, Surani Faiz, Dahl Matthew, Suzgun Mirac, Manning Christopher D., Ho Daniel E.
- Impact Of Model Size On Fine-tuned LLM Performance In Data-to-text Generation: A State-of-the-art Investigation Mahapatra Joy, Garain Utpal
- Evaluating Very Long-term Conversational Memory Of LLM Agents Maharana Adyasha, Lee Dong-ho, Tulyakov Sergey, Bansal Mohit, Barbieri Francesco, Fang Yuwei
- Venn Diagram Prompting : Accelerating Comprehension With Scaffolding Effect Mahendru Sakshi, Pandit Tejul
- Too Late To Train, Too Early To Use? A Study On Necessity And Viability Of Low-resource Bengali Llms Mahfuz Tamzeed, Dey Satak Kumar, Naswan Ruwad, Adil Hasnaen, Sayeed Khondker Salman, Shahgir Haz Sameen
- Exploring The Capabilities Of Prompted Large Language Models In Educational And Assessment Applications Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- FRACTAL: Fine-grained Scoring From Aggregate Text Labels Makhija Yukti, Agrawal Priyanka, Saket Rishi, Raghuveer Aravindan
- Making Task-oriented Dialogue Datasets More Natural By Synthetically Generating Indirect User Requests Mannekote Amogh, Nam Jinseok, Li Ziming, Gao Jian, Boyer Kristy Elizabeth, Dorr Bonnie J.
- Don't Forget Your Reward Values: Language Model Alignment Via Value-based Calibration Mao Xin, Li Feng-lin, Xu Huimin, Zhang Wei, Luu Anh Tuan
- Visually Guided Generative Text-layout Pre-training For Document Intelligence Mao Zhiming, Bai Haoli, Hou Lu, Wei Jiansheng, Jiang Xin, Liu Qun, Wong Kam-fai
- Tuning Llms With Contrastive Alignment Instructions For Machine Translation In Unseen, Low-resource Languages Mao Zhuoyuan, Yu Yen
- Understanding And Mitigating Language Confusion In Llms Marchisio Kelly, Ko Wei-yin, Bérard Alexandre, Dehaze Théo, Ruder Sebastian
- The Impact Of Hyperparameters On Large Language Model Inference Performance: An Evaluation Of Vllm And Huggingface Pipelines Martinez Matias
- German Also Hallucinates! Inconsistency Detection In News Summaries With The Absinth Dataset Mascarell Laura, Chalumattu Ribin, Rios Annette
- Test-driven Development For Code Generation Mathews Noble Saji, Nagappan Meiyappan
- XAI For All: Can Large Language Models Simplify Explainable AI? Mavrepis Philip, Makridis Georgios, Fatouros Georgios, Koukos Vasileios, Separdani Maria Margarita, Kyriazis Dimosthenis
- Can Opensource Beat Chatgpt? -- A Comparative Study Of Large Language Models For Text-to-code Generation Mayer Luis, Heumann Christian, Aßenmacher Matthias
- Set-based Prompting: Provably Solving The Language Model Order Dependency Problem Mcilroy-young Reid, Brown Katrina, Olson Conlan, Zhang Linjun, Dwork Cynthia
- Injecting New Knowledge Into Large Language Models Via Supervised Fine-tuning Mecklenburg Nick, Lin Yiyou, Li Xiaoxiao, Holstein Daniel, Nunes Leonardo, Malvar Sara, Silva Bruno, Chandra Ranveer, Aski Vijay, Yannam Pavan Kumar Reddy, Aktas Tolga, Hendry Todd
- Large Language Models For Relevance Judgment In Product Search Mehrdad Navid, Mohapatra Hrushikesh, Bagdouri Mossaab, Chandran Prijith, Magnani Alessandro, Cai Xunfan, Puthenputhussery Ajit, Yadav Sachin, Lee Tony, Zhai Chengxiang, Liao Ciya
- Realhf: Optimized RLHF Training For Large Language Models Through Parameter Reallocation Mei Zhiyu, Fu Wei, Li Kaiwei, Wang Guangju, Zhang Huanchen, Wu Yi
- Phybench: A Physical Commonsense Benchmark For Evaluating Text-to-image Models Meng Fanqing, Shao Wenqi, Luo Lixin, Wang Yahong, Chen Yiran, Lu Quanfeng, Yang Yue, Yang Tianshuo, Zhang Kaipeng, Qiao Yu, Luo Ping
- DCR: Divide-and-conquer Reasoning For Multi-choice Question Answering With Llms Meng Zijie, Zhang Yan, Feng Zhaopeng, Liu Zuozhu
- Disce Aut Deficere: Evaluating Llms Proficiency On The INVALSI Italian Benchmark Mercorio Fabio, Mezzanzanica Mario, Potertì Daniele, Serino Antonio, Seveso Andrea
- Transforming Wearable Data Into Health Insights Using Large Language Model Agents Merrill Mike A., Paruchuri Akshay, Rezaei Naghmeh, Kovacs Geza, Perez Javier, Liu Yun, Schenck Erik, Hammerquist Nova, Sunshine Jake, Tailor Shyam, Ayush Kumar, Su Hao-wei, He Qian, Mclean Cory Y., Malhotra Mark, Patel Shwetak, Zhan Jiening, Althoff Tim, Mcduff Daniel, Liu Xin
- Superposition Prompting: Improving And Accelerating Retrieval-augmented Generation Merth Thomas, Fu Qichen, Rastegari Mohammad, Najibi Mahyar
- Low-resource Machine Translation Through Retrieval-augmented LLM Prompting: A Study On The Mambai Language Merx Raphaël, Mahmudi Aso, Langford Katrina, De Araujo Leo Alberto, Vylomova Ekaterina
- Permitqa: A Benchmark For Retrieval Augmented Generation In Wind Siting And Permitting Domain Meyur Rounak, Phan Hung, Wagle Sridevi, Strube Jan, Halappanavar Mahantesh, Horawalavithana Sameera, Acharya Anurag, Munikoti Sai
- User Centric Evaluation Of Code Generation Tools Miah Tanha, Zhu Hong
- Exploration Of Masked And Causal Language Modelling For Text Generation Micheletti Nicolo, Belkadi Samuel, Han Lifeng, Nenadic Goran
- I Have An Attention Bridge To Sell You: Generalization Capabilities Of Modular Translation Architectures Mickus Timothee, Vázquez Raúl, Attieh Joseph
- Natural Language As Policies: Reasoning For Coordinate-level Embodied Control With Llms Mikami Yusuke, Melnik Andrew, Miura Jun, Hautamäki Ville
- PSLM: Parallel Generation Of Text And Speech With Llms For Low-latency Spoken Dialogue Systems Mitsui Kentaro, Mitsuda Koh, Wakatsuki Toshiaki, Hono Yukiya, Sawada Kei
- CBF-LLM: Safe Control For LLM Alignment Miyaoka Yuya, Inoue Masaki
- Convsdg: Session Data Generation For Conversational Search Mo Fengran, Yi Bole, Mao Kelong, Qu Chen, Huang Kaiyu, Nie Jian-yun
- Fight Back Against Jailbreaking Via Prompt Adversarial Tuning Mo Yichuan, Wang Yuji, Wei Zeming, Wang Yisen
- Large Language Model (LLM) AI Text Generation Detection Based On Transformer Deep Learning Algorithm Mo Yuhong, Qin Hao, Dong Yushan, Zhu Ziyi, Li Zhenglin
- Evaluating Large Language Models With Human Feedback: Establishing A Swedish Benchmark Moell Birger
- Machine Translation Meta Evaluation Through Translation Accuracy Challenge Sets Moghe Nikita, Fazla Arnisa, Amrhein Chantal, Kocmi Tom, Steedman Mark, Birch Alexandra, Sennrich Rico, Guillou Liane
- Can We Trust Large Language Models Generated Code? A Framework For In-context Learning, Security Patterns, And Code Evaluations Across Diverse Llms Mohsin Ahmad, Janicke Helge, Wood Adrian, Sarker Iqbal H., Maglaras Leandros, Janjua Naeem
- Comparison Of Large Language Models For Generating Contextually Relevant Questions Molina Ivo Lodovico, Švábenský Valdemar, Minematsu Tsubasa, Chen Li, Okubo Fumiya, Shimada Atsushi
- Kam-cot: Knowledge Augmented Multimodal Chain-of-thoughts Reasoning Mondal Debjyoti, Modi Suraj, Panda Subhadarshi, Singh Rituraj, Rao Godawari Sudhakar
- Synthetic Programming Elicitation And Repair For Text-to-code In Very Low-resource Programming Languages Mora Federico, Wong Justin, Lepe Haley, Bhatia Sahil, Elmaaroufi Karim, Varghese George, Gonzalez Joseph E., Polgreen Elizabeth, Seshia Sanjit A.
- Exploring The Landscape Of Large Language Models: Foundations, Techniques, And Challenges Moradi Milad, Yan Ke, Colwell David, Samwald Matthias, Asgari Rhona
- Langbite: A Platform For Testing Bias In Large Language Models Morales Sergio, Clarisó Robert, Cabot Jordi
- Language Modelling Approaches To Adaptive Machine Translation Moslem Yasmin
- Lhrs-bot: Empowering Remote Sensing With Vgi-enhanced Large Multimodal Language Model Muhtar Dilxat, Li Zhenshi, Gu Feng, Zhang Xueliang, Xiao Pengfeng
- Unraveling The Truth: Do Llms Really Understand Charts? A Deep Dive Into Consistency And Robustness Mukhopadhyay Srija, Qidwai Adnan, Garimella Aparna, Ramu Pritika, Gupta Vivek, Roth Dan
- Towards Robots That Know When They Need Help: Affordance-based Uncertainty For Large Language Model Planners Mullen James F. Jr., Manocha Dinesh
- Grouse: A Benchmark To Evaluate Evaluators In Grounded Question Answering Muller Sacha, Loison António, Omrani Bilel, Viaud Gautier
- Leave No Context Behind: Efficient Infinite Context Transformers With Infini-attention Munkhdalai Tsendsuren, Faruqui Manaal, Gopal Siddharth
- A Multi-expert Large Language Model Architecture For Verilog Code Generation Nadimi Bardia, Zheng Hao
- Zero-shot Visual Reasoning By Vision-language Models: Benchmarking And Analysis Nagar Aishik, Jaiswal Shantanu, Tan Cheston
- Aurora-m: The First Open Source Multilingual Language Model Red-teamed According To The U.S. Executive Order Nakamura Taishi, Mishra Mayank, Tedeschi Simone, Chai Yekun, Stillerman Jason T, Friedrich Felix, Yadav Prateek, Laud Tanmay, Chien Vu Minh, Zhuo Terry Yue, Misra Diganta, Bogin Ben, Vu Xuan-son, Karpinska Marzena, Dantuluri Arnav Varma, Kusa Wojciech, Furlanello Tommaso, Yokota Rio, Muennighoff Niklas, Pai Suhas, Adewumi Tosin, Laippala Veronika, Yao Xiaozhe, Junior Adalberto, Ariyak Alpay, Drozd Aleksandr, Clive Jordan, Gupta Kshitij, Chen Liangyu, Sun Qi, Tsui Ken, Persaud Noah, Fahmy Nour, Chen Tianlong, Bansal Mohit, Monti Nicolo, Dang Tai, Luo Ziyang, Bui Tien-tung, Navigli Roberto, Mehta Virendra, Blumberg Matthew, May Victor, Nguyen Huu, Pyysalo Sampo
- Gemquad : Generating Multilingual Question Answering Datasets From Large Language Models Using Few Shot Learning Namboori Amani, Mangale Shivam, Rosenbaum Andy, Soltan Saleh
- Using Llms To Model The Beliefs And Preferences Of Targeted Populations Namikoshi Keiichi, Filipowicz Alex, Shamma David A., Iliev Rumen, Hogan Candice L., Arechiga Nikos
- Beyond The Hype: A Dispassionate Look At Vision-language Models In Medical Scenario Nan Yang, Zhou Huichi, Xing Xiaodan, Yang Guang
- What If Red Can Talk? Dynamic Dialogue Generation Using Large Language Models Nananukul Navapat, Wongkamjan Wichayaporn
- Segment-based Interactive Machine Translation For Pre-trained Models Navarro Angel, Casacuberta Francisco
- Medinsight: A Multi-source Context Augmentation Framework For Generating Patient-centric Medical Responses Using Large Language Models Neupane Subash, Mitra Shaswata, Mittal Sudip, Golilarz Noorbakhsh Amiri, Rahimi Shahram, Amirlatifi Amin
- Balancing Exploration And Exploitation In LLM Using Soft RLLF For Enhanced Negation Understanding Nguyen Ha-thanh, Satoh Ken
- Direct Evaluation Of Chain-of-thought In Multi-hop Reasoning With Knowledge Graphs Nguyen Minh-vuong, Luo Linhao, Shiri Fatemeh, Phung Dinh, Li Yuan-fang, Vu Thuy-trang, Haffari Gholamreza
- Advancing Vietnamese Visual Question Answering With Transformer And Convolutional Integration Nguyen Ngoc Son, Nguyen Van Son, Le Tung
- An Empirical Study On Capability Of Large Language Models In Understanding Code Semantics Nguyen Thu-trang, Vu Thanh Trong, Vo Hieu Dinh, Nguyen Son
- Cultural Commonsense Knowledge For Intercultural Dialogues Nguyen Tuan-phong, Razniewski Simon, Weikum Gerhard
- Ceval: A Benchmark For Evaluating Counterfactual Text Generation Nguyen Van Bach, Schlötterer Jörg, Seifert Christin
- Aligners: Decoupling Llms And Alignment Ngweta Lilian, Agarwal Mayank, Maity Subha, Gittens Alex, Sun Yuekai, Yurochkin Mikhail
- Layer-wise Regularized Dropout For Neural Language Models Ni Shiwen, Yang Min, Xu Ruifeng, Li Chengming, Hu Xiping
- A Systematic Evaluation Of Large Language Models For Natural Language Generation Tasks Ni Xuanfan, Li Piji
- W-RAG: Weakly Supervised Dense Retrieval In RAG For Open-domain Question Answering Nian Jinming, Peng Zhiyuan, Wang Qifan, Fang Yi
- Vikhr: The Family Of Open-source Instruction-tuned Large Language Models For Russian Nikolich Aleksandr, Korolev Konstantin, Shelmanov Artem, Kiselev Igor
- How Reliable AI Chatbots Are For Disease Prediction From Patient Complaints? Nipu Ayesha Siddika, Islam K M Sajjadul, Madiraju Praveen
- Enhancing Dialogue State Tracking Models Through Llm-backed User-agents Simulation Niu Cheng, Wang Xingguang, Cheng Xuxin, Song Juntong, Zhang Tong
- Mitigating Hallucinations In Large Language Models Via Self-refinement-enhanced Knowledge Retrieval Niu Mengjia, Li Hao, Shi Jie, Haddadi Hamed, Mo Fan
- Large Language Models And Cognitive Science: A Comprehensive Review Of Similarities, Differences, And Challenges Niu Qian, Liu Junyu, Bi Ziqian, Feng Pohsun, Peng Benji, Chen Keyu, Li Ming
- Ambignlg: Addressing Task Ambiguity In Instruction For NLG Niwa Ayana, Iso Hayate
- Language Portability Strategies For Open-domain Dialogue With Pre-trained Language Models From High To Low Resource Languages Njifenjou Ahmed, Sucal Virgile, Jabaian Bassam, Lefèvre Fabrice
- Mobileflow: A Multimodal LLM For Mobile GUI Agent Nong Songqin, Zhu Jiali, Wu Rui, Jin Jiongchao, Shan Shuo, Huang Xiutian, Xu Wenhao
- The Conversation Is The Command: Interacting With Real-world Autonomous Robot Through Natural Language Nwankwo Linus, Rueckert Elmar
- Beyond Metrics: Evaluating Llms' Effectiveness In Culturally Nuanced, Low-resource Real-world Scenarios Ochieng Millicent, Gumma Varun, Sitaram Sunayana, Wang Jindong, Chaudhary Vishrav, Ronen Keshet, Bali Kalika, O'neill Jacki
- Robust Adaptation Of Foundation Models With Black-box Visual Prompting Oh Changdae, Seo Gyeongdeok, Jung Geunyoung, Cheng Zhi-qi, Choi Hosik, Jung Jiyoung, Song Kyungwoo
- Jmultiwoz: A Large-scale Japanese Multi-domain Task-oriented Dialogue Dataset Ohashi Atsumoto, Hirai Ryu, Iizuka Shinya, Higashinaka Ryuichiro
- Jdocqa: Japanese Document Question Answering Dataset For Generative Language Models Onami Eri, Kurita Shuhei, Miyanishi Taiki, Watanabe Taro
- Large Language Models Are Self-taught Reasoners: Enhancing LLM Applications Via Tailored Problem-solving Demonstrations Ong Kai Tzu-iunn, Kwon Taeyoon, Yeo Jinyoung
- Simplifying Translations For Children: Iterative Simplification Considering Age Of Acquisition With Llms Oshika Masashi, Morishita Makoto, Hirao Tsutomu, Sasano Ryohei, Takeda Koichi
- A Systematic Review Of Data-to-text NLG Osuji Chinonso Cynthia, Ferreira Thiago Castro, Davis Brian
- QLSC: A Query Latent Semantic Calibrator For Robust Extractive Question Answering Ouyang Sheng, Wang Jianzong, Zhang Yong, Li Zhitao, Liang Ziqi, Zhang Xulong, Cheng Ning, Xiao Jing
- Towards Cross-lingual Explanation Of Artwork In Large-scale Vision Language Models Ozaki Shintaro, Hayashi Kazuki, Sakai Yusuke, Kamigaito Hidetaka, Hayashi Katsuhiko, Watanabe Taro
- Gemini Goes To Med School: Exploring The Capabilities Of Multimodal Large Language Models On Medical Challenge Problems & Hallucinations Pal Ankit, Sankarasubbu Malaikannan
- Investigating LLM Applications In E-commerce Palen-michel Chester, Wang Ruixiang, Zhang Yipeng, Yu David, Xu Canran, Wu Zhe
- Distilling Large Language Models For Text-attributed Graph Learning Pan Bo, Zhang Zheng, Zhang Yifei, Hu Yuntong, Zhao Liang
- Not All Contexts Are Equal: Teaching Llms Credibility-aware Generation Pan Ruotong, Cao Boxi, Lin Hongyu, Han Xianpei, Zheng Jia, Wang Sirui, Cai Xunliang, Sun Le
- POMP: Probability-driven Meta-graph Prompter For Llms In Low-resource Unsupervised Neural Machine Translation Pan Shilong, Tian Zhiliang, Ding Liang, Huang Zhen, Wen Zhihua, Li Dongsheng
- G-DIG: Towards Gradient-based Diverse And High-quality Instruction Data Selection For Machine Translation Pan Xingyuan, Huang Luyang, Kang Liyan, Liu Zhicheng, Lu Yu, Cheng Shanbo
- Enhancing Repository-level Code Generation With Integrated Contextual Information Pan Zhiyuan, Hu Xing, Xia Xin, Yang Xiaohu
- Can Watermarking Large Language Models Prevent Copyrighted Text Generation And Hide Training Data? Panaitescu-liess Michael-andrei, Che Zora, An Bang, Xu Yuancheng, Pathmanathan Pankayaraj, Chakraborty Souradip, Zhu Sicheng, Goldstein Tom, Huang Furong
- Lottery Ticket Adaptation: Mitigating Destructive Interference In Llms Panda Ashwinee, Isik Berivan, Qi Xiangyu, Koyejo Sanmi, Weissman Tsachy, Mittal Prateek
- Salute The Classic: Revisiting Challenges Of Machine Translation In The Age Of Large Language Models Pang Jianhui, Ye Fanghua, Wang Longyue, Yu Dian, Wong Derek F., Shi Shuming, Tu Zhaopeng
- Anchor-based Large Language Models Pang Jianhui, Ye Fanghua, Wong Derek Fai, He Xin, Chen Wanshun, Wang Longyue
- Shaking Up Vlms: Comparing Transformers And Structured State Space Models For Vision & Language Modeling Pantazopoulos Georgios, Nikandrou Malvina, Suglia Alessandro, Lemon Oliver, Eshghi Arash
- Lost In Space: Probing Fine-grained Spatial Understanding In Vision And Language Resamplers Pantazopoulos Georgios, Suglia Alessandro, Lemon Oliver, Eshghi Arash
- Measuring Memorization In RLHF For Code Completion Pappu Aneesh, Porter Billy, Shumailov Ilia, Hayes Jamie
- Building Pre-train LLM Dataset For The INDIC Languages: A Case Study On Hindi Parida Shantipriya, Panwar Shakshi, Lata Kusum, Mishra Sanskruti, Sekhar Sambit
- Low-resource Cross-lingual Summarization Through Few-shot Learning With Large Language Models Park Gyutae, Hwang Seojin, Lee Hwanhee
- Bridging Vision And Language Spaces With Assignment Prediction Park Jungin, Lee Jiyoung, Sohn Kwanghoon
- Token-picker: Accelerating Attention In Text Generation With Minimized Memory Transfer Via Probability Estimation Park Junyoung, Kang Myeonggu, Han Yunki, Kim Yanggon, Shin Jaekang, Kim Lee-sup
- Disentangling Length From Quality In Direct Preference Optimization Park Ryan, Rafailov Rafael, Ermon Stefano, Finn Chelsea
- Any-precision LLM: Low-cost Deployment Of Multiple, Different-sized Llms Park Yeonhong, Hyun Jake, Cho Sanglyul, Sim Bonggeun, Lee Jae W.
- The Ultimate Guide To Fine-tuning Llms From Basics To Breakthroughs: An Exhaustive Review Of Technologies, Research, Best Practices, Applied Research Challenges And Opportunities Parthasarathy Venkatesh Balavadhani, Zafar Ahtsham, Khan Aafaq, Shahid Arsalan
- Multi-logieval: Towards Evaluating Multi-step Logical Reasoning Ability Of Large Language Models Patel Nisarg, Kulkarni Mohith, Parmar Mihir, Budhiraja Aashna, Nakamura Mutsumi, Varshney Neeraj, Baral Chitta
- Goex: Perspectives And Designs Towards A Runtime For Autonomous LLM Applications Patil Shishir G., Zhang Tianjun, Fang Vivian, C. Noppapon, Huang Roy, Hao Aaron, Casado Martin, Gonzalez Joseph E., Popa Raluca Ada, Stoica Ion
- Aptly: Making Mobile Apps From Natural Language Patton Evan W., Kim David Y. J., Granquist Ashley, Liu Robin, Scott Arianna, Zamanova Jennet, Abelson Harold
- The What, Why, And How Of Context Length Extension Techniques In Large Language Models -- A Detailed Survey Pawar Saurav, Tonmoy S. M Towhidul Islam, Zaman S M Mehedi, Jain Vinija, Chadha Aman, Das Amitava
- Autonomous Workflow For Multimodal Fine-grained Training Assistants Towards Mixed Reality Pei Jiahuan, Viola Irene, Huang Haochen, Wang Junxiao, Ahsan Moonisa, Ye Fanghua, Yiming Jiang, Sai Yao, Wang Di, Chen Zhumin, Ren Pengjie, Cesar Pablo
- Automated Multi-language To English Machine Translation Using Generative Pre-trained Transformers Pelofske Elijah, Urias Vincent, Liebrock Lorie M.
- Answer Is All You Need: Instruction-following Text Embedding Via Answering The Question Peng Letian, Zhang Yuwei, Wang Zilong, Srinivasa Jayanth, Liu Gaowen, Wang Zihan, Shang Jingbo
- Humaneval-xl: A Multilingual Code Generation Benchmark For Cross-lingual Natural Language Generalization Peng Qiwei, Chai Yekun, Li Xuhong
- Learnable In-context Vector For Visual Question Answering Peng Yingzhe, Hao Chenduo, Yang Xu, Peng Jiawei, Hu Xinting, Geng Xin
- Did Translation Models Get More Robust Without Anyone Even Noticing? Peters Ben, Martins André F. T.
- Optimizing Large Language Models For Openapi Code Completion Petryshyn Bohdan, Lukoševičius Mantas
- Suri: Multi-constraint Instruction Following For Long-form Text Generation Pham Chau Minh, Sun Simeng, Iyyer Mohit
- Towards Reliable Medical Question Answering: Techniques And Challenges In Mitigating Hallucinations In Language Models Pham Duy Khoa, Vo Bao Quoc
- Graph Guided Question Answer Generation For Procedural Question-answering Pham Hai X., Hadji Isma, Xu Xinnuo, Degutyte Ziedune, Rainey Jay, Kazakos Evangelos, Fazly Afsaneh, Tzimiropoulos Georgios, Martinez Brais
- Can Base Chatgpt Be Used For Forecasting Without Additional Optimization? Pham Van, Cunningham Scott
- Peering Into The Mind Of Language Models: An Approach For Attribution In Contextual Question Answering Phukan Anirudh, Somasundaram Shwetha, Saxena Apoorv, Goswami Koustava, Srinivasan Balaji Vasan
- Better RAG Using Relevant Information Gain Pickett Marc, Hartman Jeremy, Bhowmick Ayan Kumar, Alam Raquib-ul, Vempaty Aditya
- Enhancing Gender-inclusive Machine Translation With Neomorphemes And Large Language Models Piergentili Andrea, Savoldi Beatrice, Negri Matteo, Bentivogli Luisa
- Bimedix: Bilingual Medical Mixture Of Experts LLM Pieri Sara, Mullappilly Sahal Shaji, Khan Fahad Shahbaz, Anwer Rao Muhammad, Khan Salman, Baldwin Timothy, Cholakkal Hisham
- Doccgen: Document-based Controlled Code Generation Pimparkhede Sameer, Kammakomati Mehant, Tamilselvam Srikanth, Kumar Prince, Kumar Ashok Pon, Bhattacharyya Pushpak
- How Effective Are State Space Models For Machine Translation? Pitorro Hugo, Vasylenko Pavlo, Treviso Marcos, Martins André F. T.
- Reasoning With Large Language Models, A Survey Plaat Aske, Wong Annie, Verberne Suzan, Broekens Joost, Van Stein Niki, Back Thomas
- Dissecting Language Models: Machine Unlearning Via Selective Pruning Pochinkov Nicholas, Schoots Nandi
- V-RECS, A Low-cost LLM4VIS Recommender With Explanations, Captioning And Suggestions Podo Luca, Angelini Marco, Velardi Paola
- Exploring The Efficacy Of Large Language Models (GPT-4) In Binary Reverse Engineering Pordanesh Saman, Tan Benjamin
- Evaluation Of Language Models In The Medical Context Under Resource-constrained Settings Posada Andrea, Rueckert Daniel, Meissen Felix, Müller Philip
- Retrieval Augmented Thought Process For Private Data Handling In Healthcare Pouplin Thomas, Sun Hao, Holt Samuel, Van Der Schaar Mihaela
- Machine Translation With Large Language Models: Prompt Engineering For Persian, English, And Russian Directions Pourkamali Nooshin, Sharifi Shler Ebrahim
- Large Language Models Meet User Interfaces: The Case Of Provisioning Feedback Pozdniakov Stanislav, Brazil Jonathan, Abdi Solmaz, Bakharia Aneesha, Sadiq Shazia, Gasevic Dragan, Denny Paul, Khosravi Hassan
- Sambanova SN40L: Scaling The AI Memory Wall With Dataflow And Composition Of Experts Prabhakar Raghu, Sivaramakrishnan Ram, Gandhi Darshan, Du Yun, Wang Mingran, Song Xiangyu, Zhang Kejie, Gao Tianren, Wang Angela, Li Karen, Sheng Yongning, Brot Joshua, Sokolov Denis, Vivek Apurv, Leung Calvin, Sabnis Arjun, Bai Jiayu, Zhao Tuowen, Gottscho Mark, Jackson David, Luttrell Mark, Shah Manish K., Chen Edison, Liang Kaizhao, Jain Swayambhoo, Thakker Urmish, Huang Dawei, Jairath Sumti, Brown Kevin J., Olukotun Kunle
- DEXTER: A Benchmark For Open-domain Complex Question Answering Using Llms Prabhu Venktesh V. Deepali, Anand Avishek
- Convkgyarn: Spinning Configurable And Scalable Conversational Knowledge Graph QA Datasets With Large Language Models Pradeep Ronak, Lee Daniel, Mousavi Ali, Pound Jeff, Sang Yisi, Lin Jimmy, Ilyas Ihab, Potdar Saloni, Arefiyan Mostafa, Li Yunyao
- Accelerating Large Language Model Pretraining Via LFR Pedagogy: Learn, Focus, And Review Prakriya Neha, Yen Jui-nan, Hsieh Cho-jui, Cong Jason
- Interactions With Prompt Problems: A New Way To Teach Programming With Large Language Models Prather James, Denny Paul, Leinonen Juho, Smith David H. Iv, Reeves Brent N., Macneil Stephen, Becker Brett A., Luxton-reilly Andrew, Amarouche Thezyrie, Kimmel Bailey
- Helpful Or Harmful? Exploring The Efficacy Of Large Language Models For Online Grooming Prevention Prosser Ellie, Edwards Matthew
- Crafting A Good Prompt Or Providing Exemplary Dialogues? A Study Of In-context Learning For Persona-based Dialogue Generation Pu Jiashu, Wan Yajing, Zhang Yuru, Chen Jing, Cheng Ling, Shao Qian, Chang Yongzhu, Lv Tangjie, Zhang Rongsheng
- Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study In Indonesian And Sundanese Putri Rifki Afina, Haznitrama Faiz Ghifari, Adhista Dea, Oh Alice
- Less Is More: Accurate Speech Recognition & Translation Without Web-scale Data Puvvada Krishna C., Żelasko Piotr, Huang He, Hrinchuk Oleksii, Koluguri Nithin Rao, Dhawan Kunal, Majumdar Somshubra, Rastorgueva Elena, Chen Zhehuai, Lavrukhin Vitaly, Balam Jagadeesh, Ginsburg Boris
- Model Internals-based Answer Attribution For Trustworthy Retrieval-augmented Generation Qi Jirui, Sarti Gabriele, Fernández Raquel, Bisazza Arianna
- Data Augmentation Integrating Dialogue Flow And Style To Adapt Spoken Dialogue Systems To Low-resource User Groups Qi Zhiyang, Inaba Michimasa
- Prompt Perturbation Consistency Learning For Robust Language Models Qiang Yao, Nandi Subhrangshu, Mehrabi Ninareh, Steeg Greg Ver, Kumar Anoop, Rumshisky Anna, Galstyan Aram
- Openba-v2: Reaching 77.3% High Compression Ratio With Fast Multi-stage Pruning Qiao Dan, Su Yi, Wang Pinzheng, Ye Jing, Xie Wenjing, Zhou Yuechi, Ding Yuyang, Tang Zecheng, Wang Jikai, Ji Yixin, Wang Yue, Guo Pei, Sun Zechen, Zhang Zikang, Li Juntao, Chao Pingfu, Chen Wenliang, Fu Guohong, Zhou Guodong, Zhu Qiaoming, Zhang Min
- Vl-mamba: Exploring State Space Models For Multimodal Learning Qiao Yanyuan, Yu Zheng, Guo Longteng, Chen Sihan, Zhao Zijia, Sun Mingzhen, Wu Qi, Liu Jing
- Empirical Guidelines For Deploying Llms Onto Resource-constrained Edge Devices Qin Ruiyang, Liu Dancheng, Yan Zheyu, Tan Zhaoxuan, Pan Zixuan, Jia Zhenge, Jiang Meng, Abbasi Ahmed, Xiong Jinjun, Shi Yiyu
- Exploring The Nexus Of Large Language Models And Legal Systems: A Short Survey Qin Weicong, Sun Zhongxiang
- LAMPO: Large Language Models As Preference Machines For Few-shot Ordinal Classification Qin Zhen, Wu Junru, Shen Jiaming, Liu Tianqi, Wang Xuanhui
- Efficient Interactive LLM Serving With Proxy Model-based Sequence Length Prediction Qiu Haoran, Mao Weichao, Patke Archit, Cui Shengkun, Jha Saurabh, Wang Chen, Franke Hubertus, Kalbarczyk Zbigniew T., Başar Tamer, Iyer Ravishankar K.
- Snapntell: Enhancing Entity-centric Visual Question Answering With Retrieval Augmented Multimodal LLM Qiu Jielin, Madotto Andrea, Lin Zhaojiang, Crook Paul A., Xu Yifan Ethan, Dong Xin Luna, Faloutsos Christos, Li Lei, Damavandi Babak, Moon Seungwhan
- Megen: Generative Backdoor In Large Language Models Via Model Editing Qiu Jiyang, Ma Xinbei, Zhang Zhuosheng, Zhao Hai
- Dissecting Multiplication In Transformers: Insights Into Llms Qiu Luyu, Li Jianing, Su Chi, Zhang Chen Jason, Chen Lei
- Entropy-based Decoding For Retrieval-augmented Large Language Models Qiu Zexuan, Ou Zijing, Wu Bin, Li Jingjing, Liu Aiwei, King Irwin
- Unsupervised Distractor Generation Via Large Language Model Distilling And Counterfactual Contrastive Decoding Qu Fanyi, Sun Hao, Wu Yunfang
- The Frontier Of Data Erasure: Machine Unlearning For Large Language Models Qu Youyang, Ding Ming, Sun Nan, Thilakarathna Kanchana, Zhu Tianqing, Niyato Dusit
- Automatically Generating Numerous Context-driven SFT Data For Llms Across Diverse Granularity Quan Shanghaoran
- Detecting Hallucinations In Large Language Model Generation: A Token Probability Approach Quevedo Ernesto, Yero Jorge, Koerner Rachel, Rivas Pablo, Cerny Tomas
- An Empirical Study On Self-correcting Large Language Models For Data Science Code Generation Quoc Thai Tang, Minh Duc Ha, Thanh Tho Quan, Nguyen-duc Anh
- Memory-augmented Generative Adversarial Transformers Raaijmakers Stephan, Bakker Roos, Cremers Anita, De Kleijn Roy, Kouwenhoven Tom, Verhoef Tessa
- Defan: Definitive Answer Dataset For Llms Hallucination Evaluation Rahman A B M Ashikur, Anwar Saeed, Usman Muhammad, Mian Ajmal
- Report On The 1st Workshop On Large Language Model For Evaluation In Information Retrieval (llm4eval 2024) At SIGIR 2024 Rahmani Hossein A., Siro Clemencia, Aliannejadi Mohammad, Craswell Nick, Clarke Charles L. A., Faggioli Guglielmo, Mitra Bhaskar, Thomas Paul, Yilmaz Emine
- Masontigers At Semeval-2024 Task 9: Solving Puzzles With An Ensemble Of Chain-of-thoughts Raihan Md Nishat, Goswami Dhiman, Emran Al Nahian Bin, Puspo Sadiya Sayara Chowdhury, Ganguly Amrita, Zampieri Marcos
- Knowledge-based Consistency Testing Of Large Language Models Rajan Sai Sathiesh, Soremekun Ezekiel, Chattopadhyay Sudipta
- Constructing Domain-specific Evaluation Sets For Llm-as-a-judge Raju Ravi, Jain Swayambhoo, Li Bo, Li Jonathan, Thakker Urmish
- ANCHOR: Llm-driven News Subject Conditioning For Text-to-image Synthesis Ramakrishnan Aashish Anantha, Huang Sharon X., Lee Dongwon
- Self-refinement Of Language Models From External Proxy Metrics Feedback Ramji Keshav, Lee Young-suk, Astudillo Ramón Fernandez, Sultan Md Arafat, Naseem Tahira, Munawar Asim, Florian Radu, Roukos Salim
- Analyzing LLM Behavior In Dialogue Summarization: Unveiling Circumstantial Hallucination Trends Ramprasad Sanjana, Ferracane Elisa, Lipton Zachary C.
- Learning To Localize Objects Improves Spatial Reasoning In Visual-llms Ranasinghe Kanchana, Shukla Satya Narayan, Poursaeed Omid, Ryoo Michael S., Lin Tsung-yu
- The Battle Of Llms: A Comparative Study In Conversational QA Tasks Rangapur Aryan, Rangapur Aman
- SPARQL Generation: An Analysis On Fine-tuning Openllama For Question Answering Over A Life Science Knowledge Graph Rangel Julio C., De Farias Tarcisio Mendes, Sima Ana Claudia, Kobayashi Norio
- Visual Hallucination: Definition, Quantification, And Prescriptive Remediations Rani Anku, Rawte Vipula, Sharma Harshad, Anand Neeraj, Rajbangshi Krishnav, Sheth Amit, Das Amitava
- A Critical Look At Tokenwise Reward-guided Text Generation Rashid Ahmad, Wu Ruotian, Grosse Julia, Kristiadi Agustinus, Poupart Pascal
- Llms For Test Input Generation For Semantic Caches Rasool Zafaryab, Barnett Scott, Willie David, Kurniawan Stefanus, Balugo Sherwin, Thudumu Srikanth, Abdelrazek Mohamed
- Cinepile: A Long Video Question Answering Dataset And Benchmark Rawal Ruchit, Saifullah Khalid, Basri Ronen, Jacobs David, Somepalli Gowthami, Goldstein Tom
- Developing Safe And Responsible Large Language Model : Can We Balance Bias Reduction And Language Understanding In Large Language Models? Raza Shaina, Bamgbose Oluwanifemi, Ghuge Shardul, Tavakol Fatemeh, Reji Deepak John, Bashir Syed Raza
- Beads: Bias Evaluation Across Domains Raza Shaina, Rahman Mizanur, Zhang Michael R.
- MBIAS: Mitigating Bias In Large Language Models While Retaining Context Raza Shaina, Raval Ananya, Chatrath Veronica
- FIRST: Faster Improved Listwise Reranking With Single Token Decoding Reddy Revanth Gangi, Doo Jaehyeok, Xu Yifei, Sultan Md Arafat, Swain Deevya, Sil Avirup, Ji Heng
- MASSIVE Multilingual Abstract Meaning Representation: A Dataset And Baselines For Hallucination Detection Regan Michael, Wein Shira, Baker George, Monti Emilio
- Optimizing Visual Question Answering Models For Driving: Bridging The Gap Between Human And Machine Attention Patterns Rekanar Kaavya, Hayes Martin, Sistu Ganesh, Eising Ciaran
- Identifying Semantic Induction Heads To Understand In-context Learning Ren Jie, Guo Qipeng, Yan Hang, Liu Dongrui, Zhang Quanshi, Qiu Xipeng, Lin Dahua
- Melora: Mini-ensemble Low-rank Adapters For Parameter-efficient Fine-tuning Ren Pengjie, Shi Chengshun, Wu Shiguang, Zhang Mengqi, Ren Zhaochun, De Rijke Maarten, Chen Zhumin, Pei Jiahuan
- Self-generated Replay Memories For Continual Neural Machine Translation Resta Michele, Bacciu Davide
- How Multilingual Are Large Language Models Fine-tuned For Translation? Richburg Aquia, Carpuat Marine
- Quantifying Contamination In Evaluating Code Generation Capabilities Of Language Models Riddell Martin, Ni Ansong, Cohan Arman
- Code Generation With Alphacodium: From Prompt Engineering To Flow Engineering Ridnik Tal, Kredo Dedy, Friedman Itamar
- Instruction-tuned Large Language Models For Machine Translation In The Medical Domain Rios Miguel
- Question-instructed Visual Descriptions For Zero-shot Video Question Answering Romero David, Solorio Thamar
- CLAPNQ: Cohesive Long-form Answers From Passages In Natural Questions For RAG Systems Rosenthal Sara, Sil Avirup, Florian Radu, Roukos Salim
- Pairing Analogy-augmented Generation With Procedural Memory For Procedural Q&A Roth K, Gupta Rushil, Halle Simon, Liu Bang
- Enhancing Low-resource NMT With A Multilingual Encoder And Knowledge Distillation: A Case Study Roy Aniruddha, Ray Pretam, Maheshwari Ayush, Sarkar Sudeshna, Goyal Pawan
- Evaluation Of RAG Metrics For Question Answering In The Telecom Domain Roychowdhury Sujoy, Soman Sumit, Ranjani H G, Gunda Neeraj, Chhabra Vansh, Bala Sai Krishna
- Capabilities Of Gemini Models In Medicine Saab Khaled, Tu Tao, Weng Wei-hung, Tanno Ryutaro, Stutz David, Wulczyn Ellery, Zhang Fan, Strother Tim, Park Chunjong, Vedadi Elahe, Chaves Juanma Zambrano, Hu Szu-yeu, Schaekermann Mike, Kamath Aishwarya, Cheng Yong, Barrett David G. T., Cheung Cathy, Mustafa Basil, Palepu Anil, Mcduff Daniel, Hou Le, Golany Tomer, Liu Luyang, Alayrac Jean-baptiste, Houlsby Neil, Tomasev Nenad, Freyberg Jan, Lau Charles, Kemp Jonas, Lai Jeremy, Azizi Shekoofeh, Kanada Kimberly, Man Siwai, Kulkarni Kavita, Sun Ruoxi, Shakeri Siamak, He Luheng, Caine Ben, Webson Albert, Latysheva Natasha, Johnson Melvin, Mansfield Philip, Lu Jian, Rivlin Ehud, Anderson Jesper, Green Bradley, Wong Renee, Krause Jonathan, Shlens Jonathon, Dominowska Ewa, Eslami S. M. Ali, Chou Katherine, Cui Claire, Vinyals Oriol, Kavukcuoglu Koray, Manyika James, Dean Jeff, Hassabis Demis, Matias Yossi, Webster Dale, Barral Joelle, Corrado Greg, Semturs Christopher, Mahdavi S. Sara, Gottweis Juraj, Karthikesalingam Alan, Natarajan Vivek
- Llms' Understanding Of Natural Language Revealed Saba Walid S.
- Mitigating Hallucination In Fictional Character Role-play Sadeq Nafis, Xie Zhouhang, Kang Byungkyu, Lamba Prarit, Gao Xiang, Mcauley Julian
- Insights Into Alignment: Evaluating DPO And Its Variants Across Multiple Tasks Saeidi Amir, Verma Shivanshu, Baral Chitta
- On Zero-shot Counterspeech Generation By Llms Saha Punyajoy, Agrawal Aalok, Jana Abhik, Biemann Chris, Mukherjee Animesh
- A Systematic Survey Of Prompt Engineering In Large Language Models: Techniques And Applications Sahoo Pranab, Singh Ayush Kumar, Saha Sriparna, Jain Vinija, Mondal Samrat, Chadha Aman
- Mixsumm: Topic-based Data Augmentation Using Llms For Low-resource Extractive Text Summarization Sahu Gaurav, Laradji Issam H.
- Toward The Evaluation Of Large Language Models Considering Score Variance Across Instruction Templates Sakai Yusuke, Nohejl Adam, Hang Jiangnan, Kamigaito Hidetaka, Watanabe Taro
- Risks, Causes, And Mitigations Of Widespread Deployments Of Large Language Models (llms): A Survey Sakib Md Nazmus, Islam Md Athikul, Pathak Royal, Arifin Md Mashrur
- Optimization Methods For Personalizing Large Language Models Through Retrieval Augmentation Salemi Alireza, Kallumadi Surya, Zamani Hamed
- Rainbow Teaming: Open-ended Generation Of Diverse Adversarial Prompts Samvelyan Mikayel, Raparthy Sharath Chandra, Lupu Andrei, Hambro Eric, Markosyan Aram H., Bhatt Manish, Mao Yuning, Jiang Minqi, Parker-holder Jack, Foerster Jakob, Rocktäschel Tim, Raileanu Roberta
- Zero-shot RTL Code Generation With Attention Sink Augmented Large Language Models Sandal Selim, Akturk Ismail
- Retrieval-augmented Generation Meets Data-driven Tabula Rasa Approach For Temporal Knowledge Graph Forecasting Sannidhi Geethan, Sakhinana Sagar Srinivas, Runkana Venkataramana
- Lean Attention: Hardware-aware Scalable Attention Mechanism For The Decode-phase Of Transformers Sanovar Rya, Bharadwaj Srikant, Amant Renee St., Rühle Victor, Rajmohan Saravan
- Grapheval: A Knowledge-graph Based LLM Hallucination Evaluation Framework Sansford Hannah, Richardson Nicholas, Maretic Hermina Petric, Saada Juba Nait
- The Power Of Prompts: Evaluating And Mitigating Gender Bias In MT With Llms Sant Aleix, Escolano Carlos, Mash Audrey, Fornaciari Francesca De Luca, Melero Maite
- Textmachina: Seamless Generation Of Machine-generated Text Datasets Sarvazyan Areg Mikael, González José Ángel, Franco-salvador Marc
- A Prompt Response To The Demand For Automatic Gender-neutral Translation Savoldi Beatrice, Piergentili Andrea, Fucci Dennis, Negri Matteo, Bentivogli Luisa
- Robust Few-shot Transfer Learning For Knowledge Base Question Answering With Unanswerable Questions Sawhney Riya, Bhattacharya Indrajit, Mausam
- Evaluating Consistency And Reasoning Capabilities Of Large Language Models Saxena Yash, Chopra Sarthak, Tripathi Arunendra Mani
- Self-distillation For Model Stacking Unlocks Cross-lingual NLU In 200+ Languages Schmidt Fabian David, Borchert Philipp, Vulić Ivan, Glavaš Goran
- Symbolic Prompt Program Search: A Structure-aware Approach To Efficient Compile-time Prompt Optimization Schnabel Tobias, Neville Jennifer
- THREAD: Thinking Deeper With Recursive Spawning Schroeder Philip, Morgan Nathaniel, Luo Hongyin, Glass James
- Context Matters: An Empirical Study Of The Impact Of Contextual Information In Temporal Question Answering Systems Schumacher Dan, Haji Fatemeh, Grey Tara, Bandlamudi Niharika, Karnik Nupoor, Kumar Gagana Uday, Chiang Jason Cho-yu, Rad Paul, Vishwamitra Nishant, Rios Anthony
- Evaluating Large Language Models With Fmeval Schwöbel Pola, Franceschi Luca, Zafar Muhammad Bilal, Vasist Keerthan, Malhotra Aman, Shenhar Tomer, Tailor Pinal, Yilmaz Pinar, Diamond Michael, Donini Michele
- Quantitative Knowledge Retrieval From Large Language Models Selby David, Spriestersbach Kai, Iwashita Yuichiro, Bappert Dennis, Warrier Archana, Mukherjee Sumantrak, Asim Muhammad Nabeel, Kise Koichi, Vollmer Sebastian
- Benchmarking Large Language Models For Math Reasoning Tasks Seßler Kathrin, Rong Yao, Gözlüklü Emek, Kasneci Enkelejda
- Medpromptx: Grounded Multimodal Prompting For Chest X-ray Diagnosis Shaaban Mai A., Khan Adnan, Yaqub Mohammad
- Tencdm: Understanding The Properties Of Diffusion Model In The Space Of Language Model Encodings Shabalin Alexander, Meshchaninov Viacheslav, Chimbulatov Egor, Lapikov Vladislav, Kim Roman, Bartosh Grigory, Molchanov Dmitry, Markov Sergey, Vetrov Dmitry
- Flashattention-3: Fast And Accurate Attention With Asynchrony And Low-precision Shah Jay, Bikshandi Ganesh, Zhang Ying, Thakkar Vijay, Ramani Pradeep, Dao Tri
- An End-to-end Speech Summarization Using Large Language Model Shang Hengchao, Li Zongyao, Guo Jiaxin, Li Shaojun, Rao Zhiqiang, Luo Yuanchang, Wei Daimeng, Yang Hao
- Extensible Embedding: A Flexible Multipler For Llm's Context Length Shao Ninglu, Xiao Shitao, Liu Zheng, Zhang Peitian
- Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization Shao Ninglu, Xiao Shitao, Liu Zheng, Zhang Peitian
- Can Llms Replace Economic Choice Prediction Labs? The Case Of Language-based Persuasion Games Shapira Eilam, Madmon Omer, Reichart Roi, Tennenholtz Moshe
- Guiding In-context Learning Of Llms Through Quality Estimation For Machine Translation Sharami Javad Pourmostafa Roshan, Shterionov Dimitar, Spronck Pieter
- Unirag: Universal Retrieval Augmentation For Multi-modal Large Language Models Sharifymoghaddam Sahel, Upadhyay Shivani, Chen Wenhu, Lin Jimmy
- SPML: A DSL For Defending Language Models Against Prompt Attacks Sharma Reshabh K, Gupta Vinayak, Grossman Dan
- Thermometer: Towards Universal Calibration For Large Language Models Shen Maohao, Das Subhro, Greenewald Kristjan, Sattigeri Prasanna, Wornell Gregory, Ghosh Soumya
- Pyramid Coder: Hierarchical Code Generator For Compositional Visual Question Answering Shen Ruoyue, Inoue Nakamasa, Shinoda Koichi
- Small Llms Are Weak Tool Learners: A Multi-llm Agent Shen Weizhou, Li Chenliang, Chen Hongzhan, Yan Ming, Quan Xiaojun, Chen Hehong, Zhang Ji, Huang Fei
- PMG : Personalized Multimodal Generation With Large Language Models Shen Xiaoteng, Zhang Rui, Zhao Xiaoyan, Zhu Jieming, Xiao Xi
- Scaling Laws For Linear Complexity Language Models Shen Xuyang, Li Dong, Leng Ruitao, Qin Zhen, Sun Weigao, Zhong Yiran
- The Fire Thief Is Also The Keeper: Balancing Usability And Privacy In Prompts Shen Zhili, Xi Zihang, He Ying, Tong Wei, Hua Jingyu, Zhong Sheng
- Repeval: Effective Text Evaluation With LLM Representation Sheng Shuqian, Xu Yi, Zhang Tianhang, Shen Zanwei, Fu Luoyi, Ding Jiaxin, Zhou Lei, Wang Xinbing, Zhou Chenghu
- Continual Learning Of Large Language Models: A Comprehensive Survey Shi Haizhou, Xu Zihao, Wang Hengyi, Qin Weiyi, Wang Wenyuan, Wang Yibin, Wang Zifeng, Ebrahimi Sayna, Wang Hao
- Paying More Attention To Image: A Training-free Method For Alleviating Hallucination In Lvlms Shi Liu, Kecheng Zheng, Wei Chen
- Ask-eda: A Design Assistant Empowered By LLM, Hybrid RAG And Abbreviation De-hallucination Shi Luyao, Kazda Michael, Sears Bradley, Shropshire Nick, Puri Ruchir
- Action Controlled Paraphrasing Shi Ning, Wu Zijun
- Medadapter: Efficient Test-time Adaptation Of Large Language Models Towards Medical Reasoning Shi Wenqi, Xu Ran, Zhuang Yuchen, Yu Yue, Wu Hang, Yang Carl, Wang May D.
- Ehragent: Code Empowers Large Language Models For Few-shot Complex Tabular Reasoning On Electronic Health Records Shi Wenqi, Xu Ran, Zhuang Yuchen, Yu Yue, Zhang Jieyu, Wu Hang, Zhu Yuanda, Ho Joyce, Yang Carl, Wang May D.
- Retrieval-enhanced Knowledge Editing In Language Models For Multi-hop Question Answering Shi Yucheng, Tan Qiaoyu, Wu Xuansheng, Zhong Shaochen, Zhou Kaixiong, Liu Ninghao
- Generate-then-ground In Retrieval-augmented Generation For Multi-hop Question Answering Shi Zhengliang, Zhang Shuo, Sun Weiwei, Gao Shen, Ren Pengjie, Chen Zhumin, Ren Zhaochun
- How Can Large Language Models Enable Better Socially Assistive Human-robot Interaction: A Brief Survey Shi Zhonghao, Landrum Ellen, Connell Amy O', Kian Mina, Pinto-alva Leticia, Shrestha Kaleen, Zhu Xiaoyuan, Matarić Maja J
- Adapting Llms To Hebrew: Unveiling Dictalm 2.0 With Enhanced Vocabulary And Instruction Capabilities Shmidman Shaltiel, Shmidman Avi, Cohen Amir Dn, Koppel Moshe
- Knowledge Graph Large Language Model (KG-LLM) For Link Prediction Shu Dong, Chen Tianle, Jin Mingyu, Zhang Chong, Du Mengnan, Zhang Yongfeng
- An Active Inference Strategy For Prompting Reliable Responses From Large Language Models In Medical Practice Shusterman Roma, Waters Allison C., O`neill Shannon, Luu Phan, Tucker Don M.
- Design2code: How Far Are We From Automating Front-end Engineering? Si Chenglei, Zhang Yanzhe, Yang Zhengyuan, Liu Ruibo, Yang Diyi
- Where Does In-context Translation Happen In Large Language Models Sia Suzanna, Mueller David, Duh Kevin
- The Fault In Our Stars: Quality Assessment Of Code Generation Benchmarks Siddiq Mohammed Latif, Dristi Simantika, Saha Joy, Santos Joanna C. S.
- Constructing Benchmarks And Interventions For Combating Hallucinations In Llms Simhi Adi, Herzig Jonathan, Szpektor Idan, Belinkov Yonatan
- INDIC QA BENCHMARK: A Multilingual Benchmark To Evaluate Question Answering Capability Of Llms For Indic Languages Singh Abhishek Kumar, Murthy Rudra, Kumar Vishwajeet, Sen Jaydeep, Ramakrishnan Ganesh
- Rethinking Interpretability In The Era Of Large Language Models Singh Chandan, Inala Jeevana Priya, Galley Michel, Caruana Rich, Gao Jianfeng
- Indicgenbench: A Multilingual Benchmark To Evaluate Generation Capabilities Of Llms On Indic Languages Singh Harman, Gupta Nitish, Bharadwaj Shikhar, Tewari Dinesh, Talukdar Partha
- Exposing The Achilles' Heel: Evaluating Llms Ability To Handle Mistakes In Mathematical Reasoning Singh Joykirat, Nambi Akshay, Vineet Vibhav
- Evaluating Zero-shot GPT-4V Performance On 3D Visual Question Answering Benchmarks Singh Simranjit, Pavlakos Georgios, Stamoulis Dimitrios
- LGR2: Language Guided Reward Relabeling For Accelerating Hierarchical Reinforcement Learning Singh Utsav, Bhattacharyya Pramit, Namboodiri Vinay P.
- Are Small Language Models Ready To Compete With Large Language Models For Practical Applications? Sinha Neelabh, Jain Vinija, Chadha Aman
- Exploring Value Biases: How Llms Deviate Towards The Ideal Sivaprasad Sarath, Kaushik Pramod, Abdelnabi Sahar, Fritz Mario
- Replan: Robotic Replanning With Perception And Language Models Skreta Marta, Zhou Zihan, Yuan Jia Lin, Darvish Kourosh, Aspuru-guzik Alán, Garg Animesh
- Aligning Large Language Models For Enhancing Psychiatric Interviews Through Symptom Delineation And Summarization So Jae-hee, Chang Joonhwan, Kim Eunji, Na Junho, Choi Jiyeon, Sohn Jy-yong, Kim Byung-hoon, Chu Sang Hui
- Gistembed: Guided In-sample Selection Of Training Negatives For Text Embedding Fine-tuning Solatorio Aivin V.
- A New Pipeline For Generating Instruction Dataset Via RAG And Self Fine-tuning Song Chih-wei, Lee Yu-kai, Tsai Yin-te
- Moviechat+: Question-aware Sparse Memory For Long Video Question Answering Song Enxin, Chai Wenhao, Ye Tian, Hwang Jenq-neng, Li Xi, Wang Gaoang
- Increasing Model Capacity For Free: A Simple Strategy For Parameter Efficient Fine-tuning Song Haobo, Zhao Hao, Majumder Soumajit, Lin Tao
- Cs-bench: A Comprehensive Benchmark For Large Language Models Towards Computer Science Mastery Song Xiaoshuai, Diao Muxi, Dong Guanting, Wang Zhengyang, Fu Yujia, Qiao Runqi, Wang Zhexu, Fu Dayuan, Wu Huangxuan, Liang Bin, Zeng Weihao, Wang Yejie, Gongque Zhuoma, Yu Jianing, Tan Qiuna, Xu Weiran
- The Good, The Bad, And The Greedy: Evaluation Of Llms Should Not Ignore Non-determinism Song Yifan, Wang Guoyin, Li Sujian, Lin Bill Yuchen
- Sharelora: Parameter Efficient And Robust Large Language Model Fine-tuning Via Shared Low-rank Adaptation Song Yurun, Zhao Junchen, Harris Ian G., Jyothi Sangeetha Abdu
- Moviellm: Enhancing Long Video Understanding With Ai-generated Movies Song Zhende, Wang Chenchen, Sheng Jiamu, Zhang Chi, Yu Gang, Fan Jiayuan, Chen Tao
- Alchemistcoder: Harmonizing And Eliciting Code Capability By Hindsight Tuning On Multi-source Data Song Zifan, Wang Yudong, Zhang Wenwei, Liu Kuikun, Lyu Chengqi, Song Demin, Guo Qipeng, Yan Hang, Lin Dahua, Chen Kai, Zhao Cairong
- Fine Tuning Vs. Retrieval Augmented Generation For Less Popular Knowledge Soudani Heydar, Kanoulas Evangelos, Hasibi Faegheh
- Know When To Stop: A Study Of Semantic Drift In Text Generation Spataru Ava, Hambro Eric, Voita Elena, Cancedda Nicola
- Generaitor: Tree-in-the-loop Text Generation For Language Model Explainability And Adaptation Spinner Thilo, Kehlbeck Rebecca, Sevastjanova Rita, Stähle Tobias, Keim Daniel A., Deussen Oliver, El-assady Mennatallah
- VDGD: Mitigating LVLM Hallucinations In Cognitive Prompts By Bridging The Visual Perception Gap Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Oriol Nieto, Zeyu Jin, Dinesh Manocha
- Enhancing Temporal Understanding In Audio Question Answering For Large Audio Language Models Sridhar Arvind Krishna, Guo Yinyi, Visser Erik
- Evaluating Llms' Mathematical Reasoning In Financial Document Question Answering Srivastava Pragya, Malik Manuj, Gupta Vivek, Ganu Tanuja, Roth Dan
- Towards Truly Zero-shot Compositional Visual Reasoning With Llms As Programmers Stanić Aleksandar, Caelles Sergi, Tschannen Michael
- The Fine-tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities Stap David, Hasler Eva, Byrne Bill, Monz Christof, Tran Ke
- Few-shot VQA With Frozen Llms: A Tale Of Two Approaches Sterner Igor, Lin Weizhe, Chen Jinghong, Byrne Bill
- The Calibration Gap Between Model And Human Confidence In Large Language Models Steyvers Mark, Tejeda Heliodoro, Kumar Aakriti, Belem Catarina, Karny Sheer, Hu Xinyue, Mayer Lukas, Smyth Padhraic
- Groundedness In Retrieval-augmented Long-form Generation: An Empirical Study Stolfo Alessandro
- Llms Achieve Adult Human Performance On Higher-order Theory Of Mind Tasks Street Winnie, Siy John Oliver, Keeling Geoff, Baranes Adrien, Barnett Benjamin, Mckibben Michael, Kanyere Tatenda, Lentz Alison, Arcas Blaise Aguera Y, Dunbar Robin I. M.
- ARKS: Active Retrieval In Knowledge Soup For Code Generation Su Hongjin, Jiang Shuyang, Lai Yuhang, Wu Haoyuan, Shi Boao, Liu Che, Liu Qian, Yu Tao
- Mitigating Entity-level Hallucination In Large Language Models Su Weihang, Tang Yichen, Ai Qingyao, Wang Changyue, Wu Zhijing, Liu Yiqun
- Unsupervised Real-time Hallucination Detection Based On The Internal States Of Large Language Models Su Weihang, Wang Changyue, Ai Qingyao, Hu Yiran, Wu Zhijing, Zhou Yujia, Liu Yiqun
- Demonstration Augmentation For Zero-shot In-context Learning Su Yi, Tai Yunpeng, Ji Yixin, Li Juntao, Yan Bowen, Zhang Min
- Living In The Moment: Can Large Language Models Grasp Co-temporal Reasoning? Su Zhaochen, Li Juntao, Zhang Jun, Zhu Tong, Qu Xiaoye, Zhou Pan, Bowen Yan, Cheng Yu, Zhang Min
- Timo: Towards Better Temporal Reasoning For Language Models Su Zhaochen, Zhang Jun, Zhu Tong, Qu Xiaoye, Li Juntao, Zhang Min, Cheng Yu
- M-QALM: A Benchmark To Assess Clinical Reading Comprehension And Knowledge Recall In Large Language Models Via Question Answering Subramanian Anand, Schlegel Viktor, Kashyap Abhinav Ramesh, Nguyen Thanh-tung, Dwivedi Vijay Prakash, Winkler Stefan
- LAB: Large-scale Alignment For Chatbots Sudalairaj Shivchander, Bhandwaldar Abhishek, Pareja Aldo, Xu Kai, Cox David D., Srivastava Akash
- Confabulation: The Surprising Value Of Large Language Model Hallucinations Sui Peiqi, Duede Eamon, Wu Sophie, So Richard Jean
- Fidelis: Faithful Reasoning In Large Language Model For Knowledge Graph Question Answering Sui Yuan, He Yufei, Liu Nian, He Xiaoxin, Wang Kun, Hooi Bryan
- Decoding At The Speed Of Thought: Harnessing Parallel Decoding Of Lexical Units For Llms Sun Chenxi, Zhang Hongzhi, Lin Zijia, Zhang Jingyuan, Zhang Fuzheng, Wang Zhongyuan, Chen Bin, Song Chengru, Zhang Di, Gai Kun, Xiong Deyi
- Building Better AI Agents: A Provocation On The Utilisation Of Persona In Llm-based Conversational Agents Sun Guangzhi, Zhan Xiao, Such Jose
- Conifer: Improving Complex Constrained Instruction-following Ability Of Large Language Models Sun Haoran, Liu Lixin, Li Junjie, Wang Fengyu, Dong Baohua, Lin Ran, Huang Ruohui
- Fine-tuning Vs Prompting, Can Language Models Understand Human Values? Sun Pingwei
- Fostering Natural Conversation In Large Language Models With NICO: A Natural Interactive Conversation Dataset Sun Renliang, Liu Mengyuan, Yang Shiping, Wang Rui, He Junqing, Zhang Jiaxing
- Prompt Chaining Or Stepwise Prompt? Refinement In Text Summarization Sun Shichao, Yuan Ruifeng, Cao Ziqiang, Li Wenjie, Liu Pengfei
- Llm-oriented Retrieval Tuner Sun Si, Zhang Hanqing, Liu Zhiyuan, Bao Jie, Song Dawei
- Unicoder: Scaling Code Large Language Model Via Universal Code Sun Tao, Chai Linzheng, Yang Jian, Yin Yuwei, Guo Hongcheng, Liu Jiaheng, Wang Bing, Yang Liqun, Li Zhoujun
- Towards Detecting Llms Hallucination Via Markov Chain-based Multi-agent Debate Framework Sun Xiaoxi, Li Jinpeng, Zhong Yan, Zhao Dongyan, Yan Rui
- Chain-of-strategy Planning With Llms: Aligning The Generation Of Psychotherapy Dialogue With Strategy In Motivational Interviewing Sun Xin, Tang Xiao, Ali Abdallah El, Li Zhuying, Shen Xiaoyu, Ren Pengjie, De Wit Jan, Pei Jiahuan, Bosch Jos A.
- Multi-turn Context Jailbreak Attack On Large Language Models From First Principles Sun Xiongtao, Zhang Deyue, Yang Dongdong, Zou Quanchen, Li Hui
- LCS: A Language Converter Strategy For Zero-shot Neural Machine Translation Sun Zengkui, Liu Yijin, Meng Fandong, Xu Jinan, Chen Yufeng, Zhou Jie
- Leveraging Chatgpt In Pharmacovigilance Event Extraction: An Empirical Study Sun Zhaoyue, Pergola Gabriele, Wallace Byron C., He Yulan
- Automatic Generation And Evaluation Of Reading Comprehension Test Items With Large Language Models Säuberli Andreas, Clematide Simon
- Performance Assessment Of Chatgpt Vs Bard In Detecting Alzheimer's Dementia T Balamurali B, Chen Jer-ming
- Online Adaptation Of Language Models With A Memory Of Amortized Contexts Tack Jihoon, Kim Jaehyung, Mitchell Eric, Shin Jinwoo, Teh Yee Whye, Schwarz Jonathan Richard
- JORA: JAX Tensor-parallel Lora Library For Retrieval Augmented Fine-tuning Tahir Anique, Cheng Lu, Liu Huan
- PIXAR: Auto-regressive Language Modeling In Pixel Space Tai Yintao, Liao Xiyang, Suglia Alessandro, Vergari Antonio
- Pretraining And Updating Language- And Domain-specific Large Language Model: A Case Study In Japanese Business Domain Takahashi Kosuke, Omi Takahiro, Arima Kosuke, Ishigaki Tatsuya
- Improving Large Language Model (LLM) Fidelity Through Context-aware Grounding: A Systematic Approach To Reliability And Veracity Talukdar Wrick, Biswas Anjanava
- Let Me Speak Freely? A Study On The Impact Of Format Restrictions On Performance Of Large Language Models Tam Zhi Rui, Wu Cheng-kuang, Tsai Yi-lin, Lin Chieh-yen, Lee Hung-yi, Chen Yun-nung
- Mobilequant: Mobile-friendly Quantization For On-device Language Models Tan Fuwen, Lee Royson, Dudziak Łukasz, Hu Shell Xu, Bhattacharya Sourav, Hospedales Timothy, Tzimiropoulos Georgios, Martinez Brais
- PROXYQA: An Alternative Framework For Evaluating Long-form Text Generation With Large Language Models Tan Haochen, Guo Zhijiang, Shi Zhan, Xu Lu, Liu Zhili, Feng Yunlong, Li Xiaoguang, Wang Yasheng, Shang Lifeng, Liu Qun, Song Linqi
- Idgenrec: Llm-recsys Alignment With Textual ID Learning Tan Juntao, Xu Shuyuan, Hua Wenyue, Ge Yingqiang, Li Zelong, Zhang Yongfeng
- Lloco: Learning Long Contexts Offline Tan Sijun, Li Xiuyu, Patil Shishir, Wu Ziyang, Zhang Tianjun, Keutzer Kurt, Gonzalez Joseph E., Popa Raluca Ada
- Fine-tuning Large Language Model (LLM) Artificial Intelligence Chatbots In Ophthalmology And Llm-based Evaluation Using GPT-4 Tan Ting Fang, Elangovan Kabilan, Jin Liyuan, Jie Yao, Yong Li, Lim Joshua, Poh Stanley, Ng Wei Yan, Lim Daniel, Ke Yuhe, Liu Nan, Ting Daniel Shu Wei
- Struct-x: Enhancing Large Language Models Reasoning With Structured Data Tan Xiaoyu, Wang Haoyu, Qiu Xihe, Cheng Yuan, Xu Yinghui, Chu Wei, Qi Yuan
- Large Language Models For Data Annotation: A Survey Tan Zhen, Li Dawei, Wang Song, Beigi Alimohammad, Jiang Bohan, Bhattacharjee Amrita, Karami Mansooreh, Li Jundong, Cheng Lu, Liu Huan
- Tuning-free Accountable Intervention For LLM Deployment -- A Metacognitive Approach Tan Zhen, Peng Jie, Chen Tianlong, Liu Huan
- Instructdoc: A Dataset For Zero-shot Generalization Of Visual Document Understanding With Instructions Tanaka Ryota, Iki Taichi, Nishida Kyosuke, Saito Kuniko, Suzuki Jun
- Tofueval: Evaluating Hallucinations Of Llms On Topic-focused Dialogue Summarization Tang Liyan, Shalyminov Igor, Wong Amy Wing-mei, Burnsky Jon, Vincent Jake W., Yang Yu'an, Singh Siffi, Feng Song, Song Hwanjun, Su Hang, Sun Lijia, Zhang Yi, Mansour Saab, Mckeown Kathleen
- On The Hardness Of Faithful Chain-of-thought Reasoning In Large Language Models Tanneru Sree Harsha, Ley Dan, Agarwal Chirag, Lakkaraju Himabindu
- Making A Long Story Short In Conversation Modeling Tao Yufei, Mines Tiernan, Agrawal Ameeta
- CUDRT: Benchmarking The Detection Of Human Vs. Large Language Models Generated Texts Tao Zhen, Li Zhiyu, Xi Dinghao, Xu Wei
- MEEL: Multi-modal Event Evolution Learning Tao Zhengwei, Jin Zhi, Huang Junqiang, Chen Xiancai, Bai Xiaoying, Zhao Haiyan, Zhang Yifan, Tao Chongyang
- A Survey On Self-evolution Of Large Language Models Tao Zhengwei, Lin Ting-en, Chen Xiancai, Li Hangyu, Wu Yuchuan, Li Yongbin, Jin Zhi, Huang Fei, Tao Dacheng, Zhou Jingren
- Targeted Visual Prompting For Medical Visual Question Answering Tascon-morales Sergio, Márquez-neila Pablo, Sznitman Raphael
- Embedding-aligned Language Models Tennenholtz Guy, Chow Yinlam, Hsu Chih-wei, Shani Lior, Liang Ethan, Boutilier Craig
- Introducing Super Rags In Mistral 8x7b-v1 Thakur Ayush, Gupta Raghav
- Limits Of Transformer Language Models On Learning To Compose Algorithms Thomm Jonathan, Terzic Aleksandar, Camposampiero Giacomo, Hersche Michael, Schölkopf Bernhard, Rahimi Abbas
- Elitr-bench: A Meeting Assistant Benchmark For Long-context Language Models Thonet Thibaut, Rozen Jos, Besacier Laurent
- Climategpt: Towards AI Synthesizing Interdisciplinary Research On Climate Change Thulke David, Gao Yingbo, Pelser Petrus, Brune Rein, Jalota Rricha, Fok Floris, Ramos Michael, Van Wyk Ian, Nasir Abdallah, Goldstein Hayden, Tragemann Taylor, Nguyen Katie, Fowler Ariana, Stanco Andrew, Gabriel Jon, Taylor Jordan, Moro Dean, Tsymbalov Evgenii, De Waal Juliette, Matusov Evgeny, Yaghi Mudar, Shihadah Mohammad, Ney Hermann, Dugast Christian, Dotan Jonathan, Erasmus Daniel
- Selective Prompt Anchoring For Code Generation Tian Yuan, Zhang Tianyi
- Codehalu: Investigating Code Hallucinations In Llms Via Execution-based Verification Tian Yuchen, Yan Weixiang, Yang Qian, Zhao Xuandong, Chen Qian, Wang Wen, Luo Ziyang, Ma Lei, Song Dawn
- Generative AI Systems: A Systems-based Perspective On Generative AI Tomczak Jakub M.
- Cpsdbench: A Large Language Model Evaluation Benchmark And Baseline For Chinese Public Security Domain Tong Xin, Jin Bo, Lin Zhi, Wang Binjun, Yu Ting, Cheng Qiang
- A Comprehensive Survey Of Hallucination Mitigation Techniques In Large Language Models Tonmoy S. M Towhidul Islam, Zaman S M Mehedi, Jain Vinija, Rani Anku, Rawte Vipula, Chadha Aman, Das Amitava
- Prompting Techniques For Secure Code Generation: A Systematic Investigation Tony Catherine, Ferreyra Nicolás E. Díaz, Mutas Markus, Dhiff Salem, Scandariato Riccardo
- Few-shot Cross-lingual Transfer For Prompting Large Language Models In Low-resource Languages Toukmaji Christopher
- Metacognition Is All You Need? Using Introspection In Generative Agents To Improve Goal-directed Behavior Toy Jason, Macadam Josh, Tabor Phil
- Readctrl: Personalizing Text Generation With Readability-controlled Instruction Learning Tran Hieu, Yao Zonghai, Li Lingxi, Yu Hong
- Xtower: A Multilingual LLM For Explaining And Correcting Translation Errors Treviso Marcos, Guerreiro Nuno M., Agrawal Sweta, Rei Ricardo, Pombal José, Vaz Tania, Wu Helena, Silva Beatriz, Van Stigt Daan, Martins André F. T.
- Linguacodus: A Synergistic Framework For Transformative Code Generation In Machine Learning Pipelines Trofimova Ekaterina, Sataev Emil, Ustyuzhanin Andrey E.
- Code Less, Align More: Efficient LLM Fine-tuning For Code Generation With Data Pruning Tsai Yun-da, Liu Mingjie, Ren Haoxing
- Enhance Modality Robustness In Text-centric Multimodal Alignment With Adversarial Prompting Tsai Yun-da, Yen Ting-yu, Liao Keng-te, Lin Shou-de
- Towards Safety And Helpfulness Balanced Responses Via Controllable Large Language Models Tuan Yi-lin, Chen Xilun, Smith Eric Michael, Martin Louis, Batra Soumya, Celikyilmaz Asli, Wang William Yang, Bikel Daniel M.
- LM Transparency Tool: Interactive Tool For Analyzing Transformer Language Models Tufanov Igor, Hambardzumyan Karen, Ferrando Javier, Voita Elena
- Dynamo: Accelerating Language Model Inference With Dynamic Multi-token Sampling Tuli Shikhar, Lin Chi-heng, Hsu Yen-chang, Jha Niraj K., Shen Yilin, Jin Hongxia
- VBART: The Turkish LLM Turker Meliksah, Ari Mehmet Erdi, Han Aydin
- Syncode: LLM Generation With Grammar Augmentation Ugare Shubham, Suresh Tarun, Kang Hangoo, Misailovic Sasa, Singh Gagandeep
- Adacoder: Adaptive Prompt Compression For Programmatic Visual Question Answering Ukai Mahiro, Kurita Shuhei, Hashimoto Atsushi, Ushiku Yoshitaka, Inoue Nakamasa
- Calibrating Large Language Models Using Their Generations Only Ulmer Dennis, Gubri Martin, Lee Hwaran, Yun Sangdoo, Oh Seong Joon
- Hypotermqa: Hypothetical Terms Dataset For Benchmarking Hallucination Tendency Of Llms Uluoglakci Cem Middle East Technical University, Temizel Tugba Taskaya Middle East Technical University
- Investigating Context Effects In Similarity Judgements In Large Language Models Uprety Sagar, Jaiswal Amit Kumar, Liu Haiming, Song Dawei
- No Size Fits All: The Perils And Pitfalls Of Leveraging Llms Vary With Company Size Urlana Ashok, Kumar Charaka Vinayak, Garlapati Bala Mallikarjunarao, Singh Ajeet Kumar, Mishra Rahul
- Llms With Industrial Lens: Deciphering The Challenges And Prospects -- A Survey Urlana Ashok, Kumar Charaka Vinayak, Singh Ajeet Kumar, Garlapati Bala Mallikarjunarao, Chalamala Srinivasa Rao, Mishra Rahul
- Accelerating Large Language Model Inference With Self-supervised Early Exits Valade Florian
- Cost-effective Hallucination Detection For Llms Valentin Simon, Fu Jinmiao, Detommaso Gianluca, Xu Shaoyuan, Zappella Giovanni, Wang Bryan
- Improved Baselines For Data-efficient Perceptual Augmentation Of Llms Vallaeys Théophane, Shukor Mustafa, Cord Matthieu, Verbeek Jakob
- Gender Bias In Machine Translation And The Era Of Large Language Models Vanmassenhove Eva
- Investigating And Addressing Hallucinations Of Llms In Tasks Involving Negation Varshney Neeraj, Raj Satyam, Mishra Venkatesh, Chatterjee Agneet, Sarkar Ritika, Saeidi Amir, Baral Chitta
- Infusing Knowledge Into Large Language Models With Contextual Prompts Vasisht Kinshuk, Ganesan Balaji, Kumar Vikas, Bhatnagar Vasudha
- Question Suggestion For Conversational Shopping Assistants Using Product Metadata Vedula Nikhita, Rokhlenko Oleg, Malmasi Shervin
- What's In An Embedding? Would A Rose By Any Embedding Smell As Sweet? Venkatasubramanian Venkat
- Cross-modal Projection In Multimodal Llms Doesn't Really Project Visual Attributes To Textual Space Verma Gaurav, Choi Minje, Sharma Kartik, Watson-daniels Jamelle, Oh Sejoon, Kumar Srijan
- RAG Based Question-answering For Contextual Response Prediction System Veturi Sriram, Vaichal Saurabh, Jagadheesh Reshma Lal, Tripto Nafis Irtiza, Yan Nian
- Input Conditioned Graph Generation For Language Agents Vierling Lukas, Fu Jie, Chen Kai
- Exploring Safety-utility Trade-offs In Personalized Language Models Vijjini Anvesh Rao, Chowdhury Somnath Basu Roy, Chaturvedi Snigdha
- Emotional Manipulation Through Prompt Engineering Amplifies Disinformation Generation In AI Large Language Models Vinay Rasita, Spitale Giovanni, Biller-andorno Nikola, Germani Federico
- Medreqal: Examining Medical Knowledge Recall Of Large Language Models Via Question Answering Vladika Juraj, Schneider Phillip, Matthes Florian
- Vi-mistral-x: Building A Vietnamese Language Model With Advanced Continual Pre-training Vo James
- Foundational Autoraters: Taming Large Language Models For Better Automatic Evaluation Vu Tu, Krishna Kalpesh, Alzubi Salaheddin, Tar Chris, Faruqui Manaal, Sung Yun-hsuan
- Leveraging Zero-shot Prompting For Efficient Language Model Distillation Vöge Lukas, Gurgul Vincent, Lessmann Stefan
- Are Large Language Models Aligned With People's Social Intuitions For Human-robot Interactions? Wachowiak Lennart, Coles Andrew, Celiktutan Oya, Canal Gerard
- From Rags To Rich Parameters: Probing How Language Models Utilize External Knowledge Over Parametric Information For Factual Queries Wadhwa Hitesh, Seetharaman Rahul, Aggarwal Somyaa, Ghosh Reshmi, Basu Samyadeep, Srinivasan Soundararajan, Zhao Wenlong, Chaudhari Shreyas, Aghazadeh Ehsan
- Investigating Mysteries Of Cot-augmented Distillation Wadhwa Somin, Amir Silvio, Wallace Byron C.
- A & B == B & A: Triggering Logical Reasoning Failures In Large Language Models Wan Yuxuan, Wang Wenxuan, Yang Yiliu, Yuan Youliang, Huang Jen-tse, He Pinjia, Jiao Wenxiang, Lyu Michael R.
- Datavist5: A Pre-trained Language Model For Jointly Understanding Text And Data Visualization Wan Zhuoyue, Song Yuanfeng, Li Shuaimin, Zhang Chen Jason, Wong Raymond Chi-wing
- COSMO: Contrastive Streamlined Multimodal Model With Interleaved Pre-training Wang Alex Jinpeng, Li Linjie, Lin Kevin Qinghong, Wang Jianfeng, Lin Kevin, Yang Zhengyuan, Wang Lijuan, Shou Mike Zheng
- GOLF: Goal-oriented Long-term Life Tasks Supported By Human-ai Collaboration Wang Ben
- Task Supportive And Personalized Human-large Language Model Interaction: A User Study Wang Ben, Liu Jiqun, Karimnazarov Jamshed, Thompson Nicolas
- Qrmem: Unleash The Length Limitation Through Question Then Reflection Memory Mechanism Wang Bo, Huang Heyan, Cao Yixin, Ying Jiahao, Tang Wei, Feng Chong
- Direct: Diagnostic Reasoning For Clinical Notes Via Large Language Models Wang Bowen, Chang Jiuyang, Qian Yiming, Chen Guoxin, Chen Junhao, Jiang Zhouqiang, Zhang Jiahao, Nakashima Yuta, Nagahara Hajime
- Hybrid Alignment Training For Large Language Models Wang Chenglong, Zhou Hang, Chang Kaiyan, Li Bei, Mu Yongyu, Xiao Tong, Liu Tongran, Zhu Jingbo
- A Survey For Large Language Models In Biomedicine Wang Chong, Li Mengyao, He Junjun, Wang Zhongruo, Darzi Erfan, Chen Zan, Ye Jin, Li Tianbin, Su Yanzhou, Ke Jing, Qu Kaili, Li Shuxin, Yu Yi, Liò Pietro, Wang Tianyun, Wang Yu Guang, Shen Yiqing
- Ada-leval: Evaluating Long-context Llms With Length-adaptable Benchmarks Wang Chonghua, Duan Haodong, Zhang Songyang, Lin Dahua, Chen Kai
- Planning In Natural Language Improves LLM Search For Code Generation Wang Evan, Cassano Federico, Wu Catherine, Bai Yunfeng, Song Will, Nath Vaskar, Han Ziwen, Hendryx Sean, Yue Summer, Zhang Hugh
- Adapting Open-source Large Language Models For Cost-effective, Expert-level Clinical Note Generation With On-policy Reinforcement Learning Wang Hanyin, Gao Chufan, Liu Bolun, Xu Qiping, Hussein Guleid, Labban Mohamad El, Iheasirim Kingsley, Korsapati Hariprasad, Outcalt Chuck, Sun Jimeng
- Reinforcement Learning For Edit-based Non-autoregressive Neural Machine Translation Wang Hao, Morimura Tetsuro, Honda Ukyo, Kawahara Daisuke
- Blendfilter: Advancing Retrieval-augmented Large Language Models Via Query Generation Blending And Knowledge Filtering Wang Haoyu, Li Ruirui, Jiang Haoming, Tian Jinjin, Wang Zhengyang, Luo Chen, Tang Xianfeng, Cheng Monica, Zhao Tuo, Gao Jing
- Unims-rag: A Unified Multi-source Retrieval-augmented Generation For Personalized Dialogue Systems Wang Hongru, Huang Wenyu, Deng Yang, Wang Rui, Wang Zezhong, Wang Yufei, Mi Fei, Pan Jeff Z., Wong Kam-fai
- Target-constrained Bidirectional Planning For Generation Of Target-oriented Proactive Dialogue Wang Jian, Lin Dongding, Li Wenjie
- Techgpt-2.0: A Large Language Model Project To Solve The Task Of Knowledge Graph Construction Wang Jiaqi, Chang Yuying, Li Zhong, An Ning, Ma Qi, Hei Lei, Luo Haibo, Lu Yifei, Ren Feiliang
- A User-centric Benchmark For Evaluating Large Language Models Wang Jiayin, Mo Fengran, Ma Weizhi, Sun Peijie, Zhang Min, Nie Jian-yun
- Bioinformatics And Biomedical Informatics With Chatgpt: Year One Review Wang Jinge, Cheng Zien, Yao Qiuming, Liu Li, Xu Dong, Hu Gangqing
- Helpful Or Harmful Data? Fine-tuning-free Shapley Attribution For Explaining Language Model Predictions Wang Jingtan, Lin Xiaoqiang, Qiao Rui, Foo Chuan-sheng, Low Bryan Kian Hsiang
- JMLR: Joint Medical LLM And Retrieval Training For Enhancing Reasoning And Professional Question Answering Capability Wang Junda, Yang Zhichao, Yao Zonghai, Yu Hong
- Mixture-of-agents Enhances Large Language Model Capabilities Wang Junlin, Wang Jue, Athiwaratkun Ben, Zhang Ce, Zou James
- Balancing Rigor And Utility: Mitigating Cognitive Biases In Large Language Models For Multiple-choice Questions Wang Liman, Zhong Hanyang, Cao Wenting, Sun Zeyuan
- Conversational Simulmt: Efficient Simultaneous Translation With Large Language Models Wang Minghan, Vu Thuy-trang, Wang Yuxia, Shareghi Ehsan, Haffari Gholamreza
- Exploring The Potential Of Multimodal LLM With Knowledge-intensive Multimodal ASR Wang Minghan, Wang Yuxia, Vu Thuy-trang, Shareghi Ehsan, Haffari Gholamreza
- Leave No Document Behind: Benchmarking Long-context Llms With Extended Multi-doc QA Wang Minzheng, Chen Longze, Fu Cheng, Liao Shengyi, Zhang Xinghua, Wu Bingli, Yu Haiyang, Xu Nan, Zhang Lei, Luo Run, Li Yunshui, Yang Min, Huang Fei, Li Yongbin
- Inferaligner: Inference-time Alignment For Harmlessness Through Cross-model Guidance Wang Pengyu, Zhang Dong, Li Linyang, Tan Chenkun, Wang Xinghao, Ren Ke, Jiang Botian, Qiu Xipeng
- Role Prompting Guided Domain Adaptation With General Capability Preserve For Large Language Models Wang Rui, Mi Fei, Chen Yi, Xue Boyang, Wang Hongru, Zhu Qi, Wong Kam-fai, Xu Ruifeng
- Deconfounded Causality-aware Parameter-efficient Fine-tuning For Problem-solving Improvement Of Llms Wang Ruoyu, Li Xiaoxuan, Yao Lina
- Unique Security And Privacy Threats Of Large Language Model: A Comprehensive Survey Wang Shang, Zhu Tianqing, Liu Bo, Ding Ming, Guo Xu, Ye Dayong, Zhou Wanlei, Yu Philip S.
- What Is The Best Way For Chatgpt To Translate Poetry? Wang Shanshan, Wong Derek F., Yao Jingming, Chao Lidia S.
- Lora Meets Dropout Under A Unified Framework Wang Sheng, Chen Liheng, Jiang Jiyue, Xue Boyang, Kong Lingpeng, Wu Chuan
- Mfc-bench: Benchmarking Multimodal Fact-checking With Large Vision-language Models Wang Shengkang, Lin Hongzhan, Luo Ziyang, Ye Zhen, Chen Guang, Ma Jing
- \(\mathbb{uscd}\): Improving Code Generation Of Llms By Uncertainty-aware Selective Contrastive Decoding Wang Shuai, Ding Liang, Shen Li, Luo Yong, He Zheng, Yu Wei, Tao Dacheng
- Resonance Rope: Improving Context Length Generalization Of Large Language Models Wang Suyuchen, Kobyzev Ivan, Lu Peng, Rezagholizadeh Mehdi, Liu Bang
- Weaver: Foundation Models For Creative Writing Wang Tiannan, Chen Jiamin, Jia Qingrui, Wang Shuai, Fang Ruoyu, Wang Huilin, Gao Zhaowei, Xie Chunzhao, Xu Chuou, Dai Jihong, Liu Yibin, Wu Jialong, Ding Shengwei, Li Long, Huang Zhiwei, Deng Xinle, Yu Teng, Ma Gangan, Xiao Han, Chen Zixin, Xiang Danjun, Wang Yunxia, Zhu Yuanyuan, Xiao Yi, Wang Jing, Wang Yiru, Ding Siran, Huang Jiayang, Xu Jiayi, Tayier Yilihamu, Hu Zhenyu, Gao Yuan, Zheng Chengfeng, Ye Yueshu, Li Yihang, Wan Lei, Jiang Xinyue, Wang Yujie, Cheng Siyu, Song Zhule, Tang Xiangru, Xu Xiaohua, Zhang Ningyu, Chen Huajun, Jiang Yuchen Eleanor, Zhou Wangchunshu
- Less Is More For Improving Automatic Evaluation Of Factual Consistency Wang Tong, Kulkarni Ninad, Qi Yanjun
- Testing And Evaluation Of Large Language Models: Correctness, Non-toxicity, And Fairness Wang Wenxuan
- FSM: A Finite State Machine Based Zero-shot Prompting Paradigm For Multi-hop Question Answering Wang Xiaochen, He Junqing, Yang Zhe, Wang Yiru, Meng Xiangdi, Pan Kunhao, Sui Zhifang
- Negativeprompt: Leveraging Psychology For Large Language Models Enhancement Via Negative Emotional Stimuli Wang Xu, Li Cheng, Chang Yi, Wang Jindong, Wu Yuan
- Dolphcoder: Echo-locating Code Large Language Models With Diverse And Multi-objective Instruction Tuning Wang Yejie, He Keqing, Dong Guanting, Wang Pei, Zeng Weihao, Diao Muxi, Mou Yutao, Zhang Mengdi, Wang Jingang, Cai Xunliang, Xu Weiran
- DHP Benchmark: Are Llms Good NLG Evaluators? Wang Yicheng, Yuan Jiayi, Chuang Yu-neng, Wang Zhuoer, Liu Yingchi, Cusick Mark, Kulkarni Param, Ji Zhengping, Ibrahim Yasser, Hu Xia
- Inscl: A Data-efficient Continual Learning Paradigm For Fine-tuning Large Language Models With Instructions Wang Yifan, Liu Yafei, Shi Chufan, Li Haoling, Chen Chen, Lu Haonan, Yang Yujiu
- A Theoretical Understanding Of Self-correction Through In-context Alignment Wang Yifei, Wu Yuyang, Wei Zeming, Jegelka Stefanie, Wang Yisen
- Speak It Out: Solving Symbol-related Problems With Symbol-to-language Conversion For Language Models Wang Yile, Cheng Sijie, Sun Zixin, Li Peng, Liu Yang
- Ladic: Are Diffusion Models Really Inferior To Autoregressive Counterparts For Image-to-text Generation? Wang Yuchi, Ren Shuhuai, Gao Rundong, Yao Linli, Guo Qingyan, An Kaikai, Bai Jianhong, Sun Xu
- REAR: A Relevance-aware Retrieval-augmented Framework For Open-domain Question Answering Wang Yuhao, Ren Ruiyang, Li Junyi, Zhao Wayne Xin, Liu Jing, Wen Ji-rong
- Sibyl: Simple Yet Effective Agent Framework For Complex Real-world Reasoning Wang Yulong, Shen Tianhao, Liu Lifeng, Xie Jian
- Videollamb: Long-context Video Understanding With Recurrent Memory Bridges Wang Yuxuan, Xie Cihang, Liu Yang, Zheng Zilong
- To Recommend Or Not: Recommendability Identification In Conversations With Pre-trained Language Models Wang Zhefan, Ma Weizhi, Zhang Min
- M-RAG: Reinforcing Large Language Model Performance Through Retrieval-augmented Generation With Multiple Partitions Wang Zheng, Teo Shu Xian, Ouyang Jieer, Xu Yongjun, Shi Wei
- Trove: Inducing Verifiable And Efficient Toolboxes For Solving Programmatic Tasks Wang Zhiruo, Fried Daniel, Neubig Graham
- RAT: Retrieval Augmented Thoughts Elicit Context-aware Reasoning In Long-horizon Generation Wang Zihao, Liu Anji, Lin Haowei, Li Jiaqi, Ma Xiaojian, Liang Yitao
- Farsight: Fostering Responsible AI Awareness During AI Application Prototyping Wang Zijie J., Kulkarni Chinmay, Wilcox Lauren, Terry Michael, Madaio Michael
- Eliminating Position Bias Of Language Models: A Mechanistic Approach Wang Ziqi, Zhang Hanlin, Li Xiner, Huang Kuan-hao, Han Chi, Ji Shuiwang, Kakade Sham M., Peng Hao, Ji Heng
- Large Language Models Are Pattern Matchers: Editing Semi-structured And Structured Documents With Chatgpt Weber Irene
- Hidden In Plain Sight: Exploring Chat History Tampering In Interactive Language Models Wei Cheng'an, Zhao Yue, Gong Yujia, Chen Kai, Xiang Lu, Zhu Shenchen
- Rethinking Generative Large Language Model Evaluation For Semantic Comprehension Wei Fangyun, Chen Xi, Luo Lin
- Systematic Evaluation Of Llm-as-a-judge In LLM Alignment Tasks: Explainable Metrics And Diverse Prompt Templates Wei Hui, He Shenghua, Xia Tian, Wong Andy, Lin Jingyang, Han Mei
- Performance Evaluation Of Lightweight Open-source Large Language Models In Pediatric Consultations: A Comparative Analysis Wei Qiuhong, Cui Ying, Ding Mengwei, Wang Yanqin, Xiang Lingling, Yao Zhengxiong, Chen Ceran, Long Ying, Jin Zhezhen, Xu Ximing
- Unveiling Selection Biases: Exploring Order And Token Sensitivity In Large Language Models Wei Sheng-lun, Wu Cheng-kuang, Huang Hen-hsen, Chen Hsin-hsi
- Leveraging Large Language Models For Patient Engagement: The Power Of Conversational AI In Digital Health Wen Bo, Norel Raquel, Liu Julia, Stappenbeck Thaddeus, Zulkernine Farhana, Chen Huamin
- Grounding Data Science Code Generation With Input-output Specifications Wen Yeming, Yin Pengcheng, Shi Kensen, Michalewski Henryk, Chaudhuri Swarat, Polozov Alex
- Towards Knowledge-grounded Natural Language Understanding And Generation Whitehouse Chenxi
- Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions Wiegreffe Sarah, Tafjord Oyvind, Belinkov Yonatan, Hajishirzi Hannaneh, Sabharwal Ashish
- Training-free Exponential Extension Of Sliding Window Context With Cascading KV Cache Willette Jeffrey, Lee Heejun, Lee Youngwan, Jeon Myeongjae, Hwang Sung Ju
- Easy Problems That Llms Get Wrong Williams Sean, Huckle James
- CBR-RAG: Case-based Reasoning For Retrieval Augmented Generation In Llms For Legal Question Answering Wiratunga Nirmalie, Abeyratne Ramitha, Jayawardena Lasal, Martin Kyle, Massie Stewart, Nkisi-orji Ikechukwu, Weerasinghe Ruvan, Liret Anne, Fleisch Bruno
- Distilrr: Transferring Code Repair For Low-resource Programming Languages Wong Kyle, Amayuelas Alfonso, Pan Liangming, Wang William Yang
- Effects Of A Prompt Engineering Intervention On Undergraduate Students' AI Self-efficacy, AI Knowledge And Prompt Engineering Ability: A Mixed Methods Study Woo David James, Wang Deliang, Yung Tim, Guo Kai
- Transferable Speech-to-text Large Language Model Alignment Module Wu Boyong, Yan Chao, Pu Haoran
- Towards Evaluating And Building Versatile Large Language Models For Medicine Wu Chaoyi, Qiu Pengcheng, Liu Jinxin, Gu Hongfei, Li Na, Zhang Ya, Wang Yanfeng, Xie Weidi
- Performance Law Of Large Language Models Wu Chuhan, Tang Ruiming
- Generative Monoculture In Large Language Models Wu Fan, Black Emily, Chandrasekaran Varun
- Medkp: Medical Dialogue With Knowledge Enhancement And Clinical Pathway Encoding Wu Jiageng, Wu Xian, Zheng Yefeng, Yang Jie
- Gendec: A Robust Generative Question-decomposition Method For Multi-hop Reasoning Wu Jian, Yang Linyi, Ji Yuliang, Huang Wenhao, Karlsson Börje F., Okumura Manabu
- Visionllm V2: An End-to-end Generalist Multimodal Large Language Model For Hundreds Of Vision-language Tasks Wu Jiannan, Zhong Muyan, Xing Sen, Lai Zeqiang, Liu Zhaoyang, Wang Wenhai, Chen Zhe, Zhu Xizhou, Lu Lewei, Lu Tong, Luo Ping, Qiao Yu, Dai Jifeng
- Uncertainty Estimation Of Large Language Models In Medical Question Answering Wu Jiaxin, Yu Yizhou, Zhou Hong-yu
- RLPF: Reinforcement Learning From Prediction Feedback For User Summarization With Llms Wu Jiaxing, Ning Lin, Liu Luyang, Lee Harrison, Wu Neo, Wang Chao, Prakash Sushant, O'banion Shawn, Green Bradley, Xie Jun
- What Are The Limits Of Cross-lingual Dense Passage Retrieval For Low-resource Languages? Wu Jie, Ren Zhaochun, Verberne Suzan
- Can Large Language Models Understand Uncommon Meanings Of Common Words? Wu Jinyang, Che Feihu, Zheng Xinxin, Zhang Shuai, Jin Ruihan, Nie Shuai, Shao Pengpeng, Tao Jianhua
- Noiseboost: Alleviating Hallucination With Noise Perturbation For Multimodal Large Language Models Wu Kai, Jiang Boyuan, Jiang Zhengkai, He Qingdong, Luo Donghao, Wang Shengzhi, Liu Qingwen, Wang Chengjie
- Adapting Large Language Models For Document-level Machine Translation Wu Minghao, Vu Thuy-trang, Qu Lizhen, Foster George, Haffari Gholamreza
- ITERTL: An Iterative Framework For Fine-tuning Llms For RTL Code Generation Wu Peiyang, Guo Nan, Xiao Xiao, Li Wenming, Ye Xiaochun, Fan Dongrui
- Videollm-mod: Efficient Video-language Streaming With Mixture-of-depths Vision Computation Wu Shiwei, Chen Joya, Lin Kevin Qinghong, Wang Qimeng, Gao Yan, Xu Qianli, Xu Tong, Hu Yao, Chen Enhong, Shou Mike Zheng
- Tablebench: A Comprehensive And Complex Benchmark For Table Question Answering Wu Xianjie, Yang Jian, Chai Linzheng, Zhang Ge, Liu Jiaheng, Du Xinrun, Liang Di, Shu Daixin, Cheng Xianfu, Sun Tianzhen, Niu Guanglin, Li Tongliang, Li Zhoujun
- Switchcit: Switching For Continual Instruction Tuning Of Large Language Models Wu Xinbo, Hartman Max, Jayaraman Vidhata Arjun, Varshney Lav R.
- Transformer-based Causal Language Models Perform Clustering Wu Xinbo, Varshney Lav R.
- Passage-specific Prompt Tuning For Passage Reranking In Question Answering With Large Language Models Wu Xuyang, Peng Zhiyuan, Sai Krishna Sravanthi Rajanala, Wu Hsin-tai, Fang Yi
- Knowledge-infused Legal Wisdom: Navigating LLM Consultation Through The Lens Of Diagnostics And Positive-unlabeled Reinforcement Learning Wu Yang, Wang Chenghao, Gumusel Ece, Liu Xiaozhong
- Which Questions Should I Answer? Salience Prediction Of Inquisitive Questions Wu Yating, Mangla Ritika, Dimakis Alexandros G., Durrett Greg, Li Junyi Jessy
- Agentkit: Structured LLM Reasoning With Dynamic Graphs Wu Yue, Fan Yewen, Min So Yeon, Prabhumoye Shrimai, Mcaleer Stephen, Bisk Yonatan, Salakhutdinov Ruslan, Li Yuanzhi, Mitchell Tom
- Longgenbench: Benchmarking Long-form Generation In Long Context Llms Wu Yuhao, Hee Ming Shan, Hu Zhiqing, Lee Roy Ka-wei
- Inversecoder: Unleashing The Power Of Instruction-tuned Code Llms With Inverse-instruct Wu Yutong, Huang Di, Shi Wenxuan, Wang Wei, Gao Lingzhe, Liu Shihao, Nan Ziyuan, Yuan Kaizhao, Zhang Rui, Zhang Xishan, Du Zidong, Guo Qi, Pu Yewen, Yin Dawei, Hu Xing, Chen Yunji
- Large Language Models Can Self-correct With Minimal Effort Wu Zhenyu, Zeng Qingkai, Zhang Zhihan, Tan Zhaoxuan, Shen Chao, Jiang Meng
- FP6-LLM: Efficiently Serving Large Language Models Through Fp6-centric Algorithm-system Co-design Xia Haojun, Zheng Zhen, Wu Xiaoxia, Chen Shiyang, Yao Zhewei, Youn Stephen, Bakhtiari Arash, Wyatt Michael, Zhuang Donglin, Zhou Zhongzhu, Ruwase Olatunji, He Yuxiong, Song Shuaiwen Leon
- LESS: Selecting Influential Data For Targeted Instruction Tuning Xia Mengzhou, Malladi Sadhika, Gururangan Suchin, Arora Sanjeev, Chen Danqi
- Language Models Can Evaluate Themselves Via Probability Discrepancy Xia Tingyu, Yu Bowen, Wu Yuan, Chang Yi, Zhou Chang
- Hallucination Diversity-aware Active Learning For Text Summarization Xia Yu, Liu Xu, Yu Tong, Kim Sungchul, Rossi Ryan A., Rao Anup, Mai Tung, Li Shuai
- What To Do If Language Models Disagree? Black-box Model Ensembling For Textual And Visual Question Answering Xia Yuxi, Zaporojets Kilm, Roth Benjamin
- Guardagent: Safeguard LLM Agents By A Guard Agent Via Knowledge-enabled Reasoning Xiang Zhen, Zheng Linzhi, Li Yanjie, Hong Junyuan, Li Qinbin, Xie Han, Zhang Jiawei, Xiong Zidi, Xie Chulin, Yang Carl, Song Dawn, Li Bo
- Clover-2: Accurate Inference For Regressive Lightweight Speculative Decoding Xiao Bin, Gui Lujun, Su Lei, Chen Weipeng
- Infllm: Training-free Long-context Extrapolation For Llms With An Efficient Context Memory Xiao Chaojun, Zhang Pengle, Han Xu, Xiao Guangxuan, Lin Yankai, Zhang Zhengyan, Liu Zhiyuan, Sun Maosong
- Rar-b: Reasoning As Retrieval Benchmark Xiao Chenghao, Hudson G Thomas, Moubayed Noura Al
- A Comprehensive Survey Of Large Language Models And Multimodal Large Language Models In Medicine Xiao Hanguang, Zhou Feizhong, Liu Xingyue, Liu Tianqi, Li Zhipeng, Liu Xin, Huang Xiaoxuan
- EMP: Enhance Memory In Data Pruning Xiao Jinying, Li Ping, Nie Jie, Tang Zhe
- Videoqa In The Era Of Llms: An Empirical Study Xiao Junbin, Huang Nanxin, Qin Hangyu, Li Dongyang, Li Yicong, Zhu Fengbin, Tao Zhulin, Yu Jianxing, Lin Liang, Chua Tat-seng, Yao Angela
- Palm2-vadapter: Progressively Aligned Language Model Makes A Strong Vision-language Adapter Xiao Junfei, Xu Zheng, Yuille Alan, Yan Shen, Wang Boyu
- Alleviating Hallucination In Large Vision-language Models With Active Retrieval Augmentation Xiaoye Qu, Qiyuan Chen, Wei Wei, Jishuo Sun, Jianfeng Dong
- Large Multimodal Agents: A Survey Xie Junlin, Chen Zhihong, Zhang Ruifei, Wan Xiang, Li Guanbin
- Me Llama: Foundation Large Language Models For Medical Applications Xie Qianqian, Chen Qingyu, Chen Aokun, Peng Cheng, Hu Yan, Lin Fongci, Peng Xueqing, Huang Jimin, Zhang Jeffrey, Keloth Vipina, Zhou Xinyu, He Huan, Ohno-machado Lucila, Wu Yonghui, Xu Hua, Bian Jiang
- Wanglab At MEDIQA-M3G 2024: Multimodal Medical Answer Generation Using Large Language Models Xie Ronald, Palayew Steven, Toma Augustin, Bader Gary, Wang Bo
- Sorry-bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors Xie Tinghao, Qi Xiangyu, Zeng Yi, Huang Yangsibo, Sehwag Udari Madhushani, Huang Kaixuan, He Luxi, Wei Boyi, Li Dacheng, Sheng Ying, Jia Ruoxi, Li Bo, Li Kai, Chen Danqi, Henderson Peter, Mittal Prateek
- Order Matters In Hallucination: Reasoning Order As Benchmark And Reflexive Prompting For Large-language-models Xie Zikai
- Benchmarking Retrieval-augmented Generation For Medicine Xiong Guangzhi, Jin Qiao, Lu Zhiyong, Zhang Aidong
- Interactive-kbqa: Multi-turn Interactions For Knowledge Base Question Answering With Large Language Models Xiong Guanming, Bao Junwei, Zhao Wen
- Converging Paradigms: The Synergy Of Symbolic And Connectionist AI In Llm-empowered Autonomous Agents Xiong Haoyi, Wang Zhiyuan, Li Xuhong, Bian Jiang, Xie Zeke, Mumtaz Shahid, Barnes Laura E.
- Temporal Scaling Law For Large Language Models Xiong Yizhe, Chen Xiansheng, Ye Xin, Chen Hui, Lin Zijia, Lian Haoran, Su Zhenpeng, Niu Jianwei, Ding Guiguang
- From Artificial Needles To Real Haystacks: Improving Retrieval Capabilities In Llms By Finetuning On Synthetic Data Xiong Zheyang, Papageorgiou Vasilis, Lee Kangwook, Papailiopoulos Dimitris
- Empowering 1000 Tokens/second On-device LLM Prefilling With Mllm-npu Xu Daliang, Zhang Hao, Yang Liming, Liu Ruiqi, Huang Gang, Xu Mengwei, Liu Xuanzhe
- Large Language Models For Education: A Survey Xu Hanyi, Gan Wensheng, Qi Zhenlian, Wu Jiayang, Yu Philip S.
- Contrastive Preference Optimization: Pushing The Boundaries Of LLM Performance In Machine Translation Xu Haoran, Sharaf Amr, Chen Yunmo, Tan Weiting, Shen Lingfeng, Van Durme Benjamin, Murray Kenton, Kim Young Jin
- Redagent: Red Teaming Large Language Models With Context-aware Autonomous Language Agent Xu Huiyu, Zhang Wenhui, Wang Zhibo, Xiao Feng, Zheng Rui, Feng Yunhe, Ba Zhongjie, Ren Kui
- On-device Language Models: A Comprehensive Review Xu Jiajun, Li Zhiyuan, Chen Wei, Wang Qun, Gao Xin, Cai Qi, Ling Ziyuan
- A General-purpose Device For Interaction With Llms Xu Jiajun, Wang Qun, Cao Yuhang, Zeng Baitao, Liu Sicheng
- Reverse Image Retrieval Cues Parametric Memory In Multimodal Llms Xu Jialiang, Moor Michael, Leskovec Jure
- Meteora: Multiple-tasks Embedded Lora For Large Language Models Xu Jingwei, Lai Junyu, Huang Yunpeng
- Prompting Large Language Models For Recommender Systems: A Comprehensive Framework And Empirical Analysis Xu Lanling, Zhang Junjie, Li Bingqian, Wang Jinpeng, Cai Mingchen, Zhao Wayne Xin, Wen Ji-rong
- BESA: Pruning Large Language Models With Blockwise Parameter-efficient Sparsity Allocation Xu Peng, Shao Wenqi, Chen Mengzhao, Tang Shitao, Zhang Kaipeng, Gao Peng, An Fengwei, Qiao Yu, Luo Ping
- MINDECHO: Role-playing Language Agents For Key Opinion Leaders Xu Rui, Lu Dakuan, Tan Xiaoyu, Wang Xintao, Yuan Siyu, Chen Jiangjie, Chu Wei, Yinghui Xu
- Cruxeval-x: A Benchmark For Multilingual Code Reasoning, Understanding And Execution Xu Ruiyang, Cao Jialun, Lu Yaojie, Lin Hongyu, Han Xianpei, He Ben, Cheung Shing-chi, Sun Le
- Unsupervised Information Refinement Training Of Large Language Models For Retrieval-augmented Generation Xu Shicheng, Pang Liang, Yu Mo, Meng Fandong, Shen Huawei, Cheng Xueqi, Zhou Jie
- Is DPO Superior To PPO For LLM Alignment? A Comprehensive Study Xu Shusheng, Fu Wei, Gao Jiaxuan, Ye Wenjie, Liu Weilin, Mei Zhiyu, Wang Guangju, Yu Chao, Wu Yi
- Sayself: Teaching Llms To Express Confidence With Self-reflective Rationales Xu Tianyang, Wu Shujin, Diao Shizhe, Liu Xiaoze, Wang Xingyao, Chen Yangyi, Gao Jing
- Can Llms Solve Longer Math Word Problems Better? Xu Xin, Xiao Tong, Chao Zitong, Huang Zhenya, Yang Can, Wang Yang
- Bridging The Gap Between Different Vocabularies For LLM Ensemble Xu Yangyifan, Lu Jinliang, Zhang Jiajun
- Generate-on-graph: Treat LLM As Both Agent And KG In Incomplete Knowledge Graph Question Answering Xu Yao, He Shizhu, Chen Jiabei, Wang Zihao, Song Yangqiu, Tong Hanghang, Liu Kang, Zhao Jun
- Chatglm-math: Improving Math Problem-solving In Large Language Models With A Self-critique Pipeline Xu Yifan, Liu Xiao, Liu Xinghan, Hou Zhenyu, Li Yueyan, Zhang Xiaohan, Wang Zihan, Zeng Aohan, Du Zhengxiao, Zhao Wenyi, Tang Jie, Dong Yuxiao
- Think: Thinner Key Cache By Query-driven Pruning Xu Yuhui, Jie Zhanming, Dong Hanze, Wang Lei, Lu Xudong, Zhou Aojun, Saha Amrita, Xiong Caiming, Sahoo Doyen
- Safedecoding: Defending Against Jailbreak Attacks Via Safety-aware Decoding Xu Zhangchen, Jiang Fengqing, Niu Luyao, Jia Jinyuan, Lin Bill Yuchen, Poovendran Radha
- Beyond Perplexity: Multi-dimensional Safety Evaluation Of LLM Compression Xu Zhichao, Gupta Ashim, Li Tao, Bentham Oliver, Srikumar Vivek
- Question Calibration And Multi-hop Modeling For Temporal Question Answering Xue Chao, Liang Di, Wang Pengfei, Zhang Jing
- Multi-programming Language Ensemble For Code Generation In Large Language Model Xue Tengfei, Li Xuefeng, Azim Tahir, Smirnov Roman, Yu Jianhui, Sadrieh Arash, Pahlavan Babak
- Pythonsaga: Redefining The Benchmark To Evaluate Code Generating Llms Yadav Ankit, Beniwal Himanshu, Singh Mayank
- Mitigating LLM Hallucinations Via Conformal Abstention Yadkori Yasin Abbasi, Kuzborskij Ilja, Stutz David, György András, Fisch Adam, Doucet Arnaud, Beloshapka Iuliya, Weng Wei-hung, Yang Yao-yuan, Szepesvári Csaba, Cemgil Ali Taylan, Tomasev Nenad
- Medlm: Exploring Language Models For Medical Question Answering Systems Yagnik Niraj, Jhaveri Jay, Sharma Vivek, Pila Gabriel
- LEIA: Facilitating Cross-lingual Knowledge Transfer In Language Models With Entity-based Data Augmentation Yamada Ikuya, Ri Ryokan
- An Empirical Study On Cross-lingual Vocabulary Adaptation For Efficient Language Model Inference Yamaguchi Atsuki, Villavicencio Aline, Aletras Nikolaos
- Refutebench: Evaluating Refuting Instruction-following For Large Language Models Yan Jianhao, Luo Yun, Zhang Yue
- Echosight: Advancing Visual-language Models With Wiki Knowledge Yan Yibin, Xie Weidi
- Advancing Genai Assisted Programming--a Comparative Study On Prompt Efficiency And Code Quality Between GPT-4 And GLM-4 Yang Angus, Li Zehan, Li Jie
- 3D Vision And Language Pretraining With Large-scale Synthetic Data Yang Dejie, Xu Zhu, Mo Wentao, Chen Qingchao, Huang Siyuan, Liu Yang
- IM-RAG: Multi-round Retrieval-augmented Generation Through Learning Inner Monologues Yang Diji, Rao Jinmeng, Chen Kezhen, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- Pyramidinfer: Pyramid KV Cache Compression For High-throughput LLM Inference Yang Dongjie, Han Xiaodong, Gao Yan, Hu Yao, Zhang Shilin, Zhao Hai
- The Geometry Of Queries: Query-based Innovations In Retrieval-augmented Generation Yang Eric, Amar Jonathan, Lee Jong Ha, Kumar Bhawesh, Jia Yugang
- Aligning Llms Through Multi-perspective User Preference Ranking-based Feedback For Programming Question Answering Yang Hongyu, He Liyang, Hou Min, Shen Shuanghong, Li Rui, Hou Jiahui, Ma Jianhui, Zhao Junda
- Toward Automatic Relevance Judgment Using Vision--language Models For Image--text Retrieval Evaluation Yang Jheng-hong, Lin Jimmy
- Asynchronous And Segmented Bidirectional Encoding For NMT Yang Jingpu, Han Zehua, Xiang Mengyu, Wang Helin, Huang Yuxiao, Fang Miao
- If LLM Is The Wizard, Then Code Is The Wand: A Survey On How Code Empowers Large Language Models To Serve As Intelligent Agents Yang Ke, Liu Jiateng, Wu John, Yang Chaoqi, Fung Yi R., Li Sha, Huang Zixuan, Cao Xu, Wang Xingyao, Wang Yiquan, Ji Heng, Zhai Chengxiang
- Regularizing Hidden States Enables Learning Generalizable Reward Model For Llms Yang Rui, Ding Ruomeng, Lin Yong, Zhang Huan, Zhang Tong
- Video As The New Language For Real-world Decision Making Yang Sherry, Walker Jacob, Parker-holder Jack, Du Yilun, Bruce Jake, Barreto Andre, Abbeel Pieter, Schuurmans Dale
- Seed-story: Multimodal Long Story Generation With Large Language Model Yang Shuai, Ge Yuying, Li Yang, Chen Yukang, Ge Yixiao, Shan Ying, Chen Yingcong
- Do Large Language Models Latently Perform Multi-hop Reasoning? Yang Sohee, Gribovskaya Elena, Kassner Nora, Geva Mor, Riedel Sebastian
- CRAG -- Comprehensive RAG Benchmark Yang Xiao, Sun Kai, Xin Hao, Sun Yushi, Bhalla Nikita, Chen Xiangsen, Choudhary Sajal, Gui Rongze Daniel, Jiang Ziran Will, Jiang Ziyu, Kong Lingkun, Moran Brian, Wang Jiaqi, Xu Yifan Ethan, Yan An, Yang Chenyu, Yuan Eting, Zha Hanwen, Tang Nan, Chen Lei, Scheffer Nicolas, Liu Yue, Shah Nirav, Wanga Rakesh, Kumar Anuj, Yih Wen-tau, Dong Xin Luna
- Fine-tuning Large Language Models With Human-inspired Learning Strategies In Medical Question Answering Yang Yushi, Bean Andrew M., Mccraith Robert, Mahdi Adam
- Recent Advances Of Foundation Language Models-based Continual Learning: A Survey Yang Yutao, Zhou Jie, Ding Xuanwen, Huai Tianyu, Liu Shunyu, Chen Qin, He Liang, Xie Yuan
- Assessing Adversarial Robustness Of Large Language Models: An Empirical Study Yang Zeyu, Meng Zhao, Zheng Xiaochen, Wattenhofer Roger
- Adversarial Attacks And Defense For Conversation Entailment Task Yang Zhenning, Krawec Ryan, Wu Liang-yuan
- Curiousllm: Elevating Multi-document QA With Reasoning-infused Knowledge Graph Prompting Yang Zukang, Zhu Zixuan
- Training Ultra Long Context Language Model With Fully Pipelined Distributed Transformer Yao Jinghan, Jacobs Sam Ade, Tanaka Masahiro, Ruwase Olatunji, Shafi Aamir, Subramoni Hari, Panda Dhabaleswar K.
- Sirllm: Streaming Infinite Retentive LLM Yao Yao, Li Zuchao, Zhao Hai
- Open-domain Implicit Format Control For Large Language Model Generation Yao Yiqun, Ma Wenjia, Fang Xuezhi, Jiang Xin, Li Xiang, Meng Xuying, Han Peng, Li Jing, Sun Aixin, Wang Yequan
- Seakr: Self-aware Knowledge Retrieval For Adaptive Retrieval Augmented Generation Yao Zijun, Qi Weijian, Pan Liangming, Cao Shulin, Hu Linmei, Liu Weichuan, Hou Lei, Li Juanzi
- Preference-guided Reflective Sampling For Aligning Language Models Ye Hai, Ng Hwee Tou
- Dataframe QA: A Universal LLM Framework On Dataframe Question Answering Without Data Exposure Ye Junyi, Du Mengnan, Wang Guiling
- Boosting Conversational Question Answering With Fine-grained Retrieval-augmentation And Self-check Ye Linhao, Lei Zhikai, Yin Jianghao, Chen Qin, Zhou Jie, He Liang
- Harnessing Multimodal Large Language Models For Multimodal Sequential Recommendation Ye Yuyang, Zheng Zhi, Shen Yishan, Wang Tianshu, Zhang Hengruo, Zhu Peijun, Yu Runlong, Zhang Kai, Xiong Hui
- Long-context Language Modeling With Parallel Context Encoding Yen Howard, Gao Tianyu, Chen Danqi
- Towards Fast Multilingual LLM Inference: Speculative Decoding And Specialized Drafters Yi Euiin, Kim Taehyeon, Jeung Hongseok, Chang Du-seong, Yun Se-young
- Jailbreak Attacks And Defenses Against Large Language Models: A Survey Yi Sibo, Liu Yule, Sun Zhen, Cong Tianshuo, He Xinlei, Song Jiaxing, Xu Ke, Li Qi
- A Survey On Recent Advances In Llm-based Multi-turn Dialogue Systems Yi Zihao, Ouyang Jiarui, Liu Yuwen, Liao Tianhao, Xu Zhe, Shen Ying
- The Task-oriented Queries Benchmark (toqb) Yim Keun Soo
- Characterizing Truthfulness In Large Language Model Generations With Local Intrinsic Dimension Yin Fan, Srinivasa Jayanth, Chang Kai-wei
- Do Large Language Model Understand Multi-intent Spoken Language ? Yin Shangjian, Huang Peijie, Xu Yuhong, Huang Haojing, Chen Jiatian
- Relative Preference Optimization: Enhancing LLM Alignment Through Contrasting Responses Across Identical And Diverse Prompts Yin Yueqin, Wang Zhendong, Gu Yi, Huang Hai, Chen Weizhu, Zhou Mingyuan
- Vqattack: Transferable Adversarial Attacks On Visual Question Answering Via Pre-trained Models Yin Ziyi, Ye Muchao, Zhang Tianrong, Wang Jiaqi, Liu Han, Chen Jinghui, Wang Ting, Ma Fenglong
- Large Language Model With Graph Convolution For Recommendation Yingpeng Du, Ziyan Wang, Zhu Sun, Haoyan Chua, Hongzhi Liu, Zhonghai Wu, Yining Ma, Jie Zhang, Youchen Sun
- Can Large Language Models Faithfully Express Their Intrinsic Uncertainty In Words? Yona Gal, Aharoni Roee, Geva Mor
- Hyperclova X Technical Report Yoo Kang Min, Han Jaegeun, In Sookyo, Jeon Heewon, Jeong Jisu, Kang Jaewook, Kim Hyunwook, Kim Kyung-min, Kim Munhyong, Kim Sungju, Kwak Donghyun, Kwak Hanock, Kwon Se Jung, Lee Bado, Lee Dongsoo, Lee Gichang, Lee Jooho, Park Baeseong, Shin Seongjin, Yu Joonsang, Baek Seolki, Byeon Sumin, Cho Eungsup, Choe Dooseok, Han Jeesung, Jin Youngkyun, Jun Hyein, Jung Jaeseung, Kim Chanwoong, Kim Jinhong, Kim Jinuk, Lee Dokyeong, Park Dongwook, Sohn Jeong Min, Han Sujung, Heo Jiae, Hong Sungju, Jeon Mina, Jung Hyunhoon, Jung Jungeun, Jung Wangkyo, Kim Chungjoon, Kim Hyeri, Kim Jonghyun, Kim Min Young, Lee Soeun, Park Joonhee, Shin Jieun, Yang Sojin, Yoon Jungsoon, Lee Hwaran, Bae Sanghwan, Cha Jeehwan, Gylleus Karl, Ham Donghoon, Hong Mihak, Hong Youngki, Hong Yunki, Jang Dahyun, Jeon Hyojun, Jeon Yujin, Jeong Yeji, Ji Myunggeun, Jin Yeguk, Jo Chansong, Joo Shinyoung, Jung Seunghwan, Kim Adrian Jungmyung, Kim Byoung Hoon, Kim Hyomin, Kim Jungwhan, Kim Minkyoung, Kim Minseung, Kim Sungdong, Kim Yonghee, Kim Youngjun, Kim Youngkwan, Ko Donghyeon, Lee Dughyun, Lee Ha Young, Lee Jaehong, Lee Jieun, Lee Jonghyun, Lee Jongjin, Lee Min Young, Lee Yehbin, Min Taehong, Min Yuri, Moon Kiyoon, Oh Hyangnam, Park Jaesun, Park Kyuyon, Park Younghun, Seo Hanbae, Seo Seunghyun, Sim Mihyun, Son Gyubin, Yeo Matt, Yeom Kyung Hoon, Yoo Wonjoon, You Myungin, Ahn Doheon, Ahn Homin, Ahn Joohee, Ahn Seongmin, An Chanwoo, An Hyeryun, An Junho, An Sang-min, Byun Boram, Byun Eunbin, Cha Jongho, Chang Minji, Chang Seunggyu, Cho Haesong, Cho Youngdo, Choi Dalnim, Choi Daseul, Choi Hyoseok, Choi Minseong, Choi Sangho, Choi Seongjae, Choi Wooyong, Chun Sewhan, Go Dong Young, Ham Chiheon, Han Danbi, Han Jaemin, Hong Moonyoung, Hong Sung Bum, Hwang Dong-hyun, Hwang Seongchan, Im Jinbae, Jang Hyuk Jin, Jang Jaehyung, Jang Jaeni, Jang Sihyeon, Jang Sungwon, Jeon Joonha, Jeong Daun, Jeong Joonhyun, Jeong Kyeongseok, Jeong Mini, Jin Sol, Jo Hanbyeol, Jo Hanju, Jo Minjung, Jung Chaeyoon, Jung Hyungsik, Jung Jaeuk, Jung Ju Hwan, Jung Kwangsun, Jung Seungjae, Ka Soonwon, Kang Donghan, Kang Soyoung, Kil Taeho, Kim Areum, Kim Beomyoung, Kim Byeongwook, Kim Daehee, Kim Dong-gyun, Kim Donggook, Kim Donghyun, Kim Euna, Kim Eunchul, Kim Geewook, Kim Gyu Ri, Kim Hanbyul, Kim Heesu, Kim Isaac, Kim Jeonghoon, Kim Jihye, Kim Joonghoon, Kim Minjae, Kim Minsub, Kim Pil Hwan, Kim Sammy, Kim Seokhun, Kim Seonghyeon, Kim Soojin, Kim Soong, Kim Soyoon, Kim Sunyoung, Kim Taeho, Kim Wonho, Kim Yoonsik, Kim You Jin, Kim Yuri, Kwon Beomseok, Kwon Ohsung, Kwon Yoo-hwan, Lee Anna, Lee Byungwook, Lee Changho, Lee Daun, Lee Dongjae, Lee Ha-ram, Lee Hodong, Lee Hwiyeong, Lee Hyunmi, Lee Injae, Lee Jaeung, Lee Jeongsang, Lee Jisoo, Lee Jongsoo, Lee Joongjae, Lee Juhan, Lee Jung Hyun, Lee Junghoon, Lee Junwoo, Lee Se Yun, Lee Sujin, Lee Sungjae, Lee Sungwoo, Lee Wonjae, Lee Zoo Hyun, Lim Jong Kun, Lim Kun, Lim Taemin, Na Nuri, Nam Jeongyeon, Nam Kyeong-min, Noh Yeonseog, Oh Biro, Oh Jung-sik, Oh Solgil, Oh Yeontaek, Park Boyoun, Park Cheonbok, Park Dongju, Park Hyeonjin, Park Hyun Tae, Park Hyunjung, Park Jihye, Park Jooseok, Park Junghwan, Park Jungsoo, Park Miru, Park Sang Hee, Park Seunghyun, Park Soyoung, Park Taerim, Park Wonkyeong, Ryu Hyunjoon, Ryu Jeonghun, Ryu Nahyeon, Seo Soonshin, Seo Suk Min, Shim Yoonjeong, Shin Kyuyong, Shin Wonkwang, Sim Hyun, Sim Woongseob, Soh Hyejin, Son Bokyong, Son Hyunjun, Son Seulah, Song Chi-yun, Song Chiyoung, Song Ka Yeon, Song Minchul, Song Seungmin, Wang Jisung, Yeo Yonggoo, Yi Myeong Yeon, Yim Moon Bin, Yoo Taehwan, Yoo Youngjoon, Yoon Sungmin, Yoon Young Jin, Yu Hangyeol, Yu Ui Seon, Zuo Xingdong, Bae Jeongin, Bae Joungeun, Cho Hyunsoo, Cho Seonghyun, Cho Yongjin, Choi Taekyoon, Choi Yera, Chung Jiwan, Han Zhenghui, Heo Byeongho, Hong Euisuk, Hwang Taebaek, Im Seonyeol, Jegal Sumin, Jeon Sumin, Jeong Yelim, Jeong Yonghyun, Jiang Can, Jiang Juyong, Jin Jiho, Jo Ara, Jo Younghyun, Jung Hoyoun, Jung Juyoung, Kang Seunghyeong, Kim Dae Hee, Kim Ginam, Kim Hangyeol, Kim Heeseung, Kim Hyojin, Kim Hyojun, Kim Hyun-ah, Kim Jeehye, Kim Jin-hwa, Kim Jiseon, Kim Jonghak, Kim Jung Yoon, Kim Rak Yeong, Kim Seongjin, Kim Seoyoon, Kim Sewon, Kim Sooyoung, Kim Sukyoung, Kim Taeyong, Ko Naeun, Koo Bonseung, Kwak Heeyoung, Kwon Haena, Kwon Youngjin, Lee Boram, Lee Bruce W., Lee Dagyeong, Lee Erin, Lee Euijin, Lee Ha Gyeong, Lee Hyojin, Lee Hyunjeong, Lee Jeeyoon, Lee Jeonghyun, Lee Jongheok, Lee Joonhyung, Lee Junhyuk, Lee Mingu, Lee Nayeon, Lee Sangkyu, Lee Se Young, Lee Seulgi, Lee Seung Jin, Lee Suhyeon, Lee Yeonjae, Lee Yesol, Lee Youngbeom, Lee Yujin, Li Shaodong, Liu Tianyu, Moon Seong-eun, Moon Taehong, Nihlenramstroem Max-lasse, Oh Wonseok, Oh Yuri, Park Hongbeen, Park Hyekyung, Park Jaeho, Park Nohil, Park Sangjin, Ryu Jiwon, Ryu Miru, Ryu Simo, Seo Ahreum, Seo Hee, Seo Kangdeok, Shin Jamin, Shin Seungyoun, Sin Heetae, Wang Jiangping, Wang Lei, Xiang Ning, Xiao Longxiang, Xu Jing, Yi Seonyeong, Yoo Haanju, Yoo Haneul, Yoo Hwanhee, Yu Liang, Yu Youngjae, Yuan Weijie, Zeng Bo, Zhou Qian, Cho Kyunghyun, Ha Jung-woo, Park Joonsuk, Hwang Jihyun, Kwon Hyoung Jo, Kwon Soonyong, Lee Jungyeon, Lee Seungho, Lim Seonghyeon, Noh Hyunkyung, Choi Seungho, Lee Sang-woo, Lim Jung Hwa, Sung Nako
- By My Eyes: Grounding Multimodal Large Language Models With Sensor Data Via Visual Prompting Yoon Hyungjun, Tolera Biniyam Aschalew, Gong Taesik, Lee Kimin, Lee Sung-ju
- Matryoshka-adaptor: Unsupervised And Supervised Tuning For Smaller Embedding Dimensions Yoon Jinsung, Sinha Raj, Arik Sercan O, Pfister Tomas
- Making The Most Of Your Model: Methods For Finetuning And Applying Pretrained Transformers Yoshida Davis
- Enhancing Healthcare Through Large Language Models: A Study On Medical Question Answering Yu Haoran, Yu Chang, Wang Zihan, Zou Dongxian, Qin Hao
- A Cause-effect Look At Alleviating Hallucination Of Knowledge-grounded Dialogue Generation Yu Jifan, Zhang Xiaohan, Xu Yifan, Lei Xuanyu, Yao Zijun, Zhang Jing, Hou Lei, Li Juanzi
- Extend Model Merging From Fine-tuned To Pre-trained Large Language Models Via Weight Disentanglement Yu Le, Yu Bowen, Yu Haiyang, Huang Fei, Li Yongbin
- An LLM Maturity Model For Reliable And Transparent Text-to-query Yu Lei Expression, Ray Abir Expression
- In Defense Of RAG In The Era Of Long-context Language Models Yu Tan, Xu Anbang, Akkiraju Rama
- Unveiling And Harnessing Hidden Attention Sinks: Enhancing Large Language Models Without Training Through Attention Calibration Yu Zhongzhi, Wang Zheng, Fu Yonggan, Shi Huihong, Shaikh Khalid, Lin Yingyan Celine
- Towards A Holistic Evaluation Of Llms On Factual Knowledge Recall Yuan Jiaqing, Pan Lin, Hang Chung-wei, Guo Jiang, Jiang Jiarong, Min Bonan, Ng Patrick, Wang Zhiguo
- KV Cache Compression, But What Must We Give In Return? A Comprehensive Benchmark Of Long Context Capable Approaches Yuan Jiayi Henry, Liu Hongyi Henry, Shaochen Henry, Zhong, Chuang Yu-neng, Li Songchen, Wang Guanchu, Le Duy, Jin Hongye, Chaudhary Vipin, Xu Zhaozhuo, Liu Zirui, Hu Xia
- Advancing LLM Reasoning Generalists With Preference Trees Yuan Lifan, Cui Ganqu, Wang Hanbin, Ding Ning, Wang Xingyao, Deng Jia, Shan Boji, Chen Huimin, Xie Ruobing, Lin Yankai, Liu Zhenghao, Zhou Bowen, Peng Hao, Liu Zhiyuan, Sun Maosong
- Chatmusician: Understanding And Generating Music Intrinsically With LLM Yuan Ruibin, Lin Hanfeng, Wang Yi, Tian Zeyue, Wu Shangda, Shen Tianhao, Zhang Ge, Wu Yuhang, Liu Cong, Zhou Ziya, Ma Ziyang, Xue Liumeng, Wang Ziyu, Liu Qin, Zheng Tianyu, Li Yizhi, Ma Yinghao, Liang Yiming, Chi Xiaowei, Liu Ruibo, Wang Zili, Li Pengfei, Wu Jingcheng, Lin Chenghua, Liu Qifeng, Jiang Tao, Huang Wenhao, Chen Wenhu, Benetos Emmanouil, Fu Jie, Xia Gus, Dannenberg Roger, Xue Wei, Kang Shiyin, Guo Yike
- EASYTOOL: Enhancing Llm-based Agents With Concise Tool Instruction Yuan Siyu, Song Kaitao, Chen Jiangjie, Tan Xu, Shen Yongliang, Kan Ren, Li Dongsheng, Yang Deqing
- R-judge: Benchmarking Safety Risk Awareness For LLM Agents Yuan Tongxin, He Zhiwei, Dong Lingzhong, Wang Yiming, Zhao Ruijie, Xia Tian, Xu Lizhen, Zhou Binglin, Li Fangqi, Zhang Zhuosheng, Wang Rui, Liu Gongshen
- Evaluating Character Understanding Of Large Language Models Via Character Profiling From Fictional Works Yuan Xinfeng, Yuan Siyu, Cui Yuhan, Lin Tianhe, Wang Xintao, Xu Rui, Chen Jiangjie, Yang Deqing
- A Hybrid RAG System With Comprehensive Enhancement On Complex Reasoning Yuan Ye, Liu Chengwu, Yuan Jingyang, Sun Gongbo, Li Siqi, Zhang Ming
- Tcmbench: A Comprehensive Benchmark For Evaluating Large Language Models In Traditional Chinese Medicine Yue Wenjing, Wang Xiaoling, Zhu Wei, Guan Ming, Zheng Huanran, Wang Pengfei, Sun Changzhi, Ma Xin
- Wkvquant: Quantizing Weight And Key/value Cache For Large Language Models Gains More Yue Yuxuan, Yuan Zhihang, Duanmu Haojie, Zhou Sifan, Wu Jianlong, Nie Liqiang
- SPOT: Text Source Prediction From Originality Score Thresholding Yvinec Edouard, Kasser Gabriel
- Turkishmmlu: Measuring Massive Multitask Language Understanding In Turkish Yüksel Arda, Köksal Abdullatif, Şenel Lütfi Kerem, Korhonen Anna, Schütze Hinrich
- In-context Example Selection Via Similarity Search Improves Low-resource Machine Translation Zebaze Armel, Sagot Benoît, Bawden Rachel
- Design Proteins Using Large Language Models: Enhancements And Comparative Analyses Zeinalipour Kamyar, Jamshidi Neda, Bianchini Monica, Maggini Marco, Gori Marco
- Automating Turkish Educational Quiz Generation Using Large Language Models Zeinalipour Kamyar, Keptiğ Yusuf Gökberk, Maggini Marco, Gori Marco
- Skywork-math: Data Scaling Laws For Mathematical Reasoning In Large Language Models -- The Story Goes On Zeng Liang, Zhong Liangjun, Zhao Liang, Wei Tianwen, Yang Liu, He Jujie, Cheng Cheng, Hu Rui, Liu Yang, Yan Shuicheng, Fang Han, Zhou Yahui
- Advancing Multimodal Large Language Models In Chart Question Answering With Visualization-referenced Instruction Tuning Zeng Xingchen, Lin Haichuan, Ye Yilin, Zeng Wei
- DELIA: Diversity-enhanced Learning For Instruction Adaptation In Large Language Models Zeng Yuanhao, Ren Fei, Zhou Xinpeng, Wang Yihang, Shao Yingxia
- Can Mllms Perform Text-to-image In-context Learning? Zeng Yuchen, Kang Wonjun, Chen Yicong, Koo Hyung Il, Lee Kangwook
- Personarag: Enhancing Retrieval-augmented Generation Systems With User-centric Agents Zerhoudi Saber, Granitzer Michael
- Prefix Text As A Yarn: Eliciting Non-english Alignment In Foundation Language Model Zhan Runzhe, Yang Xinyi, Wong Derek F., Chao Lidia S., Zhang Yue
- When Scaling Meets LLM Finetuning: The Effect Of Data, Model And Finetuning Method Zhang Biao, Liu Zhongtao, Cherry Colin, Firat Orhan
- LUQ: Long-text Uncertainty Quantification For Llms Zhang Caiqi, Liu Fangyu, Basaldella Marco, Collier Nigel
- Sciglm: Training Scientific Language Models With Self-reflective Instruction Annotation And Tuning Zhang Dan, Hu Ziniu, Zhoubian Sining, Du Zhengxiao, Yang Kaiyu, Wang Zihan, Yue Yisong, Dong Yuxiao, Tang Jie
- Code Representation Learning At Scale Zhang Dejiao, Ahmad Wasi, Tan Ming, Ding Hantian, Nallapati Ramesh, Roth Dan, Ma Xiaofei, Xiang Bing
- Chemllm: A Chemical Large Language Model Zhang Di, Liu Wei, Tan Qian, Chen Jingdan, Yan Hang, Yan Yuliang, Li Jiatong, Huang Weiran, Yue Xiangyu, Ouyang Wanli, Zhou Dongzhan, Zhang Shufei, Su Mao, Zhong Han-sen, Li Yuqiang
- Unforgettable Generalization In Language Models Zhang Eric, Chosen Leshem, Andreas Jacob
- Closing The Gap Between Open-source And Commercial Large Language Models For Medical Evidence Summarization Zhang Gongbo, Jin Qiao, Zhou Yiliang, Wang Song, Idnay Betina R., Luo Yiming, Park Elizabeth, Nestor Jordan G., Spotnitz Matthew E., Soroush Ali, Campion Thomas, Lu Zhiyong, Weng Chunhua, Peng Yifan
- Iterative Utility Judgment Framework Via Llms Inspired By Relevance In Philosophy Zhang Hengran, Bi Keping, Guo Jiafeng, Cheng Xueqi
- Are Large Language Models Good At Utility Judgments? Zhang Hengran, Zhang Ruqing, Guo Jiafeng, De Rijke Maarten, Fan Yixing, Cheng Xueqi
- Sinklora: Enhanced Efficiency And Chat Capabilities For Long-context Large Language Models Zhang Hengyu
- Balancing Speciality And Versatility: A Coarse To Fine Framework For Supervised Fine-tuning Large Language Model Zhang Hengyuan, Wu Yanru, Li Dawei, Yang Sak, Zhao Rui, Jiang Yong, Tan Fei
- Enja: Ensemble Jailbreak On Large Language Models Zhang Jiahao, Wang Zilong, Wang Ruofan, Ma Xingjun, Jiang Yu-gang
- Longcite: Enabling Llms To Generate Fine-grained Citations In Long-context QA Zhang Jiajie, Bai Yushi, Lv Xin, Gu Wanjun, Liu Danqing, Zou Minhao, Cao Shulin, Hou Lei, Dong Yuxiao, Feng Ling, Li Juanzi
- Knowhalu: Hallucination Detection Via Multi-form Knowledge Based Factual Checking Zhang Jiawei, Xu Chejian, Gai Yu, Lecue Freddy, Song Dawn, Li Bo
- Regurgitative Training: The Value Of Real Data In Training Large Language Models Zhang Jinghui, Qiao Dandan, Yang Mochen, Wei Qiang
- Onegen: Efficient One-pass Unified Generation And Retrieval For Llms Zhang Jintian, Peng Cheng, Sun Mengshu, Chen Xiang, Liang Lei, Zhang Zhiqiang, Zhou Jun, Chen Huajun, Zhang Ningyu
- Read And Think: An Efficient Step-wise Multimodal Language Model For Document Understanding And Reasoning Zhang Jinxu
- Unimot: Unified Molecule-text Language Model With Discrete Token Representation Zhang Juzheng, Bian Yatao, Chen Yongqiang, Yao Quanming
- Fast And Slow Generating: An Empirical Study On Large And Small Language Models Collaborative Decoding Zhang Kaiyan, Wang Jianyu, Ding Ning, Qi Biqing, Hua Ermo, Lv Xingtai, Zhou Bowen
- Exploring The Best Practices Of Query Expansion With Large Language Models Zhang Le, Wu Yihong, Yang Qian, Nie Jian-yun
- SPL: A Socratic Playground For Learning Powered By Large Language Model Zhang Liang, Lin Jionghao, Kuang Ziyi, Xu Sheng, Yeasin Mohammed, Hu Xiangen
- Generative Verifiers: Reward Modeling As Next-token Prediction Zhang Lunjun, Hosseini Arian, Bansal Hritik, Kazemi Mehran, Kumar Aviral, Agarwal Rishabh
- Attention Instruction: Amplifying Attention In The Middle Via Prompting Zhang Meiru, Meng Zaiqiao, Collier Nigel
- Case Study: Testing Model Capabilities In Some Reasoning Tasks Zhang Min, Takumi Sato, Zhang Jack, Wang Jun
- Instructedit: Instruction-based Knowledge Editing For Large Language Models Zhang Ningyu, Tian Bozhong, Cheng Siyuan, Liang Xiaozhuan, Hu Yi, Xue Kouying, Gou Yanjie, Chen Xi, Chen Huajun
- Internlm-xcomposer-2.5: A Versatile Large Vision Language Model Supporting Long-contextual Input And Output Zhang Pan, Dong Xiaoyi, Zang Yuhang, Cao Yuhang, Qian Rui, Chen Lin, Guo Qipeng, Duan Haodong, Wang Bin, Ouyang Linke, Zhang Songyang, Zhang Wenwei, Li Yining, Gao Yang, Sun Peng, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Yan Hang, He Conghui, Zhang Xingcheng, Chen Kai, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- Mindspeech: Continuous Imagined Speech Decoding Using High-density Fnirs And Prompt Tuning For Advanced Human-ai Interaction Zhang Suyi, Alam Ekram, Baber Jack, Bianco Francesca, Turner Edward, Chamanzar Maysam, Dehghani Hamid
- Cfbench: A Comprehensive Constraints-following Benchmark For Llms Zhang Tao, Shen Yanjun, Luo Wenjing, Zhang Yan, Liang Hao, Zhang Tao, Yang Fan, Lin Mingan, Qiao Yujing, Chen Weipeng, Cui Bin, Zhang Wentao, Zhou Zenan
- R4: Reinforced Retriever-reorder-responder For Retrieval-augmented Large Language Models Zhang Taolin, Li Dongyang, Chen Qizhou, Wang Chengyu, Huang Longtao, Xue Hui, He Xiaofeng, Huang Jun
- In-context Principle Learning From Mistakes Zhang Tianjun, Madaan Aman, Gao Luyu, Zheng Steven, Mishra Swaroop, Yang Yiming, Tandon Niket, Alon Uri
- RAFT: Adapting Language Model To Domain Specific RAG Zhang Tianjun, Patil Shishir G., Jain Naman, Shen Sheng, Zaharia Matei, Stoica Ion, Gonzalez Joseph E.
- Leanquant: Accurate Large Language Model Quantization With Loss-error-aware Grid Zhang Tianyi, Shrivastava Anshumali
- Interpreting And Improving Large Language Models In Arithmetic Calculation Zhang Wei, Wan Chaoqun, Zhang Yonggang, Cheung Yiu-ming, Tian Xinmei, Shen Xu, Ye Jieping
- Defining Boundaries: A Spectrum Of Task Feasibility For Large Language Models Zhang Wenbo, Xu Zihang, Cai Hengrui
- Agent-pro: Learning To Evolve Via Policy-level Reflection And Optimization Zhang Wenqi, Tang Ke, Wu Hai, Wang Mengna, Shen Yongliang, Hou Guiyang, Tan Zeqi, Li Peng, Zhuang Yueting, Lu Weiming
- D\'olares Or Dollars? Unraveling The Bilingual Prowess Of Financial Llms Between Spanish And English Zhang Xiao, Xiang Ruoyu, Yuan Chenhan, Feng Duanyu, Han Weiguang, Lopez-lira Alejandro, Liu Xiao-yang, Ananiadou Sophia, Peng Min, Huang Jimin, Xie Qianqian
- Llms Instruct Llms:an Extraction And Editing Method Zhang Xin, Ju Tianjie, Liang Huijia, Fu Ying, Zhang Qin
- LLM As A Mastermind: A Survey Of Strategic Reasoning With Large Language Models Zhang Yadong, Mao Shaoguang, Ge Tao, Wang Xun, De Wynter Adrian, Xia Yan, Wu Wenshan, Song Ting, Lan Man, Wei Furu
- Mme-realworld: Could Your Multimodal LLM Challenge High-resolution Real-world Scenarios That Are Difficult For Humans? Zhang Yi-fan, Zhang Huanyu, Tian Haochen, Fu Chaoyou, Zhang Shuangqing, Wu Junfei, Li Feng, Wang Kun, Wen Qingsong, Zhang Zhang, Wang Liang, Jin Rong, Tan Tieniu
- MLKD-BERT: Multi-level Knowledge Distillation For Pre-trained Language Models Zhang Ying, Yang Ziheng, Ji Shufan
- An Examination On The Effectiveness Of Divide-and-conquer Prompting In Large Language Models Zhang Yizhou, Du Lun, Cao Defu, Fu Qiang, Liu Yan
- Leveraging Biases In Large Language Models: "bias-knn'' For Effective Few-shot Learning Zhang Yong, Li Hanzhang, Li Zhitao, Cheng Ning, Li Ming, Xiao Jing, Wang Jianzong
- Prompt-aware Adapter: Towards Learning Adaptive Visual Tokens For Multimodal Large Language Models Zhang Yue, Fan Hehe, Yang Yi
- Toolbehonest: A Multi-level Hallucination Diagnostic Benchmark For Tool-augmented Large Language Models Zhang Yuxiang, Chen Jing, Wang Junjie, Liu Yaxin, Yang Cheng, Shi Chufan, Zhu Xinyu, Lin Zihao, Wan Hanwen, Yang Yujiu, Sakai Tetsuya, Feng Tian, Yamana Hayato
- A Survey On The Memory Mechanism Of Large Language Model Based Agents Zhang Zeyu, Bo Xiaohe, Ma Chen, Li Rui, Chen Xu, Dai Quanyu, Zhu Jieming, Dong Zhenhua, Wen Ji-rong
- Large Language Model Can Continue Evolving From Mistakes Zhao Haokun, Han Haixia, Shi Jie, Du Chengyu, Liang Jiaqing, Xiao Yanghua
- Revolutionizing Finance With Llms: An Overview Of Applications And Insights Zhao Huaqin, Liu Zhengliang, Wu Zihao, Li Yiwei, Yang Tianze, Shu Peng, Xu Shaochen, Dai Haixing, Zhao Lin, Mai Gengchen, Liu Ninghao, Liu Tianming
- Longagent: Scaling Language Models To 128k Context Through Multi-agent Collaboration Zhao Jun, Zu Can, Xu Hao, Lu Yi, He Wei, Ding Yiwen, Gui Tao, Zhang Qi, Huang Xuanjing
- Lora Land: 310 Fine-tuned Llms That Rival GPT-4, A Technical Report Zhao Justin, Wang Timothy, Abid Wael, Angus Geoffrey, Garg Arnav, Kinnison Jeffery, Sherstinsky Alex, Molino Piero, Addair Travis, Rishi Devvret
- SLIDE: A Framework Integrating Small And Large Language Models For Open-domain Dialogues Evaluation Zhao Kun, Yang Bohao, Tang Chen, Lin Chenghua, Zhan Liang
- Diffagent: Fast And Accurate Text-to-image API Selection With Large Language Model Zhao Lirui, Yang Yue, Zhang Kaipeng, Shao Wenqi, Zhang Yuxin, Qiao Yu, Luo Ping, Ji Rongrong
- Efficiently Training 7B LLM With 1 Million Sequence Length On 8 Gpus Zhao Pinxue, Zhang Hailin, Fu Fangcheng, Nie Xiaonan, Liu Qibin, Yang Fang, Peng Yuanbo, Jiao Dian, Li Shuaipeng, Xue Jinbao, Tao Yangyu, Cui Bin
- Defending Large Language Models Against Jailbreak Attacks Via Layer-specific Editing Zhao Wei, Li Zhe, Li Yige, Zhang Ye, Sun Jun
- I Could've Asked That: Reformulating Unanswerable Questions Zhao Wenting, Gao Ge, Cardie Claire, Rush Alexander M.
- Wildchat: 1M Chatgpt Interaction Logs In The Wild Zhao Wenting, Ren Xiang, Hessel Jack, Cardie Claire, Choi Yejin, Deng Yuntian
- Codev: Empowering Llms For Verilog Generation Through Multi-level Summarization Zhao Yang, Huang Di, Li Chongxiao, Jin Pengwei, Nan Ziyuan, Ma Tianyun, Qi Lei, Pan Yansong, Zhang Zhenxing, Zhang Rui, Zhang Xishan, Du Zidong, Guo Qi, Hu Xing, Chen Yunji
- Codejudge-eval: Can Large Language Models Be Good Judges In Code Understanding? Zhao Yuwei, Luo Ziyang, Tian Yuchen, Lin Hongzhan, Yan Weixiang, Li Annan, Ma Jing
- SWIFT:A Scalable Lightweight Infrastructure For Fine-tuning Zhao Yuze, Huang Jintao, Hu Jinghan, Wang Xingjun, Mao Yunlin, Zhang Daoze, Jiang Zeyinzi, Wu Zhikai, Ai Baole, Wang Ang, Zhou Wenmeng, Chen Yingda
- HALC: Object Hallucination Reduction Via Adaptive Focal-contrast Decoding Zhaorun Chen, Zhuokai Zhao, Hongyin Luo, Huaxiu Yao, Bo Li, Jiawei Zhou
- Gpt-4v(ision) Is A Generalist Web Agent, If Grounded Zheng Boyuan, Gou Boyu, Kil Jihyung, Sun Huan, Su Yu
- Trustscore: Reference-free Evaluation Of LLM Response Trustworthiness Zheng Danna, Liu Danyang, Lapata Mirella, Pan Jeff Z.
- Fine-tuning Large Language Models For Domain-specific Machine Translation Zheng Jiawei, Hong Hanghai, Wang Xiaoli, Su Jingsong, Liang Yonggui, Wu Shikai
- Ali-agent: Assessing Llms' Alignment With Human Values Via Agent-based Evaluation Zheng Jingnan, Wang Han, Zhang An, Nguyen Tai D., Sun Jun, Chua Tat-seng
- NEO-BENCH: Evaluating Robustness Of Large Language Models With Neologisms Zheng Jonathan, Ritter Alan, Xu Wei
- Towards Lifelong Learning Of Large Language Models: A Survey Zheng Junhao, Qiu Shengjie, Shi Chengming, Ma Qianli
- Clr-fact: Evaluating The Complex Logical Reasoning Capability Of Large Language Models Over Factual Knowledge Zheng Tianshi, Bai Jiaxin, Wang Yicheng, Fang Tianqing, Guo Yue, Yim Yauwai, Song Yangqiu
- Opencodeinterpreter: Integrating Code Generation With Execution And Refinement Zheng Tianyu, Zhang Ge, Shen Tianhao, Liu Xueling, Lin Bill Yuchen, Fu Jie, Chen Wenhu, Yue Xiang
- Harnessing Large Language Models For Text-rich Sequential Recommendation Zheng Zhi, Chao Wenshuo, Qiu Zhaopeng, Zhu Hengshu, Xiong Hui
- Synthet2c: Generating Synthetic Data For Fine-tuning Large Language Models On The Text2cypher Task Zhong Ziije, Zhong Linqing, Sun Zhaoze, Jin Qingyun, Qin Zengchang, Zhang Xiaofan
- Fairer Preferences Elicit Improved Human-aligned Large Language Model Judgments Zhou Han, Wan Xingchen, Liu Yinhong, Collier Nigel, Vulić Ivan, Korhonen Anna
- Sequence To Sequence Reward Modeling: Improving RLHF By Language Feedback Zhou Jiayi, Ji Jiaming, Dai Juntao, Yang Yaodong
- Qog:question And Options Generation Based On Language Model Zhou Jincheng
- An LLM Feature-based Framework For Dialogue Constructiveness Assessment Zhou Lexin, Farag Youmna, Vlachos Andreas
- Evaluating The Smooth Control Of Attribute Intensity In Text Generation With Llms Zhou Shang, Yao Feng, Dong Chengyu, Wang Zihan, Shang Jingbo
- Cogmg: Collaborative Augmentation Between Large Language Model And Knowledge Graph Zhou Tong, Chen Yubo, Liu Kang, Zhao Jun
- Using Pretrained Large Language Model With Prompt Engineering To Answer Biomedical Questions Zhou Wenxin, Ngo Thuy Hang
- Shared Imagination: Llms Hallucinate Alike Zhou Yilun, Xiong Caiming, Savarese Silvio, Wu Chien-sheng
- Defending Jailbreak Prompts Via In-context Adversarial Game Zhou Yujun, Han Yufei, Zhuang Haomin, Guo Kehan, Liang Zhenwen, Bao Hongyan, Zhang Xiangliang
- Lawgpt: A Chinese Legal Knowledge-enhanced Large Language Model Zhou Zhi, Shi Jiang-xin, Song Peng-xiao, Yang Xiao-wen, Jin Yi-xuan, Guo Lan-zhe, Li Yu-feng
- Fine-tuning Large Language Models To Translate: Will A Touch Of Noisy Data In Misaligned Languages Suffice? Zhu Dawei, Chen Pinzhen, Zhang Miaoran, Haddow Barry, Shen Xiaoyu, Klakow Dietrich
- TAT-LLM: A Specialized Language Model For Discrete Reasoning Over Tabular And Textual Data Zhu Fengbin, Liu Ziyang, Feng Fuli, Wang Chao, Li Moxin, Chua Tat-seng
- Benchmarking Large Language Models On CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset Zhu Jie, Li Junhui, Wen Yalong, Guo Lifan
- Fastmem: Fast Memorization Of Prompt Improves Context Awareness Of Large Language Models Zhu Junyi, Liu Shuochen, Yu Yu, Tang Bo, Yan Yibo, Li Zhiyu, Xiong Feiyu, Xu Tong, Blaschko Matthew B.
- LIME-M: Less Is More For Evaluation Of Mllms Zhu Kang, Zang Qianbo, Jia Shian, Wu Siwei, Fang Feiteng, Li Yizhi, Guo Shuyue, Zheng Tianyu, Li Bo, Wu Haoning, Qu Xingwei, Yang Jian, Liu Zachary, Yue Xiang, Liu J. H., Lin Chenghua, Yang Min, Ni Shiwen, Huang Wenhao, Zhang Ge
- LIRE: Listwise Reward Enhancement For Preference Alignment Zhu Mingye, Liu Yi, Zhang Lei, Guo Junbo, Mao Zhendong
- Parameter-efficient Tuning Large Language Models For Graph Representation Learning Zhu Qi, Zheng Da, Song Xiang, Zhang Shichang, Jin Bowen, Sun Yizhou, Karypis George
- Inference-time Decontamination: Reusing Leaked Benchmarks For Large Language Model Evaluation Zhu Qin, Cheng Qingyuan, Peng Runyu, Li Xiaonan, Liu Tengxiao, Peng Ru, Qiu Xipeng, Huang Xuanjing
- Quite Good, But Not Enough: Nationality Bias In Large Language Models -- A Case Study Of Chatgpt Zhu Shucheng, Wang Weikang, Liu Ying
- How Privacy-savvy Are Large Language Models? A Case Study On Compliance And Privacy Technical Review Zhu Xichou, Liu Yang, Shen Zhou, Liu Yi, Li Min, Chen Yujun, John Benzi, Ma Zhenzhen, Hu Tao, Yang Bolong, Wang Manman, Xie Zongxing, Liu Peng, Cai Dan, Wang Junhui
- KG-FPQ: Evaluating Factuality Hallucination In Llms With Knowledge Graph-based False Premise Questions Zhu Yanxu, Xiao Jinlin, Wang Yuhang, Sang Jitao
- Can Large Language Models Understand Context? Zhu Yilun, Moniz Joel Ruben Antony, Bhargava Shruti, Lu Jiarui, Piraviperumal Dhivya, Li Site, Zhang Yuan, Yu Hong, Tseng Bo-hsiang
- Apt-pipe: A Prompt-tuning Tool For Social Data Annotation Using Chatgpt Zhu Yiming, Yin Zhizhuo, Tyson Gareth, Haq Ehsan-ul, Lee Lik-hang, Hui Pan
- One Token Can Help! Learning Scalable And Pluggable Virtual Tokens For Retrieval-augmented Large Language Models Zhu Yutao, Huang Zhaoheng, Dou Zhicheng, Wen Ji-rong
- INTERS: Unlocking The Power Of Large Language Models In Search With Instruction Tuning Zhu Yutao, Zhang Peitian, Zhang Chenghao, Chen Yifei, Xie Binyu, Liu Zheng, Wen Ji-rong, Dou Zhicheng
- Yulan: An Open-source Large Language Model Zhu Yutao, Zhou Kun, Mao Kelong, Chen Wentong, Sun Yiding, Chen Zhipeng, Cao Qian, Wu Yihan, Chen Yushuo, Wang Feng, Zhang Lei, Li Junyi, Wang Xiaolei, Wang Lei, Zhang Beichen, Dong Zican, Cheng Xiaoxue, Chen Yuhan, Tang Xinyu, Hou Yupeng, Ren Qiangqiang, Pang Xincheng, Xie Shufang, Zhao Wayne Xin, Dou Zhicheng, Mao Jiaxin, Lin Yankai, Song Ruihua, Xu Jun, Chen Xu, Yan Rui, Wei Zhewei, Hu Di, Huang Wenbing, Gao Ze-feng, Chen Yueguo, Lu Weizheng, Wen Ji-rong
- Murar: A Simple And Effective Multimodal Retrieval And Answer Refinement Framework For Multimodal Question Answering Zhu Zhengyuan, Lee Daniel, Zhang Hong, Harsha Sai Sree, Feujio Loic, Maharaj Akash, Li Yunyao
- Halueval-wild: Evaluating Hallucinations Of Language Models In The Wild Zhu Zhiying, Yang Yiming, Sun Zhiqing
- Sketch Then Generate: Providing Incremental User Feedback And Guiding LLM Code Generation Through Language-oriented Code Sketches Zhu-tian Chen, Xiong Zeyu, Yao Xiaoshuo, Glassman Elena
- Protllm: An Interleaved Protein-language LLM With Protein-as-word Pre-training Zhuo Le, Chi Zewen, Xu Minghao, Huang Heyan, Zheng Heqi, He Conghui, Mao Xian-ling, Zhang Wentao
- Bigcodebench: Benchmarking Code Generation With Diverse Function Calls And Complex Instructions Zhuo Terry Yue, Vu Minh Chien, Chim Jenny, Hu Han, Yu Wenhao, Widyasari Ratnadira, Yusuf Imam Nur Bani, Zhan Haolan, He Junda, Paul Indraneil, Brunner Simon, Gong Chen, Hoang Thong, Zebaze Armel Randy, Hong Xiaoheng, Li Wen-ding, Kaddour Jean, Xu Ming, Zhang Zhihan, Yadav Prateek, Jain Naman, Gu Alex, Cheng Zhoujun, Liu Jiawei, Liu Qian, Wang Zijian, Lo David, Hui Binyuan, Muennighoff Niklas, Fried Daniel, Du Xiaoning, De Vries Harm, Von Werra Leandro
- Astraios: Parameter-efficient Instruction Tuning Code Large Language Models Zhuo Terry Yue, Zebaze Armel, Suppattarachai Nitchakarn, Von Werra Leandro, De Vries Harm, Liu Qian, Muennighoff Niklas
- Language Model Sentence Completion With A Parser-driven Rhetorical Control Method Zingale Joshua, Kalita Jugal
- Proswitch: Knowledge-guided Instruction Tuning To Generate Professional And Non-professional Styled Text Zong Chang, Chen Yuyan, Lu Weiming, Shao Jian, Zhuang Yueting
- VL-ICL Bench: The Devil In The Details Of Benchmarking Multimodal In-context Learning Zong Yongshuo, Bohdal Ondrej, Hospedales Timothy
- Promptintern: Saving Inference Costs By Internalizing Recurrent Prompt During Large Language Model Fine-tuning Zou Jiaru, Zhou Mengyu, Li Tao, Han Shi, Zhang Dongmei
- Prompt-time Ontology-driven Symbolic Knowledge Capture With Large Language Models Çöplü Tolga, Bendiken Arto, Skomorokhov Andrii, Bateiko Eduard, Cobb Stephen
- Enhancing Visual Question Answering Through Question-driven Image Captions As Prompts Özdemir Övgü, Akagündüz Erdem
- Systematic Task Exploration With Llms: A Study In Citation Text Generation Şahinuç Furkan, Kuznetsov Ilia, Hou Yufang, Gurevych Iryna
🏷 Attention Mechanism
- Multimodal Attention For Neural Machine Translation Caglayan Ozan, Barrault Loïc, Bougares Fethi
- Joint Copying And Restricted Generation For Paraphrase Cao Ziqiang, Luo Chuwei, Li Wenjie, Li Sujian
- Long Short-term Memory-networks For Machine Reading Cheng Jianpeng, Dong Li, Lapata Mirella
- Tree-to-sequence Attentional Neural Machine Translation Eriguchi Akiko, Hashimoto Kazuma, Tsuruoka Yoshimasa
- Multi-way, Multilingual Neural Machine Translation With A Shared Attention Mechanism Firat Orhan, Cho Kyunghyun, Bengio Yoshua
- Multimodal Compact Bilinear Pooling For Visual Question Answering And Visual Grounding Fukui Akira, Park Dong Huk, Yang Daylen, Rohrbach Anna, Darrell Trevor, Rohrbach Marcus
- Character-level Question Answering With Attention Golub David, He Xiaodong
- Revisiting Visual Question Answering Baselines Jabri Allan, Joulin Armand, Van Der Maaten Laurens
- Google's Multilingual Neural Machine Translation System: Enabling Zero-shot Translation Johnson Melvin, Schuster Mike, Le Quoc V., Krikun Maxim, Wu Yonghui, Chen Zhifeng, Thorat Nikhil, Viégas Fernanda, Wattenberg Martin, Corrado Greg, Hughes Macduff, Dean Jeffrey
- Lstm-based Mixture-of-experts For Knowledge-aware Dialogues Le Phong, Dymetman Marc, Renders Jean-michel
- Rationalizing Neural Predictions Lei Tao, Barzilay Regina, Jaakkola Tommi
- Coherent Dialogue With Attention-based Language Models Mei Hongyuan, Bansal Mohit, Walter Matthew R.
- Interactive Attention For Neural Machine Translation Meng Fandong, Lu Zhengdong, Li Hang, Liu Qun
- Temporal Attention Model For Neural Machine Translation Sankaran Baskaran, Mi Haitao, Al-onaizan Yaser, Ittycheriah Abe
- Image Captioning With Deep Bidirectional Lstms Wang Cheng, Yang Haojin, Bartz Christian, Meinel Christoph
- Neural Machine Translation Advised By Statistical Machine Translation Wang Xing, Lu Zhengdong, Tu Zhaopeng, Li Hang, Xiong Deyi, Zhang Min
- Topic Aware Neural Response Generation Xing Chen, Wu Wei, Wu Yu, Liu Jie, Huang Yalou, Zhou Ming, Ma Wei-ying
- Neural Contextual Conversation Learning With Labeled Question-answering Pairs Xiong Kun, Cui Anqi, Zhang Zefeng, Li Ming
- An Attentional Neural Conversation Model With Improved Specificity Yao Kaisheng, Peng Baolin, Zweig Geoffrey, Wong Kam-fai
- End-to-end Answer Chunk Extraction And Ranking For Reading Comprehension Yu Yang, Zhang Wei, Hasan Kazi, Yu Mo, Xiang Bing, Zhou Bowen
- Don't Just Assume; Look And Answer: Overcoming Priors For Visual Question Answering Agrawal Aishwarya, Batra Dhruv, Parikh Devi, Kembhavi Aniruddha
- Weighted Transformer Network For Machine Translation Ahmed Karim, Keskar Nitish Shirish, Socher Richard
- Attention Is All You Need Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin
- Are Emojis Predictable? Barbieri Francesco, Ballesteros Miguel, Saggion Horacio
- Lig-cristal System For The WMT17 Automatic Post-editing Task Berard Alexandre, Pietquin Olivier, Besacier Laurent
- Towards Neural Machine Translation With Latent Tree Attention Bradbury James, Socher Richard
- Incorporating Global Visual Features Into Attention-based Neural Machine Translation Calixto Iacer, Liu Qun, Campbell Nick
- Enabling Multi-source Neural Machine Translation By Concatenating Source Sentences In Multiple Languages Dabre Raj, Cromieres Fabien, Kurohashi Sadao
- Neural Machine Translation Leveraging Phrase-based Models In A Hybrid Search Dahlmann Leonard, Matusov Evgeny, Petrushkov Pavel, Khadivi Shahram
- Frustratingly Short Attention Spans In Neural Language Modeling Daniluk Michał, Rocktäschel Tim, Welbl Johannes, Riedel Sebastian
- An Empirical Study On The Effectiveness Of Images In Multimodal Neural Machine Translation Delbrouck Jean-benoit, Dupont Stéphane
- Robustfill: Neural Program Learning Under Noisy I/O Devlin Jacob, Uesato Jonathan, Bhupatiraju Surya, Singh Rishabh, Mohamed Abdel-rahman, Kohli Pushmeet
- A Copy-augmented Sequence-to-sequence Architecture Gives Good Performance On Task-oriented Dialogue Eric Mihail, Manning Christopher D.
- Ruminating Reader: Reasoning With Gated Multi-hop Attention Gong Yichen, Bowman Samuel R.
- Sockeye: A Toolkit For Neural Machine Translation Hieber Felix, Domhan Tobias, Denkowski Michael, Vilar David, Sokolov Artem, Clifton Ann, Post Matt
- Reinforced Mnemonic Reader For Machine Reading Comprehension Hu Minghao, Peng Yuxing, Huang Zhen, Qiu Xipeng, Wei Furu, Zhou Ming
- An Exploration Of Neural Sequence-to-sequence Architectures For Automatic Post-editing Junczys-dowmunt Marcin, Grundkiewicz Roman
- Attention Strategies For Multi-source Sequence-to-sequence Learning Libovický Jindřich, Helcl Jindřich
- Task-driven Visual Saliency And Attention-based Visual Question Answering Lin Yuetan, Pang Zhangyang, Wang Donghui, Zhuang Yueting
- Ivqa: Inverse Visual Question Answering Liu Feng, Xiang Tao, Hospedales Timothy M., Yang Wankou, Sun Changyin
- Phase Conductor On Multi-layered Attentions For Machine Comprehension Liu Rui, Wei Wei, Mao Weiguang, Chikina Maria
- Table-to-text Generation By Structure-aware Seq2seq Learning Liu Tianyu, Wang Kexiang, Sha Lei, Chang Baobao, Sui Zhifang
- Best Of Both Worlds: Transferring Knowledge From Discriminative Learning To A Generative Visual Dialog Model Lu Jiasen, Kannan Anitha, Yang Jianwei, Parikh Devi, Batra Dhruv
- Visual Question Answering With Memory-augmented Networks Ma Chao, Shen Chunhua, Dick Anthony, Wu Qi, Wang Peng, Hengel Anton Van Den, Reid Ian
- A Generative Approach To Question Answering Mitra Rajarshee
- Contextualized Word Representations For Reading Comprehension Salant Shimi, Berant Jonathan
- Visual Reference Resolution Using Attention Memory For Visual Dialog Seo Paul Hongsuck, Lehrmann Andreas, Han Bohyung, Sigal Leonid
- Generating High-quality And Informative Conversation Responses With Sequence-to-sequence Models Shao Louis, Gouws Stephan, Britz Denny, Goldie Anna, Strope Brian, Kurzweil Ray
- Recurrent And Contextual Models For Visual Question Answering Sharang Abhijit, Lau Eric
- An Empirical Analysis Of Multiple-turn Reasoning Strategies In Reading Comprehension Tasks Shen Yelong, Liu Xiaodong, Duh Kevin, Gao Jianfeng
- A Unified Query-based Generative Model For Question Generation And Question Answering Song Linfeng, Wang Zhiguo, Hamza Wael
- Tips And Tricks For Visual Question Answering: Learnings From The 2017 Challenge Teney Damien, Anderson Peter, He Xiaodong, Hengel Anton Van Den
- Self-attentive Residual Decoder For Neural Machine Translation Werlen Lesly Miculicich, Pappas Nikolaos, Ram Dhananjay, Popescu-belis Andrei
- Are You Talking To Me? Reasoned Visual Dialog Generation Through Adversarial Learning Wu Qi, Wang Peng, Shen Chunhua, Reid Ian, Hengel Anton Van Den
- Hierarchical Recurrent Attention Network For Response Generation Xing Chen, Wu Wei, Wu Yu, Zhou Ming, Huang Yalou, Ma Wei-ying
- The Forgettable-watcher Model For Video Question Answering Xue Hongyang, Zhao Zhou, Cai Deng
- Deepprobe: Information Directed Sequence Understanding And Chatbot Design Via Recurrent Neural Networks Yin Zi, Chang Keng-hao, Zhang Ruofei
- Code2seq: Generating Sequences From Structured Representations Of Code Alon Uri, Brody Shaked, Levy Omer, Yahav Eran
- Multilingual Extractive Reading Comprehension By Runtime Machine Translation Asai Akari, Eriguchi Akiko, Hashimoto Kazuma, Tsuruoka Yoshimasa
- Attention Based Natural Language Grounding By Navigating Virtual Environment B Akilesh, Sinha Abhishek, Sarkar Mausoom, Krishnamurthy Balaji
- Training Deeper Neural Machine Translation Models With Transparent Attention Bapna Ankur, Chen Mia Xu, Firat Orhan, Cao Yuan, Wu Yonghui
- Commonsense For Generative Multi-hop Question Answering Tasks Bauer Lisa, Wang Yicheng, Bansal Mohit
- LIUM-CVC Submissions For WMT18 Multimodal Translation Task Caglayan Ozan, Bardet Adrien, Bougares Fethi, Barrault Loïc, Wang Kai, Masana Marc, Herranz Luis, Van De Weijer Joost
- Behavior Analysis Of NLI Models: Uncovering The Influence Of Three Factors On Robustness Carmona Vicente Ivan Sanchez, Mitchell Jeff, Riedel Sebastian
- Textually Enriched Neural Module Networks For Visual Question Answering Chandu Khyathi Raghavi, Pyreddy Mary Arpita, Felix Matthieu, Joshi Narendra Nath
- Augmenting Neural Response Generation With Context-aware Topical Attention Dziri Nouha, Kamalloo Ehsan, Mathewson Kory W., Zaiane Osmar
- Reciprocal Attention Fusion For Visual Question Answering Farazi Moshiur R, Khan Salman H
- SALSA-TEXT : Self Attentive Latent Space Based Adversarial Text Generation Gagnon-marchand Jules, Sadeghi Hamed, Haidar Md. Akmal, Rezagholizadeh Mehdi
- Question-guided Hybrid Convolution For Visual Question Answering Gao Peng, Lu Pan, Li Hongsheng, Li Shuang, Li Yikang, Hoi Steven, Wang Xiaogang
- Generating Distractors For Reading Comprehension Questions From Real Examinations Gao Yifan, Bing Lidong, Li Piji, King Irwin, Lyu Michael R.
- Analysing The Potential Of Seq-to-seq Models For Incremental Interpretation In Task-oriented Dialogue Hupkes Dieuwke, Bouwmeester Sanne, Fernández Raquel
- Transfer Learning Of Language-independent End-to-end ASR With Language Model Fusion Inaguma Hirofumi, Cho Jaejin, Baskar Murali Karthick, Kawahara Tatsuya, Watanabe Shinji
- Closed-book Training To Improve Summarization Encoder Memory Jiang Yichen, Bansal Mohit
- Neural Machine Translation With Adequacy-oriented Learning Kong Xiang, Tu Zhaopeng, Shi Shuming, Hovy Eduard, Zhang Tong
- A Question-focused Multi-factor Attention Network For Question Answering Kundu Souvik, Ng Hwee Tou
- Answerer In Questioner's Mind: Information Theoretic Approach To Goal-oriented Visual Dialog Lee Sang-woo, Heo Yu-jung, Zhang Byoung-tak
- Seq2rdf: An End-to-end Application For Deriving Triples From Natural Language Text Liu Yue, Zhang Tongtao, Liang Zhicheng, Ji Heng, Mcguinness Deborah L.
- Mem2seq: Effectively Incorporating Knowledge Bases Into End-to-end Task-oriented Dialog Systems Madotto Andrea, Wu Chien-sheng, Fung Pascale
- Neural Machine Translation With Key-value Memory-augmented Attention Meng Fandong, Tu Zhaopeng, Cheng Yong, Wu Haiyang, Zhai Junjie, Yang Yuekui, Wang Di
- Improved Fusion Of Visual And Language Representations By Dense Symmetric Co-attention For Visual Question Answering Nguyen Duy-kien, Okatani Takayuki
- Phrase-based Attentions Nguyen Phi Xuan, Joty Shafiq
- Polite Dialogue Generation Without Parallel Data Niu Tong, Bansal Mohit
- Recursive Visual Attention In Visual Dialog Niu Yulei, Zhang Hanwang, Zhang Manli, Zhang Jianhong, Lu Zhiwu, Wen Ji-rong
- Dual Recurrent Attention Units For Visual Question Answering Osman Ahmed, Samek Wojciech
- Multimodal Explanations: Justifying Decisions And Pointing To The Evidence Park Dong Huk, Hendricks Lisa Anne, Akata Zeynep, Rohrbach Anna, Schiele Bernt, Darrell Trevor, Rohrbach Marcus
- Dynamic Fusion With Intra- And Inter- Modality Attention Flow For Visual Question Answering Peng Gao, Jiang Zhengkai, You Haoxuan, Lu Pan, Hoi Steven, Wang Xiaogang, Li Hongsheng
- Teaching Machines To Code: Neural Markup Generation With Visual Attention Singh Sumeet S.
- Adapting Visual Question Answering Models For Enhancing Multimodal Community Q&A Platforms Srivastava Avikalp, Liu Hsin Wen, Fujita Sumio
- The University Of Cambridge's Machine Translation Systems For WMT18 Stahlberg Felix, De Gispert Adria, Byrne Bill
- Inducing Grammars With And For Neural Machine Translation Tran Ke, Bisk Yonatan
- Few-shot Generalization Across Dialogue Tasks Vlasov Vladimir, Drissner-schmid Akela, Nichol Alan
- Multilingual NMT With A Language-independent Attention Bridge Vázquez Raúl, Raganato Alessandro, Tiedemann Jörg, Creutz Mathias
- Structured Triplet Learning With Pos-tag Guided Attention For Visual Question Answering Wang Zhe, Liu Xiaoyi, Chen Liangjian, Wang Limin, Qiao Yu, Xie Xiaohui, Fowlkes Charless
- The Fine Line Between Linguistic Generalization And Failure In Seq2seq-attention Models Weber Noah, Shekhar Leena, Balasubramanian Niranjan
- Dual Ask-answer Network For Machine Reading Comprehension Xiao Han, Wang Feng, Yan Jianfeng, Zheng Jingyao
- Qanet: Combining Local Convolution With Global Self-attention For Reading Comprehension Yu Adams Wei, Dohan David, Luong Minh-thang, Zhao Rui, Chen Kai, Norouzi Mohammad, Le Quoc V.
- Improved Training Of End-to-end Attention Models For Speech Recognition Zeyer Albert, Irie Kazuki, Schlüter Ralf, Ney Hermann
- Medical Exam Question Answering With Large-scale Reading Comprehension Zhang Xiao, Wu Ji, He Zhiyang, Liu Xien, Su Ying
- Phrase Table As Recommendation Memory For Neural Machine Translation Zhao Yang, Wang Yining, Zhang Jiajun, Zong Chengqing
- An Affect-rich Neural Conversational Model With Biased Attention And Weighted Cross-entropy Loss Zhong Peixiang, Wang Di, Miao Chunyan
- A Visual Attention Grounding Neural Model For Multimodal Machine Translation Zhou Mingyang, Cheng Runxiang, Lee Yong Jae, Yu Zhou
- Whatcha Lookin' At? Deeplifting Bert's Attention In Question Answering Arkhangelskaia Ekaterina, Dutta Sourav
- PLATO: Pre-trained Dialogue Generation Model With Discrete Latent Variable Bao Siqi, He Huang, Wang Fan, Wu Hua, Wang Haifeng
- Keeping Notes: Conditional Natural Language Generation With A Scratchpad Mechanism Benmalek Ryan Y., Khabsa Madian, Desu Suma, Cardie Claire, Banko Michele
- Language Features Matter: Effective Language Representations For Vision-language Tasks Burns Andrea, Tan Reuben, Saenko Kate, Sclaroff Stan, Plummer Bryan A.
- BERTQA -- Attention On Steroids Chadha Ankit, Sood Rewa
- DMRM: A Dual-channel Multi-hop Reasoning Model For Visual Dialog Chen Feilong, Meng Fandong, Xu Jiaming, Li Peng, Xu Bo, Zhou Jie
- Attending To Entities For Better Text Understanding Cheng Pengxiang, Erk Katrin
- Learning Multi-level Information For Dialogue Response Selection By Highway Recurrent Transformer Chiang Ting-rui, Huang Chao-wei, Su Shang-yu, Chen Yun-nung
- Contrastive Multi-document Question Generation Cho Woon Sang, Zhang Yizhe, Rao Sudha, Celikyilmaz Asli, Xiong Chenyan, Gao Jianfeng, Wang Mengdi, Dolan Bill
- Memory-augmented Recurrent Networks For Dialogue Coherence Donahue David, Meng Yuanliang, Rumshisky Anna
- Unified Language Model Pre-training For Natural Language Understanding And Generation Dong Li, Yang Nan, Wang Wenhui, Wei Furu, Liu Xiaodong, Wang Yu, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- Retrosynthesis With Attention-based NMT Model And Chemical Analysis Of The "wrong" Predictions Duan Hongliang, Wang Ling, Zhang Chengyun, Li Jianjun
- Heterogeneous Memory Enhanced Multimodal Attention Model For Video Question Answering Fan Chenyou, Zhang Xiaofan, Zhang Shu, Wang Wensheng, Zhang Chi, Huang Heng
- Optibox: Breaking The Limits Of Proposals For Visual Grounding Fan Zicong, Meng Si Yi, Sigal Leonid, Little James J.
- Question-agnostic Attention For Visual Question Answering Farazi Moshiur R, Khan Salman H, Barnes Nick
- Multi-step Reasoning Via Recurrent Dual Attention For Visual Dialog Gan Zhe, Cheng Yu, Kholy Ahmed El, Li Linjie, Liu Jingjing, Gao Jianfeng
- Generating Natural Language Explanations For Visual Question Answering Using Scene Graphs And Visual Attention Ghosh Shalini, Burachas Giedrius, Ray Arijit, Ziskind Avi
- Interactive Matching Network For Multi-turn Response Selection In Retrieval-based Chatbots Gu Jia-chen, Ling Zhen-hua, Liu Quan
- Zero-shot Paraphrase Generation With Multilingual Language Models Guo Yinpeng, Liao Yi, Jiang Xin, Zhang Qing, Zhang Yibo, Liu Qun
- Resurrecting Submodularity For Neural Text Generation Han Simeng, Lin Xiang, Joty Shafiq
- Exbert: A Visual Analysis Tool To Explore Learned Representations In Transformers Models Hoover Benjamin, Strobelt Hendrik, Gehrmann Sebastian
- Iterative Answer Prediction With Pointer-augmented Multimodal Transformers For Textvqa Hu Ronghang, Singh Amanpreet, Darrell Trevor, Rohrbach Marcus
- Rap-net: Recurrent Attention Pooling Networks For Dialogue Response Selection Huang Chao-wei, Chiang Ting-rui, Su Shang-yu, Chen Yun-nung
- Attention Is Not Explanation Jain Sarthak, Wallace Byron C.
- Amr-to-text Generation With Cache Transition Systems Jin Lisa, Gildea Daniel
- Let Me Know What To Ask: Interrogative-word-aware Question Generation Kang Junmo, Roman Haritz Puerto San, Myaeng Sung-hyon
- Weak Supervision Helps Emergence Of Word-object Alignment And Improves Vision-language Tasks Kervadec Corentin Liris, Antipov Grigory Liris, Baccouche Moez Liris, Wolf Christian Liris
- Sample Efficient Text Summarization Using A Single Pre-trained Transformer Khandelwal Urvashi, Clark Kevin, Jurafsky Dan, Kaiser Lukasz
- When And Why Is Document-level Context Useful In Neural Machine Translation? Kim Yunsu, Tran Duc Thanh, Ney Hermann
- Learning To Answer By Learning To Ask: Getting The Best Of GPT-2 And BERT Worlds Klein Tassilo, Nabi Moin
- Transcoding Compositionally: Using Attention To Find More Generalizable Solutions Korrel Kris, Hupkes Dieuwke, Dankers Verna, Bruni Elia
- Revealing The Dark Secrets Of BERT Kovaleva Olga, Romanov Alexey, Rogers Anna, Rumshisky Anna
- Question Generation From Paragraphs: A Tale Of Two Hierarchical Models Kumar Vishwajeet, Chaki Raktim, Talluri Sai Teja, Ramakrishnan Ganesh, Li Yuan-fang, Haffari Gholamreza
- Big Bidirectional Insertion Representations For Documents Li Lala, Chan William
- Visualbert: A Simple And Performant Baseline For Vision And Language Li Liunian Harold, Yatskar Mark, Yin Da, Hsieh Cho-jui, Chang Kai-wei
- Relevance-promoting Language Model For Short-text Conversation Li Xin, Li Piji, Bi Wei, Liu Xiaojiang, Lam Wai
- Explicit Sentence Compression For Neural Machine Translation Li Zuchao, Wang Rui, Chen Kehai, Utiyama Masao, Sumita Eiichiro, Zhang Zhuosheng, Zhao Hai
- TAB-VCR: Tags And Attributes Based Visual Commonsense Reasoning Baselines Lin Jingxiang, Jain Unnat, Schwing Alexander G.
- Attention-informed Mixed-language Training For Zero-shot Cross-lingual Task-oriented Dialogue Systems Liu Zihan, Winata Genta Indra, Lin Zhaojiang, Xu Peng, Fung Pascale
- Multi-hop Reading Comprehension Via Deep Reinforcement Learning Based Document Traversal Long Alex, Mason Joel, Blair Alan, Wang Wei
- Vilbert: Pretraining Task-agnostic Visiolinguistic Representations For Vision-and-language Tasks Lu Jiasen, Batra Dhruv, Parikh Devi, Lee Stefan
- Good, Better, Best: Textual Distractors Generation For Multiple-choice Visual Question Answering Via Reinforcement Learning Lu Jiaying, Ye Xin, Ren Yi, Yang Yezhou
- Triplenet: Triple Attention Network For Multi-turn Response Selection In Retrieval-based Chatbots Ma Wentao, Cui Yiming, Shao Nan, He Su, Zhang Wei-nan, Liu Ting, Wang Shijin, Hu Guoping
- Selective Attention For Context-aware Neural Machine Translation Maruf Sameen, Martins André F. T., Haffari Gholamreza
- Emergent Properties Of Finetuned Language Representation Models Matton Alexandre, De Oliveira Luke
- Structured Pruning Of A Bert-based Question Answering Model Mccarley J. S., Chakravarti Rishav, Sil Avirup
- Self-attentional Models Application In Task-oriented Dialogue Generation Systems Mehrjardi Mansour Saffar, Trabelsi Amine, Zaiane Osmar R.
- Megatron-lm: Training Multi-billion Parameter Language Models Using Model Parallelism Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick Legresley, Jared Casper, Bryan Catanzaro
- Interrogating The Explanatory Power Of Attention In Neural Machine Translation Moradi Pooya, Kambhatla Nishant, Sarkar Anoop
- Let's Ask Again: Refine Network For Automatic Question Generation Nema Preksha, Mohankumar Akash Kumar, Khapra Mitesh M., Srinivasan Balaji Vasan, Ravindran Balaraman
- Efficient Attention Mechanism For Visual Dialog That Can Handle All The Interactions Between Multiple Inputs Nguyen Van-quang, Suganuma Masanori, Okatani Takayuki
- Frustratingly Easy Natural Question Answering Pan Lin, Chakravarti Rishav, Ferritto Anthony, Glass Michael, Gliozzo Alfio, Roukos Salim, Florian Radu, Sil Avirup
- Visual Dialogue State Tracking For Question Generation Pang Wei, Wang Xiaojie
- AI Meets Austen: Towards Human-robot Discussions Of Literary Metaphor Parde Natalie, Nielsen Rodney D.
- Stabilizing Transformers For Reinforcement Learning Parisotto Emilio, Song H. Francis, Rae Jack W., Pascanu Razvan, Gulcehre Caglar, Jayakumar Siddhant M., Jaderberg Max, Kaufman Raphael Lopez, Clark Aidan, Noury Seb, Botvinick Matthew M., Heess Nicolas, Hadsell Raia
- A Modular Task-oriented Dialogue System Using A Neural Mixture-of-experts Pei Jiahuan, Ren Pengjie, De Rijke Maarten
- Learning To Deceive With Attention-based Explanations Pruthi Danish, Gupta Mansi, Dhingra Bhuwan, Neubig Graham, Lipton Zachary C.
- Blockwise Self-attention For Long Document Understanding Qiu Jiezhong, Ma Hao, Levy Omer, Yih Scott Wen-tau, Wang Sinong, Tang Jie
- Option Comparison Network For Multiple-choice Reading Comprehension Ran Qiu, Li Peng, Hu Weiwei, Zhou Jie
- On Leveraging The Visual Modality For Neural Machine Translation Raunak Vikas, Choe Sang Keun, Lu Quanyang, Xu Yi, Metze Florian
- Factor Graph Attention Schwartz Idan, Yu Seunghak, Hazan Tamir, Schwing Alexander
- Taking A HINT: Leveraging Explanations To Make Vision And Language Models More Grounded Selvaraju Ramprasaath R., Lee Stefan, Shen Yilin, Jin Hongxia, Ghosh Shalini, Heck Larry, Batra Dhruv, Parikh Devi
- Scene-based Factored Attention For Image Captioning Shen Chen, Ji Rongrong, Chen Fuhai, Sun Xiaoshuai, Li Xiangming
- Pragmatically Informative Text Generation Shen Sheng, Fried Daniel, Andreas Jacob, Klein Dan
- MASS: Masked Sequence To Sequence Pre-training For Language Generation Song Kaitao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Tie-yan
- Natural Language Generation Using Reinforcement Learning With External Rewards Srinivasan Vidhushini, Santhanam Sashank, Shaikh Samira
- Sesamebert: Attention For Anywhere Su Ta-chun, Cheng Hsiang-chih
- Adaptive Attention Span In Transformers Sukhbaatar Sainbayar, Grave Edouard, Bojanowski Piotr, Joulin Armand
- Augmenting Self-attention With Persistent Memory Sukhbaatar Sainbayar, Grave Edouard, Lample Guillaume, Jegou Herve, Joulin Armand
- Generating Diverse Translation By Manipulating Multi-head Attention Sun Zewei, Huang Shujian, Wei Hao-ran, Dai Xin-yu, Chen Jiajun
- Syntax-infused Transformer And BERT Models For Machine Translation And Natural Language Understanding Sundararaman Dhanasekar, Subramanian Vivek, Wang Guoyin, Si Shijing, Shen Dinghan, Wang Dong, Carin Lawrence
- LXMERT: Learning Cross-modality Encoder Representations From Transformers Tan Hao, Bansal Mohit
- Generating Token-level Explanations For Natural Language Inference Thorne James, Vlachos Andreas, Christodoulopoulos Christos, Mittal Arpit
- Tree-structured Semantic Encoder With Knowledge Sharing For Domain Adaptation In Natural Language Generation Tseng Bo-hsiang, Budzianowski Paweł, Wu Yen-chen, Gašić Milica
- Capturing Greater Context For Question Generation Tuan Luu Anh, Shah Darsh J, Barzilay Regina
- How Does BERT Answer Questions? A Layer-wise Analysis Of Transformer Representations Van Aken Betty, Winter Benjamin, Löser Alexander, Gers Felix A.
- Visualizing Attention In Transformer-based Language Representation Models Vig Jesse
- Language Models With Transformers Wang Chenguang, Li Mu, Smola Alexander J.
- Improving Knowledge-aware Dialogue Generation Via Knowledge Base Question Answering Wang Jian, Liu Junhao, Bi Wei, Liu Xiaojiang, He Kejing, Xu Ruifeng, Yang Min
- Theme-aware Generation Model For Chinese Lyrics Wang Jie, Zhao Xinyan
- Structbert: Incorporating Language Structures Into Pre-training For Deep Language Understanding Wang Wei, Bi Bin, Yan Ming, Wu Chen, Bao Zuyi, Xia Jiangnan, Peng Liwei, Si Luo
- Tree Transformer: Integrating Tree Structures Into Self-attention Wang Yau-shian, Lee Hung-yi, Chen Yun-nung
- Video Question Generation Via Cross-modal Self-attention Networks Learning Wang Yu-siang, Su Hung-ting, Chang Chen-hsi, Liu Zhe-yu, Hsu Winston H.
- Attention Is Not Not Explanation Wiegreffe Sarah, Pinter Yuval
- Learning To Memorize In Neural Task-oriented Dialogue Systems Wu Chien-sheng
- Integrated Triaging For Fast Reading Comprehension Wu Felix, Li Boyi, Wang Lequn, Lao Ni, Blitzer John, Weinberger Kilian Q.
- Transformer-based Cascaded Multimodal Speech Translation Wu Zixiu, Caglayan Ozan, Ive Julia, Wang Josiah, Specia Lucia
- Sharing Attention Weights For Fast Transformer Xiao Tong, Li Yinqiao, Zhu Jingbo, Yu Zhengtao, Liu Tongran
- Visual Entailment: A Novel Task For Fine-grained Image Understanding Xie Ning, Lai Farley, Doran Derek, Kadav Asim
- A Multi-turn Emotionally Engaging Dialog Model Xie Yubo, Svikhnushina Ekaterina, Pu Pearl
- Making History Matter: History-advantage Sequence Training For Visual Dialog Yang Tianhao, Zha Zheng-jun, Zhang Hanwang
- Knowledge-grounded Response Generation With Deep Attentional Latent-variable Model Ye Hao-tong, Lo Kai-ling, Su Shang-yu, Chen Yun-nung
- Bp-transformer: Modelling Long-range Context Via Binary Partitioning Ye Zihao, Guo Qipeng, Gan Quan, Qiu Xipeng, Zhang Zheng
- Multimodal Unified Attention Networks For Vision-and-language Interactions Yu Zhou, Cui Yuhao, Yu Jun, Tao Dacheng, Tian Qi
- Extended Answer And Uncertainty Aware Neural Question Generation Zeng Hongwei, Zhi Zhuo, Liu Jun, Wei Bifan
- Context-aware Visual Policy Network For Fine-grained Image Captioning Zha Zheng-jun, Liu Daqing, Zhang Hanwang, Zhang Yongdong, Wu Feng
- Recosa: Detecting The Relevant Contexts With Self-attention For Multi-turn Dialogue Generation Zhang Hainan, Lan Yanyan, Pang Liang, Guo Jiafeng, Cheng Xueqi
- MUSE: Parallel Multi-scale Attention For Sequence To Sequence Learning Zhao Guangxiang, Sun Xu, Xu Jingjing, Zhang Zhiyuan, Luo Liangchen
- A Pre-training Based Personalized Dialogue Generation Model With Persona-sparse Data Zheng Yinhe, Zhang Rongsheng, Mao Xiaoxi, Huang Minlie
- Atom Responding Machine For Dialog Generation Zhou Ganbin, Luo Ping, Chen Jingwu, Lin Fen, Lin Leyu, He Qing
- Unified Vision-language Pre-training For Image Captioning And VQA Zhou Luowei, Palangi Hamid, Zhang Lei, Hu Houdong, Corso Jason J., Gao Jianfeng
- Multi-task Learning With Language Modeling For Question Generation Zhou Wenjie, Zhang Minghua, Wu Yunfang
- Co-attention Hierarchical Network: Generating Coherent Long Distractors For Reading Comprehension Zhou Xiaorui, Luo Senlin, Wu Yunfang
- Using Interlinear Glosses As Pivot In Low-resource Multilingual Machine Translation Zhou Zhong, Levin Lori, Mortensen David R., Waibel Alex
- Encoder-agnostic Adaptation For Conditional Language Generation Ziegler Zachary M., Melas-kyriazi Luke, Gehrmann Sebastian, Rush Alexander M.
- A Study On Multimodal And Interactive Explanations For Visual Question Answering Alipour Kamran, Schulze Jurgen P., Yao Yi, Ziskind Avi, Burachas Giedrius
- Unilmv2: Pseudo-masked Language Models For Unified Language Model Pre-training Bao Hangbo, Dong Li, Wei Furu, Wang Wenhui, Yang Nan, Liu Xiaodong, Wang Yu, Piao Songhao, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- The Elephant In The Interpretability Room: Why Use Attention As Explanation When We Have Saliency Methods? Bastings Jasmijn, Filippova Katja
- Co-attentional Transformers For Story-based Video Understanding Bebensee Björn, Zhang Byoung-tak
- Longformer: The Long-document Transformer Beltagy Iz, Peters Matthew E., Cohan Arman
- Adaptive Transformers For Learning Multimodal Representations Bhargava Prajjwal
- Chatbot Interaction With Artificial Intelligence: Human Data Augmentation With T5 And Language Transformer Ensemble For Text Classification Bird Jordan J., Ekárt Anikó, Faria Diego R.
- Advances Of Transformer-based Models For News Headline Generation Bukhtiyarov Alexey, Gusev Ilya
- Memory Transformer Burtsev Mikhail S., Kuratov Yuri, Peganov Anton, Sapunov Grigory V.
- Behind The Scene: Revealing The Secrets Of Pre-trained Vision-and-language Models Cao Jize, Gan Zhe, Cheng Yu, Yu Licheng, Chen Yen-chun, Liu Jingjing
- Pretrained Language Models For Dialogue Generation With Multiple Input Sources Cao Yu, Bi Wei, Fang Meng, Tao Dacheng
- Dipair: Fast And Accurate Distillation For Trillion-scale Text Matching And Pair Modeling Chen Jiecao, Yang Liu, Raman Karthik, Bendersky Michael, Yeh Jung-jung, Zhou Yun, Najork Marc, Cai Danyang, Emadzadeh Ehsan
- Adaptive Bi-directional Attention: Exploring Multi-granularity Representations For Machine Reading Comprehension Chen Nuo, Liu Fenglin, You Chenyu, Zhou Peilin, Zou Yuexian
- Improving BERT With Self-supervised Attention Chen Yiren, Kou Xiaoyu, Bai Jiangang, Tong Yunhai
- Autoadr: Automatic Model Design For Ad Relevance Chen Yiren, Yang Yaming, Sun Hong, Wang Yujing, Xu Yu, Shen Wei, Zhou Rong, Tong Yunhai, Bai Jing, Zhang Ruofei
- Accurate Word Alignment Induction From Neural Machine Translation Chen Yun, Liu Yang, Chen Guanhua, Jiang Xin, Liu Qun
- MEANTIME: Mixture Of Attention Mechanisms With Multi-temporal Embeddings For Sequential Recommendation Cho Sung Min, Park Eunhyeok, Yoo Sungjoo
- Unsupervised Explanation Generation For Machine Reading Comprehension Cui Yiming, Liu Ting, Wang Shijin, Hu Guoping
- Ernie-doc: A Retrospective Long-document Modeling Transformer Ding Siyu, Shang Junyuan, Wang Shuohuan, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Turngpt: A Transformer-based Language Model For Predicting Turn-taking In Spoken Dialog Ekstedt Erik, Skantze Gabriel
- Addressing Some Limitations Of Transformers With Feedback Memory Fan Angela, Lavril Thibaut, Grave Edouard, Joulin Armand, Sukhbaatar Sainbayar
- Attention Guided Semantic Relationship Parsing For Visual Question Answering Farazi Moshiur, Khan Salman, Barnes Nick
- Compressing Large-scale Transformer-based Models: A Case Study On BERT Ganesh Prakhar, Chen Yao, Lou Xin, Khan Mohammad Ali, Yang Yin, Sajjad Hassan, Nakov Preslav, Chen Deming, Winslett Marianne
- IQ-VQA: Intelligent Visual Question Answering Goel Vatsal, Chandak Mohit, Anand Ashish, Guha Prithwijit
- ISAAQ -- Mastering Textbook Questions With Pre-trained Transformers And Bottom-up And Top-down Attention Gomez-perez Jose Manuel, Ortega Raul
- Are Neural Open-domain Dialog Systems Robust To Speech Recognition Errors In The Dialog History? An Empirical Study Gopalakrishnan Karthik, Hedayatnia Behnam, Wang Longshaokan, Liu Yang, Hakkani-tur Dilek
- Token-level Adaptive Training For Neural Machine Translation Gu Shuhao, Zhang Jinchao, Meng Fandong, Feng Yang, Xie Wanying, Zhou Jie, Yu Dong
- Dialogbert: Discourse-aware Response Generation Via Learning To Recover And Rank Utterances Gu Xiaodong, Yoo Kang Min, Ha Jung-woo
- GMAT: Global Memory Augmentation For Transformers Gupta Ankit, Berant Jonathan
- Improvement Of A Dedicated Model For Open Domain Persona-aware Dialogue Generation Han Qiang
- ECONET: Effective Continual Pretraining Of Language Models For Event Temporal Reasoning Han Rujun, Ren Xiang, Peng Nanyun
- Ranking Enhanced Dialogue Generation Hao Changying, Pang Liang, Lan Yanyan, Sun Fei, Guo Jiafeng, Cheng Xueqi
- Exploring Explainable Selection To Control Abstractive Summarization Haonan Wang, Yang Gao, Yu Bai, Lapata Mirella, Heyan Huang
- Enriched Pre-trained Transformers For Joint Slot Filling And Intent Detection Hardalov Momchil, Koychev Ivan, Nakov Preslav
- Guided Transformer: Leveraging Multiple External Sources For Representation Learning In Conversational Search Hashemi Helia, Zamani Hamed, Croft W. Bruce
- Sub-instruction Aware Vision-and-language Navigation Hong Yicong, Rodriguez-opazo Cristian, Wu Qi, Gould Stephen
- Memory Attentive Fusion: External Language Model Integration For Transformer-based Sequence-to-sequence Model Ihori Mana, Masumura Ryo, Makishima Naoki, Tanaka Tomohiro, Takashima Akihiko, Orihashi Shota
- In Defense Of Grid Features For Visual Question Answering Jiang Huaizu, Misra Ishan, Rohrbach Marcus, Learned-miller Erik, Chen Xinlei
- TLDR: Token Loss Dynamic Reweighting For Reducing Repetitive Utterance Generation Jiang Shaojie, Wolf Thomas, Monz Christof, De Rijke Maarten
- Meta-context Transformers For Domain-specific Response Generation Kar Debanjana, Samanta Suranjana, Azad Amar Prakash
- Current Limitations Of Language Models: What You Need Is Retrieval Komatsuzaki Aran
- PONE: A Novel Automatic Evaluation Metric For Open-domain Generative Dialogue Systems Lan Tian, Mao Xian-ling, Wei Wei, Gao Xiaoyan, Huang Heyan
- Which Kind Is Better In Open-domain Multi-turn Dialog,hierarchical Or Non-hierarchical Models? An Empirical Study Lan Tian, Mao Xian-ling, Wei Wei, Huang Heyan
- LIMEADE: From AI Explanations To Advice Taking Lee Benjamin Charles Germain, Downey Doug, Lo Kyle, Weld Daniel S.
- Multilingual Speech Translation With Efficient Finetuning Of Pretrained Models Li Xian, Wang Changhan, Tang Yun, Tran Chau, Tang Yuqing, Pino Juan, Baevski Alexei, Conneau Alexis, Auli Michael
- Enhancing Dialogue Generation Via Multi-level Contrastive Learning Li Xin, Li Piji, Wang Yan, Liu Xiaojiang, Lam Wai
- Neural Machine Translation With Joint Representation Li Yanyang, Wang Qiang, Xiao Tong, Liu Tongran, Zhu Jingbo
- A Comparison Of Pre-trained Vision-and-language Models For Multimodal Representation Learning Across Medical Images And Reports Li Yikuan, Wang Hanyin, Luo Yuan
- Toward Interpretability Of Dual-encoder Models For Dialogue Response Suggestions Li Yitong, Li Dianqi, Prakash Sushant, Wang Peng
- Rikinet: Reading Wikipedia Pages For Natural Question Answering Liu Dayiheng, Gong Yeyun, Fu Jie, Yan Yu, Chen Jiusheng, Jiang Daxin, Lv Jiancheng, Duan Nan
- KG-BART: Knowledge Graph-augmented BART For Generative Commonsense Reasoning Liu Ye, Wan Yao, He Lifang, Peng Hao, Yu Philip S.
- VECO: Variable And Flexible Cross-lingual Pre-training For Language Understanding And Generation Luo Fuli, Wang Wei, Liu Jiahao, Liu Yijia, Bi Bin, Huang Songfang, Huang Fei, Si Luo
- Asking Complex Questions With Multi-hop Answer-focused Reasoning Ma Xiyao, Zhu Qile, Zhou Yanlin, Li Xiaolin, Wu Dapeng
- Capturing Document Context Inside Sentence-level Neural Machine Translation Models With Self-training Mansimov Elman, Melis Gábor, Yu Lei
- Natural Language Rationales With Full-stack Visual Reasoning: From Pixels To Semantic Frames To Commonsense Graphs Marasović Ana, Bhagavatula Chandra, Park Jae Sung, Bras Ronan Le, Smith Noah A., Choi Yejin
- ADER: Adaptively Distilled Exemplar Replay Towards Continual Learning For Session-based Recommendation Mi Fei, Lin Xiaoyu, Faltings Boi
- Lambert: Language And Action Learning Using Multimodal BERT Miyazawa Kazuki, Aoki Tatsuya, Horii Takato, Nagai Takayuki
- On Incorporating Structural Information To Improve Dialogue Response Generation Moghe Nikita, Vijayan Priyesh, Ravindran Balaraman, Khapra Mitesh M.
- NABU \(\mathrm{-}\) Multilingual Graph-based Neural RDF Verbalizer Moussallem Diego, Gnaneshwar Dwaraknath, Ferreira Thiago Castro, Ngomo Axel-cyrille Ngonga
- Axformer: Accuracy-driven Approximation Of Transformers For Faster, Smaller And More Accurate NLP Models Nagarajan Amrit, Sen Sanchari, Stevens Jacob R., Raghunathan Anand
- Answer Generation Through Unified Memories Over Multiple Passages Nakatsuji Makoto, Okui Sohei
- Better Distractions: Transformer-based Distractor Generation And Multiple Choice Question Filtering Offerijns Jeroen, Verberne Suzan, Verhoef Tessa
- Alleviating The Burden Of Labeling: Sentence Generation By Attention Branch Encoder-decoder Network Ogura Tadashi, Magassouba Aly, Sugiura Komei, Hirakawa Tsubasa, Yamashita Takayoshi, Fujiyoshi Hironobu, Kawai Hisashi
- Facts2story: Controlling Text Generation By Key Facts Orbach Eyal Bar Ilan University, Goldberg Yoav Bar Ilan University And Allen Institute For Artificial Intelligence
- Challenges And Thrills Of Legal Arguments Pallaprolu Anurag, Vaidya Radha, Attawar Aditya Swaroop
- X-linear Attention Networks For Image Captioning Pan Yingwei, Yao Ting, Li Yehao, Mei Tao
- Multi-view Attention Network For Visual Dialog Park Sungjin, Whang Taesun, Yoon Yeochan, Lim Heuiseok
- Telling Bert's Full Story: From Local Attention To Global Aggregation Pascual Damian, Brunner Gino, Wattenhofer Roger
- Sparsifying Transformer Models With Trainable Representation Pooling Pietruszka Michał, Borchmann Łukasz, Garncarek Łukasz
- Prophetnet: Predicting Future N-gram For Sequence-to-sequence Pre-training Qi Weizhen, Yan Yu, Gong Yeyun, Liu Dayiheng, Duan Nan, Chen Jiusheng, Zhang Ruofei, Zhou Ming
- Automatic Distractor Generation For Multiple Choice Questions In Standard Tests Qiu Zhaopeng, Wu Xian, Fan Wei
- Exploring Recurrent, Memory And Attention Based Architectures For Scoring Interactional Aspects Of Human-machine Text Dialog Ramanarayanan Vikram, Mulholland Matthew, Ghosh Debanjan
- Hierarchical GPT With Congruent Transformers For Multi-sentence Language Models Roh Jihyeon, Gim Huiseong, Lee Soo-young
- Personalized Query Rewriting In Conversational AI Agents Roshan-ghias Alireza, Mathialagan Clint Solomon, Ponnusamy Pragaash, Mathias Lambert, Guo Chenlei
- Hierarchical Transformer For Task Oriented Dialog Systems Santra Bishal, Anusha Potnuru, Goyal Pawan
- Look Before You Speak: Visually Contextualized Utterances Seo Paul Hongsuck, Nagrani Arsha, Schmid Cordelia
- Reasoning Over History: Context Aware Visual Dialog Shah Muhammad A., Mehri Shikib, Srinivasan Tejas
- Neural Data-to-text Generation Via Jointly Learning The Segmentation And Correspondence Shen Xiaoyu, Chang Ernie, Su Hui, Zhou Jie, Klakow Dietrich
- Neural Machine Translation With Error Correction Song Kaitao, Tan Xu, Lu Jianfeng
- An Investigation Of Language Model Interpretability Via Sentence Editing Stevens Samuel, Su Yu
- Recipes For Adapting Pre-trained Monolingual And Multilingual Models To Machine Translation Stickland Asa Cooper, Li Xian, Ghazvininejad Marjan
- Benchmarking Machine Reading Comprehension: A Psychological Perspective Sugawara Saku, Stenetorp Pontus, Aizawa Akiko
- Explain And Improve: Lrp-inference Fine-tuning For Image Captioning Models Sun Jiamei, Lapuschkin Sebastian, Samek Wojciech, Binder Alexander
- Edgebert: Sentence-level Energy Optimizations For Latency-aware Multi-task NLP Inference Tambe Thierry, Hooper Coleman, Pentecost Lillian, Jia Tianyu, Yang En-yu, Donato Marco, Sanh Victor, Whatmough Paul N., Rush Alexander M., Brooks David, Wei Gu-yeon
- Interpretable Neural Computation For Real-world Compositional Visual Question Answering Tang Ruixue, Ma Chao
- Synthesizer: Rethinking Self-attention In Transformer Models Tay Yi, Bahri Dara, Metzler Donald, Juan Da-cheng, Zhao Zhe, Zheng Che
- Cycle Text-to-image GAN With BERT Tsue Trevor, Sen Samir, Li Jason
- Multi-task Learning With Multi-head Attention For Multi-choice Reading Comprehension Wan Hui
- HAT: Hardware-aware Transformers For Efficient Natural Language Processing Wang Hanrui, Wu Zhanghao, Liu Zhijian, Cai Han, Zhu Ligeng, Gan Chuang, Han Song
- SRQA: Synthetic Reader For Factoid Question Answering Wang Jiuniu, Xu Wenjia, Fu Xingyu, Wei Yang, Jin Li, Chen Ziyan, Xu Guangluan, Wu Yirong
- Encoding Syntactic Knowledge In Transformer Encoder For Intent Detection And Slot Filling Wang Jixuan, Wei Kai, Radfar Martin, Zhang Weiwei, Chung Clement
- Minilm: Deep Self-attention Distillation For Task-agnostic Compression Of Pre-trained Transformers Wang Wenhui, Wei Furu, Dong Li, Bao Hangbo, Yang Nan, Zhou Ming
- VD-BERT: A Unified Vision And Dialog Transformer With BERT Wang Yue, Joty Shafiq, Lyu Michael R., King Irwin, Xiong Caiming, Hoi Steven C. H.
- A Controllable Model Of Grounded Response Generation Wu Zeqiu, Galley Michel, Brockett Chris, Zhang Yizhe, Gao Xiang, Quirk Chris, Koncel-kedziorski Rik, Gao Jianfeng, Hajishirzi Hannaneh, Ostendorf Mari, Dolan Bill
- Lite Transformer With Long-short Range Attention Wu Zhanghao, Liu Zhijian, Lin Ji, Lin Yujun, Han Song
- Context-guided BERT For Targeted Aspect-based Sentiment Analysis Wu Zhengxuan, Ong Desmond C.
- ERNIE-GEN: An Enhanced Multi-flow Pre-training And Fine-tuning Framework For Natural Language Generation Xiao Dongling, Zhang Han, Li Yukun, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Generating Informative Dialogue Responses With Keywords-guided Networks Xu Heng-da, Mao Xian-ling, Chi Zewen, Zhu Jing-jing, Sun Fanshu, Huang Heyan
- Multi-unit Transformers For Neural Machine Translation Yan Jianhao, Meng Fandong, Zhou Jie
- Seqdialn: Sequential Visual Dialog Networks In Joint Visual-linguistic Representation Space Yang Liu
- IART: Intent-aware Response Ranking With Transformers In Information-seeking Conversation Systems Yang Liu, Qiu Minghui, Qu Chen, Chen Cen, Guo Jiafeng, Zhang Yongfeng, Croft W. Bruce, Chen Haiqing
- Contrastive Triple Extraction With Generative Transformer Ye Hongbin, Zhang Ningyu, Deng Shumin, Chen Mosha, Tan Chuanqi, Huang Fei, Chen Huajun
- Hard-coded Gaussian Attention For Neural Machine Translation You Weiqiu, Sun Simeng, Iyyer Mohit
- GOBO: Quantizing Attention-based NLP Models For Low Latency And Energy Efficient Inference Zadeh Ali Hadi, Edo Isak, Awad Omar Mohamed, Moshovos Andreas
- Big Bird: Transformers For Longer Sequences Zaheer Manzil, Guruganesh Guru, Dubey Avinava, Ainslie Joshua, Alberti Chris, Ontanon Santiago, Pham Philip, Ravula Anirudh, Wang Qifan, Yang Li, Ahmed Amr
- Open-domain Dialogue Generation Based On Pre-trained Language Models Zeng Yan, Nie Jian-yun
- Modeling Topical Relevance For Multi-turn Dialogue Generation Zhang Hainan, Lan Yanyan, Pang Liang, Chen Hongshen, Ding Zhuoye, Yin Dawei
- BERT-JAM: Boosting Bert-enhanced Neural Machine Translation With Joint Attention Zhang Zhebin, Wu Sai, Jiang Dawei, Chen Gang
- CPM: A Large-scale Generative Chinese Pre-trained Language Model Zhang Zhengyan, Han Xu, Zhou Hao, Ke Pei, Gu Yuxian, Ye Deming, Qin Yujia, Su Yusheng, Ji Haozhe, Guan Jian, Qi Fanchao, Wang Xiaozhi, Zheng Yanan, Zeng Guoyang, Cao Huanqi, Chen Shengqi, Li Daixuan, Sun Zhenbo, Liu Zhiyuan, Huang Minlie, Han Wentao, Tang Jie, Li Juanzi, Zhu Xiaoyan, Sun Maosong
- Know What You Don't Need: Single-shot Meta-pruning For Attention Heads Zhang Zhengyan, Qi Fanchao, Liu Zhiyuan, Liu Qun, Sun Maosong
- A Closer Look At Few-shot Crosslingual Transfer: The Choice Of Shots Matters Zhao Mengjie, Zhu Yi, Shareghi Ehsan, Vulić Ivan, Reichart Roi, Korhonen Anna, Schütze Hinrich
- DUMA: Reading Comprehension With Transposition Thinking Zhu Pengfei, Zhao Hai, Li Xiaoguang
- Towards Understanding Sample Variance In Visually Grounded Language Generation: Evaluations And Observations Zhu Wanrong, Wang Xin Eric, Narayana Pradyumna, Sone Kazoo, Basu Sugato, Wang William Yang
- Vision-dialog Navigation By Exploring Cross-modal Memory Zhu Yi, Zhu Fengda, Zhan Zhaohuan, Lin Bingqian, Jiao Jianbin, Chang Xiaojun, Liang Xiaodan
- Regularizing Transformers With Deep Probabilistic Layers Aguilera Aurora Cobo, Olmos Pablo Martínez, Artés-rodríguez Antonio, Pérez-cruz Fernando
- Goal-directed Story Generation: Augmenting Generative Language Models With Reinforcement Learning Alabdulkarim Amal, Li Winston, Martin Lara J., Riedl Mark O.
- See, Hear, Read: Leveraging Multimodality With Guided Attention For Abstractive Text Summarization Atri Yash Kumar, Pramanick Shraman, Goyal Vikram, Chakraborty Tanmoy
- G-transformer For Document-level Machine Translation Bao Guangsheng, Zhang Yue, Teng Zhiyang, Chen Boxing, Luo Weihua
- Building A Question And Answer System For News Domain Basu Sandipan, Gaddala Aravind, Chetan Pooja, Tiwari Garima, Darapaneni Narayana, Parvathaneni Sadwik, Paduri Anwesh Reddy
- SHAQ: Single Headed Attention With Quasi-recurrence Bharwani Nashwin, Kushner Warren, Dandona Sangeet, Schreiber Ben
- Leveraging Redundancy In Attention With Reuse Transformers Bhojanapalli Srinadh, Chakrabarti Ayan, Veit Andreas, Lukasik Michal, Jain Himanshu, Liu Frederick, Chang Yin-wen, Kumar Sanjiv
- Improving Language Models By Retrieving From Trillions Of Tokens Borgeaud Sebastian, Mensch Arthur, Hoffmann Jordan, Cai Trevor, Rutherford Eliza, Millican Katie, Driessche George Van Den, Lespiau Jean-baptiste, Damoc Bogdan, Clark Aidan, Casas Diego De Las, Guy Aurelia, Menick Jacob, Ring Roman, Hennigan Tom, Huang Saffron, Maggiore Loren, Jones Chris, Cassirer Albin, Brock Andy, Paganini Michela, Irving Geoffrey, Vinyals Oriol, Osindero Simon, Simonyan Karen, Rae Jack W., Elsen Erich, Sifre Laurent
- On The Prunability Of Attention Heads In Multilingual BERT Budhraja Aakriti, Pande Madhura, Kumar Pratyush, Khapra Mitesh M.
- CDLM: Cross-document Language Modeling Caciularu Avi, Cohan Arman, Beltagy Iz, Peters Matthew E., Cattan Arie, Dagan Ido
- Coarse-grained Decomposition And Fine-grained Interaction For Multi-hop Question Answering Cao Xing, Liu Yun
- The Selectgen Challenge: Finding The Best Training Samples For Few-shot Neural Text Generation Chang Ernie, Shen Xiaoyu, Marin Alex, Demberg Vera
- On Training Instance Selection For Few-shot Neural Text Generation Chang Ernie, Shen Xiaoyu, Yeh Hui-syuan, Demberg Vera
- Generic Attention-model Explainability For Interpreting Bi-modal And Encoder-decoder Transformers Chefer Hila, Gur Shir, Wolf Lior
- Multimodal Incremental Transformer With Visual Grounding For Visual Dialogue Generation Chen Feilong, Meng Fandong, Chen Xiuyi, Li Peng, Zhou Jie
- Visualgpt: Data-efficient Adaptation Of Pretrained Language Models For Image Captioning Chen Jun, Guo Han, Yi Kai, Li Boyang, Elhoseiny Mohamed
- Reinforcement Learning For Few-shot Text Generation Adaptation Cheng Pengsen, Dai Jinqiao, Liu Jiamiao, Liu Jiayong, Jia Peng
- Breaking Down Multilingual Machine Translation Chiang Ting-rui, Chen Yi-pei, Yeh Yi-ting, Neubig Graham
- Language Modeling Using Lmus: 10x Better Data Efficiency Or Improved Scaling Compared To Transformers Chilkuri Narsimha, Hunsberger Eric, Voelker Aaron, Malik Gurshaant, Eliasmith Chris
- Enjoy The Salience: Towards Better Transformer-based Faithful Explanations With Word Salience Chrysostomou George, Aletras Nikolaos
- Improving The Faithfulness Of Attention-based Explanations With Task-specific Information For Text Classification Chrysostomou George, Aletras Nikolaos
- Multilingual Multi-aspect Explainability Analyses On Machine Reading Comprehension Models Cui Yiming, Zhang Wei-nan, Che Wanxiang, Liu Ting, Chen Zhigang, Wang Shijin
- YANMTT: Yet Another Neural Machine Translation Toolkit Dabre Raj, Sumita Eiichiro
- Mention Memory: Incorporating Textual Knowledge Into Transformers Through Entity Mention Attention De Jong Michiel, Zemlyanskiy Yury, Fitzgerald Nicholas, Sha Fei, Cohen William
- Not Quite 'ask A Librarian': AI On The Nature, Value, And Future Of LIS Dinneen Jesse David, Bubinger Helen
- On-the-fly Attention Modulation For Neural Generation Dong Yue, Bhagavatula Chandra, Lu Ximing, Hwang Jena D., Bosselut Antoine, Cheung Jackie Chi Kit, Choi Yejin
- Attention Forcing For Machine Translation Dou Qingyun, Lu Yiting, Manakul Potsawee, Wu Xixin, Gales Mark J. F.
- An Empirical Study Of Training End-to-end Vision-and-language Transformers Dou Zi-yi, Xu Yichong, Gan Zhe, Wang Jianfeng, Wang Shuohang, Wang Lijuan, Zhu Chenguang, Zhang Pengchuan, Yuan Lu, Peng Nanyun, Liu Zicheng, Zeng Michael
- Generative Context Pair Selection For Multi-hop Question Answering Dua Dheeru, Santos Cicero Nogueira Dos, Ng Patrick, Athiwaratkun Ben, Xiang Bing, Gardner Matt, Singh Sameer
- Latent Alignment Of Procedural Concepts In Multimodal Recipes Faghihi Hossein Rajaby, Mirzaee Roshanak, Paliwal Sudarshan, Kordjamshidi Parisa
- Leveraging Knowledge In Multilingual Commonsense Reasoning Fang Yuwei, Wang Shuohang, Xu Yichong, Xu Ruochen, Sun Siqi, Zhu Chenguang, Zeng Michael
- Compressing Visual-linguistic Model Via Knowledge Distillation Fang Zhiyuan, Wang Jianfeng, Hu Xiaowei, Wang Lijuan, Yang Yezhou, Liu Zicheng
- Attention Weights In Transformer NMT Fail Aligning Words Between Sequences But Largely Explain Model Predictions Ferrando Javier, Costa-jussà Marta R.
- Grounded Graph Decoding Improves Compositional Generalization In Question Answering Gai Yu, Jain Paras, Zhang Wendi, Gonzalez Joseph E., Song Dawn, Stoica Ion
- Does Dialog Length Matter For Next Response Selection Task? An Empirical Study Ganhotra Jatin, Joshi Sachindra
- Chop Chop BERT: Visual Question Answering By Chopping Visualbert's Heads Gao Chenyu, Zhu Qi, Wang Peng, Wu Qi
- Condenser: A Pre-training Architecture For Dense Retrieval Gao Luyu, Callan Jamie
- Improving And Diagnosing Knowledge-based Visual Question Answering Via Entity Enhanced Knowledge Injection Garcia-olano Diego, Onoe Yasumasa, Ghosh Joydeep
- Cross-attention Is All You Need: Adapting Pretrained Transformers For Machine Translation Gheini Mozhdeh, Ren Xiang, May Jonathan
- Transaction: ICL-SJTU Submission To Epic-kitchens Action Anticipation Challenge 2021 Gu Xiao, Qiu Jianing, Guo Yao, Lo Benny, Yang Guang-zhong
- Block-skim: Efficient Question Answering For Transformer Guan Yue, Li Zhengyi, Leng Jingwen, Lin Zhouhan, Guo Minyi, Zhu Yuhao
- Longt5: Efficient Text-to-text Transformer For Long Sequences Guo Mandy, Ainslie Joshua, Uthus David, Ontanon Santiago, Ni Jianmo, Sung Yun-hsuan, Yang Yinfei
- Memory-efficient Transformers Via Top-\(k\) Attention Gupta Ankit, Dar Guy, Goodman Shaya, Ciprut David, Berant Jonathan
- Greedy Gradient Ensemble For Robust Visual Question Answering Han Xinzhe, Wang Shuhui, Su Chi, Huang Qingming, Tian Qi
- Zero-shot Controlled Generation With Encoder-decoder Transformers Hazarika Devamanyu, Namazifar Mahdi, Hakkani-tür Dilek
- The Niutrans System For WNGT 2020 Efficiency Task Hu Chi, Li Bei, Lin Ye, Li Yinqiao, Li Yanyang, Wang Chenglong, Xiao Tong, Zhu Jingbo
- Transfer Learning For Sequence Generation: From Single-source To Multi-source Huang Xuancheng, Xu Jingfang, Sun Maosong, Liu Yang
- Audio-oriented Multimodal Machine Comprehension: Task, Dataset And Model Huang Zhiqi, Liu Fenglin, Wu Xian, Ge Shen, Wang Helin, Fan Wei, Zou Yuexian
- Visqa: X-raying Vision And Language Reasoning In Transformers Jaunet Theo, Kervadec Corentin, Vuillemot Romain, Antipov Grigory, Baccouche Moez, Wolf Christian
- A Neural Conversation Generation Model Via Equivalent Shared Memory Investigation Ji Changzhen, Zhang Yating, Liu Xiaozhong, Jatowt Adam, Sun Changlong, Zhu Conghui, Zhao Tiejun
- Inducing Transformer's Compositional Generalization Ability Via Auxiliary Sequence Prediction Tasks Jiang Yichen, Bansal Mohit
- XRJL-HKUST At Semeval-2021 Task 4: Wordnet-enhanced Dual Multi-head Co-attention For Reading Comprehension Of Abstract Meaning Jiang Yuxin, Shou Ziyi, Wang Qijun, Wu Hao, Lin Fangzhen
- Attention Is Indeed All You Need: Semantically Attention-guided Decoding For Data-to-text NLG Juraska Juraj, Walker Marilyn
- Finetuning Pretrained Transformers Into Rnns Kasai Jungo, Peng Hao, Zhang Yizhe, Yogatama Dani, Ilharco Gabriel, Pappas Nikolaos, Mao Yi, Chen Weizhu, Smith Noah A.
- How Transferable Are Reasoning Patterns In VQA? Kervadec Corentin, Jaunet Theo, Antipov Grigory, Baccouche Moez, Vuillemot Romain, Wolf Christian
- Visual Question Answering Based On Local-scene-aware Referring Expression Generation Kim Jung-jun, Lee Dong-gyu, Wu Jialin, Jung Hong-gyu, Lee Seong-whan
- Learned Token Pruning For Transformers Kim Sehoon, Shen Sheng, Thorsley David, Gholami Amir, Kwon Woosuk, Hassoun Joseph, Keutzer Kurt
- Hurdles To Progress In Long-form Question Answering Krishna Kalpesh, Roy Aurko, Iyyer Mohit
- On The Significance Of Question Encoder Sequence Model In The Out-of-distribution Performance In Visual Question Answering Kv Gouthaman, Mittal Anurag
- Draw Me A Flower: Processing And Grounding Abstraction In Natural Language Lachmy Royi, Pyatkin Valentina, Manevich Avshalom, Tsarfaty Reut
- You Only Need One Model For Open-domain Question Answering Lee Haejun, Kedia Akhil, Lee Jongwon, Paranjape Ashwin, Manning Christopher D., Woo Kyoung-gu
- Towards Few-shot Fact-checking Via Perplexity Lee Nayeon, Bang Yejin, Madotto Andrea, Khabsa Madian, Fung Pascale
- When Attention Meets Fast Recurrence: Training Language Models With Reduced Compute Lei Tao
- Semvlp: Vision-language Pre-training By Aligning Semantics At Multiple Levels Li Chenliang, Yan Ming, Xu Haiyang, Luo Fuli, Wang Wei, Bi Bin, Huang Songfang
- Residual Tree Aggregation Of Layers For Neural Machine Translation Li Guoliang, Li Yiyang
- Align Before Fuse: Vision And Language Representation Learning With Momentum Distillation Li Junnan, Selvaraju Ramprasaath R., Gotmare Akhilesh Deepak, Joty Shafiq, Xiong Caiming, Hoi Steven
- Ensemble ALBERT On Squad 2.0 Li Shilun, Li Renee, Peng Veronica
- A Short Study On Compressing Decoder-based Language Models Li Tianda, Mesbahi Yassir El, Kobyzev Ivan, Rashid Ahmad, Mahmud Atif, Anchuri Nithin, Hajimolahoseini Habib, Liu Yang, Rezagholizadeh Mehdi
- Question-aware Memory Network For Multi-hop Question Answering In Human-robot Interaction Li Xinmeng, Alazab Mamoun, Li Qian, Yu Keping, Yin Quanjun
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-image Pre-training Paradigm Li Yangguang, Liang Feng, Zhao Lichen, Cui Yufeng, Ouyang Wanli, Shao Jing, Yu Fengwei, Yan Junjie
- Dialoging Resonance: How Users Perceive, Reciprocate And React To Chatbot's Self-disclosure In Conversational Recommendations Liang Kai-hui, Shi Weiyan, Oh Yoojung, Wang Hao-chuan, Zhang Jingwen, Yu Zhou
- Learning Neural Templates For Recommender Dialogue System Liang Zujie, Hu Huang, Xu Can, Miao Jian, He Yingying, Chen Yining, Geng Xiubo, Liang Fan, Jiang Daxin
- Multimodal Transformer With Variable-length Memory For Vision-and-language Navigation Lin Chuang, Jiang Yi, Cai Jianfei, Qu Lizhen, Haffari Gholamreza, Yuan Zehuan
- Codeqa: A Question Answering Dataset For Source Code Comprehension Liu Chenxiao, Wan Xiaojun
- What Makes Good In-context Examples For GPT-\(3\)? Liu Jiachang, Shen Dinghan, Zhang Yizhe, Dolan Bill, Carin Lawrence, Chen Weizhu
- Learning To Ask Conversational Questions By Optimizing Levenshtein Distance Liu Zhongkun, Ren Pengjie, Chen Zhumin, Ren Zhaochun, De Rijke Maarten, Zhou Ming
- Pretrained Transformers As Universal Computation Engines Lu Kevin, Grover Aditya, Abbeel Pieter, Mordatch Igor
- X-METRA-ADA: Cross-lingual Meta-transfer Learning Adaptation To Natural Language Understanding And Question Answering M'hamdi Meryem, Kim Doo Soon, Dernoncourt Franck, Bui Trung, Ren Xiang, May Jonathan
- Direct Simultaneous Speech-to-speech Translation With Variational Monotonic Multihead Attention Ma Xutai, Gong Hongyu, Liu Danni, Lee Ann, Tang Yun, Chen Peng-jen, Hsu Wei-ning, Koehn Phillip, Pino Juan
- Coherence Boosting: When Your Pretrained Language Model Is Not Paying Enough Attention Malkin Nikolay, Wang Zhen, Jojic Nebojsa
- Towards Retrieval-based Conversational Recommendation Manzoor Ahtsham, Jannach Dietmar
- \(\infty\)-former: Infinite Memory Transformer Martins Pedro Henrique, Marinho Zita, Martins André F. T.
- Transformer Based Bengali Chatbot Using General Knowledge Dataset Masum Abu Kaisar Mohammad, Abujar Sheikh, Akter Sharmin, Ria Nushrat Jahan, Hossain Syed Akhter
- Lattegan: Visually Guided Language Attention For Multi-turn Text-conditioned Image Manipulation Matsumori Shoya, Abe Yuki, Shingyouchi Kosuke, Sugiura Komei, Imai Michita
- Unified Questioner Transformer For Descriptive Question Generation In Goal-oriented Visual Dialogue Matsumori Shoya, Shingyouchi Kosuke, Abe Yuki, Fukuchi Yosuke, Sugiura Komei, Imai Michita
- Schema-guided Paradigm For Zero-shot Dialog Mehri Shikib, Eskenazi Maxine
- Prevent The Language Model From Being Overconfident In Neural Machine Translation Miao Mengqi, Meng Fandong, Liu Yijin, Zhou Xiao-hua, Zhou Jie
- Structural Analysis Of An All-purpose Question Answering Model Micheli Vincent, Heinrich Quentin, Fleuret François, Belblidia Wacim
- Multi-modal Understanding And Generation For Medical Images And Text Via Vision-language Pre-training Moon Jong Hak, Lee Hyungyung, Shin Woncheol, Kim Young-hak, Choi Edward
- Software-based Dialogue Systems: Survey, Taxonomy And Challenges Motger Quim, Franch Xavier, Marco Jordi
- P-adapters: Robustly Extracting Factual Information From Language Models With Diverse Prompts Newman Benjamin, Choubey Prafulla Kumar, Rajani Nazneen
- Improving Transformers With Probabilistic Attention Keys Nguyen Tam, Nguyen Tan M., Le Dung D., Nguyen Duy Khuong, Tran Viet-anh, Baraniuk Richard G., Ho Nhat, Osher Stanley J.
- Materialized Knowledge Bases From Commonsense Transformers Nguyen Tuan-phong, Razniewski Simon
- MLP Architectures For Vision-and-language Modeling: An Empirical Study Nie Yixin, Li Linjie, Gan Zhe, Wang Shuohang, Zhu Chenguang, Zeng Michael, Liu Zicheng, Bansal Mohit, Wang Lijuan
- Probing For Bridging Inference In Transformer Language Models Pandit Onkar, Hou Yufang
- ABC: Attention With Bounded-memory Control Peng Hao, Kasai Jungo, Pappas Nikolaos, Yogatama Dani, Wu Zhaofeng, Kong Lingpeng, Schwartz Roy, Smith Noah A.
- Focused Attention Improves Document-grounded Generation Prabhumoye Shrimai, Hashimoto Kazuma, Zhou Yingbo, Black Alan W, Salakhutdinov Ruslan
- VT-CLIP: Enhancing Vision-language Models With Visual-guided Texts Qiu Longtian, Zhang Renrui, Guo Ziyu, Zeng Ziyao, Guo Zilu, Li Yafeng, Zhang Guangnan
- Towards Solving Multimodal Comprehension Sahu Pritish, Sikka Karan, Divakaran Ajay
- Audio-visual Scene-aware Dialog And Reasoning Using Audio-visual Transformers With Joint Student-teacher Learning Shah Ankit P., Geng Shijie, Gao Peng, Cherian Anoop, Hori Takaaki, Marks Tim K., Roux Jonathan Le, Hori Chiori
- Laviter: Learning Aligned Visual And Textual Representations Assisted By Image And Caption Generation Shaikh Mohammad Abuzar, Ji Zhanghexuan, Moukheiber Dana, Shen Yan, Srihari Sargur, Gao Mingchen
- Societal Biases In Language Generation: Progress And Challenges Sheng Emily, Chang Kai-wei, Natarajan Premkumar, Peng Nanyun
- Adapting Long Context NLM For ASR Rescoring In Conversational Agents Shenoy Ashish, Bodapati Sravan, Sunkara Monica, Ronanki Srikanth, Kirchhoff Katrin
- Layer-wise Pruning Of Transformer Attention Heads For Efficient Language Modeling Shim Kyuhong, Choi Iksoo, Sung Wonyong, Choi Jungwook
- Pre-trained Language Models As Prior Knowledge For Playing Text-based Games Singh Ishika, Singh Gargi, Modi Ashutosh
- Semantics-aware Attention Improves Neural Machine Translation Slobodkin Aviv, Choshen Leshem, Abend Omri
- Primer: Searching For Efficient Transformers For Language Modeling So David R., Mańke Wojciech, Liu Hanxiao, Dai Zihang, Shazeer Noam, Le Quoc V.
- Multimodal Integration Of Human-like Attention In Visual Question Answering Sood Ekta, Kögel Fabian, Müller Philipp, Thomas Dominike, Bace Mihai, Bulling Andreas
- VQA-MHUG: A Gaze Dataset To Study Multimodal Neural Attention In Visual Question Answering Sood Ekta, Kögel Fabian, Strohm Florian, Dhar Prajit, Bulling Andreas
- Worst Of Both Worlds: Biases Compound In Pre-trained Vision-and-language Models Srinivasan Tejas, Bisk Yonatan
- Non-autoregressive Text Generation With Pre-trained Language Models Su Yixuan, Cai Deng, Wang Yan, Vandyke David, Baker Simon, Li Piji, Collier Nigel
- Not All Memories Are Created Equal: Learning To Forget By Expiring Sukhbaatar Sainbayar, Ju Da, Poff Spencer, Roller Stephen, Szlam Arthur, Weston Jason, Fan Angela
- A Non-hierarchical Attention Network With Modality Dropout For Textual Response Generation In Multimodal Dialogue Systems Sun Rongyi, Chen Borun, Zhou Qingyu, Li Yinghui, Cao Yunbo, Zheng Hai-tao
- Do Long-range Language Models Actually Use Long-range Context? Sun Simeng, Krishna Kalpesh, Mattarella-micke Andrew, Iyyer Mohit
- Lightningdot: Pre-training Visual-semantic Embeddings For Real-time Image-text Retrieval Sun Siqi, Chen Yen-chun, Li Linjie, Wang Shuohang, Fang Yuwei, Liu Jingjing
- Know Deeper: Knowledge-conversation Cyclic Utilization Mechanism For Open-domain Dialogue Generation Sun Yajing, Hu Yue, Xing Luxi, Xie Yuqiang, Wei Xiangpeng
- Multilingual Translation Via Grafting Pre-trained Language Models Sun Zewei, Wang Mingxuan, Li Lei
- Exploring Transformers In Natural Language Generation: GPT, BERT, And Xlnet Topal M. Onat, Bas Anil, Van Heerden Imke
- Local Explanation Of Dialogue Response Generation Tuan Yi-lin, Pryor Connor, Chen Wenhu, Getoor Lise, Wang William Yang
- Topicrefine: Joint Topic Prediction And Dialogue Response Generation For Multi-turn End-to-end Dialogue System Wang Hongru, Cui Mingyu, Zhou Zimo, Fung Gabriel Pui Cheong, Wong Kam-fai
- Distilled Dual-encoder Model For Vision-language Understanding Wang Zekun, Wang Wenhui, Zhu Haichao, Liu Ming, Qin Bing, Wei Furu
- Dodrio: Exploring Transformer Models With Interactive Visualization Wang Zijie J., Turko Robert, Chau Duen Horng
- VAULT: Variable Unified Long Text Representation For Machine Reading Comprehension Wen Haoyang, Ferritto Anthony, Ji Heng, Florian Radu, Sil Avirup
- Graph-free Multi-hop Reading Comprehension: A Select-to-guide Strategy Wu Bohong, Zhang Zhuosheng, Zhao Hai
- Fastformer: Additive Attention Can Be All You Need Wu Chuhan, Wu Fangzhao, Qi Tao, Huang Yongfeng, Xie Xing
- A Multilingual Modeling Method For Span-extraction Reading Comprehension Wu Gaochen, Xu Bin, Chang Dejie, Liu Bangchang
- Improving Low-resource Reading Comprehension Via Cross-lingual Transposition Rethinking Wu Gaochen, Xu Bin, Qin Yuxin, Kong Fei, Liu Bangchang, Zhao Hongwen, Chang Dejie
- Not All Attention Is All You Need Wu Hongqiu, Zhao Hai, Zhang Min
- On Explaining Your Explanations Of BERT: An Empirical Study With Sequence Classification Wu Zhengxuan, Ong Desmond C.
- Simple Local Attentions Remain Competitive For Long-context Tasks Xiong Wenhan, Oğuz Barlas, Gupta Anchit, Chen Xilun, Liskovich Diana, Levy Omer, Yih Wen-tau, Mehdad Yashar
- Moca: Incorporating Multi-stage Domain Pretraining And Cross-guided Multimodal Attention For Textbook Question Answering Xu Fangzhi, Lin Qika, Liu Jun, Zhang Lingling, Zhao Tianzhe, Chai Qi, Pan Yudai
- Attention-guided Generative Models For Extractive Question Answering Xu Peng, Liang Davis, Huang Zhiheng, Xiang Bing
- Fastseq: Make Sequence Generation Faster Yan Yu, Hu Fei, Chen Jiusheng, Bhendawade Nikhil, Ye Ting, Gong Yeyun, Duan Nan, Cui Desheng, Chi Bingyu, Zhang Ruofei
- Transformer-based Korean Pretrained Language Models: A Survey On Three Years Of Progress Yang Kichang
- Causal Attention For Vision-language Tasks Yang Xu, Zhang Hanwang, Qi Guojun, Cai Jianfei
- TR-BERT: Dynamic Token Reduction For Accelerating BERT Inference Ye Deming, Lin Yankai, Huang Yufei, Sun Maosong
- LV-BERT: Exploiting Layer Variety For BERT Yu Weihao, Jiang Zihang, Chen Fei, Hou Qibin, Feng Jiashi
- Language Bias In Visual Question Answering: A Survey And Taxonomy Yuan Desen
- Improving Stack Overflow Question Title Generation With Copying Enhanced Codebert Model And Bi-modal Information Zhang Fengji, Yu Xiao, Keung Jacky, Li Fuyang, Xie Zhiwen, Yang Zhen, Ma Caoyuan, Zhang Zhimin
- Fewnlu: Benchmarking State-of-the-art Methods For Few-shot Natural Language Understanding Zheng Yanan, Zhou Jing, Qian Yujie, Ding Ming, Liao Chonghua, Li Jian, Salakhutdinov Ruslan, Tang Jie, Ruder Sebastian, Yang Zhilin
- Mmchat: Multi-modal Chat Dataset On Social Media Zheng Yinhe, Chen Guanyi, Liu Xin, Sun Jian
- SILG: The Multi-environment Symbolic Interactive Language Grounding Benchmark Zhong Victor, Hanjie Austin W., Wang Sida I., Narasimhan Karthik, Zettlemoyer Luke
- Vl-interpret: An Interactive Visualization Tool For Interpreting Vision-language Transformers Aflalo Estelle, Du Meng, Tseng Shao-yen, Liu Yongfei, Wu Chenfei, Duan Nan, Lal Vasudev
- On The Calibration Of Massively Multilingual Language Models Ahuja Kabir, Sitaram Sunayana, Dandapat Sandipan, Choudhury Monojit
- Multimodal Sequential Generative Models For Semi-supervised Language Instruction Following Akuzawa Kei, Iwasawa Yusuke, Matsuo Yutaka
- Compound Tokens: Channel Fusion For Vision-language Representation Learning Aladago Maxwell Mbabilla, Piergiovanni Aj
- End-to-end Multimodal Representation Learning For Video Dialog Alamri Huda, Bilic Anthony, Hu Michael, Beedu Apoorva, Essa Irfan
- Input-tuning: Adapting Unfamiliar Inputs To Frozen Pretrained Models An Shengnan, Li Yifei, Lin Zeqi, Liu Qian, Chen Bei, Fu Qiang, Chen Weizhu, Zheng Nanning, Lou Jian-guang
- Tempo: Accelerating Transformer-based Model Training Through Memory Footprint Reduction Andoorveedu Muralidhar, Zhu Zhanda, Zheng Bojian, Pekhimenko Gennady
- Characterizing Verbatim Short-term Memory In Neural Language Models Armeni Kristijan, Honey Christopher, Linzen Tal
- ATTEMPT: Parameter-efficient Multi-task Tuning Via Attentional Mixtures Of Soft Prompts Asai Akari, Salehi Mohammadreza, Peters Matthew E., Hajishirzi Hannaneh
- Rethinking The Role Of Scale For In-context Learning: An Interpretability-based Case Study At 66 Billion Scale Bansal Hritik, Gopalakrishnan Karthik, Dingliwal Saket, Bodapati Sravan, Kirchhoff Katrin, Roth Dan
- P\(^3\)LM: Probabilistically Permuted Prophet Language Modeling For Generative Pre-training Bao Junwei, Wang Yifan, Ying Jiangyong, Gong Yeyun, Zhao Jing, Wu Youzheng, He Xiaodong
- \(\textit{latent}\)-glat: Glancing At Latent Variables For Parallel Text Generation Bao Yu, Zhou Hao, Huang Shujian, Wang Dongqi, Qian Lihua, Dai Xinyu, Chen Jiajun, Li Lei
- Grad-sam: Explaining Transformers Via Gradient Self-attention Maps Barkan Oren, Hauon Edan, Caciularu Avi, Katz Ori, Malkiel Itzik, Armstrong Omri, Koenigstein Noam
- Evidence > Intuition: Transferability Estimation For Encoder Selection Bassignana Elisa, Müller-eberstein Max, Zhang Mike, Plank Barbara
- Petals: Collaborative Inference And Fine-tuning Of Large Models Borzunov Alexander, Baranchuk Dmitry, Dettmers Tim, Ryabinin Max, Belkada Younes, Chumachenko Artem, Samygin Pavel, Raffel Colin
- Understanding Performance Of Long-document Ranking Models Through Comprehensive Evaluation And Leaderboarding Boytsov Leonid, Akinpelu David, Lin Tianyi, Gao Fangwei, Zhao Yutian, Huang Jeffrey, Katyal Nipun, Nyberg Eric
- Large Language Models Can Implement Policy Iteration Brooks Ethan, Walls Logan, Lewis Richard L., Singh Satinder
- Understanding Attention For Vision-and-language Tasks Cao Feiqi, Han Soyeon Caren, Long Siqu, Xu Changwei, Poon Josiah
- HIBRIDS: Attention With Hierarchical Biases For Structure-aware Long Document Summarization Cao Shuyang, Wang Lu
- Litevl: Efficient Video-language Learning With Enhanced Spatial-temporal Modeling Chen Dongsheng, Tao Chaofan, Hou Lu, Shang Lifeng, Jiang Xin, Liu Qun
- Exploring And Exploiting Multi-granularity Representations For Machine Reading Comprehension Chen Nuo, You Chenyu
- Towards Improving Faithfulness In Abstractive Summarization Chen Xiuying, Li Mingzhe, Gao Xin, Zhang Xiangliang
- Empowering Parameter-efficient Transfer Learning By Recognizing The Kernel Structure In Self-attention Chen Yifan, Hazarika Devamanyu, Namazifar Mahdi, Liu Yang, Jin Di, Hakkani-tur Dilek
- Adaprompt: Adaptive Model Training For Prompt-based NLP Chen Yulong, Liu Yang, Dong Li, Wang Shuohang, Zhu Chenguang, Zeng Michael, Zhang Yue
- Capture Salient Historical Information: A Fast And Accurate Non-autoregressive Model For Multi-turn Spoken Language Understanding Cheng Lizhi, Jia Weijia, Yang Wenmian
- Is Multiwoz A Solved Task? An Interactive TOD Evaluation Framework With User Simulator Cheng Qinyuan, Li Linyang, Quan Guofeng, Gao Feng, Mou Xiaofeng, Qiu Xipeng
- Neural Machine Translation With Contrastive Translation Memories Cheng Xin, Gao Shen, Liu Lemao, Zhao Dongyan, Yan Rui
- Meta-learning Fast Weight Language Models Clark Kevin, Guu Kelvin, Chang Ming-wei, Pasupat Panupong, Hinton Geoffrey, Norouzi Mohammad
- LSG Attention: Extrapolation Of Pretrained Transformers To Long Sequences Condevaux Charles, Harispe Sébastien
- The Future Is Different: Large Pre-trained Language Models Fail In Prediction Tasks Cvejoski Kostadin, Sánchez Ramsés J., Ojeda César
- Why Can GPT Learn In-context? Language Models Implicitly Perform Gradient Descent As Meta-optimizers Dai Damai, Sun Yutao, Dong Li, Hao Yaru, Ma Shuming, Sui Zhifang, Wei Furu
- Llm.int8(): 8-bit Matrix Multiplication For Transformers At Scale Dettmers Tim, Lewis Mike, Belkada Younes, Zettlemoyer Luke
- Less Is More: Linear Layers On CLIP Features As Powerful Vizwiz Model Deuser Fabian, Habel Konrad, Rösch Philipp J., Oswald Norbert
- Parallel Attention Forcing For Machine Translation Dou Qingyun, Gales Mark
- Coarse-to-fine Vision-language Pre-training With Fusion In The Backbone Dou Zi-yi, Kamath Aishwarya, Gan Zhe, Zhang Pengchuan, Wang Jianfeng, Li Linjie, Liu Zicheng, Liu Ce, Lecun Yann, Peng Nanyun, Gao Jianfeng, Wang Lijuan
- Leveraging Pre-trained Models For Failure Analysis Triplets Generation Ezukwoke Kenneth, Hoayek Anis, Batton-hubert Mireille, Boucher Xavier, Gounet Pascal, Adrian Jerome
- Topic-aware Response Generation In Task-oriented Dialogue With Unstructured Knowledge Access Feng Yue, Lampouras Gerasimos, Iacobacci Ignacio
- Should Attention Be All We Need? The Epistemic And Ethical Implications Of Unification In Machine Learning Fishman Nic, Hancox-li Leif
- Hungry Hungry Hippos: Towards Language Modeling With State Space Models Fu Daniel Y., Dao Tri, Saab Khaled K., Thomas Armin W., Rudra Atri, Ré Christopher
- Checks And Strategies For Enabling Code-switched Machine Translation Gowda Thamme, Gheini Mozhdeh, May Jonathan
- Continuous Decomposition Of Granularity For Neural Paraphrase Generation Gu Xiaodong, Zhang Zhaowei, Lee Sang-woo, Yoo Kang Min, Ha Jung-woo
- Generating Coherent Narratives By Learning Dynamic And Discrete Entity States With A Contrastive Framework Guan Jian, Yang Zhenyu, Zhang Rongsheng, Hu Zhipeng, Huang Minlie
- Optimal Transport For Unsupervised Hallucination Detection In Neural Machine Translation Guerreiro Nuno M., Colombo Pierre, Piantanida Pablo, Martins André F. T.
- Looking For A Needle In A Haystack: A Comprehensive Study Of Hallucinations In Neural Machine Translation Guerreiro Nuno M., Voita Elena, Martins André F. T.
- Unixcoder: Unified Cross-modal Pre-training For Code Representation Guo Daya, Lu Shuai, Duan Nan, Wang Yanlin, Zhou Ming, Yin Jian
- LVP-M3: Language-aware Visual Prompt For Multilingual Multimodal Machine Translation Guo Hongcheng, Liu Jiaheng, Huang Haoyang, Yang Jian, Li Zhoujun, Zhang Dongdong, Cui Zheng, Wei Furu
- Supervised Visual Attention For Simultaneous Multimodal Machine Translation Haralampieva Veneta, Caglayan Ozan, Specia Lucia
- How Much Does Attention Actually Attend? Questioning The Importance Of Attention In Pretrained Transformers Hassid Michael, Peng Hao, Rotem Daniel, Kasai Jungo, Montero Ivan, Smith Noah A., Schwartz Roy
- Transformer Language Models Without Positional Encodings Still Learn Positional Information Haviv Adi, Ram Ori, Press Ofir, Izsak Peter, Levy Omer
- Z-code++: A Pre-trained Language Model Optimized For Abstractive Summarization He Pengcheng, Peng Baolin, Lu Liyang, Wang Song, Mei Jie, Liu Yang, Xu Ruochen, Awadalla Hany Hassan, Shi Yu, Zhu Chenguang, Xiong Wayne, Zeng Michael, Gao Jianfeng, Huang Xuedong
- VLMAE: Vision-language Masked Autoencoder He Sunan, Guo Taian, Dai Tao, Qiao Ruizhi, Wu Chen, Shu Xiujun, Ren Bo
- Hyperprompt: Prompt-based Task-conditioning Of Transformers He Yun, Zheng Huaixiu Steven, Tay Yi, Gupta Jai, Du Yu, Aribandi Vamsi, Zhao Zhe, Li Yaguang, Chen Zhao, Metzler Donald, Cheng Heng-tze, Chi Ed H.
- Robust Dialogue State Tracking With Weak Supervision And Sparse Data Heck Michael, Lubis Nurul, Van Niekerk Carel, Feng Shutong, Geishauser Christian, Lin Hsien-chin, Gašić Milica
- Shapley Head Pruning: Identifying And Removing Interference In Multilingual Transformers Held William, Yang Diyi
- Prompt-to-prompt Image Editing With Cross Attention Control Hertz Amir, Mokady Ron, Tenenbaum Jay, Aberman Kfir, Pritch Yael, Cohen-or Daniel
- TRUE: Re-evaluating Factual Consistency Evaluation Honovich Or, Aharoni Roee, Herzig Jonathan, Taitelbaum Hagai, Kukliansy Doron, Cohen Vered, Scialom Thomas, Szpektor Idan, Hassidim Avinatan, Matias Yossi
- Syntax-guided Localized Self-attention By Constituency Syntactic Distance Hou Shengyuan, Kai Jushi, Xue Haotian, Zhu Bingyu, Yuan Bo, Huang Longtao, Wang Xinbing, Lin Zhouhan
- Unified Discrete Diffusion For Simultaneous Vision-language Generation Hu Minghui, Zheng Chuanxia, Zheng Heliang, Cham Tat-jen, Wang Chaoyue, Yang Zuopeng, Tao Dacheng, Suganthan Ponnuthurai N.
- PLANET: Dynamic Content Planning In Autoregressive Transformers For Long-form Text Generation Hu Zhe, Chan Hou Pong, Liu Jiachen, Xiao Xinyan, Wu Hua, Huang Lifu
- Transformer Quality In Linear Time Hua Weizhe, Dai Zihang, Liu Hanxiao, Le Quoc V.
- Personalized Dialogue Generation With Persona-adaptive Attention Huang Qiushi, Zhang Yu, Ko Tom, Liu Xubo, Wu Bo, Wang Wenwu, Tang Lilian
- FPT: Improving Prompt Tuning Efficiency Via Progressive Training Huang Yufei, Qin Yujia, Wang Huadong, Yin Yichun, Sun Maosong, Liu Zhiyuan, Liu Qun
- Chatgpt Makes Medicine Easy To Swallow: An Exploratory Case Study On Simplified Radiology Reports Jeblick Katharina, Schachtner Balthasar, Dexl Jakob, Mittermeier Andreas, Stüber Anna Theresa, Topalis Johanna, Weber Tobias, Wesp Philipp, Sabel Bastian, Ricke Jens, Ingrisch Michael
- Lamemo: Language Modeling With Look-ahead Memory Ji Haozhe, Zhang Rongsheng, Yang Zhenyu, Hu Zhipeng, Huang Minlie
- RHO (\(\rho\)): Reducing Hallucination In Open-domain Dialogues With Knowledge Grounding Ji Ziwei, Liu Zihan, Lee Nayeon, Yu Tiezheng, Wilie Bryan, Zeng Min, Fung Pascale
- Pseudo-q: Generating Pseudo Language Queries For Visual Grounding Jiang Haojun, Lin Yuanze, Han Dongchen, Song Shiji, Huang Gao
- Informative Language Representation Learning For Massively Multilingual Neural Machine Translation Jin Renren, Xiong Deyi
- Latency Adjustable Transformer Encoder For Language Understanding Kachuee Sajjad, Sharifkhani Mohammad
- Gpt-neo For Commonsense Reasoning -- A Theoretical And Practical Lens Kashyap Rohan, Kashyap Vivek, P. Narendra C.
- Fie: Building A Global Probability Space By Leveraging Early Fusion In Encoder For Open-domain Question Answering Kedia Akhil, Zaidi Mohd Abbas, Lee Haejun
- Natural Language To Code Using Transformers Kusupati Uday, Ailavarapu Venkata Ravi Teja
- Internet-augmented Language Models Through Few-shot Prompting For Open-domain Question Answering Lazaridou Angeliki, Gribovskaya Elena, Stokowiec Wojciech, Grigorev Nikolai
- Guiding Visual Question Answering With Attention Priors Le Thao Minh, Le Vuong, Gupta Sunil, Venkatesh Svetha, Tran Truyen
- Hard Gate Knowledge Distillation -- Leverage Calibration For Robust And Reliable Language Model Lee Dongkyu, Tian Zhiliang, Zhao Yingxiu, Cheung Ka Chun, Zhang Nevin L.
- Littlebird: Efficient Faster & Longer Transformer For Question Answering Lee Minchul, Han Kijong, Shin Myeong Cheol
- A Survey On Retrieval-augmented Text Generation Li Huayang, Su Yixuan, Cai Deng, Wang Yan, Liu Lemao
- Fine-grained Semantically Aligned Vision-language Pre-training Li Juncheng, He Xin, Wei Longhui, Qian Long, Zhu Linchao, Xie Lingxi, Zhuang Yueting, Tian Qi, Tang Siliang
- Learning To Transfer Prompts For Text Generation Li Junyi, Tang Tianyi, Nie Jian-yun, Wen Ji-rong, Zhao Wayne Xin
- Evade The Trap Of Mediocrity: Promoting Diversity And Novelty In Text Generation Via Concentrating Attention Li Wenhao, Yi Xiaoyuan, Hu Jinyi, Sun Maosong, Xie Xing
- P-transformer: Towards Better Document-to-document Neural Machine Translation Li Yachao, Li Junhui, Jiang Jing, Tao Shimin, Yang Hao, Zhang Min
- VALHALLA: Visual Hallucination For Machine Translation Li Yi, Panda Rameswar, Kim Yoon, Chen Chun-fu, Feris Rogerio, Cox David, Vasconcelos Nuno
- Exploring Visual Interpretability For Contrastive Language-image Pre-training Li Yi, Wang Hualiang, Duan Yiqun, Xu Hang, Li Xiaomeng
- Clinical-longformer And Clinical-bigbird: Transformers For Long Clinical Sequences Li Yikuan, Wehbe Ramsey M., Ahmad Faraz S., Wang Hanyin, Luo Yuan
- Accelerating Attention Through Gradient-based Learned Runtime Pruning Li Zheng, Ghodrati Soroush, Yazdanbakhsh Amir, Esmaeilzadeh Hadi, Kang Mingu
- Vision-language Pre-training For Multimodal Aspect-based Sentiment Analysis Ling Yan, Yu Jianfei, Xia Rui
- A Simple Meta-learning Paradigm For Zero-shot Intent Classification With Mixture Attention Mechanism Liu Han, Zhao Siyang, Zhang Xiaotong, Zhang Feng, Sun Junjie, Yu Hong, Zhang Xianchao
- Towards Better Few-shot And Finetuning Performance With Forgetful Causal Language Models Liu Hao, Geng Xinyang, Lee Lisa, Mordatch Igor, Levine Sergey, Narang Sharan, Abbeel Pieter
- A Generative User Simulator With Gpt-based Architecture And Goal State Tracking For Reinforced Multi-domain Dialog Systems Liu Hong, Cai Yucheng, Ou Zhijian, Huang Yi, Feng Junlan
- Fl-tuning: Layer Tuning For Feed-forward Network In Transformer Liu Jingping, Song Yuqiu, Xue Kui, Sun Hongli, Wang Chao, Chen Lihan, Jiang Haiyun, Liang Jiaqing, Ruan Tong
- PSP: Pre-trained Soft Prompts For Few-shot Abstractive Summarization Liu Xiaochen, Gao Yang, Bai Yu, Li Jiawei, Hu Yinan, Huang Heyan, Chen Boxing
- Dialogconv: A Lightweight Fully Convolutional Network For Multi-view Response Selection Liu Yongkang, Feng Shi, Gao Wei, Wang Daling, Zhang Yifei
- Relaxed Attention For Transformer Models Lohrenz Timo, Möller Björn, Li Zhengyang, Fingscheidt Tim
- Choose Your QA Model Wisely: A Systematic Study Of Generative And Extractive Readers For Question Answering Luo Man, Hashimoto Kazuma, Yavuz Semih, Liu Zhiwei, Baral Chitta, Zhou Yingbo
- Biogpt: Generative Pre-trained Transformer For Biomedical Text Generation And Mining Luo Renqian, Sun Liai, Xia Yingce, Qin Tao, Zhang Sheng, Poon Hoifung, Liu Tie-yan
- I-tuning: Tuning Frozen Language Models With Image For Lightweight Image Captioning Luo Ziyang, Hu Zhipeng, Xi Yadong, Zhang Rongsheng, Ma Jing
- A Frustratingly Simple Approach For End-to-end Image Captioning Luo Ziyang, Xi Yadong, Zhang Rongsheng, Ma Jing
- Unveiling Code Pre-trained Models: Investigating Syntax And Semantics Capacities Ma Wei, Liu Shangqing, Zhao Mengjie, Xie Xiaofei, Wang Wenhan, Hu Qiang, Zhang Jie, Liu Yang
- DIALOG-22 Ruatd Generated Text Detection Maloyan Narek, Nutfullin Bulat, Ilyushin Eugene
- Fine-tuning Pre-trained Transformers Into Decaying Fast Weights Mao Huanru Henry
- The Devil Is In The Details: On Models And Training Regimes For Few-shot Intent Classification Mesgar Mohsen, Tran Thy Thy, Glavas Goran, Gurevych Iryna
- Do Vision-and-language Transformers Learn Grounded Predicate-noun Dependencies? Nikolaus Mitja, Salin Emmanuelle, Ayache Stephane, Fourtassi Abdellah, Favre Benoit
- Follow-up Attention: An Empirical Study Of Developer And Neural Model Code Exploration Paltenghi Matteo, Pandita Rahul, Henley Austin Z., Ziegler Albert
- Relation-aware Language-graph Transformer For Question Answering Park Jinyoung, Choi Hyeong Kyu, Ko Juyeon, Park Hyeonjin, Kim Ji-hoon, Jeong Jisu, Kim Kyungmin, Kim Hyunwoo J.
- Sparse And Dense Approaches For The Full-rank Retrieval Of Responses For Dialogues Penha Gustavo, Hauff Claudia
- Outliers Dimensions That Disrupt Transformers Are Driven By Frequency Puccetti Giovanni, Rogers Anna, Drozd Aleksandr, Dell'orletta Felice
- The NLP Task Effectiveness Of Long-range Transformers Qin Guanghui, Feng Yukun, Van Durme Benjamin
- Memory In Humans And Deep Language Models: Linking Hypotheses For Model Augmentation Raccah Omri, Chen Phoebe, Willke Ted L., Poeppel David, Vo Vy A.
- Parallel Context Windows For Large Language Models Ratner Nir, Levine Yoav, Belinkov Yonatan, Ram Ori, Magar Inbal, Abend Omri, Karpas Ehud, Shashua Amnon, Leyton-brown Kevin, Shoham Yoav
- Entity-conditioned Question Generation For Robust Attention Distribution In Neural Information Retrieval Reddy Revanth Gangi, Sultan Md Arafat, Franz Martin, Sil Avirup, Ji Heng
- Cometkiwi: Ist-unbabel 2022 Submission For The Quality Estimation Shared Task Rei Ricardo, Treviso Marcos, Guerreiro Nuno M., Zerva Chrysoula, Farinha Ana C., Maroti Christine, De Souza José G. C., Glushkova Taisiya, Alves Duarte M., Lavie Alon, Coheur Luisa, Martins André F. T.
- An Understanding-oriented Robust Machine Reading Comprehension Model Ren Feiliang, Liu Yongkang, Li Bochao, Liu Shilei, Wang Bingchao, Wang Jiaqi, Liu Chunchao, Ma Qi
- Exploring Extreme Parameter Compression For Pre-trained Language Models Ren Yuxin, Wang Benyou, Shang Lifeng, Jiang Xin, Liu Qun
- Dialokg: Knowledge-structure Aware Task-oriented Dialogue Generation Rony Md Rashad Al Hasan, Usbeck Ricardo, Lehmann Jens
- Transformer Grammars: Augmenting Transformer Language Models With Syntactic Inductive Biases At Scale Sartran Laurent, Barrett Samuel, Kuncoro Adhiguna, Stanojević Miloš, Blunsom Phil, Dyer Chris
- Bird-eye Transformers For Text Generation Models Sha Lei, Song Yuhang, Yordanov Yordan, Salvatori Tommaso, Lukasiewicz Thomas
- Overcoming Catastrophic Forgetting Beyond Continual Learning: Balanced Training For Neural Machine Translation Shao Chenze, Feng Yang
- Mgpt: Few-shot Learners Go Multilingual Shliazhko Oleh, Fenogenova Alena, Tikhonova Maria, Mikhailov Vladislav, Kozlova Anastasia, Shavrina Tatiana
- On The Efficacy Of Co-attention Transformer Layers In Visual Question Answering Sikarwar Ankur, Kreiman Gabriel
- Investigating Explainability Of Generative AI For Code Through Scenario-based Design Sun Jiao, Liao Q. Vera, Muller Michael, Agarwal Mayank, Houde Stephanie, Talamadupula Kartik, Weisz Justin D.
- Tokenization Consistency Matters For Generative Models On Extractive NLP Tasks Sun Kaiser, Qi Peng, Zhang Yuhao, Liu Lan, Wang William Yang, Huang Zhiheng
- Long-form Video-language Pre-training With Multimodal Temporal Contrastive Learning Sun Yuchong, Xue Hongwei, Song Ruihua, Liu Bei, Yang Huan, Fu Jianlong
- A Length-extrapolatable Transformer Sun Yutao, Dong Li, Patra Barun, Ma Shuming, Huang Shaohan, Benhaim Alon, Chaudhary Vishrav, Song Xia, Wei Furu
- Pseudo-ood Training For Robust Language Models Sundararaman Dhanasekar, Mehta Nikhil, Carin Lawrence
- ACORT: A Compact Object Relation Transformer For Parameter Efficient Image Captioning Tan Jia Huei, Tan Ying Hua, Chan Chee Seng, Chuah Joon Huang
- Etrica: Event-triggered Context-aware Story Generation Augmented By Cross Attention Tang Chen, Lin Chenghua, Huang Henglin, Guerin Frank, Zhang Zhihao
- Adavae: Exploring Adaptive Gpt-2s In Variational Auto-encoders For Language Modeling Tu Haoqin, Yang Zhongliang, Yang Jinshuai, Huang Yongfeng
- Towards Reasoning-aware Explainable VQA Vaideeswaran Rakesh, Gao Feng, Mathur Abhinav, Thattai Govind
- What Do They Capture? -- A Structural Analysis Of Pre-trained Language Models For Source Code Wan Yao, Zhao Wei, Zhang Hongyu, Sui Yulei, Xu Guandong, Jin Hai
- Unite: Unified Translation Evaluation Wan Yu, Liu Dayiheng, Yang Baosong, Zhang Haibo, Chen Boxing, Wong Derek F., Chao Lidia S.
- Knowledgeable Salient Span Mask For Enhancing Language Models As Knowledge Base Wang Cunxiang, Luo Fuli, Li Yanyang, Xu Runxin, Huang Fei, Zhang Yue
- Knowledge Prompting In Pre-trained Language Model For Natural Language Understanding Wang Jianing, Huang Wenkang, Shi Qiuhui, Wang Hongbin, Qiu Minghui, Li Xiang, Gao Ming
- Lilt: A Simple Yet Effective Language-independent Layout Transformer For Structured Document Understanding Wang Jiapeng, Jin Lianwen, Ding Kai
- Rethinking Textual Adversarial Defense For Pre-trained Language Models Wang Jiayi, Bao Rongzhou, Zhang Zhuosheng, Zhao Hai
- Pretraining Without Attention Wang Junxiong, Yan Jing Nathan, Gu Albert, Rush Alexander M.
- Zero-shot Image Captioning By Anchor-augmented Vision-language Space Alignment Wang Junyang, Zhang Yi, Yan Ming, Zhang Ji, Sang Jitao
- Pay Attention To Your Tone: Introducing A New Dataset For Polite Language Rewrite Wang Xun, Ge Tao, Mao Allen, Li Yuki, Wei Furu, Chen Si-qing
- Controlling Styles In Neural Machine Translation With Activation Prompt Wang Yifan, Sun Zewei, Cheng Shanbo, Zheng Weiguo, Wang Mingxuan
- Unified Multimodal Model With Unlikelihood Training For Visual Dialog Wang Zihao, Wang Junli, Jiang Changjun
- EGCR: Explanation Generation For Conversational Recommendation Wen Bingbing, Bu Xiaoning, Shah Chirag
- Generative Or Contrastive? Phrase Reconstruction For Better Sentence Representation Learning Wu Bohong, Zhao Hai
- Sentence Representation Learning With Generative Objective Rather Than Contrastive Objective Wu Bohong, Zhao Hai
- Adversarial Self-attention For Language Understanding Wu Hongqiu, Ding Ruixue, Zhao Hai, Xie Pengjun, Huang Fei, Zhang Min
- Modeling Context With Linear Attention For Scalable Document-level Translation Wu Zhaofeng, Peng Hao, Pappas Nikolaos, Smith Noah A.
- Enhancing Pre-trained Models With Text Structure Knowledge For Question Generation Wu Zichen Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China, Jia Xin Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China, Qu Fanyi Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China, Wu Yunfang Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China
- Dual Modality Prompt Tuning For Vision-language Pre-trained Model Xing Yinghui, Wu Qirui, Cheng De, Zhang Shizhou, Liang Guoqiang, Wang Peng, Zhang Yanning
- Evaluating And Improving Context Attention Distribution On Multi-turn Response Generation Using Self-contained Distractions Xing Yujie, Gulla Jon Atle
- MGA-VQA: Multi-granularity Alignment For Visual Question Answering Xiong Peixi, Shen Yilin, Jin Hongxia
- Adapting Pretrained Text-to-text Models For Long Text Sequences Xiong Wenhan, Gupta Anchit, Toshniwal Shubham, Mehdad Yashar, Yih Wen-tau
- COSPLAY: Concept Set Guided Personalized Dialogue Generation Across Both Party Personas Xu Chen, Li Piji, Wang Wei, Yang Haoran, Wang Siyun, Xiao Chuangbai
- Evaluating Parameter Efficient Learning For Generation Xu Peng, Patwary Mostofa, Prabhumoye Shrimai, Adams Virginia, Prenger Ryan J., Ping Wei, Lee Nayeon, Shoeybi Mohammad, Catanzaro Bryan
- Probing Causes Of Hallucinations In Neural Machine Translations Yan Jianhao, Meng Fandong, Zhou Jie
- Generative Negative Text Replay For Continual Vision-language Pretraining Yan Shipeng, Hong Lanqing, Xu Hang, Han Jianhua, Tuytelaars Tinne, Li Zhenguo, He Xuming
- Tableformer: Robust Transformer Modeling For Table-text Encoding Yang Jingfeng, Gupta Aditya, Upadhyay Shyam, He Luheng, Goel Rahul, Paul Shachi
- AD-DROP: Attribution-driven Dropout For Robust Language Model Fine-tuning Yang Tao, Deng Jinghao, Quan Xiaojun, Wang Qifan, Nie Shaoliang
- Zeroquant: Efficient And Affordable Post-training Quantization For Large-scale Transformers Yao Zhewei, Aminabadi Reza Yazdani, Zhang Minjia, Wu Xiaoxia, Li Conglong, He Yuxiong
- Language Models Are General-purpose Interfaces Yaru Hao, Haoyu Song, Li Dong, Shaohan Huang, Zewen Chi, Wenhui Wang, Shuming Ma, Furu Wei
- Structured Prompting: Scaling In-context Learning To 1,000 Examples Yaru Hao, Yutao Sun, Li Dong, Zhixiong Han, Yuxian Gu, Furu Wei
- Alzheimer's Diagnosis And Generation-based Chatbot Using Hierarchical Attention And Transformer Yeong Park Jun, Jong Shin Su, Hwan Choi Chang, Jae Lee Jung, Sang-il Choi
- Efficient Language Modeling With Sparse All-mlp Yu Ping, Artetxe Mikel, Ott Myle, Shleifer Sam, Gong Hongyu, Stoyanov Ves, Li Xian
- Tri-attention: Explicit Context-aware Attention Mechanism For Natural Language Processing Yu Rui, Li Yifeng, Lu Wenpeng, Cao Longbing
- Can Pretrained Language Models (yet) Reason Deductively? Yuan Zhangdie, Hu Songbo, Vulić Ivan, Korhonen Anna, Meng Zaiqiao
- Dyrex: Dynamic Query Representation For Extractive Question Answering Zaratiana Urchade, Khbir Niama El, Núñez Dennis, Holat Pierre, Tomeh Nadi, Charnois Thierry
- Better Pre-training By Reducing Representation Confusion Zhang Haojie, Liang Mingfei, Xie Ruobing, Sun Zhenlong, Zhang Bo, Lin Leyu
- SMDT: Selective Memory-augmented Neural Document Translation Zhang Xu, Yang Jian, Huang Haoyang, Ma Shuming, Zhang Dongdong, Li Jinlong, Wei Furu
- Unveiling Transformers With LEGO: A Synthetic Reasoning Task Zhang Yi, Backurs Arturs, Bubeck Sébastien, Eldan Ronen, Gunasekar Suriya, Wagner Tal
- Attend, Memorize And Generate: Towards Faithful Table-to-text Generation In Few Shots Zhao Wenting, Liu Ye, Wan Yao, Yu Philip S.
- Lite Unified Modeling For Discriminative Reading Comprehension Zhao Yilin, Zhao Hai, Shen Libin, Zhao Yinggong
- Medical Dialogue Response Generation With Pivotal Information Recalling Zhao Yu, Li Yunxin, Wu Yuxiang, Hu Baotian, Chen Qingcai, Wang Xiaolong, Ding Yuxin, Zhang Min
- Predicting Fine-tuning Performance With Probing Zhu Zining, Shahtalebi Soroosh, Rudzicz Frank
- Future Sight: Dynamic Story Generation With Large Pretrained Language Models Zimmerman Brian D., Sahu Gaurav, Vechtomova Olga
- Efficient Long Sequence Modeling Via State Space Augmented Transformer Zuo Simiao, Liu Xiaodong, Jiao Jian, Charles Denis, Manavoglu Eren, Zhao Tuo, Gao Jianfeng
- Attention Over Pre-trained Sentence Embeddings For Long Document Classification Abdaoui Amine, Dutta Sourav
- \(\mathbb{vd}\)-\(\mathbb{gr}\): Boosting \(\mathbb{v}\)isual \(\mathbb{d}\)ialog With Cascaded Spatial-temporal Multi-modal \(\mathbb{gr}\)aphs Abdessaied Adnen, Shi Lei, Bulling Andreas
- Beyond Traditional Teaching: The Potential Of Large Language Models And Chatbots In Graduate Engineering Education Abedi Mahyar, Alshybani Ibrahem, Shahadat Muhammad Rubayat Bin, Murillo Michael S.
- Position Interpolation Improves Alibi Extrapolation Al-khateeb Faisal, Dey Nolan, Soboleva Daria, Hestness Joel
- Mamba: Linear-time Sequence Modeling With Selective State Spaces Albert Gu, Tri Dao
- Mistral 7B Albert Q. Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego De Las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lucile Saulnier, Lélio Renard Lavaud, Marie-anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timothée Lacroix, William El Sayed
- Pixt3: Pixel-based Table-to-text Generation Alonso Iñigo, Agirre Eneko, Lapata Mirella
- Does Deep Learning Learn To Abstract? A Systematic Probing Framework An Shengnan, Lin Zeqi, Chen Bei, Fu Qiang, Zheng Nanning, Lou Jian-guang
- Randomized Positional Encodings Boost Length Generalization Of Transformers Anian Ruoss, Grégoire Delétang, Tim Genewein, Jordi Grau-moya, Róbert Csordás, Mehdi Bennani, Shane Legg, Joel Veness
- AI And Education: An Investigation Into The Use Of Chatgpt For Systems Thinking Arndt Holger
- Zoology: Measuring And Improving Recall In Efficient Language Models Arora Simran, Eyuboglu Sabri, Timalsina Aman, Johnson Isys, Poli Michael, Zou James, Rudra Atri, Ré Christopher
- Factuality Challenges In The Era Of Large Language Models Augenstein Isabelle, Baldwin Timothy, Cha Meeyoung, Chakraborty Tanmoy, Ciampaglia Giovanni Luca, Corney David, Diresta Renee, Ferrara Emilio, Hale Scott, Halevy Alon, Hovy Eduard, Ji Heng, Menczer Filippo, Miguez Ruben, Nakov Preslav, Scheufele Dietram, Sharma Shivam, Zagni Giovanni
- Evaluating Chatgpt As A Question Answering System: A Comprehensive Analysis And Comparison With Existing Models Bahak Hossein, Taheri Farzaneh, Zojaji Zahra, Kazemi Arefeh
- Separate-and-enhance: Compositional Finetuning For Text2image Diffusion Models Bao Zhipeng, Li Yijun, Singh Krishna Kumar, Wang Yu-xiong, Hebert Martial
- Testing The Depth Of Chatgpt's Comprehension Via Cross-modal Tasks Based On Ascii-art: Gpt3.5's Abilities In Regard To Recognizing And Generating Ascii-art Are Not Totally Lacking Bayani David
- Span-selective Linear Attention Transformers For Effective And Robust Schema-guided Dialogue State Tracking Bebensee Björn, Lee Haejun
- AQUALLM: Audio Question Answering Data Generation Using Large Language Models Behera Swarup Ranjan, Injeti Krishna Mohan, Patibandla Jaya Sai Kiran, Pokala Praveen Kumar, Pailla Balakrishna Reddy
- Loquacity And Visible Emotion: Chatgpt As A Policy Advisor Biancotti Claudia, Camassa Carolina
- How Should The Advent Of Large Language Models Affect The Practice Of Science? Binz Marcel, Alaniz Stephan, Roskies Adina, Aczel Balazs, Bergstrom Carl T., Allen Colin, Schad Daniel, Wulff Dirk, West Jevin D., Zhang Qiong, Shiffrin Richard M., Gershman Samuel J., Popov Ven, Bender Emily M., Marelli Marco, Botvinick Matthew M., Akata Zeynep, Schulz Eric
- Distractor Generation For Multiple-choice Questions With Predictive Prompting And Large Language Models Bitew Semere Kiros, Deleu Johannes, Develder Chris, Demeester Thomas
- How Close Is Chatgpt To Human Experts? Comparison Corpus, Evaluation, And Detection Biyang Guo, Xin Zhang, Ziyuan Wang, Minqi Jiang, Jinran Nie, Yuxuan Ding, Jianwei Yue, Yupeng Wu
- RWKV: Reinventing Rnns For The Transformer Era Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Stella Biderman, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran Gv, Xuzheng He, Haowen Hou, Jiaju Lin, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Guangyu Song, Xiangru Tang, Bolun Wang, Johan S. Wind, Stanislaw Wozniak, Ruichong Zhang, Zhenyuan Zhang, Qihang Zhao, Peng Zhou, Qinghua Zhou, Jian Zhu, Rui-jie Zhu
- A Categorical Archive Of Chatgpt Failures Borji Ali
- Soundstorm: Efficient Parallel Audio Generation Borsos Zalán, Sharifi Matt, Vincent Damien, Kharitonov Eugene, Zeghidour Neil, Tagliasacchi Marco
- Eight Things To Know About Large Language Models Bowman Samuel R.
- Efficient Transformer Knowledge Distillation: A Performance Review Brown Nathan, Williamson Ashton, Anderson Tahj, Lawrence Logan
- Finding The Needle In A Haystack: Unsupervised Rationale Extraction From Long Text Classifiers Bujel Kamil, Caines Andrew, Yannakoudakis Helen, Rei Marek
- Scaling In-context Demonstrations With Structured Attention Cai Tianle, Huang Kaixuan, Lee Jason D., Wang Mengdi
- CLAP: Isolating Content From Style Through Contrastive Learning With Augmented Prompts Cai Yichao, Liu Yuhang, Zhang Zhen, Shi Javen Qinfeng
- On The Application Of Large Language Models For Language Teaching And Assessment Technology Caines Andrew, Benedetto Luca, Taslimipoor Shiva, Davis Christopher, Gao Yuan, Andersen Oeistein, Yuan Zheng, Elliott Mark, Moore Russell, Bryant Christopher, Rei Marek, Yannakoudakis Helen, Mullooly Andrew, Nicholls Diane, Buttery Paula
- Localizing Lying In Llama: Understanding Instructed Dishonesty On True-false Questions Through Prompting, Probing, And Patching Campbell James, Ren Richard, Guo Phillip
- The Life Cycle Of Knowledge In Big Language Models: A Survey Cao Boxi, Lin Hongyu, Han Xianpei, Sun Le
- Are Large Language Models Good Fact Checkers: A Preliminary Study Cao Han, Wei Lingwei, Chen Mengyang, Zhou Wei, Hu Songlin
- Promoting Generalized Cross-lingual Question Answering In Few-resource Scenarios Via Self-knowledge Distillation Carrino Casimiro Pio, Escolano Carlos, Fonollosa José A. R.
- Exploring Answer Information Methods For Question Generation With Transformers Chafekar Talha, Hussain Aafiya, Sharma Grishma, Sharma Deepak
- Language Model Behavior: A Comprehensive Survey Chang Tyler A., Bergen Benjamin K.
- Sudden Drops In The Loss: Syntax Acquisition, Phase Transitions, And Simplicity Bias In Mlms Chen Angelica, Shwartz-ziv Ravid, Cho Kyunghyun, Leavitt Matthew L., Saphra Naomi
- Dual-space Hierarchical Learning For Goal-guided Conversational Recommendation Chen Can, Liu Hao, Liu Zeming, Liu Xue, Dou Dejing
- Maybe Only 0.5% Data Is Needed: A Preliminary Exploration Of Low Training Data Instruction Tuning Chen Hao, Zhang Yiming, Zhang Qi, Yang Hantao, Hu Xiaomeng, Ma Xuetao, Yanggong Yifan, Zhao Junbo
- Walking Down The Memory Maze: Beyond Context Limit Through Interactive Reading Chen Howard, Pasunuru Ramakanth, Weston Jason, Celikyilmaz Asli
- Natural Response Generation For Chinese Reading Comprehension Chen Nuo, Li Hongguang, Bao Yinan, Wang Baoyuan, Li Jia
- Webvln: Vision-and-language Navigation On Websites Chen Qi, Pitawela Dileepa, Zhao Chongyang, Zhou Gengze, Chen Hsiang-ting, Wu Qi
- Extending Context Window Of Large Language Models Via Positional Interpolation Chen Shouyuan, Wong Sherman, Chen Liangjian, Tian Yuandong
- Understanding And Improving In-context Learning On Vision-language Models Chen Shuo, Han Zhen, He Bailan, Buckley Mark, Torr Philip, Tresp Volker, Gu Jindong
- Llm-empowered Chatbots For Psychiatrist And Patient Simulation: Application And Evaluation Chen Siyuan, Wu Mengyue, Zhu Kenny Q., Lan Kunyao, Zhang Zhiling, Cui Lyuchun
- Enhancing Robot Program Synthesis Through Environmental Context Chen Tianyi, Wang Qidi, Dong Zhen, Shen Liwei, Peng Xin
- Reason Out Your Layout: Evoking The Layout Master From Large Language Models For Text-to-image Synthesis Chen Xiaohui, Liu Yongfei, Yang Yingxiang, Yuan Jianbo, You Quanzeng, Liu Li-ping, Yang Hongxia
- Fortify The Shortest Stave In Attention: Enhancing Context Awareness Of Large Language Models For Effective Tool Use Chen Yuhan, Lv Ang, Lin Ting-en, Chen Changyu, Wu Yuchuan, Huang Fei, Li Yongbin, Yan Rui
- Confidant: Customizing Transformer-based Llms Via Collaborative Edge Training Chen Yuhao, Yan Yuxuan, Yang Qianqian, Shu Yuanchao, He Shibo, Chen Jiming
- Lmexplainer: Grounding Knowledge And Explaining Language Models Chen Zichen, Chen Jianda, Chen Yuanyuan, Yu Han, Singh Ambuj K, Sra Misha
- Xplainllm: A QA Explanation Dataset For Understanding LLM Decision-making Chen Zichen, Chen Jianda, Gaidhani Mitali, Singh Ambuj, Sra Misha
- Latent Positional Information Is In The Self-attention Variance Of Transformer Language Models Without Positional Embeddings Chi Ta-chung, Fan Ting-han, Chen Li-wei, Rudnicky Alexander I., Ramadge Peter J.
- Attention Alignment And Flexible Positional Embeddings Improve Transformer Length Extrapolation Chi Ta-chung, Fan Ting-han, Rudnicky Alexander I.
- Transformer Working Memory Enables Regular Language Reasoning And Natural Language Length Extrapolation Chi Ta-chung, Fan Ting-han, Rudnicky Alexander I., Ramadge Peter J.
- An Integrative Survey On Mental Health Conversational Agents To Bridge Computer Science And Medical Perspectives Cho Young Min, Rai Sunny, Ungar Lyle, Sedoc João, Guntuku Sharath Chandra
- Generative Data Augmentation Using Llms Improves Distributional Robustness In Question Answering Chowdhury Arijit Ghosh, Chadha Aman
- Apollo: Unified Adapter And Prompt Learning For Vision Language Models Chowdhury Sanjoy, Nag Sayan, Manocha Dinesh
- Navigate Through Enigmatic Labyrinth A Survey Of Chain Of Thought Reasoning: Advances, Frontiers And Future Chu Zheng, Chen Jingchang, Chen Qianglong, Yu Weijiang, He Tao, Wang Haotian, Peng Weihua, Liu Ming, Qin Bing, Liu Ting
- Strong And Efficient Baselines For Open Domain Conversational Question Answering Coman Andrei C., Barlacchi Gianni, De Gispert Adrià
- Seamless: Multilingual Expressive And Streaming Speech Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Coria, Dale David, Dong Ning, Duppenthaler Mark, Duquenne Paul-ambroise, Ellis Brian, Elsahar Hady, Haaheim Justin, Hoffman John, Hwang Min-jae, Inaguma Hirofumi, Klaiber Christopher, Kulikov Ilia, Li Pengwei, Licht Daniel, Maillard Jean, Mavlyutov Ruslan, Rakotoarison Alice, Sadagopan Kaushik Ram, Ramakrishnan Abinesh, Tran Tuan, Wenzek Guillaume, Yang Yilin, Ye Ethan, Evtimov Ivan, Fernandez Pierre, Gao Cynthia, Hansanti Prangthip, Kalbassi Elahe, Kallet Amanda, Kozhevnikov Artyom, Gonzalez Gabriel Mejia, Roman Robin San, Touret Christophe, Wong Corinne, Wood Carleigh, Yu Bokai, Andrews Pierre, Balioglu Can, Chen Peng-jen, Costa-jussà Marta R., Elbayad Maha, Gong Hongyu, Guzmán Francisco, Heffernan Kevin, Jain Somya, Kao Justine, Lee Ann, Ma Xutai, Mourachko Alex, Peloquin Benjamin, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Sun Anna, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler, Williamson Mary
- Hallucination Is The Last Thing You Need Curran Shawn, Lansley Sam, Bethell Oliver
- Evaluating Prompt-based Question Answering For Object Prediction In The Open Research Knowledge Graph D'souza Jennifer, Hrou Moussab, Auer Sören
- Syntactic Knowledge Via Graph Attention With BERT In Machine Translation Dai Yuqian, Sharoff Serge, De Kamps Marc
- An Adversarial Example For Direct Logit Attribution: Memory Management In Gelu-4l Dao James, Lau Yeu-tong, Rager Can, Janiak Jett
- Flashattention-2: Faster Attention With Better Parallelism And Work Partitioning Dao Tri
- Mask The Bias: Improving Domain-adaptive Generalization Of Ctc-based ASR With Internal Language Model Estimation Das Nilaksh, Sunkara Monica, Bodapati Sravan, Cai Jinglun, Kulshreshtha Devang, Farris Jeff, Kirchhoff Katrin
- Im-promptu: In-context Composition From Image Prompts Dedhia Bhishma, Chang Michael, Snell Jake C., Griffiths Thomas L., Jha Niraj K.
- Divergent Token Metrics: Measuring Degradation To Prune Away LLM Components -- And Optimize Quantization Deiseroth Björn, Meuer Max, Gritsch Nikolas, Eichenberg Constantin, Schramowski Patrick, Aßenmacher Matthias, Kersting Kristian
- Codefuse-13b: A Pretrained Multi-lingual Code Large Language Model Di Peng, Li Jianguo, Yu Hang, Jiang Wei, Cai Wenting, Cao Yang, Chen Chaoyu, Chen Dajun, Chen Hongwei, Chen Liang, Fan Gang, Gong Jie, Gong Zi, Hu Wen, Guo Tingting, Lei Zhichao, Li Ting, Li Zheng, Liang Ming, Liao Cong, Liu Bingchang, Liu Jiachen, Liu Zhiwei, Lu Shaojun, Shen Min, Wang Guangpei, Wang Huan, Wang Zhi, Xu Zhaogui, Yang Jiawei, Ye Qing, Zhang Gehao, Zhang Yu, Zhao Zelin, Zheng Xunjin, Zhou Hailian, Zhu Lifu, Zhu Xianying
- Lmflow: An Extensible Toolkit For Finetuning And Inference Of Large Foundation Models Diao Shizhe, Pan Rui, Dong Hanze, Shum Ka Shun, Zhang Jipeng, Xiong Wei, Zhang Tong
- Jump To Conclusions: Short-cutting Transformers With Linear Transformations Din Alexander Yom, Karidi Taelin, Choshen Leshem, Geva Mor
- Longnet: Scaling Transformers To 1,000,000,000 Tokens Ding Jiayu, Ma Shuming, Dong Li, Zhang Xingxing, Huang Shaohan, Wang Wenhui, Zheng Nanning, Wei Furu
- Modeling What-to-ask And How-to-ask For Answer-unaware Conversational Question Generation Do Xuan Long, Zou Bowei, Joty Shafiq, Tran Anh Tai, Pan Liangming, Chen Nancy F., Aw Ai Ti
- Blockwise Compression Of Transformer-based Models Without Retraining Dong Gaochen, Chen Wei
- The Philosopher's Stone: Trojaning Plugins Of Large Language Models Dong Tian, Xue Minhui, Chen Guoxing, Holland Rayne, Meng Yan, Li Shaofeng, Liu Zhen, Zhu Haojin
- Blending Reward Functions Via Few Expert Demonstrations For Faithful And Accurate Knowledge-grounded Dialogue Generation Du Wanyu, Ji Yangfeng
- Shifting Attention To Relevance: Towards The Predictive Uncertainty Quantification Of Free-form Large Language Models Duan Jinhao, Cheng Hao, Wang Shiqi, Zavalny Alex, Wang Chenan, Xu Renjing, Kailkhura Bhavya, Xu Kaidi
- Bridging The Language Gap: Knowledge Injected Multilingual Question Answering Duan Zhichao, Li Xiuxing, Zhang Zhengyan, Li Zhenyu, Liu Ning, Wang Jianyong
- Stack Attention: Improving The Ability Of Transformers To Model Hierarchical Patterns Dusell Brian, Chiang David
- Tinystories: How Small Can Language Models Be And Still Speak Coherent English? Eldan Ronen, Li Yuanzhi
- Transferable Decoding With Visual Entities For Zero-shot Image Captioning Fei Junjie, Wang Teng, Zhang Jinrui, He Zhenyu, Wang Chengjie, Zheng Feng
- Sequence-to-sequence Pre-training With Unified Modality Masking For Visual Document Understanding Feng Shuwei, Zhan Tianyang, Jie Zhanming, Luong Trung Quoc, Jin Xiaoran
- Should Chatgpt Be Biased? Challenges And Risks Of Bias In Large Language Models Ferrara Emilio
- Grounded Complex Task Segmentation For Conversational Assistants Ferreira Rafael, Semedo David, Magalhães João
- Vision Language Transformers: A Survey Fields Clayton, Kennington Casey
- Breaking The Language Barrier: Improving Cross-lingual Reasoning With Structured Self-attention Foroutan Negar, Banaei Mohammadreza, Aberer Karl, Bosselut Antoine
- Codeapex: A Bilingual Programming Evaluation Benchmark For Large Language Models Fu Lingyue, Chai Huacan, Luo Shuang, Du Kounianhua, Zhang Weiming, Fan Longteng, Lei Jiayi, Rui Renting, Lin Jianghao, Fang Yuchen, Liu Yifan, Wang Jingkuan, Qi Siyuan, Zhang Kangning, Zhang Weinan, Yu Yong
- CLIPAG: Towards Generator-free Text-to-image Generation Ganz Roy, Elad Michael
- Roles Of Scaling And Instruction Tuning In Language Perception: Model Vs. Human Attention Gao Changjiang, Huang Shujian, Li Jixing, Chen Jiajun
- Empower Your Model With Longer And Better Context Comprehension Gao Yifei, Wang Lei, Fang Jun, Hu Longhua, Cheng Jun
- Building Trustworthy Neurosymbolic AI Systems: Consistency, Reliability, Explainability, And Safety Gaur Manas, Sheth Amit
- Model Tells You What To Discard: Adaptive KV Cache Compression For Llms Ge Suyu, Zhang Yunan, Liu Liyuan, Zhang Minjia, Han Jiawei, Gao Jianfeng
- Hiclip: Contrastive Language-image Pretraining With Hierarchy-aware Attention Geng Shijie, Yuan Jianbo, Tian Yu, Chen Yuxiao, Zhang Yongfeng
- Gpt-based Models Meet Simulation: How To Efficiently Use Large-scale Pre-trained Language Models Across Simulation Tasks Giabbanelli Philippe J.
- Resetox: Re-learning Attention Weights For Toxicity Mitigation In Machine Translation Gilabert Javier García, Escolano Carlos, Costa-jussà Marta R.
- Prompt Cache: Modular Attention Reuse For Low-latency Inference Gim In, Chen Guojun, Lee Seung-seob, Sarda Nikhil, Khandelwal Anurag, Zhong Lin
- Generative AI Beyond Llms: System Implications Of Multi-modal Generation Golden Alicia, Hsia Samuel, Sun Fei, Acun Bilge, Hosmer Basil, Lee Yejin, Devito Zachary, Johnson Jeff, Wei Gu-yeon, Brooks David, Wu Carole-jean
- Improving Input-label Mapping With Demonstration Replay For In-context Learning Gong Zhuocheng, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Yan Rui
- Seer: Language Instructed Video Prediction With Latent Diffusion Models Gu Xianfan, Wen Chuan, Ye Weirui, Song Jiaming, Gao Yang
- Hifi: High-information Attention Heads Hold For Parameter-efficient Model Adaptation Gui Anchun, Xiao Han
- Hiformer: Heterogeneous Feature Interactions Learning With Transformers For Recommender Systems Gui Huan, Wang Ruoxi, Yin Ke, Jin Long, Kula Maciej, Xu Taibai, Hong Lichan, Chi Ed H.
- SPT: Fine-tuning Transformer-based Language Models Efficiently With Sparsification Gui Yuntao, Yan Xiao, Yin Peiqi, Yang Han, Cheng James
- Longcoder: A Long-range Pre-trained Language Model For Code Completion Guo Daya, Xu Canwen, Duan Nan, Yin Jian, Mcauley Julian
- ELIP: Efficient Language-image Pre-training With Fewer Vision Tokens Guo Yangyang, Zhang Haoyu, Wong Yongkang, Nie Liqiang, Kankanhalli Mohan
- Evaluating Large Language Models: A Comprehensive Survey Guo Zishan, Jin Renren, Liu Chuang, Huang Yufei, Shi Dan, Supryadi, Yu Linhao, Liu Yan, Li Jiaxuan, Xiong Bojian, Xiong Deyi
- Gistscore: Learning Better Representations For In-context Example Selection With Gist Bottlenecks Gupta Shivanshu, Rosenbaum Clemens, Elenberg Ethan R.
- A Real-world Webagent With Planning, Long Context Understanding, And Program Synthesis Gur Izzeddin, Furuta Hiroki, Huang Austin, Safdari Mustafa, Matsuo Yutaka, Eck Douglas, Faust Aleksandra
- Seeing Chatgpt Through Students' Eyes: An Analysis Of Tiktok Data Haensch Anna-carolina, Ball Sarah, Herklotz Markus, Kreuter Frauke
- Efficient Parallelization Layouts For Large-scale Distributed Model Training Hagemann Johannes, Weinbach Samuel, Dobler Konstantin, Schall Maximilian, De Melo Gerard
- Lm-infinite: Zero-shot Extreme Length Generalization For Large Language Models Han Chi, Wang Qifan, Peng Hao, Xiong Wenhan, Chen Yu, Ji Heng, Wang Sinong
- Onellm: One Framework To Align All Modalities With Language Han Jiaming, Gong Kaixiong, Zhang Yiyuan, Wang Jiaqi, Zhang Kaipeng, Lin Dahua, Qiao Yu, Gao Peng, Yue Xiangyu
- Imagebind-llm: Multi-modality Instruction Tuning Han Jiaming, Zhang Renrui, Shao Wenqi, Gao Peng, Xu Peng, Xiao Han, Zhang Kaipeng, Liu Chris, Wen Song, Guo Ziyu, Lu Xudong, Ren Shuai, Wen Yafei, Chen Xiaoxin, Yue Xiangyu, Li Hongsheng, Qiao Yu
- Blockwise Parallel Transformer For Large Context Models Hao Liu, Pieter Abbeel
- Vico: Plug-and-play Visual Condition For Personalized Text-to-image Generation Hao Shaozhe, Han Kai, Zhao Shihao, Wong Kwan-yee K.
- Towards Efficient Vision-language Tuning: More Information Density, More Generalizability Hao Tianxiang, Lyu Mengyao, Chen Hui, Zhao Sicheng, Ding Xiaohan, Han Jungong, Ding Guiguang
- Leveraging Large Language Models For Sequential Recommendation Harte Jesse, Zorgdrager Wouter, Louridas Panos, Katsifodimos Asterios, Jannach Dietmar, Fragkoulis Marios
- The Political Ideology Of Conversational AI: Converging Evidence On Chatgpt's Pro-environmental, Left-libertarian Orientation Hartmann Jochen, Schwenzow Jasper, Witte Maximilian
- Never Lost In The Middle: Mastering Long-context Question Answering With Position-agnostic Decompositional Training He Junqing, Pan Kunhao, Dong Xiaoqun, Song Zhuoyang, Liu Yibo, Sun Qianguo, Liang Yuxin, Wang Hao, Zhang Enming, Zhang Jiaxing
- Multi-modal Latent Space Learning For Chain-of-thought Reasoning In Language Models He Liqi, Li Zuchao, Cai Xiantao, Wang Ping
- Multimodal Graph Transformer For Multimodal Question Answering He Xuehai, Wang Xin Eric
- Localized Text-to-image Generation For Free Via Cross Attention Control He Yutong, Salakhutdinov Ruslan, Kolter J. Zico
- Stochastic Parrots Looking For Stochastic Parrots: Llms Are Easy To Fine-tune And Hard To Detect With Other Llms Henrique Da Silva Gameiro, Kucharavy Andrei, Guerraoui Rachid
- Improving Long Context Document-level Machine Translation Herold Christian, Ney Hermann
- Harnessing The Zero-shot Power Of Instruction-tuned Large Language Model In End-to-end Speech Recognition Higuchi Yosuke, Ogawa Tetsuji, Kobayashi Tetsunori
- Do Compressed Llms Forget Knowledge? An Experimental Study With Practical Implications Hoang Duc N. M, Cho Minsik, Merth Thomas, Rastegari Mohammad, Wang Zhangyang
- Direct2v: Large Language Models Are Frame-level Directors For Zero-shot Text-to-video Generation Hong Susung, Seo Junyoung, Shin Heeseong, Hong Sunghwan, Kim Seungryong
- Towards A Mechanistic Interpretation Of Multi-step Reasoning Capabilities Of Language Models Hou Yifan, Li Jiaoda, Fei Yu, Stolfo Alessandro, Zhou Wangchunshu, Zeng Guangtao, Bosselut Antoine, Sachan Mrinmaya
- Uncertainty-aware Unlikelihood Learning Improves Generative Aspect Sentiment Quad Prediction Hu Mengting, Bai Yinhao, Wu Yike, Zhang Zhen, Zhang Liqi, Gao Hang, Zhao Shiwan, Huang Minlie
- Enhancing Answer Selection In Community Question Answering With Pre-trained And Large Language Models Hu Xinghang
- Unlocking The Potential Of User Feedback: Leveraging Large Language Model As User Simulator To Enhance Dialogue System Hu Zhiyuan, Feng Yue, Luu Anh Tuan, Hooi Bryan, Lipani Aldo
- POP: Prompt Of Prompts For Continual Learning Hu Zhiyuan, Lyu Jiancheng, Gao Dashan, Vasconcelos Nuno
- SMILE: Single-turn To Multi-turn Inclusive Language Expansion Via Chatgpt For Mental Health Support Huachuan Qiu, Hongliang He, Shuai Zhang, Anqi Li, Zhenzhong Lan
- Directed Acyclic Transformer Pre-training For High-quality Non-autoregressive Text Generation Huang Fei, Ke Pei, Huang Minlie
- Free-bloom: Zero-shot Text-to-video Generator With LLM Director And LDM Animator Huang Hanzhuo, Feng Yufan, Shi Cheng, Xu Lan, Yu Jingyi, Yang Sibei
- A Survey On Hallucination In Large Language Models: Principles, Taxonomy, Challenges, And Open Questions Huang Lei, Yu Weijiang, Ma Weitao, Zhong Weihong, Feng Zhangyin, Wang Haotian, Chen Qianglong, Peng Weihua, Feng Xiaocheng, Qin Bing, Liu Ting
- Adapting Pre-trained Language Models To Vision-language Tasks Via Dynamic Visual Prompting Huang Shubin, Wu Qiong, Zhou Yiyi, Chen Weijie, Zhang Rongsheng, Sun Xiaoshuai, Ji Rongrong
- Generic Attention-model Explainability By Weighted Relevance Accumulation Huang Yiming, Jia Aozhe, Zhang Xiaodan, Zhang Jiawei
- Trustgpt: A Benchmark For Trustworthy And Responsible Large Language Models Huang Yue, Zhang Qihui, Y Philip S., Sun Lichao
- Chatgpt Is Fun, But It Is Not Funny! Humor Is Still Challenging Large Language Models Jentzsch Sophie, Kersting Kristian
- Mistral 7B Jiang Albert Q., Sablayrolles Alexandre, Mensch Arthur, Bamford Chris, Chaplot Devendra Singh, Casas Diego De Las, Bressand Florian, Lengyel Gianna, Lample Guillaume, Saulnier Lucile, Lavaud Lélio Renard, Lachaux Marie-anne, Stock Pierre, Scao Teven Le, Lavril Thibaut, Wang Thomas, Lacroix Timothée, Sayed William El
- Llm-blender: Ensembling Large Language Models With Pairwise Ranking And Generative Fusion Jiang Dongfu, Ren Xiang, Lin Bill Yuchen
- Forcing Generative Models To Degenerate Ones: The Power Of Data Poisoning Attacks Jiang Shuli, Kadhe Swanand Ravindra, Zhou Yi, Cai Ling, Baracaldo Nathalie
- BRAINTEASER: Lateral Thinking Puzzles For Large Language Models Jiang Yifan, Ilievski Filip, Ma Kaixin, Sourati Zhivar
- Lion: Adversarial Distillation Of Proprietary Large Language Models Jiang Yuxin, Chan Chunkit, Chen Mingyang, Wang Wei
- Prompt-based Length Controlled Generation With Reinforcement Learning Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- GPT Is Becoming A Turing Machine: Here Are Some Ways To Program It Jojic Ana, Wang Zhen, Jojic Nebojsa
- "it's Not Like Jarvis, But It's Pretty Close!" -- Examining Chatgpt's Usage Among Undergraduate Students In Computer Science Joshi Ishika, Budhiraja Ritvik, Akolekar Harshal D, Challa Jagat Sesh, Kumar Dhruv
- Chatgpt In The Classroom: An Analysis Of Its Strengths And Weaknesses For Solving Undergraduate Computer Science Questions Joshi Ishika, Budhiraja Ritvik, Dev Harshal, Kadia Jahnvi, Ataullah M. Osama, Mitra Sayan, Kumar Dhruv, Akolekar Harshal D.
- A Case Study On Test Case Construction With Large Language Models: Unveiling Practical Insights And Challenges Junior Roberto Francisco De Lima, Presta Luiz Fernando Paes De Barros, Borborema Lucca Santos, Da Silva Vanderson Nogueira, Dahia Marcio Leal De Melo, Santos Anderson Carlos Sousa E
- A Comprehensive Capability Analysis Of GPT-3 And GPT-3.5 Series Models Junjie Ye, Xuanting Chen, Nuo Xu, Can Zu, Zekai Shao, Shichun Liu, Yuhan Cui, Zeyang Zhou, Chao Gong, Yang Shen, Jie Zhou, Siming Chen, Tao Gui, Qi Zhang, Xuanjing Huang
- Recommendation As Instruction Following: A Large Language Model Empowered Recommendation Approach Junjie Zhang, Ruobing Xie, Yupeng Hou, Wayne Xin Zhao, Leyu Lin, Ji-rong Wen
- Robust Recommender System: A Survey And Future Directions Kaike Zhang, Qi Cao, Fei Sun, Yunfan Wu, Shuchang Tao, Huawei Shen, Xueqi Cheng
- Syntax-guided Transformers: Elevating Compositional Generalization And Grounding In Multimodal Environments Kamali Danial, Kordjamshidi Parisa
- Reducing Sequence Length By Predicting Edit Operations With Large Language Models Kaneko Masahiro, Okazaki Naoaki
- Llm-augmented Preference Learning From Natural Language Kang Inwon, Ruan Sikai, Ho Tyler, Lin Jui-chien, Mohsin Farhad, Seneviratne Oshani, Xia Lirong
- The Impact Of Positional Encoding On Length Generalization In Transformers Kazemnejad Amirhossein, Padhi Inkit, Ramamurthy Karthikeyan Natesan, Das Payel, Reddy Siva
- Unnatural Language Processing: How Do Language Models Handle Machine-generated Prompts? Kervadec Corentin, Franzon Francesca, Baroni Marco
- Compressed Context Memory For Online Language Model Interaction Kim Jang-hyun, Yeom Junyoung, Yun Sangdoo, Song Hyun Oh
- Do Large Language Models Pay Similar Attention Like Human Programmers When Generating Code? Kou Bonan, Chen Shengmai, Wang Zhijie, Ma Lei, Zhang Tianyi
- Fundamentals Of Generative Large Language Models And Perspectives In Cyber-defense Kucharavy Andrei, Schillaci Zachary, Maréchal Loïc, Würsch Maxime, Dolamic Ljiljana, Sabonnadiere Remi, David Dimitri Percia, Mermoud Alain, Lenders Vincent
- Large Language Models Humanize Technology Kumar Pratyush
- Large Language Models On The Chessboard: A Study On Chatgpt's Formal Language Comprehension And Complex Reasoning Skills Kuo Mu-tien, Hsueh Chih-chung, Tsai Richard Tzong-han
- M4LE: A Multi-ability Multi-range Multi-task Multi-domain Long-context Evaluation Benchmark For Large Language Models Kwan Wai-chung, Zeng Xingshan, Wang Yufei, Sun Yusen, Li Liangyou, Shang Lifeng, Liu Qun, Wong Kam-fai
- Chatgpt Beyond English: Towards A Comprehensive Evaluation Of Large Language Models In Multilingual Learning Lai Viet Dac, Ngo Nghia Trung, Veyseh Amir Pouran Ben, Man Hieu, Dernoncourt Franck, Bui Trung, Nguyen Thien Huu
- Mini-dalle3: Interactive Text To Image By Prompting Large Language Models Lai Zeqiang, Zhu Xizhou, Dai Jifeng, Qiao Yu, Wang Wenhai
- End-to-end Speech Recognition Contextualization With Large Language Models Lakomkin Egor, Wu Chunyang, Fathullah Yassir, Kalinli Ozlem, Seltzer Michael L., Fuegen Christian
- A Systematic Study And Comprehensive Evaluation Of Chatgpt On Benchmark Datasets Laskar Md Tahmid Rahman, Bari M Saiful, Rahman Mizanur, Bhuiyan Md Amran Hossen, Joty Shafiq, Huang Jimmy Xiangji
- Read-only Prompt Optimization For Vision-language Few-shot Learning Lee Dongjun, Song Seokwon, Suh Jihee, Choi Joonmyung, Lee Sanghyeok, Kim Hyunwoo J.
- Is Attention Required For ICL? Exploring The Relationship Between Model Architecture And In-context Learning Ability Lee Ivan, Jiang Nan, Berg-kirkpatrick Taylor
- Chatgpt: A Meta-analysis After 2.5 Months Leiter Christoph, Zhang Ran, Chen Yanran, Belouadi Jonas, Larionov Daniil, Fresen Vivian, Eger Steffen
- Self-detoxifying Language Models Via Toxification Reversal Leong Chak Tou, Cheng Yi, Wang Jiashuo, Wang Jian, Li Wenjie
- Talk The Walk: Synthetic Data Generation For Conversational Music Recommendation Leszczynski Megan, Zhang Shu, Ganti Ravi, Balog Krisztian, Radlinski Filip, Pereira Fernando, Chaganty Arun Tejasvi
- Teach Llms To Personalize -- An Approach Inspired By Writing Education Li Cheng, Zhang Mingyang, Mei Qiaozhu, Wang Yaqing, Hombaiah Spurthi Amba, Liang Yi, Bendersky Michael
- DISTFLASHATTN: Distributed Memory-efficient Attention For Long-context Llms Training Li Dacheng, Shao Rulin, Xie Anze, Xing Eric P., Ma Xuezhe, Stoica Ion, Gonzalez Joseph E., Zhang Hao
- Explaincpe: A Free-text Explanation Benchmark Of Chinese Pharmacist Examination Li Dongfang, Yu Jindi, Hu Baotian, Xu Zhenran, Zhang Min
- Repetition In Repetition Out: Towards Understanding Neural Text Degeneration From The Data Perspective Li Huayang, Lan Tian, Fu Zihao, Cai Deng, Liu Lemao, Collier Nigel, Watanabe Taro, Su Yixuan
- Graphix-t5: Mixing Pre-trained Transformers With Graph-aware Layers For Text-to-sql Parsing Li Jinyang, Hui Binyuan, Cheng Reynold, Qin Bowen, Ma Chenhao, Huo Nan, Huang Fei, Du Wenyu, Si Luo, Li Yongbin
- Prefix Propagation: Parameter-efficient Tuning For Long Sequences Li Jonathan, Aitken Will, Bhambhoria Rohan, Zhu Xiaodan
- Inference-time Intervention: Eliciting Truthful Answers From A Language Model Li Kenneth, Patel Oam, Viégas Fernanda, Pfister Hanspeter, Wattenberg Martin
- Summarizing Multiple Documents With Conversational Structure For Meta-review Generation Li Miao, Hovy Eduard, Lau Jey Han
- Starcoder: May The Source Be With You! Li Raymond, Allal Loubna Ben, Zi Yangtian, Muennighoff Niklas, Kocetkov Denis, Mou Chenghao, Marone Marc, Akiki Christopher, Li Jia, Chim Jenny, Liu Qian, Zheltonozhskii Evgenii, Zhuo Terry Yue, Wang Thomas, Dehaene Olivier, Davaadorj Mishig, Lamy-poirier Joel, Monteiro João, Shliazhko Oleh, Gontier Nicolas, Meade Nicholas, Zebaze Armel, Yee Ming-ho, Umapathi Logesh Kumar, Zhu Jian, Lipkin Benjamin, Oblokulov Muhtasham, Wang Zhiruo, Murthy Rudra, Stillerman Jason, Patel Siva Sankalp, Abulkhanov Dmitry, Zocca Marco, Dey Manan, Zhang Zhihan, Fahmy Nour, Bhattacharyya Urvashi, Yu Wenhao, Singh Swayam, Luccioni Sasha, Villegas Paulo, Kunakov Maxim, Zhdanov Fedor, Romero Manuel, Lee Tony, Timor Nadav, Ding Jennifer, Schlesinger Claire, Schoelkopf Hailey, Ebert Jan, Dao Tri, Mishra Mayank, Gu Alex, Robinson Jennifer, Anderson Carolyn Jane, Dolan-gavitt Brendan, Contractor Danish, Reddy Siva, Fried Daniel, Bahdanau Dzmitry, Jernite Yacine, Ferrandis Carlos Muñoz, Hughes Sean, Wolf Thomas, Guha Arjun, Von Werra Leandro, De Vries Harm
- Unigen: A Unified Generative Framework For Retrieval And Question Answering With Large Language Models Li Xiaoxi, Zhou Yujia, Dou Zhicheng
- Unlock The Power: Competitive Distillation For Multi-modal Large Language Models Li Xinwei, Lin Li, Wang Shuai, Qian Chen
- PBNR: Prompt-based News Recommender System Li Xinyi, Zhang Yongfeng, Malthouse Edward C.
- Dissecting Chain-of-thought: Compositionality Through In-context Filtering And Learning Li Yingcong, Sreenivasan Kartik, Giannou Angeliki, Papailiopoulos Dimitris, Oymak Samet
- Posgen: Personalized Opening Sentence Generation For Online Insurance Sales Li Yu, Zhang Yi, Wu Weijia, Zhou Zimu, Li Qiang
- Unveiling The Pitfalls Of Knowledge Editing For Large Language Models Li Zhoubo, Zhang Ningyu, Yao Yunzhi, Wang Mengru, Chen Xi, Chen Huajun
- Label Supervised Llama Finetuning Li Zongxi, Li Xianming, Liu Yuzhang, Xie Haoran, Li Jing, Wang Fu-lee, Li Qing, Zhong Xiaoqin
- GPT-4 Enhanced Multimodal Grounding For Autonomous Driving: Leveraging Cross-modal Attention With Large Language Models Liao Haicheng, Shen Huanming, Li Zhenning, Wang Chengyue, Li Guofa, Bie Yiming, Xu Chengzhong
- A Survey On Large Language Models For Recommendation Likang Wu, Zhi Zheng, Zhaopeng Qiu, Hao Wang, Hongchao Gu, Tingjia Shen, Chuan Qin, Chen Zhu, Hengshu Zhu, Qi Liu, Hui Xiong, Enhong Chen
- Rella: Retrieval-enhanced Large Language Models For Lifelong Sequential Behavior Comprehension In Recommendation Lin Jianghao, Shan Rong, Zhu Chenxu, Du Kounianhua, Chen Bo, Quan Shigang, Tang Ruiming, Yu Yong, Zhang Weinan
- Advances In Embodied Navigation Using Large Language Models: A Survey Lin Jinzhou, Gao Han, Feng Xuxiang, Xu Rongtao, Wang Changwei, Zhang Man, Guo Li, Xu Shibiao
- Exposing Attention Glitches With Flip-flop Language Modeling Liu Bingbin, Ash Jordan T., Goel Surbhi, Krishnamurthy Akshay, Zhang Cyril
- Ring Attention With Blockwise Transformers For Near-infinite Context Liu Hao, Zaharia Matei, Abbeel Pieter
- Concise And Organized Perception Facilitates Reasoning In Large Language Models Liu Junjie, Yan Shaotian, Shen Chen, Xie Liang, Wang Wenxiao, Ye Jieping
- Class Incremental Learning With Pre-trained Vision-language Models Liu Xialei, Cao Xusheng, Lu Haori, Xiao Jia-wen, Bagdanov Andrew D., Cheng Ming-ming
- Query-utterance Attention With Joint Modeling For Query-focused Meeting Summarization Liu Xingxian, Duan Bin, Xiao Bo, Xu Yajing
- Instruction Position Matters In Sequence Generation With Large Language Models Liu Yijin, Zeng Xianfeng, Meng Fandong, Zhou Jie
- Binary And Ternary Natural Language Generation Liu Zechun, Oguz Barlas, Pappu Aasish, Shi Yangyang, Krishnamoorthi Raghuraman
- Picking The Underused Heads: A Network Pruning Perspective Of Attention Head Selection For Fusing Dialogue Coreference Information Liu Zhengyuan, Chen Nancy F.
- Deja Vu: Contextual Sparsity For Efficient Llms At Inference Time Liu Zichang, Wang Jue, Dao Tri, Zhou Tianyi, Yuan Binhang, Song Zhao, Shrivastava Anshumali, Zhang Ce, Tian Yuandong, Re Christopher, Chen Beidi
- Probing Pretrained Language Models With Hierarchy Properties Lovón-melgarejo Jesús, Moreno Jose G., Besançon Romaric, Ferret Olivier, Tamine Lynda
- Effectively Fine-tune To Improve Large Multimodal Models For Radiology Report Generation Lu Yuzhe, Hong Sungmin, Shah Yash, Xu Panpan
- Prosg: Using Prompt Synthetic Gradients To Alleviate Prompt Forgetting Of Rnn-like Language Models Luo Haotian, Wu Kunming, Dai Cheng, Ding Sixian, Chen Xinhao
- Zero-resource Hallucination Prevention For Large Language Models Luo Junyu, Xiao Cao, Ma Fenglong
- Hrot: Hybrid Prompt Strategy And Retrieval Of Thought For Table-text Hybrid Question Answering Luo Tongxu, Lei Fangyu, Lei Jiahe, Liu Weihao, He Shihu, Zhao Jun, Liu Kang
- Are We Falling In A Middle-intelligence Trap? An Analysis And Mitigation Of The Reversal Curse Lv Ang, Zhang Kaiyi, Xie Shufang, Tu Quan, Chen Yuhan, Wen Ji-rong, Yan Rui
- Translating Radiology Reports Into Plain Language Using Chatgpt And GPT-4 With Prompt Learning: Promising Results, Limitations, And Potential Lyu Qing, Tan Josh, Zapadka Michael E., Ponnatapura Janardhana, Niu Chuang, Myers Kyle J., Wang Ge, Whitlow Christopher T.
- Vista-llama: Reliable Video Narrator Via Equal Distance To Visual Tokens Ma Fan, Jin Xiaojie, Wang Heng, Xian Yuchen, Feng Jiashi, Yang Yi
- Query Rewriting For Retrieval-augmented Large Language Models Ma Xinbei, Gong Yeyun, He Pengcheng, Zhao Hai, Duan Nan
- VOLTA: Improving Generative Diversity By Variational Mutual Information Maximizing Autoencoder Ma Yueen, Chi Dafeng, Li Jingjing, Song Kai, Zhuang Yuzheng, King Irwin
- Large Language Models For Telecom: Forthcoming Impact On The Industry Maatouk Ali, Piovesan Nicola, Ayed Fadhel, De Domenico Antonio, Debbah Merouane
- Do Prompt Positions Really Matter? Mao Junyu, Middleton Stuart E., Niranjan Mahesan
- Unitrec: A Unified Text-to-text Transformer And Joint Contrastive Learning Framework For Text-based Recommendation Mao Zhiming, Wang Huimin, Du Yiming, Wong Kam-fai
- The Hydra Effect: Emergent Self-repair In Language Model Computations Mcgrath Thomas, Rahtz Matthew, Kramar Janos, Mikulik Vladimir, Legg Shane
- JAB: Joint Adversarial Prompting And Belief Augmentation Mehrabi Ninareh, Goyal Palash, Ramakrishna Anil, Dhamala Jwala, Ghosh Shalini, Zemel Richard, Chang Kai-wei, Galstyan Aram, Gupta Rahul
- An Empirical Study Of Netops Capability Of Pre-trained Large Language Models Miao Yukai, Bai Yu, Chen Li, Li Dan, Sun Haifeng, Wang Xizheng, Luo Ziqiu, Ren Yanyu, Sun Dapeng, Xu Xiuting, Zhang Qi, Xiang Chao, Li Xinchi
- LAIT: Efficient Multi-segment Encoding In Transformers With Layer-adjustable Interaction Milbauer Jeremiah, Louis Annie, Hosseini Mohammad Javad, Fabrikant Alex, Metzler Donald, Schuster Tal
- ALMANACS: A Simulatability Benchmark For Language Model Explainability Mills Edmund, Su Shiye, Russell Stuart, Emmons Scott
- Attention Link: An Efficient Attention-based Low Resource Machine Translation Architecture Min Zeping
- VQA With Cascade Of Self- And Co-attention Blocks Mishra Aakansha, Anand Ashish, Guha Prithwijit
- Landmark Attention: Random-access Infinite Context Length For Transformers Mohtashami Amirkeivan, Jaggi Martin
- Learning To Compress Prompts With Gist Tokens Mu Jesse, Li Xiang Lisa, Goodman Noah
- A Brief History Of Prompt: Leveraging Language Models. (through Advanced Prompting) Muktadir Golam Md
- Trusting Language Models In Education Neto Jogi Suda, Deng Li, Raya Thejaswi, Shahbazi Reza, Liu Nick, Venkatesh Adhitya, Shah Miral, Khosla Neeru, Guido Rodrigo Capobianco
- Exploring The Maze Of Multilingual Modeling Nezhad Sina Bagheri, Agrawal Ameeta
- EVJVQA Challenge: Multilingual Visual Question Answering Nguyen Ngan Luu-thuy, Nguyen Nghia Hieu, Vo Duong T. D, Tran Khanh Quoc, Van Nguyen Kiet
- Capabilities Of GPT-4 On Medical Challenge Problems Nori Harsha, King Nicholas, Mckinney Scott Mayer, Carignan Dean, Horvitz Eric
- Blackvip: Black-box Visual Prompting For Robust Transfer Learning Oh Changdae, Hwang Hyeji, Lee Hee-young, Lim Yongtaek, Jung Geunyoung, Jung Jiyoung, Choi Hosik, Song Kyungwoo
- Automated Assessment Of Students' Code Comprehension Using Llms Oli Priti, Banjade Rabin, Chapagain Jeevan, Rus Vasile
- Deepthought: An Architecture For Autonomous Self-motivated Systems Oliveira Arlindo L., Domingos Tiago, Figueiredo Mário, Lima Pedro U.
- On The Role Of Attention In Prompt-tuning Oymak Samet, Rawat Ankit Singh, Soltanolkotabi Mahdi, Thrampoulidis Christos
- Giraffe: Adventures In Expanding Context Lengths In Llms Pal Arka, Karkhanis Deep, Roberts Manley, Dooley Samuel, Sundararajan Arvind, Naidu Siddartha
- A Preliminary Evaluation Of Chatgpt For Zero-shot Dialogue Understanding Pan Wenbo, Chen Qiguang, Xu Xiao, Che Wanxiang, Qin Libo
- Semantic Composition In Visually Grounded Language Models Pandey Rohan
- Interpretable Visual Question Answering Via Reasoning Supervision Parelli Maria, Mallis Dimitrios, Diomataris Markos, Pitsikalis Vassilis
- Longbox: Evaluating Transformers On Long-sequence Clinical Tasks Parmar Mihir, Naik Aakanksha, Gupta Himanshu, Agrawal Disha, Baral Chitta
- Evaluation Of Medium-large Language Models At Zero-shot Closed Book Generative Question Answering Peinl René, Wirth Johannes
- Customising General Large Language Models For Specialised Emotion Recognition Tasks Peng Liyizhe, Zhang Zixing, Pang Tao, Han Jing, Zhao Huan, Chen Hao, Schuller Björn W.
- When Do Prompting And Prefix-tuning Work? A Theory Of Capabilities And Limitations Petrov Aleksandar, Torr Philip H. S., Bibi Adel
- Attention Sorting Combats Recency Bias In Long Context Language Models Peysakhovich Alexander, Lerer Adam
- Let Models Speak Ciphers: Multiagent Debate Through Embeddings Pham Chau, Liu Boyi, Yang Yingxiang, Chen Zhengyu, Liu Tianyi, Yuan Jianbo, Plummer Bryan A., Wang Zhaoran, Yang Hongxia
- Jatmo: Prompt Injection Defense By Task-specific Finetuning Piet Julien, Alrashed Maha, Sitawarin Chawin, Chen Sizhe, Wei Zeming, Sun Elizabeth, Alomair Basel, Wagner David
- Hyena Hierarchy: Towards Larger Convolutional Language Models Poli Michael, Massaroli Stefano, Nguyen Eric, Fu Daniel Y., Dao Tri, Baccus Stephen, Bengio Yoshua, Ermon Stefano, Ré Christopher
- Mosaicbert: A Bidirectional Encoder Optimized For Fast Pretraining Portes Jacob, Trott Alex, Havens Sam, King Daniel, Venigalla Abhinav, Nadeem Moin, Sardana Nikhil, Khudia Daya, Frankle Jonathan
- Egovlpv2: Egocentric Video-language Pre-training With Fusion In The Backbone Pramanick Shraman, Song Yale, Nag Sayan, Lin Kevin Qinghong, Shah Hardik, Shou Mike Zheng, Chellappa Rama, Zhang Pengchuan
- Mini Minds: Exploring Bebeshka And Zlata Baby Models Proskurina Irina, Metzler Guillaume, Velcin Julien
- Chatgpt Vs Human-authored Text: Insights Into Controllable Text Summarization And Sentence Style Transfer Pu Dongqi, Demberg Vera
- Harnessing The Power Of Large Language Models For Empathetic Response Generation: Empirical Investigations And Improvements Qian Yushan, Zhang Wei-nan, Liu Ting
- Hijacking Large Language Models Via Adversarial In-context Learning Qiang Yao, Zhou Xiangyu, Zhu Dongxiao
- OPERA: Alleviating Hallucination In Multi-modal Large Language Models Via Over-trust Penalty And Retrospection-allocation Qidong Huang, Xiaoyi Dong, Pan Zhang, Bin Wang, Conghui He, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu
- Improving In-context Learning Via Bidirectional Alignment Qin Chengwei, Xia Wenhan, Jiao Fangkai, Chen Chen, Hu Yuchen, Ding Bosheng, Joty Shafiq
- Is Chatgpt A General-purpose Natural Language Processing Task Solver? Qin Chengwei, Zhang Aston, Zhang Zhuosheng, Chen Jiaao, Yasunaga Michihiro, Yang Diyi
- Cross-lingual Prompting: Improving Zero-shot Chain-of-thought Reasoning Across Languages Qin Libo, Chen Qiguang, Wei Fuxuan, Huang Shijue, Che Wanxiang
- Transnormerllm: A Faster And Better Large Language Model With Improved Transnormer Qin Zhen, Li Dong, Sun Weigao, Sun Weixuan, Shen Xuyang, Han Xiaodong, Wei Yunshen, Lv Baohong, Luo Xiao, Qiao Yu, Zhong Yiran
- Bridging The Gap: Exploring The Capabilities Of Bridge-architectures For Complex Visual Reasoning Tasks Rajesh Kousik, Raman Mrigank, Karim Mohammed Asad, Chawla Pranit
- Mabvit -- Modified Attention Block Enhances Vision Transformers Ramesh Mahesh, Ramkumar Aswinkumar
- COLA: A Benchmark For Compositional Text-to-image Retrieval Ray Arijit, Radenovic Filip, Dubey Abhimanyu, Plummer Bryan A., Krishna Ranjay, Saenko Kate
- Towards A Performance Analysis On Pre-trained Visual Question Answering Models For Autonomous Driving Rekanar Kaavya, Eising Ciarán, Sistu Ganesh, Hayes Martin
- Context Compression For Auto-regressive Transformers With Sentinel Tokens Ren Siyu, Jia Qi, Zhu Kenny Q.
- I'm Afraid I Can't Do That: Predicting Prompt Refusal In Black-box Generative Language Models Reuter Max, Schulze William
- Sparq Attention: Bandwidth-efficient LLM Inference Ribar Luka, Chelombiev Ivan, Hudlass-galley Luke, Blake Charlie, Luschi Carlo, Orr Douglas
- Knowledge-infused Self Attention Transformers Roy Kaushik, Zi Yuxin, Narayanan Vignesh, Gaur Manas, Sheth Amit
- Retrieval-pretrained Transformer: Long-range Language Modeling With Self-retrieval Rubin Ohad, Berant Jonathan
- Attention-likelihood Relationship In Transformers Ruscio Valeria, Maiorca Valentino, Silvestri Fabrizio
- Plausibility Processing In Transformer Language Models: Focusing On The Role Of Attention Heads In GPT Ryu Soo Hyun
- Naturalness Of Attention: Revisiting Attention In Code Language Models Saad Mootez, Sharma Tushar
- Uncertainty Guided Global Memory Improves Multi-hop Question Answering Sagirova Alsu, Burtsev Mikhail
- Memory Injections: Correcting Multi-hop Reasoning Failures During Inference In Transformer-based Language Models Sakarvadia Mansi, Ajith Aswathy, Khan Arham, Grzenda Daniel, Hudson Nathaniel, Bauer André, Chard Kyle, Foster Ian
- Attention Lens: A Tool For Mechanistically Interpreting The Attention Head Information Retrieval Mechanism Sakarvadia Mansi, Khan Arham, Ajith Aswathy, Grzenda Daniel, Hudson Nathaniel, Bauer André, Chard Kyle, Foster Ian
- Zero-shot Translation Of Attention Patterns In VQA Models To Natural Language Salewski Leonard, Koepke A. Sophia, Lensch Hendrik P. A., Akata Zeynep
- Will Code Remain A Relevant User Interface For End-user Programming With Generative AI Models? Sarkar Advait
- RAMP: Retrieval And Attribute-marking Enhanced Prompting For Attribute-controlled Translation Sarti Gabriele, Htut Phu Mon, Niu Xing, Hsu Benjamin, Currey Anna, Dinu Georgiana, Nadejde Maria
- Interprompt: Interpretable Prompting For Interrelated Interpersonal Risk Factors In Reddit Posts Sathvik Msvpj, Sarkar Surjodeep, Saxena Chandni, Sohn Sunghwan, Garg Muskan
- Probing Conceptual Understanding Of Large Visual-language Models Schiappa Madeline, Abdullah Raiyaan, Azad Shehreen, Claypoole Jared, Cogswell Michael, Divakaran Ajay, Rawat Yogesh
- Chatbot-supported Thesis Writing: An Autoethnographic Report Schwenke Nicolas, Söbke Heinrich, Kraft Eckhard
- Surgicalgpt: End-to-end Language-vision GPT For Visual Question Answering In Surgery Seenivasan Lalithkumar, Islam Mobarakol, Kannan Gokul, Ren Hongliang
- Let's Have A Chat! A Conversation With Chatgpt: Technology, Applications, And Limitations Shahriar Sakib, Hayawi Kadhim
- Cognitive Effects In Large Language Models Shaki Jonathan, Kraus Sarit, Wooldridge Michael
- Enhancing Retrieval-augmented Large Language Models With Iterative Retrieval-generation Synergy Shao Zhihong, Gong Yeyun, Shen Yelong, Huang Minlie, Duan Nan, Chen Weizhu
- Is Translation Helpful? An Empirical Analysis Of Cross-lingual Transfer In Low-resource Dialog Generation Shen Lei, Yu Shuai, Shen Xiaoyu
- Improving The Robustness Of Transformer-based Large Language Models With Dynamic Attention Shen Lujia, Pu Yuwen, Ji Shouling, Li Changjiang, Zhang Xuhong, Ge Chunpeng, Wang Ting
- Large Language Model Alignment: A Survey Shen Tianhao, Jin Renren, Huang Yufei, Liu Chuang, Dong Weilong, Guo Zishan, Wu Xinwei, Liu Yan, Xiong Deyi
- Storygpt-v: Large Language Models As Consistent Story Visualizers Shen Xiaoqian, Elhoseiny Mohamed
- "do Anything Now": Characterizing And Evaluating In-the-wild Jailbreak Prompts On Large Language Models Shen Xinyue, Chen Zeyuan, Backes Michael, Shen Yun, Zhang Yang
- Flexgen: High-throughput Generative Inference Of Large Language Models With A Single GPU Sheng Ying, Zheng Lianmin, Yuan Binhang, Li Zhuohan, Ryabinin Max, Fu Daniel Y., Xie Zhiqiang, Chen Beidi, Barrett Clark, Gonzalez Joseph E., Liang Percy, Ré Christopher, Stoica Ion, Zhang Ce
- TOAST: Transfer Learning Via Attention Steering Shi Baifeng, Gai Siyu, Darrell Trevor, Wang Xin
- Badgpt: Exploring Security Vulnerabilities Of Chatgpt Via Backdoor Attacks To Instructgpt Shi Jiawen, Liu Yixin, Zhou Pan, Sun Lichao
- Trusting Your Evidence: Hallucinate Less With Context-aware Decoding Shi Weijia, Han Xiaochuang, Lewis Mike, Tsvetkov Yulia, Zettlemoyer Luke, Yih Scott Wen-tau
- Exploring The Robustness Of Large Language Models For Solving Programming Problems Shirafuji Atsushi, Watanobe Yutaka, Ito Takumi, Morishita Makoto, Nakamura Yuki, Oda Yusuke, Suzuki Jun
- What Does CLIP Know About A Red Circle? Visual Prompt Engineering For Vlms Shtedritski Aleksandar, Rupprecht Christian, Vedaldi Andrea
- Mind Meets Machine: Unravelling Gpt-4's Cognitive Psychology Sifatkaur Dhingra, Manmeet Singh, Vaisakh Sb, Neetiraj Malviya, Sukhpal Singh Gill
- The Confidence-competence Gap In Large Language Models: A Cognitive Study Singh Aniket Kumar, Devkota Suman, Lamichhane Bishal, Dhakal Uttam, Dhakal Chandra
- Applenet: Visual Attention Parameterized Prompt Learning For Few-shot Remote Sensing Image Generalization Using CLIP Singha Mainak, Jha Ankit, Solanki Bhupendra, Bose Shirsha, Banerjee Biplab
- A Mathematical Abstraction For Balancing The Trade-off Between Creativity And Reality In Large Language Models Sinha Ritwik, Song Zhao, Zhou Tianyi
- On Early Detection Of Hallucinations In Factual Question Answering Snyder Ben, Moisescu Marius, Zafar Muhammad Bilal
- Decoding Chatgpt: A Taxonomy Of Existing Research, Current Challenges, And Possible Future Directions Sohail Shahab Saquib, Farhat Faiza, Himeur Yassine, Nadeem Mohammad, Madsen Dag Øivind, Singh Yashbir, Atalla Shadi, Mansoor Wathiq
- HAE-RAE Bench: Evaluation Of Korean Knowledge In Language Models Son Guijin, Lee Hanwool, Kim Suwan, Kim Huiseo, Lee Jaecheol, Yeom Je Won, Jung Jihyu, Kim Jung Woo, Kim Songseong
- Zebra: Extending Context Window With Layerwise Grouped Local-global Attention Song Kaiqiang, Wang Xiaoyang, Cho Sangwoo, Pan Xiaoman, Yu Dong
- Dynamic Context Pruning For Efficient And Interpretable Autoregressive Transformers Sotiris Anagnostidis, Dario Pavllo, Luca Biggio, Lorenzo Noci, Aurelien Lucchi, Thomas Hofmann
- Curriculum-guided Abstractive Summarization Sotudeh Sajad, Deilamsalehy Hanieh, Dernoncourt Franck, Goharian Nazli
- Assessing The Nature Of Large Language Models: A Caution Against Anthropocentrism Speed Ann
- Chatgpt: A Study On Its Utility For Ubiquitous Software Engineering Tasks Sridhara Giriprasad, G. Ranjani H., Mazumdar Sourav
- Core-cog: Conversational Recommendation Of Entities Using Constrained Generation Srivastava Harshvardhan, Pruthi Kanav, Chakrabarti Soumen, Mausam
- Structured Packing In LLM Training Improves Long Context Utilization Staniszewski Konrad, Tworkowski Szymon, Jaszczur Sebastian, Zhao Yu, Michalewski Henryk, Kuciński Łukasz, Miłoś Piotr
- A Mechanistic Interpretation Of Arithmetic Reasoning In Language Models Using Causal Mediation Analysis Stolfo Alessandro, Belinkov Yonatan, Sachan Mrinmaya
- Choice Fusion As Knowledge For Zero-shot Dialogue State Tracking Su Ruolin, Yang Jingfeng, Wu Ting-wei, Juang Biing-hwang
- F3-pruning: A Training-free And Generalized Pruning Strategy Towards Faster And Finer Text-to-video Synthesis Su Sitong, Liu Jianzhi, Gao Lianli, Song Jingkuan
- Can Large Language Models Change User Preference Adversarially? Subhash Varshini
- RCMHA: Relative Convolutional Multi-head Attention For Natural Language Modelling Sugiharto Herman, Aradea, Mubarok Husni
- Create Your World: Lifelong Text-to-image Diffusion Sun Gan, Liang Wenqi, Dong Jiahua, Li Jun, Ding Zhengming, Cong Yang
- Fine-grained Audio-visual Joint Representations For Multimodal Large Language Models Sun Guangzhi, Yu Wenyi, Tang Changli, Chen Xianzhao, Tan Tian, Li Wei, Lu Lu, Ma Zejun, Zhang Chao
- Reinforcement Learning In The Era Of Llms: What Is Essential? What Is Needed? An RL Perspective On RLHF, Prompting, And Beyond Sun Hao
- EVA-CLIP: Improved Training Techniques For CLIP At Scale Sun Quan, Fang Yuxin, Wu Ledell, Wang Xinlong, Cao Yue
- Retentive Network: A Successor To Transformer For Large Language Models Sun Yutao, Dong Li, Huang Shaohan, Ma Shuming, Xia Yuqing, Xue Jilong, Wang Jianyong, Wei Furu
- Uncovering Chatgpt's Capabilities In Recommender Systems Sunhao Dai, Ninglu Shao, Haiyuan Zhao, Weijie Yu, Zihua Si, Chen Xu, Zhongxiang Sun, Xiao Zhang, Jun Xu
- Exploring Chain-of-thought Style Prompting For Text-to-sql Tai Chang-you, Chen Ziru, Zhang Tianshu, Deng Xiang, Sun Huan
- Sparsity-guided Holistic Explanation For Llms With Interpretable Inference-time Intervention Tan Zhen, Chen Tianlong, Zhang Zhenyu, Liu Huan
- Interpreting Pretrained Language Models Via Concept Bottlenecks Tan Zhen, Cheng Lu, Wang Song, Bo Yuan, Li Jundong, Liu Huan
- POUF: Prompt-oriented Unsupervised Fine-tuning For Large Pre-trained Models Tanwisuth Korawat, Zhang Shujian, Zheng Huangjie, He Pengcheng, Zhou Mingyuan
- Is Chatgpt The Ultimate Programming Assistant -- How Far Is It? Tian Haoye, Lu Weiqi, Li Tsz On, Tang Xunzhu, Cheung Shing-chi, Klein Jacques, Bissyandé Tegawendé F.
- Better Language Models Of Code Through Self-improvement To Hung Quoc, Bui Nghi D. Q., Guo Jin, Nguyen Tien N.
- Function Vectors In Large Language Models Todd Eric, Li Millicent L., Sharma Arnab Sen, Mueller Aaron, Wallace Byron C., Bau David
- Viclevr: A Visual Reasoning Dataset And Hybrid Multimodal Fusion Model For Visual Question Answering In Vietnamese Tran Khiem Vinh, Phan Hao Phu, Van Nguyen Kiet, Nguyen Ngan Luu Thuy
- Chatgpt As A Commenter To The News: Can Llms Generate Human-like Opinions? Tseng Rayden, Verberne Suzan, Van Der Putten Peter
- Focused Transformer: Contrastive Training For Context Scaling Tworkowski Szymon, Staniszewski Konrad, Pacek Mikołaj, Wu Yuhuai, Michalewski Henryk, Miłoś Piotr
- Simple Is Better And Large Is Not Enough: Towards Ensembling Of Foundational Language Models Tyagi Nancy, Shiri Aidin, Sarkar Surjodeep, Umrawal Abhishek Kumar, Gaur Manas
- Humans And Language Models Diverge When Predicting Repeating Text Vaidya Aditya R., Turek Javier, Huth Alexander G.
- Structured Thoughts Automaton: First Formalized Execution Model For Auto-regressive Language Models Vanderbruggen Tristan, Liao Chunhua, Pirkelbauer Peter, Lin Pei-hung
- Adventures Of Trustworthy Vision-language Models: A Survey Vatsa Mayank, Jain Anubhooti, Singh Richa
- BERT4CTR: An Efficient Framework To Combine Pre-trained Language Model With Non-textual Features For CTR Prediction Wang Dong, Salamatian Kavé, Xia Yunqing, Deng Weiwei, Zhiang Qi
- Docllm: A Layout-aware Generative Language Model For Multimodal Document Understanding Wang Dongsheng, Raman Natraj, Sibue Mathieu, Ma Zhiqiang, Babkin Petr, Kaur Simerjot, Pei Yulong, Nourbakhsh Armineh, Liu Xiaomo
- Robust Natural Language Understanding With Residual Attention Debiasing Wang Fei, Huang James Y., Yan Tianyi, Zhou Wenxuan, Chen Muhao
- On The Robustness Of Chatgpt: An Adversarial And Out-of-distribution Perspective Wang Jindong, Hu Xixu, Hou Wenxin, Chen Hao, Zheng Runkai, Wang Yidong, Yang Linyi, Huang Haojun, Ye Wei, Geng Xiubo, Jiao Binxin, Zhang Yue, Xie Xing
- Improved Visual Fine-tuning With Natural Language Supervision Wang Junyang, Xu Yuanhong, Hu Juhua, Yan Ming, Sang Jitao, Qian Qi
- Compositional Text-to-image Synthesis With Attention Map Control Of Diffusion Models Wang Ruichen, Chen Zekang, Chen Chen, Ma Jian, Lu Haonan, Lin Xiaodong
- Vamos: Versatile Action Models For Video Understanding Wang Shijie, Zhao Qi, Do Minh Quan, Agarwal Nakul, Lee Kwonjoon, Sun Chen
- Probing Large Language Models From A Human Behavioral Perspective Wang Xintong, Li Xiaoyu, Li Xingshan, Biemann Chris
- Learning Hierarchical Prompt With Structured Linguistic Knowledge For Vision-language Models Wang Yubin, Jiang Xinyang, Cheng De, Li Dongsheng, Zhao Cairong
- Smarttrim: Adaptive Tokens And Attention Pruning For Efficient Vision-language Models Wang Zekun, Chen Jingchang, Zhou Wangchunshu, Zhu Haichao, Liang Jiafeng, Shan Liping, Liu Ming, Xu Dongliang, Yang Qing, Qin Bing
- An In-depth Investigation Of User Response Simulation For Conversational Search Wang Zhenduo, Xu Zhichao, Ai Qingyao, Srikumar Vivek
- Interviewbot: Real-time End-to-end Dialogue System To Interview Students For College Admission Wang Zihao, Keyes Nathan, Crawford Terry, Choi Jinho D.
- A Survey Of Large Language Models Wayne Xin Zhao, Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-yun Nie, Ji-rong Wen
- REPLUG: Retrieval-augmented Black-box Language Models Weijia Shi, Sewon Min, Michihiro Yasunaga, Minjoon Seo, Rich James, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih
- Mastering Symbolic Operations: Augmenting Language Models With Compiled Neural Networks Weng Yixuan, Zhu Minjun, Xia Fei, Li Bin, He Shizhu, Liu Kang, Zhao Jun
- AI And The FCI: Can Chatgpt Project An Understanding Of Introductory Physics? West Colin G.
- The Generative AI Paradox: "what It Can Create, It May Not Understand" West Peter, Lu Ximing, Dziri Nouha, Brahman Faeze, Li Linjie, Hwang Jena D., Jiang Liwei, Fisher Jillian, Ravichander Abhilasha, Chandu Khyathi, Newman Benjamin, Koh Pang Wei, Ettinger Allyson, Choi Yejin
- System 2 Attention (is Something You Might Need Too) Weston Jason, Sukhbaatar Sainbayar
- Rethinking Model Selection And Decoding For Keyphrase Generation With Pre-trained Sequence-to-sequence Models Wu Di, Ahmad Wasi Uddin, Chang Kai-wei
- Speechgen: Unlocking The Generative Power Of Speech Language Models With Prompts Wu Haibin, Chang Kai-wei, Wu Yuan-kuei, Lee Hung-yi
- P-laplacian Adaptation For Generative Pre-trained Vision-language Models Wu Haoyuan, Zhang Xinyun, Xu Peng, Liao Peiyu, Yao Xufeng, Yu Bei
- Efficient LLM Inference Solution On Intel GPU Wu Hui, Gan Yi, Yuan Feng, Ma Jing, Zhu Wei, Xu Yutao, Zhu Hong, Zhu Yuhua, Liu Xiaoli, Gu Jinghui, Zhao Peng
- Document Flattening: Beyond Concatenating Context For Document-level Neural Machine Translation Wu Minghao, Foster George, Qu Lizhen, Haffari Gholamreza
- Approximated Prompt Tuning For Vision-language Pre-trained Models Wu Qiong, Huang Shubin, Zhou Yiyi, Dai Pingyang, Shu Annan, Jiang Guannan, Ji Rongrong
- YUAN 2.0: A Large Language Model With Localized Filtering-based Attention Wu Shaohua, Zhao Xudong, Wang Shenling, Luo Jiangang, Li Lingjun, Chen Xi, Zhao Bing, Wang Wei, Yu Tong, Zhang Rongguo, Zhang Jiahua, Wang Chao
- Towards Robust Text Retrieval With Progressive Learning Wu Tong, Qin Yulei, Zhang Enwei, Xu Zihan, Gao Yuting, Li Ke, Sun Xing
- Unleashing Potential Of Evidence In Knowledge-intensive Dialogue Generation Wu Xianjie, Yang Jian, Li Tongliang, Liang Di, Zhang Shiwei, Du Yiyang, Li Zhoujun
- From Language Modeling To Instruction Following: Understanding The Behavior Shift In Llms After Instruction Tuning Wu Xuansheng, Yao Wenlin, Chen Jianshu, Pan Xiaoman, Wang Xiaoyang, Liu Ninghao, Yu Dong
- TLM: Token-level Masking For Transformers Wu Yangjun, Fang Kebin, Zhang Dongxiang, Wang Han, Zhang Hao, Chen Gang
- Jailbreaking GPT-4V Via Self-adversarial Attacks With System Prompts Wu Yuanwei, Li Xiang, Liu Yixin, Zhou Pan, Sun Lichao
- Openicl: An Open-source Framework For In-context Learning Wu Zhenyu, Wang Yaoxiang, Ye Jiacheng, Feng Jiangtao, Xu Jingjing, Qiao Yu, Wu Zhiyong
- Efficient Streaming Language Models With Attention Sinks Xiao Guangxuan, Tian Yuandong, Chen Beidi, Han Song, Lewis Mike
- Can I Trust Your Answer? Visually Grounded Video Question Answering Xiao Junbin, Yao Angela, Li Yicong, Chua Tat Seng
- Funqa: Towards Surprising Video Comprehension Xie Binzhu, Zhang Sicheng, Zhou Zitang, Li Bo, Zhang Yuanhan, Hessel Jack, Yang Jingkang, Liu Ziwei
- Openagents: An Open Platform For Language Agents In The Wild Xie Tianbao, Zhou Fan, Cheng Zhoujun, Shi Peng, Weng Luoxuan, Liu Yitao, Hua Toh Jing, Zhao Junning, Liu Qian, Liu Che, Liu Leo Z., Xu Yiheng, Su Hongjin, Shin Dongchan, Xiong Caiming, Yu Tao
- Olagpt: Empowering Llms With Human-like Problem-solving Abilities Xie Yuanzhen, Xie Tao, Lin Mingxiong, Wei Wentao, Li Chenglin, Kong Beibei, Chen Lei, Zhuo Chengxiang, Hu Bo, Li Zang
- A Prompt Log Analysis Of Text-to-image Generation Systems Xie Yutong, Pan Zhaoying, Ma Jinge, Jie Luo, Mei Qiaozhu
- A Preliminary Study Of Chatgpt On News Recommendation: Personalization, Provider Fairness, Fake News Xinyi Li, Yongfeng Zhang, Edward C. Malthouse
- Ctc-based Non-autoregressive Speech Translation Xu Chen, Liu Xiaoqian, Liu Xiaowen, Sun Qingxuan, Zhang Yuhao, Yang Murun, Dong Qianqian, Ko Tom, Wang Mingxuan, Xiao Tong, Ma Anxiang, Zhu Jingbo
- DPL: Decoupled Prompt Learning For Vision-language Models Xu Chen, Zhu Yuhan, Zhang Guozhen, Shen Haocheng, Liao Yixuan, Chen Xiaoxin, Wu Gangshan, Wang Limin
- Pixel Aligned Language Models Xu Jiarui, Zhou Xingyi, Yan Shen, Gu Xiuye, Arnab Anurag, Sun Chen, Wang Xiaolong, Schmid Cordelia
- Shattering The Agent-environment Interface For Fine-tuning Inclusive Language Models Xu Wanqiao, Dong Shi, Arumugam Dilip, Van Roy Benjamin
- Re-reading Improves Reasoning In Large Language Models Xu Xiaohan, Tao Chongyang, Shen Tao, Xu Can, Xu Hongbo, Long Guodong, Lou Jian-guang
- It Ain't That Bad: Understanding The Mysterious Performance Drop In OOD Generalization For Generative Transformer Models Xu Xingcheng, Pan Zihao, Zhang Haipeng, Yang Yanqing
- Representation Learning With Large Language Models For Recommendation Xubin Ren, Wei Wei, Lianghao Xia, Lixin Su, Suqi Cheng, Junfeng Wang, Dawei Yin, Chao Huang
- Pit One Against Many: Leveraging Attention-head Embeddings For Parameter-efficient Multi-head Attention Xue Huiyin, Aletras Nikolaos
- Bias And Fairness In Chatbots: An Overview Xue Jintang, Wang Yun-cheng, Wei Chengwei, Liu Xiaofeng, Woo Jonghye, Kuo C. -c. Jay
- Evaluating Large Language Models' Ability Using A Psychiatric Screening Tool Based On Metaphor And Sarcasm Scenarios Yakura Hiromu
- Voila-a: Aligning Vision-language Models With User's Gaze Attention Yan Kun, Ji Lei, Wang Zeyu, Wang Yuntao, Duan Nan, Ma Shuai
- Zero-shot Query Reformulation For Conversational Search Yang Dayu, Zhang Yue, Fang Hui
- PALR: Personalization Aware Llms For Recommendation Yang Fan, Chen Zheng, Jiang Ziyan, Cho Eunah, Huang Xiaojiang, Lu Yanbin
- Empower Large Language Model To Perform Better On Industrial Domain-specific Question Answering Yang Fangkai, Zhao Pu, Wang Zezhong, Wang Lu, Zhang Jue, Garg Mohit, Lin Qingwei, Rajmohan Saravan, Zhang Dongmei
- Instoptima: Evolutionary Multi-objective Instruction Optimization Via Large Language Model-based Instruction Operators Yang Heng, Li Ke
- Longqlora: Efficient And Effective Method To Extend Context Length Of Large Language Models Yang Jianxin
- Iterative Forward Tuning Boosts In-context Learning In Language Models Yang Jiaxi, Hui Binyuan, Yang Min, Wang Bailin, Li Bowen, Li Binhua, Huang Fei, Li Yongbin
- Revisiting Parallel Context Windows: A Frustratingly Simple Alternative And Chain-of-thought Deterioration Yang Kejuan, Liu Xiao, Men Kaiwen, Zeng Aohan, Dong Yuxiao, Tang Jie
- Give Us The Facts: Enhancing Large Language Models With Knowledge Graphs For Fact-aware Language Modeling Yang Linyao, Chen Hongyang, Li Zhao, Ding Xiao, Wu Xindong
- Progressive Evidence Refinement For Open-domain Multimodal Retrieval Question Answering Yang Shuwen, Wu Anran, Wu Xingjiao, Xiao Luwei, Ma Tianlong, Jin Cheng, He Liang
- Dior-cvae: Pre-trained Language Models And Diffusion Priors For Variational Dialog Generation Yang Tianyu, Tran Thy Thy, Gurevych Iryna
- Human-in-the-loop Machine Translation With Large Language Model Yang Xinyi, Zhan Runzhe, Wong Derek F., Wu Junchao, Chao Lidia S.
- Bias A-head? Analyzing Bias In Transformer-based Language Model Attention Heads Yang Yi, Duan Hanyu, Abbasi Ahmed, Lalor John P., Tam Kar Yan
- Improving The Reliability Of Large Language Models By Leveraging Uncertainty-aware In-context Learning Yang Yuchen, Li Houqiang, Wang Yanfeng, Wang Yu
- Deepspeed-visualchat: Multi-round Multi-image Interleave Chat Via Multi-modal Causal Attention Yao Zhewei, Wu Xiaoxia, Li Conglong, Zhang Minjia, Qin Heyang, Ruwase Olatunji, Awan Ammar Ahmad, Rajbhandari Samyam, He Yuxiong
- Cognitive Mirage: A Review Of Hallucinations In Large Language Models Ye Hongbin, Liu Tong, Zhang Aijia, Hua Wei, Jia Weiqiang
- Navigating Text-to-image Customization: From Lycoris Fine-tuning To Model Evaluation Yeh Shih-ying, Hsieh Yu-guan, Gao Zhidong, Yang Bernard B W, Oh Giyeong, Gong Yanmin
- Decoding In-context Learning: Neuroscience-inspired Analysis Of Representations In Large Language Models Yousefi Safoora, Betthauser Leo, Hasanbeig Hosein, Millière Raphaël, Momennejad Ida
- MEGABYTE: Predicting Million-byte Sequences With Multiscale Transformers Yu Lili, Simig Dániel, Flaherty Colin, Aghajanyan Armen, Zettlemoyer Luke, Lewis Mike
- Stateful Large Language Model Serving With Pensieve Yu Lingfan, Li Jinyang
- Characterizing Mechanisms For Factual Recall In Language Models Yu Qinan, Merullo Jack, Pavlick Ellie
- Beneath Surface Similarity: Large Language Models Make Reasonable Scientific Analogies After Structure Abduction Yuan Siyu, Chen Jiangjie, Ge Xuyang, Xiao Yanghua, Yang Deqing
- Multilingual Augmentation For Robust Visual Question Answering In Remote Sensing Images Yuan Zhenghang, Mou Lichao, Zhu Xiao Xiang
- Prompt Highlighter: Interactive Control For Multi-modal Llms Yuechen Zhang, Shengju Qian, Bohao Peng, Shu Liu, Jiaya Jia
- Attention Satisfies: A Constraint-satisfaction Lens On Factual Errors Of Language Models Yuksekgonul Mert, Chandrasekaran Varun, Jones Erik, Gunasekar Suriya, Naik Ranjita, Palangi Hamid, Kamar Ece, Nushi Besmira
- Pre-training To Learn In Context Yuxian Gu, Li Dong, Furu Wei, Minlie Huang
- Fairness-aware Structured Pruning In Transformers Zayed Abdelrahman, Mordido Goncalo, Shabanian Samira, Baldini Ioana, Chandar Sarath
- Exploring Memorization In Fine-tuned Language Models Zeng Shenglai, Li Yaxin, Ren Jie, Liu Yiding, Xu Han, He Pengfei, Xing Yue, Wang Shuaiqiang, Tang Jiliang, Yin Dawei
- Is Chatgpt A Good Sentiment Analyzer? A Preliminary Study Zengzhi Wang, Qiming Xie, Yi Feng, Zixiang Ding, Zinong Yang, Rui Xia
- Benchmarking And In-depth Performance Study Of Large Language Models On Habana Gaudi Processors Zhang Chengming, Sun Baixi, Yu Xiaodong, Xie Zhen, Zheng Weijian, Iskra Kamil, Beckman Pete, Tao Dingwen
- Linear Attention Via Orthogonal Memory Zhang Jun, Jiang Shuyang, Feng Jiangtao, Zheng Lin, Kong Lingpeng
- Tell Your Model Where To Attend: Post-hoc Attention Steering For Llms Zhang Qingru, Singh Chandan, Liu Liyuan, Liu Xiaodong, Yu Bin, Gao Jianfeng, Zhao Tuo
- Llama-adapter: Efficient Fine-tuning Of Language Models With Zero-init Attention Zhang Renrui, Han Jiaming, Liu Chris, Gao Peng, Zhou Aojun, Hu Xiangfei, Yan Shilin, Lu Pan, Li Hongsheng, Qiao Yu
- Generation-driven Contrastive Self-training For Zero-shot Text Classification With Instruction-following LLM Zhang Ruohong, Wang Yau-shian, Yang Yiming
- History-aware Hierarchical Transformer For Multi-session Open-domain Dialogue System Zhang Tong, Liu Yong, Li Boyang, Zeng Zhiwei, Wang Pengwei, You Yuan, Miao Chunyan, Cui Lizhen
- Collm: Integrating Collaborative Embeddings Into Large Language Models For Recommendation Zhang Yang, Feng Fuli, Zhang Jizhi, Bao Keqin, Wang Qifan, He Xiangnan
- Cached Transformers: Improving Transformers With Differentiable Memory Cache Zhang Zhaoyang, Shao Wenqi, Ge Yixiao, Wang Xiaogang, Gu Jinwei, Luo Ping
- H\(_2\)O: Heavy-hitter Oracle For Efficient Generative Inference Of Large Language Models Zhang Zhenyu, Sheng Ying, Zhou Tianyi, Chen Tianlong, Zheng Lianmin, Cai Ruisi, Song Zhao, Tian Yuandong, Ré Christopher, Barrett Clark, Wang Zhangyang, Chen Beidi
- Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization Zhang Zhexin, Yang Junxiao, Ke Pei, Mi Fei, Wang Hongning, Huang Minlie
- Accountable Textual-visual Chat Learns To Reject Human Instructions In Image Re-creation Zhang Zhiwei, Liu Yuliang
- Responsible Task Automation: Empowering Large Language Models As Responsible Task Automators Zhang Zhizheng, Zhang Xiaoyi, Xie Wenxuan, Lu Yan
- Tuning Layernorm In Attention: Towards Efficient Multi-modal LLM Finetuning Zhao Bingchen, Tu Haoqin, Wei Chen, Mei Jieru, Xie Cihang
- Chbias: Bias Evaluation And Mitigation Of Chinese Conversational Language Models Zhao Jiaxu, Fang Meng, Shi Zijing, Li Yitong, Chen Ling, Pechenizkiy Mykola
- A Survey Of Large Language Models Zhao Wayne Xin, Zhou Kun, Li Junyi, Tang Tianyi, Wang Xiaolei, Hou Yupeng, Min Yingqian, Zhang Beichen, Zhang Junjie, Dong Zican, Du Yifan, Yang Chen, Chen Yushuo, Chen Zhipeng, Jiang Jinhao, Ren Ruiyang, Li Yifan, Tang Xinyu, Liu Zikang, Liu Peiyu, Nie Jian-yun, Wen Ji-rong
- Learn From Model Beyond Fine-tuning: A Survey Zheng Hongling, Shen Li, Tang Anke, Luo Yong, Hu Han, Du Bo, Tao Dacheng
- Sglang: Efficient Execution Of Structured Language Model Programs Zheng Lianmin, Yin Liangsheng, Xie Zhiqiang, Sun Chuyue, Huang Jeff, Yu Cody Hao, Cao Shiyi, Kozyrakis Christos, Stoica Ion, Gonzalez Joseph E., Barrett Clark, Sheng Ying
- Trojfsp: Trojan Insertion In Few-shot Prompt Tuning Zheng Mengxin, Xue Jiaqi, Chen Xun, Wang Yanshan, Lou Qian, Jiang Lei
- Response Length Perception And Sequence Scheduling: An Llm-empowered LLM Inference Pipeline Zheng Zangwei, Ren Xiaozhe, Xue Fuzhao, Luo Yang, Jiang Xin, You Yang
- Exploring Human-like Translation Strategy With Large Language Models Zhiwei He, Tian Liang, Wenxiang Jiao, Zhuosheng Zhang, Yujiu Yang, Rui Wang, Zhaopeng Tu, Shuming Shi, Xing Wang
- Mitigating Hallucination In Visual Language Models With Visual Supervision Zhiyang Chen, Yousong Zhu, Yufei Zhan, Zhaowen Li, Chaoyang Zhao, Jinqiao Wang, Ming Tang
- Can Chatgpt Understand Too? A Comparative Study On Chatgpt And Fine-tuned BERT Zhong Qihuang, Ding Liang, Liu Juhua, Du Bo, Tao Dacheng
- Bag Of Tricks For Effective Language Model Pretraining And Downstream Adaptation: A Case Study On GLUE Zhong Qihuang, Ding Liang, Peng Keqin, Liu Juhua, Du Bo, Shen Li, Zhan Yibing, Tao Dacheng
- Don't Make Your LLM An Evaluation Benchmark Cheater Zhou Kun, Zhu Yutao, Chen Zhipeng, Chen Wentong, Zhao Wayne Xin, Chen Xu, Lin Yankai, Wen Ji-rong, Han Jiawei
- Unleashing GPT On The Metaverse: Savior Or Destroyer? Zhou Pengyuan
- Quantifying And Analyzing Entity-level Memorization In Large Language Models Zhou Zhenhong, Xiang Jiuyang, Chen Chaomeng, Su Sen
- On The Calibration Of Large Language Models And Alignment Zhu Chiwei, Xu Benfeng, Wang Quan, Zhang Yongdong, Mao Zhendong
- Coca: Fusing Position Embedding With Collinear Constrained Attention In Transformers For Long Context Window Extending Zhu Shiyi, Ye Jing, Jiang Wei, Xue Siqiao, Zhang Qi, Wu Yifan, Li Jianguo
- 3d-vista: Pre-trained Transformer For 3D Vision And Text Alignment Zhu Ziyu, Ma Xiaojian, Chen Yixin, Deng Zhidong, Huang Siyuan, Li Qing
- Scissorhands: Exploiting The Persistence Of Importance Hypothesis For LLM KV Cache Compression At Test Time Zichang Liu, Aditya Desai, Fangshuo Liao, Weitao Wang, Victor Xie, Zhaozhuo Xu, Anastasios Kyrillidis, Anshumali Shrivastava
- Aligner: One Global Token Is Worth Millions Of Parameters When Aligning Large Language Models Ziheng Zhou University Of California, Los Angeles, Wu Yingnian University Of California, Los Angeles, Zhu Song-chun University Of California, Los Angeles, Terzopoulos Demetri University Of California, Los Angeles
- A Family Of Pretrained Transformer Language Models For Russian Zmitrovich Dmitry, Abramov Alexander, Kalmykov Andrey, Tikhonova Maria, Taktasheva Ekaterina, Astafurov Danil, Baushenko Mark, Snegirev Artem, Kadulin Vitalii, Markov Sergey, Shavrina Tatiana, Mikhailov Vladislav, Fenogenova Alena
- Olvit: Multi-modal State Tracking Via Attention-based Embeddings For Video-grounded Dialog Abdessaied Adnen, Von Hochmeister Manuel, Bulling Andreas
- Attnlrp: Attention-aware Layer-wise Relevance Propagation For Transformers Achtibat Reduan, Hatefi Sayed Mohammad Vakilzadeh, Dreyer Maximilian, Jain Aakriti, Wiegand Thomas, Lapuschkin Sebastian, Samek Wojciech
- Keyformer: KV Cache Reduction Through Key Tokens Selection For Efficient Generative Inference Adnan Muhammad, Arunkumar Akhil, Jain Gaurav, Nair Prashant J., Soloveychik Ilya, Kamath Purushotham
- Tricy: Trigger-guided Data-to-text Generation With Intent Aware Attention-copy Agarwal Vibhav, Ghosh Sourav, Bss Harichandana, Arora Himanshu, Raja Barath Raj Kandur
- Leapformer: Enabling Linear Transformers For Autoregressive And Simultaneous Tasks Via Learned Proportions Agostinelli Victor, Hong Sanghyun, Chen Lizhong
- Shadowllm: Predictor-based Contextual Sparsity For Large Language Models Akhauri Yash, Abouelhamayed Ahmed F, Dotzel Jordan, Zhang Zhiru, Rush Alexander M, Huda Safeen, Abdelfattah Mohamed S
- A Survey Of Large Language Models For European Languages Ali Wazir, Pyysalo Sampo
- Chatgpt Alternative Solutions: Large Language Models Survey Alipour Hanieh, Pendar Nick, Roy Kohinoor
- D-NLP At Semeval-2024 Task 2: Evaluating Clinical Inference Capabilities Of Large Language Models Altinok Duygu
- Syntaxshap: Syntax-aware Explainability Method For Text Generation Amara Kenza, Sevastjanova Rita, El-assady Mennatallah
- Training-free Long-context Scaling Of Large Language Models An Chenxin, Huang Fei, Zhang Jun, Gong Shansan, Qiu Xipeng, Zhou Chang, Kong Lingpeng
- Sentencevae: Enable Next-sentence Prediction For Large Language Models With Faster Speed, Higher Accuracy And Longer Context An Hongjun, Chen Yifan, Sun Zhe, Li Xuelong
- Hired: Attention-guided Token Dropping For Efficient Inference Of High-resolution Vision-language Models In Resource-constrained Environments Arif Kazi Hasan Ibn, Yoon Jinyi, Nikolopoulos Dimitrios S., Vandierendonck Hans, John Deepu, Ji Bo
- Simple Linear Attention Language Models Balance The Recall-throughput Tradeoff Arora Simran, Eyuboglu Sabri, Zhang Michael, Timalsina Aman, Alberti Silas, Zinsley Dylan, Zou James, Rudra Atri, Ré Christopher
- What Makes Language Models Good-enough? Asami Daiki, Sugawara Saku
- Bifurcated Attention: Accelerating Massively Parallel Decoding With Shared Prefixes In Llms Athiwaratkun Ben, Gonugondla Sujan Kumar, Gouda Sanjay Krishna, Qian Haifeng, Ding Hantian, Sun Qing, Wang Jun, Guo Jiacheng, Chen Liangfu, Bhatia Parminder, Nallapati Ramesh, Sengupta Sudipta, Xiang Bing
- Walia-llm: Enhancing Amharic-llama By Integrating Task-specific And Generative Datasets Azime Israel Abebe, Tonja Atnafu Lambebo, Belay Tadesse Destaw, Fuge Mitiku Yohannes, Wassie Aman Kassahun, Jada Eyasu Shiferaw, Chanie Yonas, Sewunetie Walelign Tewabe, Yimam Seid Muhie
- Masked Mixers For Language Generation And Retrieval Badger Benjamin L.
- Citrus: Chunked Instruction-aware State Eviction For Long Sequence Modeling Bai Yu, Zou Xiyuan, Huang Heyan, Chen Sanxing, Rondeau Marc-antoine, Gao Yang, Cheung Jackie Chi Kit
- Hallucination Of Multimodal Large Language Models: A Survey Bai Zechen, Wang Pichao, Xiao Tianjun, He Tong, Han Zongbo, Zhang Zheng, Shou Mike Zheng
- Patentgpt: A Large Language Model For Intellectual Property Bai Zilong, Zhang Ruiji, Chen Linqing, Cai Qijun, Zhong Yuan, Wang Cong, Fang Yan, Fang Jie, Sun Jing, Wang Weikuan, Zhou Lizhi, Hua Haoran, Qiu Tian, Wang Chaochao, Sun Cheng, Lu Jianping, Wang Yixin, Xia Yubin, Hu Meng, Liu Haowen, Xu Peng, Xu Licong, Bian Fu, Gu Xiaolong, Zhang Lisha, Wang Weilei, Tu Changyang
- LLM Augmented Llms: Expanding Capabilities Through Composition Bansal Rachit, Samanta Bidisha, Dalmia Siddharth, Gupta Nitish, Vashishth Shikhar, Ganapathy Sriram, Bapna Abhishek, Jain Prateek, Talukdar Partha
- Transforming Agency. On The Mode Of Existence Of Large Language Models Barandiaran Xabier E., Almendros Lola S.
- Understanding Information Storage And Transfer In Multi-modal Large Language Models Basu Samyadeep, Grayson Martin, Morrison Cecily, Nushi Besmira, Feizi Soheil, Massiceti Daniela
- Evolutionary Multi-objective Optimization Of Large Language Model Prompts For Balancing Sentiments Baumann Jill, Kramer Oliver
- Xlstm: Extended Long Short-term Memory Beck Maximilian, Pöppel Korbinian, Spanring Markus, Auer Andreas, Prudnikova Oleksandra, Kopp Michael, Klambauer Günter, Brandstetter Johannes, Hochreiter Sepp
- Llm2vec: Large Language Models Are Secretly Powerful Text Encoders Behnamghader Parishad, Adlakha Vaibhav, Mosbach Marius, Bahdanau Dzmitry, Chapados Nicolas, Reddy Siva
- Internalinspector \(I^2\): Robust Confidence Estimation In Llms Through Internal States Beigi Mohammad, Shen Ying, Yang Runing, Lin Zihao, Wang Qifan, Mohan Ankith, He Jianfeng, Jin Ming, Lu Chang-tien, Huang Lifu
- Multi-head RAG: Solving Multi-aspect Problems With Llms Besta Maciej, Kubicek Ales, Niggli Roman, Gerstenberger Robert, Weitzendorf Lucas, Chi Mingyuan, Iff Patrick, Gajda Joanna, Nyczyk Piotr, Müller Jürgen, Niewiadomski Hubert, Chrapek Marcin, Podstawski Michał, Hoefler Torsten
- On The Robustness Of Language Models For Tabular Question Answering Bhandari Kushal Raj, Xing Sixue, Dan Soham, Gao Jianxi
- Recurrentgemma: Moving Past Transformers For Efficient Open Language Models Botev Aleksandar, De Soham, Smith Samuel L, Fernando Anushan, Muraru George-cristian, Haroun Ruba, Berrada Leonard, Pascanu Razvan, Sessa Pier Giuseppe, Dadashi Robert, Hussenot Léonard, Ferret Johan, Girgin Sertan, Bachem Olivier, Andreev Alek, Kenealy Kathleen, Mesnard Thomas, Hardin Cassidy, Bhupatiraju Surya, Pathak Shreya, Sifre Laurent, Rivière Morgane, Kale Mihir Sanjay, Love Juliette, Tafti Pouya, Joulin Armand, Fiedel Noah, Senter Evan, Chen Yutian, Srinivasan Srivatsan, Desjardins Guillaume, Budden David, Doucet Arnaud, Vikram Sharad, Paszke Adam, Gale Trevor, Borgeaud Sebastian, Chen Charlie, Brock Andy, Paterson Antonia, Brennan Jenny, Risdal Meg, Gundluru Raj, Devanathan Nesh, Mooney Paul, Chauhan Nilay, Culliton Phil, Martins Luiz Gustavo, Bandy Elisa, Huntsperger David, Cameron Glenn, Zucker Arthur, Warkentin Tris, Peran Ludovic, Giang Minh, Ghahramani Zoubin, Farabet Clément, Kavukcuoglu Koray, Hassabis Demis, Hadsell Raia, Teh Yee Whye, De Frietas Nando
- Embardiment: An Embodied AI Agent For Productivity In XR Bovo Riccardo, Abreu Steven, Ahuja Karan, Gonzalez Eric J, Cheng Li-te, Gonzalez-franco Mar
- Improving Sequential Recommendations With Llms Boz Artun, Zorgdrager Wouter, Kotti Zoe, Harte Jesse, Louridas Panos, Jannach Dietmar, Fragkoulis Marios
- Reducing Transformer Key-value Cache Size With Cross-layer Attention Brandon William, Mishra Mayank, Nrusimha Aniruddha, Panda Rameswar, Kelly Jonathan Ragan
- LOCOST: State-space Models For Long Document Abstractive Summarization Bronnec Florian Le, Duong Song, Ravaut Mathieu, Allauzen Alexandre, Chen Nancy F., Guigue Vincent, Lumbreras Alberto, Soulier Laure, Gallinari Patrick
- Lococo: Dropping In Convolutions For Long Context Compression Cai Ruisi, Tian Yuandong, Wang Zhangyang, Chen Beidi
- Medusa: Simple LLM Inference Acceleration Framework With Multiple Decoding Heads Cai Tianle, Li Yuhong, Geng Zhengyang, Peng Hongwu, Lee Jason D., Chen Deming, Dao Tri
- A Survey On Mixture Of Experts Cai Weilin, Jiang Juyong, Wang Fan, Tang Jing, Kim Sunghun, Huang Jiayi
- Figuring Out Figures: Using Textual References To Caption Scientific Figures Cao Stanley, Liu Kevin
- Head-wise Shareable Attention For Large Language Models Cao Zouying, Yang Yifei, Zhao Hai
- A Review Of Multi-modal Large Language And Vision Models Carolan Kilian, Fennelly Laura, Smeaton Alan F.
- AMEX: Android Multi-annotation Expo Dataset For Mobile GUI Agents Chai Yuxiang, Huang Siyuan, Niu Yazhe, Xiao Han, Liu Liang, Zhang Dingyu, Gao Peng, Ren Shuai, Li Hongsheng
- Dense Reward For Free In Reinforcement Learning From Human Feedback Chan Alex J., Sun Hao, Holt Samuel, Van Der Schaar Mihaela
- When Parts Are Greater Than Sums: Individual LLM Components Can Outperform Full Models Chang Ting-yun, Thomason Jesse, Jia Robin
- Large Language Models As Instruments Of Power: New Regimes Of Autonomous Manipulation And Control Chaudhary Yaqub, Penn Jonnie
- Lmstyle Benchmark: Evaluating Text Style Transfer For Chatbots Chen Jianlin
- Efficiency In Focus: Layernorm As A Catalyst For Fine-tuning Medical Visual Language Pre-trained Models Chen Jiawei, Yang Dingkang, Jiang Yue, Li Mingcheng, Wei Jinjie, Hou Xiaolu, Zhang Lihua
- Elcorec: Enhance Language Understanding With Co-propagation Of Numerical And Categorical Features For Recommendation Chen Jizheng, Du Kounianhua, Lin Jianghao, Chen Bo, Tang Ruiming, Zhang Weinan
- Pixart-\sigma: Weak-to-strong Training Of Diffusion Transformer For 4K Text-to-image Generation Chen Junsong, Ge Chongjian, Xie Enze, Wu Yue, Yao Lewei, Ren Xiaozhe, Wang Zhongdao, Luo Ping, Lu Huchuan, Li Zhenguo
- Machine Unlearning In Large Language Models Chen Kongyang, Wang Zixin, Mi Bing, Liu Waixi, Wang Shaowei, Ren Xiaojun, Shen Jiaxing
- Enhancing Visual Question Answering Through Ranking-based Hybrid Training And Multimodal Fusion Chen Peiyuan, Zhang Zecheng, Dong Yiping, Zhou Li, Wang Han
- WSI-VQA: Interpreting Whole Slide Images By Generative Visual Question Answering Chen Pingyi, Zhu Chenglu, Zheng Sunyi, Li Honglin, Yang Lin
- Measuring Taiwanese Mandarin Language Understanding Chen Po-heng, Cheng Sijia, Chen Wei-lin, Lin Yen-ting, Chen Yun-nung
- M\(^3\)cot: A Novel Benchmark For Multi-domain Multi-step Multi-modal Chain-of-thought Chen Qiguang, Qin Libo, Zhang Jin, Chen Zhi, Xu Xiao, Che Wanxiang
- SATO: Stable Text-to-motion Framework Chen Wenshuo, Xiao Hongru, Zhang Erhang, Hu Lijie, Wang Lei, Liu Mengyuan, Chen Chen
- Gazexplain: Learning To Predict Natural Language Explanations Of Visual Scanpaths Chen Xianyu, Jiang Ming, Zhao Qi
- Eyegpt: Ophthalmic Assistant With Large Language Models Chen Xiaolan, Zhao Ziwei, Zhang Weiyi, Xu Pusheng, Gao Le, Xu Mingpu, Wu Yue, Li Yinwen, Shi Danli, He Mingguang
- NACL: A General And Effective KV Cache Eviction Framework For Llms At Inference Time Chen Yilong, Wang Guoxia, Shang Junyuan, Cui Shiyao, Zhang Zhenyu, Liu Tingwen, Wang Shuohuan, Sun Yu, Yu Dianhai, Wu Hua
- MAPO: Boosting Large Language Model Performance With Model-adaptive Prompt Optimization Chen Yuyan, Wen Zhihao, Fan Ge, Chen Zhengyu, Wu Wei, Liu Dayiheng, Li Zhixu, Liu Bang, Xiao Yanghua
- Dia-llama: Towards Large Language Model-driven CT Report Generation Chen Zhixuan, Luo Luyang, Bie Yequan, Chen Hao
- Improving Retrieval Augmented Open-domain Question-answering With Vectorized Contexts Chen Zhuo, Wang Xinyu, Jiang Yong, Xie Pengjun, Huang Fei, Tu Kewei
- An Empirical Study Of Data Ability Boundary In Llms' Math Reasoning Chen Zui, Chen Yezeng, Han Jiaqi, Huang Zhijie, Qi Ji, Zhou Yi
- "in Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles Through In-dialogue Learning Cheng Chuanqi, Tu Quan, Wu Wei, Shang Shuo, Mao Cunli, Yu Zhengtao, Yan Rui
- Transferring Backdoors Between Large Language Models By Knowledge Distillation Cheng Pengzhou, Wu Zongru, Ju Tianjie, Du Wei, Liu Zhuosheng Zhang Gongshen
- An Empirical Study Of In-context Learning In Llms For Machine Translation Chitale Pranjal A., Gala Jay, Dabre Raj
- SPAR: Personalized Content-based Recommendation Via Long Engagement Attention Chiyu Zhang, Yifei Sun, Jun Chen, Jie Lei, Muhammad Abdul-mageed, Sinong Wang, Rong Jin, Sem Park, Ning Yao, Bo Long
- Kv-runahead: Scalable Causal LLM Inference By Parallel Key-value Cache Generation Cho Minsik, Rastegari Mohammad, Naik Devang
- Lota-bench: Benchmarking Language-oriented Task Planners For Embodied Agents Choi Jae-woo, Yoon Youngwoo, Ong Hyobin, Kim Jaehong, Jang Minsu
- Cross-architecture Transfer Learning For Linear-cost Inference Transformers Choi Sehyun
- Breaking Down The Defenses: A Comparative Survey Of Attacks On Large Language Models Chowdhury Arijit Ghosh, Islam Md Mofijul, Kumar Vaibhav, Shezan Faysal Hossain, Kumar Vaibhav, Jain Vinija, Chadha Aman
- Meerkat: Audio-visual Large Language Model For Grounding In Space And Time Chowdhury Sanjoy, Nag Sayan, Dasgupta Subhrajyoti, Chen Jun, Elhoseiny Mohamed, Gao Ruohan, Manocha Dinesh
- Lookback Lens: Detecting And Mitigating Contextual Hallucinations In Large Language Models Using Only Attention Maps Chuang Yung-sung, Qiu Linlu, Hsieh Cheng-yu, Krishna Ranjay, Kim Yoon, Glass James
- Stable Language Model Pre-training By Reducing Embedding Variability Chung Woojin, Hong Jiwoo, An Na Min, Thorne James, Yun Se-young
- Open Source Conversational Llms Do Not Know Most Spanish Words Conde Javier, González Miguel, Melero Nina, Ferrando Raquel, Martínez Gonzalo, Merino-gómez Elena, Hernández José Alberto, Reviriego Pedro
- Attentionlego: An Open-source Building Block For Spatially-scalable Large Language Model Accelerator With Processing-in-memory Technology Cong Rongqing, He Wenyang, Li Mingxuan, Luo Bangning, Yang Zebin, Yang Yuchao, Huang Ru, Yan Bonan
- Induction Heads As An Essential Mechanism For Pattern Matching In In-context Learning Crosbie J., Shutova E.
- Moeut: Mixture-of-experts Universal Transformers Csordás Róbert, Irie Kazuki, Schmidhuber Jürgen, Potts Christopher, Manning Christopher D.
- The Power Of Noise: Redefining Retrieval For RAG Systems Cuconasu Florin, Trappolini Giovanni, Siciliano Federico, Filice Simone, Campagnano Cesare, Maarek Yoelle, Tonellotto Nicola, Silvestri Fabrizio
- Efficiently Exploring Large Language Models For Document-level Machine Translation With In-context Learning Cui Menglong, Du Jiangcun, Zhu Shaolin, Xiong Deyi
- Rethinking LLM Language Adaptation: A Case Study On Chinese Mixtral Cui Yiming, Yao Xin
- CORM: Cache Optimization With Recent Message For Large Language Model Inference Dai Jincheng, Huang Zhuowei, Jiang Haiyun, Chen Chen, Cai Deng, Bi Wei, Shi Shuming
- Muap: Multi-step Adaptive Prompt Learning For Vision-language Model With Missing Modality Dai Ruiting, Tan Yuqiao, Mo Lisi, He Tao, Qin Ke, Liang Shuang
- Large Language Models Can Better Understand Knowledge Graphs Than We Thought Dai Xinbang, Hua Yuncheng, Wu Tongtong, Sheng Yang, Ji Qiu, Qi Guilin
- Deepseek-v2: A Strong, Economical, And Efficient Mixture-of-experts Language Model Deepseek-ai, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Li, Hui Qu, J. L. Cai, Jian Liang, Jianzhong Guo, Jiaqi Ni, Jiashi Li, Jin Chen, Jingyang Yuan, Junjie Qiu, Junxiao Song, Kai Dong, Kaige Gao, Kang Guan, Lean Wang, Lecong Zhang, Lei Xu, Leyi Xia, Liang Zhao, Liyue Zhang, Meng Li, Miaojun Wang, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Mingming Li, Ning Tian, Panpan Huang, Peiyi Wang, Peng Zhang, Qihao Zhu, Qinyu Chen, Qiushi Du, R. J. Chen, R. L. Jin, Ruiqi Ge, Ruizhe Pan, Runxin Xu, Ruyi Chen, S. S. Li, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shaoqing Wu, Shengfeng Ye, Shirong Ma, Shiyu Wang, Shuang Zhou, Shuiping Yu, Shunfeng Zhou, Size Zheng, T. Wang, Tian Pei, Tian Yuan, Tianyu Sun, W. L. Xiao, Wangding Zeng, Wei An, Wen Liu, Wenfeng Liang, Wenjun Gao, Wentao Zhang, X. Q. Li, Xiangyue Jin, Xianzu Wang, Xiao Bi, Xiaodong Liu, Xiaohan Wang, Xiaojin Shen, Xiaokang Chen, Xiaosha Chen, Xiaotao Nie, Xiaowen Sun, Xiaoxiang Wang, Xin Liu, Xin Xie, Xingkai Yu, Xinnan Song, Xinyi Zhou, Xinyu Yang, Xuan Lu, Xuecheng Su, Y. Wu, Y. K. Li, Y. X. Wei, Y. X. Zhu, Yanhong Xu, Yanping Huang, Yao Li, Yao Zhao, Yaofeng Sun, Yaohui Li, Yaohui Wang, Yi Zheng, Yichao Zhang, Yiliang Xiong, Yilong Zhao, Ying He, Ying Tang, Yishi Piao, Yixin Dong, Yixuan Tan, Yiyuan Liu, Yongji Wang, Yongqiang Guo, Yuchen Zhu, Yuduan Wang, Yuheng Zou, Yukun Zha, Yunxian Ma, Yuting Yan, Yuxiang You, Yuxuan Liu, Z. Z. Ren, Zehui Ren, Zhangli Sha, Zhe Fu, Zhen Huang, Zhen Zhang, Zhenda Xie, Zhewen Hao, Zhihong Shao, Zhiniu Wen, Zhipeng Xu, Zhongyu Zhang, Zhuoshu Li, Zihan Wang, Zihui Gu, Zilin Li, Ziwei Xie
- Cram: Credibility-aware Attention Modification In Llms For Combating Misinformation In RAG Deng Boyi, Wang Wenjie, Zhu Fengbin, Wang Qifan, Feng Fuli
- ELASTIC: Efficient Linear Attention For Sequential Interest Compression Deng Jiaxin, Wang Shiyao, Lu Song, Li Yinfeng, Luo Xinchen, Liu Yuanjun, Xu Peixing, Zhou Guorui
- Attention Is Naturally Sparse With Gaussian Distributed Input Deng Yichuan, Song Zhao, Yang Chiwun
- Cognidual Framework: Self-training Large Language Models Within A Dual-system Theoretical Framework For Improving Cognitive Tasks Deng Yongxin, Qiu Xihe, Tan Xiaoyu, Qu Chao, Pan Jing, Cheng Yuan, Xu Yinghui, Chu Wei
- Text-tuple-table: Towards Information Integration In Text-to-table Generation Via Global Tuple Extraction Deng Zheye, Chan Chunkit, Wang Weiqi, Sun Yuxi, Fan Wei, Zheng Tianshi, Yim Yauwai, Song Yangqiu
- A Simple And Effective \(L_2\) Norm-based Strategy For KV Cache Compression Devoto Alessio, Zhao Yu, Scardapane Simone, Minervini Pasquale
- Hallu-pi: Evaluating Hallucination In Multi-modal Large Language Models Within Perturbed Inputs Ding Peng, Wu Jingyu, Kuang Jun, Ma Dan, Cao Xuezhi, Cai Xunliang, Chen Shi, Chen Jiajun, Huang Shujian
- Latent Attention For Linear Time Transformers Dolga Rares, Cobzarenco Marius, Barber David
- QAQ: Quality Adaptive Quantization For LLM KV Cache Dong Shichen, Cheng Wen, Qin Jiayu, Wang Wei
- Prompt Your Brain: Scaffold Prompt Tuning For Efficient Adaptation Of Fmri Pre-trained Model Dong Zijian, Wu Yilei, Chen Zijiao, Zhang Yichi, Jin Yueming, Zhou Juan Helen
- Integrating Physician Diagnostic Logic Into Large Language Models: Preference Learning From Process Feedback Dou Chengfeng, Jin Zhi, Jiao Wenpin, Zhao Haiyan, Zhao Yongqiang, Tao Zhenwei
- What's Wrong With Your Code Generated By Large Language Models? An Extensive Study Dou Shihan, Jia Haoxiang, Wu Shenxi, Zheng Huiyuan, Zhou Weikang, Wu Muling, Chai Mingxu, Fan Jessica, Huang Caishuang, Tao Yunbo, Liu Yan, Zhou Enyu, Zhang Ming, Zhou Yuhao, Wu Yueming, Zheng Rui, Wen Ming, Weng Rongxiang, Wang Jingang, Cai Xunliang, Gui Tao, Qiu Xipeng, Zhang Qi, Huang Xuanjing
- Enhancing Robustness Of Llm-synthetic Text Detectors For Academic Writing: A Comprehensive Analysis Dou Zhicheng, Guo Yuchen, Chang Ching-chun, Nguyen Huy H., Echizen Isao
- In-context Learning With Reinforcement Learning For Incomplete Utterance Rewriting Du Haowei, Zhao Dongyan
- Efficient Personalized Text-to-image Generation By Leveraging Textual Subspace Du Shian, Cheng Xiaotian, Qian Qi, Wei Henglu, Xu Yi, Ji Xiangyang
- On The Evaluation Consistency Of Attribution-based Explanations Duan Jiarui, Li Haoling, Zhang Haofei, Jiang Hao, Xue Mengqi, Sun Li, Song Mingli, Song Jie
- Prompting Multi-modal Tokens To Enhance End-to-end Autonomous Driving Imitation Learning With Llms Duan Yiqun, Zhang Qiang, Xu Renjing
- How To Think Step-by-step: A Mechanistic Understanding Of Chain-of-thought Reasoning Dutta Subhabrata, Singh Joykirat, Chakrabarti Soumen, Chakraborty Tanmoy
- Your Context Is Not An Array: Unveiling Random Access Limitations In Transformers Ebrahimi Mohammadreza, Panchal Sunny, Memisevic Roland
- Empirical Studies Of Parameter Efficient Methods For Large Language Models Of Code And Knowledge Transfer To R Esmaeili Amirreza, Saberi Iman, Fard Fatemeh H.
- Evaluating Human Alignment And Model Faithfulness Of LLM Rationale Fayyaz Mohsen, Yin Fan, Sun Jiao, Peng Nanyun
- Video-ccam: Enhancing Video-language Understanding With Causal Cross-attention Masks For Short And Long Videos Fei Jiajun, Li Dian, Deng Zhidong, Wang Zekun, Liu Gang, Wang Hui
- MERA: A Comprehensive LLM Evaluation In Russian Fenogenova Alena, Chervyakov Artem, Martynov Nikita, Kozlova Anastasia, Tikhonova Maria, Akhmetgareeva Albina, Emelyanov Anton, Shevelev Denis, Lebedev Pavel, Sinev Leonid, Isaeva Ulyana, Kolomeytseva Katerina, Moskovskiy Daniil, Goncharova Elizaveta, Savushkin Nikita, Mikhailova Polina, Dimitrov Denis, Panchenko Alexander, Markov Sergei
- Moa: Mixture Of Sparse Attention For Automatic Large Language Model Compression Fu Tianyu, Huang Haofeng, Ning Xuefei, Zhang Genghan, Chen Boju, Wu Tianqi, Wang Hongyi, Huang Zixiao, Li Shiyao, Yan Shengen, Dai Guohao, Yang Huazhong, Wang Yu
- Layoutllm: Large Language Model Instruction Tuning For Visually Rich Document Understanding Fujitake Masato
- Cost-efficient Large Language Model Serving For Multi-turn Conversations With Cachedattention Gao Bin, He Zhuomin, Sharma Puru, Kang Qingxuan, Jevdjic Djordje, Deng Junbo, Yang Xingkun, Yu Zhou, Zuo Pengfei
- Tc-llava: Rethinking The Transfer From Image To Video Understanding With Temporal Considerations Gao Mingze, Liu Jingyu, Li Mingda, Xie Jiangtao, Liu Qingbin, Zhao Bo, Chen Xi, Xiong Hui
- Lumina-t2x: Transforming Text Into Any Modality, Resolution, And Duration Via Flow-based Large Diffusion Transformers Gao Peng, Zhuo Le, Liu Dongyang, Du Ruoyi, Luo Xu, Qiu Longtian, Zhang Yuhang, Lin Chen, Huang Rongjie, Geng Shijie, Zhang Renrui, Xi Junlin, Shao Wenqi, Jiang Zhengkai, Yang Tianshuo, Ye Weicai, Tong He, He Jingwen, Qiao Yu, Li Hongsheng
- Gemma 2: Improving Open Language Models At A Practical Size Gemma Team, Riviere Morgane, Pathak Shreya, Sessa Pier Giuseppe, Hardin Cassidy, Bhupatiraju Surya, Hussenot Léonard, Mesnard Thomas, Shahriari Bobak, Ramé Alexandre, Ferret Johan, Liu Peter, Tafti Pouya, Friesen Abe, Casbon Michelle, Ramos Sabela, Kumar Ravin, Lan Charline Le, Jerome Sammy, Tsitsulin Anton, Vieillard Nino, Stanczyk Piotr, Girgin Sertan, Momchev Nikola, Hoffman Matt, Thakoor Shantanu, Grill Jean-bastien, Neyshabur Behnam, Bachem Olivier, Walton Alanna, Severyn Aliaksei, Parrish Alicia, Ahmad Aliya, Hutchison Allen, Abdagic Alvin, Carl Amanda, Shen Amy, Brock Andy, Coenen Andy, Laforge Anthony, Paterson Antonia, Bastian Ben, Piot Bilal, Wu Bo, Royal Brandon, Chen Charlie, Kumar Chintu, Perry Chris, Welty Chris, Choquette-choo Christopher A., Sinopalnikov Danila, Weinberger David, Vijaykumar Dimple, Rogozińska Dominika, Herbison Dustin, Bandy Elisa, Wang Emma, Noland Eric, Moreira Erica, Senter Evan, Eltyshev Evgenii, Visin Francesco, Rasskin Gabriel, Wei Gary, Cameron Glenn, Martins Gus, Hashemi Hadi, Klimczak-plucińska Hanna, Batra Harleen, Dhand Harsh, Nardini Ivan, Mein Jacinda, Zhou Jack, Svensson James, Stanway Jeff, Chan Jetha, Zhou Jin Peng, Carrasqueira Joana, Iljazi Joana, Becker Jocelyn, Fernandez Joe, Van Amersfoort Joost, Gordon Josh, Lipschultz Josh, Newlan Josh, Ji Ju-yeong, Mohamed Kareem, Badola Kartikeya, Black Kat, Millican Katie, Mcdonell Keelin, Nguyen Kelvin, Sodhia Kiranbir, Greene Kish, Sjoesund Lars Lowe, Usui Lauren, Sifre Laurent, Heuermann Lena, Lago Leticia, Mcnealus Lilly, Soares Livio Baldini, Kilpatrick Logan, Dixon Lucas, Martins Luciano, Reid Machel, Singh Manvinder, Iverson Mark, Görner Martin, Velloso Mat, Wirth Mateo, Davidow Matt, Miller Matt, Rahtz Matthew, Watson Matthew, Risdal Meg, Kazemi Mehran, Moynihan Michael, Zhang Ming, Kahng Minsuk, Park Minwoo, Rahman Mofi, Khatwani Mohit, Dao Natalie, Bardoliwalla Nenshad, Devanathan Nesh, Dumai Neta, Chauhan Nilay, Wahltinez Oscar, Botarda Pankil, Barnes Parker, Barham Paul, Michel Paul, Jin Pengchong, Georgiev Petko, Culliton Phil, Kuppala Pradeep, Comanescu Ramona, Merhej Ramona, Jana Reena, Rokni Reza Ardeshir, Agarwal Rishabh, Mullins Ryan, Saadat Samaneh, Carthy Sara Mc, Perrin Sarah, Arnold Sébastien M. R., Krause Sebastian, Dai Shengyang, Garg Shruti, Sheth Shruti, Ronstrom Sue, Chan Susan, Jordan Timothy, Yu Ting, Eccles Tom, Hennigan Tom, Kocisky Tomas, Doshi Tulsee, Jain Vihan, Yadav Vikas, Meshram Vilobh, Dharmadhikari Vishal, Barkley Warren, Wei Wei, Ye Wenming, Han Woohyun, Kwon Woosuk, Xu Xiang, Shen Zhe, Gong Zhitao, Wei Zichuan, Cotruta Victor, Kirk Phoebe, Rao Anand, Giang Minh, Peran Ludovic, Warkentin Tris, Collins Eli, Barral Joelle, Ghahramani Zoubin, Hadsell Raia, Sculley D., Banks Jeanine, Dragan Anca, Petrov Slav, Vinyals Oriol, Dean Jeff, Hassabis Demis, Kavukcuoglu Koray, Farabet Clement, Buchatskaya Elena, Borgeaud Sebastian, Fiedel Noah, Joulin Armand, Kenealy Kathleen, Dadashi Robert, Andreev Alek
- Beyond Self-learned Attention: Mitigating Attention Bias In Transformer-based Models Using Attention Guidance Gesi Jiri, Ahmed Iftekhar
- Uncovering Hidden Intentions: Exploring Prompt Recovery For Deeper Insights Into Generated Texts Give Louis, Zaoral Timo, Bruno Maria Antonietta
- Zamba: A Compact 7B SSM Hybrid Model Glorioso Paolo, Anthony Quentin, Tokpanov Yury, Whittington James, Pilault Jonathan, Ibrahim Adam, Millidge Beren
- Goldfinch: High Performance Rwkv/transformer Hybrid With Linear Pre-fill And Extreme Kv-cache Compression Goldstein Daniel, Obeid Fares, Alcaide Eric, Song Guangyu, Cheah Eugene
- Mechanistic Interpretability Of Large Language Models With Applications To The Financial Services Industry Golgoon Ashkan, Filom Khashayar, Kannan Arjun Ravi
- A Survey On Responsible Generative AI: What To Generate And What Not Gu Jindong
- Probing The Robustness Of Vision-language Pretrained Models: A Multimodal Adversarial Attack Approach Guan Jiwei, Ding Tianyu, Cao Longbing, Pan Lei, Wang Chen, Zheng Xi
- APTQ: Attention-aware Post-training Mixed-precision Quantization For Large Language Models Guan Ziyi, Huang Hantao, Su Yupeng, Huang Hong, Wong Ngai, Yu Hao
- Look Before You Leap: Towards Decision-aware And Generalizable Tool-usage For Large Language Models Gui Anchun, Li Jian, Dai Yong, Du Nan, Xiao Han
- Attention Score Is Not All You Need For Token Importance Indicator In KV Cache Reduction: Value Also Matters Guo Zhiyu, Kamigaito Hidetaka, Watanabe Taro
- Looking Into Black Box Code Language Models Haider Muhammad Umair, Farooq Umar, Siddique A. B., Marron Mark
- Enhancing CTR Prediction Through Sequential Recommendation Pre-training: Introducing The SRP4CTR Framework Han Ruidong, Li Qianzhong, Jiang He, Li Rui, Zhao Yurou, Li Xiang, Lin Wei
- EMMA: Your Text-to-image Diffusion Model Can Secretly Accept Multi-modal Prompts Han Yucheng, Wang Rui, Zhang Chi, Hu Juntao, Cheng Pei, Fu Bin, Zhang Hanwang
- A Survey On Hallucination In Large Vision-language Models Hanchao Liu, Wenyuan Xue, Yifei Chen, Dapeng Chen, Xiutian Zhao, Ke Wang, Liping Hou, Rongjun Li, Wei Peng
- Exploring Chatgpt And Its Impact On Society Haque Md. Asraful, Li Shuai
- Do Large Language Models Speak All Languages Equally? A Comparative Study In Low-resource Settings Hasan Md. Arid, Tarannum Prerona, Dey Krishno, Razzak Imran, Naseem Usman
- The Unreasonable Effectiveness Of Easy Training Data For Hard Tasks Hase Peter, Bansal Mohit, Clark Peter, Wiegreffe Sarah
- Multi-modal Instruction Tuned Llms With Fine-grained Visual Perception He Junwen, Wang Yifan, Wang Lijun, Lu Huchuan, He Jun-yan, Lan Jin-peng, Luo Bin, Xie Xuansong
- Pitvqa: Image-grounded Text Embedding LLM For Visual Question Answering In Pituitary Surgery He Runlong, Xu Mengya, Das Adrito, Khan Danyal Z., Bano Sophia, Marcus Hani J., Stoyanov Danail, Clarkson Matthew J., Islam Mobarakol
- What Matters In Transformers? Not All Attention Is Needed He Shwai, Sun Guoheng, Shen Zheyu, Li Ang
- BP4ER: Bootstrap Prompting For Explicit Reasoning In Medical Dialogue Generation He Yuhong, Zhang Yongqi, He Shizhu, Wan Jun
- Camelot: Towards Large Language Models With Training-free Consolidated Associative Memory He Zexue, Karlinsky Leonid, Kim Donghyun, Mcauley Julian, Krotov Dmitry, Feris Rogerio
- Why Do Llava Vision-language Models Reply To Images In English? Hinck Musashi, Holtermann Carolin, Olson Matthew Lyle, Schneider Florian, Yu Sungduk, Bhiwandiwalla Anahita, Lauscher Anne, Tseng Shaoyen, Lal Vasudev
- Free To Play: UN Trade And Development's Experience With Developing Its Own Open-source Retrieval Augmented Generation Large Language Model Application Hopp Daniel
- Non-linear Inference Time Intervention: Improving LLM Truthfulness Hoscilowicz Jakub, Wiacek Adam, Chojnacki Jan, Cieslak Adam, Michon Leszek, Urbanevych Vitalii, Janicki Artur
- Visual Prompting For Generalized Few-shot Segmentation: A Multi-scale Approach Hossain Mir Rayat Imtiaz, Siam Mennatullah, Sigal Leonid, Little James J.
- Enhancing And Accelerating Large Language Models Via Instruction-aware Contextual Compression Hou Haowen, Ma Fei, Bai Binwen, Zhu Xinxin, Yu Fei
- Found In The Middle: Calibrating Positional Attention Bias Improves Long Context Utilization Hsieh Cheng-yu, Chuang Yung-sung, Li Chun-liang, Wang Zifeng, Le Long T., Kumar Abhishek, Glass James, Ratner Alexander, Lee Chen-yu, Krishna Ranjay, Pfister Tomas
- Outlier-efficient Hopfield Layers For Large Transformer-based Models Hu Jerry Yao-chieh, Chang Pei-hsuan, Luo Robin, Chen Hong-yu, Li Weijian, Wang Wei-po, Liu Han
- Openrlhf: An Easy-to-use, Scalable And High-performance RLHF Framework Hu Jian, Wu Xibin, Wang Weixun, Xianyu, Zhang Dehao, Cao Yu
- Agentgen: Enhancing Planning Abilities For Large Language Model Based Agent Via Environment And Task Generation Hu Mengkang, Zhao Pu, Xu Can, Sun Qingfeng, Lou Jianguang, Lin Qingwei, Luo Ping, Rajmohan Saravan, Zhang Dongmei
- Mixed Sparsity Training: Achieving 4\(\times\) FLOP Reduction For Transformer Pretraining Hu Pihe, Li Shaolong, Huang Longbo
- Can Perplexity Reflect Large Language Model's Ability In Long Text Understanding? Hu Yutong, Huang Quzhe, Tao Mingxu, Zhang Chen, Feng Yansong
- Exact And Efficient Unlearning For Large Language Model-based Recommendation Hu Zhiyu, Zhang Yang, Xiao Minghao, Wang Wenjie, Feng Fuli, He Xiangnan
- An Intermediate Fusion Vit Enables Efficient Text-image Alignment In Diffusion Models Hu Zizhao, Jia Shaochong, Rostami Mohammad
- Multimodal Task Vectors Enable Many-shot Multimodal In-context Learning Huang Brandon, Mitra Chancharik, Arbelle Assaf, Karlinsky Leonid, Darrell Trevor, Herzig Roei
- Integrating Multi-scale Contextualized Information For Byte-based Neural Machine Translation Huang Langlin, Feng Yang
- The Solution For The 5th GCAIAC Zero-shot Referring Expression Comprehension Challenge Huang Longfei, Yu Feng, Guan Zhihao, Wan Zhonghua, Yang Yang
- Brainchat: Decoding Semantic Information From Fmri Using Vision-language Pretrained Models Huang Wanaiu
- Ml-mamba: Efficient Multi-modal Large Language Model Utilizing Mamba-2 Huang Wenjun, Pan Jiakai, Tang Jiahao, Ding Yanyu, Xing Yifei, Wang Yuhe, Wang Zhengzhuo, Hu Jianguo
- Trustllm: Trustworthiness In Large Language Models Huang Yue, Sun Lichao, Wang Haoran, Wu Siyuan, Zhang Qihui, Li Yuan, Gao Chujie, Huang Yixin, Lyu Wenhan, Zhang Yixuan, Li Xiner, Liu Zhengliang, Liu Yixin, Wang Yijue, Zhang Zhikun, Vidgen Bertie, Kailkhura Bhavya, Xiong Caiming, Xiao Chaowei, Li Chunyuan, Xing Eric, Huang Furong, Liu Hao, Ji Heng, Wang Hongyi, Zhang Huan, Yao Huaxiu, Kellis Manolis, Zitnik Marinka, Jiang Meng, Bansal Mohit, Zou James, Pei Jian, Liu Jian, Gao Jianfeng, Han Jiawei, Zhao Jieyu, Tang Jiliang, Wang Jindong, Vanschoren Joaquin, Mitchell John, Shu Kai, Xu Kaidi, Chang Kai-wei, He Lifang, Huang Lifu, Backes Michael, Gong Neil Zhenqiang, Yu Philip S., Chen Pin-yu, Gu Quanquan, Xu Ran, Ying Rex, Ji Shuiwang, Jana Suman, Chen Tianlong, Liu Tianming, Zhou Tianyi, Wang William, Li Xiang, Zhang Xiangliang, Wang Xiao, Xie Xing, Chen Xun, Wang Xuyu, Liu Yan, Ye Yanfang, Cao Yinzhi, Chen Yong, Zhao Yue
- Obscureprompt: Jailbreaking Large Language Models Via Obscure Input Huang Yue, Tang Jingyu, Chen Dongping, Tang Bingda, Wan Yao, Sun Lichao, Zhang Xiangliang
- Fastfid: Improve Inference Efficiency Of Open Domain Question Answering Via Sentence Selection Huang Yufei, Han Xu, Sun Maosong
- Transformerfam: Feedback Attention Is Working Memory Hwang Dongseong, Wang Weiran, Huo Zhuoyuan, Sim Khe Chai, Mengibar Pedro Moreno
- Test-time Low Rank Adaptation Via Confidence Maximization For Zero-shot Generalization Of Vision-language Models Imam Raza, Gani Hanan, Huzaifa Muhammad, Nandakumar Karthik
- Hallucinations Or Attention Misdirection? The Path To Strategic Value Extraction In Business Using Large Language Models Ioste Aline
- Are Human Conversations Special? A Large Language Model Perspective Jawale Toshish, Animesh Chaitanya, Vallath Sekhar, Talamadupula Kartik, Heck Larry
- Linking In-context Learning In Transformers To Human Episodic Memory Ji-an Li, Zhou Corey Y., Benna Marcus K., Mattar Marcelo G.
- Time Matters: Enhancing Pre-trained News Recommendation Models With Robust User Dwell Time Injection Jiang Hao, Li Chuanzhen, An Mingxiao
- Minference 1.0: Accelerating Pre-filling For Long-context Llms Via Dynamic Sparse Attention Jiang Huiqiang, Li Yucheng, Zhang Chengruidong, Wu Qianhui, Luo Xufang, Ahn Surin, Han Zhenhua, Abdi Amir H., Li Dongsheng, Lin Chin-yew, Yang Yuqing, Qiu Lili
- MC\(^2\): Multi-concept Guidance For Customized Multi-concept Generation Jiang Jiaxiu, Zhang Yabo, Feng Kailai, Wu Xiaohe, Zuo Wangmeng
- Enhancing Robustness In Large Language Models: Prompting For Mitigating The Impact Of Irrelevant Information Jiang Ming, Huang Tingting, Guo Biao, Lu Yao, Zhang Feng
- Do Llms Dream Of Elephants (when Told Not To)? Latent Concept Association And Associative Memory In Transformers Jiang Yibo, Rajendran Goutham, Ravikumar Pradeep, Aragam Bryon
- Learning To Edit: Aligning Llms With Knowledge Editing Jiang Yuxin, Wang Yufei, Wu Chuhan, Zhong Wanjun, Zeng Xingshan, Gao Jiahui, Li Liangyou, Jiang Xin, Shang Lifeng, Tang Ruiming, Liu Qun, Wang Wei
- Prompt-based Length Controlled Generation With Multiple Control Types Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- LLM Maybe Longlm: Self-extend LLM Context Window Without Tuning Jin Hongye, Han Xiaotian, Yang Jingfeng, Jiang Zhimeng, Liu Zirui, Chang Chia-yuan, Chen Huiyuan, Hu Xia
- Self-selected Attention Span For Accelerating Large Language Model Inference Jin Tian, Yazar Wanzin, Xu Zifei, Sharify Sayeh, Wang Xin
- Video-lavit: Unified Video-language Pre-training With Decoupled Visual-motional Tokenization Jin Yang, Sun Zhicheng, Xu Kun, Xu Kun, Chen Liwei, Jiang Hao, Huang Quzhe, Song Chengru, Liu Yuliang, Zhang Di, Song Yang, Gai Kun, Mu Yadong
- Cutting Off The Head Ends The Conflict: A Mechanism For Interpreting And Mitigating Knowledge Conflicts In Language Models Jin Zhuoran, Cao Pengfei, Yuan Hongbang, Chen Yubo, Xu Jiexin, Li Huaijun, Jiang Xiaojian, Liu Kang, Zhao Jun
- A2SF: Accumulative Attention Scoring With Forgetting Factor For Token Pruning In Transformer Decoder Jo Hyun-rae, Shin Dongkun
- Using Large Language Models To Assess Tutors' Performance In Reacting To Students Making Math Errors Kakarla Sanjit, Thomas Danielle, Lin Jionghao, Gupta Shivang, Koedinger Kenneth R.
- Understanding The Effect Of Using Semantically Meaningful Tokens For Visual Representation Learning Kalibhat Neha, Kattakinda Priyatham, Zarei Arman, Seleznev Nikita, Sharpe Samuel, Kumar Senthil, Feizi Soheil
- Shortened Llama: Depth Pruning For Large Language Models With Comparison Of Retraining Methods Kim Bo-kyeong, Kim Geonmin, Kim Tae-ho, Castells Thibault, Choi Shinkook, Shin Junho, Song Hyoung-kyu
- A Mechanistic Interpretation Of Syllogistic Reasoning In Auto-regressive Language Models Kim Geonhee, Valentino Marco, Freitas André
- Strategic Data Ordering: Enhancing Large Language Model Performance Through Curriculum Learning Kim Jisu, Lee Juhwan
- Augmenting Query And Passage For Retrieval-augmented Generation Using Llms For Open-domain Question Answering Kim Minsang, Park Cheoneum, Baek Seungjun
- Recmind: Japanese Movie Recommendation Dialogue With Seeker's Internal State Kodama Takashi, Kiyomaru Hirokazu, Huang Yin Jou, Kurohashi Sadao
- Bitune: Bidirectional Instruction-tuning Kopiczko Dawid J., Blankevoort Tijmen, Asano Yuki M.
- Evaluating Language Models For Generating And Judging Programming Feedback Koutcheme Charles, Dainese Nicola, Hellas Arto, Sarsa Sami, Leinonen Juho, Ashraf Syed, Denny Paul
- Automatic Question-answer Generation For Long-tail Knowledge Kumar Rohan, Kim Youngmin, Ravi Sunitha, Sun Haitian, Faloutsos Christos, Salakhutdinov Ruslan, Yoon Minji
- Harmonizing Code-mixed Conversations: Personality-assisted Code-mixed Response Generation In Dialogues Kumar Shivani, Chakraborty Tanmoy
- Human Evaluation Of English--irish Transformer-based NMT Lankford Séamus, Afli Haithem, Way Andy
- Transformers For Low-resource Languages:is F\'eidir Linn! Lankford Séamus, Afli Haithem, Way Andy
- A Systematic Survey And Critical Review On Evaluating Large Language Models: Challenges, Limitations, And Recommendations Laskar Md Tahmid Rahman, Alqahtani Sawsan, Bari M Saiful, Rahman Mizanur, Khan Mohammad Abdullah Matin, Khan Haidar, Jahan Israt, Bhuiyan Amran, Tan Chee Wei, Parvez Md Rizwan, Hoque Enamul, Joty Shafiq, Huang Jimmy
- Frontier AI Ethics: Anticipating And Evaluating The Societal Impacts Of Generative Agents Lazar Seth
- Nv-embed: Improved Techniques For Training Llms As Generalist Embedding Models Lee Chankyu, Roy Rajarshi, Xu Mengyao, Raiman Jonathan, Shoeybi Mohammad, Catanzaro Bryan, Ping Wei
- Hip Attention: Sparse Sub-quadratic Attention With Hierarchical Attention Pruning Lee Heejun, Park Geon, Lee Youngwan, Kim Jina, Jeong Wonyoung, Jeon Myeongjae, Hwang Sung Ju
- A Survey Of Large Language Models In Finance (finllms) Lee Jean, Stevens Nicholas, Han Soyeon Caren, Song Minseok
- Multimodal Reasoning With Multimodal Knowledge Graph Lee Junlin, Wang Yequan, Li Jing, Zhang Min
- Infinigen: Efficient Generative Inference Of Large Language Models With Dynamic KV Cache Management Lee Wonbeom, Lee Jungi, Seo Junghwan, Sim Jaewoong
- Small-e: Small Language Model With Linear Attention For Efficient Speech Synthesis Lemerle Théodor, Obin Nicolas, Roebel Axel
- Concentrate Attention: Towards Domain-generalizable Prompt Optimization For Language Models Li Chengzhengxu, Liu Xiaoming, Zhang Zhaohan, Wang Yichen, Liu Chen, Lan Yu, Shen Chao
- On The Role Of Long-tail Knowledge In Retrieval Augmented Large Language Models Li Dongyang, Yan Junbing, Zhang Taolin, Wang Chengyu, He Xiaofeng, Huang Longtao, Xue Hui, Huang Jun
- Look Within, Why Llms Hallucinate: A Causal Perspective Li He, Chi Haoang, Liu Mingyu, Yang Wenjing
- Streamingdialogue: Prolonged Dialogue Learning Via Long Context Compression With Minimal Losses Li Jia-nan, Tu Quan, Mao Cunli, Yu Zhengtao, Wen Ji-rong, Yan Rui
- Focus On Your Question! Interpreting And Mitigating Toxic Cot Problems In Commonsense Reasoning Li Jiachun, Cao Pengfei, Wang Chenhao, Jin Zhuoran, Chen Yubo, Zeng Daojian, Liu Kang, Zhao Jun
- A Transformer With Stack Attention Li Jiaoda, White Jennifer C., Sachan Mrinmaya, Cotterell Ryan
- Stylechat: Learning Recitation-augmented Memory In Llms For Stylized Dialogue Generation Li Jinpeng, Zhang Zekai, Tu Quan, Cheng Xin, Zhao Dongyan, Yan Rui
- Measuring And Controlling Instruction (in)stability In Language Model Dialogs Li Kenneth, Liu Tianle, Bashkansky Naomi, Bau David, Viégas Fernanda, Pfister Hanspeter, Wattenberg Martin
- On The Power Of Convolution Augmented Transformer Li Mingchen, Zhang Xuechen, Huang Yixiao, Oymak Samet
- Bridging The Language Gap: Enhancing Multilingual Prompt-based Code Generation In Llms Via Zero-shot Cross-lingual Transfer Li Mingda, Mishra Abhijit, Mujumdar Utkarsh
- Coevol: Constructing Better Responses For Instruction Finetuning Through Multi-agent Cooperation Li Renhao, Tan Minghuan, Wong Derek F., Yang Min
- Anchored Answers: Unravelling Positional Bias In Gpt-2's Multiple-choice Questions Li Ruizhe, Gao Yanjun
- Prompt4vis: Prompting Large Language Models With Example Mining And Schema Filtering For Tabular Data Visualization Li Shuaimin, Chen Xuanang, Song Yuanfeng, Song Yunze, Zhang Chen
- FSMR: A Feature Swapping Multi-modal Reasoning Approach With Joint Textual And Visual Clues Li Shuang, Wang Jiahua, Wen Lijie
- MM-TTS: A Unified Framework For Multimodal, Prompt-induced Emotional Text-to-speech Synthesis Li Xiang, Cheng Zhi-qi, He Jun-yan, Peng Xiaojiang, Hauptmann Alexander G.
- Grounded Compositional And Diverse Text-to-3d With Pretrained Multi-view Diffusion Model Li Xiaolong, Mo Jiawei, Wang Ying, Parameshwara Chethan, Fei Xiaohan, Swaminathan Ashwin, Taylor Cj, Tu Zhuowen, Favaro Paolo, Soatto Stefano
- Finding Challenging Metaphors That Confuse Pretrained Language Models Li Yucheng, Guerin Frank, Lin Chenghua
- Dynamic Data Sampler For Cross-language Transfer Learning In Large Language Models Li Yudong, Feng Yuhao, Zhou Wen, Zhao Zhe, Shen Linlin, Hou Cheng, Hou Xianxu
- Spin Glass Model Of In-context Learning Li Yuhao, Bai Ruoran, Huang Haiping
- VSTAR: Generative Temporal Nursing For Longer Dynamic Video Synthesis Li Yumeng, Beluch William, Keuper Margret, Zhang Dan, Khoreva Anna
- Amphista: Accelerate LLM Inference With Bi-directional Multiple Drafting Heads In A Non-autoregressive Style Li Zeping, Yang Xinlong, Gao Ziheng, Liu Ji, Liu Zhuang, Li Dong, Peng Jinzhang, Tian Lu, Barsoum Emad
- Understanding And Patching Compositional Reasoning In Llms Li Zhaoyi, Jiang Gangwei, Xie Hong, Song Linqi, Lian Defu, Wei Ying
- Focusllm: Scaling Llm's Context By Parallel Decoding Li Zhenyu, Zhang Yike, Pan Tengyu, Sun Yutao, Duan Zhichao, Fang Junjie, Han Rong, Wang Zixuan, Wang Jianyong
- Cog-ga: A Large Language Models-based Generative Agent For Vision-language Navigation In Continuous Environments Li Zhiyuan, Lu Yanfeng, Mu Yao, Qiao Hong
- Two Stacks Are Better Than One: A Comparison Of Language Modeling And Translation As Multilingual Pretraining Objectives Li Zihao, Ji Shaoxiong, Mickus Timothee, Segonne Vincent, Tiedemann Jörg
- Mitigating Hallucination In Visual-language Models Via Re-balancing Contrastive Decoding Liang Xiaoyu, Yu Jiayuan, Mu Lianrui, Zhuang Jiedong, Hu Jiaqi, Yang Yuchen, Ye Jiangnan, Lu Lu, Chen Jian, Hu Haoji
- Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats In Customized Large Language Models Liang Zi, Hu Haibo, Ye Qingqing, Xiao Yaxin, Li Haoyang
- Apiq: Finetuning Of 2-bit Quantized Large Language Model Liao Baohao, Herold Christian, Khadivi Shahram, Monz Christof
- Beyond KV Caching: Shared Attention For Efficient Llms Liao Bingli, Vargas Danilo Vasconcellos
- Extending Token Computation For LLM Reasoning Liao Bingli, Vargas Danilo Vasconcellos
- D2LLM: Decomposed And Distilled Large Language Models For Semantic Search Liao Zihan, Yu Hang, Li Jianguo, Wang Jun, Zhang Wei
- Retrieval-augmented Natural Language Reasoning For Explainable Visual Question Answering Lim Su Hyeon, Kim Minkuk, Kim Hyeon Bae, Kim Seong Tae
- Tree-based Hard Attention With Self-motivation For Large Language Models Lin Chenxi, Ren Jiayu, He Guoxiu, Jiang Zhuoren, Yu Haiyan, Zhu Xiaomin
- Mixture Of In-context Experts Enhance Llms' Long Context Awareness Lin Hongzhan, Lv Ang, Chen Yuhan, Zhu Chen, Song Yang, Zhu Hengshu, Yan Rui
- Efficient LLM Training And Serving With Heterogeneous Context Sharding Among Attention Heads Lin Xihui, Zhang Yunan, Ge Suyu, Patra Barun, Chaudhary Vishrav, Peng Hao, Song Xia
- Qserve: W4A8KV4 Quantization And System Co-design For Efficient LLM Serving Lin Yujun, Tang Haotian, Yang Shang, Zhang Zhekai, Xiao Guangxuan, Gan Chuang, Han Song
- Contextualized Sequence Likelihood: Enhanced Confidence Scores For Natural Language Generation Lin Zhen, Trivedi Shubhendu, Sun Jimeng
- Boosting Multimodal Large Language Models With Visual Tokens Withdrawal For Rapid Inference Lin Zhihang, Lin Mingbao, Lin Luxi, Ji Rongrong
- Unveiling Vulnerability Of Self-attention Liong Khai Jiet, Wu Hongqiu, Zhao Hai
- More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-specific Llms Liu Chengyuan, Wang Shihang, Kang Yangyang, Qing Lizhi, Zhao Fubang, Sun Changlong, Kuang Kun, Wu Fei
- Few-shot Class Incremental Learning With Attention-aware Self-adaptive Prompt Liu Chenxi, Wang Zhenyi, Xiong Tianyi, Chen Ruibo, Wu Yihan, Guo Junfeng, Huang Heng
- Openeval: Benchmarking Chinese Llms Across Capability, Alignment And Safety Liu Chuang, Yu Linhao, Li Jiaxuan, Jin Renren, Huang Yufei, Shi Ling, Zhang Junhui, Ji Xinmeng, Cui Tingting, Liu Tao, Song Jinwang, Zan Hongying, Li Sun, Xiong Deyi
- Mmgrec: Multimodal Generative Recommendation With Transformer Model Liu Han, Wei Yinwei, Song Xuemeng, Guan Weili, Li Yuan-fang, Nie Liqiang
- Laida: Linguistics-aware In-context Learning With Data Augmentation For Metaphor Components Identification Liu Hongde, He Chenyuan, Meng Feiyang, Niu Changyong, Jia Yuxiang
- Enhancing Llm's Cognition Via Structurization Liu Kai, Fu Zhihang, Chen Chao, Zhang Wei, Jiang Rongxin, Zhou Fan, Chen Yaowu, Wu Yue, Ye Jieping
- Intactkv: Improving Large Language Model Quantization By Keeping Pivot Tokens Intact Liu Ruikang, Bai Haoli, Lin Haokun, Li Yuening, Gao Han, Xu Zhengzhuo, Hou Lu, Yao Jun, Yuan Chun
- Are Llms Effective Backbones For Fine-tuning? An Experimental Investigation Of Supervised Llms On Chinese Short Text Matching Liu Shulin, Xu Chengcheng, Liu Hao, Yu Tinghao, Yang Tao
- Memlong: Memory-augmented Retrieval For Long Text Modeling Liu Weijie, Tang Zecheng, Li Juntao, Chen Kehai, Zhang Min
- Harnessing Large Language Models For Multimodal Product Bundling Liu Xiaohao, Wu Jie, Tao Zhulin, Ma Yunshan, Wei Yinwei, Chua Tat-seng
- Farewell To Length Extrapolation, A Training-free Infinite Context With Finite Attention Scope Liu Xiaoran, Guo Qipeng, Song Yuerong, Liu Zhigeng, Lv Kai, Yan Hang, Li Linlin, Liu Qun, Qiu Xipeng
- Scaffolding Language Learning Via Multi-modal Tutoring Systems With Pedagogical Instructions Liu Zhengyuan, Yin Stella Xin, Lee Carolyn, Chen Nancy F.
- A Closer Look Into Mixture-of-experts In Large Language Models Lo Ka Man, Huang Zeyu, Qiu Zihan, Wang Zili, Fu Jie
- Sparser Is Faster And Less Is More: Efficient Sparse Attention For Long-range Transformers Lou Chao, Jia Zixia, Zheng Zilong, Tu Kewei
- Longheads: Multi-head Attention Is Secretly A Long Context Processor Lu Yi, Zhou Xin, He Wei, Zhao Jun, Ji Tao, Gui Tao, Zhang Qi, Huang Xuanjing
- Asymptotic Theory Of In-context Learning By Linear Attention Lu Yue M., Letey Mary I., Zavatone-veth Jacob A., Maiti Anindita, Pehlevan Cengiz
- Crosstune: Black-box Few-shot Classification With Label Enhancement Luo Danqing, Zhang Chen, Zhang Yan, Li Haizhou
- Learning Positional Attention For Sequential Recommendation Luo Fan, Zhang Juan, Xu Shenghui
- Taking A Deep Breath: Enhancing Language Modeling Of Large Language Models With Sentinel Tokens Luo Weiyao, Zheng Suncong, Xia Heming, Wang Weikang, Lei Yan, Liu Tianyu, Chen Shuang, Sui Zhifang
- Turning Trash Into Treasure: Accelerating Inference Of Large Language Models With Token Recycling Luo Xianzhen, Wang Yixuan, Zhu Qingfu, Zhang Zhiming, Zhang Xuanyu, Yang Qing, Xu Dongliang, Che Wanxiang
- Interpreting Key Mechanisms Of Factual Recall In Transformer-based Language Models Lv Ang, Chen Yuhan, Zhang Kaiyi, Wang Yulong, Liu Lifeng, Wen Ji-rong, Xie Jian, Yan Rui
- EE-MLLM: A Data-efficient And Compute-efficient Multimodal Large Language Model Ma Feipeng, Zhou Yizhou, Li Hebei, He Zilong, Wu Siying, Rao Fengyun, Zhang Yueyi, Sun Xiaoyan
- Megalodon: Efficient LLM Pretraining And Inference With Unlimited Context Length Ma Xuezhe, Yang Xiaomeng, Xiong Wenhan, Chen Beidi, Yu Lili, Zhang Hao, May Jonathan, Zettlemoyer Luke, Levy Omer, Zhou Chunting
- Steering Conversational Large Language Models For Long Emotional Support Conversations Madani Navid, Saha Sougata, Srihari Rohini
- Learning To Plan Long-term For Language Modeling Mai Florian, Cornille Nathan, Moens Marie-francine
- Matmul Or No Matmul In The Era Of 1-bit Llms Malekar Jinendra, Elbtity Mohammed E., Zand Ramtin
- ULLME: A Unified Framework For Large Language Model Embeddings With Generation-augmented Learning Man Hieu, Ngo Nghia Trung, Dernoncourt Franck, Nguyen Thien Huu
- Beware Of Words: Evaluating The Lexical Richness Of Conversational Large Language Models Martínez Gonzalo, Hernández José Alberto, Conde Javier, Reviriego Pedro, Merino Elena
- Phybench: A Physical Commonsense Benchmark For Evaluating Text-to-image Models Meng Fanqing, Shao Wenqi, Luo Lixin, Wang Yahong, Chen Yiran, Lu Quanfeng, Yang Yue, Yang Tianshuo, Zhang Kaipeng, Qiao Yu, Luo Ping
- DCR: Divide-and-conquer Reasoning For Multi-choice Question Answering With Llms Meng Zijie, Zhang Yan, Feng Zhaopeng, Liu Zuozhu
- Logic-enhanced Language Model Agents For Trustworthy Social Simulations Mensfelt Agnieszka, Stathis Kostas, Trencsenyi Vince
- Linearizing Large Language Models Mercat Jean, Vasiljevic Igor, Keh Sedrick, Arora Kushal, Dave Achal, Gaidon Adrien, Kollar Thomas
- I Have An Attention Bridge To Sell You: Generalization Capabilities Of Modular Translation Architectures Mickus Timothee, Vázquez Raúl, Attieh Joseph
- Large Language Models: A Survey Minaee Shervin, Mikolov Tomas, Nikzad Narjes, Chenaghlu Meysam, Socher Richard, Amatriain Xavier, Gao Jianfeng
- E2TP: Element To Tuple Prompting Improves Aspect Sentiment Tuple Prediction Mohammadkhani Mohammad Ghiasvand, Ranjbar Niloofar, Momtazi Saeedeh
- Xc-cache: Cross-attending To Cached Context For Efficient LLM Inference Monteiro João, Marcotte Étienne, Noël Pierre-andré, Zantedeschi Valentina, Vázquez David, Chapados Nicolas, Pal Christopher, Taslakian Perouz
- Cross-layer Attention Sharing For Large Language Models Mu Yongyu, Wu Yuzhang, Fan Yuchun, Wang Chenglong, Li Hengyu, He Qiaozhi, Yang Murun, Xiao Tong, Zhu Jingbo
- Leave No Context Behind: Efficient Infinite Context Transformers With Infini-attention Munkhdalai Tsendsuren, Faruqui Manaal, Gopal Siddharth
- Compact Language Models Via Pruning And Knowledge Distillation Muralidharan Saurav, Sreenivas Sharath Turuvekere, Joshi Raviraj, Chochowski Marcin, Patwary Mostofa, Shoeybi Mohammad, Catanzaro Bryan, Kautz Jan, Molchanov Pavlo
- Beyond The Hype: A Dispassionate Look At Vision-language Models In Medical Scenario Nan Yang, Zhou Huichi, Xing Xiaodan, Yang Guang
- Dynamic Memory Compression: Retrofitting Llms For Accelerated Inference Nawrot Piotr, Łańcucki Adrian, Chochowski Marcin, Tarjan David, Ponti Edoardo M.
- Concise Thoughts: Impact Of Output Length On LLM Reasoning And Cost Nayab Sania, Rossolini Giulio, Buttazzo Giorgio, Manes Nicolamaria, Giacomelli Fabrizio
- Improving Robustness Of Llm-based Speech Synthesis By Learning Monotonic Alignment Neekhara Paarth, Hussain Shehzeen, Ghosh Subhankar, Li Jason, Valle Rafael, Badlani Rohan, Ginsburg Boris
- Needle In The Haystack For Memory Based Large Language Models Nelson Elliot, Kollias Georgios, Das Payel, Chaudhury Subhajit, Dan Soham
- Interpreting Context Look-ups In Transformers: Investigating Attention-mlp Interactions Neo Clement, Cohen Shay B., Barez Fazl
- Dude: Dual Distribution-aware Context Prompt Learning For Large Vision-language Model Nguyen Duy M. H., Le An T., Nguyen Trung Q., Diep Nghiem T., Nguyen Tai, Duong-tran Duy, Peters Jan, Shen Li, Niepert Mathias, Sonntag Daniel
- Layer-wise Regularized Dropout For Neural Language Models Ni Shiwen, Yang Min, Xu Ruifeng, Li Chengming, Hu Xiping
- Llm-based Conversational AI Therapist For Daily Functioning Screening And Psychotherapeutic Intervention Via Everyday Smart Devices Nie Jingping, Shao Hanya, Fan Yuang, Shao Qijia, You Haoxuan, Preindl Matthias, Jiang Xiaofan
- User-llm: Efficient LLM Contextualization With User Embeddings Ning Lin, Liu Luyang, Wu Jiaxing, Wu Neo, Berlowitz Devora, Prakash Sushant, Green Bradley, O'banion Shawn, Xie Jun
- Quantified Task Misalignment To Inform PEFT: An Exploration Of Domain Generalization And Catastrophic Forgetting In CLIP Niss Laura, Vogt-lowell Kevin, Tsiligkaridis Theodoros
- Fine-tuning With HED-IT: The Impact Of Human Post-editing For Dialogical Language Models Occhipinti Daniela, Marchi Michele, Mondella Irene, Lai Huiyuan, Dell'orletta Felice, Nissim Malvina, Guerini Marco
- Robust Adaptation Of Foundation Models With Black-box Visual Prompting Oh Changdae, Seo Gyeongdeok, Jung Geunyoung, Cheng Zhi-qi, Choi Hosik, Jung Jiyoung, Song Kyungwoo
- The Accuracy Of Domain Specific And Descriptive Analysis Generated By Large Language Models Otieno Denish Omondi, Abri Faranak, Siami-namini Sima, Namin Akbar Siami
- Easyinstruct: An Easy-to-use Instruction Processing Framework For Large Language Models Ou Yixin, Zhang Ningyu, Gui Honghao, Xu Ziwen, Qiao Shuofei, Xue Yida, Fang Runnan, Liu Kangwei, Li Lei, Bi Zhen, Zheng Guozhou, Chen Huajun
- QLSC: A Query Latent Semantic Calibrator For Robust Extractive Question Answering Ouyang Sheng, Wang Jianzong, Zhang Yong, Li Zhitao, Liang Ziqi, Zhang Xulong, Cheng Ning, Xiao Jing
- Instinfer: In-storage Attention Offloading For Cost-effective Long-context LLM Inference Pan Xiurui, Li Endian, Li Qiao, Liang Shengwen, Shan Yizhou, Zhou Ke, Luo Yingwei, Wang Xiaolin, Zhang Jie
- Salute The Classic: Revisiting Challenges Of Machine Translation In The Age Of Large Language Models Pang Jianhui, Ye Fanghua, Wang Longyue, Yu Dian, Wong Derek F., Shi Shuming, Tu Zhaopeng
- Anchor-based Large Language Models Pang Jianhui, Ye Fanghua, Wong Derek Fai, He Xin, Chen Wanshun, Wang Longyue
- Attndreambooth: Towards Text-aligned Personalized Text-to-image Generation Pang Lianyu, Yin Jian, Zhao Baoquan, Wu Feize, Wang Fu Lee, Li Qing, Mao Xudong
- Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes Llms More Prone To Jailbreak Attacks Pantazopoulos Georgios, Parekh Amit, Nikandrou Malvina, Suglia Alessandro
- Loramap: Harnessing The Power Of Lora Connections Park Hyeryun, Kwak Jeongwon, Jang Dongsuk, Park Sumin, Choi Jinwook
- Token-picker: Accelerating Attention In Text Generation With Minimized Memory Transfer Via Probability Estimation Park Junyoung, Kang Myeonggu, Han Yunki, Kim Yanggon, Shin Jaekang, Kim Lee-sup
- Any-precision LLM: Low-cost Deployment Of Multiple, Different-sized Llms Park Yeonhong, Hyun Jake, Cho Sanglyul, Sim Bonggeun, Lee Jae W.
- Logicbench: Towards Systematic Evaluation Of Logical Reasoning Ability Of Large Language Models Parmar Mihir, Patel Nisarg, Varshney Neeraj, Nakamura Mutsumi, Luo Man, Mashetty Santosh, Mitra Arindam, Baral Chitta
- The Ultimate Guide To Fine-tuning Llms From Basics To Breakthroughs: An Exhaustive Review Of Technologies, Research, Best Practices, Applied Research Challenges And Opportunities Parthasarathy Venkatesh Balavadhani, Zafar Ahtsham, Khan Aafaq, Shahid Arsalan
- AI Revolution On Chat Bot: Evidence From A Randomized Controlled Experiment Peng Sida, Swiatek Wojciech, Gao Allen, Cullivan Paul, Chang Haoge
- Q-PEFT: Query-dependent Parameter Efficient Fine-tuning For Text Reranking With Large Language Models Peng Zhiyuan, Wu Xuyang, Wang Qifan, Rajanala Sravanthi, Fang Yi
- Prompting A Pretrained Transformer Can Be A Universal Approximator Petrov Aleksandar, Torr Philip H. S., Bibi Adel
- Investigating The Effectiveness Of Hypertuning Via Gisting Phang Jason
- Mamba-ptq: Outlier Channels In Recurrent Large Language Models Pierro Alessandro, Abreu Steven
- How Effective Are State Space Models For Machine Translation? Pitorro Hugo, Vasylenko Pavlo, Treviso Marcos, Martins André F. T.
- Dissecting Language Models: Machine Unlearning Via Selective Pruning Pochinkov Nicholas, Schoots Nandi
- Interpreting Bias In Large Language Models: A Feature-based Approach Prakash Nirmalendu, Roy Lee Ka Wei
- Less Is More: Accurate Speech Recognition & Translation Without Web-scale Data Puvvada Krishna C., Żelasko Piotr, Huang He, Hrinchuk Oleksii, Koluguri Nithin Rao, Dhawan Kunal, Majumdar Somshubra, Rastorgueva Elena, Chen Zhehuai, Lavrukhin Vitaly, Balam Jagadeesh, Ginsburg Boris
- Interactive Continual Learning: Fast And Slow Thinking Qi Biqing, Chen Xingquan, Gao Junqi, Li Dong, Liu Jianxing, Wu Ligang, Zhou Bowen
- Practical Token Pruning For Foundation Models In Few-shot Conversational Virtual Assistant Systems Qi Haode, Qian Cheng, Ni Jian, Singh Pratyush, Fazeli Reza, Wang Gengyu, Shu Zhongzheng, Wayne Eric, Bross Juergen
- Vl-mamba: Exploring State Space Models For Multimodal Learning Qiao Yanyuan, Yu Zheng, Guo Longteng, Chen Sihan, Zhao Zijia, Sun Mingzhen, Wu Qi, Liu Jing
- The Synergy Between Data And Multi-modal Large Language Models: A Survey From Co-development Perspective Qin Zhen, Chen Daoyuan, Zhang Wenhao, Yao Liuyi, Huang Yilun, Ding Bolin, Li Yaliang, Deng Shuiguang
- Lightning Attention-2: A Free Lunch For Handling Unlimited Sequence Lengths In Large Language Models Qin Zhen, Sun Weigao, Li Dong, Shen Xuyang, Sun Weixuan, Zhong Yiran
- Various Lengths, Constant Speed: Efficient Language Modeling With Lightning Attention Qin Zhen, Sun Weigao, Li Dong, Shen Xuyang, Sun Weixuan, Zhong Yiran
- Tool Learning With Large Language Models: A Survey Qu Changle, Dai Sunhao, Wei Xiaochi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Xu Jun, Wen Ji-rong
- Memory-augmented Generative Adversarial Transformers Raaijmakers Stephan, Bakker Roos, Cremers Anita, De Kleijn Roy, Kouwenhoven Tom, Verhoef Tessa
- Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift In Fine-tuning Llms For Simultaneous Translation Raffel Matthew, Agostinelli Victor, Chen Lizhong
- Report On The 1st Workshop On Large Language Model For Evaluation In Information Retrieval (llm4eval 2024) At SIGIR 2024 Rahmani Hossein A., Siro Clemencia, Aliannejadi Mohammad, Craswell Nick, Clarke Charles L. A., Faggioli Guglielmo, Mitra Bhaskar, Thomas Paul, Yilmaz Emine
- Scavenging Hyena: Distilling Transformers Into Long Convolution Models Ralambomihanta Tokiniaina Raharison, Mohammadzadeh Shahrad, Islam Mohammad Sami Nur, Jabbour Wassim, Liang Laurence
- Optimizing Visual Question Answering Models For Driving: Bridging The Gap Between Human And Machine Attention Patterns Rekanar Kaavya, Hayes Martin, Sistu Ganesh, Eising Ciaran
- Identifying Semantic Induction Heads To Understand In-context Learning Ren Jie, Guo Qipeng, Yan Hang, Liu Dongrui, Zhang Quanshi, Qiu Xipeng, Lin Dahua
- Unveiling And Mitigating Memorization In Text-to-image Diffusion Models Through Cross Attention Ren Jie, Li Yaxin, Zeng Shenglai, Xu Han, Lyu Lingjuan, Xing Yue, Tang Jiliang
- Samba: Simple Hybrid State Space Models For Efficient Unlimited Context Language Modeling Ren Liliang, Liu Yang, Lu Yadong, Shen Yelong, Liang Chen, Chen Weizhu
- On The Efficacy Of Eviction Policy For Key-value Constrained Generative Language Model Inference Ren Siyu, Zhu Kenny Q.
- Healthcare Copilot: Eliciting The Power Of General Llms For Medical Consultation Ren Zhiyao, Zhan Yibing, Yu Baosheng, Ding Liang, Tao Dacheng
- Code Generation With Alphacodium: From Prompt Engineering To Flow Engineering Ridnik Tal, Kredo Dedy, Friedman Itamar
- Explorations Of Self-repair In Language Models Rushing Cody, Nanda Neel
- How Chatgpt Changed The Media's Narratives On AI: A Semi-automated Narrative Analysis Through Frame Semantics Ryazanov Igor, Öhman Carl, Björklund Johanna
- Tandem Transformers For Inference Efficient Llms S Aishwarya P, Nair Pranav Ajit, Samaga Yashas, Boyd Toby, Kumar Sanjiv, Jain Prateek, Netrapalli Praneeth
- Neurocache: Efficient Vector Retrieval For Long-range Language Modeling Safaya Ali, Yuret Deniz
- LUNA: A Framework For Language Understanding And Naturalness Assessment Saidov Marat, Bakalova Aleksandra, Taktasheva Ekaterina, Mikhailov Vladislav, Artemova Ekaterina
- Zero-shot RTL Code Generation With Attention Sink Augmented Large Language Models Sandal Selim, Akturk Ismail
- Lean Attention: Hardware-aware Scalable Attention Mechanism For The Decode-phase Of Transformers Sanovar Rya, Bharadwaj Srikant, Amant Renee St., Rühle Victor, Rajmohan Saravan
- Speech Reallm -- Real-time Streaming Speech Recognition With Multimodal Llms By Teaching The Flow Of Time Seide Frank, Doulaty Morrie, Shi Yangyang, Gaur Yashesh, Jia Junteng, Wu Chunyang
- Train-attention: Meta-learning Where To Focus In Continual Knowledge Learning Seo Yeongbin, Lee Dongha, Yeo Jinyoung
- Multi-modal Adapter For Vision-language Models Seputis Dominykas, Mihailov Serghei, Chatterjee Soham, Xiao Zehao
- Medpromptx: Grounded Multimodal Prompting For Chest X-ray Diagnosis Shaaban Mai A., Khan Adnan, Yaqub Mohammad
- Flashattention-3: Fast And Accurate Attention With Asynchrony And Low-precision Shah Jay, Bikshandi Ganesh, Zhang Ying, Thakkar Vijay, Ramani Pradeep, Dao Tri
- Llava-prumerge: Adaptive Token Reduction For Efficient Large Multimodal Models Shang Yuzhang, Cai Mu, Xu Bingxin, Lee Yong Jae, Yan Yan
- Mammothmoda: Multi-modal Large Language Model She Qi, Pan Junwen, Wan Xin, Zhang Rui, Lu Dawei, Huang Kai
- Rapid Optimization For Jailbreaking Llms Via Subconscious Exploitation And Echopraxia Shen Guangyu, Cheng Siyuan, Zhang Kaiyuan, Tao Guanhong, An Shengwei, Yan Lu, Zhang Zhuo, Ma Shiqing, Zhang Xiangyu
- Pyramid Coder: Hierarchical Code Generator For Compositional Visual Question Answering Shen Ruoyue, Inoue Nakamasa, Shinoda Koichi
- PMG : Personalized Multimodal Generation With Large Language Models Shen Xiaoteng, Zhang Rui, Zhao Xiaoyan, Zhu Jieming, Xiao Xi
- An Empirical Study On Context Length For Open-domain Dialog Generation Shen Xinyi, Lin Zuoquan
- Scaling Laws For Linear Complexity Language Models Shen Xuyang, Li Dong, Leng Ruitao, Qin Zhen, Sun Weigao, Zhong Yiran
- Jetmoe: Reaching Llama2 Performance With 0.1M Dollars Shen Yikang, Guo Zhen, Cai Tianle, Qin Zengyi
- Language Models Encode Collaborative Signals In Recommendation Sheng Leheng, Zhang An, Zhang Yi, Chen Yuxin, Wang Xiang, Chua Tat-seng
- Paying More Attention To Image: A Training-free Method For Alleviating Hallucination In Lvlms Shi Liu, Kecheng Zheng, Wei Chen
- Action Controlled Paraphrasing Shi Ning, Wu Zijun
- Why Larger Language Models Do In-context Learning Differently? Shi Zhenmei, Wei Junyi, Xu Zhuoyan, Liang Yingyu
- Where Does In-context Translation Happen In Large Language Models Sia Suzanna, Mueller David, Duh Kevin
- Attention Overflow: Language Model Input Blur During Long-context Missing Items Recommendation Sileo Damien
- Constructing Benchmarks And Interventions For Combating Hallucinations In Llms Simhi Adi, Herzig Jonathan, Szpektor Idan, Belinkov Yonatan
- Prefixing Attention Sinks Can Mitigate Activation Outliers For Large Language Model Quantization Son Seungwoo, Park Wonpyo, Han Woohyun, Kim Kyuyeun, Lee Jaeho
- Increasing Model Capacity For Free: A Simple Strategy For Parameter Efficient Fine-tuning Song Haobo, Zhao Hao, Majumder Soumajit, Lin Tao
- Out-of-distribution Generalization Via Composition: A Lens Through Induction Heads In Transformers Song Jiajun, Xu Zhuoyan, Zhong Yiqiao
- Moma: Multimodal LLM Adapter For Fast Personalized Image Generation Song Kunpeng, Zhu Yizhe, Liu Bingchen, Yan Qing, Elgammal Ahmed, Yang Xiao
- Can Many-shot In-context Learning Help Long-context LLM Judges? See More, Judge Better! Song Mingyang, Zheng Mao, Luo Xuan
- M3GIA: A Cognition Inspired Multilingual And Multimodal General Intelligence Ability Benchmark Song Wei, Li Yadong, Xu Jianhua, Wu Guowei, Ming Lingfeng, Yi Kexin, Luo Weihua, Li Houyi, Du Yi, Guo Fangda, Yu Kaicheng
- Hierarchical Context Merging: Better Long Context Understanding For Pre-trained Llms Song Woomin, Oh Seunghyuk, Mo Sangwoo, Kim Jaehyung, Yun Sukmin, Ha Jung-woo, Shin Jinwoo
- Sharelora: Parameter Efficient And Robust Large Language Model Fine-tuning Via Shared Low-rank Adaptation Song Yurun, Zhao Junchen, Harris Ian G., Jyothi Sangeetha Abdu
- Views Are My Own, But Also Yours: Benchmarking Theory Of Mind Using Common Ground Soubki Adil, Murzaku John, Jordehi Arash Yousefi, Zeng Peter, Markowska Magdalena, Mirroshandel Seyed Abolghasem, Rambow Owen
- Enhancing Temporal Understanding In Audio Question Answering For Large Audio Language Models Sridhar Arvind Krishna, Guo Yinyi, Visser Erik
- ARKS: Active Retrieval In Knowledge Soup For Code Generation Su Hongjin, Jiang Shuyang, Lai Yuhang, Wu Haoyuan, Shi Boao, Liu Che, Liu Qian, Yu Tao
- SK-VQA: Synthetic Knowledge Generation At Scale For Training Context-augmented Multimodal Llms Su Xin, Luo Man, Pan Kris W, Chou Tien Pei, Lal Vasudev, Howard Phillip
- Llm-barber: Block-aware Rebuilder For Sparsity Mask In One-shot For Large Language Models Su Yupeng, Guan Ziyi, Liu Xiaoqun, Jin Tianlai, Wu Dongkuan, Chesi Graziano, Wong Ngai, Yu Hao
- Building Better AI Agents: A Provocation On The Utilisation Of Persona In Llm-based Conversational Agents Sun Guangzhi, Zhan Xiao, Such Jose
- F-eval: Assessing Fundamental Abilities With Refined Evaluation Methods Sun Yu, Chen Keyu, Wang Shujie, Li Peiji, Guo Qipeng, Yan Hang, Qiu Xipeng, Huang Xuanjing, Lin Dahua
- You Only Cache Once: Decoder-decoder Architectures For Language Models Sun Yutao, Dong Li, Zhu Yi, Huang Shaohan, Wang Wenhui, Ma Shuming, Zhang Quanlu, Wang Jianyong, Wei Furu
- Lloco: Learning Long Contexts Offline Tan Sijun, Li Xiuyu, Patil Shishir, Wu Ziyang, Zhang Tianjun, Keutzer Kurt, Gonzalez Joseph E., Popa Raluca Ada
- Can I Understand What I Create? Self-knowledge Evaluation Of Large Language Models Tan Zhiquan, Wei Lai, Wang Jindong, Xie Xing, Huang Weiran
- Ungrammatical-syntax-based In-context Example Selection For Grammatical Error Correction Tang Chenming, Qu Fanyi, Wu Yunfang
- Razorattention: Efficient KV Cache Compression Through Retrieval Heads Tang Hanlin, Lin Yang, Lin Jing, Han Qingsen, Hong Shikuan, Yao Yiwu, Wang Gongyi
- Training-free Consistent Text-to-image Generation Tewel Yoad, Kaduri Omri, Gal Rinon, Kasten Yoni, Wolf Lior, Chechik Gal, Atzmon Yuval
- Untie The Knots: An Efficient Data Augmentation Strategy For Long-context Pre-training In Language Models Tian Junfeng, Zheng Da, Cheng Yang, Wang Rui, Zhang Colin, Zhang Debing
- Selective Prompt Anchoring For Code Generation Tian Yuan, Zhang Tianyi
- LLM Circuit Analyses Are Consistent Across Training And Scale Tigges Curt, Hanna Michael, Yu Qinan, Biderman Stella
- Transformers Are Ssms: Generalized Models And Efficient Algorithms Through Structured State Space Duality Tri Dao, Albert Gu
- Charactereval: A Chinese Benchmark For Role-playing Conversational Agent Evaluation Tu Quan, Fan Shilong, Tian Zihang, Yan Rui
- LM Transparency Tool: Interactive Tool For Analyzing Transformer Language Models Tufanov Igor, Hambardzumyan Karen, Ferrando Javier, Voita Elena
- Enhancing Inference Efficiency Of Large Language Models: Investigating Optimization Strategies And Architectural Innovations Tyukin Georgy
- Attention Is All You Need But You Don't Need All Of It For Inference Of Large Language Models Tyukin Georgy, Dovonon Gbetondji J-s, Kaddour Jean, Minervini Pasquale
- Improved Baselines For Data-efficient Perceptual Augmentation Of Llms Vallaeys Théophane, Shukor Mustafa, Cord Matthieu, Verbeek Jakob
- Vi-mistral-x: Building A Vietnamese Language Model With Advanced Continual Pre-training Vo James
- From Rags To Rich Parameters: Probing How Language Models Utilize External Knowledge Over Parametric Information For Factual Queries Wadhwa Hitesh, Seetharaman Rahul, Aggarwal Somyaa, Ghosh Reshmi, Basu Samyadeep, Srinivasan Soundararajan, Zhao Wenlong, Chaudhari Shreyas, Aghazadeh Ehsan
- Learning To Refine With Fine-grained Natural Language Feedback Wadhwa Manya, Zhao Xinyu, Li Junyi Jessy, Durrett Greg
- An Empirical Study Of Mamba-based Language Models Waleffe Roger, Byeon Wonmin, Riach Duncan, Norick Brandon, Korthikanti Vijay, Dao Tri, Gu Albert, Hatamizadeh Ali, Singh Sudhakar, Narayanan Deepak, Kulshreshtha Garvit, Singh Vartika, Casper Jared, Kautz Jan, Shoeybi Mohammad, Catanzaro Bryan
- Temporal Blind Spots In Large Language Models Wallat Jonas, Jatowt Adam, Anand Avishek
- Teach Better Or Show Smarter? On Instructions And Exemplars In Automatic Prompt Optimization Wan Xingchen, Sun Ruoxi, Nakhost Hootan, Arik Sercan O.
- LOOK-M: Look-once Optimization In KV Cache For Efficient Multimodal Long-context Inference Wan Zhongwei, Wu Ziang, Liu Che, Huang Jinfa, Zhu Zhihong, Jin Peng, Wang Longyue, Yuan Li
- Pargo: Bridging Vision-language With Partial And Global Views Wang An-lan, Shan Bin, Shi Wei, Lin Kun-yu, Fei Xiang, Tang Guozhi, Liao Lei, Tang Jingqun, Huang Can, Zheng Wei-shi
- Information Theoretic Text-to-image Alignment Wang Chao, Franzese Giulio, Finamore Alessandro, Gallo Massimo, Michiardi Pietro
- Length Generalization Of Causal Transformers Without Position Encoding Wang Jie, Ji Tao, Wu Yuanbin, Yan Hang, Gui Tao, Zhang Qi, Huang Xuanjing, Wang Xiaoling
- The Mamba In The Llama: Distilling And Accelerating Hybrid Models Wang Junxiong, Paliotta Daniele, May Avner, Rush Alexander M., Dao Tri
- Leave No Document Behind: Benchmarking Long-context Llms With Extended Multi-doc QA Wang Minzheng, Chen Longze, Fu Cheng, Liao Shengyi, Zhang Xinghua, Wu Bingli, Yu Haiyang, Xu Nan, Zhang Lei, Luo Run, Li Yunshui, Yang Min, Huang Fei, Li Yongbin
- Deconfounded Causality-aware Parameter-efficient Fine-tuning For Problem-solving Improvement Of Llms Wang Ruoyu, Li Xiaoxuan, Yao Lina
- Unique Security And Privacy Threats Of Large Language Model: A Comprehensive Survey Wang Shang, Zhu Tianqing, Liu Bo, Ding Ming, Guo Xu, Ye Dayong, Zhou Wanlei, Yu Philip S.
- What Is The Best Way For Chatgpt To Translate Poetry? Wang Shanshan, Wong Derek F., Yao Jingming, Chao Lidia S.
- Mfc-bench: Benchmarking Multimodal Fact-checking With Large Vision-language Models Wang Shengkang, Lin Hongzhan, Luo Ziyang, Ye Zhen, Chen Guang, Ma Jing
- Learnable Item Tokenization For Generative Recommendation Wang Wenjie, Bao Honghui, Lin Xinyu, Zhang Jizhi, Li Yongqi, Feng Fuli, Ng See-kiong, Chua Tat-seng
- QUITO: Accelerating Long-context Reasoning Through Query-guided Context Compression Wang Wenshan, Wang Yihang, Fan Yixing, Liao Huaming, Guo Jiafeng
- Rdrec: Rationale Distillation For Llm-based Recommendation Wang Xinfeng, Cui Jin, Suzuki Yoshimi, Fukumoto Fumiyo
- Negativeprompt: Leveraging Psychology For Large Language Models Enhancement Via Negative Emotional Stimuli Wang Xu, Li Cheng, Chang Yi, Wang Jindong, Wu Yuan
- A Theoretical Understanding Of Self-correction Through In-context Alignment Wang Yifei, Wu Yuyang, Wei Zeming, Jegelka Stefanie, Wang Yisen
- HPT++: Hierarchically Prompting Vision-language Models With Multi-granularity Knowledge Generation And Improved Structure Modeling Wang Yubin, Jiang Xinyang, Cheng De, Sun Wenli, Li Dongsheng, Zhao Cairong
- Loma: Lossless Compressed Memory Attention Wang Yumeng, Xiao Zhenyang
- Factuality Of Large Language Models In The Year 2024 Wang Yuxia, Wang Minghan, Manzoor Muhammad Arslan, Liu Fei, Georgiev Georgi, Das Rocktim Jyoti, Nakov Preslav
- Promptcharm: Text-to-image Generation Through Multi-modal Prompting And Refinement Wang Zhijie, Huang Yuheng, Song Da, Ma Lei, Zhang Tianyi
- Eliminating Position Bias Of Language Models: A Mechanistic Approach Wang Ziqi, Zhang Hanlin, Li Xiner, Huang Kuan-hao, Han Chi, Ji Shuiwang, Kakade Sham M., Peng Hao, Ji Heng
- AGLA: Mitigating Object Hallucinations In Large Vision-language Models With Assembly Of Global And Local Attention Wenbin An, Feng Tian, Sicong Leng, Jiahao Nie, Haonan Lin, Qianying Wang, Guang Dai, Ping Chen, Shijian Lu
- Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions Wiegreffe Sarah, Tafjord Oyvind, Belinkov Yonatan, Hajishirzi Hannaneh, Sabharwal Ashish
- Don't Miss The Forest For The Trees: Attentional Vision Calibration For Large Vision Language Models Woo Sangmin, Kim Donguk, Jang Jaehyuk, Choi Yubin, Kim Changick
- What Are The Limits Of Cross-lingual Dense Passage Retrieval For Low-resource Languages? Wu Jie, Ren Zhaochun, Verberne Suzan
- GEB-1.3B: Open Lightweight Large Language Model Wu Jie, Zhu Yufeng, Shen Lei, Lu Xuqing
- Noiseboost: Alleviating Hallucination With Noise Perturbation For Multimodal Large Language Models Wu Kai, Jiang Boyuan, Jiang Zhengkai, He Qingdong, Luo Donghao, Wang Shengzhi, Liu Qingwen, Wang Chengjie
- Controlmllm: Training-free Visual Prompt Learning For Multimodal Large Language Models Wu Mingrui, Cai Xinyue, Ji Jiayi, Li Jiale, Huang Oucheng, Luo Gen, Fei Hao, Sun Xiaoshuai, Ji Rongrong
- Parallel Decoding Via Hidden Transfer For Lossless Large Language Model Acceleration Wu Pengfei, Liu Jiahao, Gong Zhuocheng, Wang Qifan, Li Jinpeng, Wang Jingang, Cai Xunliang, Zhao Dongyan
- GUI Action Narrator: Where And When Did That Action Take Place? Wu Qinchen, Gao Difei, Lin Kevin Qinghong, Wu Zhuoyu, Guo Xiangwu, Li Peiran, Zhang Weichen, Wang Hengxu, Shou Mike Zheng
- Not All Attention Is Needed: Parameter And Computation Efficient Transfer Learning For Multi-modal Large Language Models Wu Qiong, Ye Weihao, Zhou Yiyi, Sun Xiaoshuai, Ji Rongrong
- F-LMM: Grounding Frozen Large Multimodal Models Wu Size, Jin Sheng, Zhang Wenwei, Xu Lumin, Liu Wentao, Li Wei, Loy Chen Change
- Semantic Alignment For Multimodal Large Language Models Wu Tao, Li Mengze, Chen Jingyuan, Ji Wei, Lin Wang, Gao Jinyang, Kuang Kun, Zhao Zhou, Wu Fei
- Difflora: Generating Personalized Low-rank Adaptation Weights With Diffusion Wu Yujia, Shi Yiming, Wei Jiwei, Sun Chengwei, Zhou Yuyang, Yang Yang, Shen Heng Tao
- Clover-2: Accurate Inference For Regressive Lightweight Speculative Decoding Xiao Bin, Gui Lujun, Su Lei, Chen Weipeng
- Clover: Regressive Lightweight Speculative Decoding With Sequential Knowledge Xiao Bin, Shi Chunan, Nie Xiaonan, Yang Fan, Deng Xiangwei, Su Lei, Chen Weipeng, Cui Bin
- Infllm: Training-free Long-context Extrapolation For Llms With An Efficient Context Memory Xiao Chaojun, Zhang Pengle, Han Xu, Xiao Guangxuan, Lin Yankai, Zhang Zhengyan, Liu Zhiyuan, Sun Maosong
- A Comprehensive Survey Of Large Language Models And Multimodal Large Language Models In Medicine Xiao Hanguang, Zhou Feizhong, Liu Xingyue, Liu Tianqi, Li Zhipeng, Liu Xin, Huang Xiaoxuan
- Codeshell Technical Report Xie Rui, Zeng Zhengran, Yu Zhuohao, Gao Chang, Zhang Shikun, Ye Wei
- Decomposition For Enhancing Attention: Improving Llm-based Text-to-sql Through Workflow Paradigm Xie Yuanzhen, Jin Xinzhou, Xie Tao, Lin Mingxiong, Chen Liang, Yu Chenyun, Cheng Lei, Zhuo Chengxiang, Hu Bo, Li Zang
- Calibrating Reasoning In Language Models With Internal Consistency Xie Zhihui, Guo Jizhou, Yu Tong, Li Shuai
- Order Matters In Hallucination: Reasoning Order As Benchmark And Reflexive Prompting For Large-language-models Xie Zikai
- RALL-E: Robust Codec Language Modeling With Chain-of-thought Prompting For Text-to-speech Synthesis Xin Detai, Tan Xu, Shen Kai, Ju Zeqian, Yang Dongchao, Wang Yuancheng, Takamichi Shinnosuke, Saruwatari Hiroshi, Liu Shujie, Li Jinyu, Zhao Sheng
- EFUF: Efficient Fine-grained Unlearning Framework For Mitigating Hallucinations In Multimodal Large Language Models Xing Shangyu, Zhao Fei, Wu Zhen, An Tuo, Chen Weihao, Li Chunhui, Zhang Jianbing, Dai Xinyu
- Theoretical Understanding Of In-context Learning In Shallow Transformers With Unstructured Data Xing Yue, Lin Xiaofeng, Xu Chenheng, Suh Namjoon, Song Qifan, Cheng Guang
- Temporal Scaling Law For Large Language Models Xiong Yizhe, Chen Xiansheng, Ye Xin, Chen Hui, Lin Zijia, Lian Haoran, Su Zhenpeng, Niu Jianwei, Ding Guiguang
- Stylemaster: Towards Flexible Stylized Image Generation With Diffusion Models Xu Chengming, Hu Kai, Luo Donghao, Zhang Jiangning, Li Wei, Ge Yanhao, Wang Chengjie
- Understanding The Role Of Cross-entropy Loss In Fairly Evaluating Large Language Model-based Recommendation Xu Cong, Zhu Zhangchi, Wang Jun, Wang Jianyong, Zhang Wei
- Vtensor: Flexible Virtual Tensor Management For Efficient LLM Serving Xu Jiale, Zhang Rui, Guo Cong, Hu Weiming, Liu Zihan, Wu Feiyang, Feng Yu, Sun Shixuan, Shao Changxu, Guo Yuhong, Zhao Junping, Zhang Ke, Guo Minyi, Leng Jingwen
- Relating The Seemingly Unrelated: Principled Understanding Of Generalization For Generative Models In Arithmetic Reasoning Tasks Xu Xingcheng, Zhao Zibo, Zhang Haipeng, Yang Yanqing
- Libra: Building Decoupled Vision System On Large Language Models Xu Yifan, Yang Xiaoshan, Song Yaguang, Xu Changsheng
- Think: Thinner Key Cache By Query-driven Pruning Xu Yuhui, Jie Zhanming, Dong Hanze, Wang Lei, Lu Xudong, Zhou Aojun, Saha Amrita, Xiong Caiming, Sahoo Doyen
- Question Calibration And Multi-hop Modeling For Temporal Question Answering Xue Chao, Liang Di, Wang Pengfei, Zhang Jing
- Do Large Language Models Understand Logic Or Just Mimick Context? Yan Junbing, Wang Chengyu, Huang Jun, Zhang Wei
- 3d-properties: Identifying Challenges In DPO And Charting A Path Forward Yan Yuzi, Miao Yibo, Li Jialian, Zhang Yipin, Xie Jian, Deng Zhijie, Yan Dong
- Emphasising Structured Information: Integrating Abstract Meaning Representation Into Llms For Enhanced Open-domain Dialogue Evaluation Yang Bohao, Zhao Kun, Tang Chen, Liu Dong, Zhan Liang, Lin Chenghua
- Embedded Translations For Low-resource Automated Glossing Yang Changbing, Nicolai Garrett, Silfverberg Miikka
- Equipping Transformer With Random-access Reading For Long-context Understanding Yang Chenghao, Yang Zi, Hua Nan
- Pyramidinfer: Pyramid KV Cache Compression For High-throughput LLM Inference Yang Dongjie, Han Xiaodong, Gao Yan, Hu Yao, Zhang Shilin, Zhao Hai
- Seed-story: Multimodal Long Story Generation With Large Language Model Yang Shuai, Ge Yuying, Li Yang, Chen Yukang, Ge Yixiao, Shan Ying, Chen Yingcong
- Parallelizing Linear Transformers With The Delta Rule Over Sequence Length Yang Songlin, Wang Bailin, Zhang Yu, Shen Yikang, Kim Yoon
- The Butterfly Effect Of Model Editing: Few Edits Can Trigger Large Language Models Collapse Yang Wanli, Sun Fei, Ma Xinyu, Liu Xun, Yin Dawei, Cheng Xueqi
- DGL: Dynamic Global-local Prompt Tuning For Text-video Retrieval Yang Xiangpeng, Zhu Linchao, Wang Xiaohan, Yang Yi
- Dense Connector For Mllms Yao Huanjin, Wu Wenhao, Yang Taojiannan, Song Yuxin, Zhang Mengxi, Feng Haocheng, Sun Yifan, Li Zhiheng, Ouyang Wanli, Wang Jingdong
- Cacheblend: Fast Large Language Model Serving For RAG With Cached Knowledge Fusion Yao Jiayi, Li Hanchen, Liu Yuhan, Ray Siddhant, Cheng Yihua, Zhang Qizheng, Du Kuntai, Lu Shan, Jiang Junchen
- Mplug-owl3: Towards Long Image-sequence Understanding In Multi-modal Large Language Models Ye Jiabo, Xu Haiyang, Liu Haowei, Hu Anwen, Yan Ming, Qian Qi, Zhang Ji, Huang Fei, Zhou Jingren
- Efficient Vision-and-language Pre-training With Text-relevant Image Patch Selection Ye Wei, Jiang Chaoya, Xu Haiyang, Ye Chenhao, Li Chenliang, Yan Ming, Zhang Shikun, Huang Songhang, Huang Fei
- Long-context Language Modeling With Parallel Context Encoding Yen Howard, Gao Tianyu, Chen Danqi
- How Interpretable Are Reasoning Explanations From Prompting Large Language Models? Yeo Wei Jie, Satapathy Ranjan, Goh Rick Siow Mong, Cambria Erik
- Fine-grained Detoxification Via Instance-level Prefixes For Large Language Models Yi Xin, Wang Linlin, Wang Xiaoling, He Liang
- MEND: Meta Demonstration Distillation For Efficient And Effective In-context Learning Yichuan Li, Xiyao Ma, Sixing Lu, Kyumin Lee, Xiaohu Liu, Chenlei Guo
- Lofit: Localized Fine-tuning On LLM Representations Yin Fangcong, Ye Xi, Durrett Greg
- Stablemask: Refining Causal Masking In Decoder-only Transformer Yin Qingyu, He Xuzheng, Zhuang Xiang, Zhao Yu, Yao Jianhua, Shen Xiaoyu, Zhang Qiang
- Unveiling The Safety Of Gpt-4o: An Empirical Study Using Jailbreak Attacks Ying Zonghao, Liu Aishan, Liu Xianglong, Tao Dacheng
- Debunc: Mitigating Hallucinations In Large Language Model Agent Communication With Uncertainty Estimations Yoffe Luke, Amayuelas Alfonso, Wang William Yang
- C-TPT: Calibrated Test-time Prompt Tuning For Vision-language Models Via Text Feature Dispersion Yoon Hee Suk, Yoon Eunseop, Tee Joshua Tian Jin, Hasegawa-johnson Mark, Li Yingzhen, Yoo Chang D.
- Tree-planted Transformers: Unidirectional Transformer Language Models With Implicit Syntactic Supervision Yoshida Ryo, Someya Taiga, Oseki Yohei
- When Linear Attention Meets Autoregressive Decoding: Towards More Effective And Efficient Linearized Large Language Models You Haoran, Fu Yichao, Wang Zheng, Yazdanbakhsh Amir, Lin Yingyan Celine
- Shiftaddllm: Accelerating Pretrained Llms Via Post-training Multiplication-less Reparameterization You Haoran, Guo Yipin, Fu Yichao, Zhou Wei, Shi Huihong, Zhang Xiaofan, Kundu Souvik, Yazdanbakhsh Amir, Lin Yingyan Celine
- Effectively Compress KV Heads For LLM Yu Hao, Yang Zelan, Li Shen, Li Yong, Wu Jianxin
- A Cause-effect Look At Alleviating Hallucination Of Knowledge-grounded Dialogue Generation Yu Jifan, Zhang Xiaohan, Xu Yifan, Lei Xuanyu, Yao Zijun, Zhang Jing, Hou Lei, Li Juanzi
- Mechanistic Understanding And Mitigation Of Language Model Non-factual Hallucinations Yu Lei, Cao Meng, Cheung Jackie Chi Kit, Dong Yue
- Distilling System 2 Into System 1 Yu Ping, Xu Jing, Weston Jason, Kulikov Ilia
- Truth-aware Context Selection: Mitigating Hallucinations Of Large Language Models Being Misled By Untruthful Contexts Yu Tian, Zhang Shaolei, Feng Yang
- Mitigate Position Bias In Large Language Models Via Scaling A Single Dimension Yu Yijiong, Jiang Huiqiang, Luo Xufang, Wu Qianhui, Lin Chin-yew, Li Dongsheng, Yang Yuqing, Huang Yongfeng, Qiu Lili
- Unveiling And Harnessing Hidden Attention Sinks: Enhancing Large Language Models Without Training Through Attention Calibration Yu Zhongzhi, Wang Zheng, Fu Yonggan, Shi Huihong, Shaikh Khalid, Lin Yingyan Celine
- Whispers That Shake Foundations: Analyzing And Mitigating False Premise Hallucinations In Large Language Models Yuan Hongbang, Cao Pengfei, Jin Zhuoran, Chen Yubo, Zeng Daojian, Liu Kang, Zhao Jun
- Focused Large Language Models Are Stable Many-shot Learners Yuan Peiwen, Feng Shaoxiong, Li Yiwei, Wang Xinglin, Zhang Yueqi, Tan Chuyi, Pan Boyuan, Wang Heda, Hu Yao, Li Kan
- CO3: Low-resource Contrastive Co-training For Generative Conversational Query Rewrite Yuan Yifei, Shi Chen, Wang Runze, Chen Liyi, Hu Renjun, Zhang Zengming, Jiang Feijun, Lam Wai
- Wkvquant: Quantizing Weight And Key/value Cache For Large Language Models Gains More Yue Yuxuan, Yuan Zhihang, Duanmu Haojie, Zhou Sifan, Wu Jianlong, Nie Liqiang
- Focus On The Core: Efficient Attention Via Pruned Token Compression For Document Classification Yun Jungmin, Kim Mihyeon, Kim Youngbin
- Understanding And Mitigating Compositional Issues In Text-to-image Generative Models Zarei Arman, Rezaei Keivan, Basu Samyadeep, Saberi Mehrdad, Moayeri Mazda, Kattakinda Priyatham, Feizi Soheil
- Actions Speak Louder Than Words: Trillion-parameter Sequential Transducers For Generative Recommendations Zhai Jiaqi, Liao Lucy, Liu Xing, Wang Yueming, Li Rui, Cao Xuan, Gao Leon, Gong Zhaojie, Gu Fangda, He Michael, Lu Yinghai, Shi Yu
- Language Model Prompt Selection Via Simulation Optimization Zhang Haoting, He Jinghai, Righter Rhonda, Zheng Zeyu
- Are Large Language Models Good At Utility Judgments? Zhang Hengran, Zhang Ruqing, Guo Jiafeng, De Rijke Maarten, Fan Yixing, Cheng Xueqi
- Sinklora: Enhanced Efficiency And Chat Capabilities For Long-context Large Language Models Zhang Hengyu
- Paying More Attention To Source Context: Mitigating Unfaithful Translations From Large Language Model Zhang Hongbin, Chen Kehai, Bai Xuefeng, Xiang Yang, Zhang Min
- Enja: Ensemble Jailbreak On Large Language Models Zhang Jiahao, Wang Zilong, Wang Ruofan, Ma Xingjun, Jiang Yu-gang
- Agentohana: Design Unified Data And Training Pipeline For Effective Agent Learning Zhang Jianguo, Lan Tian, Murthy Rithesh, Liu Zhiwei, Yao Weiran, Tan Juntao, Hoang Thai, Yang Liangwei, Feng Yihao, Liu Zuxin, Awalgaonkar Tulika, Niebles Juan Carlos, Savarese Silvio, Heinecke Shelby, Wang Huan, Xiong Caiming
- Harmonized Speculative Sampling Zhang Lefan, Wang Xiaodan, Huang Yanhua, Xu Ruiwen
- Attention Instruction: Amplifying Attention In The Middle Via Prompting Zhang Meiru, Meng Zaiqiao, Collier Nigel
- The Hedgehog & The Porcupine: Expressive Linear Attentions With Softmax Mimicry Zhang Michael, Bhatia Kush, Kumbong Hermann, Ré Christopher
- Tinyllama: An Open-source Small Language Model Zhang Peiyuan, Zeng Guangtao, Wang Tianduo, Lu Wei
- Mathverse: Does Your Multi-modal LLM Truly See The Diagrams In Visual Math Problems? Zhang Renrui, Jiang Dongzhi, Zhang Yichi, Lin Haokun, Guo Ziyu, Qiu Pengshuo, Zhou Aojun, Lu Pan, Chang Kai-wei, Gao Peng, Li Hongsheng
- R4: Reinforced Retriever-reorder-responder For Retrieval-augmented Large Language Models Zhang Taolin, Li Dongyang, Chen Qizhou, Wang Chengyu, Huang Longtao, Xue Hui, He Xiaofeng, Huang Jun
- Promptfix: Few-shot Backdoor Removal Via Adversarial Prompt Tuning Zhang Tianrong, Xi Zhaohan, Wang Ting, Mitra Prasenjit, Chen Jinghui
- Nomad-attention: Efficient LLM Inference On Cpus Through Multiply-add-free Attention Zhang Tianyi, Yi Jonah Wonkyu, Yao Bowen, Xu Zhaozhuo, Shrivastava Anshumali
- Interpreting And Improving Large Language Models In Arithmetic Calculation Zhang Wei, Wan Chaoqun, Zhang Yonggang, Cheung Yiu-ming, Tian Xinmei, Shen Xu, Ye Jieping
- Large Language Models As Evaluators For Recommendation Explanations Zhang Xiaoyu, Li Yishan, Wang Jiayin, Sun Bowen, Ma Weizhi, Sun Peijie, Zhang Min
- Finercut: Finer-grained Interpretable Layer Pruning For Large Language Models Zhang Yang, Li Yawei, Wang Xinpeng, Shen Qianli, Plank Barbara, Bischl Bernd, Rezaei Mina, Kawaguchi Kenji
- Mme-realworld: Could Your Multimodal LLM Challenge High-resolution Real-world Scenarios That Are Difficult For Humans? Zhang Yi-fan, Zhang Huanyu, Tian Haochen, Fu Chaoyou, Zhang Shuangqing, Wu Junfei, Li Feng, Wang Kun, Wen Qingsong, Zhang Zhang, Wang Liang, Jin Rong, Tan Tieniu
- Wings: Learning Multimodal Llms Without Text-only Forgetting Zhang Yi-kai, Lu Shiyin, Li Yang, Ma Yanqing, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Zhan De-chuan, Ye Han-jia
- MLKD-BERT: Multi-level Knowledge Distillation For Pre-trained Language Models Zhang Ying, Yang Ziheng, Ji Shufan
- Navigating User Experience Of Chatgpt-based Conversational Recommender Systems: The Effects Of Prompt Guidance And Recommendation Domain Zhang Yizhe, Jin Yucheng, Chen Li, Yang Ting
- Autocap: Towards Automatic Cross-lingual Alignment Planning For Zero-shot Chain-of-thought Zhang Yongheng, Chen Qiguang, Li Min, Che Wanxiang, Qin Libo
- Prompt-aware Adapter: Towards Learning Adaptive Visual Tokens For Multimodal Large Language Models Zhang Yue, Fan Hehe, Yang Yi
- A Survey On The Memory Mechanism Of Large Language Model Based Agents Zhang Zeyu, Bo Xiaohe, Ma Chen, Li Rui, Chen Xu, Dai Quanyu, Zhu Jieming, Dong Zhenhua, Wen Ji-rong
- Position-aware Parameter Efficient Fine-tuning Approach For Reducing Positional Bias In Llms Zhang Zheng, Yang Fan, Jiang Ziyan, Chen Zheng, Zhao Zhengyang, Ma Chengyuan, Zhao Liang, Liu Yang
- Found In The Middle: How Language Models Use Long Contexts Better Via Plug-and-play Positional Encoding Zhang Zhenyu, Chen Runjin, Liu Shiwei, Yao Zhewei, Ruwase Olatunji, Chen Beidi, Wu Xiaoxia, Wang Zhangyang
- Shieldlm: Empowering Llms As Aligned, Customizable And Explainable Safety Detectors Zhang Zhexin, Lu Yida, Ma Jingyuan, Zhang Di, Li Rui, Ke Pei, Sun Hao, Sha Lei, Sui Zhifang, Wang Hongning, Huang Minlie
- Glitchprober: Advancing Effective Detection And Mitigation Of Glitch Tokens In Large Language Models Zhang Zhibo, Bai Wuxia, Li Yuxi, Meng Mark Huasong, Wang Kailong, Shi Ling, Li Li, Wang Jun, Wang Haoyu
- LANE: Logic Alignment Of Non-tuning Large Language Models And Online Recommendation Systems For Explainable Reason Generation Zhao Hongke, Zheng Songming, Wu Likang, Yu Bowen, Wang Jing
- Diffagent: Fast And Accurate Text-to-image API Selection With Large Language Model Zhao Lirui, Yang Yue, Zhang Kaipeng, Shao Wenqi, Zhang Yuxin, Qiao Yu, Luo Ping, Ji Rongrong
- Efficiently Training 7B LLM With 1 Million Sequence Length On 8 Gpus Zhao Pinxue, Zhang Hailin, Fu Fangcheng, Nie Xiaonan, Liu Qibin, Yang Fang, Peng Yuanbo, Jiao Dian, Li Shuaipeng, Xue Jinbao, Tao Yangyu, Cui Bin
- Prepacking: A Simple Method For Fast Prefilling And Increased Throughput In Large Language Models Zhao Siyan, Israel Daniel, Broeck Guy Van Den, Grover Aditya
- Towards Comprehensive And Efficient Post Safety Alignment Of Large Language Models Via Safety Patching Zhao Weixiang, Hu Yulin, Li Zhuojun, Deng Yang, Zhao Yanyan, Qin Bing, Chua Tat-seng
- SAPT: A Shared Attention Framework For Parameter-efficient Continual Learning Of Large Language Models Zhao Weixiang, Wang Shilong, Hu Yulin, Zhao Yanyan, Qin Bing, Zhang Xuanyu, Yang Qing, Xu Dongliang, Che Wanxiang
- Enhancing Cross-domain Pre-trained Decision Transformers With Adaptive Attention Zhao Wenhao, Xu Qiushui, Xu Linjie, Song Lei, Wang Jinyu, Zhou Chunlai, Bian Jiang
- How Do Large Language Models Handle Multilingualism? Zhao Yiran, Zhang Wenxuan, Chen Guizhen, Kawaguchi Kenji, Bing Lidong
- ALISA: Accelerating Large Language Model Inference Via Sparsity-aware KV Caching Zhao Youpeng, Wu Di, Wang Jun
- SWIFT:A Scalable Lightweight Infrastructure For Fine-tuning Zhao Yuze, Huang Jintao, Hu Jinghan, Wang Xingjun, Mao Yunlin, Zhang Daoze, Jiang Zeyinzi, Wu Zhikai, Ai Baole, Wang Ang, Zhou Wenmeng, Chen Yingda
- Attention Heads Of Large Language Models: A Survey Zheng Zifan, Wang Yezhaohui, Huang Yuxin, Song Shichao, Tang Bo, Xiong Feiyu, Li Zhiyu
- Exploring Boundary Of GPT-4V On Marine Analysis: A Preliminary Case Study Zheng Ziqiang, Chen Yiwei, Zhang Jipeng, Vu Tuan-anh, Zeng Huimin, Tim Yue Him Wong, Yeung Sai-kit
- Understanding The Rope Extensions Of Long-context Llms: An Attention Perspective Zhong Meizhi, Zhang Chen, Lei Yikun, Liu Xikai, Gao Yan, Hu Yao, Chen Kehai, Zhang Min
- Unibias: Unveiling And Mitigating LLM Bias Through Internal Attention And FFN Manipulation Zhou Hanzhang, Feng Zijian, Zhu Zixiao, Qian Junlang, Mao Kezhi
- Pre-trained Large Language Models Use Fourier Features To Compute Addition Zhou Tianyi, Fu Deqing, Sharan Vatsal, Jia Robin
- Explaining Pre-trained Language Models With Attribution Scores: An Analysis In Low-resource Settings Zhou Wei, Adel Heike, Schuff Hendrik, Vu Ngoc Thang
- Few-shot Adversarial Prompt Learning On Vision-language Models Zhou Yiwei, Xia Xiaobo, Lin Zhiwei, Han Bo, Liu Tongliang
- A Survey On Efficient Inference For Large Language Models Zhou Zixuan, Ning Xuefei, Hong Ke, Fu Tianyu, Xu Jiaming, Li Shiyao, Lou Yuming, Wang Luning, Yuan Zhihang, Li Xiuhong, Yan Shengen, Dai Guohao, Zhang Xiao-ping, Dong Yuhan, Wang Yu
- Relayattention For Efficient Large Language Model Serving With Long System Prompts Zhu Lei, Wang Xinjiang, Zhang Wayne, Lau Rynson W. H.
- IAPT: Instruction-aware Prompt Tuning For Large Language Models Zhu Wei, Tian Aaron Xuxiang, Yin Congrui, Ni Yuan, Wang Xiaoling, Xie Guotong
- Can Large Language Models Understand Context? Zhu Yilun, Moniz Joel Ruben Antony, Bhargava Shruti, Lu Jiarui, Piraviperumal Dhivya, Li Site, Zhang Yuan, Yu Hong, Tseng Bo-hsiang
- Accelerating Inference Of Retrieval-augmented Generation Via Sparse Context Selection Zhu Yun, Gu Jia-chen, Sikora Caitlin, Ko Ho, Liu Yinxiao, Lin Chu-cheng, Shu Lei, Luo Liangchen, Meng Lei, Liu Bang, Chen Jindong
- Llama-excitor: General Instruction Tuning Via Indirect Feature Interaction Zou Bo, Yang Chao, Qiao Yu, Quan Chengbin, Zhao Youjian
- Image-to-text Logic Jailbreak: Your Imagination Can Help You Do Anything Zou Xiaotian, Li Ke, Chen Yongkang
- MLKV: Multi-layer Key-value Heads For Memory Efficient Transformer Decoding Zuhri Zayd Muhammad Kawakibi, Adilazuarda Muhammad Farid, Purwarianti Ayu, Aji Alham Fikri
🏷 BERT
- BERT: Pre-training Of Deep Bidirectional Transformers For Language Understanding Jacob Devlin, Ming-wei Chang, Kenton Lee, Kristina Toutanova
- Like A Baby: Visually Situated Neural Language Acquisition Ororbia Alexander G., Mali Ankur, Kelly Matthew A., Reitter David
- Can You Tell Me How To Get Past Sesame Street? Sentence-level Pretraining Beyond Language Modeling Wang Alex, Hula Jan, Xia Patrick, Pappagari Raghavendra, Mccoy R. Thomas, Patel Roma, Kim Najoung, Tenney Ian, Huang Yinghui, Yu Katherin, Jin Shuning, Chen Berlin, Van Durme Benjamin, Grave Edouard, Pavlick Ellie, Bowman Samuel R.
- Synthetic QA Corpora Generation With Roundtrip Consistency Alberti Chris, Andor Daniel, Pitler Emily, Devlin Jacob, Collins Michael
- A BERT Baseline For The Natural Questions Alberti Chris, Lee Kenton, Collins Michael
- Whatcha Lookin' At? Deeplifting Bert's Attention In Question Answering Arkhangelskaia Ekaterina, Dutta Sourav
- Meta Answering For Machine Reading Borschinger Benjamin, Boyd-graber Jordan, Buck Christian, Bulian Jannis, Ciaramita Massimiliano, Huebscher Michelle Chen, Gajewski Wojciech, Kilcher Yannic, Nogueira Rodrigo, Saralegu Lierni Sestorain
- Language Features Matter: Effective Language Representations For Vision-language Tasks Burns Andrea, Tan Reuben, Saenko Kate, Sclaroff Stan, Plummer Bryan A.
- BERTQA -- Attention On Steroids Chadha Ankit, Sood Rewa
- Making Neural Machine Reading Comprehension Faster Chatterjee Debajyoti
- Distilling Knowledge Learned In BERT For Text Generation Chen Yen-chun, Gan Zhe, Cheng Yu, Liu Jingzhou, Liu Jingjing
- Attending To Entities For Better Text Understanding Cheng Pengxiang, Erk Katrin
- Boolq: Exploring The Surprising Difficulty Of Natural Yes/no Questions Clark Christopher, Lee Kenton, Chang Ming-wei, Kwiatkowski Tom, Collins Michael, Toutanova Kristina
- On The Use Of BERT For Neural Machine Translation Clinchant Stéphane, Jung Kweon Woo, Nikoulina Vassilina
- Cross-lingual Machine Reading Comprehension Cui Yiming, Che Wanxiang, Liu Ting, Qin Bing, Wang Shijin, Hu Guoping
- Unified Language Model Pre-training For Natural Language Understanding And Generation Dong Li, Yang Nan, Wang Wenhui, Wei Furu, Liu Xiaodong, Wang Yu, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- MRQA 2019 Shared Task: Evaluating Generalization In Reading Comprehension Fisch Adam, Talmor Alon, Jia Robin, Seo Minjoon, Choi Eunsol, Chen Danqi
- Span Selection Pre-training For Question Answering Glass Michael, Gliozzo Alfio, Chakravarti Rishav, Ferritto Anthony, Pan Lin, Bhargav G P Shrivatsa, Garg Dinesh, Sil Avirup
- Reweighted Proximal Pruning For Large-scale Language Representation Guo Fu-ming, Liu Sijia, Mungall Finlay S., Lin Xue, Wang Yanzhi
- Exploring Neural Net Augmentation To BERT For Question Answering On SQUAD 2.0 Gupta Suhas
- Visualizing And Understanding The Effectiveness Of BERT Hao Yaru, Dong Li, Wei Furu, Xu Ke
- Convert: Efficient And Accurate Conversational Representations From Transformers Henderson Matthew, Casanueva Iñigo, Mrkšić Nikola, Su Pei-hao, Wen Tsung-hsien, Vulić Ivan
- Exbert: A Visual Analysis Tool To Explore Learned Representations In Transformers Models Hoover Benjamin, Strobelt Hendrik, Gehrmann Sebastian
- INSET: Sentence Infilling With Inter-sentential Transformer Huang Yichen, Zhang Yizhe, Elachqar Oussama, Cheng Yu
- Tinybert: Distilling BERT For Natural Language Understanding Jiao Xiaoqi, Yin Yichun, Shang Lifeng, Jiang Xin, Chen Xiao, Li Linlin, Wang Fang, Liu Qun
- Bipar: A Bilingual Parallel Dataset For Multilingual And Cross-lingual Reading Comprehension On Novels Jing Yimin, Xiong Deyi, Zhen Yan
- Microsoft Translator At WMT 2019: Towards Large-scale Document-level Neural Machine Translation Junczys-dowmunt Marcin
- Learning And Evaluating Contextual Embedding Of Source Code Kanade Aditya, Maniatis Petros, Balakrishnan Gogul, Shi Kensen
- Weak Supervision Helps Emergence Of Word-object Alignment And Improves Vision-language Tasks Kervadec Corentin Liris, Antipov Grigory Liris, Baccouche Moez Liris, Wolf Christian Liris
- Learning To Answer By Learning To Ask: Getting The Best Of GPT-2 And BERT Worlds Klein Tassilo, Nabi Moin
- Revealing The Dark Secrets Of BERT Kovaleva Olga, Romanov Alexey, Rogers Anna, Rumshisky Anna
- ALBERT: A Lite BERT For Self-supervised Learning Of Language Representations Lan Zhenzhong, Chen Mingda, Goodman Sebastian, Gimpel Kevin, Sharma Piyush, Soricut Radu
- What Would Elsa Do? Freezing Layers During Transformer Fine-tuning Lee Jaejun, Tang Raphael, Lin Jimmy
- BART: Denoising Sequence-to-sequence Pre-training For Natural Language Generation, Translation, And Comprehension Lewis Mike, Liu Yinhan, Goyal Naman, Ghazvininejad Marjan, Mohamed Abdelrahman, Levy Omer, Stoyanov Ves, Zettlemoyer Luke
- Bertsel: Answer Selection With Pre-trained Models Li Dongfang, Yu Yifei, Chen Qingcai, Li Xinyu
- Unicoder-vl: A Universal Encoder For Vision And Language By Cross-modal Pre-training Li Gen, Duan Nan, Fang Yuejian, Gong Ming, Jiang Daxin, Zhou Ming
- Pretrained Language Models For Document-level Neural Machine Translation Li Liangyou, Jiang Xin, Liu Qun
- Visualbert: A Simple And Performant Baseline For Vision And Language Li Liunian Harold, Yatskar Mark, Yin Da, Hsieh Cho-jui, Chang Kai-wei
- MKD: A Multi-task Knowledge Distillation Approach For Pretrained Language Models Liu Linqing, Wang Huan, Lin Jimmy, Socher Richard, Xiong Caiming
- Generative Question Refinement With Deep Reinforcement Learning In Retrieval-based QA System Liu Ye, Zhang Chenwei, Yan Xiaohui, Chang Yi, Yu Philip S.
- Roberta: A Robustly Optimized BERT Pretraining Approach Liu Yinhan, Ott Myle, Goyal Naman, Du Jingfei, Joshi Mandar, Chen Danqi, Levy Omer, Lewis Mike, Zettlemoyer Luke, Stoyanov Veselin
- Vilbert: Pretraining Task-agnostic Visiolinguistic Representations For Vision-and-language Tasks Lu Jiasen, Batra Dhruv, Parikh Devi, Lee Stefan
- Emergent Properties Of Finetuned Language Representation Models Matton Alexandre, De Oliveira Luke
- Structured Pruning Of A Bert-based Question Answering Model Mccarley J. S., Chakravarti Rishav, Sil Avirup
- Megatron-lm: Training Multi-billion Parameter Language Models Using Model Parallelism Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick Legresley, Jared Casper, Bryan Catanzaro
- Human Vs. Muppet: A Conservative Estimate Of Human Performance On The GLUE Benchmark Nangia Nikita, Bowman Samuel R.
- A Simple But Effective Method To Incorporate Multi-turn Context With BERT For Conversational Machine Comprehension Ohsugi Yasuhito, Saito Itsumi, Nishida Kyosuke, Asano Hisako, Tomita Junji
- Frustratingly Easy Natural Question Answering Pan Lin, Chakravarti Rishav, Ferritto Anthony, Glass Michael, Gliozzo Alfio, Roukos Salim, Florian Radu, Sil Avirup
- Thisiscompetition At Semeval-2019 Task 9: BERT Is Unstable For Out-of-domain Samples Park Cheoneum, Kim Juae, Lee Hyeon-gu, Amplayo Reinald Kim, Kim Harksoo, Seo Jungyun, Lee Changki
- Language Models As Knowledge Bases? Petroni Fabio, Rocktäschel Tim, Lewis Patrick, Bakhtin Anton, Wu Yuxiang, Miller Alexander H., Riedel Sebastian
- Answering Complex Open-domain Questions Through Iterative Query Generation Qi Peng, Lin Xiaowen, Mehr Leo, Wang Zijian, Manning Christopher D.
- Blockwise Self-attention For Long Document Understanding Qiu Jiezhong, Ma Hao, Levy Omer, Yih Scott Wen-tau, Wang Sinong, Tang Jie
- Probing Natural Language Inference Models Through Semantic Fragments Richardson Kyle, Hu Hai, Moss Lawrence S., Sabharwal Ashish
- Leveraging Pre-trained Checkpoints For Sequence Generation Tasks Rothe Sascha, Narayan Shashi, Severyn Aliaksei
- Is Multilingual BERT Fluent In Language Generation? Rönnqvist Samuel, Kanerva Jenna, Salakoski Tapio, Ginter Filip
- Masked Language Model Scoring Salazar Julian, Liang Davis, Nguyen Toan Q., Kirchhoff Katrin
- Distilbert, A Distilled Version Of BERT: Smaller, Faster, Cheaper And Lighter Sanh Victor, Debut Lysandre, Chaumond Julien, Wolf Thomas
- Inducing Brain-relevant Bias In Natural Language Processing Models Schwartz Dan, Toneva Mariya, Wehbe Leila
- The Dialogue Dodecathlon: Open-domain Knowledge And Image Grounded Conversational Agents Shuster Kurt, Ju Da, Roller Stephen, Dinan Emily, Boureau Y-lan, Weston Jason
- What Does BERT Learn From Multiple-choice Reading Comprehension Datasets? Si Chenglei, Wang Shuohang, Kan Min-yen, Jiang Jing
- Multilingual Question Answering From Formatted Text Applied To Conversational Agents Siblini Wissam, Pasqual Charlotte, Lavielle Axel, Challal Mohamed, Cauchois Cyril
- MASS: Masked Sequence To Sequence Pre-training For Language Generation Song Kaitao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Tie-yan
- Sesamebert: Attention For Anywhere Su Ta-chun, Cheng Hsiang-chih
- VL-BERT: Pre-training Of Generic Visual-linguistic Representations Su Weijie, Zhu Xizhou, Cao Yue, Li Bin, Lu Lewei, Wei Furu, Dai Jifeng
- Syntax-infused Transformer And BERT Models For Machine Translation And Natural Language Understanding Sundararaman Dhanasekar, Subramanian Vivek, Wang Guoyin, Si Shijing, Shen Dinghan, Wang Dong, Carin Lawrence
- Olmpics -- On What Language Model Pre-training Captures Talmor Alon, Elazar Yanai, Goldberg Yoav, Berant Jonathan
- LXMERT: Learning Cross-modality Encoder Representations From Transformers Tan Hao, Bansal Mohit
- What Do You Learn From Context? Probing For Sentence Structure In Contextualized Word Representations Tenney Ian, Xia Patrick, Chen Berlin, Wang Alex, Poliak Adam, Mccoy R Thomas, Kim Najoung, Van Durme Benjamin, Bowman Samuel R., Das Dipanjan, Pavlick Ellie
- Waldorf: Wasteless Language-model Distillation On Reading-comprehension Tian James Yi, Kreuzer Alexander P., Chen Pai-hung, Will Hans-martin
- Well-read Students Learn Better: On The Importance Of Pre-training Compact Models Turc Iulia, Chang Ming-wei, Lee Kenton, Toutanova Kristina
- How Does BERT Answer Questions? A Layer-wise Analysis Of Transformer Representations Van Aken Betty, Winter Benjamin, Löser Alexander, Gers Felix A.
- Quantity Doesn't Buy Quality Syntax With Neural Language Models Van Schijndel Marten, Mueller Aaron, Linzen Tal
- Visualizing Attention In Transformer-based Language Representation Models Vig Jesse
- Allennlp Interpret: A Framework For Explaining Predictions Of NLP Models Wallace Eric, Tuyls Jens, Wang Junlin, Subramanian Sanjay, Gardner Matt, Singh Sameer
- Do NLP Models Know Numbers? Probing Numeracy In Embeddings Wallace Eric, Wang Yizhong, Li Sujian, Singh Sameer, Gardner Matt
- Language Models With Transformers Wang Chenguang, Li Mu, Smola Alexander J.
- Do Multi-hop Readers Dream Of Reasoning Chains? Wang Haoyu, Yu Mo, Guo Xiaoxiao, Das Rajarshi, Xiong Wenhan, Gao Tian
- Structbert: Incorporating Language Structures Into Pre-training For Deep Language Understanding Wang Wei, Bi Bin, Yan Ming, Wu Chen, Bao Zuyi, Xia Jiangnan, Peng Liwei, Si Luo
- Tree Transformer: Integrating Tree Structures Into Self-attention Wang Yau-shian, Lee Hung-yi, Chen Yun-nung
- Structured Pruning Of Large Language Models Wang Ziheng, Wohlwend Jeremy, Lei Tao
- Adapting And Evaluating A Deep Learning Language Model For Clinical Why-question Answering Wen Andrew, Elwazir Mohamed Y., Moon Sungrim, Fan Jungwei
- Alternating Recurrent Dialog Model With Large-scale Pre-trained Language Models Wu Qingyang, Zhang Yichi, Li Yu, Yu Zhou
- Review Conversational Reading Comprehension Xu Hu, Liu Bing, Shu Lei, Yu Philip S.
- Towards Making The Most Of BERT In Neural Machine Translation Yang Jiacheng, Wang Mingxuan, Zhou Hao, Zhao Chengqi, Yu Yong, Zhang Weinan, Li Lei
- Deepening Hidden Representations From Pre-trained Language Models Yang Junjie, Zhao Hai
- Data Augmentation For BERT Fine-tuning In Open-domain Question Answering Yang Wei, Xie Yuqing, Tan Luchen, Xiong Kun, Li Ming, Lin Jimmy
- Model Compression With Two-stage Multi-teacher Knowledge Distillation For Web Question Answering System Yang Ze, Shou Linjun, Gong Ming, Lin Wutao, Jiang Daxin
- Q8BERT: Quantized 8bit BERT Zafrir Ofir, Boudoukh Guy, Izsak Peter, Wasserblat Moshe
- Addressing Semantic Drift In Question Generation For Semi-supervised Question Answering Zhang Shiyue, Bansal Mohit
- Bertscore: Evaluating Text Generation With BERT Zhang Tianyi, Kishore Varsha, Wu Felix, Weinberger Kilian Q., Artzi Yoav
- Semantics-aware BERT For Language Understanding Zhang Zhuosheng, Wu Yuwei, Zhao Hai, Li Zuchao, Zhang Shuailiang, Zhou Xi, Zhou Xiang
- Extremely Small BERT Models From Mixed-vocabulary Training Zhao Sanqiang, Gupta Raghav, Song Yang, Zhou Denny
- Accelerating Natural Language Understanding In Task-oriented Dialog Ahuja Ojas, Desai Shrey
- Stress Test Evaluation Of Transformer-based Models In Natural Language Understanding Tasks Aspillaga Carlos, Carvallo Andrés, Araujo Vladimir
- Fluent Response Generation For Conversational Question Answering Baheti Ashutosh, Ritter Alan, Small Kevin
- Segatron: Segment-aware Transformer For Language Modeling And Understanding Bai He, Shi Peng, Lin Jimmy, Xie Yuqing, Tan Luchen, Xiong Kun, Gao Wen, Li Ming
- Unilmv2: Pseudo-masked Language Models For Unified Language Model Pre-training Bao Hangbo, Dong Li, Wei Furu, Wang Wenhui, Yang Nan, Liu Xiaodong, Wang Yu, Piao Songhao, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- Longformer: The Long-document Transformer Beltagy Iz, Peters Matthew E., Cohan Arman
- PALM: Pre-training An Autoencoding&autoregressive Language Model For Context-conditioned Generation Bi Bin, Li Chenliang, Wu Chen, Yan Ming, Wang Wei, Huang Songfang, Huang Fei, Si Luo
- Chatbot Interaction With Artificial Intelligence: Human Data Augmentation With T5 And Language Transformer Ensemble For Text Classification Bird Jordan J., Ekárt Anikó, Faria Diego R.
- Advances Of Transformer-based Models For News Headline Generation Bukhtiyarov Alexey, Gusev Ilya
- DIET: Lightweight Language Understanding For Dialogue Systems Bunk Tanja, Varshneya Daksh, Vlasov Vladimir, Nichol Alan
- Memory Transformer Burtsev Mikhail S., Kuratov Yuri, Peganov Anton, Sapunov Grigory V.
- Behind The Scene: Revealing The Secrets Of Pre-trained Vision-and-language Models Cao Jize, Gan Zhe, Cheng Yu, Yu Licheng, Chen Yen-chun, Liu Jingjing
- Dipair: Fast And Accurate Distillation For Trillion-scale Text Matching And Pair Modeling Chen Jiecao, Yang Liu, Raman Karthik, Bendersky Michael, Yeh Jung-jung, Zhou Yun, Najork Marc, Cai Danyang, Emadzadeh Ehsan
- Model Selection For Cross-lingual Transfer Chen Yang, Ritter Alan
- Improving BERT With Self-supervised Attention Chen Yiren, Kou Xiaoyu, Bai Jiangang, Tong Yunhai
- Autoadr: Automatic Model Design For Ad Relevance Chen Yiren, Yang Yaming, Sun Hong, Wang Yujing, Xu Yu, Shen Wei, Zhou Rong, Tong Yunhai, Bai Jing, Zhang Ruofei
- Comparative Study Of Language Models On Cross-domain Data With Model Agnostic Explainability Chhipa Mayank, Vazurkar Hrushikesh Mahesh, Kumar Abhijeet, Mishra Mridul
- X-LXMERT: Paint, Caption And Answer Questions With Multi-modal Transformers Cho Jaemin, Lu Jiasen, Schwenk Dustin, Hajishirzi Hannaneh, Kembhavi Aniruddha
- Residual Energy-based Models For Text Generation Deng Yuntian, Bakhtin Anton, Ott Myle, Szlam Arthur, Ranzato Marc'aurelio
- Calibration Of Pre-trained Transformers Desai Shrey, Durrett Greg
- EFSG: Evolutionary Fooling Sentences Generator Di Giovanni Marco, Brambilla Marco
- Fine-tuning Pretrained Language Models: Weight Initializations, Data Orders, And Early Stopping Dodge Jesse, Ilharco Gabriel, Schwartz Roy, Farhadi Ali, Hajishirzi Hannaneh, Smith Noah
- Sound Natural: Content Rephrasing In Dialog Systems Einolghozati Arash, Gupta Anchit, Diedrick Keith, Gupta Sonal
- A Comparison Of LSTM And BERT For Small Corpus Ezen-can Aysu
- CERT: Contrastive Self-supervised Learning For Language Understanding Fang Hongchao, Wang Sicheng, Zhou Meng, Ding Jiayuan, Xie Pengtao
- Compressing Large-scale Transformer-based Models: A Case Study On BERT Ganesh Prakhar, Chen Yao, Lou Xin, Khan Mohammad Ali, Yang Yin, Sajjad Hassan, Nakov Preslav, Chen Deming, Winslett Marianne
- It's Not Greek To Mbert: Inducing Word-level Translations From Multilingual BERT Gonen Hila, Ravfogel Shauli, Elazar Yanai, Goldberg Yoav
- Recurrent Chunking Mechanisms For Long-text Machine Reading Comprehension Gong Hongyu, Shen Yelong, Yu Dian, Chen Jianshu, Yu Dong
- A Simple Yet Strong Pipeline For Hotpotqa Groeneveld Dirk, Khot Tushar, Mausam, Sabharwal Ashish
- A Tailored Pre-training Model For Task-oriented Dialog Generation Gu Jing, Wu Qingyang, Wu Chongruo, Shi Weiyan, Yu Zhou
- Dialogbert: Discourse-aware Response Generation Via Learning To Recover And Rank Utterances Gu Xiaodong, Yoo Kang Min, Ha Jung-woo
- UNION: An Unreferenced Metric For Evaluating Open-ended Story Generation Guan Jian, Huang Minlie
- Incorporating BERT Into Parallel Sequence Decoding With Adapters Guo Junliang, Zhang Zhirui, Xu Linli, Wei Hao-ran, Chen Boxing, Chen Enhong
- GMAT: Global Memory Augmentation For Transformers Gupta Ankit, Berant Jonathan
- BERT Based Multilingual Machine Comprehension In English And Hindi Gupta Somil, Khade Nilesh
- Conversational Machine Comprehension: A Literature Review Gupta Somil, Rawat Bhanu Pratap Singh, Yu Hong
- Exploring Explainable Selection To Control Abstractive Summarization Haonan Wang, Yang Gao, Yu Bai, Lapata Mirella, Heyan Huang
- Enriched Pre-trained Transformers For Joint Slot Filling And Intent Detection Hardalov Momchil, Koychev Ivan, Nakov Preslav
- Pretrained Transformers Improve Out-of-distribution Robustness Hendrycks Dan, Liu Xiaoyuan, Wallace Eric, Dziedzic Adam, Krishnan Rishabh, Song Dawn
- PAIR: Planning And Iterative Refinement In Pre-trained Transformers For Long Text Generation Hua Xinyu, Wang Lu
- TRANS-BLSTM: Transformer With Bidirectional LSTM For Language Understanding Huang Zhiheng, Xu Peng, Liang Davis, Mishra Ajay, Xiang Bing
- Exploring BERT Parameter Efficiency On The Stanford Question Answering Dataset V2.0 Hulburd Eric
- Probing Pretrained Language Models For Lexical Semantics Ivan Vulić, Edoardo Maria Ponti, Robert Litschko, Goran Glavaš, Anna Korhonen
- Contrastive Code Representation Learning Jain Paras, Jain Ajay, Zhang Tianjun, Abbeel Pieter, Gonzalez Joseph E., Stoica Ion
- Are Natural Language Inference Models Imppressive? Learning Implicature And Presupposition Jeretic Paloma, Warstadt Alex, Bhooshan Suvrat, Williams Adina
- Text-to-text Pre-training For Data-to-text Tasks Kale Mihir, Rastogi Abhinav
- Meta-context Transformers For Domain-specific Response Generation Kar Debanjana, Samanta Suranjana, Azad Amar Prakash
- Do Fine-tuned Commonsense Language Models Really Generalize? Kejriwal Mayank, Shen Ke
- Unsupervised Bitext Mining And Translation Via Self-trained Contextual Embeddings Keung Phillip, Salazar Julian, Lu Yichao, Smith Noah A.
- Current Limitations Of Language Models: What You Need Is Retrieval Komatsuzaki Aran
- Syntactic Structure Distillation Pretraining For Bidirectional Encoders Kuncoro Adhiguna, Kong Lingpeng, Fried Daniel, Yogatama Dani, Rimell Laura, Dyer Chris, Blunsom Phil
- Utilizing Bidirectional Encoder Representations From Transformers For Answer Selection Laskar Md Tahmid Rahman, Hoque Enamul, Huang Jimmy Xiangji
- Common Sense Or World Knowledge? Investigating Adapter-based Knowledge Injection Into Pretrained Transformers Lauscher Anne, Majewska Olga, Ribeiro Leonardo F. R., Gurevych Iryna, Rozanov Nikolai, Glavaš Goran
- From Zero To Hero: On The Limitations Of Zero-shot Cross-lingual Transfer With Multilingual Transformers Lauscher Anne, Ravishankar Vinit, Vulić Ivan, Glavaš Goran
- SLM: Learning A Discourse Language Representation With Sentence Unshuffling Lee Haejun, Hudson Drew A., Lee Kangwook, Manning Christopher D.
- Efficient Transformer-based Large Scale Language Representations Using Hardware-friendly Block Structured Pruning Li Bingbing, Kong Zhenglun, Zhang Tianyun, Li Ji, Li Zhengang, Liu Hang, Ding Caiwen
- Transformers To Learn Hierarchical Contexts In Multiparty Dialogue For Span-based Question Answering Li Changmao, Choi Jinho D.
- Linguistically-informed Transformations (LIT): A Method For Automatically Generating Contrast Sets Li Chuanrong, Shengshuo Lin, Liu Leo Z., Wu Xinyi, Zhou Xuhui, Steinert-threlkeld Shane
- Optimus: Organizing Sentences Via Pre-trained Modeling Of A Latent Space Li Chunyuan, Gao Xiang, Li Yuan, Peng Baolin, Li Xiujun, Zhang Yizhe, Gao Jianfeng
- A Closer Look At The Robustness Of Vision-and-language Pre-trained Models Li Linjie, Gan Zhe, Liu Jingjing
- A Comparison Of Pre-trained Vision-and-language Models For Multimodal Representation Learning Across Medical Images And Reports Li Yikuan, Wang Hanyin, Luo Yuan
- XGLUE: A New Benchmark Dataset For Cross-lingual Pre-training, Understanding And Generation Liang Yaobo, Duan Nan, Gong Yeyun, Wu Ning, Guo Fenfei, Qi Weizhen, Gong Ming, Shou Linjun, Jiang Daxin, Cao Guihong, Fan Xiaodong, Zhang Ruofei, Agrawal Rahul, Cui Edward, Wei Sining, Bharti Taroon, Qiao Ying, Chen Jiun-hung, Wu Winnie, Liu Shuguang, Yang Fan, Campos Daniel, Majumder Rangan, Zhou Ming
- Probabilistically Masked Language Model Capable Of Autoregressive Generation In Arbitrary Word Order Liao Yi, Jiang Xin, Liu Qun
- Multi-node Bert-pretraining: Cost-efficient Approach Lin Jiahuang, Li Xin, Pekhimenko Gennady
- Qiaoning At Semeval-2020 Task 4: Commonsense Validation And Explanation System Based On Ensemble Of Language Model Liu Pai
- Adversarial Training For Large Neural Language Models Liu Xiaodong, Cheng Hao, He Pengcheng, Chen Weizhu, Wang Yu, Poon Hoifung, Gao Jianfeng
- Charbert: Character-aware Pre-trained Language Model Ma Wentao, Cui Yiming, Si Chenglei, Liu Ting, Wang Shijin, Hu Guoping
- Incremental Processing In The Age Of Non-incremental Encoders: An Empirical Assessment Of Bidirectional Models For Incremental NLU Madureira Brielen, Schlangen David
- Recobert: A Catalog Language Model For Text-based Recommendations Malkiel Itzik, Barkan Oren, Caciularu Avi, Razin Noam, Katz Ori, Koenigstein Noam
- Ladabert: Lightweight Adaptation Of BERT Through Hybrid Model Compression Mao Yihuan, Wang Yujing, Wu Chufan, Zhang Chen, Wang Yang, Yang Yaming, Zhang Quanlu, Tong Yunhai, Bai Jing
- Dialoglue: A Natural Language Understanding Benchmark For Task-oriented Dialogue Mehri Shikib, Eric Mihail, Hakkani-tur Dilek
- Generation-distillation For Efficient Natural Language Understanding In Low-data Settings Melas-kyriazi Luke, Han George, Liang Celine
- What Happens To BERT Embeddings During Fine-tuning? Merchant Amil, Rahimtoroghi Elahe, Pavlick Ellie, Tenney Ian
- Lambert: Language And Action Learning Using Multimodal BERT Miyazawa Kazuki, Aoki Tatsuya, Horii Takato, Nagai Takayuki
- On Incorporating Structural Information To Improve Dialogue Response Generation Moghe Nikita, Vijayan Priyesh, Ravindran Balaraman, Khapra Mitesh M.
- Modern Methods For Text Generation Montesinos Dimas Munoz
- On The Stability Of Fine-tuning BERT: Misconceptions, Explanations, And Strong Baselines Mosbach Marius, Andriushchenko Maksym, Klakow Dietrich
- Stepwise Extractive Summarization And Planning With Structured Transformers Narayan Shashi, Maynez Joshua, Adamek Jakub, Pighin Daniele, Bratanič Blaž, Mcdonald Ryan
- Real-time Execution Of Large-scale Language Models On Mobile Niu Wei, Kong Zhenglun, Yuan Geng, Jiang Weiwen, Guan Jiexiong, Ding Caiwen, Zhao Pu, Liu Sijia, Ren Bin, Wang Yanzhi
- Better Distractions: Transformer-based Distractor Generation And Multiple Choice Question Filtering Offerijns Jeroen, Verberne Suzan, Verhoef Tessa
- Challenges And Thrills Of Legal Arguments Pallaprolu Anurag, Vaidya Radha, Attawar Aditya Swaroop
- Multilingual BERT Post-pretraining Alignment Pan Lin, Hang Chung-wei, Qi Haode, Shah Abhishek, Potdar Saloni, Yu Mo
- Telling Bert's Full Story: From Local Attention To Global Aggregation Pascual Damian, Brunner Gino, Wattenhofer Roger
- What Does BERT Know About Books, Movies And Music? Probing BERT For Conversational Recommendation Penha Gustavo, Hauff Claudia
- Adapterhub: A Framework For Adapting Transformers Pfeiffer Jonas, Rücklé Andreas, Poth Clifton, Kamath Aishwarya, Vulić Ivan, Ruder Sebastian, Cho Kyunghyun, Gurevych Iryna
- MAD-X: An Adapter-based Framework For Multi-task Cross-lingual Transfer Pfeiffer Jonas, Vulić Ivan, Gurevych Iryna, Ruder Sebastian
- Compressing Transformer-based Semantic Parsing Models Using Compositional Code Embeddings Prakash Prafull, Shashidhar Saurabh Kumar, Zhao Wenlong, Rongali Subendhu, Khan Haidar, Kayser Michael
- Intermediate-task Transfer Learning With Pretrained Models For Natural Language Understanding: When And Why Does It Work? Pruksachatkun Yada, Phang Jason, Liu Haokun, Htut Phu Mon, Zhang Xiaoyi, Pang Richard Yuanzhe, Vania Clara, Kann Katharina, Bowman Samuel R.
- Jiant: A Software Toolkit For Research On General-purpose Text Understanding Models Pruksachatkun Yada, Yeres Phil, Liu Haokun, Phang Jason, Htut Phu Mon, Wang Alex, Tenney Ian, Bowman Samuel R.
- Training Large Neural Networks With Constant Memory Using A New Execution Algorithm Pudipeddi Bharadwaj, Mesmakhosroshahi Maral, Xi Jinwen, Bharadwaj Sujeeth
- How Fine Can Fine-tuning Be? Learning Efficient Language Models Radiya-dixit Evani, Wang Xin
- Lareqa: Language-agnostic Answer Retrieval From A Multilingual Pool Roy Uma, Constant Noah, Al-rfou Rami, Barua Aditya, Phillips Aaron, Yang Yinfei
- Fine-tuning BERT For Schema-guided Zero-shot Dialogue State Tracking Ruan Yu-ping, Ling Zhen-hua, Gu Jia-chen, Liu Quan
- On The Effect Of Dropping Layers Of Pre-trained Transformer Models Sajjad Hassan, Dalvi Fahim, Durrani Nadir, Nakov Preslav
- Hierarchical Transformer For Task Oriented Dialog Systems Santra Bishal, Anusha Potnuru, Goyal Pawan
- What BERT Sees: Cross-modal Transfer For Visual Question Generation Scialom Thomas, Bordes Patrick, Dray Paul-alexis, Staiano Jacopo, Gallinari Patrick
- BLEURT: Learning Robust Metrics For Text Generation Sellam Thibault, Das Dipanjan, Parikh Ankur P.
- Pre-trained Summarization Distillation Shleifer Sam, Rush Alexander M.
- Lightpaff: A Two-stage Distillation Framework For Pre-training And Fine-tuning Song Kaitao, Sun Hao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Hongzhi, Liu Tie-yan
- An Investigation Of Language Model Interpretability Via Sentence Editing Stevens Samuel, Su Yu
- Bert-hlstms: BERT And Hierarchical Lstms For Visual Storytelling Su Jing, Dai Qingyun, Guerin Frank, Zhou Mian
- Let's Stop Incorrect Comparisons In End-to-end Relation Extraction! Taillé Bruno, Guigue Vincent, Scoutheeten Geoffrey, Gallinari Patrick
- Edgebert: Sentence-level Energy Optimizations For Latency-aware Multi-task NLP Inference Tambe Thierry, Hooper Coleman, Pentecost Lillian, Jia Tianyu, Yang En-yu, Donato Marco, Sanh Victor, Whatmough Paul N., Rush Alexander M., Brooks David, Wei Gu-yeon
- Cycle Text-to-image GAN With BERT Tsue Trevor, Sen Samir, Li Jason
- An Empirical Study On Robustness To Spurious Correlations Using Pre-trained Language Models Tu Lifu, Lalwani Garima, Gella Spandana, He He
- Bertnesia: Investigating The Capture And Forgetting Of Knowledge In BERT Wallat Jonas, Singh Jaspreet, Anand Avishek
- Multi-task Learning With Multi-head Attention For Multi-choice Reading Comprehension Wan Hui
- Infobert: Improving Robustness Of Language Models From An Information Theoretic Perspective Wang Boxin, Wang Shuohang, Cheng Yu, Gan Zhe, Jia Ruoxi, Li Bo, Liu Jingjing
- Minilm: Deep Self-attention Distillation For Task-agnostic Compression Of Pre-trained Transformers Wang Wenhui, Wei Furu, Dong Li, Bao Hangbo, Yang Nan, Zhou Ming
- Lightseq: A High Performance Inference Library For Transformers Wang Xiaohui, Xiong Ying, Wei Yang, Wang Mingxuan, Li Lei
- VD-BERT: A Unified Vision And Dialog Transformer With BERT Wang Yue, Joty Shafiq, Lyu Michael R., King Irwin, Xiong Caiming, Hoi Steven C. H.
- On Modality Bias In The TVQA Dataset Winterbottom Thomas, Xiao Sarah, Mclean Alistair, Moubayed Noura Al
- Cort: Complementary Rankings From Transformers Wrzalik Marco, Krechel Dirk
- TOD-BERT: Pre-trained Natural Language Understanding For Task-oriented Dialogue Wu Chien-sheng, Hoi Steven, Socher Richard, Xiong Caiming
- Context-guided BERT For Targeted Aspect-based Sentiment Analysis Wu Zhengxuan, Ong Desmond C.
- CG-BERT: Conditional Text Generation With BERT For Generalized Few-shot Intent Detection Xia Congying, Zhang Chenwei, Nguyen Hoang, Zhang Jiawei, Yu Philip
- Improving BERT Fine-tuning Via Self-ensemble And Self-distillation Xu Yige, Qiu Xipeng, Zhou Ligao, Huang Xuanjing
- MC-BERT: Efficient Language Pre-training Via A Meta Controller Xu Zhenhui, Gong Linyuan, Ke Guolin, He Di, Zheng Shuxin, Wang Liwei, Bian Jiang, Liu Tie-yan
- On The Generation Of Medical Dialogues For COVID-19 Yang Wenmian, Zeng Guangtao, Tan Bowen, Ju Zeqian, Chakravorty Subrato, He Xuehai, Chen Shu, Yang Xingyi, Wu Qingyang, Yu Zhou, Xing Eric, Xie Pengtao
- Cocolm: Complex Commonsense Enhanced Language Model With Discourse Relations Yu Changlong, Zhang Hongming, Song Yangqiu, Ng Wilfred
- Clinical Reading Comprehension: A Thorough Analysis Of The Emrqa Dataset Yue Xiang, Gutierrez Bernal Jimenez, Sun Huan
- GOBO: Quantizing Attention-based NLP Models For Low Latency And Energy Efficient Inference Zadeh Ali Hadi, Edo Isak, Awad Omar Mohamed, Moshovos Andreas
- Big Bird: Transformers For Longer Sequences Zaheer Manzil, Guruganesh Guru, Dubey Avinava, Ainslie Joshua, Alberti Chris, Ontanon Santiago, Pham Philip, Ravula Anirudh, Wang Qifan, Yang Li, Ahmed Amr
- Open-domain Dialogue Generation Based On Pre-trained Language Models Zeng Yan, Nie Jian-yun
- Accelerating Training Of Transformer-based Language Models With Progressive Layer Dropping Zhang Minjia, He Yuxiong
- Trojaning Language Models For Fun And Profit Zhang Xinyang, Zhang Zheng, Ji Shouling, Wang Ting
- When Do You Need Billions Of Words Of Pretraining Data? Zhang Yian, Warstadt Alex, Li Haau-sing, Bowman Samuel R.
- POINTER: Constrained Progressive Text Generation Via Insertion-based Generative Pre-training Zhang Yizhe, Wang Guoyin, Li Chunyuan, Gan Zhe, Brockett Chris, Dolan Bill
- BERT-JAM: Boosting Bert-enhanced Neural Machine Translation With Joint Attention Zhang Zhebin, Wu Sai, Jiang Dawei, Chen Gang
- A Closer Look At Few-shot Crosslingual Transfer: The Choice Of Shots Matters Zhao Mengjie, Zhu Yi, Shareghi Ehsan, Vulić Ivan, Reichart Roi, Korhonen Anna, Schütze Hinrich
- Pre-training Text-to-text Transformers For Concept-centric Common Sense Zhou Wangchunshu, Lee Dong-ho, Selvam Ravi Kiran, Lee Seyeon, Lin Bill Yuchen, Ren Xiang
- Learning To Compare For Better Training And Evaluation Of Open Domain Natural Language Generation Models Zhou Wangchunshu, Xu Ke
- Regularizing Transformers With Deep Probabilistic Layers Aguilera Aurora Cobo, Olmos Pablo Martínez, Artés-rodríguez Antonio, Pérez-cruz Fernando
- Re-evaluating Germeval17 Using German Pre-trained Language Models Aßenmacher M., Corvonato A., Heumann C.
- S2s-ft: Fine-tuning Pretrained Transformer Encoders For Sequence-to-sequence Learning Bao Hangbo, Dong Li, Wang Wenhui, Yang Nan, Wei Furu
- Gabert -- An Irish Language Model Barry James, Wagner Joachim, Cassidy Lauren, Cowap Alan, Lynn Teresa, Walsh Abigail, Meachair Mícheál J. Ó, Foster Jennifer
- NLP-IIS@UT At Semeval-2021 Task 4: Machine Reading Comprehension Using The Long Document Transformer Basafa Hossein, Movahedi Sajad, Ebrahimi Ali, Shakery Azadeh, Faili Heshaam
- Building A Question And Answer System For News Domain Basu Sandipan, Gaddala Aravind, Chetan Pooja, Tiwari Garima, Darapaneni Narayana, Parvathaneni Sadwik, Paduri Anwesh Reddy
- On The Universality Of Deep Contextual Language Models Bhatt Shaily, Goyal Poonam, Dandapat Sandipan, Choudhury Monojit, Sitaram Sunayana
- Improving Language Models By Retrieving From Trillions Of Tokens Borgeaud Sebastian, Mensch Arthur, Hoffmann Jordan, Cai Trevor, Rutherford Eliza, Millican Katie, Driessche George Van Den, Lespiau Jean-baptiste, Damoc Bogdan, Clark Aidan, Casas Diego De Las, Guy Aurelia, Menick Jacob, Ring Roman, Hennigan Tom, Huang Saffron, Maggiore Loren, Jones Chris, Cassirer Albin, Brock Andy, Paganini Michela, Irving Geoffrey, Vinyals Oriol, Osindero Simon, Simonyan Karen, Rae Jack W., Elsen Erich, Sifre Laurent
- On The Prunability Of Attention Heads In Multilingual BERT Budhraja Aakriti, Pande Madhura, Kumar Pratyush, Khapra Mitesh M.
- CDLM: Cross-document Language Modeling Caciularu Avi, Cohan Arman, Beltagy Iz, Peters Matthew E., Cattan Arie, Dagan Ido
- Curriculum Learning For Language Modeling Campos Daniel
- Topic Transferable Table Question Answering Chemmengath Saneem Ahmed, Kumar Vishwajeet, Bharadwaj Samarth, Sen Jaydeep, Canim Mustafa, Chakrabarti Soumen, Gliozzo Alfio, Sankaranarayanan Karthik
- Ensemble Fine-tuned Mbert For Translation Quality Estimation Chowdhury Shaika, Baili Naouel, Vannah Brian
- Enjoy The Salience: Towards Better Transformer-based Faithful Explanations With Word Salience Chrysostomou George, Aletras Nikolaos
- CANINE: Pre-training An Efficient Tokenization-free Encoder For Language Representation Clark Jonathan H., Garrette Dan, Turc Iulia, Wieting John
- Knowledge Neurons In Pretrained Transformers Dai Damai, Dong Li, Hao Yaru, Sui Zhifang, Chang Baobao, Wei Furu
- Adapting Monolingual Models: Data Can Be Scarce When Language Similarity Is High De Vries Wietse, Bartelds Martijn, Nissim Malvina, Wieling Martijn
- A Primer On Pretrained Multilingual Language Models Doddapaneni Sumanth, Ramesh Gowtham, Khapra Mitesh M., Kunchukuttan Anoop, Kumar Pratyush
- An Empirical Study Of Training End-to-end Vision-and-language Transformers Dou Zi-yi, Xu Yichong, Gan Zhe, Wang Jianfeng, Wang Shuohang, Wang Lijuan, Zhu Chenguang, Zhang Pengchuan, Yuan Lu, Peng Nanyun, Liu Zicheng, Zeng Michael
- Robustness Challenges In Model Distillation And Pruning For Natural Language Understanding Du Mengnan, Mukherjee Subhabrata, Cheng Yu, Shokouhi Milad, Hu Xia, Awadallah Ahmed Hassan
- GLM: General Language Model Pretraining With Autoregressive Blank Infilling Du Zhengxiao, Qian Yujie, Liu Xiao, Ding Ming, Qiu Jiezhong, Yang Zhilin, Tang Jie
- Lora: Low-rank Adaptation Of Large Language Models Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen
- Berta\'u: Ita\'u BERT For Digital Customer Service Finardi Paulo, Viegas José Dié, Ferreira Gustavo T., Mansano Alex F., Caridá Vinicius F.
- Vision-and-language Or Vision-for-language? On Cross-modal Influence In Multimodal Transformers Frank Stella, Bugliarello Emanuele, Elliott Desmond
- Exbert: An External Knowledge Enhanced BERT For Natural Language Inference Gajbhiye Amit, Moubayed Noura Al, Bradley Steven
- Does Dialog Length Matter For Next Response Selection Task? An Empirical Study Ganhotra Jatin, Joshi Sachindra
- Chop Chop BERT: Visual Question Answering By Chopping Visualbert's Heads Gao Chenyu, Zhu Qi, Wang Peng, Wu Qi
- JABER And SABER: Junior And Senior Arabic Bert Ghaddar Abbas, Wu Yimeng, Rashid Ahmad, Bibi Khalil, Rezagholizadeh Mehdi, Xing Chao, Wang Yasheng, Xinyu Duan, Wang Zhefeng, Huai Baoxing, Jiang Xin, Liu Qun, Langlais Philippe
- Towards Trustworthy Deception Detection: Benchmarking Model Robustness Across Domains, Modalities, And Languages Glenski Maria, Ayton Ellyn, Cosbey Robin, Arendt Dustin, Volkova Svitlana
- LAWDR: Language-agnostic Weighted Document Representations From Pre-trained Models Gong Hongyu, Chaudhary Vishrav, Tang Yuqing, Guzmán Francisco
- Larger-scale Transformers For Multilingual Masked Language Modeling Goyal Naman, Du Jingfei, Ott Myle, Anantharaman Giri, Conneau Alexis
- Block-skim: Efficient Question Answering For Transformer Guan Yue, Li Zhengyi, Leng Jingwen, Lin Zhouhan, Guo Minyi, Zhu Yuhao
- LICHEE: Improving Language Model Pre-training With Multi-grained Tokenization Guo Weidong, Zhao Mingjun, Zhang Lusheng, Niu Di, Luo Jinwen, Liu Zhenhua, Li Zhenyang, Tang Jianbo
- Open Domain Question Answering Over Tables Via Dense Retrieval Herzig Jonathan, Müller Thomas, Krichene Syrine, Eisenschlos Julian Martin
- Multilingual Language Models Predict Human Reading Behavior Hollenstein Nora, Pirovano Federico, Zhang Ce, Jäger Lena, Beinborn Lisa
- Lora: Low-rank Adaptation Of Large Language Models Hu Edward J., Shen Yelong, Wallis Phillip, Allen-zhu Zeyuan, Li Yuanzhi, Wang Shean, Wang Lu, Chen Weizhu
- DS-TOD: Efficient Domain Specialization For Task Oriented Dialog Hung Chia-chien, Lauscher Anne, Ponzetto Simone Paolo, Glavaš Goran
- Retraining Distilbert For A Voice Shopping Assistant By Using Universal Dependencies Jayarao Pratik, Sharma Arpit
- Lightmbert: A Simple Yet Effective Method For Multilingual BERT Distillation Jiao Xiaoqi, Yin Yichun, Shang Lifeng, Jiang Xin, Chen Xiao, Li Linlin, Wang Fang, Liu Qun
- A Good Prompt Is Worth Millions Of Parameters: Low-resource Prompt-based Learning For Vision-language Models Jin Woojeong, Cheng Yu, Shen Yelong, Chen Weizhu, Ren Xiang
- Distilling Large Language Models Into Tiny And Effective Students Using Pqrnn Kaliamoorthi Prabhu, Siddhant Aditya, Li Edward, Johnson Melvin
- AMMUS : A Survey Of Transformer-based Pretrained Models In Natural Language Processing Kalyan Katikapalli Subramanyam, Rajasekharan Ajit, Sangeetha Sivanesan
- Tiny Neural Models For Seq2seq Kandoor Arun
- Amazon Sagemaker Model Parallelism: A General And Flexible Framework For Large Model Training Karakus Can, Huilgol Rahul, Wu Fei, Subramanian Anirudh, Daniel Cade, Cavdar Derya, Xu Teng, Chen Haohan, Rahnama Arash, Quintela Luis
- What Do Pre-trained Code Models Know About Code? Karmakar Anjan, Robbes Romain
- Uquad1.0: Development Of An Urdu Question Answering Training Data For Machine Reading Comprehension Kazi Samreen, Khoja Shakeel
- Generating Rich Product Descriptions For Conversational E-commerce Systems Kedia Shashank, Mantha Aditya, Gupta Sneha, Guo Stephen, Achan Kannan
- BERT Busters: Outlier Dimensions That Disrupt Transformers Kovaleva Olga, Kulshreshtha Saurabh, Rogers Anna, Rumshisky Anna
- What BERT Based Language Models Learn In Spoken Transcripts: An Empirical Study Kumar Ayush, Sundararaman Mukuntha Narayanan, Vepa Jithendra
- Can Transformer Models Measure Coherence In Text? Re-thinking The Shuffle Test Laban Philippe, Dai Luke, Bandarkar Lucas, Hearst Marti A.
- Pagnol: An Extra-large French Generative Model Launay Julien, Tommasone Elena, Pannier Baptiste, Boniface François, Chatelain Amélie, Cappelli Alessandro, Poli Iacopo, Seddah Djamé
- Exploiting Sentence-level Representations For Passage Ranking Leonhardt Jurek, Beringer Fabian, Anand Avishek
- Personalized Transformer For Explainable Recommendation Li Lei, Zhang Yongfeng, Chen Li
- Ensemble ALBERT On Squad 2.0 Li Shilun, Li Renee, Peng Veronica
- A Short Study On Compressing Decoder-based Language Models Li Tianda, Mesbahi Yassir El, Kobyzev Ivan, Rashid Ahmad, Mahmud Atif, Anchuri Nithin, Hajimolahoseini Habib, Liu Yang, Rezagholizadeh Mehdi
- Scheduled Sampling In Vision-language Pretraining With Decoupled Encoder-decoder Network Li Yehao, Pan Yingwei, Yao Ting, Chen Jingwen, Mei Tao
- Lyra: A Benchmark For Turducken-style Code Generation Liang Qingyuan, Sun Zeyu, Zhu Qihao, Zhang Wenjie, Yu Lian, Xiong Yingfei, Zhang Lu
- MDQE: A More Accurate Direct Pretraining For Machine Translation Quality Estimation Lin Lei
- Unified Multimodal Pre-training And Prompt-based Tuning For Vision-language Understanding And Generation Liu Tianyi, Wu Zuxuan, Xiong Wenhan, Chen Jingjing, Jiang Yu-gang
- Augmenting Sequential Recommendation With Pseudo-prior Items Via Reversely Pre-training Transformer Liu Zhiwei, Fan Ziwei, Wang Yu, Yu Philip S.
- X-METRA-ADA: Cross-lingual Meta-transfer Learning Adaptation To Natural Language Understanding And Question Answering M'hamdi Meryem, Kim Doo Soon, Dernoncourt Franck, Bui Trung, Ren Xiang, May Jonathan
- Local And Global Context-based Pairwise Models For Sentence Ordering Manku Ruskin Raj, Paul Aditya Jyoti
- Lightweight Cross-lingual Sentence Representation Learning Mao Zhuoyuan, Gupta Prakhar, Wang Pei, Chu Chenhui, Jaggi Martin, Kurohashi Sadao
- Self-training Improves Pre-training For Few-shot Learning In Task-oriented Dialog Systems Mi Fei, Zhou Wanhao, Cai Fengyu, Kong Lingjing, Huang Minlie, Faltings Boi
- Recent Advances In Natural Language Processing Via Large Pre-trained Language Models: A Survey Min Bonan, Ross Hayley, Sulem Elior, Veyseh Amir Pouran Ben, Nguyen Thien Huu, Sainz Oscar, Agirre Eneko, Heinz Ilana, Roth Dan
- BERTGEN: Multi-task Generation Through BERT Mitzalis Faidon, Caglayan Ozan, Madhyastha Pranava, Specia Lucia
- Multi-modal Understanding And Generation For Medical Images And Text Via Vision-language Pre-training Moon Jong Hak, Lee Hyungyung, Shin Woncheol, Kim Young-hak, Choi Edward
- GPT-3 Models Are Poor Few-shot Learners In The Biomedical Domain Moradi Milad, Blagec Kathrin, Haberl Florian, Samwald Matthias
- First Align, Then Predict: Understanding The Cross-lingual Ability Of Multilingual BERT Muller Benjamin, Elazar Yanai, Sagot Benoît, Seddah Djamé
- P-adapters: Robustly Extracting Factual Information From Language Models With Diverse Prompts Newman Benjamin, Choubey Prafulla Kumar, Rajani Nazneen
- Sentence-t5: Scalable Sentence Encoders From Pre-trained Text-to-text Models Ni Jianmo, Ábrego Gustavo Hernández, Constant Noah, Ma Ji, Hall Keith B., Cer Daniel, Yang Yinfei
- Probing For Bridging Inference In Transformer Language Models Pandit Onkar, Hou Yufang
- A Comparative Study Of Transformer-based Language Models On Extractive Question Answering Pearce Kate, Zhan Tiffany, Komanduri Aneesh, Zhan Justin
- ABC: Attention With Bounded-memory Control Peng Hao, Kasai Jungo, Pappas Nikolaos, Yogatama Dani, Wu Zhaofeng, Kong Lingpeng, Schwartz Roy, Smith Noah A.
- The Effect Of Model Size On Worst-group Generalization Pham Alan, Chan Eunice, Srivatsa Vikranth, Ghosh Dhruba, Yang Yaoqing, Yu Yaodong, Zhong Ruiqi, Gonzalez Joseph E., Steinhardt Jacob
- Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge Porada Ian, Sordoni Alessandro, Cheung Jackie Chi Kit
- Robertuito: A Pre-trained Language Model For Social Media Text In Spanish Pérez Juan Manuel, Furman Damián A., Alemany Laura Alonso, Luque Franco
- Assessing The Syntactic Capabilities Of Transformer-based Multilingual Language Models Pérez-mayos Laura, García Alba Táboas, Mille Simon, Wanner Leo
- Winner Team Mia At Textvqa Challenge 2021: Vision-and-language Representation Learning With Pre-trained Sequence-to-sequence Model Qiao Yixuan, Chen Hao, Wang Jun, Chen Yihao, Ye Xianbin, Li Ziliang, Qi Xianbiao, Gao Peng, Xie Guotong
- Grid Search Hyperparameter Benchmarking Of BERT, ALBERT, And Longformer On Duorc Quijano Alex John, Nguyen Sam, Ordonez Juanita
- Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information? Rozanova Julia, Ferreira Deborah, Dubba Krishna, Cheng Weiwei, Zhang Dell, Freitas Andre
- Efficient Domain Adaptation Of Language Models Via Adaptive Tokenization Sachidananda Vin, Kessler Jason S., Lai Yi-an
- Representation Learning For Conversational Data Using Discourse Mutual Information Maximization Santra Bishal, Roychowdhury Sumegh, Mandal Aishik, Gurram Vasu, Naik Atharva, Gupta Manish, Goyal Pawan
- Large Pre-trained Language Models Contain Human-like Biases Of What Is Right And Wrong To Do Schramowski Patrick, Turan Cigdem, Andersen Nico, Rothkopf Constantin A., Kersting Kristian
- CPT: A Pre-trained Unbalanced Transformer For Both Chinese Language Understanding And Generation Shao Yunfan, Geng Zhichao, Liu Yitao, Dai Junqi, Yan Hang, Yang Fei, Zhe Li, Bao Hujun, Qiu Xipeng
- Better Neural Machine Translation By Extracting Linguistic Information From BERT Shavarani Hassan S., Sarkar Anoop
- FPM: A Collection Of Large-scale Foundation Pre-trained Language Models Shen Dezhou
- Contextual Biasing Of Language Models For Speech Recognition In Goal-oriented Conversational Agents Shenoy Ashish, Bodapati Sravan, Kirchhoff Katrin
- Adapting Long Context NLM For ASR Rescoring In Conversational Agents Shenoy Ashish, Bodapati Sravan, Sunkara Monica, Ronanki Srikanth, Kirchhoff Katrin
- Scaling Law For Recommendation Models: Towards General-purpose User Representations Shin Kyuyong, Kwak Hanock, Kim Su Young, Ramstrom Max Nihlen, Jeong Jisu, Ha Jung-woo, Kim Kyung-min
- Bob: BERT Over BERT For Training Persona-based Dialogue Models From Limited Personalized Data Song Haoyu, Wang Yan, Zhang Kaiyan, Zhang Wei-nan, Liu Ting
- Worst Of Both Worlds: Biases Compound In Pre-trained Vision-and-language Models Srinivasan Tejas, Bisk Yonatan
- Ernie-tiny : A Progressive Distillation Framework For Pretrained Transformer Compression Su Weiyue, Chen Xuyi, Feng Shikun, Liu Jiaxiang, Liu Weixin, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Non-autoregressive Text Generation With Pre-trained Language Models Su Yixuan, Cai Deng, Wang Yan, Vandyke David, Baker Simon, Li Piji, Collier Nigel
- Tacl: Improving BERT Pre-training With Token-aware Contrastive Learning Su Yixuan, Liu Fangyu, Meng Zaiqiao, Lan Tian, Shu Lei, Shareghi Ehsan, Collier Nigel
- Embodied BERT: A Transformer Model For Embodied, Language-guided Visual Task Completion Suglia Alessandro, Gao Qiaozi, Thomason Jesse, Thattai Govind, Sukhatme Gaurav
- NSP-BERT: A Prompt-based Few-shot Learner Through An Original Pre-training Task--next Sentence Prediction Sun Yi, Zheng Yu, Hao Chao, Qiu Hangping
- Multilingual Translation Via Grafting Pre-trained Language Models Sun Zewei, Wang Mingxuan, Li Lei
- Trusting Roberta Over BERT: Insights From Checklisting The Natural Language Inference Task Tarunesh Ishan, Aditya Somak, Choudhury Monojit
- Exploring Transformers In Natural Language Generation: GPT, BERT, And Xlnet Topal M. Onat, Bas Anil, Van Heerden Imke
- Learning Better Visual Dialog Agents With Pretrained Visual-linguistic Representation Tu Tao, Ping Qing, Thattai Govind, Tur Gokhan, Natarajan Prem
- Cheap And Good? Simple And Effective Data Augmentation For Low Resource Machine Reading Van Hoang, Yadav Vikas, Surdeanu Mihai
- Bertnesia: Investigating The Capture And Forgetting Of Knowledge In BERT Wallat Jonas, Singh Jaspreet, Anand Avishek
- Topicrefine: Joint Topic Prediction And Dialogue Response Generation For Multi-turn End-to-end Dialogue System Wang Hongru, Cui Mingyu, Zhou Zimo, Fung Gabriel Pui Cheong, Wong Kam-fai
- Sentence Semantic Regression For Text Generation Wang Wei, Li Piji, Zheng Hai-tao
- Lightseq2: Accelerated Training For Transformer-based Models On Gpus Wang Xiaohui, Wei Yang, Xiong Ying, Huang Guyue, Qian Xian, Ding Yufei, Wang Mingxuan, Li Lei
- Codet5: Identifier-aware Unified Pre-trained Encoder-decoder Models For Code Understanding And Generation Wang Yue, Wang Weishi, Joty Shafiq, Hoi Steven C. H.
- Training Multilingual Pre-trained Language Model With Byte-level Subwords Wei Junqiu, Liu Qun, Guo Yinpeng, Jiang Xin
- A Multilingual Modeling Method For Span-extraction Reading Comprehension Wu Gaochen, Xu Bin, Chang Dejie, Liu Bangchang
- Domain-adaptive Pretraining Methods For Dialogue Understanding Wu Han, Xu Kun, Song Linfeng, Jin Lifeng, Zhang Haisong, Song Linqi
- On Explaining Your Explanations Of BERT: An Empirical Study With Sequence Classification Wu Zhengxuan, Ong Desmond C.
- Training Large-scale News Recommenders With Pretrained Language Models In The Loop Xiao Shitao, Liu Zheng, Shao Yingxia, Di Tao, Xie Xing
- Exploring Story Generation With Multi-task Objectives In Variational Autoencoders Xie Zhuohan, Cohn Trevor, Lau Jey Han
- From Dense To Sparse: Contrastive Pruning For Better Pre-trained Language Model Compression Xu Runxin, Luo Fuli, Wang Chengyu, Chang Baobao, Huang Jun, Huang Songfang, Huang Fei
- Transformer-based Korean Pretrained Language Models: A Survey On Three Years Of Progress Yang Kichang
- NT5?! Training T5 To Perform Numerical Reasoning Yang Peng-jian, Chen Ying Ting, Chen Yuechan, Cer Daniel
- TR-BERT: Dynamic Token Reduction For Accelerating BERT Inference Ye Deming, Lin Yankai, Huang Yufei, Sun Maosong
- Leveraging Linguistic Coordination In Reranking N-best Candidates For End-to-end Response Selection Using BERT Yu Mingzhi University Of Pittsburgh, Litman Diane University Of Pittsburgh
- LV-BERT: Exploiting Layer Variety For BERT Yu Weihao, Jiang Zihang, Chen Fei, Hou Qibin, Feng Jiashi
- Prune Once For All: Sparse Pre-trained Language Models Zafrir Ofir, Larey Ariel, Boudoukh Guy, Shen Haihao, Wasserblat Moshe
- Bert-coqac: Bert-based Conversational Question Answering In Context Zaib Munazza, Tran Dai Hoang, Sagar Subhash, Mahmood Adnan, Zhang Wei E., Sheng Quan Z.
- Bitfit: Simple Parameter-efficient Fine-tuning For Transformer-based Masked Language-models Zaken Elad Ben, Ravfogel Shauli, Goldberg Yoav
- Improving Stack Overflow Question Title Generation With Copying Enhanced Codebert Model And Bi-modal Information Zhang Fengji, Yu Xiao, Keung Jacky, Li Fuyang, Xie Zhiwen, Yang Zhen, Ma Caoyuan, Zhang Zhimin
- Ror: Read-over-read For Long Document Machine Reading Comprehension Zhao Jing, Bao Junwei, Wang Yifan, Zhou Yongwei, Wu Youzheng, He Xiaodong, Zhou Bowen
- Improving Sequence-to-sequence Pre-training Via Sequence Span Rewriting Zhou Wangchunshu, Ge Tao, Xu Canwen, Xu Ke, Wei Furu
- Global Memory Transformer For Processing Long Documents Adel Arij Al
- How Does The Pre-training Objective Affect What Large Language Models Learn About Linguistic Properties? Alajrami Ahmed, Aletras Nikolaos
- Tempo: Accelerating Transformer-based Model Training Through Memory Footprint Reduction Andoorveedu Muralidhar, Zhu Zhanda, Zheng Bojian, Pekhimenko Gennady
- Vl-beit: Generative Vision-language Pretraining Bao Hangbo, Wang Wenhui, Dong Li, Wei Furu
- Logical Reasoning For Task Oriented Dialogue Systems Beygi Sajjad, Fazel-zarandi Maryam, Cervone Alessandra, Krishnan Prakash, Jonnalagadda Siddhartha Reddy
- Team \'UFAL At CMCL 2022 Shared Task: Figuring Out The Correct Recipe For Predicting Eye-tracking Features Using Pretrained Language Models Bhattacharya Sunit, Kumar Rishu, Bojar Ondrej
- Evaluating The Susceptibility Of Pre-trained Language Models Via Handcrafted Adversarial Examples Branch Hezekiah J., Cefalu Jonathan Rodriguez, Mchugh Jeremy, Hujer Leyla, Bahl Aditya, Iglesias Daniel Del Castillo, Heichman Ron, Darwishi Ramesh
- Analyzing Semantic Faithfulness Of Language Models Via Input Intervention On Question Answering Chaturvedi Akshay, Bhar Swarnadeep, Saha Soumadeep, Garain Utpal, Asher Nicholas
- Imputing Out-of-vocabulary Embeddings With LOVE Makes Language Models Robust With Little Cost Chen Lihu, Varoquaux Gaël, Suchanek Fabian M.
- Vault: Augmenting The Vision-and-language Transformer For Sentiment Classification On Social Media Chochlakis Georgios University Of Southern California, Srinivasan Tejas University Of Southern California, Thomason Jesse University Of Southern California, Narayanan Shrikanth University Of Southern California
- Scientific And Creative Analogies In Pretrained Language Models Czinczoll Tamara, Yannakoudakis Helen, Mishra Pushkar, Shutova Ekaterina
- Promptagator: Few-shot Dense Retrieval From 8 Examples Dai Zhuyun, Zhao Vincent Y., Ma Ji, Luan Yi, Ni Jianmo, Lu Jing, Bakalov Anton, Guu Kelvin, Hall Keith B., Chang Ming-wei
- Understanding BLOOM: An Empirical Study On Diverse NLP Tasks Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- Rlprompt: Optimizing Discrete Text Prompts With Reinforcement Learning Deng Mingkai, Wang Jianyu, Hsieh Cheng-ping, Wang Yihan, Guo Han, Shu Tianmin, Song Meng, Xing Eric P., Hu Zhiting
- Pre-training Transformer Models With Sentence-level Objectives For Answer Sentence Selection Di Liello Luca, Garg Siddhant, Soldaini Luca, Moschitti Alessandro
- Black-box Prompt Learning For Pre-trained Language Models Diao Shizhe, Huang Zhichao, Xu Ruijia, Li Xuechun, Lin Yong, Zhou Xiao, Zhang Tong
- Contrastive Learning With Bidirectional Transformers For Sequential Recommendation Du Hanwen, Shi Hui, Zhao Pengpeng, Wang Deqing, Sheng Victor S., Liu Yanchi, Liu Guanfeng, Zhao Lei
- Leveraging Pre-trained Models For Failure Analysis Triplets Generation Ezukwoke Kenneth, Hoayek Anis, Batton-hubert Mireille, Boucher Xavier, Gounet Pascal, Adrian Jerome
- Towards Winoqueer: Developing A Benchmark For Anti-queer Bias In Large Language Models Felkner Virginia K., Chang Ho-chun Herbert, Jang Eugene, May Jonathan
- Pnlp-mixer: An Efficient All-mlp Architecture For Language Fusco Francesco, Pascual Damian, Staar Peter, Antognini Diego
- Cramming: Training A Language Model On A Single GPU In One Day Geiping Jonas, Goldstein Tom
- Microbert: Effective Training Of Low-resource Monolingual Berts Through Parameter Reduction And Multitask Learning Gessler Luke, Zeldes Amir
- Massively Multilingual Lexical Specialization Of Multilingual Transformers Green Tommaso, Ponzetto Simone Paolo, Glavaš Goran
- Transkimmer: Transformer Learns To Layer-wise Skim Guan Yue, Li Zhengyi, Leng Jingwen, Lin Zhouhan, Guo Minyi
- Visually-augmented Pretrained Language Models For NLP Tasks Without Images Guo Hangyu, Zhou Kun, Zhao Wayne Xin, Zhang Qinyu, Wen Ji-rong
- Quala-minilm: A Quantized Length Adaptive Minilm Guskin Shira, Wasserblat Moshe, Wang Chang, Shen Haihao
- Thinking About GPT-3 In-context Learning For Biomedical IE? Think Again Gutiérrez Bernal Jiménez, Mcneal Nikolas, Washington Clay, Chen You, Li Lang, Sun Huan, Su Yu
- LUNA: Language Understanding With Number Augmentations On Transformers Via Number Plugins And Pre-training Han Hongwei, Xu Jialiang, Zhou Mengyu, Shao Yijia, Han Shi, Zhang Dongmei
- ORCA: Interpreting Prompted Language Models Via Locating Supporting Data Evidence In The Ocean Of Pretraining Data Han Xiaochuang, Tsvetkov Yulia
- Aging With GRACE: Lifelong Model Editing With Discrete Key-value Adaptors Hartvigsen Thomas, Sankaranarayanan Swami, Palangi Hamid, Kim Yoon, Ghassemi Marzyeh
- Bundle MCR: Towards Conversational Bundle Recommendation He Zhankui, Zhao Handong, Yu Tong, Kim Sungchul, Du Fan, Mcauley Julian
- Compressing Pre-trained Transformers Via Low-bit Nxm Sparsity For Natural Language Understanding Holmes Connor, Zhang Minjia, He Yuxiong, Wu Bo
- XDBERT: Distilling Visual Information To BERT From Cross-modal Systems To Improve Language Understanding Hsu Chan-jan, Lee Hung-yi, Tsao Yu
- Empirical Evaluation Of Post-training Quantization Methods For Language Tasks Hu Ting, Meinel Christoph, Yang Haojin
- Fctalker: Fine And Coarse Grained Context Modeling For Expressive Conversational Speech Synthesis Hu Yifan, Liu Rui, Gao Guanglai, Li Haizhou
- Empowering Language Models With Knowledge Graph Reasoning For Question Answering Hu Ziniu, Xu Yichong, Yu Wenhao, Wang Shuohang, Yang Ziyi, Zhu Chenguang, Chang Kai-wei, Sun Yizhou
- Transformer Quality In Linear Time Hua Weizhe, Dai Zihang, Liu Hanxiao, Le Quoc V.
- Layoutlmv3: Pre-training For Document AI With Unified Text And Image Masking Huang Yupan, Lv Tengchao, Cui Lei, Lu Yutong, Wei Furu
- Do Berts Learn To Use Browser User Interface? Exploring Multi-step Tasks With Unified Vision-and-language Berts Iki Taichi, Aizawa Akiko
- Contrastive Learning For Prompt-based Few-shot Language Learners Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Probing Script Knowledge From Pre-trained Models Jin Zijian, Zhang Xingyu, Yu Mo, Huang Lifu
- Latency Adjustable Transformer Encoder For Language Understanding Kachuee Sajjad, Sharifkhani Mohammad
- Downstream Datasets Make Surprisingly Good Pretraining Corpora Krishna Kundan, Garg Saurabh, Bigham Jeffrey P., Lipton Zachary C.
- Mucot: Multilingual Contrastive Training For Question-answering In Low-resource Languages Kumar Gokul Karthik, Gehlot Abhishek Singh, Mullappilly Sahal Shaji, Nandakumar Karthik
- The Optimal BERT Surgeon: Scalable And Accurate Second-order Pruning For Large Language Models Kurtic Eldar, Campos Daniel, Nguyen Tuan, Frantar Elias, Kurtz Mark, Fineran Benjamin, Goin Michael, Alistarh Dan
- Emergent Linguistic Structures In Neural Networks Are Fragile La Malfa Emanuele, Wicker Matthew, Kwiatkowska Marta
- Does BERT Really Agree ? Fine-grained Analysis Of Lexical Dependence On A Syntactic Task Lasri Karim, Lenci Alessandro, Poibeau Thierry
- Littlebird: Efficient Faster & Longer Transformer For Question Answering Lee Minchul, Han Kijong, Shin Myeong Cheol
- Simple Recurrence Improves Masked Language Models Lei Tao, Tian Ran, Bastings Jasmijn, Parikh Ankur P.
- LAVENDER: Unifying Video-language Understanding As Masked Language Modeling Li Linjie, Gan Zhe, Lin Kevin, Lin Chung-ching, Liu Zicheng, Liu Ce, Wang Lijuan
- Task-specific Pre-training And Prompt Decomposition For Knowledge Graph Population With Language Models Li Tianyi, Huang Wenyu, Papasarantopoulos Nikos, Vougiouklis Pavlos, Pan Jeff Z.
- Clinical-longformer And Clinical-bigbird: Transformers For Long Clinical Sequences Li Yikuan, Wehbe Ramsey M., Ahmad Faraz S., Wang Hanyin, Luo Yuan
- Accelerating Attention Through Gradient-based Learned Runtime Pruning Li Zheng, Ghodrati Soroush, Yazdanbakhsh Amir, Esmaeilzadeh Hadi, Kang Mingu
- Life After BERT: What Do Other Muppets Understand About Language? Lialin Vladislav, Zhao Kevin, Shivagunde Namrata, Rumshisky Anna
- On Task-adaptive Pretraining For Dialogue Response Selection Lin Tzu-hsiang, Chi Ta-chung, Rumshisky Anna
- Do Ever Larger Octopi Still Amplify Reporting Biases? Evidence From Judgments Of Typical Colour Liu Fangyu, Eisenschlos Julian Martin, Cole Jeremy R., Collier Nigel
- A Win-win Deal: Towards Sparse And Robust Pre-trained Language Models Liu Yuanxin, Meng Fandong, Lin Zheng, Li Jiangnan, Fu Peng, Cao Yanan, Wang Weiping, Zhou Jie
- Declaration-based Prompt Tuning For Visual Question Answering Liu Yuhang, Wei Wei, Peng Daowan, Zhu Feida
- Choose Your QA Model Wisely: A Systematic Study Of Generative And Extractive Readers For Question Answering Luo Man, Hashimoto Kazuma, Yavuz Semih, Liu Zhiwei, Baral Chitta, Zhou Yingbo
- Biogpt: Generative Pre-trained Transformer For Biomedical Text Generation And Mining Luo Renqian, Sun Liai, Xia Yingce, Qin Tao, Zhang Sheng, Poon Hoifung, Liu Tie-yan
- Unveiling Code Pre-trained Models: Investigating Syntax And Semantics Capacities Ma Wei, Liu Shangqing, Zhao Mengjie, Xie Xiaofei, Wang Wenhan, Hu Qiang, Zhang Jie, Liu Yang
- PERFECT: Prompt-free And Efficient Few-shot Learning With Language Models Mahabadi Rabeeh Karimi, Zettlemoyer Luke, Henderson James, Saeidi Marzieh, Mathias Lambert, Stoyanov Veselin, Yazdani Majid
- Tangobert: Reducing Inference Cost By Using Cascaded Architecture Mamou Jonathan, Pereg Oren, Wasserblat Moshe, Schwartz Roy
- Generating Training Data With Language Models: Towards Zero-shot Language Understanding Meng Yu, Huang Jiaxin, Zhang Yu, Han Jiawei
- Minicons: Enabling Flexible Behavioral And Representational Analyses Of Transformer Language Models Misra Kanishka
- Minialbert: Model Distillation Via Parameter-efficient Recursive Transformers Nouriborji Mohammadmahdi, Rohanian Omid, Kouchaki Samaneh, Clifton David A.
- CUE Vectors: Modular Training Of Language Models Conditioned On Diverse Contextual Signals Novotney Scott, Mukherjee Sreeparna, Ahmed Zeeshan, Stolcke Andreas
- Extremebert: A Toolkit For Accelerating Pretraining Of Customized BERT Pan Rui, Diao Shizhe, Chen Jianlin, Zhang Tong
- Bidirectional Language Models Are Also Few-shot Learners Patel Ajay, Li Bryan, Rasooli Mohammad Sadegh, Constant Noah, Raffel Colin, Callison-burch Chris
- Outliers Dimensions That Disrupt Transformers Are Driven By Frequency Puccetti Giovanni, Rogers Anna, Drozd Aleksandr, Dell'orletta Felice
- ELLE: Efficient Lifelong Pre-training For Emerging Data Qin Yujia, Zhang Jiajie, Lin Yankai, Liu Zhiyuan, Li Peng, Sun Maosong, Zhou Jie
- VLC-BERT: Visual Question Answering With Contextualized Commonsense Knowledge Ravi Sahithya, Chinchure Aditya, Sigal Leonid, Liao Renjie, Shwartz Vered
- Exploring Extreme Parameter Compression For Pre-trained Language Models Ren Yuxin, Wang Benyou, Shang Lifeng, Jiang Xin, Liu Qun
- APOLLO: A Simple Approach For Adaptive Pretraining Of Language Models For Logical Reasoning Sanyal Soumya, Xu Yichong, Wang Shuohang, Yang Ziyi, Pryzant Reid, Yu Wenhao, Zhu Chenguang, Ren Xiang
- Unintended Bias In Language Model-driven Conversational Recommendation Shen Tianshu, Li Jiaru, Bouadjenek Mohamed Reda, Mai Zheda, Sanner Scott
- Black-box Tuning For Language-model-as-a-service Sun Tianxiang, Shao Yunfan, Qian Hong, Huang Xuanjing, Qiu Xipeng
- Fewer Errors, But More Stereotypes? The Effect Of Model Size On Gender Bias Tal Yarden, Magar Inbal, Schwartz Roy
- Compression Of Generative Pre-trained Language Models Via Quantization Tao Chaofan, Hou Lu, Zhang Wei, Shang Lifeng, Jiang Xin, Liu Qun, Luo Ping, Wong Ngai
- Towards Fine-tuning Pre-trained Language Models With Integer Forward And Backward Propagation Tayaranian Mohammadreza, Ghaffari Alireza, Tahaei Marzieh S., Rezagholizadeh Mehdi, Asgharian Masoud, Nia Vahid Partovi
- Predicting Issue Types With Sebert Trautsch Alexander, Herbold Steffen
- Effect And Analysis Of Large-scale Language Model Rescoring On Competitive ASR Systems Udagawa Takuma, Suzuki Masayuki, Kurata Gakuto, Itoh Nobuyasu, Saon George
- Dylora: Parameter Efficient Tuning Of Pre-trained Models Using Dynamic Search-free Low-rank Adaptation Valipour Mojtaba, Rezagholizadeh Mehdi, Kobyzev Ivan, Ghodsi Ali
- VLSP 2021 - Vimrc Challenge: Vietnamese Machine Reading Comprehension Van Nguyen Kiet, Tran Son Quoc, Nguyen Luan Thanh, Van Huynh Tin, Luu Son T., Nguyen Ngan Luu-thuy
- What Do They Capture? -- A Structural Analysis Of Pre-trained Language Models For Source Code Wan Yao, Zhao Wei, Zhang Hongyu, Sui Yulei, Xu Guandong, Jin Hai
- Knowledgeable Salient Span Mask For Enhancing Language Models As Knowledge Base Wang Cunxiang, Luo Fuli, Li Yanyang, Xu Runxin, Huang Fei, Zhang Yue
- Pretraining Without Attention Wang Junxiong, Yan Jing Nathan, Gu Albert, Rush Alexander M.
- What Language Model Architecture And Pretraining Objective Work Best For Zero-shot Generalization? Wang Thomas, Roberts Adam, Hesslow Daniel, Scao Teven Le, Chung Hyung Won, Beltagy Iz, Launay Julien, Raffel Colin
- Qrelscore: Better Evaluating Generated Questions With Deeper Understanding Of Context-aware Relevance Wang Xiaoqiang, Liu Bang, Tang Siliang, Wu Lingfei
- Pan More Gold From The Sand: Refining Open-domain Dialogue Training With Noisy Self-retrieval Generation Wang Yihe, Li Yitong, Wang Yasheng, Mi Fei, Zhou Pingyi, Wang Xin, Liu Jin, Jiang Xin, Liu Qun
- Unified Multimodal Model With Unlikelihood Training For Visual Dialog Wang Zihao, Wang Junli, Jiang Changjun
- Probing Of Quantitative Values In Abstractive Summarization Models White Nathan M.
- Adversarial Self-attention For Language Understanding Wu Hongqiu, Ding Ruixue, Zhao Hai, Xie Pengjun, Huang Fei, Zhang Min
- Exploring The Efficacy Of Pre-trained Checkpoints In Text-to-music Generation Task Wu Shangda, Sun Maosong
- Extreme Compression For Pre-trained Transformers Made Simple And Efficient Wu Xiaoxia, Yao Zhewei, Zhang Minjia, Li Conglong, He Yuxiong
- Go-tuning: Improving Zero-shot Learning Abilities Of Smaller Language Models Xu Jingjing, Dong Qingxiu, Liu Hongyi, Li Lei
- CLIP Also Understands Text: Prompting CLIP For Phrase Understanding Yan An, Li Jiacheng, Zhu Wanrong, Lu Yujie, Wang William Yang, Mcauley Julian
- Zero-shot Video Question Answering Via Frozen Bidirectional Language Models Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Parameter-efficient Tuning Makes A Good Classification Head Yang Zhuoyi, Ding Ming, Guo Yanhui, Lv Qingsong, Tang Jie
- Prompt Tuning For Discriminative Pre-trained Language Models Yao Yuan, Dong Bowen, Zhang Ao, Zhang Zhengyan, Xie Ruobing, Liu Zhiyuan, Lin Leyu, Sun Maosong, Wang Jianyong
- Zeroquant: Efficient And Affordable Post-training Quantization For Large-scale Transformers Yao Zhewei, Aminabadi Reza Yazdani, Zhang Minjia, Wu Xiaoxia, Li Conglong, He Yuxiong
- Random-ltd: Random And Layerwise Token Dropping Brings Efficient Training For Large-scale Transformers Yao Zhewei, Wu Xiaoxia, Li Conglong, Holmes Connor, Zhang Minjia, Li Cheng, He Yuxiong
- Better Pre-training By Reducing Representation Confusion Zhang Haojie, Liang Mingfei, Xie Ruobing, Sun Zhenlong, Zhang Bo, Lin Leyu
- Scala: Accelerating Adaptation Of Pre-trained Transformer-based Language Models Via Efficient Large-batch Adversarial Noise Zhang Minjia, Naresh Niranjan Uma, He Yuxiong
- MSDF: A General Open-domain Multi-skill Dialog Framework Zhao Yu, Hu Xinshuo, Li Yunxin, Hu Baotian, Li Dongfang, Chen Sichao, Wang Xiaolong
- Syntax-informed Question Answering With Heterogeneous Graph Transformer Zhu Fangyi, Tan Lok You, Ng See-kiong, Bressan Stéphane
- \(\mathbb{vd}\)-\(\mathbb{gr}\): Boosting \(\mathbb{v}\)isual \(\mathbb{d}\)ialog With Cascaded Spatial-temporal Multi-modal \(\mathbb{gr}\)aphs Abdessaied Adnen, Shi Lei, Bulling Andreas
- On The Relationship Between Skill Neurons And Robustness In Prompt Tuning Ackermann Leon, Ohmer Xenia
- The Emergence Of Essential Sparsity In Large Pre-trained Models: The Weights That Matter Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Zhangyang Wang
- Is BERT Blind? Exploring The Effect Of Vision-and-language Pretraining On Visual Language Understanding Alper Morris, Fiman Michael, Averbuch-elor Hadar
- Prompt-engineering And Transformer-based Question Generation And Evaluation Amyeen Rubaba
- Studying The Impacts Of Pre-training Using Chatgpt-generated Text On Downstream Tasks Anand Sarthak
- Distilling Efficient Language-specific Models For Cross-lingual Transfer Ansell Alan, Ponti Edoardo Maria, Korhonen Anna, Vulić Ivan
- Scaling Laws For Language Encoding Models In Fmri Antonello Richard, Vaidya Aditya, Huth Alexander G.
- Detecting Natural Language Biases With Prompt-based Learning Aowal Md Abdul, Islam Maliha T, Mammen Priyanka Mary, Shetty Sandesh
- Sequence-to-sequence Spanish Pre-trained Language Models Araujo Vladimir, Trusca Maria Mihaela, Tufiño Rodrigo, Moens Marie-francine
- Slimfit: Memory-efficient Fine-tuning Of Transformer-based Models Using Training Dynamics Ardakani Arash, Haan Altan, Tan Shangyin, Popovici Doru Thom, Cheung Alvin, Iancu Costin, Sen Koushik
- Real-time Visual Feedback To Guide Benchmark Creation: A Human-and-metric-in-the-loop Workflow Arunkumar Anjana, Mishra Swaroop, Sachdeva Bhavdeep, Baral Chitta, Bryan Chris
- Taught By The Internet, Exploring Bias In Openais GPT3 Ayaz Ali, Nawalgaria Aditya, Yin Ruilian
- Investigating Pre-trained Language Models On Cross-domain Datasets, A Step Closer To General AI Ballout Mohamad, Krumnack Ulf, Heidemann Gunther, Kühnberger Kai-uwe
- Do Language Models Learn About Legal Entity Types During Pretraining? Barale Claire, Rovatsos Michael, Bhuta Nehal
- Understanding Telecom Language Through Large Language Models Bariah Lina, Zou Hang, Zhao Qiyang, Mouhouche Belkacem, Bader Faouzi, Debbah Merouane
- Text Summarization Using Large Language Models: A Comparative Study Of Mpt-7b-instruct, Falcon-7b-instruct, And Openai Chat-gpt Models Basyal Lochan, Sanghvi Mihir
- Harnessing Gpt-3.5-turbo For Rhetorical Role Prediction In Legal Cases Belfathi Anas, Hernandez Nicolas, Monceaux Laura
- Labrador: Exploring The Limits Of Masked Language Modeling For Laboratory Data Bellamy David R., Kumar Bhawesh, Wang Cindy, Beam Andrew
- Large Language Models In The Workplace: A Case Study On Prompt Engineering For Job Type Classification Benjamin Clavié, Alexandru Ciceu, Frederick Naylor, Guillaume Soulié, Thomas Brightwell
- Pre-training Llms Using Human-like Development Data Corpus Bhardwaj Khushi, Shah Raj Sanjay, Varma Sashank
- Inpars-light: Cost-effective Unsupervised Training Of Efficient Rankers Boytsov Leonid, Patel Preksha, Sourabh Vivek, Nisar Riddhi, Kundu Sayani, Ramanathan Ramya, Nyberg Eric
- Extractgpt: Exploring The Potential Of Large Language Models For Product Attribute Value Extraction Brinkmann Alexander, Shraga Roee, Bizer Christian
- Utilizing Chatgpt Generated Data To Retrieve Depression Symptoms From Social Media Bucur Ana-maria
- Finding The Needle In A Haystack: Unsupervised Rationale Extraction From Long Text Classifiers Bujel Kamil, Caines Andrew, Yannakoudakis Helen, Rei Marek
- Local Large Language Models For Complex Structured Medical Tasks Bumgardner V. K. Cody, Mullen Aaron, Armstrong Sam, Hickey Caylin, Talbert Jeff
- Oberta: Improving Sparse Transfer Learning Via Improved Initialization, Distillation, And Pruning Regimes Campos Daniel, Marques Alexandre, Kurtz Mark, Zhai Chengxiang
- Not All Layers Are Equally As Important: Every Layer Counts BERT Charpentier Lucas Georges Gabriel, Samuel David
- Cabbage Sweeter Than Cake? Analysing The Potential Of Large Language Models For Learning Conceptual Spaces Chatterjee Usashi, Gajbhiye Amit, Schockaert Steven
- Generate Labeled Training Data Using Prompt Programming And GPT-3. An Example Of Big Five Personality Classification Chen Eason
- Evaluation Of Chatgpt Family Of Models For Biomedical Reasoning And Classification Chen Shan, Li Yingya, Lu Sheng, Van Hoang, Aerts Hugo Jwl, Savova Guergana K., Bitterman Danielle S.
- Tutorials On Stance Detection Using Pre-trained Language Models: Fine-tuning BERT And Prompting Large Language Models Chuang Yun-shiuan
- BERT Lost Patience Won't Be Robust To Adversarial Slowdown Coalson Zachary, Ritter Gabriel, Bobba Rakesh, Hong Sanghyun
- Seamlessm4t: Massively Multilingual & Multimodal Machine Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Cora, Dale David, Dong Ning, Duquenne Paul-ambroise, Elsahar Hady, Gong Hongyu, Heffernan Kevin, Hoffman John, Klaiber Christopher, Li Pengwei, Licht Daniel, Maillard Jean, Rakotoarison Alice, Sadagopan Kaushik Ram, Wenzek Guillaume, Ye Ethan, Akula Bapi, Chen Peng-jen, Hachem Naji El, Ellis Brian, Gonzalez Gabriel Mejia, Haaheim Justin, Hansanti Prangthip, Howes Russ, Huang Bernie, Hwang Min-jae, Inaguma Hirofumi, Jain Somya, Kalbassi Elahe, Kallet Amanda, Kulikov Ilia, Lam Janice, Li Daniel, Ma Xutai, Mavlyutov Ruslan, Peloquin Benjamin, Ramadan Mohamed, Ramakrishnan Abinesh, Sun Anna, Tran Kevin, Tran Tuan, Tufanov Igor, Vogeti Vish, Wood Carleigh, Yang Yilin, Yu Bokai, Andrews Pierre, Balioglu Can, Costa-jussà Marta R., Celebi Onur, Elbayad Maha, Gao Cynthia, Guzmán Francisco, Kao Justine, Lee Ann, Mourachko Alexandre, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler
- Robust Infidelity: When Faithfulness Measures On Masked Language Models Are Misleading Crothers Evan, Viktor Herna, Japkowicz Nathalie
- Syntactic Knowledge Via Graph Attention With BERT In Machine Translation Dai Yuqian, Sharoff Serge, De Kamps Marc
- Context-aware Transformer Pre-training For Answer Sentence Selection Di Liello Luca, Garg Siddhant, Moschitti Alessandro
- Jump To Conclusions: Short-cutting Transformers With Linear Transformations Din Alexander Yom, Karidi Taelin, Choshen Leshem, Geva Mor
- Self-agreement: A Framework For Fine-tuning Language Models To Find Agreement Among Diverse Opinions Ding Shiyao, Ito Takayuki
- Block-wise Bit-compression Of Transformer-based Models Dong Gaochen, Chen Wei
- Chain-of-thought Tuning: Masked Language Models Can Also Think Step By Step In Natural Language Understanding Fan Caoyun, Tian Jidong, Li Yitian, Chen Wenqing, He Hao, Jin Yaohui
- Using GPT-4 To Augment Unbalanced Data For Automatic Scoring Fang Luyang, Lee Gyeong-geon, Zhai Xiaoming
- Cerbero-7b: A Leap Forward In Language-specific Llms Through Enhanced Chat Corpus Generation And Evaluation Galatolo Federico A., Cimino Mario G. C. A.
- A Lightweight Method To Generate Unanswerable Questions In English Gautam Vagrant, Zhang Miaoran, Klakow Dietrich
- Ahead-of-time P-tuning Gavrilov Daniil, Balagansky Nikita
- Wanglab At Mediqa-chat 2023: Clinical Note Generation From Doctor-patient Conversations Using Large Language Models Giorgi John, Toma Augustin, Xie Ronald, Chen Sondra S., An Kevin R., Zheng Grace X., Wang Bo
- Lil-bevo: Explorations Of Strategies For Training Language Models In More Humanlike Ways Govindarajan Venkata S, Rodriguez Juan Diego, Bostrom Kaj, Mahowald Kyle
- SPT: Fine-tuning Transformer-based Language Models Efficiently With Sparsification Gui Yuntao, Yan Xiao, Yin Peiqi, Yang Han, Cheng James
- Efficacy Of Machine-generated Instructions Gulati Samaksh, Verma Anshit, Parmar Manoj, Chaudhary Palash
- Longcoder: A Long-range Pre-trained Language Model For Code Completion Guo Daya, Xu Canwen, Duan Nan, Yin Jian, Mcauley Julian
- A Survey Of Vision-language Pre-training From The Lens Of Multimodal Machine Translation Gwinnup Jeremy, Duh Kevin
- Leveraging Large Language Models For Sequential Recommendation Harte Jesse, Zorgdrager Wouter, Louridas Panos, Katsifodimos Asterios, Jannach Dietmar, Fragkoulis Marios
- The Political Ideology Of Conversational AI: Converging Evidence On Chatgpt's Pro-environmental, Left-libertarian Orientation Hartmann Jochen, Schwenzow Jasper, Witte Maximilian
- Multilingual Language Models Are Not Multicultural: A Case Study In Emotion Havaldar Shreya, Rai Sunny, Singhal Bhumika, Liu Langchen, Guntuku Sharath Chandra, Ungar Lyle
- Assessing The Efficacy Of Large Language Models In Generating Accurate Teacher Responses Hicke Yann, Masand Abhishek, Guo Wentao, Gangavarapu Tushaar
- Bad Actor, Good Advisor: Exploring The Role Of Large Language Models In Fake News Detection Hu Beizhe, Sheng Qiang, Cao Juan, Shi Yuhui, Li Yang, Wang Danding, Qi Peng
- Enhancing Answer Selection In Community Question Answering With Pre-trained And Large Language Models Hu Xinghang
- Towards Equipping Transformer With The Ability Of Systematic Compositionality Huang Chen, Qin Peixin, Lei Wenqiang, Lv Jiancheng
- Adapting Pre-trained Language Models To Vision-language Tasks Via Dynamic Visual Prompting Huang Shubin, Wu Qiong, Zhou Yiyi, Chen Weijie, Zhang Rongsheng, Sun Xiaoshuai, Ji Rongrong
- Breaking The Token Barrier: Chunking And Convolution For Efficient Long Text Classification With BERT Jaiswal Aman, Milios Evangelos
- Just CHOP: Embarrassingly Simple LLM Compression Jha Ananya Harsh, Sherborne Tom, Walsh Evan Pete, Groeneveld Dirk, Strubell Emma, Beltagy Iz
- You Only Forward Once: Prediction And Rationalization In A Single Forward Pass Jiang Han, Duan Junwen, Qu Zhe, Wang Jianxin
- Llm-augmented Preference Learning From Natural Language Kang Inwon, Ruan Sikai, Ho Tyler, Lin Jui-chien, Mohsin Farhad, Seneviratne Oshani, Xia Lirong
- Revisiting Intermediate Layer Distillation For Compressing Language Models: An Overfitting Perspective Ko Jongwoo, Park Seungjoon, Jeong Minchan, Hong Sukjin, Ahn Euijai, Chang Du-seong, Yun Se-young
- Transformer Language Models Handle Word Frequency In Prediction Head Kobayashi Goro, Kuribayashi Tatsuki, Yokoi Sho, Inui Kentaro
- Prompt Generate Train (PGT): Few-shot Domain Adaption Of Retrieval Augmented Generation Models For Open Book Question-answering Krishna C. S.
- Certifying LLM Safety Against Adversarial Prompting Kumar Aounon, Agarwal Chirag, Srinivas Suraj, Li Aaron Jiaxun, Feizi Soheil, Lakkaraju Himabindu
- Sensi-bert: Towards Sensitivity Driven Fine-tuning For Parameter-efficient BERT Kundu Souvik, Sridhar Sharath Nittur, Szankin Maciej, Sundaresan Sairam
- DACBERT: Leveraging Dependency Agreement For Cost-efficient Bert Pretraining Kuo Martin, Zhang Jianyi, Chen Yiran
- Chatgpt: Beginning Of An End Of Manual Linguistic Data Annotation? Use Case Of Automatic Genre Identification Kuzman Taja, Mozetič Igor, Ljubešić Nikola
- A Zero-shot And Few-shot Study Of Instruction-finetuned Large Language Models Applied To Clinical And Biomedical Tasks Labrak Yanis, Rouvier Mickael, Dufour Richard
- Fine-tuning Chatgpt For Automatic Scoring Latif Ehsan, Zhai Xiaoming
- Building Efficient Universal Classifiers With Natural Language Inference Laurer Moritz, Van Atteveldt Wouter, Casas Andreu, Welbers Kasper
- Building Multimodal AI Chatbots Lee Min Young
- Do Vision And Language Models Share Concepts? A Vector Space Alignment Study Li Jiaang, Kementchedjhieva Yova, Fierro Constanza, Søgaard Anders
- Compressing Context To Enhance Inference Efficiency Of Large Language Models Li Yucheng, Dong Bo, Lin Chenghua, Guerin Frank
- Label Supervised Llama Finetuning Li Zongxi, Li Xianming, Liu Yuzhang, Xie Haoran, Li Jing, Wang Fu-lee, Li Qing, Zhong Xiaoqin
- Make Pre-trained Model Reversible: From Parameter To Memory Efficient Fine-tuning Liao Baohao, Tan Shaomu, Monz Christof
- Argue With Me Tersely: Towards Sentence-level Counter-argument Generation Lin Jiayu, Ye Rong, Han Meng, Zhang Qi, Lai Ruofei, Zhang Xinyu, Cao Zhao, Huang Xuanjing, Wei Zhongyu
- Nlebench+norglm: A Comprehensive Empirical Analysis And Benchmark Dataset For Generative Language Models In Norwegian Liu Peng, Zhang Lemei, Farup Terje Nissen, Lauvrak Even W., Ingvaldsen Jon Espen, Eide Simen, Gulla Jon Atle, Yang Zhirong
- World-to-words: Grounded Open Vocabulary Acquisition Through Fast Mapping In Vision-language Models Ma Ziqiao, Pan Jiayi, Chai Joyce
- Representation Deficiency In Masked Language Modeling Meng Yu, Krishnan Jitin, Wang Sinong, Wang Qifan, Mao Yuning, Fang Han, Ghazvininejad Marjan, Han Jiawei, Zettlemoyer Luke
- Increasing The Performance Of Cognitively Inspired Data-efficient Language Models Via Implicit Structure Building Momen Omar, Arps David, Kallmeyer Laura
- Navigating Prompt Complexity For Zero-shot Classification: A Study Of Large Language Models In Computational Social Science Mu Yida, Wu Ben P., Thorne William, Robinson Ambrose, Aletras Nikolaos, Scarton Carolina, Bontcheva Kalina, Song Xingyi
- "medium" Lms Of Code In The Era Of Llms: Lessons From Stackoverflow Mukherjee Manisha, Hellendoorn Vincent J.
- On Robustness Of Finetuned Transformer-based NLP Models Neerudu Pavan Kalyan Reddy, Oota Subba Reddy, Marreddy Mounika, Kagita Venkateswara Rao, Gupta Manish
- Trusting Language Models In Education Neto Jogi Suda, Deng Li, Raya Thejaswi, Shahbazi Reza, Liu Nick, Venkatesh Adhitya, Shah Miral, Khosla Neeru, Guido Rodrigo Capobianco
- Exploring The Maze Of Multilingual Modeling Nezhad Sina Bagheri, Agrawal Ameeta
- Retrieving-to-answer: Zero-shot Video Question Answering With Frozen Large Language Models Pan Junting, Lin Ziyi, Ge Yuying, Zhu Xiatian, Zhang Renrui, Wang Yi, Qiao Yu, Li Hongsheng
- Language Model Self-improvement By Reinforcement Learning Contemplation Pang Jing-cheng, Wang Pengyuan, Li Kaiyuan, Chen Xiong-hui, Xu Jiacheng, Zhang Zongzhang, Yu Yang
- Sequencing Matters: A Generate-retrieve-generate Model For Building Conversational Agents Patwardhan Quinn, Yang Grace Hui
- Entity Matching Using Large Language Models Peeters Ralph, Bizer Christian
- Recjpq: Training Large-catalogue Sequential Recommenders Petrov Aleksandr V., Macdonald Craig
- Mosaicbert: A Bidirectional Encoder Optimized For Fast Pretraining Portes Jacob, Trott Alex, Havens Sam, King Daniel, Venigalla Abhinav, Nadeem Moin, Sardana Nikhil, Khudia Daya, Frankle Jonathan
- Transformers In The Service Of Description Logic-based Contexts Poulis Angelos, Tsalapati Eleni, Koubarakis Manolis
- Mini Minds: Exploring Bebeshka And Zlata Baby Models Proskurina Irina, Metzler Guillaume, Velcin Julien
- Cmed-gpt: Prompt Tuning For Entity-aware Chinese Medical Dialogue Generation Qu Zhijie, Li Juan, Ma Zerui, Li Jianqiang
- Quantized Transformer Language Model Implementations On Edge Devices Rahman Mohammad Wali Ur, Abrar Murad Mehrab, Copening Hunter Gibbons, Hariri Salim, Shao Sicong, Satam Pratik, Salehi Soheil
- Mixed-distil-bert: Code-mixed Language Modeling For Bangla, English, And Hindi Raihan Md Nishat, Goswami Dhiman, Mahmud Antara
- Does The "most Sinfully Decadent Cake Ever" Taste Good? Answering Yes/no Questions From Figurative Contexts Rakshit Geetanjali, Flanigan Jeffrey
- Retrieval-augmented Image Captioning Ramos Rita, Elliott Desmond, Martins Bruno
- Precog: Exploring The Relation Between Memorization And Performance In Pre-trained Language Models Ranaldi Leonardo, Ruzzetti Elena Sofia, Zanzotto Fabio Massimo
- Chatgpt-crawler: Find Out If Chatgpt Really Knows What It's Talking About Rangapur Aman, Wang Haoran
- Weakly Supervised Detection Of Hallucinations In LLM Activations Rateike Miriam, Cintas Celia, Wamburu John, Akumu Tanya, Speakman Skyler
- Towards A Performance Analysis On Pre-trained Visual Question Answering Models For Autonomous Driving Rekanar Kaavya, Eising Ciarán, Sistu Ganesh, Hayes Martin
- Commonsense Reasoning For Conversational AI: A Survey Of The State Of The Art Richardson Christopher, Heck Larry
- From Chatbots To Phishbots? -- Preventing Phishing Scams Created Using Chatgpt, Google Bard And Claude Roy Sayak Saha, Thota Poojitha, Naragam Krishna Vamsi, Nilizadeh Shirin
- Naturalness Of Attention: Revisiting Attention In Code Language Models Saad Mootez, Sharma Tushar
- Promptmix: A Class Boundary Augmentation Method For Large Language Model Distillation Sahu Gaurav, Vechtomova Olga, Bahdanau Dzmitry, Laradji Issam H.
- Zero Is Not Hero Yet: Benchmarking Zero-shot Performance Of Llms For Financial Tasks Shah Agam, Chava Sudheer
- Hero: Roberta And Longformer Hebrew Language Models Shalumov Vitaly, Haskey Harel
- Improving The Robustness Of Transformer-based Large Language Models With Dynamic Attention Shen Lujia, Pu Yuwen, Ji Shouling, Li Changjiang, Zhang Xuhong, Ge Chunpeng, Wang Ting
- Explaining Black Box Text Modules In Natural Language With Language Models Singh Chandan, Hsu Aliyah R., Antonello Richard, Jain Shailee, Huth Alexander G., Yu Bin, Gao Jianfeng
- Response-act Guided Reinforced Dialogue Generation For Mental Health Counseling Srivastava Aseem, Pandey Ishan, Akhtar Md. Shad, Chakraborty Tanmoy
- Dial-mae: Contextual Masked Auto-encoder For Retrieval-based Dialogue Systems Su Zhenpeng, Wu Xing, Zhou Wei, Ma Guangyuan, Hu Songlin
- Does Fine-tuning GPT-3 With The Openai API Leak Personally-identifiable Information? Sun Albert Yu, Zemour Eliott, Saxena Arushi, Vaidyanathan Udith, Lin Eric, Lau Christian, Mugunthan Vaikkunth
- Location-aware Visual Question Generation With Lightweight Models Suwono Nicholas Collin, Chen Justin Chih-yao, Hung Tun Min, Huang Ting-hao Kenneth, Liao I-bin, Li Yung-hui, Ku Lun-wei, Sun Shao-hua
- The BEA 2023 Shared Task On Generating AI Teacher Responses In Educational Dialogues Tack Anaïs, Kochmar Ekaterina, Yuan Zheng, Bibauw Serge, Piech Chris
- Massive Editing For Large Language Models Via Meta Learning Tan Chenmien, Zhang Ge, Fu Jie
- Can BERT Refrain From Forgetting On Sequential Tasks? A Probing Study Tao Mingxu, Feng Yansong, Zhao Dongyan
- Better Language Models Of Code Through Self-improvement To Hung Quoc, Bui Nghi D. Q., Guo Jin, Nguyen Tien N.
- Videberta: A Powerful Pre-trained Language Model For Vietnamese Tran Cong Dao, Pham Nhut Huy, Nguyen Anh, Hy Truong Son, Vu Tu
- Chatgpt As A Commenter To The News: Can Llms Generate Human-like Opinions? Tseng Rayden, Verberne Suzan, Van Der Putten Peter
- Automating Code-related Tasks Through Transformers: The Impact Of Pre-training Tufano Rosalia, Pascarella Luca, Bavota Gabriele
- Simple Is Better And Large Is Not Enough: Towards Ensembling Of Foundational Language Models Tyagi Nancy, Shiri Aidin, Sarkar Surjodeep, Umrawal Abhishek Kumar, Gaur Manas
- Mlongt5: A Multilingual And Efficient Text-to-text Transformer For Longer Sequences Uthus David, Ontañón Santiago, Ainslie Joshua, Guo Mandy
- Fairpy: A Toolkit For Evaluation Of Social Biases And Their Mitigation In Large Language Models Viswanath Hrishikesh, Zhang Tianyi
- GPT-RE: In-context Learning For Relation Extraction Using Large Language Models Wan Zhen, Cheng Fei, Mao Zhuoyuan, Liu Qianying, Song Haiyue, Li Jiwei, Kurohashi Sadao
- BERT4CTR: An Efficient Framework To Combine Pre-trained Language Model With Non-textual Features For CTR Prediction Wang Dong, Salamatian Kavé, Xia Yunqing, Deng Weiwei, Zhiang Qi
- Robust Natural Language Understanding With Residual Attention Debiasing Wang Fei, Huang James Y., Yan Tianyi, Zhou Wenxuan, Chen Muhao
- Improved Visual Fine-tuning With Natural Language Supervision Wang Junyang, Xu Yuanhong, Hu Juhua, Yan Ming, Sang Jitao, Qian Qi
- Instructuie: Multi-task Instruction Tuning For Unified Information Extraction Wang Xiao, Zhou Weikang, Zu Can, Xia Han, Chen Tianze, Zhang Yuansen, Zheng Rui, Ye Junjie, Zhang Qi, Gui Tao, Kang Jihua, Yang Jingsheng, Li Siyuan, Du Chunsai
- Gradient-based Word Substitution For Obstinate Adversarial Examples Generation In Language Models Wang Yimu, Shi Peng, Zhang Hongyang
- Hyperpolyglot Llms: Cross-lingual Interpretability In Token Embeddings Wen-yi Andrea W, Mimno David
- Llm-powered Data Augmentation For Enhanced Cross-lingual Performance Whitehouse Chenxi, Choudhury Monojit, Aji Alham Fikri
- Extrapolating Multilingual Understanding Models As Multilingual Generators Wu Bohong, Yuan Fei, Zhao Hai, Li Lei, Xu Jingjing
- TLM: Token-level Masking For Transformers Wu Yangjun, Fang Kebin, Zhang Dongxiang, Wang Han, Zhang Hao, Chen Gang
- Uncertainty-aware Language Modeling For Selective Question Answering Yang Qi, Ravikumar Shreya, Schmitt-ulms Fynn, Lolla Satvik, Demir Ege, Elistratov Iaroslav, Lavaee Alex, Lolla Sadhana, Ahmadi Elaheh, Rus Daniela, Amini Alexander, Perez Alejandro
- Integrating UMLS Knowledge Into Large Language Models For Medical Question Answering Yang Rui, Marrese-taylor Edison, Ke Yuhe, Cheng Lechao, Chen Qingyu, Li Irene
- Bias A-head? Analyzing Bias In Transformer-based Language Model Attention Heads Yang Yi, Duan Hanyu, Abbasi Ahmed, Lalor John P., Tam Kar Yan
- Promptcare: Prompt Copyright Protection By Watermark Injection And Verification Yao Hongwei, Lou Jian, Ren Kui, Qin Zhan
- Diffusion Language Models Can Perform Many Tasks With Scaling And Instruction-finetuning Ye Jiasheng, Zheng Zaixiang, Bao Yu, Qian Lihua, Gu Quanquan
- Where To Go Next For Recommender Systems? ID- Vs. Modality-based Recommender Models Revisited Yuan Zheng, Yuan Fajie, Song Yu, Li Youhua, Fu Junchen, Yang Fei, Pan Yunzhu, Ni Yongxin
- Is Chatgpt A Good Sentiment Analyzer? A Preliminary Study Zengzhi Wang, Qiming Xie, Yi Feng, Zixiang Ding, Zinong Yang, Rui Xia
- Text Alignment Is An Efficient Unified Model For Massive NLP Tasks Zha Yuheng, Yang Yichi, Li Ruichen, Hu Zhiting
- Cot-bert: Enhancing Unsupervised Sentence Representation Through Chain-of-thought Zhang Bowen, Chang Kehua, Li Chunping
- User-controlled Knowledge Fusion In Large Language Models: Balancing Creativity And Hallucination Zhang Chen
- Xdial-eval: A Multilingual Open-domain Dialogue Evaluation Benchmark Zhang Chen, D'haro Luis Fernando, Tang Chengguang, Shi Ke, Tang Guohua, Li Haizhou
- Task-agnostic Distillation Of Encoder-decoder Language Models Zhang Chen, Yang Yang, Wang Jingang, Song Dawei
- Weighted Sampling For Masked Language Modeling Zhang Linhan, Chen Qian, Wang Wen, Deng Chong, Cao Xin, Hao Kongzhang, Jiang Yuxin, Wang Wei
- Lora-fa: Memory-efficient Low-rank Adaptation For Large Language Models Fine-tuning Zhang Longteng, Zhang Lin, Shi Shaohuai, Chu Xiaowen, Li Bo
- Llama-adapter: Efficient Fine-tuning Of Language Models With Zero-init Attention Zhang Renrui, Han Jiaming, Liu Chris, Gao Peng, Zhou Aojun, Hu Xiangfei, Yan Shilin, Lu Pan, Li Hongsheng, Qiao Yu
- Modeling Sequential Sentence Relation To Improve Cross-lingual Dense Retrieval Zhang Shunyu, Liang Yaobo, Gong Ming, Jiang Daxin, Duan Nan
- Can Chatgpt Understand Too? A Comparative Study On Chatgpt And Fine-tuned BERT Zhong Qihuang, Ding Liang, Liu Juhua, Du Bo, Tao Dacheng
- Revisiting Token Dropping Strategy In Efficient BERT Pretraining Zhong Qihuang, Ding Liang, Liu Juhua, Liu Xuebo, Zhang Min, Du Bo, Tao Dacheng
- Flame: Few-shot Learning From Natural Language Explanations Zhou Yangqiaoyu, Zhang Yiming, Tan Chenhao
- Can GPT Models Follow Human Summarization Guidelines? Evaluating Chatgpt And GPT-4 For Dialogue Summarization Zhou Yongxin, Ringeval Fabien, Portet François
- A Family Of Pretrained Transformer Language Models For Russian Zmitrovich Dmitry, Abramov Alexander, Kalmykov Andrey, Tikhonova Maria, Taktasheva Ekaterina, Astafurov Danil, Baushenko Mark, Snegirev Artem, Kadulin Vitalii, Markov Sergey, Shavrina Tatiana, Mikhailov Vladislav, Fenogenova Alena
- A Dataset And Benchmark For Hospital Course Summarization With Adapted Large Language Models Aali Asad, Van Veen Dave, Arefeen Yamin Ishraq, Hom Jason, Bluethgen Christian, Reis Eduardo Pontes, Gatidis Sergios, Clifford Namuun, Daws Joseph, Tehrani Arash S., Kim Jangwon, Chaudhari Akshay S.
- Bcamirs At Semeval-2024 Task 4: Beyond Words: A Multimodal And Multilingual Exploration Of Persuasion In Memes Abaskohi Amirhossein, Dabiriaghdam Amirhossein, Wang Lele, Carenini Giuseppe
- A Survey On Symbolic Knowledge Distillation Of Large Language Models Acharya Kamal, Velasquez Alvaro, Song Houbing Herbert
- Codemirage: Hallucinations In Code Generated By Large Language Models Agarwal Vibhor, Pei Yulong, Alamir Salwa, Liu Xiaomo
- Enhancing Pre-trained Generative Language Models With Question Attended Span Extraction On Machine Reading Comprehension Ai Lin, Hui Zheng, Liu Zizhou, Hirschberg Julia
- BAMO At Semeval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense Ansari Baktash, Rostamkhani Mohammadmostafa, Eetemadi Sauleh
- Simpler Becomes Harder: Do Llms Exhibit A Coherent Behavior On Simplified Corpora? Anschütz Miriam, Mosca Edoardo, Groh Georg
- An Assessment On Comprehending Mental Health Through Large Language Models Arcan Mihael, Niland David-paul, Delahunty Fionn
- Robustsentembed: Robust Sentence Embeddings Using Adversarial Self-supervised Contrastive Learning Asl Javad Rafiei, Panzade Prajwal, Blanco Eduardo, Takabi Daniel, Cai Zhipeng
- CYGENT: A Cybersecurity Conversational Agent With Log Summarization Powered By GPT-3 Balasubramanian Prasasthy, Seby Justin, Kostakos Panos
- Breaking Boundaries: Investigating The Effects Of Model Editing On Cross-linguistic Performance Banerjee Somnath, Halder Avik, Mandal Rajarshi, Layek Sayan, Soboroff Ian, Hazra Rima, Mukherjee Animesh
- Luna: An Evaluation Foundation Model To Catch Language Model Hallucinations With High Accuracy And Low Cost Belyi Masha, Friel Robert, Shao Shuai, Sanyal Atindriyo
- Cross-lingual Editing In Multilingual Language Models Beniwal Himanshu, D Kowsik Nandagopan, Singh Mayank
- Checkembed: Effective Verification Of LLM Solutions To Open-ended Tasks Besta Maciej, Paleari Lorenzo, Kubicek Ales, Nyczyk Piotr, Gerstenberger Robert, Iff Patrick, Lehmann Tomasz, Niewiadomski Hubert, Hoefler Torsten
- Enhancing Code Translation In Language Models With Few-shot Learning Via Retrieval-augmented Generation Bhattarai Manish, Santos Javier E., Jones Shawn, Biswas Ayan, Alexandrov Boian, O'malley Daniel
- Improving Sequential Recommendations With Llms Boz Artun, Zorgdrager Wouter, Kotti Zoe, Harte Jesse, Louridas Panos, Jannach Dietmar, Fragkoulis Marios
- Enhancing Trust In Llms: Algorithms For Comparing And Interpreting Llms Brown Nik Bear
- Fine-tuned 'small' Llms (still) Significantly Outperform Zero-shot Generative AI Models In Text Classification Bucher Martin Juan José, Martini Marco
- Investigating Wit, Creativity, And Detectability Of Large Language Models In Domain-specific Writing Style Adaptation Of Reddit's Showerthoughts Buz Tolga, Frost Benjamin, Genchev Nikola, Schneider Moritz, Kaffee Lucie-aimée, De Melo Gerard
- Low-cost Generation And Evaluation Of Dictionary Example Sentences Cai Bill, Ng Clarence Boon Liang, Tan Daniel, Hotama Shelvia
- Figuring Out Figures: Using Textual References To Caption Scientific Figures Cao Stanley, Liu Kevin
- Head-wise Shareable Attention For Large Language Models Cao Zouying, Yang Yifei, Zhao Hai
- A Review Of Multi-modal Large Language And Vision Models Carolan Kilian, Fennelly Laura, Smeaton Alan F.
- Notellm: A Retrievable Large Language Model For Note Recommendation Chao Zhang, Shiwei Wu, Haoxin Zhang, Tong Xu, Yan Gao, Yao Hu, Di Wu, Enhong Chen
- Enhancing Visual Question Answering Through Ranking-based Hybrid Training And Multimodal Fusion Chen Peiyuan, Zhang Zecheng, Dong Yiping, Zhou Li, Wang Han
- Pragmatic Inference Of Scalar Implicature By Llms Cho Ye-eun, Kim Seong Mook
- Llambert: Large-scale Low-cost Data Annotation In NLP Csanády Bálint, Muzsai Lajos, Vedres Péter, Nádasdy Zoltán, Lukács András
- Synthetic Dataset Creation And Fine-tuning Of Transformer Models For Question Answering In Serbian Cvetanović Aleksa, Tadić Predrag
- Pelle: Encoder-based Language Models For Brazilian Portuguese Based On Open Data De Mello Guilherme Lamartine, Finger Marcelo, Serras And Felipe, Carpi Miguel De Mello, Jose Marcos Menon, Domingues Pedro Henrique, Cavalim Paulo
- Evaluating Large Language Models For Automatic Analysis Of Teacher Simulations De-fitero-dominguez David, Albaladejo-gonzález Mariano, Garcia-cabot Antonio, Garcia-lopez Eva, Moreno-cediel Antonio, Barno Erin, Reich Justin
- Facts-and-feelings: Capturing Both Objectivity And Subjectivity In Table-to-text Generation Dey Tathagata, Bhattacharyya Pushpak
- Unlocking Continual Learning Abilities In Language Models Du Wenyu, Cheng Shuang, Luo Tongxu, Qiu Zihan, Huang Zeyu, Cheung Ka Chun, Cheng Reynold, Fu Jie
- Perltqa: A Personal Long-term Memory Dataset For Memory Classification, Retrieval, And Synthesis In Question Answering Du Yiming, Wang Hongru, Zhao Zhengyi, Liang Bin, Wang Baojun, Zhong Wanjun, Wang Zezhong, Wong Kam-fai
- CLEFT: Language-image Contrastive Learning With Efficient Large Language Model And Prompt Fine-tuning Du Yuexi, Chang Brian, Dvornek Nicha C.
- Enabling Natural Zero-shot Prompting On Encoder Models Via Statement-tuning Elshabrawy Ahmed, Huang Yongxin, Gurevych Iryna, Aji Alham Fikri
- Bertaqa: How Much Do Language Models Know About Local Culture? Etxaniz Julen, Azkune Gorka, Soroa Aitor, De Lacalle Oier Lopez, Artetxe Mikel
- Data-augmentation-based Dialectal Adaptation For Llms Faisal Fahim, Anastasopoulos Antonios
- Unraveling The Dominance Of Large Language Models Over Transformer Models For Bangla Natural Language Inference: A Comprehensive Study Faria Fatema Tuj Johora, Moin Mukaffi Bin, Fahim Asif Iftekher, Debnath Pronay, Shah Faisal Muhammad
- An Extremely Data-efficient And Generative Llm-based Reinforcement Learning Agent For Recommenders Feng Shuang, Feng Grace
- Llama-vits: Enhancing TTS Synthesis With Semantic Awareness Feng Xincan, Yoshimoto Akifumi
- Generative AI And Large Language Models For Cyber Security: All Insights You Need Ferrag Mohamed Amine, Alwahedi Fatima, Battah Ammar, Cherif Bilel, Mechri Abdechakour, Tihanyi Norbert
- Student Answer Forecasting: Transformer-driven Answer Choice Prediction For Language Learning Gado Elena Grazia, Martorella Tommaso, Zunino Luca, Mejia-domenzain Paola, Swamy Vinitra, Frej Jibril, Käser Tanja
- Improving Language Understanding From Screenshots Gao Tianyu, Wang Zirui, Bhaskar Adithya, Chen Danqi
- A Review On The Use Of Large Language Models As Virtual Tutors García-méndez Silvia, De Arriba-pérez Francisco, Somoza-lópez María Del Carmen
- Assessing Generative Language Models In Classification Tasks: Performance And Self-evaluation Capabilities In The Environmental And Climate Change Domain Grasso Francesca, Locci Stefano
- Fine-tuning Large Language Models For Stock Return Prediction Using Newsflow Guo Tian, Hauptmann Emmanuel
- Multilingual Large Language Models And Curse Of Multilinguality Gurgurov Daniil, Bäumel Tanja, Anikina Tatiana
- Quality Of Answers Of Generative Large Language Models Vs Peer Patients For Interpreting Lab Test Results For Lay Patients: Evaluation Study He Zhe, Bhasuran Balu, Jin Qiao, Tian Shubo, Hanna Karim, Shavor Cindy, Arguello Lisbeth Garcia, Murray Patrick, Lu Zhiyong
- The Effect Of Model Size On LLM Post-hoc Explainability Via LIME Heyen Henning, Widdicombe Amy, Siegel Noah Y., Perez-ortiz Maria, Treleaven Philip
- Large Language Models For Expansion Of Spoken Language Understanding Systems To New Languages Hoscilowicz Jakub, Pawlowski Pawel, Skorupa Marcin, Sowański Marcin, Janicki Artur
- Outlier-efficient Hopfield Layers For Large Transformer-based Models Hu Jerry Yao-chieh, Chang Pei-hsuan, Luo Robin, Chen Hong-yu, Li Weijian, Wang Wei-po, Liu Han
- Leveraging Large Language Models For Enhanced NLP Task Performance Through Knowledge Distillation And Optimized Training Strategies Huang Yining, Tang Keke, Chen Meilian
- A Comprehensive Survey On Evaluating Large Language Model Applications In The Medical Industry Huang Yining, Tang Keke, Chen Meilian, Wang Boyuan
- Pc-lora: Low-rank Adaptation For Progressive Model Compression With Knowledge Distillation Hwang Injoon, Park Haewon, Lee Youngwan, Yang Jooyoung, Maeng Sunjae
- Autocompletion Of Chief Complaints In The Electronic Health Records Using Large Language Models Islam K M Sajjadul, Nipu Ayesha Siddika, Madiraju Praveen, Deshpande Priya
- Towards Flexible Evaluation For Generative Visual Question Answering Ji Huishan, Si Qingyi, Lin Zheng, Wang Weiping
- Large Visual-language Models Are Also Good Classifiers: A Study Of In-context Multimodal Fake News Detection Jiang Ye, Wang Yimin
- Evaluating The Impact Of Advanced LLM Techniques On Ai-lecture Tutors For A Robotics Course Kahl Sebastian, Löffler Felix, Maciol Martin, Ridder Fabian, Schmitz Marius, Spanagel Jennifer, Wienkamp Jens, Burgahn Christopher, Schilling Malte
- Counterfactuals As A Means For Evaluating Faithfulness Of Attribution Methods In Autoregressive Language Models Kamahi Sepehr, Yaghoobzadeh Yadollah
- Can Reinforcement Learning Unlock The Hidden Dangers In Aligned Large Language Models? Karkevandi Mohammad Bahrami, Vishwamitra Nishant, Najafirad Peyman
- From Text To Transformation: A Comprehensive Review Of Large Language Models' Versatility Kaur Pravneet, Kashyap Gautam Siddharth, Kumar Ankit, Nafis Md Tabrez, Kumar Sandeep, Shokeen Vikrant
- On Initializing Transformers With Pre-trained Embeddings Kim Ha Young, Balasubramanian Niranjan, Kang Byungkon
- Reinforcement Learning For Optimizing RAG For Domain Chatbots Kulkarni Mandar, Tangarajan Praveen, Kim Kyung, Trivedi Anusua
- Nv-embed: Improved Techniques For Training Llms As Generalist Embedding Models Lee Chankyu, Roy Rajarshi, Xu Mengyao, Raiman Jonathan, Shoeybi Mohammad, Catanzaro Bryan, Ping Wei
- D2LLM: Decomposed And Distilled Large Language Models For Semantic Search Liao Zihan, Yu Hang, Li Jianguo, Wang Jun, Zhang Wei
- Adapting Mental Health Prediction Tasks For Cross-lingual Learning Via Meta-training And In-context Learning With Large Language Model Lifelo Zita, Ning Huansheng, Dhelim Sahraoui
- Syncpkl: Harnessing Llms To Generate Synthetic Data For Commonsense Persona Knowledge Linking Lin Kuan-yen
- VP-LLM: Text-driven 3D Volume Completion With Large Language Models Through Patchification Liu Jianmeng, Liu Yichen, Zhang Yuyao, Meng Zeyuan, Tai Yu-wing, Tang Chi-keung
- Enhancing Llm's Cognition Via Structurization Liu Kai, Fu Zhihang, Chen Chao, Zhang Wei, Jiang Rongxin, Zhou Fan, Chen Yaowu, Wu Yue, Ye Jieping
- Fantastic Semantics And Where To Find Them: Investigating Which Layers Of Generative Llms Reflect Lexical Semantics Liu Zhu, Kong Cunliang, Liu Ying, Sun Maosong
- Can Perplexity Predict Fine-tuning Performance? An Investigation Of Tokenization Effects On Sequential Language Models For Nepali Luitel Nishant, Bekoju Nirajan, Sah Anand Kumar, Shakya Subarna
- Large Language Models As Foundations For Next-gen Dense Retrieval: A Comprehensive Empirical Assessment Luo Kun, Qin Minghao, Liu Zheng, Xiao Shitao, Zhao Jun, Liu Kang
- Impact Of Model Size On Fine-tuned LLM Performance In Data-to-text Generation: A State-of-the-art Investigation Mahapatra Joy, Garain Utpal
- Exploration Of Masked And Causal Language Modelling For Text Generation Micheletti Nicolo, Belkadi Samuel, Han Lifeng, Nenadic Goran
- CBF-LLM: Safe Control For LLM Alignment Miyaoka Yuya, Inoue Masaki
- RDBE: Reasoning Distillation-based Evaluation Enhances Automatic Essay Scoring Mohammadkhani Ali Ghiasvand
- Towards Generating Informative Textual Description For Neurons In Language Models Mondal Shrayani, Garodia Rishabh, Qureshi Arbaaz, Lee Taesung, Park Youngja
- What Drives Performance In Multilingual Language Models? Nezhad Sina Bagheri, Agrawal Ameeta
- How Reliable AI Chatbots Are For Disease Prediction From Patient Complaints? Nipu Ayesha Siddika, Islam K M Sajjadul, Madiraju Praveen
- Improving Black-box Robustness With In-context Rewriting O'brien Kyle, Ng Nathan, Puri Isha, Mendez Jorge, Palangi Hamid, Kim Yoon, Ghassemi Marzyeh, Hartvigsen Thomas
- NLP At UC Santa Cruz At Semeval-2024 Task 5: Legal Answer Validation Using Few-shot Multi-choice QA Pahilajani Anish, Jain Samyak Rajesh, Trivedi Devasha
- Training Llms To Recognize Hedges In Spontaneous Narratives Paige Amie J., Soubki Adil, Murzaku John, Rambow Owen, Brennan Susan E.
- Llmlingua-2: Data Distillation For Efficient And Faithful Task-agnostic Prompt Compression Pan Zhuoshi, Wu Qianhui, Jiang Huiqiang, Xia Menglin, Luo Xufang, Zhang Jue, Lin Qingwei, Rühle Victor, Yang Yuqing, Lin Chin-yew, Zhao H. Vicky, Qiu Lili, Zhang Dongmei
- Answer Is All You Need: Instruction-following Text Embedding Via Answering The Question Peng Letian, Zhang Yuwei, Wang Zilong, Srinivasa Jayanth, Liu Gaowen, Wang Zihan, Shang Jingbo
- FACTOID: Factual Entailment For Hallucination Detection Rawte Vipula, Tonmoy S. M Towhidul Islam, Rajbangshi Krishnav, Nag Shravani, Chadha Aman, Sheth Amit P., Das Amitava
- Are Electra's Sentence Embeddings Beyond Repair? The Case Of Semantic Textual Similarity Rep Ivan, Dukić David, Šnajder Jan
- From PEFT To DEFT: Parameter Efficient Finetuning For Reducing Activation Density In Transformers Runwal Bharat, Pedapati Tejaswini, Chen Pin-yu
- Benchmarking And Building Long-context Retrieval Models With Loco And M2-BERT Saad-falcon Jon, Fu Daniel Y., Arora Simran, Guha Neel, Ré Christopher
- Mixsumm: Topic-based Data Augmentation Using Llms For Low-resource Extractive Text Summarization Sahu Gaurav, Laradji Issam H.
- Berts Are Generative In-context Learners Samuel David
- Evaluating Consistency And Reasoning Capabilities Of Large Language Models Saxena Yash, Chopra Sarthak, Tripathi Arunendra Mani
- Evaluating Text Summaries Generated By Large Language Models Using Openai's GPT Shakil Hassan, Mahi Atqiya Munawara, Nguyen Phuoc, Ortiz Zeydy, Mardini Mamoun T.
- Medadapter: Efficient Test-time Adaptation Of Large Language Models Towards Medical Reasoning Shi Wenqi, Xu Ran, Zhuang Yuchen, Yu Yue, Wu Hang, Yang Carl, Wang May D.
- Scaling Synthetic Logical Reasoning Datasets With Context-sensitive Declarative Grammars Sileo Damien
- Robustness Of Llms To Perturbations In Text Singh Ayush, Singh Navpreet, Vatsal Shubham
- Sharelora: Parameter Efficient And Robust Large Language Model Fine-tuning Via Shared Low-rank Adaptation Song Yurun, Zhao Junchen, Harris Ian G., Jyothi Sangeetha Abdu
- Empo: Theory-driven Dataset Construction For Empathetic Response Generation Through Preference Optimization Sotolar Ondrej
- Tailoring Vaccine Messaging With Common-ground Opinions Stureborg Rickard, Chen Sanxing, Xie Ruoyu, Patel Aayushi, Li Christopher, Zhu Chloe Qinyu, Hu Tingnan, Yang Jun, Dhingra Bhuwan
- PIXAR: Auto-regressive Language Modeling In Pixel Space Tai Yintao, Liao Xiyang, Suglia Alessandro, Vergari Antonio
- VBART: The Turkish LLM Turker Meliksah, Ari Mehmet Erdi, Han Aydin
- PARADISE: Evaluating Implicit Planning Skills Of Language Models With Procedural Warnings And Tips Dataset Uzunoglu Arda, Safa Abdalfatah Rashid, Şahin Gözde Gül
- RAG Based Question-answering For Contextual Response Prediction System Veturi Sriram, Vaichal Saurabh, Jagadheesh Reshma Lal, Tripto Nafis Irtiza, Yan Nian
- Datavist5: A Pre-trained Language Model For Jointly Understanding Text And Data Visualization Wan Zhuoyue, Song Yuanfeng, Li Shuaimin, Zhang Chen Jason, Wong Raymond Chi-wing
- Reconstruct The Pruned Model Without Any Retraining Wang Pingjie, Fan Ziqing, Hu Shengchao, Chen Zhe, Wang Yanfeng, Wang Yu
- Ladic: Are Diffusion Models Really Inferior To Autoregressive Counterparts For Image-to-text Generation? Wang Yuchi, Ren Shuhuai, Gao Rundong, Yao Linli, Guo Qingyan, An Kaikai, Bai Jianhong, Sun Xu
- Promptmm: Multi-modal Knowledge Distillation For Recommendation With Prompt-tuning Wei Wei, Tang Jiabin, Jiang Yangqin, Xia Lianghao, Huang Chao
- Which Questions Should I Answer? Salience Prediction Of Inquisitive Questions Wu Yating, Mangla Ritika, Dimakis Alexandros G., Durrett Greg, Li Junyi Jessy
- Acquiring Clean Language Models From Backdoor Poisoned Datasets By Downscaling Frequency Space Wu Zongru, Zhang Zhuosheng, Cheng Pengzhou, Liu Gongshen
- Seamless Language Expansion: Enhancing Multilingual Mastery In Self-supervised Models Xu Jing, Wu Minglin, Wu Xixin, Meng Helen
- Embedded Translations For Low-resource Automated Glossing Yang Changbing, Nicolai Garrett, Silfverberg Miikka
- Aligning Llms Through Multi-perspective User Preference Ranking-based Feedback For Programming Question Answering Yang Hongyu, He Liyang, Hou Min, Shen Shuanghong, Li Rui, Hou Jiahui, Ma Jianhui, Zhao Junda
- Adazeta: Adaptive Zeroth-order Tensor-train Adaption For Memory-efficient Large Language Models Fine-tuning Yang Yifan, Zhen Kai, Banijamal Ershad, Mouchtaris Athanasios, Zhang Zheng
- Making The Most Of Your Model: Methods For Finetuning And Applying Pretrained Transformers Yoshida Davis
- Improving Structural Diversity Of Blackbox Llms Via Chain-of-specification Prompting Young Halley, Zeng Yimeng, Gardner Jacob, Bastani Osbert
- Tcmbench: A Comprehensive Benchmark For Evaluating Large Language Models In Traditional Chinese Medicine Yue Wenjing, Wang Xiaoling, Zhu Wei, Guan Ming, Zheng Huanran, Wang Pengfei, Sun Changzhi, Ma Xin
- Focus On The Core: Efficient Attention Via Pruned Token Compression For Document Classification Yun Jungmin, Kim Mihyeon, Kim Youngbin
- The Hedgehog & The Porcupine: Expressive Linear Attentions With Softmax Mimicry Zhang Michael, Bhatia Kush, Kumbong Hermann, Ré Christopher
- Mindspeech: Continuous Imagined Speech Decoding Using High-density Fnirs And Prompt Tuning For Advanced Human-ai Interaction Zhang Suyi, Alam Ekram, Baber Jack, Bianco Francesca, Turner Edward, Chamanzar Maysam, Dehghani Hamid
- MLKD-BERT: Multi-level Knowledge Distillation For Pre-trained Language Models Zhang Ying, Yang Ziheng, Ji Shufan
- APT: Adaptive Pruning And Tuning Pretrained Language Models For Efficient Training And Inference Zhao Bowen, Hajishirzi Hannaneh, Cao Qingqing
- Galore: Memory-efficient LLM Training By Gradient Low-rank Projection Zhao Jiawei, Zhang Zhenyu, Chen Beidi, Wang Zhangyang, Anandkumar Anima, Tian Yuandong
- Mlps Compass: What Is Learned When Mlps Are Combined With Plms? Zhou Li, Chen Wenyu, Cao Yong, Zeng Dingyi, Liu Wanlong, Qu Hong
- How Privacy-savvy Are Large Language Models? A Case Study On Compliance And Privacy Technical Review Zhu Xichou, Liu Yang, Shen Zhou, Liu Yi, Li Min, Chen Yujun, John Benzi, Ma Zhenzhen, Hu Tao, Yang Bolong, Wang Manman, Xie Zongxing, Liu Peng, Cai Dan, Wang Junhui
- Is Larger Always Better? Evaluating And Prompting Large Language Models For Non-generative Medical Tasks Zhu Yinghao, Gao Junyi, Wang Zixiang, Liao Weibin, Zheng Xiaochen, Liang Lifang, Wang Yasha, Pan Chengwei, Harrison Ewen M., Ma Liantao
🏷 Bias Mitigation
- Learning To Deceive With Attention-based Explanations Pruthi Danish, Gupta Mansi, Dhingra Bhuwan, Neubig Graham, Lipton Zachary C.
- Robustness Challenges In Model Distillation And Pruning For Natural Language Understanding Du Mengnan, Mukherjee Subhabrata, Cheng Yu, Shokouhi Milad, Hu Xia, Awadallah Ahmed Hassan
- Societal Biases In Language Generation: Progress And Challenges Sheng Emily, Chang Kai-wei, Natarajan Premkumar, Peng Nanyun
- A Disability Lens Towards Biases In GPT-3 Generated Open-ended Languages Amin Akhter Al, Kabir Kazi Sinthia
- Holistic Evaluation Of Language Models Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao, Jue Wang, Keshav Santhanam, Laurel Orr, Lucia Zheng, Mert Yuksekgonul, Mirac Suzgun, Nathan Kim, Neel Guha, Niladri Chatterji, Omar Khattab, Peter Henderson, Qian Huang, Ryan Chi, Sang Michael Xie, Shibani Santurkar, Surya Ganguli, Tatsunori Hashimoto, Thomas Icard, Tianyi Zhang, Vishrav Chaudhary, William Wang, Xuechen Li, Yifan Mai, Yuhui Zhang, Yuta Koreeda
- Chatgpt: The End Of Online Exam Integrity? Susnjak Teo
- H2ogpt: Democratizing Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Prabhu Prithvi, Gambera Jeff, Landry Mark, Bansal Shivam, Chesler Ryan, Lee Chun Ming, Conde Marcos V., Stetsenko Pasha, Grellier Olivier, Ambati Srisatish
- Challenges And Contributing Factors In The Utilization Of Large Language Models (llms) Chen Xiaoliang, Li Liangbin, Chang Le, Huang Yunhe, Zhao Yuxuan, Zhang Yuxiao, Li Dinuo
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- Scaling Vision Transformers To 22 Billion Parameters Dehghani Mostafa, Djolonga Josip, Mustafa Basil, Padlewski Piotr, Heek Jonathan, Gilmer Justin, Steiner Andreas, Caron Mathilde, Geirhos Robert, Alabdulmohsin Ibrahim, Jenatton Rodolphe, Beyer Lucas, Tschannen Michael, Arnab Anurag, Wang Xiao, Riquelme Carlos, Minderer Matthias, Puigcerver Joan, Evci Utku, Kumar Manoj, Van Steenkiste Sjoerd, Elsayed Gamaleldin F., Mahendran Aravindh, Yu Fisher, Oliver Avital, Huot Fantine, Bastings Jasmijn, Collier Mark Patrick, Gritsenko Alexey, Birodkar Vighnesh, Vasconcelos Cristina, Tay Yi, Mensink Thomas, Kolesnikov Alexander, Pavetić Filip, Tran Dustin, Kipf Thomas, Lučić Mario, Zhai Xiaohua, Keysers Daniel, Harmsen Jeremiah, Houlsby Neil
- Fairness Of Chatgpt And The Role Of Explainable-guided Prompts Deldjoo Yashar
- Can Instruction Fine-tuned Language Models Identify Social Bias Through Prompting? Dige Omkar, Tian Jacob-junqi, Emerson David, Khattak Faiza Khan
- Probing Explicit And Implicit Gender Bias Through LLM Conditional Text Generation Dong Xiangjue, Wang Yibo, Yu Philip S., Caverlee James
- ROBBIE: Robust Bias Evaluation Of Large Generative Language Models Esiobu David, Tan Xiaoqing, Hosseini Saghar, Ung Megan, Zhang Yuchen, Fernandes Jude, Dwivedi-yu Jane, Presani Eleonora, Williams Adina, Smith Eric Michael
- Bias Runs Deep: Implicit Reasoning Biases In Persona-assigned Llms Gupta Shashank, Shrivastava Vaishnavi, Deshpande Ameet, Kalyan Ashwin, Clark Peter, Sabharwal Ashish, Khot Tushar
- Is Chatgpt Fair For Recommendation? Evaluating Fairness In Large Language Model Recommendation Jizhi Zhang, Keqin Bao, Yang Zhang, Wenjie Wang, Fuli Feng, Xiangnan He
- Robust Recommender System: A Survey And Future Directions Kaike Zhang, Qi Cao, Fei Sun, Yunfan Wu, Shuchang Tao, Huawei Shen, Xueqi Cheng
- Mvbench: A Comprehensive Multi-modal Video Understanding Benchmark Li Kunchang, Wang Yali, He Yinan, Li Yizhuo, Wang Yi, Liu Yi, Wang Zun, Xu Jilan, Chen Guo, Luo Ping, Wang Limin, Qiao Yu
- A Survey On Fairness In Large Language Models Li Yingji, Du Mengnan, Song Rui, Wang Xin, Wang Ying
- GPT Detectors Are Biased Against Non-native English Writers Liang Weixin, Yuksekgonul Mert, Mao Yining, Wu Eric, Zou James
- Opening Up Chatgpt: Tracking Openness, Transparency, And Accountability In Instruction-tuned Text Generators Liesenfeld Andreas, Lopez Alianda, Dingemanse Mark
- Trustworthy Llms: A Survey And Guideline For Evaluating Large Language Models' Alignment Liu Yang, Yao Yuanshun, Ton Jean-francois, Zhang Xiaoying, Guo Ruocheng, Cheng Hao, Klochkov Yegor, Taufiq Muhammad Faaiz, Li Hang
- Fairness-guided Few-shot Prompting For Large Language Models Ma Huan, Zhang Changqing, Bian Yatao, Liu Lemao, Zhang Zhirui, Zhao Peilin, Zhang Shu, Fu Huazhu, Hu Qinghua, Wu Bingzhe
- How Trustworthy Are Open-source Llms? An Assessment Under Malicious Demonstrations Shows Their Vulnerabilities Mo Lingbo, Wang Boshi, Chen Muhao, Sun Huan
- N-critics: Self-refinement Of Large Language Models With Ensemble Of Critics Mousavi Sajad, Gutiérrez Ricardo Luna, Rengarajan Desik, Gundecha Vineet, Babu Ashwin Ramesh, Naug Avisek, Guillen Antonio, Sarkar Soumyendu
- A Brief History Of Prompt: Leveraging Language Models. (through Advanced Prompting) Muktadir Golam Md
- Enhancing Logical Reasoning In Large Language Models To Facilitate Legal Applications Nguyen Ha-thanh, Fungwacharakorn Wachara, Satoh Ken
- Large Language Model (LLM) Bias Index -- LLMBI Oketunji Abiodun Finbarrs, Anas Muhammad, Saina Deepthi
- Reproducing Whisper-style Training Using An Open-source Toolkit And Publicly Available Data Peng Yifan, Tian Jinchuan, Yan Brian, Berrebbi Dan, Chang Xuankai, Li Xinjian, Shi Jiatong, Arora Siddhant, Chen William, Sharma Roshan, Zhang Wangyou, Sudo Yui, Shakeel Muhammad, Jung Jee-weon, Maiti Soumi, Watanabe Shinji
- Can Chatgpt Assess Human Personalities? A General Evaluation Framework Rao Haocong, Leung Cyril, Miao Chunyan
- Soft-prompt Tuning For Large Language Models To Evaluate Bias Tian Jacob-junqi, Emerson David, Miyandoab Sevil Zanjani, Pandya Deval, Seyyed-kalantari Laleh, Khattak Faiza Khan
- Post Turing: Mapping The Landscape Of LLM Evaluation Tikhonov Alexey, Yamshchikov Ivan P.
- Robust Natural Language Understanding With Residual Attention Debiasing Wang Fei, Huang James Y., Yan Tianyi, Zhou Wenxuan, Chen Muhao
- Adding Guardrails To Advanced Chatbots Wang Yanchen, Singh Lisa
- A Preliminary Study Of Chatgpt On News Recommendation: Personalization, Provider Fairness, Fake News Xinyi Li, Yongfeng Zhang, Edward C. Malthouse
- Bias And Fairness In Chatbots: An Overview Xue Jintang, Wang Yun-cheng, Wei Chengwei, Liu Xiaofeng, Woo Jonghye, Kuo C. -c. Jay
- Bias A-head? Analyzing Bias In Transformer-based Language Model Attention Heads Yang Yi, Duan Hanyu, Abbasi Ahmed, Lalor John P., Tam Kar Yan
- Fairness-aware Structured Pruning In Transformers Zayed Abdelrahman, Mordido Goncalo, Shabanian Samira, Baldini Ioana, Chandar Sarath
- Taxonomy-based Checklist For Large Language Model Evaluation Zhang Damin
- Open (clinical) Llms Are Sensitive To Instruction Phrasings Arroyo Alberto Mario Ceballos, Munnangi Monica, Sun Jiuding, Zhang Karen Y. C., Mcinerney Denis Jered, Wallace Byron C., Amir Silvio
- Current State Of LLM Risks And AI Guardrails Ayyamperumal Suriya Ganesh, Ge Limin
- Downstream Bias Mitigation Is All You Need Baksi Arkadeep, Singh Rahul, Joshi Tarun
- From 'showgirls' To 'performers': Fine-tuning With Gender-inclusive Language For Bias Reduction In Llms Bartl Marion, Leavy Susan
- Enhancing Trust In Llms: Algorithms For Comparing And Interpreting Llms Brown Nik Bear
- Are Large Language Models Really Bias-free? Jailbreak Prompts For Assessing Adversarial Robustness To Bias Elicitation Cantini Riccardo, Cosenza Giada, Orsino Alessio, Talia Domenico
- Editable Fairness: Fine-grained Bias Mitigation In Language Models Chen Ruizhe, Li Yichen, Yang Jianfei, Zhou Joey Tianyi, Liu Zuozhu
- Open Source Conversational Llms Do Not Know Most Spanish Words Conde Javier, González Miguel, Melero Nina, Ferrando Raquel, Martínez Gonzalo, Merino-gómez Elena, Hernández José Alberto, Reviriego Pedro
- Out-of-context Prompting Boosts Fairness And Robustness In Large Language Model Predictions Cotta Leonardo, Maddison Chris J.
- Low-rank Finetuning For Llms: A Fairness Perspective Das Saswat, Romanelli Marco, Tran Cuong, Reza Zarreen, Kailkhura Bhavya, Fioretto Ferdinando
- Aligning Model Evaluations With Human Preferences: Mitigating Token Count Bias In Language Model Assessments Daynauth Roland, Mars Jason
- Understanding Biases In Chatgpt-based Recommender Systems: Provider Fairness, Temporal Stability, And Recency Deldjoo Yashar
- AXOLOTL: Fairness Through Assisted Self-debiasing Of Large Language Model Outputs Ebrahimi Sana, Chen Kaiwen, Asudeh Abolfazl, Das Gautam, Koudas Nick
- Cognitive Bias In High-stakes Decision-making With Llms Echterhoff Jessica, Liu Yao, Alessa Abeer, Mcauley Julian, He Zexue
- Biasalert: A Plug-and-play Tool For Social Bias Detection In Llms Fan Zhiting, Chen Ruizhe, Xu Ruiling, Liu Zuozhu
- RAVE: Residual Vector Embedding For Clip-guided Backlit Image Enhancement Gaintseva Tatiana, Benning Martin, Slabaugh Gregory
- Mechanistic Interpretability Of Large Language Models With Applications To The Financial Services Industry Golgoon Ashkan, Filom Khashayar, Kannan Arjun Ravi
- Chatgpt Based Data Augmentation For Improved Parameter-efficient Debiasing Of Llms Han Pengrui, Kocielnik Rafal, Saravanan Adhithya, Jiang Roy, Sharir Or, Anandkumar Anima
- Prompt-based Bias Calibration For Better Zero/few-shot Learning Of Language Models He Kang, Long Yinghan, Roy Kaushik
- Applying And Evaluating Large Language Models In Mental Health Care: A Scoping Review Of Human-assessed Generative Tasks Hua Yining, Na Hongbin, Li Zehan, Liu Fenglin, Fang Xiao, Clifton David, Torous John
- Trustllm: Trustworthiness In Large Language Models Huang Yue, Sun Lichao, Wang Haoran, Wu Siyuan, Zhang Qihui, Li Yuan, Gao Chujie, Huang Yixin, Lyu Wenhan, Zhang Yixuan, Li Xiner, Liu Zhengliang, Liu Yixin, Wang Yijue, Zhang Zhikun, Vidgen Bertie, Kailkhura Bhavya, Xiong Caiming, Xiao Chaowei, Li Chunyuan, Xing Eric, Huang Furong, Liu Hao, Ji Heng, Wang Hongyi, Zhang Huan, Yao Huaxiu, Kellis Manolis, Zitnik Marinka, Jiang Meng, Bansal Mohit, Zou James, Pei Jian, Liu Jian, Gao Jianfeng, Han Jiawei, Zhao Jieyu, Tang Jiliang, Wang Jindong, Vanschoren Joaquin, Mitchell John, Shu Kai, Xu Kaidi, Chang Kai-wei, He Lifang, Huang Lifu, Backes Michael, Gong Neil Zhenqiang, Yu Philip S., Chen Pin-yu, Gu Quanquan, Xu Ran, Ying Rex, Ji Shuiwang, Jana Suman, Chen Tianlong, Liu Tianming, Zhou Tianyi, Wang William, Li Xiang, Zhang Xiangliang, Wang Xiao, Xie Xing, Chen Xun, Wang Xuyu, Liu Yan, Ye Yanfang, Cao Yinzhi, Chen Yong, Zhao Yue
- Grade Score: Quantifying LLM Performance In Option Selection Iourovitski Dmitri
- Navigating LLM Ethics: Advancements, Challenges, And Future Directions Jiao Junfeng, Afroogh Saleh, Xu Yiming, Phillips Connor
- A2SF: Accumulative Attention Scoring With Forgetting Factor For Token Pruning In Transformer Decoder Jo Hyun-rae, Shin Dongkun
- Steering Llms Towards Unbiased Responses: A Causality-guided Debiasing Framework Li Jingling, Tang Zeyu, Liu Xiaoyu, Spirtes Peter, Zhang Kun, Leqi Liu, Liu Yang
- Red Teaming Visual Language Models Li Mukai, Li Lei, Yin Yuwei, Ahmed Masood, Liu Zhenguang, Liu Qi
- On The Intrinsic Self-correction Capability Of Llms: Uncertainty And Latent Concept Liu Guangliang, Mao Haitao, Cao Bochuan, Xue Zhiyu, Johnson Kristen, Tang Jiliang, Wang Rongrong
- Towards Transfer Unlearning: Empirical Evidence Of Cross-domain Bias Mitigation Lu Huimin, Isonuma Masaru, Mori Junichiro, Sakata Ichiro
- Do Multilingual Large Language Models Mitigate Stereotype Bias? Nie Shangrui, Fromm Michael, Welch Charles, Görge Rebekka, Karimi Akbar, Plepi Joan, Mowmita Nazia Afsan, Flores-herr Nicolas, Ali Mehdi, Flek Lucie
- Offsetbias: Leveraging Debiased Data For Tuning Evaluators Park Junsoo, Jwa Seungyeon, Ren Meiying, Kim Daeyoung, Choi Sanghyuk
- Securing Large Language Models: Addressing Bias, Misinformation, And Prompt Attacks Peng Benji, Chen Keyu, Li Ming, Feng Pohsun, Bi Ziqian, Liu Junyu, Niu Qian
- Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period Of Large Language Models Qian Chen, Zhang Jie, Yao Wei, Liu Dongrui, Yin Zhenfei, Qiao Yu, Liu Yong, Shao Jing
- Evaluating Llms For Gender Disparities In Notable Persons Rhue Lauren, Goethals Sofie, Sundararajan Arun
- Risks, Causes, And Mitigations Of Widespread Deployments Of Large Language Models (llms): A Survey Sakib Md Nazmus, Islam Md Athikul, Pathak Royal, Arifin Md Mashrur
- Textmachina: Seamless Generation Of Machine-generated Text Datasets Sarvazyan Areg Mikael, González José Ángel, Franco-salvador Marc
- Using Chatgpt To Score Essays And Short-form Constructed Responses Shermis Mark D.
- Judging The Judges: A Systematic Investigation Of Position Bias In Pairwise Comparative Assessments By Llms Shi Lin, Ma Chiyu, Ma Weicheng, Vosoughi Soroush
- Can Many-shot In-context Learning Help Long-context LLM Judges? See More, Judge Better! Song Mingyang, Zheng Mao, Luo Xuan
- Expert-guided Extinction Of Toxic Tokens For Debiased Generation Sun Xueyao, Shi Kaize, Tang Haoran, Xu Guandong, Li Qing
- Improving Large Language Model (LLM) Fidelity Through Context-aware Grounding: A Systematic Approach To Reliability And Veracity Talukdar Wrick, Biswas Anjanava
- Insaaf: Incorporating Safety Through Accuracy And Fairness | Are Llms Ready For The Indian Legal Domain? Tripathi Yogesh, Donakanti Raghav, Girhepuje Sahil, Kavathekar Ishan, Vedula Bhaskara Hanuma, Krishnan Gokul S, Goyal Shreya, Goel Anmol, Ravindran Balaraman, Kumaraguru Ponnurangam
- Gender Bias In Machine Translation And The Era Of Large Language Models Vanmassenhove Eva
- Testing And Evaluation Of Large Language Models: Correctness, Non-toxicity, And Fairness Wang Wenxuan
- Freeva: Offline MLLM As Training-free Video Assistant Wu Wenhao
- Debiasing Multimodal Large Language Models Yi-fan Zhang, Weichen Yu, Qingsong Wen, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan
- Benchmarking Trustworthiness Of Multimodal Large Language Models: A Comprehensive Study Zhang Yichi, Huang Yao, Sun Yitong, Liu Chang, Zhao Zhe, Fang Zhengwei, Wang Yifan, Chen Huanran, Yang Xiao, Wei Xingxing, Su Hang, Dong Yinpeng, Zhu Jun
- Fairer Preferences Elicit Improved Human-aligned Large Language Model Judgments Zhou Han, Wan Xingchen, Liu Yinhong, Collier Nigel, Vulić Ivan, Korhonen Anna
🏷 Distillation
- Making Neural Machine Reading Comprehension Faster Chatterjee Debajyoti
- Tinybert: Distilling BERT For Natural Language Understanding Jiao Xiaoqi, Yin Yichun, Shang Lifeng, Jiang Xin, Chen Xiao, Li Linlin, Wang Fang, Liu Qun
- MKD: A Multi-task Knowledge Distillation Approach For Pretrained Language Models Liu Linqing, Wang Huan, Lin Jimmy, Socher Richard, Xiong Caiming
- Structured Pruning Of A Bert-based Question Answering Model Mccarley J. S., Chakravarti Rishav, Sil Avirup
- Distilbert, A Distilled Version Of BERT: Smaller, Faster, Cheaper And Lighter Sanh Victor, Debut Lysandre, Chaumond Julien, Wolf Thomas
- Waldorf: Wasteless Language-model Distillation On Reading-comprehension Tian James Yi, Kreuzer Alexander P., Chen Pai-hung, Will Hans-martin
- Well-read Students Learn Better: On The Importance Of Pre-training Compact Models Turc Iulia, Chang Ming-wei, Lee Kenton, Toutanova Kristina
- Towards Making The Most Of BERT In Neural Machine Translation Yang Jiacheng, Wang Mingxuan, Zhou Hao, Zhao Chengqi, Yu Yong, Zhang Weinan, Li Lei
- Model Compression With Two-stage Multi-teacher Knowledge Distillation For Web Question Answering System Yang Ze, Shou Linjun, Gong Ming, Lin Wutao, Jiang Daxin
- Extremely Small BERT Models From Mixed-vocabulary Training Zhao Sanqiang, Gupta Raghav, Song Yang, Zhou Denny
- Dipair: Fast And Accurate Distillation For Trillion-scale Text Matching And Pair Modeling Chen Jiecao, Yang Liu, Raman Karthik, Bendersky Michael, Yeh Jung-jung, Zhou Yun, Najork Marc, Cai Danyang, Emadzadeh Ehsan
- Autoadr: Automatic Model Design For Ad Relevance Chen Yiren, Yang Yaming, Sun Hong, Wang Yujing, Xu Yu, Shen Wei, Zhou Rong, Tong Yunhai, Bai Jing, Zhang Ruofei
- A Tailored Pre-training Model For Task-oriented Dialog Generation Gu Jing, Wu Qingyang, Wu Chongruo, Shi Weiyan, Yu Zhou
- Pretrained Transformers Improve Out-of-distribution Robustness Hendrycks Dan, Liu Xiaoyuan, Wallace Eric, Dziedzic Adam, Krishnan Rishabh, Song Dawn
- Syntactic Structure Distillation Pretraining For Bidirectional Encoders Kuncoro Adhiguna, Kong Lingpeng, Fried Daniel, Yogatama Dani, Rimell Laura, Dyer Chris, Blunsom Phil
- Efficient Transformer-based Large Scale Language Representations Using Hardware-friendly Block Structured Pruning Li Bingbing, Kong Zhenglun, Zhang Tianyun, Li Ji, Li Zhengang, Liu Hang, Ding Caiwen
- Autoregressive Knowledge Distillation Through Imitation Learning Lin Alexander, Wohlwend Jeremy, Chen Howard, Lei Tao
- Ladabert: Lightweight Adaptation Of BERT Through Hybrid Model Compression Mao Yihuan, Wang Yujing, Wu Chufan, Zhang Chen, Wang Yang, Yang Yaming, Zhang Quanlu, Tong Yunhai, Bai Jing
- Generation-distillation For Efficient Natural Language Understanding In Low-data Settings Melas-kyriazi Luke, Han George, Liang Celine
- Wechat Neural Machine Translation Systems For WMT20 Meng Fandong, Yan Jianhao, Liu Yijin, Gao Yuan, Zeng Xianfeng, Zeng Qinsong, Li Peng, Chen Ming, Zhou Jie, Liu Sifan, Zhou Hao
- ADER: Adaptively Distilled Exemplar Replay Towards Continual Learning For Session-based Recommendation Mi Fei, Lin Xiaoyu, Faltings Boi
- Axformer: Accuracy-driven Approximation Of Transformers For Faster, Smaller And More Accurate NLP Models Nagarajan Amrit, Sen Sanchari, Stevens Jacob R., Raghunathan Anand
- On The Effect Of Dropping Layers Of Pre-trained Transformer Models Sajjad Hassan, Dalvi Fahim, Durrani Nadir, Nakov Preslav
- Pre-trained Summarization Distillation Shleifer Sam, Rush Alexander M.
- Lightpaff: A Two-stage Distillation Framework For Pre-training And Fine-tuning Song Kaitao, Sun Hao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Hongzhi, Liu Tie-yan
- Minilm: Deep Self-attention Distillation For Task-agnostic Compression Of Pre-trained Transformers Wang Wenhui, Wei Furu, Dong Li, Bao Hangbo, Yang Nan, Zhou Ming
- Improving BERT Fine-tuning Via Self-ensemble And Self-distillation Xu Yige, Qiu Xipeng, Zhou Ligao, Huang Xuanjing
- Future-guided Incremental Transformer For Simultaneous Translation Zhang Shaolei, Feng Yang, Li Liangyou
- Improving Non-autoregressive Neural Machine Translation With Monolingual Data Zhou Jiawei, Keung Phillip
- Distilling Transformers For Neural Cross-domain Search Clement Colin B., Wu Chen, Drain Dawn, Sundaresan Neel
- YANMTT: Yet Another Neural Machine Translation Toolkit Dabre Raj, Sumita Eiichiro
- Robustness Challenges In Model Distillation And Pruning For Natural Language Understanding Du Mengnan, Mukherjee Subhabrata, Cheng Yu, Shokouhi Milad, Hu Xia, Awadallah Ahmed Hassan
- Robustly Optimized And Distilled Training For Natural Language Understanding Elfadeel Haytham, Peshterliev Stan
- Compressing Visual-linguistic Model Via Knowledge Distillation Fang Zhiyuan, Wang Jianfeng, Hu Xiaowei, Wang Lijuan, Yang Yezhou, Liu Zicheng
- Scalable Transformers For Neural Machine Translation Gao Peng, Geng Shijie, Qiao Yu, Wang Xiaogang, Dai Jifeng, Li Hongsheng
- Generate, Annotate, And Learn: NLP With Synthetic Text He Xuanli, Nassar Islam, Kiros Jamie, Haffari Gholamreza, Norouzi Mohammad
- The Niutrans System For WNGT 2020 Efficiency Task Hu Chi, Li Bei, Lin Ye, Li Yinqiao, Li Yanyang, Wang Chenglong, Xiao Tong, Zhu Jingbo
- Audio-oriented Multimodal Machine Comprehension: Task, Dataset And Model Huang Zhiqi, Liu Fenglin, Wu Xian, Ge Shen, Wang Helin, Fan Wei, Zou Yuexian
- Lightmbert: A Simple Yet Effective Method For Multilingual BERT Distillation Jiao Xiaoqi, Yin Yichun, Shang Lifeng, Jiang Xin, Chen Xiao, Li Linlin, Wang Fang, Liu Qun
- Distilling Large Language Models Into Tiny And Effective Students Using Pqrnn Kaliamoorthi Prabhu, Siddhant Aditya, Li Edward, Johnson Melvin
- Align Before Fuse: Vision And Language Representation Learning With Momentum Distillation Li Junnan, Selvaraju Ramprasaath R., Gotmare Akhilesh Deepak, Joty Shafiq, Xiong Caiming, Hoi Steven
- A Short Study On Compressing Decoder-based Language Models Li Tianda, Mesbahi Yassir El, Kobyzev Ivan, Rashid Ahmad, Mahmud Atif, Anchuri Nithin, Hajimolahoseini Habib, Liu Yang, Rezagholizadeh Mehdi
- ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, Dianhai Yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang
- Ernie-tiny : A Progressive Distillation Framework For Pretrained Transformer Compression Su Weiyue, Chen Xuyi, Feng Shikun, Liu Jiaxiang, Liu Weixin, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- NVIDIA Nemo Neural Machine Translation Systems For English-german And English-russian News And Biomedical Tasks At WMT21 Subramanian Sandeep, Hrinchuk Oleksii, Adams Virginia, Kuchaiev Oleksii
- Selective Knowledge Distillation For Neural Machine Translation Wang Fusheng, Yan Jianhao, Meng Fandong, Zhou Jie
- Distilled Dual-encoder Model For Vision-language Understanding Wang Zekun, Wang Wenhui, Zhu Haichao, Liu Ming, Qin Bing, Wei Furu
- Prune Once For All: Sparse Pre-trained Language Models Zafrir Ofir, Larey Ariel, Boudoukh Guy, Shen Haihao, Wasserblat Moshe
- Wechat Neural Machine Translation Systems For WMT21 Zeng Xianfeng, Liu Yijin, Li Ernan, Ran Qiu, Meng Fandong, Li Peng, Xu Jinan, Zhou Jie
- I2D2: Inductive Knowledge Distillation With Neurologic And Self-imitation Bhagavatula Chandra, Hwang Jena D., Downey Doug, Bras Ronan Le, Lu Ximing, Qin Lianhui, Sakaguchi Keisuke, Swayamdipta Swabha, West Peter, Choi Yejin
- Enabling Multimodal Generation On CLIP Via Vision-language Knowledge Distillation Dai Wenliang, Hou Lu, Shang Lifeng, Jiang Xin, Liu Qun, Fung Pascale
- Re2g: Retrieve, Rerank, Generate Glass Michael, Rossiello Gaetano, Chowdhury Md Faisal Mahbub, Naik Ankita Rajaram, Cai Pengshan, Gliozzo Alfio
- Quala-minilm: A Quantized Length Adaptive Minilm Guskin Shira, Wasserblat Moshe, Wang Chang, Shen Haihao
- LUNA: Language Understanding With Number Augmentations On Transformers Via Number Plugins And Pre-training Han Hongwei, Xu Jialiang, Zhou Mengyu, Shao Yijia, Han Shi, Zhang Dongmei
- The Optimal BERT Surgeon: Scalable And Accurate Second-order Pruning For Large Language Models Kurtic Eldar, Campos Daniel, Nguyen Tuan, Frantar Elias, Kurtz Mark, Fineran Benjamin, Goin Michael, Alistarh Dan
- Hard Gate Knowledge Distillation -- Leverage Calibration For Robust And Reliable Language Model Lee Dongkyu, Tian Zhiliang, Zhao Yingxiu, Cheung Ka Chun, Zhang Nevin L.
- DQ-BART: Efficient Sequence-to-sequence Model Via Joint Distillation And Quantization Li Zheng, Wang Zijian, Tan Ming, Nallapati Ramesh, Bhatia Parminder, Arnold Andrew, Xiang Bing, Roth Dan
- Bjtu-wechat's Systems For The WMT22 Chat Translation Task Liang Yunlong, Meng Fandong, Xu Jinan, Chen Yufeng, Zhou Jie
- Teaching Small Language Models To Reason Magister Lucie Charlotte, Mallinson Jonathan, Adamek Jakub, Malmi Eric, Severyn Aliaksei
- Minialbert: Model Distillation Via Parameter-efficient Recursive Transformers Nouriborji Mohammadmahdi, Rohanian Omid, Kouchaki Samaneh, Clifton David A.
- A Self-paced Mixed Distillation Method For Non-autoregressive Generation Qi Weizhen, Gong Yeyun, Shen Yelong, Jiao Jian, Yan Yu, Li Houqiang, Zhang Ruofei, Chen Weizhu, Duan Nan
- Exploring Extreme Parameter Compression For Pre-trained Language Models Ren Yuxin, Wang Benyou, Shang Lifeng, Jiang Xin, Liu Qun
- Dialokg: Knowledge-structure Aware Task-oriented Dialogue Generation Rony Md Rashad Al Hasan, Usbeck Ricardo, Lehmann Jens
- Overcoming Catastrophic Forgetting Beyond Continual Learning: Balanced Training For Neural Machine Translation Shao Chenze, Feng Yang
- Distilling Reasoning Capabilities Into Smaller Language Models Shridhar Kumar, Stolfo Alessandro, Sachan Mrinmaya
- QUILL: Query Intent With Large Language Models Using Retrieval Augmentation And Multi-stage Distillation Srinivasan Krishna, Raman Karthik, Samanta Anupam, Liao Lingrui, Bertelli Luca, Bendersky Mike
- Compression Of Generative Pre-trained Language Models Via Quantization Tao Chaofan, Hou Lu, Zhang Wei, Shang Lifeng, Jiang Xin, Liu Qun, Luo Ping, Wong Ngai
- Multimodal Adaptive Distillation For Leveraging Unimodal Encoders For Vision-language Tasks Wang Zhecan, Codella Noel, Chen Yen-chun, Zhou Luowei, Dai Xiyang, Xiao Bin, Yang Jianwei, You Haoxuan, Chang Kai-wei, Chang Shih-fu, Yuan Lu
- Extreme Compression For Pre-trained Transformers Made Simple And Efficient Wu Xiaoxia, Yao Zhewei, Zhang Minjia, Li Conglong, He Yuxiong
- Integrating Translation Memories Into Non-autoregressive Machine Translation Xu Jitao, Crego Josep, Yvon François
- Generative Negative Text Replay For Continual Vision-language Pretraining Yan Shipeng, Hong Lanqing, Xu Hang, Han Jianhua, Tuytelaars Tinne, Li Zhenguo, He Xuming
- Zeroquant: Efficient And Affordable Post-training Quantization For Large-scale Transformers Yao Zhewei, Aminabadi Reza Yazdani, Zhang Minjia, Wu Xiaoxia, Li Conglong, He Yuxiong
- Leveraging Vision-language Models For Improving Domain Generalization In Image Classification Addepalli Sravanti, Asokan Ashish Ramayee, Sharma Lakshay, Babu R. Venkatesh
- Rest Meets React: Self-improvement For Multi-step Reasoning LLM Agent Aksitov Renat, Miryoosefi Sobhan, Li Zonglin, Li Daliang, Babayan Sheila, Kopparapu Kavya, Fisher Zachary, Guo Ruiqi, Prakash Sushant, Srinivasan Pranesh, Zaheer Manzil, Yu Felix, Kumar Sanjiv
- Distilling Efficient Language-specific Models For Cross-lingual Transfer Ansell Alan, Ponti Edoardo Maria, Korhonen Anna, Vulić Ivan
- Testing The Depth Of Chatgpt's Comprehension Via Cross-modal Tasks Based On Ascii-art: Gpt3.5's Abilities In Regard To Recognizing And Generating Ascii-art Are Not Totally Lacking Bayani David
- Plasma: Making Small Language Models Better Procedural Knowledge Models For (counterfactual) Planning Brahman Faeze, Bhagavatula Chandra, Pyatkin Valentina, Hwang Jena D., Li Xiang Lorraine, Arai Hirona J., Sanyal Soumya, Sakaguchi Keisuke, Ren Xiang, Choi Yejin
- Efficient Transformer Knowledge Distillation: A Performance Review Brown Nathan, Williamson Ashton, Anderson Tahj, Lawrence Logan
- A Systematic Study Of Knowledge Distillation For Natural Language Generation With Pseudo-target Training Calderon Nitay, Mukherjee Subhabrata, Reichart Roi, Kantor Amir
- Oberta: Improving Sparse Transfer Learning Via Improved Initialization, Distillation, And Pruning Regimes Campos Daniel, Marques Alexandre, Kurtz Mark, Zhai Chengxiang
- Promoting Generalized Cross-lingual Question Answering In Few-resource Scenarios Via Self-knowledge Distillation Carrino Casimiro Pio, Escolano Carlos, Fonollosa José A. R.
- Dialogue Chain-of-thought Distillation For Commonsense-aware Conversational Agents Chae Hyungjoo, Song Yongho, Ong Kai Tzu-iunn, Kwon Taeyoon, Kim Minjin, Yu Youngjae, Lee Dongha, Kang Dongyeop, Yeo Jinyoung
- Chain-of-thought Prompt Distillation For Multimodal Named Entity Recognition And Multimodal Relation Extraction Chen Feng, Feng Yujian
- Personalised Distillation: Empowering Open-sourced Llms With Adaptive Learning For Code Generation Chen Hailin, Saha Amrita, Hoi Steven, Joty Shafiq
- MCC-KD: Multi-cot Consistent Knowledge Distillation Chen Hongzhan, Wu Siyue, Quan Xiaojun, Wang Rui, Yan Ming, Zhang Ji
- Galactic Chitchat: Using Large Language Models To Converse With Astronomy Literature Ciucă Ioana, Ting Yuan-sen
- MBR And QE Finetuning: Training-time Distillation Of The Best And Most Expensive Decoding Methods Finkelstein Mara, Naskar Subhajit, Mirzazadeh Mehdi, Shah Apurva, Freitag Markus
- Distilling Step-by-step! Outperforming Larger Language Models With Less Training Data And Smaller Model Sizes Hsieh Cheng-yu, Li Chun-liang, Yeh Chih-kuan, Nakhost Hootan, Fujii Yasuhisa, Ratner Alexander, Krishna Ranjay, Lee Chen-yu, Pfister Tomas
- Bad Actor, Good Advisor: Exploring The Role Of Large Language Models In Fake News Detection Hu Beizhe, Sheng Qiang, Cao Juan, Shi Yuhui, Li Yang, Wang Danding, Qi Peng
- Visual Program Distillation: Distilling Tools And Programmatic Reasoning Into Vision-language Models Hu Yushi, Stretcu Otilia, Lu Chun-ta, Viswanathan Krishnamurthy, Hata Kenji, Luo Enming, Krishna Ranjay, Fuxman Ariel
- Soft Prompt Decoding For Multilingual Dense Retrieval Huang Zhiqi, Zeng Hansi, Zamani Hamed, Allan James
- Just CHOP: Embarrassingly Simple LLM Compression Jha Ananya Harsh, Sherborne Tom, Walsh Evan Pete, Groeneveld Dirk, Strubell Emma, Beltagy Iz
- Lion: Adversarial Distillation Of Proprietary Large Language Models Jiang Yuxin, Chan Chunkit, Chen Mingyang, Wang Wei
- Impossible Distillation: From Low-quality Model To High-quality Dataset & Model For Summarization And Paraphrasing Jung Jaehun, West Peter, Jiang Liwei, Brahman Faeze, Lu Ximing, Fisher Jillian, Sorensen Taylor, Choi Yejin
- Knowledge-augmented Reasoning Distillation For Small Language Models In Knowledge-intensive Tasks Kang Minki, Lee Seanie, Baek Jinheon, Kawaguchi Kenji, Hwang Sung Ju
- Expediting Contrastive Language-image Pretraining Via Self-distilled Encoders Kim Bumsoo, Kim Jinhyung, Jo Yeonsik, Kim Seung Hwan
- Aligning Large Language Models Through Synthetic Feedback Kim Sungdong, Bae Sanghwan, Shin Jamin, Kang Soyoung, Kwak Donghyun, Yoo Kang Min, Seo Minjoon
- Revisiting Intermediate Layer Distillation For Compressing Language Models: An Overfitting Perspective Ko Jongwoo, Park Seungjoon, Jeong Minchan, Hong Sukjin, Ahn Euijai, Chang Du-seong, Yun Se-young
- Sparse Fine-tuning For Inference Acceleration Of Large Language Models Kurtic Eldar, Kuznedelev Denis, Frantar Elias, Goin Michael, Alistarh Dan
- Mixed Distillation Helps Smaller Language Model Better Reasoning Li Chenglin, Chen Qianglong, Li Liangyue, Wang Caiyu, Li Yicheng, Chen Zulong, Zhang Yin
- Unlock The Power: Competitive Distillation For Multi-modal Large Language Models Li Xinwei, Lin Li, Wang Shuai, Qian Chen
- Turning Dust Into Gold: Distilling Complex Reasoning Capabilities From Llms By Leveraging Negative Data Li Yiwei, Yuan Peiwen, Feng Shaoxiong, Pan Boyuan, Sun Bin, Wang Xinglin, Wang Heda, Li Kan
- Homodistil: Homotopic Task-agnostic Distillation Of Pre-trained Transformers Liang Chen, Jiang Haoming, Li Zheng, Tang Xianfeng, Yin Bin, Zhao Tuo
- Mind's Mirror: Distilling Self-evaluation Capability And Comprehensive Thinking From Large Language Models Liu Weize, Li Guocong, Zhang Kai, Du Bang, Chen Qiyuan, Hu Xuming, Xu Hongxia, Chen Jintai, Wu Jian
- Online Speculative Decoding Liu Xiaoxuan, Hu Lanxiang, Bailis Peter, Cheung Alvin, Deng Zhijie, Stoica Ion, Zhang Hao
- Towards Logiglue: A Brief Survey And A Benchmark For Analyzing Logical Reasoning Capabilities Of Language Models Luo Man, Kumbhar Shrinidhi, Shen Ming, Parmar Mihir, Varshney Neeraj, Banerjee Pratyay, Aditya Somak, Baral Chitta
- Sci-cot: Leveraging Large Language Models For Enhanced Knowledge Distillation In Small Models For Scientific QA Ma Yuhan, Jiang Haiqi, Fan Chenyou
- LEALLA: Learning Lightweight Language-agnostic Sentence Embeddings With Knowledge Distillation Mao Zhuoyuan, Nakagawa Tetsuji
- Learning To Compress Prompts With Gist Tokens Mu Jesse, Li Xiang Lisa, Goodman Noah
- Kosmos-g: Generating Images In Context With Multimodal Large Language Models Pan Xichen, Dong Li, Huang Shaohan, Peng Zhiliang, Chen Wenhu, Wei Furu
- Semantic Composition In Visually Grounded Language Models Pandey Rohan
- Localized Symbolic Knowledge Distillation For Visual Commonsense Models Park Jae Sung, Hessel Jack, Chandu Khyathi Raghavi, Liang Paul Pu, Lu Ximing, West Peter, Yu Youngjae, Huang Qiuyuan, Gao Jianfeng, Farhadi Ali, Choi Yejin
- Improving In-context Learning Via Bidirectional Alignment Qin Chengwei, Xia Wenhan, Jiao Fangkai, Chen Chen, Hu Yuchen, Ding Bosheng, Joty Shafiq
- Tailoring Self-rationalizers With Multi-reward Distillation Ramnath Sahana, Joshi Brihi, Hallinan Skyler, Lu Ximing, Li Liunian Harold, Chan Aaron, Hessel Jack, Choi Yejin, Ren Xiang
- ICLEF: In-context Learning With Expert Feedback For Explainable Style Transfer Saakyan Arkadiy, Muresan Smaranda
- Promptmix: A Class Boundary Augmentation Method For Large Language Model Distillation Sahu Gaurav, Vechtomova Olga, Bahdanau Dzmitry, Laradji Issam H.
- A Symmetric Dual Encoding Dense Retrieval Framework For Knowledge-intensive Visual Question Answering Salemi Alireza, Pizzorno Juan Altmayer, Zamani Hamed
- Dont Add, Dont Miss: Effective Content Preserving Generation From Pre-selected Text Spans Slobodkin Aviv, Caciularu Avi, Hirsch Eran, Dagan Ido
- Distilled GPT For Source Code Summarization Su Chia-yi, Mcmillan Collin
- CONSCENDI: A Contrastive And Scenario-guided Distillation Approach To Guardrail Models For Virtual Assistants Sun Albert Yu, Nair Varun, Schumacher Elliot, Kannan Anitha
- Create Your World: Lifelong Text-to-image Diffusion Sun Gan, Liang Wenqi, Dong Jiahua, Li Jun, Ding Zhengming, Cong Yang
- Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers Sun Weiwei, Chen Zheng, Ma Xinyu, Yan Lingyong, Wang Shuaiqiang, Ren Pengjie, Chen Zhumin, Yin Dawei, Ren Zhaochun
- Improving Non-autoregressive Translation Quality With Pretrained Language Model, Embedding Distillation And Upsampling Strategy For CTC Syu Shen-sian, Xie Juncheng, Lee Hung-yi
- Baby Llama: Knowledge Distillation From An Ensemble Of Teachers Trained On A Small Dataset With No Performance Penalty Timiryasov Inar, Tastet Jean-loup
- What Makes For Good Visual Tokenizers For Large Language Models? Wang Guangzhi, Ge Yixiao, Ding Xiaohan, Kankanhalli Mohan, Shan Ying
- Fake Alignment: Are Llms Really Aligned Well? Wang Yixu, Teng Yan, Huang Kexin, Lyu Chengqi, Zhang Songyang, Zhang Wenwei, Ma Xingjun, Jiang Yu-gang, Qiao Yu, Wang Yingchun
- Smarttrim: Adaptive Tokens And Attention Pruning For Efficient Vision-language Models Wang Zekun, Chen Jingchang, Zhou Wangchunshu, Zhu Haichao, Liang Jiafeng, Shan Liping, Liu Ming, Xu Dongliang, Yang Qing, Qin Bing
- Is Chatgpt Good At Search? Investigating Large Language Models As Re-ranking Agents Weiwei Sun, Lingyong Yan, Xinyu Ma, Shuaiqiang Wang, Pengjie Ren, Zhumin Chen, Dawei Yin, Zhaochun Ren
- O3D: Offline Data-driven Discovery And Distillation For Sequential Decision-making With Large Language Models Xiao Yuchen, Sun Yanchao, Xu Mengda, Madhushani Udari, Vann Jared, Garg Deepeka, Ganesh Sumitra
- Llm-pruner: On The Structural Pruning Of Large Language Models Xinyin Ma, Gongfan Fang, Xinchao Wang
- Effective Distillation Of Table-based Reasoning Ability From Llms Yang Bohao, Tang Chen, Zhao Kun, Xiao Chenghao, Lin Chenghua
- Enabling Large Language Models To Learn From Rules Yang Wenkai, Lin Yankai, Zhou Jie, Wen Jirong
- Quantization-aware And Tensor-compressed Training Of Transformers For Natural Language Understanding Yang Zi, Choudhary Samridhi, Kunzmann Siegfried, Zhang Zheng
- Information Association For Language Model Updating By Mitigating Lm-logical Discrepancy Yu Pengfei, Ji Heng
- Task-agnostic Distillation Of Encoder-decoder Language Models Zhang Chen, Yang Yang, Wang Jingang, Song Dawei
- Retrieve Anything To Augment Large Language Models Zhang Peitian, Xiao Shitao, Liu Zheng, Dou Zhicheng, Nie Jian-yun
- IAG: Induction-augmented Generation Framework For Answering Reasoning Questions Zhang Zhebin, Zhang Xinyu, Ren Yuanhang, Shi Saijiang, Han Meng, Wu Yongkang, Lai Ruofei, Cao Zhao
- Learn From Model Beyond Fine-tuning: A Survey Zheng Hongling, Shen Li, Tang Anke, Luo Yong, Hu Han, Du Bo, Tao Dacheng
- Sur-adapter: Enhancing Text-to-image Pre-trained Diffusion Models With Large Language Models Zhong Shanshan, Huang Zhongzhan, Wen Wushao, Qin Jinghui, Lin Liang
- Pad: Program-aided Distillation Can Teach Small Models Reasoning Better Than Chain-of-thought Fine-tuning Zhu Xuekai, Qi Biqing, Zhang Kaiyan, Long Xinwei, Lin Zhouhan, Zhou Bowen
- A Survey On Symbolic Knowledge Distillation Of Large Language Models Acharya Kamal, Velasquez Alvaro, Song Houbing Herbert
- Mixture Of Modular Experts: Distilling Knowledge From A Multilingual Teacher Into Specialized Modular Language Models Al-maamari Mohammed, Amor Mehdi Ben, Granitzer Michael
- Smaller, Weaker, Yet Better: Training LLM Reasoners Via Compute-optimal Sampling Bansal Hritik, Hosseini Arian, Agarwal Rishabh, Tran Vinh Q., Kazemi Mehran
- Towards Cross-tokenizer Distillation: The Universal Logit Distillation Loss For Llms Boizard Nicolas, Haddad Kevin El, Hudelot Céline, Colombo Pierre
- Generation, Distillation And Evaluation Of Motivational Interviewing-style Reflections With A Foundational Language Model Brown Andrew, Zhu Jiading, Abdelwahab Mohamed, Dong Alec, Wang Cindy, Rose Jonathan
- Pheme: Efficient And Conversational Speech Generation Budzianowski Paweł, Sereda Taras, Cichy Tomasz, Vulić Ivan
- Medusa: Simple LLM Inference Acceleration Framework With Multiple Decoding Heads Cai Tianle, Li Yuhong, Geng Zhengyang, Peng Hongwu, Lee Jason D., Chen Deming, Dao Tri
- Magdi: Structured Distillation Of Multi-agent Interaction Graphs Improves Reasoning In Smaller Language Models Chen Justin Chih-yao, Saha Swarnadeep, Stengel-eskin Elias, Bansal Mohit
- Ternaryllm: Ternarized Large Language Model Chen Tianqi, Li Zhe, Xu Weixiang, Zhu Zeyu, Li Dong, Tian Lu, Barsoum Emad, Wang Peisong, Cheng Jian
- Transferring Backdoors Between Large Language Models By Knowledge Distillation Cheng Pengzhou, Wu Zongru, Ju Tianjie, Du Wei, Liu Zhuosheng Zhang Gongshen
- Distillation Matters: Empowering Sequential Recommenders To Match The Performance Of Large Language Model Cui Yu, Liu Feng, Wang Pengbo, Wang Bohao, Tang Heng, Wan Yi, Wang Jun, Chen Jiawei
- Improve Student's Reasoning Generalizability Through Cascading Decomposed Cots Distillation Dai Chengwei, Li Kun, Zhou Wei, Hu Songlin
- Self-play With Execution Feedback: Improving Instruction-following Capabilities Of Large Language Models Dong Guanting, Lu Keming, Li Chengpeng, Xia Tingyu, Yu Bowen, Zhou Chang, Zhou Jingren
- Unmemorization In Large Language Models Via Self-distillation And Deliberate Imagination Dong Yijiang River, Lin Hongzhou, Belkin Mikhail, Huerta Ramon, Vulić Ivan
- Bitdistiller: Unleashing The Potential Of Sub-4-bit Llms Via Self-distillation Du Dayou, Zhang Yijia, Cao Shijie, Guo Jiaqi, Cao Ting, Chu Xiaowen, Xu Ningyi
- Accuracy Is Not All You Need Dutta Abhinav, Krishnan Sanjeev, Kwatra Nipun, Ramjee Ramachandran
- From LLM To NMT: Advancing Low-resource Machine Translation With Claude Enis Maxim, Hopkins Mark
- Keypoint-based Progressive Chain-of-thought Distillation For Llms Feng Kaituo, Li Changsheng, Zhang Xiaolu, Zhou Jun, Yuan Ye, Wang Guoren
- Introducing The Newspalm MBR And QE Dataset: Llm-generated High-quality Parallel Data Outperforms Traditional Web-crawled Data Finkelstein Mara, Vilar David, Freitag Markus
- Gemma 2: Improving Open Language Models At A Practical Size Gemma Team, Riviere Morgane, Pathak Shreya, Sessa Pier Giuseppe, Hardin Cassidy, Bhupatiraju Surya, Hussenot Léonard, Mesnard Thomas, Shahriari Bobak, Ramé Alexandre, Ferret Johan, Liu Peter, Tafti Pouya, Friesen Abe, Casbon Michelle, Ramos Sabela, Kumar Ravin, Lan Charline Le, Jerome Sammy, Tsitsulin Anton, Vieillard Nino, Stanczyk Piotr, Girgin Sertan, Momchev Nikola, Hoffman Matt, Thakoor Shantanu, Grill Jean-bastien, Neyshabur Behnam, Bachem Olivier, Walton Alanna, Severyn Aliaksei, Parrish Alicia, Ahmad Aliya, Hutchison Allen, Abdagic Alvin, Carl Amanda, Shen Amy, Brock Andy, Coenen Andy, Laforge Anthony, Paterson Antonia, Bastian Ben, Piot Bilal, Wu Bo, Royal Brandon, Chen Charlie, Kumar Chintu, Perry Chris, Welty Chris, Choquette-choo Christopher A., Sinopalnikov Danila, Weinberger David, Vijaykumar Dimple, Rogozińska Dominika, Herbison Dustin, Bandy Elisa, Wang Emma, Noland Eric, Moreira Erica, Senter Evan, Eltyshev Evgenii, Visin Francesco, Rasskin Gabriel, Wei Gary, Cameron Glenn, Martins Gus, Hashemi Hadi, Klimczak-plucińska Hanna, Batra Harleen, Dhand Harsh, Nardini Ivan, Mein Jacinda, Zhou Jack, Svensson James, Stanway Jeff, Chan Jetha, Zhou Jin Peng, Carrasqueira Joana, Iljazi Joana, Becker Jocelyn, Fernandez Joe, Van Amersfoort Joost, Gordon Josh, Lipschultz Josh, Newlan Josh, Ji Ju-yeong, Mohamed Kareem, Badola Kartikeya, Black Kat, Millican Katie, Mcdonell Keelin, Nguyen Kelvin, Sodhia Kiranbir, Greene Kish, Sjoesund Lars Lowe, Usui Lauren, Sifre Laurent, Heuermann Lena, Lago Leticia, Mcnealus Lilly, Soares Livio Baldini, Kilpatrick Logan, Dixon Lucas, Martins Luciano, Reid Machel, Singh Manvinder, Iverson Mark, Görner Martin, Velloso Mat, Wirth Mateo, Davidow Matt, Miller Matt, Rahtz Matthew, Watson Matthew, Risdal Meg, Kazemi Mehran, Moynihan Michael, Zhang Ming, Kahng Minsuk, Park Minwoo, Rahman Mofi, Khatwani Mohit, Dao Natalie, Bardoliwalla Nenshad, Devanathan Nesh, Dumai Neta, Chauhan Nilay, Wahltinez Oscar, Botarda Pankil, Barnes Parker, Barham Paul, Michel Paul, Jin Pengchong, Georgiev Petko, Culliton Phil, Kuppala Pradeep, Comanescu Ramona, Merhej Ramona, Jana Reena, Rokni Reza Ardeshir, Agarwal Rishabh, Mullins Ryan, Saadat Samaneh, Carthy Sara Mc, Perrin Sarah, Arnold Sébastien M. R., Krause Sebastian, Dai Shengyang, Garg Shruti, Sheth Shruti, Ronstrom Sue, Chan Susan, Jordan Timothy, Yu Ting, Eccles Tom, Hennigan Tom, Kocisky Tomas, Doshi Tulsee, Jain Vihan, Yadav Vikas, Meshram Vilobh, Dharmadhikari Vishal, Barkley Warren, Wei Wei, Ye Wenming, Han Woohyun, Kwon Woosuk, Xu Xiang, Shen Zhe, Gong Zhitao, Wei Zichuan, Cotruta Victor, Kirk Phoebe, Rao Anand, Giang Minh, Peran Ludovic, Warkentin Tris, Collins Eli, Barral Joelle, Ghahramani Zoubin, Hadsell Raia, Sculley D., Banks Jeanine, Dragan Anca, Petrov Slav, Vinyals Oriol, Dean Jeff, Hassabis Demis, Kavukcuoglu Koray, Farabet Clement, Buchatskaya Elena, Borgeaud Sebastian, Fiedel Noah, Joulin Armand, Kenealy Kathleen, Dadashi Robert, Andreev Alek
- Why Not Transform Chat Large Language Models To Non-english? Geng Xiang, Zhu Ming, Li Jiahuan, Lai Zhejian, Zou Wei, She Shuaijie, Guo Jiaxin, Zhao Xiaofeng, Li Yinglu, Li Yuang, Su Chang, Zhao Yanqing, Lyu Xinglin, Zhang Min, Chen Jiajun, Yang Hao, Huang Shujian
- Direct Alignment Of Draft Model For Speculative Decoding With Chat-fine-tuned Llms Goel Raghavv, Gagrani Mukul, Jeon Wonseok, Park Junyoung, Lee Mingu, Lott Christopher
- Boosting Lossless Speculative Decoding Via Feature Sampling And Partial Alignment Distillation Gui Lujun, Xiao Bin, Su Lei, Chen Weipeng
- Sub-goal Distillation: A Method To Improve Small Language Agents Hashemzadeh Maryam, Stengel-eskin Elias, Chandar Sarath, Cote Marc-alexandre
- Lightweight Modality Adaptation To Sequential Recommendation Via Correlation Supervision Hu Hengchang, Liu Qijiong, Li Chuang, Kan Min-yen
- Pruning Large Language Models With Semi-structural Adaptive Sparse Training Huang Weiyu, Hu Yuezhou, Jian Guohao, Zhu Jun, Chen Jianfei
- Leveraging Large Language Models For Enhanced NLP Task Performance Through Knowledge Distillation And Optimized Training Strategies Huang Yining, Tang Keke, Chen Meilian
- Tool Calling: Enhancing Medication Consultation Via Retrieval-augmented Large Language Models Huang Zhongzhen, Xue Kui, Fan Yongqi, Mu Linjie, Liu Ruoyu, Ruan Tong, Zhang Shaoting, Zhang Xiaofan
- Pc-lora: Low-rank Adaptation For Progressive Model Compression With Knowledge Distillation Hwang Injoon, Park Haewon, Lee Youngwan, Yang Jooyoung, Maeng Sunjae
- Parameter Efficient Diverse Paraphrase Generation Using Sequence-level Knowledge Distillation Jayawardena Lasal, Yapa Prasan
- Mix-cpt: A Domain Adaptation Framework Via Decoupling Knowledge Learning And Format Alignment Jiang Jinhao, Li Junyi, Zhao Wayne Xin, Song Yang, Zhang Tao, Wen Ji-rong
- Information-theoretic Distillation For Reference-less Summarization Jung Jaehun, Lu Ximing, Jiang Liwei, Brahman Faeze, West Peter, Koh Pang Wei, Choi Yejin
- Spectra: A Comprehensive Study Of Ternary, Quantized, And FP16 Language Models Kaushal Ayush, Pandey Tejas, Vaidhya Tejas, Bhagat Aaryan, Rish Irina
- Promptkd: Distilling Student-friendly Knowledge For Generative Language Models Via Prompt Tuning Kim Gyeongman, Jang Doohyuk, Yang Eunho
- Review-driven Personalized Preference Reasoning With Large Language Models For Recommendation Kim Jieyong, Kim Hyunseo, Cho Hyunjin, Kang Seongku, Chang Buru, Yeo Jinyoung, Lee Dongha
- Improving Conversational Abilities Of Quantized Large Language Models Via Direct Preference Alignment Lee Janghwan, Park Seongmin, Hong Sukjin, Kim Minsoo, Chang Du-seong, Choi Jungwook
- Gecko: Versatile Text Embeddings Distilled From Large Language Models Lee Jinhyuk, Dai Zhuyun, Ren Xiaoqi, Chen Blair, Cer Daniel, Cole Jeremy R., Hui Kai, Boratko Michael, Kapadia Rajvi, Ding Wen, Luan Yi, Duddu Sai Meher Karthik, Abrego Gustavo Hernandez, Shi Weiqiang, Gupta Nithi, Kusupati Aditya, Jain Prateek, Jonnalagadda Siddhartha Reddy, Chang Ming-wei, Naim Iftekhar
- Contextualization Distillation From Large Language Model For Knowledge Graph Completion Li Dawei, Tan Zhen, Chen Tianlong, Liu Huan
- MT-PATCHER: Selective And Extendable Knowledge Distillation From Large Language Models For Machine Translation Li Jiahuan, Cheng Shanbo, Huang Shujian, Chen Jiajun
- Grounded Compositional And Diverse Text-to-3d With Pretrained Multi-view Diffusion Model Li Xiaolong, Mo Jiawei, Wang Ying, Parameshwara Chethan, Fei Xiaohan, Swaminathan Ashwin, Taylor Cj, Tu Zhuowen, Favaro Paolo, Soatto Stefano
- Intermediate Distillation: Data-efficient Distillation From Black-box Llms For Information Retrieval Li Zizhong, Zhang Haopeng, Zhang Jiawei
- Languashrink: Reducing Token Overhead With Psycholinguistics Liang Xuechen, Tao Meiling, Xia Yinghui, Shi Tianyu, Wang Jun, Yang Jingsong
- Direct Large Language Model Alignment Through Self-rewarding Contrastive Prompt Distillation Liu Aiwei, Bai Haoping, Lu Zhiyun, Kong Xiang, Wang Simon, Shan Jiulong, Cao Meng, Wen Lijie
- Speculative Decoding Via Early-exiting For Faster LLM Inference With Thompson Sampling Control Mechanism Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang
- Personalitychat: Conversation Distillation For Personalized Dialog Modeling With Facts And Traits Lotfi Ehsan, De Bruyn Maxime, Buhmann Jeska, Daelemans Walter
- Decoupled Alignment For Robust Plug-and-play Adaptation Luo Haozheng, Yu Jiahao, Zhang Wenxin, Li Jialong, Hu Jerry Yao-chieh, Xing Xinyu, Liu Han
- FBI-LLM: Scaling Up Fully Binarized Llms From Scratch Via Autoregressive Distillation Ma Liqun, Sun Mingjie, Shen Zhiqiang
- RDBE: Reasoning Distillation-based Evaluation Enhances Automatic Essay Scoring Mohammadkhani Ali Ghiasvand
- Compact Language Models Via Pruning And Knowledge Distillation Muralidharan Saurav, Sreenivas Sharath Turuvekere, Joshi Raviraj, Chochowski Marcin, Patwary Mostofa, Shoeybi Mohammad, Catanzaro Bryan, Kautz Jan, Molchanov Pavlo
- Better Alignment With Instruction Back-and-forth Translation Nguyen Thao, Li Jeffrey, Oh Sewoong, Schmidt Ludwig, Weston Jason, Zettlemoyer Luke, Li Xian
- Layer-wise Regularized Dropout For Neural Language Models Ni Shiwen, Yang Min, Xu Ruifeng, Li Chengming, Hu Xiping
- Can Llms Learn By Teaching? A Preliminary Study Ning Xuefei, Wang Zifu, Li Shiyao, Lin Zinan, Yao Peiran, Fu Tianyu, Blaschko Matthew B., Dai Guohao, Yang Huazhong, Wang Yu
- Llmlingua-2: Data Distillation For Efficient And Faithful Task-agnostic Prompt Compression Pan Zhuoshi, Wu Qianhui, Jiang Huiqiang, Xia Menglin, Luo Xufang, Zhang Jue, Lin Qingwei, Rühle Victor, Yang Yuqing, Lin Chin-yew, Zhao H. Vicky, Qiu Lili, Zhang Dongmei
- Pre-trained Vision And Language Transformers Are Few-shot Incremental Learners Park Keon-hee, Song Kyungwoo, Park Gyeong-moon
- Distillation Contrastive Decoding: Improving Llms Reasoning With Contrastive Decoding And Distillation Phan Phuc, Tran Hieu, Phan Long
- Practical Token Pruning For Foundation Models In Few-shot Conversational Virtual Assistant Systems Qi Haode, Qian Cheng, Ni Jian, Singh Pratyush, Fazeli Reza, Wang Gengyu, Shu Zhongzheng, Wayne Eric, Bross Juergen
- Coursegpt-zh: An Educational Large Language Model Based On Knowledge Distillation Incorporating Prompt Optimization Qu Zheyan, Yin Lu, Yu Zitong, Wang Wenbo, Zhang Xing
- Scavenging Hyena: Distilling Transformers Into Long Convolution Models Ralambomihanta Tokiniaina Raharison, Mohammadzadeh Shahrad, Islam Mohammad Sami Nur, Jabbour Wassim, Liang Laurence
- Preference Distillation For Personalized Generative Recommendation Ramos Jerome, Wu Bin, Lipani Aldo
- Small But Funny: A Feedback-driven Approach To Humor Distillation Ravi Sahithya, Huber Patrick, Shrivastava Akshat, Sagar Aditya, Aly Ahmed, Shwartz Vered, Einolghozati Arash
- FIRST: Faster Improved Listwise Reranking With Single Token Decoding Reddy Revanth Gangi, Doo Jaehyeok, Xu Yifei, Sultan Md Arafat, Swain Deevya, Sil Avirup, Ji Heng
- Enhancing Low-resource NMT With A Multilingual Encoder And Knowledge Distillation: A Case Study Roy Aniruddha, Ray Pretam, Maheshwari Ayush, Sarkar Sudeshna, Goyal Pawan
- Mixsumm: Topic-based Data Augmentation Using Llms For Low-resource Extractive Text Summarization Sahu Gaurav, Laradji Issam H.
- Optimization Methods For Personalizing Large Language Models Through Retrieval Augmentation Salemi Alireza, Kallumadi Surya, Zamani Hamed
- Self-distillation For Model Stacking Unlocks Cross-lingual NLU In 200+ Languages Schmidt Fabian David, Borchert Philipp, Vulić Ivan, Glavaš Goran
- Generate-then-ground In Retrieval-augmented Generation For Multi-hop Question Answering Shi Zhengliang, Zhang Shuo, Sun Weiwei, Gao Shen, Ren Pengjie, Chen Zhumin, Ren Zhaochun
- FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation Shum Kashun, Xu Minrui, Zhang Jianshu, Chen Zixin, Diao Shizhe, Dong Hanze, Zhang Jipeng, Raza Muhammad Omer
- Comparative Analysis Of Different Efficient Fine Tuning Methods Of Large Language Models (llms) In Low-resource Setting Srinivasan Krishna Prasad Varadarajan, Gumpena Prasanth, Yattapu Madhusudhana, Brahmbhatt Vishal H.
- Tinyllm: Learning A Small Student From Multiple Large Language Models Tian Yijun, Han Yikun, Chen Xiusi, Wang Wei, Chawla Nitesh V.
- Efficient LLM Context Distillation Upadhayayaya Rajesh, Smith Zachary, Kottmyer Chritopher, Osti Manish Raj
- Leveraging Zero-shot Prompting For Efficient Language Model Distillation Vöge Lukas, Gurgul Vincent, Lessmann Stefan
- Investigating Mysteries Of Cot-augmented Distillation Wadhwa Somin, Amir Silvio, Wallace Byron C.
- BLSP-KD: Bootstrapping Language-speech Pre-training Via Knowledge Distillation Wang Chen, Liao Minpeng, Huang Zhongqiang, Zhang Jiajun
- Interactive Dualchecker For Mitigating Hallucinations In Distilling Large Language Models Wang Meiyun, Suzuki Masahiro, Sakaji Hiroki, Izumi Kiyoshi
- Role Prompting Guided Domain Adaptation With General Capability Preserve For Large Language Models Wang Rui, Mi Fei, Chen Yi, Xue Boyang, Wang Hongru, Zhu Qi, Wong Kam-fai, Xu Ruifeng
- Using Advanced Llms To Enhance Smaller Llms: An Interpretable Knowledge Distillation Approach Wang Tong, Sudhir K., Hong Dat
- QCRD: Quality-guided Contrastive Rationale Distillation For Large Language Models Wang Wei, Li Zhaowei, Xu Qi, Cai Yiqing, Song Hang, Qi Qi, Zhou Ran, Huang Zhida, Wang Tao, Xiao Li
- Secokd: Aligning Large Language Models For In-context Learning With Fewer Shots Wang Weixing, Yang Haojin, Meinel Christoph
- Rdrec: Rationale Distillation For Llm-based Recommendation Wang Xinfeng, Cui Jin, Suzuki Yoshimi, Fukumoto Fumiyo
- Promptmm: Multi-modal Knowledge Distillation For Recommendation With Prompt-tuning Wei Wei, Tang Jiabin, Jiang Yangqin, Xia Lianghao, Huang Chao
- Clover-2: Accurate Inference For Regressive Lightweight Speculative Decoding Xiao Bin, Gui Lujun, Su Lei, Chen Weipeng
- Large Language Models Can Learn Temporal Reasoning Xiong Siheng, Payani Ali, Kompella Ramana, Fekri Faramarz
- On-device Language Models: A Comprehensive Review Xu Jiajun, Li Zhiyuan, Chen Wei, Wang Qun, Gao Xin, Cai Qi, Ling Ziyuan
- Self-distillation Bridges Distribution Gap In Language Model Fine-tuning Yang Zhaorui, Pang Tianyu, Feng Haozhe, Wang Han, Chen Wei, Zhu Minfeng, Liu Qian
- Self-judge: Selective Instruction Following With Alignment Self-evaluation Ye Hai, Ng Hwee Tou
- MEND: Meta Demonstration Distillation For Efficient And Effective In-context Learning Yichuan Li, Xiyao Ma, Sixing Lu, Kyumin Lee, Xiaohu Liu, Chenlei Guo
- Distilling System 2 Into System 1 Yu Ping, Xu Jing, Weston Jason, Kulikov Ilia
- Why Lift So Heavy? Slimming Large Language Models By Cutting Off The Layers Yuan Shuzhou, Nie Ercong, Ma Bolei, Färber Michael
- Mammoth2: Scaling Instructions From The Web Yue Xiang, Zheng Tuney, Zhang Ge, Chen Wenhu
- Distilling Instruction-following Abilities Of Large Language Models With Task-aware Curriculum Planning Yue Yuanhao, Wang Chengyu, Huang Jun, Wang Peng
- Can Small Language Models Be Good Reasoners For Sequential Recommendation? Yuling Wang, Changxin Tian, Binbin Hu, Yanhua Yu, Ziqi Liu, Zhiqiang Zhang, Jun Zhou, Liang Pang, Xiao Wang
- Distilling Implicit Multimodal Knowledge Into Llms For Zero-resource Dialogue Generation Zhang Bo, Ma Hui, Ding Jian, Wang Jian, Xu Bo, Lin Hongfei
- Plad: Preference-based Large Language Model Distillation With Pseudo-preference Pairs Zhang Rongzhi, Shen Jiaming, Liu Tianqi, Wang Haorui, Qin Zhen, Han Feng, Liu Jialu, Baumgartner Simon, Bendersky Michael, Zhang Chao
- ELAD: Explanation-guided Large Language Models Active Distillation Zhang Yifei, Pan Bo, Ling Chen, Hu Yuntong, Zhao Liang
- MLKD-BERT: Multi-level Knowledge Distillation For Pre-trained Language Models Zhang Ying, Yang Ziheng, Ji Shufan
- Revisiting Knowledge Distillation For Autoregressive Language Models Zhong Qihuang, Ding Liang, Shen Li, Liu Juhua, Du Bo, Tao Dacheng
- Multi-stage Balanced Distillation: Addressing Long-tail Challenges In Sequence-level Knowledge Distillation Zhou Yuhang, Zhu Jing, Xu Paiheng, Liu Xiaoyu, Wang Xiyao, Koutra Danai, Ai Wei, Huang Furong
- Distilling Mathematical Reasoning Capabilities Into Small Language Models Zhu Xunyu, Li Jian, Liu Yong, Ma Can, Wang Weiping
🏷 Efficiency And Optimization
- Reinforced Mnemonic Reader For Machine Reading Comprehension Hu Minghao, Peng Yuxing, Huang Zhen, Qiu Xipeng, Wei Furu, Zhou Ming
- Training Deeper Neural Machine Translation Models With Transparent Attention Bapna Ankur, Chen Mia Xu, Firat Orhan, Cao Yuan, Wu Yonghui
- Making Neural Machine Reading Comprehension Faster Chatterjee Debajyoti
- Reweighted Proximal Pruning For Large-scale Language Representation Guo Fu-ming, Liu Sijia, Mungall Finlay S., Lin Xue, Wang Yanzhi
- Fine-tuning By Curriculum Learning For Non-autoregressive Neural Machine Translation Guo Junliang, Tan Xu, Xu Linli, Qin Tao, Chen Enhong, Liu Tie-yan
- Zero-shot Paraphrase Generation With Multilingual Language Models Guo Yinpeng, Liao Yi, Jiang Xin, Zhang Qing, Zhang Yibo, Liu Qun
- Resurrecting Submodularity For Neural Text Generation Han Simeng, Lin Xiang, Joty Shafiq
- Visualizing And Understanding The Effectiveness Of BERT Hao Yaru, Dong Li, Wei Furu, Xu Ke
- Convert: Efficient And Accurate Conversational Representations From Transformers Henderson Matthew, Casanueva Iñigo, Mrkšić Nikola, Su Pei-hao, Wen Tsung-hsien, Vulić Ivan
- Tinybert: Distilling BERT For Natural Language Understanding Jiao Xiaoqi, Yin Yichun, Shang Lifeng, Jiang Xin, Chen Xiao, Li Linlin, Wang Fang, Liu Qun
- Sample Efficient Text Summarization Using A Single Pre-trained Transformer Khandelwal Urvashi, Clark Kevin, Jurafsky Dan, Kaiser Lukasz
- TAB-VCR: Tags And Attributes Based Visual Commonsense Reasoning Baselines Lin Jingxiang, Jain Unnat, Schwing Alexander G.
- MKD: A Multi-task Knowledge Distillation Approach For Pretrained Language Models Liu Linqing, Wang Huan, Lin Jimmy, Socher Richard, Xiong Caiming
- Structured Pruning Of A Bert-based Question Answering Model Mccarley J. S., Chakravarti Rishav, Sil Avirup
- Megatron-lm: Training Multi-billion Parameter Language Models Using Model Parallelism Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick Legresley, Jared Casper, Bryan Catanzaro
- Adversarial Bootstrapping For Dialogue Model Training Olabiyi Oluwatobi, Mueller Erik T., Larson Christopher, Lahlou Tarek
- A Modular Task-oriented Dialogue System Using A Neural Mixture-of-experts Pei Jiahuan, Ren Pengjie, De Rijke Maarten
- Zero: Memory Optimizations Toward Training Trillion Parameter Models Samyam Rajbhandari, Jeff Rasley, Olatunji Ruwase, Yuxiong He
- Distilbert, A Distilled Version Of BERT: Smaller, Faster, Cheaper And Lighter Sanh Victor, Debut Lysandre, Chaumond Julien, Wolf Thomas
- Waldorf: Wasteless Language-model Distillation On Reading-comprehension Tian James Yi, Kreuzer Alexander P., Chen Pai-hung, Will Hans-martin
- Well-read Students Learn Better: On The Importance Of Pre-training Compact Models Turc Iulia, Chang Ming-wei, Lee Kenton, Toutanova Kristina
- Language Models With Transformers Wang Chenguang, Li Mu, Smola Alexander J.
- Structured Pruning Of Large Language Models Wang Ziheng, Wohlwend Jeremy, Lei Tao
- Integrated Triaging For Fast Reading Comprehension Wu Felix, Li Boyi, Wang Lequn, Lao Ni, Blitzer John, Weinberger Kilian Q.
- Towards Making The Most Of BERT In Neural Machine Translation Yang Jiacheng, Wang Mingxuan, Zhou Hao, Zhao Chengqi, Yu Yong, Zhang Weinan, Li Lei
- Model Compression With Two-stage Multi-teacher Knowledge Distillation For Web Question Answering System Yang Ze, Shou Linjun, Gong Ming, Lin Wutao, Jiang Daxin
- Q8BERT: Quantized 8bit BERT Zafrir Ofir, Boudoukh Guy, Izsak Peter, Wasserblat Moshe
- Extremely Small BERT Models From Mixed-vocabulary Training Zhao Sanqiang, Gupta Raghav, Song Yang, Zhou Denny
- Building Task-oriented Visual Dialog Systems Through Alternative Optimization Between Dialog Policy And Language Generation Zhou Mingyang, Arnold Josh, Yu Zhou
- Accelerating Natural Language Understanding In Task-oriented Dialog Ahuja Ojas, Desai Shrey
- Adaptive Transformers For Learning Multimodal Representations Bhargava Prajjwal
- Exploring Fluent Query Reformulations With Text-to-text Transformers And Reinforcement Learning Chen Jerry Zikun, Yu Shi, Wang Haoran
- Dipair: Fast And Accurate Distillation For Trillion-scale Text Matching And Pair Modeling Chen Jiecao, Yang Liu, Raman Karthik, Bendersky Michael, Yeh Jung-jung, Zhou Yun, Najork Marc, Cai Danyang, Emadzadeh Ehsan
- Autoadr: Automatic Model Design For Ad Relevance Chen Yiren, Yang Yaming, Sun Hong, Wang Yujing, Xu Yu, Shen Wei, Zhou Rong, Tong Yunhai, Bai Jing, Zhang Ruofei
- Comparative Study Of Language Models On Cross-domain Data With Model Agnostic Explainability Chhipa Mayank, Vazurkar Hrushikesh Mahesh, Kumar Abhijeet, Mishra Mridul
- Rethinking Embedding Coupling In Pre-trained Language Models Chung Hyung Won, Févry Thibault, Tsai Henry, Johnson Melvin, Ruder Sebastian
- A Tailored Pre-training Model For Task-oriented Dialog Generation Gu Jing, Wu Qingyang, Wu Chongruo, Shi Weiyan, Yu Zhou
- Pretrained Transformers Improve Out-of-distribution Robustness Hendrycks Dan, Liu Xiaoyuan, Wallace Eric, Dziedzic Adam, Krishnan Rishabh, Song Dawn
- Less Is More: Data-efficient Complex Question Answering Over Knowledge Bases Hua Yuncheng, Li Yuan-fang, Qi Guilin, Wu Wei, Zhang Jingyao, Qi Daiqing
- Exploring BERT Parameter Efficiency On The Stanford Question Answering Dataset V2.0 Hulburd Eric
- Scaling Laws For Neural Language Models Jared Kaplan, Sam Mccandlish, Tom Henighan, Tom B. Brown, Benjamin Chess, Rewon Child, Scott Gray, Alec Radford, Jeffrey Wu, Dario Amodei
- Syntactic Structure Distillation Pretraining For Bidirectional Encoders Kuncoro Adhiguna, Kong Lingpeng, Fried Daniel, Yogatama Dani, Rimell Laura, Dyer Chris, Blunsom Phil
- Efficient Transformer-based Large Scale Language Representations Using Hardware-friendly Block Structured Pruning Li Bingbing, Kong Zhenglun, Zhang Tianyun, Li Ji, Li Zhengang, Liu Hang, Ding Caiwen
- Multilingual Speech Translation With Efficient Finetuning Of Pretrained Models Li Xian, Wang Changhan, Tang Yun, Tran Chau, Tang Yuqing, Pino Juan, Baevski Alexei, Conneau Alexis, Auli Michael
- Enhancing Dialogue Generation Via Multi-level Contrastive Learning Li Xin, Li Piji, Wang Yan, Liu Xiaojiang, Lam Wai
- Neural Machine Translation With Joint Representation Li Yanyang, Wang Qiang, Xiao Tong, Liu Tongran, Zhu Jingbo
- Train Large, Then Compress: Rethinking Model Size For Efficient Training And Inference Of Transformers Li Zhuohan, Wallace Eric, Shen Sheng, Lin Kevin, Keutzer Kurt, Klein Dan, Gonzalez Joseph E.
- Autoregressive Knowledge Distillation Through Imitation Learning Lin Alexander, Wohlwend Jeremy, Chen Howard, Lei Tao
- Multi-node Bert-pretraining: Cost-efficient Approach Lin Jiahuang, Li Xin, Pekhimenko Gennady
- Mintl: Minimalist Transfer Learning For Task-oriented Dialogue Systems Lin Zhaojiang, Madotto Andrea, Winata Genta Indra, Fung Pascale
- Variational Transformers For Diverse Response Generation Lin Zhaojiang, Winata Genta Indra, Xu Peng, Liu Zihan, Fung Pascale
- Tell Me How To Ask Again: Question Data Augmentation With Controllable Rewriting In Continuous Space Liu Dayiheng, Gong Yeyun, Fu Jie, Yan Yu, Chen Jiusheng, Lv Jiancheng, Duan Nan, Zhou Ming
- Ladabert: Lightweight Adaptation Of BERT Through Hybrid Model Compression Mao Yihuan, Wang Yujing, Wu Chufan, Zhang Chen, Wang Yang, Yang Yaming, Zhang Quanlu, Tong Yunhai, Bai Jing
- Generation-distillation For Efficient Natural Language Understanding In Low-data Settings Melas-kyriazi Luke, Han George, Liang Celine
- Wechat Neural Machine Translation Systems For WMT20 Meng Fandong, Yan Jianhao, Liu Yijin, Gao Yuan, Zeng Xianfeng, Zeng Qinsong, Li Peng, Chen Ming, Zhou Jie, Liu Sifan, Zhou Hao
- ADER: Adaptively Distilled Exemplar Replay Towards Continual Learning For Session-based Recommendation Mi Fei, Lin Xiaoyu, Faltings Boi
- On The Stability Of Fine-tuning BERT: Misconceptions, Explanations, And Strong Baselines Mosbach Marius, Andriushchenko Maksym, Klakow Dietrich
- Axformer: Accuracy-driven Approximation Of Transformers For Faster, Smaller And More Accurate NLP Models Nagarajan Amrit, Sen Sanchari, Stevens Jacob R., Raghunathan Anand
- Real-time Execution Of Large-scale Language Models On Mobile Niu Wei, Kong Zhenglun, Yuan Geng, Jiang Weiwen, Guan Jiexiong, Ding Caiwen, Zhao Pu, Liu Sijia, Ren Bin, Wang Yanzhi
- On The Effect Of Dropping Layers Of Pre-trained Transformer Models Sajjad Hassan, Dalvi Fahim, Durrani Nadir, Nakov Preslav
- It's Not Just Size That Matters: Small Language Models Are Also Few-shot Learners Schick Timo, Schütze Hinrich
- Data-efficient Methods For Dialogue Systems Shalyminov Igor
- Pre-trained Summarization Distillation Shleifer Sam, Rush Alexander M.
- Lightpaff: A Two-stage Distillation Framework For Pre-training And Fine-tuning Song Kaitao, Sun Hao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Hongzhi, Liu Tie-yan
- Discovering Useful Sentence Representations From Large Pretrained Language Models Subramani Nishant, Suresh Nivedita
- Intellicode Compose: Code Generation Using Transformer Svyatkovskiy Alexey, Deng Shao Kun, Fu Shengyu, Sundaresan Neel
- Edgebert: Sentence-level Energy Optimizations For Latency-aware Multi-task NLP Inference Tambe Thierry, Hooper Coleman, Pentecost Lillian, Jia Tianyu, Yang En-yu, Donato Marco, Sanh Victor, Whatmough Paul N., Rush Alexander M., Brooks David, Wei Gu-yeon
- Interpretable Neural Computation For Real-world Compositional Visual Question Answering Tang Ruixue, Ma Chao
- SRQA: Synthetic Reader For Factoid Question Answering Wang Jiuniu, Xu Wenjia, Fu Xingyu, Wei Yang, Jin Li, Chen Ziyan, Xu Guangluan, Wu Yirong
- Minilm: Deep Self-attention Distillation For Task-agnostic Compression Of Pre-trained Transformers Wang Wenhui, Wei Furu, Dong Li, Bao Hangbo, Yang Nan, Zhou Ming
- Lightseq: A High Performance Inference Library For Transformers Wang Xiaohui, Xiong Ying, Wei Yang, Wang Mingxuan, Li Lei
- Lite Transformer With Long-short Range Attention Wu Zhanghao, Liu Zhijian, Lin Ji, Lin Yujun, Han Song
- Improving BERT Fine-tuning Via Self-ensemble And Self-distillation Xu Yige, Qiu Xipeng, Zhou Ligao, Huang Xuanjing
- MC-BERT: Efficient Language Pre-training Via A Meta Controller Xu Zhenhui, Gong Linyuan, Ke Guolin, He Di, Zheng Shuxin, Wang Liwei, Bian Jiang, Liu Tie-yan
- Multi-unit Transformers For Neural Machine Translation Yan Jianhao, Meng Fandong, Zhou Jie
- Adding Recurrence To Pretrained Transformers For Improved Efficiency And Context Size Yoshida Davis, Ettinger Allyson, Gimpel Kevin
- GOBO: Quantizing Attention-based NLP Models For Low Latency And Energy Efficient Inference Zadeh Ali Hadi, Edo Isak, Awad Omar Mohamed, Moshovos Andreas
- Accelerating Training Of Transformer-based Language Models With Progressive Layer Dropping Zhang Minjia, He Yuxiong
- Improving Adversarial Text Generation By Modeling The Distant Future Zhang Ruiyi, Chen Changyou, Gan Zhe, Wang Wenlin, Shen Dinghan, Wang Guoyin, Wen Zheng, Carin Lawrence
- Future-guided Incremental Transformer For Simultaneous Translation Zhang Shaolei, Feng Yang, Li Liangyou
- BERT-JAM: Boosting Bert-enhanced Neural Machine Translation With Joint Attention Zhang Zhebin, Wu Sai, Jiang Dawei, Chen Gang
- Know What You Don't Need: Single-shot Meta-pruning For Attention Heads Zhang Zhengyan, Qi Fanchao, Liu Zhiyuan, Liu Qun, Sun Maosong
- Improving Non-autoregressive Neural Machine Translation With Monolingual Data Zhou Jiawei, Keung Phillip
- Goal-directed Story Generation: Augmenting Generative Language Models With Reinforcement Learning Alabdulkarim Amal, Li Winston, Martin Lara J., Riedl Mark O.
- A General Language Assistant As A Laboratory For Alignment Amanda Askell, Yuntao Bai, Anna Chen, Dawn Drain, Deep Ganguli, Tom Henighan, Andy Jones, Nicholas Joseph, Ben Mann, Nova Dassarma, Nelson Elhage, Zac Hatfield-dodds, Danny Hernandez, Jackson Kernion, Kamal Ndousse, Catherine Olsson, Dario Amodei, Tom Brown, Jack Clark, Sam Mccandlish, Chris Olah, Jared Kaplan
- Ext5: Towards Extreme Multi-task Scaling For Transfer Learning Aribandi Vamsi, Tay Yi, Schuster Tal, Rao Jinfeng, Zheng Huaixiu Steven, Mehta Sanket Vaibhav, Zhuang Honglei, Tran Vinh Q., Bahri Dara, Ni Jianmo, Gupta Jai, Hui Kai, Ruder Sebastian, Metzler Donald
- PLATO-XL: Exploring The Large-scale Pre-training Of Dialogue Generation Bao Siqi, He Huang, Wang Fan, Wu Hua, Wang Haifeng, Wu Wenquan, Wu Zhihua, Guo Zhen, Lu Hua, Huang Xinxian, Tian Xin, Xu Xinchao, Lin Yingzhan, Niu Zheng-yu
- What Changes Can Large-scale Language Models Bring? Intensive Study On Hyperclova: Billions-scale Korean Generative Pretrained Transformers Boseop Kim, Hyoungseok Kim, Sang-woo Lee, Gichang Lee, Donghyun Kwak, Dong Hyeon Jeon, Sunghyun Park, Sungju Kim, Seonhoon Kim, Dongpil Seo, Heungsub Lee, Minyoung Jeong, Sungjae Lee, Minsub Kim, Suk Hyun Ko, Seokhun Kim, Taeyong Park, Jinuk Kim, Soyoung Kang, Na-hyeon Ryu, Kang Min Yoo, Minsuk Chang, Soobin Suh, Sookyo In, Jinseong Park, Kyungduk Kim, Hiun Kim, Jisu Jeong, Yong Goo Yeo, Donghoon Ham, Dongju Park, Min Young Lee, Jaewook Kang, Inho Kang, Jung-woo Ha, Woomyoung Park, Nako Sung
- On The Prunability Of Attention Heads In Multilingual BERT Budhraja Aakriti, Pande Madhura, Kumar Pratyush, Khapra Mitesh M.
- Visualgpt: Data-efficient Adaptation Of Pretrained Language Models For Image Captioning Chen Jun, Guo Han, Yi Kai, Li Boyang, Elhoseiny Mohamed
- Language Modeling Using Lmus: 10x Better Data Efficiency Or Improved Scaling Compared To Transformers Chilkuri Narsimha, Hunsberger Eric, Voelker Aaron, Malik Gurshaant, Eliasmith Chris
- Distilling Transformers For Neural Cross-domain Search Clement Colin B., Wu Chen, Drain Dawn, Sundaresan Neel
- YANMTT: Yet Another Neural Machine Translation Toolkit Dabre Raj, Sumita Eiichiro
- Openprompt: An Open-source Framework For Prompt-learning Ding Ning, Hu Shengding, Zhao Weilin, Chen Yulin, Liu Zhiyuan, Zheng Hai-tao, Sun Maosong
- Diverse Pretrained Context Encodings Improve Document Translation Donato Domenic, Yu Lei, Dyer Chris
- Robustness Challenges In Model Distillation And Pruning For Natural Language Understanding Du Mengnan, Mukherjee Subhabrata, Cheng Yu, Shokouhi Milad, Hu Xia, Awadallah Ahmed Hassan
- Robustly Optimized And Distilled Training For Natural Language Understanding Elfadeel Haytham, Peshterliev Stan
- Compressing Visual-linguistic Model Via Knowledge Distillation Fang Zhiyuan, Wang Jianfeng, Hu Xiaowei, Wang Lijuan, Yang Yezhou, Liu Zicheng
- Scalable Transformers For Neural Machine Translation Gao Peng, Geng Shijie, Qiao Yu, Wang Xiaogang, Dai Jifeng, Li Hongsheng
- ELIT: Emory Language And Information Toolkit He Han, Xu Liyan, Choi Jinho D.
- Generate, Annotate, And Learn: NLP With Synthetic Text He Xuanli, Nassar Islam, Kiros Jamie, Haffari Gholamreza, Norouzi Mohammad
- ENCONTER: Entity Constrained Progressive Sequence Generation Via Insertion-based Transformer Hsieh Lee-hsun, Lee Yang-yin, Lim Ee-peng
- The Niutrans System For WNGT 2020 Efficiency Task Hu Chi, Li Bei, Lin Ye, Li Yinqiao, Li Yanyang, Wang Chenglong, Xiao Tong, Zhu Jingbo
- Audio-oriented Multimodal Machine Comprehension: Task, Dataset And Model Huang Zhiqi, Liu Fenglin, Wu Xian, Ge Shen, Wang Helin, Fan Wei, Zou Yuexian
- Lightmbert: A Simple Yet Effective Method For Multilingual BERT Distillation Jiao Xiaoqi, Yin Yichun, Shang Lifeng, Jiang Xin, Chen Xiao, Li Linlin, Wang Fang, Liu Qun
- Distilling Large Language Models Into Tiny And Effective Students Using Pqrnn Kaliamoorthi Prabhu, Siddhant Aditya, Li Edward, Johnson Melvin
- What Do Pre-trained Code Models Know About Code? Karmakar Anjan, Robbes Romain
- Finetuning Pretrained Transformers Into Rnns Kasai Jungo, Peng Hao, Zhang Yizhe, Yogatama Dani, Ilharco Gabriel, Pappas Nikolaos, Mao Yi, Chen Weizhu, Smith Noah A.
- Learned Token Pruning For Transformers Kim Sehoon, Shen Sheng, Thorsley David, Gholami Amir, Kwon Woosuk, Hassoun Joseph, Keutzer Kurt
- BERT Busters: Outlier Dimensions That Disrupt Transformers Kovaleva Olga, Kulshreshtha Saurabh, Rogers Anna, Rumshisky Anna
- Layered Gradient Accumulation And Modular Pipeline Parallelism: Fast And Efficient Training Of Large Language Models Lamy-poirier Joel
- Pagnol: An Extra-large French Generative Model Launay Julien, Tommasone Elena, Pannier Baptiste, Boniface François, Chatelain Amélie, Cappelli Alessandro, Poli Iacopo, Seddah Djamé
- When Attention Meets Fast Recurrence: Training Language Models With Reduced Compute Lei Tao
- Exploiting Sentence-level Representations For Passage Ranking Leonhardt Jurek, Beringer Fabian, Anand Avishek
- The Stability-efficiency Dilemma: Investigating Sequence Length Warmup For Training GPT Models Li Conglong, Zhang Minjia, He Yuxiong
- Align Before Fuse: Vision And Language Representation Learning With Momentum Distillation Li Junnan, Selvaraju Ramprasaath R., Gotmare Akhilesh Deepak, Joty Shafiq, Xiong Caiming, Hoi Steven
- Personalized Transformer For Explainable Recommendation Li Lei, Zhang Yongfeng, Chen Li
- A Short Study On Compressing Decoder-based Language Models Li Tianda, Mesbahi Yassir El, Kobyzev Ivan, Rashid Ahmad, Mahmud Atif, Anchuri Nithin, Hajimolahoseini Habib, Liu Yang, Rezagholizadeh Mehdi
- Enriching Non-autoregressive Transformer With Syntactic And Semanticstructures For Neural Machine Translation Liu Ye, Wan Yao, Zhang Jian-guo, Zhao Wenting, Yu Philip S.
- Pretrained Transformers As Universal Computation Engines Lu Kevin, Grover Aditya, Abbeel Pieter, Mordatch Igor
- X-METRA-ADA: Cross-lingual Meta-transfer Learning Adaptation To Natural Language Understanding And Question Answering M'hamdi Meryem, Kim Doo Soon, Dernoncourt Franck, Bui Trung, Ren Xiang, May Jonathan
- COCO-LM: Correcting And Contrasting Text Sequences For Language Model Pretraining Meng Yu, Xiong Chenyan, Bajaj Payal, Tiwary Saurabh, Bennett Paul, Han Jiawei, Song Xia
- Hierarchical Transformers Are More Efficient Language Models Nawrot Piotr, Tworkowski Szymon, Tyrolski Michał, Kaiser Łukasz, Wu Yuhuai, Szegedy Christian, Michalewski Henryk
- ABC: Attention With Bounded-memory Control Peng Hao, Kasai Jungo, Pappas Nikolaos, Yogatama Dani, Wu Zhaofeng, Kong Lingpeng, Schwartz Roy, Smith Noah A.
- Assessing The Syntactic Capabilities Of Transformer-based Multilingual Language Models Pérez-mayos Laura, García Alba Táboas, Mille Simon, Wanner Leo
- Layer-wise Pruning Of Transformer Attention Heads For Efficient Language Modeling Shim Kyuhong, Choi Iksoo, Sung Wonyong, Choi Jungwook
- Exploring Low-cost Transformer Model Compression For Large-scale Commercial Reply Suggestions Shrivastava Vaishnavi, Gaonkar Radhika, Gupta Shashank, Jha Abhishek
- ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, Dianhai Yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang
- Pre-trained Language Models As Prior Knowledge For Playing Text-based Games Singh Ishika, Singh Gargi, Modi Ashutosh
- Ernie-tiny : A Progressive Distillation Framework For Pretrained Transformer Compression Su Weiyue, Chen Xuyi, Feng Shikun, Liu Jiaxiang, Liu Weixin, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- NVIDIA Nemo Neural Machine Translation Systems For English-german And English-russian News And Biomedical Tasks At WMT21 Subramanian Sandeep, Hrinchuk Oleksii, Adams Virginia, Kuchaiev Oleksii
- Not All Memories Are Created Equal: Learning To Forget By Expiring Sukhbaatar Sainbayar, Ju Da, Poff Spencer, Roller Stephen, Szlam Arthur, Weston Jason, Fan Angela
- Do Long-range Language Models Actually Use Long-range Context? Sun Simeng, Krishna Kalpesh, Mattarella-micke Andrew, Iyyer Mohit
- Diff-explainer: Differentiable Convex Optimization For Explainable Multi-hop Inference Thayaparan Mokanarangan, Valentino Marco, Ferreira Deborah, Rozanova Julia, Freitas André
- Selective Knowledge Distillation For Neural Machine Translation Wang Fusheng, Yan Jianhao, Meng Fandong, Zhou Jie
- Lightseq2: Accelerated Training For Transformer-based Models On Gpus Wang Xiaohui, Wei Yang, Xiong Ying, Huang Guyue, Qian Xian, Ding Yufei, Wang Mingxuan, Li Lei
- Distilled Dual-encoder Model For Vision-language Understanding Wang Zekun, Wang Wenhui, Zhu Haichao, Liu Ming, Qin Bing, Wei Furu
- VAULT: Variable Unified Long Text Representation For Machine Reading Comprehension Wen Haoyang, Ferritto Anthony, Ji Heng, Florian Radu, Sil Avirup
- Training Large-scale News Recommenders With Pretrained Language Models In The Loop Xiao Shitao, Liu Zheng, Shao Yingxia, Di Tao, Xie Xing
- E2E-VLP: End-to-end Vision-language Pre-training Enhanced By Visual Learning Xu Haiyang, Yan Ming, Li Chenliang, Bi Bin, Huang Songfang, Xiao Wenming, Huang Fei
- From Dense To Sparse: Contrastive Pruning For Better Pre-trained Language Model Compression Xu Runxin, Luo Fuli, Wang Chengyu, Chang Baobao, Huang Jun, Huang Songfang, Huang Fei
- Fastseq: Make Sequence Generation Faster Yan Yu, Hu Fei, Chen Jiusheng, Bhendawade Nikhil, Ye Ting, Gong Yeyun, Duan Nan, Cui Desheng, Chi Bingyu, Zhang Ruofei
- Turning Tables: Generating Examples From Semi-structured Tables For Endowing Language Models With Reasoning Skills Yoran Ori, Talmor Alon, Berant Jonathan
- Reconsidering The Past: Optimizing Hidden States In Language Models Yoshida Davis, Gimpel Kevin
- Prune Once For All: Sparse Pre-trained Language Models Zafrir Ofir, Larey Ariel, Boudoukh Guy, Shen Haihao, Wasserblat Moshe
- Wechat Neural Machine Translation Systems For WMT21 Zeng Xianfeng, Liu Yijin, Li Ernan, Ran Qiu, Meng Fandong, Li Peng, Xu Jinan, Zhou Jie
- Ernie-vilg: Unified Generative Pre-training For Bidirectional Vision-language Generation Zhang Han, Yin Weichong, Fang Yewei, Li Lanxin, Duan Boqiang, Wu Zhihua, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- A Knowledge-grounded Dialog System Based On Pre-trained Language Models Zhang Weijie, Chen Jiaoxuan, Wu Haipang, Wan Sanhui, Li Gongfeng
- CPM-2: Large-scale Cost-effective Pre-trained Language Models Zhengyan Zhang, Yuxian Gu, Xu Han, Shengqi Chen, Chaojun Xiao, Zhenbo Sun, Yuan Yao, Fanchao Qi, Jian Guan, Pei Ke, Yanzheng Cai, Guoyang Zeng, Zhixing Tan, Zhiyuan Liu, Minlie Huang, Wentao Han, Yang Liu, Xiaoyan Zhu, Maosong Sun
- Gpt-3-driven Pedagogical Agents For Training Children's Curious Question-asking Skills Abdelghani Rania, Wang Yen-hsiang, Yuan Xingdi, Wang Tong, Lucas Pauline, Sauzéon Hélène, Oudeyer Pierre-yves
- GLM-130B: An Open Bilingual Pre-trained Model Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Peng Zhang, Yuxiao Dong, Jie Tang
- \(\textit{latent}\)-glat: Glancing At Latent Variables For Parallel Text Generation Bao Yu, Zhou Hao, Huang Shujian, Wang Dongqi, Qian Lihua, Dai Xinyu, Chen Jiajun, Li Lei
- I2D2: Inductive Knowledge Distillation With Neurologic And Self-imitation Bhagavatula Chandra, Hwang Jena D., Downey Doug, Bras Ronan Le, Lu Ximing, Qin Lianhui, Sakaguchi Keisuke, Swayamdipta Swabha, West Peter, Choi Yejin
- Vector-quantized Input-contextualized Soft Prompts For Natural Language Understanding Bhardwaj Rishabh, Saha Amrita, Hoi Steven C. H., Poria Soujanya
- Ernie-code: Beyond English-centric Cross-lingual Pretraining For Programming Languages Chai Yekun, Wang Shuohuan, Pang Chao, Sun Yu, Tian Hao, Wu Hua
- Speechprompt: An Exploration Of Prompt Tuning On Generative Spoken Language Model For Speech Processing Tasks Chang Kai-wei, Tseng Wei-cheng, Li Shang-wen, Lee Hung-yi
- Xdoc: Unified Pre-training For Cross-format Document Understanding Chen Jingye, Lv Tengchao, Cui Lei, Zhang Cha, Wei Furu
- Vault: Augmenting The Vision-and-language Transformer For Sentiment Classification On Social Media Chochlakis Georgios University Of Southern California, Srinivasan Tejas University Of Southern California, Thomason Jesse University Of Southern California, Narayanan Shrikanth University Of Southern California
- Decoder Tuning: Efficient Language Understanding As Decoding Cui Ganqu, Li Wentao, Ding Ning, Huang Longtao, Liu Zhiyuan, Sun Maosong
- M6-rec: Generative Pretrained Language Models Are Open-ended Recommender Systems Cui Zeyu, Ma Jianxin, Zhou Chang, Zhou Jingren, Yang Hongxia
- Enabling Multimodal Generation On CLIP Via Vision-language Knowledge Distillation Dai Wenliang, Hou Lu, Shang Lifeng, Jiang Xin, Liu Qun, Fung Pascale
- Rlprompt: Optimizing Discrete Text Prompts With Reinforcement Learning Deng Mingkai, Wang Jianyu, Hsieh Cheng-ping, Wang Yihan, Guo Han, Shu Tianmin, Song Meng, Xing Eric P., Hu Zhiting
- Llm.int8(): 8-bit Matrix Multiplication For Transformers At Scale Dettmers Tim, Lewis Mike, Belkada Younes, Zettlemoyer Luke
- Black-box Prompt Learning For Pre-trained Language Models Diao Shizhe, Huang Zhichao, Xu Ruijia, Li Xuechun, Lin Yong, Zhou Xiao, Zhang Tong
- Imagination Is All You Need! Curved Contrastive Learning For Abstract Sequence Modeling Utilized On Long Short-term Dialogue Planning Erker Justus-jonas, Schaffer Stefan, Spanakis Gerasimos
- Llmeffichecker: Understanding And Testing Efficiency Degradation Of Large Language Models Feng Xiaoning, Han Xiaohong, Chen Simin, Yang Wei
- GPTQ: Accurate Post-training Quantization For Generative Pre-trained Transformers Frantar Elias, Ashkboos Saleh, Hoefler Torsten, Alistarh Dan
- Hungry Hungry Hippos: Towards Language Modeling With State Space Models Fu Daniel Y., Dao Tri, Saab Khaled K., Thomas Armin W., Rudra Atri, Ré Christopher
- Pnlp-mixer: An Efficient All-mlp Architecture For Language Fusco Francesco, Pascual Damian, Staar Peter, Antognini Diego
- Visual Prompt Tuning For Test-time Domain Adaptation Gao Yunhe, Shi Xingjian, Zhu Yi, Wang Hao, Tang Zhiqiang, Zhou Xiong, Li Mu, Metaxas Dimitris N.
- Cramming: Training A Language Model On A Single GPU In One Day Geiping Jonas, Goldstein Tom
- Re2g: Retrieve, Rerank, Generate Glass Michael, Rossiello Gaetano, Chowdhury Md Faisal Mahbub, Naik Ankita Rajaram, Cai Pengshan, Gliozzo Alfio
- Transkimmer: Transformer Learns To Layer-wise Skim Guan Yue, Li Zhengyi, Leng Jingwen, Lin Zhouhan, Guo Minyi
- Quala-minilm: A Quantized Length Adaptive Minilm Guskin Shira, Wasserblat Moshe, Wang Chang, Shen Haihao
- LUNA: Language Understanding With Number Augmentations On Transformers Via Number Plugins And Pre-training Han Hongwei, Xu Jialiang, Zhou Mengyu, Shao Yijia, Han Shi, Zhang Dongmei
- SPACE-3: Unified Dialog Model Pre-training For Task-oriented Dialog Understanding And Generation He Wanwei, Dai Yinpei, Yang Min, Sun Jian, Huang Fei, Si Luo, Li Yongbin
- Hyperprompt: Prompt-based Task-conditioning Of Transformers He Yun, Zheng Huaixiu Steven, Tay Yi, Gupta Jai, Du Yu, Aribandi Vamsi, Zhao Zhe, Li Yaguang, Chen Zhao, Metzler Donald, Cheng Heng-tze, Chi Ed H.
- Shapley Head Pruning: Identifying And Removing Interference In Multilingual Transformers Held William, Yang Diyi
- Compressing Pre-trained Transformers Via Low-bit Nxm Sparsity For Natural Language Understanding Holmes Connor, Zhang Minjia, He Yuxiong, Wu Bo
- Syntax-guided Localized Self-attention By Constituency Syntactic Distance Hou Shengyuan, Kai Jushi, Xue Haotian, Zhu Bingyu, Yuan Bo, Huang Longtao, Wang Xinbing, Lin Zhouhan
- Empirical Evaluation Of Post-training Quantization Methods For Language Tasks Hu Ting, Meinel Christoph, Yang Haojin
- Autoregressive Entity Generation For End-to-end Task-oriented Dialog Huang Guanhuan, Quan Xiaojun, Wang Qifan
- FPT: Improving Prompt Tuning Efficiency Via Progressive Training Huang Yufei, Qin Yujia, Wang Huadong, Yin Yichun, Sun Maosong, Liu Zhiyuan, Liu Qun
- Emergent Abilities Of Large Language Models Jason Wei, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph, Sebastian Borgeaud, Dani Yogatama, Maarten Bosma, Denny Zhou, Donald Metzler, Ed H. Chi, Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean, William Fedus
- Latency Adjustable Transformer Encoder For Language Understanding Kachuee Sajjad, Sharifkhani Mohammad
- Who Says Elephants Can't Run: Bringing Large Scale Moe Models Into Cloud Scale Production Kim Young Jin, Henry Rawn, Fahim Raffy, Awadalla Hany Hassan
- Validating Large Language Models With Relm Kuchnik Michael, Smith Virginia, Amvrosiadis George
- The Optimal BERT Surgeon: Scalable And Accurate Second-order Pruning For Large Language Models Kurtic Eldar, Campos Daniel, Nguyen Tuan, Frantar Elias, Kurtz Mark, Fineran Benjamin, Goin Michael, Alistarh Dan
- Language Detoxification With Attribute-discriminative Latent Space Kwak Jin Myung, Kim Minseon, Hwang Sung Ju
- Interactive Code Generation Via Test-driven User-intent Formalization Lahiri Shuvendu K., Fakhoury Sarah, Naik Aaditya, Sakkas Georgios, Chakraborty Saikat, Musuvathi Madanlal, Choudhury Piali, Von Veh Curtis, Inala Jeevana Priya, Wang Chenglong, Gao Jianfeng
- Evaluation Of Synthetic Datasets For Conversational Recommender Systems Lara Harsh, Tiwari Manoj
- Hard Gate Knowledge Distillation -- Leverage Calibration For Robust And Reliable Language Model Lee Dongkyu, Tian Zhiliang, Zhao Yingxiu, Cheung Ka Chun, Zhang Nevin L.
- Simple Recurrence Improves Masked Language Models Lei Tao, Tian Ran, Bastings Jasmijn, Parikh Ankur P.
- ELMER: A Non-autoregressive Pre-trained Language Model For Efficient And Effective Text Generation Li Junyi, Tang Tianyi, Zhao Wayne Xin, Nie Jian-yun, Wen Ji-rong
- Evaluating Psychological Safety Of Large Language Models Li Xingxuan, Li Yutong, Qiu Lin, Joty Shafiq, Bing Lidong
- Accelerating Attention Through Gradient-based Learned Runtime Pruning Li Zheng, Ghodrati Soroush, Yazdanbakhsh Amir, Esmaeilzadeh Hadi, Kang Mingu
- DQ-BART: Efficient Sequence-to-sequence Model Via Joint Distillation And Quantization Li Zheng, Wang Zijian, Tan Ming, Nallapati Ramesh, Bhatia Parminder, Arnold Andrew, Xiang Bing, Roth Dan
- Bjtu-wechat's Systems For The WMT22 Chat Translation Task Liang Yunlong, Meng Fandong, Xu Jinan, Chen Yufeng, Zhou Jie
- Simple And Effective Gradient-based Tuning Of Sequence-to-sequence Models Lichtarge Jared, Alberti Chris, Kumar Shankar
- Gentus: Simulating User Behaviour And Language In Task-oriented Dialogues With Generative Transformers Lin Hsien-chin, Geishauser Christian, Feng Shutong, Lubis Nurul, Van Niekerk Carel, Heck Michael, Gašić Milica
- Building Markovian Generative Architectures Over Pretrained LM Backbones For Efficient Task-oriented Dialog Systems Liu Hong, Cai Yucheng, Ou Zhijian, Huang Yi, Feng Junlan
- Fl-tuning: Layer Tuning For Feed-forward Network In Transformer Liu Jingping, Song Yuqiu, Xue Kui, Sun Hongli, Wang Chao, Chen Lihan, Jiang Haiyun, Liang Jiaqing, Ruan Tong
- Transformer With Memory Replay Liu Rui, Mozafari Barzan
- A Win-win Deal: Towards Sparse And Robust Pre-trained Language Models Liu Yuanxin, Meng Fandong, Lin Zheng, Li Jiangnan, Fu Peng, Cao Yanan, Wang Weiping, Zhou Jie
- Torchscale: Transformers At Scale Ma Shuming, Wang Hongyu, Huang Shaohan, Wang Wenhui, Chi Zewen, Dong Li, Benhaim Alon, Patra Barun, Chaudhary Vishrav, Song Xia, Wei Furu
- Teaching Small Language Models To Reason Magister Lucie Charlotte, Mallinson Jonathan, Adamek Jakub, Malmi Eric, Severyn Aliaksei
- Text Generation With Text-editing Models Malmi Eric, Dong Yue, Mallinson Jonathan, Chuklin Aleksandr, Adamek Jakub, Mirylenka Daniil, Stahlberg Felix, Krause Sebastian, Kumar Shankar, Severyn Aliaksei
- Tangobert: Reducing Inference Cost By Using Cascaded Architecture Mamou Jonathan, Pereg Oren, Wasserblat Moshe, Schwartz Roy
- Minialbert: Model Distillation Via Parameter-efficient Recursive Transformers Nouriborji Mohammadmahdi, Rohanian Omid, Kouchaki Samaneh, Clifton David A.
- Parameter-efficient Abstractive Question Answering Over Tables Or Text Pal Vaishali, Kanoulas Evangelos, De Rijke Maarten
- LUT-GEMM: Quantized Matrix Multiplication Based On Luts For Efficient Inference In Large-scale Generative Language Models Park Gunho, Park Baeseong, Kim Minsub, Lee Sungjae, Kim Jeonghoon, Kwon Beomseok, Kwon Se Jung, Kim Byeongwook, Lee Youngjoo, Lee Dongsoo
- Holistic Evaluation Of Language Models Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao, Jue Wang, Keshav Santhanam, Laurel Orr, Lucia Zheng, Mert Yuksekgonul, Mirac Suzgun, Nathan Kim, Neel Guha, Niladri Chatterji, Omar Khattab, Peter Henderson, Qian Huang, Ryan Chi, Sang Michael Xie, Shibani Santurkar, Surya Ganguli, Tatsunori Hashimoto, Thomas Icard, Tianyi Zhang, Vishrav Chaudhary, William Wang, Xuechen Li, Yifan Mai, Yuhui Zhang, Yuta Koreeda
- Hypertuning: Toward Adapting Large Language Models Without Back-propagation Phang Jason, Mao Yi, He Pengcheng, Chen Weizhu
- Investigating Efficiently Extending Transformers For Long Input Summarization Phang Jason, Zhao Yao, Liu Peter J.
- Outliers Dimensions That Disrupt Transformers Are Driven By Frequency Puccetti Giovanni, Rogers Anna, Drozd Aleksandr, Dell'orletta Felice
- A Self-paced Mixed Distillation Method For Non-autoregressive Generation Qi Weizhen, Gong Yeyun, Shen Yelong, Jiao Jian, Yan Yu, Li Houqiang, Zhang Ruofei, Chen Weizhu, Duan Nan
- Diffusion Glancing Transformer For Parallel Sequence To Sequence Learning Qian Lihua, Wang Mingxuan, Liu Yang, Zhou Hao
- The NLP Task Effectiveness Of Long-range Transformers Qin Guanghui, Feng Yukun, Van Durme Benjamin
- ELLE: Efficient Lifelong Pre-training For Emerging Data Qin Yujia, Zhang Jiajie, Lin Yankai, Liu Zhiyuan, Li Peng, Sun Maosong, Zhou Jie
- Exploring Extreme Parameter Compression For Pre-trained Language Models Ren Yuxin, Wang Benyou, Shang Lifeng, Jiang Xin, Liu Qun
- Breakpoint Transformers For Modeling And Tracking Intermediate Beliefs Richardson Kyle, Tamari Ronen, Sultan Oren, Tsarfaty Reut, Shahaf Dafna, Sabharwal Ashish
- Dialokg: Knowledge-structure Aware Task-oriented Dialogue Generation Rony Md Rashad Al Hasan, Usbeck Ricardo, Lehmann Jens
- Overcoming Catastrophic Forgetting Beyond Continual Learning: Balanced Training For Neural Machine Translation Shao Chenze, Feng Yang
- Staged Training For Transformer Language Models Shen Sheng, Walsh Pete, Keutzer Kurt, Dodge Jesse, Peters Matthew, Beltagy Iz
- Distilling Reasoning Capabilities Into Smaller Language Models Shridhar Kumar, Stolfo Alessandro, Sachan Mrinmaya
- Augmenting Interpretable Models With Llms During Training Singh Chandan, Askari Armin, Caruana Rich, Gao Jianfeng
- Llm-planner: Few-shot Grounded Planning For Embodied Agents With Large Language Models Song Chan Hee, Wu Jiaman, Washington Clayton, Sadler Brian M., Chao Wei-lun, Su Yu
- QUILL: Query Intent With Large Language Models Using Retrieval Augmentation And Multi-stage Distillation Srinivasan Krishna, Raman Karthik, Samanta Anupam, Liao Lingrui, Bertelli Luca, Bendersky Mike
- Black-box Tuning For Language-model-as-a-service Sun Tianxiang, Shao Yunfan, Qian Hong, Huang Xuanjing, Qiu Xipeng
- Metaphorical User Simulators For Evaluating Task-oriented Dialogue Systems Sun Weiwei, Guo Shuyu, Zhang Shuo, Ren Pengjie, Chen Zhumin, De Rijke Maarten, Ren Zhaochun
- Large Language Models Are Zero-shot Reasoners Takeshi Kojima, Shixiang Shane Gu, Machel Reid, Yutaka Matsuo, Yusuke Iwasawa
- Compression Of Generative Pre-trained Language Models Via Quantization Tao Chaofan, Hou Lu, Zhang Wei, Shang Lifeng, Jiang Xin, Liu Qun, Luo Ping, Wong Ngai
- Will We Run Out Of Data? Limits Of LLM Scaling Based On Human-generated Data Villalobos Pablo, Ho Anson, Sevilla Jaime, Besiroglu Tamay, Heim Lennart, Hobbhahn Marius
- Exploring The Limits Of Domain-adaptive Training For Detoxifying Large-scale Language Models Wang Boxin, Ping Wei, Xiao Chaowei, Xu Peng, Patwary Mostofa, Shoeybi Mohammad, Li Bo, Anandkumar Anima, Catanzaro Bryan
- Zero-shot Image Captioning By Anchor-augmented Vision-language Space Alignment Wang Junyang, Zhang Yi, Yan Ming, Zhang Ji, Sang Jitao
- Finding Skill Neurons In Pre-trained Transformer-based Language Models Wang Xiaozhi, Wen Kaiyue, Zhang Zhengyan, Hou Lei, Liu Zhiyuan, Li Juanzi
- Multimodal Adaptive Distillation For Leveraging Unimodal Encoders For Vision-language Tasks Wang Zhecan, Codella Noel, Chen Yen-chun, Zhou Luowei, Dai Xiyang, Xiao Bin, Yang Jianwei, You Haoxuan, Chang Kai-wei, Chang Shih-fu, Yuan Lu
- Stateful Memory-augmented Transformers For Efficient Dialogue Modeling Wu Qingyang, Yu Zhou
- Extreme Compression For Pre-trained Transformers Made Simple And Efficient Wu Xiaoxia, Yao Zhewei, Zhang Minjia, Li Conglong, He Yuxiong
- An Efficient Memory-augmented Transformer For Knowledge-intensive NLP Tasks Wu Yuxiang, Zhao Yu, Hu Baotian, Minervini Pasquale, Stenetorp Pontus, Riedel Sebastian
- Smoothquant: Accurate And Efficient Post-training Quantization For Large Language Models Xiao Guangxuan, Lin Ji, Seznec Mickael, Wu Hao, Demouth Julien, Han Song
- Dual Modality Prompt Tuning For Vision-language Pre-trained Model Xing Yinghui, Wu Qirui, Cheng De, Zhang Shizhou, Liang Guoqiang, Wang Peng, Zhang Yanning
- Evaluating And Improving Context Attention Distribution On Multi-turn Response Generation Using Self-contained Distractions Xing Yujie, Gulla Jon Atle
- Adapting Pretrained Text-to-text Models For Long Text Sequences Xiong Wenhan, Gupta Anchit, Toshniwal Shubham, Mehdad Yashar, Yih Wen-tau
- Integrating Translation Memories Into Non-autoregressive Machine Translation Xu Jitao, Crego Josep, Yvon François
- Generative Negative Text Replay For Continual Vision-language Pretraining Yan Shipeng, Hong Lanqing, Xu Hang, Han Jianhua, Tuytelaars Tinne, Li Zhenguo, He Xuming
- Zeroquant: Efficient And Affordable Post-training Quantization For Large-scale Transformers Yao Zhewei, Aminabadi Reza Yazdani, Zhang Minjia, Wu Xiaoxia, Li Conglong, He Yuxiong
- Transcending Scaling Laws With 0.1% Extra Compute Yi Tay, Jason Wei, Hyung Won Chung, Vinh Q. Tran, David R. So, Siamak Shakeri, Xavier Garcia, Huaixiu Steven Zheng, Jinfeng Rao, Aakanksha Chowdhery, Denny Zhou, Donald Metzler, Slav Petrov, Neil Houlsby, Quoc V. Le, Mostafa Dehghani
- Efficient Language Modeling With Sparse All-mlp Yu Ping, Artetxe Mikel, Ott Myle, Shleifer Sam, Gong Hongyu, Stoyanov Ves, Li Xian
- Bilaterally Slimmable Transformer For Elastic And Efficient Visual Question Answering Yu Zhou, Jin Zitian, Yu Jun, Xu Mingliang, Wang Hongbo, Fan Jianping
- Better Pre-training By Reducing Representation Confusion Zhang Haojie, Liang Mingfei, Xie Ruobing, Sun Zhenlong, Zhang Bo, Lin Leyu
- Scala: Accelerating Adaptation Of Pre-trained Transformer-based Language Models Via Efficient Large-batch Adversarial Noise Zhang Minjia, Naresh Niranjan Uma, He Yuxiong
- Efficient Long Sequence Modeling Via State Space Augmented Transformer Zuo Simiao, Liu Xiaodong, Jiao Jian, Charles Denis, Manavoglu Eren, Zhao Tuo, Gao Jianfeng
- I Was Blind But Now I See: Implementing Vision-enabled Dialogue In Social Robots Abbo Giulio Antonio, Belpaeme Tony
- Leveraging Vision-language Models For Improving Domain Generalization In Image Classification Addepalli Sravanti, Asokan Ashish Ramayee, Sharma Lakshay, Babu R. Venkatesh
- SARATHI: Efficient LLM Inference By Piggybacking Decodes With Chunked Prefills Agrawal Amey, Panwar Ashish, Mohan Jayashree, Kwatra Nipun, Gulavani Bhargav S., Ramjee Ramachandran
- Multimodal Representations For Teacher-guided Compositional Visual Reasoning Aissa Wafa Cedric - Vertigo, Ferecatu Marin Cedric - Vertigo, Crucianu Michel Cedric - Vertigo
- The Emergence Of Essential Sparsity In Large Pre-trained Models: The Weights That Matter Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Zhangyang Wang
- Rest Meets React: Self-improvement For Multi-step Reasoning LLM Agent Aksitov Renat, Miryoosefi Sobhan, Li Zonglin, Li Daliang, Babayan Sheila, Kopparapu Kavya, Fisher Zachary, Guo Ruiqi, Prakash Sushant, Srinivasan Pranesh, Zaheer Manzil, Yu Felix, Kumar Sanjiv
- Mamba: Linear-time Sequence Modeling With Selective State Spaces Albert Gu, Tri Dao
- Mistral 7B Albert Q. Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego De Las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lucile Saulnier, Lélio Renard Lavaud, Marie-anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timothée Lacroix, William El Sayed
- Pixt3: Pixel-based Table-to-text Generation Alonso Iñigo, Agirre Eneko, Lapata Mirella
- Detecting And Preventing Hallucinations In Large Vision Language Models Anisha Gunjal, Jihan Yin, Erhan Bas
- Distilling Efficient Language-specific Models For Cross-lingual Transfer Ansell Alan, Ponti Edoardo Maria, Korhonen Anna, Vulić Ivan
- Scaling Laws For Language Encoding Models In Fmri Antonello Richard, Vaidya Aditya, Huth Alexander G.
- Resurrecting Recurrent Neural Networks For Long Sequences Antonio Orvieto, Samuel L Smith, Albert Gu, Anushan Fernando, Caglar Gulcehre, Razvan Pascanu, Soham De
- Slimfit: Memory-efficient Fine-tuning Of Transformer-based Models Using Training Dynamics Ardakani Arash, Haan Altan, Tan Shangyin, Popovici Doru Thom, Cheung Alvin, Iancu Costin, Sen Koushik
- Zoology: Measuring And Improving Recall In Efficient Language Models Arora Simran, Eyuboglu Sabri, Timalsina Aman, Johnson Isys, Poli Michael, Zou James, Rudra Atri, Ré Christopher
- Schema-driven Information Extraction From Heterogeneous Tables Bai Fan, Kang Junmo, Stanovsky Gabriel, Freitag Dayne, Dredze Mark, Ritter Alan
- A Bi-step Grounding Paradigm For Large Language Models In Recommendation Systems Bao Keqin, Zhang Jizhi, Wang Wenjie, Zhang Yang, Yang Zhengyi, Luo Yancheng, Chen Chong, Feng Fuli, Tian Qi
- Testing The Depth Of Chatgpt's Comprehension Via Cross-modal Tasks Based On Ascii-art: Gpt3.5's Abilities In Regard To Recognizing And Generating Ascii-art Are Not Totally Lacking Bayani David
- Span-selective Linear Attention Transformers For Effective And Robust Schema-guided Dialogue State Tracking Bebensee Björn, Lee Haejun
- RWKV: Reinventing Rnns For The Transformer Era Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Stella Biderman, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran Gv, Xuzheng He, Haowen Hou, Jiaju Lin, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Guangyu Song, Xiangru Tang, Bolun Wang, Johan S. Wind, Stanislaw Wozniak, Ruichong Zhang, Zhenyuan Zhang, Qihang Zhao, Peng Zhou, Qinghua Zhou, Jian Zhu, Rui-jie Zhu
- The Ups And Downs Of Large Language Model Inference With Vocabulary Trimming By Language Heuristics Bogoychev Nikolay, Chen Pinzhen, Haddow Barry, Birch Alexandra
- Which Prompts Make The Difference? Data Prioritization For Efficient Human LLM Evaluation Boubdir Meriem, Kim Edward, Ermis Beyza, Fadaee Marzieh, Hooker Sara
- Plasma: Making Small Language Models Better Procedural Knowledge Models For (counterfactual) Planning Brahman Faeze, Bhagavatula Chandra, Pyatkin Valentina, Hwang Jena D., Li Xiang Lorraine, Arai Hirona J., Sanyal Soumya, Sakaguchi Keisuke, Ren Xiang, Choi Yejin
- Efficient Transformer Knowledge Distillation: A Performance Review Brown Nathan, Williamson Ashton, Anderson Tahj, Lawrence Logan
- Revealing The Structure Of Language Model Capabilities Burnell Ryan, Hao Han, Conway Andrew R. A., Orallo Jose Hernandez
- Large Language Models As Tool Makers Cai Tianle, Wang Xuezhi, Ma Tengyu, Chen Xinyun, Zhou Denny
- A Systematic Study Of Knowledge Distillation For Natural Language Generation With Pseudo-target Training Calderon Nitay, Mukherjee Subhabrata, Reichart Roi, Kantor Amir
- Oberta: Improving Sparse Transfer Learning Via Improved Initialization, Distillation, And Pruning Regimes Campos Daniel, Marques Alexandre, Kurtz Mark, Zhai Chengxiang
- Successor Features For Efficient Multisubject Controlled Text Generation Cao Meng, Fatemi Mehdi, Cheung Jackie Chi Kit, Shabanian Samira
- Pumer: Pruning And Merging Tokens For Efficient Vision Language Models Cao Qingqing, Paranjape Bhargavi, Hajishirzi Hannaneh
- Revolutionizing Mobile Interaction: Enabling A 3 Billion Parameter GPT LLM On Mobile Carreira Samuel, Marques Tomás, Ribeiro José, Grilo Carlos
- Promoting Generalized Cross-lingual Question Answering In Few-resource Scenarios Via Self-knowledge Distillation Carrino Casimiro Pio, Escolano Carlos, Fonollosa José A. R.
- Grounding Large Language Models In Interactive Environments With Online Reinforcement Learning Carta Thomas, Romac Clément, Wolf Thomas, Lamprier Sylvain, Sigaud Olivier, Oudeyer Pierre-yves
- Dialogue Chain-of-thought Distillation For Commonsense-aware Conversational Agents Chae Hyungjoo, Song Yongho, Ong Kai Tzu-iunn, Kwon Taeyoon, Kim Minjin, Yu Youngjae, Lee Dongha, Kang Dongyeop, Yeo Jinyoung
- INT2.1: Towards Fine-tunable Quantized Large Language Models With Error Correction Through Low-rank Adaptation Chai Yuji, Gkountouras John, Ko Glenn G., Brooks David, Wei Gu-yeon
- Learning To Generate Better Than Your LLM Chang Jonathan D., Brantley Kiante, Ramamurthy Rajkumar, Misra Dipendra, Sun Wen
- Chipgpt: How Far Are We From Natural Language Hardware Design Chang Kaiyan, Wang Ying, Ren Haimeng, Wang Mengdi, Liang Shengwen, Han Yinhe, Li Huawei, Li Xiaowei
- MME: A Comprehensive Evaluation Benchmark For Multimodal Large Language Models Chaoyou Fu, Peixian Chen, Yunhang Shen, Yulei Qin, Mengdan Zhang, Xu Lin, Jinrui Yang, Xiawu Zheng, Ke Li, Xing Sun, Yunsheng Wu, Rongrong Ji
- Fireact: Toward Language Agent Fine-tuning Chen Baian, Shu Chang, Shareghi Ehsan, Collier Nigel, Narasimhan Karthik, Yao Shunyu
- Dual-space Hierarchical Learning For Goal-guided Conversational Recommendation Chen Can, Liu Hao, Liu Zeming, Liu Xue, Dou Dejing
- Chain-of-thought Prompt Distillation For Multimodal Named Entity Recognition And Multimodal Relation Extraction Chen Feng, Feng Yujian
- Personalised Distillation: Empowering Open-sourced Llms With Adaptive Learning For Code Generation Chen Hailin, Saha Amrita, Hoi Steven, Joty Shafiq
- Maybe Only 0.5% Data Is Needed: A Preliminary Exploration Of Low Training Data Instruction Tuning Chen Hao, Zhang Yiming, Zhang Qi, Yang Hantao, Hu Xiaomeng, Ma Xuetao, Yanggong Yifan, Zhao Junbo
- MCC-KD: Multi-cot Consistent Knowledge Distillation Chen Hongzhan, Wu Siyue, Quan Xiaojun, Wang Rui, Yan Ming, Zhang Ji
- Extending Context Window Of Large Language Models Via Positional Interpolation Chen Shouyuan, Wong Sherman, Chen Liangjian, Tian Yuandong
- Lorashear: Efficient Large Language Model Structured Pruning And Knowledge Recovery Chen Tianyi, Ding Tianyu, Yadav Badal, Zharkov Ilya, Liang Luming
- Asking Before Acting: Gather Information In Embodied Decision Making With Language Models Chen Xiaoyu, Zhang Shenao, Zhang Pushi, Zhao Li, Chen Jianyu
- Teaching Large Language Models To Self-debug Chen Xinyun, Lin Maxwell, Schärli Nathanael, Zhou Denny
- Minprompt: Graph-based Minimal Prompt Data Augmentation For Few-shot Question Answering Chen Xiusi, Jiang Jyun-yu, Chang Wei-cheng, Hsieh Cho-jui, Yu Hsiang-fu, Wang Wei
- EE-LLM: Large-scale Training And Inference Of Early-exit Large Language Models With 3D Parallelism Chen Yanxi, Pan Xuchen, Li Yaliang, Ding Bolin, Zhou Jingren
- Confidant: Customizing Transformer-based Llms Via Collaborative Edge Training Chen Yuhao, Yan Yuxuan, Yang Qianqian, Shu Yuanchao, He Shibo, Chen Jiming
- Token Prediction As Implicit Classification To Identify Llm-generated Text Chen Yutian, Kang Hao, Zhai Vivian, Li Liangze, Singh Rita, Raj Bhiksha
- GENOME: Generative Neuro-symbolic Visual Reasoning By Growing And Reusing Modules Chen Zhenfang, Sun Rui, Liu Wenjun, Hong Yining, Gan Chuang
- Cascade Speculative Drafting For Even Faster LLM Inference Chen Ziyi, Yang Xiaocong, Lin Jiacheng, Sun Chenkai, Chang Kevin Chen-chuan, Huang Jie
- Black-box Prompt Optimization: Aligning Large Language Models Without Model Training Cheng Jiale, Liu Xiao, Zheng Kehan, Ke Pei, Wang Hongning, Dong Yuxiao, Tang Jie, Huang Minlie
- Adversarial Preference Optimization: Enhancing Your Alignment Via RM-LLM Game Cheng Pengyu, Yang Yifan, Li Jian, Dai Yong, Hu Tianhao, Cao Peixin, Du Nan, Li Xiaolong
- Language Grounded Qformer For Efficient Vision Language Understanding Choraria Moulik, Sekhar Nitesh, Wu Yue, Zhang Xu, Singhal Prateek, Varshney Lav R.
- Investigating Hallucinations In Pruned Large Language Models For Abstractive Summarization Chrysostomou George, Zhao Zhixue, Williams Miles, Aletras Nikolaos
- Galactic Chitchat: Using Large Language Models To Converse With Astronomy Literature Ciucă Ioana, Ting Yuan-sen
- Strong And Efficient Baselines For Open Domain Conversational Question Answering Coman Andrei C., Barlacchi Gianni, De Gispert Adrià
- Efficiently Adapting Pretrained Language Models To New Languages Csaki Zoltan, Pawakapan Pian, Thakker Urmish, Xu Qiantong
- Personalized Autonomous Driving With Large Language Models: Field Experiments Cui Can, Yang Zichong, Zhou Yupeng, Ma Yunsheng, Lu Juanwu, Li Lingxi, Chen Yaobin, Panchal Jitesh, Wang Ziran
- Machine Mindset: An MBTI Exploration Of Large Language Models Cui Jiaxi, Lv Liuzhenghao, Wen Jing, Wang Rongsheng, Tang Jing, Tian Yonghong, Yuan Li
- Efficient And Effective Text Encoding For Chinese Llama And Alpaca Cui Yiming, Yang Ziqing, Yao Xin
- Large Language Models For Compiler Optimization Cummins Chris, Seeker Volker, Grubisic Dejan, Elhoushi Mostafa, Liang Youwei, Roziere Baptiste, Gehring Jonas, Gloeckle Fabian, Hazelwood Kim, Synnaeve Gabriel, Leather Hugh
- Neural Retrievers Are Biased Towards Llm-generated Content Dai Sunhao, Zhou Yuqi, Pang Liang, Liu Weihao, Hu Xiaolin, Liu Yong, Zhang Xiao, Wang Gang, Xu Jun
- Flashattention-2: Faster Attention With Better Parallelism And Work Partitioning Dao Tri
- Optimizing Distributed Training On Frontier For Large Language Models Dash Sajal, Lyngaas Isaac, Yin Junqi, Wang Xiao, Egele Romain, Cong Guojing, Wang Feiyi, Balaprakash Prasanna
- Divergent Token Metrics: Measuring Degradation To Prune Away LLM Components -- And Optimize Quantization Deiseroth Björn, Meuer Max, Gritsch Nikolas, Eichenberg Constantin, Schramowski Patrick, Aßenmacher Matthias, Kersting Kristian
- Skipdecode: Autoregressive Skip Decoding With Batching And Caching For Efficient LLM Inference Del Corro Luciano, Del Giorno Allie, Agarwal Sahaj, Yu Bin, Awadallah Ahmed, Mukherjee Subhabrata
- Spqr: A Sparse-quantized Representation For Near-lossless LLM Weight Compression Dettmers Tim, Svirschevski Ruslan, Egiazarian Vage, Kuznedelev Denis, Frantar Elias, Ashkboos Saleh, Borzunov Alexander, Hoefler Torsten, Alistarh Dan
- Longnet: Scaling Transformers To 1,000,000,000 Tokens Ding Jiayu, Ma Shuming, Dong Li, Zhang Xingxing, Huang Shaohan, Wang Wenhui, Zheng Nanning, Wei Furu
- A Wolf In Sheep's Clothing: Generalized Nested Jailbreak Prompts Can Fool Large Language Models Easily Ding Peng, Kuang Jun, Ma Dan, Cao Xuezhi, Xian Yunsen, Chen Jiajun, Huang Shujian
- Everything Of Thoughts: Defying The Law Of Penrose Triangle For Thought Generation Ding Ruomeng, Zhang Chaoyun, Wang Lu, Xu Yong, Ma Minghua, Zhang Wei, Qin Si, Rajmohan Saravan, Lin Qingwei, Zhang Dongmei
- Prompt Optimization Via Adversarial In-context Learning Do Xuan Long, Zhao Yiran, Brown Hannah, Xie Yuxi, Zhao James Xu, Chen Nancy F., Kawaguchi Kenji, Shieh Michael, He Junxian
- Blockwise Compression Of Transformer-based Models Without Retraining Dong Gaochen, Chen Wei
- Sweeping Heterogeneity With Smart Mops: Mixture Of Prompts For LLM Task Adaptation Dun Chen, Garcia Mirian Hipolito, Zheng Guoqing, Awadallah Ahmed Hassan, Kyrillidis Anastasios, Sim Robert
- GPT-3.5, GPT-4, Or BARD? Evaluating Llms Reasoning Ability In Zero-shot Setting And Performance Boosting Through Prompts Espejel Jessica López, Ettifouri El Hassane, Alassan Mahaman Sanoussi Yahaya, Chouham El Mehdi, Dahhane Walid
- Chatgpt Outperforms Crowd-workers For Text-annotation Tasks Fabrizio Gilardi, Meysam Alizadeh, Maël Kubli
- Proximal Policy Optimization Actual Combat: Manipulating Output Tokenizer Length Fan Miao, Hu Chen, Zhou Shuchang
- Llama Rider: Spurring Large Language Models To Explore The Open World Feng Yicheng, Wang Yuxuan, Liu Jiazheng, Zheng Sipeng, Lu Zongqing
- MBR And QE Finetuning: Training-time Distillation Of The Best And Most Expensive Decoding Methods Finkelstein Mara, Naskar Subhajit, Mirzazadeh Mehdi, Shah Apurva, Freitag Markus
- Sparsegpt: Massive Language Models Can Be Accurately Pruned In One-shot Frantar Elias, Alistarh Dan
- Revisiting The Knowledge Injection Frameworks Fu Peng, Zhang Yiming, Wang Haobo, Qiu Weikang, Zhao Junbo
- Ziya2: Data-centric Learning Is All Llms Need Gan Ruyi, Wu Ziwei, Sun Renliang, Lu Junyu, Wu Xiaojun, Zhang Dixiang, Pan Kunhao, He Junqing, Tian Yuanhe, Yang Ping, Yang Qi, Wang Hao, Zhang Jiaxing, Song Yan
- Text-to-sql Empowered By Large Language Models: A Benchmark Evaluation Gao Dawei, Wang Haibin, Li Yaliang, Sun Xiuyu, Qian Yichen, Ding Bolin, Zhou Jingren
- Hiclip: Contrastive Language-image Pretraining With Hierarchy-aware Attention Geng Shijie, Yuan Jianbo, Tian Yu, Chen Yuxiao, Zhang Yongfeng
- Adapting Llms For Efficient, Personalized Information Retrieval: Methods And Implications Ghodratnama Samira, Zakershahrak Mehrdad
- Do Generative Large Language Models Need Billions Of Parameters? Gholami Sia, Omar Marwan
- Mtuncertainty: Assessing The Need For Post-editing Of Machine Translation Outputs By Fine-tuning Openai Llms Gladkoff Serge, Han Lifeng, Erofeev Gleb, Sorokina Irina, Nenadic Goran
- Headless Language Models: Learning Without Predicting With Contrastive Weight Tying Godey Nathan, De La Clergerie Éric, Sagot Benoît
- Generative AI Beyond Llms: System Implications Of Multi-modal Generation Golden Alicia, Hsia Samuel, Sun Fei, Acun Bilge, Hosmer Basil, Lee Yejin, Devito Zachary, Johnson Jeff, Wei Gu-yeon, Brooks David, Wu Carole-jean
- Neuron Patching: Semantic-based Neuron-level Language Model Repair For Code Generation Gu Jian, Aleti Aldeida, Chen Chunyang, Zhang Hongyu
- Leveraging Pre-trained Large Language Models To Construct And Utilize World Models For Model-based Task Planning Guan Lin, Valmeekam Karthik, Sreedharan Sarath, Kambhampati Subbarao
- Hiformer: Heterogeneous Feature Interactions Learning With Transformers For Recommender Systems Gui Huan, Wang Ruoxi, Yin Ke, Jin Long, Kula Maciej, Xu Taibai, Hong Lichan, Chi Ed H.
- SPT: Fine-tuning Transformer-based Language Models Efficiently With Sparsification Gui Yuntao, Yan Xiao, Yin Peiqi, Yang Han, Cheng James
- Longcoder: A Long-range Pre-trained Language Model For Code Completion Guo Daya, Xu Canwen, Duan Nan, Yin Jian, Mcauley Julian
- Connecting Large Language Models With Evolutionary Algorithms Yields Powerful Prompt Optimizers Guo Qingyan, Wang Rui, Guo Junliang, Li Bei, Song Kaitao, Tan Xu, Liu Guoqing, Bian Jiang, Yang Yujiu
- Compresso: Structured Pruning With Collaborative Prompting Learns Compact Large Language Models Guo Song, Xu Jiahang, Zhang Li Lyna, Yang Mao
- ELIP: Efficient Language-image Pre-training With Fewer Vision Tokens Guo Yangyang, Zhang Haoyu, Wong Yongkang, Nie Liqiang, Kankanhalli Mohan
- Improving Small Language Models On Pubmedqa Via Generative Data Augmentation Guo Zhen, Wang Peiqi, Wang Yanwei, Yu Shangdi
- Continual Pre-training Of Large Language Models: How To (re)warm Your Model? Gupta Kshitij, Thérien Benjamin, Ibrahim Adam, Richter Mats L., Anthony Quentin, Belilovsky Eugene, Rish Irina, Lesort Timothée
- A Real-world Webagent With Planning, Long Context Understanding, And Program Synthesis Gur Izzeddin, Furuta Hiroki, Huang Austin, Safdari Mustafa, Matsuo Yutaka, Eck Douglas, Faust Aleksandra
- Efficient Parallelization Layouts For Large-scale Distributed Model Training Hagemann Johannes, Weinbach Samuel, Dobler Konstantin, Schall Maximilian, De Melo Gerard
- Dialcot Meets PPO: Decomposing And Exploring Reasoning Paths In Smaller Language Models Han Chengcheng, Du Xiaowei, Zhang Che, Lian Yixin, Li Xiang, Gao Ming, Wang Baoyuan
- Lm-infinite: Zero-shot Extreme Length Generalization For Large Language Models Han Chi, Wang Qifan, Peng Hao, Xiong Wenhan, Chen Yu, Ji Heng, Wang Sinong
- Llms As Visual Explainers: Advancing Image Classification With Evolving Visual Descriptions Han Songhao, Zhuo Le, Liao Yue, Liu Si
- David Helps Goliath: Inference-time Collaboration Between Small Specialized And Large General Diffusion Lms Han Xiaochuang, Kumar Sachin, Tsvetkov Yulia, Ghazvininejad Marjan
- Blockwise Parallel Transformer For Large Context Models Hao Liu, Pieter Abbeel
- SPEED: Speculative Pipelined Execution For Efficient Decoding Hooper Coleman, Kim Sehoon, Mohammadzadeh Hiva, Genc Hasan, Keutzer Kurt, Gholami Amir, Shao Sophia
- Ralle: A Framework For Developing And Evaluating Retrieval-augmented Large Language Models Hoshi Yasuto, Miyashita Daisuke, Ng Youyang, Tatsuno Kento, Morioka Yasuhiro, Torii Osamu, Deguchi Jun
- Distilling Step-by-step! Outperforming Larger Language Models With Less Training Data And Smaller Model Sizes Hsieh Cheng-yu, Li Chun-liang, Yeh Chih-kuan, Nakhost Hootan, Fujii Yasuhisa, Ratner Alexander, Krishna Ranjay, Lee Chen-yu, Pfister Tomas
- Bad Actor, Good Advisor: Exploring The Role Of Large Language Models In Fake News Detection Hu Beizhe, Sheng Qiang, Cao Juan, Shi Yuhui, Li Yang, Wang Danding, Qi Peng
- Ladder-of-thought: Using Knowledge As Steps To Elevate Stance Detection Hu Kairui, Yan Ming, Zhou Joey Tianyi, Tsang Ivor W., Chong Wen Haw, Yap Yong Keong
- COMMA: Co-articulated Multi-modal Learning Hu Lianyu, Gao Liqing, Liu Zekang, Pun Chi-man, Feng Wei
- Visual Program Distillation: Distilling Tools And Programmatic Reasoning Into Vision-language Models Hu Yushi, Stretcu Otilia, Lu Chun-ta, Viswanathan Krishnamurthy, Hata Kenji, Luo Enming, Krishna Ranjay, Fuxman Ariel
- Token-level Adversarial Prompt Detection Based On Perplexity Measures And Contextual Information Hu Zhengmian, Wu Gang, Mitra Saayan, Zhang Ruiyi, Sun Tong, Huang Heng, Swaminathan Viswanathan
- Unlocking The Potential Of User Feedback: Leveraging Large Language Model As User Simulator To Enhance Dialogue System Hu Zhiyuan, Feng Yue, Luu Anh Tuan, Hooi Bryan, Lipani Aldo
- Lorahub: Efficient Cross-task Generalization Via Dynamic Lora Composition Huang Chengsong, Liu Qian, Lin Bill Yuchen, Pang Tianyu, Du Chao, Lin Min
- Agentcoder: Multi-agent-based Code Generation With Iterative Testing And Optimisation Huang Dong, Zhang Jie M., Luck Michael, Bu Qingwen, Qing Yuhao, Cui Heming
- Adapting Pre-trained Language Models To Vision-language Tasks Via Dynamic Visual Prompting Huang Shubin, Wu Qiong, Zhou Yiyi, Chen Weijie, Zhang Rongsheng, Sun Xiaoshuai, Ji Rongrong
- Fewer Is More: Boosting LLM Reasoning With Reinforced Context Pruning Huang Xijie, Zhang Li Lyna, Cheng Kwang-ting, Yang Fan, Yang Mao
- Advancing Transformer Architecture In Long-context Large Language Models: A Comprehensive Survey Huang Yunpeng, Xu Jingwei, Lai Junyu, Jiang Zixu, Chen Taolue, Li Zenan, Yao Yuan, Ma Xiaoxing, Yang Lijuan, Chen Hao, Li Shupeng, Zhao Penghao
- Evaluating And Enhancing Large Language Models For Conversational Reasoning On Knowledge Graphs Huang Yuxuan, Shi Lida, Liu Anqi, Xu Hao
- Soft Prompt Decoding For Multilingual Dense Retrieval Huang Zhiqi, Zeng Hansi, Zamani Hamed, Allan James
- TADA: Efficient Task-agnostic Domain Adaptation For Transformers Hung Chia-chien, Lange Lukas, Strötgen Jannik
- Camels In A Changing Climate: Enhancing LM Adaptation With Tulu 2 Ivison Hamish, Wang Yizhong, Pyatkin Valentina, Lambert Nathan, Peters Matthew, Dasigi Pradeep, Jang Joel, Wadden David, Smith Noah A., Beltagy Iz, Hajishirzi Hannaneh
- Quip: 2-bit Quantization Of Large Language Models With Guarantees Jerry Chee, Yaohui Cai, Volodymyr Kuleshov, Christopher De Sa
- Just CHOP: Embarrassingly Simple LLM Compression Jha Ananya Harsh, Sherborne Tom, Walsh Evan Pete, Groeneveld Dirk, Strubell Emma, Beltagy Iz
- Expedited Training Of Visual Conditioned Language Generation Via Redundancy Reduction Jian Yiren, Liu Tingkai, Tao Yunzhe, Zhang Chunhui, Vosoughi Soroush, Yang Hongxia
- Mistral 7B Jiang Albert Q., Sablayrolles Alexandre, Mensch Arthur, Bamford Chris, Chaplot Devendra Singh, Casas Diego De Las, Bressand Florian, Lengyel Gianna, Lample Guillaume, Saulnier Lucile, Lavaud Lélio Renard, Lachaux Marie-anne, Stock Pierre, Scao Teven Le, Lavril Thibaut, Wang Thomas, Lacroix Timothée, Sayed William El
- Bus:efficient And Effective Vision-language Pre-training With Bottom-up Patch Summarization Jiang Chaoya, Xu Haiyang, Ye Wei, Ye Qinghao, Li Chenliang, Yan Ming, Bi Bin, Zhang Shikun, Huang Fei, Huang Songfang
- Hykge: A Hypothesis Knowledge Graph Enhanced Framework For Accurate And Reliable Medical Llms Responses Jiang Xinke, Zhang Ruizhe, Xu Yongxin, Qiu Rihong, Fang Yue, Wang Zhiyuan, Tang Jinyi, Ding Hongxin, Chu Xu, Zhao Junfeng, Wang Yasha
- Lion: Adversarial Distillation Of Proprietary Large Language Models Jiang Yuxin, Chan Chunkit, Chen Mingyang, Wang Wei
- How Can Recommender Systems Benefit From Large Language Models: A Survey Jianghao Lin, Xinyi Dai, Yunjia Xi, Weiwen Liu, Bo Chen, Hao Zhang, Yong Liu, Chuhan Wu, Xiangyang Li, Chenxu Zhu, Huifeng Guo, Yong Yu, Ruiming Tang, Weinan Zhang
- Growlength: Accelerating Llms Pretraining By Progressively Growing Training Length Jin Hongye, Han Xiaotian, Yang Jingfeng, Jiang Zhimeng, Chang Chia-yuan, Hu Xia
- The Cost Of Down-scaling Language Models: Fact Recall Deteriorates Before In-context Learning Jin Tian, Clement Nolan, Dong Xin, Nagarajan Vaishnavh, Carbin Michael, Ragan-kelley Jonathan, Dziugaite Gintare Karolina
- Binary Code Summarization: Benchmarking Chatgpt/gpt-4 And Other Large Language Models Jin Xin, Larson Jonathan, Yang Weiwei, Lin Zhiqiang
- Teaching Language Models To Hallucinate Less With Synthetic Tasks Jones Erik, Palangi Hamid, Simões Clarisse, Chandrasekaran Varun, Mukherjee Subhabrata, Mitra Arindam, Awadallah Ahmed, Kamar Ece
- Small Language Models Fine-tuned To Coordinate Larger Language Models Improve Complex Reasoning Juneja Gurusha, Dutta Subhabrata, Chakrabarti Soumen, Manchanda Sunny, Chakraborty Tanmoy
- Impossible Distillation: From Low-quality Model To High-quality Dataset & Model For Summarization And Paraphrasing Jung Jaehun, West Peter, Jiang Liwei, Brahman Faeze, Lu Ximing, Fisher Jillian, Sorensen Taylor, Choi Yejin
- A Case Study On Test Case Construction With Large Language Models: Unveiling Practical Insights And Challenges Junior Roberto Francisco De Lima, Presta Luiz Fernando Paes De Barros, Borborema Lucca Santos, Da Silva Vanderson Nogueira, Dahia Marcio Leal De Melo, Santos Anderson Carlos Sousa E
- Knowledge-augmented Reasoning Distillation For Small Language Models In Knowledge-intensive Tasks Kang Minki, Lee Seanie, Baek Jinheon, Kawaguchi Kenji, Hwang Sung Ju
- Sorted Llama: Unlocking The Potential Of Intermediate Layers Of Large Language Models For Dynamic Inference Kavehzadeh Parsa, Valipour Mojtaba, Tahaei Marzieh, Ghodsi Ali, Chen Boxing, Rezagholizadeh Mehdi
- Unnatural Language Processing: How Do Language Models Handle Machine-generated Prompts? Kervadec Corentin, Franzon Francesca, Baroni Marco
- Expediting Contrastive Language-image Pretraining Via Self-distilled Encoders Kim Bumsoo, Kim Jinhyung, Jo Yeonsik, Kim Seung Hwan
- Language Models Can Solve Computer Tasks Kim Geunwoo, Baldi Pierre, Mcaleer Stephen
- Memory-efficient Fine-tuning Of Compressed Large Language Models Via Sub-4-bit Integer Quantization Kim Jeonghoon, Lee Jung Hyun, Kim Sungdong, Park Joonsuk, Yoo Kang Min, Kwon Se Jung, Lee Dongsoo
- Squeezellm: Dense-and-sparse Quantization Kim Sehoon, Hooper Coleman, Gholami Amir, Dong Zhen, Li Xiuyu, Shen Sheng, Mahoney Michael W., Keutzer Kurt
- Speculative Decoding With Big Little Decoder Kim Sehoon, Mangalam Karttikeya, Moon Suhong, Malik Jitendra, Mahoney Michael W., Gholami Amir, Keutzer Kurt
- Aligning Large Language Models Through Synthetic Feedback Kim Sungdong, Bae Sanghwan, Shin Jamin, Kang Soyoung, Kwak Donghyun, Yoo Kang Min, Seo Minjoon
- Finequant: Unlocking Efficiency With Fine-grained Weight-only Quantization For Llms Kim Young Jin, Henry Rawn, Fahim Raffy, Awadalla Hany Hassan
- Revisiting Intermediate Layer Distillation For Compressing Language Models: An Overfitting Perspective Ko Jongwoo, Park Seungjoon, Jeong Minchan, Hong Sukjin, Ahn Euijai, Chang Du-seong, Yun Se-young
- L-TUNING: Synchronized Label Tuning For Prompt And Prefix In Llms Kowsher Md., Sobuj Md. Shohanur Islam, Mahmud Asif, Prottasha Nusrat Jahan, Bhat Prakash
- Prompt Generate Train (PGT): Few-shot Domain Adaption Of Retrieval Augmented Generation Models For Open Book Question-answering Krishna C. S.
- Sparse Fine-tuning For Inference Acceleration Of Large Language Models Kurtic Eldar, Kuznedelev Denis, Frantar Elias, Goin Michael, Alistarh Dan
- Enhancing Transformers Without Self-supervised Learning: A Loss Landscape Perspective In Sequential Recommendation Lai Vivian, Chen Huiyuan, Yeh Chin-chia Michael, Xu Minghua, Cai Yiwei, Yang Hao
- Veclip: Improving CLIP Training Via Visual-enriched Captions Lai Zhengfeng, Zhang Haotian, Zhang Bowen, Wu Wentao, Bai Haoping, Timofeev Aleksei, Du Xianzhi, Gan Zhe, Shan Jiulong, Chuah Chen-nee, Yang Yinfei, Cao Meng
- Cabrita: Closing The Gap For Foreign Languages Larcher Celio, Piau Marcos, Finardi Paulo, Gengo Pedro, Esposito Piero, Caridá Vinicius
- An Experimental Study On Pretraining Transformers From Scratch For IR Lassance Carlos, Déjean Hervé, Clinchant Stéphane
- Orchestrallm: Efficient Orchestration Of Language Models For Dialogue State Tracking Lee Chia-hsuan, Cheng Hao, Ostendorf Mari
- Read-only Prompt Optimization For Vision-language Few-shot Learning Lee Dongjun, Song Seokwon, Suh Jihee, Choi Joonmyung, Lee Sanghyeok, Kim Hyunwoo J.
- Is Attention Required For ICL? Exploring The Relationship Between Model Architecture And In-context Learning Ability Lee Ivan, Jiang Nan, Berg-kirkpatrick Taylor
- Do We Still Need Clinical Language Models? Lehman Eric, Hernandez Evan, Mahajan Diwakar, Wulff Jonas, Smith Micah J., Ziegler Zachary, Nadler Daniel, Szolovits Peter, Johnson Alistair, Alsentzer Emily
- Mixed Distillation Helps Smaller Language Model Better Reasoning Li Chenglin, Chen Qianglong, Li Liangyue, Wang Caiyu, Li Yicheng, Chen Zulong, Zhang Yin
- Dialogue For Prompting: A Policy-gradient-based Discrete Prompt Generation For Few-shot Learning Li Chengzhengxu, Liu Xiaoming, Wang Yichen, Li Duyi, Lan Yu, Shen Chao
- Adaptive Gating In Mixture-of-experts Based Language Models Li Jiamin, Su Qiang, Yang Yitao, Jiang Yimin, Wang Cong, Xu Hong
- Instructcoder: Instruction Tuning Large Language Models For Code Editing Li Kaixin, Hu Qisheng, Zhao Xu, Chen Hui, Xie Yuxi, Liu Tiedong, Xie Qizhe, He Junxian
- Watermarking Llms With Weight Quantization Li Linyang, Jiang Botian, Wang Pengyu, Ren Ke, Yan Hang, Qiu Xipeng
- TRAQ: Trustworthy Retrieval Augmented Question Answering Via Conformal Prediction Li Shuo, Park Sangdon, Lee Insup, Bastani Osbert
- Unigen: A Unified Generative Framework For Retrieval And Question Answering With Large Language Models Li Xiaoxi, Zhou Yujia, Dou Zhicheng
- Unlock The Power: Competitive Distillation For Multi-modal Large Language Models Li Xinwei, Lin Li, Wang Shuai, Qian Chen
- A Practical Survey On Zero-shot Prompt Design For In-context Learning Li Yinheng
- Turning Dust Into Gold: Distilling Complex Reasoning Capabilities From Llms By Leveraging Negative Data Li Yiwei, Yuan Peiwen, Feng Shaoxiong, Pan Boyuan, Sun Bin, Wang Xinglin, Wang Heda, Li Kan
- Loftq: Lora-fine-tuning-aware Quantization For Large Language Models Li Yixiao, Yu Yifan, Liang Chen, He Pengcheng, Karampatziakis Nikos, Chen Weizhu, Zhao Tuo
- Losparse: Structured Compression Of Large Language Models Based On Low-rank And Sparse Approximation Li Yixiao, Yu Yifan, Zhang Qingru, Liang Chen, He Pengcheng, Chen Weizhu, Zhao Tuo
- Compressing Context To Enhance Inference Efficiency Of Large Language Models Li Yucheng, Dong Bo, Lin Chenghua, Guerin Frank
- E-sparse: Boosting The Large Language Model Inference Through Entropy-based N:M Sparsity Li Yun, Niu Lin, Zhang Xipeng, Liu Kai, Zhu Jianchen, Kang Zhanhui
- QFT: Quantized Full-parameter Tuning Of Llms With Affordable Resources Li Zhikai, Liu Xiaoxuan, Zhu Banghua, Dong Zhen, Gu Qingyi, Keutzer Kurt
- Split And Merge: Aligning Position Biases In Large Language Model Based Evaluators Li Zongjie, Wang Chaozheng, Ma Pingchuan, Wu Daoyuan, Wang Shuai, Gao Cuiyun, Liu Yang
- Llm-grounded Diffusion: Enhancing Prompt Understanding Of Text-to-image Diffusion Models With Large Language Models Lian Long, Li Boyi, Yala Adam, Darrell Trevor
- Homodistil: Homotopic Task-agnostic Distillation Of Pre-trained Transformers Liang Chen, Jiang Haoming, Li Zheng, Tang Xianfeng, Yin Bin, Zhao Tuo
- GPT-4 Enhanced Multimodal Grounding For Autonomous Driving: Leveraging Cross-modal Attention With Large Language Models Liao Haicheng, Shen Huanming, Li Zhenning, Wang Chengyue, Li Guofa, Bie Yiming, Xu Chengzhong
- AWQ: Activation-aware Weight Quantization For LLM Compression And Acceleration Lin Ji, Tang Jiaming, Tang Haotian, Yang Shang, Chen Wei-ming, Wang Wei-chen, Xiao Guangxuan, Dang Xingyu, Gan Chuang, Han Song
- Batchprompt: Accomplish More With Less Lin Jianzhe, Diesendruck Maurice, Du Liang, Abraham Robin
- Just Ask One More Time! Self-agreement Improves Reasoning Of Language Models In (almost) All Scenarios Lin Lei, Fu Jiayi, Liu Pengli, Li Qingyang, Gong Yan, Wan Junchen, Zhang Fuzheng, Wang Zhongyuan, Zhang Di, Gai Kun
- Pushing Large Language Models To The 6G Edge: Vision, Challenges, And Opportunities Lin Zheng, Qu Guanqiao, Chen Qiyuan, Chen Xianhao, Chen Zhe, Huang Kaibin
- Alltogether: Investigating The Efficacy Of Spliced Prompt For Web Navigation Using Large Language Models Liu Jiarun, Hu Wentao, Zhang Chunhong
- Retrieval-based Knowledge Transfer: An Effective Approach For Extreme Large Language Model Compression Liu Jiduan, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Wang Ran Lucien, Yan Rui
- Mind's Mirror: Distilling Self-evaluation Capability And Comprehensive Thinking From Large Language Models Liu Weize, Li Guocong, Zhang Kai, Du Bang, Chen Qiyuan, Hu Xuming, Xu Hongxia, Chen Jintai, Wu Jian
- Scaling Laws Of Rope-based Extrapolation Liu Xiaoran, Yan Hang, Zhang Shuo, An Chenxin, Qiu Xipeng, Lin Dahua
- Online Speculative Decoding Liu Xiaoxuan, Hu Lanxiang, Bailis Peter, Cheung Alvin, Deng Zhijie, Stoica Ion, Zhang Hao
- BOLT: Fast Energy-based Controlled Text Generation With Tunable Biases Liu Xin, Khalifa Muhammad, Wang Lu
- RECALL: A Benchmark For Llms Robustness Against External Counterfactual Knowledge Liu Yi, Huang Lianzhe, Li Shicheng, Chen Sishuo, Zhou Hao, Meng Fandong, Zhou Jie, Sun Xu
- Binary And Ternary Natural Language Generation Liu Zechun, Oguz Barlas, Pappu Aasish, Shi Yangyang, Krishnamoorthi Raghuraman
- Picking The Underused Heads: A Network Pruning Perspective Of Attention Head Selection For Fusing Dialogue Coreference Information Liu Zhengyuan, Chen Nancy F.
- Reason For Future, Act For Now: A Principled Framework For Autonomous LLM Agents With Provable Sample Efficiency Liu Zhihan, Hu Hao, Zhang Shenao, Guo Hongyi, Ke Shuqi, Liu Boyi, Wang Zhaoran
- Ziya-visual: Bilingual Large Vision-language Model Via Multi-task Instruction Tuning Lu Junyu, Zhang Dixiang, Wu Xiaojun, Gao Xinyu, Gan Ruyi, Zhang Jiaxing, Song Yan, Zhang Pingjian
- Routing To The Expert: Efficient Reward-guided Ensemble Of Large Language Models Lu Keming, Yuan Hongyi, Lin Runji, Lin Junyang, Yuan Zheng, Zhou Chang, Zhou Jingren
- Large Language Models Can Be Guided To Evade Ai-generated Text Detection Lu Ning, Liu Shengcai, He Rui, Wang Qi, Ong Yew-soon, Tang Ke
- Cheap And Quick: Efficient Vision-language Instruction Tuning For Large Language Models Luo Gen, Zhou Yiyi, Ren Tianhe, Chen Shengxin, Sun Xiaoshuai, Ji Rongrong
- Wizardmath: Empowering Mathematical Reasoning For Large Language Models Via Reinforced Evol-instruct Luo Haipeng, Sun Qingfeng, Xu Can, Zhao Pu, Lou Jianguang, Tao Chongyang, Geng Xiubo, Lin Qingwei, Chen Shifeng, Zhang Dongmei
- Prosg: Using Prompt Synthetic Gradients To Alleviate Prompt Forgetting Of Rnn-like Language Models Luo Haotian, Wu Kunming, Dai Cheng, Ding Sixian, Chen Xinhao
- Exploring Small Language Models With Prompt-learning Paradigm For Efficient Domain-specific Text Classification Luo Hengyu, Liu Peng, Esping Stefan
- Towards Logiglue: A Brief Survey And A Benchmark For Analyzing Logical Reasoning Capabilities Of Language Models Luo Man, Kumbhar Shrinidhi, Shen Ming, Parmar Mihir, Varshney Neeraj, Banerjee Pratyay, Aditya Somak, Baral Chitta
- Unlocking The Potential Of Large Language Models For Explainable Recommendations Luo Yucong, Cheng Mingyue, Zhang Hao, Lu Junyu, Liu Qi, Chen Enhong
- Are We Falling In A Middle-intelligence Trap? An Analysis And Mitigation Of The Reversal Curse Lv Ang, Zhang Kaiyi, Xie Shufang, Tu Quan, Chen Yuhan, Wen Ji-rong, Yan Rui
- Full Parameter Fine-tuning For Large Language Models With Limited Resources Lv Kai, Yang Yuqing, Liu Tengxiao, Gao Qinghui, Guo Qipeng, Qiu Xipeng
- Collie: Collaborative Training Of Large Language Models In An Efficient Way Lv Kai, Zhang Shuo, Gu Tianle, Xing Shuhao, Hong Jiawei, Chen Keyu, Liu Xiaoran, Yang Yuqing, Guo Honglin, Liu Tengxiao, Sun Yu, Guo Qipeng, Yan Hang, Qiu Xipeng
- POS: A Prompts Optimization Suite For Augmenting Text-to-video Generation Ma Shijie, Xu Huayi, Li Mengjian, Geng Weidong, Wang Yaxiong, Wang Meng
- Sci-cot: Leveraging Large Language Models For Enhanced Knowledge Distillation In Small Models For Scientific QA Ma Yuhan, Jiang Haiqi, Fan Chenyou
- Large Language Models For Telecom: Forthcoming Impact On The Industry Maatouk Ali, Piovesan Nicola, Ayed Fadhel, De Domenico Antonio, Debbah Merouane
- LEALLA: Learning Lightweight Language-agnostic Sentence Embeddings With Knowledge Distillation Mao Zhuoyuan, Nakagawa Tetsuji
- When Less Is More: Investigating Data Pruning For Pretraining Llms At Scale Marion Max, Üstün Ahmet, Pozzobon Luiza, Wang Alex, Fadaee Marzieh, Hooker Sara
- The Hydra Effect: Emergent Self-repair In Language Model Computations Mcgrath Thomas, Rahtz Matthew, Kramar Janos, Mikulik Vladimir, Legg Shane
- Lightlm: A Lightweight Deep And Narrow Language Model For Generative Recommendation Mei Kai, Zhang Yongfeng
- Towards Efficient Generative Large Language Model Serving: A Survey From Algorithms To Systems Miao Xupeng, Oliaro Gabriele, Zhang Zhihao, Cheng Xinhao, Jin Hongyi, Chen Tianqi, Jia Zhihao
- Think Twice: Measuring The Efficiency Of Eliminating Prediction Shortcuts Of Question Answering Models Mikula Lukáš, Štefánik Michal, Petrovič Marek, Sojka Petr
- LAIT: Efficient Multi-segment Encoding In Transformers With Layer-adjustable Interaction Milbauer Jeremiah, Louis Annie, Hosseini Mohammad Javad, Fabrikant Alex, Metzler Donald, Schuster Tal
- PRE: Vision-language Prompt Learning With Reparameterization Encoder Minh Anh Pham Thi, Nguyen An Duc, Tzimiropoulos Georgios
- Relu Strikes Back: Exploiting Activation Sparsity In Large Language Models Mirzadeh Iman, Alizadeh Keivan, Mehta Sachin, Del Mundo Carlo C, Tuzel Oncel, Samei Golnoosh, Rastegari Mohammad, Farajtabar Mehrdad
- Learning To Compress Prompts With Gist Tokens Mu Jesse, Li Xiang Lisa, Goodman Noah
- Can Llms Follow Simple Rules? Mu Norman, Chen Sarah, Wang Zifan, Chen Sizhe, Karamardian David, Aljeraisy Lulwa, Alomair Basel, Hendrycks Dan, Wagner David
- Superhf: Supervised Iterative Learning From Human Feedback Mukobi Gabriel, Chatain Peter, Fong Su, Windesheim Robert, Kutyniok Gitta, Bhatia Kush, Alberti Silas
- Testing Llms On Code Generation With Varying Levels Of Prompt Specificity Murr Lincoln, Grainger Morgan, Gao David
- PEARL: Personalizing Large Language Model Writing Assistants With Generation-calibrated Retrievers Mysore Sheshera, Lu Zhuoran, Wan Mengting, Yang Longqi, Menezes Steve, Baghaee Tina, Gonzalez Emmanuel Barajas, Neville Jennifer, Safavi Tara
- Cheaply Evaluating Inference Efficiency Metrics For Autoregressive Transformer Apis Narayanan Deepak, Santhanam Keshav, Henderson Peter, Bommasani Rishi, Lee Tony, Liang Percy
- A Comprehensive Overview Of Large Language Models Naveed Humza, Khan Asad Ullah, Qiu Shi, Saqib Muhammad, Anwar Saeed, Usman Muhammad, Akhtar Naveed, Barnes Nick, Mian Ajmal
- Nanot5: A Pytorch Framework For Pre-training And Fine-tuning T5-style Models With Limited Resources Nawrot Piotr
- Large Language Models As Tax Attorneys: A Case Study In Legal Capabilities Emergence Nay John J., Karamardian David, Lawsky Sarah B., Tao Wenting, Bhat Meghana, Jain Raghav, Lee Aaron Travis, Choi Jonathan H., Kasai Jungo
- Large Language Models In Healthcare And Medical Domain: A Review Nazi Zabir Al, Peng Wei
- Meet In The Middle: A New Pre-training Paradigm Nguyen Anh, Karampatziakis Nikos, Chen Weizhu
- LEVER: Learning To Verify Language-to-code Generation With Execution Ni Ansong, Iyer Srini, Radev Dragomir, Stoyanov Ves, Yih Wen-tau, Wang Sida I., Lin Xi Victoria
- An Experimental Study: Assessing The Combined Framework Of Wavlm And BEST-RQ For Text-to-speech Synthesis Nielson Via, Hillis Steven
- Codegen2: Lessons For Training Llms On Programming And Natural Languages Nijkamp Erik, Hayashi Hiroaki, Xiong Caiming, Savarese Silvio, Zhou Yingbo
- Future-proofing Education: A Prototype For Simulating Oral Examinations Using Large Language Models Nitze André
- Askit: Unified Programming Interface For Programming With Large Language Models Okuda Katsumi, Amarasinghe Saman
- GPT-4 Technical Report Openai, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko, Madelaine Boyd, Anna-luisa Brakman, Greg Brockman, Tim Brooks, Miles Brundage, Kevin Button, Trevor Cai, Rosie Campbell, Andrew Cann, Brittany Carey, Chelsea Carlson, Rory Carmichael, Brooke Chan, Che Chang, Fotis Chantzis, Derek Chen, Sully Chen, Ruby Chen, Jason Chen, Mark Chen, Ben Chess, Chester Cho, Casey Chu, Hyung Won Chung, Dave Cummings, Jeremiah Currier, Yunxing Dai, Cory Decareaux, Thomas Degry, Noah Deutsch, Damien Deville, Arka Dhar, David Dohan, Steve Dowling, Sheila Dunning, Adrien Ecoffet, Atty Eleti, Tyna Eloundou, David Farhi, Liam Fedus, Niko Felix, Simón Posada Fishman, Juston Forte, Isabella Fulford, Leo Gao, Elie Georges, Christian Gibson, Vik Goel, Tarun Gogineni, Gabriel Goh, Rapha Gontijo-lopes, Jonathan Gordon, Morgan Grafstein, Scott Gray, Ryan Greene, Joshua Gross, Shixiang Shane Gu, Yufei Guo, Chris Hallacy, Jesse Han, Jeff Harris, Yuchen He, Mike Heaton, Johannes Heidecke, Chris Hesse, Alan Hickey, Wade Hickey, Peter Hoeschele, Brandon Houghton, Kenny Hsu, Shengli Hu, Xin Hu, Joost Huizinga, Shantanu Jain, Shawn Jain, Joanne Jang, Angela Jiang, Roger Jiang, Haozhun Jin, Denny Jin, Shino Jomoto, Billie Jonn, Heewoo Jun, Tomer Kaftan, Łukasz Kaiser, Ali Kamali, Ingmar Kanitscheider, Nitish Shirish Keskar, Tabarak Khan, Logan Kilpatrick, Jong Wook Kim, Christina Kim, Yongjik Kim, Jan Hendrik Kirchner, Jamie Kiros, Matt Knight, Daniel Kokotajlo, Łukasz Kondraciuk, Andrew Kondrich, Aris Konstantinidis, Kyle Kosic, Gretchen Krueger, Vishal Kuo, Michael Lampe, Ikai Lan, Teddy Lee, Jan Leike, Jade Leung, Daniel Levy, Chak Ming Li, Rachel Lim, Molly Lin, Stephanie Lin, Mateusz Litwin, Theresa Lopez, Ryan Lowe, Patricia Lue, Anna Makanju, Kim Malfacini, Sam Manning, Todor Markov, Yaniv Markovski, Bianca Martin, Katie Mayer, Andrew Mayne, Bob Mcgrew, Scott Mayer Mckinney, Christine Mcleavey, Paul Mcmillan, Jake Mcneil, David Medina, Aalok Mehta, Jacob Menick, Luke Metz, Andrey Mishchenko, Pamela Mishkin, Vinnie Monaco, Evan Morikawa, Daniel Mossing, Tong Mu, Mira Murati, Oleg Murk, David Mély, Ashvin Nair, Reiichiro Nakano, Rajeev Nayak, Arvind Neelakantan, Richard Ngo, Hyeonwoo Noh, Long Ouyang, Cullen O'keefe, Jakub Pachocki, Alex Paino, Joe Palermo, Ashley Pantuliano, Giambattista Parascandolo, Joel Parish, Emy Parparita, Alex Passos, Mikhail Pavlov, Andrew Peng, Adam Perelman, Filipe De Avila Belbute Peres, Michael Petrov, Henrique Ponde De Oliveira Pinto, Michael, Pokorny, Michelle Pokrass, Vitchyr H. Pong, Tolly Powell, Alethea Power, Boris Power, Elizabeth Proehl, Raul Puri, Alec Radford, Jack Rae, Aditya Ramesh, Cameron Raymond, Francis Real, Kendra Rimbach, Carl Ross, Bob Rotsted, Henri Roussez, Nick Ryder, Mario Saltarelli, Ted Sanders, Shibani Santurkar, Girish Sastry, Heather Schmidt, David Schnurr, John Schulman, Daniel Selsam, Kyla Sheppard, Toki Sherbakov, Jessica Shieh, Sarah Shoker, Pranav Shyam, Szymon Sidor, Eric Sigler, Maddie Simens, Jordan Sitkin, Katarina Slama, Ian Sohl, Benjamin Sokolowsky, Yang Song, Natalie Staudacher, Felipe Petroski Such, Natalie Summers, Ilya Sutskever, Jie Tang, Nikolas Tezak, Madeleine B. Thompson, Phil Tillet, Amin Tootoonchian, Elizabeth Tseng, Preston Tuggle, Nick Turley, Jerry Tworek, Juan Felipe Cerón Uribe, Andrea Vallone, Arun Vijayvergiya, Chelsea Voss, Carroll Wainwright, Justin Jay Wang, Alvin Wang, Ben Wang, Jonathan Ward, Jason Wei, Cj Weinmann, Akila Welihinda, Peter Welinder, Jiayi Weng, Lilian Weng, Matt Wiethoff, Dave Willner, Clemens Winter, Samuel Wolrich, Hannah Wong, Lauren Workman, Sherwin Wu, Jeff Wu, Michael Wu, Kai Xiao, Tao Xu, Sarah Yoo, Kevin Yu, Qiming Yuan, Wojciech Zaremba, Rowan Zellers, Chong Zhang, Marvin Zhang, Shengjia Zhao, Tianhao Zheng, Juntang Zhuang, William Zhuk, Barret Zoph
- Finding And Editing Multi-modal Neurons In Pre-trained Transformers Pan Haowen, Cao Yixin, Wang Xiaozhi, Yang Xun, Wang Meng
- Stelocoder: A Decoder-only LLM For Multi-language To Python Code Translation Pan Jialing, Sadé Adrien, Kim Jin, Soriano Eric, Sole Guillem, Flamant Sylvain
- Smoothquant+: Accurate And Efficient 4-bit Post-training Weightquantization For LLM Pan Jiayi, Wang Chengcan, Zheng Kaifu, Li Yangguang, Wang Zhenyu, Feng Bin
- Tokenize Anything Via Prompting Pan Ting, Tang Lulu, Wang Xinlong, Shan Shiguang
- Kosmos-g: Generating Images In Context With Multimodal Large Language Models Pan Xichen, Dong Li, Huang Shaohan, Peng Zhiliang, Chen Wenhu, Wei Furu
- Semantic Composition In Visually Grounded Language Models Pandey Rohan
- Peft-medaware: Large Language Model For Medical Awareness Pandya Keivalya
- Trainable Transformer In Transformer Panigrahi Abhishek, Malladi Sadhika, Xia Mengzhou, Arora Sanjeev
- Localized Symbolic Knowledge Distillation For Visual Commonsense Models Park Jae Sung, Hessel Jack, Chandu Khyathi Raghavi, Liang Paul Pu, Lu Ximing, West Peter, Yu Youngjae, Huang Qiuyuan, Gao Jianfeng, Farhadi Ali, Choi Yejin
- Towards Efficient And Effective Adaptation Of Large Language Models For Sequential Recommendation Peng Bo, Burns Ben, Chen Ziqi, Parthasarathy Srinivasan, Ning Xia
- Pre-train, Prompt And Recommendation: A Comprehensive Survey Of Language Modelling Paradigm Adaptations In Recommender Systems Peng Liu, Lemei Zhang, Jon Atle Gulla
- Self-driven Grounding: Large Language Model Agents With Automatical Language-aligned Skill Learning Peng Shaohui, Hu Xing, Yi Qi, Zhang Rui, Guo Jiaming, Huang Di, Tian Zikang, Chen Ruizhi, Du Zidong, Guo Qi, Chen Yunji, Li Ling
- Reproducing Whisper-style Training Using An Open-source Toolkit And Publicly Available Data Peng Yifan, Tian Jinchuan, Yan Brian, Berrebbi Dan, Chang Xuankai, Li Xinjian, Shi Jiatong, Arora Siddhant, Chen William, Sharma Roshan, Zhang Wangyou, Sudo Yui, Shakeel Muhammad, Jung Jee-weon, Maiti Soumi, Watanabe Shinji
- Automatic Prompt Optimization With "gradient Descent" And Beam Search Pryzant Reid, Iter Dan, Li Jerry, Lee Yin Tat, Zhu Chenguang, Zeng Michael
- Vidcom: Fast Video Comprehension Through Large Language Models With Multimodal Tools Qi Ji, Ji Kaixuan, Yu Jifan, Wang Duokang, Xu Bin, Hou Lei, Li Juanzi
- Improving In-context Learning Via Bidirectional Alignment Qin Chengwei, Xia Wenhan, Jiao Fangkai, Chen Chen, Hu Yuchen, Ding Bosheng, Joty Shafiq
- Federated Full-parameter Tuning Of Billion-sized Language Models With Communication Cost Under 18 Kilobytes Qin Zhen, Chen Daoyuan, Qian Bingchen, Ding Bolin, Li Yaliang, Deng Shuiguang
- Large Language Models Are Effective Text Rankers With Pairwise Ranking Prompting Qin Zhen, Jagerman Rolf, Hui Kai, Zhuang Honglei, Wu Junru, Yan Le, Shen Jiaming, Liu Tianqi, Liu Jialu, Metzler Donald, Wang Xuanhui, Bendersky Michael
- Transnormerllm: A Faster And Better Large Language Model With Improved Transnormer Qin Zhen, Li Dong, Sun Weigao, Sun Weixuan, Shen Xuyang, Han Xiaodong, Wei Yunshen, Lv Baohong, Luo Xiao, Qiao Yu, Zhong Yiran
- Codegeex: A Pre-trained Model For Code Generation With Multilingual Benchmarking On Humaneval-x Qinkai Zheng, Xiao Xia, Xu Zou, Yuxiao Dong, Shan Wang, Yufei Xue, Zihan Wang, Lei Shen, Andi Wang, Yang Li, Teng Su, Zhilin Yang, Jie Tang
- Direct Preference Optimization: Your Language Model Is Secretly A Reward Model Rafael Rafailov, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D. Manning, Chelsea Finn
- Quantized Transformer Language Model Implementations On Edge Devices Rahman Mohammad Wali Ur, Abrar Murad Mehrab, Copening Hunter Gibbons, Hariri Salim, Shao Sicong, Satam Pratik, Salehi Soheil
- Tailoring Self-rationalizers With Multi-reward Distillation Ramnath Sahana, Joshi Brihi, Hallinan Skyler, Lu Ximing, Li Liunian Harold, Chan Aaron, Hessel Jack, Choi Yejin, Ren Xiang
- Retrieval-based Knowledge Augmented Vision Language Pre-training Rao Jiahua, Shan Zifei, Liu Longpo, Zhou Yao, Yang Yuedong
- Gpt4tools: Teaching Large Language Model To Use Tools Via Self-instruction Rui Yang, Lin Song, Yanwei Li, Sijie Zhao, Yixiao Ge, Xiu Li, Ying Shan
- ICLEF: In-context Learning With Expert Feedback For Explainable Style Transfer Saakyan Arkadiy, Muresan Smaranda
- Promptmix: A Class Boundary Augmentation Method For Large Language Model Distillation Sahu Gaurav, Vechtomova Olga, Bahdanau Dzmitry, Laradji Issam H.
- Extending The Frontier Of Chatgpt: Code Generation And Debugging Sakib Fardin Ahsan, Khan Saadat Hasan, Karim A. H. M. Rezaul
- A Symmetric Dual Encoding Dense Retrieval Framework For Knowledge-intensive Visual Question Answering Salemi Alireza, Pizzorno Juan Altmayer, Zamani Hamed
- Efficient RLHF: Reducing The Memory Usage Of PPO Santacroce Michael, Lu Yadong, Yu Han, Li Yuanzhi, Shen Yelong
- What Matters In The Structured Pruning Of Generative Language Models? Santacroce Michael, Wen Zixin, Shen Yelong, Li Yuanzhi
- Beyond Chinchilla-optimal: Accounting For Inference In Language Model Scaling Laws Sardana Nikhil, Portes Jacob, Doubov Sasha, Frankle Jonathan
- GIFT: Generative Interpretable Fine-tuning Savadikar Chinmay, Song Xi, Wu Tianfu
- Grounding Gaps In Language Model Generations Shaikh Omar, Gligorić Kristina, Khetan Ashna, Gerstgrasser Matthias, Yang Diyi, Jurafsky Dan
- PB-LLM: Partially Binarized Large Language Models Shang Yuzhang, Yuan Zhihang, Wu Qiang, Dong Zhen
- One-shot Sensitivity-aware Mixed Sparsity Pruning For Large Language Models Shao Hang, Liu Bei, Xiao Bo, Zeng Ke, Wan Guanglu, Qian Yanmin
- Omniquant: Omnidirectionally Calibrated Quantization For Large Language Models Shao Wenqi, Chen Mengzhao, Zhang Zhaoyang, Xu Peng, Zhao Lirui, Li Zhiqian, Zhang Kaipeng, Gao Peng, Qiao Yu, Luo Ping
- Efficient LLM Inference On Cpus Shen Haihao, Chang Hanwen, Dong Bo, Luo Yu, Meng Hengyu
- Crossget: Cross-guided Ensemble Of Tokens For Accelerating Vision-language Transformers Shi Dachuan, Tao Chaofan, Rao Anyi, Yang Zhendong, Yuan Chun, Wang Jiaqi
- Sotana: The Open-source Software Development Assistant Shi Ensheng, Zhang Fengji, Wang Yanlin, Chen Bei, Du Lun, Zhang Hongyu, Han Shi, Zhang Dongmei, Sun Hongbin
- L3 Ensembles: Lifelong Learning Approach For Ensemble Of Foundational Language Models Shiri Aidin, Roy Kaushik, Sheth Amit, Gaur Manas
- Ep-alm: Efficient Perceptual Augmentation Of Language Models Shukor Mustafa, Dancette Corentin, Cord Matthieu
- Automatic Prompt Augmentation And Selection With Chain-of-thought From Labeled Data Shum Kashun, Diao Shizhe, Zhang Tong
- Knowledge Unlearning For Llms: Tasks, Methods, And Challenges Si Nianwen, Zhang Hao, Chang Heyu, Zhang Wenlin, Qu Dan, Zhang Weiqiang
- A Long Way To Go: Investigating Length Correlations In RLHF Singhal Prasann, Goyal Tanya, Xu Jiacheng, Durrett Greg
- Dont Add, Dont Miss: Effective Content Preserving Generation From Pre-selected Text Spans Slobodkin Aviv, Caciularu Avi, Hirsch Eran, Dagan Ido
- Biomedical Knowledge Graph-optimized Prompt Generation For Large Language Models Soman Karthik, Rose Peter W, Morris John H, Akbas Rabia E, Smith Brett, Peetoom Braian, Villouta-reyes Catalina, Cerono Gabriel, Shi Yongmei, Rizk-jackson Angela, Israni Sharat, Nelson Charlotte A, Huang Sui, Baranzini Sergio E
- Dynamics Of Instruction Tuning: Each Ability Of Large Language Models Has Its Own Growth Pace Song Chiyu, Zhou Zhanchao, Yan Jianhao, Fei Yuejiao, Lan Zhenzhong, Zhang Yue
- Preference Ranking Optimization For Human Alignment Song Feifan, Yu Bowen, Li Minghao, Yu Haiyang, Huang Fei, Li Yongbin, Wang Houfeng
- Zebra: Extending Context Window With Layerwise Grouped Local-global Attention Song Kaiqiang, Wang Xiaoyang, Cho Sangwoo, Pan Xiaoman, Yu Dong
- Pneumollm: Harnessing The Power Of Large Language Model For Pneumoconiosis Diagnosis Song Meiyue, Yu Zhihua, Wang Jiaxin, Wang Jiarui, Lu Yuting, Li Baicun, Wang Xiaoxu, Huang Qinghua, Li Zhijun, Kanellakis Nikolaos I., Liu Jiangfeng, Wang Jing, Wang Binglu, Yang Juntao
- Powerinfer: Fast Large Language Model Serving With A Consumer-grade GPU Song Yixin, Mi Zeyu, Xie Haotong, Chen Haibo
- Reward Collapse In Aligning Large Language Models Song Ziang, Cai Tianle, Lee Jason D., Su Weijie J.
- Joint Prompt Optimization Of Stacked Llms Using Variational Inference Sordoni Alessandro, Yuan Xingdi, Côté Marc-alexandre, Pereira Matheus, Trischler Adam, Xiao Ziang, Hosseini Arian, Niedtner Friederike, Roux Nicolas Le
- Dynamic Context Pruning For Efficient And Interpretable Autoregressive Transformers Sotiris Anagnostidis, Dario Pavllo, Luca Biggio, Lorenzo Noci, Aurelien Lucchi, Thomas Hofmann
- Response-act Guided Reinforced Dialogue Generation For Mental Health Counseling Srivastava Aseem, Pandey Ishan, Akhtar Md. Shad, Chakraborty Tanmoy
- Core-cog: Conversational Recommendation Of Entities Using Constrained Generation Srivastava Harshvardhan, Pruthi Kanav, Chakrabarti Soumen, Mausam
- The Languini Kitchen: Enabling Language Modelling Research At Different Scales Of Compute Stanić Aleksandar, Ashley Dylan, Serikov Oleg, Kirsch Louis, Faccio Francesco, Schmidhuber Jürgen, Hofmann Thomas, Schlag Imanol
- Emergent And Predictable Memorization In Large Language Models Stella Biderman, Usvsn Sai Prashanth, Lintang Sutawika, Hailey Schoelkopf, Quentin Anthony, Shivanshu Purohit, Edward Raff
- Distilled GPT For Source Code Summarization Su Chia-yi, Mcmillan Collin
- F3-pruning: A Training-free And Generalized Pruning Strategy Towards Faster And Finer Text-to-video Synthesis Su Sitong, Liu Jianzhi, Gao Lianli, Song Jingkuan
- CONSCENDI: A Contrastive And Scenario-guided Distillation Approach To Guardrail Models For Virtual Assistants Sun Albert Yu, Nair Varun, Schumacher Elliot, Kannan Anitha
- Create Your World: Lifelong Text-to-image Diffusion Sun Gan, Liang Wenqi, Dong Jiahua, Li Jun, Ding Zhengming, Cong Yang
- Reinforcement Learning In The Era Of Llms: What Is Essential? What Is Needed? An RL Perspective On RLHF, Prompting, And Beyond Sun Hao
- Query-dependent Prompt Evaluation And Optimization With Offline Inverse RL Sun Hao, Hüyük Alihan, Van Der Schaar Mihaela
- Autohint: Automatic Prompt Optimization With Hint Generation Sun Hong, Li Xue, Xu Yinchuan, Homma Youkow, Cao Qi, Wu Min, Jiao Jian, Charles Denis
- A Simple And Effective Pruning Approach For Large Language Models Sun Mingjie, Liu Zhuang, Bair Anna, Kolter J. Zico
- Make Prompt-based Black-box Tuning Colorful: Boosting Model Generalization From Three Orthogonal Perspectives Sun Qiushi, Han Chengcheng, Chen Nuo, Zhu Renyu, Gong Jingyang, Li Xiang, Gao Ming
- Corex: Pushing The Boundaries Of Complex Reasoning Through Multi-model Collaboration Sun Qiushi, Yin Zhangyue, Li Xiang, Wu Zhiyong, Qiu Xipeng, Kong Lingpeng
- EVA-CLIP: Improved Training Techniques For CLIP At Scale Sun Quan, Fang Yuxin, Wu Ledell, Wang Xinlong, Cao Yue
- Exploring The Impact Of Low-rank Adaptation On The Performance, Efficiency, And Regularization Of RLHF Sun Simeng, Gupta Dhawal, Iyyer Mohit
- A Prompt Learning Framework For Source Code Summarization Sun Weisong, Fang Chunrong, You Yudu, Chen Yuchen, Liu Yi, Wang Chong, Zhang Jian, Zhang Quanjun, Qian Hanwei, Zhao Wei, Liu Yang, Chen Zhenyu
- Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers Sun Weiwei, Chen Zheng, Ma Xinyu, Yan Lingyong, Wang Shuaiqiang, Ren Pengjie, Chen Zhumin, Yin Dawei, Ren Zhaochun
- Parrot: Enhancing Multi-turn Instruction Following For Large Language Models Sun Yuchong, Liu Che, Zhou Kun, Huang Jinwen, Song Ruihua, Zhao Wayne Xin, Zhang Fuzheng, Zhang Di, Gai Kun
- Large Language Models For Intent-driven Session Recommendations Sun Zhu, Liu Hongyang, Qu Xinghua, Feng Kaidong, Wang Yan, Ong Yew-soon
- Improving Non-autoregressive Translation Quality With Pretrained Language Model, Embedding Distillation And Upsampling Strategy For CTC Syu Shen-sian, Xie Juncheng, Lee Hung-yi
- Cappy: Outperforming And Boosting Large Multi-task Lms With A Small Scorer Tan Bowen, Zhu Yun, Liu Lijuan, Xing Eric, Hu Zhiting, Chen Jindong
- Compound Text-guided Prompt Tuning Via Image-adaptive Cues Tan Hao, Li Jun, Zhou Yizhuang, Wan Jun, Lei Zhen, Zhang Xiangyu
- SPDF: Sparse Pre-training And Dense Fine-tuning For Large Language Models Thangarasa Vithursan, Gupta Abhay, Marshall William, Li Tianda, Leong Kevin, Decoste Dennis, Lie Sean, Saxena Shreyas
- DUMA: A Dual-mind Conversational Agent With Fast And Slow Thinking Tian Xiaoyu, Chen Liangyu, Liu Na, Liu Yaxuan, Zou Wei, Chen Kaijiang, Cui Ming
- RLHF-V: Towards Trustworthy Mllms Via Behavior Alignment From Fine-grained Correctional Human Feedback Tianyu Yu, Yuan Yao, Haoye Zhang, Taiwen He, Yifeng Han, Ganqu Cui, Jinyi Hu, Zhiyuan Liu, Hai-tao Zheng, Maosong Sun, Tat-seng Chua
- Qlora: Efficient Finetuning Of Quantized Llms Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, Luke Zettlemoyer
- Baby Llama: Knowledge Distillation From An Ensemble Of Teachers Trained On A Small Dataset With No Performance Penalty Timiryasov Inar, Tastet Jean-loup
- D4: Improving LLM Pretraining Via Document De-duplication And Diversification Tirumala Kushal, Simig Daniel, Aghajanyan Armen, Morcos Ari S.
- Automl-gpt: Large Language Model For Automl Tsai Yun-da, Tsai Yu-che, Huang Bo-wei, Yang Chun-pai, Lin Shou-de
- Activation Addition: Steering Language Models Without Optimization Turner Alexander Matt, Thiergart Lisa, Leech Gavin, Udell David, Vazquez Juan J., Mini Ulisse, Macdiarmid Monte
- Mini-gpts: Efficient Large Language Models Through Contextual Pruning Valicenti Tim, Vidal Justice, Patnaik Ritik
- The LLM Surgeon Van Der Ouderaa Tycho F. A., Nagel Markus, Van Baalen Mart, Asano Yuki M., Blankevoort Tijmen
- Accelerating Llama Inference By Enabling Intermediate Layer Decoding Via Instruction Tuning With LITE Varshney Neeraj, Chatterjee Agneet, Parmar Mihir, Baral Chitta
- PIP: Parse-instructed Prefix For Syntactically Controlled Paraphrase Generation Wan Yixin, Huang Kuan-hao, Chang Kai-wei
- Reformulating Domain Adaptation Of Large Language Models As Adapt-retrieve-revise: A Case Study On Chinese Legal Domain Wan Zhen, Zhang Yating, Wang Yexiang, Cheng Fei, Kurohashi Sadao
- Efficient Large Language Models: A Survey Wan Zhongwei, Wang Xin, Liu Che, Alam Samiul, Zheng Yu, Liu Jiachen, Qu Zhongnan, Yan Shen, Zhu Yi, Zhang Quanlu, Chowdhury Mosharaf, Zhang Mi
- Cost-effective Hyperparameter Optimization For Large Language Model Generation Inference Wang Chi, Liu Susan Xueqing, Awadallah Ahmed H.
- What Makes For Good Visual Tokenizers For Large Language Models? Wang Guangzhi, Ge Yixiao, Ding Xiaohan, Kankanhalli Mohan, Shan Ying
- Dialog Action-aware Transformer For Dialog Policy Learning Wang Huimin, Kwan Wai-chung, Wong Kam-fai
- Adapting LLM Agents With Universal Feedback In Communication Wang Kuan, Lu Yadong, Santacroce Michael, Gong Yeyun, Zhang Chao, Shen Yelong
- Simultaneous Machine Translation With Large Language Models Wang Minghan, Zhao Jinming, Vu Thuy-trang, Shiri Fatemeh, Shareghi Ehsan, Haffari Gholamreza
- Vamos: Versatile Action Models For Video Understanding Wang Shijie, Zhao Qi, Do Minh Quan, Agarwal Nakul, Lee Kwonjoon, Sun Chen
- Promptagent: Strategic Planning With Language Models Enables Expert-level Prompt Optimization Wang Xinyuan, Li Chenxi, Wang Zhen, Bai Fan, Luo Haotian, Zhang Jiayou, Jojic Nebojsa, Xing Eric P., Hu Zhiting
- Automated Evaluation Of Personalized Text Generation Using Large Language Models Wang Yaqing, Jiang Jiepu, Zhang Mingyang, Li Cheng, Liang Yi, Mei Qiaozhu, Bendersky Michael
- Pandalm: An Automatic Evaluation Benchmark For LLM Instruction Tuning Optimization Wang Yidong, Yu Zhuohao, Zeng Zhengran, Yang Linyi, Wang Cunxiang, Chen Hao, Jiang Chaoya, Xie Rui, Wang Jindong, Xie Xing, Ye Wei, Zhang Shikun, Zhang Yue
- Multilora: Democratizing Lora For Better Multi-task Learning Wang Yiming, Lin Yu, Zeng Xiaodong, Zhang Guannan
- Meta-reasoning: Semantics-symbol Deconstruction For Large Language Models Wang Yiming, Zhang Zhuosheng, Zhang Pei, Yang Baosong, Wang Rui
- Fake Alignment: Are Llms Really Aligned Well? Wang Yixu, Teng Yan, Huang Kexin, Lyu Chengqi, Zhang Songyang, Zhang Wenwei, Ma Xingjun, Jiang Yu-gang, Qiao Yu, Wang Yingchun
- Smarttrim: Adaptive Tokens And Attention Pruning For Efficient Vision-language Models Wang Zekun, Chen Jingchang, Zhou Wangchunshu, Zhu Haichao, Liang Jiafeng, Shan Liping, Liu Ming, Xu Dongliang, Yang Qing, Qin Bing
- Enabling Language Models To Implicitly Learn Self-improvement Wang Ziqi, Hou Le, Lu Tianjian, Wu Yuexin, Li Yunxuan, Yu Hongkun, Ji Heng
- Do Llms Understand User Preferences? Evaluating Llms On User Rating Prediction Wang-cheng Kang, Jianmo Ni, Nikhil Mehta, Maheswaran Sathiamoorthy, Lichan Hong, Ed Chi, Derek Zhiyuan Cheng
- Llmrec: Large Language Models With Graph Augmentation For Recommendation Wei Wei, Xubin Ren, Jiabin Tang, Qinyong Wang, Lixin Su, Suqi Cheng, Junfeng Wang, Dawei Yin, Chao Huang
- Is Chatgpt Good At Search? Investigating Large Language Models As Re-ranking Agents Weiwei Sun, Lingyong Yan, Xinyu Ma, Shuaiqiang Wang, Pengjie Ren, Zhumin Chen, Dawei Yin, Zhaochun Ren
- Autodroid: Llm-powered Task Automation In Android Wen Hao, Li Yuanchun, Liu Guohong, Zhao Shanhui, Yu Tao, Li Toby Jia-jun, Jiang Shiqi, Liu Yunhao, Zhang Yaqin, Liu Yunxin
- Mastering Symbolic Operations: Augmenting Language Models With Compiled Neural Networks Weng Yixuan, Zhu Minjun, Xia Fei, Li Bin, He Shizhu, Liu Kang, Zhao Jun
- Exploring Parameter-efficient Fine-tuning Techniques For Code Generation With Large Language Models Weyssow Martin, Zhou Xin, Kim Kisub, Lo David, Sahraoui Houari
- Frustratingly Simple Memory Efficiency For Pre-trained Language Models Via Dynamic Embedding Pruning Williams Miles, Aletras Nikolaos
- Speechgen: Unlocking The Generative Power Of Speech Language Models With Prompts Wu Haibin, Chang Kai-wei, Wu Yuan-kuei, Lee Hung-yi
- Toward Adversarial Training On Contextualized Language Representation Wu Hongqiu, Liu Yongxiang, Shi Hanwen, Zhao Hai, Zhang Min
- Efficient LLM Inference Solution On Intel GPU Wu Hui, Gan Yi, Yuan Feng, Ma Jing, Zhu Wei, Xu Yutao, Zhu Hong, Zhu Yuhua, Liu Xiaoli, Gu Jinghui, Zhao Peng
- Approximated Prompt Tuning For Vision-language Pre-trained Models Wu Qiong, Huang Shubin, Zhou Yiyi, Dai Pingyang, Shu Annan, Jiang Guannan, Ji Rongrong
- Sheared Llama: Accelerating Language Model Pre-training Via Structured Pruning Xia Mengzhou, Gao Tianyu, Zeng Zhiyuan, Chen Danqi
- Language Models Meet World Models: Embodied Experiences Enhance Language Models Xiang Jiannan, Tao Tianhua, Gu Yi, Shu Tianmin, Wang Zirui, Yang Zichao, Hu Zhiting
- Can I Trust Your Answer? Visually Grounded Video Question Answering Xiao Junbin, Yao Angela, Li Yicong, Chua Tat Seng
- O3D: Offline Data-driven Discovery And Distillation For Sequential Decision-making With Large Language Models Xiao Yuchen, Sun Yanchao, Xu Mengda, Madhushani Udari, Vann Jared, Garg Deepeka, Ganesh Sumitra
- Instructed Language Models With Retrievers Are Powerful Entity Linkers Xiao Zilin, Gong Ming, Wu Jie, Zhang Xingyao, Shou Linjun, Pei Jian, Jiang Daxin
- Empirical Study Of Zero-shot NER With Chatgpt Xie Tingyu, Li Qi, Zhang Jian, Zhang Yan, Liu Zuozhu, Wang Hongwei
- E4srec: An Elegant Effective Efficient Extensible Solution Of Large Language Models For Sequential Recommendation Xinhang Li, Chong Chen, Xiangyu Zhao, Yong Zhang, Chunxiao Xing
- Llm-pruner: On The Structural Pruning Of Large Language Models Xinyin Ma, Gongfan Fang, Xinchao Wang
- Gentopia: A Collaborative Platform For Tool-augmented Llms Xu Binfeng, Liu Xukun, Shen Hua, Han Zeyu, Li Yuhan, Yue Murong, Peng Zhiyuan, Liu Yuchen, Yao Ziyu, Xu Dongkuan
- Rewoo: Decoupling Reasoning From Observations For Efficient Augmented Language Models Xu Binfeng, Peng Zhiyuan, Lei Bowen, Mukherjee Subhabrata, Liu Yuchen, Xu Dongkuan
- Align On The Fly: Adapting Chatbot Behavior To Established Norms Xu Chunpu, Chern Steffi, Chern Ethan, Zhang Ge, Wang Zekun, Liu Ruibo, Li Jing, Fu Jie, Liu Pengfei
- Shattering The Agent-environment Interface For Fine-tuning Inclusive Language Models Xu Wanqiao, Dong Shi, Arumugam Dilip, Van Roy Benjamin
- Llmrefine: Pinpointing And Refining Large Language Models Via Fine-grained Actionable Feedback Xu Wenda, Deutsch Daniel, Finkelstein Mara, Juraska Juraj, Zhang Biao, Liu Zhongtao, Wang William Yang, Li Lei, Freitag Markus
- Compress, Then Prompt: Improving Accuracy-efficiency Trade-off Of LLM Inference With Transferable Prompt Xu Zhaozhuo, Liu Zirui, Chen Beidi, Tang Yuxin, Wang Jue, Zhou Kaixiong, Hu Xia, Shrivastava Anshumali
- Representation Learning With Large Language Models For Recommendation Xubin Ren, Wei Wei, Lianghao Xia, Lixin Su, Suqi Cheng, Junfeng Wang, Dawei Yin, Chao Huang
- Parameter-efficient Tuning Helps Language Model Alignment Xue Tianci, Wang Ziqi, Ji Heng
- Refining The Responses Of Llms By Themselves Yan Tianqiang, Xu Tiansheng
- Codescope: An Execution-based Multilingual Multitask Multidimensional Benchmark For Evaluating Llms On Code Understanding And Generation Yan Weixiang, Liu Haitian, Wang Yunkun, Li Yunzhe, Chen Qian, Wang Wen, Lin Tingyu, Zhao Weishan, Zhu Li, Sundaram Hari, Deng Shuiguang
- Effective Distillation Of Table-based Reasoning Ability From Llms Yang Bohao, Tang Chen, Zhao Kun, Xiao Chenghao, Lin Chenghua
- Tackling Vision Language Tasks Through Learning Inner Monologues Yang Diji, Chen Kezhen, Rao Jinmeng, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- Instoptima: Evolutionary Multi-objective Instruction Optimization Via Large Language Model-based Instruction Operators Yang Heng, Li Ke
- Progressive Evidence Refinement For Open-domain Multimodal Retrieval Question Answering Yang Shuwen, Wu Anran, Wu Xingjiao, Xiao Luwei, Ma Tianlong, Jin Cheng, He Liang
- Enabling Large Language Models To Learn From Rules Yang Wenkai, Lin Yankai, Zhou Jie, Wen Jirong
- Quantization-aware And Tensor-compressed Training Of Transformers For Natural Language Understanding Yang Zi, Choudhary Samridhi, Kunzmann Siegfried, Zhang Zheng
- Retroformer: Retrospective Large Language Agents With Policy Gradient Optimization Yao Weiran, Heinecke Shelby, Niebles Juan Carlos, Liu Zhiwei, Feng Yihao, Xue Le, Murthy Rithesh, Chen Zeyuan, Zhang Jianguo, Arpit Devansh, Xu Ran, Mui Phil, Wang Huan, Xiong Caiming, Savarese Silvio
- Nanolm: An Affordable LLM Pre-training Benchmark Via Accurate Loss Prediction Across Scales Yao Yiqun, Fan Siqi, Huang Xiusheng, Fang Xuezhi, Li Xiang, Ni Ziyi, Jiang Xin, Meng Xuying, Han Peng, Shang Shuo, Liu Kang, Sun Aixin, Wang Yequan
- Do Physicians Know How To Prompt? The Need For Automatic Prompt Optimization Help In Clinical Note Generation Yao Zonghai, Jaafar Ahmed, Wang Beining, Yang Zhichao, Yu Hong
- Qilin-med: Multi-stage Knowledge Injection Advanced Medical Large Language Model Ye Qichen, Liu Junling, Chong Dading, Zhou Peilin, Hua Yining, Liu Fenglin, Cao Meng, Wang Ziming, Cheng Xuxin, Lei Zhu, Guo Zhenhua
- Rational Decision-making Agent With Internalized Utility Judgment Ye Yining, Cong Xin, Tian Shizuo, Qin Yujia, Liu Chong, Lin Yankai, Liu Zhiyuan, Sun Maosong
- Llm-eval: Unified Multi-dimensional Automatic Evaluation For Open-domain Conversations With Large Language Models Yen-ting Lin, Yun-nung Chen
- Modulora: Finetuning 2-bit Llms On Consumer Gpus By Integrating With Modular Quantizers Yin Junjie, Dong Jiahao, Wang Yingheng, De Sa Christopher, Kuleshov Volodymyr
- Asyncmld: Asynchronous Multi-llm Framework For Dialogue Recommendation System Yoshimaru Naoki, Okuma Motoharu, Iio Takamasa, Hatano Kenji
- \(P^{3}O\): Transferring Visual Representations For Reinforcement Learning Via Prompting You Guoliang, Chu Xiaomeng, Duan Yifan, Peng Jie, Ji Jianmin, Zhang Yu, Zhang Yanyong
- GPTFUZZER: Red Teaming Large Language Models With Auto-generated Jailbreak Prompts Yu Jiahao, Lin Xingwei, Yu Zheng, Xing Xinyu
- MELO: Enhancing Model Editing With Neuron-indexed Dynamic Lora Yu Lang, Chen Qin, Zhou Jie, He Liang
- Information Association For Language Model Updating By Mitigating Lm-logical Discrepancy Yu Pengfei, Ji Heng
- Large Language Model As Attributed Training Data Generator: A Tale Of Diversity And Bias Yu Yue, Zhuang Yuchen, Zhang Jieyu, Meng Yu, Ratner Alexander, Krishna Ranjay, Shen Jiaming, Zhang Chao
- Fine-tuning Language Models With Generative Adversarial Reward Modelling Yu Zhang Ze, Jaw Lau Jia, Hui Zhang, Low Bryan Kian Hsiang
- Speculative Contrastive Decoding Yuan Hongyi, Lu Keming, Huang Fei, Yuan Zheng, Zhou Chang
- Large Language Models Illuminate A Progressive Pathway To Artificial Healthcare Assistant: A Review Yuan Mingze, Bao Peng, Yuan Jiajia, Shen Yunhao, Chen Zifan, Xie Yi, Zhao Jie, Chen Yang, Zhang Li, Shen Lin, Dong Bin
- Where To Go Next For Recommender Systems? ID- Vs. Modality-based Recommender Models Revisited Yuan Zheng, Yuan Fajie, Song Yu, Li Youhua, Fu Junchen, Yang Fei, Pan Yunzhu, Ni Yongxin
- Artgpt-4: Towards Artistic-understanding Large Vision-language Models With Enhanced Adapter Yuan Zhengqing, He Yunhong, Wang Kun, Ye Yanfang, Sun Lichao
- Plan, Eliminate, And Track -- Language Models Are Good Teachers For Embodied Agents Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye
- Tool Learning With Foundation Models Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang, Jing Yi, Yuzhang Zhu, Zhenning Dai, Lan Yan, Xin Cong, Yaxi Lu, Weilin Zhao, Yuxiang Huang, Junxi Yan, Xu Han, Xian Sun, Dahai Li, Jason Phang, Cheng Yang, Tongshuang Wu, Heng Ji, Zhiyuan Liu, Maosong Sun
- Fairness-aware Structured Pruning In Transformers Zayed Abdelrahman, Mordido Goncalo, Shabanian Samira, Baldini Ioana, Chandar Sarath
- Improving Machine Translation With Large Language Models: A Preliminary Study With Cooperative Decoding Zeng Jiali, Meng Fandong, Yin Yongjing, Zhou Jie
- Text Alignment Is An Efficient Unified Model For Massive NLP Tasks Zha Yuheng, Yang Yichi, Li Ruichen, Hu Zhiting
- G3detector: General Gpt-generated Text Detector Zhan Haolan, He Xuanli, Xu Qiongkai, Wu Yuxiang, Stenetorp Pontus
- Vpgtrans: Transfer Visual Prompt Generator Across Llms Zhang Ao, Fei Hao, Yao Yuan, Ji Wei, Li Li, Liu Zhiyuan, Chua Tat-seng
- Task-agnostic Distillation Of Encoder-decoder Language Models Zhang Chen, Yang Yang, Wang Jingang, Song Dawei
- Benchmarking And In-depth Performance Study Of Large Language Models On Habana Gaudi Processors Zhang Chengming, Sun Baixi, Yu Xiaodong, Xie Zhen, Zheng Weijian, Iskra Kamil, Beckman Pete, Tao Dingwen
- Controllable Text Generation With Residual Memory Transformer Zhang Hanqing, Si Sun, Wu Haiming, Song Dawei
- Linear Attention Via Orthogonal Memory Zhang Jun, Jiang Shuyang, Feng Jiangtao, Zheng Lin, Kong Lingpeng
- Marathon: A Race Through The Realm Of Long Context With Large Language Models Zhang Lei, Li Yunshui, Liu Ziqiang, Yang Jiaxi, Liu Junhao, Chen Longze, Luo Run, Yang Min
- A Two-stage Adaptation Of Large Language Models For Text Ranking Zhang Longhui, Zhang Yanzhao, Long Dingkun, Xie Pengjun, Zhang Meishan, Zhang Min
- Retrieve Anything To Augment Large Language Models Zhang Peitian, Xiao Shitao, Liu Zheng, Dou Zhicheng, Nie Jian-yun
- Automl-gpt: Automatic Machine Learning With GPT Zhang Shujian, Gong Chengyue, Wu Lemeng, Liu Xingchao, Zhou Mingyuan
- Meta Prompting For AI Systems Zhang Yifan, Yuan Yang, Yao Andrew Chi-chih
- IAG: Induction-augmented Generation Framework For Answering Reasoning Questions Zhang Zhebin, Zhang Xinyu, Ren Yuanhang, Shi Saijiang, Han Meng, Wu Yongkang, Lai Ruofei, Cao Zhao
- Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-thought Reasoning To Language Agents Zhang Zhuosheng, Yao Yao, Zhang Aston, Tang Xiangru, Ma Xinbei, He Zhiwei, Wang Yiming, Gerstein Mark, Wang Rui, Liu Gongshen, Zhao Hai
- You Only Look At Screens: Multimodal Chain-of-action Agents Zhang Zhuosheng, Zhang Aston
- LLM4DV: Using Large Language Models For Hardware Test Stimuli Generation Zhang Zixi, Chadwick Greg, Mcnally Hugo, Zhao Yiren, Mullins Robert
- Tuning Layernorm In Attention: Towards Efficient Multi-modal LLM Finetuning Zhao Bingchen, Tu Haoqin, Wei Chen, Mei Jieru, Xie Cihang
- Chatspot: Bootstrapping Multimodal Llms Via Precise Referring Instruction Tuning Zhao Liang, Yu En, Ge Zheng, Yang Jinrong, Wei Haoran, Zhou Hongyu, Sun Jianjian, Peng Yuang, Dong Runpei, Han Chunrui, Zhang Xiangyu
- Group Preference Optimization: Few-shot Alignment Of Large Language Models Zhao Siyan, Dang John, Grover Aditya
- Pareto Optimal Learning For Estimating Large Language Model Errors Zhao Theodore, Wei Mu, Preston J. Samuel, Poon Hoifung
- Ca-lora: Adapting Existing Lora For Compressed Llms To Enable Efficient Multi-tasking On Personal Devices Zhao Weilin, Huang Yuxiang, Han Xu, Liu Zhiyuan, Zhang Zhengyan, Li Kuai, Chen Chen, Yang Tao, Sun Maosong
- Investigating Table-to-text Generation Capabilities Of Llms In Real-world Information Seeking Scenarios Zhao Yilun, Zhang Haowei, Si Shengyun, Nan Linyong, Tang Xiangru, Cohan Arman
- Mllm-dataengine: An Iterative Refinement Approach For MLLM Zhao Zhiyuan, Ouyang Linke, Wang Bin, Huang Siyuan, Zhang Pan, Dong Xiaoyi, Wang Jiaqi, He Conghui
- Adapting Large Language Models By Integrating Collaborative Semantics For Recommendation Zheng Bowen, Hou Yupeng, Lu Hongyu, Chen Yu, Zhao Wayne Xin, Chen Ming, Wen Ji-rong
- Learn From Model Beyond Fine-tuning: A Survey Zheng Hongling, Shen Li, Tang Anke, Luo Yong, Hu Han, Du Bo, Tao Dacheng
- Sglang: Efficient Execution Of Structured Language Model Programs Zheng Lianmin, Yin Liangsheng, Xie Zhiqiang, Sun Chuyue, Huang Jeff, Yu Cody Hao, Cao Shiyi, Kozyrakis Christos, Stoica Ion, Gonzalez Joseph E., Barrett Clark, Sheng Ying
- Towards Unified Token Learning For Vision-language Tracking Zheng Yaozong, Zhong Bineng, Liang Qihua, Li Guorong, Ji Rongrong, Li Xianxian
- Response Length Perception And Sequence Scheduling: An Llm-empowered LLM Inference Pipeline Zheng Zangwei, Ren Xiaozhe, Xue Fuzhao, Luo Yang, Jiang Xin, You Yang
- Llamarec: Two-stage Recommendation Using Large Language Models For Ranking Zhenrui Yue, Sara Rabhi, Gabriel De Souza Pereira Moreira, Dong Wang, Even Oldridge
- Beyond Hallucinations: Enhancing Lvlms Through Hallucination-aware Direct Preference Optimization Zhiyuan Zhao, Bin Wang, Linke Ouyang, Xiaoyi Dong, Jiaqi Wang, Conghui He
- Sur-adapter: Enhancing Text-to-image Pre-trained Diffusion Models With Large Language Models Zhong Shanshan, Huang Zhongzhan, Wen Wushao, Qin Jinghui, Lin Liang
- Survival Of The Most Influential Prompts: Efficient Black-box Prompt Search Via Clustering And Pruning Zhou Han, Wan Xingchen, Vulić Ivan, Korhonen Anna
- Lobass: Gauging Learnability In Supervised Fine-tuning Data Zhou Haotian, Liu Tingkai, Ma Qianli, Yuan Jianbo, Liu Pengfei, You Yang, Yang Hongxia
- Large Language Model As A Policy Teacher For Training Reinforcement Learning Agents Zhou Zihao, Hu Bin, Zhao Chenyang, Zhang Pu, Liu Bin
- Extrapolating Large Language Models To Non-english By Aligning Languages Zhu Wenhao, Lv Yunzhe, Dong Qingxiu, Yuan Fei, Xu Jingjing, Huang Shujian, Kong Lingpeng, Chen Jiajun, Li Lei
- Pad: Program-aided Distillation Can Teach Small Models Reasoning Better Than Chain-of-thought Fine-tuning Zhu Xuekai, Qi Biqing, Zhang Kaiyan, Long Xinwei, Lin Zhouhan, Zhou Bowen
- 3d-vista: Pre-trained Transformer For 3D Vision And Text Alignment Zhu Ziyu, Ma Xiaojian, Chen Yixin, Deng Zhidong, Huang Siyuan, Li Qing
- Scissorhands: Exploiting The Persistence Of Importance Hypothesis For LLM KV Cache Compression At Test Time Zichang Liu, Aditya Desai, Fangshuo Liao, Weitao Wang, Victor Xie, Zhaozhuo Xu, Anastasios Kyrillidis, Anshumali Shrivastava
- Aligner: One Global Token Is Worth Millions Of Parameters When Aligning Large Language Models Ziheng Zhou University Of California, Los Angeles, Wu Yingnian University Of California, Los Angeles, Zhu Song-chun University Of California, Los Angeles, Terzopoulos Demetri University Of California, Los Angeles
- Benchmarking Large Language Models For Persian: A Preliminary Study Focusing On Chatgpt Abaskohi Amirhossein, Baruni Sara, Masoudi Mostafa, Abbasi Nesa, Babalou Mohammad Hadi, Edalat Ali, Kamahi Sepehr, Sani Samin Mahdizadeh, Naghavian Nikoo, Namazifard Danial, Sadeghi Pouya, Yaghoobzadeh Yadollah
- Are You Still On Track!? Catching LLM Task Drift With Activations Abdelnabi Sahar, Fay Aideen, Cherubin Giovanni, Salem Ahmed, Fritz Mario, Paverd Andrew
- A Survey On Symbolic Knowledge Distillation Of Large Language Models Acharya Kamal, Velasquez Alvaro, Song Houbing Herbert
- Attnlrp: Attention-aware Layer-wise Relevance Propagation For Transformers Achtibat Reduan, Hatefi Sayed Mohammad Vakilzadeh, Dreyer Maximilian, Jain Aakriti, Wiegand Thomas, Lapuschkin Sebastian, Samek Wojciech
- Towards Optimizing And Evaluating A Retrieval Augmented QA Chatbot Using Llms With Human In The Loop Afzal Anum, Kowsik Alexander, Fani Rajna, Matthes Florian
- Step-by-step Unmasking For Parameter-efficient Fine-tuning Of Large Language Models Agarwal Aradhye, Ramesh Suhas K, Sengupta Ayan, Chakraborty Tanmoy
- Promptwizard: Task-aware Agent-driven Prompt Optimization Framework Agarwal Eshaan, Dani Vivek, Ganu Tanuja, Nambi Akshay
- Codemirage: Hallucinations In Code Generated By Large Language Models Agarwal Vibhor, Pei Yulong, Alamir Salwa, Liu Xiaomo
- Enabling High-sparsity Foundational Llama Models With Efficient Pretraining And Deployment Agarwalla Abhinav, Gupta Abhay, Marques Alexandre, Pandit Shubhra, Goin Michael, Kurtic Eldar, Leong Kevin, Nguyen Tuan, Salem Mahmoud, Alistarh Dan, Lie Sean, Kurtz Mark
- Etalon: Holistic Performance Evaluation Framework For LLM Inference Systems Agrawal Amey, Agarwal Anmol, Kedia Nitin, Mohan Jayashree, Kundu Souvik, Kwatra Nipun, Ramjee Ramachandran, Tumanov Alexey
- LM4OPT: Unveiling The Potential Of Large Language Models In Formulating Mathematical Optimization Problems Ahmed Tasnim, Choudhury Salimur
- Shadowllm: Predictor-based Contextual Sparsity For Large Language Models Akhauri Yash, Abouelhamayed Ahmed F, Dotzel Jordan, Zhang Zhiru, Rush Alexander M, Huda Safeen, Abdelfattah Mohamed S
- Mixture Of Modular Experts: Distilling Knowledge From A Multilingual Teacher Into Specialized Modular Language Models Al-maamari Mohammed, Amor Mehdi Ben, Granitzer Michael
- Benchmarking Open-source Language Models For Efficient Question Answering In Industrial Applications Alassan Mahaman Sanoussi Yahaya, Espejel Jessica López, Bouhandi Merieme, Dahhane Walid, Ettifouri El Hassane
- Chatgpt Alternative Solutions: Large Language Models Survey Alipour Hanieh, Pendar Nick, Roy Kohinoor
- Understanding Alignment In Multimodal Llms: A Comprehensive Study Amirloo Elmira, Fauconnier Jean-philippe, Roesmann Christoph, Kerl Christian, Boney Rinu, Qian Yusu, Wang Zirui, Dehghan Afshin, Yang Yinfei, Gan Zhe, Grasch Peter
- Sentencevae: Enable Next-sentence Prediction For Large Language Models With Faster Speed, Higher Accuracy And Longer Context An Hongjun, Chen Yifan, Sun Zhe, Li Xuelong
- Enhancing Journalism With AI: A Study Of Contextualized Image Captioning For News Articles Using Llms And Lmms Anagnostopoulou Aliki, Gouvea Thiago, Sonntag Daniel
- Tensor Train Low-rank Approximation (tt-lora): Democratizing AI With Accelerated Llms Anjum Afia, Eren Maksim E., Boureima Ismael, Alexandrov Boian, Bhattarai Manish
- Scaling Sparse Fine-tuning To Large Language Models Ansell Alan, Vulić Ivan, Sterz Hannah, Korhonen Anna, Ponti Edoardo M.
- Forklift: An Extensible Neural Lifter Armengol-estapé Jordi, Rocha Rodrigo C. O., Woodruff Jackson, Minervini Pasquale, O'boyle Michael F. P.
- Simple Linear Attention Language Models Balance The Recall-throughput Tradeoff Arora Simran, Eyuboglu Sabri, Zhang Michael, Timalsina Aman, Alberti Silas, Zinsley Dylan, Zou James, Rudra Atri, Ré Christopher
- Enhancing Holonic Architecture With Natural Language Processing For System Of Systems Ashfaq Muhammad, Sadik Ahmed R., Mikkonen Tommi, Waseem Muhammad, Akitalo Niko M
- Slicegpt: Compress Large Language Models By Deleting Rows And Columns Ashkboos Saleh, Croci Maximilian L., Nascimento Marcelo Gennari Do, Hoefler Torsten, Hensman James
- Bifurcated Attention: Accelerating Massively Parallel Decoding With Shared Prefixes In Llms Athiwaratkun Ben, Gonugondla Sujan Kumar, Gouda Sanjay Krishna, Qian Haifeng, Ding Hantian, Sun Qing, Wang Jun, Guo Jiacheng, Chen Liangfu, Bhatia Parminder, Nallapati Ramesh, Sengupta Sudipta, Xiang Bing
- Transformer Alignment In Large Language Models Aubry Murdock, Meng Haoming, Sugolov Anton, Papyan Vardan
- GRAD-SUM: Leveraging Gradient Summarization For Optimal Prompt Engineering Austin Derek, Chartock Elliott
- Lamda: Large Model Fine-tuning Via Spectrally Decomposed Low-dimensional Adaptation Azizi Seyedarmin, Kundu Souvik, Pedram Massoud
- Evaluating The Efficacy Of Open-source Llms In Enterprise-specific RAG Systems: A Comparative Study Of Performance And Scalability B Gautam, Purwar Anupam
- Hybrid Preference Optimization: Augmenting Direct Preference Optimization With Auxiliary Objectives Badrinath Anirudhan, Agarwal Prabhat, Xu Jiajing
- Quantifying The Capabilities Of Llms Across Scale And Precision Badshah Sher, Sajjad Hassan
- Efficient Model-agnostic Alignment Via Bayesian Persuasion Bai Fengshuo, Wang Mingzhi, Zhang Zhaowei, Chen Boyuan, Xu Yinda, Wen Ying, Yang Yaodong
- Sparsellm: Towards Global Pruning For Pre-trained Language Models Bai Guangji, Li Yijiang, Ling Chen, Kim Kibaek, Zhao Liang
- Citrus: Chunked Instruction-aware State Eviction For Long Sequence Modeling Bai Yu, Zou Xiyuan, Huang Heyan, Chen Sanxing, Rondeau Marc-antoine, Gao Yang, Cheung Jackie Chi Kit
- Finetuning Large Language Model For Personalized Ranking Bai Zhuoxi, Wu Ning, Cai Fengyu, Zhu Xinyi, Xiong Yun
- Smaller, Weaker, Yet Better: Training LLM Reasoners Via Compute-optimal Sampling Bansal Hritik, Hosseini Arian, Agarwal Rishabh, Tran Vinh Q., Kazemi Mehran
- Comparing Bad Apples To Good Oranges: Aligning Large Language Models Via Joint Preference Optimization Bansal Hritik, Suvarna Ashima, Bhatt Gantavya, Peng Nanyun, Chang Kai-wei, Grover Aditya
- Medaide: Leveraging Large Language Models For On-premise Medical Assistance On Edge Devices Basit Abdul, Hussain Khizar, Hanif Muhammad Abdullah, Shafique Muhammad
- A Comparative Study Of DSL Code Generation: Fine-tuning Vs. Optimized Retrieval Augmentation Bassamzadeh Nastaran, Methani Chhaya
- The Unreasonable Effectiveness Of Eccentric Automatic Prompts Battle Rick, Gollapudi Teja
- Evolutionary Multi-objective Optimization Of Large Language Model Prompts For Balancing Sentiments Baumann Jill, Kramer Oliver
- Enhanced Language Model Truthfulness With Learnable Intervention And Uncertainty Expression Bayat Farima Fatahi, Liu Xin, Jagadish H. V., Wang Lu
- SUTRA: Scalable Multilingual Language Model Architecture Bendale Abhijit, Sapienza Michael, Ripplinger Steven, Gibbs Simon, Lee Jaewon, Mistry Pranav
- Think Big, Generate Quick: Llm-to-slm For Fast Autoregressive Decoding Bergner Benjamin, Skliar Andrii, Royer Amelie, Blankevoort Tijmen, Asano Yuki, Bejnordi Babak Ehteshami
- Lotr: Low Tensor Rank Weight Adaptation Bershatsky Daniel, Cherniuk Daria, Daulbaev Talgat, Mikhalev Aleksandr, Oseledets Ivan
- Efficient Reinforcement Learning Via Large Language Model-based Search Bhambri Siddhant, Bhattacharjee Amrita, Liu Huan, Kambhampati Subbarao
- Fintral: A Family Of GPT-4 Level Multimodal Financial Large Language Models Bhatia Gagan, Nagoudi El Moatez Billah, Cavusoglu Hasan, Abdul-mageed Muhammad
- Towards Cross-tokenizer Distillation: The Universal Logit Distillation Loss For Llms Boizard Nicolas, Haddad Kevin El, Hudelot Céline, Colombo Pierre
- Low-rank Quantization-aware Training For Llms Bondarenko Yelysei, Del Chiaro Riccardo, Nagel Markus
- Elephants Never Forget: Memorization And Learning Of Tabular Data In Large Language Models Bordt Sebastian, Nori Harsha, Rodrigues Vanessa, Nushi Besmira, Caruana Rich
- Fast And Effective Weight Update For Pruned Large Language Models Boža Vladimír
- ACORN: Aspect-wise Commonsense Reasoning Explanation Evaluation Brassard Ana, Heinzerling Benjamin, Kudo Keito, Sakaguchi Keisuke, Inui Kentaro
- Generation, Distillation And Evaluation Of Motivational Interviewing-style Reflections With A Foundational Language Model Brown Andrew, Zhu Jiading, Abdelwahab Mohamed, Dong Alec, Wang Cindy, Rose Jonathan
- Pheme: Efficient And Conversational Speech Generation Budzianowski Paweł, Sereda Taras, Cichy Tomasz, Vulić Ivan
- Aligning Large Language Models With Counterfactual DPO Butcher Bradley
- Olora: Orthonormal Low-rank Adaptation Of Large Language Models Büyükakyüz Kerim
- On Large Language Models In National Security Applications Caballero William N., Jenkins Phillip R.
- Cendol: Open Instruction-tuned Generative Large Language Models For Indonesian Languages Cahyawijaya Samuel, Lovenia Holy, Koto Fajri, Putri Rifki Afina, Dave Emmanuel, Lee Jhonson, Shadieq Nuur, Cenggoro Wawan, Akbar Salsabil Maulana, Mahendra Muhammad Ihza, Putri Dea Annisayanti, Wilie Bryan, Winata Genta Indra, Aji Alham Fikri, Purwarianti Ayu, Fung Pascale
- Self-control Of LLM Behaviors By Compressing Suffix Gradient Into Prefix Controller Cai Min, Zhang Yuchen, Zhang Shichang, Yin Fan, Zou Difan, Yue Yisong, Hu Ziniu
- Matryoshka Multimodal Models Cai Mu, Yang Jianwei, Gao Jianfeng, Lee Yong Jae
- Flextron: Many-in-one Flexible Large Language Model Cai Ruisi, Muralidharan Saurav, Heinrich Greg, Yin Hongxu, Wang Zhangyang, Kautz Jan, Molchanov Pavlo
- Lococo: Dropping In Convolutions For Long Context Compression Cai Ruisi, Tian Yuandong, Wang Zhangyang, Chen Beidi
- Medusa: Simple LLM Inference Acceleration Framework With Multiple Decoding Heads Cai Tianle, Li Yuhong, Geng Zhengyang, Peng Hongwu, Lee Jason D., Chen Deming, Dao Tri
- Internlm2 Technical Report Cai Zheng, Cao Maosong, Chen Haojiong, Chen Kai, Chen Keyu, Chen Xin, Chen Xun, Chen Zehui, Chen Zhi, Chu Pei, Dong Xiaoyi, Duan Haodong, Fan Qi, Fei Zhaoye, Gao Yang, Ge Jiaye, Gu Chenya, Gu Yuzhe, Gui Tao, Guo Aijia, Guo Qipeng, He Conghui, Hu Yingfan, Huang Ting, Jiang Tao, Jiao Penglong, Jin Zhenjiang, Lei Zhikai, Li Jiaxing, Li Jingwen, Li Linyang, Li Shuaibin, Li Wei, Li Yining, Liu Hongwei, Liu Jiangning, Hong Jiawei, Liu Kaiwen, Liu Kuikun, Liu Xiaoran, Lv Chengqi, Lv Haijun, Lv Kai, Ma Li, Ma Runyuan, Ma Zerun, Ning Wenchang, Ouyang Linke, Qiu Jiantao, Qu Yuan, Shang Fukai, Shao Yunfan, Song Demin, Song Zifan, Sui Zhihao, Sun Peng, Sun Yu, Tang Huanze, Wang Bin, Wang Guoteng, Wang Jiaqi, Wang Jiayu, Wang Rui, Wang Yudong, Wang Ziyi, Wei Xingjian, Weng Qizhen, Wu Fan, Xiong Yingtong, Xu Chao, Xu Ruiliang, Yan Hang, Yan Yirong, Yang Xiaogui, Ye Haochen, Ying Huaiyuan, Yu Jia, Yu Jing, Zang Yuhang, Zhang Chuyu, Zhang Li, Zhang Pan, Zhang Peng, Zhang Ruijie, Zhang Shuo, Zhang Songyang, Zhang Wenjian, Zhang Wenwei, Zhang Xingcheng, Zhang Xinyue, Zhao Hui, Zhao Qian, Zhao Xiaomeng, Zhou Fengzhe, Zhou Zaida, Zhuo Jingming, Zou Yicheng, Qiu Xipeng, Qiao Yu, Lin Dahua
- MADTP: Multimodal Alignment-guided Dynamic Token Pruning For Accelerating Vision-language Transformer Cao Jianjian, Ye Peng, Li Shengze, Yu Chong, Tang Yansong, Lu Jiwen, Chen Tao
- Beyond Sparse Rewards: Enhancing Reinforcement Learning With Language Model Critique In Text Generation Cao Meng, Shu Lei, Yu Lei, Zhu Yun, Wichers Nevan, Liu Yinxiao, Meng Lei
- Personalized Steering Of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization Cao Yuanpu, Zhang Tianrong, Cao Bochuan, Yin Ziyi, Lin Lu, Ma Fenglong, Chen Jinghui
- Survey On Large Language Model-enhanced Reinforcement Learning: Concept, Taxonomy, And Methods Cao Yuji, Zhao Huan, Cheng Yuheng, Shu Ting, Liu Guolong, Liang Gaoqi, Zhao Junhua, Li Yun
- Smileyllama: Modifying Large Language Models For Directed Chemical Space Exploration Cavanagh Joseph M., Sun Kunyang, Gritsevskiy Andrew, Bagni Dorian, Bannister Thomas D., Head-gordon Teresa
- Towards Robust And Cost-efficient Knowledge Unlearning For Large Language Models Cha Sungmin, Cho Sungjun, Hwang Dasol, Lee Moontae
- Prompto: An Open Source Library For Asynchronous Querying Of LLM Endpoints Chan Ryan Sze-yin, Nanni Federico, Brown Edwin, Chapman Ed, Williams Angus R., Bright Jonathan, Gabasova Evelina
- Benchmarking Cognitive Domains For Llms: Insights From Taiwanese Hakka Culture Chang Chen-chi, Chen Ching-yuan, Lee Hung-shin, Lee Chih-cheng
- Speechprompt: Prompting Speech Language Models For Speech Processing Tasks Chang Kai-wei, Wu Haibin, Wang Yu-kai, Wu Yuan-kuei, Shen Hua, Tseng Wei-cheng, Kang Iu-thing, Li Shang-wen, Lee Hung-yi
- Efficient Prompting Methods For Large Language Models: A Survey Chang Kaiyan, Xu Songcheng, Wang Chenglong, Luo Yingfeng, Xiao Tong, Zhu Jingbo
- Xprompt:explaining Large Language Model's Generation Via Joint Prompt Attribution Chang Yurui, Cao Bochuan, Wang Yujia, Chen Jinghui, Lin Lu
- Make Large Language Model A Better Ranker Chao Wenshuo, Zheng Zhi, Zhu Hengshu, Liu Hao
- No Such Thing As A General Learner: Language Models And Their Dual Optimization Chemla Emmanuel, Nefdt Ryan M.
- Hardware-aware Parallel Prompt Decoding For Memory-efficient Acceleration Of LLM Inference Chen Hao Mark, Luk Wayne, Yiu Ka Fai Cedric, Li Rui, Mishchenko Konstantin, Venieris Stylianos I., Fan Hongxiang
- Efficiency In Focus: Layernorm As A Catalyst For Fine-tuning Medical Visual Language Pre-trained Models Chen Jiawei, Yang Dingkang, Jiang Yue, Li Mingcheng, Wei Jinjie, Hou Xiaolu, Zhang Lihua
- Putting People In Llms' Shoes: Generating Better Answers Via Question Rewriter Chen Junhao, Wang Bowen, Jiang Zhouqiang, Nakashima Yuta
- Pixart-\sigma: Weak-to-strong Training Of Diffusion Transformer For 4K Text-to-image Generation Chen Junsong, Ge Chongjian, Xie Enze, Wu Yue, Yao Lewei, Ren Xiaozhe, Wang Zhongdao, Luo Ping, Lu Huchuan, Li Zhenguo
- Magdi: Structured Distillation Of Multi-agent Interaction Graphs Improves Reasoning In Smaller Language Models Chen Justin Chih-yao, Saha Swarnadeep, Stengel-eskin Elias, Bansal Mohit
- Pca-bench: Evaluating Multimodal Large Language Models In Perception-cognition-action Chain Chen Liang, Zhang Yichi, Ren Shuhuai, Zhao Haozhe, Cai Zefan, Wang Yuchi, Wang Peiyi, Meng Xiangdi, Liu Tianyu, Chang Baobao
- Efficientqat: Efficient Quantization-aware Training For Large Language Models Chen Mengzhao, Shao Wenqi, Xu Peng, Wang Jiahao, Gao Peng, Zhang Kaipeng, Qiao Yu, Luo Ping
- Graphwiz: An Instruction-following Language Model For Graph Problems Chen Nuo, Li Yuhan, Tang Jianheng, Li Jia
- Lifelong Knowledge Editing For Llms With Retrieval-augmented Continuous Prompt Learning Chen Qizhou, Zhang Taolin, He Xiaofeng, Li Dongyang, Wang Chengyu, Huang Longtao, Xue Hui
- Advancing Tool-augmented Large Language Models: Integrating Insights From Errors In Inference Trees Chen Sijia, Wang Yibo, Wu Yi-feng, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Zhang Lijun
- Ternaryllm: Ternarized Large Language Model Chen Tianqi, Li Zhe, Xu Weixiang, Zhu Zeyu, Li Dong, Tian Lu, Barsoum Emad, Wang Peisong, Cheng Jian
- GRATH: Gradual Self-truthifying For Large Language Models Chen Weixin, Song Dawn, Li Bo
- Beyond Natural Language: Llms Leveraging Alternative Formats For Enhanced Reasoning And Communication Chen Weize, Yuan Chenfei, Yuan Jiarui, Su Yusheng, Qian Chen, Yang Cheng, Xie Ruobing, Liu Zhiyuan, Sun Maosong
- Llast: Improved End-to-end Speech Translation System Leveraged By Large Language Models Chen Xi, Zhang Songyang, Bai Qibing, Chen Kai, Nakamura Satoshi
- Eyegpt: Ophthalmic Assistant With Large Language Models Chen Xiaolan, Zhao Ziwei, Zhang Weiyi, Xu Pusheng, Gao Le, Xu Mingpu, Wu Yue, Li Yinwen, Shi Danli, He Mingguang
- Distilling Reasoning Ability From Large Language Models With Adaptive Thinking Chen Xiaoshu, Zhou Sihang, Liang Ke, Liu Xinwang
- RL-JACK: Reinforcement Learning-powered Black-box Jailbreaking Attack Against Llms Chen Xuan, Nie Yuzhou, Yan Lu, Mao Yunshu, Guo Wenbo, Zhang Xiangyu
- Take The Bull By The Horns: Hard Sample-reweighted Continual Training Improves LLM Generalization Chen Xuxi, Wang Zhendong, Sow Daouda, Yang Junjie, Chen Tianlong, Liang Yingbin, Zhou Mingyuan, Wang Zhangyang
- Refining Packing And Shuffling Strategies For Enhanced Performance In Generative Language Models Chen Yanbing, Wang Ruilin, Yang Zihao, Jiang Lavender Yao, Oermann Eric Karl
- Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided By Text Information Chen Yi, Xu Jian, Zhang Xu-yao, Liu Wen-zhuo, Liu Yang-yang, Liu Cheng-lin
- NACL: A General And Effective KV Cache Eviction Framework For Llms At Inference Time Chen Yilong, Wang Guoxia, Shang Junyuan, Cui Shiyao, Zhang Zhenyu, Liu Tingwen, Wang Shuohuan, Sun Yu, Yu Dianhai, Wu Hua
- MAPO: Boosting Large Language Model Performance With Model-adaptive Prompt Optimization Chen Yuyan, Wen Zhihao, Fan Ge, Chen Zhengyu, Wu Wei, Liu Dayiheng, Li Zhixu, Liu Bang, Xiao Yanghua
- Transferring Backdoors Between Large Language Models By Knowledge Distillation Cheng Pengzhou, Wu Zongru, Ju Tianjie, Du Wei, Liu Zhuosheng Zhang Gongshen
- Information Re-organization Improves Reasoning In Large Language Models Cheng Xiaoxia, Tan Zeqi, Xue Wei, Lu Weiming
- Talking Nonsense: Probing Large Language Models' Understanding Of Adversarial Gibberish Inputs Cherepanova Valeriia, Zou James
- Exploring Quantization For Efficient Pre-training Of Transformer Language Models Chitsaz Kamran, Fournier Quentin, Mordido Gonçalo, Chandar Sarath
- Multi-granularity Guided Fusion-in-decoder Choi Eunseong, Lee Hyeri, Lee Jongwuk
- Cross-architecture Transfer Learning For Linear-cost Inference Transformers Choi Sehyun
- A Better LLM Evaluator For Text Generation: The Impact Of Prompt Output Sequencing And Optimization Chu Kuanchao, Chen Yi-pei, Nakayama Hideki
- Beyond Llms: Advancing The Landscape Of Complex Reasoning Chu-carroll Jennifer, Beck Andrew, Burnham Greg, Melville David Os, Nachman David, Özcan A. Erdem, Ferrucci David
- Crosslingual Capabilities And Knowledge Barriers In Multilingual Large Language Models Chua Lynn, Ghazi Badih, Huang Yangsibo, Kamath Pritish, Kumar Ravi, Manurangsi Pasin, Sinha Amer, Xie Chulin, Zhang Chiyuan
- Faithlm: Towards Faithful Explanations For Large Language Models Chuang Yu-neng, Wang Guanchu, Chang Chia-yuan, Tang Ruixiang, Zhong Shaochen, Yang Fan, Du Mengnan, Cai Xuanting, Hu Xia
- Sambalingo: Teaching Large Language Models New Languages Csaki Zoltan, Li Bo, Li Jonathan, Xu Qiantong, Pawakapan Pian, Zhang Leon, Du Yun, Zhao Hengyu, Hu Changran, Thakker Urmish
- Cherry On Top: Parameter Heterogeneity And Quantization In Large Language Models Cui Wanyun, Wang Qianle
- Phaseevo: Towards Unified In-context Prompt Optimization For Large Language Models Cui Wendi, Zhang Jiaxin, Li Zhuohang, Sun Hao, Lopez Damien, Das Kamalika, Malin Bradley, Kumar Sricharan
- Distillation Matters: Empowering Sequential Recommenders To Match The Performance Of Large Language Model Cui Yu, Liu Feng, Wang Pengbo, Wang Bohao, Tang Heng, Wan Yi, Wang Jun, Chen Jiawei
- Meta Large Language Model Compiler: Foundation Models Of Compiler Optimization Cummins Chris, Seeker Volker, Grubisic Dejan, Roziere Baptiste, Gehring Jonas, Synnaeve Gabriel, Leather Hugh
- Anchored Preference Optimization And Contrastive Revisions: Addressing Underspecification In Alignment D'oosterlinck Karel, Xu Winnie, Develder Chris, Demeester Thomas, Singh Amanpreet, Potts Christopher, Kiela Douwe, Mehri Shikib
- Improve Student's Reasoning Generalizability Through Cascading Decomposed Cots Distillation Dai Chengwei, Li Kun, Zhou Wei, Hu Songlin
- CORM: Cache Optimization With Recent Message For Large Language Model Inference Dai Jincheng, Huang Zhuowei, Jiang Haiyun, Chen Chen, Cai Deng, Bi Wei, Shi Shuming
- Generating Code World Models With Large Language Models Guided By Monte Carlo Tree Search Dainese Nicola, Merler Matteo, Alakuijala Minttu, Marttinen Pekka
- A Complete Survey On Llm-based AI Chatbots Dam Sumit Kumar, Hong Choong Seon, Qiao Yu, Zhang Chaoning
- Active Preference Optimization For Sample Efficient RLHF Das Nirjhar, Chakraborty Souradip, Pacchiano Aldo, Chowdhury Sayak Ray
- Low-rank Finetuning For Llms: A Fairness Perspective Das Saswat, Romanelli Marco, Tran Cuong, Reza Zarreen, Kailkhura Bhavya, Fioretto Ferdinando
- MIMIR: A Streamlined Platform For Personalized Agent Tuning In Domain Expertise Deng Chunyuan, Tang Xiangru, Zhao Yilun, Wang Hanming, Wang Haoran, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- ELASTIC: Efficient Linear Attention For Sequential Interest Compression Deng Jiaxin, Wang Shiyao, Lu Song, Li Yinfeng, Luo Xinchen, Liu Yuanjun, Xu Peixing, Zhou Guorui
- Attention Is Naturally Sparse With Gaussian Distributed Input Deng Yichuan, Song Zhao, Yang Chiwun
- LMGT: Optimizing Exploration-exploitation Balance In Reinforcement Learning Through Language Model Guided Trade-offs Deng Yongxin, Qiu Xihe, Tan Xiaoyu, Chu Wei, Xu Yinghui
- Unveiling Encoder-free Vision-language Models Diao Haiwen, Cui Yufeng, Li Xiaotong, Wang Yueze, Lu Huchuan, Wang Xinlong
- Longrope: Extending LLM Context Window Beyond 2 Million Tokens Ding Yiran, Zhang Li Lyna, Zhang Chengruidong, Xu Yuanyuan, Shang Ning, Xu Jiahang, Yang Fan, Yang Mao
- Large Language Models Prompting With Episodic Memory Do Dai, Tran Quan, Venkatesh Svetha, Le Hung
- Automatic Prompt Selection For Large Language Models Do Viet-tung, Hoang Van-khanh, Nguyen Duy-hung, Sabahi Shahab, Yang Jeff, Hotta Hajime, Nguyen Minh-tien, Le Hung
- Language Adaptation On A Tight Academic Compute Budget: Tokenizer Swapping Works And Pure Bfloat16 Is Enough Dobler Konstantin, De Melo Gerard
- A Tale Of Tails: Model Collapse As A Change Of Scaling Laws Dohmatob Elvis, Feng Yunzhen, Yang Pu, Charton Francois, Kempe Julia
- Self-play With Execution Feedback: Improving Instruction-following Capabilities Of Large Language Models Dong Guanting, Lu Keming, Li Chengpeng, Xia Tingyu, Yu Bowen, Zhou Chang, Zhou Jingren
- Baichuanseed: Sharing The Potential Of Extensive Data Collection And Deduplication By Introducing A Competitive Large Language Model Baseline Dong Guosheng, Pan Da, Sun Yiding, Zhang Shusen, Liang Zheng, Wu Xin, Shen Yanjun, Yang Fan, Sun Haoze, Li Tianpeng, Lin Mingan, Xu Jianhua, Zhang Yufan, Nie Xiaonan, Su Lei, Wang Bingning, Zhang Wentao, Mao Jiaxin, Zhou Zenan, Chen Weipeng
- Pruner-zero: Evolving Symbolic Pruning Metric From Scratch For Large Language Models Dong Peijie, Li Lujun, Tang Zhenheng, Liu Xiang, Pan Xinglin, Wang Qiang, Chu Xiaowen
- QAQ: Quality Adaptive Quantization For LLM KV Cache Dong Shichen, Cheng Wen, Qin Jiayu, Wang Wei
- A Framework For Real-time Safeguarding The Text Generation Of Large Language Model Dong Ximing, Lin Dayi, Wang Shaowei, Hassan Ahmed E.
- Unmemorization In Large Language Models Via Self-distillation And Deliberate Imagination Dong Yijiang River, Lin Hongzhou, Belkin Mikhail, Huerta Ramon, Vulić Ivan
- Jailbreaking Text-to-image Models With Llm-based Agents Dong Yingkai, Li Zheng, Meng Xiangtao, Yu Ning, Guo Shanqing
- Prompt Your Brain: Scaffold Prompt Tuning For Efficient Adaptation Of Fmri Pre-trained Model Dong Zijian, Wu Yilei, Chen Zijiao, Zhang Yichi, Jin Yueming, Zhou Juan Helen
- Learning From Students: Applying T-distributions To Explore Accurate And Efficient Formats For Llms Dotzel Jordan, Chen Yuzong, Kotb Bahaa, Prasad Sushma, Wu Gang, Li Sheng, Abdelfattah Mohamed S., Zhang Zhiru
- Stepcoder: Improve Code Generation With Reinforcement Learning From Compiler Feedback Dou Shihan, Liu Yan, Jia Haoxiang, Xiong Limao, Zhou Enyu, Shen Wei, Shan Junjie, Huang Caishuang, Wang Xiao, Fan Xiaoran, Xi Zhiheng, Zhou Yuhao, Ji Tao, Zheng Rui, Zhang Qi, Huang Xuanjing, Gui Tao
- Bitdistiller: Unleashing The Potential Of Sub-4-bit Llms Via Self-distillation Du Dayou, Zhang Yijia, Cao Shijie, Guo Jiaqi, Cao Ting, Chu Xiaowen, Xu Ningyi
- Multi-line Ai-assisted Code Authoring Dunay Omer, Cheng Daniel, Tait Adam, Thakkar Parth, Rigby Peter C, Chiu Andy, Ahmad Imad, Ganesan Arun, Maddila Chandra, Murali Vijayaraghavan, Tayyebi Ali, Nagappan Nachiappan
- Accuracy Is Not All You Need Dutta Abhinav, Krishnan Sanjeev, Kwatra Nipun, Ramjee Ramachandran
- CROME: Cross-modal Adapters For Efficient Multimodal LLM Ebrahimi Sayna, Arik Sercan O., Nama Tejas, Pfister Tomas
- Lightweight Large Language Model For Medication Enquiry: Med-pal Elangovan Kabilan, Ong Jasmine Chiat Ling, Jin Liyuan, Seng Benjamin Jun Jie, Kwan Yu Heng, Tan Lit Soo, Zhong Ryan Jian, Ma Justina Koi Li, Ke Yuhe, Liu Nan, Giacomini Kathleen M, Ting Daniel Shu Wei
- Dynamic Q&A Of Clinical Documents With Large Language Models Elgedawy Ran, Danciu Ioana, Mahbub Maria, Srinivasan Sudarshan
- From LLM To NMT: Advancing Low-resource Machine Translation With Claude Enis Maxim, Hopkins Mark
- Tinyagent: Function Calling At The Edge Erdogan Lutfi Eren, Lee Nicholas, Jha Siddharth, Kim Sehoon, Tabrizi Ryan, Moon Suhong, Hooper Coleman, Anumanchipalli Gopala, Keutzer Kurt, Gholami Amir
- Combinatorial Reasoning: Selecting Reasons In Generative AI Pipelines Via Combinatorial Optimization Esencan Mert, Kumar Tarun Advaith, Asanjan Ata Akbari, Lott P. Aaron, Mohseni Masoud, Unlu Can, Venturelli Davide, Ho Alan
- Prompt Optimizer Of Text-to-image Diffusion Models For Abstract Concept Understanding Fan Zezhong, Li Xiaohan, Fang Chenhao, Biswas Topojoy, Nag Kaushiki, Xu Jianpeng, Achan Kannan
- Multi-modal Hallucination Control By Visual Information Grounding Favero Alessandro, Zancato Luca, Trager Matthew, Choudhary Siddharth, Perera Pramuditha, Achille Alessandro, Swaminathan Ashwin, Soatto Stefano
- Legend: Leveraging Representation Engineering To Annotate Safety Margin For Preference Datasets Feng Duanyu, Qin Bowen, Huang Chen, Huang Youcheng, Zhang Zheng, Lei Wenqiang
- Keypoint-based Progressive Chain-of-thought Distillation For Llms Feng Kaituo, Li Changsheng, Zhang Xiaolu, Zhou Jun, Yuan Ye, Wang Guoren
- An Extremely Data-efficient And Generative Llm-based Reinforcement Learning Agent For Recommenders Feng Shuang, Feng Grace
- Generative AI And Large Language Models For Cyber Security: All Insights You Need Ferrag Mohamed Amine, Alwahedi Fatima, Battah Ammar, Cherif Bilel, Mechri Abdechakour, Tihanyi Norbert
- The Chronicles Of RAG: The Retriever, The Chunk And The Generator Finardi Paulo, Avila Leonardo, Castaldoni Rodrigo, Gengo Pedro, Larcher Celio, Piau Marcos, Costa Pablo, Caridá Vinicius
- Introducing The Newspalm MBR And QE Dataset: Llm-generated High-quality Parallel Data Outperforms Traditional Web-crawled Data Finkelstein Mara, Vilar David, Freitag Markus
- Contrastive Policy Gradient: Aligning Llms On Sequence-level Scores In A Supervised-friendly Fashion Flet-berliac Yannis, Grinsztajn Nathan, Strub Florian, Choi Eugene, Cremer Chris, Ahmadian Arash, Chandak Yash, Azar Mohammad Gheshlaghi, Pietquin Olivier, Geist Matthieu
- Geckopt: LLM System Efficiency Via Intent-based Tool Selection Fore Michael, Singh Simranjit, Stamoulis Dimitrios
- Conversational Assistants In Knowledge-intensive Contexts: An Evaluation Of LLM- Versus Intent-based Systems Freire Samuel Kernan, Wang Chaofan, Niforatos Evangelos
- Autorag-hp: Automatic Online Hyper-parameter Tuning For Retrieval-augmented Generation Fu Jia, Qin Xiaoting, Yang Fangkai, Wang Lu, Zhang Jue, Lin Qingwei, Chen Yubo, Zhang Dongmei, Rajmohan Saravan, Zhang Qi
- Lazyllm: Dynamic Token Pruning For Efficient Long Context LLM Inference Fu Qichen, Cho Minsik, Merth Thomas, Mehta Sachin, Rastegari Mohammad, Najibi Mahyar
- Moa: Mixture Of Sparse Attention For Automatic Large Language Model Compression Fu Tianyu, Huang Haofeng, Ning Xuefei, Zhang Genghan, Chen Boju, Wu Tianqi, Wang Hongyi, Huang Zixiao, Li Shiyao, Yan Shengen, Dai Guohao, Yang Huazhong, Wang Yu
- Continual Pre-training For Cross-lingual LLM Adaptation: Enhancing Japanese Language Capabilities Fujii Kazuki, Nakamura Taishi, Loem Mengsay, Iida Hiroki, Ohi Masanari, Hattori Kakeru, Shota Hirai, Mizuki Sakae, Yokota Rio, Okazaki Naoaki
- Application Of LLM Agents In Recruitment: A Novel Framework For Resume Screening Gan Chengguang, Zhang Qinghao, Mori Tatsunori
- Meta Reasoning For Large Language Models Gao Peizhong, Xie Ao, Mao Shaoguang, Wu Wenshan, Xia Yan, Mi Haipeng, Wei Furu
- Compensate Quantization Errors: Make Weights Hierarchical To Compensate Each Other Gao Yifei, Ou Jie, Wang Lei, Xiao Yuting, Xiang Zhiyuan, Dai Ruiting, Cheng Jun
- Attacking Large Language Models With Projected Gradient Descent Geisler Simon, Wollschläger Tom, Abdalla M. H. I., Gasteiger Johannes, Günnemann Stephan
- Gemini 1.5: Unlocking Multimodal Understanding Across Millions Of Tokens Of Context Gemini Team, Georgiev Petko, Lei Ving Ian, Burnell Ryan, Bai Libin, Gulati Anmol, Tanzer Garrett, Vincent Damien, Pan Zhufeng, Wang Shibo, Mariooryad Soroosh, Ding Yifan, Geng Xinyang, Alcober Fred, Frostig Roy, Omernick Mark, Walker Lexi, Paduraru Cosmin, Sorokin Christina, Tacchetti Andrea, Gaffney Colin, Daruki Samira, Sercinoglu Olcan, Gleicher Zach, Love Juliette, Voigtlaender Paul, Jain Rohan, Surita Gabriela, Mohamed Kareem, Blevins Rory, Ahn Junwhan, Zhu Tao, Kawintiranon Kornraphop, Firat Orhan, Gu Yiming, Zhang Yujing, Rahtz Matthew, Faruqui Manaal, Clay Natalie, Gilmer Justin, Co-reyes Jd, Penchev Ivo, Zhu Rui, Morioka Nobuyuki, Hui Kevin, Haridasan Krishna, Campos Victor, Mahdieh Mahdis, Guo Mandy, Hassan Samer, Kilgour Kevin, Vezer Arpi, Cheng Heng-tze, De Liedekerke Raoul, Goyal Siddharth, Barham Paul, Strouse Dj, Noury Seb, Adler Jonas, Sundararajan Mukund, Vikram Sharad, Lepikhin Dmitry, Paganini Michela, Garcia Xavier, Yang Fan, Valter Dasha, Trebacz Maja, Vodrahalli Kiran, Asawaroengchai Chulayuth, Ring Roman, Kalb Norbert, Soares Livio Baldini, Brahma Siddhartha, Steiner David, Yu Tianhe, Mentzer Fabian, He Antoine, Gonzalez Lucas, Xu Bibo, Kaufman Raphael Lopez, Shafey Laurent El, Oh Junhyuk, Hennigan Tom, Driessche George Van Den, Odoom Seth, Lucic Mario, Roelofs Becca, Lall Sid, Marathe Amit, Chan Betty, Ontanon Santiago, He Luheng, Teplyashin Denis, Lai Jonathan, Crone Phil, Damoc Bogdan, Ho Lewis, Riedel Sebastian, Lenc Karel, Yeh Chih-kuan, Chowdhery Aakanksha, Xu Yang, Kazemi Mehran, Amid Ehsan, Petrushkina Anastasia, Swersky Kevin, Khodaei Ali, Chen Gowoon, Larkin Chris, Pinto Mario, Yan Geng, Badia Adria Puigdomenech, Patil Piyush, Hansen Steven, Orr Dave, Arnold Sebastien M. R., Grimstad Jordan, Dai Andrew, Douglas Sholto, Sinha Rishika, Yadav Vikas, Chen Xi, Gribovskaya Elena, Austin Jacob, Zhao Jeffrey, Patel Kaushal, Komarek Paul, Austin Sophia, Borgeaud Sebastian, Friso Linda, Goyal Abhimanyu, Caine Ben, Cao Kris, Chung Da-woon, Lamm Matthew, Barth-maron Gabe, Kagohara Thais, Olszewska Kate, Chen Mia, Shivakumar Kaushik, Agarwal Rishabh, Godhia Harshal, Rajwar Ravi, Snaider Javier, Dotiwalla Xerxes, Liu Yuan, Barua Aditya, Ungureanu Victor, Zhang Yuan, Batsaikhan Bat-orgil, Wirth Mateo, Qin James, Danihelka Ivo, Doshi Tulsee, Chadwick Martin, Chen Jilin, Jain Sanil, Le Quoc, Kar Arjun, Gurumurthy Madhu, Li Cheng, Sang Ruoxin, Liu Fangyu, Lamprou Lampros, Munoz Rich, Lintz Nathan, Mehta Harsh, Howard Heidi, Reynolds Malcolm, Aroyo Lora, Wang Quan, Blanco Lorenzo, Cassirer Albin, Griffith Jordan, Das Dipanjan, Lee Stephan, Sygnowski Jakub, Fisher Zach, Besley James, Powell Richard, Ahmed Zafarali, Paulus Dominik, Reitter David, Borsos Zalan, Joshi Rishabh, Pope Aedan, Hand Steven, Selo Vittorio, Jain Vihan, Sethi Nikhil, Goel Megha, Makino Takaki, May Rhys, Yang Zhen, Schalkwyk Johan, Butterfield Christina, Hauth Anja, Goldin Alex, Hawkins Will, Senter Evan, Brin Sergey, Woodman Oliver, Ritter Marvin, Noland Eric, Giang Minh, Bolina Vijay, Lee Lisa, Blyth Tim, Mackinnon Ian, Reid Machel, Sarvana Obaid, Silver David, Chen Alexander, Wang Lily, Maggiore Loren, Chang Oscar, Attaluri Nithya, Thornton Gregory, Chiu Chung-cheng, Bunyan Oskar, Levine Nir, Chung Timothy, Eltyshev Evgenii, Si Xiance, Lillicrap Timothy, Brady Demetra, Aggarwal Vaibhav, Wu Boxi, Xu Yuanzhong, Mcilroy Ross, Badola Kartikeya, Sandhu Paramjit, Moreira Erica, Stokowiec Wojciech, Hemsley Ross, Li Dong, Tudor Alex, Shyam Pranav, Rahimtoroghi Elahe, Haykal Salem, Sprechmann Pablo, Zhou Xiang, Mincu Diana, Li Yujia, Addanki Ravi, Krishna Kalpesh, Wu Xiao, Frechette Alexandre, Eyal Matan, Dafoe Allan, Lacey Dave, Whang Jay, Avrahami Thi, Zhang Ye, Taropa Emanuel, Lin Hanzhao, Toyama Daniel, Rutherford Eliza, Sano Motoki, Choe Hyunjeong, Tomala Alex, Safranek-shrader Chalence, Kassner Nora, Pajarskas Mantas, Harvey Matt, Sechrist Sean, Fortunato Meire, Lyu Christina, Elsayed Gamaleldin, Kuang Chenkai, Lottes James, Chu Eric, Jia Chao, Chen Chih-wei, Humphreys Peter, Baumli Kate, Tao Connie, Samuel Rajkumar, Santos Cicero Nogueira Dos, Andreassen Anders, Rakićević Nemanja, Grewe Dominik, Kumar Aviral, Winkler Stephanie, Caton Jonathan, Brock Andrew, Dalmia Sid, Sheahan Hannah, Barr Iain, Miao Yingjie, Natsev Paul, Devlin Jacob, Behbahani Feryal, Prost Flavien, Sun Yanhua, Myaskovsky Artiom, Pillai Thanumalayan Sankaranarayana, Hurt Dan, Lazaridou Angeliki, Xiong Xi, Zheng Ce, Pardo Fabio, Li Xiaowei, Horgan Dan, Stanton Joe, Ambar Moran, Xia Fei, Lince Alejandro, Wang Mingqiu, Mustafa Basil, Webson Albert, Lee Hyo, Anil Rohan, Wicke Martin, Dozat Timothy, Sinha Abhishek, Piqueras Enrique, Dabir Elahe, Upadhyay Shyam, Boral Anudhyan, Hendricks Lisa Anne, Fry Corey, Djolonga Josip, Su Yi, Walker Jake, Labanowski Jane, Huang Ronny, Misra Vedant, Chen Jeremy, Skerry-ryan Rj, Singh Avi, Rijhwani Shruti, Yu Dian, Castro-ros Alex, Changpinyo Beer, Datta Romina, Bagri Sumit, Hrafnkelsson Arnar Mar, Maggioni Marcello, Zheng Daniel, Sulsky Yury, Hou Shaobo, Paine Tom Le, Yang Antoine, Riesa Jason, Rogozinska Dominika, Marcus Dror, Badawy Dalia El, Zhang Qiao, Wang Luyu, Miller Helen, Greer Jeremy, Sjos Lars Lowe, Nova Azade, Zen Heiga, Chaabouni Rahma, Rosca Mihaela, Jiang Jiepu, Chen Charlie, Liu Ruibo, Sainath Tara, Krikun Maxim, Polozov Alex, Lespiau Jean-baptiste, Newlan Josh, Cankara Zeyncep, Kwak Soo, Xu Yunhan, Chen Phil, Coenen Andy, Meyer Clemens, Tsihlas Katerina, Ma Ada, Gottweis Juraj, Xing Jinwei, Gu Chenjie, Miao Jin, Frank Christian, Cankara Zeynep, Ganapathy Sanjay, Dasgupta Ishita, Hughes-fitt Steph, Chen Heng, Reid David, Rong Keran, Fan Hongmin, Van Amersfoort Joost, Zhuang Vincent, Cohen Aaron, Gu Shixiang Shane, Mohananey Anhad, Ilic Anastasija, Tobin Taylor, Wieting John, Bortsova Anna, Thacker Phoebe, Wang Emma, Caveness Emily, Chiu Justin, Sezener Eren, Kaskasoli Alex, Baker Steven, Millican Katie, Elhawaty Mohamed, Aisopos Kostas, Lebsack Carl, Byrd Nathan, Dai Hanjun, Jia Wenhao, Wiethoff Matthew, Davoodi Elnaz, Weston Albert, Yagati Lakshman, Ahuja Arun, Gao Isabel, Pundak Golan, Zhang Susan, Azzam Michael, Sim Khe Chai, Caelles Sergi, Keeling James, Sharma Abhanshu, Swing Andy, Li Yaguang, Liu Chenxi, Bostock Carrie Grimes, Bansal Yamini, Nado Zachary, Anand Ankesh, Lipschultz Josh, Karmarkar Abhijit, Proleev Lev, Ittycheriah Abe, Yeganeh Soheil Hassas, Polovets George, Faust Aleksandra, Sun Jiao, Rrustemi Alban, Li Pen, Shivanna Rakesh, Liu Jeremiah, Welty Chris, Lebron Federico, Baddepudi Anirudh, Krause Sebastian, Parisotto Emilio, Soricut Radu, Xu Zheng, Bloxwich Dawn, Johnson Melvin, Neyshabur Behnam, Mao-jones Justin, Wang Renshen, Ramasesh Vinay, Abbas Zaheer, Guez Arthur, Segal Constant, Nguyen Duc Dung, Svensson James, Hou Le, York Sarah, Milan Kieran, Bridgers Sophie, Gworek Wiktor, Tagliasacchi Marco, Lee-thorp James, Chang Michael, Guseynov Alexey, Hartman Ale Jakse, Kwong Michael, Zhao Ruizhe, Kashem Sheleem, Cole Elizabeth, Miech Antoine, Tanburn Richard, Phuong Mary, Pavetic Filip, Cevey Sebastien, Comanescu Ramona, Ives Richard, Yang Sherry, Du Cosmo, Li Bo, Zhang Zizhao, Iinuma Mariko, Hu Clara Huiyi, Roy Aurko, Bijwadia Shaan, Zhu Zhenkai, Martins Danilo, Saputro Rachel, Gergely Anita, Zheng Steven, Jia Dawei, Antonoglou Ioannis, Sadovsky Adam, Gu Shane, Bi Yingying, Andreev Alek, Samangooei Sina, Khan Mina, Kocisky Tomas, Filos Angelos, Kumar Chintu, Bishop Colton, Yu Adams, Hodkinson Sarah, Mittal Sid, Shah Premal, Moufarek Alexandre, Cheng Yong, Bloniarz Adam, Lee Jaehoon, Pejman Pedram, Michel Paul, Spencer Stephen, Feinberg Vladimir, Xiong Xuehan, Savinov Nikolay, Smith Charlotte, Shakeri Siamak, Tran Dustin, Chesus Mary, Bohnet Bernd, Tucker George, Von Glehn Tamara, Muir Carrie, Mao Yiran, Kazawa Hideto, Slone Ambrose, Soparkar Kedar, Shrivastava Disha, Cobon-kerr James, Sharman Michael, Pavagadhi Jay, Araya Carlos, Misiunas Karolis, Ghelani Nimesh, Laskin Michael, Barker David, Li Qiujia, Briukhov Anton, Houlsby Neil, Glaese Mia, Lakshminarayanan Balaji, Schucher Nathan, Tang Yunhao, Collins Eli, Lim Hyeontaek, Feng Fangxiaoyu, Recasens Adria, Lai Guangda, Magni Alberto, De Cao Nicola, Siddhant Aditya, Ashwood Zoe, Orbay Jordi, Dehghani Mostafa, Brennan Jenny, He Yifan, Xu Kelvin, Gao Yang, Saroufim Carl, Molloy James, Wu Xinyi, Arnold Seb, Chang Solomon, Schrittwieser Julian, Buchatskaya Elena, Radpour Soroush, Polacek Martin, Giordano Skye, Bapna Ankur, Tokumine Simon, Hellendoorn Vincent, Sottiaux Thibault, Cogan Sarah, Severyn Aliaksei, Saleh Mohammad, Thakoor Shantanu, Shefey Laurent, Qiao Siyuan, Gaba Meenu, Chang Shuo-yiin, Swanson Craig, Zhang Biao, Lee Benjamin, Rubenstein Paul Kishan, Song Gan, Kwiatkowski Tom, Koop Anna, Kannan Ajay, Kao David, Schuh Parker, Stjerngren Axel, Ghiasi Golnaz, Gibson Gena, Vilnis Luke, Yuan Ye, Ferreira Felipe Tiengo, Kamath Aishwarya, Klimenko Ted, Franko Ken, Xiao Kefan, Bhattacharya Indro, Patel Miteyan, Wang Rui, Morris Alex, Strudel Robin, Sharma Vivek, Choy Peter, Hashemi Sayed Hadi, Landon Jessica, Finkelstein Mara, Jhakra Priya, Frye Justin, Barnes Megan, Mauger Matthew, Daun Dennis, Baatarsukh Khuslen, Tung Matthew, Farhan Wael, Michalewski Henryk, Viola Fabio, Quitry Felix De Chaumont, Lan Charline Le, Hudson Tom, Wang Qingze, Fischer Felix, Zheng Ivy, White Elspeth, Dragan Anca, Alayrac Jean-baptiste, Ni Eric, Pritzel Alexander, Iwanicki Adam, Isard Michael, Bulanova Anna, Zilka Lukas, Dyer Ethan, Sachan Devendra, Srinivasan Srivatsan, Muckenhirn Hannah, Cai Honglong, Mandhane Amol, Tariq Mukarram, Rae Jack W., Wang Gary, Ayoub Kareem, Fitzgerald Nicholas, Zhao Yao, Han Woohyun, Alberti Chris, Garrette Dan, Krishnakumar Kashyap, Gimenez Mai, Levskaya Anselm, Sohn Daniel, Matak Josip, Iturrate Inaki, Chang Michael B., Xiang Jackie, Cao Yuan, Ranka Nishant, Brown Geoff, Hutter Adrian, Mirrokni Vahab, Chen Nanxin, Yao Kaisheng, Egyed Zoltan, Galilee Francois, Liechty Tyler, Kallakuri Praveen, Palmer Evan, Ghemawat Sanjay, Liu Jasmine, Tao David, Thornton Chloe, Green Tim, Jasarevic Mimi, Lin Sharon, Cotruta Victor, Tan Yi-xuan, Fiedel Noah, Yu Hongkun, Chi Ed, Neitz Alexander, Heitkaemper Jens, Sinha Anu, Zhou Denny, Sun Yi, Kaed Charbel, Hulse Brice, Mishra Swaroop, Georgaki Maria, Kudugunta Sneha, Farabet Clement, Shafran Izhak, Vlasic Daniel, Tsitsulin Anton, Ananthanarayanan Rajagopal, Carin Alen, Su Guolong, Sun Pei, V Shashank, Carvajal Gabriel, Broder Josef, Comsa Iulia, Repina Alena, Wong William, Chen Warren Weilun, Hawkins Peter, Filonov Egor, Loher Lucia, Hirnschall Christoph, Wang Weiyi, Ye Jingchen, Burns Andrea, Cate Hardie, Wright Diana Gage, Piccinini Federico, Zhang Lei, Lin Chu-cheng, Gog Ionel, Kulizhskaya Yana, Sreevatsa Ashwin, Song Shuang, Cobo Luis C., Iyer Anand, Tekur Chetan, Garrido Guillermo, Xiao Zhuyun, Kemp Rupert, Zheng Huaixiu Steven, Li Hui, Agarwal Ananth, Ngani Christel, Goshvadi Kati, Santamaria-fernandez Rebeca, Fica Wojciech, Chen Xinyun, Gorgolewski Chris, Sun Sean, Garg Roopal, Ye Xinyu, Eslami S. M. Ali, Hua Nan, Simon Jon, Joshi Pratik, Kim Yelin, Tenney Ian, Potluri Sahitya, Thiet Lam Nguyen, Yuan Quan, Luisier Florian, Chronopoulou Alexandra, Scellato Salvatore, Srinivasan Praveen, Chen Minmin, Koverkathu Vinod, Dalibard Valentin, Xu Yaming, Saeta Brennan, Anderson Keith, Sellam Thibault, Fernando Nick, Huot Fantine, Jung Junehyuk, Varadarajan Mani, Quinn Michael, Raul Amit, Le Maigo, Habalov Ruslan, Clark Jon, Jalan Komal, Bullard Kalesha, Singhal Achintya, Luong Thang, Wang Boyu, Rajayogam Sujeevan, Eisenschlos Julian, Jia Johnson, Finchelstein Daniel, Yakubovich Alex, Balle Daniel, Fink Michael, Agarwal Sameer, Li Jing, Dvijotham Dj, Pal Shalini, Kang Kai, Konzelmann Jaclyn, Beattie Jennifer, Dousse Olivier, Wu Diane, Crocker Remi, Elkind Chen, Jonnalagadda Siddhartha Reddy, Lee Jong, Holtmann-rice Dan, Kallarackal Krystal, Liu Rosanne, Vnukov Denis, Vats Neera, Invernizzi Luca, Jafari Mohsen, Zhou Huanjie, Taylor Lilly, Prendki Jennifer, Wu Marcus, Eccles Tom, Liu Tianqi, Kopparapu Kavya, Beaufays Francoise, Angermueller Christof, Marzoca Andreea, Sarcar Shourya, Dib Hilal, Stanway Jeff, Perbet Frank, Trdin Nejc, Sterneck Rachel, Khorlin Andrey, Li Dinghua, Wu Xihui, Goenka Sonam, Madras David, Goldshtein Sasha, Gierke Willi, Zhou Tong, Liu Yaxin, Liang Yannie, White Anais, Li Yunjie, Singh Shreya, Bahargam Sanaz, Epstein Mark, Basu Sujoy, Lao Li, Ozturel Adnan, Crous Carl, Zhai Alex, Lu Han, Tung Zora, Gaur Neeraj, Walton Alanna, Dixon Lucas, Zhang Ming, Globerson Amir, Uy Grant, Bolt Andrew, Wiles Olivia, Nasr Milad, Shumailov Ilia, Selvi Marco, Piccinno Francesco, Aguilar Ricardo, Mccarthy Sara, Khalman Misha, Shukla Mrinal, Galic Vlado, Carpenter John, Villela Kevin, Zhang Haibin, Richardson Harry, Martens James, Bosnjak Matko, Belle Shreyas Rammohan, Seibert Jeff, Alnahlawi Mahmoud, Mcwilliams Brian, Singh Sankalp, Louis Annie, Ding Wen, Popovici Dan, Simicich Lenin, Knight Laura, Mehta Pulkit, Gupta Nishesh, Shi Chongyang, Fatehi Saaber, Mitrovic Jovana, Grills Alex, Pagadora Joseph, Petrova Dessie, Eisenbud Danielle, Zhang Zhishuai, Yates Damion, Mittal Bhavishya, Tripuraneni Nilesh, Assael Yannis, Brovelli Thomas, Jain Prateek, Velimirovic Mihajlo, Akbulut Canfer, Mu Jiaqi, Macherey Wolfgang, Kumar Ravin, Xu Jun, Qureshi Haroon, Comanici Gheorghe, Wiesner Jeremy, Gong Zhitao, Ruddock Anton, Bauer Matthias, Felt Nick, Gp Anirudh, Arnab Anurag, Zelle Dustin, Rothfuss Jonas, Rosgen Bill, Shenoy Ashish, Seybold Bryan, Li Xinjian, Mudigonda Jayaram, Erdogan Goker, Xia Jiawei, Simsa Jiri, Michi Andrea, Yao Yi, Yew Christopher, Kan Steven, Caswell Isaac, Radebaugh Carey, Elisseeff Andre, Valenzuela Pedro, Mckinney Kay, Paterson Kim, Cui Albert, Latorre-chimoto Eri, Kim Solomon, Zeng William, Durden Ken, Ponnapalli Priya, Sosea Tiberiu, Choquette-choo Christopher A., Manyika James, Robenek Brona, Vashisht Harsha, Pereira Sebastien, Lam Hoi, Velic Marko, Owusu-afriyie Denese, Lee Katherine, Bolukbasi Tolga, Parrish Alicia, Lu Shawn, Park Jane, Venkatraman Balaji, Talbert Alice, Rosique Lambert, Cheng Yuchung, Sozanschi Andrei, Paszke Adam, Kumar Praveen, Austin Jessica, Li Lu, Salama Khalid, Kim Wooyeol, Dukkipati Nandita, Baryshnikov Anthony, Kaplanis Christos, Sheng Xianghai, Chervonyi Yuri, Unlu Caglar, Casas Diego De Las, Askham Harry, Tunyasuvunakool Kathryn, Gimeno Felix, Poder Siim, Kwak Chester, Miecnikowski Matt, Mirrokni Vahab, Dimitriev Alek, Parisi Aaron, Liu Dangyi, Tsai Tomy, Shevlane Toby, Kouridi Christina, Garmon Drew, Goedeckemeyer Adrian, Brown Adam R., Vijayakumar Anitha, Elqursh Ali, Jazayeri Sadegh, Huang Jin, Carthy Sara Mc, Hoover Jay, Kim Lucy, Kumar Sandeep, Chen Wei, Biles Courtney, Bingham Garrett, Rosen Evan, Wang Lisa, Tan Qijun, Engel David, Pongetti Francesco, De Cesare Dario, Hwang Dongseong, Yu Lily, Pullman Jennifer, Narayanan Srini, Levin Kyle, Gopal Siddharth, Li Megan, Aharoni Asaf, Trinh Trieu, Lo Jessica, Casagrande Norman, Vij Roopali, Matthey Loic, Ramadhana Bramandia, Matthews Austin, Carey Cj, Johnson Matthew, Goranova Kremena, Shah Rohin, Ashraf Shereen, Dasgupta Kingshuk, Larsen Rasmus, Wang Yicheng, Vuyyuru Manish Reddy, Jiang Chong, Ijazi Joana, Osawa Kazuki, Smith Celine, Boppana Ramya Sree, Bilal Taylan, Koizumi Yuma, Xu Ying, Altun Yasemin, Shabat Nir, Bariach Ben, Korchemniy Alex, Choo Kiam, Ronneberger Olaf, Iwuanyanwu Chimezie, Zhao Shubin, Soergel David, Hsieh Cho-jui, Cai Irene, Iqbal Shariq, Sundermeyer Martin, Chen Zhe, Bursztein Elie, Malaviya Chaitanya, Biadsy Fadi, Shroff Prakash, Dhillon Inderjit, Latkar Tejasi, Dyer Chris, Forbes Hannah, Nicosia Massimo, Nikolaev Vitaly, Greene Somer, Georgiev Marin, Wang Pidong, Martin Nina, Sedghi Hanie, Zhang John, Banzal Praseem, Fritz Doug, Rao Vikram, Wang Xuezhi, Zhang Jiageng, Patraucean Viorica, Du Dayou, Mordatch Igor, Jurin Ivan, Liu Lewis, Dubey Ayush, Mohan Abhi, Nowakowski Janek, Ion Vlad-doru, Wei Nan, Tojo Reiko, Raad Maria Abi, Hudson Drew A., Keshava Vaishakh, Agrawal Shubham, Ramirez Kevin, Wu Zhichun, Nguyen Hoang, Liu Ji, Sewak Madhavi, Petrini Bryce, Choi Donghyun, Philips Ivan, Wang Ziyue, Bica Ioana, Garg Ankush, Wilkiewicz Jarek, Agrawal Priyanka, Li Xiaowei, Guo Danhao, Xue Emily, Shaik Naseer, Leach Andrew, Khan Sadh Mnm, Wiesinger Julia, Jerome Sammy, Chakladar Abhishek, Wang Alek Wenjiao, Ornduff Tina, Abu Folake, Ghaffarkhah Alireza, Wainwright Marcus, Cortes Mario, Liu Frederick, Maynez Joshua, Terzis Andreas, Samangouei Pouya, Mansour Riham, Kępa Tomasz, Aubet François-xavier, Algymr Anton, Banica Dan, Weisz Agoston, Orban Andras, Senges Alexandre, Andrejczuk Ewa, Geller Mark, Santo Niccolo Dal, Anklin Valentin, Merey Majd Al, Baeuml Martin, Strohman Trevor, Bai Junwen, Petrov Slav, Wu Yonghui, Hassabis Demis, Kavukcuoglu Koray, Dean Jeffrey, Vinyals Oriol
- Gemma 2: Improving Open Language Models At A Practical Size Gemma Team, Riviere Morgane, Pathak Shreya, Sessa Pier Giuseppe, Hardin Cassidy, Bhupatiraju Surya, Hussenot Léonard, Mesnard Thomas, Shahriari Bobak, Ramé Alexandre, Ferret Johan, Liu Peter, Tafti Pouya, Friesen Abe, Casbon Michelle, Ramos Sabela, Kumar Ravin, Lan Charline Le, Jerome Sammy, Tsitsulin Anton, Vieillard Nino, Stanczyk Piotr, Girgin Sertan, Momchev Nikola, Hoffman Matt, Thakoor Shantanu, Grill Jean-bastien, Neyshabur Behnam, Bachem Olivier, Walton Alanna, Severyn Aliaksei, Parrish Alicia, Ahmad Aliya, Hutchison Allen, Abdagic Alvin, Carl Amanda, Shen Amy, Brock Andy, Coenen Andy, Laforge Anthony, Paterson Antonia, Bastian Ben, Piot Bilal, Wu Bo, Royal Brandon, Chen Charlie, Kumar Chintu, Perry Chris, Welty Chris, Choquette-choo Christopher A., Sinopalnikov Danila, Weinberger David, Vijaykumar Dimple, Rogozińska Dominika, Herbison Dustin, Bandy Elisa, Wang Emma, Noland Eric, Moreira Erica, Senter Evan, Eltyshev Evgenii, Visin Francesco, Rasskin Gabriel, Wei Gary, Cameron Glenn, Martins Gus, Hashemi Hadi, Klimczak-plucińska Hanna, Batra Harleen, Dhand Harsh, Nardini Ivan, Mein Jacinda, Zhou Jack, Svensson James, Stanway Jeff, Chan Jetha, Zhou Jin Peng, Carrasqueira Joana, Iljazi Joana, Becker Jocelyn, Fernandez Joe, Van Amersfoort Joost, Gordon Josh, Lipschultz Josh, Newlan Josh, Ji Ju-yeong, Mohamed Kareem, Badola Kartikeya, Black Kat, Millican Katie, Mcdonell Keelin, Nguyen Kelvin, Sodhia Kiranbir, Greene Kish, Sjoesund Lars Lowe, Usui Lauren, Sifre Laurent, Heuermann Lena, Lago Leticia, Mcnealus Lilly, Soares Livio Baldini, Kilpatrick Logan, Dixon Lucas, Martins Luciano, Reid Machel, Singh Manvinder, Iverson Mark, Görner Martin, Velloso Mat, Wirth Mateo, Davidow Matt, Miller Matt, Rahtz Matthew, Watson Matthew, Risdal Meg, Kazemi Mehran, Moynihan Michael, Zhang Ming, Kahng Minsuk, Park Minwoo, Rahman Mofi, Khatwani Mohit, Dao Natalie, Bardoliwalla Nenshad, Devanathan Nesh, Dumai Neta, Chauhan Nilay, Wahltinez Oscar, Botarda Pankil, Barnes Parker, Barham Paul, Michel Paul, Jin Pengchong, Georgiev Petko, Culliton Phil, Kuppala Pradeep, Comanescu Ramona, Merhej Ramona, Jana Reena, Rokni Reza Ardeshir, Agarwal Rishabh, Mullins Ryan, Saadat Samaneh, Carthy Sara Mc, Perrin Sarah, Arnold Sébastien M. R., Krause Sebastian, Dai Shengyang, Garg Shruti, Sheth Shruti, Ronstrom Sue, Chan Susan, Jordan Timothy, Yu Ting, Eccles Tom, Hennigan Tom, Kocisky Tomas, Doshi Tulsee, Jain Vihan, Yadav Vikas, Meshram Vilobh, Dharmadhikari Vishal, Barkley Warren, Wei Wei, Ye Wenming, Han Woohyun, Kwon Woosuk, Xu Xiang, Shen Zhe, Gong Zhitao, Wei Zichuan, Cotruta Victor, Kirk Phoebe, Rao Anand, Giang Minh, Peran Ludovic, Warkentin Tris, Collins Eli, Barral Joelle, Ghahramani Zoubin, Hadsell Raia, Sculley D., Banks Jeanine, Dragan Anca, Petrov Slav, Vinyals Oriol, Dean Jeff, Hassabis Demis, Kavukcuoglu Koray, Farabet Clement, Buchatskaya Elena, Borgeaud Sebastian, Fiedel Noah, Joulin Armand, Kenealy Kathleen, Dadashi Robert, Andreev Alek
- Breaking The Length Barrier: Llm-enhanced CTR Prediction In Long Textual User Behaviors Geng Binzong, Huan Zhaoxin, Zhang Xiaolu, He Yong, Zhang Liang, Yuan Fajie, Zhou Jun, Mo Linjian
- Why Not Transform Chat Large Language Models To Non-english? Geng Xiang, Zhu Ming, Li Jiahuan, Lai Zhejian, Zou Wei, She Shuaijie, Guo Jiaxin, Zhao Xiaofeng, Li Yinglu, Li Yuang, Su Chang, Zhao Yanqing, Lyu Xinglin, Zhang Min, Chen Jiajun, Yang Hao, Huang Shujian
- AI And Memory Wall Gholami Amir, Yao Zhewei, Kim Sehoon, Hooper Coleman, Mahoney Michael W., Keutzer Kurt
- Feedback-aligned Mixed Llms For Machine Language-molecule Translation Gkoumas Dimitris, Liakata Maria
- Better & Faster Large Language Models Via Multi-token Prediction Gloeckle Fabian, Idrissi Badr Youbi, Rozière Baptiste, Lopez-paz David, Synnaeve Gabriel
- Direct Alignment Of Draft Model For Speculative Decoding With Chat-fine-tuned Llms Goel Raghavv, Gagrani Mukul, Jeon Wonseok, Park Junyoung, Lee Mingu, Lott Christopher
- Tokenshap: Interpreting Large Language Models With Monte Carlo Shapley Value Estimation Goldshmidt Roni, Horovicz Miriam
- LLMC: Benchmarking Large Language Model Quantization With A Versatile Compression Toolkit Gong Ruihao, Yong Yang, Gu Shiqiao, Huang Yushi, Lv Chentao, Zhang Yunchen, Liu Xianglong, Tao Dacheng
- Loose LIPS Sink Ships: Asking Questions In Battleship With Language-informed Program Sampling Grand Gabriel, Pepe Valerio, Andreas Jacob, Tenenbaum Joshua B.
- CMR Scaling Law: Predicting Critical Mixture Ratios For Continual Pre-training Of Language Models Gu Jiawei, Yang Zacc, Ding Chuanghao, Zhao Rui, Tan Fei
- Continual Skill And Task Learning Via Dialogue Gu Weiwei, Kondepudi Suresh, Huang Lixiao, Gopalan Nakul
- APTQ: Attention-aware Post-training Mixed-precision Quantization For Large Language Models Guan Ziyi, Huang Hantao, Su Yupeng, Huang Hong, Wong Ngai, Yu Hao
- Boosting Lossless Speculative Decoding Via Feature Sampling And Partial Alignment Distillation Gui Lujun, Xiao Bin, Su Lei, Chen Weipeng
- On The Interchangeability Of Positional Embeddings In Multilingual Neural Machine Translation Models Gumma Varun, Chitale Pranjal A., Bali Kalika
- Fine-tuning Large Language Models For Stock Return Prediction Using Newsflow Guo Tian, Hauptmann Emmanuel
- GPTQT: Quantize Large Language Models Twice To Push The Efficiency Guo Yipin, Lang Yilin, Ren Qinyuan
- Dependency-aware Semi-structured Sparsity: Declining Roles Of Outliers In Pruning Glu-based Llms Guo Zhiyu, Kamigaito Hidetaka, Wanatnabe Taro
- Attention Score Is Not All You Need For Token Importance Indicator In KV Cache Reduction: Value Also Matters Guo Zhiyu, Kamigaito Hidetaka, Watanabe Taro
- Integrating Explanations In Learning LTL Specifications From Demonstrations Gupta Ashutosh, Komp John, Rajput Abhay Singh, Shankaranarayanan Krishna, Trivedi Ashutosh, Varshney Namrita
- METAREFLECTION: Learning Instructions For Language Agents Using Past Reflections Gupta Priyanshu, Kirtania Shashank, Singha Ananya, Gulwani Sumit, Radhakrishna Arjun, Shi Sherry, Soares Gustavo
- Would I Lie To You? Inference Time Alignment Of Language Models Using Direct Preference Heads Hadji-kyriacou Avelina Asada, Arandjelovic Ognjen
- Mamba State-space Models Can Be Strong Downstream Learners Halloran John T., Gulati Manbir, Roysdon Paul F.
- Sltrain: A Sparse Plus Low-rank Approach For Parameter And Memory Efficient Pretraining Han Andi, Li Jiaxiang, Huang Wei, Hong Mingyi, Takeda Akiko, Jawanpuria Pratik, Mishra Bamdev
- Llm-personalize: Aligning LLM Planners With Human Preferences Via Reinforced Self-training For Housekeeping Robots Han Dongge, Mcinroe Trevor, Jelley Adam, Albrecht Stefano V., Bell Peter, Storkey Amos
- Value Augmented Sampling For Language Model Alignment And Personalization Han Seungwook, Shenfeld Idan, Srivastava Akash, Kim Yoon, Agrawal Pulkit
- Torchopera: A Compound AI System For LLM Safety Han Shanshan, Yao Yuhang, Hu Zijian, Stripelis Dimitris, Xu Zhaozhuo, He Chaoyang
- Rethinking Large Language Model Architectures For Sequential Recommendations Hanbing Wang, Xiaorui Liu, Wenqi Fan, Xiangyu Zhao, Venkataramana Kini, Devendra Yadav, Fei Wang, Zhen Wen, Jiliang Tang, Hui Liu
- Vision-and-language Navigation Generative Pretrained Transformer Hanlin Wen
- Astprompter: Weakly Supervised Automated Language Model Red-teaming To Identify Likely Toxic Prompts Hardy Amelia F., Liu Houjun, Lange Bernard, Kochenderfer Mykel J.
- Spectrum: Targeted Training On Signal To Noise Ratio Hartford Eric, Atkins Lucas, Neto Fernando Fernandes, Golchinfar David
- Pruning For Protection: Increasing Jailbreak Resistance In Aligned Llms Without Fine-tuning Hasan Adib, Rugina Ileana, Wang Alex
- Sub-goal Distillation: A Method To Improve Small Language Agents Hashemzadeh Maryam, Stengel-eskin Elias, Chandar Sarath, Cote Marc-alexandre
- Teaching Large Language Models To Reason With Reinforcement Learning Havrilla Alex, Du Yuqing, Raparthy Sharath Chandra, Nalmpantis Christoforos, Dwivedi-yu Jane, Zhuravinskyi Maksym, Hambro Eric, Sukhbaatar Sainbayar, Raileanu Roberta
- POEM: Interactive Prompt Optimization For Enhancing Multimodal Reasoning Of Large Language Models He Jianben, Wang Xingbo, Liu Shiyi, Wu Guande, Silva Claudio, Qu Huamin
- Prompt-based Bias Calibration For Better Zero/few-shot Learning Of Language Models He Kang, Long Yinghan, Roy Kaushik
- Can Language Models Act As Knowledge Bases At Scale? He Qiyuan, Wang Yizhong, Wang Wenya
- What Matters In Transformers? Not All Attention Is Needed He Shwai, Sun Guoheng, Shen Zheyu, Li Ang
- MARS: Mixture Of Auto-regressive Models For Fine-grained Text-to-image Synthesis He Wanggui, Fu Siming, Liu Mushui, Wang Xierui, Xiao Wenyi, Shu Fangxun, Wang Yi, Zhang Lei, Yu Zhelun, Li Haoyuan, Huang Ziwei, Gan Leilei, Jiang Hao
- G-retriever: Retrieval-augmented Generation For Textual Graph Understanding And Question Answering He Xiaoxin, Tian Yijun, Sun Yifei, Chawla Nitesh V., Laurent Thomas, Lecun Yann, Bresson Xavier, Hooi Bryan
- Fortifying Ethical Boundaries In AI: Advanced Strategies For Enhancing Security In Large Language Models He Yunhong, Qiu Jianling, Zhang Wei, Yuan Zhengqing
- Let The Code LLM Edit Itself When You Edit The Code He Zhenyu, Zhang Jun, Luo Shengjie, Xu Jingjing, Zhang Zhi, He Di
- Evaluation Of The Programming Skills Of Large Language Models Heitz Luc Bryan, Chamas Joun, Scherb Christopher
- Visual Prompting For Generalized Few-shot Segmentation: A Multi-scale Approach Hossain Mir Rayat Imtiaz, Siam Mennatullah, Sigal Leonid, Little James J.
- Enhancing And Accelerating Large Language Models Via Instruction-aware Contextual Compression Hou Haowen, Ma Fei, Bai Binwen, Zhu Xinxin, Yu Fei
- A Systematic Evaluation Of Large Language Models For Generating Programming Code Hou Wenpin, Ji Zhicheng
- Chatglm-rlhf: Practices Of Aligning Large Language Models With Human Feedback Hou Zhenyu, Niu Yilin, Du Zhengxiao, Zhang Xiaohan, Liu Xiao, Zeng Aohan, Zheng Qinkai, Huang Minlie, Wang Hongning, Tang Jie, Dong Yuxiao
- Large Language Model-powered Chatbots For Internationalizing Student Support In Higher Education Hsain Achraf, Housni Hamza El
- Lightweight Modality Adaptation To Sequential Recommendation Via Correlation Supervision Hu Hengchang, Liu Qijiong, Li Chuang, Kan Min-yen
- Outlier-efficient Hopfield Layers For Large Transformer-based Models Hu Jerry Yao-chieh, Chang Pei-hsuan, Luo Robin, Chen Hong-yu, Li Weijian, Wang Wei-po, Liu Han
- Openrlhf: An Easy-to-use, Scalable And High-performance RLHF Framework Hu Jian, Wu Xibin, Wang Weixun, Xianyu, Zhang Dehao, Cao Yu
- Characterization Of Large Language Model Development In The Datacenter Hu Qinghao, Ye Zhisheng, Wang Zerui, Wang Guoteng, Zhang Meng, Chen Qiaoling, Sun Peng, Lin Dahua, Wang Xiaolin, Luo Yingwei, Wen Yonggang, Zhang Tianwei
- Minicpm: Unveiling The Potential Of Small Language Models With Scalable Training Strategies Hu Shengding, Tu Yuge, Han Xu, He Chaoqun, Cui Ganqu, Long Xiang, Zheng Zhi, Fang Yewei, Huang Yuxiang, Zhao Weilin, Zhang Xinrong, Thai Zheng Leng, Zhang Kaihuo, Wang Chongyi, Yao Yuan, Zhao Chenyang, Zhou Jie, Cai Jie, Zhai Zhongwu, Ding Ning, Jia Chao, Zeng Guoyang, Li Dahai, Liu Zhiyuan, Sun Maosong
- RAG And RAU: A Survey On Retrieval-augmented Language Model In Natural Language Processing Hu Yucheng, Lu Yuxing
- Prompt Perturbation In Retrieval-augmented Generation Based Large Language Models Hu Zhibo Hye-young, Wang Chen Hye-young, Shu Yanfeng Hye-young, Helen Hye-young, Paik, Zhu Liming
- Exact And Efficient Unlearning For Large Language Model-based Recommendation Hu Zhiyu, Zhang Yang, Xiao Minghao, Wang Wenjie, Feng Fuli, He Xiangnan
- Longrecipe: Recipe For Efficient Long Context Generalization In Large Language Models Hu Zhiyuan, Liu Yuliang, Zhao Jinman, Wang Suyuchen, Wang Yan, Shen Wei, Gu Qing, Luu Anh Tuan, Ng See-kiong, Jiang Zhiwei, Hooi Bryan
- An Intermediate Fusion Vit Enables Efficient Text-image Alignment In Diffusion Models Hu Zizhao, Jia Shaochong, Rostami Mohammad
- Talk Less, Interact Better: Evaluating In-context Conversational Adaptation In Multimodal Llms Hua Yilun, Artzi Yoav
- Mmevalpro: Calibrating Multimodal Benchmarks Towards Trustworthy And Efficient Evaluation Huang Jinsheng, Chen Liang, Guo Taian, Zeng Fu, Zhao Yusheng, Wu Bohan, Yuan Ye, Zhao Haozhe, Guo Zhihui, Zhang Yichi, Yuan Jingyang, Ju Wei, Liu Luchen, Liu Tianyu, Chang Baobao, Zhang Ming
- Antidote: Post-fine-tuning Safety Alignment For Large Language Models Against Harmful Fine-tuning Huang Tiansheng, Bhattacharya Gautam, Joshi Pratik, Kimball Josh, Liu Ling
- Billm: Pushing The Limit Of Post-training Quantization For Llms Huang Wei, Liu Yangdong, Qin Haotong, Li Ying, Zhang Shiming, Liu Xianglong, Magno Michele, Qi Xiaojuan
- Slim-llm: Salience-driven Mixed-precision Quantization For Large Language Models Huang Wei, Qin Haotong, Liu Yangdong, Li Yawei, Liu Xianglong, Benini Luca, Magno Michele, Qi Xiaojuan
- An Empirical Study Of Llama3 Quantization: From Llms To Mllms Huang Wei, Zheng Xingyu, Ma Xudong, Qin Haotong, Lv Chengtao, Chen Hong, Luo Jie, Qi Xiaojuan, Liu Xianglong, Magno Michele
- Pruning Large Language Models With Semi-structural Adaptive Sparse Training Huang Weiyu, Hu Yuezhou, Jian Guohao, Zhu Jun, Chen Jianfei
- Queryagent: A Reliable And Efficient Reasoning Framework With Environmental Feedback-based Self-correction Huang Xiang, Cheng Sitao, Huang Shanshan, Shen Jiayu, Xu Yong, Zhang Chaoyun, Qu Yuzhong
- Rolora: Fine-tuning Rotated Outlier-free Llms For Effective Weight-activation Quantization Huang Xijie, Liu Zechun, Liu Shih-yang, Cheng Kwang-ting
- WESE: Weak Exploration To Strong Exploitation For LLM Agents Huang Xu, Liu Weiwen, Chen Xiaolong, Wang Xingmei, Lian Defu, Wang Yasheng, Tang Ruiming, Chen Enhong
- Leveraging Large Language Models For Enhanced NLP Task Performance Through Knowledge Distillation And Optimized Training Strategies Huang Yining, Tang Keke, Chen Meilian
- Fastfid: Improve Inference Efficiency Of Open Domain Question Answering Via Sentence Selection Huang Yufei, Han Xu, Sun Maosong
- Compression Represents Intelligence Linearly Huang Yuzhen, Zhang Jinghan, Shan Zifei, He Junxian
- Tool Calling: Enhancing Medication Consultation Via Retrieval-augmented Large Language Models Huang Zhongzhen, Xue Kui, Fan Yongqi, Mu Linjie, Liu Ruoyu, Ruan Tong, Zhang Shaoting, Zhang Xiaofan
- HFT: Half Fine-tuning For Large Language Models Hui Tingfeng, Zhang Zhenyu, Wang Shuohuan, Xu Weiran, Sun Yu, Wu Hua
- Federated Recommendation Via Hybrid Retrieval Augmented Generation Huimin Zeng, Zhenrui Yue, Qian Jiang, Dong Wang
- Pc-lora: Low-rank Adaptation For Progressive Model Compression With Knowledge Distillation Hwang Injoon, Park Haewon, Lee Youngwan, Yang Jooyoung, Maeng Sunjae
- Self-guiding Exploration For Combinatorial Problems Iklassov Zangir, Du Yali, Akimov Farkhad, Takac Martin
- On The Generalization Of Preference Learning With DPO Im Shawn, Li Yixuan
- Understanding The Learning Dynamics Of Alignment With Human Feedback Im Shawn, Li Yixuan
- Test-time Low Rank Adaptation Via Confidence Maximization For Zero-shot Generalization Of Vision-language Models Imam Raza, Gani Hanan, Huzaifa Muhammad, Nandakumar Karthik
- Scaling Laws For Downstream Task Performance Of Large Language Models Isik Berivan, Ponomareva Natalia, Hazimeh Hussein, Paparas Dimitris, Vassilvitskii Sergei, Koyejo Sanmi
- Jamba-1.5: Hybrid Transformer-mamba Models At Scale Jamba Team, Lenz Barak, Arazi Alan, Bergman Amir, Manevich Avshalom, Peleg Barak, Aviram Ben, Almagor Chen, Fridman Clara, Padnos Dan, Gissin Daniel, Jannai Daniel, Muhlgay Dor, Zimberg Dor, Gerber Edden M, Dolev Elad, Krakovsky Eran, Safahi Erez, Schwartz Erez, Cohen Gal, Shachaf Gal, Rozenblum Haim, Bata Hofit, Blass Ido, Magar Inbal, Dalmedigos Itay, Osin Jhonathan, Fadlon Julie, Rozman Maria, Danos Matan, Gokhman Michael, Zusman Mor, Gidron Naama, Ratner Nir, Gat Noam, Rozen Noam, Fried Oded, Leshno Ohad, Antverg Omer, Abend Omri, Lieber Opher, Dagan Or, Cohavi Orit, Alon Raz, Belson Ro'i, Cohen Roi, Gilad Rom, Glozman Roman, Lev Shahar, Meirom Shaked, Delbari Tal, Ness Tal, Asida Tomer, Gal Tom Ben, Braude Tom, Pumerantz Uriya, Cohen Yehoshua, Belinkov Yonatan, Globerson Yuval, Levy Yuval Peleg, Shoham Yoav
- Parameter Efficient Diverse Paraphrase Generation Using Sequence-level Knowledge Distillation Jayawardena Lasal, Yapa Prasan
- L4Q: Parameter Efficient Quantization-aware Fine-tuning On Large Language Models Jeon Hyesung, Kim Yulhwa, Kim Jae-joon
- SDQ: Sparse Decomposed Quantization For LLM Inference Jeong Geonhwa, Tsai Po-an, Keckler Stephen W., Krishna Tushar
- OLAPH: Improving Factuality In Biomedical Long-form Question Answering Jeong Minbyul, Hwang Hyeon, Yoon Chanwoong, Lee Taewhoo, Kang Jaewoo
- Llms Could Autonomously Learn Without External Supervision Ji Ke, Chen Junying, Gao Anningzhe, Xie Wenya, Wan Xiang, Wang Benyou
- Generalizing Reward Modeling For Out-of-distribution Preference Learning Jia Chen
- SOUL: Unlocking The Power Of Second-order Optimization For LLM Unlearning Jia Jinghan, Zhang Yihua, Zhang Yimeng, Liu Jiancheng, Runwal Bharat, Diffenderfer James, Kailkhura Bhavya, Liu Sijia
- Minference 1.0: Accelerating Pre-filling For Long-context Llms Via Dynamic Sparse Attention Jiang Huiqiang, Li Yucheng, Zhang Chengruidong, Wu Qianhui, Luo Xufang, Ahn Surin, Han Zhenhua, Abdi Amir H., Li Dongsheng, Lin Chin-yew, Yang Yuqing, Qiu Lili
- MC\(^2\): Multi-concept Guidance For Customized Multi-concept Generation Jiang Jiaxiu, Zhang Yabo, Feng Kailai, Wu Xiaohe, Zuo Wangmeng
- Mix-cpt: A Domain Adaptation Framework Via Decoupling Knowledge Learning And Format Alignment Jiang Jinhao, Li Junyi, Zhao Wayne Xin, Song Yang, Zhang Tao, Wen Ji-rong
- Wildteaming At Scale: From In-the-wild Jailbreaks To (adversarially) Safer Language Models Jiang Liwei, Rao Kavel, Han Seungju, Ettinger Allyson, Brahman Faeze, Kumar Sachin, Mireshghallah Niloofar, Lu Ximing, Sap Maarten, Choi Yejin, Dziri Nouha
- Unlocking The Power Of Large Language Models For Entity Alignment Jiang Xuhui, Shen Yinghan, Shi Zhichao, Xu Chengjin, Li Wei, Li Zixuan, Guo Jian, Shen Huawei, Wang Yuanzhuo
- Large Visual-language Models Are Also Good Classifiers: A Study Of In-context Multimodal Fake News Detection Jiang Ye, Wang Yimin
- Megascale: Scaling Large Language Model Training To More Than 10,000 Gpus Jiang Ziheng, Lin Haibin, Zhong Yinmin, Huang Qi, Chen Yangrui, Zhang Zhi, Peng Yanghua, Li Xiang, Xie Cong, Nong Shibiao, Jia Yulu, He Sun, Chen Hongmin, Bai Zhihao, Hou Qi, Yan Shipeng, Zhou Ding, Sheng Yiyao, Jiang Zhuo, Xu Haohan, Wei Haoran, Zhang Zhang, Nie Pengfei, Zou Leqi, Zhao Sida, Xiang Liang, Liu Zherui, Li Zhe, Jia Xiaoying, Ye Jianxi, Jin Xin, Liu Xin
- Learning Planning-based Reasoning By Trajectories Collection And Process Reward Synthesizing Jiao Fangkai, Qin Chengwei, Liu Zhengyuan, Chen Nancy F., Joty Shafiq
- APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking Jin Can, Peng Hongwu, Zhao Shiyu, Wang Zhenting, Xu Wujiang, Han Ligong, Zhao Jiahui, Zhong Kai, Rajasekaran Sanguthevar, Metaxas Dimitris N.
- Ragcache: Efficient Knowledge Caching For Retrieval-augmented Generation Jin Chao, Zhang Zili, Jiang Xuanlin, Liu Fangyue, Liu Xin, Liu Xuanzhe, Jin Xin
- Self-selected Attention Span For Accelerating Large Language Model Inference Jin Tian, Yazar Wanzin, Xu Zifei, Sharify Sayeh, Wang Xin
- MULTIVERSE: Exposing Large Language Model Alignment Problems In Diverse Worlds Jin Xiaolong, Zhang Zhuo, Zhang Xiangyu
- Cutting Off The Head Ends The Conflict: A Mechanism For Interpreting And Mitigating Knowledge Conflicts In Language Models Jin Zhuoran, Cao Pengfei, Yuan Hongbang, Chen Yubo, Xu Jiexin, Li Huaijun, Jiang Xiaojian, Liu Kang, Zhao Jun
- Birbal: An Efficient 7B Instruct-model Fine-tuned With Curated Datasets Jindal Ashvini Kumar, Rajpoot Pawan Kumar, Parikh Ankur
- A2SF: Accumulative Attention Scoring With Forgetting Factor For Token Pruning In Transformer Decoder Jo Hyun-rae, Shin Dongkun
- LLMRS: Unlocking Potentials Of Llm-based Recommender Systems For Software Purchase John Angela, Aidoo Theophilus, Behmanush Hamayoon, Gunduz Irem B., Shrestha Hewan, Rahman Maxx Richard, Maaß Wolfgang
- Llm-based Federated Recommendation Jujia Zhao, Wenjie Wang, Chen Xu, Zhaochun Ren, See-kiong Ng, Tat-seng Chua
- Information-theoretic Distillation For Reference-less Summarization Jung Jaehun, Lu Ximing, Jiang Liwei, Brahman Faeze, West Peter, Koh Pang Wei, Choi Yejin
- Introducing A New Hyper-parameter For RAG: Context Window Utilization Juvekar Kush, Purwar Anupam
- Effective Context Selection In Llm-based Leaderboard Generation: An Empirical Study Kabongo Salomon, D'souza Jennifer, Auer Sören
- A Survey On Hardware Accelerators For Large Language Models Kachris Christoforos
- Language Repository For Long Video Understanding Kahatapitiya Kumara, Ranasinghe Kanchana, Park Jongwoo, Ryoo Michael S.
- Slo-aware GPU Frequency Scaling For Energy Efficient LLM Inference Serving Kakolyris Andreas Kosmas, Masouros Dimosthenis, Vavaroutsos Petros, Xydis Sotirios, Soudris Dimitrios
- Scaling Laws For Forgetting When Fine-tuning Large Language Models Kalajdzievski Damjan
- Alpaca Against Vicuna: Using Llms To Uncover Memorization Of Llms Kassem Aly M., Mahmoud Omar, Mireshghallah Niloofar, Kim Hyunwoo, Tsvetkov Yulia, Choi Yejin, Saad Sherif, Rana Santu
- Faaf: Facts As A Function For The Evaluation Of Generated Text Katranidis Vasileios, Barany Gabor
- Spectra: A Comprehensive Study Of Ternary, Quantized, And FP16 Language Models Kaushal Ayush, Pandey Tejas, Vaidhya Tejas, Bhagat Aaryan, Rish Irina
- Visiongpt: Vision-language Understanding Agent Using Generalized Multimodal Framework Kelly Chris, Hu Luhui, Yang Bang, Tian Yu, Yang Deshun, Yang Cindy, Huang Zaoshan, Li Zihao, Hu Jiayin, Zou Yuexian
- Autonomous Prompt Engineering In Large Language Models Kepel Daan, Valogianni Konstantina
- A Comprehensive Survey Of Accelerated Generation Techniques In Large Language Models Khoshnoodi Mahsa, Jain Vinija, Gao Mingye, Srikanth Malavika, Chadha Aman
- Shortened Llama: Depth Pruning For Large Language Models With Comparison Of Retraining Methods Kim Bo-kyeong, Kim Geonmin, Kim Tae-ho, Castells Thibault, Choi Shinkook, Shin Junho, Song Hyoung-kyu
- Promptkd: Distilling Student-friendly Knowledge For Generative Language Models Via Prompt Tuning Kim Gyeongman, Jang Doohyuk, Yang Eunho
- Review-driven Personalized Preference Reasoning With Large Language Models For Recommendation Kim Jieyong, Kim Hyunseo, Cho Hyunjin, Kang Seongku, Chang Buru, Yeo Jinyoung, Lee Dongha
- Exploring Prompting Methods For Mitigating Class Imbalance Through Synthetic Data Generation With Large Language Models Kim Jinhee, Kim Taesung, Choo Jaegul
- Openvla: An Open-source Vision-language-action Model Kim Moo Jin, Pertsch Karl, Karamcheti Siddharth, Xiao Ted, Balakrishna Ashwin, Nair Suraj, Rafailov Rafael, Foster Ethan, Lam Grace, Sanketi Pannag, Vuong Quan, Kollar Thomas, Burchfiel Benjamin, Tedrake Russ, Sadigh Dorsa, Levine Sergey, Liang Percy, Finn Chelsea
- Large Language Models Meet Collaborative Filtering: An Efficient All-round Llm-based Recommender System Kim Sein, Kang Hongseok, Choi Seungyoon, Kim Donghyun, Yang Minchul, Park Chanyoung
- Aligning Large Language Models With Representation Editing: A Control Perspective Kong Lingkai, Wang Haorui, Mu Wenhao, Du Yuanqi, Zhuang Yuchen, Zhou Yifei, Song Yue, Zhang Rongzhi, Wang Kai, Zhang Chao
- QPO: Query-dependent Prompt Optimization Via Multi-loop Offline Reinforcement Learning Kong Yilun, Mao Hangyu, Zhao Qi, Zhang Bin, Ruan Jingqing, Shen Li, Chang Yongzhe, Wang Xueqian, Zhao Rui, Tao Dacheng
- Evaluating Language Models For Generating And Judging Programming Feedback Koutcheme Charles, Dainese Nicola, Hellas Arto, Sarsa Sami, Leinonen Juho, Ashraf Syed, Denny Paul
- Large Language Models For Tuning Evolution Strategies Kramer Oliver
- Reinforcement Learning For Optimizing RAG For Domain Chatbots Kulkarni Mandar, Tangarajan Praveen, Kim Kyung, Trivedi Anusua
- Fine-tuning, Quantization, And Llms: Navigating Unintended Outcomes Kumar Divyanshu, Kumar Anurakt, Agarwal Sahil, Harshangi Prashanth
- Improving Socratic Question Generation Using Data Augmentation And Preference Optimization Kumar Nischal Ashok, Lan Andrew
- Babilong: Testing The Limits Of Llms With Long Context Reasoning-in-a-haystack Kuratov Yuri, Bulatov Aydar, Anokhin Petr, Rodkin Ivan, Sorokin Dmitry, Sorokin Artyom, Burtsev Mikhail
- Language-specific Calibration For Pruning Multilingual Language Models Kurz Simon, Chen Jian-jia, Flek Lucie, Zhao Zhixue
- Continual Learning Optimizations For Auto-regressive Decoder Of Multilingual ASR Systems Kwok Chin Yuen, Yip Jia Qi, Chng Eng Siong
- RES-Q: Evaluating Code-editing Large Language Model Systems At The Repository Scale Labash Beck, Rosedale August, Reents Alex, Negritto Lucas, Wiel Colin
- Biomistral: A Collection Of Open-source Pretrained Large Language Models For Medical Domains Labrak Yanis, Bazoge Adrien, Morin Emmanuel, Gourraud Pierre-antoine, Rouvier Mickael, Dufour Richard
- Fishing For Magikarp: Automatically Detecting Under-trained Tokens In Large Language Models Land Sander, Bartolo Max
- Fine-tuning Language Models With Reward Learning On Policy Lang Hao, Huang Fei, Li Yongbin
- Large Language Models As Evolution Strategies Lange Robert Tjarko, Tian Yingtao, Tang Yujin
- Transformers For Low-resource Languages:is F\'eidir Linn! Lankford Séamus, Afli Haithem, Way Andy
- Melting Point: Mobile Evaluation Of Language Transformers Laskaridis Stefanos, Katevas Kleomenis, Minto Lorenzo, Haddadi Hamed
- Reallm: A General Framework For LLM Compression And Fine-tuning Leconte Louis, Bedin Lisa, Nguyen Van Minh, Moulines Eric
- Benchmarks Underestimate The Readiness Of Multi-lingual Dialogue Agents Lee Andrew H., Semnani Sina J., Castillo-lópez Galo, De Chalendar Gäel, Choudhury Monojit, Dua Ashna, Kavitha Kapil Rajesh, Kim Sungkyun, Kodali Prashant, Kumaraguru Ponnurangam, Lombard Alexis, Moradshahi Mehrad, Park Gihyun, Semmar Nasredine, Seo Jiwon, Shen Tianhao, Shrivastava Manish, Xiong Deyi, Lam Monica S.
- Programming Refusal With Conditional Activation Steering Lee Bruce W., Padhi Inkit, Ramamurthy Karthikeyan Natesan, Miehling Erik, Dognin Pierre, Nagireddy Manish, Dhurandhar Amit
- BAPO: Base-anchored Preference Optimization For Personalized Alignment In Large Language Models Lee Gihun, Jeong Minchan, Kim Yujin, Jung Hojung, Oh Jaehoon, Kim Sangmook, Yun Se-young
- Hip Attention: Sparse Sub-quadratic Attention With Hierarchical Attention Pruning Lee Heejun, Park Geon, Lee Youngwan, Kim Jina, Jeong Wonyoung, Jeon Myeongjae, Hwang Sung Ju
- Math Multiple Choice Question Generation Via Human-large Language Model Collaboration Lee Jaewook, Smith Digory, Woodhead Simon, Lan Andrew
- Improving Conversational Abilities Of Quantized Large Language Models Via Direct Preference Alignment Lee Janghwan, Park Seongmin, Hong Sukjin, Kim Minsoo, Chang Du-seong, Choi Jungwook
- A Survey Of Large Language Models In Finance (finllms) Lee Jean, Stevens Nicholas, Han Soyeon Caren, Song Minseok
- Gecko: Versatile Text Embeddings Distilled From Large Language Models Lee Jinhyuk, Dai Zhuyun, Ren Xiaoqi, Chen Blair, Cer Daniel, Cole Jeremy R., Hui Kai, Boratko Michael, Kapadia Rajvi, Ding Wen, Luan Yi, Duddu Sai Meher Karthik, Abrego Gustavo Hernandez, Shi Weiqiang, Gupta Nithi, Kusupati Aditya, Jain Prateek, Jonnalagadda Siddhartha Reddy, Chang Ming-wei, Naim Iftekhar
- LRQ: Optimizing Post-training Quantization For Large Language Models By Learning Low-rank Weight-scaling Matrices Lee Jung Hyun, Kim Jeonghoon, Yang June Yong, Kwon Se Jung, Yang Eunho, Yoo Kang Min, Lee Dongsoo
- Tender: Accelerating Large Language Models Via Tensor Decomposition And Runtime Requantization Lee Jungi, Lee Wonbeom, Sim Jaewoong
- Intent-based Prompt Calibration: Enhancing Prompt Optimization With Synthetic Boundary Cases Levi Elad, Brosh Eli, Friedmann Matan
- More RLHF, More Trust? On The Impact Of Human Preference Alignment On Language Model Trustworthiness Li Aaron J., Krishna Satyapriya, Lakkaraju Himabindu
- Cascade Reward Sampling For Efficient Decoding-time Alignment Li Bolian, Wang Yifan, Grama Ananth, Zhang Ruqi
- Are Bigger Encoders Always Better In Vision Large Models? Li Bozhou, Liang Hao, Meng Zimo, Zhang Wentao
- Concentrate Attention: Towards Domain-generalizable Prompt Optimization For Language Models Li Chengzhengxu, Liu Xiaoming, Zhang Zhaohan, Wang Yichen, Liu Chen, Lan Yu, Shen Chao
- Contextualization Distillation From Large Language Model For Knowledge Graph Completion Li Dawei, Tan Zhen, Chen Tianlong, Liu Huan
- Gradient-mask Tuning Elevates The Upper Limits Of LLM Performance Li Haoling, Zhang Xin, Liu Xiao, Gong Yeyun, Wang Yifan, Yang Yujiu, Chen Qi, Cheng Peng
- Streamingdialogue: Prolonged Dialogue Learning Via Long Context Compression With Minimal Losses Li Jia-nan, Tu Quan, Mao Cunli, Yu Zhengtao, Wen Ji-rong, Yan Rui
- MT-PATCHER: Selective And Extendable Knowledge Distillation From Large Language Models For Machine Translation Li Jiahuan, Cheng Shanbo, Huang Shujian, Chen Jiajun
- Dialogue Action Tokens: Steering Language Models In Goal-directed Dialogue With A Multi-turn Planner Li Kenneth, Wang Yiming, Viégas Fernanda, Wattenberg Martin
- Owlore: Outlier-weighed Layerwise Sampled Low-rank Projection For Memory-efficient LLM Fine-tuning Li Pengxiang, Yin Lu, Gao Xiaowei, Liu Shiwei
- Examining Post-training Quantization For Mixture-of-experts: A Benchmark Li Pingzhi, Jin Xiaolong, Cheng Yu, Chen Tianlong
- Model-enhanced Llm-driven VUI Testing Of VPA Apps Li Suwan, Bu Lei, Bai Guangdong, Xie Fuman, Chen Kai, Yue Chang
- Uio-llms: Unbiased Incremental Optimization For Long-context Llms Li Wenhao, Lin Mingbao, Zhong Yunshan, Yan Shuicheng, Ji Rongrong
- Unlocking Large Language Model's Planning Capabilities With Maximum Diversity Fine-tuning Li Wenjun, Chen Changyu, Varakantham Pradeep
- Tokenpacker: Efficient Visual Projector For Multimodal LLM Li Wentong, Yuan Yuqian, Liu Jian, Tang Dongqi, Wang Song, Qin Jie, Zhu Jianke, Zhang Lei
- 52B To 1T: Lessons Learned Via Tele-flm Series Li Xiang, Yao Yiqun, Jiang Xin, Fang Xuezhi, Wang Chao, Liu Xinzhang, Wang Zihan, Zhao Yu, Wang Xin, Huang Yuyao, Song Shuangyong, Li Yongxiang, Zhang Zheng, Zhao Bo, Sun Aixin, Wang Yequan, He Zhongjiang, Wang Zhongyuan, Li Xuelong, Huang Tiejun
- Preference Tuning For Toxicity Mitigation Generalizes Across Languages Li Xiaochen, Yong Zheng-xin, Bach Stephen H.
- Grounded Compositional And Diverse Text-to-3d With Pretrained Multi-view Diffusion Model Li Xiaolong, Mo Jiawei, Wang Ying, Parameshwara Chethan, Fei Xiaohan, Swaminathan Ashwin, Taylor Cj, Tu Zhuowen, Favaro Paolo, Soatto Stefano
- Personalized Language Modeling From Personalized Human Feedback Li Xinyu, Lipton Zachary C., Leqi Liu
- Badedit: Backdooring Large Language Models By Model Editing Li Yanzhou, Li Tianlin, Chen Kangjie, Zhang Jian, Liu Shangqing, Wang Wenhan, Zhang Tianwei, Liu Yang
- Mateval: A Multi-agent Discussion Framework For Advancing Open-ended Text Evaluation Li Yu, Zhang Shenyu, Wu Rui, Huang Xiutian, Chen Yongrui, Xu Wenhao, Qi Guilin, Min Dehai
- EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty Li Yuhui, Wei Fangyun, Zhang Chao, Zhang Hongyang
- Large Language Models For Human-like Autonomous Driving: A Survey Li Yun, Katsumata Kai, Javanmardi Ehsan, Tsukada Manabu
- Amphista: Accelerate LLM Inference With Bi-directional Multiple Drafting Heads In A Non-autoregressive Style Li Zeping, Yang Xinlong, Gao Ziheng, Liu Ji, Liu Zhuang, Li Dong, Peng Jinzhang, Tian Lu, Barsoum Emad
- Focusllm: Scaling Llm's Context By Parallel Decoding Li Zhenyu, Zhang Yike, Pan Tengyu, Sun Yutao, Duan Zhichao, Fang Junjie, Han Rong, Wang Zixuan, Wang Jianyong
- Hunyuan-dit: A Powerful Multi-resolution Diffusion Transformer With Fine-grained Chinese Understanding Li Zhimin, Zhang Jianwei, Lin Qin, Xiong Jiangfeng, Long Yanxin, Deng Xinchi, Zhang Yingfang, Liu Xingchao, Huang Minbin, Xiao Zedong, Chen Dayou, He Jiajun, Li Jiahao, Li Wenyue, Zhang Chen, Quan Rongwei, Lu Jianxiang, Huang Jiabin, Yuan Xiaoyan, Zheng Xiaoxiao, Li Yixuan, Zhang Jihong, Zhang Chao, Chen Meng, Liu Jie, Fang Zheng, Wang Weiyan, Xue Jinbao, Tao Yangyu, Zhu Jianchen, Liu Kai, Lin Sihuan, Sun Yifu, Li Yun, Wang Dongdong, Chen Mingtao, Hu Zhichao, Xiao Xiao, Chen Yan, Liu Yuhong, Liu Wei, Wang Di, Yang Yong, Jiang Jie, Lu Qinglin
- Cog-ga: A Large Language Models-based Generative Agent For Vision-language Navigation In Continuous Environments Li Zhiyuan, Lu Yanfeng, Mu Yao, Qiao Hong
- Self-instructed Derived Prompt Generation Meets In-context Learning: Unlocking New Potential Of Black-box Llms Li Zhuo, Du Yuhao, Hu Jinpeng, Wan Xiang, Gao Anningzhe
- Intermediate Distillation: Data-efficient Distillation From Black-box Llms For Information Retrieval Li Zizhong, Zhang Haopeng, Zhang Jiawei
- Languashrink: Reducing Token Overhead With Psycholinguistics Liang Xuechen, Tao Meiling, Xia Yinghui, Shi Tianyu, Wang Jun, Yang Jingsong
- Self-evolving Agents With Reflective And Memory-augmented Abilities Liang Xuechen, Tao Meiling, Xia Yinghui, Shi Tianyu, Wang Jun, Yang Jingsong
- Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats In Customized Large Language Models Liang Zi, Hu Haibo, Ye Qingqing, Xiao Yaxin, Li Haoyang
- Apiq: Finetuning Of 2-bit Quantized Large Language Model Liao Baohao, Herold Christian, Khadivi Shahram, Monz Christof
- 3-in-1: 2D Rotary Adaptation For Efficient Finetuning, Efficient Batching And Composability Liao Baohao, Monz Christof
- Beyond KV Caching: Shared Attention For Efficient Llms Liao Bingli, Vargas Danilo Vasconcellos
- Extending Token Computation For LLM Reasoning Liao Bingli, Vargas Danilo Vasconcellos
- Look Before You Leap: Problem Elaboration Prompting Improves Mathematical Reasoning In Large Language Models Liao Haoran, Tian Jidong, Hu Shaohua, He Hao, Jin Yaohui
- MING-MOE: Enhancing Medical Multi-task Learning In Large Language Models With Sparse Mixture Of Low-rank Adapter Experts Liao Yusheng, Jiang Shuyang, Wang Yu, Wang Yanfeng
- E2LLM: Encoder Elongated Large Language Models For Long-context Understanding And Reasoning Liao Zihan, Wang Jun, Yu Hang, Wei Lingxiao, Li Jianguo, Wang Jun, Zhang Wei
- D2LLM: Decomposed And Distilled Large Language Models For Semantic Search Liao Zihan, Yu Hang, Li Jianguo, Wang Jun, Zhang Wei
- Bita: Bi-directional Tuning For Lossless Acceleration In Large Language Models Lin Feng, Yi Hanling, Li Hongbin, Yang Yifan, Yu Xiaotian, Lu Guangming, Xiao Rong
- Mixture Of In-context Experts Enhance Llms' Long Context Awareness Lin Hongzhan, Lv Ang, Chen Yuhan, Zhu Chen, Song Yang, Zhu Hengshu, Yan Rui
- Token-wise Influential Training Data Retrieval For Large Language Models Lin Huawei, Long Jikai, Xu Zhaozhuo, Zhao Weijie
- Large Language Models Make Sample-efficient Recommender Systems Lin Jianghao, Dai Xinyi, Shan Rong, Chen Bo, Tang Ruiming, Yu Yong, Zhang Weinan
- FLAME: Factuality-aware Alignment For Large Language Models Lin Sheng-chieh, Gao Luyu, Oguz Barlas, Xiong Wenhan, Lin Jimmy, Yih Wen-tau, Chen Xilun
- Optimizing Language Models For Human Preferences Is A Causal Inference Problem Lin Victoria, Ben-michael Eli, Morency Louis-philippe
- Diversedialogue: A Methodology For Designing Chatbots With Human-like Diversity Lin Xiaoyu, Yu Xinkai, Aich Ankit, Giorgi Salvatore, Ungar Lyle
- Efficient LLM Training And Serving With Heterogeneous Context Sharding Among Attention Heads Lin Xihui, Zhang Yunan, Ge Suyu, Patra Barun, Chaudhary Vishrav, Peng Hao, Song Xia
- Qserve: W4A8KV4 Quantization And System Co-design For Efficient LLM Serving Lin Yujun, Tang Haotian, Yang Shang, Zhang Zhekai, Xiao Guangxuan, Gan Chuang, Han Song
- Prompt Compression With Context-aware Sentence Encoding For Fast And Improved LLM Inference Liskavets Barys, Ushakov Maxim, Roy Shuvendu, Klibanov Mark, Etemad Ali, Luke Shane
- Direct Large Language Model Alignment Through Self-rewarding Contrastive Prompt Distillation Liu Aiwei, Bai Haoping, Lu Zhiyun, Kong Xiang, Wang Simon, Shan Jiulong, Cao Meng, Wen Lijie
- Cool-fusion: Fuse Large Language Models Without Training Liu Cong, Quan Xiaojun, Pan Yan, Lin Liang, Wu Weigang, Chen Xu
- SPHINX-X: Scaling Data And Parameters For A Family Of Multi-modal Large Language Models Liu Dongyang, Zhang Renrui, Qiu Longtian, Huang Siyuan, Lin Weifeng, Zhao Shitian, Geng Shijie, Lin Ziyi, Jin Peng, Zhang Kaipeng, Shao Wenqi, Xu Chao, He Conghui, He Junjun, Shao Hao, Lu Pan, Li Hongsheng, Qiao Yu, Gao Peng
- A Practice-friendly Two-stage Llm-enhanced Paradigm In Sequential Recommendation Liu Dugang, Xian Shenxian, Lin Xiaolin, Zhang Xiaolian, Zhu Hong, Fang Yuan, Chen Zhen, Ming Zhong
- Mmgrec: Multimodal Generative Recommendation With Transformer Model Liu Han, Wei Yinwei, Song Xuemeng, Guan Weili, Li Yuan-fang, Nie Liqiang
- Advancing Adversarial Suffix Transfer Learning On Aligned Large Language Models Liu Hongfu, Xie Yuxi, Wang Ye, Shieh Michael
- Bucket Pre-training Is All You Need Liu Hongtao, Peng Qiyao, Yang Qing, Liu Kai, Xu Hongyan
- Training-free Activation Sparsity In Large Language Models Liu James, Ponnusamy Pragaash, Cai Tianle, Guo Han, Kim Yoon, Athiwaratkun Ben
- Speculative Decoding Via Early-exiting For Faster LLM Inference With Thompson Sampling Control Mechanism Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang
- Tiny Refinements Elicit Resilience: Toward Efficient Prefix-model Against LLM Red-teaming Liu Jiaxu, Yin Xiangyu, Wu Sihao, Wang Jianhong, Fang Meng, Yi Xinping, Huang Xiaowei
- Iterative Length-regularized Direct Preference Optimization: A Case Study On Improving 7B Language Models To GPT-4 Level Liu Jie, Zhou Zhanhui, Liu Jiaheng, Bu Xingyuan, Yang Chao, Zhong Han-sen, Ouyang Wanli
- Beyond Benchmarking: A New Paradigm For Evaluation And Assessment Of Large Language Models Liu Jin, Li Qingquan, Du Wenlong
- STORE: Streamlining Semantic Tokenization And Generative Recommendation With A Single LLM Liu Qijiong, Zhu Jieming, Fan Lu, Zhao Zhou, Wu Xiao-ming
- Proofread: Fixes All Errors With One Tap Liu Renjie, Zhang Yanxiang, Zhu Yun, Sun Haicheng, Zhang Yuanbo, Huang Michael Xuelin, Cai Shanqing, Meng Lei, Zhai Shumin
- Intactkv: Improving Large Language Model Quantization By Keeping Pivot Tokens Intact Liu Ruikang, Bai Haoli, Lin Haokun, Li Yuening, Gao Han, Xu Zhengzhuo, Hou Lu, Yao Jun, Yuan Chun
- Flashback:efficient Retrieval-augmented Language Modeling For Long Context Inference Liu Runheng, Xiao Xingchen, Huang Heyan, Chi Zewen, Wu Zhijing
- ST-LLM: Large Language Models Are Effective Temporal Learners Liu Ruyang, Li Chen, Tang Haoran, Ge Yixiao, Shan Ying, Li Ge
- RL-GPT: Integrating Reinforcement Learning And Code-as-policy Liu Shaoteng, Yuan Haoqi, Hu Minda, Li Yanwei, Chen Yukang, Liu Shu, Lu Zongqing, Jia Jiaya
- Bi-chainer: Automated Large Language Models Reasoning With Bidirectional Chaining Liu Shuqi, He Bowei, Song Linqi
- Making Them Ask And Answer: Jailbreaking Large Language Models In Few Queries Via Disguise And Reconstruction Liu Tong, Zhang Yingjie, Zhao Zhe, Dong Yinpeng, Meng Guozhu, Chen Kai
- Extensive Self-contrast Enables Feedback-free Language Model Alignment Liu Xiao, Song Xixuan, Dong Yuxiao, Tang Jie
- Harnessing Large Language Models For Multimodal Product Bundling Liu Xiaohao, Wu Jie, Tao Zhulin, Ma Yunshan, Wei Yinwei, Chua Tat-seng
- Farewell To Length Extrapolation, A Training-free Infinite Context With Finite Attention Scope Liu Xiaoran, Guo Qipeng, Song Yuerong, Liu Zhigeng, Lv Kai, Yan Hang, Li Linlin, Liu Qun, Qiu Xipeng
- Adaptive Draft-verification For Efficient Large Language Model Decoding Liu Xukun, Lei Bowen, Zhang Ruqi, Xu Dongkuan
- Understanding Llms: A Comprehensive Overview From Training To Inference Liu Yiheng, He Hao, Han Tianle, Zhang Xu, Liu Mengyuan, Tian Jiaming, Zhang Yutong, Wang Jiaqi, Gao Xiaohui, Zhong Tianyang, Pan Yi, Xu Shaochen, Wu Zihao, Liu Zhengliang, Zhang Xin, Zhang Shu, Hu Xintao, Zhang Tuo, Qiang Ning, Liu Tianming, Ge Bao
- DELTA: Decomposed Efficient Long-term Robot Task Planning Using Large Language Models Liu Yuchen, Palmieri Luigi, Koch Sebastian, Georgievski Ilche, Aiello Marco
- Hd-eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition Liu Yuxuan, Yang Tianchi, Huang Shaohan, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Synthvlm: High-efficiency And High-quality Synthetic Data For Vision Language Models Liu Zheng, Liang Hao, Huang Xijie, Xiong Wentao, Yu Qinhan, Sun Linzhuang, Chen Chong, He Conghui, Cui Bin, Zhang Wentao
- Personality-aware Student Simulation For Conversational Intelligent Tutoring Systems Liu Zhengyuan, Yin Stella Xin, Lin Geyu, Chen Nancy F.
- Mixture Of Insightful Experts (mote): The Synergy Of Thought Chains And Expert Mixtures In Self-alignment Liu Zhili, Gou Yunhao, Chen Kai, Hong Lanqing, Gao Jiahui, Mi Fei, Zhang Yu, Li Zhenguo, Jiang Xin, Liu Qun, Kwok James T.
- Enabling Weak Llms To Judge Response Reliability Via Meta Ranking Liu Zijun, Kou Boqun, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Towards Aligning Language Models With Textual Feedback Lloret Saüc Abadal, Dhuliawala Shehzaad, Murugesan Keerthiram, Sachan Mrinmaya
- A Closer Look Into Mixture-of-experts In Large Language Models Lo Ka Man, Huang Zeyu, Qiu Zihan, Wang Zili, Fu Jie
- Large Language Models Relearn Removed Concepts Lo Michelle, Cohen Shay B., Barez Fazl
- Loqt: Low-rank Adapters For Quantized Pre-training Loeschcke Sebastian, Toftrup Mads, Kastoryano Michael J., Belongie Serge, Snæbjarnarson Vésteinn
- Generative Multi-modal Knowledge Retrieval With Large Language Models Long Xinwei, Zeng Jiali, Meng Fandong, Ma Zhiyuan, Zhang Kaiyan, Zhou Bowen, Zhou Jie
- Evaluating Large Language Models In Analysing Classroom Dialogue Long Yun, Luo Haifeng, Zhang Yu
- Personalitychat: Conversation Distillation For Personalized Dialog Modeling With Facts And Traits Lotfi Ehsan, De Bruyn Maxime, Buhmann Jeska, Daelemans Walter
- Sparser Is Faster And Less Is More: Efficient Sparse Attention For Long-range Transformers Lou Chao, Jia Zixia, Zheng Zilong, Tu Kewei
- SPO: Multi-dimensional Preference Sequential Alignment With Implicit Reward Modeling Lou Xingzhou, Zhang Junge, Xie Jian, Liu Lifeng, Yan Dong, Huang Kaiqi
- Efficient Encoder-decoder Transformer Decoding For Decomposable Tasks Lu Bo-ru, Haduong Nikita, Lin Chien-yu, Cheng Hao, Smith Noah A., Ostendorf Mari
- Deepseek-vl: Towards Real-world Vision-language Understanding Lu Haoyu, Liu Wen, Zhang Bo, Wang Bingxuan, Dong Kai, Liu Bo, Sun Jingxiang, Ren Tongzheng, Li Zhuoshu, Yang Hao, Sun Yaofeng, Deng Chengqi, Xu Hanwei, Xie Zhenda, Ruan Chong
- YODA: Teacher-student Progressive Learning For Language Models Lu Jianqiao, Zhong Wanjun, Wang Yufei, Guo Zhijiang, Zhu Qi, Huang Wenyong, Wang Yanlin, Mi Fei, Wang Baojun, Wang Yasheng, Shang Lifeng, Jiang Xin, Liu Qun
- FIPO: Free-form Instruction-oriented Prompt Optimization With Preference Dataset And Modular Fine-tuning Schema Lu Junru, An Siyu, Zhang Min, He Yulan, Yin Di, Sun Xing
- Online Merging Optimizers For Boosting Rewards And Mitigating Tax In Alignment Lu Keming, Yu Bowen, Huang Fei, Fan Yang, Lin Runji, Zhou Chang
- Fine-tuning Large Language Models For Domain Adaptation: Exploration Of Training Strategies, Scaling, Model Merging And Synergistic Capabilities Lu Wei, Luu Rachel K., Buehler Markus J.
- Not All Experts Are Equal: Efficient Expert Pruning And Skipping For Mixture-of-experts Large Language Models Lu Xudong, Liu Qi, Xu Yuhui, Zhou Aojun, Huang Siyuan, Zhang Bo, Yan Junchi, Li Hongsheng
- SPP: Sparsity-preserved Parameter-efficient Fine-tuning For Large Language Models Lu Xudong, Zhou Aojun, Xu Yuhui, Zhang Renrui, Gao Peng, Li Hongsheng
- Prompt Tuning As User Inherent Profile Inference Machine Lu Yusheng, Du Zhaocheng, Li Xiangyang, Zhao Xiangyu, Liu Weiwen, Wang Yichao, Guo Huifeng, Tang Ruiming, Dong Zhenhua, Duan Yongrui
- MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory For Long Sequences Training Luo Cheng, Zhao Jiawei, Chen Zhuoming, Chen Beidi, Anandkumar Anima
- Crosstune: Black-box Few-shot Classification With Label Enhancement Luo Danqing, Zhang Chen, Zhang Yan, Li Haizhou
- Autom3l: An Automated Multimodal Machine Learning Framework With Large Language Models Luo Daqin, Feng Chengjian, Nong Yuxuan, Shen Yiqing
- Decoupled Alignment For Robust Plug-and-play Adaptation Luo Haozheng, Yu Jiahao, Zhang Wenxin, Li Jialong, Hu Jerry Yao-chieh, Xing Xinyu, Liu Han
- BGE Landmark Embedding: A Chunking-free Embedding Method For Retrieval Augmented Long-context Large Language Models Luo Kun, Liu Zheng, Xiao Shitao, Liu Kang
- Large Language Models As Foundations For Next-gen Dense Retrieval: A Comprehensive Empirical Assessment Luo Kun, Qin Minghao, Liu Zheng, Xiao Shitao, Zhao Jun, Liu Kang
- End-to-end Neuro-symbolic Reinforcement Learning With Textual Explanations Luo Lirui, Zhang Guoxi, Xu Hongming, Yang Yaodong, Fang Cong, Li Qing
- M\(^3\)GPT: An Advanced Multimodal, Multitask Framework For Motion Comprehension And Generation Luo Mingshuang, Hou Ruibing, Chang Hong, Liu Zimo, Wang Yaowei, Shan Shiguang
- Zero-shot Cross-domain Dialogue State Tracking Via Dual Low-rank Adaptation Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- TRAWL: Tensor Reduced And Approximated Weights For Large Language Models Luo Yiran, Patel Het, Fu Yu, Ahn Dawon, Chen Jia, Dong Yue, Papalexakis Evangelos E.
- Codeact: Code Adaptive Compute-efficient Tuning Framework For Code Llms Lv Weijie, Xia Xuan, Huang Sheng-jun
- X-reflect: Cross-reflection Prompting For Multimodal Recommendation Lyu Hanjia, Rossi Ryan, Chen Xiang, Tanjim Md Mehrab, Petrangeli Stefano, Sarkhel Somdeb, Luo Jiebo
- Automatic Summarization Of Doctor-patient Encounter Dialogues Using Large Language Model Through Prompt Tuning Lyu Mengxian, Peng Cheng, Li Xiaohan, Balian Patrick, Bian Jiang, Wu Yonghui
- When Emotional Stimuli Meet Prompt Designing: An Auto-prompt Graphical Paradigm Ma Chenggian, Zhao Xiangyu, Zhang Chunhui, Qin Yanzhao, Zhang Wentao
- First Activations Matter: Training-free Methods For Dynamic Activation In Large Language Models Ma Chi, Huang Mincong, Zhang Ying, Wang Chao, Wang Yujie, Yu Lei, Liu Chuan, Lin Wei
- Multi-modal Generative Embedding Model Ma Feipeng, Xue Hongwei, Wang Guangting, Zhou Yizhou, Rao Fengyun, Yan Shilin, Zhang Yueyi, Wu Siying, Shou Mike Zheng, Sun Xiaoyan
- EE-MLLM: A Data-efficient And Compute-efficient Multimodal Large Language Model Ma Feipeng, Zhou Yizhou, Li Hebei, He Zilong, Wu Siying, Rao Fengyun, Zhang Yueyi, Sun Xiaoyan
- FBI-LLM: Scaling Up Fully Binarized Llms From Scratch Via Autoregressive Distillation Ma Liqun, Sun Mingjie, Shen Zhiqiang
- Are Large Language Models Good Prompt Optimizers? Ma Ruotian, Wang Xiaolei, Zhou Xin, Li Jian, Du Nan, Gui Tao, Zhang Qi, Huang Xuanjing
- Megalodon: Efficient LLM Pretraining And Inference With Unlimited Context Length Ma Xuezhe, Yang Xiaomeng, Xiong Wenhan, Chen Beidi, Yu Lili, Zhang Hao, May Jonathan, Zettlemoyer Luke, Levy Omer, Zhou Chunting
- Llamoco: Instruction Tuning Of Large Language Models For Optimization Code Generation Ma Zeyuan, Guo Hongshu, Chen Jiacheng, Peng Guojun, Cao Zhiguang, Ma Yining, Gong Yue-jiao
- Rephrasing The Web: A Recipe For Compute And Data-efficient Language Modeling Maini Pratyush, Seto Skyler, Bai He, Grangier David, Zhang Yizhe, Jaitly Navdeep
- Exploring The Capabilities Of Prompted Large Language Models In Educational And Assessment Applications Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- FRACTAL: Fine-grained Scoring From Aggregate Text Labels Makhija Yukti, Agrawal Priyanka, Saket Rishi, Raghuveer Aravindan
- Matmul Or No Matmul In The Era Of 1-bit Llms Malekar Jinendra, Elbtity Mohammed E., Zand Ramtin
- COPAL: Continual Pruning In Large Language Generative Models Malla Srikanth, Choi Joon Hee, Choi Chiho
- Reinforced Prompt Personalization For Recommendation With Large Language Models Mao Wenyu, Wu Jiancan, Chen Weijian, Gao Chongming, Wang Xiang, He Xiangnan
- Don't Forget Your Reward Values: Language Model Alignment Via Value-based Calibration Mao Xin, Li Feng-lin, Xu Huimin, Zhang Wei, Luu Anh Tuan
- How Does Quantization Affect Multilingual Llms? Marchisio Kelly, Dash Saurabh, Chen Hongyu, Aumiller Dennis, Üstün Ahmet, Hooker Sara, Ruder Sebastian
- The Impact Of Hyperparameters On Large Language Model Inference Performance: An Evaluation Of Vllm And Huggingface Pipelines Martinez Matias
- Pack Of Llms: Model Fusion At Test-time Via Perplexity Optimization Mavromatis Costas, Karypis Petros, Karypis George
- Learning To Ask Informative Questions: Enhancing Llms With Preference Optimization And Expected Information Gain Mazzaccara Davide, Testoni Alberto, Bernardi Raffaella
- Improving Text-to-image Consistency Via Automatic Prompt Optimization Mañas Oscar, Astolfi Pietro, Hall Melissa, Ross Candace, Urbanek Jack, Williams Adina, Agrawal Aishwarya, Romero-soriano Adriana, Drozdzal Michal
- Superposition Prompting: Improving And Accelerating Retrieval-augmented Generation Merth Thomas, Fu Qichen, Rastegari Mohammad, Najibi Mahyar
- An Efficient Inference Framework For Early-exit Large Language Models Miao Ruijie, Yan Yihan, Yao Xinshuo, Yang Tong
- Subject-driven Text-to-image Generation Via Preference-based Reinforcement Learning Miao Yanting, Loh William, Kothawade Suraj, Poupart Pascal, Rashwan Abdullah, Li Yeqing
- Smart Language Agents In Real-world Planning Miin Annabelle, Wei Timothy
- Large Language Models: A Survey Minaee Shervin, Mikolov Tomas, Nikzad Narjes, Chenaghlu Meysam, Socher Richard, Amatriain Xavier, Gao Jianfeng
- RDBE: Reasoning Distillation-based Evaluation Enhances Automatic Essay Scoring Mohammadkhani Ali Ghiasvand
- Routoo: Learning To Route To Large Language Models Effectively Mohammadshahi Alireza, Shaikh Arshad Rafiq, Yazdani Majid
- Kam-cot: Knowledge Augmented Multimodal Chain-of-thoughts Reasoning Mondal Debjyoti, Modi Suraj, Panda Subhadarshi, Singh Rituraj, Rao Godawari Sudhakar
- Exploring The Landscape Of Large Language Models: Foundations, Techniques, And Challenges Moradi Milad, Yan Ke, Colwell David, Samwald Matthias, Asgari Rhona
- Language Modelling Approaches To Adaptive Machine Translation Moslem Yasmin
- A Logical Fallacy-informed Framework For Argument Generation Mouchel Luca, Paul Debjit, Cui Shaobo, West Robert, Bosselut Antoine, Faltings Boi
- Large Language Models Are Parallel Multilingual Learners Mu Yongyu, Feng Peinan, Cao Zhiquan, Wu Yuzhang, Li Bei, Wang Chenglong, Xiao Tong, Song Kai, Liu Tongran, Zhang Chunliang, Zhu Jingbo
- Cross-layer Attention Sharing For Large Language Models Mu Yongyu, Wu Yuzhang, Fan Yuchun, Wang Chenglong, Li Hengyu, He Qiaozhi, Yang Murun, Xiao Tong, Zhu Jingbo
- Compact Language Models Via Pruning And Knowledge Distillation Muralidharan Saurav, Sreenivas Sharath Turuvekere, Joshi Raviraj, Chochowski Marcin, Patwary Mostofa, Shoeybi Mohammad, Catanzaro Bryan, Kautz Jan, Molchanov Pavlo
- Generating Situated Reflection Triggers About Alternative Solution Paths: A Case Study Of Generative AI For Computer-supported Collaborative Learning Naik Atharva, Yin Jessica Ruhan, Kamath Anusha, Ma Qianou, Wu Sherry Tongshuang, Murray Charles, Bogart Christopher, Sakr Majd, Rose Carolyn P.
- Cdquant: Accurate Post-training Weight Quantization Of Large Pre-trained Models Using Greedy Coordinate Descent Nair Pranav Ajit, Suggala Arun Sai
- Apprentices To Research Assistants: Advancing Research With Large Language Models Namvarpour M., Razi A.
- Educational Personalized Learning Path Planning With Large Language Models Ng Chee, Fung Yuen
- Better Alignment With Instruction Back-and-forth Translation Nguyen Thao, Li Jeffrey, Oh Sewoong, Schmidt Ludwig, Weston Jason, Zettlemoyer Luke, Li Xian
- Xwin-lm: Strong And Scalable Alignment Practice For Llms Ni Bolin, Hu Jingcheng, Wei Yixuan, Peng Houwen, Zhang Zheng, Meng Gaofeng, Hu Han
- Layer-wise Regularized Dropout For Neural Language Models Ni Shiwen, Yang Min, Xu Ruifeng, Li Chengming, Hu Xiping
- Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge In English-centric Large Language Models Nie Ercong, Yuan Shuzhou, Ma Bolei, Schmid Helmut, Färber Michael, Kreuter Frauke, Schütze Hinrich
- Vikhr: The Family Of Open-source Instruction-tuned Large Language Models For Russian Nikolich Aleksandr, Korolev Konstantin, Shelmanov Artem, Kiselev Igor
- Pico: Peer Review In Llms Based On The Consistency Optimization Ning Kun-peng, Yang Shuo, Liu Yu-yang, Yao Jia-yu, Liu Zhen-hui, Wang Yu, Pang Ming, Yuan Li
- User-llm: Efficient LLM Contextualization With User Embeddings Ning Lin, Liu Luyang, Wu Jiaxing, Wu Neo, Berlowitz Devora, Prakash Sushant, Green Bradley, O'banion Shawn, Xie Jun
- Can Llms Learn By Teaching? A Preliminary Study Ning Xuefei, Wang Zifu, Li Shiyao, Lin Zinan, Yao Peiran, Fu Tianyu, Blaschko Matthew B., Dai Guohao, Yang Huazhong, Wang Yu
- Skill Set Optimization: Reinforcing Language Model Behavior Via Transferable Skills Nottingham Kolby, Majumder Bodhisattwa Prasad, Mishra Bhavana Dalvi, Singh Sameer, Clark Peter, Fox Roy
- GECKO: Generative Language Model For English, Code And Korean Oh Sungwoo, Kim Donggyu
- Crafting Efficient Fine-tuning Strategies For Large Language Models Oliver Michael, Wang Guan
- Outliers And Calibration Sets Have Diminishing Effect On Quantization Of Modern Llms Paglieri Davide, Dash Saurabh, Rocktäschel Tim, Parker-holder Jack
- Investigating LLM Applications In E-commerce Palen-michel Chester, Wang Ruixiang, Zhang Yipeng, Yu David, Xu Canran, Wu Zhe
- Dense Training, Sparse Inference: Rethinking Training Of Mixture-of-experts Language Models Pan Bowen, Shen Yikang, Liu Haokun, Mishra Mayank, Zhang Gaoyuan, Oliva Aude, Raffel Colin, Panda Rameswar
- Dynathink: Fast Or Slow? A Dynamic Decision-making Framework For Large Language Models Pan Jiabao, Zhang Yan, Zhang Chen, Liu Zuozhu, Wang Hongwei, Li Haizhou
- LISA: Layerwise Importance Sampling For Memory-efficient Large Language Model Fine-tuning Pan Rui, Liu Xiang, Diao Shizhe, Pi Renjie, Zhang Jipeng, Han Chi, Zhang Tong
- The Solution For The AIGC Inference Performance Optimization Competition Pan Sishun, Xu Haonan, Wan Zhonghua, Yang Yang
- Ee-tuning: An Economical Yet Scalable Solution For Tuning Early-exit Large Language Models Pan Xuchen, Chen Yanxi, Li Yaliang, Ding Bolin, Zhou Jingren
- Preparing Lessons For Progressive Training On Language Models Pan Yu, Yuan Ye, Yin Yichun, Shi Jiaxin, Xu Zenglin, Zhang Ming, Shang Lifeng, Jiang Xin, Liu Qun
- Llmlingua-2: Data Distillation For Efficient And Faithful Task-agnostic Prompt Compression Pan Zhuoshi, Wu Qianhui, Jiang Huiqiang, Xia Menglin, Luo Xufang, Zhang Jue, Lin Qingwei, Rühle Victor, Yang Yuqing, Lin Chin-yew, Zhao H. Vicky, Qiu Lili, Zhang Dongmei
- Salute The Classic: Revisiting Challenges Of Machine Translation In The Age Of Large Language Models Pang Jianhui, Ye Fanghua, Wang Longyue, Yu Dian, Wong Derek F., Shi Shuming, Tu Zhaopeng
- Anchor-based Large Language Models Pang Jianhui, Ye Fanghua, Wong Derek Fai, He Xin, Chen Wanshun, Wang Longyue
- Token-picker: Accelerating Attention In Text Generation With Minimized Memory Transfer Via Probability Estimation Park Junyoung, Kang Myeonggu, Han Yunki, Kim Yanggon, Shin Jaekang, Kim Lee-sup
- Pre-trained Vision And Language Transformers Are Few-shot Incremental Learners Park Keon-hee, Song Kyungwoo, Park Gyeong-moon
- Disentangling Length From Quality In Direct Preference Optimization Park Ryan, Rafailov Rafael, Ermon Stefano, Finn Chelsea
- Any-precision LLM: Low-cost Deployment Of Multiple, Different-sized Llms Park Yeonhong, Hyun Jake, Cho Sanglyul, Sim Bonggeun, Lee Jae W.
- The Ultimate Guide To Fine-tuning Llms From Basics To Breakthroughs: An Exhaustive Review Of Technologies, Research, Best Practices, Applied Research Challenges And Opportunities Parthasarathy Venkatesh Balavadhani, Zafar Ahtsham, Khan Aafaq, Shahid Arsalan
- Evidence To Generate (E2G): A Single-agent Two-step Prompting For Context Grounded And Retrieval Augmented Reasoning Parvez Md Rizwan
- Automating Code Adaptation For Mlops -- A Benchmarking Study On Llms Patel Harsh, Ramanan Buvaneswari A., Khan Manzoor A., Williams Thomas, Friedman Brian, Drabeck Lawrence
- Advprompter: Fast Adaptive Adversarial Prompting For Llms Paulus Anselm, Zharmagambetov Arman, Guo Chuan, Amos Brandon, Tian Yuandong
- Constitutionalexperts: Training A Mixture Of Principle-based Prompts Petridis Savvas, Wedin Ben, Yuan Ann, Wexler James, Thain Nithum
- Optimizing Large Language Models For Openapi Code Completion Petryshyn Bohdan, Lukoševičius Mantas
- Towards Reliable Medical Question Answering: Techniques And Challenges In Mitigating Hallucinations In Language Models Pham Duy Khoa, Vo Bao Quoc
- Can Base Chatgpt Be Used For Forecasting Without Additional Optimization? Pham Van, Cunningham Scott
- Distillation Contrastive Decoding: Improving Llms Reasoning With Contrastive Decoding And Distillation Phan Phuc, Tran Hieu, Phan Long
- Ptt5-v2: A Closer Look At Continued Pretraining Of T5 Models For The Portuguese Language Piau Marcos, Lotufo Roberto, Nogueira Rodrigo
- Better RAG Using Relevant Information Gain Pickett Marc, Hartman Jeremy, Bhowmick Ayan Kumar, Alam Raquib-ul, Vempaty Aditya
- Mamba-ptq: Outlier Channels In Recurrent Large Language Models Pierro Alessandro, Abreu Steven
- Dissecting Language Models: Machine Unlearning Via Selective Pruning Pochinkov Nicholas, Schoots Nandi
- Advanced Natural-based Interaction For The Italian Language: Llamantino-3-anita Polignano Marco, Basile Pierpaolo, Semeraro Giovanni
- Documint: Docstring Generation For Python Using Small Language Models Poudel Bibek, Cook Adam, Traore Sekou, Ameli Shelah
- Large Language Models Meet User Interfaces: The Case Of Provisioning Feedback Pozdniakov Stanislav, Brazil Jonathan, Abdi Solmaz, Bakharia Aneesha, Sadiq Shazia, Gasevic Dragan, Denny Paul, Khosravi Hassan
- When Quantization Affects Confidence Of Large Language Models? Proskurina Irina, Brun Luc, Metzler Guillaume, Velcin Julien
- Code Prompting Elicits Conditional Reasoning Abilities In Text+code Llms Puerto Haritz, Tutek Martin, Aditya Somak, Zhu Xiaodan, Gurevych Iryna
- Practical Token Pruning For Foundation Models In Few-shot Conversational Virtual Assistant Systems Qi Haode, Qian Cheng, Ni Jian, Singh Pratyush, Fazeli Reza, Wang Gengyu, Shu Zhongzheng, Wayne Eric, Bross Juergen
- Model Internals-based Answer Attribution For Trustworthy Retrieval-augmented Generation Qi Jirui, Sarti Gabriele, Fernández Raquel, Bisazza Arianna
- ADELIE: Aligning Large Language Models On Information Extraction Qi Yunjia, Peng Hao, Wang Xiaozhi, Xu Bin, Hou Lei, Li Juanzi
- Iterative Experience Refinement Of Software-developing Agents Qian Chen, Li Jiahao, Dang Yufan, Liu Wei, Wang Yifei, Xie Zihao, Chen Weize, Yang Cheng, Zhang Yingli, Liu Zhiyuan, Sun Maosong
- Openba-v2: Reaching 77.3% High Compression Ratio With Fast Multi-stage Pruning Qiao Dan, Su Yi, Wang Pinzheng, Ye Jing, Xie Wenjing, Zhou Yuechi, Ding Yuyang, Tang Zecheng, Wang Jikai, Ji Yixin, Wang Yue, Guo Pei, Sun Zechen, Zhang Zikang, Li Juntao, Chao Pingfu, Chen Wenliang, Fu Guohong, Zhou Guodong, Zhu Qiaoming, Zhang Min
- Supportiveness-based Knowledge Rewriting For Retrieval-augmented Language Modeling Qiao Zile, Ye Wei, Jiang Yong, Mo Tong, Xie Pengjun, Li Weiping, Huang Fei, Zhang Shikun
- The Uniqueness Of Llama3-70b With Per-channel Quantization: An Empirical Study Qin Minghai
- Empirical Guidelines For Deploying Llms Onto Resource-constrained Edge Devices Qin Ruiyang, Liu Dancheng, Yan Zheyu, Tan Zhaoxuan, Pan Zixuan, Jia Zhenge, Jiang Meng, Abbasi Ahmed, Xiong Jinjun, Shi Yiyu
- Megen: Generative Backdoor In Large Language Models Via Model Editing Qiu Jiyang, Ma Xinbei, Zhang Zhuosheng, Zhao Hai
- Spectral Editing Of Activations For Large Language Model Alignment Qiu Yifu, Zhao Zheng, Ziser Yftah, Korhonen Anna, Ponti Edoardo M., Cohen Shay B.
- Layerwise Recurrent Router For Mixture-of-experts Qiu Zihan, Huang Zeyu, Cheng Shuang, Zhou Yizhi, Wang Zili, Titov Ivan, Fu Jie
- Coursegpt-zh: An Educational Large Language Model Based On Knowledge Distillation Incorporating Prompt Optimization Qu Zheyan, Yin Lu, Yu Zitong, Wang Wenbo, Zhang Xing
- Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift In Fine-tuning Llms For Simultaneous Translation Raffel Matthew, Agostinelli Victor, Chen Lizhong
- GSR-BENCH: A Benchmark For Grounded Spatial Reasoning Evaluation Via Multimodal Llms Rajabi Navid, Kosecka Jana
- Qdylora: Quantized Dynamic Low-rank Adaptation For Efficient Large Language Model Tuning Rajabzadeh Hossein, Valipour Mojtaba, Zhu Tianshu, Tahaei Marzieh, Kwon Hyock Ju, Ghodsi Ali, Chen Boxing, Rezagholizadeh Mehdi
- Scavenging Hyena: Distilling Transformers Into Long Convolution Models Ralambomihanta Tokiniaina Raharison, Mohammadzadeh Shahrad, Islam Mohammad Sami Nur, Jabbour Wassim, Liang Laurence
- Preference Distillation For Personalized Generative Recommendation Ramos Jerome, Wu Bin, Lipani Aldo
- WARP: On The Benefits Of Weight Averaged Rewarded Policies Ramé Alexandre, Ferret Johan, Vieillard Nino, Dadashi Robert, Hussenot Léonard, Cedoz Pierre-louis, Sessa Pier Giuseppe, Girgin Sertan, Douillard Arthur, Bachem Olivier
- Self-refine Instruction-tuning For Aligning Reasoning In Language Models Ranaldi Leonardo, Freitas Andrè
- A Fine-tuning Enhanced RAG System With Quantized Influence Measure As AI Judge Rangan Keshav, Yin Yiqiao
- Small But Funny: A Feedback-driven Approach To Humor Distillation Ravi Sahithya, Huber Patrick, Shrivastava Akshat, Sagar Aditya, Aly Ahmed, Shwartz Vered, Einolghozati Arash
- Analyzing And Adapting Large Language Models For Few-shot Multilingual NLU: Are We There Yet? Razumovskaia Evgeniia, Vulić Ivan, Korhonen Anna
- FIRST: Faster Improved Listwise Reranking With Single Token Decoding Reddy Revanth Gangi, Doo Jaehyeok, Xu Yifei, Sultan Md Arafat, Swain Deevya, Sil Avirup, Ji Heng
- Strengthening Multimodal Large Language Model With Bootstrapped Preference Optimization Renjie Pi, Tianyang Han, Wei Xiong, Jipeng Zhang, Runtao Liu, Rui Pan, Tong Zhang
- Are Electra's Sentence Embeddings Beyond Repair? The Case Of Semantic Textual Similarity Rep Ivan, Dukić David, Šnajder Jan
- Code Generation With Alphacodium: From Prompt Engineering To Flow Engineering Ridnik Tal, Kredo Dedy, Friedman Itamar
- Countering Reward Over-optimization In LLM With Demonstration-guided Reinforcement Learning Rita Mathieu, Strub Florian, Chaabouni Rahma, Michel Paul, Dupoux Emmanuel, Pietquin Olivier
- RAGE Against The Machine: Retrieval-augmented LLM Explanations Rorseth Joel, Godfrey Parke, Golab Lukasz, Srivastava Divesh, Szlichta Jaroslaw
- Achieving Peak Performance For Large Language Models: A Systematic Review Rostam Zhyar Rzgar K, Szénási Sándor, Kertész Gábor
- Enhancing Low-resource NMT With A Multilingual Encoder And Knowledge Distillation: A Case Study Roy Aniruddha, Ray Pretam, Maheshwari Ayush, Sarkar Sudeshna, Goyal Pawan
- Efficient Language Adaptive Pre-training: Extending State-of-the-art Large Language Models For Polish Ruciński Szymon
- Insights Into Alignment: Evaluating DPO And Its Variants Across Multiple Tasks Saeidi Amir, Verma Shivanshu, Baral Chitta
- Vision-language And Large Language Model Performance In Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, And Quantized Models Safavi-naini Seyed Amir Ahmad, Ali Shuhaib, Shahab Omer, Shahhoseini Zahra, Savage Thomas, Rafiee Sara, Samaan Jamil S, Shabeeb Reem Al, Ladak Farah, Yang Jamie O, Echavarria Juan, Babar Sumbal, Shaukat Aasma, Margolis Samuel, Tatonetti Nicholas P, Nadkarni Girish, Kurdi Bara El, Soroush Ali
- Mixsumm: Topic-based Data Augmentation Using Llms For Low-resource Extractive Text Summarization Sahu Gaurav, Laradji Issam H.
- Algorithmic Language Models With Neurally Compiled Libraries Saldyt Lucas, Kambhampati Subbarao
- Optimization Methods For Personalizing Large Language Models Through Retrieval Augmentation Salemi Alireza, Kallumadi Surya, Zamani Hamed
- Zero-shot RTL Code Generation With Attention Sink Augmented Large Language Models Sandal Selim, Akturk Ismail
- The Future Of Large Language Model Pre-training Is Federated Sani Lorenzo, Iacob Alex, Cao Zeyu, Marino Bill, Gao Yan, Paulik Tomas, Zhao Wanru, Shen William F., Aleksandrov Preslav, Qiu Xinchi, Lane Nicholas D.
- Lean Attention: Hardware-aware Scalable Attention Mechanism For The Decode-phase Of Transformers Sanovar Rya, Bharadwaj Srikant, Amant Renee St., Rühle Victor, Rajmohan Saravan
- Improving The Validity Of Automatically Generated Feedback Via Reinforcement Learning Scarlatos Alexander, Smith Digory, Woodhead Simon, Lan Andrew
- Self-distillation For Model Stacking Unlocks Cross-lingual NLU In 200+ Languages Schmidt Fabian David, Borchert Philipp, Vulić Ivan, Glavaš Goran
- Symbolic Prompt Program Search: A Structure-aware Approach To Efficient Compile-time Prompt Optimization Schnabel Tobias, Neville Jennifer
- Large Language Models To Generate System-level Test Programs Targeting Non-functional Properties Schwachhofer Denis, Domanski Peter, Becker Steffen, Wagner Stefan, Sauer Matthias, Pflüger Dirk, Polian Ilia
- Generative Explore-exploit: Training-free Optimization Of Generative Recommender Systems Using LLM Optimizers Senel Lütfi Kerem, Fetahu Besnik, Yoshida Davis, Chen Zhiyu, Castellucci Giuseppe, Vedula Nikhita, Choi Jason, Malmasi Shervin
- Train-attention: Meta-learning Where To Focus In Continual Knowledge Learning Seo Yeongbin, Lee Dongha, Yeo Jinyoung
- Benchmarking Large Language Models For Math Reasoning Tasks Seßler Kathrin, Rong Yao, Gözlüklü Emek, Kasneci Enkelejda
- Flashattention-3: Fast And Accurate Attention With Asynchrony And Low-precision Shah Jay, Bikshandi Ganesh, Zhang Ying, Thakkar Vijay, Ramani Pradeep, Dao Tri
- Ai-assisted Generation Of Difficult Math Questions Shah Vedant, Yu Dingli, Lyu Kaifeng, Park Simon, Ke Nan Rosemary, Mozer Michael, Bengio Yoshua, Arora Sanjeev, Goyal Anirudh
- Putting Gpt-4o To The Sword: A Comprehensive Evaluation Of Language, Vision, Speech, And Multimodal Proficiency Shahriar Sakib, Lund Brady, Mannuru Nishith Reddy, Arshad Muhammad Arbab, Hayawi Kadhim, Bevara Ravi Varma Kumar, Mannuru Aashrith, Batool Laiba
- Show, Don't Tell: Aligning Language Models With Demonstrated Feedback Shaikh Omar, Lam Michelle, Hejna Joey, Shao Yijia, Bernstein Michael, Yang Diyi
- Patch-level Training For Large Language Models Shao Chenze, Meng Fandong, Zhou Jie
- Extensible Embedding: A Flexible Multipler For Llm's Context Length Shao Ninglu, Xiao Shitao, Liu Zheng, Zhang Peitian
- Combining Multiple Post-training Techniques To Achieve Most Efficient Quantized Llms Sharify Sayeh, Xu Zifei, Yazar Wanzin, Wang Xin
- Patched MOA: Optimizing Inference For Diverse Software Development Tasks Sharma Asankhaya
- MAPO: Advancing Multilingual Reasoning Through Multilingual Alignment-as-preference Optimization She Shuaijie, Zou Wei, Huang Shujian, Zhu Wenhao, Liu Xiang, Geng Xiang, Chen Jiajun
- Accurate And Efficient Fine-tuning Of Quantized Large Language Models Through Optimal Balance Shen Ao, Wang Qiang, Lai Zhiquan, Li Xionglve, Li Dongsheng
- Nemo-aligner: Scalable Toolkit For Efficient Model Alignment Shen Gerald, Wang Zhilin, Delalleau Olivier, Zeng Jiaqi, Dong Yi, Egert Daniel, Sun Shengyang, Zhang Jimmy, Jain Sahil, Taghibakhshi Ali, Ausin Markel Sanz, Aithal Ashwath, Kuchaiev Oleksii
- Rapid Optimization For Jailbreaking Llms Via Subconscious Exploitation And Echopraxia Shen Guangyu, Cheng Siyuan, Zhang Kaiyuan, Tao Guanhong, An Shengwei, Yan Lu, Zhang Zhuo, Ma Shiqing, Zhang Xiangyu
- Improving Reinforcement Learning From Human Feedback Using Contrastive Rewards Shen Wei, Zhang Xiaoying, Yao Yuanshun, Zheng Rui, Guo Hongyi, Liu Yang
- Scaling Laws For Linear Complexity Language Models Shen Xuyang, Li Dong, Leng Ruitao, Qin Zhen, Sun Weigao, Zhong Yiran
- Think-then-act: A Dual-angle Evaluated Retrieval-augmented Generation Shen Yige, Jiang Hao, Qu Hua, Zhao Jihong
- Lumos : Empowering Multimodal Llms With Scene Text Recognition Shenoy Ashish, Lu Yichao, Jayakumar Srihari, Chatterjee Debojeet, Moslehpour Mohsen, Chuang Pierce, Harpale Abhay, Bhardwaj Vikas, Xu Di, Zhao Shicong, Zhao Longfang, Ramchandani Ankit, Dong Xin Luna, Kumar Anuj
- Using Chatgpt To Score Essays And Short-form Constructed Responses Shermis Mark D.
- A Thorough Examination Of Decoding Methods In The Era Of Llms Shi Chufan, Yang Haoran, Cai Deng, Zhang Zhisong, Wang Yifan, Yang Yujiu, Lam Wai
- Keep The Cost Down: A Review On Methods To Optimize LLM' S Kv-cache Consumption Shi Luohe, Zhang Hongyi, Yao Yao, Li Zuchao, Zhao Hai
- Retrieval-enhanced Knowledge Editing In Language Models For Multi-hop Question Answering Shi Yucheng, Tan Qiaoyu, Wu Xuansheng, Zhong Shaochen, Zhou Kaixiong, Liu Ninghao
- Eragent: Enhancing Retrieval-augmented Language Models With Improved Accuracy, Efficiency, And Personalization Shi Yunxiao, Zi Xing, Shi Zijing, Zhang Haimin, Wu Qiang, Xu Min
- Generate-then-ground In Retrieval-augmented Generation For Multi-hop Question Answering Shi Zhengliang, Zhang Shuo, Sun Weiwei, Gao Shen, Ren Pengjie, Chen Zhumin, Ren Zhaochun
- Rethinking Pruning Large Language Models: Benefits And Pitfalls Of Reconstruction Error Minimization Shin Sungbin, Park Wonpyo, Lee Jaeho, Lee Namhoon
- Structuredrag: JSON Response Formatting With Large Language Models Shorten Connor, Pierse Charles, Smith Thomas Benjamin, Cardenas Erika, Sharma Akanksha, Trengrove John, Van Luijt Bob
- FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation Shum Kashun, Xu Minrui, Zhang Jianshu, Chen Zixin, Diao Shizhe, Dong Hanze, Zhang Jipeng, Raza Muhammad Omer
- H2o-danube-1.8b Technical Report Singer Philipp, Pfeiffer Pascal, Babakhin Yauhen, Jeblick Maximilian, Dhankhar Nischay, Fodor Gabor, Ambati Sri Satish
- Using Adaptive Empathetic Responses For Teaching English Siyan Li, Shao Teresa, Yu Zhou, Hirschberg Julia
- Scaling LLM Test-time Compute Optimally Can Be More Effective Than Scaling Model Parameters Snell Charlie, Lee Jaehoon, Xu Kelvin, Kumar Aviral
- Prefixing Attention Sinks Can Mitigate Activation Outliers For Large Language Model Quantization Son Seungwoo, Park Wonpyo, Han Woohyun, Kim Kyuyeun, Lee Jaeho
- Prosparse: Introducing And Enhancing Intrinsic Activation Sparsity Within Large Language Models Song Chenyang, Han Xu, Zhang Zhengyan, Hu Shengding, Shi Xiyu, Li Kuai, Chen Chen, Liu Zhiyuan, Li Guangli, Yang Tao, Sun Maosong
- Hierarchical Context Merging: Better Long Context Understanding For Pre-trained Llms Song Woomin, Oh Seunghyuk, Mo Sangwoo, Kim Jaehyung, Yun Sukmin, Ha Jung-woo, Shin Jinwoo
- Position: Leverage Foundational Models For Black-box Optimization Song Xingyou, Tian Yingtao, Lange Robert Tjarko, Lee Chansoo, Tang Yujin, Chen Yutian
- Sharelora: Parameter Efficient And Robust Large Language Model Fine-tuning Via Shared Low-rank Adaptation Song Yurun, Zhao Junchen, Harris Ian G., Jyothi Sangeetha Abdu
- Empo: Theory-driven Dataset Construction For Empathetic Response Generation Through Preference Optimization Sotolar Ondrej
- Comparative Analysis Of Different Efficient Fine Tuning Methods Of Large Language Models (llms) In Low-resource Setting Srinivasan Krishna Prasad Varadarajan, Gumpena Prasanth, Yattapu Madhusudhana, Brahmbhatt Vishal H.
- LACIE: Listener-aware Finetuning For Confidence Calibration In Large Language Models Stengel-eskin Elias, Hase Peter, Bansal Mohit
- RLVF: Learning From Verbal Feedback Without Overgeneralization Stephan Moritz, Khazatsky Alexander, Mitchell Eric, Chen Annie S, Hsu Sheryl, Sharma Archit, Finn Chelsea
- Llm-barber: Block-aware Rebuilder For Sparsity Mask In One-shot For Large Language Models Su Yupeng, Guan Ziyi, Liu Xiaoqun, Jin Tianlai, Wu Dongkuan, Chesi Graziano, Wong Ngai, Yu Hao
- Timo: Towards Better Temporal Reasoning For Language Models Su Zhaochen, Zhang Jun, Zhu Tong, Qu Xiaoye, Li Juntao, Zhang Min, Cheng Yu
- Persona-db: Efficient Large Language Model Personalization For Response Prediction With Collaborative Data Refinement Sun Chenkai, Yang Ke, Reddy Revanth Gangi, Fung Yi R., Chan Hou Pong, Small Kevin, Zhai Chengxiang, Ji Heng
- Decoding At The Speed Of Thought: Harnessing Parallel Decoding Of Lexical Units For Llms Sun Chenxi, Zhang Hongzhi, Lin Zijia, Zhang Jingyuan, Zhang Fuzheng, Wang Zhongyuan, Chen Bin, Song Chengru, Zhang Di, Gai Kun, Xiong Deyi
- Hierarchical In-context Reinforcement Learning With Hindsight Modular Reflections For Planning Sun Chuanneng, Huang Songjun, Pompili Dario
- Bbox-adapter: Lightweight Adapting For Black-box Large Language Models Sun Haotian, Zhuang Yuchen, Wei Wei, Zhang Chao, Dai Bo
- Benchmarking Chinese Commonsense Reasoning Of Llms: From Chinese-specifics To Reasoning-memorization Correlations Sun Jiaxing, Huang Weiquan, Wu Jiang, Gu Chenya, Li Wei, Zhang Songyang, Yan Hang, He Conghui
- Diversifying Query: Region-guided Transformer For Temporal Sentence Grounding Sun Xiaolong, Shi Liushuai, Wang Le, Zhou Sanping, Xia Kun, Wang Yabing, Hua Gang
- Expert-guided Extinction Of Toxic Tokens For Debiased Generation Sun Xueyao, Shi Kaize, Tang Haoran, Xu Guandong, Li Qing
- Automating Research Synthesis With Domain-specific Large Language Model Fine-tuning Susnjak Teo, Hwang Peter, Reyes Napoleon H., Barczak Andre L. C., Mcintosh Timothy R., Ranathunga Surangika
- Specexec: Massively Parallel Speculative Decoding For Interactive LLM Inference On Consumer Devices Svirschevski Ruslan, May Avner, Chen Zhuoming, Chen Beidi, Jia Zhihao, Ryabinin Max
- Online Adaptation Of Language Models With A Memory Of Amortized Contexts Tack Jihoon, Kim Jaehyung, Mitchell Eric, Shin Jinwoo, Teh Yee Whye, Schwarz Jonathan Richard
- Mobilequant: Mobile-friendly Quantization For On-device Language Models Tan Fuwen, Lee Royson, Dudziak Łukasz, Hu Shell Xu, Bhattacharya Sourav, Hospedales Timothy, Tzimiropoulos Georgios, Martinez Brais
- PROXYQA: An Alternative Framework For Evaluating Long-form Text Generation With Large Language Models Tan Haochen, Guo Zhijiang, Shi Zhan, Xu Lu, Liu Zhili, Feng Yunlong, Li Xiaoguang, Wang Yasheng, Shang Lifeng, Liu Qun, Song Linqi
- DLO: Dynamic Layer Operation For Efficient Vertical Scaling Of Llms Tan Zhen, Dong Daize, Zhao Xinyu, Peng Jie, Cheng Yu, Chen Tianlong
- EVALALIGN: Supervised Fine-tuning Multimodal Llms With Human-aligned Data For Evaluating Text-to-image Models Tan Zhiyu, Yang Xiaomeng, Qin Luozheng, Yang Mengping, Zhang Cheng, Li Hao
- Razorattention: Efficient KV Cache Compression Through Retrieval Heads Tang Hanlin, Lin Yang, Lin Jing, Han Qingsen, Hong Shikuan, Yao Yiwu, Wang Gongyi
- Llmbox: A Comprehensive Library For Large Language Models Tang Tianyi, Hu Yiwen, Li Bingqian, Luo Wenyang, Qin Zijing, Sun Haoxiang, Wang Jiapeng, Xu Shiyi, Cheng Xiaoxue, Guo Geyang, Peng Han, Zheng Bowen, Tang Yiru, Min Yingqian, Chen Yushuo, Chen Jie, Zhao Yuanqian, Ding Luran, Wang Yuhao, Dong Zican, Xia Chunxuan, Li Junyi, Zhou Kun, Zhao Wayne Xin, Wen Ji-rong
- Unleashing The Potential Of Large Language Models As Prompt Optimizers: An Analogical Analysis With Gradient-based Model Optimizers Tang Xinyu, Wang Xiaolei, Zhao Wayne Xin, Lu Siyuan, Li Yaliang, Wen Ji-rong
- Scaling Laws With Vocabulary: Larger Models Deserve Larger Vocabularies Tao Chaofan, Liu Qian, Dou Longxu, Muennighoff Niklas, Wan Zhongwei, Luo Ping, Lin Min, Wong Ngai
- Unlocking The Potential Of Model Merging For Low-resource Languages Tao Mingxu, Zhang Chen, Huang Quzhe, Ma Tianyao, Huang Songfang, Zhao Dongyan, Feng Yansong
- Exploring Design Choices For Building Language-specific Llms Tejaswi Atula, Gupta Nilesh, Choi Eunsol
- Embedding-aligned Language Models Tennenholtz Guy, Chow Yinlam, Hsu Chih-wei, Shani Lior, Liang Ethan, Boutilier Craig
- Training-free Consistent Text-to-image Generation Tewel Yoad, Kaduri Omri, Gal Rinon, Kasten Yoni, Wolf Lior, Chechik Gal, Atzmon Yuval
- Limits Of Transformer Language Models On Learning To Compose Algorithms Thomm Jonathan, Terzic Aleksandar, Camposampiero Giacomo, Hersche Michael, Schölkopf Bernhard, Rahimi Abbas
- Hydralora: An Asymmetric Lora Architecture For Efficient Fine-tuning Tian Chunlin, Shi Zhan, Guo Zhijiang, Li Li, Xu Chengzhong
- Untie The Knots: An Efficient Data Augmentation Strategy For Long-context Pre-training In Language Models Tian Junfeng, Zheng Da, Cheng Yang, Wang Rui, Zhang Colin, Zhang Debing
- Tinyllm: Learning A Small Student From Multiple Large Language Models Tian Yijun, Han Yikun, Chen Xiusi, Wang Wei, Chawla Nitesh V.
- UCCIX: Irish-excellence Large Language Model Tran Khanh-tung, O'sullivan Barry, Nguyen Hoang D.
- Code Less, Align More: Efficient LLM Fine-tuning For Code Generation With Data Pruning Tsai Yun-da, Liu Mingjie, Ren Haoxing
- Enhancing Inference Efficiency Of Large Language Models: Investigating Optimization Strategies And Architectural Innovations Tyukin Georgy
- Lifelong Robot Library Learning: Bootstrapping Composable And Generalizable Skills For Embodied Control With Language Models Tziafas Georgios, Kasaei Hamidreza
- Efficient LLM Context Distillation Upadhayayaya Rajesh, Smith Zachary, Kottmyer Chritopher, Osti Manish Raj
- Enhancing Model Performance: Another Approach To Vision-language Instruction Tuning Vedanshu, Tripathi Mm, Jaint Bhavnesh
- How Much Data Is Enough Data? Fine-tuning Large Language Models For In-house Translation: Performance Evaluation Across Multiple Dataset Sizes Vieira Inacio, Allred Will, Lankford Séamus, Castilho Sheila, Way Andy
- Leveraging Zero-shot Prompting For Efficient Language Model Distillation Vöge Lukas, Gurgul Vincent, Lessmann Stefan
- Investigating Mysteries Of Cot-augmented Distillation Wadhwa Somin, Amir Silvio, Wallace Byron C.
- Teach Better Or Show Smarter? On Instructions And Exemplars In Automatic Prompt Optimization Wan Xingchen, Sun Ruoxi, Nakhost Hootan, Arik Sercan O.
- LARR: Large Language Model Aided Real-time Scene Recommendation With Semantic Understanding Wan Zhizhong, Yin Bin, Xie Junjie, Jiang Fei, Li Xiang, Lin Wei
- LOOK-M: Look-once Optimization In KV Cache For Efficient Multimodal Long-context Inference Wan Zhongwei, Wu Ziang, Liu Che, Huang Jinfa, Zhu Zhihong, Jin Peng, Wang Longyue, Yuan Li
- Datavist5: A Pre-trained Language Model For Jointly Understanding Text And Data Visualization Wan Zhuoyue, Song Yuanfeng, Li Shuaimin, Zhang Chen Jason, Wong Raymond Chi-wing
- Grokked Transformers Are Implicit Reasoners: A Mechanistic Journey To The Edge Of Generalization Wang Boshi, Yue Xiang, Su Yu, Sun Huan
- BLSP-KD: Bootstrapping Language-speech Pre-training Via Knowledge Distillation Wang Chen, Liao Minpeng, Huang Zhongqiang, Zhang Jiajun
- Hybrid Alignment Training For Large Language Models Wang Chenglong, Zhou Hang, Chang Kaiyan, Li Bei, Mu Yongyu, Xiao Tong, Liu Tongran, Zhu Jingbo
- Instruction Tuning-free Visual Token Complement For Multimodal Llms Wang Dongsheng, Cui Jiequan, Li Miaoge, Lin Wang, Chen Bo, Zhang Hanwang
- Towards Efficient And Effective Unlearning Of Large Language Models For Recommendation Wang Hangyu, Lin Jianghao, Chen Bo, Yang Yang, Tang Ruiming, Zhang Weinan, Yu Yong
- Probing The Safety Response Boundary Of Large Language Models Via Unsafe Decoding Path Generation Wang Haoyu, Wu Bingzhe, Bian Yatao, Chang Yongzhe, Wang Xueqian, Zhao Peilin
- Mambabyte: Token-free Selective State Space Model Wang Junxiong, Gangavarapu Tushaar, Yan Jing Nathan, Rush Alexander M.
- Balancing Rigor And Utility: Mitigating Cognitive Biases In Large Language Models For Multiple-choice Questions Wang Liman, Zhong Hanyang, Cao Wenting, Sun Zeyuan
- Selective Forgetting: Advancing Machine Unlearning Techniques And Evaluation In Language Models Wang Lingzhi, Zeng Xingshan, Guo Jinsong, Wong Kam-fai, Gottlob Georg
- Interactive Dualchecker For Mitigating Hallucinations In Distilling Large Language Models Wang Meiyun, Suzuki Masahiro, Sakaji Hiroki, Izumi Kiyoshi
- Conversational Simulmt: Efficient Simultaneous Translation With Large Language Models Wang Minghan, Vu Thuy-trang, Wang Yuxia, Shareghi Ehsan, Haffari Gholamreza
- Reconstruct The Pruned Model Without Any Retraining Wang Pingjie, Fan Ziqing, Hu Shengchao, Chen Zhe, Wang Yanfeng, Wang Yu
- Automated Generation Of Multiple-choice Cloze Questions For Assessing English Vocabulary Using Gpt-turbo 3.5 Wang Qiao, Rose Ralph, Orita Naho, Sugawara Ayaka
- Unlearning With Control: Assessing Real-world Utility For Large Language Model Unlearning Wang Qizhou, Han Bo, Yang Puning, Zhu Jianing, Liu Tongliang, Sugiyama Masashi
- SUBLLM: A Novel Efficient Architecture With Token Sequence Subsampling For LLM Wang Quandong, Yuan Yuxuan, Yang Xiaoyu, Zhang Ruike, Zhao Kang, Liu Wei, Luan Jian, Povey Daniel, Wang Bin
- Role Prompting Guided Domain Adaptation With General Capability Preserve For Large Language Models Wang Rui, Mi Fei, Chen Yi, Xue Boyang, Wang Hongru, Zhu Qi, Wong Kam-fai, Xu Ruifeng
- Deconfounded Causality-aware Parameter-efficient Fine-tuning For Problem-solving Improvement Of Llms Wang Ruoyu, Li Xiaoxuan, Yao Lina
- Prolora: Partial Rotation Empowers More Parameter-efficient Lora Wang Sheng, Xue Boyang, Ye Jiacheng, Jiang Jiyue, Chen Liheng, Kong Lingpeng, Wu Chuan
- Using Advanced Llms To Enhance Smaller Llms: An Interpretable Knowledge Distillation Approach Wang Tong, Sudhir K., Hong Dat
- QCRD: Quality-guided Contrastive Rationale Distillation For Large Language Models Wang Wei, Li Zhaowei, Xu Qi, Cai Yiqing, Song Hang, Qi Qi, Zhou Ran, Huang Zhida, Wang Tao, Xiao Li
- Secokd: Aligning Large Language Models For In-context Learning With Fewer Shots Wang Weixing, Yang Haojin, Meinel Christoph
- Learning To Ask: When Llms Meet Unclear Instruction Wang Wenxuan, Shi Juluan, Wang Chaozheng, Lee Cheryl, Yuan Youliang, Huang Jen-tse, Lyu Michael R.
- Searching For Best Practices In Retrieval-augmented Generation Wang Xiaohua, Wang Zhenghua, Gao Xuan, Zhang Feiran, Wu Yixin, Xu Zhibo, Shi Tianyuan, Wang Zhengyuan, Li Shizheng, Qian Qi, Yin Ruicheng, Lv Changze, Zheng Xiaoqing, Huang Xuanjing
- Validating Llm-generated Programs With Metamorphic Prompt Testing Wang Xiaoyin, Zhu Dakai
- Longllava: Scaling Multi-modal Llms To 1000 Images Efficiently Via Hybrid Architecture Wang Xidong, Song Dingjie, Chen Shunian, Zhang Chen, Wang Benyou
- Rdrec: Rationale Distillation For Llm-based Recommendation Wang Xinfeng, Cui Jin, Suzuki Yoshimi, Fukumoto Fumiyo
- Coglm: Tracking Cognitive Development Of Large Language Models Wang Xinglin, Yuan Peiwen, Feng Shaoxiong, Li Yiwei, Pan Boyuan, Wang Heda, Hu Yao, Li Kan
- Defending Llms Against Jailbreaking Attacks Via Backtranslation Wang Yihan, Shi Zhouxing, Bai Andrew, Hsieh Cho-jui
- Uncertainty Aware Learning For Language Model Alignment Wang Yikun, Zheng Rui, Ding Liang, Zhang Qi, Lin Dahua, Tao Dacheng
- LSTP: Language-guided Spatial-temporal Prompt Learning For Long-form Video-text Understanding Wang Yuxuan, Wang Yueqian, Wu Pengfei, Liang Jianxin, Zhao Dongyan, Zheng Zilong
- Videollamb: Long-context Video Understanding With Recurrent Memory Bridges Wang Yuxuan, Xie Cihang, Liu Yang, Zheng Zilong
- Promptmm: Multi-modal Knowledge Distillation For Recommendation With Prompt-tuning Wei Wei, Tang Jiabin, Jiang Yangqin, Xia Lianghao, Huang Chao
- On-policy Fine-grained Knowledge Feedback For Hallucination Mitigation Wen Xueru, Lu Xinyu, Guan Xinyan, Lu Yaojie, Lin Hongyu, He Ben, Han Xianpei, Sun Le
- Detecting And Mitigating Hallucination In Large Vision Language Models Via Fine-grained AI Feedback Wenyi Xiao, Ziwei Huang, Leilei Gan, Wanggui He, Haoyuan Li, Zhelun Yu, Hao Jiang, Fei Wu, Linchao Zhu
- Codeultrafeedback: An Llm-as-a-judge Dataset For Aligning Large Language Models To Coding Preferences Weyssow Martin, Kamanda Aton, Sahraoui Houari
- Memory Is All You Need: An Overview Of Compute-in-memory Architectures For Accelerating Large Language Model Inference Wolters Christopher, Yang Xiaoxuan, Schlichtmann Ulf, Suzumura Toyotaro
- Core: Context-regularized Text Embedding Learning For Text-to-image Personalization Wu Feize, Pang Yun, Zhang Junyi, Pang Lianyu, Yin Jian, Zhao Baoquan, Li Qing, Mao Xudong
- GEB-1.3B: Open Lightweight Large Language Model Wu Jie, Zhu Yufeng, Shen Lei, Lu Xuqing
- Commit: Coordinated Instruction Tuning For Multimodal Large Language Models Wu Junda, Li Xintong, Yu Tong, Wang Yu, Chen Xiang, Gu Jiuxiang, Yao Lina, Shang Jingbo, Mcauley Julian
- \(\beta\)-dpo: Direct Preference Optimization With Dynamic \(\beta\) Wu Junkang, Xie Yuexiang, Yang Zhengyi, Wu Jiancan, Gao Jinyang, Ding Bolin, Wang Xiang, He Xiangnan
- Controlmllm: Training-free Visual Prompt Learning For Multimodal Large Language Models Wu Mingrui, Cai Xinyue, Ji Jiayi, Li Jiale, Huang Oucheng, Luo Gen, Fei Hao, Sun Xiaoshuai, Ji Rongrong
- Parallel Decoding Via Hidden Transfer For Lossless Large Language Model Acceleration Wu Pengfei, Liu Jiahao, Gong Zhuocheng, Wang Qifan, Li Jinpeng, Wang Jingang, Cai Xunliang, Zhao Dongyan
- GUI Action Narrator: Where And When Did That Action Take Place? Wu Qinchen, Gao Difei, Lin Kevin Qinghong, Wu Zhuoyu, Guo Xiangwu, Li Peiran, Zhang Weichen, Wang Hengxu, Shou Mike Zheng
- Not All Attention Is Needed: Parameter And Computation Efficient Transfer Learning For Multi-modal Large Language Models Wu Qiong, Ye Weihao, Zhou Yiyi, Sun Xiaoshuai, Ji Rongrong
- Avatar: Optimizing LLM Agents For Tool-assisted Knowledge Retrieval Wu Shirley, Zhao Shiyu, Huang Qian, Huang Kexin, Yasunaga Michihiro, Cao Kaidi, Ioannidis Vassilis N., Subbian Karthik, Leskovec Jure, Zou James
- Videollm-mod: Efficient Video-language Streaming With Mixture-of-depths Vision Computation Wu Shiwei, Chen Joya, Lin Kevin Qinghong, Wang Qimeng, Gao Yan, Xu Qianli, Xu Tong, Hu Yao, Chen Enhong, Shou Mike Zheng
- Enhance Reasoning For Large Language Models In The Game Werewolf Wu Shuang, Zhu Liwen, Yang Tao, Xu Shiwei, Fu Qiang, Wei Yang, Fu Haobo
- Enhancing Q-learning With Large Language Model Heuristics Wu Xiefeng
- Difflora: Generating Personalized Low-rank Adaptation Weights With Diffusion Wu Yujia, Shi Yiming, Wei Jiwei, Sun Chengwei, Zhou Yuyang, Yang Yang, Shen Heng Tao
- Inverse-q*: Token Level Reinforcement Learning For Aligning Large Language Models Without Preference Data Xia Han, Gao Songyang, Ge Qiming, Xi Zhiheng, Zhang Qi, Huang Xuanjing
- FP6-LLM: Efficiently Serving Large Language Models Through Fp6-centric Algorithm-system Co-design Xia Haojun, Zheng Zhen, Wu Xiaoxia, Chen Shiyang, Yao Zhewei, Youn Stephen, Bakhtiari Arash, Wyatt Michael, Zhuang Donglin, Zhou Zhongzhu, Ruwase Olatunji, He Yuxiong, Song Shuaiwen Leon
- Unlocking Efficiency In Large Language Model Inference: A Comprehensive Survey Of Speculative Decoding Xia Heming, Yang Zhe, Dong Qingxiu, Wang Peiyi, Li Yongqi, Ge Tao, Liu Tianyu, Li Wenjie, Sui Zhifang
- Clover-2: Accurate Inference For Regressive Lightweight Speculative Decoding Xiao Bin, Gui Lujun, Su Lei, Chen Weipeng
- Clover: Regressive Lightweight Speculative Decoding With Sequential Knowledge Xiao Bin, Shi Chunan, Nie Xiaonan, Yang Fan, Deng Xiangwei, Su Lei, Chen Weipeng, Cui Bin
- EMP: Enhance Memory In Data Pruning Xiao Jinying, Li Ping, Nie Jie, Tang Zhe
- Palm2-vadapter: Progressively Aligned Language Model Makes A Strong Vision-language Adapter Xiao Junfei, Xu Zheng, Yuille Alan, Yan Shen, Wang Boyu
- Advancing Multimodal Large Language Models With Quantization-aware Scale Learning For Efficient Adaptation Xie Jingjing, Zhang Yuxin, Lin Mingbao, Cao Liujuan, Ji Rongrong
- Codeshell Technical Report Xie Rui, Zeng Zhengran, Yu Zhuohao, Gao Chang, Zhang Shikun, Ye Wei
- Exploratory Preference Optimization: Harnessing Implicit Q*-approximation For Sample-efficient RLHF Xie Tengyang, Foster Dylan J., Krishnamurthy Akshay, Rosset Corby, Awadallah Ahmed, Rakhlin Alexander
- Monte Carlo Tree Search Boosts Reasoning Via Iterative Preference Learning Xie Yuxi, Goyal Anirudh, Zheng Wenyue, Kan Min-yen, Lillicrap Timothy P., Kawaguchi Kenji, Shieh Michael
- Llm-enhanced User-item Interactions: Leveraging Edge Information For Optimized Recommendations Xinyuan Wang, Liang Wu, Liangjie Hong, Hao Liu, Yanjie Fu
- Large Language Models Can Learn Temporal Reasoning Xiong Siheng, Payani Ali, Kompella Ramana, Fekri Faramarz
- Watch Every Step! LLM Agent Learning Via Iterative Step-level Process Refinement Xiong Weimin, Song Yifan, Zhao Xiutian, Wu Wenhao, Wang Xun, Wang Ke, Li Cheng, Peng Wei, Li Sujian
- Temporal Scaling Law For Large Language Models Xiong Yizhe, Chen Xiansheng, Ye Xin, Chen Hui, Lin Zijia, Lian Haoran, Su Zhenpeng, Niu Jianwei, Ding Guiguang
- Mixture-of-instructions: Comprehensive Alignment Of A Large Language Model Through The Mixture Of Diverse System Prompting Instructions Xu Bowen, Wu Shaoyu, Liu Kai, Hu Lulu
- Enhancing Retrieval-augmented Lms With A Two-stage Consistency Learning Compressor Xu Chuankai, Zhao Dongming, Wang Bo, Xing Hanwen
- Aligning Large Language Models Via Fine-grained Supervision Xu Dehong, Qiu Liang, Kim Minseok, Ladhak Faisal, Do Jaeyoung
- Large Language Models For Education: A Survey Xu Hanyi, Gan Wensheng, Qi Zhenlian, Wu Jiayang, Yu Philip S.
- Contrastive Preference Optimization: Pushing The Boundaries Of LLM Performance In Machine Translation Xu Haoran, Sharaf Amr, Chen Yunmo, Tan Weiting, Shen Lingfeng, Van Durme Benjamin, Murray Kenton, Kim Young Jin
- Redagent: Red Teaming Large Language Models With Context-aware Autonomous Language Agent Xu Huiyu, Zhang Wenhui, Wang Zhibo, Xiao Feng, Zheng Rui, Feng Yunhe, Ba Zhongjie, Ren Kui
- On-device Language Models: A Comprehensive Review Xu Jiajun, Li Zhiyuan, Chen Wei, Wang Qun, Gao Xin, Cai Qi, Ling Ziyuan
- A General-purpose Device For Interaction With Llms Xu Jiajun, Wang Qun, Cao Yuhang, Zeng Baitao, Liu Sicheng
- Faithful Logical Reasoning Via Symbolic Chain-of-thought Xu Jundong, Fei Hao, Pan Liangming, Liu Qian, Lee Mong-li, Hsu Wynne
- Accelerating Large Language Model Training With Hybrid Gpu-based Compression Xu Lang, Anthony Quentin, Zhou Qinghua, Alnaasan Nawras, Gulhane Radha R., Shafi Aamir, Subramoni Hari, Panda Dhabaleswar K.
- BESA: Pruning Large Language Models With Blockwise Parameter-efficient Sparsity Allocation Xu Peng, Shao Wenqi, Chen Mengzhao, Tang Shitao, Zhang Kaipeng, Gao Peng, An Fengwei, Qiao Yu, Luo Ping
- Is DPO Superior To PPO For LLM Alignment? A Comprehensive Study Xu Shusheng, Fu Wei, Gao Jiaxuan, Ye Wenjie, Liu Weilin, Mei Zhiyu, Wang Guangju, Yu Chao, Wu Yi
- Chatglm-math: Improving Math Problem-solving In Large Language Models With A Self-critique Pipeline Xu Yifan, Liu Xiao, Liu Xinghan, Hou Zhenyu, Li Yueyan, Zhang Xiaohan, Wang Zihan, Zeng Aohan, Du Zhengxiao, Zhao Wenyi, Tang Jie, Dong Yuxiao
- \(\textit{linkprompt}\): Natural And Universal Adversarial Attacks On Prompt-based Language Models Xu Yue, Wang Wenjie
- Think: Thinner Key Cache By Query-driven Pruning Xu Yuhui, Jie Zhanming, Dong Hanze, Wang Lei, Lu Xudong, Zhou Aojun, Saha Amrita, Xiong Caiming, Sahoo Doyen
- Magpie: Alignment Data Synthesis From Scratch By Prompting Aligned Llms With Nothing Xu Zhangchen, Jiang Fengqing, Niu Luyao, Deng Yuntian, Poovendran Radha, Choi Yejin, Lin Bill Yuchen
- Beyond Perplexity: Multi-dimensional Safety Evaluation Of LLM Compression Xu Zhichao, Gupta Ashim, Li Tao, Bentham Oliver, Srikumar Vivek
- LEIA: Facilitating Cross-lingual Knowledge Transfer In Language Models With Entity-based Data Augmentation Yamada Ikuya, Ri Ryokan
- An Empirical Study On Cross-lingual Vocabulary Adaptation For Efficient Language Model Inference Yamaguchi Atsuki, Villavicencio Aline, Aletras Nikolaos
- Vocabulary Expansion For Low-resource Cross-lingual Transfer Yamaguchi Atsuki, Villavicencio Aline, Aletras Nikolaos
- Mirror: A Multiple-perspective Self-reflection Method For Knowledge-rich Reasoning Yan Hanqi, Zhu Qinglin, Wang Xinyu, Gui Lin, He Yulan
- Predicting The Big Five Personality Traits In Chinese Counselling Dialogues Using Large Language Models Yan Yang, Ma Lizhi, Li Anqi, Ma Jingsong, Lan Zhenzhong
- Exploring The LLM Journey From Cognition To Expression With Linear Representations Yan Yuzi, Li Jialian, Zhang Yipin, Yan Dong
- 3d-properties: Identifying Challenges In DPO And Charting A Path Forward Yan Yuzi, Miao Yibo, Li Jialian, Zhang Yipin, Xie Jian, Deng Zhijie, Yan Dong
- Advancing Genai Assisted Programming--a Comparative Study On Prompt Efficiency And Code Quality Between GPT-4 And GLM-4 Yang Angus, Li Zehan, Li Jie
- IM-RAG: Multi-round Retrieval-augmented Generation Through Learning Inner Monologues Yang Diji, Rao Jinmeng, Chen Kezhen, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- Pyramidinfer: Pyramid KV Cache Compression For High-throughput LLM Inference Yang Dongjie, Han Xiaodong, Gao Yan, Hu Yao, Zhang Shilin, Zhao Hai
- Asynchronous And Segmented Bidirectional Encoding For NMT Yang Jingpu, Han Zehua, Xiang Mengyu, Wang Helin, Huang Yuxiao, Fang Miao
- No Token Left Behind: Reliable KV Cache Compression Via Importance-aware Mixed Precision Quantization Yang June Yong, Kim Byeongwook, Bae Jeongin, Kwon Beomseok, Park Gunho, Yang Eunho, Kwon Se Jung, Lee Dongsoo
- Buffer Of Thoughts: Thought-augmented Reasoning With Large Language Models Yang Ling, Yu Zhaochen, Zhang Tianjun, Cao Shiyi, Xu Minkai, Zhang Wentao, Gonzalez Joseph E., Cui Bin
- Dual-phase Accelerated Prompt Optimization Yang Muchen, Li Moxin, Li Yongle, Chen Zijun, Gao Chongming, Zhang Junqi, Li Yangyang, Feng Fuli
- Regularizing Hidden States Enables Learning Generalizable Reward Model For Llms Yang Rui, Ding Ruomeng, Lin Yong, Zhang Huan, Zhang Tong
- Is Crowdsourcing Breaking Your Bank? Cost-effective Fine-tuning Of Pre-trained Language Models With Proximal Policy Optimization Yang Shuo, Kasneci Gjergji
- Adazeta: Adaptive Zeroth-order Tensor-train Adaption For Memory-efficient Large Language Models Fine-tuning Yang Yifan, Zhen Kai, Banijamal Ershad, Mouchtaris Athanasios, Zhang Zheng
- Loretta: Low-rank Economic Tensor-train Adaptation For Ultra-low-parameter Fine-tuning Of Large Language Models Yang Yifan, Zhou Jiajun, Wong Ngai, Zhang Zheng
- P3: A Policy-driven, Pace-adaptive, And Diversity-promoted Framework For Optimizing LLM Training Yang Yingxuan, Wang Huayi, Wen Muning, Zhang Weinan
- Weak-to-strong Reasoning Yang Yuqing, Ma Yan, Liu Pengfei
- Self-distillation Bridges Distribution Gap In Language Model Fine-tuning Yang Zhaorui, Pang Tianyu, Feng Haozhe, Wang Han, Chen Wei, Zhu Minfeng, Liu Qian
- An Empirical Analysis Of Compute-optimal Inference For Problem-solving With Language Models Yangzhen Wu, Zhiqing Sun, Shanda Li, Sean Welleck, Yiming Yang
- Exploiting Inter-layer Expert Affinity For Accelerating Mixture-of-experts Model Inference Yao Jinghan Dk, Anthony Quentin Dk, Shafi Aamir Dk, Subramoni Hari Dk, K. Dhabaleswar Dk, Panda
- Training Ultra Long Context Language Model With Fully Pipelined Distributed Transformer Yao Jinghan, Jacobs Sam Ade, Tanaka Masahiro, Ruwase Olatunji, Shafi Aamir, Subramoni Hari, Panda Dhabaleswar K.
- Deco: Decoupling Token Compression From Semantic Abstraction In Multimodal Large Language Models Yao Linli, Li Lei, Ren Shuhuai, Wang Lean, Liu Yuanxin, Sun Xu, Hou Lu
- Enhancing In-context Learning Performance With Just Svd-based Weight Pruning: A Theoretical Perspective Yao Xinhao, Hu Xiaolin, Yang Shenzhi, Liu Yong
- The Impact Of Quantization On Retrieval-augmented Generation: An Analysis Of Small Llms Yazan Mert, Verberne Suzan, Situmeang Frederik
- R^2AG: Incorporating Retrieval Information Into Retrieval Augmented Generation Ye Fuda, Li Shuangyin, Zhang Yongqi, Chen Lei
- Preference-guided Reflective Sampling For Aligning Language Models Ye Hai, Ng Hwee Tou
- Self-judge: Selective Instruction Following With Alignment Self-evaluation Ye Hai, Ng Hwee Tou
- Large Language Models As Hyper-heuristics For Combinatorial Optimization Ye Haoran, Wang Jiarui, Cao Zhiguang, Berto Federico, Hua Chuanbo, Kim Haeyeon, Park Jinkyoo, Song Guojie
- Generation Meets Verification: Accelerating Large Language Model Inference With Smart Parallel Auto-correct Decoding Yi Hanling, Lin Feng, Li Hongbin, Ning Peiyang, Yu Xiaotian, Xiao Rong
- MEND: Meta Demonstration Distillation For Efficient And Effective In-context Learning Yichuan Li, Xiyao Ma, Sixing Lu, Kyumin Lee, Xiaohu Liu, Chenlei Guo
- Stablemask: Refining Causal Masking In Decoder-only Transformer Yin Qingyu, He Xuzheng, Zhuang Xiang, Zhao Yu, Yao Jianhua, Shen Xiaoyu, Zhang Qiang
- Relative Preference Optimization: Enhancing LLM Alignment Through Contrasting Responses Across Identical And Diverse Prompts Yin Yueqin, Wang Zhendong, Gu Yi, Huang Hai, Chen Weizhu, Zhou Mingyuan
- Unveiling The Safety Of Gpt-4o: An Empirical Study Using Jailbreak Attacks Ying Zonghao, Liu Aishan, Liu Xianglong, Tao Dacheng
- CSRT: Evaluation And Analysis Of Llms Using Code-switching Red-teaming Dataset Yoo Haneul, Yang Yongjin, Lee Hwaran
- By My Eyes: Grounding Multimodal Large Language Models With Sensor Data Via Visual Prompting Yoon Hyungjun, Tolera Biniyam Aschalew, Gong Taesik, Lee Kimin, Lee Sung-ju
- Matryoshka-adaptor: Unsupervised And Supervised Tuning For Smaller Embedding Dimensions Yoon Jinsung, Sinha Raj, Arik Sercan O, Pfister Tomas
- Making The Most Of Your Model: Methods For Finetuning And Applying Pretrained Transformers Yoshida Davis
- Tree-planted Transformers: Unidirectional Transformer Language Models With Implicit Syntactic Supervision Yoshida Ryo, Someya Taiga, Oseki Yohei
- When Linear Attention Meets Autoregressive Decoding: Towards More Effective And Efficient Linearized Large Language Models You Haoran, Fu Yichao, Wang Zheng, Yazdanbakhsh Amir, Lin Yingyan Celine
- Shiftaddllm: Accelerating Pretrained Llms Via Post-training Multiplication-less Reparameterization You Haoran, Guo Yipin, Fu Yichao, Zhou Wei, Shi Huihong, Zhang Xiaofan, Kundu Souvik, Yazdanbakhsh Amir, Lin Yingyan Celine
- Distilling System 2 Into System 1 Yu Ping, Xu Jing, Weston Jason, Kulikov Ilia
- Remamba: Equip Mamba With Effective Long-sequence Modeling Yuan Danlong, Liu Jiahao, Li Bei, Zhang Huishuai, Wang Jingang, Cai Xunliang, Zhao Dongyan
- KV Cache Compression, But What Must We Give In Return? A Comprehensive Benchmark Of Long Context Capable Approaches Yuan Jiayi Henry, Liu Hongyi Henry, Shaochen Henry, Zhong, Chuang Yu-neng, Li Songchen, Wang Guanchu, Le Duy, Jin Hongye, Chaudhary Vipin, Xu Zhaozhuo, Liu Zirui, Hu Xia
- Why Lift So Heavy? Slimming Large Language Models By Cutting Off The Layers Yuan Shuzhou, Nie Ercong, Ma Bolei, Färber Michael
- A Hybrid RAG System With Comprehensive Enhancement On Complex Reasoning Yuan Ye, Liu Chengwu, Yuan Jingyang, Sun Gongbo, Li Siqi, Zhang Ming
- Refuse Whenever You Feel Unsafe: Improving Safety In Llms Via Decoupled Refusal Training Yuan Youliang, Jiao Wenxiang, Wang Wenxuan, Huang Jen-tse, Xu Jiahao, Liang Tian, He Pinjia, Tu Zhaopeng
- Rigorllm: Resilient Guardrails For Large Language Models Against Undesired Content Yuan Zhuowen, Xiong Zidi, Zeng Yi, Yu Ning, Jia Ruoxi, Song Dawn, Li Bo
- Mammoth2: Scaling Instructions From The Web Yue Xiang, Zheng Tuney, Zhang Ge, Chen Wenhu
- Distilling Instruction-following Abilities Of Large Language Models With Task-aware Curriculum Planning Yue Yuanhao, Wang Chengyu, Huang Jun, Wang Peng
- Wkvquant: Quantizing Weight And Key/value Cache For Large Language Models Gains More Yue Yuxuan, Yuan Zhihang, Duanmu Haojie, Zhou Sifan, Wu Jianlong, Nie Liqiang
- Can Small Language Models Be Good Reasoners For Sequential Recommendation? Yuling Wang, Changxin Tian, Binbin Hu, Yanhua Yu, Ziqi Liu, Zhiqiang Zhang, Jun Zhou, Liang Pang, Xiao Wang
- Focus On The Core: Efficient Attention Via Pruned Token Compression For Document Classification Yun Jungmin, Kim Mihyeon, Kim Youngbin
- Toward Inference-optimal Mixture-of-expert Large Language Models Yun Longfei, Zhuang Yonghao, Fu Yao, Xing Eric P, Zhang Hao
- Design Proteins Using Large Language Models: Enhancements And Comparative Analyses Zeinalipour Kamyar, Jamshidi Neda, Bianchini Monica, Maggini Marco, Gori Marco
- ABQ-LLM: Arbitrary-bit Quantized Inference Acceleration For Large Language Models Zeng Chao, Liu Songwei, Xie Yusheng, Liu Hong, Wang Xiaojian, Wei Miao, Yang Shu, Chen Fangmin, Mei Xing
- Multilingual Brain Surgeon: Large Language Models Can Be Compressed Leaving No Language Behind Zeng Hongchuan, Xu Hongshen, Chen Lu, Yu Kai
- Skywork-math: Data Scaling Laws For Mathematical Reasoning In Large Language Models -- The Story Goes On Zeng Liang, Zhong Liangjun, Zhao Liang, Wei Tianwen, Yang Liu, He Jujie, Cheng Cheng, Hu Rui, Liu Yang, Yan Shuicheng, Fang Han, Zhou Yahui
- Token-level Direct Preference Optimization Zeng Yongcheng, Liu Guoqing, Ma Weiyu, Yang Ning, Zhang Haifeng, Wang Jun
- Learning Reward For Robot Skills Using Large Language Models Via Self-alignment Zeng Yuwei, Mu Yao, Shao Lin
- Chimera: A Lossless Decoding Method For Accelerating Large Language Models Inference By Fusing All Tokens Zeng Ziqian, Yu Jiahong, Pang Qianshi, Wang Zihao, Zhuang Huiping, Shao Hongen, Zou Xiaofeng
- Unveiling The Lexical Sensitivity Of Llms: Combinatorial Optimization For Prompt Enhancement Zhan Pengwei, Xu Zhen, Tan Qian, Song Jie, Xie Ru
- Distilling Implicit Multimodal Knowledge Into Llms For Zero-resource Dialogue Generation Zhang Bo, Ma Hui, Ding Jian, Wang Jian, Xu Bo, Lin Hongfei
- Extracting Prompts By Inverting LLM Outputs Zhang Collin, Morris John X., Shmatikov Vitaly
- Speechgpt-gen: Scaling Chain-of-information Speech Generation Zhang Dong, Zhang Xin, Zhan Jun, Li Shimin, Zhou Yaqian, Qiu Xipeng
- ALTER: Augmentation For Large-table-based Reasoning Zhang Han, Ma Yuheng, Yang Hanfang
- Language Model Prompt Selection Via Simulation Optimization Zhang Haoting, He Jinghai, Righter Rhonda, Zheng Zeyu
- Sinklora: Enhanced Efficiency And Chat Capabilities For Long-context Large Language Models Zhang Hengyu
- Knowhalu: Hallucination Detection Via Multi-form Knowledge Based Factual Checking Zhang Jiawei, Xu Chejian, Gai Yu, Lecue Freddy, Song Dawn, Li Bo
- Onegen: Efficient One-pass Unified Generation And Retrieval For Llms Zhang Jintian, Peng Cheng, Sun Mengshu, Chen Xiang, Liang Lei, Zhang Zhiqiang, Zhou Jun, Chen Huajun, Zhang Ningyu
- Unimot: Unified Molecule-text Language Model With Discrete Token Representation Zhang Juzheng, Bian Yatao, Chen Yongqiang, Yao Quanming
- HARE: Human Priors, A Key To Small Language Model Efficiency Zhang Lingyun, Jin Bin, Ge Gaojian, Liu Lunhui, Shen Xuewen, Wu Mingyong, Zhang Houqian, Jiang Yongneng, Chen Shiqi, Pu Shi
- Enhancing Multi-hop Reasoning Through Knowledge Erasure In Large Language Model Editing Zhang Mengqi, Fang Bowen, Liu Qiang, Ren Pengjie, Wu Shu, Chen Zhumin, Wang Liang
- Automated Multi-level Preference For Mllms Zhang Mengxi, Wu Wenhao, Lu Yu, Song Yuxin, Rong Kang, Yao Huanjin, Zhao Jianbo, Liu Fanglong, Sun Yifan, Feng Haocheng, Wang Jingdong
- The Hedgehog & The Porcupine: Expressive Linear Attentions With Softmax Mimicry Zhang Michael, Bhatia Kush, Kumbong Hermann, Ré Christopher
- Transfertod: A Generalizable Chinese Multi-domain Task-oriented Dialogue System With Transfer Capabilities Zhang Ming, Huang Caishuang, Wu Yilong, Liu Shichun, Zheng Huiyuan, Dong Yurui, Shen Yujiong, Dou Shihan, Zhao Jun, Ye Junjie, Zhang Qi, Gui Tao, Huang Xuanjing
- Instructedit: Instruction-based Knowledge Editing For Large Language Models Zhang Ningyu, Tian Bozhong, Cheng Siyuan, Liang Xiaozhuan, Hu Yi, Xue Kouying, Gou Yanjie, Chen Xi, Chen Huajun
- Soaring From 4K To 400K: Extending Llm's Context With Activation Beacon Zhang Peitian, Liu Zheng, Xiao Shitao, Shao Ninglu, Ye Qiwei, Dou Zhicheng
- Tinyllama: An Open-source Small Language Model Zhang Peiyuan, Zeng Guangtao, Wang Tianduo, Lu Wei
- Plad: Preference-based Large Language Model Distillation With Pseudo-preference Pairs Zhang Rongzhi, Shen Jiaming, Liu Tianqi, Wang Haorui, Qin Zhen, Han Feng, Liu Jialu, Baumgartner Simon, Bendersky Michael, Zhang Chao
- Direct Preference Optimization Of Video Large Multimodal Models From Language Model Reward Zhang Ruohong, Gui Liangke, Sun Zhiqing, Feng Yihao, Xu Keyang, Zhang Yuanhan, Fu Di, Li Chunyuan, Hauptmann Alexander, Bisk Yonatan, Yang Yiming
- Promptfix: Few-shot Backdoor Removal Via Adversarial Prompt Tuning Zhang Tianrong, Xi Zhaohan, Wang Ting, Mitra Prasenjit, Chen Jinghui
- Leanquant: Accurate Large Language Model Quantization With Loss-error-aware Grid Zhang Tianyi, Shrivastava Anshumali
- Revisiting OPRO: The Limitations Of Small-scale Llms As Optimizers Zhang Tuo, Yuan Jinyue, Avestimehr Salman
- Tired Of Plugins? Large Language Models Can Be End-to-end Recommenders Zhang Wenlin, Wu Chuhan, Li Xiangyang, Wang Yuhao, Dong Kuicai, Wang Yichao, Dai Xinyi, Zhao Xiangyu, Guo Huifeng, Tang Ruiming
- Agent-pro: Learning To Evolve Via Policy-level Reflection And Optimization Zhang Wenqi, Tang Ke, Wu Hai, Wang Mengna, Shen Yongliang, Hou Guiyang, Tan Zeqi, Li Peng, Zhuang Yueting, Lu Weiming
- Dpp-based Adversarial Prompt Searching For Lanugage Models Zhang Xu, Wan Xiaojun
- Contrasolver: Self-alignment Of Language Models By Resolving Internal Preference Contradictions Zhang Xu, Yin Xunjian, Wan Xiaojun
- Chain Of Preference Optimization: Improving Chain-of-thought Reasoning In Llms Zhang Xuan, Du Chao, Pang Tianyu, Liu Qian, Gao Wei, Lin Min
- Glape: Gold Label-agnostic Prompt Evaluation And Optimization For Large Language Model Zhang Xuanchang, Zhang Zhuosheng, Zhao Hai
- Finercut: Finer-grained Interpretable Layer Pruning For Large Language Models Zhang Yang, Li Yawei, Wang Xinpeng, Shen Qianli, Plank Barbara, Bischl Bernd, Rezaei Mina, Kawaguchi Kenji
- Wings: Learning Multimodal Llms Without Text-only Forgetting Zhang Yi-kai, Lu Shiyin, Li Yang, Ma Yanqing, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Zhan De-chuan, Ye Han-jia
- ELAD: Explanation-guided Large Language Models Active Distillation Zhang Yifei, Pan Bo, Ling Chen, Hu Yuntong, Zhao Liang
- Boosting Jailbreak Attack With Momentum Zhang Yihao, Wei Zeming
- MLKD-BERT: Multi-level Knowledge Distillation For Pre-trained Language Models Zhang Ying, Yang Ziheng, Ji Shufan
- Diversifying The Expert Knowledge For Task-agnostic Pruning In Sparse Mixture-of-experts Zhang Zeliang, Liu Xiaodong, Cheng Hao, Xu Chenliang, Gao Jianfeng
- Get Confused Cautiously: Textual Sequence Memorization Erasure With Selective Entropy Maximization Zhang Zhaohan, Liu Ziquan, Patras Ioannis
- Glitchprober: Advancing Effective Detection And Mitigation Of Glitch Tokens In Large Language Models Zhang Zhibo, Bai Wuxia, Li Yuxi, Meng Mark Huasong, Wang Kailong, Shi Ling, Li Li, Wang Jun, Wang Haoyu
- APT: Adaptive Pruning And Tuning Pretrained Language Models For Efficient Training And Inference Zhao Bowen, Hajishirzi Hannaneh, Cao Qingqing
- Large Language Model Can Continue Evolving From Mistakes Zhao Haokun, Han Haixia, Shi Jie, Du Chengyu, Liang Jiaqing, Xiao Yanghua
- Revolutionizing Finance With Llms: An Overview Of Applications And Insights Zhao Huaqin, Liu Zhengliang, Wu Zihao, Li Yiwei, Yang Tianze, Shu Peng, Xu Shaochen, Dai Haixing, Zhao Lin, Mai Gengchen, Liu Ninghao, Liu Tianming
- Galore: Memory-efficient LLM Training By Gradient Low-rank Projection Zhao Jiawei, Zhang Zhenyu, Chen Beidi, Wang Zhangyang, Anandkumar Anima, Tian Yuandong
- Longskywork: A Training Recipe For Efficiently Extending Context Length In Large Language Models Zhao Liang, Wei Tianwen, Zeng Liang, Cheng Cheng, Yang Liu, Cheng Peng, Wang Lijie, Li Chenxia, Wu Xuejie, Zhu Bo, Gan Yimeng, Hu Rui, Yan Shuicheng, Fang Han, Zhou Yahui
- Aqulia-med LLM: Pioneering Full-process Open-source Medical Language Models Zhao Lulu, Zeng Weihao, Shi Xiaofeng, Zhou Hua, Hao Donglin, Lin Yonghua
- Prepacking: A Simple Method For Fast Prefilling And Increased Throughput In Large Language Models Zhao Siyan, Israel Daniel, Broeck Guy Van Den, Grover Aditya
- Ouroboros: Generating Longer Drafts Phrase By Phrase For Faster Speculative Decoding Zhao Weilin, Huang Yuxiang, Han Xu, Xu Wang, Xiao Chaojun, Zhang Xinrong, Fang Yewei, Zhang Kaihuo, Liu Zhiyuan, Sun Maosong
- SWIFT:A Scalable Lightweight Infrastructure For Fine-tuning Zhao Yuze, Huang Jintao, Hu Jinghan, Wang Xingjun, Mao Yunlin, Zhang Daoze, Jiang Zeyinzi, Wu Zhikai, Ai Baole, Wang Ang, Zhou Wenmeng, Chen Yingda
- Adversarial Contrastive Decoding: Boosting Safety Alignment Of Large Language Models Via Opposite Prompt Optimization Zhao Zhengyue, Zhang Xiaoyun, Xu Kaidi, Hu Xing, Zhang Rui, Du Zidong, Guo Qi, Chen Yunji
- Chemdfm: Dialogue Foundation Model For Chemistry Zhao Zihan, Ma Da, Chen Lu, Sun Liangtai, Li Zihao, Xu Hongshen, Zhu Zichen, Zhu Su, Fan Shuai, Shen Guodong, Chen Xin, Yu Kai
- ICE-GRT: Instruction Context Enhancement By Generative Reinforcement Based Transformers Zheng Chen, Sun Ke, Tang Da, Ma Yukun, Zhang Yuyu, Xi Chenguang, Zhou Xun
- On Prompt-driven Safeguarding For Large Language Models Zheng Chujie, Yin Fan, Zhou Hao, Meng Fandong, Zhou Jie, Chang Kai-wei, Huang Minlie, Peng Nanyun
- Learn To Be Efficient: Build Structured Sparsity In Large Language Models Zheng Haizhong, Bai Xiaoyan, Liu Xueshen, Mao Z. Morley, Chen Beidi, Lai Fan, Prakash Atul
- PAS: Data-efficient Plug-and-play Prompt Augmentation System Zheng Miao, Liang Hao, Yang Fan, Sun Haoze, Li Tianpeng, Xiong Lingchu, Zhang Yan, Wu Youzhen, Li Kun, Shen Yanjun, Lin Mingan, Zhang Tao, Dong Guosheng, Qiao Yujing, Fang Kun, Chen Weipeng, Cui Bin, Zhang Wentao, Zhou Zenan
- More-3s:multimodal-based Offline Reinforcement Learning With Shared Semantic Spaces Zheng Tianyu, Zhang Ge, Qu Xingwei, Kuang Ming, Huang Stephen W., He Zhaofeng
- Llamafactory: Unified Efficient Fine-tuning Of 100+ Language Models Zheng Yaowei, Zhang Richong, Zhang Junhao, Ye Yanhan, Luo Zheyan, Feng Zhangchi, Ma Yongqiang
- Training Large Language Models For Reasoning Through Reverse Curriculum Reinforcement Learning Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang
- Revisiting Knowledge Distillation For Autoregressive Language Models Zhong Qihuang, Ding Liang, Shen Li, Liu Juhua, Du Bo, Tao Dacheng
- Moextend: Tuning New Experts For Modality And Task Extension Zhong Shanshan, Gao Shanghua, Huang Zhongzhan, Wen Wushao, Zitnik Marinka, Zhou Pan
- Robust Prompt Optimization For Defending Language Models Against Jailbreaking Attacks Zhou Andy, Li Bo, Wang Haohan
- Navgpt-2: Unleashing Navigational Reasoning Capability For Large Vision-language Models Zhou Gengze, Hong Yicong, Wang Zun, Wang Xin Eric, Wu Qi
- Fairer Preferences Elicit Improved Human-aligned Large Language Model Judgments Zhou Han, Wan Xingchen, Liu Yinhong, Collier Nigel, Vulić Ivan, Korhonen Anna
- Sequence To Sequence Reward Modeling: Improving RLHF By Language Feedback Zhou Jiayi, Ji Jiaming, Dai Juntao, Yang Yaodong
- Towards Democratizing Multilingual Large Language Models For Medicine Through A Two-stage Instruction Fine-tuning Approach Zhou Meng, Parmar Surajsinh, Bhatti Anubhav
- Sirius: Contextual Sparsity With Correction For Efficient Llms Zhou Yang, Chen Zhuoming, Xu Zhaozhuo, Lin Victoria, Chen Beidi
- Archer: Training Language Model Agents Via Hierarchical Multi-turn RL Zhou Yifei, Zanette Andrea, Pan Jiayi, Levine Sergey, Kumar Aviral
- Shared Imagination: Llms Hallucinate Alike Zhou Yilun, Xiong Caiming, Savarese Silvio, Wu Chien-sheng
- Aligning Modalities In Vision Large Language Models Via Preference Fine-tuning Zhou Yiyang, Cui Chenhang, Rafailov Rafael, Finn Chelsea, Yao Huaxiu
- Calibrated Self-rewarding Vision Language Models Zhou Yiyang, Fan Zhiyuan, Cheng Dongjie, Yang Sihan, Chen Zhaorun, Cui Chenhang, Wang Xiyao, Li Yun, Zhang Linjun, Yao Huaxiu
- Multi-stage Balanced Distillation: Addressing Long-tail Challenges In Sequence-level Knowledge Distillation Zhou Yuhang, Zhu Jing, Xu Paiheng, Liu Xiaoyu, Wang Xiyao, Koutra Danai, Ai Wei, Huang Furong
- Metagpt: Merging Large Language Models Using Model Exclusive Task Arithmetic Zhou Yuyan, Song Liang, Wang Bingning, Chen Weipeng
- A Survey On Efficient Inference For Large Language Models Zhou Zixuan, Ning Xuefei, Hong Ke, Fu Tianyu, Xu Jiaming, Li Shiyao, Lou Yuming, Wang Luning, Yuan Zhihang, Li Xiuhong, Yan Shengen, Dai Guohao, Zhang Xiao-ping, Dong Yuhan, Wang Yu
- Lifelong Personalized Low-rank Adaptation Of Large Language Models For Recommendation Zhu Jiachen, Lin Jianghao, Dai Xinyi, Chen Bo, Shan Rong, Zhu Jieming, Tang Ruiming, Yu Yong, Zhang Weinan
- Cost: Contrastive Quantization Based Semantic Tokenization For Generative Recommendation Zhu Jieming, Jin Mengqun, Liu Qijiong, Qiu Zexuan, Dong Zhenhua, Li Xiu
- Fastmem: Fast Memorization Of Prompt Improves Context Awareness Of Large Language Models Zhu Junyi, Liu Shuochen, Yu Yu, Tang Bo, Yan Yibo, Li Zhiyu, Xiong Feiyu, Xu Tong, Blaschko Matthew B.
- Relayattention For Efficient Large Language Model Serving With Long System Prompts Zhu Lei, Wang Xinjiang, Zhang Wayne, Lau Rynson W. H.
- LIRE: Listwise Reward Enhancement For Preference Alignment Zhu Mingye, Liu Yi, Zhang Lei, Guo Junbo, Mao Zhendong
- Personality Alignment Of Large Language Models Zhu Minjun, Yang Linyi, Zhang Yue
- Parameter-efficient Tuning Large Language Models For Graph Representation Learning Zhu Qi, Zheng Da, Song Xiang, Zhang Shichang, Jin Bowen, Sun Yizhou, Karypis George
- Scalable Matmul-free Language Modeling Zhu Rui-jie, Zhang Yu, Sifferman Ethan, Sheaves Tyler, Wang Yiqiao, Richmond Dustin, Zhou Peng, Eshraghian Jason K.
- Distilling Mathematical Reasoning Capabilities Into Small Language Models Zhu Xunyu, Li Jian, Liu Yong, Ma Can, Wang Weiping
- Can Large Language Models Understand Context? Zhu Yilun, Moniz Joel Ruben Antony, Bhargava Shruti, Lu Jiarui, Piraviperumal Dhivya, Li Site, Zhang Yuan, Yu Hong, Tseng Bo-hsiang
- Efficient Test-time Prompt Tuning For Vision-language Models Zhu Yuhan, Zhang Guozhen, Xu Chen, Shen Haocheng, Chen Xiaoxin, Wu Gangshan, Wang Limin
- Accelerating Inference Of Retrieval-augmented Generation Via Sparse Context Selection Zhu Yun, Gu Jia-chen, Sikora Caitlin, Ko Ho, Liu Yinxiao, Lin Chu-cheng, Shu Lei, Luo Liangchen, Meng Lei, Liu Bang, Chen Jindong
- Lumina-next: Making Lumina-t2x Stronger And Faster With Next-dit Zhuo Le, Du Ruoyi, Xiao Han, Li Yangguang, Liu Dongyang, Huang Rongjie, Liu Wenze, Zhao Lirui, Wang Fu-yun, Ma Zhanyu, Luo Xu, Wang Zehan, Zhang Kaipeng, Zhu Xiangyang, Liu Si, Yue Xiangyu, Liu Dingning, Ouyang Wanli, Liu Ziwei, Qiao Yu, Li Hongsheng, Gao Peng
- Promptintern: Saving Inference Costs By Internalizing Recurrent Prompt During Large Language Model Fine-tuning Zou Jiaru, Zhou Mengyu, Li Tao, Han Shi, Zhang Dongmei
- Aya Model: An Instruction Finetuned Open-access Multilingual Language Model Üstün Ahmet, Aryabumi Viraat, Yong Zheng-xin, Ko Wei-yin, D'souza Daniel, Onilude Gbemileke, Bhandari Neel, Singh Shivalika, Ooi Hui-lee, Kayid Amr, Vargus Freddie, Blunsom Phil, Longpre Shayne, Muennighoff Niklas, Fadaee Marzieh, Kreutzer Julia, Hooker Sara
🏷 Ethics And Bias
- Revisiting Visual Question Answering Baselines Jabri Allan, Joulin Armand, Van Der Maaten Laurens
- Topic Aware Neural Response Generation Xing Chen, Wu Wei, Wu Yu, Liu Jie, Huang Yalou, Zhou Ming, Ma Wei-ying
- Don't Just Assume; Look And Answer: Overcoming Priors For Visual Question Answering Agrawal Aishwarya, Batra Dhruv, Parikh Devi, Kembhavi Aniruddha
- Ivqa: Inverse Visual Question Answering Liu Feng, Xiang Tao, Hospedales Timothy M., Yang Wankou, Sun Changyin
- Recurrent And Contextual Models For Visual Question Answering Sharang Abhijit, Lau Eric
- A Unified Query-based Generative Model For Question Generation And Question Answering Song Linfeng, Wang Zhiguo, Hamza Wael
- Self-attentive Residual Decoder For Neural Machine Translation Werlen Lesly Miculicich, Pappas Nikolaos, Ram Dhananjay, Popescu-belis Andrei
- An Affect-rich Neural Conversational Model With Biased Attention And Weighted Cross-entropy Loss Zhong Peixiang, Wang Di, Miao Chunyan
- Reinforcement Learning Based Graph-to-sequence Model For Natural Question Generation Chen Yu, Wu Lingfei, Zaki Mohammed J.
- An Empirical Comparison On Imitation Learning And Reinforcement Learning For Paraphrase Generation Du Wanyu, Ji Yangfeng
- Exposure Bias Versus Self-recovery: Are Distortions Really Incremental For Autoregressive Text Generation? He Tianxing, Zhang Jingzhao, Zhou Zhiming, Glass James
- Exbert: A Visual Analysis Tool To Explore Learned Representations In Transformers Models Hoover Benjamin, Strobelt Hendrik, Gehrmann Sebastian
- Evaluating Rewards For Question Generation Models Hosking Tom, Riedel Sebastian
- Attention Is Not Explanation Jain Sarthak, Wallace Byron C.
- Weak Supervision Helps Emergence Of Word-object Alignment And Improves Vision-language Tasks Kervadec Corentin Liris, Antipov Grigory Liris, Baccouche Moez Liris, Wolf Christian Liris
- TAB-VCR: Tags And Attributes Based Visual Commonsense Reasoning Baselines Lin Jingxiang, Jain Unnat, Schwing Alexander G.
- Adversarial Bootstrapping For Dialogue Model Training Olabiyi Oluwatobi, Mueller Erik T., Larson Christopher, Lahlou Tarek
- Learning To Deceive With Attention-based Explanations Pruthi Danish, Gupta Mansi, Dhingra Bhuwan, Neubig Graham, Lipton Zachary C.
- Masked Language Model Scoring Salazar Julian, Liang Davis, Nguyen Toan Q., Kirchhoff Katrin
- Hierarchical Reinforcement Learning For Open-domain Dialog Saleh Abdelrhman, Jaques Natasha, Ghandeharioun Asma, Shen Judy Hanwen, Picard Rosalind
- Distilbert, A Distilled Version Of BERT: Smaller, Faster, Cheaper And Lighter Sanh Victor, Debut Lysandre, Chaumond Julien, Wolf Thomas
- Generalization In Generation: A Closer Look At Exposure Bias Schmidt Florian
- Inducing Brain-relevant Bias In Natural Language Processing Models Schwartz Dan, Toneva Mariya, Wehbe Leila
- Scene-based Factored Attention For Image Captioning Shen Chen, Ji Rongrong, Chen Fuhai, Sun Xiaoshuai, Li Xiangming
- On NMT Search Errors And Model Errors: Cat Got Your Tongue? Stahlberg Felix, Byrne Bill
- Visualizing Attention In Transformer-based Language Representation Models Vig Jesse
- Visual Entailment: A Novel Task For Fine-grained Image Understanding Xie Ning, Lai Farley, Doran Derek, Kadav Asim
- Making History Matter: History-advantage Sequence Training For Visual Dialog Yang Tianhao, Zha Zheng-jun, Zhang Hanwang
- Model Compression With Two-stage Multi-teacher Knowledge Distillation For Web Question Answering System Yang Ze, Shou Linjun, Gong Ming, Lin Wutao, Jiang Daxin
- Recosa: Detecting The Relevant Contexts With Self-attention For Multi-turn Dialogue Generation Zhang Hainan, Lan Yanyan, Pang Liang, Guo Jiafeng, Cheng Xueqi
- A Study On Multimodal And Interactive Explanations For Visual Question Answering Alipour Kamran, Schulze Jurgen P., Yao Yi, Ziskind Avi, Burachas Giedrius
- Syntax-driven Iterative Expansion Language Models For Controllable Text Generation Casas Noe, Fonollosa José A. R., Costa-jussà Marta R.
- MEANTIME: Mixture Of Attention Mechanisms With Multi-temporal Embeddings For Sequential Recommendation Cho Sung Min, Park Eunhyeok, Yoo Sungjoo
- Residual Energy-based Models For Text Generation Deng Yuntian, Bakhtin Anton, Ott Myle, Szlam Arthur, Ranzato Marc'aurelio
- Relevance Transformer: Generating Concise Code Snippets With Relevance Feedback Gemmell Carlos, Rossetto Federico, Dalton Jeffrey
- Teaforn: Teacher-forcing With N-grams Goodman Sebastian, Ding Nan, Soricut Radu
- Investigating Representations Of Verb Bias In Neural Language Models Hawkins Robert D., Yamakoshi Takateru, Griffiths Thomas L., Goldberg Adele E.
- Do Fine-tuned Commonsense Language Models Really Generalize? Kejriwal Mayank, Shen Ke
- Syntactic Structure Distillation Pretraining For Bidirectional Encoders Kuncoro Adhiguna, Kong Lingpeng, Fried Daniel, Yogatama Dani, Rimell Laura, Dyer Chris, Blunsom Phil
- Reducing Language Biases In Visual Question Answering With Visually-grounded Question Encoder Kv Gouthaman, Mittal Anurag
- Autoregressive Knowledge Distillation Through Imitation Learning Lin Alexander, Wohlwend Jeremy, Chen Howard, Lei Tao
- BLEURT: Learning Robust Metrics For Text Generation Sellam Thibault, Das Dipanjan, Parikh Ankur P.
- Multi-scale Transformer Language Models Subramanian Sandeep, Collobert Ronan, Ranzato Marc'aurelio, Boureau Y-lan
- Interpretable Neural Computation For Real-world Compositional Visual Question Answering Tang Ruixue, Ma Chao
- The Language Interpretability Tool: Extensible, Interactive Visualizations And Analysis For NLP Models Tenney Ian, Wexler James, Bastings Jasmijn, Bolukbasi Tolga, Coenen Andy, Gehrmann Sebastian, Jiang Ellen, Pushkarna Mahima, Radebaugh Carey, Reif Emily, Yuan Ann
- On Modality Bias In The TVQA Dataset Winterbottom Thomas, Xiao Sarah, Mclean Alistair, Moubayed Noura Al
- ERNIE-GEN: An Enhanced Multi-flow Pre-training And Fine-tuning Framework For Natural Language Generation Xiao Dongling, Zhang Han, Li Yukun, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Exploring Question-specific Rewards For Generating Deep Questions Xie Yuxi, Pan Liangming, Wang Dongzhe, Kan Min-yen, Feng Yansong
- Multi-unit Transformers For Neural Machine Translation Yan Jianhao, Meng Fandong, Zhou Jie
- Off-policy Self-critical Training For Transformer In Visual Paragraph Generation Yan Shiyang, Hua Yang, Robertson Neil M.
- Just Ask: Learning To Answer Questions From Millions Of Narrated Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Modifying Memories In Transformer Models Zhu Chen, Rawat Ankit Singh, Zaheer Manzil, Bhojanapalli Srinadh, Li Daliang, Yu Felix, Kumar Sanjiv
- Overcoming Language Priors With Self-supervised Learning For Visual Question Answering Zhu Xi, Mao Zhendong, Liu Chunxiao, Zhang Peng, Wang Bin, Zhang Yongdong
- G-transformer For Document-level Machine Translation Bao Guangsheng, Zhang Yue, Teng Zhiyang, Chen Boxing, Luo Weihua
- CANINE: Pre-training An Efficient Tokenization-free Encoder For Language Representation Clark Jonathan H., Garrette Dan, Turc Iulia, Wieting John
- Diagnosing Transformers In Task-oriented Semantic Parsing Desai Shrey, Aly Ahmed
- EKTVQA: Generalized Use Of External Knowledge To Empower Scene Text In Text-vqa Dey Arka Ujjal, Valveny Ernest, Harit Gaurav
- Robustness Challenges In Model Distillation And Pruning For Natural Language Understanding Du Mengnan, Mukherjee Subhabrata, Cheng Yu, Shokouhi Milad, Hu Xia, Awadallah Ahmed Hassan
- Generative Context Pair Selection For Multi-hop Question Answering Dua Dheeru, Santos Cicero Nogueira Dos, Ng Patrick, Athiwaratkun Ben, Xiang Bing, Gardner Matt, Singh Sameer
- Greedy Gradient Ensemble For Robust Visual Question Answering Han Xinzhe, Wang Shuhui, Su Chi, Huang Qingming, Tian Qi
- Zero-shot Controlled Generation With Encoder-decoder Transformers Hazarika Devamanyu, Namazifar Mahdi, Hakkani-tür Dilek
- Scaling Language Models: Methods, Analysis & Insights From Training Gopher Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George Van Den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor, Irina Higgins, Antonia Creswell, Nat Mcaleese, Amy Wu, Erich Elsen, Siddhant Jayakumar, Elena Buchatskaya, David Budden, Esme Sutherland, Karen Simonyan, Michela Paganini, Laurent Sifre, Lena Martens, Xiang Lorraine Li, Adhiguna Kuncoro, Aida Nematzadeh, Elena Gribovskaya, Domenic Donato, Angeliki Lazaridou, Arthur Mensch, Jean-baptiste Lespiau, Maria Tsimpoukelli, Nikolai Grigorev, Doug Fritz, Thibault Sottiaux, Mantas Pajarskas, Toby Pohlen, Zhitao Gong, Daniel Toyama, Cyprien De Masson D'autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego De Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew Johnson, Blake Hechtman, Laura Weidinger, Iason Gabriel, William Isaac, Ed Lockhart, Simon Osindero, Laura Rimell, Chris Dyer, Oriol Vinyals, Kareem Ayoub, Jeff Stanway, Lorrayne Bennett, Demis Hassabis, Koray Kavukcuoglu, Geoffrey Irving
- Visqa: X-raying Vision And Language Reasoning In Transformers Jaunet Theo, Kervadec Corentin, Vuillemot Romain, Antipov Grigory, Baccouche Moez, Wolf Christian
- Enriching Transformers With Structured Tensor-product Representations For Abstractive Summarization Jiang Yichen, Celikyilmaz Asli, Smolensky Paul, Soulos Paul, Rao Sudha, Palangi Hamid, Fernandez Roland, Smith Caitlin, Bansal Mohit, Gao Jianfeng
- How Transferable Are Reasoning Patterns In VQA? Kervadec Corentin, Jaunet Theo, Antipov Grigory, Baccouche Moez, Vuillemot Romain, Wolf Christian
- Improving Scheduled Sampling With Elastic Weight Consolidation For Neural Machine Translation Korakakis Michalis, Vlachos Andreas
- BERT Busters: Outlier Dimensions That Disrupt Transformers Kovaleva Olga, Kulshreshtha Saurabh, Rogers Anna, Rumshisky Anna
- On The Significance Of Question Encoder Sequence Model In The Out-of-distribution Performance In Visual Question Answering Kv Gouthaman, Mittal Anurag
- Variational Latent-state GPT For Semi-supervised Task-oriented Dialog Systems Liu Hong, Cai Yucheng, Lin Zhenru, Ou Zhijian, Huang Yi, Feng Junlan
- Confidence-aware Scheduled Sampling For Neural Machine Translation Liu Yijin, Meng Fandong, Chen Yufeng, Xu Jinan, Zhou Jie
- Scheduled Sampling Based On Decoding Steps For Neural Machine Translation Liu Yijin, Meng Fandong, Chen Yufeng, Xu Jinan, Zhou Jie
- BERTGEN: Multi-task Generation Through BERT Mitzalis Faidon, Caglayan Ozan, Madhyastha Pranava, Specia Lucia
- MLP Architectures For Vision-and-language Modeling: An Empirical Study Nie Yixin, Li Linjie, Gan Zhe, Wang Shuohang, Zhu Chenguang, Zeng Michael, Liu Zicheng, Bansal Mohit, Wang Lijuan
- Code Generation From Natural Language With Less Prior And More Monolingual Data Norouzi Sajad, Tang Keyi, Cao Yanshuai
- ORCHARD: A Benchmark For Measuring Systematic Generalization Of Multi-hierarchical Reasoning Pung Bill Tuck Weng, Chan Alvin
- Leveraging The Inductive Bias Of Large Language Models For Abstract Textual Reasoning Rytting Christopher Michael, Wingate David
- Challenges In Procedural Multimodal Machine Comprehension:a Novel Way To Benchmark Sahu Pritish, Sikka Karan, Divakaran Ajay
- Towards Solving Multimodal Comprehension Sahu Pritish, Sikka Karan, Divakaran Ajay
- Large Pre-trained Language Models Contain Human-like Biases Of What Is Right And Wrong To Do Schramowski Patrick, Turan Cigdem, Andersen Nico, Rothkopf Constantin A., Kersting Kristian
- Societal Biases In Language Generation: Progress And Challenges Sheng Emily, Chang Kai-wei, Natarajan Premkumar, Peng Nanyun
- Contextual Biasing Of Language Models For Speech Recognition In Goal-oriented Conversational Agents Shenoy Ashish, Bodapati Sravan, Kirchhoff Katrin
- Worst Of Both Worlds: Biases Compound In Pre-trained Vision-and-language Models Srinivasan Tejas, Bisk Yonatan
- NVIDIA Nemo Neural Machine Translation Systems For English-german And English-russian News And Biomedical Tasks At WMT21 Subramanian Sandeep, Hrinchuk Oleksii, Adams Virginia, Kuchaiev Oleksii
- Local Explanation Of Dialogue Response Generation Tuan Yi-lin, Pryor Connor, Chen Wenhu, Getoor Lise, Wang William Yang
- AI Chains: Transparent And Controllable Human-ai Interaction By Chaining Large Language Model Prompts Wu Tongshuang, Terry Michael, Cai Carrie J.
- Causal Attention For Vision-language Tasks Yang Xu, Zhang Hanwang, Qi Guojun, Cai Jianfei
- Language Bias In Visual Question Answering: A Survey And Taxonomy Yuan Desen
- Bitfit: Simple Parameter-efficient Fine-tuning For Transformer-based Masked Language-models Zaken Elad Ben, Ravfogel Shauli, Goldberg Yoav
- Calibrate Before Use: Improving Few-shot Performance Of Language Models Zhao Tony Z., Wallace Eric, Feng Shi, Klein Dan, Singh Sameer
- Palm: Scaling Language Modeling With Pathways Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-hellstern, Douglas Eck, Jeff Dean, Slav Petrov, Noah Fiedel
- End-to-end Multimodal Representation Learning For Video Dialog Alamri Huda, Bilic Anthony, Hu Michael, Beedu Apoorva, Essa Irfan
- Despite "super-human" Performance, Current Llms Are Unsuited For Decisions About Ethics And Safety Albrecht Joshua, Kitanidis Ellie, Fetterman Abraham J.
- Improving Alignment Of Dialogue Agents Via Targeted Human Judgements Amelia Glaese, Nat Mcaleese, Maja Trębacz, John Aslanides, Vlad Firoiu, Timo Ewalds, Maribeth Rauh, Laura Weidinger, Martin Chadwick, Phoebe Thacker, Lucy Campbell-gillingham, Jonathan Uesato, Po-sen Huang, Ramona Comanescu, Fan Yang, Abigail See, Sumanth Dathathri, Rory Greig, Charlie Chen, Doug Fritz, Jaume Sanchez Elias, Richard Green, Soňa Mokrá, Nicholas Fernando, Boxi Wu, Rachel Foley, Susannah Young, Iason Gabriel, William Isaac, John Mellor, Demis Hassabis, Koray Kavukcuoglu, Lisa Anne Hendricks, Geoffrey Irving
- A Disability Lens Towards Biases In GPT-3 Generated Open-ended Languages Amin Akhter Al, Kabir Kazi Sinthia
- BLOOM: A 176b-parameter Open-access Multilingual Language Model Bigscience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova Del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina Mcmillan-major, Iz Beltagy, Huu Nguyen, Lucile Saulnier, Samson Tan, Pedro Ortiz Suarez, Victor Sanh, Hugo Laurençon, Yacine Jernite, Julien Launay, Margaret Mitchell, Colin Raffel, Aaron Gokaslan, Adi Simhi, Aitor Soroa, Alham Fikri Aji, Amit Alfassy, Anna Rogers, Ariel Kreisberg Nitzav, Canwen Xu, Chenghao Mou, Chris Emezue, Christopher Klamm, Colin Leong, Daniel Van Strien, David Ifeoluwa Adelani, Dragomir Radev, Eduardo González Ponferrada, Efrat Levkovizh, Ethan Kim, Eyal Bar Natan, Francesco De Toni, Gérard Dupont, Germán Kruszewski, Giada Pistilli, Hady Elsahar, Hamza Benyamina, Hieu Tran, Ian Yu, Idris Abdulmumin, Isaac Johnson, Itziar Gonzalez-dios, Javier De La Rosa, Jenny Chim, Jesse Dodge, Jian Zhu, Jonathan Chang, Jörg Frohberg, Joseph Tobing, Joydeep Bhattacharjee, Khalid Almubarak, Kimbo Chen, Kyle Lo, Leandro Von Werra, Leon Weber, Long Phan, Loubna Ben Allal, Ludovic Tanguy, Manan Dey, Manuel Romero Muñoz, Maraim Masoud, María Grandury, Mario Šaško, Max Huang, Maximin Coavoux, Mayank Singh, Mike Tian-jian Jiang, Minh Chien Vu, Mohammad A. Jauhar, Mustafa Ghaleb, Nishant Subramani, Nora Kassner, Nurulaqilla Khamis, Olivier Nguyen, Omar Espejel, Ona De Gibert, Paulo Villegas, Peter Henderson, Pierre Colombo, Priscilla Amuok, Quentin Lhoest, Rheza Harliman, Rishi Bommasani, Roberto Luis López, Rui Ribeiro, Salomey Osei, Sampo Pyysalo, Sebastian Nagel, Shamik Bose, Shamsuddeen Hassan Muhammad, Shanya Sharma, Shayne Longpre, Somaieh Nikpoor, Stanislav Silberberg, Suhas Pai, Sydney Zink, Tiago Timponi Torrent, Timo Schick, Tristan Thrush, Valentin Danchev, Vassilina Nikoulina, Veronika Laippala, Violette Lepercq, Vrinda Prabhu, Zaid Alyafeai, Zeerak Talat, Arun Raja, Benjamin Heinzerling, Chenglei Si, Davut Emre Taşar, Elizabeth Salesky, Sabrina J. Mielke, Wilson Y. Lee, Abheesht Sharma, Andrea Santilli, Antoine Chaffin, Arnaud Stiegler, Debajyoti Datta, Eliza Szczechla, Gunjan Chhablani, Han Wang, Harshit Pandey, Hendrik Strobelt, Jason Alan Fries, Jos Rozen, Leo Gao, Lintang Sutawika, M Saiful Bari, Maged S. Al-shaibani, Matteo Manica, Nihal Nayak, Ryan Teehan, Samuel Albanie, Sheng Shen, Srulik Ben-david, Stephen H. Bach, Taewoon Kim, Tali Bers, Thibault Fevry, Trishala Neeraj, Urmish Thakker, Vikas Raunak, Xiangru Tang, Zheng-xin Yong, Zhiqing Sun, Shaked Brody, Yallow Uri, Hadar Tojarieh, Adam Roberts, Hyung Won Chung, Jaesung Tae, Jason Phang, Ofir Press, Conglong Li, Deepak Narayanan, Hatim Bourfoune, Jared Casper, Jeff Rasley, Max Ryabinin, Mayank Mishra, Minjia Zhang, Mohammad Shoeybi, Myriam Peyrounette, Nicolas Patry, Nouamane Tazi, Omar Sanseviero, Patrick Von Platen, Pierre Cornette, Pierre François Lavallée, Rémi Lacroix, Samyam Rajbhandari, Sanchit Gandhi, Shaden Smith, Stéphane Requena, Suraj Patil, Tim Dettmers, Ahmed Baruwa, Amanpreet Singh, Anastasia Cheveleva, Anne-laure Ligozat, Arjun Subramonian, Aurélie Névéol, Charles Lovering, Dan Garrette, Deepak Tunuguntla, Ehud Reiter, Ekaterina Taktasheva, Ekaterina Voloshina, Eli Bogdanov, Genta Indra Winata, Hailey Schoelkopf, Jan-christoph Kalo, Jekaterina Novikova, Jessica Zosa Forde, Jordan Clive, Jungo Kasai, Ken Kawamura, Liam Hazan, Marine Carpuat, Miruna Clinciu, Najoung Kim, Newton Cheng, Oleg Serikov, Omer Antverg, Oskar Van Der Wal, Rui Zhang, Ruochen Zhang, Sebastian Gehrmann, Shachar Mirkin, Shani Pais, Tatiana Shavrina, Thomas Scialom, Tian Yun, Tomasz Limisiewicz, Verena Rieser, Vitaly Protasov, Vladislav Mikhailov, Yada Pruksachatkun, Yonatan Belinkov, Zachary Bamberger, Zdeněk Kasner, Alice Rueda, Amanda Pestana, Amir Feizpour, Ammar Khan, Amy Faranak, Ana Santos, Anthony Hevia, Antigona Unldreaj, Arash Aghagol, Arezoo Abdollahi, Aycha Tammour, Azadeh Hajihosseini, Bahareh Behroozi, Benjamin Ajibade, Bharat Saxena, Carlos Muñoz Ferrandis, Daniel Mcduff, Danish Contractor, David Lansky, Davis David, Douwe Kiela, Duong A. Nguyen, Edward Tan, Emi Baylor, Ezinwanne Ozoani, Fatima Mirza, Frankline Ononiwu, Habib Rezanejad, Hessie Jones, Indrani Bhattacharya, Irene Solaiman, Irina Sedenko, Isar Nejadgholi, Jesse Passmore, Josh Seltzer, Julio Bonis Sanz, Livia Dutra, Mairon Samagaio, Maraim Elbadri, Margot Mieskes, Marissa Gerchick, Martha Akinlolu, Michael Mckenna, Mike Qiu, Muhammed Ghauri, Mykola Burynok, Nafis Abrar, Nazneen Rajani, Nour Elkott, Nour Fahmy, Olanrewaju Samuel, Ran An, Rasmus Kromann, Ryan Hao, Samira Alizadeh, Sarmad Shubber, Silas Wang, Sourav Roy, Sylvain Viguier, Thanh Le, Tobi Oyebade, Trieu Le, Yoyo Yang, Zach Nguyen, Abhinav Ramesh Kashyap, Alfredo Palasciano, Alison Callahan, Anima Shukla, Antonio Miranda-escalada, Ayush Singh, Benjamin Beilharz, Bo Wang, Caio Brito, Chenxi Zhou, Chirag Jain, Chuxin Xu, Clémentine Fourrier, Daniel León Periñán, Daniel Molano, Dian Yu, Enrique Manjavacas, Fabio Barth, Florian Fuhrimann, Gabriel Altay, Giyaseddin Bayrak, Gully Burns, Helena U. Vrabec, Imane Bello, Ishani Dash, Jihyun Kang, John Giorgi, Jonas Golde, Jose David Posada, Karthik Rangasai Sivaraman, Lokesh Bulchandani, Lu Liu, Luisa Shinzato, Madeleine Hahn De Bykhovetz, Maiko Takeuchi, Marc Pàmies, Maria A Castillo, Marianna Nezhurina, Mario Sänger, Matthias Samwald, Michael Cullan, Michael Weinberg, Michiel De Wolf, Mina Mihaljcic, Minna Liu, Moritz Freidank, Myungsun Kang, Natasha Seelam, Nathan Dahlberg, Nicholas Michio Broad, Nikolaus Muellner, Pascale Fung, Patrick Haller, Ramya Chandrasekhar, Renata Eisenberg, Robert Martin, Rodrigo Canalli, Rosaline Su, Ruisi Su, Samuel Cahyawijaya, Samuele Garda, Shlok S Deshmukh, Shubhanshu Mishra, Sid Kiblawi, Simon Ott, Sinee Sang-aroonsiri, Srishti Kumar, Stefan Schweter, Sushil Bharati, Tanmay Laud, Théo Gigant, Tomoya Kainuma, Wojciech Kusa, Yanis Labrak, Yash Shailesh Bajaj, Yash Venkatraman, Yifan Xu, Yingxin Xu, Yu Xu, Zhe Tan, Zhongli Xie, Zifan Ye, Mathilde Bras, Younes Belkada, Thomas Wolf
- Looking For A Handsome Carpenter! Debiasing GPT-3 Job Advertisements Borchers Conrad, Gala Dalia Sara, Gilburt Benjamin, Oravkin Eduard, Bounsi Wilfried, Asano Yuki M., Kirk Hannah Rose
- Understanding Performance Of Long-document Ranking Models Through Comprehensive Evaluation And Leaderboarding Boytsov Leonid, Akinpelu David, Lin Tianyi, Gao Fangwei, Zhao Yutian, Huang Jeffrey, Katyal Nipun, Nyberg Eric
- HIBRIDS: Attention With Hierarchical Biases For Structure-aware Long Document Summarization Cao Shuyang, Wang Lu
- The Myth Of Culturally Agnostic AI Models Cetinic Eva
- Testing Occupational Gender Bias In Language Models: Towards Robust Measurement And Zero-shot Debiasing Chen Yuen, Raghuram Vethavikashini Chithrra, Mattern Justus, Sachan Mrinmaya, Mihalcea Rada, Schölkopf Bernhard, Jin Zhijing
- Prompting GPT-3 To Be Reliable Chenglei Si, Zhe Gan, Zhengyuan Yang, Shuohang Wang, Jianfeng Wang, Jordan Boyd-graber, Lijuan Wang
- Shortcut Learning Of Large Language Models In Natural Language Understanding Du Mengnan, He Fengxiang, Zou Na, Tao Dacheng, Hu Xia
- Towards Winoqueer: Developing A Benchmark For Anti-queer Bias In Large Language Models Felkner Virginia K., Chang Ho-chun Herbert, Jang Eugene, May Jonathan
- Should Attention Be All We Need? The Epistemic And Ethical Implications Of Unification In Machine Learning Fishman Nic, Hancox-li Leif
- Revision Transformers: Instructing Language Models To Change Their Values Friedrich Felix, Stammer Wolfgang, Schramowski Patrick, Kersting Kristian
- Adapterbias: Parameter-efficient Token-dependent Representation Shift For Adapters In NLP Tasks Fu Chin-lun, Chen Zih-ching, Lee Yun-ru, Lee Hung-yi
- Normsage: Multi-lingual Multi-cultural Norm Discovery From Conversations On-the-fly Fung Yi R., Chakraborty Tuhin, Guo Hao, Rambow Owen, Muresan Smaranda, Ji Heng
- VLMAE: Vision-language Masked Autoencoder He Sunan, Guo Taian, Dai Tao, Qiao Ruizhi, Wu Chen, Shu Xiujun, Ren Bo
- Bridging The Data Gap Between Training And Inference For Unsupervised Neural Machine Translation He Zhiwei, Wang Xing, Wang Rui, Shi Shuming, Tu Zhaopeng
- Validating Large Language Models With Relm Kuchnik Michael, Smith Virginia, Amvrosiadis George
- Evaluation Of Synthetic Datasets For Conversational Recommender Systems Lara Harsh, Tiwari Manoj
- Littlebird: Efficient Faster & Longer Transformer For Question Answering Lee Minchul, Han Kijong, Shin Myeong Cheol
- Evaluating Psychological Safety Of Large Language Models Li Xingxuan, Li Yutong, Qiu Lin, Joty Shafiq, Bing Lidong
- Simple And Effective Gradient-based Tuning Of Sequence-to-sequence Models Lichtarge Jared, Alberti Chris, Kumar Shankar
- Delving Deeper Into Cross-lingual Visual Question Answering Liu Chen, Pfeiffer Jonas, Korhonen Anna, Vulić Ivan, Gurevych Iryna
- Do Ever Larger Octopi Still Amplify Reporting Biases? Evidence From Judgments Of Typical Colour Liu Fangyu, Eisenschlos Julian Martin, Cole Jeremy R., Collier Nigel
- Same Pre-training Loss, Better Downstream: Implicit Bias Matters For Language Models Liu Hong, Xie Sang Michael, Li Zhiyuan, Ma Tengyu
- A Win-win Deal: Towards Sparse And Robust Pre-trained Language Models Liu Yuanxin, Meng Fandong, Lin Zheng, Li Jiangnan, Fu Peng, Cao Yanan, Wang Weiping, Zhou Jie
- Structured Like A Language Model: Analysing AI As An Automated Subject Magee Liam, Arora Vanicka, Munn Luke
- Text Generation With Text-editing Models Malmi Eric, Dong Yue, Mallinson Jonathan, Chuklin Aleksandr, Adamek Jakub, Mirylenka Daniil, Stahlberg Felix, Krause Sebastian, Kumar Shankar, Severyn Aliaksei
- JASMINE: Arabic GPT Models For Few-shot Learning Nagoudi El Moatez Billah, Abdul-mageed Muhammad, Elmadany Abdelrahim, Inciarte Alcides Alcoba, Khondaker Md Tawkat Islam
- Relation-aware Language-graph Transformer For Question Answering Park Jinyoung, Choi Hyeong Kyu, Ko Juyeon, Park Hyeonjin, Kim Ji-hoon, Jeong Jisu, Kim Kyungmin, Kim Hyunwoo J.
- Holistic Evaluation Of Language Models Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao, Jue Wang, Keshav Santhanam, Laurel Orr, Lucia Zheng, Mert Yuksekgonul, Mirac Suzgun, Nathan Kim, Neel Guha, Niladri Chatterji, Omar Khattab, Peter Henderson, Qian Huang, Ryan Chi, Sang Michael Xie, Shibani Santurkar, Surya Ganguli, Tatsunori Hashimoto, Thomas Icard, Tianyi Zhang, Vishrav Chaudhary, William Wang, Xuechen Li, Yifan Mai, Yuhui Zhang, Yuta Koreeda
- Harnessing The Power Of Multi-task Pretraining For Ground-truth Level Natural Language Explanations Plüster Björn, Ambsdorf Jakob, Braach Lukas, Lee Jae Hee, Wermter Stefan
- Lamda: Language Models For Dialog Applications Romal Thoppilan, Daniel De Freitas, Jamie Hall, Noam Shazeer, Apoorv Kulshreshtha, Heng-tze Cheng, Alicia Jin, Taylor Bos, Leslie Baker, Yu Du, Yaguang Li, Hongrae Lee, Huaixiu Steven Zheng, Amin Ghafouri, Marcelo Menegali, Yanping Huang, Maxim Krikun, Dmitry Lepikhin, James Qin, Dehao Chen, Yuanzhong Xu, Zhifeng Chen, Adam Roberts, Maarten Bosma, Vincent Zhao, Yanqi Zhou, Chung-ching Chang, Igor Krivokon, Will Rusch, Marc Pickett, Pranesh Srinivasan, Laichee Man, Kathleen Meier-hellstern, Meredith Ringel Morris, Tulsee Doshi, Renelito Delos Santos, Toju Duke, Johnny Soraker, Ben Zevenbergen, Vinodkumar Prabhakaran, Mark Diaz, Ben Hutchinson, Kristen Olson, Alejandra Molina, Erin Hoffman-john, Josh Lee, Lora Aroyo, Ravi Rajakumar, Alena Butryna, Matthew Lamm, Viktoriya Kuzmina, Joe Fenton, Aaron Cohen, Rachel Bernstein, Ray Kurzweil, Blaise Aguera-arcas, Claire Cui, Marian Croak, Ed Chi, Quoc Le
- NLX-GPT: A Model For Natural Language Explanations In Vision And Vision-language Tasks Sammani Fawaz, Mukherjee Tanmoy, Deligiannis Nikos
- Transformer Grammars: Augmenting Transformer Language Models With Syntactic Inductive Biases At Scale Sartran Laurent, Barrett Samuel, Kuncoro Adhiguna, Stanojević Miloš, Blunsom Phil, Dyer Chris
- Bird-eye Transformers For Text Generation Models Sha Lei, Song Yuhang, Yordanov Yordan, Salvatori Tommaso, Lukasiewicz Thomas
- On Second Thought, Let's Not Think Step By Step! Bias And Toxicity In Zero-shot Reasoning Shaikh Omar, Zhang Hongxin, Held William, Bernstein Michael, Yang Diyi
- Deanthropomorphising NLP: Can A Language Model Be Conscious? Shardlow Matthew, Przybyła Piotr
- Unintended Bias In Language Model-driven Conversational Recommendation Shen Tianshu, Li Jiaru, Bouadjenek Mohamed Reda, Mai Zheda, Sanner Scott
- Augmenting Interpretable Models With Llms During Training Singh Chandan, Askari Armin, Caruana Rich, Gao Jianfeng
- Large Language Models Encode Clinical Knowledge Singhal Karan, Azizi Shekoofeh, Tu Tao, Mahdavi S. Sara, Wei Jason, Chung Hyung Won, Scales Nathan, Tanwani Ajay, Cole-lewis Heather, Pfohl Stephen, Payne Perry, Seneviratne Martin, Gamble Paul, Kelly Chris, Scharli Nathaneal, Chowdhery Aakanksha, Mansfield Philip, Arcas Blaise Aguera Y, Webster Dale, Corrado Greg S., Matias Yossi, Chou Katherine, Gottweis Juraj, Tomasev Nenad, Liu Yun, Rajkomar Alvin, Barral Joelle, Semturs Christopher, Karthikesalingam Alan, Natarajan Vivek
- Beyond The Imitation Game: Quantifying And Extrapolating The Capabilities Of Language Models Srivastava Aarohi Shammie, Rastogi Abhinav Shammie, Rao Abhishek Shammie, Shoeb Abu Awal Md Shammie, Abid Abubakar Shammie, Fisch Adam Shammie, Brown Adam R. Shammie, Santoro Adam Shammie, Gupta Aditya Shammie, Garriga-alonso Adrià Shammie, Kluska Agnieszka Shammie, Lewkowycz Aitor Shammie, Agarwal Akshat Shammie, Power Alethea Shammie, Ray Alex Shammie, Warstadt Alex Shammie, Kocurek Alexander W. Shammie, Safaya Ali Shammie, Tazarv Ali Shammie, Xiang Alice Shammie, Parrish Alicia Shammie, Nie Allen Shammie, Hussain Aman Shammie, Askell Amanda Shammie, Dsouza Amanda Shammie, Slone Ambrose Shammie, Rahane Ameet Shammie, Iyer Anantharaman S. Shammie, Andreassen Anders Shammie, Madotto Andrea Shammie, Santilli Andrea Shammie, Stuhlmüller Andreas Shammie, Dai Andrew Shammie, La Andrew Shammie, Lampinen Andrew Shammie, Zou Andy Shammie, Jiang Angela Shammie, Chen Angelica Shammie, Vuong Anh Shammie, Gupta Animesh Shammie, Gottardi Anna Shammie, Norelli Antonio Shammie, Venkatesh Anu Shammie, Gholamidavoodi Arash Shammie, Tabassum Arfa Shammie, Menezes Arul Shammie, Kirubarajan Arun Shammie, Mullokandov Asher Shammie, Sabharwal Ashish Shammie, Herrick Austin Shammie, Efrat Avia Shammie, Erdem Aykut Shammie, Karakaş Ayla Shammie, Roberts B. Ryan Shammie, Loe Bao Sheng Shammie, Zoph Barret Shammie, Bojanowski Bartłomiej Shammie, Özyurt Batuhan Shammie, Hedayatnia Behnam Shammie, Neyshabur Behnam Shammie, Inden Benjamin Shammie, Stein Benno Shammie, Ekmekci Berk Shammie, Lin Bill Yuchen Shammie, Howald Blake Shammie, Orinion Bryan Shammie, Diao Cameron Shammie, Dour Cameron Shammie, Stinson Catherine Shammie, Argueta Cedrick Shammie, Ramírez César Ferri Shammie, Singh Chandan Shammie, Rathkopf Charles Shammie, Meng Chenlin Shammie, Baral Chitta Shammie, Wu Chiyu Shammie, Callison-burch Chris Shammie, Waites Chris Shammie, Voigt Christian Shammie, Manning Christopher D. Shammie, Potts Christopher Shammie, Ramirez Cindy Shammie, Rivera Clara E. Shammie, Siro Clemencia Shammie, Raffel Colin Shammie, Ashcraft Courtney Shammie, Garbacea Cristina Shammie, Sileo Damien Shammie, Garrette Dan Shammie, Hendrycks Dan Shammie, Kilman Dan Shammie, Roth Dan Shammie, Freeman Daniel Shammie, Khashabi Daniel Shammie, Levy Daniel Shammie, González Daniel Moseguí Shammie, Perszyk Danielle Shammie, Hernandez Danny Shammie, Chen Danqi Shammie, Ippolito Daphne Shammie, Gilboa Dar Shammie, Dohan David Shammie, Drakard David Shammie, Jurgens David Shammie, Datta Debajyoti Shammie, Ganguli Deep Shammie, Emelin Denis Shammie, Kleyko Denis Shammie, Yuret Deniz Shammie, Chen Derek Shammie, Tam Derek Shammie, Hupkes Dieuwke Shammie, Misra Diganta Shammie, Buzan Dilyar Shammie, Mollo Dimitri Coelho Shammie, Yang Diyi Shammie, Lee Dong-ho Shammie, Schrader Dylan Shammie, Shutova Ekaterina Shammie, Cubuk Ekin Dogus Shammie, Segal Elad Shammie, Hagerman Eleanor Shammie, Barnes Elizabeth Shammie, Donoway Elizabeth Shammie, Pavlick Ellie Shammie, Rodola Emanuele Shammie, Lam Emma Shammie, Chu Eric Shammie, Tang Eric Shammie, Erdem Erkut Shammie, Chang Ernie Shammie, Chi Ethan A. Shammie, Dyer Ethan Shammie, Jerzak Ethan Shammie, Kim Ethan Shammie, Manyasi Eunice Engefu Shammie, Zheltonozhskii Evgenii Shammie, Xia Fanyue Shammie, Siar Fatemeh Shammie, Martínez-plumed Fernando Shammie, Happé Francesca Shammie, Chollet Francois Shammie, Rong Frieda Shammie, Mishra Gaurav Shammie, Winata Genta Indra Shammie, De Melo Gerard Shammie, Kruszewski Germán Shammie, Parascandolo Giambattista Shammie, Mariani Giorgio Shammie, Wang Gloria Shammie, Jaimovitch-lópez Gonzalo Shammie, Betz Gregor Shammie, Gur-ari Guy Shammie, Galijasevic Hana Shammie, Kim Hannah Shammie, Rashkin Hannah Shammie, Hajishirzi Hannaneh Shammie, Mehta Harsh Shammie, Bogar Hayden Shammie, Shevlin Henry Shammie, Schütze Hinrich Shammie, Yakura Hiromu Shammie, Zhang Hongming Shammie, Wong Hugh Mee Shammie, Ng Ian Shammie, Noble Isaac Shammie, Jumelet Jaap Shammie, Geissinger Jack Shammie, Kernion Jackson Shammie, Hilton Jacob Shammie, Lee Jaehoon Shammie, Fisac Jaime Fernández Shammie, Simon James B. Shammie, Koppel James Shammie, Zheng James Shammie, Zou James Shammie, Kocoń Jan Shammie, Thompson Jana Shammie, Wingfield Janelle Shammie, Kaplan Jared Shammie, Radom Jarema Shammie, Sohl-dickstein Jascha Shammie, Phang Jason Shammie, Wei Jason Shammie, Yosinski Jason Shammie, Novikova Jekaterina Shammie, Bosscher Jelle Shammie, Marsh Jennifer Shammie, Kim Jeremy Shammie, Taal Jeroen Shammie, Engel Jesse Shammie, Alabi Jesujoba Shammie, Xu Jiacheng Shammie, Song Jiaming Shammie, Tang Jillian Shammie, Waweru Joan Shammie, Burden John Shammie, Miller John Shammie, Balis John U. Shammie, Batchelder Jonathan Shammie, Berant Jonathan Shammie, Frohberg Jörg Shammie, Rozen Jos Shammie, Hernandez-orallo Jose Shammie, Boudeman Joseph Shammie, Guerr Joseph Shammie, Jones Joseph Shammie, Tenenbaum Joshua B. Shammie, Rule Joshua S. Shammie, Chua Joyce Shammie, Kanclerz Kamil Shammie, Livescu Karen Shammie, Krauth Karl Shammie, Gopalakrishnan Karthik Shammie, Ignatyeva Katerina Shammie, Markert Katja Shammie, Dhole Kaustubh D. Shammie, Gimpel Kevin Shammie, Omondi Kevin Shammie, Mathewson Kory Shammie, Chiafullo Kristen Shammie, Shkaruta Ksenia Shammie, Shridhar Kumar Shammie, Mcdonell Kyle Shammie, Richardson Kyle Shammie, Reynolds Laria Shammie, Gao Leo Shammie, Zhang Li Shammie, Dugan Liam Shammie, Qin Lianhui Shammie, Contreras-ochando Lidia Shammie, Morency Louis-philippe Shammie, Moschella Luca Shammie, Lam Lucas Shammie, Noble Lucy Shammie, Schmidt Ludwig Shammie, He Luheng Shammie, Colón Luis Oliveros Shammie, Metz Luke Shammie, Şenel Lütfi Kerem Shammie, Bosma Maarten Shammie, Sap Maarten Shammie, Ter Hoeve Maartje Shammie, Farooqi Maheen Shammie, Faruqui Manaal Shammie, Mazeika Mantas Shammie, Baturan Marco Shammie, Marelli Marco Shammie, Maru Marco Shammie, Quintana Maria Jose Ramírez Shammie, Tolkiehn Marie Shammie, Giulianelli Mario Shammie, Lewis Martha Shammie, Potthast Martin Shammie, Leavitt Matthew L. Shammie, Hagen Matthias Shammie, Schubert Mátyás Shammie, Baitemirova Medina Orduna Shammie, Arnaud Melody Shammie, Mcelrath Melvin Shammie, Yee Michael A. Shammie, Cohen Michael Shammie, Gu Michael Shammie, Ivanitskiy Michael Shammie, Starritt Michael Shammie, Strube Michael Shammie, Swędrowski Michał Shammie, Bevilacqua Michele Shammie, Yasunaga Michihiro Shammie, Kale Mihir Shammie, Cain Mike Shammie, Xu Mimee Shammie, Suzgun Mirac Shammie, Walker Mitch Shammie, Tiwari Mo Shammie, Bansal Mohit Shammie, Aminnaseri Moin Shammie, Geva Mor Shammie, Gheini Mozhdeh Shammie, T Mukund Varma Shammie, Peng Nanyun Shammie, Chi Nathan A. Shammie, Lee Nayeon Shammie, Krakover Neta Gur-ari Shammie, Cameron Nicholas Shammie, Roberts Nicholas Shammie, Doiron Nick Shammie, Martinez Nicole Shammie, Nangia Nikita Shammie, Deckers Niklas Shammie, Muennighoff Niklas Shammie, Keskar Nitish Shirish Shammie, Iyer Niveditha S. Shammie, Constant Noah Shammie, Fiedel Noah Shammie, Wen Nuan Shammie, Zhang Oliver Shammie, Agha Omar Shammie, Elbaghdadi Omar Shammie, Levy Omer Shammie, Evans Owain Shammie, Casares Pablo Antonio Moreno Shammie, Doshi Parth Shammie, Fung Pascale Shammie, Liang Paul Pu Shammie, Vicol Paul Shammie, Alipoormolabashi Pegah Shammie, Liao Peiyuan Shammie, Liang Percy Shammie, Chang Peter Shammie, Eckersley Peter Shammie, Htut Phu Mon Shammie, Hwang Pinyu Shammie, Miłkowski Piotr Shammie, Patil Piyush Shammie, Pezeshkpour Pouya Shammie, Oli Priti Shammie, Mei Qiaozhu Shammie, Lyu Qing Shammie, Chen Qinlang Shammie, Banjade Rabin Shammie, Rudolph Rachel Etta Shammie, Gabriel Raefer Shammie, Habacker Rahel Shammie, Risco Ramon Shammie, Millière Raphaël Shammie, Garg Rhythm Shammie, Barnes Richard Shammie, Saurous Rif A. Shammie, Arakawa Riku Shammie, Raymaekers Robbe Shammie, Frank Robert Shammie, Sikand Rohan Shammie, Novak Roman Shammie, Sitelew Roman Shammie, Lebras Ronan Shammie, Liu Rosanne Shammie, Jacobs Rowan Shammie, Zhang Rui Shammie, Salakhutdinov Ruslan Shammie, Chi Ryan Shammie, Lee Ryan Shammie, Stovall Ryan Shammie, Teehan Ryan Shammie, Yang Rylan Shammie, Singh Sahib Shammie, Mohammad Saif M. Shammie, Anand Sajant Shammie, Dillavou Sam Shammie, Shleifer Sam Shammie, Wiseman Sam Shammie, Gruetter Samuel Shammie, Bowman Samuel R. Shammie, Schoenholz Samuel S. Shammie, Han Sanghyun Shammie, Kwatra Sanjeev Shammie, Rous Sarah A. Shammie, Ghazarian Sarik Shammie, Ghosh Sayan Shammie, Casey Sean Shammie, Bischoff Sebastian Shammie, Gehrmann Sebastian Shammie, Schuster Sebastian Shammie, Sadeghi Sepideh Shammie, Hamdan Shadi Shammie, Zhou Sharon Shammie, Srivastava Shashank Shammie, Shi Sherry Shammie, Singh Shikhar Shammie, Asaadi Shima Shammie, Gu Shixiang Shane Shammie, Pachchigar Shubh Shammie, Toshniwal Shubham Shammie, Upadhyay Shyam Shammie, Shyamolima Shammie, Debnath, Shakeri Siamak, Thormeyer Simon, Melzi Simone, Reddy Siva, Makini Sneha Priscilla, Lee Soo-hwan, Torene Spencer, Hatwar Sriharsha, Dehaene Stanislas, Divic Stefan, Ermon Stefano, Biderman Stella, Lin Stephanie, Prasad Stephen, Piantadosi Steven T., Shieber Stuart M., Misherghi Summer, Kiritchenko Svetlana, Mishra Swaroop, Linzen Tal, Schuster Tal, Li Tao, Yu Tao, Ali Tariq, Hashimoto Tatsu, Wu Te-lin, Desbordes Théo, Rothschild Theodore, Phan Thomas, Wang Tianle, Nkinyili Tiberius, Schick Timo, Kornev Timofei, Tunduny Titus, Gerstenberg Tobias, Chang Trenton, Neeraj Trishala, Khot Tushar, Shultz Tyler, Shaham Uri, Misra Vedant, Demberg Vera, Nyamai Victoria, Raunak Vikas, Ramasesh Vinay, Prabhu Vinay Uday, Padmakumar Vishakh, Srikumar Vivek, Fedus William, Saunders William, Zhang William, Vossen Wout, Ren Xiang, Tong Xiaoyu, Zhao Xinran, Wu Xinyi, Shen Xudong, Yaghoobzadeh Yadollah, Lakretz Yair, Song Yangqiu, Bahri Yasaman, Choi Yejin, Yang Yichi, Hao Yiding, Chen Yifu, Belinkov Yonatan, Hou Yu, Hou Yufang, Bai Yuntao, Seid Zachary, Zhao Zhuoye, Wang Zijian, Wang Zijie J., Wang Zirui, Wu Ziyi
- Chatgpt: The End Of Online Exam Integrity? Susnjak Teo
- Fewer Errors, But More Stereotypes? The Effect Of Model Size On Gender Bias Tal Yarden, Magar Inbal, Schwartz Roy
- Exploring The Limits Of Domain-adaptive Training For Detoxifying Large-scale Language Models Wang Boxin, Ping Wei, Xiao Chaowei, Xu Peng, Patwary Mostofa, Shoeybi Mohammad, Li Bo, Anandkumar Anima, Catanzaro Bryan
- Pretraining Without Attention Wang Junxiong, Yan Jing Nathan, Gu Albert, Rush Alexander M.
- Jiuzhang: A Chinese Pre-trained Language Model For Mathematical Problem Understanding Wayne Xin Zhao, Kun Zhou, Zheng Gong, Beichen Zhang, Yuanhang Zhou, Jing Sha, Zhigang Chen, Shijin Wang, Cong Liu, Ji-rong Wen
- Adversarial Self-attention For Language Understanding Wu Hongqiu, Ding Ruixue, Zhao Hai, Xie Pengjun, Huang Fei, Zhang Min
- Modeling Context With Linear Attention For Scalable Document-level Translation Wu Zhaofeng, Peng Hao, Pappas Nikolaos, Smith Noah A.
- Enhancing Pre-trained Models With Text Structure Knowledge For Question Generation Wu Zichen Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China, Jia Xin Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China, Qu Fanyi Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China, Wu Yunfang Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China
- Learning To Answer Visual Questions From Web Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Tableformer: Robust Transformer Modeling For Table-text Encoding Yang Jingfeng, Gupta Aditya, Upadhyay Shyam, He Luheng, Goel Rahul, Paul Shachi
- A Survey On GPT-3 Zong Mingyu, Krishnamachari Bhaskar
- Creating Trustworthy Llms: Dealing With Hallucinations In Healthcare AI Ahmad Muhammad Aurangzeb, Yaramis Ilker, Roy Taposh Dutta
- Multimodal Representations For Teacher-guided Compositional Visual Reasoning Aissa Wafa Cedric - Vertigo, Ferecatu Marin Cedric - Vertigo, Crucianu Michel Cedric - Vertigo
- Position Interpolation Improves Alibi Extrapolation Al-khateeb Faisal, Dey Nolan, Soboleva Daria, Hestness Joel
- Query Understanding In The Age Of Large Language Models Anand Avishek, V Venktesh, Anand Abhijit, Setty Vinay
- Studying The Impacts Of Pre-training Using Chatgpt-generated Text On Downstream Tasks Anand Sarthak
- Detecting Natural Language Biases With Prompt-based Learning Aowal Md Abdul, Islam Maliha T, Mammen Priyanka Mary, Shetty Sandesh
- Llm2loss: Leveraging Language Models For Explainable Model Diagnostics Ardeshir Shervin
- Using Large Language Models For Qualitative Analysis Can Introduce Serious Bias Ashwin Julian, Chhabra Aditya, Rao Vijayendra
- Taught By The Internet, Exploring Bias In Openais GPT3 Ayaz Ali, Nawalgaria Aditya, Yin Ruilian
- Intentional Biases In LLM Responses Badyal Nicklaus, Jacoby Derek, Coady Yvonne
- Peering Through Preferences: Unraveling Feedback Acquisition For Aligning Large Language Models Bansal Hritik, Dang John, Grover Aditya
- Worldsense: A Synthetic Benchmark For Grounded Reasoning In Large Language Models Benchekroun Youssef, Dervishi Megi, Ibrahim Mark, Gaya Jean-baptiste, Martinet Xavier, Mialon Grégoire, Scialom Thomas, Dupoux Emmanuel, Hupkes Dieuwke, Vincent Pascal
- Language Model Unalignment: Parametric Red-teaming To Expose Hidden Harms And Biases Bhardwaj Rishabh, Poria Soujanya
- A Categorical Archive Of Chatgpt Failures Borji Ali
- Extractgpt: Exploring The Potential Of Large Language Models For Product Attribute Value Extraction Brinkmann Alexander, Shraga Roee, Bizer Christian
- On The Application Of Large Language Models For Language Teaching And Assessment Technology Caines Andrew, Benedetto Luca, Taslimipoor Shiva, Davis Christopher, Gao Yuan, Andersen Oeistein, Yuan Zheng, Elliott Mark, Moore Russell, Bryant Christopher, Rei Marek, Yannakoudakis Helen, Mullooly Andrew, Nicholls Diane, Buttery Paula
- A Systematic Study Of Knowledge Distillation For Natural Language Generation With Pseudo-target Training Calderon Nitay, Mukherjee Subhabrata, Reichart Roi, Kantor Amir
- H2O Open Ecosystem For State-of-the-art Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Lee Chun Ming, Conde Marcos V.
- H2ogpt: Democratizing Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Prabhu Prithvi, Gambera Jeff, Landry Mark, Bansal Shivam, Chesler Ryan, Lee Chun Ming, Conde Marcos V., Stetsenko Pasha, Grellier Olivier, Ambati Srisatish
- Assessing Large Language Models' Ability To Predict How Humans Balance Self-interest And The Interest Of Others Capraro Valerio, Di Paolo Roberto, Pizziol Veronica
- Which Spurious Correlations Impact Reasoning In NLI Models? A Visual Interactive Diagnosis Through Data-constrained Counterfactuals Chan Robin, Amini Afra, El-assady Mennatallah
- Language Model Behavior: A Comprehensive Survey Chang Tyler A., Bergen Benjamin K.
- Sudden Drops In The Loss: Syntax Acquisition, Phase Transitions, And Simplicity Bias In Mlms Chen Angelica, Shwartz-ziv Ravid, Cho Kyunghyun, Leavitt Matthew L., Saphra Naomi
- Interpreting And Controlling Vision Foundation Models Via Text Explanations Chen Haozhe, Yang Junfeng, Vondrick Carl, Mao Chengzhi
- Challenges And Contributing Factors In The Utilization Of Large Language Models (llms) Chen Xiaoliang, Li Liangbin, Chang Le, Huang Yunhe, Zhao Yuxuan, Zhang Yuxiao, Li Dinuo
- GENOME: Generative Neuro-symbolic Visual Reasoning By Growing And Reusing Modules Chen Zhenfang, Sun Rui, Liu Wenjun, Hong Yining, Gan Chuang
- See, Think, Confirm: Interactive Prompting Between Vision And Language Models For Knowledge-based Visual Reasoning Chen Zhenfang, Zhou Qinhong, Shen Yikang, Hong Yining, Zhang Hao, Gan Chuang
- Lmexplainer: Grounding Knowledge And Explaining Language Models Chen Zichen, Chen Jianda, Chen Yuanyuan, Yu Han, Singh Ambuj K, Sra Misha
- Xplainllm: A QA Explanation Dataset For Understanding LLM Decision-making Chen Zichen, Chen Jianda, Gaidhani Mitali, Singh Ambuj, Sra Misha
- SCALE: Synergized Collaboration Of Asymmetric Language Translation Engines Cheng Xin, Wang Xun, Ge Tao, Chen Si-qing, Wei Furu, Zhao Dongyan, Yan Rui
- An Integrative Survey On Mental Health Conversational Agents To Bridge Computer Science And Medical Perspectives Cho Young Min, Rai Sunny, Ungar Lyle, Sedoc João, Guntuku Sharath Chandra
- Whose Text Is It Anyway? Exploring Bigcode, Intellectual Property, And Ethics Choksi Madiha Zahrah, Goedicke David
- Implications Of Annotation Artifacts In Edge Probing Test Datasets Choudhury Sagnik Ray, Kalra Jushaan
- Challenges Of Large Language Models For Mental Health Counseling Chung Neo Christopher, Dyer George, Brocki Lennart
- Inducing Anxiety In Large Language Models Increases Exploration And Bias Coda-forno Julian, Witte Kristin, Jagadish Akshay K., Binz Marcel, Akata Zeynep, Schulz Eric
- Seamlessm4t: Massively Multilingual & Multimodal Machine Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Cora, Dale David, Dong Ning, Duquenne Paul-ambroise, Elsahar Hady, Gong Hongyu, Heffernan Kevin, Hoffman John, Klaiber Christopher, Li Pengwei, Licht Daniel, Maillard Jean, Rakotoarison Alice, Sadagopan Kaushik Ram, Wenzek Guillaume, Ye Ethan, Akula Bapi, Chen Peng-jen, Hachem Naji El, Ellis Brian, Gonzalez Gabriel Mejia, Haaheim Justin, Hansanti Prangthip, Howes Russ, Huang Bernie, Hwang Min-jae, Inaguma Hirofumi, Jain Somya, Kalbassi Elahe, Kallet Amanda, Kulikov Ilia, Lam Janice, Li Daniel, Ma Xutai, Mavlyutov Ruslan, Peloquin Benjamin, Ramadan Mohamed, Ramakrishnan Abinesh, Sun Anna, Tran Kevin, Tran Tuan, Tufanov Igor, Vogeti Vish, Wood Carleigh, Yang Yilin, Yu Bokai, Andrews Pierre, Balioglu Can, Costa-jussà Marta R., Celebi Onur, Elbayad Maha, Gao Cynthia, Guzmán Francisco, Kao Justine, Lee Ann, Mourachko Alexandre, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler
- Seamless: Multilingual Expressive And Streaming Speech Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Coria, Dale David, Dong Ning, Duppenthaler Mark, Duquenne Paul-ambroise, Ellis Brian, Elsahar Hady, Haaheim Justin, Hoffman John, Hwang Min-jae, Inaguma Hirofumi, Klaiber Christopher, Kulikov Ilia, Li Pengwei, Licht Daniel, Maillard Jean, Mavlyutov Ruslan, Rakotoarison Alice, Sadagopan Kaushik Ram, Ramakrishnan Abinesh, Tran Tuan, Wenzek Guillaume, Yang Yilin, Ye Ethan, Evtimov Ivan, Fernandez Pierre, Gao Cynthia, Hansanti Prangthip, Kalbassi Elahe, Kallet Amanda, Kozhevnikov Artyom, Gonzalez Gabriel Mejia, Roman Robin San, Touret Christophe, Wong Corinne, Wood Carleigh, Yu Bokai, Andrews Pierre, Balioglu Can, Chen Peng-jen, Costa-jussà Marta R., Elbayad Maha, Gong Hongyu, Guzmán Francisco, Heffernan Kevin, Jain Somya, Kao Justine, Lee Ann, Ma Xutai, Mourachko Alex, Peloquin Benjamin, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Sun Anna, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler, Williamson Mary
- Reinforcement Learning Fine-tuning Of Language Models Is Biased Towards More Extractable Features Cruz Diogo, Pona Edoardo, Holness-tofts Alex, Schmied Elias, Alonso Víctor Abia, Griffin Charlie, Cirstea Bogdan-ionut
- Ultrafeedback: Boosting Language Models With Scaled AI Feedback Cui Ganqu, Yuan Lifan, Ding Ning, Yao Guanming, He Bingxiang, Zhu Wei, Ni Yuan, Xie Guotong, Xie Ruobing, Lin Yankai, Liu Zhiyuan, Sun Maosong
- Neural Retrievers Are Biased Towards Llm-generated Content Dai Sunhao, Zhou Yuqi, Pang Liang, Liu Weihao, Hu Xiaolin, Liu Yong, Zhang Xiao, Wang Gang, Xu Jun
- Mask The Bias: Improving Domain-adaptive Generalization Of Ctc-based ASR With Internal Language Model Estimation Das Nilaksh, Sunkara Monica, Bodapati Sravan, Cai Jinglun, Kulshreshtha Devang, Farris Jeff, Kirchhoff Katrin
- Who's Thinking? A Push For Human-centered Evaluation Of Llms Using The XAI Playbook Datta Teresa, Dickerson John P.
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- Im-promptu: In-context Composition From Image Prompts Dedhia Bhishma, Chang Michael, Snell Jake C., Griffiths Thomas L., Jha Niraj K.
- Scaling Vision Transformers To 22 Billion Parameters Dehghani Mostafa, Djolonga Josip, Mustafa Basil, Padlewski Piotr, Heek Jonathan, Gilmer Justin, Steiner Andreas, Caron Mathilde, Geirhos Robert, Alabdulmohsin Ibrahim, Jenatton Rodolphe, Beyer Lucas, Tschannen Michael, Arnab Anurag, Wang Xiao, Riquelme Carlos, Minderer Matthias, Puigcerver Joan, Evci Utku, Kumar Manoj, Van Steenkiste Sjoerd, Elsayed Gamaleldin F., Mahendran Aravindh, Yu Fisher, Oliver Avital, Huot Fantine, Bastings Jasmijn, Collier Mark Patrick, Gritsenko Alexey, Birodkar Vighnesh, Vasconcelos Cristina, Tay Yi, Mensink Thomas, Kolesnikov Alexander, Pavetić Filip, Tran Dustin, Kipf Thomas, Lučić Mario, Zhai Xiaohua, Keysers Daniel, Harmsen Jeremiah, Houlsby Neil
- Controlled Text Generation Via Language Model Arithmetic Dekoninck Jasper, Fischer Marc, Beurer-kellner Luca, Vechev Martin
- Fairness Of Chatgpt And The Role Of Explainable-guided Prompts Deldjoo Yashar
- Investigating Data Contamination In Modern Benchmarks For Large Language Models Deng Chunyuan, Zhao Yilun, Tang Xiangru, Gerstein Mark, Cohan Arman
- Can Chatgpt Read Who You Are? Derner Erik, Kučera Dalibor, Oliver Nuria, Zahálka Jan
- Toxicity In Chatgpt: Analyzing Persona-assigned Language Models Deshpande Ameet, Murahari Vishvak, Rajpurohit Tanmay, Kalyan Ashwin, Narasimhan Karthik
- Queer People Are People First: Deconstructing Sexual Identity Stereotypes In Large Language Models Dhingra Harnoor, Jayashanker Preetiha, Moghe Sayali, Strubell Emma
- Can Instruction Fine-tuned Language Models Identify Social Bias Through Prompting? Dige Omkar, Tian Jacob-junqi, Emerson David, Khattak Faiza Khan
- Fluid Transformers And Creative Analogies: Exploring Large Language Models' Capacity For Augmenting Cross-domain Analogical Creativity Ding Zijian, Srinivasan Arvind, Macneil Stephen, Chan Joel
- Probing Explicit And Implicit Gender Bias Through LLM Conditional Text Generation Dong Xiangjue, Wang Yibo, Yu Philip S., Caverlee James
- Blending Reward Functions Via Few Expert Demonstrations For Faithful And Accurate Knowledge-grounded Dialogue Generation Du Wanyu, Ji Yangfeng
- Denevil: Towards Deciphering And Navigating The Ethical Values Of Large Language Models Via Instruction Learning Duan Shitong, Yi Xiaoyuan, Zhang Peng, Lu Tun, Xie Xing, Gu Ning
- In What Languages Are Generative Language Models The Most Formal? Analyzing Formality Distribution Across Languages Ersoy Asım, Vizcarra Gerson, Mayeesha Tasmiah Tahsin, Muller Benjamin
- ROBBIE: Robust Bias Evaluation Of Large Generative Language Models Esiobu David, Tan Xiaoqing, Hosseini Saghar, Ung Megan, Zhang Yuchen, Fernandes Jude, Dwivedi-yu Jane, Presani Eleonora, Williams Adina, Smith Eric Michael
- Comparable Demonstrations Are Important In In-context Learning: A Novel Perspective On Demonstration Selection Fan Caoyun, Tian Jidong, Li Yitian, He Hao, Jin Yaohui
- An Empirical Study Of Multitask Learning To Improve Open Domain Dialogue Systems Farahani Mehrdad, Johansson Richard
- Friend Or Foe? Exploring The Implications Of Large Language Models On The Science System Fecher Benedikt, Hebing Marcel, Laufer Melissa, Pohle Jörg, Sofsky Fabian
- Transferable Decoding With Visual Entities For Zero-shot Image Captioning Fei Junjie, Wang Teng, Zhang Jinrui, He Zhenyu, Wang Chengjie, Zheng Feng
- Should Chatgpt Be Biased? Challenges And Risks Of Bias In Large Language Models Ferrara Emilio
- Leveraging Large Language Models In Conversational Recommender Systems Friedman Luke, Ahuja Sameer, Allen David, Tan Zhenning, Sidahmed Hakim, Long Changbo, Xie Jun, Schubiner Gabriel, Patel Ajay, Lara Harsh, Chu Brian, Chen Zexi, Tiwari Manoj
- Chainpoll: A High Efficacy Method For LLM Hallucination Detection Friel Robert, Sanyal Atindriyo
- Generate Then Select: Open-ended Visual Question Answering Guided By World Knowledge Fu Xingyu, Zhang Sheng, Kwon Gukyeong, Perera Pramuditha, Zhu Henghui, Zhang Yuhao, Li Alexander Hanbo, Wang William Yang, Wang Zhiguo, Castelli Vittorio, Ng Patrick, Roth Dan, Xiang Bing
- The Impact And Applications Of Chatgpt: A Systematic Review Of Literature Reviews Gabashvili Irene S.
- Llama-adapter V2: Parameter-efficient Visual Instruction Model Gao Peng, Han Jiaming, Zhang Renrui, Lin Ziyi, Geng Shijie, Zhou Aojun, Zhang Wei, Lu Pan, He Conghui, Yue Xiangyu, Li Hongsheng, Qiao Yu
- Ahead-of-time P-tuning Gavrilov Daniil, Balagansky Nikita
- Chatgpt Perpetuates Gender Bias In Machine Translation And Ignores Non-gendered Pronouns: Findings Across Bengali And Five Other Low-resource Languages Ghosh Sourojit, Caliskan Aylin
- Large Language Models Are Zero-shot Time Series Forecasters Gruver Nate, Finzi Marc, Qiu Shikai, Wilson Andrew Gordon
- Targen: Targeted Data Generation With Large Language Models Gupta Himanshu, Scaria Kevin, Anantheswaran Ujjwala, Verma Shreyas, Parmar Mihir, Sawant Saurabh Arjun, Baral Chitta, Mishra Swaroop
- Bias Runs Deep: Implicit Reasoning Biases In Persona-assigned Llms Gupta Shashank, Shrivastava Vaishnavi, Deshpande Ameet, Kalyan Ashwin, Clark Peter, Sabharwal Ashish, Khot Tushar
- A Real-world Webagent With Planning, Long Context Understanding, And Program Synthesis Gur Izzeddin, Furuta Hiroki, Huang Austin, Safdari Mustafa, Matsuo Yutaka, Eck Douglas, Faust Aleksandra
- Regulating Chatgpt And Other Large Generative AI Models Hacker Philipp, Engel Andreas, Mauer Marco
- Are Large Language Model-based Evaluators The Solution To Scaling Up Multilingual Evaluation? Hada Rishav, Gumma Varun, De Wynter Adrian, Diddee Harshita, Ahmed Mohamed, Choudhury Monojit, Bali Kalika, Sitaram Sunayana
- Human-like Intuitive Behavior And Reasoning Biases Emerged In Language Models -- And Disappeared In GPT-4 Hagendorff Thilo, Fabi Sarah
- Opiniongpt: Modelling Explicit Biases In Instruction-tuned Llms Haller Patrick, Aynetdinov Ansar, Akbik Alan
- Towards Efficient Vision-language Tuning: More Information Density, More Generalizability Hao Tianxiang, Lyu Mengyao, Chen Hui, Zhao Sicheng, Ding Xiaohan, Han Jungong, Ding Guiguang
- Unlocking The Potential Of Chatgpt: A Comprehensive Exploration Of Its Applications, Advantages, Limitations, And Future Directions In Natural Language Processing Hariri Walid
- The Political Ideology Of Conversational AI: Converging Evidence On Chatgpt's Pro-environmental, Left-libertarian Orientation Hartmann Jochen, Schwenzow Jasper, Witte Maximilian
- Wanjuan: A Comprehensive Multimodal Dataset For Advancing English And Chinese Large Models He Conghui, Jin Zhenjiang, Xu Chao, Qiu Jiantao, Wang Bin, Li Wei, Yan Hang, Wang Jiaqi, Lin Dahua
- Can Large Language Models Understand Real-world Complex Instructions? He Qianyu, Zeng Jie, Huang Wenhao, Chen Lina, Xiao Jin, He Qianxi, Zhou Xunzhe, Chen Lida, Wang Xintao, Huang Yuncheng, Ye Haoning, Li Zihan, Chen Shisong, Zhang Yikai, Gu Zhouhong, Liang Jiaqing, Xiao Yanghua
- Divide Et Impera: Multi-transformer Architectures For Complex Nlp-tasks Helland Solveig, Gavagnin Elena, De Spindler Alexandre
- Large Language Models Are Biased To Overestimate Profoundness Herrera-berg Eugenio, Browne Tomás Vergara, León-villagrá Pablo, Vives Marc-lluís, Calderon Cristian Buc
- Backpack Language Models Hewitt John, Thickstun John, Manning Christopher D., Liang Percy
- Ralle: A Framework For Developing And Evaluating Retrieval-augmented Large Language Models Hoshi Yasuto, Miyashita Daisuke, Ng Youyang, Tatsuno Kento, Morioka Yasuhiro, Torii Osamu, Deguchi Jun
- Unbiased Watermark For Large Language Models Hu Zhengmian, Chen Lichang, Wu Xidong, Wu Yihan, Zhang Hongyang, Huang Heng
- Directed Acyclic Transformer Pre-training For High-quality Non-autoregressive Text Generation Huang Fei, Ke Pei, Huang Minlie
- Chatgpt For Shaping The Future Of Dentistry: The Potential Of Multi-modal Large Language Model Huang Hanyao, Zheng Ou, Wang Dongdong, Yin Jiayi, Wang Zijin, Ding Shengxuan, Yin Heng, Xu Chuan, Yang Renjie, Zheng Qian, Shi Bing
- Trustgpt: A Benchmark For Trustworthy And Responsible Large Language Models Huang Yue, Zhang Qihui, Y Philip S., Sun Lichao
- Soft Prompt Decoding For Multilingual Dense Retrieval Huang Zhiqi, Zeng Hansi, Zamani Hamed, Allan James
- Minimizing Factual Inconsistency And Hallucination In Large Language Models I Muneeswaran, Saxena Shreya, Prasad Siva, Prakash M V Sai, Shankar Advaith, V Varun, Vaddina Vishal, Gopalakrishnan Saisubramaniam
- Instructed To Bias: Instruction-tuned Language Models Exhibit Emergent Cognitive Bias Itzhak Itay, Stanovsky Gabriel, Rosenfeld Nir, Belinkov Yonatan
- Hijacking Context In Large Multi-modal Models Jeong Joonhyun
- Benchmarking And Explaining Large Language Model-based Code Generation: A Causality-centric Approach Ji Zhenlan, Ma Pingchuan, Li Zongjie, Wang Shuai
- Longllmlingua: Accelerating And Enhancing Llms In Long Context Scenarios Via Prompt Compression Jiang Huiqiang, Wu Qianhui, Luo Xufang, Li Dongsheng, Lin Chin-yew, Yang Yuqing, Qiu Lili
- Prompt Packer: Deceiving Llms Through Compositional Instruction With Hidden Attacks Jiang Shuyu, Chen Xingshu, Tang Rui
- How Can Recommender Systems Benefit From Large Language Models: A Survey Jianghao Lin, Xinyi Dai, Yunjia Xi, Weiwen Liu, Bo Chen, Hao Zhang, Yong Liu, Chuhan Wu, Xiangyang Li, Chenxu Zhu, Huifeng Guo, Yong Yu, Ruiming Tang, Weinan Zhang
- Emergent Representations Of Program Semantics In Language Models Trained On Programs Jin Charles, Rinard Martin
- Is Chatgpt Fair For Recommendation? Evaluating Fairness In Large Language Model Recommendation Jizhi Zhang, Keqin Bao, Yang Zhang, Wenjie Wang, Fuli Feng, Xiangnan He
- Robust Recommender System: A Survey And Future Directions Kaike Zhang, Qi Cao, Fei Sun, Yunfan Wu, Shuchang Tao, Huawei Shen, Xueqi Cheng
- Impact Of Co-occurrence On Factual Knowledge Of Large Language Models Kang Cheongwoong, Choi Jaesik
- The Past, Present And Better Future Of Feedback Learning In Large Language Models For Subjective Human Preferences And Values Kirk Hannah Rose, Bean Andrew M., Vidgen Bertie, Röttger Paul, Hale Scott A.
- Large Language Models Are Temporal And Causal Reasoners For Video Question Answering Ko Dohwan, Lee Ji Soo, Kang Wooyoung, Roh Byungseok, Kim Hyunwoo J.
- Transformer Language Models Handle Word Frequency In Prediction Head Kobayashi Goro, Kuribayashi Tatsuki, Yokoi Sho, Inui Kentaro
- Chatgpt: Jack Of All Trades, Master Of None Kocoń Jan, Cichecki Igor, Kaszyca Oliwier, Kochanek Mateusz, Szydło Dominika, Baran Joanna, Bielaniewicz Julita, Gruza Marcin, Janz Arkadiusz, Kanclerz Kamil, Kocoń Anna, Koptyra Bartłomiej, Mieleszczenko-kowszewicz Wiktoria, Miłkowski Piotr, Oleksy Marcin, Piasecki Maciej, Radliński Łukasz, Wojtasik Konrad, Woźniak Stanisław, Kazienko Przemysław
- Benchmarking Cognitive Biases In Large Language Models As Evaluators Koo Ryan, Lee Minhwa, Raheja Vipul, Park Jong Inn, Kim Zae Myung, Kang Dongyeop
- Gender Bias And Stereotypes In Large Language Models Kotek Hadas, Dockum Rikker, Sun David Q.
- On The Intersection Of Self-correction And Trust In Language Models Krishna Satyapriya
- Chatbots Are Not Reliable Text Annotators Kristensen-mclachlan Ross Deans, Canavan Miceal, Kardos Márton, Jacobsen Mia, Aarøe Lene
- The Moral Authority Of Chatgpt Krügel Sebastian, Ostermaier Andreas, Uhl Matthias
- In Generative AI We Trust: Can Chatbots Effectively Verify Political Information? Kuznetsova Elizaveta, Makhortykh Mykola, Vziatysheva Victoria, Stolze Martha, Baghumyan Ani, Urman Aleksandra
- Large Language Models Are Partially Primed In Pronoun Interpretation Lam Suet-ying, Zeng Qingcheng, Zhang Kexun, You Chenyu, Voigt Rob
- Open Sesame! Universal Black Box Jailbreaking Of Large Language Models Lapid Raz, Langberg Ron, Sipper Moshe
- A Systematic Study And Comprehensive Evaluation Of Chatgpt On Benchmark Datasets Laskar Md Tahmid Rahman, Bari M Saiful, Rahman Mizanur, Bhuiyan Md Amran Hossen, Joty Shafiq, Huang Jimmy Xiangji
- An Experimental Study On Pretraining Transformers From Scratch For IR Lassance Carlos, Déjean Hervé, Clinchant Stéphane
- Chatgpt: A Meta-analysis After 2.5 Months Leiter Christoph, Zhang Ran, Chen Yanran, Belouadi Jonas, Larionov Daniil, Fresen Vivian, Eger Steffen
- Deceptive Semantic Shortcuts On Reasoning Chains: How Far Can Models Go Without Hallucination? Li Bangzheng, Zhou Ben, Wang Fei, Fu Xingyu, Roth Dan, Chen Muhao
- This Land Is {your, My} Land: Evaluating Geopolitical Biases In Language Models Li Bryan, Haider Samar, Callison-burch Chris
- Explaincpe: A Free-text Explanation Benchmark Of Chinese Pharmacist Examination Li Dongfang, Yu Jindi, Hu Baotian, Xu Zhenran, Zhang Min
- Graphix-t5: Mixing Pre-trained Transformers With Graph-aware Layers For Text-to-sql Parsing Li Jinyang, Hui Binyuan, Cheng Reynold, Qin Bowen, Ma Chenhao, Huo Nan, Huang Fei, Du Wenyu, Si Luo, Li Yongbin
- Fine-tuning Multimodal Llms To Follow Zero-shot Demonstrative Instructions Li Juncheng, Pan Kaihang, Ge Zhiqi, Gao Minghe, Ji Wei, Zhang Wenqiao, Chua Tat-seng, Tang Siliang, Zhang Hanwang, Zhuang Yueting
- On The Steerability Of Large Language Models Toward Data-driven Personas Li Junyi, Mehrabi Ninareh, Peris Charith, Goyal Palash, Chang Kai-wei, Galstyan Aram, Zemel Richard, Gupta Rahul
- Mvbench: A Comprehensive Multi-modal Video Understanding Benchmark Li Kunchang, Wang Yali, He Yinan, Li Yizhuo, Wang Yi, Liu Yi, Wang Zun, Xu Jilan, Chen Guo, Luo Ping, Wang Limin, Qiao Yu
- Summarizing Multiple Documents With Conversational Structure For Meta-review Generation Li Miao, Hovy Eduard, Lau Jey Han
- Starcoder: May The Source Be With You! Li Raymond, Allal Loubna Ben, Zi Yangtian, Muennighoff Niklas, Kocetkov Denis, Mou Chenghao, Marone Marc, Akiki Christopher, Li Jia, Chim Jenny, Liu Qian, Zheltonozhskii Evgenii, Zhuo Terry Yue, Wang Thomas, Dehaene Olivier, Davaadorj Mishig, Lamy-poirier Joel, Monteiro João, Shliazhko Oleh, Gontier Nicolas, Meade Nicholas, Zebaze Armel, Yee Ming-ho, Umapathi Logesh Kumar, Zhu Jian, Lipkin Benjamin, Oblokulov Muhtasham, Wang Zhiruo, Murthy Rudra, Stillerman Jason, Patel Siva Sankalp, Abulkhanov Dmitry, Zocca Marco, Dey Manan, Zhang Zhihan, Fahmy Nour, Bhattacharyya Urvashi, Yu Wenhao, Singh Swayam, Luccioni Sasha, Villegas Paulo, Kunakov Maxim, Zhdanov Fedor, Romero Manuel, Lee Tony, Timor Nadav, Ding Jennifer, Schlesinger Claire, Schoelkopf Hailey, Ebert Jan, Dao Tri, Mishra Mayank, Gu Alex, Robinson Jennifer, Anderson Carolyn Jane, Dolan-gavitt Brendan, Contractor Danish, Reddy Siva, Fried Daniel, Bahdanau Dzmitry, Jernite Yacine, Ferrandis Carlos Muñoz, Hughes Sean, Wolf Thomas, Guha Arjun, Von Werra Leandro, De Vries Harm
- Stablellava: Enhanced Visual Instruction Tuning With Synthesized Image-dialogue Data Li Yanda, Zhang Chi, Yu Gang, Wang Zhibin, Fu Bin, Lin Guosheng, Shen Chunhua, Chen Ling, Wei Yunchao
- A Survey On Fairness In Large Language Models Li Yingji, Du Mengnan, Song Rui, Wang Xin, Wang Ying
- Prompting Large Language Models For Counterfactual Generation: An Empirical Study Li Yongqi, Xu Mayi, Miao Xin, Zhou Shen, Qian Tieyun
- Textbooks Are All You Need II: Phi-1.5 Technical Report Li Yuanzhi, Bubeck Sébastien, Eldan Ronen, Del Giorno Allie, Gunasekar Suriya, Lee Yin Tat
- Split And Merge: Aligning Position Biases In Large Language Model Based Evaluators Li Zongjie, Wang Chaozheng, Ma Pingchuan, Wu Daoyuan, Wang Shuai, Gao Cuiyun, Liu Yang
- GPT Detectors Are Biased Against Non-native English Writers Liang Weixin, Yuksekgonul Mert, Mao Yining, Wu Eric, Zou James
- AI Transparency In The Age Of Llms: A Human-centered Research Roadmap Liao Q. Vera, Vaughan Jennifer Wortman
- Opening Up Chatgpt: Tracking Openness, Transparency, And Accountability In Instruction-tuned Text Generators Liesenfeld Andreas, Lopez Alianda, Dingemanse Mark
- Dynamic Scheduled Sampling With Imitation Loss For Neural Text Generation Lin Xiang, Jwalapuram Prathyusha, Joty Shafiq
- Exposing Attention Glitches With Flip-flop Language Modeling Liu Bingbin, Ash Jordan T., Goel Surbhi, Krishnamurthy Akshay, Zhang Cyril
- Concise And Organized Perception Facilitates Reasoning In Large Language Models Liu Junjie, Yan Shaotian, Shen Chen, Xie Liang, Wang Wenxiao, Ye Jieping
- Think-in-memory: Recalling And Post-thinking Enable Llms With Long-term Memory Liu Lei, Yang Xiaoyan, Shen Yue, Hu Binbin, Zhang Zhiqiang, Gu Jinjie, Zhang Guannan
- Selenite: Scaffolding Online Sensemaking With Comprehensive Overviews Elicited From Large Language Models Liu Michael Xieyang, Wu Tongshuang, Chen Tianying, Li Franklin Mingzhe, Kittur Aniket, Myers Brad A.
- Nlebench+norglm: A Comprehensive Empirical Analysis And Benchmark Dataset For Generative Language Models In Norwegian Liu Peng, Zhang Lemei, Farup Terje Nissen, Lauvrak Even W., Ingvaldsen Jon Espen, Eide Simen, Gulla Jon Atle, Yang Zhirong
- Aligning Large Language Models With Human Preferences Through Representation Engineering Liu Wenhao, Wang Xiaohua, Wu Muling, Li Tianlong, Lv Changze, Ling Zixuan, Zhu Jianhao, Zhang Cenyuan, Zheng Xiaoqing, Huang Xuanjing
- BOLT: Fast Energy-based Controlled Text Generation With Tunable Biases Liu Xin, Khalifa Muhammad, Wang Lu
- Litcab: Lightweight Language Model Calibration Over Short- And Long-form Responses Liu Xin, Khalifa Muhammad, Wang Lu
- G-eval: NLG Evaluation Using GPT-4 With Better Human Alignment Liu Yang, Iter Dan, Xu Yichong, Wang Shuohang, Xu Ruochen, Zhu Chenguang
- Trustworthy Llms: A Survey And Guideline For Evaluating Large Language Models' Alignment Liu Yang, Yao Yuanshun, Ton Jean-francois, Zhang Xiaoying, Guo Ruocheng, Cheng Hao, Klochkov Yegor, Taufiq Muhammad Faaiz, Li Hang
- Llms As Narcissistic Evaluators: When Ego Inflates Evaluation Scores Liu Yiqi, Moosavi Nafise Sadat, Lin Chenghua
- LLM360: Towards Fully Transparent Open-source Llms Liu Zhengzhong, Qiao Aurick, Neiswanger Willie, Wang Hongyi, Tan Bowen, Tao Tianhua, Li Junbo, Wang Yuqi, Sun Suqi, Pangarkar Omkar, Fan Richard, Gu Yi, Miller Victor, Zhuang Yonghao, He Guowei, Li Haonan, Koto Fajri, Tang Liping, Ranjan Nikhil, Shen Zhiqiang, Ren Xuguang, Iriondo Roberto, Mu Cun, Hu Zhiting, Schulze Mark, Nakov Preslav, Baldwin Tim, Xing Eric P.
- LLM Comparative Assessment: Zero-shot NLG Evaluation Through Pairwise Comparisons Using Large Language Models Liusie Adian, Manakul Potsawee, Gales Mark J. F.
- Safer Conversational AI As A Source Of User Delight Lu Xiaoding, Korshuk Aleksey, Liu Zongyi, Beauchamp William, Research Chai
- SCITAB: A Challenging Benchmark For Compositional Reasoning And Claim Verification On Scientific Tables Lu Xinyuan, Pan Liangming, Liu Qian, Nakov Preslav, Kan Min-yen
- Chatgpt And A New Academic Reality: Artificial Intelligence-written Research Papers And The Ethics Of The Large Language Models In Scholarly Publishing Lund Brady, Wang Ting, Mannuru Nishith Reddy, Nie Bing, Shimray Somipam, Wang Ziang
- SAIL: Search-augmented Instruction Learning Luo Hongyin, Chuang Yung-sung, Gong Yuan, Zhang Tianhua, Kim Yoon, Wu Xixin, Fox Danny, Meng Helen, Glass James
- Recranker: Instruction Tuning Large Language Model As Ranker For Top-k Recommendation Luo Sichun, He Bowei, Zhao Haohan, Shao Wei, Qi Yanlin, Huang Yinya, Zhou Aojun, Yao Yuxuan, Li Zongpeng, Xiao Yuanzhang, Zhan Mingjie, Song Linqi
- An Empirical Study Of Catastrophic Forgetting In Large Language Models During Continual Fine-tuning Luo Yun, Yang Zhen, Meng Fandong, Li Yafu, Zhou Jie, Zhang Yue
- Augmented Large Language Models With Parametric Knowledge Guiding Luo Ziyang, Xu Can, Zhao Pu, Geng Xiubo, Tao Chongyang, Ma Jing, Lin Qingwei, Jiang Daxin
- Fingpt: Large Generative Models For A Small Language Luukkonen Risto, Komulainen Ville, Luoma Jouni, Eskelinen Anni, Kanerva Jenna, Kupari Hanna-mari, Ginter Filip, Laippala Veronika, Muennighoff Niklas, Piktus Aleksandra, Wang Thomas, Tazi Nouamane, Scao Teven Le, Wolf Thomas, Suominen Osma, Sairanen Samuli, Merioksa Mikko, Heinonen Jyrki, Vahtola Aija, Antao Samuel, Pyysalo Sampo
- Are We Falling In A Middle-intelligence Trap? An Analysis And Mitigation Of The Reversal Curse Lv Ang, Zhang Kaiyi, Xie Shufang, Tu Quan, Chen Yuhan, Wen Ji-rong, Yan Rui
- Fairness-guided Few-shot Prompting For Large Language Models Ma Huan, Zhang Changqing, Bian Yatao, Liu Lemao, Zhang Zhirui, Zhao Peilin, Zhang Shu, Fu Huazhu, Hu Qinghua, Wu Bingzhe
- "oops, Did I Just Say That?" Testing And Repairing Unethical Suggestions Of Large Language Models With Suggest-critique-reflect Process Ma Pingchuan, Li Zongjie, Sun Ao, Wang Shuai
- An Examination Of The Compositionality Of Large Generative Vision-language Models Ma Teli, Li Rong, Liang Junwei
- Sources Of Hallucination By Large Language Models On Inference Tasks Mckenna Nick, Li Tianyi, Cheng Liang, Hosseini Mohammad Javad, Johnson Mark, Steedman Mark
- From Bytes To Biases: Investigating The Cultural Self-perception Of Large Language Models Messner Wolfgang, Greene Tatum, Matalone Josephine
- Think Twice: Measuring The Efficiency Of Eliminating Prediction Shortcuts Of Question Answering Models Mikula Lukáš, Štefánik Michal, Petrovič Marek, Sojka Petr
- How Trustworthy Are Open-source Llms? An Assessment Under Malicious Demonstrations Shows Their Vulnerabilities Mo Lingbo, Wang Boshi, Chen Muhao, Sun Huan
- Assigning AI: Seven Approaches For Students, With Prompts Mollick Ethan, Mollick Lilach
- Increasing The Performance Of Cognitively Inspired Data-efficient Language Models Via Implicit Structure Building Momen Omar, Arps David, Kallmeyer Laura
- N-critics: Self-refinement Of Large Language Models With Ensemble Of Critics Mousavi Sajad, Gutiérrez Ricardo Luna, Rengarajan Desik, Gundecha Vineet, Babu Ashwin Ramesh, Naug Avisek, Guillen Antonio, Sarkar Soumyendu
- How To Plant Trees In Language Models: Data And Architectural Effects On The Emergence Of Syntactic Inductive Biases Mueller Aaron, Linzen Tal
- A Brief History Of Prompt: Leveraging Language Models. (through Advanced Prompting) Muktadir Golam Md
- Enhancing Logical Reasoning In Large Language Models To Facilitate Legal Applications Nguyen Ha-thanh, Fungwacharakorn Wachara, Satoh Ken
- In-context Example Selection With Influences Nguyen Tai, Wong Eric
- Culturax: A Cleaned, Enormous, And Multilingual Dataset For Large Language Models In 167 Languages Nguyen Thuat, Van Nguyen Chien, Lai Viet Dac, Man Hieu, Ngo Nghia Trung, Dernoncourt Franck, Rossi Ryan A., Nguyen Thien Huu
- Seallms -- Large Language Models For Southeast Asia Nguyen Xuan-phi, Zhang Wenxuan, Li Xin, Aljunied Mahani, Hu Zhiqiang, Shen Chenhui, Chia Yew Ken, Li Xingxuan, Wang Jianyu, Tan Qingyu, Cheng Liying, Chen Guanzheng, Deng Yue, Yang Sen, Liu Chaoqun, Zhang Hang, Bing Lidong
- LEXTREME: A Multi-lingual And Multi-task Benchmark For The Legal Domain Niklaus Joel, Matoshi Veton, Rani Pooja, Galassi Andrea, Stürmer Matthias, Chalkidis Ilias
- In-contextual Gender Bias Suppression For Large Language Models Oba Daisuke, Kaneko Masahiro, Bollegala Danushka
- Are Large Language Models Fit For Guided Reading? Ochieng Peter
- Large Language Model (LLM) Bias Index -- LLMBI Oketunji Abiodun Finbarrs, Anas Muhammad, Saina Deepthi
- Thoughtsource: A Central Hub For Large Language Model Reasoning Data Ott Simon, Hebenstreit Konstantin, Liévin Valentin, Hother Christoffer Egeberg, Moradi Milad, Mayrhauser Maximilian, Praas Robert, Winther Ole, Samwald Matthias
- Dialogbench: Evaluating Llms As Human-like Dialogue Systems Ou Jiao, Lu Junda, Liu Che, Tang Yihong, Zhang Fuzheng, Zhang Di, Gai Kun
- Med-halt: Medical Domain Hallucination Test For Large Language Models Pal Ankit, Umapathi Logesh Kumar, Sankarasubbu Malaikannan
- COSMIC: Data Efficient Instruction-tuning For Speech In-context Learning Pan Jing, Wu Jian, Gaur Yashesh, Sivasankaran Sunit, Chen Zhuo, Liu Shujie, Li Jinyu
- Semantic Composition In Visually Grounded Language Models Pandey Rohan
- Injecting Structural Hints: Using Language Models To Study Inductive Biases In Language Learning Papadimitriou Isabel, Jurafsky Dan
- Interpretable Visual Question Answering Via Reasoning Supervision Parelli Maria, Mallis Dimitrios, Diomataris Markos, Pitsikalis Vassilis
- Reproducing Whisper-style Training Using An Open-source Toolkit And Publicly Available Data Peng Yifan, Tian Jinchuan, Yan Brian, Berrebbi Dan, Chang Xuankai, Li Xinjian, Shi Jiatong, Arora Siddhant, Chen William, Sharma Roshan, Zhang Wangyou, Sudo Yui, Shakeel Muhammad, Jung Jee-weon, Maiti Soumi, Watanabe Shinji
- Large Language Models Can Infer Psychological Dispositions Of Social Media Users Peters Heinrich, Matz Sandra
- When Do Prompting And Prefix-tuning Work? A Theory Of Capabilities And Limitations Petrov Aleksandar, Torr Philip H. S., Bibi Adel
- Attention Sorting Combats Recency Bias In Long Context Language Models Peysakhovich Alexander, Lerer Adam
- Large Language Models Sensitivity To The Order Of Options In Multiple-choice Questions Pezeshkpour Pouya, Hruschka Estevam
- Emptying The Ocean With A Spoon: Should We Edit Models? Pinter Yuval, Elhadad Michael
- Mosaicbert: A Bidirectional Encoder Optimized For Fast Pretraining Portes Jacob, Trott Alex, Havens Sam, King Daniel, Venigalla Abhinav, Nadeem Moin, Sardana Nikhil, Khudia Daya, Frankle Jonathan
- Science In The Era Of Chatgpt, Large Language Models And Generative AI: Challenges For Research Ethics And How To Respond Pournaras Evangelos
- The Robots Are Here: Navigating The Generative AI Revolution In Computing Education Prather James, Denny Paul, Leinonen Juho, Becker Brett A., Albluwi Ibrahim, Craig Michelle, Keuning Hieke, Kiesler Natalie, Kohn Tobias, Luxton-reilly Andrew, Macneil Stephen, Peterson Andrew, Pettit Raymond, Reeves Brent N., Savelka Jaromir
- Gameeval: Evaluating Llms On Conversational Games Qiao Dan, Wu Chenfei, Liang Yaobo, Li Juntao, Duan Nan
- When Large Language Models Contradict Humans? Large Language Models' Sycophantic Behaviour Ranaldi Leonardo, Pucci Giulia
- Can Chatgpt Assess Human Personalities? A General Evaluation Framework Rao Haocong, Leung Cyril, Miao Chunyan
- Supporting Human-ai Collaboration In Auditing Llms With Llms Rastogi Charvi, Ribeiro Marco Tulio, King Nicholas, Nori Harsha, Amershi Saleema
- Weakly Supervised Detection Of Hallucinations In LLM Activations Rateike Miriam, Cintas Celia, Wamburu John, Akumu Tanya, Speakman Skyler
- On Context Utilization In Summarization With Large Language Models Ravaut Mathieu, Sun Aixin, Chen Nancy F., Joty Shafiq
- I'm Afraid I Can't Do That: Predicting Prompt Refusal In Black-box Generative Language Models Reuter Max, Schulze William
- Benchmarking Causal Study To Interpret Large Language Models For Source Code Rodriguez-cardenas Daniel, Palacio David N., Khati Dipin, Burke Henry, Poshyvanyk Denys
- Journey Of Hallucination-minimized Generative AI Solutions For Financial Decision Makers Roychowdhury Sohini
- ICLEF: In-context Learning With Expert Feedback For Explainable Style Transfer Saakyan Arkadiy, Muresan Smaranda
- Branch-solve-merge Improves Large Language Model Evaluation And Generation Saha Swarnadeep, Levy Omer, Celikyilmaz Asli, Bansal Mohit, Weston Jason, Li Xian
- Verbosity Bias In Preference Labeling By Large Language Models Saito Keita, Wachi Akifumi, Wataoka Koki, Akimoto Youhei
- In-context Impersonation Reveals Large Language Models' Strengths And Biases Salewski Leonard, Alaniz Stephan, Rio-torto Isabel, Schulz Eric, Akata Zeynep
- Large Language Models Are Competitive Near Cold-start Recommenders For Language- And Item-based Preferences Sanner Scott, Balog Krisztian, Radlinski Filip, Wedin Ben, Dixon Lucas
- Inseq: An Interpretability Toolkit For Sequence Generation Models Sarti Gabriele, Feldhus Nils, Sickert Ludwig, Van Der Wal Oskar, Nissim Malvina, Bisazza Arianna
- Towards Llm-based Autograding For Short Textual Answers Schneider Johannes, Schenk Bernd, Niklaus Christina
- Evaluating The Deductive Competence Of Large Language Models Seals Spencer M., Shalin Valerie L.
- Personality Traits In Large Language Models Serapio-garcía Greg, Safdari Mustafa, Crepy Clément, Sun Luning, Fitz Stephen, Romero Peter, Abdulhai Marwa, Faust Aleksandra, Matarić Maja
- Cognitive Effects In Large Language Models Shaki Jonathan, Kraus Sarit, Wooldridge Michael
- Is Translation Helpful? An Empirical Analysis Of Cross-lingual Transfer In Low-resource Dialog Generation Shen Lei, Yu Shuai, Shen Xiaoyu
- POSQA: Probe The World Models Of Llms With Size Comparisons Shu Chang, Han Jiuzhou, Liu Fangyu, Shareghi Ehsan, Collier Nigel
- SC-ML: Self-supervised Counterfactual Metric Learning For Debiased Visual Question Answering Shu Xinyao, Yan Shiyang, Yang Xu, Wu Ziheng, Chen Zhongfeng, Lu Zhenyu
- Measuring Inductive Biases Of In-context Learning With Underspecified Demonstrations Si Chenglei, Friedman Dan, Joshi Nitish, Feng Shi, Chen Danqi, He He
- Mitigating Object Hallucinations In Large Vision-language Models Through Visual Contrastive Decoding Sicong Leng, Hang Zhang, Guanzheng Chen, Xin Li, Shijian Lu, Chunyan Miao, Lidong Bing
- The Confidence-competence Gap In Large Language Models: A Cognitive Study Singh Aniket Kumar, Devkota Suman, Lamichhane Bishal, Dhakal Uttam, Dhakal Chandra
- Unlocking Model Insights: A Dataset For Automated Model Card Generation Singh Shruti, Lodwal Hitesh, Malwat Husain, Thakur Rakesh, Singh Mayank
- A Long Way To Go: Investigating Length Correlations In RLHF Singhal Prasann, Goyal Tanya, Xu Jiacheng, Durrett Greg
- Decoding Chatgpt: A Taxonomy Of Existing Research, Current Challenges, And Possible Future Directions Sohail Shahab Saquib, Farhat Faiza, Himeur Yassine, Nadeem Mohammad, Madsen Dag Øivind, Singh Yashbir, Atalla Shadi, Mansoor Wathiq
- Pythia: A Suite For Analyzing Large Language Models Across Training And Scaling Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, Usvsn Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar Van Der Wal
- Learning From Red Teaming: Gender Bias Provocation And Mitigation In Large Language Models Su Hsuan, Cheng Cheng-chu, Farn Hua, Kumar Shachi H, Sahay Saurav, Chen Shang-tse, Lee Hung-yi
- Character-level Chinese Backpack Language Models Sun Hao, Hewitt John
- A Short Survey Of Viewing Large Language Models In Legal Aspect Sun Zhongxiang
- Large Language Models For Intent-driven Session Recommendations Sun Zhu, Liu Hongyang, Qu Xinghua, Feng Kaidong, Wang Yan, Ong Yew-soon
- Do Large Language Models Show Decision Heuristics Similar To Humans? A Case Study Using GPT-3.5 Suri Gaurav, Slater Lily R., Ziaee Ali, Nguyen Morgan
- Gender-specific Machine Translation With Large Language Models Sánchez Eduardo, Andrews Pierre, Stenetorp Pontus, Artetxe Mikel, Costa-jussà Marta R.
- Imaginations Of WALL-E : Reconstructing Experiences With An Imagination-inspired Module For Advanced AI Systems Taghavi Zeinab Sadat, Gooran Soroush, Dalili Seyed Arshan, Amirzadeh Hamidreza, Nematbakhsh Mohammad Jalal, Sameti Hossein
- Soft-prompt Tuning For Large Language Models To Evaluate Bias Tian Jacob-junqi, Emerson David, Miyandoab Sevil Zanjani, Pandya Deval, Seyyed-kalantari Laleh, Khattak Faiza Khan
- Chimed-gpt: A Chinese Medical Large Language Model With Full Training Regime And Better Alignment To Human Preferences Tian Yuanhe, Gan Ruyi, Song Yan, Zhang Jiaxing, Zhang Yongdong
- Post Turing: Mapping The Landscape Of LLM Evaluation Tikhonov Alexey, Yamshchikov Ivan P.
- Fine-tune Language Models To Approximate Unbiased In-context Learning Timothy Chu, Zhao Song, Chiwun Yang
- Do Llms Exhibit Human-like Response Biases? A Case Study In Survey Design Tjuatja Lindia, Chen Valerie, Wu Sherry Tongshuang, Talwalkar Ameet, Neubig Graham
- Clinical Camel: An Open Expert-level Medical Language Model With Dialogue-based Knowledge Encoding Toma Augustin, Lawler Patrick R., Ba Jimmy, Krishnan Rahul G., Rubin Barry B., Wang Bo
- Viclevr: A Visual Reasoning Dataset And Hybrid Multimodal Fusion Model For Visual Question Answering In Vietnamese Tran Khiem Vinh, Phan Hao Phu, Van Nguyen Kiet, Nguyen Ngan Luu Thuy
- Sight Beyond Text: Multi-modal Training Enhances Llms In Truthfulness And Ethics Tu Haoqin, Zhao Bingchen, Wei Chen, Xie Cihang
- Activation Addition: Steering Language Models Without Optimization Turner Alexander Matt, Thiergart Lisa, Leech Gavin, Udell David, Vazquez Juan J., Mini Ulisse, Macdiarmid Monte
- Language Models Don't Always Say What They Think: Unfaithful Explanations In Chain-of-thought Prompting Turpin Miles, Michael Julian, Perez Ethan, Bowman Samuel R.
- How Prevalent Is Gender Bias In Chatgpt? -- Exploring German And English Chatgpt Responses Urchs Stefanie, Thurner Veronika, Aßenmacher Matthias, Heumann Christian, Thiemichen Stephanie
- Humans And Language Models Diverge When Predicting Repeating Text Vaidya Aditya R., Turek Javier, Huth Alexander G.
- Adventures Of Trustworthy Vision-language Models: A Survey Vatsa Mayank, Jain Anubhooti, Singh Richa
- Behind The Magic, MERLIM: Multi-modal Evaluation Benchmark For Large Image-language Models Villa Andrés, Alcázar Juan Carlos León, Soto Alvaro, Ghanem Bernard
- Fairpy: A Toolkit For Evaluation Of Social Biases And Their Mitigation In Large Language Models Viswanath Hrishikesh, Zhang Tianyi
- Unraveling Downstream Gender Bias From Large Language Models: A Study On AI Educational Writing Assistance Wambsganss Thiemo, Su Xiaotian, Swamy Vinitra, Neshaei Seyed Parsa, Rietsche Roman, Käser Tanja
- Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases In Dialogue Systems Wan Yixin, Zhao Jieyu, Chadha Aman, Peng Nanyun, Chang Kai-wei
- Robust Natural Language Understanding With Residual Attention Debiasing Wang Fei, Huang James Y., Yan Tianyi, Zhou Wenxuan, Chen Muhao
- Knowledgeable In-context Tuning: Exploring And Exploiting Factual Knowledge For In-context Learning Wang Jianing, Wang Chengyu, Tan Chuanqi, Huang Jun, Gao Ming
- On The Robustness Of Chatgpt: An Adversarial And Out-of-distribution Perspective Wang Jindong, Hu Xixu, Hou Wenxin, Chen Hao, Zheng Runkai, Wang Yidong, Yang Linyi, Huang Haojun, Ye Wei, Geng Xiubo, Jiao Binxin, Zhang Yue, Xie Xing
- Improved Visual Fine-tuning With Natural Language Supervision Wang Junyang, Xu Yuanhong, Hu Juhua, Yan Ming, Sang Jitao, Qian Qi
- Chated: A Chatbot Leveraging Chatgpt For An Enhanced Learning Experience In Higher Education Wang Kevin, Ramos Jason, Lawrence Ramon
- Adding Guardrails To Advanced Chatbots Wang Yanchen, Singh Lisa
- Primacy Effect Of Chatgpt Wang Yiwei, Cai Yujun, Chen Muhao, Liang Yuxuan, Hooi Bryan
- Menatqa: A New Dataset For Testing The Temporal Comprehension And Reasoning Abilities Of Large Language Models Wei Yifan, Su Yisong, Ma Huanhuan, Yu Xiaoyan, Lei Fangyu, Zhang Yuanzhe, Zhao Jun, Liu Kang
- Magicoder: Empowering Code Generation With Oss-instruct Wei Yuxiang, Wang Zhe, Liu Jiawei, Ding Yifeng, Zhang Lingming
- Mindmap: Knowledge Graph Prompting Sparks Graph Of Thoughts In Large Language Models Wen Yilin, Wang Zifeng, Sun Jimeng
- Unveiling Security, Privacy, And Ethical Concerns Of Chatgpt Wu Xiaodong, Duan Ran, Ni Jianbing
- Adaptive Chameleon Or Stubborn Sloth: Revealing The Behavior Of Large Language Models In Knowledge Conflicts Xie Jian, Zhang Kai, Chen Jiangjie, Lou Renze, Su Yu
- Tencentllmeval: A Hierarchical Evaluation Of Real-world Capabilities For Human-aligned Llms Xie Shuyi, Yao Wenlin, Dai Yong, Wang Shaobo, Zhou Donlin, Jin Lifeng, Feng Xinhua, Wei Pengzhi, Lin Yujie, Hu Zhichao, Yu Dong, Zhang Zhengyou, Nie Jing, Liu Yuhong
- A Preliminary Study Of Chatgpt On News Recommendation: Personalization, Provider Fairness, Fake News Xinyi Li, Yongfeng Zhang, Edward C. Malthouse
- Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation And Beyond Xu Fangzhi, Lin Qika, Han Jiawei, Zhao Tianzhe, Liu Jun, Cambria Erik
- Mental-llm: Leveraging Large Language Models For Mental Health Prediction Via Online Text Data Xu Xuhai, Yao Bingsheng, Dong Yuanzhe, Gabriel Saadia, Yu Hong, Hendler James, Ghassemi Marzyeh, Dey Anind K., Wang Dakuo
- Representation Learning With Large Language Models For Recommendation Xubin Ren, Wei Wei, Lianghao Xia, Lixin Su, Suqi Cheng, Junfeng Wang, Dawei Yin, Chao Huang
- Bias And Fairness In Chatbots: An Overview Xue Jintang, Wang Yun-cheng, Wei Chengwei, Liu Xiaofeng, Woo Jonghye, Kuo C. -c. Jay
- Occuquest: Mitigating Occupational Bias For Inclusive Large Language Models Xue Mingfeng, Liu Dayiheng, Yang Kexin, Dong Guanting, Lei Wenqiang, Yuan Zheng, Zhou Chang, Zhou Jingren
- Pretraining Data Mixtures Enable Narrow Model Selection Capabilities In Transformer Models Yadlowsky Steve, Doshi Lyric, Tripuraneni Nilesh
- Backdooring Instruction-tuned Large Language Models With Virtual Prompt Injection Yan Jun, Yadav Vikas, Li Shiyang, Chen Lichang, Tang Zheng, Wang Hai, Srinivasan Vijay, Ren Xiang, Jin Hongxia
- Practical And Ethical Challenges Of Large Language Models In Education: A Systematic Scoping Review Yan Lixiang, Sha Lele, Zhao Linxuan, Li Yuheng, Martinez-maldonado Roberto, Chen Guanliang, Li Xinyu, Jin Yueqiao, Gašević Dragan
- Beyond Testers' Biases: Guiding Model Testing With Knowledge Bases Using Llms Yang Chenyang, Rustogi Rishabh, Brower-sinning Rachel, Lewis Grace A., Kästner Christian, Wu Tongshuang
- Integrating UMLS Knowledge Into Large Language Models For Medical Question Answering Yang Rui, Marrese-taylor Edison, Ke Yuhe, Cheng Lechao, Chen Qingyu, Li Irene
- Bias A-head? Analyzing Bias In Transformer-based Language Model Attention Heads Yang Yi, Duan Hanyu, Abbasi Ahmed, Lalor John P., Tam Kar Yan
- Agent Lumos: Unified And Modular Training For Open-source Language Agents Yin Da, Brahman Faeze, Ravichander Abhilasha, Chandu Khyathi, Chang Kai-wei, Choi Yejin, Lin Bill Yuchen
- LAMM: Language-assisted Multi-modal Instruction-tuning Dataset, Framework, And Benchmark Yin Zhenfei, Wang Jiong, Cao Jianjian, Shi Zhelun, Liu Dingning, Li Mukai, Sheng Lu, Bai Lei, Huang Xiaoshui, Wang Zhiyong, Shao Jing, Ouyang Wanli
- Alignment Is Not Sufficient To Prevent Large Language Models From Generating Harmful Information: A Psychoanalytic Perspective Yin Zi, Ding Wei, Liu Jia
- Kola: Carefully Benchmarking World Knowledge Of Large Language Models Yu Jifan, Wang Xiaozhi, Tu Shangqing, Cao Shulin, Zhang-li Daniel, Lv Xin, Peng Hao, Yao Zijun, Zhang Xiaohan, Li Hanming, Li Chunyang, Zhang Zheyuan, Bai Yushi, Liu Yantao, Xin Amy, Lin Nianyi, Yun Kaifeng, Gong Linlu, Chen Jianhui, Wu Zhili, Qi Yunjia, Li Weikai, Guan Yong, Zeng Kaisheng, Qi Ji, Jin Hailong, Liu Jinxin, Gu Yu, Yao Yuan, Ding Ning, Hou Lei, Liu Zhiyuan, Xu Bin, Tang Jie, Li Juanzi
- Large Language Model As Attributed Training Data Generator: A Tale Of Diversity And Bias Yu Yue, Zhuang Yuchen, Zhang Jieyu, Meng Yu, Ratner Alexander, Krishna Ranjay, Shen Jiaming, Zhang Chao
- Speculative Contrastive Decoding Yuan Hongyi, Lu Keming, Huang Fei, Yuan Zheng, Zhou Chang
- Plan, Eliminate, And Track -- Language Models Are Good Teachers For Embodied Agents Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye
- Aligning Large Language Models With Human: A Survey Yufei Wang, Wanjun Zhong, Liangyou Li, Fei Mi, Xingshan Zeng, Wenyong Huang, Lifeng Shang, Xin Jiang, Qun Liu
- Keeping Users Engaged During Repeated Administration Of The Same Questionnaire: Using Large Language Models To Reliably Diversify Questions Yun Hye Sun, Arjmand Mehdi, Sherlock Phillip, Paasche-orlow Michael K., Griffith James W., Bickmore Timothy
- Large Language Models Are Zero-shot Rankers For Recommender Systems Yupeng Hou, Junjie Zhang, Zihan Lin, Hongyu Lu, Ruobing Xie, Julian Mcauley, Wayne Xin Zhao
- Fairness-aware Structured Pruning In Transformers Zayed Abdelrahman, Mordido Goncalo, Shabanian Samira, Baldini Ioana, Chandar Sarath
- Taxonomy-based Checklist For Large Language Model Evaluation Zhang Damin
- On The Safety Of Open-sourced Large Language Models: Does Alignment Really Prevent Them From Being Misused? Zhang Hangfan, Guo Zhimeng, Zhu Huaisheng, Cao Bochuan, Lin Lu, Jia Jinyuan, Chen Jinghui, Wu Dinghao
- Marathon: A Race Through The Realm Of Long Context With Large Language Models Zhang Lei, Li Yunshui, Liu Ziqiang, Yang Jiaxi, Liu Junhao, Chen Longze, Luo Run, Yang Min
- Weighted Sampling For Masked Language Modeling Zhang Linhan, Chen Qian, Wang Wen, Deng Chong, Cao Xin, Hao Kongzhang, Jiang Yuxin, Wang Wei
- Balancing Specialized And General Skills In Llms: The Impact Of Modern Tuning And Data Strategy Zhang Zheng, Zheng Chen, Tang Da, Sun Ke, Ma Yukun, Bu Yingtong, Zhou Xun, Zhao Liang
- Explainability For Large Language Models: A Survey Zhao Haiyan, Chen Hanjie, Yang Fan, Liu Ninghao, Deng Huiqi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Du Mengnan
- Learning And Forgetting Unsafe Examples In Large Language Models Zhao Jiachen, Deng Zhun, Madras David, Zou James, Ren Mengye
- Chbias: Bias Evaluation And Mitigation Of Chinese Conversational Language Models Zhao Jiaxu, Fang Meng, Shi Zijing, Li Yitong, Chen Ling, Pechenizkiy Mykola
- Mind Vs. Mouth: On Measuring Re-judge Inconsistency Of Social Bias In Large Language Models Zhao Yachao, Wang Bo, Zhao Dongming, Huang Kun, Wang Yan, He Ruifang, Hou Yuexian
- Judging Llm-as-a-judge With Mt-bench And Chatbot Arena Zheng Lianmin, Chiang Wei-lin, Sheng Ying, Zhuang Siyuan, Wu Zhanghao, Zhuang Yonghao, Lin Zi, Li Zhuohan, Li Dacheng, Xing Eric P., Zhang Hao, Gonzalez Joseph E., Stoica Ion
- Gpt-fathom: Benchmarking Large Language Models To Decipher The Evolutionary Path Towards GPT-4 And Beyond Zheng Shen, Zhang Yuyu, Zhu Yijie, Xi Chenguang, Gao Pengyang, Zhou Xun, Chang Kevin Chen-chuan
- Principle-driven Self-alignment Of Language Models From Scratch With Minimal Human Supervision Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan
- Batch Calibration: Rethinking Calibration For In-context Learning And Prompt Engineering Zhou Han, Wan Xingchen, Proleev Lev, Mincu Diana, Chen Jilin, Heller Katherine, Roy Subhrajit
- Instruction-following Evaluation For Large Language Models Zhou Jeffrey, Lu Tianjian, Mishra Swaroop, Brahma Siddhartha, Basu Sujoy, Luan Yi, Zhou Denny, Hou Le
- Ethical Chatgpt: Concerns, Challenges, And Commandments Zhou Jianlong, Müller Heimo, Holzinger Andreas, Chen Fang
- Unleashing GPT On The Metaverse: Savior Or Destroyer? Zhou Pengyuan
- The Mystery Of In-context Learning: A Comprehensive Survey On Interpretation And Analysis Zhou Yuxiang, Li Jiazheng, Xiang Yanzheng, Yan Hanqi, Gui Lin, He Yulan
- Pose: Efficient Context Window Extension Of Llms Via Positional Skip-wise Training Zhu Dawei, Yang Nan, Wang Liang, Song Yifan, Wu Wenhao, Wei Furu, Li Sujian
- Judgelm: Fine-tuned Large Language Models Are Scalable Judges Zhu Lianghui, Wang Xinggang, Wang Xinlong
- Beyond Yes And No: Improving Zero-shot LLM Rankers Via Scoring Fine-grained Relevance Labels Zhuang Honglei, Qin Zhen, Hui Kai, Wu Junru, Yan Le, Wang Xuanhui, Bendersky Michael
- Red Teaming Chatgpt Via Jailbreaking: Bias, Robustness, Reliability And Toxicity Zhuo Terry Yue, Huang Yujin, Chen Chunyang, Xing Zhenchang
- A Survey On Symbolic Knowledge Distillation Of Large Language Models Acharya Kamal, Velasquez Alvaro, Song Houbing Herbert
- Attnlrp: Attention-aware Layer-wise Relevance Propagation For Transformers Achtibat Reduan, Hatefi Sayed Mohammad Vakilzadeh, Dreyer Maximilian, Jain Aakriti, Wiegand Thomas, Lapuschkin Sebastian, Samek Wojciech
- Step-by-step Unmasking For Parameter-efficient Fine-tuning Of Large Language Models Agarwal Aradhye, Ramesh Suhas K, Sengupta Ayan, Chakraborty Tanmoy
- Many-shot In-context Learning Agarwal Rishabh, Singh Avi, Zhang Lei M., Bohnet Bernd, Rosias Luis, Chan Stephanie, Zhang Biao, Anand Ankesh, Abbas Zaheer, Nova Azade, Co-reyes John D., Chu Eric, Behbahani Feryal, Faust Aleksandra, Larochelle Hugo
- Ethical Reasoning And Moral Value Alignment Of Llms Depend On The Language We Prompt Them In Agarwal Utkarsh, Tanmay Kumar, Khandelwal Aditi, Choudhury Monojit
- Can Llms Perform Structured Graph Reasoning? Agrawal Palaash, Vasania Shavak, Tan Cheston
- Plug And Play With Prompts: A Prompt Tuning Approach For Controlling Text Generation Ajwani Rohan Deepak, Zhu Zining, Rose Jonathan, Rudzicz Frank
- Llmauditor: A Framework For Auditing Large Language Models Using Human-in-the-loop Amirizaniani Maryam, Yao Jihan, Lavergne Adrian, Okada Elizabeth Snell, Chadha Aman, Roosta Tanya, Shah Chirag
- Understanding Alignment In Multimodal Llms: A Comprehensive Study Amirloo Elmira, Fauconnier Jean-philippe, Roesmann Christoph, Kerl Christian, Boney Rinu, Qian Yusu, Wang Zirui, Dehghan Afshin, Yang Yinfei, Gan Zhe, Grasch Peter
- A Framework For Fine-tuning Llms Using Heterogeneous Feedback Aponte Ryan, Rossi Ryan A., Guo Shunan, Dernoncourt Franck, Yu Tong, Chen Xiang, Mitra Subrata, Lipka Nedim
- Open (clinical) Llms Are Sensitive To Instruction Phrasings Arroyo Alberto Mario Ceballos, Munnangi Monica, Sun Jiuding, Zhang Karen Y. C., Mcinerney Denis Jered, Wallace Byron C., Amir Silvio
- Current State Of LLM Risks And AI Guardrails Ayyamperumal Suriya Ganesh, Ge Limin
- Llm-driven Robots Risk Enacting Discrimination, Violence, And Unlawful Actions Azeem Rumaisa, Hundt Andrew, Mansouri Masoumeh, Brandão Martim
- Downstream Bias Mitigation Is All You Need Baksi Arkadeep, Singh Rahul, Joshi Tarun
- Is Your Large Language Model Knowledgeable Or A Choices-only Cheater? Balepur Nishant, Rudinger Rachel
- MAIRA-2: Grounded Radiology Report Generation Bannur Shruthi, Bouzid Kenza, Castro Daniel C., Schwaighofer Anton, Bond-taylor Sam, Ilse Maximilian, Pérez-garcía Fernando, Salvatelli Valentina, Sharma Harshita, Meissen Felix, Ranjit Mercy, Srivastav Shaury, Gong Julia, Falck Fabian, Oktay Ozan, Thieme Anja, Lungren Matthew P., Wetscherek Maria Teodora, Alvarez-valle Javier, Hyland Stephanie L.
- Decoding Matters: Addressing Amplification Bias And Homogeneity Issue For Llm-based Recommendation Bao Keqin, Zhang Jizhi, Zhang Yang, Huo Xinyue, Chen Chong, Feng Fuli
- Aligning Large Language Models From Self-reference AI Feedback With One General Principle Bao Rong, Zheng Rui, Dou Shihan, Wang Xiao, Zhou Enyu, Wang Bo, Zhang Qi, Ding Liang, Tao Dacheng
- Reinforcement Learning From LLM Feedback To Counteract Goal Misgeneralization Barj Houda Nait El, Sautory Theophile
- From 'showgirls' To 'performers': Fine-tuning With Gender-inclusive Language For Bias Reduction In Llms Bartl Marion, Leavy Susan
- Chain-of-thought Unfaithfulness As Disguised Accuracy Bentham Oliver, Stringham Nathan, Marasović Ana
- Redefining "hallucination" In Llms: Towards A Psychology-informed Framework For Mitigating Misinformation Berberette Elijah, Hutchins Jack, Sadovnik Amir
- A Systematic Analysis Of Large Language Models As Soft Reasoners: The Case Of Syllogistic Inferences Bertolazzi Leonardo, Gatt Albert, Bernardi Raffaella
- Evaluating AI For Law: Bridging The Gap With Open-source Solutions Bhambhoria Rohan, Dahan Samuel, Li Jonathan, Zhu Xiaodan
- On The Robustness Of Language Models For Tabular Question Answering Bhandari Kushal Raj, Xing Sixue, Dan Soham, Gao Jianxi
- Adaptive Token Biaser: Knowledge Editing Via Biasing Key Entities Bi Baolong, Liu Shenghua, Wang Yiwei, Mei Lingrui, Gao Hongcheng, Xu Yilong, Cheng Xueqi
- Digital Forgetting In Large Language Models: A Survey Of Unlearning Methods Blanco-justicia Alberto, Jebreel Najeeb, Manzanares Benet, Sánchez David, Domingo-ferrer Josep, Collell Guillem, Tan Kuan Eeik
- GRAM: Global Reasoning For Multi-page VQA Blau Tsachi, Fogel Sharon, Ronen Roi, Golts Alona, Ganz Roy, Avraham Elad Ben, Aberdam Aviad, Tsiper Shahar, Litman Ron
- Improving Language Models Trained On Translated Data With Continual Pre-training And Dictionary Learning Analysis Boughorbel Sabri, Parvez Md Rizwan, Hawasly Majd
- Enhancing Trust In Llms: Algorithms For Comparing And Interpreting Llms Brown Nik Bear
- Pangea: Procedural Artificial Narrative Using Generative AI For Turn-based Video Games Buongiorno Steph, Klinkert Lawrence Jake, Chawla Tanishq, Zhuang Zixin, Clark Corey
- Roguegpt: Dis-ethical Tuning Transforms Chatgpt4 Into A Rogue AI In 158 Words Buscemi Alessio, Proverbio Daniele
- XAI Meets Llms: A Survey Of The Relation Between Explainable AI And Large Language Models Cambria Erik, Malandri Lorenzo, Mercorio Fabio, Nobani Navid, Seveso Andrea
- Are Large Language Models Really Bias-free? Jailbreak Prompts For Assessing Adversarial Robustness To Bias Elicitation Cantini Riccardo, Cosenza Giada, Orsino Alessio, Talia Domenico
- Structeval: Deepen And Broaden Large Language Model Assessment Via Structured Evaluation Cao Boxi, Ren Mengjie, Lin Hongyu, Han Xianpei, Zhang Feng, Zhan Junfeng, Sun Le
- Verifiable Generation With Subsentence-level Fine-grained Citations Cao Shuyang, Wang Lu
- Graphinsight: Unlocking Insights In Large Language Models For Graph Structure Understanding Cao Yukun, Han Shuo, Gao Zengyi, Ding Zezhong, Xie Xike, Zhou S. Kevin
- A Review Of Multi-modal Large Language And Vision Models Carolan Kilian, Fennelly Laura, Smeaton Alan F.
- Tokenization Falling Short: The Curse Of Tokenization Chai Yekun, Fang Yewei, Peng Qiwei, Li Xuhong
- Investigating Llms As Voting Assistants Via Contextual Augmentation: A Case Study On The European Parliament Elections 2024 Chalkidis Ilias
- Integrating Emotional And Linguistic Models For Ethical Compliance In Large Language Models Chang Edward Y.
- When Parts Are Greater Than Sums: Individual LLM Components Can Outperform Full Models Chang Ting-yun, Thomason Jesse, Jia Robin
- Bias-aware Low-rank Adaptation: Mitigating Catastrophic Inheritance Of Large Language Models Chang Yupeng, Chang Yi, Wu Yuan
- Make Large Language Model A Better Ranker Chao Wenshuo, Zheng Zhi, Zhu Hengshu, Liu Hao
- Quantitative Certification Of Bias In Large Language Models Chaudhary Isha, Hu Qian, Kumar Manoj, Ziyadi Morteza, Gupta Rahul, Singh Gagandeep
- No Such Thing As A General Learner: Language Models And Their Dual Optimization Chemla Emmanuel, Nefdt Ryan M.
- Quantifying And Mitigating Unimodal Biases In Multimodal Large Language Models: A Causal Perspective Chen Meiqi, Cao Yixin, Zhang Yan, Lu Chaochao
- Editable Fairness: Fine-grained Bias Mitigation In Language Models Chen Ruizhe, Li Yichen, Yang Jianfei, Zhou Joey Tianyi, Liu Zuozhu
- The Sifo Benchmark: Investigating The Sequential Instruction Following Ability Of Large Language Models Chen Xinyi, Liao Baohao, Qi Jirui, Eustratiadis Panagiotis, Monz Christof, Bisazza Arianna, De Rijke Maarten
- Designing A Dashboard For Transparency And Control Of Conversational AI Chen Yida, Wu Aoyu, Depodesta Trevor, Yeh Catherine, Li Kenneth, Marin Nicholas Castillo, Patel Oam, Riecke Jan, Raval Shivam, Seow Olivia, Wattenberg Martin, Viégas Fernanda
- NACL: A General And Effective KV Cache Eviction Framework For Llms At Inference Time Chen Yilong, Wang Guoxia, Shang Junyuan, Cui Shiyao, Zhang Zhenyu, Liu Tingwen, Wang Shuohuan, Sun Yu, Yu Dianhai, Wu Hua
- Dia-llama: Towards Large Language Model-driven CT Report Generation Chen Zhixuan, Luo Luyang, Bie Yequan, Chen Hao
- Social Debiasing For Fair Multi-modal Llms Cheng Harry, Guo Yangyang, Guo Qingpei, Yang Ming, Gan Tian, Nie Liqiang
- Potential And Limitations Of Llms In Capturing Structured Semantics: A Case Study On SRL Cheng Ning, Yan Zhaohui, Wang Ziming, Li Zhijie, Yu Jiaming, Zheng Zilong, Tu Kewei, Xu Jinan, Han Wenjuan
- A Reality Check Of The Benefits Of LLM In Business Cheung Ming
- Revisiting Zero-shot Abstractive Summarization In The Era Of Large Language Models From The Perspective Of Position Bias Chhabra Anshuman, Askari Hadi, Mohapatra Prasant
- Large Language Models And User Trust: Consequence Of Self-referential Learning Loop And The Deskilling Of Healthcare Professionals Choudhury Avishek, Chaudhry Zaria
- A Causal Explainable Guardrails For Large Language Models Chu Zhixuan, Wang Yan, Li Longfei, Wang Zhibo, Qin Zhan, Ren Kui
- Informed AI Regulation: Comparing The Ethical Frameworks Of Leading LLM Chatbots Using An Ethics-based Audit To Assess Moral Reasoning And Normative Values Chun Jon, Elkins Katherine
- Open Source Conversational Llms Do Not Know Most Spanish Words Conde Javier, González Miguel, Melero Nina, Ferrando Raquel, Martínez Gonzalo, Merino-gómez Elena, Hernández José Alberto, Reviriego Pedro
- Ragsys: Item-cold-start Recommender As RAG System Contal Emile, Mcgoldrick Garrin
- Iscore: Visual Analytics For Interpreting How Language Models Automatically Score Summaries Coscia Adam, Holmes Langdon, Morris Wesley, Choi Joon Suh, Crossley Scott, Endert Alex
- Out-of-context Prompting Boosts Fairness And Robustness In Large Language Model Predictions Cotta Leonardo, Maddison Chris J.
- Faculty Perspectives On The Potential Of RAG In Computer Science Higher Education Dakshit Sagnik
- Under The Surface: Tracking The Artifactuality Of Llm-generated Data Das Debarati, De Langis Karin, Martin-boyle Anna, Kim Jaehyung, Lee Minhwa, Kim Zae Myung, Hayati Shirley Anugrah, Owan Risako, Hu Bin, Parkar Ritik, Koo Ryan, Park Jonginn, Tyagi Aahan, Ferland Libby, Roy Sanjali, Liu Vincent, Kang Dongyeop
- Low-rank Finetuning For Llms: A Fairness Perspective Das Saswat, Romanelli Marco, Tran Cuong, Reza Zarreen, Kailkhura Bhavya, Fioretto Ferdinando
- Aligning Model Evaluations With Human Preferences: Mitigating Token Count Bias In Language Model Assessments Daynauth Roland, Mars Jason
- Towards Transparency: Exploring LLM Trainings Datasets Through Visual Topic Modeling And Semantic Frame De Dampierre Charles, Mogoutov Andrei, Baumard Nicolas
- Will GPT-4 Run DOOM? De Wynter Adrian
- RTP-LX: Can Llms Evaluate Toxicity In Multilingual Scenarios? De Wynter Adrian, Watts Ishaan, Altıntoprak Nektar Ege, Wongsangaroonsri Tua, Zhang Minghui, Farra Noura, Baur Lena, Claudet Samantha, Gajdusek Pavel, Gören Can, Gu Qilong, Kaminska Anna, Kaminski Tomasz, Kuo Ruby, Kyuba Akiko, Lee Jongho, Mathur Kartik, Merok Petter, Milovanović Ivana, Paananen Nani, Paananen Vesa-matti, Pavlenko Anna, Vidal Bruno Pereira, Strika Luciano, Tsao Yueh, Turcato Davide, Vakhno Oleksandr, Velcsov Judit, Vickers Anna, Visser Stéphanie, Widarmanto Herdyan, Zaikin Andrey, Chen Si-qing
- T-FREE: Tokenizer-free Generative Llms Via Sparse Representations For Memory-efficient Embeddings Deiseroth Björn, Brack Manuel, Schramowski Patrick, Kersting Kristian, Weinbach Samuel
- Polyrating: A Cost-effective And Bias-aware Rating System For LLM Evaluation Dekoninck Jasper, Baader Maximilian, Vechev Martin
- Understanding Biases In Chatgpt-based Recommender Systems: Provider Fairness, Temporal Stability, And Recency Deldjoo Yashar
- Deconstructing The Ethics Of Large Language Models From Long-standing Issues To New-emerging Dilemmas Deng Chengyuan, Duan Yiqun, Jin Xin, Chang Heng, Tian Yijun, Liu Han, Zou Henry Peng, Jin Yiqiao, Xiao Yijia, Wang Yichen, Wu Shenghao, Xie Zongxing, Gao Kuofeng, He Sihong, Zhuang Jun, Cheng Lu, Wang Haohan
- Promises, Outlooks And Challenges Of Diffusion Language Modeling Deschenaux Justin, Gulcehre Caglar
- Unveiling Encoder-free Vision-language Models Diao Haiwen, Cui Yufeng, Li Xiaotong, Wang Yueze, Lu Huchuan, Wang Xinlong
- Hallu-pi: Evaluating Hallucination In Multi-modal Large Language Models Within Perturbed Inputs Ding Peng, Wu Jingyu, Kuang Jun, Ma Dan, Cao Xuezhi, Cai Xunliang, Chen Shi, Chen Jiajun, Huang Shujian
- Disclosure And Mitigation Of Gender Bias In Llms Dong Xiangjue, Wang Yibo, Yu Philip S., Caverlee James
- To What Extent Is Chatgpt Useful For Language Teacher Lesson Plan Creation? Dornburg Alex, Davin Kristin
- Unlocking Continual Learning Abilities In Language Models Du Wenyu, Cheng Shuang, Luo Tongxu, Qiu Zihan, Huang Zeyu, Cheung Ka Chun, Cheng Reynold, Fu Jie
- Prompting Multi-modal Tokens To Enhance End-to-end Autonomous Driving Imitation Learning With Llms Duan Yiqun, Zhang Qiang, Xu Renjing
- Length-controlled Alpacaeval: A Simple Way To Debias Automatic Evaluators Dubois Yann, Galambosi Balázs, Liang Percy, Hashimoto Tatsunori B.
- How To Think Step-by-step: A Mechanistic Understanding Of Chain-of-thought Reasoning Dutta Subhabrata, Singh Joykirat, Chakrabarti Soumen, Chakraborty Tanmoy
- AXOLOTL: Fairness Through Assisted Self-debiasing Of Large Language Model Outputs Ebrahimi Sana, Chen Kaiwen, Asudeh Abolfazl, Das Gautam, Koudas Nick
- Cognitive Bias In High-stakes Decision-making With Llms Echterhoff Jessica, Liu Yao, Alessa Abeer, Mcauley Julian, He Zexue
- Reducing Selection Bias In Large Language Models Eicher J. E., Irgolič R. F.
- Considers-the-human Evaluation Framework: Rethinking Human Evaluation For Generative Large Language Models Elangovan Aparna, Liu Ling, Xu Lei, Bodapati Sravan, Roth Dan
- Lightweight Large Language Model For Medication Enquiry: Med-pal Elangovan Kabilan, Ong Jasmine Chiat Ling, Jin Liyuan, Seng Benjamin Jun Jie, Kwan Yu Heng, Tan Lit Soo, Zhong Ryan Jian, Ma Justina Koi Li, Ke Yuhe, Liu Nan, Giacomini Kathleen M, Ting Daniel Shu Wei
- Coda: Constrained Generation Based Data Augmentation For Low-resource NLP Evuru Chandra Kiran Reddy, Ghosh Sreyan, Kumar Sonal, S Ramaneswaran, Tyagi Utkarsh, Manocha Dinesh
- Biasalert: A Plug-and-play Tool For Social Bias Detection In Llms Fan Zhiting, Chen Ruizhe, Xu Ruiling, Liu Zuozhu
- What Are Large Language Models Mapping To In The Brain? A Case Against Over-reliance On Brain Scores Feghhi Ebrahim, Hadidi Nima, Song Bryan, Blank Idan A., Kao Jonathan C.
- Towards Trustworthy AI: A Review Of Ethical And Robust Large Language Models Ferdaus Md Meftahul, Abdelguerfi Mahdi, Ioup Elias, Niles Kendall N., Pathak Ken, Sloan Steven
- Disperse-then-merge: Pushing The Limits Of Instruction Tuning Via Alignment Tax Reduction Fu Tingchen, Cai Deng, Liu Lemao, Shi Shuming, Yan Rui
- Thinking Fair And Slow: On The Efficacy Of Structured Prompts For Debiasing Language Models Furniturewala Shaz, Jandial Surgan, Java Abhinav, Banerjee Pragyan, Shahid Simra, Bhatia Sumit, Jaidka Kokil
- RAVE: Residual Vector Embedding For Clip-guided Backlit Image Enhancement Gaintseva Tatiana, Benning Martin, Slabaugh Gregory
- Open Generative Large Language Models For Galician Gamallo Pablo, Rodríguez Pablo, De-dios-flores Iria, Sotelo Susana, Paniagua Silvia, Bardanca Daniel, Pichel José Ramom, Garcia Marcos
- Lumina-t2x: Transforming Text Into Any Modality, Resolution, And Duration Via Flow-based Large Diffusion Transformers Gao Peng, Zhuo Le, Liu Dongyang, Du Ruoyi, Luo Xu, Qiu Longtian, Zhang Yuhang, Lin Chen, Huang Rongjie, Geng Shijie, Zhang Renrui, Xi Junlin, Shao Wenqi, Jiang Zhengkai, Yang Tianshuo, Ye Weicai, Tong He, He Jingwen, Qiao Yu, Li Hongsheng
- Robust Pronoun Fidelity With English Llms: Are They Reasoning, Repeating, Or Just Biased? Gautam Vagrant, Bingert Eileen, Zhu Dawei, Lauscher Anne, Klakow Dietrich
- Sketch-guided Constrained Decoding For Boosting Blackbox Large Language Models Without Logit Access Geng Saibo, Döner Berkay, Wendler Chris, Josifoski Martin, West Robert
- Beyond Self-learned Attention: Mitigating Attention Bias In Transformer-based Models Using Attention Guidance Gesi Jiri, Ahmed Iftekhar
- Explicit And Implicit Large Language Model Personas Generate Opinions But Fail To Replicate Deeper Perceptions And Biases Giorgi Salvatore, Liu Tingting, Aich Ankit, Isman Kelsey, Sherman Garrick, Fried Zachary, Sedoc João, Ungar Lyle H., Curtis Brenda
- Tokenshap: Interpreting Large Language Models With Monte Carlo Shapley Value Estimation Goldshmidt Roni, Horovicz Miriam
- Mechanistic Interpretability Of Large Language Models With Applications To The Financial Services Industry Golgoon Ashkan, Filom Khashayar, Kannan Arjun Ravi
- COBIAS: Contextual Reliability In Bias Assessment Govil Priyanshul, Jain Hemang, Bonagiri Vamshi Krishna, Chadha Aman, Kumaraguru Ponnurangam, Gaur Manas, Dey Sanorita
- Llmguard: Guarding Against Unsafe LLM Behavior Goyal Shubh, Hira Medha, Mishra Shubham, Goyal Sukriti, Goel Arnav, Dadu Niharika, Db Kirushikesh, Mehta Sameep, Madaan Nishtha
- Fine-tuned Language Models Generate Stable Inorganic Materials As Text Gruver Nate, Sriram Anuroop, Madotto Andrea, Wilson Andrew Gordon, Zitnick C. Lawrence, Ulissi Zachary
- Enhancing Collaborative Semantics Of Language Model-driven Recommendations Via Graph-aware Learning Guan Zhong, Wu Likang, Zhao Hongke, He Ming, Fan Jianpin
- Serial Position Effects Of Large Language Models Guo Xiaobo, Vosoughi Soroush
- AAVENUE: Detecting LLM Biases On NLU Tasks In AAVE Via A Novel Benchmark Gupta Abhay, Meng Philip, Yurtseven Ece, O'brien Sean, Zhu Kevin
- METAL: Towards Multilingual Meta-evaluation Hada Rishav, Gumma Varun, Ahmed Mohamed, Bali Kalika, Sitaram Sunayana
- Chatgpt Based Data Augmentation For Improved Parameter-efficient Debiasing Of Llms Han Pengrui, Kocielnik Rafal, Saravanan Adhithya, Jiang Roy, Sharir Or, Anandkumar Anima
- Exploring Chatgpt And Its Impact On Society Haque Md. Asraful, Li Shuai
- A Comparison Of Large Language Model And Human Performance On Random Number Generation Tasks Harrison Rachel M.
- Concerns On Bias In Large Language Models When Creating Synthetic Personae Haxvig Helena A.
- Large Language Models Are Biased Reinforcement Learners Hayes William M., Yax Nicolas, Palminteri Stefano
- Relative Value Biases In Large Language Models Hayes William M., Yax Nicolas, Palminteri Stefano
- Cos: Enhancing Personalization And Mitigating Bias With Context Steering He Jerry Zhi-yang, Pandey Sashrika, Schrum Mariah L., Dragan Anca
- Prompt-based Bias Calibration For Better Zero/few-shot Learning Of Language Models He Kang, Long Yinghan, Roy Kaushik
- BP4ER: Bootstrap Prompting For Explicit Reasoning In Medical Dialogue Generation He Yuhong, Zhang Yongqi, He Shizhu, Wan Jun
- Intent-conditioned And Non-toxic Counterspeech Generation Using Multi-task Instruction Tuning With RLAIF Hengle Amey, Kumar Aswini, Singh Sahajpreet, Bandhakavi Anil, Akhtar Md Shad, Chakroborty Tanmoy
- Eliciting Personality Traits In Large Language Models Hilliard Airlie, Munoz Cristian, Wu Zekun, Koshiyama Adriano Soares
- Why Do Llava Vision-language Models Reply To Images In English? Hinck Musashi, Holtermann Carolin, Olson Matthew Lyle, Schneider Florian, Yu Sungduk, Bhiwandiwalla Anahita, Lauscher Anne, Tseng Shaoyen, Lal Vasudev
- From Descriptive Richness To Bias: Unveiling The Dark Side Of Generative Image Caption Enrichment Hirota Yusuke, Hachiuma Ryo, Yang Chao-han Huck, Nakashima Yuta
- Non-linear Inference Time Intervention: Improving LLM Truthfulness Hoscilowicz Jakub, Wiacek Adam, Chojnacki Jan, Cieslak Adam, Michon Leszek, Urbanevych Vitalii, Janicki Artur
- Uncovering Bias In Large Vision-language Models At Scale With Counterfactuals Howard Phillip, Fraser Kathleen C., Bhiwandiwalla Anahita, Kiritchenko Svetlana
- Found In The Middle: Calibrating Positional Attention Bias Improves Long Context Utilization Hsieh Cheng-yu, Chuang Yung-sung, Li Chun-liang, Wang Zifeng, Le Long T., Kumar Abhishek, Glass James, Ratner Alexander, Lee Chen-yu, Krishna Ranjay, Pfister Tomas
- Language Models Align With Human Judgments On Key Grammatical Constructions Hu Jennifer, Mahowald Kyle, Lupyan Gary, Ivanova Anna, Levy Roger
- Rethinking Llm-based Preference Evaluation Hu Zhengyu, Song Linxin, Zhang Jieyu, Xiao Zheyuan, Wang Jingang, Chen Zhenyu, Xiong Hui
- Applying And Evaluating Large Language Models In Mental Health Care: A Scoping Review Of Human-assessed Generative Tasks Hua Yining, Na Hongbin, Li Zehan, Liu Fenglin, Fang Xiao, Clifton David, Torous John
- Chatgpt Rates Natural Language Explanation Quality Like Humans: But On Which Scales? Huang Fan, Kwak Haewoon, Park Kunwoo, An Jisun
- Mmevalpro: Calibrating Multimodal Benchmarks Towards Trustworthy And Efficient Evaluation Huang Jinsheng, Chen Liang, Guo Taian, Zeng Fu, Zhao Yusheng, Wu Bohan, Yuan Ye, Zhao Haozhe, Guo Zhihui, Zhang Yichi, Yuan Jingyang, Ju Wei, Liu Luchen, Liu Tianyu, Chang Baobao, Zhang Ming
- Visual Hallucinations Of Multi-modal Large Language Models Huang Wen, Liu Hongbin, Guo Minxin, Gong Neil Zhenqiang
- Trustllm: Trustworthiness In Large Language Models Huang Yue, Sun Lichao, Wang Haoran, Wu Siyuan, Zhang Qihui, Li Yuan, Gao Chujie, Huang Yixin, Lyu Wenhan, Zhang Yixuan, Li Xiner, Liu Zhengliang, Liu Yixin, Wang Yijue, Zhang Zhikun, Vidgen Bertie, Kailkhura Bhavya, Xiong Caiming, Xiao Chaowei, Li Chunyuan, Xing Eric, Huang Furong, Liu Hao, Ji Heng, Wang Hongyi, Zhang Huan, Yao Huaxiu, Kellis Manolis, Zitnik Marinka, Jiang Meng, Bansal Mohit, Zou James, Pei Jian, Liu Jian, Gao Jianfeng, Han Jiawei, Zhao Jieyu, Tang Jiliang, Wang Jindong, Vanschoren Joaquin, Mitchell John, Shu Kai, Xu Kaidi, Chang Kai-wei, He Lifang, Huang Lifu, Backes Michael, Gong Neil Zhenqiang, Yu Philip S., Chen Pin-yu, Gu Quanquan, Xu Ran, Ying Rex, Ji Shuiwang, Jana Suman, Chen Tianlong, Liu Tianming, Zhou Tianyi, Wang William, Li Xiang, Zhang Xiangliang, Wang Xiao, Xie Xing, Chen Xun, Wang Xuyu, Liu Yan, Ye Yanfang, Cao Yinzhi, Chen Yong, Zhao Yue
- Breaking The Language Barrier: Can Direct Inference Outperform Pre-translation In Multilingual LLM Applications? Intrator Yotam, Halfon Matan, Goldenberg Roman, Tsarfaty Reut, Eyal Matan, Rivlin Ehud, Matias Yossi, Aizenberg Natalia
- Grade Score: Quantifying LLM Performance In Option Selection Iourovitski Dmitri
- Are Large Vision Language Models Up To The Challenge Of Chart Comprehension And Reasoning? An Extensive Investigation Into The Capabilities And Limitations Of Lvlms Islam Mohammed Saidul, Rahman Raian, Masry Ahmed, Laskar Md Tahmid Rahman, Nayeem Mir Tafseer, Hoque Enamul
- Prepair: Pointwise Reasoning Enhance Pairwise Evaluating For Robust Instruction-following Assessments Jeong Hawon, Park Chaehun, Hong Jimin, Choo Jaegul
- Linking In-context Learning In Transformers To Human Episodic Memory Ji-an Li, Zhou Corey Y., Benna Marcus K., Mattar Marcelo G.
- A Peek Into Token Bias: Large Language Models Are Not Yet Genuine Reasoners Jiang Bowen, Xie Yangxinyu, Hao Zhuoqun, Wang Xiaomeng, Mallick Tanwi, Su Weijie J., Taylor Camillo J., Roth Dan
- The Real, The Better: Aligning Large Language Models With Online Human Behaviors Jiang Guanying, Yan Lingyong, Shi Haibo, Yin Dawei
- Raising The Bar: Investigating The Values Of Large Language Models Via Generative Evolving Testing Jiang Han, Yi Xiaoyuan, Wei Zhihua, Wang Shu, Xie Xing
- Navigating LLM Ethics: Advancements, Challenges, And Future Directions Jiao Junfeng, Afroogh Saleh, Xu Yiming, Phillips Connor
- Birbal: An Efficient 7B Instruct-model Fine-tuned With Curated Datasets Jindal Ashvini Kumar, Rajpoot Pawan Kumar, Parikh Ankur
- A2SF: Accumulative Attention Scoring With Forgetting Factor For Token Pruning In Transformer Decoder Jo Hyun-rae, Shin Dongkun
- Beyond Labels: Aligning Large Language Models With Human-like Reasoning Kabir Muhammad Rafsan, Sultan Rafeed Mohammad, Asif Ihsanul Haque, Ahad Jawad Ibn, Rahman Fuad, Amin Mohammad Ruhul, Mohammed Nabeel, Rahman Shafin
- Evaluating The Impact Of Advanced LLM Techniques On Ai-lecture Tutors For A Robotics Course Kahl Sebastian, Löffler Felix, Maciol Martin, Ridder Fabian, Schmitz Marius, Spanagel Jennifer, Wienkamp Jens, Burgahn Christopher, Schilling Malte
- Eagle: Ethical Dataset Given From Real Interactions Kaneko Masahiro, Bollegala Danushka, Baldwin Timothy
- Evaluating Gender Bias In Large Language Models Via Chain-of-thought Prompting Kaneko Masahiro, Bollegala Danushka, Okazaki Naoaki, Baldwin Timothy
- Can Large Language Models Be Good Emotional Supporter? Mitigating Preference Bias On Emotional Support Conversation Kang Dongjin, Kim Sunghwan, Kwon Taeyoon, Moon Seungjun, Cho Hyunsouk, Yu Youngjae, Lee Dongha, Yeo Jinyoung
- MEDIC: Towards A Comprehensive Framework For Evaluating Llms In Clinical Applications Kanithi Praveen K, Christophe Clément, Pimentel Marco Af, Raha Tathagata, Saadi Nada, Javed Hamza, Maslenkova Svetlana, Hayat Nasir, Rajan Ronnie, Khan Shadab
- BRAVE: Broadening The Visual Encoding Of Vision-language Models Kar Oğuzhan Fatih, Tonioni Alessio, Poklukar Petra, Kulshrestha Achin, Zamir Amir, Tombari Federico
- Codeaid: Evaluating A Classroom Deployment Of An Llm-based Programming Assistant That Balances Student And Educator Needs Kazemitabaar Majeed, Ye Runlong, Wang Xiaoning, Henley Austin Z., Denny Paul, Craig Michelle, Grossman Tovi
- Exploring The Frontiers Of Llms In Psychological Applications: A Comprehensive Review Ke Luoma, Tong Song, Cheng Peng, Peng Kaiping
- Grounding And Evaluation For Large Language Models: Practical Challenges And Lessons Learned (survey) Kenthapadi Krishnaram, Sameki Mehrnoosh, Taly Ankur
- Source-aware Training Enables Knowledge Attribution In Language Models Khalifa Muhammad, Wadden David, Strubell Emma, Lee Honglak, Wang Lu, Beltagy Iz, Peng Hao
- Why And When Llm-based Assistants Can Go Wrong: Investigating The Effectiveness Of Prompt-based Interactions For Software Help-seeking Khurana Anjali, Subramonyam Hari, Chilana Parmit K
- A Mechanistic Interpretation Of Syllogistic Reasoning In Auto-regressive Language Models Kim Geonhee, Valentino Marco, Freitas André
- Promptkd: Distilling Student-friendly Knowledge For Generative Language Models Via Prompt Tuning Kim Gyeongman, Jang Doohyuk, Yang Eunho
- Persona Is A Double-edged Sword: Enhancing The Zero-shot Reasoning By Ensembling The Role-playing And Neutral Prompts Kim Junseok, Yang Nakyeong, Jung Kyomin
- Prometheus 2: An Open Source Language Model Specialized In Evaluating Other Language Models Kim Seungone, Suk Juyoung, Longpre Shayne, Lin Bill Yuchen, Shin Jamin, Welleck Sean, Neubig Graham, Lee Moontae, Lee Kyungjae, Seo Minjoon
- Unveiling Divergent Inductive Biases Of Llms On Temporal Data Kishore Sindhu, He Hangfeng
- From Bytes To Borsch: Fine-tuning Gemma And Mistral For The Ukrainian Language Representation Kiulian Artur, Polishko Anton, Khandoga Mykola, Chubych Oryna, Connor Jack, Ravishankar Raghav, Shirawalmath Adarsh
- Studying Large Language Model Behaviors Under Realistic Knowledge Conflicts Kortukov Evgenii, Rubinstein Alexander, Nguyen Elisa, Oh Seong Joon
- Open Source Language Models Can Provide Feedback: Evaluating Llms' Ability To Help Students Using Gpt-4-as-a-judge Koutcheme Charles, Dainese Nicola, Sarsa Sami, Hellas Arto, Leinonen Juho, Denny Paul
- Subtle Biases Need Subtler Measures: Dual Metrics For Evaluating Representative And Affinity Bias In Large Language Models Kumar Abhishek, Yunusov Sarfaroz, Emami Ali
- SEC-QA: A Systematic Evaluation Corpus For Financial QA Lai Viet Dac, Krumdick Michael, Lovering Charles, Reddy Varshini, Schmidt Craig, Tanner Chris
- Cat-bench: Benchmarking Language Model Understanding Of Causal And Temporal Dependencies In Plans Lal Yash Kumar, Cohen Vanya, Chambers Nathanael, Balasubramanian Niranjan, Mooney Raymond
- IBD: Alleviating Hallucinations In Large Vision-language Models Via Image-biased Decoding Lanyun Zhu, Deyi Ji, Tianrun Chen, Peng Xu, Jieping Ye, Jun Liu
- Frontier AI Ethics: Anticipating And Evaluating The Societal Impacts Of Generative Agents Lazar Seth
- Sentiment Reasoning For Healthcare Le-duc Khai, Nguyen Khai-nguyen, Tat Bach Phan, Le Duy, Ngo Jerry, Vo-dang Long, Nguyen Anh Totti, Hy Truong-son
- Language Models Show Stable Value Orientations Across Diverse Role-plays Lee Bruce W., Lee Yeongheon, Cho Hyunsoo
- The Life Cycle Of Large Language Models: A Review Of Biases In Education Lee Jinsook, Hicke Yann, Yu Renzhe, Brooks Christopher, Kizilcec René F.
- Probability Of Differentiation Reveals Brittleness Of Homogeneity Bias In Large Language Models Lee Messi H. J., Lai Calvin K.
- Chatgpt As Research Scientist: Probing Gpt's Capabilities As A Research Librarian, Research Ethicist, Data Generator And Data Predictor Lehr Steven A., Caliskan Aylin, Liyanage Suneragiri, Banaji Mahzarin R.
- Small-e: Small Language Model With Linear Attention For Efficient Speech Synthesis Lemerle Théodor, Obin Nicolas, Roebel Axel
- More RLHF, More Trust? On The Impact Of Human Preference Alignment On Language Model Trustworthiness Li Aaron J., Krishna Satyapriya, Lakkaraju Himabindu
- Steering Llms Towards Unbiased Responses: A Causality-guided Debiasing Framework Li Jingling, Tang Zeyu, Liu Xiaoyu, Spirtes Peter, Zhang Kun, Leqi Liu, Liu Yang
- Stylechat: Learning Recitation-augmented Memory In Llms For Stylized Dialogue Generation Li Jinpeng, Zhang Zekai, Tu Quan, Cheng Xin, Zhao Dongyan, Yan Rui
- Bridging The Language Gap: Enhancing Multilingual Prompt-based Code Generation In Llms Via Zero-shot Cross-lingual Transfer Li Mingda, Mishra Abhijit, Mujumdar Utkarsh
- Red Teaming Visual Language Models Li Mukai, Li Lei, Yin Yuwei, Ahmed Masood, Liu Zhenguang, Liu Qi
- From Text To Insight: Leveraging Large Language Models For Performance Evaluation In Management Li Ning, Zhou Huaikang, Xu Mingze
- Anchored Answers: Unravelling Positional Bias In Gpt-2's Multiple-choice Questions Li Ruizhe, Gao Yanjun
- Long-context Llms Struggle With Long In-context Learning Li Tianle, Zhang Ge, Do Quy Duc, Yue Xiang, Chen Wenhu
- Stbench: Assessing The Ability Of Large Language Models In Spatio-temporal Analysis Li Wenbin, Yao Di, Zhao Ruibo, Chen Wenjie, Xu Zijie, Luo Chengxue, Gong Chang, Jing Quanliang, Tan Haining, Bi Jingping
- Uio-llms: Unbiased Incremental Optimization For Long-context Llms Li Wenhao, Lin Mingbao, Zhong Yunshan, Yan Shuicheng, Ji Rongrong
- Rethinking Chatgpt's Success: Usability And Cognitive Behaviors Enabled By Auto-regressive Llms' Prompting Li Xinzhe, Liu Ming
- Cif-bench: A Chinese Instruction-following Benchmark For Evaluating The Generalizability Of Large Language Models Li Yizhi, Zhang Ge, Qu Xingwei, Li Jiali, Li Zhaoqun, Wang Zekun, Li Hao, Yuan Ruibin, Ma Yinghao, Zhang Kai, Zhou Wangchunshu, Liang Yiming, Zhang Lei, Ma Lei, Zhang Jiajun, Li Zuowen, Huang Stephen W., Lin Chenghua, Fu Jie
- Uni-moe: Scaling Unified Multimodal Llms With Mixture Of Experts Li Yunxin, Jiang Shenyuan, Hu Baotian, Wang Longyue, Zhong Wanqi, Luo Wenhan, Ma Lin, Zhang Min
- Do Vision-language Transformers Exhibit Visual Commonsense? An Empirical Study Of VCR Li Zhenyang, Guo Yangyang, Wang Kejie, Chen Xiaolin, Nie Liqiang, Kankanhalli Mohan
- Mitigating Hallucination In Visual-language Models Via Re-balancing Contrastive Decoding Liang Xiaoyu, Yu Jiayuan, Mu Lianrui, Zhuang Jiedong, Hu Jiaqi, Yang Yuchen, Ye Jiangnan, Lu Lu, Chen Jian, Hu Haoji
- Large Language Models As Recommender Systems: A Study Of Popularity Bias Lichtenberg Jan Malte, Buchholz Alexander, Schwöbel Pola
- Wildbench: Benchmarking Llms With Challenging Tasks From Real Users In The Wild Lin Bill Yuchen, Deng Yuntian, Chandu Khyathi, Brahman Faeze, Ravichander Abhilasha, Pyatkin Valentina, Dziri Nouha, Bras Ronan Le, Choi Yejin
- Single Character Perturbations Break LLM Alignment Lin Leon, Brown Hannah, Kawaguchi Kenji, Shieh Michael
- Optimizing Language Models For Human Preferences Is A Causal Inference Problem Lin Victoria, Ben-michael Eli, Morency Louis-philippe
- Openeval: Benchmarking Chinese Llms Across Capability, Alignment And Safety Liu Chuang, Yu Linhao, Li Jiaxuan, Jin Renren, Huang Yufei, Shi Ling, Zhang Junhui, Ji Xinmeng, Cui Tingting, Liu Tao, Song Jinwang, Zan Hongying, Li Sun, Xiong Deyi
- Self-reflection Outcome Is Sensitive To Prompt Construction Liu Fengyuan, Aldahoul Nouar, Eady Gregory, Zaki Yasir, Alshebli Bedoor, Rahwan Talal
- On The Intrinsic Self-correction Capability Of Llms: Uncertainty And Latent Concept Liu Guangliang, Mao Haitao, Cao Bochuan, Xue Zhiyu, Johnson Kristen, Tang Jiliang, Wang Rongrong
- Automatic Generation Of Model And Data Cards: A Step Towards Responsible AI Liu Jiarui, Li Wenkai, Jin Zhijing, Diab Mona
- The Generation Gap:exploring Age Bias In The Underlying Value Systems Of Large Language Models Liu Siyang, Maturi Trish, Yi Bowen, Shen Siqi, Mihalcea Rada
- Explicit Inductive Inference Using Large Language Models Liu Tianyang, Li Tianyi, Cheng Liang, Steedman Mark
- Making Them Ask And Answer: Jailbreaking Large Language Models In Few Queries Via Disguise And Reconstruction Liu Tong, Zhang Yingjie, Zhao Zhe, Dong Yinpeng, Meng Guozhu, Chen Kai
- An Empirical Analysis On Large Language Models In Debate Evaluation Liu Xinyi, Liu Pinxin, He Hangfeng
- Arondight: Red Teaming Large Vision Language Models With Auto-generated Multi-modal Jailbreak Prompts Liu Yi, Cai Chengjun, Zhang Xiaoli, Yuan Xingliang, Wang Cong
- Aligning With Human Judgement: The Role Of Pairwise Preference In Large Language Model Evaluators Liu Yinhong, Zhou Han, Guo Zhijiang, Shareghi Ehsan, Vulić Ivan, Korhonen Anna, Collier Nigel
- SAFETY-J: Evaluating Safety With Critique Liu Yixiu, Zheng Yuxiang, Xia Shijie, Li Jiajun, Tu Yi, Song Chaoling, Liu Pengfei
- Hd-eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition Liu Yuxuan, Yang Tianchi, Huang Shaohan, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Llms Are Biased Towards Output Formats! Systematically Evaluating And Mitigating Output Format Bias Of Llms Long Do Xuan, Ngoc Hai Nguyen, Sim Tiviatis, Dao Hieu, Joty Shafiq, Kawaguchi Kenji, Chen Nancy F., Kan Min-yen
- From GPT-4 To Gemini And Beyond: Assessing The Landscape Of Mllms On Generalizability, Trustworthiness And Causality Through Four Modalities Lu Chaochao, Qian Chen, Zheng Guodong, Fan Hongxing, Gao Hongzhi, Zhang Jie, Shao Jing, Deng Jingyi, Fu Jinlan, Huang Kexin, Li Kunchang, Li Lijun, Wang Limin, Sheng Lu, Chen Meiqi, Zhang Ming, Ren Qibing, Chen Sirui, Gui Tao, Ouyang Wanli, Wang Yali, Teng Yan, Wang Yaru, Wang Yi, He Yinan, Wang Yingchun, Wang Yixu, Zhang Yongting, Qiao Yu, Shen Yujiong, Mou Yurong, Chen Yuxi, Zhang Zaibin, Shi Zhelun, Yin Zhenfei, Wang Zhipin
- Investigating Bias Representations In Llama 2 Chat Via Activation Steering Lu Dawn, Rimsky Nina
- Towards Transfer Unlearning: Empirical Evidence Of Cross-domain Bias Mitigation Lu Huimin, Isonuma Masaru, Mori Junichiro, Sakata Ichiro
- Revisiting Multi-modal LLM Evaluation Lu Jian, Srivastava Shikhar, Chen Junyu, Shrestha Robik, Acharya Manoj, Kafle Kushal, Kanan Christopher
- Datasculpt: Crafting Data Landscapes For LLM Post-training Through Multi-objective Partitioning Lu Keer, Liang Zheng, Nie Xiaonan, Pan Da, Zhang Shusen, Zhao Keshi, Chen Weipeng, Zhou Zenan, Dong Guosheng, Zhang Wentao, Cui Bin
- Insights Into LLM Long-context Failures: When Transformers Know But Don't Tell Lu Taiming, Gao Muhan, Yu Kuai, Byerly Adam, Khashabi Daniel
- From Understanding To Utilization: A Survey On Explainability For Large Language Models Luo Haoyan, Specia Lucia
- Ensuring Safe And High-quality Outputs: A Guideline Library Approach For Language Models Luo Yi, Lin Zhenghao, Zhang Yuhao, Sun Jiashuo, Lin Chen, Xu Chengjin, Su Xiangdong, Shen Yelong, Guo Jian, Gong Yeyun
- Realistic Evaluation Of Toxicity In Large Language Models Luong Tinh Son, Le Thanh-thien, Van Linh Ngo, Nguyen Thien Huu
- Are Large Language Models Good Prompt Optimizers? Ma Ruotian, Wang Xiaolei, Zhou Xin, Li Jian, Du Nan, Gui Tao, Zhang Qi, Huang Xuanjing
- LLM In-context Recall Is Prompt Dependent Machlab Daniel, Battle Rick
- Venn Diagram Prompting : Accelerating Comprehension With Scaffolding Effect Mahendru Sakshi, Pandit Tejul
- Too Late To Train, Too Early To Use? A Study On Necessity And Viability Of Low-resource Bengali Llms Mahfuz Tamzeed, Dey Satak Kumar, Naswan Ruwad, Adil Hasnaen, Sayeed Khondker Salman, Shahgir Haz Sameen
- Mitigating Hallucinations In Large Vision-language Models (lvlms) Via Language-contrastive Decoding (LCD) Manevich Avshalom, Tsarfaty Reut
- Generative Artificial Intelligence In Higher Education: Evidence From An Analysis Of Institutional Policies And Guidelines Mcdonald Nora, Johri Aditya, Ali Areej, Hingle Aayushi
- Inadequacies Of Large Language Model Benchmarks In The Era Of Generative Artificial Intelligence Mcintosh Timothy R., Susnjak Teo, Liu Tong, Watters Paul, Halgamuge Malka N.
- Openelm: An Efficient Language Model Family With Open Training And Inference Framework Mehta Sachin, Sekhavat Mohammad Hossein, Cao Qingqing, Horton Maxwell, Jin Yanzi, Sun Chenfan, Mirzadeh Iman, Najibi Mahyar, Belenko Dmitry, Zatloukal Peter, Rastegari Mohammad
- Phybench: A Physical Commonsense Benchmark For Evaluating Text-to-image Models Meng Fanqing, Shao Wenqi, Luo Lixin, Wang Yahong, Chen Yiran, Lu Quanfeng, Yang Yue, Yang Tianshuo, Zhang Kaipeng, Qiao Yu, Luo Ping
- Anthropocentric Bias And The Possibility Of Artificial Cognition Millière Raphaël, Rathkopf Charles
- Multimodal Contrastive In-context Learning Miyanishi Yosuke, Nguyen Minh Le
- Do Large Language Models Exhibit Cognitive Dissonance? Studying The Difference Between Revealed Beliefs And Stated Answers Mondal Manuel, Dolamic Ljiljana, Bovet Gérôme, Cudré-mauroux Philippe, Audiffren Julien
- Langbite: A Platform For Testing Bias In Large Language Models Morales Sergio, Clarisó Robert, Cabot Jordi
- Open-llm-leaderboard: From Multi-choice To Open-style Questions For Llms Evaluation, Benchmark, And Arena Myrzakhan Aidar, Bsharat Sondos Mahmoud, Shen Zhiqiang
- Benchmarking Llama2, Mistral, Gemma And GPT For Factuality, Toxicity, Bias And Propensity For Hallucinations Nadeau David, Kroutikov Mike, Mcneil Karen, Baribeau Simon
- Negation Blindness In Large Language Models: Unveiling The NO Syndrome In Image Generation Nadeem Mohammad, Sohail Shahab Saquib, Cambria Erik, Schuller Björn W., Hussain Amir
- Apprentices To Research Assistants: Advancing Research With Large Language Models Namvarpour M., Razi A.
- MBBQ: A Dataset For Cross-lingual Comparison Of Stereotypes In Generative Llms Neplenbroek Vera, Bisazza Arianna, Fernández Raquel
- Educational Personalized Learning Path Planning With Large Language Models Ng Chee, Fung Yuen
- Examining The Influence Of Political Bias On Large Language Model Performance In Stance Classification Ng Lynnette Hui Xian, Cruickshank Iain, Lee Roy Ka-wei
- Llms For Generating And Evaluating Counterfactuals: A Comprehensive Study Nguyen Van Bach, Youssef Paul, Schlötterer Jörg, Seifert Christin
- Mixeval: Deriving Wisdom Of The Crowd From LLM Benchmark Mixtures Ni Jinjie, Xue Fuzhao, Yue Xiang, Deng Yuntian, Shah Mahir, Jain Kabir, Neubig Graham, You Yang
- Do Multilingual Large Language Models Mitigate Stereotype Bias? Nie Shangrui, Fromm Michael, Welch Charles, Görge Rebekka, Karimi Akbar, Plepi Joan, Mowmita Nazia Afsan, Flores-herr Nicolas, Ali Mehdi, Flek Lucie
- Large Language Models And Cognitive Science: A Comprehensive Review Of Similarities, Differences, And Challenges Niu Qian, Liu Junyu, Bi Ziqian, Feng Pohsun, Peng Benji, Chen Keyu, Li Ming
- Beyond Metrics: Evaluating Llms' Effectiveness In Culturally Nuanced, Low-resource Real-world Scenarios Ochieng Millicent, Gumma Varun, Sitaram Sunayana, Wang Jindong, Chaudhary Vishrav, Ronen Keshet, Bali Kalika, O'neill Jacki
- Likelihood-based Mitigation Of Evaluation Bias In Large Language Models Ohi Masanari, Kaneko Masahiro, Koike Ryuto, Loem Mengsay, Okazaki Naoaki
- Simplifying Translations For Children: Iterative Simplification Considering Age Of Acquisition With Llms Oshika Masashi, Morishita Makoto, Hirao Tsutomu, Sasano Ryohei, Takeda Koichi
- Towards Cross-lingual Explanation Of Artwork In Large-scale Vision Language Models Ozaki Shintaro, Hayashi Kazuki, Sakai Yusuke, Kamigaito Hidetaka, Hayashi Katsuhiko, Watanabe Taro
- POMP: Probability-driven Meta-graph Prompter For Llms In Low-resource Unsupervised Neural Machine Translation Pan Shilong, Tian Zhiliang, Ding Liang, Huang Zhen, Wen Zhihua, Li Dongsheng
- LLM Evaluators Recognize And Favor Their Own Generations Panickssery Arjun, Bowman Samuel R., Feng Shi
- Pragmatic Competence Evaluation Of Large Language Models For Korean Park Dojun, Lee Jiwoo, Jeong Hyeyun, Park Seohyun, Lee Sungeun
- Building Trust In Mental Health Chatbots: Safety Metrics And Llm-based Evaluation Tools Park Jung In, Abbasian Mahyar, Azimi Iman, Bounds Dawn, Jun Angela, Han Jaesu, Mccarron Robert, Borelli Jessica, Li Jia, Mahmoudi Mona, Wiedenhoeft Carmen, Rahmani Amir
- Offsetbias: Leveraging Debiased Data For Tuning Evaluators Park Junsoo, Jwa Seungyeon, Ren Meiying, Kim Daeyoung, Choi Sanghyuk
- Disentangling Length From Quality In Direct Preference Optimization Park Ryan, Rafailov Rafael, Ermon Stefano, Finn Chelsea
- Automating Code Adaptation For Mlops -- A Benchmarking Study On Llms Patel Harsh, Ramanan Buvaneswari A., Khan Manzoor A., Williams Thomas, Friedman Brian, Drabeck Lawrence
- Generative AI As A Metacognitive Agent: A Comparative Mixed-method Study With Human Participants On Icf-mimicking Exam Performance Pavlovic Jelena University Of Belgrade, Faculty Of Philosophy And Koucing Centar Resarch Lab, Krstic Jugoslav Koucing Centar Research Lab, Mitrovic Luka Koucing Centar Research Lab, Babic Djordje Koucing Centar Research Lab, Milosavljevic Adrijana Koucing Centar Research Lab, Nikolic Milena Koucing Centar Research Lab, Karaklic Tijana Koucing Centar Research Lab, Mitrovic Tijana Koucing Centar Research Lab
- The Effectiveness Of Llms As Annotators: A Comparative Overview And Empirical Analysis Of Direct Representation Pavlovic Maja, Poesio Massimo
- Securing Large Language Models: Addressing Bias, Misinformation, And Prompt Attacks Peng Benji, Chen Keyu, Li Ming, Feng Pohsun, Bi Ziqian, Liu Junyu, Niu Qian
- A Review Of The Challenges With Massive Web-mined Corpora Used In Large Language Models Pre-training Perełkiewicz Michał, Poświata Rafał
- Enhancing Gender-inclusive Machine Translation With Neomorphemes And Large Language Models Piergentili Andrea, Savoldi Beatrice, Negri Matteo, Bentivogli Luisa
- LM-PUB-QUIZ: A Comprehensive Framework For Zero-shot Evaluation Of Relational Knowledge In Language Models Ploner Max, Wiland Jacek, Pohl Sebastian, Akbik Alan
- Advanced Natural-based Interaction For The Italian Language: Llamantino-3-anita Polignano Marco, Basile Pierpaolo, Semeraro Giovanni
- LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users Poole-dayan Elinor, Roy Deb, Kabbara Jad
- Interpreting Bias In Large Language Models: A Feature-based Approach Prakash Nirmalendu, Roy Lee Ka Wei
- The Silent Curriculum: How Does LLM Monoculture Shape Educational Content And Its Accessibility? Priyanshu Aman, Vijay Supriti
- When Quantization Affects Confidence Of Large Language Models? Proskurina Irina, Brun Luc, Metzler Guillaume, Velcin Julien
- A Moral Imperative: The Need For Continual Superalignment Of Large Language Models Puthumanaillam Gokul, Vora Manav, Thangeda Pranay, Ornik Melkior
- Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period Of Large Language Models Qian Chen, Zhang Jie, Yao Wei, Liu Dongrui, Yin Zhenfei, Qiao Yu, Liu Yong, Shao Jing
- Exploring The Nexus Of Large Language Models And Legal Systems: A Short Survey Qin Weicong, Sun Zhongxiang
- LAMPO: Large Language Models As Preference Machines For Few-shot Ordinal Classification Qin Zhen, Wu Junru, Shen Jiaming, Liu Tianqi, Wang Xuanhui
- Brainstorming Brings Power To Large Language Models Of Knowledge Reasoning Qin Zining, Wang Chenhao, Qin Huiling, Jia Weijia
- Spectral Editing Of Activations For Large Language Model Alignment Qiu Yifu, Zhao Zheng, Ziser Yftah, Korhonen Anna, Ponti Edoardo M., Cohen Shay B.
- The Frontier Of Data Erasure: Machine Unlearning For Large Language Models Qu Youyang, Ding Ming, Sun Nan, Thilakarathna Kanchana, Zhu Tianqing, Niyato Dusit
- Llms Among Us: Generative AI Participating In Digital Discourse Radivojevic Kristina, Clark Nicholas, Brenner Paul
- Constructing Domain-specific Evaluation Sets For Llm-as-a-judge Raju Ravi, Jain Swayambhoo, Li Bo, Li Jonathan, Thakker Urmish
- Visual Hallucination: Definition, Quantification, And Prescriptive Remediations Rani Anku, Rawte Vipula, Sharma Harshad, Anand Neeraj, Rajbangshi Krishnav, Sheth Amit, Das Amitava
- Developing Safe And Responsible Large Language Model : Can We Balance Bias Reduction And Language Understanding In Large Language Models? Raza Shaina, Bamgbose Oluwanifemi, Ghuge Shardul, Tavakol Fatemeh, Reji Deepak John, Bashir Syed Raza
- FAIR Enough: How Can We Develop And Assess A Fair-compliant Dataset For Large Language Models' Training? Raza Shaina, Ghuge Shardul, Ding Chen, Dolatabadi Elham, Pandya Deval
- Beads: Bias Evaluation Across Domains Raza Shaina, Rahman Mizanur, Zhang Michael R.
- MBIAS: Mitigating Bias In Large Language Models While Retaining Context Raza Shaina, Raval Ananya, Chatrath Veronica
- Beyond Performance: Quantifying And Mitigating Label Bias In Llms Reif Yuval, Schwartz Roy
- Identifying Semantic Induction Heads To Understand In-context Learning Ren Jie, Guo Qipeng, Yan Hang, Liu Dongrui, Zhang Quanshi, Qiu Xipeng, Lin Dahua
- Codeattack: Revealing Safety Generalization Challenges Of Large Language Models Via Code Completion Ren Qibing, Gao Chang, Shao Jing, Yan Junchi, Tan Xin, Lam Wai, Ma Lizhuang
- Strengthening Multimodal Large Language Model With Bootstrapped Preference Optimization Renjie Pi, Tianyang Han, Wei Xiong, Jipeng Zhang, Runtao Liu, Rui Pan, Tong Zhang
- Large Language Models Are Biased Because They Are Large Language Models Resnik Philip
- Evaluating Llms For Gender Disparities In Notable Persons Rhue Lauren, Goethals Sofie, Sundararajan Arun
- Exploring The Potential Of The Large Language Models (llms) In Identifying Misleading News Headlines Rony Md Main Uddin, Haque Md Mahfuzul, Ali Mohammad, Alam Ahmed Shatil, Hassan Naeemul
- Safetyprompts: A Systematic Review Of Open Datasets For Evaluating And Improving Large Language Model Safety Röttger Paul, Pernisi Fabio, Vidgen Bertie, Hovy Dirk
- Where Is The Answer? Investigating Positional Bias In Language Model Knowledge Extraction Saito Kuniaki, Sohn Kihyuk, Lee Chen-yu, Ushiku Yoshitaka
- Risks, Causes, And Mitigations Of Widespread Deployments Of Large Language Models (llms): A Survey Sakib Md Nazmus, Islam Md Athikul, Pathak Royal, Arifin Md Mashrur
- Large Language Models Show Human-like Social Desirability Biases In Survey Responses Salecha Aadesh, Ireland Molly E., Subrahmanya Shashanka, Sedoc João, Ungar Lyle H., Eichstaedt Johannes C.
- Retrieval-augmented Generation Meets Data-driven Tabula Rasa Approach For Temporal Knowledge Graph Forecasting Sannidhi Geethan, Sakhinana Sagar Srinivas, Runkana Venkataramana
- The Power Of Prompts: Evaluating And Mitigating Gender Bias In MT With Llms Sant Aleix, Escolano Carlos, Mash Audrey, Fornaciari Francesca De Luca, Melero Maite
- Textmachina: Seamless Generation Of Machine-generated Text Datasets Sarvazyan Areg Mikael, González José Ángel, Franco-salvador Marc
- A Prompt Response To The Demand For Automatic Gender-neutral Translation Savoldi Beatrice, Piergentili Andrea, Fucci Dennis, Negri Matteo, Bentivogli Luisa
- Pitfalls Of Conversational Llms On News Debiasing Schlicht Ipek Baris, Altiok Defne, Taouk Maryanne, Flek Lucie
- Metacognitive Myopia In Large Language Models Scholten Florian, Rebholz Tobias R., Hütter Mandy
- Evaluating Large Language Models With Fmeval Schwöbel Pola, Franceschi Luca, Zafar Muhammad Bilal, Vasist Keerthan, Malhotra Aman, Shenhar Tomer, Tailor Pinal, Yilmaz Pinar, Diamond Michael, Donini Michele
- Behavioral Testing: Can Large Language Models Implicitly Resolve Ambiguous Entities? Sedova Anastasiia, Litschko Robert, Frassinelli Diego, Roth Benjamin, Plank Barbara
- Jetmoe: Reaching Llama2 Performance With 0.1M Dollars Shen Yikang, Guo Zhen, Cai Tianle, Qin Zengyi
- Using Chatgpt To Score Essays And Short-form Constructed Responses Shermis Mark D.
- Judging The Judges: A Systematic Investigation Of Position Bias In Pairwise Comparative Assessments By Llms Shi Lin, Ma Chiyu, Ma Weicheng, Vosoughi Soroush
- Paying More Attention To Image: A Training-free Method For Alleviating Hallucination In Lvlms Shi Liu, Kecheng Zheng, Wei Chen
- Wildfeedback: Aligning Llms With In-situ User Interactions And Feedback Shi Taiwei, Wang Zhuoer, Yang Longqi, Lin Ying-chun, He Zexue, Wan Mengting, Zhou Pei, Jauhar Sujay, Xu Xiaofeng, Song Xia, Neville Jennifer
- Can Prompt Modifiers Control Bias? A Comparative Analysis Of Text-to-image Generative Models Shin Philip Wootaek, Ahn Jihyun Janice, Yin Wenpeng, Sampson Jack, Narayanan Vijaykrishnan
- Scaling Synthetic Logical Reasoning Datasets With Context-sensitive Declarative Grammars Sileo Damien
- Exploring Value Biases: How Llms Deviate Towards The Ideal Sivaprasad Sarath, Kaushik Pramod, Abdelnabi Sahar, Fritz Mario
- Gistembed: Guided In-sample Selection Of Training Negatives For Text Embedding Fine-tuning Solatorio Aivin V.
- Can Many-shot In-context Learning Help Long-context LLM Judges? See More, Judge Better! Song Mingyang, Zheng Mao, Luo Xuan
- M3GIA: A Cognition Inspired Multilingual And Multimodal General Intelligence Ability Benchmark Song Wei, Li Yadong, Xu Jianhua, Wu Guowei, Ming Lingfeng, Yi Kexin, Luo Weihua, Li Houyi, Du Yi, Guo Fangda, Yu Kaicheng
- Moviellm: Enhancing Long Video Understanding With Ai-generated Movies Song Zhende, Wang Chenchen, Sheng Jiamu, Zhang Chi, Yu Gang, Fan Jiayuan, Chen Tao
- Generaitor: Tree-in-the-loop Text Generation For Language Model Explainability And Adaptation Spinner Thilo, Kehlbeck Rebecca, Sevastjanova Rita, Stähle Tobias, Keim Daniel A., Deussen Oliver, El-assady Mennatallah
- Appropriateness Of Llm-equipped Robotic Well-being Coach Language In The Workplace: A Qualitative Evaluation Spitale Micol, Axelsson Minja, Gunes Hatice
- Chatgpt For Conversational Recommendation: Refining Recommendations By Reprompting With Feedback Spurlock Kyle Dylan, Acun Cagla, Saka Esin, Nasraoui Olfa
- Probing Multimodal Llms As World Models For Driving Sreeram Shiva, Wang Tsun-hsuan, Maalouf Alaa, Rosman Guy, Karaman Sertac, Rus Daniela
- Large Language Models Are Inconsistent And Biased Evaluators Stureborg Rickard, Alikaniotis Dimitris, Suhara Yoshi
- Crafting Large Language Models For Enhanced Interpretability Sun Chung-en, Oikarinen Tuomas, Weng Tsui-wei
- Bbox-adapter: Lightweight Adapting For Black-box Large Language Models Sun Haotian, Zhuang Yuchen, Wei Wei, Zhang Chao, Dai Bo
- Expert-guided Extinction Of Toxic Tokens For Debiased Generation Sun Xueyao, Shi Kaize, Tang Haoran, Xu Guandong, Li Qing
- Automating Research Synthesis With Domain-specific Large Language Model Fine-tuning Susnjak Teo, Hwang Peter, Reyes Napoleon H., Barczak Andre L. C., Mcintosh Timothy R., Ranathunga Surangika
- Mmlu-pro+: Evaluating Higher-order Reasoning And Shortcut Learning In Llms Taghanaki Saeid Asgari, Khani Aliasgahr, Khasahmadi Amir
- Improving Large Language Model (LLM) Fidelity Through Context-aware Grounding: A Systematic Approach To Reliability And Veracity Talukdar Wrick, Biswas Anjanava
- Blinded By Generated Contexts: How Language Models Merge Generated And Retrieved Contexts When Knowledge Conflicts? Tan Hexiang, Sun Fei, Yang Wanli, Wang Yuanzhuo, Cao Qi, Cheng Xueqi
- Judging The Judges: Evaluating Alignment And Vulnerabilities In Llms-as-judges Thakur Aman Singh, Choudhary Kartik, Ramayapally Venkat Srinik, Vaidyanathan Sankaran, Hupkes Dieuwke
- Assessing Large Language Models In Mechanical Engineering Education: A Study On Mechanics-focused Conceptual Understanding Tian Jie, Hou Jixin, Wu Zihao, Shu Peng, Liu Zhengliang, Xiang Yujie, Gu Beikang, Filla Nicholas, Li Yiwei, Liu Ning, Chen Xianyan, Tang Keke, Liu Tianming, Wang Xianqiao
- Tokenize The World Into Object-level Knowledge To Address Long-tail Events In Autonomous Driving Tian Ran, Li Boyi, Weng Xinshuo, Chen Yuxiao, Schmerling Edward, Wang Yue, Ivanovic Boris, Pavone Marco
- The Instinctive Bias: Spurious Images Lead To Hallucination In Mllms Tianyang Han, Qing Lian, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang
- A Comprehensive Survey Of Hallucination Mitigation Techniques In Large Language Models Tonmoy S. M Towhidul Islam, Zaman S M Mehedi, Jain Vinija, Rani Anku, Rawte Vipula, Chadha Aman, Das Amitava
- Connecting The Dots: Llms Can Infer And Verbalize Latent Structure From Disparate Training Data Treutlein Johannes, Choi Dami, Betley Jan, Anil Cem, Marks Samuel, Grosse Roger Baker, Evans Owain
- Insaaf: Incorporating Safety Through Accuracy And Fairness | Are Llms Ready For The Indian Legal Domain? Tripathi Yogesh, Donakanti Raghav, Girhepuje Sahil, Kavathekar Ishan, Vedula Bhaskara Hanuma, Krishnan Gokul S, Goyal Shreya, Goel Anmol, Ravindran Balaraman, Kumaraguru Ponnurangam
- Crossing Linguistic Horizons: Finetuning And Comprehensive Evaluation Of Vietnamese Large Language Models Truong Sang T., Nguyen Duc Q., Nguyen Toan, Le Dong D., Truong Nhi N., Quan Tho, Koyejo Sanmi
- LM Transparency Tool: Interactive Tool For Analyzing Transformer Language Models Tufanov Igor, Hambardzumyan Karen, Ferrando Javier, Voita Elena
- Investigating Context Effects In Similarity Judgements In Large Language Models Uprety Sagar, Jaiswal Amit Kumar, Liu Haiming, Song Dawei
- Autopuredata: Automated Filtering Of Web Data For LLM Fine-tuning Vadlapati Praneeth
- Gender Bias In Machine Translation And The Era Of Large Language Models Vanmassenhove Eva
- Replacing Judges With Juries: Evaluating LLM Generations With A Panel Of Diverse Models Verga Pat, Hofstatter Sebastian, Althammer Sophia, Su Yixuan, Piktus Aleksandra, Arkhangorodsky Arkady, Xu Minjie, White Naomi, Lewis Patrick
- Exploring LGBTQ+ Bias In Generative AI Answers Across Different Country And Religious Contexts Vicsek Lilla, Vancsó Anna, Zajko Mike, Takacs Judit
- Exploring Safety-utility Trade-offs In Personalized Language Models Vijjini Anvesh Rao, Chowdhury Somnath Basu Roy, Chaturvedi Snigdha
- Emotional Manipulation Through Prompt Engineering Amplifies Disinformation Generation In AI Large Language Models Vinay Rasita, Spitale Giovanni, Biller-andorno Nikola, Germani Federico
- Foundational Autoraters: Taming Large Language Models For Better Automatic Evaluation Vu Tu, Krishna Kalpesh, Alzubi Salaheddin, Tar Chris, Faruqui Manaal, Sung Yun-hsuan
- From Rags To Rich Parameters: Probing How Language Models Utilize External Knowledge Over Parametric Information For Factual Queries Wadhwa Hitesh, Seetharaman Rahul, Aggarwal Somyaa, Ghosh Reshmi, Basu Samyadeep, Srinivasan Soundararajan, Zhao Wenlong, Chaudhari Shreyas, Aghazadeh Ehsan
- Task Supportive And Personalized Human-large Language Model Interaction: A User Study Wang Ben, Liu Jiqun, Karimnazarov Jamshed, Thompson Nicolas
- A Survey For Large Language Models In Biomedicine Wang Chong, Li Mengyao, He Junjun, Wang Zhongruo, Darzi Erfan, Chen Zan, Ye Jin, Li Tianbin, Su Yanzhou, Ke Jing, Qu Kaili, Li Shuxin, Yu Yi, Liò Pietro, Wang Tianyun, Wang Yu Guang, Shen Yiqing
- Reinforcement Learning For Edit-based Non-autoregressive Neural Machine Translation Wang Hao, Morimura Tetsuro, Honda Ukyo, Kawahara Daisuke
- Mambabyte: Token-free Selective State Space Model Wang Junxiong, Gangavarapu Tushaar, Yan Jing Nathan, Rush Alexander M.
- Balancing Rigor And Utility: Mitigating Cognitive Biases In Large Language Models For Multiple-choice Questions Wang Liman, Zhong Hanyang, Cao Wenting, Sun Zeyuan
- Will The Real Linda Please Stand Up...to Large Language Models? Examining The Representativeness Heuristic In Llms Wang Pengda, Xiao Zilin, Chen Hanjie, Oswald Frederick L.
- Mfc-bench: Benchmarking Multimodal Fact-checking With Large Vision-language Models Wang Shengkang, Lin Hongzhan, Luo Ziyang, Ye Zhen, Chen Guang, Ma Jing
- Learnable Item Tokenization For Generative Recommendation Wang Wenjie, Bao Honghui, Lin Xinyu, Zhang Jizhi, Li Yongqi, Feng Fuli, Ng See-kiong, Chua Tat-seng
- Testing And Evaluation Of Large Language Models: Correctness, Non-toxicity, And Fairness Wang Wenxuan
- Chatlogic: Integrating Logic Programming With Large Language Models For Multi-step Reasoning Wang Zhongsheng, Liu Jiamou, Bao Qiming, Rong Hongfei, Zhang Jingfeng
- Farsight: Fostering Responsible AI Awareness During AI Application Prototyping Wang Zijie J., Kulkarni Chinmay, Wilcox Lauren, Terry Michael, Madaio Michael
- Speculative RAG: Enhancing Retrieval Augmented Generation Through Drafting Wang Zilong, Wang Zifeng, Le Long, Zheng Huaixiu Steven, Mishra Swaroop, Perot Vincent, Zhang Yuwei, Mattapalli Anush, Taly Ankur, Shang Jingbo, Lee Chen-yu, Pfister Tomas
- Eliminating Position Bias Of Language Models: A Mechanistic Approach Wang Ziqi, Zhang Hanlin, Li Xiner, Huang Kuan-hao, Han Chi, Ji Shuiwang, Kakade Sham M., Peng Hao, Ji Heng
- Cogergllm: Exploring Large Language Model Systems Design Perspective Using Cognitive Ergonomics Wasi Azmine Toushik
- PARIKSHA : A Large-scale Investigation Of Human-llm Evaluator Agreement On Multilingual And Multi-cultural Data Watts Ishaan, Gumma Varun, Yadavalli Aditya, Seshadri Vivek, Swaminathan Manohar, Sitaram Sunayana
- Systematic Evaluation Of Llm-as-a-judge In LLM Alignment Tasks: Explainable Metrics And Diverse Prompt Templates Wei Hui, He Shenghua, Xia Tian, Wong Andy, Lin Jingyang, Han Mei
- Unveiling Selection Biases: Exploring Order And Token Sensitivity In Large Language Models Wei Sheng-lun, Wu Cheng-kuang, Huang Hen-hsen, Chen Hsin-hsi
- Leveraging Large Language Models For Patient Engagement: The Power Of Conversational AI In Digital Health Wen Bo, Norel Raquel, Liu Julia, Stappenbeck Thaddeus, Zulkernine Farhana, Chen Huamin
- Do Llamas Work In English? On The Latent Language Of Multilingual Transformers Wendler Chris, Veselovsky Veniamin, Monea Giovanni, West Robert
- Revealing Fine-grained Values And Opinions In Large Language Models Wright Dustin, Arora Arnav, Borenstein Nadav, Yadav Srishti, Belongie Serge, Augenstein Isabelle
- A Surprising Failure? Multimodal Llms And The NLVR Challenge Wu Anne, Brantley Kianté, Artzi Yoav
- Freeva: Offline MLLM As Training-free Video Assistant Wu Wenhao
- Enhancing Q-learning With Large Language Model Heuristics Wu Xiefeng
- Verbalized Machine Learning: Revisiting Machine Learning With Language Models Xiao Tim Z., Bamler Robert, Schölkopf Bernhard, Liu Weiyang
- Recall: Membership Inference Via Relative Conditional Log-likelihoods Xie Roy, Wang Junlin, Huang Ruomin, Zhang Minxing, Ge Rong, Pei Jian, Gong Neil Zhenqiang, Dhingra Bhuwan
- From Introspection To Best Practices: Principled Analysis Of Demonstrations In Multimodal In-context Learning Xu Nan, Wang Fei, Zhang Sheng, Poon Hoifung, Chen Muhao
- Walking In Others' Shoes: How Perspective-taking Guides Large Language Models In Reducing Toxicity And Bias Xu Rongwu, Zhou Zi'an, Zhang Tianwei, Qi Zehan, Yao Su, Xu Ke, Xu Wei, Qiu Han
- Benchmarking Benchmark Leakage In Large Language Models Xu Ruijie, Wang Zengzhi, Fan Run-ze, Liu Pengfei
- Cruxeval-x: A Benchmark For Multilingual Code Reasoning, Understanding And Execution Xu Ruiyang, Cao Jialun, Lu Yaojie, Lin Hongyu, Han Xianpei, He Ben, Cheung Shing-chi, Sun Le
- Pride And Prejudice: LLM Amplifies Self-bias In Self-refinement Xu Wenda, Zhu Guanglei, Zhao Xuandong, Pan Liangming, Li Lei, Wang William Yang
- Beyond Perplexity: Multi-dimensional Safety Evaluation Of LLM Compression Xu Zhichao, Gupta Ashim, Li Tao, Bentham Oliver, Srikumar Vivek
- Vision-flan: Scaling Human-labeled Tasks In Visual Instruction Tuning Xu Zhiyang, Feng Chao, Shao Rulin, Ashby Trevor, Shen Ying, Jin Di, Cheng Yu, Wang Qifan, Huang Lifu
- Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-choice Selectors Xue Mengge, Hu Zhenyu, Liu Liqun, Liao Kuo, Li Shuang, Han Honglin, Zhao Meng, Yin Chengguo
- Multi-programming Language Ensemble For Code Generation In Large Language Model Xue Tengfei, Li Xuefeng, Azim Tahir, Smirnov Roman, Yu Jianhui, Sadrieh Arash, Pahlavan Babak
- Ninjallm: Fast, Scalable And Cost-effective RAG Using Amazon Sagemaker And AWS Trainium And Inferentia2 Xue Tengfei, Li Xuefeng, Smirnov Roman, Azim Tahir, Sadrieh Arash, Pahlavan Babak
- Pythonsaga: Redefining The Benchmark To Evaluate Code Generating Llms Yadav Ankit, Beniwal Himanshu, Singh Mayank
- Predicting The Big Five Personality Traits In Chinese Counselling Dialogues Using Large Language Models Yan Yang, Ma Lizhi, Li Anqi, Ma Jingsong, Lan Zhenzhong
- Toward Automatic Relevance Judgment Using Vision--language Models For Image--text Retrieval Evaluation Yang Jheng-hong, Lin Jimmy
- Large Language Model Tokenizer Bias: A Case Study And Solution On Gpt-4o Yang Jin, Wang Zhiqiang, Lin Yanbin, Zhao Zunduo
- Machine Unlearning Of Pre-trained Large Language Models Yao Jin, Chien Eli, Du Minxin, Niu Xinyao, Wang Tianhao, Cheng Zezhou, Yue Xiang
- CLAVE: An Adaptive Framework For Evaluating Values Of LLM Generated Responses Yao Jing, Yi Xiaoyuan, Xie Xing
- Mm-spubench: Towards Better Understanding Of Spurious Biases In Multimodal Llms Ye Wenqian, Zheng Guangtao, Ma Yunsheng, Cao Xu, Lai Bolin, Rehg James M., Zhang Aidong
- Jailbreak Attacks And Defenses Against Large Language Models: A Survey Yi Sibo, Liu Yule, Sun Zhen, Cong Tianshuo, He Xinlei, Song Jiaxing, Xu Ke, Li Qi
- Debiasing Multimodal Large Language Models Yi-fan Zhang, Weichen Yu, Qingsong Wen, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan
- Lofit: Localized Fine-tuning On LLM Representations Yin Fangcong, Ye Xi, Durrett Greg
- Hyperclova X Technical Report Yoo Kang Min, Han Jaegeun, In Sookyo, Jeon Heewon, Jeong Jisu, Kang Jaewook, Kim Hyunwook, Kim Kyung-min, Kim Munhyong, Kim Sungju, Kwak Donghyun, Kwak Hanock, Kwon Se Jung, Lee Bado, Lee Dongsoo, Lee Gichang, Lee Jooho, Park Baeseong, Shin Seongjin, Yu Joonsang, Baek Seolki, Byeon Sumin, Cho Eungsup, Choe Dooseok, Han Jeesung, Jin Youngkyun, Jun Hyein, Jung Jaeseung, Kim Chanwoong, Kim Jinhong, Kim Jinuk, Lee Dokyeong, Park Dongwook, Sohn Jeong Min, Han Sujung, Heo Jiae, Hong Sungju, Jeon Mina, Jung Hyunhoon, Jung Jungeun, Jung Wangkyo, Kim Chungjoon, Kim Hyeri, Kim Jonghyun, Kim Min Young, Lee Soeun, Park Joonhee, Shin Jieun, Yang Sojin, Yoon Jungsoon, Lee Hwaran, Bae Sanghwan, Cha Jeehwan, Gylleus Karl, Ham Donghoon, Hong Mihak, Hong Youngki, Hong Yunki, Jang Dahyun, Jeon Hyojun, Jeon Yujin, Jeong Yeji, Ji Myunggeun, Jin Yeguk, Jo Chansong, Joo Shinyoung, Jung Seunghwan, Kim Adrian Jungmyung, Kim Byoung Hoon, Kim Hyomin, Kim Jungwhan, Kim Minkyoung, Kim Minseung, Kim Sungdong, Kim Yonghee, Kim Youngjun, Kim Youngkwan, Ko Donghyeon, Lee Dughyun, Lee Ha Young, Lee Jaehong, Lee Jieun, Lee Jonghyun, Lee Jongjin, Lee Min Young, Lee Yehbin, Min Taehong, Min Yuri, Moon Kiyoon, Oh Hyangnam, Park Jaesun, Park Kyuyon, Park Younghun, Seo Hanbae, Seo Seunghyun, Sim Mihyun, Son Gyubin, Yeo Matt, Yeom Kyung Hoon, Yoo Wonjoon, You Myungin, Ahn Doheon, Ahn Homin, Ahn Joohee, Ahn Seongmin, An Chanwoo, An Hyeryun, An Junho, An Sang-min, Byun Boram, Byun Eunbin, Cha Jongho, Chang Minji, Chang Seunggyu, Cho Haesong, Cho Youngdo, Choi Dalnim, Choi Daseul, Choi Hyoseok, Choi Minseong, Choi Sangho, Choi Seongjae, Choi Wooyong, Chun Sewhan, Go Dong Young, Ham Chiheon, Han Danbi, Han Jaemin, Hong Moonyoung, Hong Sung Bum, Hwang Dong-hyun, Hwang Seongchan, Im Jinbae, Jang Hyuk Jin, Jang Jaehyung, Jang Jaeni, Jang Sihyeon, Jang Sungwon, Jeon Joonha, Jeong Daun, Jeong Joonhyun, Jeong Kyeongseok, Jeong Mini, Jin Sol, Jo Hanbyeol, Jo Hanju, Jo Minjung, Jung Chaeyoon, Jung Hyungsik, Jung Jaeuk, Jung Ju Hwan, Jung Kwangsun, Jung Seungjae, Ka Soonwon, Kang Donghan, Kang Soyoung, Kil Taeho, Kim Areum, Kim Beomyoung, Kim Byeongwook, Kim Daehee, Kim Dong-gyun, Kim Donggook, Kim Donghyun, Kim Euna, Kim Eunchul, Kim Geewook, Kim Gyu Ri, Kim Hanbyul, Kim Heesu, Kim Isaac, Kim Jeonghoon, Kim Jihye, Kim Joonghoon, Kim Minjae, Kim Minsub, Kim Pil Hwan, Kim Sammy, Kim Seokhun, Kim Seonghyeon, Kim Soojin, Kim Soong, Kim Soyoon, Kim Sunyoung, Kim Taeho, Kim Wonho, Kim Yoonsik, Kim You Jin, Kim Yuri, Kwon Beomseok, Kwon Ohsung, Kwon Yoo-hwan, Lee Anna, Lee Byungwook, Lee Changho, Lee Daun, Lee Dongjae, Lee Ha-ram, Lee Hodong, Lee Hwiyeong, Lee Hyunmi, Lee Injae, Lee Jaeung, Lee Jeongsang, Lee Jisoo, Lee Jongsoo, Lee Joongjae, Lee Juhan, Lee Jung Hyun, Lee Junghoon, Lee Junwoo, Lee Se Yun, Lee Sujin, Lee Sungjae, Lee Sungwoo, Lee Wonjae, Lee Zoo Hyun, Lim Jong Kun, Lim Kun, Lim Taemin, Na Nuri, Nam Jeongyeon, Nam Kyeong-min, Noh Yeonseog, Oh Biro, Oh Jung-sik, Oh Solgil, Oh Yeontaek, Park Boyoun, Park Cheonbok, Park Dongju, Park Hyeonjin, Park Hyun Tae, Park Hyunjung, Park Jihye, Park Jooseok, Park Junghwan, Park Jungsoo, Park Miru, Park Sang Hee, Park Seunghyun, Park Soyoung, Park Taerim, Park Wonkyeong, Ryu Hyunjoon, Ryu Jeonghun, Ryu Nahyeon, Seo Soonshin, Seo Suk Min, Shim Yoonjeong, Shin Kyuyong, Shin Wonkwang, Sim Hyun, Sim Woongseob, Soh Hyejin, Son Bokyong, Son Hyunjun, Son Seulah, Song Chi-yun, Song Chiyoung, Song Ka Yeon, Song Minchul, Song Seungmin, Wang Jisung, Yeo Yonggoo, Yi Myeong Yeon, Yim Moon Bin, Yoo Taehwan, Yoo Youngjoon, Yoon Sungmin, Yoon Young Jin, Yu Hangyeol, Yu Ui Seon, Zuo Xingdong, Bae Jeongin, Bae Joungeun, Cho Hyunsoo, Cho Seonghyun, Cho Yongjin, Choi Taekyoon, Choi Yera, Chung Jiwan, Han Zhenghui, Heo Byeongho, Hong Euisuk, Hwang Taebaek, Im Seonyeol, Jegal Sumin, Jeon Sumin, Jeong Yelim, Jeong Yonghyun, Jiang Can, Jiang Juyong, Jin Jiho, Jo Ara, Jo Younghyun, Jung Hoyoun, Jung Juyoung, Kang Seunghyeong, Kim Dae Hee, Kim Ginam, Kim Hangyeol, Kim Heeseung, Kim Hyojin, Kim Hyojun, Kim Hyun-ah, Kim Jeehye, Kim Jin-hwa, Kim Jiseon, Kim Jonghak, Kim Jung Yoon, Kim Rak Yeong, Kim Seongjin, Kim Seoyoon, Kim Sewon, Kim Sooyoung, Kim Sukyoung, Kim Taeyong, Ko Naeun, Koo Bonseung, Kwak Heeyoung, Kwon Haena, Kwon Youngjin, Lee Boram, Lee Bruce W., Lee Dagyeong, Lee Erin, Lee Euijin, Lee Ha Gyeong, Lee Hyojin, Lee Hyunjeong, Lee Jeeyoon, Lee Jeonghyun, Lee Jongheok, Lee Joonhyung, Lee Junhyuk, Lee Mingu, Lee Nayeon, Lee Sangkyu, Lee Se Young, Lee Seulgi, Lee Seung Jin, Lee Suhyeon, Lee Yeonjae, Lee Yesol, Lee Youngbeom, Lee Yujin, Li Shaodong, Liu Tianyu, Moon Seong-eun, Moon Taehong, Nihlenramstroem Max-lasse, Oh Wonseok, Oh Yuri, Park Hongbeen, Park Hyekyung, Park Jaeho, Park Nohil, Park Sangjin, Ryu Jiwon, Ryu Miru, Ryu Simo, Seo Ahreum, Seo Hee, Seo Kangdeok, Shin Jamin, Shin Seungyoun, Sin Heetae, Wang Jiangping, Wang Lei, Xiang Ning, Xiao Longxiang, Xu Jing, Yi Seonyeong, Yoo Haanju, Yoo Haneul, Yoo Hwanhee, Yu Liang, Yu Youngjae, Yuan Weijie, Zeng Bo, Zhou Qian, Cho Kyunghyun, Ha Jung-woo, Park Joonsuk, Hwang Jihyun, Kwon Hyoung Jo, Kwon Soonyong, Lee Jungyeon, Lee Seungho, Lim Seonghyeon, Noh Hyunkyung, Choi Seungho, Lee Sang-woo, Lim Jung Hwa, Sung Nako
- An LLM Maturity Model For Reliable And Transparent Text-to-query Yu Lei Expression, Ray Abir Expression
- BEYOND DIALOGUE: A Profile-dialogue Alignment Framework Towards General Role-playing Language Model Yu Yeyong, Yu Runsheng, Wei Haojie, Zhang Zhanqiu, Qian Quan
- Mitigate Position Bias In Large Language Models Via Scaling A Single Dimension Yu Yijiong, Jiang Huiqiang, Luo Xufang, Wu Qianhui, Lin Chin-yew, Li Dongsheng, Yang Yuqing, Huang Yongfeng, Qiu Lili
- Refuse Whenever You Feel Unsafe: Improving Safety In Llms Via Decoupled Refusal Training Yuan Youliang, Jiao Wenxiang, Wang Wenxuan, Huang Jen-tse, Xu Jiahao, Liang Tian, He Pinjia, Tu Zhaopeng
- Rigorllm: Resilient Guardrails For Large Language Models Against Undesired Content Yuan Zhuowen, Xiong Zidi, Zeng Yi, Yu Ning, Jia Ruoxi, Song Dawn, Li Bo
- Turkishmmlu: Measuring Massive Multitask Language Understanding In Turkish Yüksel Arda, Köksal Abdullatif, Şenel Lütfi Kerem, Korhonen Anna, Schütze Hinrich
- WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-experts Zahraei Pardis Sadat, Emami Ali
- Unused Information In Token Probability Distribution Of Generative LLM: Improving LLM Reading Comprehension Through Calculation Of Expected Values Zawistowski Krystian
- Design Proteins Using Large Language Models: Enhancements And Comparative Analyses Zeinalipour Kamyar, Jamshidi Neda, Bianchini Monica, Maggini Marco, Gori Marco
- DELIA: Diversity-enhanced Learning For Instruction Adaptation In Large Language Models Zeng Yuanhao, Ren Fei, Zhou Xinpeng, Wang Yihang, Shao Yingxia
- When Scaling Meets LLM Finetuning: The Effect Of Data, Model And Finetuning Method Zhang Biao, Liu Zhongtao, Cherry Colin, Firat Orhan
- Causal Prompting: Debiasing Large Language Model Prompting Based On Front-door Adjustment Zhang Congzhi, Zhang Linhai, Wu Jialong, Zhou Deyu, He Yulan
- Map-neo: Highly Capable And Transparent Bilingual Large Language Model Series Zhang Ge, Qu Scott, Liu Jiaheng, Zhang Chenchen, Lin Chenghua, Yu Chou Leuang, Pan Danny, Cheng Esther, Liu Jie, Lin Qunshu, Yuan Raven, Zheng Tuney, Pang Wei, Du Xinrun, Liang Yiming, Ma Yinghao, Li Yizhi, Ma Ziyang, Lin Bill, Benetos Emmanouil, Yang Huan, Zhou Junting, Ma Kaijing, Liu Minghao, Niu Morry, Wang Noah, Que Quehry, Liu Ruibo, Liu Sine, Guo Shawn, Gao Soren, Zhou Wangchunshu, Zhang Xinyue, Zhou Yizhi, Wang Yubo, Bai Yuelin, Zhang Yuhan, Zhang Yuxiang, Wang Zenith, Yang Zhenzhu, Zhao Zijian, Zhang Jiajun, Ouyang Wanli, Huang Wenhao, Chen Wenhu
- Closing The Gap Between Open-source And Commercial Large Language Models For Medical Evidence Summarization Zhang Gongbo, Jin Qiao, Zhou Yiliang, Wang Song, Idnay Betina R., Luo Yiming, Park Elizabeth, Nestor Jordan G., Spotnitz Matthew E., Soroush Ali, Campion Thomas, Lu Zhiyong, Weng Chunhua, Peng Yifan
- Paying More Attention To Source Context: Mitigating Unfaithful Translations From Large Language Model Zhang Hongbin, Chen Kehai, Bai Xuefeng, Xiang Yang, Zhang Min
- Attention Instruction: Amplifying Attention In The Middle Via Prompting Zhang Meiru, Meng Zaiqiao, Collier Nigel
- D\'olares Or Dollars? Unraveling The Bilingual Prowess Of Financial Llms Between Spanish And English Zhang Xiao, Xiang Ruoyu, Yuan Chenhan, Feng Duanyu, Han Weiguang, Lopez-lira Alejandro, Liu Xiao-yang, Ananiadou Sophia, Peng Min, Huang Jimin, Xie Qianqian
- RAGLAB: A Modular And Research-oriented Unified Framework For Retrieval-augmented Generation Zhang Xuanwang, Song Yunze, Wang Yidong, Tang Shuyun, Li Xinfeng, Zeng Zhengran, Wu Zhen, Ye Wei, Xu Wenyuan, Zhang Yue, Dai Xinyu, Zhang Shikun, Wen Qingsong
- Benchmarking Trustworthiness Of Multimodal Large Language Models: A Comprehensive Study Zhang Yichi, Huang Yao, Sun Yitong, Liu Chang, Zhao Zhe, Fang Zhengwei, Wang Yifan, Chen Huanran, Yang Xiao, Wei Xingxing, Su Hang, Dong Yinpeng, Zhu Jun
- Navigating User Experience Of Chatgpt-based Conversational Recommender Systems: The Effects Of Prompt Guidance And Recommendation Domain Zhang Yizhe, Jin Yucheng, Chen Li, Yang Ting
- Leveraging Biases In Large Language Models: "bias-knn'' For Effective Few-shot Learning Zhang Yong, Li Hanzhang, Li Zhitao, Cheng Ning, Li Ming, Xiao Jing, Wang Jianzong
- Pattern-aware Chain-of-thought Prompting In Large Language Models Zhang Yufeng, Wang Xuepeng, Wu Lingxiang, Wang Jinqiao
- DARG: Dynamic Evaluation Of Large Language Models Via Adaptive Reasoning Graph Zhang Zhehao, Chen Jiaao, Yang Diyi
- Position-aware Parameter Efficient Fine-tuning Approach For Reducing Positional Bias In Llms Zhang Zheng, Yang Fan, Jiang Ziyan, Chen Zheng, Zhao Zhengyang, Ma Chengyuan, Zhao Liang, Liu Yang
- Language Model Council: Benchmarking Foundation Models On Highly Subjective Tasks By Consensus Zhao Justin, Plaza-del-arco Flor Miriam, Curry Amanda Cercas
- SLIDE: A Framework Integrating Small And Large Language Models For Open-domain Dialogues Evaluation Zhao Kun, Yang Bohao, Tang Chen, Lin Chenghua, Zhan Liang
- Fairer Preferences Elicit Improved Human-aligned Large Language Model Judgments Zhou Han, Wan Xingchen, Liu Yinhong, Collier Nigel, Vulić Ivan, Korhonen Anna
- Unibias: Unveiling And Mitigating LLM Bias Through Internal Attention And FFN Manipulation Zhou Hanzhang, Feng Zijian, Zhu Zixiao, Qian Junlang, Mao Kezhi
- Sequence To Sequence Reward Modeling: Improving RLHF By Language Feedback Zhou Jiayi, Ji Jiaming, Dai Juntao, Yang Yaodong
- Fine-tuning Large Language Models To Translate: Will A Touch Of Noisy Data In Misaligned Languages Suffice? Zhu Dawei, Chen Pinzhen, Zhang Miaoran, Haddow Barry, Shen Xiaoyu, Klakow Dietrich
- Quite Good, But Not Enough: Nationality Bias In Large Language Models -- A Case Study Of Chatgpt Zhu Shucheng, Wang Weikang, Liu Ying
- Aya Model: An Instruction Finetuned Open-access Multilingual Language Model Üstün Ahmet, Aryabumi Viraat, Yong Zheng-xin, Ko Wei-yin, D'souza Daniel, Onilude Gbemileke, Bhandari Neel, Singh Shivalika, Ooi Hui-lee, Kayid Amr, Vargus Freddie, Blunsom Phil, Longpre Shayne, Muennighoff Niklas, Fadaee Marzieh, Kreutzer Julia, Hooker Sara
🏷 Fairness
- Learning To Deceive With Attention-based Explanations Pruthi Danish, Gupta Mansi, Dhingra Bhuwan, Neubig Graham, Lipton Zachary C.
- Societal Biases In Language Generation: Progress And Challenges Sheng Emily, Chang Kai-wei, Natarajan Premkumar, Peng Nanyun
- A Disability Lens Towards Biases In GPT-3 Generated Open-ended Languages Amin Akhter Al, Kabir Kazi Sinthia
- Holistic Evaluation Of Language Models Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao, Jue Wang, Keshav Santhanam, Laurel Orr, Lucia Zheng, Mert Yuksekgonul, Mirac Suzgun, Nathan Kim, Neel Guha, Niladri Chatterji, Omar Khattab, Peter Henderson, Qian Huang, Ryan Chi, Sang Michael Xie, Shibani Santurkar, Surya Ganguli, Tatsunori Hashimoto, Thomas Icard, Tianyi Zhang, Vishrav Chaudhary, William Wang, Xuechen Li, Yifan Mai, Yuhui Zhang, Yuta Koreeda
- Chatgpt: The End Of Online Exam Integrity? Susnjak Teo
- H2ogpt: Democratizing Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Prabhu Prithvi, Gambera Jeff, Landry Mark, Bansal Shivam, Chesler Ryan, Lee Chun Ming, Conde Marcos V., Stetsenko Pasha, Grellier Olivier, Ambati Srisatish
- Assessing Large Language Models' Ability To Predict How Humans Balance Self-interest And The Interest Of Others Capraro Valerio, Di Paolo Roberto, Pizziol Veronica
- Challenges And Contributing Factors In The Utilization Of Large Language Models (llms) Chen Xiaoliang, Li Liangbin, Chang Le, Huang Yunhe, Zhao Yuxuan, Zhang Yuxiao, Li Dinuo
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- Scaling Vision Transformers To 22 Billion Parameters Dehghani Mostafa, Djolonga Josip, Mustafa Basil, Padlewski Piotr, Heek Jonathan, Gilmer Justin, Steiner Andreas, Caron Mathilde, Geirhos Robert, Alabdulmohsin Ibrahim, Jenatton Rodolphe, Beyer Lucas, Tschannen Michael, Arnab Anurag, Wang Xiao, Riquelme Carlos, Minderer Matthias, Puigcerver Joan, Evci Utku, Kumar Manoj, Van Steenkiste Sjoerd, Elsayed Gamaleldin F., Mahendran Aravindh, Yu Fisher, Oliver Avital, Huot Fantine, Bastings Jasmijn, Collier Mark Patrick, Gritsenko Alexey, Birodkar Vighnesh, Vasconcelos Cristina, Tay Yi, Mensink Thomas, Kolesnikov Alexander, Pavetić Filip, Tran Dustin, Kipf Thomas, Lučić Mario, Zhai Xiaohua, Keysers Daniel, Harmsen Jeremiah, Houlsby Neil
- Fairness Of Chatgpt And The Role Of Explainable-guided Prompts Deldjoo Yashar
- Probing Explicit And Implicit Gender Bias Through LLM Conditional Text Generation Dong Xiangjue, Wang Yibo, Yu Philip S., Caverlee James
- ROBBIE: Robust Bias Evaluation Of Large Generative Language Models Esiobu David, Tan Xiaoqing, Hosseini Saghar, Ung Megan, Zhang Yuchen, Fernandes Jude, Dwivedi-yu Jane, Presani Eleonora, Williams Adina, Smith Eric Michael
- Bias Runs Deep: Implicit Reasoning Biases In Persona-assigned Llms Gupta Shashank, Shrivastava Vaishnavi, Deshpande Ameet, Kalyan Ashwin, Clark Peter, Sabharwal Ashish, Khot Tushar
- Regulating Chatgpt And Other Large Generative AI Models Hacker Philipp, Engel Andreas, Mauer Marco
- Better To Ask In English: Cross-lingual Evaluation Of Large Language Models For Healthcare Queries Jin Yiqiao, Chandra Mohit, Verma Gaurav, Hu Yibo, De Choudhury Munmun, Kumar Srijan
- Is Chatgpt Fair For Recommendation? Evaluating Fairness In Large Language Model Recommendation Jizhi Zhang, Keqin Bao, Yang Zhang, Wenjie Wang, Fuli Feng, Xiangnan He
- Robust Recommender System: A Survey And Future Directions Kaike Zhang, Qi Cao, Fei Sun, Yunfan Wu, Shuchang Tao, Huawei Shen, Xueqi Cheng
- Mvbench: A Comprehensive Multi-modal Video Understanding Benchmark Li Kunchang, Wang Yali, He Yinan, Li Yizhuo, Wang Yi, Liu Yi, Wang Zun, Xu Jilan, Chen Guo, Luo Ping, Wang Limin, Qiao Yu
- A Survey On Fairness In Large Language Models Li Yingji, Du Mengnan, Song Rui, Wang Xin, Wang Ying
- GPT Detectors Are Biased Against Non-native English Writers Liang Weixin, Yuksekgonul Mert, Mao Yining, Wu Eric, Zou James
- Opening Up Chatgpt: Tracking Openness, Transparency, And Accountability In Instruction-tuned Text Generators Liesenfeld Andreas, Lopez Alianda, Dingemanse Mark
- Trustworthy Llms: A Survey And Guideline For Evaluating Large Language Models' Alignment Liu Yang, Yao Yuanshun, Ton Jean-francois, Zhang Xiaoying, Guo Ruocheng, Cheng Hao, Klochkov Yegor, Taufiq Muhammad Faaiz, Li Hang
- Fairness-guided Few-shot Prompting For Large Language Models Ma Huan, Zhang Changqing, Bian Yatao, Liu Lemao, Zhang Zhirui, Zhao Peilin, Zhang Shu, Fu Huazhu, Hu Qinghua, Wu Bingzhe
- How Trustworthy Are Open-source Llms? An Assessment Under Malicious Demonstrations Shows Their Vulnerabilities Mo Lingbo, Wang Boshi, Chen Muhao, Sun Huan
- N-critics: Self-refinement Of Large Language Models With Ensemble Of Critics Mousavi Sajad, Gutiérrez Ricardo Luna, Rengarajan Desik, Gundecha Vineet, Babu Ashwin Ramesh, Naug Avisek, Guillen Antonio, Sarkar Soumyendu
- A Brief History Of Prompt: Leveraging Language Models. (through Advanced Prompting) Muktadir Golam Md
- Enhancing Logical Reasoning In Large Language Models To Facilitate Legal Applications Nguyen Ha-thanh, Fungwacharakorn Wachara, Satoh Ken
- Large Language Model (LLM) Bias Index -- LLMBI Oketunji Abiodun Finbarrs, Anas Muhammad, Saina Deepthi
- Reproducing Whisper-style Training Using An Open-source Toolkit And Publicly Available Data Peng Yifan, Tian Jinchuan, Yan Brian, Berrebbi Dan, Chang Xuankai, Li Xinjian, Shi Jiatong, Arora Siddhant, Chen William, Sharma Roshan, Zhang Wangyou, Sudo Yui, Shakeel Muhammad, Jung Jee-weon, Maiti Soumi, Watanabe Shinji
- Can Chatgpt Assess Human Personalities? A General Evaluation Framework Rao Haocong, Leung Cyril, Miao Chunyan
- Soft-prompt Tuning For Large Language Models To Evaluate Bias Tian Jacob-junqi, Emerson David, Miyandoab Sevil Zanjani, Pandya Deval, Seyyed-kalantari Laleh, Khattak Faiza Khan
- Post Turing: Mapping The Landscape Of LLM Evaluation Tikhonov Alexey, Yamshchikov Ivan P.
- Adding Guardrails To Advanced Chatbots Wang Yanchen, Singh Lisa
- A Preliminary Study Of Chatgpt On News Recommendation: Personalization, Provider Fairness, Fake News Xinyi Li, Yongfeng Zhang, Edward C. Malthouse
- Bias And Fairness In Chatbots: An Overview Xue Jintang, Wang Yun-cheng, Wei Chengwei, Liu Xiaofeng, Woo Jonghye, Kuo C. -c. Jay
- Bias A-head? Analyzing Bias In Transformer-based Language Model Attention Heads Yang Yi, Duan Hanyu, Abbasi Ahmed, Lalor John P., Tam Kar Yan
- Fairness-aware Structured Pruning In Transformers Zayed Abdelrahman, Mordido Goncalo, Shabanian Samira, Baldini Ioana, Chandar Sarath
- Taxonomy-based Checklist For Large Language Model Evaluation Zhang Damin
- Open (clinical) Llms Are Sensitive To Instruction Phrasings Arroyo Alberto Mario Ceballos, Munnangi Monica, Sun Jiuding, Zhang Karen Y. C., Mcinerney Denis Jered, Wallace Byron C., Amir Silvio
- Current State Of LLM Risks And AI Guardrails Ayyamperumal Suriya Ganesh, Ge Limin
- Breaking Boundaries: Investigating The Effects Of Model Editing On Cross-linguistic Performance Banerjee Somnath, Halder Avik, Mandal Rajarshi, Layek Sayan, Soboroff Ian, Hazra Rima, Mukherjee Animesh
- SUTRA: Scalable Multilingual Language Model Architecture Bendale Abhijit, Sapienza Michael, Ripplinger Steven, Gibbs Simon, Lee Jaewon, Mistry Pranav
- Enhancing Trust In Llms: Algorithms For Comparing And Interpreting Llms Brown Nik Bear
- Are Large Language Models Really Bias-free? Jailbreak Prompts For Assessing Adversarial Robustness To Bias Elicitation Cantini Riccardo, Cosenza Giada, Orsino Alessio, Talia Domenico
- Editable Fairness: Fine-grained Bias Mitigation In Language Models Chen Ruizhe, Li Yichen, Yang Jianfei, Zhou Joey Tianyi, Liu Zuozhu
- Open Source Conversational Llms Do Not Know Most Spanish Words Conde Javier, González Miguel, Melero Nina, Ferrando Raquel, Martínez Gonzalo, Merino-gómez Elena, Hernández José Alberto, Reviriego Pedro
- Out-of-context Prompting Boosts Fairness And Robustness In Large Language Model Predictions Cotta Leonardo, Maddison Chris J.
- Low-rank Finetuning For Llms: A Fairness Perspective Das Saswat, Romanelli Marco, Tran Cuong, Reza Zarreen, Kailkhura Bhavya, Fioretto Ferdinando
- Understanding Biases In Chatgpt-based Recommender Systems: Provider Fairness, Temporal Stability, And Recency Deldjoo Yashar
- AXOLOTL: Fairness Through Assisted Self-debiasing Of Large Language Model Outputs Ebrahimi Sana, Chen Kaiwen, Asudeh Abolfazl, Das Gautam, Koudas Nick
- Mechanistic Interpretability Of Large Language Models With Applications To The Financial Services Industry Golgoon Ashkan, Filom Khashayar, Kannan Arjun Ravi
- Chatgpt Based Data Augmentation For Improved Parameter-efficient Debiasing Of Llms Han Pengrui, Kocielnik Rafal, Saravanan Adhithya, Jiang Roy, Sharir Or, Anandkumar Anima
- Prompt-based Bias Calibration For Better Zero/few-shot Learning Of Language Models He Kang, Long Yinghan, Roy Kaushik
- Fortifying Ethical Boundaries In AI: Advanced Strategies For Enhancing Security In Large Language Models He Yunhong, Qiu Jianling, Zhang Wei, Yuan Zhengqing
- Applying And Evaluating Large Language Models In Mental Health Care: A Scoping Review Of Human-assessed Generative Tasks Hua Yining, Na Hongbin, Li Zehan, Liu Fenglin, Fang Xiao, Clifton David, Torous John
- Trustllm: Trustworthiness In Large Language Models Huang Yue, Sun Lichao, Wang Haoran, Wu Siyuan, Zhang Qihui, Li Yuan, Gao Chujie, Huang Yixin, Lyu Wenhan, Zhang Yixuan, Li Xiner, Liu Zhengliang, Liu Yixin, Wang Yijue, Zhang Zhikun, Vidgen Bertie, Kailkhura Bhavya, Xiong Caiming, Xiao Chaowei, Li Chunyuan, Xing Eric, Huang Furong, Liu Hao, Ji Heng, Wang Hongyi, Zhang Huan, Yao Huaxiu, Kellis Manolis, Zitnik Marinka, Jiang Meng, Bansal Mohit, Zou James, Pei Jian, Liu Jian, Gao Jianfeng, Han Jiawei, Zhao Jieyu, Tang Jiliang, Wang Jindong, Vanschoren Joaquin, Mitchell John, Shu Kai, Xu Kaidi, Chang Kai-wei, He Lifang, Huang Lifu, Backes Michael, Gong Neil Zhenqiang, Yu Philip S., Chen Pin-yu, Gu Quanquan, Xu Ran, Ying Rex, Ji Shuiwang, Jana Suman, Chen Tianlong, Liu Tianming, Zhou Tianyi, Wang William, Li Xiang, Zhang Xiangliang, Wang Xiao, Xie Xing, Chen Xun, Wang Xuyu, Liu Yan, Ye Yanfang, Cao Yinzhi, Chen Yong, Zhao Yue
- Grade Score: Quantifying LLM Performance In Option Selection Iourovitski Dmitri
- Navigating LLM Ethics: Advancements, Challenges, And Future Directions Jiao Junfeng, Afroogh Saleh, Xu Yiming, Phillips Connor
- A2SF: Accumulative Attention Scoring With Forgetting Factor For Token Pruning In Transformer Decoder Jo Hyun-rae, Shin Dongkun
- The Life Cycle Of Large Language Models: A Review Of Biases In Education Lee Jinsook, Hicke Yann, Yu Renzhe, Brooks Christopher, Kizilcec René F.
- Red Teaming Visual Language Models Li Mukai, Li Lei, Yin Yuwei, Ahmed Masood, Liu Zhenguang, Liu Qi
- Generative Artificial Intelligence In Higher Education: Evidence From An Analysis Of Institutional Policies And Guidelines Mcdonald Nora, Johri Aditya, Ali Areej, Hingle Aayushi
- Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period Of Large Language Models Qian Chen, Zhang Jie, Yao Wei, Liu Dongrui, Yin Zhenfei, Qiao Yu, Liu Yong, Shao Jing
- Evaluating Llms For Gender Disparities In Notable Persons Rhue Lauren, Goethals Sofie, Sundararajan Arun
- Risks, Causes, And Mitigations Of Widespread Deployments Of Large Language Models (llms): A Survey Sakib Md Nazmus, Islam Md Athikul, Pathak Royal, Arifin Md Mashrur
- Using Chatgpt To Score Essays And Short-form Constructed Responses Shermis Mark D.
- Judging The Judges: A Systematic Investigation Of Position Bias In Pairwise Comparative Assessments By Llms Shi Lin, Ma Chiyu, Ma Weicheng, Vosoughi Soroush
- Expert-guided Extinction Of Toxic Tokens For Debiased Generation Sun Xueyao, Shi Kaize, Tang Haoran, Xu Guandong, Li Qing
- Improving Large Language Model (LLM) Fidelity Through Context-aware Grounding: A Systematic Approach To Reliability And Veracity Talukdar Wrick, Biswas Anjanava
- Insaaf: Incorporating Safety Through Accuracy And Fairness | Are Llms Ready For The Indian Legal Domain? Tripathi Yogesh, Donakanti Raghav, Girhepuje Sahil, Kavathekar Ishan, Vedula Bhaskara Hanuma, Krishnan Gokul S, Goyal Shreya, Goel Anmol, Ravindran Balaraman, Kumaraguru Ponnurangam
- Gender Bias In Machine Translation And The Era Of Large Language Models Vanmassenhove Eva
- Testing And Evaluation Of Large Language Models: Correctness, Non-toxicity, And Fairness Wang Wenxuan
- Leveraging Large Language Models For Patient Engagement: The Power Of Conversational AI In Digital Health Wen Bo, Norel Raquel, Liu Julia, Stappenbeck Thaddeus, Zulkernine Farhana, Chen Huamin
- Guiding Clinical Reasoning With Large Language Models Via Knowledge Seeds Wu Jiageng, Wu Xian, Yang Jie
- Freeva: Offline MLLM As Training-free Video Assistant Wu Wenhao
- Debiasing Multimodal Large Language Models Yi-fan Zhang, Weichen Yu, Qingsong Wen, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan
- Benchmarking Trustworthiness Of Multimodal Large Language Models: A Comprehensive Study Zhang Yichi, Huang Yao, Sun Yitong, Liu Chang, Zhao Zhe, Fang Zhengwei, Wang Yifan, Chen Huanran, Yang Xiao, Wei Xingxing, Su Hang, Dong Yinpeng, Zhu Jun
- Fairer Preferences Elicit Improved Human-aligned Large Language Model Judgments Zhou Han, Wan Xingchen, Liu Yinhong, Collier Nigel, Vulić Ivan, Korhonen Anna
🏷 Few Shot
- Few-shot Generalization Across Dialogue Tasks Vlasov Vladimir, Drissner-schmid Akela, Nichol Alan
- Investigating Multilingual NMT Representations At Scale Kudugunta Sneha Reddy, Bapna Ankur, Caswell Isaac, Arivazhagan Naveen, Firat Orhan
- Learning To Memorize In Neural Task-oriented Dialogue Systems Wu Chien-sheng
- Language Models Are Few-shot Learners Brown Tom B., Mann Benjamin, Ryder Nick, Subbiah Melanie, Kaplan Jared, Dhariwal Prafulla, Neelakantan Arvind, Shyam Pranav, Sastry Girish, Askell Amanda, Agarwal Sandhini, Herbert-voss Ariel, Krueger Gretchen, Henighan Tom, Child Rewon, Ramesh Aditya, Ziegler Daniel M., Wu Jeffrey, Winter Clemens, Hesse Christopher, Chen Mark, Sigler Eric, Litwin Mateusz, Gray Scott, Chess Benjamin, Clark Jack, Berner Christopher, Mccandlish Sam, Radford Alec, Sutskever Ilya, Amodei Dario
- Fid-ex: Improving Sequence-to-sequence Models For Extractive Rationale Generation Lakhotia Kushal, Paranjape Bhargavi, Ghoshal Asish, Yih Wen-tau, Mehdad Yashar, Iyer Srinivasan
- From Zero To Hero: On The Limitations Of Zero-shot Cross-lingual Transfer With Multilingual Transformers Lauscher Anne, Ravishankar Vinit, Vulić Ivan, Glavaš Goran
- Language Models As Few-shot Learner For Task-oriented Dialogue Systems Madotto Andrea, Liu Zihan, Lin Zhaojiang, Fung Pascale
- SOLOIST: Building Task Bots At Scale With Transfer Learning And Machine Teaching Peng Baolin, Li Chunyuan, Li Jinchao, Shayandeh Shahin, Liden Lars, Gao Jianfeng
- MAD-X: An Adapter-based Framework For Multi-task Cross-lingual Transfer Pfeiffer Jonas, Vulić Ivan, Gurevych Iryna, Ruder Sebastian
- It's Not Just Size That Matters: Small Language Models Are Also Few-shot Learners Schick Timo, Schütze Hinrich
- Hybrid Generative-retrieval Transformers For Dialogue Domain Adaptation Shalyminov Igor, Sordoni Alessandro, Atkinson Adam, Schulz Hannes
- Making Pre-trained Language Models Better Few-shot Learners Tianyu Gao, Adam Fisch, Danqi Chen
- Language Models Are Few-shot Learners Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam Mccandlish, Alec Radford, Ilya Sutskever, Dario Amodei
- TOD-BERT: Pre-trained Natural Language Understanding For Task-oriented Dialogue Wu Chien-sheng, Hoi Steven, Socher Richard, Xiong Caiming
- CG-BERT: Conditional Text Generation With BERT For Generalized Few-shot Intent Detection Xia Congying, Zhang Chenwei, Nguyen Hoang, Zhang Jiawei, Yu Philip
- Few-shot Generative Conversational Query Rewriting Yu Shi, Liu Jiahua, Yang Jingqin, Xiong Chenyan, Bennett Paul, Gao Jianfeng, Liu Zhiyuan
- CPM: A Large-scale Generative Chinese Pre-trained Language Model Zhang Zhengyan, Han Xu, Zhou Hao, Ke Pei, Gu Yuxian, Ye Deming, Qin Yujia, Su Yusheng, Ji Haozhe, Guan Jian, Qi Fanchao, Wang Xiaozhi, Zheng Yanan, Zeng Guoyang, Cao Huanqi, Chen Shengqi, Li Daixuan, Sun Zhenbo, Liu Zhiyuan, Huang Minlie, Han Wentao, Tang Jie, Li Juanzi, Zhu Xiaoyan, Sun Maosong
- A Closer Look At Few-shot Crosslingual Transfer: The Choice Of Shots Matters Zhao Mengjie, Zhu Yi, Shareghi Ehsan, Vulić Ivan, Reichart Roi, Korhonen Anna, Schütze Hinrich
- RAFT: A Real-world Few-shot Text Classification Benchmark Alex Neel, Lifland Eli, Tunstall Lewis, Thakur Abhishek, Maham Pegah, Riedel C. Jess, Hine Emmie, Ashurst Carolyn, Sedille Paul, Carlier Alexis, Noetel Michael, Stuhlmüller Andreas
- On The Multilingual Capabilities Of Very Large-scale English Language Models Armengol-estapé Jordi, Bonet Ona De Gibert, Melero Maite
- Efficient Large Scale Language Modeling With Mixtures Of Experts Artetxe Mikel, Bhosale Shruti, Goyal Naman, Mihaylov Todor, Ott Myle, Shleifer Sam, Lin Xi Victoria, Du Jingfei, Iyer Srinivasan, Pasunuru Ramakanth, Anantharaman Giri, Li Xian, Chen Shuohui, Akin Halil, Baines Mandeep, Martin Louis, Zhou Xing, Koura Punit Singh, O'horo Brian, Wang Jeff, Zettlemoyer Luke, Diab Mona, Kozareva Zornitsa, Stoyanov Ves
- What Changes Can Large-scale Language Models Bring? Intensive Study On Hyperclova: Billions-scale Korean Generative Pretrained Transformers Boseop Kim, Hyoungseok Kim, Sang-woo Lee, Gichang Lee, Donghyun Kwak, Dong Hyeon Jeon, Sunghyun Park, Sungju Kim, Seonhoon Kim, Dongpil Seo, Heungsub Lee, Minyoung Jeong, Sungjae Lee, Minsub Kim, Suk Hyun Ko, Seokhun Kim, Taeyong Park, Jinuk Kim, Soyoung Kang, Na-hyeon Ryu, Kang Min Yoo, Minsuk Chang, Soobin Suh, Sookyo In, Jinseong Park, Kyungduk Kim, Hiun Kim, Jisu Jeong, Yong Goo Yeo, Donghoon Ham, Dongju Park, Min Young Lee, Jaewook Kang, Inho Kang, Jung-woo Ha, Woomyoung Park, Nako Sung
- The Power Of Scale For Parameter-efficient Prompt Tuning Brian Lester, Rami Al-rfou, Noah Constant
- The Selectgen Challenge: Finding The Best Training Samples For Few-shot Neural Text Generation Chang Ernie, Shen Xiaoyu, Marin Alex, Demberg Vera
- On Training Instance Selection For Few-shot Neural Text Generation Chang Ernie, Shen Xiaoyu, Yeh Hui-syuan, Demberg Vera
- Neural Data-to-text Generation With Lm-based Text Augmentation Chang Ernie, Shen Xiaoyu, Zhu Dawei, Demberg Vera, Su Hui
- Reinforcement Learning For Few-shot Text Generation Adaptation Cheng Pengsen, Dai Jinqiao, Liu Jiamiao, Liu Jiayong, Jia Peng
- WARP: Word-level Adversarial Reprogramming Hambardzumyan Karen, Khachatrian Hrant, May Jonathan
- Generate, Annotate, And Learn: NLP With Synthetic Text He Xuanli, Nassar Islam, Kiros Jamie, Haffari Gholamreza, Norouzi Mohammad
- Finetuned Language Models Are Zero-shot Learners Jason Wei, Maarten Bosma, Vincent Y. Zhao, Kelvin Guu, Adams Wei Yu, Brian Lester, Nan Du, Andrew M. Dai, Quoc V. Le
- A Good Prompt Is Worth Millions Of Parameters: Low-resource Prompt-based Learning For Vision-language Models Jin Woojeong, Cheng Yu, Shen Yelong, Chen Weizhu, Ren Xiang
- What's In A Measurement? Using GPT-3 On Semeval 2021 Task 8 -- Measeval Kohler Curt, Daniel Ron Jr
- Towards Few-shot Fact-checking Via Perplexity Lee Nayeon, Bang Yejin, Madotto Andrea, Khabsa Madian, Fung Pascale
- Few-shot Learning With Multilingual Language Models Lin Xi Victoria, Mihaylov Todor, Artetxe Mikel, Wang Tianlu, Chen Shuohui, Simig Daniel, Ott Myle, Goyal Naman, Bhosale Shruti, Du Jingfei, Pasunuru Ramakanth, Shleifer Sam, Koura Punit Singh, Chaudhary Vishrav, O'horo Brian, Wang Jeff, Zettlemoyer Luke, Kozareva Zornitsa, Diab Mona, Stoyanov Veselin, Li Xian
- What Makes Good In-context Examples For GPT-\(3\)? Liu Jiachang, Shen Dinghan, Zhang Yizhe, Dolan Bill, Carin Lawrence, Chen Weizhu
- Unified Multimodal Pre-training And Prompt-based Tuning For Vision-language Understanding And Generation Liu Tianyi, Wu Zuxuan, Xiong Wenhan, Chen Jingjing, Jiang Yu-gang
- GPT Understands, Too Liu Xiao, Zheng Yanan, Du Zhengxiao, Ding Ming, Qian Yujie, Yang Zhilin, Tang Jie
- Fantastically Ordered Prompts And Where To Find Them: Overcoming Few-shot Prompt Order Sensitivity Lu Yao, Bartolo Max, Moore Alastair, Riedel Sebastian, Stenetorp Pontus
- Few-shot Self-rationalization With Natural Language Prompts Marasović Ana, Beltagy Iz, Downey Doug, Peters Matthew E.
- Improving Compositional Generalization With Self-training For Data-to-text Generation Mehta Sanket Vaibhav, Rao Jinfeng, Tay Yi, Kale Mihir, Parikh Ankur P., Strubell Emma
- CINS: Comprehensive Instruction For Few-shot Learning In Task-oriented Dialog Systems Mi Fei, Li Yitong, Wang Yasheng, Jiang Xin, Liu Qun
- Self-training Improves Pre-training For Few-shot Learning In Task-oriented Dialog Systems Mi Fei, Zhou Wanhao, Cai Fengyu, Kong Lingjing, Huang Minlie, Faltings Boi
- Language Models Are Few-shot Butlers Micheli Vincent, Fleuret François
- GPT-3 Models Are Poor Few-shot Learners In The Biomedical Domain Moradi Milad, Blagec Kathrin, Haberl Florian, Samwald Matthias
- Show Your Work: Scratchpads For Intermediate Computation With Language Models Nye Maxwell, Andreassen Anders Johan, Gur-ari Guy, Michalewski Henryk, Austin Jacob, Bieber David, Dohan David, Lewkowycz Aitor, Bosma Maarten, Luan David, Sutton Charles, Odena Augustus
- Pre-train, Prompt, And Predict: A Systematic Survey Of Prompting Methods In Natural Language Processing Pengfei Liu, Weizhe Yuan, Jinlan Fu, Zhengbao Jiang, Hiroaki Hayashi, Graham Neubig
- LFPT5: A Unified Framework For Lifelong Few-shot Language Learning Based On Prompt Tuning Of T5 Qin Chengwei, Joty Shafiq
- Crossfit: A Few-shot Learning Challenge For Cross-task Generalization In NLP Qinyuan Ye, Bill Yuchen Lin, Xiang Ren
- VT-CLIP: Enhancing Vision-language Models With Visual-guided Texts Qiu Longtian, Zhang Renrui, Guo Ziyu, Zeng Ziyao, Guo Zilu, Li Yafeng, Zhang Guangnan
- Prompt Programming For Large Language Models: Beyond The Few-shot Paradigm Reynolds Laria, Mcdonell Kyle
- LAION-400M: Open Dataset Of Clip-filtered 400 Million Image-text Pairs Schuhmann Christoph, Vencu Richard, Beaumont Romain, Kaczmarczyk Robert, Mullis Clayton, Katta Aarush, Coombes Theo, Jitsev Jenia, Komatsuzaki Aran
- Metaicl: Learning To Learn In Context Sewon Min, Mike Lewis, Luke Zettlemoyer, Hannaneh Hajishirzi
- Few-shot Semantic Parsing With Language Models Trained On Code Shin Richard, Van Durme Benjamin
- NSP-BERT: A Prompt-based Few-shot Learner Through An Original Pre-training Task--next Sentence Prediction Sun Yi, Zheng Yu, Hao Chao, Qiu Hangping
- ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Sun Yu, Wang Shuohuan, Feng Shikun, Ding Siyu, Pang Chao, Shang Junyuan, Liu Jiaxiang, Chen Xuyi, Zhao Yanbin, Lu Yuxiang, Liu Weixin, Wu Zhihua, Gong Weibao, Liang Jianzhong, Shang Zhizhou, Sun Peng, Liu Wei, Ouyang Xuan, Yu Dianhai, Tian Hao, Wu Hua, Wang Haifeng
- Multimodal Few-shot Learning With Frozen Language Models Tsimpoukelli Maria, Menick Jacob, Cabi Serkan, Eslami S. M. Ali, Vinyals Oriol, Hill Felix
- Convfit: Conversational Fine-tuning Of Pretrained Language Models Vulić Ivan, Su Pei-hao, Coope Sam, Gerz Daniela, Budzianowski Paweł, Casanueva Iñigo, Mrkšić Nikola, Wen Tsung-hsien
- Entailment As Few-shot Learner Wang Sinong, Fang Han, Khabsa Madian, Mao Hanzi, Ma Hao
- List: Lite Prompted Self-training Makes Parameter-efficient Few-shot Learners Wang Yaqing, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Pangu-\(α\): Large-scale Autoregressive Pretrained Chinese Language Models With Auto-parallel Computation Wei Zeng, Xiaozhe Ren, Teng Su, Hui Wang, Yi Liao, Zhiwei Wang, Xin Jiang, Zhenzhang Yang, Kaisheng Wang, Xiaoda Zhang, Chen Li, Ziyan Gong, Yifan Yao, Xinjing Huang, Jun Wang, Jianfeng Yu, Qi Guo, Yue Yu, Yan Zhang, Jin Wang, Hengtao Tao, Dasen Yan, Zexuan Yi, Fang Peng, Fangqing Jiang, Han Zhang, Lingfeng Deng, Yehong Zhang, Zhe Lin, Chao Zhang, Shaojie Zhang, Mingyue Guo, Shanzhi Gu, Gaojun Fan, Yaowei Wang, Xuefeng Jin, Qun Liu, Yonghong Tian
- Reframing Human-ai Collaboration For Generating Free-text Explanations Wiegreffe Sarah, Hessel Jack, Swayamdipta Swabha, Riedl Mark, Choi Yejin
- Language Models Are Few-shot Multilingual Learners Winata Genta Indra, Madotto Andrea, Lin Zhaojiang, Liu Rosanne, Yosinski Jason, Fung Pascale
- Yuan 1.0: Large-scale Pre-trained Language Model In Zero-shot And Few-shot Learning Wu Shaohua, Zhao Xudong, Yu Tong, Zhang Rongguo, Shen Chong, Liu Hongli, Li Feng, Zhu Hong, Luo Jiangang, Xu Liang, Zhang Xuanwei
- An Explanation Of In-context Learning As Implicit Bayesian Inference Xie Sang Michael, Raghunathan Aditi, Liang Percy, Ma Tengyu
- An Empirical Study Of GPT-3 For Few-shot Knowledge-based VQA Yang Zhengyuan, Gan Zhe, Wang Jianfeng, Hu Xiaowei, Lu Yumao, Liu Zicheng, Wang Lijuan
- Gpt3mix: Leveraging Large-scale Language Models For Text Augmentation Yoo Kang Min, Park Dongju, Kang Jaewook, Lee Sang-woo, Park Woomyeong
- Reconsidering The Past: Optimizing Hidden States In Language Models Yoshida Davis, Gimpel Kevin
- Calibrate Before Use: Improving Few-shot Performance Of Language Models Zhao Tony Z., Wallace Eric, Feng Shi, Klein Dan, Singh Sameer
- Fewnlu: Benchmarking State-of-the-art Methods For Few-shot Natural Language Understanding Zheng Yanan, Zhou Jing, Qian Yujie, Ding Ming, Liao Chonghua, Li Jian, Salakhutdinov Ruslan, Tang Jie, Ruder Sebastian, Yang Zhilin
- Palm: Scaling Language Modeling With Pathways Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-hellstern, Douglas Eck, Jeff Dean, Slav Petrov, Noah Fiedel
- Few-shot Training Llms For Project-specific Code-summarization Ahmed Toufique, Devanbu Premkumar
- On The Calibration Of Massively Multilingual Language Models Ahuja Kabir, Sitaram Sunayana, Dandapat Sandipan, Choudhury Monojit
- Text And Patterns: For Effective Chain Of Thought, It Takes Two To Tango Aman Madaan, Amir Yazdanbakhsh
- Language Models Of Code Are Few-shot Commonsense Learners Aman Madaan, Shuyan Zhou, Uri Alon, Yiming Yang, Graham Neubig
- Query Refinement Prompts For Closed-book Long-form Question Answering Amplayo Reinald Kim, Webster Kellie, Collins Michael, Das Dipanjan, Narayan Shashi
- Can Language Models Learn From Explanations In Context? Andrew K. Lampinen, Ishita Dasgupta, Stephanie C. Y. Chan, Kory Matthewson, Michael Henry Tessler, Antonia Creswell, James L. Mcclelland, Jane X. Wang, Felix Hill
- Selection-inference: Exploiting Large Language Models For Interpretable Logical Reasoning Antonia Creswell, Murray Shanahan, Irina Higgins
- ATTEMPT: Parameter-efficient Multi-task Tuning Via Attentional Mixtures Of Soft Prompts Asai Akari, Salehi Mohammadreza, Peters Matthew E., Hajishirzi Hannaneh
- Bootstrapping Multilingual Semantic Parsers Using Large Language Models Awasthi Abhijeet, Gupta Nitish, Samanta Bidisha, Dave Shachi, Sarawagi Sunita, Talukdar Partha
- Enabling Classifiers To Make Judgements Explicitly Aligned With Human Values Bang Yejin, Yu Tiezheng, Madotto Andrea, Lin Zhaojiang, Diab Mona, Fung Pascale
- Prompting Language Models For Linguistic Structure Blevins Terra, Gonen Hila, Zettlemoyer Luke
- Inpars: Data Augmentation For Information Retrieval Using Large Language Models Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Nogueira Rodrigo
- Large Language Models Can Implement Policy Iteration Brooks Ethan, Walls Logan, Lewis Richard L., Singh Satinder
- Improving In-context Few-shot Learning Via Self-supervised Training Chen Mingda, Du Jingfei, Pasunuru Ramakanth, Mihaylov Todor, Iyer Srini, Stoyanov Veselin, Kozareva Zornitsa
- Adaprompt: Adaptive Model Training For Prompt-based NLP Chen Yulong, Liu Yang, Dong Li, Wang Shuohang, Zhu Chenguang, Zeng Michael, Zhang Yue
- Prompting GPT-3 To Be Reliable Chenglei Si, Zhe Gan, Zhengyuan Yang, Shuohang Wang, Jianfeng Wang, Jordan Boyd-graber, Lijuan Wang
- Scaling Instruction-finetuned Language Models Chung Hyung Won, Hou Le, Longpre Shayne, Zoph Barret, Tay Yi, Fedus William, Li Yunxuan, Wang Xuezhi, Dehghani Mostafa, Brahma Siddhartha, Webson Albert, Gu Shixiang Shane, Dai Zhuyun, Suzgun Mirac, Chen Xinyun, Chowdhery Aakanksha, Castro-ros Alex, Pellat Marie, Robinson Kevin, Valter Dasha, Narang Sharan, Mishra Gaurav, Yu Adams, Zhao Vincent, Huang Yanping, Dai Andrew, Yu Hongkun, Petrov Slav, Chi Ed H., Dean Jeff, Devlin Jacob, Roberts Adam, Zhou Denny, Le Quoc V., Wei Jason
- Promptagator: Few-shot Dense Retrieval From 8 Examples Dai Zhuyun, Zhao Vincent Y., Ma Ji, Luan Yi, Ni Jianmo, Lu Jing, Bakalov Anton, Guu Kelvin, Hall Keith B., Chang Ming-wei
- Language Model Cascades David Dohan, Winnie Xu, Aitor Lewkowycz, Jacob Austin, David Bieber, Raphael Gontijo Lopes, Yuhuai Wu, Henryk Michalewski, Rif A. Saurous, Jascha Sohl-dickstein, Kevin Murphy, Charles Sutton
- Rlprompt: Optimizing Discrete Text Prompts With Reinforcement Learning Deng Mingkai, Wang Jianyu, Hsieh Cheng-ping, Wang Yihan, Guo Han, Shu Tianmin, Song Meng, Xing Eric P., Hu Zhiting
- Is GPT-3 A Good Data Annotator? Ding Bosheng, Qin Chengwei, Liu Linlin, Chia Yew Ken, Joty Shafiq, Li Boyang, Bing Lidong
- Star: Bootstrapping Reasoning With Reasoning Eric Zelikman, Yuhuai Wu, Jesse Mu, Noah D. Goodman
- Hungry Hungry Hippos: Towards Language Modeling With State Space Models Fu Daniel Y., Dao Tri, Saab Khaled K., Thomas Armin W., Rudra Atri, Ré Christopher
- WAVPROMPT: Towards Few-shot Spoken Language Understanding With Frozen Language Models Gao Heting, Ni Junrui, Qian Kaizhi, Zhang Yang, Chang Shiyu, Hasegawa-johnson Mark
- "john Is 50 Years Old, Can His Son Be 65?" Evaluating NLP Models' Understanding Of Feasibility Gupta Himanshu, Varshney Neeraj, Mishra Swaroop, Pal Kuntal Kumar, Sawant Saurabh Arjun, Scaria Kevin, Goyal Siddharth, Baral Chitta
- Thinking About GPT-3 In-context Learning For Biomedical IE? Think Again Gutiérrez Bernal Jiménez, Mcneal Nikolas, Washington Clay, Chen You, Li Lang, Sun Huan, Su Yu
- Z-code++: A Pre-trained Language Model Optimized For Abstractive Summarization He Pengcheng, Peng Baolin, Lu Liyang, Wang Song, Mei Jie, Liu Yang, Xu Ruochen, Awadalla Hany Hassan, Shi Yu, Zhu Chenguang, Xiong Wayne, Zeng Michael, Gao Jianfeng, Huang Xuedong
- SPACE-3: Unified Dialog Model Pre-training For Task-oriented Dialog Understanding And Generation He Wanwei, Dai Yinpei, Yang Min, Sun Jian, Huang Fei, Si Luo, Li Yongbin
- Shapley Head Pruning: Identifying And Removing Interference In Multilingual Transformers Held William, Yang Diyi
- Demystifying Prompts In Language Models Via Perplexity Estimation Hila Gonen, Srini Iyer, Terra Blevins, Noah A. Smith, Luke Zettlemoyer
- Welm: A Well-read Pre-trained Language Model For Chinese Hui Su, Xiao Zhou, Houjin Yu, Xiaoyu Shen, Yuwen Chen, Zilin Zhu, Yang Yu, Jie Zhou
- Scaling Instruction-finetuned Language Models Hyung Won Chung, Le Hou, Shayne Longpre, Barret Zoph, Yi Tay, William Fedus, Yunxuan Li, Xuezhi Wang, Mostafa Dehghani, Siddhartha Brahma, Albert Webson, Shixiang Shane Gu, Zhuyun Dai, Mirac Suzgun, Xinyun Chen, Aakanksha Chowdhery, Alex Castro-ros, Marie Pellat, Kevin Robinson, Dasha Valter, Sharan Narang, Gaurav Mishra, Adams Yu, Vincent Zhao, Yanping Huang, Andrew Dai, Hongkun Yu, Slav Petrov, Ed H. Chi, Jeff Dean, Jacob Devlin, Adam Roberts, Denny Zhou, Quoc V. Le, Jason Wei
- From Human Days To Machine Seconds: Automatically Answering And Generating Machine Learning Final Exams Iddo Drori, Sarah J. Zhang, Reece Shuttleworth, Sarah Zhang, Keith Tyser, Zad Chin, Pedro Lantigua, Saisamrit Surbehera, Gregory Hunter, Derek Austin, Leonard Tang, Yann Hicke, Sage Simhon, Sathwik Karnik, Darnell Granberry, Madeleine Udell
- HINT: Hypernetwork Instruction Tuning For Efficient Zero- & Few-shot Generalisation Ivison Hamish, Bhagia Akshita, Wang Yizhong, Hajishirzi Hannaneh, Peters Matthew
- Can Large Language Models Truly Understand Prompts? A Case Study With Negated Prompts Jang Joel, Ye Seonghyeon, Seo Minjoon
- Contrastive Learning For Prompt-based Few-shot Language Learners Jian Yiren, Gao Chongyang, Vosoughi Soroush
- From Images To Textual Prompts: Zero-shot VQA With Frozen Large Language Models Jiaxian Guo, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Boyang Li, Dacheng Tao, Steven C. H. Hoi
- What Makes Data-to-text Generation Hard For Pretrained Language Models? Keymanesh Moniba, Benton Adrian, Dredze Mark
- Decomposed Prompting: A Modular Approach For Solving Complex Tasks Khot Tushar, Trivedi Harsh, Finlayson Matthew, Fu Yao, Richardson Kyle, Clark Peter, Sabharwal Ashish
- Few-shot Question Generation For Personalized Feedback In Intelligent Tutoring Systems Kulshreshtha Devang, Shayan Muhammad, Belfer Robert, Reddy Siva, Serban Iulian Vlad, Kochmar Ekaterina
- Co-training Improves Prompt-based Learning For Large Language Models Lang Hunter, Agrawal Monica, Kim Yoon, Sontag David
- Internet-augmented Language Models Through Few-shot Prompting For Open-domain Question Answering Lazaridou Angeliki, Gribovskaya Elena, Stokowiec Wojciech, Grigorev Nikolai
- LAVENDER: Unifying Video-language Understanding As Masked Language Modeling Li Linjie, Gan Zhe, Lin Kevin, Lin Chung-ching, Liu Zicheng, Liu Ce, Wang Lijuan
- Deplot: One-shot Visual Language Reasoning By Plot-to-table Translation Liu Fangyu, Eisenschlos Julian Martin, Piccinno Francesco, Krichene Syrine, Pang Chenxi, Lee Kenton, Joshi Mandar, Chen Wenhu, Collier Nigel, Altun Yasemin
- Towards Better Few-shot And Finetuning Performance With Forgetful Causal Language Models Liu Hao, Geng Xinyang, Lee Lisa, Mordatch Igor, Levine Sergey, Narang Sharan, Abbeel Pieter
- Fl-tuning: Layer Tuning For Feed-forward Network In Transformer Liu Jingping, Song Yuqiu, Xue Kui, Sun Hongli, Wang Chao, Chen Lihan, Jiang Haiyun, Liang Jiaqing, Ruan Tong
- PSP: Pre-trained Soft Prompts For Few-shot Abstractive Summarization Liu Xiaochen, Gao Yang, Bai Yu, Li Jiawei, Hu Yinan, Huang Heyan, Chen Boxing
- Declaration-based Prompt Tuning For Visual Question Answering Liu Yuhang, Wei Wei, Peng Daowan, Zhu Feida
- Can Large Language Models Reason About Medical Questions? Liévin Valentin, Hother Christoffer Egeberg, Motzfeldt Andreas Geert, Winther Ole
- Learn To Explain: Multimodal Reasoning Via Thought Chains For Science Question Answering Lu Pan, Mishra Swaroop, Xia Tony, Qiu Liang, Chang Kai-wei, Zhu Song-chun, Tafjord Oyvind, Clark Peter, Kalyan Ashwin
- Imagination-augmented Natural Language Understanding Lu Yujie, Zhu Wanrong, Wang Xin Eric, Eckstein Miguel, Wang William Yang
- PAL: Program-aided Language Models Luyu Gao, Aman Madaan, Shuyan Zhou, Uri Alon, Pengfei Liu, Yiming Yang, Jamie Callan, Graham Neubig
- PERFECT: Prompt-free And Efficient Few-shot Learning With Language Models Mahabadi Rabeeh Karimi, Zettlemoyer Luke, Henderson James, Saeidi Marzieh, Mathias Lambert, Stoyanov Veselin, Yazdani Majid
- Grounding Language With Visual Affordances Over Unstructured Data Mees Oier, Borja-diaz Jessica, Burgard Wolfram
- Generating Training Data With Language Models: Towards Zero-shot Language Understanding Meng Yu, Huang Jiaxin, Zhang Yu, Han Jiawei
- Tuning Language Models As Training Data Generators For Augmentation-enhanced Few-shot Learning Meng Yu, Michalski Martin, Huang Jiaxin, Zhang Yu, Abdelzaher Tarek, Han Jiawei
- The Devil Is In The Details: On Models And Training Regimes For Few-shot Intent Classification Mesgar Mohsen, Tran Thy Thy, Glavas Goran, Gurevych Iryna
- JASMINE: Arabic GPT Models For Few-shot Learning Nagoudi El Moatez Billah, Abdul-mageed Muhammad, Elmadany Abdelrahim, Inciarte Alcides Alcoba, Khondaker Md Tawkat Islam
- Thinksum: Probabilistic Reasoning Over Sets Using Large Language Models Ozturkler Batu, Malkin Nikolay, Wang Zhen, Jojic Nebojsa
- Dynamic Prompt Learning Via Policy Gradient For Semi-structured Mathematical Reasoning Pan Lu, Liang Qiu, Kai-wei Chang, Ying Nian Wu, Song-chun Zhu, Tanmay Rajpurohit, Peter Clark, Ashwin Kalyan
- Bidirectional Language Models Are Also Few-shot Learners Patel Ajay, Li Bryan, Rasooli Mohammad Sadegh, Constant Noah, Raffel Colin, Callison-burch Chris
- Visconde: Multi-document QA With GPT-3 And Neural Reranking Pereira Jayr, Fidalgo Robson, Lotufo Roberto, Nogueira Rodrigo
- Hypertuning: Toward Adapting Large Language Models Without Back-propagation Phang Jason, Mao Yi, He Pengcheng, Chen Weizhu
- Synchromesh: Reliable Code Generation From Pre-trained Language Models Poesia Gabriel, Polozov Oleksandr, Le Vu, Tiwari Ashish, Soares Gustavo, Meek Christopher, Gulwani Sumit
- Super-prompting: Utilizing Model-independent Contextual Data To Reduce Data Annotation Required In Visual Commonsense Tasks Rezaei Navid, Reformat Marek Z.
- Leveraging Large Language Models For Multiple Choice Question Answering Robinson Joshua, Rytting Christopher Michael, Wingate David
- Billions Of Parameters Are Worth More Than In-domain Training Data: A Case Study In The Legal Case Entailment Task Rosa Guilherme Moraes, Bonifacio Luiz, Jeronymo Vitor, Abonizio Hugo, Lotufo Roberto, Nogueira Rodrigo
- Few-shot Visual Question Generation: A Novel Task And Benchmark Datasets Roy Anurag, Ekka David Johnson, Ghosh Saptarshi, Das Abir
- Data Augmentation For Intent Classification With Off-the-shelf Large Language Models Sahu Gaurav, Rodriguez Pau, Laradji Issam H., Atighehchian Parmida, Vazquez David, Bahdanau Dzmitry
- Alexatm 20B: Few-shot Learning Using A Large-scale Multilingual Seq2seq Model Saleh Soltan, Shankar Ananthakrishnan, Jack Fitzgerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, Stephen Rawls, Andy Rosenbaum, Anna Rumshisky, Chandana Satya Prakash, Mukund Sridhar, Fabian Triefenbach, Apurv Verma, Gokhan Tur, Prem Natarajan
- On The Effect Of Pretraining Corpora On In-context Learning By A Large-scale Language Model Seongjin Shin, Sang-woo Lee, Hwijeen Ahn, Sungdong Kim, Hyoungseok Kim, Boseop Kim, Kyunghyun Cho, Gichang Lee, Woomyoung Park, Jung-woo Ha, Nako Sung
- Using Deepspeed And Megatron To Train Megatron-turing NLG 530B, A Large-scale Generative Language Model Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick Legresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zhang, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, Bryan Catanzaro
- Mgpt: Few-shot Learners Go Multilingual Shliazhko Oleh, Fenogenova Alena, Tikhonova Maria, Mikhailov Vladislav, Kozlova Anastasia, Shavrina Tatiana
- Gpt-neox-20b: An Open-source Autoregressive Language Model Sid Black, Stella Biderman, Eric Hallahan, Quentin Anthony, Leo Gao, Laurence Golding, Horace He, Connor Leahy, Kyle Mcdonell, Jason Phang, Michael Pieler, Usvsn Sai Prashanth, Shivanshu Purohit, Laria Reynolds, Jonathan Tow, Ben Wang, Samuel Weinbach
- Ask Me Anything: A Simple Strategy For Prompting Language Models Simran Arora, Avanika Narayan, Mayee F. Chen, Laurel Orr, Neel Guha, Kush Bhatia, Ines Chami, Frederic Sala, Christopher Ré
- Llm-planner: Few-shot Grounded Planning For Embodied Agents With Large Language Models Song Chan Hee, Wu Jiaman, Washington Clayton, Sadler Brian M., Chao Wei-lun, Su Yu
- CLIP Models Are Few-shot Learners: Empirical Studies On VQA And Visual Entailment Song Haoyu, Dong Li, Zhang Wei-nan, Liu Ting, Wei Furu
- OPT-IML: Scaling Language Model Instruction Meta Learning Through The Lens Of Generalization Srinivasan Iyer, Xi Victoria Lin, Ramakanth Pasunuru, Todor Mihaylov, Daniel Simig, Ping Yu, Kurt Shuster, Tianlu Wang, Qing Liu, Punit Singh Koura, Xian Li, Brian O'horo, Gabriel Pereyra, Jeff Wang, Christopher Dewan, Asli Celikyilmaz, Luke Zettlemoyer, Ves Stoyanov
- Towards Zero-shot And Few-shot Table Question Answering Using GPT-3 Srivastava Pragya, Ganu Tanuja, Guha Saikat
- Dynamar: Dynamic Prompt With Mask Token Representation Sun Xiaodi, Rajagopalan Sunny, Nigam Priyanka, Lu Weiyi, Xu Yi, Zeng Belinda, Chilimbi Trishul
- OPT: Open Pre-trained Transformer Language Models Susan Zhang, Stephen Roller, Naman Goyal, Mikel Artetxe, Moya Chen, Shuohui Chen, Christopher Dewan, Mona Diab, Xian Li, Xi Victoria Lin, Todor Mihaylov, Myle Ott, Sam Shleifer, Kurt Shuster, Daniel Simig, Punit Singh Koura, Anjali Sridhar, Tianlu Wang, Luke Zettlemoyer
- Large Language Models Are Zero-shot Reasoners Takeshi Kojima, Shixiang Shane Gu, Machel Reid, Yutaka Matsuo, Yusuke Iwasawa
- TAPE: Assessing Few-shot Russian Language Understanding Taktasheva Ekaterina, Shavrina Tatiana, Fenogenova Alena, Shevelev Denis, Katricheva Nadezhda, Tikhonova Maria, Akhmetgareeva Albina, Zinkevich Oleg, Bashmakova Anastasiia, Iordanskaia Svetlana, Spiridonova Alena, Kurenshchikova Valentina, Artemova Ekaterina, Mikhailov Vladislav
- Commonsenseqa 2.0: Exposing The Limits Of AI Through Gamification Talmor Alon, Yoran Ori, Bras Ronan Le, Bhagavatula Chandra, Goldberg Yoav, Choi Yejin, Berant Jonathan
- Efficient Few-shot Learning Without Prompts Tunstall Lewis, Reimers Nils, Jo Unso Eun Seo, Bates Luke, Korat Daniel, Wasserblat Moshe, Pereg Oren
- Prompting Palm For Translation: Assessing Strategies And Performance Vilar David, Freitag Markus, Cherry Colin, Luo Jiaming, Ratnakar Viresh, Foster George
- Code4struct: Code Generation For Few-shot Event Structure Prediction Wang Xingyao, Li Sha, Ji Heng
- Adamix: Mixture-of-adaptations For Parameter-efficient Model Tuning Wang Yaqing, Agarwal Sahaj, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Language Models With Image Descriptors Are Strong Few-shot Video-language Learners Wang Zhenhailong, Li Manling, Xu Ruochen, Zhou Luowei, Lei Jie, Lin Xudong, Wang Shuohang, Yang Ziyi, Zhu Chenguang, Hoiem Derek, Chang Shih-fu, Bansal Mohit, Ji Heng
- Large Language Models Are Few(1)-shot Table Reasoners Wenhu Chen
- Program Of Thoughts Prompting: Disentangling Computation From Reasoning For Numerical Reasoning Tasks Wenhu Chen, Xueguang Ma, Xinyi Wang, William W. Cohen
- The Unreliability Of Explanations In Few-shot Prompting For Textual Reasoning Xi Ye, Greg Durrett
- Rationale-augmented Ensembles In Language Models Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc Le, Ed Chi, Denny Zhou
- Zero-shot Video Question Answering Via Frozen Bidirectional Language Models Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Language Models Are General-purpose Interfaces Yaru Hao, Haoyu Song, Li Dong, Shaohan Huang, Zewen Chi, Wenhui Wang, Shuming Ma, Furu Wei
- Structured Prompting: Scaling In-context Learning To 1,000 Examples Yaru Hao, Yutao Sun, Li Dong, Zhixiong Han, Yuxian Gu, Furu Wei
- Transcending Scaling Laws With 0.1% Extra Compute Yi Tay, Jason Wei, Hyung Won Chung, Vinh Q. Tran, David R. So, Siamak Shakeri, Xavier Garcia, Huaixiu Steven Zheng, Jinfeng Rao, Aakanksha Chowdhery, Denny Zhou, Donald Metzler, Slav Petrov, Neil Houlsby, Quoc V. Le, Mostafa Dehghani
- Making Large Language Models Better Reasoners With Step-aware Verifier Yifei Li, Zeqi Lin, Shizhuo Zhang, Qiang Fu, Bei Chen, Jian-guang Lou, Weizhu Chen
- Attend, Memorize And Generate: Towards Faithful Table-to-text Generation In Few Shots Zhao Wenting, Liu Ye, Wan Yao, Yu Philip S.
- Self-adaptive In-context Learning: An Information Compression Perspective For In-context Example Selection And Ordering Zhiyong Wu, Yaoxiang Wang, Jiacheng Ye, Lingpeng Kong
- Reflection Of Thought: Inversely Eliciting Numerical Reasoning In Language Models Via Solving Linear Systems Zhou Fan, Dong Haoyu, Liu Qian, Cheng Zhoujun, Han Shi, Zhang Dongmei
- Large Language Models Are Human-level Prompt Engineers Zhou Yongchao, Muresanu Andrei Ioan, Han Ziwen, Paster Keiran, Pitis Silviu, Chan Harris, Ba Jimmy
- KPT: Keyword-guided Pre-training For Grounded Dialog Generation Zhu Qi, Mi Fei, Zhang Zheng, Wang Yasheng, Li Yitong, Jiang Xin, Liu Qun, Zhu Xiaoyan, Huang Minlie
- LM-CPPF: Paraphrasing-guided Data Augmentation For Contrastive Prompt-based Few-shot Fine-tuning Abaskohi Amirhossein, Rothe Sascha, Yaghoobzadeh Yadollah
- On Codex Prompt Engineering For OCL Generation: An Empirical Study Abukhalaf Seif, Hamdaqa Mohammad, Khomh Foutse
- Multi-party Goal Tracking With Llms: Comparing Pre-training, Fine-tuning, And Prompt Engineering Addlesee Angus, Sieińska Weronika, Gunson Nancie, Garcia Daniel Hernández, Dondrup Christian, Lemon Oliver
- The ADAIO System At The BEA-2023 Shared Task On Generating AI Teacher Responses In Educational Dialogues Adigwe Adaeze 2 And 3, Yuan Zheng 2 And 3
- Can Large Language Models Be Good Path Planners? A Benchmark And Investigation On Spatial-temporal Reasoning Aghzal Mohamed, Plaku Erion, Yao Ziyu
- Better Patching Using LLM Prompting, Via Self-consistency Ahmed Toufique, Devanbu Premkumar
- Steering Large Language Models For Machine Translation With Finetuning And In-context Learning Alves Duarte M., Guerreiro Nuno M., Alves João, Pombal José, Rei Ricardo, De Souza José G. C., Colombo Pierre, Martins André F. T.
- BUFFET: Benchmarking Large Language Models For Few-shot Cross-lingual Transfer Asai Akari, Kudugunta Sneha, Yu Xinyan Velocity, Blevins Terra, Gonen Hila, Reid Machel, Tsvetkov Yulia, Ruder Sebastian, Hajishirzi Hannaneh
- Constituency Parsing Using Llms Bai Xuefeng, Wu Jialong, Chen Yulong, Wang Zhongqing, Zhang Yue
- Longbench: A Bilingual, Multitask Benchmark For Long Context Understanding Bai Yushi, Lv Xin, Zhang Jiajie, Lyu Hongchang, Tang Jiankai, Huang Zhidian, Du Zhengxiao, Liu Xiao, Zeng Aohan, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- A Bi-step Grounding Paradigm For Large Language Models In Recommendation Systems Bao Keqin, Zhang Jizhi, Wang Wenjie, Zhang Yang, Yang Zhengyi, Luo Yancheng, Chen Chong, Feng Fuli, Tian Qi
- Investigating The Translation Performance Of A Large Multilingual Language Model: The Case Of BLOOM Bawden Rachel, Yvon François
- Large Language Models In The Workplace: A Case Study On Prompt Engineering For Job Type Classification Benjamin Clavié, Alexandru Ciceu, Frederick Naylor, Guillaume Soulié, Thomas Brightwell
- Distractor Generation For Multiple-choice Questions With Predictive Prompting And Large Language Models Bitew Semere Kiros, Deleu Johannes, Develder Chris, Demeester Thomas
- Extractgpt: Exploring The Potential Of Large Language Models For Product Attribute Value Extraction Brinkmann Alexander, Shraga Roee, Bizer Christian
- Leveraging Large Language Models For Scalable Vector Graphics-driven Image Understanding Cai Mu, Huang Zeyi, Li Yuheng, Ojha Utkarsh, Wang Haohan, Lee Yong Jae
- CLAP: Isolating Content From Style Through Contrastive Learning With Augmented Prompts Cai Yichao, Liu Yuhang, Zhang Zhen, Shi Javen Qinfeng
- Two Failures Of Self-consistency In The Multi-step Reasoning Of Llms Chen Angelica, Phang Jason, Parrish Alicia, Padmakumar Vishakh, Zhao Chen, Bowman Samuel R., Cho Kyunghyun
- Fireact: Toward Language Agent Fine-tuning Chen Baian, Shu Chang, Shareghi Ehsan, Collier Nigel, Narasimhan Karthik, Yao Shunyu
- Large Language Models Are Visual Reasoning Coordinators Chen Liangyu, Li Bo, Shen Sheng, Yang Jingkang, Li Chunyuan, Keutzer Kurt, Darrell Trevor, Liu Ziwei
- Introspective Tips: Large Language Model For In-context Decision Making Chen Liting, Wang Lu, Dong Hang, Du Yali, Yan Jie, Yang Fangkai, Li Shuang, Zhao Pu, Qin Si, Rajmohan Saravan, Lin Qingwei, Zhang Dongmei
- Few-shot Data Synthesis For Open Domain Multi-hop Question Answering Chen Mingda, Chen Xilun, Yih Wen-tau
- Breaking Language Barriers In Multilingual Mathematical Reasoning: Insights And Observations Chen Nuo, Zheng Zinan, Wu Ning, Gong Ming, Song Yangqiu, Zhang Dongmei, Li Jia
- Teaching Large Language Models To Self-debug Chen Xinyun, Lin Maxwell, Schärli Nathanael, Zhou Denny
- Minprompt: Graph-based Minimal Prompt Data Augmentation For Few-shot Question Answering Chen Xiusi, Jiang Jyun-yu, Chang Wei-cheng, Hsieh Cho-jui, Yu Hsiang-fu, Wang Wei
- GENOME: Generative Neuro-symbolic Visual Reasoning By Growing And Reusing Modules Chen Zhenfang, Sun Rui, Liu Wenjun, Hong Yining, Gan Chuang
- See, Think, Confirm: Interactive Prompting Between Vision And Language Models For Knowledge-based Visual Reasoning Chen Zhenfang, Zhou Qinhong, Shen Yikang, Hong Yining, Zhang Hao, Gan Chuang
- Lift Yourself Up: Retrieval-augmented Text Generation With Self Memory Cheng Xin, Luo Di, Chen Xiuying, Liu Lemao, Zhao Dongyan, Yan Rui
- SCALE: Synergized Collaboration Of Asymmetric Language Translation Engines Cheng Xin, Wang Xun, Ge Tao, Chen Si-qing, Wei Furu, Zhao Dongyan, Yan Rui
- Batch Prompting: Efficient Inference With Large Language Model Apis Cheng Zhoujun, Kasai Jungo, Yu Tao
- Apollo: Unified Adapter And Prompt Learning For Vision Language Models Chowdhury Sanjoy, Nag Sayan, Manocha Dinesh
- Large Language Models For User Interest Journeys Christakopoulou Konstantina, Lalama Alberto, Adams Cj, Qu Iris, Amir Yifat, Chucri Samer, Vollucci Pierce, Soldo Fabio, Bseiso Dina, Scodel Sarah, Dixon Lucas, Chi Ed H., Chen Minmin
- Tutorials On Stance Detection Using Pre-trained Language Models: Fine-tuning BERT And Prompting Large Language Models Chuang Yun-shiuan
- Increasing Diversity While Maintaining Accuracy: Text Data Generation With Large Language Models And Human Interventions Chung John Joon Young, Kamar Ece, Amershi Saleema
- Auggpt: Leveraging Chatgpt For Text Data Augmentation Dai Haixing, Liu Zhengliang, Liao Wenxiong, Huang Xiaoke, Cao Yihan, Wu Zihao, Zhao Lin, Xu Shaochen, Liu Wei, Liu Ninghao, Li Sheng, Zhu Dajiang, Cai Hongmin, Sun Lichao, Li Quanzheng, Shen Dinggang, Liu Tianming, Li Xiang
- Llmebench: A Flexible Framework For Accelerating Llms Benchmarking Dalvi Fahim, Hasanain Maram, Boughorbel Sabri, Mousi Basel, Abdaljalil Samir, Nazar Nizi, Abdelali Ahmed, Chowdhury Shammur Absar, Mubarak Hamdy, Ali Ahmed, Hawasly Majd, Durrani Nadir, Alam Firoj
- GPT-3 Models Are Few-shot Financial Reasoners De Padua Raul Salles, Qureshi Imran, Karakaplan Mustafa U.
- Im-promptu: In-context Composition From Image Prompts Dedhia Bhishma, Chang Michael, Snell Jake C., Griffiths Thomas L., Jha Niraj K.
- Mixture Of Soft Prompts For Controllable Data Generation Derek Chen, Celine Lee, Yunan Lu, Domenic Rosati, Zhou Yu
- Evaluating Chatgpt As A Recommender System: A Rigorous Approach Di Palma Dario, Biancofiore Giovanni Maria, Anelli Vito Walter, Narducci Fedelucio, Di Noia Tommaso, Di Sciascio Eugenio
- Using Large Language Model To Solve And Explain Physics Word Problems Approaching Human Level Ding Jingzhe, Cen Yan, Wei Xinyuan
- Do Llms Work On Charts? Designing Few-shot Prompts For Chart Question Answering And Summarization Do Xuan Long, Hassanpour Mohammad, Masry Ahmed, Kavehzadeh Parsa, Hoque Enamul, Joty Shafiq
- Improving Factuality And Reasoning In Language Models Through Multiagent Debate Du Yilun, Li Shuang, Torralba Antonio, Tenenbaum Joshua B., Mordatch Igor
- Extending Context Window Of Large Language Models Via Semantic Compression Fei Weizhi, Niu Xueyan, Zhou Pingyi, Hou Lu, Bai Bo, Deng Lei, Han Wei
- Language Models Can Be Logical Solvers Feng Jiazhan, Xu Ruochen, Hao Junheng, Sharma Hiteshi, Shen Yelong, Zhao Dongyan, Chen Weizhu
- Towards Llm-driven Dialogue State Tracking Feng Yujie, Lu Zexin, Liu Bo, Zhan Liming, Wu Xiao-ming
- Sentence Simplification Via Large Language Models Feng Yutao, Qiang Jipeng, Li Yun, Yuan Yunhao, Zhu Yi
- Specializing Smaller Language Models Towards Multi-step Reasoning Fu Yao, Peng Hao, Ou Litu, Sabharwal Ashish, Khot Tushar
- Strategyllm: Large Language Models As Strategy Generators, Executors, Optimizers, And Evaluators For Problem Solving Gao Chang, Jiang Haiyun, Cai Deng, Shi Shuming, Lam Wai
- Self-explanation Prompting Improves Dialogue Understanding In Large Language Models Gao Haoyu, Lin Ting-en, Li Hangyu, Yang Min, Wu Yuchuan, Ma Wentao, Li Yongbin
- How To Design Translation Prompts For Chatgpt: An Empirical Study Gao Yuan, Wang Ruili, Hou Feng
- UPAR: A Kantian-inspired Prompting Framework For Enhancing Large Language Model Capabilities Geng Hejia, Xu Boxun, Li Peng
- Text-to-audio Generation Using Instruction-tuned LLM And Latent Diffusion Model Ghosal Deepanway, Majumder Navonil, Mehrish Ambuj, Poria Soujanya
- Wanglab At Mediqa-chat 2023: Clinical Note Generation From Doctor-patient Conversations Using Large Language Models Giorgi John, Toma Augustin, Xie Ronald, Chen Sondra S., An Kevin R., Zheng Grace X., Wang Bo
- What Can Large Language Models Do In Chemistry? A Comprehensive Benchmark On Eight Tasks Guo Taicheng, Guo Kehan, Nan Bozhao, Liang Zhenwen, Guo Zhichun, Chawla Nitesh V., Wiest Olaf, Zhang Xiangliang
- PPTC Benchmark: Evaluating Large Language Models For Powerpoint Task Completion Guo Yiduo, Zhang Zekai, Liang Yaobo, Zhao Dongyan, Duan Nan
- Improving Small Language Models On Pubmedqa Via Generative Data Augmentation Guo Zhen, Wang Peiqi, Wang Yanwei, Yu Shangdi
- Images In Language Space: Exploring The Suitability Of Large Language Models For Vision & Language Tasks Hakimov Sherzod, Schlangen David
- Answering Unseen Questions With Smaller Language Models Using Rationale Generation And Dense Retrieval Hartill Tim, Benavides-prado Diana, Witbrock Michael, Riddle Patricia J.
- Zero- And Few-shot Prompting With Llms: A Comparative Study With Fine-tuned Models For Bangla Sentiment Analysis Hasan Md. Arid, Das Shudipta, Anjum Afiyat, Alam Firoj, Anjum Anika, Sarker Avijit, Noori Sheak Rashed Haider
- Large Language Models Are Biased To Overestimate Profoundness Herrera-berg Eugenio, Browne Tomás Vergara, León-villagrá Pablo, Vives Marc-lluís, Calderon Cristian Buc
- Assessing The Efficacy Of Large Language Models In Generating Accurate Teacher Responses Hicke Yann, Masand Abhishek, Guo Wentao, Gangavarapu Tushaar
- Comparative Analysis Of GPT-4 And Human Graders In Evaluating Praise Given To Students In Synthetic Dialogues Hirunyasiri Dollaya, Thomas Danielle R., Lin Jionghao, Koedinger Kenneth R., Aleven Vincent
- Distilling Step-by-step! Outperforming Larger Language Models With Less Training Data And Smaller Model Sizes Hsieh Cheng-yu, Li Chun-liang, Yeh Chih-kuan, Nakhost Hootan, Fujii Yasuhisa, Ratner Alexander, Krishna Ranjay, Lee Chen-yu, Pfister Tomas
- Context-aware Prompt Tuning For Vision-language Model With Dual-alignment Hu Hongyu, Lin Tiancheng, Wang Jie, Sun Zhenbang, Xu Yi
- Scaled Prompt-tuning For Few-shot Natural Language Generation Hu Ting, Meinel Christoph, Yang Haojin
- Code Prompting: A Neural Symbolic Method For Complex Reasoning In Large Language Models Hu Yi, Yang Haotong, Lin Zhouchen, Zhang Muhan
- POP: Prompt Of Prompts For Continual Learning Hu Zhiyuan, Lyu Jiancheng, Gao Dashan, Vasconcelos Nuno
- Deploying And Evaluating Llms To Program Service Mobile Robots Hu Zichao, Lucchetti Francesca, Schlesinger Claire, Saxena Yash, Freeman Anders, Modak Sadanand, Guha Arjun, Biswas Joydeep
- Lorahub: Efficient Cross-task Generalization Via Dynamic Lora Composition Huang Chengsong, Liu Qian, Lin Bill Yuchen, Pang Tianyu, Du Chao, Lin Min
- Fewer Is More: Boosting LLM Reasoning With Reinforced Context Pruning Huang Xijie, Zhang Li Lyna, Cheng Kwang-ting, Yang Fan, Yang Mao
- Llama Guard: Llm-based Input-output Safeguard For Human-ai Conversations Inan Hakan, Upasani Kartikeya, Chi Jianfeng, Rungta Rashi, Iyer Krithika, Mao Yuning, Tontchev Michael, Hu Qing, Fuller Brian, Testuggine Davide, Khabsa Madian
- Query Expansion By Prompting Large Language Models Jagerman Rolf, Zhuang Honglei, Qin Zhen, Wang Xuanhui, Bendersky Michael
- Consistency Analysis Of Chatgpt Jang Myeongjun Erik, Lukasiewicz Thomas
- Inpars-v2: Large Language Models As Efficient Dataset Generators For Information Retrieval Jeronymo Vitor, Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Lotufo Roberto, Zavrel Jakub, Nogueira Rodrigo
- Natural Language Decomposition And Interpretation Of Complex Utterances Jhamtani Harsh, Fang Hao, Xia Patrick, Levy Eran, Andreas Jacob, Van Durme Ben
- Chain Of Thought Prompt Tuning In Vision Language Models Jiaxin Ge, Hongyin Luo, Siyuan Qian, Yulu Gan, Jie Fu, Shanghang Zhang
- A Comprehensive Capability Analysis Of GPT-3 And GPT-3.5 Series Models Junjie Ye, Xuanting Chen, Nuo Xu, Can Zu, Zekai Shao, Shichun Liu, Yuhan Cui, Zeyang Zhou, Chao Gong, Yang Shen, Jie Zhou, Siming Chen, Tao Gui, Qi Zhang, Xuanjing Huang
- Is Chatgpt A Good Recommender? A Preliminary Study Junling Liu, Chao Liu, Peilin Zhou, Renjie Lv, Kang Zhou, Yan Zhang
- Contextual Code Switching For Machine Translation Using Language Models Kaji Arshad, Shah Manan
- Evaluating Open-domain Question Answering In The Era Of Large Language Models Kamalloo Ehsan, Dziri Nouha, Clarke Charles L. A., Rafiei Davood
- Deficiency Of Large Language Models In Finance: An Empirical Examination Of Hallucination Kang Haoqiang, Liu Xiao-yang
- Llm-augmented Preference Learning From Natural Language Kang Inwon, Ruan Sikai, Ho Tyler, Lin Jui-chien, Mohsin Farhad, Seneviratne Oshani, Xia Lirong
- BLESS: Benchmarking Large Language Models On Sentence Simplification Kew Tannon, Chi Alison, Vásquez-rodríguez Laura, Agrawal Sweta, Aumiller Dennis, Alva-manchego Fernando, Shardlow Matthew
- Dspy: Compiling Declarative Language Model Calls Into Self-improving Pipelines Khattab Omar, Singhvi Arnav, Maheshwari Paridhi, Zhang Zhiyuan, Santhanam Keshav, Vardhamanan Sri, Haq Saiful, Sharma Ashutosh, Joshi Thomas T., Moazam Hanna, Miller Heather, Zaharia Matei, Potts Christopher
- Memory-efficient Fine-tuning Of Compressed Large Language Models Via Sub-4-bit Integer Quantization Kim Jeonghoon, Lee Jung Hyun, Kim Sungdong, Park Joonsuk, Yoo Kang Min, Kwon Se Jung, Lee Dongsoo
- LLM4SGG: Large Language Models For Weakly Supervised Scene Graph Generation Kim Kibum, Yoon Kanghoon, Jeon Jaehyeong, In Yeonjun, Moon Jinyoung, Kim Donghyun, Park Chanyoung
- Chatgpt: Jack Of All Trades, Master Of None Kocoń Jan, Cichecki Igor, Kaszyca Oliwier, Kochanek Mateusz, Szydło Dominika, Baran Joanna, Bielaniewicz Julita, Gruza Marcin, Janz Arkadiusz, Kanclerz Kamil, Kocoń Anna, Koptyra Bartłomiej, Mieleszczenko-kowszewicz Wiktoria, Miłkowski Piotr, Oleksy Marcin, Piasecki Maciej, Radliński Łukasz, Wojtasik Konrad, Woźniak Stanisław, Kazienko Przemysław
- Leveraging Few-shot Data Augmentation And Waterfall Prompting For Response Generation Krause Lea, Santamaría Selene Báez, Van Der Meer Michiel, Khurana Urja
- Prompt Generate Train (PGT): Few-shot Domain Adaption Of Retrieval Augmented Generation Models For Open Book Question-answering Krishna C. S.
- Chatbots Are Not Reliable Text Annotators Kristensen-mclachlan Ross Deans, Canavan Miceal, Kardos Márton, Jacobsen Mia, Aarøe Lene
- Lavip:language-grounded Visual Prompts Kunananthaseelan Nilakshan, Zhang Jing, Harandi Mehrtash
- Reward Design With Language Models Kwon Minae, Xie Sang Michael, Bullard Kalesha, Sadigh Dorsa
- Chatgpt For Arabic Grammatical Error Correction Kwon Sang Yun, Bhatia Gagan, Nagoud El Moatez Billah, Abdul-mageed Muhammad
- Beyond English: Evaluating Llms For Arabic Grammatical Error Correction Kwon Sang Yun, Bhatia Gagan, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- A Zero-shot And Few-shot Study Of Instruction-finetuned Large Language Models Applied To Clinical And Biomedical Tasks Labrak Yanis, Rouvier Mickael, Dufour Richard
- Cabrita: Closing The Gap For Foreign Languages Larcher Celio, Piau Marcos, Finardi Paulo, Gengo Pedro, Esposito Piero, Caridá Vinicius
- Read-only Prompt Optimization For Vision-language Few-shot Learning Lee Dongjun, Song Seokwon, Suh Jihee, Choi Joonmyung, Lee Sanghyeok, Kim Hyunwoo J.
- Prompted Llms As Chatbot Modules For Long Open-domain Conversation Lee Gibbeum, Hartmann Volker, Park Jongho, Papailiopoulos Dimitris, Lee Kangwook
- Applying Large Language Models And Chain-of-thought For Automatic Scoring Lee Gyeong-geon, Latif Ehsan, Wu Xuansheng, Liu Ninghao, Zhai Xiaoming
- Teaching Arithmetic To Small Transformers Lee Nayoung, Sreenivasan Kartik, Lee Jason D., Lee Kangwook, Papailiopoulos Dimitris
- Hint Of Thought Prompting: An Explainable And Zero-shot Approach To Reasoning Tasks With Llms Lei Ioktong, Deng Zhidong
- Llm4vis: Explainable Visualization Recommendation Using Chatgpt Lei Wang, Songheng Zhang, Yun Wang, Ee-peng Lim, Yong Wang
- The Language Of Prompting: What Linguistic Properties Make A Prompt Successful? Leidinger Alina, Van Rooij Robert, Shutova Ekaterina
- Task Contamination: Language Models May Not Be Few-shot Anymore Li Changmao, Flanigan Jeffrey
- Dialogue For Prompting: A Policy-gradient-based Discrete Prompt Generation For Few-shot Learning Li Chengzhengxu, Liu Xiaoming, Wang Yichen, Li Duyi, Lan Yu, Shen Chao
- Are Human-generated Demonstrations Necessary For In-context Learning? Li Rui, Wang Guoyin, Li Jiwei
- Autoconv: Automatically Generating Information-seeking Conversations With Large Language Models Li Siheng, Yang Cheng, Yin Yichun, Zhu Xinyu, Cheng Zesen, Shang Lifeng, Jiang Xin, Liu Qun, Yang Yujiu
- On Bilingual Lexicon Induction With Large Language Models Li Yaoyiran, Korhonen Anna, Vulić Ivan
- A Practical Survey On Zero-shot Prompt Design For In-context Learning Li Yinheng
- Flexkbqa: A Flexible Llm-powered Framework For Few-shot Knowledge Base Question Answering Li Zhenyu, Fan Sunqi, Gu Yu, Li Xiuxing, Duan Zhichao, Dong Bowen, Liu Ning, Wang Jianyong
- Label Supervised Llama Finetuning Li Zongxi, Li Xianming, Liu Yuzhang, Xie Haoran, Li Jing, Wang Fu-lee, Li Qing, Zhong Xiaoqin
- Prompting Large Language Models With Chain-of-thought For Few-shot Knowledge Base Question Generation Liang Yuanyuan, Wang Jianing, Zhu Hanlun, Wang Lei, Qian Weining, Lan Yunshi
- Rella: Retrieval-enhanced Large Language Models For Lifelong Sequential Behavior Comprehension In Recommendation Lin Jianghao, Shan Rong, Zhu Chenxu, Du Kounianhua, Chen Bo, Quan Shigang, Tang Ruiming, Yu Yong, Zhang Weinan
- Few-shot Adaptation For Parsing Contextual Utterances With Llms Lin Kevin, Xia Patrick, Fang Hao
- RA-DIT: Retrieval-augmented Dual Instruction Tuning Lin Xi Victoria, Chen Xilun, Chen Mingda, Shi Weijia, Lomeli Maria, James Rich, Rodriguez Pedro, Kahn Jacob, Szilvasy Gergely, Lewis Mike, Zettlemoyer Luke, Yih Scott
- Goat: Fine-tuned Llama Outperforms GPT-4 On Arithmetic Tasks Liu Tiedong, Low Bryan Kian Hsiang
- MMHQA-ICL: Multimodal In-context Learning For Hybrid Question Answering Over Text, Tables And Images Liu Weihao, Lei Fangyu, Luo Tongxu, Lei Jiahe, He Shizhu, Zhao Jun, Liu Kang
- Calibrating Llm-based Evaluator Liu Yuxuan, Yang Tianchi, Huang Shaohan, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Exploring Effectiveness Of GPT-3 In Grammatical Error Correction: A Study On Performance And Controllability In Prompt-based Methods Loem Mengsay, Kaneko Masahiro, Takase Sho, Okazaki Naoaki
- Data-to-text Generation For Severely Under-resourced Languages With GPT-3.5: A Bit Of Help Needed From Google Translate Lorandi Michela, Belz Anya
- Chain-of-dictionary Prompting Elicits Translation In Large Language Models Lu Hongyuan, Yang Haoran, Huang Haoyang, Zhang Dongdong, Lam Wai, Wei Furu
- Enhancing Black-box Few-shot Text Classification With Prompt-based Data Augmentation Luo Danqing, Zhang Chen, Xu Jiahui, Wang Bin, Chen Yiming, Zhang Yan, Li Haizhou
- Exploring Small Language Models With Prompt-learning Paradigm For Efficient Domain-specific Text Classification Luo Hengyu, Liu Peng, Esping Stefan
- Dr.icl: Demonstration-retrieved In-context Learning Luo Man, Xu Xin, Dai Zhuyun, Pasupat Panupong, Kazemi Mehran, Baral Chitta, Imbrasaite Vaiva, Zhao Vincent Y
- Hrot: Hybrid Prompt Strategy And Retrieval Of Thought For Table-text Hybrid Question Answering Luo Tongxu, Lei Fangyu, Lei Jiahe, Liu Weihao, He Shihu, Zhao Jun, Liu Kang
- Fairness-guided Few-shot Prompting For Large Language Models Ma Huan, Zhang Changqing, Bian Yatao, Liu Lemao, Zhang Zhirui, Zhao Peilin, Zhang Shu, Fu Huazhu, Hu Qinghua, Wu Bingzhe
- Let's Do A Thought Experiment: Using Counterfactuals To Improve Moral Reasoning Ma Xiao, Mishra Swaroop, Beirami Ahmad, Beutel Alex, Chen Jilin
- Large Language Model Is Not A Good Few-shot Information Extractor, But A Good Reranker For Hard Samples! Ma Yubo, Cao Yixin, Hong Yongching, Sun Aixin
- Exploring Prompting Large Language Models As Explainable Metrics Mahmoudi Ghazaleh
- Aligning Large Language Models For Clinical Tasks Manathunga Supun, Hettigoda Isuru
- Enhancing CLIP With GPT-4: Harnessing Visual Descriptions As Prompts Maniparambil Mayug, Vorster Chris, Molloy Derek, Murphy Noel, Mcguinness Kevin, O'connor Noel E.
- Do Prompt Positions Really Matter? Mao Junyu, Middleton Stuart E., Niranjan Mahesan
- ICL Markup: Structuring In-context Learning Using Soft-token Tags Marc-etienne Brunet, Ashton Anderson, Richard Zemel
- Benchmarking Large Language Model Capabilities For Conditional Generation Maynez Joshua, Agrawal Priyanka, Gehrmann Sebastian
- PRE: Vision-language Prompt Learning With Reparameterization Encoder Minh Anh Pham Thi, Nguyen An Duc, Tzimiropoulos Georgios
- Characterizing Large Language Models As Rationalizers Of Knowledge-intensive Tasks Mishra Aditi, Rahman Sajjadur, Kim Hannah, Mitra Kushan, Hruschka Estevam
- Adaptive Machine Translation With Large Language Models Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- Consistency Analysis Of Chatgpt Myeongjun Erik Jang, Thomas Lukasiewicz
- Large Language Model Augmented Narrative Driven Recommendations Mysore Sheshera, Mccallum Andrew, Zamani Hamed
- Generating Medically-accurate Summaries Of Patient-provider Dialogue: A Multi-stage Approach Using Large Language Models Nair Varun, Schumacher Elliot, Kannan Anitha
- Enhancing Few-shot Text-to-sql Capabilities Of Large Language Models: A Study On Prompt Design Strategies Nan Linyong, Zhao Yilun, Zou Weijin, Ri Narutatsu, Tae Jaesung, Zhang Ellen, Cohan Arman, Radev Dragomir
- Large Language Models As Tax Attorneys: A Case Study In Legal Capabilities Emergence Nay John J., Karamardian David, Lawsky Sarah B., Tao Wenting, Bhat Meghana, Jain Raghav, Lee Aaron Travis, Choi Jonathan H., Kasai Jungo
- Evaluating The Symbol Binding Ability Of Large Language Models For Multiple-choice Questions In Vietnamese General Education Nguyen Duc-vu, Nguyen Quoc-nam
- Cof-cot: Enhancing Large Language Models With Coarse-to-fine Chain-of-thought Prompting For Multi-domain NLU Tasks Nguyen Hoang H., Liu Ye, Zhang Chenwei, Zhang Tao, Yu Philip S.
- In-context Example Selection With Influences Nguyen Tai, Wong Eric
- Democratizing Llms For Low-resource Languages By Leveraging Their English Dominant Abilities With Linguistically-diverse Prompts Nguyen Xuan-phi, Aljunied Sharifah Mahani, Joty Shafiq, Bing Lidong
- L2ceval: Evaluating Language-to-code Generation Capabilities Of Large Language Models Ni Ansong, Yin Pengcheng, Zhao Yilun, Riddell Martin, Feng Troy, Shen Rui, Yin Stephen, Liu Ye, Yavuz Semih, Xiong Caiming, Joty Shafiq, Zhou Yingbo, Radev Dragomir, Cohan Arman
- Code-style In-context Learning For Knowledge-based Question Answering Nie Zhijie, Zhang Richong, Wang Zhongyuan, Liu Xudong
- Blackvip: Black-box Visual Prompting For Robust Transfer Learning Oh Changdae, Hwang Hyeji, Lee Hee-young, Lim Yongtaek, Jung Geunyoung, Jung Jiyoung, Choi Hosik, Song Kyungwoo
- Automated Assessment Of Students' Code Comprehension Using Llms Oli Priti, Banjade Rabin, Chapagain Jeevan, Rus Vasile
- Large Language Models Vote: Prompting For Rare Disease Identification Oniani David, Hilsman Jordan, Dong Hang, Gao Fengyi, Verma Shiven, Wang Yanshan
- Self-supervised Meta-prompt Learning With Meta-gradient Regularization For Few-shot Generalization Pan Kaihang, Li Juncheng, Song Hongye, Lin Jun, Liu Xiaozhong, Tang Siliang
- Chameleon: Plug-and-play Compositional Reasoning With Large Language Models Pan Lu, Baolin Peng, Hao Cheng, Michel Galley, Kai-wei Chang, Ying Nian Wu, Song-chun Zhu, Jianfeng Gao
- ART: Automatic Multi-step Reasoning And Tool-use For Large Language Models Paranjape Bhargavi, Lundberg Scott, Singh Sameer, Hajishirzi Hannaneh, Zettlemoyer Luke, Ribeiro Marco Tulio
- On The Analysis Of Cross-lingual Prompt Tuning For Decoder-based Multilingual Model Park Nohil, Park Joonsuk, Yoo Kang Min, Yoon Sungroh
- Instructexcel: A Benchmark For Natural Language Instruction In Excel Payan Justin, Mishra Swaroop, Singh Mukul, Negreanu Carina, Poelitz Christian, Baral Chitta, Roy Subhro, Chakravarthy Rasika, Van Durme Benjamin, Nouri Elnaz
- Spoken Language Intelligence Of Large Language Models For Language Learning Peng Linkai, Nuchged Baorian, Gao Yingming
- Soft Prompt Tuning For Augmenting Dense Retrieval With Large Language Models Peng Zhiyuan, Wu Xuyang, Wang Qifan, Fang Yi
- Large Language Models Sensitivity To The Order Of Options In Multiple-choice Questions Pezeshkpour Pouya, Hruschka Estevam
- Zero-shot Question Answering Over Financial Documents Using Large Language Models Phogat Karmvir Singh, Harsha Chetan, Dasaratha Sridhar, Ramakrishna Shashishekar, Puranam Sai Akhil
- Sabi\'a: Portuguese Large Language Models Pires Ramon, Abonizio Hugo, Almeida Thales Sales, Nogueira Rodrigo
- Three Ways Of Using Large Language Models To Evaluate Chat Plátek Ondřej, Hudeček Vojtěch, Schmidtová Patricia, Lango Mateusz, Dušek Ondřej
- Transformers In The Service Of Description Logic-based Contexts Poulis Angelos, Tsalapati Eleni, Koubarakis Manolis
- Rephrase, Augment, Reason: Visual Grounding Of Questions For Vision-language Models Prasad Archiki, Stengel-eskin Elias, Bansal Mohit
- Decomposed Prompting For Machine Translation Between Related Languages Using Large Language Models Puduppully Ratish, Kunchukuttan Anoop, Dabre Raj, Aw Ai Ti, Chen Nancy F.
- Visual Adversarial Examples Jailbreak Aligned Large Language Models Qi Xiangyu, Huang Kaixuan, Panda Ashwinee, Henderson Peter, Wang Mengdi, Mittal Prateek
- Improving In-context Learning Via Bidirectional Alignment Qin Chengwei, Xia Wenhan, Jiao Fangkai, Chen Chen, Hu Yuchen, Ding Bosheng, Joty Shafiq
- In-context Learning With Iterative Demonstration Selection Qin Chengwei, Zhang Aston, Chen Chen, Dagar Anirudh, Ye Wenming
- Faithful Chain-of-thought Reasoning Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong, Marianna Apidianaki, Chris Callison-burch
- Controllable Generation Of Dialogue Acts For Dialogue Systems Via Few-shot Response Generation And Ranking Ramirez Angela, Agarwal Karik, Juraska Juraj, Garg Utkarsh, Walker Marilyn A.
- Lmcap: Few-shot Multilingual Image Captioning By Retrieval Augmented Language Model Prompting Ramos Rita, Martins Bruno, Elliott Desmond
- Dissecting In-context Learning Of Translations In Gpts Raunak Vikas, Awadalla Hany Hassan, Menezes Arul
- Do Gpts Produce Less Literal Translations? Raunak Vikas, Menezes Arul, Post Matt, Awadalla Hany Hassan
- Prompt, Generate, Then Cache: Cascade Of Foundation Models Makes Strong Few-shot Learners Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Hongsheng Li, Yu Qiao, Peng Gao
- Negated Complementary Commonsense Using Large Language Models Rezaei Navid, Reformat Marek Z.
- STREET: A Multi-task Structured Reasoning And Explanation Benchmark Ribeiro Danilo, Wang Shen, Ma Xiaofei, Zhu Henry, Dong Rui, Kong Deguang, Burger Juliette, Ramos Anjelica, Wang William, Huang Zhiheng, Karypis George, Xiang Bing, Roth Dan
- Unveiling The Potential Of Large Language Models In Generating Semantic And Cross-language Clones Roy Palash R., Alam Ajmain I., Al-omari Farouq, Roy Banani, Roy Chanchal K., Schneider Kevin A.
- ICLEF: In-context Learning With Expert Feedback For Explainable Style Transfer Saakyan Arkadiy, Muresan Smaranda
- Promptmix: A Class Boundary Augmentation Method For Large Language Model Distillation Sahu Gaurav, Vechtomova Olga, Bahdanau Dzmitry, Laradji Issam H.
- Large Language Models Are Competitive Near Cold-start Recommenders For Language- And Item-based Preferences Sanner Scott, Balog Krisztian, Radlinski Filip, Wedin Ben, Dixon Lucas
- Open-ended Instructable Embodied Agents With Memory-augmented Large Language Models Sarch Gabriel, Wu Yue, Tarr Michael J., Fragkiadaki Katerina
- RAMP: Retrieval And Attribute-marking Enhanced Prompting For Attribute-controlled Translation Sarti Gabriele, Htut Phu Mon, Niu Xing, Hsu Benjamin, Currey Anna, Dinu Georgiana, Nadejde Maria
- An Empirical Evaluation Of Using Large Language Models For Automated Unit Test Generation Schäfer Max, Nadi Sarah, Eghbali Aryaz, Tip Frank
- Quantifying Language Models' Sensitivity To Spurious Features In Prompt Design Or: How I Learned To Start Worrying About Prompt Formatting Sclar Melanie, Choi Yejin, Tsvetkov Yulia, Suhr Alane
- Wikichat: Stopping The Hallucination Of Large Language Model Chatbots By Few-shot Grounding On Wikipedia Semnani Sina J., Yao Violet Z., Zhang Heidi C., Lam Monica S.
- The Cot Collection: Improving Zero-shot And Few-shot Learning Of Language Models Via Chain-of-thought Fine-tuning Seungone Kim, Se June Joo, Doyoung Kim, Joel Jang, Seonghyeon Ye, Jamin Shin, Minjoon Seo
- Language Is Not All You Need: Aligning Perception With Language Models Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Johan Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei
- The Flan Collection: Designing Data And Methods For Effective Instruction Tuning Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay, Denny Zhou, Quoc V. Le, Barret Zoph, Jason Wei, Adam Roberts
- Mixture-of-experts Meets Instruction Tuning:a Winning Combination For Large Language Models Shen Sheng, Hou Le, Zhou Yanqi, Du Nan, Longpre Shayne, Wei Jason, Chung Hyung Won, Zoph Barret, Fedus William, Chen Xinyun, Vu Tu, Wu Yuexin, Chen Wuyang, Webson Albert, Li Yunxuan, Zhao Vincent, Yu Hongkun, Keutzer Kurt, Darrell Trevor, Zhou Denny
- Can Language Agents Be Alternatives To PPO? A Preliminary Empirical Study On Openai Gym Sheng Junjie, Huang Zixiao, Shen Chuyun, Li Wenhao, Hua Yun, Jin Bo, Zha Hongyuan, Wang Xiangfeng
- Prompt Space Optimizing Few-shot Reasoning Success With Large Language Models Shi Fobo, Qing Peijun, Yang Dong, Wang Nan, Lei Youbo, Lu Haonan, Lin Xiaodong, Li Duantengchuan
- Unleashing The Power Of Pre-trained Language Models For Offline Reinforcement Learning Shi Ruizhe, Liu Yuyao, Ze Yanjie, Du Simon S., Xu Huazhe
- Dept: Decomposed Prompt Tuning For Parameter-efficient Fine-tuning Shi Zhengxiang, Lipani Aldo
- Large Language Models For Aspect-based Sentiment Analysis Simmering Paul F., Huoviala Paavo
- Applenet: Visual Attention Parameterized Prompt Learning For Few-shot Remote Sensing Image Generalization Using CLIP Singha Mainak, Jha Ankit, Solanki Bhupendra, Bose Shirsha, Banerjee Biplab
- SPARSEFIT: Few-shot Prompting With Sparse Fine-tuning For Jointly Generating Predictions And Natural Language Explanations Solano Jesus, Sanni Mardhiyah, Camburu Oana-maria, Minervini Pasquale
- Pythia: A Suite For Analyzing Large Language Models Across Training And Scaling Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, Usvsn Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar Van Der Wal
- TEST: Text Prototype Aligned Embedding To Activate Llm's Ability For Time Series Sun Chenxi, Li Hongyan, Li Yaliang, Hong Shenda
- Adaplanner: Adaptive Planning From Feedback With Language Models Sun Haotian, Zhuang Yuchen, Kong Lingkai, Dai Bo, Zhang Chao
- Make Prompt-based Black-box Tuning Colorful: Boosting Model Generalization From Three Orthogonal Perspectives Sun Qiushi, Han Chengcheng, Chen Nuo, Zhu Renyu, Gong Jingyang, Li Xiang, Gao Ming
- Generative Multimodal Models Are In-context Learners Sun Quan, Cui Yufeng, Zhang Xiaosong, Zhang Fan, Yu Qiying, Luo Zhengxiong, Wang Yueze, Rao Yongming, Liu Jingjing, Huang Tiejun, Wang Xinlong
- Battle Of The Large Language Models: Dolly Vs Llama Vs Vicuna Vs Guanaco Vs Bard Vs Chatgpt -- A Text-to-sql Parsing Comparison Sun Shuo, Zhang Yuchen, Yan Jiahuan, Gao Yuze, Ong Donovan, Chen Bin, Su Jian
- PEARL: Prompting Large Language Models To Plan And Execute Actions Over Long Documents Sun Simeng, Liu Yang, Wang Shuohang, Zhu Chenguang, Iyyer Mohit
- A Prompt Learning Framework For Source Code Summarization Sun Weisong, Fang Chunrong, You Yudu, Chen Yuchen, Liu Yi, Wang Chong, Zhang Jian, Zhang Quanjun, Qian Hanwei, Zhao Wei, Liu Yang, Chen Zhenyu
- Multimodal Question Answering For Unified Information Extraction Sun Yuxuan, Zhang Kai, Su Yu
- Pre-training Intent-aware Encoders For Zero- And Few-shot Intent Classification Sung Mujeen, Gung James, Mansimov Elman, Pappas Nikolaos, Shu Raphael, Romeo Salvatore, Zhang Yi, Castelli Vittorio
- Approximating Online Human Evaluation Of Social Chatbots With Prompting Svikhnushina Ekaterina, Pu Pearl
- Can Language Models Teach Weaker Agents? Teacher Explanations Improve Students Via Personalization Swarnadeep Saha, Peter Hase, Mohit Bansal
- Exploring Chain-of-thought Style Prompting For Text-to-sql Tai Chang-you, Chen Ziru, Zhang Tianshu, Deng Xiang, Sun Huan
- Link-context Learning For Multimodal Llms Tai Yan, Fan Weichen, Zhang Zhao, Zhu Feng, Zhao Rui, Liu Ziwei
- Training Generative Question-answering On Synthetic Data Obtained From An Instruct-tuned Model Takahashi Kosuke, Omi Takahiro, Arima Kosuke, Ishigaki Tatsuya
- Tackling VQA With Pretrained Foundation Models Without Further Training Tan Alvin De Jun, Shen Bingquan
- Compound Text-guided Prompt Tuning Via Image-adaptive Cues Tan Hao, Li Jun, Zhou Yizhuang, Wan Jun, Lei Zhen, Zhang Xiangyu
- Is Chatgpt A Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation Tao Fang, Shu Yang, Kaixin Lan, Derek F. Wong, Jinpeng Hu, Lidia S. Chao, Yue Zhang
- Parameter Efficient Tuning Allows Scalable Personalization Of Llms For Text Entry: A Case Study On Abbreviation Expansion Tomanek Katrin, Cai Shanqing, Venugopalan Subhashini
- Large Language Model Prompt Chaining For Long Legal Document Classification Trautmann Dietrich
- Chatgpt As A Commenter To The News: Can Llms Generate Human-like Opinions? Tseng Rayden, Verberne Suzan, Van Der Putten Peter
- Efficiently Aligned Cross-lingual Transfer Learning For Conversational Tasks Using Prompt-tuning Tu Lifu, Qu Jin, Yavuz Semih, Joty Shafiq, Liu Wenhao, Xiong Caiming, Zhou Yingbo
- Language Models Don't Always Say What They Think: Unfaithful Explanations In Chain-of-thought Prompting Turpin Miles, Michael Julian, Perez Ethan, Bowman Samuel R.
- ASPIRO: Any-shot Structured Parsing-error-induced Reprompting For Consistent Data-to-text Generation Vejvar Martin, Fujimoto Yasutaka
- Small Language Models Improve Giants By Rewriting Their Outputs Vernikos Giorgos, Bražinskas Arthur, Adamek Jakub, Mallinson Jonathan, Severyn Aliaksei, Malmi Eric
- Mtcue: Learning Zero-shot Control Of Extra-textual Attributes By Leveraging Unstructured Context In Neural Machine Translation Vincent Sebastian, Flynn Robert, Scarton Carolina
- Prompt2model: Generating Deployable Models From Natural Language Instructions Viswanathan Vijay, Zhao Chenyang, Bertsch Amanda, Wu Tongshuang, Neubig Graham
- Freshllms: Refreshing Large Language Models With Search Engine Augmentation Vu Tu, Iyyer Mohit, Wang Xuezhi, Constant Noah, Wei Jerry, Wei Jason, Tar Chris, Sung Yun-hsuan, Zhou Denny, Le Quoc, Luong Thang
- Better Zero-shot Reasoning With Self-adaptive Prompting Wan Xingchen, Sun Ruoxi, Dai Hanjun, Arik Sercan O., Pfister Tomas
- Universal Self-adaptive Prompting Wan Xingchen, Sun Ruoxi, Nakhost Hootan, Dai Hanjun, Eisenschlos Julian Martin, Arik Sercan O., Pfister Tomas
- Knowledgeable In-context Tuning: Exploring And Exploiting Factual Knowledge For In-context Learning Wang Jianing, Wang Chengyu, Tan Chuanqi, Huang Jun, Gao Ming
- Query2doc: Query Expansion With Large Language Models Wang Liang, Yang Nan, Wei Furu
- STEPS: A Benchmark For Order Reasoning In Sequential Tasks Wang Weizhi, Wang Hong, Yan Xifeng
- Abspyramid: Benchmarking The Abstraction Ability Of Language Models With A Unified Entailment Graph Wang Zhaowei, Shi Haochen, Wang Weiqi, Fang Tianqing, Zhang Hongming, Choi Sehyun, Liu Xin, Song Yangqiu
- Empowering Few-shot Recommender Systems With Large Language Models -- Enhanced Representations Wang Zhoumeng
- Multiqg-ti: Towards Question Generation From Multi-modal Sources Wang Zichao, Baraniuk Richard
- Do Llms Understand User Preferences? Evaluating Llms On User Rating Prediction Wang-cheng Kang, Jianmo Ni, Nikhil Mehta, Maheswaran Sathiamoorthy, Lichan Hong, Ed Chi, Derek Zhiyuan Cheng
- Machine Translation For Ge'ez Language Wassie Aman Kassahun
- GROVE: A Retrieval-augmented Complex Story Generation Framework With A Forest Of Evidence Wen Zhihua, Tian Zhiliang, Wu Wei, Yang Yuxin, Shi Yanqi, Huang Zhen, Li Dongsheng
- Helping Language Models Learn More: Multi-dimensional Task Prompt For Few-shot Tuning Weng Jinta, Zhang Jiarui, Hu Yue, Fa Daidong, Xuand Xiaofeng, Huang Heyan
- Large Language Models Leverage External Knowledge To Extend Clinical Insight Beyond Language Boundaries Wu Jiageng, Wu Xian, Qiu Zhaopeng, Li Minghui, Zhang Yingying, Zheng Yefeng, Yuan Changzheng, Yang Jie
- Analyzing Chain-of-thought Prompting In Large Language Models Via Gradient-based Feature Attributions Wu Skyler, Shen Eric Meng, Badrinath Charumathi, Ma Jiaqi, Lakkaraju Himabindu
- An Early Evaluation Of Gpt-4v(ision) Wu Yang, Wang Shilong, Yang Hao, Zheng Tian, Zhang Hongbo, Zhao Yanyan, Qin Bing
- Imagenetvc: Zero- And Few-shot Visual Commonsense Evaluation On 1000 Imagenet Categories Xia Heming, Dong Qingxiu, Li Lei, Xu Jingjing, Liu Tianyu, Qin Ziwei, Sui Zhifang
- O3D: Offline Data-driven Discovery And Distillation For Sequential Decision-making With Large Language Models Xiao Yuchen, Sun Yanchao, Xu Mengda, Madhushani Udari, Vann Jared, Garg Deepeka, Ganesh Sumitra
- Empirical Study Of Zero-shot NER With Chatgpt Xie Tingyu, Li Qi, Zhang Jian, Zhang Yan, Liu Zuozhu, Wang Hongwei
- Fine-tuned Llms Know More, Hallucinate Less With Few-shot Sequence-to-sequence Semantic Parsing Over Wikidata Xu Silei, Liu Shicheng, Culhane Theo, Pertseva Elizaveta, Wu Meng-hsi, Semnani Sina J., Lam Monica S.
- Mental-llm: Leveraging Large Language Models For Mental Health Prediction Via Online Text Data Xu Xuhai, Yao Bingsheng, Dong Yuanzhe, Gabriel Saadia, Yu Hong, Hendler James, Ghassemi Marzyeh, Dey Anind K., Wang Dakuo
- Recmind: Large Language Model Powered Agent For Recommendation Yancheng Wang, Ziyan Jiang, Zheng Chen, Fan Yang, Yingxue Zhou, Eunah Cho, Xing Fan, Xiaojiang Huang, Yanbin Lu, Yingzhen Yang
- PALR: Personalization Aware Llms For Recommendation Yang Fan, Chen Zheng, Jiang Ziyan, Cho Eunah, Huang Xiaojiang, Lu Yanbin
- Towards Interpretable Mental Health Analysis With Large Language Models Yang Kailai, Ji Shaoxiong, Zhang Tianlin, Xie Qianqian, Kuang Ziyan, Ananiadou Sophia
- Revisiting Parallel Context Windows: A Frustratingly Simple Alternative And Chain-of-thought Deterioration Yang Kejuan, Liu Xiao, Men Kaiwen, Zeng Aohan, Dong Yuxiao, Tang Jie
- Alignedcot: Prompting Large Language Models Via Native-speaking Demonstrations Yang Zhicheng, Huang Yinya, Xiong Jing, Feng Liang, Liang Xiaodan, Wang Yiwei, Tang Jing
- Coupling Large Language Models With Logic Programming For Robust And General Reasoning From Text Yang Zhun, Ishay Adam, Lee Joohyung
- Re-vilm: Retrieval-augmented Visual Language Model For Zero And Few-shot Image Captioning Yang Zhuolin, Ping Wei, Liu Zihan, Korthikanti Vijay, Nie Weili, Huang De-an, Fan Linxi, Yu Zhiding, Lan Shiyi, Li Bo, Liu Ming-yu, Zhu Yuke, Shoeybi Mohammad, Catanzaro Bryan, Xiao Chaowei, Anandkumar Anima
- More Samples Or More Prompts? Exploring Effective In-context Sampling For LLM Few-shot Prompt Engineering Yao Bingsheng, Chen Guiming, Zou Ruishi, Lu Yuxuan, Li Jiachen, Zhang Shao, Sang Yisi, Liu Sijia, Hendler James, Wang Dakuo
- Diffusion Language Models Can Perform Many Tasks With Scaling And Instruction-finetuning Ye Jiasheng, Zheng Zaixiang, Bao Yu, Qian Lihua, Gu Quanquan
- QAID: Question Answering Inspired Few-shot Intent Detection Yehudai Asaf, Vetzler Matan, Mass Yosi, Lazar Koren, Cohen Doron, Carmeli Boaz
- Improving Language Models Via Plug-and-play Retrieval Feedback Yu Wenhao, Zhang Zhihan, Liang Zhenwen, Jiang Meng, Sabharwal Ashish
- Evaluating Instruction-tuned Large Language Models On Code Comprehension And Generation Yuan Zhiqiang, Liu Junwei, Zi Qiancheng, Liu Mingwei, Peng Xin, Lou Yiling
- Conditionally Combining Robot Skills Using Large Language Models Zentner K. R., Julian Ryan, Ichter Brian, Sukhatme Gaurav S.
- Mm-narrator: Narrating Long-form Videos With Multimodal In-context Learning Zhang Chaoyi, Lin Kevin, Yang Zhengyuan, Wang Jianfeng, Li Linjie, Lin Chung-ching, Liu Zicheng, Wang Lijuan
- The Skipped Beat: A Study Of Sociopragmatic Understanding In Llms For 64 Languages Zhang Chiyu, Doan Khai Duy, Liao Qisheng, Abdul-mageed Muhammad
- Efficient Toxic Content Detection By Bootstrapping And Distilling Large Language Models Zhang Jiang, Wu Qiong, Xu Yiming, Cao Cheng, Du Zheng, Psounis Konstantinos
- The Potential And Pitfalls Of Using A Large Language Model Such As Chatgpt Or GPT-4 As A Clinical Assistant Zhang Jingqing, Sun Kai, Jagadeesh Akshay, Ghahfarokhi Mahta, Gupta Deepa, Gupta Ashok, Gupta Vibhor, Guo Yike
- Aligning Instruction Tasks Unlocks Large Language Models As Zero-shot Relation Extractors Zhang Kai, Gutiérrez Bernal Jiménez, Su Yu
- Arcmmlu: A Library And Information Science Benchmark For Large Language Models Zhang Shitou, Li Zuchao, Liu Xingshen, Yang Liming, Wang Ping
- SGP-TOD: Building Task Bots Effortlessly Via Schema-guided LLM Prompting Zhang Xiaoying, Peng Baolin, Li Kun, Zhou Jingyan, Meng Helen
- Meta Prompting For AI Systems Zhang Yifan, Yuan Yang, Yao Andrew Chi-chih
- Group Preference Optimization: Few-shot Alignment Of Large Language Models Zhao Siyan, Dang John, Grover Aditya
- Chat With The Environment: Interactive Multimodal Perception Using Large Language Models Zhao Xufeng, Li Mengdi, Weber Cornelius, Hafez Muhammad Burhan, Wermter Stefan
- Robut: A Systematic Study Of Table QA Robustness Against Human-annotated Adversarial Perturbations Zhao Yilun, Zhao Chen, Nan Linyong, Qi Zhenting, Zhang Wenlin, Tang Xiangru, Mi Boyu, Radev Dragomir
- Noisy Exemplars Make Large Language Models More Robust: A Domain-agnostic Behavioral Analysis Zheng Hongyi, Saparov Abulhair
- Sglang: Efficient Execution Of Structured Language Model Programs Zheng Lianmin, Yin Liangsheng, Xie Zhiqiang, Sun Chuyue, Huang Jeff, Yu Cody Hao, Cao Shiyi, Kozyrakis Christos, Stoica Ion, Gonzalez Joseph E., Barrett Clark, Sheng Ying
- Trojfsp: Trojan Insertion In Few-shot Prompt Tuning Zheng Mengxin, Xue Jiaqi, Chen Xun, Wang Yanshan, Lou Qian, Jiang Lei
- Large Language Models Are Good Prompt Learners For Low-shot Image Classification Zheng Zhaoheng, Wei Jingmin, Hu Xuefeng, Zhu Haidong, Nevatia Ram
- Batch Calibration: Rethinking Calibration For In-context Learning And Prompt Engineering Zhou Han, Wan Xingchen, Proleev Lev, Mincu Diana, Chen Jilin, Heller Katherine, Roy Subhrajit
- Survival Of The Most Influential Prompts: Efficient Black-box Prompt Search Via Clustering And Pruning Zhou Han, Wan Xingchen, Vulić Ivan, Korhonen Anna
- Llms Learn Task Heuristics From Demonstrations: A Heuristic-driven Prompting Strategy For Document-level Event Argument Extraction Zhou Hanzhang, Qian Junlang, Feng Zijian, Lu Hui, Zhu Zixiao, Mao Kezhi
- Multi-stage Pre-training Enhanced By Chatgpt For Multi-scenario Multi-domain Dialogue Summarization Zhou Weixiao, Li Gengyao, Cheng Xianfu, Liang Xinnian, Zhu Junnan, Zhai Feifei, Li Zhoujun
- Flame: Few-shot Learning From Natural Language Explanations Zhou Yangqiaoyu, Zhang Yiming, Tan Chenhao
- Revisiting Automated Prompting: Are We Actually Doing Better? Zhou Yulin, Zhao Yiren, Shumailov Ilia, Mullins Robert, Gal Yarin
- VL-GPT: A Generative Pre-trained Transformer For Vision And Language Understanding And Generation Zhu Jinguo, Ding Xiaohan, Ge Yixiao, Ge Yuying, Zhao Sijie, Zhao Hengshuang, Wang Xiaohua, Shan Ying
- Open-source Large Language Models Are Strong Zero-shot Query Likelihood Models For Document Ranking Zhuang Shengyao, Liu Bing, Koopman Bevan, Zuccon Guido
- Tab-cot: Zero-shot Tabular Chain Of Thought Ziqi Jin, Wei Lu
- Fine-grained Contract NER Using Instruction Based Model Adibhatla Hiranmai Sri, Baswani Pavan, Shrivastava Manish
- Many-shot In-context Learning Agarwal Rishabh, Singh Avi, Zhang Lei M., Bohnet Bernd, Rosias Luis, Chan Stephanie, Zhang Biao, Anand Ankesh, Abbas Zaheer, Nova Azade, Co-reyes John D., Chu Eric, Behbahani Feryal, Faust Aleksandra, Larochelle Hugo
- Illuminate: A Novel Approach For Depression Detection With Explainable Analysis And Proactive Therapy Using Prompt Engineering Agrawal Aryan
- Enhancing Pre-trained Generative Language Models With Question Attended Span Extraction On Machine Reading Comprehension Ai Lin, Hui Zheng, Liu Zizhou, Hirschberg Julia
- Can Open-source Llms Compete With Commercial Models? Exploring The Few-shot Performance Of Current GPT Models In Biomedical Tasks Ateia Samy, Kruschwitz Udo
- Finetuning Large Language Model For Personalized Ranking Bai Zhuoxi, Wu Ning, Cai Fengyu, Zhu Xinyi, Xiong Yun
- Enhancing Code Translation In Language Models With Few-shot Learning Via Retrieval-augmented Generation Bhattarai Manish, Santos Javier E., Jones Shawn, Biswas Ayan, Alexandrov Boian, O'malley Daniel
- Elephants Never Forget: Memorization And Learning Of Tabular Data In Large Language Models Bordt Sebastian, Nori Harsha, Rodrigues Vanessa, Nushi Besmira, Caruana Rich
- Enhancing Trust In Llms: Algorithms For Comparing And Interpreting Llms Brown Nik Bear
- TACT: Advancing Complex Aggregative Reasoning With Information Extraction Tools Caciularu Avi, Jacovi Alon, Ben-david Eyal, Goldshtein Sasha, Schuster Tal, Herzig Jonathan, Elidan Gal, Globerson Amir
- Llms Are Few-shot In-context Low-resource Language Learners Cahyawijaya Samuel, Lovenia Holy, Fung Pascale
- Can Few-shot Work In Long-context? Recycling The Context To Generate Demonstrations Cattan Arie, Jacovi Alon, Fabrikant Alex, Herzig Jonathan, Aharoni Roee, Rashkin Hannah, Marcus Dror, Hassidim Avinatan, Matias Yossi, Szpektor Idan, Caciularu Avi
- Speechprompt: Prompting Speech Language Models For Speech Processing Tasks Chang Kai-wei, Wu Haibin, Wang Yu-kai, Wu Yuan-kuei, Shen Hua, Tseng Wei-cheng, Kang Iu-thing, Li Shang-wen, Lee Hung-yi
- Cost-effective Instruction Learning For Pathology Vision And Language Analysis Chen Kaitao, Liu Mianxin, Yan Fang, Ma Lei, Shi Xiaoming, Wang Lilong, Wang Xiaosong, Zhu Lifeng, Wang Zhe, Zhou Mu, Zhang Shaoting
- Measuring Taiwanese Mandarin Language Understanding Chen Po-heng, Cheng Sijia, Chen Wei-lin, Lin Yen-ting, Chen Yun-nung
- Can We Rely On LLM Agents To Draft Long-horizon Plans? Let's Take Travelplanner As An Example Chen Yanan, Pesaranghader Ali, Sadhu Tanmana, Yi Dong Hoon
- Potential And Limitations Of Llms In Capturing Structured Semantics: A Case Study On SRL Cheng Ning, Yan Zhaohui, Wang Ziming, Li Zhijie, Yu Jiaming, Zheng Zilong, Tu Kewei, Xu Jinan, Han Wenjuan
- Multi-prompting Decoder Helps Better Language Understanding Cheng Zifeng, Chen Zhaoling, Jiang Zhiwei, Yin Yafeng, Ge Shiping, Liu Yuliang, Gu Qing
- Modeling: A Novel Dataset For Testing Linguistic Reasoning In Language Models Chi Nathan A., Malchev Teodor, Kong Riley, Chi Ryan A., Huang Lucas, Chi Ethan A., Mccoy R. Thomas, Radev Dragomir
- A Chain-of-thought Prompting Approach With Llms For Evaluating Students' Formative Assessment Responses In Science Cohn Clayton, Hutchins Nicole, Le Tuan, Biswas Gautam
- Ragsys: Item-cold-start Recommender As RAG System Contal Emile, Mcgoldrick Garrin
- Gamebench: Evaluating Strategic Reasoning Abilities Of LLM Agents Costarelli Anthony, Allen Mat, Hauksson Roman, Sodunke Grace, Hariharan Suhas, Cheng Carlson, Li Wenjie, Clymer Joshua, Yadav Arjun
- Induction Heads As An Essential Mechanism For Pattern Matching In In-context Learning Crosbie J., Shutova E.
- ACT-MNMT Auto-constriction Turning For Multilingual Neural Machine Translation Dai Shaojie, Liu Xin, Luo Ping, Yu Yue
- Human-interpretable Adversarial Prompt Attack On Large Language Models With Situational Context Das Nilanjana, Raff Edward, Gaur Manas
- Prompting Open-source And Commercial Language Models For Grammatical Error Correction Of English Learner Text Davis Christopher, Caines Andrew, Andersen Øistein, Taslimipoor Shiva, Yannakoudakis Helen, Yuan Zheng, Bryant Christopher, Rei Marek, Buttery Paula
- Evaluating Large Language Models For Automatic Analysis Of Teacher Simulations De-fitero-dominguez David, Albaladejo-gonzález Mariano, Garcia-cabot Antonio, Garcia-lopez Eva, Moreno-cediel Antonio, Barno Erin, Reich Justin
- Understanding Biases In Chatgpt-based Recommender Systems: Provider Fairness, Temporal Stability, And Recency Deldjoo Yashar
- Large Language Models Prompting With Episodic Memory Do Dai, Tran Quan, Venkatesh Svetha, Le Hung
- Evaluating Linguistic Capabilities Of Multimodal Llms In The Lens Of Few-shot Learning Dogan Mustafa, Kesen Ilker, Calixto Iacer, Erdem Aykut, Erdem Erkut
- Towards Multimodal In-context Learning For Vision & Language Models Doveh Sivan, Perek Shaked, Mirza M. Jehanzeb, Lin Wei, Alfassy Amit, Arbelle Assaf, Ullman Shimon, Karlinsky Leonid
- Leveraging Machine-generated Rationales To Facilitate Social Meaning Detection In Conversations Dutt Ritam, Wu Zhen, Shi Kelly, Sheth Divyanshu, Gupta Prakhar, Rose Carolyn Penstein
- Conversational Prompt Engineering Ein-dor Liat, Toledo-ronen Orith, Spector Artem, Gretz Shai, Dankin Lena, Halfon Alon, Katz Yoav, Slonim Noam
- Enabling Natural Zero-shot Prompting On Encoder Models Via Statement-tuning Elshabrawy Ahmed, Huang Yongxin, Gurevych Iryna, Aji Alham Fikri
- Unraveling The Dominance Of Large Language Models Over Transformer Models For Bangla Natural Language Inference: A Comprehensive Study Faria Fatema Tuj Johora, Moin Mukaffi Bin, Fahim Asif Iftekher, Debnath Pronay, Shah Faisal Muhammad
- MERA: A Comprehensive LLM Evaluation In Russian Fenogenova Alena, Chervyakov Artem, Martynov Nikita, Kozlova Anastasia, Tikhonova Maria, Akhmetgareeva Albina, Emelyanov Anton, Shevelev Denis, Lebedev Pavel, Sinev Leonid, Isaeva Ulyana, Kolomeytseva Katerina, Moskovskiy Daniil, Goncharova Elizaveta, Savushkin Nikita, Mikhailova Polina, Dimitrov Denis, Panchenko Alexander, Markov Sergei
- Introducing The Newspalm MBR And QE Dataset: Llm-generated High-quality Parallel Data Outperforms Traditional Web-crawled Data Finkelstein Mara, Vilar David, Freitag Markus
- UI-JEPA: Towards Active Perception Of User Intent Through Onscreen User Activity Fu Yicheng, Anantha Raviteja, Vashisht Prabal, Cheng Jianpeng, Littwin Etai
- Using Large Language Models For The Interpretation Of Building Regulations Fuchs Stefan, Witbrock Michael, Dimyadi Johannes, Amor Robert
- Customizing Language Model Responses With Contrastive In-context Learning Gao Xiang, Das Kamalika
- Uncovering Hidden Intentions: Exploring Prompt Recovery For Deeper Insights Into Generated Texts Give Louis, Zaoral Timo, Bruno Maria Antonietta
- Blendsql: A Scalable Dialect For Unifying Hybrid Question Answering In Relational Algebra Glenn Parker, Dakle Parag Pravin, Wang Liang, Raghavan Preethi
- Memorization In In-context Learning Golchin Shahriar, Surdeanu Mihai, Bethard Steven, Blanco Eduardo, Riloff Ellen
- Talking The Talk Does Not Entail Walking The Walk: On The Limits Of Large Language Models In Lexical Entailment Recognition Greco Candida M., La Cava Lucio, Tagarelli Andrea
- Continual Skill And Task Learning Via Dialogue Gu Weiwei, Kondepudi Suresh, Huang Lixiao, Gopalan Nakul
- Towards Generalist Prompting For Large Language Models By Mental Models Guan Haoxiang, He Jiyan, Zheng Shuxin, Chen En-hong, Zhang Weiming, Yu Nenghai
- AAVENUE: Detecting LLM Biases On NLU Tasks In AAVE Via A Novel Benchmark Gupta Abhay, Meng Philip, Yurtseven Ece, O'brien Sean, Zhu Kevin
- Irel At Semeval-2024 Task 9: Improving Conventional Prompting Methods For Brain Teasers Gupta Harshit, Chaudhary Manav, Raha Tathagata, Subramanian Shivansh, Varma Vasudeva
- POEM: Interactive Prompt Optimization For Enhancing Multimodal Reasoning Of Large Language Models He Jianben, Wang Xingbo, Liu Shiyi, Wu Guande, Silva Claudio, Qu Huamin
- Prompt-based Bias Calibration For Better Zero/few-shot Learning Of Language Models He Kang, Long Yinghan, Roy Kaushik
- Can Large Language Models Make The Grade? An Empirical Study Evaluating Llms Ability To Mark Short Answer Questions In K-12 Education Henkel Owen, Boxer Adam, Hills Libby, Roberts Bill
- Optimising Hard Prompts With Few-shot Meta-prompting Hiraou Sayash Raaj
- Visual Prompting For Generalized Few-shot Segmentation: A Multi-scale Approach Hossain Mir Rayat Imtiaz, Siam Mennatullah, Sigal Leonid, Little James J.
- A Hopfieldian View-based Interpretation For Chain-of-thought Reasoning Hu Lijie, Liu Liang, Yang Shu, Chen Xin, Xiao Hongru, Li Mengdi, Zhou Pan, Ali Muhammad Asif, Wang Di
- Multimodal Task Vectors Enable Many-shot Multimodal In-context Learning Huang Brandon, Mitra Chancharik, Arbelle Assaf, Karlinsky Leonid, Darrell Trevor, Herzig Roei
- Generating Educational Materials With Different Levels Of Readability Using Llms Huang Chieh-yang, Wei Jing, Huang Ting-hao 'kenneth'
- Queryagent: A Reliable And Efficient Reasoning Framework With Environmental Feedback-based Self-correction Huang Xiang, Cheng Sitao, Huang Shanshan, Shen Jiayu, Xu Yong, Zhang Chaoyun, Qu Yuzhong
- Ecr-chain: Advancing Generative Language Models To Better Emotion-cause Reasoners Through Reasoning Chains Huang Zhaopei, Zhao Jinming, Jin Qin
- The Promise And Challenges Of Using Llms To Accelerate The Screening Process Of Systematic Reviews Huotala Aleksi, Kuutila Miikka, Ralph Paul, Mäntylä Mika
- News Without Borders: Domain Adaptation Of Multilingual Sentence Embeddings For Cross-lingual News Recommendation Iana Andreea, Schmidt Fabian David, Glavaš Goran, Paulheim Heiko
- Assessing Llms Suitability For Knowledge Graph Completion Iga Vasile Ionut Remus, Silaghi Gheorghe Cosmin
- L4Q: Parameter Efficient Quantization-aware Fine-tuning On Large Language Models Jeon Hyesung, Kim Yulhwa, Kim Jae-joon
- Leveraging Llms For Dialogue Quality Measurement Jia Jinghan, Komma Abi, Leffel Timothy, Peng Xujun, Nagesh Ajay, Soliman Tamer, Galstyan Aram, Kumar Anoop
- Langsuite: Planning, Controlling And Interacting With Large Language Models In Embodied Text Environments Jia Zixia, Wang Mengmeng, Tong Baichen, Zhu Song-chun, Zheng Zilong
- Heuristic-enhanced Candidates Selection Strategy For Gpts Tackle Few-shot Aspect-based Sentiment Analysis Jiang Baoxing, Wan Yujie, Ju Shenggen
- HGT: Leveraging Heterogeneous Graph-enhanced Large Language Models For Few-shot Complex Table Understanding Jin Rihui, Li Yu, Qi Guilin, Hu Nan, Li Yuan-fang, Chen Jiaoyan, Wang Jianan, Chen Yongrui, Min Dehai
- Investigating Multi-hop Factual Shortcuts In Knowledge Editing Of Large Language Models Ju Tianjie, Chen Yijin, Yuan Xinwei, Zhang Zhuosheng, Du Wei, Zheng Yubin, Liu Gongshen
- Self-training Large Language Models For Improved Visual Program Synthesis With Visual Reinforcement Khan Zaid, Bg Vijay Kumar, Schulter Samuel, Fu Yun, Chandraker Manmohan
- Keyword-driven Retrieval-augmented Large Language Models For Cold-start User Recommendations Kieu Hai-dang, Nguyen Minh Duc, Nguyen Thanh-son, Le Dung D.
- Few-shot Personalization Of Llms With Mis-aligned Responses Kim Jaehyung, Yang Yiming
- Enhancing Psychotherapy Counseling: A Data Augmentation Pipeline Leveraging Large Language Models For Counseling Conversations Kim Jun-woo, Han Ji-eun, Koh Jun-seok, Seo Hyeon-tae, Chang Du-seong
- Large Language Models Meet Collaborative Filtering: An Efficient All-round Llm-based Recommender System Kim Sein, Kang Hongseok, Choi Seungyoon, Kim Donghyun, Yang Minchul, Park Chanyoung
- Is GPT-4 Alone Sufficient For Automated Essay Scoring?: A Comparative Judgment Approach Based On Rater Cognition Kim Seungju, Jo Meounggun
- QPO: Query-dependent Prompt Optimization Via Multi-loop Offline Reinforcement Learning Kong Yilun, Mao Hangyu, Zhao Qi, Zhang Bin, Ruan Jingqing, Shen Li, Chang Yongzhe, Wang Xueqian, Zhao Rui, Tao Dacheng
- Audio Flamingo: A Novel Audio Language Model With Few-shot Learning And Dialogue Abilities Kong Zhifeng, Goel Arushi, Badlani Rohan, Ping Wei, Valle Rafael, Catanzaro Bryan
- Retrieval-augmented Code Generation For Situated Action Generation: A Case Study On Minecraft Kranti Chalamalasetti, Hakimov Sherzod, Schlangen David
- Synthdst: Synthetic Data Is All You Need For Few-shot Dialog State Tracking Kulkarni Atharva, Tseng Bo-hsiang, Moniz Joel Ruben Antony, Piraviperumal Dhivya, Yu Hong, Bhargava Shruti
- Cat-bench: Benchmarking Language Model Understanding Of Causal And Temporal Dependencies In Plans Lal Yash Kumar, Cohen Vanya, Chambers Nathanael, Balasubramanian Niranjan, Mooney Raymond
- Benchmarks Underestimate The Readiness Of Multi-lingual Dialogue Agents Lee Andrew H., Semnani Sina J., Castillo-lópez Galo, De Chalendar Gäel, Choudhury Monojit, Dua Ashna, Kavitha Kapil Rajesh, Kim Sungkyun, Kodali Prashant, Kumaraguru Ponnurangam, Lombard Alexis, Moradshahi Mehrad, Park Gihyun, Semmar Nasredine, Seo Jiwon, Shen Tianhao, Shrivastava Manish, Xiong Deyi, Lam Monica S.
- Effective And Efficient Conversation Retrieval For Dialogue State Tracking With Implicit Text Summaries Lee Seanie, Cheng Jianpeng, Driesen Joris, Coca Alexandru, Johannsen Anders
- Llava-docent: Instruction Tuning With Multimodal Large Language Model To Support Art Appreciation Education Lee Unggi, Jeon Minji, Lee Yunseo, Byun Gyuri, Son Yoorim, Shin Jaeyoon, Ko Hongkyu, Kim Hyeoncheol
- Pctoolkit: A Unified Plug-and-play Prompt Compression Toolkit Of Large Language Models Li Jinyi, Lan Yihuai, Wang Lei, Wang Hao
- Is Programming By Example Solved By Llms? Li Wen-ding, Ellis Kevin
- Empowering Large Language Models For Textual Data Augmentation Li Yichuan, Ding Kaize, Wang Jianling, Lee Kyumin
- Text-to-model: Text-conditioned Neural Network Diffusion For Train-once-for-all Personalization Li Zexi, Gao Lingzhi, Wu Chao
- Implicit In-context Learning Li Zhuowei, Xu Zihao, Han Ligong, Gao Yunhe, Wen Song, Liu Di, Wang Hao, Metaxas Dimitris N.
- Vl-trojan: Multimodal Instruction Backdoor Attacks Against Autoregressive Visual Language Models Liang Jiawei, Liang Siyuan, Luo Man, Liu Aishan, Han Dongchen, Chang Ee-chien, Cao Xiaochun
- How Can I Get It Right? Using GPT To Rephrase Incorrect Trainee Responses Lin Jionghao, Han Zifei, Thomas Danielle R., Gurung Ashish, Gupta Shivang, Aleven Vincent, Koedinger Kenneth R.
- Few-shot Class Incremental Learning With Attention-aware Self-adaptive Prompt Liu Chenxi, Wang Zhenyi, Xiong Tianyi, Chen Ruibo, Wu Yihan, Guo Junfeng, Huang Heng
- An Incomplete Loop: Instruction Inference, Instruction Following, And In-context Learning In Language Models Liu Emmy, Neubig Graham, Andreas Jacob
- Self-reflection Outcome Is Sensitive To Prompt Construction Liu Fengyuan, Aldahoul Nouar, Eady Gregory, Zaki Yasir, Alshebli Bedoor, Rahwan Talal
- Unraveling The Mechanics Of Learning-based Demonstration Selection For In-context Learning Liu Hui, Wang Wenya, Sun Hao, Tian Chris Xing, Kong Chenqi, Dong Xin, Li Haoliang
- Are Llms Effective Backbones For Fine-tuning? An Experimental Investigation Of Supervised Llms On Chinese Short Text Matching Liu Shulin, Xu Chengcheng, Liu Hao, Yu Tinghao, Yang Tao
- Demorank: Selecting Effective Demonstrations For Large Language Models In Ranking Task Liu Wenhan, Zhu Yutao, Dou Zhicheng
- Let's Learn Step By Step: Enhancing In-context Learning Ability With Curriculum Learning Liu Yinpeng, Liu Jiawei, Shi Xiang, Cheng Qikai, Huang Yong, Lu Wei
- Enabling Weak Llms To Judge Response Reliability Via Meta Ranking Liu Zijun, Kou Boqun, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Large Language Models Know What Makes Exemplary Contexts Long Quanyu, Chen Jianda, Wang Wenya, Pan Sinno Jialin
- Does In-context Learning Really Learn? Rethinking How Large Language Models Respond And Solve Tasks Via In-context Learning Long Quanyu, Wu Yin, Wang Wenya, Pan Sinno Jialin
- In-context Time Series Predictor Lu Jiecheng, Sun Yan, Yang Shihao
- Crosstune: Black-box Few-shot Classification With Label Enhancement Luo Danqing, Zhang Chen, Zhang Yan, Li Haizhou
- Interpreting Key Mechanisms Of Factual Recall In Transformer-based Language Models Lv Ang, Chen Yuhan, Zhang Kaiyi, Wang Yulong, Liu Lifeng, Wen Ji-rong, Xie Jian, Yan Rui
- Exploring The Role Of Transliteration In In-context Learning For Low-resource Languages Written In Non-latin Scripts Ma Chunlan, Liu Yihong, Ye Haotian, Schütze Hinrich
- Making Pre-trained Language Models Better Continual Few-shot Relation Extractors Ma Shengkun, Han Jiale, Liang Yi, Cheng Bo
- Llmparser: An Exploratory Study On Using Large Language Models For Log Parsing Ma Zeyang, Chen An Ran, Kim Dong Jae, Chen Tse-hsun, Wang Shaowei
- From Tarzan To Tolkien: Controlling The Language Proficiency Level Of Llms For Content Generation Malik Ali, Mayhew Stephen, Piech Chris, Bicknell Klinton
- Reinforced Prompt Personalization For Recommendation With Large Language Models Mao Wenyu, Wu Jiancan, Chen Weijian, Gao Chongming, Wang Xiang, He Xiangnan
- Understanding And Mitigating Language Confusion In Llms Marchisio Kelly, Ko Wei-yin, Bérard Alexandre, Dehaze Théo, Ruder Sebastian
- MM1: Methods, Analysis & Insights From Multimodal LLM Pre-training Mckinzie Brandon, Gan Zhe, Fauconnier Jean-philippe, Dodge Sam, Zhang Bowen, Dufter Philipp, Shah Dhruti, Du Xianzhi, Peng Futang, Weers Floris, Belyi Anton, Zhang Haotian, Singh Karanjeet, Kang Doug, Jain Ankur, Hè Hongyu, Schwarzer Max, Gunter Tom, Kong Xiang, Zhang Aonan, Wang Jianyu, Wang Chong, Du Nan, Lei Tao, Wiseman Sam, Yin Guoli, Lee Mark, Wang Zirui, Pang Ruoming, Grasch Peter, Toshev Alexander, Yang Yinfei
- Soda-eval: Open-domain Dialogue Evaluation In The Age Of Llms Mendonça John, Trancoso Isabel, Lavie Alon
- Low-resource Machine Translation Through Retrieval-augmented LLM Prompting: A Study On The Mambai Language Merx Raphaël, Mahmudi Aso, Langford Katrina, De Araujo Leo Alberto, Vylomova Ekaterina
- A Realistic Evaluation Of Llms For Quotation Attribution In Literary Texts: A Case Study Of Llama3 Michel Gaspard, Epure Elena V., Hennequin Romain, Cerisara Christophe
- Can We Trust Large Language Models Generated Code? A Framework For In-context Learning, Security Patterns, And Code Evaluations Across Diverse Llms Mohsin Ahmad, Janicke Helge, Wood Adrian, Sarker Iqbal H., Maglaras Leandros, Janjua Naeem
- Are Large Language Models Actually Good At Text Style Transfer? Mukherjee Sourabrata, Ojha Atul Kr., Dušek Ondřej
- Zero-shot Visual Reasoning By Vision-language Models: Benchmarking And Analysis Nagar Aishik, Jaiswal Shantanu, Tan Cheston
- Using Grammar Masking To Ensure Syntactic Validity In Llm-based Modeling Tasks Netz Lukas, Reimer Jan, Rumpe Bernhard
- Dude: Dual Distribution-aware Context Prompt Learning For Large Vision-language Model Nguyen Duy M. H., Le An T., Nguyen Trung Q., Diep Nghiem T., Nguyen Tai, Duong-tran Duy, Peters Jan, Shen Li, Niepert Mathias, Sonntag Daniel
- Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge In English-centric Large Language Models Nie Ercong, Yuan Shuzhou, Ma Bolei, Schmid Helmut, Färber Michael, Kreuter Frauke, Schütze Hinrich
- How Reliable AI Chatbots Are For Disease Prediction From Patient Complaints? Nipu Ayesha Siddika, Islam K M Sajjadul, Madiraju Praveen
- Likelihood-based Mitigation Of Evaluation Bias In Large Language Models Ohi Masanari, Kaneko Masahiro, Koike Ryuto, Loem Mengsay, Okazaki Naoaki
- Large Language Models Are Self-taught Reasoners: Enhancing LLM Applications Via Tailored Problem-solving Demonstrations Ong Kai Tzu-iunn, Kwon Taeyoon, Yeo Jinyoung
- NLP At UC Santa Cruz At Semeval-2024 Task 5: Legal Answer Validation Using Few-shot Multi-choice QA Pahilajani Anish, Jain Samyak Rajesh, Trivedi Devasha
- Training Llms To Recognize Hedges In Spontaneous Narratives Paige Amie J., Soubki Adil, Murzaku John, Rambow Owen, Brennan Susan E.
- Investigating LLM Applications In E-commerce Palen-michel Chester, Wang Ruixiang, Zhang Yipeng, Yu David, Xu Canran, Wu Zhe
- Distilling Large Language Models For Text-attributed Graph Learning Pan Bo, Zhang Zheng, Zhang Yifei, Hu Yuntong, Zhao Liang
- Pragmatic Competence Evaluation Of Large Language Models For Korean Park Dojun, Lee Jiwoo, Jeong Hyeyun, Park Seohyun, Lee Sungeun
- Low-resource Cross-lingual Summarization Through Few-shot Learning With Large Language Models Park Gyutae, Hwang Seojin, Lee Hwanhee
- Pre-trained Vision And Language Transformers Are Few-shot Incremental Learners Park Keon-hee, Song Kyungwoo, Park Gyeong-moon
- Decoding News Narratives: A Critical Analysis Of Large Language Models In Framing Detection Pastorino Valeria, Sivakumar Jasivan A., Moosavi Nafise Sadat
- Large Language Models Can Self-improve At Web Agent Tasks Patel Ajay, Hofmarcher Markus, Leoveanu-condrei Claudiu, Dinu Marius-constantin, Callison-burch Chris, Hochreiter Sepp
- Investigating The Effectiveness Of Hypertuning Via Gisting Phang Jason
- Reasoning With Large Language Models, A Survey Plaat Aske, Wong Annie, Verberne Suzan, Broekens Joost, Van Stein Niki, Back Thomas
- Practical Token Pruning For Foundation Models In Few-shot Conversational Virtual Assistant Systems Qi Haode, Qian Cheng, Ni Jian, Singh Pratyush, Fazeli Reza, Wang Gengyu, Shu Zhongzheng, Wayne Eric, Bross Juergen
- LAMPO: Large Language Models As Preference Machines For Few-shot Ordinal Classification Qin Zhen, Wu Junru, Shen Jiaming, Liu Tianqi, Wang Xuanhui
- DEEP-ICL: Definition-enriched Experts For Language Model In-context Learning Qu Xingwei, Liang Yiming, Wang Yucheng, Zheng Tianyu, Yue Tommy, Ma Lei, Huang Stephen W., Zhang Jiajun, Shi Yinan, Lin Chenghua, Fu Jie, Zhang Ge
- Masontigers At Semeval-2024 Task 9: Solving Puzzles With An Ensemble Of Chain-of-thoughts Raihan Md Nishat, Goswami Dhiman, Emran Al Nahian Bin, Puspo Sadiya Sayara Chowdhury, Ganguly Amrita, Zampieri Marcos
- Mitigating Exaggerated Safety In Large Language Models Ray Ruchira, Bhalani Ruchi
- Analyzing And Adapting Large Language Models For Few-shot Multilingual NLU: Are We There Yet? Razumovskaia Evgeniia, Vulić Ivan, Korhonen Anna
- Beyond Performance: Quantifying And Mitigating Label Bias In Llms Reif Yuval, Schwartz Roy
- Pairing Analogy-augmented Generation With Procedural Memory For Procedural Q&A Roth K, Gupta Rushil, Halle Simon, Liu Bang
- In-context Learning Demonstration Selection Via Influence Analysis S. Vinay M., Van Minh-hao, Wu Xintao
- Neurocache: Efficient Vector Retrieval For Long-range Language Modeling Safaya Ali, Yuret Deniz
- ICAL: Continual Learning Of Multimodal Agents By Transforming Trajectories Into Actionable Insights Sarch Gabriel, Jang Lawrence, Tarr Michael J., Cohen William W., Marino Kenneth, Fragkiadaki Katerina
- HELPER-X: A Unified Instructable Embodied Agent To Tackle Four Interactive Vision-language Domains With Memory-augmented Language Models Sarch Gabriel, Somani Sahil, Kapoor Raghav, Tarr Michael J., Fragkiadaki Katerina
- Improving Sentence Embeddings With Automatic Generation Of Training Data Using Few-shot Examples Sato Soma, Tsukagoshi Hayato, Sasano Ryohei, Takeda Koichi
- Robust Few-shot Transfer Learning For Knowledge Base Question Answering With Unanswerable Questions Sawhney Riya, Bhattacharya Indrajit, Mausam
- THREAD: Thinking Deeper With Recursive Spawning Schroeder Philip, Morgan Nathaniel, Luo Hongyin, Glass James
- Multi-modal Adapter For Vision-language Models Seputis Dominykas, Mihailov Serghei, Chatterjee Soham, Xiao Zehao
- Medpromptx: Grounded Multimodal Prompting For Chest X-ray Diagnosis Shaaban Mai A., Khan Adnan, Yaqub Mohammad
- Putting Gpt-4o To The Sword: A Comprehensive Evaluation Of Language, Vision, Speech, And Multimodal Proficiency Shahriar Sakib, Lund Brady, Mannuru Nishith Reddy, Arshad Muhammad Arbab, Hayawi Kadhim, Bevara Ravi Varma Kumar, Mannuru Aashrith, Batool Laiba
- Show, Don't Tell: Aligning Language Models With Demonstrated Feedback Shaikh Omar, Lam Michelle, Hejna Joey, Shao Yijia, Bernstein Michael, Yang Diyi
- Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization Shao Ninglu, Xiao Shitao, Liu Zheng, Zhang Peitian
- Unirag: Universal Retrieval Augmentation For Multi-modal Large Language Models Sharifymoghaddam Sahel, Upadhyay Shivani, Chen Wenhu, Lin Jimmy
- Rumour Evaluation With Very Large Language Models Shehata Dahlia, Cohen Robin, Clarke Charles
- Can Language Models Explain Their Own Classification Behavior? Sherburn Dane, Chughtai Bilal, Evans Owain
- Ehragent: Code Empowers Large Language Models For Few-shot Complex Tabular Reasoning On Electronic Health Records Shi Wenqi, Xu Ran, Zhuang Yuchen, Yu Yue, Zhang Jieyu, Wu Hang, Zhu Yuanda, Ho Joyce, Yang Carl, Wang May D.
- Deconstructing In-context Learning: Understanding Prompts Via Corruption Shivagunde Namrata, Lialin Vladislav, Muckatira Sherin, Rumshisky Anna
- INDIC QA BENCHMARK: A Multilingual Benchmark To Evaluate Question Answering Capability Of Llms For Indic Languages Singh Abhishek Kumar, Murthy Rudra, Kumar Vishwajeet, Sen Jaydeep, Ramakrishnan Ganesh
- Comparative Analysis Of Different Efficient Fine Tuning Methods Of Large Language Models (llms) In Low-resource Setting Srinivasan Krishna Prasad Varadarajan, Gumpena Prasanth, Yattapu Madhusudhana, Brahmbhatt Vishal H.
- Exploring LLM Prompting Strategies For Joint Essay Scoring And Feedback Generation Stahl Maja, Biermann Leon, Nehring Andreas, Wachsmuth Henning
- The Fine-tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities Stap David, Hasler Eva, Byrne Bill, Monz Christof, Tran Ke
- Few-shot VQA With Frozen Llms: A Tale Of Two Approaches Sterner Igor, Lin Weizhe, Chen Jinghong, Byrne Bill
- Demonstration Augmentation For Zero-shot In-context Learning Su Yi, Tai Yunpeng, Ji Yixin, Li Juntao, Yan Bowen, Zhang Min
- Structured Chain-of-thought Prompting For Few-shot Generation Of Content-grounded QA Conversations Sultan Md Arafat, Ganhotra Jatin, Astudillo Ramón Fernandez
- Tuning-free Accountable Intervention For LLM Deployment -- A Metacognitive Approach Tan Zhen, Peng Jie, Chen Tianlong, Liu Huan
- Dynamic Contexts For Generating Suggestion Questions In RAG Based Conversational Systems Tayal Anuja, Tyagi Aman
- Interactive Prompt Debugging With Sequence Salience Tenney Ian, Mullins Ryan, Du Bin, Pandya Shree, Kahng Minsuk, Dixon Lucas
- Robustness Assessment Of Mathematical Reasoning In The Presence Of Missing And Contradictory Conditions Tian Shi-yu, Zhou Zhi, Jia Lin-han, Guo Lan-zhe, Li Yu-feng
- An Ecosage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant Tomar Mohit, Tiwari Abhisek, Saha Tulika, Jha Prince, Saha Sriparna
- Can Llms Learn From Previous Mistakes? Investigating Llms' Errors To Boost For Reasoning Tong Yongqi, Li Dawei, Wang Sizhe, Wang Yujia, Teng Fei, Shang Jingbo
- Few-shot Cross-lingual Transfer For Prompting Large Language Models In Low-resource Languages Toukmaji Christopher
- Leveraging Zero-shot Prompting For Efficient Language Model Distillation Vöge Lukas, Gurgul Vincent, Lessmann Stefan
- COSMO: Contrastive Streamlined Multimodal Model With Interleaved Pre-training Wang Alex Jinpeng, Li Linjie, Lin Kevin Qinghong, Wang Jianfeng, Lin Kevin, Yang Zhengyuan, Wang Lijuan, Shou Mike Zheng
- Interactive Dualchecker For Mitigating Hallucinations In Distilling Large Language Models Wang Meiyun, Suzuki Masahiro, Sakaji Hiroki, Izumi Kiyoshi
- Beyond The Known: Investigating Llms Performance On Out-of-domain Intent Detection Wang Pei, He Keqing, Wang Yejie, Song Xiaoshuai, Mou Yutao, Wang Jingang, Xian Yunsen, Cai Xunliang, Xu Weiran
- Effective Demonstration Annotation For In-context Learning Via Language Model-based Determinantal Point Process Wang Peng, Wang Xiaobin, Lou Chao, Mao Shengyu, Xie Pengjun, Jiang Yong
- SUBLLM: A Novel Efficient Architecture With Token Sequence Subsampling For LLM Wang Quandong, Yuan Yuxuan, Yang Xiaoyu, Zhang Ruike, Zhao Kang, Liu Wei, Luan Jian, Povey Daniel, Wang Bin
- \(\mathbb{uscd}\): Improving Code Generation Of Llms By Uncertainty-aware Selective Contrastive Decoding Wang Shuai, Ding Liang, Shen Li, Luo Yong, He Zheng, Yu Wei, Tao Dacheng
- Metatool: Facilitating Large Language Models To Master Tools With Meta-task Augmentation Wang Xiaohan, Li Dian, Zhao Yilin, Sinbadliu, Wang Hui
- Chain-of-thought Reasoning Without Prompting Wang Xuezhi, Zhou Denny
- Strategic Chain-of-thought: Guiding Accurate Reasoning In Llms Through Strategy Elicitation Wang Yu, Zhao Shiwan, Wang Zhihu, Huang Heyuan, Fan Ming, Zhang Yubo, Wang Zhixing, Wang Haijun, Liu Ting
- Exovip: Step-by-step Verification And Exploration With Exoskeleton Modules For Compositional Visual Reasoning Wang Yuxuan, Yuille Alan, Li Zhuowan, Zheng Zilong
- OPDAI At Semeval-2024 Task 6: Small Llms Can Accelerate Hallucination Detection With Weakly Supervised Data Wei Chengcheng, Chen Ze, Fang Songtan, He Jiarong, Gao Max
- Training-free Exponential Extension Of Sliding Window Context With Cascading KV Cache Willette Jeffrey, Lee Heejun, Lee Youngwan, Jeon Myeongjae, Hwang Sung Ju
- Towards Evaluating And Building Versatile Large Language Models For Medicine Wu Chaoyi, Qiu Pengcheng, Liu Jinxin, Gu Hongfei, Li Na, Zhang Ya, Wang Yanfeng, Xie Weidi
- Instructing Large Language Models To Identify And Ignore Irrelevant Conditions Wu Zhenyu, Shen Chao, Jiang Meng
- LESS: Selecting Influential Data For Targeted Instruction Tuning Xia Mengzhou, Malladi Sadhika, Gururangan Suchin, Arora Sanjeev, Chen Danqi
- Me Llama: Foundation Large Language Models For Medical Applications Xie Qianqian, Chen Qingyu, Chen Aokun, Peng Cheng, Hu Yan, Lin Fongci, Peng Xueqing, Huang Jimin, Zhang Jeffrey, Keloth Vipina, Zhou Xinyu, He Huan, Ohno-machado Lucila, Wu Yonghui, Xu Hua, Bian Jiang
- Interactive-kbqa: Multi-turn Interactions For Knowledge Base Question Answering With Large Language Models Xiong Guanming, Bao Junwei, Zhao Wen
- Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-choice Selectors Xue Mengge, Hu Zhenyu, Liu Liqun, Liao Kuo, Li Shuang, Han Honglin, Zhao Meng, Yin Chengguo
- Do Large Language Models Understand Logic Or Just Mimick Context? Yan Junbing, Wang Chengyu, Huang Jun, Zhang Wei
- Exploring The LLM Journey From Cognition To Expression With Linear Representations Yan Yuzi, Li Jialian, Zhang Yipin, Yan Dong
- Aqa-bench: An Interactive Benchmark For Evaluating Llms' Sequential Reasoning Ability Yang Siwei, Zhao Bingchen, Xie Cihang
- Unleashing The Potential Of Large Language Models For Predictive Tabular Tasks In Data Science Yang Yazheng, Wang Yuqi, Sen Sankalok, Li Lei, Liu Qi
- Pre-trained Language Models Improve The Few-shot Prompt Ability Of Decision Transformer Yang Yu, Xu Pan
- Large Language Models Are Contrastive Reasoners Yao Liang
- BEAF: Observing Before-after Changes To Evaluate Hallucination In Vision-language Models Ye-bin Moon, Hyeon-woo Nam, Choi Wonseok, Oh Tae-hyun
- Exploring The Comprehension Of Chatgpt In Traditional Chinese Medicine Knowledge Yizhen Li, Shaohan Huang, Jiaxing Qi, Lei Quan, Dongran Han, Zhongzhi Luan
- Making The Most Of Your Model: Methods For Finetuning And Applying Pretrained Transformers Yoshida Davis
- CO3: Low-resource Contrastive Co-training For Generative Conversational Query Rewrite Yuan Yifei, Shi Chen, Wang Runze, Chen Liyi, Hu Renjun, Zhang Zengming, Jiang Feijun, Lam Wai
- Prompting Implicit Discourse Relation Annotation Yung Frances, Ahmad Mansoor, Scholman Merel, Demberg Vera
- Turkishmmlu: Measuring Massive Multitask Language Understanding In Turkish Yüksel Arda, Köksal Abdullatif, Şenel Lütfi Kerem, Korhonen Anna, Schütze Hinrich
- Building Accurate Translation-tailored Llms With Language Aware Instruction Tuning Zan Changtong, Ding Liang, Shen Li, Zhen Yibing, Liu Weifeng, Tao Dacheng
- Tell Me Why: Explainable Public Health Fact-checking With Large Language Models Zarharan Majid, Wullschleger Pascal, Kia Babak Behkam, Pilehvar Mohammad Taher, Foster Jennifer
- Read And Think: An Efficient Step-wise Multimodal Language Model For Document Understanding And Reasoning Zhang Jinxu
- In-context Principle Learning From Mistakes Zhang Tianjun, Madaan Aman, Gao Luyu, Zheng Steven, Mishra Swaroop, Yang Yiming, Tandon Niket, Alon Uri
- Promptfix: Few-shot Backdoor Removal Via Adversarial Prompt Tuning Zhang Tianrong, Xi Zhaohan, Wang Ting, Mitra Prasenjit, Chen Jinghui
- Leveraging Biases In Large Language Models: "bias-knn'' For Effective Few-shot Learning Zhang Yong, Li Hanzhang, Li Zhitao, Cheng Ning, Li Ming, Xiao Jing, Wang Jianzong
- Exploring The Compositional Deficiency Of Large Language Models In Mathematical Reasoning Zhao Jun, Tong Jingqi, Mou Yurong, Zhang Ming, Zhang Qi, Huang Xuanjing
- A Survey Of Backdoor Attacks And Defenses On Large Language Models: Implications For Security Measures Zhao Shuai, Jia Meihuizi, Guo Zhongliang, Gan Leilei, Xu Xiaoyu, Wu Xiaobao, Fu Jie, Feng Yichao, Pan Fengjun, Tuan Luu Anh
- Using Pretrained Large Language Model With Prompt Engineering To Answer Biomedical Questions Zhou Wenxin, Ngo Thuy Hang
- Few-shot Adversarial Prompt Learning On Vision-language Models Zhou Yiwei, Xia Xiaobo, Lin Zhiwei, Han Bo, Liu Tongliang
- Prompting Large Language Models For Zero-shot Clinical Prediction With Structured Longitudinal Electronic Health Record Data Zhu Yinghao, Wang Zixiang, Gao Junyi, Tong Yuning, An Jingkun, Liao Weibin, Harrison Ewen M., Ma Liantao, Pan Chengwei
- INTERS: Unlocking The Power Of Large Language Models In Search With Instruction Tuning Zhu Yutao, Zhang Peitian, Zhang Chenghao, Chen Yifei, Xie Binyu, Liu Zheng, Wen Ji-rong, Dou Zhicheng
- VL-ICL Bench: The Devil In The Details Of Benchmarking Multimodal In-context Learning Zong Yongshuo, Bohdal Ondrej, Hospedales Timothy
🏷 Fine Tuning
- Google's Multilingual Neural Machine Translation System: Enabling Zero-shot Translation Johnson Melvin, Schuster Mike, Le Quoc V., Krikun Maxim, Wu Yonghui, Chen Zhifeng, Thorat Nikhil, Viégas Fernanda, Wattenberg Martin, Corrado Greg, Hughes Macduff, Dean Jeffrey
- An Exploration Of Neural Sequence-to-sequence Architectures For Automatic Post-editing Junczys-dowmunt Marcin, Grundkiewicz Roman
- Shortcut-stacked Sentence Encoders For Multi-domain Inference Nie Yixin, Bansal Mohit
- Tips And Tricks For Visual Question Answering: Learnings From The 2017 Challenge Teney Damien, Anderson Peter, He Xiaodong, Hengel Anton Van Den
- Transfer Learning Of Language-independent End-to-end ASR With Language Model Fusion Inaguma Hirofumi, Cho Jaejin, Baskar Murali Karthick, Kawahara Tatsuya, Watanabe Shinji
- SNU_IDS At Semeval-2018 Task 12: Sentence Encoder With Contextualized Vectors For Argument Reasoning Comprehension Kim Taeuk, Choi Jihun, Lee Sang-goo
- Polite Dialogue Generation Without Parallel Data Niu Tong, Bansal Mohit
- Like A Baby: Visually Situated Neural Language Acquisition Ororbia Alexander G., Mali Ankur, Kelly Matthew A., Reitter David
- Contextual Parameter Generation For Universal Neural Machine Translation Platanios Emmanouil Antonios, Sachan Mrinmaya, Neubig Graham, Mitchell Tom
- Cross-lingual Transfer Learning For Multilingual Task Oriented Dialog Schuster Sebastian, Gupta Sonal, Shah Rushin, Lewis Mike
- Multilingual NMT With A Language-independent Attention Bridge Vázquez Raúl, Raganato Alessandro, Tiedemann Jörg, Creutz Mathias
- Can You Tell Me How To Get Past Sesame Street? Sentence-level Pretraining Beyond Language Modeling Wang Alex, Hula Jan, Xia Patrick, Pappagari Raghavendra, Mccoy R. Thomas, Patel Roma, Kim Najoung, Tenney Ian, Huang Yinghui, Yu Katherin, Jin Shuning, Chen Berlin, Van Durme Benjamin, Grave Edouard, Pavlick Ellie, Bowman Samuel R.
- Abductive Reasoning As Self-supervision For Common Sense Question Answering Aakur Sathyanarayanan N., Sarkar Sudeep
- Boolq: Exploring The Surprising Difficulty Of Natural Yes/no Questions Clark Christopher, Lee Kenton, Chang Ming-wei, Kwiatkowski Tom, Collins Michael, Toutanova Kristina
- Exploring The Limits Of Transfer Learning With A Unified Text-to-text Transformer Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J. Liu
- TANDA: Transfer And Adapt Pre-trained Transformer Models For Answer Sentence Selection Garg Siddhant, Vu Thuy, Moschitti Alessandro
- Reweighted Proximal Pruning For Large-scale Language Representation Guo Fu-ming, Liu Sijia, Mungall Finlay S., Lin Xue, Wang Yanzhi
- Fine-tuning By Curriculum Learning For Non-autoregressive Neural Machine Translation Guo Junliang, Tan Xu, Xu Linli, Qin Tao, Chen Enhong, Liu Tie-yan
- Visualizing And Understanding The Effectiveness Of BERT Hao Yaru, Dong Li, Wei Furu, Xu Ke
- Microsoft Translator At WMT 2019: Towards Large-scale Document-level Neural Machine Translation Junczys-dowmunt Marcin
- Sample Efficient Text Summarization Using A Single Pre-trained Transformer Khandelwal Urvashi, Clark Kevin, Jurafsky Dan, Kaiser Lukasz
- Investigating Multilingual NMT Representations At Scale Kudugunta Sneha Reddy, Bapna Ankur, Caswell Isaac, Arivazhagan Naveen, Firat Orhan
- What Would Elsa Do? Freezing Layers During Transformer Fine-tuning Lee Jaejun, Tang Raphael, Lin Jimmy
- Patent Claim Generation By Fine-tuning Openai GPT-2 Lee Jieh-sheng, Hsiang Jieh
- Bertsel: Answer Selection With Pre-trained Models Li Dongfang, Yu Yifei, Chen Qingcai, Li Xinyu
- Frustratingly Easy Natural Question Answering Pan Lin, Chakravarti Rishav, Ferritto Anthony, Glass Michael, Gliozzo Alfio, Roukos Salim, Florian Radu, Sil Avirup
- Language Models As Knowledge Bases? Petroni Fabio, Rocktäschel Tim, Lewis Patrick, Bakhtin Anton, Wu Yuxiang, Miller Alexander H., Riedel Sebastian
- Probing Natural Language Inference Models Through Semantic Fragments Richardson Kyle, Hu Hai, Moss Lawrence S., Sabharwal Ashish
- Masked Language Model Scoring Salazar Julian, Liang Davis, Nguyen Toan Q., Kirchhoff Katrin
- Distilbert, A Distilled Version Of BERT: Smaller, Faster, Cheaper And Lighter Sanh Victor, Debut Lysandre, Chaumond Julien, Wolf Thomas
- Generalization In Generation: A Closer Look At Exposure Bias Schmidt Florian
- Inducing Brain-relevant Bias In Natural Language Processing Models Schwartz Dan, Toneva Mariya, Wehbe Leila
- MASS: Masked Sequence To Sequence Pre-training For Language Generation Song Kaitao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Tie-yan
- Cued@wmt19:ewc&lms Stahlberg Felix, Saunders Danielle, De Gispert Adria, Byrne Bill
- Sesamebert: Attention For Anywhere Su Ta-chun, Cheng Hsiang-chih
- Syntax-infused Transformer And BERT Models For Machine Translation And Natural Language Understanding Sundararaman Dhanasekar, Subramanian Vivek, Wang Guoyin, Si Shijing, Shen Dinghan, Wang Dong, Carin Lawrence
- Olmpics -- On What Language Model Pre-training Captures Talmor Alon, Elazar Yanai, Goldberg Yoav, Berant Jonathan
- LXMERT: Learning Cross-modality Encoder Representations From Transformers Tan Hao, Bansal Mohit
- Tree-structured Semantic Encoder With Knowledge Sharing For Domain Adaptation In Natural Language Generation Tseng Bo-hsiang, Budzianowski Paweł, Wu Yen-chen, Gašić Milica
- Well-read Students Learn Better: On The Importance Of Pre-training Compact Models Turc Iulia, Chang Ming-wei, Lee Kenton, Toutanova Kristina
- How Does BERT Answer Questions? A Layer-wise Analysis Of Transformer Representations Van Aken Betty, Winter Benjamin, Löser Alexander, Gers Felix A.
- Training Optimus Prime, M.D.: Generating Medical Certification Items By Fine-tuning Openai's Gpt2 Transformer Model Von Davier Matthias
- Structbert: Incorporating Language Structures Into Pre-training For Deep Language Understanding Wang Wei, Bi Bin, Yan Ming, Wu Chen, Bao Zuyi, Xia Jiangnan, Peng Liwei, Si Luo
- Tree Transformer: Integrating Tree Structures Into Self-attention Wang Yau-shian, Lee Hung-yi, Chen Yun-nung
- Structured Pruning Of Large Language Models Wang Ziheng, Wohlwend Jeremy, Lei Tao
- Transfertransfo: A Transfer Learning Approach For Neural Network Based Conversational Agents Wolf Thomas, Sanh Victor, Chaumond Julien, Delangue Clement
- Alternating Recurrent Dialog Model With Large-scale Pre-trained Language Models Wu Qingyang, Zhang Yichi, Li Yu, Yu Zhou
- Review Conversational Reading Comprehension Xu Hu, Liu Bing, Shu Lei, Yu Philip S.
- Towards Making The Most Of BERT In Neural Machine Translation Yang Jiacheng, Wang Mingxuan, Zhou Hao, Zhao Chengqi, Yu Yong, Zhang Weinan, Li Lei
- Deepening Hidden Representations From Pre-trained Language Models Yang Junjie, Zhao Hai
- Data Augmentation For BERT Fine-tuning In Open-domain Question Answering Yang Wei, Xie Yuqing, Tan Luchen, Xiong Kun, Li Ming, Lin Jimmy
- Model Compression With Two-stage Multi-teacher Knowledge Distillation For Web Question Answering System Yang Ze, Shou Linjun, Gong Ming, Lin Wutao, Jiang Daxin
- Q8BERT: Quantized 8bit BERT Zafrir Ofir, Boudoukh Guy, Izsak Peter, Wasserblat Moshe
- Sequence-to-sequence Pre-training With Data Augmentation For Sentence Rewriting Zhang Yi, Ge Tao, Wei Furu, Zhou Ming, Sun Xu
- Semantics-aware BERT For Language Understanding Zhang Zhuosheng, Wu Yuwei, Zhao Hai, Li Zuchao, Zhang Shuailiang, Zhou Xi, Zhou Xiang
- How Much Knowledge Can You Pack Into The Parameters Of A Language Model? Adam Roberts, Colin Raffel, Noam Shazeer
- PALM: Pre-training An Autoencoding&autoregressive Language Model For Context-conditioned Generation Bi Bin, Li Chenliang, Wu Chen, Yan Ming, Wang Wei, Huang Songfang, Huang Fei, Si Luo
- Chatbot Interaction With Artificial Intelligence: Human Data Augmentation With T5 And Language Transformer Ensemble For Text Classification Bird Jordan J., Ekárt Anikó, Faria Diego R.
- Language Models Are Few-shot Learners Brown Tom B., Mann Benjamin, Ryder Nick, Subbiah Melanie, Kaplan Jared, Dhariwal Prafulla, Neelakantan Arvind, Shyam Pranav, Sastry Girish, Askell Amanda, Agarwal Sandhini, Herbert-voss Ariel, Krueger Gretchen, Henighan Tom, Child Rewon, Ramesh Aditya, Ziegler Daniel M., Wu Jeffrey, Winter Clemens, Hesse Christopher, Chen Mark, Sigler Eric, Litwin Mateusz, Gray Scott, Chess Benjamin, Clark Jack, Berner Christopher, Mccandlish Sam, Radford Alec, Sutskever Ilya, Amodei Dario
- DIET: Lightweight Language Understanding For Dialogue Systems Bunk Tanja, Varshneya Daksh, Vlasov Vladimir, Nichol Alan
- Simultaneous Paraphrasing And Translation By Fine-tuning Transformer Models Chada Rakesh
- Model Selection For Cross-lingual Transfer Chen Yang, Ritter Alan
- Rethinking Embedding Coupling In Pre-trained Language Models Chung Hyung Won, Févry Thibault, Tsai Henry, Johnson Melvin, Ruder Sebastian
- Fine-tuning Pretrained Language Models: Weight Initializations, Data Orders, And Early Stopping Dodge Jesse, Ilharco Gabriel, Schwartz Roy, Farhadi Ali, Hajishirzi Hannaneh, Smith Noah
- A Comparison Of LSTM And BERT For Small Corpus Ezen-can Aysu
- The Pile: An 800GB Dataset Of Diverse Text For Language Modeling Gao Leo, Biderman Stella, Black Sid, Golding Laurence, Hoppe Travis, Foster Charles, Phang Jason, He Horace, Thite Anish, Nabeshima Noa, Presser Shawn, Leahy Connor
- ISAAQ -- Mastering Textbook Questions With Pre-trained Transformers And Bottom-up And Top-down Attention Gomez-perez Jose Manuel, Ortega Raul
- It's Not Greek To Mbert: Inducing Word-level Translations From Multilingual BERT Gonen Hila, Ravfogel Shauli, Elazar Yanai, Goldberg Yoav
- Dialogbert: Discourse-aware Response Generation Via Learning To Recover And Rank Utterances Gu Xiaodong, Yoo Kang Min, Ha Jung-woo
- Incorporating BERT Into Parallel Sequence Decoding With Adapters Guo Junliang, Zhang Zhirui, Xu Linli, Wei Hao-ran, Chen Boxing, Chen Enhong
- BERT Based Multilingual Machine Comprehension In English And Hindi Gupta Somil, Khade Nilesh
- ECONET: Effective Continual Pretraining Of Language Models For Event Temporal Reasoning Han Rujun, Ren Xiang, Peng Nanyun
- Towards Learning A Generic Agent For Vision-and-language Navigation Via Pre-training Hao Weituo, Li Chunyuan, Li Xiujun, Carin Lawrence, Gao Jianfeng
- Enriched Pre-trained Transformers For Joint Slot Filling And Intent Detection Hardalov Momchil, Koychev Ivan, Nakov Preslav
- Exploring BERT Parameter Efficiency On The Stanford Question Answering Dataset V2.0 Hulburd Eric
- How Can We Know When Language Models Know? On The Calibration Of Language Models For Question Answering Jiang Zhengbao, Araki Jun, Ding Haibo, Neubig Graham
- Text-to-text Pre-training For Data-to-text Tasks Kale Mihir, Rastogi Abhinav
- Current Limitations Of Language Models: What You Need Is Retrieval Komatsuzaki Aran
- Fid-ex: Improving Sequence-to-sequence Models For Extractive Rationale Generation Lakhotia Kushal, Paranjape Bhargavi, Ghoshal Asish, Yih Wen-tau, Mehdad Yashar, Iyer Srinivasan
- Utilizing Bidirectional Encoder Representations From Transformers For Answer Selection Laskar Md Tahmid Rahman, Hoque Enamul, Huang Jimmy Xiangji
- Common Sense Or World Knowledge? Investigating Adapter-based Knowledge Injection Into Pretrained Transformers Lauscher Anne, Majewska Olga, Ribeiro Leonardo F. R., Gurevych Iryna, Rozanov Nikolai, Glavaš Goran
- From Zero To Hero: On The Limitations Of Zero-shot Cross-lingual Transfer With Multilingual Transformers Lauscher Anne, Ravishankar Vinit, Vulić Ivan, Glavaš Goran
- An Empirical Investigation Of Pre-trained Transformer Language Models For Open-domain Dialogue Generation Li Piji
- Multilingual Speech Translation With Efficient Finetuning Of Pretrained Models Li Xian, Wang Changhan, Tang Yun, Tran Chau, Tang Yuqing, Pino Juan, Baevski Alexei, Conneau Alexis, Auli Michael
- Mintl: Minimalist Transfer Learning For Task-oriented Dialogue Systems Lin Zhaojiang, Madotto Andrea, Winata Genta Indra, Fung Pascale
- Qiaoning At Semeval-2020 Task 4: Commonsense Validation And Explanation System Based On Ensemble Of Language Model Liu Pai
- Adversarial Training For Large Neural Language Models Liu Xiaodong, Cheng Hao, He Pengcheng, Chen Weizhu, Wang Yu, Poon Hoifung, Gao Jianfeng
- VECO: Variable And Flexible Cross-lingual Pre-training For Language Understanding And Generation Luo Fuli, Wang Wei, Liu Jiahao, Liu Yijia, Bi Bin, Huang Songfang, Huang Fei, Si Luo
- Language Models As Few-shot Learner For Task-oriented Dialogue Systems Madotto Andrea, Liu Zihan, Lin Zhaojiang, Fung Pascale
- Dialoglue: A Natural Language Understanding Benchmark For Task-oriented Dialogue Mehri Shikib, Eric Mihail, Hakkani-tur Dilek
- Generation-distillation For Efficient Natural Language Understanding In Low-data Settings Melas-kyriazi Luke, Han George, Liang Celine
- What Happens To BERT Embeddings During Fine-tuning? Merchant Amil, Rahimtoroghi Elahe, Pavlick Ellie, Tenney Ian
- On The Stability Of Fine-tuning BERT: Misconceptions, Explanations, And Strong Baselines Mosbach Marius, Andriushchenko Maksym, Klakow Dietrich
- Avgout: A Simple Output-probability Measure To Eliminate Dull Responses Niu Tong, Bansal Mohit
- The Chess Transformer: Mastering Play Using Generative Language Models Noever David, Ciolino Matt, Kalin Josh
- Facts2story: Controlling Text Generation By Key Facts Orbach Eyal Bar Ilan University, Goldberg Yoav Bar Ilan University And Allen Institute For Artificial Intelligence
- Retrieval-augmented Generation For Knowledge-intensive NLP Tasks Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela
- SOLOIST: Building Task Bots At Scale With Transfer Learning And Machine Teaching Peng Baolin, Li Chunyuan, Li Jinchao, Shayandeh Shahin, Liden Lars, Gao Jianfeng
- What Does BERT Know About Books, Movies And Music? Probing BERT For Conversational Recommendation Penha Gustavo, Hauff Claudia
- Adapterhub: A Framework For Adapting Transformers Pfeiffer Jonas, Rücklé Andreas, Poth Clifton, Kamath Aishwarya, Vulić Ivan, Ruder Sebastian, Cho Kyunghyun, Gurevych Iryna
- Intermediate-task Transfer Learning With Pretrained Models For Natural Language Understanding: When And Why Does It Work? Pruksachatkun Yada, Phang Jason, Liu Haokun, Htut Phu Mon, Zhang Xiaoyi, Pang Richard Yuanzhe, Vania Clara, Kann Katharina, Bowman Samuel R.
- Jiant: A Software Toolkit For Research On General-purpose Text Understanding Models Pruksachatkun Yada, Yeres Phil, Liu Haokun, Phang Jason, Htut Phu Mon, Wang Alex, Tenney Ian, Bowman Samuel R.
- How Fine Can Fine-tuning Be? Learning Efficient Language Models Radiya-dixit Evani, Wang Xin
- Fine-tuning BERT For Schema-guided Zero-shot Dialogue State Tracking Ruan Yu-ping, Ling Zhen-hua, Gu Jia-chen, Liu Quan
- End-to-end Synthetic Data Generation For Domain Adaptation Of Question Answering Systems Shakeri Siamak, Santos Cicero Nogueira Dos, Zhu Henry, Ng Patrick, Nan Feng, Wang Zhiguo, Nallapati Ramesh, Xiang Bing
- Data-efficient Methods For Dialogue Systems Shalyminov Igor
- Hybrid Generative-retrieval Transformers For Dialogue Domain Adaptation Shalyminov Igor, Sordoni Alessandro, Atkinson Adam, Schulz Hannes
- A Simple But Tough-to-beat Data Augmentation Approach For Natural Language Understanding And Generation Shen Dinghan, Zheng Mingzhi, Shen Yelong, Qu Yanru, Chen Weizhu
- Pre-trained Summarization Distillation Shleifer Sam, Rush Alexander M.
- Lightpaff: A Two-stage Distillation Framework For Pre-training And Fine-tuning Song Kaitao, Sun Hao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Hongzhi, Liu Tie-yan
- Recipes For Adapting Pre-trained Monolingual And Multilingual Models To Machine Translation Stickland Asa Cooper, Li Xian, Ghazvininejad Marjan
- Discovering Useful Sentence Representations From Large Pretrained Language Models Subramani Nishant, Suresh Nivedita
- Explain And Improve: Lrp-inference Fine-tuning For Image Captioning Models Sun Jiamei, Lapuschkin Sebastian, Samek Wojciech, Binder Alexander
- Vokenization: Improving Language Understanding With Contextualized, Visual-grounded Supervision Tan Hao, Bansal Mohit
- The Language Interpretability Tool: Extensible, Interactive Visualizations And Analysis For NLP Models Tenney Ian, Wexler James, Bastings Jasmijn, Bolukbasi Tolga, Coenen Andy, Gehrmann Sebastian, Jiang Ellen, Pushkarna Mahima, Radebaugh Carey, Reif Emily, Yuan Ann
- Making Pre-trained Language Models Better Few-shot Learners Tianyu Gao, Adam Fisch, Danqi Chen
- Language Models Are Few-shot Learners Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam Mccandlish, Alec Radford, Ilya Sutskever, Dario Amodei
- Unsupervised Pretraining For Neural Machine Translation Using Elastic Weight Consolidation Variš Dušan, Bojar Ondřej
- Bertnesia: Investigating The Capture And Forgetting Of Knowledge In BERT Wallat Jonas, Singh Jaspreet, Anand Avishek
- Infobert: Improving Robustness Of Language Models From An Information Theoretic Perspective Wang Boxin, Wang Shuohang, Cheng Yu, Gan Zhe, Jia Ruoxi, Li Bo, Liu Jingjing
- Minilm: Deep Self-attention Distillation For Task-agnostic Compression Of Pre-trained Transformers Wang Wenhui, Wei Furu, Dong Li, Bao Hangbo, Yang Nan, Zhou Ming
- ERNIE-GEN: An Enhanced Multi-flow Pre-training And Fine-tuning Framework For Natural Language Generation Xiao Dongling, Zhang Han, Li Yukun, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Improving BERT Fine-tuning Via Self-ensemble And Self-distillation Xu Yige, Qiu Xipeng, Zhou Ligao, Huang Xuanjing
- Seqdialn: Sequential Visual Dialog Networks In Joint Visual-linguistic Representation Space Yang Liu
- On The Generation Of Medical Dialogues For COVID-19 Yang Wenmian, Zeng Guangtao, Tan Bowen, Ju Zeqian, Chakravorty Subrato, He Xuehai, Chen Shu, Yang Xingyi, Wu Qingyang, Yu Zhou, Xing Eric, Xie Pengtao
- Adding Recurrence To Pretrained Transformers For Improved Efficiency And Context Size Yoshida Davis, Ettinger Allyson, Gimpel Kevin
- Cocolm: Complex Commonsense Enhanced Language Model With Discourse Relations Yu Changlong, Zhang Hongming, Song Yangqiu, Ng Wilfred
- GOBO: Quantizing Attention-based NLP Models For Low Latency And Energy Efficient Inference Zadeh Ali Hadi, Edo Isak, Awad Omar Mohamed, Moshovos Andreas
- When Do You Need Billions Of Words Of Pretraining Data? Zhang Yian, Warstadt Alex, Li Haau-sing, Bowman Samuel R.
- Know What You Don't Need: Single-shot Meta-pruning For Attention Heads Zhang Zhengyan, Qi Fanchao, Liu Zhiyuan, Liu Qun, Sun Maosong
- Pre-training Text-to-text Transformers For Concept-centric Common Sense Zhou Wangchunshu, Lee Dong-ho, Selvam Ravi Kiran, Lee Seyeon, Lin Bill Yuchen, Ren Xiang
- Learning To Compare For Better Training And Evaluation Of Open Domain Natural Language Generation Models Zhou Wangchunshu, Xu Ke
- Modifying Memories In Transformer Models Zhu Chen, Rawat Ankit Singh, Zaheer Manzil, Bhojanapalli Srinadh, Li Daliang, Yu Felix, Kumar Sanjiv
- Ext5: Towards Extreme Multi-task Scaling For Transfer Learning Aribandi Vamsi, Tay Yi, Schuster Tal, Rao Jinfeng, Zheng Huaixiu Steven, Mehta Sanket Vaibhav, Zhuang Honglei, Tran Vinh Q., Bahri Dara, Ni Jianmo, Gupta Jai, Hui Kai, Ruder Sebastian, Metzler Donald
- Efficient Large Scale Language Modeling With Mixtures Of Experts Artetxe Mikel, Bhosale Shruti, Goyal Naman, Mihaylov Todor, Ott Myle, Shleifer Sam, Lin Xi Victoria, Du Jingfei, Iyer Srinivasan, Pasunuru Ramakanth, Anantharaman Giri, Li Xian, Chen Shuohui, Akin Halil, Baines Mandeep, Martin Louis, Zhou Xing, Koura Punit Singh, O'horo Brian, Wang Jeff, Zettlemoyer Luke, Diab Mona, Kozareva Zornitsa, Stoyanov Ves
- See, Hear, Read: Leveraging Multimodality With Guided Attention For Abstractive Text Summarization Atri Yash Kumar, Pramanick Shraman, Goyal Vikram, Chakraborty Tanmoy
- S2s-ft: Fine-tuning Pretrained Transformer Encoders For Sequence-to-sequence Learning Bao Hangbo, Dong Li, Wang Wenhui, Yang Nan, Wei Furu
- Gabert -- An Irish Language Model Barry James, Wagner Joachim, Cassidy Lauren, Cowap Alan, Lynn Teresa, Walsh Abigail, Meachair Mícheál J. Ó, Foster Jennifer
- Continual Learning In Multilingual NMT Via Language-specific Embeddings Berard Alexandre
- SHAQ: Single Headed Attention With Quasi-recurrence Bharwani Nashwin, Kushner Warren, Dandona Sangeet, Schreiber Ben
- On The Universality Of Deep Contextual Language Models Bhatt Shaily, Goyal Poonam, Dandapat Sandipan, Choudhury Monojit, Sitaram Sunayana
- Improving Language Models By Retrieving From Trillions Of Tokens Borgeaud Sebastian, Mensch Arthur, Hoffmann Jordan, Cai Trevor, Rutherford Eliza, Millican Katie, Driessche George Van Den, Lespiau Jean-baptiste, Damoc Bogdan, Clark Aidan, Casas Diego De Las, Guy Aurelia, Menick Jacob, Ring Roman, Hennigan Tom, Huang Saffron, Maggiore Loren, Jones Chris, Cassirer Albin, Brock Andy, Paganini Michela, Irving Geoffrey, Vinyals Oriol, Osindero Simon, Simonyan Karen, Rae Jack W., Elsen Erich, Sifre Laurent
- On The Prunability Of Attention Heads In Multilingual BERT Budhraja Aakriti, Pande Madhura, Kumar Pratyush, Khapra Mitesh M.
- Implicit Premise Generation With Discourse-aware Commonsense Knowledge Models Chakrabarty Tuhin, Trivedi Aadit, Muresan Smaranda
- Multimodal Incremental Transformer With Visual Grounding For Visual Dialogue Generation Chen Feilong, Meng Fandong, Chen Xiuyi, Li Peng, Zhou Jie
- Towards Making The Most Of Multilingual Pretraining For Zero-shot Neural Machine Translation Chen Guanhua, Ma Shuming, Chen Yun, Zhang Dongdong, Pan Jia, Wang Wenping, Wei Furu
- Ensemble Fine-tuned Mbert For Translation Quality Estimation Chowdhury Shaika, Baili Naouel, Vannah Brian
- Enjoy The Salience: Towards Better Transformer-based Faithful Explanations With Word Salience Chrysostomou George, Aletras Nikolaos
- Control Prefixes For Parameter-efficient Text Generation Clive Jordan, Cao Kris, Rei Marek
- YANMTT: Yet Another Neural Machine Translation Toolkit Dabre Raj, Sumita Eiichiro
- Knowledge Neurons In Pretrained Transformers Dai Damai, Dong Li, Hao Yaru, Sui Zhifang, Chang Baobao, Wei Furu
- Adapting Monolingual Models: Data Can Be Scarce When Language Similarity Is High De Vries Wietse, Bartelds Martijn, Nissim Malvina, Wieling Martijn
- A Primer On Pretrained Multilingual Language Models Doddapaneni Sumanth, Ramesh Gowtham, Khapra Mitesh M., Kunchukuttan Anoop, Kumar Pratyush
- Lora: Low-rank Adaptation Of Large Language Models Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen
- Compressing Visual-linguistic Model Via Knowledge Distillation Fang Zhiyuan, Wang Jianfeng, Hu Xiaowei, Wang Lijuan, Yang Yezhou, Liu Zicheng
- Cross-attention Is All You Need: Adapting Pretrained Transformers For Machine Translation Gheini Mozhdeh, Ren Xiang, May Jonathan
- LAWDR: Language-agnostic Weighted Document Representations From Pre-trained Models Gong Hongyu, Chaudhary Vishrav, Tang Yuqing, Guzmán Francisco
- Memory-efficient Transformers Via Top-\(k\) Attention Gupta Ankit, Dar Guy, Goodman Shaya, Ciprut David, Berant Jonathan
- WARP: Word-level Adversarial Reprogramming Hambardzumyan Karen, Khachatrian Hrant, May Jonathan
- Lora: Low-rank Adaptation Of Large Language Models Hu Edward J., Shen Yelong, Wallis Phillip, Allen-zhu Zeyuan, Li Yuanzhi, Wang Shean, Wang Lu, Chen Weizhu
- Investigating Transfer Learning In Multilingual Pre-trained Language Models Through Chinese Natural Language Inference Hu Hai, Zhou He, Tian Zuoyu, Zhang Yiwen, Ma Yina, Li Yanting, Nie Yixin, Richardson Kyle
- Transfer Learning For Sequence Generation: From Single-source To Multi-source Huang Xuancheng, Xu Jingfang, Sun Maosong, Liu Yang
- DS-TOD: Efficient Domain Specialization For Task Oriented Dialog Hung Chia-chien, Lauscher Anne, Ponzetto Simone Paolo, Glavaš Goran
- A Good Prompt Is Worth Millions Of Parameters: Low-resource Prompt-based Learning For Vision-language Models Jin Woojeong, Cheng Yu, Shen Yelong, Chen Weizhu, Ren Xiang
- AMMUS : A Survey Of Transformer-based Pretrained Models In Natural Language Processing Kalyan Katikapalli Subramanyam, Rajasekharan Ajit, Sangeetha Sivanesan
- Generating Rich Product Descriptions For Conversational E-commerce Systems Kedia Shashank, Mantha Aditya, Gupta Sneha, Guo Stephen, Achan Kannan
- How Transferable Are Reasoning Patterns In VQA? Kervadec Corentin, Jaunet Theo, Antipov Grigory, Baccouche Moez, Vuillemot Romain, Wolf Christian
- Controlling Conditional Language Models Without Catastrophic Forgetting Korbak Tomasz, Elsahar Hady, Kruszewski German, Dymetman Marc
- Towards Few-shot Fact-checking Via Perplexity Lee Nayeon, Bang Yejin, Madotto Andrea, Khabsa Madian, Fung Pascale
- Exploiting Sentence-level Representations For Passage Ranking Leonhardt Jurek, Beringer Fabian, Anand Avishek
- A Short Study On Compressing Decoder-based Language Models Li Tianda, Mesbahi Yassir El, Kobyzev Ivan, Rashid Ahmad, Mahmud Atif, Anchuri Nithin, Hajimolahoseini Habib, Liu Yang, Rezagholizadeh Mehdi
- Scheduled Sampling In Vision-language Pretraining With Decoupled Encoder-decoder Network Li Yehao, Pan Yingwei, Yao Ting, Chen Jingwen, Mei Tao
- Truthfulqa: Measuring How Models Mimic Human Falsehoods Lin Stephanie, Hilton Jacob, Evans Owain
- Variational Latent-state GPT For Semi-supervised Task-oriented Dialog Systems Liu Hong, Cai Yucheng, Lin Zhenru, Ou Zhijian, Huang Yi, Feng Junlan
- Unified Multimodal Pre-training And Prompt-based Tuning For Vision-language Understanding And Generation Liu Tianyi, Wu Zuxuan, Xiong Wenhan, Chen Jingjing, Jiang Yu-gang
- Learning To Ask Conversational Questions By Optimizing Levenshtein Distance Liu Zhongkun, Ren Pengjie, Chen Zhumin, Ren Zhaochun, De Rijke Maarten, Zhou Ming
- X-METRA-ADA: Cross-lingual Meta-transfer Learning Adaptation To Natural Language Understanding And Question Answering M'hamdi Meryem, Kim Doo Soon, Dernoncourt Franck, Bui Trung, Ren Xiang, May Jonathan
- Schema-guided Paradigm For Zero-shot Dialog Mehri Shikib, Eskenazi Maxine
- Recent Advances In Natural Language Processing Via Large Pre-trained Language Models: A Survey Min Bonan, Ross Hayley, Sulem Elior, Veyseh Amir Pouran Ben, Nguyen Thien Huu, Sainz Oscar, Agirre Eneko, Heinz Ilana, Roth Dan
- First Align, Then Predict: Understanding The Cross-lingual Ability Of Multilingual BERT Muller Benjamin, Elazar Yanai, Sagot Benoît, Seddah Djamé
- Arat5: Text-to-text Transformers For Arabic Language Generation Nagoudi El Moatez Billah, Elmadany Abdelrahim, Abdul-mageed Muhammad
- Sentence-t5: Scalable Sentence Encoders From Pre-trained Text-to-text Models Ni Jianmo, Ábrego Gustavo Hernández, Constant Noah, Ma Ji, Hall Keith B., Cer Daniel, Yang Yinfei
- Investigating Numeracy Learning Ability Of A Text-to-text Transfer Model Pal Kuntal Kumar, Baral Chitta
- Probing For Bridging Inference In Transformer Language Models Pandit Onkar, Hou Yufang
- Attentive Fine-tuning Of Transformers For Translation Of Low-resourced Languages @loresmt 2021 Puranik Karthik, Hande Adeep, Priyadharshini Ruba, Durairaj Thenmozhi, Sampath Anbukkarasi, Thamburaj Kingston Pal, Chakravarthi Bharathi Raja
- Robertuito: A Pre-trained Language Model For Social Media Text In Spanish Pérez Juan Manuel, Furman Damián A., Alemany Laura Alonso, Luque Franco
- Grid Search Hyperparameter Benchmarking Of BERT, ALBERT, And Longformer On Duorc Quijano Alex John, Nguyen Sam, Ordonez Juanita
- Teaching Autoregressive Language Models Complex Tasks By Demonstration Recchia Gabriel
- Webgpt: Browser-assisted Question-answering With Human Feedback Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman
- Multiversal Views On Language Models Reynolds Laria, Mcdonell Kyle
- Efficient Domain Adaptation Of Language Models Via Adaptive Tokenization Sachidananda Vin, Kessler Jason S., Lai Yi-an
- Large Pre-trained Language Models Contain Human-like Biases Of What Is Right And Wrong To Do Schramowski Patrick, Turan Cigdem, Andersen Nico, Rothkopf Constantin A., Kersting Kristian
- Better Neural Machine Translation By Extracting Linguistic Information From BERT Shavarani Hassan S., Sarkar Anoop
- How Much Can CLIP Benefit Vision-and-language Tasks? Shen Sheng, Li Liunian Harold, Tan Hao, Bansal Mohit, Rohrbach Anna, Chang Kai-wei, Yao Zhewei, Keutzer Kurt
- Exploring Low-cost Transformer Model Compression For Large-scale Commercial Reply Suggestions Shrivastava Vaishnavi, Gaonkar Radhika, Gupta Shashank, Jha Abhishek
- Txt: Crossmodal End-to-end Learning With Transformers Steitz Jan-martin O., Pfeiffer Jonas, Gurevych Iryna, Roth Stefan
- Corelm: Coreference-aware Language Model Fine-tuning Stylianou Nikolaos, Vlahavas Ioannis
- ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Sun Yu, Wang Shuohuan, Feng Shikun, Ding Siyu, Pang Chao, Shang Junyuan, Liu Jiaxiang, Chen Xuyi, Zhao Yanbin, Lu Yuxiang, Liu Weixin, Wu Zhihua, Gong Weibao, Liang Jianzhong, Shang Zhizhou, Sun Peng, Liu Wei, Ouyang Xuan, Yu Dianhai, Tian Hao, Wu Hua, Wang Haifeng
- Diff-explainer: Differentiable Convex Optimization For Explainable Multi-hop Inference Thayaparan Mokanarangan, Valentino Marco, Ferreira Deborah, Rozanova Julia, Freitas André
- Convfit: Conversational Fine-tuning Of Pretrained Language Models Vulić Ivan, Su Pei-hao, Coope Sam, Gerz Daniela, Budzianowski Paweł, Casanueva Iñigo, Mrkšić Nikola, Wen Tsung-hsien
- Bertnesia: Investigating The Capture And Forgetting Of Knowledge In BERT Wallat Jonas, Singh Jaspreet, Anand Avishek
- List: Lite Prompted Self-training Makes Parameter-efficient Few-shot Learners Wang Yaqing, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Distilled Dual-encoder Model For Vision-language Understanding Wang Zekun, Wang Wenhui, Zhu Haichao, Liu Ming, Qin Bing, Wei Furu
- On Explaining Your Explanations Of BERT: An Empirical Study With Sequence Classification Wu Zhengxuan, Ong Desmond C.
- Prefix-tuning: Optimizing Continuous Prompts For Generation Xiang Lisa Li, Percy Liang
- P-tuning V2: Prompt Tuning Can Be Comparable To Fine-tuning Universally Across Scales And Tasks Xiao Liu, Kaixuan Ji, Yicheng Fu, Weng Lam Tam, Zhengxiao Du, Zhilin Yang, Jie Tang
- From Dense To Sparse: Contrastive Pruning For Better Pre-trained Language Model Compression Xu Runxin, Luo Fuli, Wang Chengyu, Chang Baobao, Huang Jun, Huang Songfang, Huang Fei
- Learning To Generate Task-specific Adapters From Task Description Ye Qinyuan, Ren Xiang
- Gpt3mix: Leveraging Large-scale Language Models For Text Augmentation Yoo Kang Min, Park Dongju, Kang Jaewook, Lee Sang-woo, Park Woomyeong
- On The Interplay Between Fine-tuning And Composition In Transformers Yu Lang, Ettinger Allyson
- Prune Once For All: Sparse Pre-trained Language Models Zafrir Ofir, Larey Ariel, Boudoukh Guy, Shen Haihao, Wasserblat Moshe
- Bitfit: Simple Parameter-efficient Fine-tuning For Transformer-based Masked Language-models Zaken Elad Ben, Ravfogel Shauli, Goldberg Yoav
- Topical Language Generation Using Transformers Zandie Rohola, Mahoor Mohammad H.
- A Knowledge-grounded Dialog System Based On Pre-trained Language Models Zhang Weijie, Chen Jiaoxuan, Wu Haipang, Wan Sanhui, Li Gongfeng
- DSGPT: Domain-specific Generative Pre-training Of Transformers For Text Generation In E-commerce Title And Review Summarization Zhang Xueying, Jiang Yunjiang, Shang Yue, Cheng Zhaomeng, Zhang Chi, Fan Xiaochuan, Xiao Yun, Long Bo
- An Exploratory Study On Long Dialogue Summarization: What Works And What's Next Zhang Yusen, Ni Ansong, Yu Tao, Zhang Rui, Zhu Chenguang, Deb Budhaditya, Celikyilmaz Asli, Awadallah Ahmed Hassan, Radev Dragomir
- CPM-2: Large-scale Cost-effective Pre-trained Language Models Zhengyan Zhang, Yuxian Gu, Xu Han, Shengqi Chen, Chaojun Xiao, Zhenbo Sun, Yuan Yao, Fanchao Qi, Jian Guan, Pei Ke, Yanzheng Cai, Guoyang Zeng, Zhixing Tan, Zhiyuan Liu, Minlie Huang, Wentao Han, Yang Liu, Xiaoyan Zhu, Maosong Sun
- Adapting Language Models For Zero-shot Learning By Meta-tuning On Dataset And Prompt Collections Zhong Ruiqi, Lee Kristy, Zhang Zheng, Klein Dan
- LAFITE: Towards Language-free Training For Text-to-image Generation Zhou Yufan, Zhang Ruiyi, Chen Changyou, Li Chunyuan, Tensmeyer Chris, Yu Tong, Gu Jiuxiang, Xu Jinhui, Sun Tong
- Making Large Language Models Interactive: A Pioneer Study On Supporting Complex Information-seeking Tasks With Implicit Constraints Ahmadvand Ali, Arabzadeh Negar, Kiseleva Julia, Sanz Patricio Figueroa, Deng Xin, Jauhar Sujay, Gamon Michael, Agichtein Eugene, Friend Ned, Aniruddha
- Input-tuning: Adapting Unfamiliar Inputs To Frozen Pretrained Models An Shengnan, Li Yifei, Lin Zeqi, Liu Qian, Chen Bei, Fu Qiang, Chen Weizhu, Zheng Nanning, Lou Jian-guang
- A Comparative Study On Language Models For Task-oriented Dialogue Systems Andreas Vinsen Marselino, Winata Genta Indra, Purwarianti Ayu
- Selection-inference: Exploiting Large Language Models For Interpretable Logical Reasoning Antonia Creswell, Murray Shanahan, Irina Higgins
- ATTEMPT: Parameter-efficient Multi-task Tuning Via Attentional Mixtures Of Soft Prompts Asai Akari, Salehi Mohammadreza, Peters Matthew E., Hajishirzi Hannaneh
- Parameter-efficient Finetuning Of Transformers For Source Code Ayupov Shamil, Chirkova Nadezhda
- Evidence > Intuition: Transferability Estimation For Encoder Selection Bassignana Elisa, Müller-eberstein Max, Zhang Mike, Plank Barbara
- Effective Cross-task Transfer Learning For Explainable Natural Language Inference With T5 Bigoulaeva Irina, Sachdeva Rachneet, Madabushi Harish Tayyar, Villavicencio Aline, Gurevych Iryna
- Using Cognitive Psychology To Understand GPT-3 Binz Marcel, Schulz Eric
- Inpars: Data Augmentation For Information Retrieval Using Large Language Models Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Nogueira Rodrigo
- Looking For A Handsome Carpenter! Debiasing GPT-3 Job Advertisements Borchers Conrad, Gala Dalia Sara, Gilburt Benjamin, Oravkin Eduard, Bounsi Wilfried, Asano Yuki M., Kirk Hannah Rose
- Petals: Collaborative Inference And Fine-tuning Of Large Models Borzunov Alexander, Baranchuk Dmitry, Dettmers Tim, Ryabinin Max, Belkada Younes, Chumachenko Artem, Samygin Pavel, Raffel Colin
- Understanding Performance Of Long-document Ranking Models Through Comprehensive Evaluation And Leaderboarding Boytsov Leonid, Akinpelu David, Lin Tianyi, Gao Fangwei, Zhao Yutian, Huang Jeffrey, Katyal Nipun, Nyberg Eric
- Evaluating The Susceptibility Of Pre-trained Language Models Via Handcrafted Adversarial Examples Branch Hezekiah J., Cefalu Jonathan Rodriguez, Mchugh Jeremy, Hujer Leyla, Bahl Aditya, Iglesias Daniel Del Castillo, Heichman Ron, Darwishi Ramesh
- Large Language Models Can Implement Policy Iteration Brooks Ethan, Walls Logan, Lewis Richard L., Singh Satinder
- Robust Preference Learning For Storytelling Via Contrastive Reinforcement Learning Castricato Louis, Havrilla Alexander, Matiana Shahbuland, Pieler Michael, Ye Anbang, Yang Ian, Frazier Spencer, Riedl Mark
- Speechprompt: An Exploration Of Prompt Tuning On Generative Spoken Language Model For Speech Processing Tasks Chang Kai-wei, Tseng Wei-cheng, Li Shang-wen, Lee Hung-yi
- Xdoc: Unified Pre-training For Cross-format Document Understanding Chen Jingye, Lv Tengchao, Cui Lei, Zhang Cha, Wei Furu
- Transformers Go For The Lols: Generating (humourous) Titles From Scientific Abstracts End-to-end Chen Yanran, Eger Steffen
- Empowering Parameter-efficient Transfer Learning By Recognizing The Kernel Structure In Self-attention Chen Yifan, Hazarika Devamanyu, Namazifar Mahdi, Liu Yang, Jin Di, Hakkani-tur Dilek
- Testing Occupational Gender Bias In Language Models: Towards Robust Measurement And Zero-shot Debiasing Chen Yuen, Raghuram Vethavikashini Chithrra, Mattern Justus, Sachan Mrinmaya, Mihalcea Rada, Schölkopf Bernhard, Jin Zhijing
- Is Multiwoz A Solved Task? An Interactive TOD Evaluation Framework With User Simulator Cheng Qinyuan, Li Linyang, Quan Guofeng, Gao Feng, Mou Xiaofeng, Qiu Xipeng
- Language-family Adapters For Low-resource Multilingual Neural Machine Translation Chronopoulou Alexandra, Stojanovski Dario, Fraser Alexander
- M6-rec: Generative Pretrained Language Models Are Open-ended Recommender Systems Cui Zeyu, Ma Jianxin, Zhou Chang, Zhou Jingren, Yang Hongxia
- Understanding BLOOM: An Empirical Study On Diverse NLP Tasks Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- Less Is More: Linear Layers On CLIP Features As Powerful Vizwiz Model Deuser Fabian, Habel Konrad, Rösch Philipp J., Oswald Norbert
- Black-box Prompt Learning For Pre-trained Language Models Diao Shizhe, Huang Zhichao, Xu Ruijia, Li Xuechun, Lin Yong, Zhou Xiao, Zhang Tong
- Star: Bootstrapping Reasoning With Reasoning Eric Zelikman, Yuhuai Wu, Jesse Mu, Noah D. Goodman
- Towards Winoqueer: Developing A Benchmark For Anti-queer Bias In Large Language Models Felkner Virginia K., Chang Ho-chun Herbert, Jang Eugene, May Jonathan
- MIGA: A Unified Multi-task Generation Framework For Conversational Text-to-sql Fu Yingwen, Ou Wenjie, Yu Zhou, Lin Yue
- WAVPROMPT: Towards Few-shot Spoken Language Understanding With Frozen Language Models Gao Heting, Ni Junrui, Qian Kaizhi, Zhang Yang, Chang Shiyu, Hasegawa-johnson Mark
- Visual Prompt Tuning For Test-time Domain Adaptation Gao Yunhe, Shi Xingjian, Zhu Yi, Wang Hao, Tang Zhiqiang, Zhou Xiong, Li Mu, Metaxas Dimitris N.
- Visually-augmented Pretrained Language Models For NLP Tasks Without Images Guo Hangyu, Zhou Kun, Zhao Wayne Xin, Zhang Qinyu, Wen Ji-rong
- MALM: Mixing Augmented Language Modeling For Zero-shot Machine Translation Gupta Kshitij
- Thinking About GPT-3 In-context Learning For Biomedical IE? Think Again Gutiérrez Bernal Jiménez, Mcneal Nikolas, Washington Clay, Chen You, Li Lang, Sun Huan, Su Yu
- FOLIO: Natural Language Reasoning With First-order Logic Han Simeng, Schoelkopf Hailey, Zhao Yilun, Qi Zhenting, Riddell Martin, Zhou Wenfei, Coady James, Peng David, Qiao Yujie, Benson Luke, Sun Lucy, Wardle-solano Alex, Szabo Hannah, Zubova Ekaterina, Burtell Matthew, Fan Jonathan, Liu Yixin, Wong Brian, Sailor Malcolm, Ni Ansong, Nan Linyong, Kasai Jungo, Yu Tao, Zhang Rui, Fabbri Alexander R., Kryscinski Wojciech, Yavuz Semih, Liu Ye, Lin Xi Victoria, Joty Shafiq, Zhou Yingbo, Xiong Caiming, Ying Rex, Cohan Arman, Radev Dragomir
- Supervised Visual Attention For Simultaneous Multimodal Machine Translation Haralampieva Veneta, Caglayan Ozan, Specia Lucia
- Rethinking With Retrieval: Faithful Large Language Model Inference He Hangfeng, Zhang Hongming, Roth Dan
- Bundle MCR: Towards Conversational Bundle Recommendation He Zhankui, Zhao Handong, Yu Tong, Kim Sungchul, Du Fan, Mcauley Julian
- Compressing Pre-trained Transformers Via Low-bit Nxm Sparsity For Natural Language Understanding Holmes Connor, Zhang Minjia, He Yuxiong, Wu Bo
- HINT: Hypernetwork Instruction Tuning For Efficient Zero- & Few-shot Generalisation Ivison Hamish, Bhagia Akshita, Wang Yizhong, Hajishirzi Hannaneh, Peters Matthew
- Chatgpt Makes Medicine Easy To Swallow: An Exploratory Case Study On Simplified Radiology Reports Jeblick Katharina, Schachtner Balthasar, Dexl Jakob, Mittermeier Andreas, Stüber Anna Theresa, Topalis Johanna, Weber Tobias, Wesp Philipp, Sabel Bastian, Ricke Jens, Ingrisch Michael
- DP-KB: Data Programming With Knowledge Bases Improves Transformer Fine Tuning For Answer Sentence Selection Jedema Nic, Vu Thuy, Gupta Manish, Moschitti Alessandro
- Visual Prompt Tuning Jia Menglin, Tang Luming, Chen Bor-chun, Cardie Claire, Belongie Serge, Hariharan Bharath, Lim Ser-nam
- Contrastive Learning For Prompt-based Few-shot Language Learners Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Large Language Models Can Self-improve Jiaxin Huang, Shixiang Shane Gu, Le Hou, Yuexin Wu, Xuezhi Wang, Hongkun Yu, Jiawei Han
- Probing Script Knowledge From Pre-trained Models Jin Zijian, Zhang Xingyu, Yu Mo, Huang Lifu
- Training Compute-optimal Large Language Models Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego De Las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, Tom Hennigan, Eric Noland, Katie Millican, George Van Den Driessche, Bogdan Damoc, Aurelia Guy, Simon Osindero, Karen Simonyan, Erich Elsen, Jack W. Rae, Oriol Vinyals, Laurent Sifre
- Latency Adjustable Transformer Encoder For Language Understanding Kachuee Sajjad, Sharifkhani Mohammad
- Gpt-neo For Commonsense Reasoning -- A Theoretical And Practical Lens Kashyap Rohan, Kashyap Vivek, P. Narendra C.
- What Makes Data-to-text Generation Hard For Pretrained Language Models? Keymanesh Moniba, Benton Adrian, Dredze Mark
- Mucot: Multilingual Contrastive Training For Question-answering In Low-resource Languages Kumar Gokul Karthik, Gehlot Abhishek Singh, Mullappilly Sahal Shaji, Nandakumar Karthik
- The Optimal BERT Surgeon: Scalable And Accurate Second-order Pruning For Large Language Models Kurtic Eldar, Campos Daniel, Nguyen Tuan, Frantar Elias, Kurtz Mark, Fineran Benjamin, Goin Michael, Alistarh Dan
- Internet-augmented Language Models Through Few-shot Prompting For Open-domain Question Answering Lazaridou Angeliki, Gribovskaya Elena, Stokowiec Wojciech, Grigorev Nikolai
- Simple Recurrence Improves Masked Language Models Lei Tao, Tian Ran, Bastings Jasmijn, Parikh Ankur P.
- GPT-D: Inducing Dementia-related Linguistic Anomalies By Deliberate Degradation Of Artificial Neural Language Models Li Changye, Knopman David, Xu Weizhe, Cohen Trevor, Pakhomov Serguei
- Summer: Wechat Neural Machine Translation Systems For The WMT22 Biomedical Translation Task Li Ernan, Meng Fandong, Zhou Jie
- Fine-grained Semantically Aligned Vision-language Pre-training Li Juncheng, He Xin, Wei Longhui, Qian Long, Zhu Linchao, Xie Lingxi, Zhuang Yueting, Tian Qi, Tang Siliang
- Learning To Transfer Prompts For Text Generation Li Junyi, Tang Tianyi, Nie Jian-yun, Wen Ji-rong, Zhao Wayne Xin
- Pre-trained Language Models For Interactive Decision-making Li Shuang, Puig Xavier, Paxton Chris, Du Yilun, Wang Clinton, Fan Linxi, Chen Tao, Huang De-an, Akyürek Ekin, Anandkumar Anima, Andreas Jacob, Mordatch Igor, Torralba Antonio, Zhu Yuke
- Evaluating Psychological Safety Of Large Language Models Li Xingxuan, Li Yutong, Qiu Lin, Joty Shafiq, Bing Lidong
- Uni-eden: Universal Encoder-decoder Network By Multi-granular Vision-language Pre-training Li Yehao, Fan Jiahao, Pan Yingwei, Yao Ting, Lin Weiyao, Mei Tao
- Bjtu-wechat's Systems For The WMT22 Chat Translation Task Liang Yunlong, Meng Fandong, Xu Jinan, Chen Yufeng, Zhou Jie
- Contextual Transformer For Offline Meta Reinforcement Learning Lin Runji, Li Ye, Feng Xidong, Zhang Zhaowei, Fung Xian Hong Wu, Zhang Haifeng, Wang Jun, Du Yali, Yang Yaodong
- Delving Deeper Into Cross-lingual Visual Question Answering Liu Chen, Pfeiffer Jonas, Korhonen Anna, Vulić Ivan, Gurevych Iryna
- Fl-tuning: Layer Tuning For Feed-forward Network In Transformer Liu Jingping, Song Yuqiu, Xue Kui, Sun Hongli, Wang Chao, Chen Lihan, Jiang Haiyun, Liang Jiaqing, Ruan Tong
- PSP: Pre-trained Soft Prompts For Few-shot Abstractive Summarization Liu Xiaochen, Gao Yang, Bai Yu, Li Jiawei, Hu Yinan, Huang Heyan, Chen Boxing
- A Win-win Deal: Towards Sparse And Robust Pre-trained Language Models Liu Yuanxin, Meng Fandong, Lin Zheng, Li Jiangnan, Fu Peng, Cao Yanan, Wang Weiping, Zhou Jie
- Declaration-based Prompt Tuning For Visual Question Answering Liu Yuhang, Wei Wei, Peng Daowan, Zhu Feida
- Training Language Models To Follow Instructions With Human Feedback Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe
- Vision-and-language Pretrained Models: A Survey Long Siqu, Cao Feiqi, Han Soyeon Caren, Yang Haiqin
- Structured Like A Language Model: Analysing AI As An Automated Subject Magee Liam, Arora Vanicka, Munn Luke
- PERFECT: Prompt-free And Efficient Few-shot Learning With Language Models Mahabadi Rabeeh Karimi, Zettlemoyer Luke, Henderson James, Saeidi Marzieh, Mathias Lambert, Stoyanov Veselin, Yazdani Majid
- Fine-tuning Pre-trained Transformers Into Decaying Fast Weights Mao Huanru Henry
- Mini-model Adaptation: Efficiently Extending Pretrained Models To New Languages Via Aligned Shallow Training Marchisio Kelly, Lewis Patrick, Chen Yihong, Artetxe Mikel
- Generating Training Data With Language Models: Towards Zero-shot Language Understanding Meng Yu, Huang Jiaxin, Zhang Yu, Han Jiawei
- Who Is GPT-3? An Exploration Of Personality, Values And Demographics Miotto Marilù, Rossberg Nicola, Kleinberg Bennett
- Domain-specific Text Generation For Machine Translation Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- SGPT: GPT Sentence Embeddings For Semantic Search Muennighoff Niklas
- Large Language Models Are Reasoning Teachers Namgyu Ho, Laura Schmid, Se-young Yun
- Crosslingual Generalization Through Multitask Finetuning Niklas Muennighoff, Thomas Wang, Lintang Sutawika, Adam Roberts, Stella Biderman, Teven Le Scao, M Saiful Bari, Sheng Shen, Zheng-xin Yong, Hailey Schoelkopf, Xiangru Tang, Dragomir Radev, Alham Fikri Aji, Khalid Almubarak, Samuel Albanie, Zaid Alyafeai, Albert Webson, Edward Raff, Colin Raffel
- No Language Left Behind: Scaling Human-centered Machine Translation Nllb Team, Marta R. Costa-jussà, James Cross, Onur Çelebi, Maha Elbayad, Kenneth Heafield, Kevin Heffernan, Elahe Kalbassi, Janice Lam, Daniel Licht, Jean Maillard, Anna Sun, Skyler Wang, Guillaume Wenzek, Al Youngblood, Bapi Akula, Loic Barrault, Gabriel Mejia Gonzalez, Prangthip Hansanti, John Hoffman, Semarley Jarrett, Kaushik Ram Sadagopan, Dirk Rowe, Shannon Spruit, Chau Tran, Pierre Andrews, Necip Fazil Ayan, Shruti Bhosale, Sergey Edunov, Angela Fan, Cynthia Gao, Vedanuj Goswami, Francisco Guzmán, Philipp Koehn, Alexandre Mourachko, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Jeff Wang
- Improving The Cross-lingual Generalisation In Visual Question Answering Nooralahzadeh Farhad, Sennrich Rico
- Minialbert: Model Distillation Via Parameter-efficient Recursive Transformers Nouriborji Mohammadmahdi, Rohanian Omid, Kouchaki Samaneh, Clifton David A.
- Neural Language Taskonomy: Which NLP Tasks Are The Most Predictive Of Fmri Brain Activity? Oota Subba Reddy, Arora Jashn, Agarwal Veeral, Marreddy Mounika, Gupta Manish, Surampudi Bapi Raju
- Parameter-efficient Abstractive Question Answering Over Tables Or Text Pal Vaishali, Kanoulas Evangelos, De Rijke Maarten
- Follow-up Attention: An Empirical Study Of Developer And Neural Model Code Exploration Paltenghi Matteo, Pandita Rahul, Henley Austin Z., Ziegler Albert
- Bidirectional Language Models Are Also Few-shot Learners Patel Ajay, Li Bryan, Rasooli Mohammad Sadegh, Constant Noah, Raffel Colin, Callison-burch Chris
- Sparse And Dense Approaches For The Full-rank Retrieval Of Responses For Dialogues Penha Gustavo, Hauff Claudia
- Hypertuning: Toward Adapting Large Language Models Without Back-propagation Phang Jason, Mao Yi, He Pengcheng, Chen Weizhu
- Synchromesh: Reliable Code Generation From Pre-trained Language Models Poesia Gabriel, Polozov Oleksandr, Le Vu, Tiwari Ashish, Soares Gustavo, Meek Christopher, Gulwani Sumit
- Super-prompting: Utilizing Model-independent Contextual Data To Reduce Data Annotation Required In Visual Commonsense Tasks Rezaei Navid, Reformat Marek Z.
- Additive Interventions Yield Robust Multi-domain Machine Translation Models Rippeth Elijah, Post Matt
- Learning To Diversify For Product Question Generation Roitman Haggai, Singer Uriel, Eshel Yotam, Nus Alexander, Kiperwasser Eliyahu
- Lamda: Language Models For Dialog Applications Romal Thoppilan, Daniel De Freitas, Jamie Hall, Noam Shazeer, Apoorv Kulshreshtha, Heng-tze Cheng, Alicia Jin, Taylor Bos, Leslie Baker, Yu Du, Yaguang Li, Hongrae Lee, Huaixiu Steven Zheng, Amin Ghafouri, Marcelo Menegali, Yanping Huang, Maxim Krikun, Dmitry Lepikhin, James Qin, Dehao Chen, Yuanzhong Xu, Zhifeng Chen, Adam Roberts, Maarten Bosma, Vincent Zhao, Yanqi Zhou, Chung-ching Chang, Igor Krivokon, Will Rusch, Marc Pickett, Pranesh Srinivasan, Laichee Man, Kathleen Meier-hellstern, Meredith Ringel Morris, Tulsee Doshi, Renelito Delos Santos, Toju Duke, Johnny Soraker, Ben Zevenbergen, Vinodkumar Prabhakaran, Mark Diaz, Ben Hutchinson, Kristen Olson, Alejandra Molina, Erin Hoffman-john, Josh Lee, Lora Aroyo, Ravi Rajakumar, Alena Butryna, Matthew Lamm, Viktoriya Kuzmina, Joe Fenton, Aaron Cohen, Rachel Bernstein, Ray Kurzweil, Blaise Aguera-arcas, Claire Cui, Marian Croak, Ed Chi, Quoc Le
- LINGUIST: Language Model Instruction Tuning To Generate Annotated Utterances For Intent Classification And Slot Tagging Rosenbaum Andy, Soltan Saleh, Hamza Wael, Versley Yannick, Boese Markus
- Few-shot Visual Question Generation: A Novel Task And Benchmark Datasets Roy Anurag, Ekka David Johnson, Ghosh Saptarshi, Das Abir
- Benchclamp: A Benchmark For Evaluating Language Models On Syntactic And Semantic Parsing Roy Subhro, Thomson Sam, Chen Tongfei, Shin Richard, Pauls Adam, Eisner Jason, Van Durme Benjamin
- The Goldilocks Of Pragmatic Understanding: Fine-tuning Strategy Matters For Implicature Resolution By Llms Ruis Laura, Khan Akbir, Biderman Stella, Hooker Sara, Rocktäschel Tim, Grefenstette Edward
- Data Augmentation For Intent Classification With Off-the-shelf Large Language Models Sahu Gaurav, Rodriguez Pau, Laradji Issam H., Atighehchian Parmida, Vazquez David, Bahdanau Dzmitry
- Self-repetition In Abstractive Neural Summarizers Salkar Nikita, Trikalinos Thomas, Wallace Byron C., Nenkova Ani
- Language Models Are Greedy Reasoners: A Systematic Formal Analysis Of Chain-of-thought Saparov Abulhair, He He
- Using Deepspeed And Megatron To Train Megatron-turing NLG 530B, A Large-scale Generative Language Model Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick Legresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zhang, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, Bryan Catanzaro
- XRICL: Cross-lingual Retrieval-augmented In-context Learning For Cross-lingual Text-to-sql Semantic Parsing Shi Peng, Zhang Rui, Bai He, Lin Jimmy
- Mgpt: Few-shot Learners Go Multilingual Shliazhko Oleh, Fenogenova Alena, Tikhonova Maria, Mikhailov Vladislav, Kozlova Anastasia, Shavrina Tatiana
- Improving The Domain Adaptation Of Retrieval Augmented Generation (RAG) Models For Open Domain Question Answering Siriwardhana Shamane, Weerasekera Rivindu, Wen Elliott, Kaluarachchi Tharindu, Rana Rajib, Nanayakkara Suranga
- CLIP Models Are Few-shot Learners: Empirical Studies On VQA And Visual Entailment Song Haoyu, Dong Li, Zhang Wei-nan, Liu Ting, Wei Furu
- Human Language Modeling Soni Nikita, Matero Matthew, Balasubramanian Niranjan, Schwartz H. Andrew
- OPT-IML: Scaling Language Model Instruction Meta Learning Through The Lens Of Generalization Srinivasan Iyer, Xi Victoria Lin, Ramakanth Pasunuru, Todor Mihaylov, Daniel Simig, Ping Yu, Kurt Shuster, Tianlu Wang, Qing Liu, Punit Singh Koura, Xian Li, Brian O'horo, Gabriel Pereyra, Jeff Wang, Christopher Dewan, Asli Celikyilmaz, Luke Zettlemoyer, Ves Stoyanov
- Towards Zero-shot And Few-shot Table Question Answering Using GPT-3 Srivastava Pragya, Ganu Tanuja, Guha Saikat
- Conciseness: An Overlooked Language Task Stahlberg Felix, Kumar Aashish, Alberti Chris, Kumar Shankar
- Towards Diverse, Relevant And Coherent Open-domain Dialogue Generation Via Hybrid Latent Variables Sun Bin, Li Yitong, Mi Fei, Wang Weichao, Li Yiwei, Li Kan
- Dynamar: Dynamic Prompt With Mask Token Representation Sun Xiaodi, Rajagopalan Sunny, Nigam Priyanka, Lu Weiyi, Xu Yi, Zeng Belinda, Chilimbi Trishul
- On The Effect Of Pre-training For Transformer In Different Modality On Offline Reinforcement Learning Takagi Shiro
- News Summarization And Evaluation In The Era Of GPT-3 Tanya Goyal, Junyi Jessy Li, Greg Durrett
- Towards Fine-tuning Pre-trained Language Models With Integer Forward And Backward Propagation Tayaranian Mohammadreza, Ghaffari Alireza, Tahaei Marzieh S., Rezagholizadeh Mehdi, Asgharian Masoud, Nia Vahid Partovi
- Benchmarking Large Language Models For Automated Verilog RTL Code Generation Thakur Shailja, Ahmad Baleegh, Fan Zhenxing, Pearce Hammond, Tan Benjamin, Karri Ramesh, Dolan-gavitt Brendan, Garg Siddharth
- Efficient Few-shot Learning Without Prompts Tunstall Lewis, Reimers Nils, Jo Unso Eun Seo, Bates Luke, Korat Daniel, Wasserblat Moshe, Pereg Oren
- Dylora: Parameter Efficient Tuning Of Pre-trained Models Using Dynamic Search-free Low-rank Adaptation Valipour Mojtaba, Rezagholizadeh Mehdi, Kobyzev Ivan, Ghodsi Ali
- Will We Run Out Of Data? Limits Of LLM Scaling Based On Human-generated Data Villalobos Pablo, Ho Anson, Sevilla Jaime, Besiroglu Tamay, Heim Lennart, Hobbhahn Marius
- Can Language Models Make Fun? A Case Study In Chinese Comical Crosstalk Wang Benyou, Wu Xiangbo, Liu Xiaokang, Li Jianquan, Tiwari Prayag, Xie Qianqian
- Exploring The Limits Of Domain-adaptive Training For Detoxifying Large-scale Language Models Wang Boxin, Ping Wei, Xiao Chaowei, Xu Peng, Patwary Mostofa, Shoeybi Mohammad, Li Bo, Anandkumar Anima, Catanzaro Bryan
- Finding Skill Neurons In Pre-trained Transformer-based Language Models Wang Xiaozhi, Wen Kaiyue, Zhang Zhengyan, Hou Lei, Liu Zhiyuan, Li Juanzi
- Compilable Neural Code Generation With Compiler Feedback Wang Xin, Wang Yasheng, Wan Yao, Mi Fei, Li Yitong, Zhou Pingyi, Liu Jin, Wu Hao, Jiang Xin, Liu Qun
- Adamix: Mixture-of-adaptations For Parameter-efficient Model Tuning Wang Yaqing, Agarwal Sahaj, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Controlling Styles In Neural Machine Translation With Activation Prompt Wang Yifan, Sun Zewei, Cheng Shanbo, Zheng Weiguo, Wang Mingxuan
- Two-stage LLM Fine-tuning With Less Specialization And More Generalization Wang Yihan, Si Si, Li Daliang, Lukasik Michal, Yu Felix, Hsieh Cho-jui, Dhillon Inderjit S, Kumar Sanjiv
- Unified Multimodal Model With Unlikelihood Training For Visual Dialog Wang Zihao, Wang Junli, Jiang Changjun
- CREATER: Ctr-driven Advertising Text Generation With Controlled Pre-training And Contrastive Fine-tuning Wei Penghui, Yang Xuanhua, Liu Shaoguo, Wang Liang, Zheng Bo
- Probing Of Quantitative Values In Abstractive Summarization Models White Nathan M.
- Adversarial Self-attention For Language Understanding Wu Hongqiu, Ding Ruixue, Zhao Hai, Xie Pengjun, Huang Fei, Zhang Min
- CLIP Also Understands Text: Prompting CLIP For Phrase Understanding Yan An, Li Jiacheng, Zhu Wanrong, Lu Yujie, Wang William Yang, Mcauley Julian
- AD-DROP: Attribution-driven Dropout For Robust Language Model Fine-tuning Yang Tao, Deng Jinghao, Quan Xiaojun, Wang Qifan, Nie Shaoliang
- Parameter-efficient Tuning Makes A Good Classification Head Yang Zhuoyi, Ding Ming, Guo Yanhui, Lv Qingsong, Tang Jie
- Prompt Tuning For Discriminative Pre-trained Language Models Yao Yuan, Dong Bowen, Zhang Ao, Zhang Zhengyan, Xie Ruobing, Liu Zhiyuan, Lin Leyu, Sun Maosong, Wang Jianyong
- UL2: Unifying Language Learning Paradigms Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Xavier Garcia, Jason Wei, Xuezhi Wang, Hyung Won Chung, Siamak Shakeri, Dara Bahri, Tal Schuster, Huaixiu Steven Zheng, Denny Zhou, Neil Houlsby, Donald Metzler
- Active Example Selection For In-context Learning Yiming Zhang, Shi Feng, Chenhao Tan
- BLOOM+1: Adding Language Support To BLOOM For Zero-shot Prompting Yong Zheng-xin, Schoelkopf Hailey, Muennighoff Niklas, Aji Alham Fikri, Adelani David Ifeoluwa, Almubarak Khalid, Bari M Saiful, Sutawika Lintang, Kasai Jungo, Baruwa Ahmed, Winata Genta Indra, Biderman Stella, Raff Edward, Radev Dragomir, Nikoulina Vassilina
- KRLS: Improving End-to-end Response Generation In Task Oriented Dialog With Reinforced Keywords Learning Yu Xiao, Wu Qingyang, Qian Kun, Yu Zhou
- Hype: Better Pre-trained Language Model Fine-tuning With Hidden Representation Perturbation Yuan Hongyi, Yuan Zheng, Tan Chuanqi, Huang Fei, Huang Songfang
- Wudaomm: A Large-scale Multi-modal Dataset For Pre-training Models Yuan Sha, Zhao Shuai, Leng Jiahong, Xue Zhao, Zhao Hanyu, Liu Peiyu, Gong Zheng, Zhao Wayne Xin, Li Junyi, Tang Jie
- Can Pretrained Language Models (yet) Reason Deductively? Yuan Zhangdie, Hu Songbo, Vulić Ivan, Korhonen Anna, Meng Zaiqiao
- M-adapter: Modality Adaptation For End-to-end Speech-to-text Translation Zhao Jinming, Yang Hao, Shareghi Ehsan, Haffari Gholamreza
- Attend, Memorize And Generate: Towards Faithful Table-to-text Generation In Few Shots Zhao Wenting, Liu Ye, Wan Yao, Yu Philip S.
- Reflection Of Thought: Inversely Eliciting Numerical Reasoning In Language Models Via Solving Linear Systems Zhou Fan, Dong Haoyu, Liu Qian, Cheng Zhoujun, Han Shi, Zhang Dongmei
- Predicting Fine-tuning Performance With Probing Zhu Zining, Shahtalebi Soroosh, Rudzicz Frank
- Exploring And Evaluating Personalized Models For Code Generation Zlotchevski Andrei, Drain Dawn, Svyatkovskiy Alexey, Clement Colin, Sundaresan Neel, Tufano Michele
- St-moe: Designing Stable And Transferable Sparse Expert Models Zoph Barret, Bello Irwan, Kumar Sameer, Du Nan, Huang Yanping, Dean Jeff, Shazeer Noam, Fedus William
- Efficient Long Sequence Modeling Via State Space Augmented Transformer Zuo Simiao, Liu Xiaodong, Jiao Jian, Charles Denis, Manavoglu Eren, Zhao Tuo, Gao Jianfeng
- LM-CPPF: Paraphrasing-guided Data Augmentation For Contrastive Prompt-based Few-shot Fine-tuning Abaskohi Amirhossein, Rothe Sascha, Yaghoobzadeh Yadollah
- Attention Over Pre-trained Sentence Embeddings For Long Document Classification Abdaoui Amine, Dutta Sourav
- Multi-party Goal Tracking With Llms: Comparing Pre-training, Fine-tuning, And Prompt Engineering Addlesee Angus, Sieińska Weronika, Gunson Nancie, Garcia Daniel Hernández, Dondrup Christian, Lemon Oliver
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon
- Bring Your Own KG: Self-supervised Program Synthesis For Zero-shot KGQA Agarwal Dhruv, Das Rajarshi, Khosla Sopan, Gangadharaiah Rashmi
- Can Large Language Models Be Good Path Planners? A Benchmark And Investigation On Spatial-temporal Reasoning Aghzal Mohamed, Plaku Erion, Yao Ziyu
- Simul-llm: A Framework For Exploring High-quality Simultaneous Translation With Large Language Models Agostinelli Victor, Wild Max, Raffel Matthew, Fuad Kazi Ahmed Asif, Chen Lizhong
- Jointly Training Large Autoregressive Multimodal Models Aiello Emanuele, Yu Lili, Nie Yixin, Aghajanyan Armen, Oguz Barlas
- An In-depth Look At Gemini's Language Abilities Akter Syeda Nahida, Yu Zichun, Muhamed Aashiq, Ou Tianyue, Bäuerle Alex, Cabrera Ángel Alexander, Dholakia Krish, Xiong Chenyan, Neubig Graham
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Akyürek Afra Feyza, Akyürek Ekin, Madaan Aman, Kalyan Ashwin, Clark Peter, Wijaya Derry, Tandon Niket
- Can Language Models Employ The Socratic Method? Experiments With Code Debugging Al-hossami Erfan, Bunescu Razvan, Smith Justin, Teehan Ryan
- Becoming Self-instruct: Introducing Early Stopping Criteria For Minimal Instruct Tuning Alshikh Waseem, Daaboul Manhal, Goddard Kirk, Imel Brock, Kamble Kiran, Kulkarni Parikshith, Russak Melisa
- Steering Large Language Models For Machine Translation With Finetuning And In-context Learning Alves Duarte M., Guerreiro Nuno M., Alves João, Pombal José, Rei Ricardo, De Souza José G. C., Colombo Pierre, Martins André F. T.
- Taqyim: Evaluating Arabic NLP Tasks Using Chatgpt Models Alyafeai Zaid, Alshaibani Maged S., Alkhamissi Badr, Luqman Hamzah, Alareqi Ebrahim, Fadel Ali
- Knowledge Of Knowledge: Exploring Known-unknowns Uncertainty With Large Language Models Amayuelas Alfonso, Wong Kyle, Pan Liangming, Chen Wenhu, Wang William
- Learning From Mistakes Makes LLM Better Reasoner An Shengnan, Ma Zexiong, Lin Zeqi, Zheng Nanning, Lou Jian-guang, Chen Weizhu
- Expel: LLM Agents Are Experiential Learners Andrew Zhao, Daniel Huang, Quentin Xu, Matthieu Lin, Yong-jin Liu, Gao Huang
- Augmenting Llms With Knowledge: A Survey On Hallucination Prevention Andriopoulos Konstantinos, Pouwelse Johan
- Distilling Efficient Language-specific Models For Cross-lingual Transfer Ansell Alan, Ponti Edoardo Maria, Korhonen Anna, Vulić Ivan
- Agenttuning: Enabling Generalized Agent Abilities For Llms Aohan Zeng, Mingdao Liu, Rui Lu, Bowen Wang, Xiao Liu, Yuxiao Dong, Jie Tang
- Sequence-to-sequence Spanish Pre-trained Language Models Araujo Vladimir, Trusca Maria Mihaela, Tufiño Rodrigo, Moens Marie-francine
- Chainforge: A Visual Toolkit For Prompt Engineering And LLM Hypothesis Testing Arawjo Ian, Swoopes Chelse, Vaithilingam Priyan, Wattenberg Martin, Glassman Elena
- Slimfit: Memory-efficient Fine-tuning Of Transformer-based Models Using Training Dynamics Ardakani Arash, Haan Altan, Tan Shangyin, Popovici Doru Thom, Cheung Alvin, Iancu Costin, Sen Koushik
- AI And Education: An Investigation Into The Use Of Chatgpt For Systems Thinking Arndt Holger
- Learning And Leveraging Verifiers To Improve Planning Capabilities Of Pre-trained Language Models Arora Daman, Kambhampati Subbarao
- BUFFET: Benchmarking Large Language Models For Few-shot Cross-lingual Transfer Asai Akari, Kudugunta Sneha, Yu Xinyan Velocity, Blevins Terra, Gonen Hila, Reid Machel, Tsvetkov Yulia, Ruder Sebastian, Hajishirzi Hannaneh
- Instruction-tuning Aligns Llms To The Human Brain Aw Khai Loong, Montariol Syrielle, Alkhamissi Badr, Schrimpf Martin, Bosselut Antoine
- Intentional Biases In LLM Responses Badyal Nicklaus, Jacoby Derek, Coady Yvonne
- Longbench: A Bilingual, Multitask Benchmark For Long Context Understanding Bai Yushi, Lv Xin, Zhang Jiajie, Lyu Hongchang, Tang Jiankai, Huang Zhidian, Du Zhengxiao, Liu Xiao, Zeng Aohan, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- Kgquiz: Evaluating The Generalization Of Encoded Knowledge In Large Language Models Bai Yuyang, Feng Shangbin, Balachandran Vidhisha, Tan Zhaoxuan, Lou Shiqi, He Tianxing, Tsvetkov Yulia
- Tamil-llama: A New Tamil Language Model Based On Llama 2 Balachandran Abhinand
- Fine-tuning Large Enterprise Language Models Via Ontological Reasoning Baldazzi Teodoro, Bellomarini Luigi, Ceri Stefano, Colombo Andrea, Gentili Andrea, Sallinger Emanuel
- Investigating Pre-trained Language Models On Cross-domain Datasets, A Step Closer To General AI Ballout Mohamad, Krumnack Ulf, Heidemann Gunther, Kühnberger Kai-uwe
- A Bi-step Grounding Paradigm For Large Language Models In Recommendation Systems Bao Keqin, Zhang Jizhi, Wang Wenjie, Zhang Yang, Yang Zhengyi, Luo Yancheng, Chen Chong, Feng Fuli, Tian Qi
- Assessing And Enhancing The Robustness Of Large Language Models With Task Structure Variations For Logical Reasoning Bao Qiming, Gendron Gael, Peng Alex Yuxuan, Zhong Wanjun, Tan Neset, Chen Yang, Witbrock Michael, Liu Jiamou
- Disc-medllm: Bridging General Large Language Models And Real-world Medical Consultation Bao Zhijie, Chen Wei, Xiao Shengze, Ren Kuang, Wu Jiaao, Zhong Cheng, Peng Jiajie, Huang Xuanjing, Wei Zhongyu
- Understanding Telecom Language Through Large Language Models Bariah Lina, Zou Hang, Zhao Qiyang, Mouhouche Belkacem, Bader Faouzi, Debbah Merouane
- Text Summarization Using Large Language Models: A Comparative Study Of Mpt-7b-instruct, Falcon-7b-instruct, And Openai Chat-gpt Models Basyal Lochan, Sanghvi Mihir
- The Poison Of Alignment Bekbayev Aibek, Chun Sungbae, Dulat Yerzat, Yamazaki James
- Labrador: Exploring The Limits Of Masked Language Modeling For Laboratory Data Bellamy David R., Kumar Bhawesh, Wang Cindy, Beam Andrew
- A Survey On Prompting Techniques In Llms Bhandari Prabin
- TART: A Plug-and-play Transformer Module For Task-agnostic Reasoning Bhatia Kush, Narayan Avanika, De Sa Christopher, Ré Christopher
- Helm: Highlighted Evidence Augmented Language Model For Enhanced Table-to-text Generation Bian Junyi, Qin Xiaolei, Zou Wuhe, Huang Mengzuo, Luo Congyi, Zhang Ke, Zhang Weidong
- Safety-tuned Llamas: Lessons From Improving The Safety Of Large Language Models That Follow Instructions Bianchi Federico, Suzgun Mirac, Attanasio Giuseppe, Röttger Paul, Jurafsky Dan, Hashimoto Tatsunori, Zou James
- VIGC: Visual Instruction Generation And Correction Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He
- BLT: Can Large Language Models Handle Basic Legal Text? Blair-stanek Andrew, Holzenberger Nils, Van Durme Benjamin
- Pixlore: A Dataset-driven Approach To Rich Image Captioning Bonilla Diego
- Extractgpt: Exploring The Potential Of Large Language Models For Product Attribute Value Extraction Brinkmann Alexander, Shraga Roee, Bizer Christian
- Utilizing Chatgpt Generated Data To Retrieve Depression Symptoms From Social Media Bucur Ana-maria
- Generative AI Assistants In Software Development Education: A Vision For Integrating Generative AI Into Educational Practice, Not Instinctively Defending Against It Bull Christopher, Kharrufa Ahmed
- Just Tell Me: Prompt Engineering In Business Process Management Busch Kiran, Rochlitzer Alexander, Sola Diana, Leopold Henrik
- Oberta: Improving Sparse Transfer Learning Via Improved Initialization, Distillation, And Pruning Regimes Campos Daniel, Marques Alexandre, Kurtz Mark, Zhai Chengxiang
- Wizardlm: Empowering Large Language Models To Follow Complex Instructions Can Xu, Qingfeng Sun, Kai Zheng, Xiubo Geng, Pu Zhao, Jiazhan Feng, Chongyang Tao, Daxin Jiang
- H2O Open Ecosystem For State-of-the-art Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Lee Chun Ming, Conde Marcos V.
- Defending Against Alignment-breaking Attacks Via Robustly Aligned LLM Cao Bochuan, Cao Yuanpu, Lin Lu, Chen Jinghui
- Are Large Language Models Good Fact Checkers: A Preliminary Study Cao Han, Wei Lingwei, Chen Mengyang, Zhou Wei, Hu Songlin
- Instructmol: Multi-modal Integration For Building A Versatile And Reliable Molecular Assistant In Drug Discovery Cao He, Liu Zijing, Lu Xingyu, Yao Yuan, Li Yu
- How Does AI Chat Change Search Behaviors? Capra Robert, Arguello Jaime
- Promoting Generalized Cross-lingual Question Answering In Few-resource Scenarios Via Self-knowledge Distillation Carrino Casimiro Pio, Escolano Carlos, Fonollosa José A. R.
- INT2.1: Towards Fine-tunable Quantized Large Language Models With Error Correction Through Low-rank Adaptation Chai Yuji, Gkountouras John, Ko Glenn G., Brooks David, Wei Gu-yeon
- Learning To Generate Better Than Your LLM Chang Jonathan D., Brantley Kiante, Ramamurthy Rajkumar, Misra Dipendra, Sun Wen
- Prompting And Adapter Tuning For Self-supervised Encoder-decoder Speech Model Chang Kai-wei, Chen Ming-hsin, Lin Yun-ping, Hsu Jing Neng, Huang Paul Kuo-ming, Huang Chien-yu, Li Shang-wen, Lee Hung-yi
- Language Model Behavior: A Comprehensive Survey Chang Tyler A., Bergen Benjamin K.
- She Had Cobalt Blue Eyes: Prompt Testing To Create Aligned And Sustainable Language Models Chatrath Veronica, Bamgbose Oluwanifemi, Raza Shaina
- Fireact: Toward Language Agent Fine-tuning Chen Baian, Shu Chang, Shareghi Ehsan, Collier Nigel, Narasimhan Karthik, Yao Shunyu
- Visual Instruction Tuning With Polite Flamingo Chen Delong, Liu Jianfeng, Dai Wenliang, Wang Baoyuan
- LION : Empowering Multimodal Large Language Model With Dual-level Visual Knowledge Chen Gongwei, Shen Leyang, Shao Rui, Deng Xiang, Nie Liqiang
- Videollm: Modeling Video Sequence With Large Language Models Chen Guo, Zheng Yin-dong, Wang Jiahao, Xu Jilan, Huang Yifei, Pan Junting, Wang Yi, Wang Yali, Qiao Yu, Lu Tong, Wang Limin
- Chatgpt's One-year Anniversary: Are Open-source Large Language Models Catching Up? Chen Hailin, Jiao Fangkai, Li Xingxuan, Qin Chengwei, Ravaut Mathieu, Zhao Ruochen, Xiong Caiming, Joty Shafiq
- Maybe Only 0.5% Data Is Needed: A Preliminary Exploration Of Low Training Data Instruction Tuning Chen Hao, Zhang Yiming, Zhang Qi, Yang Hantao, Hu Xiaomeng, Ma Xuetao, Yanggong Yifan, Zhao Junbo
- Skills-in-context Prompting: Unlocking Compositionality In Large Language Models Chen Jiaao, Pan Xiaoman, Yu Dian, Song Kaiqiang, Wang Xiaoyang, Yu Dong, Chen Jianshu
- Adaptation With Self-evaluation To Improve Selective Prediction In Llms Chen Jiefeng, Yoon Jinsung, Ebrahimi Sayna, Arik Sercan O, Pfister Tomas, Jha Somesh
- Schema-guided Semantic Accuracy: Faithfulness In Task-oriented Dialogue Response Generation Chen Jinghong, Lin Weizhe, Byrne Bill
- Forgetful Large Language Models: Lessons Learned From Using Llms In Robot Programming Chen Juo-tung, Huang Chien-ming
- Epi-curriculum: Episodic Curriculum Learning For Low-resource Domain Adaptation In Neural Machine Translation Chen Keyu, Zhuang Di, Li Mingchen, Chang J. Morris
- Alpagasus: Training A Better Alpaca With Fewer Data Chen Lichang, Li Shiyang, Yan Jun, Wang Hai, Gunaratna Kalpa, Yadav Vikas, Tang Zheng, Srinivasan Vijay, Zhou Tianyi, Huang Heng, Jin Hongxia
- Introspective Tips: Large Language Model For In-context Decision Making Chen Liting, Wang Lu, Dong Hang, Du Yali, Yan Jie, Yang Fangkai, Li Shuang, Zhao Pu, Qin Si, Rajmohan Saravan, Lin Qingwei, Zhang Dongmei
- Driving With Llms: Fusing Object-level Vector Modality For Explainable Autonomous Driving Chen Long, Sinavski Oleg, Hünermann Jan, Karnsund Alice, Willmott Andrew James, Birch Danny, Maund Daniel, Shotton Jamie
- Natural Response Generation For Chinese Reading Comprehension Chen Nuo, Li Hongguang, Bao Yinan, Wang Baoyuan, Li Jia
- Breaking Language Barriers In Multilingual Mathematical Reasoning: Insights And Observations Chen Nuo, Zheng Zinan, Wu Ning, Gong Ming, Song Yangqiu, Zhang Dongmei, Li Jia
- Evaluation Of Chatgpt Family Of Models For Biomedical Reasoning And Classification Chen Shan, Li Yingya, Lu Sheng, Van Hoang, Aerts Hugo Jwl, Savova Guergana K., Bitterman Danielle S.
- Extending Context Window Of Large Language Models Via Positional Interpolation Chen Shouyuan, Wong Sherman, Chen Liangjian, Tian Yuandong
- Llm-empowered Chatbots For Psychiatrist And Patient Simulation: Application And Evaluation Chen Siyuan, Wu Mengyue, Zhu Kenny Q., Lan Kunyao, Zhang Zhiling, Cui Lyuchun
- MMICT: Boosting Multi-modal Fine-tuning With In-context Examples Chen Tao, Zhang Enwei, Gao Yuting, Li Ke, Sun Xing, Zhang Yan, Li Hui, Ji Rongrong
- Lorashear: Efficient Large Language Model Structured Pruning And Knowledge Recovery Chen Tianyi, Ding Tianyu, Yadav Badal, Zharkov Ilya, Liang Luming
- The Janus Interface: How Fine-tuning In Large Language Models Amplifies The Privacy Risks Chen Xiaoyi, Tang Siyuan, Zhu Rui, Yan Shijun, Jin Lei, Wang Zihao, Su Liya, Zhang Zhikun, Wang Xiaofeng, Tang Haixu
- Asking Before Acting: Gather Information In Embodied Decision Making With Language Models Chen Xiaoyu, Zhang Shenao, Zhang Pushi, Zhao Li, Chen Jianyu
- Minprompt: Graph-based Minimal Prompt Data Augmentation For Few-shot Question Answering Chen Xiusi, Jiang Jyun-yu, Chang Wei-cheng, Hsieh Cho-jui, Yu Hsiang-fu, Wang Wei
- Translation And Fusion Improves Zero-shot Cross-lingual Information Extraction Chen Yang, Shah Vedaant, Ritter Alan
- Measuring And Improving Chain-of-thought Reasoning In Vision-language Models Chen Yangyi, Sikka Karan, Cogswell Michael, Ji Heng, Divakaran Ajay
- Dog-instruct: Towards Premium Instruction-tuning Data Via Text-grounded Instruction Wrapping Chen Yongrui, Jiang Haiyun, Huang Xinting, Shi Shuming, Qi Guilin
- SALM: Speech-augmented Language Model With In-context Learning For Speech Recognition And Translation Chen Zhehuai, Huang He, Andrusenko Andrei, Hrinchuk Oleksii, Puvvada Krishna C., Li Jason, Ghosh Subhankar, Balam Jagadeesh, Ginsburg Boris
- See, Think, Confirm: Interactive Prompting Between Vision And Language Models For Knowledge-based Visual Reasoning Chen Zhenfang, Zhou Qinhong, Shen Yikang, Hong Yining, Zhang Hao, Gan Chuang
- UPRISE: Universal Prompt Retrieval For Improving Zero-shot Evaluation Cheng Daixuan, Huang Shaohan, Bi Junyu, Zhan Yuefeng, Liu Jianfeng, Wang Yujing, Sun Hao, Wei Furu, Deng Denvy, Zhang Qi
- SCALE: Synergized Collaboration Of Asymmetric Language Translation Engines Cheng Xin, Wang Xun, Ge Tao, Chen Si-qing, Wei Furu, Zhao Dongyan, Yan Rui
- Attention Alignment And Flexible Positional Embeddings Improve Transformer Length Extrapolation Chi Ta-chung, Fan Ting-han, Rudnicky Alexander I.
- M\(^{2}\)chat: Empowering VLM For Multimodal LLM Interleaved Text-image Generation Chi Xiaowei, Zhang Rongyu, Jiang Zhengkai, Liu Yijiang, Wang Yatian, Qi Xingqun, Luo Wenhan, Gao Peng, Zhang Shanghang, Liu Qifeng, Guo Yike
- INSTRUCTEVAL: Towards Holistic Evaluation Of Instruction-tuned Large Language Models Chia Yew Ken, Hong Pengfei, Bing Lidong, Poria Soujanya
- Evaluating The Efficacy Of Interactive Language Therapy Based On LLM For High-functioning Autistic Adolescent Psychological Counseling Cho Yujin, Kim Mingeon, Kim Seojin, Kwon Oyun, Kwon Ryan Donghan, Lee Yoonha, Lim Dohyun
- Generative Data Augmentation Using Llms Improves Distributional Robustness In Question Answering Chowdhury Arijit Ghosh, Chadha Aman
- Large Language Models For User Interest Journeys Christakopoulou Konstantina, Lalama Alberto, Adams Cj, Qu Iris, Amir Yifat, Chucri Samer, Vollucci Pierce, Soldo Fabio, Bseiso Dina, Scodel Sarah, Dixon Lucas, Chi Ed H., Chen Minmin
- Language And Task Arithmetic With Parameter-efficient Layers For Zero-shot Summarization Chronopoulou Alexandra, Pfeiffer Jonas, Maynez Joshua, Wang Xinyi, Ruder Sebastian, Agrawal Priyanka
- Tutorials On Stance Detection Using Pre-trained Language Models: Fine-tuning BERT And Prompting Large Language Models Chuang Yun-shiuan
- Dola: Decoding By Contrasting Layers Improves Factuality In Large Language Models Chuang Yung-sung, Xie Yujia, Luo Hongyin, Kim Yoon, Glass James, He Pengcheng
- Instructtods: Large Language Models For End-to-end Task-oriented Dialogue Systems Chung Willy, Cahyawijaya Samuel, Wilie Bryan, Lovenia Holy, Fung Pascale
- Galactic Chitchat: Using Large Language Models To Converse With Astronomy Literature Ciucă Ioana, Ting Yuan-sen
- Generalization Analogies: A Testbed For Generalizing AI Oversight To Hard-to-measure Domains Clymer Joshua, Baker Garrett, Subramani Rohan, Wang Sam
- Inducing Anxiety In Large Language Models Increases Exploration And Bias Coda-forno Julian, Witte Kristin, Jagadish Akshay K., Binz Marcel, Akata Zeynep, Schulz Eric
- Reinforcement Learning Fine-tuning Of Language Models Is Biased Towards More Extractable Features Cruz Diogo, Pona Edoardo, Holness-tofts Alex, Schmied Elias, Alonso Víctor Abia, Griffin Charlie, Cirstea Bogdan-ionut
- Machine Mindset: An MBTI Exploration Of Large Language Models Cui Jiaxi, Lv Liuzhenghao, Wen Jing, Wang Rongsheng, Tang Jing, Tian Yonghong, Yuan Li
- Ada-instruct: Adapting Instruction Generators For Complex Reasoning Cui Wanyun, Wang Qianle
- Neural Retrievers Are Biased Towards Llm-generated Content Dai Sunhao, Zhou Yuqi, Pang Liang, Liu Weihao, Hu Xiaolin, Liu Yong, Zhang Xiao, Wang Gang, Xu Jun
- Educhat: A Large-scale Language Model-based Chatbot System For Intelligent Education Dan Yuhao, Lei Zhikai, Gu Yiyang, Li Yong, Yin Jianghao, Lin Jiaju, Ye Linhao, Tie Zhiyan, Zhou Yougen, Wang Yilei, Zhou Aimin, Zhou Ze, Chen Qin, Zhou Jie, He Liang, Qiu Xipeng
- Mask The Bias: Improving Domain-adaptive Generalization Of Ctc-based ASR With Internal Language Model Estimation Das Nilaksh, Sunkara Monica, Bodapati Sravan, Cai Jinglun, Kulshreshtha Devang, Farris Jeff, Kirchhoff Katrin
- Optimizing Distributed Training On Frontier For Large Language Models Dash Sajal, Lyngaas Isaac, Yin Junqi, Wang Xiao, Egele Romain, Cong Guojing, Wang Feiyi, Balaprakash Prasanna
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- GPT-3 Models Are Few-shot Financial Reasoners De Padua Raul Salles, Qureshi Imran, Karakaplan Mustafa U.
- Fairness Of Chatgpt And The Role Of Explainable-guided Prompts Deldjoo Yashar
- Plug-and-play Policy Planner For Large Language Model Powered Dialogue Agents Deng Yang, Zhang Wenxuan, Lam Wai, Ng See-kiong, Chua Tat-seng
- Mutual Enhancement Of Large And Small Language Models With Cross-silo Knowledge Transfer Deng Yongheng, Qiao Ziqing, Ren Ju, Liu Yang, Zhang Yaoxue
- Multilingual Jailbreak Challenges In Large Language Models Deng Yue, Zhang Wenxuan, Pan Sinno Jialin, Bing Lidong
- Context-aware Transformer Pre-training For Answer Sentence Selection Di Liello Luca, Garg Siddhant, Moschitti Alessandro
- A Comprehensive Evaluation Of Neural SPARQL Query Generation From Natural Language Questions Diallo Papa Abdou Karim Karou, Reyd Samuel, Zouaq Amal
- Lmflow: An Extensible Toolkit For Finetuning And Inference Of Large Foundation Models Diao Shizhe, Pan Rui, Dong Hanze, Shum Ka Shun, Zhang Jipeng, Xiong Wei, Zhang Tong
- Enhancing Chat Language Models By Scaling High-quality Instructional Conversations Ding Ning, Chen Yulin, Xu Bokai, Qin Yujia, Zheng Zhi, Hu Shengding, Liu Zhiyuan, Sun Maosong, Zhou Bowen
- Self-agreement: A Framework For Fine-tuning Language Models To Find Agreement Among Diverse Opinions Ding Shiyao, Ito Takayuki
- HPC-GPT: Integrating Large Language Model For High-performance Computing Ding Xianzhong, Chen Le, Emani Murali, Liao Chunhua, Lin Pei-hung, Vanderbruggen Tristan, Xie Zhen, Cerpa Alberto E., Du Wan
- Fluid Transformers And Creative Analogies: Exploring Large Language Models' Capacity For Augmenting Cross-domain Analogical Creativity Ding Zijian, Srinivasan Arvind, Macneil Stephen, Chan Joel
- Efficient Finetuning Large Language Models For Vietnamese Chatbot Doan Vu-thuan, Truong Quoc-truong, Nguyen Duc-vu, Nguyen Vinh-tiep, Luu Thuy-ngan Nguyen
- Establishing Performance Baselines In Fine-tuning, Retrieval-augmented Generation And Soft-prompting For Non-specialist LLM Users Dodgson Jennifer, Nanzheng Lin, Peh Julian, Pattirane Akira Rafhael Janson, Alhajir Alfath Daryl, Dinarto Eko Ridho, Lim Joseph, Ahmad Syed Danyal
- Blockwise Compression Of Transformer-based Models Without Retraining Dong Gaochen, Chen Wei
- Speechgpt: Empowering Large Language Models With Intrinsic Cross-modal Conversational Abilities Dong Zhang, Shimin Li, Xin Zhang, Jun Zhan, Pengyu Wang, Yaqian Zhou, Xipeng Qiu
- Loramoe: Alleviate World Knowledge Forgetting In Large Language Models Via Moe-style Plugin Dou Shihan, Zhou Enyu, Liu Yan, Gao Songyang, Zhao Jun, Shen Wei, Zhou Yuhao, Xi Zhiheng, Wang Xiao, Fan Xiaoran, Pu Shiliang, Zhu Jiang, Zheng Rui, Gui Tao, Zhang Qi, Huang Xuanjing
- Quantifying And Attributing The Hallucination Of Large Language Models Via Association Analysis Du Li, Wang Yequan, Xing Xingrun, Ya Yiqun, Li Xiang, Jiang Xin, Fang Xuezhi
- Guiding Pretraining In Reinforcement Learning With Large Language Models Du Yuqing, Watkins Olivia, Wang Zihan, Colas Cédric, Darrell Trevor, Abbeel Pieter, Gupta Abhishek, Andreas Jacob
- Alpacafarm: A Simulation Framework For Methods That Learn From Human Feedback Dubois Yann, Li Xuechen, Taori Rohan, Zhang Tianyi, Gulrajani Ishaan, Ba Jimmy, Guestrin Carlos, Liang Percy, Hashimoto Tatsunori B.
- Jacotext: A Pretrained Model For Java Code-text Generation Espejel Jessica López, Alassan Mahaman Sanoussi Yahaya, Dahhane Walid, Ettifouri El Hassane
- Large Language Models For Code Analysis: Do Llms Really Do Their Job? Fang Chongzhou, Miao Ning, Srivastav Shaurya, Liu Jialin, Zhang Ruoyu, Fang Ruijie, Asmita, Tsang Ryan, Nazari Najmeh, Wang Han, Homayoun Houman
- Revisiting Instruction Fine-tuned Model Evaluation To Guide Industrial Applications Faysse Manuel, Viaud Gautier, Hudelot Céline, Colombo Pierre
- Extending Context Window Of Large Language Models Via Semantic Compression Fei Weizhi, Niu Xueyan, Zhou Pingyi, Hou Lu, Bai Bo, Deng Lei, Han Wei
- Language Models Can Be Logical Solvers Feng Jiazhan, Xu Ruochen, Hao Junheng, Sharma Hiteshi, Shen Yelong, Zhao Dongyan, Chen Weizhu
- Llama Rider: Spurring Large Language Models To Explore The Open World Feng Yicheng, Wang Yuxuan, Liu Jiazheng, Zheng Sipeng, Lu Zongqing
- Vision Language Transformers: A Survey Fields Clayton, Kennington Casey
- Context-aware Meta-learning Fifty Christopher, Duan Dennis, Junkins Ronald G., Amid Ehsan, Leskovec Jure, Re Christopher, Thrun Sebastian
- Can Large Language Models Follow Concept Annotation Guidelines? A Case Study On Scientific And Financial Domains Fonseca Marcio, Cohen Shay B.
- A Challenger To GPT-4V? Early Explorations Of Gemini In Visual Expertise Fu Chaoyou, Zhang Renrui, Wang Zihan, Huang Yubo, Zhang Zhengye, Qiu Longtian, Ye Gaoxiang, Shen Yunhang, Zhang Mengdan, Chen Peixian, Zhao Sirui, Lin Shaohui, Jiang Deqiang, Yin Di, Gao Peng, Li Ke, Li Hongsheng, Sun Xing
- Kwaiyiimath: Technical Report Fu Jiayi, Lin Lei, Gao Xiaoyang, Liu Pengli, Chen Zhengzong, Yang Zhirui, Zhang Shengnan, Zheng Xue, Li Yan, Liu Yuliang, Ye Xucheng, Liao Yiqiao, Liao Chao, Chen Bin, Song Chengru, Wan Junchen, Lin Zijia, Zhang Fuzheng, Wang Zhongyuan, Zhang Di, Gai Kun
- Gpt4aigchip: Towards Next-generation AI Accelerator Design Automation Via Large Language Models Fu Yonggan, Zhang Yongan, Yu Zhongzhi, Li Sixu, Ye Zhifan, Li Chaojian, Wan Cheng, Lin Yingyan
- Badllama: Cheaply Removing Safety Fine-tuning From Llama 2-chat 13B Gade Pranav, Lermen Simon, Rogers-smith Charlie, Ladish Jeffrey
- The Inner Sentiments Of A Thought Gagne Chris, Dayan Peter
- Natural Language Commanding Via Program Synthesis Gandhi Apurva, Nguyen Thong Q., Jiao Huitian, Steen Robert, Bhatawdekar Ameya
- Text-to-sql Empowered By Large Language Models: A Benchmark Evaluation Gao Dawei, Wang Haibin, Li Yaliang, Sun Xiuyu, Qian Yichen, Ding Bolin, Zhou Jingren
- Real Customization Or Just Marketing: Are Customized Versions Of Chat GPT Useful? Garrido-merchán Eduardo C., Arroyo-barrigüete Jose L., Borrás-pala Francisco, Escobar-torres Leandro, De Ibarreta Carlos Martínez, Ortiz-lozano Jose María, Rua-vieites Antonio
- Ahead-of-time P-tuning Gavrilov Daniil, Balagansky Nikita
- Model Tells You What To Discard: Adaptive KV Cache Compression For Llms Ge Suyu, Zhang Yunan, Liu Liyuan, Zhang Minjia, Han Jiawei, Gao Jianfeng
- MART: Improving LLM Safety With Multi-round Automatic Red-teaming Ge Suyu, Zhou Chunting, Hou Rui, Khabsa Madian, Wang Yi-chia, Wang Qifan, Han Jiawei, Mao Yuning
- Ddcot: Duty-distinct Chain-of-thought Prompting For Multimodal Reasoning In Language Models Ge Zheng, Bin Yang, Jiajin Tang, Hong-yu Zhou, Sibei Yang
- Flacuna: Unleashing The Problem Solving Power Of Vicuna Using FLAN Fine-tuning Ghosal Deepanway, Chia Yew Ken, Majumder Navonil, Poria Soujanya
- Text-to-audio Generation Using Instruction-tuned LLM And Latent Diffusion Model Ghosal Deepanway, Majumder Navonil, Mehrish Ambuj, Poria Soujanya
- Mtuncertainty: Assessing The Need For Post-editing Of Machine Translation Outputs By Fine-tuning Openai Llms Gladkoff Serge, Han Lifeng, Erofeev Gleb, Sorokina Irina, Nenadic Goran
- Intention And Context Elicitation With Large Language Models In The Legal Aid Intake Process Goodson Nick, Lu Rongfei
- Lil-bevo: Explorations Of Strategies For Training Language Models In More Humanlike Ways Govindarajan Venkata S, Rodriguez Juan Diego, Bostrom Kaj, Mahowald Kyle
- AMAGO: Scalable In-context Reinforcement Learning For Adaptive Agents Grigsby Jake, Fan Linxi, Zhu Yuke
- Pokemqa: Programmable Knowledge Editing For Multi-hop Question Answering Gu Hengrui, Zhou Kaixiong, Han Xiaotian, Liu Ninghao, Wang Ruobing, Wang Xin
- Seer: Language Instructed Video Prediction With Latent Diffusion Models Gu Xianfan, Wen Chuan, Ye Weirui, Song Jiaming, Gao Yang
- Voyager: An Open-ended Embodied Agent With Large Language Models Guanzhi Wang, Yuqi Xie, Yunfan Jiang, Ajay Mandlekar, Chaowei Xiao, Yuke Zhu, Linxi Fan, Anima Anandkumar
- Hifi: High-information Attention Heads Hold For Parameter-efficient Model Adaptation Gui Anchun, Xiao Han
- SPT: Fine-tuning Transformer-based Language Models Efficiently With Sparsification Gui Yuntao, Yan Xiao, Yin Peiqi, Yang Han, Cheng James
- Beyond Imitation: Leveraging Fine-grained Quality Signals For Alignment Guo Geyang, Zhao Ranchi, Tang Tianyi, Zhao Wayne Xin, Wen Ji-rong
- Meta-tuning Llms To Leverage Lexical Knowledge For Generalizable Language Style Understanding Guo Ruohao, Xu Wei, Ritter Alan
- Compresso: Structured Pruning With Collaborative Prompting Learns Compact Large Language Models Guo Song, Xu Jiahang, Zhang Li Lyna, Yang Mao
- What Can Large Language Models Do In Chemistry? A Comprehensive Benchmark On Eight Tasks Guo Taicheng, Guo Kehan, Nan Bozhao, Liang Zhenwen, Guo Zhichun, Chawla Nitesh V., Wiest Olaf, Zhang Xiangliang
- Instruction Fusion: Advancing Prompt Evolution Through Hybridization Guo Weidong, Yang Jiuding, Yang Kaitong, Li Xiangyang, Rao Zhuwei, Xu Yu, Niu Di
- Learning To Plan With Natural Language Guo Yiduo, Liang Yaobo, Wu Chenfei, Wu Wenshan, Zhao Dongyan, Duan Nan
- Improving Small Language Models On Pubmedqa Via Generative Data Augmentation Guo Zhen, Wang Peiqi, Wang Yanwei, Yu Shangdi
- Point-bind & Point-llm: Aligning Point Cloud With Multi-modality For 3D Understanding, Generation, And Instruction Following Guo Ziyu, Zhang Renrui, Zhu Xiangyang, Tang Yiwen, Ma Xianzheng, Han Jiaming, Chen Kexin, Gao Peng, Li Xianzhi, Li Hongsheng, Heng Pheng-ann
- Editing Common Sense In Transformers Gupta Anshita, Mondal Debanjan, Sheshadri Akshay Krishna, Zhao Wenlong, Li Xiang Lorraine, Wiegreffe Sarah, Tandon Niket
- Grace: Generation Using Associated Code Edits Gupta Priyanshu, Khare Avishree, Bajpai Yasharth, Chakraborty Saikat, Gulwani Sumit, Kanade Aditya, Radhakrishna Arjun, Soares Gustavo, Tiwari Ashish
- Gistscore: Learning Better Representations For In-context Example Selection With Gist Bottlenecks Gupta Shivanshu, Rosenbaum Clemens, Elenberg Ethan R.
- Evaluating Concurrent Robustness Of Language Models Across Diverse Challenge Sets Gupta Vatsal, Pandya Pranshu, Kataria Tushar, Gupta Vivek, Roth Dan
- A Survey Of Vision-language Pre-training From The Lens Of Multimodal Machine Translation Gwinnup Jeremy, Duh Kevin
- Opiniongpt: Modelling Explicit Biases In Instruction-tuned Llms Haller Patrick, Aynetdinov Ansar, Akbik Alan
- Llms As Visual Explainers: Advancing Image Classification With Evolving Visual Descriptions Han Songhao, Zhuo Le, Liao Yue, Liu Si
- Medalpaca -- An Open-source Collection Of Medical Conversational AI Models And Training Data Han Tianyu, Adams Lisa C., Papaioannou Jens-michalis, Grundmann Paul, Oberhauser Tom, Löser Alexander, Truhn Daniel, Bressem Keno K.
- In-context Alignment: Chat With Vanilla Language Models Before Fine-tuning Han Xiaochuang
- Chartllama: A Multimodal LLM For Chart Understanding And Generation Han Yucheng, Zhang Chi, Chen Xin, Yang Xu, Wang Zhibin, Yu Gang, Fu Bin, Zhang Hanwang
- Personalisation Within Bounds: A Risk Taxonomy And Policy Framework For The Alignment Of Large Language Models With Personalised Feedback Hannah Rose Kirk, Bertie Vidgen, Paul Röttger, Scott A. Hale
- Boosting Large Language Model For Speech Synthesis: An Empirical Study Hao Hongkun, Zhou Long, Liu Shujie, Li Jinyu, Hu Shujie, Wang Rui, Wei Furu
- Vico: Plug-and-play Visual Condition For Personalized Text-to-image Generation Hao Shaozhe, Han Kai, Zhao Shihao, Wong Kwan-yee K.
- Towards Efficient Vision-language Tuning: More Information Density, More Generalizability Hao Tianxiang, Lyu Mengyao, Chen Hui, Zhao Sicheng, Ding Xiaohan, Han Jungong, Ding Guiguang
- Unlocking The Potential Of Chatgpt: A Comprehensive Exploration Of Its Applications, Advantages, Limitations, And Future Directions In Natural Language Processing Hariri Walid
- Zero- And Few-shot Prompting With Llms: A Comparative Study With Fine-tuned Models For Bangla Sentiment Analysis Hasan Md. Arid, Das Shudipta, Anjum Afiyat, Alam Firoj, Anjum Anika, Sarker Avijit, Noori Sheak Rashed Haider
- Chatgpt For Zero-shot Dialogue State Tracking: A Solution Or An Opportunity? Heck Michael, Lubis Nurul, Ruppik Benjamin, Vukovic Renato, Feng Shutong, Geishauser Christian, Lin Hsien-chin, Van Niekerk Carel, Gašić Milica
- Divide Et Impera: Multi-transformer Architectures For Complex Nlp-tasks Helland Solveig, Gavagnin Elena, De Spindler Alexandre
- Simplemtod: A Simple Language Model For Multimodal Task-oriented Dialogue With Symbolic Scene Representation Hemanthage Bhathiya, Dondrup Christian, Bartie Phil, Lemon Oliver
- Stochastic Parrots Looking For Stochastic Parrots: Llms Are Easy To Fine-tune And Hard To Detect With Other Llms Henrique Da Silva Gameiro, Kucharavy Andrei, Guerraoui Rachid
- Assessing The Efficacy Of Large Language Models In Generating Accurate Teacher Responses Hicke Yann, Masand Abhishek, Guo Wentao, Gangavarapu Tushaar
- Do Compressed Llms Forget Knowledge? An Experimental Study With Practical Implications Hoang Duc N. M, Cho Minsik, Merth Thomas, Rastegari Mohammad, Wang Zhangyang
- Zero-shot Goal-directed Dialogue Via RL On Imagined Conversations Hong Joey, Levine Sergey, Dragan Anca
- CIEM: Contrastive Instruction Evaluation Method For Better Instruction Tuning Hongyu Hu, Jiyuan Zhang, Minyi Zhao, Zhenbang Sun
- COMMA: Co-articulated Multi-modal Learning Hu Lianyu, Gao Liqing, Liu Zekang, Pun Chi-man, Feng Wei
- Meta-learning Online Adaptation Of Language Models Hu Nathan, Mitchell Eric, Manning Christopher D., Finn Chelsea
- Scaled Prompt-tuning For Few-shot Natural Language Generation Hu Ting, Meinel Christoph, Yang Haojin
- Llm-adapters: An Adapter Family For Parameter-efficient Fine-tuning Of Large Language Models Hu Zhiqiang, Wang Lei, Lan Yihuai, Xu Wanyu, Lim Ee-peng, Bing Lidong, Xu Xing, Poria Soujanya, Lee Roy Ka-wei
- Lorahub: Efficient Cross-task Generalization Via Dynamic Lora Composition Huang Chengsong, Liu Qian, Lin Bill Yuchen, Pang Tianyu, Du Chao, Lin Min
- Acegpt, Localizing Large Language Models In Arabic Huang Huang, Yu Fei, Zhu Jianqing, Sun Xuening, Cheng Hao, Song Dingjie, Chen Zhihong, Alharthi Abdulmohsen, An Bang, He Juncai, Liu Ziche, Zhang Zhiyi, Chen Junying, Li Jianquan, Wang Benyou, Zhang Lian, Sun Ruoyu, Wan Xiang, Li Haizhou, Xu Jinchao
- Revisiting The Reliability Of Psychological Scales On Large Language Models Huang Jen-tse, Wang Wenxuan, Lam Man Ho, Li Eric John, Jiao Wenxiang, Lyu Michael R.
- T2i-compbench: A Comprehensive Benchmark For Open-world Compositional Text-to-image Generation Huang Kaiyi, Sun Kaiyue, Xie Enze, Li Zhenguo, Liu Xihui
- Lawyer Llama Technical Report Huang Quzhe, Tao Mingxu, Zhang Chen, An Zhenwei, Jiang Cong, Chen Zhibin, Wu Zirui, Feng Yansong
- Adapting Pre-trained Language Models To Vision-language Tasks Via Dynamic Visual Prompting Huang Shubin, Wu Qiong, Zhou Yiyi, Chen Weijie, Zhang Rongsheng, Sun Xiaoshuai, Ji Rongrong
- Fewer Is More: Boosting LLM Reasoning With Reinforced Context Pruning Huang Xijie, Zhang Li Lyna, Cheng Kwang-ting, Yang Fan, Yang Mao
- Competition-level Problems Are Effective LLM Evaluators Huang Yiming, Lin Zhenghao, Liu Xiao, Gong Yeyun, Lu Shuai, Lei Fangyu, Liang Yaobo, Shen Yelong, Lin Chen, Duan Nan, Chen Weizhu
- Llama 2: Open Foundation And Fine-tuned Chat Models Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini, Rui Hou, Hakan Inan, Marcin Kardas, Viktor Kerkez, Madian Khabsa, Isabel Kloumann, Artem Korenev, Punit Singh Koura, Marie-anne Lachaux, Thibaut Lavril, Jenya Lee, Diana Liskovich, Yinghai Lu, Yuning Mao, Xavier Martinet, Todor Mihaylov, Pushkar Mishra, Igor Molybog, Yixin Nie, Andrew Poulton, Jeremy Reizenstein, Rashi Rungta, Kalyan Saladi, Alan Schelten, Ruan Silva, Eric Michael Smith, Ranjan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, Jian Xiang Kuan, Puxin Xu, Zheng Yan, Iliyan Zarov, Yuchen Zhang, Angela Fan, Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom
- TADA: Efficient Task-agnostic Domain Adaptation For Transformers Hung Chia-chien, Lange Lukas, Strötgen Jannik
- Conceptual Model Interpreter For Large Language Models Härer Felix
- Minimizing Factual Inconsistency And Hallucination In Large Language Models I Muneeswaran, Saxena Shreya, Prasad Siva, Prakash M V Sai, Shankar Advaith, V Varun, Vaddina Vishal, Gopalakrishnan Saisubramaniam
- Llama Guard: Llm-based Input-output Safeguard For Human-ai Conversations Inan Hakan, Upasani Kartikeya, Chi Jianfeng, Rungta Rashi, Iyer Krithika, Mao Yuning, Tontchev Michael, Hu Qing, Fuller Brian, Testuggine Davide, Khabsa Madian
- Pushing Boundaries: Exploring Zero Shot Object Classification With Large Multimodal Models Islam Ashhadul, Biswas Md. Rafiul, Zaghouani Wajdi, Belhaouari Samir Brahim, Shah Zubair
- Emergent Ai-assisted Discourse: Case Study Of A Second Language Writer Authoring With Chatgpt Jacob Sharin, Tate Tamara, Warschauer Mark
- Cotran: An Llm-based Code Translator Using Reinforcement Learning With Feedback From Compiler And Symbolic Execution Jana Prithwish, Jha Piyush, Ju Haoyang, Kishore Gautham, Mahajan Aryan, Ganesh Vijay
- Chatgpt Is Fun, But It Is Not Funny! Humor Is Still Challenging Large Language Models Jentzsch Sophie, Kersting Kristian
- Chit-chat Or Deep Talk: Prompt Engineering For Process Mining Jessen Urszula, Sroka Michal, Fahland Dirk
- Exploring The Impact Of Instruction Data Scaling On Large Language Models: An Empirical Study On Real-world Use Cases Ji Yunjie, Deng Yong, Gong Yan, Peng Yiping, Niu Qiang, Zhang Lei, Ma Baochang, Li Xiangang
- Legal Syllogism Prompting: Teaching Large Language Models For Legal Judgment Prediction Jiang Cong, Yang Xiaolei
- Tigerscore: Towards Building Explainable Metric For All Text Generation Tasks Jiang Dongfu, Li Yishan, Zhang Ge, Huang Wenhao, Lin Bill Yuchen, Chen Wenhu
- Graphologue: Exploring Large Language Model Responses With Interactive Diagrams Jiang Peiling, Rayan Jude, Dow Steven P., Xia Haijun
- Forcing Generative Models To Degenerate Ones: The Power Of Data Poisoning Attacks Jiang Shuli, Kadhe Swanand Ravindra, Zhou Yi, Cai Ling, Baracaldo Nathalie
- Scaling Sentence Embeddings With Large Language Models Jiang Ting, Huang Shaohan, Luan Zhongzhi, Wang Deqing, Zhuang Fuzhen
- Hykge: A Hypothesis Knowledge Graph Enhanced Framework For Accurate And Reliable Medical Llms Responses Jiang Xinke, Zhang Ruizhe, Xu Yongxin, Qiu Rihong, Fang Yue, Wang Zhiyuan, Tang Jinyi, Ding Hongxin, Chu Xu, Zhao Junfeng, Wang Yasha
- Panda LLM: Training Data And Evaluation For Open-sourced Chinese Instruction-following Large Language Models Jiao Fangkai, Ding Bosheng, Luo Tianze, Mo Zhanfeng
- Exploring Self-supervised Logic-enhanced Training For Large Language Models Jiao Fangkai, Teng Zhiyang, Ding Bosheng, Liu Zhengyuan, Chen Nancy F., Joty Shafiq
- Knowledge-augmented Large Language Models For Personalized Contextual Query Suggestion Jinheon Baek, Nirupama Chandrasekaran, Silviu Cucerzan, Allen Herring, Sujay Kumar Jauhar
- Teaching Language Models To Hallucinate Less With Synthetic Tasks Jones Erik, Palangi Hamid, Simões Clarisse, Chandrasekaran Varun, Mukherjee Subhabrata, Mitra Arindam, Awadallah Ahmed, Kamar Ece
- Small Language Models Fine-tuned To Coordinate Larger Language Models Improve Complex Reasoning Juneja Gurusha, Dutta Subhabrata, Chakrabarti Soumen, Manchanda Sunny, Chakraborty Tanmoy
- Synthetic Data Generation In Low-resource Settings Via Fine-tuning Of Large Language Models Kaddour Jean, Liu Qi
- Domain-specific Code Language Models: Unraveling The Potential For HPC Codes And Tasks Kadosh Tal, Hasabnis Niranjan, Vo Vy A., Schneider Nadav, Krien Neva, Capota Mihai, Wasay Abdul, Ahmed Nesreen, Willke Ted, Tamir Guy, Pinter Yuval, Mattson Timothy, Oren Gal
- Contextual Code Switching For Machine Translation Using Language Models Kaji Arshad, Shah Manan
- A Rank Stabilization Scaling Factor For Fine-tuning With Lora Kalajdzievski Damjan
- Swectrl-mini: A Data-transparent Transformer-based Large Language Model For Controllable Text Generation In Swedish Kalpakchi Dmytro, Boye Johan
- A Survey Of GPT-3 Family Large Language Models Including Chatgpt And GPT-4 Kalyan Katikapalli Subramanyam
- Reducing Sequence Length By Predicting Edit Operations With Large Language Models Kaneko Masahiro, Okazaki Naoaki
- Llm-augmented Preference Learning From Natural Language Kang Inwon, Ruan Sikai, Ho Tyler, Lin Jui-chien, Mohsin Farhad, Seneviratne Oshani, Xia Lirong
- Knowledge-augmented Reasoning Distillation For Small Language Models In Knowledge-intensive Tasks Kang Minki, Lee Seanie, Baek Jinheon, Kawaguchi Kenji, Hwang Sung Ju
- Sorted Llama: Unlocking The Potential Of Intermediate Layers Of Large Language Models For Dynamic Inference Kavehzadeh Parsa, Valipour Mojtaba, Tahaei Marzieh, Ghodsi Ali, Chen Boxing, Rezagholizadeh Mehdi
- Critiquellm: Towards An Informative Critique Generation Model For Evaluation Of Large Language Model Generation Ke Pei, Wen Bosi, Feng Zhuoer, Liu Xiao, Lei Xuanyu, Cheng Jiale, Wang Shengyuan, Zeng Aohan, Dong Yuxiao, Wang Hongning, Tang Jie, Huang Minlie
- Turning English-centric Llms Into Polyglots: How Much Multilinguality Is Needed? Kew Tannon, Schottmann Florian, Sennrich Rico
- Zzzgpt: An Interactive GPT Approach To Enhance Sleep Quality Khaokaew Yonchanok, Ji Kaixin, Nguyen Thuc Hanh, Kegalle Hiruni, Alaofi Marwah, Xue Hao, Salim Flora D.
- Reliability Check: An Analysis Of Gpt-3's Response To Sensitive Topics And Prompt Wording Khatun Aisha, Brown Daniel G.
- Fantom: A Benchmark For Stress-testing Machine Theory Of Mind In Interactions Kim Hyunwoo, Sclar Melanie, Zhou Xuhui, Bras Ronan Le, Kim Gunhee, Choi Yejin, Sap Maarten
- Compressed Context Memory For Online Language Model Interaction Kim Jang-hyun, Yeom Junyoung, Yun Sangdoo, Song Hyun Oh
- Memory-efficient Fine-tuning Of Compressed Large Language Models Via Sub-4-bit Integer Quantization Kim Jeonghoon, Lee Jung Hyun, Kim Sungdong, Park Joonsuk, Yoo Kang Min, Kwon Se Jung, Lee Dongsoo
- Cotever: Chain Of Thought Prompting Annotation Toolkit For Explanation Verification Kim Seungone, Joo Se June, Jang Yul, Chae Hyungjoo, Yeo Jinyoung
- Finequant: Unlocking Efficiency With Fine-grained Weight-only Quantization For Llms Kim Young Jin, Henry Rawn, Fahim Raffy, Awadalla Hany Hassan
- Building A Llama2-finetuned LLM For Odia Language Utilizing Domain Knowledge Instruction Set Kohli Guneet Singh, Parida Shantipriya, Sekhar Sambit, Saha Samirit, Nair Nipun B, Agarwal Parul, Khosla Sonal, Patiyal Kusumlata, Dhal Debasish
- Contextual Refinement Of Translations: Large Language Models For Sentence And Document-level Post-editing Koneru Sai, Exel Miriam, Huck Matthias, Niehues Jan
- NOLA: Compressing Lora Using Linear Combination Of Random Basis Koohpayegani Soroush Abbasi, Navaneet Kl, Nooralinejad Parsa, Kolouri Soheil, Pirsiavash Hamed
- Humans In Humans Out: On GPT Converging Toward Common Sense In Both Success And Failure Koralus Philipp, Wang-maścianica Vincent
- Understanding Catastrophic Forgetting In Language Models Via Implicit Inference Kotha Suhas, Springer Jacob Mitchell, Raghunathan Aditi
- L-TUNING: Synchronized Label Tuning For Prompt And Prefix In Llms Kowsher Md., Sobuj Md. Shohanur Islam, Mahmud Asif, Prottasha Nusrat Jahan, Bhat Prakash
- Chatgpt For PLC/DCS Control Logic Generation Koziolek Heiko, Gruener Sten, Ashiwal Virendra
- Implicit Meta-learning May Lead Language Models To Trust More Reliable Sources Krasheninnikov Dmitrii, Krasheninnikov Egor, Mlodozeniec Bruno, Maharaj Tegan, Krueger David
- Leveraging Few-shot Data Augmentation And Waterfall Prompting For Response Generation Krause Lea, Santamaría Selene Báez, Van Der Meer Michiel, Khurana Urja
- Prompt Generate Train (PGT): Few-shot Domain Adaption Of Retrieval Augmented Generation Models For Open Book Question-answering Krishna C. S.
- In-context Explainers: Harnessing Llms For Explaining Black Box Models Kroeger Nicholas, Ley Dan, Krishna Satyapriya, Agarwal Chirag, Lakkaraju Himabindu
- Fundamentals Of Generative Large Language Models And Perspectives In Cyber-defense Kucharavy Andrei, Schillaci Zachary, Maréchal Loïc, Würsch Maxime, Dolamic Ljiljana, Sabonnadiere Remi, David Dimitri Percia, Mermoud Alain, Lenders Vincent
- Lavip:language-grounded Visual Prompts Kunananthaseelan Nilakshan, Zhang Jing, Harandi Mehrtash
- Sensi-bert: Towards Sensitivity Driven Fine-tuning For Parameter-efficient BERT Kundu Souvik, Sridhar Sharath Nittur, Szankin Maciej, Sundaresan Sairam
- Large Language Models On The Chessboard: A Study On Chatgpt's Formal Language Comprehension And Complex Reasoning Skills Kuo Mu-tien, Hsueh Chih-chung, Tsai Richard Tzong-han
- Sparse Fine-tuning For Inference Acceleration Of Large Language Models Kurtic Eldar, Kuznedelev Denis, Frantar Elias, Goin Michael, Alistarh Dan
- M4LE: A Multi-ability Multi-range Multi-task Multi-domain Long-context Evaluation Benchmark For Large Language Models Kwan Wai-chung, Zeng Xingshan, Wang Yufei, Sun Yusen, Li Liangyou, Shang Lifeng, Liu Qun, Wong Kam-fai
- Longform: Effective Instruction Tuning With Reverse Instructions Köksal Abdullatif, Schick Timo, Korhonen Anna, Schütze Hinrich
- Openassistant Conversations -- Democratizing Large Language Model Alignment Köpf Andreas, Kilcher Yannic, Von Rütte Dimitri, Anagnostidis Sotiris, Tam Zhi-rui, Stevens Keith, Barhoum Abdullah, Duc Nguyen Minh, Stanley Oliver, Nagyfi Richárd, Es Shahul, Suri Sameer, Glushkov David, Dantuluri Arnav, Maguire Andrew, Schuhmann Christoph, Nguyen Huu, Mattick Alexander
- LISA: Reasoning Segmentation Via Large Language Model Lai Xin, Tian Zhuotao, Chen Yukang, Li Yanwei, Yuan Yuhui, Liu Shu, Jia Jiaya
- A Data Source For Reasoning Embodied Agents Lanchantin Jack, Sukhbaatar Sainbayar, Synnaeve Gabriel, Sun Yuxuan, Srinet Kavya, Szlam Arthur
- Fine-tuning Chatgpt For Automatic Scoring Latif Ehsan, Zhai Xiaoming
- Building Efficient Universal Classifiers With Natural Language Inference Laurer Moritz, Van Atteveldt Wouter, Casas Andreu, Welbers Kasper
- Platypus: Quick, Cheap, And Powerful Refinement Of Llms Lee Ariel N., Hunter Cole J., Ruiz Nataniel
- Prompted Llms As Chatbot Modules For Long Open-domain Conversation Lee Gibbeum, Hartmann Volker, Park Jongho, Papailiopoulos Dimitris, Lee Kangwook
- Can Large Language Models Capture Dissenting Human Voices? Lee Noah, An Na Min, Thorne James
- Chain Of Empathy: Enhancing Empathetic Response Of Large Language Models Based On Psychotherapy Models Lee Yoon Kyung, Lee Inju, Shin Minjung, Bae Seoyeon, Hahn Sowon
- Chain Of Natural Language Inference For Reducing Large Language Model Ungrounded Hallucinations Lei Deren, Li Yaxi, Hu Mengya, Wang Mingyu, Yun Vincent, Ching Emily, Kamal Eslam
- Mitigating Fine-grained Hallucination By Fine-tuning Large Vision-language Models With Caption Rewrites Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-peng Lim
- T-sciq: Teaching Multimodal Chain-of-thought Reasoning Via Mixed Large Language Model Signals For Science Question Answering Lei Wang, Yi Hu, Jiabang He, Xing Xu, Ning Liu, Hui Liu, Heng Tao Shen
- Recexplainer: Aligning Large Language Models For Explaining Recommendation Models Lei Yuxuan, Lian Jianxun, Yao Jing, Huang Xu, Lian Defu, Xie Xing
- The Language Of Prompting: What Linguistic Properties Make A Prompt Successful? Leidinger Alina, Van Rooij Robert, Shutova Ekaterina
- The Eval4nlp 2023 Shared Task On Prompting Large Language Models As Explainable Metrics Leiter Christoph, Opitz Juri, Deutsch Daniel, Gao Yang, Dror Rotem, Eger Steffen
- Self-detoxifying Language Models Via Toxification Reversal Leong Chak Tou, Cheng Yi, Wang Jiashuo, Wang Jian, Li Wenjie
- Lora Fine-tuning Efficiently Undoes Safety Training In Llama 2-chat 70B Lermen Simon, Rogers-smith Charlie, Ladish Jeffrey
- Sequential Monte Carlo Steering Of Large Language Models Using Probabilistic Programs Lew Alexander K., Zhi-xuan Tan, Grand Gabriel, Mansinghka Vikash K.
- Vision-language Instruction Tuning: A Review And Analysis Li Chen, Ge Yixiao, Li Dian, Shan Ying
- Learning To Rewrite Prompts For Personalized Text Generation Li Cheng, Zhang Mingyang, Mei Qiaozhu, Kong Weize, Bendersky Michael
- Mugglemath: Assessing The Impact Of Query And Response Augmentation On Math Reasoning Li Chengpeng, Yuan Zheng, Yuan Hongyi, Dong Guanting, Lu Keming, Wu Jiancan, Tan Chuanqi, Wang Xiang, Zhou Chang
- DISTFLASHATTN: Distributed Memory-efficient Attention For Long-context Llms Training Li Dacheng, Shao Rulin, Xie Anze, Xing Eric P., Ma Xuezhe, Stoica Ion, Gonzalez Joseph E., Zhang Hao
- Privacy In Large Language Models: Attacks, Defenses And Future Directions Li Haoran, Chen Yulin, Luo Jinglong, Kang Yan, Zhang Xiaojin, Hu Qi, Chan Chunkit, Song Yangqiu
- Mastering Robot Manipulation With Multimodal Prompts Through Pretraining And Multi-task Fine-tuning Li Jiachen, Gao Qiaozi, Johnston Michael, Gao Xiaofeng, He Xuehai, Shakiah Suhaila, Shi Hangjie, Ghanadan Reza, Wang William Yang
- CFGPT: Chinese Financial Assistant With Large Language Model Li Jiangtong, Bian Yuxuan, Wang Guoxuan, Lei Yang, Cheng Dawei, Ding Zhijun, Jiang Changjun
- Evcap: Retrieval-augmented Image Captioning With External Visual-name Memory For Open-world Comprehension Li Jiaxuan, Vo Duc Minh, Sugimoto Akihiro, Nakayama Hideki
- Prefix Propagation: Parameter-efficient Tuning For Long Sequences Li Jonathan, Aitken Will, Bhambhoria Rohan, Zhu Xiaodan
- MAQA: A Multimodal QA Benchmark For Negation Li Judith Yue, Jansen Aren, Huang Qingqing, Lee Joonseok, Ganti Ravi, Kuzmin Dima
- Fine-tuning Multimodal Llms To Follow Zero-shot Demonstrative Instructions Li Juncheng, Pan Kaihang, Ge Zhiqi, Gao Minghe, Ji Wei, Zhang Wenqiao, Chua Tat-seng, Tang Siliang, Zhang Hanwang, Zhuang Yueting
- Openba: An Open-sourced 15B Bilingual Asymmetric Seq2seq Model Pre-trained From Scratch Li Juntao, Tang Zecheng, Ding Yuyang, Wang Pinzheng, Guo Pei, You Wangjie, Qiao Dan, Chen Wenliang, Fu Guohong, Zhu Qiaoming, Zhou Guodong, Zhang Min
- Instructcoder: Instruction Tuning Large Language Models For Code Editing Li Kaixin, Hu Qisheng, Zhao Xu, Chen Hui, Xie Yuxi, Liu Tiedong, Xie Qizhe, He Junxian
- Mvbench: A Comprehensive Multi-modal Video Understanding Benchmark Li Kunchang, Wang Yali, He Yinan, Li Yizhuo, Wang Yi, Liu Yi, Wang Zun, Xu Jilan, Chen Guo, Luo Ping, Wang Limin, Qiao Yu
- Watermarking Llms With Weight Quantization Li Linyang, Jiang Botian, Wang Pengyu, Ren Ke, Yan Hang, Qiu Xipeng
- Reflection-tuning: Data Recycling Improves LLM Instruction-tuning Li Ming, Chen Lichang, Chen Jiuhai, He Shwai, Huang Heng, Gu Jiuxiang, Zhou Tianyi
- Skillgpt: A Restful API Service For Skill Extraction And Standardization Using A Large Language Model Li Nan, Kang Bo, De Bie Tijl
- Laffi: Leveraging Hybrid Natural Language Feedback For Fine-tuning Language Models Li Qianxi, Cao Yingyue, Kang Jikun, Yang Tianpei, Chen Xi, Jin Jun, Taylor Matthew E.
- Vlm-eval: A General Evaluation On Video Large Language Models Li Shuailin, Zhang Yuang, Zhao Yucheng, Wang Qiuyue, Jia Fan, Liu Yingfei, Wang Tiancai
- Tailoring Personality Traits In Large Language Models Via Unsupervisedly-built Personalized Lexicons Li Tianlong, Dou Shihan, Lv Changze, Liu Wenhao, Xu Jianhan, Wu Muling, Ling Zixuan, Zheng Xiaoqing, Huang Xuanjing
- Benchmarking And Improving Generator-validator Consistency Of Language Models Li Xiang Lisa, Shrivastava Vaishnavi, Li Siyan, Hashimoto Tatsunori, Liang Percy
- Mot: Memory-of-thought Enables Chatgpt To Self-improve Li Xiaonan, Qiu Xipeng
- Exploring Fine-tuning Chatgpt For News Recommendation Li Xinyi, Zhang Yongfeng, Malthouse Edward C
- PBNR: Prompt-based News Recommender System Li Xinyi, Zhang Yongfeng, Malthouse Edward C.
- On Bilingual Lexicon Induction With Large Language Models Li Yaoyiran, Korhonen Anna, Vulić Ivan
- A Survey On Fairness In Large Language Models Li Yingji, Du Mengnan, Song Rui, Wang Xin, Wang Ying
- Loftq: Lora-fine-tuning-aware Quantization For Large Language Models Li Yixiao, Yu Yifan, Liang Chen, He Pengcheng, Karampatziakis Nikos, Chen Weizhu, Zhao Tuo
- Prompting Large Language Models For Counterfactual Generation: An Empirical Study Li Yongqi, Xu Mayi, Miao Xin, Zhou Shen, Qian Tieyun
- Advancing Precise Outline-conditioned Text Generation With Task Duality And Explicit Outline Control Li Yunzhe, Chen Qian, Yan Weixiang, Wang Wen, Zhang Qinglin, Sundaram Hari
- Guiding Large Language Models Via Directional Stimulus Prompting Li Zekun, Peng Baolin, He Pengcheng, Galley Michel, Gao Jianfeng, Yan Xifeng
- QFT: Quantized Full-parameter Tuning Of Llms With Affordable Resources Li Zhikai, Liu Xiaoxuan, Zhu Banghua, Dong Zhen, Gu Qingyi, Keutzer Kurt
- Unveiling The Pitfalls Of Knowledge Editing For Large Language Models Li Zhoubo, Zhang Ningyu, Yao Yunzhi, Wang Mengru, Chen Xi, Chen Huajun
- Label Supervised Llama Finetuning Li Zongxi, Li Xianming, Liu Yuzhang, Xie Haoran, Li Jing, Wang Fu-lee, Li Qing, Zhong Xiaoqin
- Towards End-to-end Embodied Decision Making Via Multi-modal Large Language Model: Explorations With Gpt4-vision And Beyond Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Tianyu Liu, Baobao Chang
- Prompting Large Language Models With Chain-of-thought For Few-shot Knowledge Base Question Generation Liang Yuanyuan, Wang Jianing, Zhu Hanlun, Wang Lei, Qian Weining, Lan Yunshi
- Make Pre-trained Model Reversible: From Parameter To Memory Efficient Fine-tuning Liao Baohao, Tan Shaomu, Monz Christof
- Opening Up Chatgpt: Tracking Openness, Transparency, And Accountability In Instruction-tuned Text Generators Liesenfeld Andreas, Lopez Alianda, Dingemanse Mark
- STEVE-1: A Generative Model For Text-to-behavior In Minecraft Lifshitz Shalev, Paster Keiran, Chan Harris, Ba Jimmy, Mcilraith Sheila
- A Survey On Large Language Models For Recommendation Likang Wu, Zhi Zheng, Zhaopeng Qiu, Hao Wang, Hongchao Gu, Tingjia Shen, Chuan Qin, Chen Zhu, Hengshu Zhu, Qi Liu, Hui Xiong, Enhong Chen
- The Unlocking Spell On Base Llms: Rethinking Alignment Via In-context Learning Lin Bill Yuchen, Ravichander Abhilasha, Lu Ximing, Dziri Nouha, Sclar Melanie, Chandu Khyathi, Bhagavatula Chandra, Choi Yejin
- Beneath The Surface: Unveiling Harmful Memes With Multimodal Reasoning Distilled From Large Language Models Lin Hongzhan, Luo Ziyang, Ma Jing, Chen Long
- VILA: On Pre-training For Visual Language Models Lin Ji, Yin Hongxu, Ping Wei, Lu Yao, Molchanov Pavlo, Tao Andrew, Mao Huizi, Kautz Jan, Shoeybi Mohammad, Han Song
- Few-shot Adaptation For Parsing Contextual Utterances With Llms Lin Kevin, Xia Patrick, Fang Hao
- RA-DIT: Retrieval-augmented Dual Instruction Tuning Lin Xi Victoria, Chen Xilun, Chen Mingda, Shi Weijia, Lomeli Maria, James Rich, Rodriguez Pedro, Kahn Jacob, Szilvasy Gergely, Lewis Mike, Zettlemoyer Luke, Yih Scott
- Pushing Large Language Models To The 6G Edge: Vision, Challenges, And Opportunities Lin Zheng, Qu Guanqiao, Chen Qiyuan, Chen Xianhao, Chen Zhe, Huang Kaibin
- SPHINX: The Joint Mixing Of Weights, Tasks, And Visual Embeddings For Multi-modal Large Language Models Lin Ziyi, Liu Chris, Zhang Renrui, Gao Peng, Qiu Longtian, Xiao Han, Qiu Han, Lin Chen, Shao Wenqi, Chen Keqin, Han Jiaming, Huang Siyuan, Zhang Yichi, He Xuming, Li Hongsheng, Qiao Yu
- Unleashing The Creative Mind: Language Model As Hierarchical Policy For Improved Exploration On Challenging Problem Solving Ling Zhan, Fang Yunhao, Li Xuanlin, Mu Tongzhou, Lee Mingu, Pourreza Reza, Memisevic Roland, Su Hao
- MMC: Advancing Multimodal Chart Understanding With Large-scale Instruction Tuning Liu Fuxiao, Wang Xiaoyang, Yao Wenlin, Chen Jianshu, Song Kaiqiang, Cho Sangwoo, Yacoob Yaser, Yu Dong
- Examining Llms' Uncertainty Expression Towards Questions Outside Parametric Knowledge Liu Genglin, Wang Xingyao, Yuan Lifan, Chen Yangyi, Peng Hao
- Logicot: Logical Chain-of-thought Instruction-tuning Liu Hanmeng, Teng Zhiyang, Cui Leyang, Zhang Chaoli, Zhou Qiji, Zhang Yue
- Glore: Evaluating Logical Reasoning Of Large Language Models Liu Hanmeng, Teng Zhiyang, Ning Ruoxi, Liu Jian, Zhou Qiji, Zhang Yue
- Exploration With Principles For Diverse AI Supervision Liu Hao, Zaharia Matei, Abbeel Pieter
- Contrastive Vision-language Alignment Makes Efficient Instruction Learner Liu Lizhao, Sun Xinyu, Xiang Tianhang, Zhuang Zhuangwei, Yin Liuren, Tan Mingkui
- Chipnemo: Domain-adapted Llms For Chip Design Liu Mingjie, Ene Teodor-dumitru, Kirby Robert, Cheng Chris, Pinckney Nathaniel, Liang Rongjian, Alben Jonah, Anand Himyanshu, Banerjee Sanmitra, Bayraktaroglu Ismet, Bhaskaran Bonita, Catanzaro Bryan, Chaudhuri Arjun, Clay Sharon, Dally Bill, Dang Laura, Deshpande Parikshit, Dhodhi Siddhanth, Halepete Sameer, Hill Eric, Hu Jiashang, Jain Sumit, Jindal Ankit, Khailany Brucek, Kokai George, Kunal Kishor, Li Xiaowei, Lind Charley, Liu Hao, Oberman Stuart, Omar Sujeet, Pasandi Ghasem, Pratty Sreedhar, Raiman Jonathan, Sarkar Ambar, Shao Zhengjiang, Sun Hanfei, Suthar Pratik P, Tej Varun, Turner Walker, Xu Kaizhe, Ren Haoxing
- Nlebench+norglm: A Comprehensive Empirical Analysis And Benchmark Dataset For Generative Language Models In Norwegian Liu Peng, Zhang Lemei, Farup Terje Nissen, Lauvrak Even W., Ingvaldsen Jon Espen, Eide Simen, Gulla Jon Atle, Yang Zhirong
- From Zero To Hero: Examining The Power Of Symbolic Tasks In Instruction Tuning Liu Qian, Zhou Fan, Jiang Zhengbao, Dou Longxu, Lin Min
- Bt-adapter: Video Conversation Is Feasible Without Video Instruction Tuning Liu Ruyang, Li Chen, Ge Yixiao, Shan Ying, Li Thomas H., Li Ge
- In-context Vectors: Making In Context Learning More Effective And Controllable Through Latent Space Steering Liu Sheng, Ye Haotian, Xing Lei, Zou James
- Language Models As Black-box Optimizers For Vision-language Models Liu Shihong, Lin Zhiqiu, Yu Samuel, Lee Ryan, Ling Tiffany, Pathak Deepak, Ramanan Deva
- Goat: Fine-tuned Llama Outperforms GPT-4 On Arithmetic Tasks Liu Tiedong, Low Bryan Kian Hsiang
- Aligning Large Language Models With Human Preferences Through Representation Engineering Liu Wenhao, Wang Xiaohua, Wu Muling, Li Tianlong, Lv Changze, Ling Zixuan, Zhu Jianhao, Zhang Cenyuan, Zheng Xiaoqing, Huang Xuanjing
- Fingpt: Democratizing Internet-scale Data For Financial Large Language Models Liu Xiao-yang, Wang Guoxuan, Yang Hongyang, Zha Daochen
- Scaling Laws Of Rope-based Extrapolation Liu Xiaoran, Yan Hang, Zhang Shuo, An Chenxin, Qiu Xipeng, Lin Dahua
- Litcab: Lightweight Language Model Calibration Over Short- And Long-form Responses Liu Xin, Khalifa Muhammad, Wang Lu
- Modeling Uncertainty And Using Post-fusion As Fallback Improves Retrieval Augmented Generation With Llms Liu Ye, Yavuz Semih, Meng Rui, Moorthy Meghana, Joty Shafiq, Xiong Caiming, Zhou Yingbo
- Summary Of Chatgpt-related Research And Perspective Towards The Future Of Large Language Models Liu Yiheng, Han Tianle, Ma Siyuan, Zhang Jiayue, Yang Yuanyuan, Tian Jiaming, He Hao, Li Antong, He Mengshen, Liu Zhengliang, Wu Zihao, Zhao Lin, Zhu Dajiang, Li Xiang, Qiang Ning, Shen Dingang, Liu Tianming, Ge Bao
- Instruction Position Matters In Sequence Generation With Large Language Models Liu Yijin, Zeng Xianfeng, Meng Fandong, Zhou Jie
- On Learning To Summarize With Large Language Models As References Liu Yixin, Shi Kejian, He Katherine S, Ye Longtian, Fabbri Alexander R., Liu Pengfei, Radev Dragomir, Cohan Arman
- SAIE Framework: Support Alone Isn't Enough -- Advancing LLM Training With Adversarial Remarks Loem Mengsay, Kaneko Masahiro, Okazaki Naoaki
- Exploring The Sensitivity Of Llms' Decision-making Capabilities: Insights From Prompt Variation And Hyperparameters Loya Manikanta, Sinha Divya Anand, Futrell Richard
- Llama-reviewer: Advancing Code Review Automation With Large Language Models Through Parameter-efficient Fine-tuning Lu Junyi, Yu Lei, Li Xiaojia, Yang Li, Zuo Chun
- Lyrics: Boosting Fine-grained Language-vision Alignment And Comprehension Via Semantic-aware Visual Objects Lu Junyu, Zhang Dixiang, Zhang Songxin, Xie Zejian, Song Zhuoyang, Lin Cong, Zhang Jiaxing, Jing Bingyi, Zhang Pingjian
- Inference-time Policy Adapters (IPA): Tailoring Extreme-scale Lms Without Fine-tuning Lu Ximing, Brahman Faeze, West Peter, Jang Jaehun, Chandu Khyathi, Ravichander Abhilasha, Qin Lianhui, Ammanabrolu Prithviraj, Jiang Liwei, Ramnath Sahana, Dziri Nouha, Fisher Jillian, Lin Bill Yuchen, Hallinan Skyler, Ren Xiang, Welleck Sean, Choi Yejin
- Illuminating The Black Box: A Psychometric Investigation Into The Multifaceted Nature Of Large Language Models Lu Yang, Yu Jordan, Huang Shou-hsuan Stephen
- UI Layout Generation With Llms Guided By UI Grammar Lu Yuwen, Tong Ziang, Zhao Qinyi, Zhang Chengzhi, Li Toby Jia-jun
- Effectively Fine-tune To Improve Large Multimodal Models For Radiology Report Generation Lu Yuzhe, Hong Sungmin, Shah Yash, Xu Panpan
- Exploring Small Language Models With Prompt-learning Paradigm For Efficient Domain-specific Text Classification Luo Hengyu, Liu Peng, Esping Stefan
- SAIL: Search-augmented Instruction Learning Luo Hongyin, Chuang Yung-sung, Gong Yuan, Zhang Tianhua, Kim Yoon, Wu Xixin, Fox Danny, Meng Helen, Glass James
- Critique Ability Of Large Language Models Luo Liangchen, Lin Zi, Liu Yinxiao, Shu Lei, Zhu Yun, Shang Jingbo, Meng Lei
- Taiyi: A Bilingual Fine-tuned Large Language Model For Diverse Biomedical Tasks Luo Ling, Ning Jinzhong, Zhao Yingwen, Wang Zhijun, Ding Zeyuan, Chen Peng, Fu Weiru, Han Qinyu, Xu Guangtao, Qiu Yunzhi, Pan Dinghao, Li Jiru, Li Hao, Feng Wenduo, Tu Senbo, Liu Yuqi, Yang Zhihao, Wang Jian, Sun Yuanyuan, Lin Hongfei
- Towards Logiglue: A Brief Survey And A Benchmark For Analyzing Logical Reasoning Capabilities Of Language Models Luo Man, Kumbhar Shrinidhi, Shen Ming, Parmar Mihir, Varshney Neeraj, Banerjee Pratyay, Aditya Somak, Baral Chitta
- Recranker: Instruction Tuning Large Language Model As Ranker For Top-k Recommendation Luo Sichun, He Bowei, Zhao Haohan, Shao Wei, Qi Yanlin, Huang Yinya, Zhou Aojun, Yao Yuxuan, Li Zongpeng, Xiao Yuanzhang, Zhan Mingjie, Song Linqi
- YAYI 2: Multilingual Open-source Large Language Models Luo Yin, Kong Qingchao, Xu Nan, Cao Jia, Hao Bao, Qu Baoyu, Chen Bo, Zhu Chao, Zhao Chenyang, Zhang Donglei, Feng Fan, Zhao Feifei, Sun Hailong, Yang Hanxuan, Pan Haojun, Liu Hongyu, Guo Jianbin, Du Jiangtao, Wang Jingyi, Li Junfeng, Sun Lei, Liu Liduo, Dong Lifeng, Liu Lili, Wang Lin, Zhang Liwen, Wang Minzheng, Wang Pin, Yu Ping, Li Qingxiao, Yan Rui, Zou Rui, Li Ruiqun, Huang Taiwen, Wang Xiaodong, Wu Xiaofei, Peng Xin, Zhang Xina, Fang Xing, Xiao Xinglin, Hao Yanni, Dong Yao, Wang Yigang, Liu Ying, Jiang Yongyu, Wang Yungan, Wang Yuqi, Wang Zhangsheng, Yu Zhaoxin, Luo Zhen, Mao Wenji, Wang Lei, Zeng Dajun
- Biomedgpt: Open Multimodal Generative Pre-trained Transformer For Biomedicine Luo Yizhen, Zhang Jiahuan, Fan Siqi, Yang Kai, Wu Yushuai, Qiao Mu, Nie Zaiqing
- Unlocking The Potential Of Large Language Models For Explainable Recommendations Luo Yucong, Cheng Mingyue, Zhang Hao, Lu Junyu, Liu Qi, Chen Enhong
- An Empirical Study Of Catastrophic Forgetting In Large Language Models During Continual Fine-tuning Luo Yun, Yang Zhen, Meng Fandong, Li Yafu, Zhou Jie, Zhang Yue
- Augmented Large Language Models With Parametric Knowledge Guiding Luo Ziyang, Xu Can, Zhao Pu, Geng Xiubo, Tao Chongyang, Ma Jing, Lin Qingwei, Jiang Daxin
- Are We Falling In A Middle-intelligence Trap? An Analysis And Mitigation Of The Reversal Curse Lv Ang, Zhang Kaiyi, Xie Shufang, Tu Quan, Chen Yuhan, Wen Ji-rong, Yan Rui
- Full Parameter Fine-tuning For Large Language Models With Limited Resources Lv Kai, Yang Yuqing, Liu Tengxiao, Gao Qinghui, Guo Qipeng, Qiu Xipeng
- Collie: Collaborative Training Of Large Language Models In An Efficient Way Lv Kai, Zhang Shuo, Gu Tianle, Xing Shuhao, Hong Jiawei, Chen Keyu, Liu Xiaoran, Yang Yuqing, Guo Honglin, Liu Tengxiao, Sun Yu, Guo Qipeng, Yan Hang, Qiu Xipeng
- A Paradigm Shift: The Future Of Machine Translation Lies With Large Language Models Lyu Chenyang, Du Zefeng, Xu Jitao, Duan Yitao, Wu Minghao, Lynn Teresa, Aji Alham Fikri, Wong Derek F., Liu Siyou, Wang Longyue
- Chain-of-skills: A Configurable Model For Open-domain Question Answering Ma Kaixin, Cheng Hao, Zhang Yu, Liu Xiaodong, Nyberg Eric, Gao Jianfeng
- Beyond Chatbots: Explorellm For Structured Thoughts And Personalized Model Responses Ma Xiao, Mishra Swaroop, Liu Ariel, Su Sophie, Chen Jilin, Kulkarni Chinmay, Cheng Heng-tze, Le Quoc, Chi Ed
- Fine-tuning Llama For Multi-stage Text Retrieval Ma Xueguang, Wang Liang, Yang Nan, Wei Furu, Lin Jimmy
- At Which Training Stage Does Code Data Help Llms Reasoning? Ma Yingwei, Liu Yue, Yu Yue, Zhang Yuanliang, Jiang Yu, Wang Changjian, Li Shanshan
- VOLTA: Improving Generative Diversity By Variational Mutual Information Maximizing Autoencoder Ma Yueen, Chi Dafeng, Li Jingjing, Song Kai, Zhuang Yuzheng, King Irwin
- Harnessing The Power Of Prompt-based Techniques For Generating School-level Questions Using Large Language Models Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- Fine-tuning Language Models With Just Forward Passes Malladi Sadhika, Gao Tianyu, Nichani Eshaan, Damian Alex, Lee Jason D., Chen Danqi, Arora Sanjeev
- Aligning Large Language Models For Clinical Tasks Manathunga Supun, Hettigoda Isuru
- ICL Markup: Structuring In-context Learning Using Soft-token Tags Marc-etienne Brunet, Ashton Anderson, Richard Zemel
- Interpreting Learned Feedback Patterns In Large Language Models Marks Luke, Abdullah Amir, Neo Clement, Arike Rauno, Krueger David, Torr Philip, Barez Fazl
- Chatgpt Vs State-of-the-art Models: A Benchmarking Study In Keyphrase Generation Task Martínez-cruz Roberto, López-lópez Alvaro J., Portela José
- A Large Language Model-assisted Education Tool To Provide Feedback On Open-ended Responses Matelsky Jordan K., Parodi Felipe, Liu Tony, Lange Richard D., Kording Konrad P.
- Enhancing LLM Intelligence With ARM-RAG: Auxiliary Rationale Memory For Retrieval Augmented Generation Melz Eric
- Representation Deficiency In Masked Language Modeling Meng Yu, Krishnan Jitin, Wang Sinong, Wang Qifan, Mao Yuning, Fang Han, Ghazvininejad Marjan, Han Jiawei, Zettlemoyer Luke
- Self-checker: Plug-and-play Modules For Fact-checking With Large Language Models Miaoran Li, Baolin Peng, Michel Galley, Jianfeng Gao, Zhu Zhang
- Uzh_clyp At Semeval-2023 Task 9: Head-first Fine-tuning And Chatgpt Data Generation For Cross-lingual Learning In Tweet Intimacy Prediction Michail Andrianos, Konstantinou Stefanos, Clematide Simon
- PEFTT: Parameter-efficient Fine-tuning For Low-resource Tibetan Pre-trained Language Models Mingjun Zhou, Zhuoma Daiqing, Nuo Qun, Tashi Nyima
- PRE: Vision-language Prompt Learning With Reparameterization Encoder Minh Anh Pham Thi, Nguyen An Duc, Tzimiropoulos Georgios
- Synthetic Imitation Edit Feedback For Factual Alignment In Clinical Summarization Mishra Prakamya, Yao Zonghai, Chen Shuwei, Wang Beining, Mittal Rohan, Yu Hong
- An Emulator For Fine-tuning Large Language Models Using Small Language Models Mitchell Eric, Rafailov Rafael, Sharma Archit, Finn Chelsea, Manning Christopher D.
- Compositional Chain-of-thought Prompting For Large Multimodal Models Mitra Chancharik, Huang Brandon, Darrell Trevor, Herzig Roei
- How Trustworthy Are Open-source Llms? An Assessment Under Malicious Demonstrations Shows Their Vulnerabilities Mo Lingbo, Wang Boshi, Chen Muhao, Sun Huan
- Chatgpt In The Age Of Generative AI And Large Language Models: A Concise Survey Mohamadi Salman, Mujtaba Ghulam, Le Ngan, Doretto Gianfranco, Adjeroh Donald A.
- Landmark Attention: Random-access Infinite Context Length For Transformers Mohtashami Amirkeivan, Jaggi Martin
- Llms Killed The Script Kiddie: How Agents Supported By Large Language Models Change The Landscape Of Network Threat Testing Moskal Stephen, Laney Sam, Hemberg Erik, O'reilly Una-may
- Adaptive Machine Translation With Large Language Models Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- Fine-tuning Large Language Models For Adaptive Machine Translation Moslem Yasmin, Haque Rejwanul, Way Andy
- Can Llms Facilitate Interpretation Of Pre-trained Language Models? Mousi Basel, Durrani Nadir, Dalvi Fahim
- Gradient-based Automated Iterative Recovery For Parameter-efficient Tuning Mozes Maximilian, Bolukbasi Tolga, Yuan Ann, Liu Frederick, Thain Nithum, Dixon Lucas
- Can Llms Follow Simple Rules? Mu Norman, Chen Sarah, Wang Zifan, Chen Sizhe, Karamardian David, Aljeraisy Lulwa, Alomair Basel, Hendrycks Dan, Wagner David
- How To Plant Trees In Language Models: Data And Architectural Effects On The Emergence Of Syntactic Inductive Biases Mueller Aaron, Linzen Tal
- Assessing Translation Capabilities Of Large Language Models Involving English And Indian Languages Mujadia Vandan, Urlana Ashok, Bhaskar Yash, Pavani Penumalla Aditya, Shravya Kukkapalli, Krishnamurthy Parameswari, Sharma Dipti Misra
- Superhf: Supervised Iterative Learning From Human Feedback Mukobi Gabriel, Chatain Peter, Fong Su, Windesheim Robert, Kutyniok Gitta, Bhatia Kush, Alberti Silas
- A Brief History Of Prompt: Leveraging Language Models. (through Advanced Prompting) Muktadir Golam Md
- LLM4VV: Developing Llm-driven Testsuite For Compiler Validation Munley Christian, Jarmusch Aaron, Chandrasekaran Sunita
- Ai-assisted Code Authoring At Scale: Fine-tuning, Deploying, And Mixed Methods Evaluation Murali Vijayaraghavan, Maddila Chandra, Ahmad Imad, Bolin Michael, Cheng Daniel, Ghorbani Negar, Fernandez Renuka, Nagappan Nachiappan, Rigby Peter C.
- Teaching Probabilistic Logical Reasoning To Transformers Nafar Aliakbar, Venable Kristen Brent, Kordjamshidi Parisa
- Towards Probing Contact Center Large Language Models Nathan Varun, Kumar Ayush, Ingle Digvijay, Vepa Jithendra
- A Comprehensive Overview Of Large Language Models Naveed Humza, Khan Asad Ullah, Qiu Shi, Saqib Muhammad, Anwar Saeed, Usman Muhammad, Akhtar Naveed, Barnes Nick, Mian Ajmal
- Nanot5: A Pytorch Framework For Pre-training And Fine-tuning T5-style Models With Limited Resources Nawrot Piotr
- Large Language Models In Healthcare And Medical Domain: A Review Nazi Zabir Al, Peng Wei
- Simplyretrieve: A Private And Lightweight Retrieval-centric Generative AI Tool Ng Youyang, Miyashita Daisuke, Hoshi Yasuto, Morioka Yasuhiro, Torii Osamu, Kodama Tomoya, Deguchi Jun
- Multi-dimensional Data Refining Strategy For Effective Fine-tuning Llms Ngoc Thanh Nguyen, Tran Quang Nhat, Tang Arthur, Nguyen Bao, Nguyen Thuy, Pham Thanh
- Democratizing Llms For Low-resource Languages By Leveraging Their English Dominant Abilities With Linguistically-diverse Prompts Nguyen Xuan-phi, Aljunied Sharifah Mahani, Joty Shafiq, Bing Lidong
- Forgetting Before Learning: Utilizing Parametric Arithmetic For Knowledge Updating In Large Language Models Ni Shiwen, Chen Dingwei, Li Chengming, Hu Xiping, Xu Ruifeng, Yang Min
- Codegen2: Lessons For Training Llms On Programming And Natural Languages Nijkamp Erik, Hayashi Hiroaki, Xiong Caiming, Savarese Silvio, Zhou Yingbo
- Reflexion: Language Agents With Verbal Reinforcement Learning Noah Shinn, Federico Cassano, Edward Berman, Ashwin Gopinath, Karthik Narasimhan, Shunyu Yao
- Evaluating AI Vocational Skills Through Professional Testing Noever David, Ciolino Matt
- Numeracy From Literacy: Data Science As An Emergent Skill From Large Language Models Noever David, Mckee Forrest
- Adversarial Fine-tuning Of Language Models: An Iterative Optimisation Approach For The Generation And Detection Of Problematic Content O'neill Charles, Miller Jack, Ciuca Ioana, Ting Yuan-sen, Bui Thang
- In-contextual Gender Bias Suppression For Large Language Models Oba Daisuke, Kaneko Masahiro, Bollegala Danushka
- Blackvip: Black-box Visual Prompting For Robust Transfer Learning Oh Changdae, Hwang Hyeji, Lee Hee-young, Lim Yongtaek, Jung Geunyoung, Jung Jiyoung, Choi Hosik, Song Kyungwoo
- Fine-tuning Or Retrieval? Comparing Knowledge Injection In Llms Ovadia Oded, Brief Menachem, Mishaeli Moshik, Elisha Oren
- Understanding The Capabilities Of Large Language Models For Automated Planning Pallagani Vishal, Muppasani Bharath, Murugesan Keerthiram, Rossi Francesca, Srivastava Biplav, Horesh Lior, Fabiano Francesco, Loreggia Andrea
- Stelocoder: A Decoder-only LLM For Multi-language To Python Code Translation Pan Jialing, Sadé Adrien, Kim Jin, Soriano Eric, Sole Guillem, Flamant Sylvain
- COSMIC: Data Efficient Instruction-tuning For Speech In-context Learning Pan Jing, Wu Jian, Gaur Yashesh, Sivasankaran Sunit, Chen Zhuo, Liu Shujie, Li Jinyu
- Retrieving-to-answer: Zero-shot Video Question Answering With Frozen Large Language Models Pan Junting, Lin Ziyi, Ge Yuying, Zhu Xiatian, Zhang Renrui, Wang Yi, Qiao Yu, Li Hongsheng
- Let's Reinforce Step By Step Pan Sarah, Lialin Vladislav, Muckatira Sherin, Rumshisky Anna
- X-instructblip: A Framework For Aligning X-modal Instruction-aware Representations To Llms And Emergent Cross-modal Reasoning Panagopoulou Artemis, Xue Le, Yu Ning, Li Junnan, Li Dongxu, Joty Shafiq, Xu Ran, Savarese Silvio, Xiong Caiming, Niebles Juan Carlos
- Peft-medaware: Large Language Model For Medical Awareness Pandya Keivalya
- Automating Customer Service Using Langchain: Building Custom Open-source GPT Chatbot For Organizations Pandya Keivalya, Holia Mehfuza
- Language Model Self-improvement By Reinforcement Learning Contemplation Pang Jing-cheng, Wang Pengyuan, Li Kaiyuan, Chen Xiong-hui, Xu Jiacheng, Zhang Zongzhang, Yu Yang
- Trainable Transformer In Transformer Panigrahi Abhishek, Malladi Sadhika, Xia Mengzhou, Arora Sanjeev
- On The Analysis Of Cross-lingual Prompt Tuning For Decoder-based Multilingual Model Park Nohil, Park Joonsuk, Yoo Kang Min, Yoon Sungroh
- 1.5 Million Materials Narratives Generated By Chatbots Park Yang Jeong, Jerng Sung Eun, Park Jin-sung, Kwon Choah, Hsu Chia-wei, Ren Zhichu, Yoon Sungroh, Li Ju
- A Large Language Model Approach To Educational Survey Feedback Analysis Parker Michael J., Anderson Caitlin, Stone Claire, Oh Yearim
- Entity Matching Using Large Language Models Peeters Ralph, Bizer Christian
- Evaluation Of Medium-large Language Models At Zero-shot Closed Book Generative Question Answering Peinl René, Wirth Johannes
- Exploiting Novel GPT-4 Apis Pelrine Kellin, Taufeeque Mohammad, Zając Michał, Mclean Euan, Gleave Adam
- Yarn: Efficient Context Window Extension Of Large Language Models Peng Bowen, Quesnelle Jeffrey, Fan Honglu, Shippole Enrico
- When Does In-context Learning Fall Short And Why? A Study On Specification-heavy Tasks Peng Hao, Wang Xiaozhi, Chen Jianhui, Li Weikai, Qi Yunjia, Wang Zimu, Wu Zhili, Zeng Kaisheng, Xu Bin, Hou Lei, Li Juanzi
- FP8-LM: Training FP8 Large Language Models Peng Houwen, Wu Kan, Wei Yixuan, Zhao Guoshuai, Yang Yuxiang, Liu Ze, Xiong Yifan, Yang Ziyue, Ni Bolin, Hu Jingcheng, Li Ruihang, Zhang Miaosen, Li Chen, Ning Jia, Wang Ruizhe, Zhang Zheng, Liu Shuguang, Chau Joe, Hu Han, Cheng Peng
- Soft Prompt Tuning For Augmenting Dense Retrieval With Large Language Models Peng Zhiyuan, Wu Xuyang, Wang Qifan, Fang Yi
- Constitutionmaker: Interactively Critiquing Large Language Models By Converting Feedback Into Principles Petridis Savvas, Wedin Ben, Wexler James, Donsbach Aaron, Pushkarna Mahima, Goyal Nitesh, Cai Carrie J., Terry Michael
- When Do Prompting And Prefix-tuning Work? A Theory Of Capabilities And Limitations Petrov Aleksandar, Torr Philip H. S., Bibi Adel
- The Impact Of Depth On Compositional Generalization In Transformer Language Models Petty Jackson, Van Steenkiste Sjoerd, Dasgupta Ishita, Sha Fei, Garrette Dan, Linzen Tal
- Let Models Speak Ciphers: Multiagent Debate Through Embeddings Pham Chau, Liu Boyi, Yang Yingxiang, Chen Zhengyu, Liu Tianyi, Yuan Jianbo, Plummer Bryan A., Wang Zhaoran, Yang Hongxia
- Training Chain-of-thought Via Latent-variable Inference Phan Du, Hoffman Matthew D., Dohan David, Douglas Sholto, Le Tuan Anh, Parisi Aaron, Sountsov Pavel, Sutton Charles, Vikram Sharad, Saurous Rif A.
- LLM Self Defense: By Self Examination, Llms Know They Are Being Tricked Phute Mansi, Helbling Alec, Hull Matthew, Peng Shengyun, Szyller Sebastian, Cornelius Cory, Chau Duen Horng
- Typhoon: Thai Large Language Models Pipatanakul Kunat, Jirabovonvisut Phatrasek, Manakul Potsawee, Sripaisarnmongkol Sittipong, Patomwong Ruangsak, Chokchainant Pathomporn, Tharnpipitchai Kasima
- Painter: Teaching Auto-regressive Language Models To Draw Sketches Pourreza Reza, Bhattacharyya Apratim, Panchal Sunny, Lee Mingu, Madan Pulkit, Memisevic Roland
- Egovlpv2: Egocentric Video-language Pre-training With Fusion In The Backbone Pramanick Shraman, Song Yale, Nag Sayan, Lin Kevin Qinghong, Shah Hardik, Shou Mike Zheng, Chellappa Rama, Zhang Pengchuan
- The Robots Are Here: Navigating The Generative AI Revolution In Computing Education Prather James, Denny Paul, Leinonen Juho, Becker Brett A., Albluwi Ibrahim, Craig Michelle, Keuning Hieke, Kiesler Natalie, Kohn Tobias, Luxton-reilly Andrew, Macneil Stephen, Peterson Andrew, Pettit Raymond, Reeves Brent N., Savelka Jaromir
- Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! Qi Xiangyu, Zeng Yi, Xie Tinghao, Chen Pin-yu, Jia Ruoxi, Mittal Prateek, Henderson Peter
- PILLOW: Enhancing Efficient Instruction Fine-tuning Via Prompt Matching Qi Zhenting, Tan Xiaoyu, Shi Shaojie, Qu Chao, Xu Yinghui, Qi Yuan
- How Good Is Google Bard's Visual Understanding? An Empirical Study On Open Challenges Qin Haotong, Ji Ge-peng, Khan Salman, Fan Deng-ping, Khan Fahad Shahbaz, Van Gool Luc
- Enabling On-device Large Language Model Personalization With Self-supervised Data Selection And Synthesis Qin Ruiyang, Xia Jun, Jia Zhenge, Jiang Meng, Abbasi Ahmed, Zhou Peipei, Hu Jingtong, Shi Yiyu
- Toolllm: Facilitating Large Language Models To Master 16000+ Real-world Apis Qin Yujia, Liang Shihao, Ye Yining, Zhu Kunlun, Yan Lan, Lu Yaxi, Lin Yankai, Cong Xin, Tang Xiangru, Qian Bill, Zhao Sihan, Hong Lauren, Tian Runchu, Xie Ruobing, Zhou Jie, Gerstein Mark, Li Dahai, Liu Zhiyuan, Sun Maosong
- Federated Full-parameter Tuning Of Billion-sized Language Models With Communication Cost Under 18 Kilobytes Qin Zhen, Chen Daoyuan, Qian Bingchen, Ding Bolin, Li Yaliang, Deng Shuiguang
- Mplug-owl: Modularization Empowers Large Language Models With Multimodality Qinghao Ye, Haiyang Xu, Guohai Xu, Jiabo Ye, Ming Yan, Yiyang Zhou, Junyang Wang, Anwen Hu, Pengcheng Shi, Yaya Shi, Chenliang Li, Yuanhong Xu, Hehong Chen, Junfeng Tian, Qi Qian, Ji Zhang, Fei Huang, Jingren Zhou
- Adalora: Adaptive Budget Allocation For Parameter-efficient Fine-tuning Qingru Zhang, Minshuo Chen, Alexander Bukharin, Nikos Karampatziakis, Pengcheng He, Yu Cheng, Weizhu Chen, Tuo Zhao
- Embodied Executable Policy Learning With Language-based Scene Summarization Qiu Jielin, Xu Mengdi, Han William, Moon Seungwhan, Zhao Ding
- Unlocking Emergent Modularity In Large Language Models Qiu Zihan, Huang Zeyu, Fu Jie
- Cmed-gpt: Prompt Tuning For Entity-aware Chinese Medical Dialogue Generation Qu Zhijie, Li Juan, Ma Zerui, Li Jianqiang
- Direct Preference Optimization: Your Language Model Is Secretly A Reward Model Rafael Rafailov, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D. Manning, Chelsea Finn
- Controllable Generation Of Dialogue Acts For Dialogue Systems Via Few-shot Response Generation And Ranking Ramirez Angela, Agarwal Karik, Juraska Juraj, Garg Utkarsh, Walker Marilyn A.
- Tailoring Self-rationalizers With Multi-reward Distillation Ramnath Sahana, Joshi Brihi, Hallinan Skyler, Lu Ximing, Li Liunian Harold, Chan Aaron, Hessel Jack, Choi Yejin, Ren Xiang
- Tricking Llms Into Disobedience: Formalizing, Analyzing, And Detecting Jailbreaks Rao Abhinav, Vashistha Sachin, Naik Atharva, Aditya Somak, Choudhury Monojit
- Weakly Supervised Detection Of Hallucinations In LLM Activations Rateike Miriam, Cintas Celia, Wamburu John, Akumu Tanya, Speakman Skyler
- Exploring The Relationship Between LLM Hallucinations And Prompt Linguistic Nuances: Readability, Formality, And Concreteness Rawte Vipula, Priya Prachi, Tonmoy S. M Towhidul Islam, Zaman S M Mehedi, Sheth Amit, Das Amitava
- COLA: A Benchmark For Compositional Text-to-image Retrieval Ray Arijit, Radenovic Filip, Dubey Abhimanyu, Plummer Bryan A., Krishna Ranjay, Saenko Kate
- \(\textit{dial Beinfo For Faithfulness}\): Improving Factuality Of Information-seeking Dialogue Via Behavioural Fine-tuning Razumovskaia Evgeniia, Vulić Ivan, Marković Pavle, Cichy Tomasz, Zheng Qian, Wen Tsung-hsien, Budzianowski Paweł
- Fine-tuning Language Models For Context-specific SQL Query Generation Rebei Amine
- Timechat: A Time-sensitive Multimodal Large Language Model For Long Video Understanding Ren Shuhuai, Yao Linli, Li Shicheng, Sun Xu, Hou Lu
- I'm Afraid I Can't Do That: Predicting Prompt Refusal In Black-box Generative Language Models Reuter Max, Schulze William
- Sparq Attention: Bandwidth-efficient LLM Inference Ribar Luka, Chelombiev Ivan, Hudlass-galley Luke, Blake Charlie, Luschi Carlo, Orr Douglas
- Synergistic Integration Of Large Language Models And Cognitive Architectures For Robust AI: An Exploratory Analysis Romero Oscar J., Zimmerman John, Steinfeld Aaron, Tomasic Anthony
- Carexpert: Leveraging Large Language Models For In-car Conversational Question Answering Rony Md Rashad Al Hasan, Suess Christian, Bhat Sinchana Ramakanth, Sudhi Viju, Schneider Julia, Vogel Maximilian, Teucher Roman, Friedl Ken E., Sahoo Soumya
- LLM As An Art Director (ladi): Using Llms To Improve Text-to-media Generators Roush Allen, Zakirov Emil, Shirokov Artemiy, Lunina Polina, Gane Jack, Duffy Alexander, Basil Charlie, Whitcomb Aber, Benedetto Jim, Dewolfe Chris
- Outlier Dimensions Encode Task-specific Knowledge Rudman William, Chen Catherine, Eickhoff Carsten
- Gpt4tools: Teaching Large Language Model To Use Tools Via Self-instruction Rui Yang, Lin Song, Yanwei Li, Sijie Zhao, Yixiao Ge, Xiu Li, Ying Shan
- Uncertainty Guided Global Memory Improves Multi-hop Question Answering Sagirova Alsu, Burtsev Mikhail
- In-context Impersonation Reveals Large Language Models' Strengths And Biases Salewski Leonard, Alaniz Stephan, Rio-torto Isabel, Schulz Eric, Akata Zeynep
- Efficient RLHF: Reducing The Memory Usage Of PPO Santacroce Michael, Lu Yadong, Yu Han, Li Yuanzhi, Shen Yelong
- Camoscio: An Italian Instruction-tuned Llama Santilli Andrea, Rodolà Emanuele
- Testing The General Deductive Reasoning Capacity Of Large Language Models Using OOD Examples Saparov Abulhair, Pang Richard Yuanzhe, Padmakumar Vishakh, Joshi Nitish, Kazemi Seyed Mehran, Kim Najoung, He He
- Interprompt: Interpretable Prompting For Interrelated Interpersonal Risk Factors In Reddit Posts Sathvik Msvpj, Sarkar Surjodeep, Saxena Chandni, Sohn Sunghwan, Garg Muskan
- GIFT: Generative Interpretable Fine-tuning Savadikar Chinmay, Song Xi, Wu Tianfu
- The Cot Collection: Improving Zero-shot And Few-shot Learning Of Language Models Via Chain-of-thought Fine-tuning Seungone Kim, Se June Joo, Doyoung Kim, Joel Jang, Seonghyeon Ye, Jamin Shin, Minjoon Seo
- PB-LLM: Partially Binarized Large Language Models Shang Yuzhang, Yuan Zhihang, Wu Qiang, Dong Zhen
- Democratizing Llms: An Exploration Of Cost-performance Trade-offs In Self-refined Open-source Models Shashidhar Sumuk, Chinta Abhinav, Sahai Vaibhav, Wang Zhenhailong, Ji Heng
- Exploring The Factual Consistency In Dialogue Comprehension Of Large Language Models She Shuaijie, Huang Shujian, Wang Xingyun, Zhou Yanke, Chen Jiajun
- Pangu-coder2: Boosting Large Language Models For Code With Ranking Feedback Shen Bo, Zhang Jiaxin, Chen Taihong, Zan Daoguang, Geng Bing, Fu An, Zeng Muhan, Yu Ailun, Ji Jichuan, Zhao Jingyang, Guo Yuenan, Wang Qianxiang
- Large Language Model Alignment: A Survey Shen Tianhao, Jin Renren, Huang Yufei, Liu Chuang, Dong Weilong, Guo Zishan, Wu Xinwei, Liu Yan, Xiong Deyi
- Why Does Chatgpt Fall Short In Providing Truthful Answers? Shen Zheng, Jie Huang, Kevin Chen-chuan Chang
- TOAST: Transfer Learning Via Attention Steering Shi Baifeng, Gai Siyu, Darrell Trevor, Wang Xin
- Sotana: The Open-source Software Development Assistant Shi Ensheng, Zhang Fengji, Wang Yanlin, Chen Bei, Du Lun, Zhang Hongyu, Han Shi, Zhang Dongmei, Sun Hongbin
- Badgpt: Exploring Security Vulnerabilities Of Chatgpt Via Backdoor Attacks To Instructgpt Shi Jiawen, Liu Yixin, Zhou Pan, Sun Lichao
- Unleashing The Power Of Pre-trained Language Models For Offline Reinforcement Learning Shi Ruizhe, Liu Yuyao, Ze Yanjie, Du Simon S., Xu Huazhe
- Preliminary Study On Incremental Learning For Large Language Model-based Recommender Systems Shi Tianhao, Zhang Yang, Xu Zhijian, Chen Chong, Feng Fuli, He Xiangnan, Tian Qi
- MKRAG: Medical Knowledge Retrieval Augmented Generation For Medical Question Answering Shi Yucheng, Xu Shaochen, Yang Tianze, Liu Zhengliang, Liu Tianming, Li Quanzheng, Li Xiang, Liu Ninghao
- Dept: Decomposed Prompt Tuning For Parameter-efficient Fine-tuning Shi Zhengxiang, Lipani Aldo
- Reasoning With Language Model Is Planning With World Model Shibo Hao, Yi Gu, Haodi Ma, Joshua Jiahua Hong, Zhen Wang, Daisy Zhe Wang, Zhiting Hu
- L3 Ensembles: Lifelong Learning Approach For Ensemble Of Foundational Language Models Shiri Aidin, Roy Kaushik, Sheth Amit, Gaur Manas
- Introducing Dictalm -- A Large Generative Language Model For Modern Hebrew Shmidman Shaltiel, Shmidman Avi, Cohen Amir David Nissan, Koppel Moshe
- SCREWS: A Modular Framework For Reasoning With Revisions Shridhar Kumar, Jhamtani Harsh, Fang Hao, Van Durme Benjamin, Eisner Jason, Xia Patrick
- The ART Of LLM Refinement: Ask, Refine, And Trust Shridhar Kumar, Sinha Koustuv, Cohen Andrew, Wang Tianlu, Yu Ping, Pasunuru Ram, Sachan Mrinmaya, Weston Jason, Celikyilmaz Asli
- Woodpecker: Hallucination Correction For Multimodal Large Language Models Shukang Yin, Chaoyou Fu, Sirui Zhao, Tong Xu, Hao Wang, Dianbo Sui, Yunhang Shen, Ke Li, Xing Sun, Enhong Chen
- Tree Of Thoughts: Deliberate Problem Solving With Large Language Models Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Thomas L. Griffiths, Yuan Cao, Karthik Narasimhan
- An Empirical Study Of Instruction-tuning Large Language Models In Chinese Si Qingyi, Wang Tong, Lin Zheng, Zhang Xu, Cao Yanan, Wang Weiping
- Large Language Models For Aspect-based Sentiment Analysis Simmering Paul F., Huoviala Paavo
- SPARSEFIT: Few-shot Prompting With Sparse Fine-tuning For Jointly Generating Predictions And Natural Language Explanations Solano Jesus, Sanni Mardhiyah, Camburu Oana-maria, Minervini Pasquale
- Biomedical Knowledge Graph-optimized Prompt Generation For Large Language Models Soman Karthik, Rose Peter W, Morris John H, Akbas Rabia E, Smith Brett, Peetoom Braian, Villouta-reyes Catalina, Cerono Gabriel, Shi Yongmei, Rizk-jackson Angela, Israni Sharat, Nelson Charlotte A, Huang Sui, Baranzini Sergio E
- Conpet: Continual Parameter-efficient Tuning For Large Language Models Song Chenyang, Han Xu, Zeng Zheni, Li Kuai, Chen Chen, Liu Zhiyuan, Sun Maosong, Yang Tao
- Preference Ranking Optimization For Human Alignment Song Feifan, Yu Bowen, Li Minghao, Yu Haiyang, Huang Fei, Li Yongbin, Wang Houfeng
- How To Bridge The Gap Between Modalities: A Comprehensive Survey On Multimodal Large Language Model Song Shezheng, Li Xiaopeng, Li Shasha, Zhao Shan, Yu Jie, Ma Jun, Mao Xiaoguang, Zhang Weimin
- Large Language Models Meet Open-world Intent Discovery And Recognition: An Evaluation Of Chatgpt Song Xiaoshuai, He Keqing, Wang Pei, Dong Guanting, Mou Yutao, Wang Jingang, Xian Yunsen, Cai Xunliang, Xu Weiran
- Learning Ui-to-code Reverse Generator Using Visual Critic Without Rendering Soselia Davit, Saifullah Khalid, Zhou Tianyi
- Dynamic Context Pruning For Efficient And Interpretable Autoregressive Transformers Sotiris Anagnostidis, Dario Pavllo, Luca Biggio, Lorenzo Noci, Aurelien Lucchi, Thomas Hofmann
- Structured Packing In LLM Training Improves Long Context Utilization Staniszewski Konrad, Tworkowski Szymon, Jaszczur Sebastian, Zhao Yu, Michalewski Henryk, Kuciński Łukasz, Miłoś Piotr
- Learning From Red Teaming: Gender Bias Provocation And Mitigation In Large Language Models Su Hsuan, Cheng Cheng-chu, Farn Hua, Kumar Shachi H, Sahay Saurav, Chen Shang-tse, Lee Hung-yi
- F3-pruning: A Training-free And Generalized Pruning Strategy Towards Faster And Finer Text-to-video Synthesis Su Sitong, Liu Jianzhi, Gao Lianli, Song Jingkuan
- Dial-mae: Contextual Masked Auto-encoder For Retrieval-based Dialogue Systems Su Zhenpeng, Wu Xing, Zhou Wei, Ma Guangyuan, Hu Songlin
- Language Model-in-the-loop: Data Optimal Approach To Learn-to-recommend Actions In Text Games Sudhakar Arjun Vaithilingam, Parthasarathi Prasanna, Rajendran Janarthanan, Chandar Sarath
- Luminate: Structured Generation And Exploration Of Design Space With Large Language Models For Human-ai Co-creation Suh Sangho, Chen Meng, Min Bryan, Li Toby Jia-jun, Xia Haijun
- Jmedlora:medical Domain Adaptation On Japanese Large Language Models Using Instruction-tuning Sukeda Issey, Suzuki Masahiro, Sakaji Hiroki, Kodera Satoshi
- Does Fine-tuning GPT-3 With The Openai API Leak Personally-identifiable Information? Sun Albert Yu, Zemour Eliott, Saxena Arushi, Vaidyanathan Udith, Lin Eric, Lau Christian, Mugunthan Vaikkunth
- Speech-based Slot Filling Using Large Language Models Sun Guangzhi, Feng Shutong, Jiang Dongcheng, Zhang Chao, Gašić Milica, Woodland Philip C.
- Reinforcement Learning In The Era Of Llms: What Is Essential? What Is Needed? An RL Perspective On RLHF, Prompting, And Beyond Sun Hao
- Evaluating The Zero-shot Robustness Of Instruction-tuned Language Models Sun Jiuding, Shaib Chantal, Wallace Byron C.
- Battle Of The Large Language Models: Dolly Vs Llama Vs Vicuna Vs Guanaco Vs Bard Vs Chatgpt -- A Text-to-sql Parsing Comparison Sun Shuo, Zhang Yuchen, Yan Jiahuan, Gao Yuze, Ong Donovan, Chen Bin, Su Jian
- Exploring The Impact Of Low-rank Adaptation On The Performance, Efficiency, And Regularization Of RLHF Sun Simeng, Gupta Dhawal, Iyyer Mohit
- A Prompt Learning Framework For Source Code Summarization Sun Weisong, Fang Chunrong, You Yudu, Chen Yuchen, Liu Yi, Wang Chong, Zhang Jian, Zhang Quanjun, Qian Hanwei, Zhao Wei, Liu Yang, Chen Zhenyu
- A Comparative Study Between Full-parameter And Lora-based Fine-tuning On Chinese Instruction Data For Instruction Following Large Language Model Sun Xianghui, Ji Yunjie, Ma Baochang, Li Xiangang
- Uncovering Chatgpt's Capabilities In Recommender Systems Sunhao Dai, Ninglu Shao, Haiyuan Zhao, Weijie Yu, Zihua Si, Chen Xu, Zhongxiang Sun, Xiao Zhang, Jun Xu
- From Base To Conversational: Japanese Instruction Dataset And Tuning Large Language Models Suzuki Masahiro, Hirano Masanori, Sakaji Hiroki
- Improving Non-autoregressive Translation Quality With Pretrained Language Model, Embedding Distillation And Upsampling Strategy For CTC Syu Shen-sian, Xie Juncheng, Lee Hung-yi
- Sparks Of Artificial General Intelligence: Early Experiments With GPT-4 Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang
- Imaginations Of WALL-E : Reconstructing Experiences With An Imagination-inspired Module For Advanced AI Systems Taghavi Zeinab Sadat, Gooran Soroush, Dalili Seyed Arshan, Amirzadeh Hamidreza, Nematbakhsh Mohammad Jalal, Sameti Hossein
- Training Generative Question-answering On Synthetic Data Obtained From An Instruct-tuned Model Takahashi Kosuke, Omi Takahiro, Arima Kosuke, Ishigaki Tatsuya
- Redcoast: A Lightweight Tool To Automate Distributed Training Of Llms On Any Gpu/tpus Tan Bowen, Zhu Yun, Liu Lijuan, Wang Hongyi, Zhuang Yonghao, Chen Jindong, Xing Eric, Hu Zhiting
- Medchatzh: A Better Medical Adviser Learns From Better Instructions Tan Yang, Li Mingchen, Huang Zijie, Yu Huiqun, Fan Guisheng
- Struc-bench: Are Large Language Models Really Good At Generating Complex Structured Data? Tang Xiangru, Zong Yiming, Phang Jason, Zhao Yilun, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- POUF: Prompt-oriented Unsupervised Fine-tuning For Large Pre-trained Models Tanwisuth Korawat, Zhang Shujian, Zheng Huangjie, He Pengcheng, Zhou Mingyuan
- Webwise: Web Interface Control And Sequential Exploration With Large Language Models Tao Heyi, T Sethuraman V, Shlapentokh-rothman Michal, Hoiem Derek
- Demystifying Embedding Spaces Using Large Language Models Tennenholtz Guy, Chow Yinlam, Hsu Chih-wei, Jeong Jihwan, Shani Lior, Tulepbergenov Azamat, Ramachandran Deepak, Mladenov Martin, Boutilier Craig
- CLAMP: Contrastive Language Model Prompt-tuning Teterwak Piotr, Sun Ximeng, Plummer Bryan A., Saenko Kate, Lim Ser-nam
- Leveraging Llms For Synthesizing Training Data Across Many Languages In Multilingual Dense Retrieval Thakur Nandan, Ni Jianmo, Ábrego Gustavo Hernández, Wieting John, Lin Jimmy, Cer Daniel
- SPDF: Sparse Pre-training And Dense Fine-tuning For Large Language Models Thangarasa Vithursan, Gupta Abhay, Marshall William, Li Tianda, Leong Kevin, Decoste Dennis, Lie Sean, Saxena Shreyas
- Chimed-gpt: A Chinese Medical Large Language Model With Full Training Regime And Better Alignment To Human Preferences Tian Yuanhe, Gan Ruyi, Song Yan, Zhang Jiaxing, Zhang Yongdong
- BOOST: Harnessing Black-box Control To Boost Commonsense In Lms' Generation Tian Yufei, Zhang Felix, Peng Nanyun
- Impact Of Tokenization On Llama Russian Adaptation Tikhomirov Mikhail, Chernyshev Daniil
- Qlora: Efficient Finetuning Of Quantized Llms Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, Luke Zettlemoyer
- Fine-tune Language Models To Approximate Unbiased In-context Learning Timothy Chu, Zhao Song, Chiwun Yang
- Better Language Models Of Code Through Self-improvement To Hung Quoc, Bui Nghi D. Q., Guo Jin, Nguyen Tien N.
- Clinical Camel: An Open Expert-level Medical Language Model With Dialogue-based Knowledge Encoding Toma Augustin, Lawler Patrick R., Ba Jimmy, Krishnan Rahul G., Rubin Barry B., Wang Bo
- Parameter Efficient Tuning Allows Scalable Personalization Of Llms For Text Entry: A Case Study On Abbreviation Expansion Tomanek Katrin, Cai Shanqing, Venugopalan Subhashini
- Bioinstruct: Instruction Tuning Of Large Language Models For Biomedical Natural Language Processing Tran Hieu, Yang Zhichao, Yao Zonghai, Yu Hong
- Sight Beyond Text: Multi-modal Training Enhances Llms In Truthfulness And Ethics Tu Haoqin, Zhao Bingchen, Wei Chen, Xie Cihang
- Efficiently Aligned Cross-lingual Transfer Learning For Conversational Tasks Using Prompt-tuning Tu Lifu, Qu Jin, Yavuz Semih, Joty Shafiq, Liu Wenhao, Xiong Caiming, Zhou Yingbo
- Automating Code-related Tasks Through Transformers: The Impact Of Pre-training Tufano Rosalia, Pascarella Luca, Bavota Gabriele
- Focused Transformer: Contrastive Training For Context Scaling Tworkowski Szymon, Staniszewski Konrad, Pacek Mikołaj, Wu Yuhuai, Michalewski Henryk, Miłoś Piotr
- Taco: Enhancing Cross-lingual Transfer For Low-resource Languages In Llms Through Translation-assisted Chain-of-thought Processes Upadhayay Bibek, Behzadan Vahid
- In-context Ability Transfer For Question Decomposition In Complex QA V Venktesh, Bhattacharya Sourangshu, Anand Avishek
- On The Automatic Generation And Simplification Of Children's Stories Valentini Maria, Weber Jennifer, Salcido Jesus, Wright Téa, Colunga Eliana, Kann Katharina
- Mini-gpts: Efficient Large Language Models Through Contextual Pruning Valicenti Tim, Vidal Justice, Patnaik Ritik
- Theory Of Mind In Large Language Models: Examining Performance Of 11 State-of-the-art Models Vs. Children Aged 7-10 On Advanced Tests Van Duijn Max J., Van Dijk Bram M. A., Kouwenhoven Tom, De Valk Werner, Spruit Marco R., Van Der Putten Peter
- Small Language Models Improve Giants By Rewriting Their Outputs Vernikos Giorgos, Bražinskas Arthur, Adamek Jakub, Mallinson Jonathan, Severyn Aliaksei, Malmi Eric
- Prompt2model: Generating Deployable Models From Natural Language Instructions Viswanathan Vijay, Zhao Chenyang, Bertsch Amanda, Wu Tongshuang, Neubig Graham
- PIP: Parse-instructed Prefix For Syntactically Controlled Paraphrase Generation Wan Yixin, Huang Kuan-hao, Chang Kai-wei
- Reformulating Domain Adaptation Of Large Language Models As Adapt-retrieve-revise: A Case Study On Chinese Legal Domain Wan Zhen, Zhang Yating, Wang Yexiang, Cheng Fei, Kurohashi Sadao
- Shall We Pretrain Autoregressive Language Models With Retrieval? A Comprehensive Study Wang Boxin, Ping Wei, Xu Peng, Mcafee Lawrence, Liu Zihan, Shoeybi Mohammad, Dong Yi, Kuchaiev Oleksii, Li Bo, Xiao Chaowei, Anandkumar Anima, Catanzaro Bryan
- BERT4CTR: An Efficient Framework To Combine Pre-trained Language Model With Non-textual Features For CTR Prediction Wang Dong, Salamatian Kavé, Xia Yunqing, Deng Weiwei, Zhiang Qi
- Openchat: Advancing Open-source Language Models With Mixed-quality Data Wang Guan, Cheng Sijie, Zhan Xianyuan, Li Xiangang, Song Sen, Liu Yang
- What Makes For Good Visual Tokenizers For Large Language Models? Wang Guangzhi, Ge Yixiao, Ding Xiaohan, Kankanhalli Mohan, Shan Ying
- A Survey Of The Evolution Of Language Model-based Dialogue Systems Wang Hongru, Wang Lingzhi, Du Yiming, Chen Liang, Zhou Jingyan, Wang Yufei, Wong Kam-fai
- Dialog Action-aware Transformer For Dialog Policy Learning Wang Huimin, Kwan Wai-chung, Wong Kam-fai
- Missrec: Pre-training And Transferring Multi-modal Interest-aware Sequence Representation For Recommendation Wang Jinpeng, Zeng Ziyun, Wang Yunxiao, Wang Yuting, Lu Xingyu, Li Tianxiang, Yuan Jun, Zhang Rui, Zheng Hai-tao, Xia Shu-tao
- Improved Visual Fine-tuning With Natural Language Supervision Wang Junyang, Xu Yuanhong, Hu Juhua, Yan Ming, Sang Jitao, Qian Qi
- Mathcoder: Seamless Code Integration In Llms For Enhanced Mathematical Reasoning Wang Ke, Ren Houxing, Zhou Aojun, Lu Zimu, Luo Sichun, Shi Weikang, Zhang Renrui, Song Linqi, Zhan Mingjie, Li Hongsheng
- Adapting LLM Agents With Universal Feedback In Communication Wang Kuan, Lu Yadong, Santacroce Michael, Gong Yeyun, Zhang Chao, Shen Yelong
- Improving Text Embeddings With Large Language Models Wang Liang, Yang Nan, Huang Xiaolong, Yang Linjun, Majumder Rangan, Wei Furu
- Query2doc: Query Expansion With Large Language Models Wang Liang, Yang Nan, Wei Furu
- Making Large Language Models Better Reasoners With Alignment Wang Peiyi, Li Lei, Chen Liang, Song Feifan, Lin Binghuai, Cao Yunbo, Liu Tianyu, Sui Zhifang
- Demystifying Instruction Mixing For Fine-tuning Large Language Models Wang Renxi, Li Haonan, Wu Minghao, Wang Yuxia, Han Xudong, Zhang Chiyu, Baldwin Timothy
- Aurora:activating Chinese Chat Capability For Mixtral-8x7b Sparse Mixture-of-experts Through Instruction-tuning Wang Rongsheng, Chen Haoming, Zhou Ruizhe, Duan Yaofei, Cai Kunyan, Ma Han, Cui Jiaxi, Li Jian, Pang Patrick Cheong-iao, Wang Yapeng, Tan Tao
- Ivygpt: Interactive Chinese Pathway Language Model In Medical Domain Wang Rongsheng, Duan Yaofei, Lam Chantong, Chen Jiexi, Xu Jiangsheng, Chen Haoming, Liu Xiaohong, Pang Patrick Cheong-iao, Tan Tao
- READ: Recurrent Adaptation Of Large Transformers Wang Sid, Nguyen John, Li Ke, Wu Carole-jean
- Lora Ensembles For Large Language Model Fine-tuning Wang Xi, Aitchison Laurence, Rudolph Maja
- Orthogonal Subspace Learning For Language Model Continual Learning Wang Xiao, Chen Tianze, Ge Qiming, Xia Han, Bao Rong, Zheng Rui, Zhang Qi, Gui Tao, Huang Xuanjing
- Guiding Language Model Reasoning With Planning Tokens Wang Xinyi, Caccia Lucas, Ostapenko Oleksiy, Yuan Xingdi, Wang William Yang, Sordoni Alessandro
- FIAT: Fusing Learning Paradigms With Instruction-accelerated Tuning Wang Xinyi, Wieting John, Clark Jonathan H.
- Promptagent: Strategic Planning With Language Models Enables Expert-level Prompt Optimization Wang Xinyuan, Li Chenxi, Wang Zhen, Bai Fan, Luo Haotian, Zhang Jiayou, Jojic Nebojsa, Xing Eric P., Hu Zhiting
- Non-intrusive Adaptation: Input-centric Parameter-efficient Fine-tuning For Versatile Multimodal Modeling Wang Yaqing, Wu Jialin, Dabral Tanmaya, Zhang Jiageng, Brown Geoff, Lu Chun-ta, Liu Frederick, Liang Yi, Pang Bo, Bendersky Michael, Soricut Radu
- NEWTON: Are Large Language Models Capable Of Physical Reasoning? Wang Yi Ru, Duan Jiafei, Fox Dieter, Srinivasa Siddhartha
- Universality And Limitations Of Prompt Tuning Wang Yihan, Chauhan Jatin, Wang Wei, Hsieh Cho-jui
- Rescue: Ranking LLM Responses With Partial Ordering To Improve Response Generation Wang Yikun, Zheng Rui, Li Haoming, Zhang Qi, Gui Tao, Liu Fei
- Multilora: Democratizing Lora For Better Multi-task Learning Wang Yiming, Lin Yu, Zeng Xiaodong, Zhang Guannan
- Primacy Effect Of Chatgpt Wang Yiwei, Cai Yujun, Chen Muhao, Liang Yuxuan, Hooi Bryan
- Fake Alignment: Are Llms Really Aligned Well? Wang Yixu, Teng Yan, Huang Kexin, Lyu Chengqi, Zhang Songyang, Zhang Wenwei, Ma Xingjun, Jiang Yu-gang, Qiao Yu, Wang Yingchun
- How Far Can Camels Go? Exploring The State Of Instruction Tuning On Open Resources Wang Yizhong, Ivison Hamish, Dasigi Pradeep, Hessel Jack, Khot Tushar, Chandu Khyathi Raghavi, Wadden David, Macmillan Kelsey, Smith Noah A., Beltagy Iz, Hajishirzi Hannaneh
- Towards Improving Document Understanding: An Exploration On Text-grounding Via Mllms Wang Yonghui, Zhou Wengang, Feng Hao, Zhou Keyi, Li Houqiang
- Harnessing The Power Of David Against Goliath: Exploring Instruction Data Generation Without Using Closed-source Models Wang Yue, Wang Xinrui, Li Juntao, Chang Jinxiong, Zhang Qishen, Liu Zhongyi, Zhang Guannan, Zhang Min
- Mediagpt : A Large Language Model For Chinese Media Wang Zhonghao, Lu Zijia, Jin Bo, Deng Haiying
- Do Llms Understand User Preferences? Evaluating Llms On User Rating Prediction Wang-cheng Kang, Jianmo Ni, Nikhil Mehta, Maheswaran Sathiamoorthy, Lichan Hong, Ed Chi, Derek Zhiyuan Cheng
- Call For Papers -- The Babylm Challenge: Sample-efficient Pretraining On A Developmentally Plausible Corpus Warstadt Alex, Choshen Leshem, Mueller Aaron, Williams Adina, Wilcox Ethan, Zhuang Chengxu
- Polylm: An Open Source Polyglot Large Language Model Wei Xiangpeng, Wei Haoran, Lin Huan, Li Tianhao, Zhang Pei, Ren Xingzhang, Li Mei, Wan Yu, Cao Zhiwei, Xie Binbin, Hu Tianxiang, Li Shangjie, Hui Binyuan, Yu Bowen, Liu Dayiheng, Yang Baosong, Huang Fei, Xie Jun
- LLMSTEP: LLM Proofstep Suggestions In Lean Welleck Sean, Saha Rahul
- Autodroid: Llm-powered Task Automation In Android Wen Hao, Li Yuanchun, Liu Guohong, Zhao Shanhui, Yu Tao, Li Toby Jia-jun, Jiang Shiqi, Liu Yunhao, Zhang Yaqin, Liu Yunxin
- Unveiling The Implicit Toxicity In Large Language Models Wen Jiaxin, Ke Pei, Sun Hao, Zhang Zhexin, Li Chengfei, Bai Jinfeng, Huang Minlie
- Hyperpolyglot Llms: Cross-lingual Interpretability In Token Embeddings Wen-yi Andrea W, Mimno David
- Helping Language Models Learn More: Multi-dimensional Task Prompt For Few-shot Tuning Weng Jinta, Zhang Jiarui, Hu Yue, Fa Daidong, Xuand Xiaofeng, Huang Heyan
- Lmtuner: An User-friendly And Highly-integrable Training Framework For Fine-tuning Large Language Models Weng Yixuan, Wang Zhiqi, Liao Huanxuan, He Shizhu, Liu Shengping, Liu Kang, Zhao Jun
- Exploring Parameter-efficient Fine-tuning Techniques For Code Generation With Large Language Models Weyssow Martin, Zhou Xin, Kim Kisub, Lo David, Sahraoui Houari
- Llm-powered Data Augmentation For Enhanced Cross-lingual Performance Whitehouse Chenxi, Choudhury Monojit, Aji Alham Fikri
- Better Question-answering Models On A Budget Wijeratne Yudhanjaya, Marikar Ishan
- Resolving References In Visually-grounded Dialogue Via Text Generation Willemsen Bram, Qian Livia, Skantze Gabriel
- Frustratingly Simple Memory Efficiency For Pre-trained Language Models Via Dynamic Embedding Pruning Williams Miles, Aletras Nikolaos
- Extrapolating Multilingual Understanding Models As Multilingual Generators Wu Bohong, Yuan Fei, Zhao Hai, Li Lei, Xu Jingjing
- Pmc-llama: Towards Building Open-source Language Models For Medicine Wu Chaoyi, Lin Weixiong, Zhang Xiaoman, Zhang Ya, Wang Yanfeng, Xie Weidi
- Exploring The Limits Of Chatgpt In Software Security Applications Wu Fangzhou, Zhang Qingzhao, Bajaj Ati Priya, Bao Tiffany, Zhang Ning, Wang Ruoyu "fish", Xiao Chaowei
- P-laplacian Adaptation For Generative Pre-trained Vision-language Models Wu Haoyuan, Zhang Xinyun, Xu Peng, Liao Peiyu, Yao Xufeng, Yu Bei
- Lamini-lm: A Diverse Herd Of Distilled Models From Large-scale Instructions Wu Minghao, Waheed Abdul, Zhang Chiyu, Abdul-mageed Muhammad, Aji Alham Fikri
- Approximated Prompt Tuning For Vision-language Pre-trained Models Wu Qiong, Huang Shubin, Zhou Yiyi, Dai Pingyang, Shu Annan, Jiang Guannan, Ji Rongrong
- YUAN 2.0: A Large Language Model With Localized Filtering-based Attention Wu Shaohua, Zhao Xudong, Wang Shenling, Luo Jiangang, Li Lingjun, Chen Xi, Zhao Bing, Wang Wei, Yu Tong, Zhang Rongguo, Zhang Jiahua, Wang Chao
- Paragraph-to-image Generation With Information-enriched Diffusion Model Wu Weijia, Li Zhuang, He Yefei, Shou Mike Zheng, Shen Chunhua, Cheng Lele, Li Yan, Gao Tingting, Zhang Di, Wang Zhongyuan
- Openicl: An Open-source Framework For In-context Learning Wu Zhenyu, Wang Yaoxiang, Ye Jiacheng, Feng Jiangtao, Xu Jingjing, Qiao Yu, Wu Zhiyong
- Language Models Meet World Models: Embodied Experiences Enhance Language Models Xiang Jiannan, Tao Tianhua, Gu Yi, Shu Tianmin, Wang Zirui, Yang Zichao, Hu Zhiting
- Efficient Streaming Language Models With Attention Sinks Xiao Guangxuan, Tian Yuandong, Chen Beidi, Han Song, Lewis Mike
- Large Language Models Can Be Good Privacy Protection Learners Xiao Yijia, Jin Yiqiao, Bai Yushi, Wu Yue, Yang Xianjun, Luo Xiao, Yu Wenchao, Zhao Xujiang, Liu Yanchi, Chen Haifeng, Wang Wei, Cheng Wei
- Instruction-vit: Multi-modal Prompts For Instruction Learning In Vit Xiao Zhenxiang, Chen Yuzhong, Zhang Lu, Yao Junjie, Wu Zihao, Yu Xiaowei, Pan Yi, Zhao Lin, Ma Chong, Liu Xinyu, Liu Wei, Li Xiang, Yuan Yixuan, Shen Dinggang, Zhu Dajiang, Liu Tianming, Jiang Xi
- Supporting Qualitative Analysis With Large Language Models: Combining Codebook With GPT-3 For Deductive Coding Xiao Ziang, Yuan Xingdi, Liao Q. Vera, Abdelghani Rania, Oudeyer Pierre-yves
- Instructed Language Models With Retrievers Are Powerful Entity Linkers Xiao Zilin, Gong Ming, Wu Jie, Zhang Xingyao, Shou Linjun, Pei Jian, Jiang Daxin
- The Wall Street Neophyte: A Zero-shot Analysis Of Chatgpt Over Multimodal Stock Movement Prediction Challenges Xie Qianqian, Han Weiguang, Lai Yanzhao, Peng Min, Huang Jimin
- Proto-lm: A Prototypical Network-based Framework For Built-in Interpretability In Large Language Models Xie Sean, Vosoughi Soroush, Hassanpour Saeed
- A Prompt Log Analysis Of Text-to-image Generation Systems Xie Yutong, Pan Zhaoying, Ma Jinge, Jie Luo, Mei Qiaozhu
- Llm-pruner: On The Structural Pruning Of Large Language Models Xinyin Ma, Gongfan Fang, Xinchao Wang
- Doctorglm: Fine-tuning Your Chinese Doctor Is Not A Herculean Task Xiong Honglin, Wang Sheng, Zhu Yitao, Zhao Zihao, Liu Yuxiao, Huang Linlin, Wang Qian, Shen Dinggang
- Can Llms Express Their Uncertainty? An Empirical Evaluation Of Confidence Elicitation In Llms Xiong Miao, Hu Zhiyuan, Lu Xinyang, Li Yifei, Fu Jie, He Junxian, Hooi Bryan
- Rewoo: Decoupling Reasoning From Observations For Efficient Augmented Language Models Xu Binfeng, Peng Zhiyuan, Lei Bowen, Mukherjee Subhabrata, Liu Yuchen, Xu Dongkuan
- Baize: An Open-source Chat Model With Parameter-efficient Tuning On Self-chat Data Xu Canwen, Guo Daya, Duan Nan, Mcauley Julian
- Automatic Pair Construction For Contrastive Post-training Xu Canwen, Rosset Corby, Chau Ethan C., Del Corro Luciano, Mahajan Shweti, Mcauley Julian, Neville Jennifer, Awadallah Ahmed Hassan, Rao Nikhil
- Align On The Fly: Adapting Chatbot Behavior To Established Norms Xu Chunpu, Chern Steffi, Chern Ethan, Zhang Ge, Wang Zekun, Liu Ruibo, Li Jing, Fu Jie, Liu Pengfei
- Large Language Models For Generative Information Extraction: A Survey Xu Derong, Chen Wei, Peng Wenjun, Zhang Chao, Xu Tong, Zhao Xiangyu, Wu Xian, Zheng Yefeng, Wang Yang, Chen Enhong
- A Paradigm Shift In Machine Translation: Boosting Translation Performance Of Large Language Models Xu Haoran, Kim Young Jin, Sharaf Amr, Awadalla Hany Hassan
- No Train Still Gain. Unleash Mathematical Reasoning Of Large Language Models With Monte Carlo Tree Search Guided By Energy Function Xu Haotian
- Penetrative AI: Making Llms Comprehend The Physical World Xu Huatao, Han Liying, Yang Qirui, Li Mo, Srivastava Mani
- Instructions As Backdoors: Backdoor Vulnerabilities Of Instruction Tuning For Large Language Models Xu Jiashu, Ma Mingyu Derek, Wang Fei, Xiao Chaowei, Chen Muhao
- Shattering The Agent-environment Interface For Fine-tuning Inclusive Language Models Xu Wanqiao, Dong Shi, Arumugam Dilip, Van Roy Benjamin
- Reasons To Reject? Aligning Language Models With Judgments Xu Weiwen, Cai Deng, Zhang Zhisong, Lam Wai, Shi Shuming
- Llmrefine: Pinpointing And Refining Large Language Models Via Fine-grained Actionable Feedback Xu Wenda, Deutsch Daniel, Finkelstein Mara, Juraska Juraj, Zhang Biao, Liu Zhongtao, Wang William Yang, Li Lei, Freitag Markus
- Mental-llm: Leveraging Large Language Models For Mental Health Prediction Via Online Text Data Xu Xuhai, Yao Bingsheng, Dong Yuanzhe, Gabriel Saadia, Yu Hong, Hendler James, Ghassemi Marzyeh, Dey Anind K., Wang Dakuo
- Drivegpt4: Interpretable End-to-end Autonomous Driving Via Large Language Model Xu Zhenhua, Zhang Yujia, Xie Enze, Zhao Zhen, Guo Yong, Wong Kwan-yee. K., Li Zhenguo, Zhao Hengshuang
- Occuquest: Mitigating Occupational Bias For Inclusive Large Language Models Xue Mingfeng, Liu Dayiheng, Yang Kexin, Dong Guanting, Lei Wenqiang, Yuan Zheng, Zhou Chang, Zhou Jingren
- Parameter-efficient Tuning Helps Language Model Alignment Xue Tianci, Wang Ziqi, Ji Heng
- Scaling Evidence-based Instructional Design Expertise Through Large Language Models Yadav Gautam
- Tackling Vision Language Tasks Through Learning Inner Monologues Yang Diji, Chen Kezhen, Rao Jinmeng, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- Refgpt: Dialogue Generation Of GPT, By GPT, And For GPT Yang Dongjie, Yuan Ruifeng, Fan Yuantao, Yang Yifei, Wang Zili, Wang Shusen, Zhao Hai
- Instoptima: Evolutionary Multi-objective Instruction Optimization Via Large Language Model-based Instruction Operators Yang Heng, Li Ke
- Auto-gpt For Online Decision Making: Benchmarks And Additional Opinions Yang Hui, Yue Sifu, He Yunzhong
- Longqlora: Efficient And Effective Method To Extend Context Length Of Large Language Models Yang Jianxin
- Good Questions Help Zero-shot Image Reasoning Yang Kaiwen, Shen Tao, Tian Xinmei, Geng Xiubo, Tao Chongyang, Tao Dacheng, Zhou Tianyi
- Medxchat: A Unified Multimodal Large Language Model Framework Towards Cxrs Understanding And Generation Yang Ling, Wang Zhanyu, Chen Zhenghao, Liang Xinyu, Zhou Luping
- Skymath: Technical Report Yang Liu, Yang Haihua, Cheng Wenjun, Lin Lei, Li Chenxia, Chen Yifu, Liu Lunan, Pan Jianfei, Wei Tianwen, Li Biye, Zhao Liang, Wang Lijie, Zhu Bo, Li Guoliang, Wu Xuejie, Luo Xilin, Hu Rui
- Progressive Evidence Refinement For Open-domain Multimodal Retrieval Question Answering Yang Shuwen, Wu Anran, Wu Xingjiao, Xiao Luwei, Ma Tianlong, Jin Cheng, He Liang
- Zhongjing: Enhancing The Chinese Medical Capabilities Of Large Language Model Through Expert Feedback And Real-world Multi-turn Dialogue Yang Songhua, Zhao Hanjie, Zhu Senbin, Zhou Guangyu, Xu Hongfei, Jia Yuxiang, Zan Hongying
- Exploring An LM To Generate Prolog Predicates From Mathematics Questions Yang Xiaocheng, Tam Yik-cheung
- On The Calibration Of Multilingual Question Answering Llms Yang Yahan, Dan Soham, Roth Dan, Lee Insup
- Harnessing The Power Of Large Language Models For Natural Language To First-order Logic Translation Yang Yuan, Xiong Siheng, Payani Ali, Shareghi Ehsan, Fekri Faramarz
- Improving The Reliability Of Large Language Models By Leveraging Uncertainty-aware In-context Learning Yang Yuchen, Li Houqiang, Wang Yanfeng, Wang Yu
- On The Planning, Search, And Memorization Capabilities Of Large Language Models Yang Yunhao, Tomar Anshul
- Re-vilm: Retrieval-augmented Visual Language Model For Zero And Few-shot Image Captioning Yang Zhuolin, Ping Wei, Liu Zihan, Korthikanti Vijay, Nie Weili, Huang De-an, Fan Linxi, Yu Zhiding, Lan Shiyi, Li Bo, Liu Ming-yu, Zhu Yuke, Shoeybi Mohammad, Catanzaro Bryan, Xiao Chaowei, Anandkumar Anima
- Improving Language Model Negotiation With Self-play And In-context Learning From AI Feedback Yao Fu, Hao Peng, Tushar Khot, Mirella Lapata
- Knowledge Plugins: Enhancing Large Language Models For Domain-specific Recommendations Yao Jing, Xu Wei, Lian Jianxun, Wang Xiting, Yi Xiaoyuan, Xie Xing
- Retroformer: Retrospective Large Language Agents With Policy Gradient Optimization Yao Weiran, Heinecke Shelby, Niebles Juan Carlos, Liu Zhiwei, Feng Yihao, Xue Le, Murthy Rithesh, Chen Zeyuan, Zhang Jianguo, Arpit Devansh, Xu Ran, Mui Phil, Wang Huan, Xiong Caiming, Savarese Silvio
- Deltazip: Multi-tenant Language Model Serving Via Delta Compression Yao Xiaozhe, Klimovic Ana
- A Survey On Large Language Model (LLM) Security And Privacy: The Good, The Bad, And The Ugly Yao Yifan, Duan Jinhao, Xu Kaidi, Cai Yuanfang, Sun Zhibo, Zhang Yue
- Editing Large Language Models: Problems, Methods, And Opportunities Yao Yunzhi, Wang Peng, Tian Bozhong, Cheng Siyuan, Li Zhoubo, Deng Shumin, Chen Huajun, Zhang Ningyu
- Deepspeed-visualchat: Multi-round Multi-image Interleave Chat Via Multi-modal Causal Attention Yao Zhewei, Wu Xiaoxia, Li Conglong, Zhang Minjia, Qin Heyang, Ruwase Olatunji, Awan Ammar Ahmad, Rajbhandari Samyam, He Yuxiong
- Ureader: Universal Ocr-free Visually-situated Language Understanding With Multimodal Large Language Model Ye Jiabo, Hu Anwen, Xu Haiyang, Ye Qinghao, Yan Ming, Xu Guohai, Li Chenliang, Tian Junfeng, Qian Qi, Zhang Ji, Jin Qin, He Liang, Lin Xin Alex, Huang Fei
- Qilin-med: Multi-stage Knowledge Injection Advanced Medical Large Language Model Ye Qichen, Liu Junling, Chong Dading, Zhou Peilin, Hua Yining, Liu Fenglin, Cao Meng, Wang Ziming, Cheng Xuxin, Lei Zhu, Guo Zhenhua
- Effective Large Language Model Adaptation For Improved Grounding And Citation Generation Ye Xi, Sun Ruoxi, Arik Sercan Ö., Pfister Tomas
- Rational Decision-making Agent With Internalized Utility Judgment Ye Yining, Cong Xin, Tian Shizuo, Qin Yujia, Liu Chong, Lin Yankai, Liu Zhiyuan, Sun Maosong
- ASPEN: High-throughput Lora Fine-tuning Of Large Language Models With A Single GPU Ye Zhengmao, Li Dengchun, Tian Jingqi, Lan Tingfeng, Zuo Jie, Duan Lei, Lu Hui, Jiang Yexi, Sha Jian, Zhang Ke, Tang Mingjie
- Navigating Text-to-image Customization: From Lycoris Fine-tuning To Model Evaluation Yeh Shih-ying, Hsieh Yu-guan, Gao Zhidong, Yang Bernard B W, Oh Giyeong, Gong Yanmin
- QAID: Question Answering Inspired Few-shot Intent Detection Yehudai Asaf, Vetzler Matan, Mass Yosi, Lazar Koren, Cohen Doron, Carmeli Boaz
- Cones: Concept Embedding Search For Parameter Efficient Tuning Large Vision Language Models Yi Huahui, Qin Ziyuan, Xu Wei, Guo Miaotian, Wang Kun, Zhang Shaoting, Li Kang, Lao Qicheng
- Benchmarking And Defending Against Indirect Prompt Injection Attacks On Large Language Models Yi Jingwei, Xie Yueqi, Zhu Bin, Kiciman Emre, Sun Guangzhong, Xie Xing, Wu Fangzhao
- Lemur: Harmonizing Natural Language And Code For Language Agents Yiheng Xu, Hongjin Su, Chen Xing, Boyu Mi, Qian Liu, Weijia Shi, Binyuan Hui, Fan Zhou, Yitao Liu, Tianbao Xie, Zhoujun Cheng, Siheng Zhao, Lingpeng Kong, Bailin Wang, Caiming Xiong, Tao Yu
- Dynosaur: A Dynamic Growth Paradigm For Instruction-tuning Data Curation Yin Da, Liu Xiao, Yin Fan, Zhong Ming, Bansal Hritik, Han Jiawei, Chang Kai-wei
- Modulora: Finetuning 2-bit Llms On Consumer Gpus By Integrating With Modular Quantizers Yin Junjie, Dong Jiahao, Wang Yingheng, De Sa Christopher, Kuleshov Volodymyr
- LAMM: Language-assisted Multi-modal Instruction-tuning Dataset, Framework, And Benchmark Yin Zhenfei, Wang Jiong, Cao Jianjian, Shi Zhelun, Liu Dingning, Li Mukai, Sheng Lu, Bai Lei, Huang Xiaoshui, Wang Zhiyong, Shao Jing, Ouyang Wanli
- From Text To Motion: Grounding GPT-4 In A Humanoid Robot "alter3" Yoshida Takahide, Masumori Atsushi, Ikegami Takashi
- Merlin:empowering Multimodal Llms With Foresight Minds Yu En, Zhao Liang, Wei Yana, Yang Jinrong, Wu Dongming, Kong Lingyu, Wei Haoran, Wang Tiancai, Ge Zheng, Zhang Xiangyu, Tao Wenbing
- GPTFUZZER: Red Teaming Large Language Models With Auto-generated Jailbreak Prompts Yu Jiahao, Lin Xingwei, Yu Zheng, Xing Xinyu
- MELO: Enhancing Model Editing With Neuron-indexed Dynamic Lora Yu Lang, Chen Qin, Zhou Jie, He Liang
- Language Models Are Super Mario: Absorbing Abilities From Homologous Models As A Free Lunch Yu Le, Yu Bowen, Yu Haiyang, Huang Fei, Li Yongbin
- Scaling Autoregressive Multi-modal Models: Pretraining And Instruction Tuning Yu Lili, Shi Bowen, Pasunuru Ramakanth, Muller Benjamin, Golovneva Olga, Wang Tianlu, Babu Arun, Tang Binh, Karrer Brian, Sheynin Shelly, Ross Candace, Polyak Adam, Howes Russell, Sharma Vasu, Xu Puxin, Tamoyan Hovhannes, Ashual Oron, Singer Uriel, Li Shang-wen, Zhang Susan, James Richard, Ghosh Gargi, Taigman Yaniv, Fazel-zarandi Maryam, Celikyilmaz Asli, Zettlemoyer Luke, Aghajanyan Armen
- Information Association For Language Model Updating By Mitigating Lm-logical Discrepancy Yu Pengfei, Ji Heng
- Improving Language Models Via Plug-and-play Retrieval Feedback Yu Wenhao, Zhang Zhihan, Liang Zhenwen, Jiang Meng, Sabharwal Ashish
- Training With "paraphrasing The Original Text" Improves Long-context Performance Yu Yijiong, Huang Yongfeng, Qi Zhixiao, Zhou Zhe
- Fine-tuning Language Models With Generative Adversarial Reward Modelling Yu Zhang Ze, Jaw Lau Jia, Hui Zhang, Low Bryan Kian Hsiang
- How Vocabulary Sharing Facilitates Multilingualism In Llama? Yuan Fei, Yuan Shuai, Wu Zhiyong, Li Lei
- Beneath Surface Similarity: Large Language Models Make Reasonable Scientific Analogies After Structure Abduction Yuan Siyu, Chen Jiangjie, Ge Xuyang, Xiao Yanghua, Yang Deqing
- Scaling Relationship On Learning Mathematical Reasoning With Large Language Models Yuan Zheng, Yuan Hongyi, Li Chengpeng, Dong Guanting, Lu Keming, Tan Chuanqi, Zhou Chang, Zhou Jingren
- Artgpt-4: Towards Artistic-understanding Large Vision-language Models With Enhanced Adapter Yuan Zhengqing, He Yunhong, Wang Kun, Ye Yanfang, Sun Lichao
- Evaluating Instruction-tuned Large Language Models On Code Comprehension And Generation Yuan Zhiqiang, Liu Junwei, Zi Qiancheng, Liu Mingwei, Peng Xin, Lou Yiling
- Plan, Eliminate, And Track -- Language Models Are Good Teachers For Embodied Agents Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye
- Aligning Large Language Models With Human: A Survey Yufei Wang, Wanjun Zhong, Liangyou Li, Fei Mi, Xingshan Zeng, Wenyong Huang, Lifeng Shang, Xin Jiang, Qun Liu
- Planting A SEED Of Vision In Large Language Model Yuying Ge, Yixiao Ge, Ziyun Zeng, Xintao Wang, Ying Shan
- Fairness-aware Structured Pruning In Transformers Zayed Abdelrahman, Mordido Goncalo, Shabanian Samira, Baldini Ioana, Chandar Sarath
- TIM: Teaching Large Language Models To Translate With Comparison Zeng Jiali, Meng Fandong, Yin Yongjing, Zhou Jie
- Exploring Memorization In Fine-tuned Language Models Zeng Shenglai, Li Yaxin, Ren Jie, Liu Yiding, Xu Han, He Pengfei, Xing Yue, Wang Shuaiqiang, Tang Jiliang, Yin Dawei
- Building Open-ended Embodied Agent Via Language-policy Bidirectional Adaptation Zhai Shaopeng, Wang Jie, Zhang Tianyi, Huang Fuxian, Zhang Qi, Zhou Ming, Hou Jing, Qiao Yu, Liu Yu
- Investigating The Catastrophic Forgetting In Multimodal Large Language Models Zhai Yuexiang, Tong Shengbang, Li Xiao, Cai Mu, Qu Qing, Lee Yong Jae, Ma Yi
- Removing RLHF Protections In GPT-4 Via Fine-tuning Zhan Qiusi, Fang Richard, Bindu Rohan, Gupta Akul, Hashimoto Tatsunori, Kang Daniel
- Vpgtrans: Transfer Visual Prompt Generator Across Llms Zhang Ao, Fei Hao, Yao Yuan, Ji Wei, Li Li, Liu Zhiyuan, Chua Tat-seng
- Mm-narrator: Narrating Long-form Videos With Multimodal In-context Learning Zhang Chaoyi, Lin Kevin, Yang Zhengyuan, Wang Jianfeng, Li Linjie, Lin Chung-ching, Liu Zicheng, Wang Lijuan
- User-controlled Knowledge Fusion In Large Language Models: Balancing Creativity And Hallucination Zhang Chen
- Revisiting Block-based Quantisation: What Is Important For Sub-8-bit LLM Inference? Zhang Cheng, Cheng Jianyi, Shumailov Ilia, Constantinides George A., Zhao Yiren
- Benchmarking And In-depth Performance Study Of Large Language Models On Habana Gaudi Processors Zhang Chengming, Sun Baixi, Yu Xiaodong, Xie Zhen, Zheng Weijian, Iskra Kamil, Beckman Pete, Tao Dingwen
- The Skipped Beat: A Study Of Sociopragmatic Understanding In Llms For 64 Languages Zhang Chiyu, Doan Khai Duy, Liao Qisheng, Abdul-mageed Muhammad
- On The Safety Of Open-sourced Large Language Models: Does Alignment Really Prevent Them From Being Misused? Zhang Hangfan, Guo Zhimeng, Zhu Huaisheng, Cao Bochuan, Lin Lu, Jia Jinyuan, Chen Jinghui, Wu Dinghao
- Tuning Large Language Model For End-to-end Speech Translation Zhang Hao, Si Nianwen, Chen Yaqi, Zhang Wenlin, Yang Xukui, Qu Dan, Jiao Xiaolin
- XATU: A Fine-grained Instruction-based Benchmark For Explainable Text Updates Zhang Haopeng, Iso Hayate, Gurajada Sairam, Bhutani Nikita
- Composing Parameter-efficient Modules With Arithmetic Operations Zhang Jinghan, Chen Shiqi, Liu Junteng, He Junxian
- Aligning Instruction Tasks Unlocks Large Language Models As Zero-shot Relation Extractors Zhang Kai, Gutiérrez Bernal Jiménez, Su Yu
- Llm-based Medical Assistant Personalization With Short- And Long-term Memory Coordination Zhang Kai, Kang Yangyang, Zhao Fubang, Liu Xiaozhong
- Crash: Clustering, Removing, And Sharing Enhance Fine-tuning Without Full Large Language Model Zhang Kaiyan, Ding Ning, Qi Biqing, Zhu Xuekai, Long Xinwei, Zhou Bowen
- Weighted Sampling For Masked Language Modeling Zhang Linhan, Chen Qian, Wang Wen, Deng Chong, Cao Xin, Hao Kongzhang, Jiang Yuxin, Wang Wei
- A Two-stage Adaptation Of Large Language Models For Text Ranking Zhang Longhui, Zhang Yanzhao, Long Dingkun, Xie Pengjun, Zhang Meishan, Zhang Min
- Lora-fa: Memory-efficient Low-rank Adaptation For Large Language Models Fine-tuning Zhang Longteng, Zhang Lin, Shi Shaohuai, Chu Xiaowen, Li Bo
- Retrieve Anything To Augment Large Language Models Zhang Peitian, Xiao Shitao, Liu Zheng, Dou Zhicheng, Nie Jian-yun
- Llama-adapter: Efficient Fine-tuning Of Language Models With Zero-init Attention Zhang Renrui, Han Jiaming, Liu Chris, Gao Peng, Zhou Aojun, Hu Xiangfei, Yan Shilin, Lu Pan, Li Hongsheng, Qiao Yu
- Generation-driven Contrastive Self-training For Zero-shot Text Classification With Instruction-following LLM Zhang Ruohong, Wang Yau-shian, Yang Yiming
- Instruction Tuning For Large Language Models: A Survey Zhang Shengyu, Dong Linfeng, Li Xiaoya, Zhang Sen, Sun Xiaofei, Wang Shuhe, Li Jiwei, Hu Runyi, Zhang Tianwei, Wu Fei, Wang Guoyin
- Adarefiner: Refining Decisions Of Language Models With Adaptive Feedback Zhang Wanpeng, Lu Zongqing
- Bridging The Information Gap Between Domain-specific Model And General LLM For Personalized Recommendation Zhang Wenxuan, Liu Hongzhi, Du Yingpeng, Zhu Chen, Song Yang, Zhu Hengshu, Wu Zhonghai
- EHR Interaction Between Patients And AI: Noteaid EHR Interaction Zhang Xiaocheng, Yao Zonghai, Yu Hong
- Large Language Models In Medical Term Classification And Unexpected Misalignment Between Response And Reasoning Zhang Xiaodan, Vemulapalli Sandeep, Talukdar Nabasmita, Ahn Sumyeong, Wang Jiankun, Meng Han, Murtaza Sardar Mehtab Bin, Dave Aakash Ajay, Leshchiner Dmitry, Joseph Dimitri F., Witteveen-lane Martin, Chesla Dave, Zhou Jiayu, Chen Bin
- PMC-VQA: Visual Instruction Tuning For Medical Visual Question Answering Zhang Xiaoman, Wu Chaoyi, Zhao Ziheng, Lin Weixiong, Zhang Ya, Wang Yanfeng, Xie Weidi
- Dialoguellm: Context And Emotion Knowledge-tuned Large Language Models For Emotion Recognition In Conversations Zhang Yazhou, Wang Mengyao, Wu Youxi, Tiwari Prayag, Li Qiuchi, Wang Benyou, Qin Jing
- Meta Prompting For AI Systems Zhang Yifan, Yuan Yang, Yao Andrew Chi-chih
- Balancing Specialized And General Skills In Llms: The Impact Of Modern Tuning And Data Strategy Zhang Zheng, Zheng Chen, Tang Da, Sun Ke, Ma Yukun, Bu Yingtong, Zhou Xun, Zhao Liang
- H\(_2\)O: Heavy-hitter Oracle For Efficient Generative Inference Of Large Language Models Zhang Zhenyu, Sheng Ying, Zhou Tianyi, Chen Tianlong, Zheng Lianmin, Cai Ruisi, Song Zhao, Tian Yuandong, Ré Christopher, Barrett Clark, Wang Zhangyang, Chen Beidi
- Auto-instruct: Automatic Instruction Generation And Ranking For Black-box Language Models Zhang Zhihan, Wang Shuohang, Yu Wenhao, Xu Yichong, Iter Dan, Zeng Qingkai, Liu Yang, Zhu Chenguang, Jiang Meng
- Accountable Textual-visual Chat Learns To Reject Human Instructions In Image Re-creation Zhang Zhiwei, Liu Yuliang
- Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-thought Reasoning To Language Agents Zhang Zhuosheng, Yao Yao, Zhang Aston, Tang Xiangru, Ma Xinbei, He Zhiwei, Wang Yiming, Gerstein Mark, Wang Rui, Liu Gongshen, Zhao Hai
- MELA: Multilingual Evaluation Of Linguistic Acceptability Zhang Ziyin, Liu Yikang, Huang Weifang, Mao Junyu, Wang Rui, Hu Hai
- Large Language Models As Zero-shot Conversational Recommenders Zhankui He, Zhouhang Xie, Rahul Jha, Harald Steck, Dawen Liang, Yesu Feng, Bodhisattwa Prasad Majumder, Nathan Kallus, Julian Mcauley
- Tuning Layernorm In Attention: Towards Efficient Multi-modal LLM Finetuning Zhao Bingchen, Tu Haoqin, Wei Chen, Mei Jieru, Xie Cihang
- SVIT: Scaling Up Visual Instruction Tuning Zhao Bo, Wu Boya, He Muyang, Huang Tiejun
- Explainability For Large Language Models: A Survey Zhao Haiyan, Chen Hanjie, Yang Fan, Liu Ninghao, Deng Huiqi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Du Mengnan
- Hallucination Detection For Grounded Instruction Generation Zhao Lingjun, Nguyen Khanh, Daumé Hal Iii
- Group Preference Optimization: Few-shot Alignment Of Large Language Models Zhao Siyan, Dang John, Grover Aditya
- Ca-lora: Adapting Existing Lora For Compressed Llms To Enable Efficient Multi-tasking On Personal Devices Zhao Weilin, Huang Yuxiang, Han Xu, Liu Zhiyuan, Zhang Zhengyan, Li Kuai, Chen Chen, Yang Tao, Sun Maosong
- Babystories: Can Reinforcement Learning Teach Baby Language Models To Write Better Stories? Zhao Xingmeng, Wang Tongnian, Osborn Sheri, Rios Anthony
- Slic-hf: Sequence Likelihood Calibration With Human Feedback Zhao Yao, Joshi Rishabh, Liu Tianqi, Khalman Misha, Saleh Mohammad, Liu Peter J.
- Multi-modal In-context Learning Makes An Ego-evolving Scene Text Recognizer Zhao Zhen, Tang Jingqun, Lin Chunhui, Wu Binghong, Huang Can, Liu Hao, Tan Xin, Zhang Zhizhong, Xie Yuan
- Adapting Large Language Models By Integrating Collaborative Semantics For Recommendation Zheng Bowen, Hou Yupeng, Lu Hongyu, Chen Yu, Zhao Wayne Xin, Chen Ming, Wen Ji-rong
- Can We Edit Factual Knowledge By In-context Learning? Zheng Ce, Li Lei, Dong Qingxiu, Fan Yuxuan, Wu Zhiyong, Xu Jingjing, Chang Baobao
- Lyra: Orchestrating Dual Correction In Automated Theorem Proving Zheng Chuanyang, Wang Haiming, Xie Enze, Liu Zhengying, Sun Jiankai, Xin Huajian, Shen Jianhao, Li Zhenguo, Li Yu
- Learn From Model Beyond Fine-tuning: A Survey Zheng Hongling, Shen Li, Tang Anke, Luo Yong, Hu Han, Du Bo, Tao Dacheng
- Trafficsafetygpt: Tuning A Pre-trained Large Language Model To A Domain-specific Expert In Transportation Safety Zheng Ou, Abdel-aty Mohamed, Wang Dongdong, Wang Chenzhu, Ding Shengxuan
- Improving Generalization Of Alignment With Human Preferences Through Group Invariant Learning Zheng Rui, Shen Wei, Hua Yuan, Lai Wenbin, Dou Shihan, Zhou Yuhao, Xi Zhiheng, Wang Xiao, Huang Haoran, Gui Tao, Zhang Qi, Huang Xuanjing
- Gpt-fathom: Benchmarking Large Language Models To Decipher The Evolutionary Path Towards GPT-4 And Beyond Zheng Shen, Zhang Yuyu, Zhu Yijie, Xi Chenguang, Gao Pengyang, Zhou Xun, Chang Kevin Chen-chuan
- Principle-driven Self-alignment Of Language Models From Scratch With Minimal Human Supervision Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan
- Seeking Neural Nuggets: Knowledge Transfer In Large Language Models From A Parametric Perspective Zhong Ming, An Chenxin, Chen Weizhu, Han Jiawei, He Pengcheng
- Bag Of Tricks For Effective Language Model Pretraining And Downstream Adaptation: A Case Study On GLUE Zhong Qihuang, Ding Liang, Peng Keqin, Liu Juhua, Du Bo, Shen Li, Zhan Yibing, Tao Dacheng
- Let's Think Outside The Box: Exploring Leap-of-thought In Large Language Models With Creative Humor Generation Zhong Shanshan, Huang Zhongzhan, Gao Shanghua, Wen Wushao, Lin Liang, Zitnik Marinka, Zhou Pan
- Sur-adapter: Enhancing Text-to-image Pre-trained Diffusion Models With Large Language Models Zhong Shanshan, Huang Zhongzhan, Wen Wushao, Qin Jinghui, Lin Liang
- Language Agent Tree Search Unifies Reasoning Acting And Planning In Language Models Zhou Andy, Yan Kai, Shlapentokh-rothman Michal, Wang Haohan, Wang Yu-xiong
- Navgpt: Explicit Reasoning In Vision-and-language Navigation With Large Language Models Zhou Gengze, Hong Yicong, Wu Qi
- Lobass: Gauging Learnability In Supervised Fine-tuning Data Zhou Haotian, Liu Tingkai, Ma Qianli, Yuan Jianbo, Liu Pengfei, You Yang, Yang Hongxia
- Vicor: Bridging Visual Understanding And Commonsense Reasoning With Large Language Models Zhou Kaiwen, Lee Kwonjoon, Misu Teruhisa, Wang Xin Eric
- Infmllm: A Unified Framework For Visual-language Tasks Zhou Qiang, Wang Zhibin, Chu Wei, Xu Yinghui, Li Hao, Qi Yuan
- Multi-stage Pre-training Enhanced By Chatgpt For Multi-scenario Multi-domain Dialogue Summarization Zhou Weixiao, Li Gengyao, Cheng Xianfu, Liang Xinnian, Zhu Junnan, Zhai Feifei, Li Zhoujun
- Making Harmful Behaviors Unlearnable For Large Language Models Zhou Xin, Lu Yi, Ma Ruotian, Gui Tao, Zhang Qi, Huang Xuanjing
- Revisiting Automated Prompting: Are We Actually Doing Better? Zhou Yulin, Zhao Yiren, Shumailov Ilia, Mullins Robert, Gal Yarin
- The Mystery Of In-context Learning: A Comprehensive Survey On Interpretation And Analysis Zhou Yuxiang, Li Jiazheng, Xiang Yanzheng, Yan Hanqi, Gui Lin, He Yulan
- Large Language Model As A Policy Teacher For Training Reinforcement Learning Agents Zhou Zihao, Hu Bin, Zhao Chenyang, Zhang Pu, Liu Bin
- Pose: Efficient Context Window Extension Of Llms Via Positional Skip-wise Training Zhu Dawei, Yang Nan, Wang Liang, Song Yifan, Wu Wenhao, Wei Furu, Li Sujian
- Judgelm: Fine-tuned Large Language Models Are Scalable Judges Zhu Lianghui, Wang Xinggang, Wang Xinlong
- Coca: Fusing Position Embedding With Collinear Constrained Attention In Transformers For Long Context Window Extending Zhu Shiyi, Ye Jing, Jiang Wei, Xue Siqiao, Zhang Qi, Wu Yifan, Li Jianguo
- Promptcblue: A Chinese Prompt Tuning Benchmark For The Medical Domain Zhu Wei, Wang Xiaoling, Zheng Huanran, Chen Mosha, Tang Buzhou
- Extrapolating Large Language Models To Non-english By Aligning Languages Zhu Wenhao, Lv Yunzhe, Dong Qingxiu, Yuan Fei, Xu Jingjing, Huang Shujian, Kong Lingpeng, Chen Jiajun, Li Lei
- Pad: Program-aided Distillation Can Teach Small Models Reasoning Better Than Chain-of-thought Fine-tuning Zhu Xuekai, Qi Biqing, Zhang Kaiyan, Long Xinwei, Lin Zhouhan, Zhou Bowen
- 3d-vista: Pre-trained Transformer For 3D Vision And Text Alignment Zhu Ziyu, Ma Xiaojian, Chen Yixin, Deng Zhidong, Huang Siyuan, Li Qing
- Open-source Large Language Models Are Strong Zero-shot Query Likelihood Models For Document Ranking Zhuang Shengyao, Liu Bing, Koopman Bevan, Zuccon Guido
- Describe, Explain, Plan And Select: Interactive Planning With Large Language Models Enables Open-world Multi-task Agents Zihao Wang, Shaofei Cai, Guanzhou Chen, Anji Liu, Xiaojian Ma, Yitao Liang
- Aligner: One Global Token Is Worth Millions Of Parameters When Aligning Large Language Models Ziheng Zhou University Of California, Los Angeles, Wu Yingnian University Of California, Los Angeles, Zhu Song-chun University Of California, Los Angeles, Terzopoulos Demetri University Of California, Los Angeles
- Recommender Systems In The Era Of Large Language Models (llms) Zihuai Zhao, Wenqi Fan, Jiatong Li, Yunqing Liu, Xiaowei Mei, Yiqi Wang, Zhen Wen, Fei Wang, Xiangyu Zhao, Jiliang Tang, Qing Li
- A Dataset And Benchmark For Hospital Course Summarization With Adapted Large Language Models Aali Asad, Van Veen Dave, Arefeen Yamin Ishraq, Hom Jason, Bluethgen Christian, Reis Eduardo Pontes, Gatidis Sergios, Clifford Namuun, Daws Joseph, Tehrani Arash S., Kim Jangwon, Chaudhari Akshay S.
- Bcamirs At Semeval-2024 Task 4: Beyond Words: A Multimodal And Multilingual Exploration Of Persuasion In Memes Abaskohi Amirhossein, Dabiriaghdam Amirhossein, Wang Lele, Carenini Giuseppe
- Are You Still On Track!? Catching LLM Task Drift With Activations Abdelnabi Sahar, Fay Aideen, Cherubin Giovanni, Salem Ahmed, Fritz Mario, Paverd Andrew
- Bridging The Bosphorus: Advancing Turkish Large Language Models Through Strategies For Low-resource Language Adaptation And Benchmarking Acikgoz Emre Can, Erdogan Mete, Yuret Deniz
- Fine-grained Contract NER Using Instruction Based Model Adibhatla Hiranmai Sri, Baswani Pavan, Shrivastava Manish
- Step-by-step Unmasking For Parameter-efficient Fine-tuning Of Large Language Models Agarwal Aradhye, Ramesh Suhas K, Sengupta Ayan, Chakraborty Tanmoy
- Many-shot In-context Learning Agarwal Rishabh, Singh Avi, Zhang Lei M., Bohnet Bernd, Rosias Luis, Chan Stephanie, Zhang Biao, Anand Ankesh, Abbas Zaheer, Nova Azade, Co-reyes John D., Chu Eric, Behbahani Feryal, Faust Aleksandra, Larochelle Hugo
- Enabling High-sparsity Foundational Llama Models With Efficient Pretraining And Deployment Agarwalla Abhinav, Gupta Abhay, Marques Alexandre, Pandit Shubhra, Goin Michael, Kurtic Eldar, Leong Kevin, Nguyen Tuan, Salem Mahmoud, Alistarh Dan, Lie Sean, Kurtz Mark
- LM4OPT: Unveiling The Potential Of Large Language Models In Formulating Mathematical Optimization Problems Ahmed Tasnim, Choudhury Salimur
- Linguistic Intelligence In Large Language Models For Telecommunications Ahmed Tasnim, Piovesan Nicola, De Domenico Antonio, Choudhury Salimur
- Tuning Large Multimodal Models For Videos Using Reinforcement Learning From AI Feedback Ahn Daechul, Choi Yura, Yu Youngjae, Kang Dongyeop, Choi Jonghyun
- Sphinx: Sample Efficient Multilingual Instruction Fine-tuning Through N-shot Guided Prompting Ahuja Sanchit, Tanmay Kumar, Chauhan Hardik Hansrajbhai, Patra Barun, Aggarwal Kriti, Del Corro Luciano, Mitra Arindam, Dhamecha Tejas Indulal, Awadallah Ahmed, Choudhary Monojit, Chaudhary Vishrav, Sitaram Sunayana
- Enhancing Pre-trained Generative Language Models With Question Attended Span Extraction On Machine Reading Comprehension Ai Lin, Hui Zheng, Liu Zizhou, Hirschberg Julia
- QASE Enhanced Plms: Improved Control In Text Generation For MRC Ai Lin, Hui Zheng, Liu Zizhou, Hirschberg Julia
- FACTS About Building Retrieval Augmented Generation-based Chatbots Akkiraju Rama, Xu Anbang, Bora Deepak, Yu Tan, An Lu, Seth Vishal, Shukla Aaditya, Gundecha Pritam, Mehta Hridhay, Jha Ashwin, Raj Prithvi, Balasubramanian Abhinav, Maram Murali, Muthusamy Guru, Annepally Shivakesh Reddy, Knowles Sidney, Du Min, Burnett Nick, Javiya Sean, Marannan Ashok, Kumari Mamta, Jha Surbhi, Dereszenski Ethan, Chakraborty Anupam, Ranjan Subhash, Terfai Amina, Surya Anoop, Mercer Tracey, Thanigachalam Vinodh Kumar, Bar Tamar, Krishnan Sanjana, Kilaru Samy, Jaksic Jasmine, Algarici Nave, Liberman Jacob, Conway Joey, Nayyar Sonu, Boitano Justin
- Enhancing Textbook Question Answering Task With Large Language Models And Retrieval Augmented Generation Alawwad Hessa Abdulrahman, Alhothali Areej, Naseem Usman, Alkhathlan Ali, Jamal Amani
- Should We Fine-tune Or RAG? Evaluating Different Techniques To Adapt Llms For Dialogue Alghisi Simone, Rizzoli Massimo, Roccabruna Gabriel, Mousavi Seyed Mahed, Riccardi Giuseppe
- Chatgpt Alternative Solutions: Large Language Models Survey Alipour Hanieh, Pendar Nick, Roy Kohinoor
- Evaluating The Efficacy Of Foundational Models: Advancing Benchmarking Practices To Enhance Fine-tuning Decision-making Amujo Oluyemi Enoch, Yang Shanchieh Jay
- Funaudiollm: Voice Understanding And Generation Foundation Models For Natural Interaction Between Humans And Llms An Keyu, Chen Qian, Deng Chong, Du Zhihao, Gao Changfeng, Gao Zhifu, Gu Yue, He Ting, Hu Hangrui, Hu Kai, Ji Shengpeng, Li Yabin, Li Zerui, Lu Heng, Luo Haoneng, Lv Xiang, Ma Bin, Ma Ziyang, Ni Chongjia, Song Changhe, Shi Jiaqi, Shi Xian, Wang Hao, Wang Wen, Wang Yuxuan, Xiao Zhangyu, Yan Zhijie, Yang Yexin, Zhang Bin, Zhang Qinglin, Zhang Shiliang, Zhao Nan, Zheng Siqi
- Mm-phyrlhf: Reinforcement Learning Framework For Multimodal Physics Question-answering Anand Avinash, Kapuriya Janak, Kirtani Chhavi, Singh Apoorv, Saraf Jay, Lal Naman, Kumar Jatin, Shivam Adarsh Raj, Verma Astha, Shah Rajiv Ratn, Zimmermann Roger
- Investigating The Robustness Of Llms On Math Word Problems Anantheswaran Ujjwala, Gupta Himanshu, Scaria Kevin, Verma Shreyas, Baral Chitta, Mishra Swaroop
- Does Refusal Training In Llms Generalize To The Past Tense? Andriushchenko Maksym, Flammarion Nicolas
- Tensor Train Low-rank Approximation (tt-lora): Democratizing AI With Accelerated Llms Anjum Afia, Eren Maksim E., Boureima Ismael, Alexandrov Boian, Bhattarai Manish
- Scaling Sparse Fine-tuning To Large Language Models Ansell Alan, Vulić Ivan, Sterz Hannah, Korhonen Anna, Ponti Edoardo M.
- A Framework For Fine-tuning Llms Using Heterogeneous Feedback Aponte Ryan, Rossi Ryan A., Guo Shunan, Dernoncourt Franck, Yu Tong, Chen Xiang, Mitra Subrata, Lipka Nedim
- SPAFIT: Stratified Progressive Adaptation Fine-tuning For Pre-trained Large Language Models Arora Samir, Wang Liangliang
- Exploring The Robustness Of Task-oriented Dialogue Systems For Colloquial German Varieties Artemova Ekaterina, Blaschke Verena, Plank Barbara
- Enhancing Holonic Architecture With Natural Language Processing For System Of Systems Ashfaq Muhammad, Sadik Ahmed R., Mikkonen Tommi, Waseem Muhammad, Akitalo Niko M
- Evaluating Large Language Models On The GMAT: Implications For The Future Of Business Education Ashrafimoghari Vahid, Gürkan Necdet, Suchow Jordan W.
- Elicitron: An LLM Agent-based Simulation Framework For Design Requirements Elicitation Ataei Mohammadmehdi, Cheong Hyunmin, Grandi Daniele, Wang Ye, Morris Nigel, Tessier Alexander
- Can Open-source Llms Compete With Commercial Models? Exploring The Few-shot Performance Of Current GPT Models In Biomedical Tasks Ateia Samy, Kruschwitz Udo
- Walia-llm: Enhancing Amharic-llama By Integrating Task-specific And Generative Datasets Azime Israel Abebe, Tonja Atnafu Lambebo, Belay Tadesse Destaw, Fuge Mitiku Yohannes, Wassie Aman Kassahun, Jada Eyasu Shiferaw, Chanie Yonas, Sewunetie Walelign Tewabe, Yimam Seid Muhie
- Lamda: Large Model Fine-tuning Via Spectrally Decomposed Low-dimensional Adaptation Azizi Seyedarmin, Kundu Souvik, Pedram Massoud
- The Pitfalls Of Next-token Prediction Bachmann Gregor, Nagarajan Vaishnavh
- Efficient Model-agnostic Alignment Via Bayesian Persuasion Bai Fengshuo, Wang Mingzhi, Zhang Zhaowei, Chen Boyuan, Xu Yinda, Wen Ying, Yang Yaodong
- Federated Fine-tuning Of Large Language Models Under Heterogeneous Tasks And Client Resources Bai Jiamu, Chen Daoyuan, Qian Bingchen, Yao Liuyi, Li Yaliang
- COIG-CQIA: Quality Is All You Need For Chinese Instruction Fine-tuning Bai Yuelin, Du Xinrun, Liang Yiming, Jin Yonggang, Liu Ziqiang, Zhou Junting, Zheng Tianyu, Zhang Xincheng, Ma Nuo, Wang Zekun, Yuan Ruibin, Wu Haihong, Lin Hongquan, Huang Wenhao, Zhang Jiajun, Chen Wenhu, Lin Chenghua, Fu Jie, Yang Min, Ni Shiwen, Zhang Ge
- Longalign: A Recipe For Long Context Alignment Of Large Language Models Bai Yushi, Lv Xin, Zhang Jiajie, He Yuze, Qi Ji, Hou Lei, Tang Jie, Dong Yuxiao, Li Juanzi
- Longwriter: Unleashing 10,000+ Word Generation From Long Context Llms Bai Yushi, Zhang Jiajie, Lv Xin, Zheng Linzhi, Zhu Siqi, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- Downstream Bias Mitigation Is All You Need Baksi Arkadeep, Singh Rahul, Joshi Tarun
- RAG Vs Fine-tuning: Pipelines, Tradeoffs, And A Case Study On Agriculture Balaguer Angels, Benara Vinamra, Cunha Renato Luiz De Freitas, Filho Roberto De M. Estevão, Hendry Todd, Holstein Daniel, Marsman Jennifer, Mecklenburg Nick, Malvar Sara, Nunes Leonardo O., Padilha Rafael, Sharp Morris, Silva Bruno, Sharma Swati, Aski Vijay, Chandra Ranveer
- CYGENT: A Cybersecurity Conversational Agent With Log Summarization Powered By GPT-3 Balasubramanian Prasasthy, Seby Justin, Kostakos Panos
- LAMP: A Language Model On The Map Balsebre Pasquale, Huang Weiming, Cong Gao
- Llms With Chain-of-thought Are Non-causal Reasoners Bao Guangsheng, Zhang Hongbo, Yang Linyi, Wang Cunxiang, Zhang Yue
- Fine-tuning Or Fine-failing? Debunking Performance Myths In Large Language Models Barnett Scott, Brannelly Zac, Kurniawan Stefanus, Wong Sheng
- Slicerchat: Building A Local Chatbot For 3D Slicer Barr Colton
- From 'showgirls' To 'performers': Fine-tuning With Gender-inclusive Language For Bias Reduction In Llms Bartl Marion, Leavy Susan
- Medaide: Leveraging Large Language Models For On-premise Medical Assistance On Edge Devices Basit Abdul, Hussain Khizar, Hanif Muhammad Abdullah, Shafique Muhammad
- A Comparative Study Of DSL Code Generation: Fine-tuning Vs. Optimized Retrieval Augmentation Bassamzadeh Nastaran, Methani Chhaya
- Think Big, Generate Quick: Llm-to-slm For Fast Autoregressive Decoding Bergner Benjamin, Skliar Andrii, Royer Amelie, Blankevoort Tijmen, Asano Yuki, Bejnordi Babak Ehteshami
- Lotr: Low Tensor Rank Weight Adaptation Bershatsky Daniel, Cherniuk Daria, Daulbaev Talgat, Mikhalev Aleksandr, Oseledets Ivan
- A Systematic Analysis Of Large Language Models As Soft Reasoners: The Case Of Syllogistic Inferences Bertolazzi Leonardo, Gatt Albert, Bernardi Raffaella
- Enhancing Event Reasoning In Large Language Models Through Instruction Fine-tuning With Semantic Causal Graphs Bethany Mazal, Bethany Emet, Wherry Brandon, Chiang Cho-yu, Vishwamitra Nishant, Rios Anthony, Najafirad Peyman
- Taking The Next Step With Generative Artificial Intelligence: The Transformative Role Of Multimodal Large Language Models In Science Education Bewersdorff Arne, Hartmann Christian, Hornberger Marie, Seßler Kathrin, Bannert Maria, Kasneci Enkelejda, Kasneci Gjergji, Zhai Xiaoming, Nerdel Claudia
- Soft Prompting For Unlearning In Large Language Models Bhaila Karuna, Van Minh-hao, Wu Xintao
- Efficient Reinforcement Learning Via Large Language Model-based Search Bhambri Siddhant, Bhattacharjee Amrita, Liu Huan, Kambhampati Subbarao
- Language Models Are Homer Simpson! Safety Re-alignment Of Fine-tuned Language Models Through Task Arithmetic Bhardwaj Rishabh, Anh Do Duc, Poria Soujanya
- An Approach To Build Zero-shot Slot-filling System For Industry-grade Conversational Assistants Bhargav G P Shrivatsa, Neelam Sumit, Sharma Udit, Ikbal Shajith, Sreedhar Dheeraj, Karanam Hima, Joshi Sachindra, Dhoolia Pankaj, Garg Dinesh, Croutwater Kyle, Qi Haode, Wayne Eric, Murdock J William
- Fintral: A Family Of GPT-4 Level Multimodal Financial Large Language Models Bhatia Gagan, Nagoudi El Moatez Billah, Cavusoglu Hasan, Abdul-mageed Muhammad
- Zero-shot Llm-guided Counterfactual Generation For Text Bhattacharjee Amrita, Moraffah Raha, Garland Joshua, Liu Huan
- Enhancing Code Translation In Language Models With Few-shot Learning Via Retrieval-augmented Generation Bhattarai Manish, Santos Javier E., Jones Shawn, Biswas Ayan, Alexandrov Boian, O'malley Daniel
- Speculative Streaming: Fast LLM Inference Without Auxiliary Models Bhendawade Nikhil, Belousova Irina, Fu Qichen, Mason Henry, Rastegari Mohammad, Najibi Mahyar
- Supervisory Prompt Training Billa Jean Ghislain, Oh Min, Du Liang
- Exploring And Benchmarking The Planning Capabilities Of Large Language Models Bohnet Bernd, Nova Azade, Parisi Aaron T, Swersky Kevin, Goshvadi Katayoon, Dai Hanjun, Schuurmans Dale, Fiedel Noah, Sedghi Hanie
- Workarena++: Towards Compositional Planning And Reasoning-based Common Knowledge Work Tasks Boisvert Léo, Thakkar Megh, Gasse Maxime, Caccia Massimo, De Chezelles Thibault Le Sellier, Cappart Quentin, Chapados Nicolas, Lacoste Alexandre, Drouin Alexandre
- Low-rank Quantization-aware Training For Llms Bondarenko Yelysei, Del Chiaro Riccardo, Nagel Markus
- Improving Sequential Recommendations With Llms Boz Artun, Zorgdrager Wouter, Kotti Zoe, Harte Jesse, Louridas Panos, Jannach Dietmar, Fragkoulis Marios
- Embedding Large Language Models Into Extended Reality: Opportunities And Challenges For Inclusion, Engagement, And Privacy Bozkir Efe, Özdel Süleyman, Lau Ka Hei Carrie, Wang Mengdi, Gao Hong, Kasneci Enkelejda
- Fast And Effective Weight Update For Pruned Large Language Models Boža Vladimír
- Enhancing Trust In Llms: Algorithms For Comparing And Interpreting Llms Brown Nik Bear
- Fine-tuned 'small' Llms (still) Significantly Outperform Zero-shot Generative AI Models In Text Classification Bucher Martin Juan José, Martini Marco
- Roguegpt: Dis-ethical Tuning Transforms Chatgpt4 Into A Rogue AI In 158 Words Buscemi Alessio, Proverbio Daniele
- ARES: Alternating Reinforcement Learning And Supervised Fine-tuning For Enhanced Multi-modal Chain-of-thought Reasoning Through Diverse AI Feedback Byun Ju-seung, Chun Jiyun, Kil Jihyung, Perrault Andrew
- Olora: Orthonormal Low-rank Adaptation Of Large Language Models Büyükakyüz Kerim
- In-context Learning And Fine-tuning GPT For Argument Mining Cabessa Jérémie, Hernault Hugo, Mushtaq Umer
- Cendol: Open Instruction-tuned Generative Large Language Models For Indonesian Languages Cahyawijaya Samuel, Lovenia Holy, Koto Fajri, Putri Rifki Afina, Dave Emmanuel, Lee Jhonson, Shadieq Nuur, Cenggoro Wawan, Akbar Salsabil Maulana, Mahendra Muhammad Ihza, Putri Dea Annisayanti, Wilie Bryan, Winata Genta Indra, Aji Alham Fikri, Purwarianti Ayu, Fung Pascale
- CERET: Cost-effective Extrinsic Refinement For Text Generation Cai Jason, Su Hang, Sunkara Monica, Shalyminov Igor, Mansour Saab
- Flextron: Many-in-one Flexible Large Language Model Cai Ruisi, Muralidharan Saurav, Heinrich Greg, Yin Hongxu, Wang Zhangyang, Kautz Jan, Molchanov Pavlo
- Lococo: Dropping In Convolutions For Long Context Compression Cai Ruisi, Tian Yuandong, Wang Zhangyang, Chen Beidi
- Medusa: Simple LLM Inference Acceleration Framework With Multiple Decoding Heads Cai Tianle, Li Yuhong, Geng Zhengyang, Peng Hongwu, Lee Jason D., Chen Deming, Dao Tri
- Internlm2 Technical Report Cai Zheng, Cao Maosong, Chen Haojiong, Chen Kai, Chen Keyu, Chen Xin, Chen Xun, Chen Zehui, Chen Zhi, Chu Pei, Dong Xiaoyi, Duan Haodong, Fan Qi, Fei Zhaoye, Gao Yang, Ge Jiaye, Gu Chenya, Gu Yuzhe, Gui Tao, Guo Aijia, Guo Qipeng, He Conghui, Hu Yingfan, Huang Ting, Jiang Tao, Jiao Penglong, Jin Zhenjiang, Lei Zhikai, Li Jiaxing, Li Jingwen, Li Linyang, Li Shuaibin, Li Wei, Li Yining, Liu Hongwei, Liu Jiangning, Hong Jiawei, Liu Kaiwen, Liu Kuikun, Liu Xiaoran, Lv Chengqi, Lv Haijun, Lv Kai, Ma Li, Ma Runyuan, Ma Zerun, Ning Wenchang, Ouyang Linke, Qiu Jiantao, Qu Yuan, Shang Fukai, Shao Yunfan, Song Demin, Song Zifan, Sui Zhihao, Sun Peng, Sun Yu, Tang Huanze, Wang Bin, Wang Guoteng, Wang Jiaqi, Wang Jiayu, Wang Rui, Wang Yudong, Wang Ziyi, Wei Xingjian, Weng Qizhen, Wu Fan, Xiong Yingtong, Xu Chao, Xu Ruiliang, Yan Hang, Yan Yirong, Yang Xiaogui, Ye Haochen, Ying Huaiyuan, Yu Jia, Yu Jing, Zang Yuhang, Zhang Chuyu, Zhang Li, Zhang Pan, Zhang Peng, Zhang Ruijie, Zhang Shuo, Zhang Songyang, Zhang Wenjian, Zhang Wenwei, Zhang Xingcheng, Zhang Xinyue, Zhao Hui, Zhao Qian, Zhao Xiaomeng, Zhou Fengzhe, Zhou Zaida, Zhuo Jingming, Zou Yicheng, Qiu Xipeng, Qiao Yu, Lin Dahua
- Towards Logically Consistent Language Models Via Probabilistic Reasoning Calanzone Diego, Teso Stefano, Vergari Antonio
- Tonggu: Mastering Classical Chinese Understanding With Knowledge-grounded Large Language Models Cao Jiahuan, Peng Dezhi, Zhang Peirong, Shi Yongxin, Liu Yang, Ding Kai, Jin Lianwen
- Personalized Steering Of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization Cao Yuanpu, Zhang Tianrong, Cao Bochuan, Yin Ziyi, Lin Lu, Ma Fenglong, Chen Jinghui
- Aligning Large Language Models With Recommendation Knowledge Cao Yuwei, Mehta Nikhil, Yi Xinyang, Keshavan Raghunandan, Heldt Lukasz, Hong Lichan, Chi Ed H., Sathiamoorthy Maheswaran
- A Review Of Multi-modal Large Language And Vision Models Carolan Kilian, Fennelly Laura, Smeaton Alan F.
- Can Formal Argumentative Reasoning Enhance Llms Performances? Castagna Federico, Sassoon Isabel, Parsons Simon
- Suppressing Pink Elephants With Direct Principle Feedback Castricato Louis, Lile Nathan, Anand Suraj, Schoelkopf Hailey, Verma Siddharth, Biderman Stella
- Smileyllama: Modifying Large Language Models For Directed Chemical Space Exploration Cavanagh Joseph M., Sun Kunyang, Gritsevskiy Andrew, Bagni Dorian, Bannister Thomas D., Head-gordon Teresa
- Detoxbench: Benchmarking Large Language Models For Multitask Fraud & Abuse Detection Chakraborty Joymallya, Xia Wei, Majumder Anirban, Ma Dan, Chaabene Walid, Janvekar Naveed
- Cross-modal Safety Alignment: Is Textual Unlearning All You Need? Chakraborty Trishna, Shayegani Erfan, Cai Zikui, Abu-ghazaleh Nael, Asif M. Salman, Dong Yue, Roy-chowdhury Amit K., Song Chengyu
- Unlocking The Wisdom Of Large Language Models: An Introduction To The Path To Artificial General Intelligence Chang Edward Y.
- Speechprompt: Prompting Speech Language Models For Speech Processing Tasks Chang Kai-wei, Wu Haibin, Wang Yu-kai, Wu Yuan-kuei, Shen Hua, Tseng Wei-cheng, Kang Iu-thing, Li Shang-wen, Lee Hung-yi
- Goldfish: Monolingual Language Models For 350 Languages Chang Tyler A., Arnett Catherine, Tu Zhuowen, Bergen Benjamin K.
- Bias-aware Low-rank Adaptation: Mitigating Catastrophic Inheritance Of Large Language Models Chang Yupeng, Chang Yi, Wu Yuan
- When Large Language Models Meet Evolutionary Algorithms Chao Wang, Zhao Jiaxuan, Jiao Licheng, Li Lingling, Liu Fang, Yang Shuyuan
- Language Models Can Exploit Cross-task In-context Learning For Data-scarce Novel Tasks Chatterjee Anwoy, Tanwar Eshaan, Dutta Subhabrata, Chakraborty Tanmoy
- Phantom: General Trigger Attacks On Retrieval Augmented Language Generation Chaudhari Harsh, Severi Giorgio, Abascal John, Jagielski Matthew, Choquette-choo Christopher A., Nasr Milad, Nita-rotaru Cristina, Oprea Alina
- The Dark Side Of Human Feedback: Poisoning Large Language Models Via User Inputs Chen Bocheng, Guo Hanqing, Wang Guangjing, Wang Yuanda, Yan Qiben
- Self-cognition In Large Language Models: An Exploratory Study Chen Dongping, Shi Jiawen, Wan Yao, Zhou Pan, Gong Neil Zhenqiang, Sun Lichao
- Efficiency In Focus: Layernorm As A Catalyst For Fine-tuning Medical Visual Language Pre-trained Models Chen Jiawei, Yang Dingkang, Jiang Yue, Li Mingcheng, Wei Jinjie, Hou Xiaolu, Zhang Lihua
- Automated Data Curation For Robust Language Model Fine-tuning Chen Jiuhai, Mueller Jonas
- Putting People In Llms' Shoes: Generating Better Answers Via Question Rewriter Chen Junhao, Wang Bowen, Jiang Zhouqiang, Nakashima Yuta
- Instructioncp: A Fast Approach To Transfer Large Language Models Into Target Language Chen Kuang-ming, Lee Hung-yi
- Bailong: Bilingual Transfer Learning Based On Qlora And Zip-tie Embedding Chen Lung-chuan, Li Zong-ru
- Quantifying And Mitigating Unimodal Biases In Multimodal Large Language Models: A Causal Perspective Chen Meiqi, Cao Yixin, Zhang Yan, Lu Chaochao
- Graphwiz: An Instruction-following Language Model For Graph Problems Chen Nuo, Li Yuhan, Tang Jianheng, Li Jia
- Llava-mole: Sparse Mixture Of Lora Experts For Mitigating Data Conflicts In Instruction Finetuning Mllms Chen Shaoxiang, Jie Zequn, Ma Lin
- Advancing Tool-augmented Large Language Models: Integrating Insights From Errors In Inference Trees Chen Sijia, Wang Yibo, Wu Yi-feng, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Zhang Lijun
- Struq: Defending Against Prompt Injection With Structured Queries Chen Sizhe, Piet Julien, Sitawarin Chawin, Wagner David
- From Yes-men To Truth-tellers: Addressing Sycophancy In Large Language Models With Pinpoint Tuning Chen Wei, Huang Zhen, Xie Liang, Lin Binbin, Li Houqiang, Lu Le, Tian Xinmei, Cai Deng, Zhang Yonggang, Wan Wenxiao, Shen Xu, Ye Jieping
- Octopus: On-device Language Model For Function Calling Of Software Apis Chen Wei, Li Zhiyuan, Ma Mingyuan
- Llast: Improved End-to-end Speech Translation System Leveraged By Large Language Models Chen Xi, Zhang Songyang, Bai Qibing, Chen Kai, Nakamura Satoshi
- Cause-aware Empathetic Response Generation Via Chain-of-thought Fine-tuning Chen Xinhao, Yang Chong, Lan Man, Cai Li, Chen Yang, Hu Tu, Zhuang Xinlin, Zhou Aimin
- Can We Rely On LLM Agents To Draft Long-horizon Plans? Let's Take Travelplanner As An Example Chen Yanan, Pesaranghader Ali, Sadhu Tanmana, Yi Dong Hoon
- Extroversion Or Introversion? Controlling The Personality Of Your Large Language Models Chen Yanquan, Wu Zhen, Guo Junjie, Huang Shujian, Dai Xinyu
- See What Llms Cannot Answer: A Self-challenge Framework For Uncovering LLM Weaknesses Chen Yulong, Liu Yang, Yan Jianhao, Bai Xuefeng, Zhong Ming, Yang Yinghao, Yang Ziyi, Zhu Chenguang, Zhang Yue
- Mofo: Momentum-filtered Optimizer For Mitigating Forgetting In LLM Fine-tuning Chen Yupeng, Wang Senmiao, Lin Zhihang, Qin Zeyu, Zhang Yushun, Ding Tian, Sun Ruoyu
- Autoprm: Automating Procedural Supervision For Multi-step Reasoning Via Controllable Question Decomposition Chen Zhaorun, Zhao Zhuokai, Zhu Zhihong, Zhang Ruiqi, Li Xiang, Raj Bhiksha, Yao Huaxiu
- Improving Retrieval Augmented Open-domain Question-answering With Vectorized Contexts Chen Zhuo, Wang Xinyu, Jiang Yong, Xie Pengjun, Huang Fei, Tu Kewei
- An Empirical Study Of Data Ability Boundary In Llms' Math Reasoning Chen Zui, Chen Yezeng, Han Jiaqi, Huang Zhijie, Qi Ji, Zhou Yi
- "in Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles Through In-dialogue Learning Cheng Chuanqi, Tu Quan, Wu Wei, Shang Shuo, Mao Cunli, Yu Zhengtao, Yan Rui
- Small Agent Can Also Rock! Empowering Small Language Models As Hallucination Detector Cheng Xiaoxue, Li Junyi, Zhao Wayne Xin, Zhang Hongzhi, Zhang Fuzheng, Zhang Di, Gai Kun, Wen Ji-rong
- ANOLE: An Open, Autoregressive, Native Large Multimodal Models For Interleaved Image-text Generation Chern Ethan, Su Jiadi, Ma Yan, Liu Pengfei
- Human-centered Llm-agent User Interface: A Position Paper Chin Daniel, Wang Yuxuan, Xia Gus
- An Empirical Study Of In-context Learning In Llms For Machine Translation Chitale Pranjal A., Gala Jay, Dabre Raj
- Exploring Quantization For Efficient Pre-training Of Transformer Language Models Chitsaz Kamran, Fournier Quentin, Mordido Gonçalo, Chandar Sarath
- Optimizing Language Augmentation For Multilingual Large Language Models: A Case Study On Korean Choi Changsu, Jeong Yongbin, Park Seoyoon, Won Inho, Lim Hyeonseok, Kim Sangmin, Kang Yejee, Yoon Chanhyuk, Park Jaewan, Lee Yiseul, Lee Hyejin, Hahm Younggyun, Kim Hansaem, Lim Kyungtae
- Lota-bench: Benchmarking Language-oriented Task Planners For Embodied Agents Choi Jae-woo, Yoon Youngwoo, Ong Hyobin, Kim Jaehong, Jang Minsu
- Cross-architecture Transfer Learning For Linear-cost Inference Transformers Choi Sehyun
- Gemmar: Enhancing Llms Through Arabic Instruction-tuning Chouikhi Hasna, Aloui Manel, Hammou Cyrine Ben, Chaabane Ghaith, Kchaou Haithem, Dhaouadi Chehir
- Crosslingual Capabilities And Knowledge Barriers In Multilingual Large Language Models Chua Lynn, Ghazi Badih, Huang Yangsibo, Kamath Pritish, Kumar Ravi, Manurangsi Pasin, Sinha Amer, Xie Chulin, Zhang Chiyuan
- PEFT-U: Parameter-efficient Fine-tuning For User Personalization Clarke Christopher, Heng Yuzhao, Tang Lingjia, Mars Jason
- Cogbench: A Large Language Model Walks Into A Psychology Lab Coda-forno Julian, Binz Marcel, Wang Jane X., Schulz Eric
- Towards A Human-in-the-loop LLM Approach To Collaborative Discourse Analysis Cohn Clayton, Snyder Caitlin, Montenegro Justin, Biswas Gautam
- Saullm-54b & Saullm-141b: Scaling Up Domain Adaptation For The Legal Domain Colombo Pierre, Pires Telmo, Boudiaf Malik, Melo Rui, Culver Dominic, Morgado Sofia, Malaboeuf Etienne, Hautreux Gabriel, Charpentier Johanne, Desa Michael
- Ragsys: Item-cold-start Recommender As RAG System Contal Emile, Mcgoldrick Garrin
- LLASP: Fine-tuning Large Language Models For Answer Set Programming Coppolillo Erica, Calimeri Francesco, Manco Giuseppe, Perri Simona, Ricca Francesco
- Rocode: A Dataset For Measuring Code Intelligence From Problem Definitions In Romanian Cosma Adrian, Iordache Bogdan, Rosso Paolo
- Llambert: Large-scale Low-cost Data Annotation In NLP Csanády Bálint, Muzsai Lajos, Vedres Péter, Nádasdy Zoltán, Lukács András
- Rethinking LLM Language Adaptation: A Case Study On Chinese Mixtral Cui Yiming, Yao Xin
- Meta Large Language Model Compiler: Foundation Models Of Compiler Optimization Cummins Chris, Seeker Volker, Grubisic Dejan, Roziere Baptiste, Gehring Jonas, Synnaeve Gabriel, Leather Hugh
- Synthetic Dataset Creation And Fine-tuning Of Transformer Models For Question Answering In Serbian Cvetanović Aleksa, Tadić Predrag
- Getting The Most Out Of Your Tokenizer For Pre-training And Domain Adaptation Dagan Gautier, Synnaeve Gabriel, Rozière Baptiste
- CORM: Cache Optimization With Recent Message For Large Language Model Inference Dai Jincheng, Huang Zhuowei, Jiang Haiyun, Chen Chen, Cai Deng, Bi Wei, Shi Shuming
- ACT-MNMT Auto-constriction Turning For Multilingual Neural Machine Translation Dai Shaojie, Liu Xin, Luo Ping, Yu Yue
- Human-interpretable Adversarial Prompt Attack On Large Language Models With Situational Context Das Nilanjana, Raff Edward, Gaur Manas
- Low-rank Finetuning For Llms: A Fairness Perspective Das Saswat, Romanelli Marco, Tran Cuong, Reza Zarreen, Kailkhura Bhavya, Fioretto Ferdinando
- Towards Transparency: Exploring LLM Trainings Datasets Through Visual Topic Modeling And Semantic Frame De Dampierre Charles, Mogoutov Andrei, Baumard Nicolas
- Evaluating Large Language Models For Automatic Analysis Of Teacher Simulations De-fitero-dominguez David, Albaladejo-gonzález Mariano, Garcia-cabot Antonio, Garcia-lopez Eva, Moreno-cediel Antonio, Barno Erin, Reich Justin
- Deepseek-v2: A Strong, Economical, And Efficient Mixture-of-experts Language Model Deepseek-ai, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Li, Hui Qu, J. L. Cai, Jian Liang, Jianzhong Guo, Jiaqi Ni, Jiashi Li, Jin Chen, Jingyang Yuan, Junjie Qiu, Junxiao Song, Kai Dong, Kaige Gao, Kang Guan, Lean Wang, Lecong Zhang, Lei Xu, Leyi Xia, Liang Zhao, Liyue Zhang, Meng Li, Miaojun Wang, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Mingming Li, Ning Tian, Panpan Huang, Peiyi Wang, Peng Zhang, Qihao Zhu, Qinyu Chen, Qiushi Du, R. J. Chen, R. L. Jin, Ruiqi Ge, Ruizhe Pan, Runxin Xu, Ruyi Chen, S. S. Li, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shaoqing Wu, Shengfeng Ye, Shirong Ma, Shiyu Wang, Shuang Zhou, Shuiping Yu, Shunfeng Zhou, Size Zheng, T. Wang, Tian Pei, Tian Yuan, Tianyu Sun, W. L. Xiao, Wangding Zeng, Wei An, Wen Liu, Wenfeng Liang, Wenjun Gao, Wentao Zhang, X. Q. Li, Xiangyue Jin, Xianzu Wang, Xiao Bi, Xiaodong Liu, Xiaohan Wang, Xiaojin Shen, Xiaokang Chen, Xiaosha Chen, Xiaotao Nie, Xiaowen Sun, Xiaoxiang Wang, Xin Liu, Xin Xie, Xingkai Yu, Xinnan Song, Xinyi Zhou, Xinyu Yang, Xuan Lu, Xuecheng Su, Y. Wu, Y. K. Li, Y. X. Wei, Y. X. Zhu, Yanhong Xu, Yanping Huang, Yao Li, Yao Zhao, Yaofeng Sun, Yaohui Li, Yaohui Wang, Yi Zheng, Yichao Zhang, Yiliang Xiong, Yilong Zhao, Ying He, Ying Tang, Yishi Piao, Yixin Dong, Yixuan Tan, Yiyuan Liu, Yongji Wang, Yongqiang Guo, Yuchen Zhu, Yuduan Wang, Yuheng Zou, Yukun Zha, Yunxian Ma, Yuting Yan, Yuxiang You, Yuxuan Liu, Z. Z. Ren, Zehui Ren, Zhangli Sha, Zhe Fu, Zhen Huang, Zhen Zhang, Zhenda Xie, Zhewen Hao, Zhihong Shao, Zhiniu Wen, Zhipeng Xu, Zhongyu Zhang, Zhuoshu Li, Zihan Wang, Zihui Gu, Zilin Li, Ziwei Xie
- Cram: Credibility-aware Attention Modification In Llms For Combating Misinformation In RAG Deng Boyi, Wang Wenjie, Zhu Fengbin, Wang Qifan, Feng Fuli
- MIMIR: A Streamlined Platform For Personalized Agent Tuning In Domain Expertise Deng Chunyuan, Tang Xiangru, Zhao Yilun, Wang Hanming, Wang Haoran, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- Pandora: Jailbreak Gpts By Retrieval Augmented Generation Poisoning Deng Gelei, Liu Yi, Wang Kailong, Li Yuekang, Zhang Tianwei, Liu Yang
- Gotcha! Don't Trick Me With Unanswerable Questions! Self-aligning Large Language Models For Responding To Unknown Questions Deng Yang, Zhao Yong, Li Moxin, Ng See-kiong, Chua Tat-seng
- LMGT: Optimizing Exploration-exploitation Balance In Reinforcement Learning Through Language Model Guided Trade-offs Deng Yongxin, Qiu Xihe, Tan Xiaoyu, Chu Wei, Xu Yinghui
- Text-tuple-table: Towards Information Integration In Text-to-table Generation Via Global Tuple Extraction Deng Zheye, Chan Chunkit, Wang Weiqi, Sun Yuxi, Fan Wei, Zheng Tianshi, Yim Yauwai, Song Yangqiu
- Sycophancy To Subterfuge: Investigating Reward-tampering In Large Language Models Denison Carson, Macdiarmid Monte, Barez Fazl, Duvenaud David, Kravec Shauna, Marks Samuel, Schiefer Nicholas, Soklaski Ryan, Tamkin Alex, Kaplan Jared, Shlegeris Buck, Bowman Samuel R., Perez Ethan, Hubinger Evan
- Garak: A Framework For Security Probing Large Language Models Derczynski Leon, Galinkin Erick, Martin Jeffrey, Majumdar Subho, Inie Nanna
- Glam: Fine-tuning Large Language Models For Domain Knowledge Graph Alignment Via Neighborhood Partitioning And Generative Subgraph Encoding Dernbach Stefan, Agarwal Khushbu, Zuniga Alejandro, Henry Michael, Choudhury Sutanay
- A Simple And Effective \(L_2\) Norm-based Strategy For KV Cache Compression Devoto Alessio, Zhao Yu, Scardapane Simone, Minervini Pasquale
- Facts-and-feelings: Capturing Both Objectivity And Subjectivity In Table-to-text Generation Dey Tathagata, Bhattacharyya Pushpak
- Learning How To Ask: Cycle-consistency Refines Prompts In Multimodal Foundation Models Diesendruck Maurice, Lin Jianzhe, Imani Shima, Mahalingam Gayathri, Xu Mingyang, Zhao Jie
- Data Augmentation Using Large Language Models: Data Perspectives, Learning Paradigms And Challenges Ding Bosheng, Qin Chengwei, Zhao Ruochen, Luo Tianze, Li Xinze, Chen Guizhen, Xia Wenhan, Hu Junjie, Luu Anh Tuan, Joty Shafiq
- Entgpt: Linking Generative Large Language Models With Knowledge Bases Ding Yifan, Poudel Amrit, Zeng Qingkai, Weninger Tim, Veeramani Balaji, Bhattacharya Sanmitra
- Longrope: Extending LLM Context Window Beyond 2 Million Tokens Ding Yiran, Zhang Li Lyna, Zhang Chengruidong, Xu Yuanyuan, Shang Ning, Xu Jiahang, Yang Fan, Yang Mao
- Self-play With Execution Feedback: Improving Instruction-following Capabilities Of Large Language Models Dong Guanting, Lu Keming, Li Chengpeng, Xia Tingyu, Yu Bowen, Zhou Chang, Zhou Jingren
- Understand What LLM Needs: Dual Preference Alignment For Retrieval-augmented Generation Dong Guanting, Zhu Yutao, Zhang Chenghao, Wang Zechen, Dou Zhicheng, Wen Ji-rong
- Baichuanseed: Sharing The Potential Of Extensive Data Collection And Deduplication By Introducing A Competitive Large Language Model Baseline Dong Guosheng, Pan Da, Sun Yiding, Zhang Shusen, Liang Zheng, Wu Xin, Shen Yanjun, Yang Fan, Sun Haoze, Li Tianpeng, Lin Mingan, Xu Jianhua, Zhang Yufan, Nie Xiaonan, Su Lei, Wang Bingning, Zhang Wentao, Mao Jiaxin, Zhou Zenan, Chen Weipeng
- RLHF Workflow: From Reward Modeling To Online RLHF Dong Hanze, Xiong Wei, Pang Bo, Wang Haoxiang, Zhao Han, Zhou Yingbo, Jiang Nan, Sahoo Doyen, Xiong Caiming, Zhang Tong
- Internlm-xcomposer2-4khd: A Pioneering Large Vision-language Model Handling Resolutions From 336 Pixels To 4K HD Dong Xiaoyi, Zhang Pan, Zang Yuhang, Cao Yuhang, Wang Bin, Ouyang Linke, Zhang Songyang, Duan Haodong, Zhang Wenwei, Li Yining, Yan Hang, Gao Yang, Chen Zhe, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Chen Kai, He Conghui, Zhang Xingcheng, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- Unmemorization In Large Language Models Via Self-distillation And Deliberate Imagination Dong Yijiang River, Lin Hongzhou, Belkin Mikhail, Huerta Ramon, Vulić Ivan
- Prompt Your Brain: Scaffold Prompt Tuning For Efficient Adaptation Of Fmri Pre-trained Model Dong Zijian, Wu Yilei, Chen Zijiao, Zhang Yichi, Jin Yueming, Zhou Juan Helen
- Biomedical Large Languages Models Seem Not To Be Superior To Generalist Models On Unseen Medical Data Dorfner Felix J., Dada Amin, Busch Felix, Makowski Marcus R., Han Tianyu, Truhn Daniel, Kleesiek Jens, Sushil Madhumita, Lammert Jacqueline, Adams Lisa C., Bressem Keno K.
- Stepcoder: Improve Code Generation With Reinforcement Learning From Compiler Feedback Dou Shihan, Liu Yan, Jia Haoxiang, Xiong Limao, Zhou Enyu, Shen Wei, Shan Junjie, Huang Caishuang, Wang Xiao, Fan Xiaoran, Xi Zhiheng, Zhou Yuhao, Ji Tao, Zheng Rui, Zhang Qi, Huang Xuanjing, Gui Tao
- Workarena: How Capable Are Web Agents At Solving Common Knowledge Work Tasks? Drouin Alexandre, Gasse Maxime, Caccia Massimo, Laradji Issam H., Del Verme Manuel, Marty Tom, Boisvert Léo, Thakkar Megh, Cappart Quentin, Vazquez David, Chapados Nicolas, Lacoste Alexandre
- Investigating Low-cost LLM Annotation For~spoken Dialogue Understanding Datasets Druart Lucas Lia, Vielzeuf Valentin Lia, Estève Yannick Lia
- In-context Learning With Reinforcement Learning For Incomplete Utterance Rewriting Du Haowei, Zhao Dongyan
- Chinese Tiny LLM: Pretraining A Chinese-centric Large Language Model Du Xinrun, Yu Zhouliang, Gao Songyang, Pan Ding, Cheng Yuyang, Ma Ziyang, Yuan Ruibin, Qu Xingwei, Liu Jiaheng, Zheng Tianyu, Luo Xinchen, Zhou Guorui, Chen Wenhu, Zhang Ge
- Perltqa: A Personal Long-term Memory Dataset For Memory Classification, Retrieval, And Synthesis In Question Answering Du Yiming, Wang Hongru, Zhao Zhengyi, Liang Bin, Wang Baojun, Zhong Wanjun, Wang Zezhong, Wong Kam-fai
- CLEFT: Language-image Contrastive Learning With Efficient Large Language Model And Prompt Fine-tuning Du Yuexi, Chang Brian, Dvornek Nicha C.
- AXOLOTL: Fairness Through Assisted Self-debiasing Of Large Language Model Outputs Ebrahimi Sana, Chen Kaiwen, Asudeh Abolfazl, Das Gautam, Koudas Nick
- CROME: Cross-modal Adapters For Efficient Multimodal LLM Ebrahimi Sayna, Arik Sercan O., Nama Tejas, Pfister Tomas
- Relation Extraction With Fine-tuned Large Language Models In Retrieval Augmented Generation Frameworks Efeoglu Sefika, Paschke Adrian
- ARAGOG: Advanced RAG Output Grading Eibich Matouš, Nagpal Shivay, Fred-ojala Alexander
- Enabling Natural Zero-shot Prompting On Encoder Models Via Statement-tuning Elshabrawy Ahmed, Huang Yongxin, Gurevych Iryna, Aji Alham Fikri
- Navigating Uncertainty: Optimizing API Dependency For Hallucination Reduction In Closed-book Question Answering Erbacher Pierre, Falissar Louis, Guigue Vincent, Soulier Laure
- Empirical Studies Of Parameter Efficient Methods For Large Language Models Of Code And Knowledge Transfer To R Esmaeili Amirreza, Saberi Iman, Fard Fatemeh H.
- Coda: Constrained Generation Based Data Augmentation For Low-resource NLP Evuru Chandra Kiran Reddy, Ghosh Sreyan, Kumar Sonal, S Ramaneswaran, Tyagi Utkarsh, Manocha Dinesh
- Llama-omni: Seamless Speech Interaction With Large Language Models Fang Qingkai, Guo Shoutao, Zhou Yan, Ma Zhengrui, Zhang Shaolei, Feng Yang
- Curlora: Stable LLM Continual Fine-tuning And Catastrophic Forgetting Mitigation Fawi Muhammad
- Evaluating Human Alignment And Model Faithfulness Of LLM Rationale Fayyaz Mohsen, Yin Fan, Sun Jiao, Peng Nanyun
- Prompt Exploration With Prompt Regression Feffer Michael, Xu Ronald, Sun Yuekai, Yurochkin Mikhail
- Internlm-law: An Open Source Chinese Legal Large Language Model Fei Zhiwei, Zhang Songyang, Shen Xiaoyu, Zhu Dawei, Wang Xiao, Cao Maosong, Zhou Fengzhe, Li Yining, Zhang Wenwei, Lin Dahua, Chen Kai, Ge Jidong
- IGOT: Information Gain Optimized Tokenizer On Domain Adaptive Pretraining Feng Dawei, Zhang Yihai, Xu Zhixuan
- An Extremely Data-efficient And Generative Llm-based Reinforcement Learning Agent For Recommenders Feng Shuang, Feng Grace
- Exploring Automated Distractor Generation For Math Multiple-choice Questions Via Large Language Models Feng Wanyong, Lee Jaewook, Mcnichols Hunter, Scarlatos Alexander, Smith Digory, Woodhead Simon, Ornelas Nancy Otero, Lan Andrew
- Mixture-of-loras: An Efficient Multitask Tuning For Large Language Models Feng Wenfeng, Hao Chuzhan, Zhang Yuewei, Han Yu, Wang Hao
- Mt-ladder: A Model-agnostic Framework Boosting Llm-based Machine Translation To The Next Level Feng Zhaopeng, Zhang Yan, Chen Ruizhe, Meng Zijie, Liu Zuozhu
- Generative AI And Large Language Models For Cyber Security: All Insights You Need Ferrag Mohamed Amine, Alwahedi Fatima, Battah Ammar, Cherif Bilel, Mechri Abdechakour, Tihanyi Norbert
- RAG Foundry: A Framework For Enhancing Llms For Retrieval Augmented Generation Fleischer Daniel, Berchansky Moshe, Wasserblat Moshe, Izsak Peter
- Re-adaptir: Improving Information Retrieval Through Reverse Engineered Adaptation Fleshman William, Van Durme Benjamin
- A Note On Lora Fomenko Vlad, Yu Han, Lee Jongho, Hsieh Stanley, Chen Weizhu
- CLIP With Generative Latent Replay: A Strong Baseline For Incremental Learning Frascaroli Emanuele, Panariello Aniello, Buzzega Pietro, Bonicelli Lorenzo, Porrello Angelo, Calderara Simone
- Autorag-hp: Automatic Online Hyper-parameter Tuning For Retrieval-augmented Generation Fu Jia, Qin Xiaoting, Yang Fangkai, Wang Lu, Zhang Jue, Lin Qingwei, Chen Yubo, Zhang Dongmei, Rajmohan Saravan, Zhang Qi
- A Role-specific Guided Large Language Model For Ophthalmic Consultation Based On Stylistic Differentiation Fu Laiyi, Fan Binbin, Du Hongkai, Feng Yanxiang, Li Chunhua, Song Huping
- Lazyllm: Dynamic Token Pruning For Efficient Long Context LLM Inference Fu Qichen, Cho Minsik, Merth Thomas, Mehta Sachin, Rastegari Mohammad, Najibi Mahyar
- Disperse-then-merge: Pushing The Limits Of Instruction Tuning Via Alignment Tax Reduction Fu Tingchen, Cai Deng, Liu Lemao, Shi Shuming, Yan Rui
- Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight In The Real World For Meeting Summarization? Fu Xue-yong, Laskar Md Tahmid Rahman, Khasanova Elena, Chen Cheng, Tn Shashi Bhushan
- Cross-task Defense: Instruction-tuning Llms For Content Safety Fu Yu, Xiao Wen, Chen Jia, Li Jiachen, Papalexakis Evangelos, Chien Aichi, Dong Yue
- Using Large Language Models For The Interpretation Of Building Regulations Fuchs Stefan, Witbrock Michael, Dimyadi Johannes, Amor Robert
- Layoutllm: Large Language Model Instruction Tuning For Visually Rich Document Understanding Fujitake Masato
- Zero-shot Persuasive Chatbots With Llm-generated Strategies And Information Retrieval Furumai Kazuaki, Legaspi Roberto, Vizcarra Julio, Yamazaki Yudai, Nishimura Yasutaka, Semnani Sina J., Ikeda Kazushi, Shi Weiyan, Lam Monica S.
- Airavata: Introducing Hindi Instruction-tuned LLM Gala Jay, Jayakumar Thanmay, Husain Jaavid Aktar, M Aswanth Kumar, Khan Mohammed Safi Ur Rahman, Kanojia Diptesh, Puduppully Ratish, Khapra Mitesh M., Dabre Raj, Murthy Rudra, Kunchukuttan Anoop
- MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test With Open-domain Information Extraction Large Language Models Gan Chengguang, Yin Qingyu, He Xinyang, Wei Hanjun, Liang Yunhao, Lim Younghun, Wang Shijian, Huang Hexiang, Zhang Qinghao, Ni Shiwen, Mori Tatsunori
- Application Of LLM Agents In Recruitment: A Novel Framework For Resume Screening Gan Chengguang, Zhang Qinghao, Mori Tatsunori
- Practical Unlearning For Large Language Models Gao Chongyang, Wang Lixu, Weng Chenkai, Wang Xiao, Zhu Qi
- The Best Of Both Worlds: Toward An Honest And Helpful Large Language Model Gao Chujie, Zhang Qihui, Chen Dongping, Huang Yue, Wu Siyuan, Fu Zhengyan, Wan Yao, Zhang Xiangliang, Sun Lichao
- Aligning LLM Agents By Learning Latent Preference From User Edits Gao Ge, Taymanov Alexey, Salinas Eduardo, Mineiro Paul, Misra Dipendra
- AIM: Let Any Multi-modal Large Language Models Embrace Efficient In-context Learning Gao Jun, Qiao Qian, Cao Ziqiang, Wang Zili, Li Wenjie
- Tc-llava: Rethinking The Transfer From Image To Video Understanding With Temporal Considerations Gao Mingze, Liu Jingyu, Li Mingda, Xie Jiangtao, Liu Qingbin, Zhao Bo, Chen Xi, Xiong Hui
- Autovcoder: A Systematic Framework For Automated Verilog Code Generation Using Llms Gao Mingzhe, Zhao Jieru, Lin Zhe, Ding Wenchao, Hou Xiaofeng, Feng Yu, Li Chao, Guo Minyi
- Efficient Tool Use With Chain-of-abstraction Reasoning Gao Silin, Dwivedi-yu Jane, Yu Ping, Tan Xiaoqing Ellen, Pasunuru Ramakanth, Golovneva Olga, Sinha Koustuv, Celikyilmaz Asli, Bosselut Antoine, Wang Tianlu
- Worldgpt: Empowering LLM As Multimodal World Model Ge Zhiqi, Huang Hongzhe, Zhou Mingze, Li Juncheng, Wang Guoming, Tang Siliang, Zhuang Yueting
- Does Fine-tuning Llms On New Knowledge Encourage Hallucinations? Gekhman Zorik, Yona Gal, Aharoni Roee, Eyal Matan, Feder Amir, Reichart Roi, Herzig Jonathan
- Can Large Language Models Learn Independent Causal Mechanisms? Gendron Gaël, Nguyen Bao Trung, Peng Alex Yuxuan, Witbrock Michael, Dobbie Gillian
- Supporting Sensemaking Of Large Language Model Outputs At Scale Gero Katy Ilonka, Swoopes Chelse, Gu Ziwei, Kummerfeld Jonathan K., Glassman Elena L.
- Beyond Self-learned Attention: Mitigating Attention Bias In Transformer-based Models Using Attention Guidance Gesi Jiri, Ahmed Iftekhar
- A Closer Look At The Limitations Of Instruction Tuning Ghosh Sreyan, Evuru Chandra Kiran Reddy, Kumar Sonal, S Ramaneswaran, Aneja Deepali, Jin Zeyu, Duraiswami Ramani, Manocha Dinesh
- Investigating The Translation Capabilities Of Large Language Models Trained On Parallel Data Only Gilabert Javier García, Escolano Carlos, Savall Aleix Sant, Fornaciari Francesca De Luca, Mash Audrey, Liao Xixian, Melero Maite
- Uncovering Hidden Intentions: Exploring Prompt Recovery For Deeper Insights Into Generated Texts Give Louis, Zaoral Timo, Bruno Maria Antonietta
- Chatglm: A Family Of Large Language Models From GLM-130B To GLM-4 All Tools Glm Team, :, Zeng Aohan, Xu Bin, Wang Bowen, Zhang Chenhui, Yin Da, Zhang Dan, Rojas Diego, Feng Guanyu, Zhao Hanlin, Lai Hanyu, Yu Hao, Wang Hongning, Sun Jiadai, Zhang Jiajie, Cheng Jiale, Gui Jiayi, Tang Jie, Zhang Jing, Sun Jingyu, Li Juanzi, Zhao Lei, Wu Lindong, Zhong Lucen, Liu Mingdao, Huang Minlie, Zhang Peng, Zheng Qinkai, Lu Rui, Duan Shuaiqi, Zhang Shudan, Cao Shulin, Yang Shuxun, Tam Weng Lam, Zhao Wenyi, Liu Xiao, Xia Xiao, Zhang Xiaohan, Gu Xiaotao, Lv Xin, Liu Xinghan, Liu Xinyi, Yang Xinyue, Song Xixuan, Zhang Xunkai, An Yifan, Xu Yifan, Niu Yilin, Yang Yuantao, Li Yueyan, Bai Yushi, Dong Yuxiao, Qi Zehan, Wang Zhaoyu, Yang Zhen, Du Zhengxiao, Hou Zhenyu, Wang Zihan
- Do Llms Implicitly Determine The Suitable Text Difficulty For Users? Gobara Seiji, Kamigaito Hidetaka, Watanabe Taro
- Arcee's Mergekit: A Toolkit For Merging Large Language Models Goddard Charles, Siriwardhana Shamane, Ehghaghi Malikeh, Meyers Luke, Karpukhin Vlad, Benedict Brian, Mcquade Mark, Solawetz Jacob
- Direct Alignment Of Draft Model For Speculative Decoding With Chat-fine-tuned Llms Goel Raghavv, Gagrani Mukul, Jeon Wonseok, Park Junyoung, Lee Mingu, Lott Christopher
- Language Guided Exploration For RL Agents In Text Environments Golchha Hitesh, Yerawar Sahil, Patel Dhruvesh, Dan Soham, Murugesan Keerthiram
- Do Zombies Understand? A Choose-your-own-adventure Exploration Of Machine Cognition Goldstein Ariel, Stanovsky Gabriel
- Exploring Augmentation And Cognitive Strategies For AI Based Synthetic Personae Gonzalez Rafael Arias, Dipaola Steve
- Estimating Knowledge In Large Language Models Without Generating A Single Token Gottesman Daniela, Geva Mor
- Eyes Closed, Safety On: Protecting Multimodal Llms Via Image-to-text Transformation Gou Yunhao, Chen Kai, Liu Zhili, Hong Lanqing, Xu Hang, Li Zhenguo, Yeung Dit-yan, Kwok James T., Zhang Yu
- COBIAS: Contextual Reliability In Bias Assessment Govil Priyanshul, Jain Hemang, Bonagiri Vamshi Krishna, Chadha Aman, Kumaraguru Ponnurangam, Gaur Manas, Dey Sanorita
- From Generalist To Specialist: Improving Large Language Models For Medical Physics Using Arcot Grandinetti Jace, Mcbeth Rafe
- Stress-testing Capability Elicitation With Password-locked Models Greenblatt Ryan, Roger Fabien, Krasheninnikov Dmitrii, Krueger David
- Fine-tuned Language Models Generate Stable Inorganic Materials As Text Gruver Nate, Sriram Anuroop, Madotto Andrea, Wilson Andrew Gordon, Zitnick C. Lawrence, Ulissi Zachary
- Teams-rl: Teaching Llms To Generate Better Instruction Datasets Via Reinforcement Learning Gu Shangding, Knoll Alois, Jin Ming
- Continual Skill And Task Learning Via Dialogue Gu Weiwei, Kondepudi Suresh, Huang Lixiao, Gopalan Nakul
- Middleware For Llms: Tools Are Instrumental For Language Agents In Complex Environments Gu Yu, Shu Yiheng, Yu Hao, Liu Xiao, Dong Yuxiao, Tang Jie, Srinivasa Jayanth, Latapie Hugo, Su Yu
- Structext-eval: An Autogenerated Benchmark For Evaluating Large Language Model's Ability In Structure-rich Text Understanding Gu Zhouhong, Ye Haoning, Zhou Zeyang, Feng Hongwei, Xiao Yanghua
- Diverse And Fine-grained Instruction-following Ability Exploration With Synthetic Data Gu Zihui, Sun Xingwu, Lian Fengzong, Kang Zhanhui, Xu Cheng-zhong, Fan Ju
- Towards Generalist Prompting For Large Language Models By Mental Models Guan Haoxiang, He Jiyan, Zheng Shuxin, Chen En-hong, Zhang Weiming, Yu Nenghai
- AMOR: A Recipe For Building Adaptable Modular Knowledge Agents Through Process Feedback Guan Jian, Wu Wei, Wen Zujie, Xu Peng, Wang Hongning, Huang Minlie
- Enhancing Collaborative Semantics Of Language Model-driven Recommendations Via Graph-aware Learning Guan Zhong, Wu Likang, Zhao Hongke, He Ming, Fan Jianpin
- On The Interchangeability Of Positional Embeddings In Multilingual Neural Machine Translation Models Gumma Varun, Chitale Pranjal A., Bali Kalika
- Conversational Topic Recommendation In Counseling And Psychotherapy With Decision Transformer And Large Language Models Gunal Aylin, Lin Baihan, Bouneffouf Djallel
- Sample Design Engineering: An Empirical Study Of What Makes Good Downstream Fine-tuning Samples For Llms Guo Biyang, Wang He, Xiao Wenyilin, Chen Hong, Lee Zhuxin, Han Songqiao, Huang Hailiang
- A Novel Paradigm Boosting Translation Capabilities Of Large Language Models Guo Jiaxin, Yang Hao, Li Zongyao, Wei Daimeng, Shang Hengchao, Chen Xiaoyu
- Integrating Large Language Models With Graphical Session-based Recommendation Guo Naicheng, Cheng Hongwei, Liang Qianqiao, Chen Linxun, Han Bing
- Sillm: Large Language Models For Simultaneous Machine Translation Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Fine-tuning Large Language Models For Stock Return Prediction Using Newsflow Guo Tian, Hauptmann Emmanuel
- Serial Position Effects Of Large Language Models Guo Xiaobo, Vosoughi Soroush
- API Pack: A Massive Multi-programming Language Dataset For API Call Generation Guo Zhen, Soria Adriana Meza, Sun Wei, Shen Yikang, Panda Rameswar
- Stabletoolbench: Towards Stable Large-scale Benchmarking On Tool Learning Of Large Language Models Guo Zhicheng, Cheng Sijie, Wang Hao, Liang Shihao, Qin Yujia, Li Peng, Liu Zhiyuan, Sun Maosong, Liu Yang
- Selective Self-rehearsal: A Fine-tuning Approach To Improve Generalization In Large Language Models Gupta Sonam, Nandwani Yatin, Yehudai Asaf, Mishra Mayank, Pandey Gaurav, Raghu Dinesh, Joshi Sachindra
- Would I Lie To You? Inference Time Alignment Of Language Models Using Direct Preference Heads Hadji-kyriacou Avelina Asada, Arandjelovic Ognjen
- Mamba State-space Models Can Be Strong Downstream Learners Halloran John T., Gulati Manbir, Roysdon Paul F.
- Sltrain: A Sparse Plus Low-rank Approach For Parameter And Memory Efficient Pretraining Han Andi, Li Jiaxiang, Huang Wei, Hong Mingyi, Takeda Akiko, Jawanpuria Pratik, Mishra Bamdev
- Small Language Model Can Self-correct Han Haixia, Liang Jiaqing, Shi Jie, He Qianyu, Xiao Yanghua
- Transfer Learning For Text Diffusion Models Han Kehang, Kenealy Kathleen, Barua Aditya, Fiedel Noah, Constant Noah
- Enhancing CTR Prediction Through Sequential Recommendation Pre-training: Introducing The SRP4CTR Framework Han Ruidong, Li Qianzhong, Jiang He, Li Rui, Zhao Yurou, Li Xiang, Lin Wei
- Towards Robust Instruction Tuning On Multimodal Large Language Models Han Wei, Chen Hui, Poria Soujanya
- COCO Is "ALL'' You Need For Visual Instruction Fine-tuning Han Xiaotian, Wang Yiqi, Zhai Bohan, You Quanzeng, Yang Hongxia
- Vision-and-language Navigation Generative Pretrained Transformer Hanlin Wen
- Spectrum: Targeted Training On Signal To Noise Ratio Hartford Eric, Atkins Lucas, Neto Fernando Fernandes, Golchinfar David
- Pruning For Protection: Increasing Jailbreak Resistance In Aligned Llms Without Fine-tuning Hasan Adib, Rugina Ileana, Wang Alex
- The Unreasonable Effectiveness Of Easy Training Data For Hard Tasks Hase Peter, Bansal Mohit, Clark Peter, Wiegreffe Sarah
- Teaching Large Language Models To Reason With Reinforcement Learning Havrilla Alex, Du Yuqing, Raparthy Sharath Chandra, Nalmpantis Christoforos, Dwivedi-yu Jane, Zhuravinskyi Maksym, Hambro Eric, Sukhbaatar Sainbayar, Raileanu Roberta
- Chain-of-instructions: Compositional Instruction Tuning On Large Language Models Hayati Shirley Anugrah, Jung Taehee, Bodding-long Tristan, Kar Sudipta, Sethy Abhinav, Kim Joo-kyung, Kang Dongyeop
- Safety Arithmetic: A Framework For Test-time Safety Alignment Of Language Models By Steering Parameters And Activations Hazra Rima, Layek Sayan, Banerjee Somnath, Poria Soujanya
- From Words To Actions: Unveiling The Theoretical Underpinnings Of Llm-driven Autonomous Systems He Jianliang, Chen Siyu, Zhang Fengzhuo, Yang Zhuoran
- Prompt-based Bias Calibration For Better Zero/few-shot Learning Of Language Models He Kang, Long Yinghan, Roy Kaushik
- Achieving Tool Calling Functionality In Llms Using Only Prompt Engineering Without Fine-tuning He Shengtao
- Prompting Chatgpt For Translation: A Comparative Analysis Of Translation Brief And Persona Prompts He Sui
- MARS: Mixture Of Auto-regressive Models For Fine-grained Text-to-image Synthesis He Wanggui, Fu Siming, Liu Mushui, Wang Xierui, Xiao Wenyi, Shu Fangxun, Wang Yi, Zhang Lei, Yu Zhelun, Li Haoyuan, Huang Ziwei, Gan Leilei, Jiang Hao
- Transferring Troubles: Cross-lingual Transferability Of Backdoor Attacks In Llms With Instruction Tuning He Xuanli, Wang Jun, Xu Qiongkai, Minervini Pasquale, Stenetorp Pontus, Rubinstein Benjamin I. P., Cohn Trevor
- SHED: Shapley-based Automated Dataset Refinement For Instruction Fine-tuning He Yexiao, Wang Ziyao, Shen Zheyu, Sun Guoheng, Dai Yucong, Wu Yongkai, Wang Hongyi, Li Ang
- Telechat Technical Report He Zhongjiang, Wang Zihan, Liu Xinzhang, Liu Shixuan, Yao Yitong, Huang Yuyao, Li Xuelong, Li Yongxiang, Che Zhonghao, Zhang Zhaoxi, Wang Yan, Wang Xin, Pu Luwen, Xu Huinan, Fang Ruiyu, Zhao Yu, Zhang Jie, Huang Xiaomeng, Lu Zhilong, Peng Jiaxin, Zheng Wenjun, Wang Shiquan, Yang Bingkai, He Xuewei, Jiang Zhuoru, Xie Qiyi, Zhang Yanhan, Li Zhongqiu, Shi Lingling, Fu Weiwei, Zhang Yin, Huang Zilu, Xiong Sishi, Zhang Yuxiang, Wang Chao, Song Shuangyong
- PERSOMA: Personalized Soft Prompt Adapter Architecture For Personalized Language Prompting Hebert Liam, Sayana Krishna, Jash Ambarish, Karatzoglou Alexandros, Sodhi Sukhdeep, Doddapaneni Sumanth, Cai Yanli, Kuzmin Dima
- On Overcoming Miscalibrated Conversational Priors In Llm-based Chatbots Herlihy Christine, Neville Jennifer, Schnabel Tobias, Swaminathan Adith
- Lilium: Ebay's Large Language Models For E-commerce Herold Christian, Kozielski Michael, Ekimov Leonid, Petrushkov Pavel, Vandenbussche Pierre-yves, Khadivi Shahram
- Deepspeed-fastgen: High-throughput Text Generation For Llms Via MII And Deepspeed-inference Holmes Connor, Tanaka Masahiro, Wyatt Michael, Awan Ammar Ahmad, Rasley Jeff, Rajbhandari Samyam, Aminabadi Reza Yazdani, Qin Heyang, Bakhtiari Arash, Kurilenko Lev, He Yuxiong
- Llm-aided Compilation For Tensor Accelerators Hong Charles, Bhatia Sahil, Haan Altan, Dong Shengjun Kris, Nikiforov Dima, Cheung Alvin, Shao Yakun Sophia
- Towards Building A Robust Knowledge Intensive Question Answering Model With Large Language Models Hong Hong Xingyun, Shao Shao Yan, Wang Wang Zhilin, Duan Duan Manni, Xiongnan Jin
- Accelerating Multilingual Language Model For Excessively Tokenized Languages Hong Jimin, Lee Gibbeum, Cho Jaewoong
- Curiosity-driven Red-teaming For Large Language Models Hong Zhang-wei, Shenfeld Idan, Wang Tsun-hsuan, Chuang Yung-sung, Pareja Aldo, Glass James, Srivastava Akash, Agrawal Pulkit
- Free To Play: UN Trade And Development's Experience With Developing Its Own Open-source Retrieval Augmented Generation Large Language Model Application Hopp Daniel
- Non-linear Inference Time Intervention: Improving LLM Truthfulness Hoscilowicz Jakub, Wiacek Adam, Chojnacki Jan, Cieslak Adam, Michon Leszek, Urbanevych Vitalii, Janicki Artur
- Raw Text Is All You Need: Knowledge-intensive Multi-turn Instruction Tuning For Large Language Model Hou Xia, Li Qifeng, Yang Jian, Li Tongliang, Chai Linzheng, Wu Xianjie, Ji Hangyuan, Li Zhoujun, Nie Jixuan, Dun Jingbo, Song Wenfeng
- Chatglm-rlhf: Practices Of Aligning Large Language Models With Human Feedback Hou Zhenyu, Niu Yilin, Du Zhengxiao, Zhang Xiaohan, Liu Xiao, Zeng Aohan, Zheng Qinkai, Huang Minlie, Wang Hongning, Tang Jie, Dong Yuxiao
- Teaching Language Models To Self-improve By Learning From Language Feedback Hu Chi, Hu Yimin, Cao Hang, Xiao Tong, Zhu Jingbo
- Openrlhf: An Easy-to-use, Scalable And High-performance RLHF Framework Hu Jian, Wu Xibin, Wang Weixun, Xianyu, Zhang Dehao, Cao Yu
- Agentgen: Enhancing Planning Abilities For Large Language Model Based Agent Via Environment And Task Generation Hu Mengkang, Zhao Pu, Xu Can, Sun Qingfeng, Lou Jianguang, Lin Qingwei, Luo Ping, Rajmohan Saravan, Zhang Dongmei
- Mitigating Large Language Model Hallucination With Faithful Finetuning Hu Minda, He Bowei, Wang Yufei, Li Liangyou, Ma Chen, King Irwin
- Minicpm: Unveiling The Potential Of Small Language Models With Scalable Training Strategies Hu Shengding, Tu Yuge, Han Xu, He Chaoqun, Cui Ganqu, Long Xiang, Zheng Zhi, Fang Yewei, Huang Yuxiang, Zhao Weilin, Zhang Xinrong, Thai Zheng Leng, Zhang Kaihuo, Wang Chongyi, Yao Yuan, Zhao Chenyang, Zhou Jie, Cai Jie, Zhai Zhongwu, Ding Ning, Jia Chao, Zeng Guoyang, Li Dahai, Liu Zhiyuan, Sun Maosong
- A Survey On Large Language Model-based Game Agents Hu Sihao, Huang Tiansheng, Ilhan Fatih, Tekin Selim, Liu Gaowen, Kompella Ramana, Liu Ling
- Matryoshka Query Transformer For Large Vision-language Models Hu Wenbo, Dou Zi-yi, Li Liunian Harold, Kamath Amita, Peng Nanyun, Chang Kai-wei
- Case-based Or Rule-based: How Do Transformers Do The Math? Hu Yi, Tang Xiaojuan, Yang Haotong, Zhang Muhan
- Exact And Efficient Unlearning For Large Language Model-based Recommendation Hu Zhiyu, Zhang Yang, Xiao Minghao, Wang Wenjie, Feng Fuli, He Xiangnan
- Disentangling Logic: The Role Of Context In Large Language Model Reasoning Capabilities Hua Wenyue, Zhu Kaijie, Li Lingyao, Fan Lizhou, Lin Shuhang, Jin Mingyu, Xue Haochen, Li Zelong, Wang Jindong, Zhang Yongfeng
- Instupr : Instruction-based Unsupervised Passage Reranking With Large Language Models Huang Chao-wei, Chen Yun-nung
- Investigating Decoder-only Large Language Models For Speech-to-text Translation Huang Chao-wei, Lu Hui, Gong Hongyu, Inaguma Hirofumi, Kulikov Ilia, Mavlyutov Ruslan, Popuri Sravya
- Pokergpt: An End-to-end Lightweight Solver For Multi-player Texas Hold'em Via Large Language Model Huang Chenghao, Cao Yanbo, Wen Yinlong, Zhou Tao, Zhang Yanru
- Learn When (not) To Trust Language Models: A Privacy-centric Adaptive Model-aware Approach Huang Chengkai, Wang Rui, Xie Kaige, Yu Tong, Yao Lina
- What Affects The Stability Of Tool Learning? An Empirical Study On The Robustness Of Tool Learning Frameworks Huang Chengrui, Shi Zhengliang, Wen Yuntao, Chen Xiuying, Han Peng, Gao Shen, Shang Shuo
- Selective Prompting Tuning For Personalized Conversations With Llms Huang Qiushi, Liu Xubo, Ko Tom, Wu Bo, Wang Wenwu, Zhang Yu, Tang Lilian
- Antidote: Post-fine-tuning Safety Alignment For Large Language Models Against Harmful Fine-tuning Huang Tiansheng, Bhattacharya Gautam, Joshi Pratik, Kimball Josh, Liu Ling
- Vaccine: Perturbation-aware Alignment For Large Language Models Against Harmful Fine-tuning Huang Tiansheng, Hu Sihao, Liu Ling
- Improving Llms For Recommendation With Out-of-vocabulary Tokens Huang Ting-ji, Yang Jia-qi, Shen Chunxu, Liu Kai-qi, Zhan De-chuan, Ye Han-jia
- An Empirical Study Of Llama3 Quantization: From Llms To Mllms Huang Wei, Zheng Xingyu, Ma Xudong, Qin Haotong, Lv Chengtao, Chen Hong, Luo Jie, Qi Xiaojuan, Liu Xianglong, Magno Michele
- Visual Hallucinations Of Multi-modal Large Language Models Huang Wen, Liu Hongbin, Guo Minxin, Gong Neil Zhenqiang
- Rolora: Fine-tuning Rotated Outlier-free Llms For Effective Weight-activation Quantization Huang Xijie, Liu Zechun, Liu Shih-yang, Cheng Kwang-ting
- WESE: Weak Exploration To Strong Exploitation For LLM Agents Huang Xu, Liu Weiwen, Chen Xiaolong, Wang Xingmei, Lian Defu, Wang Yasheng, Tang Ruiming, Chen Enhong
- Lost In The Source Language: How Large Language Models Evaluate The Quality Of Machine Translation Huang Xu, Zhang Zhirui, Geng Xiang, Du Yichao, Chen Jiajun, Huang Shujian
- Leveraging Large Language Models For Enhanced NLP Task Performance Through Knowledge Distillation And Optimized Training Strategies Huang Yining, Tang Keke, Chen Meilian
- See The Unseen: Better Context-consistent Knowledge-editing By Noises Huang Youcheng, Lei Wenqiang, Zhang Zheng, Lv Jiancheng, Yan Shuicheng
- Ecr-chain: Advancing Generative Language Models To Better Emotion-cause Reasoners Through Reasoning Chains Huang Zhaopei, Zhao Jinming, Jin Qin
- Accelerating Pre-training Of Multimodal Llms Via Chain-of-sight Huang Ziyuan, Ji Kaixiang, Gong Biao, Qing Zhiwu, Zhang Qinglong, Zheng Kecheng, Wang Jian, Chen Jingdong, Yang Ming
- Instruction Fine-tuning: Does Prompt Loss Matter? Huerta-enochian Mathew, Ko Seung Yong
- HFT: Half Fine-tuning For Large Language Models Hui Tingfeng, Zhang Zhenyu, Wang Shuohuan, Xu Weiran, Sun Yu, Wu Hua
- Self-explore To Avoid The Pit: Improving The Reasoning Capabilities Of Language Models With Fine-grained Rewards Hwang Hyeonbin, Kim Doyoung, Kim Seungone, Ye Seonghyeon, Seo Minjoon
- Pc-lora: Low-rank Adaptation For Progressive Model Compression With Knowledge Distillation Hwang Injoon, Park Haewon, Lee Youngwan, Yang Jooyoung, Maeng Sunjae
- News Without Borders: Domain Adaptation Of Multilingual Sentence Embeddings For Cross-lingual News Recommendation Iana Andreea, Schmidt Fabian David, Glavaš Goran, Paulheim Heiko
- Self-guiding Exploration For Combinatorial Problems Iklassov Zangir, Du Yali, Akimov Farkhad, Takac Martin
- Test-time Low Rank Adaptation Via Confidence Maximization For Zero-shot Generalization Of Vision-language Models Imam Raza, Gani Hanan, Huzaifa Muhammad, Nandakumar Karthik
- Improving Multilingual Instruction Finetuning Via Linguistically Natural And Diverse Datasets Indurthi Sathish Reddy, Zhou Wenxuan, Chollampatt Shamil, Agrawal Ravi, Song Kaiqiang, Zhao Lingxiao, Zhu Chenguang
- Scaling Laws For Downstream Task Performance Of Large Language Models Isik Berivan, Ponomareva Natalia, Hazimeh Hussein, Paparas Dimitris, Vassilvitskii Sergei, Koyejo Sanmi
- Quality Or Quantity? On Data Scale And Diversity In Adapting Large Language Models For Low-resource Translation Iyer Vivek, Malik Bhavitvya, Stepachev Pavel, Chen Pinzhen, Haddow Barry, Birch Alexandra
- KIT-19: A Comprehensive Korean Instruction Toolkit On 19 Tasks For Fine-tuning Korean Large Language Models Jang Dongjun, Byun Sungjoo, Jo Hyemi, Shin Hyopil
- Performance Of Recent Large Language Models For A Low-resourced Language Jayakody Ravindu, Dias Gihan
- L4Q: Parameter Efficient Quantization-aware Fine-tuning On Large Language Models Jeon Hyesung, Kim Yulhwa, Kim Jae-joon
- RLSF: Reinforcement Learning Via Symbolic Feedback Jha Piyush, Jana Prithwish, Arora Arnav, Ganesh Vijay
- Precision Empowers, Excess Distracts: Visual Question Answering With Dynamically Infused Knowledge In Language Models Jhalani Manas, M Annervaz K, Bhattacharyya Pushpak
- Language Models Resist Alignment Ji Jiaming, Wang Kaile, Qiu Tianyi, Chen Boyuan, Zhou Jiayi, Li Changye, Lou Hantao, Yang Yaodong
- Llms Could Autonomously Learn Without External Supervision Ji Ke, Chen Junying, Gao Anningzhe, Xie Wenya, Wan Xiang, Wang Benyou
- Can't Say Cant? Measuring And Reasoning Of Dark Jargons In Large Language Models Ji Xu, Zhang Jianyi, Zhou Ziyin, Zhao Zhangchi, Qiao Qianqian, Han Kaiying, Hossen Md Imran, Hei Xiali
- Bwarea Model: Learning World Model, Inverse Dynamics, And Policy For Controllable Language Generation Jia Chengxing, Wang Pengyuan, Li Ziniu, Li Yi-chen, Zhang Zhilong, Tang Nan, Yu Yang
- Interpretable Catastrophic Forgetting Of Large Language Model Fine-tuning Via Instruction Vector Jiang Gangwei, Jiang Caigao, Li Zhaoyi, Xue Siqiao, Zhou Jun, Song Linqi, Lian Defu, Wei Ying
- Minference 1.0: Accelerating Pre-filling For Long-context Llms Via Dynamic Sparse Attention Jiang Huiqiang, Li Yucheng, Zhang Chengruidong, Wu Qianhui, Luo Xufang, Ahn Surin, Han Zhenhua, Abdi Amir H., Li Dongsheng, Lin Chin-yew, Yang Yuqing, Qiu Lili
- Mix-cpt: A Domain Adaptation Framework Via Decoupling Knowledge Learning And Format Alignment Jiang Jinhao, Li Junyi, Zhao Wayne Xin, Song Yang, Zhang Tao, Wen Ji-rong
- Can Large Language Models Generate High-quality Patent Claims? Jiang Lekang, Zhang Caiqi, Scherz Pascal A, Goetz Stephan
- Wildteaming At Scale: From In-the-wild Jailbreaks To (adversarially) Safer Language Models Jiang Liwei, Rao Kavel, Han Seungju, Ettinger Allyson, Brahman Faeze, Kumar Sachin, Mireshghallah Niloofar, Lu Ximing, Sap Maarten, Choi Yejin, Dziri Nouha
- Turning Generative Models Degenerate: The Power Of Data Poisoning Attacks Jiang Shuli, Kadhe Swanand Ravindra, Zhou Yi, Ahmed Farhan, Cai Ling, Baracaldo Nathalie
- Mora: High-rank Updating For Parameter-efficient Fine-tuning Jiang Ting, Huang Shaohan, Luo Shengyue, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi, Wang Deqing, Zhuang Fuzhen
- SEED: Customize Large Language Models With Sample-efficient Adaptation For Code Generation Jiang Xue, Dong Yihong, Jin Zhi, Li Ge
- Large Visual-language Models Are Also Good Classifiers: A Study Of In-context Multimodal Fake News Detection Jiang Ye, Wang Yimin
- Instruction-tuned Language Models Are Better Knowledge Learners Jiang Zhengbao, Sun Zhiqing, Shi Weijia, Rodriguez Pedro, Zhou Chunting, Neubig Graham, Lin Xi Victoria, Yih Wen-tau, Iyer Srinivasan
- Learning Planning-based Reasoning By Trajectories Collection And Process Reward Synthesizing Jiao Fangkai, Qin Chengwei, Liu Zhengyuan, Chen Nancy F., Joty Shafiq
- LLM Maybe Longlm: Self-extend LLM Context Window Without Tuning Jin Hongye, Han Xiaotian, Yang Jingfeng, Jiang Zhimeng, Liu Zirui, Chang Chia-yuan, Chen Huiyuan, Hu Xia
- BIDER: Bridging Knowledge Inconsistency For Efficient Retrieval-augmented Llms Via Key Supporting Evidence Jin Jiajie, Zhu Yutao, Zhou Yujia, Dou Zhicheng
- Crimson: Empowering Strategic Reasoning In Cybersecurity Through Large Language Models Jin Jiandong, Tang Bowen, Ma Mingxuan, Liu Xiao, Wang Yunfei, Lai Qingnan, Yang Jia, Zhou Changling
- Self-selected Attention Span For Accelerating Large Language Model Inference Jin Tian, Yazar Wanzin, Xu Zifei, Sharify Sayeh, Wang Xin
- Birbal: An Efficient 7B Instruct-model Fine-tuned With Curated Datasets Jindal Ashvini Kumar, Rajpoot Pawan Kumar, Parikh Ankur
- Llm-based Federated Recommendation Jujia Zhao, Wenjie Wang, Chen Xu, Zhaochun Ren, See-kiong Ng, Tat-seng Chua
- What If...?: Thinking Counterfactual Keywords Helps To Mitigate Hallucination In Large Multi-modal Models Junho Kim, Yeon Ju Kim, Yong Man Ro
- Dipt: Enhancing LLM Reasoning Through Diversified Perspective-taking Just Hoang Anh, Dabas Mahavir, Huang Lifu, Jin Ming, Jia Ruoxi
- Beyond Labels: Aligning Large Language Models With Human-like Reasoning Kabir Muhammad Rafsan, Sultan Rafeed Mohammad, Asif Ihsanul Haque, Ahad Jawad Ibn, Rahman Fuad, Amin Mohammad Ruhul, Mohammed Nabeel, Rahman Shafin
- Evaluating The Impact Of Advanced LLM Techniques On Ai-lecture Tutors For A Robotics Course Kahl Sebastian, Löffler Felix, Maciol Martin, Ridder Fabian, Schmitz Marius, Spanagel Jennifer, Wienkamp Jens, Burgahn Christopher, Schilling Malte
- Scaling Laws For Forgetting When Fine-tuning Large Language Models Kalajdzievski Damjan
- When Can Llms Actually Correct Their Own Mistakes? A Critical Survey Of Self-correction Of Llms Kamoi Ryo, Zhang Yusen, Zhang Nan, Han Jiawei, Zhang Rui
- Domain-specific Improvement On Psychotherapy Chatbot Using Assistant Kang Cheng, Novak Daniel, Urbanova Katerina, Cheng Yuqing, Hu Yong
- Mindstar: Enhancing Math Reasoning In Pre-trained Llms At Inference Time Kang Jikun, Li Xin Zhe, Chen Xi, Kazemi Amirreza, Sun Qianyi, Chen Boxing, Li Dong, He Xu, He Quan, Wen Feng, Hao Jianye, Yao Jun
- Unfamiliar Finetuning Examples Control How Language Models Hallucinate Kang Katie, Wallace Eric, Tomlin Claire, Kumar Aviral, Levine Sergey
- Large Language Models Must Be Taught To Know What They Don't Know Kapoor Sanyam, Gruver Nate, Roberts Manley, Collins Katherine, Pal Arka, Bhatt Umang, Weller Adrian, Dooley Samuel, Goldblum Micah, Wilson Andrew Gordon
- Instruct-skillmix: A Powerful Pipeline For LLM Instruction Tuning Kaur Simran, Park Simon, Goyal Anirudh, Arora Sanjeev
- Defending Large Language Models Against Attacks With Residual Stream Activation Analysis Kawasaki Amelia, Davis Andrew, Abbas Houssam
- Exploring The Frontiers Of Llms In Psychological Applications: A Comprehensive Review Ke Luoma, Tong Song, Cheng Peng, Peng Kaiping
- Source-aware Training Enables Knowledge Attribution In Language Models Khalifa Muhammad, Wadden David, Strubell Emma, Lee Honglak, Wang Lu, Beltagy Iz, Peng Hao
- Indicllmsuite: A Blueprint For Creating Pre-training And Fine-tuning Datasets For Indian Languages Khan Mohammed Safi Ur Rahman, Mehta Priyam, Sankar Ananth, Kumaravelan Umashankar, Doddapaneni Sumanth, G Suriyaprasaad, G Varun Balan, Jain Sparsh, Kunchukuttan Anoop, Kumar Pratyush, Dabre Raj, Khapra Mitesh M.
- Guiding Large Language Models To Post-edit Machine Translation With Error Annotations Ki Dayeon, Carpuat Marine
- Shortened Llama: Depth Pruning For Large Language Models With Comparison Of Retraining Methods Kim Bo-kyeong, Kim Geonmin, Kim Tae-ho, Castells Thibault, Choi Shinkook, Shin Junho, Song Hyoung-kyu
- Understanding Large-language Model (llm)-powered Human-robot Interaction Kim Callie Y., Lee Christine P., Mutlu Bilge
- Promptkd: Distilling Student-friendly Knowledge For Generative Language Models Via Prompt Tuning Kim Gyeongman, Jang Doohyuk, Yang Eunho
- Fine-tuning CLIP Text Encoders With Two-step Paraphrasing Kim Hyunjae, Yoon Seunghyun, Bui Trung, Zhao Handong, Tran Quan, Dernoncourt Franck, Kang Jaewoo
- Sure: Summarizing Retrievals Using Answer Candidates For Open-domain QA Of Llms Kim Jaehyung, Nam Jaehyun, Mo Sangwoo, Park Jongjin, Lee Sang-woo, Seo Minjoon, Ha Jung-woo, Shin Jinwoo
- Openvla: An Open-source Vision-language-action Model Kim Moo Jin, Pertsch Karl, Karamcheti Siddharth, Xiao Ted, Balakrishna Ashwin, Nair Suraj, Rafailov Rafael, Foster Ethan, Lam Grace, Sanketi Pannag, Vuong Quan, Kollar Thomas, Burchfiel Benjamin, Tedrake Russ, Sadigh Dorsa, Levine Sergey, Liang Percy, Finn Chelsea
- Large Language Models Meet Collaborative Filtering: An Efficient All-round Llm-based Recommender System Kim Sein, Kang Hongseok, Choi Seungyoon, Kim Donghyun, Yang Minchul, Park Chanyoung
- Is GPT-4 Alone Sufficient For Automated Essay Scoring?: A Comparative Judgment Approach Based On Rater Cognition Kim Seungju, Jo Meounggun
- Groundial: Human-norm Grounded Safe Dialog Response Generation Kim Siwon, Dai Shuyang, Kachuee Mohammad, Ray Shayan, Taghavi Tara, Yoon Sungroh
- "i'm Not Sure, But...": Examining The Impact Of Large Language Models' Uncertainty Expression On User Reliance And Trust Kim Sunnie S. Y., Liao Q. Vera, Vorvoreanu Mihaela, Ballard Stephanie, Vaughan Jennifer Wortman
- Llmem: Estimating GPU Memory Usage For Fine-tuning Pre-trained Llms Kim Taeho, Wang Yanming, Chaturvedi Vatshank, Gupta Lokesh, Kim Seyeon, Kwon Yongin, Ha Sangtae
- Health-llm: Large Language Models For Health Prediction Via Wearable Sensor Data Kim Yubin, Xu Xuhai, Mcduff Daniel, Breazeal Cynthia, Park Hae Won
- Evaluating Telugu Proficiency In Large Language Models_ A Comparative Analysis Of Chatgpt And Gemini Kishore Katikela Sreeharsha, Shaik Rahimanuddin
- From Bytes To Borsch: Fine-tuning Gemma And Mistral For The Ukrainian Language Representation Kiulian Artur, Polishko Anton, Khandoga Mykola, Chubych Oryna, Connor Jack, Ravishankar Raghav, Shirawalmath Adarsh
- Contrastive Perplexity For Controlled Generation: An Application In Detoxifying Large Language Models Klein Tassilo, Nabi Moin
- Extended Mind Transformers Klett Phoebe, Ahle Thomas
- Blending Llms Into Cascaded Speech Translation: Kit's Offline Speech Translation System For IWSLT 2024 Koneru Sai, Nguyen Thai-binh, Pham Ngoc-quan, Liu Danni, Li Zhaolin, Waibel Alexander, Niehues Jan
- Self-prompt Tuning: Enable Autonomous Role-playing In Llms Kong Aobo, Zhao Shiwan, Chen Hao, Li Qicheng, Qin Yong, Sun Ruiqi, Zhou Xin, Zhou Jiaming, Sun Haoqin
- Labeling Supervised Fine-tuning Data With The Scaling Law Kong Huanjun
- Aligning Large Language Models With Representation Editing: A Control Perspective Kong Lingkai, Wang Haorui, Mu Wenhao, Du Yuanqi, Zhuang Yuchen, Zhou Yifei, Song Yue, Zhang Rongzhi, Wang Kai, Zhang Chao
- Customizing Language Models With Instance-wise Lora For Sequential Recommendation Kong Xiaoyu, Wu Jiancan, Zhang An, Sheng Leheng, Lin Hui, Wang Xiang, He Xiangnan
- Bitune: Bidirectional Instruction-tuning Kopiczko Dawid J., Blankevoort Tijmen, Asano Yuki M.
- Transllama: Llm-based Simultaneous Translation System Koshkin Roman, Sudoh Katsuhito, Nakamura Satoshi
- Llms Are Zero-shot Context-aware Simultaneous Translators Koshkin Roman, Sudoh Katsuhito, Nakamura Satoshi
- Hints-in-browser: Benchmarking Language Models For Programming Feedback Generation Kotalwar Nachiket, Gotovos Alkis, Singla Adish
- Arabiangpt: Native Arabic Gpt-based Large Language Model Koubaa Anis, Ammar Adel, Ghouti Lahouari, Najar Omar, Sibaee Serry
- Can Large Language Models Explore In-context? Krishnamurthy Akshay, Harris Keegan, Foster Dylan J., Zhang Cyril, Slivkins Aleksandrs
- Can Better Text Semantics In Prompt Tuning Improve VLM Generalization? Kuchibhotla Hari Chandana, Kancheti Sai Srinivas, Reddy Abbavaram Gowtham, Balasubramanian Vineeth N
- Fine-tuning, Quantization, And Llms: Navigating Unintended Outcomes Kumar Divyanshu, Kumar Anurakt, Agarwal Sahil, Harshangi Prashanth
- Bridging The Gap: Dynamic Learning Strategies For Improving Multilingual Performance In Llms Kumar Somnath, Balloli Vaibhav, Ranjit Mercy, Ahuja Kabir, Ganu Tanuja, Sitaram Sunayana, Bali Kalika, Nambi Akshay
- Properties And Challenges Of Llm-generated Explanations Kunz Jenny, Kuhlmann Marco
- Teaching Llama A New Language Through Cross-lingual Knowledge Transfer Kuulmets Hele-andra, Purason Taido, Luhtaru Agnes, Fishel Mark
- Continual Learning Optimizations For Auto-regressive Decoder Of Multilingual ASR Systems Kwok Chin Yuen, Yip Jia Qi, Chng Eng Siong
- Can Github Issues Be Solved With Tree Of Thoughts? La Rosa Ricardo, Hulse Corey, Liu Bangdi
- Thalle: Text Hyperlocally Augmented Large Language Extension -- Technical Report Labs Kbtg, Khamnuansin Danupat, Petchsod Atthakorn, Lertpiya Anuruth, Balee Pornchanan, Lodkaew Thanawat, Chalothorn Tawunrat, Pongthawornkamol Thadpong, Lertsutthiwong Monchai
- The Remarkable Robustness Of Llms: Stages Of Inference? Lad Vedang, Gurnee Wes, Tegmark Max
- Mcot: Multilingual Instruction Tuning For Reasoning Consistency In Language Models Lai Huiyuan, Nissim Malvina
- From Distributional To Overton Pluralism: Investigating Large Language Model Alignment Lake Thom, Choi Eunsol, Durrett Greg
- Self-directed Synthetic Dialogues And Revisions Technical Report Lambert Nathan, Schoelkopf Hailey, Gokaslan Aaron, Soldaini Luca, Pyatkin Valentina, Castricato Louis
- Lapdoc: Layout-aware Prompting For Documents Lamott Marcel, Weweler Yves-noel, Ulges Adrian, Shafait Faisal, Krechel Dirk, Obradovic Darko
- Fine-tuning Language Models With Reward Learning On Policy Lang Hao, Huang Fei, Li Yongbin
- Large Language Models As Evolution Strategies Lange Robert Tjarko, Tian Yingtao, Tang Yujin
- Enhancing Neural Machine Translation Of Low-resource Languages: Corpus Development, Human Evaluation And Explainable AI Architectures Lankford Séamus
- Adaptmllm: Fine-tuning Multilingual Language Models On Low-resource Languages With Integrated LLM Playgrounds Lankford Séamus, Afli Haithem, Way Andy
- Sparse Rewards Can Self-train Dialogue Agents Lattimer Barrett Martin, Gangal Varun, Mcdonald Ryan, Yang Yi
- Reallm: A General Framework For LLM Compression And Fine-tuning Leconte Louis, Bedin Lisa, Nguyen Van Minh, Moulines Eric
- Collavo: Crayon Large Language And Vision Model Lee Byung-kwan, Park Beomchan, Kim Chae Won, Ro Yong Man
- Nv-embed: Improved Techniques For Training Llms As Generalist Embedding Models Lee Chankyu, Roy Rajarshi, Xu Mengyao, Raiman Jonathan, Shoeybi Mohammad, Catanzaro Bryan, Ping Wei
- Improving Conversational Abilities Of Quantized Large Language Models Via Direct Preference Alignment Lee Janghwan, Park Seongmin, Hong Sukjin, Kim Minsoo, Chang Du-seong, Choi Jungwook
- CATS: Contextually-aware Thresholding For Sparsity In Large Language Models Lee Je-yong, Lee Donghyun, Zhang Genghan, Tiwari Mo, Mirhoseini Azalia
- A Survey Of Large Language Models In Finance (finllms) Lee Jean, Stevens Nicholas, Han Soyeon Caren, Song Minseok
- Reinforcement Learning From Reflective Feedback (RLRF): Aligning And Improving Llms Via Fine-grained Self-reflection Lee Kyungjae, Hwang Dasol, Park Sunghyun, Jang Youngsoo, Lee Moontae
- LLM2LLM: Boosting Llms With Novel Iterative Data Enhancement Lee Nicholas, Wattanawong Thanakul, Kim Sehoon, Mangalam Karttikeya, Shen Sheng, Anumanchipalli Gopala, Mahoney Michael W., Keutzer Kurt, Gholami Amir
- Interactive Text-to-image Retrieval With Large Language Models: A Plug-and-play Approach Lee Saehyung, Yu Sangwon, Park Junsung, Yi Jihun, Yoon Sungroh
- Effective And Efficient Conversation Retrieval For Dialogue State Tracking With Implicit Text Summaries Lee Seanie, Cheng Jianpeng, Driesen Joris, Coca Alexandru, Johannsen Anders
- Materials Science In The Era Of Large Language Models: A Perspective Lei Ge, Docherty Ronan, Cooper Samuel J.
- Meta-task Prompting Elicits Embeddings From Large Language Models Lei Yibin, Wu Di, Zhou Tianyi, Shen Tao, Cao Yu, Tao Chongyang, Yates Andrew
- Prexme! Large Scale Prompt Exploration Of Open Source Llms For Machine Translation And Summarization Evaluation Leiter Christoph, Eger Steffen
- More RLHF, More Trust? On The Impact Of Human Preference Alignment On Language Model Trustworthiness Li Aaron J., Krishna Satyapriya, Lakkaraju Himabindu
- Cascade Reward Sampling For Efficient Decoding-time Alignment Li Bolian, Wang Yifan, Grama Ananth, Zhang Ruqi
- A Study On Training And Developing Large Language Models For Behavior Tree Generation Li Fu, Wang Xueying, Li Bin, Wu Yunlong, Wang Yanzhen, Yi Xiaodong
- Recall, Retrieve And Reason: Towards Better In-context Relation Extraction Li Guozheng, Wang Peng, Ke Wenjun, Guo Yikai, Ji Ke, Shang Ziyu, Liu Jiajun, Xu Zijie
- Gradient-mask Tuning Elevates The Upper Limits Of LLM Performance Li Haoling, Zhang Xin, Liu Xiao, Gong Yeyun, Wang Yifan, Yang Yujiu, Chen Qi, Cheng Peng
- Backdoor Removal For Generative Large Language Models Li Haoran, Chen Yulin, Zheng Zihao, Hu Qi, Chan Chunkit, Liu Heshan, Song Yangqiu
- Revisiting Catastrophic Forgetting In Large Language Model Tuning Li Hongyu, Ding Liang, Fang Meng, Tao Dacheng
- Enhance Lifelong Model Editing With Continuous Data-adapter Association Li Jiaang, Wang Quan, Wang Zhongnan, Zhang Yongdong, Mao Zhendong
- A Cross-language Investigation Into Jailbreak Attacks In Large Language Models Li Jie, Liu Yi, Liu Chongyang, Shi Ling, Ren Xiaoning, Zheng Yaowen, Liu Yang, Xue Yinxing
- Dissecting Human And LLM Preferences Li Junlong, Zhou Fan, Sun Shichao, Zhang Yikai, Zhao Hai, Liu Pengfei
- Selective Reflection-tuning: Student-selected Data Recycling For LLM Instruction-tuning Li Ming, Chen Lichang, Chen Jiuhai, He Shwai, Gu Jiuxiang, Zhou Tianyi
- Bridging The Language Gap: Enhancing Multilingual Prompt-based Code Generation In Llms Via Zero-shot Cross-lingual Transfer Li Mingda, Mishra Abhijit, Mujumdar Utkarsh
- Red Teaming Visual Language Models Li Mukai, Li Lei, Yin Yuwei, Ahmed Masood, Liu Zhenguang, Liu Qi
- Owlore: Outlier-weighed Layerwise Sampled Low-rank Projection For Memory-efficient LLM Fine-tuning Li Pengxiang, Yin Lu, Gao Xiaowei, Liu Shiwei
- Coevol: Constructing Better Responses For Instruction Finetuning Through Multi-agent Cooperation Li Renhao, Tan Minghuan, Wong Derek F., Yang Min
- Scilitllm: How To Adapt Llms For Scientific Literature Understanding Li Sihang, Huang Jin, Zhuang Jiaxi, Shi Yaorui, Cai Xiaochen, Xu Mingjun, Wang Xiang, Zhang Linfeng, Ke Guolin, Cai Hengxing
- Improving Faithfulness Of Large Language Models In Summarization Via Sliding Generation And Self-consistency Li Taiji, Li Zhi, Zhang Yin
- Moe-ct: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting Li Tianhao, Li Shangjie, Xie Binbin, Xiong Deyi, Yang Baosong
- Stbench: Assessing The Ability Of Large Language Models In Spatio-temporal Analysis Li Wenbin, Yao Di, Zhao Ruibo, Chen Wenjie, Xu Zijie, Luo Chengxue, Gong Chang, Jing Quanliang, Tan Haining, Bi Jingping
- Tutorly: Turning Programming Videos Into Apprenticeship Learning Environments With Llms Li Wengxi, Pea Roy, Haber Nick, Subramonyam Hari
- Unlocking Large Language Model's Planning Capabilities With Maximum Diversity Fine-tuning Li Wenjun, Chen Changyu, Varakantham Pradeep
- 52B To 1T: Lessons Learned Via Tele-flm Series Li Xiang, Yao Yiqun, Jiang Xin, Fang Xuezhi, Wang Chao, Liu Xinzhang, Wang Zihan, Zhao Yu, Wang Xin, Huang Yuyao, Song Shuangyong, Li Yongxiang, Zhang Zheng, Zhao Bo, Sun Aixin, Wang Yequan, He Zhongjiang, Wang Zhongyuan, Li Xuelong, Huang Tiejun
- Making Long-context Language Models Better Multi-hop Reasoners Li Yanyang, Liang Shuo, Lyu Michael R., Wang Liwei
- Badedit: Backdooring Large Language Models By Model Editing Li Yanzhou, Li Tianlin, Chen Kangjie, Zhang Jian, Liu Shangqing, Wang Wenhan, Zhang Tianwei, Liu Yang
- Q-adapter: Training Your LLM Adapter As A Residual Q-function Li Yi-chen, Zhang Fuxiang, Qiu Wenjie, Yuan Lei, Jia Chengxing, Zhang Zongzhang, Yu Yang
- Womd-reasoning: A Large-scale Language Dataset For Interaction And Driving Intentions Reasoning Li Yiheng, Ge Chongjian, Li Chenran, Xu Chenfeng, Tomizuka Masayoshi, Tang Chen, Ding Mingyu, Zhan Wei
- Dynamic Data Sampler For Cross-language Transfer Learning In Large Language Models Li Yudong, Feng Yuhao, Zhou Wen, Zhao Zhe, Shen Linlin, Hou Cheng, Hou Xianxu
- Uni-moe: Scaling Unified Multimodal Llms With Mixture Of Experts Li Yunxin, Jiang Shenyuan, Hu Baotian, Wang Longyue, Zhong Wanqi, Luo Wenhan, Ma Lin, Zhang Min
- Understanding And Patching Compositional Reasoning In Llms Li Zhaoyi, Jiang Gangwei, Xie Hong, Song Linqi, Lian Defu, Wei Ying
- EAGLE: Elevating Geometric Reasoning Through Llm-empowered Visual Instruction Tuning Li Zhihao, Du Yao, Liu Yang, Zhang Yan, Liu Yufang, Zhang Mengdi, Cai Xunliang
- Cog-ga: A Large Language Models-based Generative Agent For Vision-language Navigation In Continuous Environments Li Zhiyuan, Lu Yanfeng, Mu Yao, Qiao Hong
- Effects Of Different Prompts On The Quality Of GPT-4 Responses To Dementia Care Questions Li Zhuochun, Xie Bo, Hilsabeck Robin, Aguirre Alyssa, Zou Ning, Luo Zhimeng, He Daqing
- Synthesize Step-by-step: Tools, Templates And Llms As Data Generators For Reasoning-based Chart VQA Li Zhuowan, Jasani Bhavan, Tang Peng, Ghadar Shabnam
- Implicit In-context Learning Li Zhuowei, Xu Zihao, Han Ligong, Gao Yunhe, Wen Song, Liu Di, Wang Hao, Metaxas Dimitris N.
- Two Stacks Are Better Than One: A Comparison Of Language Modeling And Translation As Multilingual Pretraining Objectives Li Zihao, Ji Shaoxiong, Mickus Timothee, Segonne Vincent, Tiedemann Jörg
- 500xcompressor: Generalized Prompt Compression For Large Language Models Li Zongqian, Su Yixuan, Collier Nigel
- What Is The Best Model? Application-driven Evaluation For Large Language Models Lian Shiguo, Zhao Kaikai, Liu Xinhui, Lei Xuejiao, Yang Bikun, Zhang Wenjing, Wang Kai, Liu Zhaoxiang
- Controllable Text Generation For Large Language Models: A Survey Liang Xun, Wang Hanyu, Wang Yezhaohui, Song Shichao, Yang Jiawei, Niu Simin, Hu Jie, Liu Dan, Yao Shunyu, Xiong Feiyu, Li Zhiyu
- Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats In Customized Large Language Models Liang Zi, Hu Haibo, Ye Qingqing, Xiao Yaxin, Li Haoyang
- Apiq: Finetuning Of 2-bit Quantized Large Language Model Liao Baohao, Herold Christian, Khadivi Shahram, Monz Christof
- IKUN For WMT24 General MT Task: Llms Are Here For Multilingual Machine Translation Liao Baohao, Herold Christian, Khadivi Shahram, Monz Christof
- Extending Token Computation For LLM Reasoning Liao Bingli, Vargas Danilo Vasconcellos
- Enhancing Reinforcement Learning With Label-sensitive Reward For Natural Language Understanding Liao Kuo, Li Shuang, Zhao Meng, Liu Liqun, Xue Mengge, Hu Zhenyu, Han Honglin, Yin Chengguo
- MARIO: Math Reasoning With Code Interpreter Output -- A Reproducible Pipeline Liao Minpeng, Luo Wei, Li Chengxi, Wu Jing, Fan Kai
- MING-MOE: Enhancing Medical Multi-task Learning In Large Language Models With Sparse Mixture Of Low-rank Adapter Experts Liao Yusheng, Jiang Shuyang, Wang Yu, Wang Yanfeng
- E2LLM: Encoder Elongated Large Language Models For Long-context Understanding And Reasoning Liao Zihan, Wang Jun, Yu Hang, Wei Lingxiao, Li Jianguo, Wang Jun, Zhang Wei
- Adapting Mental Health Prediction Tasks For Cross-lingual Learning Via Meta-training And In-context Learning With Large Language Model Lifelo Zita, Ning Huansheng, Dhelim Sahraoui
- Inducing Generalization Across Languages And Tasks Using Featurized Low-rank Mixtures Lin Chu-cheng, Wang Xinyi, Clark Jonathan H., Lu Han, Zhu Yun, Whitehouse Chenxi, Yu Hongkun
- Fine-tuned Network Relies On Generic Representation To Solve Unseen Cognitive Task Lin Dongyan
- FLAME: Factuality-aware Alignment For Large Language Models Lin Sheng-chieh, Gao Luyu, Oguz Barlas, Xiong Wenhan, Lin Jimmy, Yih Wen-tau, Chen Xilun
- Optimizing Language Models For Human Preferences Is A Causal Inference Problem Lin Victoria, Ben-michael Eli, Morency Louis-philippe
- PANDA: Preference Adaptation For Enhancing Domain-specific Abilities Of Llms Liu An, Yang Zonghan, Zhang Zhenhe, Hu Qingyuan, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-specific Llms Liu Chengyuan, Wang Shihang, Kang Yangyang, Qing Lizhi, Zhao Fubang, Sun Changlong, Kuang Kun, Wu Fei
- Llmembed: Rethinking Lightweight Llm's Genuine Function In Text Classification Liu Chun, Zhang Hongguang, Zhao Kainan, Ju Xinghai, Yang Lin
- Lumina-mgpt: Illuminate Flexible Photorealistic Text-to-image Generation With Multimodal Generative Pretraining Liu Dongyang, Zhao Shitian, Zhuo Le, Lin Weifeng, Qiao Yu, Li Hongsheng, Gao Peng
- A Practice-friendly Two-stage Llm-enhanced Paradigm In Sequential Recommendation Liu Dugang, Xian Shenxian, Lin Xiaolin, Zhang Xiaolian, Zhu Hong, Fang Yuan, Chen Zhen, Ming Zhong
- Pefomed: Parameter Efficient Fine-tuning Of Multimodal Large Language Models For Medical Imaging Liu Gang, He Jinlong, Li Pengfei, He Genrong, Chen Zhaolin, Zhong Shenjun
- On The Intrinsic Self-correction Capability Of Llms: Uncertainty And Latent Concept Liu Guangliang, Mao Haitao, Cao Bochuan, Xue Zhiyu, Johnson Kristen, Tang Jiliang, Wang Rongrong
- Laida: Linguistics-aware In-context Learning With Data Augmentation For Metaphor Components Identification Liu Hongde, He Chenyuan, Meng Feiyang, Niu Changyong, Jia Yuxiang
- Advancing Adversarial Suffix Transfer Learning On Aligned Large Language Models Liu Hongfu, Xie Yuxi, Wang Ye, Shieh Michael
- Bitdelta: Your Fine-tune May Only Be Worth One Bit Liu James, Xiao Guangxuan, Li Kai, Lee Jason D., Han Song, Dao Tri, Cai Tianle
- Tiny Refinements Elicit Resilience: Toward Efficient Prefix-model Against LLM Red-teaming Liu Jiaxu, Yin Xiangyu, Wu Sihao, Wang Jianhong, Fang Meng, Yi Xinping, Huang Xiaowei
- Educating Llms Like Human Students: Structure-aware Injection Of Domain Knowledge Liu Kai, Chen Ze, Fu Zhihang, Jiang Rongxin, Zhou Fan, Chen Yaowu, Wu Yue, Ye Jieping
- Synthetic Context Generation For Question Generation Liu Naiming, Wang Zichao, Baraniuk Richard
- Proofread: Fixes All Errors With One Tap Liu Renjie, Zhang Yanxiang, Zhu Yun, Sun Haicheng, Zhang Yuanbo, Huang Michael Xuelin, Cai Shanqing, Meng Lei, Zhai Shumin
- Flashback:efficient Retrieval-augmented Language Modeling For Long Context Inference Liu Runheng, Xiao Xingchen, Huang Heyan, Chi Zewen, Wu Zhijing
- Are Llms Effective Backbones For Fine-tuning? An Experimental Investigation Of Supervised Llms On Chinese Short Text Matching Liu Shulin, Xu Chengcheng, Liu Hao, Yu Tinghao, Yang Tao
- Making Them Ask And Answer: Jailbreaking Large Language Models In Few Queries Via Disguise And Reconstruction Liu Tong, Zhang Yingjie, Zhao Zhe, Dong Yinpeng, Meng Guozhu, Chen Kai
- Cmm-math: A Chinese Multimodal Math Dataset To Evaluate And Enhance The Mathematics Reasoning Of Large Multimodal Models Liu Wentao, Pan Qianjun, Zhang Yi, Liu Zhuo, Wu Ji, Zhou Jie, Zhou Aimin, Chen Qin, Jiang Bo, He Liang
- Extensive Self-contrast Enables Feedback-free Language Model Alignment Liu Xiao, Song Xixuan, Dong Yuxiao, Tang Jie
- Adaptive Draft-verification For Efficient Large Language Model Decoding Liu Xukun, Lei Bowen, Zhang Ruqi, Xu Dongkuan
- Datasets For Large Language Models: A Comprehensive Survey Liu Yang, Cao Jiahuan, Liu Chongyu, Ding Kai, Jin Lianwen
- Understanding Llms: A Comprehensive Overview From Training To Inference Liu Yiheng, He Hao, Han Tianle, Zhang Xu, Liu Mengyuan, Tian Jiaming, Zhang Yutong, Wang Jiaqi, Gao Xiaohui, Zhong Tianyang, Pan Yi, Xu Shaochen, Wu Zihao, Liu Zhengliang, Zhang Xin, Zhang Shu, Hu Xintao, Zhang Tuo, Qiang Ning, Liu Tianming, Ge Bao
- Let's Learn Step By Step: Enhancing In-context Learning Ability With Curriculum Learning Liu Yinpeng, Liu Jiawei, Shi Xiang, Cheng Qikai, Huang Yong, Lu Wei
- From Summary To Action: Enhancing Large Language Models For Complex Tasks With Open World Apis Liu Yulong, Yuan Yunlong, Wang Chunwei, Han Jianhua, Ma Yongqiang, Zhang Li, Zheng Nanning, Xu Hang
- Adamole: Fine-tuning Large Language Models With Adaptive Mixture Of Low-rank Adaptation Experts Liu Zefang, Luo Jiahua
- World Models With Hints Of Large Language Models For Goal Achieving Liu Zeyuan, Huan Ziyu, Wang Xiyao, Lyu Jiafei, Tao Jian, Li Xiu, Huang Furong, Xu Huazhe
- Mixture Of Insightful Experts (mote): The Synergy Of Thought Chains And Expert Mixtures In Self-alignment Liu Zhili, Gou Yunhao, Chen Kai, Hong Lanqing, Gao Jiahui, Mi Fei, Zhang Yu, Li Zhenguo, Jiang Xin, Liu Qun, Kwok James T.
- A Closer Look Into Mixture-of-experts In Large Language Models Lo Ka Man, Huang Zeyu, Qiu Zihan, Wang Zili, Fu Jie
- Loqt: Low-rank Adapters For Quantized Pre-training Loeschcke Sebastian, Toftrup Mads, Kastoryano Michael J., Belongie Serge, Snæbjarnarson Vésteinn
- Llms Are Biased Towards Output Formats! Systematically Evaluating And Mitigating Output Format Bias Of Llms Long Do Xuan, Ngoc Hai Nguyen, Sim Tiviatis, Dao Hieu, Joty Shafiq, Kawaguchi Kenji, Chen Nancy F., Kan Min-yen
- Sparser Is Faster And Less Is More: Efficient Sparse Attention For Long-range Transformers Lou Chao, Jia Zixia, Zheng Zilong, Tu Kewei
- Deepseek-vl: Towards Real-world Vision-language Understanding Lu Haoyu, Liu Wen, Zhang Bo, Wang Bingxuan, Dong Kai, Liu Bo, Sun Jingxiang, Ren Tongzheng, Li Zhuoshu, Yang Hao, Sun Yaofeng, Deng Chengqi, Xu Hanwei, Xie Zhenda, Ruan Chong
- YODA: Teacher-student Progressive Learning For Language Models Lu Jianqiao, Zhong Wanjun, Wang Yufei, Guo Zhijiang, Zhu Qi, Huang Wenyong, Wang Yanlin, Mi Fei, Wang Baojun, Wang Yasheng, Shang Lifeng, Jiang Xin, Liu Qun
- FIPO: Free-form Instruction-oriented Prompt Optimization With Preference Dataset And Modular Fine-tuning Schema Lu Junru, An Siyu, Zhang Min, He Yulan, Yin Di, Sun Xing
- Online Merging Optimizers For Boosting Rewards And Mitigating Tax In Alignment Lu Keming, Yu Bowen, Huang Fei, Fan Yang, Lin Runji, Zhou Chang
- Fine-tuning Large Language Models For Domain Adaptation: Exploration Of Training Strategies, Scaling, Model Merging And Synergistic Capabilities Lu Wei, Luu Rachel K., Buehler Markus J.
- SPP: Sparsity-preserved Parameter-efficient Fine-tuning For Large Language Models Lu Xudong, Zhou Aojun, Xu Yuhui, Zhang Renrui, Gao Peng, Li Hongsheng
- To Err Is Human, But Llamas Can Learn It Too Luhtaru Agnes, Purason Taido, Vainikko Martin, Del Maksym, Fishel Mark
- Can Perplexity Predict Fine-tuning Performance? An Investigation Of Tokenization Effects On Sequential Language Models For Nepali Luitel Nishant, Bekoju Nirajan, Sah Anand Kumar, Shakya Subarna
- Layoutllm: Layout Instruction Tuning With Large Language Models For Document Understanding Luo Chuwei, Shen Yufan, Zhu Zhaoqing, Zheng Qi, Yu Zhi, Yao Cong
- Crosstune: Black-box Few-shot Classification With Label Enhancement Luo Danqing, Zhang Chen, Zhang Yan, Li Haizhou
- Arena Learning: Build Data Flywheel For Llms Post-training Via Simulated Chatbot Arena Luo Haipeng, Sun Qingfeng, Xu Can, Zhao Pu, Lin Qingwei, Lou Jianguang, Chen Shifeng, Tang Yansong, Chen Weizhu
- Decoupled Alignment For Robust Plug-and-play Adaptation Luo Haozheng, Yu Jiahao, Zhang Wenxin, Li Jialong, Hu Jerry Yao-chieh, Xing Xinyu, Liu Han
- Kuaiji: The First Chinese Accounting Large Language Model Luo Jiayuan, Yang Songhua, Qiu Xiaoling, Chen Panyu, Nai Yufei, Zeng Wenxuan, Zhang Wentao, Jiang Xinke
- Zero-shot Cross-domain Dialogue State Tracking Via Dual Low-rank Adaptation Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- Knowla: Enhancing Parameter-efficient Finetuning With Knowledgeable Adaptation Luo Xindi, Sun Zequn, Zhao Jing, Zhao Zhe, Hu Wei
- Ensuring Safe And High-quality Outputs: A Guideline Library Approach For Language Models Luo Yi, Lin Zhenghao, Zhang Yuhao, Sun Jiashuo, Lin Chen, Xu Chengjin, Su Xiangdong, Shen Yelong, Guo Jian, Gong Yeyun
- TRAWL: Tensor Reduced And Approximated Weights For Large Language Models Luo Yiran, Patel Het, Fu Yu, Ahn Dawon, Chen Jia, Dong Yue, Papalexakis Evangelos E.
- Reft: Reasoning With Reinforced Fine-tuning Luong Trung Quoc, Zhang Xinbo, Jie Zhanming, Sun Peng, Jin Xiaoran, Li Hang
- WILBUR: Adaptive In-context Learning For Robust And Accurate Web Agents Lutz Michael, Bohra Arth, Saroyan Manvel, Harutyunyan Artem, Campagna Giovanni
- Codeact: Code Adaptive Compute-efficient Tuning Framework For Code Llms Lv Weijie, Xia Xuan, Huang Sheng-jun
- GPT-4 Generated Narratives Of Life Events Using A Structured Narrative Prompt: A Validation Study Lynch Christopher J., Jensen Erik, Munro Madison H., Zamponi Virginia, Martinez Joseph, O'brien Kevin, Feldhaus Brandon, Smith Katherine, Reinhold Ann Marie, Gore Ross
- Keeping Llms Aligned After Fine-tuning: The Crucial Role Of Prompt Templates Lyu Kaifeng, Zhao Haoyu, Gu Xinran, Yu Dingli, Goyal Anirudh, Arora Sanjeev
- Automatic Summarization Of Doctor-patient Encounter Dialogues Using Large Language Model Through Prompt Tuning Lyu Mengxian, Peng Cheng, Li Xiaohan, Balian Patrick, Bian Jiang, Wu Yonghui
- Calibrating Large Language Models With Sample Consistency Lyu Qing, Shridhar Kumar, Malaviya Chaitanya, Zhang Li, Elazar Yanai, Tandon Niket, Apidianaki Marianna, Sachan Mrinmaya, Callison-burch Chris
- Knowtuning: Knowledge-aware Fine-tuning For Large Language Models Lyu Yougang, Yan Lingyong, Wang Shuaiqiang, Shi Haibo, Yin Dawei, Ren Pengjie, Chen Zhumin, De Rijke Maarten, Ren Zhaochun
- Sparsity-accelerated Training For Large Language Models Ma Da, Chen Lu, Wang Pengyu, Xu Hongshen, Li Hanqi, Sun Liangtai, Zhu Su, Fan Shuai, Yu Kai
- Eventlens: Leveraging Event-aware Pretraining And Cross-modal Linking Enhances Visual Commonsense Reasoning Ma Mingjie, Yu Zhihuan, Ma Yichao, Li Guohui
- Llmparser: An Exploratory Study On Using Large Language Models For Log Parsing Ma Zeyang, Chen An Ran, Kim Dong Jae, Chen Tse-hsun, Wang Shaowei
- Llamoco: Instruction Tuning Of Large Language Models For Optimization Code Generation Ma Zeyuan, Guo Hongshu, Chen Jiacheng, Peng Guojun, Cao Zhiguang, Ma Yining, Gong Yue-jiao
- LLM In-context Recall Is Prompt Dependent Machlab Daniel, Battle Rick
- Leveraging Environment Interaction For Automated PDDL Generation And Planning With Large Language Models Mahdavi Sadegh, Aoki Raquel, Tang Keyi, Cao Yanshuai
- Too Late To Train, Too Early To Use? A Study On Necessity And Viability Of Low-resource Bengali Llms Mahfuz Tamzeed, Dey Satak Kumar, Naswan Ruwad, Adil Hasnaen, Sayeed Khondker Salman, Shahgir Haz Sameen
- Exploring The Capabilities Of Prompted Large Language Models In Educational And Assessment Applications Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- COPAL: Continual Pruning In Large Language Generative Models Malla Srikanth, Choi Joon Hee, Choi Chiho
- ULLME: A Unified Framework For Large Language Model Embeddings With Generation-augmented Learning Man Hieu, Ngo Nghia Trung, Dernoncourt Franck, Nguyen Thien Huu
- Mitigating Hallucinations In Large Vision-language Models (lvlms) Via Language-contrastive Decoding (LCD) Manevich Avshalom, Tsarfaty Reut
- Coalitions Of Large Language Models Increase The Robustness Of AI Agents Mangal Prattyush, Mak Carol, Kanakis Theo, Donovan Timothy, Braines Dave, Pyzer-knapp Edward
- Tuning Llms With Contrastive Alignment Instructions For Machine Translation In Unseen, Low-resource Languages Mao Zhuoyuan, Yu Yen
- Understanding And Mitigating Language Confusion In Llms Marchisio Kelly, Ko Wei-yin, Bérard Alexandre, Dehaze Théo, Ruder Sebastian
- German Also Hallucinates! Inconsistency Detection In News Summaries With The Absinth Dataset Mascarell Laura, Chalumattu Ribin, Rios Annette
- Improving Text-to-image Consistency Via Automatic Prompt Optimization Mañas Oscar, Astolfi Pietro, Hall Melissa, Ross Candace, Urbanek Jack, Williams Adina, Agrawal Aishwarya, Romero-soriano Adriana, Drozdzal Michal
- MM1: Methods, Analysis & Insights From Multimodal LLM Pre-training Mckinzie Brandon, Gan Zhe, Fauconnier Jean-philippe, Dodge Sam, Zhang Bowen, Dufter Philipp, Shah Dhruti, Du Xianzhi, Peng Futang, Weers Floris, Belyi Anton, Zhang Haotian, Singh Karanjeet, Kang Doug, Jain Ankur, Hè Hongyu, Schwarzer Max, Gunter Tom, Kong Xiang, Zhang Aonan, Wang Jianyu, Wang Chong, Du Nan, Lei Tao, Wiseman Sam, Yin Guoli, Lee Mark, Wang Zirui, Pang Ruoming, Grasch Peter, Toshev Alexander, Yang Yinfei
- Injecting New Knowledge Into Large Language Models Via Supervised Fine-tuning Mecklenburg Nick, Lin Yiyou, Li Xiaoxiao, Holstein Daniel, Nunes Leonardo, Malvar Sara, Silva Bruno, Chandra Ranveer, Aski Vijay, Yannam Pavan Kumar Reddy, Aktas Tolga, Hendry Todd
- Large Language Models For Relevance Judgment In Product Search Mehrdad Navid, Mohapatra Hrushikesh, Bagdouri Mossaab, Chandran Prijith, Magnani Alessandro, Cai Xunfan, Puthenputhussery Ajit, Yadav Sachin, Lee Tony, Zhai Chengxiang, Liao Ciya
- Openelm: An Efficient Language Model Family With Open Training And Inference Framework Mehta Sachin, Sekhavat Mohammad Hossein, Cao Qingqing, Horton Maxwell, Jin Yanzi, Sun Chenfan, Mirzadeh Iman, Najibi Mahyar, Belenko Dmitry, Zatloukal Peter, Rastegari Mohammad
- Soda-eval: Open-domain Dialogue Evaluation In The Age Of Llms Mendonça John, Trancoso Isabel, Lavie Alon
- Superposition Prompting: Improving And Accelerating Retrieval-augmented Generation Merth Thomas, Fu Qichen, Rastegari Mohammad, Najibi Mahyar
- A Comparison Of LLM Finetuning Methods & Evaluation Metrics With Travel Chatbot Use Case Meyer Sonia, Singh Shreya, Tam Bertha, Ton Christopher, Ren Angel
- Subject-driven Text-to-image Generation Via Preference-based Reinforcement Learning Miao Yanting, Loh William, Kothawade Suraj, Poupart Pascal, Rashwan Abdullah, Li Yeqing
- Is English The New Programming Language? How About Pseudo-code Engineering? Michaelsen Gian Alexandre, Santos Renato P. Dos
- Exploration Of Masked And Causal Language Modelling For Text Generation Micheletti Nicolo, Belkadi Samuel, Han Lifeng, Nenadic Goran
- Velora: Memory Efficient Training Using Rank-1 Sub-token Projections Miles Roy, Reddy Pradyumna, Elezi Ismail, Deng Jiankang
- Large Language Models: A Survey Minaee Shervin, Mikolov Tomas, Nikzad Narjes, Chenaghlu Meysam, Socher Richard, Amatriain Xavier, Gao Jianfeng
- Aligning Query Representation With Rewritten Query And Relevance Judgments In Conversational Search Mo Fengran, Qu Chen, Mao Kelong, Wu Yihong, Su Zhan, Huang Kaiyu, Nie Jian-yun
- Convsdg: Session Data Generation For Conversational Search Mo Fengran, Yi Bole, Mao Kelong, Qu Chen, Huang Kaiyu, Nie Jian-yun
- Fight Back Against Jailbreaking Via Prompt Adversarial Tuning Mo Yichuan, Wang Yuji, Wei Zeming, Wang Yisen
- Llava-chef: A Multi-modal Generative Model For Food Recipes Mohbat Fnu, Zaki Mohammed J.
- Comparison Of Large Language Models For Generating Contextually Relevant Questions Molina Ivo Lodovico, Švábenský Valdemar, Minematsu Tsubasa, Chen Li, Okubo Fumiya, Shimada Atsushi
- Synthetic Programming Elicitation And Repair For Text-to-code In Very Low-resource Programming Languages Mora Federico, Wong Justin, Lepe Haley, Bhatia Sahil, Elmaaroufi Karim, Varghese George, Gonzalez Joseph E., Polgreen Elizabeth, Seshia Sanjit A.
- Exploring The Landscape Of Large Language Models: Foundations, Techniques, And Challenges Moradi Milad, Yan Ke, Colwell David, Samwald Matthias, Asgari Rhona
- Language Modelling Approaches To Adaptive Machine Translation Moslem Yasmin
- Are Llms Robust For Spoken Dialogues? Mousavi Seyed Mahed, Roccabruna Gabriel, Alghisi Simone, Rizzoli Massimo, Ravanelli Mirco, Riccardi Giuseppe
- Compact Language Models Via Pruning And Knowledge Distillation Muralidharan Saurav, Sreenivas Sharath Turuvekere, Joshi Raviraj, Chochowski Marcin, Patwary Mostofa, Shoeybi Mohammad, Catanzaro Bryan, Kautz Jan, Molchanov Pavlo
- Novel-wd: Exploring Acquisition Of Novel World Knowledge In Llms Using Prefix-tuning Méloux Maxime, Cerisara Christophe
- Generating Situated Reflection Triggers About Alternative Solution Paths: A Case Study Of Generative AI For Computer-supported Collaborative Learning Naik Atharva, Yin Jessica Ruhan, Kamath Anusha, Ma Qianou, Wu Sherry Tongshuang, Murray Charles, Bogart Christopher, Sakr Majd, Rose Carolyn P.
- Gemquad : Generating Multilingual Question Answering Datasets From Large Language Models Using Few Shot Learning Namboori Amani, Mangale Shivam, Rosenbaum Andy, Soltan Saleh
- Using Llms To Model The Beliefs And Preferences Of Targeted Populations Namikoshi Keiichi, Filipowicz Alex, Shamma David A., Iliev Rumen, Hogan Candice L., Arechiga Nikos
- Review-feedback-reason (refer): A Novel Framework For NLG Evaluation And Reasoning Narsupalli Yaswanth, Chandra Abhranil, Muppirala Sreevatsa, Gupta Manish, Goyal Pawan
- Word2world: Generating Stories And Worlds Through Large Language Models Nasir Muhammad U., James Steven, Togelius Julian
- Learning Goal-conditioned Representations For Language Reward Models Nath Vaskar, Slack Dylan, Da Jeff, Ma Yuntao, Zhang Hugh, Whitehead Spencer, Hendryx Sean
- MBBQ: A Dataset For Cross-lingual Comparison Of Stereotypes In Generative Llms Neplenbroek Vera, Bisazza Arianna, Fernández Raquel
- Towards Modeling Learner Performance With Large Language Models Neshaei Seyed Parsa, Davis Richard Lee, Hazimeh Adam, Lazarevski Bojan, Dillenbourg Pierre, Käser Tanja
- What Drives Performance In Multilingual Language Models? Nezhad Sina Bagheri, Agrawal Ameeta
- Balancing Exploration And Exploitation In LLM Using Soft RLLF For Enhanced Negation Understanding Nguyen Ha-thanh, Satoh Ken
- Better Alignment With Instruction Back-and-forth Translation Nguyen Thao, Li Jeffrey, Oh Sewoong, Schmidt Ludwig, Weston Jason, Zettlemoyer Luke, Li Xian
- Xwin-lm: Strong And Scalable Alignment Practice For Llms Ni Bolin, Hu Jingcheng, Wei Yixuan, Peng Houwen, Zhang Zheng, Meng Gaofeng, Hu Han
- Panza: A Personalized Text Writing Assistant Via Data Playback And Local Fine-tuning Nicolicioiu Armand, Iofinova Eugenia, Kurtic Eldar, Nikdan Mahdi, Panferov Andrei, Markov Ilia, Shavit Nir, Alistarh Dan
- Flawn-t5: An Empirical Examination Of Effective Instruction-tuning Data Mixtures For Legal Reasoning Niklaus Joel, Zheng Lucia, Mccarthy Arya D., Hahn Christopher, Rosen Brian M., Henderson Peter, Ho Daniel E., Honke Garrett, Liang Percy, Manning Christopher
- Vikhr: The Family Of Open-source Instruction-tuned Large Language Models For Russian Nikolich Aleksandr, Korolev Konstantin, Shelmanov Artem, Kiselev Igor
- WTU-EVAL: A Whether-or-not Tool Usage Evaluation Benchmark For Large Language Models Ning Kangyun, Su Yisong, Lv Xueqiang, Zhang Yuanzhe, Liu Jian, Liu Kang, Xu Jinan
- Can Llms Learn By Teaching? A Preliminary Study Ning Xuefei, Wang Zifu, Li Shiyao, Lin Zinan, Yao Peiran, Fu Tianyu, Blaschko Matthew B., Dai Guohao, Yang Huazhong, Wang Yu
- Quantified Task Misalignment To Inform PEFT: An Exploration Of Domain Generalization And Catastrophic Forgetting In CLIP Niss Laura, Vogt-lowell Kevin, Tsiligkaridis Theodoros
- Enhancing Dialogue State Tracking Models Through Llm-backed User-agents Simulation Niu Cheng, Wang Xingguang, Cheng Xuxin, Song Juntong, Zhang Tong
- PARAMANU-GANITA: Language Model With Mathematical Capabilities Niyogi Mitodru, Bhattacharya Arnab
- Language Portability Strategies For Open-domain Dialogue With Pre-trained Language Models From High To Low Resource Languages Njifenjou Ahmed, Sucal Virgile, Jabaian Bassam, Lefèvre Fabrice
- Role-play Zero-shot Prompting With Large Language Models For Open-domain Human-machine Conversation Njifenjou Ahmed, Sucal Virgile, Jabaian Bassam, Lefèvre Fabrice
- Fine-tuning With HED-IT: The Impact Of Human Post-editing For Dialogical Language Models Occhipinti Daniela, Marchi Michele, Mondella Irene, Lai Huiyuan, Dell'orletta Felice, Nissim Malvina, Guerini Marco
- Robust Adaptation Of Foundation Models With Black-box Visual Prompting Oh Changdae, Seo Gyeongdeok, Jung Geunyoung, Cheng Zhi-qi, Choi Hosik, Jung Jiyoung, Song Kyungwoo
- Crafting Efficient Fine-tuning Strategies For Large Language Models Oliver Michael, Wang Guan
- Routellm: Learning To Route Llms With Preference Data Ong Isaac, Almahairi Amjad, Wu Vincent, Chiang Wei-lin, Wu Tianhao, Gonzalez Joseph E., Kadous M Waleed, Stoica Ion
- Jamba: A Hybrid Transformer-mamba Language Model Opher Lieber, Barak Lenz, Hofit Bata, Gal Cohen, Jhonathan Osin, Itay Dalmedigos, Erez Safahi, Shaked Meirom, Yonatan Belinkov, Shai Shalev-shwartz, Omri Abend, Raz Alon, Tomer Asida, Amir Bergman, Roman Glozman, Michael Gokhman, Avashalom Manevich, Nir Ratner, Noam Rozen, Erez Shwartz, Mor Zusman, Yoav Shoham
- Prompting Large Vision-language Models For Compositional Reasoning Ossowski Timothy, Jiang Ming, Hu Junjie
- Towards Modular Llms By Building And Reusing A Library Of Loras Ostapenko Oleksiy, Su Zhan, Ponti Edoardo Maria, Charlin Laurent, Roux Nicolas Le, Pereira Matheus, Caccia Lucas, Sordoni Alessandro
- Towards Cross-lingual Explanation Of Artwork In Large-scale Vision Language Models Ozaki Shintaro, Hayashi Kazuki, Sakai Yusuke, Kamigaito Hidetaka, Hayashi Katsuhiko, Watanabe Taro
- Training Llms To Recognize Hedges In Spontaneous Narratives Paige Amie J., Soubki Adil, Murzaku John, Rambow Owen, Brennan Susan E.
- Investigating LLM Applications In E-commerce Palen-michel Chester, Wang Ruixiang, Zhang Yipeng, Yu David, Xu Canran, Wu Zhe
- LISA: Layerwise Importance Sampling For Memory-efficient Large Language Model Fine-tuning Pan Rui, Liu Xiang, Diao Shizhe, Pi Renjie, Zhang Jipeng, Han Chi, Zhang Tong
- POMP: Probability-driven Meta-graph Prompter For Llms In Low-resource Unsupervised Neural Machine Translation Pan Shilong, Tian Zhiliang, Ding Liang, Huang Zhen, Wen Zhihua, Li Dongsheng
- AILS-NTUA At Semeval-2024 Task 9: Cracking Brain Teasers: Transformer Models For Lateral Thinking Puzzles Panagiotopoulos Ioannis, Filandrianos Giorgos, Lymperaiou Maria, Stamou Giorgos
- Lottery Ticket Adaptation: Mitigating Destructive Interference In Llms Panda Ashwinee, Isik Berivan, Qi Xiangyu, Koyejo Sanmi, Weissman Tsachy, Mittal Prateek
- Phased Instruction Fine-tuning For Large Language Models Pang Wei, Zhou Chuan, Zhou Xiao-hua, Wang Xiaojie
- LLM Evaluators Recognize And Favor Their Own Generations Panickssery Arjun, Bowman Samuel R., Feng Shi
- Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes Llms More Prone To Jailbreak Attacks Pantazopoulos Georgios, Parekh Amit, Nikandrou Malvina, Suglia Alessandro
- Measuring Memorization In RLHF For Code Completion Pappu Aneesh, Porter Billy, Shumailov Ilia, Hayes Jamie
- Low-resource Cross-lingual Summarization Through Few-shot Learning With Large Language Models Park Gyutae, Hwang Seojin, Lee Hwanhee
- Loramap: Harnessing The Power Of Lora Connections Park Hyeryun, Kwak Jeongwon, Jang Dongsuk, Park Sumin, Choi Jinwook
- Offsetbias: Leveraging Debiased Data For Tuning Evaluators Park Junsoo, Jwa Seungyeon, Ren Meiying, Kim Daeyoung, Choi Sanghyuk
- Token-picker: Accelerating Attention In Text Generation With Minimized Memory Transfer Via Probability Estimation Park Junyoung, Kang Myeonggu, Han Yunki, Kim Yanggon, Shin Jaekang, Kim Lee-sup
- Textboost: Towards One-shot Personalization Of Text-to-image Models Via Fine-tuning Text Encoder Park Nahyeon, Kim Kunhee, Shim Hyunjung
- The Ultimate Guide To Fine-tuning Llms From Basics To Breakthroughs: An Exhaustive Review Of Technologies, Research, Best Practices, Applied Research Challenges And Opportunities Parthasarathy Venkatesh Balavadhani, Zafar Ahtsham, Khan Aafaq, Shahid Arsalan
- Decoding News Narratives: A Critical Analysis Of Large Language Models In Framing Detection Pastorino Valeria, Sivakumar Jasivan A., Moosavi Nafise Sadat
- Large Language Models Can Self-improve At Web Agent Tasks Patel Ajay, Hofmarcher Markus, Leoveanu-condrei Claudiu, Dinu Marius-constantin, Callison-burch Chris, Hochreiter Sepp
- Making Reasoning Matter: Measuring And Improving Faithfulness Of Chain-of-thought Reasoning Paul Debjit, West Robert, Bosselut Antoine, Faltings Boi
- Advprompter: Fast Adaptive Adversarial Prompting For Llms Paulus Anselm, Zharmagambetov Arman, Guo Chuan, Amos Brandon, Tian Yuandong
- Autonomous Workflow For Multimodal Fine-grained Training Assistants Towards Mixed Reality Pei Jiahuan, Viola Irene, Huang Haochen, Wang Junxiao, Ahsan Moonisa, Ye Fanghua, Yiming Jiang, Sai Yao, Wang Di, Chen Zhumin, Ren Pengjie, Cesar Pablo
- Automated Multi-language To English Machine Translation Using Generative Pre-trained Transformers Pelofske Elijah, Urias Vincent, Liebrock Lorie M.
- Answer Is All You Need: Instruction-following Text Embedding Via Answering The Question Peng Letian, Zhang Yuwei, Wang Zilong, Srinivasa Jayanth, Liu Gaowen, Wang Zihan, Shang Jingbo
- Review-llm: Harnessing Large Language Models For Personalized Review Generation Peng Qiyao, Liu Hongtao, Xu Hongyan, Yang Qing, Shao Minglai, Wang Wenjun
- Uncertainty-aware Explainable Recommendation With Large Language Models Peng Yicui, Chen Hao, Lin Chingsheng, Huang Guo, Hu Jinrong, Guo Hui, Kong Bin, Hu Shu, Wu Xi, Wang Xin
- Q-PEFT: Query-dependent Parameter Efficient Fine-tuning For Text Reranking With Large Language Models Peng Zhiyuan, Wu Xuyang, Wang Qifan, Rajanala Sravanthi, Fang Yi
- Astrollama-chat: Scaling Astrollama With Conversational And Diverse Datasets Perkowski Ernest, Pan Rui, Nguyen Tuan Dung, Ting Yuan-sen, Kruk Sandor, Zhang Tong, O'neill Charlie, Jablonska Maja, Sun Zechang, Smith Michael J., Liu Huiling, Schawinski Kevin, Iyer Kartheik, Universetbd Ioana Ciucă For
- Analyzing Large Language Models Chatbots: An Experimental Approach Using A Probability Test Peruchini Melise, Teixeira Julio Monteiro
- Prompting A Pretrained Transformer Can Be A Universal Approximator Petrov Aleksandar, Torr Philip H. S., Bibi Adel
- Assessing The Emergent Symbolic Reasoning Abilities Of Llama Large Language Models Petruzzellis Flavio, Testolin Alberto, Sperduti Alessandro
- Optimizing Large Language Models For Openapi Code Completion Petryshyn Bohdan, Lukoševičius Mantas
- H2o-danube3 Technical Report Pfeiffer Pascal, Singer Philipp, Babakhin Yauhen, Fodor Gabor, Dhankhar Nischay, Ambati Sri Satish
- Suri: Multi-constraint Instruction Following For Long-form Text Generation Pham Chau Minh, Sun Simeng, Iyyer Mohit
- Towards Reliable Medical Question Answering: Techniques And Challenges In Mitigating Hallucinations In Language Models Pham Duy Khoa, Vo Bao Quoc
- Mllm-protector: Ensuring Mllm's Safety Without Hurting Performance Pi Renjie, Han Tianyang, Zhang Jianshu, Xie Yueqi, Pan Rui, Lian Qing, Dong Hanze, Zhang Jipeng, Zhang Tong
- Beyond Metrics: A Critical Analysis Of The Variability In Large Language Model Evaluation Frameworks Pimentel Marco Af, Christophe Clément, Raha Tathagata, Munjal Prateek, Kanithi Praveen K, Khan Shadab
- Doccgen: Document-based Controlled Code Generation Pimparkhede Sameer, Kammakomati Mehant, Tamilselvam Srikanth, Kumar Prince, Kumar Ashok Pon, Bhattacharyya Pushpak
- Sbora: Low-rank Adaptation With Regional Weight Updates Po Lai-man, Liu Yuyang, Wu Haoxuan, Zhang Tianqi, Yu Wing-yin, Jiang Zeyu, Li Kun
- V-RECS, A Low-cost LLM4VIS Recommender With Explanations, Captioning And Suggestions Podo Luca, Angelini Marco, Velardi Paola
- Advanced Natural-based Interaction For The Italian Language: Llamantino-3-anita Polignano Marco, Basile Pierpaolo, Semeraro Giovanni
- Evaluation Of Language Models In The Medical Context Under Resource-constrained Settings Posada Andrea, Rueckert Daniel, Meissen Felix, Müller Philip
- Documint: Docstring Generation For Python Using Small Language Models Poudel Bibek, Cook Adam, Traore Sekou, Ameli Shelah
- Machine Translation With Large Language Models: Prompt Engineering For Persian, English, And Russian Directions Pourkamali Nooshin, Sharifi Shler Ebrahim
- The Silent Curriculum: How Does LLM Monoculture Shape Educational Content And Its Accessibility? Priyanshu Aman, Vijay Supriti
- Customized Retrieval Augmented Generation And Benchmarking For EDA Tool Documentation QA Pu Yuan, He Zhuolun, Qiu Tairu, Wu Haoyuan, Yu Bei
- Fine-tuning With Divergent Chains Of Thought Boosts Reasoning Through Self-correction In Language Models Puerto Haritz, Chubakov Tilek, Zhu Xiaodan, Madabushi Harish Tayyar, Gurevych Iryna
- Less Is More: Accurate Speech Recognition & Translation Without Web-scale Data Puvvada Krishna C., Żelasko Piotr, Huang He, Hrinchuk Oleksii, Koluguri Nithin Rao, Dhawan Kunal, Majumdar Somshubra, Rastorgueva Elena, Chen Zhehuai, Lavrukhin Vitaly, Balam Jagadeesh, Ginsburg Boris
- Is Next Token Prediction Sufficient For GPT? Exploration On Code Logic Comprehension Qi Mengnan, Huang Yufan, Yao Yongqiang, Wang Maoquan, Gu Bin, Sundaresan Neel
- Safety Alignment Should Be Made More Than Just A Few Tokens Deep Qi Xiangyu, Panda Ashwinee, Lyu Kaifeng, Ma Xiao, Roy Subhrajit, Beirami Ahmad, Mittal Prateek, Henderson Peter
- ADELIE: Aligning Large Language Models On Information Extraction Qi Yunjia, Peng Hao, Wang Xiaozhi, Xu Bin, Hou Lei, Li Juanzi
- Mutual Reasoning Makes Smaller Llms Stronger Problem-solvers Qi Zhenting, Ma Mingyuan, Xu Jiahang, Zhang Li Lyna, Yang Fan, Yang Mao
- Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period Of Large Language Models Qian Chen, Zhang Jie, Yao Wei, Liu Dongrui, Yin Zhenfei, Qiao Yu, Liu Yong, Shao Jing
- Mia-bench: Towards Better Instruction Following Evaluation Of Multimodal Llms Qian Yusu, Ye Hanrong, Fauconnier Jean-philippe, Grasch Peter, Yang Yinfei, Gan Zhe
- Prompt Perturbation Consistency Learning For Robust Language Models Qiang Yao, Nandi Subhrangshu, Mehrabi Ninareh, Steeg Greg Ver, Kumar Anoop, Rumshisky Anna, Galstyan Aram
- The Uniqueness Of Llama3-70b With Per-channel Quantization: An Empirical Study Qin Minghai
- Empirical Guidelines For Deploying Llms Onto Resource-constrained Edge Devices Qin Ruiyang, Liu Dancheng, Yan Zheyu, Tan Zhaoxuan, Pan Zixuan, Jia Zhenge, Jiang Meng, Abbasi Ahmed, Xiong Jinjun, Shi Yiyu
- Exploring The Nexus Of Large Language Models And Legal Systems: A Short Survey Qin Weicong, Sun Zhongxiang
- Enhancing Sequential Recommendations Through Multi-perspective Reflections And Iteration Qin Weicong, Xu Yi, Yu Weijie, Shen Chenglei, Zhang Xiao, He Ming, Fan Jianping, Xu Jun
- Coursegpt-zh: An Educational Large Language Model Based On Knowledge Distillation Incorporating Prompt Optimization Qu Zheyan, Yin Lu, Yu Zitong, Wang Wenbo, Zhang Xing
- Automatically Generating Numerous Context-driven SFT Data For Llms Across Diverse Granularity Quan Shanghaoran
- Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift In Fine-tuning Llms For Simultaneous Translation Raffel Matthew, Agostinelli Victor, Chen Lizhong
- Medit: Multilingual Text Editing Via Instruction Tuning Raheja Vipul, Alikaniotis Dimitris, Kulkarni Vivek, Alhafni Bashar, Kumar Dhruv
- V-zen: Efficient GUI Understanding And Precise Grounding With A Novel Multimodal LLM Rahman Abdur, Chawla Rajat, Kumar Muskaan, Datta Arkajit, Jha Adarsh, Ns Mukunda, Bhola Ishaan
- GSR-BENCH: A Benchmark For Grounded Spatial Reasoning Evaluation Via Multimodal Llms Rajabi Navid, Kosecka Jana
- Qdylora: Quantized Dynamic Low-rank Adaptation For Efficient Large Language Model Tuning Rajabzadeh Hossein, Valipour Mojtaba, Zhu Tianshu, Tahaei Marzieh, Kwon Hyock Ju, Ghodsi Ali, Chen Boxing, Rezagholizadeh Mehdi
- ANCHOR: Llm-driven News Subject Conditioning For Text-to-image Synthesis Ramakrishnan Aashish Anantha, Huang Sharon X., Lee Dongwon
- Self-refinement Of Language Models From External Proxy Metrics Feedback Ramji Keshav, Lee Young-suk, Astudillo Ramón Fernandez, Sultan Md Arafat, Naseem Tahira, Munawar Asim, Florian Radu, Roukos Salim
- Self-refine Instruction-tuning For Aligning Reasoning In Language Models Ranaldi Leonardo, Freitas Andrè
- Learning To Localize Objects Improves Spatial Reasoning In Visual-llms Ranasinghe Kanchana, Shukla Satya Narayan, Poursaeed Omid, Ryoo Michael S., Lin Tsung-yu
- A Fine-tuning Enhanced RAG System With Quantized Influence Measure As AI Judge Rangan Keshav, Yin Yiqiao
- SPARQL Generation: An Analysis On Fine-tuning Openllama For Question Answering Over A Life Science Knowledge Graph Rangel Julio C., De Farias Tarcisio Mendes, Sima Ana Claudia, Kobayashi Norio
- RAVEN: Multitask Retrieval Augmented Vision-language Learning Rao Varun Nagaraj, Choudhary Siddharth, Deshpande Aditya, Satzoda Ravi Kumar, Appalaraju Srikar
- A Critical Look At Tokenwise Reward-guided Text Generation Rashid Ahmad, Wu Ruotian, Grosse Julia, Kristiadi Agustinus, Poupart Pascal
- Developing Safe And Responsible Large Language Model : Can We Balance Bias Reduction And Language Understanding In Large Language Models? Raza Shaina, Bamgbose Oluwanifemi, Ghuge Shardul, Tavakol Fatemeh, Reji Deepak John, Bashir Syed Raza
- Beads: Bias Evaluation Across Domains Raza Shaina, Rahman Mizanur, Zhang Michael R.
- MBIAS: Mitigating Bias In Large Language Models While Retaining Context Raza Shaina, Raval Ananya, Chatrath Veronica
- Analyzing And Adapting Large Language Models For Few-shot Multilingual NLU: Are We There Yet? Razumovskaia Evgeniia, Vulić Ivan, Korhonen Anna
- Retrieval-augmented Generation: Is Dense Passage Retrieval Retrieving? Reichman Benjamin, Heck Larry
- Melora: Mini-ensemble Low-rank Adapters For Parameter-efficient Fine-tuning Ren Pengjie, Shi Chengshun, Wu Shiguang, Zhang Mengqi, Ren Zhaochun, De Rijke Maarten, Chen Zhumin, Pei Jiahuan
- Codeattack: Revealing Safety Generalization Challenges Of Large Language Models Via Code Completion Ren Qibing, Gao Chang, Shao Jing, Yan Junchi, Tan Xin, Lam Wai, Ma Lizhuang
- Analyzing And Reducing Catastrophic Forgetting In Parameter Efficient Tuning Ren Weijieying, Li Xinlong, Wang Lei, Zhao Tianxiang, Qin Wei
- I Learn Better If You Speak My Language: Understanding The Superior Performance Of Fine-tuning Large Language Models With Llm-generated Responses Ren Xuan, Wu Biao, Liu Lingqiao
- Healthcare Copilot: Eliciting The Power Of General Llms For Medical Consultation Ren Zhiyao, Zhan Yibing, Yu Baosheng, Ding Liang, Tao Dacheng
- Are Electra's Sentence Embeddings Beyond Repair? The Case Of Semantic Textual Similarity Rep Ivan, Dukić David, Šnajder Jan
- How Multilingual Are Large Language Models Fine-tuned For Translation? Richburg Aquia, Carpuat Marine
- Instruction-tuned Large Language Models For Machine Translation In The Medical Domain Rios Miguel
- Exploring Advanced Large Language Models With Llmsuite Roffo Giorgio
- Achieving Peak Performance For Large Language Models: A Systematic Review Rostam Zhyar Rzgar K, Szénási Sándor, Kertész Gábor
- Generative AI For Enhancing Active Learning In Education: A Comparative Study Of GPT-3.5 And GPT-4 In Crafting Customized Test Questions Rouzegar Hamdireza, Makrehchi Masoud
- Fist-financial Style Transfer With Hallucination And Creativity Control Framework Roychowdhury Sohini, Krema Marko, Moore Brian, Lai Xingjian, Effedua Dike, Jethwani Bharat
- Evaluation Of RAG Metrics For Question Answering In The Telecom Domain Roychowdhury Sujoy, Soman Sumit, Ranjani H G, Gunda Neeraj, Chhabra Vansh, Bala Sai Krishna
- Efficient Language Adaptive Pre-training: Extending State-of-the-art Large Language Models For Polish Ruciński Szymon
- From PEFT To DEFT: Parameter Efficient Finetuning For Reducing Activation Density In Transformers Runwal Bharat, Pedapati Tejaswini, Chen Pin-yu
- Explorations Of Self-repair In Language Models Rushing Cody, Nanda Neel
- Writing In The Margins: Better Inference Pattern For Long Context Retrieval Russak Melisa, Jamil Umar, Bryant Christopher, Kamble Kiran, Magnuson Axel, Russak Mateusz, Alshikh Waseem
- In-context Learning Demonstration Selection Via Influence Analysis S. Vinay M., Van Minh-hao, Wu Xintao
- Utebc-nlp At Semeval-2024 Task 9: Can Llms Be Lateral Thinkers? Sadeghi Pouya, Abaskohi Amirhossein, Yaghoobzadeh Yadollah
- Insights Into Alignment: Evaluating DPO And Its Variants Across Multiple Tasks Saeidi Amir, Verma Shivanshu, Baral Chitta
- On Zero-shot Counterspeech Generation By Llms Saha Punyajoy, Agrawal Aalok, Jana Abhik, Biemann Chris, Mukherjee Animesh
- Where Is The Answer? Investigating Positional Bias In Language Model Knowledge Extraction Saito Kuniaki, Sohn Kihyuk, Lee Chen-yu, Ushiku Yoshitaka
- Algorithmic Language Models With Neurally Compiled Libraries Saldyt Lucas, Kambhampati Subbarao
- Rainbow Teaming: Open-ended Generation Of Diverse Adversarial Prompts Samvelyan Mikayel, Raparthy Sharath Chandra, Lupu Andrei, Hambro Eric, Markosyan Aram H., Bhatt Manish, Mao Yuning, Jiang Minqi, Parker-holder Jack, Foerster Jakob, Rocktäschel Tim, Raileanu Roberta
- Zero-shot RTL Code Generation With Attention Sink Augmented Large Language Models Sandal Selim, Akturk Ismail
- Improving Sentence Embeddings With Automatic Generation Of Training Data Using Few-shot Examples Sato Soma, Tsukagoshi Hayato, Sasano Ryohei, Takeda Koichi
- Robust Few-shot Transfer Learning For Knowledge Base Question Answering With Unanswerable Questions Sawhney Riya, Bhattacharya Indrajit, Mausam
- Towards Faithful And Robust LLM Specialists For Evidence-based Question-answering Schimanski Tobias, Ni Jingwei, Kraus Mathias, Ash Elliott, Leippold Markus
- Ai-augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy Schoenegger Philipp, Park Peter S., Karger Ezra, Trott Sean, Tetlock Philip E.
- Soft Prompt Threats: Attacking Safety Alignment And Unlearning In Open-source Llms Through The Embedding Space Schwinn Leo, Dobre David, Xhonneux Sophie, Gidel Gauthier, Gunnemann Stephan
- Generative Explore-exploit: Training-free Optimization Of Generative Recommender Systems Using LLM Optimizers Senel Lütfi Kerem, Fetahu Besnik, Yoshida Davis, Chen Zhiyu, Castellucci Giuseppe, Vedula Nikhita, Choi Jason, Malmasi Shervin
- Surprising Efficacy Of Fine-tuned Transformers For Fact-checking Over Larger Language Models Setty Vinay
- Detection And Measurement Of Syntactic Templates In Generated Text Shaib Chantal, Elazar Yanai, Li Junyi Jessy, Wallace Byron C.
- Show, Don't Tell: Aligning Language Models With Demonstrated Feedback Shaikh Omar, Lam Michelle, Hejna Joey, Shao Yijia, Bernstein Michael, Yang Diyi
- Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization Shao Ninglu, Xiao Shitao, Liu Zheng, Zhang Peitian
- Guiding In-context Learning Of Llms Through Quality Estimation For Machine Translation Sharami Javad Pourmostafa Roshan, Shterionov Dimitar, Spronck Pieter
- A Critical Evaluation Of AI Feedback For Aligning Large Language Models Sharma Archit, Keh Sedrick, Mitchell Eric, Finn Chelsea, Arora Kushal, Kollar Thomas
- Patched MOA: Optimizing Inference For Diverse Software Development Tasks Sharma Asankhaya
- Accurate And Efficient Fine-tuning Of Quantized Large Language Models Through Optimal Balance Shen Ao, Wang Qiang, Lai Zhiquan, Li Xionglve, Li Dongsheng
- Nemo-aligner: Scalable Toolkit For Efficient Model Alignment Shen Gerald, Wang Zhilin, Delalleau Olivier, Zeng Jiaqi, Dong Yi, Egert Daniel, Sun Shengyang, Zhang Jimmy, Jain Sahil, Taghibakhshi Ali, Ausin Markel Sanz, Aithal Ashwath, Kuchaiev Oleksii
- The Language Barrier: Dissecting Safety Challenges Of Llms In Multilingual Contexts Shen Lingfeng, Tan Weiting, Chen Sihao, Chen Yunmo, Zhang Jingyu, Xu Haoran, Zheng Boyuan, Koehn Philipp, Khashabi Daniel
- A Fine-tuning Dataset And Benchmark For Large Language Models For Protein Understanding Shen Yiqing, Chen Zan, Mamalakis Michail, He Luhan, Xia Haiyang, Li Tianbin, Su Yanzhou, He Junjun, Wang Yu Guang
- Toursynbio: A Multi-modal Large Model And Agent Framework To Bridge Text And Protein Sequences For Protein Engineering Shen Yiqing, Chen Zan, Mamalakis Michail, Liu Yungeng, Li Tianbin, Su Yanzhou, He Junjun, Liò Pietro, Wang Yu Guang
- Using Chatgpt To Score Essays And Short-form Constructed Responses Shermis Mark D.
- Latent Adversarial Training Improves Robustness To Persistent Harmful Behaviors In Llms Sheshadri Abhay, Ewart Aidan, Guo Phillip, Lynch Aengus, Wu Cindy, Hebbar Vivek, Sleight Henry, Stickland Asa Cooper, Perez Ethan, Hadfield-menell Dylan, Casper Stephen
- Neurosymbolic AI For Enhancing Instructability In Generative AI Sheth Amit, Pallagani Vishal, Roy Kaushik
- Continual Learning Of Large Language Models: A Comprehensive Survey Shi Haizhou, Xu Zihao, Wang Hengyi, Qin Weiyi, Wang Wenyuan, Wang Yibin, Wang Zifeng, Ebrahimi Sayna, Wang Hao
- Eagle: Exploring The Design Space For Multimodal Llms With Mixture Of Encoders Shi Min, Liu Fuxiao, Wang Shihao, Liao Shijia, Radhakrishnan Subhashree, Huang De-an, Yin Hongxu, Sapra Karan, Yacoob Yaser, Shi Humphrey, Catanzaro Bryan, Tao Andrew, Kautz Jan, Yu Zhiding, Liu Guilin
- Reslora: Identity Residual Mapping In Low-rank Adaption Shi Shuhua, Huang Shaohan, Song Minghui, Li Zhoujun, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Math-llava: Bootstrapping Mathematical Reasoning For Multimodal Large Language Models Shi Wenhao, Hu Zhiqiang, Bin Yi, Liu Junhua, Yang Yang, Ng See-kiong, Bing Lidong, Lee Roy Ka-wei
- Medadapter: Efficient Test-time Adaptation Of Large Language Models Towards Medical Reasoning Shi Wenqi, Xu Ran, Zhuang Yuchen, Yu Yue, Wu Hang, Yang Carl, Wang May D.
- Can Prompt Modifiers Control Bias? A Comparative Analysis Of Text-to-image Generative Models Shin Philip Wootaek, Ahn Jihyun Janice, Yin Wenpeng, Sampson Jack, Narayanan Vijaykrishnan
- FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation Shum Kashun, Xu Minrui, Zhang Jianshu, Chen Zixin, Diao Shizhe, Dong Hanze, Zhang Jipeng, Raza Muhammad Omer
- Iclguard: Controlling In-context Learning Behavior For Applicability Authorization Si Wai Man, Backes Michael, Zhang Yang
- Where Does In-context Translation Happen In Large Language Models Sia Suzanna, Mueller David, Duh Kevin
- H2o-danube-1.8b Technical Report Singer Philipp, Pfeiffer Pascal, Babakhin Yauhen, Jeblick Maximilian, Dhankhar Nischay, Fodor Gabor, Ambati Sri Satish
- Aya Dataset: An Open-access Collection For Multilingual Instruction Tuning Singh Shivalika, Vargus Freddie, Dsouza Daniel, Karlsson Börje F., Mahendiran Abinaya, Ko Wei-yin, Shandilya Herumb, Patel Jay, Mataciunas Deividas, Omahony Laura, Zhang Mike, Hettiarachchi Ramith, Wilson Joseph, Machado Marina, Moura Luisa Souza, Krzemiński Dominik, Fadaei Hakimeh, Ergün Irem, Okoh Ifeoma, Alaagib Aisha, Mudannayake Oshan, Alyafeai Zaid, Chien Vu Minh, Ruder Sebastian, Guthikonda Surya, Alghamdi Emad A., Gehrmann Sebastian, Muennighoff Niklas, Bartolo Max, Kreutzer Julia, Üstün Ahmet, Fadaee Marzieh, Hooker Sara
- Evaluating Zero-shot GPT-4V Performance On 3D Visual Question Answering Benchmarks Singh Simranjit, Pavlakos Georgios, Stamoulis Dimitrios
- Llava Finds Free Lunch: Teaching Human Behavior Improves Content Understanding Abilities Of Llms Singh Somesh, S Harini I, Singla Yaman K, Baths Veeky, Shah Rajiv Ratn, Chen Changyou, Krishnamurthy Balaji
- An Empirical Study Of Validating Synthetic Data For Formula Generation Singh Usneek, Cambronero José, Gulwani Sumit, Kanade Aditya, Khatry Anirudh, Le Vu, Singh Mukul, Verbruggen Gust
- LGR2: Language Guided Reward Relabeling For Accelerating Hierarchical Reinforcement Learning Singh Utsav, Bhattacharyya Pramit, Namboodiri Vinay P.
- A Three-pronged Approach To Cross-lingual Adaptation With Multilingual Llms Singh Vaibhav, Krishna Amrith, Nj Karthika, Ramakrishnan Ganesh
- Maml-en-llm: Model Agnostic Meta-training Of Llms For Improved In-context Learning Sinha Sanchit, Yue Yuguang, Soto Victor, Kulkarni Mayank, Lu Jianhua, Zhang Aidong
- Unipa-gpt: Large Language Models For University-oriented QA In Italian Siragusa Irene, Pirrone Roberto
- Gistembed: Guided In-sample Selection Of Training Negatives For Text Embedding Fine-tuning Solatorio Aivin V.
- A Statistical Framework For Weak-to-strong Generalization Somerstep Seamus, Polo Felipe Maia, Banerjee Moulinath, Ritov Ya'acov, Yurochkin Mikhail, Sun Yuekai
- A New Pipeline For Generating Instruction Dataset Via RAG And Self Fine-tuning Song Chih-wei, Lee Yu-kai, Tsai Yin-te
- Hyacinth6b: A Large Language Model For Traditional Chinese Song Chih-wei, Tsai Yin-te
- Scaling Data Diversity For Fine-tuning Language Models In Human Alignment Song Feifan, Yu Bowen, Lang Hao, Yu Haiyang, Huang Fei, Wang Houfeng, Li Yongbin
- Increasing Model Capacity For Free: A Simple Strategy For Parameter Efficient Fine-tuning Song Haobo, Zhao Hao, Majumder Soumajit, Lin Tao
- Out-of-distribution Generalization Via Composition: A Lens Through Induction Heads In Transformers Song Jiajun, Xu Zhuoyan, Zhong Yiqiao
- Position: Leverage Foundational Models For Black-box Optimization Song Xingyou, Tian Yingtao, Lange Robert Tjarko, Lee Chansoo, Tang Yujin, Chen Yutian
- Sharelora: Parameter Efficient And Robust Large Language Model Fine-tuning Via Shared Low-rank Adaptation Song Yurun, Zhao Junchen, Harris Ian G., Jyothi Sangeetha Abdu
- Alchemistcoder: Harmonizing And Eliciting Code Capability By Hindsight Tuning On Multi-source Data Song Zifan, Wang Yudong, Zhang Wenwei, Liu Kuikun, Lyu Chengqi, Song Demin, Guo Qipeng, Yan Hang, Lin Dahua, Chen Kai, Zhao Cairong
- Regressive Side Effects Of Training Language Models To Mimic Student Misconceptions Sonkar Shashank, Liu Naiming, Baraniuk Richard G.
- Pedagogical Alignment Of Large Language Models Sonkar Shashank, Ni Kangqi, Chaudhary Sapana, Baraniuk Richard G.
- Fine Tuning Vs. Retrieval Augmented Generation For Less Popular Knowledge Soudani Heydar, Kanoulas Evangelos, Hasibi Faegheh
- Generaitor: Tree-in-the-loop Text Generation For Language Model Explainability And Adaptation Spinner Thilo, Kehlbeck Rebecca, Sevastjanova Rita, Stähle Tobias, Keim Daniel A., Deussen Oliver, El-assady Mennatallah
- Repetition Improves Language Model Embeddings Springer Jacob Mitchell, Kotha Suhas, Fried Daniel, Neubig Graham, Raghunathan Aditi
- PUB: A Pragmatics Understanding Benchmark For Assessing Llms' Pragmatics Capabilities Sravanthi Settaluri Lakshmi, Doshi Meet, Kalyan Tankala Pavan, Murthy Rudra, Bhattacharyya Pushpak, Dabre Raj
- Canttalkaboutthis: Aligning Language Models To Stay On Topic In Dialogues Sreedhar Makesh Narsimhan, Rebedea Traian, Ghosh Shaona, Zeng Jiaqi, Parisien Christopher
- Probing Multimodal Llms As World Models For Driving Sreeram Shiva, Wang Tsun-hsuan, Maalouf Alaa, Rosman Guy, Karaman Sertac, Rus Daniela
- Comparative Analysis Of Different Efficient Fine Tuning Methods Of Large Language Models (llms) In Low-resource Setting Srinivasan Krishna Prasad Varadarajan, Gumpena Prasanth, Yattapu Madhusudhana, Brahmbhatt Vishal H.
- The Fine-tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities Stap David, Hasler Eva, Byrne Bill, Monz Christof, Tran Ke
- Fine-tuning Large Language Models For Entity Matching Steiner Aaron, Peeters Ralph, Bizer Christian
- Numllm: Numeric-sensitive Large Language Model For Chinese Finance Su Huan-yi, Wu Ke, Huang Yu-hao, Li Wu-jun
- Investigating Video Reasoning Capability Of Large Language Models With Tropes In Movies Su Hung-ting, Chao Chun-tong, Hsu Ya-ching, Lin Xudong, Niu Yulei, Lee Hung-yi, Hsu Winston H.
- Mission Impossible: A Statistical Perspective On Jailbreaking Llms Su Jingtong, Kempe Julia, Ullrich Karen
- Living In The Moment: Can Large Language Models Grasp Co-temporal Reasoning? Su Zhaochen, Li Juntao, Zhang Jun, Zhu Tong, Qu Xiaoye, Zhou Pan, Bowen Yan, Cheng Yu, Zhang Min
- M-QALM: A Benchmark To Assess Clinical Reading Comprehension And Knowledge Recall In Large Language Models Via Question Answering Subramanian Anand, Schlegel Viktor, Kashyap Abhinav Ramesh, Nguyen Thanh-tung, Dwivedi Vijay Prakash, Winkler Stefan
- LAB: Large-scale Alignment For Chatbots Sudalairaj Shivchander, Bhandwaldar Abhishek, Pareja Aldo, Xu Kai, Cox David D., Srivastava Akash
- Fidelis: Faithful Reasoning In Large Language Model For Knowledge Graph Question Answering Sui Yuan, He Yufei, Liu Nian, He Xiaoxin, Wang Kun, Hooi Bryan
- 70b-parameter Large Language Models In Japanese Medical Question-answering Sukeda Issey, Kishikawa Risa, Kodera Satoshi
- Persona-db: Efficient Large Language Model Personalization For Response Prediction With Collaborative Data Refinement Sun Chenkai, Yang Ke, Reddy Revanth Gangi, Fung Yi R., Chan Hou Pong, Small Kevin, Zhai Chengxiang, Ji Heng
- Sq-llava: Self-questioning For Large Vision-language Assistant Sun Guohao, Qin Can, Wang Jiamian, Chen Zeyuan, Xu Ran, Tao Zhiqiang
- Triforce: Lossless Acceleration Of Long Sequence Generation With Hierarchical Speculative Decoding Sun Hanshi, Chen Zhuoming, Yang Xinyu, Tian Yuandong, Chen Beidi
- Supervised Fine-tuning As Inverse Reinforcement Learning Sun Hao
- Fuxitranyu: A Multilingual Large Language Model Trained With Balanced Data Sun Haoran, Jin Renren, Xu Shaoyang, Pan Leiyu, Supryadi, Cui Menglong, Du Jiangcun, Lei Yikun, Yang Lei, Shi Ling, Xiao Juesi, Zhu Shaolin, Xiong Deyi
- Bbox-adapter: Lightweight Adapting For Black-box Large Language Models Sun Haotian, Zhuang Yuchen, Wei Wei, Zhang Chao, Dai Bo
- Lawluo: A Chinese Law Firm Co-run By LLM Agents Sun Jingyun, Dai Chengxiao, Luo Zhongze, Chang Yangbo, Li Yang
- Fine-tuning Vs Prompting, Can Language Models Understand Human Values? Sun Pingwei
- The Critique Of Critique Sun Shichao, Li Junlong, Yuan Weizhe, Yuan Ruifeng, Li Wenjie, Liu Pengfei
- Expert-guided Extinction Of Toxic Tokens For Debiased Generation Sun Xueyao, Shi Kaize, Tang Haoran, Xu Guandong, Li Qing
- QDA-SQL: Questions Enhanced Dialogue Augmentation For Multi-turn Text-to-sql Sun Yinggang, Guo Ziming, Yu Haining, Liu Chuanyi, Li Xiang, Wang Bingxuan, Yu Xiangzhan, Zhao Tiancheng
- Leveraging Chatgpt In Pharmacovigilance Event Extraction: An Empirical Study Sun Zhaoyue, Pergola Gabriele, Wallace Byron C., He Yulan
- Towards Smaller, Faster Decoder-only Transformers: Architectural Variants And Their Implications Suresh Sathya Krishnan, P Shunmugapriya
- Automating Research Synthesis With Domain-specific Large Language Model Fine-tuning Susnjak Teo, Hwang Peter, Reyes Napoleon H., Barczak Andre L. C., Mcintosh Timothy R., Ranathunga Surangika
- JORA: JAX Tensor-parallel Lora Library For Retrieval Augmented Fine-tuning Tahir Anique, Cheng Lu, Liu Huan
- Lloco: Learning Long Contexts Offline Tan Sijun, Li Xiuyu, Patil Shishir, Wu Ziyang, Zhang Tianjun, Keutzer Kurt, Gonzalez Joseph E., Popa Raluca Ada
- Fine-tuning Large Language Model (LLM) Artificial Intelligence Chatbots In Ophthalmology And Llm-based Evaluation Using GPT-4 Tan Ting Fang, Elangovan Kabilan, Jin Liyuan, Jie Yao, Yong Li, Lim Joshua, Poh Stanley, Ng Wei Yan, Lim Daniel, Ke Yuhe, Liu Nan, Ting Daniel Shu Wei
- Democratizing Large Language Models Via Personalized Parameter-efficient Fine-tuning Tan Zhaoxuan, Zeng Qingkai, Tian Yijun, Liu Zheyuan, Yin Bing, Jiang Meng
- DLO: Dynamic Layer Operation For Efficient Vertical Scaling Of Llms Tan Zhen, Dong Daize, Zhao Xinyu, Peng Jie, Cheng Yu, Chen Tianlong
- Can I Understand What I Create? Self-knowledge Evaluation Of Large Language Models Tan Zhiquan, Wei Lai, Wang Jindong, Xie Xing, Huang Weiran
- EVALALIGN: Supervised Fine-tuning Multimodal Llms With Human-aligned Data For Evaluating Text-to-image Models Tan Zhiyu, Yang Xiaomeng, Qin Luozheng, Yang Mengping, Zhang Cheng, Li Hao
- Language-specific Neurons: The Key To Multilingual Capabilities In Large Language Models Tang Tianyi, Luo Wenyang, Huang Haoyang, Zhang Dongdong, Wang Xiaolei, Zhao Xin, Wei Furu, Wen Ji-rong
- Prioritizing Safeguarding Over Autonomy: Risks Of LLM Agents For Science Tang Xiangru, Jin Qiao, Zhu Kunlun, Yuan Tongxin, Zhang Yichi, Zhou Wangchunshu, Qu Meng, Zhao Yilun, Tang Jian, Zhang Zhuosheng, Cohan Arman, Lu Zhiyong, Gerstein Mark
- On The Hardness Of Faithful Chain-of-thought Reasoning In Large Language Models Tanneru Sree Harsha, Ley Dan, Agarwal Chirag, Lakkaraju Himabindu
- Unlocking The Potential Of Model Merging For Low-resource Languages Tao Mingxu, Zhang Chen, Huang Quzhe, Ma Tianyao, Huang Songfang, Zhao Dongyan, Feng Yansong
- Making A Long Story Short In Conversation Modeling Tao Yufei, Mines Tiernan, Agrawal Ameeta
- MEEL: Multi-modal Event Evolution Learning Tao Zhengwei, Jin Zhi, Huang Junqiang, Chen Xiancai, Bai Xiaoying, Zhao Haiyan, Zhang Yifan, Tao Chongyang
- Exploring Design Choices For Building Language-specific Llms Tejaswi Atula, Gupta Nilesh, Choi Eunsol
- Equipping Language Models With Tool Use Capability For Tabular Data Analysis In Finance Theuma Adrian, Shareghi Ehsan
- Hydralora: An Asymmetric Lora Architecture For Efficient Fine-tuning Tian Chunlin, Shi Zhan, Guo Zhijiang, Li Li, Xu Chengzhong
- Toward Self-improvement Of Llms Via Imagination, Searching, And Criticizing Tian Ye, Peng Baolin, Song Linfeng, Jin Lifeng, Yu Dian, Mi Haitao, Yu Dong
- Cantor: Inspiring Multimodal Chain-of-thought Of MLLM Timin Gao, Peixian Chen, Mengdan Zhang, Chaoyou Fu, Yunhang Shen, Yan Zhang, Shengchuan Zhang, Xiawu Zheng, Xing Sun, Liujuan Cao, Rongrong Ji
- An Ecosage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant Tomar Mohit, Tiwari Abhisek, Saha Tulika, Jha Prince, Saha Sriparna
- Understanding Linear Probing Then Fine-tuning Language Models From NTK Perspective Tomihari Akiyoshi, Sato Issei
- Can Llms Learn From Previous Mistakes? Investigating Llms' Errors To Boost For Reasoning Tong Yongqi, Li Dawei, Wang Sizhe, Wang Yujia, Teng Fei, Shang Jingbo
- Optimizing Language Model's Reasoning Abilities With Weak Supervision Tong Yongqi, Wang Sizhe, Li Dawei, Wang Yifan, Han Simeng, Lin Zi, Huang Chengsong, Huang Jiaxin, Shang Jingbo
- Evaluating Large Language Models With Grid-based Game Competitions: An Extensible LLM Benchmark And Leaderboard Topsakal Oguzhan, Edell Colby Jacob, Harper Jackson Bailey
- Few-shot Cross-lingual Transfer For Prompting Large Language Models In Low-resource Languages Toukmaji Christopher
- Linguacodus: A Synergistic Framework For Transformative Code Generation In Machine Learning Pipelines Trofimova Ekaterina, Sataev Emil, Ustyuzhanin Andrey E.
- Crossing Linguistic Horizons: Finetuning And Comprehensive Evaluation Of Vietnamese Large Language Models Truong Sang T., Nguyen Duc Q., Nguyen Toan, Le Dong D., Truong Nhi N., Quan Tho, Koyejo Sanmi
- Code Less, Align More: Efficient LLM Fine-tuning For Code Generation With Data Pruning Tsai Yun-da, Liu Mingjie, Ren Haoxing
- Towards Safety And Helpfulness Balanced Responses Via Controllable Large Language Models Tuan Yi-lin, Chen Xilun, Smith Eric Michael, Martin Louis, Batra Soumya, Celikyilmaz Asli, Wang William Yang, Bikel Daniel M.
- Unmasking The Imposters: In-domain Detection Of Human Vs. Machine-generated Tweets Tuck Bryan E., Verma Rakesh M.
- VBART: The Turkish LLM Turker Meliksah, Ari Mehmet Erdi, Han Aydin
- Lifelong Robot Library Learning: Bootstrapping Composable And Generalizable Skills For Embodied Control With Language Models Tziafas Georgios, Kasaei Hamidreza
- Asking And Answering Questions To Extract Event-argument Structures Uddin Md Nayem, George Enfa Rose, Blanco Eduardo, Corman Steven
- Bootstrapping Llm-based Task-oriented Dialogue Agents Via Self-talk Ulmer Dennis, Mansimov Elman, Lin Kaixiang, Sun Justin, Gao Xibin, Zhang Yi
- PARADISE: Evaluating Implicit Planning Skills Of Language Models With Procedural Warnings And Tips Dataset Uzunoglu Arda, Safa Abdalfatah Rashid, Şahin Gözde Gül
- Autopuredata: Automated Filtering Of Web Data For LLM Fine-tuning Vadlapati Praneeth
- A Survey Of Prompt Engineering Methods In Large Language Models For Different NLP Tasks Vatsal Shubham, Dubey Harsh
- Question Suggestion For Conversational Shopping Assistants Using Product Metadata Vedula Nikhita, Rokhlenko Oleg, Malmasi Shervin
- Cross-modal Projection In Multimodal Llms Doesn't Really Project Visual Attributes To Textual Space Verma Gaurav, Choi Minje, Sharma Kartik, Watson-daniels Jamelle, Oh Sejoon, Kumar Srijan
- How Much Data Is Enough Data? Fine-tuning Large Language Models For In-house Translation: Performance Evaluation Across Multiple Dataset Sizes Vieira Inacio, Allred Will, Lankford Séamus, Castilho Sheila, Way Andy
- Redwhale: An Adapted Korean LLM Through Efficient Continual Pretraining Vo Anh-dung, Jung Minseong, Lee Wonbeen, Choi Daewoo
- Foundational Autoraters: Taming Large Language Models For Better Automatic Evaluation Vu Tu, Krishna Kalpesh, Alzubi Salaheddin, Tar Chris, Faruqui Manaal, Sung Yun-hsuan
- Exploring Fact Memorization And Style Imitation In Llms Using Qlora: An Experimental Study And Quality Assessment Methods Vyborov Eugene, Osypenko Oleksiy, Sotnyk Serge
- Generative Large Language Models In Automated Fact-checking: A Survey Vykopal Ivan, Pikuliak Matúš, Ostermann Simon, Šimko Marián
- Investigating Mysteries Of Cot-augmented Distillation Wadhwa Somin, Amir Silvio, Wallace Byron C.
- Fusechat: Knowledge Fusion Of Chat Models Wan Fanqi, Zhong Longguang, Yang Ziyi, Chen Ruijun, Quan Xiaojun
- LOOK-M: Look-once Optimization In KV Cache For Efficient Multimodal Long-context Inference Wan Zhongwei, Wu Ziang, Liu Che, Huang Jinfa, Zhu Zhihong, Jin Peng, Wang Longyue, Yuan Li
- Datavist5: A Pre-trained Language Model For Jointly Understanding Text And Data Visualization Wan Zhuoyue, Song Yuanfeng, Li Shuaimin, Zhang Chen Jason, Wong Raymond Chi-wing
- IAA: Inner-adaptor Architecture Empowers Frozen Large Language Model With Multimodal Capabilities Wang Bin, Xie Chunyu, Leng Dawei, Yin Yuhui
- Information Theoretic Text-to-image Alignment Wang Chao, Franzese Giulio, Finamore Alessandro, Gallo Massimo, Michiardi Pietro
- BLSP-KD: Bootstrapping Language-speech Pre-training Via Knowledge Distillation Wang Chen, Liao Minpeng, Huang Zhongqiang, Zhang Jiajun
- Mllm-tool: A Multimodal Large Language Model For Tool Agent Learning Wang Chenyu Michael, Luo Weixin Michael, Chen Qianyu Michael, Mai Haonan Michael, Guo Jindi Michael, Dong Sixun Michael, Xiaohua Michael, Xuan, Li Zhengxin, Ma Lin, Gao Shenghua
- A Survey For Large Language Models In Biomedicine Wang Chong, Li Mengyao, He Junjun, Wang Zhongruo, Darzi Erfan, Chen Zan, Ye Jin, Li Tianbin, Su Yanzhou, Ke Jing, Qu Kaili, Li Shuxin, Yu Yi, Liò Pietro, Wang Tianyun, Wang Yu Guang, Shen Yiqing
- Towards Efficient And Effective Unlearning Of Large Language Models For Recommendation Wang Hangyu, Lin Jianghao, Chen Bo, Yang Yang, Tang Ruiming, Zhang Weinan, Yu Yong
- Adapting Open-source Large Language Models For Cost-effective, Expert-level Clinical Note Generation With On-policy Reinforcement Learning Wang Hanyin, Gao Chufan, Liu Bolun, Xu Qiping, Hussein Guleid, Labban Mohamad El, Iheasirim Kingsley, Korsapati Hariprasad, Outcalt Chuck, Sun Jimeng
- Techgpt-2.0: A Large Language Model Project To Solve The Task Of Knowledge Graph Construction Wang Jiaqi, Chang Yuying, Li Zhong, An Ning, Ma Qi, Hei Lei, Luo Haibo, Lu Yifei, Ren Feiliang
- Length Generalization Of Causal Transformers Without Position Encoding Wang Jie, Ji Tao, Wu Yuanbin, Yan Hang, Gui Tao, Zhang Qi, Huang Xuanjing, Wang Xiaoling
- Chinese Metaphor Recognition Using A Multi-stage Prompting Large Language Model Wang Jie, Wang Jin, Zhang Xuejie
- Bioinformatics And Biomedical Informatics With Chatgpt: Year One Review Wang Jinge, Cheng Zien, Yao Qiuming, Liu Li, Xu Dong, Hu Gangqing
- Helpful Or Harmful Data? Fine-tuning-free Shapley Attribution For Explaining Language Model Predictions Wang Jingtan, Lin Xiaoqiang, Qiao Rui, Foo Chuan-sheng, Low Bryan Kian Hsiang
- JMLR: Joint Medical LLM And Retrieval Training For Enhancing Reasoning And Professional Question Answering Capability Wang Junda, Yang Zhichao, Yao Zonghai, Yu Hong
- Beyond The Known: Investigating Llms Performance On Out-of-domain Intent Detection Wang Pei, He Keqing, Wang Yejie, Song Xiaoshuai, Mou Yutao, Wang Jingang, Xian Yunsen, Cai Xunliang, Xu Weiran
- Inferaligner: Inference-time Alignment For Harmlessness Through Cross-model Guidance Wang Pengyu, Zhang Dong, Li Linyang, Tan Chenkun, Wang Xinghao, Ren Ke, Jiang Botian, Qiu Xipeng
- Role Prompting Guided Domain Adaptation With General Capability Preserve For Large Language Models Wang Rui, Mi Fei, Chen Yi, Xue Boyang, Wang Hongru, Zhu Qi, Wong Kam-fai, Xu Ruifeng
- Deconfounded Causality-aware Parameter-efficient Fine-tuning For Problem-solving Improvement Of Llms Wang Ruoyu, Li Xiaoxuan, Yao Lina
- Unique Security And Privacy Threats Of Large Language Model: A Comprehensive Survey Wang Shang, Zhu Tianqing, Liu Bo, Ding Ming, Guo Xu, Ye Dayong, Zhou Wanlei, Yu Philip S.
- Lora Meets Dropout Under A Unified Framework Wang Sheng, Chen Liheng, Jiang Jiyue, Xue Boyang, Kong Lingpeng, Wu Chuan
- Prolora: Partial Rotation Empowers More Parameter-efficient Lora Wang Sheng, Xue Boyang, Ye Jiacheng, Jiang Jiyue, Chen Liheng, Kong Lingpeng, Wu Chuan
- Zero-shot Generative Large Language Models For Systematic Review Screening Automation Wang Shuai, Scells Harrisen, Zhuang Shengyao, Potthast Martin, Koopman Bevan, Zuccon Guido
- Weaver: Foundation Models For Creative Writing Wang Tiannan, Chen Jiamin, Jia Qingrui, Wang Shuai, Fang Ruoyu, Wang Huilin, Gao Zhaowei, Xie Chunzhao, Xu Chuou, Dai Jihong, Liu Yibin, Wu Jialong, Ding Shengwei, Li Long, Huang Zhiwei, Deng Xinle, Yu Teng, Ma Gangan, Xiao Han, Chen Zixin, Xiang Danjun, Wang Yunxia, Zhu Yuanyuan, Xiao Yi, Wang Jing, Wang Yiru, Ding Siran, Huang Jiayang, Xu Jiayi, Tayier Yilihamu, Hu Zhenyu, Gao Yuan, Zheng Chengfeng, Ye Yueshu, Li Yihang, Wan Lei, Jiang Xinyue, Wang Yujie, Cheng Siyu, Song Zhule, Tang Xiangru, Xu Xiaohua, Zhang Ningyu, Chen Huajun, Jiang Yuchen Eleanor, Zhou Wangchunshu
- Using Advanced Llms To Enhance Smaller Llms: An Interpretable Knowledge Distillation Approach Wang Tong, Sudhir K., Hong Dat
- QCRD: Quality-guided Contrastive Rationale Distillation For Large Language Models Wang Wei, Li Zhaowei, Xu Qi, Cai Yiqing, Song Hang, Qi Qi, Zhou Ran, Huang Zhida, Wang Tao, Xiao Li
- Secokd: Aligning Large Language Models For In-context Learning With Fewer Shots Wang Weixing, Yang Haojin, Meinel Christoph
- Testing And Evaluation Of Large Language Models: Correctness, Non-toxicity, And Fairness Wang Wenxuan
- Metatool: Facilitating Large Language Models To Master Tools With Meta-task Augmentation Wang Xiaohan, Li Dian, Zhao Yilin, Sinbadliu, Wang Hui
- Executable Code Actions Elicit Better LLM Agents Wang Xingyao, Chen Yangyi, Yuan Lifan, Zhang Yizhe, Li Yunzhu, Peng Hao, Ji Heng
- Hidden Question Representations Tell Non-factuality Within And Across Large Language Models Wang Yanling, Li Haoyang, Zou Hao, Zhang Jing, He Xinlei, Li Qi, Xu Ke
- Inscl: A Data-efficient Continual Learning Paradigm For Fine-tuning Large Language Models With Instructions Wang Yifan, Liu Yafei, Shi Chufan, Li Haoling, Chen Chen, Lu Haonan, Yang Yujiu
- Uncertainty Aware Learning For Language Model Alignment Wang Yikun, Zheng Rui, Ding Liang, Zhang Qi, Lin Dahua, Tao Dacheng
- Loma: Lossless Compressed Memory Attention Wang Yumeng, Xiao Zhenyang
- Exovip: Step-by-step Verification And Exploration With Exoskeleton Modules For Compositional Visual Reasoning Wang Yuxuan, Yuille Alan, Li Zhuowan, Zheng Zilong
- To Recommend Or Not: Recommendability Identification In Conversations With Pre-trained Language Models Wang Zhefan, Ma Weizhi, Zhang Min
- A Comprehensive Survey Of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO And More Wang Zhichao James, Bi Bin James, Pentyala Shiva Kumar James, Ramnath Kiran James, Chaudhuri Sougata James, Mehrotra Shubham James, Zixu James, Zhu Claire, Mao Xiang-bo Claire, Asur Sitaram Claire, Na Claire, Cheng
- Re-task: Revisiting LLM Tasks From Capability, Skill, And Knowledge Perspectives Wang Zhihu, Zhao Shiwan, Wang Yu, Huang Heyuan, Shi Jiaxin, Xie Sitao, Wang Zhixing, Zhang Yubo, Li Hongyan, Yan Junchi
- Promptcharm: Text-to-image Generation Through Multi-modal Prompting And Refinement Wang Zhijie, Huang Yuheng, Song Da, Ma Lei, Zhang Tianyi
- Re2llm: Reflective Reinforcement Large Language Model For Session-based Recommendation Wang Ziyan, Du Yingpeng, Sun Zhu, Chua Haoyan, Feng Kaidong, Wang Wenya, Zhang Jie
- Flora: Federated Fine-tuning Large Language Models With Heterogeneous Low-rank Adaptations Wang Ziyao, Shen Zheyu, He Yexiao, Sun Guoheng, Wang Hongyi, Lyu Lingjuan, Li Ang
- Investigating Multilingual Instruction-tuning: Do Polyglot Models Demand For Multilingual Instructions? Weber Alexander Arno, Thellmann Klaudia, Ebert Jan, Flores-herr Nicolas, Lehmann Jens, Fromm Michael, Ali Mehdi
- Measuring And Reducing LLM Hallucination Without Gold-standard Answers Wei Jiaheng, Yao Yuanshun, Ton Jean-francois, Guo Hongyi, Estornell Andrew, Liu Yang
- Instructrag: Instructing Retrieval-augmented Generation Via Self-synthesized Rationales Wei Zhepei, Chen Wei-lin, Meng Yu
- On-policy Fine-grained Knowledge Feedback For Hallucination Mitigation Wen Xueru, Lu Xinyu, Guan Xinyan, Lu Yaojie, Lin Hongyu, He Ben, Han Xianpei, Sun Le
- Grounding Data Science Code Generation With Input-output Specifications Wen Yeming, Yin Pengcheng, Shi Kensen, Michalewski Henryk, Chaudhuri Swarat, Polozov Alex
- Codeultrafeedback: An Llm-as-a-judge Dataset For Aligning Large Language Models To Coding Preferences Weyssow Martin, Kamanda Aton, Sahraoui Houari
- Towards Knowledge-grounded Natural Language Understanding And Generation Whitehouse Chenxi
- Personalized Large Language Models Woźniak Stanisław, Koptyra Bartłomiej, Janz Arkadiusz, Kazienko Przemysław, Kocoń Jan
- Curriculum Learning With Quality-driven Data Selection Wu Biao, Meng Fang, Chen Ling
- Sparkra: A Retrieval-augmented Knowledge Service System Based On Spark Large Language Model Wu Dayong, Li Jiaqi, Wang Baoxin, Zhao Honghong, Xue Siyuan, Yang Yanjie, Chang Zhijun, Zhang Rui, Qian Li, Wang Bo, Wang Shijin, Zhang Zhixiong, Hu Guoping
- Generative Monoculture In Large Language Models Wu Fan, Black Emily, Chandrasekaran Varun
- Gendec: A Robust Generative Question-decomposition Method For Multi-hop Reasoning Wu Jian, Yang Linyi, Ji Yuliang, Huang Wenhao, Karlsson Börje F., Okumura Manabu
- Pandora's Box Or Aladdin's Lamp: A Comprehensive Analysis Revealing The Role Of RAG Noise In Large Language Models Wu Jinyang, Che Feihu, Zhang Chuyuan, Tao Jianhua, Zhang Shuai, Shao Pengpeng
- Can Large Language Models Understand Uncommon Meanings Of Common Words? Wu Jinyang, Che Feihu, Zheng Xinxin, Zhang Shuai, Jin Ruihan, Nie Shuai, Shao Pengpeng, Tao Jianhua
- \(\beta\)-dpo: Direct Preference Optimization With Dynamic \(\beta\) Wu Junkang, Xie Yuexiang, Yang Zhengyi, Wu Jiancan, Gao Jinyang, Ding Bolin, Wang Xiang, He Xiangnan
- Noiseboost: Alleviating Hallucination With Noise Perturbation For Multimodal Large Language Models Wu Kai, Jiang Boyuan, Jiang Zhengkai, He Qingdong, Luo Donghao, Wang Shengzhi, Liu Qingwen, Wang Chengjie
- Adapting Large Language Models For Document-level Machine Translation Wu Minghao, Vu Thuy-trang, Qu Lizhen, Foster George, Haffari Gholamreza
- Mixture-of-skills: Learning To Optimize Data Usage For Fine-tuning Large Language Models Wu Minghao, Vu Thuy-trang, Qu Lizhen, Haffari Gholamreza
- ITERTL: An Iterative Framework For Fine-tuning Llms For RTL Code Generation Wu Peiyang, Guo Nan, Xiao Xiao, Li Wenming, Ye Xiaochun, Fan Dongrui
- Motionllm: Multimodal Motion-language Learning With Large Language Models Wu Qi, Zhao Yubo, Wang Yifan, Tai Yu-wing, Tang Chi-keung
- GUI Action Narrator: Where And When Did That Action Take Place? Wu Qinchen, Gao Difei, Lin Kevin Qinghong, Wu Zhuoyu, Guo Xiangwu, Li Peiran, Zhang Weichen, Wang Hengxu, Shou Mike Zheng
- Not All Attention Is Needed: Parameter And Computation Efficient Transfer Learning For Multi-modal Large Language Models Wu Qiong, Ye Weihao, Zhou Yiyi, Sun Xiaoshuai, Ji Rongrong
- Automated Review Generation Method Based On Large Language Models Wu Shican, Ma Xiao, Luo Dehui, Li Lulu, Shi Xiangcheng, Chang Xin, Lin Xiaoyun, Luo Ran, Pei Chunlei, Zhao Zhi-jian, Gong Jinlong
- F-LMM: Grounding Frozen Large Multimodal Models Wu Size, Jin Sheng, Zhang Wenwei, Xu Lumin, Liu Wentao, Li Wei, Loy Chen Change
- Semantic Alignment For Multimodal Large Language Models Wu Tao, Li Mengze, Chen Jingyuan, Ji Wei, Lin Wang, Gao Jinyang, Kuang Kun, Zhao Zhou, Wu Fei
- Never Miss A Beat: An Efficient Recipe For Context Window Extension Of Large Language Models With Consistent "middle" Enhancement Wu Tong, Zhao Yanpeng, Zheng Zilong
- Enhancing Q-learning With Large Language Model Heuristics Wu Xiefeng
- Passage-specific Prompt Tuning For Passage Reranking In Question Answering With Large Language Models Wu Xuyang, Peng Zhiyuan, Sai Krishna Sravanthi Rajanala, Wu Hsin-tai, Fang Yi
- Difflora: Generating Personalized Low-rank Adaptation Weights With Diffusion Wu Yujia, Shi Yiming, Wei Jiwei, Sun Chengwei, Zhou Yuyang, Yang Yang, Shen Heng Tao
- Inversecoder: Unleashing The Power Of Instruction-tuned Code Llms With Inverse-instruct Wu Yutong, Huang Di, Shi Wenxuan, Wang Wei, Gao Lingzhe, Liu Shihao, Nan Ziyuan, Yuan Kaizhao, Zhang Rui, Zhang Xishan, Du Zidong, Guo Qi, Pu Yewen, Yin Dawei, Hu Xing, Chen Yunji
- Acquiring Clean Language Models From Backdoor Poisoned Datasets By Downscaling Frequency Space Wu Zongru, Zhang Zhuosheng, Cheng Pengzhou, Liu Gongshen
- Imitating Language Via Scalable Inverse Reinforcement Learning Wulfmeier Markus, Bloesch Michael, Vieillard Nino, Ahuja Arun, Bornschein Jorg, Huang Sandy, Sokolov Artem, Barnes Matt, Desjardins Guillaume, Bewley Alex, Bechtle Sarah Maria Elisabeth, Springenberg Jost Tobias, Momchev Nikola, Bachem Olivier, Geist Matthieu, Riedmiller Martin
- In-context Learning Can Re-learn Forbidden Tasks Xhonneux Sophie, Dobre David, Tang Jian, Gidel Gauthier, Sridhar Dhanya
- A Practice Of Post-training On Llama-3 70B With Optimal Selection Of Additional Language Mixture Ratio Xi Ningyuan, Wu Yetao, Fan Kun, Chen Teng, Gu Qingqing, Yu Peng, Qu Jinxian, Liu Chenxi, Jiang Zhonglin, Chen Yong, Ji Luo
- What To Do If Language Models Disagree? Black-box Model Ensembling For Textual And Visual Question Answering Xia Yuxi, Zaporojets Kilm, Roth Benjamin
- Infllm: Training-free Long-context Extrapolation For Llms With An Efficient Context Memory Xiao Chaojun, Zhang Pengle, Han Xu, Xiao Guangxuan, Lin Yankai, Zhang Zhengyan, Liu Zhiyuan, Sun Maosong
- Rar-b: Reasoning As Retrieval Benchmark Xiao Chenghao, Hudson G Thomas, Moubayed Noura Al
- EMP: Enhance Memory In Data Pruning Xiao Jinying, Li Ping, Nie Jie, Tang Zhe
- Palm2-vadapter: Progressively Aligned Language Model Makes A Strong Vision-language Adapter Xiao Junfei, Xu Zheng, Yuille Alan, Yan Shen, Wang Boyu
- Emovit: Revolutionizing Emotion Insights With Visual Instruction Tuning Xie Hongxia, Peng Chu-jun, Tseng Yu-wen, Chen Hung-jen, Hsu Chan-feng, Shuai Hong-han, Cheng Wen-huang
- MEMLA: Enhancing Multilingual Knowledge Editing With Neuron-masked Low-rank Adaptation Xie Jiakuan, Cao Pengfei, Chen Yuheng, Chen Yubo, Liu Kang, Zhao Jun
- Non-instructional Fine-tuning: Enabling Instruction-following Capabilities In Pre-trained Language Models Without Instruction-following Data Xie Juncheng, Syu Shensian, Lee Hung-yi
- Autoad-zero: A Training-free Framework For Zero-shot Audio Description Xie Junyu, Han Tengda, Bain Max, Nagrani Arsha, Varol Gül, Xie Weidi, Zisserman Andrew
- Exploratory Preference Optimization: Harnessing Implicit Q*-approximation For Sample-efficient RLHF Xie Tengyang, Foster Dylan J., Krishnamurthy Akshay, Rosset Corby, Awadallah Ahmed, Rakhlin Alexander
- Monte Carlo Tree Search Boosts Reasoning Via Iterative Preference Learning Xie Yuxi, Goyal Anirudh, Zheng Wenyue, Kan Min-yen, Lillicrap Timothy P., Kawaguchi Kenji, Shieh Michael
- Understanding The Weakness Of Large Language Model Agents Within A Complex Android Environment Xing Mingzhe, Zhang Rongkai, Xue Hui, Chen Qi, Yang Fan, Xiao Zhen
- Improving Retrieval-augmented Generation In Medicine With Iterative Follow-up Questions Xiong Guangzhi, Jin Qiao, Wang Xiao, Zhang Minjia, Lu Zhiyong, Zhang Aidong
- Large Language Models Can Learn Temporal Reasoning Xiong Siheng, Payani Ali, Kompella Ramana, Fekri Faramarz
- Building Math Agents With Multi-turn Iterative Preference Learning Xiong Wei, Shi Chengshuai, Shen Jiaming, Rosenberg Aviv, Qin Zhen, Calandriello Daniele, Khalman Misha, Joshi Rishabh, Piot Bilal, Saleh Mohammad, Jin Chi, Zhang Tong, Liu Tianqi
- Mixture-of-instructions: Comprehensive Alignment Of A Large Language Model Through The Mixture Of Diverse System Prompting Instructions Xu Bowen, Wu Shaoyu, Liu Kai, Hu Lulu
- Interactive Evolution: A Neural-symbolic Self-training Framework For Large Language Models Xu Fangzhi, Sun Qiushi, Cheng Kanzhi, Liu Jun, Qiao Yu, Wu Zhiyong
- Large Language Models For Education: A Survey Xu Hanyi, Gan Wensheng, Qi Zhenlian, Wu Jiayang, Yu Philip S.
- Contrastive Preference Optimization: Pushing The Boundaries Of LLM Performance In Machine Translation Xu Haoran, Sharaf Amr, Chen Yunmo, Tan Weiting, Shen Lingfeng, Van Durme Benjamin, Murray Kenton, Kim Young Jin
- Seamless Language Expansion: Enhancing Multilingual Mastery In Self-supervised Models Xu Jing, Wu Minglin, Wu Xixin, Meng Helen
- Meteora: Multiple-tasks Embedded Lora For Large Language Models Xu Jingwei, Lai Junyu, Huang Yunpeng
- On The Tip Of The Tongue: Analyzing Conceptual Representation In Large Language Models With Reverse-dictionary Probe Xu Ningyu, Zhang Qi, Zhang Menghan, Qian Peng, Huang Xuanjing
- Course-correction: Safety Alignment Using Synthetic Preferences Xu Rongwu, Cai Yishuo, Zhou Zhenhong, Gu Renjie, Weng Haiqin, Liu Yan, Zhang Tianwei, Xu Wei, Qiu Han
- Is DPO Superior To PPO For LLM Alignment? A Comprehensive Study Xu Shusheng, Fu Wei, Gao Jiaxuan, Ye Wenjie, Liu Weilin, Mei Zhiyu, Wang Guangju, Yu Chao, Wu Yi
- SA-MDKIF: A Scalable And Adaptable Medical Domain Knowledge Injection Framework For Large Language Models Xu Tianhan, Hu Zhe, Chen Ling, Li Bin
- Sayself: Teaching Llms To Express Confidence With Self-reflective Rationales Xu Tianyang, Wu Shujin, Diao Shizhe, Liu Xiaoze, Wang Xingyao, Chen Yangyi, Gao Jing
- Can Llms Solve Longer Math Word Problems Better? Xu Xin, Xiao Tong, Chao Zitong, Huang Zhenya, Yang Can, Wang Yang
- Chatglm-math: Improving Math Problem-solving In Large Language Models With A Self-critique Pipeline Xu Yifan, Liu Xiao, Liu Xinghan, Hou Zhenyu, Li Yueyan, Zhang Xiaohan, Wang Zihan, Zeng Aohan, Du Zhengxiao, Zhao Wenyi, Tang Jie, Dong Yuxiao
- \(\textit{linkprompt}\): Natural And Universal Adversarial Attacks On Prompt-based Language Models Xu Yue, Wang Wenjie
- Magpie: Alignment Data Synthesis From Scratch By Prompting Aligned Llms With Nothing Xu Zhangchen, Jiang Fengqing, Niu Luyao, Deng Yuntian, Poovendran Radha, Choi Yejin, Lin Bill Yuchen
- Bag Of Tricks: Benchmarking Of Jailbreak Attacks On Llms Xu Zhao, Liu Fan, Liu Hao
- A Comprehensive Study Of Jailbreak Attack Versus Defense For Large Language Models Xu Zihao, Liu Yi, Deng Gelei, Li Yuekang, Picek Stjepan
- Longvila: Scaling Long-context Visual Language Models For Long Videos Xue Fuzhao, Chen Yukang, Li Dacheng, Hu Qinghao, Zhu Ligeng, Li Xiuyu, Fang Yunhao, Tang Haotian, Yang Shang, Liu Zhijian, He Ethan, Yin Hongxu, Molchanov Pavlo, Kautz Jan, Fan Linxi, Zhu Yuke, Lu Yao, Han Song
- Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-choice Selectors Xue Mengge, Hu Zhenyu, Liu Liqun, Liao Kuo, Li Shuang, Han Honglin, Zhao Meng, Yin Chengguo
- Fine-tuning Large Language Models For Automated Diagnostic Screening Summaries Yadav Manjeet, Sahu Nilesh Kumar, Chaturvedi Mudita, Gupta Snehil, Lone Haroon R
- Medlm: Exploring Language Models For Medical Question Answering Systems Yagnik Niraj, Jhaveri Jay, Sharma Vivek, Pila Gabriel
- LTNER: Large Language Model Tagging For Named Entity Recognition With Contextualized Entity Marking Yan Faren, Yu Peng, Chen Xin
- Predicting The Big Five Personality Traits In Chinese Counselling Dialogues Using Large Language Models Yan Yang, Ma Lizhi, Li Anqi, Ma Jingsong, Lan Zhenzhong
- Exploring The LLM Journey From Cognition To Expression With Linear Representations Yan Yuzi, Li Jialian, Zhang Yipin, Yan Dong
- Equipping Transformer With Random-access Reading For Long-context Understanding Yang Chenghao, Yang Zi, Hua Nan
- 3D Vision And Language Pretraining With Large-scale Synthetic Data Yang Dejie, Xu Zhu, Mo Wentao, Chen Qingchao, Huang Siyuan, Liu Yang
- IM-RAG: Multi-round Retrieval-augmented Generation Through Learning Inner Monologues Yang Diji, Rao Jinmeng, Chen Kezhen, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- Improving Factuality In Large Language Models Via Decoding-time Hallucinatory And Truthful Comparators Yang Dingkang, Xiao Dongling, Wei Jinjie, Li Mingcheng, Chen Zhaoyu, Li Ke, Zhang Lihua
- Unveiling The Generalization Power Of Fine-tuned Large Language Models Yang Haoran, Zhang Yumeng, Xu Jiaqi, Lu Hongyuan, Heng Pheng Ann, Lam Wai
- Aligning Llms Through Multi-perspective User Preference Ranking-based Feedback For Programming Question Answering Yang Hongyu, He Liyang, Hou Min, Shen Shuanghong, Li Rui, Hou Jiahui, Ma Jianhui, Zhao Junda
- Mllmreid: Multimodal Large Language Model-based Person Re-identification Yang Shan, Zhang Yongfei
- Moral: Moe Augmented Lora For Llms' Lifelong Learning Yang Shu, Ali Muhammad Asif, Wang Cheng-long, Hu Lijie, Wang Di
- Is Crowdsourcing Breaking Your Bank? Cost-effective Fine-tuning Of Pre-trained Language Models With Proximal Policy Optimization Yang Shuo, Kasneci Gjergji
- Corda: Context-oriented Decomposition Adaptation Of Large Language Models Yang Yibo, Li Xiaojie, Zhou Zhongzhu, Song Shuaiwen Leon, Wu Jianlong, Nie Liqiang, Ghanem Bernard
- Adazeta: Adaptive Zeroth-order Tensor-train Adaption For Memory-efficient Large Language Models Fine-tuning Yang Yifan, Zhen Kai, Banijamal Ershad, Mouchtaris Athanasios, Zhang Zheng
- Loretta: Low-rank Economic Tensor-train Adaptation For Ultra-low-parameter Fine-tuning Of Large Language Models Yang Yifan, Zhou Jiajun, Wong Ngai, Zhang Zheng
- Guardt2i: Defending Text-to-image Models From Adversarial Prompts Yang Yijun, Gao Ruiyuan, Yang Xiao, Zhong Jianyuan, Xu Qiang
- P3: A Policy-driven, Pace-adaptive, And Diversity-promoted Framework For Optimizing LLM Training Yang Yingxuan, Wang Huayi, Wen Muning, Zhang Weinan
- Pre-trained Language Models Improve The Few-shot Prompt Ability Of Decision Transformer Yang Yu, Xu Pan
- Weak-to-strong Reasoning Yang Yuqing, Ma Yan, Liu Pengfei
- Fine-tuning Large Language Models With Human-inspired Learning Strategies In Medical Question Answering Yang Yushi, Bean Andrew M., Mccraith Robert, Mahdi Adam
- Recent Advances Of Foundation Language Models-based Continual Learning: A Survey Yang Yutao, Zhou Jie, Ding Xuanwen, Huai Tianyu, Liu Shunyu, Chen Qin, He Liang, Xie Yuan
- Assessing Adversarial Robustness Of Large Language Models: An Empirical Study Yang Zeyu, Meng Zhao, Zheng Xiaochen, Wattenhofer Roger
- Self-distillation Bridges Distribution Gap In Language Model Fine-tuning Yang Zhaorui, Pang Tianyu, Feng Haozhe, Wang Han, Chen Wei, Zhu Minfeng, Liu Qian
- Adversarial Attacks And Defense For Conversation Entailment Task Yang Zhenning, Krawec Ryan, Wu Liang-yuan
- React Meets Actre: When Language Agents Enjoy Training Data Autonomy Yang Zonghan, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Curiousllm: Elevating Multi-document QA With Reasoning-infused Knowledge Graph Prompting Yang Zukang, Zhu Zixuan
- Exploiting Inter-layer Expert Affinity For Accelerating Mixture-of-experts Model Inference Yao Jinghan Dk, Anthony Quentin Dk, Shafi Aamir Dk, Subramoni Hari Dk, K. Dhabaleswar Dk, Panda
- Sirllm: Streaming Infinite Retentive LLM Yao Yao, Li Zuchao, Zhao Hai
- Open-domain Implicit Format Control For Large Language Model Generation Yao Yiqun, Ma Wenjia, Fang Xuezhi, Jiang Xin, Li Xiang, Meng Xuying, Han Peng, Li Jing, Sun Aixin, Wang Yequan
- Self-judge: Selective Instruction Following With Alignment Self-evaluation Ye Hai, Ng Hwee Tou
- Harnessing Multimodal Large Language Models For Multimodal Sequential Recommendation Ye Yuyang, Zheng Zhi, Shen Yishan, Wang Tianshu, Zhang Hengruo, Zhu Peijun, Yu Runlong, Zhang Kai, Xiong Hui
- BEAF: Observing Before-after Changes To Evaluate Hallucination In Vision-language Models Ye-bin Moon, Hyeon-woo Nam, Choi Wonseok, Oh Tae-hyun
- Generation Meets Verification: Accelerating Large Language Model Inference With Smart Parallel Auto-correct Decoding Yi Hanling, Lin Feng, Li Hongbin, Ning Peiyang, Yu Xiaotian, Xiao Rong
- Debiasing Multimodal Large Language Models Yi-fan Zhang, Weichen Yu, Qingsong Wen, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan
- MEND: Meta Demonstration Distillation For Efficient And Effective In-context Learning Yichuan Li, Xiyao Ma, Sixing Lu, Kyumin Lee, Xiaohu Liu, Chenlei Guo
- Lofit: Localized Fine-tuning On LLM Representations Yin Fangcong, Ye Xi, Durrett Greg
- Vqattack: Transferable Adversarial Attacks On Visual Question Answering Via Pre-trained Models Yin Ziyi, Ye Muchao, Zhang Tianrong, Wang Jiaqi, Liu Han, Chen Jinghui, Wang Ting, Ma Fenglong
- Internlm-math: Open Math Large Language Models Toward Verifiable Reasoning Ying Huaiyuan, Zhang Shuo, Li Linyang, Zhou Zhejian, Shao Yunfan, Fei Zhaoye, Ma Yichuan, Hong Jiawei, Liu Kuikun, Wang Ziyi, Wang Yudong, Wu Zijian, Li Shuaibin, Zhou Fengzhe, Liu Hongwei, Zhang Songyang, Zhang Wenwei, Yan Hang, Qiu Xipeng, Wang Jiayu, Chen Kai, Lin Dahua
- Safety Fine-tuning At (almost) No Cost: A Baseline For Vision Large Language Models Yongshuo Zong, Ondrej Bohdal, Tingyang Yu, Yongxin Yang, Timothy Hospedales
- Hyperclova X Technical Report Yoo Kang Min, Han Jaegeun, In Sookyo, Jeon Heewon, Jeong Jisu, Kang Jaewook, Kim Hyunwook, Kim Kyung-min, Kim Munhyong, Kim Sungju, Kwak Donghyun, Kwak Hanock, Kwon Se Jung, Lee Bado, Lee Dongsoo, Lee Gichang, Lee Jooho, Park Baeseong, Shin Seongjin, Yu Joonsang, Baek Seolki, Byeon Sumin, Cho Eungsup, Choe Dooseok, Han Jeesung, Jin Youngkyun, Jun Hyein, Jung Jaeseung, Kim Chanwoong, Kim Jinhong, Kim Jinuk, Lee Dokyeong, Park Dongwook, Sohn Jeong Min, Han Sujung, Heo Jiae, Hong Sungju, Jeon Mina, Jung Hyunhoon, Jung Jungeun, Jung Wangkyo, Kim Chungjoon, Kim Hyeri, Kim Jonghyun, Kim Min Young, Lee Soeun, Park Joonhee, Shin Jieun, Yang Sojin, Yoon Jungsoon, Lee Hwaran, Bae Sanghwan, Cha Jeehwan, Gylleus Karl, Ham Donghoon, Hong Mihak, Hong Youngki, Hong Yunki, Jang Dahyun, Jeon Hyojun, Jeon Yujin, Jeong Yeji, Ji Myunggeun, Jin Yeguk, Jo Chansong, Joo Shinyoung, Jung Seunghwan, Kim Adrian Jungmyung, Kim Byoung Hoon, Kim Hyomin, Kim Jungwhan, Kim Minkyoung, Kim Minseung, Kim Sungdong, Kim Yonghee, Kim Youngjun, Kim Youngkwan, Ko Donghyeon, Lee Dughyun, Lee Ha Young, Lee Jaehong, Lee Jieun, Lee Jonghyun, Lee Jongjin, Lee Min Young, Lee Yehbin, Min Taehong, Min Yuri, Moon Kiyoon, Oh Hyangnam, Park Jaesun, Park Kyuyon, Park Younghun, Seo Hanbae, Seo Seunghyun, Sim Mihyun, Son Gyubin, Yeo Matt, Yeom Kyung Hoon, Yoo Wonjoon, You Myungin, Ahn Doheon, Ahn Homin, Ahn Joohee, Ahn Seongmin, An Chanwoo, An Hyeryun, An Junho, An Sang-min, Byun Boram, Byun Eunbin, Cha Jongho, Chang Minji, Chang Seunggyu, Cho Haesong, Cho Youngdo, Choi Dalnim, Choi Daseul, Choi Hyoseok, Choi Minseong, Choi Sangho, Choi Seongjae, Choi Wooyong, Chun Sewhan, Go Dong Young, Ham Chiheon, Han Danbi, Han Jaemin, Hong Moonyoung, Hong Sung Bum, Hwang Dong-hyun, Hwang Seongchan, Im Jinbae, Jang Hyuk Jin, Jang Jaehyung, Jang Jaeni, Jang Sihyeon, Jang Sungwon, Jeon Joonha, Jeong Daun, Jeong Joonhyun, Jeong Kyeongseok, Jeong Mini, Jin Sol, Jo Hanbyeol, Jo Hanju, Jo Minjung, Jung Chaeyoon, Jung Hyungsik, Jung Jaeuk, Jung Ju Hwan, Jung Kwangsun, Jung Seungjae, Ka Soonwon, Kang Donghan, Kang Soyoung, Kil Taeho, Kim Areum, Kim Beomyoung, Kim Byeongwook, Kim Daehee, Kim Dong-gyun, Kim Donggook, Kim Donghyun, Kim Euna, Kim Eunchul, Kim Geewook, Kim Gyu Ri, Kim Hanbyul, Kim Heesu, Kim Isaac, Kim Jeonghoon, Kim Jihye, Kim Joonghoon, Kim Minjae, Kim Minsub, Kim Pil Hwan, Kim Sammy, Kim Seokhun, Kim Seonghyeon, Kim Soojin, Kim Soong, Kim Soyoon, Kim Sunyoung, Kim Taeho, Kim Wonho, Kim Yoonsik, Kim You Jin, Kim Yuri, Kwon Beomseok, Kwon Ohsung, Kwon Yoo-hwan, Lee Anna, Lee Byungwook, Lee Changho, Lee Daun, Lee Dongjae, Lee Ha-ram, Lee Hodong, Lee Hwiyeong, Lee Hyunmi, Lee Injae, Lee Jaeung, Lee Jeongsang, Lee Jisoo, Lee Jongsoo, Lee Joongjae, Lee Juhan, Lee Jung Hyun, Lee Junghoon, Lee Junwoo, Lee Se Yun, Lee Sujin, Lee Sungjae, Lee Sungwoo, Lee Wonjae, Lee Zoo Hyun, Lim Jong Kun, Lim Kun, Lim Taemin, Na Nuri, Nam Jeongyeon, Nam Kyeong-min, Noh Yeonseog, Oh Biro, Oh Jung-sik, Oh Solgil, Oh Yeontaek, Park Boyoun, Park Cheonbok, Park Dongju, Park Hyeonjin, Park Hyun Tae, Park Hyunjung, Park Jihye, Park Jooseok, Park Junghwan, Park Jungsoo, Park Miru, Park Sang Hee, Park Seunghyun, Park Soyoung, Park Taerim, Park Wonkyeong, Ryu Hyunjoon, Ryu Jeonghun, Ryu Nahyeon, Seo Soonshin, Seo Suk Min, Shim Yoonjeong, Shin Kyuyong, Shin Wonkwang, Sim Hyun, Sim Woongseob, Soh Hyejin, Son Bokyong, Son Hyunjun, Son Seulah, Song Chi-yun, Song Chiyoung, Song Ka Yeon, Song Minchul, Song Seungmin, Wang Jisung, Yeo Yonggoo, Yi Myeong Yeon, Yim Moon Bin, Yoo Taehwan, Yoo Youngjoon, Yoon Sungmin, Yoon Young Jin, Yu Hangyeol, Yu Ui Seon, Zuo Xingdong, Bae Jeongin, Bae Joungeun, Cho Hyunsoo, Cho Seonghyun, Cho Yongjin, Choi Taekyoon, Choi Yera, Chung Jiwan, Han Zhenghui, Heo Byeongho, Hong Euisuk, Hwang Taebaek, Im Seonyeol, Jegal Sumin, Jeon Sumin, Jeong Yelim, Jeong Yonghyun, Jiang Can, Jiang Juyong, Jin Jiho, Jo Ara, Jo Younghyun, Jung Hoyoun, Jung Juyoung, Kang Seunghyeong, Kim Dae Hee, Kim Ginam, Kim Hangyeol, Kim Heeseung, Kim Hyojin, Kim Hyojun, Kim Hyun-ah, Kim Jeehye, Kim Jin-hwa, Kim Jiseon, Kim Jonghak, Kim Jung Yoon, Kim Rak Yeong, Kim Seongjin, Kim Seoyoon, Kim Sewon, Kim Sooyoung, Kim Sukyoung, Kim Taeyong, Ko Naeun, Koo Bonseung, Kwak Heeyoung, Kwon Haena, Kwon Youngjin, Lee Boram, Lee Bruce W., Lee Dagyeong, Lee Erin, Lee Euijin, Lee Ha Gyeong, Lee Hyojin, Lee Hyunjeong, Lee Jeeyoon, Lee Jeonghyun, Lee Jongheok, Lee Joonhyung, Lee Junhyuk, Lee Mingu, Lee Nayeon, Lee Sangkyu, Lee Se Young, Lee Seulgi, Lee Seung Jin, Lee Suhyeon, Lee Yeonjae, Lee Yesol, Lee Youngbeom, Lee Yujin, Li Shaodong, Liu Tianyu, Moon Seong-eun, Moon Taehong, Nihlenramstroem Max-lasse, Oh Wonseok, Oh Yuri, Park Hongbeen, Park Hyekyung, Park Jaeho, Park Nohil, Park Sangjin, Ryu Jiwon, Ryu Miru, Ryu Simo, Seo Ahreum, Seo Hee, Seo Kangdeok, Shin Jamin, Shin Seungyoun, Sin Heetae, Wang Jiangping, Wang Lei, Xiang Ning, Xiao Longxiang, Xu Jing, Yi Seonyeong, Yoo Haanju, Yoo Haneul, Yoo Hwanhee, Yu Liang, Yu Youngjae, Yuan Weijie, Zeng Bo, Zhou Qian, Cho Kyunghyun, Ha Jung-woo, Park Joonsuk, Hwang Jihyun, Kwon Hyoung Jo, Kwon Soonyong, Lee Jungyeon, Lee Seungho, Lim Seonghyeon, Noh Hyunkyung, Choi Seungho, Lee Sang-woo, Lim Jung Hwa, Sung Nako
- C-TPT: Calibrated Test-time Prompt Tuning For Vision-language Models Via Text Feature Dispersion Yoon Hee Suk, Yoon Eunseop, Tee Joshua Tian Jin, Hasegawa-johnson Mark, Li Yingzhen, Yoo Chang D.
- Shiftaddllm: Accelerating Pretrained Llms Via Post-training Multiplication-less Reparameterization You Haoran, Guo Yipin, Fu Yichao, Zhou Wei, Shi Huihong, Zhang Xiaofan, Kundu Souvik, Yazdanbakhsh Amir, Lin Yingyan Celine
- Flow Of Reasoning: Efficient Training Of LLM Policy With Divergent Thinking Yu Fangxu, Jiang Lai, Kang Haoqiang, Hao Shibo, Qin Lianhui
- Iterative Graph Alignment Yu Fangyuan, Arora Hardeep Singh, Johnson Matt
- Neeko: Leveraging Dynamic Lora For Efficient Multi-character Role-playing Agent Yu Xiaoyan, Luo Tongxu, Wei Yifan, Lei Fangyu, Huang Yiming, Peng Hao, Zhu Liehuang
- Rankrag: Unifying Context Ranking With Retrieval-augmented Generation In Llms Yu Yue, Ping Wei, Liu Zihan, Wang Boxin, You Jiaxuan, Zhang Chao, Shoeybi Mohammad, Catanzaro Bryan
- Kieval: A Knowledge-grounded Interactive Evaluation Framework For Large Language Models Yu Zhuohao, Gao Chang, Yao Wenjin, Wang Yidong, Ye Wei, Wang Jindong, Xie Xing, Zhang Yue, Zhang Shikun
- Towards A Holistic Evaluation Of Llms On Factual Knowledge Recall Yuan Jiaqing, Pan Lin, Hang Chung-wei, Guo Jiang, Jiang Jiarong, Min Bonan, Ng Patrick, Wang Zhiguo
- Advancing LLM Reasoning Generalists With Preference Trees Yuan Lifan, Cui Ganqu, Wang Hanbin, Ding Ning, Wang Xingyao, Deng Jia, Shan Boji, Chen Huimin, Xie Ruobing, Lin Yankai, Liu Zhenghao, Zhou Bowen, Peng Hao, Liu Zhiyuan, Sun Maosong
- Why Lift So Heavy? Slimming Large Language Models By Cutting Off The Layers Yuan Shuzhou, Nie Ercong, Ma Bolei, Färber Michael
- Self-rewarding Language Models Yuan Weizhe, Pang Richard Yuanzhe, Cho Kyunghyun, Li Xian, Sukhbaatar Sainbayar, Xu Jing, Weston Jason
- Tcmbench: A Comprehensive Benchmark For Evaluating Large Language Models In Traditional Chinese Medicine Yue Wenjing, Wang Xiaoling, Zhu Wei, Guan Ming, Zheng Huanran, Wang Pengfei, Sun Changzhi, Ma Xin
- Mammoth2: Scaling Instructions From The Web Yue Xiang, Zheng Tuney, Zhang Ge, Chen Wenhu
- Investigating Continual Pretraining In Large Language Models: Insights And Implications Yıldız Çağatay, Ravichandran Nishaanth Kanna, Punia Prishruit, Bethge Matthias, Ermis Beyza
- Building Accurate Translation-tailored Llms With Language Aware Instruction Tuning Zan Changtong, Ding Liang, Shen Li, Zhen Yibing, Liu Weifeng, Tao Dacheng
- Understanding And Mitigating Compositional Issues In Text-to-image Generative Models Zarei Arman, Rezaei Keivan, Basu Samyadeep, Saberi Mehrdad, Moayeri Mazda, Kattakinda Priyatham, Feizi Soheil
- Tell Me Why: Explainable Public Health Fact-checking With Large Language Models Zarharan Majid, Wullschleger Pascal, Kia Babak Behkam, Pilehvar Mohammad Taher, Foster Jennifer
- Skywork-math: Data Scaling Laws For Mathematical Reasoning In Large Language Models -- The Story Goes On Zeng Liang, Zhong Liangjun, Zhao Liang, Wei Tianwen, Yang Liu, He Jujie, Cheng Cheng, Hu Rui, Liu Yang, Yan Shuicheng, Fang Han, Zhou Yahui
- Token-level Direct Preference Optimization Zeng Yongcheng, Liu Guoqing, Ma Weiyu, Yang Ning, Zhang Haifeng, Wang Jun
- Can Mllms Perform Text-to-image In-context Learning? Zeng Yuchen, Kang Wonjun, Chen Yicong, Koo Hyung Il, Lee Kangwook
- Prefix Text As A Yarn: Eliciting Non-english Alignment In Foundation Language Model Zhan Runzhe, Yang Xinyi, Wong Derek F., Chao Lidia S., Zhang Yue
- Skyeyegpt: Unifying Remote Sensing Vision-language Tasks Via Instruction Tuning With Large Language Model Zhan Yang, Xiong Zhitong, Yuan Yuan
- When Scaling Meets LLM Finetuning: The Effect Of Data, Model And Finetuning Method Zhang Biao, Liu Zhongtao, Cherry Colin, Firat Orhan
- Simple Techniques For Enhancing Sentence Embeddings In Generative Language Models Zhang Bowen, Chang Kehua, Li Chunping
- Ts-align: A Teacher-student Collaborative Framework For Scalable Iterative Finetuning Of Large Language Models Zhang Chen, Tang Chengguang, Chong Dading, Shi Ke, Tang Guohua, Jiang Feng, Li Haizhou
- Sciglm: Training Scientific Language Models With Self-reflective Instruction Annotation And Tuning Zhang Dan, Hu Ziniu, Zhoubian Sining, Du Zhengxiao, Yang Kaiyu, Wang Zihan, Yue Yisong, Dong Yuxiao, Tang Jie
- Rest-mcts*: LLM Self-training Via Process Reward Guided Tree Search Zhang Dan, Zhoubian Sining, Hu Ziniu, Yue Yisong, Dong Yuxiao, Tang Jie
- Chemllm: A Chemical Large Language Model Zhang Di, Liu Wei, Tan Qian, Chen Jingdan, Yan Hang, Yan Yuliang, Li Jiatong, Huang Weiran, Yue Xiangyu, Ouyang Wanli, Zhou Dongzhan, Zhang Shufei, Su Mao, Zhong Han-sen, Li Yuqiang
- Unforgettable Generalization In Language Models Zhang Eric, Chosen Leshem, Andreas Jacob
- Closing The Gap Between Open-source And Commercial Large Language Models For Medical Evidence Summarization Zhang Gongbo, Jin Qiao, Zhou Yiliang, Wang Song, Idnay Betina R., Luo Yiming, Park Elizabeth, Nestor Jordan G., Spotnitz Matthew E., Soroush Ali, Campion Thomas, Lu Zhiyong, Weng Chunhua, Peng Yifan
- Sinklora: Enhanced Efficiency And Chat Capabilities For Long-context Large Language Models Zhang Hengyu
- Balancing Speciality And Versatility: A Coarse To Fine Framework For Supervised Fine-tuning Large Language Model Zhang Hengyuan, Wu Yanru, Li Dawei, Yang Sak, Zhao Rui, Jiang Yong, Tan Fei
- Longcite: Enabling Llms To Generate Fine-grained Citations In Long-context QA Zhang Jiajie, Bai Yushi, Lv Xin, Gu Wanjun, Liu Danqing, Zou Minhao, Cao Shulin, Hou Lei, Dong Yuxiao, Feng Ling, Li Juanzi
- Agentohana: Design Unified Data And Training Pipeline For Effective Agent Learning Zhang Jianguo, Lan Tian, Murthy Rithesh, Liu Zhiwei, Yao Weiran, Tan Juntao, Hoang Thai, Yang Liangwei, Feng Yihao, Liu Zuxin, Awalgaonkar Tulika, Niebles Juan Carlos, Savarese Silvio, Heinecke Shelby, Wang Huan, Xiong Caiming
- Should We Fear Large Language Models? A Structural Analysis Of The Human Reasoning System For Elucidating LLM Capabilities And Risks Through The Lens Of Heidegger's Philosophy Zhang Jianqiiu
- Regurgitative Training: The Value Of Real Data In Training Large Language Models Zhang Jinghui, Qiao Dandan, Yang Mochen, Wei Qiang
- Read And Think: An Efficient Step-wise Multimodal Language Model For Document Understanding And Reasoning Zhang Jinxu
- Android In The Zoo: Chain-of-action-thought For GUI Agents Zhang Jiwen, Wu Jihao, Teng Yihua, Liao Minghui, Xu Nuo, Xiao Xiao, Wei Zhongyu, Tang Duyu
- Fast And Slow Generating: An Empirical Study On Large And Small Language Models Collaborative Decoding Zhang Kaiyan, Wang Jianyu, Ding Ning, Qi Biqing, Hua Ermo, Lv Xingtai, Zhou Bowen
- Transfertod: A Generalizable Chinese Multi-domain Task-oriented Dialogue System With Transfer Capabilities Zhang Ming, Huang Caishuang, Wu Yilong, Liu Shichun, Zheng Huiyuan, Dong Yurui, Shen Yujiong, Dou Shihan, Zhao Jun, Ye Junjie, Zhang Qi, Gui Tao, Huang Xuanjing
- Internlm-xcomposer-2.5: A Versatile Large Vision Language Model Supporting Long-contextual Input And Output Zhang Pan, Dong Xiaoyi, Zang Yuhang, Cao Yuhang, Qian Rui, Chen Lin, Guo Qipeng, Duan Haodong, Wang Bin, Ouyang Linke, Zhang Songyang, Zhang Wenwei, Li Yining, Gao Yang, Sun Peng, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Yan Hang, He Conghui, Zhang Xingcheng, Chen Kai, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- Soaring From 4K To 400K: Extending Llm's Context With Activation Beacon Zhang Peitian, Liu Zheng, Xiao Shitao, Shao Ninglu, Ye Qiwei, Dou Zhicheng
- Extending Llama-3's Context Ten-fold Overnight Zhang Peitian, Shao Ninglu, Liu Zheng, Xiao Shitao, Qian Hongjin, Ye Qiwei, Dou Zhicheng
- RECOST: External Knowledge Guided Data-efficient Instruction Tuning Zhang Qi, Zhang Yiming, Wang Haobo, Zhao Junbo
- Llava-read: Enhancing Reading Ability Of Multimodal Language Models Zhang Ruiyi, Zhou Yufan, Chen Jian, Gu Jiuxiang, Chen Changyou, Sun Tong
- Getting More From Less: Large Language Models Are Good Spontaneous Multilingual Learners Zhang Shimao, Gao Changjiang, Zhu Wenhao, Chen Jiajun, Huang Xin, Han Xue, Feng Junlan, Deng Chao, Huang Shujian
- Improving Diversity Of Commonsense Generation By Large Language Models Via In-context Learning Zhang Tianhui, Peng Bei, Bollegala Danushka
- RAFT: Adapting Language Model To Domain Specific RAG Zhang Tianjun, Patil Shishir G., Jain Naman, Shen Sheng, Zaharia Matei, Stoica Ion, Gonzalez Joseph E.
- Promptfix: Few-shot Backdoor Removal Via Adversarial Prompt Tuning Zhang Tianrong, Xi Zhaohan, Wang Ting, Mitra Prasenjit, Chen Jinghui
- Interpreting And Improving Large Language Models In Arithmetic Calculation Zhang Wei, Wan Chaoqun, Zhang Yonggang, Cheung Yiu-ming, Tian Xinmei, Shen Xu, Ye Jieping
- Defining Boundaries: A Spectrum Of Task Feasibility For Large Language Models Zhang Wenbo, Xu Zihang, Cai Hengrui
- Methodology Of Adapting Large English Language Models For Specific Cultural Contexts Zhang Wenjing, Xiao Siqi, Lei Xuejiao, Wang Ning, Zhang Huazheng, An Meijuan, Yang Bikun, Liu Zhaoxiang, Wang Kai, Lian Shiguo
- Agent-pro: Learning To Evolve Via Policy-level Reflection And Optimization Zhang Wenqi, Tang Ke, Wu Hai, Wang Mengna, Shen Yongliang, Hou Guiyang, Tan Zeqi, Li Peng, Zhuang Yueting, Lu Weiming
- Llms Instruct Llms:an Extraction And Editing Method Zhang Xin, Ju Tianjie, Liang Huijia, Fu Ying, Zhang Qin
- Ask-before-plan: Proactive Language Agents For Real-world Planning Zhang Xuan, Deng Yang, Ren Zifeng, Ng See-kiong, Chua Tat-seng
- Chain Of Preference Optimization: Improving Chain-of-thought Reasoning In Llms Zhang Xuan, Du Chao, Pang Tianyu, Liu Qian, Gao Wei, Lin Min
- Finercut: Finer-grained Interpretable Layer Pruning For Large Language Models Zhang Yang, Li Yawei, Wang Xinpeng, Shen Qianli, Plank Barbara, Bischl Bernd, Rezaei Mina, Kawaguchi Kenji
- Pushing The Limit Of LLM Capacity For Text Classification Zhang Yazhou, Wang Mengyao, Ren Chenyu, Li Qiuchi, Tiwari Prayag, Wang Benyou, Qin Jing
- Mg-verilog: Multi-grained Dataset Towards Enhanced Llm-assisted Verilog Generation Zhang Yongan, Yu Zhongzhi, Fu Yonggan, Wan Cheng, Lin Yingyan Celine
- Small Language Models Need Strong Verifiers To Self-correct Reasoning Zhang Yunxiang, Khalifa Muhammad, Logeswaran Lajanugen, Kim Jaekyeom, Lee Moontae, Lee Honglak, Wang Lu
- Adam-mini: Use Fewer Learning Rates To Gain More Zhang Yushun, Chen Congliang, Li Ziniu, Ding Tian, Wu Chenwei, Ye Yinyu, Luo Zhi-quan, Sun Ruoyu
- Position-aware Parameter Efficient Fine-tuning Approach For Reducing Positional Bias In Llms Zhang Zheng, Yang Fan, Jiang Ziyan, Chen Zheng, Zhao Zhengyang, Ma Chengyuan, Zhao Liang, Liu Yang
- Found In The Middle: How Language Models Use Long Contexts Better Via Plug-and-play Positional Encoding Zhang Zhenyu, Chen Runjin, Liu Shiwei, Yao Zhewei, Ruwase Olatunji, Chen Beidi, Wu Xiaoxia, Wang Zhangyang
- Safe Unlearning: A Surprisingly Effective And Generalizable Solution To Defend Against Jailbreak Attacks Zhang Zhexin, Yang Junxiao, Ke Pei, Cui Shiyao, Zheng Chujie, Wang Hongning, Huang Minlie
- Preserving Knowledge In Large Language Model With Model-agnostic Self-decompression Zhang Zilun, Sun Yutao, Zhao Tiancheng, Sha Leigang, Xu Ruochen, Lee Kyusong, Yin Jianwei
- APT: Adaptive Pruning And Tuning Pretrained Language Models For Efficient Training And Inference Zhao Bowen, Hajishirzi Hannaneh, Cao Qingqing
- LANE: Logic Alignment Of Non-tuning Large Language Models And Online Recommendation Systems For Explainable Reason Generation Zhao Hongke, Zheng Songming, Wu Likang, Yu Bowen, Wang Jing
- Galore: Memory-efficient LLM Training By Gradient Low-rank Projection Zhao Jiawei, Zhang Zhenyu, Chen Beidi, Wang Zhangyang, Anandkumar Anima, Tian Yuandong
- Exploring The Compositional Deficiency Of Large Language Models In Mathematical Reasoning Zhao Jun, Tong Jingqi, Mou Yurong, Zhang Ming, Zhang Qi, Huang Xuanjing
- Lora Land: 310 Fine-tuned Llms That Rival GPT-4, A Technical Report Zhao Justin, Wang Timothy, Abid Wael, Angus Geoffrey, Garg Arnav, Kinnison Jeffery, Sherstinsky Alex, Molino Piero, Addair Travis, Rishi Devvret
- Longskywork: A Training Recipe For Efficiently Extending Context Length In Large Language Models Zhao Liang, Wei Tianwen, Zeng Liang, Cheng Cheng, Yang Liu, Cheng Peng, Wang Lijie, Li Chenxia, Wu Xuejie, Zhu Bo, Gan Yimeng, Hu Rui, Yan Shuicheng, Fang Han, Zhou Yahui
- Mitigating Object Hallucination In Large Vision-language Models Via Classifier-free Guidance Zhao Linxi, Deng Yihe, Zhang Weitong, Gu Quanquan
- Diffagent: Fast And Accurate Text-to-image API Selection With Large Language Model Zhao Lirui, Yang Yue, Zhang Kaipeng, Shao Wenqi, Zhang Yuxin, Qiao Yu, Luo Ping, Ji Rongrong
- Aqulia-med LLM: Pioneering Full-process Open-source Medical Language Models Zhao Lulu, Zeng Weihao, Shi Xiaofeng, Zhou Hua, Hao Donglin, Lin Yonghua
- Efficiently Training 7B LLM With 1 Million Sequence Length On 8 Gpus Zhao Pinxue, Zhang Hailin, Fu Fangcheng, Nie Xiaonan, Liu Qibin, Yang Fang, Peng Yuanbo, Jiao Dian, Li Shuaipeng, Xue Jinbao, Tao Yangyu, Cui Bin
- A Survey Of Backdoor Attacks And Defenses On Large Language Models: Implications For Security Measures Zhao Shuai, Jia Meihuizi, Guo Zhongliang, Gan Leilei, Xu Xiaoyu, Wu Xiaobao, Fu Jie, Feng Yichao, Pan Fengjun, Tuan Luu Anh
- Defending Large Language Models Against Jailbreak Attacks Via Layer-specific Editing Zhao Wei, Li Zhe, Li Yige, Zhang Ye, Sun Jun
- Ouroboros: Generating Longer Drafts Phrase By Phrase For Faster Speculative Decoding Zhao Weilin, Huang Yuxiang, Han Xu, Xu Wang, Xiao Chaojun, Zhang Xinrong, Fang Yewei, Zhang Kaihuo, Liu Zhiyuan, Sun Maosong
- Both Matter: Enhancing The Emotional Intelligence Of Large Language Models Without Compromising The General Intelligence Zhao Weixiang, Li Zhuojun, Wang Shilong, Wang Yang, Hu Yulin, Zhao Yanyan, Wei Chen, Qin Bing
- SAPT: A Shared Attention Framework For Parameter-efficient Continual Learning Of Large Language Models Zhao Weixiang, Wang Shilong, Hu Yulin, Zhao Yanyan, Qin Bing, Zhang Xuanyu, Yang Qing, Xu Dongliang, Che Wanxiang
- Enhancing Cross-domain Pre-trained Decision Transformers With Adaptive Attention Zhao Wenhao, Xu Qiushui, Xu Linjie, Song Lei, Wang Jinyu, Zhou Chunlai, Bian Jiang
- Wildchat: 1M Chatgpt Interaction Logs In The Wild Zhao Wenting, Ren Xiang, Hessel Jack, Cardie Claire, Choi Yejin, Deng Yuntian
- How Do Large Language Models Handle Multilingualism? Zhao Yiran, Zhang Wenxuan, Chen Guizhen, Kawaguchi Kenji, Bing Lidong
- An Empirical Study Of Retrieval Augmented Generation With Chain-of-thought Zhao Yuetong, Cao Hongyu, Zhao Xianyu, Ou Zhijian
- Improving The Robustness Of Large Language Models Via Consistency Alignment Zhao Yukun, Yan Lingyong, Sun Weiwei, Xing Guoliang, Wang Shuaiqiang, Meng Chong, Cheng Zhicong, Ren Zhaochun, Yin Dawei
- SWIFT:A Scalable Lightweight Infrastructure For Fine-tuning Zhao Yuze, Huang Jintao, Hu Jinghan, Wang Xingjun, Mao Yunlin, Zhang Daoze, Jiang Zeyinzi, Wu Zhikai, Ai Baole, Wang Ang, Zhou Wenmeng, Chen Yingda
- Adversarial Contrastive Decoding: Boosting Safety Alignment Of Large Language Models Via Opposite Prompt Optimization Zhao Zhengyue, Zhang Xiaoyun, Xu Kaidi, Hu Xing, Zhang Rui, Du Zidong, Guo Qi, Chen Yunji
- Retrieval-augmented Mixture Of Lora Experts For Uploadable Machine Learning Zhao Ziyu, Gan Leilei, Wang Guoyin, Hu Yuwei, Shen Tao, Yang Hongxia, Kuang Kun, Wu Fei
- Loraretriever: Input-aware Lora Retrieval And Composition For Mixed Tasks In The Wild Zhao Ziyu, Gan Leilei, Wang Guoyin, Zhou Wangchunshu, Yang Hongxia, Kuang Kun, Wu Fei
- ICE-GRT: Instruction Context Enhancement By Generative Reinforcement Based Transformers Zheng Chen, Sun Ke, Tang Da, Ma Yukun, Zhang Yuyu, Xi Chenguang, Zhou Xun
- Balancing Enhancement, Harmlessness, And General Capabilities: Enhancing Conversational Llms With Direct RLHF Zheng Chen, Sun Ke, Wu Hang, Xi Chenguang, Zhou Xun
- Weak-to-strong Extrapolation Expedites Alignment Zheng Chujie, Wang Ziqi, Ji Heng, Huang Minlie, Peng Nanyun
- Large Language Models As Reliable Knowledge Bases? Zheng Danna, Lapata Mirella, Pan Jeff Z.
- Fine-tuning Large Language Models For Domain-specific Machine Translation Zheng Jiawei, Hong Hanghai, Wang Xiaoli, Su Jingsong, Liang Yonggui, Wu Shikai
- Can Llms Learn New Concepts Incrementally Without Forgetting? Zheng Junhao, Qiu Shengjie, Ma Qianli
- Kun: Answer Polishment For Chinese Self-alignment With Instruction Back-translation Zheng Tianyu, Guo Shuyue, Qu Xingwei, Guo Jiawei, Du Xinrun, Jia Qi, Lin Chenghua, Huang Wenhao, Fu Jie, Zhang Ge
- Llamafactory: Unified Efficient Fine-tuning Of 100+ Language Models Zheng Yaowei, Zhang Richong, Zhang Junhao, Ye Yanhan, Luo Zheyan, Feng Zhangchi, Ma Yongqiang
- Harnessing Large Language Models For Text-rich Sequential Recommendation Zheng Zhi, Chao Wenshuo, Qiu Zhaopeng, Zhu Hengshu, Xiong Hui
- Training Large Language Models For Reasoning Through Reverse Curriculum Reinforcement Learning Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang
- Agentgym: Evolving Large Language Model-based Agents Across Diverse Environments Zhiheng Xi, Yiwen Ding, Wenxiang Chen, Boyang Hong, Honglin Guo, Junzhe Wang, Dingwen Yang, Chenyang Liao, Xin Guo, Wei He, Songyang Gao, Lu Chen, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-gang Jiang
- Moextend: Tuning New Experts For Modality And Task Extension Zhong Shanshan, Gao Shanghua, Huang Zhongzhan, Wen Wushao, Zitnik Marinka, Zhou Pan
- Harnessing Large Language Models As Post-hoc Correctors Zhong Zhiqiang, Zhou Kuangyu, Mottin Davide
- Synthet2c: Generating Synthetic Data For Fine-tuning Large Language Models On The Text2cypher Task Zhong Ziije, Zhong Linqing, Sun Zhaoze, Jin Qingyun, Qin Zengchang, Zhang Xiaofan
- Sequence To Sequence Reward Modeling: Improving RLHF By Language Feedback Zhou Jiayi, Ji Jiaming, Dai Juntao, Yang Yaodong
- Orchestrating Llms With Different Personalizations Zhou Jin Peng, Luo Katie Z, Gu Jingwen, Yuan Jason, Weinberger Kilian Q., Sun Wen
- Qog:question And Options Generation Based On Language Model Zhou Jincheng
- Leveraging Web-crawled Data For High-quality Fine-tuning Zhou Jing, Jiang Chenglin, Shen Wei, Zhou Xiao, He Xiaonan
- Towards Democratizing Multilingual Large Language Models For Medicine Through A Two-stage Instruction Fine-tuning Approach Zhou Meng, Parmar Surajsinh, Bhatti Anubhav
- Enhancing The General Agent Capabilities Of Low-parameter Llms Through Tuning And Multi-branch Reasoning Zhou Qinhao, Zhang Zihan, Xiang Xiang, Wang Ke, Wu Yuchuan, Li Yongbin
- Explaining Pre-trained Language Models With Attribution Scores: An Analysis In Low-resource Settings Zhou Wei, Adel Heike, Schuff Hendrik, Vu Ngoc Thang
- Archer: Training Language Model Agents Via Hierarchical Multi-turn RL Zhou Yifei, Zanette Andrea, Pan Jiayi, Levine Sergey, Kumar Aviral
- Aligning Modalities In Vision Large Language Models Via Preference Fine-tuning Zhou Yiyang, Cui Chenhang, Rafailov Rafael, Finn Chelsea, Yao Huaxiu
- Calibrated Self-rewarding Vision Language Models Zhou Yiyang, Fan Zhiyuan, Cheng Dongjie, Yang Sihan, Chen Zhaorun, Cui Chenhang, Wang Xiyao, Li Yun, Zhang Linjun, Yao Huaxiu
- Dual Instruction Tuning With Large Language Models For Mathematical Reasoning Zhou Yongwei, Zhao Tiejun
- Defending Jailbreak Prompts Via In-context Adversarial Game Zhou Yujun, Han Yufei, Zhuang Haomin, Guo Kehan, Liang Zhenwen, Bao Hongyan, Zhang Xiangliang
- Metagpt: Merging Large Language Models Using Model Exclusive Task Arithmetic Zhou Yuyan, Song Liang, Wang Bingning, Chen Weipeng
- Emulated Disalignment: Safety Alignment For Large Language Models May Backfire! Zhou Zhanhui, Liu Jie, Dong Zhichen, Liu Jiaheng, Yang Chao, Ouyang Wanli, Qiao Yu
- Lawgpt: A Chinese Legal Knowledge-enhanced Large Language Model Zhou Zhi, Shi Jiang-xin, Song Peng-xiao, Yang Xiao-wen, Jin Yi-xuan, Guo Lan-zhe, Li Yu-feng
- Fine-tuning Large Language Models To Translate: Will A Touch Of Noisy Data In Misaligned Languages Suffice? Zhu Dawei, Chen Pinzhen, Zhang Miaoran, Haddow Barry, Shen Xiaoyu, Klakow Dietrich
- A Preference-driven Paradigm For Enhanced Translation With Large Language Models Zhu Dawei, Trenous Sony, Shen Xiaoyu, Klakow Dietrich, Byrne Bill, Hasler Eva
- TAT-LLM: A Specialized Language Model For Discrete Reasoning Over Tabular And Textual Data Zhu Fengbin, Liu Ziyang, Feng Fuli, Wang Chao, Li Moxin, Chua Tat-seng
- FANNO: Augmenting High-quality Instruction Data With Open-sourced Llms Only Zhu He, Su Junyou, Lun Tianle, Tao Yicheng, Zhang Wenjia, Fan Zipei, Chen Guanhua
- Lifelong Personalized Low-rank Adaptation Of Large Language Models For Recommendation Zhu Jiachen, Lin Jianghao, Dai Xinyi, Chen Bo, Shan Rong, Zhu Jieming, Tang Ruiming, Yu Yong, Zhang Weinan
- Collectivesft: Scaling Large Language Models For Chinese Medical Benchmark With Collective Instructions In Healthcare Zhu Jingwei, Tan Minghuan, Yang Min, Li Ruixue, Alinejad-rokny Hamid
- Fastmem: Fast Memorization Of Prompt Improves Context Awareness Of Large Language Models Zhu Junyi, Liu Shuochen, Yu Yu, Tang Bo, Yan Yibo, Li Zhiyu, Xiong Feiyu, Xu Tong, Blaschko Matthew B.
- LIRE: Listwise Reward Enhancement For Preference Alignment Zhu Mingye, Liu Yi, Zhang Lei, Guo Junbo, Mao Zhendong
- Parameter-efficient Tuning Large Language Models For Graph Representation Learning Zhu Qi, Zheng Da, Song Xiang, Zhang Shichang, Jin Bowen, Sun Yizhou, Karypis George
- IAPT: Instruction-aware Prompt Tuning For Large Language Models Zhu Wei, Tian Aaron Xuxiang, Yin Congrui, Ni Yuan, Wang Xiaoling, Xie Guotong
- Distilling Mathematical Reasoning Capabilities Into Small Language Models Zhu Xunyu, Li Jian, Liu Yong, Ma Can, Wang Weiping
- One Token Can Help! Learning Scalable And Pluggable Virtual Tokens For Retrieval-augmented Large Language Models Zhu Yutao, Huang Zhaoheng, Dou Zhicheng, Wen Ji-rong
- Yulan: An Open-source Large Language Model Zhu Yutao, Zhou Kun, Mao Kelong, Chen Wentong, Sun Yiding, Chen Zhipeng, Cao Qian, Wu Yihan, Chen Yushuo, Wang Feng, Zhang Lei, Li Junyi, Wang Xiaolei, Wang Lei, Zhang Beichen, Dong Zican, Cheng Xiaoxue, Chen Yuhan, Tang Xinyu, Hou Yupeng, Ren Qiangqiang, Pang Xincheng, Xie Shufang, Zhao Wayne Xin, Dou Zhicheng, Mao Jiaxin, Lin Yankai, Song Ruihua, Xu Jun, Chen Xu, Yan Rui, Wei Zhewei, Hu Di, Huang Wenbing, Gao Ze-feng, Chen Yueguo, Lu Weizheng, Wen Ji-rong
- Astraios: Parameter-efficient Instruction Tuning Code Large Language Models Zhuo Terry Yue, Zebaze Armel, Suppattarachai Nitchakarn, Von Werra Leandro, De Vries Harm, Liu Qian, Muennighoff Niklas
- Language Model Sentence Completion With A Parser-driven Rhetorical Control Method Zingale Joshua, Kalita Jugal
- Multi-lingual Malaysian Embedding: Leveraging Large Language Models For Semantic Representations Zolkepli Husein, Razak Aisyah, Adha Kamarul, Nazhan Ariff
- Proswitch: Knowledge-guided Instruction Tuning To Generate Professional And Non-professional Styled Text Zong Chang, Chen Yuyan, Lu Weiming, Shao Jian, Zhuang Yueting
- Llama-excitor: General Instruction Tuning Via Indirect Feature Interaction Zou Bo, Yang Chao, Qiao Yu, Quan Chengbin, Zhao Youjian
- Implicitave: An Open-source Dataset And Multimodal Llms Benchmark For Implicit Attribute Value Extraction Zou Henry Peng, Samuel Vinay, Zhou Yue, Zhang Weizhi, Fang Liancheng, Song Zihe, Yu Philip S., Caragea Cornelia
- Promptintern: Saving Inference Costs By Internalizing Recurrent Prompt During Large Language Model Fine-tuning Zou Jiaru, Zhou Mengyu, Li Tao, Han Shi, Zhang Dongmei
- Systematic Task Exploration With Llms: A Study In Citation Text Generation Şahinuç Furkan, Kuznetsov Ilia, Hou Yufang, Gurevych Iryna
🏷 GPT
- Robust Text-to-sql Generation With Execution-guided Decoding Wang Chenglong, Tatwawadi Kedar, Brockschmidt Marc, Huang Po-sen, Mao Yi, Polozov Oleksandr, Singh Rishabh
- Syntactically Supervised Transformers For Faster Neural Machine Translation Akoury Nader, Krishna Kalpesh, Iyyer Mohit
- Attending To Entities For Better Text Understanding Cheng Pengxiang, Erk Katrin
- Insertion-based Decoding With Automatically Inferred Generation Order Gu Jiatao, Liu Qi, Cho Kyunghyun
- Fine-tuning By Curriculum Learning For Non-autoregressive Neural Machine Translation Guo Junliang, Tan Xu, Xu Linli, Qin Tao, Chen Enhong, Liu Tie-yan
- Zero-shot Paraphrase Generation With Multilingual Language Models Guo Yinpeng, Liao Yi, Jiang Xin, Zhang Qing, Zhang Yibo, Liu Qun
- Exposure Bias Versus Self-recovery: Are Distortions Really Incremental For Autoregressive Text Generation? He Tianxing, Zhang Jingzhao, Zhou Zhiming, Glass James
- INSET: Sentence Infilling With Inter-sentential Transformer Huang Yichen, Zhang Yizhe, Elachqar Oussama, Cheng Yu
- Learning To Answer By Learning To Ask: Getting The Best Of GPT-2 And BERT Worlds Klein Tassilo, Nabi Moin
- Patent Claim Generation By Fine-tuning Openai GPT-2 Lee Jieh-sheng, Hsiang Jieh
- BART: Denoising Sequence-to-sequence Pre-training For Natural Language Generation, Translation, And Comprehension Lewis Mike, Liu Yinhan, Goyal Naman, Ghazvininejad Marjan, Mohamed Abdelrahman, Levy Omer, Stoyanov Ves, Zettlemoyer Luke
- Gpt-based Generation For Classical Chinese Poetry Liao Yi, Wang Yasheng, Liu Qun, Jiang Xin
- Megatron-lm: Training Multi-billion Parameter Language Models Using Model Parallelism Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick Legresley, Jared Casper, Bryan Catanzaro
- Dlgnet: A Transformer-based Model For Dialogue Response Generation Olabiyi Oluwatobi, Mueller Erik T.
- Adversarial Bootstrapping For Dialogue Model Training Olabiyi Oluwatobi, Mueller Erik T., Larson Christopher, Lahlou Tarek
- Leveraging Pre-trained Checkpoints For Sequence Generation Tasks Rothe Sascha, Narayan Shashi, Severyn Aliaksei
- Masked Language Model Scoring Salazar Julian, Liang Davis, Nguyen Toan Q., Kirchhoff Katrin
- Zero: Memory Optimizations Toward Training Trillion Parameter Models Samyam Rajbhandari, Jeff Rasley, Olatunji Ruwase, Yuxiong He
- Generalization In Generation: A Closer Look At Exposure Bias Schmidt Florian
- Do Massively Pretrained Language Models Make Better Storytellers? See Abigail, Pappu Aneesh, Saxena Rohun, Yerukola Akhila, Manning Christopher D.
- Insertion Transformer: Flexible Sequence Generation Via Insertion Operations Stern Mitchell, Chan William, Kiros Jamie, Uszkoreit Jakob
- Quantity Doesn't Buy Quality Syntax With Neural Language Models Van Schijndel Marten, Mueller Aaron, Linzen Tal
- Visualizing Attention In Transformer-based Language Representation Models Vig Jesse
- Training Optimus Prime, M.D.: Generating Medical Certification Items By Fine-tuning Openai's Gpt2 Transformer Model Von Davier Matthias
- Language Models With Transformers Wang Chenguang, Li Mu, Smola Alexander J.
- Alternating Recurrent Dialog Model With Large-scale Pre-trained Language Models Wu Qingyang, Zhang Yichi, Li Yu, Yu Zhou
- Towards Making The Most Of BERT In Neural Machine Translation Yang Jiacheng, Wang Mingxuan, Zhou Hao, Zhao Chengqi, Yu Yong, Zhang Weinan, Li Lei
- Model Compression With Two-stage Multi-teacher Knowledge Distillation For Web Question Answering System Yang Ze, Shou Linjun, Gong Ming, Lin Wutao, Jiang Daxin
- Q8BERT: Quantized 8bit BERT Zafrir Ofir, Boudoukh Guy, Izsak Peter, Wasserblat Moshe
- Dialogpt: Large-scale Generative Pre-training For Conversational Response Generation Zhang Yizhe, Sun Siqi, Galley Michel, Chen Yen-chun, Brockett Chris, Gao Xiang, Gao Jianfeng, Liu Jingjing, Dolan Bill
- Semantics-aware BERT For Language Understanding Zhang Zhuosheng, Wu Yuwei, Zhao Hai, Li Zuchao, Zhang Shuailiang, Zhou Xi, Zhou Xiang
- Unilmv2: Pseudo-masked Language Models For Unified Language Model Pre-training Bao Hangbo, Dong Li, Wei Furu, Wang Wenhui, Yang Nan, Liu Xiaodong, Wang Yu, Piao Songhao, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- PALM: Pre-training An Autoencoding&autoregressive Language Model For Context-conditioned Generation Bi Bin, Li Chenliang, Wu Chen, Yan Ming, Wang Wei, Huang Songfang, Huang Fei, Si Luo
- Language Models Are Few-shot Learners Brown Tom B., Mann Benjamin, Ryder Nick, Subbiah Melanie, Kaplan Jared, Dhariwal Prafulla, Neelakantan Arvind, Shyam Pranav, Sastry Girish, Askell Amanda, Agarwal Sandhini, Herbert-voss Ariel, Krueger Gretchen, Henighan Tom, Child Rewon, Ramesh Aditya, Ziegler Daniel M., Wu Jeffrey, Winter Clemens, Hesse Christopher, Chen Mark, Sigler Eric, Litwin Mateusz, Gray Scott, Chess Benjamin, Clark Jack, Berner Christopher, Mccandlish Sam, Radford Alec, Sutskever Ilya, Amodei Dario
- Pretrained Language Models For Dialogue Generation With Multiple Input Sources Cao Yu, Bi Wei, Fang Meng, Tao Dacheng
- Pymt5: Multi-mode Translation Of Natural Language And Python Code With Transformers Clement Colin B., Drain Dawn, Timcheck Jonathan, Svyatkovskiy Alexey, Sundaresan Neel
- Cascaded Text Generation With Markov Transformers Deng Yuntian, Rush Alexander M.
- Turngpt: A Transformer-based Language Model For Predicting Turn-taking In Spoken Dialog Ekstedt Erik, Skantze Gabriel
- CERT: Contrastive Self-supervised Learning For Language Understanding Fang Hongchao, Wang Sicheng, Zhou Meng, Ding Jiayuan, Xie Pengtao
- The Pile: An 800GB Dataset Of Diverse Text For Language Modeling Gao Leo, Biderman Stella, Black Sid, Golding Laurence, Hoppe Travis, Foster Charles, Phang Jason, He Horace, Thite Anish, Nabeshima Noa, Presser Shawn, Leahy Connor
- Are Neural Open-domain Dialog Systems Robust To Speech Recognition Errors In The Dialog History? An Empirical Study Gopalakrishnan Karthik, Hedayatnia Behnam, Wang Longshaokan, Liu Yang, Hakkani-tur Dilek
- A Tailored Pre-training Model For Task-oriented Dialog Generation Gu Jing, Wu Qingyang, Wu Chongruo, Shi Weiyan, Yu Zhou
- Dialogbert: Discourse-aware Response Generation Via Learning To Recover And Rank Utterances Gu Xiaodong, Yoo Kang Min, Ha Jung-woo
- Incorporating BERT Into Parallel Sequence Decoding With Adapters Guo Junliang, Zhang Zhirui, Xu Linli, Wei Hao-ran, Chen Boxing, Chen Enhong
- Investigating Representations Of Verb Bias In Neural Language Models Hawkins Robert D., Yamakoshi Takateru, Griffiths Thomas L., Goldberg Adele E.
- How Can We Know When Language Models Know? On The Calibration Of Language Models For Question Answering Jiang Zhengbao, Araki Jun, Ding Haibo, Neubig Graham
- Text-to-text Pre-training For Data-to-text Tasks Kale Mihir, Rastogi Abhinav
- Meta-context Transformers For Domain-specific Response Generation Kar Debanjana, Samanta Suranjana, Azad Amar Prakash
- Do Fine-tuned Commonsense Language Models Really Generalize? Kejriwal Mayank, Shen Ke
- Current Limitations Of Language Models: What You Need Is Retrieval Komatsuzaki Aran
- Common Sense Or World Knowledge? Investigating Adapter-based Knowledge Injection Into Pretrained Transformers Lauscher Anne, Majewska Olga, Ribeiro Leonardo F. R., Gurevych Iryna, Rozanov Nikolai, Glavaš Goran
- Optimus: Organizing Sentences Via Pre-trained Modeling Of A Latent Space Li Chunyuan, Gao Xiang, Li Yuan, Peng Baolin, Li Xiujun, Zhang Yizhe, Gao Jianfeng
- Probabilistically Masked Language Model Capable Of Autoregressive Generation In Arbitrary Word Order Liao Yi, Jiang Xin, Liu Qun
- Autoregressive Knowledge Distillation Through Imitation Learning Lin Alexander, Wohlwend Jeremy, Chen Howard, Lei Tao
- Multi-node Bert-pretraining: Cost-efficient Approach Lin Jiahuang, Li Xin, Pekhimenko Gennady
- Variational Transformers For Diverse Response Generation Lin Zhaojiang, Winata Genta Indra, Xu Peng, Liu Zihan, Fung Pascale
- Task-level Curriculum Learning For Non-autoregressive Neural Machine Translation Liu Jinglin, Ren Yi, Tan Xu, Zhang Chen, Qin Tao, Zhao Zhou, Liu Tie-yan
- Simplifying Paragraph-level Question Generation Via Transformer Language Models Lopez Luis Enrico, Cruz Diane Kathryn, Cruz Jan Christian Blaise, Cheng Charibeth
- Language Models As Few-shot Learner For Task-oriented Dialogue Systems Madotto Andrea, Liu Zihan, Lin Zhaojiang, Fung Pascale
- Incremental Processing In The Age Of Non-incremental Encoders: An Empirical Assessment Of Bidirectional Models For Incremental NLU Madureira Brielen, Schlangen David
- Gpt-too: A Language-model-first Approach For Amr-to-text Generation Mager Manuel, Astudillo Ramon Fernandez, Naseem Tahira, Sultan Md Arafat, Lee Young-suk, Florian Radu, Roukos Salim
- Modern Methods For Text Generation Montesinos Dimas Munoz
- The Chess Transformer: Mastering Play Using Generative Language Models Noever David, Ciolino Matt, Kalin Josh
- Better Distractions: Transformer-based Distractor Generation And Multiple Choice Question Filtering Offerijns Jeroen, Verberne Suzan, Verhoef Tessa
- Facts2story: Controlling Text Generation By Key Facts Orbach Eyal Bar Ilan University, Goldberg Yoav Bar Ilan University And Allen Institute For Artificial Intelligence
- BANG: Bridging Autoregressive And Non-autoregressive Generation With Large Scale Pretraining Qi Weizhen, Gong Yeyun, Jiao Jian, Yan Yu, Chen Weizhu, Liu Dayiheng, Tang Kewen, Li Houqiang, Chen Jiusheng, Zhang Ruofei, Zhou Ming, Duan Nan
- Hierarchical GPT With Congruent Transformers For Multi-sentence Language Models Roh Jihyeon, Gim Huiseong, Lee Soo-young
- Knowledge-aware Language Model Pretraining Rosset Corby, Xiong Chenyan, Phan Minh, Song Xia, Bennett Paul, Tiwary Saurabh
- It's Not Just Size That Matters: Small Language Models Are Also Few-shot Learners Schick Timo, Schütze Hinrich
- Hybrid Generative-retrieval Transformers For Dialogue Domain Adaptation Shalyminov Igor, Sordoni Alessandro, Atkinson Adam, Schulz Hannes
- Lightpaff: A Two-stage Distillation Framework For Pre-training And Fine-tuning Song Kaitao, Sun Hao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Hongzhi, Liu Tie-yan
- Making Pre-trained Language Models Better Few-shot Learners Tianyu Gao, Adam Fisch, Danqi Chen
- Language Models Are Few-shot Learners Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam Mccandlish, Alec Radford, Ilya Sutskever, Dario Amodei
- EDITOR: An Edit-based Transformer With Repositioning For Neural Machine Translation With Soft Lexical Constraints Xu Weijia, Carpuat Marine
- On The Generation Of Medical Dialogues For COVID-19 Yang Wenmian, Zeng Guangtao, Tan Bowen, Ju Zeqian, Chakravorty Subrato, He Xuehai, Chen Shu, Yang Xingyi, Wu Qingyang, Yu Zhou, Xing Eric, Xie Pengtao
- Adding Recurrence To Pretrained Transformers For Improved Efficiency And Context Size Yoshida Davis, Ettinger Allyson, Gimpel Kevin
- Few-shot Generative Conversational Query Rewriting Yu Shi, Liu Jiahua, Yang Jingqin, Xiong Chenyan, Bennett Paul, Gao Jianfeng, Liu Zhiyuan
- Emptransfo: A Multi-head Transformer Architecture For Creating Empathetic Dialog Systems Zandie Rohola, Mahoor Mohammad H.
- Turingadvice: A Generative And Dynamic Evaluation Of Language Use Zellers Rowan, Holtzman Ari, Clark Elizabeth, Qin Lianhui, Farhadi Ali, Choi Yejin
- Trojaning Language Models For Fun And Profit Zhang Xinyang, Zhang Zheng, Ji Shouling, Wang Ting
- POINTER: Constrained Progressive Text Generation Via Insertion-based Generative Pre-training Zhang Yizhe, Wang Guoyin, Li Chunyuan, Gan Zhe, Brockett Chris, Dolan Bill
- CPM: A Large-scale Generative Chinese Pre-trained Language Model Zhang Zhengyan, Han Xu, Zhou Hao, Ke Pei, Gu Yuxian, Ye Deming, Qin Yujia, Su Yusheng, Ji Haozhe, Guan Jian, Qi Fanchao, Wang Xiaozhi, Zheng Yanan, Zeng Guoyang, Cao Huanqi, Chen Shengqi, Li Daixuan, Sun Zhenbo, Liu Zhiyuan, Huang Minlie, Han Wentao, Tang Jie, Li Juanzi, Zhu Xiaoyan, Sun Maosong
- Improving Non-autoregressive Neural Machine Translation With Monolingual Data Zhou Jiawei, Keung Phillip
- RAFT: A Real-world Few-shot Text Classification Benchmark Alex Neel, Lifland Eli, Tunstall Lewis, Thakur Abhishek, Maham Pegah, Riedel C. Jess, Hine Emmie, Ashurst Carolyn, Sedille Paul, Carlier Alexis, Noetel Michael, Stuhlmüller Andreas
- On The Multilingual Capabilities Of Very Large-scale English Language Models Armengol-estapé Jordi, Bonet Ona De Gibert, Melero Maite
- Efficient Large Scale Language Modeling With Mixtures Of Experts Artetxe Mikel, Bhosale Shruti, Goyal Naman, Mihaylov Todor, Ott Myle, Shleifer Sam, Lin Xi Victoria, Du Jingfei, Iyer Srinivasan, Pasunuru Ramakanth, Anantharaman Giri, Li Xian, Chen Shuohui, Akin Halil, Baines Mandeep, Martin Louis, Zhou Xing, Koura Punit Singh, O'horo Brian, Wang Jeff, Zettlemoyer Luke, Diab Mona, Kozareva Zornitsa, Stoyanov Ves
- One Question Answering Model For Many Languages With Cross-lingual Dense Passage Retrieval Asai Akari, Yu Xinyan, Kasai Jungo, Hajishirzi Hannaneh
- Non-autoregressive Translation By Learning Target Categorical Codes Bao Yu, Huang Shujian, Xiao Tong, Wang Dongqi, Dai Xinyu, Chen Jiajun
- Improving Language Models By Retrieving From Trillions Of Tokens Borgeaud Sebastian, Mensch Arthur, Hoffmann Jordan, Cai Trevor, Rutherford Eliza, Millican Katie, Driessche George Van Den, Lespiau Jean-baptiste, Damoc Bogdan, Clark Aidan, Casas Diego De Las, Guy Aurelia, Menick Jacob, Ring Roman, Hennigan Tom, Huang Saffron, Maggiore Loren, Jones Chris, Cassirer Albin, Brock Andy, Paganini Michela, Irving Geoffrey, Vinyals Oriol, Osindero Simon, Simonyan Karen, Rae Jack W., Elsen Erich, Sifre Laurent
- What Changes Can Large-scale Language Models Bring? Intensive Study On Hyperclova: Billions-scale Korean Generative Pretrained Transformers Boseop Kim, Hyoungseok Kim, Sang-woo Lee, Gichang Lee, Donghyun Kwak, Dong Hyeon Jeon, Sunghyun Park, Sungju Kim, Seonhoon Kim, Dongpil Seo, Heungsub Lee, Minyoung Jeong, Sungjae Lee, Minsub Kim, Suk Hyun Ko, Seokhun Kim, Taeyong Park, Jinuk Kim, Soyoung Kang, Na-hyeon Ryu, Kang Min Yoo, Minsuk Chang, Soobin Suh, Sookyo In, Jinseong Park, Kyungduk Kim, Hiun Kim, Jisu Jeong, Yong Goo Yeo, Donghoon Ham, Dongju Park, Min Young Lee, Jaewook Kang, Inho Kang, Jung-woo Ha, Woomyoung Park, Nako Sung
- The Power Of Scale For Parameter-efficient Prompt Tuning Brian Lester, Rami Al-rfou, Noah Constant
- Gaud\'i: Conversational Interactions With Deep Representations To Generate Image Collections Bursztyn Victor S., Healey Jennifer, Vinay Vishwa
- Neural Data-to-text Generation With Lm-based Text Augmentation Chang Ernie, Shen Xiaoyu, Zhu Dawei, Demberg Vera, Su Hui
- Topic Transferable Table Question Answering Chemmengath Saneem Ahmed, Kumar Vishwajeet, Bharadwaj Samarth, Sen Jaydeep, Canim Mustafa, Chakrabarti Soumen, Gliozzo Alfio, Sankaranarayanan Karthik
- Visualgpt: Data-efficient Adaptation Of Pretrained Language Models For Image Captioning Chen Jun, Guo Han, Yi Kai, Li Boyang, Elhoseiny Mohamed
- MT6: Multilingual Pretrained Text-to-text Transformer With Translation Pairs Chi Zewen, Dong Li, Ma Shuming, Mao Shaohan Huang Xian-ling, Huang Heyan, Wei Furu
- All That's 'human' Is Not Gold: Evaluating Human Evaluation Of Generated Text Clark Elizabeth, August Tal, Serrano Sofia, Haduong Nikita, Gururangan Suchin, Smith Noah A.
- Openprompt: An Open-source Framework For Prompt-learning Ding Ning, Hu Shengding, Zhao Weilin, Chen Yulin, Liu Zhiyuan, Zheng Hai-tao, Sun Maosong
- Not Quite 'ask A Librarian': AI On The Nature, Value, And Future Of LIS Dinneen Jesse David, Bubinger Helen
- Is GPT-3 Text Indistinguishable From Human Text? Scarecrow: A Framework For Scrutinizing Machine Text Dou Yao, Forbes Maxwell, Koncel-kedziorski Rik, Smith Noah A., Choi Yejin
- GLM: General Language Model Pretraining With Autoregressive Blank Infilling Du Zhengxiao, Qian Yujie, Liu Xiao, Ding Ming, Qiu Jiezhong, Yang Zhilin, Tang Jie
- Lora: Low-rank Adaptation Of Large Language Models Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen
- Transformer-based Conditional Variational Autoencoder For Controllable Story Generation Fang Le, Zeng Tao, Liu Chaochun, Bo Liefeng, Dong Wen, Chen Changyou
- An Adversarially-learned Turing Test For Dialog Generation Models Gao Xiang, Zhang Yizhe, Galley Michel, Dolan Bill
- WARP: Word-level Adversarial Reprogramming Hambardzumyan Karen, Khachatrian Hrant, May Jonathan
- Text2app: A Framework For Creating Android Apps From Text Descriptions Hasan Masum, Mehrab Kazi Sajeed, Ahmad Wasi Uddin, Shahriyar Rifat
- ENCONTER: Entity Constrained Progressive Sequence Generation Via Insertion-based Transformer Hsieh Lee-hsun, Lee Yang-yin, Lim Ee-peng
- Lora: Low-rank Adaptation Of Large Language Models Hu Edward J., Shen Yelong, Wallis Phillip, Allen-zhu Zeyuan, Li Yuanzhi, Wang Shean, Wang Lu, Chen Weizhu
- Non-autoregressive Translation With Layer-wise Prediction And Deep Supervision Huang Chenyang, Zhou Hao, Zaïane Osmar R., Mou Lili, Li Lei
- Finetuned Language Models Are Zero-shot Learners Jason Wei, Maarten Bosma, Vincent Y. Zhao, Kelvin Guu, Adams Wei Yu, Brian Lester, Nan Du, Andrew M. Dai, Quoc V. Le
- Improving Non-autoregressive Generation With Mixup Training Jiang Ting, Huang Shaohan, Zhang Zihan, Wang Deqing, Zhuang Fuzhen, Wei Furu, Huang Haizhen, Zhang Liangjie, Zhang Qi
- Numgpt: Improving Numeracy Ability Of Generative Pre-trained Models Jin Zhihua, Jiang Xin, Wang Xingbo, Liu Qun, Wang Yong, Ren Xiaozhe, Qu Huamin
- AMMUS : A Survey Of Transformer-based Pretrained Models In Natural Language Processing Kalyan Katikapalli Subramanyam, Rajasekharan Ajit, Sangeetha Sivanesan
- Amazon Sagemaker Model Parallelism: A General And Flexible Framework For Large Model Training Karakus Can, Huilgol Rahul, Wu Fei, Subramanian Anirudh, Daniel Cade, Cavdar Derya, Xu Teng, Chen Haohan, Rahnama Arash, Quintela Luis
- Finetuning Pretrained Transformers Into Rnns Kasai Jungo, Peng Hao, Zhang Yizhe, Yogatama Dani, Ilharco Gabriel, Pappas Nikolaos, Mao Yi, Chen Weizhu, Smith Noah A.
- Text-free Prosody-aware Generative Spoken Language Modeling Kharitonov Eugene, Lee Ann, Polyak Adam, Adi Yossi, Copet Jade, Lakhotia Kushal, Nguyen Tu-anh, Rivière Morgane, Mohamed Abdelrahman, Dupoux Emmanuel, Hsu Wei-ning
- What's In A Measurement? Using GPT-3 On Semeval 2021 Task 8 -- Measeval Kohler Curt, Daniel Ron Jr
- Improving Scheduled Sampling With Elastic Weight Consolidation For Neural Machine Translation Korakakis Michalis, Vlachos Andreas
- Controlling Conditional Language Models Without Catastrophic Forgetting Korbak Tomasz, Elsahar Hady, Kruszewski German, Dymetman Marc
- BERT Busters: Outlier Dimensions That Disrupt Transformers Kovaleva Olga, Kulshreshtha Saurabh, Rogers Anna, Rumshisky Anna
- Pagnol: An Extra-large French Generative Model Launay Julien, Tommasone Elena, Pannier Baptiste, Boniface François, Chatelain Amélie, Cappelli Alessandro, Poli Iacopo, Seddah Djamé
- The Stability-efficiency Dilemma: Investigating Sequence Length Warmup For Training GPT Models Li Conglong, Zhang Minjia, He Yuxiong
- A Short Study On Compressing Decoder-based Language Models Li Tianda, Mesbahi Yassir El, Kobyzev Ivan, Rashid Ahmad, Mahmud Atif, Anchuri Nithin, Hajimolahoseini Habib, Liu Yang, Rezagholizadeh Mehdi
- Terapipe: Token-level Pipeline Parallelism For Training Large-scale Language Models Li Zhuohan, Zhuang Siyuan, Guo Shiyuan, Zhuo Danyang, Zhang Hao, Song Dawn, Stoica Ion
- Lyra: A Benchmark For Turducken-style Code Generation Liang Qingyuan, Sun Zeyu, Zhu Qihao, Zhang Wenjie, Yu Lian, Xiong Yingfei, Zhang Lu
- Truthfulqa: Measuring How Models Mimic Human Falsehoods Lin Stephanie, Hilton Jacob, Evans Owain
- Few-shot Learning With Multilingual Language Models Lin Xi Victoria, Mihaylov Todor, Artetxe Mikel, Wang Tianlu, Chen Shuohui, Simig Daniel, Ott Myle, Goyal Naman, Bhosale Shruti, Du Jingfei, Pasunuru Ramakanth, Shleifer Sam, Koura Punit Singh, Chaudhary Vishrav, O'horo Brian, Wang Jeff, Zettlemoyer Luke, Kozareva Zornitsa, Diab Mona, Stoyanov Veselin, Li Xian
- VX2TEXT: End-to-end Learning Of Video-based Text Generation From Multimodal Inputs Lin Xudong, Bertasius Gedas, Wang Jue, Chang Shih-fu, Parikh Devi, Torresani Lorenzo
- Dexperts: Decoding-time Controlled Text Generation With Experts And Anti-experts Liu Alisa, Sap Maarten, Lu Ximing, Swayamdipta Swabha, Bhagavatula Chandra, Smith Noah A., Choi Yejin
- Variational Latent-state GPT For Semi-supervised Task-oriented Dialog Systems Liu Hong, Cai Yucheng, Lin Zhenru, Ou Zhijian, Huang Yi, Feng Junlan
- What Makes Good In-context Examples For GPT-\(3\)? Liu Jiachang, Shen Dinghan, Zhang Yizhe, Dolan Bill, Carin Lawrence, Chen Weizhu
- Unified Multimodal Pre-training And Prompt-based Tuning For Vision-language Understanding And Generation Liu Tianyi, Wu Zuxuan, Xiong Wenhan, Chen Jingjing, Jiang Yu-gang
- A Token-level Reference-free Hallucination Detection Benchmark For Free-form Text Generation Liu Tianyu, Zhang Yizhe, Brockett Chris, Mao Yi, Sui Zhifang, Chen Weizhu, Dolan Bill
- GPT Understands, Too Liu Xiao, Zheng Yanan, Du Zhengxiao, Ding Ming, Qian Yujie, Yang Zhilin, Tang Jie
- Enriching Non-autoregressive Transformer With Syntactic And Semanticstructures For Neural Machine Translation Liu Ye, Wan Yao, Zhang Jian-guo, Zhao Wenting, Yu Philip S.
- Fantastically Ordered Prompts And Where To Find Them: Overcoming Few-shot Prompt Order Sensitivity Lu Yao, Bartolo Max, Moore Alastair, Riedel Sebastian, Stenetorp Pontus
- Few-shot Self-rationalization With Natural Language Prompts Marasović Ana, Beltagy Iz, Downey Doug, Peters Matthew E.
- Evaluating Large Language Models Trained On Code Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde De Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter, Philippe Tillet, Felipe Petroski Such, Dave Cummings, Matthias Plappert, Fotios Chantzis, Elizabeth Barnes, Ariel Herbert-voss, William Hebgen Guss, Alex Nichol, Alex Paino, Nikolas Tezak, Jie Tang, Igor Babuschkin, Suchir Balaji, Shantanu Jain, William Saunders, Christopher Hesse, Andrew N. Carr, Jan Leike, Josh Achiam, Vedant Misra, Evan Morikawa, Alec Radford, Matthew Knight, Miles Brundage, Mira Murati, Katie Mayer, Peter Welinder, Bob Mcgrew, Dario Amodei, Sam Mccandlish, Ilya Sutskever, Wojciech Zaremba
- How Much Do Language Models Copy From Their Training Data? Evaluating Linguistic Novelty In Text Generation Using RAVEN Mccoy R. Thomas, Smolensky Paul, Linzen Tal, Gao Jianfeng, Celikyilmaz Asli
- Language Models Are Few-shot Butlers Micheli Vincent, Fleuret François
- GPT-3 Models Are Poor Few-shot Learners In The Biomedical Domain Moradi Milad, Blagec Kathrin, Haberl Florian, Samwald Matthias
- Glam: Efficient Scaling Of Language Models With Mixture-of-experts Nan Du, Yanping Huang, Andrew M. Dai, Simon Tong, Dmitry Lepikhin, Yuanzhong Xu, Maxim Krikun, Yanqi Zhou, Adams Wei Yu, Orhan Firat, Barret Zoph, Liam Fedus, Maarten Bosma, Zongwei Zhou, Tao Wang, Yu Emma Wang, Kellie Webster, Marie Pellat, Kevin Robinson, Kathleen Meier-hellstern, Toju Duke, Lucas Dixon, Kun Zhang, Quoc V Le, Yonghui Wu, Zhifeng Chen, Claire Cui
- Hierarchical Transformers Are More Efficient Language Models Nawrot Piotr, Tworkowski Szymon, Tyrolski Michał, Kaiser Łukasz, Wu Yuhuai, Szegedy Christian, Michalewski Henryk
- TIMEDIAL: Temporal Commonsense Reasoning In Dialog Qin Lianhui, Gupta Aditya, Upadhyay Shyam, He Luheng, Choi Yejin, Faruqui Manaal
- Teaching Autoregressive Language Models Complex Tasks By Demonstration Recchia Gabriel
- Webgpt: Browser-assisted Question-answering With Human Feedback Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman
- Multiversal Views On Language Models Reynolds Laria, Mcdonell Kyle
- Prompt Programming For Large Language Models: Beyond The Few-shot Paradigm Reynolds Laria, Mcdonell Kyle
- Leveraging The Inductive Bias Of Large Language Models For Abstract Textual Reasoning Rytting Christopher Michael, Wingate David
- Representation Learning For Conversational Data Using Discourse Mutual Information Maximization Santra Bishal, Roychowdhury Sumegh, Mandal Aishik, Gurram Vasu, Naik Atharva, Gupta Manish, Goyal Pawan
- Large Pre-trained Language Models Contain Human-like Biases Of What Is Right And Wrong To Do Schramowski Patrick, Turan Cigdem, Andersen Nico, Rothkopf Constantin A., Kersting Kristian
- The Power Of Prompt Tuning For Low-resource Semantic Parsing Schucher Nathan, Reddy Siva, De Vries Harm
- Scaling Law For Recommendation Models: Towards General-purpose User Representations Shin Kyuyong, Kwak Hanock, Kim Su Young, Ramstrom Max Nihlen, Jeong Jisu, Ha Jung-woo, Kim Kyung-min
- Few-shot Semantic Parsing With Language Models Trained On Code Shin Richard, Van Durme Benjamin
- ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, Dianhai Yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang
- Primer: Searching For Efficient Transformers For Language Modeling So David R., Mańke Wojciech, Liu Hanxiao, Dai Zihang, Shazeer Noam, Le Quoc V.
- Corelm: Coreference-aware Language Model Fine-tuning Stylianou Nikolaos, Vlahavas Ioannis
- Non-autoregressive Text Generation With Pre-trained Language Models Su Yixuan, Cai Deng, Wang Yan, Vandyke David, Baker Simon, Li Piji, Collier Nigel
- NSP-BERT: A Prompt-based Few-shot Learner Through An Original Pre-training Task--next Sentence Prediction Sun Yi, Zheng Yu, Hao Chao, Qiu Hangping
- ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Sun Yu, Wang Shuohuan, Feng Shikun, Ding Siyu, Pang Chao, Shang Junyuan, Liu Jiaxiang, Chen Xuyi, Zhao Yanbin, Lu Yuxiang, Liu Weixin, Wu Zhihua, Gong Weibao, Liang Jianzhong, Shang Zhizhou, Sun Peng, Liu Wei, Ouyang Xuan, Yu Dianhai, Tian Hao, Wu Hua, Wang Haifeng
- Multilingual Translation Via Grafting Pre-trained Language Models Sun Zewei, Wang Mingxuan, Li Lei
- General-purpose Question-answering With Macaw Tafjord Oyvind, Clark Peter
- Understanding The Capabilities, Limitations, And Societal Impact Of Large Language Models Tamkin Alex, Brundage Miles, Clark Jack, Ganguli Deep
- Towards Reinforcement Learning For Pivot-based Neural Machine Translation With Non-autoregressive Transformer Tokarchuk Evgeniia, Rosendahl Jan, Wang Weiyue, Petrushkov Pavel, Lancewicki Tomer, Khadivi Shahram, Ney Hermann
- Exploring Transformers In Natural Language Generation: GPT, BERT, And Xlnet Topal M. Onat, Bas Anil, Van Heerden Imke
- Topicrefine: Joint Topic Prediction And Dialogue Response Generation For Multi-turn End-to-end Dialogue System Wang Hongru, Cui Mingyu, Zhou Zimo, Fung Gabriel Pui Cheong, Wong Kam-fai
- Entailment As Few-shot Learner Wang Sinong, Fang Han, Khabsa Madian, Mao Hanzi, Ma Hao
- Sentence Semantic Regression For Text Generation Wang Wei, Li Piji, Zheng Hai-tao
- Lightseq2: Accelerated Training For Transformer-based Models On Gpus Wang Xiaohui, Wei Yang, Xiong Ying, Huang Guyue, Qian Xian, Ding Yufei, Wang Mingxuan, Li Lei
- List: Lite Prompted Self-training Makes Parameter-efficient Few-shot Learners Wang Yaqing, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Codet5: Identifier-aware Unified Pre-trained Encoder-decoder Models For Code Understanding And Generation Wang Yue, Wang Weishi, Joty Shafiq, Hoi Steven C. H.
- Pangu-\(α\): Large-scale Autoregressive Pretrained Chinese Language Models With Auto-parallel Computation Wei Zeng, Xiaozhe Ren, Teng Su, Hui Wang, Yi Liao, Zhiwei Wang, Xin Jiang, Zhenzhang Yang, Kaisheng Wang, Xiaoda Zhang, Chen Li, Ziyan Gong, Yifan Yao, Xinjing Huang, Jun Wang, Jianfeng Yu, Qi Guo, Yue Yu, Yan Zhang, Jin Wang, Hengtao Tao, Dasen Yan, Zexuan Yi, Fang Peng, Fangqing Jiang, Han Zhang, Lingfeng Deng, Yehong Zhang, Zhe Lin, Chao Zhang, Shaojie Zhang, Mingyue Guo, Shanzhi Gu, Gaojun Fan, Yaowei Wang, Xuefeng Jin, Qun Liu, Yonghong Tian
- Reframing Human-ai Collaboration For Generating Free-text Explanations Wiegreffe Sarah, Hessel Jack, Swayamdipta Swabha, Riedl Mark, Choi Yejin
- Language Models Are Few-shot Multilingual Learners Winata Genta Indra, Madotto Andrea, Lin Zhaojiang, Liu Rosanne, Yosinski Jason, Fung Pascale
- Yuan 1.0: Large-scale Pre-trained Language Model In Zero-shot And Few-shot Learning Wu Shaohua, Zhao Xudong, Yu Tong, Zhang Rongguo, Shen Chong, Liu Hongli, Li Feng, Zhu Hong, Luo Jiangang, Xu Liang, Zhang Xuanwei
- Prefix-tuning: Optimizing Continuous Prompts For Generation Xiang Lisa Li, Percy Liang
- An Explanation Of In-context Learning As Implicit Bayesian Inference Xie Sang Michael, Raghunathan Aditi, Liang Percy, Ma Tengyu
- Exploring Story Generation With Multi-task Objectives In Variational Autoencoders Xie Zhuohan, Cohn Trevor, Lau Jey Han
- Fastseq: Make Sequence Generation Faster Yan Yu, Hu Fei, Chen Jiusheng, Bhendawade Nikhil, Ye Ting, Gong Yeyun, Duan Nan, Cui Desheng, Chi Bingyu, Zhang Ruofei
- Transformer-based Korean Pretrained Language Models: A Survey On Three Years Of Progress Yang Kichang
- An Empirical Study Of GPT-3 For Few-shot Knowledge-based VQA Yang Zhengyuan, Gan Zhe, Wang Jianfeng, Hu Xiaowei, Lu Yumao, Liu Zicheng, Wang Lijuan
- Gpt3mix: Leveraging Large-scale Language Models For Text Augmentation Yoo Kang Min, Park Dongju, Kang Jaewook, Lee Sang-woo, Park Woomyeong
- Reconsidering The Past: Optimizing Hidden States In Language Models Yoshida Davis, Gimpel Kevin
- Ernie-vilg: Unified Generative Pre-training For Bidirectional Vision-language Generation Zhang Han, Yin Weichong, Fang Yewei, Li Lanxin, Duan Boqiang, Wu Zhihua, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- DSGPT: Domain-specific Generative Pre-training Of Transformers For Text Generation In E-commerce Title And Review Summarization Zhang Xueying, Jiang Yunjiang, Shang Yue, Cheng Zhaomeng, Zhang Chi, Fan Xiaochuan, Xiao Yun, Long Bo
- Retgen: A Joint Framework For Retrieval And Grounded Text Generation Modeling Zhang Yizhe, Sun Siqi, Gao Xiang, Fang Yuwei, Brockett Chris, Galley Michel, Gao Jianfeng, Dolan Bill
- Calibrate Before Use: Improving Few-shot Performance Of Language Models Zhao Tony Z., Wallace Eric, Feng Shi, Klein Dan, Singh Sameer
- Adapting Language Models For Zero-shot Learning By Meta-tuning On Dataset And Prompt Collections Zhong Ruiqi, Lee Kristy, Zhang Zheng, Klein Dan
- Thinking Clearly, Talking Fast: Concept-guided Non-autoregressive Generation For Open-domain Dialogue Systems Zou Yicheng, Liu Zhihua, Hu Xingwu, Zhang Qi
- Gpt-3-driven Pedagogical Agents For Training Children's Curious Question-asking Skills Abdelghani Rania, Wang Yen-hsiang, Yuan Xingdi, Wang Tong, Lucas Pauline, Sauzéon Hélène, Oudeyer Pierre-yves
- Vector Representations Of Idioms In Conversational Systems Adewumi Tosin, Liwicki Foteini, Liwicki Marcus
- Using Large Language Models To Simulate Multiple Humans And Replicate Human Subject Studies Aher Gati, Arriaga Rosa I., Kalai Adam Tauman
- Few-shot Training Llms For Project-specific Code-summarization Ahmed Toufique, Devanbu Premkumar
- Prompting As Probing: Using Language Models For Knowledge Base Construction Alivanistos Dimitrios, Santamaría Selene Báez, Cochez Michael, Kalo Jan-christoph, Van Krieken Emile, Thanapalasingam Thiviyan
- Text And Patterns: For Effective Chain Of Thought, It Takes Two To Tango Aman Madaan, Amir Yazdanbakhsh
- Language Models Of Code Are Few-shot Commonsense Learners Aman Madaan, Shuyan Zhou, Uri Alon, Yiming Yang, Graham Neubig
- A Disability Lens Towards Biases In GPT-3 Generated Open-ended Languages Amin Akhter Al, Kabir Kazi Sinthia
- Tempo: Accelerating Transformer-based Model Training Through Memory Footprint Reduction Andoorveedu Muralidhar, Zhu Zhanda, Zheng Bojian, Pekhimenko Gennady
- A Comparative Study On Language Models For Task-oriented Dialogue Systems Andreas Vinsen Marselino, Winata Genta Indra, Purwarianti Ayu
- GLM-130B: An Open Bilingual Pre-trained Model Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Peng Zhang, Yuxiao Dong, Jie Tang
- Improving Short Text Classification With Augmented Data Using GPT-3 Balkus Salvador, Yan Donghui
- P\(^3\)LM: Probabilistically Permuted Prophet Language Modeling For Generative Pre-training Bao Junwei, Wang Yifan, Ying Jiangyong, Gong Yeyun, Zhao Jing, Wu Youzheng, He Xiaodong
- \(\textit{latent}\)-glat: Glancing At Latent Variables For Parallel Text Generation Bao Yu, Zhou Hao, Huang Shujian, Wang Dongqi, Qian Lihua, Dai Xinyu, Chen Jiajun, Li Lei
- Leveraging Pre-trained Language Models For Conversational Information Seeking From Text Bellan Patrizio, Dragoni Mauro, Ghidini Chiara
- Bygpt5: End-to-end Style-conditioned Poetry Generation With Token-free Language Models Belouadi Jonas, Eger Steffen
- I2D2: Inductive Knowledge Distillation With Neurologic And Self-imitation Bhagavatula Chandra, Hwang Jena D., Downey Doug, Bras Ronan Le, Lu Ximing, Qin Lianhui, Sakaguchi Keisuke, Swayamdipta Swabha, West Peter, Choi Yejin
- Using Cognitive Psychology To Understand GPT-3 Binz Marcel, Schulz Eric
- Prompting Language Models For Linguistic Structure Blevins Terra, Gonen Hila, Zettlemoyer Luke
- Looking For A Handsome Carpenter! Debiasing GPT-3 Job Advertisements Borchers Conrad, Gala Dalia Sara, Gilburt Benjamin, Oravkin Eduard, Bounsi Wilfried, Asano Yuki M., Kirk Hannah Rose
- Evaluating The Susceptibility Of Pre-trained Language Models Via Handcrafted Adversarial Examples Branch Hezekiah J., Cefalu Jonathan Rodriguez, Mchugh Jeremy, Hujer Leyla, Bahl Aditya, Iglesias Daniel Del Castillo, Heichman Ron, Darwishi Ramesh
- A Model-agnostic Data Manipulation Method For Persona-based Dialogue Generation Cao Yu, Bi Wei, Fang Meng, Shi Shuming, Tao Dacheng
- Analyzing Semantic Faithfulness Of Language Models Via Input Intervention On Question Answering Chaturvedi Akshay, Bhar Swarnadeep, Saha Soumadeep, Garain Utpal, Asher Nicholas
- Controllable Text Generation With Language Constraints Chen Howard, Li Huihan, Chen Danqi, Narasimhan Karthik
- Transformers Go For The Lols: Generating (humourous) Titles From Scientific Abstracts End-to-end Chen Yanran, Eger Steffen
- Capture Salient Historical Information: A Fast And Accurate Non-autoregressive Model For Multi-turn Spoken Language Understanding Cheng Lizhi, Jia Weijia, Yang Wenmian
- Prompting GPT-3 To Be Reliable Chenglei Si, Zhe Gan, Zhengyuan Yang, Shuohang Wang, Jianfeng Wang, Jordan Boyd-graber, Lijuan Wang
- On Reality And The Limits Of Language Data: Aligning Llms With Human Norms Collier Nigel H., Liu Fangyu, Shareghi Ehsan
- M6-rec: Generative Pretrained Language Models Are Open-ended Recommender Systems Cui Zeyu, Ma Jianxin, Zhou Chang, Zhou Jingren, Yang Hongxia
- Scientific And Creative Analogies In Pretrained Language Models Czinczoll Tamara, Yannakoudakis Helen, Mishra Pushkar, Shutova Ekaterina
- Why Can GPT Learn In-context? Language Models Implicitly Perform Gradient Descent As Meta-optimizers Dai Damai, Sun Yutao, Dong Li, Hao Yaru, Ma Shuming, Sui Zhifang, Wei Furu
- Understanding BLOOM: An Empirical Study On Diverse NLP Tasks Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- Rlprompt: Optimizing Discrete Text Prompts With Reinforcement Learning Deng Mingkai, Wang Jianyu, Hsieh Cheng-ping, Wang Yihan, Guo Han, Shu Tianmin, Song Meng, Xing Eric P., Hu Zhiting
- Least-to-most Prompting Enables Complex Reasoning In Large Language Models Denny Zhou, Nathanael Schärli, Le Hou, Jason Wei, Nathan Scales, Xuezhi Wang, Dale Schuurmans, Claire Cui, Olivier Bousquet, Quoc Le, Ed Chi
- Black-box Prompt Learning For Pre-trained Language Models Diao Shizhe, Huang Zhichao, Xu Ruijia, Li Xuechun, Lin Yong, Zhou Xiao, Zhang Tong
- Is GPT-3 A Good Data Annotator? Ding Bosheng, Qin Chengwei, Liu Linlin, Chia Yew Ken, Joty Shafiq, Li Boyang, Bing Lidong
- Parallel Attention Forcing For Machine Translation Dou Qingyun, Gales Mark
- Probing For Incremental Parse States In Autoregressive Language Models Eisape Tiwalayo, Gangireddy Vineet, Levy Roger P., Kim Yoon
- Leveraging Pre-trained Models For Failure Analysis Triplets Generation Ezukwoke Kenneth, Hoayek Anis, Batton-hubert Mireille, Boucher Xavier, Gounet Pascal, Adrian Jerome
- Progressive Text-to-image Generation Fei Zhengcong, Fan Mingyuan, Zhu Li, Huang Junshi
- Llmeffichecker: Understanding And Testing Efficiency Degradation Of Large Language Models Feng Xiaoning, Han Xiaohong, Chen Simin, Yang Wei
- GPTQ: Accurate Post-training Quantization For Generative Pre-trained Transformers Frantar Elias, Ashkboos Saleh, Hoefler Torsten, Alistarh Dan
- Complexity-based Prompting For Multi-step Reasoning Fu Yao, Peng Hao, Sabharwal Ashish, Clark Peter, Khot Tushar
- Normsage: Multi-lingual Multi-cultural Norm Discovery From Conversations On-the-fly Fung Yi R., Chakraborty Tuhin, Guo Hao, Rambow Owen, Muresan Smaranda, Ji Heng
- Re2g: Retrieve, Rerank, Generate Glass Michael, Rossiello Gaetano, Chowdhury Md Faisal Mahbub, Naik Ankita Rajaram, Cai Pengshan, Gliozzo Alfio
- Diffuseq: Sequence To Sequence Text Generation With Diffusion Models Gong Shansan, Li Mukai, Feng Jiangtao, Wu Zhiyong, Kong Lingpeng
- "john Is 50 Years Old, Can His Son Be 65?" Evaluating NLP Models' Understanding Of Feasibility Gupta Himanshu, Varshney Neeraj, Mishra Swaroop, Pal Kuntal Kumar, Sawant Saurabh Arjun, Scaria Kevin, Goyal Siddharth, Baral Chitta
- Thinking About GPT-3 In-context Learning For Biomedical IE? Think Again Gutiérrez Bernal Jiménez, Mcneal Nikolas, Washington Clay, Chen You, Li Lang, Sun Huan, Su Yu
- Thinking Fast And Slow In Large Language Models Hagendorff Thilo, Fabi Sarah, Kosinski Michal
- FOLIO: Natural Language Reasoning With First-order Logic Han Simeng, Schoelkopf Hailey, Zhao Yilun, Qi Zhenting, Riddell Martin, Zhou Wenfei, Coady James, Peng David, Qiao Yujie, Benson Luke, Sun Lucy, Wardle-solano Alex, Szabo Hannah, Zubova Ekaterina, Burtell Matthew, Fan Jonathan, Liu Yixin, Wong Brian, Sailor Malcolm, Ni Ansong, Nan Linyong, Kasai Jungo, Yu Tao, Zhang Rui, Fabbri Alexander R., Kryscinski Wojciech, Yavuz Semih, Liu Ye, Lin Xi Victoria, Joty Shafiq, Zhou Yingbo, Xiong Caiming, Ying Rex, Cohan Arman, Radev Dragomir
- SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model For Text Generation And Modular Control Han Xiaochuang, Kumar Sachin, Tsvetkov Yulia
- Aging With GRACE: Lifelong Model Editing With Discrete Key-value Adaptors Hartvigsen Thomas, Sankaranarayanan Swami, Palangi Hamid, Kim Yoon, Ghassemi Marzyeh
- Transformer Language Models Without Positional Encodings Still Learn Positional Information Haviv Adi, Ram Ori, Press Ofir, Izsak Peter, Levy Omer
- Rethinking With Retrieval: Faithful Large Language Model Inference He Hangfeng, Zhang Hongming, Roth Dan
- Z-code++: A Pre-trained Language Model Optimized For Abstractive Summarization He Pengcheng, Peng Baolin, Lu Liyang, Wang Song, Mei Jie, Liu Yang, Xu Ruochen, Awadalla Hany Hassan, Shi Yu, Zhu Chenguang, Xiong Wayne, Zeng Michael, Gao Jianfeng, Huang Xuedong
- Demystifying Prompts In Language Models Via Perplexity Estimation Hila Gonen, Srini Iyer, Terra Blevins, Noah A. Smith, Luke Zettlemoyer
- Wikiwhy: Answering And Explaining Cause-and-effect Questions Ho Matthew, Sharma Aditya, Chang Justin, Saxon Michael, Levy Sharon, Lu Yujie, Wang William Yang
- Cogvideo: Large-scale Pretraining For Text-to-video Generation Via Transformers Hong Wenyi, Ding Ming, Zheng Wendi, Liu Xinghan, Tang Jie
- Instruction Induction: From Few Examples To Natural Language Task Descriptions Honovich Or, Shaham Uri, Bowman Samuel R., Levy Omer
- Promptcap: Prompt-guided Task-aware Image Captioning Hu Yushi, Hua Hang, Yang Zhengyuan, Shi Weijia, Smith Noah A, Luo Jiebo
- PLANET: Dynamic Content Planning In Autoregressive Transformers For Long-form Text Generation Hu Zhe, Chan Hou Pong, Liu Jiachen, Xiao Xinyan, Wu Hua, Huang Lifu
- Autoregressive Entity Generation For End-to-end Task-oriented Dialog Huang Guanhuan, Quan Xiaojun, Wang Qifan
- From Human Days To Machine Seconds: Automatically Answering And Generating Machine Learning Final Exams Iddo Drori, Sarah J. Zhang, Reece Shuttleworth, Sarah Zhang, Keith Tyser, Zad Chin, Pedro Lantigua, Saisamrit Surbehera, Gregory Hunter, Derek Austin, Leonard Tang, Yann Hicke, Sage Simhon, Sathwik Karnik, Darnell Granberry, Madeleine Udell
- Uniform Complexity For Text Generation Imperial Joseph Marvin, Madabushi Harish Tayyar
- Can Large Language Models Truly Understand Prompts? A Case Study With Negated Prompts Jang Joel, Ye Seonghyeon, Seo Minjoon
- Chain-of-thought Prompting Elicits Reasoning In Large Language Models Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, Denny Zhou
- Litetransformersearch: Training-free Neural Architecture Search For Efficient Language Models Javaheripi Mojan, De Rosa Gustavo H., Mukherjee Subhabrata, Shah Shital, Religa Tomasz L., Mendes Caio C. T., Bubeck Sebastien, Koushanfar Farinaz, Dey Debadeepta
- Chatgpt Makes Medicine Easy To Swallow: An Exploratory Case Study On Simplified Radiology Reports Jeblick Katharina, Schachtner Balthasar, Dexl Jakob, Mittermeier Andreas, Stüber Anna Theresa, Topalis Johanna, Weber Tobias, Wesp Philipp, Sabel Bastian, Ricke Jens, Ingrisch Michael
- Contrastive Learning For Prompt-based Few-shot Language Learners Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Pairreranker: Pairwise Reranking For Natural Language Generation Jiang Dongfu, Lin Bill Yuchen, Ren Xiang
- A Simple Contrastive Learning Objective For Alleviating Neural Text Degeneration Jiang Shaojie, Zhang Ruqing, Vakulenko Svitlana, De Rijke Maarten
- Training Compute-optimal Large Language Models Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego De Las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, Tom Hennigan, Eric Noland, Katie Millican, George Van Den Driessche, Bogdan Damoc, Aurelia Guy, Simon Osindero, Karen Simonyan, Erich Elsen, Jack W. Rae, Oriol Vinyals, Laurent Sifre
- Self-prompting Large Language Models For Zero-shot Open-domain QA Junlong Li, Jinyuan Wang, Zhuosheng Zhang, Hai Zhao
- Latency Adjustable Transformer Encoder For Language Understanding Kachuee Sajjad, Sharifkhani Mohammad
- Action-gpt: Leveraging Large-scale Language Models For Improved And Generalized Action Generation Kalakonda Sai Shashank, Maheshwari Shubh, Sarvadevabhatla Ravi Kiran
- Gpt-neo For Commonsense Reasoning -- A Theoretical And Practical Lens Kashyap Rohan, Kashyap Vivek, P. Narendra C.
- Decomposed Prompting: A Modular Approach For Solving Complex Tasks Khot Tushar, Trivedi Harsh, Finlayson Matthew, Fu Yao, Richardson Kyle, Clark Peter, Sabharwal Ashish
- Towards The Generation Of Musical Explanations With GPT-3 Krol Stephen James, Llano Maria Teresa, Mccormack Jon
- Co-training Improves Prompt-based Learning For Large Language Models Lang Hunter, Agrawal Monica, Kim Yoon, Sontag David
- Coauthor: Designing A Human-ai Collaborative Writing Dataset For Exploring Language Model Capabilities Lee Mina, Liang Percy, Yang Qian
- Evaluating Human-language Model Interaction Lee Mina, Srivastava Megha, Hardy Amelia, Thickstun John, Durmus Esin, Paranjape Ashwin, Gerard-ursin Ines, Li Xiang Lisa, Ladhak Faisal, Rong Frieda, Wang Rose E., Kwon Minae, Park Joon Sung, Cao Hancheng, Lee Tony, Bommasani Rishi, Bernstein Michael, Liang Percy
- Dialogcc: An Automated Pipeline For Creating High-quality Multi-modal Dialogue Dataset Lee Young-jun, Ko Byungsoo, Kim Han-gyu, Hyeon Jonghwan, Choi Ho-jin
- Fast Inference From Transformers Via Speculative Decoding Leviathan Yaniv, Kalman Matan, Matias Yossi
- GPT-D: Inducing Dementia-related Linguistic Anomalies By Deliberate Degradation Of Artificial Neural Language Models Li Changye, Knopman David, Xu Weizhe, Cohen Trevor, Pakhomov Serguei
- Counterfactual Reasoning: Do Language Models Need World Knowledge For Causal Understanding? Li Jiaxuan, Yu Lang, Ettinger Allyson
- ELMER: A Non-autoregressive Pre-trained Language Model For Efficient And Effective Text Generation Li Junyi, Tang Tianyi, Zhao Wayne Xin, Nie Jian-yun, Wen Ji-rong
- Explanations From Large Language Models Make Small Reasoners Better Li Shiyang, Chen Jianshu, Shen Yelong, Chen Zhiyu, Zhang Xinlu, Li Zekun, Wang Hong, Qian Jing, Peng Baolin, Mao Yi, Chen Wenhu, Yan Xifeng
- Systematicity In Gpt-3's Interpretation Of Novel English Noun Compounds Li Siyan, Carlson Riley, Potts Christopher
- Evaluating Psychological Safety Of Large Language Models Li Xingxuan, Li Yutong, Qiu Lin, Joty Shafiq, Bing Lidong
- VALHALLA: Visual Hallucination For Machine Translation Li Yi, Panda Rameswar, Kim Yoon, Chen Chun-fu, Feris Rogerio, Cox David, Vasconcelos Nuno
- Accelerating Attention Through Gradient-based Learned Runtime Pruning Li Zheng, Ghodrati Soroush, Yazdanbakhsh Amir, Esmaeilzadeh Hadi, Kang Mingu
- Life After BERT: What Do Other Muppets Understand About Language? Lialin Vladislav, Zhao Kevin, Shivagunde Namrata, Rumshisky Anna
- Using Large Language Models To Generate Engaging Captions For Data Visualizations Liew Ashley, Mueller Klaus
- Teaching Models To Express Their Uncertainty In Words Lin Stephanie, Hilton Jacob, Evans Owain
- Do Ever Larger Octopi Still Amplify Reporting Biases? Evidence From Judgments Of Typical Colour Liu Fangyu, Eisenschlos Julian Martin, Cole Jeremy R., Collier Nigel
- Towards Better Few-shot And Finetuning Performance With Forgetful Causal Language Models Liu Hao, Geng Xinyang, Lee Lisa, Mordatch Igor, Levine Sergey, Narang Sharan, Abbeel Pieter
- Instruction-following Agents With Multimodal Transformer Liu Hao, Lee Lisa, Lee Kimin, Abbeel Pieter
- Building Markovian Generative Architectures Over Pretrained LM Backbones For Efficient Task-oriented Dialog Systems Liu Hong, Cai Yucheng, Ou Zhijian, Huang Yi, Feng Junlan
- A Generative User Simulator With Gpt-based Architecture And Goal State Tracking For Reinforced Multi-domain Dialog Systems Liu Hong, Cai Yucheng, Ou Zhijian, Huang Yi, Feng Junlan
- Same Pre-training Loss, Better Downstream: Implicit Bias Matters For Language Models Liu Hong, Xie Sang Michael, Li Zhiyuan, Ma Tengyu
- Rainier: Reinforced Knowledge Introspector For Commonsense Question Answering Liu Jiacheng, Hallinan Skyler, Lu Ximing, He Pengfei, Welleck Sean, Hajishirzi Hannaneh, Choi Yejin
- Relational Memory Augmented Language Models Liu Qi, Yogatama Dani, Blunsom Phil
- 3DALL-E: Integrating Text-to-image AI In 3D Design Workflows Liu Vivian, Vermeulen Jo, Fitzmaurice George, Matejka Justin
- Mulzdg: Multilingual Code-switching Framework For Zero-shot Dialogue Generation Liu Yongkang, Feng Shi, Wang Daling, Zhang Yifei
- Can Large Language Models Reason About Medical Questions? Liévin Valentin, Hother Christoffer Egeberg, Motzfeldt Andreas Geert, Winther Ole
- Relaxed Attention For Transformer Models Lohrenz Timo, Möller Björn, Li Zhengyang, Fingscheidt Tim
- Training Language Models To Follow Instructions With Human Feedback Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe
- Learn To Explain: Multimodal Reasoning Via Thought Chains For Science Question Answering Lu Pan, Mishra Swaroop, Xia Tony, Qiu Liang, Chang Kai-wei, Zhu Song-chun, Tafjord Oyvind, Clark Peter, Kalyan Ashwin
- Biogpt: Generative Pre-trained Transformer For Biomedical Text Generation And Mining Luo Renqian, Sun Liai, Xia Yingce, Qin Tao, Zhang Sheng, Poon Hoifung, Liu Tie-yan
- I-tuning: Tuning Frozen Language Models With Image For Lightweight Image Captioning Luo Ziyang, Hu Zhipeng, Xi Yadong, Zhang Rongsheng, Ma Jing
- A Frustratingly Simple Approach For End-to-end Image Captioning Luo Ziyang, Xi Yadong, Zhang Rongsheng, Ma Jing
- Memory-assisted Prompt Editing To Improve GPT-3 After Deployment Madaan Aman, Tandon Niket, Clark Peter, Yang Yiming
- Structured Like A Language Model: Analysing AI As An Automated Subject Magee Liam, Arora Vanicka, Munn Luke
- Edit5: Semi-autoregressive Text-editing With T5 Warm-start Mallinson Jonathan, Adamek Jakub, Malmi Eric, Severyn Aliaksei
- Fine-tuning Pre-trained Transformers Into Decaying Fast Weights Mao Huanru Henry
- Chatbots In A Botnet World Mckee Forrest, Noever David
- LAD: Language Models As Data For Zero-shot Dialog Mehri Shikib, Altun Yasemin, Eskenazi Maxine
- Mass-editing Memory In A Transformer Meng Kevin, Sharma Arnab Sen, Andonian Alex, Belinkov Yonatan, Bau David
- Generating Training Data With Language Models: Towards Zero-shot Language Understanding Meng Yu, Huang Jiaxin, Zhang Yu, Han Jiawei
- Tuning Language Models As Training Data Generators For Augmentation-enhanced Few-shot Learning Meng Yu, Michalski Martin, Huang Jiaxin, Zhang Yu, Abdelzaher Tarek, Han Jiawei
- Rarely A Problem? Language Models Exhibit Inverse Scaling In Their Predictions Following Few-type Quantifiers Michaelov James A., Bergen Benjamin K.
- Rethinking The Role Of Demonstrations: What Makes In-context Learning Work? Min Sewon, Lyu Xinxi, Holtzman Ari, Artetxe Mikel, Lewis Mike, Hajishirzi Hannaneh, Zettlemoyer Luke
- Who Is GPT-3? An Exploration Of Personality, Values And Demographics Miotto Marilù, Rossberg Nicola, Kleinberg Bennett
- HELP ME THINK: A Simple Prompting Strategy For Non-experts To Create Customized Content With Models Mishra Swaroop, Nouri Elnaz
- SGPT: GPT Sentence Embeddings For Semantic Search Muennighoff Niklas
- JASMINE: Arabic GPT Models For Few-shot Learning Nagoudi El Moatez Billah, Abdul-mageed Muhammad, Elmadany Abdelrahim, Inciarte Alcides Alcoba, Khondaker Md Tawkat Islam
- Large Language Models Are Reasoning Teachers Namgyu Ho, Laura Schmid, Se-young Yun
- The Turing Deception Noever David, Ciolino Matt
- Measuring And Narrowing The Compositionality Gap In Language Models Ofir Press, Muru Zhang, Sewon Min, Ludwig Schmidt, Noah A. Smith, Mike Lewis
- Thinksum: Probabilistic Reasoning Over Sets Using Large Language Models Ozturkler Batu, Malkin Nikolay, Wang Zhen, Jojic Nebojsa
- Follow-up Attention: An Empirical Study Of Developer And Neural Model Code Exploration Paltenghi Matteo, Pandita Rahul, Henley Austin Z., Ziegler Albert
- Dynamic Prompt Learning Via Policy Gradient For Semi-structured Mathematical Reasoning Pan Lu, Liang Qiu, Kai-wei Chang, Ying Nian Wu, Song-chun Zhu, Tanmay Rajpurohit, Peter Clark, Ashwin Kalyan
- Bidirectional Language Models Are Also Few-shot Learners Patel Ajay, Li Bryan, Rasooli Mohammad Sadegh, Constant Noah, Raffel Colin, Callison-burch Chris
- Visconde: Multi-document QA With GPT-3 And Neural Reranking Pereira Jayr, Fidalgo Robson, Lotufo Roberto, Nogueira Rodrigo
- Ignore Previous Prompt: Attack Techniques For Language Models Perez Fábio, Ribeiro Ian
- Synchromesh: Reliable Code Generation From Pre-trained Language Models Poesia Gabriel, Polozov Oleksandr, Le Vu, Tiwari Ashish, Soares Gustavo, Meek Christopher, Gulwani Sumit
- Grips: Gradient-free, Edit-based Instruction Search For Prompting Large Language Models Prasad Archiki, Hase Peter, Zhou Xiang, Bansal Mohit
- Psychologically-informed Chain-of-thought Prompts For Metaphor Understanding In Large Language Models Prystawski Ben, Thibodeau Paul, Potts Christopher, Goodman Noah D.
- Tegformer: Topic-to-essay Generation With Good Topic Coverage And High Text Coherence Qi Wang, Liu Rui, Zuo Yuan, Chen Yong, Zhang Dell
- A Self-paced Mixed Distillation Method For Non-autoregressive Generation Qi Weizhen, Gong Yeyun, Shen Yelong, Jiao Jian, Yan Yu, Li Houqiang, Zhang Ruofei, Chen Weizhu, Duan Nan
- Diffusion Glancing Transformer For Parallel Sequence To Sequence Learning Qian Lihua, Wang Mingxuan, Liu Yang, Zhou Hao
- ELLE: Efficient Lifelong Pre-training For Emerging Data Qin Yujia, Zhang Jiajie, Lin Yankai, Liu Zhiyuan, Li Peng, Sun Maosong, Zhou Jie
- Leveraging Large Language Models For Multiple Choice Question Answering Robinson Joshua, Rytting Christopher Michael, Wingate David
- Billions Of Parameters Are Worth More Than In-domain Training Data: A Case Study In The Legal Case Entailment Task Rosa Guilherme Moraes, Bonifacio Luiz, Jeronymo Vitor, Abonizio Hugo, Lotufo Roberto, Nogueira Rodrigo
- Galactica: A Large Language Model For Science Ross Taylor, Marcin Kardas, Guillem Cucurull, Thomas Scialom, Anthony Hartshorn, Elvis Saravia, Andrew Poulton, Viktor Kerkez, Robert Stojnic
- Benchclamp: A Benchmark For Evaluating Language Models On Syntactic And Semantic Parsing Roy Subhro, Thomson Sam, Chen Tongfei, Shin Richard, Pauls Adam, Eisner Jason, Van Durme Benjamin
- Data Augmentation For Intent Classification With Off-the-shelf Large Language Models Sahu Gaurav, Rodriguez Pau, Laradji Issam H., Atighehchian Parmida, Vazquez David, Bahdanau Dzmitry
- Alexatm 20B: Few-shot Learning Using A Large-scale Multilingual Seq2seq Model Saleh Soltan, Shankar Ananthakrishnan, Jack Fitzgerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, Stephen Rawls, Andy Rosenbaum, Anna Rumshisky, Chandana Satya Prakash, Mukund Sridhar, Fabian Triefenbach, Apurv Verma, Gokhan Tur, Prem Natarajan
- NLX-GPT: A Model For Natural Language Explanations In Vision And Vision-language Tasks Sammani Fawaz, Mukherjee Tanmoy, Deligiannis Nikos
- Language Models Are Greedy Reasoners: A Systematic Formal Analysis Of Chain-of-thought Saparov Abulhair, He He
- Training Language Models With Language Feedback Scheurer Jérémy, Campos Jon Ander, Chan Jun Shern, Chen Angelica, Cho Kyunghyun, Perez Ethan
- Non-autoregressive Neural Machine Translation: A Call For Clarity Schmidt Robin M., Pires Telmo, Peitz Stephan, Lööf Jonas
- Large Language Models And The Reverse Turing Test Sejnowski Terrence
- On The Effect Of Pretraining Corpora On In-context Learning By A Large-scale Language Model Seongjin Shin, Sang-woo Lee, Hwijeen Ahn, Sungdong Kim, Hyoungseok Kim, Boseop Kim, Kyunghyun Cho, Gichang Lee, Woomyoung Park, Jung-woo Ha, Nako Sung
- Deanthropomorphising NLP: Can A Language Model Be Conscious? Shardlow Matthew, Przybyła Piotr
- Staged Training For Transformer Language Models Shen Sheng, Walsh Pete, Keutzer Kurt, Dodge Jesse, Peters Matthew, Beltagy Iz
- XRICL: Cross-lingual Retrieval-augmented In-context Learning For Cross-lingual Text-to-sql Semantic Parsing Shi Peng, Zhang Rui, Bai He, Lin Jimmy
- Mgpt: Few-shot Learners Go Multilingual Shliazhko Oleh, Fenogenova Alena, Tikhonova Maria, Mikhailov Vladislav, Kozlova Anastasia, Shavrina Tatiana
- Distilling Reasoning Capabilities Into Smaller Language Models Shridhar Kumar, Stolfo Alessandro, Sachan Mrinmaya
- Language Models That Seek For Knowledge: Modular Search & Generation For Dialogue And Prompt Completion Shuster Kurt, Komeili Mojtaba, Adolphs Leonard, Roller Stephen, Szlam Arthur, Weston Jason
- Gpt-neox-20b: An Open-source Autoregressive Language Model Sid Black, Stella Biderman, Eric Hallahan, Quentin Anthony, Leo Gao, Laurence Golding, Horace He, Connor Leahy, Kyle Mcdonell, Jason Phang, Michael Pieler, Usvsn Sai Prashanth, Shivanshu Purohit, Laria Reynolds, Jonathan Tow, Ben Wang, Samuel Weinbach
- Ask Me Anything: A Simple Strategy For Prompting Language Models Simran Arora, Avanika Narayan, Mayee F. Chen, Laurel Orr, Neel Guha, Kush Bhatia, Ines Chami, Frederic Sala, Christopher Ré
- Augmenting Interpretable Models With Llms During Training Singh Chandan, Askari Armin, Caruana Rich, Gao Jianfeng
- Explaining Patterns In Data With Language Models Via Interpretable Autoprompting Singh Chandan, Morris John X., Aneja Jyoti, Rush Alexander M., Gao Jianfeng
- Beyond The Imitation Game: Quantifying And Extrapolating The Capabilities Of Language Models Srivastava Aarohi Shammie, Rastogi Abhinav Shammie, Rao Abhishek Shammie, Shoeb Abu Awal Md Shammie, Abid Abubakar Shammie, Fisch Adam Shammie, Brown Adam R. Shammie, Santoro Adam Shammie, Gupta Aditya Shammie, Garriga-alonso Adrià Shammie, Kluska Agnieszka Shammie, Lewkowycz Aitor Shammie, Agarwal Akshat Shammie, Power Alethea Shammie, Ray Alex Shammie, Warstadt Alex Shammie, Kocurek Alexander W. Shammie, Safaya Ali Shammie, Tazarv Ali Shammie, Xiang Alice Shammie, Parrish Alicia Shammie, Nie Allen Shammie, Hussain Aman Shammie, Askell Amanda Shammie, Dsouza Amanda Shammie, Slone Ambrose Shammie, Rahane Ameet Shammie, Iyer Anantharaman S. Shammie, Andreassen Anders Shammie, Madotto Andrea Shammie, Santilli Andrea Shammie, Stuhlmüller Andreas Shammie, Dai Andrew Shammie, La Andrew Shammie, Lampinen Andrew Shammie, Zou Andy Shammie, Jiang Angela Shammie, Chen Angelica Shammie, Vuong Anh Shammie, Gupta Animesh Shammie, Gottardi Anna Shammie, Norelli Antonio Shammie, Venkatesh Anu Shammie, Gholamidavoodi Arash Shammie, Tabassum Arfa Shammie, Menezes Arul Shammie, Kirubarajan Arun Shammie, Mullokandov Asher Shammie, Sabharwal Ashish Shammie, Herrick Austin Shammie, Efrat Avia Shammie, Erdem Aykut Shammie, Karakaş Ayla Shammie, Roberts B. Ryan Shammie, Loe Bao Sheng Shammie, Zoph Barret Shammie, Bojanowski Bartłomiej Shammie, Özyurt Batuhan Shammie, Hedayatnia Behnam Shammie, Neyshabur Behnam Shammie, Inden Benjamin Shammie, Stein Benno Shammie, Ekmekci Berk Shammie, Lin Bill Yuchen Shammie, Howald Blake Shammie, Orinion Bryan Shammie, Diao Cameron Shammie, Dour Cameron Shammie, Stinson Catherine Shammie, Argueta Cedrick Shammie, Ramírez César Ferri Shammie, Singh Chandan Shammie, Rathkopf Charles Shammie, Meng Chenlin Shammie, Baral Chitta Shammie, Wu Chiyu Shammie, Callison-burch Chris Shammie, Waites Chris Shammie, Voigt Christian Shammie, Manning Christopher D. Shammie, Potts Christopher Shammie, Ramirez Cindy Shammie, Rivera Clara E. Shammie, Siro Clemencia Shammie, Raffel Colin Shammie, Ashcraft Courtney Shammie, Garbacea Cristina Shammie, Sileo Damien Shammie, Garrette Dan Shammie, Hendrycks Dan Shammie, Kilman Dan Shammie, Roth Dan Shammie, Freeman Daniel Shammie, Khashabi Daniel Shammie, Levy Daniel Shammie, González Daniel Moseguí Shammie, Perszyk Danielle Shammie, Hernandez Danny Shammie, Chen Danqi Shammie, Ippolito Daphne Shammie, Gilboa Dar Shammie, Dohan David Shammie, Drakard David Shammie, Jurgens David Shammie, Datta Debajyoti Shammie, Ganguli Deep Shammie, Emelin Denis Shammie, Kleyko Denis Shammie, Yuret Deniz Shammie, Chen Derek Shammie, Tam Derek Shammie, Hupkes Dieuwke Shammie, Misra Diganta Shammie, Buzan Dilyar Shammie, Mollo Dimitri Coelho Shammie, Yang Diyi Shammie, Lee Dong-ho Shammie, Schrader Dylan Shammie, Shutova Ekaterina Shammie, Cubuk Ekin Dogus Shammie, Segal Elad Shammie, Hagerman Eleanor Shammie, Barnes Elizabeth Shammie, Donoway Elizabeth Shammie, Pavlick Ellie Shammie, Rodola Emanuele Shammie, Lam Emma Shammie, Chu Eric Shammie, Tang Eric Shammie, Erdem Erkut Shammie, Chang Ernie Shammie, Chi Ethan A. Shammie, Dyer Ethan Shammie, Jerzak Ethan Shammie, Kim Ethan Shammie, Manyasi Eunice Engefu Shammie, Zheltonozhskii Evgenii Shammie, Xia Fanyue Shammie, Siar Fatemeh Shammie, Martínez-plumed Fernando Shammie, Happé Francesca Shammie, Chollet Francois Shammie, Rong Frieda Shammie, Mishra Gaurav Shammie, Winata Genta Indra Shammie, De Melo Gerard Shammie, Kruszewski Germán Shammie, Parascandolo Giambattista Shammie, Mariani Giorgio Shammie, Wang Gloria Shammie, Jaimovitch-lópez Gonzalo Shammie, Betz Gregor Shammie, Gur-ari Guy Shammie, Galijasevic Hana Shammie, Kim Hannah Shammie, Rashkin Hannah Shammie, Hajishirzi Hannaneh Shammie, Mehta Harsh Shammie, Bogar Hayden Shammie, Shevlin Henry Shammie, Schütze Hinrich Shammie, Yakura Hiromu Shammie, Zhang Hongming Shammie, Wong Hugh Mee Shammie, Ng Ian Shammie, Noble Isaac Shammie, Jumelet Jaap Shammie, Geissinger Jack Shammie, Kernion Jackson Shammie, Hilton Jacob Shammie, Lee Jaehoon Shammie, Fisac Jaime Fernández Shammie, Simon James B. Shammie, Koppel James Shammie, Zheng James Shammie, Zou James Shammie, Kocoń Jan Shammie, Thompson Jana Shammie, Wingfield Janelle Shammie, Kaplan Jared Shammie, Radom Jarema Shammie, Sohl-dickstein Jascha Shammie, Phang Jason Shammie, Wei Jason Shammie, Yosinski Jason Shammie, Novikova Jekaterina Shammie, Bosscher Jelle Shammie, Marsh Jennifer Shammie, Kim Jeremy Shammie, Taal Jeroen Shammie, Engel Jesse Shammie, Alabi Jesujoba Shammie, Xu Jiacheng Shammie, Song Jiaming Shammie, Tang Jillian Shammie, Waweru Joan Shammie, Burden John Shammie, Miller John Shammie, Balis John U. Shammie, Batchelder Jonathan Shammie, Berant Jonathan Shammie, Frohberg Jörg Shammie, Rozen Jos Shammie, Hernandez-orallo Jose Shammie, Boudeman Joseph Shammie, Guerr Joseph Shammie, Jones Joseph Shammie, Tenenbaum Joshua B. Shammie, Rule Joshua S. Shammie, Chua Joyce Shammie, Kanclerz Kamil Shammie, Livescu Karen Shammie, Krauth Karl Shammie, Gopalakrishnan Karthik Shammie, Ignatyeva Katerina Shammie, Markert Katja Shammie, Dhole Kaustubh D. Shammie, Gimpel Kevin Shammie, Omondi Kevin Shammie, Mathewson Kory Shammie, Chiafullo Kristen Shammie, Shkaruta Ksenia Shammie, Shridhar Kumar Shammie, Mcdonell Kyle Shammie, Richardson Kyle Shammie, Reynolds Laria Shammie, Gao Leo Shammie, Zhang Li Shammie, Dugan Liam Shammie, Qin Lianhui Shammie, Contreras-ochando Lidia Shammie, Morency Louis-philippe Shammie, Moschella Luca Shammie, Lam Lucas Shammie, Noble Lucy Shammie, Schmidt Ludwig Shammie, He Luheng Shammie, Colón Luis Oliveros Shammie, Metz Luke Shammie, Şenel Lütfi Kerem Shammie, Bosma Maarten Shammie, Sap Maarten Shammie, Ter Hoeve Maartje Shammie, Farooqi Maheen Shammie, Faruqui Manaal Shammie, Mazeika Mantas Shammie, Baturan Marco Shammie, Marelli Marco Shammie, Maru Marco Shammie, Quintana Maria Jose Ramírez Shammie, Tolkiehn Marie Shammie, Giulianelli Mario Shammie, Lewis Martha Shammie, Potthast Martin Shammie, Leavitt Matthew L. Shammie, Hagen Matthias Shammie, Schubert Mátyás Shammie, Baitemirova Medina Orduna Shammie, Arnaud Melody Shammie, Mcelrath Melvin Shammie, Yee Michael A. Shammie, Cohen Michael Shammie, Gu Michael Shammie, Ivanitskiy Michael Shammie, Starritt Michael Shammie, Strube Michael Shammie, Swędrowski Michał Shammie, Bevilacqua Michele Shammie, Yasunaga Michihiro Shammie, Kale Mihir Shammie, Cain Mike Shammie, Xu Mimee Shammie, Suzgun Mirac Shammie, Walker Mitch Shammie, Tiwari Mo Shammie, Bansal Mohit Shammie, Aminnaseri Moin Shammie, Geva Mor Shammie, Gheini Mozhdeh Shammie, T Mukund Varma Shammie, Peng Nanyun Shammie, Chi Nathan A. Shammie, Lee Nayeon Shammie, Krakover Neta Gur-ari Shammie, Cameron Nicholas Shammie, Roberts Nicholas Shammie, Doiron Nick Shammie, Martinez Nicole Shammie, Nangia Nikita Shammie, Deckers Niklas Shammie, Muennighoff Niklas Shammie, Keskar Nitish Shirish Shammie, Iyer Niveditha S. Shammie, Constant Noah Shammie, Fiedel Noah Shammie, Wen Nuan Shammie, Zhang Oliver Shammie, Agha Omar Shammie, Elbaghdadi Omar Shammie, Levy Omer Shammie, Evans Owain Shammie, Casares Pablo Antonio Moreno Shammie, Doshi Parth Shammie, Fung Pascale Shammie, Liang Paul Pu Shammie, Vicol Paul Shammie, Alipoormolabashi Pegah Shammie, Liao Peiyuan Shammie, Liang Percy Shammie, Chang Peter Shammie, Eckersley Peter Shammie, Htut Phu Mon Shammie, Hwang Pinyu Shammie, Miłkowski Piotr Shammie, Patil Piyush Shammie, Pezeshkpour Pouya Shammie, Oli Priti Shammie, Mei Qiaozhu Shammie, Lyu Qing Shammie, Chen Qinlang Shammie, Banjade Rabin Shammie, Rudolph Rachel Etta Shammie, Gabriel Raefer Shammie, Habacker Rahel Shammie, Risco Ramon Shammie, Millière Raphaël Shammie, Garg Rhythm Shammie, Barnes Richard Shammie, Saurous Rif A. Shammie, Arakawa Riku Shammie, Raymaekers Robbe Shammie, Frank Robert Shammie, Sikand Rohan Shammie, Novak Roman Shammie, Sitelew Roman Shammie, Lebras Ronan Shammie, Liu Rosanne Shammie, Jacobs Rowan Shammie, Zhang Rui Shammie, Salakhutdinov Ruslan Shammie, Chi Ryan Shammie, Lee Ryan Shammie, Stovall Ryan Shammie, Teehan Ryan Shammie, Yang Rylan Shammie, Singh Sahib Shammie, Mohammad Saif M. Shammie, Anand Sajant Shammie, Dillavou Sam Shammie, Shleifer Sam Shammie, Wiseman Sam Shammie, Gruetter Samuel Shammie, Bowman Samuel R. Shammie, Schoenholz Samuel S. Shammie, Han Sanghyun Shammie, Kwatra Sanjeev Shammie, Rous Sarah A. Shammie, Ghazarian Sarik Shammie, Ghosh Sayan Shammie, Casey Sean Shammie, Bischoff Sebastian Shammie, Gehrmann Sebastian Shammie, Schuster Sebastian Shammie, Sadeghi Sepideh Shammie, Hamdan Shadi Shammie, Zhou Sharon Shammie, Srivastava Shashank Shammie, Shi Sherry Shammie, Singh Shikhar Shammie, Asaadi Shima Shammie, Gu Shixiang Shane Shammie, Pachchigar Shubh Shammie, Toshniwal Shubham Shammie, Upadhyay Shyam Shammie, Shyamolima Shammie, Debnath, Shakeri Siamak, Thormeyer Simon, Melzi Simone, Reddy Siva, Makini Sneha Priscilla, Lee Soo-hwan, Torene Spencer, Hatwar Sriharsha, Dehaene Stanislas, Divic Stefan, Ermon Stefano, Biderman Stella, Lin Stephanie, Prasad Stephen, Piantadosi Steven T., Shieber Stuart M., Misherghi Summer, Kiritchenko Svetlana, Mishra Swaroop, Linzen Tal, Schuster Tal, Li Tao, Yu Tao, Ali Tariq, Hashimoto Tatsu, Wu Te-lin, Desbordes Théo, Rothschild Theodore, Phan Thomas, Wang Tianle, Nkinyili Tiberius, Schick Timo, Kornev Timofei, Tunduny Titus, Gerstenberg Tobias, Chang Trenton, Neeraj Trishala, Khot Tushar, Shultz Tyler, Shaham Uri, Misra Vedant, Demberg Vera, Nyamai Victoria, Raunak Vikas, Ramasesh Vinay, Prabhu Vinay Uday, Padmakumar Vishakh, Srikumar Vivek, Fedus William, Saunders William, Zhang William, Vossen Wout, Ren Xiang, Tong Xiaoyu, Zhao Xinran, Wu Xinyi, Shen Xudong, Yaghoobzadeh Yadollah, Lakretz Yair, Song Yangqiu, Bahri Yasaman, Choi Yejin, Yang Yichi, Hao Yiding, Chen Yifu, Belinkov Yonatan, Hou Yu, Hou Yufang, Bai Yuntao, Seid Zachary, Zhao Zhuoye, Wang Zijian, Wang Zijie J., Wang Zirui, Wu Ziyi
- Towards Zero-shot And Few-shot Table Question Answering Using GPT-3 Srivastava Pragya, Ganu Tanuja, Guha Saikat
- Putting Gpt-3's Creativity To The (alternative Uses) Test Stevenson Claire, Smal Iris, Baas Matthijs, Grasman Raoul, Van Der Maas Han
- Black-box Tuning For Language-model-as-a-service Sun Tianxiang, Shao Yunfan, Qian Hong, Huang Xuanjing, Qiu Xipeng
- Dynamar: Dynamic Prompt With Mask Token Representation Sun Xiaodi, Rajagopalan Sunny, Nigam Priyanka, Lu Weiyi, Xu Yi, Zeng Belinda, Chilimbi Trishul
- OPT: Open Pre-trained Transformer Language Models Susan Zhang, Stephen Roller, Naman Goyal, Mikel Artetxe, Moya Chen, Shuohui Chen, Christopher Dewan, Mona Diab, Xian Li, Xi Victoria Lin, Todor Mihaylov, Myle Ott, Sam Shleifer, Kurt Shuster, Daniel Simig, Punit Singh Koura, Anjali Sridhar, Tianlu Wang, Luke Zettlemoyer
- Chatgpt: The End Of Online Exam Integrity? Susnjak Teo
- The AI Teacher Test: Measuring The Pedagogical Ability Of Blender And GPT-3 In Educational Dialogues Tack Anaïs, Piech Chris
- Large Language Models Are Zero-shot Reasoners Takeshi Kojima, Shixiang Shane Gu, Machel Reid, Yutaka Matsuo, Yusuke Iwasawa
- TAPE: Assessing Few-shot Russian Language Understanding Taktasheva Ekaterina, Shavrina Tatiana, Fenogenova Alena, Shevelev Denis, Katricheva Nadezhda, Tikhonova Maria, Akhmetgareeva Albina, Zinkevich Oleg, Bashmakova Anastasiia, Iordanskaia Svetlana, Spiridonova Alena, Kurenshchikova Valentina, Artemova Ekaterina, Mikhailov Vladislav
- Commonsenseqa 2.0: Exposing The Limits Of AI Through Gamification Talmor Alon, Yoran Ori, Bras Ronan Le, Bhagavatula Chandra, Goldberg Yoav, Choi Yejin, Berant Jonathan
- News Summarization And Evaluation In The Era Of GPT-3 Tanya Goyal, Junyi Jessy Li, Greg Durrett
- Compression Of Generative Pre-trained Language Models Via Quantization Tao Chaofan, Hou Lu, Zhang Wei, Shang Lifeng, Jiang Xin, Liu Qun, Luo Ping, Wong Ngai
- Interleaving Retrieval With Chain-of-thought Reasoning For Knowledge-intensive Multi-step Questions Trivedi Harsh, Balasubramanian Niranjan, Khot Tushar, Sabharwal Ashish
- Do Large Language Models Know What Humans Know? Trott Sean, Jones Cameron, Chang Tyler, Michaelov James, Bergen Benjamin
- Adavae: Exploring Adaptive Gpt-2s In Variational Auto-encoders For Language Modeling Tu Haoqin, Yang Zhongliang, Yang Jinshuai, Huang Yongfeng
- Help Me Write A Poem: Instruction Tuning As A Vehicle For Collaborative Poetry Writing Tuhin Chakrabarty, Vishakh Padmakumar, He He
- Effect And Analysis Of Large-scale Language Model Rescoring On Competitive ASR Systems Udagawa Takuma, Suzuki Masayuki, Kurata Gakuto, Itoh Nobuyasu, Saon George
- Inferring Implicit Relations In Complex Questions With Language Models Uri Katz, Mor Geva, Jonathan Berant
- Dylora: Parameter Efficient Tuning Of Pre-trained Models Using Dynamic Search-free Low-rank Adaptation Valipour Mojtaba, Rezagholizadeh Mehdi, Kobyzev Ivan, Ghodsi Ali
- Will It Blend? Mixing Training Paradigms & Prompting For Argument Quality Prediction Van Der Meer Michiel, Reuver Myrthe, Khurana Urja, Krause Lea, Santamaría Selene Báez
- UGIF: UI Grounded Instruction Following Venkatesh Sagar Gubbi, Talukdar Partha, Narayanan Srini
- How Large Language Models Are Transforming Machine-paraphrased Plagiarism Wahle Jan Philip, Ruas Terry, Kirstein Frederic, Gipp Bela
- Exploring The Limits Of Domain-adaptive Training For Detoxifying Large-scale Language Models Wang Boxin, Ping Wei, Xiao Chaowei, Xu Peng, Patwary Mostofa, Shoeybi Mohammad, Li Bo, Anandkumar Anima, Catanzaro Bryan
- Hybrid-regressive Neural Machine Translation Wang Qiang, Hu Xinhui, Chen Ming
- What Language Model Architecture And Pretraining Objective Work Best For Zero-shot Generalization? Wang Thomas, Roberts Adam, Hesslow Daniel, Scao Teven Le, Chung Hyung Won, Beltagy Iz, Launay Julien, Raffel Colin
- Elaboration-generating Commonsense Question Answering At Scale Wang Wenya, Srikumar Vivek, Hajishirzi Hanna, Smith Noah A.
- Qrelscore: Better Evaluating Generated Questions With Deeper Understanding Of Context-aware Relevance Wang Xiaoqiang, Liu Bang, Tang Siliang, Wu Lingfei
- Compilable Neural Code Generation With Compiler Feedback Wang Xin, Wang Yasheng, Wan Yao, Mi Fei, Li Yitong, Zhou Pingyi, Liu Jin, Wu Hao, Jiang Xin, Liu Qun
- Pay Attention To Your Tone: Introducing A New Dataset For Polite Language Rewrite Wang Xun, Ge Tao, Mao Allen, Li Yuki, Wei Furu, Chen Si-qing
- Pan More Gold From The Sand: Refining Open-domain Dialogue Training With Noisy Self-retrieval Generation Wang Yihe, Li Yitong, Wang Yasheng, Mi Fei, Zhou Pingyi, Wang Xin, Liu Jin, Jiang Xin, Liu Qun
- Unified Multimodal Model With Unlikelihood Training For Visual Dialog Wang Zihao, Wang Junli, Jiang Changjun
- Emergent Analogical Reasoning In Large Language Models Webb Taylor, Holyoak Keith J., Lu Hongjing
- Mixed-effects Transformers For Hierarchical Adaptation White Julia, Goodman Noah, Hawkins Robert
- Exploring The Efficacy Of Pre-trained Checkpoints In Text-to-music Generation Task Wu Shangda, Sun Maosong
- The Unreliability Of Explanations In Few-shot Prompting For Textual Reasoning Xi Ye, Greg Durrett
- Go-tuning: Improving Zero-shot Learning Abilities Of Smaller Language Models Xu Jingjing, Dong Qingxiu, Liu Hongyi, Li Lei
- Integrating Translation Memories Into Non-autoregressive Machine Translation Xu Jitao, Crego Josep, Yvon François
- Zero-shot Video Question Answering Via Frozen Bidirectional Language Models Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Testaug: A Framework For Augmenting Capability-based NLP Tests Yang Guanqun, Haque Mirazul, Song Qiaochu, Yang Wei, Liu Xueqing
- GLUE-X: Evaluating Natural Language Understanding Models From An Out-of-distribution Generalization Perspective Yang Linyi, Zhang Shuibai, Qin Libo, Li Yafu, Wang Yidong, Liu Hanmeng, Wang Jindong, Xie Xing, Zhang Yue
- What GPT Knows About Who Is Who Yang Xiaohan, Peynetti Eduardo, Meerman Vasco, Tanner Chris
- Zeroquant: Efficient And Affordable Post-training Quantization For Large-scale Transformers Yao Zhewei, Aminabadi Reza Yazdani, Zhang Minjia, Wu Xiaoxia, Li Conglong, He Yuxiong
- Random-ltd: Random And Layerwise Token Dropping Brings Efficient Training For Large-scale Transformers Yao Zhewei, Wu Xiaoxia, Li Conglong, Holmes Connor, Zhang Minjia, Li Cheng, He Yuxiong
- Guess The Instruction! Flipped Learning Makes Language Models Stronger Zero-shot Learners Ye Seonghyeon, Kim Doyoung, Jang Joel, Shin Joongbo, Seo Minjoon
- UL2: Unifying Language Learning Paradigms Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Xavier Garcia, Jason Wei, Xuezhi Wang, Hyung Won Chung, Siamak Shakeri, Dara Bahri, Tal Schuster, Huaixiu Steven Zheng, Denny Zhou, Neil Houlsby, Donald Metzler
- Making Large Language Models Better Reasoners With Step-aware Verifier Yifei Li, Zeqi Lin, Shizhuo Zhang, Qiang Fu, Bei Chen, Jian-guang Lou, Weizhu Chen
- Active Example Selection For In-context Learning Yiming Zhang, Shi Feng, Chenhao Tan
- Large Language Models Are Better Reasoners With Self-verification Yixuan Weng, Minjun Zhu, Fei Xia, Bin Li, Shizhu He, Shengping Liu, Bin Sun, Kang Liu, Jun Zhao
- Super-naturalinstructions: Generalization Via Declarative Instructions On 1600+ NLP Tasks Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei, Anjana Arunkumar, Arjun Ashok, Arut Selvan Dhanasekaran, Atharva Naik, David Stap, Eshaan Pathak, Giannis Karamanolakis, Haizhi Gary Lai, Ishan Purohit, Ishani Mondal, Jacob Anderson, Kirby Kuznia, Krima Doshi, Maitreya Patel, Kuntal Kumar Pal, Mehrad Moradshahi, Mihir Parmar, Mirali Purohit, Neeraj Varshney, Phani Rohitha Kaza, Pulkit Verma, Ravsehaj Singh Puri, Rushang Karia, Shailaja Keyur Sampat, Savan Doshi, Siddhartha Mishra, Sujan Reddy, Sumanta Patro, Tanay Dixit, Xudong Shen, Chitta Baral, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi, Daniel Khashabi
- Scaling Autoregressive Models For Content-rich Text-to-image Generation Yu Jiahui, Xu Yuanzhong, Koh Jing Yu, Luong Thang, Baid Gunjan, Wang Zirui, Vasudevan Vijay, Ku Alexander, Yang Yinfei, Ayan Burcu Karagol, Hutchinson Ben, Han Wei, Parekh Zarana, Li Xin, Zhang Han, Baldridge Jason, Wu Yonghui
- MSDF: A General Open-domain Multi-skill Dialog Framework Zhao Yu, Hu Xinshuo, Li Yunxin, Hu Baotian, Li Dongfang, Chen Sichao, Wang Xiaolong
- Learning To Decompose: Hypothetical Question Decomposition Based On Comparable Texts Zhou Ben, Richardson Kyle, Yu Xiaodong, Roth Dan
- Reflection Of Thought: Inversely Eliciting Numerical Reasoning In Language Models Via Solving Linear Systems Zhou Fan, Dong Haoyu, Liu Qian, Cheng Zhoujun, Han Shi, Zhang Dongmei
- Binding Language Models In Symbolic Languages Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu
- Automatic Chain Of Thought Prompting In Large Language Models Zhuosheng Zhang, Aston Zhang, Mu Li, Alex Smola
- A Survey On GPT-3 Zong Mingyu, Krishnamachari Bhaskar
- LM-CPPF: Paraphrasing-guided Data Augmentation For Contrastive Prompt-based Few-shot Fine-tuning Abaskohi Amirhossein, Rothe Sascha, Yaghoobzadeh Yadollah
- Let The Llms Talk: Simulating Human-to-human Conversational QA Via Zero-shot Llm-to-llm Interactions Abbasiantaeb Zahra, Yuan Yifei, Kanoulas Evangelos, Aliannejadi Mohammad
- I Was Blind But Now I See: Implementing Vision-enabled Dialogue In Social Robots Abbo Giulio Antonio, Belpaeme Tony
- Comparative Analysis Of Chatgpt, GPT-4, And Microsoft Bing Chatbots For GRE Test Abu-haifa Mohammad, Etawi Bara'a, Alkhatatbeh Huthaifa, Ababneh Ayman
- On Codex Prompt Engineering For OCL Generation: An Empirical Study Abukhalaf Seif, Hamdaqa Mohammad, Khomh Foutse
- Multi-party Goal Tracking With Llms: Comparing Pre-training, Fine-tuning, And Prompt Engineering Addlesee Angus, Sieińska Weronika, Gunson Nancie, Garcia Daniel Hernández, Dondrup Christian, Lemon Oliver
- The ADAIO System At The BEA-2023 Shared Task On Generating AI Teacher Responses In Educational Dialogues Adigwe Adaeze 2 And 3, Yuan Zheng 2 And 3
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon
- KITLM: Domain-specific Knowledge Integration Into Language Models For Question Answering Agarwal Ankush, Gawade Sakharam, Azad Amar Prakash, Bhattacharyya Pushpak
- Can Large Language Models Be Good Path Planners? A Benchmark And Investigation On Spatial-temporal Reasoning Aghzal Mohamed, Plaku Erion, Yao Ziyu
- SARATHI: Efficient LLM Inference By Piggybacking Decodes With Chunked Prefills Agrawal Amey, Panwar Ashish, Mohan Jayashree, Kwatra Nipun, Gulavani Bhargav S., Ramjee Ramachandran
- MEGA: Multilingual Evaluation Of Generative AI Ahuja Kabir, Diddee Harshita, Hada Rishav, Ochieng Millicent, Ramesh Krithika, Jain Prachi, Nambi Akshay, Ganu Tanuja, Segal Sameer, Axmed Maxamed, Bali Kalika, Sitaram Sunayana
- MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models And Tasks Ahuja Sanchit, Aggarwal Divyanshu, Gumma Varun, Watts Ishaan, Sathe Ashutosh, Ochieng Millicent, Hada Rishav, Jain Prachi, Axmed Maxamed, Bali Kalika, Sitaram Sunayana
- Jointly Training Large Autoregressive Multimodal Models Aiello Emanuele, Yu Lili, Nie Yixin, Aghajanyan Armen, Oguz Barlas
- Can We Trust The Evaluation On Chatgpt? Aiyappa Rachith, An Jisun, Kwak Haewoon, Ahn Yong-yeol
- CESAR: Automatic Induction Of Compositional Instructions For Multi-turn Dialogs Aksu Taha, Hazarika Devamanyu, Mehri Shikib, Kim Seokhwan, Hakkani-tür Dilek, Liu Yang, Namazifar Mahdi
- An In-depth Look At Gemini's Language Abilities Akter Syeda Nahida, Yu Zichun, Muhamed Aashiq, Ou Tianyue, Bäuerle Alex, Cabrera Ángel Alexander, Dholakia Krish, Xiong Chenyan, Neubig Graham
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Akyürek Afra Feyza, Akyürek Ekin, Madaan Aman, Kalyan Ashwin, Clark Peter, Wijaya Derry, Tandon Niket
- Can Language Models Employ The Socratic Method? Experiments With Code Debugging Al-hossami Erfan, Bunescu Razvan, Smith Justin, Teehan Ryan
- Chatgpt And Beyond: The Generative AI Revolution In Education Al-smadi Mohammad
- Physics Of Language Models: Part 3.2, Knowledge Manipulation Allen-zhu Zeyuan, Li Yuanzhi
- The Falcon Series Of Open Language Models Almazrouei Ebtesam, Alobeidli Hamza, Alshamsi Abdulaziz, Cappelli Alessandro, Cojocaru Ruxandra, Debbah Mérouane, Goffinet Étienne, Hesslow Daniel, Launay Julien, Malartic Quentin, Mazzotta Daniele, Noune Badreddine, Pannier Baptiste, Penedo Guilherme
- Detecting Language Model Attacks With Perplexity Alon Gabriel, Kamfonas Michael
- Taqyim: Evaluating Arabic NLP Tasks Using Chatgpt Models Alyafeai Zaid, Alshaibani Maged S., Alkhamissi Badr, Luqman Hamzah, Alareqi Ebrahim, Fadel Ali
- Generative AI Perceptions: A Survey To Measure The Perceptions Of Faculty, Staff, And Students On Generative AI Tools In Academia Amani Sara, White Lance, Balart Trini, Arora Laksha, Shryock Kristi J., Brumbelow Kelly, Watson Karan L.
- Chatgpt-3.5, Chatgpt-4, Google Bard, And Microsoft Bing To Improve Health Literacy And Communication In Pediatric Populations And Beyond Amin Kanhai S., Mayes Linda, Khosla Pavan, Doshi Rushabh
- L-eval: Instituting Standardized Evaluation For Long Context Language Models An Chenxin, Gong Shansan, Zhong Ming, Zhao Xingjian, Li Mukai, Zhang Jun, Kong Lingpeng, Qiu Xipeng
- Does Deep Learning Learn To Abstract? A Systematic Probing Framework An Shengnan, Lin Zeqi, Chen Bei, Fu Qiang, Zheng Nanning, Lou Jian-guang
- Learning From Mistakes Makes LLM Better Reasoner An Shengnan, Ma Zexiong, Lin Zeqi, Zheng Nanning, Lou Jian-guang, Chen Weizhu
- Studying The Impacts Of Pre-training Using Chatgpt-generated Text On Downstream Tasks Anand Sarthak
- Protip: Progressive Tool Retrieval Improves Planning Anantha Raviteja, Bandyopadhyay Bortik, Kashi Anirudh, Mahinder Sayantan, Hill Andrew W, Chappidi Srinivas
- Context Tuning For Retrieval Augmented Generation Anantha Raviteja, Bethi Tharun, Vodianik Danil, Chappidi Srinivas
- Chemcrow: Augmenting Large-language Models With Chemistry Tools Andres M Bran, Sam Cox, Oliver Schilter, Carlo Baldassari, Andrew D White, Philippe Schwaller
- Expel: LLM Agents Are Experiential Learners Andrew Zhao, Daniel Huang, Quentin Xu, Matthieu Lin, Yong-jin Liu, Gao Huang
- Scaling Laws For Language Encoding Models In Fmri Antonello Richard, Vaidya Aditya, Huth Alexander G.
- Agenttuning: Enabling Generalized Agent Abilities For Llms Aohan Zeng, Mingdao Liu, Rui Lu, Bowen Wang, Xiao Liu, Yuxiao Dong, Jie Tang
- Sequence-to-sequence Spanish Pre-trained Language Models Araujo Vladimir, Trusca Maria Mihaela, Tufiño Rodrigo, Moens Marie-francine
- Unmasking The Giant: A Comprehensive Evaluation Of Chatgpt's Proficiency In Coding Algorithms And Data Structures Arefin Sayed Erfan, Heya Tasnia Ashrafi, Al-qudah Hasan, Ineza Ynes, Serwadda Abdul
- GPT-4 Can't Reason Arkoudas Konstantine
- AI And Education: An Investigation Into The Use Of Chatgpt For Systems Thinking Arndt Holger
- Learning And Leveraging Verifiers To Improve Planning Capabilities Of Pre-trained Language Models Arora Daman, Kambhampati Subbarao
- Have Llms Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models Arora Daman, Singh Himanshu Gaurav, Mausam
- Towards A Unified Multimodal Reasoning Framework Arun Abhinav, Mal Dipendra Singh, Soni Mehul, Sawada Tomohiro
- Real-time Visual Feedback To Guide Benchmark Creation: A Human-and-metric-in-the-loop Workflow Arunkumar Anjana, Mishra Swaroop, Sachdeva Bhavdeep, Baral Chitta, Bryan Chris
- BUFFET: Benchmarking Large Language Models For Few-shot Cross-lingual Transfer Asai Akari, Kudugunta Sneha, Yu Xinyan Velocity, Blevins Terra, Gonen Hila, Reid Machel, Tsvetkov Yulia, Ruder Sebastian, Hajishirzi Hannaneh
- Self-rag: Learning To Retrieve, Generate, And Critique Through Self-reflection Asai Akari, Wu Zeqiu, Wang Yizhong, Sil Avirup, Hajishirzi Hannaneh
- Is Chatgpt A Biomedical Expert? -- Exploring The Zero-shot Performance Of Current GPT Models In Biomedical Tasks Ateia Samy, Kruschwitz Udo
- Factuality Challenges In The Era Of Large Language Models Augenstein Isabelle, Baldwin Timothy, Cha Meeyoung, Chakraborty Tanmoy, Ciampaglia Giovanni Luca, Corney David, Diresta Renee, Ferrara Emilio, Hale Scott, Halevy Alon, Hovy Eduard, Ji Heng, Menczer Filippo, Miguez Ruben, Nakov Preslav, Scheufele Dietram, Sharma Shivam, Zagni Giovanni
- Unihd At TSAR-2022 Shared Task: Is Compute All We Need For Lexical Simplification? Aumiller Dennis, Gertz Michael
- Openflamingo: An Open-source Framework For Training Large Autoregressive Vision-language Models Awadalla Anas, Gao Irena, Gardner Josh, Hessel Jack, Hanafy Yusuf, Zhu Wanrong, Marathe Kalyani, Bitton Yonatan, Gadre Samir, Sagawa Shiori, Jitsev Jenia, Kornblith Simon, Koh Pang Wei, Ilharco Gabriel, Wortsman Mitchell, Schmidt Ludwig
- Using Large Language Models For Zero-shot Natural Language Generation From Knowledge Graphs Axelsson Agnes, Skantze Gabriel
- Taught By The Internet, Exploring Bias In Openais GPT3 Ayaz Ali, Nawalgaria Aditya, Yin Ruilian
- Intentional Biases In LLM Responses Badyal Nicklaus, Jacoby Derek, Coady Yvonne
- Abusing Images And Sounds For Indirect Instruction Injection In Multi-modal Llms Bagdasaryan Eugene, Hsieh Tsung-yin, Nassi Ben, Shmatikov Vitaly
- Evaluating Chatgpt As A Question Answering System: A Comprehensive Analysis And Comparison With Existing Models Bahak Hossein, Taheri Farzaneh, Zojaji Zahra, Kazemi Arefeh
- Chatgpt: Applications, Opportunities, And Threats Bahrini Aram, Khamoshifar Mohammadsadra, Abbasimehr Hossein, Riggs Robert J., Esmaeili Maryam, Majdabadkohne Rastin Mastali, Pasehvar Morteza
- Constituency Parsing Using Llms Bai Xuefeng, Wu Jialong, Chen Yulong, Wang Zhongqing, Zhang Yue
- Longbench: A Bilingual, Multitask Benchmark For Long Context Understanding Bai Yushi, Lv Xin, Zhang Jiajie, Lyu Hongchang, Tang Jiankai, Huang Zhidian, Du Zhengxiao, Liu Xiao, Zeng Aohan, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- Gpt-4: A Review On Advancements And Opportunities In Natural Language Processing Baktash Jawid Ahmad, Dawodi Mursal
- Tamil-llama: A New Tamil Language Model Based On Llama 2 Balachandran Abhinand
- No That's Not What I Meant: Handling Third Position Repair In Conversational Question Answering Balaraman Vevake, Eshghi Arash, Konstas Ioannis, Papaioannou Ioannis
- It's Not Easy Being Wrong: Large Language Models Struggle With Process Of Elimination Reasoning Balepur Nishant, Palta Shramay, Rudinger Rachel
- Investigating Pre-trained Language Models On Cross-domain Datasets, A Step Closer To General AI Ballout Mohamad, Krumnack Ulf, Heidemann Gunther, Kühnberger Kai-uwe
- Redefining Developer Assistance: Through Large Language Models In Software Ecosystem Banerjee Somnath, Dutta Avik, Layek Sayan, Sahoo Amruit, Joyce Sam Conrad, Hazra Rima
- Exploring Qualitative Research Using Llms Bano Muneera, Zowghi Didar, Whittle Jon
- Assessing And Enhancing The Robustness Of Large Language Models With Task Structure Variations For Logical Reasoning Bao Qiming, Gendron Gael, Peng Alex Yuxuan, Zhong Wanjun, Tan Neset, Chen Yang, Witbrock Michael, Liu Jiamou
- Exploring Iterative Enhancement For Improving Learnersourced Multiple-choice Question Explanations With Large Language Models Bao Qiming, Leinonen Juho, Peng Alex Yuxuan, Zhong Wanjun, Gendron Gaël, Pistotti Timothy, Huang Alice, Denny Paul, Witbrock Michael, Liu Jiamou
- Abstract Meaning Representation-based Logic-driven Data Augmentation For Logical Reasoning Bao Qiming, Peng Alex Yuxuan, Deng Zhenyun, Zhong Wanjun, Gendron Gael, Pistotti Timothy, Tan Neset, Young Nathan, Chen Yang, Zhu Yonghua, Denny Paul, Witbrock Michael, Liu Jiamou
- Check Your Facts And Try Again: Improving Large Language Models With External Knowledge And Automated Feedback Baolin Peng, Michel Galley, Pengcheng He, Hao Cheng, Yujia Xie, Yu Hu, Qiuyuan Huang, Lars Liden, Zhou Yu, Weizhu Chen, Jianfeng Gao
- Understanding Telecom Language Through Large Language Models Bariah Lina, Zou Hang, Zhao Qiyang, Mouhouche Belkacem, Bader Faouzi, Debbah Merouane
- Llamantino: Llama 2 Models For Effective Text Generation In Italian Language Basile Pierpaolo, Musacchio Elio, Polignano Marco, Siciliani Lucia, Fiameni Giuseppe, Semeraro Giovanni
- Text Summarization Using Large Language Models: A Comparative Study Of Mpt-7b-instruct, Falcon-7b-instruct, And Openai Chat-gpt Models Basyal Lochan, Sanghvi Mihir
- Testing The Depth Of Chatgpt's Comprehension Via Cross-modal Tasks Based On Ascii-art: Gpt3.5's Abilities In Regard To Recognizing And Generating Ascii-art Are Not Totally Lacking Bayani David
- Exploring The Landscape Of Large Language Models In Medical Question Answering Bean Andrew M., Korgul Karolina, Krones Felix, Mccraith Robert, Mahdi Adam
- Large Linguistic Models: Analyzing Theoretical Linguistic Abilities Of Llms Beguš Gašper, Dąbkowski Maksymilian, Rhodes Ryan
- Examining The Emergence Of Deductive Reasoning In Generative Language Models Belcak Peter, Lanzendörfer Luca A., Wattenhofer Roger
- Harnessing Gpt-3.5-turbo For Rhetorical Role Prediction In Legal Cases Belfathi Anas, Hernandez Nicolas, Monceaux Laura
- Worldsense: A Synthetic Benchmark For Grounded Reasoning In Large Language Models Benchekroun Youssef, Dervishi Megi, Ibrahim Mark, Gaya Jean-baptiste, Martinet Xavier, Mialon Grégoire, Scialom Thomas, Dupoux Emmanuel, Hupkes Dieuwke, Vincent Pascal
- @ve: A Chatbot For Latin Bendel Oliver, N'diaye Karim
- Large Language Models In The Workplace: A Case Study On Prompt Engineering For Job Type Classification Benjamin Clavié, Alexandru Ciceu, Frederick Naylor, Guillaume Soulié, Thomas Brightwell
- Prompt Sketching For Large Language Models Beurer-kellner Luca, Müller Mark Niklas, Fischer Marc, Vechev Martin
- When Automated Assessment Meets Automated Content Generation: Examining Text Quality In The Era Of Gpts Bevilacqua Marialena, Oketch Kezia, Qin Ruiyang, Stamey Will, Zhang Xinyuan, Gan Yi, Yang Kai, Abbasi Ahmed
- A Survey On Prompting Techniques In Llms Bhandari Prabin
- Are Large Language Models Geospatially Knowledgeable? Bhandari Prabin, Anastasopoulos Antonios, Pfoser Dieter
- Language Model Unalignment: Parametric Red-teaming To Expose Hidden Harms And Biases Bhardwaj Rishabh, Poria Soujanya
- Red-teaming Large Language Models Using Chain Of Utterances For Safety-alignment Bhardwaj Rishabh, Poria Soujanya
- Investigating Answerability Of Llms For Long-form Question Answering Bhat Meghana Moorthy, Meng Rui, Liu Ye, Zhou Yingbo, Yavuz Semih
- TART: A Plug-and-play Transformer Module For Task-agnostic Reasoning Bhatia Kush, Narayan Avanika, De Sa Christopher, Ré Christopher
- Chatgpt Is A Knowledgeable But Inexperienced Solver: An Investigation Of Commonsense Problem In Large Language Models Bian Ning, Han Xianpei, Sun Le, Lin Hongyu, Lu Yaojie, He Ben, Jiang Shanshan, Dong Bin
- Loquacity And Visible Emotion: Chatgpt As A Policy Advisor Biancotti Claudia, Camassa Carolina
- In-context Learning Dynamics With Random Binary Sequences Bigelow Eric J., Lubana Ekdeep Singh, Dick Robert P., Tanaka Hidenori, Ullman Tomer D.
- VIGC: Visual Instruction Generation And Correction Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He
- Model Leeching: An Extraction Attack Targeting Llms Birch Lewis, Hackett William, Trawicki Stefan, Suri Neeraj, Garraghan Peter
- Can Chatgpt Be Your Personal Medical Assistant? Biswas Md. Rafiul, Islam Ashhadul, Shah Zubair, Zaghouani Wajdi, Belhaouari Samir Brahim
- Distractor Generation For Multiple-choice Questions With Predictive Prompting And Large Language Models Bitew Semere Kiros, Deleu Johannes, Develder Chris, Demeester Thomas
- Visit-bench: A Benchmark For Vision-language Instruction Following Inspired By Real-world Use Bitton Yonatan, Bansal Hritik, Hessel Jack, Shao Rulin, Zhu Wanrong, Awadalla Anas, Gardner Josh, Taori Rohan, Schmidt Ludwig
- How Close Is Chatgpt To Human Experts? Comparison Corpus, Evaluation, And Detection Biyang Guo, Xin Zhang, Ziyuan Wang, Minqi Jiang, Jinran Nie, Yuxuan Ding, Jianwei Yue, Yupeng Wu
- BLT: Can Large Language Models Handle Basic Legal Text? Blair-stanek Andrew, Holzenberger Nils, Van Durme Benjamin
- Chip-chat: Challenges And Opportunities In Conversational Hardware Design Blocklove Jason, Garg Siddharth, Karri Ramesh, Pearce Hammond
- Personality Testing Of Large Language Models: Limited Temporal Stability, But Highlighted Prosociality Bodroza Bojana, Dinic Bojana M., Bojic Ljubisa
- Halle-control: Controlling Object Hallucination In Large Multimodal Models Bohan Zhai, Shijia Yang, Chenfeng Xu, Sheng Shen, Kurt Keutzer, Chunyuan Li, Manling Li
- GPT-4 Surpassing Human Performance In Linguistic Pragmatics Bojic Ljubisa, Kovacevic Predrag, Cabarkapa Milan
- Ask Language Model To Clean Your Noisy Translation Data Bolding Quinten, Liao Baohao, Denis Brandon James, Luo Jun, Monz Christof
- Pixlore: A Dataset-driven Approach To Rich Image Captioning Bonilla Diego
- A Categorical Archive Of Chatgpt Failures Borji Ali
- Soundstorm: Efficient Parallel Audio Generation Borsos Zalán, Sharifi Matt, Vincent Damien, Kharitonov Eugene, Zeghidour Neil, Tagliasacchi Marco
- Analyzing Multilingual Competency Of Llms In Multi-turn Instruction Following: A Case Study Of Arabic Boughorbel Sabri, Hawasly Majd
- Inpars-light: Cost-effective Unsupervised Training Of Efficient Rankers Boytsov Leonid, Patel Preksha, Sourabh Vivek, Nisar Riddhi, Kundu Sayani, Ramanathan Ramya, Nyberg Eric
- Chemcrow: Augmenting Large-language Models With Chemistry Tools Bran Andres M, Cox Sam, Schilter Oliver, Baldassari Carlo, White Andrew D, Schwaller Philippe
- Evaluating Large Language Models For Document-grounded Response Generation In Information-seeking Dialogues Braunschweiler Norbert, Doddipatla Rama, Keizer Simon, Stoyanchev Svetlana
- Extractgpt: Exploring The Potential Of Large Language Models For Product Attribute Value Extraction Brinkmann Alexander, Shraga Roee, Bizer Christian
- Fine-grained Affective Processing Capabilities Emerging From Large Language Models Broekens Joost, Hilpert Bernhard, Verberne Suzan, Baraka Kim, Gebhard Patrick, Plaat Aske
- Accuracy Of A Vision-language Model On Challenging Medical Cases Buckley Thomas, Diao James A., Rodman Adam, Manrai Arjun K.
- Utilizing Chatgpt Generated Data To Retrieve Depression Symptoms From Social Media Bucur Ana-maria
- Generative AI Assistants In Software Development Education: A Vision For Integrating Generative AI Into Educational Practice, Not Instinctively Defending Against It Bull Christopher, Kharrufa Ahmed
- Weak-to-strong Generalization: Eliciting Strong Capabilities With Weak Supervision Burns Collin, Izmailov Pavel, Kirchner Jan Hendrik, Baker Bowen, Gao Leo, Aschenbrenner Leopold, Chen Yining, Ecoffet Adrien, Joglekar Manas, Leike Jan, Sutskever Ilya, Wu Jeff
- A Comparative Study Of Code Generation Using Chatgpt 3.5 Across 10 Programming Languages Buscemi Alessio
- Just Tell Me: Prompt Engineering In Business Process Management Busch Kiran, Rochlitzer Alexander, Sola Diana, Leopold Henrik
- Pokemonchat: Auditing Chatgpt For Pok\'emon Universe Knowledge Cabello Laura, Li Jiaang, Chalkidis Ilias
- Development Of The Chatgpt, Generative Artificial Intelligence And Natural Large Language Models For Accountable Reporting And Use (CANGARU) Guidelines Cacciamani Giovanni E., Eppler Michael B., Ganjavi Conner, Pekan Asli, Biedermann Brett, Collins Gary S., Gill Inderbir S.
- Peek Across: Improving Multi-document Modeling Via Cross-document Question-answering Caciularu Avi, Peters Matthew E., Goldberger Jacob, Dagan Ido, Cohan Arman
- Large Language Models As Tool Makers Cai Tianle, Wang Xuezhi, Ma Tengyu, Chen Xinyun, Zhou Denny
- Do Large Language Models Resemble Humans In Language Use? Cai Zhenguang G., Duan Xufeng, Haslett David A., Wang Shuqi, Pickering Martin J.
- On The Application Of Large Language Models For Language Teaching And Assessment Technology Caines Andrew, Benedetto Luca, Taslimipoor Shiva, Davis Christopher, Gao Yuan, Andersen Oeistein, Yuan Zheng, Elliott Mark, Moore Russell, Bryant Christopher, Rei Marek, Yannakoudakis Helen, Mullooly Andrew, Nicholls Diane, Buttery Paula
- A Systematic Study Of Knowledge Distillation For Natural Language Generation With Pseudo-target Training Calderon Nitay, Mukherjee Subhabrata, Reichart Roi, Kantor Amir
- Can Transformers Learn Sequential Function Classes In Context? Campbell Ryan, Guo Emma, Hu Evan, Vir Reya, Hsiao Ethan
- Wizardlm: Empowering Large Language Models To Follow Complex Instructions Can Xu, Qingfeng Sun, Kai Zheng, Xiubo Geng, Pu Zhao, Jiazhan Feng, Chongyang Tao, Daxin Jiang
- H2O Open Ecosystem For State-of-the-art Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Lee Chun Ming, Conde Marcos V.
- H2ogpt: Democratizing Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Prabhu Prithvi, Gambera Jeff, Landry Mark, Bansal Shivam, Chesler Ryan, Lee Chun Ming, Conde Marcos V., Stetsenko Pasha, Grellier Olivier, Ambati Srisatish
- A Step Closer To Comprehensive Answers: Constrained Multi-stage Question Decomposition With Large Language Models Cao Hejing, An Zhenwei, Feng Jiazhan, Xu Kun, Chen Liwei, Zhao Dongyan
- Systematic Rectification Of Language Models Via Dead-end Analysis Cao Meng, Fatemi Mehdi, Cheung Jackie Chi Kit, Shabanian Samira
- Unnatural Error Correction: GPT-4 Can Almost Perfectly Handle Unnatural Scrambled Text Cao Qi, Kojima Takeshi, Matsuo Yutaka, Iwasawa Yusuke
- Exploring The Effectiveness Of Chatgpt-based Feedback Compared With Teacher Feedback And Self-feedback: Evidence From Chinese To English Translation Cao Siyi, Zhong Linping
- How Does AI Chat Change Search Behaviors? Capra Robert, Arguello Jaime
- Assessing Large Language Models' Ability To Predict How Humans Balance Self-interest And The Interest Of Others Capraro Valerio, Di Paolo Roberto, Pizziol Veronica
- News Verifiers Showdown: A Comparative Performance Evaluation Of Chatgpt 3.5, Chatgpt 4.0, Bing AI, And Bard In News Fact-checking Caramancion Kevin Matthe
- A LLM Assisted Exploitation Of Ai-guardian Carlini Nicholas
- Revolutionizing Mobile Interaction: Enabling A 3 Billion Parameter GPT LLM On Mobile Carreira Samuel, Marques Tomás, Ribeiro José, Grilo Carlos
- Can It Edit? Evaluating The Ability Of Large Language Models To Follow Code Editing Instructions Cassano Federico, Li Luisa, Sethi Akul, Shinn Noah, Brennan-jones Abby, Ginesin Jacob, Berman Edward, Chakhnashvili George, Lozhkov Anton, Anderson Carolyn Jane, Guha Arjun
- Examining The Influence Of Varied Levels Of Domain Knowledge Base Inclusion In Gpt-based Intelligent Tutors Castleman Blake, Turkcan Mehmet Kerem
- Chatgpt To Replace Crowdsourcing Of Paraphrases For Intent Classification: Higher Diversity And Comparable Model Robustness Cegin Jan, Simko Jakub, Brusilovsky Peter
- An Empirical Study Of Using Chatgpt For Fact Verification Task Chakraborty Mohna, Kulkarni Adithya, Li Qi
- Chatgpt May Pass The Bar Exam Soon, But Has A Long Way To Go For The Lexglue Benchmark Chalkidis Ilias
- Chatgpt Evaluation On Sentence Level Relations: A Focus On Temporal, Causal, And Discourse Relations Chan Chunkit, Cheng Jiayang, Wang Weiqi, Jiang Yuxin, Fang Tianqing, Liu Xin, Song Yangqiu
- Which Spurious Correlations Impact Reasoning In NLI Models? A Visual Interactive Diagnosis Through Data-constrained Counterfactuals Chan Robin, Amini Afra, El-assady Mennatallah
- Muse: Text-to-image Generation Via Masked Generative Transformers Chang Huiwen, Zhang Han, Barber Jarred, Maschinot Aj, Lezama Jose, Jiang Lu, Yang Ming-hsuan, Murphy Kevin, Freeman William T., Rubinstein Michael, Li Yuanzhen, Krishnan Dilip
- Learning To Generate Better Than Your LLM Chang Jonathan D., Brantley Kiante, Ramamurthy Rajkumar, Misra Dipendra, Sun Wen
- Chipgpt: How Far Are We From Natural Language Hardware Design Chang Kaiyan, Wang Ying, Ren Haimeng, Wang Mengdi, Liang Shengwen, Han Yinhe, Li Huawei, Li Xiaowei
- Characterizing Learning Curves During Language Model Pre-training: Learning, Forgetting, And Stability Chang Tyler A., Tu Zhuowen, Bergen Benjamin K.
- Hallucination Augmented Contrastive Learning For Multimodal Large Language Model Chaoya Jiang, Haiyang Xu, Mengfan Dong, Jiaxing Chen, Wei Ye, Ming Yan, Qinghao Ye, Ji Zhang, Fei Huang, Shikun Zhang
- Memgpt: Towards Llms As Operating Systems Charles Packer, Sarah Wooders, Kevin Lin, Vivian Fang, Shishir G. Patil, Ion Stoica, Joseph E. Gonzalez
- She Had Cobalt Blue Eyes: Prompt Testing To Create Aligned And Sustainable Language Models Chatrath Veronica, Bamgbose Oluwanifemi, Raza Shaina
- Cabbage Sweeter Than Cake? Analysing The Potential Of Large Language Models For Learning Conceptual Spaces Chatterjee Usashi, Gajbhiye Amit, Schockaert Steven
- Efficient GPT Model Pre-training Using Tensor Train Matrix Representation Chekalina Viktoriia, Novikov Georgii, Gusak Julia, Oseledets Ivan, Panchenko Alexander
- Two Failures Of Self-consistency In The Multi-step Reasoning Of Llms Chen Angelica, Phang Jason, Parrish Alicia, Padmakumar Vishakh, Zhao Chen, Bowman Samuel R., Cho Kyunghyun
- Fireact: Toward Language Agent Fine-tuning Chen Baian, Shu Chang, Shareghi Ehsan, Collier Nigel, Narasimhan Karthik, Yao Shunyu
- Generate Labeled Training Data Using Prompt Programming And GPT-3. An Example Of Big Five Personality Classification Chen Eason
- Gptutor: A Chatgpt-powered Programming Tool For Code Explanation Chen Eason, Huang Ray, Chen Han-shin, Tseng Yuen-hsien, Li Liang-yi
- X-LLM: Bootstrapping Advanced Large Language Models By Treating Multi-modalities As Foreign Languages Chen Feilong, Han Minglun, Zhao Haozhi, Zhang Qingyang, Shi Jing, Xu Shuang, Xu Bo
- Tem-adapter: Adapting Image-text Pretraining For Video Question Answer Chen Guangyi, Liu Xiao, Wang Guangrun, Zhang Kun, Torr Philip H. S., Zhang Xiao-ping, Tang Yansong
- CLEX: Continuous Length Extrapolation For Large Language Models Chen Guanzheng, Li Xin, Meng Zaiqiao, Liang Shangsong, Bing Lidong
- Videollm: Modeling Video Sequence With Large Language Models Chen Guo, Zheng Yin-dong, Wang Jiahao, Xu Jilan, Huang Yifei, Pan Junting, Wang Yi, Wang Yali, Qiao Yu, Lu Tong, Wang Limin
- Chatgpt's One-year Anniversary: Are Open-source Large Language Models Catching Up? Chen Hailin, Jiao Fangkai, Li Xingxuan, Qin Chengwei, Ravaut Mathieu, Zhao Ruochen, Xiong Caiming, Joty Shafiq
- Personalised Distillation: Empowering Open-sourced Llms With Adaptive Learning For Code Generation Chen Hailin, Saha Amrita, Hoi Steven, Joty Shafiq
- Fairytalecqa: Integrating A Commonsense Knowledge Graph Into Children's Storybook Narratives Chen Jiaju, Lu Yuxuan, Zhang Shao, Yao Bingsheng, Dong Yuanzhe, Xu Ying, Li Yunyao, Wang Qianwen, Wang Dakuo, Sun Yuling
- Quantifying Uncertainty In Answers From Any Language Model And Enhancing Their Trustworthiness Chen Jiuhai, Mueller Jonas
- Huatuogpt-ii, One-stage Training For Medical Adaption Of Llms Chen Junying, Wang Xidong, Gao Anningzhe, Jiang Feng, Chen Shunian, Zhang Hongbo, Song Dingjie, Xie Wenya, Kong Chuyi, Li Jianquan, Wan Xiang, Li Haizhou, Wang Benyou
- Forgetful Large Language Models: Lessons Learned From Using Llms In Robot Programming Chen Juo-tung, Huang Chien-ming
- Reconcile: Round-table Conference Improves Reasoning Via Consensus Among Diverse Llms Chen Justin Chih-yao, Saha Swarnadeep, Bansal Mohit
- Alpagasus: Training A Better Alpaca With Fewer Data Chen Lichang, Li Shiyang, Yan Jun, Wang Hai, Gunaratna Kalpa, Yadav Vikas, Tang Zheng, Srinivasan Vijay, Zhou Tianyi, Huang Heng, Jin Hongxia
- Driving With Llms: Fusing Object-level Vector Modality For Explainable Autonomous Driving Chen Long, Sinavski Oleg, Hünermann Jan, Karnsund Alice, Willmott Andrew James, Birch Danny, Maund Daniel, Shotton Jamie
- Few-shot Data Synthesis For Open Domain Multi-hop Question Answering Chen Mingda, Chen Xilun, Yih Wen-tau
- From Good To Great: Improving Math Reasoning With Tool-augmented Interleaf Prompting Chen Nuo, Li Hongguang, Wang Baoyuan, Li Jia
- Breaking Language Barriers In Multilingual Mathematical Reasoning: Insights And Observations Chen Nuo, Zheng Zinan, Wu Ning, Gong Ming, Song Yangqiu, Zhang Dongmei, Li Jia
- Dynamic Strategy Chain: Dynamic Zero-shot Cot For Long Mental Health Support Generation Chen Qi, Liu Dexi
- Evaluation Of Chatgpt Family Of Models For Biomedical Reasoning And Classification Chen Shan, Li Yingya, Lu Sheng, Van Hoang, Aerts Hugo Jwl, Savova Guergana K., Bitterman Danielle S.
- Evaluating Factual Consistency Of Summaries With Large Language Models Chen Shiqi, Gao Siyang, He Junxian
- Chatpipe: Orchestrating Data Preparation Program By Optimizing Human-chatgpt Interactions Chen Sibei, Liu Hanbing, Jin Weiting, Sun Xiangyu, Feng Xiaoyao, Fan Ju, Du Xiaoyong, Tang Nan
- Llm-empowered Chatbots For Psychiatrist And Patient Simulation: Application And Evaluation Chen Siyuan, Wu Mengyue, Zhu Kenny Q., Lan Kunyao, Zhang Zhiling, Cui Lyuchun
- Theoremqa: A Theorem-driven Question Answering Dataset Chen Wenhu, Yin Ming, Ku Max, Lu Pan, Wan Yixin, Ma Xueguang, Xu Jianyu, Wang Xinyi, Xia Tony
- Challenges And Contributing Factors In The Utilization Of Large Language Models (llms) Chen Xiaoliang, Li Liangbin, Chang Le, Huang Yunhe, Zhao Yuxuan, Zhang Yuxiao, Li Dinuo
- The Janus Interface: How Fine-tuning In Large Language Models Amplifies The Privacy Risks Chen Xiaoyi, Tang Siyuan, Zhu Rui, Yan Shijun, Jin Lei, Wang Zihao, Su Liya, Zhang Zhikun, Wang Xiaofeng, Tang Haixu
- Asking Before Acting: Gather Information In Embodied Decision Making With Language Models Chen Xiaoyu, Zhang Shenao, Zhang Pushi, Zhao Li, Chen Jianyu
- Autoeval-video: An Automatic Benchmark For Assessing Large Vision Language Models In Open-ended Video Question Answering Chen Xiuyuan, Lin Yuan, Zhang Yuchen, Huang Weiran
- Do Models Explain Themselves? Counterfactual Simulatability Of Natural Language Explanations Chen Yanda, Zhong Ruiqi, Ri Narutatsu, Zhao Chen, He He, Steinhardt Jacob, Yu Zhou, Mckeown Kathleen
- Translation And Fusion Improves Zero-shot Cross-lingual Information Extraction Chen Yang, Shah Vedaant, Ritter Alan
- EE-LLM: Large-scale Training And Inference Of Early-exit Large Language Models With 3D Parallelism Chen Yanxi, Pan Xuchen, Li Yaliang, Ding Bolin, Zhou Jingren
- Bianque: Balancing The Questioning And Suggestion Ability Of Health Llms With Multi-turn Health Conversations Polished By Chatgpt Chen Yirong, Wang Zhenyu, Xing Xiaofen, Zheng Huimin, Xu Zhipei, Fang Kai, Wang Junhong, Li Sihang, Wu Jieling, Liu Qi, Xu Xiangmin
- Prompt To GPT-3: Step-by-step Thinking Instructions For Humor Generation Chen Yuetian, Shi Bowen, Si Mei
- Fortify The Shortest Stave In Attention: Enhancing Context Awareness Of Large Language Models For Effective Tool Use Chen Yuhan, Lv Ang, Lin Ting-en, Chen Changyu, Wu Yuchuan, Huang Fei, Li Yongbin, Yan Rui
- Assessing The Impact Of Prompting Methods On Chatgpt's Mathematical Capabilities Chen Yuhao, Wong Chloe, Yang Hanwen, Aguenza Juan, Bhujangari Sai, Vu Benthan, Lei Xun, Prasad Amisha, Fluss Manny, Phuong Eric, Liu Minghao, Kumar Raja, Vats Vanshika, Davis James
- Token Prediction As Implicit Classification To Identify Llm-generated Text Chen Yutian, Kang Hao, Zhai Vivian, Li Liangze, Singh Rita, Raj Bhiksha
- MEDITRON-70B: Scaling Medical Pretraining For Large Language Models Chen Zeming, Cano Alejandro Hernández, Romanou Angelika, Bonnet Antoine, Matoba Kyle, Salvi Francesco, Pagliardini Matteo, Fan Simin, Köpf Andreas, Mohtashami Amirkeivan, Sallinen Alexandre, Sakhaeirad Alireza, Swamy Vinitra, Krawczuk Igor, Bayazit Deniz, Marmet Axel, Montariol Syrielle, Hartley Mary-anne, Jaggi Martin, Bosselut Antoine
- Chatcot: Tool-augmented Chain-of-thought Reasoning On Chat-based Large Language Models Chen Zhipeng, Zhou Kun, Zhang Beichen, Gong Zheng, Zhao Wayne Xin, Wen Ji-rong
- Lmexplainer: Grounding Knowledge And Explaining Language Models Chen Zichen, Chen Jianda, Chen Yuanyuan, Yu Han, Singh Ambuj K, Sra Misha
- Cascade Speculative Drafting For Even Faster LLM Inference Chen Ziyi, Yang Xiaocong, Lin Jiacheng, Sun Chenkai, Chang Kevin Chen-chuan, Huang Jie
- Visual Chatgpt: Talking, Drawing And Editing With Visual Foundation Models Chenfei Wu, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, Nan Duan
- UPRISE: Universal Prompt Retrieval For Improving Zero-shot Evaluation Cheng Daixuan, Huang Shaohan, Bi Junyu, Zhan Yuefeng, Liu Jianfeng, Wang Yujing, Sun Hao, Wei Furu, Deng Denvy, Zhang Qi
- Black-box Prompt Optimization: Aligning Large Language Models Without Model Training Cheng Jiale, Liu Xiao, Zheng Kehan, Ke Pei, Wang Hongning, Dong Yuxiao, Tang Jie, Huang Minlie
- Egothink: Evaluating First-person Perspective Thinking Capability Of Vision-language Models Cheng Sijie, Guo Zhicheng, Wu Jingwen, Fang Kechen, Li Peng, Liu Huaping, Liu Yang
- SCALE: Synergized Collaboration Of Asymmetric Language Translation Engines Cheng Xin, Wang Xun, Ge Tao, Chen Si-qing, Wei Furu, Zhao Dongyan, Yan Rui
- Batch Prompting: Efficient Inference With Large Language Model Apis Cheng Zhoujun, Kasai Jungo, Yu Tao
- Transformer Working Memory Enables Regular Language Reasoning And Natural Language Length Extrapolation Chi Ta-chung, Fan Ting-han, Rudnicky Alexander I., Ramadge Peter J.
- M\(^{2}\)chat: Empowering VLM For Multimodal LLM Interleaved Text-image Generation Chi Xiaowei, Zhang Rongyu, Jiang Zhengkai, Liu Yijiang, Wang Yatian, Qi Xingqun, Luo Wenhan, Gao Peng, Zhang Shanghang, Liu Qifeng, Guo Yike
- INSTRUCTEVAL: Towards Holistic Evaluation Of Instruction-tuned Large Language Models Chia Yew Ken, Hong Pengfei, Bing Lidong, Poria Soujanya
- A Closer Look Into Automatic Evaluation Using Large Language Models Chiang Cheng-han, Lee Hung-yi
- Timebench: A Comprehensive Evaluation Of Temporal Reasoning Abilities In Large Language Models Chu Zheng, Chen Jingchang, Chen Qianglong, Yu Weijiang, Wang Haotian, Liu Ming, Qin Bing
- Tutorials On Stance Detection Using Pre-trained Language Models: Fine-tuning BERT And Prompting Large Language Models Chuang Yun-shiuan
- Long Story Short: A Summarize-then-search Method For Long Video Question Answering Chung Jiwan, Yu Youngjae
- LIMA: Less Is More For Alignment Chunting Zhou, Pengfei Liu, Puxin Xu, Srini Iyer, Jiao Sun, Yuning Mao, Xuezhe Ma, Avia Efrat, Ping Yu, Lili Yu, Susan Zhang, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer, Omer Levy
- Safurai 001: New Qualitative Approach For Code LLM Evaluation Cifarelli Davide, Boiardi Leonardo, Puppo Alessandro
- Remember What You Did So You Know What To Do Next Ciosici Manuel R., Hedges Alex, Kankanampati Yash, Martin Justin, Freedman Marjorie, Weischedel Ralph
- Galactic Chitchat: Using Large Language Models To Converse With Astronomy Literature Ciucă Ioana, Ting Yuan-sen
- Harnessing The Power Of Adversarial Prompting And Large Language Models For Robust Hypothesis Generation In Astronomy Ciucă Ioana, Ting Yuan-sen, Kruk Sandor, Iyer Kartheik
- Cognitively Inspired Components For Social Conversational Agents Clay Alex, Alonso Eduardo, Mondragón Esther
- BERT Lost Patience Won't Be Robust To Adversarial Slowdown Coalson Zachary, Ritter Gabriel, Bobba Rakesh, Hong Sanghyun
- Inducing Anxiety In Large Language Models Increases Exploration And Bias Coda-forno Julian, Witte Kristin, Jagadish Akshay K., Binz Marcel, Akata Zeynep, Schulz Eric
- Evaluating Language Models For Mathematics Through Interactions Collins Katherine M., Jiang Albert Q., Frieder Simon, Wong Lionel, Zilka Miri, Bhatt Umang, Lukasiewicz Thomas, Wu Yuhuai, Tenenbaum Joshua B., Hart William, Gowers Timothy, Li Wenda, Weller Adrian, Jamnik Mateja
- Dr. Jekyll And Mr. Hyde: Two Faces Of Llms Collu Matteo Gioele, Janssen-groesbeek Tom, Koffas Stefanos, Conti Mauro, Picek Stjepan
- Prompting A Large Language Model To Generate Diverse Motivational Messages: A Comparison With Human-written Messages Cox Samuel Rhys, Abdul Ashraf, Ooi Wei Tsang
- Ultrafeedback: Boosting Language Models With Scaled AI Feedback Cui Ganqu, Yuan Lifan, Ding Ning, Yao Guanming, He Bingxiang, Zhu Wei, Ni Yuan, Xie Guotong, Xie Ruobing, Lin Yankai, Liu Zhiyuan, Sun Maosong
- Efficient And Effective Text Encoding For Chinese Llama And Alpaca Cui Yiming, Yang Ziqing, Yao Xin
- Auggpt: Leveraging Chatgpt For Text Data Augmentation Dai Haixing, Liu Zhengliang, Liao Wenxiong, Huang Xiaoke, Cao Yihan, Wu Zihao, Zhao Lin, Xu Shaochen, Liu Wei, Liu Ninghao, Li Sheng, Zhu Dajiang, Cai Hongmin, Sun Lichao, Li Quanzheng, Shen Dinggang, Liu Tianming, Li Xiang
- Llm-in-the-loop: Leveraging Large Language Model For Thematic Analysis Dai Shih-chieh, Xiong Aiping, Ku Lun-wei
- Choice Over Control: How Users Write With Large Language Models Using Diegetic And Non-diegetic Prompting Dang Hai, Goller Sven, Lehmann Florian, Buschek Daniel
- Flashattention-2: Faster Attention With Better Parallelism And Work Partitioning Dao Tri
- Performance Comparison Of Large Language Models On VNHSGE English Dataset: Openai Chatgpt, Microsoft Bing Chat, And Google Bard Dao Xuan-quy
- Chatgpt Is Good But Bing Chat Is Better For Vietnamese Students Dao Xuan-quy, Le Ngoc-bich
- Can Chatgpt Pass The Vietnamese National High School Graduation Examination? Dao Xuan-quy, Le Ngoc-bich, Phan Xuan-dung, Ngo Bac-bien
- Mask The Bias: Improving Domain-adaptive Generalization Of Ctc-based ASR With Internal Language Model Estimation Das Nilaksh, Sunkara Monica, Bodapati Sravan, Cai Jinglun, Kulshreshtha Devang, Farris Jeff, Kirchhoff Katrin
- Optimizing Distributed Training On Frontier For Large Language Models Dash Sajal, Lyngaas Isaac, Yin Junqi, Wang Xiao, Egele Romain, Cong Guojing, Wang Feiyi, Balaprakash Prasanna
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- Testing GPT-4 With Wolfram Alpha And Code Interpreter Plug-ins On Math And Science Problems Davis Ernest, Aaronson Scott
- LLMR: Real-time Prompting Of Interactive Worlds Using Large Language Models De La Torre Fernanda, Fang Cathy Mengying, Huang Han, Banburski-fahey Andrzej, Fernandez Judith Amores, Lanier Jaron
- GPT-3 Models Are Few-shot Financial Reasoners De Padua Raul Salles, Qureshi Imran, Karakaplan Mustafa U.
- Improved Prompting And Process For Writing User Personas With Llms, Using Qualitative Interviews: Capturing Behaviour And Personality Traits Of Users De Paoli Stefano
- Writing User Personas With Large Language Models: Testing Phase 6 Of A Thematic Analysis Of Semi-structured Interviews De Paoli Stefano
- Facechat: An Emotion-aware Face-to-face Dialogue Framework Deema Alnuhait, Qingyang Wu, Zhou Yu
- Skipdecode: Autoregressive Skip Decoding With Batching And Caching For Efficient LLM Inference Del Corro Luciano, Del Giorno Allie, Agarwal Sahaj, Yu Bin, Awadallah Ahmed, Mukherjee Subhabrata
- Fairness Of Chatgpt And The Role Of Explainable-guided Prompts Deldjoo Yashar
- Investigating Data Contamination In Modern Benchmarks For Large Language Models Deng Chunyuan, Zhao Yilun, Tang Xiangru, Gerstein Mark, Cohan Arman
- Masterkey: Automated Jailbreak Across Multiple Large Language Model Chatbots Deng Gelei, Liu Yi, Li Yuekang, Wang Kailong, Zhang Ying, Li Zefeng, Wang Haoyu, Zhang Tianwei, Liu Yang
- Prompting And Evaluating Large Language Models For Proactive Dialogues: Clarification, Target-guided, And Non-collaboration Deng Yang, Liao Lizi, Chen Liang, Wang Hongru, Lei Wenqiang, Chua Tat-seng
- Multilingual Jailbreak Challenges In Large Language Models Deng Yue, Zhang Wenxuan, Pan Sinno Jialin, Bing Lidong
- Early Chatgpt User Portrait Through The Lens Of Data Deng Yuyang, Zhao Ni, Huang Xin
- Can Chatgpt Read Who You Are? Derner Erik, Kučera Dalibor, Oliver Nuria, Zahálka Jan
- Toxicity In Chatgpt: Analyzing Persona-assigned Language Models Deshpande Ameet, Murahari Vishvak, Rajpurohit Tanmay, Kalyan Ashwin, Narasimhan Karthik
- Chatgpt For GTFS: Benchmarking Llms On GTFS Understanding And Retrieval Devunuri Saipraneeth, Qiam Shirin, Lehe Lewis
- Chatgpt Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions Deyao Zhu, Jun Chen, Kilichbek Haydarov, Xiaoqian Shen, Wenxuan Zhang, Mohamed Elhoseiny
- Minigpt-4: Enhancing Vision-language Understanding With Advanced Large Language Models Deyao Zhu, Jun Chen, Xiaoqian Shen, Xiang Li, Mohamed Elhoseiny
- Evaluating Chatgpt As A Recommender System: A Rigorous Approach Di Palma Dario, Biancofiore Giovanni Maria, Anelli Vito Walter, Narducci Fedelucio, Di Noia Tommaso, Di Sciascio Eugenio
- Can Large Language Models Assist In Hazard Analysis? Diemert Simon, Weber Jens H
- Jump To Conclusions: Short-cutting Transformers With Linear Transformations Din Alexander Yom, Karidi Taelin, Choshen Leshem, Geva Mor
- Using Large Language Model To Solve And Explain Physics Word Problems Approaching Human Level Ding Jingzhe, Cen Yan, Wei Xinyuan
- Enhancing Chat Language Models By Scaling High-quality Instructional Conversations Ding Ning, Chen Yulin, Xu Bokai, Qin Yujia, Zheng Zhi, Hu Shengding, Liu Zhiyuan, Sun Maosong, Zhou Bowen
- A Wolf In Sheep's Clothing: Generalized Nested Jailbreak Prompts Can Fool Large Language Models Easily Ding Peng, Kuang Jun, Ma Dan, Cao Xuezhi, Xian Yunsen, Chen Jiajun, Huang Shujian
- Self-agreement: A Framework For Fine-tuning Language Models To Find Agreement Among Diverse Opinions Ding Shiyao, Ito Takayuki
- HPC-GPT: Integrating Large Language Model For High-performance Computing Ding Xianzhong, Chen Le, Emani Murali, Liao Chunhua, Lin Pei-hung, Vanderbruggen Tristan, Xie Zhen, Cerpa Alberto E., Du Wan
- Efficient Finetuning Large Language Models For Vietnamese Chatbot Doan Vu-thuan, Truong Quoc-truong, Nguyen Duc-vu, Nguyen Vinh-tiep, Luu Thuy-ngan Nguyen
- Establishing Performance Baselines In Fine-tuning, Retrieval-augmented Generation And Soft-prompting For Non-specialist LLM Users Dodgson Jennifer, Nanzheng Lin, Peh Julian, Pattirane Akira Rafhael Janson, Alhajir Alfath Daryl, Dinarto Eko Ridho, Lim Joseph, Ahmad Syed Danyal
- Block-wise Bit-compression Of Transformer-based Models Dong Gaochen, Chen Wei
- Blockwise Compression Of Transformer-based Models Without Retraining Dong Gaochen, Chen Wei
- Large Language Model For Science: A Study On P Vs. NP Dong Qingxiu, Dong Li, Xu Ke, Zhou Guangyan, Hao Yaru, Sui Zhifang, Wei Furu
- Speechgpt: Empowering Large Language Models With Intrinsic Cross-modal Conversational Abilities Dong Zhang, Shimin Li, Xin Zhang, Jun Zhan, Pengyu Wang, Yaqian Zhou, Xipeng Qiu
- A Comparative Study Of Ai-generated (GPT-4) And Human-crafted Mcqs In Programming Education Doughty Jacob, Wan Zipiao, Bompelli Anishka, Qayum Jubahed, Wang Taozhi, Zhang Juran, Zheng Yujia, Doyle Aidan, Sridhar Pragnya, Agarwal Arav, Bogart Christopher, Keylor Eric, Kultur Can, Savelka Jaromir, Sakr Majd
- Large Language Models Douglas Michael R.
- Classeval: A Manually-crafted Benchmark For Evaluating Llms On Class-level Code Generation Du Xueying, Liu Mingwei, Wang Kaixin, Wang Hanlin, Liu Junwei, Chen Yixuan, Feng Jiayi, Sha Chaofeng, Peng Xin, Lou Yiling
- Lauragpt: Listen, Attend, Understand, And Regenerate Audio With GPT Du Zhihao, Wang Jiaming, Chen Qian, Chu Yunfei, Gao Zhifu, Li Zerui, Hu Kai, Zhou Xiaohuan, Xu Jin, Ma Ziyang, Wang Wen, Zheng Siqi, Zhou Chang, Yan Zhijie, Zhang Shiliang
- Botchat: Evaluating Llms' Capabilities Of Having Multi-turn Dialogues Duan Haodong, Wei Jueqi, Wang Chonghua, Liu Hongwei, Fang Yixiao, Zhang Songyang, Lin Dahua, Chen Kai
- Alpacafarm: A Simulation Framework For Methods That Learn From Human Feedback Dubois Yann, Li Xuechen, Taori Rohan, Zhang Tianyi, Gulrajani Ishaan, Ba Jimmy, Guestrin Carlos, Liang Percy, Hashimoto Tatsunori B.
- Frugal Lms Trained To Invoke Symbolic Solvers Achieve Parameter-efficient Arithmetic Reasoning Dutta Subhabrata, Singh Joykirat, Pandey Ishan, Manchanda Sunny, Chakrabarti Soumen, Chakraborty Tanmoy
- A Comparative Analysis Of Large Language Models For Code Documentation Generation Dvivedi Shubhang Shekhar, Vijay Vyshnav, Pujari Sai Leela Rahul, Lodh Shoumik, Kumar Dhruv
- Faith And Fate: Limits Of Transformers On Compositionality Dziri Nouha, Lu Ximing, Sclar Melanie, Li Xiang Lorraine, Jiang Liwei, Lin Bill Yuchen, West Peter, Bhagavatula Chandra, Bras Ronan Le, Hwang Jena D., Sanyal Soumya, Welleck Sean, Ren Xiang, Ettinger Allyson, Harchaoui Zaid, Choi Yejin
- GPT-SW3: An Autoregressive Language Model For The Nordic Languages Ekgren Ariel, Gyllensten Amaru Cuba, Stollenwerk Felix, Öhman Joey, Isbister Tim, Gogoulou Evangelia, Carlsson Fredrik, Heiman Alice, Casademont Judit, Sahlgren Magnus
- Measuring And Improving Attentiveness To Partial Inputs With Counterfactuals Elazar Yanai, Paranjape Bhargavi, Peng Hao, Wiegreffe Sarah, Raghavi Khyathi, Srikumar Vivek, Singh Sameer, Smith Noah A.
- Tinystories: How Small Can Language Models Be And Still Speak Coherent English? Eldan Ronen, Li Yuanzhi
- A Comprehensive Performance Study Of Large Language Models On Novel AI Accelerators Emani Murali, Foreman Sam, Sastry Varuni, Xie Zhen, Raskar Siddhisanket, Arnold William, Thakur Rajeev, Vishwanath Venkatram, Papka Michael E.
- Textgenshap: Scalable Post-hoc Explanations In Text Generation With Long Documents Enouen James, Nakhost Hootan, Ebrahimi Sayna, Arik Sercan O, Liu Yan, Pfister Tomas
- GPT-3.5, GPT-4, Or BARD? Evaluating Llms Reasoning Ability In Zero-shot Setting And Performance Boosting Through Prompts Espejel Jessica López, Ettifouri El Hassane, Alassan Mahaman Sanoussi Yahaya, Chouham El Mehdi, Dahhane Walid
- Chatgpt Outperforms Crowd-workers For Text-annotation Tasks Fabrizio Gilardi, Meysam Alizadeh, Maël Kubli
- Lm-polygraph: Uncertainty Estimation For Language Models Fadeeva Ekaterina, Vashurin Roman, Tsvigun Akim, Vazhentsev Artem, Petrakov Sergey, Fedyanin Kirill, Vasilev Daniil, Goncharova Elizaveta, Panchenko Alexander, Panov Maxim, Baldwin Timothy, Shelmanov Artem
- Perspectives On Large Language Models For Relevance Judgment Faggioli Guglielmo, Dietz Laura, Clarke Charles, Demartini Gianluca, Hagen Matthias, Hauff Claudia, Kando Noriko, Kanoulas Evangelos, Potthast Martin, Stein Benno, Wachsmuth Henning
- Exploring The Potential Of Large Language Models In Generating Code-tracing Questions For Introductory Programming Courses Fan Aysa Xuemo, Zhang Ranran Haoran, Paquette Luc, Zhang Rui
- Uncovering The Potential Of Chatgpt For Discourse Analysis In Dialogue: An Empirical Study Fan Yaxin, Jiang Feng, Li Peifeng, Li Haizhou
- Using GPT-4 To Augment Unbalanced Data For Automatic Scoring Fang Luyang, Lee Gyeong-geon, Zhai Xiaoming
- Instructseq: Unifying Vision Tasks With Instruction-conditioned Multi-modal Sequence Generation Fang Rongyao, Yan Shilin, Huang Zhaoyang, Zhou Jingqiu, Tian Hao, Dai Jifeng, Li Hongsheng
- Chatgpt As Data Augmentation For Compositional Generalization: A Case Study In Open Intent Detection Fang Yihao, Li Xianzhi, Thomas Stephen W., Zhu Xiaodan
- An Empirical Study Of Multitask Learning To Improve Open Domain Dialogue Systems Farahani Mehrdad, Johansson Richard
- Tooltalk: Evaluating Tool-usage In A Conversational Setting Farn Nicholas, Shin Richard
- Chatgpt Performance On Standardized Testing Exam -- A Proposed Strategy For Learners Farooq Umer, Anwar Saira
- Friend Or Foe? Exploring The Implications Of Large Language Models On The Science System Fecher Benedikt, Hebing Marcel, Laufer Melissa, Pohle Jörg, Sofsky Fabian
- Trapping LLM Hallucinations Using Tagged Context Prompts Feldman Philip, Foulds James R., Pan Shimei
- Language Models Can Be Logical Solvers Feng Jiazhan, Xu Ruochen, Hao Junheng, Sharma Hiteshi, Shen Yelong, Zhao Dongyan, Chen Weizhu
- Chessgpt: Bridging Policy Learning And Language Modeling Feng Xidong, Luo Yicheng, Wang Ziyan, Tang Hongrui, Yang Mengyue, Shao Kun, Mguni David, Du Yali, Wang Jun
- Towards Llm-driven Dialogue State Tracking Feng Yujie, Lu Zexin, Liu Bo, Zhan Liming, Wu Xiao-ming
- Should Chatgpt Be Biased? Challenges And Risks Of Bias In Large Language Models Ferrara Emilio
- Exaranker: Explanation-augmented Neural Ranker Ferraretto Fernando, Laitz Thiago, Lotufo Roberto, Nogueira Rodrigo
- Context-aware Meta-learning Fifty Christopher, Duan Dennis, Junkins Ronald G., Amid Ehsan, Leskovec Jure, Re Christopher, Thrun Sebastian
- Leveraging Large Language Models For Automated Dialogue Analysis Finch Sarah E., Paek Ellie S., Choi Jinho D.
- Medalign: A Clinician-generated Dataset For Instruction Following With Electronic Medical Records Fleming Scott L., Lozano Alejandro, Haberkorn William J., Jindal Jenelle A., Reis Eduardo P., Thapa Rahul, Blankemeier Louis, Genkins Julian Z., Steinberg Ethan, Nayak Ashwin, Patel Birju S., Chiang Chia-chun, Callahan Alison, Huo Zepeng, Gatidis Sergios, Adams Scott J., Fayanju Oluseyi, Shah Shreya J., Savage Thomas, Goh Ethan, Chaudhari Akshay S., Aghaeepour Nima, Sharp Christopher, Pfeffer Michael A., Liang Percy, Chen Jonathan H., Morse Keith E., Brunskill Emma P., Fries Jason A., Shah Nigam H.
- Can Large Language Models Follow Concept Annotation Guidelines? A Case Study On Scientific And Financial Domains Fonseca Marcio, Cohen Shay B.
- Enhancing Pipeline-based Conversational Agents With Large Language Models Foosherian Mina, Purwins Hendrik, Rathnayake Purna, Alam Touhidul, Teimao Rui, Thoben Klaus-dieter
- A Review Of Chatgpt Applications In Education, Marketing, Software Engineering, And Healthcare: Benefits, Drawbacks, And Research Directions Fraiwan Mohammad, Khasawneh Natheer
- Sparsegpt: Massive Language Models Can Be Accurately Pruned In One-shot Frantar Elias, Alistarh Dan
- Chatgpt & Mechanical Engineering: Examining Performance On The FE Mechanical Engineering And Undergraduate Exams Frenkel Matthew, Emara Hebah
- A Challenger To GPT-4V? Early Explorations Of Gemini In Visual Expertise Fu Chaoyou, Zhang Renrui, Wang Zihan, Huang Yubo, Zhang Zhengye, Qiu Longtian, Ye Gaoxiang, Shen Yunhang, Zhang Mengdan, Chen Peixian, Zhao Sirui, Lin Shaohui, Jiang Deqiang, Yin Di, Gao Peng, Li Ke, Li Hongsheng, Sun Xing
- Gptscore: Evaluate As You Desire Fu Jinlan, Ng See-kiong, Jiang Zhengbao, Liu Pengfei
- Codeapex: A Bilingual Programming Evaluation Benchmark For Large Language Models Fu Lingyue, Chai Huacan, Luo Shuang, Du Kounianhua, Zhang Weiming, Fan Longteng, Lei Jiayi, Rui Renting, Lin Jianghao, Fang Yuchen, Liu Yifan, Wang Jingkuan, Qi Siyuan, Zhang Kangning, Zhang Weinan, Yu Yong
- Revisiting The Knowledge Injection Frameworks Fu Peng, Zhang Yiming, Wang Haobo, Qiu Weikang, Zhao Junbo
- Generate Then Select: Open-ended Visual Question Answering Guided By World Knowledge Fu Xingyu, Zhang Sheng, Kwon Gukyeong, Perera Pramuditha, Zhu Henghui, Zhang Yuhao, Li Alexander Hanbo, Wang William Yang, Wang Zhiguo, Castelli Vittorio, Ng Patrick, Roth Dan, Xiang Bing
- Chain-of-thought Hub: A Continuous Effort To Measure Large Language Models' Reasoning Performance Fu Yao, Ou Litu, Chen Mingyu, Wan Yuhao, Peng Hao, Khot Tushar
- Specializing Smaller Language Models Towards Multi-step Reasoning Fu Yao, Peng Hao, Ou Litu, Sabharwal Ashish, Khot Tushar
- Gpt4aigchip: Towards Next-generation AI Accelerator Design Automation Via Large Language Models Fu Yonggan, Zhang Yongan, Yu Zhongzhi, Li Sixu, Ye Zhifan, Li Chaojian, Wan Cheng, Lin Yingyan
- Safety Alignment In NLP Tasks: Weakly Aligned Summarization As An In-context Attack Fu Yu, Li Yufei, Xiao Wen, Liu Cong, Dong Yue
- Exposing Limitations Of Language Model Agents In Sequential-task Compositions On The Web Furuta Hiroki, Matsuo Yutaka, Faust Aleksandra, Gur Izzeddin
- Mitigating Hallucination In Large Multi-modal Models Via Robust Instruction Tuning Fuxiao Liu, Kevin Lin, Linjie Li, Jianfeng Wang, Yaser Yacoob, Lijuan Wang
- The Impact And Applications Of Chatgpt: A Systematic Review Of Literature Reviews Gabashvili Irene S.
- Cerbero-7b: A Leap Forward In Language-specific Llms Through Enhanced Chat Corpus Generation And Evaluation Galatolo Federico A., Cimino Mario G. C. A.
- Sensitivity And Robustness Of Large Language Models To Prompt Template In Japanese Text Classification Tasks Gan Chengguang, Mori Tatsunori
- Systematic Evaluation Of GPT-3 For Zero-shot Personality Estimation Ganesan Adithya V, Lal Yash Kumar, Nilsson August Håkan, Schwartz H. Andrew
- Assistgpt: A General Multi-modal Assistant That Can Plan, Execute, Inspect, And Learn Gao Difei, Ji Lei, Zhou Luowei, Lin Kevin Qinghong, Chen Joya, Fan Zihan, Shou Mike Zheng
- Benchmarking Large Language Models With Augmented Instructions For Fine-grained Information Extraction Gao Jun, Zhao Huan, Zhang Yice, Wang Wei, Yu Changlong, Xu Ruifeng
- Examining User-friendly And Open-sourced Large GPT Models: A Survey On Language, Multimodal, And Scientific GPT Models Gao Kaiyuan, He Sunan, He Zhenyu, Lin Jiacheng, Pei Qizhi, Shao Jie, Zhang Wei
- Llama-adapter V2: Parameter-efficient Visual Instruction Model Gao Peng, Han Jiaming, Zhang Renrui, Lin Ziyi, Geng Shijie, Zhou Aojun, Zhang Wei, Lu Pan, He Conghui, Yue Xiangyu, Li Hongsheng, Qiao Yu
- Empower Your Model With Longer And Better Context Comprehension Gao Yifei, Wang Lei, Fang Jun, Hu Longhua, Cheng Jun
- How To Design Translation Prompts For Chatgpt: An Empirical Study Gao Yuan, Wang Ruili, Hou Feng
- Real Customization Or Just Marketing: Are Customized Versions Of Chat GPT Useful? Garrido-merchán Eduardo C., Arroyo-barrigüete Jose L., Borrás-pala Francisco, Escobar-torres Leandro, De Ibarreta Carlos Martínez, Ortiz-lozano Jose María, Rua-vieites Antonio
- Simulating H.P. Lovecraft Horror Literature With The Chatgpt Large Language Model Garrido-merchán Eduardo C., Arroyo-barrigüete José Luis, Gozalo-brizuela Roberto
- Text Encoders Lack Knowledge: Leveraging Generative Llms For Domain-specific Semantic Textual Similarity Gatto Joseph, Sharif Omar, Seegmiller Parker, Bohlman Philip, Preum Sarah Masud
- Building Trustworthy Neurosymbolic AI Systems: Consistency, Reliability, Explainability, And Safety Gaur Manas, Sheth Amit
- Reasoning In Large Language Models Through Symbolic Math Word Problems Gaur Vedant, Saunshi Nikunj
- Mllm-bench: Evaluating Multimodal Llms With Per-sample Criteria Ge Wentao, Chen Shunian, Chen Guiming Hardy, Chen Zhihong, Chen Junying, Yan Shuo, Zhu Chenghao, Lin Ziyue, Xie Wenya, Zhang Xinyi, Chai Yichen, Liu Xiaoyu, Song Dingjie, Wang Xidong, Gao Anningzhe, Zhang Zhiyi, Li Jianquan, Wan Xiang, Wang Benyou
- Making Llama SEE And Draw With SEED Tokenizer Ge Yuying, Zhao Sijie, Zeng Ziyun, Ge Yixiao, Li Chen, Wang Xintao, Shan Ying
- UPAR: A Kantian-inspired Prompting Framework For Enhancing Large Language Model Capabilities Geng Hejia, Xu Boxun, Li Peng
- Flacuna: Unleashing The Problem Solving Power Of Vicuna Using FLAN Fine-tuning Ghosal Deepanway, Chia Yew Ken, Majumder Navonil, Poria Soujanya
- Chatgpt Perpetuates Gender Bias In Machine Translation And Ignores Non-gendered Pronouns: Findings Across Bengali And Five Other Low-resource Languages Ghosh Sourojit, Caliskan Aylin
- Gpt-based Models Meet Simulation: How To Efficiently Use Large-scale Pre-trained Language Models Across Simulation Tasks Giabbanelli Philippe J.
- Semantic Compression With Large Language Models Gilbert Henry, Sandborn Michael, Schmidt Douglas C., Spencer-smith Jesse, White Jules
- Transformative Effects Of Chatgpt On Modern Education: Emerging Era Of AI Chatbots Gill Sukhpal Singh, Xu Minxian, Patros Panos, Wu Huaming, Kaur Rupinder, Kaur Kamalpreet, Fuller Stephanie, Singh Manmeet, Arora Priyansh, Parlikad Ajith Kumar, Stankovski Vlado, Abraham Ajith, Ghosh Soumya K., Lutfiyya Hanan, Kanhere Salil S., Bahsoon Rami, Rana Omer, Dustdar Schahram, Sakellariou Rizos, Uhlig Steve, Buyya Rajkumar
- Wanglab At Mediqa-chat 2023: Clinical Note Generation From Doctor-patient Conversations Using Large Language Models Giorgi John, Toma Augustin, Xie Ronald, Chen Sondra S., An Kevin R., Zheng Grace X., Wang Bo
- Mtuncertainty: Assessing The Need For Post-editing Of Machine Translation Outputs By Fine-tuning Openai Llms Gladkoff Serge, Han Lifeng, Erofeev Gleb, Sorokina Irina, Nenadic Goran
- Decoding Stumpers: Large Language Models Vs. Human Problem-solvers Goldstein Alon, Havin Miriam, Reichart Roi, Goldstein Ariel
- Can Llms Capture Human Preferences? Goli Ali, Singh Amandeep
- Evaluating The Potential Of Leading Large Language Models In Reasoning Biology Questions Gong Xinyu, Holmes Jason, Li Yiwei, Liu Zhengliang, Gan Qi, Wu Zihao, Zhang Jianli, Zou Yusong, Teng Yuxi, Jiang Tian, Zhu Hongtu, Liu Wei, Liu Tianming, Yan Yajun
- Figstep: Jailbreaking Large Vision-language Models Via Typographic Visual Prompts Gong Yichen, Ran Delong, Liu Jinyuan, Wang Conglei, Cong Tianshuo, Wang Anyu, Duan Sisi, Wang Xiaoyun
- Bloomvqa: Assessing Hierarchical Multi-modal Comprehension Gong Yunye, Shrestha Robik, Claypoole Jared, Cogswell Michael, Ray Arijit, Kanan Christopher, Divakaran Ajay
- Improving Input-label Mapping With Demonstration Replay For In-context Learning Gong Zhuocheng, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Yan Rui
- Using Large Language Models For Interpreting Autonomous Robots Behaviors González-santamarta Miguel A., Fernández-becerra Laura, Sobrín-hidalgo David, Guerrero-higueras Ángel Manuel, González Irene, Lera Francisco J. Rodríguez
- Chatgpt Is Not All You Need. A State Of The Art Review Of Large Generative AI Models Gozalo-brizuela Roberto, Garrido-merchan Eduardo C.
- Not What You've Signed Up For: Compromising Real-world Llm-integrated Applications With Indirect Prompt Injection Greshake Kai, Abdelnabi Sahar, Mishra Shailesh, Endres Christoph, Holz Thorsten, Fritz Mario
- Large Language Models Are Zero-shot Time Series Forecasters Gruver Nate, Finzi Marc, Qiu Shikai, Wilson Andrew Gordon
- Linguistically Informed Chatgpt Prompts To Enhance Japanese-chinese Machine Translation: A Case Study On Attributive Clauses Gu Wenshi
- Language Models Hallucinate, But May Excel At Fact Verification Guan Jian, Dodge Jesse, Wadden David, Huang Minlie, Peng Hao
- Leveraging Pre-trained Large Language Models To Construct And Utilize World Models For Model-based Task Planning Guan Lin, Valmeekam Karthik, Sreedharan Sarath, Kambhampati Subbarao
- Voyager: An Open-ended Embodied Agent With Large Language Models Guanzhi Wang, Yuqi Xie, Yunfan Jiang, Ajay Mandlekar, Chaowei Xiao, Yuke Zhu, Linxi Fan, Anima Anandkumar
- The False Promise Of Imitating Proprietary Llms Gudibande Arnav, Wallace Eric, Snell Charlie, Geng Xinyang, Liu Hao, Abbeel Pieter, Levine Sergey, Song Dawn
- Hallucinations In Large Multilingual Translation Models Guerreiro Nuno M., Alves Duarte, Waldendorf Jonas, Haddow Barry, Birch Alexandra, Colombo Pierre, Martins André F. T.
- SPT: Fine-tuning Transformer-based Language Models Efficiently With Sparsification Gui Yuntao, Yan Xiao, Yin Peiqi, Yang Han, Cheng James
- Efficacy Of Machine-generated Instructions Gulati Samaksh, Verma Anshit, Parmar Manoj, Chaudhary Palash
- Connecting Large Language Models With Evolutionary Algorithms Yields Powerful Prompt Optimizers Guo Qingyan, Wang Rui, Guo Junliang, Li Bei, Song Kaitao, Tan Xu, Liu Guoqing, Bian Jiang, Yang Yujiu
- What Can Large Language Models Do In Chemistry? A Comprehensive Benchmark On Eight Tasks Guo Taicheng, Guo Kehan, Nan Bozhao, Liang Zhenwen, Guo Zhichun, Chawla Nitesh V., Wiest Olaf, Zhang Xiangliang
- PPTC Benchmark: Evaluating Large Language Models For Powerpoint Task Completion Guo Yiduo, Zhang Zekai, Liang Yaobo, Zhao Dongyan, Duan Nan
- Arthmodel: Enhance Arithmetic Skills To Large Language Model Guo Yingdi
- Gemrec: Towards Generative Model Recommendation Guo Yuanhe, Liu Haoming, Wen Hongyi
- Improving Small Language Models On Pubmedqa Via Generative Data Augmentation Guo Zhen, Wang Peiqi, Wang Yanwei, Yu Shangdi
- Editing Common Sense In Transformers Gupta Anshita, Mondal Debanjan, Sheshadri Akshay Krishna, Zhao Wenlong, Li Xiang Lorraine, Wiegreffe Sarah, Tandon Niket
- Bias Runs Deep: Implicit Reasoning Biases In Persona-assigned Llms Gupta Shashank, Shrivastava Vaishnavi, Deshpande Ameet, Kalyan Ashwin, Clark Peter, Sabharwal Ashish, Khot Tushar
- A Survey Of Vision-language Pre-training From The Lens Of Multimodal Machine Translation Gwinnup Jeremy, Duh Kevin
- Regulating Chatgpt And Other Large Generative AI Models Hacker Philipp, Engel Andreas, Mauer Marco
- Is GPT-4 A Reliable Rater? Evaluating Consistency In GPT-4 Text Ratings Hackl Veronika, Müller Alexandra Elena, Granitzer Michael, Sailer Maximilian
- Are Large Language Model-based Evaluators The Solution To Scaling Up Multilingual Evaluation? Hada Rishav, Gumma Varun, De Wynter Adrian, Diddee Harshita, Ahmed Mohamed, Choudhury Monojit, Bali Kalika, Sitaram Sunayana
- Seeing Chatgpt Through Students' Eyes: An Analysis Of Tiktok Data Haensch Anna-carolina, Ball Sarah, Herklotz Markus, Kreuter Frauke
- Deception Abilities Emerged In Large Language Models Hagendorff Thilo
- Human-like Intuitive Behavior And Reasoning Biases Emerged In Language Models -- And Disappeared In GPT-4 Hagendorff Thilo, Fabi Sarah
- Images In Language Space: Exploring The Suitability Of Large Language Models For Vision & Language Tasks Hakimov Sherzod, Schlangen David
- Opiniongpt: Modelling Explicit Biases In Instruction-tuned Llms Haller Patrick, Aynetdinov Ansar, Akbik Alan
- Evaluation Of AI Chatbots For Patient-specific EHR Questions Hamidi Alaleh, Roberts Kirk
- Potential Benefits Of Employing Large Language Models In Research In Moral Education And Development Han Hyemin
- Cheddar: Student-chatgpt Dialogue In EFL Writing Education Han Jieun, Yoo Haneul, Myung Junho, Kim Minsun, Lee Tak Yeon, Ahn So-yeon, Oh Alice
- Pive: Prompting With Iterative Verification Improving Graph-based Generative Capability Of Llms Han Jiuzhou, Collier Nigel, Buntine Wray, Shareghi Ehsan
- Medalpaca -- An Open-source Collection Of Medical Conversational AI Models And Training Data Han Tianyu, Adams Lisa C., Papaioannou Jens-michalis, Grundmann Paul, Oberhauser Tom, Löser Alexander, Truhn Daniel, Bressem Keno K.
- David Helps Goliath: Inference-time Collaboration Between Small Specialized And Large General Diffusion Lms Han Xiaochuang, Kumar Sachin, Tsvetkov Yulia, Ghazvininejad Marjan
- Chartllama: A Multimodal LLM For Chart Understanding And Generation Han Yucheng, Zhang Chi, Chen Xin, Yang Xu, Wang Zhibin, Yu Gang, Fu Bin, Zhang Hanwang
- Personalisation Within Bounds: A Risk Taxonomy And Policy Framework For The Alignment Of Large Language Models With Personalised Feedback Hannah Rose Kirk, Bertie Vidgen, Paul Röttger, Scott A. Hale
- Reasoning Implicit Sentiment With Chain-of-thought Prompting Hao Fei, Bobo Li, Qian Liu, Lidong Bing, Fei Li, Tat-seng Chua
- Toolkengpt: Augmenting Frozen Language Models With Massive Tools Via Tool Embeddings Hao Shibo, Liu Tianyang, Wang Zhen, Hu Zhiting
- Visual Instruction Tuning Haotian Liu, Chunyuan Li, Qingyang Wu, Yong Jae Lee
- Idealgpt: Iteratively Decomposing Vision And Language Reasoning Via Large Language Models Haoxuan You, Rui Sun, Zhecan Wang, Long Chen, Gengyu Wang, Hammad A. Ayyubi, Kai-wei Chang, Shih-fu Chang
- Test-time Training On Nearest Neighbors For Large Language Models Hardt Moritz, Sun Yu
- Herd: Using Multiple, Smaller Llms To Match The Performances Of Proprietary, Large Llms Via An Intelligent Composer Hari Surya Narayanan, Thomson Matt
- Analyzing The Performance Of Chatgpt In Cardiology And Vascular Pathologies Hariri Walid
- Unlocking The Potential Of Chatgpt: A Comprehensive Exploration Of Its Applications, Advantages, Limitations, And Future Directions In Natural Language Processing Hariri Walid
- The Political Ideology Of Conversational AI: Converging Evidence On Chatgpt's Pro-environmental, Left-libertarian Orientation Hartmann Jochen, Schwenzow Jasper, Witte Maximilian
- CHAI-DT: A Framework For Prompting Conversational Generative AI Agents To Actively Participate In Co-creation Harwood Brandon
- Zero- And Few-shot Prompting With Llms: A Comparative Study With Fine-tuned Models For Bangla Sentiment Analysis Hasan Md. Arid, Das Shudipta, Anjum Afiyat, Alam Firoj, Anjum Anika, Sarker Avijit, Noori Sheak Rashed Haider
- Chatgpt As Your Personal Data Scientist Hassan Md Mahadi, Knipper Alex, Santu Shubhra Kanti Karmaker
- Multilingual Language Models Are Not Multicultural: A Case Study In Emotion Havaldar Shreya, Rai Sunny, Singhal Bhumika, Liu Langchen, Guntuku Sharath Chandra, Ungar Lyle
- Spear Phishing With Large Language Models Hazell Julian
- Linear Latent World Models In Simple Transformers: A Case Study On Othello-gpt Hazineh Dean S., Zhang Zechen, Chiu Jeffery
- Wanjuan: A Comprehensive Multimodal Dataset For Advancing English And Chinese Large Models He Conghui, Jin Zhenjiang, Xu Chao, Qiu Jiantao, Wang Bin, Li Wei, Yan Hang, Wang Jiaqi, Lin Dahua
- ICL-D3IE: In-context Learning With Diverse Demonstrations Updating For Document Information Extraction He Jiabang, Wang Lei, Hu Yi, Liu Ning, Liu Hui, Xu Xing, Shen Heng Tao
- Wordart Designer: User-driven Artistic Typography Synthesis Using Large Language Models He Jun-yan, Cheng Zhi-qi, Li Chenyang, Sun Jingdong, Xiang Wangmeng, Lin Xianhui, Kang Xiaoyang, Jin Zengke, Hu Yusen, Luo Bin, Geng Yifeng, Xie Xuansong, Zhou Jingren
- You Only Prompt Once: On The Capabilities Of Prompt Learning On Large Language Models To Tackle Toxic Content He Xinlei, Zannettou Savvas, Shen Yun, Zhang Yang
- Using Natural Language Explanations To Improve Robustness Of In-context Learning He Xuanli, Wu Yuxiang, Camburu Oana-maria, Minervini Pasquale, Stenetorp Pontus
- An Automatically Discovered Chain-of-thought Prompt Generalizes To Novel Models And Datasets Hebenstreit Konstantin, Praas Robert, Kiesewetter Louis P, Samwald Matthias
- Chatgpt For Zero-shot Dialogue State Tracking: A Solution Or An Opportunity? Heck Michael, Lubis Nurul, Ruppik Benjamin, Vukovic Renato, Feng Shutong, Geishauser Christian, Lin Hsien-chin, Van Niekerk Carel, Gašić Milica
- Exploring The Responses Of Large Language Models To Beginner Programmers' Help Requests Hellas Arto, Leinonen Juho, Sarsa Sami, Koutcheme Charles, Kujanpää Lilja, Sorva Juha
- Simplemtod: A Simple Language Model For Multimodal Task-oriented Dialogue With Symbolic Scene Representation Hemanthage Bhathiya, Dondrup Christian, Bartie Phil, Lemon Oliver
- How Good Are GPT Models At Machine Translation? A Comprehensive Evaluation Hendy Amr, Abdelrehim Mohamed, Sharaf Amr, Raunak Vikas, Gabr Mohamed, Matsushita Hitokazu, Kim Young Jin, Afify Mohamed, Awadalla Hany Hassan
- Can Llms Grade Short-answer Reading Comprehension Questions : An Empirical Study With A Novel Dataset Henkel Owen, Hills Libby, Roberts Bill, Mcgrane Joshua
- Large Language Models Are Biased To Overestimate Profoundness Herrera-berg Eugenio, Browne Tomás Vergara, León-villagrá Pablo, Vives Marc-lluís, Calderon Cristian Buc
- Backpack Language Models Hewitt John, Thickstun John, Manning Christopher D., Liang Percy
- Assessing The Efficacy Of Large Language Models In Generating Accurate Teacher Responses Hicke Yann, Masand Abhishek, Guo Wentao, Gangavarapu Tushaar
- Newsgpt: Chatgpt Integration For Robot-reporter Hireche Abdelhadi, Belkacem Abdelkader Nasreddine, Jamil Sadia, Chen Chao
- Comparative Analysis Of GPT-4 And Human Graders In Evaluating Praise Given To Students In Synthetic Dialogues Hirunyasiri Dollaya, Thomas Danielle R., Lin Jionghao, Koedinger Kenneth R., Aleven Vincent
- Response: Emergent Analogical Reasoning In Large Language Models Hodel Damian, West Jevin
- Evaluating Large Language Models On A Highly-specialized Topic, Radiation Oncology Physics Holmes Jason, Liu Zhengliang, Zhang Lian, Ding Yuzhen, Sio Terence T., Mcgee Lisa A., Ashman Jonathan B., Li Xiang, Liu Tianming, Shen Jiajian, Liu Wei
- DP-OPT: Make Large Language Model Your Privacy-preserving Prompt Engineer Hong Junyuan, Wang Jiachen T., Zhang Chenhui, Li Zhangheng, Li Bo, Wang Zhangyang
- Cogagent: A Visual Language Model For GUI Agents Hong Wenyi, Wang Weihan, Lv Qingsong, Xu Jiazheng, Yu Wenmeng, Ji Junhui, Wang Yan, Wang Zihan, Zhang Yuxuan, Li Juanzi, Xu Bin, Dong Yuxiao, Ding Ming, Tang Jie
- Do Large Language Models And Humans Have Similar Behaviors In Causal Inference With Script Knowledge? Hong Xudong, Ryzhova Margarita, Biondi Daniel Adrian, Demberg Vera
- SPEED: Speculative Pipelined Execution For Efficient Decoding Hooper Coleman, Kim Sehoon, Mohammadzadeh Hiva, Genc Hasan, Keutzer Kurt, Gholami Amir, Shao Sophia
- More Robots Are Coming: Large Multimodal Models (chatgpt) Can Solve Visually Diverse Images Of Parsons Problems Hou Irene, Man Owen, Mettille Sophie, Gutierrez Sebastian, Angelikas Kenneth, Macneil Stephen
- Towards A Mechanistic Interpretation Of Multi-step Reasoning Capabilities Of Language Models Hou Yifan, Li Jiaoda, Fei Yu, Stolfo Alessandro, Zhou Wangchunshu, Zeng Guangtao, Bosselut Antoine, Sachan Mrinmaya
- Bad Actor, Good Advisor: Exploring The Role Of Large Language Models In Fake News Detection Hu Beizhe, Sheng Qiang, Cao Juan, Shi Yuhui, Li Yang, Wang Danding, Qi Peng
- Chain-of-symbol Prompting Elicits Planning In Large Langauge Models Hu Hanxu, Lu Hongyuan, Zhang Huajian, Song Yun-ze, Lam Wai, Zhang Yue
- Context-aware Prompt Tuning For Vision-language Model With Dual-alignment Hu Hongyu, Lin Tiancheng, Wang Jie, Sun Zhenbang, Xu Yi
- Ladder-of-thought: Using Knowledge As Steps To Elevate Stance Detection Hu Kairui, Yan Ming, Zhou Joey Tianyi, Tsang Ivor W., Chong Wen Haw, Yap Yong Keong
- Meta-learning Online Adaptation Of Language Models Hu Nathan, Mitchell Eric, Manning Christopher D., Finn Chelsea
- Decipherpref: Analyzing Influential Factors In Human Preference Judgments Via GPT-4 Hu Yebowen, Song Kaiqiang, Cho Sangwoo, Wang Xiaoyang, Foroosh Hassan, Liu Fei
- AMERICANO: Argument Generation With Discourse-driven Decomposition And Agent Interaction Hu Zhe, Chan Hou Pong, Yin Yu
- Llm-adapters: An Adapter Family For Parameter-efficient Fine-tuning Of Large Language Models Hu Zhiqiang, Wang Lei, Lan Yihuai, Xu Wanyu, Lim Ee-peng, Bing Lidong, Xu Xing, Poria Soujanya, Lee Roy Ka-wei
- SMILE: Single-turn To Multi-turn Inclusive Language Expansion Via Chatgpt For Mental Health Support Huachuan Qiu, Hongliang He, Shuai Zhang, Anqi Li, Zhenzhong Lan
- Enhancing Large Language Models In Coding Through Multi-perspective Self-consistency Huang Baizhou, Lu Shuai, Chen Weizhu, Wan Xiaojun, Duan Nan
- Agentcoder: Multi-agent-based Code Generation With Iterative Testing And Optimisation Huang Dong, Zhang Jie M., Luck Michael, Bu Qingwen, Qing Yuhao, Cui Heming
- Is Chatgpt Better Than Human Annotators? Potential And Limitations Of Chatgpt In Explaining Implicit Hate Speech Huang Fan, Kwak Haewoon, An Jisun
- Directed Acyclic Transformer Pre-training For High-quality Non-autoregressive Text Generation Huang Fei, Ke Pei, Huang Minlie
- Chatgpt For Shaping The Future Of Dentistry: The Potential Of Multi-modal Large Language Model Huang Hanyao, Zheng Ou, Wang Dongdong, Yin Jiayi, Wang Zijin, Ding Shengxuan, Yin Heng, Xu Chuan, Yang Renjie, Zheng Qian, Shi Bing
- Acegpt, Localizing Large Language Models In Arabic Huang Huang, Yu Fei, Zhu Jianqing, Sun Xuening, Cheng Hao, Song Dingjie, Chen Zhihong, Alharthi Abdulmohsen, An Bang, He Juncai, Liu Ziche, Zhang Zhiyi, Chen Junying, Li Jianquan, Wang Benyou, Zhang Lian, Sun Ruoyu, Wan Xiang, Li Haizhou, Xu Jinchao
- Revisiting The Reliability Of Psychological Scales On Large Language Models Huang Jen-tse, Wang Wenxuan, Lam Man Ho, Li Eric John, Jiao Wenxiang, Lyu Michael R.
- Who Is Chatgpt? Benchmarking Llms' Psychological Portrayal Using Psychobench Huang Jen-tse, Wang Wenxuan, Li Eric John, Lam Man Ho, Ren Shujie, Yuan Youliang, Jiao Wenxiang, Tu Zhaopeng, Lyu Michael R.
- Lawyer Llama Technical Report Huang Quzhe, Tao Mingxu, Zhang Chen, An Zhenwei, Jiang Cong, Chen Zhibin, Wu Zirui, Feng Yansong
- Can Large Language Models Explain Themselves? A Study Of Llm-generated Self-explanations Huang Shiyuan, Mamidanna Siddarth, Jangam Shreedhar, Zhou Yilun, Gilpin Leilani H.
- Lateval: An Interactive Llms Evaluation Benchmark With Incomplete Information From Lateral Thinking Puzzles Huang Shulin, Ma Shirong, Li Yinghui, Huang Mengzuo, Zou Wuhe, Zhang Weidong, Zheng Hai-tao
- Grounded Decoding: Guiding Text Generation With Grounded Models For Embodied Agents Huang Wenlong, Xia Fei, Shah Dhruv, Driess Danny, Zeng Andy, Lu Yao, Florence Pete, Mordatch Igor, Levine Sergey, Hausman Karol, Ichter Brian
- Fewer Is More: Boosting LLM Reasoning With Reinforced Context Pruning Huang Xijie, Zhang Li Lyna, Cheng Kwang-ting, Yang Fan, Yang Mao
- Competition-level Problems Are Effective LLM Evaluators Huang Yiming, Lin Zhenghao, Liu Xiao, Gong Yeyun, Lu Shuai, Lei Fangyu, Liang Yaobo, Shen Yelong, Lin Chen, Duan Nan, Chen Weizhu
- Trustgpt: A Benchmark For Trustworthy And Responsible Large Language Models Huang Yue, Zhang Qihui, Y Philip S., Sun Lichao
- Evaluating And Enhancing Large Language Models For Conversational Reasoning On Knowledge Graphs Huang Yuxuan, Shi Lida, Liu Anqi, Xu Hao
- C-eval: A Multi-level Multi-discipline Chinese Evaluation Suite For Foundation Models Huang Yuzhen, Bai Yuzhuo, Zhu Zhihao, Zhang Junlei, Zhang Jinghan, Su Tangjun, Liu Junteng, Lv Chuancheng, Zhang Yikai, Lei Jiayi, Fu Yao, Sun Maosong, He Junxian
- Comparing Large Language Model AI And Human-generated Coaching Messages For Behavioral Weight Loss Huang Zhuoran, Berry Michael P., Chwyl Christina, Hsieh Gary, Wei Jing, Forman Evan M.
- Llama: Open And Efficient Foundation Language Models Hugo Touvron, Thibaut Lavril, Gautier Izacard, Xavier Martinet, Marie-anne Lachaux, Timothée Lacroix, Baptiste Rozière, Naman Goyal, Eric Hambro, Faisal Azhar, Aurelien Rodriguez, Armand Joulin, Edouard Grave, Guillaume Lample
- Walking A Tightrope -- Evaluating Large Language Models In High-risk Domains Hung Chia-chien, Rim Wiem Ben, Frost Lindsay, Bruckner Lars, Lawrence Carolin
- Do You Trust Chatgpt? -- Perceived Credibility Of Human And Ai-generated Content Huschens Martin, Briesch Martin, Sobania Dominik, Rothlauf Franz
- Understanding The Effectiveness Of Very Large Language Models On Dialog Evaluation Huynh Jessica, Jiao Cathy, Gupta Prakhar, Mehri Shikib, Bajaj Payal, Chaudhary Vishrav, Eskenazi Maxine
- Large Language Models As Sous Chefs: Revising Recipes With GPT-3 Hwang Alyssa, Li Bryan, Hou Zhaoyi, Roth Dan
- Conceptual Model Interpreter For Large Language Models Härer Felix
- Minimizing Factual Inconsistency And Hallucination In Large Language Models I Muneeswaran, Saxena Shreya, Prasad Siva, Prakash M V Sai, Shankar Advaith, V Varun, Vaddina Vishal, Gopalakrishnan Saisubramaniam
- Large Language Model Displays Emergent Ability To Interpret Novel Literary Metaphors Ichien Nicholas, Stamenković Dušan, Holyoak Keith J.
- Diversigate: A Comprehensive Framework For Reliable Large Language Models Imani Shima, Beyram Ali, Shrivastava Harsh
- Mathprompter: Mathematical Reasoning Using Large Language Models Imani Shima, Du Liang, Shrivastava Harsh
- Multitool-cot: GPT-3 Can Use Multiple External Tools With Chain Of Thought Prompting Inaba Tatsuro, Kiyomaru Hirokazu, Cheng Fei, Kurohashi Sadao
- Leveraging Large Language Models To Generate Answer Set Programs Ishay Adam, Yang Zhun, Lee Joohyung
- Instructed To Bias: Instruction-tuned Language Models Exhibit Emergent Cognitive Bias Itzhak Itay, Stanovsky Gabriel, Rosenfeld Nir, Belinkov Yonatan
- Camels In A Changing Climate: Enhancing LM Adaptation With Tulu 2 Ivison Hamish, Wang Yizhong, Pyatkin Valentina, Lambert Nathan, Peters Matthew, Dasigi Pradeep, Jang Joel, Wadden David, Smith Noah A., Beltagy Iz, Hajishirzi Hannaneh
- 14 Examples Of How Llms Can Transform Materials Science And Chemistry: A Reflection On A Large Language Model Hackathon Jablonka Kevin Maik, Ai Qianxiang, Al-feghali Alexander, Badhwar Shruti, Bocarsly Joshua D., Bran Andres M, Bringuier Stefan, Brinson L. Catherine, Choudhary Kamal, Circi Defne, Cox Sam, De Jong Wibe A., Evans Matthew L., Gastellu Nicolas, Genzling Jerome, Gil María Victoria, Gupta Ankur K., Hong Zhi, Imran Alishba, Kruschwitz Sabine, Labarre Anne, Lála Jakub, Liu Tao, Ma Steven, Majumdar Sauradeep, Merz Garrett W., Moitessier Nicolas, Moubarak Elias, Mouriño Beatriz, Pelkie Brenden, Pieler Michael, Ramos Mayk Caldas, Ranković Bojana, Rodriques Samuel G., Sanders Jacob N., Schwaller Philippe, Schwarting Marcus, Shi Jiale, Smit Berend, Smith Ben E., Van Herck Joren, Völker Christoph, Ward Logan, Warren Sean, Weiser Benjamin, Zhang Sylvester, Zhang Xiaoqi, Zia Ghezal Ahmad, Scourtas Aristana, Schmidt Kj, Foster Ian, White Andrew D., Blaiszik Ben
- Emergent Ai-assisted Discourse: Case Study Of A Second Language Writer Authoring With Chatgpt Jacob Sharin, Tate Tamara, Warschauer Mark
- Neftune: Noisy Embeddings Improve Instruction Finetuning Jain Neel, Chiang Ping-yeh, Wen Yuxin, Kirchenbauer John, Chu Hong-min, Somepalli Gowthami, Bartoldson Brian R., Kailkhura Bhavya, Schwarzschild Avi, Saha Aniruddha, Goldblum Micah, Geiping Jonas, Goldstein Tom
- Co-writing With Opinionated Language Models Affects Users' Views Jakesch Maurice, Bhat Advait, Buschek Daniel, Zalmanson Lior, Naaman Mor
- Cotran: An Llm-based Code Translator Using Reinforcement Learning With Feedback From Compiler And Symbolic Execution Jana Prithwish, Jha Piyush, Ju Haoyang, Kishore Gautham, Mahajan Aryan, Ganesh Vijay
- Consistency Analysis Of Chatgpt Jang Myeongjun Erik, Lukasiewicz Thomas
- Chatgpt Is Fun, But It Is Not Funny! Humor Is Still Challenging Large Language Models Jentzsch Sophie, Kersting Kristian
- Hijacking Context In Large Multi-modal Models Jeong Joonhyun
- Inpars-v2: Large Language Models As Efficient Dataset Generators For Information Retrieval Jeronymo Vitor, Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Lotufo Roberto, Zavrel Jakub, Nogueira Rodrigo
- Neuro Symbolic Reasoning For Planning: Counterexample Guided Inductive Synthesis Using Large Language Models And Satisfiability Solving Jha Sumit Kumar, Jha Susmit, Lincoln Patrick, Bastian Nathaniel D., Velasquez Alvaro, Ewetz Rickard, Neema Sandeep
- Tailoring Language Generation Models Under Total Variation Distance Ji Haozhe, Ke Pei, Hu Zhipeng, Zhang Rongsheng, Huang Minlie
- Large Language Models As Automated Aligners For Benchmarking Vision-language Models Ji Yuanfeng, Ge Chongjian, Kong Weikai, Xie Enze, Liu Zhengying, Li Zhengguo, Luo Ping
- Exploring The Impact Of Instruction Data Scaling On Large Language Models: An Empirical Study On Real-world Use Cases Ji Yunjie, Deng Yong, Gong Yan, Peng Yiping, Niu Qiang, Zhang Lei, Ma Baochang, Li Xiangang
- Zero-shot Faithfulness Evaluation For Text Summarization With Foundation Language Model Jia Qi, Ren Siyu, Liu Yizhu, Zhu Kenny Q.
- Legal Syllogism Prompting: Teaching Large Language Models For Legal Judgment Prediction Jiang Cong, Yang Xiaolei
- Tigerscore: Towards Building Explainable Metric For All Text Generation Tasks Jiang Dongfu, Li Yishan, Zhang Ge, Huang Wenhao, Lin Bill Yuchen, Chen Wenhu
- Llm-blender: Ensembling Large Language Models With Pairwise Ranking And Generative Fusion Jiang Dongfu, Ren Xiang, Lin Bill Yuchen
- Personallm: Investigating The Ability Of Large Language Models To Express Personality Traits Jiang Hang, Zhang Xiajie, Cao Xubo, Breazeal Cynthia, Roy Deb, Kabbara Jad
- Llmlingua: Compressing Prompts For Accelerated Inference Of Large Language Models Jiang Huiqiang, Wu Qianhui, Lin Chin-yew, Yang Yuqing, Qiu Lili
- Longllmlingua: Accelerating And Enhancing Llms In Long Context Scenarios Via Prompt Compression Jiang Huiqiang, Wu Qianhui, Luo Xufang, Li Dongsheng, Lin Chin-yew, Yang Yuqing, Qiu Lili
- Graphologue: Exploring Large Language Model Responses With Interactive Diagrams Jiang Peiling, Rayan Jude, Dow Steven P., Xia Haijun
- Selfevolve: A Code Evolution Framework Via Large Language Models Jiang Shuyang, Wang Yuhao, Wang Yu
- Prompt Packer: Deceiving Llms Through Compositional Instruction With Hidden Attacks Jiang Shuyu, Chen Xingshu, Tang Rui
- Scaling Sentence Embeddings With Large Language Models Jiang Ting, Huang Shaohan, Luan Zhongzhi, Wang Deqing, Zhuang Fuzhen
- Lion: Adversarial Distillation Of Proprietary Large Language Models Jiang Yuxin, Chan Chunkit, Chen Mingyang, Wang Wei
- Parrot: Translating During Chat Using Large Language Models Tuned With Human Translation And Feedback Jiao Wenxiang, Huang Jen-tse, Wang Wenxuan, He Zhiwei, Liang Tian, Wang Xing, Shi Shuming, Tu Zhaopeng
- Storyanalogy: Deriving Story-level Analogies From Large Language Models To Unlock Analogical Understanding Jiayang Cheng, Qiu Lin, Chan Tsz Ho, Fang Tianqing, Wang Weiqi, Chan Chunkit, Ru Dongyu, Guo Qipeng, Zhang Hongming, Song Yangqiu, Zhang Yue, Zhang Zheng
- Prompt-based Length Controlled Generation With Reinforcement Learning Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- Medcpt: Contrastive Pre-trained Transformers With Large-scale Pubmed Search Logs For Zero-shot Biomedical Information Retrieval Jin Qiao, Kim Won, Chen Qingyu, Comeau Donald C., Yeganova Lana, Wilbur W. John, Lu Zhiyong
- Binary Code Summarization: Benchmarking Chatgpt/gpt-4 And Other Large Language Models Jin Xin, Larson Jonathan, Yang Weiwei, Lin Zhiqiang
- Vcoder: Versatile Vision Encoders For Multimodal Large Language Models Jitesh Jain, Jianwei Yang, Humphrey Shi
- Is Chatgpt Fair For Recommendation? Evaluating Fairness In Large Language Model Recommendation Jizhi Zhang, Keqin Bao, Yang Zhang, Wenjie Wang, Fuli Feng, Xiangnan He
- GPT Is Becoming A Turing Machine: Here Are Some Ways To Program It Jojic Ana, Wang Zhen, Jojic Nebojsa
- Does GPT-4 Pass The Turing Test? Jones Cameron R., Bergen Benjamin K.
- "it's Not Like Jarvis, But It's Pretty Close!" -- Examining Chatgpt's Usage Among Undergraduate Students In Computer Science Joshi Ishika, Budhiraja Ritvik, Akolekar Harshal D, Challa Jagat Sesh, Kumar Dhruv
- Chatgpt In The Classroom: An Analysis Of Its Strengths And Weaknesses For Solving Undergraduate Computer Science Questions Joshi Ishika, Budhiraja Ritvik, Dev Harshal, Kadia Jahnvi, Ataullah M. Osama, Mitra Sayan, Kumar Dhruv, Akolekar Harshal D.
- A Glimpse In Chatgpt Capabilities And Its Impact For AI Research Joublin Frank, Ceravola Antonello, Deigmoeller Joerg, Gienger Michael, Franzius Mathias, Eggert Julian
- Small Language Models Fine-tuned To Coordinate Larger Language Models Improve Complex Reasoning Juneja Gurusha, Dutta Subhabrata, Chakrabarti Soumen, Manchanda Sunny, Chakraborty Tanmoy
- Impossible Distillation: From Low-quality Model To High-quality Dataset & Model For Summarization And Paraphrasing Jung Jaehun, West Peter, Jiang Liwei, Brahman Faeze, Lu Ximing, Fisher Jillian, Sorensen Taylor, Choi Yejin
- A Comprehensive Capability Analysis Of GPT-3 And GPT-3.5 Series Models Junjie Ye, Xuanting Chen, Nuo Xu, Can Zu, Zekai Shao, Shichun Liu, Yuhan Cui, Zeyang Zhou, Chao Gong, Yang Shen, Jie Zhou, Siming Chen, Tao Gui, Qi Zhang, Xuanjing Huang
- Recommendation As Instruction Following: A Large Language Model Empowered Recommendation Approach Junjie Zhang, Ruobing Xie, Yupeng Hou, Wayne Xin Zhao, Leyu Lin, Ji-rong Wen
- Is Chatgpt A Good Recommender? A Preliminary Study Junling Liu, Chao Liu, Peilin Zhou, Renjie Lv, Kang Zhou, Yan Zhang
- Evaluation And Analysis Of Hallucination In Large Vision-language Models Junyang Wang, Yiyang Zhou, Guohai Xu, Pengcheng Shi, Chenlin Zhao, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang, Jihua Zhu, Jitao Sang, Haoyu Tang
- AMBER: An Llm-free Multi-dimensional Benchmark For Mllms Hallucination Evaluation Junyang Wang, Yuhang Wang, Guohai Xu, Jing Zhang, Yukai Gu, Haitao Jia, Jiaqi Wang, Haiyang Xu, Ming Yan, Ji Zhang, Jitao Sang
- TARJAMAT: Evaluation Of Bard And Chatgpt On Machine Translation Of Ten Arabic Varieties Kadaoui Karima, Magdy Samar M., Waheed Abdul, Khondaker Md Tawkat Islam, El-shangiti Ahmed Oumar, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- Domain-specific Code Language Models: Unraveling The Potential For HPC Codes And Tasks Kadosh Tal, Hasabnis Niranjan, Vo Vy A., Schneider Nadav, Krien Neva, Capota Mihai, Wasay Abdul, Ahmed Nesreen, Willke Ted, Tamir Guy, Pinter Yuval, Mattson Timothy, Oren Gal
- Robust Training For Conversational Question Answering Models With Reinforced Reformulation Generation Kaiser Magdalena, Roy Rishiraj Saha, Weikum Gerhard
- Swectrl-mini: A Data-transparent Transformer-based Large Language Model For Controllable Text Generation In Swedish Kalpakchi Dmytro, Boye Johan
- A Survey Of GPT-3 Family Large Language Models Including Chatgpt And GPT-4 Kalyan Katikapalli Subramanyam
- Evaluating Open-domain Question Answering In The Era Of Large Language Models Kamalloo Ehsan, Dziri Nouha, Clarke Charles L. A., Rafiei Davood
- Knowledge-augmented Reasoning Distillation For Small Language Models In Knowledge-intensive Tasks Kang Minki, Lee Seanie, Baek Jinheon, Kawaguchi Kenji, Hwang Sung Ju
- Chatmof: An Autonomous AI System For Predicting And Generating Metal-organic Frameworks Kang Yeonghun, Kim Jihan
- Shs-nlp At Radsum23: Domain-adaptive Pre-training Of Instruction-tuned Llms For Radiology Report Impression Generation Karn Sanjeev Kumar, Ghosh Rikhiya, P Kusuma, Farri Oladimeji
- Large Language Models Effectively Leverage Document-level Context For Literary Translation, But Critical Errors Persist Karpinska Marzena, Iyyer Mohit
- Exploring Automatic Evaluation Methods Based On A Decoder-based LLM For Text Generation Kasahara Tomohito, Kawahara Daisuke
- Evaluating GPT-4 And Chatgpt On Japanese Medical Licensing Examinations Kasai Jungo, Kasai Yuhei, Sakaguchi Keisuke, Yamada Yutaro, Radev Dragomir
- Evaluating Large Language Models For Health-related Queries With Presuppositions Kaur Navreet, Choudhury Monojit, Pruthi Danish
- Application Of Frozen Large-scale Models To Multimodal Task-oriented Dialogue Kawamoto Tatsuki, Suzuki Takuma, Miyama Ko, Meguro Takumi, Takagi Tomohiro
- Critiquellm: Towards An Informative Critique Generation Model For Evaluation Of Large Language Model Generation Ke Pei, Wen Bosi, Feng Zhuoer, Liu Xiao, Lei Xuanyu, Cheng Jiale, Wang Shengyuan, Zeng Aohan, Dong Yuxiao, Wang Hongning, Tang Jie, Huang Minlie
- Appropriateness Is All You Need! Kempt Hendrik, Lavie Alon, Nagel Saskia K.
- Can Chatgpt And Bard Generate Aligned Assessment Items? A Reliability Analysis Against Human Performance Khademi Abdolvahab
- Assessing The Promise And Pitfalls Of Chatgpt For Automated Code Generation Khan Muhammad Fawad Akbar, Ramsdell Max, Falor Erik, Karimi Hamid
- Zzzgpt: An Interactive GPT Approach To Enhance Sleep Quality Khaokaew Yonchanok, Ji Kaixin, Nguyen Thuc Hanh, Kegalle Hiruni, Alaofi Marwah, Xue Hao, Salim Flora D.
- Dspy: Compiling Declarative Language Model Calls Into Self-improving Pipelines Khattab Omar, Singhvi Arnav, Maheshwari Paridhi, Zhang Zhiyuan, Santhanam Keshav, Vardhamanan Sri, Haq Saiful, Sharma Ashutosh, Joshi Thomas T., Moazam Hanna, Miller Heather, Zaharia Matei, Potts Christopher
- Reliability Check: An Analysis Of Gpt-3's Response To Sensitive Topics And Prompt Wording Khatun Aisha, Brown Daniel G.
- Gptaraeval: A Comprehensive Evaluation Of Chatgpt On Arabic NLP Khondaker Md Tawkat Islam, Waheed Abdul, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- How Secure Is Code Generated By Chatgpt? Khoury Raphaël, Avila Anderson R., Brunelle Jacob, Camara Baba Mamadou
- Exploring The Potential Of Large Language Models To Generate Formative Programming Feedback Kiesler Natalie, Lohr Dominic, Keuning Hieke
- Large Language Models In Introductory Programming Education: Chatgpt's Performance And Implications For Assessments Kiesler Natalie, Schiffner Daniel
- Language Models Can Solve Computer Tasks Kim Geunwoo, Baldi Pierre, Mcaleer Stephen
- GTA: Gated Toxicity Avoidance For LM Performance Preservation Kim Heegyu, Cho Hyunsouk
- Which Is Better? Exploring Prompting Strategy For Llm-based Metrics Kim Joonghoon, Park Saeran, Jeong Kiyoon, Lee Sangmin, Han Seung Hun, Lee Jiyoon, Kang Pilsung
- Speculative Decoding With Big Little Decoder Kim Sehoon, Mangalam Karttikeya, Moon Suhong, Malik Jitendra, Mahoney Michael W., Gholami Amir, Keutzer Kurt
- Prometheus: Inducing Fine-grained Evaluation Capability In Language Models Kim Seungone, Shin Jamin, Cho Yejin, Jang Joel, Longpre Shayne, Lee Hwaran, Yun Sangdoo, Shin Seongjin, Kim Sungdong, Thorne James, Seo Minjoon
- Aligning Large Language Models Through Synthetic Feedback Kim Sungdong, Bae Sanghwan, Shin Jamin, Kang Soyoung, Kwak Donghyun, Yoo Kang Min, Seo Minjoon
- MAGVLT: Masked Generative Vision-and-language Transformer Kim Sungwoong, Jo Daejin, Lee Donghoon, Kim Jongmin
- Can Language Models Laugh At Youtube Short-form Videos? Ko Dayoon, Lee Sangho, Kim Gunhee
- Large Language Models Are Temporal And Causal Reasoners For Video Question Answering Ko Dohwan, Lee Ji Soo, Kang Wooyoung, Roh Byungseok, Kim Hyunwoo J.
- Transformer Language Models Handle Word Frequency In Prediction Head Kobayashi Goro, Kuribayashi Tatsuki, Yokoi Sho, Inui Kentaro
- Conversational Ai-powered Design: Chatgpt As Designer, User, And Product Kocaballi A. Baki
- Large Language Models Are State-of-the-art Evaluators Of Translation Quality Kocmi Tom, Federmann Christian
- Chatgpt: Jack Of All Trades, Master Of None Kocoń Jan, Cichecki Igor, Kaszyca Oliwier, Kochanek Mateusz, Szydło Dominika, Baran Joanna, Bielaniewicz Julita, Gruza Marcin, Janz Arkadiusz, Kanclerz Kamil, Kocoń Anna, Koptyra Bartłomiej, Mieleszczenko-kowszewicz Wiktoria, Miłkowski Piotr, Oleksy Marcin, Piasecki Maciej, Radliński Łukasz, Wojtasik Konrad, Woźniak Stanisław, Kazienko Przemysław
- Writing Your Own Book: A Method For Going From Closed To Open Book QA To Improve Robustness And Performance Of Smaller Llms Kokaia Giorgi, Sinha Pratyush, Jiang Yutong, Boujemaa Nozha
- Better Zero-shot Reasoning With Role-play Prompting Kong Aobo, Zhao Shiwan, Chen Hao, Li Qicheng, Qin Yong, Sun Ruiqi, Zhou Xin, Wang Enzhi, Dong Xiaohang
- Platolm: Teaching Llms In Multi-round Dialogue Via A User Simulator Kong Chuyi, Fan Yaxin, Wan Xiang, Jiang Feng, Wang Benyou
- NOLA: Compressing Lora Using Linear Combination Of Random Basis Koohpayegani Soroush Abbasi, Navaneet Kl, Nooralinejad Parsa, Kolouri Soheil, Pirsiavash Hamed
- Humans In Humans Out: On GPT Converging Toward Common Sense In Both Success And Failure Koralus Philipp, Wang-maścianica Vincent
- Performance Of The Pre-trained Large Language Model GPT-4 On Automated Short Answer Grading Kortemeyer Gerd
- Evaluating Large Language Models In Theory Of Mind Tasks Kosinski Michal
- Understanding Catastrophic Forgetting In Language Models Via Implicit Inference Kotha Suhas, Springer Jacob Mitchell, Raghunathan Aditi
- Large Language Models Only Pass Primary School Exams In Indonesia: A Comprehensive Test On Indommlu Koto Fajri, Aisyah Nurul, Li Haonan, Baldwin Timothy
- Do Large Language Models Pay Similar Attention Like Human Programmers When Generating Code? Kou Bonan, Chen Shengmai, Wang Zhijie, Ma Lei, Zhang Tianyi
- Developing Effective Educational Chatbots With Chatgpt Prompts: Insights From Preliminary Tests In A Case Study On Social Media Literacy (with Appendix) Koyuturk Cansu, Yavari Mona, Theophilou Emily, Bursic Sathya, Donabauer Gregor, Telari Alessia, Testa Alessia, Boiano Raffaele, Gabbiadini Alessandro, Hernandez-leo Davinia, Ruskov Martin, Ognibene Dimitri
- Chatgpt For PLC/DCS Control Logic Generation Koziolek Heiko, Gruener Sten, Ashiwal Virendra
- Leveraging Few-shot Data Augmentation And Waterfall Prompting For Response Generation Krause Lea, Santamaría Selene Báez, Van Der Meer Michiel, Khurana Urja
- Prompt Generate Train (PGT): Few-shot Domain Adaption Of Retrieval Augmented Generation Models For Open Book Question-answering Krishna C. S.
- Chatbots Are Not Reliable Text Annotators Kristensen-mclachlan Ross Deans, Canavan Miceal, Kardos Márton, Jacobsen Mia, Aarøe Lene
- The Moral Authority Of Chatgpt Krügel Sebastian, Ostermaier Andreas, Uhl Matthias
- Fundamentals Of Generative Large Language Models And Perspectives In Cyber-defense Kucharavy Andrei, Schillaci Zachary, Maréchal Loïc, Würsch Maxime, Dolamic Ljiljana, Sabonnadiere Remi, David Dimitri Percia, Mermoud Alain, Lenders Vincent
- Watch Your Language: Investigating Content Moderation With Large Language Models Kumar Deepak, Abuhashem Yousef, Durumeric Zakir
- How Reliable Are Ai-generated-text Detectors? An Assessment Framework Using Evasive Soft Prompts Kumarage Tharindu, Sheth Paras, Moraffah Raha, Garland Joshua, Liu Huan
- Large Language Models On The Chessboard: A Study On Chatgpt's Formal Language Comprehension And Complex Reasoning Skills Kuo Mu-tien, Hsueh Chih-chung, Tsai Richard Tzong-han
- Sparse Fine-tuning For Inference Acceleration Of Large Language Models Kurtic Eldar, Kuznedelev Denis, Frantar Elias, Goin Michael, Alistarh Dan
- Chatgpt: Beginning Of An End Of Manual Linguistic Data Annotation? Use Case Of Automatic Genre Identification Kuzman Taja, Mozetič Igor, Ljubešić Nikola
- In Generative AI We Trust: Can Chatbots Effectively Verify Political Information? Kuznetsova Elizaveta, Makhortykh Mykola, Vziatysheva Victoria, Stolze Martha, Baghumyan Ani, Urman Aleksandra
- Reward Design With Language Models Kwon Minae, Xie Sang Michael, Bullard Kalesha, Sadigh Dorsa
- Chatgpt For Arabic Grammatical Error Correction Kwon Sang Yun, Bhatia Gagan, Nagoud El Moatez Billah, Abdul-mageed Muhammad
- Beyond English: Evaluating Llms For Arabic Grammatical Error Correction Kwon Sang Yun, Bhatia Gagan, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- Openassistant Conversations -- Democratizing Large Language Model Alignment Köpf Andreas, Kilcher Yannic, Von Rütte Dimitri, Anagnostidis Sotiris, Tam Zhi-rui, Stevens Keith, Barhoum Abdullah, Duc Nguyen Minh, Stanley Oliver, Nagyfi Richárd, Es Shahul, Suri Sameer, Glushkov David, Dantuluri Arnav, Maguire Andrew, Schuhmann Christoph, Nguyen Huu, Mattick Alexander
- A Zero-shot And Few-shot Study Of Instruction-finetuned Large Language Models Applied To Clinical And Biomedical Tasks Labrak Yanis, Rouvier Mickael, Dufour Richard
- Unraveling Chatgpt: A Critical Analysis Of Ai-generated Goal-oriented Dialogues And Annotations Labruna Tiziano, Brenna Sofia, Zaninello Andrea, Magnini Bernardo
- Multidimensional Evaluation For Text Style Transfer Using Chatgpt Lai Huiyuan, Toral Antonio, Nissim Malvina
- Chatgpt Beyond English: Towards A Comprehensive Evaluation Of Large Language Models In Multilingual Learning Lai Viet Dac, Ngo Nghia Trung, Veyseh Amir Pouran Ben, Man Hieu, Dernoncourt Franck, Bui Trung, Nguyen Thien Huu
- Mini-dalle3: Interactive Text To Image By Prompting Large Language Models Lai Zeqiang, Zhu Xizhou, Dai Jifeng, Qiao Yu, Wang Wenhai
- Exploring Large Language Models To Facilitate Variable Autonomy For Human-robot Teaming Lakhnati Younes, Pascher Max, Gerken Jens
- Can Llms Be Good Financial Advisors?: An Initial Study In Personal Decision Making For Optimized Outcomes Lakkaraju Kausik, Vuruma Sai Krishna Revanth, Pallagani Vishal, Muppasani Bharath, Srivastava Biplav
- Large Language Models Are Partially Primed In Pronoun Interpretation Lam Suet-ying, Zeng Qingcheng, Zhang Kexun, You Chenyu, Voigt Rob
- LLM Cognitive Judgements Differ From Human Lamprinidis Sotiris
- Creating Large Language Model Resistant Exams: Guidelines And Strategies Larsen Simon Kaare
- A Systematic Study And Comprehensive Evaluation Of Chatgpt On Benchmark Datasets Laskar Md Tahmid Rahman, Bari M Saiful, Rahman Mizanur, Bhuiyan Md Amran Hossen, Joty Shafiq, Huang Jimmy Xiangji
- Fine-tuning Chatgpt For Automatic Scoring Latif Ehsan, Zhai Xiaoming
- VAL: Interactive Task Learning With GPT Dialog Parsing Lawley Lane, Maclellan Christopher J.
- Improved Instruction Ordering In Recipe-grounded Conversation Le Duong Minh, Guo Ruohao, Xu Wei, Ritter Alan
- Voicebox: Text-guided Multilingual Universal Speech Generation At Scale Le Matthew, Vyas Apoorv, Shi Bowen, Karrer Brian, Sari Leda, Moritz Rashel, Williamson Mary, Manohar Vimal, Adi Yossi, Mahadeokar Jay, Hsu Wei-ning
- Challenges Of Gpt-3-based Conversational Agents For Healthcare Lechner Fabian, Lahnala Allison, Welch Charles, Flek Lucie
- Learning From Teaching Assistants To Program With Subgoals: Exploring The Potential For AI Teaching Assistants Lee Changyoon, Myung Junho, Han Jieun, Jin Jiho, Oh Alice
- Towards Reliable And Fluent Large Language Models: Incorporating Feedback Learning Loops In QA Systems Lee Dongyub, Whang Taesun, Lee Chanhee, Lim Heuiseok
- Applying Large Language Models And Chain-of-thought For Automatic Scoring Lee Gyeong-geon, Latif Ehsan, Wu Xuansheng, Liu Ninghao, Zhai Xiaoming
- The Importance Of Prompt Tuning For Automated Neuron Explanations Lee Justin, Oikarinen Tuomas, Chatha Arjun, Chang Keng-chi, Chen Yilan, Weng Tsui-wei
- Building Multimodal AI Chatbots Lee Min Young
- Teaching Arithmetic To Small Transformers Lee Nayoung, Sreenivasan Kartik, Lee Jason D., Lee Kangwook, Papailiopoulos Dimitris
- Explore, Select, Derive, And Recall: Augmenting LLM With Human-like Memory For Mobile Task Automation Lee Sunjae, Choi Junyoung, Lee Jungjae, Wasi Munim Hasan, Choi Hojun, Ko Steven Y., Oh Sangeun, Shin Insik
- Large Language Models Can Share Images, Too! Lee Young-jun, Lee Dokyong, Sung Joo Won, Hyeon Jonghwan, Choi Ho-jin
- Creating A Dataset For High-performance Computing Code Translation Using Llms: A Bridge Between Openmp Fortran And C++ Lei Bin, Ding Caiwen, Chen Le, Lin Pei-hung, Liao Chunhua
- Boosting Logical Reasoning In Large Language Models Through A New Framework: The Graph Of Thought Lei Bin, Lin Pei-hung, Liao Chunhua, Ding Caiwen
- Zero-shot Next-item Recommendation Using Large Pretrained Language Models Lei Wang, Ee-peng Lim
- Mitigating Fine-grained Hallucination By Fine-tuning Large Vision-language Models With Caption Rewrites Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-peng Lim
- Llm4vis: Explainable Visualization Recommendation Using Chatgpt Lei Wang, Songheng Zhang, Yun Wang, Ee-peng Lim, Yong Wang
- Chatgpt: A Meta-analysis After 2.5 Months Leiter Christoph, Zhang Ran, Chen Yanran, Belouadi Jonas, Larionov Daniil, Fresen Vivian, Eger Steffen
- Guiding LLM To Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut Triggers Levy Mosh, Ravfogel Shauli, Goldberg Yoav
- Seed-bench-2: Benchmarking Multimodal Large Language Models Li Bohao, Ge Yuying, Ge Yixiao, Wang Guangzhi, Wang Rui, Zhang Ruimao, Shan Ying
- Task Contamination: Language Models May Not Be Few-shot Anymore Li Changmao, Flanigan Jeffrey
- Mixed Distillation Helps Smaller Language Model Better Reasoning Li Chenglin, Chen Qianglong, Li Liangyue, Wang Caiyu, Li Yicheng, Chen Zulong, Zhang Yin
- Dialogue For Prompting: A Policy-gradient-based Discrete Prompt Generation For Few-shot Learning Li Chengzhengxu, Liu Xiaoming, Wang Yichen, Li Duyi, Lan Yu, Shen Chao
- Modelscope-agent: Building Your Customizable Agent System With Open-source Large Language Models Li Chenliang, Chen Hehong, Yan Ming, Shen Weizhou, Xu Haiyang, Wu Zhikai, Zhang Zhicheng, Zhou Wenmeng, Chen Yingda, Cheng Chen, Shi Hongzhu, Zhang Ji, Huang Fei, Zhou Jingren
- Llava-med: Training A Large Language-and-vision Assistant For Biomedicine In One Day Li Chunyuan, Wong Cliff, Zhang Sheng, Usuyama Naoto, Liu Haotian, Yang Jianwei, Naumann Tristan, Poon Hoifung, Gao Jianfeng
- Explaincpe: A Free-text Explanation Benchmark Of Chinese Pharmacist Examination Li Dongfang, Yu Jindi, Hu Baotian, Xu Zhenran, Zhang Min
- Revisiting Large Language Models As Zero-shot Relation Extractors Li Guozheng, Wang Peng, Ke Wenjun
- Do Vision And Language Models Share Concepts? A Vector Space Alignment Study Li Jiaang, Kementchedjhieva Yova, Fierro Constanza, Søgaard Anders
- Eliciting The Translation Ability Of Large Language Models Via Multilingual Finetuning With Translation Instructions Li Jiahuan, Zhou Hao, Huang Shujian, Cheng Shanbo, Chen Jiajun
- Adaptive Gating In Mixture-of-experts Based Language Models Li Jiamin, Su Qiang, Yang Yitao, Jiang Yimin, Wang Cong, Xu Hong
- CFGPT: Chinese Financial Assistant With Large Language Model Li Jiangtong, Bian Yuxuan, Wang Guoxuan, Lei Yang, Cheng Dawei, Ding Zhijun, Jiang Changjun
- Counterfactual Reasoning: Testing Language Models' Understanding Of Hypothetical Scenarios Li Jiaxuan, Yu Lang, Ettinger Allyson
- Distilling Chatgpt For Explainable Automated Student Answer Assessment Li Jiazheng, Gui Lin, Zhou Yuxiang, West David, Aloisi Cesare, He Yulan
- Overprompt: Enhancing Chatgpt Through Efficient In-context Learning Li Jiazheng, Zhao Runcong, Yang Yongxin, He Yulan, Gui Lin
- Contradoc: Understanding Self-contradictions In Documents With Large Language Models Li Jierui, Raheja Vipul, Kumar Dhruv
- Instructcoder: Instruction Tuning Large Language Models For Code Editing Li Kaixin, Hu Qisheng, Zhao Xu, Chen Hui, Xie Yuxi, Liu Tiedong, Xie Qizhe, He Junxian
- Tool-augmented Reward Modeling Li Lei, Chai Yekun, Wang Shuohuan, Sun Yu, Tian Hao, Zhang Ningyu, Wu Hua
- Watermarking Llms With Weight Quantization Li Linyang, Jiang Botian, Wang Pengyu, Ren Ke, Yan Hang, Qiu Xipeng
- Api-bank: A Comprehensive Benchmark For Tool-augmented Llms Li Minghao, Zhao Yingxiu, Yu Bowen, Song Feifan, Li Hangyu, Yu Haiyang, Li Zhoujun, Huang Fei, Li Yongbin
- Skillgpt: A Restful API Service For Skill Extraction And Standardization Using A Large Language Model Li Nan, Kang Bo, De Bie Tijl
- Table-gpt: Table-tuned GPT For Diverse Table Tasks Li Peng, He Yeye, Yashar Dror, Cui Weiwei, Ge Song, Zhang Haidong, Fainman Danielle Rifinski, Zhang Dongmei, Chaudhuri Surajit
- Vlm-eval: A General Evaluation On Video Large Language Models Li Shuailin, Zhang Yuang, Zhao Yucheng, Wang Qiuyue, Jia Fan, Liu Yingfei, Wang Tiancai
- Benchmarking And Improving Generator-validator Consistency Of Language Models Li Xiang Lisa, Shrivastava Vaishnavi, Li Siyan, Hashimoto Tatsunori, Liang Percy
- Context Matters: Data-efficient Augmentation Of Large Language Models For Scientific Applications Li Xiang, Tang Haoran, Chen Siyu, Wang Ziwei, Maravi Anurag, Abram Marcin
- FLM-101B: An Open LLM And How To Train It With $100K Budget Li Xiang, Yao Yiqun, Jiang Xin, Fang Xuezhi, Meng Xuying, Fan Siqi, Han Peng, Li Jing, Du Li, Qin Bowen, Zhang Zheng, Sun Aixin, Wang Yequan
- Image Content Generation With Causal Reasoning Li Xiaochuan, Fan Baoyu, Zhang Runze, Jin Liang, Wang Di, Guo Zhenhua, Zhao Yaqian, Li Rengang
- Mot: Memory-of-thought Enables Chatgpt To Self-improve Li Xiaonan, Qiu Xipeng
- Think Outside The Code: Brainstorming Boosts Large Language Models In Code Generation Li Xin-ye, Xue Jiang-tian, Xie Zheng, Li Ming
- Exploring Fine-tuning Chatgpt For News Recommendation Li Xinyi, Zhang Yongfeng, Malthouse Edward C
- Explicit Syntactic Guidance For Neural Text Generation Li Yafu, Cui Leyang, Yan Jianhao, Yin Yongjing, Bi Wei, Shi Shuming, Zhang Yue
- Stablellava: Enhanced Visual Instruction Tuning With Synthesized Image-dialogue Data Li Yanda, Zhang Chi, Yu Gang, Wang Zhibin, Fu Bin, Lin Guosheng, Shen Chunhua, Chen Ling, Wei Yunchao
- Curriculum-driven Edubot: A Framework For Developing Language Learning Chatbots Through Synthesizing Conversational Data Li Yu, Qu Shang, Shen Jili, Min Shangchao, Yu Zhou
- RAIN: Your Language Models Can Align Themselves Without Finetuning Li Yuhui, Wei Fangyun, Zhao Jinjing, Zhang Chao, Zhang Hongyang
- Towards Vision Enhancing Llms: Empowering Multimodal Knowledge Storage And Sharing In Llms Li Yunxin, Hu Baotian, Wang Wei, Cao Xiaochun, Zhang Min
- Advancing Precise Outline-conditioned Text Generation With Task Duality And Explicit Outline Control Li Yunzhe, Chen Qian, Yan Weixiang, Wang Wen, Zhang Qinglin, Sundaram Hari
- Guiding Large Language Models Via Directional Stimulus Prompting Li Zekun, Peng Baolin, He Pengcheng, Galley Michel, Gao Jianfeng, Yan Xifeng
- Split And Merge: Aligning Position Biases In Large Language Model Based Evaluators Li Zongjie, Wang Chaozheng, Ma Pingchuan, Wu Daoyuan, Wang Shuai, Gao Cuiyun, Liu Yang
- Batgpt: A Bidirectional Autoregessive Talker From Generative Pre-trained Transformer Li Zuchao, Zhang Shitou, Zhao Hai, Yang Yifei, Yang Dongjie
- Towards End-to-end Embodied Decision Making Via Multi-modal Large Language Model: Explorations With Gpt4-vision And Beyond Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Tianyu Liu, Baobao Chang
- Exploring Format Consistency For Instruction Tuning Liang Shihao, Tian Runchu, Zhu Kunlun, Qin Yujia, Wang Huadong, Cong Xin, Liu Zhiyuan, Liu Xiaojiang, Sun Maosong
- Encouraging Divergent Thinking In Large Language Models Through Multi-agent Debate Liang Tian, He Zhiwei, Jiao Wenxiang, Wang Xing, Wang Rui, Yang Yujiu, Tu Zhaopeng, Shi Shuming
- GPT Detectors Are Biased Against Non-native English Writers Liang Weixin, Yuksekgonul Mert, Mao Yining, Wu Eric, Zou James
- Uhgeval: Benchmarking The Hallucination Of Chinese Large Language Models Via Unconstrained Generation Liang Xun, Song Shichao, Niu Simin, Li Zhiyu, Xiong Feiyu, Tang Bo, Wang Yezhaohui, He Dawei, Cheng Peng, Wang Zhonghao, Deng Haiying
- Let GPT Be A Math Tutor: Teaching Math Word Problem Solvers With Customized Exercise Generation Liang Zhenwen, Yu Wenhao, Rajpurohit Tanmay, Clark Peter, Zhang Xiangliang, Kaylan Ashwin
- GPT-4 Enhanced Multimodal Grounding For Autonomous Driving: Leveraging Cross-modal Attention With Large Language Models Liao Haicheng, Shen Huanming, Li Zhenning, Wang Chengyue, Li Guofa, Bie Yiming, Xu Chengzhong
- Modeling Complex Mathematical Reasoning Via Large Language Model Based Mathagent Liao Haoran, Du Qinyi, Hu Shaohua, He Hao, Xu Yanyan, Tian Jidong, Jin Yaohui
- Opening Up Chatgpt: Tracking Openness, Transparency, And Accountability In Instruction-tuned Text Generators Liesenfeld Andreas, Lopez Alianda, Dingemanse Mark
- Video-llava: Learning United Visual Representation By Alignment Before Projection Lin Bin, Ye Yang, Zhu Bin, Cui Jiaxi, Ning Munan, Jin Peng, Yuan Li
- Paralinguistics-enhanced Large Language Modeling Of Spoken Dialogue Lin Guan-ting, Shivakumar Prashanth Gurunath, Gandhe Ankur, Yang Chao-han Huck, Gu Yile, Ghosh Shalini, Stolcke Andreas, Lee Hung-yi, Bulyko Ivan
- Sparks Of Artificial General Recommender (AGR): Early Experiments With Chatgpt Lin Guo, Zhang Yongfeng
- Videodirectorgpt: Consistent Multi-scene Video Generation Via Llm-guided Planning Lin Han, Zala Abhay, Cho Jaemin, Bansal Mohit
- Agentsims: An Open-source Sandbox For Large Language Model Evaluation Lin Jiaju, Zhao Haoran, Zhang Aochi, Wu Yiting, Ping Huqiuyue, Chen Qin
- Batchprompt: Accomplish More With Less Lin Jianzhe, Diesendruck Maurice, Du Liang, Abraham Robin
- Argue With Me Tersely: Towards Sentence-level Counter-argument Generation Lin Jiayu, Ye Rong, Han Meng, Zhang Qi, Lai Ruofei, Zhang Xinyu, Cao Zhao, Huang Xuanjing, Wei Zhongyu
- Advances In Embodied Navigation Using Large Language Models: A Survey Lin Jinzhou, Gao Han, Feng Xuxiang, Xu Rongtao, Wang Changwei, Zhang Man, Guo Li, Xu Shibiao
- Comparing Generative Chatbots Based On Process Requirements Lins Luis Fernando, Nascimento Nathalia, Alencar Paulo, Oliveira Toacy, Cowan Donald
- External Reasoning: Towards Multi-large-language-models Interchangeable Assistance With Human Feedback Liu Akide
- Recprompt: A Prompt Tuning Framework For News Recommendation Using Large Language Models Liu Dairui, Yang Boming, Du Honghui, Greene Derek, Lawlor Aonghus, Dong Ruihai, Li Irene
- MMC: Advancing Multimodal Chart Understanding With Large-scale Instruction Tuning Liu Fuxiao, Wang Xiaoyang, Yao Wenlin, Chen Jianshu, Song Kaiqiang, Cho Sangwoo, Yacoob Yaser, Yu Dong
- Logicot: Logical Chain-of-thought Instruction-tuning Liu Hanmeng, Teng Zhiyang, Cui Leyang, Zhang Chaoli, Zhou Qiji, Zhang Yue
- Glore: Evaluating Logical Reasoning Of Large Language Models Liu Hanmeng, Teng Zhiyang, Ning Ruoxi, Liu Jian, Zhou Qiji, Zhang Yue
- Exploration With Principles For Diverse AI Supervision Liu Hao, Zaharia Matei, Abbeel Pieter
- Alltogether: Investigating The Efficacy Of Spliced Prompt For Web Navigation Using Large Language Models Liu Jiarun, Hu Wentao, Zhang Chunhong
- Chatcounselor: A Large Language Models For Mental Health Support Liu June M., Li Donghao, Cao He, Ren Tianhe, Liao Zeyi, Wu Jiamin
- An Evaluation Of GPT-4V And Gemini In Online VQA Liu Mengchen, Chen Chongyan, Gurari Danna
- Chipnemo: Domain-adapted Llms For Chip Design Liu Mingjie, Ene Teodor-dumitru, Kirby Robert, Cheng Chris, Pinckney Nathaniel, Liang Rongjian, Alben Jonah, Anand Himyanshu, Banerjee Sanmitra, Bayraktaroglu Ismet, Bhaskaran Bonita, Catanzaro Bryan, Chaudhuri Arjun, Clay Sharon, Dally Bill, Dang Laura, Deshpande Parikshit, Dhodhi Siddhanth, Halepete Sameer, Hill Eric, Hu Jiashang, Jain Sumit, Jindal Ankit, Khailany Brucek, Kokai George, Kunal Kishor, Li Xiaowei, Lind Charley, Liu Hao, Oberman Stuart, Omar Sujeet, Pasandi Ghasem, Pratty Sreedhar, Raiman Jonathan, Sarkar Ambar, Shao Zhengjiang, Sun Hanfei, Suthar Pratik P, Tej Varun, Turner Walker, Xu Kaizhe, Ren Haoxing
- X-eval: Generalizable Multi-aspect Text Evaluation Via Augmented Instruction Tuning With Auxiliary Evaluation Aspects Liu Minqian, Shen Ying, Xu Zhiyang, Cao Yixin, Cho Eunah, Kumar Vaibhav, Ghanadan Reza, Huang Lifu
- From Zero To Hero: Examining The Power Of Symbolic Tasks In Instruction Tuning Liu Qian, Zhou Fan, Jiang Zhengbao, Dou Longxu, Lin Min
- Exploring The Boundaries Of GPT-4 In Radiology Liu Qianchu, Hyland Stephanie, Bannur Shruthi, Bouzid Kenza, Castro Daniel C., Wetscherek Maria Teodora, Tinn Robert, Sharma Harshita, Pérez-garcía Fernando, Schwaighofer Anton, Rajpurkar Pranav, Khanna Sameer Tajdin, Poon Hoifung, Usuyama Naoto, Thieme Anja, Nori Aditya V., Lungren Matthew P., Oktay Ozan, Alvarez-valle Javier
- Chatgpt-powered Conversational Drug Editing Using Retrieval And Domain Feedback Liu Shengchao, Wang Jiongxiao, Yang Yijin, Wang Chengpeng, Liu Ling, Guo Hongyu, Xiao Chaowei
- Goat: Fine-tuned Llama Outperforms GPT-4 On Arithmetic Tasks Liu Tiedong, Low Bryan Kian Hsiang
- Beyond Text: Unveiling Multimodal Proficiency Of Large Language Models With Multiapi Benchmark Liu Xiao, Lin Jianfeng, Zhang Jiawei
- Fingpt: Democratizing Internet-scale Data For Financial Large Language Models Liu Xiao-yang, Wang Guoxuan, Yang Hongyang, Zha Daochen
- BOLT: Fast Energy-based Controlled Text Generation With Tunable Biases Liu Xin, Khalifa Muhammad, Wang Lu
- Litcab: Lightweight Language Model Calibration Over Short- And Long-form Responses Liu Xin, Khalifa Muhammad, Wang Lu
- Parallel Sentence-level Explanation Generation For Real-world Low-resource Scenarios Liu Yan, Chen Xiaokang, Dai Qi
- Gumsum: Multi-genre Data And Evaluation For English Abstractive Summarization Liu Yang Janet, Zeldes Amir
- G-eval: NLG Evaluation Using GPT-4 With Better Human Alignment Liu Yang, Iter Dan, Xu Yichong, Wang Shuohang, Xu Ruochen, Zhu Chenguang
- Trustworthy Llms: A Survey And Guideline For Evaluating Large Language Models' Alignment Liu Yang, Yao Yuanshun, Ton Jean-francois, Zhang Xiaoying, Guo Ruocheng, Cheng Hao, Klochkov Yegor, Taufiq Muhammad Faaiz, Li Hang
- Modeling Uncertainty And Using Post-fusion As Fallback Improves Retrieval Augmented Generation With Llms Liu Ye, Yavuz Semih, Meng Rui, Moorthy Meghana, Joty Shafiq, Xiong Caiming, Zhou Yingbo
- Jailbreaking Chatgpt Via Prompt Engineering: An Empirical Study Liu Yi, Deng Gelei, Xu Zhengzi, Li Yuekang, Zheng Yaowen, Zhang Ying, Zhao Lida, Zhang Tianwei, Wang Kailong, Liu Yang
- Summary Of Chatgpt-related Research And Perspective Towards The Future Of Large Language Models Liu Yiheng, Han Tianle, Ma Siyuan, Zhang Jiayue, Yang Yuanyuan, Tian Jiaming, He Hao, Li Antong, He Mengshen, Liu Zhengliang, Wu Zihao, Zhao Lin, Zhu Dajiang, Li Xiang, Qiang Ning, Shen Dingang, Liu Tianming, Ge Bao
- Llms As Narcissistic Evaluators: When Ego Inflates Evaluation Scores Liu Yiqi, Moosavi Nafise Sadat, Lin Chenghua
- Evaluate What You Can't Evaluate: Unassessable Quality For Generated Response Liu Yongkang, Feng Shi, Wang Daling, Zhang Yifei, Schütze Hinrich
- Robustness Over Time: Understanding Adversarial Examples' Effectiveness On Longitudinal Versions Of Large Language Models Liu Yugeng, Cong Tianshuo, Zhao Zhengyu, Backes Michael, Shen Yun, Zhang Yang
- Binary And Ternary Natural Language Generation Liu Zechun, Oguz Barlas, Pappu Aasish, Shi Yangyang, Krishnamoorthi Raghuraman
- Molxpt: Wrapping Molecules With Text For Generative Pre-training Liu Zequn, Zhang Wei, Xia Yingce, Wu Lijun, Xie Shufang, Qin Tao, Zhang Ming, Liu Tie-yan
- Radiology-gpt: A Large Language Model For Radiology Liu Zhengliang, Zhong Aoxiao, Li Yiwei, Yang Longtao, Ju Chao, Wu Zihao, Ma Chong, Shu Peng, Chen Cheng, Kim Sekeun, Dai Haixing, Zhao Lin, Sun Lichao, Zhu Dajiang, Liu Jun, Liu Wei, Shen Dinggang, Li Xiang, Li Quanzheng, Liu Tianming
- Self-contradictory Reasoning Evaluation And Detection Liu Ziyi, Lee Isabelle, Du Yongkang, Sanyal Soumya, Zhao Jieyu
- Exploring Effectiveness Of GPT-3 In Grammatical Error Correction: A Study On Performance And Controllability In Prompt-based Methods Loem Mengsay, Kaneko Masahiro, Takase Sho, Okazaki Naoaki
- Harnessing Large Language Models' Empathetic Response Generation Capabilities For Online Mental Health Counselling Support Loh Siyuan Brandon, Raamkumar Aravind Sesagiri
- Discuss Before Moving: Visual Language Navigation Via Multi-expert Discussions Long Yuxing, Li Xiaoqi, Cai Wenzhe, Dong Hao
- Data-to-text Generation For Severely Under-resourced Languages With GPT-3.5: A Bit Of Help Needed From Google Translate Lorandi Michela, Belz Anya
- Chatgpt V Bard V Bing V Claude 2 V Aria V Human-expert. How Good Are AI Chatbots At Scientific Writing? Lozić Edisa, Štular Benjamin
- Bounding The Capabilities Of Large Language Models In Open Text Generation With Prompt Constraints Lu Albert, Zhang Hongxin, Zhang Yanzhe, Wang Xuezhi, Yang Diyi
- Chain-of-dictionary Prompting Elicits Translation In Large Language Models Lu Hongyuan, Yang Haoran, Huang Haoyang, Zhang Dongdong, Lam Wai, Wei Furu
- Unified-io 2: Scaling Autoregressive Multimodal Models With Vision, Language, Audio, And Action Lu Jiasen, Clark Christopher, Lee Sangho, Zhang Zichen, Khosla Savya, Marten Ryan, Hoiem Derek, Kembhavi Aniruddha
- Ziya-visual: Bilingual Large Vision-language Model Via Multi-task Instruction Tuning Lu Junyu, Zhang Dixiang, Wu Xiaojun, Gao Xinyu, Gan Ruyi, Zhang Jiaxing, Song Yan, Zhang Pingjian
- Large Language Models Can Be Guided To Evade Ai-generated Text Detection Lu Ning, Liu Shengcai, He Rui, Wang Qi, Ong Yew-soon, Tang Ke
- Mathvista: Evaluating Mathematical Reasoning Of Foundation Models In Visual Contexts Lu Pan, Bansal Hritik, Xia Tony, Liu Jiacheng, Li Chunyuan, Hajishirzi Hannaneh, Cheng Hao, Chang Kai-wei, Galley Michel, Gao Jianfeng
- Error Analysis Prompting Enables Human-like Translation Evaluation In Large Language Models Lu Qingyu, Qiu Baopu, Ding Liang, Zhang Kanjian, Kocmi Tom, Tao Dacheng
- Inference-time Policy Adapters (IPA): Tailoring Extreme-scale Lms Without Fine-tuning Lu Ximing, Brahman Faeze, West Peter, Jang Jaehun, Chandu Khyathi, Ravichander Abhilasha, Qin Lianhui, Ammanabrolu Prithviraj, Jiang Liwei, Ramnath Sahana, Dziri Nouha, Fisher Jillian, Lin Bill Yuchen, Hallinan Skyler, Ren Xiang, Welleck Sean, Choi Yejin
- SCITAB: A Challenging Benchmark For Compositional Reasoning And Claim Verification On Scientific Tables Lu Xinyuan, Pan Liangming, Liu Qian, Nakov Preslav, Kan Min-yen
- RTLLM: An Open-source Benchmark For Design RTL Generation With Large Language Model Lu Yao, Liu Shang, Zhang Qijun, Xie Zhiyao
- UI Layout Generation With Llms Guided By UI Grammar Lu Yuwen, Tong Ziang, Zhao Qinyi, Zhang Chengzhi, Li Toby Jia-jun
- Fighting Fire With Fire: The Dual Role Of Llms In Crafting And Detecting Elusive Disinformation Lucas Jason, Uchendu Adaku, Yamashita Michiharu, Lee Jooyoung, Rohatgi Shaurya, Lee Dongwon
- Explanation-based Finetuning Makes Models More Robust To Spurious Cues Ludan Josh Magnus, Meng Yixuan, Nguyen Tai, Shah Saurabh, Lyu Qing, Apidianaki Marianna, Callison-burch Chris
- Chatgpt And A New Academic Reality: Artificial Intelligence-written Research Papers And The Ethics Of The Large Language Models In Scholarly Publishing Lund Brady, Wang Ting, Mannuru Nishith Reddy, Nie Bing, Shimray Somipam, Wang Ziang
- Enhancing Black-box Few-shot Text Classification With Prompt-based Data Augmentation Luo Danqing, Zhang Chen, Xu Jiahui, Wang Bin, Chen Yiming, Zhang Yan, Li Haizhou
- Wizardmath: Empowering Mathematical Reasoning For Large Language Models Via Reinforced Evol-instruct Luo Haipeng, Sun Qingfeng, Xu Can, Zhao Pu, Lou Jianguang, Tao Chongyang, Geng Xiubo, Lin Qingwei, Chen Shifeng, Zhang Dongmei
- Exploring Small Language Models With Prompt-learning Paradigm For Efficient Domain-specific Text Classification Luo Hengyu, Liu Peng, Esping Stefan
- Systematic Assessment Of Factual Knowledge In Large Language Models Luo Linhao, Vu Thuy-trang, Phung Dinh, Haffari Gholamreza
- Biomedgpt: Open Multimodal Generative Pre-trained Transformer For Biomedicine Luo Yizhen, Zhang Jiahuan, Fan Siqi, Yang Kai, Wu Yushuai, Qiao Mu, Nie Zaiqing
- Chatgpt As A Factual Inconsistency Evaluator For Text Summarization Luo Zheheng, Xie Qianqian, Ananiadou Sophia
- Fingpt: Large Generative Models For A Small Language Luukkonen Risto, Komulainen Ville, Luoma Jouni, Eskelinen Anni, Kanerva Jenna, Kupari Hanna-mari, Ginter Filip, Laippala Veronika, Muennighoff Niklas, Piktus Aleksandra, Wang Thomas, Tazi Nouamane, Scao Teven Le, Wolf Thomas, Suominen Osma, Sairanen Samuli, Merioksa Mikko, Heinonen Jyrki, Vahtola Aija, Antao Samuel, Pyysalo Sampo
- Are We Falling In A Middle-intelligence Trap? An Analysis And Mitigation Of The Reversal Curse Lv Ang, Zhang Kaiyi, Xie Shufang, Tu Quan, Chen Yuhan, Wen Ji-rong, Yan Rui
- KOSMOS-2.5: A Multimodal Literate Model Lv Tengchao, Huang Yupan, Chen Jingye, Zhao Yuzhong, Jia Yilin, Cui Lei, Ma Shuming, Chang Yaoyao, Huang Shaohan, Wang Wenhui, Dong Li, Luo Weiyao, Wu Shaoxiang, Wang Guoxin, Zhang Cha, Wei Furu
- Comparative Analysis Of Drug-gpt And Chatgpt Llms For Healthcare Insights: Evaluating Accuracy And Relevance In Patient And HCP Contexts Lysandrou Giorgos, Owen Roma English, Mursec Kirsty, Brun Grant Le, Fairley Elizabeth A. L.
- A Paradigm Shift: The Future Of Machine Translation Lies With Large Language Models Lyu Chenyang, Du Zefeng, Xu Jitao, Duan Yitao, Wu Minghao, Lynn Teresa, Aji Alham Fikri, Wong Derek F., Liu Siyou, Wang Longyue
- Translating Radiology Reports Into Plain Language Using Chatgpt And GPT-4 With Prompt Learning: Promising Results, Limitations, And Potential Lyu Qing, Tan Josh, Zapadka Michael E., Ponnatapura Janardhana, Niu Chuang, Myers Kyle J., Wang Ge, Whitlow Christopher T.
- Vista-llama: Reliable Video Narrator Via Equal Distance To Visual Tokens Ma Fan, Jin Xiaojie, Wang Heng, Xian Yuchen, Feng Jiashi, Yang Yi
- Fairness-guided Few-shot Prompting For Large Language Models Ma Huan, Zhang Changqing, Bian Yatao, Liu Lemao, Zhang Zhirui, Zhao Peilin, Zhang Shu, Fu Huazhu, Hu Qinghua, Wu Bingzhe
- "oops, Did I Just Say That?" Testing And Repairing Unethical Suggestions Of Large Language Models With Suggest-critique-reflect Process Ma Pingchuan, Li Zongjie, Sun Ao, Wang Shuai
- An Examination Of The Compositionality Of Large Generative Vision-language Models Ma Teli, Li Rong, Liang Junwei
- Let's Do A Thought Experiment: Using Counterfactuals To Improve Moral Reasoning Ma Xiao, Mishra Swaroop, Beirami Ahmad, Beutel Alex, Chen Jilin
- Tomchallenges: A Principle-guided Dataset And Diverse Evaluation Tasks For Exploring Theory Of Mind Ma Xiaomeng, Gao Lingyu, Xu Qihui
- Non-autoregressive Streaming Transformer For Simultaneous Translation Ma Zhengrui, Zhang Shaolei, Guo Shoutao, Shao Chenze, Zhang Min, Feng Yang
- Video-chatgpt: Towards Detailed Video Understanding Via Large Vision And Language Models Maaz Muhammad, Rasheed Hanoona, Khan Salman, Khan Fahad Shahbaz
- Self-refine: Iterative Refinement With Self-feedback Madaan Aman, Tandon Niket, Gupta Prakhar, Hallinan Skyler, Gao Luyu, Wiegreffe Sarah, Alon Uri, Dziri Nouha, Prabhumoye Shrimai, Yang Yiming, Gupta Shashank, Majumder Bodhisattwa Prasad, Hermann Katherine, Welleck Sean, Yazdanbakhsh Amir, Clark Peter
- Enhancing Conversational Quality In Language Learning Chatbots: An Evaluation Of GPT4 For ASR Error Correction Mai Long, Carson-berndsen Julie
- Harnessing The Power Of Prompt-based Techniques For Generating School-level Questions Using Large Language Models Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- How Can Large Language Models Help Humans In Design And Manufacturing? Makatura Liane, Foshey Michael, Wang Bohan, Hähnlein Felix, Ma Pingchuan, Deng Bolei, Tjandrasuwita Megan, Spielberg Andrew, Owens Crystal Elaine, Chen Peter Yichen, Zhao Allan, Zhu Amy, Norton Wil J, Gu Edward, Jacob Joshua, Li Yifei, Schulz Adriana, Matusik Wojciech
- Fine-tuning Language Models With Just Forward Passes Malladi Sadhika, Gao Tianyu, Nichani Eshaan, Damian Alex, Lee Jason D., Chen Danqi, Arora Sanjeev
- Enhancing CLIP With GPT-4: Harnessing Visual Descriptions As Prompts Maniparambil Mayug, Vorster Chris, Molloy Derek, Murphy Noel, Mcguinness Kevin, O'connor Noel E.
- Gpteval: A Survey On Assessments Of Chatgpt And GPT-4 Mao Rui, Chen Guanyi, Zhang Xulang, Guerin Frank, Cambria Erik
- Editing Personality For Large Language Models Mao Shengyu, Wang Xiaohan, Wang Mengru, Jiang Yong, Xie Pengjun, Huang Fei, Zhang Ningyu
- Interpreting Learned Feedback Patterns In Large Language Models Marks Luke, Abdullah Amir, Neo Clement, Arike Rauno, Krueger David, Torr Philip, Barez Fazl
- Establishing Vocabulary Tests As A Benchmark For Evaluating Large Language Models Martínez Gonzalo, Conde Javier, Merino-gómez Elena, Bermúdez-margaretto Beatriz, Hernández José Alberto, Reviriego Pedro, Brysbaert Marc
- Chatgpt Vs State-of-the-art Models: A Benchmarking Study In Keyphrase Generation Task Martínez-cruz Roberto, López-lópez Alvaro J., Portela José
- Investigating The Efficacy Of Large Language Models In Reflective Assessment Methods Through Chain Of Thoughts Prompting Masikisiki Baphumelele, Marivate Vukosi, Hlope Yvette
- Benchmarking Large Language Model Capabilities For Conditional Generation Maynez Joshua, Agrawal Priyanka, Gehrmann Sebastian
- Sources Of Hallucination By Large Language Models On Inference Tasks Mckenna Nick, Li Tianyi, Cheng Liang, Hosseini Mohammad Javad, Johnson Mark, Steedman Mark
- Lightlm: A Lightweight Deep And Narrow Language Model For Generative Recommendation Mei Kai, Zhang Yongfeng
- For Generated Text, Is Nli-neutral Text The Best Text? Mersinias Michail, Mahowald Kyle
- From Bytes To Biases: Investigating The Cultural Self-perception Of Large Language Models Messner Wolfgang, Greene Tatum, Matalone Josephine
- An AI Chatbot For Explaining Deep Reinforcement Learning Decisions Of Service-oriented Systems Metzger Andreas, Bartel Jone, Laufer Jan
- An Empirical Study Of Netops Capability Of Pre-trained Large Language Models Miao Yukai, Bai Yu, Chen Li, Li Dan, Sun Haifeng, Wang Xizheng, Luo Ziqiu, Ren Yanyu, Sun Dapeng, Xu Xiuting, Zhang Qi, Xiang Chao, Li Xinchi
- Self-checker: Plug-and-play Modules For Fact-checking With Large Language Models Miaoran Li, Baolin Peng, Michel Galley, Jianfeng Gao, Zhu Zhang
- Uzh_clyp At Semeval-2023 Task 9: Head-first Fine-tuning And Chatgpt Data Generation For Cross-lingual Learning In Tweet Intimacy Prediction Michail Andrianos, Konstantinou Stefanos, Clematide Simon
- Beyond Scale: The Diversity Coefficient As A Data Quality Metric For Variability In Natural Language Data Miranda Brando, Lee Alycia, Sundar Sudharsan, Casasola Allison, Koyejo Sanmi
- Stress Testing Chain-of-thought Prompting For Large Language Models Mishra Aayush, Thakkar Karan
- Synthetic Imitation Edit Feedback For Factual Alignment In Clinical Summarization Mishra Prakamya, Yao Zonghai, Chen Shuwei, Wang Beining, Mittal Rohan, Yu Hong
- Comparing Humans, GPT-4, And GPT-4V On Abstraction And Reasoning Tasks Mitchell Melanie, Palmarini Alessandro B., Moskvichev Arseny
- Boosting Theory-of-mind Performance In Large Language Models Via Prompting Moghaddam Shima Rahimi, Honey Christopher J.
- Chatgpt In The Age Of Generative AI And Large Language Models: A Concise Survey Mohamadi Salman, Mujtaba Ghulam, Le Ngan, Doretto Gianfranco, Adjeroh Donald A.
- Landmark Attention: Random-access Infinite Context Length For Transformers Mohtashami Amirkeivan, Jaggi Martin
- Evaluating Cognitive Maps And Planning In Large Language Models With Cogeval Momennejad Ida, Hasanbeig Hosein, Vieira Felipe, Sharma Hiteshi, Ness Robert Osazuwa, Jojic Nebojsa, Palangi Hamid, Larson Jonathan
- Narrativexl: A Large-scale Dataset For Long-term Memory Models Moskvichev Arseny, Mai Ky-vinh
- Fine-tuning Large Language Models For Adaptive Machine Translation Moslem Yasmin, Haque Rejwanul, Way Andy
- The Impact Of Artificial Intelligence On The Evolution Of Digital Education: A Comparative Study Of Openai Text Generation Tools Including Chatgpt, Bing Chat, Bard, And Ernie Motlagh Negin Yazdani, Khajavi Matin, Sharifi Abbas, Ahmadi Mohsen
- Can Llms Facilitate Interpretation Of Pre-trained Language Models? Mousi Basel, Durrani Nadir, Dalvi Fahim
- Navigating Prompt Complexity For Zero-shot Classification: A Study Of Large Language Models In Computational Social Science Mu Yida, Wu Ben P., Thorne William, Robinson Ambrose, Aletras Nikolaos, Scarton Carolina, Bontcheva Kalina, Song Xingyi
- In-context Learning Generalizes, But Not Always Robustly: The Case Of Syntax Mueller Aaron, Webson Albert, Petty Jackson, Linzen Tal
- "medium" Lms Of Code In The Era Of Llms: Lessons From Stackoverflow Mukherjee Manisha, Hellendoorn Vincent J.
- Orca: Progressive Learning From Complex Explanation Traces Of GPT-4 Mukherjee Subhabrata, Mitra Arindam, Jawahar Ganesh, Agarwal Sahaj, Palangi Hamid, Awadallah Ahmed
- Superhf: Supervised Iterative Learning From Human Feedback Mukobi Gabriel, Chatain Peter, Fong Su, Windesheim Robert, Kutyniok Gitta, Bhatia Kush, Alberti Silas
- Arabic Mini-climategpt : A Climate Change And Sustainability Tailored Arabic LLM Mullappilly Sahal Shaji, Shaker Abdelrahman, Thawakar Omkar, Cholakkal Hisham, Anwer Rao Muhammad, Khan Salman, Khan Fahad Shahbaz
- LLM4VV: Developing Llm-driven Testsuite For Compiler Validation Munley Christian, Jarmusch Aaron, Chandrasekaran Sunita
- Testing Llms On Code Generation With Varying Levels Of Prompt Specificity Murr Lincoln, Grainger Morgan, Gao David
- Consistency Analysis Of Chatgpt Myeongjun Erik Jang, Thomas Lukasiewicz
- Self-contradictory Hallucinations Of Large Language Models: Evaluation, Detection And Mitigation Mündler Niels, He Jingxuan, Jenko Slobodan, Vechev Martin
- Testing Language Model Agents Safely In The Wild Naihin Silen, Atkinson David, Green Marc, Hamadi Merwane, Swift Craig, Schonholtz Douglas, Kalai Adam Tauman, Bau David
- Generating Medically-accurate Summaries Of Patient-provider Dialogue: A Multi-stage Approach Using Large Language Models Nair Varun, Schumacher Elliot, Kannan Anitha
- DERA: Enhancing Large Language Model Completions With Dialog-enabled Resolving Agents Nair Varun, Schumacher Elliot, Tso Geoffrey, Kannan Anitha
- Using An LLM To Help With Code Understanding Nam Daye, Macvean Andrew, Hellendoorn Vincent, Vasilescu Bogdan, Myers Brad
- Breaking Language Barriers With A LEAP: Learning Strategies For Polyglot Llms Nambi Akshay, Balloli Vaibhav, Ranjit Mercy, Ganu Tanuja, Ahuja Kabir, Sitaram Sunayana, Bali Kalika
- On Evaluating The Integration Of Reasoning And Action In LLM Agents With Database Question Answering Nan Linyong, Zhang Ellen, Zou Weijin, Zhao Yilun, Zhou Wenfei, Cohan Arman
- HALO: An Ontology For Representing And Categorizing Hallucinations In Large Language Models Nananukul Navapat, Kejriwal Mayank
- Cheaply Evaluating Inference Efficiency Metrics For Autoregressive Transformer Apis Narayanan Deepak, Santhanam Keshav, Henderson Peter, Bommasani Rishi, Lee Tony, Liang Percy
- Scalable Extraction Of Training Data From (production) Language Models Nasr Milad, Carlini Nicholas, Hayase Jonathan, Jagielski Matthew, Cooper A. Feder, Ippolito Daphne, Choquette-choo Christopher A., Wallace Eric, Tramèr Florian, Lee Katherine
- Clever Hans Or Neural Theory Of Mind? Stress Testing Social Reasoning In Large Language Models Natalie Shapira, Mosh Levy, Seyed Hossein Alavi, Xuhui Zhou, Yejin Choi, Yoav Goldberg, Maarten Sap, Vered Shwartz
- Aligning A Medium-size GPT Model In English To A Small Closed Domain In Spanish Navarrete-parra Oscar R., Uc-cetina Victor, Reyes-magana Jorge
- Large Language Models As Tax Attorneys: A Case Study In Legal Capabilities Emergence Nay John J., Karamardian David, Lawsky Sarah B., Tao Wenting, Bhat Meghana, Jain Raghav, Lee Aaron Travis, Choi Jonathan H., Kasai Jungo
- Vision Encoder-decoder Models For AI Coaching Nayak Jyothi S, Khan Afifah Khan Mohammed Ajmal, Manjeshwar Chirag, Banday Imadh Ajaz
- On Robustness Of Finetuned Transformer-based NLP Models Neerudu Pavan Kalyan Reddy, Oota Subba Reddy, Marreddy Mounika, Kagita Venkateswara Rao, Gupta Manish
- Exploring The Maze Of Multilingual Modeling Nezhad Sina Bagheri, Agrawal Ameeta
- Meet In The Middle: A New Pre-training Paradigm Nguyen Anh, Karampatziakis Nikos, Chen Weizhu
- Evaluating The Symbol Binding Ability Of Large Language Models For Multiple-choice Questions In Vietnamese General Education Nguyen Duc-vu, Nguyen Quoc-nam
- Black-box Analysis: Gpts Across Time In Legal Textual Entailment Task Nguyen Ha-thanh, Goebel Randy, Toni Francesca, Stathis Kostas, Satoh Ken
- Large Language Models For In-context Student Modeling: Synthesizing Student's Behavior In Visual Programming Nguyen Manh Hung, Tschiatschek Sebastian, Singla Adish
- When Giant Language Brains Just Aren't Enough! Domain Pizzazz With Knowledge Sparkle Dust Nguyen Minh-tien, Nguyen Duy-hung, Sabahi Shahab, Le Hung, Yang Jeff, Hotta Hajime
- Democratizing Llms For Low-resource Languages By Leveraging Their English Dominant Abilities With Linguistically-diverse Prompts Nguyen Xuan-phi, Aljunied Sharifah Mahani, Joty Shafiq, Bing Lidong
- Seallms -- Large Language Models For Southeast Asia Nguyen Xuan-phi, Zhang Wenxuan, Li Xin, Aljunied Mahani, Hu Zhiqiang, Shen Chenhui, Chia Yew Ken, Li Xingxuan, Wang Jianyu, Tan Qingyu, Cheng Liying, Chen Guanzheng, Deng Yue, Yang Sen, Liu Chaoqun, Zhang Hang, Bing Lidong
- Gpt4image: Can Large Pre-trained Models Help Vision Models On Perception Tasks? Ning Ding, Yehui Tang, Zhongqian Fu, Chao Xu, Kai Han, Yunhe Wang
- Ragtruth: A Hallucination Corpus For Developing Trustworthy Retrieval-augmented Language Models Niu Cheng, Wu Yuanhao, Zhu Juno, Xu Siliang, Shum Kashun, Zhong Randy, Song Juntong, Zhang Tong
- Reflexion: Language Agents With Verbal Reinforcement Learning Noah Shinn, Federico Cassano, Edward Berman, Ashwin Gopinath, Karthik Narasimhan, Shunyu Yao
- Evaluating AI Vocational Skills Through Professional Testing Noever David, Ciolino Matt
- AI Text-to-behavior: A Study In Steerability Noever David, Hyams Sam
- Chatbots As Problem Solvers: Playing Twenty Questions With Role Reversals Noever David, Mckee Forrest
- Numeracy From Literacy: Data Science As An Emergent Skill From Large Language Models Noever David, Mckee Forrest
- Multimodal Analysis Of Google Bard And Gpt-vision: Experiments In Visual Reasoning Noever David, Noever Samantha Elizabeth Miller
- Visual AI And Linguistic Intelligence Through Steerability And Composability Noever David, Noever Samantha Elizabeth Miller
- Capabilities Of GPT-4 On Medical Challenge Problems Nori Harsha, King Nicholas, Mckinney Scott Mayer, Carignan Dean, Horvitz Eric
- Evaluating GPT-3.5 And GPT-4 Models On Brazilian University Admission Exams Nunes Desnes, Primi Ricardo, Pires Ramon, Lotufo Roberto, Nogueira Rodrigo
- Contrastive Decoding Improves Reasoning In Large Language Models O'brien Sean, Lewis Mike
- Adversarial Fine-tuning Of Language Models: An Iterative Optimisation Approach For The Generation And Detection Of Problematic Content O'neill Charles, Miller Jack, Ciuca Ioana, Ting Yuan-sen, Bui Thang
- In-contextual Gender Bias Suppression For Large Language Models Oba Daisuke, Kaneko Masahiro, Bollegala Danushka
- Are Large Language Models Fit For Guided Reading? Ochieng Peter
- Comparative Analysis Of CHATGPT And The Evolution Of Language Models Ogundare Oluwatosin, Araya Gustavo Quiros
- Data Augmentation For Neural Machine Translation Using Generative Language Model Oh Seokjin, Lee Su Ah, Jung Woohwan
- Large Language Model (LLM) Bias Index -- LLMBI Oketunji Abiodun Finbarrs, Anas Muhammad, Saina Deepthi
- LINC: A Neurosymbolic Approach For Logical Reasoning By Combining Language Models With First-order Logic Provers Olausson Theo X., Gu Alex, Lipkin Benjamin, Zhang Cedegao E., Solar-lezama Armando, Tenenbaum Joshua B., Levy Roger
- Is Self-repair A Silver Bullet For Code Generation? Olausson Theo X., Inala Jeevana Priya, Wang Chenglong, Gao Jianfeng, Solar-lezama Armando
- Generative AI: Implications And Applications For Education Olga Anastasia Olnancy, Tzirides, Saini Akash, Zapata Gabriela, Searsmith Duane, Cope Bill, Kalantzis Mary, Castro Vania, Kourkoulou Theodora, Jones John, Da Silva Rodrigo Abrantes, Whiting Jen, Kastania Nikoleta Polyxeni
- GPT-4 Technical Report Openai, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko, Madelaine Boyd, Anna-luisa Brakman, Greg Brockman, Tim Brooks, Miles Brundage, Kevin Button, Trevor Cai, Rosie Campbell, Andrew Cann, Brittany Carey, Chelsea Carlson, Rory Carmichael, Brooke Chan, Che Chang, Fotis Chantzis, Derek Chen, Sully Chen, Ruby Chen, Jason Chen, Mark Chen, Ben Chess, Chester Cho, Casey Chu, Hyung Won Chung, Dave Cummings, Jeremiah Currier, Yunxing Dai, Cory Decareaux, Thomas Degry, Noah Deutsch, Damien Deville, Arka Dhar, David Dohan, Steve Dowling, Sheila Dunning, Adrien Ecoffet, Atty Eleti, Tyna Eloundou, David Farhi, Liam Fedus, Niko Felix, Simón Posada Fishman, Juston Forte, Isabella Fulford, Leo Gao, Elie Georges, Christian Gibson, Vik Goel, Tarun Gogineni, Gabriel Goh, Rapha Gontijo-lopes, Jonathan Gordon, Morgan Grafstein, Scott Gray, Ryan Greene, Joshua Gross, Shixiang Shane Gu, Yufei Guo, Chris Hallacy, Jesse Han, Jeff Harris, Yuchen He, Mike Heaton, Johannes Heidecke, Chris Hesse, Alan Hickey, Wade Hickey, Peter Hoeschele, Brandon Houghton, Kenny Hsu, Shengli Hu, Xin Hu, Joost Huizinga, Shantanu Jain, Shawn Jain, Joanne Jang, Angela Jiang, Roger Jiang, Haozhun Jin, Denny Jin, Shino Jomoto, Billie Jonn, Heewoo Jun, Tomer Kaftan, Łukasz Kaiser, Ali Kamali, Ingmar Kanitscheider, Nitish Shirish Keskar, Tabarak Khan, Logan Kilpatrick, Jong Wook Kim, Christina Kim, Yongjik Kim, Jan Hendrik Kirchner, Jamie Kiros, Matt Knight, Daniel Kokotajlo, Łukasz Kondraciuk, Andrew Kondrich, Aris Konstantinidis, Kyle Kosic, Gretchen Krueger, Vishal Kuo, Michael Lampe, Ikai Lan, Teddy Lee, Jan Leike, Jade Leung, Daniel Levy, Chak Ming Li, Rachel Lim, Molly Lin, Stephanie Lin, Mateusz Litwin, Theresa Lopez, Ryan Lowe, Patricia Lue, Anna Makanju, Kim Malfacini, Sam Manning, Todor Markov, Yaniv Markovski, Bianca Martin, Katie Mayer, Andrew Mayne, Bob Mcgrew, Scott Mayer Mckinney, Christine Mcleavey, Paul Mcmillan, Jake Mcneil, David Medina, Aalok Mehta, Jacob Menick, Luke Metz, Andrey Mishchenko, Pamela Mishkin, Vinnie Monaco, Evan Morikawa, Daniel Mossing, Tong Mu, Mira Murati, Oleg Murk, David Mély, Ashvin Nair, Reiichiro Nakano, Rajeev Nayak, Arvind Neelakantan, Richard Ngo, Hyeonwoo Noh, Long Ouyang, Cullen O'keefe, Jakub Pachocki, Alex Paino, Joe Palermo, Ashley Pantuliano, Giambattista Parascandolo, Joel Parish, Emy Parparita, Alex Passos, Mikhail Pavlov, Andrew Peng, Adam Perelman, Filipe De Avila Belbute Peres, Michael Petrov, Henrique Ponde De Oliveira Pinto, Michael, Pokorny, Michelle Pokrass, Vitchyr H. Pong, Tolly Powell, Alethea Power, Boris Power, Elizabeth Proehl, Raul Puri, Alec Radford, Jack Rae, Aditya Ramesh, Cameron Raymond, Francis Real, Kendra Rimbach, Carl Ross, Bob Rotsted, Henri Roussez, Nick Ryder, Mario Saltarelli, Ted Sanders, Shibani Santurkar, Girish Sastry, Heather Schmidt, David Schnurr, John Schulman, Daniel Selsam, Kyla Sheppard, Toki Sherbakov, Jessica Shieh, Sarah Shoker, Pranav Shyam, Szymon Sidor, Eric Sigler, Maddie Simens, Jordan Sitkin, Katarina Slama, Ian Sohl, Benjamin Sokolowsky, Yang Song, Natalie Staudacher, Felipe Petroski Such, Natalie Summers, Ilya Sutskever, Jie Tang, Nikolas Tezak, Madeleine B. Thompson, Phil Tillet, Amin Tootoonchian, Elizabeth Tseng, Preston Tuggle, Nick Turley, Jerry Tworek, Juan Felipe Cerón Uribe, Andrea Vallone, Arun Vijayvergiya, Chelsea Voss, Carroll Wainwright, Justin Jay Wang, Alvin Wang, Ben Wang, Jonathan Ward, Jason Wei, Cj Weinmann, Akila Welihinda, Peter Welinder, Jiayi Weng, Lilian Weng, Matt Wiethoff, Dave Willner, Clemens Winter, Samuel Wolrich, Hannah Wong, Lauren Workman, Sherwin Wu, Jeff Wu, Michael Wu, Kai Xiao, Tao Xu, Sarah Yoo, Kevin Yu, Qiming Yuan, Wojciech Zaremba, Rowan Zellers, Chong Zhang, Marvin Zhang, Shengjia Zhao, Tianhao Zheng, Juntang Zhuang, William Zhuk, Barret Zoph
- Detecting Llm-generated Text In Computing Education: A Comparative Study For Chatgpt Cases Orenstrakh Michael Sheinman, Karnalim Oscar, Suarez Carlos Anibal, Liut Michael
- Thoughtsource: A Central Hub For Large Language Model Reasoning Data Ott Simon, Hebenstreit Konstantin, Liévin Valentin, Hother Christoffer Egeberg, Moradi Milad, Mayrhauser Maximilian, Praas Robert, Winther Ole, Samwald Matthias
- Dialogbench: Evaluating Llms As Human-like Dialogue Systems Ou Jiao, Lu Junda, Liu Che, Tang Yihong, Zhang Fuzheng, Zhang Di, Gai Kun
- The Shifted And The Overlooked: A Task-oriented Investigation Of User-gpt Interactions Ouyang Siru, Wang Shuohang, Liu Yang, Zhong Ming, Jiao Yizhu, Iter Dan, Pryzant Reid, Zhu Chenguang, Ji Heng, Han Jiawei
- Structured Chemistry Reasoning With Large Language Models Ouyang Siru, Zhang Zhuosheng, Yan Bing, Liu Xuan, Choi Yejin, Han Jiawei, Qin Lianhui
- How To Catch An AI Liar: Lie Detection In Black-box Llms By Asking Unrelated Questions Pacchiardi Lorenzo, Chan Alex J., Mindermann Sören, Moscovitz Ilan, Pan Alexa Y., Gal Yarin, Evans Owain, Brauner Jan
- Med-halt: Medical Domain Hallucination Test For Large Language Models Pal Ankit, Umapathi Logesh Kumar, Sankarasubbu Malaikannan
- Ontochatgpt Information System: Ontology-driven Structured Prompts For Chatgpt Meta-learning Palagin Oleksandr, Kaverinskiy Vladislav, Litvin Anna, Malakhov Kyrylo
- Kwaiagents: Generalized Information-seeking Agent System With Large Language Models Pan Haojie, Zhai Zepeng, Yuan Hao, Lv Yaojia, Fu Ruiji, Liu Ming, Wang Zhongyuan, Qin Bing
- What In-context Learning "learns" In-context: Disentangling Task Recognition And Task Learning Pan Jane, Gao Tianyu, Chen Howard, Chen Danqi
- COSMIC: Data Efficient Instruction-tuning For Speech In-context Learning Pan Jing, Wu Jian, Gaur Yashesh, Sivasankaran Sunit, Chen Zhuo, Liu Shujie, Li Jinyu
- Chameleon: Plug-and-play Compositional Reasoning With Large Language Models Pan Lu, Baolin Peng, Hao Cheng, Michel Galley, Kai-wei Chang, Ying Nian Wu, Song-chun Zhu, Jianfeng Gao
- A Preliminary Evaluation Of Chatgpt For Zero-shot Dialogue Understanding Pan Wenbo, Chen Qiguang, Xu Xiao, Che Wanxiang, Qin Libo
- Automating Customer Service Using Langchain: Building Custom Open-source GPT Chatbot For Organizations Pandya Keivalya, Holia Mehfuza
- Automated Annotation With Generative AI Requires Validation Pangakis Nicholas, Wolken Samuel, Fasching Neil
- Abductive Reasoning With The GPT-4 Language Model: Case Studies From Criminal Investigation, Medical Practice, Scientific Research Pareschi Remo
- On The Analysis Of Cross-lingual Prompt Tuning For Decoder-based Multilingual Model Park Nohil, Park Joonsuk, Yoo Kang Min, Yoon Sungroh
- 1.5 Million Materials Narratives Generated By Chatbots Park Yang Jeong, Jerng Sung Eun, Park Jin-sung, Kwon Choah, Hsu Chia-wei, Ren Zhichu, Yoon Sungroh, Li Ju
- A Large Language Model Approach To Educational Survey Feedback Analysis Parker Michael J., Anderson Caitlin, Stone Claire, Oh Yearim
- Longbox: Evaluating Transformers On Long-sequence Clinical Tasks Parmar Mihir, Naik Aakanksha, Gupta Himanshu, Agrawal Disha, Baral Chitta
- Gorilla: Large Language Model Connected With Massive Apis Patil Shishir G., Zhang Tianjun, Wang Xin, Gonzalez Joseph E.
- REFINER: Reasoning Feedback On Intermediate Representations Paul Debjit, Ismayilzada Mete, Peyrard Maxime, Borges Beatriz, Bosselut Antoine, West Robert, Faltings Boi
- Instructexcel: A Benchmark For Natural Language Instruction In Excel Payan Justin, Mishra Swaroop, Singh Mukul, Negreanu Carina, Poelitz Christian, Baral Chitta, Roy Subhro, Chakravarthy Rasika, Van Durme Benjamin, Nouri Elnaz
- Entity Matching Using Large Language Models Peeters Ralph, Bizer Christian
- GPT Self-supervision For A Better Data Annotator Pei Xiaohuan, Li Yanxi, Xu Chang
- Evaluation Of Medium-large Language Models At Zero-shot Closed Book Generative Question Answering Peinl René, Wirth Johannes
- Exploiting Novel GPT-4 Apis Pelrine Kellin, Taufeeque Mohammad, Zając Michał, Mclean Euan, Gleave Adam
- Chatgpt Prompting Cannot Estimate Predictive Uncertainty In High-resource Languages Pelucchi Martino, Valdenegro-toro Matias
- Instruction Tuning With GPT-4 Peng Baolin, Li Chunyuan, He Pengcheng, Galley Michel, Gao Jianfeng
- A Study Of Generative Large Language Model For Medical Research And Healthcare Peng Cheng, Yang Xi, Chen Aokun, Smith Kaleb E, Pournejatian Nima, Costa Anthony B, Martin Cheryl, Flores Mona G, Zhang Ying, Magoc Tanja, Lipori Gloria, Mitchell Duane A, Ospina Naykky S, Ahmed Mustafa M, Hogan William R, Shenkman Elizabeth A, Guo Yi, Bian Jiang, Wu Yonghui
- Generative Large Language Models Are All-purpose Text Analytics Engines: Text-to-text Learning Is All Your Need Peng Cheng, Yang Xi, Chen Aokun, Yu Zehao, Smith Kaleb E, Costa Anthony B, Flores Mona G, Bian Jiang, Wu Yonghui
- FP8-LM: Training FP8 Large Language Models Peng Houwen, Wu Kan, Wei Yixuan, Zhao Guoshuai, Yang Yuxiang, Liu Ze, Xiong Yifan, Yang Ziyue, Ni Bolin, Hu Jingcheng, Li Ruihang, Zhang Miaosen, Li Chen, Ning Jia, Wang Ruizhe, Zhang Zheng, Liu Shuguang, Chau Joe, Hu Han, Cheng Peng
- Spoken Language Intelligence Of Large Language Models For Language Learning Peng Linkai, Nuchged Baorian, Gao Yingming
- Large Language Models Can Infer Psychological Dispositions Of Social Media Users Peters Heinrich, Matz Sandra
- Learning From Free-text Human Feedback -- Collect New Datasets Or Extend Existing Ones? Petrak Dominic, Moosavi Nafise Sadat, Tian Ye, Rozanov Nikolai, Gurevych Iryna
- Zero-shot Question Answering Over Financial Documents Using Large Language Models Phogat Karmvir Singh, Harsha Chetan, Dasaratha Sridhar, Ramakrishna Shashishekar, Puranam Sai Akhil
- Generative AI For Programming Education: Benchmarking Chatgpt, GPT-4, And Human Tutors Phung Tung, Pădurean Victor-alexandru, Cambronero José, Gulwani Sumit, Kohn Tobias, Majumdar Rupak, Singla Adish, Soares Gustavo
- Automating Human Tutor-style Programming Feedback: Leveraging GPT-4 Tutor Model For Hint Generation And GPT-3.5 Student Model For Hint Validation Phung Tung, Pădurean Victor-alexandru, Singh Anjali, Brooks Christopher, Cambronero José, Gulwani Sumit, Singla Adish, Soares Gustavo
- LLM Self Defense: By Self Examination, Llms Know They Are Being Tricked Phute Mansi, Helbling Alec, Hull Matthew, Peng Shengyun, Szyller Sebastian, Cornelius Cory, Chau Duen Horng
- Jatmo: Prompt Injection Defense By Task-specific Finetuning Piet Julien, Alrashed Maha, Sitawarin Chawin, Chen Sizhe, Wei Zeming, Sun Elizabeth, Alomair Basel, Wagner David
- Typhoon: Thai Large Language Models Pipatanakul Kunat, Jirabovonvisut Phatrasek, Manakul Potsawee, Sripaisarnmongkol Sittipong, Patomwong Ruangsak, Chokchainant Pathomporn, Tharnpipitchai Kasima
- Sabi\'a: Portuguese Large Language Models Pires Ramon, Abonizio Hugo, Almeida Thales Sales, Nogueira Rodrigo
- Evaluating Gpt-4's Vision Capabilities On Brazilian University Admission Exams Pires Ramon, Almeida Thales Sales, Abonizio Hugo, Nogueira Rodrigo
- LLM4TDD: Best Practices For Test Driven Development Using Large Language Models Piya Sanyogita, Sullivan Allison
- Three Ways Of Using Large Language Models To Evaluate Chat Plátek Ondřej, Hudeček Vojtěch, Schmidtová Patricia, Lango Mateusz, Dušek Ondřej
- Ai-assisted Coding: Experiments With GPT-4 Poldrack Russell A, Lu Thomas, Beguš Gašper
- Selfcheckgpt: Zero-resource Black-box Hallucination Detection For Generative Large Language Models Potsawee Manakul, Adian Liusie, Mark J. F. Gales
- Transformers In The Service Of Description Logic-based Contexts Poulis Angelos, Tsalapati Eleni, Koubarakis Manolis
- Science In The Era Of Chatgpt, Large Language Models And Generative AI: Challenges For Research Ethics And How To Respond Pournaras Evangelos
- CORE-GPT: Combining Open Access Research And Large Language Models For Credible, Trustworthy Question Answering Pride David, Cancellieri Matteo, Knoth Petr
- Chatgpt Vs Human-authored Text: Insights Into Controllable Text Summarization And Sentence Style Transfer Pu Dongqi, Demberg Vera
- Neural Task Synthesis For Visual Programming Pădurean Victor-alexandru, Tzannetos Georgios, Singla Adish
- Visual Adversarial Examples Jailbreak Aligned Large Language Models Qi Xiangyu, Huang Kaixuan, Panda Ashwinee, Henderson Peter, Wang Mengdi, Mittal Prateek
- Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! Qi Xiangyu, Zeng Yi, Xie Tinghao, Chen Pin-yu, Jia Ruoxi, Mittal Prateek, Henderson Peter
- Toolink: Linking Toolkit Creation And Using Through Chain-of-solving On Open-source Model Qian Cheng, Xiong Chenyan, Liu Zhenghao, Liu Zhiyuan
- Harnessing The Power Of Large Language Models For Empathetic Response Generation: Empirical Investigations And Improvements Qian Yushan, Zhang Wei-nan, Liu Ting
- Is Chatgpt A General-purpose Natural Language Processing Task Solver? Qin Chengwei, Zhang Aston, Zhang Zhuosheng, Chen Jiaao, Yasunaga Michihiro, Yang Diyi
- How Good Is Google Bard's Visual Understanding? An Empirical Study On Open Challenges Qin Haotong, Ji Ge-peng, Khan Salman, Fan Deng-ping, Khan Fahad Shahbaz, Van Gool Luc
- Toolllm: Facilitating Large Language Models To Master 16000+ Real-world Apis Qin Yujia, Liang Shihao, Ye Yining, Zhu Kunlun, Yan Lan, Lu Yaxi, Lin Yankai, Cong Xin, Tang Xiangru, Qian Bill, Zhao Sihan, Hong Lauren, Tian Runchu, Xie Ruobing, Zhou Jie, Gerstein Mark, Li Dahai, Liu Zhiyuan, Sun Maosong
- Large Language Models Are Effective Text Rankers With Pairwise Ranking Prompting Qin Zhen, Jagerman Rolf, Hui Kai, Zhuang Honglei, Wu Junru, Yan Le, Shen Jiaming, Liu Tianqi, Liu Jialu, Metzler Donald, Wang Xuanhui, Bendersky Michael
- Faithful Chain-of-thought Reasoning Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong, Marianna Apidianaki, Chris Callison-burch
- Are Large Language Models Temporally Grounded? Qiu Yifu, Zhao Zheng, Ziser Yftah, Korhonen Anna, Ponti Edoardo M., Cohen Shay B.
- Cmed-gpt: Prompt Tuning For Entity-aware Chinese Medical Dialogue Generation Qu Zhijie, Li Juan, Ma Zerui, Li Jianqiang
- Exploring The Effectiveness Of GPT Models In Test-taking: A Case Study Of The Driver's License Knowledge Test Rahimi Saba, Balch Tucker, Veloso Manuela
- K-PERM: Personalized Response Generation Using Dynamic Knowledge Retrieval And Persona-adaptive Queries Raj Kanak, Roy Kaushik, Bonagiri Vamshi, Govil Priyanshul, Thirunarayanan Krishnaprasad, Gaur Manas
- Multimodal Multi-hop Question Answering Through A Conversation Between Tools And Efficiently Finetuned Large Language Models Rajabzadeh Hossein, Wang Suyuchen, Kwon Hyock Ju, Liu Bang
- Does The "most Sinfully Decadent Cake Ever" Taste Good? Answering Yes/no Questions From Figurative Contexts Rakshit Geetanjali, Flanigan Jeffrey
- Tailoring Self-rationalizers With Multi-reward Distillation Ramnath Sahana, Joshi Brihi, Hallinan Skyler, Lu Ximing, Li Liunian Harold, Chan Aaron, Hessel Jack, Choi Yejin, Ren Xiang
- Chatgpt-crawler: Find Out If Chatgpt Really Knows What It's Talking About Rangapur Aman, Wang Haoran
- Tricking Llms Into Disobedience: Formalizing, Analyzing, And Detecting Jailbreaks Rao Abhinav, Vashistha Sachin, Naik Atharva, Aditya Somak, Choudhury Monojit
- Can Chatgpt Assess Human Personalities? A General Evaluation Framework Rao Haocong, Leung Cyril, Miao Chunyan
- Evaluating Llms On Document-based QA: Exact Answer Selection And Numerical Extraction Using Cogtale Dataset Rasool Zafaryab, Kurniawan Stefanus, Balugo Sherwin, Barnett Scott, Vasa Rajesh, Chesser Courtney, Hampstead Benjamin M., Belleville Sylvie, Mouzakis Kon, Bahar-fuchs Alex
- Supporting Human-ai Collaboration In Auditing Llms With Llms Rastogi Charvi, Ribeiro Marco Tulio, King Nicholas, Nori Harsha, Amershi Saleema
- Dissecting In-context Learning Of Translations In Gpts Raunak Vikas, Awadalla Hany Hassan, Menezes Arul
- Do Gpts Produce Less Literal Translations? Raunak Vikas, Menezes Arul, Post Matt, Awadalla Hany Hassan
- Leveraging GPT-4 For Automatic Translation Post-editing Raunak Vikas, Sharaf Amr, Wang Yiren, Awadallah Hany Hassan, Menezes Arul
- \(\textit{dial Beinfo For Faithfulness}\): Improving Factuality Of Information-seeking Dialogue Via Behavioural Fine-tuning Razumovskaia Evgeniia, Vulić Ivan, Marković Pavle, Cichy Tomasz, Zheng Qian, Wen Tsung-hsien, Budzianowski Paweł
- Fine-tuning Language Models For Context-specific SQL Query Generation Rebei Amine
- Self-evaluation Improves Selective Generation In Large Language Models Ren Jie, Zhao Yao, Vu Tu, Liu Peter J., Lakshminarayanan Balaji
- Investigating The Factual Knowledge Boundary Of Large Language Models With Retrieval Augmentation Ren Ruiyang, Wang Yuhao, Qu Yingqi, Zhao Wayne Xin, Liu Jing, Tian Hao, Wu Hua, Wen Ji-rong, Wang Haifeng
- Prompt, Generate, Then Cache: Cascade Of Foundation Models Makes Strong Few-shot Learners Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Hongsheng Li, Yu Qiao, Peng Gao
- I'm Afraid I Can't Do That: Predicting Prompt Refusal In Black-box Generative Language Models Reuter Max, Schulze William
- Negated Complementary Commonsense Using Large Language Models Rezaei Navid, Reformat Marek Z.
- STREET: A Multi-task Structured Reasoning And Explanation Benchmark Ribeiro Danilo, Wang Shen, Ma Xiaofei, Zhu Henry, Dong Rui, Kong Deguang, Burger Juliette, Ramos Anjelica, Wang William, Huang Zhiheng, Karypis George, Xiang Bing, Roth Dan
- Commonsense Reasoning For Conversational AI: A Survey Of The State Of The Art Richardson Christopher, Heck Larry
- SYNDICOM: Improving Conversational Commonsense With Error-injection And Natural Language Feedback Richardson Christopher, Sundar Anirudh, Heck Larry
- Data Contamination Through The Lens Of Time Roberts Manley, Thakur Himanshu, Herlihy Christine, White Colin, Dooley Samuel
- Smoothllm: Defending Large Language Models Against Jailbreaking Attacks Robey Alexander, Wong Eric, Hassani Hamed, Pappas George J.
- Bio-sieve: Exploring Instruction Tuning Large Language Models For Systematic Review Automation Robinson Ambrose, Thorne William, Wu Ben P., Pandor Abdullah, Essat Munira, Stevenson Mark, Song Xingyi
- Chatgpt MT: Competitive For High- (but Not Low-) Resource Languages Robinson Nathaniel R., Ogayo Perez, Mortensen David R., Neubig Graham
- Benchmarking Causal Study To Interpret Large Language Models For Source Code Rodriguez-cardenas Daniel, Palacio David N., Khati Dipin, Burke Henry, Poshyvanyk Denys
- Audiogpt: Understanding And Generating Speech, Music, Sound, And Talking Head Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Zhou Zhao, Shinji Watanabe
- Grading Conversational Responses Of Chatbots Rosario Grant, Noever David
- The Manipulation Problem: Conversational AI As A Threat To Epistemic Agency Rosenberg Louis
- Axiomatic Preference Modeling For Longform Question Answering Rosset Corby, Zheng Guoqing, Dibia Victor, Awadallah Ahmed, Bennett Paul
- Unveiling The Potential Of Large Language Models In Generating Semantic And Cross-language Clones Roy Palash R., Alam Ajmain I., Al-omari Farouq, Roy Banani, Roy Chanchal K., Schneider Kevin A.
- Generating Phishing Attacks Using Chatgpt Roy Sayak Saha, Naragam Krishna Vamsi, Nilizadeh Shirin
- From Chatbots To Phishbots? -- Preventing Phishing Scams Created Using Chatgpt, Google Bard And Claude Roy Sayak Saha, Thota Poojitha, Naragam Krishna Vamsi, Nilizadeh Shirin
- Identifying The Risks Of LM Agents With An Lm-emulated Sandbox Ruan Yangjun, Dong Honghua, Wang Andrew, Pitis Silviu, Zhou Yongchao, Ba Jimmy, Dubois Yann, Maddison Chris J., Hashimoto Tatsunori
- Gpt4tools: Teaching Large Language Model To Use Tools Via Self-instruction Rui Yang, Lin Song, Yanwei Li, Sijie Zhao, Yixiao Ge, Xiu Li, Ying Shan
- Are Emergent Abilities Of Large Language Models A Mirage? Rylan Schaeffer, Brando Miranda, Sanmi Koyejo
- Plausibility Processing In Transformer Language Models: Focusing On The Role Of Attention Heads In GPT Ryu Soo Hyun
- Performance Of Chatgpt-3.5 And GPT-4 On The United States Medical Licensing Examination With And Without Distractions Safrai Myriam, Azaria Amos
- Branch-solve-merge Improves Large Language Model Evaluation And Generation Saha Swarnadeep, Levy Omer, Celikyilmaz Asli, Bansal Mohit, Weston Jason, Li Xian
- Promptmix: A Class Boundary Augmentation Method For Large Language Model Distillation Sahu Gaurav, Vechtomova Olga, Bahdanau Dzmitry, Laradji Issam H.
- Verbosity Bias In Preference Labeling By Large Language Models Saito Keita, Wachi Akifumi, Wataoka Koki, Akimoto Youhei
- Memory Injections: Correcting Multi-hop Reasoning Failures During Inference In Transformer-based Language Models Sakarvadia Mansi, Ajith Aswathy, Khan Arham, Grzenda Daniel, Hudson Nathaniel, Bauer André, Chard Kyle, Foster Ian
- Extending The Frontier Of Chatgpt: Code Generation And Debugging Sakib Fardin Ahsan, Khan Saadat Hasan, Karim A. H. M. Rezaul
- UT5: Pretraining Non Autoregressive T5 With Unrolled Denoising Salem Mahmoud G., Ye Jiayu, Lin Chu-cheng, Liu Frederick
- Can Llms Augment Low-resource Reading Comprehension Datasets? Opportunities And Challenges Samuel Vinay, Aynaou Houda, Chowdhury Arijit Ghosh, Ramanan Karthik Venkat, Chadha Aman
- What Matters In The Structured Pruning Of Generative Language Models? Santacroce Michael, Wen Zixin, Shen Yelong, Li Yuanzhi
- Camoscio: An Italian Instruction-tuned Llama Santilli Andrea, Rodolà Emanuele
- Early Weight Averaging Meets High Learning Rates For LLM Pre-training Sanyal Sunny, Neerkaje Atula, Kaddour Jean, Kumar Abhishek, Sanghavi Sujay
- Inseq: An Interpretability Toolkit For Sequence Generation Models Sarti Gabriele, Feldhus Nils, Sickert Ludwig, Van Der Wal Oskar, Nissim Malvina, Bisazza Arianna
- Interprompt: Interpretable Prompting For Interrelated Interpersonal Risk Factors In Reddit Posts Sathvik Msvpj, Sarkar Surjodeep, Saxena Chandni, Sohn Sunghwan, Garg Muskan
- GIFT: Generative Interpretable Fine-tuning Savadikar Chinmay, Song Xi, Wu Tianfu
- Diagnostic Reasoning Prompts Reveal The Potential For Large Language Model Interpretability In Medicine Savage Thomas, Nayak Ashwin, Gallo Robert, Rangan Ekanath, Chen Jonathan H
- Unlocking Practical Applications In Legal Domain: Evaluation Of GPT For Zero-shot Semantic Annotation Of Legal Texts Savelka Jaromir
- Large Language Models (GPT) Struggle To Answer Multiple-choice Questions About Code Savelka Jaromir, Agarwal Arav, Bogart Christopher, Sakr Majd
- Can GPT-4 Support Analysis Of Textual Data In Tasks Requiring Highly Specialized Domain Expertise? Savelka Jaromir, Ashley Kevin D., Gray Morgan A, Westermann Hannes, Xu Huihui
- ARB: Advanced Reasoning Benchmark For Large Language Models Sawada Tomohiro, Paleka Daniel, Havrilla Alexander, Tadepalli Pranav, Vidas Paula, Kranias Alexander, Nay John J., Gupta Kshitij, Komatsuzaki Aran
- Large Language Models Can Strategically Deceive Their Users When Put Under Pressure Scheurer Jérémy, Balesni Mikita, Hobbhahn Marius
- Towards Llm-based Autograding For Short Textual Answers Schneider Johannes, Schenk Bernd, Niklaus Christina
- Chatbot-supported Thesis Writing: An Autoethnographic Report Schwenke Nicolas, Söbke Heinrich, Kraft Eckhard
- An Empirical Evaluation Of Using Large Language Models For Automated Unit Test Generation Schäfer Max, Nadi Sarah, Eghbali Aryaz, Tip Frank
- Surgicalgpt: End-to-end Language-vision GPT For Visual Question Answering In Surgery Seenivasan Lalithkumar, Islam Mobarakol, Kannan Gokul, Ren Hongliang
- Wikichat: Stopping The Hallucination Of Large Language Model Chatbots By Few-shot Grounding On Wikipedia Semnani Sina J., Yao Violet Z., Zhang Heidi C., Lam Monica S.
- Jais And Jais-chat: Arabic-centric Foundation And Instruction-tuned Open Generative Large Language Models Sengupta Neha, Sahu Sunil Kumar, Jia Bokang, Katipomu Satheesh, Li Haonan, Koto Fajri, Marshall William, Gosal Gurpreet, Liu Cynthia, Chen Zhiming, Afzal Osama Mohammed, Kamboj Samta, Pandit Onkar, Pal Rahul, Pradhan Lalit, Mujahid Zain Muhammad, Baali Massa, Han Xudong, Bsharat Sondos Mahmoud, Aji Alham Fikri, Shen Zhiqiang, Liu Zhengzhong, Vassilieva Natalia, Hestness Joel, Hock Andy, Feldman Andrew, Lee Jonathan, Jackson Andrew, Ren Hector Xuguang, Nakov Preslav, Baldwin Timothy, Xing Eric
- Language Models: A Guide For The Perplexed Serrano Sofia, Brumbaugh Zander, Smith Noah A.
- The Cot Collection: Improving Zero-shot And Few-shot Learning Of Language Models Via Chain-of-thought Fine-tuning Seungone Kim, Se June Joo, Doyoung Kim, Joel Jang, Seonghyeon Ye, Jamin Shin, Minjoon Seo
- Zero Is Not Hero Yet: Benchmarking Zero-shot Performance Of Llms For Financial Tasks Shah Agam, Chava Sudheer
- Scalable And Transferable Black-box Jailbreaks For Language Models Via Persona Modulation Shah Rusheb, Feuillade--montixi Quentin, Pour Soroush, Tagade Arush, Casper Stephen, Rando Javier
- Let's Have A Chat! A Conversation With Chatgpt: Technology, Applications, And Limitations Shahriar Sakib, Hayawi Kadhim
- Cognitive Effects In Large Language Models Shaki Jonathan, Kraus Sarit, Wooldridge Michael
- PB-LLM: Partially Binarized Large Language Models Shang Yuzhang, Yuan Zhihang, Wu Qiang, Dong Zhen
- One-shot Sensitivity-aware Mixed Sparsity Pruning For Large Language Models Shao Hang, Liu Bei, Xiao Bo, Zeng Ke, Wan Guanglu, Qian Yanmin
- Tinylvlm-ehub: Towards Comprehensive And Efficient Evaluation For Large Vision-language Models Shao Wenqi, Lei Meng, Hu Yutao, Gao Peng, Zhang Kaipeng, Meng Fanqing, Xu Peng, Huang Siyuan, Li Hongsheng, Qiao Yu, Luo Ping
- Character-llm: A Trainable Agent For Role-playing Shao Yunfan, Li Linyang, Dai Junqi, Qiu Xipeng
- Performance Of Chatgpt On USMLE: Unlocking The Potential Of Large Language Models For Ai-assisted Medical Education Sharma Prabin, Thapa Kisan, Thapa Dikshya, Dhakal Prastab, Upadhaya Mala Deep, Adhikari Santosh, Khanal Salik Ram
- Democratizing Llms: An Exploration Of Cost-performance Trade-offs In Self-refined Open-source Models Shashidhar Sumuk, Chinta Abhinav, Sahai Vaibhav, Wang Zhenhailong, Ji Heng
- Exploring The Factual Consistency In Dialogue Comprehension Of Large Language Models She Shuaijie, Huang Shujian, Wang Xingyun, Zhou Yanke, Chen Jiajun
- Efficient LLM Inference On Cpus Shen Haihao, Chang Hanwen, Dong Bo, Luo Yu, Meng Hengyu
- Improving The Robustness Of Transformer-based Large Language Models With Dynamic Attention Shen Lujia, Pu Yuwen, Ji Shouling, Li Changjiang, Zhang Xuhong, Ge Chunpeng, Wang Ting
- Retrieval-generation Alignment For End-to-end Task-oriented Dialogue System Shen Weizhou, Gao Yingqi, Huang Canbin, Wan Fanqi, Quan Xiaojun, Bi Wei
- Storygpt-v: Large Language Models As Consistent Story Visualizers Shen Xiaoqian, Elhoseiny Mohamed
- "do Anything Now": Characterizing And Evaluating In-the-wild Jailbreak Prompts On Large Language Models Shen Xinyue, Chen Zeyuan, Backes Michael, Shen Yun, Zhang Yang
- In Chatgpt We Trust? Measuring And Characterizing The Reliability Of Chatgpt Shen Xinyue, Chen Zeyuan, Backes Michael, Zhang Yang
- Why Does Chatgpt Fall Short In Providing Truthful Answers? Shen Zheng, Jie Huang, Kevin Chen-chuan Chang
- Sotana: The Open-source Software Development Assistant Shi Ensheng, Zhang Fengji, Wang Yanlin, Chen Bei, Du Lun, Zhang Hongyu, Han Shi, Zhang Dongmei, Sun Hongbin
- QADYNAMICS: Training Dynamics-driven Synthetic QA Diagnostic For Zero-shot Commonsense Question Answering Shi Haochen, Wang Weiqi, Fang Tianqing, Xu Baixuan, Ding Wenxuan, Liu Xin, Song Yangqiu
- Badgpt: Exploring Security Vulnerabilities Of Chatgpt Via Backdoor Attacks To Instructgpt Shi Jiawen, Liu Yixin, Zhou Pan, Sun Lichao
- Trusting Your Evidence: Hallucinate Less With Context-aware Decoding Shi Weijia, Han Xiaochuang, Lewis Mike, Tsvetkov Yulia, Zettlemoyer Luke, Yih Scott Wen-tau
- Reasoning With Language Model Is Planning With World Model Shibo Hao, Yi Gu, Haodi Ma, Joshua Jiahua Hong, Zhen Wang, Daisy Zhe Wang, Zhiting Hu
- Large Language Models Can Enhance Persuasion Through Linguistic Feature Alignment Shin Minkyu, Kim Jin
- Exploring The Robustness Of Large Language Models For Solving Programming Problems Shirafuji Atsushi, Watanobe Yutaka, Ito Takumi, Morishita Makoto, Nakamura Yuki, Oda Yusuke, Suzuki Jun
- SCREWS: A Modular Framework For Reasoning With Revisions Shridhar Kumar, Jhamtani Harsh, Fang Hao, Van Durme Benjamin, Eisner Jason, Xia Patrick
- Llamas Know What Gpts Don't Show: Surrogate Models For Confidence Estimation Shrivastava Vaishnavi, Liang Percy, Kumar Ananya
- What Does CLIP Know About A Red Circle? Visual Prompt Engineering For Vlms Shtedritski Aleksandar, Rupprecht Christian, Vedaldi Andrea
- Audio-visual LLM For Video Understanding Shu Fangxun, Zhang Lei, Jiang Hao, Xie Cihang
- A Comprehensive Evaluation Of Large Language Models On Legal Judgment Prediction Shui Ruihao, Cao Yixin, Wang Xiang, Chua Tat-seng
- A Survey On Multimodal Large Language Models Shukang Yin, Chaoyou Fu, Sirui Zhao, Ke Li, Xing Sun, Tong Xu, Enhong Chen
- Woodpecker: Hallucination Correction For Multimodal Large Language Models Shukang Yin, Chaoyou Fu, Sirui Zhao, Tong Xu, Hao Wang, Dianbo Sui, Yunhang Shen, Ke Li, Xing Sun, Enhong Chen
- The Curse Of Recursion: Training On Generated Data Makes Models Forget Shumailov Ilia, Shumaylov Zakhar, Zhao Yiren, Gal Yarin, Papernot Nicolas, Anderson Ross
- Tree Of Thoughts: Deliberate Problem Solving With Large Language Models Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Thomas L. Griffiths, Yuan Cao, Karthik Narasimhan
- Measuring Inductive Biases Of In-context Learning With Underspecified Demonstrations Si Chenglei, Friedman Dan, Joshi Nitish, Feng Shi, Chen Danqi, He He
- An Empirical Study Of Instruction-tuning Large Language Models In Chinese Si Qingyi, Wang Tong, Lin Zheng, Zhang Xu, Cao Yanan, Wang Weiping
- Exploring The Capabilities Of Chatgpt In Ancient Chinese Translation And Person Name Recognition Si Shijing, Zhou Siqing, Tang Le, Cheng Xiaoqing, Zhang Yugui
- Mondrian: Prompt Abstraction Attack Against Large Language Models For Cheaper API Pricing Si Wai Man, Backes Michael, Zhang Yang
- Mind Meets Machine: Unravelling Gpt-4's Cognitive Psychology Sifatkaur Dhingra, Manmeet Singh, Vaisakh Sb, Neetiraj Malviya, Sukhpal Singh Gill
- GPT-4 As An Agronomist Assistant? Answering Agriculture Exams Using Large Language Models Silva Bruno, Nunes Leonardo, Estevão Roberto, Aski Vijay, Chandra Ranveer
- Generalized Planning In PDDL Domains With Pretrained Large Language Models Silver Tom, Dan Soham, Srinivas Kavitha, Tenenbaum Joshua B., Kaelbling Leslie Pack, Katz Michael
- Large Language Models For Aspect-based Sentiment Analysis Simmering Paul F., Huoviala Paavo
- Assessing GPT4-V On Structured Reasoning Tasks Singh Mukul, Cambronero José, Gulwani Sumit, Le Vu, Verbruggen Gust
- Unlocking Model Insights: A Dataset For Automated Model Card Generation Singh Shruti, Lodwal Hitesh, Malwat Husain, Thakur Rakesh, Singh Mayank
- Exploiting Large Language Models (llms) Through Deception Techniques And Persuasion Principles Singh Sonali, Abri Faranak, Namin Akbar Siami
- Evaluating Chatgpt And GPT-4 For Visual Programming Singla Adish
- TABLET: Learning From Instructions For Tabular Data Slack Dylan, Singh Sameer
- Dont Add, Dont Miss: Effective Content Preserving Generation From Pre-selected Text Spans Slobodkin Aviv, Caciularu Avi, Hirsch Eran, Dagan Ido
- Step: Stacked LLM Policies For Web Actions Sodhi Paloma, Branavan S. R. K., Artzi Yoav, Mcdonald Ryan
- Decoding Chatgpt: A Taxonomy Of Existing Research, Current Challenges, And Possible Future Directions Sohail Shahab Saquib, Farhat Faiza, Himeur Yassine, Nadeem Mohammad, Madsen Dag Øivind, Singh Yashbir, Atalla Shadi, Mansoor Wathiq
- Biomedical Knowledge Graph-optimized Prompt Generation For Large Language Models Soman Karthik, Rose Peter W, Morris John H, Akbas Rabia E, Smith Brett, Peetoom Braian, Villouta-reyes Catalina, Cerono Gabriel, Shi Yongmei, Rizk-jackson Angela, Israni Sharat, Nelson Charlotte A, Huang Sui, Baranzini Sergio E
- Dynamics Of Instruction Tuning: Each Ability Of Large Language Models Has Its Own Growth Pace Song Chiyu, Zhou Zhanchao, Yan Jianhao, Fei Yuejiao, Lan Zhenzhong, Zhang Yue
- Preference Ranking Optimization For Human Alignment Song Feifan, Yu Bowen, Li Minghao, Yu Haiyang, Huang Fei, Li Yongbin, Wang Houfeng
- Nlpbench: Evaluating Large Language Models On Solving NLP Problems Song Linxin, Zhang Jieyu, Cheng Lechao, Zhou Pengyuan, Zhou Tianyi, Li Irene
- How To Bridge The Gap Between Modalities: A Comprehensive Survey On Multimodal Large Language Model Song Shezheng, Li Xiaopeng, Li Shasha, Zhao Shan, Yu Jie, Ma Jun, Mao Xiaoguang, Zhang Weimin
- Large Language Models Meet Open-world Intent Discovery And Recognition: An Evaluation Of Chatgpt Song Xiaoshuai, He Keqing, Wang Pei, Dong Guanting, Mou Yutao, Wang Jingang, Xian Yunsen, Cai Xunliang, Xu Weiran
- Honeybee: Progressive Instruction Finetuning Of Large Language Models For Materials Science Song Yu, Miret Santiago, Zhang Huan, Liu Bang
- Reward Collapse In Aligning Large Language Models Song Ziang, Cai Tianle, Lee Jason D., Su Weijie J.
- Deduction Under Perturbed Evidence: Probing Student Simulation Capabilities Of Large Language Models Sonkar Shashank, Baraniuk Richard G.
- Code Soliloquies For Accurate Calculations In Large Language Models Sonkar Shashank, Le Myco, Chen Xinghe, Liu Naiming, Mallick Debshila Basu, Baraniuk Richard G.
- Recursion Of Thought: A Divide-and-conquer Approach To Multi-context Reasoning With Language Models Soochan Lee, Gunhee Kim
- Joint Prompt Optimization Of Stacked Llms Using Variational Inference Sordoni Alessandro, Yuan Xingdi, Côté Marc-alexandre, Pereira Matheus, Trischler Adam, Xiao Ziang, Hosseini Arian, Niedtner Friederike, Roux Nicolas Le
- Learning Ui-to-code Reverse Generator Using Visual Critic Without Rendering Soselia Davit, Saifullah Khalid, Zhou Tianyi
- Dynamic Context Pruning For Efficient And Interpretable Autoregressive Transformers Sotiris Anagnostidis, Dario Pavllo, Luca Biggio, Lorenzo Noci, Aurelien Lucchi, Thomas Hofmann
- Evaluation Metrics In The Era Of GPT-4: Reliably Evaluating Large Language Models On Sequence To Sequence Tasks Sottana Andrea, Liang Bin, Zou Kai, Yuan Zheng
- Accelerating LLM Inference With Staged Speculative Decoding Spector Benjamin, Re Chris
- Assessing The Nature Of Large Language Models: A Caution Against Anthropocentrism Speed Ann
- Commands As AI Conversations Spinellis Diomidis
- Musr: Testing The Limits Of Chain-of-thought With Multistep Soft Reasoning Sprague Zayne, Ye Xi, Bostrom Kaj, Chaudhuri Swarat, Durrett Greg
- Harnessing Llms In Curricular Design: Using GPT-4 To Support Authoring Of Learning Objectives Sridhar Pragnya, Doyle Aidan, Agarwal Arav, Bogart Christopher, Savelka Jaromir, Sakr Majd
- Chatgpt: A Study On Its Utility For Ubiquitous Software Engineering Tasks Sridhara Giriprasad, G. Ranjani H., Mazumdar Sourav
- Training Large Language Models Efficiently With Sparsity And Dataflow Srinivasan Venkat, Gandhi Darshan, Thakker Urmish, Prabhakar Raghu
- Evaluating Chatbots To Promote Users' Trust -- Practices And Open Problems Srivastava Biplav, Lakkaraju Kausik, Koppel Tarmo, Narayanan Vignesh, Kundu Ashish, Joshi Sachindra
- Instances Need More Care: Rewriting Prompts For Instances With Llms In The Loop Yields Better Zero-shot Performance Srivastava Saurabh, Huang Chengyue, Fan Weiguo, Yao Ziyu
- The Languini Kitchen: Enabling Language Modelling Research At Different Scales Of Compute Stanić Aleksandar, Ashley Dylan, Serikov Oleg, Kirsch Louis, Faccio Francesco, Schmidhuber Jürgen, Hofmann Thomas, Schlag Imanol
- Seeing Seeds Beyond Weeds: Green Teaming Generative AI For Beneficial Uses Stapleton Logan, Taylor Jordan, Fox Sarah, Wu Tongshuang, Zhu Haiyi
- GPT-4 Doesn't Know It's Wrong: An Analysis Of Iterative Prompting For Reasoning Problems Stechly Kaya, Marquez Matthew, Kambhampati Subbarao
- Distilled GPT For Source Code Summarization Su Chia-yi, Mcmillan Collin
- Learning From Red Teaming: Gender Bias Provocation And Mitigation In Large Language Models Su Hsuan, Cheng Cheng-chu, Farn Hua, Kumar Shachi H, Sahay Saurav, Chen Shang-tse, Lee Hung-yi
- The Synergy Of Speculative Decoding And Batching In Serving Large Language Models Su Qidong, Giannoula Christina, Pekhimenko Gennady
- HC3 Plus: A Semantic-invariant Human Chatgpt Comparison Corpus Su Zhenpeng, Wu Xing, Zhou Wei, Ma Guangyuan, Hu Songlin
- Can Large Language Models Change User Preference Adversarially? Subhash Varshini
- Language Model-in-the-loop: Data Optimal Approach To Learn-to-recommend Actions In Text Games Sudhakar Arjun Vaithilingam, Parthasarathi Prasanna, Rajendran Janarthanan, Chandar Sarath
- Jmedlora:medical Domain Adaptation On Japanese Large Language Models Using Instruction-tuning Sukeda Issey, Suzuki Masahiro, Sakaji Hiroki, Kodera Satoshi
- CONSCENDI: A Contrastive And Scenario-guided Distillation Approach To Guardrail Models For Virtual Assistants Sun Albert Yu, Nair Varun, Schumacher Elliot, Kannan Anitha
- Does Fine-tuning GPT-3 With The Openai API Leak Personally-identifiable Information? Sun Albert Yu, Zemour Eliott, Saxena Arushi, Vaidyanathan Udith, Lin Eric, Lau Christian, Mugunthan Vaikkunth
- Speech-based Slot Filling Using Large Language Models Sun Guangzhi, Feng Shutong, Jiang Dongcheng, Zhang Chao, Gašić Milica, Woodland Philip C.
- Reinforcement Learning In The Era Of Llms: What Is Essential? What Is Needed? An RL Perspective On RLHF, Prompting, And Beyond Sun Hao
- Think-on-graph: Deep And Responsible Reasoning Of Large Language Model On Knowledge Graph Sun Jiashuo, Xu Chengjin, Tang Lumingyuan, Wang Saizhuo, Lin Chen, Gong Yeyun, Ni Lionel M., Shum Heung-yeung, Guo Jian
- Scieval: A Multi-level Large Language Model Evaluation Benchmark For Scientific Research Sun Liangtai, Han Yang, Zhao Zihan, Ma Da, Shen Zhennan, Chen Baocai, Chen Lu, Yu Kai
- Generative Multimodal Models Are In-context Learners Sun Quan, Cui Yufeng, Zhang Xiaosong, Zhang Fan, Yu Qiying, Luo Zhengxiong, Wang Yueze, Rao Yongming, Liu Jingjing, Huang Tiejun, Wang Xinlong
- Battle Of The Large Language Models: Dolly Vs Llama Vs Vicuna Vs Guanaco Vs Bard Vs Chatgpt -- A Text-to-sql Parsing Comparison Sun Shuo, Zhang Yuchen, Yan Jiahuan, Gao Yuze, Ong Donovan, Chen Bin, Su Jian
- PEARL: Prompting Large Language Models To Plan And Execute Actions Over Long Documents Sun Simeng, Liu Yang, Wang Shuohang, Zhu Chenguang, Iyyer Mohit
- Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers Sun Weiwei, Chen Zheng, Ma Xinyu, Yan Lingyong, Wang Shuaiqiang, Ren Pengjie, Chen Zhumin, Yin Dawei, Ren Zhaochun
- Multimodal Question Answering For Unified Information Extraction Sun Yuxuan, Zhang Kai, Su Yu
- Ctbls: Augmenting Large Language Models With Conversational Tables Sundar Anirudh S, Heck Larry
- Uncovering Chatgpt's Capabilities In Recommender Systems Sunhao Dai, Ninglu Shao, Haiyuan Zhao, Weijie Yu, Zihua Si, Chen Xu, Zhongxiang Sun, Xiao Zhang, Jun Xu
- Do Large Language Models Show Decision Heuristics Similar To Humans? A Case Study Using GPT-3.5 Suri Gaurav, Slater Lily R., Ziaee Ali, Nguyen Morgan
- Location-aware Visual Question Generation With Lightweight Models Suwono Nicholas Collin, Chen Justin Chih-yao, Hung Tun Min, Huang Ting-hao Kenneth, Liao I-bin, Li Yung-hui, Ku Lun-wei, Sun Shao-hua
- Approximating Online Human Evaluation Of Social Chatbots With Prompting Svikhnushina Ekaterina, Pu Pearl
- Assessing The Ability Of Chatgpt To Screen Articles For Systematic Reviews Syriani Eugene, David Istvan, Kumar Gauransh
- Improving Non-autoregressive Translation Quality With Pretrained Language Model, Embedding Distillation And Upsampling Strategy For CTC Syu Shen-sian, Xie Juncheng, Lee Hung-yi
- Sparks Of Artificial General Intelligence: Early Experiments With GPT-4 Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang
- The BEA 2023 Shared Task On Generating AI Teacher Responses In Educational Dialogues Tack Anaïs, Kochmar Ekaterina, Yuan Zheng, Bibauw Serge, Piech Chris
- Training Generative Question-answering On Synthetic Data Obtained From An Instruct-tuned Model Takahashi Kosuke, Omi Takahiro, Arima Kosuke, Ishigaki Tatsuya
- Redcoast: A Lightweight Tool To Automate Distributed Training Of Llms On Any Gpu/tpus Tan Bowen, Zhu Yun, Liu Lijuan, Wang Hongyi, Zhuang Yonghao, Chen Jindong, Xing Eric, Hu Zhiting
- Is Chatgpt A Good Multi-party Conversation Solver? Tan Chao-hong, Gu Jia-chen, Ling Zhen-hua
- Large Language Model-driven Classroom Flipping: Empowering Student-centric Peer Questioning With Flipped Interaction Tan Chee Wei
- Copilot For Xcode: Exploring Ai-assisted Programming By Prompting Cloud-based Large Language Models Tan Chee Wei, Guo Shangxin, Wong Man Fai, Hang Ching Nam
- Massive Editing For Large Language Models Via Meta Learning Tan Chenmien, Zhang Ge, Fu Jie
- Can Chatgpt Replace Traditional KBQA Models? An In-depth Analysis Of The Question Answering Performance Of The GPT LLM Family Tan Yiming, Min Dehai, Li Yu, Li Wenbo, Hu Nan, Chen Yongrui, Qi Guilin
- Toolalpaca: Generalized Tool Learning For Language Models With 3000 Simulated Cases Tang Qiaoyu, Deng Ziliang, Lin Hongyu, Han Xianpei, Liang Qiao, Cao Boxi, Sun Le
- Ml-bench: Evaluating Large Language Models And Agents For Machine Learning Tasks On Repository-level Code Tang Xiangru, Liu Yuliang, Cai Zefan, Shao Yanjun, Lu Junjie, Zhang Yichi, Deng Zexuan, Hu Helan, An Kaikai, Huang Ruijun, Si Shuzheng, Chen Sheng, Zhao Haozhe, Chen Liang, Wang Yan, Liu Tianyu, Jiang Zhiwei, Chang Baobao, Fang Yin, Qin Yujia, Zhou Wangchunshu, Zhao Yilun, Cohan Arman, Gerstein Mark
- Struc-bench: Are Large Language Models Really Good At Generating Complex Structured Data? Tang Xiangru, Zong Yiming, Phang Jason, Zhao Yilun, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- Codi-2: In-context, Interleaved, And Interactive Any-to-any Generation Tang Zineng, Yang Ziyi, Khademi Mahmoud, Liu Yang, Zhu Chenguang, Bansal Mohit
- Using Large Language Models For Cybersecurity Capture-the-flag Challenges And Certification Questions Tann Wesley, Liu Yuancheng, Sim Jun Heng, Seah Choon Meng, Chang Ee-chien
- Is Chatgpt A Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation Tao Fang, Shu Yang, Kaixin Lan, Derek F. Wong, Jinpeng Hu, Lidia S. Chao, Yue Zhang
- Caption Anything: Interactive Image Description With Diverse Multimodal Controls Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao
- Nomiracl: Knowing When You Don't Know For Robust Multilingual Retrieval-augmented Generation Thakur Nandan, Bonifacio Luiz, Zhang Xinyu, Ogundepo Odunayo, Kamalloo Ehsan, Alfonso-hermelo David, Li Xiaoguang, Liu Qun, Chen Boxing, Rezagholizadeh Mehdi, Lin Jimmy
- Verigen: A Large Language Model For Verilog Code Generation Thakur Shailja, Ahmad Baleegh, Pearce Hammond, Tan Benjamin, Dolan-gavitt Brendan, Karri Ramesh, Garg Siddharth
- SPDF: Sparse Pre-training And Dense Fine-tuning For Large Language Models Thangarasa Vithursan, Gupta Abhay, Marshall William, Li Tianda, Leong Kevin, Decoste Dennis, Lie Sean, Saxena Shreyas
- Learning To Prompt In The Classroom To Understand AI Limits: A Pilot Study Theophilou Emily, Koyuturk Cansu, Yavari Mona, Bursic Sathya, Donabauer Gregor, Telari Alessia, Testa Alessia, Boiano Raffaele, Hernandez-leo Davinia, Ruskov Martin, Taibi Davide, Gabbiadini Alessandro, Ognibene Dimitri
- Is Chatgpt The Ultimate Programming Assistant -- How Far Is It? Tian Haoye, Lu Weiqi, Li Tsz On, Tang Xunzhu, Cheung Shing-chi, Klein Jacques, Bissyandé Tegawendé F.
- Just Ask For Calibration: Strategies For Eliciting Calibrated Confidence Scores From Language Models Fine-tuned With Human Feedback Tian Katherine, Mitchell Eric, Zhou Allan, Sharma Archit, Rafailov Rafael, Yao Huaxiu, Finn Chelsea, Manning Christopher D.
- R\(^3\) Prompting: Review, Rephrase And Resolve For Chain-of-thought Reasoning In Large Language Models Under Noisy Context Tian Qingyuan, Zhu Hanlun, Wang Lei, Li Yang, Lan Yunshi
- Evil Geniuses: Delving Into The Safety Of Llm-based Agents Tian Yu, Yang Xiao, Zhang Jingyuan, Dong Yinpeng, Su Hang
- Chimed-gpt: A Chinese Medical Large Language Model With Full Training Regime And Better Alignment To Human Preferences Tian Yuanhe, Gan Ruyi, Song Yan, Zhang Jiaxing, Zhang Yongdong
- BOOST: Harnessing Black-box Control To Boost Commonsense In Lms' Generation Tian Yufei, Zhang Felix, Peng Nanyun
- RLHF-V: Towards Trustworthy Mllms Via Behavior Alignment From Fine-grained Correctional Human Feedback Tianyu Yu, Yuan Yao, Haoye Zhang, Taiwen He, Yifeng Han, Ganqu Cui, Jinyi Hu, Zhiyuan Liu, Hai-tao Zheng, Maosong Sun, Tat-seng Chua
- Qlora: Efficient Finetuning Of Quantized Llms Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, Luke Zettlemoyer
- Baby Llama: Knowledge Distillation From An Ensemble Of Teachers Trained On A Small Dataset With No Performance Penalty Timiryasov Inar, Tastet Jean-loup
- Function Vectors In Large Language Models Todd Eric, Li Millicent L., Sharma Arnab Sen, Mueller Aaron, Wallace Byron C., Bau David
- Clinical Camel: An Open Expert-level Medical Language Model With Dialogue-based Knowledge Encoding Toma Augustin, Lawler Patrick R., Ba Jimmy, Krishnan Rahul G., Rubin Barry B., Wang Bo
- Bioinstruct: Instruction Tuning Of Large Language Models For Biomedical Natural Language Processing Tran Hieu, Yang Zhichao, Yao Zonghai, Yu Hong
- Large Language Model Prompt Chaining For Long Legal Document Classification Trautmann Dietrich
- Can Large Language Models Play Text Games Well? Current State-of-the-art And Open Questions Tsai Chen Feng, Zhou Xiaochen, Liu Sierra S., Li Jing, Yu Mo, Mei Hongyuan
- Automl-gpt: Large Language Model For Automl Tsai Yun-da, Tsai Yu-che, Huang Bo-wei, Yang Chun-pai, Lin Shou-de
- Chatgpt As A Commenter To The News: Can Llms Generate Human-like Opinions? Tseng Rayden, Verberne Suzan, Van Der Putten Peter
- Overinformative Question Answering By Humans And Machines Tsvilodub Polina, Franke Michael, Hawkins Robert D., Goodman Noah D.
- How Many Unicorns Are In This Image? A Safety Evaluation Benchmark For Vision Llms Tu Haoqin, Cui Chenhang, Wang Zijun, Zhou Yiyang, Zhao Bingchen, Han Junlin, Zhou Wangchunshu, Yao Huaxiu, Xie Cihang
- Efficiently Aligned Cross-lingual Transfer Learning For Conversational Tasks Using Prompt-tuning Tu Lifu, Qu Jin, Yavuz Semih, Joty Shafiq, Liu Wenhao, Xiong Caiming, Zhou Yingbo
- Unlocking Anticipatory Text Generation: A Constrained Approach For Large Language Models Decoding Tu Lifu, Yavuz Semih, Qu Jin, Xu Jiacheng, Meng Rui, Xiong Caiming, Zhou Yingbo
- Causal-discovery Performance Of Chatgpt In The Context Of Neuropathic Pain Diagnosis Tu Ruibo, Ma Chao, Zhang Cheng
- Activation Addition: Steering Language Models Without Optimization Turner Alexander Matt, Thiergart Lisa, Leech Gavin, Udell David, Vazquez Juan J., Mini Ulisse, Macdiarmid Monte
- Language Models Don't Always Say What They Think: Unfaithful Explanations In Chain-of-thought Prompting Turpin Miles, Michael Julian, Perez Ethan, Bowman Samuel R.
- Zeroshotdataaug: Generating And Augmenting Training Data With Chatgpt Ubani Solomon, Polat Suleyman Olcay, Nielsen Rodney
- Taco: Enhancing Cross-lingual Transfer For Low-resource Languages In Llms Through Translation-assisted Chain-of-thought Processes Upadhayay Bibek, Behzadan Vahid
- How Prevalent Is Gender Bias In Chatgpt? -- Exploring German And English Chatgpt Responses Urchs Stefanie, Thurner Veronika, Aßenmacher Matthias, Heumann Christian, Thiemichen Stephanie
- Chatclimate: Grounding Conversational AI In Climate Science Vaghefi Saeid Ashraf, Wang Qian, Muccione Veruska, Ni Jingwei, Kraus Mathias, Bingler Julia, Schimanski Tobias, Colesanti-senni Chiara, Webersinke Nicolas, Huggel Christrian, Leippold Markus
- Humans And Language Models Diverge When Predicting Repeating Text Vaidya Aditya R., Turek Javier, Huth Alexander G.
- Let's Think Frame By Frame With VIP: A Video Infilling And Prediction Dataset For Evaluating Video Chain-of-thought Vaishnavi Himakunthala, Andy Ouyang, Daniel Rose, Ryan He, Alex Mei, Yujie Lu, Chinmay Sonar, Michael Saxon, William Yang Wang
- Mini-gpts: Efficient Large Language Models Through Contextual Pruning Valicenti Tim, Vidal Justice, Patnaik Ritik
- Can Large Language Models Really Improve By Self-critiquing Their Own Plans? Valmeekam Karthik, Marquez Matthew, Kambhampati Subbarao
- On The Planning Abilities Of Large Language Models : A Critical Investigation Valmeekam Karthik, Marquez Matthew, Sreedharan Sarath, Kambhampati Subbarao
- Evaluating Shutdown Avoidance Of Language Models In Textual Scenarios Van Der Weij Teun, Lermen Simon, Lang Leon
- Theory Of Mind In Large Language Models: Examining Performance Of 11 State-of-the-art Models Vs. Children Aged 7-10 On Advanced Tests Van Duijn Max J., Van Dijk Bram M. A., Kouwenhoven Tom, De Valk Werner, Spruit Marco R., Van Der Putten Peter
- Detecting And Correcting Hate Speech In Multimodal Memes With Large Visual Language Model Van Minh-hao, Wu Xintao
- Can NLP Models Correctly Reason Over Contexts That Break The Common Assumptions? Varshney Neeraj, Parmar Mihir, Patel Nisarg, Handa Divij, Sarkar Sayantan, Luo Man, Baral Chitta
- A Stitch In Time Saves Nine: Detecting And Mitigating Hallucinations Of Llms By Validating Low-confidence Generation Varshney Neeraj, Yao Wenlin, Zhang Hongming, Chen Jianshu, Yu Dong
- Gpt-who: An Information Density-based Machine-generated Text Detector Venkatraman Saranya, Uchendu Adaku, Lee Dongwon
- Emulating Human Cognitive Processes For Expert-level Medical Question-answering With Large Language Models Verma Khushboo, Moore Marina, Wottrich Stephanie, López Karla Robles, Aggarwal Nishant, Bhatt Zeel, Singh Aagamjit, Unroe Bradford, Basheer Salah, Sachdeva Nitish, Arora Prinka, Kaur Harmanjeet, Kaur Tanupreet, Hood Tevon, Marquez Anahi, Varshney Tushar, Deng Nanfu, Ramani Azaan, Ishwara Pawanraj, Saeed Maimoona, Peña Tatiana López Velarde, Barksdale Bryan, Guha Sushovan, Kumar Satwant
- Simplesafetytests: A Test Suite For Identifying Critical Safety Risks In Large Language Models Vidgen Bertie, Scherrer Nino, Kirk Hannah Rose, Qian Rebecca, Kannappan Anand, Hale Scott A., Röttger Paul
- Fairpy: A Toolkit For Evaluation Of Social Biases And Their Mitigation In Large Language Models Viswanath Hrishikesh, Zhang Tianyi
- Prompt2model: Generating Deployable Models From Natural Language Instructions Viswanathan Vijay, Zhao Chenyang, Bertsch Amanda, Wu Tongshuang, Neubig Graham
- Generative Pre-trained Transformer For Vietnamese Community-based COVID-19 Question Answering Vo Tam Minh, Tran Khiem Vinh
- Unraveling Downstream Gender Bias From Large Language Models: A Study On AI Educational Writing Assistance Wambsganss Thiemo, Su Xiaotian, Swamy Vinitra, Neshaei Seyed Parsa, Rietsche Roman, Käser Tanja
- Poisoning Language Models During Instruction Tuning Wan Alexander, Wallace Eric, Shen Sheng, Klein Dan
- Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases In Dialogue Systems Wan Yixin, Zhao Jieyu, Chadha Aman, Peng Nanyun, Chang Kai-wei
- GPT-RE: In-context Learning For Relation Extraction Using Large Language Models Wan Zhen, Cheng Fei, Mao Zhuoyuan, Liu Qianying, Song Haiyue, Li Jiwei, Kurohashi Sadao
- Reformulating Domain Adaptation Of Large Language Models As Adapt-retrieve-revise: A Case Study On Chinese Legal Domain Wan Zhen, Zhang Yating, Wang Yexiang, Cheng Fei, Kurohashi Sadao
- Shall We Pretrain Autoregressive Language Models With Retrieval? A Comprehensive Study Wang Boxin, Ping Wei, Xu Peng, Mcafee Lawrence, Liu Zihan, Shoeybi Mohammad, Dong Yi, Kuchaiev Oleksii, Li Bo, Xiao Chaowei, Anandkumar Anima, Catanzaro Bryan
- Keqing: Knowledge-based Question Answering Is A Nature Chain-of-thought Mentor Of LLM Wang Chaojie, Xu Yishi, Peng Zhong, Zhang Chenxi, Chen Bo, Wang Xinrun, Feng Lei, An Bo
- Cost-effective Hyperparameter Optimization For Large Language Model Generation Inference Wang Chi, Liu Susan Xueqing, Awadallah Ahmed H.
- Cue-cot: Chain-of-thought Prompting For Responding To In-depth Dialogue Questions With Llms Wang Hongru, Wang Rui, Mi Fei, Deng Yang, Wang Zezhong, Liang Bin, Xu Ruifeng, Wong Kam-fai
- Cross-lingual Knowledge Editing In Large Language Models Wang Jiaan, Liang Yunlong, Sun Zengkui, Cao Yuxuan, Xu Jiarong, Meng Fandong
- Knowledgeable In-context Tuning: Exploring And Exploiting Factual Knowledge For In-context Learning Wang Jianing, Wang Chengyu, Tan Chuanqi, Huang Jun, Gao Ming
- On The Robustness Of Chatgpt: An Adversarial And Out-of-distribution Perspective Wang Jindong, Hu Xixu, Hou Wenxin, Chen Hao, Zheng Runkai, Wang Yidong, Yang Linyi, Huang Haojun, Ye Wei, Geng Xiubo, Jiao Binxin, Zhang Yue, Xie Xing
- Notechat: A Dataset Of Synthetic Doctor-patient Conversations Conditioned On Clinical Notes Wang Junda, Yao Zonghai, Yang Zhichao, Zhou Huixue, Li Rumeng, Wang Xun, Xu Yucheng, Yu Hong
- Mathcoder: Seamless Code Integration In Llms For Enhanced Mathematical Reasoning Wang Ke, Ren Houxing, Zhou Aojun, Lu Zimu, Luo Sichun, Shi Weikang, Zhang Renrui, Song Linqi, Zhan Mingjie, Li Hongsheng
- Chated: A Chatbot Leveraging Chatgpt For An Enhanced Learning Experience In Higher Education Wang Kevin, Ramos Jason, Lawrence Ramon
- Document-level Machine Translation With Large Language Models Wang Longyue, Lyu Chenyang, Ji Tianbo, Zhang Zhirui, Yu Dian, Shi Shuming, Tu Zhaopeng
- Recursively Summarizing Enables Long-term Dialogue Memory In Large Language Models Wang Qingyue, Ding Liang, Cao Yanan, Tian Zhiliang, Wang Shi, Tao Dacheng, Guo Li
- Scimon: Scientific Inspiration Machines Optimized For Novelty Wang Qingyun, Downey Doug, Ji Heng, Hope Tom
- Ivygpt: Interactive Chinese Pathway Language Model In Medical Domain Wang Rongsheng, Duan Yaofei, Lam Chantong, Chen Jiexi, Xu Jiangsheng, Chen Haoming, Liu Xiaohong, Pang Patrick Cheong-iao, Tan Tao
- Bridging The Novice-expert Gap Via Models Of Decision-making: A Case Study On Remediating Math Mistakes Wang Rose E., Zhang Qingyang, Robinson Carly, Loeb Susanna, Demszky Dorottya
- Shepherd: A Critic For Language Model Generation Wang Tianlu, Yu Ping, Tan Xiaoqing Ellen, O'brien Sean, Pasunuru Ramakanth, Dwivedi-yu Jane, Golovneva Olga, Zettlemoyer Luke, Fazel-zarandi Maryam, Celikyilmaz Asli
- CAR: Conceptualization-augmented Reasoner For Zero-shot Commonsense Question Answering Wang Weiqi, Fang Tianqing, Ding Wenxuan, Xu Baixuan, Liu Xin, Song Yangqiu, Bosselut Antoine
- All Languages Matter: On The Multilingual Safety Of Large Language Models Wang Wenxuan, Tu Zhaopeng, Chen Chang, Yuan Youliang, Huang Jen-tse, Jiao Wenxiang, Lyu Michael R.
- Instructuie: Multi-task Instruction Tuning For Unified Information Extraction Wang Xiao, Zhou Weikang, Zu Can, Xia Han, Chen Tianze, Zhang Yuansen, Zheng Rui, Ye Junjie, Zhang Qi, Gui Tao, Kang Jihua, Yang Jingsheng, Li Siyuan, Du Chunsai
- MINT: Evaluating Llms In Multi-turn Interaction With Tools And Language Feedback Wang Xingyao, Wang Zihan, Liu Jiateng, Chen Yangyi, Yuan Lifan, Peng Hao, Ji Heng
- User-aware Prefix-tuning Is A Good Learner For Personalized Image Captioning Wang Xuan, Wang Guanhong, Chai Wenhao, Zhou Jiayu, Wang Gaoang
- Emotional Intelligence Of Large Language Models Wang Xuena, Li Xueting, Yin Zi, Wu Yue, Jia Liu
- Instructta: Instruction-tuned Targeted Attack For Large Vision-language Models Wang Xunguang, Ji Zhenlan, Ma Pingchuan, Li Zongjie, Wang Shuai
- Adding Guardrails To Advanced Chatbots Wang Yanchen, Singh Lisa
- NEWTON: Are Large Language Models Capable Of Physical Reasoning? Wang Yi Ru, Duan Jiafei, Fox Dieter, Srinivasa Siddhartha
- Pandalm: An Automatic Evaluation Benchmark For LLM Instruction Tuning Optimization Wang Yidong, Yu Zhuohao, Zeng Zhengran, Yang Linyi, Wang Cunxiang, Chen Hao, Jiang Chaoya, Xie Rui, Wang Jindong, Xie Xing, Ye Wei, Zhang Shikun, Zhang Yue
- Hint-enhanced In-context Learning Wakes Large Language Models Up For Knowledge-intensive Tasks Wang Yifan, Guo Qingyan, Ni Xinzhe, Shi Chufan, Liu Lemao, Jiang Haiyun, Yang Yujiu
- Multilora: Democratizing Lora For Better Multi-task Learning Wang Yiming, Lin Yu, Zeng Xiaodong, Zhang Guannan
- Gradient-based Word Substitution For Obstinate Adversarial Examples Generation In Language Models Wang Yimu, Shi Peng, Zhang Hongyang
- Primacy Effect Of Chatgpt Wang Yiwei, Cai Yujun, Chen Muhao, Liang Yuxuan, Hooi Bryan
- How Far Can Camels Go? Exploring The State Of Instruction Tuning On Open Resources Wang Yizhong, Ivison Hamish, Dasigi Pradeep, Hessel Jack, Khot Tushar, Chandu Khyathi Raghavi, Wadden David, Macmillan Kelsey, Smith Noah A., Beltagy Iz, Hajishirzi Hannaneh
- Towards Improving Document Understanding: An Exploration On Text-grounding Via Mllms Wang Yonghui, Zhou Wengang, Feng Hao, Zhou Keyi, Li Houqiang
- DRDT: Dynamic Reflection With Divergent Thinking For Llm-based Sequential Recommendation Wang Yu, Liu Zhiwei, Zhang Jianguo, Yao Weiran, Heinecke Shelby, Yu Philip S.
- Augmenting Black-box Llms With Medical Textbooks For Clinical Question Answering Wang Yubo, Ma Xueguang, Chen Wenhu
- Harnessing The Power Of David Against Goliath: Exploring Instruction Data Generation Without Using Closed-source Models Wang Yue, Wang Xinrui, Li Juntao, Chang Jinxiong, Zhang Qishen, Liu Zhongyi, Zhang Guannan, Zhang Min
- Gemini In Reasoning: Unveiling Commonsense In Multimodal Large Language Models Wang Yuqing, Zhao Yun
- Metacognitive Prompting Improves Understanding In Large Language Models Wang Yuqing, Zhao Yun
- Gpt4video: A Unified Multimodal Large Language Model For Lnstruction-followed Understanding And Safety-aware Generation Wang Zhanyu, Wang Longyue, Zhao Zhen, Wu Minghao, Lyu Chenyang, Li Huayang, Cai Deng, Zhou Luping, Shi Shuming, Tu Zhaopeng
- An In-depth Investigation Of User Response Simulation For Conversational Search Wang Zhenduo, Xu Zhichao, Ai Qingyao, Srikumar Vivek
- Mediagpt : A Large Language Model For Chinese Media Wang Zhonghao, Lu Zijia, Jin Bo, Deng Haiying
- Can Llms Like GPT-4 Outperform Traditional AI Tools In Dementia Diagnosis? Maybe, But Not Today Wang Zhuo, Li Rongzhen, Dong Bowen, Wang Jie, Li Xiuxing, Liu Ning, Mao Chenhui, Zhang Wei, Dong Liling, Gao Jing, Wang Jianyong
- An Effective Data Creation Pipeline To Generate High-quality Financial Instruction Data For Large Language Model Wang Ziao, Wang Jianning, Wu Junda, Zhang Xiaofeng
- Multiqg-ti: Towards Question Generation From Multi-modal Sources Wang Zichao, Baraniuk Richard
- Generating Valid And Natural Adversarial Examples With Large Language Models Wang Zimu, Wang Wei, Chen Qi, Wang Qiufeng, Nguyen Anh
- Memorybank: Enhancing Large Language Models With Long-term Memory Wanjun Zhong, Lianghong Guo, Qiqi Gao, He Ye, Yanlin Wang
- Machine Translation For Ge'ez Language Wassie Aman Kassahun
- A Survey Of Large Language Models Wayne Xin Zhao, Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-yun Nie, Ji-rong Wen
- Jailbroken: How Does LLM Safety Training Fail? Wei Alexander, Haghtalab Nika, Steinhardt Jacob
- Larger Language Models Do In-context Learning Differently Wei Jerry, Wei Jason, Tay Yi, Tran Dustin, Webson Albert, Lu Yifeng, Chen Xinyun, Liu Hanxiao, Huang Da, Zhou Denny, Ma Tengyu
- Magicoder: Empowering Code Generation With Oss-instruct Wei Yuxiang, Wang Zhe, Liu Jiawei, Ding Yifeng, Zhang Lingming
- Copiloting The Copilots: Fusing Large Language Models With Completion Engines For Automated Program Repair Wei Yuxiang, Xia Chunqiu Steven, Zhang Lingming
- REPLUG: Retrieval-augmented Black-box Language Models Weijia Shi, Sewon Min, Michihiro Yasunaga, Minjoon Seo, Rich James, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih
- Is Chatgpt Good At Search? Investigating Large Language Models As Re-ranking Agents Weiwei Sun, Lingyong Yan, Xinyu Ma, Shuaiqiang Wang, Pengjie Ren, Zhumin Chen, Dawei Yin, Zhaochun Ren
- Layoutgpt: Compositional Visual Planning And Generation With Large Language Models Weixi Feng, Wanrong Zhu, Tsu-jui Fu, Varun Jampani, Arjun Akula, Xuehai He, Sugato Basu, Xin Eric Wang, William Yang Wang
- Infovisdial: An Informative Visual Dialogue Dataset By Bridging Large Multimodal And Language Models Wen Bingbing, Yang Zhengyuan, Wang Jianfeng, Gan Zhe, Howe Bill, Wang Lijuan
- Autodroid: Llm-powered Task Automation In Android Wen Hao, Li Yuanchun, Liu Guohong, Zhao Shanhui, Yu Tao, Li Toby Jia-jun, Jiang Shiqi, Liu Yunhao, Zhang Yaqin, Liu Yunxin
- Droidbot-gpt: Gpt-powered UI Automation For Android Wen Hao, Wang Hongming, Liu Jiaxuan, Li Yuanchun
- Helping Language Models Learn More: Multi-dimensional Task Prompt For Few-shot Tuning Weng Jinta, Zhang Jiarui, Hu Yue, Fa Daidong, Xuand Xiaofeng, Huang Heyan
- Is Chatgpt A Good Translator? Yes With GPT-4 As The Engine Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Xing Wang, Shuming Shi, Zhaopeng Tu
- Advances In Apparent Conceptual Physics Reasoning In GPT-4 West Colin G.
- AI And The FCI: Can Chatgpt Project An Understanding Of Introductory Physics? West Colin G.
- Llm-powered Data Augmentation For Enhanced Cross-lingual Performance Whitehouse Chenxi, Choudhury Monojit, Aji Alham Fikri
- Better Question-answering Models On A Budget Wijeratne Yudhanjaya, Marikar Ishan
- Cases Of EFL Secondary Students' Prompt Engineering Pathways To Complete A Writing Task With Chatgpt Woo David James, Guo Kai, Susanto Hengky
- From Big To Small Without Losing It All: Text Augmentation With Chatgpt For Efficient Sentiment Analysis Woźniak Stanisław, Kocoń Jan
- Extrapolating Multilingual Understanding Models As Multilingual Generators Wu Bohong, Yuan Fei, Zhao Hai, Li Lei, Xu Jingjing
- Pmc-llama: Towards Building Open-source Language Models For Medicine Wu Chaoyi, Lin Weixiong, Zhang Xiaoman, Zhang Ya, Wang Yanfeng, Xie Weidi
- Exploring The Limits Of Chatgpt In Software Security Applications Wu Fangzhou, Zhang Qingzhao, Bajaj Ati Priya, Bao Tiffany, Zhang Ning, Wang Ruoyu "fish", Xiao Chaowei
- Speechgen: Unlocking The Generative Power Of Speech Language Models With Prompts Wu Haibin, Chang Kai-wei, Wu Yuan-kuei, Lee Hung-yi
- Large Language Models Leverage External Knowledge To Extend Clinical Insight Beyond Language Boundaries Wu Jiageng, Wu Xian, Qiu Zhaopeng, Li Minghui, Zhang Yingying, Zheng Yefeng, Yuan Changzheng, Yang Jie
- Lamini-lm: A Diverse Herd Of Distilled Models From Large-scale Instructions Wu Minghao, Waheed Abdul, Zhang Chiyu, Abdul-mageed Muhammad, Aji Alham Fikri
- A Comparative Study Of Open-source Large Language Models, GPT-4 And Claude 2: Multiple-choice Test Taking In Nephrology Wu Sean, Koo Michael, Blum Lesley, Black Andy, Kao Liyo, Scalzo Fabien, Kurtz Ira
- Next-gpt: Any-to-any Multimodal LLM Wu Shengqiong, Fei Hao, Qu Leigang, Ji Wei, Chua Tat-seng
- A New Dialogue Response Generation Agent For Large Language Models By Asking Questions To Detect User's Intentions Wu Siwei, Shen Xiangqing, Xia Rui
- Unveiling Security, Privacy, And Ethical Concerns Of Chatgpt Wu Xiaodong, Duan Ran, Ni Jianbing
- An Early Evaluation Of Gpt-4v(ision) Wu Yang, Wang Shilong, Yang Hao, Zheng Tian, Zhang Hongbo, Zhao Yanyan, Qin Bing
- Improving Cross-task Generalization With Step-by-step Instructions Wu Yang, Zhao Yanyan, Li Zhongyang, Qin Bing, Xiong Kai
- Jailbreaking GPT-4V Via Self-adversarial Attacks With System Prompts Wu Yuanwei, Li Xiang, Liu Yixin, Zhou Pan, Sun Lichao
- A Simple Baseline For Knowledge-based Visual Question Answering Xenos Alexandros, Stafylakis Themos, Patras Ioannis, Tzimiropoulos Georgios
- Conversational Automated Program Repair Xia Chunqiu Steven, Zhang Lingming
- Language Models Meet World Models: Embodied Experiences Enhance Language Models Xiang Jiannan, Tao Tianhua, Gu Yi, Shu Tianmin, Wang Zirui, Yang Zichao, Hu Zhiting
- Personalized Abstractive Summarization By Tri-agent Generation Pipeline Xiao Wen, Xie Yujia, Carenini Giuseppe, He Pengcheng
- Supporting Qualitative Analysis With Large Language Models: Combining Codebook With GPT-3 For Deductive Coding Xiao Ziang, Yuan Xingdi, Liao Q. Vera, Abdelghani Rania, Oudeyer Pierre-yves
- Rethinking The Evaluation For Conversational Recommendation In The Era Of Large Language Models Xiaolei Wang, Xinyu Tang, Wayne Xin Zhao, Jingyuan Wang, Ji-rong Wen
- The Wall Street Neophyte: A Zero-shot Analysis Of Chatgpt Over Multimodal Stock Movement Prediction Challenges Xie Qianqian, Han Weiguang, Lai Yanzhao, Peng Min, Huang Jimin
- Tencentllmeval: A Hierarchical Evaluation Of Real-world Capabilities For Human-aligned Llms Xie Shuyi, Yao Wenlin, Dai Yong, Wang Shaobo, Zhou Donlin, Jin Lifeng, Feng Xinhua, Wei Pengzhi, Lin Yujie, Hu Zhichao, Yu Dong, Zhang Zhengyou, Nie Jing, Liu Yuhong
- Empirical Study Of Zero-shot NER With Chatgpt Xie Tingyu, Li Qi, Zhang Jian, Zhang Yan, Liu Zuozhu, Wang Hongwei
- Analysis Of The Reasoning With Redundant Information Provided Ability Of Large Language Models Xie Wenbei
- Translating Natural Language To Planning Goals With Large-language Models Xie Yaqi, Yu Chen, Zhu Tongyao, Bai Jinbin, Gong Ze, Soh Harold
- Olagpt: Empowering Llms With Human-like Problem-solving Abilities Xie Yuanzhen, Xie Tao, Lin Mingxiong, Wei Wentao, Li Chenglin, Kong Beibei, Chen Lei, Zhuo Chengxiang, Hu Bo, Li Zang
- The Next Chapter: A Study Of Large Language Models In Storytelling Xie Zhuohan, Cohn Trevor, Lau Jey Han
- A Preliminary Study Of Chatgpt On News Recommendation: Personalization, Provider Fairness, Fake News Xinyi Li, Yongfeng Zhang, Edward C. Malthouse
- Doctorglm: Fine-tuning Your Chinese Doctor Is Not A Herculean Task Xiong Honglin, Wang Sheng, Zhu Yitao, Zhao Zihao, Liu Yuxiao, Huang Linlin, Wang Qian, Shen Dinggang
- TRIGO: Benchmarking Formal Mathematical Proof Reduction For Generative Language Models Xiong Jing, Shen Jianhao, Yuan Ye, Wang Haiming, Yin Yichun, Liu Zhengying, Li Lin, Guo Zhijiang, Cao Qingxing, Huang Yinya, Zheng Chuanyang, Liang Xiaodan, Zhang Ming, Liu Qun
- Examining Inter-consistency Of Large Language Models Collaboration: An In-depth Analysis Via Debate Xiong Kai, Ding Xiao, Cao Yixin, Liu Ting, Qin Bing
- Can Llms Express Their Uncertainty? An Empirical Evaluation Of Confidence Elicitation In Llms Xiong Miao, Hu Zhiyuan, Lu Xinyang, Li Yifei, Fu Jie, He Junxian, Hooi Bryan
- The Program Testing Ability Of Large Language Models For Code Xiong Weimin, Guo Yiwen, Chen Hao
- Effective Long-context Scaling Of Foundation Models Xiong Wenhan, Liu Jingyu, Molybog Igor, Zhang Hejia, Bhargava Prajjwal, Hou Rui, Martin Louis, Rungta Rashi, Sankararaman Karthik Abinav, Oguz Barlas, Khabsa Madian, Fang Han, Mehdad Yashar, Narang Sharan, Malik Kshitiz, Fan Angela, Bhosale Shruti, Edunov Sergey, Lewis Mike, Wang Sinong, Ma Hao
- Expertprompting: Instructing Large Language Models To Be Distinguished Experts Xu Benfeng, Yang An, Lin Junyang, Wang Quan, Zhou Chang, Zhang Yongdong, Mao Zhendong
- Rewoo: Decoupling Reasoning From Observations For Efficient Augmented Language Models Xu Binfeng, Peng Zhiyuan, Lei Bowen, Mukherjee Subhabrata, Liu Yuchen, Xu Dongkuan
- Baize: An Open-source Chat Model With Parameter-efficient Tuning On Self-chat Data Xu Canwen, Guo Daya, Duan Nan, Mcauley Julian
- Automatic Pair Construction For Contrastive Post-training Xu Canwen, Rosset Corby, Chau Ethan C., Del Corro Luciano, Mahajan Shweti, Mcauley Julian, Neville Jennifer, Awadallah Ahmed Hassan, Rao Nikhil
- Small Models Are Valuable Plug-ins For Large Language Models Xu Canwen, Xu Yichong, Wang Shuohang, Liu Yang, Zhu Chenguang, Mcauley Julian
- Ctc-based Non-autoregressive Speech Translation Xu Chen, Liu Xiaoqian, Liu Xiaowen, Sun Qingxuan, Zhang Yuhao, Yang Murun, Dong Qianqian, Ko Tom, Wang Mingxuan, Xiao Tong, Ma Anxiang, Zhu Jingbo
- Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation And Beyond Xu Fangzhi, Lin Qika, Han Jiawei, Zhao Tianzhe, Liu Jun, Cambria Erik
- A Paradigm Shift In Machine Translation: Boosting Translation Performance Of Large Language Models Xu Haoran, Kim Young Jin, Sharaf Amr, Awadalla Hany Hassan
- Penetrative AI: Making Llms Comprehend The Physical World Xu Huatao, Han Liying, Yang Qirui, Li Mo, Srivastava Mani
- Superclue: A Comprehensive Chinese Large Language Model Benchmark Xu Liang, Li Anqi, Zhu Lei, Xue Hang, Zhu Changtai, Zhao Kangkang, He Haonan, Zhang Xuanwei, Kang Qiyue, Lan Zhenzhong
- Sc-safety: A Multi-round Open-ended Question Adversarial Safety Benchmark For Large Language Models In Chinese Xu Liang, Zhao Kangkang, Zhu Lei, Xue Hang
- Tensorgpt: Efficient Compression Of The Embedding Layer In Llms Based On The Tensor-train Decomposition Xu Mingxue, Xu Yao Lei, Mandic Danilo P.
- Cognitive Overload: Jailbreaking Large Language Models With Overloaded Logical Thinking Xu Nan, Wang Fei, Zhou Ben, Li Bang Zheng, Xiao Chaowei, Chen Muhao
- On The Tool Manipulation Capability Of Open-source Large Language Models Xu Qiantong, Hong Fenglu, Li Bo, Hu Changran, Chen Zhengyu, Zhang Jian
- Does Conceptual Representation Require Embodiment? Insights From Large Language Models Xu Qihui, Peng Yingying, Nastase Samuel A., Chodorow Martin, Wu Minghua, Li Ping
- Fine-tuned Llms Know More, Hallucinate Less With Few-shot Sequence-to-sequence Semantic Parsing Over Wikidata Xu Silei, Liu Shicheng, Culhane Theo, Pertseva Elizaveta, Wu Meng-hsi, Semnani Sina J., Lam Monica S.
- Shattering The Agent-environment Interface For Fine-tuning Inclusive Language Models Xu Wanqiao, Dong Shi, Arumugam Dilip, Van Roy Benjamin
- INSTRUCTSCORE: Explainable Text Generation Evaluation With Finegrained Feedback Xu Wenda, Wang Danqing, Pan Liangming, Song Zhenqiao, Freitag Markus, Wang William Yang, Li Lei
- Re-reading Improves Reasoning In Large Language Models Xu Xiaohan, Tao Chongyang, Shen Tao, Xu Can, Xu Hongbo, Long Guodong, Lou Jian-guang
- An LLM Can Fool Itself: A Prompt-based Adversarial Attack Xu Xilie, Kong Keyi, Liu Ning, Cui Lizhen, Wang Di, Zhang Jingfeng, Kankanhalli Mohan
- Mental-llm: Leveraging Large Language Models For Mental Health Prediction Via Online Text Data Xu Xuhai, Yao Bingsheng, Dong Yuanzhe, Gabriel Saadia, Yu Hong, Hendler James, Ghassemi Marzyeh, Dey Anind K., Wang Dakuo
- Llms And The Abstraction And Reasoning Corpus: Successes, Failures, And The Importance Of Object-based Representations Xu Yudong, Li Wenhao, Vaezipoor Pashootan, Sanner Scott, Khalil Elias B.
- Drivegpt4: Interpretable End-to-end Autonomous Driving Via Large Language Model Xu Zhenhua, Zhang Yujia, Xie Enze, Zhao Zhen, Guo Yong, Wong Kwan-yee. K., Li Zhenguo, Zhao Hengshuang
- IDOL: Indicator-oriented Logic Pre-training For Logical Reasoning Xu Zihang, Yang Ziqing, Cui Yiming, Wang Shijin
- How Robust Is GPT-3.5 To Predecessors? A Comprehensive Study On Language Understanding Tasks Xuanting Chen, Junjie Ye, Can Zu, Nuo Xu, Rui Zheng, Minlong Peng, Jie Zhou, Tao Gui, Qi Zhang, Xuanjing Huang
- Occuquest: Mitigating Occupational Bias For Inclusive Large Language Models Xue Mingfeng, Liu Dayiheng, Yang Kexin, Dong Guanting, Lei Wenqiang, Yuan Zheng, Zhou Chang, Zhou Jingren
- RCOT: Detecting And Rectifying Factual Inconsistency In Reasoning By Reversing Chain-of-thought Xue Tianci, Wang Ziqi, Wang Zhenhailong, Han Chi, Yu Pengfei, Ji Heng
- Scaling Evidence-based Instructional Design Expertise Through Large Language Models Yadav Gautam
- Optimizing Machine Translation Through Prompt Engineering: An Investigation Into Chatgpt's Customizability Yamada Masaru
- Evaluating Spatial Understanding Of Large Language Models Yamada Yutaro, Bao Yihan, Lampinen Andrew K., Kasai Jungo, Yildirim Ilker
- LPML: Llm-prompting Markup Language For Mathematical Reasoning Yamauchi Ryutaro, Sonoda Sho, Sannai Akiyoshi, Kumagai Wataru
- Voila-a: Aligning Vision-language Models With User's Gaze Attention Yan Kun, Ji Lei, Wang Zeyu, Wang Yuntao, Duan Nan, Ma Shuai
- Practical And Ethical Challenges Of Large Language Models In Education: A Systematic Scoping Review Yan Lixiang, Sha Lele, Zhao Linxuan, Li Yuheng, Martinez-maldonado Roberto, Chen Guanliang, Li Xinyu, Jin Yueqiao, Gašević Dragan
- Refining The Responses Of Llms By Themselves Yan Tianqiang, Xu Tiansheng
- Multimodal Chatgpt For Medical Applications: An Experimental Study Of GPT-4V Yan Zhiling, Zhang Kai, Zhou Rong, He Lifang, Li Xiang, Sun Lichao
- Can You Follow Me? Testing Situational Understanding In Chatgpt Yang Chenghao, Ettinger Allyson
- Beyond Testers' Biases: Guiding Model Testing With Knowledge Bases Using Llms Yang Chenyang, Rustogi Rishabh, Brower-sinning Rachel, Lewis Grace A., Kästner Christian, Wu Tongshuang
- Refgpt: Dialogue Generation Of GPT, By GPT, And For GPT Yang Dongjie, Yuan Ruifeng, Fan Yuantao, Yang Yifei, Wang Zili, Wang Shusen, Zhao Hai
- PRCA: Fitting Black-box Large Language Models For Retrieval Question Answering Via Pluggable Reward-driven Contextual Adapter Yang Haoyan, Li Zhitao, Zhang Yong, Wang Jianzong, Cheng Ning, Li Ming, Xiao Jing
- Auto-gpt For Online Decision Making: Benchmarks And Additional Opinions Yang Hui, Yue Sifu, He Yunzhong
- Towards Interpretable Mental Health Analysis With Large Language Models Yang Kailai, Ji Shaoxiong, Zhang Tianlin, Xie Qianqian, Kuang Ziyan, Ananiadou Sophia
- Give Us The Facts: Enhancing Large Language Models With Knowledge Graphs For Fact-aware Language Modeling Yang Linyao, Chen Hongyang, Li Zhao, Ding Xiao, Wu Xindong
- Supervised Knowledge Makes Large Language Models Better In-context Learners Yang Linyi, Zhang Shuibai, Yu Zhuohao, Bao Guangsheng, Wang Yidong, Wang Jindong, Xu Ruochen, Ye Wei, Xie Xing, Chen Weizhu, Zhang Yue
- Integrating UMLS Knowledge Into Large Language Models For Medical Question Answering Yang Rui, Marrese-taylor Edison, Ke Yuhe, Cheng Lechao, Chen Qingyu, Li Irene
- Zhongjing: Enhancing The Chinese Medical Capabilities Of Large Language Model Through Expert Feedback And Real-world Multi-turn Dialogue Yang Songhua, Zhao Hanjie, Zhu Senbin, Zhou Guangyu, Xu Hongfei, Jia Yuxiang, Zan Hongying
- Psycot: Psychological Questionnaire As Powerful Chain-of-thought For Personality Detection Yang Tao, Shi Tianyuan, Wan Fanqi, Quan Xiaojun, Wang Qifan, Wu Bingzhe, Wu Jiaxiang
- Neural Machine Translation Data Generation And Augmentation Using Chatgpt Yang Wayne, Nicolai Garrett
- Bigtranslate: Augmenting Large Language Models With Multilingual Translation Capability Over 100 Languages Yang Wen, Li Chong, Zhang Jiajun, Zong Chengqing
- Exploring An LM To Generate Prolog Predicates From Mathematics Questions Yang Xiaocheng, Tam Yik-cheung
- Human-in-the-loop Machine Translation With Large Language Model Yang Xinyi, Zhan Runzhe, Wong Derek F., Wu Junchao, Chao Lidia S.
- Bias A-head? Analyzing Bias In Transformer-based Language Model Attention Heads Yang Yi, Duan Hanyu, Abbasi Ahmed, Lalor John P., Tam Kar Yan
- Harnessing The Power Of Large Language Models For Natural Language To First-order Logic Translation Yang Yuan, Xiong Siheng, Payani Ali, Shareghi Ehsan, Fekri Faramarz
- On The Planning, Search, And Memorization Capabilities Of Large Language Models Yang Yunhao, Tomar Anshul
- GPT Can Solve Mathematical Problems Without A Calculator Yang Zhen, Ding Ming, Lv Qingsong, Jiang Zhihuan, He Zehai, Guo Yuyi, Bai Jinfeng, Tang Jie
- TEAL: Tokenize And Embed ALL For Multi-modal Large Language Models Yang Zhen, Zhang Yingxue, Meng Fandong, Zhou Jie
- Surpassing GPT-4 Medical Coding With A Two-stage Approach Yang Zhichao, Batra Sanjit Singh, Stremmel Joel, Halperin Eran
- Coupling Large Language Models With Logic Programming For Robust And General Reasoning From Text Yang Zhun, Ishay Adam, Lee Joohyung
- I-code V2: An Autoregressive Generation Framework Over Vision, Language, And Speech Data Yang Ziyi, Khademi Mahmoud, Xu Yichong, Pryzant Reid, Fang Yuwei, Zhu Chenguang, Chen Dongdong, Qian Yao, Gao Mei, Chen Yi-ling, Gmyr Robert, Kanda Naoyuki, Codella Noel, Xiao Bin, Shi Yu, Yuan Lu, Yoshioka Takuya, Zeng Michael, Huang Xuedong
- Benchmarking Llm-based Machine Translation On Cultural Awareness Yao Binwei, Jiang Ming, Yang Diyi, Hu Junjie
- Improving Language Model Negotiation With Self-play And In-context Learning From AI Feedback Yao Fu, Hao Peng, Tushar Khot, Mirella Lapata
- LLM Lies: Hallucinations Are Not Bugs, But Features As Adversarial Examples Yao Jia-yu, Ning Kun-peng, Liu Zhen-hui, Ning Mu-nan, Liu Yu-yang, Yuan Li
- Embodiedgpt: Vision-language Pre-training Via Embodied Chain Of Thought Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo
- A Survey On Large Language Model (LLM) Security And Privacy: The Good, The Bad, And The Ugly Yao Yifan, Duan Jinhao, Xu Kaidi, Cai Yuanfang, Sun Zhibo, Zhang Yue
- Do Physicians Know How To Prompt? The Need For Automatic Prompt Optimization Help In Clinical Note Generation Yao Zonghai, Jaafar Ahmed, Wang Beining, Yang Zhichao, Yu Hong
- Diffusion Language Models Can Perform Many Tasks With Scaling And Instruction-finetuning Ye Jiasheng, Zheng Zaixiang, Bao Yu, Qian Lihua, Gu Quanquan
- Rational Decision-making Agent With Internalized Utility Judgment Ye Yining, Cong Xin, Tian Shizuo, Qin Yujia, Liu Chong, Lin Yankai, Liu Zhiyuan, Sun Maosong
- A Multitask, Multilingual, Multimodal Evaluation Of Chatgpt On Reasoning, Hallucination, And Interactivity Yejin Bang, Samuel Cahyawijaya, Nayeon Lee, Wenliang Dai, Dan Su, Bryan Wilie, Holy Lovenia, Ziwei Ji, Tiezheng Yu, Willy Chung, Quyet V. Do, Yan Xu, Pascale Fung
- Agent Lumos: Unified And Modular Training For Open-source Language Agents Yin Da, Brahman Faeze, Ravichander Abhilasha, Chandu Khyathi, Chang Kai-wei, Choi Yejin, Lin Bill Yuchen
- Dynosaur: A Dynamic Growth Paradigm For Instruction-tuning Data Curation Yin Da, Liu Xiao, Yin Fan, Zhong Ming, Bansal Hritik, Han Jiawei, Chang Kai-wei
- Do Large Language Models Know What They Don't Know? Yin Zhangyue, Sun Qiushi, Guo Qipeng, Wu Jiawen, Qiu Xipeng, Huang Xuanjing
- LAMM: Language-assisted Multi-modal Instruction-tuning Dataset, Framework, And Benchmark Yin Zhenfei, Wang Jiong, Cao Jianjian, Shi Zhelun, Liu Dingning, Li Mukai, Sheng Lu, Bai Lei, Huang Xiaoshui, Wang Zhiyong, Shao Jing, Ouyang Wanli
- Analyzing And Mitigating Object Hallucination In Large Vision-language Models Yiyang Zhou, Chenhang Cui, Jaehong Yoon, Linjun Zhang, Zhun Deng, Chelsea Finn, Mohit Bansal, Huaxiu Yao
- Low-resource Languages Jailbreak GPT-4 Yong Zheng-xin, Menghini Cristina, Bach Stephen H.
- Prompting Multilingual Large Language Models To Generate Code-mixed Texts: The Case Of South East Asian Languages Yong Zheng-xin, Zhang Ruochen, Forde Jessica Zosa, Wang Skyler, Subramonian Arjun, Lovenia Holy, Cahyawijaya Samuel, Winata Genta Indra, Sutawika Lintang, Cruz Jan Christian Blaise, Tan Yin Lin, Phan Long, Garcia Rowena, Solorio Thamar, Aji Alham Fikri
- Hugginggpt: Solving AI Tasks With Chatgpt And Its Friends In Hugging Face Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu, Yueting Zhuang
- From Text To Motion: Grounding GPT-4 In A Humanoid Robot "alter3" Yoshida Takahide, Masumori Atsushi, Ikegami Takashi
- Fundamental Limitations Of Alignment In Large Language Models Yotam Wolf, Noam Wies, Oshri Avnery, Yoav Levine, Amnon Shashua
- OVM, Outcome-supervised Value Models For Planning In Mathematical Reasoning Yu Fei, Gao Anningzhe, Wang Benyou
- GPTFUZZER: Red Teaming Large Language Models With Auto-generated Jailbreak Prompts Yu Jiahao, Lin Xingwei, Yu Zheng, Xing Xinyu
- Assessing Prompt Injection Risks In 200+ Custom Gpts Yu Jiahao, Wu Yuhang, Shu Dong, Jin Mingyu, Yang Sabrina, Xing Xinyu
- Scaling Autoregressive Multi-modal Models: Pretraining And Instruction Tuning Yu Lili, Shi Bowen, Pasunuru Ramakanth, Muller Benjamin, Golovneva Olga, Wang Tianlu, Babu Arun, Tang Binh, Karrer Brian, Sheynin Shelly, Ross Candace, Polyak Adam, Howes Russell, Sharma Vasu, Xu Puxin, Tamoyan Hovhannes, Ashual Oron, Singer Uriel, Li Shang-wen, Zhang Susan, James Richard, Ghosh Gargi, Taigman Yaniv, Fazel-zarandi Maryam, Celikyilmaz Asli, Zettlemoyer Luke, Aghajanyan Armen
- MEGABYTE: Predicting Million-byte Sequences With Multiscale Transformers Yu Lili, Simig Dániel, Flaherty Colin, Aghajanyan Armen, Zettlemoyer Luke, Lewis Mike
- Metamath: Bootstrap Your Own Mathematical Questions For Large Language Models Yu Longhui, Jiang Weisen, Shi Han, Yu Jincheng, Liu Zhengying, Zhang Yu, Kwok James T., Li Zhenguo, Weller Adrian, Liu Weiyang
- Characterizing Mechanisms For Factual Recall In Language Models Yu Qinan, Merullo Jack, Pavlick Ellie
- Seqgpt: An Out-of-the-box Large Language Model For Open Domain Sequence Understanding Yu Tianyu, Jiang Chengyue, Lou Chao, Huang Shen, Wang Xiaobin, Liu Wei, Cai Jiong, Li Yangning, Li Yinghui, Tu Kewei, Zheng Hai-tao, Zhang Ningyu, Xie Pengjun, Huang Fei, Jiang Yong
- Chain-of-note: Enhancing Robustness In Retrieval-augmented Language Models Yu Wenhao, Zhang Hongming, Pan Xiaoman, Ma Kaixin, Wang Hongwei, Yu Dong
- Prompt-based Monte-carlo Tree Search For Goal-oriented Dialogue Policy Planning Yu Xiao, Chen Maximillian, Yu Zhou
- Reeval: Automatic Hallucination Evaluation For Retrieval-augmented Large Language Models Via Transferable Adversarial Attacks Yu Xiaodong, Cheng Hao, Liu Xiaodong, Roth Dan, Gao Jianfeng
- Large Language Model As Attributed Training Data Generator: A Tale Of Diversity And Bias Yu Yue, Zhuang Yuchen, Zhang Jieyu, Meng Yu, Ratner Alexander, Krishna Ranjay, Shen Jiaming, Zhang Chao
- Prophet: Prompting Large Language Models With Complementary Answer Heuristics For Knowledge-based Visual Question Answering Yu Zhou, Ouyang Xuecheng, Shao Zhenwei, Wang Meng, Yu Jun
- CRAFT: Customizing Llms By Creating And Retrieving From Specialized Toolsets Yuan Lifan, Chen Yangyi, Wang Xingyao, Fung Yi R., Peng Hao, Ji Heng
- Evolving Large Language Model Assistant With Long-term Conditional Memory Yuan Ruifeng, Sun Shichao, Wang Zili, Cao Ziqiang, Li Wenjie
- Beneath Surface Similarity: Large Language Models Make Reasonable Scientific Analogies After Structure Abduction Yuan Siyu, Chen Jiangjie, Ge Xuyang, Xiao Yanghua, Yang Deqing
- Artgpt-4: Towards Artistic-understanding Large Vision-language Models With Enhanced Adapter Yuan Zhengqing, He Yunhong, Wang Kun, Ye Yanfang, Sun Lichao
- Evotext: Enhancing Natural Language Generation Models Via Self-escalation Learning For Up-to-date Knowledge And Improved Performance Yuan Zhengqing, Xue Huiwen, Zhang Chao, Liu Yongming
- Large Language Model Cascades With Mixture Of Thoughts Representations For Cost-efficient Reasoning Yue Murong, Zhao Jie, Zhang Min, Du Liang, Yao Ziyu
- Mammoth: Building Math Generalist Models Through Hybrid Instruction Tuning Yue Xiang, Qu Xingwei, Zhang Ge, Fu Yao, Huang Wenhao, Sun Huan, Su Yu, Chen Wenhu
- Prompt Highlighter: Interactive Control For Multi-modal Llms Yuechen Zhang, Shengju Qian, Bohao Peng, Shu Liu, Jiaya Jia
- Chat-rec: Towards Interactive And Explainable Llms-augmented Recommender System Yunfan Gao, Tao Sheng, Youlin Xiang, Yun Xiong, Haofen Wang, Jiawei Zhang
- On Evaluating Adversarial Robustness Of Large Vision-language Models Yunqing Zhao, Tianyu Pang, Chao Du, Xiao Yang, Chongxuan Li, Ngai-man Cheung, Min Lin
- Large Language Models Are Zero-shot Rankers For Recommender Systems Yupeng Hou, Junjie Zhang, Zihan Lin, Hongyu Lu, Ruobing Xie, Julian Mcauley, Wayne Xin Zhao
- Planting A SEED Of Vision In Large Language Model Yuying Ge, Yixiao Ge, Ziyun Zeng, Xintao Wang, Ying Shan
- Mascqa: A Question Answering Dataset For Investigating Materials Science Knowledge Of Large Language Models Zaki Mohd, Jayadeva, Mausam, Krishnan N. M. Anoop
- Fairness-aware Structured Pruning In Transformers Zayed Abdelrahman, Mordido Goncalo, Shabanian Samira, Baldini Ioana, Chandar Sarath
- What Matters In Training A Gpt4-style Language Model With Multimodal Inputs? Zeng Yan, Zhang Hanbo, Zheng Jiani, Xia Jiangnan, Wei Guoqiang, Wei Yang, Zhang Yuchen, Kong Tao
- MR-GSM8K: A Meta-reasoning Benchmark For Large Language Model Evaluation Zeng Zhongshen, Chen Pengguang, Liu Shu, Jiang Haiyun, Jia Jiaya
- Is Chatgpt A Good Sentiment Analyzer? A Preliminary Study Zengzhi Wang, Qiming Xie, Yi Feng, Zixiang Ding, Zinong Yang, Rui Xia
- Text Alignment Is An Efficient Unified Model For Massive NLP Tasks Zha Yuheng, Yang Yichi, Li Ruichen, Hu Zhiting
- Investigating The Catastrophic Forgetting In Multimodal Large Language Models Zhai Yuexiang, Tong Shengbang, Li Xiao, Cai Mu, Qu Qing, Lee Yong Jae, Ma Yi
- G3detector: General Gpt-generated Text Detector Zhan Haolan, He Xuanli, Xu Qiongkai, Wu Yuxiang, Stenetorp Pontus
- Removing RLHF Protections In GPT-4 Via Fine-tuning Zhan Qiusi, Fang Richard, Bindu Rohan, Gupta Akul, Hashimoto Tatsunori, Kang Daniel
- Deceptive AI Ecosystems: The Case Of Chatgpt Zhan Xiao, Xu Yifan, Sarkadi Stefan
- A Simple LLM Framework For Long-range Video Question-answering Zhang Ce, Lu Taixi, Islam Md Mohaiminul, Wang Ziyang, Yu Shoubin, Bansal Mohit, Bertasius Gedas
- Mm-narrator: Narrating Long-form Videos With Multimodal In-context Learning Zhang Chaoyi, Lin Kevin, Yang Zhengyuan, Wang Jianfeng, Li Linjie, Lin Chung-ching, Liu Zicheng, Wang Lijuan
- A Comprehensive Analysis Of The Effectiveness Of Large Language Models As Automatic Dialogue Evaluators Zhang Chen, D'haro Luis Fernando, Chen Yiming, Zhang Malu, Li Haizhou
- Xdial-eval: A Multilingual Open-domain Dialogue Evaluation Benchmark Zhang Chen, D'haro Luis Fernando, Tang Chengguang, Shi Ke, Tang Guohua, Li Haizhou
- Task-agnostic Distillation Of Encoder-decoder Language Models Zhang Chen, Yang Yang, Wang Jingang, Song Dawei
- The Skipped Beat: A Study Of Sociopragmatic Understanding In Llms For 64 Languages Zhang Chiyu, Doan Khai Duy, Liao Qisheng, Abdul-mageed Muhammad
- Taxonomy-based Checklist For Large Language Model Evaluation Zhang Damin
- Mobile-env: Building Qualified Evaluation Benchmarks For LLM-GUI Interaction Zhang Danyang, Shen Zhennan, Xie Rui, Zhang Situo, Xie Tianbao, Zhao Zihan, Chen Siyuan, Chen Lu, Xu Hongshen, Cao Ruisheng, Yu Kai
- Controllable Text Generation With Residual Memory Transformer Zhang Hanqing, Si Sun, Wu Haiming, Song Dawei
- Tuning Large Language Model For End-to-end Speech Translation Zhang Hao, Si Nianwen, Chen Yaqi, Zhang Wenlin, Yang Xukui, Qu Dan, Jiao Xiaolin
- Uncovering Hidden Connections: Iterative Search And Reasoning For Video-grounded Dialog Zhang Haoyu, Liu Meng, Wang Yaowei, Cao Da, Guan Weili, Nie Liqiang
- Tractable Control For Autoregressive Language Generation Zhang Honghua, Dang Meihua, Peng Nanyun, Broeck Guy Van Den
- Empirical Evaluation Of Chatgpt On Requirements Information Retrieval Under Zero-shot Setting Zhang Jianzhang, Chen Yiyang, Niu Nan, Wang Yinglin, Liu Chuang
- The Potential And Pitfalls Of Using A Large Language Model Such As Chatgpt Or GPT-4 As A Clinical Assistant Zhang Jingqing, Sun Kai, Jagadeesh Akshay, Ghahfarokhi Mahta, Gupta Deepa, Gupta Ashok, Gupta Vibhor, Guo Yike
- Llm-based Medical Assistant Personalization With Short- And Long-term Memory Coordination Zhang Kai, Kang Yangyang, Zhao Fubang, Liu Xiaozhong
- Moqagpt : Zero-shot Multi-modal Open-domain Question Answering With Large Language Model Zhang Le, Wu Yihong, Mo Fengran, Nie Jian-yun, Agrawal Aishwarya
- Knowgpt: Knowledge Graph Based Prompting For Large Language Models Zhang Qinggang, Dong Junnan, Chen Hao, Zha Daochen, Yu Zailiang, Huang Xiao
- Automl-gpt: Automatic Machine Learning With GPT Zhang Shujian, Gong Chengyue, Wu Lemeng, Liu Xingchao, Zhou Mingyuan
- Learning To Predict Concept Ordering For Common Sense Generation Zhang Tianhui, Bollegala Danushka, Peng Bei
- The Wisdom Of Hindsight Makes Language Models Better Instruction Followers Zhang Tianjun, Liu Fangchen, Wong Justin, Abbeel Pieter, Gonzalez Joseph E.
- M3exam: A Multilingual, Multimodal, Multilevel Benchmark For Examining Large Language Models Zhang Wenxuan, Aljunied Sharifah Mahani, Gao Chang, Chia Yew Ken, Bing Lidong
- Don't Trust Chatgpt When Your Question Is Not In English: A Study Of Multilingual Abilities And Types Of Llms Zhang Xiang, Li Senyu, Hauer Bradley, Shi Ning, Kondrak Grzegorz
- Large Language Models In Medical Term Classification And Unexpected Misalignment Between Response And Reasoning Zhang Xiaodan, Vemulapalli Sandeep, Talukdar Nabasmita, Ahn Sumyeong, Wang Jiankun, Meng Han, Murtaza Sardar Mehtab Bin, Dave Aakash Ajay, Leshchiner Dmitry, Joseph Dimitri F., Witteveen-lane Martin, Chesla Dave, Zhou Jiayu, Chen Bin
- Gpt-4v(ision) As A Generalist Evaluator For Vision-language Tasks Zhang Xinlu, Lu Yujie, Wang Weizhi, Yan An, Yan Jun, Qin Lianke, Wang Heng, Yan Xifeng, Wang William Yang, Petzold Linda Ruth
- CGCE: A Chinese Generative Chat Evaluation Benchmark For General And Financial Domains Zhang Xuanyu, Li Bingbing, Yang Qing
- Llavar: Enhanced Visual Instruction Tuning For Text-rich Image Understanding Zhang Yanzhe, Zhang Ruiyi, Gu Jiuxiang, Zhou Yufan, Lipka Nedim, Yang Diyi, Sun Tong
- Meta Prompting For AI Systems Zhang Yifan, Yuan Yang, Yao Andrew Chi-chih
- Probing The Multi-turn Planning Capabilities Of Llms Via 20 Question Games Zhang Yizhe, Lu Jiarui, Jaitly Navdeep
- Alleviating Hallucinations Of Large Language Models Through Induced Hallucinations Zhang Yue, Cui Leyang, Bi Wei, Shi Shuming
- Llmeval: A Preliminary Study On How To Evaluate Large Language Models Zhang Yue, Zhang Ming, Yuan Haipeng, Liu Shichun, Shi Yongyao, Gui Tao, Zhang Qi, Huang Xuanjing
- Mani-gpt: A Generative Model For Interactive Robotic Manipulation Zhang Zhe, Chai Wei, Wang Jiankun
- IAG: Induction-augmented Generation Framework For Answering Reasoning Questions Zhang Zhebin, Zhang Xinyu, Ren Yuanhang, Shi Saijiang, Han Meng, Wu Yongkang, Lai Ruofei, Cao Zhao
- VISAR: A Human-ai Argumentative Writing Assistant With Visual Programming And Rapid Draft Prototyping Zhang Zheng, Gao Jie, Dhaliwal Ranjodh Singh, Li Toby Jia-jun
- Students' Perceptions And Preferences Of Generative Artificial Intelligence Feedback For Programming Zhang Zhengdong, Dong Zihan, Shi Yang, Matsuda Noboru, Price Thomas, Xu Dongkuan
- H\(_2\)O: Heavy-hitter Oracle For Efficient Generative Inference Of Large Language Models Zhang Zhenyu, Sheng Ying, Zhou Tianyi, Chen Tianlong, Zheng Lianmin, Cai Ruisi, Song Zhao, Tian Yuandong, Ré Christopher, Barrett Clark, Wang Zhangyang, Chen Beidi
- Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization Zhang Zhexin, Yang Junxiao, Ke Pei, Mi Fei, Wang Hongning, Huang Minlie
- Accountable Textual-visual Chat Learns To Reject Human Instructions In Image Re-creation Zhang Zhiwei, Liu Yuliang
- Unifying The Perspectives Of NLP And Software Engineering: A Survey On Language Models For Code Zhang Ziyin, Chen Chaoyu, Liu Bingchang, Liao Cong, Gong Zi, Yu Hang, Li Jianguo, Wang Rui
- MELA: Multilingual Evaluation Of Linguistic Acceptability Zhang Ziyin, Liu Yikang, Huang Weifang, Mao Junyu, Wang Rui, Hu Hai
- SVIT: Scaling Up Visual Instruction Tuning Zhao Bo, Wu Boya, He Muyang, Huang Tiejun
- Large Language Models Are Complex Table Parsers Zhao Bowen, Ji Changkai, Zhang Yuejie, He Wen, Wang Yingwen, Wang Qing, Feng Rui, Zhang Xiaobo
- Genixer: Empowering Multimodal Large Language Models As A Powerful Data Generator Zhao Henry Hengyuan, Zhou Pan, Shou Mike Zheng
- Explicit Planning Helps Language Models In Logical Reasoning Zhao Hongyu, Wang Kangrui, Yu Mo, Mei Hongyuan
- Automatic Model Selection With Large Language Models For Reasoning Zhao James Xu, Xie Yuxi, Kawaguchi Kenji, He Junxian, Xie Michael Qizhe
- Chbias: Bias Evaluation And Mitigation Of Chinese Conversational Language Models Zhao Jiaxu, Fang Meng, Shi Zijing, Li Yitong, Chen Ling, Pechenizkiy Mykola
- Chatspot: Bootstrapping Multimodal Llms Via Precise Referring Instruction Tuning Zhao Liang, Yu En, Ge Zheng, Yang Jinrong, Wei Haoran, Zhou Hongyu, Sun Jianjian, Peng Yuang, Dong Runpei, Han Chunrui, Zhang Xiangyu
- Verify-and-edit: A Knowledge-enhanced Chain-of-thought Framework Zhao Ruochen, Li Xingxuan, Joty Shafiq, Qin Chengwei, Bing Lidong
- Group Preference Optimization: Few-shot Alignment Of Large Language Models Zhao Siyan, Dang John, Grover Aditya
- A Survey Of Large Language Models Zhao Wayne Xin, Zhou Kun, Li Junyi, Tang Tianyi, Wang Xiaolei, Hou Yupeng, Min Yingqian, Zhang Beichen, Zhang Junjie, Dong Zican, Du Yifan, Yang Chen, Chen Yushuo, Chen Zhipeng, Jiang Jinhao, Ren Ruiyang, Li Yifan, Tang Xinyu, Liu Zikang, Liu Peiyu, Nie Jian-yun, Wen Ji-rong
- Is Chatgpt Equipped With Emotional Dialogue Capabilities? Zhao Weixiang, Zhao Yanyan, Lu Xin, Wang Shilong, Tong Yanpeng, Qin Bing
- Babystories: Can Reinforcement Learning Teach Baby Language Models To Write Better Stories? Zhao Xingmeng, Wang Tongnian, Osborn Sheri, Rios Anthony
- Mind Vs. Mouth: On Measuring Re-judge Inconsistency Of Social Bias In Large Language Models Zhao Yachao, Wang Bo, Zhao Dongming, Huang Kun, Wang Yan, He Ruifang, Hou Yuexian
- Investigating Table-to-text Generation Capabilities Of Llms In Real-world Information Seeking Scenarios Zhao Yilun, Zhang Haowei, Si Shengyun, Nan Linyong, Tang Xiangru, Cohan Arman
- Robut: A Systematic Study Of Table QA Robustness Against Human-annotated Adversarial Perturbations Zhao Yilun, Zhao Chen, Nan Linyong, Qi Zhenting, Zhang Wenlin, Tang Xiangru, Mi Boyu, Radev Dragomir
- Knowing What Llms DO NOT Know: A Simple Yet Effective Self-detection Method Zhao Yukun, Yan Lingyong, Sun Weiwei, Xing Guoliang, Meng Chong, Wang Shuaiqiang, Cheng Zhicong, Ren Zhaochun, Yin Dawei
- Mllm-dataengine: An Iterative Refinement Approach For MLLM Zhao Zhiyuan, Ouyang Linke, Wang Bin, Huang Siyuan, Zhang Pan, Dong Xiaoyi, Wang Jiaqi, He Conghui
- Can We Edit Factual Knowledge By In-context Learning? Zheng Ce, Li Lei, Dong Qingxiu, Fan Yuxuan, Wu Zhiyong, Xu Jingjing, Chang Baobao
- Progressive-hint Prompting Improves Reasoning In Large Language Models Zheng Chuanyang, Liu Zhengying, Xie Enze, Li Zhenguo, Li Yu
- Take A Step Back: Evoking Reasoning Via Abstraction In Large Language Models Zheng Huaixiu Steven, Mishra Swaroop, Chen Xinyun, Cheng Heng-tze, Chi Ed H., Le Quoc V, Zhou Denny
- Candidate Soups: Fusing Candidate Results Improves Translation Quality For Non-autoregressive Translation Zheng Huanran, Zhu Wei, Wang Pengfei, Wang Xiaoling
- Lmsys-chat-1m: A Large-scale Real-world LLM Conversation Dataset Zheng Lianmin, Chiang Wei-lin, Sheng Ying, Li Tianle, Zhuang Siyuan, Wu Zhanghao, Zhuang Yonghao, Li Zhuohan, Lin Zi, Xing Eric P., Gonzalez Joseph E., Stoica Ion, Zhang Hao
- Judging Llm-as-a-judge With Mt-bench And Chatbot Arena Zheng Lianmin, Chiang Wei-lin, Sheng Ying, Zhuang Siyuan, Wu Zhanghao, Zhuang Yonghao, Lin Zi, Li Zhuohan, Li Dacheng, Xing Eric P., Zhang Hao, Gonzalez Joseph E., Stoica Ion
- Memorycompanion: A Smart Healthcare Solution To Empower Efficient Alzheimer's Care Via Unleashing Generative AI Zheng Lifei, Heo Yeonie, Fang Yi
- Trafficsafetygpt: Tuning A Pre-trained Large Language Model To A Domain-specific Expert In Transportation Safety Zheng Ou, Abdel-aty Mohamed, Wang Dongdong, Wang Chenzhu, Ding Shengxuan
- Gpt-fathom: Benchmarking Large Language Models To Decipher The Evolutionary Path Towards GPT-4 And Beyond Zheng Shen, Zhang Yuyu, Zhu Yijie, Xi Chenguang, Gao Pengyang, Zhou Xun, Chang Kevin Chen-chuan
- Building Emotional Support Chatbots In The Era Of Llms Zheng Zhonghua, Liao Lizi, Deng Yang, Nie Liqiang
- Marinegpt: Unlocking Secrets Of Ocean To The Public Zheng Ziqiang, Zhang Jipeng, Vu Tuan-anh, Diao Shizhe, Tim Yue Him Wong, Yeung Sai-kit
- MM-REACT: Prompting Chatgpt For Multimodal Reasoning And Action Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Ehsan Azarnasab, Faisal Ahmed, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang
- Llamarec: Two-stage Recommendation Using Large Language Models For Ranking Zhenrui Yue, Sara Rabhi, Gabriel De Souza Pereira Moreira, Dong Wang, Even Oldridge
- Aligning Large Multimodal Models With Factually Augmented RLHF Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liang-yan Gui, Yu-xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell
- Principle-driven Self-alignment Of Language Models From Scratch With Minimal Human Supervision Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan
- Beyond Hallucinations: Enhancing Lvlms Through Hallucination-aware Direct Preference Optimization Zhiyuan Zhao, Bin Wang, Linke Ouyang, Xiaoyi Dong, Jiaqi Wang, Conghui He
- Bookgpt: A General Framework For Book Recommendation Empowered By Large Language Model Zhiyuli Aakas, Chen Yanfang, Zhang Xuan, Liang Xun
- Can Chatgpt Understand Too? A Comparative Study On Chatgpt And Fine-tuned BERT Zhong Qihuang, Ding Liang, Liu Juhua, Du Bo, Tao Dacheng
- Chatabl: Abductive Learning Via Natural Language Interaction With Chatgpt Zhong Tianyang, Wei Yaonai, Yang Li, Wu Zihao, Liu Zhengliang, Wei Xiaozheng, Li Wenjun, Yao Junjie, Ma Chong, Li Xiang, Zhu Dajiang, Jiang Xi, Han Junwei, Shen Dinggang, Liu Tianming, Zhang Tuo
- Mquake: Assessing Knowledge Editing In Language Models Via Multi-hop Questions Zhong Zexuan, Wu Zhengxuan, Manning Christopher D., Potts Christopher, Chen Danqi
- Language Agent Tree Search Unifies Reasoning Acting And Planning In Language Models Zhou Andy, Yan Kai, Shlapentokh-rothman Michal, Wang Haohan, Wang Yu-xiong
- Solving Challenging Math Word Problems Using GPT-4 Code Interpreter With Code-based Self-verification Zhou Aojun, Wang Ke, Lu Zimu, Shi Weikang, Luo Sichun, Qin Zipeng, Lu Shaoqing, Jia Anya, Song Linqi, Zhan Mingjie, Li Hongsheng
- Navgpt: Explicit Reasoning In Vision-and-language Navigation With Large Language Models Zhou Gengze, Hong Yicong, Wu Qi
- Ethical Chatgpt: Concerns, Challenges, And Commandments Zhou Jianlong, Müller Heimo, Holzinger Andreas, Chen Fang
- Characterglm: Customizing Chinese Conversational AI Characters With Large Language Models Zhou Jinfeng, Chen Zhuang, Wan Dazhen, Wen Bosi, Song Yi, Yu Jifan, Huang Yongkang, Peng Libiao, Yang Jiaming, Xiao Xiyao, Sabour Sahand, Zhang Xiaohan, Hou Wenjing, Zhang Yijia, Dong Yuxiao, Tang Jie, Huang Minlie
- GPT As A Baseline For Recommendation Explanation Texts Zhou Joyce, Joachims Thorsten
- CONA: A Novel Context-aware Instruction Paradigm For Communication Using Large Language Model Zhou Nan, Tao Xinghui, Chen Xi
- Exploring Recommendation Capabilities Of Gpt-4v(ision): A Preliminary Case Study Zhou Peilin, Cao Meng, Huang You-liang, Ye Qichen, Zhang Peiyan, Liu Junling, Xie Yueqi, Hua Yining, Kim Jaeboum
- Unleashing GPT On The Metaverse: Savior Or Destroyer? Zhou Pengyuan
- Chinese Intermediate English Learners Outdid Chatgpt In Deep Cohesion: Evidence From English Narrative Writing Zhou Tongquan, Cao Siyi, Zhou Siruo, Zhang Yao, He Aijing
- Complementary Advantages Of Chatgpts And Human Readers In Reasoning: Evidence From English Text Reading Comprehension Zhou Tongquan, Zhang Yao, Cao Siyi, Li Yulu, Wang Tao
- Recurrentgpt: Interactive Generation Of (arbitrarily) Long Text Zhou Wangchunshu, Jiang Yuchen Eleanor, Cui Peng, Wang Tiannan, Xiao Zhenxin, Hou Yifan, Cotterell Ryan, Sachan Mrinmaya
- Multi-stage Pre-training Enhanced By Chatgpt For Multi-scenario Multi-domain Dialogue Summarization Zhou Weixiao, Li Gengyao, Cheng Xianfu, Liang Xinnian, Zhu Junnan, Zhai Feifei, Li Zhoujun
- Flame: Few-shot Learning From Natural Language Explanations Zhou Yangqiaoyu, Zhang Yiming, Tan Chenhao
- Can GPT Models Follow Human Summarization Guidelines? Evaluating Chatgpt And GPT-4 For Dialogue Summarization Zhou Yongxin, Ringeval Fabien, Portet François
- Quantifying And Analyzing Entity-level Memorization In Large Language Models Zhou Zhenhong, Xiang Jiuyang, Chen Chaomeng, Su Sen
- Avatargpt: All-in-one Framework For Motion Understanding, Planning, Generation And Beyond Zhou Zixiang, Wan Yu, Wang Baoyuan
- VL-GPT: A Generative Pre-trained Transformer For Vision And Language Understanding And Generation Zhu Jinguo, Ding Xiaohan, Ge Yixiao, Ge Yuying, Zhao Sijie, Zhao Hengshuang, Wang Xiaohua, Shan Ying
- Judgelm: Fine-tuned Large Language Models Are Scalable Judges Zhu Lianghui, Wang Xinggang, Wang Xinlong
- Coca: Fusing Position Embedding With Collinear Constrained Attention In Transformers For Long Context Window Extending Zhu Shiyi, Ye Jing, Jiang Wei, Xue Siqiao, Zhang Qi, Wu Yifan, Li Jianguo
- Multilingual Machine Translation With Large Language Models: Empirical Results And Analysis Zhu Wenhao, Liu Hongyi, Dong Qingxiu, Xu Jingjing, Huang Shujian, Kong Lingpeng, Chen Jiajun, Li Lei
- 3d-vista: Pre-trained Transformer For 3D Vision And Text Alignment Zhu Ziyu, Ma Xiaojian, Chen Yixin, Deng Zhidong, Huang Siyuan, Li Qing
- Red Teaming Chatgpt Via Jailbreaking: Bias, Robustness, Reliability And Toxicity Zhuo Terry Yue, Huang Yujin, Chen Chunyang, Xing Zhenchang
- Recommender Systems In The Era Of Large Language Models (llms) Zihuai Zhao, Wenqi Fan, Jiatong Li, Yunqing Liu, Xiaowei Mei, Yiqi Wang, Zhen Wen, Fei Wang, Xiangyu Zhao, Jiliang Tang, Qing Li
- A Family Of Pretrained Transformer Language Models For Russian Zmitrovich Dmitry, Abramov Alexander, Kalmykov Andrey, Tikhonova Maria, Taktasheva Ekaterina, Astafurov Danil, Baushenko Mark, Snegirev Artem, Kadulin Vitalii, Markov Sergey, Shavrina Tatiana, Mikhailov Vladislav, Fenogenova Alena
- Universal And Transferable Adversarial Attacks On Aligned Language Models Zou Andy, Wang Zifan, Carlini Nicholas, Nasr Milad, Kolter J. Zico, Fredrikson Matt
- Dr Chatgpt, Tell Me What I Want To Hear: How Prompt Knowledge Impacts Health Answer Correctness Zuccon Guido, Koopman Bevan
- Chatgpt Hallucinates When Attributing Answers Zuccon Guido, Koopman Bevan, Shaik Razia
- A Dataset And Benchmark For Hospital Course Summarization With Adapted Large Language Models Aali Asad, Van Veen Dave, Arefeen Yamin Ishraq, Hom Jason, Bluethgen Christian, Reis Eduardo Pontes, Gatidis Sergios, Clifford Namuun, Daws Joseph, Tehrani Arash S., Kim Jangwon, Chaudhari Akshay S.
- Benchmarking Large Language Models For Persian: A Preliminary Study Focusing On Chatgpt Abaskohi Amirhossein, Baruni Sara, Masoudi Mostafa, Abbasi Nesa, Babalou Mohammad Hadi, Edalat Ali, Kamahi Sepehr, Sani Samin Mahdizadeh, Naghavian Nikoo, Namazifard Danial, Sadeghi Pouya, Yaghoobzadeh Yadollah
- Bcamirs At Semeval-2024 Task 4: Beyond Words: A Multimodal And Multilingual Exploration Of Persuasion In Memes Abaskohi Amirhossein, Dabiriaghdam Amirhossein, Wang Lele, Carenini Giuseppe
- Knowledge-infused Llm-powered Conversational Health Agent: A Case Study For Diabetes Patients Abbasian Mahyar, Yang Zhongqi, Khatibi Elahe, Zhang Pengfei, Nagesh Nitish, Azimi Iman, Jain Ramesh, Rahmani Amir M.
- Granite-function Calling Model: Introducing Function Calling Abilities Via Multi-task Learning Of Granular Tasks Abdelaziz Ibrahim, Basu Kinjal, Agarwal Mayank, Kumaravel Sadhana, Stallone Matthew, Panda Rameswar, Rizk Yara, Bhargav Gp, Crouse Maxwell, Gunasekara Chulaka, Ikbal Shajith, Joshi Sachin, Karanam Hima, Kumar Vineet, Munawar Asim, Neelam Sumit, Raghu Dinesh, Sharma Udit, Soria Adriana Meza, Sreedhar Dheeraj, Venkateswaran Praveen, Unuvar Merve, Cox David, Roukos Salim, Lastras Luis, Kapanipathi Pavan
- Phi-3 Technical Report: A Highly Capable Language Model Locally On Your Phone Abdin Marah, Aneja Jyoti, Awadalla Hany, Awadallah Ahmed, Awan Ammar Ahmad, Bach Nguyen, Bahree Amit, Bakhtiari Arash, Bao Jianmin, Behl Harkirat, Benhaim Alon, Bilenko Misha, Bjorck Johan, Bubeck Sébastien, Cai Martin, Cai Qin, Chaudhary Vishrav, Chen Dong, Chen Dongdong, Chen Weizhu, Chen Yen-chun, Chen Yi-ling, Cheng Hao, Chopra Parul, Dai Xiyang, Dixon Matthew, Eldan Ronen, Fragoso Victor, Gao Jianfeng, Gao Mei, Gao Min, Garg Amit, Del Giorno Allie, Goswami Abhishek, Gunasekar Suriya, Haider Emman, Hao Junheng, Hewett Russell J., Hu Wenxiang, Huynh Jamie, Iter Dan, Jacobs Sam Ade, Javaheripi Mojan, Jin Xin, Karampatziakis Nikos, Kauffmann Piero, Khademi Mahoud, Kim Dongwoo, Kim Young Jin, Kurilenko Lev, Lee James R., Lee Yin Tat, Li Yuanzhi, Li Yunsheng, Liang Chen, Liden Lars, Lin Xihui, Lin Zeqi, Liu Ce, Liu Liyuan, Liu Mengchen, Liu Weishung, Liu Xiaodong, Luo Chong, Madan Piyush, Mahmoudzadeh Ali, Majercak David, Mazzola Matt, Mendes Caio César Teodoro, Mitra Arindam, Modi Hardik, Nguyen Anh, Norick Brandon, Patra Barun, Perez-becker Daniel, Portet Thomas, Pryzant Reid, Qin Heyang, Radmilac Marko, Ren Liliang, De Rosa Gustavo, Rosset Corby, Roy Sambudha, Ruwase Olatunji, Saarikivi Olli, Saied Amin, Salim Adil, Santacroce Michael, Shah Shital, Shang Ning, Sharma Hiteshi, Shen Yelong, Shukla Swadheen, Song Xia, Tanaka Masahiro, Tupini Andrea, Vaddamanu Praneetha, Wang Chunyu, Wang Guanhua, Wang Lijuan, Wang Shuohang, Wang Xin, Wang Yu, Ward Rachel, Wen Wen, Witte Philipp, Wu Haiping, Wu Xiaoxia, Wyatt Michael, Xiao Bin, Xu Can, Xu Jiahang, Xu Weijian, Xue Jilong, Yadav Sonali, Yang Fan, Yang Jianwei, Yang Yifan, Yang Ziyi, Yu Donghan, Yuan Lu, Zhang Chenruidong, Zhang Cyril, Zhang Jianwen, Zhang Li Lyna, Zhang Yi, Zhang Yue, Zhang Yunan, Zhou Xiren
- Synthetic Dialogue Dataset Generation Using LLM Agents Abdullin Yelaman, Molla-aliod Diego, Ofoghi Bahadorreza, Yearwood John, Li Qingyang
- The Llama 3 Herd Of Models Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang, Bobbie Chern, Charlotte Caucheteux, Chaya Nayak, Chloe Bi, Chris Marra, Chris Mcconnell, Christian Keller, Christophe Touret, Chunyang Wu, Corinne Wong, Cristian Canton Ferrer, Cyrus Nikolaidis, Damien Allonsius, Daniel Song, Danielle Pintz, Danny Livshits, David Esiobu, Dhruv Choudhary, Dhruv Mahajan, Diego Garcia-olano, Diego Perino, Dieuwke Hupkes, Egor Lakomkin, Ehab Albadawy, Elina Lobanova, Emily Dinan, Eric Michael Smith, Filip Radenovic, Frank Zhang, Gabriel Synnaeve, Gabrielle Lee, Georgia Lewis Anderson, Graeme Nail, Gregoire Mialon, Guan Pang, Guillem Cucurell, Hailey Nguyen, Hannah Korevaar, Hu Xu, Hugo Touvron, Iliyan Zarov, Imanol Arrieta Ibarra, Isabel Kloumann, Ishan Misra, Ivan Evtimov, Jade Copet, Jaewon Lee, Jan Geffert, Jana Vranes, Jason Park, Jay Mahadeokar, Jeet Shah, Jelmer Van Der Linde, Jennifer Billock, Jenny Hong, Jenya Lee, Jeremy Fu, Jianfeng Chi, Jianyu Huang, Jiawen Liu, Jie Wang, Jiecao Yu, Joanna Bitton, Joe Spisak, Jongsoo Park, Joseph Rocca, Joshua Johnstun, Joshua Saxe, Junteng Jia, Kalyan Vasuden Alwala, Kartikeya Upasani, Kate Plawiak, Ke Li, Kenneth Heafield, Kevin Stone, Khalid El-arini, Krithika Iyer, Kshitiz Malik, Kuenley Chiu, Kunal Bhalla, Lauren Rantala-yeary, Laurens Van Der Maaten, Lawrence Chen, Liang Tan, Liz Jenkins, Louis Martin, Lovish Madaan, Lubo Malo, Lukas Blecher, Lukas Landzaat, Luke De Oliveira, Madeline Muzzi, Mahesh Pasupuleti, Mannat Singh, Manohar Paluri, Marcin Kardas, Mathew Oldham, Mathieu Rita, Maya Pavlova, Melanie Kambadur, Mike Lewis, Min Si, Mitesh Kumar Singh, Mona Hassan, Naman Goyal, Narjes Torabi, Nikolay Bashlykov, Nikolay Bogoychev, Niladri Chatterji, Olivier Duchenne, Onur Çelebi, Patrick Alrassy, Pengchuan Zhang, Pengwei Li, Petar Vasic, Peter Weng, Prajjwal Bhargava, Pratik Dubal, Praveen Krishnan, Punit Singh Koura, Puxin Xu, Qing He, Qingxiao Dong, Ragavan Srinivasan, Raj Ganapathy, Ramon Calderer, Ricardo Silveira Cabral, Robert Stojnic, Roberta Raileanu, Rohit Girdhar, Rohit Patel, Romain Sauvestre, Ronnie Polidoro, Roshan Sumbaly, Ross Taylor, Ruan Silva, Rui Hou, Rui Wang, Saghar Hosseini, Sahana Chennabasappa, Sanjay Singh, Sean Bell, Seohyun Sonia Kim, Sergey Edunov, Shaoliang Nie, Sharan Narang, Sharath Raparthy, Sheng Shen, Shengye Wan, Shruti Bhosale, Shun Zhang, Simon Vandenhende, Soumya Batra, Spencer Whitman, Sten Sootla, Stephane Collot, Suchin Gururangan, Sydney Borodinsky, Tamar Herman, Tara Fowler, Tarek Sheasha, Thomas Georgiou, Thomas Scialom, Tobias Speckbacher, Todor Mihaylov, Tong Xiao, Ujjwal Karn, Vedanuj Goswami, Vibhor Gupta, Vignesh Ramanathan, Viktor Kerkez, Vincent Gonguet, Virginie Do, Vish Vogeti, Vladan Petrovic, Weiwei Chu, Wenhan Xiong, Wenyin Fu, Whitney Meers, Xavier Martinet, Xiaodong Wang, Xiaoqing Ellen Tan, Xinfeng Xie, Xuchao Jia, Xuewei Wang, Yaelle Goldschlag, Yashesh Gaur, Yasmine Babaei, Yi Wen, Yiwen Song, Yuchen Zhang, Yue Li, Yuning Mao, Zacharie Delpierre Coudert, Zheng Yan, Zhengxing Chen, Zoe Papakipos, Aaditya Singh, Aaron Grattafiori, Abha Jain, Adam Kelsey, Adam Shajnfeld, Adithya Gangidi, Adolfo Victoria, Ahuva Goldstand, Ajay Menon, Ajay Sharma, Alex Boesenberg, Alex Vaughan, Alexei Baevski, Allie Feinstein, Amanda Kallet, Amit Sangani, Anam Yunus, Andrei Lupu, Andres Alvarado, Andrew Caples, Andrew Gu, Andrew Ho, Andrew Poulton, Andrew Ryan, Ankit Ramchandani, Annie Franco, Aparajita Saraf, Arkabandhu Chowdhury, Ashley Gabriel, Ashwin Bharambe, Assaf Eisenman, Azadeh Yazdan, Beau James, Ben Maurer, Benjamin Leonhardi, Bernie Huang, Beth Loyd, Beto De Paola, Bhargavi Paranjape, Bing Liu, Bo Wu, Boyu Ni, Braden Hancock, Bram Wasti, Brandon Spence, Brani Stojkovic, Brian Gamido, Britt Montalvo, Carl Parker, Carly Burton, Catalina Mejia, Changhan Wang, Changkyu Kim, Chao Zhou, Chester Hu, Ching-hsiang Chu, Chris Cai, Chris Tindal, Christoph Feichtenhofer, Damon Civin, Dana Beaty, Daniel Kreymer, Daniel Li, Danny Wyatt, David Adkins, David Xu, Davide Testuggine, Delia David, Devi Parikh, Diana Liskovich, Didem Foss, Dingkang Wang, Duc Le, Dustin Holland, Edward Dowling, Eissa Jamil, Elaine Montgomery, Eleonora Presani, Emily Hahn, Emily Wood, Erik Brinkman, Esteban Arcaute, Evan Dunbar, Evan Smothers, Fei Sun, Felix Kreuk, Feng Tian, Firat Ozgenel, Francesco Caggioni, Francisco Guzmán, Frank Kanayet, Frank Seide, Gabriela Medina Florez, Gabriella Schwarz, Gada Badeer, Georgia Swee, Gil Halpern, Govind Thattai, Grant Herman, Grigory Sizov, Guangyi, Zhang, Guna Lakshminarayanan, Hamid Shojanazeri, Han Zou, Hannah Wang, Hanwen Zha, Haroun Habeeb, Harrison Rudolph, Helen Suk, Henry Aspegren, Hunter Goldman, Ibrahim Damlaj, Igor Molybog, Igor Tufanov, Irina-elena Veliche, Itai Gat, Jake Weissman, James Geboski, James Kohli, Japhet Asher, Jean-baptiste Gaya, Jeff Marcus, Jeff Tang, Jennifer Chan, Jenny Zhen, Jeremy Reizenstein, Jeremy Teboul, Jessica Zhong, Jian Jin, Jingyi Yang, Joe Cummings, Jon Carvill, Jon Shepard, Jonathan Mcphie, Jonathan Torres, Josh Ginsburg, Junjie Wang, Kai Wu, Kam Hou U, Karan Saxena, Karthik Prasad, Kartikay Khandelwal, Katayoun Zand, Kathy Matosich, Kaushik Veeraraghavan, Kelly Michelena, Keqian Li, Kun Huang, Kunal Chawla, Kushal Lakhotia, Kyle Huang, Lailin Chen, Lakshya Garg, Lavender A, Leandro Silva, Lee Bell, Lei Zhang, Liangpeng Guo, Licheng Yu, Liron Moshkovich, Luca Wehrstedt, Madian Khabsa, Manav Avalani, Manish Bhatt, Maria Tsimpoukelli, Martynas Mankus, Matan Hasson, Matthew Lennie, Matthias Reso, Maxim Groshev, Maxim Naumov, Maya Lathi, Meghan Keneally, Michael L. Seltzer, Michal Valko, Michelle Restrepo, Mihir Patel, Mik Vyatskov, Mikayel Samvelyan, Mike Clark, Mike Macey, Mike Wang, Miquel Jubert Hermoso, Mo Metanat, Mohammad Rastegari, Munish Bansal, Nandhini Santhanam, Natascha Parks, Natasha White, Navyata Bawa, Nayan Singhal, Nick Egebo, Nicolas Usunier, Nikolay Pavlovich Laptev, Ning Dong, Ning Zhang, Norman Cheng, Oleg Chernoguz, Olivia Hart, Omkar Salpekar, Ozlem Kalinli, Parkin Kent, Parth Parekh, Paul Saab, Pavan Balaji, Pedro Rittner, Philip Bontrager, Pierre Roux, Piotr Dollar, Polina Zvyagina, Prashant Ratanchandani, Pritish Yuvraj, Qian Liang, Rachad Alao, Rachel Rodriguez, Rafi Ayub, Raghotham Murthy, Raghu Nayani, Rahul Mitra, Raymond Li, Rebekkah Hogan, Robin Battey, Rocky Wang, Rohan Maheswari, Russ Howes, Ruty Rinott, Sai Jayesh Bondu, Samyak Datta, Sara Chugh, Sara Hunt, Sargun Dhillon, Sasha Sidorov, Satadru Pan, Saurabh Verma, Seiji Yamamoto, Sharadh Ramaswamy, Shaun Lindsay, Shaun Lindsay, Sheng Feng, Shenghao Lin, Shengxin Cindy Zha, Shiva Shankar, Shuqiang Zhang, Shuqiang Zhang, Sinong Wang, Sneha Agarwal, Soji Sajuyigbe, Soumith Chintala, Stephanie Max, Stephen Chen, Steve Kehoe, Steve Satterfield, Sudarshan Govindaprasad, Sumit Gupta, Sungmin Cho, Sunny Virk, Suraj Subramanian, Sy Choudhury, Sydney Goldman, Tal Remez, Tamar Glaser, Tamara Best, Thilo Kohler, Thomas Robinson, Tianhe Li, Tianjun Zhang, Tim Matthews, Timothy Chou, Tzook Shaked, Varun Vontimitta, Victoria Ajayi, Victoria Montanez, Vijai Mohan, Vinay Satish Kumar, Vishal Mangla, Vítor Albiero, Vlad Ionescu, Vlad Poenaru, Vlad Tiberiu Mihailescu, Vladimir Ivanov, Wei Li, Wenchen Wang, Wenwen Jiang, Wes Bouaziz, Will Constable, Xiaocheng Tang, Xiaofang Wang, Xiaojian Wu, Xiaolan Wang, Xide Xia, Xilun Wu, Xinbo Gao, Yanjun Chen, Ye Hu, Ye Jia, Ye Qi, Yenda Li, Yilin Zhang, Ying Zhang, Yossi Adi, Youngjin Nam, Yu, Wang, Yuchen Hao, Yundi Qian, Yuzi He, Zach Rait, Zachary Devito, Zef Rosnbrick, Zhaoduo Wen, Zhenyu Yang, Zhiwei Zhao
- Infercept: Efficient Intercept Support For Augmented Large Language Model Inference Abhyankar Reyna, He Zijian, Srivatsa Vikranth, Zhang Hao, Zhang Yiying
- Stackrag Agent: Improving Developer Answers With Retrieval-augmented Generation Abrahamyan Davit, Fard Fatemeh H.
- Knowledge Graphs As Context Sources For Llm-based Explanations Of Learning Recommendations Abu-rasheed Hasan, Weber Christian, Fathi Madjid
- Pathocl: Path-based Prompt Augmentation For OCL Generation With GPT-4 Abukhalaf Seif, Hamdaqa Mohammad, Khomh Foutse
- A Survey On Symbolic Knowledge Distillation Of Large Language Models Acharya Kamal, Velasquez Alvaro, Song Houbing Herbert
- Keyformer: KV Cache Reduction Through Key Tokens Selection For Efficient Generative Inference Adnan Muhammad, Arunkumar Akhil, Jain Gaurav, Nair Prashant J., Soloveychik Ilya, Kamath Purushotham
- Towards Optimizing And Evaluating A Retrieval Augmented QA Chatbot Using Llms With Human In The Loop Afzal Anum, Kowsik Alexander, Fani Rajna, Matthes Florian
- Copilot Evaluation Harness: Evaluating Llm-guided Software Programming Agarwal Anisha, Chan Aaron, Chandel Shubham, Jang Jinu, Miller Shaun, Moghaddam Roshanak Zilouchian, Mohylevskyy Yevhen, Sundaresan Neel, Tufano Michele
- Ethical Reasoning And Moral Value Alignment Of Llms Depend On The Language We Prompt Them In Agarwal Utkarsh, Tanmay Kumar, Khandelwal Aditi, Choudhury Monojit
- Tricy: Trigger-guided Data-to-text Generation With Intent Aware Attention-copy Agarwal Vibhav, Ghosh Sourav, Bss Harichandana, Arora Himanshu, Raja Barath Raj Kandur
- Codemirage: Hallucinations In Code Generated By Large Language Models Agarwal Vibhor, Pei Yulong, Alamir Salwa, Liu Xiaomo
- Enabling High-sparsity Foundational Llama Models With Efficient Pretraining And Deployment Agarwalla Abhinav, Gupta Abhay, Marques Alexandre, Pandit Shubhra, Goin Michael, Kurtic Eldar, Leong Kevin, Nguyen Tuan, Salem Mahmoud, Alistarh Dan, Lie Sean, Kurtz Mark
- Look Further Ahead: Testing The Limits Of GPT-4 In Path Planning Aghzal Mohamed, Plaku Erion, Yao Ziyu
- Leapformer: Enabling Linear Transformers For Autoregressive And Simultaneous Tasks Via Learned Proportions Agostinelli Victor, Hong Sanghyun, Chen Lizhong
- Illuminate: A Novel Approach For Depression Detection With Explainable Analysis And Proactive Therapy Using Prompt Engineering Agrawal Aryan
- Can't Remember Details In Long Documents? You Need Some R&R Agrawal Devanshu, Gao Shang, Gajek Martin
- Can Llms Perform Structured Graph Reasoning? Agrawal Palaash, Vasania Shavak, Tan Cheston
- LM4OPT: Unveiling The Potential Of Large Language Models In Formulating Mathematical Optimization Problems Ahmed Tasnim, Choudhury Salimur
- Linguistic Intelligence In Large Language Models For Telecommunications Ahmed Tasnim, Piovesan Nicola, De Domenico Antonio, Choudhury Salimur
- Timechara: Evaluating Point-in-time Character Hallucination Of Role-playing Large Language Models Ahn Jaewoo, Lee Taehyun, Lim Junyoung, Kim Jin-hwa, Yun Sangdoo, Lee Hwaran, Kim Gunhee
- Enhancing Pre-trained Generative Language Models With Question Attended Span Extraction On Machine Reading Comprehension Ai Lin, Hui Zheng, Liu Zizhou, Hirschberg Julia
- QASE Enhanced Plms: Improved Control In Text Generation For MRC Ai Lin, Hui Zheng, Liu Zizhou, Hirschberg Julia
- Semantically Diverse Language Generation For Uncertainty Estimation In Language Models Aichberger Lukas, Schweighofer Kajetan, Ielanskyi Mykyta, Hochreiter Sepp
- Have We Reached AGI? Comparing Chatgpt, Claude, And Gemini To Human Literacy And Education Benchmarks Akpan Mfon
- History Of Generative Artificial Intelligence (AI) Chatbots: Past, Present, And Future Development Al-amin Md., Ali Mohammad Shazed, Salam Abdus, Khan Arif, Ali Ashraf, Ullah Ahsan, Alam Md Nur, Chowdhury Shamsul Kabir
- A Survey Of Large Language Models For European Languages Ali Wazir, Pyysalo Sampo
- Chatgpt Alternative Solutions: Large Language Models Survey Alipour Hanieh, Pendar Nick, Roy Kohinoor
- Sabi\'a-2: A New Generation Of Portuguese Large Language Models Almeida Thales Sales, Abonizio Hugo, Nogueira Rodrigo, Pires Ramon
- Syntaxshap: Syntax-aware Explainability Method For Text Generation Amara Kenza, Sevastjanova Rita, El-assady Mennatallah
- Large Language Models For Psycholinguistic Plausibility Pretesting Amouyal Samuel Joseph, Meltzer-asscher Aya, Berant Jonathan
- Training-free Long-context Scaling Of Large Language Models An Chenxin, Huang Fei, Zhang Jun, Gong Shansan, Qiu Xipeng, Zhou Chang, Kong Lingpeng
- Enhancing Journalism With AI: A Study Of Contextualized Image Captioning For News Articles Using Llms And Lmms Anagnostopoulou Aliki, Gouvea Thiago, Sonntag Daniel
- Does Refusal Training In Llms Generalize To The Past Tense? Andriushchenko Maksym, Flammarion Nicolas
- BAMO At Semeval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense Ansari Baktash, Rostamkhani Mohammadmostafa, Eetemadi Sauleh
- Simpler Becomes Harder: Do Llms Exhibit A Coherent Behavior On Simplified Corpora? Anschütz Miriam, Mosca Edoardo, Groh Georg
- Evaluating Chatgpt On Nuclear Domain-specific Data Anwar Muhammad, De Costa Mischa, Hammad Issam, Lau Daniel
- An Assessment On Comprehending Mental Health Through Large Language Models Arcan Mihael, Niland David-paul, Delahunty Fionn
- Forklift: An Extensible Neural Lifter Armengol-estapé Jordi, Rocha Rodrigo C. O., Woodruff Jackson, Minervini Pasquale, O'boyle Michael F. P.
- Thai Winograd Schemas: A Benchmark For Thai Commonsense Reasoning Artkaew Phakphum
- Slicegpt: Compress Large Language Models By Deleting Rows And Columns Ashkboos Saleh, Croci Maximilian L., Nascimento Marcelo Gennari Do, Hoefler Torsten, Hensman James
- Evaluating Large Language Models On The GMAT: Implications For The Future Of Business Education Ashrafimoghari Vahid, Gürkan Necdet, Suchow Jordan W.
- Assessing Llms For Zero-shot Abstractive Summarization Through The Lens Of Relevance Paraphrasing Askari Hadi, Chhabra Anshuman, Chen Muhao, Mohapatra Prasant
- Minigpt4-video: Advancing Multimodal Llms For Video Understanding With Interleaved Visual-textual Tokens Ataallah Kirolos, Shen Xiaoqian, Abdelrahman Eslam, Sleiman Essam, Zhu Deyao, Ding Jian, Elhoseiny Mohamed
- Can Open-source Llms Compete With Commercial Models? Exploring The Few-shot Performance Of Current GPT Models In Biomedical Tasks Ateia Samy, Kruschwitz Udo
- Prompt Design Matters For Computational Social Science Tasks But In Unpredictable Ways Atreja Shubham, Ashkinaze Joshua, Li Lingyao, Mendelsohn Julia, Hemphill Libby
- Feedback-generation For Programming Exercises With GPT-4 Azaiz Imen, Kiesler Natalie, Strickroth Sven
- Accuracy And Consistency Of Llms In The Registered Dietitian Exam: The Impact Of Prompt Engineering And Knowledge Retrieval Azimi Iman, Qi Mohan, Wang Li, Rahmani Amir M., Li Youlin
- Minigpt-reverse-designing: Predicting Image Adjustments Utilizing Minigpt-4 Azizi Vahid, Koochaki Fatemeh
- The Pitfalls Of Next-token Prediction Bachmann Gregor, Nagarajan Vaishnavh
- Efficient Model-agnostic Alignment Via Bayesian Persuasion Bai Fengshuo, Wang Mingzhi, Zhang Zhaowei, Chen Boyuan, Xu Yinda, Wen Ying, Yang Yaodong
- Sparsellm: Towards Global Pruning For Pre-trained Language Models Bai Guangji, Li Yijiang, Ling Chen, Kim Kibaek, Zhao Liang
- Patentgpt: A Large Language Model For Intellectual Property Bai Zilong, Zhang Ruiji, Chen Linqing, Cai Qijun, Zhong Yuan, Wang Cong, Fang Yan, Fang Jie, Sun Jing, Wang Weikuan, Zhou Lizhi, Hua Haoran, Qiu Tian, Wang Chaochao, Sun Cheng, Lu Jianping, Wang Yixin, Xia Yubin, Hu Meng, Liu Haowen, Xu Peng, Xu Licong, Bian Fu, Gu Xiaolong, Zhang Lisha, Wang Weilei, Tu Changyang
- Evaluating The Semantic Profiling Abilities Of Llms For Natural Language Utterances In Data Visualization Bako Hannah K., Bhutani Arshnoor, Liu Xinyi, Cobbina Kwesi A., Liu Zhicheng
- RAG Vs Fine-tuning: Pipelines, Tradeoffs, And A Case Study On Agriculture Balaguer Angels, Benara Vinamra, Cunha Renato Luiz De Freitas, Filho Roberto De M. Estevão, Hendry Todd, Holstein Daniel, Marsman Jennifer, Mecklenburg Nick, Malvar Sara, Nunes Leonardo O., Padilha Rafael, Sharp Morris, Silva Bruno, Sharma Swati, Aski Vijay, Chandra Ranveer
- CYGENT: A Cybersecurity Conversational Agent With Log Summarization Powered By GPT-3 Balasubramanian Prasasthy, Seby Justin, Kostakos Panos
- LAMP: A Language Model On The Map Balsebre Pasquale, Huang Weiming, Cong Gao
- Demystifying Platform Requirements For Diverse LLM Inference Use Cases Bambhaniya Abhimanyu, Raj Ritik, Jeong Geonhwa, Kundu Souvik, Srinivasan Sudarshan, Elavazhagan Midhilesh, Kumar Madhu, Krishna Tushar
- Breaking Boundaries: Investigating The Effects Of Model Editing On Cross-linguistic Performance Banerjee Somnath, Halder Avik, Mandal Rajarshi, Layek Sayan, Soboroff Ian, Hazra Rima, Mukherjee Animesh
- How (un)ethical Are Instruction-centric Responses Of Llms? Unveiling The Vulnerabilities Of Safety Guardrails To Harmful Queries Banerjee Somnath, Layek Sayan, Hazra Rima, Mukherjee Animesh
- Transforming Agency. On The Mode Of Existence Of Large Language Models Barandiaran Xabier E., Almendros Lola S.
- TTQA-RS- A Break-down Prompting Approach For Multi-hop Table-text Question Answering With Reasoning And Summarization Bardhan Jayetri, Xiao Bushi, Wang Daisy Zhe
- Slicerchat: Building A Local Chatbot For 3D Slicer Barr Colton
- STARLING: Self-supervised Training Of Text-based Reinforcement Learning Agent With Large Language Models Basavatia Shreyas, Murugesan Keerthiram, Ratnakar Shivam
- Llms' Reading Comprehension Is Affected By Parametric Knowledge And Struggles With Hypothetical Statements Basmov Victoria, Goldberg Yoav, Tsarfaty Reut
- Evolutionary Multi-objective Optimization Of Large Language Model Prompts For Balancing Sentiments Baumann Jill, Kramer Oliver
- How Do You Know That? Teaching Generative Language Models To Reference Answers To Biomedical Questions Bašaragin Bojana, Ljajić Adela, Medvecki Darija, Cassano Lorenzo, Košprdić Miloš, Milošević Nikola
- Llm2vec: Large Language Models Are Secretly Powerful Text Encoders Behnamghader Parishad, Adlakha Vaibhav, Mosbach Marius, Bahdanau Dzmitry, Chapados Nicolas, Reddy Siva
- Luna: An Evaluation Foundation Model To Catch Language Model Hallucinations With High Accuracy And Low Cost Belyi Masha, Friel Robert, Shao Shuai, Sanyal Atindriyo
- SUTRA: Scalable Multilingual Language Model Architecture Bendale Abhijit, Sapienza Michael, Ripplinger Steven, Gibbs Simon, Lee Jaewon, Mistry Pranav
- Redefining "hallucination" In Llms: Towards A Psychology-informed Framework For Mitigating Misinformation Berberette Elijah, Hutchins Jack, Sadovnik Amir
- Cotar: Chain-of-thought Attribution Reasoning With Multi-level Granularity Berchansky Moshe, Fleischer Daniel, Wasserblat Moshe, Izsak Peter
- Think Big, Generate Quick: Llm-to-slm For Fast Autoregressive Decoding Bergner Benjamin, Skliar Andrii, Royer Amelie, Blankevoort Tijmen, Asano Yuki, Bejnordi Babak Ehteshami
- Checkembed: Effective Verification Of LLM Solutions To Open-ended Tasks Besta Maciej, Paleari Lorenzo, Kubicek Ales, Nyczyk Piotr, Gerstenberger Robert, Iff Patrick, Lehmann Tomasz, Niewiadomski Hubert, Hoefler Torsten
- Enhancing Event Reasoning In Large Language Models Through Instruction Fine-tuning With Semantic Causal Graphs Bethany Mazal, Bethany Emet, Wherry Brandon, Chiang Cho-yu, Vishwamitra Nishant, Rios Anthony, Najafirad Peyman
- Taking The Next Step With Generative Artificial Intelligence: The Transformative Role Of Multimodal Large Language Models In Science Education Bewersdorff Arne, Hartmann Christian, Hornberger Marie, Seßler Kathrin, Bannert Maria, Kasneci Enkelejda, Kasneci Gjergji, Zhai Xiaoming, Nerdel Claudia
- Evaluating AI For Law: Bridging The Gap With Open-source Solutions Bhambhoria Rohan, Dahan Samuel, Li Jonathan, Zhu Xiaodan
- Self-amplify: Improving Small Language Models With Self Post Hoc Explanations Bhan Milan, Vittaut Jean-noel, Chesneau Nicolas, Lesot Marie-jeanne
- Fintral: A Family Of GPT-4 Level Multimodal Financial Large Language Models Bhatia Gagan, Nagoudi El Moatez Billah, Cavusoglu Hasan, Abdul-mageed Muhammad
- Enhancing Code Translation In Language Models With Few-shot Learning Via Retrieval-augmented Generation Bhattarai Manish, Santos Javier E., Jones Shawn, Biswas Ayan, Alexandrov Boian, O'malley Daniel
- Iterative Refinement Of Project-level Code Context For Precise Code Generation With Compiler Feedback Bi Zhangqian, Wan Yao, Wang Zheng, Zhang Hongyu, Guan Batu, Lu Fangxin, Zhang Zili, Sui Yulei, Jin Hai, Shi Xuanhua
- Supervisory Prompt Training Billa Jean Ghislain, Oh Min, Du Liang
- Assessing The Potential Of Mid-sized Language Models For Clinical QA Bolton Elliot, Xiong Betty, Muralidharan Vijaytha, Schamroth Joel, Muralidharan Vivek, Manning Christopher D., Daneshjou Roxana
- Navigating The Labyrinth: Evaluating And Enhancing Llms' Ability To Reason About Search Problems Borazjanizadeh Nasim, Herzig Roei, Darrell Trevor, Feris Rogerio, Karlinsky Leonid
- Reliable Reasoning Beyond Natural Language Borazjanizadeh Nasim, Piantadosi Steven T.
- The Unreasonable Ineffectiveness Of Nucleus Sampling On Mitigating Text Memorization Borec Luka, Sadler Philipp, Schlangen David
- Open Artificial Knowledge Borisov Vadim, Schreiber Richard H.
- Improving Language Models Trained On Translated Data With Continual Pre-training And Dictionary Learning Analysis Boughorbel Sabri, Parvez Md Rizwan, Hawasly Majd
- Retrieving Examples From Memory For Retrieval Augmented Neural Machine Translation: A Systematic Comparison Bouthors Maxime, Crego Josep, Yvon Francois
- Improving Sequential Recommendations With Llms Boz Artun, Zorgdrager Wouter, Kotti Zoe, Harte Jesse, Louridas Panos, Jannach Dietmar, Fragkoulis Marios
- The Art Of Saying No: Contextual Noncompliance In Language Models Brahman Faeze, Kumar Sachin, Balachandran Vidhisha, Dasigi Pradeep, Pyatkin Valentina, Ravichander Abhilasha, Wiegreffe Sarah, Dziri Nouha, Chandu Khyathi, Hessel Jack, Tsvetkov Yulia, Smith Noah A., Choi Yejin, Hajishirzi Hannaneh
- Reducing Transformer Key-value Cache Size With Cross-layer Attention Brandon William, Mishra Mayank, Nrusimha Aniruddha, Panda Rameswar, Kelly Jonathan Ragan
- Large Language Models "ad Referendum": How Good Are They At Machine Translation In The Legal Domain? Briva-iglesias Vicent, Camargo Joao Lucas Cavalheiro, Dogru Gokhan
- Generation, Distillation And Evaluation Of Motivational Interviewing-style Reflections With A Foundational Language Model Brown Andrew, Zhu Jiading, Abdelwahab Mohamed, Dong Alec, Wang Cindy, Rose Jonathan
- Fine-tuned 'small' Llms (still) Significantly Outperform Zero-shot Generative AI Models In Text Classification Bucher Martin Juan José, Martini Marco
- Pheme: Efficient And Conversational Speech Generation Budzianowski Paweł, Sereda Taras, Cichy Tomasz, Vulić Ivan
- Lissard: Long And Simple Sequential Reasoning Datasets Bueno Mirelle, Lotufo Roberto, Nogueira Rodrigo
- The Two Sides Of The Coin: Hallucination Generation And Detection With Llms As Evaluators For Llms Bui Anh Thu Maria, Brech Saskia Felizitas, Hußfeldt Natalie, Jennert Tobias, Ullrich Melanie, Breuer Timo, Khasmakhi Narjes Nikzad, Schaer Philipp
- Pangea: Procedural Artificial Narrative Using Generative AI For Turn-based Video Games Buongiorno Steph, Klinkert Lawrence Jake, Chawla Tanishq, Zhuang Zixin, Clark Corey
- Roguegpt: Dis-ethical Tuning Transforms Chatgpt4 Into A Rogue AI In 158 Words Buscemi Alessio, Proverbio Daniele
- Investigating Wit, Creativity, And Detectability Of Large Language Models In Domain-specific Writing Style Adaptation Of Reddit's Showerthoughts Buz Tolga, Frost Benjamin, Genchev Nikola, Schneider Moritz, Kaffee Lucie-aimée, De Melo Gerard
- ARES: Alternating Reinforcement Learning And Supervised Fine-tuning For Enhanced Multi-modal Chain-of-thought Reasoning Through Diverse AI Feedback Byun Ju-seung, Chun Jiyun, Kil Jihyung, Perrault Andrew
- On Large Language Models In National Security Applications Caballero William N., Jenkins Phillip R.
- In-context Learning And Fine-tuning GPT For Argument Mining Cabessa Jérémie, Hernault Hugo, Mushtaq Umer
- High-dimension Human Value Representation In Large Language Models Cahyawijaya Samuel, Chen Delong, Bang Yejin, Khalatbari Leila, Wilie Bryan, Ji Ziwei, Ishii Etsuko, Fung Pascale
- Sciassess: Benchmarking LLM Proficiency In Scientific Literature Analysis Cai Hengxing, Cai Xiaochen, Chang Junhan, Li Sihang, Yao Lin, Wang Changxin, Gao Zhifeng, Wang Hongshuai, Li Yongge, Lin Mujie, Yang Shuwen, Wang Jiankun, Xu Mingjun, Huang Jin, Xi Fang, Zhuang Jiaxi, Yin Yuqi, Li Yaqi, Chen Changhong, Cheng Zheng, Zhao Zifeng, Zhang Linfeng, Ke Guolin
- Flextron: Many-in-one Flexible Large Language Model Cai Ruisi, Muralidharan Saurav, Heinrich Greg, Yin Hongxu, Wang Zhangyang, Kautz Jan, Molchanov Pavlo
- Towards Large Language Model Aided Program Refinement Cai Yufan, Hou Zhe, Luan Xiaokun, Baena David Miguel Sanan, Lin Yun, Sun Jun, Dong Jin Song
- Internlm2 Technical Report Cai Zheng, Cao Maosong, Chen Haojiong, Chen Kai, Chen Keyu, Chen Xin, Chen Xun, Chen Zehui, Chen Zhi, Chu Pei, Dong Xiaoyi, Duan Haodong, Fan Qi, Fei Zhaoye, Gao Yang, Ge Jiaye, Gu Chenya, Gu Yuzhe, Gui Tao, Guo Aijia, Guo Qipeng, He Conghui, Hu Yingfan, Huang Ting, Jiang Tao, Jiao Penglong, Jin Zhenjiang, Lei Zhikai, Li Jiaxing, Li Jingwen, Li Linyang, Li Shuaibin, Li Wei, Li Yining, Liu Hongwei, Liu Jiangning, Hong Jiawei, Liu Kaiwen, Liu Kuikun, Liu Xiaoran, Lv Chengqi, Lv Haijun, Lv Kai, Ma Li, Ma Runyuan, Ma Zerun, Ning Wenchang, Ouyang Linke, Qiu Jiantao, Qu Yuan, Shang Fukai, Shao Yunfan, Song Demin, Song Zifan, Sui Zhihao, Sun Peng, Sun Yu, Tang Huanze, Wang Bin, Wang Guoteng, Wang Jiaqi, Wang Jiayu, Wang Rui, Wang Yudong, Wang Ziyi, Wei Xingjian, Weng Qizhen, Wu Fan, Xiong Yingtong, Xu Chao, Xu Ruiliang, Yan Hang, Yan Yirong, Yang Xiaogui, Ye Haochen, Ying Huaiyuan, Yu Jia, Yu Jing, Zang Yuhang, Zhang Chuyu, Zhang Li, Zhang Pan, Zhang Peng, Zhang Ruijie, Zhang Shuo, Zhang Songyang, Zhang Wenjian, Zhang Wenwei, Zhang Xingcheng, Zhang Xinyue, Zhao Hui, Zhao Qian, Zhao Xiaomeng, Zhou Fengzhe, Zhou Zaida, Zhuo Jingming, Zou Yicheng, Qiu Xipeng, Qiao Yu, Lin Dahua
- On The Worst Prompt Performance Of Large Language Models Cao Bowen, Cai Deng, Zhang Zhisong, Zou Yuexian, Lam Wai
- Figuring Out Figures: Using Textual References To Caption Scientific Figures Cao Stanley, Liu Kevin
- Chart-based Reasoning: Transferring Capabilities From Llms To Vlms Carbune Victor, Mansoor Hassan, Liu Fangyu, Aralikatte Rahul, Baechler Gilles, Chen Jindong, Sharma Abhanshu
- A Review Of Multi-modal Large Language And Vision Models Carolan Kilian, Fennelly Laura, Smeaton Alan F.
- Suppressing Pink Elephants With Direct Principle Feedback Castricato Louis, Lile Nathan, Anand Suraj, Schoelkopf Hailey, Verma Siddharth, Biderman Stella
- Dual Modalities Of Text: Visual And Textual Generative Pre-training Chai Yekun, Liu Qingyi, Xiao Jingwu, Wang Shuohuan, Sun Yu, Wu Hua
- Towards Neural Synthesis For Smt-assisted Proof-oriented Programming Chakraborty Saikat, Ebner Gabriel, Bhat Siddharth, Fakhoury Sarah, Fatima Sakina, Lahiri Shuvendu, Swamy Nikhil
- AMGPT: A Large Language Model For Contextual Querying In Additive Manufacturing Chandrasekhar Achuth, Chan Jonathan, Ogoke Francis, Ajenifujah Olabode, Farimani Amir Barati
- Goldfish: Monolingual Language Models For 350 Languages Chang Tyler A., Arnett Catherine, Tu Zhuowen, Bergen Benjamin K.
- VITA: Towards Open-source Interactive Omni Multimodal LLM Chaoyou Fu, Haojia Lin, Zuwei Long, Yunhang Shen, Meng Zhao, Yifan Zhang, Shaoqi Dong, Xiong Wang, Di Yin, Long Ma, Xiawu Zheng, Ran He, Rongrong Ji, Yunsheng Wu, Caifeng Shan, Xing Sun
- Video-mme: The First-ever Comprehensive Evaluation Benchmark Of Multi-modal Llms In Video Analysis Chaoyou Fu, Yuhan Dai, Yongdong Luo, Lei Li, Shuhuai Ren, Renrui Zhang, Zihan Wang, Chenyu Zhou, Yunhang Shen, Mengdan Zhang, Peixian Chen, Yanwei Li, Shaohui Lin, Sirui Zhao, Ke Li, Tong Xu, Xiawu Zheng, Enhong Chen, Rongrong Ji, Xing Sun
- Quantifying The Gain In Weak-to-strong Generalization Charikar Moses, Pabbaraju Chirag, Shiragur Kirankumar
- \'evaluation Des Capacit\'es De R\'eponse De Larges Mod\`eles De Langage (LLM) Pour Des Questions D'historiens Chartier Mathieu, Dakkoune Nabil, Bourgeois Guillaume, Jean Stéphane
- Language Models Can Exploit Cross-task In-context Learning For Data-scarce Novel Tasks Chatterjee Anwoy, Tanwar Eshaan, Dutta Subhabrata, Chakraborty Tanmoy
- Benchmarking Llms For Translating Classical Chinese Poetry:evaluating Adequacy, Fluency, And Elegance Chen Andong, Lou Lianzhang, Chen Kehai, Bai Xuefeng, Xiang Yang, Yang Muyun, Zhao Tiejun, Zhang Min
- Lmstyle Benchmark: Evaluating Text Style Transfer For Chatbots Chen Jianlin
- Control-dag: Constrained Decoding For Non-autoregressive Directed Acyclic T5 Using Weighted Finite State Automata Chen Jinghong, Lin Weizhe, Mei Jingbiao, Byrne Bill
- Automated Data Curation For Robust Language Model Fine-tuning Chen Jiuhai, Mueller Jonas
- Cost-effective Instruction Learning For Pathology Vision And Language Analysis Chen Kaitao, Liu Mianxin, Yan Fang, Ma Lei, Shi Xiaoming, Wang Lilong, Wang Xiaosong, Zhu Lifeng, Wang Zhe, Zhou Mu, Zhang Shaoting
- Diahalu: A Dialogue-level Hallucination Evaluation Benchmark For Large Language Models Chen Kedi, Chen Qin, Zhou Jie, He Yishen, He Liang
- OMPGPT: A Generative Pre-trained Transformer Model For Openmp Chen Le, Bhattacharjee Arijit, Ahmed Nesreen, Hasabnis Niranjan, Oren Gal, Vo Vy, Jannesari Ali
- Pca-bench: Evaluating Multimodal Large Language Models In Perception-cognition-action Chain Chen Liang, Zhang Yichi, Ren Shuhuai, Zhao Haozhe, Cai Zefan, Wang Yuchi, Wang Peiyi, Meng Xiangdi, Liu Tianyu, Chang Baobao
- Identifying Query-relevant Neurons In Large Language Models For Long-form Texts Chen Lihu, Dejl Adam, Toni Francesca
- Jumpcoder: Go Beyond Autoregressive Coder Via Online Modification Chen Mouxiang, Tian Hao, Liu Zhongxin, Ren Xiaoxue, Sun Jianling
- Graphwiz: An Instruction-following Language Model For Graph Problems Chen Nuo, Li Yuhan, Tang Jianheng, Li Jia
- Octopus: On-device Language Model For Function Calling Of Software Apis Chen Wei, Li Zhiyuan, Ma Mingyuan
- Intelligent Tutor: Leveraging Chatgpt And Microsoft Copilot Studio To Deliver A Generative AI Student Support And Feedback System Within Teams Chen Wei-yu
- Eyegpt: Ophthalmic Assistant With Large Language Models Chen Xiaolan, Zhao Ziwei, Zhang Weiyi, Xu Pusheng, Gao Le, Xu Mingpu, Wu Yue, Li Yinwen, Shi Danli, He Mingguang
- See What Llms Cannot Answer: A Self-challenge Framework For Uncovering LLM Weaknesses Chen Yulong, Liu Yang, Yan Jianhao, Bai Xuefeng, Zhong Ming, Yang Yinghao, Yang Ziyi, Zhu Chenguang, Zhang Yue
- Social Debiasing For Fair Multi-modal Llms Cheng Harry, Guo Yangyang, Guo Qingpei, Yang Ming, Gan Tian, Nie Liqiang
- Information Re-organization Improves Reasoning In Large Language Models Cheng Xiaoxia, Tan Zeqi, Xue Wei, Lu Weiming
- Small Agent Can Also Rock! Empowering Small Language Models As Hallucination Detector Cheng Xiaoxue, Li Junyi, Zhao Wayne Xin, Zhang Hongzhi, Zhang Fuzheng, Zhang Di, Gai Kun, Wen Ji-rong
- A Perspective On Large Language Models, Intelligent Machines, And Knowledge Acquisition Cherkassky Vladimir, Lee Eng Hock
- ANOLE: An Open, Autoregressive, Native Large Multimodal Models For Interleaved Image-text Generation Chern Ethan, Su Jiadi, Ma Yan, Liu Pengfei
- Revisiting Zero-shot Abstractive Summarization In The Era Of Large Language Models From The Perspective Of Position Bias Chhabra Anshuman, Askari Hadi, Mohapatra Prasant
- Lamsum: Creating Extractive Summaries Of User Generated Content Using Llms Chhikara Garima, Sharma Anurag, Gurucharan V., Ghosh Kripabandhu, Chakraborty Abhijnan
- Modeling: A Novel Dataset For Testing Linguistic Reasoning In Language Models Chi Nathan A., Malchev Teodor, Kong Riley, Chi Ryan A., Huang Lucas, Chi Ethan A., Mccoy R. Thomas, Radev Dragomir
- Large Language Model As An Assignment Evaluator: Insights, Feedback, And Challenges In A 1000+ Student Course Chiang Cheng-han, Chen Wei-chih, Kuan Chun-yi, Yang Chienchou, Lee Hung-yi
- Human-centered Llm-agent User Interface: A Position Paper Chin Daniel, Wang Yuxuan, Xia Gus
- A Computational Framework For Behavioral Assessment Of LLM Therapists Chiu Yu Ying, Sharma Ashish, Lin Inna Wanyin, Althoff Tim
- Transformer Explainer: Interactive Learning Of Text-generative Models Cho Aeree, Kim Grace C., Karpekov Alexander, Helbling Alec, Wang Zijie J., Lee Seongmin, Hoover Benjamin, Chau Duen Horng
- Unveiling Imitation Learning: Exploring The Impact Of Data Falsity To Large Language Model Cho Hyunsoo
- Pragmatic Inference Of Scalar Implicature By Llms Cho Ye-eun, Kim Seong Mook
- Optimizing Language Augmentation For Multilingual Large Language Models: A Case Study On Korean Choi Changsu, Jeong Yongbin, Park Seoyoon, Won Inho, Lim Hyeonseok, Kim Sangmin, Kang Yejee, Yoon Chanhyuk, Park Jaewan, Lee Yiseul, Lee Hyejin, Hahm Younggyun, Kim Hansaem, Lim Kyungtae
- Can Only Llms Do Reasoning?: Potential Of Small Language Models In Task Planning Choi Gawon, Ahn Hyemin
- SNAP: Unlearning Selective Knowledge In Large Language Models With Negative Instructions Choi Minseok, Rim Daniel, Lee Dohyun, Choo Jaegul
- Autotutor Meets Large Language Models: A Language Model Tutor With Rich Pedagogy And Guardrails Chowdhury Sankalan Pal, Zouhar Vilém, Sachan Mrinmaya
- Med42-v2: A Suite Of Clinical Llms Christophe Clément, Kanithi Praveen K, Raha Tathagata, Khan Shadab, Pimentel Marco Af
- Professional Agents -- Evolving Large Language Models Into Autonomous Experts With Human-level Competencies Chu Zhixuan, Wang Yan, Zhu Feng, Yu Lu, Li Longfei, Gu Jinjie
- AI Safety In Generative AI Large Language Models: A Survey Chua Jaymari, Li Yun, Yang Shiyi, Wang Chen, Yao Lina
- Developing Generative AI Chatbots Conceptual Framework For Higher Education Chukwuere Joshua Ebere
- Informed AI Regulation: Comparing The Ethical Frameworks Of Leading LLM Chatbots Using An Ethics-based Audit To Assess Moral Reasoning And Normative Values Chun Jon, Elkins Katherine
- Stable Language Model Pre-training By Reducing Embedding Variability Chung Woojin, Hong Jiwoo, An Na Min, Thorne James, Yun Se-young
- PEFT-U: Parameter-efficient Fine-tuning For User Personalization Clarke Christopher, Heng Yuzhao, Tang Lingjia, Mars Jason
- A Chain-of-thought Prompting Approach With Llms For Evaluating Students' Formative Assessment Responses In Science Cohn Clayton, Hutchins Nicole, Le Tuan, Biswas Gautam
- Towards A Human-in-the-loop LLM Approach To Collaborative Discourse Analysis Cohn Clayton, Snyder Caitlin, Montenegro Justin, Biswas Gautam
- Gamebench: Evaluating Strategic Reasoning Abilities Of LLM Agents Costarelli Anthony, Allen Mat, Hauksson Roman, Sodunke Grace, Hariharan Suhas, Cheng Carlson, Li Wenjie, Clymer Joshua, Yadav Arjun
- Llambert: Large-scale Low-cost Data Annotation In NLP Csanády Bálint, Muzsai Lajos, Vedres Péter, Nádasdy Zoltán, Lukács András
- Webapp1k: A Practical Code-generation Benchmark For Web App Development Cui Yi
- Can AI Replace Human Subjects? A Large-scale Replication Of Psychological Experiments With Llms Cui Ziyan, Li Ning, Zhou Huaikang
- Anchored Preference Optimization And Contrastive Revisions: Addressing Underspecification In Alignment D'oosterlinck Karel, Xu Winnie, Develder Chris, Demeester Thomas, Singh Amanpreet, Potts Christopher, Kiela Douwe, Mehri Shikib
- Chatgpt Vs Llama: Impact, Reliability, And Challenges In Stack Overflow Discussions Da Silva Leuson, Samhi Jordan, Khomh Foutse
- Large Legal Fictions: Profiling Legal Hallucinations In Large Language Models Dahl Matthew, Magesh Varun, Suzgun Mirac, Ho Daniel E.
- MHPP: Exploring The Capabilities And Limitations Of Language Models Beyond Basic Code Generation Dai Jianbo, Lu Jianqiao, Feng Yunlong, Ruan Rongju, Cheng Ming, Tan Haochen, Guo Zhijiang
- Inference To The Best Explanation In Large Language Models Dalal Dhairya, Valentino Marco, Freitas André, Buitelaar Paul
- A Complete Survey On Llm-based AI Chatbots Dam Sumit Kumar, Hong Choong Seon, Qiao Yu, Zhang Chaoning
- Impact Of Non-standard Unicode Characters On Security And Comprehension In Large Language Models Daniel Johan S, Pal Anand
- Synthetic Patient-physician Dialogue Generation From Clinical Notes Using LLM Das Trisha, Albassam Dina, Sun Jimeng
- Xmainframe: A Large Language Model For Mainframe Modernization Dau Anh T. V., Dao Hieu Trung, Nguyen Anh Tuan, Tran Hieu Trung, Nguyen Phong X., Bui Nghi D. Q.
- Investigating Symbolic Capabilities Of Large Language Models Dave Neisarg, Kifer Daniel, Giles C. Lee, Mali Ankur
- Prompting Open-source And Commercial Language Models For Grammatical Error Correction Of English Learner Text Davis Christopher, Caines Andrew, Andersen Øistein, Taslimipoor Shiva, Yannakoudakis Helen, Yuan Zheng, Bryant Christopher, Rei Marek, Buttery Paula
- Llms Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark For Comprehensive Evaluation Of Llms Davoodi Arash Gholami, Davoudi Seyed Pouyan Mousavi, Pezeshkpour Pouya
- Aligning Model Evaluations With Human Preferences: Mitigating Token Count Bias In Language Model Assessments Daynauth Roland, Mars Jason
- Application Of GPT Language Models For Innovation In Activities In University Teaching De Buenaga Manuel, Bueno Francisco Javier
- Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding With Childplay De Carvalho Gonçalo Hora, Knap Oscar, Pollice Robert
- Retail-gpt: Leveraging Retrieval Augmented Generation (RAG) For Building E-commerce Chat Assistants De Freitas Bruno Amaral Teixeira, Lotufo Roberto De Alencar
- Will GPT-4 Run DOOM? De Wynter Adrian
- Understanding Biases In Chatgpt-based Recommender Systems: Provider Fairness, Temporal Stability, And Recency Deldjoo Yashar
- MIMIR: A Streamlined Platform For Personalized Agent Tuning In Domain Expertise Deng Chunyuan, Tang Xiangru, Zhao Yilun, Wang Hanming, Wang Haoran, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- Pandora: Jailbreak Gpts By Retrieval Augmented Generation Poisoning Deng Gelei, Liu Yi, Wang Kailong, Li Yuekang, Zhang Tianwei, Liu Yang
- Language In Vivo Vs. In Silico: Size Matters But Larger Language Models Still Do Not Comprehend Language On A Par With Humans Dentella Vittoria, Guenther Fritz, Leivada Evelina
- Promises, Outlooks And Challenges Of Diffusion Language Modeling Deschenaux Justin, Gulcehre Caglar
- Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets Devine Peter
- Enhancing Recommendation Diversity By Re-ranking With Large Language Models Diego Carraro, Derek Bridge
- Learning How To Ask: Cycle-consistency Refines Prompts In Multimodal Foundation Models Diesendruck Maurice, Lin Jianzhe, Imani Shima, Mahalingam Gayathri, Xu Mingyang, Zhao Jie
- MANGO: A Benchmark For Evaluating Mapping And Navigation Abilities Of Large Language Models Ding Peng, Fang Jiading, Li Peng, Wang Kangrui, Zhou Xiaochen, Yu Mo, Li Jing, Walter Matthew R., Mei Hongyuan
- Hallu-pi: Evaluating Hallucination In Multi-modal Large Language Models Within Perturbed Inputs Ding Peng, Wu Jingyu, Kuang Jun, Ma Dan, Cao Xuezhi, Cai Xunliang, Chen Shi, Chen Jiajun, Huang Shujian
- Semcoder: Training Code Language Models With Comprehensive Semantics Ding Yangruibo, Peng Jinjun, Min Marcus J., Kaiser Gail, Yang Junfeng, Ray Baishakhi
- Entgpt: Linking Generative Large Language Models With Knowledge Bases Ding Yifan, Poudel Amrit, Zeng Qingkai, Weninger Tim, Veeramani Balaji, Bhattacharya Sanmitra
- Evaluating And Mitigating Linguistic Discrimination In Large Language Models Dong Guoliang, Wang Haoyu, Sun Jun, Wang Xinyu
- Internlm-xcomposer2-4khd: A Pioneering Large Vision-language Model Handling Resolutions From 336 Pixels To 4K HD Dong Xiaoyi, Zhang Pan, Zang Yuhang, Cao Yuhang, Wang Bin, Ouyang Linke, Zhang Songyang, Duan Haodong, Zhang Wenwei, Li Yining, Yan Hang, Gao Yang, Chen Zhe, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Chen Kai, He Conghui, Zhang Xingcheng, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- How Reliable Are Automatic Evaluation Methods For Instruction-tuned Llms? Doostmohammadi Ehsan, Holmström Oskar, Kuhlmann Marco
- To What Extent Is Chatgpt Useful For Language Teacher Lesson Plan Creation? Dornburg Alex, Davin Kristin
- Enhancing Robustness Of Llm-synthetic Text Detectors For Academic Writing: A Comprehensive Analysis Dou Zhicheng, Guo Yuchen, Chang Ching-chun, Nguyen Huy H., Echizen Isao
- Assessing The Reasoning Abilities Of Chatgpt In The Context Of Claim Verification Dougrez-lewis John, Akhter Mahmud Elahi, He Yulan, Liakata Maria
- Evaluating Language Model Context Windows: A "working Memory" Test And Inference-time Correction Dsouza Amanda, Glaze Christopher, Shin Changho, Sala Frederic
- Perltqa: A Personal Long-term Memory Dataset For Memory Classification, Retrieval, And Synthesis In Question Answering Du Yiming, Wang Hongru, Zhao Zhengyi, Liang Bin, Wang Baojun, Zhong Wanjun, Wang Zezhong, Wong Kam-fai
- Macbehaviour: An R Package For Behavioural Experimentation On Large Language Models Duan Xufeng, Li Shixuan, Cai1 Zhenguang G.
- Applying RLAIF For Code Generation With Api-usage In Lightweight Llms Dutta Sujan, Mahinder Sayantan, Anantha Raviteja, Bandyopadhyay Bortik
- Dialogue You Can Trust: Human And AI Perspectives On Generated Conversations Ebubechukwu Ike, Takeuchi Johane, Ceravola Antonello, Joublin Frank
- Reducing Selection Bias In Large Language Models Eicher J. E., Irgolič R. F.
- Arabic Automatic Story Generation With Large Language Models El-shangiti Ahmed Oumar, Alwajih Fakhraddin, Abdul-mageed Muhammad
- Creating Arabic LLM Prompts At Scale El-sheikh Abdelrahman, Elmogtaba Ahmed, Darwish Kareem, Elmallah Muhammad, Elneima Ashraf, Sawaf Hassan
- Persuasiveness Of Generated Free-text Rationales In Subjective Decisions: A Case Study On Pairwise Argument Ranking Elaraby Mohamed, Litman Diane, Li Xiang Lorraine, Magooda Ahmed
- Ocassionally Secure: A Comparative Analysis Of Code Generation Assistants Elgedawy Ran, Sadik John, Dutta Senjuti, Gautam Anuj, Georgiou Konstantinos, Gholamrezae Farzin, Ji Fujiao, Lim Kyungchan, Liu Qian, Ruoti Scott
- Tinyagent: Function Calling At The Edge Erdogan Lutfi Eren, Lee Nicholas, Jha Siddharth, Kim Sehoon, Tabrizi Ryan, Moon Suhong, Hooper Coleman, Anumanchipalli Gopala, Keutzer Kurt, Gholami Amir
- Low-cost Language Models: Survey And Performance Evaluation On Python Code Generation Espejel Jessica López, Alassan Mahaman Sanoussi Yahaya, Bouhandi Merieme, Dahhane Walid, Ettifouri El Hassane
- Latxa: An Open Language Model And Evaluation Suite For Basque Etxaniz Julen, Sainz Oscar, Perez Naiara, Aldabe Itziar, Rigau German, Agirre Eneko, Ormazabal Aitor, Artetxe Mikel, Soroa Aitor
- Large Language Models As Evaluators For Scientific Synthesis Evans Julia, D'souza Jennifer, Auer Sören
- Decoding AI: The Inside Story Of Data Analysis In Chatgpt Evkaya Ozan, De Carvalho Miguel
- Prompt Optimizer Of Text-to-image Diffusion Models For Abstract Concept Understanding Fan Zezhong, Li Xiaohan, Fang Chenhao, Biswas Topojoy, Nag Kaushiki, Xu Jianpeng, Achan Kannan
- Biasalert: A Plug-and-play Tool For Social Bias Detection In Llms Fan Zhiting, Chen Ruizhe, Xu Ruiling, Liu Zuozhu
- DARA: Decomposition-alignment-reasoning Autonomous Language Agent For Question Answering Over Knowledge Graphs Fang Haishuo, Zhu Xiaodan, Gurevych Iryna
- TRACE The Evidence: Constructing Knowledge-grounded Reasoning Chains For Retrieval-augmented Generation Fang Jinyuan, Meng Zaiqiao, Macdonald Craig
- Llama-omni: Seamless Speech Interaction With Large Language Models Fang Qingkai, Guo Shoutao, Zhou Yan, Ma Zhengrui, Zhang Shaolei, Feng Yang
- PATCH! Psychometrics-assisted Benchmarking Of Large Language Models: A Case Study Of Proficiency In 8th Grade Mathematics Fang Qixiang, Oberski Daniel L., Nguyen Dong
- \(VILA^2\): VILA Augmented VILA Fang Yunhao, Zhu Ligeng, Lu Yao, Wang Yan, Molchanov Pavlo, Cho Jang Hyun, Pavone Marco, Han Song, Yin Hongxu
- Unraveling The Dominance Of Large Language Models Over Transformer Models For Bangla Natural Language Inference: A Comprehensive Study Faria Fatema Tuj Johora, Moin Mukaffi Bin, Fahim Asif Iftekher, Debnath Pronay, Shah Faisal Muhammad
- Multi-modal Hallucination Control By Visual Information Grounding Favero Alessandro, Zancato Luca, Trager Matthew, Choudhary Siddharth, Perera Pramuditha, Achille Alessandro, Swaminathan Ashwin, Soatto Stefano
- Video-ccam: Enhancing Video-language Understanding With Causal Cross-attention Masks For Short And Long Videos Fei Jiajun, Li Dian, Deng Zhidong, Wang Zekun, Liu Gang, Wang Hui
- Internlm-law: An Open Source Chinese Legal Large Language Model Fei Zhiwei, Zhang Songyang, Shen Xiaoyu, Zhu Dawei, Wang Xiao, Cao Maosong, Zhou Fengzhe, Li Yining, Zhang Wenwei, Lin Dahua, Chen Kai, Ge Jidong
- Ragged Edges: The Double-edged Sword Of Retrieval-augmented Chatbots Feldman Philip, Foulds James R., Pan Shimei
- IGOT: Information Gain Optimized Tokenizer On Domain Adaptive Pretraining Feng Dawei, Zhang Yihai, Xu Zhixuan
- Citygpt: Empowering Urban Spatial Cognition Of Large Language Models Feng Jie, Du Yuwei, Liu Tianhui, Guo Siqi, Lin Yuming, Li Yong
- AGILE: A Novel Framework Of LLM Agents Feng Peiyuan, He Yichen, Huang Guanhua, Lin Yuan, Zhang Hanchong, Zhang Yuchen, Li Hang
- An Extremely Data-efficient And Generative Llm-based Reinforcement Learning Agent For Recommenders Feng Shuang, Feng Grace
- Courseassist: Pedagogically Appropriate AI Tutor For Computer Science Education Feng Ty, Liu Sa, Ghosal Dipak
- Optimus: Accelerating Large-scale Multi-modal LLM Training By Bubble Exploitation Feng Weiqi, Chen Yangrui, Wang Shaoyu, Peng Yanghua, Lin Haibin, Yu Minlan
- Mt-ladder: A Model-agnostic Framework Boosting Llm-based Machine Translation To The Next Level Feng Zhaopeng, Zhang Yan, Chen Ruizhe, Meng Zijie, Liu Zuozhu
- Syllabusqa: A Course Logistics Question Answering Dataset Fernandez Nigel, Scarlatos Alexander, Lan Andrew
- Generative AI And Large Language Models For Cyber Security: All Insights You Need Ferrag Mohamed Amine, Alwahedi Fatima, Battah Ammar, Cherif Bilel, Mechri Abdechakour, Tihanyi Norbert
- Exaranker-open: Synthetic Explanation For IR Using Open-source Llms Ferraretto Fernando, Laitz Thiago, Lotufo Roberto, Nogueira Rodrigo
- Toward Large Language Models As A Therapeutic Tool: Comparing Prompting Techniques To Improve Gpt-delivered Problem-solving Therapy Filienko Daniil, Wang Yinzhou, Jazmi Caroline El, Xie Serena, Cohen Trevor, De Cock Martine, Yuwen Weichao
- The Chronicles Of RAG: The Retriever, The Chunk And The Generator Finardi Paulo, Avila Leonardo, Castaldoni Rodrigo, Gengo Pedro, Larcher Celio, Piau Marcos, Costa Pablo, Caridá Vinicius
- Convosense: Overcoming Monotonous Commonsense Inferences For Conversational AI Finch Sarah E., Choi Jinho D.
- Geckopt: LLM System Efficiency Via Intent-based Tool Selection Fore Michael, Singh Simranjit, Stamoulis Dimitrios
- LLM Agents In Interaction: Measuring Personality Consistency And Linguistic Alignment In Interacting Populations Of Large Language Models Frisch Ivar, Giulianelli Mario
- Hint-before-solving Prompting: Guiding Llms To Effectively Utilize Encoded Knowledge Fu Jinlan, Huangfu Shenzhen, Yan Hang, Ng See-kiong, Qiu Xipeng
- A Role-specific Guided Large Language Model For Ophthalmic Consultation Based On Stylistic Differentiation Fu Laiyi, Fan Binbin, Du Hongkai, Feng Yanxiang, Li Chunhua, Song Huping
- BLINK: Multimodal Large Language Models Can See But Not Perceive Fu Xingyu, Hu Yushi, Li Bangzheng, Feng Yu, Wang Haoyu, Lin Xudong, Roth Dan, Smith Noah A., Ma Wei-chiu, Krishna Ranjay
- Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight In The Real World For Meeting Summarization? Fu Xue-yong, Laskar Md Tahmid Rahman, Khasanova Elena, Chen Cheng, Tn Shashi Bhushan
- Data Engineering For Scaling Language Models To 128K Context Fu Yao, Panda Rameswar, Niu Xinyao, Yue Xiang, Hajishirzi Hannaneh, Kim Yoon, Peng Hao
- UI-JEPA: Towards Active Perception Of User Intent Through Onscreen User Activity Fu Yicheng, Anantha Raviteja, Vashisht Prabal, Cheng Jianpeng, Littwin Etai
- Using Large Language Models For The Interpretation Of Building Regulations Fuchs Stefan, Witbrock Michael, Dimyadi Johannes, Amor Robert
- Can AI Relate: Testing Large Language Model Response For Mental Health Support Gabriel Saadia, Puri Isha, Xu Xuhai, Malgaroli Matteo, Ghassemi Marzyeh
- Open Generative Large Language Models For Galician Gamallo Pablo, Rodríguez Pablo, De-dios-flores Iria, Sotelo Susana, Paniagua Silvia, Bardanca Daniel, Pichel José Ramom, Garcia Marcos
- Application Of LLM Agents In Recruitment: A Novel Framework For Resume Screening Gan Chengguang, Zhang Qinghao, Mori Tatsunori
- Aligning LLM Agents By Learning Latent Preference From User Edits Gao Ge, Taymanov Alexey, Salinas Eduardo, Mineiro Paul, Misra Dipendra
- Self-evolving GPT: A Lifelong Autonomous Experiential Learner Gao Jinglong, Ding Xiao, Cui Yiming, Zhao Jianbai, Wang Hepeng, Liu Ting, Qin Bing
- Improving Language Understanding From Screenshots Gao Tianyu, Wang Zirui, Bhaskar Adithya, Chen Danqi
- Longins: A Challenging Long-context Instruction-based Exam For Llms Gavin Shawn, Zheng Tuney, Liu Jiaheng, Que Quehry, Wang Noah, Yang Jian, Zhang Chenchen, Huang Wenhao, Chen Wenhu, Zhang Ge
- Graph Descriptive Order Improves Reasoning With Large Language Model Ge Yuyao, Liu Shenghua, Feng Wenjie, Mei Lingrui, Chen Lizhe, Cheng Xueqi
- Worldgpt: Empowering LLM As Multimodal World Model Ge Zhiqi, Huang Hongzhe, Zhou Mingze, Li Juncheng, Wang Guoming, Tang Siliang, Zhuang Yueting
- Edinburgh Clinical NLP At MEDIQA-CORR 2024: Guiding Large Language Models With Hints Gema Aryo Pradipta, Lee Chaeeun, Minervini Pasquale, Daines Luke, Simpson T. Ian, Alex Beatrice
- Gemini 1.5: Unlocking Multimodal Understanding Across Millions Of Tokens Of Context Gemini Team, Georgiev Petko, Lei Ving Ian, Burnell Ryan, Bai Libin, Gulati Anmol, Tanzer Garrett, Vincent Damien, Pan Zhufeng, Wang Shibo, Mariooryad Soroosh, Ding Yifan, Geng Xinyang, Alcober Fred, Frostig Roy, Omernick Mark, Walker Lexi, Paduraru Cosmin, Sorokin Christina, Tacchetti Andrea, Gaffney Colin, Daruki Samira, Sercinoglu Olcan, Gleicher Zach, Love Juliette, Voigtlaender Paul, Jain Rohan, Surita Gabriela, Mohamed Kareem, Blevins Rory, Ahn Junwhan, Zhu Tao, Kawintiranon Kornraphop, Firat Orhan, Gu Yiming, Zhang Yujing, Rahtz Matthew, Faruqui Manaal, Clay Natalie, Gilmer Justin, Co-reyes Jd, Penchev Ivo, Zhu Rui, Morioka Nobuyuki, Hui Kevin, Haridasan Krishna, Campos Victor, Mahdieh Mahdis, Guo Mandy, Hassan Samer, Kilgour Kevin, Vezer Arpi, Cheng Heng-tze, De Liedekerke Raoul, Goyal Siddharth, Barham Paul, Strouse Dj, Noury Seb, Adler Jonas, Sundararajan Mukund, Vikram Sharad, Lepikhin Dmitry, Paganini Michela, Garcia Xavier, Yang Fan, Valter Dasha, Trebacz Maja, Vodrahalli Kiran, Asawaroengchai Chulayuth, Ring Roman, Kalb Norbert, Soares Livio Baldini, Brahma Siddhartha, Steiner David, Yu Tianhe, Mentzer Fabian, He Antoine, Gonzalez Lucas, Xu Bibo, Kaufman Raphael Lopez, Shafey Laurent El, Oh Junhyuk, Hennigan Tom, Driessche George Van Den, Odoom Seth, Lucic Mario, Roelofs Becca, Lall Sid, Marathe Amit, Chan Betty, Ontanon Santiago, He Luheng, Teplyashin Denis, Lai Jonathan, Crone Phil, Damoc Bogdan, Ho Lewis, Riedel Sebastian, Lenc Karel, Yeh Chih-kuan, Chowdhery Aakanksha, Xu Yang, Kazemi Mehran, Amid Ehsan, Petrushkina Anastasia, Swersky Kevin, Khodaei Ali, Chen Gowoon, Larkin Chris, Pinto Mario, Yan Geng, Badia Adria Puigdomenech, Patil Piyush, Hansen Steven, Orr Dave, Arnold Sebastien M. R., Grimstad Jordan, Dai Andrew, Douglas Sholto, Sinha Rishika, Yadav Vikas, Chen Xi, Gribovskaya Elena, Austin Jacob, Zhao Jeffrey, Patel Kaushal, Komarek Paul, Austin Sophia, Borgeaud Sebastian, Friso Linda, Goyal Abhimanyu, Caine Ben, Cao Kris, Chung Da-woon, Lamm Matthew, Barth-maron Gabe, Kagohara Thais, Olszewska Kate, Chen Mia, Shivakumar Kaushik, Agarwal Rishabh, Godhia Harshal, Rajwar Ravi, Snaider Javier, Dotiwalla Xerxes, Liu Yuan, Barua Aditya, Ungureanu Victor, Zhang Yuan, Batsaikhan Bat-orgil, Wirth Mateo, Qin James, Danihelka Ivo, Doshi Tulsee, Chadwick Martin, Chen Jilin, Jain Sanil, Le Quoc, Kar Arjun, Gurumurthy Madhu, Li Cheng, Sang Ruoxin, Liu Fangyu, Lamprou Lampros, Munoz Rich, Lintz Nathan, Mehta Harsh, Howard Heidi, Reynolds Malcolm, Aroyo Lora, Wang Quan, Blanco Lorenzo, Cassirer Albin, Griffith Jordan, Das Dipanjan, Lee Stephan, Sygnowski Jakub, Fisher Zach, Besley James, Powell Richard, Ahmed Zafarali, Paulus Dominik, Reitter David, Borsos Zalan, Joshi Rishabh, Pope Aedan, Hand Steven, Selo Vittorio, Jain Vihan, Sethi Nikhil, Goel Megha, Makino Takaki, May Rhys, Yang Zhen, Schalkwyk Johan, Butterfield Christina, Hauth Anja, Goldin Alex, Hawkins Will, Senter Evan, Brin Sergey, Woodman Oliver, Ritter Marvin, Noland Eric, Giang Minh, Bolina Vijay, Lee Lisa, Blyth Tim, Mackinnon Ian, Reid Machel, Sarvana Obaid, Silver David, Chen Alexander, Wang Lily, Maggiore Loren, Chang Oscar, Attaluri Nithya, Thornton Gregory, Chiu Chung-cheng, Bunyan Oskar, Levine Nir, Chung Timothy, Eltyshev Evgenii, Si Xiance, Lillicrap Timothy, Brady Demetra, Aggarwal Vaibhav, Wu Boxi, Xu Yuanzhong, Mcilroy Ross, Badola Kartikeya, Sandhu Paramjit, Moreira Erica, Stokowiec Wojciech, Hemsley Ross, Li Dong, Tudor Alex, Shyam Pranav, Rahimtoroghi Elahe, Haykal Salem, Sprechmann Pablo, Zhou Xiang, Mincu Diana, Li Yujia, Addanki Ravi, Krishna Kalpesh, Wu Xiao, Frechette Alexandre, Eyal Matan, Dafoe Allan, Lacey Dave, Whang Jay, Avrahami Thi, Zhang Ye, Taropa Emanuel, Lin Hanzhao, Toyama Daniel, Rutherford Eliza, Sano Motoki, Choe Hyunjeong, Tomala Alex, Safranek-shrader Chalence, Kassner Nora, Pajarskas Mantas, Harvey Matt, Sechrist Sean, Fortunato Meire, Lyu Christina, Elsayed Gamaleldin, Kuang Chenkai, Lottes James, Chu Eric, Jia Chao, Chen Chih-wei, Humphreys Peter, Baumli Kate, Tao Connie, Samuel Rajkumar, Santos Cicero Nogueira Dos, Andreassen Anders, Rakićević Nemanja, Grewe Dominik, Kumar Aviral, Winkler Stephanie, Caton Jonathan, Brock Andrew, Dalmia Sid, Sheahan Hannah, Barr Iain, Miao Yingjie, Natsev Paul, Devlin Jacob, Behbahani Feryal, Prost Flavien, Sun Yanhua, Myaskovsky Artiom, Pillai Thanumalayan Sankaranarayana, Hurt Dan, Lazaridou Angeliki, Xiong Xi, Zheng Ce, Pardo Fabio, Li Xiaowei, Horgan Dan, Stanton Joe, Ambar Moran, Xia Fei, Lince Alejandro, Wang Mingqiu, Mustafa Basil, Webson Albert, Lee Hyo, Anil Rohan, Wicke Martin, Dozat Timothy, Sinha Abhishek, Piqueras Enrique, Dabir Elahe, Upadhyay Shyam, Boral Anudhyan, Hendricks Lisa Anne, Fry Corey, Djolonga Josip, Su Yi, Walker Jake, Labanowski Jane, Huang Ronny, Misra Vedant, Chen Jeremy, Skerry-ryan Rj, Singh Avi, Rijhwani Shruti, Yu Dian, Castro-ros Alex, Changpinyo Beer, Datta Romina, Bagri Sumit, Hrafnkelsson Arnar Mar, Maggioni Marcello, Zheng Daniel, Sulsky Yury, Hou Shaobo, Paine Tom Le, Yang Antoine, Riesa Jason, Rogozinska Dominika, Marcus Dror, Badawy Dalia El, Zhang Qiao, Wang Luyu, Miller Helen, Greer Jeremy, Sjos Lars Lowe, Nova Azade, Zen Heiga, Chaabouni Rahma, Rosca Mihaela, Jiang Jiepu, Chen Charlie, Liu Ruibo, Sainath Tara, Krikun Maxim, Polozov Alex, Lespiau Jean-baptiste, Newlan Josh, Cankara Zeyncep, Kwak Soo, Xu Yunhan, Chen Phil, Coenen Andy, Meyer Clemens, Tsihlas Katerina, Ma Ada, Gottweis Juraj, Xing Jinwei, Gu Chenjie, Miao Jin, Frank Christian, Cankara Zeynep, Ganapathy Sanjay, Dasgupta Ishita, Hughes-fitt Steph, Chen Heng, Reid David, Rong Keran, Fan Hongmin, Van Amersfoort Joost, Zhuang Vincent, Cohen Aaron, Gu Shixiang Shane, Mohananey Anhad, Ilic Anastasija, Tobin Taylor, Wieting John, Bortsova Anna, Thacker Phoebe, Wang Emma, Caveness Emily, Chiu Justin, Sezener Eren, Kaskasoli Alex, Baker Steven, Millican Katie, Elhawaty Mohamed, Aisopos Kostas, Lebsack Carl, Byrd Nathan, Dai Hanjun, Jia Wenhao, Wiethoff Matthew, Davoodi Elnaz, Weston Albert, Yagati Lakshman, Ahuja Arun, Gao Isabel, Pundak Golan, Zhang Susan, Azzam Michael, Sim Khe Chai, Caelles Sergi, Keeling James, Sharma Abhanshu, Swing Andy, Li Yaguang, Liu Chenxi, Bostock Carrie Grimes, Bansal Yamini, Nado Zachary, Anand Ankesh, Lipschultz Josh, Karmarkar Abhijit, Proleev Lev, Ittycheriah Abe, Yeganeh Soheil Hassas, Polovets George, Faust Aleksandra, Sun Jiao, Rrustemi Alban, Li Pen, Shivanna Rakesh, Liu Jeremiah, Welty Chris, Lebron Federico, Baddepudi Anirudh, Krause Sebastian, Parisotto Emilio, Soricut Radu, Xu Zheng, Bloxwich Dawn, Johnson Melvin, Neyshabur Behnam, Mao-jones Justin, Wang Renshen, Ramasesh Vinay, Abbas Zaheer, Guez Arthur, Segal Constant, Nguyen Duc Dung, Svensson James, Hou Le, York Sarah, Milan Kieran, Bridgers Sophie, Gworek Wiktor, Tagliasacchi Marco, Lee-thorp James, Chang Michael, Guseynov Alexey, Hartman Ale Jakse, Kwong Michael, Zhao Ruizhe, Kashem Sheleem, Cole Elizabeth, Miech Antoine, Tanburn Richard, Phuong Mary, Pavetic Filip, Cevey Sebastien, Comanescu Ramona, Ives Richard, Yang Sherry, Du Cosmo, Li Bo, Zhang Zizhao, Iinuma Mariko, Hu Clara Huiyi, Roy Aurko, Bijwadia Shaan, Zhu Zhenkai, Martins Danilo, Saputro Rachel, Gergely Anita, Zheng Steven, Jia Dawei, Antonoglou Ioannis, Sadovsky Adam, Gu Shane, Bi Yingying, Andreev Alek, Samangooei Sina, Khan Mina, Kocisky Tomas, Filos Angelos, Kumar Chintu, Bishop Colton, Yu Adams, Hodkinson Sarah, Mittal Sid, Shah Premal, Moufarek Alexandre, Cheng Yong, Bloniarz Adam, Lee Jaehoon, Pejman Pedram, Michel Paul, Spencer Stephen, Feinberg Vladimir, Xiong Xuehan, Savinov Nikolay, Smith Charlotte, Shakeri Siamak, Tran Dustin, Chesus Mary, Bohnet Bernd, Tucker George, Von Glehn Tamara, Muir Carrie, Mao Yiran, Kazawa Hideto, Slone Ambrose, Soparkar Kedar, Shrivastava Disha, Cobon-kerr James, Sharman Michael, Pavagadhi Jay, Araya Carlos, Misiunas Karolis, Ghelani Nimesh, Laskin Michael, Barker David, Li Qiujia, Briukhov Anton, Houlsby Neil, Glaese Mia, Lakshminarayanan Balaji, Schucher Nathan, Tang Yunhao, Collins Eli, Lim Hyeontaek, Feng Fangxiaoyu, Recasens Adria, Lai Guangda, Magni Alberto, De Cao Nicola, Siddhant Aditya, Ashwood Zoe, Orbay Jordi, Dehghani Mostafa, Brennan Jenny, He Yifan, Xu Kelvin, Gao Yang, Saroufim Carl, Molloy James, Wu Xinyi, Arnold Seb, Chang Solomon, Schrittwieser Julian, Buchatskaya Elena, Radpour Soroush, Polacek Martin, Giordano Skye, Bapna Ankur, Tokumine Simon, Hellendoorn Vincent, Sottiaux Thibault, Cogan Sarah, Severyn Aliaksei, Saleh Mohammad, Thakoor Shantanu, Shefey Laurent, Qiao Siyuan, Gaba Meenu, Chang Shuo-yiin, Swanson Craig, Zhang Biao, Lee Benjamin, Rubenstein Paul Kishan, Song Gan, Kwiatkowski Tom, Koop Anna, Kannan Ajay, Kao David, Schuh Parker, Stjerngren Axel, Ghiasi Golnaz, Gibson Gena, Vilnis Luke, Yuan Ye, Ferreira Felipe Tiengo, Kamath Aishwarya, Klimenko Ted, Franko Ken, Xiao Kefan, Bhattacharya Indro, Patel Miteyan, Wang Rui, Morris Alex, Strudel Robin, Sharma Vivek, Choy Peter, Hashemi Sayed Hadi, Landon Jessica, Finkelstein Mara, Jhakra Priya, Frye Justin, Barnes Megan, Mauger Matthew, Daun Dennis, Baatarsukh Khuslen, Tung Matthew, Farhan Wael, Michalewski Henryk, Viola Fabio, Quitry Felix De Chaumont, Lan Charline Le, Hudson Tom, Wang Qingze, Fischer Felix, Zheng Ivy, White Elspeth, Dragan Anca, Alayrac Jean-baptiste, Ni Eric, Pritzel Alexander, Iwanicki Adam, Isard Michael, Bulanova Anna, Zilka Lukas, Dyer Ethan, Sachan Devendra, Srinivasan Srivatsan, Muckenhirn Hannah, Cai Honglong, Mandhane Amol, Tariq Mukarram, Rae Jack W., Wang Gary, Ayoub Kareem, Fitzgerald Nicholas, Zhao Yao, Han Woohyun, Alberti Chris, Garrette Dan, Krishnakumar Kashyap, Gimenez Mai, Levskaya Anselm, Sohn Daniel, Matak Josip, Iturrate Inaki, Chang Michael B., Xiang Jackie, Cao Yuan, Ranka Nishant, Brown Geoff, Hutter Adrian, Mirrokni Vahab, Chen Nanxin, Yao Kaisheng, Egyed Zoltan, Galilee Francois, Liechty Tyler, Kallakuri Praveen, Palmer Evan, Ghemawat Sanjay, Liu Jasmine, Tao David, Thornton Chloe, Green Tim, Jasarevic Mimi, Lin Sharon, Cotruta Victor, Tan Yi-xuan, Fiedel Noah, Yu Hongkun, Chi Ed, Neitz Alexander, Heitkaemper Jens, Sinha Anu, Zhou Denny, Sun Yi, Kaed Charbel, Hulse Brice, Mishra Swaroop, Georgaki Maria, Kudugunta Sneha, Farabet Clement, Shafran Izhak, Vlasic Daniel, Tsitsulin Anton, Ananthanarayanan Rajagopal, Carin Alen, Su Guolong, Sun Pei, V Shashank, Carvajal Gabriel, Broder Josef, Comsa Iulia, Repina Alena, Wong William, Chen Warren Weilun, Hawkins Peter, Filonov Egor, Loher Lucia, Hirnschall Christoph, Wang Weiyi, Ye Jingchen, Burns Andrea, Cate Hardie, Wright Diana Gage, Piccinini Federico, Zhang Lei, Lin Chu-cheng, Gog Ionel, Kulizhskaya Yana, Sreevatsa Ashwin, Song Shuang, Cobo Luis C., Iyer Anand, Tekur Chetan, Garrido Guillermo, Xiao Zhuyun, Kemp Rupert, Zheng Huaixiu Steven, Li Hui, Agarwal Ananth, Ngani Christel, Goshvadi Kati, Santamaria-fernandez Rebeca, Fica Wojciech, Chen Xinyun, Gorgolewski Chris, Sun Sean, Garg Roopal, Ye Xinyu, Eslami S. M. Ali, Hua Nan, Simon Jon, Joshi Pratik, Kim Yelin, Tenney Ian, Potluri Sahitya, Thiet Lam Nguyen, Yuan Quan, Luisier Florian, Chronopoulou Alexandra, Scellato Salvatore, Srinivasan Praveen, Chen Minmin, Koverkathu Vinod, Dalibard Valentin, Xu Yaming, Saeta Brennan, Anderson Keith, Sellam Thibault, Fernando Nick, Huot Fantine, Jung Junehyuk, Varadarajan Mani, Quinn Michael, Raul Amit, Le Maigo, Habalov Ruslan, Clark Jon, Jalan Komal, Bullard Kalesha, Singhal Achintya, Luong Thang, Wang Boyu, Rajayogam Sujeevan, Eisenschlos Julian, Jia Johnson, Finchelstein Daniel, Yakubovich Alex, Balle Daniel, Fink Michael, Agarwal Sameer, Li Jing, Dvijotham Dj, Pal Shalini, Kang Kai, Konzelmann Jaclyn, Beattie Jennifer, Dousse Olivier, Wu Diane, Crocker Remi, Elkind Chen, Jonnalagadda Siddhartha Reddy, Lee Jong, Holtmann-rice Dan, Kallarackal Krystal, Liu Rosanne, Vnukov Denis, Vats Neera, Invernizzi Luca, Jafari Mohsen, Zhou Huanjie, Taylor Lilly, Prendki Jennifer, Wu Marcus, Eccles Tom, Liu Tianqi, Kopparapu Kavya, Beaufays Francoise, Angermueller Christof, Marzoca Andreea, Sarcar Shourya, Dib Hilal, Stanway Jeff, Perbet Frank, Trdin Nejc, Sterneck Rachel, Khorlin Andrey, Li Dinghua, Wu Xihui, Goenka Sonam, Madras David, Goldshtein Sasha, Gierke Willi, Zhou Tong, Liu Yaxin, Liang Yannie, White Anais, Li Yunjie, Singh Shreya, Bahargam Sanaz, Epstein Mark, Basu Sujoy, Lao Li, Ozturel Adnan, Crous Carl, Zhai Alex, Lu Han, Tung Zora, Gaur Neeraj, Walton Alanna, Dixon Lucas, Zhang Ming, Globerson Amir, Uy Grant, Bolt Andrew, Wiles Olivia, Nasr Milad, Shumailov Ilia, Selvi Marco, Piccinno Francesco, Aguilar Ricardo, Mccarthy Sara, Khalman Misha, Shukla Mrinal, Galic Vlado, Carpenter John, Villela Kevin, Zhang Haibin, Richardson Harry, Martens James, Bosnjak Matko, Belle Shreyas Rammohan, Seibert Jeff, Alnahlawi Mahmoud, Mcwilliams Brian, Singh Sankalp, Louis Annie, Ding Wen, Popovici Dan, Simicich Lenin, Knight Laura, Mehta Pulkit, Gupta Nishesh, Shi Chongyang, Fatehi Saaber, Mitrovic Jovana, Grills Alex, Pagadora Joseph, Petrova Dessie, Eisenbud Danielle, Zhang Zhishuai, Yates Damion, Mittal Bhavishya, Tripuraneni Nilesh, Assael Yannis, Brovelli Thomas, Jain Prateek, Velimirovic Mihajlo, Akbulut Canfer, Mu Jiaqi, Macherey Wolfgang, Kumar Ravin, Xu Jun, Qureshi Haroon, Comanici Gheorghe, Wiesner Jeremy, Gong Zhitao, Ruddock Anton, Bauer Matthias, Felt Nick, Gp Anirudh, Arnab Anurag, Zelle Dustin, Rothfuss Jonas, Rosgen Bill, Shenoy Ashish, Seybold Bryan, Li Xinjian, Mudigonda Jayaram, Erdogan Goker, Xia Jiawei, Simsa Jiri, Michi Andrea, Yao Yi, Yew Christopher, Kan Steven, Caswell Isaac, Radebaugh Carey, Elisseeff Andre, Valenzuela Pedro, Mckinney Kay, Paterson Kim, Cui Albert, Latorre-chimoto Eri, Kim Solomon, Zeng William, Durden Ken, Ponnapalli Priya, Sosea Tiberiu, Choquette-choo Christopher A., Manyika James, Robenek Brona, Vashisht Harsha, Pereira Sebastien, Lam Hoi, Velic Marko, Owusu-afriyie Denese, Lee Katherine, Bolukbasi Tolga, Parrish Alicia, Lu Shawn, Park Jane, Venkatraman Balaji, Talbert Alice, Rosique Lambert, Cheng Yuchung, Sozanschi Andrei, Paszke Adam, Kumar Praveen, Austin Jessica, Li Lu, Salama Khalid, Kim Wooyeol, Dukkipati Nandita, Baryshnikov Anthony, Kaplanis Christos, Sheng Xianghai, Chervonyi Yuri, Unlu Caglar, Casas Diego De Las, Askham Harry, Tunyasuvunakool Kathryn, Gimeno Felix, Poder Siim, Kwak Chester, Miecnikowski Matt, Mirrokni Vahab, Dimitriev Alek, Parisi Aaron, Liu Dangyi, Tsai Tomy, Shevlane Toby, Kouridi Christina, Garmon Drew, Goedeckemeyer Adrian, Brown Adam R., Vijayakumar Anitha, Elqursh Ali, Jazayeri Sadegh, Huang Jin, Carthy Sara Mc, Hoover Jay, Kim Lucy, Kumar Sandeep, Chen Wei, Biles Courtney, Bingham Garrett, Rosen Evan, Wang Lisa, Tan Qijun, Engel David, Pongetti Francesco, De Cesare Dario, Hwang Dongseong, Yu Lily, Pullman Jennifer, Narayanan Srini, Levin Kyle, Gopal Siddharth, Li Megan, Aharoni Asaf, Trinh Trieu, Lo Jessica, Casagrande Norman, Vij Roopali, Matthey Loic, Ramadhana Bramandia, Matthews Austin, Carey Cj, Johnson Matthew, Goranova Kremena, Shah Rohin, Ashraf Shereen, Dasgupta Kingshuk, Larsen Rasmus, Wang Yicheng, Vuyyuru Manish Reddy, Jiang Chong, Ijazi Joana, Osawa Kazuki, Smith Celine, Boppana Ramya Sree, Bilal Taylan, Koizumi Yuma, Xu Ying, Altun Yasemin, Shabat Nir, Bariach Ben, Korchemniy Alex, Choo Kiam, Ronneberger Olaf, Iwuanyanwu Chimezie, Zhao Shubin, Soergel David, Hsieh Cho-jui, Cai Irene, Iqbal Shariq, Sundermeyer Martin, Chen Zhe, Bursztein Elie, Malaviya Chaitanya, Biadsy Fadi, Shroff Prakash, Dhillon Inderjit, Latkar Tejasi, Dyer Chris, Forbes Hannah, Nicosia Massimo, Nikolaev Vitaly, Greene Somer, Georgiev Marin, Wang Pidong, Martin Nina, Sedghi Hanie, Zhang John, Banzal Praseem, Fritz Doug, Rao Vikram, Wang Xuezhi, Zhang Jiageng, Patraucean Viorica, Du Dayou, Mordatch Igor, Jurin Ivan, Liu Lewis, Dubey Ayush, Mohan Abhi, Nowakowski Janek, Ion Vlad-doru, Wei Nan, Tojo Reiko, Raad Maria Abi, Hudson Drew A., Keshava Vaishakh, Agrawal Shubham, Ramirez Kevin, Wu Zhichun, Nguyen Hoang, Liu Ji, Sewak Madhavi, Petrini Bryce, Choi Donghyun, Philips Ivan, Wang Ziyue, Bica Ioana, Garg Ankush, Wilkiewicz Jarek, Agrawal Priyanka, Li Xiaowei, Guo Danhao, Xue Emily, Shaik Naseer, Leach Andrew, Khan Sadh Mnm, Wiesinger Julia, Jerome Sammy, Chakladar Abhishek, Wang Alek Wenjiao, Ornduff Tina, Abu Folake, Ghaffarkhah Alireza, Wainwright Marcus, Cortes Mario, Liu Frederick, Maynez Joshua, Terzis Andreas, Samangouei Pouya, Mansour Riham, Kępa Tomasz, Aubet François-xavier, Algymr Anton, Banica Dan, Weisz Agoston, Orban Andras, Senges Alexandre, Andrejczuk Ewa, Geller Mark, Santo Niccolo Dal, Anklin Valentin, Merey Majd Al, Baeuml Martin, Strohman Trevor, Bai Junwen, Petrov Slav, Wu Yonghui, Hassabis Demis, Kavukcuoglu Koray, Dean Jeffrey, Vinyals Oriol
- Why Not Transform Chat Large Language Models To Non-english? Geng Xiang, Zhu Ming, Li Jiahuan, Lai Zhejian, Zou Wei, She Shuaijie, Guo Jiaxin, Zhao Xiaofeng, Li Yinglu, Li Yuang, Su Chang, Zhao Yanqing, Lyu Xinglin, Zhang Min, Chen Jiajun, Yang Hao, Huang Shujian
- Jailbreaking Llms With Arabic Transliteration And Arabizi Ghanim Mansour Al, Almohaimeed Saleh, Zheng Mengxin, Solihin Yan, Lou Qian
- Using Large Language Models To Enrich The Documentation Of Datasets For Machine Learning Giner-miguelez Joan, Gómez Abel, Cabot Jordi
- Chatglm: A Family Of Large Language Models From GLM-130B To GLM-4 All Tools Glm Team, :, Zeng Aohan, Xu Bin, Wang Bowen, Zhang Chenhui, Yin Da, Zhang Dan, Rojas Diego, Feng Guanyu, Zhao Hanlin, Lai Hanyu, Yu Hao, Wang Hongning, Sun Jiadai, Zhang Jiajie, Cheng Jiale, Gui Jiayi, Tang Jie, Zhang Jing, Sun Jingyu, Li Juanzi, Zhao Lei, Wu Lindong, Zhong Lucen, Liu Mingdao, Huang Minlie, Zhang Peng, Zheng Qinkai, Lu Rui, Duan Shuaiqi, Zhang Shudan, Cao Shulin, Yang Shuxun, Tam Weng Lam, Zhao Wenyi, Liu Xiao, Xia Xiao, Zhang Xiaohan, Gu Xiaotao, Lv Xin, Liu Xinghan, Liu Xinyi, Yang Xinyue, Song Xixuan, Zhang Xunkai, An Yifan, Xu Yifan, Niu Yilin, Yang Yuantao, Li Yueyan, Bai Yushi, Dong Yuxiao, Qi Zehan, Wang Zhaoyu, Yang Zhen, Du Zhengxiao, Hou Zhenyu, Wang Zihan
- Better & Faster Large Language Models Via Multi-token Prediction Gloeckle Fabian, Idrissi Badr Youbi, Rozière Baptiste, Lopez-paz David, Synnaeve Gabriel
- Direct Alignment Of Draft Model For Speculative Decoding With Chat-fine-tuned Llms Goel Raghavv, Gagrani Mukul, Jeon Wonseok, Park Junyoung, Lee Mingu, Lott Christopher
- Extending Interactive Science Exhibits Into The Classroom Using Anthropomorphized Chatbots And Bloom's Taxonomy Golding Yousuf
- Goldfinch: High Performance Rwkv/transformer Hybrid With Linear Pre-fill And Extreme Kv-cache Compression Goldstein Daniel, Obeid Fares, Alcaide Eric, Song Guangyu, Cheah Eugene
- Does Chatgpt Have A Mind? Goldstein Simon, Levinstein Benjamin A.
- Mechanistic Interpretability Of Large Language Models With Applications To The Financial Services Industry Golgoon Ashkan, Filom Khashayar, Kannan Arjun Ravi
- The Advantages Of Context Specific Language Models: The Case Of The Erasmian Language Model Gonçalves João, Jelicic Nick, Murgia Michele, Stamhuis Evert
- Loose LIPS Sink Ships: Asking Questions In Battleship With Language-informed Program Sampling Grand Gabriel, Pepe Valerio, Andreas Jacob, Tenenbaum Joshua B.
- Assessing Generative Language Models In Classification Tasks: Performance And Self-evaluation Capabilities In The Environmental And Climate Change Domain Grasso Francesca, Locci Stefano
- Is Mamba Capable Of In-context Learning? Grazzi Riccardo, Siems Julien, Schrodi Simon, Brox Thomas, Hutter Frank
- Large Language Models And Linguistic Intentionality Grindrod Jumbly
- Transformers, Contextualism, And Polysemy Grindrod Jumbly
- Middleware For Llms: Tools Are Instrumental For Language Agents In Complex Environments Gu Yu, Shu Yiheng, Yu Hao, Liu Xiao, Dong Yuxiao, Tang Jie, Srinivasa Jayanth, Latapie Hugo, Su Yu
- Anah-v2: Scaling Analytical Hallucination Annotation Of Large Language Models Gu Yuzhe, Ji Ziwei, Zhang Wenwei, Lyu Chengqi, Lin Dahua, Chen Kai
- Diverse And Fine-grained Instruction-following Ability Exploration With Synthetic Data Gu Zihui, Sun Xingwu, Lian Fengzong, Kang Zhanhui, Xu Cheng-zhong, Fan Ju
- Look Before You Leap: Towards Decision-aware And Generalizable Tool-usage For Large Language Models Gui Anchun, Li Jian, Dai Yong, Du Nan, Xiao Han
- Sample Design Engineering: An Empirical Study Of What Makes Good Downstream Fine-tuning Samples For Llms Guo Biyang, Wang He, Xiao Wenyilin, Chen Hong, Lee Zhuxin, Han Songqiao, Huang Hailiang
- A Novel Paradigm Boosting Translation Capabilities Of Large Language Models Guo Jiaxin, Yang Hao, Li Zongyao, Wei Daimeng, Shang Hengchao, Chen Xiaoyu
- SGSH: Stimulate Large Language Models With Skeleton Heuristics For Knowledge Base Question Generation Guo Shasha, Liao Lizi, Zhang Jing, Wang Yanling, Li Cuiping, Chen Hong
- Ds-agent: Automated Data Science By Empowering Large Language Models With Case-based Reasoning Guo Siyuan, Deng Cheng, Wen Ying, Chen Hechang, Chang Yi, Wang Jun
- Efficient Continual Pre-training By Mitigating The Stability Gap Guo Yiduo, Fu Jie, Zhang Huishuai, Zhao Dongyan, Shen Yikang
- GPTQT: Quantize Large Language Models Twice To Push The Efficiency Guo Yipin, Lang Yilin, Ren Qinyuan
- API Pack: A Massive Multi-programming Language Dataset For API Call Generation Guo Zhen, Soria Adriana Meza, Sun Wei, Shen Yikang, Panda Rameswar
- Stabletoolbench: Towards Stable Large-scale Benchmarking On Tool Learning Of Large Language Models Guo Zhicheng, Cheng Sijie, Wang Hao, Liang Shihao, Qin Yujia, Li Peng, Liu Zhiyuan, Sun Maosong, Liu Yang
- METAREFLECTION: Learning Instructions For Language Agents Using Past Reflections Gupta Priyanshu, Kirtania Shashank, Singha Ananya, Gulwani Sumit, Radhakrishna Arjun, Shi Sherry, Soares Gustavo
- Multilingual Large Language Models And Curse Of Multilinguality Gurgurov Daniil, Bäumel Tanja, Anikina Tatiana
- Clochat: Understanding How People Customize, Interact, And Experience Personas In Large Language Models Ha Juhye, Jeon Hyeon, Han Daeun, Seo Jinwook, Oh Changhoon
- METAL: Towards Multilingual Meta-evaluation Hada Rishav, Gumma Varun, Ahmed Mohamed, Bali Kalika, Sitaram Sunayana
- Would I Lie To You? Inference Time Alignment Of Language Models Using Direct Preference Heads Hadji-kyriacou Avelina Asada, Arandjelovic Ognjen
- Covert Malicious Finetuning: Challenges In Safeguarding LLM Adaptation Halawi Danny, Wei Alexander, Wallace Eric, Wang Tony T., Haghtalab Nika, Steinhardt Jacob
- PECC: Problem Extraction And Coding Challenges Haller Patrick, Golde Jonas, Akbik Alan
- Small Language Model Can Self-correct Han Haixia, Liang Jiaqing, Shi Jie, He Qianyu, Xiao Yanghua
- RECIPE4U: Student-chatgpt Interaction Dataset In EFL Writing Education Han Jieun, Yoo Haneul, Myung Junho, Kim Minsun, Lee Tak Yeon, Ahn So-yeon, Oh Alice
- Transfer Learning For Text Diffusion Models Han Kehang, Kenealy Kathleen, Barua Aditya, Fiedel Noah, Constant Noah
- Chatgpt Based Data Augmentation For Improved Parameter-efficient Debiasing Of Llms Han Pengrui, Kocielnik Rafal, Saravanan Adhithya, Jiang Roy, Sharir Or, Anandkumar Anima
- Wildguard: Open One-stop Moderation Tools For Safety Risks, Jailbreaks, And Refusals Of Llms Han Seungju, Rao Kavel, Ettinger Allyson, Jiang Liwei, Lin Bill Yuchen, Lambert Nathan, Choi Yejin, Dziri Nouha
- Value Augmented Sampling For Language Model Alignment And Personalization Han Seungwook, Shenfeld Idan, Srivastava Akash, Kim Yoon, Agrawal Pulkit
- COCO Is "ALL'' You Need For Visual Instruction Fine-tuning Han Xiaotian, Wang Yiqi, Zhai Bohan, You Quanzeng, Yang Hongxia
- Jailbreaking Proprietary Large Language Models Using Word Substitution Cipher Handa Divij, Chirmule Advait, Gajera Bimal, Baral Chitta
- Actionreasoningbench: Reasoning About Actions With And Without Ramification Constraints Handa Divij, Dolin Pavel, Kumbhar Shrinidhi, Baral Chitta, Son Tran Cao
- Vision-and-language Navigation Generative Pretrained Transformer Hanlin Wen
- LLM Reasoners: New Evaluation, Library, And Analysis Of Step-by-step Reasoning With Large Language Models Hao Shibo, Gu Yi, Luo Haotian, Liu Tianyang, Shao Xiyan, Wang Xinyuan, Xie Shuhua, Ma Haodi, Samavedhi Adithya, Gao Qiyue, Wang Zhen, Hu Zhiting
- Learning To Rewrite: Generalized Llm-generated Text Detection Hao Wei, Li Ran, Zhao Weiliang, Yang Junfeng, Mao Chengzhi
- Exploring Chatgpt And Its Impact On Society Haque Md. Asraful, Li Shuai
- Astprompter: Weakly Supervised Automated Language Model Red-teaming To Identify Likely Toxic Prompts Hardy Amelia F., Liu Houjun, Lange Bernard, Kochenderfer Mykel J.
- Measuring Psychological Depth In Language Models Harel-canada Fabrice, Zhou Hanyu, Mupalla Sreya, Yildiz Zeynep, Sahai Amit, Peng Nanyun
- A Comparison Of Large Language Model And Human Performance On Random Number Generation Tasks Harrison Rachel M.
- Do Large Language Models Speak All Languages Equally? A Comparative Study In Low-resource Settings Hasan Md. Arid, Tarannum Prerona, Dey Krishno, Razzak Imran, Naseem Usman
- A Comprehensive Overview Of Large Language Models (llms) For Cyber Defences: Opportunities And Directions Hassanin Mohammed, Moustafa Nour
- Relative Value Biases In Large Language Models Hayes William M., Yax Nicolas, Palminteri Stefano
- Olympiadbench: A Challenging Benchmark For Promoting AGI With Olympiad-level Bilingual Multimodal Scientific Problems He Chaoqun, Luo Renjie, Bai Yuzhuo, Hu Shengding, Thai Zhen Leng, Shen Junhao, Hu Jinyi, Han Xu, Huang Yujie, Zhang Yuxiang, Liu Jie, Qi Lei, Liu Zhiyuan, Sun Maosong
- Webvoyager: Building An End-to-end Web Agent With Large Multimodal Models He Hongliang, Yao Wenlin, Ma Kaixin, Yu Wenhao, Dai Yong, Zhang Hongming, Lan Zhenzhong, Yu Dong
- Cos: Enhancing Personalization And Mitigating Bias With Context Steering He Jerry Zhi-yang, Pandey Sashrika, Schrum Mariah L., Dragan Anca
- Prompt-based Bias Calibration For Better Zero/few-shot Learning Of Language Models He Kang, Long Yinghan, Roy Kaushik
- Pitvqa: Image-grounded Text Embedding LLM For Visual Question Answering In Pituitary Surgery He Runlong, Xu Mengya, Das Adrito, Khan Danyal Z., Bano Sophia, Marcus Hani J., Stoyanov Danail, Clarkson Matthew J., Islam Mobarakol
- Prompting Chatgpt For Translation: A Comparative Analysis Of Translation Brief And Persona Prompts He Sui
- Transferring Troubles: Cross-lingual Transferability Of Backdoor Attacks In Llms With Instruction Tuning He Xuanli, Wang Jun, Xu Qiongkai, Minervini Pasquale, Stenetorp Pontus, Rubinstein Benjamin I. P., Cohn Trevor
- Fortifying Ethical Boundaries In AI: Advanced Strategies For Enhancing Security In Large Language Models He Yunhong, Qiu Jianling, Zhang Wei, Yuan Zhengqing
- Reindex-then-adapt: Improving Large Language Models For Conversational Recommendation He Zhankui, Xie Zhouhang, Steck Harald, Liang Dawen, Jha Rahul, Kallus Nathan, Mcauley Julian
- Quality Of Answers Of Generative Large Language Models Vs Peer Patients For Interpreting Lab Test Results For Lay Patients: Evaluation Study He Zhe, Bhasuran Balu, Jin Qiao, Tian Shubo, Hanna Karim, Shavor Cindy, Arguello Lisbeth Garcia, Murray Patrick, Lu Zhiyong
- Evaluation Of The Programming Skills Of Large Language Models Heitz Luc Bryan, Chamas Joun, Scherb Christopher
- Intent-conditioned And Non-toxic Counterspeech Generation Using Multi-task Instruction Tuning With RLAIF Hengle Amey, Kumar Aswini, Singh Sahajpreet, Bandhakavi Anil, Akhtar Md Shad, Chakroborty Tanmoy
- Can Large Language Models Make The Grade? An Empirical Study Evaluating Llms Ability To Mark Short Answer Questions In K-12 Education Henkel Owen, Boxer Adam, Hills Libby, Roberts Bill
- Generative Visual Instruction Tuning Hernandez Jefferson, Villegas Ruben, Ordonez Vicente
- Lilium: Ebay's Large Language Models For E-commerce Herold Christian, Kozielski Michael, Ekimov Leonid, Petrushkov Pavel, Vandenbussche Pierre-yves, Khadivi Shahram
- Evaluation Of Instruction-following Ability For Large Language Models On Story-ending Generation Hida Rem, Ohmura Junki, Sekiya Toshiyuki
- Eliciting Personality Traits In Large Language Models Hilliard Airlie, Munoz Cristian, Wu Zekun, Koshiyama Adriano Soares
- Large Language Models Aren't All That You Need Holla Kiran Voderhobli, Kumar Chaithanya, Singh Aryan
- Llm-aided Compilation For Tensor Accelerators Hong Charles, Bhatia Sahil, Haan Altan, Dong Shengjun Kris, Nikiforov Dima, Cheung Alvin, Shao Yakun Sophia
- Towards Building A Robust Knowledge Intensive Question Answering Model With Large Language Models Hong Hong Xingyun, Shao Shao Yan, Wang Wang Zhilin, Duan Duan Manni, Xiongnan Jin
- Free To Play: UN Trade And Development's Experience With Developing Its Own Open-source Retrieval Augmented Generation Large Language Model Application Hopp Daniel
- Efficient Solutions For An Intriguing Failure Of Llms: Long Context Window Does Not Mean Llms Can Analyze Long Sequences Flawlessly Hosseini Peyman, Castro Ignacio, Ghinassi Iacopo, Purver Matthew
- Retrieval-augmented Code Completion For Local Projects Using Large Language Models Hostnik Marko, Robnik-šikonja Marko
- Large Language Models As Misleading Assistants In Conversation Hou Betty Li, Shi Kejian, Phang Jason, Aung James, Adler Steven, Campbell Rosie
- A Systematic Evaluation Of Large Language Models For Generating Programming Code Hou Wenpin, Ji Zhicheng
- Bridging Language And Items For Retrieval And Recommendation Hou Yupeng, Li Jiacheng, He Zhankui, Yan An, Chen Xiusi, Mcauley Julian
- Large Language Model-powered Chatbots For Internationalizing Student Support In Higher Education Hsain Achraf, Housni Hamza El
- Wilke: Wise-layer Knowledge Editor For Lifelong Knowledge Editing Hu Chenhui, Cao Pengfei, Chen Yubo, Liu Kang, Zhao Jun
- Rankprompt: Step-by-step Comparisons Make Language Models Better Reasoners Hu Chi, Ge Yuan, Ma Xiangnan, Cao Hang, Li Qiang, Yang Yonghua, Xiao Tong, Zhu Jingbo
- Teaching Language Models To Self-improve By Learning From Language Feedback Hu Chi, Hu Yimin, Cao Hang, Xiao Tong, Zhu Jingbo
- Multimodal Chain-of-thought Reasoning Via Chatgpt To Protect Children From Age-inappropriate Apps Hu Chuanbo, Liu Bin, Yin Minglei, Zhou Yilu, Li Xin
- Agentgen: Enhancing Planning Abilities For Large Language Model Based Agent Via Environment And Task Generation Hu Mengkang, Zhao Pu, Xu Can, Sun Qingfeng, Lou Jianguang, Lin Qingwei, Luo Ping, Rajmohan Saravan, Zhang Dongmei
- Mixed Sparsity Training: Achieving 4\(\times\) FLOP Reduction For Transformer Pretraining Hu Pihe, Li Shaolong, Huang Longbo
- Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models At Scale Hu Xiang, Ji Pengyu, Zhu Qingyang, Wu Wei, Tu Kewei
- Themis: Towards Flexible And Interpretable NLG Evaluation Hu Xinyu, Lin Li, Gao Mingqi, Yin Xunjian, Wan Xiaojun
- Listen Again And Choose The Right Answer: A New Paradigm For Automatic Speech Recognition With Large Language Models Hu Yuchen, Chen Chen, Qin Chengwei, Zhu Qiushi, Chng Eng Siong, Li Ruizhe
- Longrecipe: Recipe For Efficient Long Context Generalization In Large Language Models Hu Zhiyuan, Liu Yuliang, Zhao Jinman, Wang Suyuchen, Wang Yan, Shen Wei, Gu Qing, Luu Anh Tuan, Ng See-kiong, Jiang Zhiwei, Hooi Bryan
- Talk Less, Interact Better: Evaluating In-context Conversational Adaptation In Multimodal Llms Hua Yilun, Artzi Yoav
- Applying And Evaluating Large Language Models In Mental Health Care: A Scoping Review Of Human-assessed Generative Tasks Hua Yining, Na Hongbin, Li Zehan, Liu Fenglin, Fang Xiao, Clifton David, Torous John
- Pokergpt: An End-to-end Lightweight Solver For Multi-player Texas Hold'em Via Large Language Model Huang Chenghao, Cao Yanbo, Wen Yinlong, Zhou Tao, Zhang Yanru
- Training Language Models To Generate Text With Citations Via Fine-grained Rewards Huang Chengyu, Wu Zeqiu, Hu Yushi, Wang Wenya
- Generating Educational Materials With Different Levels Of Readability Using Llms Huang Chieh-yang, Wei Jing, Huang Ting-hao 'kenneth'
- Chatgpt Rates Natural Language Explanation Quality Like Humans: But On Which Scales? Huang Fan, Kwak Haewoon, Park Kunwoo, An Jisun
- A Survey On Evaluation Of Multimodal Large Language Models Huang Jiaxing, Zhang Jingyi
- Learning Fine-grained Grounded Citations For Attributed Large Language Models Huang Lei, Feng Xiaocheng, Ma Weitao, Gu Yuxuan, Zhong Weihong, Feng Xiachong, Yu Weijiang, Peng Weihua, Tang Duyu, Tu Dandan, Qin Bing
- Visual Hallucinations Of Multi-modal Large Language Models Huang Wen, Liu Hongbin, Guo Minxin, Gong Neil Zhenqiang
- Key-point-driven Data Synthesis With Its Enhancement On Mathematical Reasoning Huang Yiming, Liu Xiao, Gong Yeyun, Gou Zhibin, Shen Yelong, Duan Nan, Chen Weizhu
- Leveraging Large Language Models For Enhanced NLP Task Performance Through Knowledge Distillation And Optimized Training Strategies Huang Yining, Tang Keke, Chen Meilian
- A Comprehensive Survey On Evaluating Large Language Model Applications In The Medical Industry Huang Yining, Tang Keke, Chen Meilian, Wang Boyuan
- Trustllm: Trustworthiness In Large Language Models Huang Yue, Sun Lichao, Wang Haoran, Wu Siyuan, Zhang Qihui, Li Yuan, Gao Chujie, Huang Yixin, Lyu Wenhan, Zhang Yixuan, Li Xiner, Liu Zhengliang, Liu Yixin, Wang Yijue, Zhang Zhikun, Vidgen Bertie, Kailkhura Bhavya, Xiong Caiming, Xiao Chaowei, Li Chunyuan, Xing Eric, Huang Furong, Liu Hao, Ji Heng, Wang Hongyi, Zhang Huan, Yao Huaxiu, Kellis Manolis, Zitnik Marinka, Jiang Meng, Bansal Mohit, Zou James, Pei Jian, Liu Jian, Gao Jianfeng, Han Jiawei, Zhao Jieyu, Tang Jiliang, Wang Jindong, Vanschoren Joaquin, Mitchell John, Shu Kai, Xu Kaidi, Chang Kai-wei, He Lifang, Huang Lifu, Backes Michael, Gong Neil Zhenqiang, Yu Philip S., Chen Pin-yu, Gu Quanquan, Xu Ran, Ying Rex, Ji Shuiwang, Jana Suman, Chen Tianlong, Liu Tianming, Zhou Tianyi, Wang William, Li Xiang, Zhang Xiangliang, Wang Xiao, Xie Xing, Chen Xun, Wang Xuyu, Liu Yan, Ye Yanfang, Cao Yinzhi, Chen Yong, Zhao Yue
- Ecr-chain: Advancing Generative Language Models To Better Emotion-cause Reasoners Through Reasoning Chains Huang Zhaopei, Zhao Jinming, Jin Qin
- Olympicarena: Benchmarking Multi-discipline Cognitive Reasoning For Superintelligent AI Huang Zhen, Wang Zengzhi, Xia Shijie, Li Xuefeng, Zou Haoyang, Xu Ruijie, Fan Run-ze, Ye Lyumanshan, Chern Ethan, Ye Yixin, Zhang Yikai, Yang Yuqing, Wu Ting, Wang Binjie, Sun Shichao, Xiao Yang, Li Yiyuan, Zhou Fan, Chern Steffi, Qin Yiwei, Ma Yan, Su Jiadi, Liu Yixiu, Zheng Yuxiang, Zhang Shaoting, Lin Dahua, Qiao Yu, Liu Pengfei
- Towards A Psychology Of Machines: Large Language Models Predict Human Memory Huff Markus, Ulakçı Elanur
- Federated Recommendation Via Hybrid Retrieval Augmented Generation Huimin Zeng, Zhenrui Yue, Qian Jiang, Dong Wang
- The Promise And Challenges Of Using Llms To Accelerate The Screening Process Of Systematic Reviews Huotala Aleksi, Kuutila Miikka, Ralph Paul, Mäntylä Mika
- Vietnamese Poem Generation & The Prospect Of Cross-language Poem-to-poem Translation Huynh Triet Minh, Bao Quan Le
- Assessing Llms Suitability For Knowledge Graph Completion Iga Vasile Ionut Remus, Silaghi Gheorghe Cosmin
- Hallucinations Or Attention Misdirection? The Path To Strategic Value Extraction In Business Using Large Language Models Ioste Aline
- Autocompletion Of Chief Complaints In The Electronic Health Records Using Large Language Models Islam K M Sajjadul, Nipu Ayesha Siddika, Madiraju Praveen, Deshpande Priya
- Are Large Vision Language Models Up To The Challenge Of Chart Comprehension And Reasoning? An Extensive Investigation Into The Capabilities And Limitations Of Lvlms Islam Mohammed Saidul, Rahman Raian, Masry Ahmed, Laskar Md Tahmid Rahman, Nayeem Mir Tafseer, Hoque Enamul
- Response Generation For Cognitive Behavioral Therapy With Large Language Models: Comparative Study With Socratic Questioning Izumi Kenta, Tanaka Hiroki, Shidara Kazuhiro, Adachi Hiroyoshi, Kanayama Daisuke, Kudo Takashi, Nakamura Satoshi
- Evaluating The Application Of Large Language Models To Generate Feedback In Programming Education Jacobs Sven, Jaschke Steffen
- Ffn-skipllm: A Hidden Gem For Autoregressive Decoding With Adaptive Feed Forward Skipping Jaiswal Ajay, Hu Bodun, Yin Lu, Ro Yeonju, Liu Shiwei, Chen Tianlong, Akella Aditya
- KIT-19: A Comprehensive Korean Instruction Toolkit On 19 Tasks For Fine-tuning Korean Large Language Models Jang Dongjun, Byun Sungjoo, Jo Hyemi, Shin Hyopil
- Rectifying Demonstration Shortcut In In-context Learning Jang Joonwon, Jang Sanghwan, Kweon Wonbin, Jeon Minjin, Yu Hwanjo
- Evaluating Students' Open-ended Written Responses With Llms: Using The RAG Framework For GPT-3.5, GPT-4, Claude-3, And Mistral-large Jauhiainen Jussi S., Guerra Agustín Garagorry
- Performance Of Recent Large Language Models For A Low-resourced Language Jayakody Ravindu, Dias Gihan
- Aligner: Efficient Alignment By Learning To Correct Ji Jiaming, Chen Boyuan, Lou Hantao, Hong Donghai, Zhang Borong, Pan Xuehai, Dai Juntao, Qiu Tianyi, Yang Yaodong
- Can't Say Cant? Measuring And Reasoning Of Dark Jargons In Large Language Models Ji Xu, Zhang Jianyi, Zhou Ziyin, Zhao Zhangchi, Qiao Qianqian, Han Kaiying, Hossen Md Imran, Hei Xiali
- ANAH: Analytical Annotation Of Hallucinations In Large Language Models Ji Ziwei, Gu Yuzhe, Zhang Wenwei, Lyu Chengqi, Lin Dahua, Chen Kai
- Simulbench: Evaluating Language Models With Creative Simulation Tasks Jia Qi, Yue Xiang, Zheng Tianyu, Huang Jie, Lin Bill Yuchen
- Heuristic-enhanced Candidates Selection Strategy For Gpts Tackle Few-shot Aspect-based Sentiment Analysis Jiang Baoxing, Wan Yujie, Ju Shenggen
- Llm-collaboration On Automatic Science Journalism For The General Audience Jiang Gongyao, Shi Xinran, Luo Qiong
- Can Large Language Models Generate High-quality Patent Claims? Jiang Lekang, Zhang Caiqi, Scherz Pascal A, Goetz Stephan
- Detecting Scams Using Large Language Models Jiang Liming
- Investigating Data Contamination For Pre-training Language Models Jiang Minhao, Liu Ken Ziyu, Zhong Ming, Schaeffer Rylan, Ouyang Siru, Han Jiawei, Koyejo Sanmi
- Sketch: A Toolkit For Streamlining LLM Operations Jiang Xin, Li Xiang, Ma Wenjia, Fang Xuezhi, Yao Yiqun, Yu Naitong, Meng Xuying, Han Peng, Li Jing, Sun Aixin, Wang Yequan
- Large Visual-language Models Are Also Good Classifiers: A Study Of In-context Multimodal Fake News Detection Jiang Ye, Wang Yimin
- Learning Planning-based Reasoning By Trajectories Collection And Process Reward Synthesizing Jiao Fangkai, Qin Chengwei, Liu Zhengyuan, Chen Nancy F., Joty Shafiq
- Gradable Chatgpt Translation Evaluation Jiao Hui, Peng Bei, Zong Lu, Zhang Xiaojun, Li Xinwei
- Prompt-based Length Controlled Generation With Multiple Control Types Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- Zero-shot Chain-of-thought Reasoning Guided By Evolutionary Algorithms In Large Language Models Jin Feihu, Liu Yifan, Tan Ying
- GUARD: Role-playing To Generate Natural-language Jailbreakings To Test Guideline Adherence Of Large Language Models Jin Haibo, Chen Ruoxi, Zhou Andy, Zhang Yang, Wang Haohan
- Crimson: Empowering Strategic Reasoning In Cybersecurity Through Large Language Models Jin Jiandong, Tang Bowen, Ma Mingxuan, Liu Xiao, Wang Yunfei, Lai Qingnan, Yang Jia, Zhou Changling
- Can Chatgpt Support Developers? An Empirical Evaluation Of Large Language Models For Code Generation Jin Kailun, Wang Chung-yu, Pham Hung Viet, Hemmati Hadi
- Hidden Flaws Behind Expert-level Accuracy Of Multimodal GPT-4 Vision In Medicine Jin Qiao, Chen Fangyuan, Zhou Yiliang, Xu Ziyang, Cheung Justin M., Chen Robert, Summers Ronald M., Rousseau Justin F., Ni Peiyun, Landsman Marc J, Baxter Sally L., Al'aref Subhi J., Li Yijia, Chen Alex, Brejt Josef A., Chiang Michael F., Peng Yifan, Lu Zhiyong
- Adaptive Skeleton Graph Decoding Jin Shuowei, Wu Yongji, Zheng Haizhong, Zhang Qingzhao, Lentz Matthew, Mao Z. Morley, Prakash Atul, Qian Feng, Zhuo Danyang
- Self-selected Attention Span For Accelerating Large Language Model Inference Jin Tian, Yazar Wanzin, Xu Zifei, Sharify Sayeh, Wang Xin
- SMART: Automatically Scaling Down Language Models With Accuracy Guarantees For Reduced Processing Fees Jo Saehan, Trummer Immanuel
- Llm-based Open-domain Integrated Task And Knowledge Assistants With Programmable Policies Joshi Harshit, Liu Shicheng, Chen James, Weigle Robert, Lam Monica S.
- How Large Language Models Encode Context Knowledge? A Layer-wise Probing Study Ju Tianjie, Sun Weiwei, Du Wei, Yuan Xinwei, Ren Zhaochun, Liu Gongshen
- Logical Closed Loop: Uncovering Object Hallucinations In Large Vision-language Models Junfei Wu, Qiang Liu, Ding Wang, Jinghao Zhang, Shu Wu, Liang Wang, Tieniu Tan
- Trust Or Escalate: LLM Judges With Provable Guarantees For Human Agreement Jung Jaehun, Brahman Faeze, Choi Yejin
- Information-theoretic Distillation For Reference-less Summarization Jung Jaehun, Lu Ximing, Jiang Liwei, Brahman Faeze, West Peter, Koh Pang Wei, Choi Yejin
- Exploring The Latest Llms For Leaderboard Extraction Kabongo Salomon, D'souza Jennifer, Auer Sören
- Using Large Language Models To Assess Tutors' Performance In Reacting To Students Making Math Errors Kakarla Sanjit, Thomas Danielle, Lin Jionghao, Gupta Shivang, Koedinger Kenneth R.
- Counterfactuals As A Means For Evaluating Faithfulness Of Attribution Methods In Autoregressive Language Models Kamahi Sepehr, Yaghoobzadeh Yadollah
- Eagle: Ethical Dataset Given From Real Interactions Kaneko Masahiro, Bollegala Danushka, Baldwin Timothy
- Prompt-rag: Pioneering Vector Embedding-free Retrieval-augmented Generation In Niche Domains, Exemplified By Korean Medicine Kang Bongsu, Kim Jundong, Yun Tae-rim, Kim Chang-eop
- Mindstar: Enhancing Math Reasoning In Pre-trained Llms At Inference Time Kang Jikun, Li Xin Zhe, Chen Xi, Kazemi Amirreza, Sun Qianyi, Chen Boxing, Li Dong, He Xu, He Quan, Wen Feng, Hao Jianye, Yao Jun
- Solving For X And Beyond: Can Large Language Models Solve Complex Math Problems With More-than-two Unknowns? Kao Kuei-chun, Wang Ruochen, Hsieh Cho-jui
- Omniact: A Dataset And Benchmark For Enabling Multimodal Generalist Autonomous Agents For Desktop And Web Kapoor Raghav, Butala Yash Parag, Russak Melisa, Koh Jing Yu, Kamble Kiran, Alshikh Waseem, Salakhutdinov Ruslan
- Teaching Machines To Code: Smart Contract Translation With Llms Karanjai Rabimba, Xu Lei, Shi Weidong
- Evaluating Interventional Reasoning Capabilities Of Large Language Models Kasetty Tejas, Mahajan Divyat, Dziugaite Gintare Karolina, Drouin Alexandre, Sridhar Dhanya
- Beyond Traditional Benchmarks: Analyzing Behaviors Of Open Llms On Data-to-text Generation Kasner Zdeněk, Dušek Ondřej
- From Text To Transformation: A Comprehensive Review Of Large Language Models' Versatility Kaur Pravneet, Kashyap Gautam Siddharth, Kumar Ankit, Nafis Md Tabrez, Kumar Sandeep, Shokeen Vikrant
- Defending Large Language Models Against Attacks With Residual Stream Activation Analysis Kawasaki Amelia, Davis Andrew, Abbas Houssam
- Codeaid: Evaluating A Classroom Deployment Of An Llm-based Programming Assistant That Balances Student And Educator Needs Kazemitabaar Majeed, Ye Runlong, Wang Xiaoning, Henley Austin Z., Denny Paul, Craig Michelle, Grossman Tovi
- Exploring The Frontiers Of Llms In Psychological Applications: A Comprehensive Review Ke Luoma, Tong Song, Cheng Peng, Peng Kaiping
- Development And Testing Of Retrieval Augmented Generation In Large Language Models -- A Case Study Report Ke Yuhe, Jin Liyuan, Elangovan Kabilan, Abdullah Hairil Rizal, Liu Nan, Sia Alex Tiong Heng, Soh Chai Rick, Tung Joshua Yi Min, Ong Jasmine Chiat Ling, Ting Daniel Shu Wei
- Abdelhak At Semeval-2024 Task 9 : Decoding Brainteasers, The Efficacy Of Dedicated Models Versus Chatgpt Kelious Abdelhak, Okirim Mounir
- Visiongpt: Vision-language Understanding Agent Using Generalized Multimodal Framework Kelly Chris, Hu Luhui, Yang Bang, Tian Yu, Yang Deshun, Yang Cindy, Huang Zaoshan, Li Zihao, Hu Jiayin, Zou Yuexian
- Autonomous Prompt Engineering In Large Language Models Kepel Daan, Valogianni Konstantina
- Harmonic Llms Are Trustworthy Kersting Nicholas S., Rahman Mohammad, Vedala Suchismitha, Wang Yang
- Reinforcement Of Explainability Of Chatgpt Prompts By Embedding Breast Cancer Self-screening Rules Into AI Responses Khan Yousef, Hamed Ahmed Abdeen
- Consistency And Uncertainty: Identifying Unreliable Responses From Black-box Vision-language Models For Selective Visual Question Answering Khan Zaid, Fu Yun
- Do Moral Judgment And Reasoning Capability Of Llms Change With Language? A Study Using The Multilingual Defining Issues Test Khandelwal Aditi, Agarwal Utkarsh, Tanmay Kumar, Choudhury Monojit
- Iterative Prompt Refinement For Radiation Oncology Symptom Extraction Using Teacher-student Large Language Models Khanmohammadi Reza, Ghanem Ahmed I, Verdecchia Kyle, Hall Ryan, Elshaikh Mohamed, Movsas Benjamin, Bagher-ebadian Hassan, Chetty Indrin, Ghassemi Mohammad M., Thind Kundan
- ARGS: Alignment As Reward-guided Search Khanov Maxim, Burapacheep Jirayu, Li Yixuan
- Efficient Medical Question Answering With Knowledge-augmented Question Generation Khlaut Julien, Dancette Corentin, Ferreres Elodie, Bennani Alaedine, Hérent Paul, Manceron Pierre
- Beyond Code Generation: An Observational Study Of Chatgpt Usage In Software Engineering Practice Khojah Ranim, Mohamad Mazen, Leitner Philipp, Neto Francisco Gomes De Oliveira
- Llms In The Loop: Leveraging Large Language Model Annotations For Active Learning In Low-resource Languages Kholodna Nataliia, Julka Sahib, Khodadadi Mohammad, Gumus Muhammed Nurullah, Granitzer Michael
- A Comprehensive Survey Of Accelerated Generation Techniques In Large Language Models Khoshnoodi Mahsa, Jain Vinija, Gao Mingye, Srikanth Malavika, Chadha Aman
- Why And When Llm-based Assistants Can Go Wrong: Investigating The Effectiveness Of Prompt-based Interactions For Software Help-seeking Khurana Anjali, Subramonyam Hari, Chilana Parmit K
- Mhgpt: A Lightweight Generative Pre-trained Transformer For Mental Health Text Analysis Kim Dae-young, Hwa Rebecca, Rahman Muhammad Mahbubur
- Small Language Models Learn Enhanced Reasoning Skills From Medical Textbooks Kim Hyunjae, Hwang Hyeon, Lee Jiwoo, Park Sihyeon, Kim Dain, Lee Taewhoo, Yoon Chanwoong, Sohn Jiwoong, Choi Donghee, Kang Jaewoo
- Finer: Investigating And Enhancing Fine-grained Visual Concept Recognition In Large Vision Language Models Kim Jeonghwan, Ji Heng
- Persona Is A Double-edged Sword: Enhancing The Zero-shot Reasoning By Ensembling The Role-playing And Neutral Prompts Kim Junseok, Yang Nakyeong, Jung Kyomin
- Advisorqa: Towards Helpful And Harmless Advice-seeking Question Answering With Collective Intelligence Kim Minbeom, Lee Hwanhee, Park Joonsuk, Lee Hwaran, Jung Kyomin
- Automatic Jailbreaking Of The Text-to-image Generative AI Systems Kim Minseon, Lee Hyomin, Gong Boqing, Zhang Huishuai, Hwang Sung Ju
- Is GPT-4 Alone Sufficient For Automated Essay Scoring?: A Comparative Judgment Approach Based On Rater Cognition Kim Seungju, Jo Meounggun
- Prometheus 2: An Open Source Language Model Specialized In Evaluating Other Language Models Kim Seungone, Suk Juyoung, Longpre Shayne, Lin Bill Yuchen, Shin Jamin, Welleck Sean, Neubig Graham, Lee Moontae, Lee Kyungjae, Seo Minjoon
- Can Language Models Evaluate Human Written Text? Case Study On Korean Student Writing For Education Kim Seungyoon, Kim Seungone
- A Survey On Integration Of Large Language Models With Intelligent Robots Kim Yeseung, Kim Dohyun, Choi Jieun, Park Jisang, Oh Nayoung, Park Daehyung
- Tablevqa-bench: A Visual Question Answering Benchmark On Multiple Table Domains Kim Yoonsik, Yim Moonbin, Song Ka Yeon
- Health-llm: Large Language Models For Health Prediction Via Wearable Sensor Data Kim Yubin, Xu Xuhai, Mcduff Daniel, Breazeal Cynthia, Park Hae Won
- Medexqa: Medical Question Answering Benchmark With Multiple Explanations Kim Yunsoo, Wu Jinge, Abdulle Yusuf, Wu Honghan
- Evaluating Telugu Proficiency In Large Language Models_ A Comparative Analysis Of Chatgpt And Gemini Kishore Katikela Sreeharsha, Shaik Rahimanuddin
- Unveiling Divergent Inductive Biases Of Llms On Temporal Data Kishore Sindhu, He Hangfeng
- Cleared For Takeoff? Compositional & Conditional Reasoning May Be The Achilles Heel To (flight-booking) Language Agents Kohli Harsh, Sun Huan
- Chatspamdetector: Leveraging Large Language Models For Effective Phishing Email Detection Koide Takashi, Fukushi Naoki, Nakano Hiroki, Chiba Daiki
- Self-prompt Tuning: Enable Autonomous Role-playing In Llms Kong Aobo, Zhao Shiwan, Chen Hao, Li Qicheng, Qin Yong, Sun Ruiqi, Zhou Xin, Zhou Jiaming, Sun Haoqin
- Labeling Supervised Fine-tuning Data With The Scaling Law Kong Huanjun
- Aligning Large Language Models With Representation Editing: A Control Perspective Kong Lingkai, Wang Haorui, Mu Wenhao, Du Yuanqi, Zhuang Yuchen, Zhou Yifei, Song Yue, Zhang Rongzhi, Wang Kai, Zhang Chao
- Proptest: Automatic Property Testing For Improved Visual Programming Koo Jaywon, Yang Ziyan, Cascante-bonilla Paola, Ray Baishakhi, Ordonez Vicente
- Large Language Model Driven Recommendation Korikov Anton, Sanner Scott, Deldjoo Yashar, He Zhankui, Mcauley Julian, Ramisa Arnau, Vidal Rene, Sathiamoorthy Mahesh, Kasrizadeh Atoosa, Milano Silvia, Ricci Francesco
- Transllama: Llm-based Simultaneous Translation System Koshkin Roman, Sudoh Katsuhito, Nakamura Satoshi
- Uncertainty-aware Evaluation For Vision-language Models Kostumov Vasily, Nutfullin Bulat, Pilipenko Oleg, Ilyushin Eugene
- Hints-in-browser: Benchmarking Language Models For Programming Feedback Generation Kotalwar Nachiket, Gotovos Alkis, Singla Adish
- Arabiangpt: Native Arabic Gpt-based Large Language Model Koubaa Anis, Ammar Adel, Ghouti Lahouari, Najar Omar, Sibaee Serry
- Evaluating Language Models For Generating And Judging Programming Feedback Koutcheme Charles, Dainese Nicola, Hellas Arto, Sarsa Sami, Leinonen Juho, Ashraf Syed, Denny Paul
- Open Source Language Models Can Provide Feedback: Evaluating Llms' Ability To Help Students Using Gpt-4-as-a-judge Koutcheme Charles, Dainese Nicola, Sarsa Sami, Hellas Arto, Leinonen Juho, Denny Paul
- From Data To Commonsense Reasoning: The Use Of Large Language Models For Explainable AI Krause Stefanie, Stolzenburg Frieder
- Can Large Language Models Explore In-context? Krishnamurthy Akshay, Harris Keegan, Foster Dylan J., Zhang Cyril, Slivkins Aleksandrs
- Reinforcement Learning For Optimizing RAG For Domain Chatbots Kulkarni Mandar, Tangarajan Praveen, Kim Kyung, Trivedi Anusua
- Confidence Under The Hood: An Investigation Into The Confidence-probability Alignment In Large Language Models Kumar Abhishek, Morabito Robert, Umbet Sanzhar, Kabbara Jad, Emami Ali
- Mt-eval: A Multi-turn Capabilities Evaluation Benchmark For Large Language Models Kwan Wai-chung, Zeng Xingshan, Jiang Yuxin, Wang Yufei, Li Liangyou, Shang Lifeng, Jiang Xin, Liu Qun, Wong Kam-fai
- Evaluating Cultural Adaptability Of A Large Language Model Via Simulation Of Synthetic Personas Kwok Louis, Bravansky Michal, Griffin Lewis D.
- Are Llms Effective Negotiators? Systematic Evaluation Of The Multifaceted Capabilities Of Llms In Negotiation Dialogues Kwon Deuksin, Weiss Emily, Kulshrestha Tara, Chawla Kushal, Lucas Gale M., Gratch Jonathan
- BIPED: Pedagogically Informed Tutoring System For ESL Education Kwon Soonwoo, Kim Sojung, Park Minju, Lee Seunghyun, Kim Kyuseok
- RES-Q: Evaluating Code-editing Large Language Model Systems At The Repository Scale Labash Beck, Rosedale August, Reents Alex, Negritto Lucas, Wiel Colin
- Autowebglm: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent Lai Hanyu, Liu Xiao, Iong Iat Long, Yao Shuntian, Chen Yuxuan, Shen Pengbo, Yu Hao, Zhang Hanchen, Zhang Xiaohan, Dong Yuxiao, Tang Jie
- Alarm: Align Language Models Via Hierarchical Rewards Modeling Lai Yuhang, Wang Siyuan, Liu Shujun, Huang Xuanjing, Wei Zhongyu
- Lapdoc: Layout-aware Prompting For Documents Lamott Marcel, Weweler Yves-noel, Ulges Adrian, Shafait Faisal, Krechel Dirk, Obradovic Darko
- FOCUS: Forging Originality Through Contrastive Use In Self-plagiarism For Language Models Lan Kaixin, Fang Tao, Wong Derek F., Xu Yabo, Chao Lidia S., Zhao Cecilia G.
- Criticeval: Evaluating Large Language Model As Critic Lan Tian, Zhang Wenwei, Xu Chen, Huang Heyan, Lin Dahua, Chen Kai, Mao Xian-ling
- Query-opt: Optimizing Inference Of Large Language Models Via Multi-query Instructions In Meeting Summarization Laskar Md Tahmid Rahman, Khasanova Elena, Fu Xue-yong, Chen Cheng, Tn Shashi Bhushan
- Benchmarks Underestimate The Readiness Of Multi-lingual Dialogue Agents Lee Andrew H., Semnani Sina J., Castillo-lópez Galo, De Chalendar Gäel, Choudhury Monojit, Dua Ashna, Kavitha Kapil Rajesh, Kim Sungkyun, Kodali Prashant, Kumaraguru Ponnurangam, Lombard Alexis, Moradshahi Mehrad, Park Gihyun, Semmar Nasredine, Seo Jiwon, Shen Tianhao, Shrivastava Manish, Xiong Deyi, Lam Monica S.
- Trol: Traversal Of Layers For Large Language And Vision Models Lee Byung-kwan, Chung Sangyun, Kim Chae Won, Park Beomchan, Ro Yong Man
- A Survey Of Large Language Models In Finance (finllms) Lee Jean, Stevens Nicholas, Han Soyeon Caren, Song Minseok
- Plagbench: Exploring The Duality Of Large Language Models In Plagiarism Generation And Detection Lee Jooyoung, Agrawal Toshini, Uchendu Adaku, Le Thai, Chen Jinghui, Lee Dongwon
- Token-supervised Value Models For Enhancing Mathematical Reasoning Capabilities Of Large Language Models Lee Jung Hyun, Yang June Yong, Heo Byeongho, Han Dongyoon, Yoo Kang Min
- Probability Of Differentiation Reveals Brittleness Of Homogeneity Bias In Large Language Models Lee Messi H. J., Lai Calvin K.
- Prompting Large Language Models For Zero-shot Essay Scoring Via Multi-trait Specialization Lee Sanwoo, Cai Yida, Meng Desong, Wang Ziyang, Wu Yunfang
- Effective And Efficient Conversation Retrieval For Dialogue State Tracking With Implicit Text Summaries Lee Seanie, Cheng Jianpeng, Driesen Joris, Coca Alexandru, Johannsen Anders
- Aligning To Thousands Of Preferences Via System Message Generalization Lee Seongyun, Park Sue Hyun, Kim Seungone, Seo Minjoon
- COCOA: Cbt-based Conversational Counseling Agent Using Memory Specialized In Cognitive Distortions And Dynamic Prompt Lee Suyeon, Kang Jieun, Kim Harim, Chung Kyoung-mee, Lee Dongha, Yeo Jinyoung
- Llava-docent: Instruction Tuning With Multimodal Large Language Model To Support Art Appreciation Education Lee Unggi, Jeon Minji, Lee Yunseo, Byun Gyuri, Son Yoorim, Shin Jaeyoon, Ko Hongkyu, Kim Hyeoncheol
- Large Language Models Produce Responses Perceived To Be Empathic Lee Yoon Kyung, Suh Jina, Zhan Hongli, Li Junyi Jessy, Ong Desmond C.
- Towards Understanding Counseling Conversations: Domain Knowledge And Large Language Models Lee Younghun, Goldwasser Dan, Reese Laura Schwab
- Chatgpt As Research Scientist: Probing Gpt's Capabilities As A Research Librarian, Research Ethicist, Data Generator And Data Predictor Lehr Steven A., Caliskan Aylin, Liyanage Suneragiri, Banaji Mahzarin R.
- Autocoder: Enhancing Code Large Language Model With \textsc{aiev-instruct} Lei Bin, Li Yuchen, Chen Qiuwu
- Using Hallucinations To Bypass Gpt4's Filter Lemkin Benjamin
- Using Counterfactual Tasks To Evaluate The Generality Of Analogical Reasoning In Large Language Models Lewis Martha, Mitchell Melanie
- Cascade Reward Sampling For Efficient Decoding-time Alignment Li Bolian, Wang Yifan, Grama Ananth, Zhang Ruqi
- X-instruction: Aligning Language Model In Low-resource Languages With Self-curated Cross-lingual Instructions Li Chong, Yang Wen, Zhang Jiajun, Lu Jinliang, Wang Shaonan, Zong Chengqing
- Incorporating External Knowledge And Goal Guidance For Llm-based Conversational Recommender Systems Li Chuang, Deng Yang, Hu Hengchang, Kan Min-yen, Li Haizhou
- Searchlvlms: A Plug-and-play Framework For Augmenting Large Vision-language Models By Searching Up-to-date Internet Knowledge Li Chuanhao, Li Zhen, Jing Chenchen, Liu Shuo, Shao Wenqi, Wu Yuwei, Luo Ping, Qiao Yu, Zhang Kaipeng
- Humaneval On Latest GPT Models -- 2024 Li Daniel, Murr Lincoln
- "is Chatgpt A Better Explainer Than My Professor?": Evaluating The Explanation Capabilities Of Llms In Conversation Compared To A Human Baseline Li Grace, Alshomary Milad, Muresan Smaranda
- Enhance Lifelong Model Editing With Continuous Data-adapter Association Li Jiaang, Wang Quan, Wang Zhongnan, Zhang Yongdong, Mao Zhendong
- Know The Unknown: An Uncertainty-sensitive Method For LLM Instruction Tuning Li Jiaqi, Tang Yixuan, Yang Yi
- Perteval: Unveiling Real Knowledge Capacity Of Llms With Knowledge-invariant Perturbations Li Jiatong, Hu Renjun, Huang Kunzhe, Zhuang Yan, Liu Qi, Zhu Mengxiao, Shi Xing, Lin Wei
- A Cross-language Investigation Into Jailbreak Attacks In Large Language Models Li Jie, Liu Yi, Liu Chongyang, Shi Ling, Ren Xiaoning, Zheng Yaowen, Liu Yang, Xue Yinxing
- Experimenting With Legal AI Solutions: The Case Of Question-answering For Access To Justice Li Jonathan, Bhambhoria Rohan, Dahan Samuel, Zhu Xiaodan
- Dissecting Human And LLM Preferences Li Junlong, Zhou Fan, Sun Shichao, Zhang Yikai, Zhao Hai, Liu Pengfei
- Measuring And Controlling Instruction (in)stability In Language Model Dialogs Li Kenneth, Liu Tianle, Bashkansky Naomi, Bau David, Viégas Fernanda, Pfister Hanspeter, Wattenberg Martin
- Dialogue Action Tokens: Steering Language Models In Goal-directed Dialogue With A Multi-turn Planner Li Kenneth, Wang Yiming, Viégas Fernanda, Wattenberg Martin
- Red Teaming Visual Language Models Li Mukai, Li Lei, Yin Yuwei, Ahmed Masood, Liu Zhenguang, Liu Qi
- From Text To Insight: Leveraging Large Language Models For Performance Evaluation In Management Li Ning, Zhou Huaikang, Xu Mingze
- Anchored Answers: Unravelling Positional Bias In Gpt-2's Multiple-choice Questions Li Ruizhe, Gao Yanjun
- IQA-EVAL: Automatic Evaluation Of Human-model Interactive Question Answering Li Ruosen, Wang Barry, Li Ruochen, Du Xinya
- Prompt4vis: Prompting Large Language Models With Example Mining And Schema Filtering For Tabular Data Visualization Li Shuaimin, Chen Xuanang, Song Yuanfeng, Song Yunze, Zhang Chen
- Improving Faithfulness Of Large Language Models In Summarization Via Sliding Generation And Self-consistency Li Taiji, Li Zhi, Zhang Yin
- TOPA: Extend Large Language Models For Video Understanding Via Text-only Pre-alignment Li Wei, Fan Hehe, Wong Yongkang, Kankanhalli Mohan, Yang Yi
- Stbench: Assessing The Ability Of Large Language Models In Spatio-temporal Analysis Li Wenbin, Yao Di, Zhao Ruibo, Chen Wenjie, Xu Zijie, Luo Chengxue, Gong Chang, Jing Quanliang, Tan Haining, Bi Jingping
- What If We Recaption Billions Of Web Images With Llama-3? Li Xianhang, Tu Haoqin, Hui Mude, Wang Zeyu, Zhao Bingchen, Xiao Junfei, Ren Sucheng, Mei Jieru, Liu Qing, Zheng Huangjie, Zhou Yuyin, Xie Cihang
- Preference Tuning For Toxicity Mitigation Generalizes Across Languages Li Xiaochen, Yong Zheng-xin, Bach Stephen H.
- Evaluating Mathematical Reasoning Of Large Language Models: A Focus On Error Identification And Correction Li Xiaoyuan, Wang Wenjie, Li Moxin, Guo Junrong, Zhang Yang, Feng Fuli
- Personalized Language Modeling From Personalized Human Feedback Li Xinyu, Lipton Zachary C., Leqi Liu
- Rethinking Chatgpt's Success: Usability And Cognitive Behaviors Enabled By Auto-regressive Llms' Prompting Li Xinzhe, Liu Ming
- Reason From Fallacy: Enhancing Large Language Models' Logical Reasoning Through Logical Fallacy Understanding Li Yanda, Wang Dixuan, Liang Jiaqing, Jiang Guochao, He Qianyu, Xiao Yanghua, Yang Deqing
- Mini-gemini: Mining The Potential Of Multi-modality Vision Language Models Li Yanwei, Zhang Yuechen, Wang Chengyao, Zhong Zhisheng, Chen Yixin, Chu Ruihang, Liu Shaoteng, Jia Jiaya
- Making Long-context Language Models Better Multi-hop Reasoners Li Yanyang, Liang Shuo, Lyu Michael R., Wang Liwei
- Guiding Enumerative Program Synthesis With Large Language Models Li Yixuan, Parsert Julian, Polgreen Elizabeth
- Mateval: A Multi-agent Discussion Framework For Advancing Open-ended Text Evaluation Li Yu, Zhang Shenyu, Wu Rui, Huang Xiutian, Chen Yongrui, Xu Wenhao, Qi Guilin, Min Dehai
- EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty Li Yuhui, Wei Fangyun, Zhang Chao, Zhang Hongyang
- Mmsci: A Multimodal Multi-discipline Dataset For Phd-level Scientific Comprehension Li Zekun, Yang Xianjun, Choi Kyuri, Zhu Wanrong, Hsieh Ryan, Kim Hyeonjung, Lim Jin Hyuk, Ji Sungyoung, Lee Byungju, Yan Xifeng, Petzold Linda Ruth, Wilson Stephen D., Lim Woosang, Wang William Yang
- Amphista: Accelerate LLM Inference With Bi-directional Multiple Drafting Heads In A Non-autoregressive Style Li Zeping, Yang Xinlong, Gao Ziheng, Liu Ji, Liu Zhuang, Li Dong, Peng Jinzhang, Tian Lu, Barsoum Emad
- EAGLE: Elevating Geometric Reasoning Through Llm-empowered Visual Instruction Tuning Li Zhihao, Du Yao, Liu Yang, Zhang Yan, Liu Yufang, Zhang Mengdi, Cai Xunliang
- Visualization Literacy Of Multimodal Large Language Models: A Comparative Study Li Zhimin, Miao Haichao, Pascucci Valerio, Liu Shusen
- Self-instructed Derived Prompt Generation Meets In-context Learning: Unlocking New Potential Of Black-box Llms Li Zhuo, Du Yuhao, Hu Jinpeng, Wan Xiang, Gao Anningzhe
- Effects Of Different Prompts On The Quality Of GPT-4 Responses To Dementia Care Questions Li Zhuochun, Xie Bo, Hilsabeck Robin, Aguirre Alyssa, Zou Ning, Luo Zhimeng, He Daqing
- Retrieval Augmented Generation Or Long-context Llms? A Comprehensive Study And Hybrid Approach Li Zhuowan, Li Cheng, Zhang Mingyang, Mei Qiaozhu, Bendersky Michael
- Vl-trojan: Multimodal Instruction Backdoor Attacks Against Autoregressive Visual Language Models Liang Jiawei, Liang Siyuan, Luo Man, Liu Aishan, Han Dongchen, Chang Ee-chien, Cao Xiaochun
- Bytecomposer: A Human-like Melody Composition Method Based On Language Model Agent Liang Xia, Du Xingjian, Lin Jiaju, Zou Pei, Wan Yuan, Zhu Bilei
- Fennec: Fine-grained Language Model Evaluation And Correction Extended Through Branching And Bridging Liang Xiaobo, Zhang Haoke, Hu Helan, Li Juntao, Xu Jun, Zhang Min
- Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats In Customized Large Language Models Liang Zi, Hu Haibo, Ye Qingqing, Xiao Yaxin, Li Haoyang
- Look Before You Leap: Problem Elaboration Prompting Improves Mathematical Reasoning In Large Language Models Liao Haoran, Tian Jidong, Hu Shaohua, He Hao, Jin Yaohui
- MARIO: Math Reasoning With Code Interpreter Output -- A Reproducible Pipeline Liao Minpeng, Luo Wei, Li Chengxi, Wu Jing, Fan Kai
- Learning-from-mistakes Prompting For Indigenous Language Translation Liao You-cheng, Yu Chen-jui, Lin Chi-yi, Yun He-feng, Wang Yen-hsiang, Li Hsiao-min, Fan Yao-chung
- MING-MOE: Enhancing Medical Multi-task Learning In Large Language Models With Sparse Mixture Of Low-rank Adapter Experts Liao Yusheng, Jiang Shuyang, Wang Yu, Wang Yanfeng
- D2LLM: Decomposed And Distilled Large Language Models For Semantic Search Liao Zihan, Yu Hang, Li Jianguo, Wang Jun, Zhang Wei
- The Power Of Words: Generating Powershell Attacks From Natural Language Liguori Pietro, Marescalco Christian, Natella Roberto, Orbinato Vittorio, Pianese Luciano
- Retrieval-augmented Natural Language Reasoning For Explainable Visual Question Answering Lim Su Hyeon, Kim Minkuk, Kim Hyeon Bae, Kim Seong Tae
- Wildbench: Benchmarking Llms With Challenging Tasks From Real Users In The Wild Lin Bill Yuchen, Deng Yuntian, Chandu Khyathi, Brahman Faeze, Ravichander Abhilasha, Pyatkin Valentina, Dziri Nouha, Bras Ronan Le, Choi Yejin
- Fine-tuned Network Relies On Generic Representation To Solve Unseen Cognitive Task Lin Dongyan
- Probing Large Language Models For Scalar Adjective Lexical Semantics And Scalar Diversity Pragmatics Lin Fangru, Altshuler Daniel, Pierrehumbert Janet B.
- Graph-enhanced Large Language Models In Asynchronous Plan Reasoning Lin Fangru, La Malfa Emanuele, Hofmann Valentin, Yang Elle Michelle, Cohn Anthony, Pierrehumbert Janet B.
- Bita: Bi-directional Tuning For Lossless Acceleration In Large Language Models Lin Feng, Yi Hanling, Li Hongbin, Yang Yifan, Yu Xiaotian, Lu Guangming, Xiao Rong
- Can Llms Understand The Implication Of Emphasized Sentences In Dialogue? Lin Guan-ting, Lee Hung-yi
- Emojicrypt: Prompt Encryption For Secure Communication With Large Language Models Lin Guo, Hua Wenyue, Zhang Yongfeng
- Designprobe: A Graphic Design Benchmark For Multimodal Large Language Models Lin Jieru, Huang Danqing, Zhao Tiejun, Zhan Dechen, Lin Chin-yew
- How Can I Get It Right? Using GPT To Rephrase Incorrect Trainee Responses Lin Jionghao, Han Zifei, Thomas Danielle R., Gurung Ashish, Gupta Shivang, Aleven Vincent, Koedinger Kenneth R.
- Improving Visual Storytelling With Multimodal Large Language Models Lin Xiaochuan, Chen Xiangyong
- Diversedialogue: A Methodology For Designing Chatbots With Human-like Diversity Lin Xiaoyu, Yu Xinkai, Aich Ankit, Giorgi Salvatore, Ungar Lyle
- Interpretable User Satisfaction Estimation For Conversational Systems With Large Language Models Lin Ying-chun, Neville Jennifer, Stokes Jack W., Yang Longqi, Safavi Tara, Wan Mengting, Counts Scott, Suri Siddharth, Andersen Reid, Xu Xiaofeng, Gupta Deepak, Jauhar Sujay Kumar, Song Xia, Buscher Georg, Tiwary Saurabh, Hecht Brent, Teevan Jaime
- Evaluating Text-to-visual Generation With Image-to-text Generation Lin Zhiqiu, Pathak Deepak, Li Baiqi, Li Jiayao, Xia Xide, Neubig Graham, Zhang Pengchuan, Ramanan Deva
- The Colorful Future Of Llms: Evaluating And Improving Llms As Emotional Supporters For Queer Youth Lissak Shir, Calderon Nitay, Shenkman Geva, Ophir Yaakov, Fruchter Eyal, Klomek Anat Brunstein, Reichart Roi
- Llmembed: Rethinking Lightweight Llm's Genuine Function In Text Classification Liu Chun, Zhang Hongguang, Zhao Kainan, Ju Xinghai, Yang Lin
- Lumina-mgpt: Illuminate Flexible Photorealistic Text-to-image Generation With Multimodal Generative Pretraining Liu Dongyang, Zhao Shitian, Zhuo Le, Lin Weifeng, Qiao Yu, Li Hongsheng, Gao Peng
- An Incomplete Loop: Instruction Inference, Instruction Following, And In-context Learning In Language Models Liu Emmy, Neubig Graham, Andreas Jacob
- Pefomed: Parameter Efficient Fine-tuning Of Multimodal Large Language Models For Medical Imaging Liu Gang, He Jinlong, Li Pengfei, He Genrong, Chen Zhaolin, Zhong Shenjun
- Mmgrec: Multimodal Generative Recommendation With Transformer Model Liu Han, Wei Yinwei, Song Xuemeng, Guan Weili, Li Yuan-fang, Nie Liqiang
- Augmenting Math Word Problems Via Iterative Question Composing Liu Haoxiong, Zhang Yifan, Luo Yifan, Yao Andrew Chi-chih
- Laida: Linguistics-aware In-context Learning With Data Augmentation For Metaphor Components Identification Liu Hongde, He Chenyuan, Meng Feiyang, Niu Changyong, Jia Yuxiang
- Tiny Refinements Elicit Resilience: Toward Efficient Prefix-model Against LLM Red-teaming Liu Jiaxu, Yin Xiangyu, Wu Sihao, Wang Jianhong, Fang Meng, Yi Xinping, Huang Xiaowei
- Iterative Length-regularized Direct Preference Optimization: A Case Study On Improving 7B Language Models To GPT-4 Level Liu Jie, Zhou Zhanhui, Liu Jiaheng, Bu Xingyuan, Yang Chao, Zhong Han-sen, Ouyang Wanli
- Mm-instruct: Generated Visual Instructions For Large Multimodal Model Alignment Liu Jihao, Huang Xin, Zheng Jinliang, Liu Boxiao, Wang Jia, Yoshie Osamu, Liu Yu, Li Hongsheng
- Visualwebbench: How Far Have Multimodal Llms Evolved In Web Page Understanding And Grounding? Liu Junpeng, Song Yifan, Lin Bill Yuchen, Lam Wai, Neubig Graham, Li Yuanzhi, Yue Xiang
- Enhancing Llm's Cognition Via Structurization Liu Kai, Fu Zhihang, Chen Chao, Zhang Wei, Jiang Rongxin, Zhou Fan, Chen Yaowu, Wu Yue, Ye Jieping
- Selectit: Selective Instruction Tuning For Large Language Models Via Uncertainty-aware Self-reflection Liu Liangxin, Liu Xuebo, Wong Derek F., Li Dongfang, Wang Ziyi, Hu Baotian, Zhang Min
- When Chatgpt Is Gone: Creativity Reverts And Homogeneity Persists Liu Qinghan, Zhou Yiyong, Huang Jihao, Li Guiquan
- Generative Expressive Conversational Speech Synthesis Liu Rui, Hu Yifan, Ren Yi, Yin Xiang, Li Haizhou
- ST-LLM: Large Language Models Are Effective Temporal Learners Liu Ruyang, Li Chen, Tang Haoran, Ge Yixiao, Shan Ying, Li Ge
- RL-GPT: Integrating Reinforcement Learning And Code-as-policy Liu Shaoteng, Yuan Haoqi, Hu Minda, Li Yanwei, Chen Yukang, Liu Shu, Lu Zongqing, Jia Jiaya
- SPINACH: Sparql-based Information Navigation For Challenging Real-world Questions Liu Shicheng, Semnani Sina J., Triedman Harold, Xu Jialiang, Zhao Isaac Dan, Lam Monica S.
- Making Them Ask And Answer: Jailbreaking Large Language Models In Few Queries Via Disguise And Reconstruction Liu Tong, Zhang Yingjie, Zhao Zhe, Dong Yinpeng, Meng Guozhu, Chen Kai
- Toolace: Winning The Points Of LLM Function Calling Liu Weiwen, Huang Xu, Zeng Xingshan, Hao Xinlong, Yu Shuai, Li Dexun, Wang Shuai, Gan Weinan, Liu Zhengying, Yu Yuanqing, Wang Zezhong, Wang Yuxian, Ning Wu, Hou Yutai, Wang Bin, Wu Chuhan, Wang Xinzhi, Liu Yong, Wang Yasheng, Tang Duyu, Tu Dandan, Shang Lifeng, Jiang Xin, Tang Ruiming, Lian Defu, Liu Qun, Chen Enhong
- Imposter.ai: Adversarial Attacks With Hidden Intentions Towards Aligned Large Language Models Liu Xiao, Li Liangzhi, Xiang Tong, Ye Fuying, Wei Lu, Li Wangyue, Garcia Noa
- An Empirical Analysis On Large Language Models In Debate Evaluation Liu Xinyi, Liu Pinxin, He Hangfeng
- Adaptive Draft-verification For Efficient Large Language Model Decoding Liu Xukun, Lei Bowen, Zhang Ruqi, Xu Dongkuan
- Omgeval: An Open Multilingual Generative Evaluation Benchmark For Large Language Models Liu Yang, Xu Meng, Wang Shuo, Yang Liner, Wang Haoyu, Liu Zhenghao, Kong Cunliang, Chen Yun, Liu Yang, Sun Maosong, Yang Erhong
- Era-cot: Improving Chain-of-thought Through Entity Relationship Analysis Liu Yanming, Peng Xinyue, Du Tianyu, Yin Jianwei, Liu Weihao, Zhang Xuhong
- RA-ISF: Learning To Answer And Understand From Retrieval Augmentation Via Iterative Self-feedback Liu Yanming, Peng Xinyue, Zhang Xuhong, Liu Weihao, Yin Jianwei, Cao Jiannan, Du Tianyu
- Arondight: Red Teaming Large Vision Language Models With Auto-generated Multi-modal Jailbreak Prompts Liu Yi, Cai Chengjun, Zhang Xiaoli, Yuan Xingliang, Wang Cong
- Understanding Llms: A Comprehensive Overview From Training To Inference Liu Yiheng, He Hao, Han Tianle, Zhang Xu, Liu Mengyuan, Tian Jiaming, Zhang Yutong, Wang Jiaqi, Gao Xiaohui, Zhong Tianyang, Pan Yi, Xu Shaochen, Wu Zihao, Liu Zhengliang, Zhang Xin, Zhang Shu, Hu Xintao, Zhang Tuo, Qiang Ning, Liu Tianming, Ge Bao
- DELTA: Decomposed Efficient Long-term Robot Task Planning Using Large Language Models Liu Yuchen, Palmieri Luigi, Koch Sebastian, Georgievski Ilche, Aiello Marco
- Synthvlm: High-efficiency And High-quality Synthetic Data For Vision Language Models Liu Zheng, Liang Hao, Huang Xijie, Xiong Wentao, Yu Qinhan, Sun Linzhuang, Chen Chong, He Conghui, Cui Bin, Zhang Wentao
- Scaffolding Language Learning Via Multi-modal Tutoring Systems With Pedagogical Instructions Liu Zhengyuan, Yin Stella Xin, Lee Carolyn, Chen Nancy F.
- Chatqa: Surpassing GPT-4 On Conversational QA And RAG Liu Zihan, Ping Wei, Roy Rajarshi, Xu Peng, Lee Chankyu, Shoeybi Mohammad, Catanzaro Bryan
- Enabling Weak Llms To Judge Response Reliability Via Meta Ranking Liu Zijun, Kou Boqun, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Llms Are Biased Towards Output Formats! Systematically Evaluating And Mitigating Output Format Bias Of Llms Long Do Xuan, Ngoc Hai Nguyen, Sim Tiviatis, Dao Hieu, Joty Shafiq, Kawaguchi Kenji, Chen Nancy F., Kan Min-yen
- Evaluating Large Language Models In Analysing Classroom Dialogue Long Yun, Luo Haifeng, Zhang Yu
- Sparser Is Faster And Less Is More: Efficient Sparse Attention For Long-range Transformers Lou Chao, Jia Zixia, Zheng Zilong, Tu Kewei
- Roleplay-doh: Enabling Domain-experts To Create Llm-simulated Patients Via Eliciting And Adhering To Principles Louie Ryan, Nandi Ananjan, Fang William, Chang Cheng, Brunskill Emma, Yang Diyi
- Answering Real-world Clinical Questions Using Large Language Model Based Systems Low Yen Sia 1 And 2, Jackson Michael L. 1 And 2, Hyde Rebecca J. 1 And 2, Brown Robert E. 1 And 2, Sanghavi Neil M. 1 And 2, Baldwin Julian D. 1 And 2, Pike C. William 1 And 2, Muralidharan Jananee 1 And 2, Hui Gavin 1 And 2, Alexander Natasha 1 And 7, Hassan Hadeel 1 And 7, Nene Rahul V. 1 And 7, Pike Morgan 1 And 7, Pokrzywa Courtney J. 1 And 7, Vedak Shivam 1 And 7, Yan Adam Paul 1 And 7, Yao Dong-han 1 And 7, Zipursky Amy R. 1 And 7, Dinh Christina 1 And 7, Ballentine Philip 1 And 7, Derieg Dan C. 1 And 7, Polony Vladimir 1 And 7, Chawdry Rehan N. 1 And 7, Davies Jordan 1 And 7, Hyde Brigham B. 1 And 7, Shah Nigam H. 1 And 7, Gombar Saurabh 1 And 8
- Efficient Encoder-decoder Transformer Decoding For Decomposable Tasks Lu Bo-ru, Haduong Nikita, Lin Chien-yu, Cheng Hao, Smith Noah A., Ostendorf Mari
- From GPT-4 To Gemini And Beyond: Assessing The Landscape Of Mllms On Generalizability, Trustworthiness And Causality Through Four Modalities Lu Chaochao, Qian Chen, Zheng Guodong, Fan Hongxing, Gao Hongzhi, Zhang Jie, Shao Jing, Deng Jingyi, Fu Jinlan, Huang Kexin, Li Kunchang, Li Lijun, Wang Limin, Sheng Lu, Chen Meiqi, Zhang Ming, Ren Qibing, Chen Sirui, Gui Tao, Ouyang Wanli, Wang Yali, Teng Yan, Wang Yaru, Wang Yi, He Yinan, Wang Yingchun, Wang Yixu, Zhang Yongting, Qiao Yu, Shen Yujiong, Mou Yurong, Chen Yuxi, Zhang Zaibin, Shi Zhelun, Yin Zhenfei, Wang Zhipin
- Investigating Bias Representations In Llama 2 Chat Via Activation Steering Lu Dawn, Rimsky Nina
- Revisiting Multi-modal LLM Evaluation Lu Jian, Srivastava Shikhar, Chen Junyu, Shrestha Robik, Acharya Manoj, Kafle Kushal, Kanan Christopher
- Padellm-ner: Parallel Decoding In Large Language Models For Named Entity Recognition Lu Jinghui, Yang Ziwei, Wang Yanjie, Liu Xuejing, Mac Namee Brian, Huang Can
- A Bounding Box Is Worth One Token: Interleaving Layout And Text In A Large Language Model For Document Understanding Lu Jinghui, Yu Haiyang, Wang Yanjie, Ye Yongjie, Tang Jingqun, Yang Ziwei, Wu Binghong, Liu Qi, Feng Hao, Wang Han, Liu Hao, Huang Can
- FIPO: Free-form Instruction-oriented Prompt Optimization With Preference Dataset And Modular Fine-tuning Schema Lu Junru, An Siyu, Zhang Min, He Yulan, Yin Di, Sun Xing
- Blending Is All You Need: Cheaper, Better Alternative To Trillion-parameters LLM Lu Xiaoding, Liu Zongyi, Liusie Adian, Raina Vyas, Mudupalli Vineet, Zhang Yuwen, Beauchamp William
- Generative Students: Using Llm-simulated Student Profiles To Support Question Item Evaluation Lu Xinyi, Wang Xu
- Wildvision: Evaluating Vision-language Models In The Wild With Human Preferences Lu Yujie, Jiang Dongfu, Chen Wenhu, Wang William Yang, Choi Yejin, Lin Bill Yuchen
- Mathgenie: Generating Synthetic Data With Question Back-translation For Enhancing Mathematical Reasoning Of Llms Lu Zimu, Zhou Aojun, Ren Houxing, Wang Ke, Shi Weikang, Pan Junting, Zhan Mingjie, Li Hongsheng
- To Err Is Human, But Llamas Can Learn It Too Luhtaru Agnes, Purason Taido, Vainikko Martin, Del Maksym, Fishel Mark
- Can Perplexity Predict Fine-tuning Performance? An Investigation Of Tokenization Effects On Sequential Language Models For Nepali Luitel Nishant, Bekoju Nirajan, Sah Anand Kumar, Shakya Subarna
- Metric-aware LLM Inference For Regression And Scoring Lukasik Michal, Narasimhan Harikrishna, Menon Aditya Krishna, Yu Felix, Kumar Sanjiv
- Crosstune: Black-box Few-shot Classification With Label Enhancement Luo Danqing, Zhang Chen, Zhang Yan, Li Haizhou
- Kuaiji: The First Chinese Accounting Large Language Model Luo Jiayuan, Yang Songhua, Qiu Xiaoling, Chen Panyu, Nai Yufei, Zeng Wenxuan, Zhang Wentao, Jiang Xinke
- Hallucination Detection And Hallucination Mitigation: An Investigation Luo Junliang, Li Tianyu, Wu Di, Jenkin Michael, Liu Steve, Dudek Gregory
- End-to-end Neuro-symbolic Reinforcement Learning With Textual Explanations Luo Lirui, Zhang Guoxi, Xu Hongming, Yang Yaodong, Fang Cong, Li Qing
- Assessing Empathy In Large Language Models With Real-world Physician-patient Interactions Luo Man, Warren Christopher J., Cheng Lu, Abdul-muhsin Haidar M., Banerjee Imon
- M\(^3\)GPT: An Advanced Multimodal, Multitask Framework For Motion Comprehension And Generation Luo Mingshuang, Hou Ruibing, Chang Hong, Liu Zimo, Wang Yaowei, Shan Shiguang
- Matching Domain Experts By Training From Scratch On Domain Knowledge Luo Xiaoliang, Sun Guangzhi, Love Bradley C.
- Ensuring Safe And High-quality Outputs: A Guideline Library Approach For Language Models Luo Yi, Lin Zhenghao, Zhang Yuhao, Sun Jiashuo, Lin Chen, Xu Chengjin, Su Xiangdong, Shen Yelong, Guo Jian, Gong Yeyun
- Are Large Language Models True Healthcare Jacks-of-all-trades? Benchmarking Across Health Professions Beyond Physician Exams Luo Zheheng, Yuan Chenhan, Xie Qianqian, Ananiadou Sophia
- Interpreting Key Mechanisms Of Factual Recall In Transformer-based Language Models Lv Ang, Chen Yuhan, Zhang Kaiyi, Wang Yulong, Liu Lifeng, Wen Ji-rong, Xie Jian, Yan Rui
- Coggpt: Unleashing The Power Of Cognitive Dynamics On Large Language Models Lv Yaojia, Pan Haojie, Fu Ruiji, Liu Ming, Wang Zhongyuan, Qin Bing
- GPT-4 Generated Narratives Of Life Events Using A Structured Narrative Prompt: A Validation Study Lynch Christopher J., Jensen Erik, Munro Madison H., Zamponi Virginia, Martinez Joseph, O'brien Kevin, Feldhaus Brandon, Smith Katherine, Reinhold Ann Marie, Gore Ross
- Keeping Llms Aligned After Fine-tuning: The Crucial Role Of Prompt Templates Lyu Kaifeng, Zhao Haoyu, Gu Xinran, Yu Dingli, Goyal Anirudh, Arora Sanjeev
- Automatic Summarization Of Doctor-patient Encounter Dialogues Using Large Language Model Through Prompt Tuning Lyu Mengxian, Peng Cheng, Li Xiaohan, Balian Patrick, Bian Jiang, Wu Yonghui
- Weblinx: Real-world Website Navigation With Multi-turn Dialogue Lù Xing Han, Kasner Zdeněk, Reddy Siva
- Enhancing Programming Education With Chatgpt: A Case Study On Student Perceptions And Interactions In A Python Course Ma Boxaun, Chen Li, Konomi Shin'ichi
- FBI-LLM: Scaling Up Fully Binarized Llms From Scratch Via Autoregressive Distillation Ma Liqun, Sun Mingjie, Shen Zhiqiang
- Making Pre-trained Language Models Better Continual Few-shot Relation Extractors Ma Shengkun, Han Jiale, Liang Yi, Cheng Bo
- Aligned With LLM: A New Multi-modal Training Paradigm For Encoding Fmri Activity In Visual Cortex Ma Shuxiao, Wang Linyuan, Hou Senbao, Yan Bin
- Sciagent: Tool-augmented Language Models For Scientific Reasoning Ma Yubo, Gou Zhibin, Hao Junheng, Xu Ruochen, Wang Shuohang, Pan Liangming, Yang Yujiu, Cao Yixin, Sun Aixin, Awadalla Hany, Chen Weizhu
- Mmlongbench-doc: Benchmarking Long-context Document Understanding With Visualizations Ma Yubo, Zang Yuhang, Chen Liangyu, Chen Meiqi, Jiao Yizhu, Li Xinze, Lu Xinyuan, Liu Ziyu, Ma Yan, Dong Xiaoyi, Zhang Pan, Pan Liangming, Jiang Yu-gang, Wang Jiaqi, Cao Yixin, Sun Aixin
- Llamoco: Instruction Tuning Of Large Language Models For Optimization Code Generation Ma Zeyuan, Guo Hongshu, Chen Jiacheng, Peng Guojun, Cao Zhiguang, Ma Yining, Gong Yue-jiao
- Gerea: Question-aware Prompt Captions For Knowledge-based Visual Question Answering Ma Ziyu, Li Shutao, Sun Bin, Cai Jianfei, Long Zuxiang, Ma Fuyan
- Explaining Genetic Programming Trees Using Large Language Models Maddigan Paula, Lensen Andrew, Xue Bing
- Do Llms Know When To NOT Answer? Investigating Abstention Abilities Of Large Language Models Madhusudhan Nishanth, Madhusudhan Sathwik Tejaswi, Yadav Vikas, Hashemi Masoud
- Beyond Generative Artificial Intelligence: Roadmap For Natural Language Generation Maestre María Miró, Martínez-murillo Iván, Martin Tania J., Navarro-colorado Borja, Ferrández Antonio, Cueto Armando Suárez, Lloret Elena
- Hallucination-free? Assessing The Reliability Of Leading AI Legal Research Tools Magesh Varun, Surani Faiz, Dahl Matthew, Suzgun Mirac, Manning Christopher D., Ho Daniel E.
- Leveraging Environment Interaction For Automated PDDL Generation And Planning With Large Language Models Mahdavi Sadegh, Aoki Raquel, Tang Keyi, Cao Yanshuai
- Too Late To Train, Too Early To Use? A Study On Necessity And Viability Of Low-resource Bengali Llms Mahfuz Tamzeed, Dey Satak Kumar, Naswan Ruwad, Adil Hasnaen, Sayeed Khondker Salman, Shahgir Haz Sameen
- VURF: A General-purpose Reasoning And Self-refinement Framework For Video Understanding Mahmood Ahmad, Vayani Ashmal, Naseer Muzammal, Khan Salman, Khan Fahad Shahbaz
- How Effective Is GPT-4 Turbo In Generating School-level Questions From Textbooks Based On Bloom's Revised Taxonomy? Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- A Novel Multi-stage Prompting Approach For Language Agnostic MCQ Generation Using GPT Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- Fairylandai: Personalized Fairy Tales Utilizing Chatgpt And DALLE-3 Makridis Georgios, Oikonomou Athanasios, Koukos Vasileios
- From Tarzan To Tolkien: Controlling The Language Proficiency Level Of Llms For Content Generation Malik Ali, Mayhew Stephen, Piech Chris, Bicknell Klinton
- IITK At Semeval-2024 Task 2: Exploring The Capabilities Of Llms For Safe Biomedical Natural Language Inference For Clinical Trials Mandal Shreyasi, Modi Ashutosh
- Can Large Language Models Automatically Score Proficiency Of Written Essays? Mansour Watheq, Albatarni Salam, Eltanbouly Sohaila, Elsayed Tamer
- A Novel Nuanced Conversation Evaluation Framework For Large Language Models In Mental Health Marrapese Alexander, Suleiman Basem, Ullah Imdad, Kim Juno
- Beware Of Words: Evaluating The Lexical Richness Of Conversational Large Language Models Martínez Gonzalo, Hernández José Alberto, Conde Javier, Reviriego Pedro, Merino Elena
- Using Large Language Models To Estimate Features Of Multi-word Expressions: Concreteness, Valence, Arousal Martínez Gonzalo, Molero Juan Diego, González Sandra, Conde Javier, Brysbaert Marc, Reviriego Pedro
- Test-driven Development For Code Generation Mathews Noble Saji, Nagappan Meiyappan
- Do Large Language Models Understand Verbal Indicators Of Romantic Attraction? Matz Sandra C., Peters Heinrich, Eastwick Paul W., Cerf Moran, Finkel Eli J.
- XAI For All: Can Large Language Models Simplify Explainable AI? Mavrepis Philip, Makridis Georgios, Fatouros Georgios, Koukos Vasileios, Separdani Maria Margarita, Kyriazis Dimosthenis
- Can Opensource Beat Chatgpt? -- A Comparative Study Of Large Language Models For Text-to-code Generation Mayer Luis, Heumann Christian, Aßenmacher Matthias
- LLM Critics Help Catch LLM Bugs Mcaleese Nat, Pokorny Rai Michael, Uribe Juan Felipe Ceron, Nitishinskaya Evgenia, Trebacz Maja, Leike Jan
- Generative Artificial Intelligence In Higher Education: Evidence From An Analysis Of Institutional Policies And Guidelines Mcdonald Nora, Johri Aditya, Ali Areej, Hingle Aayushi
- Injecting New Knowledge Into Large Language Models Via Supervised Fine-tuning Mecklenburg Nick, Lin Yiyou, Li Xiaoxiao, Holstein Daniel, Nunes Leonardo, Malvar Sara, Silva Bruno, Chandra Ranveer, Aski Vijay, Yannam Pavan Kumar Reddy, Aktas Tolga, Hendry Todd
- Enhancing Creativity In Large Language Models Through Associative Thinking Strategies Mehrotra Pronita, Parab Aishni, Gulwani Sumit
- Metacheckgpt -- A Multi-task Hallucination Detector Using LLM Uncertainty And Meta-models Mehta Rahul, Hoblitzell Andrew, O'keefe Jack, Jang Hyeju, Varma Vasudeva
- Towards Human Understanding Of Paraphrase Types In Chatgpt Meier Dominik, Wahle Jan Philip, Ruas Terry, Gipp Bela
- On The Benchmarking Of Llms For Open-domain Dialogue Evaluation Mendonça John, Lavie Alon, Trancoso Isabel
- Soda-eval: Open-domain Dialogue Evaluation In The Age Of Llms Mendonça John, Trancoso Isabel, Lavie Alon
- Ecoh: Turn-level Coherence Evaluation For Multilingual Dialogues Mendonça John, Trancoso Isabel, Lavie Alon
- Phybench: A Physical Commonsense Benchmark For Evaluating Text-to-image Models Meng Fanqing, Shao Wenqi, Luo Lixin, Wang Yahong, Chen Yiran, Lu Quanfeng, Yang Yue, Yang Tianshuo, Zhang Kaipeng, Qiao Yu, Luo Ping
- Logic-enhanced Language Model Agents For Trustworthy Social Simulations Mensfelt Agnieszka, Stathis Kostas, Trencsenyi Vince
- Low-resource Machine Translation Through Retrieval-augmented LLM Prompting: A Study On The Mambai Language Merx Raphaël, Mahmudi Aso, Langford Katrina, De Araujo Leo Alberto, Vylomova Ekaterina
- A Comparison Of LLM Finetuning Methods & Evaluation Metrics With Travel Chatbot Use Case Meyer Sonia, Singh Shreya, Tam Bertha, Ton Christopher, Ren Angel
- User Centric Evaluation Of Code Generation Tools Miah Tanha, Zhu Hong
- Is English The New Programming Language? How About Pseudo-code Engineering? Michaelsen Gian Alexandre, Santos Renato P. Dos
- Large Language Models: A Survey Minaee Shervin, Mikolov Tomas, Nikzad Narjes, Chenaghlu Meysam, Socher Richard, Amatriain Xavier, Gao Jianfeng
- Overview Of Dialogue Robot Competition 2023 Minato Takashi, Higashinaka Ryuichiro, Sakai Kurima, Funayama Tomo, Nishizaki Hiromitsu, Naga Takayuki
- Trust No Bot: Discovering Personal Disclosures In Human-llm Conversations In The Wild Mireshghallah Niloofar, Antoniak Maria, More Yash, Choi Yejin, Farnadi Golnoosh
- Fine-grained Hallucination Detection And Editing For Language Models Mishra Abhika, Asai Akari, Balachandran Vidhisha, Wang Yizhong, Neubig Graham, Tsvetkov Yulia, Hajishirzi Hannaneh
- Evaluating Large Language Models With Human Feedback: Establishing A Swedish Benchmark Moell Birger
- Routoo: Learning To Route To Large Language Models Effectively Mohammadshahi Alireza, Shaikh Arshad Rafiq, Yazdani Majid
- Llava-chef: A Multi-modal Generative Model For Food Recipes Mohbat Fnu, Zaki Mohammed J.
- Can We Trust Large Language Models Generated Code? A Framework For In-context Learning, Security Patterns, And Code Evaluations Across Diverse Llms Mohsin Ahmad, Janicke Helge, Wood Adrian, Sarker Iqbal H., Maglaras Leandros, Janjua Naeem
- Comparison Of Large Language Models For Generating Contextually Relevant Questions Molina Ivo Lodovico, Švábenský Valdemar, Minematsu Tsubasa, Chen Li, Okubo Fumiya, Shimada Atsushi
- Kam-cot: Knowledge Augmented Multimodal Chain-of-thoughts Reasoning Mondal Debjyoti, Modi Suraj, Panda Subhadarshi, Singh Rituraj, Rao Godawari Sudhakar
- Realm: Reference Resolution As Language Modeling Moniz Joel Ruben Antony, Krishnan Soundarya, Ozyildirim Melis, Saraf Prathamesh, Ates Halim Cagri, Zhang Yuan, Yu Hong
- Are Llms Robust For Spoken Dialogues? Mousavi Seyed Mahed, Roccabruna Gabriel, Alghisi Simone, Rizzoli Massimo, Ravanelli Mirco, Riccardi Giuseppe
- Are Large Language Models Actually Good At Text Style Transfer? Mukherjee Sourabrata, Ojha Atul Kr., Dušek Ondřej
- Grouse: A Benchmark To Evaluate Evaluators In Grounded Question Answering Muller Sacha, Loison António, Omrani Bilel, Viaud Gautier
- Chatgpt In Classrooms: Transforming Challenges Into Opportunities In Education Munawar Harris Bin, Misirlis Nikolaos
- Evaluating The Evaluator: Measuring Llms' Adherence To Task Evaluation Instructions Murugadoss Bhuvanashree, Poelitz Christian, Drosos Ian, Le Vu, Mckenna Nick, Negreanu Carina Suzana, Parnin Chris, Sarkar Advait
- Open-llm-leaderboard: From Multi-choice To Open-style Questions For Llms Evaluation, Benchmark, And Arena Myrzakhan Aidar, Bsharat Sondos Mahmoud, Shen Zhiqiang
- Benchmarking Llama2, Mistral, Gemma And GPT For Factuality, Toxicity, Bias And Propensity For Hallucinations Nadeau David, Kroutikov Mike, Mcneil Karen, Baribeau Simon
- Negation Blindness In Large Language Models: Unveiling The NO Syndrome In Image Generation Nadeem Mohammad, Sohail Shahab Saquib, Cambria Erik, Schuller Björn W., Hussain Amir
- Zero-shot Visual Reasoning By Vision-language Models: Benchmarking And Analysis Nagar Aishik, Jaiswal Shantanu, Tan Cheston
- Generating Situated Reflection Triggers About Alternative Solution Paths: A Case Study Of Generative AI For Computer-supported Collaborative Learning Naik Atharva, Yin Jessica Ruhan, Kamath Anusha, Ma Qianou, Wu Sherry Tongshuang, Murray Charles, Bogart Christopher, Sakr Majd, Rose Carolyn P.
- Cdquant: Accurate Post-training Weight Quantization Of Large Pre-trained Models Using Greedy Coordinate Descent Nair Pranav Ajit, Suggala Arun Sai
- What If Red Can Talk? Dynamic Dialogue Generation Using Large Language Models Nananukul Navapat, Wongkamjan Wichayaporn
- Review-feedback-reason (refer): A Novel Framework For NLG Evaluation And Reasoning Narsupalli Yaswanth, Chandra Abhranil, Muppirala Sreevatsa, Gupta Manish, Goyal Pawan
- Revolutionizing Undergraduate Learning: Coursegpt And Its Generative AI Advancements Nazar Ahmad M., Selim Mohamed Y., Gaffar Ashraf, Ahmed Shakil
- Interpreting Context Look-ups In Transformers: Investigating Attention-mlp Interactions Neo Clement, Cohen Shay B., Barez Fazl
- What Drives Performance In Multilingual Language Models? Nezhad Sina Bagheri, Agrawal Ameeta
- Educational Personalized Learning Path Planning With Large Language Models Ng Chee, Fung Yuen
- How Well Can Llms Echo Us? Evaluating AI Chatbots' Role-play Ability With ECHO Ng Man Tik, Tse Hui Tung, Huang Jen-tse, Li Jingjing, Wang Wenxuan, Lyu Michael R.
- Dude: Dual Distribution-aware Context Prompt Learning For Large Vision-language Model Nguyen Duy M. H., Le An T., Nguyen Trung Q., Diep Nghiem T., Nguyen Tai, Duong-tran Duy, Peters Jan, Shen Li, Niepert Mathias, Sonntag Daniel
- Gpts And Language Barrier: A Cross-lingual Legal QA Examination Nguyen Ha-thanh, Yamada Hiroaki, Satoh Ken
- Better Alignment With Instruction Back-and-forth Translation Nguyen Thao, Li Jeffrey, Oh Sewoong, Schmidt Ludwig, Weston Jason, Zettlemoyer Luke, Li Xian
- Cultural Commonsense Knowledge For Intercultural Dialogues Nguyen Tuan-phong, Razniewski Simon, Weikum Gerhard
- Llms For Generating And Evaluating Counterfactuals: A Comprehensive Study Nguyen Van Bach, Youssef Paul, Schlötterer Jörg, Seifert Christin
- Xwin-lm: Strong And Scalable Alignment Practice For Llms Ni Bolin, Hu Jingcheng, Wei Yixuan, Peng Houwen, Zhang Zheng, Meng Gaofeng, Hu Han
- Mozip: A Multilingual Benchmark To Evaluate Large Language Models In Intellectual Property Ni Shiwen, Tan Minghuan, Bai Yuelin, Niu Fuqiang, Yang Min, Zhang Bowen, Xu Ruifeng, Chen Xiaojun, Li Chengming, Hu Xiping, Li Ye, Fan Jianping
- A Systematic Evaluation Of Large Language Models For Natural Language Generation Tasks Ni Xuanfan, Li Piji
- The GPT Surprise: Offering Large Language Model Chat In A Massive Coding Class Reduced Engagement But Increased Adopters Exam Performances Nie Allen, Chandak Yash, Suzara Miroslav, Ali Malika, Woodrow Juliette, Peng Matt, Sahami Mehran, Brunskill Emma, Piech Chris
- Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge In English-centric Large Language Models Nie Ercong, Yuan Shuzhou, Ma Bolei, Schmid Helmut, Färber Michael, Kreuter Frauke, Schütze Hinrich
- Cfinbench: A Comprehensive Chinese Financial Benchmark For Large Language Models Nie Ying, Yan Binwei, Guo Tianyu, Liu Hao, Wang Haoyu, He Wei, Zheng Binfan, Wang Weihao, Li Qiang, Sun Weijian, Wang Yunhe, Tao Dacheng
- WTU-EVAL: A Whether-or-not Tool Usage Evaluation Benchmark For Large Language Models Ning Kangyun, Su Yisong, Lv Xueqiang, Zhang Yuanzhe, Liu Jian, Liu Kang, Xu Jinan
- How Reliable AI Chatbots Are For Disease Prediction From Patient Complaints? Nipu Ayesha Siddika, Islam K M Sajjadul, Madiraju Praveen
- Enhancing Dialogue State Tracking Models Through Llm-backed User-agents Simulation Niu Cheng, Wang Xingguang, Cheng Xuxin, Song Juntong, Zhang Tong
- From Text To Emotion: Unveiling The Emotion Annotation Capabilities Of Llms Niu Minxue, Jaiswal Mimansa, Provost Emily Mower
- Mobileflow: A Multimodal LLM For Mobile GUI Agent Nong Songqin, Zhu Jiali, Wu Rui, Jin Jiongchao, Shan Shuo, Huang Xiutian, Xu Wenhao
- Generative AI In EU Law: Liability, Privacy, Intellectual Property, And Cybersecurity Novelli Claudio, Casolari Federico, Hacker Philipp, Spedicato Giorgio, Floridi Luciano
- Beyond Metrics: Evaluating Llms' Effectiveness In Culturally Nuanced, Low-resource Real-world Scenarios Ochieng Millicent, Gumma Varun, Sitaram Sunayana, Wang Jindong, Chaudhary Vishrav, Ronen Keshet, Bali Kalika, O'neill Jacki
- Gpt-ology, Computational Models, Silicon Sampling: How Should We Think About Llms In Cognitive Science? Ong Desmond C.
- Prompting Large Vision-language Models For Compositional Reasoning Ossowski Timothy, Jiang Ming, Hu Junjie
- The Accuracy Of Domain Specific And Descriptive Analysis Generated By Large Language Models Otieno Denish Omondi, Abri Faranak, Siami-namini Sima, Namin Akbar Siami
- Lossless Acceleration Of Large Language Model Via Adaptive N-gram Parallel Decoding Ou Jie, Chen Yueming, Tian Wenhong
- Unraveling The Dilemma Of AI Errors: Exploring The Effectiveness Of Human And Machine Explanations For Large Language Models Pafla Marvin, Larson Kate, Hancock Mark
- NLP At UC Santa Cruz At Semeval-2024 Task 5: Legal Answer Validation Using Few-shot Multi-choice QA Pahilajani Anish, Jain Samyak Rajesh, Trivedi Devasha
- Training Llms To Recognize Hedges In Spontaneous Narratives Paige Amie J., Soubki Adil, Murzaku John, Rambow Owen, Brennan Susan E.
- Gemini Goes To Med School: Exploring The Capabilities Of Multimodal Large Language Models On Medical Challenge Problems & Hallucinations Pal Ankit, Sankarasubbu Malaikannan
- Dense Training, Sparse Inference: Rethinking Training Of Mixture-of-experts Language Models Pan Bowen, Shen Yikang, Liu Haokun, Mishra Mayank, Zhang Gaoyuan, Oliva Aude, Raffel Colin, Panda Rameswar
- The Solution For The AIGC Inference Performance Optimization Competition Pan Sishun, Xu Haonan, Wan Zhonghua, Yang Yang
- Assessing AI Detectors In Identifying Ai-generated Code: Implications For Education Pan Wei Hung, Chok Ming Jie, Wong Jonathan Leong Shan, Shin Yung Xin, Poon Yeong Shian, Yang Zhou, Chong Chun Yong, Lo David, Lim Mei Kuan
- AILS-NTUA At Semeval-2024 Task 9: Cracking Brain Teasers: Transformer Models For Lateral Thinking Puzzles Panagiotopoulos Ioannis, Filandrianos Giorgos, Lymperaiou Maria, Stamou Giorgos
- Can Large Language Models Put 2 And 2 Together? Probing For Entailed Arithmetical Relationships Panas D., Seth S., Belle V.
- Phased Instruction Fine-tuning For Large Language Models Pang Wei, Zhou Chuan, Zhou Xiao-hua, Wang Xiaojie
- Self-alignment Of Large Language Models Via Monopolylogue-based Social Scene Simulation Pang Xianghe, Tang Shuo, Ye Rui, Xiong Yuxin, Zhang Bolun, Wang Yanfeng, Chen Siheng
- LLM Evaluators Recognize And Favor Their Own Generations Panickssery Arjun, Bowman Samuel R., Feng Shi
- Picturing Ambiguity: A Visual Twist On The Winograd Schema Challenge Park Brendan, Janecek Madeline, Ezzati-jivan Naser, Li Yifeng, Emami Ali
- Pragmatic Competence Evaluation Of Large Language Models For Korean Park Dojun, Lee Jiwoo, Jeong Hyeyun, Park Seohyun, Lee Sungeun
- Low-resource Cross-lingual Summarization Through Few-shot Learning With Large Language Models Park Gyutae, Hwang Seojin, Lee Hwanhee
- Llms For Enhanced Agricultural Meteorological Recommendations Park Ji-jun, Choi Soo-joon
- Building Trust In Mental Health Chatbots: Safety Metrics And Llm-based Evaluation Tools Park Jung In, Abbasian Mahyar, Azimi Iman, Bounds Dawn, Jun Angela, Han Jaesu, Mccarron Robert, Borelli Jessica, Li Jia, Mahmoudi Mona, Wiedenhoeft Carmen, Rahmani Amir
- Disentangling Length From Quality In Direct Preference Optimization Park Ryan, Rafailov Rafael, Ermon Stefano, Finn Chelsea
- Can Separators Improve Chain-of-thought Prompting? Park Yoonjeong, Kim Hyunjin, Choi Chanyeol, Kim Junseong, Sohn Jy-yong
- Logicbench: Towards Systematic Evaluation Of Logical Reasoning Ability Of Large Language Models Parmar Mihir, Patel Nisarg, Varshney Neeraj, Nakamura Mutsumi, Luo Man, Mashetty Santosh, Mitra Arindam, Baral Chitta
- Evidence To Generate (E2G): A Single-agent Two-step Prompting For Context Grounded And Retrieval Augmented Reasoning Parvez Md Rizwan
- Decoding News Narratives: A Critical Analysis Of Large Language Models In Framing Detection Pastorino Valeria, Sivakumar Jasivan A., Moosavi Nafise Sadat
- Automating Code Adaptation For Mlops -- A Benchmarking Study On Llms Patel Harsh, Ramanan Buvaneswari A., Khan Manzoor A., Williams Thomas, Friedman Brian, Drabeck Lawrence
- Multi-logieval: Towards Evaluating Multi-step Logical Reasoning Ability Of Large Language Models Patel Nisarg, Kulkarni Mohith, Parmar Mihir, Budhiraja Aashna, Nakamura Mutsumi, Varshney Neeraj, Baral Chitta
- SWAG: Storytelling With Action Guidance Patel Zeeshan, El-refai Karim, Pei Jonathan, Li Tianle
- Generative AI As A Metacognitive Agent: A Comparative Mixed-method Study With Human Participants On Icf-mimicking Exam Performance Pavlovic Jelena University Of Belgrade, Faculty Of Philosophy And Koucing Centar Resarch Lab, Krstic Jugoslav Koucing Centar Research Lab, Mitrovic Luka Koucing Centar Research Lab, Babic Djordje Koucing Centar Research Lab, Milosavljevic Adrijana Koucing Centar Research Lab, Nikolic Milena Koucing Centar Research Lab, Karaklic Tijana Koucing Centar Research Lab, Mitrovic Tijana Koucing Centar Research Lab
- The Effectiveness Of Llms As Annotators: A Comparative Overview And Empirical Analysis Of Direct Representation Pavlovic Maja, Poesio Massimo
- Automated Multi-language To English Machine Translation Using Generative Pre-trained Transformers Pelofske Elijah, Urias Vincent, Liebrock Lorie M.
- Securing Large Language Models: Addressing Bias, Misinformation, And Prompt Attacks Peng Benji, Chen Keyu, Li Ming, Feng Pohsun, Bi Ziqian, Liu Junyu, Niu Qian
- Review-llm: Harnessing Large Language Models For Personalized Review Generation Peng Qiyao, Liu Hongtao, Xu Hongyan, Yang Qing, Shao Minglai, Wang Wenjun
- AI Revolution On Chat Bot: Evidence From A Randomized Controlled Experiment Peng Sida, Swiatek Wojciech, Gao Allen, Cullivan Paul, Chang Haoge
- Uncertainty-aware Explainable Recommendation With Large Language Models Peng Yicui, Chen Hao, Lin Chingsheng, Huang Guo, Hu Jinrong, Guo Hui, Kong Bin, Hu Shu, Wu Xi, Wang Xin
- Dreambench++: A Human-aligned Benchmark For Personalized Image Generation Peng Yuang, Cui Yuxin, Tang Haomiao, Qi Zekun, Dong Runpei, Bai Jing, Han Chunrui, Ge Zheng, Zhang Xiangyu, Xia Shu-tao
- Check-eval: A Checklist-based Approach For Evaluating Text Quality Pereira Jayr, Assumpcao Andre, Lotufo Roberto
- Astrollama-chat: Scaling Astrollama With Conversational And Diverse Datasets Perkowski Ernest, Pan Rui, Nguyen Tuan Dung, Ting Yuan-sen, Kruk Sandor, Zhang Tong, O'neill Charlie, Jablonska Maja, Sun Zechang, Smith Michael J., Liu Huiling, Schawinski Kevin, Iyer Kartheik, Universetbd Ioana Ciucă For
- Analyzing Large Language Models Chatbots: An Experimental Approach Using A Probability Test Peruchini Melise, Teixeira Julio Monteiro
- Large Language Models Can Infer Personality From Free-form User Interactions Peters Heinrich, Cerf Moran, Matz Sandra C.
- Benchmarking GPT-4 On Algorithmic Problems: A Systematic Evaluation Of Prompting Strategies Petruzzellis Flavio, Testolin Alberto, Sperduti Alessandro
- Graph Guided Question Answer Generation For Procedural Question-answering Pham Hai X., Hadji Isma, Xu Xinnuo, Degutyte Ziedune, Rainey Jay, Kazakos Evangelos, Fazly Afsaneh, Tzimiropoulos Georgios, Martinez Brais
- Can Base Chatgpt Be Used For Forecasting Without Additional Optimization? Pham Van, Cunningham Scott
- Sign Of The Times: Evaluating The Use Of Large Language Models For Idiomaticity Detection Phelps Dylan, Pickard Thomas, Mi Maggie, Gow-smith Edward, Villavicencio Aline
- Peering Into The Mind Of Language Models: An Approach For Attribution In Contextual Question Answering Phukan Anirudh, Somasundaram Shwetha, Saxena Apoorv, Goswami Koustava, Srinivasan Balaji Vasan
- Revisiting Verilogeval: Newer Llms, In-context Learning, And Specification-to-rtl Tasks Pinckney Nathaniel, Batten Christopher, Liu Mingjie, Ren Haoxing, Khailany Brucek
- Improving Context-aware Preference Modeling For Language Models Pitis Silviu, Xiao Ziang, Roux Nicolas Le, Sordoni Alessandro
- V-RECS, A Low-cost LLM4VIS Recommender With Explanations, Captioning And Suggestions Podo Luca, Angelini Marco, Velardi Paola
- Vi(e)va LLM! A Conceptual Stack For Evaluating And Interpreting Generative Ai-based Visualizations Podo Luca, Ishmal Muhammad, Angelini Marco
- Exploring The Efficacy Of Large Language Models (GPT-4) In Binary Reverse Engineering Pordanesh Saman, Tan Benjamin
- Deciphering The Factors Influencing The Efficacy Of Chain-of-thought: Probability, Memorization, And Noisy Reasoning Prabhakar Akshara, Griffiths Thomas L., Mccoy R. Thomas
- Sambanova SN40L: Scaling The AI Memory Wall With Dataflow And Composition Of Experts Prabhakar Raghu, Sivaramakrishnan Ram, Gandhi Darshan, Du Yun, Wang Mingran, Song Xiangyu, Zhang Kejie, Gao Tianren, Wang Angela, Li Karen, Sheng Yongning, Brot Joshua, Sokolov Denis, Vivek Apurv, Leung Calvin, Sabnis Arjun, Bai Jiayu, Zhao Tuowen, Gottscho Mark, Jackson David, Luttrell Mark, Shah Manish K., Chen Edison, Liang Kaizhao, Jain Swayambhoo, Thakker Urmish, Huang Dawei, Jairath Sumti, Brown Kevin J., Olukotun Kunle
- Accelerating Large Language Model Pretraining Via LFR Pedagogy: Learn, Focus, And Review Prakriya Neha, Yen Jui-nan, Hsieh Cho-jui, Cong Jason
- Fractured-sorry-bench: Framework For Revealing Attacks In Conversational Turns Undermining Refusal Efficacy And Defenses Over Sorry-bench Priyanshu Aman, Vijay Supriti
- When Quantization Affects Confidence Of Large Language Models? Proskurina Irina, Brun Luc, Metzler Guillaume, Velcin Julien
- Code Prompting Elicits Conditional Reasoning Abilities In Text+code Llms Puerto Haritz, Tutek Martin, Aditya Somak, Zhu Xiaodan, Gurevych Iryna
- Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study In Indonesian And Sundanese Putri Rifki Afina, Haznitrama Faiz Ghifari, Adhista Dea, Oh Alice
- Is Next Token Prediction Sufficient For GPT? Exploration On Code Logic Comprehension Qi Mengnan, Huang Yufan, Yao Yongqiang, Wang Maoquan, Gu Bin, Sundaresan Neel
- SNIFFER: Multimodal Large Language Model For Explainable Out-of-context Misinformation Detection Qi Peng, Yan Zehong, Hsu Wynne, Lee Mong Li
- Follow My Instruction And Spill The Beans: Scalable Data Extraction From Retrieval-augmented Generation Systems Qi Zhenting, Zhang Hanlin, Xing Eric, Kakade Sham, Lakkaraju Himabindu
- Enhancing Dialogue Generation In Werewolf Game Through Situation Analysis And Persuasion Strategies Qi Zhiyang, Inaba Michimasa
- How Easy Is It To Fool Your Multimodal Llms? An Empirical Analysis On Deceptive Prompts Qian Yusu, Zhang Haotian, Yang Yinfei, Gan Zhe
- Prism: A Framework For Decoupling And Assessing The Capabilities Of Vlms Qiao Yuxuan, Duan Haodong, Fang Xinyu, Yang Junming, Chen Lin, Zhang Songyang, Wang Jiaqi, Lin Dahua, Chen Kai
- Supportiveness-based Knowledge Rewriting For Retrieval-augmented Language Modeling Qiao Zile, Ye Wei, Jiang Yong, Mo Tong, Xie Pengjun, Li Weiping, Huang Fei, Zhang Shikun
- Diffusiongpt: Llm-driven Text-to-image Generation System Qin Jie, Wu Jie, Chen Weifeng, Ren Yuxi, Li Huixia, Wu Hefeng, Xiao Xuefeng, Wang Rui, Wen Shilei
- Infobench: Evaluating Instruction Following Ability In Large Language Models Qin Yiwei, Song Kaiqiang, Hu Yebowen, Yao Wenlin, Cho Sangwoo, Wang Xiaoyang, Wu Xuansheng, Liu Fei, Liu Pengfei, Yu Dong
- Efficient Interactive LLM Serving With Proxy Model-based Sequence Length Prediction Qiu Haoran, Mao Weichao, Patke Archit, Cui Shengkun, Jha Saurabh, Wang Chen, Franke Hubertus, Kalbarczyk Zbigniew T., Başar Tamer, Iyer Ravishankar K.
- Dissecting Multiplication In Transformers: Insights Into Llms Qiu Luyu, Li Jianing, Su Chi, Zhang Chen Jason, Chen Lei
- Can Large Language Models Understand Symbolic Graphics Programs? Qiu Zeju, Liu Weiyang, Feng Haiwen, Liu Zhen, Xiao Tim Z., Collins Katherine M., Tenenbaum Joshua B., Weller Adrian, Black Michael J., Schölkopf Bernhard
- Grammaticality Representation In Chatgpt As Compared To Linguists And Laypeople Qiu Zhuang, Duan Xufeng, Cai Zhenguang G.
- Unsupervised Distractor Generation Via Large Language Model Distilling And Counterfactual Contrastive Decoding Qu Fanyi, Sun Hao, Wu Yunfang
- Before Generation, Align It! A Novel And Effective Strategy For Mitigating Hallucinations In Text-to-sql Generation Qu Ge, Li Jinyang, Li Bowen, Qin Bowen, Huo Nan, Ma Chenhao, Cheng Reynold
- Coursegpt-zh: An Educational Large Language Model Based On Knowledge Distillation Incorporating Prompt Optimization Qu Zheyan, Yin Lu, Yu Zitong, Wang Wenbo, Zhang Xing
- Llms Among Us: Generative AI Participating In Digital Discourse Radivojevic Kristina, Clark Nicholas, Brenner Paul
- Defan: Definitive Answer Dataset For Llms Hallucination Evaluation Rahman A B M Ashikur, Anwar Saeed, Usman Muhammad, Mian Ajmal
- An Investigation Of Neuron Activation As A Unified Lens To Explain Chain-of-thought Eliciting Arithmetic Reasoning Of Llms Rai Daking, Yao Ziyu
- Knowledge-based Consistency Testing Of Large Language Models Rajan Sai Sathiesh, Soremekun Ezekiel, Chattopadhyay Sudipta
- Scavenging Hyena: Distilling Transformers Into Long Convolution Models Ralambomihanta Tokiniaina Raharison, Mohammadzadeh Shahrad, Islam Mohammad Sami Nur, Jabbour Wassim, Liang Laurence
- Analyzing LLM Behavior In Dialogue Summarization: Unveiling Circumstantial Hallucination Trends Ramprasad Sanjana, Ferracane Elisa, Lipton Zachary C.
- The Battle Of Llms: A Comparative Study In Conversational QA Tasks Rangapur Aryan, Rangapur Aman
- A Critical Look At Tokenwise Reward-guided Text Generation Rashid Ahmad, Wu Ruotian, Grosse Julia, Kristiadi Agustinus, Poupart Pascal
- Lynx: An Open Source Hallucination Evaluation Model Ravi Selvan Sunitha, Mielczarek Bartosz, Kannappan Anand, Kiela Douwe, Qian Rebecca
- FACTOID: Factual Entailment For Hallucination Detection Rawte Vipula, Tonmoy S. M Towhidul Islam, Rajbangshi Krishnav, Nag Shravani, Chadha Aman, Sheth Amit P., Das Amitava
- Codeattack: Revealing Safety Generalization Challenges Of Large Language Models Via Code Completion Ren Qibing, Gao Chang, Shao Jing, Yan Junchi, Tan Xin, Lam Wai, Ma Lizhuang
- Healthcare Copilot: Eliciting The Power Of General Llms For Medical Consultation Ren Zhiyao, Zhan Yibing, Yu Baosheng, Ding Liang, Tao Dacheng
- The Benefits Of A Concise Chain Of Thought On Problem-solving In Large Language Models Renze Matthew, Guven Erhan
- Evaluating Llms For Gender Disparities In Notable Persons Rhue Lauren, Goethals Sofie, Sundararajan Arun
- Code Generation With Alphacodium: From Prompt Engineering To Flow Engineering Ridnik Tal, Kredo Dedy, Friedman Itamar
- Empirical Study Of Symmetrical Reasoning In Conversational Chatbots Rim Daniela N., Choi Heeyoul
- Pretrained Hybrids With MAD Skills Roberts Nicholas, Guo Samuel, Gao Zhiqi, Gnvv Satya Sai Srinath Namburi, Cromp Sonia, Wu Chengjun, Duan Chengyu, Sala Frederic
- \(\texttt{accord}\): Closing The Commonsense Measurability Gap Roewer-després François, Feng Jinyue, Zhu Zining, Rudzicz Frank
- Exploring Advanced Large Language Models With Llmsuite Roffo Giorgio
- Zero-shot Spam Email Classification Using Pre-trained Large Language Models Rojas-galeano Sergio
- Question-instructed Visual Descriptions For Zero-shot Video Question Answering Romero David, Solorio Thamar
- Do GPT Language Models Suffer From Split Personality Disorder? The Advent Of Substrate-free Psychometrics Romero Peter, Fitz Stephen, Nakatsuma Teruo
- Exploring The Potential Of The Large Language Models (llms) In Identifying Misleading News Headlines Rony Md Main Uddin, Haque Md Mahfuzul, Ali Mohammad, Alam Ahmed Shatil, Hassan Naeemul
- An Early Categorization Of Prompt Injection Attacks On Large Language Models Rossi Sippo, Michel Alisia Marianne, Mukkamala Raghava Rao, Thatcher Jason Bennett
- Generative AI For Enhancing Active Learning In Education: A Comparative Study Of GPT-3.5 And GPT-4 In Crafting Customized Test Questions Rouzegar Hamdireza, Makrehchi Masoud
- Code-aware Prompting: A Study Of Coverage Guided Test Generation In Regression Setting Using LLM Ryan Gabriel, Jain Siddhartha, Shang Mingyue, Wang Shiqi, Ma Xiaofei, Ramanathan Murali Krishna, Ray Baishakhi
- How Chatgpt Changed The Media's Narratives On AI: A Semi-automated Narrative Analysis Through Frame Semantics Ryazanov Igor, Öhman Carl, Björklund Johanna
- Tandem Transformers For Inference Efficient Llms S Aishwarya P, Nair Pranav Ajit, Samaga Yashas, Boyd Toby, Kumar Sanjiv, Jain Prateek, Netrapalli Praneeth
- Capabilities Of Gemini Models In Medicine Saab Khaled, Tu Tao, Weng Wei-hung, Tanno Ryutaro, Stutz David, Wulczyn Ellery, Zhang Fan, Strother Tim, Park Chunjong, Vedadi Elahe, Chaves Juanma Zambrano, Hu Szu-yeu, Schaekermann Mike, Kamath Aishwarya, Cheng Yong, Barrett David G. T., Cheung Cathy, Mustafa Basil, Palepu Anil, Mcduff Daniel, Hou Le, Golany Tomer, Liu Luyang, Alayrac Jean-baptiste, Houlsby Neil, Tomasev Nenad, Freyberg Jan, Lau Charles, Kemp Jonas, Lai Jeremy, Azizi Shekoofeh, Kanada Kimberly, Man Siwai, Kulkarni Kavita, Sun Ruoxi, Shakeri Siamak, He Luheng, Caine Ben, Webson Albert, Latysheva Natasha, Johnson Melvin, Mansfield Philip, Lu Jian, Rivlin Ehud, Anderson Jesper, Green Bradley, Wong Renee, Krause Jonathan, Shlens Jonathon, Dominowska Ewa, Eslami S. M. Ali, Chou Katherine, Cui Claire, Vinyals Oriol, Kavukcuoglu Koray, Manyika James, Dean Jeff, Hassabis Demis, Matias Yossi, Webster Dale, Barral Joelle, Corrado Greg, Semturs Christopher, Mahdavi S. Sara, Gottweis Juraj, Karthikesalingam Alan, Natarajan Vivek
- Are Llms Good Cryptic Crossword Solvers? Sadallah Abdelrahman "boda", Kotova Daria, Kochmar Ekaterina
- Utebc-nlp At Semeval-2024 Task 9: Can Llms Be Lateral Thinkers? Sadeghi Pouya, Abaskohi Amirhossein, Yaghoobzadeh Yadollah
- Vision-language And Large Language Model Performance In Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, And Quantized Models Safavi-naini Seyed Amir Ahmad, Ali Shuhaib, Shahab Omer, Shahhoseini Zahra, Savage Thomas, Rafiee Sara, Samaan Jamil S, Shabeeb Reem Al, Ladak Farah, Yang Jamie O, Echavarria Juan, Babar Sumbal, Shaukat Aasma, Margolis Samuel, Tatonetti Nicholas P, Nadkarni Girish, Kurdi Bara El, Soroush Ali
- On Zero-shot Counterspeech Generation By Llms Saha Punyajoy, Agrawal Aalok, Jana Abhik, Biemann Chris, Mukherjee Animesh
- Mixsumm: Topic-based Data Augmentation Using Llms For Low-resource Extractive Text Summarization Sahu Gaurav, Laradji Issam H.
- Risks, Causes, And Mitigations Of Widespread Deployments Of Large Language Models (llms): A Survey Sakib Md Nazmus, Islam Md Athikul, Pathak Royal, Arifin Md Mashrur
- Large Language Models Show Human-like Social Desirability Biases In Survey Responses Salecha Aadesh, Ireland Molly E., Subrahmanya Shashanka, Sedoc João, Ungar Lyle H., Eichstaedt Johannes C.
- Synergizing In-context Learning With Hints For End-to-end Task-oriented Dialog Systems Saley Vishal Vivek, Das Rocktim Jyoti, Raghu Dinesh, Mausam
- Berts Are Generative In-context Learners Samuel David
- Retrieval-augmented Generation Meets Data-driven Tabula Rasa Approach For Temporal Knowledge Graph Forecasting Sannidhi Geethan, Sakhinana Sagar Srinivas, Runkana Venkataramana
- Prompting Llms To Compose Meta-review Drafts From Peer-review Narratives Of Scholarly Manuscripts Santu Shubhra Kanti Karmaker, Sinha Sanjeev Kumar, Bansal Naman, Knipper Alex, Sarkar Souvika, Salvador John, Mahajan Yash, Guttikonda Sri, Akter Mousumi, Freestone Matthew, Williams Matthew C. Jr
- Are Machines Better At Complex Reasoning? Unveiling Human-machine Inference Gaps In Entailment Verification Sanyal Soumya, Xiao Tianyi, Liu Jiacheng, Wang Wenya, Ren Xiang
- Pre-training Small Base Lms With Fewer Tokens Sanyal Sunny, Sanghavi Sujay, Dimakis Alexandros G.
- ICAL: Continual Learning Of Multimodal Agents By Transforming Trajectories Into Actionable Insights Sarch Gabriel, Jang Lawrence, Tarr Michael J., Cohen William W., Marino Kenneth, Fragkiadaki Katerina
- Can Llms Master Math? Investigating Large Language Models On Math Stack Exchange Satpute Ankit, Giessing Noah, Greiner-petter Andre, Schubotz Moritz, Teschke Olaf, Aizawa Akiko, Gipp Bela
- A Prompt Response To The Demand For Automatic Gender-neutral Translation Savoldi Beatrice, Piergentili Andrea, Fucci Dennis, Negri Matteo, Bentivogli Luisa
- Improving The Validity Of Automatically Generated Feedback Via Reinforcement Learning Scarlatos Alexander, Smith Digory, Woodhead Simon, Lan Andrew
- Impacts Of Anthropomorphizing Large Language Models In Learning Environments Schaaff Kristina, Heidelmann Marc-andré
- The Human Factor In Detecting Errors Of Large Language Models: A Systematic Literature Review And Future Research Directions Schiller Christian A.
- Pitfalls Of Conversational Llms On News Debiasing Schlicht Ipek Baris, Altiok Defne, Taouk Maryanne, Flek Lucie
- M5 -- A Diverse Benchmark To Assess The Performance Of Large Multimodal Models Across Multilingual And Multicultural Vision-language Tasks Schneider Florian, Sitaram Sunayana
- Exploring Human-llm Conversations: Mental Models And The Originator Of Toxicity Schneider Johannes, Flores Arianna Casanova, Kranz Anne-catherine
- Morehopqa: More Than Multi-hop Reasoning Schnitzler Julian, Ho Xanh, Huang Jiahao, Boudin Florian, Sugawara Saku, Aizawa Akiko
- THREAD: Thinking Deeper With Recursive Spawning Schroeder Philip, Morgan Nathaniel, Luo Hongyin, Glass James
- Team UTSA-NLP At Semeval 2024 Task 5: Prompt Ensembling For Argument Reasoning In Civil Procedures With GPT4 Schumacher Dan, Rios Anthony
- Surprising Efficacy Of Fine-tuned Transformers For Fact-checking Over Larger Language Models Setty Vinay
- Benchmarking Large Language Models For Math Reasoning Tasks Seßler Kathrin, Rong Yao, Gözlüklü Emek, Kasneci Enkelejda
- Tencdm: Understanding The Properties Of Diffusion Model In The Space Of Language Model Encodings Shabalin Alexander, Meshchaninov Viacheslav, Chimbulatov Egor, Lapikov Vladislav, Kim Roman, Bartosh Grigory, Molchanov Dmitry, Markov Sergey, Vetrov Dmitry
- A Fine-grained Sentiment Analysis Of App Reviews Using Large Language Models: An Evaluation Study Shah Faiz Ali, Sabir Ahmed, Sharma Rajesh
- Putting Gpt-4o To The Sword: A Comprehensive Evaluation Of Language, Vision, Speech, And Multimodal Proficiency Shahriar Sakib, Lund Brady, Mannuru Nishith Reddy, Arshad Muhammad Arbab, Hayawi Kadhim, Bevara Ravi Varma Kumar, Mannuru Aashrith, Batool Laiba
- Evaluating Text Summaries Generated By Large Language Models Using Openai's GPT Shakil Hassan, Mahi Atqiya Munawara, Nguyen Phuoc, Ortiz Zeydy, Mardini Mamoun T.
- Combining Multiple Post-training Techniques To Achieve Most Efficient Quantized Llms Sharify Sayeh, Xu Zifei, Yazar Wanzin, Wang Xin
- Unirag: Universal Retrieval Augmentation For Multi-modal Large Language Models Sharifymoghaddam Sahel, Upadhyay Shivani, Chen Wenhu, Lin Jimmy
- A Critical Evaluation Of AI Feedback For Aligning Large Language Models Sharma Archit, Keh Sedrick, Mitchell Eric, Finn Chelsea, Arora Kushal, Kollar Thomas
- Patched MOA: Optimizing Inference For Diverse Software Development Tasks Sharma Asankhaya
- SPML: A DSL For Defending Language Models Against Prompt Attacks Sharma Reshabh K, Gupta Vinayak, Grossman Dan
- A Preliminary Study On Using Large Language Models In Software Pentesting Shashwat Kumar, Hahn Francis, Ou Xinming, Goldgof Dmitry, Hall Lawrence, Ligatti Jay, Rajgopalan S. Raj, Tabari Armin Ziaie
- Rumour Evaluation With Very Large Language Models Shehata Dahlia, Cohen Robin, Clarke Charles
- Shortcutsbench: A Large-scale Real-world Benchmark For Api-based Agents Shen Haiyang, Li Yue, Meng Desong, Cai Dongqi, Qi Sheng, Zhang Li, Xu Mengwei, Ma Yun
- Improving Reinforcement Learning From Human Feedback Using Contrastive Rewards Shen Wei, Zhang Xiaoying, Yao Yuanshun, Zheng Rui, Guo Hongyi, Liu Yang
- A Fine-tuning Dataset And Benchmark For Large Language Models For Protein Understanding Shen Yiqing, Chen Zan, Mamalakis Michail, He Luhan, Xia Haiyang, Li Tianbin, Su Yanzhou, He Junjun, Wang Yu Guang
- Toursynbio: A Multi-modal Large Model And Agent Framework To Bridge Text And Protein Sequences For Protein Engineering Shen Yiqing, Chen Zan, Mamalakis Michail, Liu Yungeng, Li Tianbin, Su Yanzhou, He Junjun, Liò Pietro, Wang Yu Guang
- Repeval: Effective Text Evaluation With LLM Representation Sheng Shuqian, Xu Yi, Zhang Tianhang, Shen Zanwei, Fu Luoyi, Ding Jiaxin, Zhou Lei, Wang Xinbing, Zhou Chenghu
- Can Language Models Explain Their Own Classification Behavior? Sherburn Dane, Chughtai Bilal, Evans Owain
- Using Chatgpt To Score Essays And Short-form Constructed Responses Shermis Mark D.
- Non-autoregressive Sequence-to-sequence Vision-language Models Shi Kunyu, Dong Qi, Goncalves Luis, Tu Zhuowen, Soatto Stefano
- Judging The Judges: A Systematic Investigation Of Position Bias In Pairwise Comparative Assessments By Llms Shi Lin, Ma Chiyu, Ma Weicheng, Vosoughi Soroush
- Keep The Cost Down: A Review On Methods To Optimize LLM' S Kv-cache Consumption Shi Luohe, Zhang Hongyi, Yao Yao, Li Zuchao, Zhao Hai
- Math-llava: Bootstrapping Mathematical Reasoning For Multimodal Large Language Models Shi Wenhao, Hu Zhiqiang, Bin Yi, Liu Junhua, Yang Yang, Ng See-kiong, Bing Lidong, Lee Roy Ka-wei
- Abusegpt: Abuse Of Generative AI Chatbots To Create Smishing Campaigns Shibli Ashfak Md, Pritom Mir Mehedi A., Gupta Maanak
- CAUS: A Dataset For Question Generation Based On Human Cognition Leveraging Large Language Models Shin Minjung, Kim Donghyun, Ryu Jeh-kwang
- Deconstructing In-context Learning: Understanding Prompts Via Corruption Shivagunde Namrata, Lialin Vladislav, Muckatira Sherin, Rumshisky Anna
- GPT-4 Understands Discourse At Least As Well As Humans Do Shultz Thomas, Wise Jamie, Nobandegani Ardavan Salehi
- Design2code: How Far Are We From Automating Front-end Engineering? Si Chenglei, Zhang Yanzhe, Yang Zhengyuan, Liu Ruibo, Yang Diyi
- Where Does In-context Translation Happen In Large Language Models Sia Suzanna, Mueller David, Duh Kevin
- The Fault In Our Stars: Quality Assessment Of Code Generation Benchmarks Siddiq Mohammed Latif, Dristi Simantika, Saha Joy, Santos Joanna C. S.
- Scaling Synthetic Logical Reasoning Datasets With Context-sensitive Declarative Grammars Sileo Damien
- Indicgenbench: A Multilingual Benchmark To Evaluate Generation Capabilities Of Llms On Indic Languages Singh Harman, Gupta Nitish, Bharadwaj Shikhar, Tewari Dinesh, Talukdar Partha
- Exposing The Achilles' Heel: Evaluating Llms Ability To Handle Mistakes In Mathematical Reasoning Singh Joykirat, Nambi Akshay, Vineet Vibhav
- Llm-dcache: Improving Tool-augmented Llms With Gpt-driven Localized Data Caching Singh Simranjit, Fore Michael, Karatzas Andreas, Lee Chaehong, Jian Yanan, Shangguan Longfei, Yu Fuxun, Anagnostopoulos Iraklis, Stamoulis Dimitrios
- Evaluating Zero-shot GPT-4V Performance On 3D Visual Question Answering Benchmarks Singh Simranjit, Pavlakos Georgios, Stamoulis Dimitrios
- Are Small Language Models Ready To Compete With Large Language Models For Practical Applications? Sinha Neelabh, Jain Vinija, Chadha Aman
- Unipa-gpt: Large Language Models For University-oriented QA In Italian Siragusa Irene, Pirrone Roberto
- Using Adaptive Empathetic Responses For Teaching English Siyan Li, Shao Teresa, Yu Zhou, Hirschberg Julia
- Chatting Up Attachment: Using Llms To Predict Adult Bonds Soares Paulo, Mccurdy Sean, Gerber Andrew J., Fonagy Peter
- Multi-task Inference: Can Large Language Models Follow Multiple Instructions At Once? Son Guijin, Baek Sangwon, Nam Sangdae, Jeong Ilgyun, Kim Seungone
- KMMLU: Measuring Massive Multitask Language Understanding In Korean Son Guijin, Lee Hanwool, Kim Sungdong, Kim Seungone, Muennighoff Niklas, Choi Taekyoon, Park Cheonbok, Yoo Kang Min, Biderman Stella
- Increasing Model Capacity For Free: A Simple Strategy For Parameter Efficient Fine-tuning Song Haobo, Zhao Hao, Majumder Soumajit, Lin Tao
- Out-of-distribution Generalization Via Composition: A Lens Through Induction Heads In Transformers Song Jiajun, Xu Zhuoyan, Zhong Yiqiao
- Multilingual Blending: LLM Safety Alignment Evaluation With Language Mixture Song Jiayang, Huang Yuheng, Zhou Zhehua, Ma Lei
- Can Many-shot In-context Learning Help Long-context LLM Judges? See More, Judge Better! Song Mingyang, Zheng Mao, Luo Xuan
- Integrating AI In College Education: Positive Yet Mixed Experiences With Chatgpt Song Xinrui, Zhang Jiajin, Yan Pingkun, Hahn Juergen, Kruger Uwe, Mohamed Hisham, Wang Ge
- ELLA-V: Stable Neural Codec Language Modeling With Alignment-guided Sequence Reordering Song Yakun, Chen Zhuo, Wang Xiaofei, Ma Ziyang, Chen Xie
- The Good, The Bad, And The Greedy: Evaluation Of Llms Should Not Ignore Non-determinism Song Yifan, Wang Guoyin, Li Sujian, Lin Bill Yuchen
- Sharelora: Parameter Efficient And Robust Large Language Model Fine-tuning Via Shared Low-rank Adaptation Song Yurun, Zhao Junchen, Harris Ian G., Jyothi Sangeetha Abdu
- Moviellm: Enhancing Long Video Understanding With Ai-generated Movies Song Zhende, Wang Chenchen, Sheng Jiamu, Zhang Chi, Yu Gang, Fan Jiayuan, Chen Tao
- Llms Simulate Big Five Personality Traits: Further Evidence Sorokovikova Aleksandra, Fedorova Natalia, Rezagholi Sharwin, Yamshchikov Ivan P.
- Enhancing Ai-driven Psychological Consultation: Layered Prompts With Large Language Models Souza Rafael, Lim Jia-hao, Davis Alexander
- Repetition Improves Language Model Embeddings Springer Jacob Mitchell, Kotha Suhas, Fried Daniel, Neubig Graham, Raghunathan Aditi
- Chatgpt For Conversational Recommendation: Refining Recommendations By Reprompting With Feedback Spurlock Kyle Dylan, Acun Cagla, Saka Esin, Nasraoui Olfa
- Canttalkaboutthis: Aligning Language Models To Stay On Topic In Dialogues Sreedhar Makesh Narsimhan, Rebedea Traian, Ghosh Shaona, Zeng Jiaqi, Parisien Christopher
- Probing Multimodal Llms As World Models For Driving Sreeram Shiva, Wang Tsun-hsuan, Maalouf Alaa, Rosman Guy, Karaman Sertac, Rus Daniela
- Interpreting And Learning Voice Commands With A Large Language Model For A Robot System Stankevich Stanislau, Dudek Wojciech
- Fine-tuning Large Language Models For Entity Matching Steiner Aaron, Peeters Ralph, Bizer Christian
- RLVF: Learning From Verbal Feedback Without Overgeneralization Stephan Moritz, Khazatsky Alexander, Mitchell Eric, Chen Annie S, Hsu Sheryl, Sharma Archit, Finn Chelsea
- Satyrn: A Platform For Analytics Augmented Generation Sterbentz Marko, Barrie Cameron, Shahi Shubham, Dutta Abhratanu, Hooshmand Donna, Pack Harper, Hammond Kristian J.
- Llms Achieve Adult Human Performance On Higher-order Theory Of Mind Tasks Street Winnie, Siy John Oliver, Keeling Geoff, Baranes Adrien, Barnett Benjamin, Mckibben Michael, Kanyere Tatenda, Lentz Alison, Arcas Blaise Aguera Y, Dunbar Robin I. M.
- Tailoring Vaccine Messaging With Common-ground Opinions Stureborg Rickard, Chen Sanxing, Xie Ruoyu, Patel Aayushi, Li Christopher, Zhu Chloe Qinyu, Hu Tingnan, Yang Jun, Dhingra Bhuwan
- ARKS: Active Retrieval In Knowledge Soup For Code Generation Su Hongjin, Jiang Shuyang, Lai Yuhang, Wu Haoyuan, Shi Boao, Liu Che, Liu Qian, Yu Tao
- LAB: Large-scale Alignment For Chatbots Sudalairaj Shivchander, Bhandwaldar Abhishek, Pareja Aldo, Xu Kai, Cox David D., Srivastava Akash
- Building Better AI Agents: A Provocation On The Utilisation Of Persona In Llm-based Conversational Agents Sun Guangzhi, Zhan Xiao, Such Jose
- Conifer: Improving Complex Constrained Instruction-following Ability Of Large Language Models Sun Haoran, Liu Lixin, Li Junjie, Wang Fengyu, Dong Baohua, Lin Ran, Huang Ruohui
- Bbox-adapter: Lightweight Adapting For Black-box Large Language Models Sun Haotian, Zhuang Yuchen, Wei Wei, Zhang Chao, Dai Bo
- Evograd: A Dynamic Take On The Winograd Schema Challenge With Human Adversaries Sun Jing Han, Emami Ali
- Lawluo: A Chinese Law Firm Co-run By LLM Agents Sun Jingyun, Dai Chengxiao, Luo Zhongze, Chang Yangbo, Li Yang
- Effectiveness Of Chatgpt In Explaining Complex Medical Reports To Patients Sun Mengxuan, Reiter Ehud, Kiltie Anne E, Ramsay George, Duncan Lisa, Murchie Peter, Adam Rosalind
- Fostering Natural Conversation In Large Language Models With NICO: A Natural Interactive Conversation Dataset Sun Renliang, Liu Mengyuan, Yang Shiping, Wang Rui, He Junqing, Zhang Jiaxing
- Dr-llava: Visual Instruction Tuning With Symbolic Clinical Grounding Sun Shenghuan, Goldgof Gregory M., Schubert Alexander, Sun Zhiqing, Hartvigsen Thomas, Butte Atul J., Alaa Ahmed
- Llm-oriented Retrieval Tuner Sun Si, Zhang Hanqing, Liu Zhiyuan, Bao Jie, Song Dawei
- Beyond Text: Utilizing Vocal Cues To Improve Decision Making In Llms For Robot Navigation Tasks Sun Xingpeng, Meng Haoming, Chakraborty Souradip, Bedi Amrit Singh, Bera Aniket
- Multi-turn Context Jailbreak Attack On Large Language Models From First Principles Sun Xiongtao, Zhang Deyue, Yang Dongdong, Zou Quanchen, Li Hui
- Rapidly Developing High-quality Instruction Data And Evaluation Benchmark For Large Language Models With Minimal Human Effort: A Case Study On Japanese Sun Yikun, Wan Zhen, Ueda Nobuhiro, Yahata Sakiko, Cheng Fei, Chu Chenhui, Kurohashi Sadao
- Leveraging Chatgpt In Pharmacovigilance Event Extraction: An Empirical Study Sun Zhaoyue, Pergola Gabriele, Wallace Byron C., He Yulan
- Towards Smaller, Faster Decoder-only Transformers: Architectural Variants And Their Implications Suresh Sathya Krishnan, P Shunmugapriya
- Meta-prompting: Enhancing Language Models With Task-agnostic Scaffolding Suzgun Mirac, Kalai Adam Tauman
- From Explanations To Action: A Zero-shot, Theory-driven LLM Framework For Student Performance Feedback Swamy Vinitra, Romano Davide, Desikan Bhargav Srinivasa, Camburu Oana-maria, Käser Tanja
- Automatic Generation And Evaluation Of Reading Comprehension Test Items With Large Language Models Säuberli Andreas, Clematide Simon
- Performance Assessment Of Chatgpt Vs Bard In Detecting Alzheimer's Dementia T Balamurali B, Chen Jer-ming
- PIXAR: Auto-regressive Language Modeling In Pixel Space Tai Yintao, Liao Xiyang, Suglia Alessandro, Vergari Antonio
- All In How You Ask For It: Simple Black-box Method For Jailbreak Attacks Takemoto Kazuhiro
- Phantom: Personality Has An Effect On Theory-of-mind Reasoning In Large Language Models Tan Fiona Anting, Yeo Gerard Christopher, Wu Fanyou, Xu Weijie, Jain Vinija, Chadha Aman, Jaidka Kokil, Liu Yang, Ng See-kiong
- Blinded By Generated Contexts: How Language Models Merge Generated And Retrieved Contexts When Knowledge Conflicts? Tan Hexiang, Sun Fei, Yang Wanli, Wang Yuanzhuo, Cao Qi, Cheng Xueqi
- Fine-tuning Large Language Model (LLM) Artificial Intelligence Chatbots In Ophthalmology And Llm-based Evaluation Using GPT-4 Tan Ting Fang, Elangovan Kabilan, Jin Liyuan, Jie Yao, Yong Li, Lim Joshua, Poh Stanley, Ng Wei Yan, Lim Daniel, Ke Yuhe, Liu Nan, Ting Daniel Shu Wei
- Large Language Models For Data Annotation: A Survey Tan Zhen, Li Dawei, Wang Song, Beigi Alimohammad, Jiang Bohan, Bhattacharjee Amrita, Karami Mansooreh, Li Jundong, Cheng Lu, Liu Huan
- Leveraging Language Models For Emotion And Behavior Analysis In Education Tanaka Kaito, Tan Benjamin, Wong Brian
- Instructdoc: A Dataset For Zero-shot Generalization Of Visual Document Understanding With Instructions Tanaka Ryota, Iki Taichi, Nishida Kyosuke, Saito Kuniko, Suzuki Jun
- Active Label Correction For Building Llm-based Modular AI Systems Taneja Karan, Goel Ashok
- Jill Watson: A Virtual Teaching Assistant Powered By Chatgpt Taneja Karan, Maiti Pratyusha, Kakar Sandeep, Guruprasad Pranav, Rao Sanjeev, Goel Ashok K.
- Tofueval: Evaluating Hallucinations Of Llms On Topic-focused Dialogue Summarization Tang Liyan, Shalyminov Igor, Wong Amy Wing-mei, Burnsky Jon, Vincent Jake W., Yang Yu'an, Singh Siffi, Feng Song, Song Hwanjun, Su Hang, Sun Lijia, Zhang Yi, Mansour Saab, Mckeown Kathleen
- Multihop-rag: Benchmarking Retrieval-augmented Generation For Multi-hop Queries Tang Yixuan, Yang Yi
- Mathscale: Scaling Instruction Tuning For Mathematical Reasoning Tang Zhengyang, Zhang Xingxing, Wang Benyou, Wei Furu
- Chatgpt Incorrectness Detection In Software Reviews Tanzil Minaoar Hossain, Khan Junaed Younus, Uddin Gias
- Enhancing Commentary Strategies For Imperfect Information Card Games: A Study Of Large Language Models In Guandan Commentary Tao Meiling, Liang Xuechen, Wang Ziyi, Tao Yiling, Shi Tianyu
- Chatgpt Role-play Dataset: Analysis Of User Motives And Model Naturalness Tao Yufei, Agrawal Ameeta, Dombi Judit, Sydorenko Tetyana, Lee Jung In
- Making A Long Story Short In Conversation Modeling Tao Yufei, Mines Tiernan, Agrawal Ameeta
- MEEL: Multi-modal Event Evolution Learning Tao Zhengwei, Jin Zhi, Huang Junqiang, Chen Xiancai, Bai Xiaoying, Zhao Haiyan, Zhang Yifan, Tao Chongyang
- Can Chatgpt Make Explanatory Inferences? Benchmarks For Abductive Reasoning Thagard Paul
- Judging The Judges: Evaluating Alignment And Vulnerabilities In Llms-as-judges Thakur Aman Singh, Choudhary Kartik, Ramayapally Venkat Srinik, Vaidyanathan Sankaran, Hupkes Dieuwke
- Equipping Language Models With Tool Use Capability For Tabular Data Analysis In Finance Theuma Adrian, Shareghi Ehsan
- Atomic Self-consistency For Better Long Form Generations Thirukovalluru Raghuveer, Huang Yukun, Dhingra Bhuwan
- Limits Of Transformer Language Models On Learning To Compose Algorithms Thomm Jonathan, Terzic Aleksandar, Camposampiero Giacomo, Hersche Michael, Schölkopf Bernhard, Rahimi Abbas
- Elitr-bench: A Meeting Assistant Benchmark For Long-context Language Models Thonet Thibaut, Rozen Jos, Besacier Laurent
- Elsevier Arena: Human Evaluation Of Chemistry/biology/health Foundational Large Language Models Thorne Camilo, Druckenbrodt Christian, Szarkowska Kinga, Goyal Deepika, Marajan Pranita, Somanath Vijay, Harper Corey, Yan Mao, Scerri Tony
- Dubo-sql: Diverse Retrieval-augmented Generation And Fine Tuning For Text-to-sql Thorpe Dayton G., Duberstein Andrew J., Kinsey Ian A.
- I Am A Strange Dataset: Metalinguistic Tests For Language Models Thrush Tristan, Moore Jared, Monares Miguel, Potts Christopher, Kiela Douwe
- Climategpt: Towards AI Synthesizing Interdisciplinary Research On Climate Change Thulke David, Gao Yingbo, Pelser Petrus, Brune Rein, Jalota Rricha, Fok Floris, Ramos Michael, Van Wyk Ian, Nasir Abdallah, Goldstein Hayden, Tragemann Taylor, Nguyen Katie, Fowler Ariana, Stanco Andrew, Gabriel Jon, Taylor Jordan, Moro Dean, Tsymbalov Evgenii, De Waal Juliette, Matusov Evgeny, Yaghi Mudar, Shihadah Mohammad, Ney Hermann, Dugast Christian, Dotan Jonathan, Erasmus Daniel
- Assessing Large Language Models In Mechanical Engineering Education: A Study On Mechanics-focused Conceptual Understanding Tian Jie, Hou Jixin, Wu Zihao, Shu Peng, Liu Zhengliang, Xiang Yujie, Gu Beikang, Filla Nicholas, Li Yiwei, Liu Ning, Chen Xianyan, Tang Keke, Liu Tianming, Wang Xianqiao
- Selective Prompt Anchoring For Code Generation Tian Yuan, Zhang Tianyi
- The Instinctive Bias: Spurious Images Lead To Hallucination In Mllms Tianyang Han, Qing Lian, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang
- Humor Mechanics: Advancing Humor Generation With Multistep Reasoning Tikhonov Alexey, Shtykovskiy Pavel
- Aalap: AI Assistant For Legal & Paralegal Functions In India Tiwari Aman, Kalamkar Prathamesh, Banerjee Atreyo, Karn Saurabh, Hemachandran Varun, Gupta Smita
- Generative AI Systems: A Systems-based Perspective On Generative AI Tomczak Jakub M.
- Eyes Wide Shut? Exploring The Visual Shortcomings Of Multimodal Llms Tong Shengbang, Liu Zhuang, Zhai Yuexiang, Ma Yi, Lecun Yann, Xie Saining
- Prompting Techniques For Secure Code Generation: A Systematic Investigation Tony Catherine, Ferreyra Nicolás E. Díaz, Mutas Markus, Dhiff Salem, Scandariato Riccardo
- Evaluating Large Language Models With Grid-based Game Competitions: An Extensible LLM Benchmark And Leaderboard Topsakal Oguzhan, Edell Colby Jacob, Harper Jackson Bailey
- Openmathinstruct-1: A 1.8 Million Math Instruction Tuning Dataset Toshniwal Shubham, Moshkov Ivan, Narenthiran Sean, Gitman Daria, Jia Fei, Gitman Igor
- Readctrl: Personalizing Text Generation With Readability-controlled Instruction Learning Tran Hieu, Yao Zonghai, Li Lingxi, Yu Hong
- Cognitive Modeling With Scaffolded Llms: A Case Study Of Referential Expression Generation Tsvilodub Polina, Franke Michael, Carcassi Fausto
- Augmenting The Author: Exploring The Potential Of AI Collaboration In Academic Writing Tu Joseph, Hadan Hilda, Wang Derrick M., Sgandurra Sabrina A, Mogavi Reza Hadi, Nacke Lennart E.
- Charactereval: A Chinese Benchmark For Role-playing Conversational Agent Evaluation Tu Quan, Fan Shilong, Tian Zihang, Yan Rui
- Unmasking The Imposters: In-domain Detection Of Human Vs. Machine-generated Tweets Tuck Bryan E., Verma Rakesh M.
- Dynamo: Accelerating Language Model Inference With Dynamic Multi-token Sampling Tuli Shikhar, Lin Chi-heng, Hsu Yen-chang, Jha Niraj K., Shen Yilin, Jin Hongxia
- Progres: Prompted Generative Rescoring On ASR N-best Tur Ada Defne, Moumen Adel, Ravanelli Mirco
- Adacoder: Adaptive Prompt Compression For Programmatic Visual Question Answering Ukai Mahiro, Kurita Shuhei, Hashimoto Atsushi, Ushiku Yoshitaka, Inoue Nakamasa
- Sandwich Attack: Multi-language Mixture Adaptive Attack On Llms Upadhayay Bibek, Behzadan Vahid
- PARADISE: Evaluating Implicit Planning Skills Of Language Models With Procedural Warnings And Tips Dataset Uzunoglu Arda, Safa Abdalfatah Rashid, Şahin Gözde Gül
- From Words To Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-context Examples Vacareanu Robert, Negru Vlad-andrei, Suciu Vasile, Surdeanu Mihai
- Evaluating The Capability Of Large Language Models To Personalize Science Texts For Diverse Middle-school-age Learners Vaccaro Michael Jr, Friday Mikayla, Zaghi Arash
- Do Large Language Models Perform The Way People Expect? Measuring The Human Generalization Function Vafa Keyon, Rambachan Ashesh, Mullainathan Sendhil
- Natural Language Programming In Medicine: Administering Evidence Based Clinical Workflows With Autonomous Agents Powered By Generative Large Language Models Vaid Akhil, Lampert Joshua, Lee Juhee, Sawant Ashwin, Apakama Donald, Sakhuja Ankit, Soroush Ali, Bick Sarah, Abbott Ethan, Gomez Hernando, Hadley Michael, Lee Denise, Landi Isotta, Duong Son Q, Bussola Nicole, Nabeel Ismail, Muehlstedt Silke, Muehlstedt Silke, Freeman Robert, Kovatch Patricia, Carr Brendan, Wang Fei, Glicksberg Benjamin, Argulian Edgar, Lerakis Stamatios, Khera Rohan, Reich David L., Kraft Monica, Charney Alexander, Nadkarni Girish
- Investigating The Performance Of Language Models For Completing Code In Functional Programming Languages: A Haskell Case Study Van Dam Tim, Van Der Heijden Frank, De Bekker Philippe, Nieuwschepen Berend, Otten Marc, Izadi Maliheh
- AI Sandbagging: Language Models Can Strategically Underperform On Evaluations Van Der Weij Teun, Hofstätter Felix, Jaffe Ollie, Brown Samuel F., Ward Francis Rhys
- Gender Bias In Machine Translation And The Era Of Large Language Models Vanmassenhove Eva
- Can GPT Redefine Medical Understanding? Evaluating GPT On Biomedical Machine Reading Comprehension Vatsal Shubham, Singh Ayush
- Replacing Judges With Juries: Evaluating LLM Generations With A Panel Of Diverse Models Verga Pat, Hofstatter Sebastian, Althammer Sophia, Su Yixuan, Piktus Aleksandra, Arkhangorodsky Arkady, Xu Minjie, White Naomi, Lewis Patrick
- Cross-modal Projection In Multimodal Llms Doesn't Really Project Visual Attributes To Textual Space Verma Gaurav, Choi Minje, Sharma Kartik, Watson-daniels Jamelle, Oh Sejoon, Kumar Srijan
- Auditing Counterfire: Evaluating Advanced Counterargument Generation With Evidence And Style Verma Preetika, Jaidka Kokil, Churina Svetlana
- Exploring LGBTQ+ Bias In Generative AI Answers Across Different Country And Religious Contexts Vicsek Lilla, Vancsó Anna, Zajko Mike, Takacs Judit
- Input Conditioned Graph Generation For Language Agents Vierling Lukas, Fu Jie, Chen Kai
- Exploring Safety-utility Trade-offs In Personalized Language Models Vijjini Anvesh Rao, Chowdhury Somnath Basu Roy, Chaturvedi Snigdha
- Emotional Manipulation Through Prompt Engineering Amplifies Disinformation Generation In AI Large Language Models Vinay Rasita, Spitale Giovanni, Biller-andorno Nikola, Germani Federico
- Medreqal: Examining Medical Knowledge Recall Of Large Language Models Via Question Answering Vladika Juraj, Schneider Phillip, Matthes Florian
- Foundational Autoraters: Taming Large Language Models For Better Automatic Evaluation Vu Tu, Krishna Kalpesh, Alzubi Salaheddin, Tar Chris, Faruqui Manaal, Sung Yun-hsuan
- Are Large Language Models Aligned With People's Social Intuitions For Human-robot Interactions? Wachowiak Lennart, Coles Andrew, Celiktutan Oya, Canal Gerard
- The Instruction Hierarchy: Training Llms To Prioritize Privileged Instructions Wallace Eric, Xiao Kai, Leike Reimar, Weng Lilian, Heidecke Johannes, Beutel Alex
- Fusechat: Knowledge Fusion Of Chat Models Wan Fanqi, Zhong Longguang, Yang Ziyi, Chen Ruijun, Quan Xiaojun
- A & B == B & A: Triggering Logical Reasoning Failures In Large Language Models Wan Yuxuan, Wang Wenxuan, Yang Yiliu, Yuan Youliang, Huang Jen-tse, He Pinjia, Jiao Wenxiang, Lyu Michael R.
- COSMO: Contrastive Streamlined Multimodal Model With Interleaved Pre-training Wang Alex Jinpeng, Li Linjie, Lin Kevin Qinghong, Wang Jianfeng, Lin Kevin, Yang Zhengyuan, Wang Lijuan, Shou Mike Zheng
- GOLF: Goal-oriented Long-term Life Tasks Supported By Human-ai Collaboration Wang Ben
- Task Supportive And Personalized Human-large Language Model Interaction: A User Study Wang Ben, Liu Jiqun, Karimnazarov Jamshed, Thompson Nicolas
- Halu-j: Critique-based Hallucination Judge Wang Binjie, Chern Steffi, Chern Ethan, Liu Pengfei
- Grokked Transformers Are Implicit Reasoners: A Mechanistic Journey To The Edge Of Generalization Wang Boshi, Yue Xiang, Su Yu, Sun Huan
- Direct: Diagnostic Reasoning For Clinical Notes Via Large Language Models Wang Bowen, Chang Jiuyang, Qian Yiming, Chen Guoxin, Chen Junhao, Jiang Zhouqiang, Zhang Jiahao, Nakashima Yuta, Nagahara Hajime
- Tokenization Matters! Degrading Large Language Models Through Challenging Their Tokenization Wang Dixuan, Li Yanda, Jiang Junyuan, Ding Zepeng, Jiang Guochao, Liang Jiaqing, Yang Deqing
- Adapting Open-source Large Language Models For Cost-effective, Expert-level Clinical Note Generation With On-policy Reinforcement Learning Wang Hanyin, Gao Chufan, Liu Bolun, Xu Qiping, Hussein Guleid, Labban Mohamad El, Iheasirim Kingsley, Korsapati Hariprasad, Outcalt Chuck, Sun Jimeng
- Reinforcement Learning For Edit-based Non-autoregressive Neural Machine Translation Wang Hao, Morimura Tetsuro, Honda Ukyo, Kawahara Daisuke
- Interpretable Preferences Via Multi-objective Reward Modeling And Mixture-of-experts Wang Haoxiang, Xiong Wei, Xie Tengyang, Zhao Han, Zhang Tong
- M4U: Evaluating Multilingual Understanding And Reasoning For Large Multimodal Models Wang Hongyu, Xu Jiayu, Xie Senwei, Wang Ruiping, Li Jialin, Xie Zhaojie, Zhang Bin, Xiong Chuyan, Chen Xilin
- Instructgraph: Boosting Large Language Models Via Graph-centric Instruction Tuning And Preference Alignment Wang Jianing, Wu Junda, Hou Yupeng, Liu Yao, Gao Ming, Mcauley Julian
- Techgpt-2.0: A Large Language Model Project To Solve The Task Of Knowledge Graph Construction Wang Jiaqi, Chang Yuying, Li Zhong, An Ning, Ma Qi, Hei Lei, Luo Haibo, Lu Yifei, Ren Feiliang
- Tarsier: Recipes For Training And Evaluating Large Video Description Models Wang Jiawei, Yuan Liping, Zhang Yuchen
- Bioinformatics And Biomedical Informatics With Chatgpt: Year One Review Wang Jinge, Cheng Zien, Yao Qiuming, Liu Li, Xu Dong, Hu Gangqing
- Mixture-of-agents Enhances Large Language Model Capabilities Wang Junlin, Wang Jue, Athiwaratkun Ben, Zhang Ce, Zou James
- Mambabyte: Token-free Selective State Space Model Wang Junxiong, Gangavarapu Tushaar, Yan Jing Nathan, Rush Alexander M.
- The Mamba In The Llama: Distilling And Accelerating Hybrid Models Wang Junxiong, Paliotta Daniele, May Avner, Rush Alexander M., Dao Tri
- Exploring The Potential Of Multimodal LLM With Knowledge-intensive Multimodal ASR Wang Minghan, Wang Yuxia, Vu Thuy-trang, Shareghi Ehsan, Haffari Gholamreza
- Beyond The Known: Investigating Llms Performance On Out-of-domain Intent Detection Wang Pei, He Keqing, Wang Yejie, Song Xiaoshuai, Mou Yutao, Wang Jingang, Xian Yunsen, Cai Xunliang, Xu Weiran
- Effective Demonstration Annotation For In-context Learning Via Language Model-based Determinantal Point Process Wang Peng, Wang Xiaobin, Lou Chao, Mao Shengyu, Xie Pengjun, Jiang Yong
- Cross-refine: Improving Natural Language Explanation Generation By Learning In Tandem Wang Qianli, Anikina Tatiana, Feldhus Nils, Ostermann Simon, Möller Sebastian, Schmitt Vera
- Automated Generation Of Multiple-choice Cloze Questions For Assessing English Vocabulary Using Gpt-turbo 3.5 Wang Qiao, Rose Ralph, Orita Naho, Sugawara Ayaka
- What Is The Best Way For Chatgpt To Translate Poetry? Wang Shanshan, Wong Derek F., Yao Jingming, Chao Lidia S.
- Self-taught Evaluators Wang Tianlu, Kulikov Ilia, Golovneva Olga, Yu Ping, Yuan Weizhe, Dwivedi-yu Jane, Pang Richard Yuanzhe, Fazel-zarandi Maryam, Weston Jason, Li Xian
- Weaver: Foundation Models For Creative Writing Wang Tiannan, Chen Jiamin, Jia Qingrui, Wang Shuai, Fang Ruoyu, Wang Huilin, Gao Zhaowei, Xie Chunzhao, Xu Chuou, Dai Jihong, Liu Yibin, Wu Jialong, Ding Shengwei, Li Long, Huang Zhiwei, Deng Xinle, Yu Teng, Ma Gangan, Xiao Han, Chen Zixin, Xiang Danjun, Wang Yunxia, Zhu Yuanyuan, Xiao Yi, Wang Jing, Wang Yiru, Ding Siran, Huang Jiayang, Xu Jiayi, Tayier Yilihamu, Hu Zhenyu, Gao Yuan, Zheng Chengfeng, Ye Yueshu, Li Yihang, Wan Lei, Jiang Xinyue, Wang Yujie, Cheng Siyu, Song Zhule, Tang Xiangru, Xu Xiaohua, Zhang Ningyu, Chen Huajun, Jiang Yuchen Eleanor, Zhou Wangchunshu
- Less Is More For Improving Automatic Evaluation Of Factual Consistency Wang Tong, Kulkarni Ninad, Qi Yanjun
- Using Advanced Llms To Enhance Smaller Llms: An Interpretable Knowledge Distillation Approach Wang Tong, Sudhir K., Hong Dat
- Rocks Coding, Not Development--a Human-centric, Experimental Evaluation Of Llm-supported SE Tasks Wang Wei, Ning Huilong, Zhang Gaowei, Liu Libo, Wang Yi
- Testing And Evaluation Of Large Language Models: Correctness, Non-toxicity, And Fairness Wang Wenxuan
- Metatool: Facilitating Large Language Models To Master Tools With Meta-task Augmentation Wang Xiaohan, Li Dian, Zhao Yilin, Sinbadliu, Wang Hui
- Validating Llm-generated Programs With Metamorphic Prompt Testing Wang Xiaoyin, Zhu Dakai
- Coglm: Tracking Cognitive Development Of Large Language Models Wang Xinglin, Yuan Peiwen, Feng Shaoxiong, Li Yiwei, Pan Boyuan, Wang Heda, Hu Yao, Li Kan
- "my Answer Is C": First-token Probabilities Do Not Match Text Answers In Instruction-tuned Language Models Wang Xinpeng, Ma Bolei, Hu Chengzhi, Weber-genzel Leon, Röttger Paul, Kreuter Frauke, Hovy Dirk, Plank Barbara
- Negativeprompt: Leveraging Psychology For Large Language Models Enhancement Via Negative Emotional Stimuli Wang Xu, Li Cheng, Chang Yi, Wang Jindong, Wu Yuan
- UBENCH: Benchmarking Uncertainty In Large Language Models With Multiple Choice Questions Wang Xunzhi, Zhang Zhuowei, Li Qiongyu, Chen Gaonan, Hu Mengting, Li Zhiyu, Luo Bitong, Gao Hang, Han Zhixin, Wang Haotian
- EAGER: Two-stream Generative Recommender With Behavior-semantic Collaboration Wang Ye, Xun Jiahao, Hong Minjie, Zhu Jieming, Jin Tao, Lin Wang, Li Haoyuan, Li Linjun, Xia Yan, Zhao Zhou, Dong Zhenhua
- Speak It Out: Solving Symbol-related Problems With Symbol-to-language Conversion For Language Models Wang Yile, Cheng Sijie, Sun Zixin, Li Peng, Liu Yang
- Ladic: Are Diffusion Models Really Inferior To Autoregressive Counterparts For Image-to-text Generation? Wang Yuchi, Ren Shuhuai, Gao Rundong, Yao Linli, Guo Qingyan, An Kaikai, Bai Jianhong, Sun Xu
- Sibyl: Simple Yet Effective Agent Framework For Complex Real-world Reasoning Wang Yulong, Shen Tianhao, Liu Lifeng, Xie Jian
- Loma: Lossless Compressed Memory Attention Wang Yumeng, Xiao Zhenyang
- To Recommend Or Not: Recommendability Identification In Conversations With Pre-trained Language Models Wang Zhefan, Ma Weizhi, Zhang Min
- Helpsteer2: Open-source Dataset For Training Top-performing Reward Models Wang Zhilin, Dong Yi, Delalleau Olivier, Zeng Jiaqi, Shen Gerald, Egert Daniel, Zhang Jimmy J., Sreedhar Makesh Narsimhan, Kuchaiev Oleksii
- Trove: Inducing Verifiable And Efficient Toolboxes For Solving Programmatic Tasks Wang Zhiruo, Fried Daniel, Neubig Graham
- Chatlogic: Integrating Logic Programming With Large Language Models For Multi-step Reasoning Wang Zhongsheng, Liu Jiamou, Bao Qiming, Rong Hongfei, Zhang Jingfeng
- Omnijarvis: Unified Vision-language-action Tokenization Enables Open-world Instruction Following Agents Wang Zihao, Cai Shaofei, Mu Zhancun, Lin Haowei, Zhang Ceyao, Liu Xuejie, Li Qing, Liu Anji, Ma Xiaojian, Liang Yitao
- RAT: Retrieval Augmented Thoughts Elicit Context-aware Reasoning In Long-horizon Generation Wang Zihao, Liu Anji, Lin Haowei, Li Jiaqi, Ma Xiaojian, Liang Yitao
- Eliminating Position Bias Of Language Models: A Mechanistic Approach Wang Ziqi, Zhang Hanlin, Li Xiner, Huang Kuan-hao, Han Chi, Ji Shuiwang, Kakade Sham M., Peng Hao, Ji Heng
- C3LLM: Conditional Multimodal Content Generation Using Large Language Models Wang Zixuan, Duan Qinkai, Tai Yu-wing, Tang Chi-keung
- PARIKSHA : A Large-scale Investigation Of Human-llm Evaluator Agreement On Multilingual And Multi-cultural Data Watts Ishaan, Gumma Varun, Yadavalli Aditya, Seshadri Vivek, Swaminathan Manohar, Sitaram Sunayana
- Investigating Multilingual Instruction-tuning: Do Polyglot Models Demand For Multilingual Instructions? Weber Alexander Arno, Thellmann Klaudia, Ebert Jan, Flores-herr Nicolas, Lehmann Jens, Fromm Michael, Ali Mehdi
- Large Language Models Are Pattern Matchers: Editing Semi-structured And Structured Documents With Chatgpt Weber Irene
- Hidden In Plain Sight: Exploring Chat History Tampering In Interactive Language Models Wei Cheng'an, Zhao Yue, Gong Yujia, Chen Kai, Xiang Lu, Zhu Shenchen
- OPDAI At Semeval-2024 Task 6: Small Llms Can Accelerate Hallucination Detection With Weakly Supervised Data Wei Chengcheng, Chen Ze, Fang Songtan, He Jiarong, Gao Max
- Rethinking Generative Large Language Model Evaluation For Semantic Comprehension Wei Fangyun, Chen Xi, Luo Lin
- Systematic Evaluation Of Llm-as-a-judge In LLM Alignment Tasks: Explainable Metrics And Diverse Prompt Templates Wei Hui, He Shenghua, Xia Tian, Wong Andy, Lin Jingyang, Han Mei
- Long-form Factuality In Large Language Models Wei Jerry, Yang Chengrun, Song Xinying, Lu Yifeng, Hu Nathan, Huang Jie, Tran Dustin, Peng Daiyi, Liu Ruibo, Huang Da, Du Cosmo, Le Quoc V.
- Performance Evaluation Of Lightweight Open-source Large Language Models In Pediatric Consultations: A Comparative Analysis Wei Qiuhong, Cui Ying, Ding Mengwei, Wang Yanqin, Xiang Lingling, Yao Zhengxiong, Chen Ceran, Long Ying, Jin Zhezhen, Xu Ximing
- Are Large Language Models More Empathetic Than Humans? Welivita Anuradha, Pu Pearl
- Longvlm: Efficient Long Video Understanding Via Large Language Models Weng Yuetian, Han Mingfei, He Haoyu, Chang Xiaojun, Zhuang Bohan
- Codeultrafeedback: An Llm-as-a-judge Dataset For Aligning Large Language Models To Coding Preferences Weyssow Martin, Kamanda Aton, Sahraoui Houari
- Effects Of A Prompt Engineering Intervention On Undergraduate Students' AI Self-efficacy, AI Knowledge And Prompt Engineering Ability: A Mixed Methods Study Woo David James, Wang Deliang, Yung Tim, Guo Kai
- A Surprising Failure? Multimodal Llms And The NLVR Challenge Wu Anne, Brantley Kianté, Artzi Yoav
- Curriculum Learning With Quality-driven Data Selection Wu Biao, Meng Fang, Chen Ling
- Towards Evaluating And Building Versatile Large Language Models For Medicine Wu Chaoyi, Qiu Pengcheng, Liu Jinxin, Gu Hongfei, Li Na, Zhang Ya, Wang Yanfeng, Xie Weidi
- Parameter-efficient Sparsity Crafting From Dense To Mixture-of-experts For Instruction Tuning On General Tasks Wu Haoyuan, Zheng Haisheng, He Zhuolun, Yu Bei
- Guiding Clinical Reasoning With Large Language Models Via Knowledge Seeds Wu Jiageng, Wu Xian, Yang Jie
- Gendec: A Robust Generative Question-decomposition Method For Multi-hop Reasoning Wu Jian, Yang Linyi, Ji Yuliang, Huang Wenhao, Karlsson Börje F., Okumura Manabu
- GEB-1.3B: Open Lightweight Large Language Model Wu Jie, Zhu Yufeng, Shen Lei, Lu Xuqing
- Can Large Language Models Understand Uncommon Meanings Of Common Words? Wu Jinyang, Che Feihu, Zheng Xinxin, Zhang Shuai, Jin Ruihan, Nie Shuai, Shao Pengpeng, Tao Jianhua
- Clasheval: Quantifying The Tug-of-war Between An Llm's Internal Prior And External Evidence Wu Kevin, Wu Eric, Zou James
- Adapting Large Language Models For Document-level Machine Translation Wu Minghao, Vu Thuy-trang, Qu Lizhen, Foster George, Haffari Gholamreza
- Parallel Decoding Via Hidden Transfer For Lossless Large Language Model Acceleration Wu Pengfei, Liu Jiahao, Gong Zhuocheng, Wang Qifan, Li Jinpeng, Wang Jingang, Cai Xunliang, Zhao Dongyan
- Motionllm: Multimodal Motion-language Learning With Large Language Models Wu Qi, Zhao Yubo, Wang Yifan, Tai Yu-wing, Tang Chi-keung
- GUI Action Narrator: Where And When Did That Action Take Place? Wu Qinchen, Gao Difei, Lin Kevin Qinghong, Wu Zhuoyu, Guo Xiangwu, Li Peiran, Zhang Weichen, Wang Hengxu, Shou Mike Zheng
- Videollm-mod: Efficient Video-language Streaming With Mixture-of-depths Vision Computation Wu Shiwei, Chen Joya, Lin Kevin Qinghong, Wang Qimeng, Gao Yan, Xu Qianli, Xu Tong, Hu Yao, Chen Enhong, Shou Mike Zheng
- Enhance Reasoning For Large Language Models In The Game Werewolf Wu Shuang, Zhu Liwen, Yang Tao, Xu Shiwei, Fu Qiang, Wei Yang, Fu Haobo
- Unigen: A Unified Framework For Textual Dataset Generation Using Large Language Models Wu Siyuan, Huang Yue, Gao Chujie, Chen Dongping, Zhang Qihui, Wan Yao, Zhou Tianyi, Zhang Xiangliang, Gao Jianfeng, Xiao Chaowei, Sun Lichao
- Self-directed Turing Test For Large Language Models Wu Weiqi, Wu Hongqiu, Zhao Hai
- Freeva: Offline MLLM As Training-free Video Assistant Wu Wenhao
- Long Context Alignment With Short Instructions And Synthesized Positions Wu Wenhao, Wang Yizhong, Fu Yao, Yue Xiang, Zhu Dawei, Li Sujian
- Do Language Models Plan Ahead For Future Tokens? Wu Wilson, Morris John X., Levine Lionel
- Tablebench: A Comprehensive And Complex Benchmark For Table Question Answering Wu Xianjie, Yang Jian, Chai Linzheng, Zhang Ge, Liu Jiaheng, Du Xinrun, Liang Di, Shu Daixin, Cheng Xianfu, Sun Tianzhen, Niu Guanglin, Li Tongliang, Li Zhoujun
- VILA-U: A Unified Foundation Model Integrating Visual Understanding And Generation Wu Yecheng, Zhang Zhuoyang, Chen Junyu, Tang Haotian, Li Dacheng, Fang Yunhao, Zhu Ligeng, Xie Enze, Yin Hongxu, Yi Li, Han Song, Lu Yao
- Elicitationgpt: Text Elicitation Mechanisms Via Language Models Wu Yifan, Hartline Jason
- Inversecoder: Unleashing The Power Of Instruction-tuned Code Llms With Inverse-instruct Wu Yutong, Huang Di, Shi Wenxuan, Wang Wei, Gao Lingzhe, Liu Shihao, Nan Ziyuan, Yuan Kaizhao, Zhang Rui, Zhang Xishan, Du Zidong, Guo Qi, Pu Yewen, Yin Dawei, Hu Xing, Chen Yunji
- Instructing Large Language Models To Identify And Ignore Irrelevant Conditions Wu Zhenyu, Shen Chao, Jiang Meng
- Large Language Models Can Self-correct With Minimal Effort Wu Zhenyu, Zeng Qingkai, Zhang Zhihan, Tan Zhaoxuan, Shen Chao, Jiang Meng
- Acquiring Clean Language Models From Backdoor Poisoned Datasets By Downscaling Frequency Space Wu Zongru, Zhang Zhuosheng, Cheng Pengzhou, Liu Gongshen
- Imitating Language Via Scalable Inverse Reinforcement Learning Wulfmeier Markus, Bloesch Michael, Vieillard Nino, Ahuja Arun, Bornschein Jorg, Huang Sandy, Sokolov Artem, Barnes Matt, Desjardins Guillaume, Bewley Alex, Bechtle Sarah Maria Elisabeth, Springenberg Jost Tobias, Momchev Nikola, Bachem Olivier, Geist Matthieu, Riedmiller Martin
- FOFO: A Benchmark To Evaluate Llms' Format-following Capability Xia Congying, Xing Chen, Du Jiangshu, Yang Xinyi, Feng Yihao, Xu Ran, Yin Wenpeng, Xiong Caiming
- Unlocking Efficiency In Large Language Model Inference: A Comprehensive Survey Of Speculative Decoding Xia Heming, Yang Zhe, Dong Qingxiu, Wang Peiyi, Li Yongqi, Ge Tao, Liu Tianyu, Li Wenjie, Sui Zhifang
- Language Models Can Evaluate Themselves Via Probability Discrepancy Xia Tingyu, Yu Bowen, Wu Yuan, Chang Yi, Zhou Chang
- Improving Retrieval Augmented Language Model With Self-reasoning Xia Yuan, Zhou Jingbo, Shi Zhenhui, Chen Jun, Huang Haifeng
- What To Do If Language Models Disagree? Black-box Model Ensembling For Textual And Visual Question Answering Xia Yuxi, Zaporojets Kilm, Roth Benjamin
- Pandora: Towards General World Model With Natural Language Actions And Video States Xiang Jiannan, Liu Guangyi, Gu Yi, Gao Qiyue, Ning Yuting, Zha Yuheng, Feng Zeyu, Tao Tianhua, Hao Shibo, Shi Yemin, Liu Zhengzhong, Xing Eric P., Hu Zhiting
- Badchain: Backdoor Chain-of-thought Prompting For Large Language Models Xiang Zhen, Jiang Fengqing, Xiong Zidi, Ramasubramanian Bhaskar, Poovendran Radha, Li Bo
- A Comprehensive Survey Of Large Language Models And Multimodal Large Language Models In Medicine Xiao Hanguang, Zhou Feizhong, Liu Xingyue, Liu Tianqi, Li Zhipeng, Liu Xin, Huang Xiaoxuan
- Exploring How Multiple Levels Of Gpt-generated Programming Hints Support Or Disappoint Novices Xiao Ruiwei, Hou Xinying, Stamper John
- Seeing The Image: Prioritizing Visual Correlation By Contrastive Alignment Xiao Xin, Wu Bohong, Wang Jiacong, Li Chunyuan, Zhou Xun, Guo Haoyuan
- Emovit: Revolutionizing Emotion Insights With Visual Instruction Tuning Xie Hongxia, Peng Chu-jun, Tseng Yu-wen, Chen Hung-jen, Hsu Chan-feng, Shuai Hong-han, Cheng Wen-huang
- Show-o: One Single Transformer To Unify Multimodal Understanding And Generation Xie Jinheng, Mao Weijia, Bai Zechen, Zhang David Junhao, Wang Weihao, Lin Kevin Qinghong, Gu Yuchao, Chen Zhijie, Yang Zhenheng, Shou Mike Zheng
- Non-instructional Fine-tuning: Enabling Instruction-following Capabilities In Pre-trained Language Models Without Instruction-following Data Xie Juncheng, Syu Shensian, Lee Hung-yi
- Me Llama: Foundation Large Language Models For Medical Applications Xie Qianqian, Chen Qingyu, Chen Aokun, Peng Cheng, Hu Yan, Lin Fongci, Peng Xueqing, Huang Jimin, Zhang Jeffrey, Keloth Vipina, Zhou Xinyu, He Huan, Ohno-machado Lucila, Wu Yonghui, Xu Hua, Bian Jiang
- Codeshell Technical Report Xie Rui, Zeng Zhengran, Yu Zhuohao, Gao Chang, Zhang Shikun, Ye Wei
- Sorry-bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors Xie Tinghao, Qi Xiangyu, Zeng Yi, Huang Yangsibo, Sehwag Udari Madhushani, Huang Kaixuan, He Luxi, Wei Boyi, Li Dacheng, Sheng Ying, Jia Ruoxi, Li Bo, Li Kai, Chen Danqi, Henderson Peter, Mittal Prateek
- RALL-E: Robust Codec Language Modeling With Chain-of-thought Prompting For Text-to-speech Synthesis Xin Detai, Tan Xu, Shen Kai, Ju Zeqian, Yang Dongchao, Wang Yuancheng, Takamichi Shinnosuke, Saruwatari Hiroshi, Liu Shujie, Li Jinyu, Zhao Sheng
- Benchmarking Retrieval-augmented Generation For Medicine Xiong Guangzhi, Jin Qiao, Lu Zhiyong, Zhang Aidong
- Improving Retrieval-augmented Generation In Medicine With Iterative Follow-up Questions Xiong Guangzhi, Jin Qiao, Wang Xiao, Zhang Minjia, Lu Zhiyong, Zhang Aidong
- Converging Paradigms: The Synergy Of Symbolic And Connectionist AI In Llm-empowered Autonomous Agents Xiong Haoyi, Wang Zhiyuan, Li Xuhong, Bian Jiang, Xie Zeke, Mumtaz Shahid, Barnes Laura E.
- From Artificial Needles To Real Haystacks: Improving Retrieval Capabilities In Llms By Finetuning On Synthetic Data Xiong Zheyang, Papageorgiou Vasilis, Lee Kangwook, Papailiopoulos Dimitris
- Data Efficient Evaluation Of Large Language Models And Text-to-image Models Via Adaptive Sampling Xu Cong, Saranathan Gayathri, Alam Mahammad Parwez, Shah Arpit, Lim James, Wong Soon Yee, Martin Foltin, Bhattacharya Suparna
- Contrastive Preference Optimization: Pushing The Boundaries Of LLM Performance In Machine Translation Xu Haoran, Sharaf Amr, Chen Yunmo, Tan Weiting, Shen Lingfeng, Van Durme Benjamin, Murray Kenton, Kim Young Jin
- Redagent: Red Teaming Large Language Models With Context-aware Autonomous Language Agent Xu Huiyu, Zhang Wenhui, Wang Zhibo, Xiao Feng, Zheng Rui, Feng Yunhe, Ba Zhongjie, Ren Kui
- Reverse Image Retrieval Cues Parametric Memory In Multimodal Llms Xu Jialiang, Moor Michael, Leskovec Jure
- Tur[k]ingbench: A Challenge Benchmark For Web Agents Xu Kevin, Kordi Yeganeh, Nayak Tanay, Asija Ado, Wang Yizhong, Sanders Kate, Byerly Adam, Zhang Jingyu, Van Durme Benjamin, Khashabi Daniel
- Accelerating Large Language Model Training With Hybrid Gpu-based Compression Xu Lang, Anthony Quentin, Zhou Qinghua, Alnaasan Nawras, Gulhane Radha R., Shafi Aamir, Subramoni Hari, Panda Dhabaleswar K.
- Prompting Large Language Models For Recommender Systems: A Comprehensive Framework And Empirical Analysis Xu Lanling, Zhang Junjie, Li Bingqian, Wang Jinpeng, Cai Mingchen, Zhao Wayne Xin, Wen Ji-rong
- Superclue-math6: Graded Multi-step Math Reasoning Benchmark For Llms In Chinese Xu Liang, Xue Hang, Zhu Lei, Zhao Kangkang
- Chatqa 2: Bridging The Gap To Proprietary Llms In Long Context And RAG Capabilities Xu Peng, Ping Wei, Wu Xianchao, Xu Chejian, Liu Zihan, Shoeybi Mohammad, Catanzaro Bryan
- BESA: Pruning Large Language Models With Blockwise Parameter-efficient Sparsity Allocation Xu Peng, Shao Wenqi, Chen Mengzhao, Tang Shitao, Zhang Kaipeng, Gao Peng, An Fengwei, Qiao Yu, Luo Ping
- Walking In Others' Shoes: How Perspective-taking Guides Large Language Models In Reducing Toxicity And Bias Xu Rongwu, Zhou Zi'an, Zhang Tianwei, Qi Zehan, Yao Su, Xu Ke, Xu Wei, Qiu Han
- MINDECHO: Role-playing Language Agents For Key Opinion Leaders Xu Rui, Lu Dakuan, Tan Xiaoyu, Wang Xintao, Yuan Siyu, Chen Jiangjie, Chu Wei, Yinghui Xu
- Is DPO Superior To PPO For LLM Alignment? A Comprehensive Study Xu Shusheng, Fu Wei, Gao Jiaxuan, Ye Wenjie, Liu Weilin, Mei Zhiyu, Wang Guangju, Yu Chao, Wu Yi
- Pride And Prejudice: LLM Amplifies Self-bias In Self-refinement Xu Wenda, Zhu Guanglei, Zhao Xuandong, Pan Liangming, Li Lei, Wang William Yang
- Relating The Seemingly Unrelated: Principled Understanding Of Generalization For Generative Models In Arithmetic Reasoning Tasks Xu Xingcheng, Zhao Zibo, Zhang Haipeng, Yang Yanqing
- Conveyor: Efficient Tool-aware LLM Serving With Tool Partial Execution Xu Yechen, Kong Xinhao, Chen Tingjun, Zhuo Danyang
- \(\textit{linkprompt}\): Natural And Universal Adversarial Attacks On Prompt-based Language Models Xu Yue, Wang Wenjie
- Uncovering Safety Risks Of Large Language Models Through Concept Activation Vector Xu Zhihao, Huang Ruixuan, Chen Changyu, Wang Shuai, Wang Xiting
- Vision-flan: Scaling Human-labeled Tasks In Visual Instruction Tuning Xu Zhiyang, Feng Chao, Shao Rulin, Ashby Trevor, Shen Ying, Jin Di, Cheng Yu, Wang Qifan, Huang Lifu
- A Comprehensive Study Of Jailbreak Attack Versus Defense For Large Language Models Xu Zihao, Liu Yi, Deng Gelei, Li Yuekang, Picek Stjepan
- Empirical Evidence Of Large Language Model's Influence On Human Spoken Communication Yakura Hiromu, Lopez-lopez Ezequiel, Brinkmann Levin, Serna Ignacio, Gupta Prateek, Rahwan Iyad
- List Items One By One: A New Data Source And Learning Paradigm For Multimodal Llms Yan An, Yang Zhengyuan, Wu Junda, Zhu Wanrong, Yang Jianwei, Li Linjie, Lin Kevin, Wang Jianfeng, Mcauley Julian, Gao Jianfeng, Wang Lijuan
- LTNER: Large Language Model Tagging For Named Entity Recognition With Contextualized Entity Marking Yan Faren, Yu Peng, Chen Xin
- GPT-4 Vs. Human Translators: A Comprehensive Evaluation Of Translation Quality Across Languages, Domains, And Expertise Levels Yan Jianhao, Yan Pingchuan, Chen Yulong, Li Judy, Zhu Xianchao, Zhang Yue
- Advancing Genai Assisted Programming--a Comparative Study On Prompt Efficiency And Code Quality Between GPT-4 And GLM-4 Yang Angus, Li Zehan, Li Jie
- Toward Automatic Relevance Judgment Using Vision--language Models For Image--text Retrieval Evaluation Yang Jheng-hong, Lin Jimmy
- Large Language Model Tokenizer Bias: A Case Study And Solution On Gpt-4o Yang Jin, Wang Zhiqiang, Lin Yanbin, Zhao Zunduo
- Mastering Text-to-image Diffusion: Recaptioning, Planning, And Generating With Multimodal Llms Yang Ling, Yu Zhaochen, Meng Chenlin, Xu Minkai, Ermon Stefano, Cui Bin
- GPT-4 As Evaluator: Evaluating Large Language Models On Pest Management In Agriculture Yang Shanglong, Yuan Zhipeng, Li Shunbao, Peng Ruoling, Liu Kang, Yang Po
- Seed-story: Multimodal Long Story Generation With Large Language Model Yang Shuai, Ge Yuying, Li Yang, Chen Yukang, Ge Yixiao, Shan Ying, Chen Yingcong
- Is Crowdsourcing Breaking Your Bank? Cost-effective Fine-tuning Of Pre-trained Language Models With Proximal Policy Optimization Yang Shuo, Kasneci Gjergji
- Aqa-bench: An Interactive Benchmark For Evaluating Llms' Sequential Reasoning Ability Yang Siwei, Zhao Bingchen, Xie Cihang
- The Butterfly Effect Of Model Editing: Few Edits Can Trigger Large Language Models Collapse Yang Wanli, Sun Fei, Ma Xinyu, Liu Xun, Yin Dawei, Cheng Xueqi
- React Meets Actre: When Language Agents Enjoy Training Data Autonomy Yang Zonghan, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- CLAVE: An Adaptive Framework For Evaluating Values Of LLM Generated Responses Yao Jing, Yi Xiaoyuan, Xie Xing
- Exploiting Inter-layer Expert Affinity For Accelerating Mixture-of-experts Model Inference Yao Jinghan Dk, Anthony Quentin Dk, Shafi Aamir Dk, Subramoni Hari Dk, K. Dhabaleswar Dk, Panda
- Training Ultra Long Context Language Model With Fully Pipelined Distributed Transformer Yao Jinghan, Jacobs Sam Ade, Tanaka Masahiro, Ruwase Olatunji, Shafi Aamir, Subramoni Hari, Panda Dhabaleswar K.
- Large Language Models Are Contrastive Reasoners Yao Liang
- Self-judge: Selective Instruction Following With Alignment Self-evaluation Ye Hai, Ng Hwee Tou
- Toolsword: Unveiling Safety Issues Of Large Language Models In Tool Learning Across Three Stages Ye Junjie, Li Sixian, Li Guanyu, Huang Caishuang, Gao Songyang, Wu Yilong, Zhang Qi, Gui Tao, Huang Xuanjing
- Rotbench: A Multi-level Benchmark For Evaluating The Robustness Of Large Language Models In Tool Learning Ye Junjie, Wu Yilong, Gao Songyang, Huang Caishuang, Li Sixian, Li Guanyu, Fan Xiaoran, Zhang Qi, Gui Tao, Huang Xuanjing
- Dataframe QA: A Universal LLM Framework On Dataframe Question Answering Without Data Exposure Ye Junyi, Du Mengnan, Wang Guiling
- Towards Fast Multilingual LLM Inference: Speculative Decoding And Specialized Drafters Yi Euiin, Kim Taehyeon, Jeung Hongseok, Chang Du-seong, Yun Se-young
- Generation Meets Verification: Accelerating Large Language Model Inference With Smart Parallel Auto-correct Decoding Yi Hanling, Lin Feng, Li Hongbin, Ning Peiyang, Yu Xiaotian, Xiao Rong
- MEND: Meta Demonstration Distillation For Efficient And Effective In-context Learning Yichuan Li, Xiyao Ma, Sixing Lu, Kyumin Lee, Xiaohu Liu, Chenlei Guo
- Characterizing Truthfulness In Large Language Model Generations With Local Intrinsic Dimension Yin Fan, Srinivasa Jayanth, Chang Kai-wei
- Llms-as-instructors: Learning From Errors Toward Automating Model Improvement Ying Jiahao, Lin Mingbao, Cao Yixin, Tang Wei, Wang Bo, Sun Qianru, Huang Xuanjing, Yan Shuicheng
- Unveiling The Safety Of Gpt-4o: An Empirical Study Using Jailbreak Attacks Ying Zonghao, Liu Aishan, Liu Xianglong, Tao Dacheng
- Exploring The Comprehension Of Chatgpt In Traditional Chinese Medicine Knowledge Yizhen Li, Shaohan Huang, Jiaxing Qi, Lei Quan, Dongran Han, Zhongzhi Luan
- TLCR: Token-level Continuous Reward For Fine-grained Reinforcement Learning From Human Feedback Yoon Eunseop, Yoon Hee Suk, Eom Soohwan, Han Gunsoo, Nam Daniel Wontae, Jo Daejin, On Kyoung-woon, Hasegawa-johnson Mark A., Kim Sungwoong, Yoo Chang D.
- Making The Most Of Your Model: Methods For Finetuning And Applying Pretrained Transformers Yoshida Davis
- When Linear Attention Meets Autoregressive Decoding: Towards More Effective And Efficient Linearized Large Language Models You Haoran, Fu Yichao, Wang Zheng, Yazdanbakhsh Amir, Lin Yingyan Celine
- Ferret-ui: Grounded Mobile UI Understanding With Multimodal Llms You Keen, Zhang Haotian, Schoop Eldon, Weers Floris, Swearngin Amanda, Nichols Jeffrey, Yang Yinfei, Gan Zhe
- Llasmol: Advancing Large Language Models For Chemistry With A Large-scale, Comprehensive, High-quality Instruction Tuning Dataset Yu Botao, Baker Frazier N., Chen Ziqi, Ning Xia, Sun Huan
- Mechanistic Understanding And Mitigation Of Language Model Non-factual Hallucinations Yu Lei, Cao Meng, Cheung Jackie Chi Kit, Dong Yue
- Rankrag: Unifying Context Ranking With Retrieval-augmented Generation In Llms Yu Yue, Ping Wei, Liu Zihan, Wang Boxin, You Jiaxuan, Zhang Chao, Shoeybi Mohammad, Catanzaro Bryan
- A Continued Pretrained LLM Approach For Automatic Medical Note Generation Yuan Dong, Rastogi Eti, Naik Gautam, Rajagopal Sree Prasanna, Goyal Sagar, Zhao Fen, Chintagunta Bharath, Ward Jeff
- Towards A Holistic Evaluation Of Llms On Factual Knowledge Recall Yuan Jiaqing, Pan Lin, Hang Chung-wei, Guo Jiang, Jiang Jiarong, Min Bonan, Ng Patrick, Wang Zhiguo
- Advancing LLM Reasoning Generalists With Preference Trees Yuan Lifan, Cui Ganqu, Wang Hanbin, Ding Ning, Wang Xingyao, Deng Jia, Shan Boji, Chen Huimin, Xie Ruobing, Lin Yankai, Liu Zhenghao, Zhou Bowen, Peng Hao, Liu Zhiyuan, Sun Maosong
- Chatmusician: Understanding And Generating Music Intrinsically With LLM Yuan Ruibin, Lin Hanfeng, Wang Yi, Tian Zeyue, Wu Shangda, Shen Tianhao, Zhang Ge, Wu Yuhang, Liu Cong, Zhou Ziya, Ma Ziyang, Xue Liumeng, Wang Ziyu, Liu Qin, Zheng Tianyu, Li Yizhi, Ma Yinghao, Liang Yiming, Chi Xiaowei, Liu Ruibo, Wang Zili, Li Pengfei, Wu Jingcheng, Lin Chenghua, Liu Qifeng, Jiang Tao, Huang Wenhao, Chen Wenhu, Benetos Emmanouil, Fu Jie, Xia Gus, Dannenberg Roger, Xue Wei, Kang Shiyin, Guo Yike
- R-judge: Benchmarking Safety Risk Awareness For LLM Agents Yuan Tongxin, He Zhiwei, Dong Lingzhong, Wang Yiming, Zhao Ruijie, Xia Tian, Xu Lizhen, Zhou Binglin, Li Fangqi, Zhang Zhuosheng, Wang Rui, Liu Gongshen
- Self-rewarding Language Models Yuan Weizhe, Pang Richard Yuanzhe, Cho Kyunghyun, Li Xian, Sukhbaatar Sainbayar, Xu Jing, Weston Jason
- Rhyme-aware Chinese Lyric Generator Based On GPT Yuan Yixiao, Huang Yangchen, Ma Yu, Li Xinjin, Li Zhenglin, Shi Yiming, Zhou Huapeng
- Refuse Whenever You Feel Unsafe: Improving Safety In Llms Via Decoupled Refusal Training Yuan Youliang, Jiao Wenxiang, Wang Wenxuan, Huang Jen-tse, Xu Jiahao, Liang Tian, He Pinjia, Tu Zhaopeng
- Do Large Language Models Understand Conversational Implicature -- A Case Study With A Chinese Sitcom Yue Shisen, Song Siyuan, Cheng Xinyuan, Hu Hai
- Mammoth2: Scaling Instructions From The Web Yue Xiang, Zheng Tuney, Zhang Ge, Chen Wenhu
- Distilling Instruction-following Abilities Of Large Language Models With Task-aware Curriculum Planning Yue Yuanhao, Wang Chengyu, Huang Jun, Wang Peng
- Question-analysis Prompting Improves LLM Performance In Reasoning Tasks Yugeswardeenoo Dharunish, Zhu Kevin, O'brien Sean
- Can Small Language Models Be Good Reasoners For Sequential Recommendation? Yuling Wang, Changxin Tian, Binbin Hu, Yanhua Yu, Ziqi Liu, Zhiqiang Zhang, Jun Zhou, Liang Pang, Xiao Wang
- Prompting Implicit Discourse Relation Annotation Yung Frances, Ahmad Mansoor, Scholman Merel, Demberg Vera
- Rec-gpt4v: Multimodal Recommendation With Large Vision-language Models Yuqing Liu, Yu Wang, Lichao Sun, Philip S. Yu
- Turkishmmlu: Measuring Massive Multitask Language Understanding In Turkish Yüksel Arda, Köksal Abdullatif, Şenel Lütfi Kerem, Korhonen Anna, Schütze Hinrich
- WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-experts Zahraei Pardis Sadat, Emami Ali
- Envgen: Generating And Adapting Environments Via Llms For Training Embodied Agents Zala Abhay, Cho Jaemin, Lin Han, Yoon Jaehong, Bansal Mohit
- Tell Me Why: Explainable Public Health Fact-checking With Large Language Models Zarharan Majid, Wullschleger Pascal, Kia Babak Behkam, Pilehvar Mohammad Taher, Foster Jennifer
- Unused Information In Token Probability Distribution Of Generative LLM: Improving LLM Reading Comprehension Through Calculation Of Expected Values Zawistowski Krystian
- Design Proteins Using Large Language Models: Enhancements And Comparative Analyses Zeinalipour Kamyar, Jamshidi Neda, Bianchini Monica, Maggini Marco, Gori Marco
- Automating Turkish Educational Quiz Generation Using Large Language Models Zeinalipour Kamyar, Keptiğ Yusuf Gökberk, Maggini Marco, Gori Marco
- Improving Logits-based Detector Without Logits From Black-box Llms Zeng Cong, Tang Shengkun, Yang Xianjun, Chen Yuanzhou, Sun Yiyou, Xu Zhiqiang, Li Yao, Chen Haifeng, Cheng Wei, Xu Dongkuan
- Skywork-math: Data Scaling Laws For Mathematical Reasoning In Large Language Models -- The Story Goes On Zeng Liang, Zhong Liangjun, Zhao Liang, Wei Tianwen, Yang Liu, He Jujie, Cheng Cheng, Hu Rui, Liu Yang, Yan Shuicheng, Fang Han, Zhou Yahui
- How Johnny Can Persuade Llms To Jailbreak Them: Rethinking Persuasion To Challenge AI Safety By Humanizing Llms Zeng Yi, Lin Hongpeng, Zhang Jingwen, Yang Diyi, Jia Ruoxi, Shi Weiyan
- Learning Reward For Robot Skills Using Large Language Models Via Self-alignment Zeng Yuwei, Mu Yao, Shao Lin
- MR-BEN: A Comprehensive Meta-reasoning Benchmark For Large Language Models Zeng Zhongshen, Liu Yinhong, Wan Yingjia, Li Jingyao, Chen Pengguang, Dai Jianbo, Yao Yuxuan, Xu Rongwu, Qi Zehan, Zhao Wanru, Shen Linling, Lu Jianqiao, Tan Haochen, Chen Yukang, Zhang Hao, Shi Zhan, Wang Bailin, Guo Zhijiang, Jia Jiaya
- Actions Speak Louder Than Words: Trillion-parameter Sequential Transducers For Generative Recommendations Zhai Jiaqi, Liao Lucy, Liu Xing, Wang Yueming, Li Rui, Cao Xuan, Gao Leon, Gong Zhaojie, Gu Fangda, He Michael, Lu Yinghai, Shi Yu
- Anygpt: Unified Multimodal LLM With Discrete Sequence Modeling Zhan Jun, Dai Junqi, Ye Jiasheng, Zhou Yunhua, Zhang Dong, Liu Zhigeng, Zhang Xin, Yuan Ruibin, Zhang Ge, Li Linyang, Yan Hang, Fu Jie, Gui Tao, Sun Tianxiang, Jiang Yugang, Qiu Xipeng
- Injecagent: Benchmarking Indirect Prompt Injections In Tool-integrated Large Language Model Agents Zhan Qiusi, Liang Zhixiang, Ying Zifan, Kang Daniel
- Skyeyegpt: Unifying Remote Sensing Vision-language Tasks Via Instruction Tuning With Large Language Model Zhan Yang, Xiong Zhitong, Yuan Yuan
- MC-GPT: Empowering Vision-and-language Navigation With Memory Map And Reasoning Chains Zhan Zhaohuan, Yu Lisha, Yu Sijie, Tan Guang
- Teaching Large Language Models An Unseen Language On The Fly Zhang Chen, Liu Xiao, Lin Jiuheng, Feng Yansong
- Chemllm: A Chemical Large Language Model Zhang Di, Liu Wei, Tan Qian, Chen Jingdan, Yan Hang, Yan Yuliang, Li Jiatong, Huang Weiran, Yue Xiangyu, Ouyang Wanli, Zhou Dongzhan, Zhang Shufei, Su Mao, Zhong Han-sen, Li Yuqiang
- Speechgpt-gen: Scaling Chain-of-information Speech Generation Zhang Dong, Zhang Xin, Zhan Jun, Li Shimin, Zhou Yaqian, Qiu Xipeng
- CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark Zhang Ge, Du Xinrun, Chen Bei, Liang Yiming, Luo Tongxu, Zheng Tianyu, Zhu Kang, Cheng Yuyang, Xu Chunpu, Guo Shuyue, Zhang Haoran, Qu Xingwei, Wang Junjie, Yuan Ruibin, Li Yizhi, Wang Zekun, Liu Yudong, Tsai Yu-hsuan, Zhang Fengji, Lin Chenghua, Huang Wenhao, Fu Jie
- Map-neo: Highly Capable And Transparent Bilingual Large Language Model Series Zhang Ge, Qu Scott, Liu Jiaheng, Zhang Chenchen, Lin Chenghua, Yu Chou Leuang, Pan Danny, Cheng Esther, Liu Jie, Lin Qunshu, Yuan Raven, Zheng Tuney, Pang Wei, Du Xinrun, Liang Yiming, Ma Yinghao, Li Yizhi, Ma Ziyang, Lin Bill, Benetos Emmanouil, Yang Huan, Zhou Junting, Ma Kaijing, Liu Minghao, Niu Morry, Wang Noah, Que Quehry, Liu Ruibo, Liu Sine, Guo Shawn, Gao Soren, Zhou Wangchunshu, Zhang Xinyue, Zhou Yizhi, Wang Yubo, Bai Yuelin, Zhang Yuhan, Zhang Yuxiang, Wang Zenith, Yang Zhenzhu, Zhao Zijian, Zhang Jiajun, Ouyang Wanli, Huang Wenhao, Chen Wenhu
- Closing The Gap Between Open-source And Commercial Large Language Models For Medical Evidence Summarization Zhang Gongbo, Jin Qiao, Zhou Yiliang, Wang Song, Idnay Betina R., Luo Yiming, Park Elizabeth, Nestor Jordan G., Spotnitz Matthew E., Soroush Ali, Campion Thomas, Lu Zhiyong, Weng Chunhua, Peng Yifan
- Adaptable Logical Control For Large Language Models Zhang Honghua, Kung Po-nien, Yoshida Masahiro, Broeck Guy Van Den, Peng Nanyun
- A Careful Examination Of Large Language Model Performance On Grade School Arithmetic Zhang Hugh, Da Jeff, Lee Dean, Robinson Vaughn, Wu Catherine, Song Will, Zhao Tiffany, Raja Pranav, Slack Dylan, Lyu Qin, Hendryx Sean, Kaplan Russell, Lunati Michele, Yue Summer
- Longcite: Enabling Llms To Generate Fine-grained Citations In Long-context QA Zhang Jiajie, Bai Yushi, Lv Xin, Gu Wanjun, Liu Danqing, Zou Minhao, Cao Shulin, Hou Lei, Dong Yuxiao, Feng Ling, Li Juanzi
- Xlam: A Family Of Large Action Models To Empower AI Agent Systems Zhang Jianguo, Lan Tian, Zhu Ming, Liu Zuxin, Hoang Thai, Kokane Shirley, Yao Weiran, Tan Juntao, Prabhakar Akshara, Chen Haolin, Liu Zhiwei, Feng Yihao, Awalgaonkar Tulika, Murthy Rithesh, Hu Eric, Chen Zeyuan, Xu Ran, Niebles Juan Carlos, Heinecke Shelby, Wang Huan, Savarese Silvio, Xiong Caiming
- Geoeval: Benchmark For Evaluating Llms And Multi-modal Models On Geometry Problem-solving Zhang Jiaxin, Li Zhongzhi, Zhang Mingliang, Yin Fei, Liu Chenglin, Moshfeghi Yashar
- Regurgitative Training: The Value Of Real Data In Training Large Language Models Zhang Jinghui, Qiao Dandan, Yang Mochen, Wei Qiang
- Onegen: Efficient One-pass Unified Generation And Retrieval For Llms Zhang Jintian, Peng Cheng, Sun Mengshu, Chen Xiang, Liang Lei, Zhang Zhiqiang, Zhou Jun, Chen Huajun, Zhang Ningyu
- Unimot: Unified Molecule-text Language Model With Discrete Token Representation Zhang Juzheng, Bian Yatao, Chen Yongqiang, Yao Quanming
- Hire A Linguist!: Learning Endangered Languages With In-context Linguistic Descriptions Zhang Kexun, Choi Yee Man, Song Zhenqiao, He Taiqi, Wang William Yang, Li Lei
- Predicting Learning Performance With Large Language Models: A Study In Adult Literacy Zhang Liang, Lin Jionghao, Borchers Conrad, Sabatini John, Hollander John, Cao Meng, Hu Xiangen
- SPL: A Socratic Playground For Learning Powered By Large Language Model Zhang Liang, Lin Jionghao, Kuang Ziyi, Xu Sheng, Yeasin Mohammed, Hu Xiangen
- Enhancing Multi-hop Reasoning Through Knowledge Erasure In Large Language Model Editing Zhang Mengqi, Fang Bowen, Liu Qiang, Ren Pengjie, Wu Shu, Chen Zhumin, Wang Liang
- The Impact Of Demonstrations On Multilingual In-context Learning: A Multidimensional Analysis Zhang Miaoran, Gautam Vagrant, Wang Mingyang, Alabi Jesujoba O., Shen Xiaoyu, Klakow Dietrich, Mosbach Marius
- The Hedgehog & The Porcupine: Expressive Linear Attentions With Softmax Mimicry Zhang Michael, Bhatia Kush, Kumbong Hermann, Ré Christopher
- Internlm-xcomposer-2.5: A Versatile Large Vision Language Model Supporting Long-contextual Input And Output Zhang Pan, Dong Xiaoyi, Zang Yuhang, Cao Yuhang, Qian Rui, Chen Lin, Guo Qipeng, Duan Haodong, Wang Bin, Ouyang Linke, Zhang Songyang, Zhang Wenwei, Li Yining, Gao Yang, Sun Peng, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Yan Hang, He Conghui, Zhang Xingcheng, Chen Kai, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- Extending Llama-3's Context Ten-fold Overnight Zhang Peitian, Shao Ninglu, Liu Zheng, Xiao Shitao, Qian Hongjin, Ye Qiwei, Dou Zhicheng
- Tinyllama: An Open-source Small Language Model Zhang Peiyuan, Zeng Guangtao, Wang Tianduo, Lu Wei
- Look Ahead Or Look Around? A Theoretical Comparison Between Autoregressive And Masked Pretraining Zhang Qi, Du Tianqi, Huang Haotian, Wang Yifei, Wang Yisen
- RECOST: External Knowledge Guided Data-efficient Instruction Tuning Zhang Qi, Zhang Yiming, Wang Haobo, Zhao Junbo
- Mathverse: Does Your Multi-modal LLM Truly See The Diagrams In Visual Math Problems? Zhang Renrui, Jiang Dongzhi, Zhang Yichi, Lin Haokun, Guo Ziyu, Qiu Pengshuo, Zhou Aojun, Lu Pan, Chang Kai-wei, Gao Peng, Li Hongsheng
- Direct Preference Optimization Of Video Large Multimodal Models From Language Model Reward Zhang Ruohong, Gui Liangke, Sun Zhiqing, Feng Yihao, Xu Keyang, Zhang Yuanhan, Fu Di, Li Chunyuan, Hauptmann Alexander, Bisk Yonatan, Yang Yiming
- In-context Principle Learning From Mistakes Zhang Tianjun, Madaan Aman, Gao Luyu, Zheng Steven, Mishra Swaroop, Yang Yiming, Tandon Niket, Alon Uri
- D\'olares Or Dollars? Unraveling The Bilingual Prowess Of Financial Llms Between Spanish And English Zhang Xiao, Xiang Ruoyu, Yuan Chenhan, Feng Duanyu, Han Weiguang, Lopez-lira Alejandro, Liu Xiao-yang, Ananiadou Sophia, Peng Min, Huang Jimin, Xie Qianqian
- Large Language Models As Evaluators For Recommendation Explanations Zhang Xiaoyu, Li Yishan, Wang Jiayin, Sun Bowen, Ma Weizhi, Sun Peijie, Zhang Min
- Pushing The Limit Of LLM Capacity For Text Classification Zhang Yazhou, Wang Mengyao, Ren Chenyu, Li Qiuchi, Tiwari Prayag, Wang Benyou, Qin Jing
- Mme-realworld: Could Your Multimodal LLM Challenge High-resolution Real-world Scenarios That Are Difficult For Humans? Zhang Yi-fan, Zhang Huanyu, Tian Haochen, Fu Chaoyou, Zhang Shuangqing, Wu Junfei, Li Feng, Wang Kun, Wen Qingsong, Zhang Zhang, Wang Liang, Jin Rong, Tan Tieniu
- How Far Are We From Intelligent Visual Deductive Reasoning? Zhang Yizhe, Bai He, Zhang Ruixiang, Gu Jiatao, Zhai Shuangfei, Susskind Josh, Jaitly Navdeep
- Navigating User Experience Of Chatgpt-based Conversational Recommender Systems: The Effects Of Prompt Guidance And Recommendation Domain Zhang Yizhe, Jin Yucheng, Chen Li, Yang Ting
- Leveraging Biases In Large Language Models: "bias-knn'' For Effective Few-shot Learning Zhang Yong, Li Hanzhang, Li Zhitao, Cheng Ning, Li Ming, Xiao Jing, Wang Jianzong
- Small Language Models Need Strong Verifiers To Self-correct Reasoning Zhang Yunxiang, Khalifa Muhammad, Logeswaran Lajanugen, Kim Jaekyeom, Lee Moontae, Lee Honglak, Wang Lu
- Intention Analysis Makes Llms A Good Jailbreak Defender Zhang Yuqi, Ding Liang, Zhang Lefei, Tao Dacheng
- Toolbehonest: A Multi-level Hallucination Diagnostic Benchmark For Tool-augmented Large Language Models Zhang Yuxiang, Chen Jing, Wang Junjie, Liu Yaxin, Yang Cheng, Shi Chufan, Zhu Xinyu, Lin Zihao, Wan Hanwen, Yang Yujiu, Sakai Tetsuya, Feng Tian, Yamana Hayato
- PPTC-R Benchmark: Towards Evaluating The Robustness Of Large Language Models For Powerpoint Task Completion Zhang Zekai, Guo Yiduo, Liang Yaobo, Zhao Dongyan, Duan Nan
- Consistency Matters: Explore Llms Consistency From A Black-box Perspective Zhao Fufangchen, Jin Guoqiang, Huang Jiaheng, Zhao Rui, Tan Fei
- LANE: Logic Alignment Of Non-tuning Large Language Models And Online Recommendation Systems For Explainable Reason Generation Zhao Hongke, Zheng Songming, Wu Likang, Yu Bowen, Wang Jing
- Revolutionizing Finance With Llms: An Overview Of Applications And Insights Zhao Huaqin, Liu Zhengliang, Wu Zihao, Li Yiwei, Yang Tianze, Shu Peng, Xu Shaochen, Dai Haixing, Zhao Lin, Mai Gengchen, Liu Ninghao, Liu Tianming
- Llama Beyond English: An Empirical Study On Language Capability Transfer Zhao Jun, Zhang Zhihao, Gao Luhui, Zhang Qi, Gui Tao, Huang Xuanjing
- Longagent: Scaling Language Models To 128k Context Through Multi-agent Collaboration Zhao Jun, Zu Can, Xu Hao, Lu Yi, He Wei, Ding Yiwen, Gui Tao, Zhang Qi, Huang Xuanjing
- Lora Land: 310 Fine-tuned Llms That Rival GPT-4, A Technical Report Zhao Justin, Wang Timothy, Abid Wael, Angus Geoffrey, Garg Arnav, Kinnison Jeffery, Sherstinsky Alex, Molino Piero, Addair Travis, Rishi Devvret
- Mitigating Object Hallucination In Large Vision-language Models Via Classifier-free Guidance Zhao Linxi, Deng Yihe, Zhang Weitong, Gu Quanquan
- Prepacking: A Simple Method For Fast Prefilling And Increased Throughput In Large Language Models Zhao Siyan, Israel Daniel, Broeck Guy Van Den, Grover Aditya
- Enhancing Cross-domain Pre-trained Decision Transformers With Adaptive Attention Zhao Wenhao, Xu Qiushui, Xu Linjie, Song Lei, Wang Jinyu, Zhou Chunlai, Bian Jiang
- I Could've Asked That: Reformulating Unanswerable Questions Zhao Wenting, Gao Ge, Cardie Claire, Rush Alexander M.
- Wildchat: 1M Chatgpt Interaction Logs In The Wild Zhao Wenting, Ren Xiang, Hessel Jack, Cardie Claire, Choi Yejin, Deng Yuntian
- Codev: Empowering Llms For Verilog Generation Through Multi-level Summarization Zhao Yang, Huang Di, Li Chongxiao, Jin Pengwei, Nan Ziyuan, Ma Tianyun, Qi Lei, Pan Yansong, Zhang Zhenxing, Zhang Rui, Zhang Xishan, Du Zidong, Guo Qi, Hu Xing, Chen Yunji
- ALISA: Accelerating Large Language Model Inference Via Sparsity-aware KV Caching Zhao Youpeng, Wu Di, Wang Jun
- An Empirical Study Of Retrieval Augmented Generation With Chain-of-thought Zhao Yuetong, Cao Hongyu, Zhao Xianyu, Ou Zhijian
- Chemdfm: Dialogue Foundation Model For Chemistry Zhao Zihan, Ma Da, Chen Lu, Sun Liangtai, Li Zihao, Xu Hongshen, Zhu Zichen, Zhu Su, Fan Shuai, Shen Guodong, Chen Xin, Yu Kai
- Leveraging Large Language Models With Chain-of-thought And Prompt Engineering For Traffic Crash Severity Analysis And Inference Zhen Hao, Shi Yucheng, Huang Yongcan, Yang Jidong J., Liu Ninghao
- Gpt-4v(ision) Is A Generalist Web Agent, If Grounded Zheng Boyuan, Gou Boyu, Kil Jihyung, Sun Huan, Su Yu
- ICE-GRT: Instruction Context Enhancement By Generative Reinforcement Based Transformers Zheng Chen, Sun Ke, Tang Da, Ma Yukun, Zhang Yuyu, Xi Chenguang, Zhou Xun
- Mistral-c2f: Coarse To Fine Actor For Analytical And Reasoning Enhancement In RLHF And Effective-merged Llms Zheng Chen, Sun Ke, Zhou Xun
- Large Language Models As Reliable Knowledge Bases? Zheng Danna, Lapata Mirella, Pan Jeff Z.
- Opencodeinterpreter: Integrating Code Generation With Execution And Refinement Zheng Tianyu, Zhang Ge, Shen Tianhao, Liu Xueling, Lin Bill Yuchen, Fu Jie, Chen Wenhu, Yue Xiang
- Attention Heads Of Large Language Models: A Survey Zheng Zifan, Wang Yezhaohui, Huang Yuxin, Song Shichao, Tang Bo, Xiong Feiyu, Li Zhiyu
- Exploring Boundary Of GPT-4V On Marine Analysis: A Preliminary Case Study Zheng Ziqiang, Chen Yiwei, Zhang Jipeng, Vu Tuan-anh, Zeng Huimin, Tim Yue Him Wong, Yeung Sai-kit
- Revisiting Knowledge Distillation For Autoregressive Language Models Zhong Qihuang, Ding Liang, Shen Li, Liu Juhua, Du Bo, Tao Dacheng
- Robust Prompt Optimization For Defending Language Models Against Jailbreaking Attacks Zhou Andy, Li Bo, Wang Haohan
- Navgpt-2: Unleashing Navigational Reasoning Capability For Large Vision-language Models Zhou Gengze, Hong Yicong, Wang Zun, Wang Xin Eric, Wu Qi
- Leveraging Web-crawled Data For High-quality Fine-tuning Zhou Jing, Jiang Chenglin, Shen Wei, Zhou Xiao, He Xiaonan
- Self-discover: Large Language Models Self-compose Reasoning Structures Zhou Pei, Pujara Jay, Ren Xiang, Chen Xinyun, Cheng Heng-tze, Le Quoc V., Chi Ed H., Zhou Denny, Mishra Swaroop, Zheng Huaixiu Steven
- Enhancing The General Agent Capabilities Of Low-parameter Llms Through Tuning And Multi-branch Reasoning Zhou Qinhao, Zhang Zihan, Xiang Xiang, Wang Ke, Wu Yuchuan, Li Yongbin
- Evaluating The Smooth Control Of Attribute Intensity In Text Generation With Llms Zhou Shang, Yao Feng, Dong Chengyu, Wang Zihan, Shang Jingbo
- Using Pretrained Large Language Model With Prompt Engineering To Answer Biomedical Questions Zhou Wenxin, Ngo Thuy Hang
- Aligning Modalities In Vision Large Language Models Via Preference Fine-tuning Zhou Yiyang, Cui Chenhang, Rafailov Rafael, Finn Chelsea, Yao Huaxiu
- Metagpt: Merging Large Language Models Using Model Exclusive Task Arithmetic Zhou Yuyan, Song Liang, Wang Bingning, Chen Weipeng
- Lawgpt: A Chinese Legal Knowledge-enhanced Large Language Model Zhou Zhi, Shi Jiang-xin, Song Peng-xiao, Yang Xiao-wen, Jin Yi-xuan, Guo Lan-zhe, Li Yu-feng
- Can Llms "reason" In Music? An Evaluation Of Llms' Capability Of Music Understanding And Generation Zhou Ziya, Wu Yuhang, Wu Zhiyue, Zhang Xinyue, Yuan Ruibin, Ma Yinghao, Wang Lu, Benetos Emmanouil, Xue Wei, Guo Yike
- TAT-LLM: A Specialized Language Model For Discrete Reasoning Over Tabular And Textual Data Zhu Fengbin, Liu Ziyang, Feng Fuli, Wang Chao, Li Moxin, Chua Tat-seng
- FANNO: Augmenting High-quality Instruction Data With Open-sourced Llms Only Zhu He, Su Junyou, Lun Tianle, Tao Yicheng, Zhang Wenjia, Fan Zipei, Chen Guanhua
- Eliciting The Priors Of Large Language Models Using Iterated In-context Learning Zhu Jian-qiao, Griffiths Thomas L.
- Benchmarking Large Language Models On CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset Zhu Jie, Li Junhui, Wen Yalong, Guo Lifan
- Cost: Contrastive Quantization Based Semantic Tokenization For Generative Recommendation Zhu Jieming, Jin Mengqun, Liu Qijiong, Qiu Zexuan, Dong Zhenhua, Li Xiu
- Quite Good, But Not Enough: Nationality Bias In Large Language Models -- A Case Study Of Chatgpt Zhu Shucheng, Wang Weikang, Liu Ying
- How Privacy-savvy Are Large Language Models? A Case Study On Compliance And Privacy Technical Review Zhu Xichou, Liu Yang, Shen Zhou, Liu Yi, Li Min, Chen Yujun, John Benzi, Ma Zhenzhen, Hu Tao, Yang Bolong, Wang Manman, Xie Zongxing, Liu Peng, Cai Dan, Wang Junhui
- KG-FPQ: Evaluating Factuality Hallucination In Llms With Knowledge Graph-based False Premise Questions Zhu Yanxu, Xiao Jinlin, Wang Yuhang, Sang Jitao
- Apt-pipe: A Prompt-tuning Tool For Social Data Annotation Using Chatgpt Zhu Yiming, Yin Zhizhuo, Tyson Gareth, Haq Ehsan-ul, Lee Lik-hang, Hui Pan
- Is Larger Always Better? Evaluating And Prompting Large Language Models For Non-generative Medical Tasks Zhu Yinghao, Gao Junyi, Wang Zixiang, Liao Weibin, Zheng Xiaochen, Liang Lifang, Wang Yasha, Pan Chengwei, Harrison Ewen M., Ma Liantao
- Prompting Large Language Models For Zero-shot Clinical Prediction With Structured Longitudinal Electronic Health Record Data Zhu Yinghao, Wang Zixiang, Gao Junyi, Tong Yuning, An Jingkun, Liao Weibin, Harrison Ewen M., Ma Liantao, Pan Chengwei
- Are Large Language Models Good Statisticians? Zhu Yizhang, Du Shiyin, Li Boyan, Luo Yuyu, Tang Nan
- Halueval-wild: Evaluating Hallucinations Of Language Models In The Wild Zhu Zhiying, Yang Yiming, Sun Zhiqing
- Structlm: Towards Building Generalist Models For Structured Knowledge Grounding Zhuang Alex, Zhang Ge, Zheng Tianyu, Du Xinrun, Wang Junjie, Ren Weiming, Huang Stephen W., Fu Jie, Yue Xiang, Chen Wenhu
- VL-ICL Bench: The Devil In The Details Of Benchmarking Multimodal In-context Learning Zong Yongshuo, Bohdal Ondrej, Hospedales Timothy
- Image-to-text Logic Jailbreak: Your Imagination Can Help You Do Anything Zou Xiaotian, Li Ke, Chen Yongkang
- BASE TTS: Lessons From Building A Billion-parameter Text-to-speech Model On 100K Hours Of Data Łajszczak Mateusz, Cámbara Guillermo, Li Yang, Beyhan Fatih, Van Korlaar Arent, Yang Fan, Joly Arnaud, Martín-cortinas Álvaro, Abbas Ammar, Michalski Adam, Moinet Alexis, Karlapati Sri, Muszyńska Ewa, Guo Haohan, Putrycz Bartosz, Gambino Soledad López, Yoo Kayeon, Sokolova Elena, Drugman Thomas
🏷 Has Code
- Visual Dialog Das Abhishek, Kottur Satwik, Gupta Khushi, Singh Avi, Yadav Deshraj, Moura José M. F., Parikh Devi, Batra Dhruv
- Table-to-text Generation By Structure-aware Seq2seq Learning Liu Tianyu, Wang Kexiang, Sha Lei, Chang Baobao, Sui Zhifang
- Best Of Both Worlds: Transferring Knowledge From Discriminative Learning To A Generative Visual Dialog Model Lu Jiasen, Kannan Anitha, Yang Jianwei, Parikh Devi, Batra Dhruv
- Parlai: A Dialog Research Software Platform Miller Alexander H., Feng Will, Fisch Adam, Lu Jiasen, Batra Dhruv, Bordes Antoine, Parikh Devi, Weston Jason
- Code2seq: Generating Sequences From Structured Representations Of Code Alon Uri, Brody Shaked, Levy Omer, Yahav Eran
- Attention Based Natural Language Grounding By Navigating Virtual Environment B Akilesh, Sinha Abhishek, Sarkar Mausoom, Krishnamurthy Balaji
- Recursive Visual Attention In Visual Dialog Niu Yulei, Zhang Hanwang, Zhang Manli, Zhang Jianhong, Lu Zhiwu, Wen Ji-rong
- A BERT Baseline For The Natural Questions Alberti Chris, Lee Kenton, Collins Michael
- Fusion Of Detected Objects In Text For Visual Question Answering Alberti Chris, Ling Jeffrey, Collins Michael, Reitter David
- Distilling Knowledge Learned In BERT For Text Generation Chen Yen-chun, Gan Zhe, Cheng Yu, Liu Jingzhou, Liu Jingjing
- Cross-lingual Natural Language Generation Via Pre-training Chi Zewen, Dong Li, Wei Furu, Wang Wenhui, Mao Xian-ling, Huang Heyan
- Cross-lingual Machine Reading Comprehension Cui Yiming, Che Wanxiang, Liu Ting, Qin Bing, Wang Shijin, Hu Guoping
- Unified Language Model Pre-training For Natural Language Understanding And Generation Dong Li, Yang Nan, Wang Wenhui, Wei Furu, Liu Xiaodong, Wang Yu, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- Attention Is Not Explanation Jain Sarthak, Wallace Byron C.
- CTRL: A Conditional Transformer Language Model For Controllable Generation Keskar Nitish Shirish, Mccann Bryan, Varshney Lav R., Xiong Caiming, Socher Richard
- ALBERT: A Lite BERT For Self-supervised Learning Of Language Representations Lan Zhenzhong, Chen Mingda, Goodman Sebastian, Gimpel Kevin, Sharma Piyush, Soricut Radu
- Relevance-promoting Language Model For Short-text Conversation Li Xin, Li Piji, Bi Wei, Liu Xiaojiang, Lam Wai
- Triplenet: Triple Attention Network For Multi-turn Response Selection In Retrieval-based Chatbots Ma Wentao, Cui Yiming, Shao Nan, He Su, Zhang Wei-nan, Liu Ting, Wang Shijin, Hu Guoping
- Let's Ask Again: Refine Network For Automatic Question Generation Nema Preksha, Mohankumar Akash Kumar, Khapra Mitesh M., Srinivasan Balaji Vasan, Ravindran Balaraman
- Efficient Attention Mechanism For Visual Dialog That Can Handle All The Interactions Between Multiple Inputs Nguyen Van-quang, Suganuma Masanori, Okatani Takayuki
- Language Models As Knowledge Bases? Petroni Fabio, Rocktäschel Tim, Lewis Patrick, Bakhtin Anton, Wu Yuxiang, Miller Alexander H., Riedel Sebastian
- Masked Language Model Scoring Salazar Julian, Liang Davis, Nguyen Toan Q., Kirchhoff Katrin
- Flexibly-structured Model For Task-oriented Dialogues Shu Lei, Molino Piero, Namazifar Mahdi, Xu Hu, Liu Bing, Zheng Huaixiu, Tur Gokhan
- VL-BERT: Pre-training Of Generic Visual-linguistic Representations Su Weijie, Zhu Xizhou, Cao Yue, Li Bin, Lu Lewei, Wei Furu, Dai Jifeng
- LXMERT: Learning Cross-modality Encoder Representations From Transformers Tan Hao, Bansal Mohit
- Repurposing Entailment For Multi-hop Question Answering Tasks Trivedi Harsh, Kwon Heeyoung, Khot Tushar, Sabharwal Ashish, Balasubramanian Niranjan
- Allennlp Interpret: A Framework For Explaining Predictions Of NLP Models Wallace Eric, Tuyls Jens, Wang Junlin, Subramanian Sanjay, Gardner Matt, Singh Sameer
- Improving Knowledge-aware Dialogue Generation Via Knowledge Base Question Answering Wang Jian, Liu Junhao, Bi Wei, Liu Xiaojiang, He Kejing, Xu Ruifeng, Yang Min
- Huggingface's Transformers: State-of-the-art Natural Language Processing Wolf Thomas, Debut Lysandre, Sanh Victor, Chaumond Julien, Delangue Clement, Moi Anthony, Cistac Pierric, Rault Tim, Louf Rémi, Funtowicz Morgan, Davison Joe, Shleifer Sam, Von Platen Patrick, Ma Clara, Jernite Yacine, Plu Julien, Xu Canwen, Scao Teven Le, Gugger Sylvain, Drame Mariama, Lhoest Quentin, Rush Alexander M.
- Visual Entailment: A Novel Task For Fine-grained Image Understanding Xie Ning, Lai Farley, Doran Derek, Kadav Asim
- Review Conversational Reading Comprehension Xu Hu, Liu Bing, Shu Lei, Yu Philip S.
- Towards Making The Most Of BERT In Neural Machine Translation Yang Jiacheng, Wang Mingxuan, Zhou Hao, Zhao Chengqi, Yu Yong, Zhang Weinan, Li Lei
- MUSE: Parallel Multi-scale Attention For Sequence To Sequence Learning Zhao Guangxiang, Sun Xu, Xu Jingjing, Zhang Zhiyuan, Luo Liangchen
- Unified Vision-language Pre-training For Image Captioning And VQA Zhou Luowei, Palangi Hamid, Zhang Lei, Hu Houdong, Corso Jason J., Gao Jianfeng
- How Much Knowledge Can You Pack Into The Parameters Of A Language Model? Adam Roberts, Colin Raffel, Noam Shazeer
- Fluent Response Generation For Conversational Question Answering Baheti Ashutosh, Ritter Alan, Small Kevin
- Reading Between The Lines: Exploring Infilling In Visual Narratives Chandu Khyathi Raghavi, Dong Ruo-ping, Black Alan
- Logical Natural Language Generation From Open-domain Tables Chen Wenhu, Chen Jianshu, Su Yu, Chen Zhiyu, Wang William Yang
- CERT: Contrastive Self-supervised Learning For Language Understanding Fang Hongchao, Wang Sicheng, Zhou Meng, Ding Jiayuan, Xie Pengtao
- Improvement Of A Dedicated Model For Open Domain Persona-aware Dialogue Generation Han Qiang
- Sub-instruction Aware Vision-and-language Navigation Hong Yicong, Rodriguez-opazo Cristian, Wu Qi, Gould Stephen
- How Can We Know When Language Models Know? On The Calibration Of Language Models For Question Answering Jiang Zhengbao, Araki Jun, Ding Haibo, Neubig Graham
- Common Sense Or World Knowledge? Investigating Adapter-based Knowledge Injection Into Pretrained Transformers Lauscher Anne, Majewska Olga, Ribeiro Leonardo F. R., Gurevych Iryna, Rozanov Nikolai, Glavaš Goran
- MART: Memory-augmented Recurrent Transformer For Coherent Video Paragraph Captioning Lei Jie, Wang Liwei, Shen Yelong, Yu Dong, Berg Tamara L., Bansal Mohit
- Neural Machine Translation With Joint Representation Li Yanyang, Wang Qiang, Xiao Tong, Liu Tongran, Zhu Jingbo
- Understanding The Difficulty Of Training Transformers Liu Liyuan, Liu Xiaodong, Gao Jianfeng, Chen Weizhu, Han Jiawei
- Adversarial Training For Large Neural Language Models Liu Xiaodong, Cheng Hao, He Pengcheng, Chen Weizhu, Wang Yu, Poon Hoifung, Gao Jianfeng
- Very Deep Transformers For Neural Machine Translation Liu Xiaodong, Duh Kevin, Liu Liyuan, Gao Jianfeng
- Lite Training Strategies For Portuguese-english And English-portuguese Translation Lopes Alexandre, Nogueira Rodrigo, Lotufo Roberto, Pedrini Helio
- Charbert: Character-aware Pre-trained Language Model Ma Wentao, Cui Yiming, Si Chenglei, Liu Ting, Wang Shijin, Hu Guoping
- Delight: Deep And Light-weight Transformer Mehta Sachin, Ghazvininejad Marjan, Iyer Srinivasan, Zettlemoyer Luke, Hajishirzi Hannaneh
- On The Stability Of Fine-tuning BERT: Misconceptions, Explanations, And Strong Baselines Mosbach Marius, Andriushchenko Maksym, Klakow Dietrich
- X-linear Attention Networks For Image Captioning Pan Yingwei, Yao Ting, Li Yehao, Mei Tao
- SOLOIST: Building Task Bots At Scale With Transfer Learning And Machine Teaching Peng Baolin, Li Chunyuan, Li Jinchao, Shayandeh Shahin, Liden Lars, Gao Jianfeng
- Nlpgym -- A Toolkit For Evaluating RL Agents On Natural Language Processing Tasks Ramamurthy Rajkumar, Sifa Rafet, Bauckhage Christian
- Prover: Proof Generation For Interpretable Reasoning Over Rules Saha Swarnadeep, Ghosh Sayan, Srivastava Shashank, Bansal Mohit
- Russiansuperglue: A Russian Language Understanding Evaluation Benchmark Shavrina Tatiana, Fenogenova Alena, Emelyanov Anton, Shevelev Denis, Artemova Ekaterina, Malykh Valentin, Mikhailov Vladislav, Tikhonova Maria, Chertok Andrey, Evlampiev Andrey
- Pre-trained Summarization Distillation Shleifer Sam, Rush Alexander M.
- Unnatural Language Inference Sinha Koustuv, Parthasarathi Prasanna, Pineau Joelle, Williams Adina
- An Investigation Of Language Model Interpretability Via Sentence Editing Stevens Samuel, Su Yu
- Multi-hop Question Generation With Graph Convolutional Network Su Dan, Xu Yan, Dai Wenliang, Ji Ziwei, Yu Tiezheng, Fung Pascale
- Vokenization: Improving Language Understanding With Contextualized, Visual-grounded Supervision Tan Hao, Bansal Mohit
- The Language Interpretability Tool: Extensible, Interactive Visualizations And Analysis For NLP Models Tenney Ian, Wexler James, Bastings Jasmijn, Bolukbasi Tolga, Coenen Andy, Gehrmann Sebastian, Jiang Ellen, Pushkarna Mahima, Radebaugh Carey, Reif Emily, Yuan Ann
- Infobert: Improving Robustness Of Language Models From An Information Theoretic Perspective Wang Boxin, Wang Shuohang, Cheng Yu, Gan Zhe, Jia Ruoxi, Li Bo, Liu Jingjing
- HAT: Hardware-aware Transformers For Efficient Natural Language Processing Wang Hanrui, Wu Zhanghao, Liu Zhijian, Cai Han, Zhu Ligeng, Gan Chuang, Han Song
- Lightseq: A High Performance Inference Library For Transformers Wang Xiaohui, Xiong Ying, Wei Yang, Wang Mingxuan, Li Lei
- VD-BERT: A Unified Vision And Dialog Transformer With BERT Wang Yue, Joty Shafiq, Lyu Michael R., King Irwin, Xiong Caiming, Hoi Steven C. H.
- Lite Transformer With Long-short Range Attention Wu Zhanghao, Liu Zhijian, Lin Ji, Lin Yujun, Han Song
- Exploring Question-specific Rewards For Generating Deep Questions Xie Yuxi, Pan Liangming, Wang Dongzhe, Kan Min-yen, Feng Yansong
- Just Ask: Learning To Answer Questions From Millions Of Narrated Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Seqdialn: Sequential Visual Dialog Networks In Joint Visual-linguistic Representation Space Yang Liu
- On The Generation Of Medical Dialogues For COVID-19 Yang Wenmian, Zeng Guangtao, Tan Bowen, Ju Zeqian, Chakravorty Subrato, He Xuehai, Chen Shu, Yang Xingyi, Wu Qingyang, Yu Zhou, Xing Eric, Xie Pengtao
- POINTER: Constrained Progressive Text Generation Via Insertion-based Generative Pre-training Zhang Yizhe, Wang Guoyin, Li Chunyuan, Gan Zhe, Brockett Chris, Dolan Bill
- CPM: A Large-scale Generative Chinese Pre-trained Language Model Zhang Zhengyan, Han Xu, Zhou Hao, Ke Pei, Gu Yuxian, Ye Deming, Qin Yujia, Su Yusheng, Ji Haozhe, Guan Jian, Qi Fanchao, Wang Xiaozhi, Zheng Yanan, Zeng Guoyang, Cao Huanqi, Chen Shengqi, Li Daixuan, Sun Zhenbo, Liu Zhiyuan, Huang Minlie, Han Wentao, Tang Jie, Li Juanzi, Zhu Xiaoyan, Sun Maosong
- S2s-ft: Fine-tuning Pretrained Transformer Encoders For Sequence-to-sequence Learning Bao Hangbo, Dong Li, Wang Wenhui, Yang Nan, Wei Furu
- CDLM: Cross-document Language Modeling Caciularu Avi, Cohan Arman, Beltagy Iz, Peters Matthew E., Cattan Arie, Dagan Ido
- Visualgpt: Data-efficient Adaptation Of Pretrained Language Models For Image Captioning Chen Jun, Guo Han, Yi Kai, Li Boyang, Elhoseiny Mohamed
- Unifying Vision-and-language Tasks Via Text Generation Cho Jaemin, Lei Jie, Tan Hao, Bansal Mohit
- Knowledge Neurons In Pretrained Transformers Dai Damai, Dong Li, Hao Yaru, Sui Zhifang, Chang Baobao, Wei Furu
- Zero-shot Recommendation As Language Modeling Damien Sileo, Wout Vossen, Robbe Raymaekers
- Openprompt: An Open-source Framework For Prompt-learning Ding Ning, Hu Shengding, Zhao Weilin, Chen Yulin, Liu Zhiyuan, Zheng Hai-tao, Sun Maosong
- Is GPT-3 Text Indistinguishable From Human Text? Scarecrow: A Framework For Scrutinizing Machine Text Dou Yao, Forbes Maxwell, Koncel-kedziorski Rik, Smith Noah A., Choi Yejin
- An Empirical Study Of Training End-to-end Vision-and-language Transformers Dou Zi-yi, Xu Yichong, Gan Zhe, Wang Jianfeng, Wang Shuohang, Wang Lijuan, Zhu Chenguang, Zhang Pengchuan, Yuan Lu, Peng Nanyun, Liu Zicheng, Zeng Michael
- Lora: Low-rank Adaptation Of Large Language Models Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen
- Text2app: A Framework For Creating Android Apps From Text Descriptions Hasan Masum, Mehrab Kazi Sajeed, Ahmad Wasi Uddin, Shahriyar Rifat
- ELIT: Emory Language And Information Toolkit He Han, Xu Liyan, Choi Jinho D.
- ENCONTER: Entity Constrained Progressive Sequence Generation Via Insertion-based Transformer Hsieh Lee-hsun, Lee Yang-yin, Lim Ee-peng
- The Niutrans System For WNGT 2020 Efficiency Task Hu Chi, Li Bei, Lin Ye, Li Yinqiao, Li Yanyang, Wang Chenglong, Xiao Tong, Zhu Jingbo
- Lora: Low-rank Adaptation Of Large Language Models Hu Edward J., Shen Yelong, Wallis Phillip, Allen-zhu Zeyuan, Li Yuanzhi, Wang Shean, Wang Lu, Chen Weizhu
- Investigating Transfer Learning In Multilingual Pre-trained Language Models Through Chinese Natural Language Inference Hu Hai, Zhou He, Tian Zuoyu, Zhang Yiwen, Ma Yina, Li Yanting, Nie Yixin, Richardson Kyle
- SILT: Efficient Transformer Training For Inter-lingual Inference Huertas-tato Javier, Martín Alejandro, Camacho David
- Evaluating Pretrained Transformer Models For Entity Linking In Task-oriented Dialog Jayanthi Sai Muralidhar, Embar Varsha, Raghunathan Karthik
- Inducing Transformer's Compositional Generalization Ability Via Auxiliary Sequence Prediction Tasks Jiang Yichen, Bansal Mohit
- Enriching Transformers With Structured Tensor-product Representations For Abstractive Summarization Jiang Yichen, Celikyilmaz Asli, Smolensky Paul, Soulos Paul, Rao Sudha, Palangi Hamid, Fernandez Roland, Smith Caitlin, Bansal Mohit, Gao Jianfeng
- A Good Prompt Is Worth Millions Of Parameters: Low-resource Prompt-based Learning For Vision-language Models Jin Woojeong, Cheng Yu, Shen Yelong, Chen Weizhu, Ren Xiang
- How Transferable Are Reasoning Patterns In VQA? Kervadec Corentin, Jaunet Theo, Antipov Grigory, Baccouche Moez, Vuillemot Romain, Wolf Christian
- Text-free Prosody-aware Generative Spoken Language Modeling Kharitonov Eugene, Lee Ann, Polyak Adam, Adi Yossi, Copet Jade, Lakhotia Kushal, Nguyen Tu-anh, Rivière Morgane, Mohamed Abdelrahman, Dupoux Emmanuel, Hsu Wei-ning
- Can Transformer Models Measure Coherence In Text? Re-thinking The Shuffle Test Laban Philippe, Dai Luke, Bandarkar Lucas, Hearst Marti A.
- Align Before Fuse: Vision And Language Representation Learning With Momentum Distillation Li Junnan, Selvaraju Ramprasaath R., Gotmare Akhilesh Deepak, Joty Shafiq, Xiong Caiming, Hoi Steven
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-image Pre-training Paradigm Li Yangguang, Liang Feng, Zhao Lichen, Cui Yufeng, Ouyang Wanli, Shao Jing, Yu Fengwei, Yan Junjie
- Scheduled Sampling In Vision-language Pretraining With Decoupled Encoder-decoder Network Li Yehao, Pan Yingwei, Yao Ting, Chen Jingwen, Mei Tao
- Terapipe: Token-level Pipeline Parallelism For Training Large-scale Language Models Li Zhuohan, Zhuang Siyuan, Guo Shiyuan, Zhuo Danyang, Zhang Hao, Song Dawn, Stoica Ion
- Learning Neural Templates For Recommender Dialogue System Liang Zujie, Hu Huang, Xu Can, Miao Jian, He Yingying, Chen Yining, Geng Xiubo, Liang Fan, Jiang Daxin
- Augmenting Sequential Recommendation With Pseudo-prior Items Via Reversely Pre-training Transformer Liu Zhiwei, Fan Ziwei, Wang Yu, Yu Philip S.
- Deltalm: Encoder-decoder Pre-training For Language Generation And Translation By Augmenting Pretrained Multilingual Encoders Ma Shuming, Dong Li, Huang Shaohan, Zhang Dongdong, Muzio Alexandre, Singhal Saksham, Awadalla Hany Hassan, Song Xia, Wei Furu
- Multi-modal Understanding And Generation For Medical Images And Text Via Vision-language Pre-training Moon Jong Hak, Lee Hyungyung, Shin Woncheol, Kim Young-hak, Choi Edward
- Arat5: Text-to-text Transformers For Arabic Language Generation Nagoudi El Moatez Billah, Elmadany Abdelrahim, Abdul-mageed Muhammad
- Sentence-t5: Scalable Sentence Encoders From Pre-trained Text-to-text Models Ni Jianmo, Ábrego Gustavo Hernández, Constant Noah, Ma Ji, Hall Keith B., Cer Daniel, Yang Yinfei
- MLP Architectures For Vision-and-language Modeling: An Empirical Study Nie Yixin, Li Linjie, Gan Zhe, Wang Shuohang, Zhu Chenguang, Zeng Michael, Liu Zicheng, Bansal Mohit, Wang Lijuan
- Investigating The Limitations Of Transformers With Simple Arithmetic Tasks Nogueira Rodrigo, Jiang Zhiying, Lin Jimmy
- TIMEDIAL: Temporal Commonsense Reasoning In Dialog Qin Lianhui, Gupta Aditya, Upadhyay Shyam, He Luheng, Choi Yejin, Faruqui Manaal
- Towards Neural Functional Program Evaluation Scholak Torsten, Pilault Jonathan, Velez-ginorio Joey
- How Much Can CLIP Benefit Vision-and-language Tasks? Shen Sheng, Li Liunian Harold, Tan Hao, Bansal Mohit, Rohrbach Anna, Chang Kai-wei, Yao Zhewei, Keutzer Kurt
- Lightningdot: Pre-training Visual-semantic Embeddings For Real-time Image-text Retrieval Sun Siqi, Chen Yen-chun, Li Linjie, Wang Shuohang, Fang Yuwei, Liu Jingjing
- General-purpose Question-answering With Macaw Tafjord Oyvind, Clark Peter
- Multitask Prompted Training Enables Zero-shot Task Generalization Victor Sanh, Albert Webson, Colin Raffel, Stephen H. Bach, Lintang Sutawika, Zaid Alyafeai, Antoine Chaffin, Arnaud Stiegler, Teven Le Scao, Arun Raja, Manan Dey, M Saiful Bari, Canwen Xu, Urmish Thakker, Shanya Sharma Sharma, Eliza Szczechla, Taewoon Kim, Gunjan Chhablani, Nihal Nayak, Debajyoti Datta, Jonathan Chang, Mike Tian-jian Jiang, Han Wang, Matteo Manica, Sheng Shen, Zheng Xin Yong, Harshit Pandey, Rachel Bawden, Thomas Wang, Trishala Neeraj, Jos Rozen, Abheesht Sharma, Andrea Santilli, Thibault Fevry, Jason Alan Fries, Ryan Teehan, Tali Bers, Stella Biderman, Leo Gao, Thomas Wolf, Alexander M. Rush
- Distilled Dual-encoder Model For Vision-language Understanding Wang Zekun, Wang Wenhui, Zhu Haichao, Liu Ming, Qin Bing, Wei Furu
- P-tuning V2: Prompt Tuning Can Be Comparable To Fine-tuning Universally Across Scales And Tasks Xiao Liu, Kaixuan Ji, Yicheng Fu, Weng Lam Tam, Zhengxiao Du, Zhilin Yang, Jie Tang
- Simple Local Attentions Remain Competitive For Long-context Tasks Xiong Wenhan, Oğuz Barlas, Gupta Anchit, Chen Xilun, Liskovich Diana, Levy Omer, Yih Wen-tau, Mehdad Yashar
- VLM: Task-agnostic Video-language Model Pre-training For Video Understanding Xu Hu, Ghosh Gargi, Huang Po-yao, Arora Prahal, Aminzadeh Masoumeh, Feichtenhofer Christoph, Metze Florian, Zettlemoyer Luke
- Fastseq: Make Sequence Generation Faster Yan Yu, Hu Fei, Chen Jiusheng, Bhendawade Nikhil, Ye Ting, Gong Yeyun, Duan Nan, Cui Desheng, Chi Bingyu, Zhang Ruofei
- Causal Attention For Vision-language Tasks Yang Xu, Zhang Hanwang, Qi Guojun, Cai Jianfei
- TR-BERT: Dynamic Token Reduction For Accelerating BERT Inference Ye Deming, Lin Yankai, Huang Yufei, Sun Maosong
- Readtwice: Reading Very Large Documents With Memories Zemlyanskiy Yury, Ainslie Joshua, De Jong Michiel, Pham Philip, Eckstein Ilya, Sha Fei
- Ror: Read-over-read For Long Document Machine Reading Comprehension Zhao Jing, Bao Junwei, Wang Yifan, Zhou Yongwei, Wu Youzheng, He Xiaodong, Zhou Bowen
- CPM-2: Large-scale Cost-effective Pre-trained Language Models Zhengyan Zhang, Yuxian Gu, Xu Han, Shengqi Chen, Chaojun Xiao, Zhenbo Sun, Yuan Yao, Fanchao Qi, Jian Guan, Pei Ke, Yanzheng Cai, Guoyang Zeng, Zhixing Tan, Zhiyuan Liu, Minlie Huang, Wentao Han, Yang Liu, Xiaoyan Zhu, Maosong Sun
- Taming Sparsely Activated Transformer With Stochastic Experts Zuo Simiao, Liu Xiaodong, Jiao Jian, Kim Young Jin, Hassan Hany, Zhang Ruofei, Zhao Tuo, Gao Jianfeng
- Prompting As Probing: Using Language Models For Knowledge Base Construction Alivanistos Dimitrios, Santamaría Selene Báez, Cochez Michael, Kalo Jan-christoph, Van Krieken Emile, Thanapalasingam Thiviyan
- GLM-130B: An Open Bilingual Pre-trained Model Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Peng Zhang, Yuxiao Dong, Jie Tang
- Banglanlg And Banglat5: Benchmarks And Resources For Evaluating Low-resource Natural Language Generation In Bangla Bhattacharjee Abhik, Hasan Tahmid, Ahmad Wasi Uddin, Shahriyar Rifat
- Inpars: Data Augmentation For Information Retrieval Using Large Language Models Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Nogueira Rodrigo
- Understanding Attention For Vision-and-language Tasks Cao Feiqi, Han Soyeon Caren, Long Siqu, Xu Changwei, Poon Josiah
- Speechprompt: An Exploration Of Prompt Tuning On Generative Spoken Language Model For Speech Processing Tasks Chang Kai-wei, Tseng Wei-cheng, Li Shang-wen, Lee Hung-yi
- Character-centric Story Visualization Via Visual Planning And Token Alignment Chen Hong, Han Rujun, Wu Te-lin, Nakayama Hideki, Peng Nanyun
- Xdoc: Unified Pre-training For Cross-format Document Understanding Chen Jingye, Lv Tengchao, Cui Lei, Zhang Cha, Wei Furu
- Altclip: Altering The Language Encoder In CLIP For Extended Language Capabilities Chen Zhongzhi, Liu Guang, Zhang Bo-wen, Ye Fulong, Yang Qinghong, Wu Ledell
- Task-aware Specialization For Efficient And Robust Dense Retrieval For Open-domain Question Answering Cheng Hao, Fang Hao, Liu Xiaodong, Gao Jianfeng
- Vault: Augmenting The Vision-and-language Transformer For Sentiment Classification On Social Media Chochlakis Georgios University Of Southern California, Srinivasan Tejas University Of Southern California, Thomason Jesse University Of Southern California, Narayanan Shrikanth University Of Southern California
- Continual Pre-training Mitigates Forgetting In Language And Vision Cossu Andrea, Tuytelaars Tinne, Carta Antonio, Passaro Lucia, Lomonaco Vincenzo, Bacciu Davide
- Why Can GPT Learn In-context? Language Models Implicitly Perform Gradient Descent As Meta-optimizers Dai Damai, Sun Yutao, Dong Li, Hao Yaru, Ma Shuming, Sui Zhifang, Wei Furu
- Black-box Prompt Learning For Pre-trained Language Models Diao Shizhe, Huang Zhichao, Xu Ruijia, Li Xuechun, Lin Yong, Zhou Xiao, Zhang Tong
- Write And Paint: Generative Vision-language Models Are Unified Modal Learners Diao Shizhe, Zhou Wangchunshu, Zhang Xinsong, Wang Jiawei
- Coarse-to-fine Vision-language Pre-training With Fusion In The Backbone Dou Zi-yi, Kamath Aishwarya, Gan Zhe, Zhang Pengchuan, Wang Jianfeng, Li Linjie, Liu Zicheng, Liu Ce, Lecun Yann, Peng Nanyun, Gao Jianfeng, Wang Lijuan
- Codegen: An Open Large Language Model For Code With Multi-turn Program Synthesis Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong
- GPTQ: Accurate Post-training Quantization For Generative Pre-trained Transformers Frantar Elias, Ashkboos Saleh, Hoefler Torsten, Alistarh Dan
- Language Models Are Multilingual Chain-of-thought Reasoners Freda Shi, Mirac Suzgun, Markus Freitag, Xuezhi Wang, Suraj Srivats, Soroush Vosoughi, Hyung Won Chung, Yi Tay, Sebastian Ruder, Denny Zhou, Dipanjan Das, Jason Wei
- WAVPROMPT: Towards Few-shot Spoken Language Understanding With Frozen Language Models Gao Heting, Ni Junrui, Qian Kaizhi, Zhang Yang, Chang Shiyu, Hasegawa-johnson Mark
- Re2g: Retrieve, Rerank, Generate Glass Michael, Rossiello Gaetano, Chowdhury Md Faisal Mahbub, Naik Ankita Rajaram, Cai Pengshan, Gliozzo Alfio
- Diffuseq: Sequence To Sequence Text Generation With Diffusion Models Gong Shansan, Li Mukai, Feng Jiangtao, Wu Zhiyong, Kong Lingpeng
- Generating Executable Action Plans With Environmentally-aware Language Models Gramopadhye Maitrey, Szafir Daniel
- Visually-augmented Pretrained Language Models For NLP Tasks Without Images Guo Hangyu, Zhou Kun, Zhao Wayne Xin, Zhang Qinyu, Wen Ji-rong
- Aging With GRACE: Lifelong Model Editing With Discrete Key-value Adaptors Hartvigsen Thomas, Sankaranarayanan Swami, Palangi Hamid, Kim Yoon, Ghassemi Marzyeh
- Fctalker: Fine And Coarse Grained Context Modeling For Expressive Conversational Speech Synthesis Hu Yifan, Liu Rui, Gao Guanglai, Li Haizhou
- Layoutlmv3: Pre-training For Document AI With Unified Text And Image Masking Huang Yupan, Lv Tengchao, Cui Lei, Lu Yutong, Wei Furu
- Welm: A Well-read Pre-trained Language Model For Chinese Hui Su, Xiao Zhou, Houjin Yu, Xiaoyu Shen, Yuwen Chen, Zilin Zhu, Yang Yu, Jie Zhou
- Can Large Language Models Truly Understand Prompts? A Case Study With Negated Prompts Jang Joel, Ye Seonghyeon, Seo Minjoon
- Contrastive Learning For Prompt-based Few-shot Language Learners Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Pseudo-q: Generating Pseudo Language Queries For Visual Grounding Jiang Haojun, Lin Yuanze, Han Dongchen, Song Shiji, Huang Gao
- Self-prompting Large Language Models For Zero-shot Open-domain QA Junlong Li, Jinyuan Wang, Zhuosheng Zhang, Hai Zhao
- Decomposed Prompting: A Modular Approach For Solving Complex Tasks Khot Tushar, Trivedi Harsh, Finlayson Matthew, Fu Yao, Richardson Kyle, Clark Peter, Sabharwal Ashish
- The Stack: 3 TB Of Permissively Licensed Source Code Kocetkov Denis, Li Raymond, Allal Loubna Ben, Li Jia, Mou Chenghao, Ferrandis Carlos Muñoz, Jernite Yacine, Mitchell Margaret, Hughes Sean, Wolf Thomas, Bahdanau Dzmitry, Von Werra Leandro, De Vries Harm
- Mucot: Multilingual Contrastive Training For Question-answering In Low-resource Languages Kumar Gokul Karthik, Gehlot Abhishek Singh, Mullappilly Sahal Shaji, Nandakumar Karthik
- The Optimal BERT Surgeon: Scalable And Accurate Second-order Pruning For Large Language Models Kurtic Eldar, Campos Daniel, Nguyen Tuan, Frantar Elias, Kurtz Mark, Fineran Benjamin, Goin Michael, Alistarh Dan
- Fine-grained Semantically Aligned Vision-language Pre-training Li Juncheng, He Xin, Wei Longhui, Qian Long, Zhu Linchao, Xie Lingxi, Zhuang Yueting, Tian Qi, Tang Siliang
- Learning To Transfer Prompts For Text Generation Li Junyi, Tang Tianyi, Nie Jian-yun, Wen Ji-rong, Zhao Wayne Xin
- LAVENDER: Unifying Video-language Understanding As Masked Language Modeling Li Linjie, Gan Zhe, Lin Kevin, Lin Chung-ching, Liu Zicheng, Liu Ce, Wang Lijuan
- Clinical-longformer And Clinical-bigbird: Transformers For Long Clinical Sequences Li Yikuan, Wehbe Ramsey M., Ahmad Faraz S., Wang Hanyin, Luo Yuan
- Vision-language Pre-training For Multimodal Aspect-based Sentiment Analysis Ling Yan, Yu Jianfei, Xia Rui
- Fl-tuning: Layer Tuning For Feed-forward Network In Transformer Liu Jingping, Song Yuqiu, Xue Kui, Sun Hongli, Wang Chao, Chen Lihan, Jiang Haiyun, Liang Jiaqing, Ruan Tong
- A Win-win Deal: Towards Sparse And Robust Pre-trained Language Models Liu Yuanxin, Meng Fandong, Lin Zheng, Li Jiangnan, Fu Peng, Cao Yanan, Wang Weiping, Zhou Jie
- Learn To Explain: Multimodal Reasoning Via Thought Chains For Science Question Answering Lu Pan, Mishra Swaroop, Xia Tony, Qiu Liang, Chang Kai-wei, Zhu Song-chun, Tafjord Oyvind, Clark Peter, Kalyan Ashwin
- Biogpt: Generative Pre-trained Transformer For Biomedical Text Generation And Mining Luo Renqian, Sun Liai, Xia Yingce, Qin Tao, Zhang Sheng, Poon Hoifung, Liu Tie-yan
- PAL: Program-aided Language Models Luyu Gao, Aman Madaan, Shuyan Zhou, Uri Alon, Pengfei Liu, Yiming Yang, Jamie Callan, Graham Neubig
- Memory-assisted Prompt Editing To Improve GPT-3 After Deployment Madaan Aman, Tandon Niket, Clark Peter, Yang Yiming
- PERFECT: Prompt-free And Efficient Few-shot Learning With Language Models Mahabadi Rabeeh Karimi, Zettlemoyer Luke, Henderson James, Saeidi Marzieh, Mathias Lambert, Stoyanov Veselin, Yazdani Majid
- Grounding Language With Visual Affordances Over Unstructured Data Mees Oier, Borja-diaz Jessica, Burgard Wolfram
- LAD: Language Models As Data For Zero-shot Dialog Mehri Shikib, Altun Yasemin, Eskenazi Maxine
- Mass-editing Memory In A Transformer Meng Kevin, Sharma Arnab Sen, Andonian Alex, Belinkov Yonatan, Bau David
- Do As I Can, Not As I Say: Grounding Language In Robotic Affordances Michael Ahn, Anthony Brohan, Noah Brown, Yevgen Chebotar, Omar Cortes, Byron David, Chelsea Finn, Chuyuan Fu, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Daniel Ho, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Eric Jang, Rosario Jauregui Ruano, Kyle Jeffrey, Sally Jesmonth, Nikhil J Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Kuang-huei Lee, Sergey Levine, Yao Lu, Linda Luu, Carolina Parada, Peter Pastor, Jornell Quiambao, Kanishka Rao, Jarek Rettinghouse, Diego Reyes, Pierre Sermanet, Nicolas Sievers, Clayton Tan, Alexander Toshev, Vincent Vanhoucke, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Mengyuan Yan, Andy Zeng
- Minicons: Enabling Flexible Behavioral And Representational Analyses Of Transformer Language Models Misra Kanishka
- SGPT: GPT Sentence Embeddings For Semantic Search Muennighoff Niklas
- Large Language Models Are Reasoning Teachers Namgyu Ho, Laura Schmid, Se-young Yun
- Crosslingual Generalization Through Multitask Finetuning Niklas Muennighoff, Thomas Wang, Lintang Sutawika, Adam Roberts, Stella Biderman, Teven Le Scao, M Saiful Bari, Sheng Shen, Zheng-xin Yong, Hailey Schoelkopf, Xiangru Tang, Dragomir Radev, Alham Fikri Aji, Khalid Almubarak, Samuel Albanie, Zaid Alyafeai, Albert Webson, Edward Raff, Colin Raffel
- No Language Left Behind: Scaling Human-centered Machine Translation Nllb Team, Marta R. Costa-jussà, James Cross, Onur Çelebi, Maha Elbayad, Kenneth Heafield, Kevin Heffernan, Elahe Kalbassi, Janice Lam, Daniel Licht, Jean Maillard, Anna Sun, Skyler Wang, Guillaume Wenzek, Al Youngblood, Bapi Akula, Loic Barrault, Gabriel Mejia Gonzalez, Prangthip Hansanti, John Hoffman, Semarley Jarrett, Kaushik Ram Sadagopan, Dirk Rowe, Shannon Spruit, Chau Tran, Pierre Andrews, Necip Fazil Ayan, Shruti Bhosale, Sergey Edunov, Angela Fan, Cynthia Gao, Vedanuj Goswami, Francisco Guzmán, Philipp Koehn, Alexandre Mourachko, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Jeff Wang
- Minialbert: Model Distillation Via Parameter-efficient Recursive Transformers Nouriborji Mohammadmahdi, Rohanian Omid, Kouchaki Samaneh, Clifton David A.
- Extremebert: A Toolkit For Accelerating Pretraining Of Customized BERT Pan Rui, Diao Shizhe, Chen Jianlin, Zhang Tong
- Relation-aware Language-graph Transformer For Question Answering Park Jinyoung, Choi Hyeong Kyu, Ko Juyeon, Park Hyeonjin, Kim Ji-hoon, Jeong Jisu, Kim Kyungmin, Kim Hyunwoo J.
- Sparse And Dense Approaches For The Full-rank Retrieval Of Responses For Dialogues Penha Gustavo, Hauff Claudia
- Visconde: Multi-document QA With GPT-3 And Neural Reranking Pereira Jayr, Fidalgo Robson, Lotufo Roberto, Nogueira Rodrigo
- Ignore Previous Prompt: Attack Techniques For Language Models Perez Fábio, Ribeiro Ian
- Grips: Gradient-free, Edit-based Instruction Search For Prompting Large Language Models Prasad Archiki, Hase Peter, Zhou Xiang, Bansal Mohit
- ELLE: Efficient Lifelong Pre-training For Emerging Data Qin Yujia, Zhang Jiajie, Lin Yankai, Liu Zhiyuan, Li Peng, Sun Maosong, Zhou Jie
- Parallel Context Windows For Large Language Models Ratner Nir, Levine Yoav, Belinkov Yonatan, Ram Ori, Magar Inbal, Abend Omri, Karpas Ehud, Shashua Amnon, Leyton-brown Kevin, Shoham Yoav
- An Understanding-oriented Robust Machine Reading Comprehension Model Ren Feiliang, Liu Yongkang, Li Bochao, Liu Shilei, Wang Bingchao, Wang Jiaqi, Liu Chunchao, Ma Qi
- Billions Of Parameters Are Worth More Than In-domain Training Data: A Case Study In The Legal Case Entailment Task Rosa Guilherme Moraes, Bonifacio Luiz, Jeronymo Vitor, Abonizio Hugo, Lotufo Roberto, Nogueira Rodrigo
- NLX-GPT: A Model For Natural Language Explanations In Vision And Vision-language Tasks Sammani Fawaz, Mukherjee Tanmoy, Deligiannis Nikos
- What Language Model To Train If You Have One Million GPU Hours? Scao Teven Le, Wang Thomas, Hesslow Daniel, Saulnier Lucile, Bekman Stas, Bari M Saiful, Biderman Stella, Elsahar Hady, Muennighoff Niklas, Phang Jason, Press Ofir, Raffel Colin, Sanh Victor, Shen Sheng, Sutawika Lintang, Tae Jaesung, Yong Zheng Xin, Launay Julien, Beltagy Iz
- Bird-eye Transformers For Text Generation Models Sha Lei, Song Yuhang, Yordanov Yordan, Salvatori Tommaso, Lukasiewicz Thomas
- Staged Training For Transformer Language Models Shen Sheng, Walsh Pete, Keutzer Kurt, Dodge Jesse, Peters Matthew, Beltagy Iz
- XRICL: Cross-lingual Retrieval-augmented In-context Learning For Cross-lingual Text-to-sql Semantic Parsing Shi Peng, Zhang Rui, Bai He, Lin Jimmy
- Distilling Reasoning Capabilities Into Smaller Language Models Shridhar Kumar, Stolfo Alessandro, Sachan Mrinmaya
- Repository-level Prompt Generation For Large Language Models Of Code Shrivastava Disha, Larochelle Hugo, Tarlow Daniel
- React: Synergizing Reasoning And Acting In Language Models Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, Yuan Cao
- Reasoning With Language Model Prompting: A Survey Shuofei Qiao, Yixin Ou, Ningyu Zhang, Xiang Chen, Yunzhi Yao, Shumin Deng, Chuanqi Tan, Fei Huang, Huajun Chen
- Gpt-neox-20b: An Open-source Autoregressive Language Model Sid Black, Stella Biderman, Eric Hallahan, Quentin Anthony, Leo Gao, Laurence Golding, Horace He, Connor Leahy, Kyle Mcdonell, Jason Phang, Michael Pieler, Usvsn Sai Prashanth, Shivanshu Purohit, Laria Reynolds, Jonathan Tow, Ben Wang, Samuel Weinbach
- Ask Me Anything: A Simple Strategy For Prompting Language Models Simran Arora, Avanika Narayan, Mayee F. Chen, Laurel Orr, Neel Guha, Kush Bhatia, Ines Chami, Frederic Sala, Christopher Ré
- Long-form Video-language Pre-training With Multimodal Temporal Contrastive Learning Sun Yuchong, Xue Hongwei, Song Ruihua, Liu Bei, Yang Huan, Fu Jianlong
- A Length-extrapolatable Transformer Sun Yutao, Dong Li, Patra Barun, Ma Shuming, Huang Shaohan, Benhaim Alon, Chaudhary Vishrav, Song Xia, Wei Furu
- Recitation-augmented Language Models Sun Zhiqing, Wang Xuezhi, Tay Yi, Yang Yiming, Zhou Denny
- ACORT: A Compact Object Relation Transformer For Parameter Efficient Image Captioning Tan Jia Huei, Tan Ying Hua, Chan Chee Seng, Chuah Joon Huang
- Benchmarking Large Language Models For Automated Verilog RTL Code Generation Thakur Shailja, Ahmad Baleegh, Fan Zhenxing, Pearce Hammond, Tan Benjamin, Karri Ramesh, Dolan-gavitt Brendan, Garg Siddharth
- Interleaving Retrieval With Chain-of-thought Reasoning For Knowledge-intensive Multi-step Questions Trivedi Harsh, Balasubramanian Niranjan, Khot Tushar, Sabharwal Ashish
- Adavae: Exploring Adaptive Gpt-2s In Variational Auto-encoders For Language Modeling Tu Haoqin, Yang Zhongliang, Yang Jinshuai, Huang Yongfeng
- Efficient Few-shot Learning Without Prompts Tunstall Lewis, Reimers Nils, Jo Unso Eun Seo, Bates Luke, Korat Daniel, Wasserblat Moshe, Pereg Oren
- Unite: Unified Translation Evaluation Wan Yu, Liu Dayiheng, Yang Baosong, Zhang Haibo, Chen Boxing, Wong Derek F., Chao Lidia S.
- Can Language Models Make Fun? A Case Study In Chinese Comical Crosstalk Wang Benyou, Wu Xiangbo, Liu Xiaokang, Li Jianquan, Tiwari Prayag, Xie Qianqian
- Lilt: A Simple Yet Effective Language-independent Layout Transformer For Structured Document Understanding Wang Jiapeng, Jin Lianwen, Ding Kai
- TAG: Boosting Text-vqa Via Text-aware Visual Question-answer Generation Wang Jun, Gao Mingfei, Hu Yuqian, Selvaraju Ramprasaath R., Ramaiah Chetan, Xu Ran, Jaja Joseph F., Davis Larry S.
- Pretraining Without Attention Wang Junxiong, Yan Jing Nathan, Gu Albert, Rush Alexander M.
- What Language Model Architecture And Pretraining Objective Work Best For Zero-shot Generalization? Wang Thomas, Roberts Adam, Hesslow Daniel, Scao Teven Le, Chung Hyung Won, Beltagy Iz, Launay Julien, Raffel Colin
- Visually-augmented Language Modeling Wang Weizhi, Dong Li, Cheng Hao, Song Haoyu, Liu Xiaodong, Yan Xifeng, Gao Jianfeng, Wei Furu
- Finding Skill Neurons In Pre-trained Transformer-based Language Models Wang Xiaozhi, Wen Kaiyue, Zhang Zhengyan, Hou Lei, Liu Zhiyuan, Li Juanzi
- S-prompts Learning With Pre-trained Transformers: An Occam's Razor For Domain Incremental Learning Wang Yabin, Huang Zhiwu, Hong Xiaopeng
- Language Models With Image Descriptors Are Strong Few-shot Video-language Learners Wang Zhenhailong, Li Manling, Xu Ruochen, Zhou Luowei, Lei Jie, Lin Xudong, Wang Shuohang, Yang Ziyi, Zhu Chenguang, Hoiem Derek, Chang Shih-fu, Bansal Mohit, Ji Heng
- Jiuzhang: A Chinese Pre-trained Language Model For Mathematical Problem Understanding Wayne Xin Zhao, Kun Zhou, Zheng Gong, Beichen Zhang, Yuanhang Zhou, Jing Sha, Zhigang Chen, Shijin Wang, Cong Liu, Ji-rong Wen
- Large Language Models Are Few(1)-shot Table Reasoners Wenhu Chen
- Program Of Thoughts Prompting: Disentangling Computation From Reasoning For Numerical Reasoning Tasks Wenhu Chen, Xueguang Ma, Xinyi Wang, William W. Cohen
- Sentence Representation Learning With Generative Objective Rather Than Contrastive Objective Wu Bohong, Zhao Hai
- An Efficient Memory-augmented Transformer For Knowledge-intensive NLP Tasks Wu Yuxiang, Zhao Yu, Hu Baotian, Minervini Pasquale, Stenetorp Pontus, Riedel Sebastian
- Smoothquant: Accurate And Efficient Post-training Quantization For Large Language Models Xiao Guangxuan, Lin Ji, Seznec Mickael, Wu Hao, Demouth Julien, Han Song
- Dual Modality Prompt Tuning For Vision-language Pre-trained Model Xing Yinghui, Wu Qirui, Cheng De, Zhang Shizhou, Liang Guoqiang, Wang Peng, Zhang Yanning
- Adapting Pretrained Text-to-text Models For Long Text Sequences Xiong Wenhan, Gupta Anchit, Toshniwal Shubham, Mehdad Yashar, Yih Wen-tau
- SESCORE2: Learning Text Generation Evaluation Via Synthesizing Realistic Mistakes Xu Wenda, Qian Xian, Wang Mingxuan, Li Lei, Wang William Yang
- Bridgetower: Building Bridges Between Encoders In Vision-language Representation Learning Xu Xiao, Wu Chenfei, Rosenman Shachar, Lal Vasudev, Che Wanxiang, Duan Nan
- Learning To Answer Visual Questions From Web Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Zero-shot Video Question Answering Via Frozen Bidirectional Language Models Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Testaug: A Framework For Augmenting Capability-based NLP Tests Yang Guanqun, Haque Mirazul, Song Qiaochu, Yang Wei, Liu Xueqing
- Prompt Tuning For Discriminative Pre-trained Language Models Yao Yuan, Dong Bowen, Zhang Ao, Zhang Zhengyan, Xie Ruobing, Liu Zhiyuan, Lin Leyu, Sun Maosong, Wang Jianyong
- Structured Prompting: Scaling In-context Learning To 1,000 Examples Yaru Hao, Yutao Sun, Li Dong, Zhixiong Han, Yuxian Gu, Furu Wei
- Guess The Instruction! Flipped Learning Makes Language Models Stronger Zero-shot Learners Ye Seonghyeon, Kim Doyoung, Jang Joel, Shin Joongbo, Seo Minjoon
- Large Language Models Are Better Reasoners With Self-verification Yixuan Weng, Minjun Zhu, Fei Xia, Bin Li, Shizhu He, Shengping Liu, Bin Sun, Kang Liu, Jun Zhao
- BLOOM+1: Adding Language Support To BLOOM For Zero-shot Prompting Yong Zheng-xin, Schoelkopf Hailey, Muennighoff Niklas, Aji Alham Fikri, Adelani David Ifeoluwa, Almubarak Khalid, Bari M Saiful, Sutawika Lintang, Kasai Jungo, Baruwa Ahmed, Winata Genta Indra, Biderman Stella, Raff Edward, Radev Dragomir, Nikoulina Vassilina
- Scaling Autoregressive Models For Content-rich Text-to-image Generation Yu Jiahui, Xu Yuanzhong, Koh Jing Yu, Luong Thang, Baid Gunjan, Wang Zirui, Vasudevan Vijay, Ku Alexander, Yang Yinfei, Ayan Burcu Karagol, Hutchinson Ben, Han Wei, Parekh Zarana, Li Xin, Zhang Han, Baldridge Jason, Wu Yonghui
- Bridging Cross-lingual Gaps During Leveraging The Multilingual Sequence-to-sequence Pretraining For Text Generation And Understanding Zan Changtong, Ding Liang, Shen Li, Cao Yu, Liu Weifeng, Tao Dacheng
- Dyrex: Dynamic Query Representation For Extractive Question Answering Zaratiana Urchade, Khbir Niama El, Núñez Dennis, Holat Pierre, Tomeh Nadi, Charnois Thierry
- M-adapter: Modality Adaptation For End-to-end Speech-to-text Translation Zhao Jinming, Yang Hao, Shareghi Ehsan, Haffari Gholamreza
- Lite Unified Modeling For Discriminative Reading Comprehension Zhao Yilin, Zhao Hai, Shen Libin, Zhao Yinggong
- Self-adaptive In-context Learning: An Information Compression Perspective For In-context Example Selection And Ordering Zhiyong Wu, Yaoxiang Wang, Jiacheng Ye, Lingpeng Kong
- Binding Language Models In Symbolic Languages Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu
- Automatic Chain Of Thought Prompting In Large Language Models Zhuosheng Zhang, Aston Zhang, Mu Li, Alex Smola
- Generator-retriever-generator Approach For Open-domain Question Answering Abdallah Abdelrahman, Jatowt Adam
- Inpars Toolkit: A Unified And Reproducible Synthetic Data Generation Pipeline For Neural Information Retrieval Abonizio Hugo, Bonifacio Luiz, Jeronymo Vitor, Lotufo Roberto, Zavrel Jakub, Nogueira Rodrigo
- Let's Sample Step By Step: Adaptive-consistency For Efficient Reasoning And Coding With Llms Aggarwal Pranjal, Madaan Aman, Yang Yiming, Mausam
- The Emergence Of Essential Sparsity In Large Pre-trained Models: The Weights That Matter Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Zhangyang Wang
- An In-depth Look At Gemini's Language Abilities Akter Syeda Nahida, Yu Zichun, Muhamed Aashiq, Ou Tianyue, Bäuerle Alex, Cabrera Ángel Alexander, Dholakia Krish, Xiong Chenyan, Neubig Graham
- Can Language Models Employ The Socratic Method? Experiments With Code Debugging Al-hossami Erfan, Bunescu Razvan, Smith Justin, Teehan Ryan
- Taqyim: Evaluating Arabic NLP Tasks Using Chatgpt Models Alyafeai Zaid, Alshaibani Maged S., Alkhamissi Badr, Luqman Hamzah, Alareqi Ebrahim, Fadel Ali
- Learning From Mistakes Makes LLM Better Reasoner An Shengnan, Ma Zexiong, Lin Zeqi, Zheng Nanning, Lou Jian-guang, Chen Weizhu
- On Generative Agents In Recommendation An Zhang, Yuxin Chen, Leheng Sheng, Xiang Wang, Tat-seng Chua
- Distilling Efficient Language-specific Models For Cross-lingual Transfer Ansell Alan, Ponti Edoardo Maria, Korhonen Anna, Vulić Ivan
- Agenttuning: Enabling Generalized Agent Abilities For Llms Aohan Zeng, Mingdao Liu, Rui Lu, Bowen Wang, Xiao Liu, Yuxiao Dong, Jie Tang
- Sequence-to-sequence Spanish Pre-trained Language Models Araujo Vladimir, Trusca Maria Mihaela, Tufiño Rodrigo, Moens Marie-francine
- Zoology: Measuring And Improving Recall In Efficient Language Models Arora Simran, Eyuboglu Sabri, Timalsina Aman, Johnson Isys, Poli Michael, Zou James, Rudra Atri, Ré Christopher
- Unihd At TSAR-2022 Shared Task: Is Compute All We Need For Lexical Simplification? Aumiller Dennis, Gertz Michael
- Openflamingo: An Open-source Framework For Training Large Autoregressive Vision-language Models Awadalla Anas, Gao Irena, Gardner Josh, Hessel Jack, Hanafy Yusuf, Zhu Wanrong, Marathe Kalyani, Bitton Yonatan, Gadre Samir, Sagawa Shiori, Jitsev Jenia, Kornblith Simon, Koh Pang Wei, Ilharco Gabriel, Wortsman Mitchell, Schmidt Ludwig
- Fauno: The Italian Large Language Model That Will Leave You Senza Parole! Bacciu Andrea, Trappolini Giovanni, Santilli Andrea, Rodolà Emanuele, Silvestri Fabrizio
- Leftover Lunch: Advantage-based Offline Reinforcement Learning For Language Models Baheti Ashutosh, Lu Ximing, Brahman Faeze, Bras Ronan Le, Sap Maarten, Riedl Mark
- Longbench: A Bilingual, Multitask Benchmark For Long Context Understanding Bai Yushi, Lv Xin, Zhang Jiajie, Lyu Hongchang, Tang Jiankai, Huang Zhidian, Du Zhengxiao, Liu Xiao, Zeng Aohan, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- Peering Through Preferences: Unraveling Feedback Acquisition For Aligning Large Language Models Bansal Hritik, Dang John, Grover Aditya
- A Bi-step Grounding Paradigm For Large Language Models In Recommendation Systems Bao Keqin, Zhang Jizhi, Wang Wenjie, Zhang Yang, Yang Zhengyi, Luo Yancheng, Chen Chong, Feng Fuli, Tian Qi
- Tallrec: An Effective And Efficient Tuning Framework To Align Large Language Model With Recommendation Bao Keqin, Zhang Jizhi, Zhang Yang, Wang Wenjie, Feng Fuli, He Xiangnan
- Assessing And Enhancing The Robustness Of Large Language Models With Task Structure Variations For Logical Reasoning Bao Qiming, Gendron Gael, Peng Alex Yuxuan, Zhong Wanjun, Tan Neset, Chen Yang, Witbrock Michael, Liu Jiamou
- Abstract Meaning Representation-based Logic-driven Data Augmentation For Logical Reasoning Bao Qiming, Peng Alex Yuxuan, Deng Zhenyun, Zhong Wanjun, Gendron Gael, Pistotti Timothy, Tan Neset, Young Nathan, Chen Yang, Zhu Yonghua, Denny Paul, Witbrock Michael, Liu Jiamou
- Disc-medllm: Bridging General Large Language Models And Real-world Medical Consultation Bao Zhijie, Chen Wei, Xiao Shengze, Ren Kuang, Wu Jiaao, Zhong Cheng, Peng Jiajie, Huang Xuanjing, Wei Zhongyu
- Sensitivity, Performance, Robustness: Deconstructing The Effect Of Sociodemographic Prompting Beck Tilman, Schuff Hendrik, Lauscher Anne, Gurevych Iryna
- AQUALLM: Audio Question Answering Data Generation Using Large Language Models Behera Swarup Ranjan, Injeti Krishna Mohan, Patibandla Jaya Sai Kiran, Pokala Praveen Kumar, Pailla Balakrishna Reddy
- TART: A Plug-and-play Transformer Module For Task-agnostic Reasoning Bhatia Kush, Narayan Avanika, De Sa Christopher, Ré Christopher
- When Do Program-of-thoughts Work For Reasoning? Bi Zhen, Zhang Ningyu, Jiang Yinuo, Deng Shumin, Zheng Guozhou, Chen Huajun
- VIGC: Visual Instruction Generation And Correction Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He
- How Close Is Chatgpt To Human Experts? Comparison Corpus, Evaluation, And Detection Biyang Guo, Xin Zhang, Ziyuan Wang, Minqi Jiang, Jinran Nie, Yuxuan Ding, Jianwei Yue, Yupeng Wu
- Inpars-light: Cost-effective Unsupervised Training Of Efficient Rankers Boytsov Leonid, Patel Preksha, Sourabh Vivek, Nisar Riddhi, Kundu Sayani, Ramanathan Ramya, Nyberg Eric
- Instructalign: High-and-low Resource Language Alignment Via Continual Crosslingual Instruction Tuning Cahyawijaya Samuel, Lovenia Holy, Yu Tiezheng, Chung Willy, Fung Pascale
- Leveraging Large Language Models For Scalable Vector Graphics-driven Image Understanding Cai Mu, Huang Zeyi, Li Yuheng, Ojha Utkarsh, Wang Haohan, Lee Yong Jae
- Wizardlm: Empowering Large Language Models To Follow Complex Instructions Can Xu, Qingfeng Sun, Kai Zheng, Xiubo Geng, Pu Zhao, Jiazhan Feng, Chongyang Tao, Daxin Jiang
- H2O Open Ecosystem For State-of-the-art Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Lee Chun Ming, Conde Marcos V.
- A Step Closer To Comprehensive Answers: Constrained Multi-stage Question Decomposition With Large Language Models Cao Hejing, An Zhenwei, Feng Jiazhan, Xu Kun, Chen Liwei, Zhao Dongyan
- Can It Edit? Evaluating The Ability Of Large Language Models To Follow Code Editing Instructions Cassano Federico, Li Luisa, Sethi Akul, Shinn Noah, Brennan-jones Abby, Ginesin Jacob, Berman Edward, Chakhnashvili George, Lozhkov Anton, Anderson Carolyn Jane, Guha Arjun
- Clembench: Using Game Play To Evaluate Chat-optimized Language Models As Conversational Agents Chalamalasetti Kranti, Götze Jana, Hakimov Sherzod, Madureira Brielen, Sadler Philipp, Schlangen David
- Chatgpt May Pass The Bar Exam Soon, But Has A Long Way To Go For The Lexglue Benchmark Chalkidis Ilias
- Learning To Generate Better Than Your LLM Chang Jonathan D., Brantley Kiante, Ramamurthy Rajkumar, Misra Dipendra, Sun Wen
- Hallucination Augmented Contrastive Learning For Multimodal Large Language Model Chaoya Jiang, Haiyang Xu, Mengfan Dong, Jiaxing Chen, Wei Ye, Ming Yan, Qinghao Ye, Ji Zhang, Fei Huang, Shikun Zhang
- MME: A Comprehensive Evaluation Benchmark For Multimodal Large Language Models Chaoyou Fu, Peixian Chen, Yunhang Shen, Yulei Qin, Mengdan Zhang, Xu Lin, Jinrui Yang, Xiawu Zheng, Ke Li, Xing Sun, Yunsheng Wu, Rongrong Ji
- Memgpt: Towards Llms As Operating Systems Charles Packer, Sarah Wooders, Kevin Lin, Vivian Fang, Shishir G. Patil, Ion Stoica, Joseph E. Gonzalez
- Position-enhanced Visual Instruction Tuning For Multimodal Large Language Models Chen Chi, Qin Ruoyu, Luo Fuwen, Mi Xiaoyue, Li Peng, Sun Maosong, Liu Yang
- CLEX: Continuous Length Extrapolation For Large Language Models Chen Guanzheng, Li Xin, Meng Zaiqiao, Liang Shangsong, Bing Lidong
- Videollm: Modeling Video Sequence With Large Language Models Chen Guo, Zheng Yin-dong, Wang Jiahao, Xu Jilan, Huang Yifei, Pan Junting, Wang Yi, Wang Yali, Qiao Yu, Lu Tong, Wang Limin
- Reconcile: Round-table Conference Improves Reasoning Via Consensus Among Diverse Llms Chen Justin Chih-yao, Saha Swarnadeep, Bansal Mohit
- Webvln: Vision-and-language Navigation On Websites Chen Qi, Pitawela Dileepa, Zhao Chongyang, Zhou Gengze, Chen Hsiang-ting, Wu Qi
- MMICT: Boosting Multi-modal Fine-tuning With In-context Examples Chen Tao, Zhang Enwei, Gao Yuting, Li Ke, Sun Xing, Zhang Yan, Li Hui, Ji Rongrong
- Lorashear: Efficient Large Language Model Structured Pruning And Knowledge Recovery Chen Tianyi, Ding Tianyu, Yadav Badal, Zharkov Ilya, Liang Luming
- Theoremqa: A Theorem-driven Question Answering Dataset Chen Wenhu, Yin Ming, Ku Max, Lu Pan, Wan Yixin, Ma Xueguang, Xu Jianyu, Wang Xinyi, Xia Tony
- Autoeval-video: An Automatic Benchmark For Assessing Large Vision Language Models In Open-ended Video Question Answering Chen Xiuyuan, Lin Yuan, Zhang Yuchen, Huang Weiran
- EE-LLM: Large-scale Training And Inference Of Early-exit Large Language Models With 3D Parallelism Chen Yanxi, Pan Xuchen, Li Yaliang, Ding Bolin, Zhou Jingren
- Dog-instruct: Towards Premium Instruction-tuning Data Via Text-grounded Instruction Wrapping Chen Yongrui, Jiang Haiyun, Huang Xinting, Shi Shuming, Qi Guilin
- T-eval: Evaluating The Tool Utilization Capability Of Large Language Models Step By Step Chen Zehui, Du Weihua, Zhang Wenwei, Liu Kuikun, Liu Jiangning, Zheng Miao, Zhuo Jingming, Zhang Songyang, Lin Dahua, Chen Kai, Zhao Feng
- Internvl: Scaling Up Vision Foundation Models And Aligning For Generic Visual-linguistic Tasks Chen Zhe, Wu Jiannan, Wang Wenhai, Su Weijie, Chen Guo, Xing Sen, Zhong Muyan, Zhang Qinglong, Zhu Xizhou, Lu Lewei, Li Bin, Luo Ping, Lu Tong, Qiao Yu, Dai Jifeng
- Chatcot: Tool-augmented Chain-of-thought Reasoning On Chat-based Large Language Models Chen Zhipeng, Zhou Kun, Zhang Beichen, Gong Zheng, Zhao Wayne Xin, Wen Ji-rong
- Xplainllm: A QA Explanation Dataset For Understanding LLM Decision-making Chen Zichen, Chen Jianda, Gaidhani Mitali, Singh Ambuj, Sra Misha
- Cascade Speculative Drafting For Even Faster LLM Inference Chen Ziyi, Yang Xiaocong, Lin Jiacheng, Sun Chenkai, Chang Kevin Chen-chuan, Huang Jie
- Visual Chatgpt: Talking, Drawing And Editing With Visual Foundation Models Chenfei Wu, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, Nan Duan
- UPRISE: Universal Prompt Retrieval For Improving Zero-shot Evaluation Cheng Daixuan, Huang Shaohan, Bi Junyu, Zhan Yuefeng, Liu Jianfeng, Wang Yujing, Sun Hao, Wei Furu, Deng Denvy, Zhang Qi
- Black-box Prompt Optimization: Aligning Large Language Models Without Model Training Cheng Jiale, Liu Xiao, Zheng Kehan, Ke Pei, Wang Hongning, Dong Yuxiao, Tang Jie, Huang Minlie
- Adversarial Preference Optimization: Enhancing Your Alignment Via RM-LLM Game Cheng Pengyu, Yang Yifan, Li Jian, Dai Yong, Hu Tianhao, Cao Peixin, Du Nan, Li Xiaolong
- Can We Edit Multimodal Large Language Models? Cheng Siyuan, Tian Bozhong, Liu Qingbin, Chen Xi, Wang Yongheng, Chen Huajun, Zhang Ningyu
- Batch Prompting: Efficient Inference With Large Language Model Apis Cheng Zhoujun, Kasai Jungo, Yu Tao
- Neural Codec Language Models Are Zero-shot Text To Speech Synthesizers Chengyi Wang, Sanyuan Chen, Yu Wu, Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei
- M\(^{2}\)chat: Empowering VLM For Multimodal LLM Interleaved Text-image Generation Chi Xiaowei, Zhang Rongyu, Jiang Zhengkai, Liu Yijiang, Wang Yatian, Qi Xingqun, Luo Wenhan, Gao Peng, Zhang Shanghang, Liu Qifeng, Guo Yike
- INSTRUCTEVAL: Towards Holistic Evaluation Of Instruction-tuned Large Language Models Chia Yew Ken, Hong Pengfei, Bing Lidong, Poria Soujanya
- Do Llms Understand Social Knowledge? Evaluating The Sociability Of Large Language Models With Socket Benchmark Choi Minje, Pei Jiaxin, Kumar Sagar, Shu Chang, Jurgens David
- Navigate Through Enigmatic Labyrinth A Survey Of Chain Of Thought Reasoning: Advances, Frontiers And Future Chu Zheng, Chen Jingchang, Chen Qianglong, Yu Weijiang, He Tao, Wang Haotian, Peng Weihua, Liu Ming, Qin Bing, Liu Ting
- Timebench: A Comprehensive Evaluation Of Temporal Reasoning Abilities In Large Language Models Chu Zheng, Chen Jingchang, Chen Qianglong, Yu Weijiang, Wang Haotian, Liu Ming, Qin Bing
- Instructtods: Large Language Models For End-to-end Task-oriented Dialogue Systems Chung Willy, Cahyawijaya Samuel, Wilie Bryan, Lovenia Holy, Fung Pascale
- BERT Lost Patience Won't Be Robust To Adversarial Slowdown Coalson Zachary, Ritter Gabriel, Bobba Rakesh, Hong Sanghyun
- Seamlessm4t: Massively Multilingual & Multimodal Machine Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Cora, Dale David, Dong Ning, Duquenne Paul-ambroise, Elsahar Hady, Gong Hongyu, Heffernan Kevin, Hoffman John, Klaiber Christopher, Li Pengwei, Licht Daniel, Maillard Jean, Rakotoarison Alice, Sadagopan Kaushik Ram, Wenzek Guillaume, Ye Ethan, Akula Bapi, Chen Peng-jen, Hachem Naji El, Ellis Brian, Gonzalez Gabriel Mejia, Haaheim Justin, Hansanti Prangthip, Howes Russ, Huang Bernie, Hwang Min-jae, Inaguma Hirofumi, Jain Somya, Kalbassi Elahe, Kallet Amanda, Kulikov Ilia, Lam Janice, Li Daniel, Ma Xutai, Mavlyutov Ruslan, Peloquin Benjamin, Ramadan Mohamed, Ramakrishnan Abinesh, Sun Anna, Tran Kevin, Tran Tuan, Tufanov Igor, Vogeti Vish, Wood Carleigh, Yang Yilin, Yu Bokai, Andrews Pierre, Balioglu Can, Costa-jussà Marta R., Celebi Onur, Elbayad Maha, Gao Cynthia, Guzmán Francisco, Kao Justine, Lee Ann, Mourachko Alexandre, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler
- Seamless: Multilingual Expressive And Streaming Speech Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Coria, Dale David, Dong Ning, Duppenthaler Mark, Duquenne Paul-ambroise, Ellis Brian, Elsahar Hady, Haaheim Justin, Hoffman John, Hwang Min-jae, Inaguma Hirofumi, Klaiber Christopher, Kulikov Ilia, Li Pengwei, Licht Daniel, Maillard Jean, Mavlyutov Ruslan, Rakotoarison Alice, Sadagopan Kaushik Ram, Ramakrishnan Abinesh, Tran Tuan, Wenzek Guillaume, Yang Yilin, Ye Ethan, Evtimov Ivan, Fernandez Pierre, Gao Cynthia, Hansanti Prangthip, Kalbassi Elahe, Kallet Amanda, Kozhevnikov Artyom, Gonzalez Gabriel Mejia, Roman Robin San, Touret Christophe, Wong Corinne, Wood Carleigh, Yu Bokai, Andrews Pierre, Balioglu Can, Chen Peng-jen, Costa-jussà Marta R., Elbayad Maha, Gong Hongyu, Guzmán Francisco, Heffernan Kevin, Jain Somya, Kao Justine, Lee Ann, Ma Xutai, Mourachko Alex, Peloquin Benjamin, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Sun Anna, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler, Williamson Mary
- Ultrafeedback: Boosting Language Models With Scaled AI Feedback Cui Ganqu, Yuan Lifan, Ding Ning, Yao Guanming, He Bingxiang, Zhu Wei, Ni Yuan, Xie Guotong, Xie Ruobing, Lin Yankai, Liu Zhiyuan, Sun Maosong
- Machine Mindset: An MBTI Exploration Of Large Language Models Cui Jiaxi, Lv Liuzhenghao, Wen Jing, Wang Rongsheng, Tang Jing, Tian Yonghong, Yuan Li
- Efficient And Effective Text Encoding For Chinese Llama And Alpaca Cui Yiming, Yang Ziqing, Yao Xin
- Neural Retrievers Are Biased Towards Llm-generated Content Dai Sunhao, Zhou Yuqi, Pang Liang, Liu Weihao, Hu Xiaolin, Liu Yong, Zhang Xiao, Wang Gang, Xu Jun
- Llmebench: A Flexible Framework For Accelerating Llms Benchmarking Dalvi Fahim, Hasanain Maram, Boughorbel Sabri, Mousi Basel, Abdaljalil Samir, Nazar Nizi, Abdelali Ahmed, Chowdhury Shammur Absar, Mubarak Hamdy, Ali Ahmed, Hawasly Majd, Durrani Nadir, Alam Firoj
- Educhat: A Large-scale Language Model-based Chatbot System For Intelligent Education Dan Yuhao, Lei Zhikai, Gu Yiyang, Li Yong, Yin Jianghao, Lin Jiaju, Ye Linhao, Tie Zhiyan, Zhou Yougen, Wang Yilei, Zhou Aimin, Zhou Ze, Chen Qin, Zhou Jie, He Liang, Qiu Xipeng
- Facechat: An Emotion-aware Face-to-face Dialogue Framework Deema Alnuhait, Qingyang Wu, Zhou Yu
- Controlled Text Generation Via Language Model Arithmetic Dekoninck Jasper, Fischer Marc, Beurer-kellner Luca, Vechev Martin
- Rephrase And Respond: Let Large Language Models Ask Better Questions For Themselves Deng Yihe, Zhang Weitong, Chen Zixiang, Gu Quanquan
- Multilingual Jailbreak Challenges In Large Language Models Deng Yue, Zhang Wenxuan, Pan Sinno Jialin, Bing Lidong
- Chatgpt Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions Deyao Zhu, Jun Chen, Kilichbek Haydarov, Xiaoqian Shen, Wenxuan Zhang, Mohamed Elhoseiny
- Minigpt-4: Enhancing Vision-language Understanding With Advanced Large Language Models Deyao Zhu, Jun Chen, Xiaoqian Shen, Xiang Li, Mohamed Elhoseiny
- Lmflow: An Extensible Toolkit For Finetuning And Inference Of Large Foundation Models Diao Shizhe, Pan Rui, Dong Hanze, Shum Ka Shun, Zhang Jipeng, Xiong Wei, Zhang Tong
- Jump To Conclusions: Short-cutting Transformers With Linear Transformations Din Alexander Yom, Karidi Taelin, Choshen Leshem, Geva Mor
- Enhancing Chat Language Models By Scaling High-quality Instructional Conversations Ding Ning, Chen Yulin, Xu Bokai, Qin Yujia, Zheng Zhi, Hu Shengding, Liu Zhiyuan, Sun Maosong, Zhou Bowen
- A Wolf In Sheep's Clothing: Generalized Nested Jailbreak Prompts Can Fool Large Language Models Easily Ding Peng, Kuang Jun, Ma Dan, Cao Xuezhi, Xian Yunsen, Chen Jiajun, Huang Shujian
- BAMBOO: A Comprehensive Benchmark For Evaluating Long Text Modeling Capacities Of Large Language Models Dong Zican, Tang Tianyi, Li Junyi, Zhao Wayne Xin, Wen Ji-rong
- From Static To Dynamic: A Continual Learning Framework For Large Language Models Du Mingzhe, Luu Anh Tuan, Ji Bin, Ng See-kiong
- Classeval: A Manually-crafted Benchmark For Evaluating Llms On Class-level Code Generation Du Xueying, Liu Mingwei, Wang Kaixin, Wang Hanlin, Liu Junwei, Chen Yixuan, Feng Jiayi, Sha Chaofeng, Peng Xin, Lou Yiling
- Guiding Pretraining In Reinforcement Learning With Large Language Models Du Yuqing, Watkins Olivia, Wang Zihan, Colas Cédric, Darrell Trevor, Abbeel Pieter, Gupta Abhishek, Andreas Jacob
- Botchat: Evaluating Llms' Capabilities Of Having Multi-turn Dialogues Duan Haodong, Wei Jueqi, Wang Chonghua, Liu Hongwei, Fang Yixiao, Zhang Songyang, Lin Dahua, Chen Kai
- Shifting Attention To Relevance: Towards The Predictive Uncertainty Quantification Of Free-form Large Language Models Duan Jinhao, Cheng Hao, Wang Shiqi, Zavalny Alex, Wang Chenan, Xu Renjing, Kailkhura Bhavya, Xu Kaidi
- Alpacafarm: A Simulation Framework For Methods That Learn From Human Feedback Dubois Yann, Li Xuechen, Taori Rohan, Zhang Tianyi, Gulrajani Ishaan, Ba Jimmy, Guestrin Carlos, Liang Percy, Hashimoto Tatsunori B.
- Uncovering The Potential Of Chatgpt For Discourse Analysis In Dialogue: An Empirical Study Fan Yaxin, Jiang Feng, Li Peifeng, Li Haizhou
- Instructseq: Unifying Vision Tasks With Instruction-conditioned Multi-modal Sequence Generation Fang Rongyao, Yan Shilin, Huang Zhaoyang, Zhou Jingqiu, Tian Hao, Dai Jifeng, Li Hongsheng
- Tooltalk: Evaluating Tool-usage In A Conversational Setting Farn Nicholas, Shin Richard
- Transferable Decoding With Visual Entities For Zero-shot Image Captioning Fei Junjie, Wang Teng, Zhang Jinrui, He Zhenyu, Wang Chengjie, Zheng Feng
- Chessgpt: Bridging Policy Learning And Language Modeling Feng Xidong, Luo Yicheng, Wang Ziyan, Tang Hongrui, Yang Mengyue, Shao Kun, Mguni David, Du Yali, Wang Jun
- Context-aware Meta-learning Fifty Christopher, Duan Dennis, Junkins Ronald G., Amid Ehsan, Leskovec Jure, Re Christopher, Thrun Sebastian
- Sparsegpt: Massive Language Models Can Be Accurately Pruned In One-shot Frantar Elias, Alistarh Dan
- A Challenger To GPT-4V? Early Explorations Of Gemini In Visual Expertise Fu Chaoyou, Zhang Renrui, Wang Zihan, Huang Yubo, Zhang Zhengye, Qiu Longtian, Ye Gaoxiang, Shen Yunhang, Zhang Mengdan, Chen Peixian, Zhao Sirui, Lin Shaohui, Jiang Deqiang, Yin Di, Gao Peng, Li Ke, Li Hongsheng, Sun Xing
- Gptscore: Evaluate As You Desire Fu Jinlan, Ng See-kiong, Jiang Zhengbao, Liu Pengfei
- Generate Then Select: Open-ended Visual Question Answering Guided By World Knowledge Fu Xingyu, Zhang Sheng, Kwon Gukyeong, Perera Pramuditha, Zhu Henghui, Zhang Yuhao, Li Alexander Hanbo, Wang William Yang, Wang Zhiguo, Castelli Vittorio, Ng Patrick, Roth Dan, Xiang Bing
- Mitigating Hallucination In Large Multi-modal Models Via Robust Instruction Tuning Fuxiao Liu, Kevin Lin, Linjie Li, Jianfeng Wang, Yaser Yacoob, Lijuan Wang
- Examining User-friendly And Open-sourced Large GPT Models: A Survey On Language, Multimodal, And Scientific GPT Models Gao Kaiyuan, He Sunan, He Zhenyu, Lin Jiacheng, Pei Qizhi, Shao Jie, Zhang Wei
- Llama-adapter V2: Parameter-efficient Visual Instruction Model Gao Peng, Han Jiaming, Zhang Renrui, Lin Ziyi, Geng Shijie, Zhou Aojun, Zhang Wei, Lu Pan, He Conghui, Yue Xiangyu, Li Hongsheng, Qiao Yu
- In-context Autoencoder For Context Compression In A Large Language Model Ge Tao, Hu Jing, Wang Lei, Wang Xun, Chen Si-qing, Wei Furu
- Mblip: Efficient Bootstrapping Of Multilingual Vision-llms Geigle Gregor, Jain Abhay, Timofte Radu, Glavaš Goran
- Flacuna: Unleashing The Problem Solving Power Of Vicuna Using FLAN Fine-tuning Ghosal Deepanway, Chia Yew Ken, Majumder Navonil, Poria Soujanya
- Model-generated Pretraining Signals Improves Zero-shot Generalization Of Text-to-text Transformers Gong Linyuan, Xiong Chenyan, Liu Xiaodong, Bajaj Payal, Xie Yiqing, Cheung Alvin, Gao Jianfeng, Song Xia
- Lil-bevo: Explorations Of Strategies For Training Language Models In More Humanlike Ways Govindarajan Venkata S, Rodriguez Juan Diego, Bostrom Kaj, Mahowald Kyle
- Leveraging Pre-trained Large Language Models To Construct And Utilize World Models For Model-based Task Planning Guan Lin, Valmeekam Karthik, Sreedharan Sarath, Kambhampati Subbarao
- Voyager: An Open-ended Embodied Agent With Large Language Models Guanzhi Wang, Yuqi Xie, Yunfan Jiang, Ajay Mandlekar, Chaowei Xiao, Yuke Zhu, Linxi Fan, Anima Anandkumar
- Longcoder: A Long-range Pre-trained Language Model For Code Completion Guo Daya, Xu Canwen, Duan Nan, Yin Jian, Mcauley Julian
- Meta-tuning Llms To Leverage Lexical Knowledge For Generalizable Language Style Understanding Guo Ruohao, Xu Wei, Ritter Alan
- What Can Large Language Models Do In Chemistry? A Comprehensive Benchmark On Eight Tasks Guo Taicheng, Guo Kehan, Nan Bozhao, Liang Zhenwen, Guo Zhichun, Chawla Nitesh V., Wiest Olaf, Zhang Xiangliang
- Doremi: Grounding Language Model By Detecting And Recovering From Plan-execution Misalignment Guo Yanjiang, Wang Yen-jen, Zha Lihan, Jiang Zheyuan, Chen Jianyu
- Learning To Plan With Natural Language Guo Yiduo, Liang Yaobo, Wu Chenfei, Wu Wenshan, Zhao Dongyan, Duan Nan
- PPTC Benchmark: Evaluating Large Language Models For Powerpoint Task Completion Guo Yiduo, Zhang Zekai, Liang Yaobo, Zhao Dongyan, Duan Nan
- Arthmodel: Enhance Arithmetic Skills To Large Language Model Guo Yingdi
- Gemrec: Towards Generative Model Recommendation Guo Yuanhe, Liu Haoming, Wen Hongyi
- Evaluating Large Language Models: A Comprehensive Survey Guo Zishan, Jin Renren, Liu Chuang, Huang Yufei, Shi Dan, Supryadi, Yu Linhao, Liu Yan, Li Jiaxuan, Xiong Bojian, Xiong Deyi
- Point-bind & Point-llm: Aligning Point Cloud With Multi-modality For 3D Understanding, Generation, And Instruction Following Guo Ziyu, Zhang Renrui, Zhu Xiangyang, Tang Yiwen, Ma Xianzheng, Han Jiaming, Chen Kexin, Gao Peng, Li Xianzhi, Li Hongsheng, Heng Pheng-ann
- Scaling Up And Distilling Down: Language-guided Robot Skill Acquisition Ha Huy, Florence Pete, Song Shuran
- Lm-infinite: Zero-shot Extreme Length Generalization For Large Language Models Han Chi, Wang Qifan, Peng Hao, Xiong Wenhan, Chen Yu, Ji Heng, Wang Sinong
- Onellm: One Framework To Align All Modalities With Language Han Jiaming, Gong Kaixiong, Zhang Yiyuan, Wang Jiaqi, Zhang Kaipeng, Lin Dahua, Qiao Yu, Gao Peng, Yue Xiangyu
- Imagebind-llm: Multi-modality Instruction Tuning Han Jiaming, Zhang Renrui, Shao Wenqi, Gao Peng, Xu Peng, Xiao Han, Zhang Kaipeng, Liu Chris, Wen Song, Guo Ziyu, Lu Xudong, Ren Shuai, Wen Yafei, Chen Xiaoxin, Yue Xiangyu, Li Hongsheng, Qiao Yu
- Cheddar: Student-chatgpt Dialogue In EFL Writing Education Han Jieun, Yoo Haneul, Myung Junho, Kim Minsun, Lee Tak Yeon, Ahn So-yeon, Oh Alice
- Reasoning Implicit Sentiment With Chain-of-thought Prompting Hao Fei, Bobo Li, Qian Liu, Lidong Bing, Fei Li, Tat-seng Chua
- Vico: Plug-and-play Visual Condition For Personalized Text-to-image Generation Hao Shaozhe, Han Kai, Zhao Shihao, Wong Kwan-yee K.
- Idealgpt: Iteratively Decomposing Vision And Language Reasoning Via Large Language Models Haoxuan You, Rui Sun, Zhecan Wang, Long Chen, Gengyu Wang, Hammad A. Ayyubi, Kai-wei Chang, Shih-fu Chang
- ICL-D3IE: In-context Learning With Diverse Demonstrations Updating For Document Information Extraction He Jiabang, Wang Lei, Hu Yi, Liu Ning, Liu Hui, Xu Xing, Shen Heng Tao
- Can Large Language Models Understand Real-world Complex Instructions? He Qianyu, Zeng Jie, Huang Wenhao, Chen Lina, Xiao Jin, He Qianxi, Zhou Xunzhe, Chen Lida, Wang Xintao, Huang Yuncheng, Ye Haoning, Li Zihan, Chen Shisong, Zhang Yikai, Gu Zhouhong, Liang Jiaqing, Xiao Yanghua
- Towards Verifiable Text Generation With Symbolic References Hennigen Lucas Torroba, Shen Shannon, Nrusimha Aniruddha, Gapp Bernhard, Sontag David, Kim Yoon
- DP-OPT: Make Large Language Model Your Privacy-preserving Prompt Engineer Hong Junyuan, Wang Jiachen T., Zhang Chenhui, Li Zhangheng, Li Bo, Wang Zhangyang
- Cogagent: A Visual Language Model For GUI Agents Hong Wenyi, Wang Weihan, Lv Qingsong, Xu Jiazheng, Yu Wenmeng, Ji Junhui, Wang Yan, Wang Zihan, Zhang Yuxuan, Li Juanzi, Xu Bin, Dong Yuxiao, Ding Ming, Tang Jie
- Do Large Language Models And Humans Have Similar Behaviors In Causal Inference With Script Knowledge? Hong Xudong, Ryzhova Margarita, Biondi Daniel Adrian, Demberg Vera
- Ralle: A Framework For Developing And Evaluating Retrieval-augmented Large Language Models Hoshi Yasuto, Miyashita Daisuke, Ng Youyang, Tatsuno Kento, Morioka Yasuhiro, Torii Osamu, Deguchi Jun
- Distilling Step-by-step! Outperforming Larger Language Models With Less Training Data And Smaller Model Sizes Hsieh Cheng-yu, Li Chun-liang, Yeh Chih-kuan, Nakhost Hootan, Fujii Yasuhisa, Ratner Alexander, Krishna Ranjay, Lee Chen-yu, Pfister Tomas
- Chain-of-symbol Prompting Elicits Planning In Large Langauge Models Hu Hanxu, Lu Hongyuan, Zhang Huajian, Song Yun-ze, Lam Wai, Zhang Yue
- BLIVA: A Simple Multimodal LLM For Better Handling Of Text-rich Visual Questions Hu Wenbo, Xu Yifan, Li Yi, Li Weiyue, Chen Zeyuan, Tu Zhuowen
- Deploying And Evaluating Llms To Program Service Mobile Robots Hu Zichao, Lucchetti Francesca, Schlesinger Claire, Saxena Yash, Freeman Anders, Modak Sadanand, Guha Arjun, Biswas Joydeep
- Lorahub: Efficient Cross-task Generalization Via Dynamic Lora Composition Huang Chengsong, Liu Qian, Lin Bill Yuchen, Pang Tianyu, Du Chao, Lin Min
- Acegpt, Localizing Large Language Models In Arabic Huang Huang, Yu Fei, Zhu Jianqing, Sun Xuening, Cheng Hao, Song Dingjie, Chen Zhihong, Alharthi Abdulmohsen, An Bang, He Juncai, Liu Ziche, Zhang Zhiyi, Chen Junying, Li Jianquan, Wang Benyou, Zhang Lian, Sun Ruoyu, Wan Xiang, Li Haizhou, Xu Jinchao
- Revisiting The Reliability Of Psychological Scales On Large Language Models Huang Jen-tse, Wang Wenxuan, Lam Man Ho, Li Eric John, Jiao Wenxiang, Lyu Michael R.
- Who Is Chatgpt? Benchmarking Llms' Psychological Portrayal Using Psychobench Huang Jen-tse, Wang Wenxuan, Li Eric John, Lam Man Ho, Ren Shujie, Yuan Youliang, Jiao Wenxiang, Tu Zhaopeng, Lyu Michael R.
- Chat Vector: A Simple Approach To Equip Llms With Instruction Following And Model Alignment In New Languages Huang Shih-cheng, Li Pin-zu, Hsu Yu-chi, Chen Kuang-ming, Lin Yu Tung, Hsiao Shih-kai, Tsai Richard Tzong-han, Lee Hung-yi
- Catastrophic Jailbreak Of Open-source Llms Via Exploiting Generation Huang Yangsibo, Gupta Samyak, Xia Mengzhou, Li Kai, Chen Danqi
- CLOMO: Counterfactual Logical Modification With Large Language Models Huang Yinya, Hong Ruixin, Zhang Hongming, Shao Wei, Yang Zhicheng, Yu Dong, Zhang Changshui, Liang Xiaodan, Song Linqi
- Advancing Transformer Architecture In Long-context Large Language Models: A Comprehensive Survey Huang Yunpeng, Xu Jingwei, Lai Junyu, Jiang Zixu, Chen Taolue, Li Zenan, Yao Yuan, Ma Xiaoxing, Yang Lijuan, Chen Hao, Li Shupeng, Zhao Penghao
- Evaluating And Enhancing Large Language Models For Conversational Reasoning On Knowledge Graphs Huang Yuxuan, Shi Lida, Liu Anqi, Xu Hao
- Speech Translation With Large Language Models: An Industrial Practice Huang Zhichao, Ye Rong, Ko Tom, Dong Qianqian, Cheng Shanbo, Wang Mingxuan, Li Hang
- Vimq: A Vietnamese Medical Question Dataset For Healthcare Dialogue System Development Huy Ta Duc, Tu Nguyen Anh, Vu Tran Hoang, Minh Nguyen Phuc, Phan Nguyen, Bui Trung H., Truong Steven Q. H.
- Glot500: Scaling Multilingual Corpora And Language Models To 500 Languages Imani Ayyoob, Lin Peiqin, Kargaran Amir Hossein, Severini Silvia, Sabet Masoud Jalili, Kassner Nora, Ma Chunlan, Schmid Helmut, Martins André F. T., Yvon François, Schütze Hinrich
- Personalized Soups: Personalized Large Language Model Alignment Via Post-hoc Parameter Merging Jang Joel, Kim Seungone, Lin Bill Yuchen, Wang Yizhong, Hessel Jack, Zettlemoyer Luke, Hajishirzi Hannaneh, Choi Yejin, Ammanabrolu Prithviraj
- Chatbot Is Not All You Need: Information-rich Prompting For More Realistic Responses Jeong Seokhoon, Makhmud Assentay
- Inpars-v2: Large Language Models As Efficient Dataset Generators For Information Retrieval Jeronymo Vitor, Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Lotufo Roberto, Zavrel Jakub, Nogueira Rodrigo
- Quip: 2-bit Quantization Of Large Language Models With Guarantees Jerry Chee, Yaohui Cai, Volodymyr Kuleshov, Christopher De Sa
- Bootstrapping Vision-language Learning With Decoupled Language Pre-training Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Expedited Training Of Visual Conditioned Language Generation Via Redundancy Reduction Jian Yiren, Liu Tingkai, Tao Yunzhe, Zhang Chunhui, Vosoughi Soroush, Yang Hongxia
- Llmlingua: Compressing Prompts For Accelerated Inference Of Large Language Models Jiang Huiqiang, Wu Qianhui, Lin Chin-yew, Yang Yuqing, Qiu Lili
- Longllmlingua: Accelerating And Enhancing Llms In Long Context Scenarios Via Prompt Compression Jiang Huiqiang, Wu Qianhui, Luo Xufang, Li Dongsheng, Lin Chin-yew, Yang Yuqing, Qiu Lili
- Scaling Sentence Embeddings With Large Language Models Jiang Ting, Huang Shaohan, Luan Zhongzhi, Wang Deqing, Zhuang Fuzhen
- Lion: Adversarial Distillation Of Proprietary Large Language Models Jiang Yuxin, Chan Chunkit, Chen Mingyang, Wang Wei
- Followbench: A Multi-level Fine-grained Constraints Following Benchmark For Large Language Models Jiang Yuxin, Wang Yufei, Zeng Xingshan, Zhong Wanjun, Li Liangyou, Mi Fei, Shang Lifeng, Jiang Xin, Liu Qun, Wang Wei
- How Can Recommender Systems Benefit From Large Language Models: A Survey Jianghao Lin, Xinyi Dai, Yunjia Xi, Weiwen Liu, Bo Chen, Hao Zhang, Yong Liu, Chuhan Wu, Xiangyang Li, Chenxu Zhu, Huifeng Guo, Yong Yu, Ruiming Tang, Weinan Zhang
- Parrot: Translating During Chat Using Large Language Models Tuned With Human Translation And Feedback Jiao Wenxiang, Huang Jen-tse, Wang Wenxuan, He Zhiwei, Liang Tian, Wang Xing, Shi Shuming, Tu Zhaopeng
- Llara: Large Language-recommendation Assistant Jiayi Liao, Sihang Li, Zhengyi Yang, Jiancan Wu, Yancheng Yuan, Xiang Wang, Xiangnan He
- Emergent Representations Of Program Semantics In Language Models Trained On Programs Jin Charles, Rinard Martin
- Vcoder: Versatile Vision Encoders For Multimodal Large Language Models Jitesh Jain, Jianwei Yang, Humphrey Shi
- Is Chatgpt Fair For Recommendation? Evaluating Fairness In Large Language Model Recommendation Jizhi Zhang, Keqin Bao, Yang Zhang, Wenjie Wang, Fuli Feng, Xiangnan He
- AMBER: An Llm-free Multi-dimensional Benchmark For Mllms Hallucination Evaluation Junyang Wang, Yuhang Wang, Guohai Xu, Jing Zhang, Yukai Gu, Haitao Jia, Jiaqi Wang, Haiyang Xu, Ming Yan, Ji Zhang, Jitao Sang
- Impact Of Co-occurrence On Factual Knowledge Of Large Language Models Kang Cheongwoong, Choi Jaesik
- Evaluating GPT-4 And Chatgpt On Japanese Medical Licensing Examinations Kasai Jungo, Kasai Yuhei, Sakaguchi Keisuke, Yamada Yutaro, Radev Dragomir
- Shikra: Unleashing Multimodal Llm's Referential Dialogue Magic Keqin Chen, Zhao Zhang, Weili Zeng, Richong Zhang, Feng Zhu, Rui Zhao
- Turning English-centric Llms Into Polyglots: How Much Multilinguality Is Needed? Kew Tannon, Schottmann Florian, Sennrich Rico
- Assessing The Promise And Pitfalls Of Chatgpt For Automated Code Generation Khan Muhammad Fawad Akbar, Ramsdell Max, Falor Erik, Karimi Hamid
- Q: How To Specialize Large Vision-language Models To Data-scarce VQA Tasks? A: Self-train On Unlabeled Images! Khan Zaid, Bg Vijay Kumar, Schulter Samuel, Yu Xiang, Fu Yun, Chandraker Manmohan
- Dspy: Compiling Declarative Language Model Calls Into Self-improving Pipelines Khattab Omar, Singhvi Arnav, Maheshwari Paridhi, Zhang Zhiyuan, Santhanam Keshav, Vardhamanan Sri, Haq Saiful, Sharma Ashutosh, Joshi Thomas T., Moazam Hanna, Miller Heather, Zaharia Matei, Potts Christopher
- Reliability Check: An Analysis Of Gpt-3's Response To Sensitive Topics And Prompt Wording Khatun Aisha, Brown Daniel G.
- Visually-situated Natural Language Understanding With Contrastive Reading Model And Frozen Large Language Models Kim Geewook, Lee Hodong, Kim Daehee, Jung Haeji, Park Sanghee, Kim Yoonsik, Yun Sangdoo, Kil Taeho, Lee Bado, Park Seunghyun
- Language Models Can Solve Computer Tasks Kim Geunwoo, Baldi Pierre, Mcaleer Stephen
- Compressed Context Memory For Online Language Model Interaction Kim Jang-hyun, Yeom Junyoung, Yun Sangdoo, Song Hyun Oh
- Squeezellm: Dense-and-sparse Quantization Kim Sehoon, Hooper Coleman, Gholami Amir, Dong Zhen, Li Xiuyu, Shen Sheng, Mahoney Michael W., Keutzer Kurt
- Cotever: Chain Of Thought Prompting Annotation Toolkit For Explanation Verification Kim Seungone, Joo Se June, Jang Yul, Chae Hyungjoo, Yeo Jinyoung
- Prometheus: Inducing Fine-grained Evaluation Capability In Language Models Kim Seungone, Shin Jamin, Cho Yejin, Jang Joel, Longpre Shayne, Lee Hwaran, Yun Sangdoo, Shin Seongjin, Kim Sungdong, Thorne James, Seo Minjoon
- Aligning Large Language Models Through Synthetic Feedback Kim Sungdong, Bae Sanghwan, Shin Jamin, Kang Soyoung, Kwak Donghyun, Yoo Kang Min, Seo Minjoon
- Large Language Models Are Temporal And Causal Reasoners For Video Question Answering Ko Dohwan, Lee Ji Soo, Kang Wooyoung, Roh Byungseok, Kim Hyunwoo J.
- Revisiting Intermediate Layer Distillation For Compressing Language Models: An Overfitting Perspective Ko Jongwoo, Park Seungjoon, Jeong Minchan, Hong Sukjin, Ahn Euijai, Chang Du-seong, Yun Se-young
- Better Zero-shot Reasoning With Role-play Prompting Kong Aobo, Zhao Shiwan, Chen Hao, Li Qicheng, Qin Yong, Sun Ruiqi, Zhou Xin, Wang Enzhi, Dong Xiaohang
- NOLA: Compressing Lora Using Linear Combination Of Random Basis Koohpayegani Soroush Abbasi, Navaneet Kl, Nooralinejad Parsa, Kolouri Soheil, Pirsiavash Hamed
- Implicit Meta-learning May Lead Language Models To Trust More Reliable Sources Krasheninnikov Dmitrii, Krasheninnikov Egor, Mlodozeniec Bruno, Maharaj Tegan, Krueger David
- Certifying LLM Safety Against Adversarial Prompting Kumar Aounon, Agarwal Chirag, Srinivas Suraj, Li Aaron Jiaxun, Feizi Soheil, Lakkaraju Himabindu
- Longform: Effective Instruction Tuning With Reverse Instructions Köksal Abdullatif, Schick Timo, Korhonen Anna, Schütze Hinrich
- LISA: Reasoning Segmentation Via Large Language Model Lai Xin, Tian Zhuotao, Chen Yukang, Li Yanwei, Yuan Yuhui, Liu Shu, Jia Jiaya
- Veclip: Improving CLIP Training Via Visual-enriched Captions Lai Zhengfeng, Zhang Haotian, Zhang Bowen, Wu Wentao, Bai Haoping, Timofeev Aleksei, Du Xianzhi, Gan Zhe, Shan Jiulong, Chuah Chen-nee, Yang Yinfei, Cao Meng
- Large Language Models Are Partially Primed In Pronoun Interpretation Lam Suet-ying, Zeng Qingcheng, Zhang Kexun, You Chenyu, Voigt Rob
- Improving Zero-shot Visual Question Answering Via Large Language Models With Reasoning Question Prompts Lan Yunshi, Li Xiang, Liu Xin, Li Yang, Qin Wei, Qian Weining
- Improved Instruction Ordering In Recipe-grounded Conversation Le Duong Minh, Guo Ruohao, Xu Wei, Ritter Alan
- Read-only Prompt Optimization For Vision-language Few-shot Learning Lee Dongjun, Song Seokwon, Suh Jihee, Choi Joonmyung, Lee Sanghyeok, Kim Hyunwoo J.
- Building Multimodal AI Chatbots Lee Min Young
- Can Large Language Models Capture Dissenting Human Voices? Lee Noah, An Na Min, Thorne James
- Volcano: Mitigating Multimodal Hallucination Through Self-feedback Guided Revision Lee Seongyun, Park Sue Hyun, Jo Yongrae, Seo Minjoon
- Knowledge Corpus Error In Question Answering Lee Yejoon, Oh Philhoon, Thorne James
- Large Language Models Can Share Images, Too! Lee Young-jun, Lee Dokyong, Sung Joo Won, Hyeon Jonghwan, Choi Ho-jin
- Creating A Dataset For High-performance Computing Code Translation Using Llms: A Bridge Between Openmp Fortran And C++ Lei Bin, Ding Caiwen, Chen Le, Lin Pei-hung, Liao Chunhua
- Zero-shot Next-item Recommendation Using Large Pretrained Language Models Lei Wang, Ee-peng Lim
- Mitigating Fine-grained Hallucination By Fine-tuning Large Vision-language Models With Caption Rewrites Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-peng Lim
- Llm4vis: Explainable Visualization Recommendation Using Chatgpt Lei Wang, Songheng Zhang, Yun Wang, Ee-peng Lim, Yong Wang
- T-sciq: Teaching Multimodal Chain-of-thought Reasoning Via Mixed Large Language Model Signals For Science Question Answering Lei Wang, Yi Hu, Jiabang He, Xing Xu, Ning Liu, Hui Liu, Heng Tao Shen
- Recexplainer: Aligning Large Language Models For Explaining Recommendation Models Lei Yuxuan, Lian Jianxun, Yao Jing, Huang Xu, Lian Defu, Xie Xing
- Sequential Monte Carlo Steering Of Large Language Models Using Probabilistic Programs Lew Alexander K., Zhi-xuan Tan, Grand Gabriel, Mansinghka Vikash K.
- Seed-bench-2: Benchmarking Multimodal Large Language Models Li Bohao, Ge Yuying, Ge Yixiao, Wang Guangzhi, Wang Rui, Zhang Ruimao, Shan Ying
- This Land Is {your, My} Land: Evaluating Geopolitical Biases In Language Models Li Bryan, Haider Samar, Callison-burch Chris
- Vision-language Instruction Tuning: A Review And Analysis Li Chen, Ge Yixiao, Li Dian, Shan Ying
- Mugglemath: Assessing The Impact Of Query And Response Augmentation On Math Reasoning Li Chengpeng, Yuan Zheng, Yuan Hongyi, Dong Guanting, Lu Keming, Wu Jiancan, Tan Chuanqi, Wang Xiang, Zhou Chang
- Modelscope-agent: Building Your Customizable Agent System With Open-source Large Language Models Li Chenliang, Chen Hehong, Yan Ming, Shen Weizhou, Xu Haiyang, Wu Zhikai, Zhang Zhicheng, Zhou Wenmeng, Chen Yingda, Cheng Chen, Shi Hongzhu, Zhang Ji, Huang Fei, Zhou Jingren
- DISTFLASHATTN: Distributed Memory-efficient Attention For Long-context Llms Training Li Dacheng, Shao Rulin, Xie Anze, Xing Eric P., Ma Xuezhe, Stoica Ion, Gonzalez Joseph E., Zhang Hao
- Explaincpe: A Free-text Explanation Benchmark Of Chinese Pharmacist Examination Li Dongfang, Yu Jindi, Hu Baotian, Xu Zhenran, Zhang Min
- Mastering Robot Manipulation With Multimodal Prompts Through Pretraining And Multi-task Fine-tuning Li Jiachen, Gao Qiaozi, Johnston Michael, Gao Xiaofeng, He Xuehai, Shakiah Suhaila, Shi Hangjie, Ghanadan Reza, Wang William Yang
- CFGPT: Chinese Financial Assistant With Large Language Model Li Jiangtong, Bian Yuxuan, Wang Guoxuan, Lei Yang, Cheng Dawei, Ding Zhijun, Jiang Changjun
- Distilling Chatgpt For Explainable Automated Student Answer Assessment Li Jiazheng, Gui Lin, Zhou Yuxiang, West David, Aloisi Cesare, He Yulan
- Contradoc: Understanding Self-contradictions In Documents With Large Language Models Li Jierui, Raheja Vipul, Kumar Dhruv
- Motcoder: Elevating Large Language Models With Modular Of Thought For Challenging Programming Tasks Li Jingyao, Chen Pengguang, Xia Bin, Xu Hong, Jia Jiaya
- Fine-tuning Multimodal Llms To Follow Zero-shot Demonstrative Instructions Li Juncheng, Pan Kaihang, Ge Zhiqi, Gao Minghe, Ji Wei, Zhang Wenqiao, Chua Tat-seng, Tang Siliang, Zhang Hanwang, Zhuang Yueting
- Generative Judge For Evaluating Alignment Li Junlong, Sun Shichao, Yuan Weizhe, Fan Run-ze, Zhao Hai, Liu Pengfei
- Openba: An Open-sourced 15B Bilingual Asymmetric Seq2seq Model Pre-trained From Scratch Li Juntao, Tang Zecheng, Ding Yuyang, Wang Pinzheng, Guo Pei, You Wangjie, Qiao Dan, Chen Wenliang, Fu Guohong, Zhu Qiaoming, Zhou Guodong, Zhang Min
- Instructcoder: Instruction Tuning Large Language Models For Code Editing Li Kaixin, Hu Qisheng, Zhao Xu, Chen Hui, Xie Yuxi, Liu Tiedong, Xie Qizhe, He Junxian
- Mvbench: A Comprehensive Multi-modal Video Understanding Benchmark Li Kunchang, Wang Yali, He Yinan, Li Yizhuo, Wang Yi, Liu Yi, Wang Zun, Xu Jilan, Chen Guo, Luo Ping, Wang Limin, Qiao Yu
- Tool-augmented Reward Modeling Li Lei, Chai Yekun, Wang Shuohuan, Sun Yu, Tian Hao, Zhang Ningyu, Wu Hua
- Are Human-generated Demonstrations Necessary For In-context Learning? Li Rui, Wang Guoyin, Li Jiwei
- TRAQ: Trustworthy Retrieval Augmented Question Answering Via Conformal Prediction Li Shuo, Park Sangdon, Lee Insup, Bastani Osbert
- FLM-101B: An Open LLM And How To Train It With $100K Budget Li Xiang, Yao Yiqun, Jiang Xin, Fang Xuezhi, Meng Xuying, Fan Siqi, Han Peng, Li Jing, Du Li, Qin Bowen, Zhang Zheng, Sun Aixin, Wang Yequan
- Image Content Generation With Causal Reasoning Li Xiaochuan, Fan Baoyu, Zhang Runze, Jin Liang, Wang Di, Guo Zhenhua, Zhao Yaqian, Li Rengang
- Loftq: Lora-fine-tuning-aware Quantization For Large Language Models Li Yixiao, Yu Yifan, Liang Chen, He Pengcheng, Karampatziakis Nikos, Chen Weizhu, Zhao Tuo
- Guiding Large Language Models Via Directional Stimulus Prompting Li Zekun, Peng Baolin, He Pengcheng, Galley Michel, Gao Jianfeng, Yan Xifeng
- Unveiling The Pitfalls Of Knowledge Editing For Large Language Models Li Zhoubo, Zhang Ningyu, Yao Yunzhi, Wang Mengru, Chen Xi, Chen Huajun
- Llm-grounded Diffusion: Enhancing Prompt Understanding Of Text-to-image Diffusion Models With Large Language Models Lian Long, Li Boyi, Yala Adam, Darrell Trevor
- Towards End-to-end Embodied Decision Making Via Multi-modal Large Language Model: Explorations With Gpt4-vision And Beyond Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Tianyu Liu, Baobao Chang
- Encouraging Divergent Thinking In Large Language Models Through Multi-agent Debate Liang Tian, He Zhiwei, Jiao Wenxiang, Wang Xing, Wang Rui, Yang Yujiu, Tu Zhaopeng, Shi Shuming
- A Survey On Large Language Models For Recommendation Likang Wu, Zhi Zheng, Zhaopeng Qiu, Hao Wang, Hongchao Gu, Tingjia Shen, Chuan Qin, Chen Zhu, Hengshu Zhu, Qi Liu, Hui Xiong, Enhong Chen
- Agentsims: An Open-source Sandbox For Large Language Model Evaluation Lin Jiaju, Zhao Haoran, Zhang Aochi, Wu Yiting, Ping Huqiuyue, Chen Qin
- Rella: Retrieval-enhanced Large Language Models For Lifelong Sequential Behavior Comprehension In Recommendation Lin Jianghao, Shan Rong, Zhu Chenxu, Du Kounianhua, Chen Bo, Quan Shigang, Tang Ruiming, Yu Yong, Zhang Weinan
- Argue With Me Tersely: Towards Sentence-level Counter-argument Generation Lin Jiayu, Ye Rong, Han Meng, Zhang Qi, Lai Ruofei, Zhang Xinyu, Cao Zhao, Huang Xuanjing, Wei Zhongyu
- Advances In Embodied Navigation Using Large Language Models: A Survey Lin Jinzhou, Gao Han, Feng Xuxiang, Xu Rongtao, Wang Changwei, Zhang Man, Guo Li, Xu Shibiao
- Generating With Confidence: Uncertainty Quantification For Black-box Large Language Models Lin Zhen, Trivedi Shubhendu, Sun Jimeng
- SPHINX: The Joint Mixing Of Weights, Tasks, And Visual Embeddings For Multi-modal Large Language Models Lin Ziyi, Liu Chris, Zhang Renrui, Gao Peng, Qiu Longtian, Xiao Han, Qiu Han, Lin Chen, Shao Wenqi, Chen Keqin, Han Jiaming, Huang Siyuan, Zhang Yichi, He Xuming, Li Hongsheng, Qiao Yu
- Unleashing The Creative Mind: Language Model As Hierarchical Policy For Improved Exploration On Challenging Problem Solving Ling Zhan, Fang Yunhao, Li Xuanlin, Mu Tongzhou, Lee Mingu, Pourreza Reza, Memisevic Roland, Su Hao
- External Reasoning: Towards Multi-large-language-models Interchangeable Assistance With Human Feedback Liu Akide
- MMC: Advancing Multimodal Chart Understanding With Large-scale Instruction Tuning Liu Fuxiao, Wang Xiaoyang, Yao Wenlin, Chen Jianshu, Song Kaiqiang, Cho Sangwoo, Yacoob Yaser, Yu Dong
- RETA-LLM: A Retrieval-augmented Large Language Model Toolkit Liu Jiongnan, Jin Jiajie, Wang Zihan, Cheng Jiehan, Dou Zhicheng, Wen Ji-rong
- Alignbench: Benchmarking Chinese Alignment Of Large Language Models Liu Xiao, Lei Xuanyu, Wang Shengyuan, Huang Yue, Feng Zhuoer, Wen Bosi, Cheng Jiale, Ke Pei, Xu Yifan, Tam Weng Lam, Zhang Xiaohan, Sun Lichao, Gu Xiaotao, Wang Hongning, Zhang Jing, Huang Minlie, Dong Yuxiao, Tang Jie
- Online Speculative Decoding Liu Xiaoxuan, Hu Lanxiang, Bailis Peter, Cheung Alvin, Deng Zhijie, Stoica Ion, Zhang Hao
- Mm-safetybench: A Benchmark For Safety Evaluation Of Multimodal Large Language Models Liu Xin, Zhu Yichen, Gu Jindong, Lan Yunshi, Yang Chao, Qiao Yu
- Wavjourney: Compositional Audio Creation With Large Language Models Liu Xubo, Zhu Zhongkai, Liu Haohe, Yuan Yi, Cui Meng, Huang Qiushi, Liang Jinhua, Cao Yin, Kong Qiuqiang, Plumbley Mark D., Wang Wenwu
- G-eval: NLG Evaluation Using GPT-4 With Better Human Alignment Liu Yang, Iter Dan, Xu Yichong, Wang Shuohang, Xu Ruochen, Zhu Chenguang
- Cachegen: KV Cache Compression And Streaming For Fast Large Language Model Serving Liu Yuhan, Li Hanchen, Cheng Yihua, Ray Siddhant, Huang Yuyang, Zhang Qizheng, Du Kuntai, Yao Jiayi, Lu Shan, Ananthanarayanan Ganesh, Maire Michael, Hoffmann Henry, Holtzman Ari, Jiang Junchen
- Binary And Ternary Natural Language Generation Liu Zechun, Oguz Barlas, Pappu Aasish, Shi Yangyang, Krishnamoorthi Raghuraman
- LLM360: Towards Fully Transparent Open-source Llms Liu Zhengzhong, Qiao Aurick, Neiswanger Willie, Wang Hongyi, Tan Bowen, Tao Tianhua, Li Junbo, Wang Yuqi, Sun Suqi, Pangarkar Omkar, Fan Richard, Gu Yi, Miller Victor, Zhuang Yonghao, He Guowei, Li Haonan, Koto Fajri, Tang Liping, Ranjan Nikhil, Shen Zhiqiang, Ren Xuguang, Iriondo Roberto, Mu Cun, Hu Zhiting, Schulze Mark, Nakov Preslav, Baldwin Tim, Xing Eric P.
- Deja Vu: Contextual Sparsity For Efficient Llms At Inference Time Liu Zichang, Wang Jue, Dao Tri, Zhou Tianyi, Yuan Binhang, Song Zhao, Shrivastava Anshumali, Zhang Ce, Tian Yuandong, Re Christopher, Chen Beidi
- Bounding The Capabilities Of Large Language Models In Open Text Generation With Prompt Constraints Lu Albert, Zhang Hongxin, Zhang Yanzhe, Wang Xuezhi, Yang Diyi
- Memochat: Tuning Llms To Use Memos For Consistent Long-range Open-domain Conversation Lu Junru, An Siyu, Lin Mingbao, Pergola Gabriele, He Yulan, Yin Di, Sun Xing, Wu Yunsheng
- Ziya-visual: Bilingual Large Vision-language Model Via Multi-task Instruction Tuning Lu Junyu, Zhang Dixiang, Wu Xiaojun, Gao Xinyu, Gan Ruyi, Zhang Jiaxing, Song Yan, Zhang Pingjian
- Large Language Models Can Be Guided To Evade Ai-generated Text Detection Lu Ning, Liu Shengcai, He Rui, Wang Qi, Ong Yew-soon, Tang Ke
- SCITAB: A Challenging Benchmark For Compositional Reasoning And Claim Verification On Scientific Tables Lu Xinyuan, Pan Liangming, Liu Qian, Nakov Preslav, Kan Min-yen
- Multimodal Procedural Planning Via Dual Text-image Prompting Lu Yujie, Lu Pan, Chen Zhiyu, Zhu Wanrong, Wang Xin Eric, Wang William Yang
- Fighting Fire With Fire: The Dual Role Of Llms In Crafting And Detecting Elusive Disinformation Lucas Jason, Uchendu Adaku, Yamashita Michiharu, Lee Jooyoung, Rohatgi Shaurya, Lee Dongwon
- Cheap And Quick: Efficient Vision-language Instruction Tuning For Large Language Models Luo Gen, Zhou Yiyi, Ren Tianhe, Chen Shengxin, Sun Xiaoshuai, Ji Rongrong
- Wizardmath: Empowering Mathematical Reasoning For Large Language Models Via Reinforced Evol-instruct Luo Haipeng, Sun Qingfeng, Xu Can, Zhao Pu, Lou Jianguang, Tao Chongyang, Geng Xiubo, Lin Qingwei, Chen Shifeng, Zhang Dongmei
- Biomedgpt: Open Multimodal Generative Pre-trained Transformer For Biomedicine Luo Yizhen, Zhang Jiahuan, Fan Siqi, Yang Kai, Wu Yushuai, Qiao Mu, Nie Zaiqing
- Unlocking The Potential Of Large Language Models For Explainable Recommendations Luo Yucong, Cheng Mingyue, Zhang Hao, Lu Junyu, Liu Qi, Chen Enhong
- Full Parameter Fine-tuning For Large Language Models With Limited Resources Lv Kai, Yang Yuqing, Liu Tengxiao, Gao Qinghui, Guo Qipeng, Qiu Xipeng
- Collie: Collaborative Training Of Large Language Models In An Efficient Way Lv Kai, Zhang Shuo, Gu Tianle, Xing Shuhao, Hong Jiawei, Chen Keyu, Liu Xiaoran, Yang Yuqing, Guo Honglin, Liu Tengxiao, Sun Yu, Guo Qipeng, Yan Hang, Qiu Xipeng
- KOSMOS-2.5: A Multimodal Literate Model Lv Tengchao, Huang Yupan, Chen Jingye, Zhao Yuzhong, Jia Yilin, Cui Lei, Ma Shuming, Chang Yaoyao, Huang Shaohan, Wang Wenhui, Dong Li, Luo Weiyao, Wu Shaoxiang, Wang Guoxin, Zhang Cha, Wei Furu
- An Examination Of The Compositionality Of Large Generative Vision-language Models Ma Teli, Li Rong, Liang Junwei
- At Which Training Stage Does Code Data Help Llms Reasoning? Ma Yingwei, Liu Yue, Yu Yue, Zhang Yuanliang, Jiang Yu, Wang Changjian, Li Shanshan
- Lampilot: An Open Benchmark Dataset For Autonomous Driving With Language Model Programs Ma Yunsheng, Cui Can, Cao Xu, Ye Wenqian, Liu Peiran, Lu Juanwu, Abdelraouf Amr, Gupta Rohit, Han Kyungtae, Bera Aniket, Rehg James M., Wang Ziran
- World-to-words: Grounded Open Vocabulary Acquisition Through Fast Mapping In Vision-language Models Ma Ziqiao, Pan Jiayi, Chai Joyce
- Video-chatgpt: Towards Detailed Video Understanding Via Large Vision And Language Models Maaz Muhammad, Rasheed Hanoona, Khan Salman, Khan Fahad Shahbaz
- Harnessing The Power Of Prompt-based Techniques For Generating School-level Questions Using Large Language Models Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- Enhancing CLIP With GPT-4: Harnessing Visual Descriptions As Prompts Maniparambil Mayug, Vorster Chris, Molloy Derek, Murphy Noel, Mcguinness Kevin, O'connor Noel E.
- Editing Personality For Large Language Models Mao Shengyu, Wang Xiaohan, Wang Mengru, Jiang Yong, Xie Pengjun, Huang Fei, Zhang Ningyu
- Unitrec: A Unified Text-to-text Transformer And Joint Contrastive Learning Framework For Text-based Recommendation Mao Zhiming, Wang Huimin, Du Yiming, Wong Kam-fai
- Lightlm: A Lightweight Deep And Narrow Language Model For Generative Recommendation Mei Kai, Zhang Yongfeng
- Specinfer: Accelerating Generative Large Language Model Serving With Tree-based Speculative Inference And Verification Miao Xupeng, Oliaro Gabriele, Zhang Zhihao, Cheng Xinhao, Wang Zeyu, Zhang Zhengxin, Wong Rae Ying Yee, Zhu Alan, Yang Lijie, Shi Xiaoxiang, Shi Chunan, Chen Zhuoming, Arfeen Daiyaan, Abhyankar Reyna, Jia Zhihao
- MEDIMP: 3D Medical Images With Clinical Prompts From Limited Tabular Data For Renal Transplantation Milecki Leo, Kalogeiton Vicky, Bodard Sylvain, Anglicheau Dany, Correas Jean-michel, Timsit Marc-olivier, Vakalopoulou Maria
- Beyond Accuracy: Evaluating Self-consistency Of Code Large Language Models With Identitychain Min Marcus J., Ding Yangruibo, Buratti Luca, Pujar Saurabh, Kaiser Gail, Jana Suman, Ray Baishakhi
- Compositional Chain-of-thought Prompting For Large Multimodal Models Mitra Chancharik, Huang Brandon, Darrell Trevor, Herzig Roei
- Landmark Attention: Random-access Infinite Context Length For Transformers Mohtashami Amirkeivan, Jaggi Martin
- Octopack: Instruction Tuning Code Large Language Models Muennighoff Niklas, Liu Qian, Zebaze Armel, Zheng Qinkai, Hui Binyuan, Zhuo Terry Yue, Singh Swayam, Tang Xiangru, Von Werra Leandro, Longpre Shayne
- Arabic Mini-climategpt : A Climate Change And Sustainability Tailored Arabic LLM Mullappilly Sahal Shaji, Shaker Abdelrahman, Thawakar Omkar, Cholakkal Hisham, Anwer Rao Muhammad, Khan Salman, Khan Fahad Shahbaz
- Unival: Unified Model For Image, Video, Audio And Language Tasks Mustafa Shukor, Corentin Dancette, Alexandre Rame, Matthieu Cord
- Large Language Models Are Fixated By Red Herrings: Exploring Creative Problem Solving And Einstellung Effect Using The Only Connect Wall Dataset Naeini Saeid, Saqur Raeid, Saeidi Mozhgan, Giorgi John, Taati Babak
- DERA: Enhancing Large Language Model Completions With Dialog-enabled Resolving Agents Nair Varun, Schumacher Elliot, Tso Geoffrey, Kannan Anitha
- On Robustness Of Finetuned Transformer-based NLP Models Neerudu Pavan Kalyan Reddy, Oota Subba Reddy, Marreddy Mounika, Kagita Venkateswara Rao, Gupta Manish
- Simplyretrieve: A Private And Lightweight Retrieval-centric Generative AI Tool Ng Youyang, Miyashita Daisuke, Hoshi Yasuto, Morioka Yasuhiro, Torii Osamu, Kodama Tomoya, Deguchi Jun
- Code-style In-context Learning For Knowledge-based Question Answering Nie Zhijie, Zhang Richong, Wang Zhongyuan, Liu Xudong
- Codegen2: Lessons For Training Llms On Programming And Natural Languages Nijkamp Erik, Hayashi Hiroaki, Xiong Caiming, Savarese Silvio, Zhou Yingbo
- Evaluating GPT-3.5 And GPT-4 Models On Brazilian University Admission Exams Nunes Desnes, Primi Ricardo, Pires Ramon, Lotufo Roberto, Nogueira Rodrigo
- Blackvip: Black-box Visual Prompting For Robust Transfer Learning Oh Changdae, Hwang Hyeji, Lee Hee-young, Lim Yongtaek, Jung Geunyoung, Jung Jiyoung, Choi Hosik, Song Kyungwoo
- Askit: Unified Programming Interface For Programming With Large Language Models Okuda Katsumi, Amarasinghe Saman
- LINC: A Neurosymbolic Approach For Logical Reasoning By Combining Language Models With First-order Logic Provers Olausson Theo X., Gu Alex, Lipkin Benjamin, Zhang Cedegao E., Solar-lezama Armando, Tenenbaum Joshua B., Levy Roger
- Autoplan: Automatic Planning Of Interactive Decision-making Tasks With Large Language Models Ouyang Siqi, Li Lei
- Structured Chemistry Reasoning With Large Language Models Ouyang Siru, Zhang Zhuosheng, Yan Bing, Liu Xuan, Choi Yejin, Han Jiawei, Qin Lianhui
- Eq-bench: An Emotional Intelligence Benchmark For Large Language Models Paech Samuel J.
- Towards Vision-language Mechanistic Interpretability: A Causal Tracing Tool For BLIP Palit Vedant, Pandey Rohan, Arora Aryaman, Liang Paul Pu
- Stelocoder: A Decoder-only LLM For Multi-language To Python Code Translation Pan Jialing, Sadé Adrien, Kim Jin, Soriano Eric, Sole Guillem, Flamant Sylvain
- Self-supervised Meta-prompt Learning With Meta-gradient Regularization For Few-shot Generalization Pan Kaihang, Li Juncheng, Song Hongye, Lin Jun, Liu Xiaozhong, Tang Siliang
- Logic-lm: Empowering Large Language Models With Symbolic Solvers For Faithful Logical Reasoning Pan Liangming, Albalak Alon, Wang Xinyi, Wang William Yang
- Tokenize Anything Via Prompting Pan Ting, Tang Lulu, Wang Xinlong, Shan Shiguang
- Kosmos-g: Generating Images In Context With Multimodal Large Language Models Pan Xichen, Dong Li, Huang Shaohan, Peng Zhiliang, Chen Wenhu, Wei Furu
- Frozen Transformers In Language Models Are Effective Visual Encoder Layers Pang Ziqi, Xie Ziyang, Man Yunze, Wang Yu-xiong
- On Measuring Faithfulness Or Self-consistency Of Natural Language Explanations Parcalabescu Letitia, Frank Anette
- Longbox: Evaluating Transformers On Long-sequence Clinical Tasks Parmar Mihir, Naik Aakanksha, Gupta Himanshu, Agrawal Disha, Baral Chitta
- Gorilla: Large Language Model Connected With Massive Apis Patil Shishir G., Zhang Tianjun, Wang Xin, Gonzalez Joseph E.
- Yarn: Efficient Context Window Extension Of Large Language Models Peng Bowen, Quesnelle Jeffrey, Fan Honglu, Shippole Enrico
- FP8-LM: Training FP8 Large Language Models Peng Houwen, Wu Kan, Wei Yixuan, Zhao Guoshuai, Yang Yuxiang, Liu Ze, Xiong Yifan, Yang Ziyue, Ni Bolin, Hu Jingcheng, Li Ruihang, Zhang Miaosen, Li Chen, Ning Jia, Wang Ruizhe, Zhang Zheng, Liu Shuguang, Chau Joe, Hu Han, Cheng Peng
- LLM Self Defense: By Self Examination, Llms Know They Are Being Tricked Phute Mansi, Helbling Alec, Hull Matthew, Peng Shengyun, Szyller Sebastian, Cornelius Cory, Chau Duen Horng
- Jatmo: Prompt Injection Defense By Task-specific Finetuning Piet Julien, Alrashed Maha, Sitawarin Chawin, Chen Sizhe, Wei Zeming, Sun Elizabeth, Alomair Basel, Wagner David
- Evaluating Gpt-4's Vision Capabilities On Brazilian University Admission Exams Pires Ramon, Almeida Thales Sales, Abonizio Hugo, Nogueira Rodrigo
- Egovlpv2: Egocentric Video-language Pre-training With Fusion In The Backbone Pramanick Shraman, Song Yale, Nag Sayan, Lin Kevin Qinghong, Shah Hardik, Shou Mike Zheng, Chellappa Rama, Zhang Pengchuan
- Epik-eval: Evaluation For Language Models As Epistemic Models Prato Gabriele, Huang Jerry, Parthasarathi Prasannna, Sodhani Shagun, Chandar Sarath
- Experiential Co-learning Of Software-developing Agents Qian Chen, Dang Yufan, Li Jiahao, Liu Wei, Xie Zihao, Wang Yifei, Chen Weize, Yang Cheng, Cong Xin, Che Xiaoyin, Liu Zhiyuan, Sun Maosong
- Taskweaver: A Code-first Agent Framework Qiao Bo, Li Liqun, Zhang Xu, He Shilin, Kang Yu, Zhang Chaoyun, Yang Fangkai, Dong Hang, Zhang Jue, Wang Lu, Ma Minghua, Zhao Pu, Qin Si, Qin Xiaoting, Du Chao, Xu Yong, Lin Qingwei, Rajmohan Saravan, Zhang Dongmei
- Gameeval: Evaluating Llms On Conversational Games Qiao Dan, Wu Chenfei, Liang Yaobo, Li Juntao, Duan Nan
- OPERA: Alleviating Hallucination In Multi-modal Large Language Models Via Over-trust Penalty And Retrospection-allocation Qidong Huang, Xiaoyi Dong, Pan Zhang, Bin Wang, Conghui He, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu
- Hallucidoctor: Mitigating Hallucinatory Toxicity In Visual Instruction Data Qifan Yu, Juncheng Li, Longhui Wei, Liang Pang, Wentao Ye, Bosheng Qin, Siliang Tang, Qi Tian, Yueting Zhuang
- How Good Is Google Bard's Visual Understanding? An Empirical Study On Open Challenges Qin Haotong, Ji Ge-peng, Khan Salman, Fan Deng-ping, Khan Fahad Shahbaz, Van Gool Luc
- Transnormerllm: A Faster And Better Large Language Model With Improved Transnormer Qin Zhen, Li Dong, Sun Weigao, Sun Weixuan, Shen Xuyang, Han Xiaodong, Wei Yunshen, Lv Baohong, Luo Xiao, Qiao Yu, Zhong Yiran
- Mplug-owl: Modularization Empowers Large Language Models With Multimodality Qinghao Ye, Haiyang Xu, Guohai Xu, Jiabo Ye, Ming Yan, Yiyang Zhou, Junyang Wang, Anwen Hu, Pengcheng Shi, Yaya Shi, Chenliang Li, Yuanhong Xu, Hehong Chen, Junfeng Tian, Qi Qian, Ji Zhang, Fei Huang, Jingren Zhou
- Adalora: Adaptive Budget Allocation For Parameter-efficient Fine-tuning Qingru Zhang, Minshuo Chen, Alexander Bukharin, Nikos Karampatziakis, Pengcheng He, Yu Cheng, Weizhu Chen, Tuo Zhao
- Codegeex: A Pre-trained Model For Code Generation With Multilingual Benchmarking On Humaneval-x Qinkai Zheng, Xiao Xia, Xu Zou, Yuxiao Dong, Shan Wang, Yufei Xue, Zihan Wang, Lei Shen, Andi Wang, Yang Li, Teng Su, Zhilin Yang, Jie Tang
- Latent Jailbreak: A Benchmark For Evaluating Text Safety And Output Robustness Of Large Language Models Qiu Huachuan, Zhang Shuai, Li Anqi, He Hongliang, Lan Zhenzhong
- Are Large Language Models Temporally Grounded? Qiu Yifu, Zhao Zheng, Ziser Yftah, Korhonen Anna, Ponti Edoardo M., Cohen Shay B.
- Unlocking Emergent Modularity In Large Language Models Qiu Zihan, Huang Zeyu, Fu Jie
- Layoutllm-t2i: Eliciting Layout Guidance From LLM For Text-to-image Generation Qu Leigang, Wu Shengqiong, Fei Hao, Nie Liqiang, Chua Tat-seng
- Coedit: Text Editing By Task-specific Instruction Tuning Raheja Vipul, Kumar Dhruv, Koo Ryan, Kang Dongyeop
- On Context Utilization In Summarization With Large Language Models Ravaut Mathieu, Sun Aixin, Chen Nancy F., Joty Shafiq
- Towards A Performance Analysis On Pre-trained Visual Question Answering Models For Autonomous Driving Rekanar Kaavya, Eising Ciarán, Sistu Ganesh, Hayes Martin
- Investigating The Factual Knowledge Boundary Of Large Language Models With Retrieval Augmentation Ren Ruiyang, Wang Yuhao, Qu Yingqi, Zhao Wayne Xin, Liu Jing, Tian Hao, Wu Hua, Wen Ji-rong, Wang Haifeng
- Context Compression For Auto-regressive Transformers With Sentinel Tokens Ren Siyu, Jia Qi, Zhu Kenny Q.
- Prompt, Generate, Then Cache: Cascade Of Foundation Models Makes Strong Few-shot Learners Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Hongsheng Li, Yu Qiao, Peng Gao
- I'm Afraid I Can't Do That: Predicting Prompt Refusal In Black-box Generative Language Models Reuter Max, Schulze William
- Negated Complementary Commonsense Using Large Language Models Rezaei Navid, Reformat Marek Z.
- Smoothllm: Defending Large Language Models Against Jailbreaking Attacks Robey Alexander, Wong Eric, Hassani Hamed, Pappas George J.
- Audiogpt: Understanding And Generating Speech, Music, Sound, And Talking Head Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Zhou Zhao, Shinji Watanabe
- Axiomatic Preference Modeling For Longform Question Answering Rosset Corby, Zheng Guoqing, Dibia Victor, Awadallah Ahmed, Bennett Paul
- Gpt4tools: Teaching Large Language Model To Use Tools Via Self-instruction Rui Yang, Lin Song, Yanwei Li, Sijie Zhao, Yixiao Ge, Xiu Li, Ying Shan
- Attention-likelihood Relationship In Transformers Ruscio Valeria, Maiorca Valentino, Silvestri Fabrizio
- Promptmix: A Class Boundary Augmentation Method For Large Language Model Distillation Sahu Gaurav, Vechtomova Olga, Bahdanau Dzmitry, Laradji Issam H.
- Zero-shot Translation Of Attention Patterns In VQA Models To Natural Language Salewski Leonard, Koepke A. Sophia, Lensch Hendrik P. A., Akata Zeynep
- Uni-nlx: Unifying Textual Explanations For Vision And Vision-language Tasks Sammani Fawaz, Deligiannis Nikos
- Camoscio: An Italian Instruction-tuned Llama Santilli Andrea, Rodolà Emanuele
- Open-ended Instructable Embodied Agents With Memory-augmented Large Language Models Sarch Gabriel, Wu Yue, Tarr Michael J., Fragkiadaki Katerina
- Probing Conceptual Understanding Of Large Visual-language Models Schiappa Madeline, Abdullah Raiyaan, Azad Shehreen, Claypoole Jared, Cogswell Michael, Divakaran Ajay, Rawat Yogesh
- PULSAR At Mediqa-sum 2023: Large Language Models Augmented By Synthetic Dialogue Convert Patient Dialogues To Medical Records Schlegel Viktor, Li Hao, Wu Yuping, Subramanian Anand, Nguyen Thanh-tung, Kashyap Abhinav Ramesh, Beck Daniel, Zeng Xiaojun, Batista-navarro Riza Theresa, Winkler Stefan, Nenadic Goran
- Jais And Jais-chat: Arabic-centric Foundation And Instruction-tuned Open Generative Large Language Models Sengupta Neha, Sahu Sunil Kumar, Jia Bokang, Katipomu Satheesh, Li Haonan, Koto Fajri, Marshall William, Gosal Gurpreet, Liu Cynthia, Chen Zhiming, Afzal Osama Mohammed, Kamboj Samta, Pandit Onkar, Pal Rahul, Pradhan Lalit, Mujahid Zain Muhammad, Baali Massa, Han Xudong, Bsharat Sondos Mahmoud, Aji Alham Fikri, Shen Zhiqiang, Liu Zhengzhong, Vassilieva Natalia, Hestness Joel, Hock Andy, Feldman Andrew, Lee Jonathan, Jackson Andrew, Ren Hector Xuguang, Nakov Preslav, Baldwin Timothy, Xing Eric
- PB-LLM: Partially Binarized Large Language Models Shang Yuzhang, Yuan Zhihang, Wu Qiang, Dong Zhen
- Omniquant: Omnidirectionally Calibrated Quantization For Large Language Models Shao Wenqi, Chen Mengzhao, Zhang Zhaoyang, Xu Peng, Zhao Lirui, Li Zhiqian, Zhang Kaipeng, Gao Peng, Qiao Yu, Luo Ping
- Tinylvlm-ehub: Towards Comprehensive And Efficient Evaluation For Large Vision-language Models Shao Wenqi, Lei Meng, Hu Yutao, Gao Peng, Zhang Kaipeng, Meng Fanqing, Xu Peng, Huang Siyuan, Li Hongsheng, Qiao Yu, Luo Ping
- Human Choice Prediction In Language-based Persuasion Games: Simulation-based Off-policy Evaluation Shapira Eilam, Apel Reut, Tennenholtz Moshe, Reichart Roi
- The Flan Collection: Designing Data And Methods For Effective Instruction Tuning Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay, Denny Zhou, Quoc V. Le, Barret Zoph, Jason Wei, Adam Roberts
- Efficient LLM Inference On Cpus Shen Haihao, Chang Hanwen, Dong Bo, Luo Yu, Meng Hengyu
- Retrieval-generation Alignment For End-to-end Task-oriented Dialogue System Shen Weizhou, Gao Yingqi, Huang Canbin, Wan Fanqi, Quan Xiaojun, Bi Wei
- Can Language Agents Be Alternatives To PPO? A Preliminary Empirical Study On Openai Gym Sheng Junjie, Huang Zixiao, Shen Chuyun, Li Wenhao, Hua Yun, Jin Bo, Zha Hongyuan, Wang Xiangfeng
- Flexgen: High-throughput Generative Inference Of Large Language Models With A Single GPU Sheng Ying, Zheng Lianmin, Yuan Binhang, Li Zhuohan, Ryabinin Max, Fu Daniel Y., Xie Zhiqiang, Chen Beidi, Barrett Clark, Gonzalez Joseph E., Liang Percy, Ré Christopher, Stoica Ion, Zhang Ce
- TOAST: Transfer Learning Via Attention Steering Shi Baifeng, Gai Siyu, Darrell Trevor, Wang Xin
- Specialist Or Generalist? Instruction Tuning For Specific NLP Tasks Shi Chufan, Su Yixuan, Yang Cheng, Yang Yujiu, Cai Deng
- Crossget: Cross-guided Ensemble Of Tokens For Accelerating Vision-language Transformers Shi Dachuan, Tao Chaofan, Rao Anyi, Yang Zhendong, Yuan Chun, Wang Jiaqi
- Sotana: The Open-source Software Development Assistant Shi Ensheng, Zhang Fengji, Wang Yanlin, Chen Bei, Du Lun, Zhang Hongyu, Han Shi, Zhang Dongmei, Sun Hongbin
- Prompt Space Optimizing Few-shot Reasoning Success With Large Language Models Shi Fobo, Qing Peijun, Yang Dong, Wang Nan, Lei Youbo, Lu Haonan, Lin Xiaodong, Li Duantengchuan
- QADYNAMICS: Training Dynamics-driven Synthetic QA Diagnostic For Zero-shot Commonsense Question Answering Shi Haochen, Wang Weiqi, Fang Tianqing, Xu Baixuan, Ding Wenxuan, Liu Xin, Song Yangqiu
- Preliminary Study On Incremental Learning For Large Language Model-based Recommender Systems Shi Tianhao, Zhang Yang, Xu Zhijian, Chen Chong, Feng Fuli, He Xiangnan, Tian Qi
- Active Prompting With Chain-of-thought For Large Language Models Shizhe Diao, Pengcheng Wang, Yong Lin, Rui Pan, Xiang Liu, Tong Zhang
- Rewritelm: An Instruction-tuned Large Language Model For Text Rewriting Shu Lei, Luo Liangchen, Hoskere Jayakumar, Zhu Yun, Liu Yinxiao, Tong Simon, Chen Jindong, Meng Lei
- A Comprehensive Evaluation Of Large Language Models On Legal Judgment Prediction Shui Ruihao, Cao Yixin, Wang Xiang, Chua Tat-seng
- A Survey On Multimodal Large Language Models Shukang Yin, Chaoyou Fu, Sirui Zhao, Ke Li, Xing Sun, Tong Xu, Enhong Chen
- Woodpecker: Hallucination Correction For Multimodal Large Language Models Shukang Yin, Chaoyou Fu, Sirui Zhao, Tong Xu, Hao Wang, Dianbo Sui, Yunhang Shen, Ke Li, Xing Sun, Enhong Chen
- Ep-alm: Efficient Perceptual Augmentation Of Language Models Shukor Mustafa, Dancette Corentin, Cord Matthieu
- Beyond Task Performance: Evaluating And Reducing The Flaws Of Large Multimodal Models With In-context Learning Shukor Mustafa, Rame Alexandre, Dancette Corentin, Cord Matthieu
- Automatic Prompt Augmentation And Selection With Chain-of-thought From Labeled Data Shum Kashun, Diao Shizhe, Zhang Tong
- Tree Of Thoughts: Deliberate Problem Solving With Large Language Models Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Thomas L. Griffiths, Yuan Cao, Karthik Narasimhan
- Combo Of Thinking And Observing For Outside-knowledge VQA Si Qingyi, Mo Yuchen, Lin Zheng, Ji Huishan, Wang Weiping
- An Empirical Study Of Instruction-tuning Large Language Models In Chinese Si Qingyi, Wang Tong, Lin Zheng, Zhang Xu, Cao Yanan, Wang Weiping
- Gopro: Generate And Optimize Prompts In CLIP Using Self-supervised Learning Singha Mainak, Jha Ankit, Banerjee Biplab
- Applenet: Visual Attention Parameterized Prompt Learning For Few-shot Remote Sensing Image Generalization Using CLIP Singha Mainak, Jha Ankit, Solanki Bhupendra, Bose Shirsha, Banerjee Biplab
- Step: Stacked LLM Policies For Web Actions Sodhi Paloma, Branavan S. R. K., Artzi Yoav, Mcdonald Ryan
- Conpet: Continual Parameter-efficient Tuning For Large Language Models Song Chenyang, Han Xu, Zeng Zheni, Li Kuai, Chen Chen, Liu Zhiyuan, Sun Maosong, Yang Tao
- Pneumollm: Harnessing The Power Of Large Language Model For Pneumoconiosis Diagnosis Song Meiyue, Yu Zhihua, Wang Jiaxin, Wang Jiarui, Lu Yuting, Li Baicun, Wang Xiaoxu, Huang Qinghua, Li Zhijun, Kanellakis Nikolaos I., Liu Jiangfeng, Wang Jing, Wang Binglu, Yang Juntao
- Honeybee: Progressive Instruction Finetuning Of Large Language Models For Materials Science Song Yu, Miret Santiago, Zhang Huan, Liu Bang
- Code Soliloquies For Accurate Calculations In Large Language Models Sonkar Shashank, Le Myco, Chen Xinghe, Liu Naiming, Mallick Debshila Basu, Baraniuk Richard G.
- Pythia: A Suite For Analyzing Large Language Models Across Training And Scaling Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, Usvsn Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar Van Der Wal
- Emergent And Predictable Memorization In Large Language Models Stella Biderman, Usvsn Sai Prashanth, Lintang Sutawika, Hailey Schoelkopf, Quentin Anthony, Shivanshu Purohit, Edward Raff
- Does Fine-tuning GPT-3 With The Openai API Leak Personally-identifiable Information? Sun Albert Yu, Zemour Eliott, Saxena Arushi, Vaidyanathan Udith, Lin Eric, Lau Christian, Mugunthan Vaikkunth
- Create Your World: Lifelong Text-to-image Diffusion Sun Gan, Liang Wenqi, Dong Jiahua, Li Jun, Ding Zhengming, Cong Yang
- Fine-grained Audio-visual Joint Representations For Multimodal Large Language Models Sun Guangzhi, Yu Wenyi, Tang Changli, Chen Xianzhao, Tan Tian, Li Wei, Lu Lu, Ma Zejun, Zhang Chao
- A Simple And Effective Pruning Approach For Large Language Models Sun Mingjie, Liu Zhuang, Bair Anna, Kolter J. Zico
- Make Prompt-based Black-box Tuning Colorful: Boosting Model Generalization From Three Orthogonal Perspectives Sun Qiushi, Han Chengcheng, Chen Nuo, Zhu Renyu, Gong Jingyang, Li Xiang, Gao Ming
- EVA-CLIP: Improved Training Techniques For CLIP At Scale Sun Quan, Fang Yuxin, Wu Ledell, Wang Xinlong, Cao Yue
- Expnote: Black-box Large Language Models Are Better Task Solvers With Experience Notebook Sun Wangtao, Yu Xuanqing, He Shizhu, Zhao Jun, Liu Kang
- Retentive Network: A Successor To Transformer For Large Language Models Sun Yutao, Dong Li, Huang Shaohan, Ma Shuming, Xia Yuqing, Xue Jilong, Wang Jianyong, Wei Furu
- Uncovering Chatgpt's Capabilities In Recommender Systems Sunhao Dai, Ninglu Shao, Haiyuan Zhao, Weijie Yu, Zihua Si, Chen Xu, Zhongxiang Sun, Xiao Zhang, Jun Xu
- Glitchbench: Can Large Multimodal Models Detect Video Game Glitches? Taesiri Mohammad Reza, Feng Tianjun, Nguyen Anh, Bezemer Cor-paul
- Link-context Learning For Multimodal Llms Tai Yan, Fan Weichen, Zhang Zhao, Zhu Feng, Zhao Rui, Liu Ziwei
- Massive Editing For Large Language Models Via Meta Learning Tan Chenmien, Zhang Ge, Fu Jie
- Compound Text-guided Prompt Tuning Via Image-adaptive Cues Tan Hao, Li Jun, Zhou Yizhuang, Wan Jun, Lei Zhen, Zhang Xiangyu
- Towards Robust Temporal Reasoning Of Large Language Models Via A Multi-hop QA Dataset And Pseudo-instruction Tuning Tan Qingyu, Ng Hwee Tou, Bing Lidong
- Medchatzh: A Better Medical Adviser Learns From Better Instructions Tan Yang, Li Mingchen, Huang Zijie, Yu Huiqun, Fan Guisheng
- Can Chatgpt Replace Traditional KBQA Models? An In-depth Analysis Of The Question Answering Performance Of The GPT LLM Family Tan Yiming, Min Dehai, Li Yu, Li Wenbo, Hu Nan, Chen Yongrui, Qi Guilin
- Learning To Imagine: Visually-augmented Natural Language Generation Tang Tianyi, Chen Yushuo, Du Yifan, Li Junyi, Zhao Wayne Xin, Wen Ji-rong
- Ml-bench: Evaluating Large Language Models And Agents For Machine Learning Tasks On Repository-level Code Tang Xiangru, Liu Yuliang, Cai Zefan, Shao Yanjun, Lu Junjie, Zhang Yichi, Deng Zexuan, Hu Helan, An Kaikai, Huang Ruijun, Si Shuzheng, Chen Sheng, Zhao Haozhe, Chen Liang, Wang Yan, Liu Tianyu, Jiang Zhiwei, Chang Baobao, Fang Yin, Qin Yujia, Zhou Wangchunshu, Zhao Yilun, Cohan Arman, Gerstein Mark
- Struc-bench: Are Large Language Models Really Good At Generating Complex Structured Data? Tang Xiangru, Zong Yiming, Phang Jason, Zhao Yilun, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- Medagents: Large Language Models As Collaborators For Zero-shot Medical Reasoning Tang Xiangru, Zou Anni, Zhang Zhuosheng, Li Ziming, Zhao Yilun, Zhang Xingyao, Cohan Arman, Gerstein Mark
- Caption Anything: Interactive Image Description With Diverse Multimodal Controls Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao
- Leveraging Llms For Synthesizing Training Data Across Many Languages In Multilingual Dense Retrieval Thakur Nandan, Ni Jianmo, Ábrego Gustavo Hernández, Wieting John, Lin Jimmy, Cer Daniel
- Graph Neural Prompting With Large Language Models Tian Yijun, Song Huan, Wang Zichen, Wang Haozhu, Hu Ziqing, Wang Fang, Chawla Nitesh V., Xu Panpan
- Evil Geniuses: Delving Into The Safety Of Llm-based Agents Tian Yu, Yang Xiao, Zhang Jingyuan, Dong Yinpeng, Su Hang
- Chimed-gpt: A Chinese Medical Large Language Model With Full Training Regime And Better Alignment To Human Preferences Tian Yuanhe, Gan Ruyi, Song Yan, Zhang Jiaxing, Zhang Yongdong
- RLHF-V: Towards Trustworthy Mllms Via Behavior Alignment From Fine-grained Correctional Human Feedback Tianyu Yu, Yuan Yao, Haoye Zhang, Taiwen He, Yifeng Han, Ganqu Cui, Jinyi Hu, Zhiyuan Liu, Hai-tao Zheng, Maosong Sun, Tat-seng Chua
- Do Llms Exhibit Human-like Response Biases? A Case Study In Survey Design Tjuatja Lindia, Chen Valerie, Wu Sherry Tongshuang, Talwalkar Ameet, Neubig Graham
- Function Vectors In Large Language Models Todd Eric, Li Millicent L., Sharma Arnab Sen, Mueller Aaron, Wallace Byron C., Bau David
- Videberta: A Powerful Pre-trained Language Model For Vietnamese Tran Cong Dao, Pham Nhut Huy, Nguyen Anh, Hy Truong Son, Vu Tu
- Viclevr: A Visual Reasoning Dataset And Hybrid Multimodal Fusion Model For Visual Question Answering In Vietnamese Tran Khiem Vinh, Phan Hao Phu, Van Nguyen Kiet, Nguyen Ngan Luu Thuy
- How Many Unicorns Are In This Image? A Safety Evaluation Benchmark For Vision Llms Tu Haoqin, Cui Chenhang, Wang Zijun, Zhou Yiyang, Zhao Bingchen, Han Junlin, Zhou Wangchunshu, Yao Huaxiu, Xie Cihang
- Taco: Enhancing Cross-lingual Transfer For Low-resource Languages In Llms Through Translation-assisted Chain-of-thought Processes Upadhayay Bibek, Behzadan Vahid
- Can NLP Models Correctly Reason Over Contexts That Break The Common Assumptions? Varshney Neeraj, Parmar Mihir, Patel Nisarg, Handa Divij, Sarkar Sayantan, Luo Man, Baral Chitta
- Gpt-who: An Information Density-based Machine-generated Text Detector Venkatraman Saranya, Uchendu Adaku, Lee Dongwon
- Fairpy: A Toolkit For Evaluation Of Social Biases And Their Mitigation In Large Language Models Viswanath Hrishikesh, Zhang Tianyi
- Prompt2model: Generating Deployable Models From Natural Language Instructions Viswanathan Vijay, Zhao Chenyang, Bertsch Amanda, Wu Tongshuang, Neubig Graham
- Empirical Evaluation Of Uncertainty Quantification In Retrieval-augmented Language Models For Science Wagle Sridevi, Munikoti Sai, Acharya Anurag, Smith Sara, Horawalavithana Sameera
- Efficient Large Language Models: A Survey Wan Zhongwei, Wang Xin, Liu Che, Alam Samiul, Zheng Yu, Liu Jiachen, Qu Zhongnan, Yan Shen, Zhu Yi, Zhang Quanlu, Chowdhury Mosharaf, Zhang Mi
- Shall We Pretrain Autoregressive Language Models With Retrieval? A Comprehensive Study Wang Boxin, Ping Wei, Xu Peng, Mcafee Lawrence, Liu Zihan, Shoeybi Mohammad, Dong Yi, Kuchaiev Oleksii, Li Bo, Xiao Chaowei, Anandkumar Anima, Catanzaro Bryan
- Robust Natural Language Understanding With Residual Attention Debiasing Wang Fei, Huang James Y., Yan Tianyi, Zhou Wenxuan, Chen Muhao
- Openchat: Advancing Open-source Language Models With Mixed-quality Data Wang Guan, Cheng Sijie, Zhan Xianyuan, Li Xiangang, Song Sen, Liu Yang
- Cross-lingual Knowledge Editing In Large Language Models Wang Jiaan, Liang Yunlong, Sun Zengkui, Cao Yuxuan, Xu Jiarong, Meng Fandong
- Missrec: Pre-training And Transferring Multi-modal Interest-aware Sequence Representation For Recommendation Wang Jinpeng, Zeng Ziyun, Wang Yunxiao, Wang Yuting, Lu Xingyu, Li Tianxiang, Yuan Jun, Zhang Rui, Zheng Hai-tao, Xia Shu-tao
- Improved Visual Fine-tuning With Natural Language Supervision Wang Junyang, Xu Yuanhong, Hu Juhua, Yan Ming, Sang Jitao, Qian Qi
- Mathcoder: Seamless Code Integration In Llms For Enhanced Mathematical Reasoning Wang Ke, Ren Houxing, Zhou Aojun, Lu Zimu, Luo Sichun, Shi Weikang, Zhang Renrui, Song Linqi, Zhan Mingjie, Li Hongsheng
- Document-level Machine Translation With Large Language Models Wang Longyue, Lyu Chenyang, Ji Tianbo, Zhang Zhirui, Yu Dian, Shi Shuming, Tu Zhaopeng
- Aurora:activating Chinese Chat Capability For Mixtral-8x7b Sparse Mixture-of-experts Through Instruction-tuning Wang Rongsheng, Chen Haoming, Zhou Ruizhe, Duan Yaofei, Cai Kunyan, Ma Han, Cui Jiaxi, Li Jian, Pang Patrick Cheong-iao, Wang Yapeng, Tan Tao
- Bridging The Novice-expert Gap Via Models Of Decision-making: A Case Study On Remediating Math Mistakes Wang Rose E., Zhang Qingyang, Robinson Carly, Loeb Susanna, Demszky Dorottya
- Compositional Text-to-image Synthesis With Attention Map Control Of Diffusion Models Wang Ruichen, Chen Zekang, Chen Chen, Ma Jian, Lu Haonan, Lin Xiaodong
- Vamos: Versatile Action Models For Video Understanding Wang Shijie, Zhao Qi, Do Minh Quan, Agarwal Nakul, Lee Kwonjoon, Sun Chen
- CAR: Conceptualization-augmented Reasoner For Zero-shot Commonsense Question Answering Wang Weiqi, Fang Tianqing, Ding Wenxuan, Xu Baixuan, Liu Xin, Song Yangqiu, Bosselut Antoine
- All Languages Matter: On The Multilingual Safety Of Large Language Models Wang Wenxuan, Tu Zhaopeng, Chen Chang, Yuan Youliang, Huang Jen-tse, Jiao Wenxiang, Lyu Michael R.
- Instructta: Instruction-tuned Targeted Attack For Large Vision-language Models Wang Xunguang, Ji Zhenlan, Ma Pingchuan, Li Zongjie, Wang Shuai
- NEWTON: Are Large Language Models Capable Of Physical Reasoning? Wang Yi Ru, Duan Jiafei, Fox Dieter, Srinivasa Siddhartha
- Pandalm: An Automatic Evaluation Benchmark For LLM Instruction Tuning Optimization Wang Yidong, Yu Zhuohao, Zeng Zhengran, Yang Linyi, Wang Cunxiang, Chen Hao, Jiang Chaoya, Xie Rui, Wang Jindong, Xie Xing, Ye Wei, Zhang Shikun, Zhang Yue
- Meta-reasoning: Semantics-symbol Deconstruction For Large Language Models Wang Yiming, Zhang Zhuosheng, Zhang Pei, Yang Baosong, Wang Rui
- Lifelongmemory: Leveraging Llms For Answering Queries In Long-form Egocentric Videos Wang Ying, Yang Yanlai, Ren Mengye
- Primacy Effect Of Chatgpt Wang Yiwei, Cai Yujun, Chen Muhao, Liang Yuxuan, Hooi Bryan
- Fake Alignment: Are Llms Really Aligned Well? Wang Yixu, Teng Yan, Huang Kexin, Lyu Chengqi, Zhang Songyang, Zhang Wenwei, Ma Xingjun, Jiang Yu-gang, Qiao Yu, Wang Yingchun
- How Far Can Camels Go? Exploring The State Of Instruction Tuning On Open Resources Wang Yizhong, Ivison Hamish, Dasigi Pradeep, Hessel Jack, Khot Tushar, Chandu Khyathi Raghavi, Wadden David, Macmillan Kelsey, Smith Noah A., Beltagy Iz, Hajishirzi Hannaneh
- Learning Hierarchical Prompt With Structured Linguistic Knowledge For Vision-language Models Wang Yubin, Jiang Xinyang, Cheng De, Li Dongsheng, Zhao Cairong
- Smarttrim: Adaptive Tokens And Attention Pruning For Efficient Vision-language Models Wang Zekun, Chen Jingchang, Zhou Wangchunshu, Zhu Haichao, Liang Jiafeng, Shan Liping, Liu Ming, Xu Dongliang, Yang Qing, Qin Bing
- Llmrec: Large Language Models With Graph Augmentation For Recommendation Wei Wei, Xubin Ren, Jiabin Tang, Qinyong Wang, Lixin Su, Suqi Cheng, Junfeng Wang, Dawei Yin, Chao Huang
- Autodroid: Llm-powered Task Automation In Android Wen Hao, Li Yuanchun, Liu Guohong, Zhao Shanhui, Yu Tao, Li Toby Jia-jun, Jiang Shiqi, Liu Yunhao, Zhang Yaqin, Liu Yunxin
- Unveiling The Implicit Toxicity In Large Language Models Wen Jiaxin, Ke Pei, Sun Hao, Zhang Zhexin, Li Chengfei, Bai Jinfeng, Huang Minlie
- Mindmap: Knowledge Graph Prompting Sparks Graph Of Thoughts In Large Language Models Wen Yilin, Wang Zifeng, Sun Jimeng
- Instructblip: Towards General-purpose Vision-language Models With Instruction Tuning Wenliang Dai, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Junqi Zhao, Weisheng Wang, Boyang Li, Pascale Fung, Steven Hoi
- Is Chatgpt A Good Translator? Yes With GPT-4 As The Engine Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Xing Wang, Shuming Shi, Zhaopeng Tu
- Exploring Parameter-efficient Fine-tuning Techniques For Code Generation With Large Language Models Weyssow Martin, Zhou Xin, Kim Kisub, Lo David, Sahraoui Houari
- Pmc-llama: Towards Building Open-source Language Models For Medicine Wu Chaoyi, Lin Weixiong, Zhang Xiaoman, Zhang Ya, Wang Yanfeng, Xie Weidi
- Language Prompt For Autonomous Driving Wu Dongming, Han Wencheng, Wang Tiancai, Liu Yingfei, Zhang Xiangyu, Shen Jianbing
- Speechgen: Unlocking The Generative Power Of Speech Language Models With Prompts Wu Haibin, Chang Kai-wei, Wu Yuan-kuei, Lee Hung-yi
- Next-gpt: Any-to-any Multimodal LLM Wu Shengqiong, Fei Hao, Qu Leigang, Ji Wei, Chua Tat-seng
- From Language Modeling To Instruction Following: Understanding The Behavior Shift In Llms After Instruction Tuning Wu Xuansheng, Yao Wenlin, Chen Jianshu, Pan Xiaoman, Wang Xiaoyang, Liu Ninghao, Yu Dong
- TLM: Token-level Masking For Transformers Wu Yangjun, Fang Kebin, Zhang Dongxiang, Wang Han, Zhang Hao, Chen Gang
- Fine-grained Human Feedback Gives Better Rewards For Language Model Training Wu Zeqiu, Hu Yushi, Shi Weijia, Dziri Nouha, Suhr Alane, Ammanabrolu Prithviraj, Smith Noah A., Ostendorf Mari, Hajishirzi Hannaneh
- Openicl: An Open-source Framework For In-context Learning Wu Zhenyu, Wang Yaoxiang, Ye Jiacheng, Feng Jiangtao, Xu Jingjing, Qiao Yu, Wu Zhiyong
- A Simple Baseline For Knowledge-based Visual Question Answering Xenos Alexandros, Stafylakis Themos, Patras Ioannis, Tzimiropoulos Georgios
- Imagenetvc: Zero- And Few-shot Visual Commonsense Evaluation On 1000 Imagenet Categories Xia Heming, Dong Qingxiu, Li Lei, Xu Jingjing, Liu Tianyu, Qin Ziwei, Sui Zhifang
- Improving Question Generation With Multi-level Content Planning Xia Zehua, Gou Qi, Yu Bowen, Yu Haiyang, Huang Fei, Li Yongbin, Nguyen Cam-tu
- Efficient Streaming Language Models With Attention Sinks Xiao Guangxuan, Tian Yuandong, Chen Beidi, Han Song, Lewis Mike
- Can I Trust Your Answer? Visually Grounded Video Question Answering Xiao Junbin, Yao Angela, Li Yicong, Chua Tat Seng
- Lm-cocktail: Resilient Tuning Of Language Models Via Model Merging Xiao Shitao, Liu Zheng, Zhang Peitian, Xing Xingrun
- Personalized Abstractive Summarization By Tri-agent Generation Pipeline Xiao Wen, Xie Yujia, Carenini Giuseppe, He Pengcheng
- Rethinking The Evaluation For Conversational Recommendation In The Era Of Large Language Models Xiaolei Wang, Xinyu Tang, Wayne Xin Zhao, Jingyuan Wang, Ji-rong Wen
- Adaptive Chameleon Or Stubborn Sloth: Revealing The Behavior Of Large Language Models In Knowledge Conflicts Xie Jian, Zhang Kai, Chen Jiangjie, Lou Renze, Su Yu
- Text2reward: Reward Shaping With Language Models For Reinforcement Learning Xie Tianbao, Zhao Siheng, Wu Chen Henry, Liu Yitao, Luo Qian, Zhong Victor, Yang Yanchao, Yu Tao
- Olagpt: Empowering Llms With Human-like Problem-solving Abilities Xie Yuanzhen, Xie Tao, Lin Mingxiong, Wei Wentao, Li Chenglin, Kong Beibei, Chen Lei, Zhuo Chengxiang, Hu Bo, Li Zang
- E4srec: An Elegant Effective Efficient Extensible Solution Of Large Language Models For Sequential Recommendation Xinhang Li, Chong Chen, Xiangyu Zhao, Yong Zhang, Chunxiao Xing
- Llm-pruner: On The Structural Pruning Of Large Language Models Xinyin Ma, Gongfan Fang, Xinchao Wang
- Doctorglm: Fine-tuning Your Chinese Doctor Is Not A Herculean Task Xiong Honglin, Wang Sheng, Zhu Yitao, Zhao Zihao, Liu Yuxiao, Huang Linlin, Wang Qian, Shen Dinggang
- Examining Inter-consistency Of Large Language Models Collaboration: An In-depth Analysis Via Debate Xiong Kai, Ding Xiao, Cao Yixin, Liu Ting, Qin Bing
- Expertprompting: Instructing Large Language Models To Be Distinguished Experts Xu Benfeng, Yang An, Lin Junyang, Wang Quan, Zhou Chang, Zhang Yongdong, Mao Zhendong
- Baize: An Open-source Chat Model With Parameter-efficient Tuning On Self-chat Data Xu Canwen, Guo Daya, Duan Nan, Mcauley Julian
- Align On The Fly: Adapting Chatbot Behavior To Established Norms Xu Chunpu, Chern Steffi, Chern Ethan, Zhang Ge, Wang Zekun, Liu Ruibo, Li Jing, Fu Jie, Liu Pengfei
- Large Language Models For Generative Information Extraction: A Survey Xu Derong, Chen Wei, Peng Wenjun, Zhang Chao, Xu Tong, Zhao Xiangyu, Wu Xian, Zheng Yefeng, Wang Yang, Chen Enhong
- Recommender AI Agent: Integrating Large Language Models For Interactive Recommendations Xu Huang, Jianxun Lian, Yuxuan Lei, Jing Yao, Defu Lian, Xing Xie
- U-llava: Unifying Multi-modal Tasks Via Large Language Model Xu Jinjin, Xu Liwu, Yang Yuzhe, Li Xiang, Wang Fanyi, Xie Yanchun, Huang Yi-jie, Li Yaqian
- Knowledge-infused Prompting: Assessing And Advancing Clinical Text Data Generation With Large Language Models Xu Ran, Cui Hejie, Yu Yue, Kan Xuan, Shi Wenqi, Zhuang Yuchen, Jin Wei, Ho Joyce, Yang Carl
- Re-reading Improves Reasoning In Large Language Models Xu Xiaohan, Tao Chongyang, Shen Tao, Xu Can, Xu Hongbo, Long Guodong, Lou Jian-guang
- Pink: Unveiling The Power Of Referential Comprehension For Multi-modal Llms Xuan Shiyu, Guo Qingpei, Yang Ming, Zhang Shiliang
- Representation Learning With Large Language Models For Recommendation Xubin Ren, Wei Wei, Lianghao Xia, Lixin Su, Suqi Cheng, Junfeng Wang, Dawei Yin, Chao Huang
- Codescope: An Execution-based Multilingual Multitask Multidimensional Benchmark For Evaluating Llms On Code Understanding And Generation Yan Weixiang, Liu Haitian, Wang Yunkun, Li Yunzhe, Chen Qian, Wang Wen, Lin Tingyu, Zhao Weishan, Zhu Li, Sundaram Hari, Deng Shuiguang
- Multimodal Chatgpt For Medical Applications: An Experimental Study Of GPT-4V Yan Zhiling, Zhang Kai, Zhou Rong, He Lifang, Li Xiang, Sun Lichao
- Effective Distillation Of Table-based Reasoning Ability From Llms Yang Bohao, Tang Chen, Zhao Kun, Xiao Chenghao, Lin Chenghua
- LACMA: Language-aligning Contrastive Learning With Meta-actions For Embodied Instruction Following Yang Cheng-fu, Chen Yen-chun, Yang Jianwei, Dai Xiyang, Yuan Lu, Wang Yu-chiang Frank, Chang Kai-wei
- Can You Follow Me? Testing Situational Understanding In Chatgpt Yang Chenghao, Ettinger Allyson
- Refgpt: Dialogue Generation Of GPT, By GPT, And For GPT Yang Dongjie, Yuan Ruifeng, Fan Yuantao, Yang Yifei, Wang Zili, Wang Shusen, Zhao Hai
- Empower Large Language Model To Perform Better On Industrial Domain-specific Question Answering Yang Fangkai, Zhao Pu, Wang Zezhong, Wang Lu, Zhang Jue, Garg Mohit, Lin Qingwei, Rajmohan Saravan, Zhang Dongmei
- Longqlora: Efficient And Effective Method To Extend Context Length Of Large Language Models Yang Jianxin
- Unified Language-vision Pretraining In LLM With Dynamic Discrete Visual Tokenization Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chenyi Lei, An Liu, Chengru Song, Xiaoqiang Lei, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu
- Supervised Knowledge Makes Large Language Models Better In-context Learners Yang Linyi, Zhang Shuibai, Yu Zhuohao, Bao Guangsheng, Wang Yidong, Wang Jindong, Xu Ruochen, Ye Wei, Xie Xing, Chen Weizhu, Zhang Yue
- Lidar-llm: Exploring The Potential Of Large Language Models For 3D Lidar Understanding Yang Senqiao, Liu Jiaming, Zhang Ray, Pan Mingjie, Guo Zoey, Li Xiaoqi, Chen Zehui, Gao Peng, Guo Yandong, Zhang Shanghang
- Zhongjing: Enhancing The Chinese Medical Capabilities Of Large Language Model Through Expert Feedback And Real-world Multi-turn Dialogue Yang Songhua, Zhao Hanjie, Zhu Senbin, Zhou Guangyu, Xu Hongfei, Jia Yuxiang, Zan Hongying
- Psycot: Psychological Questionnaire As Powerful Chain-of-thought For Personality Detection Yang Tao, Shi Tianyuan, Wan Fanqi, Quan Xiaojun, Wang Qifan, Wu Bingzhe, Wu Jiaxiang
- Dior-cvae: Pre-trained Language Models And Diffusion Priors For Variational Dialog Generation Yang Tianyu, Tran Thy Thy, Gurevych Iryna
- Human-in-the-loop Machine Translation With Large Language Model Yang Xinyi, Zhan Runzhe, Wong Derek F., Wu Junchao, Chao Lidia S.
- Harnessing The Power Of Large Language Models For Natural Language To First-order Logic Translation Yang Yuan, Xiong Siheng, Payani Ali, Shareghi Ehsan, Fekri Faramarz
- GPT Can Solve Mathematical Problems Without A Calculator Yang Zhen, Ding Ming, Lv Qingsong, Jiang Zhihuan, He Zehai, Guo Yuyi, Bai Jinfeng, Tang Jie
- Tcp:textual-based Class-aware Prompt Tuning For Visual-language Model Yao Hantao, Zhang Rui, Xu Changsheng
- LLM Lies: Hallucinations Are Not Bugs, But Features As Adversarial Examples Yao Jia-yu, Ning Kun-peng, Liu Zhen-hui, Ning Mu-nan, Liu Yu-yang, Yuan Li
- Nanolm: An Affordable LLM Pre-training Benchmark Via Accurate Loss Prediction Across Scales Yao Yiqun, Fan Siqi, Huang Xiusheng, Fang Xuezhi, Li Xiang, Ni Ziyi, Jiang Xin, Meng Xuying, Han Peng, Shang Shuo, Liu Kang, Sun Aixin, Wang Yequan
- Editing Large Language Models: Problems, Methods, And Opportunities Yao Yunzhi, Wang Peng, Tian Bozhong, Cheng Siyuan, Li Zhoubo, Deng Shumin, Chen Huajun, Zhang Ningyu
- Collaborative Large Language Model For Recommender Systems Yaochen Zhu, Liang Wu, Qi Guo, Liangjie Hong, Jundong Li
- Generating Data For Symbolic Language With Large Language Models Ye Jiacheng, Li Chengzu, Kong Lingpeng, Yu Tao
- Investigating The Effectiveness Of Task-agnostic Prefix Prompt For Instruction Following Ye Seonghyeon, Hwang Hyeonbin, Yang Sohee, Yun Hyeongu, Kim Yireun, Seo Minjoon
- FLASK: Fine-grained Language Model Evaluation Based On Alignment Skill Sets Ye Seonghyeon, Kim Doyoung, Kim Sungdong, Hwang Hyeonbin, Kim Seungone, Jo Yongrae, Thorne James, Kim Juho, Seo Minjoon
- Navigating Text-to-image Customization: From Lycoris Fine-tuning To Model Evaluation Yeh Shih-ying, Hsieh Yu-guan, Gao Zhidong, Yang Bernard B W, Oh Giyeong, Gong Yanmin
- Evaluating Object Hallucination In Large Vision-language Models Yifan Li, Yifan Du, Kun Zhou, Jinpeng Wang, Wayne Xin Zhao, Ji-rong Wen
- Cumulative Reasoning With Large Language Models Yifan Zhang, Jingqin Yang, Yang Yuan, Andrew Chi-chih Yao
- Lemur: Harmonizing Natural Language And Code For Language Agents Yiheng Xu, Hongjin Su, Chen Xing, Boyu Mi, Qian Liu, Weijia Shi, Binyuan Hui, Fan Zhou, Yitao Liu, Tianbao Xie, Zhoujun Cheng, Siheng Zhao, Lingpeng Kong, Bailin Wang, Caiming Xiong, Tao Yu
- Element-aware Summarization With Large Language Models: Expert-aligned Evaluation And Chain-of-thought Method Yiming Wang, Zhuosheng Zhang, Rui Wang
- Dynosaur: A Dynamic Growth Paradigm For Instruction-tuning Data Curation Yin Da, Liu Xiao, Yin Fan, Zhong Ming, Bansal Hritik, Han Jiawei, Chang Kai-wei
- Analyzing And Mitigating Object Hallucination In Large Vision-language Models Yiyang Zhou, Chenhang Cui, Jaehong Yoon, Linjun Zhang, Zhun Deng, Chelsea Finn, Mohit Bansal, Huaxiu Yao
- From Text To Motion: Grounding GPT-4 In A Humanoid Robot "alter3" Yoshida Takahide, Masumori Atsushi, Ikegami Takashi
- Ferret: Refer And Ground Anything Anywhere At Any Granularity You Haoxuan, Zhang Haotian, Gan Zhe, Du Xianzhi, Zhang Bowen, Wang Zirui, Cao Liangliang, Chang Shih-fu, Yang Yinfei
- Seqgpt: An Out-of-the-box Large Language Model For Open Domain Sequence Understanding Yu Tianyu, Jiang Chengyue, Lou Chao, Huang Shen, Wang Xiaobin, Liu Wei, Cai Jiong, Li Yangning, Li Yinghui, Tu Kewei, Zheng Hai-tao, Zhang Ningyu, Xie Pengjun, Huang Fei, Jiang Yong
- Large Language Model As Attributed Training Data Generator: A Tale Of Diversity And Bias Yu Yue, Zhuang Yuchen, Zhang Jieyu, Meng Yu, Ratner Alexander, Krishna Ranjay, Shen Jiaming, Zhang Chao
- How Vocabulary Sharing Facilitates Multilingualism In Llama? Yuan Fei, Yuan Shuai, Wu Zhiyong, Li Lei
- CRAFT: Customizing Llms By Creating And Retrieving From Specialized Toolsets Yuan Lifan, Chen Yangyi, Wang Xingyao, Fung Yi R., Peng Hao, Ji Heng
- Large Language Models Illuminate A Progressive Pathway To Artificial Healthcare Assistant: A Review Yuan Mingze, Bao Peng, Yuan Jiajia, Shen Yunhao, Chen Zifan, Xie Yi, Zhao Jie, Chen Yang, Zhang Li, Shen Lin, Dong Bin
- Osprey: Pixel Understanding With Visual Instruction Tuning Yuan Yuqian, Li Wentong, Liu Jian, Tang Dongqi, Luo Xinjie, Qin Chi, Zhang Lei, Zhu Jianke
- Artgpt-4: Towards Artistic-understanding Large Vision-language Models With Enhanced Adapter Yuan Zhengqing, He Yunhong, Wang Kun, Ye Yanfang, Sun Lichao
- Prompt Highlighter: Interactive Control For Multi-modal Llms Yuechen Zhang, Shengju Qian, Bohao Peng, Shu Liu, Jiaya Jia
- Aligning Large Language Models With Human: A Survey Yufei Wang, Wanjun Zhong, Liangyou Li, Fei Mi, Xingshan Zeng, Wenyong Huang, Lifeng Shang, Xin Jiang, Qun Liu
- Contextual Object Detection With Multimodal Large Language Models Yuhang Zang, Wei Li, Jun Han, Kaiyang Zhou, Chen Change Loy
- On Evaluating Adversarial Robustness Of Large Vision-language Models Yunqing Zhao, Tianyu Pang, Chao Du, Xiao Yang, Chongxuan Li, Ngai-man Cheung, Min Lin
- Large Language Models Are Zero-shot Rankers For Recommender Systems Yupeng Hou, Junjie Zhang, Zihan Lin, Hongyu Lu, Ruobing Xie, Julian Mcauley, Wayne Xin Zhao
- Pre-training To Learn In Context Yuxian Gu, Li Dong, Furu Wei, Minlie Huang
- On Diversified Preferences Of Large Language Model Alignment Zeng Dun, Dai Yong, Cheng Pengyu, Wang Longyue, Hu Tianhao, Chen Wanshun, Du Nan, Xu Zenglin
- TIM: Teaching Large Language Models To Translate With Comparison Zeng Jiali, Meng Fandong, Yin Yongjing, Zhou Jie
- Conditionally Combining Robot Skills Using Large Language Models Zentner K. R., Julian Ryan, Ichter Brian, Sukhatme Gaurav S.
- Next-chat: An LMM For Chat, Detection And Segmentation Zhang Ao, Yao Yuan, Ji Wei, Liu Zhiyuan, Chua Tat-seng
- Evaluating And Improving Tool-augmented Computation-intensive Math Reasoning Zhang Beichen, Zhou Kun, Wei Xilin, Zhao Wayne Xin, Sha Jing, Wang Shijin, Wen Ji-rong
- A Simple LLM Framework For Long-range Video Question-answering Zhang Ce, Lu Taixi, Islam Md Mohaiminul, Wang Ziyang, Yu Shoubin, Bansal Mohit, Bertasius Gedas
- A Comprehensive Analysis Of The Effectiveness Of Large Language Models As Automatic Dialogue Evaluators Zhang Chen, D'haro Luis Fernando, Chen Yiming, Zhang Malu, Li Haizhou
- Xdial-eval: A Multilingual Open-domain Dialogue Evaluation Benchmark Zhang Chen, D'haro Luis Fernando, Tang Chengguang, Shi Ke, Tang Guohua, Li Haizhou
- The Skipped Beat: A Study Of Sociopragmatic Understanding In Llms For 64 Languages Zhang Chiyu, Doan Khai Duy, Liao Qisheng, Abdul-mageed Muhammad
- R-tuning: Instructing Large Language Models To Say `I Don't Know' Zhang Hanning, Diao Shizhe, Lin Yong, Fung Yi R., Lian Qing, Wang Xingyao, Chen Yangyi, Ji Heng, Zhang Tong
- XATU: A Fine-grained Instruction-based Benchmark For Explainable Text Updates Zhang Haopeng, Iso Hayate, Gurajada Sairam, Bhutani Nikita
- Crash: Clustering, Removing, And Sharing Enhance Fine-tuning Without Full Large Language Model Zhang Kaiyan, Ding Ning, Qi Biqing, Zhu Xuekai, Long Xinwei, Zhou Bowen
- Moqagpt : Zero-shot Multi-modal Open-domain Question Answering With Large Language Model Zhang Le, Wu Yihong, Mo Fengran, Nie Jian-yun, Agrawal Aishwarya
- Marathon: A Race Through The Realm Of Long Context With Large Language Models Zhang Lei, Li Yunshui, Liu Ziqiang, Yang Jiaxi, Liu Junhao, Chen Longze, Luo Run, Yang Min
- Retrieve Anything To Augment Large Language Models Zhang Peitian, Xiao Shitao, Liu Zheng, Dou Zhicheng, Nie Jian-yun
- Tell Your Model Where To Attend: Post-hoc Attention Steering For Llms Zhang Qingru, Singh Chandan, Liu Liyuan, Liu Xiaodong, Yu Bin, Gao Jianfeng, Zhao Tuo
- Llama-adapter: Efficient Fine-tuning Of Language Models With Zero-init Attention Zhang Renrui, Han Jiaming, Liu Chris, Gao Peng, Zhou Aojun, Hu Xiangfei, Yan Shilin, Lu Pan, Li Hongsheng, Qiao Yu
- M3exam: A Multilingual, Multimodal, Multilevel Benchmark For Examining Large Language Models Zhang Wenxuan, Aljunied Sharifah Mahani, Gao Chang, Chia Yew Ken, Bing Lidong
- PMC-VQA: Visual Instruction Tuning For Medical Visual Question Answering Zhang Xiaoman, Wu Chaoyi, Zhao Ziheng, Lin Weixiong, Zhang Ya, Wang Yanfeng, Xie Weidi
- Collm: Integrating Collaborative Embeddings Into Large Language Models For Recommendation Zhang Yang, Feng Fuli, Zhang Jizhi, Bao Keqin, Wang Qifan, He Xiangnan
- Llavar: Enhanced Visual Instruction Tuning For Text-rich Image Understanding Zhang Yanzhe, Zhang Ruiyi, Gu Jiuxiang, Zhou Yufan, Lipka Nedim, Yang Diyi, Sun Tong
- Making Large Language Models Perform Better In Knowledge Graph Completion Zhang Yichi, Chen Zhuo, Guo Lingbing, Xu Yajing, Zhang Wen, Chen Huajun
- Meta Prompting For AI Systems Zhang Yifan, Yuan Yang, Yao Andrew Chi-chih
- Llmeval: A Preliminary Study On How To Evaluate Large Language Models Zhang Yue, Zhang Ming, Yuan Haipeng, Liu Shichun, Shi Yongyao, Gui Tao, Zhang Qi, Huang Xuanjing
- H\(_2\)O: Heavy-hitter Oracle For Efficient Generative Inference Of Large Language Models Zhang Zhenyu, Sheng Ying, Zhou Tianyi, Chen Tianlong, Zheng Lianmin, Cai Ruisi, Song Zhao, Tian Yuandong, Ré Christopher, Barrett Clark, Wang Zhangyang, Chen Beidi
- Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization Zhang Zhexin, Yang Junxiao, Ke Pei, Mi Fei, Wang Hongning, Huang Minlie
- PLUG: Leveraging Pivot Language In Cross-lingual Instruction Tuning Zhang Zhihan, Lee Dong-ho, Fang Yuwei, Yu Wenhao, Jia Mengzhao, Jiang Meng, Barbieri Francesco
- Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-thought Reasoning To Language Agents Zhang Zhuosheng, Yao Yao, Zhang Aston, Tang Xiangru, Ma Xinbei, He Zhiwei, Wang Yiming, Gerstein Mark, Wang Rui, Liu Gongshen, Zhao Hai
- You Only Look At Screens: Multimodal Chain-of-action Agents Zhang Zhuosheng, Zhang Aston
- How Do Large Language Models Capture The Ever-changing World Knowledge? A Review Of Recent Advances Zhang Zihan, Fang Meng, Chen Ling, Namazi-rad Mohammad-reza, Wang Jun
- Unifying The Perspectives Of NLP And Software Engineering: A Survey On Language Models For Code Zhang Ziyin, Chen Chaoyu, Liu Bingchang, Liao Cong, Gong Zi, Yu Hang, Li Jianguo, Wang Rui
- MELA: Multilingual Evaluation Of Linguistic Acceptability Zhang Ziyin, Liu Yikang, Huang Weifang, Mao Junyu, Wang Rui, Hu Hai
- SVIT: Scaling Up Visual Instruction Tuning Zhao Bo, Wu Boya, He Muyang, Huang Tiejun
- Genixer: Empowering Multimodal Large Language Models As A Powerful Data Generator Zhao Henry Hengyuan, Zhou Pan, Shou Mike Zheng
- Automatic Model Selection With Large Language Models For Reasoning Zhao James Xu, Xie Yuxi, Kawaguchi Kenji, He Junxian, Xie Michael Qizhe
- Ca-lora: Adapting Existing Lora For Compressed Llms To Enable Efficient Multi-tasking On Personal Devices Zhao Weilin, Huang Yuxiang, Han Xu, Liu Zhiyuan, Zhang Zhengyan, Li Kuai, Chen Chen, Yang Tao, Sun Maosong
- Easygen: Easing Multimodal Generation With Bidiffuser And Llms Zhao Xiangyu, Liu Bo, Liu Qijiong, Shi Guangyuan, Wu Xiao-ming
- Babystories: Can Reinforcement Learning Teach Baby Language Models To Write Better Stories? Zhao Xingmeng, Wang Tongnian, Osborn Sheri, Rios Anthony
- Lookahead: An Inference Acceleration Framework For Large Language Model With Lossless Generation Accuracy Zhao Yao, Xie Zhitian, Liang Chen, Zhuang Chenyi, Gu Jinjie
- Investigating Table-to-text Generation Capabilities Of Llms In Real-world Information Seeking Scenarios Zhao Yilun, Zhang Haowei, Si Shengyun, Nan Linyong, Tang Xiangru, Cohan Arman
- Robut: A Systematic Study Of Table QA Robustness Against Human-annotated Adversarial Perturbations Zhao Yilun, Zhao Chen, Nan Linyong, Qi Zhenting, Zhang Wenlin, Tang Xiangru, Mi Boyu, Radev Dragomir
- Multi-modal In-context Learning Makes An Ego-evolving Scene Text Recognizer Zhao Zhen, Tang Jingqun, Lin Chunhui, Wu Binghong, Huang Can, Liu Hao, Tan Xin, Zhang Zhizhong, Xie Yuan
- Mllm-dataengine: An Iterative Refinement Approach For MLLM Zhao Zhiyuan, Ouyang Linke, Wang Bin, Huang Siyuan, Zhang Pan, Dong Xiaoyi, Wang Jiaqi, He Conghui
- Librisqa: A Novel Dataset And Framework For Spoken Question Answering With Large Language Models Zhao Zihan, Jiang Yiyang, Liu Heyang, Wang Yanfeng, Wang Yu
- Adapting Large Language Models By Integrating Collaborative Semantics For Recommendation Zheng Bowen, Hou Yupeng, Lu Hongyu, Chen Yu, Zhao Wayne Xin, Chen Ming, Wen Ji-rong
- Can We Edit Factual Knowledge By In-context Learning? Zheng Ce, Li Lei, Dong Qingxiu, Fan Yuxuan, Wu Zhiyong, Xu Jingjing, Chang Baobao
- Learn From Model Beyond Fine-tuning: A Survey Zheng Hongling, Shen Li, Tang Anke, Luo Yong, Hu Han, Du Bo, Tao Dacheng
- Judging Llm-as-a-judge With Mt-bench And Chatbot Arena Zheng Lianmin, Chiang Wei-lin, Sheng Ying, Zhuang Siyuan, Wu Zhanghao, Zhuang Yonghao, Lin Zi, Li Zhuohan, Li Dacheng, Xing Eric P., Zhang Hao, Gonzalez Joseph E., Stoica Ion
- Sglang: Efficient Execution Of Structured Language Model Programs Zheng Lianmin, Yin Liangsheng, Xie Zhiqiang, Sun Chuyue, Huang Jeff, Yu Cody Hao, Cao Shiyi, Kozyrakis Christos, Stoica Ion, Gonzalez Joseph E., Barrett Clark, Sheng Ying
- Trafficsafetygpt: Tuning A Pre-trained Large Language Model To A Domain-specific Expert In Transportation Safety Zheng Ou, Abdel-aty Mohamed, Wang Dongdong, Wang Chenzhu, Ding Shengxuan
- Large Language Models Are Good Prompt Learners For Low-shot Image Classification Zheng Zhaoheng, Wei Jingmin, Hu Xuefeng, Zhu Haidong, Nevatia Ram
- Large Language Model Can Interpret Latent Space Of Sequential Recommender Zhengyi Yang, Jiancan Wu, Yanchen Luo, Jizhi Zhang, Yancheng Yuan, An Zhang, Xiang Wang, Xiangnan He
- MM-REACT: Prompting Chatgpt For Multimodal Reasoning And Action Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Ehsan Azarnasab, Faisal Ahmed, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang
- Self-polish: Enhance Reasoning In Large Language Models Via Problem Refinement Zhiheng Xi, Senjie Jin, Yuhao Zhou, Rui Zheng, Songyang Gao, Tao Gui, Qi Zhang, Xuanjing Huang
- The Rise And Potential Of Large Language Model Based Agents: A Survey Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang, Changhao Jiang, Yicheng Zou, Xiangyang Liu, Zhangyue Yin, Shihan Dou, Rongxiang Weng, Wensen Cheng, Qi Zhang, Wenjuan Qin, Yongyan Zheng, Xipeng Qiu, Xuanjing Huang, Tao Gui
- Humanoid Agents: Platform For Simulating Human-like Generative Agents Zhilin Wang, Yu Ying Chiu, Yu Cheung Chiu
- Aligning Large Multimodal Models With Factually Augmented RLHF Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liang-yan Gui, Yu-xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell
- Exploring Human-like Translation Strategy With Large Language Models Zhiwei He, Tian Liang, Wenxiang Jiao, Zhuosheng Zhang, Yujiu Yang, Rui Wang, Zhaopeng Tu, Shuming Shi, Xing Wang
- Beyond Hallucinations: Enhancing Lvlms Through Hallucination-aware Direct Preference Optimization Zhiyuan Zhao, Bin Wang, Linke Ouyang, Xiaoyi Dong, Jiaqi Wang, Conghui He
- Hierarchical Neural Program Synthesis Zhong Linghan, Lindeborg Ryan, Zhang Jesse, Lim Joseph J., Sun Shao-hua
- Seeking Neural Nuggets: Knowledge Transfer In Large Language Models From A Parametric Perspective Zhong Ming, An Chenxin, Chen Weizhu, Han Jiawei, He Pengcheng
- Let's Think Outside The Box: Exploring Leap-of-thought In Large Language Models With Creative Humor Generation Zhong Shanshan, Huang Zhongzhan, Gao Shanghua, Wen Wushao, Lin Liang, Zitnik Marinka, Zhou Pan
- Sur-adapter: Enhancing Text-to-image Pre-trained Diffusion Models With Large Language Models Zhong Shanshan, Huang Zhongzhan, Wen Wushao, Qin Jinghui, Lin Liang
- Language Agent Tree Search Unifies Reasoning Acting And Planning In Language Models Zhou Andy, Yan Kai, Shlapentokh-rothman Michal, Wang Haohan, Wang Yu-xiong
- Instruction-following Evaluation For Large Language Models Zhou Jeffrey, Lu Tianjian, Mishra Swaroop, Brahma Siddhartha, Basu Sujoy, Luan Yi, Zhou Denny, Hou Le
- Exploring Recommendation Capabilities Of Gpt-4v(ision): A Preliminary Case Study Zhou Peilin, Cao Meng, Huang You-liang, Ye Qichen, Zhang Peiyan, Liu Junling, Xie Yueqi, Hua Yining, Kim Jaeboum
- Infmllm: A Unified Framework For Visual-language Tasks Zhou Qiang, Wang Zhibin, Chu Wei, Xu Yinghui, Li Hao, Qi Yuan
- Recurrentgpt: Interactive Generation Of (arbitrarily) Long Text Zhou Wangchunshu, Jiang Yuchen Eleanor, Cui Peng, Wang Tiannan, Xiao Zhenxin, Hou Yifan, Cotterell Ryan, Sachan Mrinmaya
- Context-faithful Prompting For Large Language Models Zhou Wenxuan, Zhang Sheng, Poon Hoifung, Chen Muhao
- Large Language Model As A Policy Teacher For Training Reinforcement Learning Agents Zhou Zihao, Hu Bin, Zhao Chenyang, Zhang Pu, Liu Bin
- Promptbench: A Unified Library For Evaluation Of Large Language Models Zhu Kaijie, Zhao Qinlin, Chen Hao, Wang Jindong, Xie Xing
- Coca: Fusing Position Embedding With Collinear Constrained Attention In Transformers For Long Context Window Extending Zhu Shiyi, Ye Jing, Jiang Wei, Xue Siqiao, Zhang Qi, Wu Yifan, Li Jianguo
- Multilingual Machine Translation With Large Language Models: Empirical Results And Analysis Zhu Wenhao, Liu Hongyi, Dong Qingxiu, Xu Jingjing, Huang Shujian, Kong Lingpeng, Chen Jiajun, Li Lei
- Question Answering As Programming For Solving Time-sensitive Questions Zhu Xinyu, Yang Cheng, Chen Bei, Li Siheng, Lou Jian-guang, Yang Yujiu
- Pad: Program-aided Distillation Can Teach Small Models Reasoning Better Than Chain-of-thought Fine-tuning Zhu Xuekai, Qi Biqing, Zhang Kaiyan, Long Xinwei, Lin Zhouhan, Zhou Bowen
- Open-source Large Language Models Are Strong Zero-shot Query Likelihood Models For Document Ranking Zhuang Shengyao, Liu Bing, Koopman Bevan, Zuccon Guido
- Ice-score: Instructing Large Language Models To Evaluate Code Zhuo Terry Yue
- Multimodal Chain-of-thought Reasoning In Language Models Zhuosheng Zhang, Aston Zhang, Mu Li, Hai Zhao, George Karypis, Alex Smola
- Describe, Explain, Plan And Select: Interactive Planning With Large Language Models Enables Open-world Multi-task Agents Zihao Wang, Shaofei Cai, Guanzhou Chen, Anji Liu, Xiaojian Ma, Yitao Liang
- Incorporating Probing Signals Into Multimodal Machine Translation Via Visual Question-answering Pairs Zuo Yuxin, Li Bei, Lv Chuanhao, Zheng Tong, Xiao Tong, Zhu Jingbo
- Attnlrp: Attention-aware Layer-wise Relevance Propagation For Transformers Achtibat Reduan, Hatefi Sayed Mohammad Vakilzadeh, Dreyer Maximilian, Jain Aakriti, Wiegand Thomas, Lapuschkin Sebastian, Samek Wojciech
- Etalon: Holistic Performance Evaluation Framework For LLM Inference Systems Agrawal Amey, Agarwal Anmol, Kedia Nitin, Mohan Jayashree, Kundu Souvik, Kwatra Nipun, Ramjee Ramachandran, Tumanov Alexey
- The Curious Case Of Nonverbal Abstract Reasoning With Multi-modal Large Language Models Ahrabian Kian, Sourati Zhivar, Sun Kexuan, Zhang Jiarui, Jiang Yifan, Morstatter Fred, Pujara Jay
- Shadowllm: Predictor-based Contextual Sparsity For Large Language Models Akhauri Yash, Abouelhamayed Ahmed F, Dotzel Jordan, Zhang Zhiru, Rush Alexander M, Huda Safeen, Abdelfattah Mohamed S
- Mixture Of Modular Experts: Distilling Knowledge From A Multilingual Teacher Into Specialized Modular Language Models Al-maamari Mohammed, Amor Mehdi Ben, Granitzer Michael
- Automatic Pseudo-harmful Prompt Generation For Evaluating False Refusals In Large Language Models An Bang, Zhu Sicheng, Zhang Ruiyi, Panaitescu-liess Michael-andrei, Xu Yuancheng, Huang Furong
- Training-free Long-context Scaling Of Large Language Models An Chenxin, Huang Fei, Zhang Jun, Gong Shansan, Qiu Xipeng, Zhou Chang, Kong Lingpeng
- Funaudiollm: Voice Understanding And Generation Foundation Models For Natural Interaction Between Humans And Llms An Keyu, Chen Qian, Deng Chong, Du Zhihao, Gao Changfeng, Gao Zhifu, Gu Yue, He Ting, Hu Hangrui, Hu Kai, Ji Shengpeng, Li Yabin, Li Zerui, Lu Heng, Luo Haoneng, Lv Xiang, Ma Bin, Ma Ziyang, Ni Chongjia, Song Changhe, Shi Jiaqi, Shi Xian, Wang Hao, Wang Wen, Wang Yuxuan, Xiao Zhangyu, Yan Zhijie, Yang Yexin, Zhang Bin, Zhang Qinglin, Zhang Shiliang, Zhao Nan, Zheng Siqi
- Make Your LLM Fully Utilize The Context An Shengnan, Ma Zexiong, Lin Zeqi, Zheng Nanning, Lou Jian-guang
- Does Refusal Training In Llms Generalize To The Past Tense? Andriushchenko Maksym, Flammarion Nicolas
- Scaling Sparse Fine-tuning To Large Language Models Ansell Alan, Vulić Ivan, Sterz Hannah, Korhonen Anna, Ponti Edoardo M.
- Simple Linear Attention Language Models Balance The Recall-throughput Tradeoff Arora Simran, Eyuboglu Sabri, Zhang Michael, Timalsina Aman, Alberti Silas, Zinsley Dylan, Zou James, Rudra Atri, Ré Christopher
- Slicegpt: Compress Large Language Models By Deleting Rows And Columns Ashkboos Saleh, Croci Maximilian L., Nascimento Marcelo Gennari Do, Hoefler Torsten, Hensman James
- Minigpt4-video: Advancing Multimodal Llms For Video Understanding With Interleaved Visual-textual Tokens Ataallah Kirolos, Shen Xiaoqian, Abdelrahman Eslam, Sleiman Essam, Zhu Deyao, Ding Jian, Elhoseiny Mohamed
- Minigpt-reverse-designing: Predicting Image Adjustments Utilizing Minigpt-4 Azizi Vahid, Koochaki Fatemeh
- The Pitfalls Of Next-token Prediction Bachmann Gregor, Nagarajan Vaishnavh
- Mt-bench-101: A Fine-grained Benchmark For Evaluating Large Language Models In Multi-turn Dialogues Bai Ge, Liu Jie, Bu Xingyuan, He Yancheng, Liu Jiaheng, Zhou Zhanhui, Lin Zhuoran, Su Wenbo, Ge Tiezheng, Zheng Bo, Ouyang Wanli
- Longalign: A Recipe For Long Context Alignment Of Large Language Models Bai Yushi, Lv Xin, Zhang Jiajie, He Yuze, Qi Ji, Hou Lei, Tang Jie, Dong Yuxiao, Li Juanzi
- Longwriter: Unleashing 10,000+ Word Generation From Long Context Llms Bai Yushi, Zhang Jiajie, Lv Xin, Zheng Linzhi, Zhu Siqi, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- Hallucination Of Multimodal Large Language Models: A Survey Bai Zechen, Wang Pichao, Xiao Tianjun, He Tong, Han Zongbo, Zhang Zheng, Shou Mike Zheng
- MARS: Meaning-aware Response Scoring For Uncertainty Estimation In Generative Llms Bakman Yavuz Faruk, Yaldiz Duygu Nur, Buyukates Baturalp, Tao Chenyang, Dimitriadis Dimitrios, Avestimehr Salman
- Demystifying Platform Requirements For Diverse LLM Inference Use Cases Bambhaniya Abhimanyu, Raj Ritik, Jeong Geonhwa, Kundu Souvik, Srinivasan Sudarshan, Elavazhagan Midhilesh, Kumar Madhu, Krishna Tushar
- Comparing Bad Apples To Good Oranges: Aligning Large Language Models Via Joint Preference Optimization Bansal Hritik, Suvarna Ashima, Bhatt Gantavya, Peng Nanyun, Chang Kai-wei, Grover Aditya
- Llms With Chain-of-thought Are Non-causal Reasoners Bao Guangsheng, Zhang Hongbo, Yang Linyi, Wang Cunxiang, Zhang Yue
- Enhanced Language Model Truthfulness With Learnable Intervention And Uncertainty Expression Bayat Farima Fatahi, Liu Xin, Jagadish H. V., Wang Lu
- Cross-lingual Editing In Multilingual Language Models Beniwal Himanshu, D Kowsik Nandagopan, Singh Mayank
- Soft Prompting For Unlearning In Large Language Models Bhaila Karuna, Van Minh-hao, Wu Xintao
- Language Models Are Homer Simpson! Safety Re-alignment Of Fine-tuned Language Models Through Task Arithmetic Bhardwaj Rishabh, Anh Do Duc, Poria Soujanya
- Fintral: A Family Of GPT-4 Level Multimodal Financial Large Language Models Bhatia Gagan, Nagoudi El Moatez Billah, Cavusoglu Hasan, Abdul-mageed Muhammad
- Decoding By Contrasting Knowledge: Enhancing Llms' Confidence On Edited Facts Bi Baolong, Liu Shenghua, Mei Lingrui, Wang Yiwei, Ji Pengliang, Cheng Xueqi
- Workarena++: Towards Compositional Planning And Reasoning-based Common Knowledge Work Tasks Boisvert Léo, Thakkar Megh, Gasse Maxime, Caccia Massimo, De Chezelles Thibault Le Sellier, Cappart Quentin, Chapados Nicolas, Lacoste Alexandre, Drouin Alexandre
- Windows Agent Arena: Evaluating Multi-modal OS Agents At Scale Bonatti Rogerio, Zhao Dan, Bonacci Francesco, Dupont Dillon, Abdali Sara, Li Yinheng, Wagle Justin, Koishida Kazuhito, Bucker Arthur, Jang Lawrence, Hui Zack
- Low-rank Quantization-aware Training For Llms Bondarenko Yelysei, Del Chiaro Riccardo, Nagel Markus
- Elephants Never Forget: Memorization And Learning Of Tabular Data In Large Language Models Bordt Sebastian, Nori Harsha, Rodrigues Vanessa, Nushi Besmira, Caruana Rich
- Fast And Effective Weight Update For Pruned Large Language Models Boža Vladimír
- ACORN: Aspect-wise Commonsense Reasoning Explanation Evaluation Brassard Ana, Heinzerling Benjamin, Kudo Keito, Sakaguchi Keisuke, Inui Kentaro
- Lissard: Long And Simple Sequential Reasoning Datasets Bueno Mirelle, Lotufo Roberto, Nogueira Rodrigo
- Llms Are Few-shot In-context Low-resource Language Learners Cahyawijaya Samuel, Lovenia Holy, Fung Pascale
- A Survey On Mixture Of Experts Cai Weilin, Jiang Juyong, Wang Fan, Tang Jing, Kim Sunghun, Huang Jiayi
- On The Worst Prompt Performance Of Large Language Models Cao Bowen, Cai Deng, Zhang Zhisong, Zou Yuexian, Lam Wai
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes Cao He, Shao Yanjun, Liu Zhiyuan, Liu Zijing, Tang Xiangru, Yao Yuan, Li Yu
- RQ-RAG: Learning To Refine Queries For Retrieval Augmented Generation Chan Chi-min, Xu Chunpu, Yuan Ruibin, Luo Hongyin, Xue Wei, Guo Yike, Fu Jie
- Prompto: An Open Source Library For Asynchronous Querying Of LLM Endpoints Chan Ryan Sze-yin, Nanni Federico, Brown Edwin, Chapman Ed, Williams Angus R., Bright Jonathan, Gabasova Evelina
- Bias-aware Low-rank Adaptation: Mitigating Catastrophic Inheritance Of Large Language Models Chang Yupeng, Chang Yi, Wu Yuan
- GUI-WORLD: A Dataset For Gui-oriented Multimodal Llm-based Agents Chen Dongping, Huang Yue, Wu Siyuan, Tang Jingyu, Chen Liuyi, Bai Yilin, He Zhigang, Wang Chenlong, Zhou Huichi, Li Yiqiang, Zhou Tianshuo, Yu Yue, Gao Chujie, Zhang Qihui, Gui Yi, Li Zhen, Wan Yao, Zhou Pan, Gao Jianfeng, Sun Lichao
- Hardware-aware Parallel Prompt Decoding For Memory-efficient Acceleration Of LLM Inference Chen Hao Mark, Luk Wayne, Yiu Ka Fai Cedric, Li Rui, Mishchenko Konstantin, Venieris Stylianos I., Fan Hongxiang
- Generalizing Conversational Dense Retrieval Via Llm-cognition Data Augmentation Chen Haonan, Dou Zhicheng, Mao Kelong, Liu Jiongnan, Zhao Ziliang
- Elcorec: Enhance Language Understanding With Co-propagation Of Numerical And Categorical Features For Recommendation Chen Jizheng, Du Kounianhua, Lin Jianghao, Chen Bo, Tang Ruiming, Zhang Weinan
- Putting People In Llms' Shoes: Generating Better Answers Via Question Rewriter Chen Junhao, Wang Bowen, Jiang Zhouqiang, Nakashima Yuta
- Reasoning Runtime Behavior Of A Program With LLM: How Far Are We? Chen Junkai, Pan Zhiyuan, Hu Xing, Li Zhenhao, Li Ge, Xia Xin
- Efficientqat: Efficient Quantization-aware Training For Large Language Models Chen Mengzhao, Shao Wenqi, Xu Peng, Wang Jiahao, Gao Peng, Zhang Kaipeng, Qiao Yu, Luo Ping
- Jumpcoder: Go Beyond Autoregressive Coder Via Online Modification Chen Mouxiang, Tian Hao, Liu Zhongxin, Ren Xiaoxue, Sun Jianling
- WSI-VQA: Interpreting Whole Slide Images By Generative Visual Question Answering Chen Pingyi, Zhu Chenglu, Zheng Sunyi, Li Honglin, Yang Lin
- Reinstruct: Building Instruction Data From Unlabeled Corpus Chen Shu, Guan Xinyan, Lu Yaojie, Lin Hongyu, Han Xianpei, Sun Le
- Struq: Defending Against Prompt Injection With Structured Queries Chen Sizhe, Piet Julien, Sitawarin Chawin, Wagner David
- Beyond Natural Language: Llms Leveraging Alternative Formats For Enhanced Reasoning And Communication Chen Weize, Yuan Chenfei, Yuan Jiarui, Su Yusheng, Qian Chen, Yang Cheng, Xie Ruobing, Liu Zhiyuan, Sun Maosong
- Icleval: Evaluating In-context Learning Ability Of Large Language Models Chen Wentong, Lin Yankai, Zhou Zhenhao, Huang Hongyun, Jia Yantao, Cao Zhao, Wen Ji-rong
- Llast: Improved End-to-end Speech Translation System Leveraged By Large Language Models Chen Xi, Zhang Songyang, Bai Qibing, Chen Kai, Nakamura Satoshi
- Take The Bull By The Horns: Hard Sample-reweighted Continual Training Improves LLM Generalization Chen Xuxi, Wang Zhendong, Sow Daouda, Yang Junjie, Chen Tianlong, Liang Yingbin, Zhou Mingyuan, Wang Zhangyang
- NACL: A General And Effective KV Cache Eviction Framework For Llms At Inference Time Chen Yilong, Wang Guoxia, Shang Junyuan, Cui Shiyao, Zhang Zhenyu, Liu Tingwen, Wang Shuohuan, Sun Yu, Yu Dianhai, Wu Hua
- Robust And Scalable Model Editing For Large Language Models Chen Yingfa, Zhang Zhengyan, Han Xu, Xiao Chaojun, Liu Zhiyuan, Chen Chen, Li Kuai, Yang Tao, Sun Maosong
- Agent-flan: Designing Data And Methods Of Effective Agent Tuning For Large Language Models Chen Zehui, Liu Kuikun, Wang Qiuchen, Zhang Wenwei, Liu Jiangning, Lin Dahua, Chen Kai, Zhao Feng
- MLLM Is A Strong Reranker: Advancing Multimodal Retrieval-augmented Generation Via Knowledge-enhanced Reranking And Noise-injected Training Chen Zhanpeng, Xu Chengjin, Qi Yiyan, Guo Jian
- What Are The Essential Factors In Crafting Effective Long Context Multi-hop Instruction Datasets? Insights And Best Practices Chen Zhi, Chen Qiguang, Qin Libo, Guo Qipeng, Lv Haijun, Zou Yicheng, Che Wanxiang, Yan Hang, Chen Kai, Lin Dahua
- An Empirical Study Of Data Ability Boundary In Llms' Math Reasoning Chen Zui, Chen Yezeng, Han Jiaqi, Huang Zhijie, Qi Ji, Zhou Yi
- Self-playing Adversarial Language Game Enhances LLM Reasoning Cheng Pengyu, Hu Tianhao, Xu Han, Zhang Zhisong, Dai Yong, Han Lei, Du Nan
- Call Me When Necessary: Llms Can Efficiently And Faithfully Reason Over Structured Environments Cheng Sitao, Zhuang Ziyuan, Xu Yong, Yang Fangkai, Zhang Chaoyun, Qin Xiaoting, Huang Xiang, Chen Ling, Lin Qingwei, Zhang Dongmei, Rajmohan Saravan, Zhang Qi
- Information Re-organization Improves Reasoning In Large Language Models Cheng Xiaoxia, Tan Zeqi, Xue Wei, Lu Weiming
- Small Agent Can Also Rock! Empowering Small Language Models As Hallucination Detector Cheng Xiaoxue, Li Junyi, Zhao Wayne Xin, Zhang Hongzhi, Zhang Fuzheng, Zhang Di, Gai Kun, Wen Ji-rong
- Can Large Language Models Be Trusted For Evaluation? Scalable Meta-evaluation Of Llms As Evaluators Via Agent Debate Chern Steffi, Chern Ethan, Neubig Graham, Liu Pengfei
- Walert: Putting Conversational Search Knowledge Into Action By Building And Evaluating A Large Language Model-powered Chatbot Cherumanal Sachin Pathiyan, Tian Lin, Abushaqra Futoon M., De Paula Angel Felipe Magnossao, Ji Kaixin, Hettiachchi Danula, Trippas Johanne R., Ali Halil, Scholer Falk, Spina Damiano
- Retrieval-augmented Generation In Multilingual Settings Chirkova Nadezhda, Rau David, Déjean Hervé, Formal Thibault, Clinchant Stéphane, Nikoulina Vassilina
- An Empirical Study Of In-context Learning In Llms For Machine Translation Chitale Pranjal A., Gala Jay, Dabre Raj
- Exploring Quantization For Efficient Pre-training Of Transformer Language Models Chitsaz Kamran, Fournier Quentin, Mordido Gonçalo, Chandar Sarath
- Larger Language Models Don't Care How You Think: Why Chain-of-thought Prompting Fails In Subjective Tasks Chochlakis Georgios, Pandiyan Niyantha Maruthu, Lerman Kristina, Narayanan Shrikanth
- Can Only Llms Do Reasoning?: Potential Of Small Language Models In Task Planning Choi Gawon, Ahn Hyemin
- Picle: Eliciting Diverse Behaviors From Large Language Models With Persona In-context Learning Choi Hyeong Kyu, Li Yixuan
- Think Together And Work Better: Combining Humans' And Llms' Think-aloud Outcomes For Effective Text Evaluation Chu Seongyeub, Kim Jongwoo, Yi Munyong
- Crosslingual Capabilities And Knowledge Barriers In Multilingual Large Language Models Chua Lynn, Ghazi Badih, Huang Yangsibo, Kamath Pritish, Kumar Ravi, Manurangsi Pasin, Sinha Amer, Xie Chulin, Zhang Chiyuan
- Informed AI Regulation: Comparing The Ethical Frameworks Of Leading LLM Chatbots Using An Ethics-based Audit To Assess Moral Reasoning And Normative Values Chun Jon, Elkins Katherine
- Improving Token-based World Models With Parallel Observation Prediction Cohen Lior, Wang Kaixin, Kang Bingyi, Mannor Shie
- Attentionlego: An Open-source Building Block For Spatially-scalable Large Language Model Accelerator With Processing-in-memory Technology Cong Rongqing, He Wenyang, Li Mingxuan, Luo Bangning, Yang Zebin, Yang Yuchao, Huang Ru, Yan Bonan
- LLASP: Fine-tuning Large Language Models For Answer Set Programming Coppolillo Erica, Calimeri Francesco, Manco Giuseppe, Perri Simona, Ricca Francesco
- Teenytinyllama: Open-source Tiny Language Models Trained In Brazilian Portuguese Corrêa Nicholas Kluge, Falk Sophia, Fatimah Shiza, Sen Aniket, De Oliveira Nythamar
- Rethinking LLM Language Adaptation: A Case Study On Chinese Mixtral Cui Yiming, Yao Xin
- Anchored Preference Optimization And Contrastive Revisions: Addressing Underspecification In Alignment D'oosterlinck Karel, Xu Winnie, Develder Chris, Demeester Thomas, Singh Amanpreet, Potts Christopher, Kiela Douwe, Mehri Shikib
- Improve Student's Reasoning Generalizability Through Cascading Decomposed Cots Distillation Dai Chengwei, Li Kun, Zhou Wei, Hu Songlin
- MHPP: Exploring The Capabilities And Limitations Of Language Models Beyond Basic Code Generation Dai Jianbo, Lu Jianqiao, Feng Yunlong, Ruan Rongju, Cheng Ming, Tan Haochen, Guo Zhijiang
- Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding With Childplay De Carvalho Gonçalo Hora, Knap Oscar, Pollice Robert
- Understanding Biases In Chatgpt-based Recommender Systems: Provider Fairness, Temporal Stability, And Recency Deldjoo Yashar
- Text-tuple-table: Towards Information Integration In Text-to-table Generation Via Global Tuple Extraction Deng Zheye, Chan Chunkit, Wang Weiqi, Sun Yuxi, Fan Wei, Zheng Tianshi, Yim Yauwai, Song Yangqiu
- Unveiling Encoder-free Vision-language Models Diao Haiwen, Cui Yufeng, Li Xiaotong, Wang Yueze, Lu Huchuan, Wang Xinlong
- MANGO: A Benchmark For Evaluating Mapping And Navigation Abilities Of Large Language Models Ding Peng, Fang Jiading, Li Peng, Wang Kangrui, Zhou Xiaochen, Yu Mo, Li Jing, Walter Matthew R., Mei Hongyuan
- Hallu-pi: Evaluating Hallucination In Multi-modal Large Language Models Within Perturbed Inputs Ding Peng, Wu Jingyu, Kuang Jun, Ma Dan, Cao Xuezhi, Cai Xunliang, Chen Shi, Chen Jiajun, Huang Shujian
- Boosting Large Language Models With Socratic Method For Conversational Mathematics Teaching Ding Yuyang, Hu Hanglei, Zhou Jie, Chen Qin, Jiang Bo, He Liang
- Finding Blind Spots In Evaluator Llms With Interpretable Checklists Doddapaneni Sumanth, Khan Mohammed Safi Ur Rahman, Verma Sshubam, Khapra Mitesh M.
- Self-play With Execution Feedback: Improving Instruction-following Capabilities Of Large Language Models Dong Guanting, Lu Keming, Li Chengpeng, Xia Tingyu, Yu Bowen, Zhou Chang, Zhou Jingren
- Understand What LLM Needs: Dual Preference Alignment For Retrieval-augmented Generation Dong Guanting, Zhu Yutao, Zhang Chenghao, Wang Zechen, Dou Zhicheng, Wen Ji-rong
- RLHF Workflow: From Reward Modeling To Online RLHF Dong Hanze, Xiong Wei, Pang Bo, Wang Haoxiang, Zhao Han, Zhou Yingbo, Jiang Nan, Sahoo Doyen, Xiong Caiming, Zhang Tong
- Pruner-zero: Evolving Symbolic Pruning Metric From Scratch For Large Language Models Dong Peijie, Li Lujun, Tang Zhenheng, Liu Xiang, Pan Xinglin, Wang Qiang, Chu Xiaowen
- Internlm-xcomposer2-4khd: A Pioneering Large Vision-language Model Handling Resolutions From 336 Pixels To 4K HD Dong Xiaoyi, Zhang Pan, Zang Yuhang, Cao Yuhang, Wang Bin, Ouyang Linke, Zhang Songyang, Duan Haodong, Zhang Wenwei, Li Yining, Yan Hang, Gao Yang, Chen Zhe, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Chen Kai, He Conghui, Zhang Xingcheng, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- Attacks, Defenses And Evaluations For LLM Conversation Safety: A Survey Dong Zhichen, Zhou Zhanhui, Yang Chao, Shao Jing, Qiao Yu
- Learning From Students: Applying T-distributions To Explore Accurate And Efficient Formats For Llms Dotzel Jordan, Chen Yuzong, Kotb Bahaa, Prasad Sushma, Wu Gang, Li Sheng, Abdelfattah Mohamed S., Zhang Zhiru
- Evaluating Language Model Context Windows: A "working Memory" Test And Inference-time Correction Dsouza Amanda, Glaze Christopher, Shin Changho, Sala Frederic
- Bitdistiller: Unleashing The Potential Of Sub-4-bit Llms Via Self-distillation Du Dayou, Zhang Yijia, Cao Shijie, Guo Jiaqi, Cao Ting, Chu Xiaowen, Xu Ningyi
- Unlocking Continual Learning Abilities In Language Models Du Wenyu, Cheng Shuang, Luo Tongxu, Qiu Zihan, Huang Zeyu, Cheung Ka Chun, Cheng Reynold, Fu Jie
- Stacking Your Transformers: A Closer Look At Model Growth For Efficient LLM Pre-training Du Wenyu, Luo Tongxu, Qiu Zihan, Huang Zeyu, Shen Yikang, Cheng Reynold, Guo Yike, Fu Jie
- Revisiting Moe And Dense Speed-accuracy Comparisons For LLM Training Du Xianzhi, Gunter Tom, Kong Xiang, Lee Mark, Wang Zirui, Zhang Aonan, Du Nan, Pang Ruoming
- Instruction-guided Scene Text Recognition Du Yongkun, Chen Zhineng, Su Yuchen, Jia Caiyan, Jiang Yu-gang
- On The Evaluation Consistency Of Attribution-based Explanations Duan Jiarui, Li Haoling, Zhang Haofei, Jiang Hao, Xue Mengqi, Sun Li, Song Mingli, Song Jie
- Length-controlled Alpacaeval: A Simple Way To Debias Automatic Evaluators Dubois Yann, Galambosi Balázs, Liang Percy, Hashimoto Tatsunori B.
- ARAGOG: Advanced RAG Output Grading Eibich Matouš, Nagpal Shivay, Fred-ojala Alexander
- M2QA: Multi-domain Multilingual Question Answering Engländer Leon, Sterz Hannah, Poth Clifton, Pfeiffer Jonas, Kuznetsov Ilia, Gurevych Iryna
- Bertaqa: How Much Do Language Models Know About Local Culture? Etxaniz Julen, Azkune Gorka, Soroa Aitor, De Lacalle Oier Lopez, Artetxe Mikel
- Latxa: An Open Language Model And Evaluation Suite For Basque Etxaniz Julen, Sainz Oscar, Perez Naiara, Aldabe Itziar, Rigau German, Agirre Eneko, Ormazabal Aitor, Artetxe Mikel, Soroa Aitor
- Coda: Constrained Generation Based Data Augmentation For Low-resource NLP Evuru Chandra Kiran Reddy, Ghosh Sreyan, Kumar Sonal, S Ramaneswaran, Tyagi Utkarsh, Manocha Dinesh
- Data-augmentation-based Dialectal Adaptation For Llms Faisal Fahim, Anastasopoulos Antonios
- Limgen: Probing The Llms For Generating Suggestive Limitations Of Research Papers Faizullah Abdur Rahman Bin Md, Urlana Ashok, Mishra Rahul
- Nphardeval4v: A Dynamic Reasoning Benchmark Of Multimodal Large Language Models Fan Lizhou, Hua Wenyue, Li Xiang, Zhu Kaijie, Jin Mingyu, Li Lingyao, Ling Haoyang, Chi Jinkui, Wang Jindong, Ma Xin, Zhang Yongfeng
- Reformatted Alignment Fan Run-ze, Li Xuefeng, Zou Haoyang, Li Junlong, He Shwai, Chern Ethan, Hu Jiewen, Liu Pengfei
- Evaluating Generative Language Models In Information Extraction As Subjective Question Correction Fan Yuchen, Liu Yantao, Yao Zijun, Yu Jifan, Hou Lei, Li Juanzi
- Enhancing Noise Robustness Of Retrieval-augmented Language Models With Adaptive Adversarial Training Fang Feiteng, Bai Yuelin, Ni Shiwen, Yang Min, Chen Xiaojun, Xu Ruifeng
- Video-ccam: Enhancing Video-language Understanding With Causal Cross-attention Masks For Short And Long Videos Fei Jiajun, Li Dian, Deng Zhidong, Wang Zekun, Liu Gang, Wang Hui
- Citygpt: Empowering Urban Spatial Cognition Of Large Language Models Feng Jie, Du Yuwei, Liu Tianhui, Guo Siqi, Lin Yuming, Li Yong
- Large Language Model-based Human-agent Collaboration For Complex Task Solving Feng Xueyang, Chen Zhi-yuan, Qin Yujia, Lin Yankai, Chen Xu, Liu Zhiyuan, Wen Ji-rong
- Mt-ladder: A Model-agnostic Framework Boosting Llm-based Machine Translation To The Next Level Feng Zhaopeng, Zhang Yan, Chen Ruizhe, Meng Zijie, Liu Zuozhu
- Tear: Improving Llm-based Machine Translation With Systematic Self-refinement Feng Zhaopeng, Zhang Yan, Li Hao, Wu Bei, Liao Jiayu, Liu Wenqiang, Lang Jun, Feng Yang, Wu Jian, Liu Zuozhu
- Exaranker-open: Synthetic Explanation For IR Using Open-source Llms Ferraretto Fernando, Laitz Thiago, Lotufo Roberto, Nogueira Rodrigo
- RAG Foundry: A Framework For Enhancing Llms For Retrieval Augmented Generation Fleischer Daniel, Berchansky Moshe, Wasserblat Moshe, Izsak Peter
- CLIP With Generative Latent Replay: A Strong Baseline For Incremental Learning Frascaroli Emanuele, Panariello Aniello, Buzzega Pietro, Bonicelli Lorenzo, Porrello Angelo, Calderara Simone
- Autorag-hp: Automatic Online Hyper-parameter Tuning For Retrieval-augmented Generation Fu Jia, Qin Xiaoting, Yang Fangkai, Wang Lu, Zhang Jue, Lin Qingwei, Chen Yubo, Zhang Dongmei, Rajmohan Saravan, Zhang Qi
- Hint-before-solving Prompting: Guiding Llms To Effectively Utilize Encoded Knowledge Fu Jinlan, Huangfu Shenzhen, Yan Hang, Ng See-kiong, Qiu Xipeng
- A Role-specific Guided Large Language Model For Ophthalmic Consultation Based On Stylistic Differentiation Fu Laiyi, Fan Binbin, Du Hongkai, Feng Yanxiang, Li Chunhua, Song Huping
- Intelligence Analysis Of Language Models Galanti Liane, Baron Ethan
- LLM Detectors Still Fall Short Of Real World: Case Of Llm-generated Short News-like Posts Gameiro Henrique Da Silva, Kucharavy Andrei, Dolamic Ljiljana
- Worldgpt: Empowering LLM As Multimodal World Model Ge Zhiqi, Huang Hongzhe, Zhou Mingze, Li Juncheng, Wang Guoming, Tang Siliang, Zhuang Yueting
- Are We Done With MMLU? Gema Aryo Pradipta, Leang Joshua Ong Jun, Hong Giwon, Devoto Alessio, Mancino Alberto Carlo Maria, Saxena Rohit, He Xuanli, Zhao Yu, Du Xiaotang, Madani Mohammad Reza Ghasemi, Barale Claire, Mchardy Robert, Harris Joshua, Kaddour Jean, Van Krieken Emile, Minervini Pasquale
- Agentquest: A Modular Benchmark Framework To Measure Progress And Improve LLM Agents Gioacchini Luca, Siracusano Giuseppe, Sanvito Davide, Gashteovski Kiril, Friede David, Bifulco Roberto, Lawrence Carolin
- Blendsql: A Scalable Dialect For Unifying Hybrid Question Answering In Relational Algebra Glenn Parker, Dakle Parag Pravin, Wang Liang, Raghavan Preethi
- Chatglm: A Family Of Large Language Models From GLM-130B To GLM-4 All Tools Glm Team, :, Zeng Aohan, Xu Bin, Wang Bowen, Zhang Chenhui, Yin Da, Zhang Dan, Rojas Diego, Feng Guanyu, Zhao Hanlin, Lai Hanyu, Yu Hao, Wang Hongning, Sun Jiadai, Zhang Jiajie, Cheng Jiale, Gui Jiayi, Tang Jie, Zhang Jing, Sun Jingyu, Li Juanzi, Zhao Lei, Wu Lindong, Zhong Lucen, Liu Mingdao, Huang Minlie, Zhang Peng, Zheng Qinkai, Lu Rui, Duan Shuaiqi, Zhang Shudan, Cao Shulin, Yang Shuxun, Tam Weng Lam, Zhao Wenyi, Liu Xiao, Xia Xiao, Zhang Xiaohan, Gu Xiaotao, Lv Xin, Liu Xinghan, Liu Xinyi, Yang Xinyue, Song Xixuan, Zhang Xunkai, An Yifan, Xu Yifan, Niu Yilin, Yang Yuantao, Li Yueyan, Bai Yushi, Dong Yuxiao, Qi Zehan, Wang Zhaoyu, Yang Zhen, Du Zhengxiao, Hou Zhenyu, Wang Zihan
- Arcee's Mergekit: A Toolkit For Merging Large Language Models Goddard Charles, Siriwardhana Shamane, Ehghaghi Malikeh, Meyers Luke, Karpukhin Vlad, Benedict Brian, Mcquade Mark, Solawetz Jacob
- Omnifusion Technical Report Goncharova Elizaveta, Razzhigaev Anton, Mikhalchuk Matvey, Kurkin Maxim, Abdullaeva Irina, Skripkin Matvey, Oseledets Ivan, Dimitrov Denis, Kuznetsov Andrey
- AST-T5: Structure-aware Pretraining For Code Generation And Understanding Gong Linyuan, Elhoushi Mostafa, Cheung Alvin
- Evaluation Of Llms On Syntax-aware Code Fill-in-the-middle Tasks Gong Linyuan, Wang Sida, Elhoushi Mostafa, Cheung Alvin
- LLMC: Benchmarking Large Language Model Quantization With A Versatile Compression Toolkit Gong Ruihao, Yong Yang, Gu Shiqiao, Huang Yushi, Lv Chentao, Zhang Yunchen, Liu Xianglong, Tao Dacheng
- Multi-frame, Lightweight & Efficient Vision-language Models For Question Answering In Autonomous Driving Gopalkrishnan Akshay, Greer Ross, Trivedi Mohan
- Teams-rl: Teaching Llms To Generate Better Instruction Datasets Via Reinforcement Learning Gu Shangding, Knoll Alois, Jin Ming
- RWKV-CLIP: A Robust Vision-language Representation Learner Gu Tiancheng, Yang Kaicheng, An Xiang, Feng Ziyong, Liu Dongnan, Cai Weidong, Deng Jiankang
- Structext-eval: An Autogenerated Benchmark For Evaluating Large Language Model's Ability In Structure-rich Text Understanding Gu Zhouhong, Ye Haoning, Zhou Zeyang, Feng Hongwei, Xiao Yanghua
- Sample Design Engineering: An Empirical Study Of What Makes Good Downstream Fine-tuning Samples For Llms Guo Biyang, Wang He, Xiao Wenyilin, Chen Hong, Lee Zhuxin, Han Songqiao, Huang Hailiang
- Ds-agent: Automated Data Science By Empowering Large Language Models With Case-based Reasoning Guo Siyuan, Deng Cheng, Wen Ying, Chen Hechang, Chang Yi, Wang Jun
- API Pack: A Massive Multi-programming Language Dataset For API Call Generation Guo Zhen, Soria Adriana Meza, Sun Wei, Shen Yikang, Panda Rameswar
- AAVENUE: Detecting LLM Biases On NLU Tasks In AAVE Via A Novel Benchmark Gupta Abhay, Meng Philip, Yurtseven Ece, O'brien Sean, Zhu Kevin
- Walledeval: A Comprehensive Safety Evaluation Toolkit For Large Language Models Gupta Prannaya, Yau Le Qi, Low Hao Han, Lee I-shiang, Lim Hugo Maximus, Teoh Yu Xin, Koh Jia Hng, Liew Dar Win, Bhardwaj Rishabh, Bhardwaj Rajat, Poria Soujanya
- Hipporag: Neurobiologically Inspired Long-term Memory For Large Language Models Gutiérrez Bernal Jiménez, Shu Yiheng, Gu Yu, Yasunaga Michihiro, Su Yu
- On The Impacts Of Contexts On Repository-level Code Generation Hai Nam Le, Nguyen Dung Manh, Bui Nghi D. Q.
- Synthclip: Are We Ready For A Fully Synthetic CLIP Training? Hammoud Hasan Abed Al Kader, Itani Hani, Pizzati Fabio, Torr Philip, Bibi Adel, Ghanem Bernard
- Astprompter: Weakly Supervised Automated Language Model Red-teaming To Identify Likely Toxic Prompts Hardy Amelia F., Liu Houjun, Lange Bernard, Kochenderfer Mykel J.
- The Unreasonable Effectiveness Of Easy Training Data For Hard Tasks Hase Peter, Bansal Mohit, Clark Peter, Wiegreffe Sarah
- Zero-shot Generalization During Instruction Tuning: Insights From Similarity And Granularity He Bingxiang, Ding Ning, Qian Cheng, Deng Jia, Cui Ganqu, Yuan Lifan, Gao Huan-ang, Chen Huimin, Liu Zhiyuan, Sun Maosong
- MA-LMM: Memory-augmented Large Multimodal Model For Long-term Video Understanding He Bo, Li Hengduo, Jang Young Kyun, Jia Menglin, Cao Xuefei, Shah Ashish, Shrivastava Abhinav, Lim Ser-nam
- Olympiadbench: A Challenging Benchmark For Promoting AGI With Olympiad-level Bilingual Multimodal Scientific Problems He Chaoqun, Luo Renjie, Bai Yuzhuo, Hu Shengding, Thai Zhen Leng, Shen Junhao, Hu Jinyi, Han Xu, Huang Yujie, Zhang Yuxiang, Liu Jie, Qi Lei, Liu Zhiyuan, Sun Maosong
- Think-program-rectify: 3D Situated Reasoning With Large Language Models He Qingrong, Lin Kejun, Chen Shizhe, Hu Anwen, Jin Qin
- Pitvqa: Image-grounded Text Embedding LLM For Visual Question Answering In Pituitary Surgery He Runlong, Xu Mengya, Das Adrito, Khan Danyal Z., Bano Sophia, Marcus Hani J., Stoyanov Danail, Clarkson Matthew J., Islam Mobarakol
- What Matters In Transformers? Not All Attention Is Needed He Shwai, Sun Guoheng, Shen Zheyu, Li Ang
- G-retriever: Retrieval-augmented Generation For Textual Graph Understanding And Question Answering He Xiaoxin, Tian Yijun, Sun Yifei, Chawla Nitesh V., Laurent Thomas, Lecun Yann, Bresson Xavier, Hooi Bryan
- Llms Meet Multimodal Generation And Editing: A Survey He Yingqing, Liu Zhaoyang, Chen Jingye, Tian Zeyue, Liu Hongyu, Chi Xiaowei, Liu Runtao, Yuan Ruibin, Xing Yazhou, Wang Wenhai, Dai Jifeng, Zhang Yong, Xue Wei, Liu Qifeng, Guo Yike, Chen Qifeng
- HMT: Hierarchical Memory Transformer For Long Context Language Processing He Zifan, Qin Zongyue, Prakriya Neha, Sun Yizhou, Cong Jason
- Game Development As Human-llm Interaction Hong Jiale, Wu Hongqiu, Zhao Hai
- Evaluating Llms' Mathematical And Coding Competency Through Ontology-guided Interventions Hong Pengfei, Majumder Navonil, Ghosal Deepanway, Aditya Somak, Mihalcea Rada, Poria Soujanya
- Curiosity-driven Red-teaming For Large Language Models Hong Zhang-wei, Shenfeld Idan, Wang Tsun-hsuan, Chuang Yung-sung, Pareja Aldo, Glass James, Srivastava Akash, Agrawal Pulkit
- Securing Vision-language Models With A Robust Encoder Against Jailbreak And Adversarial Attacks Hossain Md Zarif, Imteaj Ahmed
- Bridging Language And Items For Retrieval And Recommendation Hou Yupeng, Li Jiacheng, He Zhankui, Yan An, Chen Xiusi, Mcauley Julian
- Outlier-efficient Hopfield Layers For Large Transformer-based Models Hu Jerry Yao-chieh, Chang Pei-hsuan, Luo Robin, Chen Hong-yu, Li Weijian, Wang Wei-po, Liu Han
- Openrlhf: An Easy-to-use, Scalable And High-performance RLHF Framework Hu Jian, Wu Xibin, Wang Weixun, Xianyu, Zhang Dehao, Cao Yu
- Hiagent: Hierarchical Working Memory Management For Solving Long-horizon Agent Tasks With Large Language Model Hu Mengkang, Chen Tianxing, Chen Qiguang, Mu Yao, Shao Wenqi, Luo Ping
- Limited Out-of-context Knowledge Reasoning In Large Language Models Hu Peng, Gao Changjiang, Gao Ruiqi, Chen Jiajun, Huang Shujian
- Minicpm: Unveiling The Potential Of Small Language Models With Scalable Training Strategies Hu Shengding, Tu Yuge, Han Xu, He Chaoqun, Cui Ganqu, Long Xiang, Zheng Zhi, Fang Yewei, Huang Yuxiang, Zhao Weilin, Zhang Xinrong, Thai Zheng Leng, Zhang Kaihuo, Wang Chongyi, Yao Yuan, Zhao Chenyang, Zhou Jie, Cai Jie, Zhai Zhongwu, Ding Ning, Jia Chao, Zeng Guoyang, Li Dahai, Liu Zhiyuan, Sun Maosong
- A Survey On Large Language Model-based Game Agents Hu Sihao, Huang Tiansheng, Ilhan Fatih, Tekin Selim, Liu Gaowen, Kompella Ramana, Liu Ling
- Refchecker: Reference-based Fine-grained Hallucination Checker And Benchmark For Large Language Models Hu Xiangkun, Ru Dongyu, Qiu Lin, Guo Qipeng, Zhang Tianhang, Xu Yang, Luo Yun, Liu Pengfei, Zhang Yue, Zhang Zheng
- RAG And RAU: A Survey On Retrieval-augmented Language Model In Natural Language Processing Hu Yucheng, Lu Yuxing
- Longrecipe: Recipe For Efficient Long Context Generalization In Large Language Models Hu Zhiyuan, Liu Yuliang, Zhao Jinman, Wang Suyuchen, Wang Yan, Shen Wei, Gu Qing, Luu Anh Tuan, Ng See-kiong, Jiang Zhiwei, Hooi Bryan
- Disentangling Logic: The Role Of Context In Large Language Model Reasoning Capabilities Hua Wenyue, Zhu Kaijie, Li Lingyao, Fan Lizhou, Lin Shuhang, Jin Mingyu, Xue Haochen, Li Zelong, Wang Jindong, Zhang Yongfeng
- Talk Less, Interact Better: Evaluating In-context Conversational Adaptation In Multimodal Llms Hua Yilun, Artzi Yoav
- Instupr : Instruction-based Unsupervised Passage Reranking With Large Language Models Huang Chao-wei, Chen Yun-nung
- Recurrent Context Compression: Efficiently Expanding The Context Window Of LLM Huang Chensen, Zhu Guibo, Wang Xuepeng, Luo Yifei, Ge Guojing, Chen Haoran, Yi Dong, Wang Jinqiao
- LITA: Language Instructed Temporal-localization Assistant Huang De-an, Liao Shijia, Radhakrishnan Subhashree, Yin Hongxu, Molchanov Pavlo, Yu Zhiding, Kautz Jan
- Reason3d: Searching And Reasoning 3D Segmentation Via Large Language Model Huang Kuan-chih, Li Xiangtai, Qi Lu, Yan Shuicheng, Yang Ming-hsuan
- Integrating Multi-scale Contextualized Information For Byte-based Neural Machine Translation Huang Langlin, Feng Yang
- Selective Prompting Tuning For Personalized Conversations With Llms Huang Qiushi, Liu Xubo, Ko Tom, Wu Bo, Wang Wenwu, Zhang Yu, Tang Lilian
- The N+ Implementation Details Of RLHF With PPO: A Case Study On TL;DR Summarization Huang Shengyi, Noukhovitch Michael, Hosseini Arian, Rasul Kashif, Wang Weixun, Tunstall Lewis
- Vaccine: Perturbation-aware Alignment For Large Language Models Against Harmful Fine-tuning Huang Tiansheng, Hu Sihao, Liu Ling
- Billm: Pushing The Limit Of Post-training Quantization For Llms Huang Wei, Liu Yangdong, Qin Haotong, Li Ying, Zhang Shiming, Liu Xianglong, Magno Michele, Qi Xiaojuan
- Visual Hallucinations Of Multi-modal Large Language Models Huang Wen, Liu Hongbin, Guo Minxin, Gong Neil Zhenqiang
- Rolora: Fine-tuning Rotated Outlier-free Llms For Effective Weight-activation Quantization Huang Xijie, Liu Zechun, Liu Shih-yang, Cheng Kwang-ting
- Fastfid: Improve Inference Efficiency Of Open Domain Question Answering Via Sentence Selection Huang Yufei, Han Xu, Sun Maosong
- Ecr-chain: Advancing Generative Language Models To Better Emotion-cause Reasoners Through Reasoning Chains Huang Zhaopei, Zhao Jinming, Jin Qin
- Enhancing The Capability And Robustness Of Large Language Models Through Reinforcement Learning-driven Query Refinement Huang Zisu, Wang Xiaohua, Zhang Feiran, Xu Zhibo, Zhang Cenyuan, Zheng Xiaoqing, Huang Xuanjing
- Romansetu: Efficiently Unlocking Multilingual Capabilities Of Large Language Models Via Romanization Husain Jaavid Aktar, Dabre Raj, Kumar Aswanth, Gala Jay, Jayakumar Thanmay, Puduppully Ratish, Kunchukuttan Anoop
- Self-explore To Avoid The Pit: Improving The Reasoning Capabilities Of Language Models With Fine-grained Rewards Hwang Hyeonbin, Kim Doyoung, Kim Seungone, Ye Seonghyeon, Seo Minjoon
- Test-time Low Rank Adaptation Via Confidence Maximization For Zero-shot Generalization Of Vision-language Models Imam Raza, Gani Hanan, Huzaifa Muhammad, Nandakumar Karthik
- Grade Score: Quantifying LLM Performance In Option Selection Iourovitski Dmitri
- Mapcoder: Multi-agent Code Generation For Competitive Problem Solving Islam Md. Ashraful, Ali Mohammed Eunus, Parvez Md Rizwan
- Meddoc-bot: A Chat Tool For Comparative Analysis Of Large Language Models In The Context Of The Pediatric Hypertension Guideline Jabarulla Mohamed Yaseen, Oeltze-jafra Steffen, Beerbaum Philipp, Uden Theodor
- OLAPH: Improving Factuality In Biomedical Long-form Question Answering Jeong Minbyul, Hwang Hyeon, Yoon Chanwoong, Lee Taewhoo, Kang Jaewoo
- Defending Large Language Models Against Jailbreak Attacks Via Semantic Smoothing Ji Jiabao, Hou Bairu, Robey Alexander, Pappas George J., Hassani Hamed, Zhang Yang, Wong Eric, Chang Shiyu
- Can't Say Cant? Measuring And Reasoning Of Dark Jargons In Large Language Models Ji Xu, Zhang Jianyi, Zhou Ziyin, Zhao Zhangchi, Qiao Qianqian, Han Kaiying, Hossen Md Imran, Hei Xiali
- SOUL: Unlocking The Power Of Second-order Optimization For LLM Unlearning Jia Jinghan, Zhang Yihua, Zhang Yimeng, Liu Jiancheng, Runwal Bharat, Diffenderfer James, Kailkhura Bhavya, Liu Sijia
- Minference 1.0: Accelerating Pre-filling For Long-context Llms Via Dynamic Sparse Attention Jiang Huiqiang, Li Yucheng, Zhang Chengruidong, Wu Qianhui, Luo Xufang, Ahn Surin, Han Zhenhua, Abdi Amir H., Li Dongsheng, Lin Chin-yew, Yang Yuqing, Qiu Lili
- MC\(^2\): Multi-concept Guidance For Customized Multi-concept Generation Jiang Jiaxiu, Zhang Yabo, Feng Kailai, Wu Xiaohe, Zuo Wangmeng
- Sketch: A Toolkit For Streamlining LLM Operations Jiang Xin, Li Xiang, Ma Wenjia, Fang Xuezhi, Yao Yiqun, Yu Naitong, Meng Xuying, Han Peng, Li Jing, Sun Aixin, Wang Yequan
- Prior Knowledge Integration Via LLM Encoding And Pseudo Event Regulation For Video Moment Retrieval Jiang Yiyang, Zhang Wengyu, Zhang Xulu, Wei Xiaoyong, Chen Chang Wen, Li Qing
- Learning To Edit: Aligning Llms With Knowledge Editing Jiang Yuxin, Wang Yufei, Wu Chuhan, Zhong Wanjun, Zeng Xingshan, Gao Jiahui, Li Liangyou, Jiang Xin, Shang Lifeng, Tang Ruiming, Liu Qun, Wang Wei
- Graph Chain-of-thought: Augmenting Large Language Models By Reasoning On Graphs Jin Bowen, Xie Chulin, Zhang Jiawei, Roy Kashob Kumar, Zhang Yu, Li Zheng, Li Ruirui, Tang Xianfeng, Wang Suhang, Meng Yu, Han Jiawei
- APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking Jin Can, Peng Hongwu, Zhao Shiyu, Wang Zhenting, Xu Wujiang, Han Ligong, Zhao Jiahui, Zhong Kai, Rajasekaran Sanguthevar, Metaxas Dimitris N.
- LLM Maybe Longlm: Self-extend LLM Context Window Without Tuning Jin Hongye, Han Xiaotian, Yang Jingfeng, Jiang Zhimeng, Liu Zirui, Chang Chia-yuan, Chen Huiyuan, Hu Xia
- Exploring Concept Depth: How Large Language Models Acquire Knowledge At Different Layers? Jin Mingyu, Yu Qinkai, Huang Jingyuan, Zeng Qingcheng, Wang Zhenting, Hua Wenyue, Zhao Haiyan, Mei Kai, Meng Yanda, Ding Kaize, Yang Fan, Du Mengnan, Zhang Yongfeng
- The Impact Of Reasoning Step Length On Large Language Models Jin Mingyu, Yu Qinkai, Shu Dong, Zhao Haiyan, Hua Wenyue, Meng Yanda, Zhang Yongfeng, Du Mengnan
- Video-lavit: Unified Video-language Pre-training With Decoupled Visual-motional Tokenization Jin Yang, Sun Zhicheng, Xu Kun, Xu Kun, Chen Liwei, Jiang Hao, Huang Quzhe, Song Chengru, Liu Yuliang, Zhang Di, Song Yang, Gai Kun, Mu Yadong
- How Large Language Models Encode Context Knowledge? A Layer-wise Probing Study Ju Tianjie, Sun Weiwei, Du Wei, Yuan Xinwei, Ren Zhaochun, Liu Gongshen
- \(\texttt{lm}^\texttt{2}\): A Simple Society Of Language Models Solves Complex Reasoning Juneja Gurusha, Dutta Subhabrata, Chakraborty Tanmoy
- Beyond Labels: Aligning Large Language Models With Human-like Reasoning Kabir Muhammad Rafsan, Sultan Rafeed Mohammad, Asif Ihsanul Haque, Ahad Jawad Ibn, Rahman Fuad, Amin Mohammad Ruhul, Mohammed Nabeel, Rahman Shafin
- Language Repository For Long Video Understanding Kahatapitiya Kumara, Ranasinghe Kanchana, Park Jongwoo, Ryoo Michael S.
- Counterfactuals As A Means For Evaluating Faithfulness Of Attribution Methods In Autoregressive Language Models Kamahi Sepehr, Yaghoobzadeh Yadollah
- Fiddler: CPU-GPU Orchestration For Fast Inference Of Mixture-of-experts Models Kamahori Keisuke, Gu Yile, Zhu Kan, Kasikci Baris
- Eagle: Ethical Dataset Given From Real Interactions Kaneko Masahiro, Bollegala Danushka, Baldwin Timothy
- Instruct, Not Assist: Llm-based Multi-turn Planning And Hierarchical Questioning For Socratic Code Debugging Kargupta Priyanka, Agarwal Ishika, Hakkani-tur Dilek, Han Jiawei
- Alpaca Against Vicuna: Using Llms To Uncover Memorization Of Llms Kassem Aly M., Mahmoud Omar, Mireshghallah Niloofar, Kim Hyunwoo, Tsvetkov Yulia, Choi Yejin, Saad Sherif, Rana Santu
- Spectra: A Comprehensive Study Of Ternary, Quantized, And FP16 Language Models Kaushal Ayush, Pandey Tejas, Vaidhya Tejas, Bhagat Aaryan, Rish Irina
- Remi: A Dataset For Reasoning With Multiple Images Kazemi Mehran, Dikkala Nishanth, Anand Ankit, Devic Petar, Dasgupta Ishita, Liu Fangyu, Fatemi Bahare, Awasthi Pranjal, Guo Dee, Gollapudi Sreenivas, Qureshi Ahmed
- Source-aware Training Enables Knowledge Attribution In Language Models Khalifa Muhammad, Wadden David, Strubell Emma, Lee Honglak, Wang Lu, Beltagy Iz, Peng Hao
- ARGS: Alignment As Reward-guided Search Khanov Maxim, Burapacheep Jirayu, Li Yixuan
- How Good Is My Video LMM? Complex Video Reasoning And Robustness Evaluation Suite For Video-lmms Khattak Muhammad Uzair, Naeem Muhammad Ferjad, Hassan Jameel, Naseer Muzammal, Tombari Federico, Khan Fahad Shahbaz, Khan Salman
- Learning To Prompt With Text Only Supervision For Vision-language Models Khattak Muhammad Uzair, Naeem Muhammad Ferjad, Naseer Muzammal, Van Gool Luc, Tombari Federico
- Assessing Language Models' Worldview For Fiction Generation Khatun Aisha, Brown Daniel G.
- Efficient Medical Question Answering With Knowledge-augmented Question Generation Khlaut Julien, Dancette Corentin, Ferreres Elodie, Bennani Alaedine, Hérent Paul, Manceron Pierre
- Shortened Llama: Depth Pruning For Large Language Models With Comparison Of Retraining Methods Kim Bo-kyeong, Kim Geonmin, Kim Tae-ho, Castells Thibault, Choi Shinkook, Shin Junho, Song Hyoung-kyu
- Dialsim: A Real-time Simulator For Evaluating Long-term Dialogue Understanding Of Conversational Agents Kim Jiho, Chay Woosog, Hwang Hyeonji, Kyung Daeun, Chung Hyunseung, Cho Eunbyeol, Jo Yohan, Choi Edward
- Exploring Prompting Methods For Mitigating Class Imbalance Through Synthetic Data Generation With Large Language Models Kim Jinhee, Kim Taesung, Choo Jaegul
- Enhancing Psychotherapy Counseling: A Data Augmentation Pipeline Leveraging Large Language Models For Counseling Conversations Kim Jun-woo, Han Ji-eun, Koh Jun-seok, Seo Hyeon-tae, Chang Du-seong
- Large Language Models Meet Collaborative Filtering: An Efficient All-round Llm-based Recommender System Kim Sein, Kang Hongseok, Choi Seungyoon, Kim Donghyun, Yang Minchul, Park Chanyoung
- Prometheus 2: An Open Source Language Model Specialized In Evaluating Other Language Models Kim Seungone, Suk Juyoung, Longpre Shayne, Lin Bill Yuchen, Shin Jamin, Welleck Sean, Neubig Graham, Lee Moontae, Lee Kyungjae, Seo Minjoon
- Rethinking The Role Of Proxy Rewards In Language Model Alignment Kim Sungdong, Seo Minjoon
- Learning To Explore And Select For Coverage-conditioned Retrieval-augmented Generation Kim Takyoung, Lee Kyungjae, Jang Young Rok, Cho Ji Yong, Kim Gangwoo, Cho Minseok, Lee Moontae
- Tablevqa-bench: A Visual Question Answering Benchmark On Multiple Table Domains Kim Yoonsik, Yim Moonbin, Song Ka Yeon
- Self-prompt Tuning: Enable Autonomous Role-playing In Llms Kong Aobo, Zhao Shiwan, Chen Hao, Li Qicheng, Qin Yong, Sun Ruiqi, Zhou Xin, Zhou Jiaming, Sun Haoqin
- Labeling Supervised Fine-tuning Data With The Scaling Law Kong Huanjun
- Audio Flamingo: A Novel Audio Language Model With Few-shot Learning And Dialogue Abilities Kong Zhifeng, Goel Arushi, Badlani Rohan, Ping Wei, Valle Rafael, Catanzaro Bryan
- Studying Large Language Model Behaviors Under Realistic Knowledge Conflicts Kortukov Evgenii, Rubinstein Alexander, Nguyen Elisa, Oh Seong Joon
- Synthdst: Synthetic Data Is All You Need For Few-shot Dialog State Tracking Kulkarni Atharva, Tseng Bo-hsiang, Moniz Joel Ruben Antony, Piraviperumal Dhivya, Yu Hong, Bhargava Shruti
- Multi-modal Generation Via Cross-modal In-context Learning Kumar Amandeep, Naseer Muzammal, Narayan Sanath, Anwer Rao Muhammad, Khan Salman, Cholakkal Hisham
- RES-Q: Evaluating Code-editing Large Language Model Systems At The Repository Scale Labash Beck, Rosedale August, Reents Alex, Negritto Lucas, Wiel Colin
- Autowebglm: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent Lai Hanyu, Liu Xiao, Iong Iat Long, Yao Shuntian, Chen Yuxuan, Shen Pengbo, Yu Hao, Zhang Hanchen, Zhang Xiaohan, Dong Yuxiao, Tang Jie
- Alarm: Align Language Models Via Hierarchical Rewards Modeling Lai Yuhang, Wang Siyuan, Liu Shujun, Huang Xuanjing, Wei Zhongyu
- From Distributional To Overton Pluralism: Investigating Large Language Model Alignment Lake Thom, Choi Eunsol, Durrett Greg
- Fine-tuning Language Models With Reward Learning On Policy Lang Hao, Huang Fei, Li Yongbin
- Sentiment Reasoning For Healthcare Le-duc Khai, Nguyen Khai-nguyen, Tat Bach Phan, Le Duy, Ngo Jerry, Vo-dang Long, Nguyen Anh Totti, Hy Truong-son
- Nv-embed: Improved Techniques For Training Llms As Generalist Embedding Models Lee Chankyu, Roy Rajarshi, Xu Mengyao, Raiman Jonathan, Shoeybi Mohammad, Catanzaro Bryan, Ping Wei
- LRQ: Optimizing Post-training Quantization For Large Language Models By Learning Low-rank Weight-scaling Matrices Lee Jung Hyun, Kim Jeonghoon, Yang June Yong, Kwon Se Jung, Yang Eunho, Yoo Kang Min, Lee Dongsoo
- LLM2LLM: Boosting Llms With Novel Iterative Data Enhancement Lee Nicholas, Wattanawong Thanakul, Kim Sehoon, Mangalam Karttikeya, Shen Sheng, Anumanchipalli Gopala, Mahoney Michael W., Keutzer Kurt, Gholami Amir
- Interactive Text-to-image Retrieval With Large Language Models: A Plug-and-play Approach Lee Saehyung, Yu Sangwon, Park Junsung, Yi Jihun, Yoon Sungroh
- Aligning To Thousands Of Preferences Via System Message Generalization Lee Seongyun, Park Sue Hyun, Kim Seungone, Seo Minjoon
- Autocoder: Enhancing Code Large Language Model With \textsc{aiev-instruct} Lei Bin, Li Yuchen, Chen Qiuwu
- Small-e: Small Language Model With Linear Attention For Efficient Speech Synthesis Lemerle Théodor, Obin Nicolas, Roebel Axel
- Intent-based Prompt Calibration: Enhancing Prompt Optimization With Synthetic Boundary Cases Levi Elad, Brosh Eli, Friedmann Matan
- Humaneval On Latest GPT Models -- 2024 Li Daniel, Murr Lincoln
- Contextualization Distillation From Large Language Model For Knowledge Graph Completion Li Dawei, Tan Zhen, Chen Tianlong, Liu Huan
- Hello Again! Llm-powered Personalized Agent For Long-term Dialogue Li Hao, Yang Chenghao, Zhang An, Deng Yang, Wang Xiang, Chua Tat-seng
- Quickllama: Query-aware Inference Acceleration For Large Language Models Li Jingyao, Shi Han, Jiang Xin, Li Zhenguo, Xu Hong, Jia Jiaya
- Fmm-attack: A Flow-based Multi-modal Adversarial Attack On Video-based Llms Li Jinmin, Gao Kuofeng, Bai Yang, Zhang Jingyun, Xia Shu-tao, Wang Yisen
- Dissecting Human And LLM Preferences Li Junlong, Zhou Fan, Sun Shichao, Zhang Yikai, Zhao Hai, Liu Pengfei
- The Dawn After The Dark: An Empirical Study On Factuality Hallucination In Large Language Models Li Junyi, Chen Jie, Ren Ruiyang, Cheng Xiaoxue, Zhao Wayne Xin, Nie Jian-yun, Wen Ji-rong
- Confidence Matters: Revisiting Intrinsic Self-correction Capabilities Of Large Language Models Li Loka, Chen Zhenhao, Chen Guangyi, Zhang Yixuan, Su Yusheng, Xing Eric, Zhang Kun
- Needlebench: Can Llms Do Retrieval And Reasoning In 1 Million Context Window? Li Mo, Zhang Songyang, Liu Yunxin, Chen Kai
- Anchored Answers: Unravelling Positional Bias In Gpt-2's Multiple-choice Questions Li Ruizhe, Gao Yanjun
- Stbench: Assessing The Ability Of Large Language Models In Spatio-temporal Analysis Li Wenbin, Yao Di, Zhao Ruibo, Chen Wenjie, Xu Zijie, Luo Chengxue, Gong Chang, Jing Quanliang, Tan Haining, Bi Jingping
- Reinforcement Learning With Token-level Feedback For Controllable Text Generation Li Wendi, Wei Wei, Xu Kaihe, Xie Wenfeng, Chen Dangyang, Cheng Yu
- Tokenpacker: Efficient Visual Projector For Multimodal LLM Li Wentong, Yuan Yuqian, Liu Jian, Tang Dongqi, Wang Song, Qin Jie, Zhu Jianke, Zhang Lei
- MM-TTS: A Unified Framework For Multimodal, Prompt-induced Emotional Text-to-speech Synthesis Li Xiang, Cheng Zhi-qi, He Jun-yan, Peng Xiaojiang, Hauptmann Alexander G.
- Llara: Supercharging Robot Learning Data For Vision-language Policy Li Xiang, Mata Cristina, Park Jongwoo, Kahatapitiya Kumara, Jang Yoo Sung, Shang Jinghuan, Ranasinghe Kanchana, Burgert Ryan, Cai Mu, Lee Yong Jae, Ryoo Michael S.
- Evaluating Mathematical Reasoning Of Large Language Models: A Focus On Error Identification And Correction Li Xiaoyuan, Wang Wenjie, Li Moxin, Guo Junrong, Zhang Yang, Feng Fuli
- Say More With Less: Understanding Prompt Learning Behaviors Through Gist Compression Li Xinze, Liu Zhenghao, Xiong Chenyan, Yu Shi, Yan Yukun, Wang Shuo, Yu Ge
- When Hindsight Is Not 20/20: Testing Limits On Reflective Thinking In Large Language Models Li Yanhong, Yang Chenghao, Ettinger Allyson
- Mini-gemini: Mining The Potential Of Multi-modality Vision Language Models Li Yanwei, Zhang Yuechen, Wang Chengyao, Zhong Zhisheng, Chen Yixin, Chu Ruihang, Liu Shaoteng, Jia Jiaya
- Backdoorllm: A Comprehensive Benchmark For Backdoor Attacks On Large Language Models Li Yige, Huang Hanxun, Zhao Yunhan, Ma Xingjun, Sun Jun
- The First Place Solution Of WSDM Cup 2024: Leveraging Large Language Models For Conversational Multi-doc QA Li Yiming, Zhang Zhao
- Cognitive Visual-language Mapper: Advancing Multimodal Comprehension With Enhanced Visual Knowledge Alignment Li Yunxin, Chen Xinyu, Hu Baotian, Shi Haoyuan, Zhang Min
- Uni-moe: Scaling Unified Multimodal Llms With Mixture Of Experts Li Yunxin, Jiang Shenyuan, Hu Baotian, Wang Longyue, Zhong Wanqi, Luo Wenhan, Ma Lin, Zhang Min
- ISQA: Informative Factuality Feedback For Scientific Summarization Li Zekai, Qin Yanxia, Liu Qian, Kan Min-yen
- Formal-llm: Integrating Formal Language And Natural Language For Controllable Llm-based Agents Li Zelong, Hua Wenyue, Wang Hao, Zhu He, Zhang Yongfeng
- Focusllm: Scaling Llm's Context By Parallel Decoding Li Zhenyu, Zhang Yike, Pan Tengyu, Sun Yutao, Duan Zhichao, Fang Junjie, Han Rong, Wang Zixuan, Wang Jianyong
- Implicit In-context Learning Li Zhuowei, Xu Zihao, Han Ligong, Gao Yunhe, Wen Song, Liu Di, Wang Hao, Metaxas Dimitris N.
- Two Stacks Are Better Than One: A Comparison Of Language Modeling And Translation As Multilingual Pretraining Objectives Li Zihao, Ji Shaoxiong, Mickus Timothee, Segonne Vincent, Tiedemann Jörg
- Recai: Leveraging Large Language Models For Next-generation Recommender Systems Lian Jianxun, Lei Yuxuan, Huang Xu, Yao Jing, Xu Wei, Xie Xing
- What Is The Best Model? Application-driven Evaluation For Large Language Models Lian Shiguo, Zhao Kaikai, Liu Xinhui, Lei Xuejiao, Yang Bikun, Zhang Wenjing, Wang Kai, Liu Zhaoxiang
- Fennec: Fine-grained Language Model Evaluation And Correction Extended Through Branching And Bridging Liang Xiaobo, Zhang Haoke, Hu Helan, Li Juntao, Xu Jun, Zhang Min
- Empowering Large Language Models To Set Up A Knowledge Retrieval Indexer Via Self-learning Liang Xun, Niu Simin, Li Zhiyu, Zhang Sensen, Song Shichao, Wang Hanyu, Yang Jiawei, Xiong Feiyu, Tang Bo, Xi Chenyang
- Internal Consistency And Self-feedback In Large Language Models: A Survey Liang Xun, Song Shichao, Zheng Zifan, Wang Hanyu, Yu Qingchen, Li Xunkai, Li Rong-hua, Cheng Peng, Wang Zhonghao, Xiong Feiyu, Li Zhiyu
- Controllable Text Generation For Large Language Models: A Survey Liang Xun, Wang Hanyu, Wang Yezhaohui, Song Shichao, Yang Jiawei, Niu Simin, Hu Jie, Liu Dan, Yao Shunyu, Xiong Feiyu, Li Zhiyu
- I-SHEEP: Self-alignment Of LLM From Scratch Through An Iterative Self-enhancement Paradigm Liang Yiming, Zhang Ge, Qu Xingwei, Zheng Tianyu, Guo Jiawei, Du Xinrun, Yang Zhenzhu, Liu Jiaheng, Lin Chenghua, Ma Lei, Huang Wenhao, Zhang Jiajun
- Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats In Customized Large Language Models Liang Zi, Hu Haibo, Ye Qingqing, Xiao Yaxin, Li Haoyang
- Imagination Augmented Generation: Learning To Imagine Richer Context For Question Answering Over Large Language Models Liao Huanxuan, He Shizhu, Xu Yao, Zhang Yuanzhe, Liu Kang, Liu Shengping, Zhao Jun
- Enhancing Reinforcement Learning With Label-sensitive Reward For Natural Language Understanding Liao Kuo, Li Shuang, Zhao Meng, Liu Liqun, Xue Mengge, Hu Zhenyu, Han Honglin, Yin Chengguo
- MARIO: Math Reasoning With Code Interpreter Output -- A Reproducible Pipeline Liao Minpeng, Luo Wei, Li Chengxi, Wu Jing, Fan Kai
- D2LLM: Decomposed And Distilled Large Language Models For Semantic Search Liao Zihan, Yu Hang, Li Jianguo, Wang Jun, Zhang Wei
- Graph-enhanced Large Language Models In Asynchronous Plan Reasoning Lin Fangru, La Malfa Emanuele, Hofmann Valentin, Yang Elle Michelle, Cohn Anthony, Pierrehumbert Janet B.
- Emojicrypt: Prompt Encryption For Secure Communication With Large Language Models Lin Guo, Hua Wenyue, Zhang Yongfeng
- Syncpkl: Harnessing Llms To Generate Synthetic Data For Commonsense Persona Knowledge Linking Lin Kuan-yen
- Single Character Perturbations Break LLM Alignment Lin Leon, Brown Hannah, Kawaguchi Kenji, Shieh Michael
- Qserve: W4A8KV4 Quantization And System Co-design For Efficient LLM Serving Lin Yujun, Tang Haotian, Yang Shang, Zhang Zhekai, Xiao Guangxuan, Gan Chuang, Han Song
- Boosting Multimodal Large Language Models With Visual Tokens Withdrawal For Rapid Inference Lin Zhihang, Lin Mingbao, Lin Luxi, Ji Rongrong
- Prompt Compression With Context-aware Sentence Encoding For Fast And Improved LLM Inference Liskavets Barys, Ushakov Maxim, Roy Shuvendu, Klibanov Mark, Etemad Ali, Luke Shane
- Llmembed: Rethinking Lightweight Llm's Genuine Function In Text Classification Liu Chun, Zhang Hongguang, Zhao Kainan, Ju Xinghai, Yang Lin
- SPHINX-X: Scaling Data And Parameters For A Family Of Multi-modal Large Language Models Liu Dongyang, Zhang Renrui, Qiu Longtian, Huang Siyuan, Lin Weifeng, Zhao Shitian, Geng Shijie, Lin Ziyi, Jin Peng, Zhang Kaipeng, Shao Wenqi, Xu Chao, He Conghui, He Junjun, Shao Hao, Lu Pan, Li Hongsheng, Qiao Yu, Gao Peng
- Self-reflection Outcome Is Sensitive To Prompt Construction Liu Fengyuan, Aldahoul Nouar, Eady Gregory, Zaki Yasir, Alshebli Bedoor, Rahwan Talal
- Pefomed: Parameter Efficient Fine-tuning Of Multimodal Large Language Models For Medical Imaging Liu Gang, He Jinlong, Li Pengfei, He Genrong, Chen Zhaolin, Zhong Shenjun
- Augmenting Math Word Problems Via Iterative Question Composing Liu Haoxiong, Zhang Yifan, Luo Yifan, Yao Andrew Chi-chih
- \(se^2\): Sequential Example Selection For In-context Learning Liu Haoyu, Liu Jianfeng, Huang Shaohan, Zhan Yuefeng, Sun Hao, Deng Weiwei, Wei Furu, Zhang Qi
- Laida: Linguistics-aware In-context Learning With Data Augmentation For Metaphor Components Identification Liu Hongde, He Chenyuan, Meng Feiyang, Niu Changyong, Jia Yuxiang
- Mm-instruct: Generated Visual Instructions For Large Multimodal Model Alignment Liu Jihao, Huang Xin, Zheng Jinliang, Liu Boxiao, Wang Jia, Yoshie Osamu, Liu Yu, Li Hongsheng
- Large Language Model-based Agents For Software Engineering: A Survey Liu Junwei, Wang Kaixin, Chen Yixuan, Peng Xin, Chen Zhenpeng, Zhang Lingming, Lou Yiling
- Selectit: Selective Instruction Tuning For Large Language Models Via Uncertainty-aware Self-reflection Liu Liangxin, Liu Xuebo, Wong Derek F., Li Dongfang, Wang Ziyi, Hu Baotian, Zhang Min
- Generative Expressive Conversational Speech Synthesis Liu Rui, Hu Yifan, Ren Yi, Yin Xiang, Li Haizhou
- Intactkv: Improving Large Language Model Quantization By Keeping Pivot Tokens Intact Liu Ruikang, Bai Haoli, Lin Haokun, Li Yuening, Gao Han, Xu Zhengzhuo, Hou Lu, Yao Jun, Yuan Chun
- Latent Guard: A Safety Framework For Text-to-image Generation Liu Runtao, Khakzar Ashkan, Gu Jindong, Chen Qifeng, Torr Philip, Pizzati Fabio
- ST-LLM: Large Language Models Are Effective Temporal Learners Liu Ruyang, Li Chen, Tang Haoran, Ge Yixiao, Shan Ying, Li Ge
- Memlong: Memory-augmented Retrieval For Long Text Modeling Liu Weijie, Tang Zecheng, Li Juntao, Chen Kehai, Zhang Min
- Demorank: Selecting Effective Demonstrations For Large Language Models In Ranking Task Liu Wenhan, Zhu Yutao, Dou Zhicheng
- Extensive Self-contrast Enables Feedback-free Language Model Alignment Liu Xiao, Song Xixuan, Dong Yuxiao, Tang Jie
- SHIELD: Evaluation And Defense Strategies For Copyright Compliance In LLM Text Generation Liu Xiaoze, Sun Ting, Xu Tianyang, Wu Feijie, Wang Cunxiang, Wang Xiaoqian, Gao Jing
- Datasets For Large Language Models: A Comprehensive Survey Liu Yang, Cao Jiahuan, Liu Chongyu, Ding Kai, Jin Lianwen
- Omgeval: An Open Multilingual Generative Evaluation Benchmark For Large Language Models Liu Yang, Xu Meng, Wang Shuo, Yang Liner, Wang Haoyu, Liu Zhenghao, Kong Cunliang, Chen Yun, Liu Yang, Sun Maosong, Yang Erhong
- SAFETY-J: Evaluating Safety With Critique Liu Yixiu, Zheng Yuxiang, Xia Shijie, Li Jiajun, Tu Yi, Song Chaoling, Liu Pengfei
- Probing Language Models For Pre-training Data Detection Liu Zhenhua, Zhu Tong, Tan Chuanyuan, Lu Haonan, Liu Bing, Chen Wenliang
- Agentlite: A Lightweight Library For Building And Advancing Task-oriented LLM Agent System Liu Zhiwei, Yao Weiran, Zhang Jianguo, Yang Liangwei, Liu Zuxin, Tan Juntao, Choubey Prafulla K., Lan Tian, Wu Jason, Wang Huan, Heinecke Shelby, Xiong Caiming, Savarese Silvio
- Fantastic Semantics And Where To Find Them: Investigating Which Layers Of Generative Llms Reflect Lexical Semantics Liu Zhu, Kong Cunliang, Liu Ying, Sun Maosong
- Chain-of-spot: Interactive Reasoning Improves Large Vision-language Models Liu Zuyan, Dong Yuhao, Rao Yongming, Zhou Jie, Lu Jiwen
- A Closer Look Into Mixture-of-experts In Large Language Models Lo Ka Man, Huang Zeyu, Qiu Zihan, Wang Zili, Fu Jie
- Roleplay-doh: Enabling Domain-experts To Create Llm-simulated Patients Via Eliciting And Adhering To Principles Louie Ryan, Nandi Ananjan, Fang William, Chang Cheng, Brunskill Emma, Yang Diyi
- Revisiting Multi-modal LLM Evaluation Lu Jian, Srivastava Shikhar, Chen Junyu, Shrestha Robik, Acharya Manoj, Kafle Kushal, Kanan Christopher
- Toolsandbox: A Stateful, Conversational, Interactive Evaluation Benchmark For LLM Tool Use Capabilities Lu Jiarui, Holleis Thomas, Zhang Yizhe, Aumayer Bernhard, Nan Feng, Bai Felix, Ma Shuang, Ma Shen, Li Mengyu, Yin Guoli, Wang Zirui, Pang Ruoming
- FIPO: Free-form Instruction-oriented Prompt Optimization With Preference Dataset And Modular Fine-tuning Schema Lu Junru, An Siyu, Zhang Min, He Yulan, Yin Di, Sun Xing
- LLM Discussion: Enhancing The Creativity Of Large Language Models Via Discussion Framework And Role-play Lu Li-chun, Chen Shou-jen, Pai Tsung-min, Yu Chan-hung, Lee Hung-yi, Sun Shao-hua
- Ovis: Structural Embedding Alignment For Multimodal Large Language Model Lu Shiyin, Li Yang, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Ye Han-jia
- Benchmarking Chinese Knowledge Rectification In Large Language Models Lu Tianhe, Fang Jizhan, Yao Yunzhi, Xu Xin, Zhang Ningyu, Chen Huajun
- Not All Experts Are Equal: Efficient Expert Pruning And Skipping For Mixture-of-experts Large Language Models Lu Xudong, Liu Qi, Xu Yuhui, Zhou Aojun, Huang Siyuan, Zhang Bo, Yan Junchi, Li Hongsheng
- SPP: Sparsity-preserved Parameter-efficient Fine-tuning For Large Language Models Lu Xudong, Zhou Aojun, Xu Yuhui, Zhang Renrui, Gao Peng, Li Hongsheng
- Longheads: Multi-head Attention Is Secretly A Long Context Processor Lu Yi, Zhou Xin, He Wei, Zhao Jun, Ji Tao, Gui Tao, Zhang Qi, Huang Xuanjing
- Llamax: Scaling Linguistic Horizons Of LLM By Enhancing Translation Capabilities Beyond 100 Languages Lu Yinquan, Zhu Wenhao, Li Lei, Qiao Yu, Yuan Fei
- Layoutllm: Layout Instruction Tuning With Large Language Models For Document Understanding Luo Chuwei, Shen Yufan, Zhu Zhaoqing, Zheng Qi, Yu Zhi, Yao Cong
- Learning Positional Attention For Sequential Recommendation Luo Fan, Zhang Juan, Xu Shenghui
- Duetsim: Building User Simulator With Dual Large Language Models For Task-oriented Dialogues Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- Zero-shot Cross-domain Dialogue State Tracking Via Dual Low-rank Adaptation Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- Weblinx: Real-world Website Navigation With Multi-turn Dialogue Lù Xing Han, Kasner Zdeněk, Reddy Siva
- Sparsity-accelerated Training For Large Language Models Ma Da, Chen Lu, Wang Pengyu, Xu Hongshen, Li Hanqi, Sun Liangtai, Zhu Su, Fan Shuai, Yu Kai
- Debate On Graph: A Flexible And Reliable Reasoning Framework For Large Language Models Ma Jie, Gao Zhitao, Chai Qi, Sun Wangchun, Wang Pinghui, Pei Hongbin, Tao Jing, Song Lingyun, Liu Jun, Zhang Chen, Cui Lizhen
- Neighboring Perturbations Of Knowledge Editing On Large Language Models Ma Jun-yu, Ling Zhen-hua, Zhang Ningyu, Gu Jia-chen
- Perturbation-restrained Sequential Model Editing Ma Jun-yu, Wang Hong, Xu Hao-xiang, Ling Zhen-hua, Gu Jia-chen
- FBI-LLM: Scaling Up Fully Binarized Llms From Scratch Via Autoregressive Distillation Ma Liqun, Sun Mingjie, Shen Zhiqiang
- Coco-agent: A Comprehensive Cognitive MLLM Agent For Smartphone GUI Automation Ma Xinbei, Zhang Zhuosheng, Zhao Hai
- Megalodon: Efficient LLM Pretraining And Inference With Unlimited Context Length Ma Xuezhe, Yang Xiaomeng, Xiong Wenhan, Chen Beidi, Yu Lili, Zhang Hao, May Jonathan, Zettlemoyer Luke, Levy Omer, Zhou Chunting
- Llamoco: Instruction Tuning Of Large Language Models For Optimization Code Generation Ma Zeyuan, Guo Hongshu, Chen Jiacheng, Peng Guojun, Cao Zhiguang, Ma Yining, Gong Yue-jiao
- Gerea: Question-aware Prompt Captions For Knowledge-based Visual Question Answering Ma Ziyu, Li Shutao, Sun Bin, Cai Jianfei, Long Zuxiang, Ma Fuyan
- M2lingual: Enhancing Multilingual, Multi-turn Instruction Alignment In Large Language Models Maheshwary Rishabh, Yadav Vikas, Nguyen Hoang, Mahajan Khyati, Madhusudhan Sathwik Tejaswi
- ULLME: A Unified Framework For Large Language Model Embeddings With Generation-augmented Learning Man Hieu, Ngo Nghia Trung, Dernoncourt Franck, Nguyen Thien Huu
- Reinforced Prompt Personalization For Recommendation With Large Language Models Mao Wenyu, Wu Jiancan, Chen Weijian, Gao Chongming, Wang Xiang, He Xiangnan
- Understanding And Mitigating Language Confusion In Llms Marchisio Kelly, Ko Wei-yin, Bérard Alexandre, Dehaze Théo, Ruder Sebastian
- Openelm: An Efficient Language Model Family With Open Training And Inference Framework Mehta Sachin, Sekhavat Mohammad Hossein, Cao Qingqing, Horton Maxwell, Jin Yanzi, Sun Chenfan, Mirzadeh Iman, Najibi Mahyar, Belenko Dmitry, Zatloukal Peter, Rastegari Mohammad
- Realhf: Optimized RLHF Training For Large Language Models Through Parameter Reallocation Mei Zhiyu, Fu Wei, Li Kaiwei, Wang Guangju, Zhang Huanchen, Wu Yi
- Phybench: A Physical Commonsense Benchmark For Evaluating Text-to-image Models Meng Fanqing, Shao Wenqi, Luo Lixin, Wang Yahong, Chen Yiran, Lu Quanfeng, Yang Yue, Yang Tianshuo, Zhang Kaipeng, Qiao Yu, Luo Ping
- DCR: Divide-and-conquer Reasoning For Multi-choice Question Answering With Llms Meng Zijie, Zhang Yan, Feng Zhaopeng, Liu Zuozhu
- Linearizing Large Language Models Mercat Jean, Vasiljevic Igor, Keh Sedrick, Arora Kushal, Dave Achal, Gaidon Adrien, Kollar Thomas
- Natural Language As Policies: Reasoning For Coordinate-level Embodied Control With Llms Mikami Yusuke, Melnik Andrew, Miura Jun, Hautamäki Ville
- CBF-LLM: Safe Control For LLM Alignment Miyaoka Yuya, Inoue Masaki
- CHIQ: Contextual History Enhancement For Improving Query Rewriting In Conversational Search Mo Fengran, Ghaddar Abbas, Mao Kelong, Rezagholizadeh Mehdi, Chen Boxing, Liu Qun, Nie Jian-yun
- Fight Back Against Jailbreaking Via Prompt Adversarial Tuning Mo Yichuan, Wang Yuji, Wei Zeming, Wang Yisen
- Virtual Personas For Language Models Via An Anthology Of Backstories Moon Suhong, Abdulhai Marwa, Kang Minwoo, Suh Joseph, Soedarmadji Widyadewi, Behar Eran Kohen, Chan David M.
- Generative Representational Instruction Tuning Muennighoff Niklas, Su Hongjin, Wang Liang, Yang Nan, Wei Furu, Yu Tao, Singh Amanpreet, Kiela Douwe
- Open-llm-leaderboard: From Multi-choice To Open-style Questions For Llms Evaluation, Benchmark, And Arena Myrzakhan Aidar, Bsharat Sondos Mahmoud, Shen Zhiqiang
- Fakes Of Varying Shades: How Warning Affects Human Perception And Engagement Regarding LLM Hallucinations Nahar Mahjabin, Seo Haeseung, Lee Eun-ju, Xiong Aiping, Lee Dongwon
- Aurora-m: The First Open Source Multilingual Language Model Red-teamed According To The U.S. Executive Order Nakamura Taishi, Mishra Mayank, Tedeschi Simone, Chai Yekun, Stillerman Jason T, Friedrich Felix, Yadav Prateek, Laud Tanmay, Chien Vu Minh, Zhuo Terry Yue, Misra Diganta, Bogin Ben, Vu Xuan-son, Karpinska Marzena, Dantuluri Arnav Varma, Kusa Wojciech, Furlanello Tommaso, Yokota Rio, Muennighoff Niklas, Pai Suhas, Adewumi Tosin, Laippala Veronika, Yao Xiaozhe, Junior Adalberto, Ariyak Alpay, Drozd Aleksandr, Clive Jordan, Gupta Kshitij, Chen Liangyu, Sun Qi, Tsui Ken, Persaud Noah, Fahmy Nour, Chen Tianlong, Bansal Mohit, Monti Nicolo, Dang Tai, Luo Ziyang, Bui Tien-tung, Navigli Roberto, Mehta Virendra, Blumberg Matthew, May Victor, Nguyen Huu, Pyysalo Sampo
- Open Llama2 Model For The Lithuanian Language Nakvosas Artūras, Daniušis Povilas, Mulevičius Vytas
- Word2world: Generating Stories And Worlds Through Large Language Models Nasir Muhammad U., James Steven, Togelius Julian
- MBBQ: A Dataset For Cross-lingual Comparison Of Stereotypes In Generative Llms Neplenbroek Vera, Bisazza Arianna, Fernández Raquel
- How Well Can Llms Echo Us? Evaluating AI Chatbots' Role-play Ability With ECHO Ng Man Tik, Tse Hui Tung, Huang Jen-tse, Li Jingjing, Wang Wenxuan, Lyu Michael R.
- Advancing Vietnamese Visual Question Answering With Transformer And Convolutional Integration Nguyen Ngoc Son, Nguyen Van Son, Le Tung
- Xwin-lm: Strong And Scalable Alignment Practice For Llms Ni Bolin, Hu Jingcheng, Wei Yixuan, Peng Houwen, Zhang Zheng, Meng Gaofeng, Hu Han
- Mozip: A Multilingual Benchmark To Evaluate Large Language Models In Intellectual Property Ni Shiwen, Tan Minghuan, Bai Yuelin, Niu Fuqiang, Yang Min, Zhang Bowen, Xu Ruifeng, Chen Xiaojun, Li Chengming, Hu Xiping, Li Ye, Fan Jianping
- BMIKE-53: Investigating Cross-lingual Knowledge Editing With In-context Learning Nie Ercong, Shao Bo, Ding Zifeng, Wang Mingyang, Schmid Helmut, Schütze Hinrich
- Cfinbench: A Comprehensive Chinese Financial Benchmark For Large Language Models Nie Ying, Yan Binwei, Guo Tianyu, Liu Hao, Wang Haoyu, He Wei, Zheng Binfan, Wang Weihao, Li Qiang, Sun Weijian, Wang Yunhe, Tao Dacheng
- Can Llms Learn By Teaching? A Preliminary Study Ning Xuefei, Wang Zifu, Li Shiyao, Lin Zinan, Yao Peiran, Fu Tianyu, Blaschko Matthew B., Dai Guohao, Yang Huazhong, Wang Yu
- LBC: Language-based-classifier For Out-of-variable Generalization Noh Kangjun, Seong Baekryun, Byun Hoyoon, Choi Youngjun, Song Sungjin, Song Kyungwoo
- Nomic Embed: Training A Reproducible Long Context Text Embedder Nussbaum Zach, Morris John X., Duderstadt Brandon, Mulyar Andriy
- The Conversation Is The Command: Interacting With Real-world Autonomous Robot Through Natural Language Nwankwo Linus, Rueckert Elmar
- GECKO: Generative Language Model For English, Code And Korean Oh Sungwoo, Kim Donggyu
- Transformers Are Multi-state Rnns Oren Matanel, Hassid Michael, Yarden Nir, Adi Yossi, Schwartz Roy
- Large Language Models As Planning Domain Generators Oswald James, Srinivas Kavitha, Kokel Harsha, Lee Junkyu, Katz Michael, Sohrabi Shirin
- Easyinstruct: An Easy-to-use Instruction Processing Framework For Large Language Models Ou Yixin, Zhang Ningyu, Gui Honghao, Xu Ziwen, Qiao Shuofei, Xue Yida, Fang Runnan, Liu Kangwei, Li Lei, Bi Zhen, Zheng Guozhou, Chen Huajun
- Gemini Goes To Med School: Exploring The Capabilities Of Multimodal Large Language Models On Medical Challenge Problems & Hallucinations Pal Ankit, Sankarasubbu Malaikannan
- Auto-encoding Morph-tokens For Multimodal LLM Pan Kaihang, Tang Siliang, Li Juncheng, Fan Zhaoyu, Chow Wei, Yan Shuicheng, Chua Tat-seng, Zhuang Yueting, Zhang Hanwang
- Ee-tuning: An Economical Yet Scalable Solution For Tuning Early-exit Large Language Models Pan Xuchen, Chen Yanxi, Li Yaliang, Ding Bolin, Zhou Jingren
- Llmlingua-2: Data Distillation For Efficient And Faithful Task-agnostic Prompt Compression Pan Zhuoshi, Wu Qianhui, Jiang Huiqiang, Xia Menglin, Luo Xufang, Zhang Jue, Lin Qingwei, Rühle Victor, Yang Yuqing, Lin Chin-yew, Zhao H. Vicky, Qiu Lili, Zhang Dongmei
- Lottery Ticket Adaptation: Mitigating Destructive Interference In Llms Panda Ashwinee, Isik Berivan, Qi Xiangyu, Koyejo Sanmi, Weissman Tsachy, Mittal Prateek
- Salute The Classic: Revisiting Challenges Of Machine Translation In The Age Of Large Language Models Pang Jianhui, Ye Fanghua, Wang Longyue, Yu Dian, Wong Derek F., Shi Shuming, Tu Zhaopeng
- Phased Instruction Fine-tuning For Large Language Models Pang Wei, Zhou Chuan, Zhou Xiao-hua, Wang Xiaojie
- Pre-trained Vision And Language Transformers Are Few-shot Incremental Learners Park Keon-hee, Song Kyungwoo, Park Gyeong-moon
- Logicbench: Towards Systematic Evaluation Of Logical Reasoning Ability Of Large Language Models Parmar Mihir, Patel Nisarg, Varshney Neeraj, Nakamura Mutsumi, Luo Man, Mashetty Santosh, Mitra Arindam, Baral Chitta
- Multi-logieval: Towards Evaluating Multi-step Logical Reasoning Ability Of Large Language Models Patel Nisarg, Kulkarni Mohith, Parmar Mihir, Budhiraja Aashna, Nakamura Mutsumi, Varshney Neeraj, Baral Chitta
- Goex: Perspectives And Designs Towards A Runtime For Autonomous LLM Applications Patil Shishir G., Zhang Tianjun, Fang Vivian, C. Noppapon, Huang Roy, Hao Aaron, Casado Martin, Gonzalez Joseph E., Popa Raluca Ada, Stoica Ion
- Humaneval-xl: A Multilingual Code Generation Benchmark For Cross-lingual Natural Language Generalization Peng Qiwei, Chai Yekun, Li Xuhong
- Check-eval: A Checklist-based Approach For Evaluating Text Quality Pereira Jayr, Assumpcao Andre, Lotufo Roberto
- Suri: Multi-constraint Instruction Following For Long-form Text Generation Pham Chau Minh, Sun Simeng, Iyyer Mohit
- Bimedix: Bilingual Medical Mixture Of Experts LLM Pieri Sara, Mullappilly Sahal Shaji, Khan Fahad Shahbaz, Anwer Rao Muhammad, Khan Salman, Baldwin Timothy, Cholakkal Hisham
- Sbora: Low-rank Adaptation With Regional Weight Updates Po Lai-man, Liu Yuyang, Wu Haoxuan, Zhang Tianqi, Yu Wing-yin, Jiang Zeyu, Li Kun
- Dissecting Language Models: Machine Unlearning Via Selective Pruning Pochinkov Nicholas, Schoots Nandi
- Advanced Natural-based Interaction For The Italian Language: Llamantino-3-anita Polignano Marco, Basile Pierpaolo, Semeraro Giovanni
- Evaluation Of Language Models In The Medical Context Under Resource-constrained Settings Posada Andrea, Rueckert Daniel, Meissen Felix, Müller Philip
- DEXTER: A Benchmark For Open-domain Complex Question Answering Using Llms Prabhu Venktesh V. Deepali, Anand Avishek
- Customized Retrieval Augmented Generation And Benchmarking For EDA Tool Documentation QA Pu Yuan, He Zhuolun, Qiu Tairu, Wu Haoyuan, Yu Bei
- Fine-tuning With Divergent Chains Of Thought Boosts Reasoning Through Self-correction In Language Models Puerto Haritz, Chubakov Tilek, Zhu Xiaodan, Madabushi Harish Tayyar, Gurevych Iryna
- Mutual Reasoning Makes Smaller Llms Stronger Problem-solvers Qi Zhenting, Ma Mingyuan, Xu Jiahang, Zhang Li Lyna, Yang Fan, Yang Mao
- Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period Of Large Language Models Qian Chen, Zhang Jie, Yao Wei, Liu Dongrui, Yin Zhenfei, Qiao Yu, Liu Yong, Shao Jing
- Learning To Poison Large Language Models During Instruction Tuning Qiang Yao, Zhou Xiangyu, Zade Saleh Zare, Roshani Mohammad Amin, Zytko Douglas, Zhu Dongxiao
- Agent Planning With World Knowledge Model Qiao Shuofei, Fang Runnan, Zhang Ningyu, Zhu Yuqi, Chen Xiang, Deng Shumin, Jiang Yong, Xie Pengjun, Huang Fei, Chen Huajun
- Prism: A Framework For Decoupling And Assessing The Capabilities Of Vlms Qiao Yuxuan, Duan Haodong, Fang Xinyu, Yang Junming, Chen Lin, Zhang Songyang, Wang Jiaqi, Lin Dahua, Chen Kai
- The Synergy Between Data And Multi-modal Large Language Models: A Survey From Co-development Perspective Qin Zhen, Chen Daoyuan, Zhang Wenhao, Yao Liuyi, Huang Yilun, Ding Bolin, Li Yaliang, Deng Shuiguang
- Lightning Attention-2: A Free Lunch For Handling Unlimited Sequence Lengths In Large Language Models Qin Zhen, Sun Weigao, Li Dong, Shen Xuyang, Sun Weixuan, Zhong Yiran
- Layerwise Recurrent Router For Mixture-of-experts Qiu Zihan, Huang Zeyu, Cheng Shuang, Zhou Yizhi, Wang Zili, Titov Ivan, Fu Jie
- Tool Learning With Large Language Models: A Survey Qu Changle, Dai Sunhao, Wei Xiaochi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Xu Jun, Wen Ji-rong
- Automatically Generating Numerous Context-driven SFT Data For Llms Across Diverse Granularity Quan Shanghaoran
- Verifiably Following Complex Robot Instructions With Foundation Models Quartey Benedict, Rosen Eric, Tellex Stefanie, Konidaris George
- Detecting Hallucinations In Large Language Model Generation: A Token Probability Approach Quevedo Ernesto, Yero Jorge, Koerner Rachel, Rivas Pablo, Cerny Tomas
- Medit: Multilingual Text Editing Via Instruction Tuning Raheja Vipul, Alikaniotis Dimitris, Kulkarni Vivek, Alhafni Bashar, Kumar Dhruv
- Defan: Definitive Answer Dataset For Llms Hallucination Evaluation Rahman A B M Ashikur, Anwar Saeed, Usman Muhammad, Mian Ajmal
- Capturing Minds, Not Just Words: Enhancing Role-playing Language Models With Personality-indicative Data Ran Yiting, Wang Xintao, Xu Rui, Yuan Xinfeng, Liang Jiaqing, Xiao Yanghua, Yang Deqing
- Developing Safe And Responsible Large Language Model : Can We Balance Bias Reduction And Language Understanding In Large Language Models? Raza Shaina, Bamgbose Oluwanifemi, Ghuge Shardul, Tavakol Fatemeh, Reji Deepak John, Bashir Syed Raza
- MBIAS: Mitigating Bias In Large Language Models While Retaining Context Raza Shaina, Raval Ananya, Chatrath Veronica
- Unveiling And Mitigating Memorization In Text-to-image Diffusion Models Through Cross Attention Ren Jie, Li Yaxin, Zeng Shenglai, Xu Han, Lyu Lingjuan, Xing Yue, Tang Jiliang
- Samba: Simple Hybrid State Space Models For Efficient Unlimited Context Language Modeling Ren Liliang, Liu Yang, Lu Yadong, Shen Yelong, Liang Chen, Chen Weizhu
- On The Efficacy Of Eviction Policy For Key-value Constrained Generative Language Model Inference Ren Siyu, Zhu Kenny Q.
- Analyzing And Reducing Catastrophic Forgetting In Parameter Efficient Tuning Ren Weijieying, Li Xinlong, Wang Lei, Zhao Tianxiang, Qin Wei
- The Effect Of Sampling Temperature On Problem Solving In Large Language Models Renze Matthew, Guven Erhan
- Self-reflection In LLM Agents: Effects On Problem-solving Performance Renze Matthew, Guven Erhan
- Self-generated Replay Memories For Continual Neural Machine Translation Resta Michele, Bacciu Davide
- Code Generation With Alphacodium: From Prompt Engineering To Flow Engineering Ridnik Tal, Kredo Dedy, Friedman Itamar
- Exploring Advanced Large Language Models With Llmsuite Roffo Giorgio
- Unibucllm: Harnessing Llms For Automated Prediction Of Item Difficulty And Response Time For Multiple-choice Questions Rogoz Ana-cristina, Ionescu Radu Tudor
- CLAPNQ: Cohesive Long-form Answers From Passages In Natural Questions For RAG Systems Rosenthal Sara, Sil Avirup, Florian Radu, Roukos Salim
- Enhancing Low-resource NMT With A Multilingual Encoder And Knowledge Distillation: A Case Study Roy Aniruddha, Ray Pretam, Maheshwari Ayush, Sarkar Sudeshna, Goyal Pawan
- Writing In The Margins: Better Inference Pattern For Long Context Retrieval Russak Melisa, Jamil Umar, Bryant Christopher, Kamble Kiran, Magnuson Axel, Russak Mateusz, Alshikh Waseem
- Emobench: Evaluating The Emotional Intelligence Of Large Language Models Sabour Sahand, Liu Siyang, Zhang Zheyuan, Liu June M., Zhou Jinfeng, Sunaryo Alvionna S., Li Juanzi, Lee Tatia M. C., Mihalcea Rada, Huang Minlie
- Mitigating Hallucination In Fictional Character Role-play Sadeq Nafis, Xie Zhouhang, Kang Byungkyu, Lamba Prarit, Gao Xiang, Mcauley Julian
- Neurocache: Efficient Vector Retrieval For Long-range Language Modeling Safaya Ali, Yuret Deniz
- Pre-training Small Base Lms With Fewer Tokens Sanyal Sunny, Sanghavi Sujay, Dimakis Alexandros G.
- Can Llms Master Math? Investigating Large Language Models On Math Stack Exchange Satpute Ankit, Giessing Noah, Greiner-petter Andre, Schubotz Moritz, Teschke Olaf, Aizawa Akiko, Gipp Bela
- Symbolic Prompt Program Search: A Structure-aware Approach To Efficient Compile-time Prompt Optimization Schnabel Tobias, Neville Jennifer
- Morehopqa: More Than Multi-hop Reasoning Schnitzler Julian, Ho Xanh, Huang Jiahao, Boudin Florian, Sugawara Saku, Aizawa Akiko
- Team UTSA-NLP At Semeval 2024 Task 5: Prompt Ensembling For Argument Reasoning In Civil Procedures With GPT4 Schumacher Dan, Rios Anthony
- Evaluating Large Language Models With Fmeval Schwöbel Pola, Franceschi Luca, Zafar Muhammad Bilal, Vasist Keerthan, Malhotra Aman, Shenhar Tomer, Tailor Pinal, Yilmaz Pinar, Diamond Michael, Donini Michele
- Medpromptx: Grounded Multimodal Prompting For Chest X-ray Diagnosis Shaaban Mai A., Khan Adnan, Yaqub Mohammad
- Llava-prumerge: Adaptive Token Reduction For Efficient Large Multimodal Models Shang Yuzhang, Cai Mu, Xu Bingxin, Lee Yong Jae, Yan Yan
- Patch-level Training For Large Language Models Shao Chenze, Meng Fandong, Zhou Jie
- Patched MOA: Optimizing Inference For Diverse Software Development Tasks Sharma Asankhaya
- SPML: A DSL For Defending Language Models Against Prompt Attacks Sharma Reshabh K, Gupta Vinayak, Grossman Dan
- Accurate And Efficient Fine-tuning Of Quantized Large Language Models Through Optimal Balance Shen Ao, Wang Qiang, Lai Zhiquan, Li Xionglve, Li Dongsheng
- Nemo-aligner: Scalable Toolkit For Efficient Model Alignment Shen Gerald, Wang Zhilin, Delalleau Olivier, Zeng Jiaqi, Dong Yi, Egert Daniel, Sun Shengyang, Zhang Jimmy, Jain Sahil, Taghibakhshi Ali, Ausin Markel Sanz, Aithal Ashwath, Kuchaiev Oleksii
- Rapid Optimization For Jailbreaking Llms Via Subconscious Exploitation And Echopraxia Shen Guangyu, Cheng Siyuan, Zhang Kaiyuan, Tao Guanhong, An Shengwei, Yan Lu, Zhang Zhuo, Ma Shiqing, Zhang Xiangyu
- Shortcutsbench: A Large-scale Real-world Benchmark For Api-based Agents Shen Haiyang, Li Yue, Meng Desong, Cai Dongqi, Qi Sheng, Zhang Li, Xu Mengwei, Ma Yun
- Learning To Decode Collaboratively With Multiple Language Models Shen Shannon Zejiang, Lang Hunter, Wang Bailin, Kim Yoon, Sontag David
- Exploring User Retrieval Integration Towards Large Language Models For Cross-domain Sequential Recommendation Shen Tingjia, Wang Hao, Zhang Jiaqing, Zhao Sirui, Li Liangyue, Chen Zulong, Lian Defu, Chen Enhong
- Jetmoe: Reaching Llama2 Performance With 0.1M Dollars Shen Yikang, Guo Zhen, Cai Tianle, Qin Zengyi
- Proctag: Process Tagging For Assessing The Efficacy Of Document Instruction Data Shen Yufan, Luo Chuwei, Zhu Zhaoqing, Chen Yang, Zheng Qi, Yu Zhi, Bu Jiajun, Yao Cong
- Continual Learning Of Large Language Models: A Comprehensive Survey Shi Haizhou, Xu Zihao, Wang Hengyi, Qin Weiyi, Wang Wenyuan, Wang Yibin, Wang Zifeng, Ebrahimi Sayna, Wang Hao
- Paying More Attention To Image: A Training-free Method For Alleviating Hallucination In Lvlms Shi Liu, Kecheng Zheng, Wei Chen
- Eagle: Exploring The Design Space For Multimodal Llms With Mixture Of Encoders Shi Min, Liu Fuxiao, Wang Shihao, Liao Shijia, Radhakrishnan Subhashree, Huang De-an, Yin Hongxu, Sapra Karan, Yacoob Yaser, Shi Humphrey, Catanzaro Bryan, Tao Andrew, Kautz Jan, Yu Zhiding, Liu Guilin
- Reslora: Identity Residual Mapping In Low-rank Adaption Shi Shuhua, Huang Shaohan, Song Minghui, Li Zhoujun, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Math-llava: Bootstrapping Mathematical Reasoning For Multimodal Large Language Models Shi Wenhao, Hu Zhiqiang, Bin Yi, Liu Junhua, Yang Yang, Ng See-kiong, Bing Lidong, Lee Roy Ka-wei
- Large Language Models Are Learnable Planners For Long-term Recommendation Shi Wentao, He Xiangnan, Zhang Yang, Gao Chongming, Li Xinyue, Zhang Jizhi, Wang Qifan, Feng Fuli
- Agentre: An Agent-based Framework For Navigating Complex Information Landscapes In Relation Extraction Shi Yuchen, Jiang Guochao, Qiu Tian, Yang Deqing
- Retrieval-enhanced Knowledge Editing In Language Models For Multi-hop Question Answering Shi Yucheng, Tan Qiaoyu, Wu Xuansheng, Zhong Shaochen, Zhou Kaixiong, Liu Ninghao
- Implicit Multimodal Alignment: On The Generalization Of Frozen Llms To Multimodal Inputs Shukor Mustafa, Cord Matthieu
- Constructing Benchmarks And Interventions For Combating Hallucinations In Llms Simhi Adi, Herzig Jonathan, Szpektor Idan, Belinkov Yonatan
- The Mysterious Case Of Neuron 1512: Injectable Realignment Architectures Reveal Internal Characteristics Of Meta's Llama 2 Model Smith Brenden, Baker Dallin, Chase Clayton, Barney Myles, Parker Kaden, Allred Makenna, Hu Peter, Evans Alex, Fulda Nancy
- Multi-task Inference: Can Large Language Models Follow Multiple Instructions At Once? Son Guijin, Baek Sangwon, Nam Sangdae, Jeong Ilgyun, Kim Seungone
- Moviechat+: Question-aware Sparse Memory For Long Video Question Answering Song Enxin, Chai Wenhao, Ye Tian, Hwang Jenq-neng, Li Xi, Wang Gaoang
- Increasing Model Capacity For Free: A Simple Strategy For Parameter Efficient Fine-tuning Song Haobo, Zhao Hao, Majumder Soumajit, Lin Tao
- Hierarchical Context Merging: Better Long Context Understanding For Pre-trained Llms Song Woomin, Oh Seunghyuk, Mo Sangwoo, Kim Jaehyung, Yun Sukmin, Ha Jung-woo, Shin Jinwoo
- Cs-bench: A Comprehensive Benchmark For Large Language Models Towards Computer Science Mastery Song Xiaoshuai, Diao Muxi, Dong Guanting, Wang Zhengyang, Fu Yujia, Qiao Runqi, Wang Zhexu, Fu Dayuan, Wu Huangxuan, Liang Bin, Zeng Weihao, Wang Yejie, Gongque Zhuoma, Yu Jianing, Tan Qiuna, Xu Weiran
- ELLA-V: Stable Neural Codec Language Modeling With Alignment-guided Sequence Reordering Song Yakun, Chen Zhuo, Wang Xiaofei, Ma Ziyang, Chen Xie
- Fine Tuning Vs. Retrieval Augmented Generation For Less Popular Knowledge Soudani Heydar, Kanoulas Evangelos, Hasibi Faegheh
- A Comprehensive Study Of The Capabilities Of Large Language Models For Vulnerability Detection Steenhoek Benjamin, Rahman Md Mahbubur, Roy Monoshi Kumar, Alam Mirza Sanjida, Barr Earl T., Le Wei
- Tailoring Vaccine Messaging With Common-ground Opinions Stureborg Rickard, Chen Sanxing, Xie Ruoyu, Patel Aayushi, Li Christopher, Zhu Chloe Qinyu, Hu Tingnan, Yang Jun, Dhingra Bhuwan
- ARKS: Active Retrieval In Knowledge Soup For Code Generation Su Hongjin, Jiang Shuyang, Lai Yuhang, Wu Haoyuan, Shi Boao, Liu Che, Liu Qian, Yu Tao
- Investigating Video Reasoning Capability Of Large Language Models With Tropes In Movies Su Hung-ting, Chao Chun-tong, Hsu Ya-ching, Lin Xudong, Niu Yulei, Lee Hung-yi, Hsu Winston H.
- Mitigating Entity-level Hallucination In Large Language Models Su Weihang, Tang Yichen, Ai Qingyao, Wang Changyue, Wu Zhijing, Liu Yiqun
- Llm-barber: Block-aware Rebuilder For Sparsity Mask In One-shot For Large Language Models Su Yupeng, Guan Ziyi, Liu Xiaoqun, Jin Tianlai, Wu Dongkuan, Chesi Graziano, Wong Ngai, Yu Hao
- Living In The Moment: Can Large Language Models Grasp Co-temporal Reasoning? Su Zhaochen, Li Juntao, Zhang Jun, Zhu Tong, Qu Xiaoye, Zhou Pan, Bowen Yan, Cheng Yu, Zhang Min
- Timo: Towards Better Temporal Reasoning For Language Models Su Zhaochen, Zhang Jun, Zhu Tong, Qu Xiaoye, Li Juntao, Zhang Min, Cheng Yu
- Decoding At The Speed Of Thought: Harnessing Parallel Decoding Of Lexical Units For Llms Sun Chenxi, Zhang Hongzhi, Lin Zijia, Zhang Jingyuan, Zhang Fuzheng, Wang Zhongyuan, Chen Bin, Song Chengru, Zhang Di, Gai Kun, Xiong Deyi
- Triforce: Lossless Acceleration Of Long Sequence Generation With Hierarchical Speculative Decoding Sun Hanshi, Chen Zhuoming, Yang Xinyu, Tian Yuandong, Chen Beidi
- Conifer: Improving Complex Constrained Instruction-following Ability Of Large Language Models Sun Haoran, Liu Lixin, Li Junjie, Wang Fengyu, Dong Baohua, Lin Ran, Huang Ruohui
- Benchmarking Chinese Commonsense Reasoning Of Llms: From Chinese-specifics To Reasoning-memorization Correlations Sun Jiaxing, Huang Weiquan, Wu Jiang, Gu Chenya, Li Wei, Zhang Songyang, Yan Hang, He Conghui
- Lawluo: A Chinese Law Firm Co-run By LLM Agents Sun Jingyun, Dai Chengxiao, Luo Zhongze, Chang Yangbo, Li Yang
- The Critique Of Critique Sun Shichao, Li Junlong, Yuan Weizhe, Yuan Ruifeng, Li Wenjie, Liu Pengfei
- QDA-SQL: Questions Enhanced Dialogue Augmentation For Multi-turn Text-to-sql Sun Yinggang, Guo Ziming, Yu Haining, Liu Chuanyi, Li Xiang, Wang Bingxuan, Yu Xiangzhan, Zhao Tiancheng
- You Only Cache Once: Decoder-decoder Architectures For Language Models Sun Yutao, Dong Li, Zhu Yi, Huang Shaohan, Wang Wenhui, Ma Shuming, Zhang Quanlu, Wang Jianyong, Wei Furu
- Llavaolmobitnet1b: Ternary LLM Goes Multimodal! Sundaram Jainaveen, Iyer Ravi
- Online Adaptation Of Language Models With A Memory Of Amortized Contexts Tack Jihoon, Kim Jaehyung, Mitchell Eric, Shin Jinwoo, Teh Yee Whye, Schwarz Jonathan Richard
- Mmlu-pro+: Evaluating Higher-order Reasoning And Shortcut Learning In Llms Taghanaki Saeid Asgari, Khani Aliasgahr, Khasahmadi Amir
- PROXYQA: An Alternative Framework For Evaluating Long-form Text Generation With Large Language Models Tan Haochen, Guo Zhijiang, Shi Zhan, Xu Lu, Liu Zhili, Feng Yunlong, Li Xiaoguang, Wang Yasheng, Shang Lifeng, Liu Qun, Song Linqi
- Idgenrec: Llm-recsys Alignment With Textual ID Learning Tan Juntao, Xu Shuyuan, Hua Wenyue, Ge Yingqiang, Li Zelong, Zhang Yongfeng
- Lloco: Learning Long Contexts Offline Tan Sijun, Li Xiuyu, Patil Shishir, Wu Ziyang, Zhang Tianjun, Keutzer Kurt, Gonzalez Joseph E., Popa Raluca Ada
- Vilco-bench: Video Language Continual Learning Benchmark Tang Tianqi, Deldari Shohreh, Xue Hao, De Melo Celso, Salim Flora D.
- Llmbox: A Comprehensive Library For Large Language Models Tang Tianyi, Hu Yiwen, Li Bingqian, Luo Wenyang, Qin Zijing, Sun Haoxiang, Wang Jiapeng, Xu Shiyi, Cheng Xiaoxue, Guo Geyang, Peng Han, Zheng Bowen, Tang Yiru, Min Yingqian, Chen Yushuo, Chen Jie, Zhao Yuanqian, Ding Luran, Wang Yuhao, Dong Zican, Xia Chunxuan, Li Junyi, Zhou Kun, Zhao Wayne Xin, Wen Ji-rong
- Step-back Profiling: Distilling User History For Personalized Scientific Writing Tang Xiangru, Zhang Xingyao, Shao Yanjun, Wu Jie, Zhao Yilun, Cohan Arman, Gong Ming, Zhang Dongmei, Gerstein Mark
- Multihop-rag: Benchmarking Retrieval-augmented Generation For Multi-hop Queries Tang Yixuan, Yang Yi
- A Survey On Self-evolution Of Large Language Models Tao Zhengwei, Lin Ting-en, Chen Xiancai, Li Hangyu, Wu Yuchuan, Li Yongbin, Jin Zhi, Huang Fei, Tao Dacheng, Zhou Jingren
- Targeted Visual Prompting For Medical Visual Question Answering Tascon-morales Sergio, Márquez-neila Pablo, Sznitman Raphael
- Interactive Prompt Debugging With Sequence Salience Tenney Ian, Mullins Ryan, Du Bin, Pandya Shree, Kahng Minsuk, Dixon Lucas
- I Am A Strange Dataset: Metalinguistic Tests For Language Models Thrush Tristan, Moore Jared, Monares Miguel, Potts Christopher, Kiela Douwe
- Selective Prompt Anchoring For Code Generation Tian Yuan, Zhang Tianyi
- Codehalu: Investigating Code Hallucinations In Llms Via Execution-based Verification Tian Yuchen, Yan Weixiang, Yang Qian, Zhao Xuandong, Chen Qian, Wang Wen, Luo Ziyang, Ma Lei, Song Dawn
- The Instinctive Bias: Spurious Images Lead To Hallucination In Mllms Tianyang Han, Qing Lian, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang
- PLUGH: A Benchmark For Spatial Understanding And Reasoning In Large Language Models Tikhonov Alexey
- Understanding Linear Probing Then Fine-tuning Language Models From NTK Perspective Tomihari Akiyoshi, Sato Issei
- Can Llms Learn From Previous Mistakes? Investigating Llms' Errors To Boost For Reasoning Tong Yongqi, Li Dawei, Wang Sizhe, Wang Yujia, Teng Fei, Shang Jingbo
- Lavy: Vietnamese Multimodal Large Language Model Tran Chi, Thanh Huong Le
- Two Tales Of Persona In Llms: A Survey Of Role-playing And Personalization Tseng Yu-min, Huang Yu-chao, Hsiao Teng-yun, Chen Wei-lin, Huang Chao-wei, Meng Yu, Chen Yun-nung
- Charactereval: A Chinese Benchmark For Role-playing Conversational Agent Evaluation Tu Quan, Fan Shilong, Tian Zihang, Yan Rui
- R-eval: A Unified Toolkit For Evaluating Domain Knowledge Of Retrieval Augmented Large Language Models Tu Shangqing, Wang Yuanchun, Yu Jifan, Xie Yuyang, Shi Yaran, Wang Xiaozhi, Zhang Jing, Hou Lei, Li Juanzi
- Lifelong Robot Library Learning: Bootstrapping Composable And Generalizable Skills For Embodied Control With Language Models Tziafas Georgios, Kasaei Hamidreza
- Syncode: LLM Generation With Grammar Augmentation Ugare Shubham, Suresh Tarun, Kang Hangoo, Misailovic Sasa, Singh Gagandeep
- PARADISE: Evaluating Implicit Planning Skills Of Language Models With Procedural Warnings And Tips Dataset Uzunoglu Arda, Safa Abdalfatah Rashid, Şahin Gözde Gül
- Investigating The Performance Of Language Models For Completing Code In Functional Programming Languages: A Haskell Case Study Van Dam Tim, Van Der Heijden Frank, De Bekker Philippe, Nieuwschepen Berend, Otten Marc, Izadi Maliheh
- Input Conditioned Graph Generation For Language Agents Vierling Lukas, Fu Jie, Chen Kai
- Temporal Blind Spots In Large Language Models Wallat Jonas, Jatowt Adam, Anand Avishek
- Fusechat: Knowledge Fusion Of Chat Models Wan Fanqi, Zhong Longguang, Yang Ziyi, Chen Ruijun, Quan Xiaojun
- Image Re-identification: Where Self-supervision Meets Vision-language Learning Wang Bin, Liang Yuying, Cai Lei, Huang Huakun, Zeng Huanqiang
- IAA: Inner-adaptor Architecture Empowers Frozen Large Language Model With Multimodal Capabilities Wang Bin, Xie Chunyu, Leng Dawei, Yin Yuhui
- Halu-j: Critique-based Hallucination Judge Wang Binjie, Chern Steffi, Chern Ethan, Liu Pengfei
- Mllm-tool: A Multimodal Large Language Model For Tool Agent Learning Wang Chenyu Michael, Luo Weixin Michael, Chen Qianyu Michael, Mai Haonan Michael, Guo Jindi Michael, Dong Sixun Michael, Xiaohua Michael, Xuan, Li Zhengxin, Ma Lin, Gao Shenghua
- Ada-leval: Evaluating Long-context Llms With Length-adaptable Benchmarks Wang Chonghua, Duan Haodong, Zhang Songyang, Lin Dahua, Chen Kai
- Towards Efficient And Effective Unlearning Of Large Language Models For Recommendation Wang Hangyu, Lin Jianghao, Chen Bo, Yang Yang, Tang Ruiming, Zhang Weinan, Yu Yong
- Techgpt-2.0: A Large Language Model Project To Solve The Task Of Knowledge Graph Construction Wang Jiaqi, Chang Yuying, Li Zhong, An Ning, Ma Qi, Hei Lei, Luo Haibo, Lu Yifei, Ren Feiliang
- Tarsier: Recipes For Training And Evaluating Large Video Description Models Wang Jiawei, Yuan Liping, Zhang Yuchen
- A User-centric Benchmark For Evaluating Large Language Models Wang Jiayin, Mo Fengran, Ma Weizhi, Sun Peijie, Zhang Min, Nie Jian-yun
- Helpful Or Harmful Data? Fine-tuning-free Shapley Attribution For Explaining Language Model Predictions Wang Jingtan, Lin Xiaoqiang, Qiao Rui, Foo Chuan-sheng, Low Bryan Kian Hsiang
- The Whole Is Better Than The Sum: Using Aggregated Demonstrations In In-context Learning For Sequential Recommendation Wang Lei, Lim Ee-peng
- SUBLLM: A Novel Efficient Architecture With Token Sequence Subsampling For LLM Wang Quandong, Yuan Yuxuan, Yang Xiaoyu, Zhang Ruike, Zhao Kang, Liu Wei, Luan Jian, Povey Daniel, Wang Bin
- Mfc-bench: Benchmarking Multimodal Fact-checking With Large Vision-language Models Wang Shengkang, Lin Hongzhan, Luo Ziyang, Ye Zhen, Chen Guang, Ma Jing
- Benchmark Self-evolving: A Multi-agent Framework For Dynamic LLM Evaluation Wang Siyuan, Long Zhuohan, Fan Zhihao, Wei Zhongyu, Huang Xuanjing
- Symbolic Working Memory Enhances Language Models For Complex Rule Application Wang Siyuan, Wei Zhongyu, Choi Yejin, Ren Xiang
- QUITO: Accelerating Long-context Reasoning Through Query-guided Context Compression Wang Wenshan, Wang Yihang, Fan Yixing, Liao Huaming, Guo Jiafeng
- Characteristic AI Agents Via Large Language Models Wang Xi, Dai Hongliang, Gao Shen, Li Piji
- Rdrec: Rationale Distillation For Llm-based Recommendation Wang Xinfeng, Cui Jin, Suzuki Yoshimi, Fukumoto Fumiyo
- Negativeprompt: Leveraging Psychology For Large Language Models Enhancement Via Negative Emotional Stimuli Wang Xu, Li Cheng, Chang Yi, Wang Jindong, Wu Yuan
- Defending Llms Against Jailbreaking Attacks Via Backtranslation Wang Yihan, Shi Zhouxing, Bai Andrew, Hsieh Cho-jui
- Speak It Out: Solving Symbol-related Problems With Symbol-to-language Conversion For Language Models Wang Yile, Cheng Sijie, Sun Zixin, Li Peng, Liu Yang
- Adaptvision: Dynamic Input Scaling In Mllms For Versatile Scene Understanding Wang Yonghui, Zhou Wengang, Feng Hao, Li Houqiang
- REAR: A Relevance-aware Retrieval-augmented Framework For Open-domain Question Answering Wang Yuhao, Ren Ruiyang, Li Junyi, Zhao Wayne Xin, Liu Jing, Wen Ji-rong
- Taste: Teaching Large Language Models To Translate Through Self-reflection Wang Yutong, Zeng Jiali, Liu Xuebo, Meng Fandong, Zhou Jie, Zhang Min
- Helpsteer2: Open-source Dataset For Training Top-performing Reward Models Wang Zhilin, Dong Yi, Delalleau Olivier, Zeng Jiaqi, Shen Gerald, Egert Daniel, Zhang Jimmy J., Sreedhar Makesh Narsimhan, Kuchaiev Oleksii
- Chatlogic: Integrating Logic Programming With Large Language Models For Multi-step Reasoning Wang Zhongsheng, Liu Jiamou, Bao Qiming, Rong Hongfei, Zhang Jingfeng
- RAT: Retrieval Augmented Thoughts Elicit Context-aware Reasoning In Long-horizon Generation Wang Zihao, Liu Anji, Lin Haowei, Li Jiaqi, Ma Xiaojian, Liang Yitao
- Farsight: Fostering Responsible AI Awareness During AI Application Prototyping Wang Zijie J., Kulkarni Chinmay, Wilcox Lauren, Terry Michael, Madaio Michael
- Flora: Federated Fine-tuning Large Language Models With Heterogeneous Low-rank Adaptations Wang Ziyao, Shen Zheyu, He Yexiao, Sun Guoheng, Wang Hongyi, Lyu Lingjuan, Li Ang
- Long-form Factuality In Large Language Models Wei Jerry, Yang Chengrun, Song Xinying, Lu Yifeng, Hu Nathan, Huang Jie, Tran Dustin, Peng Daiyi, Liu Ruibo, Huang Da, Du Cosmo, Le Quoc V.
- AC-EVAL: Evaluating Ancient Chinese Language Understanding In Large Language Models Wei Yuting, Xu Yuanxing, Wei Xinru, Yang Simin, Zhu Yangfu, Li Yuqing, Liu Di, Wu Bin
- AGLA: Mitigating Object Hallucinations In Large Vision-language Models With Assembly Of Global And Local Attention Wenbin An, Feng Tian, Sicong Leng, Jiahao Nie, Haonan Lin, Qianying Wang, Guang Dai, Ping Chen, Shijian Lu
- Longvlm: Efficient Long Video Understanding Via Large Language Models Weng Yuetian, Han Mingfei, He Haoyu, Chang Xiaojun, Zhuang Bohan
- Curriculum Learning With Quality-driven Data Selection Wu Biao, Meng Fang, Chen Ling
- EDA Corpus: A Large Language Model Dataset For Enhanced Interaction With Openroad Wu Bing-yue, Sharma Utsav, Kankipati Sai Rahul Dhanvi, Yadav Ajay, George Bintu Kappil, Guntupalli Sai Ritish, Rovinski Austin, Chhabria Vidya A.
- Towards Evaluating And Building Versatile Large Language Models For Medicine Wu Chaoyi, Qiu Pengcheng, Liu Jinxin, Gu Hongfei, Li Na, Zhang Ya, Wang Yanfeng, Xie Weidi
- What Are The Limits Of Cross-lingual Dense Passage Retrieval For Low-resource Languages? Wu Jie, Ren Zhaochun, Verberne Suzan
- \(\beta\)-dpo: Direct Preference Optimization With Dynamic \(\beta\) Wu Junkang, Xie Yuexiang, Yang Zhengyi, Wu Jiancan, Gao Jinyang, Ding Bolin, Wang Xiang, He Xiangnan
- Noiseboost: Alleviating Hallucination With Noise Perturbation For Multimodal Large Language Models Wu Kai, Jiang Boyuan, Jiang Zhengkai, He Qingdong, Luo Donghao, Wang Shengzhi, Liu Qingwen, Wang Chengjie
- Motionllm: Multimodal Motion-language Learning With Large Language Models Wu Qi, Zhao Yubo, Wang Yifan, Tai Yu-wing, Tang Chi-keung
- Avatar: Optimizing LLM Agents For Tool-assisted Knowledge Retrieval Wu Shirley, Zhao Shiyu, Huang Qian, Huang Kexin, Yasunaga Michihiro, Cao Kaidi, Ioannidis Vassilis N., Subbian Karthik, Leskovec Jure, Zou James
- How Easily Do Irrelevant Inputs Skew The Responses Of Large Language Models? Wu Siye, Xie Jian, Chen Jiangjie, Zhu Tinghui, Zhang Kai, Xiao Yanghua
- Freeva: Offline MLLM As Training-free Video Assistant Wu Wenhao
- Mitigating Misleading Chain-of-thought Reasoning With Selective Filtering Wu Yexin, Zhang Zhuosheng, Zhao Hai
- Agentkit: Structured LLM Reasoning With Dynamic Graphs Wu Yue, Fan Yewen, Min So Yeon, Prabhumoye Shrimai, Mcaleer Stephen, Bisk Yonatan, Salakhutdinov Ruslan, Li Yuanzhi, Mitchell Tom
- Acquiring Clean Language Models From Backdoor Poisoned Datasets By Downscaling Frequency Space Wu Zongru, Zhang Zhuosheng, Cheng Pengzhou, Liu Gongshen
- FOFO: A Benchmark To Evaluate Llms' Format-following Capability Xia Congying, Xing Chen, Du Jiangshu, Yang Xinyi, Feng Yihao, Xu Ran, Yin Wenpeng, Xiong Caiming
- FP6-LLM: Efficiently Serving Large Language Models Through Fp6-centric Algorithm-system Co-design Xia Haojun, Zheng Zhen, Wu Xiaoxia, Chen Shiyang, Yao Zhewei, Youn Stephen, Bakhtiari Arash, Wyatt Michael, Zhuang Donglin, Zhou Zhongzhu, Ruwase Olatunji, He Yuxiong, Song Shuaiwen Leon
- Infllm: Training-free Long-context Extrapolation For Llms With An Efficient Context Memory Xiao Chaojun, Zhang Pengle, Han Xu, Xiao Guangxuan, Lin Yankai, Zhang Zhengyan, Liu Zhiyuan, Sun Maosong
- Rar-b: Reasoning As Retrieval Benchmark Xiao Chenghao, Hudson G Thomas, Moubayed Noura Al
- Seeing The Image: Prioritizing Visual Correlation By Contrastive Alignment Xiao Xin, Wu Bohong, Wang Jiacong, Li Chunyuan, Zhou Xun, Guo Haoyuan
- Logicvista: Multimodal LLM Logical Reasoning Benchmark In Visual Contexts Xiao Yijia, Sun Edward, Liu Tianyu, Wang Wei
- Emovit: Revolutionizing Emotion Insights With Visual Instruction Tuning Xie Hongxia, Peng Chu-jun, Tseng Yu-wen, Chen Hung-jen, Hsu Chan-feng, Shuai Hong-han, Cheng Wen-huang
- Advancing Multimodal Large Language Models With Quantization-aware Scale Learning For Efficient Adaptation Xie Jingjing, Zhang Yuxin, Lin Mingbao, Cao Liujuan, Ji Rongrong
- Show-o: One Single Transformer To Unify Multimodal Understanding And Generation Xie Jinheng, Mao Weijia, Bai Zechen, Zhang David Junhao, Wang Weihao, Lin Kevin Qinghong, Gu Yuchao, Chen Zhijie, Yang Zhenheng, Shou Mike Zheng
- Large Multimodal Agents: A Survey Xie Junlin, Chen Zhihong, Zhang Ruifei, Wan Xiang, Li Guanbin
- Me Llama: Foundation Large Language Models For Medical Applications Xie Qianqian, Chen Qingyu, Chen Aokun, Peng Cheng, Hu Yan, Lin Fongci, Peng Xueqing, Huang Jimin, Zhang Jeffrey, Keloth Vipina, Zhou Xinyu, He Huan, Ohno-machado Lucila, Wu Yonghui, Xu Hua, Bian Jiang
- Decomposition For Enhancing Attention: Improving Llm-based Text-to-sql Through Workflow Paradigm Xie Yuanzhen, Jin Xinzhou, Xie Tao, Lin Mingxiong, Chen Liang, Yu Chenyun, Cheng Lei, Zhuo Chengxiang, Hu Bo, Li Zang
- Gradsafe: Detecting Jailbreak Prompts For Llms Via Safety-critical Gradient Analysis Xie Yueqi, Fang Minghong, Pi Renjie, Gong Neil
- Monte Carlo Tree Search Boosts Reasoning Via Iterative Preference Learning Xie Yuxi, Goyal Anirudh, Zheng Wenyue, Kan Min-yen, Lillicrap Timothy P., Kawaguchi Kenji, Shieh Michael
- Understanding The Weakness Of Large Language Model Agents Within A Complex Android Environment Xing Mingzhe, Zhang Rongkai, Xue Hui, Chen Qi, Yang Fan, Xiao Zhen
- Interactive Evolution: A Neural-symbolic Self-training Framework For Large Language Models Xu Fangzhi, Sun Qiushi, Cheng Kanzhi, Liu Jun, Qiao Yu, Wu Zhiyong
- On-device Language Models: A Comprehensive Review Xu Jiajun, Li Zhiyuan, Chen Wei, Wang Qun, Gao Xin, Cai Qi, Ling Ziyuan
- Faithful Logical Reasoning Via Symbolic Chain-of-thought Xu Jundong, Fei Hao, Pan Liangming, Liu Qian, Lee Mong-li, Hsu Wynne
- BESA: Pruning Large Language Models With Blockwise Parameter-efficient Sparsity Allocation Xu Peng, Shao Wenqi, Chen Mengzhao, Tang Shitao, Zhang Kaipeng, Gao Peng, An Fengwei, Qiao Yu, Luo Ping
- AIOS Compiler: LLM As Interpreter For Natural Language Programming And Flow Programming Of AI Agents Xu Shuyuan, Li Zelong, Mei Kai, Zhang Yongfeng
- Sayself: Teaching Llms To Express Confidence With Self-reflective Rationales Xu Tianyang, Wu Shujin, Diao Shizhe, Liu Xiaoze, Wang Xingyao, Chen Yangyi, Gao Jing
- Pride And Prejudice: LLM Amplifies Self-bias In Self-refinement Xu Wenda, Zhu Guanglei, Zhao Xuandong, Pan Liangming, Li Lei, Wang William Yang
- Chatglm-math: Improving Math Problem-solving In Large Language Models With A Self-critique Pipeline Xu Yifan, Liu Xiao, Liu Xinghan, Hou Zhenyu, Li Yueyan, Zhang Xiaohan, Wang Zihan, Zeng Aohan, Du Zhengxiao, Zhao Wenyi, Tang Jie, Dong Yuxiao
- Libra: Building Decoupled Vision System On Large Language Models Xu Yifan, Yang Xiaoshan, Song Yaguang, Xu Changsheng
- \(\textit{linkprompt}\): Natural And Universal Adversarial Attacks On Prompt-based Language Models Xu Yue, Wang Wenjie
- Bag Of Tricks: Benchmarking Of Jailbreak Attacks On Llms Xu Zhao, Liu Fan, Liu Hao
- Beyond Perplexity: Multi-dimensional Safety Evaluation Of LLM Compression Xu Zhichao, Gupta Ashim, Li Tao, Bentham Oliver, Srikumar Vivek
- Activerag: Revealing The Treasures Of Knowledge Via Active Learning Xu Zhipeng, Liu Zhenghao, Liu Yibin, Xiong Chenyan, Yan Yukun, Wang Shuo, Yu Shi, Liu Zhiyuan, Yu Ge
- Do Large Language Models Have Compositional Ability? An Investigation Into Limitations And Scalability Xu Zhuoyan, Shi Zhenmei, Liang Yingyu
- Multi-programming Language Ensemble For Code Generation In Large Language Model Xue Tengfei, Li Xuefeng, Azim Tahir, Smirnov Roman, Yu Jianhui, Sadrieh Arash, Pahlavan Babak
- LEIA: Facilitating Cross-lingual Knowledge Transfer In Language Models With Entity-based Data Augmentation Yamada Ikuya, Ri Ryokan
- List Items One By One: A New Data Source And Learning Paradigm For Multimodal Llms Yan An, Yang Zhengyuan, Wu Junda, Zhu Wanrong, Yang Jianwei, Li Linjie, Lin Kevin, Wang Jianfeng, Mcauley Julian, Gao Jianfeng, Wang Lijuan
- Contrastive Instruction Tuning Yan Tianyi Lorena, Wang Fei, Huang James Y., Zhou Wenxuan, Yin Fan, Galstyan Aram, Yin Wenpeng, Chen Muhao
- Predicting The Big Five Personality Traits In Chinese Counselling Dialogues Using Large Language Models Yan Yang, Ma Lizhi, Li Anqi, Ma Jingsong, Lan Zhenzhong
- Crafting Customisable Characters With Llms: Introducing Simschat, A Persona-driven Role-playing Agent Framework Yang Bohao, Liu Dong, Tang Chen, Xiao Chenghao, Zhao Kun, Li Chao, Yuan Lin, Yang Guang, Huang Lanxiao, Lin Chenghua
- Emphasising Structured Information: Integrating Abstract Meaning Representation Into Llms For Enhanced Open-domain Dialogue Evaluation Yang Bohao, Zhao Kun, Tang Chen, Liu Dong, Zhan Liang, Lin Chenghua
- Mastering Text-to-image Diffusion: Recaptioning, Planning, And Generating With Multimodal Llms Yang Ling, Yu Zhaochen, Meng Chenlin, Xu Minkai, Ermon Stefano, Cui Bin
- Buffer Of Thoughts: Thought-augmented Reasoning With Large Language Models Yang Ling, Yu Zhaochen, Zhang Tianjun, Cao Shiyi, Xu Minkai, Zhang Wentao, Gonzalez Joseph E., Cui Bin
- Aqa-bench: An Interactive Benchmark For Evaluating Llms' Sequential Reasoning Ability Yang Siwei, Zhao Bingchen, Xie Cihang
- DGL: Dynamic Global-local Prompt Tuning For Text-video Retrieval Yang Xiangpeng, Zhu Linchao, Wang Xiaohan, Yang Yi
- Weak-to-strong Reasoning Yang Yuqing, Ma Yan, Liu Pengfei
- Self-distillation Bridges Distribution Gap In Language Model Fine-tuning Yang Zhaorui, Pang Tianyu, Feng Haozhe, Wang Han, Chen Wei, Zhu Minfeng, Liu Qian
- Data Contamination Can Cross Language Barriers Yao Feng, Zhuang Yufan, Sun Zihao, Xu Sunan, Kumar Animesh, Shang Jingbo
- Large Language Models Are Contrastive Reasoners Yao Liang
- Enhancing In-context Learning Performance With Just Svd-based Weight Pruning: A Theoretical Perspective Yao Xinhao, Hu Xiaolin, Yang Shenzhi, Liu Yong
- Sirllm: Streaming Infinite Retentive LLM Yao Yao, Li Zuchao, Zhao Hai
- Open-domain Implicit Format Control For Large Language Model Generation Yao Yiqun, Ma Wenjia, Fang Xuezhi, Jiang Xin, Li Xiang, Meng Xuying, Han Peng, Li Jing, Sun Aixin, Wang Yequan
- Seakr: Self-aware Knowledge Retrieval For Adaptive Retrieval Augmented Generation Yao Zijun, Qi Weijian, Pan Liangming, Cao Shulin, Hu Linmei, Liu Weichuan, Hou Lei, Li Juanzi
- Improving Visual Commonsense In Language Models Via Multiple Image Generation Yariv Guy, Schwartz Idan, Adi Yossi, Benaim Sagie
- Large Language Models As Hyper-heuristics For Combinatorial Optimization Ye Haoran, Wang Jiarui, Cao Zhiguang, Berto Federico, Hua Chuanbo, Kim Haeyeon, Park Jinkyoo, Song Guojie
- Tooleyes: Fine-grained Evaluation For Tool Learning Capabilities Of Large Language Models In Real-world Scenarios Ye Junjie, Li Guanyu, Gao Songyang, Huang Caishuang, Wu Yilong, Li Sixian, Fan Xiaoran, Dou Shihan, Zhang Qi, Gui Tao, Huang Xuanjing
- Toolsword: Unveiling Safety Issues Of Large Language Models In Tool Learning Across Three Stages Ye Junjie, Li Sixian, Li Guanyu, Huang Caishuang, Gao Songyang, Wu Yilong, Zhang Qi, Gui Tao, Huang Xuanjing
- Rotbench: A Multi-level Benchmark For Evaluating The Robustness Of Large Language Models In Tool Learning Ye Junjie, Wu Yilong, Gao Songyang, Huang Caishuang, Li Sixian, Li Guanyu, Fan Xiaoran, Zhang Qi, Gui Tao, Huang Xuanjing
- BEAF: Observing Before-after Changes To Evaluate Hallucination In Vision-language Models Ye-bin Moon, Hyeon-woo Nam, Choi Wonseok, Oh Tae-hyun
- How Interpretable Are Reasoning Explanations From Prompting Large Language Models? Yeo Wei Jie, Satapathy Ranjan, Goh Rick Siow Mong, Cambria Erik
- Using Large Language Models To Create AI Personas For Replication And Prediction Of Media Effects: An Empirical Test Of 133 Published Experimental Research Findings Yeykelis Leo, Pichai Kaavya, Cummings James J., Reeves Byron
- MMAU: A Holistic Benchmark Of Agent Capabilities Across Diverse Domains Yin Guoli, Bai Haoping, Ma Shuang, Nan Feng, Sun Yanchao, Xu Zhaoyang, Ma Shen, Lu Jiarui, Kong Xiang, Zhang Aonan, Yap Dian Ang, Zhang Yizhe, Ahnert Karsten, Kamath Vik, Berglund Mathias, Walsh Dominic, Gindele Tobias, Wiest Juergen, Lai Zhengfeng, Wang Xiaoming, Shan Jiulong, Cao Meng, Pang Ruoming, Wang Zirui
- Do Large Language Model Understand Multi-intent Spoken Language ? Yin Shangjian, Huang Peijie, Xu Yuhong, Huang Haojing, Chen Jiatian
- Relative Preference Optimization: Enhancing LLM Alignment Through Contrasting Responses Across Identical And Diverse Prompts Yin Yueqin, Wang Zhendong, Gu Yi, Huang Hai, Chen Weizhu, Zhou Mingyuan
- Internlm-math: Open Math Large Language Models Toward Verifiable Reasoning Ying Huaiyuan, Zhang Shuo, Li Linyang, Zhou Zhejian, Shao Yunfan, Fei Zhaoye, Ma Yichuan, Hong Jiawei, Liu Kuikun, Wang Ziyi, Wang Yudong, Wu Zijian, Li Shuaibin, Zhou Fengzhe, Liu Hongwei, Zhang Songyang, Zhang Wenwei, Yan Hang, Qiu Xipeng, Wang Jiayu, Chen Kai, Lin Dahua
- Llms-as-instructors: Learning From Errors Toward Automating Model Improvement Ying Jiahao, Lin Mingbao, Cao Yixin, Tang Wei, Wang Bo, Sun Qianru, Huang Xuanjing, Yan Shuicheng
- Unveiling The Safety Of Gpt-4o: An Empirical Study Using Jailbreak Attacks Ying Zonghao, Liu Aishan, Liu Xianglong, Tao Dacheng
- Debunc: Mitigating Hallucinations In Large Language Model Agent Communication With Uncertainty Estimations Yoffe Luke, Amayuelas Alfonso, Wang William Yang
- Safety Fine-tuning At (almost) No Cost: A Baseline For Vision Large Language Models Yongshuo Zong, Ondrej Bohdal, Tingyang Yu, Yongxin Yang, Timothy Hospedales
- C-TPT: Calibrated Test-time Prompt Tuning For Vision-language Models Via Text Feature Dispersion Yoon Hee Suk, Yoon Eunseop, Tee Joshua Tian Jin, Hasegawa-johnson Mark, Li Yingzhen, Yoo Chang D.
- When Linear Attention Meets Autoregressive Decoding: Towards More Effective And Efficient Linearized Large Language Models You Haoran, Fu Yichao, Wang Zheng, Yazdanbakhsh Amir, Lin Yingyan Celine
- Shiftaddllm: Accelerating Pretrained Llms Via Post-training Multiplication-less Reparameterization You Haoran, Guo Yipin, Fu Yichao, Zhou Wei, Shi Huihong, Zhang Xiaofan, Kundu Souvik, Yazdanbakhsh Amir, Lin Yingyan Celine
- Flow Of Reasoning: Efficient Training Of LLM Policy With Divergent Thinking Yu Fangxu, Jiang Lai, Kang Haoqiang, Hao Shibo, Qin Lianhui
- Neeko: Leveraging Dynamic Lora For Efficient Multi-character Role-playing Agent Yu Xiaoyan, Luo Tongxu, Wei Yifan, Lei Fangyu, Huang Yiming, Peng Hao, Zhu Liehuang
- BEYOND DIALOGUE: A Profile-dialogue Alignment Framework Towards General Role-playing Language Model Yu Yeyong, Yu Runsheng, Wei Haojie, Zhang Zhanqiu, Qian Quan
- Mitigate Position Bias In Large Language Models Via Scaling A Single Dimension Yu Yijiong, Jiang Huiqiang, Luo Xufang, Wu Qianhui, Lin Chin-yew, Li Dongsheng, Yang Yuqing, Huang Yongfeng, Qiu Lili
- Unveiling And Harnessing Hidden Attention Sinks: Enhancing Large Language Models Without Training Through Attention Calibration Yu Zhongzhi, Wang Zheng, Fu Yonggan, Shi Huihong, Shaikh Khalid, Lin Yingyan Celine
- Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion For Efficient Inference Intervention In Large Language Model Yuan Chenhan, Huang Fei, Peng Ru, Lu Keming, Yu Bowen, Zhou Chang, Zhou Jingren
- KV Cache Compression, But What Must We Give In Return? A Comprehensive Benchmark Of Long Context Capable Approaches Yuan Jiayi Henry, Liu Hongyi Henry, Shaochen Henry, Zhong, Chuang Yu-neng, Li Songchen, Wang Guanchu, Le Duy, Jin Hongye, Chaudhary Vipin, Xu Zhaozhuo, Liu Zirui, Hu Xia
- EASYTOOL: Enhancing Llm-based Agents With Concise Tool Instruction Yuan Siyu, Song Kaitao, Chen Jiangjie, Tan Xu, Shen Yongliang, Kan Ren, Li Dongsheng, Yang Deqing
- R-judge: Benchmarking Safety Risk Awareness For LLM Agents Yuan Tongxin, He Zhiwei, Dong Lingzhong, Wang Yiming, Zhao Ruijie, Xia Tian, Xu Lizhen, Zhou Binglin, Li Fangqi, Zhang Zhuosheng, Wang Rui, Liu Gongshen
- Evaluating Character Understanding Of Large Language Models Via Character Profiling From Fictional Works Yuan Xinfeng, Yuan Siyu, Cui Yuhan, Lin Tianhe, Wang Xintao, Xu Rui, Chen Jiangjie, Yang Deqing
- A Hybrid RAG System With Comprehensive Enhancement On Complex Reasoning Yuan Ye, Liu Chengwu, Yuan Jingyang, Sun Gongbo, Li Siqi, Zhang Ming
- Refuse Whenever You Feel Unsafe: Improving Safety In Llms Via Decoupled Refusal Training Yuan Youliang, Jiao Wenxiang, Wang Wenxuan, Huang Jen-tse, Xu Jiahao, Liang Tian, He Pinjia, Tu Zhaopeng
- Do Large Language Models Understand Conversational Implicature -- A Case Study With A Chinese Sitcom Yue Shisen, Song Siyuan, Cheng Xinyuan, Hu Hai
- Web2code: A Large-scale Webpage-to-code Dataset And Evaluation Framework For Multimodal Llms Yun Sukmin, Lin Haokun, Thushara Rusiru, Bhat Mohammad Qazim, Wang Yongxin, Jiang Zutao, Deng Mingkai, Wang Jinhong, Tao Tianhua, Li Junbo, Li Haonan, Nakov Preslav, Baldwin Timothy, Liu Zhengzhong, Xing Eric P., Liang Xiaodan, Shen Zhiqiang
- Round Trip Translation Defence Against Large Language Model Jailbreaking Attacks Yung Canaan, Dolatabadi Hadi Mohaghegh, Erfani Sarah, Leckie Christopher
- Turkishmmlu: Measuring Massive Multitask Language Understanding In Turkish Yüksel Arda, Köksal Abdullatif, Şenel Lütfi Kerem, Korhonen Anna, Schütze Hinrich
- Building Accurate Translation-tailored Llms With Language Aware Instruction Tuning Zan Changtong, Ding Liang, Shen Li, Zhen Yibing, Liu Weifeng, Tao Dacheng
- In-context Example Selection Via Similarity Search Improves Low-resource Machine Translation Zebaze Armel, Sagot Benoît, Bawden Rachel
- Advancing Multimodal Large Language Models In Chart Question Answering With Visualization-referenced Instruction Tuning Zeng Xingchen, Lin Haichuan, Ye Yilin, Zeng Wei
- Token-level Direct Preference Optimization Zeng Yongcheng, Liu Guoqing, Ma Weiyu, Yang Ning, Zhang Haifeng, Wang Jun
- Can Mllms Perform Text-to-image In-context Learning? Zeng Yuchen, Kang Wonjun, Chen Yicong, Koo Hyung Il, Lee Kangwook
- MR-BEN: A Comprehensive Meta-reasoning Benchmark For Large Language Models Zeng Zhongshen, Liu Yinhong, Wan Yingjia, Li Jingyao, Chen Pengguang, Dai Jianbo, Yao Yuxuan, Xu Rongwu, Qi Zehan, Zhao Wanru, Shen Linling, Lu Jianqiao, Tan Haochen, Chen Yukang, Zhang Hao, Shi Zhan, Wang Bailin, Guo Zhijiang, Jia Jiaya
- Injecagent: Benchmarking Indirect Prompt Injections In Tool-integrated Large Language Model Agents Zhan Qiusi, Liang Zhixiang, Ying Zifan, Kang Daniel
- Skyeyegpt: Unifying Remote Sensing Vision-language Tasks Via Instruction Tuning With Large Language Model Zhan Yang, Xiong Zhitong, Yuan Yuan
- Sciglm: Training Scientific Language Models With Self-reflective Instruction Annotation And Tuning Zhang Dan, Hu Ziniu, Zhoubian Sining, Du Zhengxiao, Yang Kaiyu, Wang Zihan, Yue Yisong, Dong Yuxiao, Tang Jie
- Chemllm: A Chemical Large Language Model Zhang Di, Liu Wei, Tan Qian, Chen Jingdan, Yan Hang, Yan Yuliang, Li Jiatong, Huang Weiran, Yue Xiangyu, Ouyang Wanli, Zhou Dongzhan, Zhang Shufei, Su Mao, Zhong Han-sen, Li Yuqiang
- Speechgpt-gen: Scaling Chain-of-information Speech Generation Zhang Dong, Zhang Xin, Zhan Jun, Li Shimin, Zhou Yaqian, Qiu Xipeng
- Iterative Utility Judgment Framework Via Llms Inspired By Relevance In Philosophy Zhang Hengran, Bi Keping, Guo Jiafeng, Cheng Xueqi
- Are Large Language Models Good At Utility Judgments? Zhang Hengran, Zhang Ruqing, Guo Jiafeng, De Rijke Maarten, Fan Yixing, Cheng Xueqi
- Sinklora: Enhanced Efficiency And Chat Capabilities For Long-context Large Language Models Zhang Hengyu
- Balancing Speciality And Versatility: A Coarse To Fine Framework For Supervised Fine-tuning Large Language Model Zhang Hengyuan, Wu Yanru, Li Dawei, Yang Sak, Zhao Rui, Jiang Yong, Tan Fei
- Kb-plugin: A Plug-and-play Framework For Large Language Models To Induce Programs Over Low-resourced Knowledge Bases Zhang Jiajie, Cao Shulin, Hu Linmei, Feng Ling, Hou Lei, Li Juanzi
- Agentohana: Design Unified Data And Training Pipeline For Effective Agent Learning Zhang Jianguo, Lan Tian, Murthy Rithesh, Liu Zhiwei, Yao Weiran, Tan Juntao, Hoang Thai, Yang Liangwei, Feng Yihao, Liu Zuxin, Awalgaonkar Tulika, Niebles Juan Carlos, Savarese Silvio, Heinecke Shelby, Wang Huan, Xiong Caiming
- Fast And Slow Generating: An Empirical Study On Large And Small Language Models Collaborative Decoding Zhang Kaiyan, Wang Jianyu, Ding Ning, Qi Biqing, Hua Ermo, Lv Xingtai, Zhou Bowen
- Hire A Linguist!: Learning Endangered Languages With In-context Linguistic Descriptions Zhang Kexun, Choi Yee Man, Song Zhenqiao, He Taiqi, Wang William Yang, Li Lei
- Exploring The Best Practices Of Query Expansion With Large Language Models Zhang Le, Wu Yihong, Yang Qian, Nie Jian-yun
- ARL2: Aligning Retrievers For Black-box Large Language Models Via Self-guided Adaptive Relevance Labeling Zhang Lingxi, Yu Yue, Wang Kuan, Zhang Chao
- Automated Multi-level Preference For Mllms Zhang Mengxi, Wu Wenhao, Lu Yu, Song Yuxin, Rong Kang, Yao Huanjin, Zhao Jianbo, Liu Fanglong, Sun Yifan, Feng Haocheng, Wang Jingdong
- Transfertod: A Generalizable Chinese Multi-domain Task-oriented Dialogue System With Transfer Capabilities Zhang Ming, Huang Caishuang, Wu Yilong, Liu Shichun, Zheng Huiyuan, Dong Yurui, Shen Yujiong, Dou Shihan, Zhao Jun, Ye Junjie, Zhang Qi, Gui Tao, Huang Xuanjing
- Instructedit: Instruction-based Knowledge Editing For Large Language Models Zhang Ningyu, Tian Bozhong, Cheng Siyuan, Liang Xiaozhuan, Hu Yi, Xue Kouying, Gou Yanjie, Chen Xi, Chen Huajun
- Internlm-xcomposer-2.5: A Versatile Large Vision Language Model Supporting Long-contextual Input And Output Zhang Pan, Dong Xiaoyi, Zang Yuhang, Cao Yuhang, Qian Rui, Chen Lin, Guo Qipeng, Duan Haodong, Wang Bin, Ouyang Linke, Zhang Songyang, Zhang Wenwei, Li Yining, Gao Yang, Sun Peng, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Yan Hang, He Conghui, Zhang Xingcheng, Chen Kai, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- Soaring From 4K To 400K: Extending Llm's Context With Activation Beacon Zhang Peitian, Liu Zheng, Xiao Shitao, Shao Ninglu, Ye Qiwei, Dou Zhicheng
- Extending Llama-3's Context Ten-fold Overnight Zhang Peitian, Shao Ninglu, Liu Zheng, Xiao Shitao, Qian Hongjin, Ye Qiwei, Dou Zhicheng
- Tinyllama: An Open-source Small Language Model Zhang Peiyuan, Zeng Guangtao, Wang Tianduo, Lu Wei
- Look Ahead Or Look Around? A Theoretical Comparison Between Autoregressive And Masked Pretraining Zhang Qi, Du Tianqi, Huang Haotian, Wang Yifei, Wang Yisen
- VAR-CLIP: Text-to-image Generator With Visual Auto-regressive Modeling Zhang Qian, Dai Xiangzi, Yang Ninghua, An Xiang, Feng Ziyong, Ren Xingyu
- Instruct Large Language Models To Drive Like Humans Zhang Ruijun, Guo Xianda, Zheng Wenzhao, Zhang Chenming, Keutzer Kurt, Chen Long
- Cfbench: A Comprehensive Constraints-following Benchmark For Llms Zhang Tao, Shen Yanjun, Luo Wenjing, Zhang Yan, Liang Hao, Zhang Tao, Yang Fan, Lin Mingan, Qiao Yujing, Chen Weipeng, Cui Bin, Zhang Wentao, Zhou Zenan
- Nomad-attention: Efficient LLM Inference On Cpus Through Multiply-add-free Attention Zhang Tianyi, Yi Jonah Wonkyu, Yao Bowen, Xu Zhaozhuo, Shrivastava Anshumali
- Large Language Models As Evaluators For Recommendation Explanations Zhang Xiaoyu, Li Yishan, Wang Jiayin, Sun Bowen, Ma Weizhi, Sun Peijie, Zhang Min
- Chain Of Preference Optimization: Improving Chain-of-thought Reasoning In Llms Zhang Xuan, Du Chao, Pang Tianyu, Liu Qian, Gao Wei, Lin Min
- Glape: Gold Label-agnostic Prompt Evaluation And Optimization For Large Language Model Zhang Xuanchang, Zhang Zhuosheng, Zhao Hai
- Pybench: Evaluating LLM Agent On Various Real-world Coding Tasks Zhang Yaolun, Pan Yinxu, Wang Yudong, Cai Jie
- Mme-realworld: Could Your Multimodal LLM Challenge High-resolution Real-world Scenarios That Are Difficult For Humans? Zhang Yi-fan, Zhang Huanyu, Tian Haochen, Fu Chaoyou, Zhang Shuangqing, Wu Junfei, Li Feng, Wang Kun, Wen Qingsong, Zhang Zhang, Wang Liang, Jin Rong, Tan Tieniu
- Benchmarking Trustworthiness Of Multimodal Large Language Models: A Comprehensive Study Zhang Yichi, Huang Yao, Sun Yitong, Liu Chang, Zhao Zhe, Fang Zhengwei, Wang Yifan, Chen Huanran, Yang Xiao, Wei Xingxing, Su Hang, Dong Yinpeng, Zhu Jun
- Boosting Jailbreak Attack With Momentum Zhang Yihao, Wei Zeming
- Intention Analysis Makes Llms A Good Jailbreak Defender Zhang Yuqi, Ding Liang, Zhang Lefei, Tao Dacheng
- PPTC-R Benchmark: Towards Evaluating The Robustness Of Large Language Models For Powerpoint Task Completion Zhang Zekai, Guo Yiduo, Liang Yaobo, Zhao Dongyan, Duan Nan
- A Survey On The Memory Mechanism Of Large Language Model Based Agents Zhang Zeyu, Bo Xiaohe, Ma Chen, Li Rui, Chen Xu, Dai Quanyu, Zhu Jieming, Dong Zhenhua, Wen Ji-rong
- DARG: Dynamic Evaluation Of Large Language Models Via Adaptive Reasoning Graph Zhang Zhehao, Chen Jiaao, Yang Diyi
- Found In The Middle: How Language Models Use Long Contexts Better Via Plug-and-play Positional Encoding Zhang Zhenyu, Chen Runjin, Liu Shiwei, Yao Zhewei, Ruwase Olatunji, Chen Beidi, Wu Xiaoxia, Wang Zhangyang
- Shieldlm: Empowering Llms As Aligned, Customizable And Explainable Safety Detectors Zhang Zhexin, Lu Yida, Ma Jingyuan, Zhang Di, Li Rui, Ke Pei, Sun Hao, Sha Lei, Sui Zhifang, Wang Hongning, Huang Minlie
- Safe Unlearning: A Surprisingly Effective And Generalizable Solution To Defend Against Jailbreak Attacks Zhang Zhexin, Yang Junxiao, Ke Pei, Cui Shiyao, Zheng Chujie, Wang Hongning, Huang Minlie
- Multiple-choice Questions Are Efficient And Robust LLM Evaluators Zhang Ziyin, Jiang Zhaokun, Xu Lizhen, Hao Hongkun, Wang Rui
- Reverse That Number! Decoding Order Matters In Arithmetic Learning Zhang-li Daniel, Lin Nianyi, Yu Jifan, Zhang Zheyuan, Yao Zijun, Zhang Xiaokang, Hou Lei, Zhang Jing, Li Juanzi
- Consistency Matters: Explore Llms Consistency From A Black-box Perspective Zhao Fufangchen, Jin Guoqiang, Huang Jiaheng, Zhao Rui, Tan Fei
- Prefix Guidance: A Steering Wheel For Large Language Models To Defend Against Jailbreak Attacks Zhao Jiawei, Chen Kejiang, Yuan Xiaojian, Zhang Weiming
- Diffagent: Fast And Accurate Text-to-image API Selection With Large Language Model Zhao Lirui, Yang Yue, Zhang Kaipeng, Shao Wenqi, Zhang Yuxin, Qiao Yu, Luo Ping, Ji Rongrong
- SYNTHEVAL: Hybrid Behavioral Testing Of NLP Models With Synthetic Checklists Zhao Raoyuan, Köksal Abdullatif, Liu Yihong, Weissweiler Leonie, Korhonen Anna, Schütze Hinrich
- Defending Large Language Models Against Jailbreak Attacks Via Layer-specific Editing Zhao Wei, Li Zhe, Li Yige, Zhang Ye, Sun Jun
- Mg-llava: Towards Multi-granularity Visual Instruction Tuning Zhao Xiangyu, Li Xiangtai, Duan Haodong, Huang Haian, Li Yining, Chen Kai, Yang Hua
- Codejudge-eval: Can Large Language Models Be Good Judges In Code Understanding? Zhao Yuwei, Luo Ziyang, Tian Yuchen, Lin Hongzhan, Yan Weixiang, Li Annan, Ma Jing
- Gpt-4v(ision) Is A Generalist Web Agent, If Grounded Zheng Boyuan, Gou Boyu, Kil Jihyung, Sun Huan, Su Yu
- Ali-agent: Assessing Llms' Alignment With Human Values Via Agent-based Evaluation Zheng Jingnan, Wang Han, Zhang An, Nguyen Tai D., Sun Jun, Chua Tat-seng
- Kun: Answer Polishment For Chinese Self-alignment With Instruction Back-translation Zheng Tianyu, Guo Shuyue, Qu Xingwei, Guo Jiawei, Du Xinrun, Jia Qi, Lin Chenghua, Huang Wenhao, Fu Jie, Zhang Ge
- More-3s:multimodal-based Offline Reinforcement Learning With Shared Semantic Spaces Zheng Tianyu, Zhang Ge, Qu Xingwei, Kuang Ming, Huang Stephen W., He Zhaofeng
- Llamafactory: Unified Efficient Fine-tuning Of 100+ Language Models Zheng Yaowei, Zhang Richong, Zhang Junhao, Ye Yanhan, Luo Zheyan, Feng Zhangchi, Ma Yongqiang
- Attention Heads Of Large Language Models: A Survey Zheng Zifan, Wang Yezhaohui, Huang Yuxin, Song Shichao, Tang Bo, Xiong Feiyu, Li Zhiyu
- Exploring Boundary Of GPT-4V On Marine Analysis: A Preliminary Case Study Zheng Ziqiang, Chen Yiwei, Zhang Jipeng, Vu Tuan-anh, Zeng Huimin, Tim Yue Him Wong, Yeung Sai-kit
- Agentgym: Evolving Large Language Model-based Agents Across Diverse Environments Zhiheng Xi, Yiwen Ding, Wenxiang Chen, Boyang Hong, Honglin Guo, Junzhe Wang, Dingwen Yang, Chenyang Liao, Xin Guo, Wei He, Songyang Gao, Lu Chen, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-gang Jiang
- Moextend: Tuning New Experts For Modality And Task Extension Zhong Shanshan, Gao Shanghua, Huang Zhongzhan, Wen Wushao, Zitnik Marinka, Zhou Pan
- Robust Prompt Optimization For Defending Language Models Against Jailbreaking Attacks Zhou Andy, Li Bo, Wang Haohan
- Moe-lpr: Multilingual Extension Of Large Language Models Through Mixture-of-experts With Language Priors Routing Zhou Hao, Wang Zhijun, Huang Shujian, Huang Xin, Han Xue, Feng Junlan, Deng Chao, Luo Weihua, Chen Jiajun
- Towards Democratizing Multilingual Large Language Models For Medicine Through A Two-stage Instruction Fine-tuning Approach Zhou Meng, Parmar Surajsinh, Bhatti Anubhav
- Evaluating The Smooth Control Of Attribute Intensity In Text Generation With Llms Zhou Shang, Yao Feng, Dong Chengyu, Wang Zihan, Shang Jingbo
- Sirius: Contextual Sparsity With Correction For Efficient Llms Zhou Yang, Chen Zhuoming, Xu Zhaozhuo, Lin Victoria, Chen Beidi
- Aligning Modalities In Vision Large Language Models Via Preference Fine-tuning Zhou Yiyang, Cui Chenhang, Rafailov Rafael, Finn Chelsea, Yao Huaxiu
- Calibrated Self-rewarding Vision Language Models Zhou Yiyang, Fan Zhiyuan, Cheng Dongjie, Yang Sihan, Chen Zhaorun, Cui Chenhang, Wang Xiyao, Li Yun, Zhang Linjun, Yao Huaxiu
- Emulated Disalignment: Safety Alignment For Large Language Models May Backfire! Zhou Zhanhui, Liu Jie, Dong Zhichen, Liu Jiaheng, Yang Chao, Ouyang Wanli, Qiao Yu
- How Alignment And Jailbreak Work: Explain LLM Safety Through Intermediate Hidden States Zhou Zhenhong, Yu Haiyang, Zhang Xinghua, Xu Rongwu, Huang Fei, Li Yongbin
- Lawgpt: A Chinese Legal Knowledge-enhanced Large Language Model Zhou Zhi, Shi Jiang-xin, Song Peng-xiao, Yang Xiao-wen, Jin Yi-xuan, Guo Lan-zhe, Li Yu-feng
- Benchmarking Large Language Models On CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset Zhu Jie, Li Junhui, Wen Yalong, Guo Lifan
- Collectivesft: Scaling Large Language Models For Chinese Medical Benchmark With Collective Instructions In Healthcare Zhu Jingwei, Tan Minghuan, Yang Min, Li Ruixue, Alinejad-rokny Hamid
- Fastmem: Fast Memorization Of Prompt Improves Context Awareness Of Large Language Models Zhu Junyi, Liu Shuochen, Yu Yu, Tang Bo, Yan Yibo, Li Zhiyu, Xiong Feiyu, Xu Tong, Blaschko Matthew B.
- Dynamic Evaluation Of Large Language Models By Meta Probing Agents Zhu Kaijie, Wang Jindong, Zhao Qinlin, Xu Ruochen, Xie Xing
- LIME-M: Less Is More For Evaluation Of Mllms Zhu Kang, Zang Qianbo, Jia Shian, Wu Siwei, Fang Feiteng, Li Yizhi, Guo Shuyue, Zheng Tianyu, Li Bo, Wu Haoning, Qu Xingwei, Yang Jian, Liu Zachary, Yue Xiang, Liu J. H., Lin Chenghua, Yang Min, Ni Shiwen, Huang Wenhao, Zhang Ge
- Personality Alignment Of Large Language Models Zhu Minjun, Yang Linyi, Zhang Yue
- Scalable Matmul-free Language Modeling Zhu Rui-jie, Zhang Yu, Sifferman Ethan, Sheaves Tyler, Wang Yiqiao, Richmond Dustin, Zhou Peng, Eshraghian Jason K.
- Llama-moe: Building Mixture-of-experts From Llama With Continual Pre-training Zhu Tong, Qu Xiaoye, Dong Daize, Ruan Jiacheng, Tong Jingqi, He Conghui, Cheng Yu
- Question Translation Training For Better Multilingual Reasoning Zhu Wenhao, Huang Shujian, Yuan Fei, She Shuaijie, Chen Jiajun, Birch Alexandra
- Multilingual Contrastive Decoding Via Language-agnostic Layers Skipping Zhu Wenhao, Liu Sizhe, Huang Shujian, She Shuaijie, Wendler Chris, Chen Jiajun
- KG-FPQ: Evaluating Factuality Hallucination In Llms With Knowledge Graph-based False Premise Questions Zhu Yanxu, Xiao Jinlin, Wang Yuhang, Sang Jitao
- Prompting Large Language Models For Zero-shot Clinical Prediction With Structured Longitudinal Electronic Health Record Data Zhu Yinghao, Wang Zixiang, Gao Junyi, Tong Yuning, An Jingkun, Liao Weibin, Harrison Ewen M., Ma Liantao, Pan Chengwei
- Generative Pre-trained Speech Language Model With Efficient Hierarchical Transformer Zhu Yongxin, Su Dan, He Liqiang, Xu Linli, Yu Dong
- Knowagent: Knowledge-augmented Planning For Llm-based Agents Zhu Yuqi, Qiao Shuofei, Ou Yixin, Deng Shumin, Zhang Ningyu, Lyu Shiwei, Shen Yue, Liang Lei, Gu Jinjie, Chen Huajun
- INTERS: Unlocking The Power Of Large Language Models In Search With Instruction Tuning Zhu Yutao, Zhang Peitian, Zhang Chenghao, Chen Yifei, Xie Binyu, Liu Zheng, Wen Ji-rong, Dou Zhicheng
- Yulan: An Open-source Large Language Model Zhu Yutao, Zhou Kun, Mao Kelong, Chen Wentong, Sun Yiding, Chen Zhipeng, Cao Qian, Wu Yihan, Chen Yushuo, Wang Feng, Zhang Lei, Li Junyi, Wang Xiaolei, Wang Lei, Zhang Beichen, Dong Zican, Cheng Xiaoxue, Chen Yuhan, Tang Xinyu, Hou Yupeng, Ren Qiangqiang, Pang Xincheng, Xie Shufang, Zhao Wayne Xin, Dou Zhicheng, Mao Jiaxin, Lin Yankai, Song Ruihua, Xu Jun, Chen Xu, Yan Rui, Wei Zhewei, Hu Di, Huang Wenbing, Gao Ze-feng, Chen Yueguo, Lu Weizheng, Wen Ji-rong
- Halueval-wild: Evaluating Hallucinations Of Language Models In The Wild Zhu Zhiying, Yang Yiming, Sun Zhiqing
- Mmmmodal -- Multi-images Multi-audio Multi-turn Multi-modal Zolkepli Husein, Razak Aisyah, Adha Kamarul, Nazhan Ariff
- VL-ICL Bench: The Devil In The Details Of Benchmarking Multimodal In-context Learning Zong Yongshuo, Bohdal Ondrej, Hospedales Timothy
- Implicitave: An Open-source Dataset And Multimodal Llms Benchmark For Implicit Attribute Value Extraction Zou Henry Peng, Samuel Vinay, Zhou Yue, Zhang Weizhi, Fang Liancheng, Song Zihe, Yu Philip S., Caragea Cornelia
- MLKV: Multi-layer Key-value Heads For Memory Efficient Transformer Decoding Zuhri Zayd Muhammad Kawakibi, Adilazuarda Muhammad Farid, Purwarianti Ayu, Aji Alham Fikri
- Prompt-time Ontology-driven Symbolic Knowledge Capture With Large Language Models Çöplü Tolga, Bendiken Arto, Skomorokhov Andrii, Bateiko Eduard, Cobb Stephen
- Enhancing Visual Question Answering Through Question-driven Image Captions As Prompts Özdemir Övgü, Akagündüz Erdem
- BASE TTS: Lessons From Building A Billion-parameter Text-to-speech Model On 100K Hours Of Data Łajszczak Mateusz, Cámbara Guillermo, Li Yang, Beyhan Fatih, Van Korlaar Arent, Yang Fan, Joly Arnaud, Martín-cortinas Álvaro, Abbas Ammar, Michalski Adam, Moinet Alexis, Karlapati Sri, Muszyńska Ewa, Guo Haohan, Putrycz Bartosz, Gambino Soledad López, Yoo Kayeon, Sokolova Elena, Drugman Thomas
🏷 In Context Learning
- What Changes Can Large-scale Language Models Bring? Intensive Study On Hyperclova: Billions-scale Korean Generative Pretrained Transformers Boseop Kim, Hyoungseok Kim, Sang-woo Lee, Gichang Lee, Donghyun Kwak, Dong Hyeon Jeon, Sunghyun Park, Sungju Kim, Seonhoon Kim, Dongpil Seo, Heungsub Lee, Minyoung Jeong, Sungjae Lee, Minsub Kim, Suk Hyun Ko, Seokhun Kim, Taeyong Park, Jinuk Kim, Soyoung Kang, Na-hyeon Ryu, Kang Min Yoo, Minsuk Chang, Soobin Suh, Sookyo In, Jinseong Park, Kyungduk Kim, Hiun Kim, Jisu Jeong, Yong Goo Yeo, Donghoon Ham, Dongju Park, Min Young Lee, Jaewook Kang, Inho Kang, Jung-woo Ha, Woomyoung Park, Nako Sung
- Glam: Efficient Scaling Of Language Models With Mixture-of-experts Nan Du, Yanping Huang, Andrew M. Dai, Simon Tong, Dmitry Lepikhin, Yuanzhong Xu, Maxim Krikun, Yanqi Zhou, Adams Wei Yu, Orhan Firat, Barret Zoph, Liam Fedus, Maarten Bosma, Zongwei Zhou, Tao Wang, Yu Emma Wang, Kellie Webster, Marie Pellat, Kevin Robinson, Kathleen Meier-hellstern, Toju Duke, Lucas Dixon, Kun Zhang, Quoc V Le, Yonghui Wu, Zhifeng Chen, Claire Cui
- Learning To Retrieve Prompts For In-context Learning Ohad Rubin, Jonathan Herzig, Jonathan Berant
- Metaicl: Learning To Learn In Context Sewon Min, Mike Lewis, Luke Zettlemoyer, Hannaneh Hajishirzi
- An Explanation Of In-context Learning As Implicit Bayesian Inference Xie Sang Michael, Raghunathan Aditi, Liang Percy, Ma Tengyu
- Text And Patterns: For Effective Chain Of Thought, It Takes Two To Tango Aman Madaan, Amir Yazdanbakhsh
- Can Language Models Learn From Explanations In Context? Andrew K. Lampinen, Ishita Dasgupta, Stephanie C. Y. Chan, Kory Matthewson, Michael Henry Tessler, Antonia Creswell, James L. Mcclelland, Jane X. Wang, Felix Hill
- Exploring Length Generalization In Large Language Models Anil Cem, Wu Yuhuai, Andreassen Anders, Lewkowycz Aitor, Misra Vedant, Ramasesh Vinay, Slone Ambrose, Gur-ari Guy, Dyer Ethan, Neyshabur Behnam
- Bootstrapping Multilingual Semantic Parsers Using Large Language Models Awasthi Abhijeet, Gupta Nitish, Samanta Bidisha, Dave Shachi, Sarawagi Sunita, Talukdar Partha
- Rethinking The Role Of Scale For In-context Learning: An Interpretability-based Case Study At 66 Billion Scale Bansal Hritik, Gopalakrishnan Karthik, Dingliwal Saket, Bodapati Sravan, Kirchhoff Katrin, Roth Dan
- Leveraging Pre-trained Language Models For Conversational Information Seeking From Text Bellan Patrizio, Dragoni Mauro, Ghidini Chiara
- Large Language Models Can Implement Policy Iteration Brooks Ethan, Walls Logan, Lewis Richard L., Singh Satinder
- Data Curation Alone Can Stabilize In-context Learning Chang Ting-yun, Jia Robin
- Why Can GPT Learn In-context? Language Models Implicitly Perform Gradient Descent As Meta-optimizers Dai Damai, Sun Yutao, Dong Li, Hao Yaru, Ma Shuming, Sui Zhifang, Wei Furu
- Winodict: Probing Language Models For In-context Word Acquisition Eisenschlos Julian Martin, Cole Jeremy R., Liu Fangyu, Cohen William W.
- Thinking About GPT-3 In-context Learning For Biomedical IE? Think Again Gutiérrez Bernal Jiménez, Mcneal Nikolas, Washington Clay, Chen You, Li Lang, Sun Huan, Su Yu
- Instruction Induction: From Few Examples To Natural Language Task Descriptions Honovich Or, Shaham Uri, Bowman Samuel R., Levy Omer
- Self-generated In-context Learning: Leveraging Auto-regressive Language Models As A Demonstration Generator Hyuhng Joon Kim, Hyunsoo Cho, Junyeob Kim, Taeuk Kim, Kang Min Yoo, Sang-goo Lee
- HINT: Hypernetwork Instruction Tuning For Efficient Zero- & Few-shot Generalisation Ivison Hamish, Bhagia Akshita, Wang Yizhong, Hajishirzi Hannaneh, Peters Matthew
- Contrastive Learning For Prompt-based Few-shot Language Learners Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Self-prompting Large Language Models For Zero-shot Open-domain QA Junlong Li, Jinyuan Wang, Zhuosheng Zhang, Hai Zhao
- Decomposed Prompting: A Modular Approach For Solving Complex Tasks Khot Tushar, Trivedi Harsh, Finlayson Matthew, Fu Yao, Richardson Kyle, Clark Peter, Sabharwal Ashish
- Self-generated In-context Learning: Leveraging Auto-regressive Language Models As A Demonstration Generator Kim Hyuhng Joon, Cho Hyunsoo, Kim Junyeob, Kim Taeuk, Yoo Kang Min, Lee Sang-goo
- Internet-augmented Language Models Through Few-shot Prompting For Open-domain Question Answering Lazaridou Angeliki, Gribovskaya Elena, Stokowiec Wojciech, Grigorev Nikolai
- Explanations From Large Language Models Make Small Reasoners Better Li Shiyang, Chen Jianshu, Shen Yelong, Chen Zhiyu, Zhang Xinlu, Li Zekun, Wang Hong, Qian Jing, Peng Baolin, Mao Yi, Chen Wenhu, Yan Xifeng
- PAL: Program-aided Language Models Luyu Gao, Aman Madaan, Shuyan Zhou, Uri Alon, Pengfei Liu, Yiming Yang, Jamie Callan, Graham Neubig
- Grounding Language With Visual Affordances Over Unstructured Data Mees Oier, Borja-diaz Jessica, Burgard Wolfram
- Rethinking The Role Of Demonstrations: What Makes In-context Learning Work? Min Sewon, Lyu Xinxi, Holtzman Ari, Artetxe Mikel, Lewis Mike, Hajishirzi Hannaneh, Zettlemoyer Luke
- Thinksum: Probabilistic Reasoning Over Sets Using Large Language Models Ozturkler Batu, Malkin Nikolay, Wang Zhen, Jojic Nebojsa
- A Survey On In-context Learning Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Lei Li, Zhifang Sui
- Parallel Context Windows For Large Language Models Ratner Nir, Levine Yoav, Belinkov Yonatan, Ram Ori, Magar Inbal, Abend Omri, Karpas Ehud, Shashua Amnon, Leyton-brown Kevin, Shoham Yoav
- Super-prompting: Utilizing Model-independent Contextual Data To Reduce Data Annotation Required In Visual Commonsense Tasks Rezaei Navid, Reformat Marek Z.
- On The Effect Of Pretraining Corpora On In-context Learning By A Large-scale Language Model Seongjin Shin, Sang-woo Lee, Hwijeen Ahn, Sungdong Kim, Hyoungseok Kim, Boseop Kim, Kyunghyun Cho, Gichang Lee, Woomyoung Park, Jung-woo Ha, Nako Sung
- XRICL: Cross-lingual Retrieval-augmented In-context Learning For Cross-lingual Text-to-sql Semantic Parsing Shi Peng, Zhang Rui, Bai He, Lin Jimmy
- Black-box Tuning For Language-model-as-a-service Sun Tianxiang, Shao Yunfan, Qian Hong, Huang Xuanjing, Qiu Xipeng
- Prompting Palm For Translation: Assessing Strategies And Performance Vilar David, Freitag Markus, Cherry Colin, Luo Jiaming, Ratnakar Viresh, Foster George
- Two-stage LLM Fine-tuning With Less Specialization And More Generalization Wang Yihan, Si Si, Li Daliang, Lukasik Michal, Yu Felix, Hsieh Cho-jui, Dhillon Inderjit S, Kumar Sanjiv
- Large Language Models Are Few(1)-shot Table Reasoners Wenhu Chen
- The Unreliability Of Explanations In Few-shot Prompting For Textual Reasoning Xi Ye, Greg Durrett
- Rationale-augmented Ensembles In Language Models Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc Le, Ed Chi, Denny Zhou
- Language Models Are General-purpose Interfaces Yaru Hao, Haoyu Song, Li Dong, Shaohan Huang, Zewen Chi, Wenhui Wang, Shuming Ma, Furu Wei
- Structured Prompting: Scaling In-context Learning To 1,000 Examples Yaru Hao, Yutao Sun, Li Dong, Zhixiong Han, Yuxian Gu, Furu Wei
- Retrieval-augmented Multimodal Language Modeling Yasunaga Michihiro, Aghajanyan Armen, Shi Weijia, James Rich, Leskovec Jure, Liang Percy, Lewis Mike, Zettlemoyer Luke, Yih Wen-tau
- Complementary Explanations For Effective In-context Learning Ye Xi, Iyer Srinivasan, Celikyilmaz Asli, Stoyanov Ves, Durrett Greg, Pasunuru Ramakanth
- UL2: Unifying Language Learning Paradigms Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Xavier Garcia, Jason Wei, Xuezhi Wang, Hyung Won Chung, Siamak Shakeri, Dara Bahri, Tal Schuster, Huaixiu Steven Zheng, Denny Zhou, Neil Houlsby, Donald Metzler
- Active Example Selection For In-context Learning Yiming Zhang, Shi Feng, Chenhao Tan
- Efficient Language Modeling With Sparse All-mlp Yu Ping, Artetxe Mikel, Ott Myle, Shleifer Sam, Gong Hongyu, Stoyanov Ves, Li Xian
- Self-adaptive In-context Learning: An Information Compression Perspective For In-context Example Selection And Ordering Zhiyong Wu, Yaoxiang Wang, Jiacheng Ye, Lingpeng Kong
- Teaching Algorithmic Reasoning Via In-context Learning Zhou Hattie, Nova Azade, Larochelle Hugo, Courville Aaron, Neyshabur Behnam, Sedghi Hanie
- Large Language Models Are Human-level Prompt Engineers Zhou Yongchao, Muresanu Andrei Ioan, Han Ziwen, Paster Keiran, Pitis Silviu, Chan Harris, Ba Jimmy
- Bring Your Own KG: Self-supervised Program Synthesis For Zero-shot KGQA Agarwal Dhruv, Das Rajarshi, Khosla Sopan, Gangadharaiah Rashmi
- Can Large Language Models Be Good Path Planners? A Benchmark And Investigation On Spatial-temporal Reasoning Aghzal Mohamed, Plaku Erion, Yao Ziyu
- Instructeval: Systematic Evaluation Of Instruction Selection Methods Ajith Anirudh, Pan Chris, Xia Mengzhou, Deshpande Ameet, Narasimhan Karthik
- Steering Large Language Models For Machine Translation With Finetuning And In-context Learning Alves Duarte M., Guerreiro Nuno M., Alves João, Pombal José, Rei Ricardo, De Souza José G. C., Colombo Pierre, Martins André F. T.
- BUFFET: Benchmarking Large Language Models For Few-shot Cross-lingual Transfer Asai Akari, Kudugunta Sneha, Yu Xinyan Velocity, Blevins Terra, Gonen Hila, Reid Machel, Tsvetkov Yulia, Ruder Sebastian, Hajishirzi Hannaneh
- Tallrec: An Effective And Efficient Tuning Framework To Align Large Language Model With Recommendation Bao Keqin, Zhang Jizhi, Zhang Yang, Wang Wenjie, Feng Fuli, He Xiangnan
- TART: A Plug-and-play Transformer Module For Task-agnostic Reasoning Bhatia Kush, Narayan Avanika, De Sa Christopher, Ré Christopher
- In-context Learning Dynamics With Random Binary Sequences Bigelow Eric J., Lubana Ekdeep Singh, Dick Robert P., Tanaka Hidenori, Ullman Tomer D.
- Scaling In-context Demonstrations With Structured Attention Cai Tianle, Huang Kaixuan, Lee Jason D., Wang Mengdi
- Can Transformers Learn Sequential Function Classes In Context? Campbell Ryan, Guo Emma, Hu Evan, Vir Reya, Hsiao Ethan
- Chatgpt Evaluation On Sentence Level Relations: A Focus On Temporal, Causal, And Discourse Relations Chan Chunkit, Cheng Jiayang, Wang Weiqi, Jiang Yuxin, Fang Tianqing, Liu Xin, Song Yangqiu
- Selective Demonstrations For Cross-domain Text-to-sql Chang Shuaichen, Fosler-lussier Eric
- Fireact: Toward Language Agent Fine-tuning Chen Baian, Shu Chang, Shareghi Ehsan, Collier Nigel, Narasimhan Karthik, Yao Shunyu
- Skills-in-context Prompting: Unlocking Compositionality In Large Language Models Chen Jiaao, Pan Xiaoman, Yu Dian, Song Kaiqiang, Wang Xiaoyang, Yu Dong, Chen Jianshu
- Large Language Models Are Visual Reasoning Coordinators Chen Liangyu, Li Bo, Shen Sheng, Yang Jingkang, Li Chunyuan, Keutzer Kurt, Darrell Trevor, Liu Ziwei
- Understanding And Improving In-context Learning On Vision-language Models Chen Shuo, Han Zhen, He Bailan, Buckley Mark, Torr Philip, Tresp Volker, Gu Jindong
- MMICT: Boosting Multi-modal Fine-tuning With In-context Examples Chen Tao, Zhang Enwei, Gao Yuting, Li Ke, Sun Xing, Zhang Yan, Li Hui, Ji Rongrong
- Self-icl: Zero-shot In-context Learning With Self-generated Demonstrations Chen Wei-lin, Wu Cheng-kuang, Chen Yun-nung, Chen Hsin-hsi
- The Janus Interface: How Fine-tuning In Large Language Models Amplifies The Privacy Risks Chen Xiaoyi, Tang Siyuan, Zhu Rui, Yan Shijun, Jin Lei, Wang Zihao, Su Liya, Zhang Zhikun, Wang Xiaofeng, Tang Haixu
- SALM: Speech-augmented Language Model With In-context Learning For Speech Recognition And Translation Chen Zhehuai, Huang He, Andrusenko Andrei, Hrinchuk Oleksii, Puvvada Krishna C., Li Jason, Ghosh Subhankar, Balam Jagadeesh, Ginsburg Boris
- Xplainllm: A QA Explanation Dataset For Understanding LLM Decision-making Chen Zichen, Chen Jianda, Gaidhani Mitali, Singh Ambuj, Sra Misha
- Neural Codec Language Models Are Zero-shot Text To Speech Synthesizers Chengyi Wang, Sanyuan Chen, Yu Wu, Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei
- Large Language Models For User Interest Journeys Christakopoulou Konstantina, Lalama Alberto, Adams Cj, Qu Iris, Amir Yifat, Chucri Samer, Vollucci Pierce, Soldo Fabio, Bseiso Dina, Scodel Sarah, Dixon Lucas, Chi Ed H., Chen Minmin
- Llm-in-the-loop: Leveraging Large Language Model For Thematic Analysis Dai Shih-chieh, Xiong Aiping, Ku Lun-wei
- Llmebench: A Flexible Framework For Accelerating Llms Benchmarking Dalvi Fahim, Hasanain Maram, Boughorbel Sabri, Mousi Basel, Abdaljalil Samir, Nazar Nizi, Abdelali Ahmed, Chowdhury Shammur Absar, Mubarak Hamdy, Ali Ahmed, Hawasly Majd, Durrani Nadir, Alam Firoj
- User Simulation With Large Language Models For Evaluating Task-oriented Dialogue Davidson Sam, Romeo Salvatore, Shu Raphael, Gung James, Gupta Arshit, Mansour Saab, Zhang Yi
- Evaluating Chatgpt As A Recommender System: A Rigorous Approach Di Palma Dario, Biancofiore Giovanni Maria, Anelli Vito Walter, Narducci Fedelucio, Di Noia Tommaso, Di Sciascio Eugenio
- Do Llms Work On Charts? Designing Few-shot Prompts For Chart Question Answering And Summarization Do Xuan Long, Hassanpour Mohammad, Masry Ahmed, Kavehzadeh Parsa, Hoque Enamul, Joty Shafiq
- Prompt Optimization Via Adversarial In-context Learning Do Xuan Long, Zhao Yiran, Brown Hannah, Xie Yuxi, Zhao James Xu, Chen Nancy F., Kawaguchi Kenji, Shieh Michael, He Junxian
- Plugmed: Improving Specificity In Patient-centered Medical Dialogue Generation Using In-context Learning Dou Chengfeng, Jin Zhi, Jiao Wenping, Zhao Haiyan, Tao Zhenwei, Zhao Yongqiang
- Exploring The Relationship Between In-context Learning And Instruction Tuning Duan Hanyu, Tang Yixuan, Yang Yi, Abbasi Ahmed, Tam Kar Yan
- Measuring And Improving Attentiveness To Partial Inputs With Counterfactuals Elazar Yanai, Paranjape Bhargavi, Peng Hao, Wiegreffe Sarah, Raghavi Khyathi, Srikumar Vivek, Singh Sameer, Smith Noah A.
- Comparable Demonstrations Are Important In In-context Learning: A Novel Perspective On Demonstration Selection Fan Caoyun, Tian Jidong, Li Yitian, He Hao, Jin Yaohui
- Uncovering The Potential Of Chatgpt For Discourse Analysis In Dialogue: An Empirical Study Fan Yaxin, Jiang Feng, Li Peifeng, Li Haizhou
- Language Models Can Be Logical Solvers Feng Jiazhan, Xu Ruochen, Hao Junheng, Sharma Hiteshi, Shen Yelong, Zhao Dongyan, Chen Weizhu
- The Devil Is In The Errors: Leveraging Large Language Models For Fine-grained Machine Translation Evaluation Fernandes Patrick, Deutsch Daniel, Finkelstein Mara, Riley Parker, Martins André F. T., Neubig Graham, Garg Ankush, Clark Jonathan H., Freitag Markus, Firat Orhan
- Context-aware Meta-learning Fifty Christopher, Duan Dennis, Junkins Ronald G., Amid Ehsan, Leskovec Jure, Re Christopher, Thrun Sebastian
- Estimating Large Language Model Capabilities Without Labeled Test Data Fu Harvey Yiyun, Ye Qinyuan, Xu Albert, Ren Xiang, Jia Robin
- Gpt4aigchip: Towards Next-generation AI Accelerator Design Automation Via Large Language Models Fu Yonggan, Zhang Yongan, Yu Zhongzhi, Li Sixu, Ye Zhifan, Li Chaojian, Wan Cheng, Lin Yingyan
- Wanglab At Mediqa-chat 2023: Clinical Note Generation From Doctor-patient Conversations Using Large Language Models Giorgi John, Toma Augustin, Xie Ronald, Chen Sondra S., An Kevin R., Zheng Grace X., Wang Bo
- Improving Input-label Mapping With Demonstration Replay For In-context Learning Gong Zhuocheng, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Yan Rui
- AMAGO: Scalable In-context Reinforcement Learning For Adaptive Agents Grigsby Jake, Fan Linxi, Zhu Yuke
- What Can Large Language Models Do In Chemistry? A Comprehensive Benchmark On Eight Tasks Guo Taicheng, Guo Kehan, Nan Bozhao, Liang Zhenwen, Guo Zhichun, Chawla Nitesh V., Wiest Olaf, Zhang Xiangliang
- Retrieval-augmented Code Generation For Universal Information Extraction Guo Yucan, Li Zixuan, Jin Xiaolong, Liu Yantao, Zeng Yutao, Liu Wenxuan, Li Xiang, Yang Pan, Bai Long, Guo Jiafeng, Cheng Xueqi
- Gistscore: Learning Better Representations For In-context Example Selection With Gist Bottlenecks Gupta Shivanshu, Rosenbaum Clemens, Elenberg Ethan R.
- A Theory Of Emergent In-context Learning As Implicit Structure Induction Hahn Michael, Goyal Navin
- Potential Benefits Of Employing Large Language Models In Research In Moral Education And Development Han Hyemin
- Toolkengpt: Augmenting Frozen Language Models With Massive Tools Via Tool Embeddings Hao Shibo, Liu Tianyang, Wang Zhen, Hu Zhiting
- Zero- And Few-shot Prompting With Llms: A Comparative Study With Fine-tuned Models For Bangla Sentiment Analysis Hasan Md. Arid, Das Shudipta, Anjum Afiyat, Alam Firoj, Anjum Anika, Sarker Avijit, Noori Sheak Rashed Haider
- ICL-D3IE: In-context Learning With Diverse Demonstrations Updating For Document Information Extraction He Jiabang, Wang Lei, Hu Yi, Liu Ning, Liu Hui, Xu Xing, Shen Heng Tao
- Using Natural Language Explanations To Improve Robustness Of In-context Learning He Xuanli, Wu Yuxiang, Camburu Oana-maria, Minervini Pasquale, Stenetorp Pontus
- Chatgpt For Zero-shot Dialogue State Tracking: A Solution Or An Opportunity? Heck Michael, Lubis Nurul, Ruppik Benjamin, Vukovic Renato, Feng Shutong, Geishauser Christian, Lin Hsien-chin, Van Niekerk Carel, Gašić Milica
- Assessing The Efficacy Of Large Language Models In Generating Accurate Teacher Responses Hicke Yann, Masand Abhishek, Guo Wentao, Gangavarapu Tushaar
- DP-OPT: Make Large Language Model Your Privacy-preserving Prompt Engineer Hong Junyuan, Wang Jiachen T., Zhang Chenhui, Li Zhangheng, Li Bo, Wang Zhangyang
- Deploying And Evaluating Llms To Program Service Mobile Robots Hu Zichao, Lucchetti Francesca, Schlesinger Claire, Saxena Yash, Freeman Anders, Modak Sadanand, Guha Arjun, Biswas Joydeep
- Lorahub: Efficient Cross-task Generalization Via Dynamic Lora Composition Huang Chengsong, Liu Qian, Lin Bill Yuchen, Pang Tianyu, Du Chao, Lin Min
- Llama Guard: Llm-based Input-output Safeguard For Human-ai Conversations Inan Hakan, Upasani Kartikeya, Chi Jianfeng, Rungta Rashi, Iyer Krithika, Mao Yuning, Tontchev Michael, Hu Qing, Fuller Brian, Testuggine Davide, Khabsa Madian
- Hijacking Context In Large Multi-modal Models Jeong Joonhyun
- Symbol Tuning Improves In-context Learning In Language Models Jerry Wei, Le Hou, Andrew Lampinen, Xiangning Chen, Da Huang, Yi Tay, Xinyun Chen, Yifeng Lu, Denny Zhou, Tengyu Ma, Quoc V. Le
- Natural Language Decomposition And Interpretation Of Complex Utterances Jhamtani Harsh, Fang Hao, Xia Patrick, Levy Eran, Andreas Jacob, Van Durme Ben
- Llmlingua: Compressing Prompts For Accelerated Inference Of Large Language Models Jiang Huiqiang, Wu Qianhui, Lin Chin-yew, Yang Yuqing, Qiu Lili
- Scaling Sentence Embeddings With Large Language Models Jiang Ting, Huang Shaohan, Luan Zhongzhi, Wang Deqing, Zhuang Fuzhen
- Generative Calibration For In-context Learning Jiang Zhongtao, Zhang Yuanzhe, Liu Cao, Zhao Jun, Liu Kang
- The Cost Of Down-scaling Language Models: Fact Recall Deteriorates Before In-context Learning Jin Tian, Clement Nolan, Dong Xin, Nagarajan Vaishnavh, Carbin Michael, Ragan-kelley Jonathan, Dziugaite Gintare Karolina
- Grounding Language Models To Images For Multimodal Inputs And Outputs Jing Yu Koh, Ruslan Salakhutdinov, Daniel Fried
- Is Chatgpt A Good Recommender? A Preliminary Study Junling Liu, Chao Liu, Peilin Zhou, Renjie Lv, Kang Zhou, Yan Zhang
- Synthetic Data Generation In Low-resource Settings Via Fine-tuning Of Large Language Models Kaddour Jean, Liu Qi
- Exploring Automatic Evaluation Methods Based On A Decoder-based LLM For Text Generation Kasahara Tomohito, Kawahara Daisuke
- Benchmarking Cognitive Biases In Large Language Models As Evaluators Koo Ryan, Lee Minhwa, Raheja Vipul, Park Jong Inn, Kim Zae Myung, Kang Dongyeop
- Understanding Catastrophic Forgetting In Language Models Via Implicit Inference Kotha Suhas, Springer Jacob Mitchell, Raghunathan Aditi
- Post Hoc Explanations Of Language Models Can Improve Language Models Krishna Satyapriya, Ma Jiaqi, Slack Dylan, Ghandeharioun Asma, Singh Sameer, Lakkaraju Himabindu
- In-context Explainers: Harnessing Llms For Explaining Black Box Models Kroeger Nicholas, Ley Dan, Krishna Satyapriya, Agarwal Chirag, Lakkaraju Himabindu
- Recallm: An Adaptable Memory Mechanism With Temporal Understanding For Large Language Models Kynoch Brandon, Latapie Hugo, Van Der Sluis Dwane
- Large Language Models Are Partially Primed In Pronoun Interpretation Lam Suet-ying, Zeng Qingcheng, Zhang Kexun, You Chenyu, Voigt Rob
- Voicebox: Text-guided Multilingual Universal Speech Generation At Scale Le Matthew, Vyas Apoorv, Shi Bowen, Karrer Brian, Sari Leda, Moritz Rashel, Williamson Mary, Manohar Vimal, Adi Yossi, Mahadeokar Jay, Hsu Wei-ning
- Prompted Llms As Chatbot Modules For Long Open-domain Conversation Lee Gibbeum, Hartmann Volker, Park Jongho, Papailiopoulos Dimitris, Lee Kangwook
- Is Attention Required For ICL? Exploring The Relationship Between Model Architecture And In-context Learning Ability Lee Ivan, Jiang Nan, Berg-kirkpatrick Taylor
- Teaching Arithmetic To Small Transformers Lee Nayoung, Sreenivasan Kartik, Lee Jason D., Lee Kangwook, Papailiopoulos Dimitris
- Do We Still Need Clinical Language Models? Lehman Eric, Hernandez Evan, Mahajan Diwakar, Wulff Jonas, Smith Micah J., Ziegler Zachary, Nadler Daniel, Szolovits Peter, Johnson Alistair, Alsentzer Emily
- Explaincpe: A Free-text Explanation Benchmark Of Chinese Pharmacist Examination Li Dongfang, Yu Jindi, Hu Baotian, Xu Zhenran, Zhang Min
- Overprompt: Enhancing Chatgpt Through Efficient In-context Learning Li Jiazheng, Zhao Runcong, Yang Yongxin, He Yulan, Gui Lin
- Are Human-generated Demonstrations Necessary For In-context Learning? Li Rui, Wang Guoyin, Li Jiwei
- Crosslingual Retrieval Augmented In-context Learning For Bangla Li Xiaoqian, Nie Ercong, Liang Sheng
- From Classification To Generation: Insights Into Crosslingual Retrieval Augmented ICL Li Xiaoqian, Nie Ercong, Liang Sheng
- On Bilingual Lexicon Induction With Large Language Models Li Yaoyiran, Korhonen Anna, Vulić Ivan
- Dissecting Chain-of-thought: Compositionality Through In-context Filtering And Learning Li Yingcong, Sreenivasan Kartik, Giannou Angeliki, Papailiopoulos Dimitris, Oymak Samet
- A Practical Survey On Zero-shot Prompt Design For In-context Learning Li Yinheng
- Textbooks Are All You Need II: Phi-1.5 Technical Report Li Yuanzhi, Bubeck Sébastien, Eldan Ronen, Del Giorno Allie, Gunasekar Suriya, Lee Yin Tat
- Prompting Large Language Models With Chain-of-thought For Few-shot Knowledge Base Question Generation Liang Yuanyuan, Wang Jianing, Zhu Hanlun, Wang Lei, Qian Weining, Lan Yunshi
- The Unlocking Spell On Base Llms: Rethinking Alignment Via In-context Learning Lin Bill Yuchen, Ravichander Abhilasha, Lu Ximing, Dziri Nouha, Sclar Melanie, Chandu Khyathi, Bhagavatula Chandra, Choi Yejin
- VILA: On Pre-training For Visual Language Models Lin Ji, Yin Hongxu, Ping Wei, Lu Yao, Molchanov Pavlo, Tao Andrew, Mao Huizi, Kautz Jan, Shoeybi Mohammad, Han Song
- Few-shot Adaptation For Parsing Contextual Utterances With Llms Lin Kevin, Xia Patrick, Fang Hao
- Unleashing The Creative Mind: Language Model As Hierarchical Policy For Improved Exploration On Challenging Problem Solving Ling Zhan, Fang Yunhao, Li Xuanlin, Mu Tongzhou, Lee Mingu, Pourreza Reza, Memisevic Roland, Su Hao
- In-context Vectors: Making In Context Learning More Effective And Controllable Through Latent Space Steering Liu Sheng, Ye Haotian, Xing Lei, Zou James
- MMHQA-ICL: Multimodal In-context Learning For Hybrid Question Answering Over Text, Tables And Images Liu Weihao, Lei Fangyu, Luo Tongxu, Lei Jiahe, He Shizhu, Zhao Jun, Liu Kang
- Calibrating Llm-based Evaluator Liu Yuxuan, Yang Tianchi, Huang Shaohan, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Deja Vu: Contextual Sparsity For Efficient Llms At Inference Time Liu Zichang, Wang Jue, Dao Tri, Zhou Tianyi, Yuan Binhang, Song Zhao, Shrivastava Anshumali, Zhang Ce, Tian Yuandong, Re Christopher, Chen Beidi
- Data-to-text Generation For Severely Under-resourced Languages With GPT-3.5: A Bit Of Help Needed From Google Translate Lorandi Michela, Belz Anya
- EPA: Easy Prompt Augmentation On Large Language Models Via Multiple Sources And Multiple Targets Lu Hongyuan, Lam Wai
- Chain-of-dictionary Prompting Elicits Translation In Large Language Models Lu Hongyuan, Yang Haoran, Huang Haoyang, Zhang Dongdong, Lam Wai, Wei Furu
- Ziya-visual: Bilingual Large Vision-language Model Via Multi-task Instruction Tuning Lu Junyu, Zhang Dixiang, Wu Xiaojun, Gao Xinyu, Gan Ruyi, Zhang Jiaxing, Song Yan, Zhang Pingjian
- UI Layout Generation With Llms Guided By UI Grammar Lu Yuwen, Tong Ziang, Zhao Qinyi, Zhang Chengzhi, Li Toby Jia-jun
- Dr.icl: Demonstration-retrieved In-context Learning Luo Man, Xu Xin, Dai Zhuyun, Pasupat Panupong, Kazemi Mehran, Baral Chitta, Imbrasaite Vaiva, Zhao Vincent Y
- Recranker: Instruction Tuning Large Language Model As Ranker For Top-k Recommendation Luo Sichun, He Bowei, Zhao Haohan, Shao Wei, Qi Yanlin, Huang Yinya, Zhou Aojun, Yao Yuxuan, Li Zongpeng, Xiao Yuanzhang, Zhan Mingjie, Song Linqi
- Hrot: Hybrid Prompt Strategy And Retrieval Of Thought For Table-text Hybrid Question Answering Luo Tongxu, Lei Fangyu, Lei Jiahe, Liu Weihao, He Shihu, Zhao Jun, Liu Kang
- Fairness-guided Few-shot Prompting For Large Language Models Ma Huan, Zhang Changqing, Bian Yatao, Liu Lemao, Zhang Zhirui, Zhao Peilin, Zhang Shu, Fu Huazhu, Hu Qinghua, Wu Bingzhe
- Fine-tuning Language Models With Just Forward Passes Malladi Sadhika, Gao Tianyu, Nichani Eshaan, Damian Alex, Lee Jason D., Chen Danqi, Arora Sanjeev
- ICL Markup: Structuring In-context Learning Using Soft-token Tags Marc-etienne Brunet, Ashton Anderson, Richard Zemel
- Improving Automatic VQA Evaluation Using Large Language Models Mañas Oscar, Krojer Benno, Agrawal Aishwarya
- Self-checker: Plug-and-play Modules For Fact-checking With Large Language Models Miaoran Li, Baolin Peng, Michel Galley, Jianfeng Gao, Zhu Zhang
- Boosting Theory-of-mind Performance In Large Language Models Via Prompting Moghaddam Shima Rahimi, Honey Christopher J.
- Metavl: Transferring In-context Learning Ability From Language Models To Vision-language Models Monajatipoor Masoud, Li Liunian Harold, Rouhsedaghat Mozhdeh, Yang Lin F., Chang Kai-wei
- Adaptive Machine Translation With Large Language Models Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- Fine-tuning Large Language Models For Adaptive Machine Translation Moslem Yasmin, Haque Rejwanul, Way Andy
- Augmenting Large Language Model Translators Via Translation Memories Mu Yongyu, Reheman Abudurexiti, Cao Zhiquan, Fan Yuchun, Li Bei, Li Yinqiao, Xiao Tong, Zhang Chunliang, Zhu Jingbo
- In-context Learning Generalizes, But Not Always Robustly: The Case Of Syntax Mueller Aaron, Webson Albert, Petty Jackson, Linzen Tal
- Large Language Model Augmented Narrative Driven Recommendations Mysore Sheshera, Mccallum Andrew, Zamani Hamed
- Enhancing Few-shot Text-to-sql Capabilities Of Large Language Models: A Study On Prompt Design Strategies Nan Linyong, Zhao Yilun, Zou Weijin, Ri Narutatsu, Tae Jaesung, Zhang Ellen, Cohan Arman, Radev Dragomir
- Large Language Models As Tax Attorneys: A Case Study In Legal Capabilities Emergence Nay John J., Karamardian David, Lawsky Sarah B., Tao Wenting, Bhat Meghana, Jain Raghav, Lee Aaron Travis, Choi Jonathan H., Kasai Jungo
- In-context Example Selection With Influences Nguyen Tai, Wong Eric
- Code-style In-context Learning For Knowledge-based Question Answering Nie Zhijie, Zhang Richong, Wang Zhongyuan, Liu Xudong
- What In-context Learning "learns" In-context: Disentangling Task Recognition And Task Learning Pan Jane, Gao Tianyu, Chen Howard, Chen Danqi
- COSMIC: Data Efficient Instruction-tuning For Speech In-context Learning Pan Jing, Wu Jian, Gaur Yashesh, Sivasankaran Sunit, Chen Zhuo, Liu Shujie, Li Jinyu
- Trainable Transformer In Transformer Panigrahi Abhishek, Malladi Sadhika, Xia Mengzhou, Arora Sanjeev
- ART: Automatic Multi-step Reasoning And Tool-use For Large Language Models Paranjape Bhargavi, Lundberg Scott, Singh Sameer, Hajishirzi Hannaneh, Zettlemoyer Luke, Ribeiro Marco Tulio
- Evaluating In-context Learning Of Libraries For Code Generation Patel Arkil, Reddy Siva, Bahdanau Dzmitry, Dasigi Pradeep
- When Does In-context Learning Fall Short And Why? A Study On Specification-heavy Tasks Peng Hao, Wang Xiaozhi, Chen Jianhui, Li Weikai, Qi Yunjia, Wang Zimu, Wu Zhili, Zeng Kaisheng, Xu Bin, Hou Lei, Li Juanzi
- When Do Prompting And Prefix-tuning Work? A Theory Of Capabilities And Limitations Petrov Aleksandar, Torr Philip H. S., Bibi Adel
- Transformers In The Service Of Description Logic-based Contexts Poulis Angelos, Tsalapati Eleni, Koubarakis Manolis
- Decomposed Prompting For Machine Translation Between Related Languages Using Large Language Models Puduppully Ratish, Kunchukuttan Anoop, Dabre Raj, Aw Ai Ti, Chen Nancy F.
- PILLOW: Enhancing Efficient Instruction Fine-tuning Via Prompt Matching Qi Zhenting, Tan Xiaoyu, Shi Shaojie, Qu Chao, Xu Yinghui, Qi Yuan
- Harnessing The Power Of Large Language Models For Empathetic Response Generation: Empirical Investigations And Improvements Qian Yushan, Zhang Wei-nan, Liu Ting
- Hijacking Large Language Models Via Adversarial In-context Learning Qiang Yao, Zhou Xiangyu, Zhu Dongxiao
- Improving In-context Learning Via Bidirectional Alignment Qin Chengwei, Xia Wenhan, Jiao Fangkai, Chen Chen, Hu Yuchen, Ding Bosheng, Joty Shafiq
- In-context Learning With Iterative Demonstration Selection Qin Chengwei, Zhang Aston, Chen Chen, Dagar Anirudh, Ye Wenming
- Are Large Language Models Temporally Grounded? Qiu Yifu, Zhao Zheng, Ziser Yftah, Korhonen Anna, Ponti Edoardo M., Cohen Shay B.
- Layoutllm-t2i: Eliciting Layout Guidance From LLM For Text-to-image Generation Qu Leigang, Wu Shengqiong, Fei Hao, Nie Liqiang, Chua Tat-seng
- Dissecting In-context Learning Of Translations In Gpts Raunak Vikas, Awadalla Hany Hassan, Menezes Arul
- Do Gpts Produce Less Literal Translations? Raunak Vikas, Menezes Arul, Post Matt, Awadalla Hany Hassan
- STREET: A Multi-task Structured Reasoning And Explanation Benchmark Ribeiro Danilo, Wang Shen, Ma Xiaofei, Zhu Henry, Dong Rui, Kong Deguang, Burger Juliette, Ramos Anjelica, Wang William, Huang Zhiheng, Karypis George, Xiang Bing, Roth Dan
- ICLEF: In-context Learning With Expert Feedback For Explainable Style Transfer Saakyan Arkadiy, Muresan Smaranda
- Do Pretrained Transformers Learn In-context By Gradient Descent? Shen Lingfeng, Mishra Aayush, Khashabi Daniel
- Towards More Unified In-context Visual Understanding Sheng Dianmo, Chen Dongdong, Tan Zhentao, Liu Qiankun, Chu Qi, Bao Jianmin, Gong Tao, Liu Bin, Xu Shengwei, Yu Nenghai
- Prompt Space Optimizing Few-shot Reasoning Success With Large Language Models Shi Fobo, Qing Peijun, Yang Dong, Wang Nan, Lei Youbo, Lu Haonan, Lin Xiaodong, Li Duantengchuan
- Beyond Task Performance: Evaluating And Reducing The Flaws Of Large Multimodal Models With In-context Learning Shukor Mustafa, Rame Alexandre, Dancette Corentin, Cord Matthieu
- Measuring Inductive Biases Of In-context Learning With Underspecified Demonstrations Si Chenglei, Friedman Dan, Joshi Nitish, Feng Shi, Chen Danqi, He He
- Knowledge Unlearning For Llms: Tasks, Methods, And Challenges Si Nianwen, Zhang Hao, Chang Heyu, Zhang Wenlin, Qu Dan, Zhang Weiqiang
- SPARSEFIT: Few-shot Prompting With Sparse Fine-tuning For Jointly Generating Predictions And Natural Language Explanations Solano Jesus, Sanni Mardhiyah, Camburu Oana-maria, Minervini Pasquale
- Large Language Models Meet Open-world Intent Discovery And Recognition: An Evaluation Of Chatgpt Song Xiaoshuai, He Keqing, Wang Pei, Dong Guanting, Mou Yutao, Wang Jingang, Xian Yunsen, Cai Xunliang, Xu Weiran
- Learning From Red Teaming: Gender Bias Provocation And Mitigation In Large Language Models Su Hsuan, Cheng Cheng-chu, Farn Hua, Kumar Shachi H, Sahay Saurav, Chen Shang-tse, Lee Hung-yi
- Speech-based Slot Filling Using Large Language Models Sun Guangzhi, Feng Shutong, Jiang Dongcheng, Zhang Chao, Gašić Milica, Woodland Philip C.
- Autohint: Automatic Prompt Optimization With Hint Generation Sun Hong, Li Xue, Xu Yinchuan, Homma Youkow, Cao Qi, Wu Min, Jiao Jian, Charles Denis
- Generative Multimodal Models Are In-context Learners Sun Quan, Cui Yufeng, Zhang Xiaosong, Zhang Fan, Yu Qiying, Luo Zhengxiong, Wang Yueze, Rao Yongming, Liu Jingjing, Huang Tiejun, Wang Xinlong
- Exploring Chain-of-thought Style Prompting For Text-to-sql Tai Chang-you, Chen Ziru, Zhang Tianshu, Deng Xiang, Sun Huan
- Codi-2: In-context, Interleaved, And Interactive Any-to-any Generation Tang Zineng, Yang Ziyi, Khademi Mahmoud, Liu Yang, Zhu Chenguang, Bansal Mohit
- Multilingual Llms Are Better Cross-lingual In-context Learners With Alignment Tanwar Eshaan, Dutta Subhabrata, Borthakur Manish, Chakraborty Tanmoy
- Is Chatgpt A Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation Tao Fang, Shu Yang, Kaixin Lan, Derek F. Wong, Jinpeng Hu, Lidia S. Chao, Yue Zhang
- Webwise: Web Interface Control And Sequential Exploration With Large Language Models Tao Heyi, T Sethuraman V, Shlapentokh-rothman Michal, Hoiem Derek
- Fine-tune Language Models To Approximate Unbiased In-context Learning Timothy Chu, Zhao Song, Chiwun Yang
- Large Language Model Prompt Chaining For Long Legal Document Classification Trautmann Dietrich
- Small Language Models Improve Giants By Rewriting Their Outputs Vernikos Giorgos, Bražinskas Arthur, Adamek Jakub, Mallinson Jonathan, Severyn Aliaksei, Malmi Eric
- Freshllms: Refreshing Large Language Models With Search Engine Augmentation Vu Tu, Iyyer Mohit, Wang Xuezhi, Constant Noah, Wei Jerry, Wei Jason, Tar Chris, Sung Yun-hsuan, Zhou Denny, Le Quoc, Luong Thang
- Universal Self-adaptive Prompting Wan Xingchen, Sun Ruoxi, Nakhost Hootan, Dai Hanjun, Eisenschlos Julian Martin, Arik Sercan O., Pfister Tomas
- GPT-RE: In-context Learning For Relation Extraction Using Large Language Models Wan Zhen, Cheng Fei, Mao Zhuoyuan, Liu Qianying, Song Haiyue, Li Jiwei, Kurohashi Sadao
- Grammar Prompting For Domain-specific Language Generation With Large Language Models Wang Bailin, Wang Zi, Wang Xuezhi, Cao Yuan, Saurous Rif A., Kim Yoon
- Knowledgeable In-context Tuning: Exploring And Exploiting Factual Knowledge For In-context Learning Wang Jianing, Wang Chengyu, Tan Chuanqi, Huang Jun, Gao Ming
- Self-prompted Chain-of-thought On Large Language Models For Open-domain Multi-hop Reasoning Wang Jinyuan, Li Junlong, Zhao Hai
- Query2doc: Query Expansion With Large Language Models Wang Liang, Yang Nan, Wei Furu
- STEPS: A Benchmark For Order Reasoning In Sequential Tasks Wang Weizhi, Wang Hong, Yan Xifeng
- FIAT: Fusing Learning Paradigms With Instruction-accelerated Tuning Wang Xinyi, Wieting John, Clark Jonathan H.
- Hint-enhanced In-context Learning Wakes Large Language Models Up For Knowledge-intensive Tasks Wang Yifan, Guo Qingyan, Ni Xinzhe, Shi Chufan, Liu Lemao, Jiang Haiyun, Yang Yujiu
- DRDT: Dynamic Reflection With Divergent Thinking For Llm-based Sequential Recommendation Wang Yu, Liu Zhiwei, Zhang Jianguo, Yao Weiran, Heinecke Shelby, Yu Philip S.
- Multiqg-ti: Towards Question Generation From Multi-modal Sources Wang Zichao, Baraniuk Richard
- Larger Language Models Do In-context Learning Differently Wei Jerry, Wei Jason, Tay Yi, Tran Dustin, Webson Albert, Lu Yifeng, Chen Xinyun, Liu Hanxiao, Huang Da, Zhou Denny, Ma Tengyu
- Jailbreak And Guard Aligned Language Models With Only Few In-context Demonstrations Wei Zeming, Wang Yifei, Li Ang, Mo Yichuan, Wang Yisen
- In-context Pretraining: Language Modeling Beyond Document Boundaries Weijia Shi, Sewon Min, Maria Lomeli, Chunting Zhou, Margaret Li, Gergely Szilvasy, Rich James, Xi Victoria Lin, Noah A. Smith, Luke Zettlemoyer, Scott Yih, Mike Lewis
- Exploring Parameter-efficient Fine-tuning Techniques For Code Generation With Large Language Models Weyssow Martin, Zhou Xin, Kim Kisub, Lo David, Sahraoui Houari
- Large Language Models Leverage External Knowledge To Extend Clinical Insight Beyond Language Boundaries Wu Jiageng, Wu Xian, Qiu Zhaopeng, Li Minghui, Zhang Yingying, Zheng Yefeng, Yuan Changzheng, Yang Jie
- Analyzing Chain-of-thought Prompting In Large Language Models Via Gradient-based Feature Attributions Wu Skyler, Shen Eric Meng, Badrinath Charumathi, Ma Jiaqi, Lakkaraju Himabindu
- An Early Evaluation Of Gpt-4v(ision) Wu Yang, Wang Shilong, Yang Hao, Zheng Tian, Zhang Hongbo, Zhao Yanyan, Qin Bing
- Semantic Parsing By Large Language Models For Intricate Updating Strategies Of Zero-shot Dialogue State Tracking Wu Yuxiang, Dong Guanting, Xu Weiran
- Openicl: An Open-source Framework For In-context Learning Wu Zhenyu, Wang Yaoxiang, Ye Jiacheng, Feng Jiangtao, Xu Jingjing, Qiao Yu, Wu Zhiyong
- A Simple Baseline For Knowledge-based Visual Question Answering Xenos Alexandros, Stafylakis Themos, Patras Ioannis, Tzimiropoulos Georgios
- O3D: Offline Data-driven Discovery And Distillation For Sequential Decision-making With Large Language Models Xiao Yuchen, Sun Yanchao, Xu Mengda, Madhushani Udari, Vann Jared, Garg Deepeka, Ganesh Sumitra
- Instructed Language Models With Retrievers Are Powerful Entity Linkers Xiao Zilin, Gong Ming, Wu Jie, Zhang Xingyao, Shou Linjun, Pei Jian, Jiang Daxin
- Expertprompting: Instructing Large Language Models To Be Distinguished Experts Xu Benfeng, Yang An, Lin Junyang, Wang Quan, Zhou Chang, Zhang Yongdong, Mao Zhendong
- Small Models Are Valuable Plug-ins For Large Language Models Xu Canwen, Xu Yichong, Wang Shuohang, Liu Yang, Zhu Chenguang, Mcauley Julian
- Mental-llm: Leveraging Large Language Models For Mental Health Prediction Via Online Text Data Xu Xuhai, Yao Bingsheng, Dong Yuanzhe, Gabriel Saadia, Yu Hong, Hendler James, Ghassemi Marzyeh, Dey Anind K., Wang Dakuo
- Pretraining Data Mixtures Enable Narrow Model Selection Capabilities In Transformer Models Yadlowsky Steve, Doshi Lyric, Tripuraneni Nilesh
- Understanding In-context Learning From Repetitions Yan Jianhao, Xu Jin, Song Chiyu, Wu Chenming, Li Yafu, Zhang Yue
- Iterative Forward Tuning Boosts In-context Learning In Language Models Yang Jiaxi, Hui Binyuan, Yang Min, Wang Bailin, Li Bowen, Li Binhua, Huang Fei, Li Yongbin
- Towards Interpretable Mental Health Analysis With Large Language Models Yang Kailai, Ji Shaoxiong, Zhang Tianlin, Xie Qianqian, Kuang Ziyan, Ananiadou Sophia
- Supervised Knowledge Makes Large Language Models Better In-context Learners Yang Linyi, Zhang Shuibai, Yu Zhuohao, Bao Guangsheng, Wang Yidong, Wang Jindong, Xu Ruochen, Ye Wei, Xie Xing, Chen Weizhu, Zhang Yue
- Human-in-the-loop Machine Translation With Large Language Model Yang Xinyi, Zhan Runzhe, Wong Derek F., Wu Junchao, Chao Lidia S.
- On The Calibration Of Multilingual Question Answering Llms Yang Yahan, Dan Soham, Roth Dan, Lee Insup
- Improving The Reliability Of Large Language Models By Leveraging Uncertainty-aware In-context Learning Yang Yuchen, Li Houqiang, Wang Yanfeng, Wang Yu
- Alignedcot: Prompting Large Language Models Via Native-speaking Demonstrations Yang Zhicheng, Huang Yinya, Xiong Jing, Feng Liang, Liang Xiaodan, Wang Yiwei, Tang Jing
- Benchmarking Llm-based Machine Translation On Cultural Awareness Yao Binwei, Jiang Ming, Yang Diyi, Hu Junjie
- Improving Language Model Negotiation With Self-play And In-context Learning From AI Feedback Yao Fu, Hao Peng, Tushar Khot, Mirella Lapata
- Diffusion Language Models Can Perform Many Tasks With Scaling And Instruction-finetuning Ye Jiasheng, Zheng Zaixiang, Bao Yu, Qian Lihua, Gu Quanquan
- Do Large Language Models Know What They Don't Know? Yin Zhangyue, Sun Qiushi, Guo Qipeng, Wu Jiawen, Qiu Xipeng, Huang Xuanjing
- Decoding In-context Learning: Neuroscience-inspired Analysis Of Representations In Large Language Models Yousefi Safoora, Betthauser Leo, Hasanbeig Hosein, Millière Raphaël, Momennejad Ida
- Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning Yu Yue, Shen Jiaming, Liu Tianqi, Qin Zhen, Yan Jing Nathan, Liu Jialu, Zhang Chao, Bendersky Michael
- Pre-training To Learn In Context Yuxian Gu, Li Dong, Furu Wei, Minlie Huang
- Mm-narrator: Narrating Long-form Videos With Multimodal In-context Learning Zhang Chaoyi, Lin Kevin, Yang Zhengyuan, Wang Jianfeng, Li Linjie, Lin Chung-ching, Liu Zicheng, Wang Lijuan
- Efficient Toxic Content Detection By Bootstrapping And Distilling Large Language Models Zhang Jiang, Wu Qiong, Xu Yiming, Cao Cheng, Du Zheng, Psounis Konstantinos
- The Potential And Pitfalls Of Using A Large Language Model Such As Chatgpt Or GPT-4 As A Clinical Assistant Zhang Jingqing, Sun Kai, Jagadeesh Akshay, Ghahfarokhi Mahta, Gupta Deepa, Gupta Ashok, Gupta Vibhor, Guo Yike
- Can LLM Find The Green Circle? Investigation And Human-guided Tool Manipulation For Compositional Generalization Zhang Min, He Jianfeng, Lei Shuo, Yue Murong, Wang Linhang, Lu Chang-tien
- Making Large Language Models Perform Better In Knowledge Graph Completion Zhang Yichi, Chen Zhuo, Guo Lingbing, Xu Yajing, Zhang Wen, Chen Huajun
- Meta Prompting For AI Systems Zhang Yifan, Yuan Yang, Yao Andrew Chi-chih
- In-context Exemplars As Clues To Retrieving From Large Associative Memory Zhao Jiachen
- Knn-icl: Compositional Task-oriented Parsing Generalization With Nearest Neighbor In-context Learning Zhao Wenting, Liu Ye, Wan Yao, Wang Yibo, Wu Qingyang, Deng Zhongfen, Du Jiangshu, Liu Shuaiqi, Xu Yunlong, Yu Philip S.
- Multi-modal In-context Learning Makes An Ego-evolving Scene Text Recognizer Zhao Zhen, Tang Jingqun, Lin Chunhui, Wu Binghong, Huang Can, Liu Hao, Tan Xin, Zhang Zhizhong, Xie Yuan
- Can We Edit Factual Knowledge By In-context Learning? Zheng Ce, Li Lei, Dong Qingxiu, Fan Yuxuan, Wu Zhiyong, Xu Jingjing, Chang Baobao
- Noisy Exemplars Make Large Language Models More Robust: A Domain-agnostic Behavioral Analysis Zheng Hongyi, Saparov Abulhair
- Trojfsp: Trojan Insertion In Few-shot Prompt Tuning Zheng Mengxin, Xue Jiaqi, Chen Xun, Wang Yanshan, Lou Qian, Jiang Lei
- Building Emotional Support Chatbots In The Era Of Llms Zheng Zhonghua, Liao Lizi, Deng Yang, Nie Liqiang
- Principle-driven Self-alignment Of Language Models From Scratch With Minimal Human Supervision Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan
- Language Agent Tree Search Unifies Reasoning Acting And Planning In Language Models Zhou Andy, Yan Kai, Shlapentokh-rothman Michal, Wang Haohan, Wang Yu-xiong
- Batch Calibration: Rethinking Calibration For In-context Learning And Prompt Engineering Zhou Han, Wan Xingchen, Proleev Lev, Mincu Diana, Chen Jilin, Heller Katherine, Roy Subhrajit
- Llms Learn Task Heuristics From Demonstrations: A Heuristic-driven Prompting Strategy For Document-level Event Argument Extraction Zhou Hanzhang, Qian Junlang, Feng Zijian, Lu Hui, Zhu Zixiao, Mao Kezhi
- The Mystery Of In-context Learning: A Comprehensive Survey On Interpretation And Analysis Zhou Yuxiang, Li Jiazheng, Xiang Yanzheng, Yan Hanqi, Gui Lin, He Yulan
- Overview Of The Promptcblue Shared Task In CHIP2023 Zhu Wei, Wang Xiaoling, Chen Mosha, Tang Buzhou
- Concept-aware Training Improves In-context Learning Ability Of Language Models Štefánik Michal, Kadlčík Marek
- A Dataset And Benchmark For Hospital Course Summarization With Adapted Large Language Models Aali Asad, Van Veen Dave, Arefeen Yamin Ishraq, Hom Jason, Bluethgen Christian, Reis Eduardo Pontes, Gatidis Sergios, Clifford Namuun, Daws Joseph, Tehrani Arash S., Kim Jangwon, Chaudhari Akshay S.
- Many-shot In-context Learning Agarwal Rishabh, Singh Avi, Zhang Lei M., Bohnet Bernd, Rosias Luis, Chan Stephanie, Zhang Biao, Anand Ankesh, Abbas Zaheer, Nova Azade, Co-reyes John D., Chu Eric, Behbahani Feryal, Faust Aleksandra, Larochelle Hugo
- Illuminate: A Novel Approach For Depression Detection With Explainable Analysis And Proactive Therapy Using Prompt Engineering Agrawal Aryan
- In-context Language Learning: Architectures And Algorithms Akyürek Ekin, Wang Bailin, Kim Yoon, Andreas Jacob
- Should We Fine-tune Or RAG? Evaluating Different Techniques To Adapt Llms For Dialogue Alghisi Simone, Rizzoli Massimo, Roccabruna Gabriel, Mousavi Seyed Mahed, Riccardi Giuseppe
- Can Open-source Llms Compete With Commercial Models? Exploring The Few-shot Performance Of Current GPT Models In Biomedical Tasks Ateia Samy, Kruschwitz Udo
- Llms With Chain-of-thought Are Non-causal Reasoners Bao Guangsheng, Zhang Hongbo, Yang Linyi, Wang Cunxiang, Zhang Yue
- Exploring Autonomous Agents Through The Lens Of Large Language Models: A Review Barua Saikat
- A Systematic Analysis Of Large Language Models As Soft Reasoners: The Case Of Syllogistic Inferences Bertolazzi Leonardo, Gatt Albert, Bernardi Raffaella
- Soft Prompting For Unlearning In Large Language Models Bhaila Karuna, Van Minh-hao, Wu Xintao
- On The Robustness Of Language Models For Tabular Question Answering Bhandari Kushal Raj, Xing Sixue, Dan Soham, Gao Jianxi
- Task-oriented Dialogue With In-context Learning Bocklisch Tom, Werkmeister Thomas, Varshneya Daksh, Nichol Alan
- Exploring And Benchmarking The Planning Capabilities Of Large Language Models Bohnet Bernd, Nova Azade, Parisi Aaron T, Swersky Kevin, Goshvadi Katayoon, Dai Hanjun, Schuurmans Dale, Fiedel Noah, Sedghi Hanie
- Navigating The Labyrinth: Evaluating And Enhancing Llms' Ability To Reason About Search Problems Borazjanizadeh Nasim, Herzig Roei, Darrell Trevor, Feris Rogerio, Karlinsky Leonid
- Retrieving Examples From Memory For Retrieval Augmented Neural Machine Translation: A Systematic Comparison Bouthors Maxime, Crego Josep, Yvon Francois
- In-context Learning And Fine-tuning GPT For Argument Mining Cabessa Jérémie, Hernault Hugo, Mushtaq Umer
- TACT: Advancing Complex Aggregative Reasoning With Information Extraction Tools Caciularu Avi, Jacovi Alon, Ben-david Eyal, Goldshtein Sasha, Schuster Tal, Herzig Jonathan, Elidan Gal, Globerson Amir
- Llms Are Few-shot In-context Low-resource Language Learners Cahyawijaya Samuel, Lovenia Holy, Fung Pascale
- A Survey On Mixture Of Experts Cai Weilin, Jiang Juyong, Wang Fan, Tang Jing, Kim Sunghun, Huang Jiayi
- RQ-RAG: Learning To Refine Queries For Retrieval Augmented Generation Chan Chi-min, Xu Chunpu, Yuan Ruibin, Luo Hongyin, Xue Wei, Guo Yike, Fu Jie
- Efficient Prompting Methods For Large Language Models: A Survey Chang Kaiyan, Xu Songcheng, Wang Chenglong, Luo Yingfeng, Xiao Tong, Zhu Jingbo
- When Parts Are Greater Than Sums: Individual LLM Components Can Outperform Full Models Chang Ting-yun, Thomason Jesse, Jia Robin
- Language Models Can Exploit Cross-task In-context Learning For Data-scarce Novel Tasks Chatterjee Anwoy, Tanwar Eshaan, Dutta Subhabrata, Chakraborty Tanmoy
- Icleval: Evaluating In-context Learning Ability Of Large Language Models Chen Wentong, Lin Yankai, Zhou Zhenhao, Huang Hongyun, Jia Yantao, Cao Zhao, Wen Ji-rong
- Can We Rely On LLM Agents To Draft Long-horizon Plans? Let's Take Travelplanner As An Example Chen Yanan, Pesaranghader Ali, Sadhu Tanmana, Yi Dong Hoon
- An Empirical Study Of In-context Learning In Llms For Machine Translation Chitale Pranjal A., Gala Jay, Dabre Raj
- A Computational Framework For Behavioral Assessment Of LLM Therapists Chiu Yu Ying, Sharma Ashish, Lin Inna Wanyin, Althoff Tim
- Larger Language Models Don't Care How You Think: Why Chain-of-thought Prompting Fails In Subjective Tasks Chochlakis Georgios, Pandiyan Niyantha Maruthu, Lerman Kristina, Narayanan Shrikanth
- Picle: Eliciting Diverse Behaviors From Large Language Models With Persona In-context Learning Choi Hyeong Kyu, Li Yixuan
- Ragsys: Item-cold-start Recommender As RAG System Contal Emile, Mcgoldrick Garrin
- Shortcomings Of Llms For Low-resource Translation: Retrieval And Understanding Are Both The Problem Court Sara, Elsner Micha
- Induction Heads As An Essential Mechanism For Pattern Matching In In-context Learning Crosbie J., Shutova E.
- Efficiently Exploring Large Language Models For Document-level Machine Translation With In-context Learning Cui Menglong, Du Jiangcun, Zhu Shaolin, Xiong Deyi
- Phaseevo: Towards Unified In-context Prompt Optimization For Large Language Models Cui Wendi, Zhang Jiaxin, Li Zhuohang, Sun Hao, Lopez Damien, Das Kamalika, Malin Bradley, Kumar Sricharan
- Prompting Open-source And Commercial Language Models For Grammatical Error Correction Of English Learner Text Davis Christopher, Caines Andrew, Andersen Øistein, Taslimipoor Shiva, Yannakoudakis Helen, Yuan Zheng, Bryant Christopher, Rei Marek, Buttery Paula
- Understanding Biases In Chatgpt-based Recommender Systems: Provider Fairness, Temporal Stability, And Recency Deldjoo Yashar
- Evaluating Linguistic Capabilities Of Multimodal Llms In The Lens Of Few-shot Learning Dogan Mustafa, Kesen Ilker, Calixto Iacer, Erdem Aykut, Erdem Erkut
- Jailbreaking Text-to-image Models With Llm-based Agents Dong Yingkai, Li Zheng, Meng Xiangtao, Yu Ning, Guo Shanqing
- Towards Multimodal In-context Learning For Vision & Language Models Doveh Sivan, Perek Shaked, Mirza M. Jehanzeb, Lin Wei, Alfassy Amit, Arbelle Assaf, Ullman Shimon, Karlinsky Leonid
- In-context Learning With Reinforcement Learning For Incomplete Utterance Rewriting Du Haowei, Zhao Dongyan
- DARA: Decomposition-alignment-reasoning Autonomous Language Agent For Question Answering Over Knowledge Graphs Fang Haishuo, Zhu Xiaodan, Gurevych Iryna
- Exploring Automated Distractor Generation For Math Multiple-choice Questions Via Large Language Models Feng Wanyong, Lee Jaewook, Mcnichols Hunter, Scarlatos Alexander, Smith Digory, Woodhead Simon, Ornelas Nancy Otero, Lan Andrew
- An Application Of Large Language Models To Coding Negotiation Transcripts Friedman Ray, Cho Jaewoo, Brett Jeanne, Zhan Xuhui, Han Ningyu, Kannan Sriram, Ma Yingxiang, Spencer-smith Jesse, Jäckel Elisabeth, Zerres Alfred, Hooper Madison, Babbit Katie, Acharya Manish, Adair Wendi, Aslani Soroush, Aykaç Tayfun, Bauman Chris, Bennett Rebecca, Brady Garrett, Briggs Peggy, Dowie Cheryl, Eck Chase, Geiger Igmar, Jacob Frank, Kern Molly, Lee Sujin, Liu Leigh Anne, Liu Wu, Loewenstein Jeffrey, Lytle Anne, Ma Li, Mann Michel, Mislin Alexandra, Mitchell Tyree, Nagler Hannah Martensen Née, Nandkeolyar Amit, Olekalns Mara, Paliakova Elena, Parlamis Jennifer, Pierce Jason, Pierce Nancy, Pinkley Robin, Prime Nathalie, Ramirez-marin Jimena, Rockmann Kevin, Ross William, Semnani-azad Zhaleh, Schroeder Juliana, Smith Philip, Stimmer Elena, Swaab Roderick, Thompson Leigh, Tinsley Cathy, Tuncel Ece, Weingart Laurie, Wilken Robert, Yao Jingjing, Zhang Zhi-xue
- Unifying Demonstration Selection And Compression For In-context Learning Gao Jun, Cao Ziqiang, Li Wenjie
- AIM: Let Any Multi-modal Large Language Models Embrace Efficient In-context Learning Gao Jun, Qiao Qian, Cao Ziqiang, Wang Zili, Li Wenjie
- Harnessing The Intrinsic Knowledge Of Pretrained Language Models For Challenging Text Classification Settings Gao Lingyu
- Meta Reasoning For Large Language Models Gao Peizhong, Xie Ao, Mao Shaoguang, Wu Wenshan, Xia Yan, Mi Haipeng, Wei Furu
- Customizing Language Model Responses With Contrastive In-context Learning Gao Xiang, Das Kamalika
- Introducing Bode: A Fine-tuned Large Language Model For Portuguese Prompt-based Task Garcia Gabriel Lino, Paiola Pedro Henrique, Morelli Luis Henrique, Candido Giovani, Júnior Arnaldo Cândido, Jodas Danilo Samuel, Afonso Luis C. S., Guilherme Ivan Rizzo, Penteado Bruno Elias, Papa João Paulo
- Uncovering Hidden Intentions: Exploring Prompt Recovery For Deeper Insights Into Generated Texts Give Louis, Zaoral Timo, Bruno Maria Antonietta
- Memorization In In-context Learning Golchin Shahriar, Surdeanu Mihai, Bethard Steven, Blanco Eduardo, Riloff Ellen
- Is Mamba Capable Of In-context Learning? Grazzi Riccardo, Siems Julien, Schrodi Simon, Brox Thomas, Hutter Frank
- Talking The Talk Does Not Entail Walking The Walk: On The Limits Of Large Language Models In Lexical Entailment Recognition Greco Candida M., La Cava Lucio, Tagarelli Andrea
- Enhancing Collaborative Semantics Of Language Model-driven Recommendations Via Graph-aware Learning Guan Zhong, Wu Likang, Zhao Hongke, He Ming, Fan Jianpin
- Look Before You Leap: Towards Decision-aware And Generalizable Tool-usage For Large Language Models Gui Anchun, Li Jian, Dai Yong, Du Nan, Xiao Han
- AAVENUE: Detecting LLM Biases On NLU Tasks In AAVE Via A Novel Benchmark Gupta Abhay, Meng Philip, Yurtseven Ece, O'brien Sean, Zhu Kevin
- Irel At Semeval-2024 Task 9: Improving Conventional Prompting Methods For Brain Teasers Gupta Harshit, Chaudhary Manav, Raha Tathagata, Subramanian Shivansh, Varma Vasudeva
- The Unreasonable Effectiveness Of Easy Training Data For Hard Tasks Hase Peter, Bansal Mohit, Clark Peter, Wiegreffe Sarah
- Large Language Models Are Biased Reinforcement Learners Hayes William M., Yax Nicolas, Palminteri Stefano
- From Words To Actions: Unveiling The Theoretical Underpinnings Of Llm-driven Autonomous Systems He Jianliang, Chen Siyu, Zhang Fengzhuo, Yang Zhuoran
- Prompt-based Bias Calibration For Better Zero/few-shot Learning Of Language Models He Kang, Long Yinghan, Roy Kaushik
- Automated Black-box Prompt Engineering For Personalized Text-to-image Generation He Yutong, Robey Alexander, Murata Naoki, Jiang Yiding, Williams Joshua, Pappas George J., Hassani Hamed, Mitsufuji Yuki, Salakhutdinov Ruslan, Kolter J. Zico
- Camelot: Towards Large Language Models With Training-free Consolidated Associative Memory He Zexue, Karlinsky Leonid, Kim Donghyun, Mcauley Julian, Krotov Dmitry, Feris Rogerio
- Position Engineering: Boosting Large Language Models Through Positional Information Manipulation He Zhiyuan, Jiang Huiqiang, Wang Zilong, Yang Yuqing, Qiu Luna, Qiu Lili
- Can Large Language Models Make The Grade? An Empirical Study Evaluating Llms Ability To Mark Short Answer Questions In K-12 Education Henkel Owen, Boxer Adam, Hills Libby, Roberts Bill
- Multimodal Task Vectors Enable Many-shot Multimodal In-context Learning Huang Brandon, Mitra Chancharik, Arbelle Assaf, Karlinsky Leonid, Darrell Trevor, Herzig Roei
- Generating Educational Materials With Different Levels Of Readability Using Llms Huang Chieh-yang, Wei Jing, Huang Ting-hao 'kenneth'
- Mitigating Catastrophic Forgetting In Large Language Models With Self-synthesized Rehearsal Huang Jianheng, Cui Leyang, Wang Ante, Yang Chengyi, Liao Xinting, Song Linfeng, Yao Junfeng, Su Jinsong
- Learning Fine-grained Grounded Citations For Attributed Large Language Models Huang Lei, Feng Xiaocheng, Ma Weitao, Gu Yuxuan, Zhong Weihong, Feng Xiachong, Yu Weijiang, Peng Weihua, Tang Duyu, Tu Dandan, Qin Bing
- Rectifying Demonstration Shortcut In In-context Learning Jang Joonwon, Jang Sanghwan, Kweon Wonbin, Jeon Minjin, Yu Hwanjo
- L4Q: Parameter Efficient Quantization-aware Fine-tuning On Large Language Models Jeon Hyesung, Kim Yulhwa, Kim Jae-joon
- Linking In-context Learning In Transformers To Human Episodic Memory Ji-an Li, Zhou Corey Y., Benna Marcus K., Mattar Marcelo G.
- Seek And Solve Reasoning For Table Question Answering Jiang Ruya, Wang Chun, Deng Weihong
- Large Visual-language Models Are Also Good Classifiers: A Study Of In-context Multimodal Fake News Detection Jiang Ye, Wang Yimin
- Investigating Multi-hop Factual Shortcuts In Knowledge Editing Of Large Language Models Ju Tianjie, Chen Yijin, Yuan Xinwei, Zhang Zhuosheng, Du Wei, Zheng Yubin, Liu Gongshen
- LARA: Linguistic-adaptive Retrieval-augmented Llms For Multi-turn Intent Classification Junhua Liu, Keat Tan Yong, Bin Fu
- MEDIC: Towards A Comprehensive Framework For Evaluating Llms In Clinical Applications Kanithi Praveen K, Christophe Clément, Pimentel Marco Af, Raha Tathagata, Saadi Nada, Javed Hamza, Maslenkova Svetlana, Hayat Nasir, Rajan Ronnie, Khan Shadab
- Self-training Large Language Models For Improved Visual Program Synthesis With Visual Reinforcement Khan Zaid, Bg Vijay Kumar, Schulter Samuel, Fu Yun, Chandraker Manmohan
- Aligning Large Language Models With Self-generated Preference Data Kim Dongyoung, Lee Kimin, Shin Jinwoo, Kim Jaehyung
- Exploring Prompting Methods For Mitigating Class Imbalance Through Synthetic Data Generation With Large Language Models Kim Jinhee, Kim Taesung, Choo Jaegul
- Groundial: Human-norm Grounded Safe Dialog Response Generation Kim Siwon, Dai Shuyang, Kachuee Mohammad, Ray Shayan, Taghavi Tara, Yoon Sungroh
- Audio Flamingo: A Novel Audio Language Model With Few-shot Learning And Dialogue Abilities Kong Zhifeng, Goel Arushi, Badlani Rohan, Ping Wei, Valle Rafael, Catanzaro Bryan
- Retrieval-augmented Code Generation For Situated Action Generation: A Case Study On Minecraft Kranti Chalamalasetti, Hakimov Sherzod, Schlangen David
- Synthdst: Synthetic Data Is All You Need For Few-shot Dialog State Tracking Kulkarni Atharva, Tseng Bo-hsiang, Moniz Joel Ruben Antony, Piraviperumal Dhivya, Yu Hong, Bhargava Shruti
- Multi-modal Generation Via Cross-modal In-context Learning Kumar Amandeep, Naseer Muzammal, Narayan Sanath, Anwer Rao Muhammad, Khan Salman, Cholakkal Hisham
- RVISA: Reasoning And Verification For Implicit Sentiment Analysis Lai Wenna, Xie Haoran, Xu Guandong, Li Qing
- Large Language Models As Evolution Strategies Lange Robert Tjarko, Tian Yingtao, Tang Yujin
- Benchmarks Underestimate The Readiness Of Multi-lingual Dialogue Agents Lee Andrew H., Semnani Sina J., Castillo-lópez Galo, De Chalendar Gäel, Choudhury Monojit, Dua Ashna, Kavitha Kapil Rajesh, Kim Sungkyun, Kodali Prashant, Kumaraguru Ponnurangam, Lombard Alexis, Moradshahi Mehrad, Park Gihyun, Semmar Nasredine, Seo Jiwon, Shen Tianhao, Shrivastava Manish, Xiong Deyi, Lam Monica S.
- Prexme! Large Scale Prompt Exploration Of Open Source Llms For Machine Translation And Summarization Evaluation Leiter Christoph, Eger Steffen
- Recall, Retrieve And Reason: Towards Better In-context Relation Extraction Li Guozheng, Wang Peng, Ke Wenjun, Guo Yikai, Ji Ke, Shang Ziyu, Liu Jiajun, Xu Zijie
- Prompt4vis: Prompting Large Language Models With Example Mining And Schema Filtering For Tabular Data Visualization Li Shuaimin, Chen Xuanang, Song Yuanfeng, Song Yunze, Zhang Chen
- Long-context Llms Struggle With Long In-context Learning Li Tianle, Zhang Ge, Do Quy Duc, Yue Xiang, Chen Wenhu
- Stbench: Assessing The Ability Of Large Language Models In Spatio-temporal Analysis Li Wenbin, Yao Di, Zhao Ruibo, Chen Wenjie, Xu Zijie, Luo Chengxue, Gong Chang, Jing Quanliang, Tan Haining, Bi Jingping
- Spin Glass Model Of In-context Learning Li Yuhao, Bai Ruoran, Huang Haiping
- Self-instructed Derived Prompt Generation Meets In-context Learning: Unlocking New Potential Of Black-box Llms Li Zhuo, Du Yuhao, Hu Jinpeng, Wan Xiang, Gao Anningzhe
- Implicit In-context Learning Li Zhuowei, Xu Zihao, Han Ligong, Gao Yunhe, Wen Song, Liu Di, Wang Hao, Metaxas Dimitris N.
- Learning-from-mistakes Prompting For Indigenous Language Translation Liao You-cheng, Yu Chen-jui, Lin Chi-yi, Yun He-feng, Wang Yen-hsiang, Li Hsiao-min, Fan Yao-chung
- Towards Automated Data Sciences With Natural Language And Sagecopilot: Practices And Lessons Learned Liao Yuan, Bian Jiang, Yun Yuhui, Wang Shuo, Zhang Yubo, Chu Jiaming, Wang Tao, Li Kewei, Li Yuchen, Li Xuhong, Ji Shilei, Xiong Haoyi
- Adapting Mental Health Prediction Tasks For Cross-lingual Learning Via Meta-training And In-context Learning With Large Language Model Lifelo Zita, Ning Huansheng, Dhelim Sahraoui
- Mala-500: Massive Language Adaptation Of Large Language Models Lin Peiqin, Ji Shaoxiong, Tiedemann Jörg, Martins André F. T., Schütze Hinrich
- Navigating The Dual Facets: A Comprehensive Evaluation Of Sequential Memory Editing In Large Language Models Lin Zihao, Beigi Mohammad, Li Hongxuan, Zhou Yufan, Zhang Yuxiang, Wang Qifan, Yin Wenpeng, Huang Lifu
- An Incomplete Loop: Instruction Inference, Instruction Following, And In-context Learning In Language Models Liu Emmy, Neubig Graham, Andreas Jacob
- Mibench: Evaluating Multimodal Large Language Models Over Multiple Images Liu Haowei, Zhang Xi, Xu Haiyang, Shi Yaya, Jiang Chaoya, Yan Ming, Zhang Ji, Huang Fei, Yuan Chunfeng, Li Bing, Hu Weiming
- \(se^2\): Sequential Example Selection For In-context Learning Liu Haoyu, Liu Jianfeng, Huang Shaohan, Zhan Yuefeng, Sun Hao, Deng Weiwei, Wei Furu, Zhang Qi
- Laida: Linguistics-aware In-context Learning With Data Augmentation For Metaphor Components Identification Liu Hongde, He Chenyuan, Meng Feiyang, Niu Changyong, Jia Yuxiang
- Unraveling The Mechanics Of Learning-based Demonstration Selection For In-context Learning Liu Hui, Wang Wenya, Sun Hao, Tian Chris Xing, Kong Chenqi, Dong Xin, Li Haoliang
- Demorank: Selecting Effective Demonstrations For Large Language Models In Ranking Task Liu Wenhan, Zhu Yutao, Dou Zhicheng
- Let's Learn Step By Step: Enhancing In-context Learning Ability With Curriculum Learning Liu Yinpeng, Liu Jiawei, Shi Xiang, Cheng Qikai, Huang Yong, Lu Wei
- Enabling Weak Llms To Judge Response Reliability Via Meta Ranking Liu Zijun, Kou Boqun, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Large Language Models Know What Makes Exemplary Contexts Long Quanyu, Chen Jianda, Wang Wenya, Pan Sinno Jialin
- Does In-context Learning Really Learn? Rethinking How Large Language Models Respond And Solve Tasks Via In-context Learning Long Quanyu, Wu Yin, Wang Wenya, Pan Sinno Jialin
- Asymptotic Theory Of In-context Learning By Linear Attention Lu Yue M., Letey Mary I., Zavatone-veth Jacob A., Maiti Anindita, Pehlevan Cengiz
- Crosstune: Black-box Few-shot Classification With Label Enhancement Luo Danqing, Zhang Chen, Zhang Yan, Li Haizhou
- WILBUR: Adaptive In-context Learning For Robust And Accurate Web Agents Lutz Michael, Bohra Arth, Saroyan Manvel, Harutyunyan Artem, Campagna Giovanni
- Exploring The Role Of Transliteration In In-context Learning For Low-resource Languages Written In Non-latin Scripts Ma Chunlan, Liu Yihong, Ye Haotian, Schütze Hinrich
- A Data Generation Perspective To The Mechanism Of In-context Learning Mao Haitao, Liu Guangliang, Ma Yao, Wang Rongrong, Johnson Kristen, Tang Jiliang
- Understanding And Mitigating Language Confusion In Llms Marchisio Kelly, Ko Wei-yin, Bérard Alexandre, Dehaze Théo, Ruder Sebastian
- German Also Hallucinates! Inconsistency Detection In News Summaries With The Absinth Dataset Mascarell Laura, Chalumattu Ribin, Rios Annette
- MM1: Methods, Analysis & Insights From Multimodal LLM Pre-training Mckinzie Brandon, Gan Zhe, Fauconnier Jean-philippe, Dodge Sam, Zhang Bowen, Dufter Philipp, Shah Dhruti, Du Xianzhi, Peng Futang, Weers Floris, Belyi Anton, Zhang Haotian, Singh Karanjeet, Kang Doug, Jain Ankur, Hè Hongyu, Schwarzer Max, Gunter Tom, Kong Xiang, Zhang Aonan, Wang Jianyu, Wang Chong, Du Nan, Lei Tao, Wiseman Sam, Yin Guoli, Lee Mark, Wang Zirui, Pang Ruoming, Grasch Peter, Toshev Alexander, Yang Yinfei
- Linearizing Large Language Models Mercat Jean, Vasiljevic Igor, Keh Sedrick, Arora Kushal, Dave Achal, Gaidon Adrien, Kollar Thomas
- Multimodal Contrastive In-context Learning Miyanishi Yosuke, Nguyen Minh Le
- Can We Trust Large Language Models Generated Code? A Framework For In-context Learning, Security Patterns, And Code Evaluations Across Diverse Llms Mohsin Ahmad, Janicke Helge, Wood Adrian, Sarker Iqbal H., Maglaras Leandros, Janjua Naeem
- Xc-cache: Cross-attending To Cached Context For Efficient LLM Inference Monteiro João, Marcotte Étienne, Noël Pierre-andré, Zantedeschi Valentina, Vázquez David, Chapados Nicolas, Pal Christopher, Taslakian Perouz
- Exploring The Landscape Of Large Language Models: Foundations, Techniques, And Challenges Moradi Milad, Yan Ke, Colwell David, Samwald Matthias, Asgari Rhona
- Language Modelling Approaches To Adaptive Machine Translation Moslem Yasmin
- Large Language Models Are Parallel Multilingual Learners Mu Yongyu, Feng Peinan, Cao Zhiquan, Wu Yuzhang, Li Bei, Wang Chenglong, Xiao Tong, Song Kai, Liu Tongran, Zhang Chunliang, Zhu Jingbo
- Are Large Language Models Actually Good At Text Style Transfer? Mukherjee Sourabrata, Ojha Atul Kr., Dušek Ondřej
- Learning Vs Retrieval: The Role Of In-context Examples In Regression With Llms Nafar Aliakbar, Venable Kristen Brent, Kordjamshidi Parisa
- Gemquad : Generating Multilingual Question Answering Datasets From Large Language Models Using Few Shot Learning Namboori Amani, Mangale Shivam, Rosenbaum Andy, Soltan Saleh
- BMIKE-53: Investigating Cross-lingual Knowledge Editing With In-context Learning Nie Ercong, Shao Bo, Ding Zifeng, Wang Mingyang, Schmid Helmut, Schütze Hinrich
- Likelihood-based Mitigation Of Evaluation Bias In Large Language Models Ohi Masanari, Kaneko Masahiro, Koike Ryuto, Loem Mengsay, Okazaki Naoaki
- NLP At UC Santa Cruz At Semeval-2024 Task 5: Legal Answer Validation Using Few-shot Multi-choice QA Pahilajani Anish, Jain Samyak Rajesh, Trivedi Devasha
- Investigating LLM Applications In E-commerce Palen-michel Chester, Wang Ruixiang, Zhang Yipeng, Yu David, Xu Canran, Wu Zhe
- POMP: Probability-driven Meta-graph Prompter For Llms In Low-resource Unsupervised Neural Machine Translation Pan Shilong, Tian Zhiliang, Ding Liang, Huang Zhen, Wen Zhihua, Li Dongsheng
- Revisiting Demonstration Selection Strategies In In-context Learning Peng Keqin, Ding Liang, Yuan Yancheng, Liu Xuebo, Zhang Min, Ouyang Yuanxin, Tao Dacheng
- Learnable In-context Vector For Visual Question Answering Peng Yingzhe, Hao Chenduo, Yang Xu, Peng Jiawei, Hu Xinting, Geng Xin
- Doccgen: Document-based Controlled Code Generation Pimparkhede Sameer, Kammakomati Mehant, Tamilselvam Srikanth, Kumar Prince, Kumar Ashok Pon, Bhattacharyya Pushpak
- Revisiting Verilogeval: Newer Llms, In-context Learning, And Specification-to-rtl Tasks Pinckney Nathaniel, Batten Christopher, Liu Mingjie, Ren Haoxing, Khailany Brucek
- Reasoning With Large Language Models, A Survey Plaat Aske, Wong Annie, Verberne Suzan, Broekens Joost, Van Stein Niki, Back Thomas
- Machine Translation With Large Language Models: Prompt Engineering For Persian, English, And Russian Directions Pourkamali Nooshin, Sharifi Shler Ebrahim
- Crafting A Good Prompt Or Providing Exemplary Dialogues? A Study Of In-context Learning For Persona-based Dialogue Generation Pu Jiashu, Wan Yajing, Zhang Yuru, Chen Jing, Cheng Ling, Shao Qian, Chang Yongzhu, Lv Tangjie, Zhang Rongsheng
- Code Prompting Elicits Conditional Reasoning Abilities In Text+code Llms Puerto Haritz, Tutek Martin, Aditya Somak, Zhu Xiaodan, Gurevych Iryna
- Scenarios And Approaches For Situated Natural Language Explanations Qiu Pengshuo, Rudzicz Frank, Zhu Zining
- Before Generation, Align It! A Novel And Effective Strategy For Mitigating Hallucinations In Text-to-sql Generation Qu Ge, Li Jinyang, Li Bowen, Qin Bowen, Huo Nan, Ma Chenhao, Cheng Reynold
- Tokenrec: Learning To Tokenize ID For Llm-based Generative Recommendation Qu Haohao, Fan Wenqi, Zhao Zihuai, Li Qing
- DEEP-ICL: Definition-enriched Experts For Language Model In-context Learning Qu Xingwei, Liang Yiming, Wang Yucheng, Zheng Tianyu, Yue Tommy, Ma Lei, Huang Stephen W., Zhang Jiajun, Shi Yinan, Lin Chenghua, Fu Jie, Zhang Ge
- Enhancing Ethical Explanations Of Large Language Models Through Iterative Symbolic Refinement Quan Xin, Valentino Marco, Dennis Louise A., Freitas André
- Masontigers At Semeval-2024 Task 9: Solving Puzzles With An Ensemble Of Chain-of-thoughts Raihan Md Nishat, Goswami Dhiman, Emran Al Nahian Bin, Puspo Sadiya Sayara Chowdhury, Ganguly Amrita, Zampieri Marcos
- SSP: Self-supervised Prompting For Cross-lingual Transfer To Low-resource Languages Using Large Language Models Rathore Vipul, Deb Aniruddha, Chandresh Ankish, Singla Parag, Mausam
- Mitigating Exaggerated Safety In Large Language Models Ray Ruchira, Bhalani Ruchi
- Analyzing And Adapting Large Language Models For Few-shot Multilingual NLU: Are We There Yet? Razumovskaia Evgeniia, Vulić Ivan, Korhonen Anna
- Beyond Performance: Quantifying And Mitigating Label Bias In Llms Reif Yuval, Schwartz Roy
- Identifying Semantic Induction Heads To Understand In-context Learning Ren Jie, Guo Qipeng, Yan Hang, Liu Dongrui, Zhang Quanshi, Qiu Xipeng, Lin Dahua
- Empirical Study Of Symmetrical Reasoning In Conversational Chatbots Rim Daniela N., Choi Heeyoul
- In-context Learning Demonstration Selection Via Influence Analysis S. Vinay M., Van Minh-hao, Wu Xintao
- Capabilities Of Gemini Models In Medicine Saab Khaled, Tu Tao, Weng Wei-hung, Tanno Ryutaro, Stutz David, Wulczyn Ellery, Zhang Fan, Strother Tim, Park Chunjong, Vedadi Elahe, Chaves Juanma Zambrano, Hu Szu-yeu, Schaekermann Mike, Kamath Aishwarya, Cheng Yong, Barrett David G. T., Cheung Cathy, Mustafa Basil, Palepu Anil, Mcduff Daniel, Hou Le, Golany Tomer, Liu Luyang, Alayrac Jean-baptiste, Houlsby Neil, Tomasev Nenad, Freyberg Jan, Lau Charles, Kemp Jonas, Lai Jeremy, Azizi Shekoofeh, Kanada Kimberly, Man Siwai, Kulkarni Kavita, Sun Ruoxi, Shakeri Siamak, He Luheng, Caine Ben, Webson Albert, Latysheva Natasha, Johnson Melvin, Mansfield Philip, Lu Jian, Rivlin Ehud, Anderson Jesper, Green Bradley, Wong Renee, Krause Jonathan, Shlens Jonathon, Dominowska Ewa, Eslami S. M. Ali, Chou Katherine, Cui Claire, Vinyals Oriol, Kavukcuoglu Koray, Manyika James, Dean Jeff, Hassabis Demis, Matias Yossi, Webster Dale, Barral Joelle, Corrado Greg, Semturs Christopher, Mahdavi S. Sara, Gottweis Juraj, Karthikesalingam Alan, Natarajan Vivek
- Utebc-nlp At Semeval-2024 Task 9: Can Llms Be Lateral Thinkers? Sadeghi Pouya, Abaskohi Amirhossein, Yaghoobzadeh Yadollah
- Synergizing In-context Learning With Hints For End-to-end Task-oriented Dialog Systems Saley Vishal Vivek, Das Rocktim Jyoti, Raghu Dinesh, Mausam
- Berts Are Generative In-context Learners Samuel David
- ICAL: Continual Learning Of Multimodal Agents By Transforming Trajectories Into Actionable Insights Sarch Gabriel, Jang Lawrence, Tarr Michael J., Cohen William W., Marino Kenneth, Fragkiadaki Katerina
- Can Small Language Models Learn, Unlearn, And Retain Noise Patterns? Scaria Nicy, Kennedy Silvester John Joseph, Subramani Deepak
- In-context Learning Agents Are Asymmetric Belief Updaters Schubert Johannes A., Jagadish Akshay K., Binz Marcel, Schulz Eric
- Team UTSA-NLP At Semeval 2024 Task 5: Prompt Ensembling For Argument Reasoning In Civil Procedures With GPT4 Schumacher Dan, Rios Anthony
- Benchmarking Large Language Models For Math Reasoning Tasks Seßler Kathrin, Rong Yao, Gözlüklü Emek, Kasneci Enkelejda
- Medpromptx: Grounded Multimodal Prompting For Chest X-ray Diagnosis Shaaban Mai A., Khan Adnan, Yaqub Mohammad
- Show, Don't Tell: Aligning Language Models With Demonstrated Feedback Shaikh Omar, Lam Michelle, Hejna Joey, Shao Yijia, Bernstein Michael, Yang Diyi
- Guiding In-context Learning Of Llms Through Quality Estimation For Machine Translation Sharami Javad Pourmostafa Roshan, Shterionov Dimitar, Spronck Pieter
- Retrieval-enhanced Knowledge Editing In Language Models For Multi-hop Question Answering Shi Yucheng, Tan Qiaoyu, Wu Xuansheng, Zhong Shaochen, Zhou Kaixiong, Liu Ninghao
- Why Larger Language Models Do In-context Learning Differently? Shi Zhenmei, Wei Junyi, Xu Zhuoyan, Liang Yingyu
- Deconstructing In-context Learning: Understanding Prompts Via Corruption Shivagunde Namrata, Lialin Vladislav, Muckatira Sherin, Rumshisky Anna
- Iclguard: Controlling In-context Learning Behavior For Applicability Authorization Si Wai Man, Backes Michael, Zhang Yang
- Where Does In-context Translation Happen In Large Language Models Sia Suzanna, Mueller David, Duh Kevin
- A Three-pronged Approach To Cross-lingual Adaptation With Multilingual Llms Singh Vaibhav, Krishna Amrith, Nj Karthika, Ramakrishnan Ganesh
- Maml-en-llm: Model Agnostic Meta-training Of Llms For Improved In-context Learning Sinha Sanchit, Yue Yuguang, Soto Victor, Kulkarni Mayank, Lu Jianhua, Zhang Aidong
- Out-of-distribution Generalization Via Composition: A Lens Through Induction Heads In Transformers Song Jiajun, Xu Zhuoyan, Zhong Yiqiao
- Can Many-shot In-context Learning Help Long-context LLM Judges? See More, Judge Better! Song Mingyang, Zheng Mao, Luo Xuan
- Understanding Memorisation In Llms: Dynamics, Influencing Factors, And Implications Speicher Till, Khan Mohammad Aflah, Wu Qinyuan, Nanda Vedant, Das Soumi, Ghosh Bishwamittra, Gummadi Krishna P., Terzi Evimaria
- Comparative Analysis Of Different Efficient Fine Tuning Methods Of Large Language Models (llms) In Low-resource Setting Srinivasan Krishna Prasad Varadarajan, Gumpena Prasanth, Yattapu Madhusudhana, Brahmbhatt Vishal H.
- A Comprehensive Study Of The Capabilities Of Large Language Models For Vulnerability Detection Steenhoek Benjamin, Rahman Md Mahbubur, Roy Monoshi Kumar, Alam Mirza Sanjida, Barr Earl T., Le Wei
- Fine-tuning Large Language Models For Entity Matching Steiner Aaron, Peeters Ralph, Bizer Christian
- Demonstration Augmentation For Zero-shot In-context Learning Su Yi, Tai Yunpeng, Ji Yixin, Li Juntao, Yan Bowen, Zhang Min
- Hierarchical In-context Reinforcement Learning With Hindsight Modular Reflections For Planning Sun Chuanneng, Huang Songjun, Pompili Dario
- Retrieved In-context Principles From Previous Mistakes Sun Hao, Jiang Yong, Wang Bo, Hou Yingyan, Zhang Yan, Xie Pengjun, Huang Fei
- Lloco: Learning Long Contexts Offline Tan Sijun, Li Xiuyu, Patil Shishir, Wu Ziyang, Zhang Tianjun, Keutzer Kurt, Gonzalez Joseph E., Popa Raluca Ada
- Ungrammatical-syntax-based In-context Example Selection For Grammatical Error Correction Tang Chenming, Qu Fanyi, Wu Yunfang
- Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions Tang Chenming, Wang Zhixiang, Wu Yunfang
- On The Hardness Of Faithful Chain-of-thought Reasoning In Large Language Models Tanneru Sree Harsha, Ley Dan, Agarwal Chirag, Lakkaraju Himabindu
- Robustness Assessment Of Mathematical Reasoning In The Presence Of Missing And Contradictory Conditions Tian Shi-yu, Zhou Zhi, Jia Lin-han, Guo Lan-zhe, Li Yu-feng
- An Ecosage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant Tomar Mohit, Tiwari Abhisek, Saha Tulika, Jha Prince, Saha Sriparna
- Few-shot Cross-lingual Transfer For Prompting Large Language Models In Low-resource Languages Toukmaji Christopher
- Connecting The Dots: Llms Can Infer And Verbalize Latent Structure From Disparate Training Data Treutlein Johannes, Choi Dami, Betley Jan, Anil Cem, Marks Samuel, Grosse Roger Baker, Evans Owain
- An Empirical Study Of Mamba-based Language Models Waleffe Roger, Byeon Wonmin, Riach Duncan, Norick Brandon, Korthikanti Vijay, Dao Tri, Gu Albert, Hatamizadeh Ali, Singh Sudhakar, Narayanan Deepak, Kulshreshtha Garvit, Singh Vartika, Casper Jared, Kautz Jan, Shoeybi Mohammad, Catanzaro Bryan
- A & B == B & A: Triggering Logical Reasoning Failures In Large Language Models Wan Yuxuan, Wang Wenxuan, Yang Yiliu, Yuan Youliang, Huang Jen-tse, He Pinjia, Jiao Wenxiang, Lyu Michael R.
- The Whole Is Better Than The Sum: Using Aggregated Demonstrations In In-context Learning For Sequential Recommendation Wang Lei, Lim Ee-peng
- Interactive Dualchecker For Mitigating Hallucinations In Distilling Large Language Models Wang Meiyun, Suzuki Masahiro, Sakaji Hiroki, Izumi Kiyoshi
- Effective Demonstration Annotation For In-context Learning Via Language Model-based Determinantal Point Process Wang Peng, Wang Xiaobin, Lou Chao, Mao Shengyu, Xie Pengjun, Jiang Yong
- One Prompt Is Not Enough: Automated Construction Of A Mixture-of-expert Prompts Wang Ruochen, An Sohyun, Cheng Minhao, Zhou Tianyi, Hwang Sung Ju, Hsieh Cho-jui
- Secokd: Aligning Large Language Models For In-context Learning With Fewer Shots Wang Weixing, Yang Haojin, Meinel Christoph
- QUITO: Accelerating Long-context Reasoning Through Query-guided Context Compression Wang Wenshan, Wang Yihang, Fan Yixing, Liao Huaming, Guo Jiafeng
- Metatool: Facilitating Large Language Models To Master Tools With Meta-task Augmentation Wang Xiaohan, Li Dian, Zhao Yilin, Sinbadliu, Wang Hui
- A Theoretical Understanding Of Self-correction Through In-context Alignment Wang Yifei, Wu Yuyang, Wei Zeming, Jegelka Stefanie, Wang Yisen
- Sibyl: Simple Yet Effective Agent Framework For Complex Real-world Reasoning Wang Yulong, Shen Tianhao, Liu Lifeng, Xie Jian
- Measuring And Reducing LLM Hallucination Without Gold-standard Answers Wei Jiaheng, Yao Yuanshun, Ton Jean-francois, Guo Hongyi, Estornell Andrew, Liu Yang
- Instructrag: Instructing Retrieval-augmented Generation Via Self-synthesized Rationales Wei Zhepei, Chen Wei-lin, Meng Yu
- Towards Evaluating And Building Versatile Large Language Models For Medicine Wu Chaoyi, Qiu Pengcheng, Liu Jinxin, Gu Hongfei, Li Na, Zhang Ya, Wang Yanfeng, Xie Weidi
- Instructing Large Language Models To Identify And Ignore Irrelevant Conditions Wu Zhenyu, Shen Chao, Jiang Meng
- In-context Learning Can Re-learn Forbidden Tasks Xhonneux Sophie, Dobre David, Tang Jian, Gidel Gauthier, Sridhar Dhanya
- Decomposition For Enhancing Attention: Improving Llm-based Text-to-sql Through Workflow Paradigm Xie Yuanzhen, Jin Xinzhou, Xie Tao, Lin Mingxiong, Chen Liang, Yu Chenyun, Cheng Lei, Zhuo Chengxiang, Hu Bo, Li Zang
- Theoretical Understanding Of In-context Learning In Shallow Transformers With Unstructured Data Xing Yue, Lin Xiaofeng, Xu Chenheng, Suh Namjoon, Song Qifan, Cheng Guang
- Interactive-kbqa: Multi-turn Interactions For Knowledge Base Question Answering With Large Language Models Xiong Guanming, Bao Junwei, Zhao Wen
- From Introspection To Best Practices: Principled Analysis Of Demonstrations In Multimodal In-context Learning Xu Nan, Wang Fei, Zhang Sheng, Poon Hoifung, Chen Muhao
- Misconfidence-based Demonstration Selection For LLM In-context Learning Xu Shangqing Georgia Institute Of Technology, Zhang Chao Georgia Institute Of Technology
- Do Large Language Models Have Compositional Ability? An Investigation Into Limitations And Scalability Xu Zhuoyan, Shi Zhenmei, Liang Yingyu
- Do Large Language Models Understand Logic Or Just Mimick Context? Yan Junbing, Wang Chengyu, Huang Jun, Zhang Wei
- Aligning Llms Through Multi-perspective User Preference Ranking-based Feedback For Programming Question Answering Yang Hongyu, He Liyang, Hou Min, Shen Shuanghong, Li Rui, Hou Jiahui, Ma Jianhui, Zhao Junda
- Metaaligner: Towards Generalizable Multi-objective Alignment Of Language Models Yang Kailai, Liu Zhiwei, Xie Qianqian, Huang Jimin, Zhang Tianlin, Ananiadou Sophia
- Video As The New Language For Real-world Decision Making Yang Sherry, Walker Jacob, Parker-holder Jack, Du Yilun, Bruce Jake, Barreto Andre, Abbeel Pieter, Schuurmans Dale
- Unleashing The Potential Of Large Language Models For Predictive Tabular Tasks In Data Science Yang Yazheng, Wang Yuqi, Sen Sankalok, Li Lei, Liu Qi
- Enhancing In-context Learning Performance With Just Svd-based Weight Pruning: A Theoretical Perspective Yao Xinhao, Hu Xiaolin, Yang Shenzhi, Liu Yong
- Long-context Language Modeling With Parallel Context Encoding Yen Howard, Gao Tianyu, Chen Danqi
- MEND: Meta Demonstration Distillation For Efficient And Effective In-context Learning Yichuan Li, Xiyao Ma, Sixing Lu, Kyumin Lee, Xiaohu Liu, Chenlei Guo
- Internlm-math: Open Math Large Language Models Toward Verifiable Reasoning Ying Huaiyuan, Zhang Shuo, Li Linyang, Zhou Zhejian, Shao Yunfan, Fei Zhaoye, Ma Yichuan, Hong Jiawei, Liu Kuikun, Wang Ziyi, Wang Yudong, Wu Zijian, Li Shuaibin, Zhou Fengzhe, Liu Hongwei, Zhang Songyang, Zhang Wenwei, Yan Hang, Qiu Xipeng, Wang Jiayu, Chen Kai, Lin Dahua
- Focused Large Language Models Are Stable Many-shot Learners Yuan Peiwen, Feng Shaoxiong, Li Yiwei, Wang Xinglin, Zhang Yueqi, Tan Chuyi, Pan Boyuan, Wang Heda, Hu Yao, Li Kan
- Building Accurate Translation-tailored Llms With Language Aware Instruction Tuning Zan Changtong, Ding Liang, Shen Li, Zhen Yibing, Liu Weifeng, Tao Dacheng
- Tell Me Why: Explainable Public Health Fact-checking With Large Language Models Zarharan Majid, Wullschleger Pascal, Kia Babak Behkam, Pilehvar Mohammad Taher, Foster Jennifer
- Can Mllms Perform Text-to-image In-context Learning? Zeng Yuchen, Kang Wonjun, Chen Yicong, Koo Hyung Il, Lee Kangwook
- Teaching Large Language Models An Unseen Language On The Fly Zhang Chen, Liu Xiao, Lin Jiuheng, Feng Yansong
- Causal Prompting: Debiasing Large Language Model Prompting Based On Front-door Adjustment Zhang Congzhi, Zhang Linhai, Wu Jialong, Zhou Deyu, He Yulan
- Coe-sql: In-context Learning For Multi-turn Text-to-sql With Chain-of-editions Zhang Hanchong, Cao Ruisheng, Xu Hongshen, Chen Lu, Yu Kai
- The Impact Of Demonstrations On Multilingual In-context Learning: A Multidimensional Analysis Zhang Miaoran, Gautam Vagrant, Wang Mingyang, Alabi Jesujoba O., Shen Xiaoyu, Klakow Dietrich, Mosbach Marius
- Improving Diversity Of Commonsense Generation By Large Language Models Via In-context Learning Zhang Tianhui, Peng Bei, Bollegala Danushka
- In-context Principle Learning From Mistakes Zhang Tianjun, Madaan Aman, Gao Luyu, Zheng Steven, Mishra Swaroop, Yang Yiming, Tandon Niket, Alon Uri
- How Far Are We From Intelligent Visual Deductive Reasoning? Zhang Yizhe, Bai He, Zhang Ruixiang, Gu Jiatao, Zhai Shuangfei, Susskind Josh, Jaitly Navdeep
- Leveraging Biases In Large Language Models: "bias-knn'' For Effective Few-shot Learning Zhang Yong, Li Hanzhang, Li Zhitao, Cheng Ning, Li Ming, Xiao Jing, Wang Jianzong
- Mg-verilog: Multi-grained Dataset Towards Enhanced Llm-assisted Verilog Generation Zhang Yongan, Yu Zhongzhi, Fu Yonggan, Wan Cheng, Lin Yingyan Celine
- Large Language Models As Reliable Knowledge Bases? Zheng Danna, Lapata Mirella, Pan Jeff Z.
- Fine-tuning Large Language Models For Domain-specific Machine Translation Zheng Jiawei, Hong Hanghai, Wang Xiaoli, Su Jingsong, Liang Yonggui, Wu Shikai
- Can Llms Learn New Concepts Incrementally Without Forgetting? Zheng Junhao, Qiu Shengjie, Ma Qianli
- Harnessing Large Language Models As Post-hoc Correctors Zhong Zhiqiang, Zhou Kuangyu, Mottin Davide
- Unibias: Unveiling And Mitigating LLM Bias Through Internal Attention And FFN Manipulation Zhou Hanzhang, Feng Zijian, Zhu Zixiao, Qian Junlang, Mao Kezhi
- DETAIL: Task Demonstration Attribution For Interpretable In-context Learning Zhou Zijian, Lin Xiaoqiang, Xu Xinyi, Prakash Alok, Rus Daniela, Low Bryan Kian Hsiang
- Eliciting The Priors Of Large Language Models Using Iterated In-context Learning Zhu Jian-qiao, Griffiths Thomas L.
- Can Large Language Models Understand Context? Zhu Yilun, Moniz Joel Ruben Antony, Bhargava Shruti, Lu Jiarui, Piraviperumal Dhivya, Li Site, Zhang Yuan, Yu Hong, Tseng Bo-hsiang
- Prompting Large Language Models For Zero-shot Clinical Prediction With Structured Longitudinal Electronic Health Record Data Zhu Yinghao, Wang Zixiang, Gao Junyi, Tong Yuning, An Jingkun, Liao Weibin, Harrison Ewen M., Ma Liantao, Pan Chengwei
- Are Large Language Models Good Statisticians? Zhu Yizhang, Du Shiyin, Li Boyan, Luo Yuyu, Tang Nan
- Generative Pre-trained Speech Language Model With Efficient Hierarchical Transformer Zhu Yongxin, Su Dan, He Liqiang, Xu Linli, Yu Dong
- VL-ICL Bench: The Devil In The Details Of Benchmarking Multimodal In-context Learning Zong Yongshuo, Bohdal Ondrej, Hospedales Timothy
- Concept-aware Data Construction Improves In-context Learning Of Language Models Štefánik Michal, Kadlčík Marek, Sojka Petr
🏷 Interpretability And Explainability
- Multimodal Explanations: Justifying Decisions And Pointing To The Evidence Park Dong Huk, Hendricks Lisa Anne, Akata Zeynep, Rohrbach Anna, Schiele Bernt, Darrell Trevor, Rohrbach Marcus
- An Operation Sequence Model For Explainable Neural Machine Translation Stahlberg Felix, Saunders Danielle, Byrne Bill
- Generating Natural Language Explanations For Visual Question Answering Using Scene Graphs And Visual Attention Ghosh Shalini, Burachas Giedrius, Ray Arijit, Ziskind Avi
- Attention Is Not Explanation Jain Sarthak, Wallace Byron C.
- Learning To Deceive With Attention-based Explanations Pruthi Danish, Gupta Mansi, Dhingra Bhuwan, Neubig Graham, Lipton Zachary C.
- Answering Complex Open-domain Questions Through Iterative Query Generation Qi Peng, Lin Xiaowen, Mehr Leo, Wang Zijian, Manning Christopher D.
- Factor Graph Attention Schwartz Idan, Yu Seunghak, Hazan Tamir, Schwing Alexander
- Taking A HINT: Leveraging Explanations To Make Vision And Language Models More Grounded Selvaraju Ramprasaath R., Lee Stefan, Shen Yilin, Jin Hongxia, Ghosh Shalini, Heck Larry, Batra Dhruv, Parikh Devi
- Generating Token-level Explanations For Natural Language Inference Thorne James, Vlachos Andreas, Christodoulopoulos Christos, Mittal Arpit
- Allennlp Interpret: A Framework For Explaining Predictions Of NLP Models Wallace Eric, Tuyls Jens, Wang Junlin, Subramanian Sanjay, Gardner Matt, Singh Sameer
- Attention Is Not Not Explanation Wiegreffe Sarah, Pinter Yuval
- Visual Entailment: A Novel Task For Fine-grained Image Understanding Xie Ning, Lai Farley, Doran Derek, Kadav Asim
- Recosa: Detecting The Relevant Contexts With Self-attention For Multi-turn Dialogue Generation Zhang Hainan, Lan Yanyan, Pang Liang, Guo Jiafeng, Cheng Xueqi
- Atom Responding Machine For Dialog Generation Zhou Ganbin, Luo Ping, Chen Jingwu, Lin Fen, Lin Leyu, He Qing
- A Study On Multimodal And Interactive Explanations For Visual Question Answering Alipour Kamran, Schulze Jurgen P., Yao Yi, Ziskind Avi, Burachas Giedrius
- The Elephant In The Interpretability Room: Why Use Attention As Explanation When We Have Saliency Methods? Bastings Jasmijn, Filippova Katja
- Adaptive Transformers For Learning Multimodal Representations Bhargava Prajjwal
- Comparative Study Of Language Models On Cross-domain Data With Model Agnostic Explainability Chhipa Mayank, Vazurkar Hrushikesh Mahesh, Kumar Abhijeet, Mishra Mridul
- Transformers As Soft Reasoners Over Language Clark Peter, Tafjord Oyvind, Richardson Kyle
- Unsupervised Explanation Generation For Machine Reading Comprehension Cui Yiming, Liu Ting, Wang Shijin, Hu Guoping
- Exploring Explainable Selection To Control Abstractive Summarization Haonan Wang, Yang Gao, Yu Bai, Lapata Mirella, Heyan Huang
- Fid-ex: Improving Sequence-to-sequence Models For Extractive Rationale Generation Lakhotia Kushal, Paranjape Bhargavi, Ghoshal Asish, Yih Wen-tau, Mehdad Yashar, Iyer Srinivasan
- LIMEADE: From AI Explanations To Advice Taking Lee Benjamin Charles Germain, Downey Doug, Lo Kyle, Weld Daniel S.
- Interpretable NLG For Task-oriented Dialogue Systems With Heterogeneous Rendering Machines Li Yangming, Yao Kaisheng
- Toward Interpretability Of Dual-encoder Models For Dialogue Response Suggestions Li Yitong, Li Dianqi, Prakash Sushant, Wang Peng
- Qiaoning At Semeval-2020 Task 4: Commonsense Validation And Explanation System Based On Ensemble Of Language Model Liu Pai
- Natural Language Rationales With Full-stack Visual Reasoning: From Pixels To Semantic Frames To Commonsense Graphs Marasović Ana, Bhagavatula Chandra, Park Jae Sung, Bras Ronan Le, Smith Noah A., Choi Yejin
- On The Stability Of Fine-tuning BERT: Misconceptions, Explanations, And Strong Baselines Mosbach Marius, Andriushchenko Maksym, Klakow Dietrich
- An Investigation Of Language Model Interpretability Via Sentence Editing Stevens Samuel, Su Yu
- Benchmarking Machine Reading Comprehension: A Psychological Perspective Sugawara Saku, Stenetorp Pontus, Aizawa Akiko
- Explain And Improve: Lrp-inference Fine-tuning For Image Captioning Models Sun Jiamei, Lapuschkin Sebastian, Samek Wojciech, Binder Alexander
- Interpretable Neural Computation For Real-world Compositional Visual Question Answering Tang Ruixue, Ma Chao
- The Language Interpretability Tool: Extensible, Interactive Visualizations And Analysis For NLP Models Tenney Ian, Wexler James, Bastings Jasmijn, Bolukbasi Tolga, Coenen Andy, Gehrmann Sebastian, Jiang Ellen, Pushkarna Mahima, Radebaugh Carey, Reif Emily, Yuan Ann
- Generic Attention-model Explainability For Interpreting Bi-modal And Encoder-decoder Transformers Chefer Hila, Gur Shir, Wolf Lior
- Enjoy The Salience: Towards Better Transformer-based Faithful Explanations With Word Salience Chrysostomou George, Aletras Nikolaos
- Improving The Faithfulness Of Attention-based Explanations With Task-specific Information For Text Classification Chrysostomou George, Aletras Nikolaos
- Multilingual Multi-aspect Explainability Analyses On Machine Reading Comprehension Models Cui Yiming, Zhang Wei-nan, Che Wanxiang, Liu Ting, Chen Zhigang, Wang Shijin
- Attention Weights In Transformer NMT Fail Aligning Words Between Sequences But Largely Explain Model Predictions Ferrando Javier, Costa-jussà Marta R.
- Interactively Providing Explanations For Transformer Language Models Friedrich Felix, Schramowski Patrick, Tauchmann Christopher, Kersting Kristian
- Improving And Diagnosing Knowledge-based Visual Question Answering Via Entity Enhanced Knowledge Injection Garcia-olano Diego, Onoe Yasumasa, Ghosh Joydeep
- KAT: A Knowledge Augmented Transformer For Vision-and-language Gui Liangke, Wang Borui, Huang Qiuyuan, Hauptmann Alex, Bisk Yonatan, Gao Jianfeng
- Greedy Gradient Ensemble For Robust Visual Question Answering Han Xinzhe, Wang Shuhui, Su Chi, Huang Qingming, Tian Qi
- Enriching Transformers With Structured Tensor-product Representations For Abstractive Summarization Jiang Yichen, Celikyilmaz Asli, Smolensky Paul, Soulos Paul, Rao Sudha, Palangi Hamid, Fernandez Roland, Smith Caitlin, Bansal Mohit, Gao Jianfeng
- Personalized Transformer For Explainable Recommendation Li Lei, Zhang Yongfeng, Chen Li
- Few-shot Self-rationalization With Natural Language Prompts Marasović Ana, Beltagy Iz, Downey Doug, Peters Matthew E.
- Diff-explainer: Differentiable Convex Optimization For Explainable Multi-hop Inference Thayaparan Mokanarangan, Valentino Marco, Ferreira Deborah, Rozanova Julia, Freitas André
- Local Explanation Of Dialogue Response Generation Tuan Yi-lin, Pryor Connor, Chen Wenhu, Getoor Lise, Wang William Yang
- Reframing Human-ai Collaboration For Generating Free-text Explanations Wiegreffe Sarah, Hessel Jack, Swayamdipta Swabha, Riedl Mark, Choi Yejin
- Graph-free Multi-hop Reading Comprehension: A Select-to-guide Strategy Wu Bohong, Zhang Zhuosheng, Zhao Hai
- On Explaining Your Explanations Of BERT: An Empirical Study With Sequence Classification Wu Zhengxuan, Ong Desmond C.
- An Explanation Of In-context Learning As Implicit Bayesian Inference Xie Sang Michael, Raghunathan Aditi, Liang Percy, Ma Tengyu
- Vl-interpret: An Interactive Visualization Tool For Interpreting Vision-language Transformers Aflalo Estelle, Du Meng, Tseng Shao-yen, Liu Yongfei, Wu Chenfei, Duan Nan, Lal Vasudev
- Can Language Models Learn From Explanations In Context? Andrew K. Lampinen, Ishita Dasgupta, Stephanie C. Y. Chan, Kory Matthewson, Michael Henry Tessler, Antonia Creswell, James L. Mcclelland, Jane X. Wang, Felix Hill
- Enabling Classifiers To Make Judgements Explicitly Aligned With Human Values Bang Yejin, Yu Tiezheng, Madotto Andrea, Lin Zhaojiang, Diab Mona, Fung Pascale
- Rethinking The Role Of Scale For In-context Learning: An Interpretability-based Case Study At 66 Billion Scale Bansal Hritik, Gopalakrishnan Karthik, Dingliwal Saket, Bodapati Sravan, Kirchhoff Katrin, Roth Dan
- Effective Cross-task Transfer Learning For Explainable Natural Language Inference With T5 Bigoulaeva Irina, Sachdeva Rachneet, Madabushi Harish Tayyar, Villavicencio Aline, Gurevych Iryna
- Structured, Flexible, And Robust: Benchmarking And Improving Large Language Models Towards More Human-like Behavior In Out-of-distribution Reasoning Tasks Collins Katherine M., Wong Catherine, Feng Jiahai, Wei Megan, Tenenbaum Joshua B.
- M6-rec: Generative Pretrained Language Models Are Open-ended Recommender Systems Cui Zeyu, Ma Jianxin, Zhou Chang, Zhou Jingren, Yang Hongxia
- Neural Knowledge Bank For Pretrained Transformers Dai Damai, Jiang Wenbin, Dong Qingxiu, Lyu Yajuan, She Qiaoqiao, Sui Zhifang
- Rlprompt: Optimizing Discrete Text Prompts With Reinforcement Learning Deng Mingkai, Wang Jianyu, Hsieh Cheng-ping, Wang Yihan, Guo Han, Shu Tianmin, Song Meng, Xing Eric P., Hu Zhiting
- Black-box Prompt Learning For Pre-trained Language Models Diao Shizhe, Huang Zhichao, Xu Ruijia, Li Xuechun, Lin Yong, Zhou Xiao, Zhang Tong
- Towards Opening The Black Box Of Neural Machine Translation: Source And Target Interpretations Of The Transformer Ferrando Javier, Gállego Gerard I., Alastruey Belen, Escolano Carlos, Costa-jussà Marta R.
- Normsage: Multi-lingual Multi-cultural Norm Discovery From Conversations On-the-fly Fung Yi R., Chakraborty Tuhin, Guo Hao, Rambow Owen, Muresan Smaranda, Ji Heng
- Rethinking With Retrieval: Faithful Large Language Model Inference He Hangfeng, Zhang Hongming, Roth Dan
- Cogvideo: Large-scale Pretraining For Text-to-video Generation Via Transformers Hong Wenyi, Ding Ming, Zheng Wendi, Liu Xinghan, Tang Jie
- Self-prompting Large Language Models For Zero-shot Open-domain QA Junlong Li, Jinyuan Wang, Zhuosheng Zhang, Hai Zhao
- Towards The Generation Of Musical Explanations With GPT-3 Krol Stephen James, Llano Maria Teresa, Mccormack Jon
- Guiding Visual Question Answering With Attention Priors Le Thao Minh, Le Vuong, Gupta Sunil, Venkatesh Svetha, Tran Truyen
- Personalized Prompt Learning For Explainable Recommendation Li Lei, Zhang Yongfeng, Chen Li
- Explanations From Large Language Models Make Small Reasoners Better Li Shiyang, Chen Jianshu, Shen Yelong, Chen Zhiyu, Zhang Xinlu, Li Zekun, Wang Hong, Qian Jing, Peng Baolin, Mao Yi, Chen Wenhu, Yan Xifeng
- Exploring Visual Interpretability For Contrastive Language-image Pre-training Li Yi, Wang Hualiang, Duan Yiqun, Xu Hang, Li Xiaomeng
- Gentus: Simulating User Behaviour And Language In Task-oriented Dialogues With Generative Transformers Lin Hsien-chin, Geishauser Christian, Feng Shutong, Lubis Nurul, Van Niekerk Carel, Heck Michael, Gašić Milica
- Learn To Explain: Multimodal Reasoning Via Thought Chains For Science Question Answering Lu Pan, Mishra Swaroop, Xia Tony, Qiu Liang, Chang Kai-wei, Zhu Song-chun, Tafjord Oyvind, Clark Peter, Kalyan Ashwin
- Text Generation With Text-editing Models Malmi Eric, Dong Yue, Mallinson Jonathan, Chuklin Aleksandr, Adamek Jakub, Mirylenka Daniil, Stahlberg Felix, Krause Sebastian, Kumar Shankar, Severyn Aliaksei
- HELP ME THINK: A Simple Prompting Strategy For Non-experts To Create Customized Content With Models Mishra Swaroop, Nouri Elnaz
- Tell Me The Evidence? Dual Visual-linguistic Interaction For Answer Grounding Pan Junwen, Chen Guanlin, Liu Yi, Wang Jiexiang, Bian Cheng, Zhu Pengfei, Zhang Zhicheng
- Visconde: Multi-document QA With GPT-3 And Neural Reranking Pereira Jayr, Fidalgo Robson, Lotufo Roberto, Nogueira Rodrigo
- Harnessing The Power Of Multi-task Pretraining For Ground-truth Level Natural Language Explanations Plüster Björn, Ambsdorf Jakob, Braach Lukas, Lee Jae Hee, Wermter Stefan
- Psychologically-informed Chain-of-thought Prompts For Metaphor Understanding In Large Language Models Prystawski Ben, Thibodeau Paul, Potts Christopher, Goodman Noah D.
- Cometkiwi: Ist-unbabel 2022 Submission For The Quality Estimation Shared Task Rei Ricardo, Treviso Marcos, Guerreiro Nuno M., Zerva Chrysoula, Farinha Ana C., Maroti Christine, De Souza José G. C., Glushkova Taisiya, Alves Duarte M., Lavie Alon, Coheur Luisa, Martins André F. T.
- NLX-GPT: A Model For Natural Language Explanations In Vision And Vision-language Tasks Sammani Fawaz, Mukherjee Tanmoy, Deligiannis Nikos
- Automatic Generation Of Programming Exercises And Code Explanations Using Large Language Models Sarsa Sami, Denny Paul, Hellas Arto, Leinonen Juho
- React: Synergizing Reasoning And Acting In Language Models Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, Yuan Cao
- Augmenting Interpretable Models With Llms During Training Singh Chandan, Askari Armin, Caruana Rich, Gao Jianfeng
- Explaining Patterns In Data With Language Models Via Interpretable Autoprompting Singh Chandan, Morris John X., Aneja Jyoti, Rush Alexander M., Gao Jianfeng
- Investigating Explainability Of Generative AI For Code Through Scenario-based Design Sun Jiao, Liao Q. Vera, Muller Michael, Agarwal Mayank, Houde Stephanie, Talamadupula Kartik, Weisz Justin D.
- Towards Reasoning-aware Explainable VQA Vaideeswaran Rakesh, Gao Feng, Mathur Abhinav, Thattai Govind
- What Do They Capture? -- A Structural Analysis Of Pre-trained Language Models For Source Code Wan Yao, Zhao Wei, Zhang Hongyu, Sui Yulei, Xu Guandong, Jin Hai
- EGCR: Explanation Generation For Conversational Recommendation Wen Bingbing, Bu Xiaoning, Shah Chirag
- Sentence Representation Learning With Generative Objective Rather Than Contrastive Objective Wu Bohong, Zhao Hai
- The Unreliability Of Explanations In Few-shot Prompting For Textual Reasoning Xi Ye, Greg Durrett
- Rationale-augmented Ensembles In Language Models Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc Le, Ed Chi, Denny Zhou
- Complementary Explanations For Effective In-context Learning Ye Xi, Iyer Srinivasan, Celikyilmaz Asli, Stoyanov Ves, Durrett Greg, Pasunuru Ramakanth
- Binding Language Models In Symbolic Languages Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu
- Better Patching Using LLM Prompting, Via Self-consistency Ahmed Toufique, Devanbu Premkumar
- Multimodal Representations For Teacher-guided Compositional Visual Reasoning Aissa Wafa Cedric - Vertigo, Ferecatu Marin Cedric - Vertigo, Crucianu Michel Cedric - Vertigo
- An In-depth Look At Gemini's Language Abilities Akter Syeda Nahida, Yu Zichun, Muhamed Aashiq, Ou Tianyue, Bäuerle Alex, Cabrera Ángel Alexander, Dholakia Krish, Xiong Chenyan, Neubig Graham
- OPT-R: Exploring The Role Of Explanations In Finetuning And Prompting For Reasoning Skills Of Large Language Models Alkhamissi Badr, Verma Siddharth, Yu Ping, Jin Zhijing, Celikyilmaz Asli, Diab Mona
- It's Not Easy Being Wrong: Large Language Models Struggle With Process Of Elimination Reasoning Balepur Nishant, Palta Shramay, Rudinger Rachel
- Exploring Iterative Enhancement For Improving Learnersourced Multiple-choice Question Explanations With Large Language Models Bao Qiming, Leinonen Juho, Peng Alex Yuxuan, Zhong Wanjun, Gendron Gaël, Pistotti Timothy, Huang Alice, Denny Paul, Witbrock Michael, Liu Jiamou
- Self-consistency Of Large Language Models Under Ambiguity Bartsch Henning, Jorgensen Ole, Rosati Domenic, Hoelscher-obermaier Jason, Pfau Jacob
- Large Linguistic Models: Analyzing Theoretical Linguistic Abilities Of Llms Beguš Gašper, Dąbkowski Maksymilian, Rhodes Ryan
- Exploring Large Language Models For Code Explanation Bhattacharya Paheli, Chakraborty Manojit, Palepu Kartheek N S N, Pandey Vikas, Dindorkar Ishan, Rajpurohit Rakesh, Gupta Rishabh
- Helm: Highlighted Evidence Augmented Language Model For Enhanced Table-to-text Generation Bian Junyi, Qin Xiaolei, Zou Wuhe, Huang Mengzuo, Luo Congyi, Zhang Ke, Zhang Weidong
- Development Of The Chatgpt, Generative Artificial Intelligence And Natural Large Language Models For Accountable Reporting And Use (CANGARU) Guidelines Cacciamani Giovanni E., Eppler Michael B., Ganjavi Conner, Pekan Asli, Biedermann Brett, Collins Gary S., Gill Inderbir S.
- Localizing Lying In Llama: Understanding Instructed Dishonesty On True-false Questions Through Prompting, Probing, And Patching Campbell James, Ren Richard, Guo Phillip
- Can Transformers Learn Sequential Function Classes In Context? Campbell Ryan, Guo Emma, Hu Evan, Vir Reya, Hsiao Ethan
- Can It Edit? Evaluating The Ability Of Large Language Models To Follow Code Editing Instructions Cassano Federico, Li Luisa, Sethi Akul, Shinn Noah, Brennan-jones Abby, Ginesin Jacob, Berman Edward, Chakhnashvili George, Lozhkov Anton, Anderson Carolyn Jane, Guha Arjun
- Sudden Drops In The Loss: Syntax Acquisition, Phase Transitions, And Simplicity Bias In Mlms Chen Angelica, Shwartz-ziv Ravid, Cho Kyunghyun, Leavitt Matthew L., Saphra Naomi
- Gptutor: A Chatgpt-powered Programming Tool For Code Explanation Chen Eason, Huang Ray, Chen Han-shin, Tseng Yuen-hsien, Li Liang-yi
- Chain-of-thought Prompt Distillation For Multimodal Named Entity Recognition And Multimodal Relation Extraction Chen Feng, Feng Yujian
- Interpreting And Controlling Vision Foundation Models Via Text Explanations Chen Haozhe, Yang Junfeng, Vondrick Carl, Mao Chengzhi
- Walking Down The Memory Maze: Beyond Context Limit Through Interactive Reading Chen Howard, Pasunuru Ramakanth, Weston Jason, Celikyilmaz Asli
- Reconcile: Round-table Conference Improves Reasoning Via Consensus Among Diverse Llms Chen Justin Chih-yao, Saha Swarnadeep, Bansal Mohit
- Driving With Llms: Fusing Object-level Vector Modality For Explainable Autonomous Driving Chen Long, Sinavski Oleg, Hünermann Jan, Karnsund Alice, Willmott Andrew James, Birch Danny, Maund Daniel, Shotton Jamie
- Challenges And Contributing Factors In The Utilization Of Large Language Models (llms) Chen Xiaoliang, Li Liangbin, Chang Le, Huang Yunhe, Zhao Yuxuan, Zhang Yuxiao, Li Dinuo
- Teaching Large Language Models To Self-debug Chen Xinyun, Lin Maxwell, Schärli Nathanael, Zhou Denny
- Do Models Explain Themselves? Counterfactual Simulatability Of Natural Language Explanations Chen Yanda, Zhong Ruiqi, Ri Narutatsu, Zhao Chen, He He, Steinhardt Jacob, Yu Zhou, Mckeown Kathleen
- Token Prediction As Implicit Classification To Identify Llm-generated Text Chen Yutian, Kang Hao, Zhai Vivian, Li Liangze, Singh Rita, Raj Bhiksha
- Lmexplainer: Grounding Knowledge And Explaining Language Models Chen Zichen, Chen Jianda, Chen Yuanyuan, Yu Han, Singh Ambuj K, Sra Misha
- Xplainllm: A QA Explanation Dataset For Understanding LLM Decision-making Chen Zichen, Chen Jianda, Gaidhani Mitali, Singh Ambuj, Sra Misha
- Llf-bench: Benchmark For Interactive Learning From Language Feedback Cheng Ching-an, Kolobov Andrey, Misra Dipendra, Nie Allen, Swaminathan Adith
- Challenges Of Large Language Models For Mental Health Counseling Chung Neo Christopher, Dyer George, Brocki Lennart
- Robust Infidelity: When Faithfulness Measures On Masked Language Models Are Misleading Crothers Evan, Viktor Herna, Japkowicz Nathalie
- Syntactic Knowledge Via Graph Attention With BERT In Machine Translation Dai Yuqian, Sharoff Serge, De Kamps Marc
- Visual Chain Of Thought: Bridging Logical Gaps With Multimodal Infillings Daniel Rose, Vaishnavi Himakunthala, Andy Ouyang, Ryan He, Alex Mei, Yujie Lu, Michael Saxon, Chinmay Sonar, Diba Mirza, William Yang Wang
- An Adversarial Example For Direct Logit Attribution: Memory Management In Gelu-4l Dao James, Lau Yeu-tong, Rager Can, Janiak Jett
- Who's Thinking? A Push For Human-centered Evaluation Of Llms Using The XAI Playbook Datta Teresa, Dickerson John P.
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- Using Large Language Model To Solve And Explain Physics Word Problems Approaching Human Level Ding Jingzhe, Cen Yan, Wei Xinyuan
- Textgenshap: Scalable Post-hoc Explanations In Text Generation With Long Documents Enouen James, Nakhost Hootan, Ebrahimi Sayna, Arik Sercan O, Liu Yan, Pfister Tomas
- The Devil Is In The Errors: Leveraging Large Language Models For Fine-grained Machine Translation Evaluation Fernandes Patrick, Deutsch Daniel, Finkelstein Mara, Riley Parker, Martins André F. T., Neubig Graham, Garg Ankush, Clark Jonathan H., Freitag Markus, Firat Orhan
- Explaining How Transformers Use Context To Build Predictions Ferrando Javier, Gállego Gerard I., Tsiamas Ioannis, Costa-jussà Marta R.
- Exaranker: Explanation-augmented Neural Ranker Ferraretto Fernando, Laitz Thiago, Lotufo Roberto, Nogueira Rodrigo
- A Challenger To GPT-4V? Early Explorations Of Gemini In Visual Expertise Fu Chaoyou, Zhang Renrui, Wang Zihan, Huang Yubo, Zhang Zhengye, Qiu Longtian, Ye Gaoxiang, Shen Yunhang, Zhang Mengdan, Chen Peixian, Zhao Sirui, Lin Shaohui, Jiang Deqiang, Yin Di, Gao Peng, Li Ke, Li Hongsheng, Sun Xing
- Dynamic Clue Bottlenecks: Towards Interpretable-by-design Visual Question Answering Fu Xingyu, Zhou Ben, Chen Sihao, Yatskar Mark, Roth Dan
- Self-explanation Prompting Improves Dialogue Understanding In Large Language Models Gao Haoyu, Lin Ting-en, Li Hangyu, Yang Min, Wu Yuchuan, Ma Wentao, Li Yongbin
- Faithful Explanations Of Black-box NLP Models Using Llm-generated Counterfactuals Gat Yair, Calderon Nitay, Feder Amir, Chapanin Alexander, Sharma Amit, Reichart Roi
- Building Trustworthy Neurosymbolic AI Systems: Consistency, Reliability, Explainability, And Safety Gaur Manas, Sheth Amit
- Reasoning In Large Language Models Through Symbolic Math Word Problems Gaur Vedant, Saunshi Nikunj
- Ddcot: Duty-distinct Chain-of-thought Prompting For Multimodal Reasoning In Language Models Ge Zheng, Bin Yang, Jiajin Tang, Hong-yu Zhou, Sibei Yang
- UPAR: A Kantian-inspired Prompting Framework For Enhancing Large Language Model Capabilities Geng Hejia, Xu Boxun, Li Peng
- Adapting Llms For Efficient, Personalized Information Retrieval: Methods And Implications Ghodratnama Samira, Zakershahrak Mehrdad
- Augmented Language Models: A Survey Grégoire Mialon, Roberto Dessì, Maria Lomeli, Christoforos Nalmpantis, Ram Pasunuru, Roberta Raileanu, Baptiste Rozière, Timo Schick, Jane Dwivedi-yu, Asli Celikyilmaz, Edouard Grave, Yann Lecun, Thomas Scialom
- Digital Socrates: Evaluating Llms Through Explanation Critiques Gu Yuling, Tafjord Oyvind, Clark Peter
- Images In Language Space: Exploring The Suitability Of Large Language Models For Vision & Language Tasks Hakimov Sherzod, Schlangen David
- Using Natural Language Explanations To Improve Robustness Of In-context Learning He Xuanli, Wu Yuxiang, Camburu Oana-maria, Minervini Pasquale, Stenetorp Pontus
- An Automatically Discovered Chain-of-thought Prompt Generalizes To Novel Models And Datasets Hebenstreit Konstantin, Praas Robert, Kiesewetter Louis P, Samwald Matthias
- Backpack Language Models Hewitt John, Thickstun John, Manning Christopher D., Liang Percy
- More Robots Are Coming: Large Multimodal Models (chatgpt) Can Solve Visually Diverse Images Of Parsons Problems Hou Irene, Man Owen, Mettille Sophie, Gutierrez Sebastian, Angelikas Kenneth, Macneil Stephen
- Is Chatgpt Better Than Human Annotators? Potential And Limitations Of Chatgpt In Explaining Implicit Hate Speech Huang Fan, Kwak Haewoon, An Jisun
- Can Large Language Models Explain Themselves? A Study Of Llm-generated Self-explanations Huang Shiyuan, Mamidanna Siddarth, Jangam Shreedhar, Zhou Yilun, Gilpin Leilani H.
- Generic Attention-model Explainability By Weighted Relevance Accumulation Huang Yiming, Jia Aozhe, Zhang Xiaodan, Zhang Jiawei
- Chatgpt Is Fun, But It Is Not Funny! Humor Is Still Challenging Large Language Models Jentzsch Sophie, Kersting Kristian
- Rethinking Large Language Models In Mental Health Applications Ji Shaoxiong, Zhang Tianlin, Yang Kailai, Ananiadou Sophia, Cambria Erik
- Legal Syllogism Prompting: Teaching Large Language Models For Legal Judgment Prediction Jiang Cong, Yang Xiaolei
- Tigerscore: Towards Building Explainable Metric For All Text Generation Tasks Jiang Dongfu, Li Yishan, Zhang Ge, Huang Wenhao, Lin Bill Yuchen, Chen Wenhu
- You Only Forward Once: Prediction And Rationalization In A Single Forward Pass Jiang Han, Duan Junwen, Qu Zhe, Wang Jianxin
- Hykge: A Hypothesis Knowledge Graph Enhanced Framework For Accurate And Reliable Medical Llms Responses Jiang Xinke, Zhang Ruizhe, Xu Yongxin, Qiu Rihong, Fang Yue, Wang Zhiyuan, Tang Jinyi, Ding Hongxin, Chu Xu, Zhao Junfeng, Wang Yasha
- A Case Study On Test Case Construction With Large Language Models: Unveiling Practical Insights And Challenges Junior Roberto Francisco De Lima, Presta Luiz Fernando Paes De Barros, Borborema Lucca Santos, Da Silva Vanderson Nogueira, Dahia Marcio Leal De Melo, Santos Anderson Carlos Sousa E
- Is Chatgpt A Good Recommender? A Preliminary Study Junling Liu, Chao Liu, Peilin Zhou, Renjie Lv, Kang Zhou, Yan Zhang
- Robust Recommender System: A Survey And Future Directions Kaike Zhang, Qi Cao, Fei Sun, Yunfan Wu, Shuchang Tao, Huawei Shen, Xueqi Cheng
- Large Language Models In Introductory Programming Education: Chatgpt's Performance And Implications For Assessments Kiesler Natalie, Schiffner Daniel
- Which Is Better? Exploring Prompting Strategy For Llm-based Metrics Kim Joonghoon, Park Saeran, Jeong Kiyoon, Lee Sangmin, Han Seung Hun, Lee Jiyoon, Kang Pilsung
- Cotever: Chain Of Thought Prompting Annotation Toolkit For Explanation Verification Kim Seungone, Joo Se June, Jang Yul, Chae Hyungjoo, Yeo Jinyoung
- Can Language Models Laugh At Youtube Short-form Videos? Ko Dayoon, Lee Sangho, Kim Gunhee
- Gender Bias And Stereotypes In Large Language Models Kotek Hadas, Dockum Rikker, Sun David Q.
- Do Large Language Models Pay Similar Attention Like Human Programmers When Generating Code? Kou Bonan, Chen Shengmai, Wang Zhijie, Ma Lei, Zhang Tianyi
- Post Hoc Explanations Of Language Models Can Improve Language Models Krishna Satyapriya, Ma Jiaqi, Slack Dylan, Ghandeharioun Asma, Singh Sameer, Lakkaraju Himabindu
- In-context Explainers: Harnessing Llms For Explaining Black Box Models Kroeger Nicholas, Ley Dan, Krishna Satyapriya, Agarwal Chirag, Lakkaraju Himabindu
- DACBERT: Leveraging Dependency Agreement For Cost-efficient Bert Pretraining Kuo Martin, Zhang Jianyi, Chen Yiran
- Measuring Faithfulness In Chain-of-thought Reasoning Lanham Tamera, Chen Anna, Radhakrishnan Ansh, Steiner Benoit, Denison Carson, Hernandez Danny, Li Dustin, Durmus Esin, Hubinger Evan, Kernion Jackson, Lukošiūtė Kamilė, Nguyen Karina, Cheng Newton, Joseph Nicholas, Schiefer Nicholas, Rausch Oliver, Larson Robin, Mccandlish Sam, Kundu Sandipan, Kadavath Saurav, Yang Shannon, Henighan Thomas, Maxwell Timothy, Telleen-lawton Timothy, Hume Tristan, Hatfield-dodds Zac, Kaplan Jared, Brauner Jan, Bowman Samuel R., Perez Ethan
- Applying Large Language Models And Chain-of-thought For Automatic Scoring Lee Gyeong-geon, Latif Ehsan, Wu Xuansheng, Liu Ninghao, Zhai Xiaoming
- The Importance Of Prompt Tuning For Automated Neuron Explanations Lee Justin, Oikarinen Tuomas, Chatha Arjun, Chang Keng-chi, Chen Yilan, Weng Tsui-wei
- Llm4vis: Explainable Visualization Recommendation Using Chatgpt Lei Wang, Songheng Zhang, Yun Wang, Ee-peng Lim, Yong Wang
- Recexplainer: Aligning Large Language Models For Explaining Recommendation Models Lei Yuxuan, Lian Jianxun, Yao Jing, Huang Xu, Lian Defu, Xie Xing
- The Eval4nlp 2023 Shared Task On Prompting Large Language Models As Explainable Metrics Leiter Christoph, Opitz Juri, Deutsch Daniel, Gao Yang, Dror Rotem, Eger Steffen
- Self-detoxifying Language Models Via Toxification Reversal Leong Chak Tou, Cheng Yi, Wang Jiashuo, Wang Jian, Li Wenjie
- Explaincpe: A Free-text Explanation Benchmark Of Chinese Pharmacist Examination Li Dongfang, Yu Jindi, Hu Baotian, Xu Zhenran, Zhang Min
- Repetition In Repetition Out: Towards Understanding Neural Text Degeneration From The Data Perspective Li Huayang, Lan Tian, Fu Zihao, Cai Deng, Liu Lemao, Collier Nigel, Watanabe Taro, Su Yixuan
- Generative Judge For Evaluating Alignment Li Junlong, Sun Shichao, Yuan Weizhe, Fan Run-ze, Zhao Hai, Liu Pengfei
- Explicit Syntactic Guidance For Neural Text Generation Li Yafu, Cui Leyang, Yan Jianhao, Yin Yongjing, Bi Wei, Shi Shuming, Zhang Yue
- AI Transparency In The Age Of Llms: A Human-centered Research Roadmap Liao Q. Vera, Vaughan Jennifer Wortman
- Chatgpt-powered Conversational Drug Editing Using Retrieval And Domain Feedback Liu Shengchao, Wang Jiongxiao, Yang Yijin, Wang Chengpeng, Liu Ling, Guo Hongyu, Xiao Chaowei
- Alignbench: Benchmarking Chinese Alignment Of Large Language Models Liu Xiao, Lei Xuanyu, Wang Shengyuan, Huang Yue, Feng Zhuoer, Wen Bosi, Cheng Jiale, Ke Pei, Xu Yifan, Tam Weng Lam, Zhang Xiaohan, Sun Lichao, Gu Xiaotao, Wang Hongning, Zhang Jing, Huang Minlie, Dong Yuxiao, Tang Jie
- Parallel Sentence-level Explanation Generation For Real-world Low-resource Scenarios Liu Yan, Chen Xiaokang, Dai Qi
- Trustworthy Llms: A Survey And Guideline For Evaluating Large Language Models' Alignment Liu Yang, Yao Yuanshun, Ton Jean-francois, Zhang Xiaoying, Guo Ruocheng, Cheng Hao, Klochkov Yegor, Taufiq Muhammad Faaiz, Li Hang
- UI Layout Generation With Llms Guided By UI Grammar Lu Yuwen, Tong Ziang, Zhao Qinyi, Zhang Chengzhi, Li Toby Jia-jun
- Explanation-based Finetuning Makes Models More Robust To Spurious Cues Ludan Josh Magnus, Meng Yixuan, Nguyen Tai, Shah Saurabh, Lyu Qing, Apidianaki Marianna, Callison-burch Chris
- Zero-resource Hallucination Prevention For Large Language Models Luo Junyu, Xiao Cao, Ma Fenglong
- Unlocking The Potential Of Large Language Models For Explainable Recommendations Luo Yucong, Cheng Mingyue, Zhang Hao, Lu Junyu, Liu Qi, Chen Enhong
- Paperqa: Retrieval-augmented Generative Agent For Scientific Research Lála Jakub, O'donoghue Odhran, Shtedritski Aleksandar, Cox Sam, Rodriques Samuel G., White Andrew D.
- An AI Chatbot For Explaining Deep Reinforcement Learning Decisions Of Service-oriented Systems Metzger Andreas, Bartel Jone, Laufer Jan
- ALMANACS: A Simulatability Benchmark For Language Model Explainability Mills Edmund, Su Shiye, Russell Stuart, Emmons Scott
- Gradient-based Automated Iterative Recovery For Parameter-efficient Tuning Mozes Maximilian, Bolukbasi Tolga, Yuan Ann, Liu Frederick, Thain Nithum, Dixon Lucas
- Octopack: Instruction Tuning Code Large Language Models Muennighoff Niklas, Liu Qian, Zebaze Armel, Zheng Qinkai, Hui Binyuan, Zhuo Terry Yue, Singh Swayam, Tang Xiangru, Von Werra Leandro, Longpre Shayne
- Orca: Progressive Learning From Complex Explanation Traces Of GPT-4 Mukherjee Subhabrata, Mitra Arindam, Jawahar Ganesh, Agarwal Sahaj, Palangi Hamid, Awadallah Ahmed
- Capabilities Of GPT-4 On Medical Challenge Problems Nori Harsha, King Nicholas, Mckinney Scott Mayer, Carignan Dean, Horvitz Eric
- Evaluating GPT-3.5 And GPT-4 Models On Brazilian University Admission Exams Nunes Desnes, Primi Ricardo, Pires Ramon, Lotufo Roberto, Nogueira Rodrigo
- Automated Assessment Of Students' Code Comprehension Using Llms Oli Priti, Banjade Rabin, Chapagain Jeevan, Rus Vasile
- The Behavior Of Large Language Models When Prompted To Generate Code Explanations Oli Priti, Banjade Rabin, Chapagain Jeevan, Rus Vasile
- Evaluating And Explaining Large Language Models For Code Using Syntactic Structures Palacio David N, Velasco Alejandro, Rodriguez-cardenas Daniel, Moran Kevin, Poshyvanyk Denys
- Towards Vision-language Mechanistic Interpretability: A Causal Tracing Tool For BLIP Palit Vedant, Pandey Rohan, Arora Aryaman, Liang Paul Pu
- Finding And Editing Multi-modal Neurons In Pre-trained Transformers Pan Haowen, Cao Yixin, Wang Xiaozhi, Yang Xun, Wang Meng
- On Measuring Faithfulness Or Self-consistency Of Natural Language Explanations Parcalabescu Letitia, Frank Anette
- Abductive Reasoning With The GPT-4 Language Model: Case Studies From Criminal Investigation, Medical Practice, Scientific Research Pareschi Remo
- Entity Matching Using Large Language Models Peeters Ralph, Bizer Christian
- Customising General Large Language Models For Specialised Emotion Recognition Tasks Peng Liyizhe, Zhang Zixing, Pang Tao, Han Jing, Zhao Huan, Chen Hao, Schuller Björn W.
- Generating High-precision Feedback For Programming Syntax Errors Using Large Language Models Phung Tung, Cambronero José, Gulwani Sumit, Kohn Tobias, Majumdar Rupak, Singla Adish, Soares Gustavo
- Emptying The Ocean With A Spoon: Should We Edit Models? Pinter Yuval, Elhadad Michael
- Faithful Chain-of-thought Reasoning Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong, Marianna Apidianaki, Chris Callison-burch
- Logic-scaffolding: Personalized Aspect-instructed Recommendation Explanation Generation Using Llms Rahdari Behnam, Ding Hao, Fan Ziwei, Ma Yifei, Chen Zhuotong, Deoras Anoop, Kveton Branislav
- STREET: A Multi-task Structured Reasoning And Explanation Benchmark Ribeiro Danilo, Wang Shen, Ma Xiaofei, Zhu Henry, Dong Rui, Kong Deguang, Burger Juliette, Ramos Anjelica, Wang William, Huang Zhiheng, Karypis George, Xiang Bing, Roth Dan
- Benchmarking Causal Study To Interpret Large Language Models For Source Code Rodriguez-cardenas Daniel, Palacio David N., Khati Dipin, Burke Henry, Poshyvanyk Denys
- Are Emergent Abilities Of Large Language Models A Mirage? Rylan Schaeffer, Brando Miranda, Sanmi Koyejo
- Naturalness Of Attention: Revisiting Attention In Code Language Models Saad Mootez, Sharma Tushar
- ICLEF: In-context Learning With Expert Feedback For Explainable Style Transfer Saakyan Arkadiy, Muresan Smaranda
- Catfood: Counterfactual Augmented Training For Improving Out-of-domain Performance And Calibration Sachdeva Rachneet, Tutek Martin, Gurevych Iryna
- Zero-shot Translation Of Attention Patterns In VQA Models To Natural Language Salewski Leonard, Koepke A. Sophia, Lensch Hendrik P. A., Akata Zeynep
- Uni-nlx: Unifying Textual Explanations For Vision And Vision-language Tasks Sammani Fawaz, Deligiannis Nikos
- Inseq: An Interpretability Toolkit For Sequence Generation Models Sarti Gabriele, Feldhus Nils, Sickert Ludwig, Van Der Wal Oskar, Nissim Malvina, Bisazza Arianna
- Interprompt: Interpretable Prompting For Interrelated Interpersonal Risk Factors In Reddit Posts Sathvik Msvpj, Sarkar Surjodeep, Saxena Chandni, Sohn Sunghwan, Garg Muskan
- GIFT: Generative Interpretable Fine-tuning Savadikar Chinmay, Song Xi, Wu Tianfu
- Diagnostic Reasoning Prompts Reveal The Potential For Large Language Model Interpretability In Medicine Savage Thomas, Nayak Ashwin, Gallo Robert, Rangan Ekanath, Chen Jonathan H
- Large Language Models (GPT) Struggle To Answer Multiple-choice Questions About Code Savelka Jaromir, Agarwal Arav, Bogart Christopher, Sakr Majd
- Large Language Model Alignment: A Survey Shen Tianhao, Jin Renren, Huang Yufei, Liu Chuang, Dong Weilong, Guo Zishan, Wu Xinwei, Liu Yan, Xiong Deyi
- Beyond Task Performance: Evaluating And Reducing The Flaws Of Large Multimodal Models With In-context Learning Shukor Mustafa, Rame Alexandre, Dancette Corentin, Cord Matthieu
- Large Language Models Help Humans Verify Truthfulness -- Except When They Are Convincingly Wrong Si Chenglei, Goyal Navita, Wu Sherry Tongshuang, Zhao Chen, Feng Shi, Daumé Hal Iii, Boyd-graber Jordan
- Explaining Black Box Text Modules In Natural Language With Language Models Singh Chandan, Hsu Aliyah R., Antonello Richard, Jain Shailee, Huth Alexander G., Yu Bin, Gao Jianfeng
- SPARSEFIT: Few-shot Prompting With Sparse Fine-tuning For Jointly Generating Predictions And Natural Language Explanations Solano Jesus, Sanni Mardhiyah, Camburu Oana-maria, Minervini Pasquale
- Dynamic Context Pruning For Efficient And Interpretable Autoregressive Transformers Sotiris Anagnostidis, Dario Pavllo, Luca Biggio, Lorenzo Noci, Aurelien Lucchi, Thomas Hofmann
- Harnessing Llms In Curricular Design: Using GPT-4 To Support Authoring Of Learning Objectives Sridhar Pragnya, Doyle Aidan, Agarwal Arav, Bogart Christopher, Savelka Jaromir, Sakr Majd
- Probing Llms For Joint Encoding Of Linguistic Categories Starace Giulio, Papakostas Konstantinos, Choenni Rochelle, Panagiotopoulos Apostolos, Rosati Matteo, Leidinger Alina, Shutova Ekaterina
- Are Llms Rigorous Logical Reasoner? Empowering Natural Language Proof Generation With Contrastive Stepwise Decoding Su Ying, Fu Xiaojin, Liu Mingwen, Guo Zhijiang
- Can Large Language Models Change User Preference Adversarially? Subhash Varshini
- Character-level Chinese Backpack Language Models Sun Hao, Hewitt John
- A Short Survey Of Viewing Large Language Models In Legal Aspect Sun Zhongxiang
- Can Language Models Teach Weaker Agents? Teacher Explanations Improve Students Via Personalization Swarnadeep Saha, Peter Hase, Mohit Bansal
- Sparsity-guided Holistic Explanation For Llms With Interpretable Inference-time Intervention Tan Zhen, Chen Tianlong, Zhang Zhenyu, Liu Huan
- Interpreting Pretrained Language Models Via Concept Bottlenecks Tan Zhen, Cheng Lu, Wang Song, Bo Yuan, Li Jundong, Liu Huan
- A Benchmark For Learning To Translate A New Language From One Grammar Book Tanzer Garrett, Suzgun Mirac, Visser Eline, Jurafsky Dan, Melas-kyriazi Luke
- Demystifying Embedding Spaces Using Large Language Models Tennenholtz Guy, Chow Yinlam, Hsu Chih-wei, Jeong Jihwan, Shani Lior, Tulepbergenov Azamat, Ramachandran Deepak, Mladenov Martin, Boutilier Craig
- Overinformative Question Answering By Humans And Machines Tsvilodub Polina, Franke Michael, Hawkins Robert D., Goodman Noah D.
- Language Models Don't Always Say What They Think: Unfaithful Explanations In Chain-of-thought Prompting Turpin Miles, Michael Julian, Perez Ethan, Bowman Samuel R.
- Don't Believe Everything You Read: Enhancing Summarization Interpretability Through Automatic Identification Of Hallucinations In Large Language Models Vakharia Priyesh, Joshi Devavrat, Chavan Meenal, Sonawane Dhananjay, Garg Bhrigu, Mazaheri Parsa
- Adventures Of Trustworthy Vision-language Models: A Survey Vatsa Mayank, Jain Anubhooti, Singh Richa
- Using Natural Language Explanations To Rescale Human Judgments Wadhwa Manya, Chen Jifan, Li Junyi Jessy, Durrett Greg
- Lifelongmemory: Leveraging Llms For Answering Queries In Long-form Egocentric Videos Wang Ying, Yang Yanlai, Ren Mengye
- Can Llms Like GPT-4 Outperform Traditional AI Tools In Dementia Diagnosis? Maybe, But Not Today Wang Zhuo, Li Rongzhen, Dong Bowen, Wang Jie, Li Xiuxing, Liu Ning, Mao Chenhui, Zhang Wei, Dong Liling, Gao Jing, Wang Jianyong
- Hyperpolyglot Llms: Cross-lingual Interpretability In Token Embeddings Wen-yi Andrea W, Mimno David
- Mastering Symbolic Operations: Augmenting Language Models With Compiled Neural Networks Weng Yixuan, Zhu Minjun, Xia Fei, Li Bin, He Shizhu, Liu Kang, Zhao Jun
- From Language Modeling To Instruction Following: Understanding The Behavior Shift In Llms After Instruction Tuning Wu Xuansheng, Yao Wenlin, Chen Jianshu, Pan Xiaoman, Wang Xiaoyang, Liu Ninghao, Yu Dong
- LLMGA: Multimodal Large Language Model Based Generation Assistant Xia Bin, Wang Shiyin, Tao Yingfan, Wang Yitong, Jia Jiaya
- Rethinking The Evaluation For Conversational Recommendation In The Era Of Large Language Models Xiaolei Wang, Xinyu Tang, Wayne Xin Zhao, Jingyuan Wang, Ji-rong Wen
- The Wall Street Neophyte: A Zero-shot Analysis Of Chatgpt Over Multimodal Stock Movement Prediction Challenges Xie Qianqian, Han Weiguang, Lai Yanzhao, Peng Min, Huang Jimin
- Proto-lm: A Prototypical Network-based Framework For Built-in Interpretability In Large Language Models Xie Sean, Vosoughi Soroush, Hassanpour Saeed
- Rationale-enhanced Language Models Are Better Continual Relation Learners Xiong Weimin, Song Yifan, Wang Peiyi, Li Sujian
- Small Models Are Valuable Plug-ins For Large Language Models Xu Canwen, Xu Yichong, Wang Shuohang, Liu Yang, Zhu Chenguang, Mcauley Julian
- Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation And Beyond Xu Fangzhi, Lin Qika, Han Jiawei, Zhao Tianzhe, Liu Jun, Cambria Erik
- Recommender AI Agent: Integrating Large Language Models For Interactive Recommendations Xu Huang, Jianxun Lian, Yuxuan Lei, Jing Yao, Defu Lian, Xing Xie
- Bias And Fairness In Chatbots: An Overview Xue Jintang, Wang Yun-cheng, Wei Chengwei, Liu Xiaofeng, Woo Jonghye, Kuo C. -c. Jay
- Voila-a: Aligning Vision-language Models With User's Gaze Attention Yan Kun, Ji Lei, Wang Zeyu, Wang Yuntao, Duan Nan, Ma Shuai
- Zero-shot Query Reformulation For Conversational Search Yang Dayu, Zhang Yue, Fang Hui
- Tackling Vision Language Tasks Through Learning Inner Monologues Yang Diji, Chen Kezhen, Rao Jinmeng, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- Knowledge-prompted Estimator: A Novel Approach To Explainable Machine Translation Assessment Yang Hao, Zhang Min, Tao Shimin, Wang Minghan, Wei Daimeng, Jiang Yanfei
- Towards Interpretable Mental Health Analysis With Large Language Models Yang Kailai, Ji Shaoxiong, Zhang Tianlin, Xie Qianqian, Kuang Ziyan, Ananiadou Sophia
- Benchmarking Llm-based Machine Translation On Cultural Awareness Yao Binwei, Jiang Ming, Yang Diyi, Hu Junjie
- FLASK: Fine-grained Language Model Evaluation Based On Alignment Skill Sets Ye Seonghyeon, Kim Doyoung, Kim Sungdong, Hwang Hyeonbin, Kim Seungone, Jo Yongrae, Thorne James, Kim Juho, Seo Minjoon
- Finmem: A Performance-enhanced LLM Trading Agent With Layered Memory And Character Design Yu Yangyang, Li Haohang, Chen Zhi, Jiang Yuechen, Li Yang, Zhang Denghui, Liu Rong, Suchow Jordan W., Khashanah Khaldoun
- Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning Yu Yue, Shen Jiaming, Liu Tianqi, Qin Zhen, Yan Jing Nathan, Liu Jialu, Zhang Chao, Bendersky Michael
- Prompt Highlighter: Interactive Control For Multi-modal Llms Yuechen Zhang, Shengju Qian, Bohao Peng, Shu Liu, Jiaya Jia
- Chat-rec: Towards Interactive And Explainable Llms-augmented Recommender System Yunfan Gao, Tao Sheng, Youlin Xiang, Yun Xiong, Haofen Wang, Jiawei Zhang
- Exploring Memorization In Fine-tuned Language Models Zeng Shenglai, Li Yaxin, Ren Jie, Liu Yiding, Xu Han, He Pengfei, Xing Yue, Wang Shuaiqiang, Tang Jiliang, Yin Dawei
- XATU: A Fine-grained Instruction-based Benchmark For Explainable Text Updates Zhang Haopeng, Iso Hayate, Gurajada Sairam, Bhutani Nikita
- EHR Interaction Between Patients And AI: Noteaid EHR Interaction Zhang Xiaocheng, Yao Zonghai, Yu Hong
- Large Language Models In Medical Term Classification And Unexpected Misalignment Between Response And Reasoning Zhang Xiaodan, Vemulapalli Sandeep, Talukdar Nabasmita, Ahn Sumyeong, Wang Jiankun, Meng Han, Murtaza Sardar Mehtab Bin, Dave Aakash Ajay, Leshchiner Dmitry, Joseph Dimitri F., Witteveen-lane Martin, Chesla Dave, Zhou Jiayu, Chen Bin
- Gpt-4v(ision) As A Generalist Evaluator For Vision-language Tasks Zhang Xinlu, Lu Yujie, Wang Weizhi, Yan An, Yan Jun, Qin Lianke, Wang Heng, Yan Xifeng, Wang William Yang, Petzold Linda Ruth
- Siren's Song In The AI Ocean: A Survey On Hallucination In Large Language Models Zhang Yue, Li Yafu, Cui Leyang, Cai Deng, Liu Lemao, Fu Tingchen, Huang Xinting, Zhao Enbo, Zhang Yu, Chen Yulong, Wang Longyue, Luu Anh Tuan, Bi Wei, Shi Freda, Shi Shuming
- Pre-trained Language Models Do Not Help Auto-regressive Text-to-image Generation Zhang Yuhui, Mckinzie Brandon, Gan Zhe, Shankar Vaishaal, Toshev Alexander
- Accountable Textual-visual Chat Learns To Reject Human Instructions In Image Re-creation Zhang Zhiwei, Liu Yuliang
- Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-thought Reasoning To Language Agents Zhang Zhuosheng, Yao Yao, Zhang Aston, Tang Xiangru, Ma Xinbei, He Zhiwei, Wang Yiming, Gerstein Mark, Wang Rui, Liu Gongshen, Zhao Hai
- MELA: Multilingual Evaluation Of Linguistic Acceptability Zhang Ziyin, Liu Yikang, Huang Weifang, Mao Junyu, Wang Rui, Hu Hai
- Explainability For Large Language Models: A Survey Zhao Haiyan, Chen Hanjie, Yang Fan, Liu Ninghao, Deng Huiqi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Du Mengnan
- Automatic Model Selection With Large Language Models For Reasoning Zhao James Xu, Xie Yuxi, Kawaguchi Kenji, He Junxian, Xie Michael Qizhe
- Retrieving Multimodal Information For Augmented Generation: A Survey Zhao Ruochen, Chen Hailin, Wang Weishi, Jiao Fangkai, Do Xuan Long, Qin Chengwei, Ding Bosheng, Guo Xiaobao, Li Minzhi, Li Xingxuan, Joty Shafiq
- Explainable Multimodal Emotion Recognition Zheng Lian, Haiyang Sun, Licai Sun, Hao Gu, Zhuofan Wen, Siyuan Zhang, Shun Chen, Mingyu Xu, Ke Xu, Kang Chen, Lan Chen, Shan Liang, Ya Li, Jiangyan Yi, Bin Liu, Jianhua Tao
- GPT As A Baseline For Recommendation Explanation Texts Zhou Joyce, Joachims Thorsten
- Flame: Few-shot Learning From Natural Language Explanations Zhou Yangqiaoyu, Zhang Yiming, Tan Chenhao
- The Mystery Of In-context Learning: A Comprehensive Survey On Interpretation And Analysis Zhou Yuxiang, Li Jiazheng, Xiang Yanzheng, Yan Hanqi, Gui Lin, He Yulan
- Autodan: Interpretable Gradient-based Adversarial Attacks On Large Language Models Zhu Sicheng, Zhang Ruiyi, An Bang, Wu Gang, Barrow Joe, Wang Zichao, Huang Furong, Nenkova Ani, Sun Tong
- Can Large Language Models Transform Computational Social Science? Ziems Caleb, Held William, Shaikh Omar, Chen Jiaao, Zhang Zhehao, Yang Diyi
- Describe, Explain, Plan And Select: Interactive Planning With Large Language Models Enables Open-world Multi-task Agents Zihao Wang, Shaofei Cai, Guanzhou Chen, Anji Liu, Xiaojian Ma, Yitao Liang
- Are You Still On Track!? Catching LLM Task Drift With Activations Abdelnabi Sahar, Fay Aideen, Cherubin Giovanni, Salem Ahmed, Fritz Mario, Paverd Andrew
- Supporting Student Decisions On Learning Recommendations: An Llm-based Chatbot With Knowledge Graph Contextualization For Conversational Explainability And Mentoring Abu-rasheed Hasan, Abdulsalam Mohamad Hussam, Weber Christian, Fathi Madjid
- Knowledge Graphs As Context Sources For Llm-based Explanations Of Learning Recommendations Abu-rasheed Hasan, Weber Christian, Fathi Madjid
- A Survey On Symbolic Knowledge Distillation Of Large Language Models Acharya Kamal, Velasquez Alvaro, Song Houbing Herbert
- Attnlrp: Attention-aware Layer-wise Relevance Propagation For Transformers Achtibat Reduan, Hatefi Sayed Mohammad Vakilzadeh, Dreyer Maximilian, Jain Aakriti, Wiegand Thomas, Lapuschkin Sebastian, Samek Wojciech
- Faithfulness Vs. Plausibility: On The (un)reliability Of Explanations From Large Language Models Agarwal Chirag, Tanneru Sree Harsha, Lakkaraju Himabindu
- Mememqa: Multimodal Question Answering For Memes Via Rationale-based Inferencing Agarwal Siddhant, Sharma Shivam, Nakov Preslav, Chakraborty Tanmoy
- User Preferences For Large Language Model Versus Template-based Explanations Of Movie Recommendations: A Pilot Study Albert Julien, Balfroid Martin, Doh Miriam, Bogaert Jeremie, La Fisca Luca, De Vos Liesbet, Renard Bryan, Stragier Vincent, Jean Emmanuel
- Should We Fine-tune Or RAG? Evaluating Different Techniques To Adapt Llms For Dialogue Alghisi Simone, Rizzoli Massimo, Roccabruna Gabriel, Mousavi Seyed Mahed, Riccardi Giuseppe
- PROMPT-SAW: Leveraging Relation-aware Graphs For Textual Prompt Compression Ali Muhammad Asif, Li Zhengping, Yang Shu, Cheng Keyuan, Cao Yang, Huang Tianhao, Hu Lijie, Yu Lu, Wang Di
- Syntaxshap: Syntax-aware Explainability Method For Text Generation Amara Kenza, Sevastjanova Rita, El-assady Mennatallah
- How Susceptible Are Llms To Influence In Prompts? Anagnostidis Sotiris, Bulian Jannis
- Mm-phyrlhf: Reinforcement Learning Framework For Multimodal Physics Question-answering Anand Avinash, Kapuriya Janak, Kirtani Chhavi, Singh Apoorv, Saraf Jay, Lal Naman, Kumar Jatin, Shivam Adarsh Raj, Verma Astha, Shah Rajiv Ratn, Zimmermann Roger
- Prompt Design Matters For Computational Social Science Tasks But In Unpredictable Ways Atreja Shubham, Ashkinaze Joshua, Li Lingyao, Mendelsohn Julia, Hemphill Libby
- Current State Of LLM Risks And AI Guardrails Ayyamperumal Suriya Ganesh, Ge Limin
- LLM Augmented Llms: Expanding Capabilities Through Composition Bansal Rachit, Samanta Bidisha, Dalmia Siddharth, Gupta Nitish, Vashishth Shikhar, Ganapathy Sriram, Bapna Abhishek, Jain Prateek, Talukdar Partha
- Understanding Information Storage And Transfer In Multi-modal Large Language Models Basu Samyadeep, Grayson Martin, Morrison Cecily, Nushi Besmira, Feizi Soheil, Massiceti Daniela
- Crafting Interpretable Embeddings By Asking Llms Questions Benara Vinamra, Singh Chandan, Morris John X., Antonello Richard, Stoica Ion, Huth Alexander G., Gao Jianfeng
- Self-amplify: Improving Small Language Models With Self Post Hoc Explanations Bhan Milan, Vittaut Jean-noel, Chesneau Nicolas, Lesot Marie-jeanne
- Decoding By Contrasting Knowledge: Enhancing Llms' Confidence On Edited Facts Bi Baolong, Liu Shenghua, Mei Lingrui, Wang Yiwei, Ji Pengliang, Cheng Xueqi
- Improving Language Models Trained On Translated Data With Continual Pre-training And Dictionary Learning Analysis Boughorbel Sabri, Parvez Md Rizwan, Hawasly Majd
- ACORN: Aspect-wise Commonsense Reasoning Explanation Evaluation Brassard Ana, Heinzerling Benjamin, Kudo Keito, Sakaguchi Keisuke, Inui Kentaro
- On Behalf Of The Stakeholders: Trends In NLP Model Interpretability In The Era Of Llms Calderon Nitay, Reichart Roi
- XAI Meets Llms: A Survey Of The Relation Between Explainable AI And Large Language Models Cambria Erik, Malandri Lorenzo, Mercorio Fabio, Nobani Navid, Seveso Andrea
- How Do Large Language Models Acquire Factual Knowledge During Pretraining? Chang Hoyeon, Park Jinho, Ye Seonghyeon, Yang Sohee, Seo Youngkyung, Chang Du-seong, Seo Minjoon
- Xprompt:explaining Large Language Model's Generation Via Joint Prompt Attribution Chang Yurui, Cao Bochuan, Wang Yujia, Chen Jinghui, Lin Lu
- WSI-VQA: Interpreting Whole Slide Images By Generative Visual Question Answering Chen Pingyi, Zhu Chenglu, Zheng Sunyi, Li Honglin, Yang Lin
- Measuring Taiwanese Mandarin Language Understanding Chen Po-heng, Cheng Sijia, Chen Wei-lin, Lin Yen-ting, Chen Yun-nung
- Gazexplain: Learning To Predict Natural Language Explanations Of Visual Scanpaths Chen Xianyu, Jiang Ming, Zhao Qi
- Designing A Dashboard For Transparency And Control Of Conversational AI Chen Yida, Wu Aoyu, Depodesta Trevor, Yeh Catherine, Li Kenneth, Marin Nicholas Castillo, Patel Oam, Riecke Jan, Raval Shivam, Seow Olivia, Wattenberg Martin, Viégas Fernanda
- Potential And Limitations Of Llms In Capturing Structured Semantics: A Case Study On SRL Cheng Ning, Yan Zhaohui, Wang Ziming, Li Zhijie, Yu Jiaming, Zheng Zilong, Tu Kewei, Xu Jinan, Han Wenjuan
- Interpreting Conversational Dense Retrieval By Rewriting-enhanced Inversion Of Session Embedding Cheng Yiruo, Mao Kelong, Dou Zhicheng
- Do Language Models Enjoy Their Own Stories? Prompting Large Language Models For Automatic Story Evaluation Chhun Cyril, Suchanek Fabian M., Clavel Chloé
- Exploring Interaction Patterns For Debugging: Enhancing Conversational Capabilities Of Ai-assistants Chopra Bhavya, Bajpai Yasharth, Biyani Param, Soares Gustavo, Radhakrishna Arjun, Parnin Chris, Gulwani Sumit
- Faithlm: Towards Faithful Explanations For Large Language Models Chuang Yu-neng, Wang Guanchu, Chang Chia-yuan, Tang Ruixiang, Zhong Shaochen, Yang Fan, Du Mengnan, Cai Xuanting, Hu Xia
- A Chain-of-thought Prompting Approach With Llms For Evaluating Students' Formative Assessment Responses In Science Cohn Clayton, Hutchins Nicole, Le Tuan, Biswas Gautam
- Representations As Language: An Information-theoretic Framework For Interpretability Conklin Henry, Smith Kenny
- Iscore: Visual Analytics For Interpreting How Language Models Automatically Score Summaries Coscia Adam, Holmes Langdon, Morris Wesley, Choi Joon Suh, Crossley Scott, Endert Alex
- Gamebench: Evaluating Strategic Reasoning Abilities Of LLM Agents Costarelli Anthony, Allen Mat, Hauksson Roman, Sodunke Grace, Hariharan Suhas, Cheng Carlson, Li Wenjie, Clymer Joshua, Yadav Arjun
- Inference To The Best Explanation In Large Language Models Dalal Dhairya, Valentino Marco, Freitas André, Buitelaar Paul
- Deceptive AI Systems That Give Explanations Are More Convincing Than Honest AI Systems And Can Amplify Belief In Misinformation Danry Valdemar, Pataranutaporn Pat, Groh Matthew, Epstein Ziv, Maes Pattie
- Llms Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark For Comprehensive Evaluation Of Llms Davoodi Arash Gholami, Davoudi Seyed Pouyan Mousavi, Pezeshkpour Pouya
- Gotcha! Don't Trick Me With Unanswerable Questions! Self-aligning Large Language Models For Responding To Unknown Questions Deng Yang, Zhao Yong, Li Moxin, Ng See-kiong, Chua Tat-seng
- Prompt Your Brain: Scaffold Prompt Tuning For Efficient Adaptation Of Fmri Pre-trained Model Dong Zijian, Wu Yilei, Chen Zijiao, Zhang Yichi, Jin Yueming, Zhou Juan Helen
- On The Evaluation Consistency Of Attribution-based Explanations Duan Jiarui, Li Haoling, Zhang Haofei, Jiang Hao, Xue Mengqi, Sun Li, Song Mingli, Song Jie
- Leveraging Machine-generated Rationales To Facilitate Social Meaning Detection In Conversations Dutt Ritam, Wu Zhen, Shi Kelly, Sheth Divyanshu, Gupta Prakhar, Rose Carolyn Penstein
- Explainable AI Reloaded: Challenging The XAI Status Quo In The Era Of Large Language Models Ehsan Upol, Riedl Mark O.
- Large Language Models As Evaluators For Scientific Synthesis Evans Julia, D'souza Jennifer, Auer Sören
- Reformatted Alignment Fan Run-ze, Li Xuefeng, Zou Haoyang, Li Junlong, He Shwai, Chern Ethan, Hu Jiewen, Liu Pengfei
- Tear: Improving Llm-based Machine Translation With Systematic Self-refinement Feng Zhaopeng, Zhang Yan, Li Hao, Wu Bei, Liao Jiayu, Liu Wenqiang, Lang Jun, Feng Yang, Wu Jian, Liu Zuozhu
- Exaranker-open: Synthetic Explanation For IR Using Open-source Llms Ferraretto Fernando, Laitz Thiago, Lotufo Roberto, Nogueira Rodrigo
- Aligning LLM Agents By Learning Latent Preference From User Edits Gao Ge, Taymanov Alexey, Salinas Eduardo, Mineiro Paul, Misra Dipendra
- Calibrated Large Language Models For Binary Question Answering Giovannotti Patrizio, Gammerman Alexander
- Tokenshap: Interpreting Large Language Models With Monte Carlo Shapley Value Estimation Goldshmidt Roni, Horovicz Miriam
- Does Chatgpt Have A Mind? Goldstein Simon, Levinstein Benjamin A.
- Mechanistic Interpretability Of Large Language Models With Applications To The Financial Services Industry Golgoon Ashkan, Filom Khashayar, Kannan Arjun Ravi
- I've Got The "answer"! Interpretation Of Llms Hidden States In Question Answering Goloviznina Valeriya, Kotelnikov Evgeny
- Integrating Explanations In Learning LTL Specifications From Demonstrations Gupta Ashutosh, Komp John, Rajput Abhay Singh, Shankaranarayanan Krishna, Trivedi Ashutosh, Varshney Namrita
- Visual Cot: Advancing Multi-modal Language Models With A Comprehensive Dataset And Benchmark For Chain-of-thought Reasoning Hao Shao, Shengju Qian, Han Xiao, Guanglu Song, Zhuofan Zong, Letian Wang, Yu Liu, Hongsheng Li
- LLM Reasoners: New Evaluation, Library, And Analysis Of Step-by-step Reasoning With Large Language Models Hao Shibo, Gu Yi, Luo Haotian, Liu Tianyang, Shao Xiyan, Wang Xinyuan, Xie Shuhua, Ma Haodi, Samavedhi Adithya, Gao Qiyue, Wang Zhen, Hu Zhiting
- Think-program-rectify: 3D Situated Reasoning With Large Language Models He Qingrong, Lin Kejun, Chen Shizhe, Hu Anwen, Jin Qin
- BP4ER: Bootstrap Prompting For Explicit Reasoning In Medical Dialogue Generation He Yuhong, Zhang Yongqi, He Shizhu, Wan Jun
- The Effect Of Model Size On LLM Post-hoc Explainability Via LIME Heyen Henning, Widdicombe Amy, Siegel Noah Y., Perez-ortiz Maria, Treleaven Philip
- Evaluating The Elementary Multilingual Capabilities Of Large Language Models With Multiq Holtermann Carolin, Röttger Paul, Dill Timm, Lauscher Anne
- Free-text Rationale Generation Under Readability Level Control Hsu Yi-sheng, Feldhus Nils, Hakimov Sherzod
- A Hopfieldian View-based Interpretation For Chain-of-thought Reasoning Hu Lijie, Liu Liang, Yang Shu, Chen Xin, Xiao Hongru, Li Mengdi, Zhou Pan, Ali Muhammad Asif, Wang Di
- SLM Meets LLM: Balancing Latency, Interpretability And Consistency In Hallucination Detection Hu Mengya, Xu Rui, Lei Deren, Li Yaxi, Wang Mingyu, Ching Emily, Kamal Eslam, Deng Alex
- FOKE: A Personalized And Explainable Education Framework Integrating Foundation Models, Knowledge Graphs, And Prompt Engineering Hu Silan, Wang Xiaoning
- Chatgpt Rates Natural Language Explanation Quality Like Humans: But On Which Scales? Huang Fan, Kwak Haewoon, Park Kunwoo, An Jisun
- See The Unseen: Better Context-consistent Knowledge-editing By Noises Huang Youcheng, Lei Wenqiang, Zhang Zheng, Lv Jiancheng, Yan Shuicheng
- On Effects Of Steering Latent Representation For Large Language Model Unlearning Huu-tien Dang, Pham Trung-tin, Thanh-tung Hoang, Inoue Naoya
- Counterfactuals As A Means For Evaluating Faithfulness Of Attribution Methods In Autoregressive Language Models Kamahi Sepehr, Yaghoobzadeh Yadollah
- Self-moe: Towards Compositional Large Language Models With Self-specialized Experts Kang Junmo, Karlinsky Leonid, Luo Hongyin, Wang Zhen, Hansen Jacob, Glass James, Cox David, Panda Rameswar, Feris Rogerio, Ritter Alan
- Codeaid: Evaluating A Classroom Deployment Of An Llm-based Programming Assistant That Balances Student And Educator Needs Kazemitabaar Majeed, Ye Runlong, Wang Xiaoning, Henley Austin Z., Denny Paul, Craig Michelle, Grossman Tovi
- Retrieval-augmented Conversational Recommendation With Prompt-based Semi-structured Natural Language State Tracking Kemper Sara, Cui Justin, Dicarlantonio Kai, Lin Kathy, Tang Danjie, Korikov Anton, Sanner Scott
- Grounding And Evaluation For Large Language Models: Practical Challenges And Lessons Learned (survey) Kenthapadi Krishnaram, Sameki Mehrnoosh, Taly Ankur
- Harmonic Llms Are Trustworthy Kersting Nicholas S., Rahman Mohammad, Vedala Suchismitha, Wang Yang
- Source-aware Training Enables Knowledge Attribution In Language Models Khalifa Muhammad, Wadden David, Strubell Emma, Lee Honglak, Wang Lu, Beltagy Iz, Peng Hao
- Reinforcement Of Explainability Of Chatgpt Prompts By Embedding Breast Cancer Self-screening Rules Into AI Responses Khan Yousef, Hamed Ahmed Abdeen
- Finer: Investigating And Enhancing Fine-grained Visual Concept Recognition In Large Vision Language Models Kim Jeonghwan, Ji Heng
- Review-driven Personalized Preference Reasoning With Large Language Models For Recommendation Kim Jieyong, Kim Hyunseo, Cho Hyunjin, Kang Seongku, Chang Buru, Yeo Jinyoung, Lee Dongha
- RE-RAG: Improving Open-domain QA Performance And Interpretability With Relevance Estimator In Retrieval-augmented Generation Kim Kiseung, Lee Jay-yoon
- Medexqa: Medical Question Answering Benchmark With Multiple Explanations Kim Yunsoo, Wu Jinge, Abdulle Yusuf, Wu Honghan
- Leveraging Large Language Models For Enhanced Process Model Comprehension Kourani Humam, Berti Alessandro, Henrich Jasmin, Kratsch Wolfgang, Weidlich Robin, Li Chiao-yun, Arslan Ahmad, Schuster Daniel, Van Der Aalst Wil M. P.
- From Data To Commonsense Reasoning: The Use Of Large Language Models For Explainable AI Krause Stefanie, Stolzenburg Frieder
- Properties And Challenges Of Llm-generated Explanations Kunz Jenny, Kuhlmann Marco
- Enhancing Multi-domain Automatic Short Answer Grading Through An Explainable Neuro-symbolic Pipeline Künnecke Felix, Filighera Anna, Leong Colin, Steuer Tim
- Cat-bench: Benchmarking Language Model Understanding Of Causal And Temporal Dependencies In Plans Lal Yash Kumar, Cohen Vanya, Chambers Nathanael, Balasubramanian Niranjan, Mooney Raymond
- Enhancing Neural Machine Translation Of Low-resource Languages: Corpus Development, Human Evaluation And Explainable AI Architectures Lankford Séamus
- Eliciting Better Multilingual Structured Reasoning From Llms Through Code Li Bryan, Alkhouli Tamer, Bonadiman Daniele, Pappas Nikolaos, Mansour Saab
- "is Chatgpt A Better Explainer Than My Professor?": Evaluating The Explanation Capabilities Of Llms In Conversation Compared To A Human Baseline Li Grace, Alshomary Milad, Muresan Smaranda
- A Transformer With Stack Attention Li Jiaoda, White Jennifer C., Sachan Mrinmaya, Cotterell Ryan
- A Cross-language Investigation Into Jailbreak Attacks In Large Language Models Li Jie, Liu Yi, Liu Chongyang, Shi Ling, Ren Xiaoning, Zheng Yaowen, Liu Yang, Xue Yinxing
- Distilling Algorithmic Reasoning From Llms Via Explaining Solution Programs Li Jierui, Mooney Raymond
- Predicting Vs. Acting: A Trade-off Between World Modeling & Agent Modeling Li Margaret, Shi Weijia, Pagnoni Artidoro, West Peter, Holtzman Ari
- Anchored Answers: Unravelling Positional Bias In Gpt-2's Multiple-choice Questions Li Ruizhe, Gao Yanjun
- Preference Tuning For Toxicity Mitigation Generalizes Across Languages Li Xiaochen, Yong Zheng-xin, Bach Stephen H.
- Internal Consistency And Self-feedback In Large Language Models: A Survey Liang Xun, Song Shichao, Zheng Zifan, Wang Hanyu, Yu Qingchen, Li Xunkai, Li Rong-hua, Cheng Peng, Wang Zhonghao, Xiong Feiyu, Li Zhiyu
- 3-in-1: 2D Rotary Adaptation For Efficient Finetuning, Efficient Batching And Composability Liao Baohao, Monz Christof
- Retrieval-augmented Natural Language Reasoning For Explainable Visual Question Answering Lim Su Hyeon, Kim Minkuk, Kim Hyeon Bae, Kim Seong Tae
- Wildbench: Benchmarking Llms With Challenging Tasks From Real Users In The Wild Lin Bill Yuchen, Deng Yuntian, Chandu Khyathi, Brahman Faeze, Ravichander Abhilasha, Pyatkin Valentina, Dziri Nouha, Bras Ronan Le, Choi Yejin
- Logic Agent: Enhancing Validity With Logic Rule Invocation Liu Hanmeng, Teng Zhiyang, Zhang Chaoli, Zhang Yue
- SAFETY-J: Evaluating Safety With Critique Liu Yixiu, Zheng Yuxiang, Xia Shijie, Li Jiajun, Tu Yi, Song Chaoling, Liu Pengfei
- DELTA: Decomposed Efficient Long-term Robot Task Planning Using Large Language Models Liu Yuchen, Palmieri Luigi, Koch Sebastian, Georgievski Ilche, Aiello Marco
- Hd-eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition Liu Yuxuan, Yang Tianchi, Huang Shaohan, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Large Language Models Relearn Removed Concepts Lo Michelle, Cohen Shay B., Barez Fazl
- Layoutllm: Layout Instruction Tuning With Large Language Models For Document Understanding Luo Chuwei, Shen Yufan, Zhu Zhaoqing, Zheng Qi, Yu Zhi, Yao Cong
- From Understanding To Utilization: A Survey On Explainability For Large Language Models Luo Haoyan, Specia Lucia
- End-to-end Neuro-symbolic Reinforcement Learning With Textual Explanations Luo Lirui, Zhang Guoxi, Xu Hongming, Yang Yaodong, Fang Cong, Li Qing
- Calibrating Large Language Models With Sample Consistency Lyu Qing, Shridhar Kumar, Malaviya Chaitanya, Zhang Li, Elazar Yanai, Tandon Niket, Apidianaki Marianna, Sachan Mrinmaya, Callison-burch Chris
- Enhancing Programming Education With Chatgpt: A Case Study On Student Perceptions And Interactions In A Python Course Ma Boxaun, Chen Li, Konomi Shin'ichi
- Agentboard: An Analytical Evaluation Board Of Multi-turn LLM Agents Ma Chang, Zhang Junlei, Zhu Zhihao, Yang Cheng, Yang Yujiu, Jin Yaohui, Lan Zhenzhong, Kong Lingpeng, He Junxian
- Explaining Genetic Programming Trees Using Large Language Models Maddigan Paula, Lensen Andrew, Xue Bing
- Are Self-explanations From Large Language Models Faithful? Madsen Andreas, Chandar Sarath, Reddy Siva
- XAI For All: Can Large Language Models Simplify Explainable AI? Mavrepis Philip, Makridis Georgios, Fatouros Georgios, Koukos Vasileios, Separdani Maria Margarita, Kyriazis Dimosthenis
- Towards Human Understanding Of Paraphrase Types In Chatgpt Meier Dominik, Wahle Jan Philip, Ruas Terry, Gipp Bela
- Soda-eval: Open-domain Dialogue Evaluation In The Age Of Llms Mendonça John, Trancoso Isabel, Lavie Alon
- Ecoh: Turn-level Coherence Evaluation For Multilingual Dialogues Mendonça John, Trancoso Isabel, Lavie Alon
- Is English The New Programming Language? How About Pseudo-code Engineering? Michaelsen Gian Alexandre, Santos Renato P. Dos
- A Philosophical Introduction To Language Models - Part II: The Way Forward Millière Raphaël, Buckner Cameron
- Memllm: Finetuning Llms To Use An Explicit Read-write Memory Modarressi Ali, Köksal Abdullatif, Imani Ayyoob, Fayyaz Mohsen, Schütze Hinrich
- RDBE: Reasoning Distillation-based Evaluation Enhances Automatic Essay Scoring Mohammadkhani Ali Ghiasvand
- Towards Generating Informative Textual Description For Neurons In Language Models Mondal Shrayani, Garodia Rishabh, Qureshi Arbaaz, Lee Taesung, Park Youngja
- Concise Thoughts: Impact Of Output Length On LLM Reasoning And Cost Nayab Sania, Rossolini Giulio, Buttazzo Giorgio, Manes Nicolamaria, Giacomelli Fabrizio
- Interpreting Context Look-ups In Transformers: Investigating Attention-mlp Interactions Neo Clement, Cohen Shay B., Barez Fazl
- Direct Evaluation Of Chain-of-thought In Multi-hop Reasoning With Knowledge Graphs Nguyen Minh-vuong, Luo Linhao, Shiri Fatemeh, Phung Dinh, Li Yuan-fang, Vu Thuy-trang, Haffari Gholamreza
- Beyond Metrics: Evaluating Llms' Effectiveness In Culturally Nuanced, Low-resource Real-world Scenarios Ochieng Millicent, Gumma Varun, Sitaram Sunayana, Wang Jindong, Chaudhary Vishrav, Ronen Keshet, Bali Kalika, O'neill Jacki
- Towards Cross-lingual Explanation Of Artwork In Large-scale Vision Language Models Ozaki Shintaro, Hayashi Kazuki, Sakai Yusuke, Kamigaito Hidetaka, Hayashi Katsuhiko, Watanabe Taro
- Unraveling The Dilemma Of AI Errors: Exploring The Effectiveness Of Human And Machine Explanations For Large Language Models Pafla Marvin, Larson Kate, Hancock Mark
- Towards More Trustworthy And Interpretable Llms For Code Through Syntax-grounded Explanations Palacio David N., Rodriguez-cardenas Daniel, Velasco Alejandro, Khati Dipin, Moran Kevin, Poshyvanyk Denys
- LLM Evaluators Recognize And Favor Their Own Generations Panickssery Arjun, Bowman Samuel R., Feng Shi
- Do Vision & Language Decoders Use Images And Text Equally? How Self-consistent Are Their Explanations? Parcalabescu Letitia, Frank Anette
- Revisiting Demonstration Selection Strategies In In-context Learning Peng Keqin, Ding Liang, Yuan Yancheng, Liu Xuebo, Zhang Min, Ouyang Yuanxin, Tao Dacheng
- Answer Is All You Need: Instruction-following Text Embedding Via Answering The Question Peng Letian, Zhang Yuwei, Wang Zilong, Srinivasa Jayanth, Liu Gaowen, Wang Zihan, Shang Jingbo
- Review-llm: Harnessing Large Language Models For Personalized Review Generation Peng Qiyao, Liu Hongtao, Xu Hongyan, Yang Qing, Shao Minglai, Wang Wenjun
- Uncertainty-aware Explainable Recommendation With Large Language Models Peng Yicui, Chen Hao, Lin Chingsheng, Huang Guo, Hu Jinrong, Guo Hui, Kong Bin, Hu Shu, Wu Xi, Wang Xin
- V-RECS, A Low-cost LLM4VIS Recommender With Explanations, Captioning And Suggestions Podo Luca, Angelini Marco, Velardi Paola
- Interpreting Bias In Large Language Models: A Feature-based Approach Prakash Nirmalendu, Roy Lee Ka Wei
- When Quantization Affects Confidence Of Large Language Models? Proskurina Irina, Brun Luc, Metzler Guillaume, Velcin Julien
- Crafting A Good Prompt Or Providing Exemplary Dialogues? A Study Of In-context Learning For Persona-based Dialogue Generation Pu Jiashu, Wan Yajing, Zhang Yuru, Chen Jing, Cheng Ling, Shao Qian, Chang Yongzhu, Lv Tangjie, Zhang Rongsheng
- Model Internals-based Answer Attribution For Trustworthy Retrieval-augmented Generation Qi Jirui, Sarti Gabriele, Fernández Raquel, Bisazza Arianna
- SNIFFER: Multimodal Large Language Model For Explainable Out-of-context Misinformation Detection Qi Peng, Yan Zehong, Hsu Wynne, Lee Mong Li
- LLM4SBR: A Lightweight And Effective Framework For Integrating Large Language Models In Session-based Recommendation Qiao Shutong, Gao Chen, Wen Junhao, Zhou Wei, Luo Qun, Chen Peixuan, Li Yong
- Exploring The Nexus Of Large Language Models And Legal Systems: A Short Survey Qin Weicong, Sun Zhongxiang
- Dissecting Multiplication In Transformers: Insights Into Llms Qiu Luyu, Li Jianing, Su Chi, Zhang Chen Jason, Chen Lei
- Scenarios And Approaches For Situated Natural Language Explanations Qiu Pengshuo, Rudzicz Frank, Zhu Zining
- Enhancing Ethical Explanations Of Large Language Models Through Iterative Symbolic Refinement Quan Xin, Valentino Marco, Dennis Louise A., Freitas André
- Verification And Refinement Of Natural Language Explanations Through Llm-symbolic Theorem Proving Quan Xin, Valentino Marco, Dennis Louise A., Freitas André
- An Investigation Of Neuron Activation As A Unified Lens To Explain Chain-of-thought Eliciting Arithmetic Reasoning Of Llms Rai Daking, Yao Ziyu
- Preference Distillation For Personalized Generative Recommendation Ramos Jerome, Wu Bin, Lipani Aldo
- Evaluating The Reliability Of Self-explanations In Large Language Models Randl Korbinian, Pavlopoulos John, Henriksson Aron, Lindgren Tony
- RAGE Against The Machine: Retrieval-augmented LLM Explanations Rorseth Joel, Godfrey Parke, Golab Lukasz, Srivastava Divesh, Szlichta Jaroslaw
- Achieving Peak Performance For Large Language Models: A Systematic Review Rostam Zhyar Rzgar K, Szénási Sándor, Kertész Gábor
- Procedural Adherence And Interpretability Through Neuro-symbolic Generative Agents Rothkopf Raven, Zeng Hannah Tongxin, Santolucito Mark
- Explorations Of Self-repair In Language Models Rushing Cody, Nanda Neel
- Evaluating Consistency And Reasoning Capabilities Of Large Language Models Saxena Yash, Chopra Sarthak, Tripathi Arunendra Mani
- Metacognitive Myopia In Large Language Models Scholten Florian, Rebholz Tobias R., Hütter Mandy
- A Critical Evaluation Of AI Feedback For Aligning Large Language Models Sharma Archit, Keh Sedrick, Mitchell Eric, Finn Chelsea, Arora Kushal, Kollar Thomas
- Why Would You Suggest That? Human Trust In Language Model Responses Sharma Manasi, Siu Ho Chit, Paleja Rohan, Peña Jaime D.
- Rumour Evaluation With Very Large Language Models Shehata Dahlia, Cohen Robin, Clarke Charles
- Can Language Models Explain Their Own Classification Behavior? Sherburn Dane, Chughtai Bilal, Evans Owain
- Latent Adversarial Training Improves Robustness To Persistent Harmful Behaviors In Llms Sheshadri Abhay, Ewart Aidan, Guo Phillip, Lynch Aengus, Wu Cindy, Hebbar Vivek, Sleight Henry, Stickland Asa Cooper, Perez Ethan, Hadfield-menell Dylan, Casper Stephen
- Rethinking Interpretability In The Era Of Large Language Models Singh Chandan, Inala Jeevana Priya, Galley Michel, Caruana Rich, Gao Jianfeng
- The Mysterious Case Of Neuron 1512: Injectable Realignment Architectures Reveal Internal Characteristics Of Meta's Llama 2 Model Smith Brenden, Baker Dallin, Chase Clayton, Barney Myles, Parker Kaden, Allred Makenna, Hu Peter, Evans Alex, Fulda Nancy
- Explaining Autonomy: Enhancing Human-robot Interaction Through Explanation Generation With Large Language Models Sobrín-hidalgo David, González-santamarta Miguel A., Guerrero-higueras Ángel M., Rodríguez-lera Francisco J., Matellán-olivera Vicente
- Generaitor: Tree-in-the-loop Text Generation For Language Model Explainability And Adaptation Spinner Thilo, Kehlbeck Rebecca, Sevastjanova Rita, Stähle Tobias, Keim Daniel A., Deussen Oliver, El-assady Mennatallah
- Fine-tuning Large Language Models For Entity Matching Steiner Aaron, Peeters Ralph, Bizer Christian
- The Calibration Gap Between Model And Human Confidence In Large Language Models Steyvers Mark, Tejeda Heliodoro, Kumar Aakriti, Belem Catarina, Karny Sheer, Hu Xinyue, Mayer Lukas, Smyth Padhraic
- Crafting Large Language Models For Enhanced Interpretability Sun Chung-en, Oikarinen Tuomas, Weng Tsui-wei
- Delrec: Distilling Sequential Pattern To Enhance Llm-based Recommendation Sun Guohao, Zhang Haoyi
- Fuxitranyu: A Multilingual Large Language Model Trained With Balanced Data Sun Haoran, Jin Renren, Xu Shaoyang, Pan Leiyu, Supryadi, Cui Menglong, Du Jiangcun, Lei Yikun, Yang Lei, Shi Ling, Xiao Juesi, Zhu Shaolin, Xiong Deyi
- Effectiveness Of Chatgpt In Explaining Complex Medical Reports To Patients Sun Mengxuan, Reiter Ehud, Kiltie Anne E, Ramsay George, Duncan Lisa, Murchie Peter, Adam Rosalind
- Large Language Models As Conversational Movie Recommenders: A User Study Sun Ruixuan, Li Xinyi, Akella Avinash, Konstan Joseph A.
- Chain-of-strategy Planning With Llms: Aligning The Generation Of Psychotherapy Dialogue With Strategy In Motivational Interviewing Sun Xin, Tang Xiao, Ali Abdallah El, Li Zhuying, Shen Xiaoyu, Ren Pengjie, De Wit Jan, Pei Jiahuan, Bosch Jos A.
- From Explanations To Action: A Zero-shot, Theory-driven LLM Framework For Student Performance Feedback Swamy Vinitra, Romano Davide, Desikan Bhargav Srinivasa, Camburu Oana-maria, Käser Tanja
- Improving Large Language Model (LLM) Fidelity Through Context-aware Grounding: A Systematic Approach To Reliability And Veracity Talukdar Wrick, Biswas Anjanava
- Tuning-free Accountable Intervention For LLM Deployment -- A Metacognitive Approach Tan Zhen, Peng Jie, Chen Tianlong, Liu Huan
- Tofueval: Evaluating Hallucinations Of Llms On Topic-focused Dialogue Summarization Tang Liyan, Shalyminov Igor, Wong Amy Wing-mei, Burnsky Jon, Vincent Jake W., Yang Yu'an, Singh Siffi, Feng Song, Song Hwanjun, Su Hang, Sun Lijia, Zhang Yi, Mansour Saab, Mckeown Kathleen
- On The Hardness Of Faithful Chain-of-thought Reasoning In Large Language Models Tanneru Sree Harsha, Ley Dan, Agarwal Chirag, Lakkaraju Himabindu
- From Feature Importance To Natural Language Explanations Using Llms With RAG Tekkesinoglu Sule, Kunze Lars
- Interactive Prompt Debugging With Sequence Salience Tenney Ian, Mullins Ryan, Du Bin, Pandya Shree, Kahng Minsuk, Dixon Lucas
- Can Chatgpt Make Explanatory Inferences? Benchmarks For Abductive Reasoning Thagard Paul
- Assessing Large Language Models In Mechanical Engineering Education: A Study On Mechanics-focused Conceptual Understanding Tian Jie, Hou Jixin, Wu Zihao, Shu Peng, Liu Zhengliang, Xiang Yujie, Gu Beikang, Filla Nicholas, Li Yiwei, Liu Ning, Chen Xianyan, Tang Keke, Liu Tianming, Wang Xianqiao
- Eyes Wide Shut? Exploring The Visual Shortcomings Of Multimodal Llms Tong Shengbang, Liu Zhuang, Zhai Yuexiang, Ma Yi, Lecun Yann, Xie Saining
- Optimizing Language Model's Reasoning Abilities With Weak Supervision Tong Yongqi, Wang Sizhe, Li Dawei, Wang Yifan, Han Simeng, Lin Zi, Huang Chengsong, Huang Jiaxin, Shang Jingbo
- Xtower: A Multilingual LLM For Explaining And Correcting Translation Errors Treviso Marcos, Guerreiro Nuno M., Agrawal Sweta, Rei Ricardo, Pombal José, Vaz Tania, Wu Helena, Silva Beatriz, Van Stigt Daan, Martins André F. T.
- LM Transparency Tool: Interactive Tool For Analyzing Transformer Language Models Tufanov Igor, Hambardzumyan Karen, Ferrando Javier, Voita Elena
- What's In An Embedding? Would A Rose By Any Embedding Smell As Sweet? Venkatasubramanian Venkat
- Cross-modal Projection In Multimodal Llms Doesn't Really Project Visual Attributes To Textual Space Verma Gaurav, Choi Minje, Sharma Kartik, Watson-daniels Jamelle, Oh Sejoon, Kumar Srijan
- Leveraging Zero-shot Prompting For Efficient Language Model Distillation Vöge Lukas, Gurgul Vincent, Lessmann Stefan
- DELL: Generating Reactions And Explanations For Llm-based Misinformation Detection Wan Herun, Feng Shangbin, Tan Zhaoxuan, Wang Heng, Tsvetkov Yulia, Luo Minnan
- Datavist5: A Pre-trained Language Model For Jointly Understanding Text And Data Visualization Wan Zhuoyue, Song Yuanfeng, Li Shuaimin, Zhang Chen Jason, Wong Raymond Chi-wing
- Task Supportive And Personalized Human-large Language Model Interaction: A User Study Wang Ben, Liu Jiqun, Karimnazarov Jamshed, Thompson Nicolas
- ANGO: A Next-level Evaluation Benchmark For Generation-oriented Language Models In Chinese Domain Wang Bingchao
- Halu-j: Critique-based Hallucination Judge Wang Binjie, Chern Steffi, Chern Ethan, Liu Pengfei
- Direct: Diagnostic Reasoning For Clinical Notes Via Large Language Models Wang Bowen, Chang Jiuyang, Qian Yiming, Chen Guoxin, Chen Junhao, Jiang Zhouqiang, Zhang Jiahao, Nakashima Yuta, Nagahara Hajime
- A Survey For Large Language Models In Biomedicine Wang Chong, Li Mengyao, He Junjun, Wang Zhongruo, Darzi Erfan, Chen Zan, Ye Jin, Li Tianbin, Su Yanzhou, Ke Jing, Qu Kaili, Li Shuxin, Yu Yi, Liò Pietro, Wang Tianyun, Wang Yu Guang, Shen Yiqing
- Interpretable Preferences Via Multi-objective Reward Modeling And Mixture-of-experts Wang Haoxiang, Xiong Wei, Xie Tengyang, Zhao Han, Zhang Tong
- Helpful Or Harmful Data? Fine-tuning-free Shapley Attribution For Explaining Language Model Predictions Wang Jingtan, Lin Xiaoqiang, Qiao Rui, Foo Chuan-sheng, Low Bryan Kian Hsiang
- Cross-refine: Improving Natural Language Explanation Generation By Learning In Tandem Wang Qianli, Anikina Tatiana, Feldhus Nils, Ostermann Simon, Möller Sebastian, Schmitt Vera
- Deconfounded Causality-aware Parameter-efficient Fine-tuning For Problem-solving Improvement Of Llms Wang Ruoyu, Li Xiaoxuan, Yao Lina
- What Is The Best Way For Chatgpt To Translate Poetry? Wang Shanshan, Wong Derek F., Yao Jingming, Chao Lidia S.
- UBENCH: Benchmarking Uncertainty In Large Language Models With Multiple Choice Questions Wang Xunzhi, Zhang Zhuowei, Li Qiongyu, Chen Gaonan, Hu Mengting, Li Zhiyu, Luo Bitong, Gao Hang, Han Zhixin, Wang Haotian
- Taste: Teaching Large Language Models To Translate Through Self-reflection Wang Yutong, Zeng Jiali, Liu Xuebo, Meng Fandong, Zhou Jie, Zhang Min
- Stop Reasoning! When Multimodal Llms With Chain-of-thought Reasoning Meets Adversarial Images Wang Zefeng, Han Zhen, Chen Shuo, Xue Fan, Ding Zifeng, Xiao Xun, Tresp Volker, Torr Philip, Gu Jindong
- Absinstruct: Eliciting Abstraction Ability From Llms Through Explanation Tuning With Plausibility Estimation Wang Zhaowei, Fan Wei, Zong Qing, Zhang Hongming, Choi Sehyun, Fang Tianqing, Liu Xin, Song Yangqiu, Wong Ginny Y., See Simon
- A Comprehensive Survey Of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO And More Wang Zhichao James, Bi Bin James, Pentyala Shiva Kumar James, Ramnath Kiran James, Chaudhuri Sougata James, Mehrotra Shubham James, Zixu James, Zhu Claire, Mao Xiang-bo Claire, Asur Sitaram Claire, Na Claire, Cheng
- Promptcharm: Text-to-image Generation Through Multi-modal Prompting And Refinement Wang Zhijie, Huang Yuheng, Song Da, Ma Lei, Zhang Tianyi
- Systematic Evaluation Of Llm-as-a-judge In LLM Alignment Tasks: Explainable Metrics And Diverse Prompt Templates Wei Hui, He Shenghua, Xia Tian, Wong Andy, Lin Jingyang, Han Mei
- Towards Evaluating And Building Versatile Large Language Models For Medicine Wu Chaoyi, Qiu Pengcheng, Liu Jinxin, Gu Hongfei, Li Na, Zhang Ya, Wang Yanfeng, Xie Weidi
- Uncertainty Estimation Of Large Language Models In Medical Question Answering Wu Jiaxin, Yu Yizhou, Zhou Hong-yu
- Controlmllm: Training-free Visual Prompt Learning For Multimodal Large Language Models Wu Mingrui, Cai Xinyue, Ji Jiayi, Li Jiale, Huang Oucheng, Luo Gen, Fei Hao, Sun Xiaoshuai, Ji Rongrong
- Do Language Models Plan Ahead For Future Tokens? Wu Wilson, Morris John X., Levine Lionel
- EMP: Enhance Memory In Data Pruning Xiao Jinying, Li Ping, Nie Jie, Tang Zhe
- Videoqa In The Era Of Llms: An Empirical Study Xiao Junbin, Huang Nanxin, Qin Hangyu, Li Dongyang, Li Yicong, Zhu Fengbin, Tao Zhulin, Yu Jianxing, Lin Liang, Chua Tat-seng, Yao Angela
- Verbalized Machine Learning: Revisiting Machine Learning With Language Models Xiao Tim Z., Bamler Robert, Schölkopf Bernhard, Liu Weiyang
- Leveraging Human Revisions For Improving Text-to-layout Models Xie Amber, Cheng Chin-yi, Huang Forrest, Li Yang
- Meaningful Learning: Advancing Abstract Reasoning In Large Language Models Via Generic Fact Guidance Xiong Kai, Ding Xiao, Liu Ting, Qin Bing, Xu Dongliang, Yang Qing, Liu Hongtao, Cao Yixin
- Unveil The Duality Of Retrieval-augmented Generation: Theoretical Analysis And Practical Solution Xu Shicheng, Pang Liang, Shen Huawei, Cheng Xueqi
- Exploring The LLM Journey From Cognition To Expression With Linear Representations Yan Yuzi, Li Jialian, Zhang Yipin, Yan Dong
- 3d-properties: Identifying Challenges In DPO And Charting A Path Forward Yan Yuzi, Miao Yibo, Li Jialian, Zhang Yipin, Xie Jian, Deng Zhijie, Yan Dong
- Report Cards: Qualitative Evaluation Of Language Models Using Natural Language Summaries Yang Blair, Cui Fuyang, Paster Keiran, Ba Jimmy, Vaezipoor Pashootan, Pitis Silviu, Zhang Michael R.
- MAPLE: Enhancing Review Generation With Multi-aspect Prompt Learning In Explainable Recommendation Yang Ching-wen, Chen Che Wei, Wu Kun-da, Xu Hao, Yao Jui-feng, Kao Hung-yu
- IM-RAG: Multi-round Retrieval-augmented Generation Through Learning Inner Monologues Yang Diji, Rao Jinmeng, Chen Kezhen, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- Dissociation Of Faithful And Unfaithful Reasoning In Llms Yee Evelyn, Li Alice, Tang Chenyu, Jung Yeon Ho, Paturi Ramamohan, Bergen Leon
- How Interpretable Are Reasoning Explanations From Prompting Large Language Models? Yeo Wei Jie, Satapathy Ranjan, Goh Rick Siow Mong, Cambria Erik
- Lofit: Localized Fine-tuning On LLM Representations Yin Fangcong, Ye Xi, Durrett Greg
- MMAU: A Holistic Benchmark Of Agent Capabilities Across Diverse Domains Yin Guoli, Bai Haoping, Ma Shuang, Nan Feng, Sun Yanchao, Xu Zhaoyang, Ma Shen, Lu Jiarui, Kong Xiang, Zhang Aonan, Yap Dian Ang, Zhang Yizhe, Ahnert Karsten, Kamath Vik, Berglund Mathias, Walsh Dominic, Gindele Tobias, Wiest Juergen, Lai Zhengfeng, Wang Xiaoming, Shan Jiulong, Cao Meng, Pang Ruoming, Wang Zirui
- SEA: Supervised Embedding Alignment For Token-level Visual-textual Integration In Mllms Yin Yuanyang, Zhao Yaqi, Zhang Yajie, Lin Ke, Wang Jiahao, Tao Xin, Wan Pengfei, Zhang Di, Yin Baoqun, Zhang Wentao
- Exploring The Comprehension Of Chatgpt In Traditional Chinese Medicine Knowledge Yizhen Li, Shaohan Huang, Jiaxing Qi, Lei Quan, Dongran Han, Zhongzhi Luan
- Mechanistic Understanding And Mitigation Of Language Model Non-factual Hallucinations Yu Lei, Cao Meng, Cheung Jackie Chi Kit, Dong Yue
- Do Large Language Models Understand Conversational Implicature -- A Case Study With A Chinese Sitcom Yue Shisen, Song Siyuan, Cheng Xinyuan, Hu Hai
- Tell Me Why: Explainable Public Health Fact-checking With Large Language Models Zarharan Majid, Wullschleger Pascal, Kia Babak Behkam, Pilehvar Mohammad Taher, Foster Jennifer
- MC-GPT: Empowering Vision-and-language Navigation With Memory Map And Reasoning Chains Zhan Zhaohuan, Yu Lisha, Yu Sijie, Tan Guang
- Read And Think: An Efficient Step-wise Multimodal Language Model For Document Understanding And Reasoning Zhang Jinxu
- Instruct Large Language Models To Drive Like Humans Zhang Ruijun, Guo Xianda, Zheng Wenzhao, Zhang Chenming, Keutzer Kurt, Chen Long
- Getting More From Less: Large Language Models Are Good Spontaneous Multilingual Learners Zhang Shimao, Gao Changjiang, Zhu Wenhao, Chen Jiajun, Huang Xin, Han Xue, Feng Junlan, Deng Chao, Huang Shujian
- Tired Of Plugins? Large Language Models Can Be End-to-end Recommenders Zhang Wenlin, Wu Chuhan, Li Xiangyang, Wang Yuhao, Dong Kuicai, Wang Yichao, Dai Xinyi, Zhao Xiangyu, Guo Huifeng, Tang Ruiming
- From Redundancy To Relevance: Enhancing Explainability In Multimodal Large Language Models Zhang Xiaofeng, Shen Chen, Yuan Xiaosong, Yan Shaotian, Xie Liang, Wang Wenxiao, Gu Chaochen, Tang Hao, Ye Jieping
- Large Language Models As Evaluators For Recommendation Explanations Zhang Xiaoyu, Li Yishan, Wang Jiayin, Sun Bowen, Ma Weizhi, Sun Peijie, Zhang Min
- ELAD: Explanation-guided Large Language Models Active Distillation Zhang Yifei, Pan Bo, Ling Chen, Hu Yuntong, Zhao Liang
- Navigating User Experience Of Chatgpt-based Conversational Recommender Systems: The Effects Of Prompt Guidance And Recommendation Domain Zhang Yizhe, Jin Yucheng, Chen Li, Yang Ting
- Shieldlm: Empowering Llms As Aligned, Customizable And Explainable Safety Detectors Zhang Zhexin, Lu Yida, Ma Jingyuan, Zhang Di, Li Rui, Ke Pei, Sun Hao, Sha Lei, Sui Zhifang, Wang Hongning, Huang Minlie
- Glitchprober: Advancing Effective Detection And Mitigation Of Glitch Tokens In Large Language Models Zhang Zhibo, Bai Wuxia, Li Yuxi, Meng Mark Huasong, Wang Kailong, Shi Ling, Li Li, Wang Jun, Wang Haoyu
- LANE: Logic Alignment Of Non-tuning Large Language Models And Online Recommendation Systems For Explainable Reason Generation Zhao Hongke, Zheng Songming, Wu Likang, Yu Bowen, Wang Jing
- Attention Heads Of Large Language Models: A Survey Zheng Zifan, Wang Yezhaohui, Huang Yuxin, Song Shichao, Tang Bo, Xiong Feiyu, Li Zhiyu
- Language-based User Profiles For Recommendation Zhou Joyce, Dai Yijia, Joachims Thorsten
- An LLM Feature-based Framework For Dialogue Constructiveness Assessment Zhou Lexin, Farag Youmna, Vlachos Andreas
- Image-of-thought Prompting For Visual Reasoning Refinement In Multimodal Large Language Models Zhou Qiji, Zhou Ruochen, Hu Zike, Lu Panzhong, Gao Siyang, Zhang Yue
- Evaluating The Smooth Control Of Attribute Intensity In Text Generation With Llms Zhou Shang, Yao Feng, Dong Chengyu, Wang Zihan, Shang Jingbo
- Explaining Pre-trained Language Models With Attribution Scores: An Analysis In Low-resource Settings Zhou Wei, Adel Heike, Schuff Hendrik, Vu Ngoc Thang
- How Alignment And Jailbreak Work: Explain LLM Safety Through Intermediate Hidden States Zhou Zhenhong, Yu Haiyang, Zhang Xinghua, Xu Rongwu, Huang Fei, Li Yongbin
- Benchmarking Large Language Models On CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset Zhu Jie, Li Junhui, Wen Yalong, Guo Lifan
- Multilingual Contrastive Decoding Via Language-agnostic Layers Skipping Zhu Wenhao, Liu Sizhe, Huang Shujian, She Shuaijie, Wendler Chris, Chen Jiajun
- Efficient Test-time Prompt Tuning For Vision-language Models Zhu Yuhan, Zhang Guozhen, Xu Chen, Shen Haocheng, Chen Xiaoxin, Wu Gangshan, Wang Limin
- Llms For XAI: Future Directions For Explaining Explanations Zytek Alexandra, Pidò Sara, Veeramachaneni Kalyan
🏷 Language Modeling
- Frustratingly Short Attention Spans In Neural Language Modeling Daniluk Michał, Rocktäschel Tim, Welbl Johannes, Riedel Sebastian
- Table-to-text Generation By Structure-aware Seq2seq Learning Liu Tianyu, Wang Kexiang, Sha Lei, Chang Baobao, Sui Zhifang
- Recurrent And Contextual Models For Visual Question Answering Sharang Abhijit, Lau Eric
- Neural Text Generation: A Practical Guide Xie Ziang
- SALSA-TEXT : Self Attentive Latent Space Based Adversarial Text Generation Gagnon-marchand Jules, Sadeghi Hamed, Haidar Md. Akmal, Rezagholizadeh Mehdi
- Like A Baby: Visually Situated Neural Language Acquisition Ororbia Alexander G., Mali Ankur, Kelly Matthew A., Reitter David
- Can You Tell Me How To Get Past Sesame Street? Sentence-level Pretraining Beyond Language Modeling Wang Alex, Hula Jan, Xia Patrick, Pappagari Raghavendra, Mccoy R. Thomas, Patel Roma, Kim Najoung, Tenney Ian, Huang Yinghui, Yu Katherin, Jin Shuning, Chen Berlin, Van Durme Benjamin, Grave Edouard, Pavlick Ellie, Bowman Samuel R.
- Learning Neural Templates For Text Generation Wiseman Sam, Shieber Stuart M., Rush Alexander M.
- The Unreasonable Effectiveness Of Transformer Language Models In Grammatical Error Correction Alikaniotis Dimitrios, Raheja Vipul
- Distilling Knowledge Learned In BERT For Text Generation Chen Yen-chun, Gan Zhe, Cheng Yu, Liu Jingzhou, Liu Jingjing
- Transformer-xl: Attentive Language Models Beyond A Fixed-length Context Dai Zihang, Yang Zhilin, Yang Yiming, Carbonell Jaime, Le Quoc V., Salakhutdinov Ruslan
- Unified Language Model Pre-training For Natural Language Understanding And Generation Dong Li, Yang Nan, Wang Wenhui, Wei Furu, Liu Xiaodong, Wang Yu, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- ELI5: Long Form Question Answering Fan Angela, Jernite Yacine, Perez Ethan, Grangier David, Weston Jason, Auli Michael
- Neural Data-to-text Generation: A Comparison Between Pipeline And End-to-end Architectures Ferreira Thiago Castro, Van Der Lee Chris, Van Miltenburg Emiel, Krahmer Emiel
- Resurrecting Submodularity For Neural Text Generation Han Simeng, Lin Xiang, Joty Shafiq
- Exposure Bias Versus Self-recovery: Are Distortions Really Incremental For Autoregressive Text Generation? He Tianxing, Zhang Jingzhao, Zhou Zhiming, Glass James
- Amr-to-text Generation With Cache Transition Systems Jin Lisa, Gildea Daniel
- CTRL: A Conditional Transformer Language Model For Controllable Generation Keskar Nitish Shirish, Mccann Bryan, Varshney Lav R., Xiong Caiming, Socher Richard
- Patent Claim Generation By Fine-tuning Openai GPT-2 Lee Jieh-sheng, Hsiang Jieh
- BART: Denoising Sequence-to-sequence Pre-training For Natural Language Generation, Translation, And Comprehension Lewis Mike, Liu Yinhan, Goyal Naman, Ghazvininejad Marjan, Mohamed Abdelrahman, Levy Omer, Stoyanov Ves, Zettlemoyer Luke
- Unicoder-vl: A Universal Encoder For Vision And Language By Cross-modal Pre-training Li Gen, Duan Nan, Fang Yuejian, Gong Ming, Jiang Daxin, Zhou Ming
- Big Bidirectional Insertion Representations For Documents Li Lala, Chan William
- Relevance-promoting Language Model For Short-text Conversation Li Xin, Li Piji, Bi Wei, Liu Xiaojiang, Lam Wai
- Probing Representations Learned By Multimodal Recurrent And Transformer Models Libovický Jindřich, Madhyastha Pranava
- Commongen: A Constrained Text Generation Challenge For Generative Commonsense Reasoning Lin Bill Yuchen, Zhou Wangchunshu, Shen Ming, Zhou Pei, Bhagavatula Chandra, Choi Yejin, Ren Xiang
- Megatron-lm: Training Multi-billion Parameter Language Models Using Model Parallelism Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick Legresley, Jared Casper, Bryan Catanzaro
- Neural Assistant: Joint Action Prediction, Response Generation, And Latent Knowledge Reasoning Neelakantan Arvind, Yavuz Semih, Narang Sharan, Prasad Vishaal, Goodrich Ben, Duckworth Daniel, Sankar Chinnadhurai, Yan Xifeng
- Stabilizing Transformers For Reinforcement Learning Parisotto Emilio, Song H. Francis, Rae Jack W., Pascanu Razvan, Gulcehre Caglar, Jayakumar Siddhant M., Jaderberg Max, Kaufman Raphael Lopez, Clark Aidan, Noury Seb, Botvinick Matthew M., Heess Nicolas, Hadsell Raia
- Text Generation With Exemplar-based Adaptive Decoding Peng Hao, Parikh Ankur P., Faruqui Manaal, Dhingra Bhuwan, Das Dipanjan
- Distilbert, A Distilled Version Of BERT: Smaller, Faster, Cheaper And Lighter Sanh Victor, Debut Lysandre, Chaumond Julien, Wolf Thomas
- Generalization In Generation: A Closer Look At Exposure Bias Schmidt Florian
- Pragmatically Informative Text Generation Shen Sheng, Fried Daniel, Andreas Jacob, Klein Dan
- Insertion Transformer: Flexible Sequence Generation Via Insertion Operations Stern Mitchell, Chan William, Kiros Jamie, Uszkoreit Jakob
- Adaptive Attention Span In Transformers Sukhbaatar Sainbayar, Grave Edouard, Bojanowski Piotr, Joulin Armand
- Augmenting Self-attention With Persistent Memory Sukhbaatar Sainbayar, Grave Edouard, Lample Guillaume, Jegou Herve, Joulin Armand
- LXMERT: Learning Cross-modality Encoder Representations From Transformers Tan Hao, Bansal Mohit
- What Do You Learn From Context? Probing For Sentence Structure In Contextualized Word Representations Tenney Ian, Xia Patrick, Chen Berlin, Wang Alex, Poliak Adam, Mccoy R Thomas, Kim Najoung, Van Durme Benjamin, Bowman Samuel R., Das Dipanjan, Pavlick Ellie
- Allennlp Interpret: A Framework For Explaining Predictions Of NLP Models Wallace Eric, Tuyls Jens, Wang Junlin, Subramanian Sanjay, Gardner Matt, Singh Sameer
- Language Models With Transformers Wang Chenguang, Li Mu, Smola Alexander J.
- Tree Transformer: Integrating Tree Structures Into Self-attention Wang Yau-shian, Lee Hung-yi, Chen Yun-nung
- Structured Pruning Of Large Language Models Wang Ziheng, Wohlwend Jeremy, Lei Tao
- Bp-transformer: Modelling Long-range Context Via Binary Partitioning Ye Zihao, Guo Qipeng, Gan Quan, Qiu Xipeng, Zhang Zheng
- Bertscore: Evaluating Text Generation With BERT Zhang Tianyi, Kishore Varsha, Wu Felix, Weinberger Kilian Q., Artzi Yoav
- Multi-task Learning With Language Modeling For Question Generation Zhou Wenjie, Zhang Minghua, Wu Yunfang
- Encoder-agnostic Adaptation For Conditional Language Generation Ziegler Zachary M., Melas-kyriazi Luke, Gehrmann Sebastian, Rush Alexander M.
- Paranoid Transformer: Reading Narrative Of Madness As Computational Approach To Creativity Agafonova Yana, Tikhonov Alexey, Yamshchikov Ivan P.
- Segatron: Segment-aware Transformer For Language Modeling And Understanding Bai He, Shi Peng, Lin Jimmy, Xie Yuqing, Tan Luchen, Xiong Kun, Gao Wen, Li Ming
- Unilmv2: Pseudo-masked Language Models For Unified Language Model Pre-training Bao Hangbo, Dong Li, Wei Furu, Wang Wenhui, Yang Nan, Liu Xiaodong, Wang Yu, Piao Songhao, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- Longformer: The Long-document Transformer Beltagy Iz, Peters Matthew E., Cohan Arman
- Memory Transformer Burtsev Mikhail S., Kuratov Yuri, Peganov Anton, Sapunov Grigory V.
- Syntax-driven Iterative Expansion Language Models For Controllable Text Generation Casas Noe, Fonollosa José A. R., Costa-jussà Marta R.
- Residual Energy-based Models For Text Generation Deng Yuntian, Bakhtin Anton, Ott Myle, Szlam Arthur, Ranzato Marc'aurelio
- Cascaded Text Generation With Markov Transformers Deng Yuntian, Rush Alexander M.
- Ernie-doc: A Retrospective Long-document Modeling Transformer Ding Siyu, Shang Junyuan, Wang Shuohuan, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Injecting Entity Types Into Entity-guided Text Generation Dong Xiangyu, Yu Wenhao, Zhu Chenguang, Jiang Meng
- A Comparison Of LSTM And BERT For Small Corpus Ezen-can Aysu
- Addressing Some Limitations Of Transformers With Feedback Memory Fan Angela, Lavril Thibaut, Grave Edouard, Joulin Armand, Sukhbaatar Sainbayar
- Robust Conversational AI With Grounded Text Generation Gao Jianfeng, Peng Baolin, Li Chunyuan, Li Jinchao, Shayandeh Shahin, Liden Lars, Shum Heung-yeung
- The Pile: An 800GB Dataset Of Diverse Text For Language Modeling Gao Leo, Biderman Stella, Black Sid, Golding Laurence, Hoppe Travis, Foster Charles, Phang Jason, He Horace, Thite Anish, Nabeshima Noa, Presser Shawn, Leahy Connor
- Incorporating BERT Into Parallel Sequence Decoding With Adapters Guo Junliang, Zhang Zhirui, Xu Linli, Wei Hao-ran, Chen Boxing, Chen Enhong
- GMAT: Global Memory Augmentation For Transformers Gupta Ankit, Berant Jonathan
- PAIR: Planning And Iterative Refinement In Pre-trained Transformers For Long Text Generation Hua Xinyu, Wang Lu
- Text-to-text Pre-training For Data-to-text Tasks Kale Mihir, Rastogi Abhinav
- Meta-context Transformers For Domain-specific Response Generation Kar Debanjana, Samanta Suranjana, Azad Amar Prakash
- Current Limitations Of Language Models: What You Need Is Retrieval Komatsuzaki Aran
- Utilizing Bidirectional Encoder Representations From Transformers For Answer Selection Laskar Md Tahmid Rahman, Hoque Enamul, Huang Jimmy Xiangji
- From Zero To Hero: On The Limitations Of Zero-shot Cross-lingual Transfer With Multilingual Transformers Lauscher Anne, Ravishankar Vinit, Vulić Ivan, Glavaš Goran
- SLM: Learning A Discourse Language Representation With Sentence Unshuffling Lee Haejun, Hudson Drew A., Lee Kangwook, Manning Christopher D.
- Transformers To Learn Hierarchical Contexts In Multiparty Dialogue For Span-based Question Answering Li Changmao, Choi Jinho D.
- Optimus: Organizing Sentences Via Pre-trained Modeling Of A Latent Space Li Chunyuan, Gao Xiang, Li Yuan, Peng Baolin, Li Xiujun, Zhang Yizhe, Gao Jianfeng
- An Empirical Investigation Of Pre-trained Transformer Language Models For Open-domain Dialogue Generation Li Piji
- Multilingual Speech Translation With Efficient Finetuning Of Pretrained Models Li Xian, Wang Changhan, Tang Yun, Tran Chau, Tang Yuqing, Pino Juan, Baevski Alexei, Conneau Alexis, Auli Michael
- Probabilistically Masked Language Model Capable Of Autoregressive Generation In Arbitrary Word Order Liao Yi, Jiang Xin, Liu Qun
- Autoregressive Knowledge Distillation Through Imitation Learning Lin Alexander, Wohlwend Jeremy, Chen Howard, Lei Tao
- KG-BART: Knowledge Graph-augmented BART For Generative Commonsense Reasoning Liu Ye, Wan Yao, He Lifang, Peng Hao, Yu Philip S.
- Gpt-too: A Language-model-first Approach For Amr-to-text Generation Mager Manuel, Astudillo Ramon Fernandez, Naseem Tahira, Sultan Md Arafat, Lee Young-suk, Florian Radu, Roukos Salim
- Verb Knowledge Injection For Multilingual Event Processing Majewska Olga, Vulić Ivan, Glavaš Goran, Ponti Edoardo M., Korhonen Anna
- Delight: Deep And Light-weight Transformer Mehta Sachin, Ghazvininejad Marjan, Iyer Srinivasan, Zettlemoyer Luke, Hajishirzi Hannaneh
- Modern Methods For Text Generation Montesinos Dimas Munoz
- Stepwise Extractive Summarization And Planning With Structured Transformers Narayan Shashi, Maynez Joshua, Adamek Jakub, Pighin Daniele, Bratanič Blaž, Mcdonald Ryan
- Better Distractions: Transformer-based Distractor Generation And Multiple Choice Question Filtering Offerijns Jeroen, Verberne Suzan, Verhoef Tessa
- Facts2story: Controlling Text Generation By Key Facts Orbach Eyal Bar Ilan University, Goldberg Yoav Bar Ilan University And Allen Institute For Artificial Intelligence
- Multilingual BERT Post-pretraining Alignment Pan Lin, Hang Chung-wei, Qi Haode, Shah Abhishek, Potdar Saloni, Yu Mo
- How To Evaluate Your Dialogue System: Probe Tasks As An Alternative For Token-level Evaluation Metrics Parthasarathi Prasanna, Pineau Joelle, Chandar Sarath
- Investigating Pretrained Language Models For Graph-to-text Generation Ribeiro Leonardo F. R., Schmitt Martin, Schütze Hinrich, Gurevych Iryna
- Knowledge-aware Language Model Pretraining Rosset Corby, Xiong Chenyan, Phan Minh, Song Xia, Bennett Paul, Tiwary Saurabh
- What BERT Sees: Cross-modal Transfer For Visual Question Generation Scialom Thomas, Bordes Patrick, Dray Paul-alexis, Staiano Jacopo, Gallinari Patrick
- BLEURT: Learning Robust Metrics For Text Generation Sellam Thibault, Das Dipanjan, Parikh Ankur P.
- Neural Data-to-text Generation Via Jointly Learning The Segmentation And Correspondence Shen Xiaoyu, Chang Ernie, Su Hui, Zhou Jie, Klakow Dietrich
- Lightpaff: A Two-stage Distillation Framework For Pre-training And Fine-tuning Song Kaitao, Sun Hao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Hongzhi, Liu Tie-yan
- Multi-scale Transformer Language Models Subramanian Sandeep, Collobert Ronan, Ranzato Marc'aurelio, Boureau Y-lan
- Synthesizer: Rethinking Self-attention In Transformer Models Tay Yi, Bahri Dara, Metzler Donald, Juan Da-cheng, Zhao Zhe, Zheng Che
- The Language Interpretability Tool: Extensible, Interactive Visualizations And Analysis For NLP Models Tenney Ian, Wexler James, Bastings Jasmijn, Bolukbasi Tolga, Coenen Andy, Gehrmann Sebastian, Jiang Ellen, Pushkarna Mahima, Radebaugh Carey, Reif Emily, Yuan Ann
- Unsupervised Pretraining For Neural Machine Translation Using Elastic Weight Consolidation Variš Dušan, Bojar Ondřej
- Towards Faithful Neural Table-to-text Generation With Content-matching Constraints Wang Zhenyi, Wang Xiaoyang, An Bang, Yu Dong, Chen Changyou
- TOD-BERT: Pre-trained Natural Language Understanding For Task-oriented Dialogue Wu Chien-sheng, Hoi Steven, Socher Richard, Xiong Caiming
- Lite Transformer With Long-short Range Attention Wu Zhanghao, Liu Zhijian, Lin Ji, Lin Yujun, Han Song
- CG-BERT: Conditional Text Generation With BERT For Generalized Few-shot Intent Detection Xia Congying, Zhang Chenwei, Nguyen Hoang, Zhang Jiawei, Yu Philip
- MC-BERT: Efficient Language Pre-training Via A Meta Controller Xu Zhenhui, Gong Linyuan, Ke Guolin, He Di, Zheng Shuxin, Wang Liwei, Bian Jiang, Liu Tie-yan
- Accelerating Training Of Transformer-based Language Models With Progressive Layer Dropping Zhang Minjia, He Yuxiong
- Improving Adversarial Text Generation By Modeling The Distant Future Zhang Ruiyi, Chen Changyou, Gan Zhe, Wang Wenlin, Shen Dinghan, Wang Guoyin, Wen Zheng, Carin Lawrence
- POINTER: Constrained Progressive Text Generation Via Insertion-based Generative Pre-training Zhang Yizhe, Wang Guoyin, Li Chunyuan, Gan Zhe, Brockett Chris, Dolan Bill
- On The Multilingual Capabilities Of Very Large-scale English Language Models Armengol-estapé Jordi, Bonet Ona De Gibert, Melero Maite
- Efficient Large Scale Language Modeling With Mixtures Of Experts Artetxe Mikel, Bhosale Shruti, Goyal Naman, Mihaylov Todor, Ott Myle, Shleifer Sam, Lin Xi Victoria, Du Jingfei, Iyer Srinivasan, Pasunuru Ramakanth, Anantharaman Giri, Li Xian, Chen Shuohui, Akin Halil, Baines Mandeep, Martin Louis, Zhou Xing, Koura Punit Singh, O'horo Brian, Wang Jeff, Zettlemoyer Luke, Diab Mona, Kozareva Zornitsa, Stoyanov Ves
- Non-autoregressive Translation By Learning Target Categorical Codes Bao Yu, Huang Shujian, Xiao Tong, Wang Dongqi, Dai Xinyu, Chen Jiajun
- Exploring Unsupervised Pretraining Objectives For Machine Translation Baziotis Christos, Titov Ivan, Birch Alexandra, Haddow Barry
- Gaud\'i: Conversational Interactions With Deep Representations To Generate Image Collections Bursztyn Victor S., Healey Jennifer, Vinay Vishwa
- CDLM: Cross-document Language Modeling Caciularu Avi, Cohan Arman, Beltagy Iz, Peters Matthew E., Cattan Arie, Dagan Ido
- Curriculum Learning For Language Modeling Campos Daniel
- The Selectgen Challenge: Finding The Best Training Samples For Few-shot Neural Text Generation Chang Ernie, Shen Xiaoyu, Marin Alex, Demberg Vera
- On Training Instance Selection For Few-shot Neural Text Generation Chang Ernie, Shen Xiaoyu, Yeh Hui-syuan, Demberg Vera
- Neural Data-to-text Generation With Lm-based Text Augmentation Chang Ernie, Shen Xiaoyu, Zhu Dawei, Demberg Vera, Su Hui
- Reinforcement Learning For Few-shot Text Generation Adaptation Cheng Pengsen, Dai Jinqiao, Liu Jiamiao, Liu Jiayong, Jia Peng
- Language Modeling Using Lmus: 10x Better Data Efficiency Or Improved Scaling Compared To Transformers Chilkuri Narsimha, Hunsberger Eric, Voelker Aaron, Malik Gurshaant, Eliasmith Chris
- Unifying Vision-and-language Tasks Via Text Generation Cho Jaemin, Lei Jie, Tan Hao, Bansal Mohit
- Long-range Modeling Of Source Code Files With Ewash: Extended Window Access By Syntax Hierarchy Clement Colin B., Lu Shuai, Liu Xiaoyu, Tufano Michele, Drain Dawn, Duan Nan, Sundaresan Neel, Svyatkovskiy Alexey
- Control Prefixes For Parameter-efficient Text Generation Clive Jordan, Cao Kris, Rei Marek
- Zero-shot Recommendation As Language Modeling Damien Sileo, Wout Vossen, Robbe Raymaekers
- Multitask Finetuning For Improving Neural Machine Translation In Indian Languages Desai Shaily, Kshirsagar Atharva, Marathe Manisha
- Openprompt: An Open-source Framework For Prompt-learning Ding Ning, Hu Shengding, Zhao Weilin, Chen Yulin, Liu Zhiyuan, Zheng Hai-tao, Sun Maosong
- On-the-fly Attention Modulation For Neural Generation Dong Yue, Bhagavatula Chandra, Lu Ximing, Hwang Jena D., Bosselut Antoine, Cheung Jackie Chi Kit, Choi Yejin
- GLM: General Language Model Pretraining With Autoregressive Blank Infilling Du Zhengxiao, Qian Yujie, Liu Xiao, Ding Ming, Qiu Jiezhong, Yang Zhilin, Tang Jie
- SAPPHIRE: Approaches For Enhanced Concept-to-text Generation Feng Steven Y., Huynh Jessica, Narisetty Chaitanya, Hovy Eduard, Gangal Varun
- Retrieve, Caption, Generate: Visual Grounding For Enhancing Commonsense In Text Generation Models Feng Steven Y., Lu Kevin, Tao Zhuofu, Alikhani Malihe, Mitamura Teruko, Hovy Eduard, Gangal Varun
- Larger-scale Transformers For Multilingual Masked Language Modeling Goyal Naman, Du Jingfei, Ott Myle, Anantharaman Giri, Conneau Alexis
- Long Text Generation By Modeling Sentence-level And Discourse-level Coherence Guan Jian, Mao Xiaoxi, Fan Changjie, Liu Zitao, Ding Wenbiao, Huang Minlie
- Controllable Dialogue Generation With Disentangled Multi-grained Style Specification And Attribute Consistency Reward Hu Zhe, Cao Zhiwei, Chan Hou Pong, Liu Jiachen, Xiao Xinyan, Su Jinsong, Wu Hua
- Non-autoregressive Translation With Layer-wise Prediction And Deep Supervision Huang Chenyang, Zhou Hao, Zaïane Osmar R., Mou Lili, Li Lei
- Unifying Multimodal Transformer For Bi-directional Image And Text Generation Huang Yupan, Xue Hongwei, Liu Bei, Lu Yutong
- DS-TOD: Efficient Domain Specialization For Task Oriented Dialog Hung Chia-chien, Lauscher Anne, Ponzetto Simone Paolo, Glavaš Goran
- Improving Non-autoregressive Generation With Mixup Training Jiang Ting, Huang Shaohan, Zhang Zihan, Wang Deqing, Zhuang Fuzhen, Wei Furu, Huang Haizhen, Zhang Liangjie, Zhang Qi
- Inducing Transformer's Compositional Generalization Ability Via Auxiliary Sequence Prediction Tasks Jiang Yichen, Bansal Mohit
- A Good Prompt Is Worth Millions Of Parameters: Low-resource Prompt-based Learning For Vision-language Models Jin Woojeong, Cheng Yu, Shen Yelong, Chen Weizhu, Ren Xiang
- Rethinking Self-supervision Objectives For Generalizable Coherence Modeling Jwalapuram Prathyusha, Joty Shafiq, Lin Xiang
- Nmt5 -- Is Parallel Data Still Relevant For Pre-training Massively Multilingual Language Models? Kale Mihir, Siddhant Aditya, Constant Noah, Johnson Melvin, Al-rfou Rami, Xue Linting
- Text-free Prosody-aware Generative Spoken Language Modeling Kharitonov Eugene, Lee Ann, Polyak Adam, Adi Yossi, Copet Jade, Lakhotia Kushal, Nguyen Tu-anh, Rivière Morgane, Mohamed Abdelrahman, Dupoux Emmanuel, Hsu Wei-ning
- When Attention Meets Fast Recurrence: Training Language Models With Reduced Compute Lei Tao
- VX2TEXT: End-to-end Learning Of Video-based Text Generation From Multimodal Inputs Lin Xudong, Bertasius Gedas, Wang Jue, Chang Shih-fu, Parikh Devi, Torresani Lorenzo
- Dexperts: Decoding-time Controlled Text Generation With Experts And Anti-experts Liu Alisa, Sap Maarten, Lu Ximing, Swayamdipta Swabha, Bhagavatula Chandra, Smith Noah A., Choi Yejin
- Towards Automated Psychotherapy Via Language Modeling Liu Houjun
- Unified Multimodal Pre-training And Prompt-based Tuning For Vision-language Understanding And Generation Liu Tianyi, Wu Zuxuan, Xiong Wenhan, Chen Jingjing, Jiang Yu-gang
- A Token-level Reference-free Hallucination Detection Benchmark For Free-form Text Generation Liu Tianyu, Zhang Yizhe, Brockett Chris, Mao Yi, Sui Zhifang, Chen Weizhu, Dolan Bill
- Enriching Non-autoregressive Transformer With Syntactic And Semanticstructures For Neural Machine Translation Liu Ye, Wan Yao, Zhang Jian-guo, Zhao Wenting, Yu Philip S.
- \(\infty\)-former: Infinite Memory Transformer Martins Pedro Henrique, Marinho Zita, Martins André F. T.
- How Much Do Language Models Copy From Their Training Data? Evaluating Linguistic Novelty In Text Generation Using RAVEN Mccoy R. Thomas, Smolensky Paul, Linzen Tal, Gao Jianfeng, Celikyilmaz Asli
- Improving Compositional Generalization With Self-training For Data-to-text Generation Mehta Sanket Vaibhav, Rao Jinfeng, Tay Yi, Kale Mihir, Parikh Ankur P., Strubell Emma
- Recent Advances In Natural Language Processing Via Large Pre-trained Language Models: A Survey Min Bonan, Ross Hayley, Sulem Elior, Veyseh Amir Pouran Ben, Nguyen Thien Huu, Sainz Oscar, Agirre Eneko, Heinz Ilana, Roth Dan
- Hierarchical Transformers Are More Efficient Language Models Nawrot Piotr, Tworkowski Szymon, Tyrolski Michał, Kaiser Łukasz, Wu Yuhuai, Szegedy Christian, Michalewski Henryk
- MLP Architectures For Vision-and-language Modeling: An Empirical Study Nie Yixin, Li Linjie, Gan Zhe, Wang Shuohang, Zhu Chenguang, Zeng Michael, Liu Zicheng, Bansal Mohit, Wang Lijuan
- Do Encoder Representations Of Generative Dialogue Models Encode Sufficient Information About The Task ? Parthasarathi Prasanna, Pineau Joelle, Chandar Sarath
- ABC: Attention With Bounded-memory Control Peng Hao, Kasai Jungo, Pappas Nikolaos, Yogatama Dani, Wu Zhaofeng, Kong Lingpeng, Schwartz Roy, Smith Noah A.
- Focused Attention Improves Document-grounded Generation Prabhumoye Shrimai, Hashimoto Kazuma, Zhou Yingbo, Black Alan W, Salakhutdinov Ruslan
- Winner Team Mia At Textvqa Challenge 2021: Vision-and-language Representation Learning With Pre-trained Sequence-to-sequence Model Qiao Yixuan, Chen Hao, Wang Jun, Chen Yihao, Ye Xianbin, Li Ziliang, Qi Xianbiao, Gao Peng, Xie Guotong
- Tailor: Generating And Perturbing Text With Semantic Controls Ross Alexis, Wu Tongshuang, Peng Hao, Peters Matthew E., Gardner Matt
- CPT: A Pre-trained Unbalanced Transformer For Both Chinese Language Understanding And Generation Shao Yunfan, Geng Zhichao, Liu Yitao, Dai Junqi, Yan Hang, Yang Fei, Zhe Li, Bao Hujun, Qiu Xipeng
- Layer-wise Pruning Of Transformer Attention Heads For Efficient Language Modeling Shim Kyuhong, Choi Iksoo, Sung Wonyong, Choi Jungwook
- ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, Dianhai Yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang
- Primer: Searching For Efficient Transformers For Language Modeling So David R., Mańke Wojciech, Liu Hanxiao, Dai Zihang, Shazeer Noam, Le Quoc V.
- Corelm: Coreference-aware Language Model Fine-tuning Stylianou Nikolaos, Vlahavas Ioannis
- Non-autoregressive Text Generation With Pre-trained Language Models Su Yixuan, Cai Deng, Wang Yan, Vandyke David, Baker Simon, Li Piji, Collier Nigel
- Towards Reinforcement Learning For Pivot-based Neural Machine Translation With Non-autoregressive Transformer Tokarchuk Evgeniia, Rosendahl Jan, Wang Weiyue, Petrushkov Pavel, Lancewicki Tomer, Khadivi Shahram, Ney Hermann
- Exploring Transformers In Natural Language Generation: GPT, BERT, And Xlnet Topal M. Onat, Bas Anil, Van Heerden Imke
- Local Explanation Of Dialogue Response Generation Tuan Yi-lin, Pryor Connor, Chen Wenhu, Getoor Lise, Wang William Yang
- Bertnesia: Investigating The Capture And Forgetting Of Knowledge In BERT Wallat Jonas, Singh Jaspreet, Anand Avishek
- Contextualized Scene Imagination For Generative Commonsense Reasoning Wang Peifeng, Zamora Jonathan, Liu Junfeng, Ilievski Filip, Chen Muhao, Ren Xiang
- Sentence Semantic Regression For Text Generation Wang Wei, Li Piji, Zheng Hai-tao
- Prefix-tuning: Optimizing Continuous Prompts For Generation Xiang Lisa Li, Percy Liang
- Turning Tables: Generating Examples From Semi-structured Tables For Endowing Language Models With Reasoning Skills Yoran Ori, Talmor Alon, Berant Jonathan
- Topical Language Generation Using Transformers Zandie Rohola, Mahoor Mohammad H.
- Ernie-vilg: Unified Generative Pre-training For Bidirectional Vision-language Generation Zhang Han, Yin Weichong, Fang Yewei, Li Lanxin, Duan Boqiang, Wu Zhihua, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- DSGPT: Domain-specific Generative Pre-training Of Transformers For Text Generation In E-commerce Title And Review Summarization Zhang Xueying, Jiang Yunjiang, Shang Yue, Cheng Zhaomeng, Zhang Chi, Fan Xiaochuan, Xiao Yun, Long Bo
- Retgen: A Joint Framework For Retrieval And Grounded Text Generation Modeling Zhang Yizhe, Sun Siqi, Gao Xiang, Fang Yuwei, Brockett Chris, Galley Michel, Gao Jianfeng, Dolan Bill
- Towards More Efficient Insertion Transformer With Fractional Positional Encoding Zhang Zhisong, Zhang Yizhe, Dolan Bill
- Thinking Clearly, Talking Fast: Concept-guided Non-autoregressive Generation For Open-domain Dialogue Systems Zou Yicheng, Liu Zhihua, Hu Xingwu, Zhang Qi
- Palm: Scaling Language Modeling With Pathways Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-hellstern, Douglas Eck, Jeff Dean, Slav Petrov, Noah Fiedel
- Global Memory Transformer For Processing Long Documents Adel Arij Al
- How Does The Pre-training Objective Affect What Large Language Models Learn About Linguistic Properties? Alajrami Ahmed, Aletras Nikolaos
- Text And Patterns: For Effective Chain Of Thought, It Takes Two To Tango Aman Madaan, Amir Yazdanbakhsh
- Vl-beit: Generative Vision-language Pretraining Bao Hangbo, Wang Wenhui, Dong Li, Wei Furu
- P\(^3\)LM: Probabilistically Permuted Prophet Language Modeling For Generative Pre-training Bao Junwei, Wang Yifan, Ying Jiangyong, Gong Yeyun, Zhao Jing, Wu Youzheng, He Xiaodong
- \(\textit{latent}\)-glat: Glancing At Latent Variables For Parallel Text Generation Bao Yu, Zhou Hao, Huang Shujian, Wang Dongqi, Qian Lihua, Dai Xinyu, Chen Jiajun, Li Lei
- Banglanlg And Banglat5: Benchmarks And Resources For Evaluating Low-resource Natural Language Generation In Bangla Bhattacharjee Abhik, Hasan Tahmid, Ahmad Wasi Uddin, Shahriyar Rifat
- Ernie-code: Beyond English-centric Cross-lingual Pretraining For Programming Languages Chai Yekun, Wang Shuohuan, Pang Chao, Sun Yu, Tian Hao, Wu Hua
- Controllable Text Generation With Language Constraints Chen Howard, Li Huihan, Chen Danqi, Narasimhan Karthik
- Leveraging Natural Supervision For Language Representation Learning And Generation Chen Mingda
- Adaprompt: Adaptive Model Training For Prompt-based NLP Chen Yulong, Liu Yang, Dong Li, Wang Shuohang, Zhu Chenguang, Zeng Michael, Zhang Yue
- Capture Salient Historical Information: A Fast And Accurate Non-autoregressive Model For Multi-turn Spoken Language Understanding Cheng Lizhi, Jia Weijia, Yang Wenmian
- Meta-learning Fast Weight Language Models Clark Kevin, Guu Kelvin, Chang Ming-wei, Pasupat Panupong, Hinton Geoffrey, Norouzi Mohammad
- Neural Knowledge Bank For Pretrained Transformers Dai Damai, Jiang Wenbin, Dong Qingxiu, Lyu Yajuan, She Qiaoqiao, Sui Zhifang
- Understanding BLOOM: An Empirical Study On Diverse NLP Tasks Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- Write And Paint: Generative Vision-language Models Are Unified Modal Learners Diao Shizhe, Zhou Wangchunshu, Zhang Xinsong, Wang Jiawei
- Parallel Attention Forcing For Machine Translation Dou Qingyun, Gales Mark
- Hungry Hungry Hippos: Towards Language Modeling With State Space Models Fu Daniel Y., Dao Tri, Saab Khaled K., Thomas Armin W., Rudra Atri, Ré Christopher
- Cramming: Training A Language Model On A Single GPU In One Day Geiping Jonas, Goldstein Tom
- Microbert: Effective Training Of Low-resource Monolingual Berts Through Parameter Reduction And Multitask Learning Gessler Luke, Zeldes Amir
- Diffuseq: Sequence To Sequence Text Generation With Diffusion Models Gong Shansan, Li Mukai, Feng Jiangtao, Wu Zhiyong, Kong Lingpeng
- MALM: Mixing Augmented Language Modeling For Zero-shot Machine Translation Gupta Kshitij
- SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model For Text Generation And Modular Control Han Xiaochuang, Kumar Sachin, Tsvetkov Yulia
- Z-code++: A Pre-trained Language Model Optimized For Abstractive Summarization He Pengcheng, Peng Baolin, Lu Liyang, Wang Song, Mei Jie, Liu Yang, Xu Ruochen, Awadalla Hany Hassan, Shi Yu, Zhu Chenguang, Xiong Wayne, Zeng Michael, Gao Jianfeng, Huang Xuedong
- VLMAE: Vision-language Masked Autoencoder He Sunan, Guo Taian, Dai Tao, Qiao Ruizhi, Wu Chen, Shu Xiujun, Ren Bo
- SPACE-3: Unified Dialog Model Pre-training For Task-oriented Dialog Understanding And Generation He Wanwei, Dai Yinpei, Yang Min, Sun Jian, Huang Fei, Si Luo, Li Yongbin
- TRUE: Re-evaluating Factual Consistency Evaluation Honovich Or, Aharoni Roee, Herzig Jonathan, Taitelbaum Hagai, Kukliansy Doron, Cohen Vered, Scialom Thomas, Szpektor Idan, Hassidim Avinatan, Matias Yossi
- PLANET: Dynamic Content Planning In Autoregressive Transformers For Long-form Text Generation Hu Zhe, Chan Hou Pong, Liu Jiachen, Xiao Xinyan, Wu Hua, Huang Lifu
- Transformer Quality In Linear Time Hua Weizhe, Dai Zihang, Liu Hanxiao, Le Quoc V.
- Layoutlmv3: Pre-training For Document AI With Unified Text And Image Masking Huang Yupan, Lv Tengchao, Cui Lei, Lu Yutong, Wei Furu
- Uniform Complexity For Text Generation Imperial Joseph Marvin, Madabushi Harish Tayyar
- Litetransformersearch: Training-free Neural Architecture Search For Efficient Language Models Javaheripi Mojan, De Rosa Gustavo H., Mukherjee Subhabrata, Shah Shital, Religa Tomasz L., Mendes Caio C. T., Bubeck Sebastien, Koushanfar Farinaz, Dey Debadeepta
- Lamemo: Language Modeling With Look-ahead Memory Ji Haozhe, Zhang Rongsheng, Yang Zhenyu, Hu Zhipeng, Huang Minlie
- Survey Of Hallucination In Natural Language Generation Ji Ziwei, Lee Nayeon, Frieske Rita, Yu Tiezheng, Su Dan, Xu Yan, Ishii Etsuko, Bang Yejin, Chen Delong, Dai Wenliang, Chan Ho Shu, Madotto Andrea, Fung Pascale
- Contrastive Learning For Prompt-based Few-shot Language Learners Jian Yiren, Gao Chongyang, Vosoughi Soroush
- A Simple Contrastive Learning Objective For Alleviating Neural Text Degeneration Jiang Shaojie, Zhang Ruqing, Vakulenko Svitlana, De Rijke Maarten
- Latency Adjustable Transformer Encoder For Language Understanding Kachuee Sajjad, Sharifkhani Mohammad
- What Makes Data-to-text Generation Hard For Pretrained Language Models? Keymanesh Moniba, Benton Adrian, Dredze Mark
- High Recall Data-to-text Generation With Progressive Edit Kim Choonghan, Lee Gary Geunbae
- Language Detoxification With Attribute-discriminative Latent Space Kwak Jin Myung, Kim Minseon, Hwang Sung Ju
- Bloom Library: Multimodal Datasets In 300+ Languages For A Variety Of Downstream Tasks Leong Colin, Nemecek Joshua, Mansdorfer Jacob, Filighera Anna, Owodunni Abraham, Whitenack Daniel
- Fast Inference From Transformers Via Speculative Decoding Leviathan Yaniv, Kalman Matan, Matias Yossi
- A Survey On Retrieval-augmented Text Generation Li Huayang, Su Yixuan, Cai Deng, Wang Yan, Liu Lemao
- Learning To Transfer Prompts For Text Generation Li Junyi, Tang Tianyi, Nie Jian-yun, Wen Ji-rong, Zhao Wayne Xin
- ELMER: A Non-autoregressive Pre-trained Language Model For Efficient And Effective Text Generation Li Junyi, Tang Tianyi, Zhao Wayne Xin, Nie Jian-yun, Wen Ji-rong
- LAVENDER: Unifying Video-language Understanding As Masked Language Modeling Li Linjie, Gan Zhe, Lin Kevin, Lin Chung-ching, Liu Zicheng, Liu Ce, Wang Lijuan
- Pre-trained Language Models For Interactive Decision-making Li Shuang, Puig Xavier, Paxton Chris, Du Yilun, Wang Clinton, Fan Linxi, Chen Tao, Huang De-an, Akyürek Ekin, Anandkumar Anima, Andreas Jacob, Mordatch Igor, Torralba Antonio, Zhu Yuke
- Evade The Trap Of Mediocrity: Promoting Diversity And Novelty In Text Generation Via Concentrating Attention Li Wenhao, Yi Xiaoyuan, Hu Jinyi, Sun Maosong, Xie Xing
- Uni-eden: Universal Encoder-decoder Network By Multi-granular Vision-language Pre-training Li Yehao, Fan Jiahao, Pan Yingwei, Yao Ting, Lin Weiyao, Mei Tao
- Life After BERT: What Do Other Muppets Understand About Language? Lialin Vladislav, Zhao Kevin, Shivagunde Namrata, Rumshisky Anna
- Same Pre-training Loss, Better Downstream: Implicit Bias Matters For Language Models Liu Hong, Xie Sang Michael, Li Zhiyuan, Ma Tengyu
- Relational Memory Augmented Language Models Liu Qi, Yogatama Dani, Blunsom Phil
- Declaration-based Prompt Tuning For Visual Question Answering Liu Yuhang, Wei Wei, Peng Daowan, Zhu Feida
- Reacc: A Retrieval-augmented Code Completion Framework Lu Shuai, Duan Nan, Han Hojae, Guo Daya, Hwang Seung-won, Svyatkovskiy Alexey
- Biogpt: Generative Pre-trained Transformer For Biomedical Text Generation And Mining Luo Renqian, Sun Liai, Xia Yingce, Qin Tao, Zhang Sheng, Poon Hoifung, Liu Tie-yan
- A Frustratingly Simple Approach For End-to-end Image Captioning Luo Ziyang, Xi Yadong, Zhang Rongsheng, Ma Jing
- Torchscale: Transformers At Scale Ma Shuming, Wang Hongyu, Huang Shaohan, Wang Wenhui, Chi Zewen, Dong Li, Benhaim Alon, Patra Barun, Chaudhary Vishrav, Song Xia, Wei Furu
- Text Generation With Text-editing Models Malmi Eric, Dong Yue, Mallinson Jonathan, Chuklin Aleksandr, Adamek Jakub, Mirylenka Daniil, Stahlberg Felix, Krause Sebastian, Kumar Shankar, Severyn Aliaksei
- DIALOG-22 Ruatd Generated Text Detection Maloyan Narek, Nutfullin Bulat, Ilyushin Eugene
- Generating Training Data With Language Models: Towards Zero-shot Language Understanding Meng Yu, Huang Jiaxin, Zhang Yu, Han Jiawei
- Domain-specific Text Generation For Machine Translation Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- JASMINE: Arabic GPT Models For Few-shot Learning Nagoudi El Moatez Billah, Abdul-mageed Muhammad, Elmadany Abdelrahim, Inciarte Alcides Alcoba, Khondaker Md Tawkat Islam
- Do Vision-and-language Transformers Learn Grounded Predicate-noun Dependencies? Nikolaus Mitja, Salin Emmanuelle, Ayache Stephane, Fourtassi Abdellah, Favre Benoit
- The Turing Deception Noever David, Ciolino Matt
- Bidirectional Language Models Are Also Few-shot Learners Patel Ajay, Li Bryan, Rasooli Mohammad Sadegh, Constant Noah, Raffel Colin, Callison-burch Chris
- Xfboost: Improving Text Generation With Controllable Decoders Peng Xiangyu, Sollami Michael
- Vit5: Pretrained Text-to-text Transformer For Vietnamese Language Generation Phan Long, Tran Hieu, Nguyen Hieu, Trinh Trieu H.
- A Self-paced Mixed Distillation Method For Non-autoregressive Generation Qi Weizhen, Gong Yeyun, Shen Yelong, Jiao Jian, Yan Yu, Li Houqiang, Zhang Ruofei, Chen Weizhu, Duan Nan
- Diffusion Glancing Transformer For Parallel Sequence To Sequence Learning Qian Lihua, Wang Mingxuan, Liu Yang, Zhou Hao
- Alexatm 20B: Few-shot Learning Using A Large-scale Multilingual Seq2seq Model Saleh Soltan, Shankar Ananthakrishnan, Jack Fitzgerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, Stephen Rawls, Andy Rosenbaum, Anna Rumshisky, Chandana Satya Prakash, Mukund Sridhar, Fabian Triefenbach, Apurv Verma, Gokhan Tur, Prem Natarajan
- APOLLO: A Simple Approach For Adaptive Pretraining Of Language Models For Logical Reasoning Sanyal Soumya, Xu Yichong, Wang Shuohang, Yang Ziyi, Pryzant Reid, Yu Wenhao, Zhu Chenguang, Ren Xiang
- Transformer Grammars: Augmenting Transformer Language Models With Syntactic Inductive Biases At Scale Sartran Laurent, Barrett Samuel, Kuncoro Adhiguna, Stanojević Miloš, Blunsom Phil, Dyer Chris
- Confident Adaptive Language Modeling Schuster Tal, Fisch Adam, Gupta Jai, Dehghani Mostafa, Bahri Dara, Tran Vinh Q., Tay Yi, Metzler Donald
- On The Effect Of Pretraining Corpora On In-context Learning By A Large-scale Language Model Seongjin Shin, Sang-woo Lee, Hwijeen Ahn, Sungdong Kim, Hyoungseok Kim, Boseop Kim, Kyunghyun Cho, Gichang Lee, Woomyoung Park, Jung-woo Ha, Nako Sung
- Bird-eye Transformers For Text Generation Models Sha Lei, Song Yuhang, Yordanov Yordan, Salvatori Tommaso, Lukasiewicz Thomas
- Gpt-neox-20b: An Open-source Autoregressive Language Model Sid Black, Stella Biderman, Eric Hallahan, Quentin Anthony, Leo Gao, Laurence Golding, Horace He, Connor Leahy, Kyle Mcdonell, Jason Phang, Michael Pieler, Usvsn Sai Prashanth, Shivanshu Purohit, Laria Reynolds, Jonathan Tow, Ben Wang, Samuel Weinbach
- Human Language Modeling Soni Nikita, Matero Matthew, Balasubramanian Niranjan, Schwartz H. Andrew
- A Length-extrapolatable Transformer Sun Yutao, Dong Li, Patra Barun, Ma Shuming, Huang Shaohan, Benhaim Alon, Chaudhary Vishrav, Song Xia, Wei Furu
- Adavae: Exploring Adaptive Gpt-2s In Variational Auto-encoders For Language Modeling Tu Haoqin, Yang Zhongliang, Yang Jinshuai, Huang Yongfeng
- Will We Run Out Of Data? Limits Of LLM Scaling Based On Human-generated Data Villalobos Pablo, Ho Anson, Sevilla Jaime, Besiroglu Tamay, Heim Lennart, Hobbhahn Marius
- How Large Language Models Are Transforming Machine-paraphrased Plagiarism Wahle Jan Philip, Ruas Terry, Kirstein Frederic, Gipp Bela
- What Language Model Architecture And Pretraining Objective Work Best For Zero-shot Generalization? Wang Thomas, Roberts Adam, Hesslow Daniel, Scao Teven Le, Chung Hyung Won, Beltagy Iz, Launay Julien, Raffel Colin
- Visually-augmented Language Modeling Wang Weizhi, Dong Li, Cheng Hao, Song Haoyu, Liu Xiaodong, Yan Xifeng, Gao Jianfeng, Wei Furu
- Compilable Neural Code Generation With Compiler Feedback Wang Xin, Wang Yasheng, Wan Yao, Mi Fei, Li Yitong, Zhou Pingyi, Liu Jin, Wu Hao, Jiang Xin, Liu Qun
- CREATER: Ctr-driven Advertising Text Generation With Controlled Pre-training And Contrastive Fine-tuning Wei Penghui, Yang Xuanhua, Liu Shaoguo, Wang Liang, Zheng Bo
- Stateful Memory-augmented Transformers For Efficient Dialogue Modeling Wu Qingyang, Yu Zhou
- Precisely The Point: Adversarial Augmentations For Faithful And Informative Text Generation Wu Wenhao, Li Wei, Liu Jiachen, Xiao Xinyan, Li Sujian, Lyu Yajuan
- Memorizing Transformers Wu Yuhuai, Rabe Markus N., Hutchins Delesley, Szegedy Christian
- Go-tuning: Improving Zero-shot Learning Abilities Of Smaller Language Models Xu Jingjing, Dong Qingxiu, Liu Hongyi, Li Lei
- SESCORE2: Learning Text Generation Evaluation Via Synthesizing Realistic Mistakes Xu Wenda, Qian Xian, Wang Mingxuan, Li Lei, Wang William Yang
- Zero-shot Video Question Answering Via Frozen Bidirectional Language Models Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Prompt Tuning For Discriminative Pre-trained Language Models Yao Yuan, Dong Bowen, Zhang Ao, Zhang Zhengyan, Xie Ruobing, Liu Zhiyuan, Lin Leyu, Sun Maosong, Wang Jianyong
- Language Models Are General-purpose Interfaces Yaru Hao, Haoyu Song, Li Dong, Shaohan Huang, Zewen Chi, Wenhui Wang, Shuming Ma, Furu Wei
- Retrieval-augmented Multimodal Language Modeling Yasunaga Michihiro, Aghajanyan Armen, Shi Weijia, James Rich, Leskovec Jure, Liang Percy, Lewis Mike, Zettlemoyer Luke, Yih Wen-tau
- Modeling Multi-hop Question Answering As Single Sequence Prediction Yavuz Semih, Hashimoto Kazuma, Zhou Yingbo, Keskar Nitish Shirish, Xiong Caiming
- Scaling Autoregressive Models For Content-rich Text-to-image Generation Yu Jiahui, Xu Yuanzhong, Koh Jing Yu, Luong Thang, Baid Gunjan, Wang Zirui, Vasudevan Vijay, Ku Alexander, Yang Yinfei, Ayan Burcu Karagol, Hutchinson Ben, Han Wei, Parekh Zarana, Li Xin, Zhang Han, Baldridge Jason, Wu Yonghui
- Efficient Language Modeling With Sparse All-mlp Yu Ping, Artetxe Mikel, Ott Myle, Shleifer Sam, Gong Hongyu, Stoyanov Ves, Li Xian
- Bridging Cross-lingual Gaps During Leveraging The Multilingual Sequence-to-sequence Pretraining For Text Generation And Understanding Zan Changtong, Ding Liang, Shen Li, Cao Yu, Liu Weifeng, Tao Dacheng
- Better Pre-training By Reducing Representation Confusion Zhang Haojie, Liang Mingfei, Xie Ruobing, Sun Zhenlong, Zhang Bo, Lin Leyu
- Can Offline Reinforcement Learning Help Natural Language Understanding? Zhang Ziqi, Wang Yile, Zhang Yue, Wang Donglin
- Attend, Memorize And Generate: Towards Faithful Table-to-text Generation In Few Shots Zhao Wenting, Liu Ye, Wan Yao, Yu Philip S.
- Training Language Models With Memory Augmentation Zhong Zexuan, Lei Tao, Chen Danqi
- Efficient Long Sequence Modeling Via State Space Augmented Transformer Zuo Simiao, Liu Xiaodong, Jiao Jian, Charles Denis, Manavoglu Eren, Zhao Tuo, Gao Jianfeng
- Performance Trade-offs Of Watermarking Large Language Models Ajith Anirudh, Singh Sameer, Pruthi Danish
- Pixt3: Pixel-based Table-to-text Generation Alonso Iñigo, Agirre Eneko, Lapata Mirella
- Openleaf: Open-domain Interleaved Image-text Generation And Evaluation An Jie, Yang Zhengyuan, Li Linjie, Wang Jianfeng, Lin Kevin, Liu Zicheng, Wang Lijuan, Luo Jiebo
- Augmenting Llms With Knowledge: A Survey On Hallucination Prevention Andriopoulos Konstantinos, Pouwelse Johan
- Using Large Language Models For Zero-shot Natural Language Generation From Knowledge Graphs Axelsson Agnes, Skantze Gabriel
- Scaling Transformer To 1M Tokens And Beyond With RMT Aydar Bulatov, Yuri Kuratov, Yermek Kapushev, Mikhail S. Burtsev
- Tamil-llama: A New Tamil Language Model Based On Llama 2 Balachandran Abhinand
- Llamantino: Llama 2 Models For Effective Text Generation In Italian Language Basile Pierpaolo, Musacchio Elio, Polignano Marco, Siciliani Lucia, Fiameni Giuseppe, Semeraro Giovanni
- Labrador: Exploring The Limits Of Masked Language Modeling For Laboratory Data Bellamy David R., Kumar Bhawesh, Wang Cindy, Beam Andrew
- Prompt Sketching For Large Language Models Beurer-kellner Luca, Müller Mark Niklas, Fischer Marc, Vechev Martin
- Helm: Highlighted Evidence Augmented Language Model For Enhanced Table-to-text Generation Bian Junyi, Qin Xiaolei, Zou Wuhe, Huang Mengzuo, Luo Congyi, Zhang Ke, Zhang Weidong
- Insights Into Classifying And Mitigating Llms' Hallucinations Bruno Alessandro, Mazzeo Pier Luigi, Chetouani Aladine, Tliba Marouane, Kerkouri Mohamed Amine
- Revealing The Structure Of Language Model Capabilities Burnell Ryan, Hao Han, Conway Andrew R. A., Orallo Jose Hernandez
- Peek Across: Improving Multi-document Modeling Via Cross-document Question-answering Caciularu Avi, Peters Matthew E., Goldberger Jacob, Dagan Ido, Cohan Arman
- On The Application Of Large Language Models For Language Teaching And Assessment Technology Caines Andrew, Benedetto Luca, Taslimipoor Shiva, Davis Christopher, Gao Yuan, Andersen Oeistein, Yuan Zheng, Elliott Mark, Moore Russell, Bryant Christopher, Rei Marek, Yannakoudakis Helen, Mullooly Andrew, Nicholls Diane, Buttery Paula
- Successor Features For Efficient Multisubject Controlled Text Generation Cao Meng, Fatemi Mehdi, Cheung Jackie Chi Kit, Shabanian Samira
- A LLM Assisted Exploitation Of Ai-guardian Carlini Nicholas
- Chatgpt To Replace Crowdsourcing Of Paraphrases For Intent Classification: Higher Diversity And Comparable Model Robustness Cegin Jan, Simko Jakub, Brusilovsky Peter
- Muse: Text-to-image Generation Via Masked Generative Transformers Chang Huiwen, Zhang Han, Barber Jarred, Maschinot Aj, Lezama Jose, Jiang Lu, Yang Ming-hsuan, Murphy Kevin, Freeman William T., Rubinstein Michael, Li Yuanzhen, Krishnan Dilip
- Learning To Generate Better Than Your LLM Chang Jonathan D., Brantley Kiante, Ramamurthy Rajkumar, Misra Dipendra, Sun Wen
- Dynamic Strategy Chain: Dynamic Zero-shot Cot For Long Mental Health Support Generation Chen Qi, Liu Dexi
- Extending Context Window Of Large Language Models Via Positional Interpolation Chen Shouyuan, Wong Sherman, Chen Liangjian, Tian Yuandong
- Evaluating, Understanding, And Improving Constrained Text Generation For Large Language Models Chen Xiang, Wan Xiaojun
- Token Prediction As Implicit Classification To Identify Llm-generated Text Chen Yutian, Kang Hao, Zhai Vivian, Li Liangze, Singh Rita, Raj Bhiksha
- Lift Yourself Up: Retrieval-augmented Text Generation With Self Memory Cheng Xin, Luo Di, Chen Xiuying, Liu Lemao, Zhao Dongyan, Yan Rui
- Neural Codec Language Models Are Zero-shot Text To Speech Synthesizers Chengyi Wang, Sanyuan Chen, Yu Wu, Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei
- Attention Alignment And Flexible Positional Embeddings Improve Transformer Length Extrapolation Chi Ta-chung, Fan Ting-han, Rudnicky Alexander I.
- Increasing Diversity While Maintaining Accuracy: Text Data Generation With Large Language Models And Human Interventions Chung John Joon Young, Kamar Ece, Amershi Saleema
- Flashattention-2: Faster Attention With Better Parallelism And Work Partitioning Dao Tri
- Divergent Token Metrics: Measuring Degradation To Prune Away LLM Components -- And Optimize Quantization Deiseroth Björn, Meuer Max, Gritsch Nikolas, Eichenberg Constantin, Schramowski Patrick, Aßenmacher Matthias, Kersting Kristian
- Speak While You Think: Streaming Speech Synthesis During Text Generation Dekel Avihu, Shechtman Slava, Fernandez Raul, Haws David, Kons Zvi, Hoory Ron
- Controlled Text Generation Via Language Model Arithmetic Dekoninck Jasper, Fischer Marc, Beurer-kellner Luca, Vechev Martin
- Skipdecode: Autoregressive Skip Decoding With Batching And Caching For Efficient LLM Inference Del Corro Luciano, Del Giorno Allie, Agarwal Sahaj, Yu Bin, Awadallah Ahmed, Mukherjee Subhabrata
- Jump To Conclusions: Short-cutting Transformers With Linear Transformations Din Alexander Yom, Karidi Taelin, Choshen Leshem, Geva Mor
- Mapping The Design Space Of Interactions In Human-ai Text Co-creation Tasks Ding Zijian, Chan Joel
- Probing Explicit And Implicit Gender Bias Through LLM Conditional Text Generation Dong Xiangjue, Wang Yibo, Yu Philip S., Caverlee James
- BAMBOO: A Comprehensive Benchmark For Evaluating Long Text Modeling Capacities Of Large Language Models Dong Zican, Tang Tianyi, Li Junyi, Zhao Wayne Xin, Wen Ji-rong
- Stack Attention: Improving The Ability Of Transformers To Model Hierarchical Patterns Dusell Brian, Chiang David
- How Useful Are Educational Questions Generated By Large Language Models? Elkins Sabina, Kochmar Ekaterina, Cheung Jackie C. K., Serban Iulian
- Textgenshap: Scalable Post-hoc Explanations In Text Generation With Long Documents Enouen James, Nakhost Hootan, Ebrahimi Sayna, Arik Sercan O, Liu Yan, Pfister Tomas
- Jacotext: A Pretrained Model For Java Code-text Generation Espejel Jessica López, Alassan Mahaman Sanoussi Yahaya, Dahhane Walid, Ettifouri El Hassane
- Lm-polygraph: Uncertainty Estimation For Language Models Fadeeva Ekaterina, Vashurin Roman, Tsvigun Akim, Vazhentsev Artem, Petrakov Sergey, Fedyanin Kirill, Vasilev Daniil, Goncharova Elizaveta, Panchenko Alexander, Panov Maxim, Baldwin Timothy, Shelmanov Artem
- An Empirical Study Of Multitask Learning To Improve Open Domain Dialogue Systems Farahani Mehrdad, Johansson Richard
- Transferable Decoding With Visual Entities For Zero-shot Image Captioning Fei Junjie, Wang Teng, Zhang Jinrui, He Zhenyu, Wang Chengjie, Zheng Feng
- Extending Context Window Of Large Language Models Via Semantic Compression Fei Weizhi, Niu Xueyan, Zhou Pingyi, Hou Lu, Bai Bo, Deng Lei, Han Wei
- Improving Factual Consistency Of Text Summarization By Adversarially Decoupling Comprehension And Embellishment Abilities Of Llms Feng Huawen, Fan Yan, Liu Xiong, Lin Ting-en, Yao Zekun, Wu Yuchuan, Huang Fei, Li Yongbin, Ma Qianli
- Chessgpt: Bridging Policy Learning And Language Modeling Feng Xidong, Luo Yicheng, Wang Ziyan, Tang Hongrui, Yang Mengyue, Shao Kun, Mguni David, Du Yali, Wang Jun
- Medalign: A Clinician-generated Dataset For Instruction Following With Electronic Medical Records Fleming Scott L., Lozano Alejandro, Haberkorn William J., Jindal Jenelle A., Reis Eduardo P., Thapa Rahul, Blankemeier Louis, Genkins Julian Z., Steinberg Ethan, Nayak Ashwin, Patel Birju S., Chiang Chia-chun, Callahan Alison, Huo Zepeng, Gatidis Sergios, Adams Scott J., Fayanju Oluseyi, Shah Shreya J., Savage Thomas, Goh Ethan, Chaudhari Akshay S., Aghaeepour Nima, Sharp Christopher, Pfeffer Michael A., Liang Percy, Chen Jonathan H., Morse Keith E., Brunskill Emma P., Fries Jason A., Shah Nigam H.
- In-context Autoencoder For Context Compression In A Large Language Model Ge Tao, Hu Jing, Wang Lei, Wang Xun, Chen Si-qing, Wei Furu
- Do Generative Large Language Models Need Billions Of Parameters? Gholami Sia, Omar Marwan
- Clipsyntel: CLIP And LLM Synergy For Multimodal Question Summarization In Healthcare Ghosh Akash, Acharya Arkadeep, Jain Raghav, Saha Sriparna, Chadha Aman, Sinha Setu
- Improving Input-label Mapping With Demonstration Replay For In-context Learning Gong Zhuocheng, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Yan Rui
- Lil-bevo: Explorations Of Strategies For Training Language Models In More Humanlike Ways Govindarajan Venkata S, Rodriguez Juan Diego, Bostrom Kaj, Mahowald Kyle
- Augmented Language Models: A Survey Grégoire Mialon, Roberto Dessì, Maria Lomeli, Christoforos Nalmpantis, Ram Pasunuru, Roberta Raileanu, Baptiste Rozière, Timo Schick, Jane Dwivedi-yu, Asli Celikyilmaz, Edouard Grave, Yann Lecun, Thomas Scialom
- Reinforced Self-training (rest) For Language Modeling Gulcehre Caglar, Paine Tom Le, Srinivasan Srivatsan, Konyushkova Ksenia, Weerts Lotte, Sharma Abhishek, Siddhant Aditya, Ahern Alex, Wang Miaosen, Gu Chenjie, Macherey Wolfgang, Doucet Arnaud, Firat Orhan, De Freitas Nando
- David Helps Goliath: Inference-time Collaboration Between Small Specialized And Large General Diffusion Lms Han Xiaochuang, Kumar Sachin, Tsvetkov Yulia, Ghazvininejad Marjan
- Blockwise Parallel Transformer For Large Context Models Hao Liu, Pieter Abbeel
- Test-time Training On Nearest Neighbors For Large Language Models Hardt Moritz, Sun Yu
- Teacherlm: Teaching To Fish Rather Than Giving The Fish, Language Modeling Likewise He Nan, Lai Hanyu, Zhao Chenyang, Cheng Zirui, Pan Junting, Qin Ruoyu, Lu Ruofan, Lu Rui, Zhang Yunchen, Zhao Gangming, Hou Zhaohui, Huang Zhiyuan, Lu Shaoqing, Liang Ding, Zhan Mingjie
- Simplemtod: A Simple Language Model For Multimodal Task-oriented Dialogue With Symbolic Scene Representation Hemanthage Bhathiya, Dondrup Christian, Bartie Phil, Lemon Oliver
- Towards Verifiable Text Generation With Symbolic References Hennigen Lucas Torroba, Shen Shannon, Nrusimha Aniruddha, Gapp Bernhard, Sontag David, Kim Yoon
- Backpack Language Models Hewitt John, Thickstun John, Manning Christopher D., Liang Percy
- Harnessing The Zero-shot Power Of Instruction-tuned Large Language Model In End-to-end Speech Recognition Higuchi Yosuke, Ogawa Tetsuji, Kobayashi Tetsunori
- Meta-learning Online Adaptation Of Language Models Hu Nathan, Mitchell Eric, Manning Christopher D., Finn Chelsea
- Separate The Wheat From The Chaff: Model Deficiency Unlearning Via Parameter-efficient Module Operation Hu Xinshuo, Li Dongfang, Hu Baotian, Zheng Zihao, Liu Zhenyu, Zhang Min
- Directed Acyclic Transformer Pre-training For High-quality Non-autoregressive Text Generation Huang Fei, Ke Pei, Huang Minlie
- Large Language Models Cannot Self-correct Reasoning Yet Huang Jie, Chen Xinyun, Mishra Swaroop, Zheng Huaixiu Steven, Yu Adams Wei, Song Xinying, Zhou Denny
- Grounded Decoding: Guiding Text Generation With Grounded Models For Embodied Agents Huang Wenlong, Xia Fei, Shah Dhruv, Driess Danny, Zeng Andy, Lu Yao, Florence Pete, Mordatch Igor, Levine Sergey, Hausman Karol, Ichter Brian
- Large Language Models As Sous Chefs: Revising Recipes With GPT-3 Hwang Alyssa, Li Bryan, Hou Zhaoyi, Roth Dan
- Dag LLM Ver 1.0: Pioneering Instruction-tuned Language Modeling For Korean NLP Jang Dongjun, Lee Sangah, Byun Sungjoo, Kim Jinwoong, Seo Jean, Kim Minseok, Kim Soyeon, Oh Chaeyoung, Kim Jaeyoon, Jo Hyemi, Shin Hyopil
- Tailoring Language Generation Models Under Total Variation Distance Ji Haozhe, Ke Pei, Hu Zhipeng, Zhang Rongsheng, Huang Minlie
- Tigerscore: Towards Building Explainable Metric For All Text Generation Tasks Jiang Dongfu, Li Yishan, Zhang Ge, Huang Wenhao, Lin Bill Yuchen, Chen Wenhu
- Scaling Sentence Embeddings With Large Language Models Jiang Ting, Huang Shaohan, Luan Zhongzhi, Wang Deqing, Zhuang Fuzhen
- Chain Of Thought Prompt Tuning In Vision Language Models Jiaxin Ge, Hongyin Luo, Siyuan Qian, Yulu Gan, Jie Fu, Shanghang Zhang
- Llara: Large Language-recommendation Assistant Jiayi Liao, Sihang Li, Zhengyi Yang, Jiancan Wu, Yancheng Yuan, Xiang Wang, Xiangnan He
- BLIP-2: Bootstrapping Language-image Pre-training With Frozen Image Encoders And Large Language Models Junnan Li, Dongxu Li, Silvio Savarese, Steven Hoi
- Swectrl-mini: A Data-transparent Transformer-based Large Language Model For Controllable Text Generation In Swedish Kalpakchi Dmytro, Boye Johan
- Ever: Mitigating Hallucination In Large Language Models Through Real-time Verification And Rectification Kang Haoqiang, Ni Juntong, Yao Huaxiu
- Exploring Automatic Evaluation Methods Based On A Decoder-based LLM For Text Generation Kasahara Tomohito, Kawahara Daisuke
- GTA: Gated Toxicity Avoidance For LM Performance Preservation Kim Heegyu, Cho Hyunsouk
- Memory-efficient Fine-tuning Of Compressed Large Language Models Via Sub-4-bit Integer Quantization Kim Jeonghoon, Lee Jung Hyun, Kim Sungdong, Park Joonsuk, Yoo Kang Min, Kwon Se Jung, Lee Dongsoo
- Speculative Decoding With Big Little Decoder Kim Sehoon, Mangalam Karttikeya, Moon Suhong, Malik Jitendra, Mahoney Michael W., Gholami Amir, Keutzer Kurt
- Transformer Language Models Handle Word Frequency In Prediction Head Kobayashi Goro, Kuribayashi Tatsuki, Yokoi Sho, Inui Kentaro
- Sparse Fine-tuning For Inference Acceleration Of Large Language Models Kurtic Eldar, Kuznedelev Denis, Frantar Elias, Goin Michael, Alistarh Dan
- Longform: Effective Instruction Tuning With Reverse Instructions Köksal Abdullatif, Schick Timo, Korhonen Anna, Schütze Hinrich
- Multidimensional Evaluation For Text Style Transfer Using Chatgpt Lai Huiyuan, Toral Antonio, Nissim Malvina
- End-to-end Speech Recognition Contextualization With Large Language Models Lakomkin Egor, Wu Chunyang, Fathullah Yassir, Kalinli Ozlem, Seltzer Michael L., Fuegen Christian
- Building Efficient Universal Classifiers With Natural Language Inference Laurer Moritz, Van Atteveldt Wouter, Casas Andreu, Welbers Kasper
- Is Attention Required For ICL? Exploring The Relationship Between Model Architecture And In-context Learning Ability Lee Ivan, Jiang Nan, Berg-kirkpatrick Taylor
- Mitigating Fine-grained Hallucination By Fine-tuning Large Vision-language Models With Caption Rewrites Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-peng Lim
- The Eval4nlp 2023 Shared Task On Prompting Large Language Models As Explainable Metrics Leiter Christoph, Opitz Juri, Deutsch Daniel, Gao Yang, Dror Rotem, Eger Steffen
- Deliberate Then Generate: Enhanced Prompting Framework For Text Generation Li Bei, Wang Rui, Guo Junliang, Song Kaitao, Tan Xu, Hassan Hany, Menezes Arul, Xiao Tong, Bian Jiang, Zhu Jingbo
- Making Large Language Models A Better Foundation For Dense Retrieval Li Chaofan, Liu Zheng, Xiao Shitao, Shao Yingxia
- Learning To Rewrite Prompts For Personalized Text Generation Li Cheng, Zhang Mingyang, Mei Qiaozhu, Kong Weize, Bendersky Michael
- Teach Llms To Personalize -- An Approach Inspired By Writing Education Li Cheng, Zhang Mingyang, Mei Qiaozhu, Wang Yaqing, Hombaiah Spurthi Amba, Liang Yi, Bendersky Michael
- Autoconv: Automatically Generating Information-seeking Conversations With Large Language Models Li Siheng, Yang Cheng, Yin Yichun, Zhu Xinyu, Cheng Zesen, Shang Lifeng, Jiang Xin, Liu Qun, Yang Yujiu
- Explicit Syntactic Guidance For Neural Text Generation Li Yafu, Cui Leyang, Yan Jianhao, Yin Yongjing, Bi Wei, Shi Shuming, Zhang Yue
- Posgen: Personalized Opening Sentence Generation For Online Insurance Sales Li Yu, Zhang Yi, Wu Weijia, Zhou Zimu, Li Qiang
- Towards Vision Enhancing Llms: Empowering Multimodal Knowledge Storage And Sharing In Llms Li Yunxin, Hu Baotian, Wang Wei, Cao Xiaochun, Zhang Min
- Advancing Precise Outline-conditioned Text Generation With Task Duality And Explicit Outline Control Li Yunzhe, Chen Qian, Yan Weixiang, Wang Wen, Zhang Qinglin, Sundaram Hari
- Batgpt: A Bidirectional Autoregessive Talker From Generative Pre-trained Transformer Li Zuchao, Zhang Shitou, Zhao Hai, Yang Yifei, Yang Dongjie
- Uhgeval: Benchmarking The Hallucination Of Chinese Large Language Models Via Unconstrained Generation Liang Xun, Song Shichao, Niu Simin, Li Zhiyu, Xiong Feiyu, Tang Bo, Wang Yezhaohui, He Dawei, Cheng Peng, Wang Zhonghao, Deng Haiying
- Opening Up Chatgpt: Tracking Openness, Transparency, And Accountability In Instruction-tuned Text Generators Liesenfeld Andreas, Lopez Alianda, Dingemanse Mark
- Paralinguistics-enhanced Large Language Modeling Of Spoken Dialogue Lin Guan-ting, Shivakumar Prashanth Gurunath, Gandhe Ankur, Yang Chao-han Huck, Gu Yile, Ghosh Shalini, Stolcke Andreas, Lee Hung-yi, Bulyko Ivan
- AWQ: Activation-aware Weight Quantization For LLM Compression And Acceleration Lin Ji, Tang Jiaming, Tang Haotian, Yang Shang, Chen Wei-ming, Wang Wei-chen, Xiao Guangxuan, Dang Xingyu, Gan Chuang, Han Song
- Dynamic Scheduled Sampling With Imitation Loss For Neural Text Generation Lin Xiang, Jwalapuram Prathyusha, Joty Shafiq
- Exposing Attention Glitches With Flip-flop Language Modeling Liu Bingbin, Ash Jordan T., Goel Surbhi, Krishnamurthy Akshay, Zhang Cyril
- Logicot: Logical Chain-of-thought Instruction-tuning Liu Hanmeng, Teng Zhiyang, Cui Leyang, Zhang Chaoli, Zhou Qiji, Zhang Yue
- Ring Attention With Blockwise Transformers For Near-infinite Context Liu Hao, Zaharia Matei, Abbeel Pieter
- BOLT: Fast Energy-based Controlled Text Generation With Tunable Biases Liu Xin, Khalifa Muhammad, Wang Lu
- Litcab: Lightweight Language Model Calibration Over Short- And Long-form Responses Liu Xin, Khalifa Muhammad, Wang Lu
- RECALL: A Benchmark For Llms Robustness Against External Counterfactual Knowledge Liu Yi, Huang Lianzhe, Li Shicheng, Chen Sishuo, Zhou Hao, Meng Fandong, Zhou Jie, Sun Xu
- Calibrating Llm-based Evaluator Liu Yuxuan, Yang Tianchi, Huang Shaohan, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Binary And Ternary Natural Language Generation Liu Zechun, Oguz Barlas, Pappu Aasish, Shi Yangyang, Krishnamoorthi Raghuraman
- Data-to-text Generation For Severely Under-resourced Languages With GPT-3.5: A Bit Of Help Needed From Google Translate Lorandi Michela, Belz Anya
- Bounding The Capabilities Of Large Language Models In Open Text Generation With Prompt Constraints Lu Albert, Zhang Hongxin, Zhang Yanzhe, Wang Xuezhi, Yang Diyi
- Ziya-visual: Bilingual Large Vision-language Model Via Multi-task Instruction Tuning Lu Junyu, Zhang Dixiang, Wu Xiaojun, Gao Xinyu, Gan Ruyi, Zhang Jiaxing, Song Yan, Zhang Pingjian
- Inference-time Policy Adapters (IPA): Tailoring Extreme-scale Lms Without Fine-tuning Lu Ximing, Brahman Faeze, West Peter, Jang Jaehun, Chandu Khyathi, Ravichander Abhilasha, Qin Lianhui, Ammanabrolu Prithviraj, Jiang Liwei, Ramnath Sahana, Dziri Nouha, Fisher Jillian, Lin Bill Yuchen, Hallinan Skyler, Ren Xiang, Welleck Sean, Choi Yejin
- Chatgpt As A Factual Inconsistency Evaluator For Text Summarization Luo Zheheng, Xie Qianqian, Ananiadou Sophia
- Macaw-llm: Multi-modal Language Modeling With Image, Audio, Video, And Text Integration Lyu Chenyang, Wu Minghao, Wang Longyue, Huang Xinting, Liu Bingshuai, Du Zefeng, Shi Shuming, Tu Zhaopeng
- Adapting Pre-trained Generative Models For Extractive Question Answering Mallick Prabir, Nayak Tapas, Bhattacharya Indrajit
- Large Language Models Know Your Contextual Search Intent: A Prompting Framework For Conversational Search Mao Kelong, Dou Zhicheng, Mo Fengran, Hou Jiewen, Chen Haonan, Qian Hongjin
- Establishing Vocabulary Tests As A Benchmark For Evaluating Large Language Models Martínez Gonzalo, Conde Javier, Merino-gómez Elena, Bermúdez-margaretto Beatriz, Hernández José Alberto, Reviriego Pedro, Brysbaert Marc
- Representation Deficiency In Masked Language Modeling Meng Yu, Krishnan Jitin, Wang Sinong, Wang Qifan, Mao Yuning, Fang Han, Ghazvininejad Marjan, Han Jiawei, Zettlemoyer Luke
- For Generated Text, Is Nli-neutral Text The Best Text? Mersinias Michail, Mahowald Kyle
- Think Before You Act: Unified Policy For Interleaving Language Reasoning With Actions Mezghani Lina, Bojanowski Piotr, Alahari Karteek, Sukhbaatar Sainbayar
- Adaptive Machine Translation With Large Language Models Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- The Impact Of Artificial Intelligence On The Evolution Of Digital Education: A Comparative Study Of Openai Text Generation Tools Including Chatgpt, Bing Chat, Bard, And Ernie Motlagh Negin Yazdani, Khajavi Matin, Sharifi Abbas, Ahmadi Mohsen
- Testing Llms On Code Generation With Varying Levels Of Prompt Specificity Murr Lincoln, Grainger Morgan, Gao David
- Cheaply Evaluating Inference Efficiency Metrics For Autoregressive Transformer Apis Narayanan Deepak, Santhanam Keshav, Henderson Peter, Bommasani Rishi, Lee Tony, Liang Percy
- Exploring The Maze Of Multilingual Modeling Nezhad Sina Bagheri, Agrawal Ameeta
- Codegen2: Lessons For Training Llms On Programming And Natural Languages Nijkamp Erik, Hayashi Hiroaki, Xiong Caiming, Savarese Silvio, Zhou Yingbo
- Contrastive Decoding Improves Reasoning In Large Language Models O'brien Sean, Lewis Mike
- Large Language Model (LLM) Bias Index -- LLMBI Oketunji Abiodun Finbarrs, Anas Muhammad, Saina Deepthi
- Towards Vision-language Mechanistic Interpretability: A Causal Tracing Tool For BLIP Palit Vedant, Pandey Rohan, Arora Aryaman, Liang Paul Pu
- Trainable Transformer In Transformer Panigrahi Abhishek, Malladi Sadhika, Xia Mengzhou, Arora Sanjeev
- On The Analysis Of Cross-lingual Prompt Tuning For Decoder-based Multilingual Model Park Nohil, Park Joonsuk, Yoo Kang Min, Yoon Sungroh
- The Impact Of Depth On Compositional Generalization In Transformer Language Models Petty Jackson, Van Steenkiste Sjoerd, Dasgupta Ishita, Sha Fei, Garrette Dan, Linzen Tal
- LLM Self Defense: By Self Examination, Llms Know They Are Being Tricked Phute Mansi, Helbling Alec, Hull Matthew, Peng Shengyun, Szyller Sebastian, Cornelius Cory, Chau Duen Horng
- Mosaicbert: A Bidirectional Encoder Optimized For Fast Pretraining Portes Jacob, Trott Alex, Havens Sam, King Daniel, Venigalla Abhinav, Nadeem Moin, Sardana Nikhil, Khudia Daya, Frankle Jonathan
- Logic-scaffolding: Personalized Aspect-instructed Recommendation Explanation Generation Using Llms Rahdari Behnam, Ding Hao, Fan Ziwei, Ma Yifei, Chen Zhuotong, Deoras Anoop, Kveton Branislav
- Mixed-distil-bert: Code-mixed Language Modeling For Bangla, English, And Hindi Raihan Md Nishat, Goswami Dhiman, Mahmud Antara
- Semantic Consistency For Assuring Reliability Of Large Language Models Raj Harsh, Gupta Vipul, Rosati Domenic, Majumdar Subhabrata
- Context Compression For Auto-regressive Transformers With Sentinel Tokens Ren Siyu, Jia Qi, Zhu Kenny Q.
- Unveiling The Potential Of Large Language Models In Generating Semantic And Cross-language Clones Roy Palash R., Alam Ajmain I., Al-omari Farouq, Roy Banani, Roy Chanchal K., Schneider Kevin A.
- Retrieval-pretrained Transformer: Long-range Language Modeling With Self-retrieval Rubin Ohad, Berant Jonathan
- Branch-solve-merge Improves Large Language Model Evaluation And Generation Saha Swarnadeep, Levy Omer, Celikyilmaz Asli, Bansal Mohit, Weston Jason, Li Xian
- UT5: Pretraining Non Autoregressive T5 With Unrolled Denoising Salem Mahmoud G., Ye Jiayu, Lin Chu-cheng, Liu Frederick
- Uni-nlx: Unifying Textual Explanations For Vision And Vision-language Tasks Sammani Fawaz, Deligiannis Nikos
- Toolformer: Language Models Can Teach Themselves To Use Tools Schick Timo, Dwivedi-yu Jane, Dessì Roberto, Raileanu Roberta, Lomeli Maria, Zettlemoyer Luke, Cancedda Nicola, Scialom Thomas
- Cross-lingual Supervision Improves Large Language Models Pre-training Schioppa Andrea, Garcia Xavier, Firat Orhan
- One-shot Sensitivity-aware Mixed Sparsity Pruning For Large Language Models Shao Hang, Liu Bei, Xiao Bo, Zeng Ke, Wan Guanglu, Qian Yanmin
- Graphextqa: A Benchmark For Evaluating Graph-enhanced Large Language Models Shen Yuanchun, Liao Ruotong, Han Zhen, Ma Yunpu, Tresp Volker
- EEL: Efficiently Encoding Lattices For Reranking Singhal Prasann, Xu Jiacheng, Ye Xi, Durrett Greg
- Dont Add, Dont Miss: Effective Content Preserving Generation From Pre-selected Text Spans Slobodkin Aviv, Caciularu Avi, Hirsch Eran, Dagan Ido
- The Synergy Of Speculative Decoding And Batching In Serving Large Language Models Su Qidong, Giannoula Christina, Pekhimenko Gennady
- RCMHA: Relative Convolutional Multi-head Attention For Natural Language Modelling Sugiharto Herman, Aradea, Mubarok Husni
- Datatales: Investigating The Use Of Large Language Models For Authoring Data-driven Articles Sultanum Nicole, Srinivasan Arjun
- Towards Verifiable Text Generation With Evolving Memory And Self-reflection Sun Hao, Cai Hengyi, Wang Bo, Hou Yingyan, Wei Xiaochi, Wang Shuaiqiang, Zhang Yan, Yin Dawei
- Character-level Chinese Backpack Language Models Sun Hao, Hewitt John
- Retentive Network: A Successor To Transformer For Large Language Models Sun Yutao, Dong Li, Huang Shaohan, Ma Shuming, Xia Yuqing, Xue Jilong, Wang Jianyong, Wei Furu
- Improving Non-autoregressive Translation Quality With Pretrained Language Model, Embedding Distillation And Upsampling Strategy For CTC Syu Shen-sian, Xie Juncheng, Lee Hung-yi
- Redcoast: A Lightweight Tool To Automate Distributed Training Of Llms On Any Gpu/tpus Tan Bowen, Zhu Yun, Liu Lijuan, Wang Hongyi, Zhuang Yonghao, Chen Jindong, Xing Eric, Hu Zhiting
- Rolecraft-glm: Advancing Personalized Role-playing In Large Language Models Tao Meiling, Liang Xuechen, Shi Tianyu, Yu Lei, Xie Yiting
- Graph Neural Prompting With Large Language Models Tian Yijun, Song Huan, Wang Zichen, Wang Haozhu, Hu Ziqing, Wang Fang, Chawla Nitesh V., Xu Panpan
- Unlocking Anticipatory Text Generation: A Constrained Approach For Large Language Models Decoding Tu Lifu, Yavuz Semih, Qu Jin, Xu Jiacheng, Meng Rui, Xiong Caiming, Zhou Yingbo
- Accelerating Llama Inference By Enabling Intermediate Layer Decoding Via Instruction Tuning With LITE Varshney Neeraj, Chatterjee Agneet, Parmar Mihir, Baral Chitta
- ASPIRO: Any-shot Structured Parsing-error-induced Reprompting For Consistent Data-to-text Generation Vejvar Martin, Fujimoto Yasutaka
- Shall We Pretrain Autoregressive Language Models With Retrieval? A Comprehensive Study Wang Boxin, Ping Wei, Xu Peng, Mcafee Lawrence, Liu Zihan, Shoeybi Mohammad, Dong Yi, Kuchaiev Oleksii, Li Bo, Xiao Chaowei, Anandkumar Anima, Catanzaro Bryan
- Cost-effective Hyperparameter Optimization For Large Language Model Generation Inference Wang Chi, Liu Susan Xueqing, Awadallah Ahmed H.
- Knowledgeable In-context Tuning: Exploring And Exploiting Factual Knowledge For In-context Learning Wang Jianing, Wang Chengyu, Tan Chuanqi, Huang Jun, Gao Ming
- Automated Evaluation Of Personalized Text Generation Using Large Language Models Wang Yaqing, Jiang Jiepu, Zhang Mingyang, Li Cheng, Liang Yi, Mei Qiaozhu, Bendersky Michael
- Enabling Language Models To Implicitly Learn Self-improvement Wang Ziqi, Hou Le, Lu Tianjian, Wu Yuexin, Li Yunxuan, Yu Hongkun, Ji Heng
- Call For Papers -- The Babylm Challenge: Sample-efficient Pretraining On A Developmentally Plausible Corpus Warstadt Alex, Choshen Leshem, Mueller Aaron, Williams Adina, Wilcox Ethan, Zhuang Chengxu
- A Survey Of Large Language Models Wayne Xin Zhao, Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-yun Nie, Ji-rong Wen
- Skywork: A More Open Bilingual Foundation Model Wei Tianwen, Zhao Liang, Zhang Lichang, Zhu Bo, Wang Lijie, Yang Haihua, Li Biye, Cheng Cheng, Lü Weiwei, Hu Rui, Li Chenxia, Yang Liu, Luo Xilin, Wu Xuejie, Liu Lunan, Cheng Wenjun, Cheng Peng, Zhang Jianhao, Zhang Xiaoyu, Lin Lei, Wang Xiaokun, Ma Yutuan, Dong Chuanhai, Sun Yanqi, Chen Yifu, Peng Yongyi, Liang Xiaojuan, Yan Shuicheng, Fang Han, Zhou Yahui
- In-context Pretraining: Language Modeling Beyond Document Boundaries Weijia Shi, Sewon Min, Maria Lomeli, Chunting Zhou, Margaret Li, Gergely Szilvasy, Rich James, Xi Victoria Lin, Noah A. Smith, Luke Zettlemoyer, Scott Yih, Mike Lewis
- REPLUG: Retrieval-augmented Black-box Language Models Weijia Shi, Sewon Min, Michihiro Yasunaga, Minjoon Seo, Rich James, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih
- Resolving References In Visually-grounded Dialogue Via Text Generation Willemsen Bram, Qian Livia, Skantze Gabriel
- Extrapolating Multilingual Understanding Models As Multilingual Generators Wu Bohong, Yuan Fei, Zhao Hai, Li Lei, Xu Jingjing
- From Language Modeling To Instruction Following: Understanding The Behavior Shift In Llms After Instruction Tuning Wu Xuansheng, Yao Wenlin, Chen Jianshu, Pan Xiaoman, Wang Xiaoyang, Liu Ninghao, Yu Dong
- TLM: Token-level Masking For Transformers Wu Yangjun, Fang Kebin, Zhang Dongxiang, Wang Han, Zhang Hao, Chen Gang
- Fine-grained Human Feedback Gives Better Rewards For Language Model Training Wu Zeqiu, Hu Yushi, Shi Weijia, Dziri Nouha, Suhr Alane, Ammanabrolu Prithviraj, Smith Noah A., Ostendorf Mari, Hajishirzi Hannaneh
- Improving Question Generation With Multi-level Content Planning Xia Zehua, Gou Qi, Yu Bowen, Yu Haiyang, Huang Fei, Li Yongbin, Nguyen Cam-tu
- Tencentllmeval: A Hierarchical Evaluation Of Real-world Capabilities For Human-aligned Llms Xie Shuyi, Yao Wenlin, Dai Yong, Wang Shaobo, Zhou Donlin, Jin Lifeng, Feng Xinhua, Wei Pengzhi, Lin Yujie, Hu Zhichao, Yu Dong, Zhang Zhengyou, Nie Jing, Liu Yuhong
- Effective Long-context Scaling Of Foundation Models Xiong Wenhan, Liu Jingyu, Molybog Igor, Zhang Hejia, Bhargava Prajjwal, Hou Rui, Martin Louis, Rungta Rashi, Sankararaman Karthik Abinav, Oguz Barlas, Khabsa Madian, Fang Han, Mehdad Yashar, Narang Sharan, Malik Kshitiz, Fan Angela, Bhosale Shruti, Edunov Sergey, Lewis Mike, Wang Sinong, Ma Hao
- Llmcad: Fast And Scalable On-device Large Language Model Inference Xu Daliang, Yin Wangsong, Jin Xin, Zhang Ying, Wei Shiyun, Xu Mengwei, Liu Xuanzhe
- Knowledge-infused Prompting: Assessing And Advancing Clinical Text Data Generation With Large Language Models Xu Ran, Cui Hejie, Yu Yue, Kan Xuan, Shi Wenqi, Zhuang Yuchen, Jin Wei, Ho Joyce, Yang Carl
- Llmrefine: Pinpointing And Refining Large Language Models Via Fine-grained Actionable Feedback Xu Wenda, Deutsch Daniel, Finkelstein Mara, Juraska Juraj, Zhang Biao, Liu Zhongtao, Wang William Yang, Li Lei, Freitag Markus
- INSTRUCTSCORE: Explainable Text Generation Evaluation With Finegrained Feedback Xu Wenda, Wang Danqing, Pan Liangming, Song Zhenqiao, Freitag Markus, Wang William Yang, Li Lei
- Understanding In-context Learning From Repetitions Yan Jianhao, Xu Jin, Song Chiyu, Wu Chenming, Li Yafu, Zhang Yue
- Effective Distillation Of Table-based Reasoning Ability From Llms Yang Bohao, Tang Chen, Zhao Kun, Xiao Chenghao, Lin Chenghua
- Instoptima: Evolutionary Multi-objective Instruction Optimization Via Large Language Model-based Instruction Operators Yang Heng, Li Ke
- Longqlora: Efficient And Effective Method To Extend Context Length Of Large Language Models Yang Jianxin
- Give Us The Facts: Enhancing Large Language Models With Knowledge Graphs For Fact-aware Language Modeling Yang Linyao, Chen Hongyang, Li Zhao, Ding Xiao, Wu Xindong
- Uncertainty-aware Language Modeling For Selective Question Answering Yang Qi, Ravikumar Shreya, Schmitt-ulms Fynn, Lolla Satvik, Demir Ege, Elistratov Iaroslav, Lavaee Alex, Lolla Sadhana, Ahmadi Elaheh, Rus Daniela, Amini Alexander, Perez Alejandro
- Integrating UMLS Knowledge Into Large Language Models For Medical Question Answering Yang Rui, Marrese-taylor Edison, Ke Yuhe, Cheng Lechao, Chen Qingyu, Li Irene
- Lidar-llm: Exploring The Potential Of Large Language Models For 3D Lidar Understanding Yang Senqiao, Liu Jiaming, Zhang Ray, Pan Mingjie, Guo Zoey, Li Xiaoqi, Chen Zehui, Gao Peng, Guo Yandong, Zhang Shanghang
- Improving The Reliability Of Large Language Models By Leveraging Uncertainty-aware In-context Learning Yang Yuchen, Li Houqiang, Wang Yanfeng, Wang Yu
- Re-vilm: Retrieval-augmented Visual Language Model For Zero And Few-shot Image Captioning Yang Zhuolin, Ping Wei, Liu Zihan, Korthikanti Vijay, Nie Weili, Huang De-an, Fan Linxi, Yu Zhiding, Lan Shiyi, Li Bo, Liu Ming-yu, Zhu Yuke, Shoeybi Mohammad, Catanzaro Bryan, Xiao Chaowei, Anandkumar Anima
- A Survey On Large Language Model (LLM) Security And Privacy: The Good, The Bad, And The Ugly Yao Yifan, Duan Jinhao, Xu Kaidi, Cai Yuanfang, Sun Zhibo, Zhang Yue
- Collaborative Large Language Model For Recommender Systems Yaochen Zhu, Liang Wu, Qi Guo, Liangjie Hong, Jundong Li
- Cognitive Mirage: A Review Of Hallucinations In Large Language Models Ye Hongbin, Liu Tong, Zhang Aijia, Hua Wei, Jia Weiqiang
- Diffusion Language Models Can Perform Many Tasks With Scaling And Instruction-finetuning Ye Jiasheng, Zheng Zaixiang, Bao Yu, Qian Lihua, Gu Quanquan
- Satlm: Satisfiability-aided Language Models Using Declarative Prompting Ye Xi, Chen Qiaochu, Dillig Isil, Durrett Greg
- Scaling Autoregressive Multi-modal Models: Pretraining And Instruction Tuning Yu Lili, Shi Bowen, Pasunuru Ramakanth, Muller Benjamin, Golovneva Olga, Wang Tianlu, Babu Arun, Tang Binh, Karrer Brian, Sheynin Shelly, Ross Candace, Polyak Adam, Howes Russell, Sharma Vasu, Xu Puxin, Tamoyan Hovhannes, Ashual Oron, Singer Uriel, Li Shang-wen, Zhang Susan, James Richard, Ghosh Gargi, Taigman Yaniv, Fazel-zarandi Maryam, Celikyilmaz Asli, Zettlemoyer Luke, Aghajanyan Armen
- MEGABYTE: Predicting Million-byte Sequences With Multiscale Transformers Yu Lili, Simig Dániel, Flaherty Colin, Aghajanyan Armen, Zettlemoyer Luke, Lewis Mike
- Information Association For Language Model Updating By Mitigating Lm-logical Discrepancy Yu Pengfei, Ji Heng
- Seqgpt: An Out-of-the-box Large Language Model For Open Domain Sequence Understanding Yu Tianyu, Jiang Chengyue, Lou Chao, Huang Shen, Wang Xiaobin, Liu Wei, Cai Jiong, Li Yangning, Li Yinghui, Tu Kewei, Zheng Hai-tao, Zhang Ningyu, Xie Pengjun, Huang Fei, Jiang Yong
- Evotext: Enhancing Natural Language Generation Models Via Self-escalation Learning For Up-to-date Knowledge And Improved Performance Yuan Zhengqing, Xue Huiwen, Zhang Chao, Liu Yongming
- Prompt Highlighter: Interactive Control For Multi-modal Llms Yuechen Zhang, Shengju Qian, Bohao Peng, Shu Liu, Jiaya Jia
- Pre-training To Learn In Context Yuxian Gu, Li Dong, Furu Wei, Minlie Huang
- Mm-narrator: Narrating Long-form Videos With Multimodal In-context Learning Zhang Chaoyi, Lin Kevin, Yang Zhengyuan, Wang Jianfeng, Li Linjie, Lin Chung-ching, Liu Zicheng, Wang Lijuan
- Video-llama: An Instruction-tuned Audio-visual Language Model For Video Understanding Zhang Hang, Li Xin, Bing Lidong
- Controllable Text Generation With Residual Memory Transformer Zhang Hanqing, Si Sun, Wu Haiming, Song Dawei
- Tractable Control For Autoregressive Language Generation Zhang Honghua, Dang Meihua, Peng Nanyun, Broeck Guy Van Den
- Linear Attention Via Orthogonal Memory Zhang Jun, Jiang Shuyang, Feng Jiangtao, Zheng Lin, Kong Lingpeng
- Marathon: A Race Through The Realm Of Long Context With Large Language Models Zhang Lei, Li Yunshui, Liu Ziqiang, Yang Jiaxi, Liu Junhao, Chen Longze, Luo Run, Yang Min
- Weighted Sampling For Masked Language Modeling Zhang Linhan, Chen Qian, Wang Wen, Deng Chong, Cao Xin, Hao Kongzhang, Jiang Yuxin, Wang Wei
- Llamavae: Guiding Large Language Model Generation Via Continuous Latent Sentence Spaces Zhang Yingji, Carvalho Danilo S., Pratt-hartmann Ian, Freitas André
- Cached Transformers: Improving Transformers With Differentiable Memory Cache Zhang Zhaoyang, Shao Wenqi, Ge Yixiao, Wang Xiaogang, Gu Jinwei, Luo Ping
- VISAR: A Human-ai Argumentative Writing Assistant With Visual Programming And Rapid Draft Prototyping Zhang Zheng, Gao Jie, Dhaliwal Ranjodh Singh, Li Toby Jia-jun
- A Survey Of Large Language Models Zhao Wayne Xin, Zhou Kun, Li Junyi, Tang Tianyi, Wang Xiaolei, Hou Yupeng, Min Yingqian, Zhang Beichen, Zhang Junjie, Dong Zican, Du Yifan, Yang Chen, Chen Yushuo, Chen Zhipeng, Jiang Jinhao, Ren Ruiyang, Li Yifan, Tang Xinyu, Liu Zikang, Liu Peiyu, Nie Jian-yun, Wen Ji-rong
- Easygen: Easing Multimodal Generation With Bidiffuser And Llms Zhao Xiangyu, Liu Bo, Liu Qijiong, Shi Guangyuan, Wu Xiao-ming
- Investigating Table-to-text Generation Capabilities Of Llms In Real-world Information Seeking Scenarios Zhao Yilun, Zhang Haowei, Si Shengyun, Nan Linyong, Tang Xiangru, Cohan Arman
- Are You Still On Track!? Catching LLM Task Drift With Activations Abdelnabi Sahar, Fay Aideen, Cherubin Giovanni, Salem Ahmed, Fritz Mario, Paverd Andrew
- Fine-grained Contract NER Using Instruction Based Model Adibhatla Hiranmai Sri, Baswani Pavan, Shrivastava Manish
- Keyformer: KV Cache Reduction Through Key Tokens Selection For Efficient Generative Inference Adnan Muhammad, Arunkumar Akhil, Jain Gaurav, Nair Prashant J., Soloveychik Ilya, Kamath Purushotham
- Mememqa: Multimodal Question Answering For Memes Via Rationale-based Inferencing Agarwal Siddhant, Sharma Shivam, Nakov Preslav, Chakraborty Tanmoy
- Tricy: Trigger-guided Data-to-text Generation With Intent Aware Attention-copy Agarwal Vibhav, Ghosh Sourav, Bss Harichandana, Arora Himanshu, Raja Barath Raj Kandur
- Codemirage: Hallucinations In Code Generated By Large Language Models Agarwal Vibhor, Pei Yulong, Alamir Salwa, Liu Xiaomo
- Leapformer: Enabling Linear Transformers For Autoregressive And Simultaneous Tasks Via Learned Proportions Agostinelli Victor, Hong Sanghyun, Chen Lizhong
- Scalable Ensembling For Mitigating Reward Overoptimisation Ahmed Ahmed M., Rafailov Rafael, Sharkov Stepan, Li Xuechen, Koyejo Sanmi
- QASE Enhanced Plms: Improved Control In Text Generation For MRC Ai Lin, Hui Zheng, Liu Zizhou, Hirschberg Julia
- Plug And Play With Prompts: A Prompt Tuning Approach For Controlling Text Generation Ajwani Rohan Deepak, Zhu Zining, Rose Jonathan, Rudzicz Frank
- In-context Language Learning: Architectures And Algorithms Akyürek Ekin, Wang Bailin, Kim Yoon, Andreas Jacob
- Syntaxshap: Syntax-aware Explainability Method For Text Generation Amara Kenza, Sevastjanova Rita, El-assady Mennatallah
- Semscore: Automated Evaluation Of Instruction-tuned Llms Based On Semantic Textual Similarity Aynetdinov Ansar, Akbik Alan
- Citrus: Chunked Instruction-aware State Eviction For Long Sequence Modeling Bai Yu, Zou Xiyuan, Huang Heyan, Chen Sanxing, Rondeau Marc-antoine, Gao Yang, Cheung Jackie Chi Kit
- Llms Will Always Hallucinate, And We Need To Live With This Banerjee Sourav, Agarwal Ayushi, Singla Saloni
- To Each (textual Sequence) Its Own: Improving Memorized-data Unlearning In Large Language Models Barbulescu George-octavian, Triantafillou Peter
- Xlstm: Extended Long Short-term Memory Beck Maximilian, Pöppel Korbinian, Spanring Markus, Auer Andreas, Prudnikova Oleksandra, Kopp Michael, Klambauer Günter, Brandstetter Johannes, Hochreiter Sepp
- The Unreasonable Ineffectiveness Of Nucleus Sampling On Mitigating Text Memorization Borec Luka, Sadler Philipp, Schlangen David
- Retrieving Examples From Memory For Retrieval Augmented Neural Machine Translation: A Systematic Comparison Bouthors Maxime, Crego Josep, Yvon Francois
- LOCOST: State-space Models For Long Document Abstractive Summarization Bronnec Florian Le, Duong Song, Ravaut Mathieu, Allauzen Alexandre, Chen Nancy F., Guigue Vincent, Lumbreras Alberto, Soulier Laure, Gallinari Patrick
- Exploring Precision And Recall To Assess The Quality And Diversity Of Llms Bronnec Florian Le, Verine Alexandre, Negrevergne Benjamin, Chevaleyre Yann, Allauzen Alexandre
- Olora: Orthonormal Low-rank Adaptation Of Large Language Models Büyükakyüz Kerim
- High-dimension Human Value Representation In Large Language Models Cahyawijaya Samuel, Chen Delong, Bang Yejin, Khalatbari Leila, Wilie Bryan, Ji Ziwei, Ishii Etsuko, Fung Pascale
- CERET: Cost-effective Extrinsic Refinement For Text Generation Cai Jason, Su Hang, Sunkara Monica, Shalyminov Igor, Mansour Saab
- Beyond Sparse Rewards: Enhancing Reinforcement Learning With Language Model Critique In Text Generation Cao Meng, Shu Lei, Yu Lei, Zhu Yun, Wichers Nevan, Liu Yinxiao, Meng Lei
- Dual Modalities Of Text: Visual And Textual Generative Pre-training Chai Yekun, Liu Qingyi, Xiao Jingwu, Wang Shuohuan, Sun Yu, Wu Hua
- Speechprompt: Prompting Speech Language Models For Speech Processing Tasks Chang Kai-wei, Wu Haibin, Wang Yu-kai, Wu Yuan-kuei, Shen Hua, Tseng Wei-cheng, Kang Iu-thing, Li Shang-wen, Lee Hung-yi
- Goldfish: Monolingual Language Models For 350 Languages Chang Tyler A., Arnett Catherine, Tu Zhuowen, Bergen Benjamin K.
- Detecting Hallucination And Coverage Errors In Retrieval Augmented Generation For Controversial Topics Chang Tyler A., Tomanek Katrin, Hoffmann Jessica, Thain Nithum, Van Liemt Erin, Meier-hellstern Kathleen, Dixon Lucas
- Xprompt:explaining Large Language Model's Generation Via Joint Prompt Attribution Chang Yurui, Cao Bochuan, Wang Yujia, Chen Jinghui, Lin Lu
- When Large Language Models Meet Evolutionary Algorithms Chao Wang, Zhao Jiaxuan, Jiao Licheng, Li Lingling, Liu Fang, Yang Shuyuan
- Automated Data Curation For Robust Language Model Fine-tuning Chen Jiuhai, Mueller Jonas
- Identifying Query-relevant Neurons In Large Language Models For Long-form Texts Chen Lihu, Dejl Adam, Toni Francesca
- Ternaryllm: Ternarized Large Language Model Chen Tianqi, Li Zhe, Xu Weixiang, Zhu Zeyu, Li Dong, Tian Lu, Barsoum Emad, Wang Peisong, Cheng Jian
- Extroversion Or Introversion? Controlling The Personality Of Your Large Language Models Chen Yanquan, Wu Zhen, Guo Junjie, Huang Shujian, Dai Xinyu
- Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided By Text Information Chen Yi, Xu Jian, Zhang Xu-yao, Liu Wen-zhuo, Liu Yang-yang, Liu Cheng-lin
- Online Personalizing White-box Llms Generation With Neural Bandits Chen Zekai, Daniel Weeden, Chen Po-yu, Buet-golfouse Francois
- ANOLE: An Open, Autoregressive, Native Large Multimodal Models For Interleaved Image-text Generation Chern Ethan, Su Jiadi, Ma Yan, Liu Pengfei
- Exploring Quantization For Efficient Pre-training Of Transformer Language Models Chitsaz Kamran, Fournier Quentin, Mordido Gonçalo, Chandar Sarath
- Cross-architecture Transfer Learning For Linear-cost Inference Transformers Choi Sehyun
- A Better LLM Evaluator For Text Generation: The Impact Of Prompt Output Sequencing And Optimization Chu Kuanchao, Chen Yi-pei, Nakayama Hideki
- Teenytinyllama: Open-source Tiny Language Models Trained In Brazilian Portuguese Corrêa Nicholas Kluge, Falk Sophia, Fatimah Shiza, Sen Aniket, De Oliveira Nythamar
- Moeut: Mixture-of-experts Universal Transformers Csordás Róbert, Irie Kazuki, Schmidhuber Jürgen, Potts Christopher, Manning Christopher D.
- Application Of GPT Language Models For Innovation In Activities In University Teaching De Buenaga Manuel, Bueno Francisco Javier
- Deconstructing The Ethics Of Large Language Models From Long-standing Issues To New-emerging Dilemmas Deng Chengyuan, Duan Yiqun, Jin Xin, Chang Heng, Tian Yijun, Liu Han, Zou Henry Peng, Jin Yiqiao, Xiao Yijia, Wang Yichen, Wu Shenghao, Xie Zongxing, Gao Kuofeng, He Sihong, Zhuang Jun, Cheng Lu, Wang Haohan
- Teaching A Multilingual Large Language Model To Understand Multilingual Speech Via Multi-instructional Training Denisov Pavel, Vu Ngoc Thang
- Glam: Fine-tuning Large Language Models For Domain Knowledge Graph Alignment Via Neighborhood Partitioning And Generative Subgraph Encoding Dernbach Stefan, Agarwal Khushbu, Zuniga Alejandro, Henry Michael, Choudhury Sutanay
- Promises, Outlooks And Challenges Of Diffusion Language Modeling Deschenaux Justin, Gulcehre Caglar
- Chain-of-translation Prompting (cotr): A Novel Prompting Technique For Low Resource Languages Deshpande Tejas, Kowtal Nidhi, Joshi Raviraj
- Facts-and-feelings: Capturing Both Objectivity And Subjectivity In Table-to-text Generation Dey Tathagata, Bhattacharyya Pushpak
- Shaping Human-ai Collaboration: Varied Scaffolding Levels In Co-writing With Language Models Dhillon Paramveer S., Molaei Somayeh, Li Jiaqi, Golub Maximilian, Zheng Shaochun, Robert Lionel P.
- Finding Blind Spots In Evaluator Llms With Interpretable Checklists Doddapaneni Sumanth, Khan Mohammed Safi Ur Rahman, Verma Sshubam, Khapra Mitesh M.
- Pruner-zero: Evolving Symbolic Pruning Metric From Scratch For Large Language Models Dong Peijie, Li Lujun, Tang Zhenheng, Liu Xiang, Pan Xinglin, Wang Qiang, Chu Xiaowen
- QAQ: Quality Adaptive Quantization For LLM KV Cache Dong Shichen, Cheng Wen, Qin Jiayu, Wang Wei
- A Framework For Real-time Safeguarding The Text Generation Of Large Language Model Dong Ximing, Lin Dayi, Wang Shaowei, Hassan Ahmed E.
- Biomedical Large Languages Models Seem Not To Be Superior To Generalist Models On Unseen Medical Data Dorfner Felix J., Dada Amin, Busch Felix, Makowski Marcus R., Han Tianyu, Truhn Daniel, Kleesiek Jens, Sushil Madhumita, Lammert Jacqueline, Adams Lisa C., Bressem Keno K.
- Fedmkt: Federated Mutual Knowledge Transfer For Large And Small Language Models Fan Tao, Ma Guoqiang, Kang Yan, Gu Hanlin, Song Yuanfeng, Fan Lixin, Chen Kai, Yang Qiang
- Biasalert: A Plug-and-play Tool For Social Bias Detection In Llms Fan Zhiting, Chen Ruizhe, Xu Ruiling, Liu Zuozhu
- Integrating A.I. In Higher Education: Protocol For A Pilot Study With 'samcares: An Adaptive Learning Hub' Faruqui Syed Hasib Akhter, Tasnim Nazia, Basith Iftekhar Ibne, Obeidat Suleiman, Yildiz Faruk
- Teaching Llms To Abstain Across Languages Via Multilingual Feedback Feng Shangbin, Shi Weijia, Wang Yike, Ding Wenxuan, Ahia Orevaoghene, Li Shuyue Stella, Balachandran Vidhisha, Sitaram Sunayana, Tsvetkov Yulia
- Unveiling And Manipulating Prompt Influence In Large Language Models Feng Zijian, Zhou Hanzhang, Zhu Zixiao, Qian Junlang, Mao Kezhi
- The Chronicles Of RAG: The Retriever, The Chunk And The Generator Finardi Paulo, Avila Leonardo, Castaldoni Rodrigo, Gengo Pedro, Larcher Celio, Piau Marcos, Costa Pablo, Caridá Vinicius
- Thinking Fair And Slow: On The Efficacy Of Structured Prompts For Debiasing Language Models Furniturewala Shaz, Jandial Surgan, Java Abhinav, Banerjee Pragyan, Shahid Simra, Bhatia Sumit, Jaidka Kokil
- Selfcp: Compressing Over-limit Prompt Via The Frozen Large Language Model Itself Gao Jun, Cao Ziqiang, Li Wenjie
- Sketch-guided Constrained Decoding For Boosting Blackbox Large Language Models Without Logit Access Geng Saibo, Döner Berkay, Wendler Chris, Josifoski Martin, West Robert
- Analysis Of Plan-based Retrieval For Grounded Text Generation Godbole Ameya, Monath Nicholas, Kim Seungyeon, Rawat Ankit Singh, Mccallum Andrew, Zaheer Manzil
- Direct Alignment Of Draft Model For Speculative Decoding With Chat-fine-tuned Llms Goel Raghavv, Gagrani Mukul, Jeon Wonseok, Park Junyoung, Lee Mingu, Lott Christopher
- Inference Acceleration For Large Language Models Using "stairs" Assisted Greedy Generation Grigaliūnas Domas, Lukoševičius Mantas
- Would I Lie To You? Inference Time Alignment Of Language Models Using Direct Preference Heads Hadji-kyriacou Avelina Asada, Arandjelovic Ognjen
- Llm-personalize: Aligning LLM Planners With Human Preferences Via Reinforced Self-training For Housekeeping Robots Han Dongge, Mcinroe Trevor, Jelley Adam, Albrecht Stefano V., Bell Peter, Storkey Amos
- Transfer Learning For Text Diffusion Models Han Kehang, Kenealy Kathleen, Barua Aditya, Fiedel Noah, Constant Noah
- Prompt-based Bias Calibration For Better Zero/few-shot Learning Of Language Models He Kang, Long Yinghan, Roy Kaushik
- MARS: Mixture Of Auto-regressive Models For Fine-grained Text-to-image Synthesis He Wanggui, Fu Siming, Liu Mushui, Wang Xierui, Xiao Wenyi, Shu Fangxun, Wang Yi, Zhang Lei, Yu Zhelun, Li Haoyuan, Huang Ziwei, Gan Leilei, Jiang Hao
- Fortifying Ethical Boundaries In AI: Advanced Strategies For Enhancing Security In Large Language Models He Yunhong, Qiu Jianling, Zhang Wei, Yuan Zhengqing
- HMT: Hierarchical Memory Transformer For Long Context Language Processing He Zifan, Qin Zongyue, Prakriya Neha, Sun Yizhou, Cong Jason
- N-gram Prediction And Word Difference Representations For Language Modeling Heo Dongnyeong, Rim Daniela Noemi, Choi Heeyoul
- Generative Visual Instruction Tuning Hernandez Jefferson, Villegas Ruben, Ordonez Vicente
- Lilium: Ebay's Large Language Models For E-commerce Herold Christian, Kozielski Michael, Ekimov Leonid, Petrushkov Pavel, Vandenbussche Pierre-yves, Khadivi Shahram
- Deepspeed-fastgen: High-throughput Text Generation For Llms Via MII And Deepspeed-inference Holmes Connor, Tanaka Masahiro, Wyatt Michael, Awan Ammar Ahmad, Rasley Jeff, Rajbhandari Samyam, Aminabadi Reza Yazdani, Qin Heyang, Bakhtiari Arash, Kurilenko Lev, He Yuxiong
- Accelerating Multilingual Language Model For Excessively Tokenized Languages Hong Jimin, Lee Gibbeum, Cho Jaewoong
- Outcome-constrained Large Language Models For Countering Hate Speech Hong Lingzi, Luo Pengcheng, Blanco Eduardo, Song Xiaoying
- Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models At Scale Hu Xiang, Ji Pengyu, Zhu Qingyang, Wu Wei, Tu Kewei
- Can Perplexity Reflect Large Language Model's Ability In Long Text Understanding? Hu Yutong, Huang Quzhe, Tao Mingxu, Zhang Chen, Feng Yansong
- Prompt Perturbation In Retrieval-augmented Generation Based Large Language Models Hu Zhibo Hye-young, Wang Chen Hye-young, Shu Yanfeng Hye-young, Helen Hye-young, Paik, Zhu Liming
- Generating Educational Materials With Different Levels Of Readability Using Llms Huang Chieh-yang, Wei Jing, Huang Ting-hao 'kenneth'
- Verbalized Probabilistic Graphical Modeling With Large Language Models Huang Hengguan, Shen Xing, Wang Songtao, Liu Dianbo, Wang Hao
- Conme: Rethinking Evaluation Of Compositional Reasoning For Modern Vlms Huang Irene, Lin Wei, Mirza M. Jehanzeb, Hansen Jacob A., Doveh Sivan, Butoi Victor Ion, Herzig Roei, Arbelle Assaf, Kuhene Hilde, Darrel Trevor, Gan Chuang, Oliva Aude, Feris Rogerio, Karlinsky Leonid
- Demystifying Verbatim Memorization In Large Language Models Huang Jing, Yang Diyi, Potts Christopher
- Compression Represents Intelligence Linearly Huang Yuzhen, Zhang Jinghan, Shan Zifei, He Junxian
- Hallucinations Or Attention Misdirection? The Path To Strategic Value Extraction In Business Using Large Language Models Ioste Aline
- Autocompletion Of Chief Complaints In The Electronic Health Records Using Large Language Models Islam K M Sajjadul, Nipu Ayesha Siddika, Madiraju Praveen, Deshpande Priya
- Integrating Large Language Models With Graph-based Reasoning For Conversational Question Answering Jain Parag, Lapata Mirella
- OLAPH: Improving Factuality In Biomedical Long-form Question Answering Jeong Minbyul, Hwang Hyeon, Yoon Chanwoong, Lee Taewhoo, Kang Jaewoo
- Generalizing Reward Modeling For Out-of-distribution Preference Learning Jia Chen
- Can Large Language Models Generate High-quality Patent Claims? Jiang Lekang, Zhang Caiqi, Scherz Pascal A, Goetz Stephan
- APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking Jin Can, Peng Hongwu, Zhao Shiyu, Wang Zhenting, Xu Wujiang, Han Ligong, Zhao Jiahui, Zhong Kai, Rajasekaran Sanguthevar, Metaxas Dimitris N.
- Effective Context Selection In Llm-based Leaderboard Generation: An Empirical Study Kabongo Salomon, D'souza Jennifer, Auer Sören
- Counterfactuals As A Means For Evaluating Faithfulness Of Attribution Methods In Autoregressive Language Models Kamahi Sepehr, Yaghoobzadeh Yadollah
- Continuous Language Model Interpolation For Dynamic And Controllable Text Generation Kangaslahti Sara, Alvarez-melis David
- Beyond Traditional Benchmarks: Analyzing Behaviors Of Open Llms On Data-to-text Generation Kasner Zdeněk, Dušek Ondřej
- Finding A Needle In The Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases With Minimal Distribution Distortion Kassem Aly M., Saad Sherif
- Spectra: A Comprehensive Study Of Ternary, Quantized, And FP16 Language Models Kaushal Ayush, Pandey Tejas, Vaidhya Tejas, Bhagat Aaryan, Rish Irina
- A Comprehensive Survey Of Accelerated Generation Techniques In Large Language Models Khoshnoodi Mahsa, Jain Vinija, Gao Mingye, Srikanth Malavika, Chadha Aman
- Ontology-free General-domain Knowledge Graph-to-text Generation Dataset Synthesis Using Large Language Model Kim Daehee, Kang Deokhyung, Ryu Sangwon, Lee Gary Geunbae
- What's Wrong? Refining Meeting Summaries With LLM Feedback Kirstein Frederic, Ruas Terry, Gipp Bela
- Transllama: Llm-based Simultaneous Translation System Koshkin Roman, Sudoh Katsuhito, Nakamura Satoshi
- Understanding The Effects Of Iterative Prompting On Truthfulness Krishna Satyapriya, Agarwal Chirag, Lakkaraju Himabindu
- Alarm: Align Language Models Via Hierarchical Rewards Modeling Lai Yuhang, Wang Siyuan, Liu Shujun, Huang Xuanjing, Wei Zhongyu
- Probability Of Differentiation Reveals Brittleness Of Homogeneity Bias In Large Language Models Lee Messi H. J., Lai Calvin K.
- Cascade Reward Sampling For Efficient Decoding-time Alignment Li Bolian, Wang Yifan, Grama Ananth, Zhang Ruqi
- Recall, Retrieve And Reason: Towards Better In-context Relation Extraction Li Guozheng, Wang Peng, Ke Wenjun, Guo Yikai, Ji Ke, Shang Ziyu, Liu Jiajun, Xu Zijie
- A Cross-language Investigation Into Jailbreak Attacks In Large Language Models Li Jie, Liu Yi, Liu Chongyang, Shi Ling, Ren Xiaoning, Zheng Yaowen, Liu Yang, Xue Yinxing
- Predicting Vs. Acting: A Trade-off Between World Modeling & Agent Modeling Li Margaret, Shi Weijia, Pagnoni Artidoro, West Peter, Holtzman Ari
- On The Power Of Convolution Augmented Transformer Li Mingchen, Zhang Xuechen, Huang Yixiao, Oymak Samet
- Nearest Neighbor Speculative Decoding For LLM Generation And Attribution Li Minghan, Chen Xilun, Holtzman Ari, Chen Beidi, Lin Jimmy, Yih Wen-tau, Lin Xi Victoria
- Moe-ct: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting Li Tianhao, Li Shangjie, Xie Binbin, Xiong Deyi, Yang Baosong
- Reinforcement Learning With Token-level Feedback For Controllable Text Generation Li Wendi, Wei Wei, Xu Kaihe, Xie Wenfeng, Chen Dangyang, Cheng Yu
- Personalized Language Modeling From Personalized Human Feedback Li Xinyu, Lipton Zachary C., Leqi Liu
- Backdoorllm: A Comprehensive Benchmark For Backdoor Attacks On Large Language Models Li Yige, Huang Hanxun, Zhao Yunhan, Ma Xingjun, Sun Jun
- A Simple But Effective Approach To Improve Structured Language Model Output For Information Extraction Li Yinghao, Ramprasad Rampi, Zhang Chao
- Focusllm: Scaling Llm's Context By Parallel Decoding Li Zhenyu, Zhang Yike, Pan Tengyu, Sun Yutao, Duan Zhichao, Fang Junjie, Han Rong, Wang Zixuan, Wang Jianyong
- Two Stacks Are Better Than One: A Comparison Of Language Modeling And Translation As Multilingual Pretraining Objectives Li Zihao, Ji Shaoxiong, Mickus Timothee, Segonne Vincent, Tiedemann Jörg
- Controllable Text Generation For Large Language Models: A Survey Liang Xun, Wang Hanyu, Wang Yezhaohui, Song Shichao, Yang Jiawei, Niu Simin, Hu Jie, Liu Dan, Yao Shunyu, Xiong Feiyu, Li Zhiyu
- Evaluating Text-to-visual Generation With Image-to-text Generation Lin Zhiqiu, Pathak Deepak, Li Baiqi, Li Jiayao, Xia Xide, Neubig Graham, Zhang Pengchuan, Ramanan Deva
- Navigating The Dual Facets: A Comprehensive Evaluation Of Sequential Memory Editing In Large Language Models Lin Zihao, Beigi Mohammad, Li Hongxuan, Zhou Yufan, Zhang Yuxiang, Wang Qifan, Yin Wenpeng, Huang Lifu
- Compromising Embodied Agents With Contextual Backdoor Attacks Liu Aishan, Zhou Yuguang, Liu Xianglong, Zhang Tianyuan, Liang Siyuan, Wang Jiakai, Pu Yanjun, Li Tianlin, Zhang Junqi, Zhou Wenbo, Guo Qing, Tao Dacheng
- Cool-fusion: Fuse Large Language Models Without Training Liu Cong, Quan Xiaojun, Pan Yan, Lin Liang, Wu Weigang, Chen Xu
- Lumina-mgpt: Illuminate Flexible Photorealistic Text-to-image Generation With Multimodal Generative Pretraining Liu Dongyang, Zhao Shitian, Zhuo Le, Lin Weifeng, Qiao Yu, Li Hongsheng, Gao Peng
- Synthetic Context Generation For Question Generation Liu Naiming, Wang Zichao, Baraniuk Richard
- Flashback:efficient Retrieval-augmented Language Modeling For Long Context Inference Liu Runheng, Xiao Xingchen, Huang Heyan, Chi Zewen, Wu Zhijing
- Multi-group Uncertainty Quantification For Long-form Text Generation Liu Terrance, Wu Zhiwei Steven
- Memlong: Memory-augmented Retrieval For Long Text Modeling Liu Weijie, Tang Zecheng, Li Juntao, Chen Kehai, Zhang Min
- SHIELD: Evaluation And Defense Strategies For Copyright Compliance In LLM Text Generation Liu Xiaoze, Sun Ting, Xu Tianyang, Wu Feijie, Wang Cunxiang, Wang Xiaoqian, Gao Jing
- Fantastic Semantics And Where To Find Them: Investigating Which Layers Of Generative Llms Reflect Lexical Semantics Liu Zhu, Kong Cunliang, Liu Ying, Sun Maosong
- Loqt: Low-rank Adapters For Quantized Pre-training Loeschcke Sebastian, Toftrup Mads, Kastoryano Michael J., Belongie Serge, Snæbjarnarson Vésteinn
- Gl\'oria -- A Generative And Open Large Language Model For Portuguese Lopes Ricardo, Magalhães João, Semedo David
- High-quality Data-to-text Generation For Severely Under-resourced Languages With Out-of-the-box Large Language Models Lorandi Michela, Belz Anya
- Sparser Is Faster And Less Is More: Efficient Sparse Attention For Long-range Transformers Lou Chao, Jia Zixia, Zheng Zilong, Tu Kewei
- Towards Transfer Unlearning: Empirical Evidence Of Cross-domain Bias Mitigation Lu Huimin, Isonuma Masaru, Mori Junichiro, Sakata Ichiro
- BGE Landmark Embedding: A Chunking-free Embedding Method For Retrieval Augmented Long-context Large Language Models Luo Kun, Liu Zheng, Xiao Shitao, Liu Kang
- Taking A Deep Breath: Enhancing Language Modeling Of Large Language Models With Sentinel Tokens Luo Weiyao, Zheng Suncong, Xia Heming, Wang Weikang, Lei Yan, Liu Tianyu, Chen Shuang, Sui Zhifang
- Impact Of Model Size On Fine-tuned LLM Performance In Data-to-text Generation: A State-of-the-art Investigation Mahapatra Joy, Garain Utpal
- Learning To Plan Long-term For Language Modeling Mai Florian, Cornille Nathan, Moens Marie-francine
- Rephrasing The Web: A Recipe For Compute And Data-efficient Language Modeling Maini Pratyush, Seto Skyler, Bai He, Grangier David, Zhang Yizhe, Jaitly Navdeep
- Set-based Prompting: Provably Solving The Language Model Order Dependency Problem Mcilroy-young Reid, Brown Katrina, Olson Conlan, Zhang Linjun, Dwork Cynthia
- Permitqa: A Benchmark For Retrieval Augmented Generation In Wind Siting And Permitting Domain Meyur Rounak, Phan Hung, Wagle Sridevi, Strube Jan, Halappanavar Mahantesh, Horawalavithana Sameera, Acharya Anurag, Munikoti Sai
- Exploration Of Masked And Causal Language Modelling For Text Generation Micheletti Nicolo, Belkadi Samuel, Han Lifeng, Nenadic Goran
- CBF-LLM: Safe Control For LLM Alignment Miyaoka Yuya, Inoue Masaki
- Convsdg: Session Data Generation For Conversational Search Mo Fengran, Yi Bole, Mao Kelong, Qu Chen, Huang Kaiyu, Nie Jian-yun
- Large Language Model (LLM) AI Text Generation Detection Based On Transformer Deep Learning Algorithm Mo Yuhong, Qin Hao, Dong Yushan, Zhu Ziyi, Li Zhenglin
- Memllm: Finetuning Llms To Use An Explicit Read-write Memory Modarressi Ali, Köksal Abdullatif, Imani Ayyoob, Fayyaz Mohsen, Schütze Hinrich
- Realm: Reference Resolution As Language Modeling Moniz Joel Ruben Antony, Krishnan Soundarya, Ozyildirim Melis, Saraf Prathamesh, Ates Halim Cagri, Zhang Yuan, Yu Hong
- Repliqa: A Question-answering Dataset For Benchmarking Llms On Unseen Reference Content Monteiro Joao, Noel Pierre-andre, Marcotte Etienne, Rajeswar Sai, Zantedeschi Valentina, Vazquez David, Chapados Nicolas, Pal Christopher, Taslakian Perouz
- Language Modelling Approaches To Adaptive Machine Translation Moslem Yasmin
- Leave No Context Behind: Efficient Infinite Context Transformers With Infini-attention Munkhdalai Tsendsuren, Faruqui Manaal, Gopal Siddharth
- Compact Language Models Via Pruning And Knowledge Distillation Muralidharan Saurav, Sreenivas Sharath Turuvekere, Joshi Raviraj, Chochowski Marcin, Patwary Mostofa, Shoeybi Mohammad, Catanzaro Bryan, Kautz Jan, Molchanov Pavlo
- Novel-wd: Exploring Acquisition Of Novel World Knowledge In Llms Using Prefix-tuning Méloux Maxime, Cerisara Christophe
- What Drives Performance In Multilingual Language Models? Nezhad Sina Bagheri, Agrawal Ameeta
- Ceval: A Benchmark For Evaluating Counterfactual Text Generation Nguyen Van Bach, Schlötterer Jörg, Seifert Christin
- Vikhr: The Family Of Open-source Instruction-tuned Large Language Models For Russian Nikolich Aleksandr, Korolev Konstantin, Shelmanov Artem, Kiselev Igor
- Ambignlg: Addressing Task Ambiguity In Instruction For NLG Niwa Ayana, Iso Hayate
- A Systematic Review Of Data-to-text NLG Osuji Chinonso Cynthia, Ferreira Thiago Castro, Davis Brian
- Can Watermarking Large Language Models Prevent Copyrighted Text Generation And Hide Training Data? Panaitescu-liess Michael-andrei, Che Zora, An Bang, Xu Yuancheng, Pathmanathan Pankayaraj, Chakraborty Souradip, Zhu Sicheng, Goldstein Tom, Huang Furong
- Shaking Up Vlms: Comparing Transformers And Structured State Space Models For Vision & Language Modeling Pantazopoulos Georgios, Nikandrou Malvina, Suglia Alessandro, Lemon Oliver, Eshghi Arash
- Token-picker: Accelerating Attention In Text Generation With Minimized Memory Transfer Via Probability Estimation Park Junyoung, Kang Myeonggu, Han Yunki, Kim Yanggon, Shin Jaekang, Kim Lee-sup
- Suri: Multi-constraint Instruction Following For Long-form Text Generation Pham Chau Minh, Sun Simeng, Iyyer Mohit
- Evaluation Of Language Models In The Medical Context Under Resource-constrained Settings Posada Andrea, Rueckert Daniel, Meissen Felix, Müller Philip
- Accelerating Large Language Model Pretraining Via LFR Pedagogy: Learn, Focus, And Review Prakriya Neha, Yen Jui-nan, Hsieh Cho-jui, Cong Jason
- Supportiveness-based Knowledge Rewriting For Retrieval-augmented Language Modeling Qiao Zile, Ye Wei, Jiang Yong, Mo Tong, Xie Pengjun, Li Weiping, Huang Fei, Zhang Shikun
- Various Lengths, Constant Speed: Efficient Language Modeling With Lightning Attention Qin Zhen, Sun Weigao, Li Dong, Shen Xuyang, Sun Weixuan, Zhong Yiran
- The Frontier Of Data Erasure: Machine Unlearning For Large Language Models Qu Youyang, Ding Ming, Sun Nan, Thilakarathna Kanchana, Zhu Tianqing, Niyato Dusit
- A Critical Look At Tokenwise Reward-guided Text Generation Rashid Ahmad, Wu Ruotian, Grosse Julia, Kristiadi Agustinus, Poupart Pascal
- Llms For Test Input Generation For Semantic Caches Rasool Zafaryab, Barnett Scott, Willie David, Kurniawan Stefanus, Balugo Sherwin, Thudumu Srikanth, Abdelrazek Mohamed
- Developing Safe And Responsible Large Language Model : Can We Balance Bias Reduction And Language Understanding In Large Language Models? Raza Shaina, Bamgbose Oluwanifemi, Ghuge Shardul, Tavakol Fatemeh, Reji Deepak John, Bashir Syed Raza
- FIRST: Faster Improved Listwise Reranking With Single Token Decoding Reddy Revanth Gangi, Doo Jaehyeok, Xu Yifei, Sultan Md Arafat, Swain Deevya, Sil Avirup, Ji Heng
- Samba: Simple Hybrid State Space Models For Efficient Unlimited Context Language Modeling Ren Liliang, Liu Yang, Lu Yadong, Shen Yelong, Liang Chen, Chen Weizhu
- Achieving Peak Performance For Large Language Models: A Systematic Review Rostam Zhyar Rzgar K, Szénási Sándor, Kertész Gábor
- Tandem Transformers For Inference Efficient Llms S Aishwarya P, Nair Pranav Ajit, Samaga Yashas, Boyd Toby, Kumar Sanjiv, Jain Prateek, Netrapalli Praneeth
- Llms' Understanding Of Natural Language Revealed Saba Walid S.
- Neurocache: Efficient Vector Retrieval For Long-range Language Modeling Safaya Ali, Yuret Deniz
- Risks, Causes, And Mitigations Of Widespread Deployments Of Large Language Models (llms): A Survey Sakib Md Nazmus, Islam Md Athikul, Pathak Royal, Arifin Md Mashrur
- Evaluating Consistency And Reasoning Capabilities Of Large Language Models Saxena Yash, Chopra Sarthak, Tripathi Arunendra Mani
- Self-distillation For Model Stacking Unlocks Cross-lingual NLU In 200+ Languages Schmidt Fabian David, Borchert Philipp, Vulić Ivan, Glavaš Goran
- Tencdm: Understanding The Properties Of Diffusion Model In The Space Of Language Model Encodings Shabalin Alexander, Meshchaninov Viacheslav, Chimbulatov Egor, Lapikov Vladislav, Kim Roman, Bartosh Grigory, Molchanov Dmitry, Markov Sergey, Vetrov Dmitry
- Extensible Embedding: A Flexible Multipler For Llm's Context Length Shao Ninglu, Xiao Shitao, Liu Zheng, Zhang Peitian
- Combining Multiple Post-training Techniques To Achieve Most Efficient Quantized Llms Sharify Sayeh, Xu Zifei, Yazar Wanzin, Wang Xin
- Non-autoregressive Sequence-to-sequence Vision-language Models Shi Kunyu, Dong Qi, Goncalves Luis, Tu Zhuowen, Soatto Stefano
- Paying More Attention To Image: A Training-free Method For Alleviating Hallucination In Lvlms Shi Liu, Kecheng Zheng, Wei Chen
- Constructing Benchmarks And Interventions For Combating Hallucinations In Llms Simhi Adi, Herzig Jonathan, Szpektor Idan, Belinkov Yonatan
- ELLA-V: Stable Neural Codec Language Modeling With Alignment-guided Sequence Reordering Song Yakun, Chen Zhuo, Wang Xiaofei, Ma Ziyang, Chen Xie
- Moviellm: Enhancing Long Video Understanding With Ai-generated Movies Song Zhende, Wang Chenchen, Sheng Jiamu, Zhang Chi, Yu Gang, Fan Jiayuan, Chen Tao
- Know When To Stop: A Study Of Semantic Drift In Text Generation Spataru Ava, Hambro Eric, Voita Elena, Cancedda Nicola
- Generaitor: Tree-in-the-loop Text Generation For Language Model Explainability And Adaptation Spinner Thilo, Kehlbeck Rebecca, Sevastjanova Rita, Stähle Tobias, Keim Daniel A., Deussen Oliver, El-assady Mennatallah
- Repetition Improves Language Model Embeddings Springer Jacob Mitchell, Kotha Suhas, Fried Daniel, Neubig Graham, Raghunathan Aditi
- Confabulation: The Surprising Value Of Large Language Model Hallucinations Sui Peiqi, Duede Eamon, Wu Sophie, So Richard Jean
- Llm-oriented Retrieval Tuner Sun Si, Zhang Hanqing, Liu Zhiyuan, Bao Jie, Song Dawei
- Towards Detecting Llms Hallucination Via Markov Chain-based Multi-agent Debate Framework Sun Xiaoxi, Li Jinpeng, Zhong Yan, Zhao Dongyan, Yan Rui
- Multi-turn Context Jailbreak Attack On Large Language Models From First Principles Sun Xiongtao, Zhang Deyue, Yang Dongdong, Zou Quanchen, Li Hui
- PIXAR: Auto-regressive Language Modeling In Pixel Space Tai Yintao, Liao Xiyang, Suglia Alessandro, Vergari Antonio
- PROXYQA: An Alternative Framework For Evaluating Long-form Text Generation With Large Language Models Tan Haochen, Guo Zhijiang, Shi Zhan, Xu Lu, Liu Zhili, Feng Yunlong, Li Xiaoguang, Wang Yasheng, Shang Lifeng, Liu Qun, Song Linqi
- Idgenrec: Llm-recsys Alignment With Textual ID Learning Tan Juntao, Xu Shuyuan, Hua Wenyue, Ge Yingqiang, Li Zelong, Zhang Yongfeng
- Unlocking The Potential Of Model Merging For Low-resource Languages Tao Mingxu, Zhang Chen, Huang Quzhe, Ma Tianyao, Huang Songfang, Zhao Dongyan, Feng Yansong
- CUDRT: Benchmarking The Detection Of Human Vs. Large Language Models Generated Texts Tao Zhen, Li Zhiyu, Xi Dinghao, Xu Wei
- Embedding-aligned Language Models Tennenholtz Guy, Chow Yinlam, Hsu Chih-wei, Shani Lior, Liang Ethan, Boutilier Craig
- Zyda: A 1.3T Dataset For Open Language Modeling Tokpanov Yury, Millidge Beren, Glorioso Paolo, Pilault Jonathan, Ibrahim Adam, Whittington James, Anthony Quentin
- Cpsdbench: A Large Language Model Evaluation Benchmark And Baseline For Chinese Public Security Domain Tong Xin, Jin Bo, Lin Zhi, Wang Binjun, Yu Ting, Cheng Qiang
- Readctrl: Personalizing Text Generation With Readability-controlled Instruction Learning Tran Hieu, Yao Zonghai, Li Lingxi, Yu Hong
- Transformers Are Ssms: Generalized Models And Efficient Algorithms Through Structured State Space Duality Tri Dao, Albert Gu
- Dynamo: Accelerating Language Model Inference With Dynamic Multi-token Sampling Tuli Shikhar, Lin Chi-heng, Hsu Yen-chang, Jha Niraj K., Shen Yilin, Jin Hongxia
- VBART: The Turkish LLM Turker Meliksah, Ari Mehmet Erdi, Han Aydin
- Emotional Manipulation Through Prompt Engineering Amplifies Disinformation Generation In AI Large Language Models Vinay Rasita, Spitale Giovanni, Biller-andorno Nikola, Germani Federico
- Vi-mistral-x: Building A Vietnamese Language Model With Advanced Continual Pre-training Vo James
- An Empirical Study Of Mamba-based Language Models Waleffe Roger, Byeon Wonmin, Riach Duncan, Norick Brandon, Korthikanti Vijay, Dao Tri, Gu Albert, Hatamizadeh Ali, Singh Sudhakar, Narayanan Deepak, Kulshreshtha Garvit, Singh Vartika, Casper Jared, Kautz Jan, Shoeybi Mohammad, Catanzaro Bryan
- COSMO: Contrastive Streamlined Multimodal Model With Interleaved Pre-training Wang Alex Jinpeng, Li Linjie, Lin Kevin Qinghong, Wang Jianfeng, Lin Kevin, Yang Zhengyuan, Wang Lijuan, Shou Mike Zheng
- Reinforcement Learning For Edit-based Non-autoregressive Neural Machine Translation Wang Hao, Morimura Tetsuro, Honda Ukyo, Kawahara Daisuke
- Interpretable Preferences Via Multi-objective Reward Modeling And Mixture-of-experts Wang Haoxiang, Xiong Wei, Xie Tengyang, Zhao Han, Zhang Tong
- Length Generalization Of Causal Transformers Without Position Encoding Wang Jie, Ji Tao, Wu Yuanbin, Yan Hang, Gui Tao, Zhang Qi, Huang Xuanjing, Wang Xiaoling
- Mambabyte: Token-free Selective State Space Model Wang Junxiong, Gangavarapu Tushaar, Yan Jing Nathan, Rush Alexander M.
- The Mamba In The Llama: Distilling And Accelerating Hybrid Models Wang Junxiong, Paliotta Daniele, May Avner, Rush Alexander M., Dao Tri
- Deconfounded Causality-aware Parameter-efficient Fine-tuning For Problem-solving Improvement Of Llms Wang Ruoyu, Li Xiaoxuan, Yao Lina
- Resonance Rope: Improving Context Length Generalization Of Large Language Models Wang Suyuchen, Kobyzev Ivan, Lu Peng, Rezagholizadeh Mehdi, Liu Bang
- Ladic: Are Diffusion Models Really Inferior To Autoregressive Counterparts For Image-to-text Generation? Wang Yuchi, Ren Shuhuai, Gao Rundong, Yao Linli, Guo Qingyan, An Kaikai, Bai Jianhong, Sun Xu
- LSTP: Language-guided Spatial-temporal Prompt Learning For Long-form Video-text Understanding Wang Yuxuan, Wang Yueqian, Wu Pengfei, Liang Jianxin, Zhao Dongyan, Zheng Zilong
- Hidden In Plain Sight: Exploring Chat History Tampering In Interactive Language Models Wei Cheng'an, Zhao Yue, Gong Yujia, Chen Kai, Xiang Lu, Zhu Shenchen
- Training-free Exponential Extension Of Sliding Window Context With Cascading KV Cache Willette Jeffrey, Lee Heejun, Lee Youngwan, Jeon Myeongjae, Hwang Sung Ju
- What Are The Limits Of Cross-lingual Dense Passage Retrieval For Low-resource Languages? Wu Jie, Ren Zhaochun, Verberne Suzan
- Motionllm: Multimodal Motion-language Learning With Large Language Models Wu Qi, Zhao Yubo, Wang Yifan, Tai Yu-wing, Tang Chi-keung
- Do Language Models Plan Ahead For Future Tokens? Wu Wilson, Morris John X., Levine Lionel
- Longgenbench: Benchmarking Long-form Generation In Long Context Llms Wu Yuhao, Hee Ming Shan, Hu Zhiqing, Lee Roy Ka-wei
- Clover-2: Accurate Inference For Regressive Lightweight Speculative Decoding Xiao Bin, Gui Lujun, Su Lei, Chen Weipeng
- Show-o: One Single Transformer To Unify Multimodal Understanding And Generation Xie Jinheng, Mao Weijia, Bai Zechen, Zhang David Junhao, Wang Weihao, Lin Kevin Qinghong, Gu Yuchao, Chen Zhijie, Yang Zhenheng, Shou Mike Zheng
- Recall: Membership Inference Via Relative Conditional Log-likelihoods Xie Roy, Wang Junlin, Huang Ruomin, Zhang Minxing, Ge Rong, Pei Jian, Gong Neil Zhenqiang, Dhingra Bhuwan
- Exploratory Preference Optimization: Harnessing Implicit Q*-approximation For Sample-efficient RLHF Xie Tengyang, Foster Dylan J., Krishnamurthy Akshay, Rosset Corby, Awadallah Ahmed, Rakhlin Alexander
- RALL-E: Robust Codec Language Modeling With Chain-of-thought Prompting For Text-to-speech Synthesis Xin Detai, Tan Xu, Shen Kai, Ju Zeqian, Yang Dongchao, Wang Yuancheng, Takamichi Shinnosuke, Saruwatari Hiroshi, Liu Shujie, Li Jinyu, Zhao Sheng
- Unsupervised Information Refinement Training Of Large Language Models For Retrieval-augmented Generation Xu Shicheng, Pang Liang, Yu Mo, Meng Fandong, Shen Huawei, Cheng Xueqi, Zhou Jie
- Bridging The Gap Between Different Vocabularies For LLM Ensemble Xu Yangyifan, Lu Jinliang, Zhang Jiajun
- Beyond Perplexity: Multi-dimensional Safety Evaluation Of LLM Compression Xu Zhichao, Gupta Ashim, Li Tao, Bentham Oliver, Srikumar Vivek
- LEIA: Facilitating Cross-lingual Knowledge Transfer In Language Models With Entity-based Data Augmentation Yamada Ikuya, Ri Ryokan
- \(\text{memory}^3\): Language Modeling With Explicit Memory Yang Hongkang, Lin Zehao, Wang Wenjin, Wu Hao, Li Zhiyu, Tang Bo, Wei Wenqiang, Wang Jinbo, Tang Zeyun, Song Shichao, Xi Chenyang, Yu Yu, Chen Kai, Xiong Feiyu, Tang Linpeng, E Weinan
- Regularizing Hidden States Enables Learning Generalizable Reward Model For Llms Yang Rui, Ding Ruomeng, Lin Yong, Zhang Huan, Zhang Tong
- Parallelizing Linear Transformers With The Delta Rule Over Sequence Length Yang Songlin, Wang Bailin, Zhang Yu, Shen Yikang, Kim Yoon
- Training Ultra Long Context Language Model With Fully Pipelined Distributed Transformer Yao Jinghan, Jacobs Sam Ade, Tanaka Masahiro, Ruwase Olatunji, Shafi Aamir, Subramoni Hari, Panda Dhabaleswar K.
- Sirllm: Streaming Infinite Retentive LLM Yao Yao, Li Zuchao, Zhao Hai
- Improving Visual Commonsense In Language Models Via Multiple Image Generation Yariv Guy, Schwartz Idan, Adi Yossi, Benaim Sagie
- Preference-guided Reflective Sampling For Aligning Language Models Ye Hai, Ng Hwee Tou
- Long-context Language Modeling With Parallel Context Encoding Yen Howard, Gao Tianyu, Chen Danqi
- Characterizing Truthfulness In Large Language Model Generations With Local Intrinsic Dimension Yin Fan, Srinivasa Jayanth, Chang Kai-wei
- Stablemask: Refining Causal Masking In Decoder-only Transformer Yin Qingyu, He Xuzheng, Zhuang Xiang, Zhao Yu, Yao Jianhua, Shen Xiaoyu, Zhang Qiang
- Chatmusician: Understanding And Generating Music Intrinsically With LLM Yuan Ruibin, Lin Hanfeng, Wang Yi, Tian Zeyue, Wu Shangda, Shen Tianhao, Zhang Ge, Wu Yuhang, Liu Cong, Zhou Ziya, Ma Ziyang, Xue Liumeng, Wang Ziyu, Liu Qin, Zheng Tianyu, Li Yizhi, Ma Yinghao, Liang Yiming, Chi Xiaowei, Liu Ruibo, Wang Zili, Li Pengfei, Wu Jingcheng, Lin Chenghua, Liu Qifeng, Jiang Tao, Huang Wenhao, Chen Wenhu, Benetos Emmanouil, Fu Jie, Xia Gus, Dannenberg Roger, Xue Wei, Kang Shiyin, Guo Yike
- Tcmbench: A Comprehensive Benchmark For Evaluating Large Language Models In Traditional Chinese Medicine Yue Wenjing, Wang Xiaoling, Zhu Wei, Guan Ming, Zheng Huanran, Wang Pengfei, Sun Changzhi, Ma Xin
- Wkvquant: Quantizing Weight And Key/value Cache For Large Language Models Gains More Yue Yuxuan, Yuan Zhihang, Duanmu Haojie, Zhou Sifan, Wu Jianlong, Nie Liqiang
- DELIA: Diversity-enhanced Learning For Instruction Adaptation In Large Language Models Zeng Yuanhao, Ren Fei, Zhou Xinpeng, Wang Yihang, Shao Yingxia
- LUQ: Long-text Uncertainty Quantification For Llms Zhang Caiqi, Liu Fangyu, Basaldella Marco, Collier Nigel
- Code Representation Learning At Scale Zhang Dejiao, Ahmad Wasi, Tan Ming, Ding Hantian, Nallapati Ramesh, Roth Dan, Ma Xiaofei, Xiang Bing
- Speechgpt-gen: Scaling Chain-of-information Speech Generation Zhang Dong, Zhang Xin, Zhan Jun, Li Shimin, Zhou Yaqian, Qiu Xipeng
- Exploring The Best Practices Of Query Expansion With Large Language Models Zhang Le, Wu Yihong, Yang Qian, Nie Jian-yun
- Generative Verifiers: Reward Modeling As Next-token Prediction Zhang Lunjun, Hosseini Arian, Bansal Hritik, Kazemi Mehran, Kumar Aviral, Agarwal Rishabh
- Soaring From 4K To 400K: Extending Llm's Context With Activation Beacon Zhang Peitian, Liu Zheng, Xiao Shitao, Shao Ninglu, Ye Qiwei, Dou Zhicheng
- Mindspeech: Continuous Imagined Speech Decoding Using High-density Fnirs And Prompt Tuning For Advanced Human-ai Interaction Zhang Suyi, Alam Ekram, Baber Jack, Bianco Francesca, Turner Edward, Chamanzar Maysam, Dehghani Hamid
- R4: Reinforced Retriever-reorder-responder For Retrieval-augmented Large Language Models Zhang Taolin, Li Dongyang, Chen Qizhou, Wang Chengyu, Huang Longtao, Xue Hui, He Xiaofeng, Huang Jun
- Pushing The Limit Of LLM Capacity For Text Classification Zhang Yazhou, Wang Mengyao, Ren Chenyu, Li Qiuchi, Tiwari Prayag, Wang Benyou, Qin Jing
- Found In The Middle: How Language Models Use Long Contexts Better Via Plug-and-play Positional Encoding Zhang Zhenyu, Chen Runjin, Liu Shiwei, Yao Zhewei, Ruwase Olatunji, Chen Beidi, Wu Xiaoxia, Wang Zhangyang
- Longskywork: A Training Recipe For Efficiently Extending Context Length In Large Language Models Zhao Liang, Wei Tianwen, Zeng Liang, Cheng Cheng, Yang Liu, Cheng Peng, Wang Lijie, Li Chenxia, Wu Xuejie, Zhu Bo, Gan Yimeng, Hu Rui, Yan Shuicheng, Fang Han, Zhou Yahui
- HALC: Object Hallucination Reduction Via Adaptive Focal-contrast Decoding Zhaorun Chen, Zhuokai Zhao, Hongyin Luo, Huaxiu Yao, Bo Li, Jiawei Zhou
- Llamafactory: Unified Efficient Fine-tuning Of 100+ Language Models Zheng Yaowei, Zhang Richong, Zhang Junhao, Ye Yanhan, Luo Zheyan, Feng Zhangchi, Ma Yongqiang
- Evaluating The Smooth Control Of Attribute Intensity In Text Generation With Llms Zhou Shang, Yao Feng, Dong Chengyu, Wang Zihan, Shang Jingbo
- Benchmarking Large Language Models On CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset Zhu Jie, Li Junhui, Wen Yalong, Guo Lifan
- Scalable Matmul-free Language Modeling Zhu Rui-jie, Zhang Yu, Sifferman Ethan, Sheaves Tyler, Wang Yiqiao, Richmond Dustin, Zhou Peng, Eshraghian Jason K.
- Quite Good, But Not Enough: Nationality Bias In Large Language Models -- A Case Study Of Chatgpt Zhu Shucheng, Wang Weikang, Liu Ying
- IAPT: Instruction-aware Prompt Tuning For Large Language Models Zhu Wei, Tian Aaron Xuxiang, Yin Congrui, Ni Yuan, Wang Xiaoling, Xie Guotong
- Protllm: An Interleaved Protein-language LLM With Protein-as-word Pre-training Zhuo Le, Chi Zewen, Xu Minghao, Huang Heyan, Zheng Heqi, He Conghui, Mao Xian-ling, Zhang Wentao
- Language Model Sentence Completion With A Parser-driven Rhetorical Control Method Zingale Joshua, Kalita Jugal
- Proswitch: Knowledge-guided Instruction Tuning To Generate Professional And Non-professional Styled Text Zong Chang, Chen Yuyan, Lu Weiming, Shao Jian, Zhuang Yueting
- Systematic Task Exploration With Llms: A Study In Citation Text Generation Şahinuç Furkan, Kuznetsov Ilia, Hou Yufang, Gurevych Iryna
🏷 Large Scale Training
- Cross-lingual Machine Reading Comprehension Cui Yiming, Che Wanxiang, Liu Ting, Qin Bing, Wang Shijin, Hu Guoping
- Addressing Some Limitations Of Transformers With Feedback Memory Fan Angela, Lavril Thibaut, Grave Edouard, Joulin Armand, Sukhbaatar Sainbayar
- Scaling Laws For Neural Language Models Jared Kaplan, Sam Mccandlish, Tom Henighan, Tom B. Brown, Benjamin Chess, Rewon Child, Scott Gray, Alec Radford, Jeffrey Wu, Dario Amodei
- Just Ask: Learning To Answer Questions From Millions Of Narrated Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Layered Gradient Accumulation And Modular Pipeline Parallelism: Fast And Efficient Training Of Large Language Models Lamy-poirier Joel
- Pagnol: An Extra-large French Generative Model Launay Julien, Tommasone Elena, Pannier Baptiste, Boniface François, Chatelain Amélie, Cappelli Alessandro, Poli Iacopo, Seddah Djamé
- Exploring Transformers In Natural Language Generation: GPT, BERT, And Xlnet Topal M. Onat, Bas Anil, Van Heerden Imke
- Yuan 1.0: Large-scale Pre-trained Language Model In Zero-shot And Few-shot Learning Wu Shaohua, Zhao Xudong, Yu Tong, Zhang Rongguo, Shen Chong, Liu Hongli, Li Feng, Zhu Hong, Luo Jiangang, Xu Liang, Zhang Xuanwei
- Cramming: Training A Language Model On A Single GPU In One Day Geiping Jonas, Goldstein Tom
- Mulzdg: Multilingual Code-switching Framework For Zero-shot Dialogue Generation Liu Yongkang, Feng Shi, Wang Daling, Zhang Yifei
- Torchscale: Transformers At Scale Ma Shuming, Wang Hongyu, Huang Shaohan, Wang Wenhui, Chi Zewen, Dong Li, Benhaim Alon, Patra Barun, Chaudhary Vishrav, Song Xia, Wei Furu
- Using Deepspeed And Megatron To Train Megatron-turing NLG 530B, A Large-scale Generative Language Model Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick Legresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zhang, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, Bryan Catanzaro
- Staged Training For Transformer Language Models Shen Sheng, Walsh Pete, Keutzer Kurt, Dodge Jesse, Peters Matthew, Beltagy Iz
- Large Language Models Are Zero-shot Reasoners Takeshi Kojima, Shixiang Shane Gu, Machel Reid, Yutaka Matsuo, Yusuke Iwasawa
- Learning To Answer Visual Questions From Web Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Transcending Scaling Laws With 0.1% Extra Compute Yi Tay, Jason Wei, Hyung Won Chung, Vinh Q. Tran, David R. So, Siamak Shakeri, Xavier Garcia, Huaixiu Steven Zheng, Jinfeng Rao, Aakanksha Chowdhery, Denny Zhou, Donald Metzler, Slav Petrov, Neil Houlsby, Quoc V. Le, Mostafa Dehghani
- Scaling Laws For Language Encoding Models In Fmri Antonello Richard, Vaidya Aditya, Huth Alexander G.
- Vtrain: A Simulation Framework For Evaluating Cost-effective And Compute-optimal Large Language Model Training Bang Jehyeon, Choi Yujeong, Kim Myeongwoo, Kim Yongdeok, Rhu Minsoo
- RWKV: Reinventing Rnns For The Transformer Era Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Stella Biderman, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran Gv, Xuzheng He, Haowen Hou, Jiaju Lin, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Guangyu Song, Xiangru Tang, Bolun Wang, Johan S. Wind, Stanislaw Wozniak, Ruichong Zhang, Zhenyuan Zhang, Qihang Zhao, Peng Zhou, Qinghua Zhou, Jian Zhu, Rui-jie Zhu
- Revealing The Structure Of Language Model Capabilities Burnell Ryan, Hao Han, Conway Andrew R. A., Orallo Jose Hernandez
- EE-LLM: Large-scale Training And Inference Of Early-exit Large Language Models With 3D Parallelism Chen Yanxi, Pan Xuchen, Li Yaliang, Ding Bolin, Zhou Jingren
- Confidant: Customizing Transformer-based Llms Via Collaborative Edge Training Chen Yuhao, Yan Yuxuan, Yang Qianqian, Shu Yuanchao, He Shibo, Chen Jiming
- Optimizing Distributed Training On Frontier For Large Language Models Dash Sajal, Lyngaas Isaac, Yin Junqi, Wang Xiao, Egele Romain, Cong Guojing, Wang Feiyi, Balaprakash Prasanna
- Ziya2: Data-centric Learning Is All Llms Need Gan Ruyi, Wu Ziwei, Sun Renliang, Lu Junyu, Wu Xiaojun, Zhang Dixiang, Pan Kunhao, He Junqing, Tian Yuanhe, Yang Ping, Yang Qi, Wang Hao, Zhang Jiaxing, Song Yan
- Efficient Parallelization Layouts For Large-scale Distributed Model Training Hagemann Johannes, Weinbach Samuel, Dobler Konstantin, Schall Maximilian, De Melo Gerard
- Soft Prompt Decoding For Multilingual Dense Retrieval Huang Zhiqi, Zeng Hansi, Zamani Hamed, Allan James
- Speculative Decoding With Big Little Decoder Kim Sehoon, Mangalam Karttikeya, Moon Suhong, Malik Jitendra, Mahoney Michael W., Gholami Amir, Keutzer Kurt
- Scaling Laws Of Rope-based Extrapolation Liu Xiaoran, Yan Hang, Zhang Shuo, An Chenxin, Qiu Xipeng, Lin Dahua
- Codegen2: Lessons For Training Llms On Programming And Natural Languages Nijkamp Erik, Hayashi Hiroaki, Xiong Caiming, Savarese Silvio, Zhou Yingbo
- Beyond Chinchilla-optimal: Accounting For Inference In Language Model Scaling Laws Sardana Nikhil, Portes Jacob, Doubov Sasha, Frankle Jonathan
- The Languini Kitchen: Enabling Language Modelling Research At Different Scales Of Compute Stanić Aleksandar, Ashley Dylan, Serikov Oleg, Kirsch Louis, Faccio Francesco, Schmidhuber Jürgen, Hofmann Thomas, Schlag Imanol
- Emergent And Predictable Memorization In Large Language Models Stella Biderman, Usvsn Sai Prashanth, Lintang Sutawika, Hailey Schoelkopf, Quentin Anthony, Shivanshu Purohit, Edward Raff
- Corex: Pushing The Boundaries Of Complex Reasoning Through Multi-model Collaboration Sun Qiushi, Yin Zhangyue, Li Xiang, Wu Zhiyong, Qiu Xipeng, Kong Lingpeng
- Redcoast: A Lightweight Tool To Automate Distributed Training Of Llms On Any Gpu/tpus Tan Bowen, Zhu Yun, Liu Lijuan, Wang Hongyi, Zhuang Yonghao, Chen Jindong, Xing Eric, Hu Zhiting
- YUAN 2.0: A Large Language Model With Localized Filtering-based Attention Wu Shaohua, Zhao Xudong, Wang Shenling, Luo Jiangang, Li Lingjun, Chen Xi, Zhao Bing, Wang Wei, Yu Tong, Zhang Rongguo, Zhang Jiahua, Wang Chao
- Nanolm: An Affordable LLM Pre-training Benchmark Via Accurate Loss Prediction Across Scales Yao Yiqun, Fan Siqi, Huang Xiusheng, Fang Xuezhi, Li Xiang, Ni Ziyi, Jiang Xin, Meng Xuying, Han Peng, Shang Shuo, Liu Kang, Sun Aixin, Wang Yequan
- Extrapolating Large Language Models To Non-english By Aligning Languages Zhu Wenhao, Lv Yunzhe, Dong Qingxiu, Yuan Fei, Xu Jingjing, Huang Shujian, Kong Lingpeng, Chen Jiajun, Li Lei
- Cost-effective Instruction Learning For Pathology Vision And Language Analysis Chen Kaitao, Liu Mianxin, Yan Fang, Ma Lei, Shi Xiaoming, Wang Lilong, Wang Xiaosong, Zhu Lifeng, Wang Zhe, Zhou Mu, Zhang Shaoting
- Are We On The Right Way For Evaluating Large Vision-language Models? Chen Lin, Li Jinsong, Dong Xiaoyi, Zhang Pan, Zang Yuhang, Chen Zehui, Duan Haodong, Wang Jiaqi, Qiao Yu, Lin Dahua, Zhao Feng
- Kv-runahead: Scalable Causal LLM Inference By Parallel Key-value Cache Generation Cho Minsik, Rastegari Mohammad, Naik Devang
- A Tale Of Tails: Model Collapse As A Change Of Scaling Laws Dohmatob Elvis, Feng Yunzhen, Yang Pu, Charton Francois, Kempe Julia
- HLAT: High-quality Large Language Model Pre-trained On AWS Trainium Fan Haozheng, Zhou Hao, Huang Guangtai, Raman Parameswaran, Fu Xinwei, Gupta Gaurav, Ram Dhananjay, Wang Yida, Huan Jun
- AI And Memory Wall Gholami Amir, Yao Zhewei, Kim Sehoon, Hooper Coleman, Mahoney Michael W., Keutzer Kurt
- Chatglm-rlhf: Practices Of Aligning Large Language Models With Human Feedback Hou Zhenyu, Niu Yilin, Du Zhengxiao, Zhang Xiaohan, Liu Xiao, Zeng Aohan, Zheng Qinkai, Huang Minlie, Wang Hongning, Tang Jie, Dong Yuxiao
- Openrlhf: An Easy-to-use, Scalable And High-performance RLHF Framework Hu Jian, Wu Xibin, Wang Weixun, Xianyu, Zhang Dehao, Cao Yu
- Characterization Of Large Language Model Development In The Datacenter Hu Qinghao, Ye Zhisheng, Wang Zerui, Wang Guoteng, Zhang Meng, Chen Qiaoling, Sun Peng, Lin Dahua, Wang Xiaolin, Luo Yingwei, Wen Yonggang, Zhang Tianwei
- Scaling Laws For Downstream Task Performance Of Large Language Models Isik Berivan, Ponomareva Natalia, Hazimeh Hussein, Paparas Dimitris, Vassilvitskii Sergei, Koyejo Sanmi
- Adaptive Skeleton Graph Decoding Jin Shuowei, Wu Yongji, Zheng Haizhong, Zhang Qingzhao, Lentz Matthew, Mao Z. Morley, Prakash Atul, Qian Feng, Zhuo Danyang
- Scaling Laws For Forgetting When Fine-tuning Large Language Models Kalajdzievski Damjan
- Are Bigger Encoders Always Better In Vision Large Models? Li Bozhou, Liang Hao, Meng Zimo, Zhang Wentao
- 52B To 1T: Lessons Learned Via Tele-flm Series Li Xiang, Yao Yiqun, Jiang Xin, Fang Xuezhi, Wang Chao, Liu Xinzhang, Wang Zihan, Zhao Yu, Wang Xin, Huang Yuyao, Song Shuangyong, Li Yongxiang, Zhang Zheng, Zhao Bo, Sun Aixin, Wang Yequan, He Zhongjiang, Wang Zhongyuan, Li Xuelong, Huang Tiejun
- Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats In Customized Large Language Models Liang Zi, Hu Haibo, Ye Qingqing, Xiao Yaxin, Li Haoyang
- Token-wise Influential Training Data Retrieval For Large Language Models Lin Huawei, Long Jikai, Xu Zhaozhuo, Zhao Weijie
- Rephrasing The Web: A Recipe For Compute And Data-efficient Language Modeling Maini Pratyush, Seto Skyler, Bai He, Grangier David, Zhang Yizhe, Jaitly Navdeep
- Realhf: Optimized RLHF Training For Large Language Models Through Parameter Reallocation Mei Zhiyu, Fu Wei, Li Kaiwei, Wang Guangju, Zhang Huanchen, Wu Yi
- Large Language Models: A Survey Minaee Shervin, Mikolov Tomas, Nikzad Narjes, Chenaghlu Meysam, Socher Richard, Amatriain Xavier, Gao Jianfeng
- LISA: Layerwise Importance Sampling For Memory-efficient Large Language Model Fine-tuning Pan Rui, Liu Xiang, Diao Shizhe, Pi Renjie, Zhang Jipeng, Han Chi, Zhang Tong
- Empirical Guidelines For Deploying Llms Onto Resource-constrained Edge Devices Qin Ruiyang, Liu Dancheng, Yan Zheyu, Tan Zhaoxuan, Pan Zixuan, Jia Zhenge, Jiang Meng, Abbasi Ahmed, Xiong Jinjun, Shi Yiyu
- GSR-BENCH: A Benchmark For Grounded Spatial Reasoning Evaluation Via Multimodal Llms Rajabi Navid, Kosecka Jana
- The Future Of Large Language Model Pre-training Is Federated Sani Lorenzo, Iacob Alex, Cao Zeyu, Marino Bill, Gao Yan, Paulik Tomas, Zhao Wanru, Shen William F., Aleksandrov Preslav, Qiu Xinchi, Lane Nicholas D.
- Scaling Laws For Linear Complexity Language Models Shen Xuyang, Li Dong, Leng Ruitao, Qin Zhen, Sun Weigao, Zhong Yiran
- JORA: JAX Tensor-parallel Lora Library For Retrieval Augmented Fine-tuning Tahir Anique, Cheng Lu, Liu Huan
- Scaling Laws With Vocabulary: Larger Models Deserve Larger Vocabularies Tao Chaofan, Liu Qian, Dou Longxu, Muennighoff Niklas, Wan Zhongwei, Luo Ping, Lin Min, Wong Ngai
- UCCIX: Irish-excellence Large Language Model Tran Khanh-tung, O'sullivan Barry, Nguyen Hoang D.
- Temporal Scaling Law For Large Language Models Xiong Yizhe, Chen Xiansheng, Ye Xin, Chen Hui, Lin Zijia, Lian Haoran, Su Zhenpeng, Niu Jianwei, Ding Guiguang
- Unleashing The Potential Of Large Language Models For Predictive Tabular Tasks In Data Science Yang Yazheng, Wang Yuqi, Sen Sankalok, Li Lei, Liu Qi
- CSRT: Evaluation And Analysis Of Llms Using Code-switching Red-teaming Dataset Yoo Haneul, Yang Yongjin, Lee Hwaran
- Skywork-math: Data Scaling Laws For Mathematical Reasoning In Large Language Models -- The Story Goes On Zeng Liang, Zhong Liangjun, Zhao Liang, Wei Tianwen, Yang Liu, He Jujie, Cheng Cheng, Hu Rui, Liu Yang, Yan Shuicheng, Fang Han, Zhou Yahui
- HARE: Human Priors, A Key To Small Language Model Efficiency Zhang Lingyun, Jin Bin, Ge Gaojian, Liu Lunhui, Shen Xuewen, Wu Mingyong, Zhang Houqian, Jiang Yongneng, Chen Shiqi, Pu Shi
- Scalable Matmul-free Language Modeling Zhu Rui-jie, Zhang Yu, Sifferman Ethan, Sheaves Tyler, Wang Yiqiao, Richmond Dustin, Zhou Peng, Eshraghian Jason K.
🏷 Masked Language Model
- Distilling Knowledge Learned In BERT For Text Generation Chen Yen-chun, Gan Zhe, Cheng Yu, Liu Jingzhou, Liu Jingjing
- Span Selection Pre-training For Question Answering Glass Michael, Gliozzo Alfio, Chakravarti Rishav, Ferritto Anthony, Pan Lin, Bhargav G P Shrivatsa, Garg Dinesh, Sil Avirup
- Unicoder-vl: A Universal Encoder For Vision And Language By Cross-modal Pre-training Li Gen, Duan Nan, Fang Yuejian, Gong Ming, Jiang Daxin, Zhou Ming
- Masked Language Model Scoring Salazar Julian, Liang Davis, Nguyen Toan Q., Kirchhoff Katrin
- LXMERT: Learning Cross-modality Encoder Representations From Transformers Tan Hao, Bansal Mohit
- Segatron: Segment-aware Transformer For Language Modeling And Understanding Bai He, Shi Peng, Lin Jimmy, Xie Yuqing, Tan Luchen, Xiong Kun, Gao Wen, Li Ming
- Unilmv2: Pseudo-masked Language Models For Unified Language Model Pre-training Bao Hangbo, Dong Li, Wei Furu, Wang Wenhui, Yang Nan, Liu Xiaodong, Wang Yu, Piao Songhao, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- Memory Transformer Burtsev Mikhail S., Kuratov Yuri, Peganov Anton, Sapunov Grigory V.
- X-LXMERT: Paint, Caption And Answer Questions With Multi-modal Transformers Cho Jaemin, Lu Jiasen, Schwenk Dustin, Hajishirzi Hannaneh, Kembhavi Aniruddha
- Sound Natural: Content Rephrasing In Dialog Systems Einolghozati Arash, Gupta Anchit, Diedrick Keith, Gupta Sonal
- GMAT: Global Memory Augmentation For Transformers Gupta Ankit, Berant Jonathan
- Current Limitations Of Language Models: What You Need Is Retrieval Komatsuzaki Aran
- An Empirical Investigation Of Pre-trained Transformer Language Models For Open-domain Dialogue Generation Li Piji
- Probabilistically Masked Language Model Capable Of Autoregressive Generation In Arbitrary Word Order Liao Yi, Jiang Xin, Liu Qun
- What Does BERT Know About Books, Movies And Music? Probing BERT For Conversational Recommendation Penha Gustavo, Hauff Claudia
- TOD-BERT: Pre-trained Natural Language Understanding For Task-oriented Dialogue Wu Chien-sheng, Hoi Steven, Socher Richard, Xiong Caiming
- MC-BERT: Efficient Language Pre-training Via A Meta Controller Xu Zhenhui, Gong Linyuan, Ke Guolin, He Di, Zheng Shuxin, Wang Liwei, Bian Jiang, Liu Tie-yan
- Open-domain Dialogue Generation Based On Pre-trained Language Models Zeng Yan, Nie Jian-yun
- Exploring Unsupervised Pretraining Objectives For Machine Translation Baziotis Christos, Titov Ivan, Birch Alexandra, Haddow Barry
- Bridging The Gap Between Language Model And Reading Comprehension: Unsupervised MRC Via Self-supervision Bian Ning, Han Xianpei, Chen Bo, Lin Hongyu, He Ben, Sun Le
- CDLM: Cross-document Language Modeling Caciularu Avi, Cohan Arman, Beltagy Iz, Peters Matthew E., Cattan Arie, Dagan Ido
- Vision-and-language Or Vision-for-language? On Cross-modal Influence In Multimodal Transformers Frank Stella, Bugliarello Emanuele, Elliott Desmond
- Larger-scale Transformers For Multilingual Masked Language Modeling Goyal Naman, Du Jingfei, Ott Myle, Anantharaman Giri, Conneau Alexis
- DS-TOD: Efficient Domain Specialization For Task Oriented Dialog Hung Chia-chien, Lauscher Anne, Ponzetto Simone Paolo, Glavaš Goran
- A Good Prompt Is Worth Millions Of Parameters: Low-resource Prompt-based Learning For Vision-language Models Jin Woojeong, Cheng Yu, Shen Yelong, Chen Weizhu, Ren Xiang
- BERT Busters: Outlier Dimensions That Disrupt Transformers Kovaleva Olga, Kulshreshtha Saurabh, Rogers Anna, Rumshisky Anna
- Unified Multimodal Pre-training And Prompt-based Tuning For Vision-language Understanding And Generation Liu Tianyi, Wu Zuxuan, Xiong Wenhan, Chen Jingjing, Jiang Yu-gang
- Lightweight Cross-lingual Sentence Representation Learning Mao Zhuoyuan, Gupta Prakhar, Wang Pei, Chu Chenhui, Jaggi Martin, Kurohashi Sadao
- Self-training Improves Pre-training For Few-shot Learning In Task-oriented Dialog Systems Mi Fei, Zhou Wanhao, Cai Fengyu, Kong Lingjing, Huang Minlie, Faltings Boi
- ABC: Attention With Bounded-memory Control Peng Hao, Kasai Jungo, Pappas Nikolaos, Yogatama Dani, Wu Zhaofeng, Kong Lingpeng, Schwartz Roy, Smith Noah A.
- Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge Porada Ian, Sordoni Alessandro, Cheung Jackie Chi Kit
- Winner Team Mia At Textvqa Challenge 2021: Vision-and-language Representation Learning With Pre-trained Sequence-to-sequence Model Qiao Yixuan, Chen Hao, Wang Jun, Chen Yihao, Ye Xianbin, Li Ziliang, Qi Xianbiao, Gao Peng, Xie Guotong
- CPT: A Pre-trained Unbalanced Transformer For Both Chinese Language Understanding And Generation Shao Yunfan, Geng Zhichao, Liu Yitao, Dai Junqi, Yan Hang, Yang Fei, Zhe Li, Bao Hujun, Qiu Xipeng
- Adapting Long Context NLM For ASR Rescoring In Conversational Agents Shenoy Ashish, Bodapati Sravan, Sunkara Monica, Ronanki Srikanth, Kirchhoff Katrin
- Tacl: Improving BERT Pre-training With Token-aware Contrastive Learning Su Yixuan, Liu Fangyu, Meng Zaiqiao, Lan Tian, Shu Lei, Shareghi Ehsan, Collier Nigel
- NSP-BERT: A Prompt-based Few-shot Learner Through An Original Pre-training Task--next Sentence Prediction Sun Yi, Zheng Yu, Hao Chao, Qiu Hangping
- Bertnesia: Investigating The Capture And Forgetting Of Knowledge In BERT Wallat Jonas, Singh Jaspreet, Anand Avishek
- Improving Sequence-to-sequence Pre-training Via Sequence Span Rewriting Zhou Wangchunshu, Ge Tao, Xu Canwen, Xu Ke, Wei Furu
- Global Memory Transformer For Processing Long Documents Adel Arij Al
- On The Calibration Of Massively Multilingual Language Models Ahuja Kabir, Sitaram Sunayana, Dandapat Sandipan, Choudhury Monojit
- How Does The Pre-training Objective Affect What Large Language Models Learn About Linguistic Properties? Alajrami Ahmed, Aletras Nikolaos
- Vl-beit: Generative Vision-language Pretraining Bao Hangbo, Wang Wenhui, Dong Li, Wei Furu
- Cramming: Training A Language Model On A Single GPU In One Day Geiping Jonas, Goldstein Tom
- Microbert: Effective Training Of Low-resource Monolingual Berts Through Parameter Reduction And Multitask Learning Gessler Luke, Zeldes Amir
- Transformer Quality In Linear Time Hua Weizhe, Dai Zihang, Liu Hanxiao, Le Quoc V.
- Layoutlmv3: Pre-training For Document AI With Unified Text And Image Masking Huang Yupan, Lv Tengchao, Cui Lei, Lu Yutong, Wei Furu
- Contrastive Learning For Prompt-based Few-shot Language Learners Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Simple Recurrence Improves Masked Language Models Lei Tao, Tian Ran, Bastings Jasmijn, Parikh Ankur P.
- LAVENDER: Unifying Video-language Understanding As Masked Language Modeling Li Linjie, Gan Zhe, Lin Kevin, Lin Chung-ching, Liu Zicheng, Liu Ce, Wang Lijuan
- On Task-adaptive Pretraining For Dialogue Response Selection Lin Tzu-hsiang, Chi Ta-chung, Rumshisky Anna
- Declaration-based Prompt Tuning For Visual Question Answering Liu Yuhang, Wei Wei, Peng Daowan, Zhu Feida
- PERFECT: Prompt-free And Efficient Few-shot Learning With Language Models Mahabadi Rabeeh Karimi, Zettlemoyer Luke, Henderson James, Saeidi Marzieh, Mathias Lambert, Stoyanov Veselin, Yazdani Majid
- Mini-model Adaptation: Efficiently Extending Pretrained Models To New Languages Via Aligned Shallow Training Marchisio Kelly, Lewis Patrick, Chen Yihong, Artetxe Mikel
- Bidirectional Language Models Are Also Few-shot Learners Patel Ajay, Li Bryan, Rasooli Mohammad Sadegh, Constant Noah, Raffel Colin, Callison-burch Chris
- Beyond English-centric Bitexts For Better Multilingual Language Representation Learning Patra Barun, Singhal Saksham, Huang Shaohan, Chi Zewen, Dong Li, Wei Furu, Chaudhary Vishrav, Song Xia
- APOLLO: A Simple Approach For Adaptive Pretraining Of Language Models For Logical Reasoning Sanyal Soumya, Xu Yichong, Wang Shuohang, Yang Ziyi, Pryzant Reid, Yu Wenhao, Zhu Chenguang, Ren Xiang
- Fewer Errors, But More Stereotypes? The Effect Of Model Size On Gender Bias Tal Yarden, Magar Inbal, Schwartz Roy
- What Language Model Architecture And Pretraining Objective Work Best For Zero-shot Generalization? Wang Thomas, Roberts Adam, Hesslow Daniel, Scao Teven Le, Chung Hyung Won, Beltagy Iz, Launay Julien, Raffel Colin
- Unified Multimodal Model With Unlikelihood Training For Visual Dialog Wang Zihao, Wang Junli, Jiang Changjun
- Go-tuning: Improving Zero-shot Learning Abilities Of Smaller Language Models Xu Jingjing, Dong Qingxiu, Liu Hongyi, Li Lei
- Zero-shot Video Question Answering Via Frozen Bidirectional Language Models Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Better Pre-training By Reducing Representation Confusion Zhang Haojie, Liang Mingfei, Xie Ruobing, Sun Zhenlong, Zhang Bo, Lin Leyu
- Is BERT Blind? Exploring The Effect Of Vision-and-language Pretraining On Visual Language Understanding Alper Morris, Fiman Michael, Averbuch-elor Hadar
- The Belebele Benchmark: A Parallel Reading Comprehension Dataset In 122 Language Variants Bandarkar Lucas, Liang Davis, Muller Benjamin, Artetxe Mikel, Shukla Satya Narayan, Husa Donald, Goyal Naman, Krishnan Abhinandan, Zettlemoyer Luke, Khabsa Madian
- Labrador: Exploring The Limits Of Masked Language Modeling For Laboratory Data Bellamy David R., Kumar Bhawesh, Wang Cindy, Beam Andrew
- Sudden Drops In The Loss: Syntax Acquisition, Phase Transitions, And Simplicity Bias In Mlms Chen Angelica, Shwartz-ziv Ravid, Cho Kyunghyun, Leavitt Matthew L., Saphra Naomi
- Robust Infidelity: When Faithfulness Measures On Masked Language Models Are Misleading Crothers Evan, Viktor Herna, Japkowicz Nathalie
- Chain-of-thought Tuning: Masked Language Models Can Also Think Step By Step In Natural Language Understanding Fan Caoyun, Tian Jidong, Li Yitian, Chen Wenqing, He Hao, Jin Yaohui
- Cerbero-7b: A Leap Forward In Language-specific Llms Through Enhanced Chat Corpus Generation And Evaluation Galatolo Federico A., Cimino Mario G. C. A.
- Lil-bevo: Explorations Of Strategies For Training Language Models In More Humanlike Ways Govindarajan Venkata S, Rodriguez Juan Diego, Bostrom Kaj, Mahowald Kyle
- M3dbench: Let's Instruct Large Models With Multi-modal 3D Prompts Li Mingsheng, Chen Xin, Zhang Chi, Chen Sijin, Zhu Hongyuan, Yin Fukun, Yu Gang, Chen Tao
- Bt-adapter: Video Conversation Is Feasible Without Video Instruction Tuning Liu Ruyang, Li Chen, Ge Yixiao, Shan Ying, Li Thomas H., Li Ge
- Representation Deficiency In Masked Language Modeling Meng Yu, Krishnan Jitin, Wang Sinong, Wang Qifan, Mao Yuning, Fang Han, Ghazvininejad Marjan, Han Jiawei, Zettlemoyer Luke
- Increasing The Performance Of Cognitively Inspired Data-efficient Language Models Via Implicit Structure Building Momen Omar, Arps David, Kallmeyer Laura
- Evaluation Of Medium-large Language Models At Zero-shot Closed Book Generative Question Answering Peinl René, Wirth Johannes
- Mosaicbert: A Bidirectional Encoder Optimized For Fast Pretraining Portes Jacob, Trott Alex, Havens Sam, King Daniel, Venigalla Abhinav, Nadeem Moin, Sardana Nikhil, Khudia Daya, Frankle Jonathan
- Mini Minds: Exploring Bebeshka And Zlata Baby Models Proskurina Irina, Metzler Guillaume, Velcin Julien
- Improving Non-autoregressive Translation Quality With Pretrained Language Model, Embedding Distillation And Upsampling Strategy For CTC Syu Shen-sian, Xie Juncheng, Lee Hung-yi
- Automating Code-related Tasks Through Transformers: The Impact Of Pre-training Tufano Rosalia, Pascarella Luca, Bavota Gabriele
- Diffusion Language Models Can Perform Many Tasks With Scaling And Instruction-finetuning Ye Jiasheng, Zheng Zaixiang, Bao Yu, Qian Lihua, Gu Quanquan
- Weighted Sampling For Masked Language Modeling Zhang Linhan, Chen Qian, Wang Wen, Deng Chong, Cao Xin, Hao Kongzhang, Jiang Yuxin, Wang Wei
- Revisiting Token Dropping Strategy In Efficient BERT Pretraining Zhong Qihuang, Ding Liang, Liu Juhua, Liu Xuebo, Zhang Min, Du Bo, Tao Dacheng
- The Curious Case Of Nonverbal Abstract Reasoning With Multi-modal Large Language Models Ahrabian Kian, Sourati Zhivar, Sun Kexuan, Zhang Jiarui, Jiang Yifan, Morstatter Fred, Pujara Jay
- Low-cost Generation And Evaluation Of Dictionary Example Sentences Cai Bill, Ng Clarence Boon Liang, Tan Daniel, Hotama Shelvia
- Enabling Natural Zero-shot Prompting On Encoder Models Via Statement-tuning Elshabrawy Ahmed, Huang Yongxin, Gurevych Iryna, Aji Alham Fikri
- Multi-frame, Lightweight & Efficient Vision-language Models For Question Answering In Autonomous Driving Gopalkrishnan Akshay, Greer Ross, Trivedi Mohan
- N-gram Prediction And Word Difference Representations For Language Modeling Heo Dongnyeong, Rim Daniela Noemi, Choi Heeyoul
- Counterfactuals As A Means For Evaluating Faithfulness Of Attribution Methods In Autoregressive Language Models Kamahi Sepehr, Yaghoobzadeh Yadollah
- Exploration Of Masked And Causal Language Modelling For Text Generation Micheletti Nicolo, Belkadi Samuel, Han Lifeng, Nenadic Goran
- What Drives Performance In Multilingual Language Models? Nezhad Sina Bagheri, Agrawal Ameeta
- Discrete Multimodal Transformers With A Pretrained Large Language Model For Mixed-supervision Speech Processing Trinh Viet Anh, Southwell Rosy, Guan Yiwen, He Xinlu, Wang Zhiyong, Whitehill Jacob
- VBART: The Turkish LLM Turker Meliksah, Ari Mehmet Erdi, Han Aydin
- Omnijarvis: Unified Vision-language-action Tokenization Enables Open-world Instruction Following Agents Wang Zihao, Cai Shaofei, Mu Zhancun, Lin Haowei, Zhang Ceyao, Liu Xuejie, Li Qing, Liu Anji, Ma Xiaojian, Liang Yitao
- Making The Most Of Your Model: Methods For Finetuning And Applying Pretrained Transformers Yoshida Davis
🏷 Merging
- Visual Dialog Das Abhishek, Kottur Satwik, Gupta Khushi, Singh Avi, Yadav Deshraj, Moura José M. F., Parikh Devi, Batra Dhruv
- Character-level Question Answering With Attention Golub David, He Xiaodong
- Temporal Attention Model For Neural Machine Translation Sankaran Baskaran, Mi Haitao, Al-onaizan Yaser, Ittycheriah Abe
- Attention Is All You Need Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin
- Ruminating Reader: Reasoning With Gated Multi-hop Attention Gong Yichen, Bowman Samuel R.
- Task-driven Visual Saliency And Attention-based Visual Question Answering Lin Yuetan, Pang Zhangyang, Wang Donghui, Zhuang Yueting
- Phase Conductor On Multi-layered Attentions For Machine Comprehension Liu Rui, Wei Wei, Mao Weiguang, Chikina Maria
- Attention Based Natural Language Grounding By Navigating Virtual Environment B Akilesh, Sinha Abhishek, Sarkar Mausoom, Krishnamurthy Balaji
- Reciprocal Attention Fusion For Visual Question Answering Farazi Moshiur R, Khan Salman H
- Question-guided Hybrid Convolution For Visual Question Answering Gao Peng, Lu Pan, Li Hongsheng, Li Shuang, Li Yikang, Hoi Steven, Wang Xiaogang
- Transfer Learning Of Language-independent End-to-end ASR With Language Model Fusion Inaguma Hirofumi, Cho Jaejin, Baskar Murali Karthick, Kawahara Tatsuya, Watanabe Shinji
- Improved Fusion Of Visual And Language Representations By Dense Symmetric Co-attention For Visual Question Answering Nguyen Duy-kien, Okatani Takayuki
- Polite Dialogue Generation Without Parallel Data Niu Tong, Bansal Mohit
- Dynamic Fusion With Intra- And Inter- Modality Attention Flow For Visual Question Answering Peng Gao, Jiang Zhengkai, You Haoxuan, Lu Pan, Hoi Steven, Wang Xiaogang, Li Hongsheng
- The University Of Cambridge's Machine Translation Systems For WMT18 Stahlberg Felix, De Gispert Adria, Byrne Bill
- Improved Training Of End-to-end Attention Models For Speech Recognition Zeyer Albert, Irie Kazuki, Schlüter Ralf, Ney Hermann
- Fusion Of Detected Objects In Text For Visual Question Answering Alberti Chris, Ling Jeffrey, Collins Michael, Reitter David
- Learning Multi-level Information For Dialogue Response Selection By Highway Recurrent Transformer Chiang Ting-rui, Huang Chao-wei, Su Shang-yu, Chen Yun-nung
- Heterogeneous Memory Enhanced Multimodal Attention Model For Video Question Answering Fan Chenyou, Zhang Xiaofan, Zhang Shu, Wang Wensheng, Zhang Chi, Huang Heng
- Question-agnostic Attention For Visual Question Answering Farazi Moshiur R, Khan Salman H, Barnes Nick
- Iterative Answer Prediction With Pointer-augmented Multimodal Transformers For Textvqa Hu Ronghang, Singh Amanpreet, Darrell Trevor, Rohrbach Marcus
- Rap-net: Recurrent Attention Pooling Networks For Dialogue Response Selection Huang Chao-wei, Chiang Ting-rui, Su Shang-yu, Chen Yun-nung
- Explicit Sentence Compression For Neural Machine Translation Li Zuchao, Wang Rui, Chen Kehai, Utiyama Masao, Sumita Eiichiro, Zhang Zhuosheng, Zhao Hai
- Improving Question Generation With Sentence-level Semantic Matching And Answer Position Inferring Ma Xiyao, Zhu Qile, Zhou Yanlin, Li Xiaolin, Wu Dapeng
- On Leveraging The Visual Modality For Neural Machine Translation Raunak Vikas, Choe Sang Keun, Lu Quanyang, Xu Yi, Metze Florian
- Leveraging Pre-trained Checkpoints For Sequence Generation Tasks Rothe Sascha, Narayan Shashi, Severyn Aliaksei
- Co-attentional Transformers For Story-based Video Understanding Bebensee Björn, Zhang Byoung-tak
- Pretrained Language Models For Dialogue Generation With Multiple Input Sources Cao Yu, Bi Wei, Fang Meng, Tao Dacheng
- Relevance Transformer: Generating Concise Code Snippets With Relevance Feedback Gemmell Carlos, Rossetto Federico, Dalton Jeffrey
- Memory Attentive Fusion: External Language Model Integration For Transformer-based Sequence-to-sequence Model Ihori Mana, Masumura Ryo, Makishima Naoki, Tanaka Tomohiro, Takashima Akihiko, Orihashi Shota
- Probing Pretrained Language Models For Lexical Semantics Ivan Vulić, Edoardo Maria Ponti, Robert Litschko, Goran Glavaš, Anna Korhonen
- Meta-context Transformers For Domain-specific Response Generation Kar Debanjana, Samanta Suranjana, Azad Amar Prakash
- Fid-ex: Improving Sequence-to-sequence Models For Extractive Rationale Generation Lakhotia Kushal, Paranjape Bhargavi, Ghoshal Asish, Yih Wen-tau, Mehdad Yashar, Iyer Srinivasan
- Avgout: A Simple Output-probability Measure To Eliminate Dull Responses Niu Tong, Bansal Mohit
- Exploring Recurrent, Memory And Attention Based Architectures For Scoring Interactional Aspects Of Human-machine Text Dialog Ramanarayanan Vikram, Mulholland Matthew, Ghosh Debanjan
- Multi-hop Question Generation With Graph Convolutional Network Su Dan, Xu Yan, Dai Wenliang, Ji Ziwei, Yu Tiezheng, Fung Pascale
- VD-BERT: A Unified Vision And Dialog Transformer With BERT Wang Yue, Joty Shafiq, Lyu Michael R., King Irwin, Xiong Caiming, Hoi Steven C. H.
- Seqdialn: Sequential Visual Dialog Networks In Joint Visual-linguistic Representation Space Yang Liu
- An Empirical Study Of Training End-to-end Vision-and-language Transformers Dou Zi-yi, Xu Yichong, Gan Zhe, Wang Jianfeng, Wang Shuohang, Wang Lijuan, Zhu Chenguang, Zhang Pengchuan, Yuan Lu, Peng Nanyun, Liu Zicheng, Zeng Michael
- Towards Trustworthy Deception Detection: Benchmarking Model Robustness Across Domains, Modalities, And Languages Glenski Maria, Ayton Ellyn, Cosbey Robin, Arendt Dustin, Volkova Svitlana
- VX2TEXT: End-to-end Learning Of Video-based Text Generation From Multimodal Inputs Lin Xudong, Bertasius Gedas, Wang Jue, Chang Shih-fu, Parikh Devi, Torresani Lorenzo
- MLP Architectures For Vision-and-language Modeling: An Empirical Study Nie Yixin, Li Linjie, Gan Zhe, Wang Shuohang, Zhu Chenguang, Zeng Michael, Liu Zicheng, Bansal Mohit, Wang Lijuan
- Winner Team Mia At Textvqa Challenge 2021: Vision-and-language Representation Learning With Pre-trained Sequence-to-sequence Model Qiao Yixuan, Chen Hao, Wang Jun, Chen Yihao, Ye Xianbin, Li Ziliang, Qi Xianbiao, Gao Peng, Xie Guotong
- Audio-visual Scene-aware Dialog And Reasoning Using Audio-visual Transformers With Joint Student-teacher Learning Shah Ankit P., Geng Shijie, Gao Peng, Cherian Anoop, Hori Takaaki, Marks Tim K., Roux Jonathan Le, Hori Chiori
- Adapting Long Context NLM For ASR Rescoring In Conversational Agents Shenoy Ashish, Bodapati Sravan, Sunkara Monica, Ronanki Srikanth, Kirchhoff Katrin
- NVIDIA Nemo Neural Machine Translation Systems For English-german And English-russian News And Biomedical Tasks At WMT21 Subramanian Sandeep, Hrinchuk Oleksii, Adams Virginia, Kuchaiev Oleksii
- Learning Better Visual Dialog Agents With Pretrained Visual-linguistic Representation Tu Tao, Ping Qing, Thattai Govind, Tur Gokhan, Natarajan Prem
- Distilled Dual-encoder Model For Vision-language Understanding Wang Zekun, Wang Wenhui, Zhu Haichao, Liu Ming, Qin Bing, Wei Furu
- Simple Local Attentions Remain Competitive For Long-context Tasks Xiong Wenhan, Oğuz Barlas, Gupta Anchit, Chen Xilun, Liskovich Diana, Levy Omer, Yih Wen-tau, Mehdad Yashar
- Moca: Incorporating Multi-stage Domain Pretraining And Cross-guided Multimodal Attention For Textbook Question Answering Xu Fangzhi, Lin Qika, Liu Jun, Zhang Lingling, Zhao Tianzhe, Chai Qi, Pan Yudai
- VLM: Task-agnostic Video-language Model Pre-training For Video Understanding Xu Hu, Ghosh Gargi, Huang Po-yao, Arora Prahal, Aminzadeh Masoumeh, Feichtenhofer Christoph, Metze Florian, Zettlemoyer Luke
- Dissecting Generation Modes For Abstractive Summarization Models Via Ablation And Attribution Xu Jiacheng, Durrett Greg
- Compound Tokens: Channel Fusion For Vision-language Representation Learning Aladago Maxwell Mbabilla, Piergiovanni Aj
- The Myth Of Culturally Agnostic AI Models Cetinic Eva
- Decoder Tuning: Efficient Language Understanding As Decoding Cui Ganqu, Li Wentao, Ding Ning, Huang Longtao, Liu Zhiyuan, Sun Maosong
- Fido: Fusion-in-decoder Optimized For Stronger Performance And Faster Inference De Jong Michiel, Zemlyanskiy Yury, Ainslie Joshua, Fitzgerald Nicholas, Sanghai Sumit, Sha Fei, Cohen William
- Coarse-to-fine Vision-language Pre-training With Fusion In The Backbone Dou Zi-yi, Kamath Aishwarya, Gan Zhe, Zhang Pengchuan, Wang Jianfeng, Li Linjie, Liu Zicheng, Liu Ce, Lecun Yann, Peng Nanyun, Gao Jianfeng, Wang Lijuan
- Should Attention Be All We Need? The Epistemic And Ethical Implications Of Unification In Machine Learning Fishman Nic, Hancox-li Leif
- GPTQ: Accurate Post-training Quantization For Generative Pre-trained Transformers Frantar Elias, Ashkboos Saleh, Hoefler Torsten, Alistarh Dan
- Re2g: Retrieve, Rerank, Generate Glass Michael, Rossiello Gaetano, Chowdhury Md Faisal Mahbub, Naik Ankita Rajaram, Cai Pengshan, Gliozzo Alfio
- Diffuseq: Sequence To Sequence Text Generation With Diffusion Models Gong Shansan, Li Mukai, Feng Jiangtao, Wu Zhiyong, Kong Lingpeng
- Thinking Fast And Slow In Large Language Models Hagendorff Thilo, Fabi Sarah, Kosinski Michal
- SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model For Text Generation And Modular Control Han Xiaochuang, Kumar Sachin, Tsvetkov Yulia
- Z-code++: A Pre-trained Language Model Optimized For Abstractive Summarization He Pengcheng, Peng Baolin, Lu Liyang, Wang Song, Mei Jie, Liu Yang, Xu Ruochen, Awadalla Hany Hassan, Shi Yu, Zhu Chenguang, Xiong Wayne, Zeng Michael, Gao Jianfeng, Huang Xuedong
- Bundle MCR: Towards Conversational Bundle Recommendation He Zhankui, Zhao Handong, Yu Tong, Kim Sungchul, Du Fan, Mcauley Julian
- Unified Discrete Diffusion For Simultaneous Vision-language Generation Hu Minghui, Zheng Chuanxia, Zheng Heliang, Cham Tat-jen, Wang Chaoyue, Yang Zuopeng, Tao Dacheng, Suganthan Ponnuthurai N.
- Detecting Unintended Memorization In Language-model-fused ASR Huang W. Ronny, Chien Steve, Thakkar Om, Mathews Rajiv
- MCP: Self-supervised Pre-training For Personalized Chatbots With Multi-level Contrastive Sampling Huang Zhaoheng, Dou Zhicheng, Zhu Yutao, Ma Zhengyi
- Efficient Long-text Understanding With Short-text Models Ivgi Maor, Shaham Uri, Berant Jonathan
- Fie: Building A Global Probability Space By Leveraging Early Fusion In Encoder For Open-domain Question Answering Kedia Akhil, Zaidi Mohd Abbas, Lee Haejun
- Uni-eden: Universal Encoder-decoder Network By Multi-granular Vision-language Pre-training Li Yehao, Fan Jiahao, Pan Yingwei, Yao Ting, Lin Weiyao, Mei Tao
- Using Large Language Models To Generate Engaging Captions For Data Visualizations Liew Ashley, Mueller Klaus
- A Simple Meta-learning Paradigm For Zero-shot Intent Classification With Mixture Attention Mechanism Liu Han, Zhao Siyang, Zhang Xiaotong, Zhang Feng, Sun Junjie, Yu Hong, Zhang Xianchao
- Fl-tuning: Layer Tuning For Feed-forward Network In Transformer Liu Jingping, Song Yuqiu, Xue Kui, Sun Hongli, Wang Chao, Chen Lihan, Jiang Haiyun, Liang Jiaqing, Ruan Tong
- A Frustratingly Simple Approach For End-to-end Image Captioning Luo Ziyang, Xi Yadong, Zhang Rongsheng, Ma Jing
- Edit5: Semi-autoregressive Text-editing With T5 Warm-start Mallinson Jonathan, Adamek Jakub, Malmi Eric, Severyn Aliaksei
- SKILL: Structured Knowledge Infusion For Large Language Models Moiseev Fedor, Dong Zhe, Alfonseca Enrique, Jaggi Martin
- Ignore Previous Prompt: Attack Techniques For Language Models Perez Fábio, Ribeiro Ian
- Tegformer: Topic-to-essay Generation With Good Topic Coverage And High Text Coherence Qi Wang, Liu Rui, Zuo Yuan, Chen Yong, Zhang Dell
- Diffusion Glancing Transformer For Parallel Sequence To Sequence Learning Qian Lihua, Wang Mingxuan, Liu Yang, Zhou Hao
- ELLE: Efficient Lifelong Pre-training For Emerging Data Qin Yujia, Zhang Jiajie, Lin Yankai, Liu Zhiyuan, Li Peng, Sun Maosong, Zhou Jie
- Billions Of Parameters Are Worth More Than In-domain Training Data: A Case Study In The Legal Case Entailment Task Rosa Guilherme Moraes, Bonifacio Luiz, Jeronymo Vitor, Abonizio Hugo, Lotufo Roberto, Nogueira Rodrigo
- Reasoning With Language Model Prompting: A Survey Shuofei Qiao, Yixin Ou, Ningyu Zhang, Xiang Chen, Yunzhi Yao, Shumin Deng, Chuanqi Tan, Fei Huang, Huajun Chen
- Benchmarking Large Language Models For Automated Verilog RTL Code Generation Thakur Shailja, Ahmad Baleegh, Fan Zhenxing, Pearce Hammond, Tan Benjamin, Karri Ramesh, Dolan-gavitt Brendan, Garg Siddharth
- VLSP 2021 - Vimrc Challenge: Vietnamese Machine Reading Comprehension Van Nguyen Kiet, Tran Son Quoc, Nguyen Luan Thanh, Van Huynh Tin, Luu Son T., Nguyen Ngan Luu-thuy
- Visually-augmented Language Modeling Wang Weizhi, Dong Li, Cheng Hao, Song Haoyu, Liu Xiaodong, Yan Xifeng, Gao Jianfeng, Wei Furu
- Image As A Foreign Language: Beit Pretraining For All Vision And Vision-language Tasks Wang Wenhui, Bao Hangbo, Dong Li, Bjorck Johan, Peng Zhiliang, Liu Qiang, Aggarwal Kriti, Mohammed Owais Khan, Singhal Saksham, Som Subhojit, Wei Furu
- MGA-VQA: Multi-granularity Alignment For Visual Question Answering Xiong Peixi, Shen Yilin, Jin Hongxia
- Bridgetower: Building Bridges Between Encoders In Vision-language Representation Learning Xu Xiao, Wu Chenfei, Rosenman Shachar, Lal Vasudev, Che Wanxiang, Duan Nan
- Rationale-augmented Ensembles In Language Models Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc Le, Ed Chi, Denny Zhou
- Modeling Multi-hop Question Answering As Single Sequence Prediction Yavuz Semih, Hashimoto Kazuma, Zhou Yingbo, Keskar Nitish Shirish, Xiong Caiming
- Active Example Selection For In-context Learning Yiming Zhang, Shi Feng, Chenhao Tan
- Better Pre-training By Reducing Representation Confusion Zhang Haojie, Liang Mingfei, Xie Ruobing, Sun Zhenlong, Zhang Bo, Lin Leyu
- Tacube: Pre-computing Data Cubes For Answering Numerical-reasoning Questions Over Tabular Data Zhou Fan, Hu Mengkang, Dong Haoyu, Cheng Zhoujun, Han Shi, Zhang Dongmei
- KITLM: Domain-specific Knowledge Integration Into Language Models For Question Answering Agarwal Ankush, Gawade Sakharam, Azad Amar Prakash, Bhattacharyya Pushpak
- The Emergence Of Essential Sparsity In Large Pre-trained Models: The Weights That Matter Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Zhangyang Wang
- Chatgpt And Beyond: The Generative AI Revolution In Education Al-smadi Mohammad
- Context Tuning For Retrieval Augmented Generation Anantha Raviteja, Bethi Tharun, Vodianik Danil, Chappidi Srinivas
- Expel: LLM Agents Are Experiential Learners Andrew Zhao, Daniel Huang, Quentin Xu, Matthieu Lin, Yong-jin Liu, Gao Huang
- Augmenting Llms With Knowledge: A Survey On Hallucination Prevention Andriopoulos Konstantinos, Pouwelse Johan
- Detecting Natural Language Biases With Prompt-based Learning Aowal Md Abdul, Islam Maliha T, Mammen Priyanka Mary, Shetty Sandesh
- Sequence-to-sequence Spanish Pre-trained Language Models Araujo Vladimir, Trusca Maria Mihaela, Tufiño Rodrigo, Moens Marie-francine
- Separate-and-enhance: Compositional Finetuning For Text2image Diffusion Models Bao Zhipeng, Li Yijun, Singh Krishna Kumar, Wang Yu-xiong, Hebert Martial
- Fine-grained Affective Processing Capabilities Emerging From Large Language Models Broekens Joost, Hilpert Bernhard, Verberne Suzan, Baraka Kim, Gebhard Patrick, Plaat Aske
- Insights Into Classifying And Mitigating Llms' Hallucinations Bruno Alessandro, Mazzeo Pier Luigi, Chetouani Aladine, Tliba Marouane, Kerkouri Mohamed Amine
- Development Of The Chatgpt, Generative Artificial Intelligence And Natural Large Language Models For Accountable Reporting And Use (CANGARU) Guidelines Cacciamani Giovanni E., Eppler Michael B., Ganjavi Conner, Pekan Asli, Biedermann Brett, Collins Gary S., Gill Inderbir S.
- Scaling In-context Demonstrations With Structured Attention Cai Tianle, Huang Kaixuan, Lee Jason D., Wang Mengdi
- Pumer: Pruning And Merging Tokens For Efficient Vision Language Models Cao Qingqing, Paranjape Bhargavi, Hajishirzi Hannaneh
- Knowledge Transfer From High-resource To Low-resource Programming Languages For Code Llms Cassano Federico, Gouwar John, Lucchetti Francesca, Schlesinger Claire, Freeman Anders, Anderson Carolyn Jane, Feldman Molly Q, Greenberg Michael, Jangda Abhinav, Guha Arjun
- Creativity Support In The Age Of Large Language Models: An Empirical Study Involving Emerging Writers Chakrabarty Tuhin, Padmakumar Vishakh, Brahman Faeze, Muresan Smaranda
- Muse: Text-to-image Generation Via Masked Generative Transformers Chang Huiwen, Zhang Han, Barber Jarred, Maschinot Aj, Lezama Jose, Jiang Lu, Yang Ming-hsuan, Murphy Kevin, Freeman William T., Rubinstein Michael, Li Yuanzhen, Krishnan Dilip
- Sudden Drops In The Loss: Syntax Acquisition, Phase Transitions, And Simplicity Bias In Mlms Chen Angelica, Shwartz-ziv Ravid, Cho Kyunghyun, Leavitt Matthew L., Saphra Naomi
- Unlearn What You Want To Forget: Efficient Unlearning For Llms Chen Jiaao, Yang Diyi
- Reason Out Your Layout: Evoking The Layout Master From Large Language Models For Text-to-image Synthesis Chen Xiaohui, Liu Yongfei, Yang Yingxiang, Yuan Jianbo, You Quanzeng, Liu Li-ping, Yang Hongxia
- Medprompt: Cross-modal Prompting For Multi-task Medical Image Translation Chen Xuhang, Pun Chi-man, Wang Shuqiang
- Translation And Fusion Improves Zero-shot Cross-lingual Information Extraction Chen Yang, Shah Vedaant, Ritter Alan
- Visual Chatgpt: Talking, Drawing And Editing With Visual Foundation Models Chenfei Wu, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, Nan Duan
- Tutorials On Stance Detection Using Pre-trained Language Models: Fine-tuning BERT And Prompting Large Language Models Chuang Yun-shiuan
- Strong And Efficient Baselines For Open Domain Conversational Question Answering Coman Andrei C., Barlacchi Gianni, De Gispert Adrià
- Receive, Reason, And React: Drive As You Say With Large Language Models In Autonomous Vehicles Cui Can, Ma Yunsheng, Cao Xu, Ye Wenqian, Wang Ziran
- Llm-in-the-loop: Leveraging Large Language Model For Thematic Analysis Dai Shih-chieh, Xiong Aiping, Ku Lun-wei
- Mask The Bias: Improving Domain-adaptive Generalization Of Ctc-based ASR With Internal Language Model Estimation Das Nilaksh, Sunkara Monica, Bodapati Sravan, Cai Jinglun, Kulshreshtha Devang, Farris Jeff, Kirchhoff Katrin
- Pre-computed Memory Or On-the-fly Encoding? A Hybrid Approach To Retrieval Augmentation Makes The Most Of Your Compute De Jong Michiel, Zemlyanskiy Yury, Fitzgerald Nicholas, Ainslie Joshua, Sanghai Sumit, Sha Fei, Cohen William
- Can Chatgpt Read Who You Are? Derner Erik, Kučera Dalibor, Oliver Nuria, Zahálka Jan
- Minigpt-4: Enhancing Vision-language Understanding With Advanced Large Language Models Deyao Zhu, Jun Chen, Xiaoqian Shen, Xiang Li, Mohamed Elhoseiny
- A Comprehensive Evaluation Of Neural SPARQL Query Generation From Natural Language Questions Diallo Papa Abdou Karim Karou, Reyd Samuel, Zouaq Amal
- The Philosopher's Stone: Trojaning Plugins Of Large Language Models Dong Tian, Xue Minhui, Chen Guoxing, Holland Rayne, Meng Yan, Li Shaofeng, Liu Zhen, Zhu Haojin
- LLM Blueprint: Enabling Text-to-image Generation With Complex And Detailed Prompts Gani Hanan, Bhat Shariq Farooq, Naseer Muzammal, Khan Salman, Wonka Peter
- Llama-adapter V2: Parameter-efficient Visual Instruction Model Gao Peng, Han Jiaming, Zhang Renrui, Lin Ziyi, Geng Shijie, Zhou Aojun, Zhang Wei, Lu Pan, He Conghui, Yue Xiangyu, Li Hongsheng, Qiao Yu
- Hiclip: Contrastive Language-image Pretraining With Hierarchy-aware Attention Geng Shijie, Yuan Jianbo, Tian Yu, Chen Yuxiao, Zhang Yongfeng
- Text-to-audio Generation Using Instruction-tuned LLM And Latent Diffusion Model Ghosal Deepanway, Majumder Navonil, Mehrish Ambuj, Poria Soujanya
- Transformative Effects Of Chatgpt On Modern Education: Emerging Era Of AI Chatbots Gill Sukhpal Singh, Xu Minxian, Patros Panos, Wu Huaming, Kaur Rupinder, Kaur Kamalpreet, Fuller Stephanie, Singh Manmeet, Arora Priyansh, Parlikad Ajith Kumar, Stankovski Vlado, Abraham Ajith, Ghosh Soumya K., Lutfiyya Hanan, Kanhere Salil S., Bahsoon Rami, Rana Omer, Dustdar Schahram, Sakellariou Rizos, Uhlig Steve, Buyya Rajkumar
- Generative AI Beyond Llms: System Implications Of Multi-modal Generation Golden Alicia, Hsia Samuel, Sun Fei, Acun Bilge, Hosmer Basil, Lee Yejin, Devito Zachary, Johnson Jeff, Wei Gu-yeon, Brooks David, Wu Carole-jean
- Improving Input-label Mapping With Demonstration Replay For In-context Learning Gong Zhuocheng, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Yan Rui
- Chatgpt Is Not All You Need. A State Of The Art Review Of Large Generative AI Models Gozalo-brizuela Roberto, Garrido-merchan Eduardo C.
- Not What You've Signed Up For: Compromising Real-world Llm-integrated Applications With Indirect Prompt Injection Greshake Kai, Abdelnabi Sahar, Mishra Shailesh, Endres Christoph, Holz Thorsten, Fritz Mario
- Seer: Language Instructed Video Prediction With Latent Diffusion Models Gu Xianfan, Wen Chuan, Ye Weirui, Song Jiaming, Gao Yang
- The False Promise Of Imitating Proprietary Llms Gudibande Arnav, Wallace Eric, Snell Charlie, Geng Xinyang, Liu Hao, Abbeel Pieter, Levine Sergey, Song Dawn
- Instruction Fusion: Advancing Prompt Evolution Through Hybridization Guo Weidong, Yang Jiuding, Yang Kaitong, Li Xiangyang, Rao Zhuwei, Xu Yu, Niu Di
- ELIP: Efficient Language-image Pre-training With Fewer Vision Tokens Guo Yangyang, Zhang Haoyu, Wong Yongkang, Nie Liqiang, Kankanhalli Mohan
- Gemrec: Towards Generative Model Recommendation Guo Yuanhe, Liu Haoming, Wen Hongyi
- Testing LLM Performance On The Physics GRE: Some Observations Gupta Pranav
- Scaling Up And Distilling Down: Language-guided Robot Skill Acquisition Ha Huy, Florence Pete, Song Shuran
- Regulating Chatgpt And Other Large Generative AI Models Hacker Philipp, Engel Andreas, Mauer Marco
- Human-like Intuitive Behavior And Reasoning Biases Emerged In Language Models -- And Disappeared In GPT-4 Hagendorff Thilo, Fabi Sarah
- David Helps Goliath: Inference-time Collaboration Between Small Specialized And Large General Diffusion Lms Han Xiaochuang, Kumar Sachin, Tsvetkov Yulia, Ghazvininejad Marjan
- Blockwise Parallel Transformer For Large Context Models Hao Liu, Pieter Abbeel
- Vico: Plug-and-play Visual Condition For Personalized Text-to-image Generation Hao Shaozhe, Han Kai, Zhao Shihao, Wong Kwan-yee K.
- Wordart Designer: User-driven Artistic Typography Synthesis Using Large Language Models He Jun-yan, Cheng Zhi-qi, Li Chenyang, Sun Jingdong, Xiang Wangmeng, Lin Xianhui, Kang Xiaoyang, Jin Zengke, Hu Yusen, Luo Bin, Geng Yifeng, Xie Xuansong, Zhou Jingren
- Multi-modal Latent Space Learning For Chain-of-thought Reasoning In Language Models He Liqi, Li Zuchao, Cai Xiantao, Wang Ping
- Direct2v: Large Language Models Are Frame-level Directors For Zero-shot Text-to-video Generation Hong Susung, Seo Junyoung, Shin Heeseong, Hong Sunghwan, Kim Seungryong
- Massively Multilingual Shallow Fusion With Large Language Models Hu Ke, Sainath Tara N., Li Bo, Du Nan, Huang Yanping, Dai Andrew M., Zhang Yu, Cabrera Rodrigo, Chen Zhifeng, Strohman Trevor
- Code Prompting: A Neural Symbolic Method For Complex Reasoning In Large Language Models Hu Yi, Yang Haotong, Lin Zhouchen, Zhang Muhan
- Free-bloom: Zero-shot Text-to-video Generator With LLM Director And LDM Animator Huang Hanzhuo, Feng Yufan, Shi Cheng, Xu Lan, Yu Jingyi, Yang Sibei
- Adapting Pre-trained Language Models To Vision-language Tasks Via Dynamic Visual Prompting Huang Shubin, Wu Qiong, Zhou Yiyi, Chen Weijie, Zhang Rongsheng, Sun Xiaoshuai, Ji Rongrong
- Smartedit: Exploring Complex Instruction-based Image Editing With Multimodal Large Language Models Huang Yuzhou, Xie Liangbin, Wang Xintao, Yuan Ziyang, Cun Xiaodong, Ge Yixiao, Zhou Jiantao, Dong Chao, Huang Rui, Zhang Ruimao, Shan Ying
- Personalized Soups: Personalized Large Language Model Alignment Via Post-hoc Parameter Merging Jang Joel, Kim Seungone, Lin Bill Yuchen, Wang Yizhong, Hessel Jack, Zettlemoyer Luke, Hajishirzi Hannaneh, Choi Yejin, Ammanabrolu Prithviraj
- Expedited Training Of Visual Conditioned Language Generation Via Redundancy Reduction Jian Yiren, Liu Tingkai, Tao Yunzhe, Zhang Chunhui, Vosoughi Soroush, Yang Hongxia
- Llm-blender: Ensembling Large Language Models With Pairwise Ranking And Generative Fusion Jiang Dongfu, Ren Xiang, Lin Bill Yuchen
- From CLIP To DINO: Visual Encoders Shout In Multi-modal Large Language Models Jiang Dongsheng, Liu Yuchen, Liu Songlin, Zhao Jin'e, Zhang Hao, Gao Zhen, Zhang Xiaopeng, Li Jin, Xiong Hongkai
- Personallm: Investigating The Ability Of Large Language Models To Express Personality Traits Jiang Hang, Zhang Xiajie, Cao Xubo, Breazeal Cynthia, Roy Deb, Kabbara Jad
- BLIP-2: Bootstrapping Language-image Pre-training With Frozen Image Encoders And Large Language Models Junnan Li, Dongxu Li, Silvio Savarese, Steven Hoi
- Robust Recommender System: A Survey And Future Directions Kaike Zhang, Qi Cao, Fei Sun, Yunfan Wu, Shuchang Tao, Huawei Shen, Xueqi Cheng
- Appropriateness Is All You Need! Kempt Hendrik, Lavie Alon, Nagel Saskia K.
- Zzzgpt: An Interactive GPT Approach To Enhance Sleep Quality Khaokaew Yonchanok, Ji Kaixin, Nguyen Thuc Hanh, Kegalle Hiruni, Alaofi Marwah, Xue Hao, Salim Flora D.
- Certifying LLM Safety Against Adversarial Prompting Kumar Aounon, Agarwal Chirag, Srinivas Suraj, Li Aaron Jiaxun, Feizi Soheil, Lakkaraju Himabindu
- Mini-dalle3: Interactive Text To Image By Prompting Large Language Models Lai Zeqiang, Zhu Xizhou, Dai Jifeng, Qiao Yu, Wang Wenhai
- Platypus: Quick, Cheap, And Powerful Refinement Of Llms Lee Ariel N., Hunter Cole J., Ruiz Nataniel
- Is Attention Required For ICL? Exploring The Relationship Between Model Architecture And In-context Learning Ability Lee Ivan, Jiang Nan, Berg-kirkpatrick Taylor
- Large Language Models For Generative Recommendation: A Survey And Visionary Discussions Lei Li, Yongfeng Zhang, Dugang Liu, Li Chen
- Teach Llms To Personalize -- An Approach Inspired By Writing Education Li Cheng, Zhang Mingyang, Mei Qiaozhu, Wang Yaqing, Hombaiah Spurthi Amba, Liang Yi, Bendersky Michael
- Exploring Fine-tuning Chatgpt For News Recommendation Li Xinyi, Zhang Yongfeng, Malthouse Edward C
- Llm-grounded Diffusion: Enhancing Prompt Understanding Of Text-to-image Diffusion Models With Large Language Models Lian Long, Li Boyi, Yala Adam, Darrell Trevor
- Modeling Complex Mathematical Reasoning Via Large Language Model Based Mathagent Liao Haoran, Du Qinyi, Hu Shaohua, He Hao, Xu Yanyan, Tian Jidong, Jin Yaohui
- AI Transparency In The Age Of Llms: A Human-centered Research Roadmap Liao Q. Vera, Vaughan Jennifer Wortman
- Beneath The Surface: Unveiling Harmful Memes With Multimodal Reasoning Distilled From Large Language Models Lin Hongzhan, Luo Ziyang, Ma Jing, Chen Long
- AWQ: Activation-aware Weight Quantization For LLM Compression And Acceleration Lin Ji, Tang Jiaming, Tang Haotian, Yang Shang, Chen Wei-ming, Wang Wei-chen, Xiao Guangxuan, Dang Xingyu, Gan Chuang, Han Song
- Aligning Large Language Models With Human Preferences Through Representation Engineering Liu Wenhao, Wang Xiaohua, Wu Muling, Li Tianlong, Lv Changze, Ling Zixuan, Zhu Jianhao, Zhang Cenyuan, Zheng Xiaoqing, Huang Xuanjing
- Alignbench: Benchmarking Chinese Alignment Of Large Language Models Liu Xiao, Lei Xuanyu, Wang Shengyuan, Huang Yue, Feng Zhuoer, Wen Bosi, Cheng Jiale, Ke Pei, Xu Yifan, Tam Weng Lam, Zhang Xiaohan, Sun Lichao, Gu Xiaotao, Wang Hongning, Zhang Jing, Huang Minlie, Dong Yuxiao, Tang Jie
- Modeling Uncertainty And Using Post-fusion As Fallback Improves Retrieval Augmented Generation With Llms Liu Ye, Yavuz Semih, Meng Rui, Moorthy Meghana, Joty Shafiq, Xiong Caiming, Zhou Yingbo
- Radiology-gpt: A Large Language Model For Radiology Liu Zhengliang, Zhong Aoxiao, Li Yiwei, Yang Longtao, Ju Chao, Wu Zihao, Ma Chong, Shu Peng, Chen Cheng, Kim Sekeun, Dai Haixing, Zhao Lin, Sun Lichao, Zhu Dajiang, Liu Jun, Liu Wei, Shen Dinggang, Li Xiang, Li Quanzheng, Liu Tianming
- Chatgpt V Bard V Bing V Claude 2 V Aria V Human-expert. How Good Are AI Chatbots At Scientific Writing? Lozić Edisa, Štular Benjamin
- Multimodal Procedural Planning Via Dual Text-image Prompting Lu Yujie, Lu Pan, Chen Zhiyu, Zhu Wanrong, Wang Xin Eric, Wang William Yang
- Fighting Fire With Fire: The Dual Role Of Llms In Crafting And Detecting Elusive Disinformation Lucas Jason, Uchendu Adaku, Yamashita Michiharu, Lee Jooyoung, Rohatgi Shaurya, Lee Dongwon
- Unlocking The Potential Of Large Language Models For Explainable Recommendations Luo Yucong, Cheng Mingyue, Zhang Hao, Lu Junyu, Liu Qi, Chen Enhong
- Are We Falling In A Middle-intelligence Trap? An Analysis And Mitigation Of The Reversal Curse Lv Ang, Zhang Kaiyi, Xie Shufang, Tu Quan, Chen Yuhan, Wen Ji-rong, Yan Rui
- POS: A Prompts Optimization Suite For Augmenting Text-to-video Generation Ma Shijie, Xu Huayi, Li Mengjian, Geng Weidong, Wang Yaxiong, Wang Meng
- Generating Illustrated Instructions Menon Sachit, Misra Ishan, Girdhar Rohit
- Convgqr: Generative Query Reformulation For Conversational Search Mo Fengran, Mao Kelong, Zhu Yutao, Wu Yihong, Huang Kaiyu, Nie Jian-yun
- Chatgpt In The Age Of Generative AI And Large Language Models: A Concise Survey Mohamadi Salman, Mujtaba Ghulam, Le Ngan, Doretto Gianfranco, Adjeroh Donald A.
- Unival: Unified Model For Image, Video, Audio And Language Tasks Mustafa Shukor, Corentin Dancette, Alexandre Rame, Matthieu Cord
- Large Language Models Are Fixated By Red Herrings: Exploring Creative Problem Solving And Einstellung Effect Using The Only Connect Wall Dataset Naeini Saeid, Saqur Raeid, Saeidi Mozhgan, Giorgi John, Taati Babak
- Understanding (un)intended Memorization In Text-to-image Generative Models Naseh Ali, Roh Jaechul, Houmansadr Amir
- A Comprehensive Overview Of Large Language Models Naveed Humza, Khan Asad Ullah, Qiu Shi, Saqib Muhammad, Anwar Saeed, Usman Muhammad, Akhtar Naveed, Barnes Nick, Mian Ajmal
- Large Language Models As Tax Attorneys: A Case Study In Legal Capabilities Emergence Nay John J., Karamardian David, Lawsky Sarah B., Tao Wenting, Bhat Meghana, Jain Raghav, Lee Aaron Travis, Choi Jonathan H., Kasai Jungo
- ORES: Open-vocabulary Responsible Visual Synthesis Ni Minheng, Wu Chenfei, Wang Xiaodong, Yin Shengming, Wang Lijuan, Liu Zicheng, Duan Nan
- Adversarial Fine-tuning Of Language Models: An Iterative Optimisation Approach For The Generation And Detection Of Problematic Content O'neill Charles, Miller Jack, Ciuca Ioana, Ting Yuan-sen, Bui Thang
- Generative AI: Implications And Applications For Education Olga Anastasia Olnancy, Tzirides, Saini Akash, Zapata Gabriela, Searsmith Duane, Cope Bill, Kalantzis Mary, Castro Vania, Kourkoulou Theodora, Jones John, Da Silva Rodrigo Abrantes, Whiting Jen, Kastania Nikoleta Polyxeni
- On The Role Of Attention In Prompt-tuning Oymak Samet, Rawat Ankit Singh, Soltanolkotabi Mahdi, Thrampoulidis Christos
- How To Catch An AI Liar: Lie Detection In Black-box Llms By Asking Unrelated Questions Pacchiardi Lorenzo, Chan Alex J., Mindermann Sören, Moscovitz Ilan, Pan Alexa Y., Gal Yarin, Evans Owain, Brauner Jan
- Understanding The Capabilities Of Large Language Models For Automated Planning Pallagani Vishal, Muppasani Bharath, Murugesan Keerthiram, Rossi Francesca, Srivastava Biplav, Horesh Lior, Fabiano Francesco, Loreggia Andrea
- COSMIC: Data Efficient Instruction-tuning For Speech In-context Learning Pan Jing, Wu Jian, Gaur Yashesh, Sivasankaran Sunit, Chen Zhuo, Liu Shujie, Li Jinyu
- Automatically Correcting Large Language Models: Surveying The Landscape Of Diverse Self-correction Strategies Pan Liangming, Saxon Michael, Xu Wenda, Nathani Deepak, Wang Xinyi, Wang William Yang
- Automating Customer Service Using Langchain: Building Custom Open-source GPT Chatbot For Organizations Pandya Keivalya, Holia Mehfuza
- Longbox: Evaluating Transformers On Long-sequence Clinical Tasks Parmar Mihir, Naik Aakanksha, Gupta Himanshu, Agrawal Disha, Baral Chitta
- ECLIPSE: A Resource-efficient Text-to-image Prior For Image Generations Patel Maitreya, Kim Changhoon, Cheng Sheng, Baral Chitta, Yang Yezhou
- Science In The Era Of Chatgpt, Large Language Models And Generative AI: Challenges For Research Ethics And How To Respond Pournaras Evangelos
- Egovlpv2: Egocentric Video-language Pre-training With Fusion In The Backbone Pramanick Shraman, Song Yale, Nag Sayan, Lin Kevin Qinghong, Shah Hardik, Shou Mike Zheng, Chellappa Rama, Zhang Pengchuan
- Visual Adversarial Examples Jailbreak Aligned Large Language Models Qi Xiangyu, Huang Kaixuan, Panda Ashwinee, Henderson Peter, Wang Mengdi, Mittal Prateek
- Controlrec: Bridging The Semantic Gap Between Language Model And Personalized Recommendation Qiu Junyan, Wang Haitao, Hong Zhaolin, Yang Yiping, Liu Qiang, Wang Xingxing
- Layoutllm-t2i: Eliciting Layout Guidance From LLM For Text-to-image Generation Qu Leigang, Wu Shengqiong, Fei Hao, Nie Liqiang, Chua Tat-seng
- The Troubling Emergence Of Hallucination In Large Language Models -- An Extensive Definition, Quantification, And Prescriptive Remediations Rawte Vipula, Chakraborty Swagata, Pathak Agnibh, Sarkar Anubhav, Tonmoy S. M Towhidul Islam, Chadha Aman, Sheth Amit P., Das Amitava
- Exploring The Relationship Between LLM Hallucinations And Prompt Linguistic Nuances: Readability, Formality, And Concreteness Rawte Vipula, Priya Prachi, Tonmoy S. M Towhidul Islam, Zaman S M Mehedi, Sheth Amit, Das Amitava
- Towards A Performance Analysis On Pre-trained Visual Question Answering Models For Autonomous Driving Rekanar Kaavya, Eising Ciarán, Sistu Ganesh, Hayes Martin
- The Manipulation Problem: Conversational AI As A Threat To Epistemic Agency Rosenberg Louis
- Knowledge-infused Self Attention Transformers Roy Kaushik, Zi Yuxin, Narayanan Vignesh, Gaur Manas, Sheth Amit
- A Symmetric Dual Encoding Dense Retrieval Framework For Knowledge-intensive Visual Question Answering Salemi Alireza, Pizzorno Juan Altmayer, Zamani Hamed
- Unlocking Practical Applications In Legal Domain: Evaluation Of GPT For Zero-shot Semantic Annotation Of Legal Texts Savelka Jaromir
- Large Language Models (GPT) Struggle To Answer Multiple-choice Questions About Code Savelka Jaromir, Agarwal Arav, Bogart Christopher, Sakr Majd
- Surgicalgpt: End-to-end Language-vision GPT For Visual Question Answering In Surgery Seenivasan Lalithkumar, Islam Mobarakol, Kannan Gokul, Ren Hongliang
- The Trickle-down Impact Of Reward (in-)consistency On RLHF Shen Lingfeng, Chen Sihao, Song Linfeng, Jin Lifeng, Peng Baolin, Mi Haitao, Khashabi Daniel, Yu Dong
- Storygpt-v: Large Language Models As Consistent Story Visualizers Shen Xiaoqian, Elhoseiny Mohamed
- Flexgen: High-throughput Generative Inference Of Large Language Models With A Single GPU Sheng Ying, Zheng Lianmin, Yuan Binhang, Li Zhuohan, Ryabinin Max, Fu Daniel Y., Xie Zhiqiang, Chen Beidi, Barrett Clark, Gonzalez Joseph E., Liang Percy, Ré Christopher, Stoica Ion, Zhang Ce
- Crossget: Cross-guided Ensemble Of Tokens For Accelerating Vision-language Transformers Shi Dachuan, Tao Chaofan, Rao Anyi, Yang Zhendong, Yuan Chun, Wang Jiaqi
- L3 Ensembles: Lifelong Learning Approach For Ensemble Of Foundational Language Models Shiri Aidin, Roy Kaushik, Sheth Amit, Gaur Manas
- POSQA: Probe The World Models Of Llms With Size Comparisons Shu Chang, Han Jiuzhou, Liu Fangyu, Shareghi Ehsan, Collier Nigel
- Fusion-eval: Integrating Assistant Evaluators With Llms Shu Lei, Wichers Nevan, Luo Liangchen, Zhu Yun, Liu Yinxiao, Chen Jindong, Meng Lei
- The Curse Of Recursion: Training On Generated Data Makes Models Forget Shumailov Ilia, Shumaylov Zakhar, Zhao Yiren, Gal Yarin, Papernot Nicolas, Anderson Ross
- Getting More Out Of Mixture Of Language Model Reasoning Experts Si Chenglei, Shi Weijia, Zhao Chen, Zettlemoyer Luke, Boyd-graber Jordan
- Knowledge Unlearning For Llms: Tasks, Methods, And Challenges Si Nianwen, Zhang Hao, Chang Heyu, Zhang Wenlin, Qu Dan, Zhang Weiqiang
- Mind Meets Machine: Unravelling Gpt-4's Cognitive Psychology Sifatkaur Dhingra, Manmeet Singh, Vaisakh Sb, Neetiraj Malviya, Sukhpal Singh Gill
- Conpet: Continual Parameter-efficient Tuning For Large Language Models Song Chenyang, Han Xu, Zeng Zheni, Li Kuai, Chen Chen, Liu Zhiyuan, Sun Maosong, Yang Tao
- Harnessing Llms In Curricular Design: Using GPT-4 To Support Authoring Of Learning Objectives Sridhar Pragnya, Doyle Aidan, Agarwal Arav, Bogart Christopher, Savelka Jaromir, Sakr Majd
- Training Large Language Models Efficiently With Sparsity And Dataflow Srinivasan Venkat, Gandhi Darshan, Thakker Urmish, Prabhakar Raghu
- Beyond Memorization: Violating Privacy Via Inference With Large Language Models Staab Robin, Vero Mark, Balunović Mislav, Vechev Martin
- Choice Fusion As Knowledge For Zero-shot Dialogue State Tracking Su Ruolin, Yang Jingfeng, Wu Ting-wei, Juang Biing-hwang
- F3-pruning: A Training-free And Generalized Pruning Strategy Towards Faster And Finer Text-to-video Synthesis Su Sitong, Liu Jianzhi, Gao Lianli, Song Jingkuan
- Create Your World: Lifelong Text-to-image Diffusion Sun Gan, Liang Wenqi, Dong Jiahua, Li Jun, Ding Zhengming, Cong Yang
- Generative Multimodal Models Are In-context Learners Sun Quan, Cui Yufeng, Zhang Xiaosong, Zhang Fan, Yu Qiying, Luo Zhengxiong, Wang Yueze, Rao Yongming, Liu Jingjing, Huang Tiejun, Wang Xinlong
- Learning To Imagine: Visually-augmented Natural Language Generation Tang Tianyi, Chen Yushuo, Du Yifan, Li Junyi, Zhao Wayne Xin, Wen Ji-rong
- Caption Anything: Interactive Image Description With Diverse Multimodal Controls Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao
- Viclevr: A Visual Reasoning Dataset And Hybrid Multimodal Fusion Model For Visual Question Answering In Vietnamese Tran Khiem Vinh, Phan Hao Phu, Van Nguyen Kiet, Nguyen Ngan Luu Thuy
- Automl-gpt: Large Language Model For Automl Tsai Yun-da, Tsai Yu-che, Huang Bo-wei, Yang Chun-pai, Lin Shou-de
- Theory Of Mind In Large Language Models: Examining Performance Of 11 State-of-the-art Models Vs. Children Aged 7-10 On Advanced Tests Van Duijn Max J., Van Dijk Bram M. A., Kouwenhoven Tom, De Valk Werner, Spruit Marco R., Van Der Putten Peter
- A Survey Of The Evolution Of Language Model-based Dialogue Systems Wang Hongru, Wang Lingzhi, Du Yiming, Chen Liang, Zhou Jingyan, Wang Yufei, Wong Kam-fai
- Missrec: Pre-training And Transferring Multi-modal Interest-aware Sequence Representation For Recommendation Wang Jinpeng, Zeng Ziyun, Wang Yunxiao, Wang Yuting, Lu Xingyu, Li Tianxiang, Yuan Jun, Zhang Rui, Zheng Hai-tao, Xia Shu-tao
- Compositional Text-to-image Synthesis With Attention Map Control Of Diffusion Models Wang Ruichen, Chen Zekang, Chen Chen, Ma Jian, Lu Haonan, Lin Xiaodong
- Lora Ensembles For Large Language Model Fine-tuning Wang Xi, Aitchison Laurence, Rudolph Maja
- User-aware Prefix-tuning Is A Good Learner For Personalized Image Captioning Wang Xuan, Wang Guanhong, Chai Wenhao, Zhou Jiayu, Wang Gaoang
- Gpt4video: A Unified Multimodal Large Language Model For Lnstruction-followed Understanding And Safety-aware Generation Wang Zhanyu, Wang Longyue, Zhao Zhen, Wu Minghao, Lyu Chenyang, Li Huayang, Cai Deng, Zhou Luping, Shi Shuming, Tu Zhaopeng
- Do Llms Understand User Preferences? Evaluating Llms On User Rating Prediction Wang-cheng Kang, Jianmo Ni, Nikhil Mehta, Maheswaran Sathiamoorthy, Lichan Hong, Ed Chi, Derek Zhiyuan Cheng
- Mindmap: Knowledge Graph Prompting Sparks Graph Of Thoughts In Large Language Models Wen Yilin, Wang Zifeng, Sun Jimeng
- Efficient LLM Inference Solution On Intel GPU Wu Hui, Gan Yi, Yuan Feng, Ma Jing, Zhu Wei, Xu Yutao, Zhu Hong, Zhu Yuhua, Liu Xiaoli, Gu Jinghui, Zhao Peng
- Approximated Prompt Tuning For Vision-language Pre-trained Models Wu Qiong, Huang Shubin, Zhou Yiyi, Dai Pingyang, Shu Annan, Jiang Guannan, Ji Rongrong
- Next-gpt: Any-to-any Multimodal LLM Wu Shengqiong, Fei Hao, Qu Leigang, Ji Wei, Chua Tat-seng
- Self-correcting Llm-controlled Diffusion Models Wu Tsung-han, Lian Long, Gonzalez Joseph E., Li Boyi, Darrell Trevor
- Paragraph-to-image Generation With Information-enriched Diffusion Model Wu Weijia, Li Zhuang, He Yefei, Shou Mike Zheng, Shen Chunhua, Cheng Lele, Li Yan, Gao Tingting, Zhang Di, Wang Zhongyuan
- LLMGA: Multimodal Large Language Model Based Generation Assistant Xia Bin, Wang Shiyin, Tao Yingfan, Wang Yitong, Jia Jiaya
- Can I Trust Your Answer? Visually Grounded Video Question Answering Xiao Junbin, Yao Angela, Li Yicong, Chua Tat Seng
- Lm-cocktail: Resilient Tuning Of Language Models Via Model Merging Xiao Shitao, Liu Zheng, Zhang Peitian, Xing Xingrun
- Bridging Items And Language: A Transition Paradigm For Large Language Model-based Recommendation Xinyu Lin, Wenjie Wang, Yongqi Li, Fuli Feng, See-kiong Ng, Tat-seng Chua
- Rationale-enhanced Language Models Are Better Continual Relation Learners Xiong Weimin, Song Yifan, Wang Peiyi, Li Sujian
- Large Language Models For Generative Information Extraction: A Survey Xu Derong, Chen Wei, Peng Wenjun, Zhang Chao, Xu Tong, Zhao Xiangyu, Wu Xian, Zheng Yefeng, Wang Yang, Chen Enhong
- Rethinking The Instruction Quality: LIFT Is What You Need Xu Yang, Yao Yongqiang, Huang Yufan, Qi Mengnan, Wang Maoquan, Gu Bin, Sundaresan Neel
- Tackling Vision Language Tasks Through Learning Inner Monologues Yang Diji, Chen Kezhen, Rao Jinmeng, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- Medxchat: A Unified Multimodal Large Language Model Framework Towards Cxrs Understanding And Generation Yang Ling, Wang Zhanyu, Chen Zhenghao, Liang Xinyu, Zhou Luping
- Supervised Knowledge Makes Large Language Models Better In-context Learners Yang Linyi, Zhang Shuibai, Yu Zhuohao, Bao Guangsheng, Wang Yidong, Wang Jindong, Xu Ruochen, Ye Wei, Xie Xing, Chen Weizhu, Zhang Yue
- Dior-cvae: Pre-trained Language Models And Diffusion Priors For Variational Dialog Generation Yang Tianyu, Tran Thy Thy, Gurevych Iryna
- Beyond Chain-of-thought, Effective Graph-of-thought Reasoning In Language Models Yao Yao, Li Zuchao, Zhao Hai
- Diffusion Language Models Can Perform Many Tasks With Scaling And Instruction-finetuning Ye Jiasheng, Zheng Zaixiang, Bao Yu, Qian Lihua, Gu Quanquan
- Tram: A Token-level Retrieval-augmented Mechanism For Source Code Summarization Ye Tong, Wu Lingfei, Ma Tengfei, Zhang Xuhong, Du Yangkai, Liu Peiyu, Ji Shouling, Wang Wenhai
- Navigating Text-to-image Customization: From Lycoris Fine-tuning To Model Evaluation Yeh Shih-ying, Hsieh Yu-guan, Gao Zhidong, Yang Bernard B W, Oh Giyeong, Gong Yanmin
- Kola: Carefully Benchmarking World Knowledge Of Large Language Models Yu Jifan, Wang Xiaozhi, Tu Shangqing, Cao Shulin, Zhang-li Daniel, Lv Xin, Peng Hao, Yao Zijun, Zhang Xiaohan, Li Hanming, Li Chunyang, Zhang Zheyuan, Bai Yushi, Liu Yantao, Xin Amy, Lin Nianyi, Yun Kaifeng, Gong Linlu, Chen Jianhui, Wu Zhili, Qi Yunjia, Li Weikai, Guan Yong, Zeng Kaisheng, Qi Ji, Jin Hailong, Liu Jinxin, Gu Yu, Yao Yuan, Ding Ning, Hou Lei, Liu Zhiyuan, Xu Bin, Tang Jie, Li Juanzi
- Towards Better Chain-of-thought Prompting Strategies: A Survey Yu Zihan, He Liang, Wu Zhen, Dai Xinyu, Chen Jiajun
- Large Language Models Illuminate A Progressive Pathway To Artificial Healthcare Assistant: A Review Yuan Mingze, Bao Peng, Yuan Jiajia, Shen Yunhao, Chen Zifan, Xie Yi, Zhao Jie, Chen Yang, Zhang Li, Shen Lin, Dong Bin
- Artgpt-4: Towards Artistic-understanding Large Vision-language Models With Enhanced Adapter Yuan Zhengqing, He Yunhong, Wang Kun, Ye Yanfang, Sun Lichao
- Prompt Highlighter: Interactive Control For Multi-modal Llms Yuechen Zhang, Shengju Qian, Bohao Peng, Shu Liu, Jiaya Jia
- Planting A SEED Of Vision In Large Language Model Yuying Ge, Yixiao Ge, Ziyun Zeng, Xintao Wang, Ying Shan
- User-controlled Knowledge Fusion In Large Language Models: Balancing Creativity And Hallucination Zhang Chen
- A Comprehensive Analysis Of The Effectiveness Of Large Language Models As Automatic Dialogue Evaluators Zhang Chen, D'haro Luis Fernando, Chen Yiming, Zhang Malu, Li Haizhou
- SVIT: Scaling Up Visual Instruction Tuning Zhao Bo, Wu Boya, He Muyang, Huang Tiejun
- Explainability For Large Language Models: A Survey Zhao Haiyan, Chen Hanjie, Yang Fan, Liu Ninghao, Deng Huiqi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Du Mengnan
- Easygen: Easing Multimodal Generation With Bidiffuser And Llms Zhao Xiangyu, Liu Bo, Liu Qijiong, Shi Guangyuan, Wu Xiao-ming
- Seeking Neural Nuggets: Knowledge Transfer In Large Language Models From A Parametric Perspective Zhong Ming, An Chenxin, Chen Weizhu, Han Jiawei, He Pengcheng
- Sur-adapter: Enhancing Text-to-image Pre-trained Diffusion Models With Large Language Models Zhong Shanshan, Huang Zhongzhan, Wen Wushao, Qin Jinghui, Lin Liang
- Autodan: Interpretable Gradient-based Adversarial Attacks On Large Language Models Zhu Sicheng, Zhang Ruiyi, An Bang, Wu Gang, Barrow Joe, Wang Zichao, Huang Furong, Nenkova Ani, Sun Tong
- 3d-vista: Pre-trained Transformer For 3D Vision And Text Alignment Zhu Ziyu, Ma Xiaojian, Chen Yixin, Deng Zhidong, Huang Siyuan, Li Qing
- Recommender Systems In The Era Of Large Language Models (llms) Zihuai Zhao, Wenqi Fan, Jiatong Li, Yunqing Liu, Xiaowei Mei, Yiqi Wang, Zhen Wen, Fei Wang, Xiangyu Zhao, Jiliang Tang, Qing Li
- A Survey On Symbolic Knowledge Distillation Of Large Language Models Acharya Kamal, Velasquez Alvaro, Song Houbing Herbert
- FACTS About Building Retrieval Augmented Generation-based Chatbots Akkiraju Rama, Xu Anbang, Bora Deepak, Yu Tan, An Lu, Seth Vishal, Shukla Aaditya, Gundecha Pritam, Mehta Hridhay, Jha Ashwin, Raj Prithvi, Balasubramanian Abhinav, Maram Murali, Muthusamy Guru, Annepally Shivakesh Reddy, Knowles Sidney, Du Min, Burnett Nick, Javiya Sean, Marannan Ashok, Kumari Mamta, Jha Surbhi, Dereszenski Ethan, Chakraborty Anupam, Ranjan Subhash, Terfai Amina, Surya Anoop, Mercer Tracey, Thanigachalam Vinodh Kumar, Bar Tamar, Krishnan Sanjana, Kilaru Samy, Jaksic Jasmine, Algarici Nave, Liberman Jacob, Conway Joey, Nayyar Sonu, Boitano Justin
- Mitigating Catastrophic Forgetting In Language Transfer Via Model Merging Alexandrov Anton, Raychev Veselin, Müller Mark Niklas, Zhang Ce, Vechev Martin, Toutanova Kristina
- The Pitfalls Of Next-token Prediction Bachmann Gregor, Nagarajan Vaishnavh
- LAMP: A Language Model On The Map Balsebre Pasquale, Huang Weiming, Cong Gao
- Matryoshka Multimodal Models Cai Mu, Yang Jianwei, Gao Jianfeng, Lee Yong Jae
- Lococo: Dropping In Convolutions For Long Context Compression Cai Ruisi, Tian Yuandong, Wang Zhangyang, Chen Beidi
- XAI Meets Llms: A Survey Of The Relation Between Explainable AI And Large Language Models Cambria Erik, Malandri Lorenzo, Mercorio Fabio, Nobani Navid, Seveso Andrea
- Genception: Evaluate Multimodal Llms With Unlabeled Unimodal Data Cao Lele, Buchner Valentin, Senane Zineb, Yang Fangkai
- Large Language Models As Instruments Of Power: New Regimes Of Autonomous Manipulation And Control Chaudhary Yaqub, Penn Jonnie
- Model Composition For Multimodal Large Language Models Chen Chi, Du Yiyang, Fang Zheng, Wang Ziyue, Luo Fuwen, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Sun Maosong, Liu Yang
- Lower Layer Matters: Alleviating Hallucination Via Multi-layer Fusion Contrastive Decoding With Truthfulness Refocused Chen Dingwei, Fang Feiteng, Ni Shiwen, Liang Feng, Xu Ruifeng, Yang Min, Li Chengming
- Pixart-\sigma: Weak-to-strong Training Of Diffusion Transformer For 4K Text-to-image Generation Chen Junsong, Ge Chongjian, Xie Enze, Wu Yue, Yao Lewei, Ren Xiaozhe, Wang Zhongdao, Luo Ping, Lu Huchuan, Li Zhenguo
- Enhancing Visual Question Answering Through Ranking-based Hybrid Training And Multimodal Fusion Chen Peiyuan, Zhang Zecheng, Dong Yiping, Zhou Li, Wang Han
- Hiqa: A Hierarchical Contextual Augmentation RAG For Massive Documents QA Chen Xinyue, Gao Pengyu, Song Jiangjiang, Tan Xiaoyang
- Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided By Text Information Chen Yi, Xu Jian, Zhang Xu-yao, Liu Wen-zhuo, Liu Yang-yang, Liu Cheng-lin
- What Are The Essential Factors In Crafting Effective Long Context Multi-hop Instruction Datasets? Insights And Best Practices Chen Zhi, Chen Qiguang, Qin Libo, Guo Qipeng, Lv Haijun, Zou Yicheng, Che Wanxiang, Yan Hang, Chen Kai, Lin Dahua
- Multi-prompting Decoder Helps Better Language Understanding Cheng Zifeng, Chen Zhaoling, Jiang Zhiwei, Yin Yafeng, Ge Shiping, Liu Yuliang, Gu Qing
- ANOLE: An Open, Autoregressive, Native Large Multimodal Models For Interleaved Image-text Generation Chern Ethan, Su Jiadi, Ma Yan, Liu Pengfei
- Multi-granularity Guided Fusion-in-decoder Choi Eunseong, Lee Hyeri, Lee Jongwuk
- Deconstructing The Ethics Of Large Language Models From Long-standing Issues To New-emerging Dilemmas Deng Chengyuan, Duan Yiqun, Jin Xin, Chang Heng, Tian Yijun, Liu Han, Zou Henry Peng, Jin Yiqiao, Xiao Yijia, Wang Yichen, Wu Shenghao, Xie Zongxing, Gao Kuofeng, He Sihong, Zhuang Jun, Cheng Lu, Wang Haohan
- Promises, Outlooks And Challenges Of Diffusion Language Modeling Deschenaux Justin, Gulcehre Caglar
- Large Language Model Agent In Financial Trading: A Survey Ding Han, Li Yinheng, Wang Junhao, Chen Hang
- Modality-aware Integration With Large Language Models For Knowledge-based Visual Question Answering Dong Junnan, Zhang Qinggang, Zhou Huachi, Zha Daochen, Zheng Pai, Huang Xiao
- Instruction-guided Scene Text Recognition Du Yongkun, Chen Zhineng, Su Yuchen, Jia Caiyan, Jiang Yu-gang
- Prompt Optimizer Of Text-to-image Diffusion Models For Abstract Concept Understanding Fan Zezhong, Li Xiaohan, Fang Chenhao, Biswas Topojoy, Nag Kaushiki, Xu Jianpeng, Achan Kannan
- Disperse-then-merge: Pushing The Limits Of Instruction Tuning Via Alignment Tax Reduction Fu Tingchen, Cai Deng, Liu Lemao, Shi Shuming, Yan Rui
- Practical Unlearning For Large Language Models Gao Chongyang, Wang Lixu, Weng Chenkai, Wang Xiao, Zhu Qi
- The Best Of Both Worlds: Toward An Honest And Helpful Large Language Model Gao Chujie, Zhang Qihui, Chen Dongping, Huang Yue, Wu Siyuan, Fu Zhengyan, Wan Yao, Zhang Xiangliang, Sun Lichao
- Lumina-t2x: Transforming Text Into Any Modality, Resolution, And Duration Via Flow-based Large Diffusion Transformers Gao Peng, Zhuo Le, Liu Dongyang, Du Ruoyi, Luo Xu, Qiu Longtian, Zhang Yuhang, Lin Chen, Huang Rongjie, Geng Shijie, Zhang Renrui, Xi Junlin, Shao Wenqi, Jiang Zhengkai, Yang Tianshuo, Ye Weicai, Tong He, He Jingwen, Qiao Yu, Li Hongsheng
- Improving Language Understanding From Screenshots Gao Tianyu, Wang Zirui, Bhaskar Adithya, Chen Danqi
- Worldgpt: Empowering LLM As Multimodal World Model Ge Zhiqi, Huang Hongzhe, Zhou Mingze, Li Juncheng, Wang Guoming, Tang Siliang, Zhuang Yueting
- Feedback-aligned Mixed Llms For Machine Language-molecule Translation Gkoumas Dimitris, Liakata Maria
- Arcee's Mergekit: A Toolkit For Merging Large Language Models Goddard Charles, Siriwardhana Shamane, Ehghaghi Malikeh, Meyers Luke, Karpukhin Vlad, Benedict Brian, Mcquade Mark, Solawetz Jacob
- Omnifusion Technical Report Goncharova Elizaveta, Razzhigaev Anton, Mikhalchuk Matvey, Kurkin Maxim, Abdullaeva Irina, Skripkin Matvey, Oseledets Ivan, Dimitrov Denis, Kuznetsov Andrey
- Fine-tuned Language Models Generate Stable Inorganic Materials As Text Gruver Nate, Sriram Anuroop, Madotto Andrea, Wilson Andrew Gordon, Zitnick C. Lawrence, Ulissi Zachary
- Probing The Robustness Of Vision-language Pretrained Models: A Multimodal Adversarial Attack Approach Guan Jiwei, Ding Tianyu, Cao Longbing, Pan Lei, Wang Chen, Zheng Xi
- Recent Advances In Generative AI And Large Language Models: Current Status, Challenges, And Perspectives Hagos Desta Haileselassie, Battle Rick, Rawat Danda B.
- Covert Malicious Finetuning: Challenges In Safeguarding LLM Adaptation Halawi Danny, Wei Alexander, Wallace Eric, Wang Tony T., Haghtalab Nika, Steinhardt Jacob
- Transfer Learning For Text Diffusion Models Han Kehang, Kenealy Kathleen, Barua Aditya, Fiedel Noah, Constant Noah
- EMMA: Your Text-to-image Diffusion Model Can Secretly Accept Multi-modal Prompts Han Yucheng, Wang Rui, Zhang Chi, Hu Juntao, Cheng Pei, Fu Bin, Zhang Hanwang
- Pitvqa: Image-grounded Text Embedding LLM For Visual Question Answering In Pituitary Surgery He Runlong, Xu Mengya, Das Adrito, Khan Danyal Z., Bano Sophia, Marcus Hani J., Stoyanov Danail, Clarkson Matthew J., Islam Mobarakol
- Prompting Chatgpt For Translation: A Comparative Analysis Of Translation Brief And Persona Prompts He Sui
- MARS: Mixture Of Auto-regressive Models For Fine-grained Text-to-image Synthesis He Wanggui, Fu Siming, Liu Mushui, Wang Xierui, Xiao Wenyi, Shu Fangxun, Wang Yi, Zhang Lei, Yu Zhelun, Li Haoyuan, Huang Ziwei, Gan Leilei, Jiang Hao
- Llms Meet Multimodal Generation And Editing: A Survey He Yingqing, Liu Zhaoyang, Chen Jingye, Tian Zeyue, Liu Hongyu, Chi Xiaowei, Liu Runtao, Yuan Ruibin, Xing Yazhou, Wang Wenhai, Dai Jifeng, Zhang Yong, Xue Wei, Liu Qifeng, Guo Yike, Chen Qifeng
- Automated Black-box Prompt Engineering For Personalized Text-to-image Generation He Yutong, Robey Alexander, Murata Naoki, Jiang Yiding, Williams Joshua, Pappas George J., Hassani Hamed, Mitsufuji Yuki, Salakhutdinov Ruslan, Kolter J. Zico
- Let The Code LLM Edit Itself When You Edit The Code He Zhenyu, Zhang Jun, Luo Shengjie, Xu Jingjing, Zhang Zhi, He Di
- Generative Visual Instruction Tuning Hernandez Jefferson, Villegas Ruben, Ordonez Vicente
- Multimodal Chain-of-thought Reasoning Via Chatgpt To Protect Children From Age-inappropriate Apps Hu Chuanbo, Liu Bin, Yin Minglei, Zhou Yilu, Li Xin
- ELLA: Equip Diffusion Models With LLM For Enhanced Semantic Alignment Hu Xiwei, Wang Rui, Fang Yixiao, Fu Bin, Cheng Pei, Yu Gang
- An Intermediate Fusion Vit Enables Efficient Text-image Alignment In Diffusion Models Hu Zizhao, Jia Shaochong, Rostami Mohammad
- Large Language Models In Mental Health Care: A Scoping Review Hua Yining, Liu Fenglin, Yang Kailai, Li Zehan, Na Hongbin, Sheu Yi-han, Zhou Peilin, Moran Lauren V., Ananiadou Sophia, Beam Andrew, Torous John
- Applying And Evaluating Large Language Models In Mental Health Care: A Scoping Review Of Human-assessed Generative Tasks Hua Yining, Na Hongbin, Li Zehan, Liu Fenglin, Fang Xiao, Clifton David, Torous John
- Selective Prompting Tuning For Personalized Conversations With Llms Huang Qiushi, Liu Xubo, Ko Tom, Wu Bo, Wang Wenwu, Zhang Yu, Tang Lilian
- Leveraging Large Language Models For Enhanced NLP Task Performance Through Knowledge Distillation And Optimized Training Strategies Huang Yining, Tang Keke, Chen Meilian
- News Without Borders: Domain Adaptation Of Multilingual Sentence Embeddings For Cross-lingual News Recommendation Iana Andreea, Schmidt Fabian David, Glavaš Goran, Paulheim Heiko
- Response Generation For Cognitive Behavioral Therapy With Large Language Models: Comparative Study With Socratic Questioning Izumi Kenta, Tanaka Hiroki, Shidara Kazuhiro, Adachi Hiroyoshi, Kanayama Daisuke, Kudo Takashi, Nakamura Satoshi
- Can't Say Cant? Measuring And Reasoning Of Dark Jargons In Large Language Models Ji Xu, Zhang Jianyi, Zhou Ziyin, Zhao Zhangchi, Qiao Qianqian, Han Kaiying, Hossen Md Imran, Hei Xiali
- Prior Knowledge Integration Via LLM Encoding And Pseudo Event Regulation For Video Moment Retrieval Jiang Yiyang, Zhang Wengyu, Zhang Xulu, Wei Xiaoyong, Chen Chang Wen, Li Qing
- Llms Can Find Mathematical Reasoning Mistakes By Pedagogical Chain-of-thought Jiang Zhuoxuan, Peng Haoyuan, Feng Shanshan, Li Fan, Li Dongsheng
- Llms Can't Plan, But Can Help Planning In Llm-modulo Frameworks Kambhampati Subbarao, Valmeekam Karthik, Guan Lin, Verma Mudit, Stechly Kaya, Bhambri Siddhant, Saldyt Lucas, Murthy Anil
- Self-moe: Towards Compositional Large Language Models With Self-specialized Experts Kang Junmo, Karlinsky Leonid, Luo Hongyin, Wang Zhen, Hansen Jacob, Glass James, Cox David, Panda Rameswar, Feris Rogerio, Ritter Alan
- Finding A Needle In The Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases With Minimal Distribution Distortion Kassem Aly M., Saad Sherif
- Remi: A Dataset For Reasoning With Multiple Images Kazemi Mehran, Dikkala Nishanth, Anand Ankit, Devic Petar, Dasgupta Ishita, Liu Fangyu, Fatemi Bahare, Awasthi Pranjal, Guo Dee, Gollapudi Sreenivas, Qureshi Ahmed
- Reinforcement Of Explainability Of Chatgpt Prompts By Embedding Breast Cancer Self-screening Rules Into AI Responses Khan Yousef, Hamed Ahmed Abdeen
- Persona Is A Double-edged Sword: Enhancing The Zero-shot Reasoning By Ensembling The Role-playing And Neutral Prompts Kim Junseok, Yang Nakyeong, Jung Kyomin
- Openvla: An Open-source Vision-language-action Model Kim Moo Jin, Pertsch Karl, Karamcheti Siddharth, Xiao Ted, Balakrishna Ashwin, Nair Suraj, Rafailov Rafael, Foster Ethan, Lam Grace, Sanketi Pannag, Vuong Quan, Kollar Thomas, Burchfiel Benjamin, Tedrake Russ, Sadigh Dorsa, Levine Sergey, Liang Percy, Finn Chelsea
- Multi-modal Generation Via Cross-modal In-context Learning Kumar Amandeep, Naseer Muzammal, Narayan Sanath, Anwer Rao Muhammad, Khan Salman, Cholakkal Hisham
- Harmonizing Code-mixed Conversations: Personality-assisted Code-mixed Response Generation In Dialogues Kumar Shivani, Chakraborty Tanmoy
- Zero-shot Text-guided Infinite Image Synthesis With LLM Guidance Kwon Soyeong, Lee Taegyeong, Kim Taehwan
- Biomistral: A Collection Of Open-source Pretrained Large Language Models For Medical Domains Labrak Yanis, Bazoge Adrien, Morin Emmanuel, Gourraud Pierre-antoine, Rouvier Mickael, Dufour Richard
- Small-e: Small Language Model With Linear Attention For Efficient Speech Synthesis Lemerle Théodor, Obin Nicolas, Roebel Axel
- Purifying Large Language Models By Ensembling A Small Language Model Li Tianlin, Liu Qian, Pang Tianyu, Du Chao, Guo Qing, Liu Yang, Lin Min
- MM-TTS: A Unified Framework For Multimodal, Prompt-induced Emotional Text-to-speech Synthesis Li Xiang, Cheng Zhi-qi, He Jun-yan, Peng Xiaojiang, Hauptmann Alexander G.
- What If We Recaption Billions Of Web Images With Llama-3? Li Xianhang, Tu Haoqin, Hui Mude, Wang Zeyu, Zhao Bingchen, Xiao Junfei, Ren Sucheng, Mei Jieru, Liu Qing, Zheng Huangjie, Zhou Yuyin, Xie Cihang
- Grounded Compositional And Diverse Text-to-3d With Pretrained Multi-view Diffusion Model Li Xiaolong, Mo Jiawei, Wang Ying, Parameshwara Chethan, Fei Xiaohan, Swaminathan Ashwin, Taylor Cj, Tu Zhuowen, Favaro Paolo, Soatto Stefano
- VSTAR: Generative Temporal Nursing For Longer Dynamic Video Synthesis Li Yumeng, Beluch William, Keuper Margret, Zhang Dan, Khoreva Anna
- Llms Meet Long Video: Advancing Long Video Question Answering With An Interactive Visual Adapter In Llms Li Yunxin, Chen Xinyu, Hu Baotain, Zhang Min
- Amphista: Accelerate LLM Inference With Bi-directional Multiple Drafting Heads In A Non-autoregressive Style Li Zeping, Yang Xinlong, Gao Ziheng, Liu Ji, Liu Zhuang, Li Dong, Peng Jinzhang, Tian Lu, Barsoum Emad
- Text-to-model: Text-conditioned Neural Network Diffusion For Train-once-for-all Personalization Li Zexi, Gao Lingzhi, Wu Chao
- Hunyuan-dit: A Powerful Multi-resolution Diffusion Transformer With Fine-grained Chinese Understanding Li Zhimin, Zhang Jianwei, Lin Qin, Xiong Jiangfeng, Long Yanxin, Deng Xinchi, Zhang Yingfang, Liu Xingchao, Huang Minbin, Xiao Zedong, Chen Dayou, He Jiajun, Li Jiahao, Li Wenyue, Zhang Chen, Quan Rongwei, Lu Jianxiang, Huang Jiabin, Yuan Xiaoyan, Zheng Xiaoxiao, Li Yixuan, Zhang Jihong, Zhang Chao, Chen Meng, Liu Jie, Fang Zheng, Wang Weiyan, Xue Jinbao, Tao Yangyu, Zhu Jianchen, Liu Kai, Lin Sihuan, Sun Yifu, Li Yun, Wang Dongdong, Chen Mingtao, Hu Zhichao, Xiao Xiao, Chen Yan, Liu Yuhong, Liu Wei, Wang Di, Yang Yong, Jiang Jie, Lu Qinglin
- Effects Of Different Prompts On The Quality Of GPT-4 Responses To Dementia Care Questions Li Zhuochun, Xie Bo, Hilsabeck Robin, Aguirre Alyssa, Zou Ning, Luo Zhimeng, He Daqing
- Dogerm: Equipping Reward Models With Domain Knowledge Through Model Merging Lin Tzu-han, Li Chen-an, Lee Hung-yi, Chen Yun-nung
- Improving Visual Storytelling With Multimodal Large Language Models Lin Xiaochuan, Chen Xiangyong
- Evaluating Text-to-visual Generation With Image-to-text Generation Lin Zhiqiu, Pathak Deepak, Li Baiqi, Li Jiayao, Xia Xide, Neubig Graham, Zhang Pengchuan, Ramanan Deva
- Local And Global Contexts For Conversation Lin Zuoquan, Shen Xinyi
- Cool-fusion: Fuse Large Language Models Without Training Liu Cong, Quan Xiaojun, Pan Yan, Lin Liang, Wu Weigang, Chen Xu
- Lumina-mgpt: Illuminate Flexible Photorealistic Text-to-image Generation With Multimodal Generative Pretraining Liu Dongyang, Zhao Shitian, Zhuo Le, Lin Weifeng, Qiao Yu, Li Hongsheng, Gao Peng
- VP-LLM: Text-driven 3D Volume Completion With Large Language Models Through Patchification Liu Jianmeng, Liu Yichen, Zhang Yuyao, Meng Zeyuan, Tai Yu-wing, Tang Chi-keung
- Tiny Refinements Elicit Resilience: Toward Efficient Prefix-model Against LLM Red-teaming Liu Jiaxu, Yin Xiangyu, Wu Sihao, Wang Jianhong, Fang Meng, Yi Xinping, Huang Xiaowei
- LLM4GEN: Leveraging Semantic Representation Of Llms For Text-to-image Generation Liu Mushui, Ma Yuhang, Zhen Yang, Dan Jun, Yu Yunlong, Zhao Zeng, Hu Zhipeng, Liu Bai, Fan Changjie
- Harnessing Large Language Models For Multimodal Product Bundling Liu Xiaohao, Wu Jie, Tao Zhulin, Ma Yunshan, Wei Yinwei, Chua Tat-seng
- Understanding Llms: A Comprehensive Overview From Training To Inference Liu Yiheng, He Hao, Han Tianle, Zhang Xu, Liu Mengyuan, Tian Jiaming, Zhang Yutong, Wang Jiaqi, Gao Xiaohui, Zhong Tianyang, Pan Yi, Xu Shaochen, Wu Zihao, Liu Zhengliang, Zhang Xin, Zhang Shu, Hu Xintao, Zhang Tuo, Qiang Ning, Liu Tianming, Ge Bao
- Synthvlm: High-efficiency And High-quality Synthetic Data For Vision Language Models Liu Zheng, Liang Hao, Huang Xijie, Xiong Wentao, Yu Qinhan, Sun Linzhuang, Chen Chong, He Conghui, Cui Bin, Zhang Wentao
- Online Merging Optimizers For Boosting Rewards And Mitigating Tax In Alignment Lu Keming, Yu Bowen, Huang Fei, Fan Yang, Lin Runji, Zhou Chang
- Ovis: Structural Embedding Alignment For Multimodal Large Language Model Lu Shiyin, Li Yang, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Ye Han-jia
- Fine-tuning Large Language Models For Domain Adaptation: Exploration Of Training Strategies, Scaling, Model Merging And Synergistic Capabilities Lu Wei, Luu Rachel K., Buehler Markus J.
- Generative Students: Using Llm-simulated Student Profiles To Support Question Item Evaluation Lu Xinyi, Wang Xu
- SPP: Sparsity-preserved Parameter-efficient Fine-tuning For Large Language Models Lu Xudong, Zhou Aojun, Xu Yuhui, Zhang Renrui, Gao Peng, Li Hongsheng
- DEEM: Diffusion Models Serve As The Eyes Of Large Language Models For Image Perception Luo Run, Li Yunshui, Chen Longze, He Wanwei, Lin Ting-en, Liu Ziqiang, Zhang Lei, Song Zikai, Xia Xiaobo, Liu Tongliang, Yang Min, Hui Binyuan
- Automatic Summarization Of Doctor-patient Encounter Dialogues Using Large Language Model Through Prompt Tuning Lyu Mengxian, Peng Cheng, Li Xiaohan, Balian Patrick, Bian Jiang, Wu Yonghui
- Perturbation-restrained Sequential Model Editing Ma Jun-yu, Wang Hong, Xu Hao-xiang, Ling Zhen-hua, Gu Jia-chen
- Exploring The Capabilities Of Prompted Large Language Models In Educational And Assessment Applications Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- Understanding And Mitigating Language Confusion In Llms Marchisio Kelly, Ko Wei-yin, Bérard Alexandre, Dehaze Théo, Ruder Sebastian
- Pack Of Llms: Model Fusion At Test-time Via Perplexity Optimization Mavromatis Costas, Karypis Petros, Karypis George
- Inadequacies Of Large Language Model Benchmarks In The Era Of Generative Artificial Intelligence Mcintosh Timothy R., Susnjak Teo, Liu Tong, Watters Paul, Halgamuge Malka N.
- A Survey Of Resource-efficient LLM And Multimodal Foundation Models Mengwei Xu, Wangsong Yin, Dongqi Cai, Rongjie Yi, Daliang Xu, Qipeng Wang, Bingyang Wu, Yihao Zhao, Chen Yang, Shihe Wang, Qiyang Zhang, Zhenyan Lu, Li Zhang, Shangguang Wang, Yuanchun Li, Yunxin Liu, Xin Jin, Xuanzhe Liu
- Large Language Model (LLM) AI Text Generation Detection Based On Transformer Deep Learning Algorithm Mo Yuhong, Qin Hao, Dong Yushan, Zhu Ziyi, Li Zhenglin
- Exploring The Landscape Of Large Language Models: Foundations, Techniques, And Challenges Moradi Milad, Yan Ke, Colwell David, Samwald Matthias, Asgari Rhona
- Advancing Vietnamese Visual Question Answering With Transformer And Convolutional Integration Nguyen Ngoc Son, Nguyen Van Son, Le Tung
- Gpt-ology, Computational Models, Silicon Sampling: How Should We Think About Llms In Cognitive Science? Ong Desmond C.
- Outliers And Calibration Sets Have Diminishing Effect On Quantization Of Modern Llms Paglieri Davide, Dash Saurabh, Rocktäschel Tim, Parker-holder Jack
- Investigating LLM Applications In E-commerce Palen-michel Chester, Wang Ruixiang, Zhang Yipeng, Yu David, Xu Canran, Wu Zhe
- Lottery Ticket Adaptation: Mitigating Destructive Interference In Llms Panda Ashwinee, Isik Berivan, Qi Xiangyu, Koyejo Sanmi, Weissman Tsachy, Mittal Prateek
- Picturing Ambiguity: A Visual Twist On The Winograd Schema Challenge Park Brendan, Janecek Madeline, Ezzati-jivan Naser, Li Yifeng, Emami Ali
- The Ultimate Guide To Fine-tuning Llms From Basics To Breakthroughs: An Exhaustive Review Of Technologies, Research, Best Practices, Applied Research Challenges And Opportunities Parthasarathy Venkatesh Balavadhani, Zafar Ahtsham, Khan Aafaq, Shahid Arsalan
- Learnable In-context Vector For Visual Question Answering Peng Yingzhe, Hao Chenduo, Yang Xu, Peng Jiawei, Hu Xinting, Geng Xin
- Coherent Zero-shot Visual Instruction Generation Phung Quynh, Ge Songwei, Huang Jia-bin
- Mamba-ptq: Outlier Channels In Recurrent Large Language Models Pierro Alessandro, Abreu Steven
- Revisiting Verilogeval: Newer Llms, In-context Learning, And Specification-to-rtl Tasks Pinckney Nathaniel, Batten Christopher, Liu Mingjie, Ren Haoxing, Khailany Brucek
- Diffusiongpt: Llm-driven Text-to-image Generation System Qin Jie, Wu Jie, Chen Weifeng, Ren Yuxi, Li Huixia, Wu Hefeng, Xiao Xuefeng, Wang Rui, Wen Shilei
- Megen: Generative Backdoor In Large Language Models Via Model Editing Qiu Jiyang, Ma Xinbei, Zhang Zhuosheng, Zhao Hai
- Tool Learning With Large Language Models: A Survey Qu Changle, Dai Sunhao, Wei Xiaochi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Xu Jun, Wen Ji-rong
- Tokenrec: Learning To Tokenize ID For Llm-based Generative Recommendation Qu Haohao, Fan Wenqi, Zhao Zihuai, Li Qing
- Unveiling And Mitigating Memorization In Text-to-image Diffusion Models Through Cross Attention Ren Jie, Li Yaxin, Zeng Shenglai, Xu Han, Lyu Lingjuan, Xing Yue, Tang Jiliang
- Self-distillation For Model Stacking Unlocks Cross-lingual NLU In 200+ Languages Schmidt Fabian David, Borchert Philipp, Vulić Ivan, Glavaš Goran
- Metacognitive Myopia In Large Language Models Scholten Florian, Rebholz Tobias R., Hütter Mandy
- Tencdm: Understanding The Properties Of Diffusion Model In The Space Of Language Model Encodings Shabalin Alexander, Meshchaninov Viacheslav, Chimbulatov Egor, Lapikov Vladislav, Kim Roman, Bartosh Grigory, Molchanov Dmitry, Markov Sergey, Vetrov Dmitry
- An End-to-end Speech Summarization Using Large Language Model Shang Hengchao, Li Zongyao, Guo Jiaxin, Li Shaojun, Rao Zhiqiang, Luo Yuanchang, Wei Daimeng, Yang Hao
- Learning To Decode Collaboratively With Multiple Language Models Shen Shannon Zejiang, Lang Hunter, Wang Bailin, Kim Yoon, Sontag David
- PMG : Personalized Multimodal Generation With Large Language Models Shen Xiaoteng, Zhang Rui, Zhao Xiaoyan, Zhu Jieming, Xiao Xi
- Action Controlled Paraphrasing Shi Ning, Wu Zijun
- Reslora: Identity Residual Mapping In Low-rank Adaption Shi Shuhua, Huang Shaohan, Song Minghui, Li Zhoujun, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Profuser: Progressive Fusion Of Large Language Models Shi Tianyuan, Wan Fanqi, Huang Canbin, Quan Xiaojun, Li Chenliang, Yan Ming, Zhang Ji
- Ehragent: Code Empowers Large Language Models For Few-shot Complex Tabular Reasoning On Electronic Health Records Shi Wenqi, Xu Ran, Zhuang Yuchen, Yu Yue, Zhang Jieyu, Wu Hang, Zhu Yuanda, Ho Joyce, Yang Carl, Wang May D.
- Abusegpt: Abuse Of Generative AI Chatbots To Create Smishing Campaigns Shibli Ashfak Md, Pritom Mir Mehedi A., Gupta Maanak
- Can Prompt Modifiers Control Bias? A Comparative Analysis Of Text-to-image Generative Models Shin Philip Wootaek, Ahn Jihyun Janice, Yin Wenpeng, Sampson Jack, Narayanan Vijaykrishnan
- Rethinking Interpretability In The Era Of Large Language Models Singh Chandan, Inala Jeevana Priya, Galley Michel, Caruana Rich, Gao Jianfeng
- Are Small Language Models Ready To Compete With Large Language Models For Practical Applications? Sinha Neelabh, Jain Vinija, Chadha Aman
- Moma: Multimodal LLM Adapter For Fast Personalized Image Generation Song Kunpeng, Zhu Yizhe, Liu Bingchen, Yan Qing, Elgammal Ahmed, Yang Xiao
- Hierarchical Context Merging: Better Long Context Understanding For Pre-trained Llms Song Woomin, Oh Seunghyuk, Mo Sangwoo, Kim Jaehyung, Yun Sukmin, Ha Jung-woo, Shin Jinwoo
- Interpreting And Learning Voice Commands With A Large Language Model For A Robot System Stankevich Stanislau, Dudek Wojciech
- Parallelparc: A Scalable Pipeline For Generating Natural-language Analogies Sultan Oren, Bitton Yonatan, Yosef Ron, Shahaf Dafna
- Towards Detecting Llms Hallucination Via Markov Chain-based Multi-agent Debate Framework Sun Xiaoxi, Li Jinpeng, Zhong Yan, Zhao Dongyan, Yan Rui
- Multi-turn Context Jailbreak Attack On Large Language Models From First Principles Sun Xiongtao, Zhang Deyue, Yang Dongdong, Zou Quanchen, Li Hui
- LCS: A Language Converter Strategy For Zero-shot Neural Machine Translation Sun Zengkui, Liu Yijin, Meng Fandong, Xu Jinan, Chen Yufeng, Zhou Jie
- Unlocking The Potential Of Model Merging For Low-resource Languages Tao Mingxu, Zhang Chen, Huang Quzhe, Ma Tianyao, Huang Songfang, Zhao Dongyan, Feng Yansong
- Targeted Visual Prompting For Medical Visual Question Answering Tascon-morales Sergio, Márquez-neila Pablo, Sznitman Raphael
- Atomic Self-consistency For Better Long Form Generations Thirukovalluru Raghuveer, Huang Yukun, Dhingra Bhuwan
- An Ecosage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant Tomar Mohit, Tiwari Abhisek, Saha Tulika, Jha Prince, Saha Sriparna
- No Size Fits All: The Perils And Pitfalls Of Leveraging Llms Vary With Company Size Urlana Ashok, Kumar Charaka Vinayak, Garlapati Bala Mallikarjunarao, Singh Ajeet Kumar, Mishra Rahul
- Infusing Knowledge Into Large Language Models With Contextual Prompts Vasisht Kinshuk, Ganesan Balaji, Kumar Vikas, Bhatnagar Vasudha
- Fusechat: Knowledge Fusion Of Chat Models Wan Fanqi, Zhong Longguang, Yang Ziyi, Chen Ruijun, Quan Xiaojun
- DELL: Generating Reactions And Explanations For Llm-based Misinformation Detection Wan Herun, Feng Shangbin, Tan Zhaoxuan, Wang Heng, Tsvetkov Yulia, Luo Minnan
- LOOK-M: Look-once Optimization In KV Cache For Efficient Multimodal Long-context Inference Wan Zhongwei, Wu Ziang, Liu Che, Huang Jinfa, Zhu Zhihong, Jin Peng, Wang Longyue, Yuan Li
- Information Theoretic Text-to-image Alignment Wang Chao, Franzese Giulio, Finamore Alessandro, Gallo Massimo, Michiardi Pietro
- Role Prompting Guided Domain Adaptation With General Capability Preserve For Large Language Models Wang Rui, Mi Fei, Chen Yi, Xue Boyang, Wang Hongru, Zhu Qi, Wong Kam-fai, Xu Ruifeng
- Prefclm: Enhancing Preference-based Reinforcement Learning With Crowdsourced Large Language Models Wang Ruiqi, Zhao Dezhong, Yuan Ziqin, Obi Ike, Min Byung-cheol
- Ladic: Are Diffusion Models Really Inferior To Autoregressive Counterparts For Image-to-text Generation? Wang Yuchi, Ren Shuhuai, Gao Rundong, Yao Linli, Guo Qingyan, An Kaikai, Bai Jianhong, Sun Xu
- REAR: A Relevance-aware Retrieval-augmented Framework For Open-domain Question Answering Wang Yuhao, Ren Ruiyang, Li Junyi, Zhao Wayne Xin, Liu Jing, Wen Ji-rong
- Promptcharm: Text-to-image Generation Through Multi-modal Prompting And Refinement Wang Zhijie, Huang Yuheng, Song Da, Ma Lei, Zhang Tianyi
- Re2llm: Reflective Reinforcement Large Language Model For Session-based Recommendation Wang Ziyan, Du Yingpeng, Sun Zhu, Chua Haoyan, Feng Kaidong, Wang Wenya, Zhang Jie
- Browse And Concentrate: Comprehending Multimodal Content Via Prior-llm Context Fusion Wang Ziyue, Chen Chi, Zhu Yiqi, Luo Fuwen, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Sun Maosong, Liu Yang
- Longvlm: Efficient Long Video Understanding Via Large Language Models Weng Yuetian, Han Mingfei, He Haoyu, Chang Xiaojun, Zhuang Bohan
- Motionllm: Multimodal Motion-language Learning With Large Language Models Wu Qi, Zhao Yubo, Wang Yifan, Tai Yu-wing, Tang Chi-keung
- VILA-U: A Unified Foundation Model Integrating Visual Understanding And Generation Wu Yecheng, Zhang Zhuoyang, Chen Junyu, Tang Haotian, Li Dacheng, Fang Yunhao, Zhu Ligeng, Xie Enze, Yin Hongxu, Yi Li, Han Song, Lu Yao
- Difflora: Generating Personalized Low-rank Adaptation Weights With Diffusion Wu Yujia, Shi Yiming, Wei Jiwei, Sun Chengwei, Zhou Yuyang, Yang Yang, Shen Heng Tao
- Instructing Large Language Models To Identify And Ignore Irrelevant Conditions Wu Zhenyu, Shen Chao, Jiang Meng
- Pandora: Towards General World Model With Natural Language Actions And Video States Xiang Jiannan, Liu Guangyi, Gu Yi, Gao Qiyue, Ning Yuting, Zha Yuheng, Feng Zeyu, Tao Tianhua, Hao Shibo, Shi Yemin, Liu Zhengzhong, Xing Eric P., Hu Zhiting
- Rar-b: Reasoning As Retrieval Benchmark Xiao Chenghao, Hudson G Thomas, Moubayed Noura Al
- Show-o: One Single Transformer To Unify Multimodal Understanding And Generation Xie Jinheng, Mao Weijia, Bai Zechen, Zhang David Junhao, Wang Weihao, Lin Kevin Qinghong, Gu Yuchao, Chen Zhijie, Yang Zhenheng, Shou Mike Zheng
- Large Multimodal Agents: A Survey Xie Junlin, Chen Zhihong, Zhang Ruifei, Wan Xiang, Li Guanbin
- Decomposition For Enhancing Attention: Improving Llm-based Text-to-sql Through Workflow Paradigm Xie Yuanzhen, Jin Xinzhou, Xie Tao, Lin Mingxiong, Chen Liang, Yu Chenyun, Cheng Lei, Zhuo Chengxiang, Hu Bo, Li Zang
- Mitigating Hallucinations In Large Vision-language Models With Instruction Contrastive Decoding Xintong Wang, Jingheng Pan, Liang Ding, Chris Biemann
- Mixture-of-instructions: Comprehensive Alignment Of A Large Language Model Through The Mixture Of Diverse System Prompting Instructions Xu Bowen, Wu Shaoyu, Liu Kai, Hu Lulu
- Stylemaster: Towards Flexible Stylized Image Generation With Diffusion Models Xu Chengming, Hu Kai, Luo Donghao, Zhang Jiangning, Li Wei, Ge Yanhao, Wang Chengjie
- Meteora: Multiple-tasks Embedded Lora For Large Language Models Xu Jingwei, Lai Junyu, Huang Yunpeng
- Mastering Text-to-image Diffusion: Recaptioning, Planning, And Generating With Multimodal Llms Yang Ling, Yu Zhaochen, Meng Chenlin, Xu Minkai, Ermon Stefano, Cui Bin
- Cacheblend: Fast Large Language Model Serving For RAG With Cached Knowledge Fusion Yao Jiayi, Li Hanchen, Liu Yuhan, Ray Siddhant, Cheng Yihua, Zhang Qizheng, Du Kuntai, Lu Shan, Jiang Junchen
- Improving Visual Commonsense In Language Models Via Multiple Image Generation Yariv Guy, Schwartz Idan, Adi Yossi, Benaim Sagie
- X-VILA: Cross-modality Alignment For Large Language Model Ye Hanrong, Huang De-an, Lu Yao, Yu Zhiding, Ping Wei, Tao Andrew, Kautz Jan, Han Song, Xu Dan, Molchanov Pavlo, Yin Hongxu
- Large Language Models As Hyper-heuristics For Combinatorial Optimization Ye Haoran, Wang Jiarui, Cao Zhiguang, Berto Federico, Hua Chuanbo, Kim Haeyeon, Park Jinkyoo, Song Guojie
- Toolsword: Unveiling Safety Issues Of Large Language Models In Tool Learning Across Three Stages Ye Junjie, Li Sixian, Li Guanyu, Huang Caishuang, Gao Songyang, Wu Yilong, Zhang Qi, Gui Tao, Huang Xuanjing
- Harnessing Multimodal Large Language Models For Multimodal Sequential Recommendation Ye Yuyang, Zheng Zhi, Shen Yishan, Wang Tianshu, Zhang Hengruo, Zhu Peijun, Yu Runlong, Zhang Kai, Xiong Hui
- Extend Model Merging From Fine-tuned To Pre-trained Large Language Models Via Weight Disentanglement Yu Le, Yu Bowen, Yu Haiyang, Huang Fei, Li Yongbin
- Rigorllm: Resilient Guardrails For Large Language Models Against Undesired Content Yuan Zhuowen, Xiong Zidi, Zeng Yi, Yu Ning, Jia Ruoxi, Song Dawn, Li Bo
- Investigating Continual Pretraining In Large Language Models: Insights And Implications Yıldız Çağatay, Ravichandran Nishaanth Kanna, Punia Prishruit, Bethge Matthias, Ermis Beyza
- Understanding And Mitigating Compositional Issues In Text-to-image Generative Models Zarei Arman, Rezaei Keivan, Basu Samyadeep, Saberi Mehrdad, Moayeri Mazda, Kattakinda Priyatham, Feizi Soheil
- Advancing Multimodal Large Language Models In Chart Question Answering With Visualization-referenced Instruction Tuning Zeng Xingchen, Lin Haichuan, Ye Yilin, Zeng Wei
- Distilling Implicit Multimodal Knowledge Into Llms For Zero-resource Dialogue Generation Zhang Bo, Ma Hui, Ding Jian, Wang Jian, Xu Bo, Lin Hongfei
- LUQ: Long-text Uncertainty Quantification For Llms Zhang Caiqi, Liu Fangyu, Basaldella Marco, Collier Nigel
- Notellm-2: Multimodal Large Representation Models For Recommendation Zhang Chao, Zhang Haoxin, Wu Shiwei, Wu Di, Xu Tong, Gao Yan, Hu Yao, Chen Enhong
- Knowhalu: Hallucination Detection Via Multi-form Knowledge Based Factual Checking Zhang Jiawei, Xu Chejian, Gai Yu, Lecue Freddy, Song Dawn, Li Bo
- Large Language Models As Evaluators For Recommendation Explanations Zhang Xiaoyu, Li Yishan, Wang Jiayin, Sun Bowen, Ma Weizhi, Sun Peijie, Zhang Min
- Llms Instruct Llms:an Extraction And Editing Method Zhang Xin, Ju Tianjie, Liang Huijia, Fu Ying, Zhang Qin
- Found In The Middle: How Language Models Use Long Contexts Better Via Plug-and-play Positional Encoding Zhang Zhenyu, Chen Runjin, Liu Shiwei, Yao Zhewei, Ruwase Olatunji, Chen Beidi, Wu Xiaoxia, Wang Zhangyang
- Revolutionizing Finance With Llms: An Overview Of Applications And Insights Zhao Huaqin, Liu Zhengliang, Wu Zihao, Li Yiwei, Yang Tianze, Shu Peng, Xu Shaochen, Dai Haixing, Zhao Lin, Mai Gengchen, Liu Ninghao, Liu Tianming
- Towards Comprehensive And Efficient Post Safety Alignment Of Large Language Models Via Safety Patching Zhao Weixiang, Hu Yulin, Li Zhuojun, Deng Yang, Zhao Yanyan, Qin Bing, Chua Tat-seng
- Mg-llava: Towards Multi-granularity Visual Instruction Tuning Zhao Xiangyu, Li Xiangtai, Duan Haodong, Huang Haian, Li Yining, Chen Kai, Yang Hua
- Chemdfm: Dialogue Foundation Model For Chemistry Zhao Zihan, Ma Da, Chen Lu, Sun Liangtai, Li Zihao, Xu Hongshen, Zhu Zichen, Zhu Su, Fan Shuai, Shen Guodong, Chen Xin, Yu Kai
- Mistral-c2f: Coarse To Fine Actor For Analytical And Reasoning Enhancement In RLHF And Effective-merged Llms Zheng Chen, Sun Ke, Zhou Xun
- Towards Lifelong Learning Of Large Language Models: A Survey Zheng Junhao, Qiu Shengjie, Shi Chengming, Ma Qianli
- Orchestrating Llms With Different Personalizations Zhou Jin Peng, Luo Katie Z, Gu Jingwen, Yuan Jason, Weinberger Kilian Q., Sun Wen
- Metagpt: Merging Large Language Models Using Model Exclusive Task Arithmetic Zhou Yuyan, Song Liang, Wang Bingning, Chen Weipeng
- Llmbind: A Unified Modality-task Integration Framework Zhu Bin, Ning Munan, Jin Peng, Lin Bin, Huang Jinfa, Song Qi, Zhang Junwu, Tang Zhenyu, Pan Mingjun, Zhou Xing, Yuan Li
- Rageval: Scenario Specific RAG Evaluation Dataset Generation Framework Zhu Kunlun, Luo Yifan, Xu Dingling, Wang Ruobing, Yu Shi, Wang Shuo, Yan Yukun, Liu Zhenghao, Han Xu, Liu Zhiyuan, Sun Maosong
- Prompting Large Language Models For Zero-shot Clinical Prediction With Structured Longitudinal Electronic Health Record Data Zhu Yinghao, Wang Zixiang, Gao Junyi, Tong Yuning, An Jingkun, Liao Weibin, Harrison Ewen M., Ma Liantao, Pan Chengwei
- Are Large Language Models Good Statisticians? Zhu Yizhang, Du Shiyin, Li Boyan, Luo Yuyu, Tang Nan
- Lumina-next: Making Lumina-t2x Stronger And Faster With Next-dit Zhuo Le, Du Ruoyi, Xiao Han, Li Yangguang, Liu Dongyang, Huang Rongjie, Liu Wenze, Zhao Lirui, Wang Fu-yun, Ma Zhanyu, Luo Xu, Wang Zehan, Zhang Kaipeng, Zhu Xiangyang, Liu Si, Yue Xiangyu, Liu Dingning, Ouyang Wanli, Liu Ziwei, Qiao Yu, Li Hongsheng, Gao Peng
🏷 Model Architecture
- Natural Language Processing (almost) From Scratch Ronan Collobert, Jason Weston, Leon Bottou, Michael Karlen, Koray Kavukcuoglu, Pavel Kuksa
- Multimodal Attention For Neural Machine Translation Caglayan Ozan, Barrault Loïc, Bougares Fethi
- Joint Copying And Restricted Generation For Paraphrase Cao Ziqiang, Luo Chuwei, Li Wenjie, Li Sujian
- Long Short-term Memory-networks For Machine Reading Cheng Jianpeng, Dong Li, Lapata Mirella
- Tree-to-sequence Attentional Neural Machine Translation Eriguchi Akiko, Hashimoto Kazuma, Tsuruoka Yoshimasa
- Multi-way, Multilingual Neural Machine Translation With A Shared Attention Mechanism Firat Orhan, Cho Kyunghyun, Bengio Yoshua
- Multimodal Compact Bilinear Pooling For Visual Question Answering And Visual Grounding Fukui Akira, Park Dong Huk, Yang Daylen, Rohrbach Anna, Darrell Trevor, Rohrbach Marcus
- Character-level Question Answering With Attention Golub David, He Xiaodong
- Revisiting Visual Question Answering Baselines Jabri Allan, Joulin Armand, Van Der Maaten Laurens
- Google's Multilingual Neural Machine Translation System: Enabling Zero-shot Translation Johnson Melvin, Schuster Mike, Le Quoc V., Krikun Maxim, Wu Yonghui, Chen Zhifeng, Thorat Nikhil, Viégas Fernanda, Wattenberg Martin, Corrado Greg, Hughes Macduff, Dean Jeffrey
- Lstm-based Mixture-of-experts For Knowledge-aware Dialogues Le Phong, Dymetman Marc, Renders Jean-michel
- Rationalizing Neural Predictions Lei Tao, Barzilay Regina, Jaakkola Tommi
- Coherent Dialogue With Attention-based Language Models Mei Hongyuan, Bansal Mohit, Walter Matthew R.
- Interactive Attention For Neural Machine Translation Meng Fandong, Lu Zhengdong, Li Hang, Liu Qun
- Temporal Attention Model For Neural Machine Translation Sankaran Baskaran, Mi Haitao, Al-onaizan Yaser, Ittycheriah Abe
- Generative Deep Neural Networks For Dialogue: A Short Review Serban Iulian Vlad, Lowe Ryan, Charlin Laurent, Pineau Joelle
- Image Captioning With Deep Bidirectional Lstms Wang Cheng, Yang Haojin, Bartz Christian, Meinel Christoph
- Neural Machine Translation Advised By Statistical Machine Translation Wang Xing, Lu Zhengdong, Tu Zhaopeng, Li Hang, Xiong Deyi, Zhang Min
- Topic Aware Neural Response Generation Xing Chen, Wu Wei, Wu Yu, Liu Jie, Huang Yalou, Zhou Ming, Ma Wei-ying
- Neural Contextual Conversation Learning With Labeled Question-answering Pairs Xiong Kun, Cui Anqi, Zhang Zefeng, Li Ming
- An Attentional Neural Conversation Model With Improved Specificity Yao Kaisheng, Peng Baolin, Zweig Geoffrey, Wong Kam-fai
- End-to-end Answer Chunk Extraction And Ranking For Reading Comprehension Yu Yang, Zhang Wei, Hasan Kazi, Yu Mo, Xiang Bing, Zhou Bowen
- Don't Just Assume; Look And Answer: Overcoming Priors For Visual Question Answering Agrawal Aishwarya, Batra Dhruv, Parikh Devi, Kembhavi Aniruddha
- Weighted Transformer Network For Machine Translation Ahmed Karim, Keskar Nitish Shirish, Socher Richard
- Attention Is All You Need Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin
- Are Emojis Predictable? Barbieri Francesco, Ballesteros Miguel, Saggion Horacio
- Lig-cristal System For The WMT17 Automatic Post-editing Task Berard Alexandre, Pietquin Olivier, Besacier Laurent
- Towards Neural Machine Translation With Latent Tree Attention Bradbury James, Socher Richard
- Incorporating Global Visual Features Into Attention-based Neural Machine Translation Calixto Iacer, Liu Qun, Campbell Nick
- Enabling Multi-source Neural Machine Translation By Concatenating Source Sentences In Multiple Languages Dabre Raj, Cromieres Fabien, Kurohashi Sadao
- Neural Machine Translation Leveraging Phrase-based Models In A Hybrid Search Dahlmann Leonard, Matusov Evgeny, Petrushkov Pavel, Khadivi Shahram
- Frustratingly Short Attention Spans In Neural Language Modeling Daniluk Michał, Rocktäschel Tim, Welbl Johannes, Riedel Sebastian
- An Empirical Study On The Effectiveness Of Images In Multimodal Neural Machine Translation Delbrouck Jean-benoit, Dupont Stéphane
- Robustfill: Neural Program Learning Under Noisy I/O Devlin Jacob, Uesato Jonathan, Bhupatiraju Surya, Singh Rishabh, Mohamed Abdel-rahman, Kohli Pushmeet
- A Copy-augmented Sequence-to-sequence Architecture Gives Good Performance On Task-oriented Dialogue Eric Mihail, Manning Christopher D.
- Ruminating Reader: Reasoning With Gated Multi-hop Attention Gong Yichen, Bowman Samuel R.
- Sockeye: A Toolkit For Neural Machine Translation Hieber Felix, Domhan Tobias, Denkowski Michael, Vilar David, Sokolov Artem, Clifton Ann, Post Matt
- Reinforced Mnemonic Reader For Machine Reading Comprehension Hu Minghao, Peng Yuxing, Huang Zhen, Qiu Xipeng, Wei Furu, Zhou Ming
- An Exploration Of Neural Sequence-to-sequence Architectures For Automatic Post-editing Junczys-dowmunt Marcin, Grundkiewicz Roman
- Simple Recurrent Units For Highly Parallelizable Recurrence Lei Tao, Zhang Yu, Wang Sida I., Dai Hui, Artzi Yoav
- Attention Strategies For Multi-source Sequence-to-sequence Learning Libovický Jindřich, Helcl Jindřich
- Task-driven Visual Saliency And Attention-based Visual Question Answering Lin Yuetan, Pang Zhangyang, Wang Donghui, Zhuang Yueting
- Ivqa: Inverse Visual Question Answering Liu Feng, Xiang Tao, Hospedales Timothy M., Yang Wankou, Sun Changyin
- Phase Conductor On Multi-layered Attentions For Machine Comprehension Liu Rui, Wei Wei, Mao Weiguang, Chikina Maria
- Table-to-text Generation By Structure-aware Seq2seq Learning Liu Tianyu, Wang Kexiang, Sha Lei, Chang Baobao, Sui Zhifang
- Best Of Both Worlds: Transferring Knowledge From Discriminative Learning To A Generative Visual Dialog Model Lu Jiasen, Kannan Anitha, Yang Jianwei, Parikh Devi, Batra Dhruv
- A Practical Approach To Dialogue Response Generation In Closed Domains Lu Yichao, Keung Phillip, Zhang Shaonan, Sun Jason, Bhardwaj Vikas
- Visual Question Answering With Memory-augmented Networks Ma Chao, Shen Chunhua, Dick Anthony, Wu Qi, Wang Peng, Hengel Anton Van Den, Reid Ian
- Parlai: A Dialog Research Software Platform Miller Alexander H., Feng Will, Fisch Adam, Lu Jiasen, Batra Dhruv, Bordes Antoine, Parikh Devi, Weston Jason
- A Generative Approach To Question Answering Mitra Rajarshee
- Shortcut-stacked Sentence Encoders For Multi-domain Inference Nie Yixin, Bansal Mohit
- Contextualized Word Representations For Reading Comprehension Salant Shimi, Berant Jonathan
- Visual Reference Resolution Using Attention Memory For Visual Dialog Seo Paul Hongsuck, Lehrmann Andreas, Han Bohyung, Sigal Leonid
- Generating High-quality And Informative Conversation Responses With Sequence-to-sequence Models Shao Louis, Gouws Stephan, Britz Denny, Goldie Anna, Strope Brian, Kurzweil Ray
- Recurrent And Contextual Models For Visual Question Answering Sharang Abhijit, Lau Eric
- An Empirical Analysis Of Multiple-turn Reasoning Strategies In Reading Comprehension Tasks Shen Yelong, Liu Xiaodong, Duh Kevin, Gao Jianfeng
- A Unified Query-based Generative Model For Question Generation And Question Answering Song Linfeng, Wang Zhiguo, Hamza Wael
- Tips And Tricks For Visual Question Answering: Learnings From The 2017 Challenge Teney Damien, Anderson Peter, He Xiaodong, Hengel Anton Van Den
- Self-attentive Residual Decoder For Neural Machine Translation Werlen Lesly Miculicich, Pappas Nikolaos, Ram Dhananjay, Popescu-belis Andrei
- Are You Talking To Me? Reasoned Visual Dialog Generation Through Adversarial Learning Wu Qi, Wang Peng, Shen Chunhua, Reid Ian, Hengel Anton Van Den
- Hierarchical Recurrent Attention Network For Response Generation Xing Chen, Wu Wei, Wu Yu, Zhou Ming, Huang Yalou, Ma Wei-ying
- The Forgettable-watcher Model For Video Question Answering Xue Hongyang, Zhao Zhou, Cai Deng
- Deepprobe: Information Directed Sequence Understanding And Chatbot Design Via Recurrent Neural Networks Yin Zi, Chang Keng-hao, Zhang Ruofei
- Code2seq: Generating Sequences From Structured Representations Of Code Alon Uri, Brody Shaked, Levy Omer, Yahav Eran
- Multilingual Extractive Reading Comprehension By Runtime Machine Translation Asai Akari, Eriguchi Akiko, Hashimoto Kazuma, Tsuruoka Yoshimasa
- Attention Based Natural Language Grounding By Navigating Virtual Environment B Akilesh, Sinha Abhishek, Sarkar Mausoom, Krishnamurthy Balaji
- Training Deeper Neural Machine Translation Models With Transparent Attention Bapna Ankur, Chen Mia Xu, Firat Orhan, Cao Yuan, Wu Yonghui
- Commonsense For Generative Multi-hop Question Answering Tasks Bauer Lisa, Wang Yicheng, Bansal Mohit
- LIUM-CVC Submissions For WMT18 Multimodal Translation Task Caglayan Ozan, Bardet Adrien, Bougares Fethi, Barrault Loïc, Wang Kai, Masana Marc, Herranz Luis, Van De Weijer Joost
- Behavior Analysis Of NLI Models: Uncovering The Influence Of Three Factors On Robustness Carmona Vicente Ivan Sanchez, Mitchell Jeff, Riedel Sebastian
- Textually Enriched Neural Module Networks For Visual Question Answering Chandu Khyathi Raghavi, Pyreddy Mary Arpita, Felix Matthieu, Joshi Narendra Nath
- The Best Of Both Worlds: Combining Recent Advances In Neural Machine Translation Chen Mia Xu, Firat Orhan, Bapna Ankur, Johnson Melvin, Macherey Wolfgang, Foster George, Jones Llion, Parmar Niki, Schuster Mike, Chen Zhifeng, Wu Yonghui, Hughes Macduff
- Augmenting Neural Response Generation With Context-aware Topical Attention Dziri Nouha, Kamalloo Ehsan, Mathewson Kory W., Zaiane Osmar
- Reciprocal Attention Fusion For Visual Question Answering Farazi Moshiur R, Khan Salman H
- SALSA-TEXT : Self Attentive Latent Space Based Adversarial Text Generation Gagnon-marchand Jules, Sadeghi Hamed, Haidar Md. Akmal, Rezagholizadeh Mehdi
- Question-guided Hybrid Convolution For Visual Question Answering Gao Peng, Lu Pan, Li Hongsheng, Li Shuang, Li Yikang, Hoi Steven, Wang Xiaogang
- Generating Distractors For Reading Comprehension Questions From Real Examinations Gao Yifan, Bing Lidong, Li Piji, King Irwin, Lyu Michael R.
- The Memad Submission To The WMT18 Multimodal Translation Task Grönroos Stig-arne, Huet Benoit, Kurimo Mikko, Laaksonen Jorma, Merialdo Bernard, Pham Phu, Sjöberg Mats, Sulubacak Umut, Tiedemann Jörg, Troncy Raphael, Vázquez Raúl
- Analysing The Potential Of Seq-to-seq Models For Incremental Interpretation In Task-oriented Dialogue Hupkes Dieuwke, Bouwmeester Sanne, Fernández Raquel
- Transfer Learning Of Language-independent End-to-end ASR With Language Model Fusion Inaguma Hirofumi, Cho Jaejin, Baskar Murali Karthick, Kawahara Tatsuya, Watanabe Shinji
- BERT: Pre-training Of Deep Bidirectional Transformers For Language Understanding Jacob Devlin, Ming-wei Chang, Kenton Lee, Kristina Toutanova
- Closed-book Training To Improve Summarization Encoder Memory Jiang Yichen, Bansal Mohit
- Ms-uedin Submission To The WMT2018 APE Shared Task: Dual-source Transformer For Automatic Post-editing Junczys-dowmunt Marcin, Grundkiewicz Roman
- SNU_IDS At Semeval-2018 Task 12: Sentence Encoder With Contextualized Vectors For Argument Reasoning Comprehension Kim Taeuk, Choi Jihun, Lee Sang-goo
- Neural Machine Translation With Adequacy-oriented Learning Kong Xiang, Tu Zhaopeng, Shi Shuming, Hovy Eduard, Zhang Tong
- Fast And Simple Mixture Of Softmaxes With BPE And Hybrid-lightrnn For Language Generation Kong Xiang, Xie Qizhe, Dai Zihang, Hovy Eduard
- A Question-focused Multi-factor Attention Network For Question Answering Kundu Souvik, Ng Hwee Tou
- Answerer In Questioner's Mind: Information Theoretic Approach To Goal-oriented Visual Dialog Lee Sang-woo, Heo Yu-jung, Zhang Byoung-tak
- Seq2rdf: An End-to-end Application For Deriving Triples From Natural Language Text Liu Yue, Zhang Tongtao, Liang Zhicheng, Ji Heng, Mcguinness Deborah L.
- Mem2seq: Effectively Incorporating Knowledge Bases Into End-to-end Task-oriented Dialog Systems Madotto Andrea, Wu Chien-sheng, Fung Pascale
- Neural Machine Translation With Key-value Memory-augmented Attention Meng Fandong, Tu Zhaopeng, Cheng Yong, Wu Haiyang, Zhai Junjie, Yang Yuekui, Wang Di
- Improved Fusion Of Visual And Language Representations By Dense Symmetric Co-attention For Visual Question Answering Nguyen Duy-kien, Okatani Takayuki
- Phrase-based Attentions Nguyen Phi Xuan, Joty Shafiq
- Polite Dialogue Generation Without Parallel Data Niu Tong, Bansal Mohit
- Recursive Visual Attention In Visual Dialog Niu Yulei, Zhang Hanwang, Zhang Manli, Zhang Jianhong, Lu Zhiwu, Wen Ji-rong
- Nlitrans At Semeval-2018 Task 12: Transfer Of Semantic Knowledge For Argument Comprehension Niven Tim, Kao Hung-yu
- Like A Baby: Visually Situated Neural Language Acquisition Ororbia Alexander G., Mali Ankur, Kelly Matthew A., Reitter David
- Dual Recurrent Attention Units For Visual Question Answering Osman Ahmed, Samek Wojciech
- Multimodal Explanations: Justifying Decisions And Pointing To The Evidence Park Dong Huk, Hendricks Lisa Anne, Akata Zeynep, Rohrbach Anna, Schiele Bernt, Darrell Trevor, Rohrbach Marcus
- Extending Neural Generative Conversational Model Using External Knowledge Sources Parthasarathi Prasanna, Pineau Joelle
- Dynamic Fusion With Intra- And Inter- Modality Attention Flow For Visual Question Answering Peng Gao, Jiang Zhengkai, You Haoxuan, Lu Pan, Hoi Steven, Wang Xiaogang, Li Hongsheng
- Contextual Parameter Generation For Universal Neural Machine Translation Platanios Emmanouil Antonios, Sachan Mrinmaya, Neubig Graham, Mitchell Tom
- Training Tips For The Transformer Model Popel Martin, Bojar Ondřej
- Beyond Task Success: A Closer Look At Jointly Learning To See, Ask, And Guesswhat Shekhar Ravi, Venkatesh Aashish, Baumgärtner Tim, Bruni Elia, Plank Barbara, Bernardi Raffaella, Fernández Raquel
- Teaching Machines To Code: Neural Markup Generation With Visual Attention Singh Sumeet S.
- Adapting Visual Question Answering Models For Enhancing Multimodal Community Q&A Platforms Srivastava Avikalp, Liu Hsin Wen, Fujita Sumio
- The University Of Cambridge's Machine Translation Systems For WMT18 Stahlberg Felix, De Gispert Adria, Byrne Bill
- An Operation Sequence Model For Explainable Neural Machine Translation Stahlberg Felix, Saunders Danielle, Byrne Bill
- Inducing Grammars With And For Neural Machine Translation Tran Ke, Bisk Yonatan
- Seq2seq And Multi-task Learning For Joint Intent And Content Extraction For Domain Specific Interpreters Velay Marc, Daniel Fabrice
- Few-shot Generalization Across Dialogue Tasks Vlasov Vladimir, Drissner-schmid Akela, Nichol Alan
- Multilingual NMT With A Language-independent Attention Bridge Vázquez Raúl, Raganato Alessandro, Tiedemann Jörg, Creutz Mathias
- Can You Tell Me How To Get Past Sesame Street? Sentence-level Pretraining Beyond Language Modeling Wang Alex, Hula Jan, Xia Patrick, Pappagari Raghavendra, Mccoy R. Thomas, Patel Roma, Kim Najoung, Tenney Ian, Huang Yinghui, Yu Katherin, Jin Shuning, Chen Berlin, Van Durme Benjamin, Grave Edouard, Pavlick Ellie, Bowman Samuel R.
- A Task In A Suit And A Tie: Paraphrase Generation With Semantic Augmentation Wang Su, Gupta Rahul, Chang Nancy, Baldridge Jason
- Structured Triplet Learning With Pos-tag Guided Attention For Visual Question Answering Wang Zhe, Liu Xiaoyi, Chen Liangjian, Wang Limin, Qiao Yu, Xie Xiaohui, Fowlkes Charless
- The Fine Line Between Linguistic Generalization And Failure In Seq2seq-attention Models Weber Noah, Shekhar Leena, Balasubramanian Niranjan
- Dual Ask-answer Network For Machine Reading Comprehension Xiao Han, Wang Feng, Yan Jianfeng, Zheng Jingyao
- Qanet: Combining Local Convolution With Global Self-attention For Reading Comprehension Yu Adams Wei, Dohan David, Luong Minh-thang, Zhao Rui, Chen Kai, Norouzi Mohammad, Le Quoc V.
- Improved Training Of End-to-end Attention Models For Speech Recognition Zeyer Albert, Irie Kazuki, Schlüter Ralf, Ney Hermann
- Medical Exam Question Answering With Large-scale Reading Comprehension Zhang Xiao, Wu Ji, He Zhiyang, Liu Xien, Su Ying
- Phrase Table As Recommendation Memory For Neural Machine Translation Zhao Yang, Wang Yining, Zhang Jiajun, Zong Chengqing
- An Affect-rich Neural Conversational Model With Biased Attention And Weighted Cross-entropy Loss Zhong Peixiang, Wang Di, Miao Chunyan
- A Visual Attention Grounding Neural Model For Multimodal Machine Translation Zhou Mingyang, Cheng Runxiang, Lee Yong Jae, Yu Zhou
- Abductive Reasoning As Self-supervision For Common Sense Question Answering Aakur Sathyanarayanan N., Sarkar Sudeep
- Syntactically Supervised Transformers For Faster Neural Machine Translation Akoury Nader, Krishna Kalpesh, Iyyer Mohit
- Synthetic QA Corpora Generation With Roundtrip Consistency Alberti Chris, Andor Daniel, Pitler Emily, Devlin Jacob, Collins Michael
- A BERT Baseline For The Natural Questions Alberti Chris, Lee Kenton, Collins Michael
- Fusion Of Detected Objects In Text For Visual Question Answering Alberti Chris, Ling Jeffrey, Collins Michael, Reitter David
- The Unreasonable Effectiveness Of Transformer Language Models In Grammatical Error Correction Alikaniotis Dimitrios, Raheja Vipul
- Whatcha Lookin' At? Deeplifting Bert's Attention In Question Answering Arkhangelskaia Ekaterina, Dutta Sourav
- PLATO: Pre-trained Dialogue Generation Model With Discrete Latent Variable Bao Siqi, He Huang, Wang Fan, Wu Hua, Wang Haifeng
- Keeping Notes: Conditional Natural Language Generation With A Scratchpad Mechanism Benmalek Ryan Y., Khabsa Madian, Desu Suma, Cardie Claire, Banko Michele
- Meta Answering For Machine Reading Borschinger Benjamin, Boyd-graber Jordan, Buck Christian, Bulian Jannis, Ciaramita Massimiliano, Huebscher Michelle Chen, Gajewski Wojciech, Kilcher Yannic, Nogueira Rodrigo, Saralegu Lierni Sestorain
- Language Features Matter: Effective Language Representations For Vision-language Tasks Burns Andrea, Tan Reuben, Saenko Kate, Sclaroff Stan, Plummer Bryan A.
- Taskmaster-1: Toward A Realistic And Diverse Dialog Dataset Byrne Bill, Krishnamoorthi Karthik, Sankar Chinnadhurai, Neelakantan Arvind, Duckworth Daniel, Yavuz Semih, Goodrich Ben, Dubey Amit, Cedilnik Andy, Kim Kyu-young
- BERTQA -- Attention On Steroids Chadha Ankit, Sood Rewa
- An Empirical Study Of Generation Order For Machine Translation Chan William, Stern Mitchell, Kiros Jamie, Uszkoreit Jakob
- Making Neural Machine Reading Comprehension Faster Chatterjee Debajyoti
- DMRM: A Dual-channel Multi-hop Reasoning Model For Visual Dialog Chen Feilong, Meng Fandong, Xu Jiaming, Li Peng, Xu Bo, Zhou Jie
- Distilling Knowledge Learned In BERT For Text Generation Chen Yen-chun, Gan Zhe, Cheng Yu, Liu Jingzhou, Liu Jingjing
- Attending To Entities For Better Text Understanding Cheng Pengxiang, Erk Katrin
- Learning Multi-level Information For Dialogue Response Selection By Highway Recurrent Transformer Chiang Ting-rui, Huang Chao-wei, Su Shang-yu, Chen Yun-nung
- Contrastive Multi-document Question Generation Cho Woon Sang, Zhang Yizhe, Rao Sudha, Celikyilmaz Asli, Xiong Chenyan, Gao Jianfeng, Wang Mengdi, Dolan Bill
- Boolq: Exploring The Surprising Difficulty Of Natural Yes/no Questions Clark Christopher, Lee Kenton, Chang Ming-wei, Kwiatkowski Tom, Collins Michael, Toutanova Kristina
- On The Use Of BERT For Neural Machine Translation Clinchant Stéphane, Jung Kweon Woo, Nikoulina Vassilina
- Exploring The Limits Of Transfer Learning With A Unified Text-to-text Transformer Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J. Liu
- Deep Learning Based Chatbot Models Csaky Richard
- Cross-lingual Machine Reading Comprehension Cui Yiming, Che Wanxiang, Liu Ting, Qin Bing, Wang Shijin, Hu Guoping
- Transformer-xl: Attentive Language Models Beyond A Fixed-length Context Dai Zihang, Yang Zhilin, Yang Yiming, Carbonell Jaime, Le Quoc V., Salakhutdinov Ruslan
- The Second Conversational Intelligence Challenge (convai2) Dinan Emily, Logacheva Varvara, Malykh Valentin, Miller Alexander, Shuster Kurt, Urbanek Jack, Kiela Douwe, Szlam Arthur, Serban Iulian, Lowe Ryan, Prabhumoye Shrimai, Black Alan W, Rudnicky Alexander, Williams Jason, Pineau Joelle, Burtsev Mikhail, Weston Jason
- Memory-augmented Recurrent Networks For Dialogue Coherence Donahue David, Meng Yuanliang, Rumshisky Anna
- Unified Language Model Pre-training For Natural Language Understanding And Generation Dong Li, Yang Nan, Wang Wenhui, Wei Furu, Liu Xiaodong, Wang Yu, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- Retrosynthesis With Attention-based NMT Model And Chemical Analysis Of The "wrong" Predictions Duan Hongliang, Wang Ling, Zhang Chengyun, Li Jianjun
- Evaluating The State-of-the-art Of End-to-end Natural Language Generation: The E2E NLG Challenge Dušek Ondřej, Novikova Jekaterina, Rieser Verena
- Heterogeneous Memory Enhanced Multimodal Attention Model For Video Question Answering Fan Chenyou, Zhang Xiaofan, Zhang Shu, Wang Wensheng, Zhang Chi, Huang Heng
- Optibox: Breaking The Limits Of Proposals For Visual Grounding Fan Zicong, Meng Si Yi, Sigal Leonid, Little James J.
- Question-agnostic Attention For Visual Question Answering Farazi Moshiur R, Khan Salman H, Barnes Nick
- Neural Data-to-text Generation: A Comparison Between Pipeline And End-to-end Architectures Ferreira Thiago Castro, Van Der Lee Chris, Van Miltenburg Emiel, Krahmer Emiel
- MRQA 2019 Shared Task: Evaluating Generalization In Reading Comprehension Fisch Adam, Talmor Alon, Jia Robin, Seo Minjoon, Choi Eunsol, Chen Danqi
- Multi-step Reasoning Via Recurrent Dual Attention For Visual Dialog Gan Zhe, Cheng Yu, Kholy Ahmed El, Li Linjie, Liu Jingjing, Gao Jianfeng
- TANDA: Transfer And Adapt Pre-trained Transformer Models For Answer Sentence Selection Garg Siddhant, Vu Thuy, Moschitti Alessandro
- Self-attentive Model For Headline Generation Gavrilov Daniil, Kalaidin Pavel, Malykh Valentin
- Generating Natural Language Explanations For Visual Question Answering Using Scene Graphs And Visual Attention Ghosh Shalini, Burachas Giedrius, Ray Arijit, Ziskind Avi
- Span Selection Pre-training For Question Answering Glass Michael, Gliozzo Alfio, Chakravarti Rishav, Ferritto Anthony, Pan Lin, Bhargav G P Shrivatsa, Garg Dinesh, Sil Avirup
- Interactive Matching Network For Multi-turn Response Selection In Retrieval-based Chatbots Gu Jia-chen, Ling Zhen-hua, Liu Quan
- Insertion-based Decoding With Automatically Inferred Generation Order Gu Jiatao, Liu Qi, Cho Kyunghyun
- Reweighted Proximal Pruning For Large-scale Language Representation Guo Fu-ming, Liu Sijia, Mungall Finlay S., Lin Xue, Wang Yanzhi
- Zero-shot Paraphrase Generation With Multilingual Language Models Guo Yinpeng, Liao Yi, Jiang Xin, Zhang Qing, Zhang Yibo, Liu Qun
- Exploring Neural Net Augmentation To BERT For Question Answering On SQUAD 2.0 Gupta Suhas
- Episodic Memory Reader: Learning What To Remember For Question Answering From Streaming Data Han Moonsu, Kang Minki, Jung Hyunwoo, Hwang Sung Ju
- Resurrecting Submodularity For Neural Text Generation Han Simeng, Lin Xiang, Joty Shafiq
- Visualizing And Understanding The Effectiveness Of BERT Hao Yaru, Dong Li, Wei Furu, Xu Ke
- Analyzing The Forgetting Problem In The Pretrain-finetuning Of Dialogue Response Models He Tianxing, Liu Jun, Cho Kyunghyun, Ott Myle, Liu Bing, Glass James, Peng Fuchun
- Convert: Efficient And Accurate Conversational Representations From Transformers Henderson Matthew, Casanueva Iñigo, Mrkšić Nikola, Su Pei-hao, Wen Tsung-hsien, Vulić Ivan
- Efficient Adaptation Of Pretrained Transformers For Abstractive Summarization Hoang Andrew, Bosselut Antoine, Celikyilmaz Asli, Choi Yejin
- Exbert: A Visual Analysis Tool To Explore Learned Representations In Transformers Models Hoover Benjamin, Strobelt Hendrik, Gehrmann Sebastian
- Evaluating Rewards For Question Generation Models Hosking Tom, Riedel Sebastian
- Iterative Answer Prediction With Pointer-augmented Multimodal Transformers For Textvqa Hu Ronghang, Singh Amanpreet, Darrell Trevor, Rohrbach Marcus
- Rap-net: Recurrent Attention Pooling Networks For Dialogue Response Selection Huang Chao-wei, Chiang Ting-rui, Su Shang-yu, Chen Yun-nung
- INSET: Sentence Infilling With Inter-sentential Transformer Huang Yichen, Zhang Yizhe, Elachqar Oussama, Cheng Yu
- Designing Dialogue Systems: A Mean, Grumpy, Sarcastic Chatbot In The Browser Ilić Suzana, Nakano Reiichiro, Hajnal Ivo
- Attention Is Not Explanation Jain Sarthak, Wallace Byron C.
- Document-level Neural Machine Translation With Associated Memory Network Jiang Shu, Wang Rui, Li Zuchao, Utiyama Masao, Chen Kehai, Sumita Eiichiro, Zhao Hai, Lu Bao-liang
- Tinybert: Distilling BERT For Natural Language Understanding Jiao Xiaoqi, Yin Yichun, Shang Lifeng, Jiang Xin, Chen Xiao, Li Linlin, Wang Fang, Liu Qun
- Amr-to-text Generation With Cache Transition Systems Jin Lisa, Gildea Daniel
- Bipar: A Bilingual Parallel Dataset For Multilingual And Cross-lingual Reading Comprehension On Novels Jing Yimin, Xiong Deyi, Zhen Yan
- All-in-one Image-grounded Conversational Agents Ju Da, Shuster Kurt, Boureau Y-lan, Weston Jason
- Microsoft Translator At WMT 2019: Towards Large-scale Document-level Neural Machine Translation Junczys-dowmunt Marcin
- Learning And Evaluating Contextual Embedding Of Source Code Kanade Aditya, Maniatis Petros, Balakrishnan Gogul, Shi Kensen
- Let Me Know What To Ask: Interrogative-word-aware Question Generation Kang Junmo, Roman Haritz Puerto San, Myaeng Sung-hyon
- Weak Supervision Helps Emergence Of Word-object Alignment And Improves Vision-language Tasks Kervadec Corentin Liris, Antipov Grigory Liris, Baccouche Moez Liris, Wolf Christian Liris
- CTRL: A Conditional Transformer Language Model For Controllable Generation Keskar Nitish Shirish, Mccann Bryan, Varshney Lav R., Xiong Caiming, Socher Richard
- Sample Efficient Text Summarization Using A Single Pre-trained Transformer Khandelwal Urvashi, Clark Kevin, Jurafsky Dan, Kaiser Lukasz
- When And Why Is Document-level Context Useful In Neural Machine Translation? Kim Yunsu, Tran Duc Thanh, Ney Hermann
- Learning To Answer By Learning To Ask: Getting The Best Of GPT-2 And BERT Worlds Klein Tassilo, Nabi Moin
- Transcoding Compositionally: Using Attention To Find More Generalizable Solutions Korrel Kris, Hupkes Dieuwke, Dankers Verna, Bruni Elia
- Revealing The Dark Secrets Of BERT Kovaleva Olga, Romanov Alexey, Rogers Anna, Rumshisky Anna
- Question Generation From Paragraphs: A Tale Of Two Hierarchical Models Kumar Vishwajeet, Chaki Raktim, Talluri Sai Teja, Ramakrishnan Ganesh, Li Yuan-fang, Haffari Gholamreza
- ALBERT: A Lite BERT For Self-supervised Learning Of Language Representations Lan Zhenzhong, Chen Mingda, Goodman Sebastian, Gimpel Kevin, Sharma Piyush, Soricut Radu
- What Would Elsa Do? Freezing Layers During Transformer Fine-tuning Lee Jaejun, Tang Raphael, Lin Jimmy
- Patent Claim Generation By Fine-tuning Openai GPT-2 Lee Jieh-sheng, Hsiang Jieh
- BART: Denoising Sequence-to-sequence Pre-training For Natural Language Generation, Translation, And Comprehension Lewis Mike, Liu Yinhan, Goyal Naman, Ghazvininejad Marjan, Mohamed Abdelrahman, Levy Omer, Stoyanov Ves, Zettlemoyer Luke
- Bertsel: Answer Selection With Pre-trained Models Li Dongfang, Yu Yifei, Chen Qingcai, Li Xinyu
- Unicoder-vl: A Universal Encoder For Vision And Language By Cross-modal Pre-training Li Gen, Duan Nan, Fang Yuejian, Gong Ming, Jiang Daxin, Zhou Ming
- Big Bidirectional Insertion Representations For Documents Li Lala, Chan William
- Pretrained Language Models For Document-level Neural Machine Translation Li Liangyou, Jiang Xin, Liu Qun
- Visualbert: A Simple And Performant Baseline For Vision And Language Li Liunian Harold, Yatskar Mark, Yin Da, Hsieh Cho-jui, Chang Kai-wei
- Relevance-promoting Language Model For Short-text Conversation Li Xin, Li Piji, Bi Wei, Liu Xiaojiang, Lam Wai
- Incremental Transformer With Deliberation Decoder For Document Grounded Conversations Li Zekang, Niu Cheng, Meng Fandong, Feng Yang, Li Qian, Zhou Jie
- Explicit Sentence Compression For Neural Machine Translation Li Zuchao, Wang Rui, Chen Kehai, Utiyama Masao, Sumita Eiichiro, Zhang Zhuosheng, Zhao Hai
- Gpt-based Generation For Classical Chinese Poetry Liao Yi, Wang Yasheng, Liu Qun, Jiang Xin
- Probing Representations Learned By Multimodal Recurrent And Transformer Models Libovický Jindřich, Madhyastha Pranava
- TAB-VCR: Tags And Attributes Based Visual Commonsense Reasoning Baselines Lin Jingxiang, Jain Unnat, Schwing Alexander G.
- MKD: A Multi-task Knowledge Distillation Approach For Pretrained Language Models Liu Linqing, Wang Huan, Lin Jimmy, Socher Richard, Xiong Caiming
- Generative Question Refinement With Deep Reinforcement Learning In Retrieval-based QA System Liu Ye, Zhang Chenwei, Yan Xiaohui, Chang Yi, Yu Philip S.
- Roberta: A Robustly Optimized BERT Pretraining Approach Liu Yinhan, Ott Myle, Goyal Naman, Du Jingfei, Joshi Mandar, Chen Danqi, Levy Omer, Lewis Mike, Zettlemoyer Luke, Stoyanov Veselin
- Attention-informed Mixed-language Training For Zero-shot Cross-lingual Task-oriented Dialogue Systems Liu Zihan, Winata Genta Indra, Lin Zhaojiang, Xu Peng, Fung Pascale
- Multi-hop Reading Comprehension Via Deep Reinforcement Learning Based Document Traversal Long Alex, Mason Joel, Blair Alan, Wang Wei
- Vilbert: Pretraining Task-agnostic Visiolinguistic Representations For Vision-and-language Tasks Lu Jiasen, Batra Dhruv, Parikh Devi, Lee Stefan
- Good, Better, Best: Textual Distractors Generation For Multiple-choice Visual Question Answering Via Reinforcement Learning Lu Jiaying, Ye Xin, Ren Yi, Yang Yezhou
- Triplenet: Triple Attention Network For Multi-turn Response Selection In Retrieval-based Chatbots Ma Wentao, Cui Yiming, Shao Nan, He Su, Zhang Wei-nan, Liu Ting, Wang Shijin, Hu Guoping
- Selective Attention For Context-aware Neural Machine Translation Maruf Sameen, Martins André F. T., Haffari Gholamreza
- Emergent Properties Of Finetuned Language Representation Models Matton Alexandre, De Oliveira Luke
- Structured Pruning Of A Bert-based Question Answering Model Mccarley J. S., Chakravarti Rishav, Sil Avirup
- Self-attentional Models Application In Task-oriented Dialogue Generation Systems Mehrjardi Mansour Saffar, Trabelsi Amine, Zaiane Osmar R.
- Mogrifier LSTM Melis Gábor, Kočiský Tomáš, Blunsom Phil
- Megatron-lm: Training Multi-billion Parameter Language Models Using Model Parallelism Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick Legresley, Jared Casper, Bryan Catanzaro
- Interrogating The Explanatory Power Of Attention In Neural Machine Translation Moradi Pooya, Kambhatla Nishant, Sarkar Anoop
- Human Vs. Muppet: A Conservative Estimate Of Human Performance On The GLUE Benchmark Nangia Nikita, Bowman Samuel R.
- Neural Assistant: Joint Action Prediction, Response Generation, And Latent Knowledge Reasoning Neelakantan Arvind, Yavuz Semih, Narang Sharan, Prasad Vishaal, Goodrich Ben, Duckworth Daniel, Sankar Chinnadhurai, Yan Xifeng
- Let's Ask Again: Refine Network For Automatic Question Generation Nema Preksha, Mohankumar Akash Kumar, Khapra Mitesh M., Srinivasan Balaji Vasan, Ravindran Balaraman
- Efficient Attention Mechanism For Visual Dialog That Can Handle All The Interactions Between Multiple Inputs Nguyen Van-quang, Suganuma Masanori, Okatani Takayuki
- A Simple But Effective Method To Incorporate Multi-turn Context With BERT For Conversational Machine Comprehension Ohsugi Yasuhito, Saito Itsumi, Nishida Kyosuke, Asano Hisako, Tomita Junji
- Dlgnet: A Transformer-based Model For Dialogue Response Generation Olabiyi Oluwatobi, Mueller Erik T.
- Adversarial Bootstrapping For Dialogue Model Training Olabiyi Oluwatobi, Mueller Erik T., Larson Christopher, Lahlou Tarek
- Frustratingly Easy Natural Question Answering Pan Lin, Chakravarti Rishav, Ferritto Anthony, Glass Michael, Gliozzo Alfio, Roukos Salim, Florian Radu, Sil Avirup
- Visual Dialogue State Tracking For Question Generation Pang Wei, Wang Xiaojie
- AI Meets Austen: Towards Human-robot Discussions Of Literary Metaphor Parde Natalie, Nielsen Rodney D.
- Stabilizing Transformers For Reinforcement Learning Parisotto Emilio, Song H. Francis, Rae Jack W., Pascanu Razvan, Gulcehre Caglar, Jayakumar Siddhant M., Jaderberg Max, Kaufman Raphael Lopez, Clark Aidan, Noury Seb, Botvinick Matthew M., Heess Nicolas, Hadsell Raia
- Thisiscompetition At Semeval-2019 Task 9: BERT Is Unstable For Out-of-domain Samples Park Cheoneum, Kim Juae, Lee Hyeon-gu, Amplayo Reinald Kim, Kim Harksoo, Seo Jungyun, Lee Changki
- A Modular Task-oriented Dialogue System Using A Neural Mixture-of-experts Pei Jiahuan, Ren Pengjie, De Rijke Maarten
- Language Models As Knowledge Bases? Petroni Fabio, Rocktäschel Tim, Lewis Patrick, Bakhtin Anton, Wu Yuxiang, Miller Alexander H., Riedel Sebastian
- Learning To Deceive With Attention-based Explanations Pruthi Danish, Gupta Mansi, Dhingra Bhuwan, Neubig Graham, Lipton Zachary C.
- Human-like Machine Thinking: Language Guided Imagination Qi Feng, Wu Wenchuan
- Answering Complex Open-domain Questions Through Iterative Query Generation Qi Peng, Lin Xiaowen, Mehr Leo, Wang Zijian, Manning Christopher D.
- Blockwise Self-attention For Long Document Understanding Qiu Jiezhong, Ma Hao, Levy Omer, Yih Scott Wen-tau, Wang Sinong, Tang Jie
- Option Comparison Network For Multiple-choice Reading Comprehension Ran Qiu, Li Peng, Hu Weiwei, Zhou Jie
- On Leveraging The Visual Modality For Neural Machine Translation Raunak Vikas, Choe Sang Keun, Lu Quanyang, Xu Yi, Metze Florian
- Probing Natural Language Inference Models Through Semantic Fragments Richardson Kyle, Hu Hai, Moss Lawrence S., Sabharwal Ashish
- Leveraging Pre-trained Checkpoints For Sequence Generation Tasks Rothe Sascha, Narayan Shashi, Severyn Aliaksei
- Is Multilingual BERT Fluent In Language Generation? Rönnqvist Samuel, Kanerva Jenna, Salakoski Tapio, Ginter Filip
- Masked Language Model Scoring Salazar Julian, Liang Davis, Nguyen Toan Q., Kirchhoff Katrin
- Hierarchical Reinforcement Learning For Open-domain Dialog Saleh Abdelrhman, Jaques Natasha, Ghandeharioun Asma, Shen Judy Hanwen, Picard Rosalind
- Zero: Memory Optimizations Toward Training Trillion Parameter Models Samyam Rajbhandari, Jeff Rasley, Olatunji Ruwase, Yuxiong He
- Distilbert, A Distilled Version Of BERT: Smaller, Faster, Cheaper And Lighter Sanh Victor, Debut Lysandre, Chaumond Julien, Wolf Thomas
- Do Neural Dialog Systems Use The Conversation History Effectively? An Empirical Study Sankar Chinnadhurai, Subramanian Sandeep, Pal Christopher, Chandar Sarath, Bengio Yoshua
- A Survey Of Natural Language Generation Techniques With A Focus On Dialogue Systems - Past, Present And Future Directions Santhanam Sashank, Shaikh Samira
- Inducing Brain-relevant Bias In Natural Language Processing Models Schwartz Dan, Toneva Mariya, Wehbe Leila
- Factor Graph Attention Schwartz Idan, Yu Seunghak, Hazan Tamir, Schwing Alexander
- Do Massively Pretrained Language Models Make Better Storytellers? See Abigail, Pappu Aneesh, Saxena Rohun, Yerukola Akhila, Manning Christopher D.
- Taking A HINT: Leveraging Explanations To Make Vision And Language Models More Grounded Selvaraju Ramprasaath R., Lee Stefan, Shen Yilin, Jin Hongxia, Ghosh Shalini, Heck Larry, Batra Dhruv, Parikh Devi
- Scene-based Factored Attention For Image Captioning Shen Chen, Ji Rongrong, Chen Fuhai, Sun Xiaoshuai, Li Xiangming
- Pragmatically Informative Text Generation Shen Sheng, Fried Daniel, Andreas Jacob, Klein Dan
- Flexibly-structured Model For Task-oriented Dialogues Shu Lei, Molino Piero, Namazifar Mahdi, Xu Hu, Liu Bing, Zheng Huaixiu, Tur Gokhan
- The Dialogue Dodecathlon: Open-domain Knowledge And Image Grounded Conversational Agents Shuster Kurt, Ju Da, Roller Stephen, Dinan Emily, Boureau Y-lan, Weston Jason
- What Does BERT Learn From Multiple-choice Reading Comprehension Datasets? Si Chenglei, Wang Shuohang, Kan Min-yen, Jiang Jing
- Multilingual Question Answering From Formatted Text Applied To Conversational Agents Siblini Wissam, Pasqual Charlotte, Lavielle Axel, Challal Mohamed, Cauchois Cyril
- The Evolved Transformer So David R., Liang Chen, Le Quoc V.
- MASS: Masked Sequence To Sequence Pre-training For Language Generation Song Kaitao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Tie-yan
- Natural Language Generation Using Reinforcement Learning With External Rewards Srinivasan Vidhushini, Santhanam Sashank, Shaikh Samira
- On NMT Search Errors And Model Errors: Cat Got Your Tongue? Stahlberg Felix, Byrne Bill
- Cued@wmt19:ewc&lms Stahlberg Felix, Saunders Danielle, De Gispert Adria, Byrne Bill
- Insertion Transformer: Flexible Sequence Generation Via Insertion Operations Stern Mitchell, Chan William, Kiros Jamie, Uszkoreit Jakob
- Sesamebert: Attention For Anywhere Su Ta-chun, Cheng Hsiang-chih
- VL-BERT: Pre-training Of Generic Visual-linguistic Representations Su Weijie, Zhu Xizhou, Cao Yue, Li Bin, Lu Lewei, Wei Furu, Dai Jifeng
- Adaptive Attention Span In Transformers Sukhbaatar Sainbayar, Grave Edouard, Bojanowski Piotr, Joulin Armand
- Augmenting Self-attention With Persistent Memory Sukhbaatar Sainbayar, Grave Edouard, Lample Guillaume, Jegou Herve, Joulin Armand
- Generating Diverse Translation By Manipulating Multi-head Attention Sun Zewei, Huang Shujian, Wei Hao-ran, Dai Xin-yu, Chen Jiajun
- Syntax-infused Transformer And BERT Models For Machine Translation And Natural Language Understanding Sundararaman Dhanasekar, Subramanian Vivek, Wang Guoyin, Si Shijing, Shen Dinghan, Wang Dong, Carin Lawrence
- Olmpics -- On What Language Model Pre-training Captures Talmor Alon, Elazar Yanai, Goldberg Yoav, Berant Jonathan
- LXMERT: Learning Cross-modality Encoder Representations From Transformers Tan Hao, Bansal Mohit
- What Do You Learn From Context? Probing For Sentence Structure In Contextualized Word Representations Tenney Ian, Xia Patrick, Chen Berlin, Wang Alex, Poliak Adam, Mccoy R Thomas, Kim Najoung, Van Durme Benjamin, Bowman Samuel R., Das Dipanjan, Pavlick Ellie
- Generating Token-level Explanations For Natural Language Inference Thorne James, Vlachos Andreas, Christodoulopoulos Christos, Mittal Arpit
- Waldorf: Wasteless Language-model Distillation On Reading-comprehension Tian James Yi, Kreuzer Alexander P., Chen Pai-hung, Will Hans-martin
- Repurposing Entailment For Multi-hop Question Answering Tasks Trivedi Harsh, Kwon Heeyoung, Khot Tushar, Sabharwal Ashish, Balasubramanian Niranjan
- Tree-structured Semantic Encoder With Knowledge Sharing For Domain Adaptation In Natural Language Generation Tseng Bo-hsiang, Budzianowski Paweł, Wu Yen-chen, Gašić Milica
- Capturing Greater Context For Question Generation Tuan Luu Anh, Shah Darsh J, Barzilay Regina
- Well-read Students Learn Better: On The Importance Of Pre-training Compact Models Turc Iulia, Chang Ming-wei, Lee Kenton, Toutanova Kristina
- How Does BERT Answer Questions? A Layer-wise Analysis Of Transformer Representations Van Aken Betty, Winter Benjamin, Löser Alexander, Gers Felix A.
- Quantity Doesn't Buy Quality Syntax With Neural Language Models Van Schijndel Marten, Mueller Aaron, Linzen Tal
- Visualizing Attention In Transformer-based Language Representation Models Vig Jesse
- Training Optimus Prime, M.D.: Generating Medical Certification Items By Fine-tuning Openai's Gpt2 Transformer Model Von Davier Matthias
- Allennlp Interpret: A Framework For Explaining Predictions Of NLP Models Wallace Eric, Tuyls Jens, Wang Junlin, Subramanian Sanjay, Gardner Matt, Singh Sameer
- Do NLP Models Know Numbers? Probing Numeracy In Embeddings Wallace Eric, Wang Yizhong, Li Sujian, Singh Sameer, Gardner Matt
- Language Models With Transformers Wang Chenguang, Li Mu, Smola Alexander J.
- Do Multi-hop Readers Dream Of Reasoning Chains? Wang Haoyu, Yu Mo, Guo Xiaoxiao, Das Rajarshi, Xiong Wenhan, Gao Tian
- Improving Knowledge-aware Dialogue Generation Via Knowledge Base Question Answering Wang Jian, Liu Junhao, Bi Wei, Liu Xiaojiang, He Kejing, Xu Ruifeng, Yang Min
- Theme-aware Generation Model For Chinese Lyrics Wang Jie, Zhao Xinyan
- Structbert: Incorporating Language Structures Into Pre-training For Deep Language Understanding Wang Wei, Bi Bin, Yan Ming, Wu Chen, Bao Zuyi, Xia Jiangnan, Peng Liwei, Si Luo
- Tree Transformer: Integrating Tree Structures Into Self-attention Wang Yau-shian, Lee Hung-yi, Chen Yun-nung
- Video Question Generation Via Cross-modal Self-attention Networks Learning Wang Yu-siang, Su Hung-ting, Chang Chen-hsi, Liu Zhe-yu, Hsu Winston H.
- Structured Pruning Of Large Language Models Wang Ziheng, Wohlwend Jeremy, Lei Tao
- Adapting And Evaluating A Deep Learning Language Model For Clinical Why-question Answering Wen Andrew, Elwazir Mohamed Y., Moon Sungrim, Fan Jungwei
- Improving Neural Machine Translation With Pre-trained Representation Weng Rongxiang, Yu Heng, Huang Shujian, Luo Weihua, Chen Jiajun
- Attention Is Not Not Explanation Wiegreffe Sarah, Pinter Yuval
- Huggingface's Transformers: State-of-the-art Natural Language Processing Wolf Thomas, Debut Lysandre, Sanh Victor, Chaumond Julien, Delangue Clement, Moi Anthony, Cistac Pierric, Rault Tim, Louf Rémi, Funtowicz Morgan, Davison Joe, Shleifer Sam, Von Platen Patrick, Ma Clara, Jernite Yacine, Plu Julien, Xu Canwen, Scao Teven Le, Gugger Sylvain, Drame Mariama, Lhoest Quentin, Rush Alexander M.
- Transfertransfo: A Transfer Learning Approach For Neural Network Based Conversational Agents Wolf Thomas, Sanh Victor, Chaumond Julien, Delangue Clement
- Learning To Memorize In Neural Task-oriented Dialogue Systems Wu Chien-sheng
- Integrated Triaging For Fast Reading Comprehension Wu Felix, Li Boyi, Wang Lequn, Lao Ni, Blitzer John, Weinberger Kilian Q.
- Alternating Recurrent Dialog Model With Large-scale Pre-trained Language Models Wu Qingyang, Zhang Yichi, Li Yu, Yu Zhou
- Transformer-based Cascaded Multimodal Speech Translation Wu Zixiu, Caglayan Ozan, Ive Julia, Wang Josiah, Specia Lucia
- Sharing Attention Weights For Fast Transformer Xiao Tong, Li Yinqiao, Zhu Jingbo, Yu Zhengtao, Liu Tongran
- Visual Entailment: A Novel Task For Fine-grained Image Understanding Xie Ning, Lai Farley, Doran Derek, Kadav Asim
- A Multi-turn Emotionally Engaging Dialog Model Xie Yubo, Svikhnushina Ekaterina, Pu Pearl
- Review Conversational Reading Comprehension Xu Hu, Liu Bing, Shu Lei, Yu Philip S.
- Towards Making The Most Of BERT In Neural Machine Translation Yang Jiacheng, Wang Mingxuan, Zhou Hao, Zhao Chengqi, Yu Yong, Zhang Weinan, Li Lei
- Deepening Hidden Representations From Pre-trained Language Models Yang Junjie, Zhao Hai
- Making History Matter: History-advantage Sequence Training For Visual Dialog Yang Tianhao, Zha Zheng-jun, Zhang Hanwang
- Data Augmentation For BERT Fine-tuning In Open-domain Question Answering Yang Wei, Xie Yuqing, Tan Luchen, Xiong Kun, Li Ming, Lin Jimmy
- Model Compression With Two-stage Multi-teacher Knowledge Distillation For Web Question Answering System Yang Ze, Shou Linjun, Gong Ming, Lin Wutao, Jiang Daxin
- Knowledge-grounded Response Generation With Deep Attentional Latent-variable Model Ye Hao-tong, Lo Kai-ling, Su Shang-yu, Chen Yun-nung
- Bp-transformer: Modelling Long-range Context Via Binary Partitioning Ye Zihao, Guo Qipeng, Gan Quan, Qiu Xipeng, Zhang Zheng
- Multimodal Unified Attention Networks For Vision-and-language Interactions Yu Zhou, Cui Yuhao, Yu Jun, Tao Dacheng, Tian Qi
- Q8BERT: Quantized 8bit BERT Zafrir Ofir, Boudoukh Guy, Izsak Peter, Wasserblat Moshe
- Extended Answer And Uncertainty Aware Neural Question Generation Zeng Hongwei, Zhi Zhuo, Liu Jun, Wei Bifan
- Context-aware Visual Policy Network For Fine-grained Image Captioning Zha Zheng-jun, Liu Daqing, Zhang Hanwang, Zhang Yongdong, Wu Feng
- Recosa: Detecting The Relevant Contexts With Self-attention For Multi-turn Dialogue Generation Zhang Hainan, Lan Yanyan, Pang Liang, Guo Jiafeng, Cheng Xueqi
- PEGASUS: Pre-training With Extracted Gap-sentences For Abstractive Summarization Zhang Jingqing, Zhao Yao, Saleh Mohammad, Liu Peter J.
- Addressing Semantic Drift In Question Generation For Semi-supervised Question Answering Zhang Shiyue, Bansal Mohit
- Bertscore: Evaluating Text Generation With BERT Zhang Tianyi, Kishore Varsha, Wu Felix, Weinberger Kilian Q., Artzi Yoav
- Dialogpt: Large-scale Generative Pre-training For Conversational Response Generation Zhang Yizhe, Sun Siqi, Galley Michel, Chen Yen-chun, Brockett Chris, Gao Xiang, Gao Jianfeng, Liu Jingjing, Dolan Bill
- Semantics-aware BERT For Language Understanding Zhang Zhuosheng, Wu Yuwei, Zhao Hai, Li Zuchao, Zhang Shuailiang, Zhou Xi, Zhou Xiang
- MUSE: Parallel Multi-scale Attention For Sequence To Sequence Learning Zhao Guangxiang, Sun Xu, Xu Jingjing, Zhang Zhiyuan, Luo Liangchen
- Extremely Small BERT Models From Mixed-vocabulary Training Zhao Sanqiang, Gupta Raghav, Song Yang, Zhou Denny
- A Pre-training Based Personalized Dialogue Generation Model With Persona-sparse Data Zheng Yinhe, Zhang Rongsheng, Mao Xiaoxi, Huang Minlie
- Atom Responding Machine For Dialog Generation Zhou Ganbin, Luo Ping, Chen Jingwu, Lin Fen, Lin Leyu, He Qing
- Synchronous Bidirectional Neural Machine Translation Zhou Long, Zhang Jiajun, Zong Chengqing
- Unified Vision-language Pre-training For Image Captioning And VQA Zhou Luowei, Palangi Hamid, Zhang Lei, Hu Houdong, Corso Jason J., Gao Jianfeng
- Multi-task Learning With Language Modeling For Question Generation Zhou Wenjie, Zhang Minghua, Wu Yunfang
- Co-attention Hierarchical Network: Generating Coherent Long Distractors For Reading Comprehension Zhou Xiaorui, Luo Senlin, Wu Yunfang
- Using Interlinear Glosses As Pivot In Low-resource Multilingual Machine Translation Zhou Zhong, Levin Lori, Mortensen David R., Waibel Alex
- Encoder-agnostic Adaptation For Conditional Language Generation Ziegler Zachary M., Melas-kyriazi Luke, Gehrmann Sebastian, Rush Alexander M.
- Paranoid Transformer: Reading Narrative Of Madness As Computational Approach To Creativity Agafonova Yana, Tikhonov Alexey, Yamshchikov Ivan P.
- Accelerating Natural Language Understanding In Task-oriented Dialog Ahuja Ojas, Desai Shrey
- A Study On Multimodal And Interactive Explanations For Visual Question Answering Alipour Kamran, Schulze Jurgen P., Yao Yi, Ziskind Avi, Burachas Giedrius
- Optimizing Transformer For Low-resource Neural Machine Translation Araabi Ali, Monz Christof
- Stress Test Evaluation Of Transformer-based Models In Natural Language Understanding Tasks Aspillaga Carlos, Carvallo Andrés, Araujo Vladimir
- Fluent Response Generation For Conversational Question Answering Baheti Ashutosh, Ritter Alan, Small Kevin
- Segatron: Segment-aware Transformer For Language Modeling And Understanding Bai He, Shi Peng, Lin Jimmy, Xie Yuqing, Tan Luchen, Xiong Kun, Gao Wen, Li Ming
- Character-level Transformer-based Neural Machine Translation Banar Nikolay, Daelemans Walter, Kestemont Mike
- Unilmv2: Pseudo-masked Language Models For Unified Language Model Pre-training Bao Hangbo, Dong Li, Wei Furu, Wang Wenhui, Yang Nan, Liu Xiaodong, Wang Yu, Piao Songhao, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- The Elephant In The Interpretability Room: Why Use Attention As Explanation When We Have Saliency Methods? Bastings Jasmijn, Filippova Katja
- Co-attentional Transformers For Story-based Video Understanding Bebensee Björn, Zhang Byoung-tak
- Longformer: The Long-document Transformer Beltagy Iz, Peters Matthew E., Cohan Arman
- Adaptive Transformers For Learning Multimodal Representations Bhargava Prajjwal
- PALM: Pre-training An Autoencoding&autoregressive Language Model For Context-conditioned Generation Bi Bin, Li Chenliang, Wu Chen, Yan Ming, Wang Wei, Huang Songfang, Huang Fei, Si Luo
- Chatbot Interaction With Artificial Intelligence: Human Data Augmentation With T5 And Language Transformer Ensemble For Text Classification Bird Jordan J., Ekárt Anikó, Faria Diego R.
- Language Models Are Few-shot Learners Brown Tom B., Mann Benjamin, Ryder Nick, Subbiah Melanie, Kaplan Jared, Dhariwal Prafulla, Neelakantan Arvind, Shyam Pranav, Sastry Girish, Askell Amanda, Agarwal Sandhini, Herbert-voss Ariel, Krueger Gretchen, Henighan Tom, Child Rewon, Ramesh Aditya, Ziegler Daniel M., Wu Jeffrey, Winter Clemens, Hesse Christopher, Chen Mark, Sigler Eric, Litwin Mateusz, Gray Scott, Chess Benjamin, Clark Jack, Berner Christopher, Mccandlish Sam, Radford Alec, Sutskever Ilya, Amodei Dario
- Advances Of Transformer-based Models For News Headline Generation Bukhtiyarov Alexey, Gusev Ilya
- DIET: Lightweight Language Understanding For Dialogue Systems Bunk Tanja, Varshneya Daksh, Vlasov Vladimir, Nichol Alan
- Memory Transformer Burtsev Mikhail S., Kuratov Yuri, Peganov Anton, Sapunov Grigory V.
- Behind The Scene: Revealing The Secrets Of Pre-trained Vision-and-language Models Cao Jize, Gan Zhe, Cheng Yu, Yu Licheng, Chen Yen-chun, Liu Jingjing
- Pretrained Language Models For Dialogue Generation With Multiple Input Sources Cao Yu, Bi Wei, Fang Meng, Tao Dacheng
- Syntax-driven Iterative Expansion Language Models For Controllable Text Generation Casas Noe, Fonollosa José A. R., Costa-jussà Marta R.
- Simultaneous Paraphrasing And Translation By Fine-tuning Transformer Models Chada Rakesh
- Exploring Fluent Query Reformulations With Text-to-text Transformers And Reinforcement Learning Chen Jerry Zikun, Yu Shi, Wang Haoran
- Dipair: Fast And Accurate Distillation For Trillion-scale Text Matching And Pair Modeling Chen Jiecao, Yang Liu, Raman Karthik, Bendersky Michael, Yeh Jung-jung, Zhou Yun, Najork Marc, Cai Danyang, Emadzadeh Ehsan
- Adaptive Bi-directional Attention: Exploring Multi-granularity Representations For Machine Reading Comprehension Chen Nuo, Liu Fenglin, You Chenyu, Zhou Peilin, Zou Yuexian
- Logical Natural Language Generation From Open-domain Tables Chen Wenhu, Chen Jianshu, Su Yu, Chen Zhiyu, Wang William Yang
- Reasoning In Dialog: Improving Response Generation By Context Reading Comprehension Chen Xiuying, Cui Zhi, Zhang Jiayi, Wei Chen, Cui Jianwei, Wang Bin, Zhao Dongyan, Yan Rui
- Model Selection For Cross-lingual Transfer Chen Yang, Ritter Alan
- Improving BERT With Self-supervised Attention Chen Yiren, Kou Xiaoyu, Bai Jiangang, Tong Yunhai
- Autoadr: Automatic Model Design For Ad Relevance Chen Yiren, Yang Yaming, Sun Hong, Wang Yujing, Xu Yu, Shen Wei, Zhou Rong, Tong Yunhai, Bai Jing, Zhang Ruofei
- Accurate Word Alignment Induction From Neural Machine Translation Chen Yun, Liu Yang, Chen Guanhua, Jiang Xin, Liu Qun
- Comparative Study Of Language Models On Cross-domain Data With Model Agnostic Explainability Chhipa Mayank, Vazurkar Hrushikesh Mahesh, Kumar Abhijeet, Mishra Mridul
- X-LXMERT: Paint, Caption And Answer Questions With Multi-modal Transformers Cho Jaemin, Lu Jiasen, Schwenk Dustin, Hajishirzi Hannaneh, Kembhavi Aniruddha
- MEANTIME: Mixture Of Attention Mechanisms With Multi-temporal Embeddings For Sequential Recommendation Cho Sung Min, Park Eunhyeok, Yoo Sungjoo
- Rethinking Embedding Coupling In Pre-trained Language Models Chung Hyung Won, Févry Thibault, Tsai Henry, Johnson Melvin, Ruder Sebastian
- Transformers As Soft Reasoners Over Language Clark Peter, Tafjord Oyvind, Richardson Kyle
- Pymt5: Multi-mode Translation Of Natural Language And Python Code With Transformers Clement Colin B., Drain Dawn, Timcheck Jonathan, Svyatkovskiy Alexey, Sundaresan Neel
- Unsupervised Explanation Generation For Machine Reading Comprehension Cui Yiming, Liu Ting, Wang Shijin, Hu Guoping
- Residual Energy-based Models For Text Generation Deng Yuntian, Bakhtin Anton, Ott Myle, Szlam Arthur, Ranzato Marc'aurelio
- Cascaded Text Generation With Markov Transformers Deng Yuntian, Rush Alexander M.
- Calibration Of Pre-trained Transformers Desai Shrey, Durrett Greg
- EFSG: Evolutionary Fooling Sentences Generator Di Giovanni Marco, Brambilla Marco
- Ernie-doc: A Retrospective Long-document Modeling Transformer Ding Siyu, Shang Junyuan, Wang Shuohuan, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Gshard: Scaling Giant Models With Conditional Computation And Automatic Sharding Dmitry Lepikhin, Hyoukjoong Lee, Yuanzhong Xu, Dehao Chen, Orhan Firat, Yanping Huang, Maxim Krikun, Noam Shazeer, Zhifeng Chen
- Fine-tuning Pretrained Language Models: Weight Initializations, Data Orders, And Early Stopping Dodge Jesse, Ilharco Gabriel, Schwartz Roy, Farhadi Ali, Hajishirzi Hannaneh, Smith Noah
- Sound Natural: Content Rephrasing In Dialog Systems Einolghozati Arash, Gupta Anchit, Diedrick Keith, Gupta Sonal
- Turngpt: A Transformer-based Language Model For Predicting Turn-taking In Spoken Dialog Ekstedt Erik, Skantze Gabriel
- Training Multilingual Machine Translation By Alternately Freezing Language-specific Encoders-decoders Escolano Carlos, Costa-jussà Marta R., Fonollosa José A. R., Artetxe Mikel
- A Comparison Of LSTM And BERT For Small Corpus Ezen-can Aysu
- The Unreasonable Volatility Of Neural Machine Translation Models Fadaee Marzieh, Monz Christof
- Augmenting Transformers With Knn-based Composite Memory For Dialogue Fan Angela, Gardent Claire, Braud Chloe, Bordes Antoine
- Addressing Some Limitations Of Transformers With Feedback Memory Fan Angela, Lavril Thibaut, Grave Edouard, Joulin Armand, Sukhbaatar Sainbayar
- CERT: Contrastive Self-supervised Learning For Language Understanding Fang Hongchao, Wang Sicheng, Zhou Meng, Ding Jiayuan, Xie Pengtao
- Attention Guided Semantic Relationship Parsing For Visual Question Answering Farazi Moshiur, Khan Salman, Barnes Nick
- Paragraph-level Commonsense Transformers With Recurrent Memory Gabriel Saadia, Bhagavatula Chandra, Shwartz Vered, Bras Ronan Le, Forbes Maxwell, Choi Yejin
- Compressing Large-scale Transformer-based Models: A Case Study On BERT Ganesh Prakhar, Chen Yao, Lou Xin, Khan Mohammad Ali, Yang Yin, Sajjad Hassan, Nakov Preslav, Chen Deming, Winslett Marianne
- Robust Conversational AI With Grounded Text Generation Gao Jianfeng, Peng Baolin, Li Chunyuan, Li Jinchao, Shayandeh Shahin, Liden Lars, Shum Heung-yeung
- The Pile: An 800GB Dataset Of Diverse Text For Language Modeling Gao Leo, Biderman Stella, Black Sid, Golding Laurence, Hoppe Travis, Foster Charles, Phang Jason, He Horace, Thite Anish, Nabeshima Noa, Presser Shawn, Leahy Connor
- Relevance Transformer: Generating Concise Code Snippets With Relevance Feedback Gemmell Carlos, Rossetto Federico, Dalton Jeffrey
- IQ-VQA: Intelligent Visual Question Answering Goel Vatsal, Chandak Mohit, Anand Ashish, Guha Prithwijit
- ISAAQ -- Mastering Textbook Questions With Pre-trained Transformers And Bottom-up And Top-down Attention Gomez-perez Jose Manuel, Ortega Raul
- It's Not Greek To Mbert: Inducing Word-level Translations From Multilingual BERT Gonen Hila, Ravfogel Shauli, Elazar Yanai, Goldberg Yoav
- Recurrent Chunking Mechanisms For Long-text Machine Reading Comprehension Gong Hongyu, Shen Yelong, Yu Dian, Chen Jianshu, Yu Dong
- Measuring Systematic Generalization In Neural Proof Generation With Transformers Gontier Nicolas, Sinha Koustuv, Reddy Siva, Pal Christopher
- Teaforn: Teacher-forcing With N-grams Goodman Sebastian, Ding Nan, Soricut Radu
- Are Neural Open-domain Dialog Systems Robust To Speech Recognition Errors In The Dialog History? An Empirical Study Gopalakrishnan Karthik, Hedayatnia Behnam, Wang Longshaokan, Liu Yang, Hakkani-tur Dilek
- Contextualize Knowledge Bases With Transformer For End-to-end Task-oriented Dialogue Systems Gou Yanjie, Lei Yinjie, Liu Lingqiao, Dai Yong, Shen Chunxu
- A Simple Yet Strong Pipeline For Hotpotqa Groeneveld Dirk, Khot Tushar, Mausam, Sabharwal Ashish
- A Tailored Pre-training Model For Task-oriented Dialog Generation Gu Jing, Wu Qingyang, Wu Chongruo, Shi Weiyan, Yu Zhou
- Token-level Adaptive Training For Neural Machine Translation Gu Shuhao, Zhang Jinchao, Meng Fandong, Feng Yang, Xie Wanying, Zhou Jie, Yu Dong
- Dialogbert: Discourse-aware Response Generation Via Learning To Recover And Rank Utterances Gu Xiaodong, Yoo Kang Min, Ha Jung-woo
- UNION: An Unreferenced Metric For Evaluating Open-ended Story Generation Guan Jian, Huang Minlie
- Sequence-level Mixed Sample Data Augmentation Guo Demi, Kim Yoon, Rush Alexander M.
- Incorporating BERT Into Parallel Sequence Decoding With Adapters Guo Junliang, Zhang Zhirui, Xu Linli, Wei Hao-ran, Chen Boxing, Chen Enhong
- GMAT: Global Memory Augmentation For Transformers Gupta Ankit, Berant Jonathan
- BERT Based Multilingual Machine Comprehension In English And Hindi Gupta Somil, Khade Nilesh
- Conversational Machine Comprehension: A Literature Review Gupta Somil, Rawat Bhanu Pratap Singh, Yu Hong
- Improvement Of A Dedicated Model For Open Domain Persona-aware Dialogue Generation Han Qiang
- ECONET: Effective Continual Pretraining Of Language Models For Event Temporal Reasoning Han Rujun, Ren Xiang, Peng Nanyun
- Ranking Enhanced Dialogue Generation Hao Changying, Pang Liang, Lan Yanyan, Sun Fei, Guo Jiafeng, Cheng Xueqi
- Exploring Explainable Selection To Control Abstractive Summarization Haonan Wang, Yang Gao, Yu Bai, Lapata Mirella, Heyan Huang
- Enriched Pre-trained Transformers For Joint Slot Filling And Intent Detection Hardalov Momchil, Koychev Ivan, Nakov Preslav
- Guided Transformer: Leveraging Multiple External Sources For Representation Learning In Conversational Search Hashemi Helia, Zamani Hamed, Croft W. Bruce
- Investigating Representations Of Verb Bias In Neural Language Models Hawkins Robert D., Yamakoshi Takateru, Griffiths Thomas L., Goldberg Adele E.
- Pretrained Transformers Improve Out-of-distribution Robustness Hendrycks Dan, Liu Xiaoyuan, Wallace Eric, Dziedzic Adam, Krishnan Rishabh, Song Dawn
- Sub-instruction Aware Vision-and-language Navigation Hong Yicong, Rodriguez-opazo Cristian, Wu Qi, Gould Stephen
- PAIR: Planning And Iterative Refinement In Pre-trained Transformers For Long Text Generation Hua Xinyu, Wang Lu
- Less Is More: Data-efficient Complex Question Answering Over Knowledge Bases Hua Yuncheng, Li Yuan-fang, Qi Guilin, Wu Wei, Zhang Jingyao, Qi Daiqing
- TRANS-BLSTM: Transformer With Bidirectional LSTM For Language Understanding Huang Zhiheng, Xu Peng, Liang Davis, Mishra Ajay, Xiang Bing
- Exploring BERT Parameter Efficiency On The Stanford Question Answering Dataset V2.0 Hulburd Eric
- Memory Attentive Fusion: External Language Model Integration For Transformer-based Sequence-to-sequence Model Ihori Mana, Masumura Ryo, Makishima Naoki, Tanaka Tomohiro, Takashima Akihiko, Orihashi Shota
- Probing Pretrained Language Models For Lexical Semantics Ivan Vulić, Edoardo Maria Ponti, Robert Litschko, Goran Glavaš, Anna Korhonen
- Contrastive Code Representation Learning Jain Paras, Jain Ajay, Zhang Tianjun, Abbeel Pieter, Gonzalez Joseph E., Stoica Ion
- Scaling Laws For Neural Language Models Jared Kaplan, Sam Mccandlish, Tom Henighan, Tom B. Brown, Benjamin Chess, Rewon Child, Scott Gray, Alec Radford, Jeffrey Wu, Dario Amodei
- Are Natural Language Inference Models Imppressive? Learning Implicature And Presupposition Jeretic Paloma, Warstadt Alex, Bhooshan Suvrat, Williams Adina
- In Defense Of Grid Features For Visual Question Answering Jiang Huaizu, Misra Ishan, Rohrbach Marcus, Learned-miller Erik, Chen Xinlei
- TLDR: Token Loss Dynamic Reweighting For Reducing Repetitive Utterance Generation Jiang Shaojie, Wolf Thomas, Monz Christof, De Rijke Maarten
- How Can We Know When Language Models Know? On The Calibration Of Language Models For Question Answering Jiang Zhengbao, Araki Jun, Ding Haibo, Neubig Graham
- Text-to-text Pre-training For Data-to-text Tasks Kale Mihir, Rastogi Abhinav
- Meta-context Transformers For Domain-specific Response Generation Kar Debanjana, Samanta Suranjana, Azad Amar Prakash
- Do Fine-tuned Commonsense Language Models Really Generalize? Kejriwal Mayank, Shen Ke
- Unsupervised Bitext Mining And Translation Via Self-trained Contextual Embeddings Keung Phillip, Salazar Julian, Lu Yichao, Smith Noah A.
- SMRT Chatbots: Improving Non-task-oriented Dialog With Simulated Multiple Reference Training Khayrallah Huda, Sedoc João
- Code Prediction By Feeding Trees To Transformers Kim Seohyun, Zhao Jinman, Tian Yuchi, Chandra Satish
- Current Limitations Of Language Models: What You Need Is Retrieval Komatsuzaki Aran
- Syntactic Structure Distillation Pretraining For Bidirectional Encoders Kuncoro Adhiguna, Kong Lingpeng, Fried Daniel, Yogatama Dani, Rimell Laura, Dyer Chris, Blunsom Phil
- Fid-ex: Improving Sequence-to-sequence Models For Extractive Rationale Generation Lakhotia Kushal, Paranjape Bhargavi, Ghoshal Asish, Yih Wen-tau, Mehdad Yashar, Iyer Srinivasan
- PONE: A Novel Automatic Evaluation Metric For Open-domain Generative Dialogue Systems Lan Tian, Mao Xian-ling, Wei Wei, Gao Xiaoyan, Huang Heyan
- Which Kind Is Better In Open-domain Multi-turn Dialog,hierarchical Or Non-hierarchical Models? An Empirical Study Lan Tian, Mao Xian-ling, Wei Wei, Huang Heyan
- Utilizing Bidirectional Encoder Representations From Transformers For Answer Selection Laskar Md Tahmid Rahman, Hoque Enamul, Huang Jimmy Xiangji
- Context-based Transformer Models For Answer Sentence Selection Lauriola Ivano, Moschitti Alessandro
- Common Sense Or World Knowledge? Investigating Adapter-based Knowledge Injection Into Pretrained Transformers Lauscher Anne, Majewska Olga, Ribeiro Leonardo F. R., Gurevych Iryna, Rozanov Nikolai, Glavaš Goran
- From Zero To Hero: On The Limitations Of Zero-shot Cross-lingual Transfer With Multilingual Transformers Lauscher Anne, Ravishankar Vinit, Vulić Ivan, Glavaš Goran
- LIMEADE: From AI Explanations To Advice Taking Lee Benjamin Charles Germain, Downey Doug, Lo Kyle, Weld Daniel S.
- SLM: Learning A Discourse Language Representation With Sentence Unshuffling Lee Haejun, Hudson Drew A., Lee Kangwook, Manning Christopher D.
- MART: Memory-augmented Recurrent Transformer For Coherent Video Paragraph Captioning Lei Jie, Wang Liwei, Shen Yelong, Yu Dong, Berg Tamara L., Bansal Mohit
- Efficient Transformer-based Large Scale Language Representations Using Hardware-friendly Block Structured Pruning Li Bingbing, Kong Zhenglun, Zhang Tianyun, Li Ji, Li Zhengang, Liu Hang, Ding Caiwen
- Transformers To Learn Hierarchical Contexts In Multiparty Dialogue For Span-based Question Answering Li Changmao, Choi Jinho D.
- Linguistically-informed Transformations (LIT): A Method For Automatically Generating Contrast Sets Li Chuanrong, Shengshuo Lin, Liu Leo Z., Wu Xinyi, Zhou Xuhui, Steinert-threlkeld Shane
- Optimus: Organizing Sentences Via Pre-trained Modeling Of A Latent Space Li Chunyuan, Gao Xiang, Li Yuan, Peng Baolin, Li Xiujun, Zhang Yizhe, Gao Jianfeng
- A Closer Look At The Robustness Of Vision-and-language Pre-trained Models Li Linjie, Gan Zhe, Liu Jingjing
- An Empirical Investigation Of Pre-trained Transformer Language Models For Open-domain Dialogue Generation Li Piji
- TMT: A Transformer-based Modal Translator For Improving Multimodal Sequence Representations In Audio Visual Scene-aware Dialog Li Wubo, Jiang Dongwei, Zou Wei, Li Xiangang
- Multilingual Speech Translation With Efficient Finetuning Of Pretrained Models Li Xian, Wang Changhan, Tang Yun, Tran Chau, Tang Yuqing, Pino Juan, Baevski Alexei, Conneau Alexis, Auli Michael
- Enhancing Dialogue Generation Via Multi-level Contrastive Learning Li Xin, Li Piji, Wang Yan, Liu Xiaojiang, Lam Wai
- Mapping Natural Language Instructions To Mobile UI Action Sequences Li Yang, He Jiacong, Zhou Xin, Zhang Yuan, Baldridge Jason
- Rewriter-evaluator Architecture For Neural Machine Translation Li Yangming, Yao Kaisheng
- Neural Machine Translation With Joint Representation Li Yanyang, Wang Qiang, Xiao Tong, Liu Tongran, Zhu Jingbo
- A Comparison Of Pre-trained Vision-and-language Models For Multimodal Representation Learning Across Medical Images And Reports Li Yikuan, Wang Hanyin, Luo Yuan
- Toward Interpretability Of Dual-encoder Models For Dialogue Response Suggestions Li Yitong, Li Dianqi, Prakash Sushant, Wang Peng
- Bridging Text And Video: A Universal Multimodal Transformer For Video-audio Scene-aware Dialog Li Zekang, Li Zongjia, Zhang Jinchao, Feng Yang, Niu Cheng, Zhou Jie
- Train Large, Then Compress: Rethinking Model Size For Efficient Training And Inference Of Transformers Li Zhuohan, Wallace Eric, Shen Sheng, Lin Kevin, Keutzer Kurt, Klein Dan, Gonzalez Joseph E.
- XGLUE: A New Benchmark Dataset For Cross-lingual Pre-training, Understanding And Generation Liang Yaobo, Duan Nan, Gong Yeyun, Wu Ning, Guo Fenfei, Qi Weizhen, Gong Ming, Shou Linjun, Jiang Daxin, Cao Guihong, Fan Xiaodong, Zhang Ruofei, Agrawal Rahul, Cui Edward, Wei Sining, Bharti Taroon, Qiao Ying, Chen Jiun-hung, Wu Winnie, Liu Shuguang, Yang Fan, Campos Daniel, Majumder Rangan, Zhou Ming
- Probabilistically Masked Language Model Capable Of Autoregressive Generation In Arbitrary Word Order Liao Yi, Jiang Xin, Liu Qun
- Autoregressive Knowledge Distillation Through Imitation Learning Lin Alexander, Wohlwend Jeremy, Chen Howard, Lei Tao
- Multi-node Bert-pretraining: Cost-efficient Approach Lin Jiahuang, Li Xin, Pekhimenko Gennady
- Conversational Question Reformulation Via Sequence-to-sequence Architectures And Pretrained Language Models Lin Sheng-chieh, Yang Jheng-hong, Nogueira Rodrigo, Tsai Ming-feng, Wang Chuan-ju, Lin Jimmy
- Readonce Transformers: Reusable Representations Of Text For Transformers Lin Shih-ting, Sabharwal Ashish, Khot Tushar
- Variational Transformers For Diverse Response Generation Lin Zhaojiang, Winata Genta Indra, Xu Peng, Liu Zihan, Fung Pascale
- Mt5: A Massively Multilingual Pre-trained Text-to-text Transformer Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al-rfou, Aditya Siddhant, Aditya Barua, Colin Raffel
- TIME: Text And Image Mutual-translation Adversarial Networks Liu Bingchen, Song Kunpeng, Zhu Yizhe, De Melo Gerard, Elgammal Ahmed
- Rikinet: Reading Wikipedia Pages For Natural Question Answering Liu Dayiheng, Gong Yeyun, Fu Jie, Yan Yu, Chen Jiusheng, Jiang Daxin, Lv Jiancheng, Duan Nan
- Tell Me How To Ask Again: Question Data Augmentation With Controllable Rewriting In Continuous Space Liu Dayiheng, Gong Yeyun, Fu Jie, Yan Yu, Chen Jiusheng, Lv Jiancheng, Duan Nan, Zhou Ming
- Understanding The Difficulty Of Training Transformers Liu Liyuan, Liu Xiaodong, Gao Jianfeng, Chen Weizhu, Han Jiawei
- Qiaoning At Semeval-2020 Task 4: Commonsense Validation And Explanation System Based On Ensemble Of Language Model Liu Pai
- Adversarial Training For Large Neural Language Models Liu Xiaodong, Cheng Hao, He Pengcheng, Chen Weizhu, Wang Yu, Poon Hoifung, Gao Jianfeng
- Very Deep Transformers For Neural Machine Translation Liu Xiaodong, Duh Kevin, Liu Liyuan, Gao Jianfeng
- KG-BART: Knowledge Graph-augmented BART For Generative Commonsense Reasoning Liu Ye, Wan Yao, He Lifang, Peng Hao, Yu Philip S.
- Simplifying Paragraph-level Question Generation Via Transformer Language Models Lopez Luis Enrico, Cruz Diane Kathryn, Cruz Jan Christian Blaise, Cheng Charibeth
- VECO: Variable And Flexible Cross-lingual Pre-training For Language Understanding And Generation Luo Fuli, Wang Wei, Liu Jiahao, Liu Yijia, Bi Bin, Huang Songfang, Huang Fei, Si Luo
- Charbert: Character-aware Pre-trained Language Model Ma Wentao, Cui Yiming, Si Chenglei, Liu Ting, Wang Shijin, Hu Guoping
- Asking Complex Questions With Multi-hop Answer-focused Reasoning Ma Xiyao, Zhu Qile, Zhou Yanlin, Li Xiaolin, Wu Dapeng
- Language Models As Few-shot Learner For Task-oriented Dialogue Systems Madotto Andrea, Liu Zihan, Lin Zhaojiang, Fung Pascale
- Incremental Processing In The Age Of Non-incremental Encoders: An Empirical Assessment Of Bidirectional Models For Incremental NLU Madureira Brielen, Schlangen David
- Gpt-too: A Language-model-first Approach For Amr-to-text Generation Mager Manuel, Astudillo Ramon Fernandez, Naseem Tahira, Sultan Md Arafat, Lee Young-suk, Florian Radu, Roukos Salim
- Verb Knowledge Injection For Multilingual Event Processing Majewska Olga, Vulić Ivan, Glavaš Goran, Ponti Edoardo M., Korhonen Anna
- Recobert: A Catalog Language Model For Text-based Recommendations Malkiel Itzik, Barkan Oren, Caciularu Avi, Razin Noam, Katz Ori, Koenigstein Noam
- Capturing Document Context Inside Sentence-level Neural Machine Translation Models With Self-training Mansimov Elman, Melis Gábor, Yu Lei
- Ladabert: Lightweight Adaptation Of BERT Through Hybrid Model Compression Mao Yihuan, Wang Yujing, Wu Chufan, Zhang Chen, Wang Yang, Yang Yaming, Zhang Quanlu, Tong Yunhai, Bai Jing
- Natural Language Rationales With Full-stack Visual Reasoning: From Pixels To Semantic Frames To Commonsense Graphs Marasović Ana, Bhagavatula Chandra, Park Jae Sung, Bras Ronan Le, Smith Noah A., Choi Yejin
- Dialoglue: A Natural Language Understanding Benchmark For Task-oriented Dialogue Mehri Shikib, Eric Mihail, Hakkani-tur Dilek
- Delight: Deep And Light-weight Transformer Mehta Sachin, Ghazvininejad Marjan, Iyer Srinivasan, Zettlemoyer Luke, Hajishirzi Hannaneh
- Generation-distillation For Efficient Natural Language Understanding In Low-data Settings Melas-kyriazi Luke, Han George, Liang Celine
- Wechat Neural Machine Translation Systems For WMT20 Meng Fandong, Yan Jianhao, Liu Yijin, Gao Yuan, Zeng Xianfeng, Zeng Qinsong, Li Peng, Chen Ming, Zhou Jie, Liu Sifan, Zhou Hao
- What Happens To BERT Embeddings During Fine-tuning? Merchant Amil, Rahimtoroghi Elahe, Pavlick Ellie, Tenney Ian
- ADER: Adaptively Distilled Exemplar Replay Towards Continual Learning For Session-based Recommendation Mi Fei, Lin Xiaoyu, Faltings Boi
- Lambert: Language And Action Learning Using Multimodal BERT Miyazawa Kazuki, Aoki Tatsuya, Horii Takato, Nagai Takayuki
- On Incorporating Structural Information To Improve Dialogue Response Generation Moghe Nikita, Vijayan Priyesh, Ravindran Balaraman, Khapra Mitesh M.
- Modern Methods For Text Generation Montesinos Dimas Munoz
- On The Stability Of Fine-tuning BERT: Misconceptions, Explanations, And Strong Baselines Mosbach Marius, Andriushchenko Maksym, Klakow Dietrich
- NABU \(\mathrm{-}\) Multilingual Graph-based Neural RDF Verbalizer Moussallem Diego, Gnaneshwar Dwaraknath, Ferreira Thiago Castro, Ngomo Axel-cyrille Ngonga
- Axformer: Accuracy-driven Approximation Of Transformers For Faster, Smaller And More Accurate NLP Models Nagarajan Amrit, Sen Sanchari, Stevens Jacob R., Raghunathan Anand
- Answer Generation Through Unified Memories Over Multiple Passages Nakatsuji Makoto, Okui Sohei
- Stepwise Extractive Summarization And Planning With Structured Transformers Narayan Shashi, Maynez Joshua, Adamek Jakub, Pighin Daniele, Bratanič Blaž, Mcdonald Ryan
- I Like Fish, Especially Dolphins: Addressing Contradictions In Dialogue Modeling Nie Yixin, Williamson Mary, Bansal Mohit, Kiela Douwe, Weston Jason
- Avgout: A Simple Output-probability Measure To Eliminate Dull Responses Niu Tong, Bansal Mohit
- Real-time Execution Of Large-scale Language Models On Mobile Niu Wei, Kong Zhenglun, Yuan Geng, Jiang Weiwen, Guan Jiexiong, Ding Caiwen, Zhao Pu, Liu Sijia, Ren Bin, Wang Yanzhi
- The Chess Transformer: Mastering Play Using Generative Language Models Noever David, Ciolino Matt, Kalin Josh
- Better Distractions: Transformer-based Distractor Generation And Multiple Choice Question Filtering Offerijns Jeroen, Verberne Suzan, Verhoef Tessa
- Alleviating The Burden Of Labeling: Sentence Generation By Attention Branch Encoder-decoder Network Ogura Tadashi, Magassouba Aly, Sugiura Komei, Hirakawa Tsubasa, Yamashita Takayoshi, Fujiyoshi Hironobu, Kawai Hisashi
- Facts2story: Controlling Text Generation By Key Facts Orbach Eyal Bar Ilan University, Goldberg Yoav Bar Ilan University And Allen Institute For Artificial Intelligence
- Challenges And Thrills Of Legal Arguments Pallaprolu Anurag, Vaidya Radha, Attawar Aditya Swaroop
- Multilingual BERT Post-pretraining Alignment Pan Lin, Hang Chung-wei, Qi Haode, Shah Abhishek, Potdar Saloni, Yu Mo
- X-linear Attention Networks For Image Captioning Pan Yingwei, Yao Ting, Li Yehao, Mei Tao
- Multi-view Attention Network For Visual Dialog Park Sungjin, Whang Taesun, Yoon Yeochan, Lim Heuiseok
- On Task-level Dialogue Composition Of Generative Transformer Model Parthasarathi Prasanna, Neelakantan Arvind, Narang Sharan
- How To Evaluate Your Dialogue System: Probe Tasks As An Alternative For Token-level Evaluation Metrics Parthasarathi Prasanna, Pineau Joelle, Chandar Sarath
- Telling Bert's Full Story: From Local Attention To Global Aggregation Pascual Damian, Brunner Gino, Wattenhofer Roger
- Retrieval-augmented Generation For Knowledge-intensive NLP Tasks Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela
- SOLOIST: Building Task Bots At Scale With Transfer Learning And Machine Teaching Peng Baolin, Li Chunyuan, Li Jinchao, Shayandeh Shahin, Liden Lars, Gao Jianfeng
- What Does BERT Know About Books, Movies And Music? Probing BERT For Conversational Recommendation Penha Gustavo, Hauff Claudia
- Adapterhub: A Framework For Adapting Transformers Pfeiffer Jonas, Rücklé Andreas, Poth Clifton, Kamath Aishwarya, Vulić Ivan, Ruder Sebastian, Cho Kyunghyun, Gurevych Iryna
- MAD-X: An Adapter-based Framework For Multi-task Cross-lingual Transfer Pfeiffer Jonas, Vulić Ivan, Gurevych Iryna, Ruder Sebastian
- Sparsifying Transformer Models With Trainable Representation Pooling Pietruszka Michał, Borchmann Łukasz, Garncarek Łukasz
- Interactive Teaching For Conversational AI Ping Qing, Niu Feiyang, Thattai Govind, Chengottusseriyil Joel, Gao Qiaozi, Reganti Aishwarya, Rajagopal Prashanth, Tur Gokhan, Hakkani-tur Dilek, Nataraja Prem
- Compressing Transformer-based Semantic Parsing Models Using Compositional Code Embeddings Prakash Prafull, Shashidhar Saurabh Kumar, Zhao Wenlong, Rongali Subendhu, Khan Haidar, Kayser Michael
- Intermediate-task Transfer Learning With Pretrained Models For Natural Language Understanding: When And Why Does It Work? Pruksachatkun Yada, Phang Jason, Liu Haokun, Htut Phu Mon, Zhang Xiaoyi, Pang Richard Yuanzhe, Vania Clara, Kann Katharina, Bowman Samuel R.
- Jiant: A Software Toolkit For Research On General-purpose Text Understanding Models Pruksachatkun Yada, Yeres Phil, Liu Haokun, Phang Jason, Htut Phu Mon, Wang Alex, Tenney Ian, Bowman Samuel R.
- Training Large Neural Networks With Constant Memory Using A New Execution Algorithm Pudipeddi Bharadwaj, Mesmakhosroshahi Maral, Xi Jinwen, Bharadwaj Sujeeth
- Prophetnet: Predicting Future N-gram For Sequence-to-sequence Pre-training Qi Weizhen, Yan Yu, Gong Yeyun, Liu Dayiheng, Duan Nan, Chen Jiusheng, Zhang Ruofei, Zhou Ming
- Automatic Distractor Generation For Multiple Choice Questions In Standard Tests Qiu Zhaopeng, Wu Xian, Fan Wei
- How Fine Can Fine-tuning Be? Learning Efficient Language Models Radiya-dixit Evani, Wang Xin
- Exploring Recurrent, Memory And Attention Based Architectures For Scoring Interactional Aspects Of Human-machine Text Dialog Ramanarayanan Vikram, Mulholland Matthew, Ghosh Debanjan
- Transquest At WMT2020: Sentence-level Direct Assessment Ranasinghe Tharindu, Orasan Constantin, Mitkov Ruslan
- Hierarchical GPT With Congruent Transformers For Multi-sentence Language Models Roh Jihyeon, Gim Huiseong, Lee Soo-young
- Personalized Query Rewriting In Conversational AI Agents Roshan-ghias Alireza, Mathialagan Clint Solomon, Ponnusamy Pragaash, Mathias Lambert, Guo Chenlei
- Knowledge-aware Language Model Pretraining Rosset Corby, Xiong Chenyan, Phan Minh, Song Xia, Bennett Paul, Tiwary Saurabh
- Lareqa: Language-agnostic Answer Retrieval From A Multilingual Pool Roy Uma, Constant Noah, Al-rfou Rami, Barua Aditya, Phillips Aaron, Yang Yinfei
- Fine-tuning BERT For Schema-guided Zero-shot Dialogue State Tracking Ruan Yu-ping, Ling Zhen-hua, Gu Jia-chen, Liu Quan
- Stronger Transformers For Neural Multi-hop Question Generation Sachan Devendra Singh, Wu Lingfei, Sachan Mrinmaya, Hamilton William
- Prover: Proof Generation For Interpretable Reasoning Over Rules Saha Swarnadeep, Ghosh Sayan, Srivastava Shashank, Bansal Mohit
- On The Effect Of Dropping Layers Of Pre-trained Transformer Models Sajjad Hassan, Dalvi Fahim, Durrani Nadir, Nakov Preslav
- Hierarchical Transformer For Task Oriented Dialog Systems Santra Bishal, Anusha Potnuru, Goyal Pawan
- It's Not Just Size That Matters: Small Language Models Are Also Few-shot Learners Schick Timo, Schütze Hinrich
- What BERT Sees: Cross-modal Transfer For Visual Question Generation Scialom Thomas, Bordes Patrick, Dray Paul-alexis, Staiano Jacopo, Gallinari Patrick
- BLEURT: Learning Robust Metrics For Text Generation Sellam Thibault, Das Dipanjan, Parikh Ankur P.
- Look Before You Speak: Visually Contextualized Utterances Seo Paul Hongsuck, Nagrani Arsha, Schmid Cordelia
- Reasoning Over History: Context Aware Visual Dialog Shah Muhammad A., Mehri Shikib, Srinivasan Tejas
- End-to-end Synthetic Data Generation For Domain Adaptation Of Question Answering Systems Shakeri Siamak, Santos Cicero Nogueira Dos, Zhu Henry, Ng Patrick, Nan Feng, Wang Zhiguo, Nallapati Ramesh, Xiang Bing
- Data-efficient Methods For Dialogue Systems Shalyminov Igor
- Hybrid Generative-retrieval Transformers For Dialogue Domain Adaptation Shalyminov Igor, Sordoni Alessandro, Atkinson Adam, Schulz Hannes
- Russiansuperglue: A Russian Language Understanding Evaluation Benchmark Shavrina Tatiana, Fenogenova Alena, Emelyanov Anton, Shevelev Denis, Artemova Ekaterina, Malykh Valentin, Mikhailov Vladislav, Tikhonova Maria, Chertok Andrey, Evlampiev Andrey
- A Simple But Tough-to-beat Data Augmentation Approach For Natural Language Understanding And Generation Shen Dinghan, Zheng Mingzhi, Shen Yelong, Qu Yanru, Chen Weizhu
- Neural Data-to-text Generation Via Jointly Learning The Segmentation And Correspondence Shen Xiaoyu, Chang Ernie, Su Hui, Zhou Jie, Klakow Dietrich
- Pre-trained Summarization Distillation Shleifer Sam, Rush Alexander M.
- Saying No Is An Art: Contextualized Fallback Responses For Unanswerable Dialogue Queries Shrivastava Ashish, Dhole Kaustubh, Bhatt Abhinav, Raghunath Sharvani
- Unnatural Language Inference Sinha Koustuv, Parthasarathi Prasanna, Pineau Joelle, Williams Adina
- Lightpaff: A Two-stage Distillation Framework For Pre-training And Fine-tuning Song Kaitao, Sun Hao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Hongzhi, Liu Tie-yan
- Neural Machine Translation With Error Correction Song Kaitao, Tan Xu, Lu Jianfeng
- An Investigation Of Language Model Interpretability Via Sentence Editing Stevens Samuel, Su Yu
- Recipes For Adapting Pre-trained Monolingual And Multilingual Models To Machine Translation Stickland Asa Cooper, Li Xian, Ghazvininejad Marjan
- Bert-hlstms: BERT And Hierarchical Lstms For Visual Storytelling Su Jing, Dai Qingyun, Guerin Frank, Zhou Mian
- Discovering Useful Sentence Representations From Large Pretrained Language Models Subramani Nishant, Suresh Nivedita
- Multi-scale Transformer Language Models Subramanian Sandeep, Collobert Ronan, Ranzato Marc'aurelio, Boureau Y-lan
- Benchmarking Machine Reading Comprehension: A Psychological Perspective Sugawara Saku, Stenetorp Pontus, Aizawa Akiko
- Explain And Improve: Lrp-inference Fine-tuning For Image Captioning Models Sun Jiamei, Lapuschkin Sebastian, Samek Wojciech, Binder Alexander
- Intellicode Compose: Code Generation Using Transformer Svyatkovskiy Alexey, Deng Shao Kun, Fu Shengyu, Sundaresan Neel
- Proofwriter: Generating Implications, Proofs, And Abductive Statements Over Natural Language Tafjord Oyvind, Mishra Bhavana Dalvi, Clark Peter
- Let's Stop Incorrect Comparisons In End-to-end Relation Extraction! Taillé Bruno, Guigue Vincent, Scoutheeten Geoffrey, Gallinari Patrick
- Edgebert: Sentence-level Energy Optimizations For Latency-aware Multi-task NLP Inference Tambe Thierry, Hooper Coleman, Pentecost Lillian, Jia Tianyu, Yang En-yu, Donato Marco, Sanh Victor, Whatmough Paul N., Rush Alexander M., Brooks David, Wei Gu-yeon
- Interpretable Neural Computation For Real-world Compositional Visual Question Answering Tang Ruixue, Ma Chao
- Synthesizer: Rethinking Self-attention In Transformer Models Tay Yi, Bahri Dara, Metzler Donald, Juan Da-cheng, Zhao Zhe, Zheng Che
- Making Pre-trained Language Models Better Few-shot Learners Tianyu Gao, Adam Fisch, Danqi Chen
- Language Models Are Few-shot Learners Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam Mccandlish, Alec Radford, Ilya Sutskever, Dario Amodei
- Cycle Text-to-image GAN With BERT Tsue Trevor, Sen Samir, Li Jason
- An Empirical Study On Robustness To Spurious Correlations Using Pre-trained Language Models Tu Lifu, Lalwani Garima, Gella Spandana, He He
- On Optimal Transformer Depth For Low-resource Language Translation Van Biljon Elan, Pretorius Arnu, Kreutzer Julia
- Bertnesia: Investigating The Capture And Forgetting Of Knowledge In BERT Wallat Jonas, Singh Jaspreet, Anand Avishek
- Multi-task Learning With Multi-head Attention For Multi-choice Reading Comprehension Wan Hui
- Infobert: Improving Robustness Of Language Models From An Information Theoretic Perspective Wang Boxin, Wang Shuohang, Cheng Yu, Gan Zhe, Jia Ruoxi, Li Bo, Liu Jingjing
- HAT: Hardware-aware Transformers For Efficient Natural Language Processing Wang Hanrui, Wu Zhanghao, Liu Zhijian, Cai Han, Zhu Ligeng, Gan Chuang, Han Song
- SRQA: Synthetic Reader For Factoid Question Answering Wang Jiuniu, Xu Wenjia, Fu Xingyu, Wei Yang, Jin Li, Chen Ziyan, Xu Guangluan, Wu Yirong
- Encoding Syntactic Knowledge In Transformer Encoder For Intent Detection And Slot Filling Wang Jixuan, Wei Kai, Radfar Martin, Zhang Weiwei, Chung Clement
- Minilm: Deep Self-attention Distillation For Task-agnostic Compression Of Pre-trained Transformers Wang Wenhui, Wei Furu, Dong Li, Bao Hangbo, Yang Nan, Zhou Ming
- Rethinking The Value Of Transformer Components Wang Wenxuan, Tu Zhaopeng
- Lightseq: A High Performance Inference Library For Transformers Wang Xiaohui, Xiong Ying, Wei Yang, Wang Mingxuan, Li Lei
- VD-BERT: A Unified Vision And Dialog Transformer With BERT Wang Yue, Joty Shafiq, Lyu Michael R., King Irwin, Xiong Caiming, Hoi Steven C. H.
- Towards Faithful Neural Table-to-text Generation With Content-matching Constraints Wang Zhenyi, Wang Xiaoyang, An Bang, Yu Dong, Chen Changyou
- On Modality Bias In The TVQA Dataset Winterbottom Thomas, Xiao Sarah, Mclean Alistair, Moubayed Noura Al
- Contextual Neural Machine Translation Improves Translation Of Cataphoric Pronouns Wong Kayyen, Maruf Sameen, Haffari Gholamreza
- Cort: Complementary Rankings From Transformers Wrzalik Marco, Krechel Dirk
- TOD-BERT: Pre-trained Natural Language Understanding For Task-oriented Dialogue Wu Chien-sheng, Hoi Steven, Socher Richard, Xiong Caiming
- A Controllable Model Of Grounded Response Generation Wu Zeqiu, Galley Michel, Brockett Chris, Zhang Yizhe, Gao Xiang, Quirk Chris, Koncel-kedziorski Rik, Gao Jianfeng, Hajishirzi Hannaneh, Ostendorf Mari, Dolan Bill
- Lite Transformer With Long-short Range Attention Wu Zhanghao, Liu Zhijian, Lin Ji, Lin Yujun, Han Song
- Context-guided BERT For Targeted Aspect-based Sentiment Analysis Wu Zhengxuan, Ong Desmond C.
- CG-BERT: Conditional Text Generation With BERT For Generalized Few-shot Intent Detection Xia Congying, Zhang Chenwei, Nguyen Hoang, Zhang Jiawei, Yu Philip
- ERNIE-GEN: An Enhanced Multi-flow Pre-training And Fine-tuning Framework For Natural Language Generation Xiao Dongling, Zhang Han, Li Yukun, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Generating Informative Dialogue Responses With Keywords-guided Networks Xu Heng-da, Mao Xian-ling, Chi Zewen, Zhu Jing-jing, Sun Fanshu, Huang Heyan
- EDITOR: An Edit-based Transformer With Repositioning For Neural Machine Translation With Soft Lexical Constraints Xu Weijia, Carpuat Marine
- Improving BERT Fine-tuning Via Self-ensemble And Self-distillation Xu Yige, Qiu Xipeng, Zhou Ligao, Huang Xuanjing
- MC-BERT: Efficient Language Pre-training Via A Meta Controller Xu Zhenhui, Gong Linyuan, Ke Guolin, He Di, Zheng Shuxin, Wang Liwei, Bian Jiang, Liu Tie-yan
- Multi-unit Transformers For Neural Machine Translation Yan Jianhao, Meng Fandong, Zhou Jie
- Off-policy Self-critical Training For Transformer In Visual Paragraph Generation Yan Shiyang, Hua Yang, Robertson Neil M.
- Just Ask: Learning To Answer Questions From Millions Of Narrated Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Seqdialn: Sequential Visual Dialog Networks In Joint Visual-linguistic Representation Space Yang Liu
- IART: Intent-aware Response Ranking With Transformers In Information-seeking Conversation Systems Yang Liu, Qiu Minghui, Qu Chen, Chen Cen, Guo Jiafeng, Zhang Yongfeng, Croft W. Bruce, Chen Haiqing
- On The Generation Of Medical Dialogues For COVID-19 Yang Wenmian, Zeng Guangtao, Tan Bowen, Ju Zeqian, Chakravorty Subrato, He Xuehai, Chen Shu, Yang Xingyi, Wu Qingyang, Yu Zhou, Xing Eric, Xie Pengtao
- On The Sub-layer Functionalities Of Transformer Decoder Yang Yilin, Wang Longyue, Shi Shuming, Tadepalli Prasad, Lee Stefan, Tu Zhaopeng
- Contrastive Triple Extraction With Generative Transformer Ye Hongbin, Zhang Ningyu, Deng Shumin, Chen Mosha, Tan Chuanqi, Huang Fei, Chen Huajun
- Adding Recurrence To Pretrained Transformers For Improved Efficiency And Context Size Yoshida Davis, Ettinger Allyson, Gimpel Kevin
- Hard-coded Gaussian Attention For Neural Machine Translation You Weiqiu, Sun Simeng, Iyyer Mohit
- Cocolm: Complex Commonsense Enhanced Language Model With Discourse Relations Yu Changlong, Zhang Hongming, Song Yangqiu, Ng Wilfred
- Assessing Phrasal Representation And Composition In Transformers Yu Lang, Ettinger Allyson
- Few-shot Generative Conversational Query Rewriting Yu Shi, Liu Jiahua, Yang Jingqin, Xiong Chenyan, Bennett Paul, Gao Jianfeng, Liu Zhiyuan
- Clinical Reading Comprehension: A Thorough Analysis Of The Emrqa Dataset Yue Xiang, Gutierrez Bernal Jimenez, Sun Huan
- GOBO: Quantizing Attention-based NLP Models For Low Latency And Energy Efficient Inference Zadeh Ali Hadi, Edo Isak, Awad Omar Mohamed, Moshovos Andreas
- Big Bird: Transformers For Longer Sequences Zaheer Manzil, Guruganesh Guru, Dubey Avinava, Ainslie Joshua, Alberti Chris, Ontanon Santiago, Pham Philip, Ravula Anirudh, Wang Qifan, Yang Li, Ahmed Amr
- Emptransfo: A Multi-head Transformer Architecture For Creating Empathetic Dialog Systems Zandie Rohola, Mahoor Mohammad H.
- Turingadvice: A Generative And Dynamic Evaluation Of Language Use Zellers Rowan, Holtzman Ari, Clark Elizabeth, Qin Lianhui, Farhadi Ali, Choi Yejin
- Open-domain Dialogue Generation Based On Pre-trained Language Models Zeng Yan, Nie Jian-yun
- A Simple And Efficient Multi-task Learning Approach For Conditioned Dialogue Generation Zeng Yan, Nie Jian-yun
- Modeling Topical Relevance For Multi-turn Dialogue Generation Zhang Hainan, Lan Yanyan, Pang Liang, Chen Hongshen, Ding Zhuoye, Yin Dawei
- Token Drop Mechanism For Neural Machine Translation Zhang Huaao, Qiu Shigui, Duan Xiangyu, Zhang Min
- Accelerating Training Of Transformer-based Language Models With Progressive Layer Dropping Zhang Minjia, He Yuxiong
- Future-guided Incremental Transformer For Simultaneous Translation Zhang Shaolei, Feng Yang, Li Liangyou
- Trojaning Language Models For Fun And Profit Zhang Xinyang, Zhang Zheng, Ji Shouling, Wang Ting
- When Do You Need Billions Of Words Of Pretraining Data? Zhang Yian, Warstadt Alex, Li Haau-sing, Bowman Samuel R.
- POINTER: Constrained Progressive Text Generation Via Insertion-based Generative Pre-training Zhang Yizhe, Wang Guoyin, Li Chunyuan, Gan Zhe, Brockett Chris, Dolan Bill
- BERT-JAM: Boosting Bert-enhanced Neural Machine Translation With Joint Attention Zhang Zhebin, Wu Sai, Jiang Dawei, Chen Gang
- CPM: A Large-scale Generative Chinese Pre-trained Language Model Zhang Zhengyan, Han Xu, Zhou Hao, Ke Pei, Gu Yuxian, Ye Deming, Qin Yujia, Su Yusheng, Ji Haozhe, Guan Jian, Qi Fanchao, Wang Xiaozhi, Zheng Yanan, Zeng Guoyang, Cao Huanqi, Chen Shengqi, Li Daixuan, Sun Zhenbo, Liu Zhiyuan, Huang Minlie, Han Wentao, Tang Jie, Li Juanzi, Zhu Xiaoyan, Sun Maosong
- Know What You Don't Need: Single-shot Meta-pruning For Attention Heads Zhang Zhengyan, Qi Fanchao, Liu Zhiyuan, Liu Qun, Sun Maosong
- A Closer Look At Few-shot Crosslingual Transfer: The Choice Of Shots Matters Zhao Mengjie, Zhu Yi, Shareghi Ehsan, Vulić Ivan, Reichart Roi, Korhonen Anna, Schütze Hinrich
- Towards Making The Most Of Context In Neural Machine Translation Zheng Zaixiang, Yue Xiang, Huang Shujian, Chen Jiajun, Birch Alexandra
- Pre-training Text-to-text Transformers For Concept-centric Common Sense Zhou Wangchunshu, Lee Dong-ho, Selvam Ravi Kiran, Lee Seyeon, Lin Bill Yuchen, Ren Xiang
- Learning To Compare For Better Training And Evaluation Of Open Domain Natural Language Generation Models Zhou Wangchunshu, Xu Ke
- Modifying Memories In Transformer Models Zhu Chen, Rawat Ankit Singh, Zaheer Manzil, Bhojanapalli Srinadh, Li Daliang, Yu Felix, Kumar Sanjiv
- DUMA: Reading Comprehension With Transposition Thinking Zhu Pengfei, Zhao Hai, Li Xiaoguang
- Towards Understanding Sample Variance In Visually Grounded Language Generation: Evaluations And Observations Zhu Wanrong, Wang Xin Eric, Narayana Pradyumna, Sone Kazoo, Basu Sugato, Wang William Yang
- Vision-dialog Navigation By Exploring Cross-modal Memory Zhu Yi, Zhu Fengda, Zhan Zhaohuan, Lin Bingqian, Jiao Jianbin, Chang Xiaojun, Liang Xiaodan
- Regularizing Transformers With Deep Probabilistic Layers Aguilera Aurora Cobo, Olmos Pablo Martínez, Artés-rodríguez Antonio, Pérez-cruz Fernando
- Goal-directed Story Generation: Augmenting Generative Language Models With Reinforcement Learning Alabdulkarim Amal, Li Winston, Martin Lara J., Riedl Mark O.
- RAFT: A Real-world Few-shot Text Classification Benchmark Alex Neel, Lifland Eli, Tunstall Lewis, Thakur Abhishek, Maham Pegah, Riedel C. Jess, Hine Emmie, Ashurst Carolyn, Sedille Paul, Carlier Alexis, Noetel Michael, Stuhlmüller Andreas
- On The Multilingual Capabilities Of Very Large-scale English Language Models Armengol-estapé Jordi, Bonet Ona De Gibert, Melero Maite
- See, Hear, Read: Leveraging Multimodality With Guided Attention For Abstractive Text Summarization Atri Yash Kumar, Pramanick Shraman, Goyal Vikram, Chakraborty Tanmoy
- Re-evaluating Germeval17 Using German Pre-trained Language Models Aßenmacher M., Corvonato A., Heumann C.
- G-transformer For Document-level Machine Translation Bao Guangsheng, Zhang Yue, Teng Zhiyang, Chen Boxing, Luo Weihua
- S2s-ft: Fine-tuning Pretrained Transformer Encoders For Sequence-to-sequence Learning Bao Hangbo, Dong Li, Wang Wenhui, Yang Nan, Wei Furu
- PLATO-XL: Exploring The Large-scale Pre-training Of Dialogue Generation Bao Siqi, He Huang, Wang Fan, Wu Hua, Wang Haifeng, Wu Wenquan, Wu Zhihua, Guo Zhen, Lu Hua, Huang Xinxian, Tian Xin, Xu Xinchao, Lin Yingzhan, Niu Zheng-yu
- Non-autoregressive Translation By Learning Target Categorical Codes Bao Yu, Huang Shujian, Xiao Tong, Wang Dongqi, Dai Xinyu, Chen Jiajun
- Gabert -- An Irish Language Model Barry James, Wagner Joachim, Cassidy Lauren, Cowap Alan, Lynn Teresa, Walsh Abigail, Meachair Mícheál J. Ó, Foster Jennifer
- NLP-IIS@UT At Semeval-2021 Task 4: Machine Reading Comprehension Using The Long Document Transformer Basafa Hossein, Movahedi Sajad, Ebrahimi Ali, Shakery Azadeh, Faili Heshaam
- Building A Question And Answer System For News Domain Basu Sandipan, Gaddala Aravind, Chetan Pooja, Tiwari Garima, Darapaneni Narayana, Parvathaneni Sadwik, Paduri Anwesh Reddy
- Exploring Unsupervised Pretraining Objectives For Machine Translation Baziotis Christos, Titov Ivan, Birch Alexandra, Haddow Barry
- Continual Learning In Multilingual NMT Via Language-specific Embeddings Berard Alexandre
- Efficient Inference For Multilingual Neural Machine Translation Berard Alexandre, Lee Dain, Clinchant Stéphane, Jung Kweonwoo, Nikoulina Vassilina
- SHAQ: Single Headed Attention With Quasi-recurrence Bharwani Nashwin, Kushner Warren, Dandona Sangeet, Schreiber Ben
- On The Universality Of Deep Contextual Language Models Bhatt Shaily, Goyal Poonam, Dandapat Sandipan, Choudhury Monojit, Sitaram Sunayana
- Leveraging Redundancy In Attention With Reuse Transformers Bhojanapalli Srinadh, Chakrabarti Ayan, Veit Andreas, Lukasik Michal, Jain Himanshu, Liu Frederick, Chang Yin-wen, Kumar Sanjiv
- Latr: Layout-aware Transformer For Scene-text VQA Biten Ali Furkan, Litman Ron, Xie Yusheng, Appalaraju Srikar, Manmatha R.
- Improving Language Models By Retrieving From Trillions Of Tokens Borgeaud Sebastian, Mensch Arthur, Hoffmann Jordan, Cai Trevor, Rutherford Eliza, Millican Katie, Driessche George Van Den, Lespiau Jean-baptiste, Damoc Bogdan, Clark Aidan, Casas Diego De Las, Guy Aurelia, Menick Jacob, Ring Roman, Hennigan Tom, Huang Saffron, Maggiore Loren, Jones Chris, Cassirer Albin, Brock Andy, Paganini Michela, Irving Geoffrey, Vinyals Oriol, Osindero Simon, Simonyan Karen, Rae Jack W., Elsen Erich, Sifre Laurent
- What Changes Can Large-scale Language Models Bring? Intensive Study On Hyperclova: Billions-scale Korean Generative Pretrained Transformers Boseop Kim, Hyoungseok Kim, Sang-woo Lee, Gichang Lee, Donghyun Kwak, Dong Hyeon Jeon, Sunghyun Park, Sungju Kim, Seonhoon Kim, Dongpil Seo, Heungsub Lee, Minyoung Jeong, Sungjae Lee, Minsub Kim, Suk Hyun Ko, Seokhun Kim, Taeyong Park, Jinuk Kim, Soyoung Kang, Na-hyeon Ryu, Kang Min Yoo, Minsuk Chang, Soobin Suh, Sookyo In, Jinseong Park, Kyungduk Kim, Hiun Kim, Jisu Jeong, Yong Goo Yeo, Donghoon Ham, Dongju Park, Min Young Lee, Jaewook Kang, Inho Kang, Jung-woo Ha, Woomyoung Park, Nako Sung
- The Power Of Scale For Parameter-efficient Prompt Tuning Brian Lester, Rami Al-rfou, Noah Constant
- On The Prunability Of Attention Heads In Multilingual BERT Budhraja Aakriti, Pande Madhura, Kumar Pratyush, Khapra Mitesh M.
- Gaud\'i: Conversational Interactions With Deep Representations To Generate Image Collections Bursztyn Victor S., Healey Jennifer, Vinay Vishwa
- CDLM: Cross-document Language Modeling Caciularu Avi, Cohan Arman, Beltagy Iz, Peters Matthew E., Cattan Arie, Dagan Ido
- Long Context Question Answering Via Supervised Contrastive Learning Caciularu Avi, Dagan Ido, Goldberger Jacob, Cohan Arman
- Curriculum Learning For Language Modeling Campos Daniel
- Coarse-grained Decomposition And Fine-grained Interaction For Multi-hop Question Answering Cao Xing, Liu Yun
- Can Transformers Jump Around Right In Natural Language? Assessing Performance Transfer From SCAN Chaabouni Rahma, Dessì Roberto, Kharitonov Eugene
- The Selectgen Challenge: Finding The Best Training Samples For Few-shot Neural Text Generation Chang Ernie, Shen Xiaoyu, Marin Alex, Demberg Vera
- On Training Instance Selection For Few-shot Neural Text Generation Chang Ernie, Shen Xiaoyu, Yeh Hui-syuan, Demberg Vera
- Neural Data-to-text Generation With Lm-based Text Augmentation Chang Ernie, Shen Xiaoyu, Zhu Dawei, Demberg Vera, Su Hui
- Generic Attention-model Explainability For Interpreting Bi-modal And Encoder-decoder Transformers Chefer Hila, Gur Shir, Wolf Lior
- Topic Transferable Table Question Answering Chemmengath Saneem Ahmed, Kumar Vishwajeet, Bharadwaj Samarth, Sen Jaydeep, Canim Mustafa, Chakrabarti Soumen, Gliozzo Alfio, Sankaranarayanan Karthik
- Multimodal Incremental Transformer With Visual Grounding For Visual Dialogue Generation Chen Feilong, Meng Fandong, Chen Xiuyi, Li Peng, Zhou Jie
- Visualgpt: Data-efficient Adaptation Of Pretrained Language Models For Image Captioning Chen Jun, Guo Han, Yi Kai, Li Boyang, Elhoseiny Mohamed
- Reinforcement Learning For Few-shot Text Generation Adaptation Cheng Pengsen, Dai Jinqiao, Liu Jiamiao, Liu Jiayong, Jia Peng
- MT6: Multilingual Pretrained Text-to-text Transformer With Translation Pairs Chi Zewen, Dong Li, Ma Shuming, Mao Shaohan Huang Xian-ling, Huang Heyan, Wei Furu
- Breaking Down Multilingual Machine Translation Chiang Ting-rui, Chen Yi-pei, Yeh Yi-ting, Neubig Graham
- Language Modeling Using Lmus: 10x Better Data Efficiency Or Improved Scaling Compared To Transformers Chilkuri Narsimha, Hunsberger Eric, Voelker Aaron, Malik Gurshaant, Eliasmith Chris
- Unifying Vision-and-language Tasks Via Text Generation Cho Jaemin, Lei Jie, Tan Hao, Bansal Mohit
- Ensemble Fine-tuned Mbert For Translation Quality Estimation Chowdhury Shaika, Baili Naouel, Vannah Brian
- Enjoy The Salience: Towards Better Transformer-based Faithful Explanations With Word Salience Chrysostomou George, Aletras Nikolaos
- Improving The Faithfulness Of Attention-based Explanations With Task-specific Information For Text Classification Chrysostomou George, Aletras Nikolaos
- All That's 'human' Is Not Gold: Evaluating Human Evaluation Of Generated Text Clark Elizabeth, August Tal, Serrano Sofia, Haduong Nikita, Gururangan Suchin, Smith Noah A.
- CANINE: Pre-training An Efficient Tokenization-free Encoder For Language Representation Clark Jonathan H., Garrette Dan, Turc Iulia, Wieting John
- Long-range Modeling Of Source Code Files With Ewash: Extended Window Access By Syntax Hierarchy Clement Colin B., Lu Shuai, Liu Xiaoyu, Tufano Michele, Drain Dawn, Duan Nan, Sundaresan Neel, Svyatkovskiy Alexey
- Distilling Transformers For Neural Cross-domain Search Clement Colin B., Wu Chen, Drain Dawn, Sundaresan Neel
- Control Prefixes For Parameter-efficient Text Generation Clive Jordan, Cao Kris, Rei Marek
- Training Verifiers To Solve Math Word Problems Cobbe Karl, Kosaraju Vineet, Bavarian Mohammad, Chen Mark, Jun Heewoo, Kaiser Lukasz, Plappert Matthias, Tworek Jerry, Hilton Jacob, Nakano Reiichiro, Hesse Christopher, Schulman John
- Multilingual Multi-aspect Explainability Analyses On Machine Reading Comprehension Models Cui Yiming, Zhang Wei-nan, Che Wanxiang, Liu Ting, Chen Zhigang, Wang Shijin
- YANMTT: Yet Another Neural Machine Translation Toolkit Dabre Raj, Sumita Eiichiro
- Knowledge Neurons In Pretrained Transformers Dai Damai, Dong Li, Hao Yaru, Sui Zhifang, Chang Baobao, Wei Furu
- Mention Memory: Incorporating Textual Knowledge Into Transformers Through Entity Mention Attention De Jong Michiel, Zemlyanskiy Yury, Fitzgerald Nicholas, Sha Fei, Cohen William
- Adapting Monolingual Models: Data Can Be Scarce When Language Similarity Is High De Vries Wietse, Bartelds Martijn, Nissim Malvina, Wieling Martijn
- Multitask Finetuning For Improving Neural Machine Translation In Indian Languages Desai Shaily, Kshirsagar Atharva, Marathe Manisha
- Diagnosing Transformers In Task-oriented Semantic Parsing Desai Shrey, Aly Ahmed
- EKTVQA: Generalized Use Of External Knowledge To Empower Scene Text In Text-vqa Dey Arka Ujjal, Valveny Ernest, Harit Gaurav
- Cogview: Mastering Text-to-image Generation Via Transformers Ding Ming, Yang Zhuoyi, Hong Wenyi, Zheng Wendi, Zhou Chang, Yin Da, Lin Junyang, Zou Xu, Shao Zhou, Yang Hongxia, Tang Jie
- Not Quite 'ask A Librarian': AI On The Nature, Value, And Future Of LIS Dinneen Jesse David, Bubinger Helen
- A Primer On Pretrained Multilingual Language Models Doddapaneni Sumanth, Ramesh Gowtham, Khapra Mitesh M., Kunchukuttan Anoop, Kumar Pratyush
- Diverse Pretrained Context Encodings Improve Document Translation Donato Domenic, Yu Lei, Dyer Chris
- On-the-fly Attention Modulation For Neural Generation Dong Yue, Bhagavatula Chandra, Lu Ximing, Hwang Jena D., Bosselut Antoine, Cheung Jackie Chi Kit, Choi Yejin
- Attention Forcing For Machine Translation Dou Qingyun, Lu Yiting, Manakul Potsawee, Wu Xixin, Gales Mark J. F.
- Is GPT-3 Text Indistinguishable From Human Text? Scarecrow: A Framework For Scrutinizing Machine Text Dou Yao, Forbes Maxwell, Koncel-kedziorski Rik, Smith Noah A., Choi Yejin
- An Empirical Study Of Training End-to-end Vision-and-language Transformers Dou Zi-yi, Xu Yichong, Gan Zhe, Wang Jianfeng, Wang Shuohang, Wang Lijuan, Zhu Chenguang, Zhang Pengchuan, Yuan Lu, Peng Nanyun, Liu Zicheng, Zeng Michael
- Robustness Challenges In Model Distillation And Pruning For Natural Language Understanding Du Mengnan, Mukherjee Subhabrata, Cheng Yu, Shokouhi Milad, Hu Xia, Awadallah Ahmed Hassan
- GLM: General Language Model Pretraining With Autoregressive Blank Infilling Du Zhengxiao, Qian Yujie, Liu Xiao, Ding Ming, Qiu Jiezhong, Yang Zhilin, Tang Jie
- Generative Context Pair Selection For Multi-hop Question Answering Dua Dheeru, Santos Cicero Nogueira Dos, Ng Patrick, Athiwaratkun Ben, Xiang Bing, Gardner Matt, Singh Sameer
- Lora: Low-rank Adaptation Of Large Language Models Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen
- Robustly Optimized And Distilled Training For Natural Language Understanding Elfadeel Haytham, Peshterliev Stan
- Latent Alignment Of Procedural Concepts In Multimodal Recipes Faghihi Hossein Rajaby, Mirzaee Roshanak, Paliwal Sudarshan, Kordjamshidi Parisa
- Transformer-based Conditional Variational Autoencoder For Controllable Story Generation Fang Le, Zeng Tao, Liu Chaochun, Bo Liefeng, Dong Wen, Chen Changyou
- Leveraging Knowledge In Multilingual Commonsense Reasoning Fang Yuwei, Wang Shuohang, Xu Yichong, Xu Ruochen, Sun Siqi, Zhu Chenguang, Zeng Michael
- Compressing Visual-linguistic Model Via Knowledge Distillation Fang Zhiyuan, Wang Jianfeng, Hu Xiaowei, Wang Lijuan, Yang Yezhou, Liu Zicheng
- Retrieve, Caption, Generate: Visual Grounding For Enhancing Commonsense In Text Generation Models Feng Steven Y., Lu Kevin, Tao Zhuofu, Alikhani Malihe, Mitamura Teruko, Hovy Eduard, Gangal Varun
- Attention Weights In Transformer NMT Fail Aligning Words Between Sequences But Largely Explain Model Predictions Ferrando Javier, Costa-jussà Marta R.
- Open-domain Conversational Search Assistant With Transformers Ferreira Rafael, Leite Mariana, Semedo David, Magalhaes Joao
- Berta\'u: Ita\'u BERT For Digital Customer Service Finardi Paulo, Viegas José Dié, Ferreira Gustavo T., Mansano Alex F., Caridá Vinicius F.
- Vision-and-language Or Vision-for-language? On Cross-modal Influence In Multimodal Transformers Frank Stella, Bugliarello Emanuele, Elliott Desmond
- Interactively Providing Explanations For Transformer Language Models Friedrich Felix, Schramowski Patrick, Tauchmann Christopher, Kersting Kristian
- Grounded Graph Decoding Improves Compositional Generalization In Question Answering Gai Yu, Jain Paras, Zhang Wendi, Gonzalez Joseph E., Song Dawn, Stoica Ion
- Exbert: An External Knowledge Enhanced BERT For Natural Language Inference Gajbhiye Amit, Moubayed Noura Al, Bradley Steven
- Does Dialog Length Matter For Next Response Selection Task? An Empirical Study Ganhotra Jatin, Joshi Sachindra
- Chop Chop BERT: Visual Question Answering By Chopping Visualbert's Heads Gao Chenyu, Zhu Qi, Wang Peng, Wu Qi
- Condenser: A Pre-training Architecture For Dense Retrieval Gao Luyu, Callan Jamie
- Scalable Transformers For Neural Machine Translation Gao Peng, Geng Shijie, Qiao Yu, Wang Xiaogang, Dai Jifeng, Li Hongsheng
- An Adversarially-learned Turing Test For Dialog Generation Models Gao Xiang, Zhang Yizhe, Galley Michel, Dolan Bill
- Improving And Diagnosing Knowledge-based Visual Question Answering Via Entity Enhanced Knowledge Injection Garcia-olano Diego, Onoe Yasumasa, Ghosh Joydeep
- JABER And SABER: Junior And Senior Arabic Bert Ghaddar Abbas, Wu Yimeng, Rashid Ahmad, Bibi Khalil, Rezagholizadeh Mehdi, Xing Chao, Wang Yasheng, Xinyu Duan, Wang Zhefeng, Huai Baoxing, Jiang Xin, Liu Qun, Langlais Philippe
- Cross-attention Is All You Need: Adapting Pretrained Transformers For Machine Translation Gheini Mozhdeh, Ren Xiang, May Jonathan
- Towards Trustworthy Deception Detection: Benchmarking Model Robustness Across Domains, Modalities, And Languages Glenski Maria, Ayton Ellyn, Cosbey Robin, Arendt Dustin, Volkova Svitlana
- LAWDR: Language-agnostic Weighted Document Representations From Pre-trained Models Gong Hongyu, Chaudhary Vishrav, Tang Yuqing, Guzmán Francisco
- Larger-scale Transformers For Multilingual Masked Language Modeling Goyal Naman, Du Jingfei, Ott Myle, Anantharaman Giri, Conneau Alexis
- Xeroalign: Zero-shot Cross-lingual Transformer Alignment Gritta Milan, Iacobacci Ignacio
- Transaction: ICL-SJTU Submission To Epic-kitchens Action Anticipation Challenge 2021 Gu Xiao, Qiu Jianing, Guo Yao, Lo Benny, Yang Guang-zhong
- Block-skim: Efficient Question Answering For Transformer Guan Yue, Li Zhengyi, Leng Jingwen, Lin Zhouhan, Guo Minyi, Zhu Yuhao
- KAT: A Knowledge Augmented Transformer For Vision-and-language Gui Liangke, Wang Borui, Huang Qiuyuan, Hauptmann Alex, Bisk Yonatan, Gao Jianfeng
- Longt5: Efficient Text-to-text Transformer For Long Sequences Guo Mandy, Ainslie Joshua, Uthus David, Ontanon Santiago, Ni Jianmo, Sung Yun-hsuan, Yang Yinfei
- LICHEE: Improving Language Model Pre-training With Multi-grained Tokenization Guo Weidong, Zhao Mingjun, Zhang Lusheng, Niu Di, Luo Jinwen, Liu Zhenhua, Li Zhenyang, Tang Jianbo
- Memory-efficient Transformers Via Top-\(k\) Attention Gupta Ankit, Dar Guy, Goodman Shaya, Ciprut David, Berant Jonathan
- A Million Tweets Are Worth A Few Points: Tuning Transformers For Customer Service Tasks Hadifar Amir, Labat Sofie, Hoste Véronique, Develder Chris, Demeester Thomas
- WARP: Word-level Adversarial Reprogramming Hambardzumyan Karen, Khachatrian Hrant, May Jonathan
- Greedy Gradient Ensemble For Robust Visual Question Answering Han Xinzhe, Wang Shuhui, Su Chi, Huang Qingming, Tian Qi
- Text2app: A Framework For Creating Android Apps From Text Descriptions Hasan Masum, Mehrab Kazi Sajeed, Ahmad Wasi Uddin, Shahriyar Rifat
- Zero-shot Controlled Generation With Encoder-decoder Transformers Hazarika Devamanyu, Namazifar Mahdi, Hakkani-tür Dilek
- ELIT: Emory Language And Information Toolkit He Han, Xu Liyan, Choi Jinho D.
- Generate, Annotate, And Learn: NLP With Synthetic Text He Xuanli, Nassar Islam, Kiros Jamie, Haffari Gholamreza, Norouzi Mohammad
- Open Domain Question Answering Over Tables Via Dense Retrieval Herzig Jonathan, Müller Thomas, Krichene Syrine, Eisenschlos Julian Martin
- Multilingual Language Models Predict Human Reading Behavior Hollenstein Nora, Pirovano Federico, Zhang Ce, Jäger Lena, Beinborn Lisa
- ENCONTER: Entity Constrained Progressive Sequence Generation Via Insertion-based Transformer Hsieh Lee-hsun, Lee Yang-yin, Lim Ee-peng
- The Niutrans System For WNGT 2020 Efficiency Task Hu Chi, Li Bei, Lin Ye, Li Yinqiao, Li Yanyang, Wang Chenglong, Xiao Tong, Zhu Jingbo
- Lora: Low-rank Adaptation Of Large Language Models Hu Edward J., Shen Yelong, Wallis Phillip, Allen-zhu Zeyuan, Li Yuanzhi, Wang Shean, Wang Lu, Chen Weizhu
- Investigating Transfer Learning In Multilingual Pre-trained Language Models Through Chinese Natural Language Inference Hu Hai, Zhou He, Tian Zuoyu, Zhang Yiwen, Ma Yina, Li Yanting, Nie Yixin, Richardson Kyle
- Non-autoregressive Translation With Layer-wise Prediction And Deep Supervision Huang Chenyang, Zhou Hao, Zaïane Osmar R., Mou Lili, Li Lei
- Transfer Learning For Sequence Generation: From Single-source To Multi-source Huang Xuancheng, Xu Jingfang, Sun Maosong, Liu Yang
- Unifying Multimodal Transformer For Bi-directional Image And Text Generation Huang Yupan, Xue Hongwei, Liu Bei, Lu Yutong
- Audio-oriented Multimodal Machine Comprehension: Task, Dataset And Model Huang Zhiqi, Liu Fenglin, Wu Xian, Ge Shen, Wang Helin, Fan Wei, Zou Yuexian
- SILT: Efficient Transformer Training For Inter-lingual Inference Huertas-tato Javier, Martín Alejandro, Camacho David
- Target-dependent UNITER: A Transformer-based Multimodal Language Comprehension Model For Domestic Service Robots Ishikawa Shintaro, Sugiura Komei
- Scaling Language Models: Methods, Analysis & Insights From Training Gopher Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George Van Den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor, Irina Higgins, Antonia Creswell, Nat Mcaleese, Amy Wu, Erich Elsen, Siddhant Jayakumar, Elena Buchatskaya, David Budden, Esme Sutherland, Karen Simonyan, Michela Paganini, Laurent Sifre, Lena Martens, Xiang Lorraine Li, Adhiguna Kuncoro, Aida Nematzadeh, Elena Gribovskaya, Domenic Donato, Angeliki Lazaridou, Arthur Mensch, Jean-baptiste Lespiau, Maria Tsimpoukelli, Nikolai Grigorev, Doug Fritz, Thibault Sottiaux, Mantas Pajarskas, Toby Pohlen, Zhitao Gong, Daniel Toyama, Cyprien De Masson D'autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego De Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew Johnson, Blake Hechtman, Laura Weidinger, Iason Gabriel, William Isaac, Ed Lockhart, Simon Osindero, Laura Rimell, Chris Dyer, Oriol Vinyals, Kareem Ayoub, Jeff Stanway, Lorrayne Bennett, Demis Hassabis, Koray Kavukcuoglu, Geoffrey Irving
- Finetuned Language Models Are Zero-shot Learners Jason Wei, Maarten Bosma, Vincent Y. Zhao, Kelvin Guu, Adams Wei Yu, Brian Lester, Nan Du, Andrew M. Dai, Quoc V. Le
- Visqa: X-raying Vision And Language Reasoning In Transformers Jaunet Theo, Kervadec Corentin, Vuillemot Romain, Antipov Grigory, Baccouche Moez, Wolf Christian
- Evaluating Pretrained Transformer Models For Entity Linking In Task-oriented Dialog Jayanthi Sai Muralidhar, Embar Varsha, Raghunathan Karthik
- Retraining Distilbert For A Voice Shopping Assistant By Using Universal Dependencies Jayarao Pratik, Sharma Arpit
- A Neural Conversation Generation Model Via Equivalent Shared Memory Investigation Ji Changzhen, Zhang Yating, Liu Xiaozhong, Jatowt Adam, Sun Changlong, Zhu Conghui, Zhao Tiejun
- Discodvt: Generating Long Text With Discourse-aware Discrete Variational Transformer Ji Haozhe, Huang Minlie
- Improving Non-autoregressive Generation With Mixup Training Jiang Ting, Huang Shaohan, Zhang Zihan, Wang Deqing, Zhuang Fuzhen, Wei Furu, Huang Haizhen, Zhang Liangjie, Zhang Qi
- Inducing Transformer's Compositional Generalization Ability Via Auxiliary Sequence Prediction Tasks Jiang Yichen, Bansal Mohit
- Enriching Transformers With Structured Tensor-product Representations For Abstractive Summarization Jiang Yichen, Celikyilmaz Asli, Smolensky Paul, Soulos Paul, Rao Sudha, Palangi Hamid, Fernandez Roland, Smith Caitlin, Bansal Mohit, Gao Jianfeng
- XRJL-HKUST At Semeval-2021 Task 4: Wordnet-enhanced Dual Multi-head Co-attention For Reading Comprehension Of Abstract Meaning Jiang Yuxin, Shou Ziyi, Wang Qijun, Wu Hao, Lin Fangzhen
- REPT: Bridging Language Models And Machine Reading Comprehension Via Retrieval-based Pre-training Jiao Fangkai, Guo Yangyang, Niu Yilin, Ji Feng, Li Feng-lin, Nie Liqiang
- Lightmbert: A Simple Yet Effective Method For Multilingual BERT Distillation Jiao Xiaoqi, Yin Yichun, Shang Lifeng, Jiang Xin, Chen Xiao, Li Linlin, Wang Fang, Liu Qun
- A Good Prompt Is Worth Millions Of Parameters: Low-resource Prompt-based Learning For Vision-language Models Jin Woojeong, Cheng Yu, Shen Yelong, Chen Weizhu, Ren Xiang
- Numgpt: Improving Numeracy Ability Of Generative Pre-trained Models Jin Zhihua, Jiang Xin, Wang Xingbo, Liu Qun, Wang Yong, Ren Xiaozhe, Qu Huamin
- Attention Is Indeed All You Need: Semantically Attention-guided Decoding For Data-to-text NLG Juraska Juraj, Walker Marilyn
- Rethinking Self-supervision Objectives For Generalizable Coherence Modeling Jwalapuram Prathyusha, Joty Shafiq, Lin Xiang
- Towards Incremental Transformers: An Empirical Analysis Of Transformer Models For Incremental NLU Kahardipraja Patrick, Madureira Brielen, Schlangen David
- Distilling Large Language Models Into Tiny And Effective Students Using Pqrnn Kaliamoorthi Prabhu, Siddhant Aditya, Li Edward, Johnson Melvin
- AMMUS : A Survey Of Transformer-based Pretrained Models In Natural Language Processing Kalyan Katikapalli Subramanyam, Rajasekharan Ajit, Sangeetha Sivanesan
- Tiny Neural Models For Seq2seq Kandoor Arun
- Amazon Sagemaker Model Parallelism: A General And Flexible Framework For Large Model Training Karakus Can, Huilgol Rahul, Wu Fei, Subramanian Anirudh, Daniel Cade, Cavdar Derya, Xu Teng, Chen Haohan, Rahnama Arash, Quintela Luis
- What Do Pre-trained Code Models Know About Code? Karmakar Anjan, Robbes Romain
- Finetuning Pretrained Transformers Into Rnns Kasai Jungo, Peng Hao, Zhang Yizhe, Yogatama Dani, Ilharco Gabriel, Pappas Nikolaos, Mao Yi, Chen Weizhu, Smith Noah A.
- Uquad1.0: Development Of An Urdu Question Answering Training Data For Machine Reading Comprehension Kazi Samreen, Khoja Shakeel
- Generating Rich Product Descriptions For Conversational E-commerce Systems Kedia Shashank, Mantha Aditya, Gupta Sneha, Guo Stephen, Achan Kannan
- How Transferable Are Reasoning Patterns In VQA? Kervadec Corentin, Jaunet Theo, Antipov Grigory, Baccouche Moez, Vuillemot Romain, Wolf Christian
- Text-free Prosody-aware Generative Spoken Language Modeling Kharitonov Eugene, Lee Ann, Polyak Adam, Adi Yossi, Copet Jade, Lakhotia Kushal, Nguyen Tu-anh, Rivière Morgane, Mohamed Abdelrahman, Dupoux Emmanuel, Hsu Wei-ning
- Visual Question Answering Based On Local-scene-aware Referring Expression Generation Kim Jung-jun, Lee Dong-gyu, Wu Jialin, Jung Hong-gyu, Lee Seong-whan
- Learned Token Pruning For Transformers Kim Sehoon, Shen Sheng, Thorsley David, Gholami Amir, Kwon Woosuk, Hassoun Joseph, Keutzer Kurt
- L-verse: Bidirectional Generation Between Image And Text Kim Taehoon, Song Gwangmo, Lee Sihaeng, Kim Sangyun, Seo Yewon, Lee Soonyoung, Kim Seung Hwan, Lee Honglak, Bae Kyunghoon
- What's In A Measurement? Using GPT-3 On Semeval 2021 Task 8 -- Measeval Kohler Curt, Daniel Ron Jr
- Controlling Conditional Language Models Without Catastrophic Forgetting Korbak Tomasz, Elsahar Hady, Kruszewski German, Dymetman Marc
- BERT Busters: Outlier Dimensions That Disrupt Transformers Kovaleva Olga, Kulshreshtha Saurabh, Rogers Anna, Rumshisky Anna
- Hurdles To Progress In Long-form Question Answering Krishna Kalpesh, Roy Aurko, Iyyer Mohit
- What BERT Based Language Models Learn In Spoken Transcripts: An Empirical Study Kumar Ayush, Sundararaman Mukuntha Narayanan, Vepa Jithendra
- On The Significance Of Question Encoder Sequence Model In The Out-of-distribution Performance In Visual Question Answering Kv Gouthaman, Mittal Anurag
- Can Transformer Models Measure Coherence In Text? Re-thinking The Shuffle Test Laban Philippe, Dai Luke, Bandarkar Lucas, Hearst Marti A.
- Draw Me A Flower: Processing And Grounding Abstraction In Natural Language Lachmy Royi, Pyatkin Valentina, Manevich Avshalom, Tsarfaty Reut
- Layered Gradient Accumulation And Modular Pipeline Parallelism: Fast And Efficient Training Of Large Language Models Lamy-poirier Joel
- Pagnol: An Extra-large French Generative Model Launay Julien, Tommasone Elena, Pannier Baptiste, Boniface François, Chatelain Amélie, Cappelli Alessandro, Poli Iacopo, Seddah Djamé
- Can Transformer Language Models Predict Psychometric Properties? Laverghetta Antonio Jr., Nighojkar Animesh, Mirzakhalov Jamshidbek, Licato John
- You Only Need One Model For Open-domain Question Answering Lee Haejun, Kedia Akhil, Lee Jongwon, Paranjape Ashwin, Manning Christopher D., Woo Kyoung-gu
- Towards Few-shot Fact-checking Via Perplexity Lee Nayeon, Bang Yejin, Madotto Andrea, Khabsa Madian, Fung Pascale
- When Attention Meets Fast Recurrence: Training Language Models With Reduced Compute Lei Tao
- Quiz-style Question Generation For News Stories Lelkes Adam D., Tran Vinh Q., Yu Cong
- Exploiting Sentence-level Representations For Passage Ranking Leonhardt Jurek, Beringer Fabian, Anand Avishek
- Semvlp: Vision-language Pre-training By Aligning Semantics At Multiple Levels Li Chenliang, Yan Ming, Xu Haiyang, Luo Fuli, Wang Wei, Bi Bin, Huang Songfang
- The Stability-efficiency Dilemma: Investigating Sequence Length Warmup For Training GPT Models Li Conglong, Zhang Minjia, He Yuxiong
- Residual Tree Aggregation Of Layers For Neural Machine Translation Li Guoliang, Li Yiyang
- Align Before Fuse: Vision And Language Representation Learning With Momentum Distillation Li Junnan, Selvaraju Ramprasaath R., Gotmare Akhilesh Deepak, Joty Shafiq, Xiong Caiming, Hoi Steven
- Personalized Transformer For Explainable Recommendation Li Lei, Zhang Yongfeng, Chen Li
- Ensemble ALBERT On Squad 2.0 Li Shilun, Li Renee, Peng Veronica
- A Short Study On Compressing Decoder-based Language Models Li Tianda, Mesbahi Yassir El, Kobyzev Ivan, Rashid Ahmad, Mahmud Atif, Anchuri Nithin, Hajimolahoseini Habib, Liu Yang, Rezagholizadeh Mehdi
- Question-aware Memory Network For Multi-hop Question Answering In Human-robot Interaction Li Xinmeng, Alazab Mamoun, Li Qian, Yu Keping, Yin Quanjun
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-image Pre-training Paradigm Li Yangguang, Liang Feng, Zhao Lichen, Cui Yufeng, Ouyang Wanli, Shao Jing, Yu Fengwei, Yan Junjie
- Scheduled Sampling In Vision-language Pretraining With Decoupled Encoder-decoder Network Li Yehao, Pan Yingwei, Yao Ting, Chen Jingwen, Mei Tao
- Terapipe: Token-level Pipeline Parallelism For Training Large-scale Language Models Li Zhuohan, Zhuang Siyuan, Guo Shiyuan, Zhuo Danyang, Zhang Hao, Song Dawn, Stoica Ion
- Dialoging Resonance: How Users Perceive, Reciprocate And React To Chatbot's Self-disclosure In Conversational Recommendations Liang Kai-hui, Shi Weiyan, Oh Yoojung, Wang Hao-chuan, Zhang Jingwen, Yu Zhou
- Lyra: A Benchmark For Turducken-style Code Generation Liang Qingyuan, Sun Zeyu, Zhu Qihao, Zhang Wenjie, Yu Lian, Xiong Yingfei, Zhang Lu
- Learning Neural Templates For Recommender Dialogue System Liang Zujie, Hu Huang, Xu Can, Miao Jian, He Yingying, Chen Yining, Geng Xiubo, Liang Fan, Jiang Daxin
- Multimodal Transformer With Variable-length Memory For Vision-and-language Navigation Lin Chuang, Jiang Yi, Cai Jianfei, Qu Lizhen, Haffari Gholamreza, Yuan Zehuan
- Domain-independent User Simulation With Transformers For Task-oriented Dialogue Systems Lin Hsien-chin, Lubis Nurul, Hu Songbo, Van Niekerk Carel, Geishauser Christian, Heck Michael, Feng Shutong, Gašić Milica
- MDQE: A More Accurate Direct Pretraining For Machine Translation Quality Estimation Lin Lei
- Truthfulqa: Measuring How Models Mimic Human Falsehoods Lin Stephanie, Hilton Jacob, Evans Owain
- Few-shot Learning With Multilingual Language Models Lin Xi Victoria, Mihaylov Todor, Artetxe Mikel, Wang Tianlu, Chen Shuohui, Simig Daniel, Ott Myle, Goyal Naman, Bhosale Shruti, Du Jingfei, Pasunuru Ramakanth, Shleifer Sam, Koura Punit Singh, Chaudhary Vishrav, O'horo Brian, Wang Jeff, Zettlemoyer Luke, Kozareva Zornitsa, Diab Mona, Stoyanov Veselin, Li Xian
- VX2TEXT: End-to-end Learning Of Video-based Text Generation From Multimodal Inputs Lin Xudong, Bertasius Gedas, Wang Jue, Chang Shih-fu, Parikh Devi, Torresani Lorenzo
- Dexperts: Decoding-time Controlled Text Generation With Experts And Anti-experts Liu Alisa, Sap Maarten, Lu Ximing, Swayamdipta Swabha, Bhagavatula Chandra, Smith Noah A., Choi Yejin
- Codeqa: A Question Answering Dataset For Source Code Comprehension Liu Chenxiao, Wan Xiaojun
- Variational Latent-state GPT For Semi-supervised Task-oriented Dialog Systems Liu Hong, Cai Yucheng, Lin Zhenru, Ou Zhijian, Huang Yi, Feng Junlan
- Towards Automated Psychotherapy Via Language Modeling Liu Houjun
- What Makes Good In-context Examples For GPT-\(3\)? Liu Jiachang, Shen Dinghan, Zhang Yizhe, Dolan Bill, Carin Lawrence, Chen Weizhu
- A Three-stage Learning Framework For Low-resource Knowledge-grounded Dialogue Generation Liu Shilei, Zhao Xiaofeng, Li Bochao, Ren Feiliang, Zhang Longhui, Yin Shujuan
- Unified Multimodal Pre-training And Prompt-based Tuning For Vision-language Understanding And Generation Liu Tianyi, Wu Zuxuan, Xiong Wenhan, Chen Jingjing, Jiang Yu-gang
- A Token-level Reference-free Hallucination Detection Benchmark For Free-form Text Generation Liu Tianyu, Zhang Yizhe, Brockett Chris, Mao Yi, Sui Zhifang, Chen Weizhu, Dolan Bill
- GPT Understands, Too Liu Xiao, Zheng Yanan, Du Zhengxiao, Ding Ming, Qian Yujie, Yang Zhilin, Tang Jie
- Enriching Non-autoregressive Transformer With Syntactic And Semanticstructures For Neural Machine Translation Liu Ye, Wan Yao, Zhang Jian-guo, Zhao Wenting, Yu Philip S.
- Confidence-aware Scheduled Sampling For Neural Machine Translation Liu Yijin, Meng Fandong, Chen Yufeng, Xu Jinan, Zhou Jie
- Scheduled Sampling Based On Decoding Steps For Neural Machine Translation Liu Yijin, Meng Fandong, Chen Yufeng, Xu Jinan, Zhou Jie
- Augmenting Sequential Recommendation With Pseudo-prior Items Via Reversely Pre-training Transformer Liu Zhiwei, Fan Ziwei, Wang Yu, Yu Philip S.
- Learning To Ask Conversational Questions By Optimizing Levenshtein Distance Liu Zhongkun, Ren Pengjie, Chen Zhumin, Ren Zhaochun, De Rijke Maarten, Zhou Ming
- Pretrained Transformers As Universal Computation Engines Lu Kevin, Grover Aditya, Abbeel Pieter, Mordatch Igor
- Iconqa: A New Benchmark For Abstract Diagram Understanding And Visual Language Reasoning Lu Pan, Qiu Liang, Chen Jiaqi, Xia Tony, Zhao Yizhou, Zhang Wei, Yu Zhou, Liang Xiaodan, Zhu Song-chun
- Fantastically Ordered Prompts And Where To Find Them: Overcoming Few-shot Prompt Order Sensitivity Lu Yao, Bartolo Max, Moore Alastair, Riedel Sebastian, Stenetorp Pontus
- X-METRA-ADA: Cross-lingual Meta-transfer Learning Adaptation To Natural Language Understanding And Question Answering M'hamdi Meryem, Kim Doo Soon, Dernoncourt Franck, Bui Trung, Ren Xiang, May Jonathan
- Direct Simultaneous Speech-to-speech Translation With Variational Monotonic Multihead Attention Ma Xutai, Gong Hongyu, Liu Danni, Lee Ann, Tang Yun, Chen Peng-jen, Hsu Wei-ning, Koehn Phillip, Pino Juan
- Coherence Boosting: When Your Pretrained Language Model Is Not Paying Enough Attention Malkin Nikolay, Wang Zhen, Jojic Nebojsa
- Local And Global Context-based Pairwise Models For Sentence Ordering Manku Ruskin Raj, Paul Aditya Jyoti
- Towards Retrieval-based Conversational Recommendation Manzoor Ahtsham, Jannach Dietmar
- Lightweight Cross-lingual Sentence Representation Learning Mao Zhuoyuan, Gupta Prakhar, Wang Pei, Chu Chenhui, Jaggi Martin, Kurohashi Sadao
- Few-shot Self-rationalization With Natural Language Prompts Marasović Ana, Beltagy Iz, Downey Doug, Peters Matthew E.
- Evaluating Large Language Models Trained On Code Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde De Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter, Philippe Tillet, Felipe Petroski Such, Dave Cummings, Matthias Plappert, Fotios Chantzis, Elizabeth Barnes, Ariel Herbert-voss, William Hebgen Guss, Alex Nichol, Alex Paino, Nikolas Tezak, Jie Tang, Igor Babuschkin, Suchir Balaji, Shantanu Jain, William Saunders, Christopher Hesse, Andrew N. Carr, Jan Leike, Josh Achiam, Vedant Misra, Evan Morikawa, Alec Radford, Matthew Knight, Miles Brundage, Mira Murati, Katie Mayer, Peter Welinder, Bob Mcgrew, Dario Amodei, Sam Mccandlish, Ilya Sutskever, Wojciech Zaremba
- \(\infty\)-former: Infinite Memory Transformer Martins Pedro Henrique, Marinho Zita, Martins André F. T.
- Transformer Based Bengali Chatbot Using General Knowledge Dataset Masum Abu Kaisar Mohammad, Abujar Sheikh, Akter Sharmin, Ria Nushrat Jahan, Hossain Syed Akhter
- Lattegan: Visually Guided Language Attention For Multi-turn Text-conditioned Image Manipulation Matsumori Shoya, Abe Yuki, Shingyouchi Kosuke, Sugiura Komei, Imai Michita
- Unified Questioner Transformer For Descriptive Question Generation In Goal-oriented Visual Dialogue Matsumori Shoya, Shingyouchi Kosuke, Abe Yuki, Fukuchi Yosuke, Sugiura Komei, Imai Michita
- How Much Do Language Models Copy From Their Training Data? Evaluating Linguistic Novelty In Text Generation Using RAVEN Mccoy R. Thomas, Smolensky Paul, Linzen Tal, Gao Jianfeng, Celikyilmaz Asli
- Schema-guided Paradigm For Zero-shot Dialog Mehri Shikib, Eskenazi Maxine
- Self-training Improves Pre-training For Few-shot Learning In Task-oriented Dialog Systems Mi Fei, Zhou Wanhao, Cai Fengyu, Kong Lingjing, Huang Minlie, Faltings Boi
- Prevent The Language Model From Being Overconfident In Neural Machine Translation Miao Mengqi, Meng Fandong, Liu Yijin, Zhou Xiao-hua, Zhou Jie
- Structural Analysis Of An All-purpose Question Answering Model Micheli Vincent, Heinrich Quentin, Fleuret François, Belblidia Wacim
- Recent Advances In Natural Language Processing Via Large Pre-trained Language Models: A Survey Min Bonan, Ross Hayley, Sulem Elior, Veyseh Amir Pouran Ben, Nguyen Thien Huu, Sainz Oscar, Agirre Eneko, Heinz Ilana, Roth Dan
- Distantly Supervised Transformers For E-commerce Product QA Mittal Happy, Chakrabarti Aniket, Bayar Belhassen, Sharma Animesh Anant, Rasiwasia Nikhil
- BERTGEN: Multi-task Generation Through BERT Mitzalis Faidon, Caglayan Ozan, Madhyastha Pranava, Specia Lucia
- Multi-modal Understanding And Generation For Medical Images And Text Via Vision-language Pre-training Moon Jong Hak, Lee Hyungyung, Shin Woncheol, Kim Young-hak, Choi Edward
- GPT-3 Models Are Poor Few-shot Learners In The Biomedical Domain Moradi Milad, Blagec Kathrin, Haberl Florian, Samwald Matthias
- Software-based Dialogue Systems: Survey, Taxonomy And Challenges Motger Quim, Franch Xavier, Marco Jordi
- First Align, Then Predict: Understanding The Cross-lingual Ability Of Multilingual BERT Muller Benjamin, Elazar Yanai, Sagot Benoît, Seddah Djamé
- Arat5: Text-to-text Transformers For Arabic Language Generation Nagoudi El Moatez Billah, Elmadany Abdelrahim, Abdul-mageed Muhammad
- Glam: Efficient Scaling Of Language Models With Mixture-of-experts Nan Du, Yanping Huang, Andrew M. Dai, Simon Tong, Dmitry Lepikhin, Yuanzhong Xu, Maxim Krikun, Yanqi Zhou, Adams Wei Yu, Orhan Firat, Barret Zoph, Liam Fedus, Maarten Bosma, Zongwei Zhou, Tao Wang, Yu Emma Wang, Kellie Webster, Marie Pellat, Kevin Robinson, Kathleen Meier-hellstern, Toju Duke, Lucas Dixon, Kun Zhang, Quoc V Le, Yonghui Wu, Zhifeng Chen, Claire Cui
- Planning With Learned Entity Prompts For Abstractive Summarization Narayan Shashi, Zhao Yao, Maynez Joshua, Simoes Gonçalo, Nikolaev Vitaly, Mcdonald Ryan
- Hierarchical Transformers Are More Efficient Language Models Nawrot Piotr, Tworkowski Szymon, Tyrolski Michał, Kaiser Łukasz, Wu Yuhuai, Szegedy Christian, Michalewski Henryk
- P-adapters: Robustly Extracting Factual Information From Language Models With Diverse Prompts Newman Benjamin, Choubey Prafulla Kumar, Rajani Nazneen
- Improving Transformers With Probabilistic Attention Keys Nguyen Tam, Nguyen Tan M., Le Dung D., Nguyen Duy Khuong, Tran Viet-anh, Baraniuk Richard G., Ho Nhat, Osher Stanley J.
- Materialized Knowledge Bases From Commonsense Transformers Nguyen Tuan-phong, Razniewski Simon
- Sentence-t5: Scalable Sentence Encoders From Pre-trained Text-to-text Models Ni Jianmo, Ábrego Gustavo Hernández, Constant Noah, Ma Ji, Hall Keith B., Cer Daniel, Yang Yinfei
- MLP Architectures For Vision-and-language Modeling: An Empirical Study Nie Yixin, Li Linjie, Gan Zhe, Wang Shuohang, Zhu Chenguang, Zeng Michael, Liu Zicheng, Bansal Mohit, Wang Lijuan
- Investigating The Limitations Of Transformers With Simple Arithmetic Tasks Nogueira Rodrigo, Jiang Zhiying, Lin Jimmy
- Progressive Transformer-based Generation Of Radiology Reports Nooralahzadeh Farhad, Gonzalez Nicolas Perez, Frauenfelder Thomas, Fujimoto Koji, Krauthammer Michael
- Code Generation From Natural Language With Less Prior And More Monolingual Data Norouzi Sajad, Tang Keyi, Cao Yanshuai
- Show Your Work: Scratchpads For Intermediate Computation With Language Models Nye Maxwell, Andreassen Anders Johan, Gur-ari Guy, Michalewski Henryk, Austin Jacob, Bieber David, Dohan David, Lewkowycz Aitor, Bosma Maarten, Luan David, Sutton Charles, Odena Augustus
- What Context Features Can Transformer Language Models Use? O'connor Joe, Andreas Jacob
- Investigating Numeracy Learning Ability Of A Text-to-text Transfer Model Pal Kuntal Kumar, Baral Chitta
- Probing For Bridging Inference In Transformer Language Models Pandit Onkar, Hou Yufang
- Cascading Adaptors To Leverage English Data To Improve Performance Of Question Answering For Low-resource Languages Pandya Hariom A., Ardeshna Bhavik, Bhatt Brijesh S.
- PENELOPIE: Enabling Open Information Extraction For The Greek Language Through Machine Translation Papadopoulos Dimitris, Papadakis Nikolaos, Matsatsinis Nikolaos
- Do Encoder Representations Of Generative Dialogue Models Encode Sufficient Information About The Task ? Parthasarathi Prasanna, Pineau Joelle, Chandar Sarath
- Episodic Transformer For Vision-and-language Navigation Pashevich Alexander, Schmid Cordelia, Sun Chen
- A Comparative Study Of Transformer-based Language Models On Extractive Question Answering Pearce Kate, Zhan Tiffany, Komanduri Aneesh, Zhan Justin
- ABC: Attention With Bounded-memory Control Peng Hao, Kasai Jungo, Pappas Nikolaos, Yogatama Dani, Wu Zhaofeng, Kong Lingpeng, Schwartz Roy, Smith Noah A.
- The Effect Of Model Size On Worst-group Generalization Pham Alan, Chan Eunice, Srivatsa Vikranth, Ghosh Dhruba, Yang Yaoqing, Yu Yaodong, Zhong Ruiqi, Gonzalez Joseph E., Steinhardt Jacob
- Cotext: Multi-task Learning With Code-text Transformer Phan Long, Tran Hieu, Le Daniel, Nguyen Hieu, Anibal James, Peltekian Alec, Ye Yanfang
- Context Transformer With Stacked Pointer Networks For Conversational Question Answering Over Knowledge Graphs Plepi Joan, Kacupaj Endri, Singh Kuldeep, Thakkar Harsh, Lehmann Jens
- Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge Porada Ian, Sordoni Alessandro, Cheung Jackie Chi Kit
- Modeling Event Plausibility With Consistent Conceptual Abstraction Porada Ian, Suleman Kaheer, Trischler Adam, Cheung Jackie Chi Kit
- Focused Attention Improves Document-grounded Generation Prabhumoye Shrimai, Hashimoto Kazuma, Zhou Yingbo, Black Alan W, Salakhutdinov Ruslan
- ORCHARD: A Benchmark For Measuring Systematic Generalization Of Multi-hierarchical Reasoning Pung Bill Tuck Weng, Chan Alvin
- Attentive Fine-tuning Of Transformers For Translation Of Low-resourced Languages @loresmt 2021 Puranik Karthik, Hande Adeep, Priyadharshini Ruba, Durairaj Thenmozhi, Sampath Anbukkarasi, Thamburaj Kingston Pal, Chakravarthi Bharathi Raja
- Robertuito: A Pre-trained Language Model For Social Media Text In Spanish Pérez Juan Manuel, Furman Damián A., Alemany Laura Alonso, Luque Franco
- Assessing The Syntactic Capabilities Of Transformer-based Multilingual Language Models Pérez-mayos Laura, García Alba Táboas, Mille Simon, Wanner Leo
- Structural Guidance For Transformer Language Models Qian Peng, Naseem Tahira, Levy Roger, Astudillo Ramón Fernandez
- TIMEDIAL: Temporal Commonsense Reasoning In Dialog Qin Lianhui, Gupta Aditya, Upadhyay Shyam, He Luheng, Choi Yejin, Faruqui Manaal
- VT-CLIP: Enhancing Vision-language Models With Visual-guided Texts Qiu Longtian, Zhang Renrui, Guo Ziyu, Zeng Ziyao, Guo Zilu, Li Yafeng, Zhang Guangnan
- Vinmt: Neural Machine Translation Toolkit Quan Nguyen Hoang, Dat Nguyen Thanh, Cong Nguyen Hoang Minh, Van Vinh Nguyen, Vinh Ngo Thi, Thai Nguyen Phuong, Viet Tran Hong
- Grid Search Hyperparameter Benchmarking Of BERT, ALBERT, And Longformer On Duorc Quijano Alex John, Nguyen Sam, Ordonez Juanita
- Teaching Autoregressive Language Models Complex Tasks By Demonstration Recchia Gabriel
- Webgpt: Browser-assisted Question-answering With Human Feedback Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman
- Multiversal Views On Language Models Reynolds Laria, Mcdonell Kyle
- Prompt Programming For Large Language Models: Beyond The Few-shot Paradigm Reynolds Laria, Mcdonell Kyle
- Revisiting Context Choices For Context-aware Machine Translation Rikters Matīss, Nakazawa Toshiaki
- Don't Sweep Your Learning Rate Under The Rug: A Closer Look At Cross-modal Transfer Of Pretrained Transformers Rothermel Danielle, Li Margaret, Rocktäschel Tim, Foerster Jakob
- Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information? Rozanova Julia, Ferreira Deborah, Dubba Krishna, Cheng Weiwei, Zhang Dell, Freitas Andre
- Leveraging The Inductive Bias Of Large Language Models For Abstract Textual Reasoning Rytting Christopher Michael, Wingate David
- Efficient Domain Adaptation Of Language Models Via Adaptive Tokenization Sachidananda Vin, Kessler Jason S., Lai Yi-an
- Challenges In Procedural Multimodal Machine Comprehension:a Novel Way To Benchmark Sahu Pritish, Sikka Karan, Divakaran Ajay
- Towards Solving Multimodal Comprehension Sahu Pritish, Sikka Karan, Divakaran Ajay
- Rome Was Built In 1776: A Case Study On Factual Correctness In Knowledge-grounded Response Generation Santhanam Sashank, Hedayatnia Behnam, Gella Spandana, Padmakumar Aishwarya, Kim Seokhwan, Liu Yang, Hakkani-tur Dilek
- Representation Learning For Conversational Data Using Discourse Mutual Information Maximization Santra Bishal, Roychowdhury Sumegh, Mandal Aishik, Gurram Vasu, Naik Atharva, Gupta Manish, Goyal Pawan
- Towards Neural Functional Program Evaluation Scholak Torsten, Pilault Jonathan, Velez-ginorio Joey
- Large Pre-trained Language Models Contain Human-like Biases Of What Is Right And Wrong To Do Schramowski Patrick, Turan Cigdem, Andersen Nico, Rothkopf Constantin A., Kersting Kristian
- The Power Of Prompt Tuning For Low-resource Semantic Parsing Schucher Nathan, Reddy Siva, De Vries Harm
- Audio-visual Scene-aware Dialog And Reasoning Using Audio-visual Transformers With Joint Student-teacher Learning Shah Ankit P., Geng Shijie, Gao Peng, Cherian Anoop, Hori Takaaki, Marks Tim K., Roux Jonathan Le, Hori Chiori
- Laviter: Learning Aligned Visual And Textual Representations Assisted By Image And Caption Generation Shaikh Mohammad Abuzar, Ji Zhanghexuan, Moukheiber Dana, Shen Yan, Srihari Sargur, Gao Mingchen
- CPT: A Pre-trained Unbalanced Transformer For Both Chinese Language Understanding And Generation Shao Yunfan, Geng Zhichao, Liu Yitao, Dai Junqi, Yan Hang, Yang Fei, Zhe Li, Bao Hujun, Qiu Xipeng
- Better Neural Machine Translation By Extracting Linguistic Information From BERT Shavarani Hassan S., Sarkar Anoop
- FPM: A Collection Of Large-scale Foundation Pre-trained Language Models Shen Dezhou
- Societal Biases In Language Generation: Progress And Challenges Sheng Emily, Chang Kai-wei, Natarajan Premkumar, Peng Nanyun
- Contextual Biasing Of Language Models For Speech Recognition In Goal-oriented Conversational Agents Shenoy Ashish, Bodapati Sravan, Kirchhoff Katrin
- Adapting Long Context NLM For ASR Rescoring In Conversational Agents Shenoy Ashish, Bodapati Sravan, Sunkara Monica, Ronanki Srikanth, Kirchhoff Katrin
- Layer-wise Pruning Of Transformer Attention Heads For Efficient Language Modeling Shim Kyuhong, Choi Iksoo, Sung Wonyong, Choi Jungwook
- Scaling Law For Recommendation Models: Towards General-purpose User Representations Shin Kyuyong, Kwak Hanock, Kim Su Young, Ramstrom Max Nihlen, Jeong Jisu, Ha Jung-woo, Kim Kyung-min
- Few-shot Semantic Parsing With Language Models Trained On Code Shin Richard, Van Durme Benjamin
- Exploring Low-cost Transformer Model Compression For Large-scale Commercial Reply Suggestions Shrivastava Vaishnavi, Gaonkar Radhika, Gupta Shashank, Jha Abhishek
- ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, Dianhai Yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang
- Structural Persistence In Language Models: Priming As A Window Into Abstract Language Representations Sinclair Arabella, Jumelet Jaap, Zuidema Willem, Fernández Raquel
- Pre-trained Language Models As Prior Knowledge For Playing Text-based Games Singh Ishika, Singh Gargi, Modi Ashutosh
- Semantics-aware Attention Improves Neural Machine Translation Slobodkin Aviv, Choshen Leshem, Abend Omri
- Primer: Searching For Efficient Transformers For Language Modeling So David R., Mańke Wojciech, Liu Hanxiao, Dai Zihang, Shazeer Noam, Le Quoc V.
- Bob: BERT Over BERT For Training Persona-based Dialogue Models From Limited Personalized Data Song Haoyu, Wang Yan, Zhang Kaiyan, Zhang Wei-nan, Liu Ting
- Multimodal Integration Of Human-like Attention In Visual Question Answering Sood Ekta, Kögel Fabian, Müller Philipp, Thomas Dominike, Bace Mihai, Bulling Andreas
- VQA-MHUG: A Gaze Dataset To Study Multimodal Neural Attention In Visual Question Answering Sood Ekta, Kögel Fabian, Strohm Florian, Dhar Prajit, Bulling Andreas
- Worst Of Both Worlds: Biases Compound In Pre-trained Vision-and-language Models Srinivasan Tejas, Bisk Yonatan
- Txt: Crossmodal End-to-end Learning With Transformers Steitz Jan-martin O., Pfeiffer Jonas, Gurevych Iryna, Roth Stefan
- Corelm: Coreference-aware Language Model Fine-tuning Stylianou Nikolaos, Vlahavas Ioannis
- Ernie-tiny : A Progressive Distillation Framework For Pretrained Transformer Compression Su Weiyue, Chen Xuyi, Feng Shikun, Liu Jiaxiang, Liu Weixin, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Non-autoregressive Text Generation With Pre-trained Language Models Su Yixuan, Cai Deng, Wang Yan, Vandyke David, Baker Simon, Li Piji, Collier Nigel
- Tacl: Improving BERT Pre-training With Token-aware Contrastive Learning Su Yixuan, Liu Fangyu, Meng Zaiqiao, Lan Tian, Shu Lei, Shareghi Ehsan, Collier Nigel
- NVIDIA Nemo Neural Machine Translation Systems For English-german And English-russian News And Biomedical Tasks At WMT21 Subramanian Sandeep, Hrinchuk Oleksii, Adams Virginia, Kuchaiev Oleksii
- Embodied BERT: A Transformer Model For Embodied, Language-guided Visual Task Completion Suglia Alessandro, Gao Qiaozi, Thomason Jesse, Thattai Govind, Sukhatme Gaurav
- Not All Memories Are Created Equal: Learning To Forget By Expiring Sukhbaatar Sainbayar, Ju Da, Poff Spencer, Roller Stephen, Szlam Arthur, Weston Jason, Fan Angela
- A Non-hierarchical Attention Network With Modality Dropout For Textual Response Generation In Multimodal Dialogue Systems Sun Rongyi, Chen Borun, Zhou Qingyu, Li Yinghui, Cao Yunbo, Zheng Hai-tao
- Do Long-range Language Models Actually Use Long-range Context? Sun Simeng, Krishna Kalpesh, Mattarella-micke Andrew, Iyyer Mohit
- Lightningdot: Pre-training Visual-semantic Embeddings For Real-time Image-text Retrieval Sun Siqi, Chen Yen-chun, Li Linjie, Wang Shuohang, Fang Yuwei, Liu Jingjing
- Know Deeper: Knowledge-conversation Cyclic Utilization Mechanism For Open-domain Dialogue Generation Sun Yajing, Hu Yue, Xing Luxi, Xie Yuqiang, Wei Xiangpeng
- NSP-BERT: A Prompt-based Few-shot Learner Through An Original Pre-training Task--next Sentence Prediction Sun Yi, Zheng Yu, Hao Chao, Qiu Hangping
- ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Sun Yu, Wang Shuohuan, Feng Shikun, Ding Siyu, Pang Chao, Shang Junyuan, Liu Jiaxiang, Chen Xuyi, Zhao Yanbin, Lu Yuxiang, Liu Weixin, Wu Zhihua, Gong Weibao, Liang Jianzhong, Shang Zhizhou, Sun Peng, Liu Wei, Ouyang Xuan, Yu Dianhai, Tian Hao, Wu Hua, Wang Haifeng
- Multilingual Translation Via Grafting Pre-trained Language Models Sun Zewei, Wang Mingxuan, Li Lei
- Data Processing Matters: Srph-konvergen Ai's Machine Translation System For WMT'21 Sutawika Lintang, Cruz Jan Christian Blaise
- A Three Step Training Approach With Data Augmentation For Morphological Inflection Szolnok Gabor, Barta Botond, Lakatos Dorina, Acs Judit
- General-purpose Question-answering With Macaw Tafjord Oyvind, Clark Peter
- Understanding The Capabilities, Limitations, And Societal Impact Of Large Language Models Tamkin Alex, Brundage Miles, Clark Jack, Ganguli Deep
- Trusting Roberta Over BERT: Insights From Checklisting The Natural Language Inference Task Tarunesh Ishan, Aditya Somak, Choudhury Monojit
- Diff-explainer: Differentiable Convex Optimization For Explainable Multi-hop Inference Thayaparan Mokanarangan, Valentino Marco, Ferreira Deborah, Rozanova Julia, Freitas André
- Towards Reinforcement Learning For Pivot-based Neural Machine Translation With Non-autoregressive Transformer Tokarchuk Evgeniia, Rosendahl Jan, Wang Weiyue, Petrushkov Pavel, Lancewicki Tomer, Khadivi Shahram, Ney Hermann
- Exploring Transformers In Natural Language Generation: GPT, BERT, And Xlnet Topal M. Onat, Bas Anil, Van Heerden Imke
- Learning Better Visual Dialog Agents With Pretrained Visual-linguistic Representation Tu Tao, Ping Qing, Thattai Govind, Tur Gokhan, Natarajan Prem
- Local Explanation Of Dialogue Response Generation Tuan Yi-lin, Pryor Connor, Chen Wenhu, Getoor Lise, Wang William Yang
- Transferring Semantic Knowledge Into Language Encoders Umair Mohammad, Ferraro Francis
- Cheap And Good? Simple And Effective Data Augmentation For Low Resource Machine Reading Van Hoang, Yadav Vikas, Surdeanu Mihai
- Comparing Test Sets With Item Response Theory Vania Clara, Htut Phu Mon, Huang William, Mungra Dhara, Pang Richard Yuanzhe, Phang Jason, Liu Haokun, Cho Kyunghyun, Bowman Samuel R.
- Transformers For Headline Selection For Russian News Clusters Voropaev Pavel, Sopilnyak Olga
- Convfit: Conversational Fine-tuning Of Pretrained Language Models Vulić Ivan, Su Pei-hao, Coope Sam, Gerz Daniela, Budzianowski Paweł, Casanueva Iñigo, Mrkšić Nikola, Wen Tsung-hsien
- Bertnesia: Investigating The Capture And Forgetting Of Knowledge In BERT Wallat Jonas, Singh Jaspreet, Anand Avishek
- Selective Knowledge Distillation For Neural Machine Translation Wang Fusheng, Yan Jianhao, Meng Fandong, Zhou Jie
- Topicrefine: Joint Topic Prediction And Dialogue Response Generation For Multi-turn End-to-end Dialogue System Wang Hongru, Cui Mingyu, Zhou Zimo, Fung Gabriel Pui Cheong, Wong Kam-fai
- Contextualized Scene Imagination For Generative Commonsense Reasoning Wang Peifeng, Zamora Jonathan, Liu Junfeng, Ilievski Filip, Chen Muhao, Ren Xiang
- Entailment As Few-shot Learner Wang Sinong, Fang Han, Khabsa Madian, Mao Hanzi, Ma Hao
- Sentence Semantic Regression For Text Generation Wang Wei, Li Piji, Zheng Hai-tao
- Lightseq2: Accelerated Training For Transformer-based Models On Gpus Wang Xiaohui, Wei Yang, Xiong Ying, Huang Guyue, Qian Xian, Ding Yufei, Wang Mingxuan, Li Lei
- List: Lite Prompted Self-training Makes Parameter-efficient Few-shot Learners Wang Yaqing, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Codet5: Identifier-aware Unified Pre-trained Encoder-decoder Models For Code Understanding And Generation Wang Yue, Wang Weishi, Joty Shafiq, Hoi Steven C. H.
- Distilled Dual-encoder Model For Vision-language Understanding Wang Zekun, Wang Wenhui, Zhu Haichao, Liu Ming, Qin Bing, Wei Furu
- Dodrio: Exploring Transformer Models With Interactive Visualization Wang Zijie J., Turko Robert, Chau Duen Horng
- Visual Question Rewriting For Increasing Response Rate Wei Jiayi, Li Xilian, Zhang Yi, Wang Xin
- Training Multilingual Pre-trained Language Model With Byte-level Subwords Wei Junqiu, Liu Qun, Guo Yinpeng, Jiang Xin
- Pangu-\(α\): Large-scale Autoregressive Pretrained Chinese Language Models With Auto-parallel Computation Wei Zeng, Xiaozhe Ren, Teng Su, Hui Wang, Yi Liao, Zhiwei Wang, Xin Jiang, Zhenzhang Yang, Kaisheng Wang, Xiaoda Zhang, Chen Li, Ziyan Gong, Yifan Yao, Xinjing Huang, Jun Wang, Jianfeng Yu, Qi Guo, Yue Yu, Yan Zhang, Jin Wang, Hengtao Tao, Dasen Yan, Zexuan Yi, Fang Peng, Fangqing Jiang, Han Zhang, Lingfeng Deng, Yehong Zhang, Zhe Lin, Chao Zhang, Shaojie Zhang, Mingyue Guo, Shanzhi Gu, Gaojun Fan, Yaowei Wang, Xuefeng Jin, Qun Liu, Yonghong Tian
- VAULT: Variable Unified Long Text Representation For Machine Reading Comprehension Wen Haoyang, Ferritto Anthony, Ji Heng, Florian Radu, Sil Avirup
- Reframing Human-ai Collaboration For Generating Free-text Explanations Wiegreffe Sarah, Hessel Jack, Swayamdipta Swabha, Riedl Mark, Choi Yejin
- Memory And Knowledge Augmented Language Models For Inferring Salience In Long-form Stories Wilmot David, Keller Frank
- Language Models Are Few-shot Multilingual Learners Winata Genta Indra, Madotto Andrea, Lin Zhaojiang, Liu Rosanne, Yosinski Jason, Fung Pascale
- Graph-free Multi-hop Reading Comprehension: A Select-to-guide Strategy Wu Bohong, Zhang Zhuosheng, Zhao Hai
- Fastformer: Additive Attention Can Be All You Need Wu Chuhan, Wu Fangzhao, Qi Tao, Huang Yongfeng, Xie Xing
- A Multilingual Modeling Method For Span-extraction Reading Comprehension Wu Gaochen, Xu Bin, Chang Dejie, Liu Bangchang
- Improving Low-resource Reading Comprehension Via Cross-lingual Transposition Rethinking Wu Gaochen, Xu Bin, Qin Yuxin, Kong Fei, Liu Bangchang, Zhao Hongwen, Chang Dejie
- Domain-adaptive Pretraining Methods For Dialogue Understanding Wu Han, Xu Kun, Song Linfeng, Jin Lifeng, Zhang Haisong, Song Linqi
- Not All Attention Is All You Need Wu Hongqiu, Zhao Hai, Zhang Min
- Yuan 1.0: Large-scale Pre-trained Language Model In Zero-shot And Few-shot Learning Wu Shaohua, Zhao Xudong, Yu Tong, Zhang Rongguo, Shen Chong, Liu Hongli, Li Feng, Zhu Hong, Luo Jiangang, Xu Liang, Zhang Xuanwei
- On Explaining Your Explanations Of BERT: An Empirical Study With Sequence Classification Wu Zhengxuan, Ong Desmond C.
- Prefix-tuning: Optimizing Continuous Prompts For Generation Xiang Lisa Li, Percy Liang
- Training Large-scale News Recommenders With Pretrained Language Models In The Loop Xiao Shitao, Liu Zheng, Shao Yingxia, Di Tao, Xie Xing
- An Explanation Of In-context Learning As Implicit Bayesian Inference Xie Sang Michael, Raghunathan Aditi, Liang Percy, Ma Tengyu
- Exploring Story Generation With Multi-task Objectives In Variational Autoencoders Xie Zhuohan, Cohn Trevor, Lau Jey Han
- Simple Local Attentions Remain Competitive For Long-context Tasks Xiong Wenhan, Oğuz Barlas, Gupta Anchit, Chen Xilun, Liskovich Diana, Levy Omer, Yih Wen-tau, Mehdad Yashar
- Moca: Incorporating Multi-stage Domain Pretraining And Cross-guided Multimodal Attention For Textbook Question Answering Xu Fangzhi, Lin Qika, Liu Jun, Zhang Lingling, Zhao Tianzhe, Chai Qi, Pan Yudai
- E2E-VLP: End-to-end Vision-language Pre-training Enhanced By Visual Learning Xu Haiyang, Yan Ming, Li Chenliang, Bi Bin, Huang Songfang, Xiao Wenming, Huang Fei
- Attention-guided Generative Models For Extractive Question Answering Xu Peng, Liang Davis, Huang Zhiheng, Xiang Bing
- From Dense To Sparse: Contrastive Pruning For Better Pre-trained Language Model Compression Xu Runxin, Luo Fuli, Wang Chengyu, Chang Baobao, Huang Jun, Huang Songfang, Huang Fei
- K-PLUG: Knowledge-injected Pre-trained Language Model For Natural Language Understanding And Generation In E-commerce Xu Song, Li Haoran, Yuan Peng, Wang Yujia, Wu Youzheng, He Xiaodong, Liu Ying, Zhou Bowen
- Byt5: Towards A Token-free Future With Pre-trained Byte-to-byte Models Xue Linting, Barua Aditya, Constant Noah, Al-rfou Rami, Narang Sharan, Kale Mihir, Roberts Adam, Raffel Colin
- Fastseq: Make Sequence Generation Faster Yan Yu, Hu Fei, Chen Jiusheng, Bhendawade Nikhil, Ye Ting, Gong Yeyun, Duan Nan, Cui Desheng, Chi Bingyu, Zhang Ruofei
- Improving Conversational Recommendation Systems' Quality With Context-aware Item Meta Information Yang Bowen, Han Cong, Li Yu, Zuo Lei, Yu Zhou
- Transformer-based Korean Pretrained Language Models: A Survey On Three Years Of Progress Yang Kichang
- NT5?! Training T5 To Perform Numerical Reasoning Yang Peng-jian, Chen Ying Ting, Chen Yuechan, Cer Daniel
- Causal Attention For Vision-language Tasks Yang Xu, Zhang Hanwang, Qi Guojun, Cai Jianfei
- An Empirical Study Of GPT-3 For Few-shot Knowledge-based VQA Yang Zhengyuan, Gan Zhe, Wang Jianfeng, Hu Xiaowei, Lu Yumao, Liu Zicheng, Wang Lijuan
- TR-BERT: Dynamic Token Reduction For Accelerating BERT Inference Ye Deming, Lin Yankai, Huang Yufei, Sun Maosong
- Learning To Generate Task-specific Adapters From Task Description Ye Qinyuan, Ren Xiang
- Gpt3mix: Leveraging Large-scale Language Models For Text Augmentation Yoo Kang Min, Park Dongju, Kang Jaewook, Lee Sang-woo, Park Woomyeong
- Reconsidering The Past: Optimizing Hidden States In Language Models Yoshida Davis, Gimpel Kevin
- On The Interplay Between Fine-tuning And Composition In Transformers Yu Lang, Ettinger Allyson
- Leveraging Linguistic Coordination In Reranking N-best Candidates For End-to-end Response Selection Using BERT Yu Mingzhi University Of Pittsburgh, Litman Diane University Of Pittsburgh
- LV-BERT: Exploiting Layer Variety For BERT Yu Weihao, Jiang Zihang, Chen Fei, Hou Qibin, Feng Jiashi
- Language Bias In Visual Question Answering: A Survey And Taxonomy Yuan Desen
- Prune Once For All: Sparse Pre-trained Language Models Zafrir Ofir, Larey Ariel, Boudoukh Guy, Shen Haihao, Wasserblat Moshe
- Bert-coqac: Bert-based Conversational Question Answering In Context Zaib Munazza, Tran Dai Hoang, Sagar Subhash, Mahmood Adnan, Zhang Wei E., Sheng Quan Z.
- Bitfit: Simple Parameter-efficient Fine-tuning For Transformer-based Masked Language-models Zaken Elad Ben, Ravfogel Shauli, Goldberg Yoav
- Topical Language Generation Using Transformers Zandie Rohola, Mahoor Mohammad H.
- Readtwice: Reading Very Large Documents With Memories Zemlyanskiy Yury, Ainslie Joshua, De Jong Michiel, Pham Philip, Eckstein Ilya, Sha Fei
- Wechat Neural Machine Translation Systems For WMT21 Zeng Xianfeng, Liu Yijin, Li Ernan, Ran Qiu, Meng Fandong, Li Peng, Xu Jinan, Zhou Jie
- Improving Stack Overflow Question Title Generation With Copying Enhanced Codebert Model And Bi-modal Information Zhang Fengji, Yu Xiao, Keung Jacky, Li Fuyang, Xie Zhiwen, Yang Zhen, Ma Caoyuan, Zhang Zhimin
- Ernie-vilg: Unified Generative Pre-training For Bidirectional Vision-language Generation Zhang Han, Yin Weichong, Fang Yewei, Li Lanxin, Duan Boqiang, Wu Zhihua, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- A Knowledge-grounded Dialog System Based On Pre-trained Language Models Zhang Weijie, Chen Jiaoxuan, Wu Haipang, Wan Sanhui, Li Gongfeng
- DSGPT: Domain-specific Generative Pre-training Of Transformers For Text Generation In E-commerce Title And Review Summarization Zhang Xueying, Jiang Yunjiang, Shang Yue, Cheng Zhaomeng, Zhang Chi, Fan Xiaochuan, Xiao Yun, Long Bo
- Retgen: A Joint Framework For Retrieval And Grounded Text Generation Modeling Zhang Yizhe, Sun Siqi, Gao Xiang, Fang Yuwei, Brockett Chris, Galley Michel, Gao Jianfeng, Dolan Bill
- An Exploratory Study On Long Dialogue Summarization: What Works And What's Next Zhang Yusen, Ni Ansong, Yu Tao, Zhang Rui, Zhu Chenguang, Deb Budhaditya, Celikyilmaz Asli, Awadallah Ahmed Hassan, Radev Dragomir
- Towards More Efficient Insertion Transformer With Fractional Positional Encoding Zhang Zhisong, Zhang Yizhe, Dolan Bill
- Ror: Read-over-read For Long Document Machine Reading Comprehension Zhao Jing, Bao Junwei, Wang Yifan, Zhou Yongwei, Wu Youzheng, He Xiaodong, Zhou Bowen
- Calibrate Before Use: Improving Few-shot Performance Of Language Models Zhao Tony Z., Wallace Eric, Feng Shi, Klein Dan, Singh Sameer
- Fewnlu: Benchmarking State-of-the-art Methods For Few-shot Natural Language Understanding Zheng Yanan, Zhou Jing, Qian Yujie, Ding Ming, Liao Chonghua, Li Jian, Salakhutdinov Ruslan, Tang Jie, Ruder Sebastian, Yang Zhilin
- Mmchat: Multi-modal Chat Dataset On Social Media Zheng Yinhe, Chen Guanyi, Liu Xin, Sun Jian
- Adapting Language Models For Zero-shot Learning By Meta-tuning On Dataset And Prompt Collections Zhong Ruiqi, Lee Kristy, Zhang Zheng, Klein Dan
- SILG: The Multi-environment Symbolic Interactive Language Grounding Benchmark Zhong Victor, Hanjie Austin W., Wang Sida I., Narasimhan Karthik, Zettlemoyer Luke
- Self-guided Curriculum Learning For Neural Machine Translation Zhou Lei, Ding Liang, Duh Kevin, Watanabe Shinji, Sasano Ryohei, Takeda Koichi
- Thinking Clearly, Talking Fast: Concept-guided Non-autoregressive Generation For Open-domain Dialogue Systems Zou Yicheng, Liu Zhihua, Hu Xingwu, Zhang Qi
- Taming Sparsely Activated Transformer With Stochastic Experts Zuo Simiao, Liu Xiaodong, Jiao Jian, Kim Young Jin, Hassan Hany, Zhang Ruofei, Zhao Tuo, Gao Jianfeng
- Palm: Scaling Language Modeling With Pathways Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-hellstern, Douglas Eck, Jeff Dean, Slav Petrov, Noah Fiedel
- Gpt-3-driven Pedagogical Agents For Training Children's Curious Question-asking Skills Abdelghani Rania, Wang Yen-hsiang, Yuan Xingdi, Wang Tong, Lucas Pauline, Sauzéon Hélène, Oudeyer Pierre-yves
- Global Memory Transformer For Processing Long Documents Adel Arij Al
- Vector Representations Of Idioms In Conversational Systems Adewumi Tosin, Liwicki Foteini, Liwicki Marcus
- Vl-interpret: An Interactive Visualization Tool For Interpreting Vision-language Transformers Aflalo Estelle, Du Meng, Tseng Shao-yen, Liu Yongfei, Wu Chenfei, Duan Nan, Lal Vasudev
- Using Large Language Models To Simulate Multiple Humans And Replicate Human Subject Studies Aher Gati, Arriaga Rosa I., Kalai Adam Tauman
- Few-shot Training Llms For Project-specific Code-summarization Ahmed Toufique, Devanbu Premkumar
- On The Calibration Of Massively Multilingual Language Models Ahuja Kabir, Sitaram Sunayana, Dandapat Sandipan, Choudhury Monojit
- Multimodal Sequential Generative Models For Semi-supervised Language Instruction Following Akuzawa Kei, Iwasawa Yusuke, Matsuo Yutaka
- Compound Tokens: Channel Fusion For Vision-language Representation Learning Aladago Maxwell Mbabilla, Piergiovanni Aj
- How Does The Pre-training Objective Affect What Large Language Models Learn About Linguistic Properties? Alajrami Ahmed, Aletras Nikolaos
- End-to-end Multimodal Representation Learning For Video Dialog Alamri Huda, Bilic Anthony, Hu Michael, Beedu Apoorva, Essa Irfan
- Prompting As Probing: Using Language Models For Knowledge Base Construction Alivanistos Dimitrios, Santamaría Selene Báez, Cochez Michael, Kalo Jan-christoph, Van Krieken Emile, Thanapalasingam Thiviyan
- Text And Patterns: For Effective Chain Of Thought, It Takes Two To Tango Aman Madaan, Amir Yazdanbakhsh
- Language Models Of Code Are Few-shot Commonsense Learners Aman Madaan, Shuyan Zhou, Uri Alon, Yiming Yang, Graham Neubig
- A Disability Lens Towards Biases In GPT-3 Generated Open-ended Languages Amin Akhter Al, Kabir Kazi Sinthia
- Input-tuning: Adapting Unfamiliar Inputs To Frozen Pretrained Models An Shengnan, Li Yifei, Lin Zeqi, Liu Qian, Chen Bei, Fu Qiang, Chen Weizhu, Zheng Nanning, Lou Jian-guang
- Tempo: Accelerating Transformer-based Model Training Through Memory Footprint Reduction Andoorveedu Muralidhar, Zhu Zhanda, Zheng Bojian, Pekhimenko Gennady
- A Comparative Study On Language Models For Task-oriented Dialogue Systems Andreas Vinsen Marselino, Winata Genta Indra, Purwarianti Ayu
- Exploring Length Generalization In Large Language Models Anil Cem, Wu Yuhuai, Andreassen Anders, Lewkowycz Aitor, Misra Vedant, Ramasesh Vinay, Slone Ambrose, Gur-ari Guy, Dyer Ethan, Neyshabur Behnam
- GLM-130B: An Open Bilingual Pre-trained Model Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Peng Zhang, Yuxiao Dong, Jie Tang
- Characterizing Verbatim Short-term Memory In Neural Language Models Armeni Kristijan, Honey Christopher, Linzen Tal
- ATTEMPT: Parameter-efficient Multi-task Tuning Via Attentional Mixtures Of Soft Prompts Asai Akari, Salehi Mohammadreza, Peters Matthew E., Hajishirzi Hannaneh
- Small Batch Sizes Improve Training Of Low-resource Neural MT Atrio Àlex R., Popescu-belis Andrei
- Parameter-efficient Finetuning Of Transformers For Source Code Ayupov Shamil, Chirkova Nadezhda
- Improving Short Text Classification With Augmented Data Using GPT-3 Balkus Salvador, Yan Donghui
- Rethinking The Role Of Scale For In-context Learning: An Interpretability-based Case Study At 66 Billion Scale Bansal Hritik, Gopalakrishnan Karthik, Dingliwal Saket, Bodapati Sravan, Kirchhoff Katrin, Roth Dan
- Vl-beit: Generative Vision-language Pretraining Bao Hangbo, Wang Wenhui, Dong Li, Wei Furu
- P\(^3\)LM: Probabilistically Permuted Prophet Language Modeling For Generative Pre-training Bao Junwei, Wang Yifan, Ying Jiangyong, Gong Yeyun, Zhao Jing, Wu Youzheng, He Xiaodong
- \(\textit{latent}\)-glat: Glancing At Latent Variables For Parallel Text Generation Bao Yu, Zhou Hao, Huang Shujian, Wang Dongqi, Qian Lihua, Dai Xinyu, Chen Jiajun, Li Lei
- Grad-sam: Explaining Transformers Via Gradient Self-attention Maps Barkan Oren, Hauon Edan, Caciularu Avi, Katz Ori, Malkiel Itzik, Armstrong Omri, Koenigstein Noam
- Evidence > Intuition: Transferability Estimation For Encoder Selection Bassignana Elisa, Müller-eberstein Max, Zhang Mike, Plank Barbara
- Leveraging Pre-trained Language Models For Conversational Information Seeking From Text Bellan Patrizio, Dragoni Mauro, Ghidini Chiara
- Bygpt5: End-to-end Style-conditioned Poetry Generation With Token-free Language Models Belouadi Jonas, Eger Steffen
- Logical Reasoning For Task Oriented Dialogue Systems Beygi Sajjad, Fazel-zarandi Maryam, Cervone Alessandra, Krishnan Prakash, Jonnalagadda Siddhartha Reddy
- I2D2: Inductive Knowledge Distillation With Neurologic And Self-imitation Bhagavatula Chandra, Hwang Jena D., Downey Doug, Bras Ronan Le, Lu Ximing, Qin Lianhui, Sakaguchi Keisuke, Swayamdipta Swabha, West Peter, Choi Yejin
- Banglanlg And Banglat5: Benchmarks And Resources For Evaluating Low-resource Natural Language Generation In Bangla Bhattacharjee Abhik, Hasan Tahmid, Ahmad Wasi Uddin, Shahriyar Rifat
- Team \'UFAL At CMCL 2022 Shared Task: Figuring Out The Correct Recipe For Predicting Eye-tracking Features Using Pretrained Language Models Bhattacharya Sunit, Kumar Rishu, Bojar Ondrej
- BLOOM: A 176b-parameter Open-access Multilingual Language Model Bigscience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova Del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina Mcmillan-major, Iz Beltagy, Huu Nguyen, Lucile Saulnier, Samson Tan, Pedro Ortiz Suarez, Victor Sanh, Hugo Laurençon, Yacine Jernite, Julien Launay, Margaret Mitchell, Colin Raffel, Aaron Gokaslan, Adi Simhi, Aitor Soroa, Alham Fikri Aji, Amit Alfassy, Anna Rogers, Ariel Kreisberg Nitzav, Canwen Xu, Chenghao Mou, Chris Emezue, Christopher Klamm, Colin Leong, Daniel Van Strien, David Ifeoluwa Adelani, Dragomir Radev, Eduardo González Ponferrada, Efrat Levkovizh, Ethan Kim, Eyal Bar Natan, Francesco De Toni, Gérard Dupont, Germán Kruszewski, Giada Pistilli, Hady Elsahar, Hamza Benyamina, Hieu Tran, Ian Yu, Idris Abdulmumin, Isaac Johnson, Itziar Gonzalez-dios, Javier De La Rosa, Jenny Chim, Jesse Dodge, Jian Zhu, Jonathan Chang, Jörg Frohberg, Joseph Tobing, Joydeep Bhattacharjee, Khalid Almubarak, Kimbo Chen, Kyle Lo, Leandro Von Werra, Leon Weber, Long Phan, Loubna Ben Allal, Ludovic Tanguy, Manan Dey, Manuel Romero Muñoz, Maraim Masoud, María Grandury, Mario Šaško, Max Huang, Maximin Coavoux, Mayank Singh, Mike Tian-jian Jiang, Minh Chien Vu, Mohammad A. Jauhar, Mustafa Ghaleb, Nishant Subramani, Nora Kassner, Nurulaqilla Khamis, Olivier Nguyen, Omar Espejel, Ona De Gibert, Paulo Villegas, Peter Henderson, Pierre Colombo, Priscilla Amuok, Quentin Lhoest, Rheza Harliman, Rishi Bommasani, Roberto Luis López, Rui Ribeiro, Salomey Osei, Sampo Pyysalo, Sebastian Nagel, Shamik Bose, Shamsuddeen Hassan Muhammad, Shanya Sharma, Shayne Longpre, Somaieh Nikpoor, Stanislav Silberberg, Suhas Pai, Sydney Zink, Tiago Timponi Torrent, Timo Schick, Tristan Thrush, Valentin Danchev, Vassilina Nikoulina, Veronika Laippala, Violette Lepercq, Vrinda Prabhu, Zaid Alyafeai, Zeerak Talat, Arun Raja, Benjamin Heinzerling, Chenglei Si, Davut Emre Taşar, Elizabeth Salesky, Sabrina J. Mielke, Wilson Y. Lee, Abheesht Sharma, Andrea Santilli, Antoine Chaffin, Arnaud Stiegler, Debajyoti Datta, Eliza Szczechla, Gunjan Chhablani, Han Wang, Harshit Pandey, Hendrik Strobelt, Jason Alan Fries, Jos Rozen, Leo Gao, Lintang Sutawika, M Saiful Bari, Maged S. Al-shaibani, Matteo Manica, Nihal Nayak, Ryan Teehan, Samuel Albanie, Sheng Shen, Srulik Ben-david, Stephen H. Bach, Taewoon Kim, Tali Bers, Thibault Fevry, Trishala Neeraj, Urmish Thakker, Vikas Raunak, Xiangru Tang, Zheng-xin Yong, Zhiqing Sun, Shaked Brody, Yallow Uri, Hadar Tojarieh, Adam Roberts, Hyung Won Chung, Jaesung Tae, Jason Phang, Ofir Press, Conglong Li, Deepak Narayanan, Hatim Bourfoune, Jared Casper, Jeff Rasley, Max Ryabinin, Mayank Mishra, Minjia Zhang, Mohammad Shoeybi, Myriam Peyrounette, Nicolas Patry, Nouamane Tazi, Omar Sanseviero, Patrick Von Platen, Pierre Cornette, Pierre François Lavallée, Rémi Lacroix, Samyam Rajbhandari, Sanchit Gandhi, Shaden Smith, Stéphane Requena, Suraj Patil, Tim Dettmers, Ahmed Baruwa, Amanpreet Singh, Anastasia Cheveleva, Anne-laure Ligozat, Arjun Subramonian, Aurélie Névéol, Charles Lovering, Dan Garrette, Deepak Tunuguntla, Ehud Reiter, Ekaterina Taktasheva, Ekaterina Voloshina, Eli Bogdanov, Genta Indra Winata, Hailey Schoelkopf, Jan-christoph Kalo, Jekaterina Novikova, Jessica Zosa Forde, Jordan Clive, Jungo Kasai, Ken Kawamura, Liam Hazan, Marine Carpuat, Miruna Clinciu, Najoung Kim, Newton Cheng, Oleg Serikov, Omer Antverg, Oskar Van Der Wal, Rui Zhang, Ruochen Zhang, Sebastian Gehrmann, Shachar Mirkin, Shani Pais, Tatiana Shavrina, Thomas Scialom, Tian Yun, Tomasz Limisiewicz, Verena Rieser, Vitaly Protasov, Vladislav Mikhailov, Yada Pruksachatkun, Yonatan Belinkov, Zachary Bamberger, Zdeněk Kasner, Alice Rueda, Amanda Pestana, Amir Feizpour, Ammar Khan, Amy Faranak, Ana Santos, Anthony Hevia, Antigona Unldreaj, Arash Aghagol, Arezoo Abdollahi, Aycha Tammour, Azadeh Hajihosseini, Bahareh Behroozi, Benjamin Ajibade, Bharat Saxena, Carlos Muñoz Ferrandis, Daniel Mcduff, Danish Contractor, David Lansky, Davis David, Douwe Kiela, Duong A. Nguyen, Edward Tan, Emi Baylor, Ezinwanne Ozoani, Fatima Mirza, Frankline Ononiwu, Habib Rezanejad, Hessie Jones, Indrani Bhattacharya, Irene Solaiman, Irina Sedenko, Isar Nejadgholi, Jesse Passmore, Josh Seltzer, Julio Bonis Sanz, Livia Dutra, Mairon Samagaio, Maraim Elbadri, Margot Mieskes, Marissa Gerchick, Martha Akinlolu, Michael Mckenna, Mike Qiu, Muhammed Ghauri, Mykola Burynok, Nafis Abrar, Nazneen Rajani, Nour Elkott, Nour Fahmy, Olanrewaju Samuel, Ran An, Rasmus Kromann, Ryan Hao, Samira Alizadeh, Sarmad Shubber, Silas Wang, Sourav Roy, Sylvain Viguier, Thanh Le, Tobi Oyebade, Trieu Le, Yoyo Yang, Zach Nguyen, Abhinav Ramesh Kashyap, Alfredo Palasciano, Alison Callahan, Anima Shukla, Antonio Miranda-escalada, Ayush Singh, Benjamin Beilharz, Bo Wang, Caio Brito, Chenxi Zhou, Chirag Jain, Chuxin Xu, Clémentine Fourrier, Daniel León Periñán, Daniel Molano, Dian Yu, Enrique Manjavacas, Fabio Barth, Florian Fuhrimann, Gabriel Altay, Giyaseddin Bayrak, Gully Burns, Helena U. Vrabec, Imane Bello, Ishani Dash, Jihyun Kang, John Giorgi, Jonas Golde, Jose David Posada, Karthik Rangasai Sivaraman, Lokesh Bulchandani, Lu Liu, Luisa Shinzato, Madeleine Hahn De Bykhovetz, Maiko Takeuchi, Marc Pàmies, Maria A Castillo, Marianna Nezhurina, Mario Sänger, Matthias Samwald, Michael Cullan, Michael Weinberg, Michiel De Wolf, Mina Mihaljcic, Minna Liu, Moritz Freidank, Myungsun Kang, Natasha Seelam, Nathan Dahlberg, Nicholas Michio Broad, Nikolaus Muellner, Pascale Fung, Patrick Haller, Ramya Chandrasekhar, Renata Eisenberg, Robert Martin, Rodrigo Canalli, Rosaline Su, Ruisi Su, Samuel Cahyawijaya, Samuele Garda, Shlok S Deshmukh, Shubhanshu Mishra, Sid Kiblawi, Simon Ott, Sinee Sang-aroonsiri, Srishti Kumar, Stefan Schweter, Sushil Bharati, Tanmay Laud, Théo Gigant, Tomoya Kainuma, Wojciech Kusa, Yanis Labrak, Yash Shailesh Bajaj, Yash Venkatraman, Yifan Xu, Yingxin Xu, Yu Xu, Zhe Tan, Zhongli Xie, Zifan Ye, Mathilde Bras, Younes Belkada, Thomas Wolf
- Using Cognitive Psychology To Understand GPT-3 Binz Marcel, Schulz Eric
- Inpars: Data Augmentation For Information Retrieval Using Large Language Models Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Nogueira Rodrigo
- Looking For A Handsome Carpenter! Debiasing GPT-3 Job Advertisements Borchers Conrad, Gala Dalia Sara, Gilburt Benjamin, Oravkin Eduard, Bounsi Wilfried, Asano Yuki M., Kirk Hannah Rose
- Petals: Collaborative Inference And Fine-tuning Of Large Models Borzunov Alexander, Baranchuk Dmitry, Dettmers Tim, Ryabinin Max, Belkada Younes, Chumachenko Artem, Samygin Pavel, Raffel Colin
- Understanding Performance Of Long-document Ranking Models Through Comprehensive Evaluation And Leaderboarding Boytsov Leonid, Akinpelu David, Lin Tianyi, Gao Fangwei, Zhao Yutian, Huang Jeffrey, Katyal Nipun, Nyberg Eric
- Evaluating The Susceptibility Of Pre-trained Language Models Via Handcrafted Adversarial Examples Branch Hezekiah J., Cefalu Jonathan Rodriguez, Mchugh Jeremy, Hujer Leyla, Bahl Aditya, Iglesias Daniel Del Castillo, Heichman Ron, Darwishi Ramesh
- Large Language Models Can Implement Policy Iteration Brooks Ethan, Walls Logan, Lewis Richard L., Singh Satinder
- Understanding Attention For Vision-and-language Tasks Cao Feiqi, Han Soyeon Caren, Long Siqu, Xu Changwei, Poon Josiah
- HIBRIDS: Attention With Hierarchical Biases For Structure-aware Long Document Summarization Cao Shuyang, Wang Lu
- A Model-agnostic Data Manipulation Method For Persona-based Dialogue Generation Cao Yu, Bi Wei, Fang Meng, Shi Shuming, Tao Dacheng
- Pre-trained Neural Language Models For Automatic Mobile App User Feedback Answer Generation Cao Yue, Fard Fatemeh H.
- Analyzing Semantic Faithfulness Of Language Models Via Input Intervention On Question Answering Chaturvedi Akshay, Bhar Swarnadeep, Saha Soumadeep, Garain Utpal, Asher Nicholas
- Litevl: Efficient Video-language Learning With Enhanced Spatial-temporal Modeling Chen Dongsheng, Tao Chaofan, Hou Lu, Shang Lifeng, Jiang Xin, Liu Qun
- Character-centric Story Visualization Via Visual Planning And Token Alignment Chen Hong, Han Rujun, Wu Te-lin, Nakayama Hideki, Peng Nanyun
- Controllable Text Generation With Language Constraints Chen Howard, Li Huihan, Chen Danqi, Narasimhan Karthik
- Xdoc: Unified Pre-training For Cross-format Document Understanding Chen Jingye, Lv Tengchao, Cui Lei, Zhang Cha, Wei Furu
- Imputing Out-of-vocabulary Embeddings With LOVE Makes Language Models Robust With Little Cost Chen Lihu, Varoquaux Gaël, Suchanek Fabian M.
- Exploring And Exploiting Multi-granularity Representations For Machine Reading Comprehension Chen Nuo, You Chenyu
- Murag: Multimodal Retrieval-augmented Generator For Open Question Answering Over Images And Text Chen Wenhu, Hu Hexiang, Chen Xi, Verga Pat, Cohen William W.
- Pali: A Jointly-scaled Multilingual Language-image Model Chen Xi, Wang Xiao, Changpinyo Soravit, Piergiovanni Aj, Padlewski Piotr, Salz Daniel, Goodman Sebastian, Grycner Adam, Mustafa Basil, Beyer Lucas, Kolesnikov Alexander, Puigcerver Joan, Ding Nan, Rong Keran, Akbari Hassan, Mishra Gaurav, Xue Linting, Thapliyal Ashish, Bradbury James, Kuo Weicheng, Seyedhosseini Mojtaba, Jia Chao, Ayan Burcu Karagol, Riquelme Carlos, Steiner Andreas, Angelova Anelia, Zhai Xiaohua, Houlsby Neil, Soricut Radu
- Towards Improving Faithfulness In Abstractive Summarization Chen Xiuying, Li Mingzhe, Gao Xin, Zhang Xiangliang
- Transformers Go For The Lols: Generating (humourous) Titles From Scientific Abstracts End-to-end Chen Yanran, Eger Steffen
- Empowering Parameter-efficient Transfer Learning By Recognizing The Kernel Structure In Self-attention Chen Yifan, Hazarika Devamanyu, Namazifar Mahdi, Liu Yang, Jin Di, Hakkani-tur Dilek
- Adaprompt: Adaptive Model Training For Prompt-based NLP Chen Yulong, Liu Yang, Dong Li, Wang Shuohang, Zhu Chenguang, Zeng Michael, Zhang Yue
- Curriculum: A Broad-coverage Benchmark For Linguistic Phenomena In Natural Language Understanding Chen Zeming, Gao Qiyue
- Task-aware Specialization For Efficient And Robust Dense Retrieval For Open-domain Question Answering Cheng Hao, Fang Hao, Liu Xiaodong, Gao Jianfeng
- Capture Salient Historical Information: A Fast And Accurate Non-autoregressive Model For Multi-turn Spoken Language Understanding Cheng Lizhi, Jia Weijia, Yang Wenmian
- Is Multiwoz A Solved Task? An Interactive TOD Evaluation Framework With User Simulator Cheng Qinyuan, Li Linyang, Quan Guofeng, Gao Feng, Mou Xiaofeng, Qiu Xipeng
- Neural Machine Translation With Contrastive Translation Memories Cheng Xin, Gao Shen, Liu Lemao, Zhao Dongyan, Yan Rui
- Prompting GPT-3 To Be Reliable Chenglei Si, Zhe Gan, Zhengyuan Yang, Shuohang Wang, Jianfeng Wang, Jordan Boyd-graber, Lijuan Wang
- Vault: Augmenting The Vision-and-language Transformer For Sentiment Classification On Social Media Chochlakis Georgios University Of Southern California, Srinivasan Tejas University Of Southern California, Thomason Jesse University Of Southern California, Narayanan Shrikanth University Of Southern California
- Meta-learning Fast Weight Language Models Clark Kevin, Guu Kelvin, Chang Ming-wei, Pasupat Panupong, Hinton Geoffrey, Norouzi Mohammad
- On Reality And The Limits Of Language Data: Aligning Llms With Human Norms Collier Nigel H., Liu Fangyu, Shareghi Ehsan
- LSG Attention: Extrapolation Of Pretrained Transformers To Long Sequences Condevaux Charles, Harispe Sébastien
- Building The Intent Landscape Of Real-world Conversational Corpora With Extractive Question-answering Transformers Corbeil Jean-philippe, Li Mia Taige, Ghavidel Hadi Abdi
- M6-rec: Generative Pretrained Language Models Are Open-ended Recommender Systems Cui Zeyu, Ma Jianxin, Zhou Chang, Zhou Jingren, Yang Hongxia
- The Future Is Different: Large Pre-trained Language Models Fail In Prediction Tasks Cvejoski Kostadin, Sánchez Ramsés J., Ojeda César
- Scientific And Creative Analogies In Pretrained Language Models Czinczoll Tamara, Yannakoudakis Helen, Mishra Pushkar, Shutova Ekaterina
- Neural Knowledge Bank For Pretrained Transformers Dai Damai, Jiang Wenbin, Dong Qingxiu, Lyu Yajuan, She Qiaoqiao, Sui Zhifang
- Why Can GPT Learn In-context? Language Models Implicitly Perform Gradient Descent As Meta-optimizers Dai Damai, Sun Yutao, Dong Li, Hao Yaru, Ma Shuming, Sui Zhifang, Wei Furu
- Enabling Multimodal Generation On CLIP Via Vision-language Knowledge Distillation Dai Wenliang, Hou Lu, Shang Lifeng, Jiang Xin, Liu Qun, Fung Pascale
- Promptagator: Few-shot Dense Retrieval From 8 Examples Dai Zhuyun, Zhao Vincent Y., Ma Ji, Luan Yi, Ni Jianmo, Lu Jing, Bakalov Anton, Guu Kelvin, Hall Keith B., Chang Ming-wei
- Understanding BLOOM: An Empirical Study On Diverse NLP Tasks Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- Fido: Fusion-in-decoder Optimized For Stronger Performance And Faster Inference De Jong Michiel, Zemlyanskiy Yury, Ainslie Joshua, Fitzgerald Nicholas, Sanghai Sumit, Sha Fei, Cohen William
- Rlprompt: Optimizing Discrete Text Prompts With Reinforcement Learning Deng Mingkai, Wang Jianyu, Hsieh Cheng-ping, Wang Yihan, Guo Han, Shu Tianmin, Song Meng, Xing Eric P., Hu Zhiting
- Least-to-most Prompting Enables Complex Reasoning In Large Language Models Denny Zhou, Nathanael Schärli, Le Hou, Jason Wei, Nathan Scales, Xuezhi Wang, Dale Schuurmans, Claire Cui, Olivier Bousquet, Quoc Le, Ed Chi
- Llm.int8(): 8-bit Matrix Multiplication For Transformers At Scale Dettmers Tim, Lewis Mike, Belkada Younes, Zettlemoyer Luke
- Less Is More: Linear Layers On CLIP Features As Powerful Vizwiz Model Deuser Fabian, Habel Konrad, Rösch Philipp J., Oswald Norbert
- Pre-training Transformer Models With Sentence-level Objectives For Answer Sentence Selection Di Liello Luca, Garg Siddhant, Soldaini Luca, Moschitti Alessandro
- Black-box Prompt Learning For Pre-trained Language Models Diao Shizhe, Huang Zhichao, Xu Ruijia, Li Xuechun, Lin Yong, Zhou Xiao, Zhang Tong
- Is GPT-3 A Good Data Annotator? Ding Bosheng, Qin Chengwei, Liu Linlin, Chia Yew Ken, Joty Shafiq, Li Boyang, Bing Lidong
- Cogview2: Faster And Better Text-to-image Generation Via Hierarchical Transformers Ding Ming, Zheng Wendi, Hong Wenyi, Tang Jie
- Parallel Attention Forcing For Machine Translation Dou Qingyun, Gales Mark
- Coarse-to-fine Vision-language Pre-training With Fusion In The Backbone Dou Zi-yi, Kamath Aishwarya, Gan Zhe, Zhang Pengchuan, Wang Jianfeng, Li Linjie, Liu Zicheng, Liu Ce, Lecun Yann, Peng Nanyun, Gao Jianfeng, Wang Lijuan
- Contrastive Learning With Bidirectional Transformers For Sequential Recommendation Du Hanwen, Shi Hui, Zhao Pengpeng, Wang Deqing, Sheng Victor S., Liu Yanchi, Liu Guanfeng, Zhao Lei
- A Survey Of Vision-language Pre-trained Models Du Yifan, Liu Zikang, Li Junyi, Zhao Wayne Xin
- Imagination Is All You Need! Curved Contrastive Learning For Abstract Sequence Modeling Utilized On Long Short-term Dialogue Planning Erker Justus-jonas, Schaffer Stefan, Spanakis Gerasimos
- Leveraging Pre-trained Models For Failure Analysis Triplets Generation Ezukwoke Kenneth, Hoayek Anis, Batton-hubert Mireille, Boucher Xavier, Gounet Pascal, Adrian Jerome
- Towards Winoqueer: Developing A Benchmark For Anti-queer Bias In Large Language Models Felkner Virginia K., Chang Ho-chun Herbert, Jang Eugene, May Jonathan
- CERES: Pretraining Of Graph-conditioned Transformer For Semi-structured Session Data Feng Rui, Luo Chen, Yin Qingyu, Yin Bing, Zhao Tuo, Zhang Chao
- Llmeffichecker: Understanding And Testing Efficiency Degradation Of Large Language Models Feng Xiaoning, Han Xiaohong, Chen Simin, Yang Wei
- Topic-aware Response Generation In Task-oriented Dialogue With Unstructured Knowledge Access Feng Yue, Lampouras Gerasimos, Iacobacci Ignacio
- Learn To Remember: Transformer With Recurrent Memory For Document-level Machine Translation Feng Yukun, Li Feng, Song Ziang, Zheng Boyuan, Koehn Philipp
- Towards Opening The Black Box Of Neural Machine Translation: Source And Target Interpretations Of The Transformer Ferrando Javier, Gállego Gerard I., Alastruey Belen, Escolano Carlos, Costa-jussà Marta R.
- Should Attention Be All We Need? The Epistemic And Ethical Implications Of Unification In Machine Learning Fishman Nic, Hancox-li Leif
- GPTQ: Accurate Post-training Quantization For Generative Pre-trained Transformers Frantar Elias, Ashkboos Saleh, Hoefler Torsten, Alistarh Dan
- Revision Transformers: Instructing Language Models To Change Their Values Friedrich Felix, Stammer Wolfgang, Schramowski Patrick, Kersting Kristian
- Adapterbias: Parameter-efficient Token-dependent Representation Shift For Adapters In NLP Tasks Fu Chin-lun, Chen Zih-ching, Lee Yun-ru, Lee Hung-yi
- Hungry Hungry Hippos: Towards Language Modeling With State Space Models Fu Daniel Y., Dao Tri, Saab Khaled K., Thomas Armin W., Rudra Atri, Ré Christopher
- An Empirical Study Of End-to-end Video-language Transformers With Masked Visual Modeling Fu Tsu-jui, Li Linjie, Gan Zhe, Lin Kevin, Wang William Yang, Wang Lijuan, Liu Zicheng
- Complexity-based Prompting For Multi-step Reasoning Fu Yao, Peng Hao, Sabharwal Ashish, Clark Peter, Khot Tushar
- Normsage: Multi-lingual Multi-cultural Norm Discovery From Conversations On-the-fly Fung Yi R., Chakraborty Tuhin, Guo Hao, Rambow Owen, Muresan Smaranda, Ji Heng
- Pnlp-mixer: An Efficient All-mlp Architecture For Language Fusco Francesco, Pascual Damian, Staar Peter, Antognini Diego
- Visual Prompt Tuning For Test-time Domain Adaptation Gao Yunhe, Shi Xingjian, Zhu Yi, Wang Hao, Tang Zhiqiang, Zhou Xiong, Li Mu, Metaxas Dimitris N.
- Cramming: Training A Language Model On A Single GPU In One Day Geiping Jonas, Goldstein Tom
- Microbert: Effective Training Of Low-resource Monolingual Berts Through Parameter Reduction And Multitask Learning Gessler Luke, Zeldes Amir
- Re2g: Retrieve, Rerank, Generate Glass Michael, Rossiello Gaetano, Chowdhury Md Faisal Mahbub, Naik Ankita Rajaram, Cai Pengshan, Gliozzo Alfio
- Does Entity Abstraction Help Generative Transformers Reason? Gontier Nicolas, Reddy Siva, Pal Christopher
- Checks And Strategies For Enabling Code-switched Machine Translation Gowda Thamme, Gheini Mozhdeh, May Jonathan
- Massively Multilingual Lexical Specialization Of Multilingual Transformers Green Tommaso, Ponzetto Simone Paolo, Glavaš Goran
- Continuous Decomposition Of Granularity For Neural Paraphrase Generation Gu Xiaodong, Zhang Zhaowei, Lee Sang-woo, Yoo Kang Min, Ha Jung-woo
- Generating Coherent Narratives By Learning Dynamic And Discrete Entity States With A Contrastive Framework Guan Jian, Yang Zhenyu, Zhang Rongsheng, Hu Zhipeng, Huang Minlie
- Transkimmer: Transformer Learns To Layer-wise Skim Guan Yue, Li Zhengyi, Leng Jingwen, Lin Zhouhan, Guo Minyi
- Optimal Transport For Unsupervised Hallucination Detection In Neural Machine Translation Guerreiro Nuno M., Colombo Pierre, Piantanida Pablo, Martins André F. T.
- Looking For A Needle In A Haystack: A Comprehensive Study Of Hallucinations In Neural Machine Translation Guerreiro Nuno M., Voita Elena, Martins André F. T.
- Unixcoder: Unified Cross-modal Pre-training For Code Representation Guo Daya, Lu Shuai, Duan Nan, Wang Yanlin, Zhou Ming, Yin Jian
- Visually-augmented Pretrained Language Models For NLP Tasks Without Images Guo Hangyu, Zhou Kun, Zhao Wayne Xin, Zhang Qinyu, Wen Ji-rong
- LVP-M3: Language-aware Visual Prompt For Multilingual Multimodal Machine Translation Guo Hongcheng, Liu Jiaheng, Huang Haoyang, Yang Jian, Li Zhoujun, Zhang Dongdong, Cui Zheng, Wei Furu
- STI: Turbocharge NLP Inference At The Edge Via Elastic Pipelining Guo Liwei, Choe Wonkyo, Lin Felix Xiaozhu
- "john Is 50 Years Old, Can His Son Be 65?" Evaluating NLP Models' Understanding Of Feasibility Gupta Himanshu, Varshney Neeraj, Mishra Swaroop, Pal Kuntal Kumar, Sawant Saurabh Arjun, Scaria Kevin, Goyal Siddharth, Baral Chitta
- Understanding HTML With Large Language Models Gur Izzeddin, Nachum Ofir, Miao Yingjie, Safdari Mustafa, Huang Austin, Chowdhery Aakanksha, Narang Sharan, Fiedel Noah, Faust Aleksandra
- Quala-minilm: A Quantized Length Adaptive Minilm Guskin Shira, Wasserblat Moshe, Wang Chang, Shen Haihao
- Thinking About GPT-3 In-context Learning For Biomedical IE? Think Again Gutiérrez Bernal Jiménez, Mcneal Nikolas, Washington Clay, Chen You, Li Lang, Sun Huan, Su Yu
- Thinking Fast And Slow In Large Language Models Hagendorff Thilo, Fabi Sarah, Kosinski Michal
- LUNA: Language Understanding With Number Augmentations On Transformers Via Number Plugins And Pre-training Han Hongwei, Xu Jialiang, Zhou Mengyu, Shao Yijia, Han Shi, Zhang Dongmei
- FOLIO: Natural Language Reasoning With First-order Logic Han Simeng, Schoelkopf Hailey, Zhao Yilun, Qi Zhenting, Riddell Martin, Zhou Wenfei, Coady James, Peng David, Qiao Yujie, Benson Luke, Sun Lucy, Wardle-solano Alex, Szabo Hannah, Zubova Ekaterina, Burtell Matthew, Fan Jonathan, Liu Yixin, Wong Brian, Sailor Malcolm, Ni Ansong, Nan Linyong, Kasai Jungo, Yu Tao, Zhang Rui, Fabbri Alexander R., Kryscinski Wojciech, Yavuz Semih, Liu Ye, Lin Xi Victoria, Joty Shafiq, Zhou Yingbo, Xiong Caiming, Ying Rex, Cohan Arman, Radev Dragomir
- SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model For Text Generation And Modular Control Han Xiaochuang, Kumar Sachin, Tsvetkov Yulia
- ORCA: Interpreting Prompted Language Models Via Locating Supporting Data Evidence In The Ocean Of Pretraining Data Han Xiaochuang, Tsvetkov Yulia
- Supervised Visual Attention For Simultaneous Multimodal Machine Translation Haralampieva Veneta, Caglayan Ozan, Specia Lucia
- Aging With GRACE: Lifelong Model Editing With Discrete Key-value Adaptors Hartvigsen Thomas, Sankaranarayanan Swami, Palangi Hamid, Kim Yoon, Ghassemi Marzyeh
- How Much Does Attention Actually Attend? Questioning The Importance Of Attention In Pretrained Transformers Hassid Michael, Peng Hao, Rotem Daniel, Kasai Jungo, Montero Ivan, Smith Noah A., Schwartz Roy
- Understanding Transformer Memorization Recall Through Idioms Haviv Adi, Cohen Ido, Gidron Jacob, Schuster Roei, Goldberg Yoav, Geva Mor
- Transformer Language Models Without Positional Encodings Still Learn Positional Information Haviv Adi, Ram Ori, Press Ofir, Izsak Peter, Levy Omer
- Rethinking With Retrieval: Faithful Large Language Model Inference He Hangfeng, Zhang Hongming, Roth Dan
- Z-code++: A Pre-trained Language Model Optimized For Abstractive Summarization He Pengcheng, Peng Baolin, Lu Liyang, Wang Song, Mei Jie, Liu Yang, Xu Ruochen, Awadalla Hany Hassan, Shi Yu, Zhu Chenguang, Xiong Wayne, Zeng Michael, Gao Jianfeng, Huang Xuedong
- VLMAE: Vision-language Masked Autoencoder He Sunan, Guo Taian, Dai Tao, Qiao Ruizhi, Wu Chen, Shu Xiujun, Ren Bo
- SPACE-3: Unified Dialog Model Pre-training For Task-oriented Dialog Understanding And Generation He Wanwei, Dai Yinpei, Yang Min, Sun Jian, Huang Fei, Si Luo, Li Yongbin
- TRAC: A Textual Benchmark For Reasoning About Actions And Change He Weinan, Huang Canming, Xiao Zhanhao, Liu Yongmei
- Hyperprompt: Prompt-based Task-conditioning Of Transformers He Yun, Zheng Huaixiu Steven, Tay Yi, Gupta Jai, Du Yu, Aribandi Vamsi, Zhao Zhe, Li Yaguang, Chen Zhao, Metzler Donald, Cheng Heng-tze, Chi Ed H.
- Bundle MCR: Towards Conversational Bundle Recommendation He Zhankui, Zhao Handong, Yu Tong, Kim Sungchul, Du Fan, Mcauley Julian
- Robust Dialogue State Tracking With Weak Supervision And Sparse Data Heck Michael, Lubis Nurul, Van Niekerk Carel, Feng Shutong, Geishauser Christian, Lin Hsien-chin, Gašić Milica
- Shapley Head Pruning: Identifying And Removing Interference In Multilingual Transformers Held William, Yang Diyi
- Prompt-to-prompt Image Editing With Cross Attention Control Hertz Amir, Mokady Ron, Tenenbaum Jay, Aberman Kfir, Pritch Yael, Cohen-or Daniel
- Demystifying Prompts In Language Models Via Perplexity Estimation Hila Gonen, Srini Iyer, Terra Blevins, Noah A. Smith, Luke Zettlemoyer
- Wikiwhy: Answering And Explaining Cause-and-effect Questions Ho Matthew, Sharma Aditya, Chang Justin, Saxon Michael, Levy Sharon, Lu Yujie, Wang William Yang
- Compressing Pre-trained Transformers Via Low-bit Nxm Sparsity For Natural Language Understanding Holmes Connor, Zhang Minjia, He Yuxiong, Wu Bo
- Cogvideo: Large-scale Pretraining For Text-to-video Generation Via Transformers Hong Wenyi, Ding Ming, Zheng Wendi, Liu Xinghan, Tang Jie
- TRUE: Re-evaluating Factual Consistency Evaluation Honovich Or, Aharoni Roee, Herzig Jonathan, Taitelbaum Hagai, Kukliansy Doron, Cohen Vered, Scialom Thomas, Szpektor Idan, Hassidim Avinatan, Matias Yossi
- Instruction Induction: From Few Examples To Natural Language Task Descriptions Honovich Or, Shaham Uri, Bowman Samuel R., Levy Omer
- Syntax-guided Localized Self-attention By Constituency Syntactic Distance Hou Shengyuan, Kai Jushi, Xue Haotian, Zhu Bingyu, Yuan Bo, Huang Longtao, Wang Xinbing, Lin Zhouhan
- XDBERT: Distilling Visual Information To BERT From Cross-modal Systems To Improve Language Understanding Hsu Chan-jan, Lee Hung-yi, Tsao Yu
- Unified Discrete Diffusion For Simultaneous Vision-language Generation Hu Minghui, Zheng Chuanxia, Zheng Heliang, Cham Tat-jen, Wang Chaoyue, Yang Zuopeng, Tao Dacheng, Suganthan Ponnuthurai N.
- Empirical Evaluation Of Post-training Quantization Methods For Language Tasks Hu Ting, Meinel Christoph, Yang Haojin
- Fctalker: Fine And Coarse Grained Context Modeling For Expressive Conversational Speech Synthesis Hu Yifan, Liu Rui, Gao Guanglai, Li Haizhou
- Promptcap: Prompt-guided Task-aware Image Captioning Hu Yushi, Hua Hang, Yang Zhengyuan, Shi Weijia, Smith Noah A, Luo Jiebo
- PLANET: Dynamic Content Planning In Autoregressive Transformers For Long-form Text Generation Hu Zhe, Chan Hou Pong, Liu Jiachen, Xiao Xinyan, Wu Hua, Huang Lifu
- Empowering Language Models With Knowledge Graph Reasoning For Question Answering Hu Ziniu, Xu Yichong, Yu Wenhao, Wang Shuohang, Yang Ziyi, Zhu Chenguang, Chang Kai-wei, Sun Yizhou
- Transformer Quality In Linear Time Hua Weizhe, Dai Zihang, Liu Hanxiao, Le Quoc V.
- Personalized Dialogue Generation With Persona-adaptive Attention Huang Qiushi, Zhang Yu, Ko Tom, Liu Xubo, Wu Bo, Wang Wenwu, Tang Lilian
- Detecting Unintended Memorization In Language-model-fused ASR Huang W. Ronny, Chien Steve, Thakkar Om, Mathews Rajiv
- FPT: Improving Prompt Tuning Efficiency Via Progressive Training Huang Yufei, Qin Yujia, Wang Huadong, Yin Yichun, Sun Maosong, Liu Zhiyuan, Liu Qun
- Layoutlmv3: Pre-training For Document AI With Unified Text And Image Masking Huang Yupan, Lv Tengchao, Cui Lei, Lu Yutong, Wei Furu
- From Human Days To Machine Seconds: Automatically Answering And Generating Machine Learning Final Exams Iddo Drori, Sarah J. Zhang, Reece Shuttleworth, Sarah Zhang, Keith Tyser, Zad Chin, Pedro Lantigua, Saisamrit Surbehera, Gregory Hunter, Derek Austin, Leonard Tang, Yann Hicke, Sage Simhon, Sathwik Karnik, Darnell Granberry, Madeleine Udell
- Do Berts Learn To Use Browser User Interface? Exploring Multi-step Tasks With Unified Vision-and-language Berts Iki Taichi, Aizawa Akiko
- Uniform Complexity For Text Generation Imperial Joseph Marvin, Madabushi Harish Tayyar
- Efficient Long-text Understanding With Short-text Models Ivgi Maor, Shaham Uri, Berant Jonathan
- Can Large Language Models Truly Understand Prompts? A Case Study With Negated Prompts Jang Joel, Ye Seonghyeon, Seo Minjoon
- Chain-of-thought Prompting Elicits Reasoning In Large Language Models Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, Denny Zhou
- Litetransformersearch: Training-free Neural Architecture Search For Efficient Language Models Javaheripi Mojan, De Rosa Gustavo H., Mukherjee Subhabrata, Shah Shital, Religa Tomasz L., Mendes Caio C. T., Bubeck Sebastien, Koushanfar Farinaz, Dey Debadeepta
- Chatgpt Makes Medicine Easy To Swallow: An Exploratory Case Study On Simplified Radiology Reports Jeblick Katharina, Schachtner Balthasar, Dexl Jakob, Mittermeier Andreas, Stüber Anna Theresa, Topalis Johanna, Weber Tobias, Wesp Philipp, Sabel Bastian, Ricke Jens, Ingrisch Michael
- DP-KB: Data Programming With Knowledge Bases Improves Transformer Fine Tuning For Answer Sentence Selection Jedema Nic, Vu Thuy, Gupta Manish, Moschitti Alessandro
- Lamemo: Language Modeling With Look-ahead Memory Ji Haozhe, Zhang Rongsheng, Yang Zhenyu, Hu Zhipeng, Huang Minlie
- Survey Of Hallucination In Natural Language Generation Ji Ziwei, Lee Nayeon, Frieske Rita, Yu Tiezheng, Su Dan, Xu Yan, Ishii Etsuko, Bang Yejin, Chen Delong, Dai Wenliang, Chan Ho Shu, Madotto Andrea, Fung Pascale
- RHO (\(\rho\)): Reducing Hallucination In Open-domain Dialogues With Knowledge Grounding Ji Ziwei, Liu Zihan, Lee Nayeon, Yu Tiezheng, Wilie Bryan, Zeng Min, Fung Pascale
- Visual Prompt Tuning Jia Menglin, Tang Luming, Chen Bor-chun, Cardie Claire, Belongie Serge, Hariharan Bharath, Lim Ser-nam
- Contrastive Learning For Prompt-based Few-shot Language Learners Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Pairreranker: Pairwise Reranking For Natural Language Generation Jiang Dongfu, Lin Bill Yuchen, Ren Xiang
- Pseudo-q: Generating Pseudo Language Queries For Visual Grounding Jiang Haojun, Lin Yuanze, Han Dongchen, Song Shiji, Huang Gao
- Informative Language Representation Learning For Massively Multilingual Neural Machine Translation Jin Renren, Xiong Deyi
- Probing Script Knowledge From Pre-trained Models Jin Zijian, Zhang Xingyu, Yu Mo, Huang Lifu
- Training Compute-optimal Large Language Models Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego De Las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, Tom Hennigan, Eric Noland, Katie Millican, George Van Den Driessche, Bogdan Damoc, Aurelia Guy, Simon Osindero, Karen Simonyan, Erich Elsen, Jack W. Rae, Oriol Vinyals, Laurent Sifre
- Self-prompting Large Language Models For Zero-shot Open-domain QA Junlong Li, Jinyuan Wang, Zhuosheng Zhang, Hai Zhao
- Latency Adjustable Transformer Encoder For Language Understanding Kachuee Sajjad, Sharifkhani Mohammad
- Action-gpt: Leveraging Large-scale Language Models For Improved And Generalized Action Generation Kalakonda Sai Shashank, Maheshwari Shubh, Sarvadevabhatla Ravi Kiran
- A New Path: Scaling Vision-and-language Navigation With Synthetic Instructions And Imitation Learning Kamath Aishwarya, Anderson Peter, Wang Su, Koh Jing Yu, Ku Alexander, Waters Austin, Yang Yinfei, Baldridge Jason, Parekh Zarana
- Gpt-neo For Commonsense Reasoning -- A Theoretical And Practical Lens Kashyap Rohan, Kashyap Vivek, P. Narendra C.
- Fie: Building A Global Probability Space By Leveraging Early Fusion In Encoder For Open-domain Question Answering Kedia Akhil, Zaidi Mohd Abbas, Lee Haejun
- Decomposed Prompting: A Modular Approach For Solving Complex Tasks Khot Tushar, Trivedi Harsh, Finlayson Matthew, Fu Yao, Richardson Kyle, Clark Peter, Sabharwal Ashish
- High Recall Data-to-text Generation With Progressive Edit Kim Choonghan, Lee Gary Geunbae
- Who Says Elephants Can't Run: Bringing Large Scale Moe Models Into Cloud Scale Production Kim Young Jin, Henry Rawn, Fahim Raffy, Awadalla Hany Hassan
- Iterative Vision-and-language Navigation Krantz Jacob, Banerjee Shurjo, Zhu Wang, Corso Jason, Anderson Peter, Lee Stefan, Thomason Jesse
- Downstream Datasets Make Surprisingly Good Pretraining Corpora Krishna Kundan, Garg Saurabh, Bigham Jeffrey P., Lipton Zachary C.
- Towards The Generation Of Musical Explanations With GPT-3 Krol Stephen James, Llano Maria Teresa, Mccormack Jon
- Few-shot Question Generation For Personalized Feedback In Intelligent Tutoring Systems Kulshreshtha Devang, Shayan Muhammad, Belfer Robert, Reddy Siva, Serban Iulian Vlad, Kochmar Ekaterina
- Mucot: Multilingual Contrastive Training For Question-answering In Low-resource Languages Kumar Gokul Karthik, Gehlot Abhishek Singh, Mullappilly Sahal Shaji, Nandakumar Karthik
- The Optimal BERT Surgeon: Scalable And Accurate Second-order Pruning For Large Language Models Kurtic Eldar, Campos Daniel, Nguyen Tuan, Frantar Elias, Kurtz Mark, Fineran Benjamin, Goin Michael, Alistarh Dan
- Natural Language To Code Using Transformers Kusupati Uday, Ailavarapu Venkata Ravi Teja
- Language Detoxification With Attribute-discriminative Latent Space Kwak Jin Myung, Kim Minseon, Hwang Sung Ju
- Emergent Linguistic Structures In Neural Networks Are Fragile La Malfa Emanuele, Wicker Matthew, Kwiatkowska Marta
- Co-training Improves Prompt-based Learning For Large Language Models Lang Hunter, Agrawal Monica, Kim Yoon, Sontag David
- Does BERT Really Agree ? Fine-grained Analysis Of Lexical Dependence On A Syntactic Task Lasri Karim, Lenci Alessandro, Poibeau Thierry
- Internet-augmented Language Models Through Few-shot Prompting For Open-domain Question Answering Lazaridou Angeliki, Gribovskaya Elena, Stokowiec Wojciech, Grigorev Nikolai
- Guiding Visual Question Answering With Attention Priors Le Thao Minh, Le Vuong, Gupta Sunil, Venkatesh Svetha, Tran Truyen
- Hard Gate Knowledge Distillation -- Leverage Calibration For Robust And Reliable Language Model Lee Dongkyu, Tian Zhiliang, Zhao Yingxiu, Cheung Ka Chun, Zhang Nevin L.
- Coauthor: Designing A Human-ai Collaborative Writing Dataset For Exploring Language Model Capabilities Lee Mina, Liang Percy, Yang Qian
- Evaluating Human-language Model Interaction Lee Mina, Srivastava Megha, Hardy Amelia, Thickstun John, Durmus Esin, Paranjape Ashwin, Gerard-ursin Ines, Li Xiang Lisa, Ladhak Faisal, Rong Frieda, Wang Rose E., Kwon Minae, Park Joon Sung, Cao Hancheng, Lee Tony, Bommasani Rishi, Bernstein Michael, Liang Percy
- Littlebird: Efficient Faster & Longer Transformer For Question Answering Lee Minchul, Han Kijong, Shin Myeong Cheol
- Dialogcc: An Automated Pipeline For Creating High-quality Multi-modal Dialogue Dataset Lee Young-jun, Ko Byungsoo, Kim Han-gyu, Hyeon Jonghwan, Choi Ho-jin
- Simple Recurrence Improves Masked Language Models Lei Tao, Tian Ran, Bastings Jasmijn, Parikh Ankur P.
- Fast Inference From Transformers Via Speculative Decoding Leviathan Yaniv, Kalman Matan, Matias Yossi
- GPT-D: Inducing Dementia-related Linguistic Anomalies By Deliberate Degradation Of Artificial Neural Language Models Li Changye, Knopman David, Xu Weizhe, Cohen Trevor, Pakhomov Serguei
- Large Language Models With Controllable Working Memory Li Daliang, Rawat Ankit Singh, Zaheer Manzil, Wang Xin, Lukasik Michal, Veit Andreas, Yu Felix, Kumar Sanjiv
- Summer: Wechat Neural Machine Translation Systems For The WMT22 Biomedical Translation Task Li Ernan, Meng Fandong, Zhou Jie
- A Survey On Retrieval-augmented Text Generation Li Huayang, Su Yixuan, Cai Deng, Wang Yan, Liu Lemao
- Counterfactual Reasoning: Do Language Models Need World Knowledge For Causal Understanding? Li Jiaxuan, Yu Lang, Ettinger Allyson
- Fine-grained Semantically Aligned Vision-language Pre-training Li Juncheng, He Xin, Wei Longhui, Qian Long, Zhu Linchao, Xie Lingxi, Zhuang Yueting, Tian Qi, Tang Siliang
- Learning To Transfer Prompts For Text Generation Li Junyi, Tang Tianyi, Nie Jian-yun, Wen Ji-rong, Zhao Wayne Xin
- Personalized Prompt Learning For Explainable Recommendation Li Lei, Zhang Yongfeng, Chen Li
- LAVENDER: Unifying Video-language Understanding As Masked Language Modeling Li Linjie, Gan Zhe, Lin Kevin, Lin Chung-ching, Liu Zicheng, Liu Ce, Wang Lijuan
- Modeling Coreference Relations In Visual Dialog Li Mingxiao, Moens Marie-francine
- Explanations From Large Language Models Make Small Reasoners Better Li Shiyang, Chen Jianshu, Shen Yelong, Chen Zhiyu, Zhang Xinlu, Li Zekun, Wang Hong, Qian Jing, Peng Baolin, Mao Yi, Chen Wenhu, Yan Xifeng
- Systematicity In Gpt-3's Interpretation Of Novel English Noun Compounds Li Siyan, Carlson Riley, Potts Christopher
- Task-specific Pre-training And Prompt Decomposition For Knowledge Graph Population With Language Models Li Tianyi, Huang Wenyu, Papasarantopoulos Nikos, Vougiouklis Pavlos, Pan Jeff Z.
- Evade The Trap Of Mediocrity: Promoting Diversity And Novelty In Text Generation Via Concentrating Attention Li Wenhao, Yi Xiaoyuan, Hu Jinyi, Sun Maosong, Xie Xing
- Evaluating Psychological Safety Of Large Language Models Li Xingxuan, Li Yutong, Qiu Lin, Joty Shafiq, Bing Lidong
- P-transformer: Towards Better Document-to-document Neural Machine Translation Li Yachao, Li Junhui, Jiang Jing, Tao Shimin, Yang Hao, Zhang Min
- Uni-eden: Universal Encoder-decoder Network By Multi-granular Vision-language Pre-training Li Yehao, Fan Jiahao, Pan Yingwei, Yao Ting, Lin Weiyao, Mei Tao
- VALHALLA: Visual Hallucination For Machine Translation Li Yi, Panda Rameswar, Kim Yoon, Chen Chun-fu, Feris Rogerio, Cox David, Vasconcelos Nuno
- Exploring Visual Interpretability For Contrastive Language-image Pre-training Li Yi, Wang Hualiang, Duan Yiqun, Xu Hang, Li Xiaomeng
- Clinical-longformer And Clinical-bigbird: Transformers For Long Clinical Sequences Li Yikuan, Wehbe Ramsey M., Ahmad Faraz S., Wang Hanyin, Luo Yuan
- Accelerating Attention Through Gradient-based Learned Runtime Pruning Li Zheng, Ghodrati Soroush, Yazdanbakhsh Amir, Esmaeilzadeh Hadi, Kang Mingu
- Life After BERT: What Do Other Muppets Understand About Language? Lialin Vladislav, Zhao Kevin, Shivagunde Namrata, Rumshisky Anna
- Bjtu-wechat's Systems For The WMT22 Chat Translation Task Liang Yunlong, Meng Fandong, Xu Jinan, Chen Yufeng, Zhou Jie
- Using Large Language Models To Generate Engaging Captions For Data Visualizations Liew Ashley, Mueller Klaus
- ADAPT: Vision-language Navigation With Modality-aligned Action Prompts Lin Bingqian, Zhu Yi, Chen Zicong, Liang Xiwen, Liu Jianzhuang, Liang Xiaodan
- Gentus: Simulating User Behaviour And Language In Task-oriented Dialogues With Generative Transformers Lin Hsien-chin, Geishauser Christian, Feng Shutong, Lubis Nurul, Van Niekerk Carel, Heck Michael, Gašić Milica
- Contextual Transformer For Offline Meta Reinforcement Learning Lin Runji, Li Ye, Feng Xidong, Zhang Zhaowei, Fung Xian Hong Wu, Zhang Haifeng, Wang Jun, Du Yali, Yang Yaodong
- Teaching Models To Express Their Uncertainty In Words Lin Stephanie, Hilton Jacob, Evans Owain
- On Task-adaptive Pretraining For Dialogue Response Selection Lin Tzu-hsiang, Chi Ta-chung, Rumshisky Anna
- Vision-language Pre-training For Multimodal Aspect-based Sentiment Analysis Ling Yan, Yu Jianfei, Xia Rui
- Delving Deeper Into Cross-lingual Visual Question Answering Liu Chen, Pfeiffer Jonas, Korhonen Anna, Vulić Ivan, Gurevych Iryna
- Do Ever Larger Octopi Still Amplify Reporting Biases? Evidence From Judgments Of Typical Colour Liu Fangyu, Eisenschlos Julian Martin, Cole Jeremy R., Collier Nigel
- A Simple Meta-learning Paradigm For Zero-shot Intent Classification With Mixture Attention Mechanism Liu Han, Zhao Siyang, Zhang Xiaotong, Zhang Feng, Sun Junjie, Yu Hong, Zhang Xianchao
- Towards Better Few-shot And Finetuning Performance With Forgetful Causal Language Models Liu Hao, Geng Xinyang, Lee Lisa, Mordatch Igor, Levine Sergey, Narang Sharan, Abbeel Pieter
- Instruction-following Agents With Multimodal Transformer Liu Hao, Lee Lisa, Lee Kimin, Abbeel Pieter
- Building Markovian Generative Architectures Over Pretrained LM Backbones For Efficient Task-oriented Dialog Systems Liu Hong, Cai Yucheng, Ou Zhijian, Huang Yi, Feng Junlan
- A Generative User Simulator With Gpt-based Architecture And Goal State Tracking For Reinforced Multi-domain Dialog Systems Liu Hong, Cai Yucheng, Ou Zhijian, Huang Yi, Feng Junlan
- Rainier: Reinforced Knowledge Introspector For Commonsense Question Answering Liu Jiacheng, Hallinan Skyler, Lu Ximing, He Pengfei, Welleck Sean, Hajishirzi Hannaneh, Choi Yejin
- Fl-tuning: Layer Tuning For Feed-forward Network In Transformer Liu Jingping, Song Yuqiu, Xue Kui, Sun Hongli, Wang Chao, Chen Lihan, Jiang Haiyun, Liang Jiaqing, Ruan Tong
- Transformer With Memory Replay Liu Rui, Mozafari Barzan
- Commitbart: A Large Pre-trained Model For Github Commits Liu Shangqing, Li Yanzhou, Xie Xiaofei, Liu Yang
- 3DALL-E: Integrating Text-to-image AI In 3D Design Workflows Liu Vivian, Vermeulen Jo, Fitzmaurice George, Matejka Justin
- PSP: Pre-trained Soft Prompts For Few-shot Abstractive Summarization Liu Xiaochen, Gao Yang, Bai Yu, Li Jiawei, Hu Yinan, Huang Heyan, Chen Boxing
- Improving Personality Consistency In Conversation By Persona Extending Liu Yifan, Wei Wei, Liu Jiayi, Mao Xianling, Fang Rui, Chen Dangyang
- Dialogconv: A Lightweight Fully Convolutional Network For Multi-view Response Selection Liu Yongkang, Feng Shi, Gao Wei, Wang Daling, Zhang Yifei
- Mulzdg: Multilingual Code-switching Framework For Zero-shot Dialogue Generation Liu Yongkang, Feng Shi, Wang Daling, Zhang Yifei
- A Win-win Deal: Towards Sparse And Robust Pre-trained Language Models Liu Yuanxin, Meng Fandong, Lin Zheng, Li Jiangnan, Fu Peng, Cao Yanan, Wang Weiping, Zhou Jie
- Can Large Language Models Reason About Medical Questions? Liévin Valentin, Hother Christoffer Egeberg, Motzfeldt Andreas Geert, Winther Ole
- Prompt Generation Networks For Input-space Adaptation Of Frozen Vision Transformers Loedeman Jochem, Stol Maarten C., Han Tengda, Asano Yuki M.
- Relaxed Attention For Transformer Models Lohrenz Timo, Möller Björn, Li Zhengyang, Fingscheidt Tim
- Training Language Models To Follow Instructions With Human Feedback Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe
- Vision-and-language Pretrained Models: A Survey Long Siqu, Cao Feiqi, Han Soyeon Caren, Yang Haiqin
- Learn To Explain: Multimodal Reasoning Via Thought Chains For Science Question Answering Lu Pan, Mishra Swaroop, Xia Tony, Qiu Liang, Chang Kai-wei, Zhu Song-chun, Tafjord Oyvind, Clark Peter, Kalyan Ashwin
- Reacc: A Retrieval-augmented Code Completion Framework Lu Shuai, Duan Nan, Han Hojae, Guo Daya, Hwang Seung-won, Svyatkovskiy Alexey
- Choose Your QA Model Wisely: A Systematic Study Of Generative And Extractive Readers For Question Answering Luo Man, Hashimoto Kazuma, Yavuz Semih, Liu Zhiwei, Baral Chitta, Zhou Yingbo
- Biogpt: Generative Pre-trained Transformer For Biomedical Text Generation And Mining Luo Renqian, Sun Liai, Xia Yingce, Qin Tao, Zhang Sheng, Poon Hoifung, Liu Tie-yan
- I-tuning: Tuning Frozen Language Models With Image For Lightweight Image Captioning Luo Ziyang, Hu Zhipeng, Xi Yadong, Zhang Rongsheng, Ma Jing
- A Frustratingly Simple Approach For End-to-end Image Captioning Luo Ziyang, Xi Yadong, Zhang Rongsheng, Ma Jing
- Torchscale: Transformers At Scale Ma Shuming, Wang Hongyu, Huang Shaohan, Wang Wenhui, Chi Zewen, Dong Li, Benhaim Alon, Patra Barun, Chaudhary Vishrav, Song Xia, Wei Furu
- Unveiling Code Pre-trained Models: Investigating Syntax And Semantics Capacities Ma Wei, Liu Shangqing, Zhao Mengjie, Xie Xiaofei, Wang Wenhan, Hu Qiang, Zhang Jie, Liu Yang
- Memory-assisted Prompt Editing To Improve GPT-3 After Deployment Madaan Aman, Tandon Niket, Clark Peter, Yang Yiming
- Structured Like A Language Model: Analysing AI As An Automated Subject Magee Liam, Arora Vanicka, Munn Luke
- DIALOG-22 Ruatd Generated Text Detection Maloyan Narek, Nutfullin Bulat, Ilyushin Eugene
- Tangobert: Reducing Inference Cost By Using Cascaded Architecture Mamou Jonathan, Pereg Oren, Wasserblat Moshe, Schwartz Roy
- Fine-tuning Pre-trained Transformers Into Decaying Fast Weights Mao Huanru Henry
- Quick Starting Dialog Systems With Paraphrase Generation Marceau Louis, Belbahar Raouf, Queudot Marc, Naji Nada, Charton Eric, Meurs Marie-jean
- Mini-model Adaptation: Efficiently Extending Pretrained Models To New Languages Via Aligned Shallow Training Marchisio Kelly, Lewis Patrick, Chen Yihong, Artetxe Mikel
- GAAMA 2.0: An Integrated System That Answers Boolean And Extractive Questions Mccarley Scott, Bornea Mihaela, Rosenthal Sara, Ferritto Anthony, Sultan Md Arafat, Sil Avirup, Florian Radu
- Chatbots In A Botnet World Mckee Forrest, Noever David
- LAD: Language Models As Data For Zero-shot Dialog Mehri Shikib, Altun Yasemin, Eskenazi Maxine
- DSI++: Updating Transformer Memory With New Documents Mehta Sanket Vaibhav, Gupta Jai, Tay Yi, Dehghani Mostafa, Tran Vinh Q., Rao Jinfeng, Najork Marc, Strubell Emma, Metzler Donald
- Mass-editing Memory In A Transformer Meng Kevin, Sharma Arnab Sen, Andonian Alex, Belinkov Yonatan, Bau David
- Generating Training Data With Language Models: Towards Zero-shot Language Understanding Meng Yu, Huang Jiaxin, Zhang Yu, Han Jiawei
- The Devil Is In The Details: On Models And Training Regimes For Few-shot Intent Classification Mesgar Mohsen, Tran Thy Thy, Glavas Goran, Gurevych Iryna
- Rarely A Problem? Language Models Exhibit Inverse Scaling In Their Predictions Following Few-type Quantifiers Michaelov James A., Bergen Benjamin K.
- Rethinking The Role Of Demonstrations: What Makes In-context Learning Work? Min Sewon, Lyu Xinxi, Holtzman Ari, Artetxe Mikel, Lewis Mike, Hajishirzi Hannaneh, Zettlemoyer Luke
- Who Is GPT-3? An Exploration Of Personality, Values And Demographics Miotto Marilù, Rossberg Nicola, Kleinberg Bennett
- HELP ME THINK: A Simple Prompting Strategy For Non-experts To Create Customized Content With Models Mishra Swaroop, Nouri Elnaz
- Minicons: Enabling Flexible Behavioral And Representational Analyses Of Transformer Language Models Misra Kanishka
- Domain-specific Text Generation For Machine Translation Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- SGPT: GPT Sentence Embeddings For Semantic Search Muennighoff Niklas
- JASMINE: Arabic GPT Models For Few-shot Learning Nagoudi El Moatez Billah, Abdul-mageed Muhammad, Elmadany Abdelrahim, Inciarte Alcides Alcoba, Khondaker Md Tawkat Islam
- Large Language Models Are Reasoning Teachers Namgyu Ho, Laura Schmid, Se-young Yun
- Task Formulation Matters When Learning Continually: A Case Study In Visual Question Answering Nikandrou Mavina, Yu Lu, Suglia Alessandro, Konstas Ioannis, Rieser Verena
- Budgetlongformer: Can We Cheaply Pretrain A Sota Legal Language Model From Scratch? Niklaus Joel, Giofré Daniele
- Do Vision-and-language Transformers Learn Grounded Predicate-noun Dependencies? Nikolaus Mitja, Salin Emmanuelle, Ayache Stephane, Fourtassi Abdellah, Favre Benoit
- The Turing Deception Noever David, Ciolino Matt
- Improving The Cross-lingual Generalisation In Visual Question Answering Nooralahzadeh Farhad, Sennrich Rico
- Minialbert: Model Distillation Via Parameter-efficient Recursive Transformers Nouriborji Mohammadmahdi, Rohanian Omid, Kouchaki Samaneh, Clifton David A.
- CUE Vectors: Modular Training Of Language Models Conditioned On Diverse Contextual Signals Novotney Scott, Mukherjee Sreeparna, Ahmed Zeeshan, Stolcke Andreas
- Measuring And Narrowing The Compositionality Gap In Language Models Ofir Press, Muru Zhang, Sewon Min, Ludwig Schmidt, Noah A. Smith, Mike Lewis
- Neural Language Taskonomy: Which NLP Tasks Are The Most Predictive Of Fmri Brain Activity? Oota Subba Reddy, Arora Jashn, Agarwal Veeral, Marreddy Mounika, Gupta Manish, Surampudi Bapi Raju
- Thinksum: Probabilistic Reasoning Over Sets Using Large Language Models Ozturkler Batu, Malkin Nikolay, Wang Zhen, Jojic Nebojsa
- Parameter-efficient Abstractive Question Answering Over Tables Or Text Pal Vaishali, Kanoulas Evangelos, De Rijke Maarten
- Follow-up Attention: An Empirical Study Of Developer And Neural Model Code Exploration Paltenghi Matteo, Pandita Rahul, Henley Austin Z., Ziegler Albert
- Dynamic Prompt Learning Via Policy Gradient For Semi-structured Mathematical Reasoning Pan Lu, Liang Qiu, Kai-wei Chang, Ying Nian Wu, Song-chun Zhu, Tanmay Rajpurohit, Peter Clark, Ashwin Kalyan
- Extremebert: A Toolkit For Accelerating Pretraining Of Customized BERT Pan Rui, Diao Shizhe, Chen Jianlin, Zhang Tong
- Silo Nlp's Participation At WAT2022 Parida Shantipriya, Panda Subhadarshi, Grönroos Stig-arne, Granroth-wilding Mark, Koistinen Mika
- TALM: Tool Augmented Language Models Parisi Aaron, Zhao Yao, Fiedel Noah
- LUT-GEMM: Quantized Matrix Multiplication Based On Luts For Efficient Inference In Large-scale Generative Language Models Park Gunho, Park Baeseong, Kim Minsub, Lee Sungjae, Kim Jeonghoon, Kwon Beomseok, Kwon Se Jung, Kim Byeongwook, Lee Youngjoo, Lee Dongsoo
- Relation-aware Language-graph Transformer For Question Answering Park Jinyoung, Choi Hyeong Kyu, Ko Juyeon, Park Hyeonjin, Kim Ji-hoon, Jeong Jisu, Kim Kyungmin, Kim Hyunwoo J.
- Bidirectional Language Models Are Also Few-shot Learners Patel Ajay, Li Bryan, Rasooli Mohammad Sadegh, Constant Noah, Raffel Colin, Callison-burch Chris
- Beyond English-centric Bitexts For Better Multilingual Language Representation Learning Patra Barun, Singhal Saksham, Huang Shaohan, Chi Zewen, Dong Li, Wei Furu, Chaudhary Vishrav, Song Xia
- Xfboost: Improving Text Generation With Controllable Decoders Peng Xiangyu, Sollami Michael
- Sparse And Dense Approaches For The Full-rank Retrieval Of Responses For Dialogues Penha Gustavo, Hauff Claudia
- Visconde: Multi-document QA With GPT-3 And Neural Reranking Pereira Jayr, Fidalgo Robson, Lotufo Roberto, Nogueira Rodrigo
- Ignore Previous Prompt: Attack Techniques For Language Models Perez Fábio, Ribeiro Ian
- Lifting The Curse Of Multilinguality By Pre-training Modular Transformers Pfeiffer Jonas, Goyal Naman, Lin Xi Victoria, Li Xian, Cross James, Riedel Sebastian, Artetxe Mikel
- Vit5: Pretrained Text-to-text Transformer For Vietnamese Language Generation Phan Long, Tran Hieu, Nguyen Hieu, Trinh Trieu H.
- Investigating Efficiently Extending Transformers For Long Input Summarization Phang Jason, Zhao Yao, Liu Peter J.
- Pre-training Image-language Transformers For Open-vocabulary Tasks Piergiovanni Aj, Kuo Weicheng, Angelova Anelia
- Answer-me: Multi-task Open-vocabulary Visual Question Answering Piergiovanni Aj, Li Wei, Kuo Weicheng, Saffar Mohammad, Bertsch Fred, Angelova Anelia
- Harnessing The Power Of Multi-task Pretraining For Ground-truth Level Natural Language Explanations Plüster Björn, Ambsdorf Jakob, Braach Lukas, Lee Jae Hee, Wermter Stefan
- Synchromesh: Reliable Code Generation From Pre-trained Language Models Poesia Gabriel, Polozov Oleksandr, Le Vu, Tiwari Ashish, Soares Gustavo, Meek Christopher, Gulwani Sumit
- Grips: Gradient-free, Edit-based Instruction Search For Prompting Large Language Models Prasad Archiki, Hase Peter, Zhou Xiang, Bansal Mohit
- Psychologically-informed Chain-of-thought Prompts For Metaphor Understanding In Large Language Models Prystawski Ben, Thibodeau Paul, Potts Christopher, Goodman Noah D.
- Outliers Dimensions That Disrupt Transformers Are Driven By Frequency Puccetti Giovanni, Rogers Anna, Drozd Aleksandr, Dell'orletta Felice
- Tegformer: Topic-to-essay Generation With Good Topic Coverage And High Text Coherence Qi Wang, Liu Rui, Zuo Yuan, Chen Yong, Zhang Dell
- Diffusion Glancing Transformer For Parallel Sequence To Sequence Learning Qian Lihua, Wang Mingxuan, Liu Yang, Zhou Hao
- The NLP Task Effectiveness Of Long-range Transformers Qin Guanghui, Feng Yukun, Van Durme Benjamin
- ELLE: Efficient Lifelong Pre-training For Emerging Data Qin Yujia, Zhang Jiajie, Lin Yankai, Liu Zhiyuan, Li Peng, Sun Maosong, Zhou Jie
- Memory In Humans And Deep Language Models: Linking Hypotheses For Model Augmentation Raccah Omri, Chen Phoebe, Willke Ted L., Poeppel David, Vo Vy A.
- Parallel Context Windows For Large Language Models Ratner Nir, Levine Yoav, Belinkov Yonatan, Ram Ori, Magar Inbal, Abend Omri, Karpas Ehud, Shashua Amnon, Leyton-brown Kevin, Shoham Yoav
- VLC-BERT: Visual Question Answering With Contextualized Commonsense Knowledge Ravi Sahithya, Chinchure Aditya, Sigal Leonid, Liao Renjie, Shwartz Vered
- Entity-conditioned Question Generation For Robust Attention Distribution In Neural Information Retrieval Reddy Revanth Gangi, Sultan Md Arafat, Franz Martin, Sil Avirup, Ji Heng
- Cometkiwi: Ist-unbabel 2022 Submission For The Quality Estimation Shared Task Rei Ricardo, Treviso Marcos, Guerreiro Nuno M., Zerva Chrysoula, Farinha Ana C., Maroti Christine, De Souza José G. C., Glushkova Taisiya, Alves Duarte M., Lavie Alon, Coheur Luisa, Martins André F. T.
- An Understanding-oriented Robust Machine Reading Comprehension Model Ren Feiliang, Liu Yongkang, Li Bochao, Liu Shilei, Wang Bingchao, Wang Jiaqi, Liu Chunchao, Ma Qi
- Exploring Extreme Parameter Compression For Pre-trained Language Models Ren Yuxin, Wang Benyou, Shang Lifeng, Jiang Xin, Liu Qun
- Super-prompting: Utilizing Model-independent Contextual Data To Reduce Data Annotation Required In Visual Commonsense Tasks Rezaei Navid, Reformat Marek Z.
- Breakpoint Transformers For Modeling And Tracking Intermediate Beliefs Richardson Kyle, Tamari Ronen, Sultan Oren, Tsarfaty Reut, Shahaf Dafna, Sabharwal Ashish
- Leveraging Large Language Models For Multiple Choice Question Answering Robinson Joshua, Rytting Christopher Michael, Wingate David
- Learning To Diversify For Product Question Generation Roitman Haggai, Singer Uriel, Eshel Yotam, Nus Alexander, Kiperwasser Eliyahu
- Lamda: Language Models For Dialog Applications Romal Thoppilan, Daniel De Freitas, Jamie Hall, Noam Shazeer, Apoorv Kulshreshtha, Heng-tze Cheng, Alicia Jin, Taylor Bos, Leslie Baker, Yu Du, Yaguang Li, Hongrae Lee, Huaixiu Steven Zheng, Amin Ghafouri, Marcelo Menegali, Yanping Huang, Maxim Krikun, Dmitry Lepikhin, James Qin, Dehao Chen, Yuanzhong Xu, Zhifeng Chen, Adam Roberts, Maarten Bosma, Vincent Zhao, Yanqi Zhou, Chung-ching Chang, Igor Krivokon, Will Rusch, Marc Pickett, Pranesh Srinivasan, Laichee Man, Kathleen Meier-hellstern, Meredith Ringel Morris, Tulsee Doshi, Renelito Delos Santos, Toju Duke, Johnny Soraker, Ben Zevenbergen, Vinodkumar Prabhakaran, Mark Diaz, Ben Hutchinson, Kristen Olson, Alejandra Molina, Erin Hoffman-john, Josh Lee, Lora Aroyo, Ravi Rajakumar, Alena Butryna, Matthew Lamm, Viktoriya Kuzmina, Joe Fenton, Aaron Cohen, Rachel Bernstein, Ray Kurzweil, Blaise Aguera-arcas, Claire Cui, Marian Croak, Ed Chi, Quoc Le
- Dialokg: Knowledge-structure Aware Task-oriented Dialogue Generation Rony Md Rashad Al Hasan, Usbeck Ricardo, Lehmann Jens
- Billions Of Parameters Are Worth More Than In-domain Training Data: A Case Study In The Legal Case Entailment Task Rosa Guilherme Moraes, Bonifacio Luiz, Jeronymo Vitor, Abonizio Hugo, Lotufo Roberto, Nogueira Rodrigo
- Galactica: A Large Language Model For Science Ross Taylor, Marcin Kardas, Guillem Cucurull, Thomas Scialom, Anthony Hartshorn, Elvis Saravia, Andrew Poulton, Viktor Kerkez, Robert Stojnic
- Benchclamp: A Benchmark For Evaluating Language Models On Syntactic And Semantic Parsing Roy Subhro, Thomson Sam, Chen Tongfei, Shin Richard, Pauls Adam, Eisner Jason, Van Durme Benjamin
- Data Augmentation For Intent Classification With Off-the-shelf Large Language Models Sahu Gaurav, Rodriguez Pau, Laradji Issam H., Atighehchian Parmida, Vazquez David, Bahdanau Dzmitry
- Alexatm 20B: Few-shot Learning Using A Large-scale Multilingual Seq2seq Model Saleh Soltan, Shankar Ananthakrishnan, Jack Fitzgerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, Stephen Rawls, Andy Rosenbaum, Anna Rumshisky, Chandana Satya Prakash, Mukund Sridhar, Fabian Triefenbach, Apurv Verma, Gokhan Tur, Prem Natarajan
- Self-repetition In Abstractive Neural Summarizers Salkar Nikita, Trikalinos Thomas, Wallace Byron C., Nenkova Ani
- NLX-GPT: A Model For Natural Language Explanations In Vision And Vision-language Tasks Sammani Fawaz, Mukherjee Tanmoy, Deligiannis Nikos
- Language Models Are Greedy Reasoners: A Systematic Formal Analysis Of Chain-of-thought Saparov Abulhair, He He
- IT5: Text-to-text Pretraining For Italian Language Understanding And Generation Sarti Gabriele, Nissim Malvina
- Transformer Grammars: Augmenting Transformer Language Models With Syntactic Inductive Biases At Scale Sartran Laurent, Barrett Samuel, Kuncoro Adhiguna, Stanojević Miloš, Blunsom Phil, Dyer Chris
- What Language Model To Train If You Have One Million GPU Hours? Scao Teven Le, Wang Thomas, Hesslow Daniel, Saulnier Lucile, Bekman Stas, Bari M Saiful, Biderman Stella, Elsahar Hady, Muennighoff Niklas, Phang Jason, Press Ofir, Raffel Colin, Sanh Victor, Shen Sheng, Sutawika Lintang, Tae Jaesung, Yong Zheng Xin, Launay Julien, Beltagy Iz
- Training Language Models With Language Feedback Scheurer Jérémy, Campos Jon Ander, Chan Jun Shern, Chen Angelica, Cho Kyunghyun, Perez Ethan
- Non-autoregressive Neural Machine Translation: A Call For Clarity Schmidt Robin M., Pires Telmo, Peitz Stephan, Lööf Jonas
- Confident Adaptive Language Modeling Schuster Tal, Fisch Adam, Gupta Jai, Dehghani Mostafa, Bahri Dara, Tran Vinh Q., Tay Yi, Metzler Donald
- Large Language Models And The Reverse Turing Test Sejnowski Terrence
- On The Effect Of Pretraining Corpora On In-context Learning By A Large-scale Language Model Seongjin Shin, Sang-woo Lee, Hwijeen Ahn, Sungdong Kim, Hyoungseok Kim, Boseop Kim, Kyunghyun Cho, Gichang Lee, Woomyoung Park, Jung-woo Ha, Nako Sung
- Bird-eye Transformers For Text Generation Models Sha Lei, Song Yuhang, Yordanov Yordan, Salvatori Tommaso, Lukasiewicz Thomas
- Using Deepspeed And Megatron To Train Megatron-turing NLG 530B, A Large-scale Generative Language Model Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick Legresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zhang, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, Bryan Catanzaro
- Overcoming Catastrophic Forgetting Beyond Continual Learning: Balanced Training For Neural Machine Translation Shao Chenze, Feng Yang
- Deanthropomorphising NLP: Can A Language Model Be Conscious? Shardlow Matthew, Przybyła Piotr
- Staged Training For Transformer Language Models Shen Sheng, Walsh Pete, Keutzer Kurt, Dodge Jesse, Peters Matthew, Beltagy Iz
- Incorporating Domain Knowledge Through Task Augmentation For Front-end Javascript Code Generation Shen Sijie, Zhu Xiang, Dong Yihong, Guo Qizhi, Zhen Yankun, Li Ge
- Unintended Bias In Language Model-driven Conversational Recommendation Shen Tianshu, Li Jiaru, Bouadjenek Mohamed Reda, Mai Zheda, Sanner Scott
- XRICL: Cross-lingual Retrieval-augmented In-context Learning For Cross-lingual Text-to-sql Semantic Parsing Shi Peng, Zhang Rui, Bai He, Lin Jimmy
- Mgpt: Few-shot Learners Go Multilingual Shliazhko Oleh, Fenogenova Alena, Tikhonova Maria, Mikhailov Vladislav, Kozlova Anastasia, Shavrina Tatiana
- Distilling Reasoning Capabilities Into Smaller Language Models Shridhar Kumar, Stolfo Alessandro, Sachan Mrinmaya
- Language Models That Seek For Knowledge: Modular Search & Generation For Dialogue And Prompt Completion Shuster Kurt, Komeili Mojtaba, Adolphs Leonard, Roller Stephen, Szlam Arthur, Weston Jason
- Gpt-neox-20b: An Open-source Autoregressive Language Model Sid Black, Stella Biderman, Eric Hallahan, Quentin Anthony, Leo Gao, Laurence Golding, Horace He, Connor Leahy, Kyle Mcdonell, Jason Phang, Michael Pieler, Usvsn Sai Prashanth, Shivanshu Purohit, Laria Reynolds, Jonathan Tow, Ben Wang, Samuel Weinbach
- On The Efficacy Of Co-attention Transformer Layers In Visual Question Answering Sikarwar Ankur, Kreiman Gabriel
- Ask Me Anything: A Simple Strategy For Prompting Language Models Simran Arora, Avanika Narayan, Mayee F. Chen, Laurel Orr, Neel Guha, Kush Bhatia, Ines Chami, Frederic Sala, Christopher Ré
- Augmenting Interpretable Models With Llms During Training Singh Chandan, Askari Armin, Caruana Rich, Gao Jianfeng
- Explaining Patterns In Data With Language Models Via Interpretable Autoprompting Singh Chandan, Morris John X., Aneja Jyoti, Rush Alexander M., Gao Jianfeng
- Improving The Domain Adaptation Of Retrieval Augmented Generation (RAG) Models For Open Domain Question Answering Siriwardhana Shamane, Weerasekera Rivindu, Wen Elliott, Kaluarachchi Tharindu, Rana Rajib, Nanayakkara Suranga
- Human Language Modeling Soni Nikita, Matero Matthew, Balasubramanian Niranjan, Schwartz H. Andrew
- Climb: A Continual Learning Benchmark For Vision-and-language Tasks Srinivasan Tejas, Chang Ting-yun, Alva Leticia Leonor Pinto, Chochlakis Georgios, Rostami Mohammad, Thomason Jesse
- Beyond The Imitation Game: Quantifying And Extrapolating The Capabilities Of Language Models Srivastava Aarohi Shammie, Rastogi Abhinav Shammie, Rao Abhishek Shammie, Shoeb Abu Awal Md Shammie, Abid Abubakar Shammie, Fisch Adam Shammie, Brown Adam R. Shammie, Santoro Adam Shammie, Gupta Aditya Shammie, Garriga-alonso Adrià Shammie, Kluska Agnieszka Shammie, Lewkowycz Aitor Shammie, Agarwal Akshat Shammie, Power Alethea Shammie, Ray Alex Shammie, Warstadt Alex Shammie, Kocurek Alexander W. Shammie, Safaya Ali Shammie, Tazarv Ali Shammie, Xiang Alice Shammie, Parrish Alicia Shammie, Nie Allen Shammie, Hussain Aman Shammie, Askell Amanda Shammie, Dsouza Amanda Shammie, Slone Ambrose Shammie, Rahane Ameet Shammie, Iyer Anantharaman S. Shammie, Andreassen Anders Shammie, Madotto Andrea Shammie, Santilli Andrea Shammie, Stuhlmüller Andreas Shammie, Dai Andrew Shammie, La Andrew Shammie, Lampinen Andrew Shammie, Zou Andy Shammie, Jiang Angela Shammie, Chen Angelica Shammie, Vuong Anh Shammie, Gupta Animesh Shammie, Gottardi Anna Shammie, Norelli Antonio Shammie, Venkatesh Anu Shammie, Gholamidavoodi Arash Shammie, Tabassum Arfa Shammie, Menezes Arul Shammie, Kirubarajan Arun Shammie, Mullokandov Asher Shammie, Sabharwal Ashish Shammie, Herrick Austin Shammie, Efrat Avia Shammie, Erdem Aykut Shammie, Karakaş Ayla Shammie, Roberts B. Ryan Shammie, Loe Bao Sheng Shammie, Zoph Barret Shammie, Bojanowski Bartłomiej Shammie, Özyurt Batuhan Shammie, Hedayatnia Behnam Shammie, Neyshabur Behnam Shammie, Inden Benjamin Shammie, Stein Benno Shammie, Ekmekci Berk Shammie, Lin Bill Yuchen Shammie, Howald Blake Shammie, Orinion Bryan Shammie, Diao Cameron Shammie, Dour Cameron Shammie, Stinson Catherine Shammie, Argueta Cedrick Shammie, Ramírez César Ferri Shammie, Singh Chandan Shammie, Rathkopf Charles Shammie, Meng Chenlin Shammie, Baral Chitta Shammie, Wu Chiyu Shammie, Callison-burch Chris Shammie, Waites Chris Shammie, Voigt Christian Shammie, Manning Christopher D. Shammie, Potts Christopher Shammie, Ramirez Cindy Shammie, Rivera Clara E. Shammie, Siro Clemencia Shammie, Raffel Colin Shammie, Ashcraft Courtney Shammie, Garbacea Cristina Shammie, Sileo Damien Shammie, Garrette Dan Shammie, Hendrycks Dan Shammie, Kilman Dan Shammie, Roth Dan Shammie, Freeman Daniel Shammie, Khashabi Daniel Shammie, Levy Daniel Shammie, González Daniel Moseguí Shammie, Perszyk Danielle Shammie, Hernandez Danny Shammie, Chen Danqi Shammie, Ippolito Daphne Shammie, Gilboa Dar Shammie, Dohan David Shammie, Drakard David Shammie, Jurgens David Shammie, Datta Debajyoti Shammie, Ganguli Deep Shammie, Emelin Denis Shammie, Kleyko Denis Shammie, Yuret Deniz Shammie, Chen Derek Shammie, Tam Derek Shammie, Hupkes Dieuwke Shammie, Misra Diganta Shammie, Buzan Dilyar Shammie, Mollo Dimitri Coelho Shammie, Yang Diyi Shammie, Lee Dong-ho Shammie, Schrader Dylan Shammie, Shutova Ekaterina Shammie, Cubuk Ekin Dogus Shammie, Segal Elad Shammie, Hagerman Eleanor Shammie, Barnes Elizabeth Shammie, Donoway Elizabeth Shammie, Pavlick Ellie Shammie, Rodola Emanuele Shammie, Lam Emma Shammie, Chu Eric Shammie, Tang Eric Shammie, Erdem Erkut Shammie, Chang Ernie Shammie, Chi Ethan A. Shammie, Dyer Ethan Shammie, Jerzak Ethan Shammie, Kim Ethan Shammie, Manyasi Eunice Engefu Shammie, Zheltonozhskii Evgenii Shammie, Xia Fanyue Shammie, Siar Fatemeh Shammie, Martínez-plumed Fernando Shammie, Happé Francesca Shammie, Chollet Francois Shammie, Rong Frieda Shammie, Mishra Gaurav Shammie, Winata Genta Indra Shammie, De Melo Gerard Shammie, Kruszewski Germán Shammie, Parascandolo Giambattista Shammie, Mariani Giorgio Shammie, Wang Gloria Shammie, Jaimovitch-lópez Gonzalo Shammie, Betz Gregor Shammie, Gur-ari Guy Shammie, Galijasevic Hana Shammie, Kim Hannah Shammie, Rashkin Hannah Shammie, Hajishirzi Hannaneh Shammie, Mehta Harsh Shammie, Bogar Hayden Shammie, Shevlin Henry Shammie, Schütze Hinrich Shammie, Yakura Hiromu Shammie, Zhang Hongming Shammie, Wong Hugh Mee Shammie, Ng Ian Shammie, Noble Isaac Shammie, Jumelet Jaap Shammie, Geissinger Jack Shammie, Kernion Jackson Shammie, Hilton Jacob Shammie, Lee Jaehoon Shammie, Fisac Jaime Fernández Shammie, Simon James B. Shammie, Koppel James Shammie, Zheng James Shammie, Zou James Shammie, Kocoń Jan Shammie, Thompson Jana Shammie, Wingfield Janelle Shammie, Kaplan Jared Shammie, Radom Jarema Shammie, Sohl-dickstein Jascha Shammie, Phang Jason Shammie, Wei Jason Shammie, Yosinski Jason Shammie, Novikova Jekaterina Shammie, Bosscher Jelle Shammie, Marsh Jennifer Shammie, Kim Jeremy Shammie, Taal Jeroen Shammie, Engel Jesse Shammie, Alabi Jesujoba Shammie, Xu Jiacheng Shammie, Song Jiaming Shammie, Tang Jillian Shammie, Waweru Joan Shammie, Burden John Shammie, Miller John Shammie, Balis John U. Shammie, Batchelder Jonathan Shammie, Berant Jonathan Shammie, Frohberg Jörg Shammie, Rozen Jos Shammie, Hernandez-orallo Jose Shammie, Boudeman Joseph Shammie, Guerr Joseph Shammie, Jones Joseph Shammie, Tenenbaum Joshua B. Shammie, Rule Joshua S. Shammie, Chua Joyce Shammie, Kanclerz Kamil Shammie, Livescu Karen Shammie, Krauth Karl Shammie, Gopalakrishnan Karthik Shammie, Ignatyeva Katerina Shammie, Markert Katja Shammie, Dhole Kaustubh D. Shammie, Gimpel Kevin Shammie, Omondi Kevin Shammie, Mathewson Kory Shammie, Chiafullo Kristen Shammie, Shkaruta Ksenia Shammie, Shridhar Kumar Shammie, Mcdonell Kyle Shammie, Richardson Kyle Shammie, Reynolds Laria Shammie, Gao Leo Shammie, Zhang Li Shammie, Dugan Liam Shammie, Qin Lianhui Shammie, Contreras-ochando Lidia Shammie, Morency Louis-philippe Shammie, Moschella Luca Shammie, Lam Lucas Shammie, Noble Lucy Shammie, Schmidt Ludwig Shammie, He Luheng Shammie, Colón Luis Oliveros Shammie, Metz Luke Shammie, Şenel Lütfi Kerem Shammie, Bosma Maarten Shammie, Sap Maarten Shammie, Ter Hoeve Maartje Shammie, Farooqi Maheen Shammie, Faruqui Manaal Shammie, Mazeika Mantas Shammie, Baturan Marco Shammie, Marelli Marco Shammie, Maru Marco Shammie, Quintana Maria Jose Ramírez Shammie, Tolkiehn Marie Shammie, Giulianelli Mario Shammie, Lewis Martha Shammie, Potthast Martin Shammie, Leavitt Matthew L. Shammie, Hagen Matthias Shammie, Schubert Mátyás Shammie, Baitemirova Medina Orduna Shammie, Arnaud Melody Shammie, Mcelrath Melvin Shammie, Yee Michael A. Shammie, Cohen Michael Shammie, Gu Michael Shammie, Ivanitskiy Michael Shammie, Starritt Michael Shammie, Strube Michael Shammie, Swędrowski Michał Shammie, Bevilacqua Michele Shammie, Yasunaga Michihiro Shammie, Kale Mihir Shammie, Cain Mike Shammie, Xu Mimee Shammie, Suzgun Mirac Shammie, Walker Mitch Shammie, Tiwari Mo Shammie, Bansal Mohit Shammie, Aminnaseri Moin Shammie, Geva Mor Shammie, Gheini Mozhdeh Shammie, T Mukund Varma Shammie, Peng Nanyun Shammie, Chi Nathan A. Shammie, Lee Nayeon Shammie, Krakover Neta Gur-ari Shammie, Cameron Nicholas Shammie, Roberts Nicholas Shammie, Doiron Nick Shammie, Martinez Nicole Shammie, Nangia Nikita Shammie, Deckers Niklas Shammie, Muennighoff Niklas Shammie, Keskar Nitish Shirish Shammie, Iyer Niveditha S. Shammie, Constant Noah Shammie, Fiedel Noah Shammie, Wen Nuan Shammie, Zhang Oliver Shammie, Agha Omar Shammie, Elbaghdadi Omar Shammie, Levy Omer Shammie, Evans Owain Shammie, Casares Pablo Antonio Moreno Shammie, Doshi Parth Shammie, Fung Pascale Shammie, Liang Paul Pu Shammie, Vicol Paul Shammie, Alipoormolabashi Pegah Shammie, Liao Peiyuan Shammie, Liang Percy Shammie, Chang Peter Shammie, Eckersley Peter Shammie, Htut Phu Mon Shammie, Hwang Pinyu Shammie, Miłkowski Piotr Shammie, Patil Piyush Shammie, Pezeshkpour Pouya Shammie, Oli Priti Shammie, Mei Qiaozhu Shammie, Lyu Qing Shammie, Chen Qinlang Shammie, Banjade Rabin Shammie, Rudolph Rachel Etta Shammie, Gabriel Raefer Shammie, Habacker Rahel Shammie, Risco Ramon Shammie, Millière Raphaël Shammie, Garg Rhythm Shammie, Barnes Richard Shammie, Saurous Rif A. Shammie, Arakawa Riku Shammie, Raymaekers Robbe Shammie, Frank Robert Shammie, Sikand Rohan Shammie, Novak Roman Shammie, Sitelew Roman Shammie, Lebras Ronan Shammie, Liu Rosanne Shammie, Jacobs Rowan Shammie, Zhang Rui Shammie, Salakhutdinov Ruslan Shammie, Chi Ryan Shammie, Lee Ryan Shammie, Stovall Ryan Shammie, Teehan Ryan Shammie, Yang Rylan Shammie, Singh Sahib Shammie, Mohammad Saif M. Shammie, Anand Sajant Shammie, Dillavou Sam Shammie, Shleifer Sam Shammie, Wiseman Sam Shammie, Gruetter Samuel Shammie, Bowman Samuel R. Shammie, Schoenholz Samuel S. Shammie, Han Sanghyun Shammie, Kwatra Sanjeev Shammie, Rous Sarah A. Shammie, Ghazarian Sarik Shammie, Ghosh Sayan Shammie, Casey Sean Shammie, Bischoff Sebastian Shammie, Gehrmann Sebastian Shammie, Schuster Sebastian Shammie, Sadeghi Sepideh Shammie, Hamdan Shadi Shammie, Zhou Sharon Shammie, Srivastava Shashank Shammie, Shi Sherry Shammie, Singh Shikhar Shammie, Asaadi Shima Shammie, Gu Shixiang Shane Shammie, Pachchigar Shubh Shammie, Toshniwal Shubham Shammie, Upadhyay Shyam Shammie, Shyamolima Shammie, Debnath, Shakeri Siamak, Thormeyer Simon, Melzi Simone, Reddy Siva, Makini Sneha Priscilla, Lee Soo-hwan, Torene Spencer, Hatwar Sriharsha, Dehaene Stanislas, Divic Stefan, Ermon Stefano, Biderman Stella, Lin Stephanie, Prasad Stephen, Piantadosi Steven T., Shieber Stuart M., Misherghi Summer, Kiritchenko Svetlana, Mishra Swaroop, Linzen Tal, Schuster Tal, Li Tao, Yu Tao, Ali Tariq, Hashimoto Tatsu, Wu Te-lin, Desbordes Théo, Rothschild Theodore, Phan Thomas, Wang Tianle, Nkinyili Tiberius, Schick Timo, Kornev Timofei, Tunduny Titus, Gerstenberg Tobias, Chang Trenton, Neeraj Trishala, Khot Tushar, Shultz Tyler, Shaham Uri, Misra Vedant, Demberg Vera, Nyamai Victoria, Raunak Vikas, Ramasesh Vinay, Prabhu Vinay Uday, Padmakumar Vishakh, Srikumar Vivek, Fedus William, Saunders William, Zhang William, Vossen Wout, Ren Xiang, Tong Xiaoyu, Zhao Xinran, Wu Xinyi, Shen Xudong, Yaghoobzadeh Yadollah, Lakretz Yair, Song Yangqiu, Bahri Yasaman, Choi Yejin, Yang Yichi, Hao Yiding, Chen Yifu, Belinkov Yonatan, Hou Yu, Hou Yufang, Bai Yuntao, Seid Zachary, Zhao Zhuoye, Wang Zijian, Wang Zijie J., Wang Zirui, Wu Ziyi
- Towards Zero-shot And Few-shot Table Question Answering Using GPT-3 Srivastava Pragya, Ganu Tanuja, Guha Saikat
- Conciseness: An Overlooked Language Task Stahlberg Felix, Kumar Aashish, Alberti Chris, Kumar Shankar
- Putting Gpt-3's Creativity To The (alternative Uses) Test Stevenson Claire, Smal Iris, Baas Matthijs, Grasman Raoul, Van Der Maas Han
- QA4QG: Using Question Answering To Constrain Multi-hop Question Generation Su Dan, Xu Peng, Fung Pascale
- Towards Diverse, Relevant And Coherent Open-domain Dialogue Generation Via Hybrid Latent Variables Sun Bin, Li Yitong, Mi Fei, Wang Weichao, Li Yiwei, Li Kan
- Investigating Explainability Of Generative AI For Code Through Scenario-based Design Sun Jiao, Liao Q. Vera, Muller Michael, Agarwal Mayank, Houde Stephanie, Talamadupula Kartik, Weisz Justin D.
- Tokenization Consistency Matters For Generative Models On Extractive NLP Tasks Sun Kaiser, Qi Peng, Zhang Yuhao, Liu Lan, Wang William Yang, Huang Zhiheng
- Black-box Tuning For Language-model-as-a-service Sun Tianxiang, Shao Yunfan, Qian Hong, Huang Xuanjing, Qiu Xipeng
- Dynamar: Dynamic Prompt With Mask Token Representation Sun Xiaodi, Rajagopalan Sunny, Nigam Priyanka, Lu Weiyi, Xu Yi, Zeng Belinda, Chilimbi Trishul
- Long-form Video-language Pre-training With Multimodal Temporal Contrastive Learning Sun Yuchong, Xue Hongwei, Song Ruihua, Liu Bei, Yang Huan, Fu Jianlong
- A Length-extrapolatable Transformer Sun Yutao, Dong Li, Patra Barun, Ma Shuming, Huang Shaohan, Benhaim Alon, Chaudhary Vishrav, Song Xia, Wei Furu
- Pseudo-ood Training For Robust Language Models Sundararaman Dhanasekar, Mehta Nikhil, Carin Lawrence
- OPT: Open Pre-trained Transformer Language Models Susan Zhang, Stephen Roller, Naman Goyal, Mikel Artetxe, Moya Chen, Shuohui Chen, Christopher Dewan, Mona Diab, Xian Li, Xi Victoria Lin, Todor Mihaylov, Myle Ott, Sam Shleifer, Kurt Shuster, Daniel Simig, Punit Singh Koura, Anjali Sridhar, Tianlu Wang, Luke Zettlemoyer
- Chatgpt: The End Of Online Exam Integrity? Susnjak Teo
- The AI Teacher Test: Measuring The Pedagogical Ability Of Blender And GPT-3 In Educational Dialogues Tack Anaïs, Piech Chris
- On The Effect Of Pre-training For Transformer In Different Modality On Offline Reinforcement Learning Takagi Shiro
- Large Language Models Are Zero-shot Reasoners Takeshi Kojima, Shixiang Shane Gu, Machel Reid, Yutaka Matsuo, Yusuke Iwasawa
- Fewer Errors, But More Stereotypes? The Effect Of Model Size On Gender Bias Tal Yarden, Magar Inbal, Schwartz Roy
- Commonsenseqa 2.0: Exposing The Limits Of AI Through Gamification Talmor Alon, Yoran Ori, Bras Ronan Le, Bhagavatula Chandra, Goldberg Yoav, Choi Yejin, Berant Jonathan
- ACORT: A Compact Object Relation Transformer For Parameter Efficient Image Captioning Tan Jia Huei, Tan Ying Hua, Chan Chee Seng, Chuah Joon Huang
- Etrica: Event-triggered Context-aware Story Generation Augmented By Cross Attention Tang Chen, Lin Chenghua, Huang Henglin, Guerin Frank, Zhang Zhihao
- News Summarization And Evaluation In The Era Of GPT-3 Tanya Goyal, Junyi Jessy Li, Greg Durrett
- Compression Of Generative Pre-trained Language Models Via Quantization Tao Chaofan, Hou Lu, Zhang Wei, Shang Lifeng, Jiang Xin, Liu Qun, Luo Ping, Wong Ngai
- Towards Fine-tuning Pre-trained Language Models With Integer Forward And Backward Propagation Tayaranian Mohammadreza, Ghaffari Alireza, Tahaei Marzieh S., Rezagholizadeh Mehdi, Asgharian Masoud, Nia Vahid Partovi
- Predicting Issue Types With Sebert Trautsch Alexander, Herbold Steffen
- Interleaving Retrieval With Chain-of-thought Reasoning For Knowledge-intensive Multi-step Questions Trivedi Harsh, Balasubramanian Niranjan, Khot Tushar, Sabharwal Ashish
- Do Large Language Models Know What Humans Know? Trott Sean, Jones Cameron, Chang Tyler, Michaelov James, Bergen Benjamin
- Adavae: Exploring Adaptive Gpt-2s In Variational Auto-encoders For Language Modeling Tu Haoqin, Yang Zhongliang, Yang Jinshuai, Huang Yongfeng
- Help Me Write A Poem: Instruction Tuning As A Vehicle For Collaborative Poetry Writing Tuhin Chakrabarty, Vishakh Padmakumar, He He
- Efficient Few-shot Learning Without Prompts Tunstall Lewis, Reimers Nils, Jo Unso Eun Seo, Bates Luke, Korat Daniel, Wasserblat Moshe, Pereg Oren
- Effect And Analysis Of Large-scale Language Model Rescoring On Competitive ASR Systems Udagawa Takuma, Suzuki Masayuki, Kurata Gakuto, Itoh Nobuyasu, Saon George
- Inferring Implicit Relations In Complex Questions With Language Models Uri Katz, Mor Geva, Jonathan Berant
- Towards Reasoning-aware Explainable VQA Vaideeswaran Rakesh, Gao Feng, Mathur Abhinav, Thattai Govind
- Dylora: Parameter Efficient Tuning Of Pre-trained Models Using Dynamic Search-free Low-rank Adaptation Valipour Mojtaba, Rezagholizadeh Mehdi, Kobyzev Ivan, Ghodsi Ali
- Will It Blend? Mixing Training Paradigms & Prompting For Argument Quality Prediction Van Der Meer Michiel, Reuver Myrthe, Khurana Urja, Krause Lea, Santamaría Selene Báez
- VLSP 2021 - Vimrc Challenge: Vietnamese Machine Reading Comprehension Van Nguyen Kiet, Tran Son Quoc, Nguyen Luan Thanh, Van Huynh Tin, Luu Son T., Nguyen Ngan Luu-thuy
- UGIF: UI Grounded Instruction Following Venkatesh Sagar Gubbi, Talukdar Partha, Narayanan Srini
- How Large Language Models Are Transforming Machine-paraphrased Plagiarism Wahle Jan Philip, Ruas Terry, Kirstein Frederic, Gipp Bela
- What Do They Capture? -- A Structural Analysis Of Pre-trained Language Models For Source Code Wan Yao, Zhao Wei, Zhang Hongyu, Sui Yulei, Xu Guandong, Jin Hai
- Unite: Unified Translation Evaluation Wan Yu, Liu Dayiheng, Yang Baosong, Zhang Haibo, Chen Boxing, Wong Derek F., Chao Lidia S.
- STRUDEL: Structured Dialogue Summarization For Dialogue Comprehension Wang Borui, Feng Chengcheng, Nair Arjun, Mao Madelyn, Desai Jai, Celikyilmaz Asli, Li Haoran, Mehdad Yashar, Radev Dragomir
- Exploring The Limits Of Domain-adaptive Training For Detoxifying Large-scale Language Models Wang Boxin, Ping Wei, Xiao Chaowei, Xu Peng, Patwary Mostofa, Shoeybi Mohammad, Li Bo, Anandkumar Anima, Catanzaro Bryan
- Knowledgeable Salient Span Mask For Enhancing Language Models As Knowledge Base Wang Cunxiang, Luo Fuli, Li Yanyang, Xu Runxin, Huang Fei, Zhang Yue
- Learning To Decompose Visual Features With Latent Textual Prompts Wang Feng, Li Manling, Lin Xudong, Lv Hairong, Schwing Alexander G., Ji Heng
- Knowledge Prompting In Pre-trained Language Model For Natural Language Understanding Wang Jianing, Huang Wenkang, Shi Qiuhui, Wang Hongbin, Qiu Minghui, Li Xiang, Gao Ming
- Lilt: A Simple Yet Effective Language-independent Layout Transformer For Structured Document Understanding Wang Jiapeng, Jin Lianwen, Ding Kai
- Rethinking Textual Adversarial Defense For Pre-trained Language Models Wang Jiayi, Bao Rongzhou, Zhang Zhuosheng, Zhao Hai
- TAG: Boosting Text-vqa Via Text-aware Visual Question-answer Generation Wang Jun, Gao Mingfei, Hu Yuqian, Selvaraju Ramprasaath R., Ramaiah Chetan, Xu Ran, Jaja Joseph F., Davis Larry S.
- Pretraining Without Attention Wang Junxiong, Yan Jing Nathan, Gu Albert, Rush Alexander M.
- Zero-shot Image Captioning By Anchor-augmented Vision-language Space Alignment Wang Junyang, Zhang Yi, Yan Ming, Zhang Ji, Sang Jitao
- Hybrid-regressive Neural Machine Translation Wang Qiang, Hu Xinhui, Chen Ming
- What Language Model Architecture And Pretraining Objective Work Best For Zero-shot Generalization? Wang Thomas, Roberts Adam, Hesslow Daniel, Scao Teven Le, Chung Hyung Won, Beltagy Iz, Launay Julien, Raffel Colin
- Image As A Foreign Language: Beit Pretraining For All Vision And Vision-language Tasks Wang Wenhui, Bao Hangbo, Dong Li, Bjorck Johan, Peng Zhiliang, Liu Qiang, Aggarwal Kriti, Mohammed Owais Khan, Singhal Saksham, Som Subhojit, Wei Furu
- Elaboration-generating Commonsense Question Answering At Scale Wang Wenya, Srikumar Vivek, Hajishirzi Hanna, Smith Noah A.
- Qrelscore: Better Evaluating Generated Questions With Deeper Understanding Of Context-aware Relevance Wang Xiaoqiang, Liu Bang, Tang Siliang, Wu Lingfei
- Finding Skill Neurons In Pre-trained Transformer-based Language Models Wang Xiaozhi, Wen Kaiyue, Zhang Zhengyan, Hou Lei, Liu Zhiyuan, Li Juanzi
- Compilable Neural Code Generation With Compiler Feedback Wang Xin, Wang Yasheng, Wan Yao, Mi Fei, Li Yitong, Zhou Pingyi, Liu Jin, Wu Hao, Jiang Xin, Liu Qun
- Pay Attention To Your Tone: Introducing A New Dataset For Polite Language Rewrite Wang Xun, Ge Tao, Mao Allen, Li Yuki, Wei Furu, Chen Si-qing
- S-prompts Learning With Pre-trained Transformers: An Occam's Razor For Domain Incremental Learning Wang Yabin, Huang Zhiwu, Hong Xiaopeng
- Adamix: Mixture-of-adaptations For Parameter-efficient Model Tuning Wang Yaqing, Agarwal Sahaj, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Controlling Styles In Neural Machine Translation With Activation Prompt Wang Yifan, Sun Zewei, Cheng Shanbo, Zheng Weiguo, Wang Mingxuan
- Pan More Gold From The Sand: Refining Open-domain Dialogue Training With Noisy Self-retrieval Generation Wang Yihe, Li Yitong, Wang Yasheng, Mi Fei, Zhou Pingyi, Wang Xin, Liu Jin, Jiang Xin, Liu Qun
- Multimodal Adaptive Distillation For Leveraging Unimodal Encoders For Vision-language Tasks Wang Zhecan, Codella Noel, Chen Yen-chun, Zhou Luowei, Dai Xiyang, Xiao Bin, Yang Jianwei, You Haoxuan, Chang Kai-wei, Chang Shih-fu, Yuan Lu
- Unified Multimodal Model With Unlikelihood Training For Visual Dialog Wang Zihao, Wang Junli, Jiang Changjun
- Emergent Analogical Reasoning In Large Language Models Webb Taylor, Holyoak Keith J., Lu Hongjing
- EGCR: Explanation Generation For Conversational Recommendation Wen Bingbing, Bu Xiaoning, Shah Chirag
- Mixed-effects Transformers For Hierarchical Adaptation White Julia, Goodman Noah, Hawkins Robert
- Probing Of Quantitative Values In Abstractive Summarization Models White Nathan M.
- Generative Or Contrastive? Phrase Reconstruction For Better Sentence Representation Learning Wu Bohong, Zhao Hai
- Sentence Representation Learning With Generative Objective Rather Than Contrastive Objective Wu Bohong, Zhao Hai
- Adversarial Self-attention For Language Understanding Wu Hongqiu, Ding Ruixue, Zhao Hai, Xie Pengjun, Huang Fei, Zhang Min
- Stateful Memory-augmented Transformers For Efficient Dialogue Modeling Wu Qingyang, Yu Zhou
- Exploring The Efficacy Of Pre-trained Checkpoints In Text-to-music Generation Task Wu Shangda, Sun Maosong
- Extreme Compression For Pre-trained Transformers Made Simple And Efficient Wu Xiaoxia, Yao Zhewei, Zhang Minjia, Li Conglong, He Yuxiong
- Memorizing Transformers Wu Yuhuai, Rabe Markus N., Hutchins Delesley, Szegedy Christian
- An Efficient Memory-augmented Transformer For Knowledge-intensive NLP Tasks Wu Yuxiang, Zhao Yu, Hu Baotian, Minervini Pasquale, Stenetorp Pontus, Riedel Sebastian
- Modeling Context With Linear Attention For Scalable Document-level Translation Wu Zhaofeng, Peng Hao, Pappas Nikolaos, Smith Noah A.
- Enhancing Pre-trained Models With Text Structure Knowledge For Question Generation Wu Zichen Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China, Jia Xin Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China, Qu Fanyi Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China, Wu Yunfang Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China
- The Unreliability Of Explanations In Few-shot Prompting For Textual Reasoning Xi Ye, Greg Durrett
- Towards Unified Conversational Recommender Systems Via Knowledge-enhanced Prompt Learning Xiaolei Wang, Kun Zhou, Ji-rong Wen, Wayne Xin Zhao
- Dual Modality Prompt Tuning For Vision-language Pre-trained Model Xing Yinghui, Wu Qirui, Cheng De, Zhang Shizhou, Liang Guoqiang, Wang Peng, Zhang Yanning
- Evaluating And Improving Context Attention Distribution On Multi-turn Response Generation Using Self-contained Distractions Xing Yujie, Gulla Jon Atle
- MGA-VQA: Multi-granularity Alignment For Visual Question Answering Xiong Peixi, Shen Yilin, Jin Hongxia
- Adapting Pretrained Text-to-text Models For Long Text Sequences Xiong Wenhan, Gupta Anchit, Toshniwal Shubham, Mehdad Yashar, Yih Wen-tau
- COSPLAY: Concept Set Guided Personalized Dialogue Generation Across Both Party Personas Xu Chen, Li Piji, Wang Wei, Yang Haoran, Wang Siyun, Xiao Chuangbai
- Go-tuning: Improving Zero-shot Learning Abilities Of Smaller Language Models Xu Jingjing, Dong Qingxiu, Liu Hongyi, Li Lei
- Integrating Translation Memories Into Non-autoregressive Machine Translation Xu Jitao, Crego Josep, Yvon François
- Evaluating Parameter Efficient Learning For Generation Xu Peng, Patwary Mostofa, Prabhumoye Shrimai, Adams Virginia, Prenger Ryan J., Ping Wei, Lee Nayeon, Shoeybi Mohammad, Catanzaro Bryan
- Bridgetower: Building Bridges Between Encoders In Vision-language Representation Learning Xu Xiao, Wu Chenfei, Rosenman Shachar, Lal Vasudev, Che Wanxiang, Duan Nan
- CLIP Also Understands Text: Prompting CLIP For Phrase Understanding Yan An, Li Jiacheng, Zhu Wanrong, Lu Yujie, Wang William Yang, Mcauley Julian
- Probing Causes Of Hallucinations In Neural Machine Translations Yan Jianhao, Meng Fandong, Zhou Jie
- Generative Negative Text Replay For Continual Vision-language Pretraining Yan Shipeng, Hong Lanqing, Xu Hang, Han Jianhua, Tuytelaars Tinne, Li Zhenguo, He Xuming
- Learning To Answer Visual Questions From Web Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Testaug: A Framework For Augmenting Capability-based NLP Tests Yang Guanqun, Haque Mirazul, Song Qiaochu, Yang Wei, Liu Xueqing
- Tableformer: Robust Transformer Modeling For Table-text Encoding Yang Jingfeng, Gupta Aditya, Upadhyay Shyam, He Luheng, Goel Rahul, Paul Shachi
- GLUE-X: Evaluating Natural Language Understanding Models From An Out-of-distribution Generalization Perspective Yang Linyi, Zhang Shuibai, Qin Libo, Li Yafu, Wang Yidong, Liu Hanmeng, Wang Jindong, Xie Xing, Zhang Yue
- AD-DROP: Attribution-driven Dropout For Robust Language Model Fine-tuning Yang Tao, Deng Jinghao, Quan Xiaojun, Wang Qifan, Nie Shaoliang
- What GPT Knows About Who Is Who Yang Xiaohan, Peynetti Eduardo, Meerman Vasco, Tanner Chris
- Building Knowledge-grounded Dialogue Systems With Graph-based Semantic Modeling Yang Yizhe, Huang Heyan, Gao Yang, And Jiawei Li
- Parameter-efficient Tuning Makes A Good Classification Head Yang Zhuoyi, Ding Ming, Guo Yanhui, Lv Qingsong, Tang Jie
- Prompt Tuning For Discriminative Pre-trained Language Models Yao Yuan, Dong Bowen, Zhang Ao, Zhang Zhengyan, Xie Ruobing, Liu Zhiyuan, Lin Leyu, Sun Maosong, Wang Jianyong
- Zeroquant: Efficient And Affordable Post-training Quantization For Large-scale Transformers Yao Zhewei, Aminabadi Reza Yazdani, Zhang Minjia, Wu Xiaoxia, Li Conglong, He Yuxiong
- Random-ltd: Random And Layerwise Token Dropping Brings Efficient Training For Large-scale Transformers Yao Zhewei, Wu Xiaoxia, Li Conglong, Holmes Connor, Zhang Minjia, Li Cheng, He Yuxiong
- Language Models Are General-purpose Interfaces Yaru Hao, Haoyu Song, Li Dong, Shaohan Huang, Zewen Chi, Wenhui Wang, Shuming Ma, Furu Wei
- Structured Prompting: Scaling In-context Learning To 1,000 Examples Yaru Hao, Yutao Sun, Li Dong, Zhixiong Han, Yuxian Gu, Furu Wei
- Retrieval-augmented Multimodal Language Modeling Yasunaga Michihiro, Aghajanyan Armen, Shi Weijia, James Rich, Leskovec Jure, Liang Percy, Lewis Mike, Zettlemoyer Luke, Yih Wen-tau
- Modeling Multi-hop Question Answering As Single Sequence Prediction Yavuz Semih, Hashimoto Kazuma, Zhou Yingbo, Keskar Nitish Shirish, Xiong Caiming
- Guess The Instruction! Flipped Learning Makes Language Models Stronger Zero-shot Learners Ye Seonghyeon, Kim Doyoung, Jang Joel, Shin Joongbo, Seo Minjoon
- Alzheimer's Diagnosis And Generation-based Chatbot Using Hierarchical Attention And Transformer Yeong Park Jun, Jong Shin Su, Hwan Choi Chang, Jae Lee Jung, Sang-il Choi
- Transcending Scaling Laws With 0.1% Extra Compute Yi Tay, Jason Wei, Hyung Won Chung, Vinh Q. Tran, David R. So, Siamak Shakeri, Xavier Garcia, Huaixiu Steven Zheng, Jinfeng Rao, Aakanksha Chowdhery, Denny Zhou, Donald Metzler, Slav Petrov, Neil Houlsby, Quoc V. Le, Mostafa Dehghani
- UL2: Unifying Language Learning Paradigms Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Xavier Garcia, Jason Wei, Xuezhi Wang, Hyung Won Chung, Siamak Shakeri, Dara Bahri, Tal Schuster, Huaixiu Steven Zheng, Denny Zhou, Neil Houlsby, Donald Metzler
- Making Large Language Models Better Reasoners With Step-aware Verifier Yifei Li, Zeqi Lin, Shizhuo Zhang, Qiang Fu, Bei Chen, Jian-guang Lou, Weizhu Chen
- Active Example Selection For In-context Learning Yiming Zhang, Shi Feng, Chenhao Tan
- Categorizing Semantic Representations For Neural Machine Translation Yin Yongjing, Li Yafu, Meng Fandong, Zhou Jie, Zhang Yue
- Large Language Models Are Better Reasoners With Self-verification Yixuan Weng, Minjun Zhu, Fei Xia, Bin Li, Shizhu He, Shengping Liu, Bin Sun, Kang Liu, Jun Zhao
- Super-naturalinstructions: Generalization Via Declarative Instructions On 1600+ NLP Tasks Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei, Anjana Arunkumar, Arjun Ashok, Arut Selvan Dhanasekaran, Atharva Naik, David Stap, Eshaan Pathak, Giannis Karamanolakis, Haizhi Gary Lai, Ishan Purohit, Ishani Mondal, Jacob Anderson, Kirby Kuznia, Krima Doshi, Maitreya Patel, Kuntal Kumar Pal, Mehrad Moradshahi, Mihir Parmar, Mirali Purohit, Neeraj Varshney, Phani Rohitha Kaza, Pulkit Verma, Ravsehaj Singh Puri, Rushang Karia, Shailaja Keyur Sampat, Savan Doshi, Siddhartha Mishra, Sujan Reddy, Sumanta Patro, Tanay Dixit, Xudong Shen, Chitta Baral, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi, Daniel Khashabi
- Scaling Autoregressive Models For Content-rich Text-to-image Generation Yu Jiahui, Xu Yuanzhong, Koh Jing Yu, Luong Thang, Baid Gunjan, Wang Zirui, Vasudevan Vijay, Ku Alexander, Yang Yinfei, Ayan Burcu Karagol, Hutchinson Ben, Han Wei, Parekh Zarana, Li Xin, Zhang Han, Baldridge Jason, Wu Yonghui
- Efficient Language Modeling With Sparse All-mlp Yu Ping, Artetxe Mikel, Ott Myle, Shleifer Sam, Gong Hongyu, Stoyanov Ves, Li Xian
- Tri-attention: Explicit Context-aware Attention Mechanism For Natural Language Processing Yu Rui, Li Yifeng, Lu Wenpeng, Cao Longbing
- Bilaterally Slimmable Transformer For Elastic And Efficient Visual Question Answering Yu Zhou, Jin Zitian, Yu Jun, Xu Mingliang, Wang Hongbo, Fan Jianping
- Hype: Better Pre-trained Language Model Fine-tuning With Hidden Representation Perturbation Yuan Hongyi, Yuan Zheng, Tan Chuanqi, Huang Fei, Huang Songfang
- Wudaomm: A Large-scale Multi-modal Dataset For Pre-training Models Yuan Sha, Zhao Shuai, Leng Jiahong, Xue Zhao, Zhao Hanyu, Liu Peiyu, Gong Zheng, Zhao Wayne Xin, Li Junyi, Tang Jie
- Can Pretrained Language Models (yet) Reason Deductively? Yuan Zhangdie, Hu Songbo, Vulić Ivan, Korhonen Anna, Meng Zaiqiao
- Competition-level Code Generation With Alphacode Yujia Li, David Choi, Junyoung Chung, Nate Kushman, Julian Schrittwieser, Rémi Leblond, Tom Eccles, James Keeling, Felix Gimeno, Agustin Dal Lago, Thomas Hubert, Peter Choy, Cyprien De Masson D'autume, Igor Babuschkin, Xinyun Chen, Po-sen Huang, Johannes Welbl, Sven Gowal, Alexey Cherepanov, James Molloy, Daniel J. Mankowitz, Esme Sutherland Robson, Pushmeet Kohli, Nando De Freitas, Koray Kavukcuoglu, Oriol Vinyals
- Dyrex: Dynamic Query Representation For Extractive Question Answering Zaratiana Urchade, Khbir Niama El, Núñez Dennis, Holat Pierre, Tomeh Nadi, Charnois Thierry
- Better Pre-training By Reducing Representation Confusion Zhang Haojie, Liang Mingfei, Xie Ruobing, Sun Zhenlong, Zhang Bo, Lin Leyu
- Improved Data Augmentation For Translation Suggestion Zhang Hongxiao, Lai Siyu, Zhang Songming, Huang Hui, Chen Yufeng, Xu Jinan, Liu Jian
- Scala: Accelerating Adaptation Of Pre-trained Transformer-based Language Models Via Efficient Large-batch Adversarial Noise Zhang Minjia, Naresh Niranjan Uma, He Yuxiong
- Analyzing And Simulating User Utterance Reformulation In Conversational Recommender Systems Zhang Shuo, Wang Mu-chun, Balog Krisztian
- SMDT: Selective Memory-augmented Neural Document Translation Zhang Xu, Yang Jian, Huang Haoyang, Ma Shuming, Zhang Dongdong, Li Jinlong, Wei Furu
- Unveiling Transformers With LEGO: A Synthetic Reasoning Task Zhang Yi, Backurs Arturs, Bubeck Sébastien, Eldan Ronen, Gunasekar Suriya, Wagner Tal
- BLISS: Robust Sequence-to-sequence Learning Via Self-supervised Input Representation Zhang Zheng, Ding Liang, Cheng Dazhao, Liu Xuebo, Zhang Min, Tao Dacheng
- Can Offline Reinforcement Learning Help Natural Language Understanding? Zhang Ziqi, Wang Yile, Zhang Yue, Wang Donglin
- M-adapter: Modality Adaptation For End-to-end Speech-to-text Translation Zhao Jinming, Yang Hao, Shareghi Ehsan, Haffari Gholamreza
- Attend, Memorize And Generate: Towards Faithful Table-to-text Generation In Few Shots Zhao Wenting, Liu Ye, Wan Yao, Yu Philip S.
- Lite Unified Modeling For Discriminative Reading Comprehension Zhao Yilin, Zhao Hai, Shen Libin, Zhao Yinggong
- MSDF: A General Open-domain Multi-skill Dialog Framework Zhao Yu, Hu Xinshuo, Li Yunxin, Hu Baotian, Li Dongfang, Chen Sichao, Wang Xiaolong
- Medical Dialogue Response Generation With Pivotal Information Recalling Zhao Yu, Li Yunxin, Wu Yuxiang, Hu Baotian, Chen Qingcai, Wang Xiaolong, Ding Yuxin, Zhang Min
- Training Language Models With Memory Augmentation Zhong Zexuan, Lei Tao, Chen Danqi
- Learning To Decompose: Hypothetical Question Decomposition Based On Comparable Texts Zhou Ben, Richardson Kyle, Yu Xiaodong, Roth Dan
- Reflection Of Thought: Inversely Eliciting Numerical Reasoning In Language Models Via Solving Linear Systems Zhou Fan, Dong Haoyu, Liu Qian, Cheng Zhoujun, Han Shi, Zhang Dongmei
- Binding Language Models In Symbolic Languages Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu
- Syntax-informed Question Answering With Heterogeneous Graph Transformer Zhu Fangyi, Tan Lok You, Ng See-kiong, Bressan Stéphane
- Predicting Fine-tuning Performance With Probing Zhu Zining, Shahtalebi Soroosh, Rudzicz Frank
- Automatic Chain Of Thought Prompting In Large Language Models Zhuosheng Zhang, Aston Zhang, Mu Li, Alex Smola
- Future Sight: Dynamic Story Generation With Large Pretrained Language Models Zimmerman Brian D., Sahu Gaurav, Vechtomova Olga
- Exploring And Evaluating Personalized Models For Code Generation Zlotchevski Andrei, Drain Dawn, Svyatkovskiy Alexey, Clement Colin, Sundaresan Neel, Tufano Michele
- A Survey On GPT-3 Zong Mingyu, Krishnamachari Bhaskar
- St-moe: Designing Stable And Transferable Sparse Expert Models Zoph Barret, Bello Irwan, Kumar Sameer, Du Nan, Huang Yanping, Dean Jeff, Shazeer Noam, Fedus William
- Efficient Long Sequence Modeling Via State Space Augmented Transformer Zuo Simiao, Liu Xiaodong, Jiao Jian, Charles Denis, Manavoglu Eren, Zhao Tuo, Gao Jianfeng
- LM-CPPF: Paraphrasing-guided Data Augmentation For Contrastive Prompt-based Few-shot Fine-tuning Abaskohi Amirhossein, Rothe Sascha, Yaghoobzadeh Yadollah
- Let The Llms Talk: Simulating Human-to-human Conversational QA Via Zero-shot Llm-to-llm Interactions Abbasiantaeb Zahra, Yuan Yifei, Kanoulas Evangelos, Aliannejadi Mohammad
- I Was Blind But Now I See: Implementing Vision-enabled Dialogue In Social Robots Abbo Giulio Antonio, Belpaeme Tony
- Attention Over Pre-trained Sentence Embeddings For Long Document Classification Abdaoui Amine, Dutta Sourav
- \(\mathbb{vd}\)-\(\mathbb{gr}\): Boosting \(\mathbb{v}\)isual \(\mathbb{d}\)ialog With Cascaded Spatial-temporal Multi-modal \(\mathbb{gr}\)aphs Abdessaied Adnen, Shi Lei, Bulling Andreas
- Beyond Traditional Teaching: The Potential Of Large Language Models And Chatbots In Graduate Engineering Education Abedi Mahyar, Alshybani Ibrahem, Shahadat Muhammad Rubayat Bin, Murillo Michael S.
- Comparative Analysis Of Chatgpt, GPT-4, And Microsoft Bing Chatbots For GRE Test Abu-haifa Mohammad, Etawi Bara'a, Alkhatatbeh Huthaifa, Ababneh Ayman
- On Codex Prompt Engineering For OCL Generation: An Empirical Study Abukhalaf Seif, Hamdaqa Mohammad, Khomh Foutse
- On The Relationship Between Skill Neurons And Robustness In Prompt Tuning Ackermann Leon, Ohmer Xenia
- Multi-party Goal Tracking With Llms: Comparing Pre-training, Fine-tuning, And Prompt Engineering Addlesee Angus, Sieińska Weronika, Gunson Nancie, Garcia Daniel Hernández, Dondrup Christian, Lemon Oliver
- The ADAIO System At The BEA-2023 Shared Task On Generating AI Teacher Responses In Educational Dialogues Adigwe Adaeze 2 And 3, Yuan Zheng 2 And 3
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon
- KITLM: Domain-specific Knowledge Integration Into Language Models For Question Answering Agarwal Ankush, Gawade Sakharam, Azad Amar Prakash, Bhattacharyya Pushpak
- Can Large Language Models Be Good Path Planners? A Benchmark And Investigation On Spatial-temporal Reasoning Aghzal Mohamed, Plaku Erion, Yao Ziyu
- SARATHI: Efficient LLM Inference By Piggybacking Decodes With Chunked Prefills Agrawal Amey, Panwar Ashish, Mohan Jayashree, Kwatra Nipun, Gulavani Bhargav S., Ramjee Ramachandran
- Story Visualization By Online Text Augmentation With Context Memory Ahn Daechul, Kim Daneul, Song Gwangmo, Kim Seung Hwan, Lee Honglak, Kang Dongyeop, Choi Jonghyun
- MEGA: Multilingual Evaluation Of Generative AI Ahuja Kabir, Diddee Harshita, Hada Rishav, Ochieng Millicent, Ramesh Krithika, Jain Prachi, Nambi Akshay, Ganu Tanuja, Segal Sameer, Axmed Maxamed, Bali Kalika, Sitaram Sunayana
- MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models And Tasks Ahuja Sanchit, Aggarwal Divyanshu, Gumma Varun, Watts Ishaan, Sathe Ashutosh, Ochieng Millicent, Hada Rishav, Jain Prachi, Axmed Maxamed, Bali Kalika, Sitaram Sunayana
- Can We Trust The Evaluation On Chatgpt? Aiyappa Rachith, An Jisun, Kwak Haewoon, Ahn Yong-yeol
- The Emergence Of Essential Sparsity In Large Pre-trained Models: The Weights That Matter Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Zhangyang Wang
- CESAR: Automatic Induction Of Compositional Instructions For Multi-turn Dialogs Aksu Taha, Hazarika Devamanyu, Mehri Shikib, Kim Seokhwan, Hakkani-tür Dilek, Liu Yang, Namazifar Mahdi
- An In-depth Look At Gemini's Language Abilities Akter Syeda Nahida, Yu Zichun, Muhamed Aashiq, Ou Tianyue, Bäuerle Alex, Cabrera Ángel Alexander, Dholakia Krish, Xiong Chenyan, Neubig Graham
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Akyürek Afra Feyza, Akyürek Ekin, Madaan Aman, Kalyan Ashwin, Clark Peter, Wijaya Derry, Tandon Niket
- Can Language Models Employ The Socratic Method? Experiments With Code Debugging Al-hossami Erfan, Bunescu Razvan, Smith Justin, Teehan Ryan
- Position Interpolation Improves Alibi Extrapolation Al-khateeb Faisal, Dey Nolan, Soboleva Daria, Hestness Joel
- Chatgpt And Beyond: The Generative AI Revolution In Education Al-smadi Mohammad
- Mamba: Linear-time Sequence Modeling With Selective State Spaces Albert Gu, Tri Dao
- Mistral 7B Albert Q. Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego De Las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lucile Saulnier, Lélio Renard Lavaud, Marie-anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timothée Lacroix, William El Sayed
- Tokenizer Choice For LLM Training: Negligible Or Crucial? Ali Mehdi, Fromm Michael, Thellmann Klaudia, Rutmann Richard, Lübbering Max, Leveling Johannes, Klug Katrin, Ebert Jan, Doll Niclas, Buschhoff Jasper Schulze, Jain Charvi, Weber Alexander Arno, Jurkschat Lena, Abdelwahab Hammam, John Chelsea, Suarez Pedro Ortiz, Ostendorff Malte, Weinbach Samuel, Sifa Rafet, Kesselheim Stefan, Flores-herr Nicolas
- OPT-R: Exploring The Role Of Explanations In Finetuning And Prompting For Reasoning Skills Of Large Language Models Alkhamissi Badr, Verma Siddharth, Yu Ping, Jin Zhijing, Celikyilmaz Asli, Diab Mona
- Physics Of Language Models: Part 3.2, Knowledge Manipulation Allen-zhu Zeyuan, Li Yuanzhi
- The Falcon Series Of Open Language Models Almazrouei Ebtesam, Alobeidli Hamza, Alshamsi Abdulaziz, Cappelli Alessandro, Cojocaru Ruxandra, Debbah Mérouane, Goffinet Étienne, Hesslow Daniel, Launay Julien, Malartic Quentin, Mazzotta Daniele, Noune Badreddine, Pannier Baptiste, Penedo Guilherme
- Detecting Language Model Attacks With Perplexity Alon Gabriel, Kamfonas Michael
- Pixt3: Pixel-based Table-to-text Generation Alonso Iñigo, Agirre Eneko, Lapata Mirella
- Is BERT Blind? Exploring The Effect Of Vision-and-language Pretraining On Visual Language Understanding Alper Morris, Fiman Michael, Averbuch-elor Hadar
- Taqyim: Evaluating Arabic NLP Tasks Using Chatgpt Models Alyafeai Zaid, Alshaibani Maged S., Alkhamissi Badr, Luqman Hamzah, Alareqi Ebrahim, Fadel Ali
- Generative AI Perceptions: A Survey To Measure The Perceptions Of Faculty, Staff, And Students On Generative AI Tools In Academia Amani Sara, White Lance, Balart Trini, Arora Laksha, Shryock Kristi J., Brumbelow Kelly, Watson Karan L.
- Chatgpt-3.5, Chatgpt-4, Google Bard, And Microsoft Bing To Improve Health Literacy And Communication In Pediatric Populations And Beyond Amin Kanhai S., Mayes Linda, Khosla Pavan, Doshi Rushabh
- Prompt-engineering And Transformer-based Question Generation And Evaluation Amyeen Rubaba
- L-eval: Instituting Standardized Evaluation For Long Context Language Models An Chenxin, Gong Shansan, Zhong Ming, Zhao Xingjian, Li Mukai, Zhang Jun, Kong Lingpeng, Qiu Xipeng
- Does Deep Learning Learn To Abstract? A Systematic Probing Framework An Shengnan, Lin Zeqi, Chen Bei, Fu Qiang, Zheng Nanning, Lou Jian-guang
- Learning From Mistakes Makes LLM Better Reasoner An Shengnan, Ma Zexiong, Lin Zeqi, Zheng Nanning, Lou Jian-guang, Chen Weizhu
- Studying The Impacts Of Pre-training Using Chatgpt-generated Text On Downstream Tasks Anand Sarthak
- Protip: Progressive Tool Retrieval Improves Planning Anantha Raviteja, Bandyopadhyay Bortik, Kashi Anirudh, Mahinder Sayantan, Hill Andrew W, Chappidi Srinivas
- Context Tuning For Retrieval Augmented Generation Anantha Raviteja, Bethi Tharun, Vodianik Danil, Chappidi Srinivas
- Chemcrow: Augmenting Large-language Models With Chemistry Tools Andres M Bran, Sam Cox, Oliver Schilter, Carlo Baldassari, Andrew D White, Philippe Schwaller
- Expel: LLM Agents Are Experiential Learners Andrew Zhao, Daniel Huang, Quentin Xu, Matthieu Lin, Yong-jin Liu, Gao Huang
- Augmenting Llms With Knowledge: A Survey On Hallucination Prevention Andriopoulos Konstantinos, Pouwelse Johan
- Randomized Positional Encodings Boost Length Generalization Of Transformers Anian Ruoss, Grégoire Delétang, Tim Genewein, Jordi Grau-moya, Róbert Csordás, Mehdi Bennani, Shane Legg, Joel Veness
- Distilling Efficient Language-specific Models For Cross-lingual Transfer Ansell Alan, Ponti Edoardo Maria, Korhonen Anna, Vulić Ivan
- Scaling Laws For Language Encoding Models In Fmri Antonello Richard, Vaidya Aditya, Huth Alexander G.
- Mixture Of Tokens: Efficient Llms Through Cross-example Aggregation Antoniak Szymon, Jaszczur Sebastian, Krutul Michał, Pióro Maciej, Krajewski Jakub, Ludziejewski Jan, Odrzygóźdź Tomasz, Cygan Marek
- Agenttuning: Enabling Generalized Agent Abilities For Llms Aohan Zeng, Mingdao Liu, Rui Lu, Bowen Wang, Xiao Liu, Yuxiao Dong, Jie Tang
- Detecting Natural Language Biases With Prompt-based Learning Aowal Md Abdul, Islam Maliha T, Mammen Priyanka Mary, Shetty Sandesh
- Sequence-to-sequence Spanish Pre-trained Language Models Araujo Vladimir, Trusca Maria Mihaela, Tufiño Rodrigo, Moens Marie-francine
- Slimfit: Memory-efficient Fine-tuning Of Transformer-based Models Using Training Dynamics Ardakani Arash, Haan Altan, Tan Shangyin, Popovici Doru Thom, Cheung Alvin, Iancu Costin, Sen Koushik
- Unmasking The Giant: A Comprehensive Evaluation Of Chatgpt's Proficiency In Coding Algorithms And Data Structures Arefin Sayed Erfan, Heya Tasnia Ashrafi, Al-qudah Hasan, Ineza Ynes, Serwadda Abdul
- GPT-4 Can't Reason Arkoudas Konstantine
- AI And Education: An Investigation Into The Use Of Chatgpt For Systems Thinking Arndt Holger
- Learning And Leveraging Verifiers To Improve Planning Capabilities Of Pre-trained Language Models Arora Daman, Kambhampati Subbarao
- Have Llms Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models Arora Daman, Singh Himanshu Gaurav, Mausam
- Zoology: Measuring And Improving Recall In Efficient Language Models Arora Simran, Eyuboglu Sabri, Timalsina Aman, Johnson Isys, Poli Michael, Zou James, Rudra Atri, Ré Christopher
- Towards A Unified Multimodal Reasoning Framework Arun Abhinav, Mal Dipendra Singh, Soni Mehul, Sawada Tomohiro
- Real-time Visual Feedback To Guide Benchmark Creation: A Human-and-metric-in-the-loop Workflow Arunkumar Anjana, Mishra Swaroop, Sachdeva Bhavdeep, Baral Chitta, Bryan Chris
- BUFFET: Benchmarking Large Language Models For Few-shot Cross-lingual Transfer Asai Akari, Kudugunta Sneha, Yu Xinyan Velocity, Blevins Terra, Gonen Hila, Reid Machel, Tsvetkov Yulia, Ruder Sebastian, Hajishirzi Hannaneh
- Self-rag: Learning To Retrieve, Generate, And Critique Through Self-reflection Asai Akari, Wu Zeqiu, Wang Yizhong, Sil Avirup, Hajishirzi Hannaneh
- Is Chatgpt A Biomedical Expert? -- Exploring The Zero-shot Performance Of Current GPT Models In Biomedical Tasks Ateia Samy, Kruschwitz Udo
- Factuality Challenges In The Era Of Large Language Models Augenstein Isabelle, Baldwin Timothy, Cha Meeyoung, Chakraborty Tanmoy, Ciampaglia Giovanni Luca, Corney David, Diresta Renee, Ferrara Emilio, Hale Scott, Halevy Alon, Hovy Eduard, Ji Heng, Menczer Filippo, Miguez Ruben, Nakov Preslav, Scheufele Dietram, Sharma Shivam, Zagni Giovanni
- Unihd At TSAR-2022 Shared Task: Is Compute All We Need For Lexical Simplification? Aumiller Dennis, Gertz Michael
- Using Large Language Models For Zero-shot Natural Language Generation From Knowledge Graphs Axelsson Agnes, Skantze Gabriel
- Taught By The Internet, Exploring Bias In Openais GPT3 Ayaz Ali, Nawalgaria Aditya, Yin Ruilian
- Scaling Transformer To 1M Tokens And Beyond With RMT Aydar Bulatov, Yuri Kuratov, Yermek Kapushev, Mikhail S. Burtsev
- Intentional Biases In LLM Responses Badyal Nicklaus, Jacoby Derek, Coady Yvonne
- Abusing Images And Sounds For Indirect Instruction Injection In Multi-modal Llms Bagdasaryan Eugene, Hsieh Tsung-yin, Nassi Ben, Shmatikov Vitaly
- Evaluating Chatgpt As A Question Answering System: A Comprehensive Analysis And Comparison With Existing Models Bahak Hossein, Taheri Farzaneh, Zojaji Zahra, Kazemi Arefeh
- Chatgpt: Applications, Opportunities, And Threats Bahrini Aram, Khamoshifar Mohammadsadra, Abbasimehr Hossein, Riggs Robert J., Esmaeili Maryam, Majdabadkohne Rastin Mastali, Pasehvar Morteza
- Constituency Parsing Using Llms Bai Xuefeng, Wu Jialong, Chen Yulong, Wang Zhongqing, Zhang Yue
- Longbench: A Bilingual, Multitask Benchmark For Long Context Understanding Bai Yushi, Lv Xin, Zhang Jiajie, Lyu Hongchang, Tang Jiankai, Huang Zhidian, Du Zhengxiao, Liu Xiao, Zeng Aohan, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- Gpt-4: A Review On Advancements And Opportunities In Natural Language Processing Baktash Jawid Ahmad, Dawodi Mursal
- Tamil-llama: A New Tamil Language Model Based On Llama 2 Balachandran Abhinand
- No That's Not What I Meant: Handling Third Position Repair In Conversational Question Answering Balaraman Vevake, Eshghi Arash, Konstas Ioannis, Papaioannou Ioannis
- Fine-tuning Large Enterprise Language Models Via Ontological Reasoning Baldazzi Teodoro, Bellomarini Luigi, Ceri Stefano, Colombo Andrea, Gentili Andrea, Sallinger Emanuel
- It's Not Easy Being Wrong: Large Language Models Struggle With Process Of Elimination Reasoning Balepur Nishant, Palta Shramay, Rudinger Rachel
- Investigating Pre-trained Language Models On Cross-domain Datasets, A Step Closer To General AI Ballout Mohamad, Krumnack Ulf, Heidemann Gunther, Kühnberger Kai-uwe
- Redefining Developer Assistance: Through Large Language Models In Software Ecosystem Banerjee Somnath, Dutta Avik, Layek Sayan, Sahoo Amruit, Joyce Sam Conrad, Hazra Rima
- Vtrain: A Simulation Framework For Evaluating Cost-effective And Compute-optimal Large Language Model Training Bang Jehyeon, Choi Yujeong, Kim Myeongwoo, Kim Yongdeok, Rhu Minsoo
- Exploring Qualitative Research Using Llms Bano Muneera, Zowghi Didar, Whittle Jon
- Assessing And Enhancing The Robustness Of Large Language Models With Task Structure Variations For Logical Reasoning Bao Qiming, Gendron Gael, Peng Alex Yuxuan, Zhong Wanjun, Tan Neset, Chen Yang, Witbrock Michael, Liu Jiamou
- Exploring Iterative Enhancement For Improving Learnersourced Multiple-choice Question Explanations With Large Language Models Bao Qiming, Leinonen Juho, Peng Alex Yuxuan, Zhong Wanjun, Gendron Gaël, Pistotti Timothy, Huang Alice, Denny Paul, Witbrock Michael, Liu Jiamou
- Abstract Meaning Representation-based Logic-driven Data Augmentation For Logical Reasoning Bao Qiming, Peng Alex Yuxuan, Deng Zhenyun, Zhong Wanjun, Gendron Gael, Pistotti Timothy, Tan Neset, Young Nathan, Chen Yang, Zhu Yonghua, Denny Paul, Witbrock Michael, Liu Jiamou
- Separate-and-enhance: Compositional Finetuning For Text2image Diffusion Models Bao Zhipeng, Li Yijun, Singh Krishna Kumar, Wang Yu-xiong, Hebert Martial
- Check Your Facts And Try Again: Improving Large Language Models With External Knowledge And Automated Feedback Baolin Peng, Michel Galley, Pengcheng He, Hao Cheng, Yujia Xie, Yu Hu, Qiuyuan Huang, Lars Liden, Zhou Yu, Weizhu Chen, Jianfeng Gao
- Do Language Models Learn About Legal Entity Types During Pretraining? Barale Claire, Rovatsos Michael, Bhuta Nehal
- Understanding Telecom Language Through Large Language Models Bariah Lina, Zou Hang, Zhao Qiyang, Mouhouche Belkacem, Bader Faouzi, Debbah Merouane
- Llamantino: Llama 2 Models For Effective Text Generation In Italian Language Basile Pierpaolo, Musacchio Elio, Polignano Marco, Siciliani Lucia, Fiameni Giuseppe, Semeraro Giovanni
- Text Summarization Using Large Language Models: A Comparative Study Of Mpt-7b-instruct, Falcon-7b-instruct, And Openai Chat-gpt Models Basyal Lochan, Sanghvi Mihir
- Efficient Pre-training For Localized Instruction Generation Of Videos Batra Anil, Moltisanti Davide, Sevilla-lara Laura, Rohrbach Marcus, Keller Frank
- Testing The Depth Of Chatgpt's Comprehension Via Cross-modal Tasks Based On Ascii-art: Gpt3.5's Abilities In Regard To Recognizing And Generating Ascii-art Are Not Totally Lacking Bayani David
- Alternating Updates For Efficient Transformers Baykal Cenk, Cutler Dylan, Dikkala Nishanth, Ghosh Nikhil, Panigrahy Rina, Wang Xin
- Exploring The Landscape Of Large Language Models In Medical Question Answering Bean Andrew M., Korgul Karolina, Krones Felix, Mccraith Robert, Mahdi Adam
- Span-selective Linear Attention Transformers For Effective And Robust Schema-guided Dialogue State Tracking Bebensee Björn, Lee Haejun
- Large Linguistic Models: Analyzing Theoretical Linguistic Abilities Of Llms Beguš Gašper, Dąbkowski Maksymilian, Rhodes Ryan
- AQUALLM: Audio Question Answering Data Generation Using Large Language Models Behera Swarup Ranjan, Injeti Krishna Mohan, Patibandla Jaya Sai Kiran, Pokala Praveen Kumar, Pailla Balakrishna Reddy
- Examining The Emergence Of Deductive Reasoning In Generative Language Models Belcak Peter, Lanzendörfer Luca A., Wattenhofer Roger
- Harnessing Gpt-3.5-turbo For Rhetorical Role Prediction In Legal Cases Belfathi Anas, Hernandez Nicolas, Monceaux Laura
- Labrador: Exploring The Limits Of Masked Language Modeling For Laboratory Data Bellamy David R., Kumar Bhawesh, Wang Cindy, Beam Andrew
- Worldsense: A Synthetic Benchmark For Grounded Reasoning In Large Language Models Benchekroun Youssef, Dervishi Megi, Ibrahim Mark, Gaya Jean-baptiste, Martinet Xavier, Mialon Grégoire, Scialom Thomas, Dupoux Emmanuel, Hupkes Dieuwke, Vincent Pascal
- @ve: A Chatbot For Latin Bendel Oliver, N'diaye Karim
- Large Language Models In The Workplace: A Case Study On Prompt Engineering For Job Type Classification Benjamin Clavié, Alexandru Ciceu, Frederick Naylor, Guillaume Soulié, Thomas Brightwell
- When Automated Assessment Meets Automated Content Generation: Examining Text Quality In The Era Of Gpts Bevilacqua Marialena, Oketch Kezia, Qin Ruiyang, Stamey Will, Zhang Xinyuan, Gan Yi, Yang Kai, Abbasi Ahmed
- Pre-training Llms Using Human-like Development Data Corpus Bhardwaj Khushi, Shah Raj Sanjay, Varma Sashank
- Language Model Unalignment: Parametric Red-teaming To Expose Hidden Harms And Biases Bhardwaj Rishabh, Poria Soujanya
- Red-teaming Large Language Models Using Chain Of Utterances For Safety-alignment Bhardwaj Rishabh, Poria Soujanya
- Investigating Answerability Of Llms For Long-form Question Answering Bhat Meghana Moorthy, Meng Rui, Liu Ye, Zhou Yingbo, Yavuz Semih
- TART: A Plug-and-play Transformer Module For Task-agnostic Reasoning Bhatia Kush, Narayan Avanika, De Sa Christopher, Ré Christopher
- Automating Question Generation From Educational Text Bhowmick Ayan Kumar, Jagmohan Ashish, Vempaty Aditya, Dey Prasenjit, Hall Leigh, Hartman Jeremy, Kokku Ravi, Maheshwari Hema
- Chatgpt Is A Knowledgeable But Inexperienced Solver: An Investigation Of Commonsense Problem In Large Language Models Bian Ning, Han Xianpei, Sun Le, Lin Hongyu, Lu Yaojie, He Ben, Jiang Shanshan, Dong Bin
- Loquacity And Visible Emotion: Chatgpt As A Policy Advisor Biancotti Claudia, Camassa Carolina
- Birth Of A Transformer: A Memory Viewpoint Bietti Alberto, Cabannes Vivien, Bouchacourt Diane, Jegou Herve, Bottou Leon
- In-context Learning Dynamics With Random Binary Sequences Bigelow Eric J., Lubana Ekdeep Singh, Dick Robert P., Tanaka Hidenori, Ullman Tomer D.
- VIGC: Visual Instruction Generation And Correction Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He
- How Should The Advent Of Large Language Models Affect The Practice Of Science? Binz Marcel, Alaniz Stephan, Roskies Adina, Aczel Balazs, Bergstrom Carl T., Allen Colin, Schad Daniel, Wulff Dirk, West Jevin D., Zhang Qiong, Shiffrin Richard M., Gershman Samuel J., Popov Ven, Bender Emily M., Marelli Marco, Botvinick Matthew M., Akata Zeynep, Schulz Eric
- Model Leeching: An Extraction Attack Targeting Llms Birch Lewis, Hackett William, Trawicki Stefan, Suri Neeraj, Garraghan Peter
- Can Chatgpt Be Your Personal Medical Assistant? Biswas Md. Rafiul, Islam Ashhadul, Shah Zubair, Zaghouani Wajdi, Belhaouari Samir Brahim
- Distractor Generation For Multiple-choice Questions With Predictive Prompting And Large Language Models Bitew Semere Kiros, Deleu Johannes, Develder Chris, Demeester Thomas
- Visit-bench: A Benchmark For Vision-language Instruction Following Inspired By Real-world Use Bitton Yonatan, Bansal Hritik, Hessel Jack, Shao Rulin, Zhu Wanrong, Awadalla Anas, Gardner Josh, Taori Rohan, Schmidt Ludwig
- How Close Is Chatgpt To Human Experts? Comparison Corpus, Evaluation, And Detection Biyang Guo, Xin Zhang, Ziyuan Wang, Minqi Jiang, Jinran Nie, Yuxuan Ding, Jianwei Yue, Yupeng Wu
- BLT: Can Large Language Models Handle Basic Legal Text? Blair-stanek Andrew, Holzenberger Nils, Van Durme Benjamin
- Chip-chat: Challenges And Opportunities In Conversational Hardware Design Blocklove Jason, Garg Siddharth, Karri Ramesh, Pearce Hammond
- RWKV: Reinventing Rnns For The Transformer Era Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Stella Biderman, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran Gv, Xuzheng He, Haowen Hou, Jiaju Lin, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Guangyu Song, Xiangru Tang, Bolun Wang, Johan S. Wind, Stanislaw Wozniak, Ruichong Zhang, Zhenyuan Zhang, Qihang Zhao, Peng Zhou, Qinghua Zhou, Jian Zhu, Rui-jie Zhu
- Personality Testing Of Large Language Models: Limited Temporal Stability, But Highlighted Prosociality Bodroza Bojana, Dinic Bojana M., Bojic Ljubisa
- Halle-control: Controlling Object Hallucination In Large Multimodal Models Bohan Zhai, Shijia Yang, Chenfeng Xu, Sheng Shen, Kurt Keutzer, Chunyuan Li, Manling Li
- GPT-4 Surpassing Human Performance In Linguistic Pragmatics Bojic Ljubisa, Kovacevic Predrag, Cabarkapa Milan
- Ask Language Model To Clean Your Noisy Translation Data Bolding Quinten, Liao Baohao, Denis Brandon James, Luo Jun, Monz Christof
- Pixlore: A Dataset-driven Approach To Rich Image Captioning Bonilla Diego
- A Categorical Archive Of Chatgpt Failures Borji Ali
- Soundstorm: Efficient Parallel Audio Generation Borsos Zalán, Sharifi Matt, Vincent Damien, Kharitonov Eugene, Zeghidour Neil, Tagliasacchi Marco
- Analyzing Multilingual Competency Of Llms In Multi-turn Instruction Following: A Case Study Of Arabic Boughorbel Sabri, Hawasly Majd
- Eight Things To Know About Large Language Models Bowman Samuel R.
- Inpars-light: Cost-effective Unsupervised Training Of Efficient Rankers Boytsov Leonid, Patel Preksha, Sourabh Vivek, Nisar Riddhi, Kundu Sayani, Ramanathan Ramya, Nyberg Eric
- Chemcrow: Augmenting Large-language Models With Chemistry Tools Bran Andres M, Cox Sam, Schilter Oliver, Baldassari Carlo, White Andrew D, Schwaller Philippe
- Evaluating Large Language Models For Document-grounded Response Generation In Information-seeking Dialogues Braunschweiler Norbert, Doddipatla Rama, Keizer Simon, Stoyanchev Svetlana
- Extractgpt: Exploring The Potential Of Large Language Models For Product Attribute Value Extraction Brinkmann Alexander, Shraga Roee, Bizer Christian
- Fine-grained Affective Processing Capabilities Emerging From Large Language Models Broekens Joost, Hilpert Bernhard, Verberne Suzan, Baraka Kim, Gebhard Patrick, Plaat Aske
- Efficient Transformer Knowledge Distillation: A Performance Review Brown Nathan, Williamson Ashton, Anderson Tahj, Lawrence Logan
- Accuracy Of A Vision-language Model On Challenging Medical Cases Buckley Thomas, Diao James A., Rodman Adam, Manrai Arjun K.
- Utilizing Chatgpt Generated Data To Retrieve Depression Symptoms From Social Media Bucur Ana-maria
- Codetf: One-stop Transformer Library For State-of-the-art Code LLM Bui Nghi D. Q., Le Hung, Wang Yue, Li Junnan, Gotmare Akhilesh Deepak, Hoi Steven C. H.
- Finding The Needle In A Haystack: Unsupervised Rationale Extraction From Long Text Classifiers Bujel Kamil, Caines Andrew, Yannakoudakis Helen, Rei Marek
- Generative AI Assistants In Software Development Education: A Vision For Integrating Generative AI Into Educational Practice, Not Instinctively Defending Against It Bull Christopher, Kharrufa Ahmed
- Local Large Language Models For Complex Structured Medical Tasks Bumgardner V. K. Cody, Mullen Aaron, Armstrong Sam, Hickey Caylin, Talbert Jeff
- Revealing The Structure Of Language Model Capabilities Burnell Ryan, Hao Han, Conway Andrew R. A., Orallo Jose Hernandez
- Weak-to-strong Generalization: Eliciting Strong Capabilities With Weak Supervision Burns Collin, Izmailov Pavel, Kirchner Jan Hendrik, Baker Bowen, Gao Leo, Aschenbrenner Leopold, Chen Yining, Ecoffet Adrien, Joglekar Manas, Leike Jan, Sutskever Ilya, Wu Jeff
- A Comparative Study Of Code Generation Using Chatgpt 3.5 Across 10 Programming Languages Buscemi Alessio
- Just Tell Me: Prompt Engineering In Business Process Management Busch Kiran, Rochlitzer Alexander, Sola Diana, Leopold Henrik
- Incremental Learning Of Humanoid Robot Behavior From Natural Interaction And Large Language Models Bärmann Leonard, Kartmann Rainer, Peller-konrad Fabian, Niehues Jan, Waibel Alex, Asfour Tamim
- Pokemonchat: Auditing Chatgpt For Pok\'emon Universe Knowledge Cabello Laura, Li Jiaang, Chalkidis Ilias
- Development Of The Chatgpt, Generative Artificial Intelligence And Natural Large Language Models For Accountable Reporting And Use (CANGARU) Guidelines Cacciamani Giovanni E., Eppler Michael B., Ganjavi Conner, Pekan Asli, Biedermann Brett, Collins Gary S., Gill Inderbir S.
- Peek Across: Improving Multi-document Modeling Via Cross-document Question-answering Caciularu Avi, Peters Matthew E., Goldberger Jacob, Dagan Ido, Cohan Arman
- Scaling In-context Demonstrations With Structured Attention Cai Tianle, Huang Kaixuan, Lee Jason D., Wang Mengdi
- Large Language Models As Tool Makers Cai Tianle, Wang Xuezhi, Ma Tengyu, Chen Xinyun, Zhou Denny
- CLAP: Isolating Content From Style Through Contrastive Learning With Augmented Prompts Cai Yichao, Liu Yuhang, Zhang Zhen, Shi Javen Qinfeng
- Do Large Language Models Resemble Humans In Language Use? Cai Zhenguang G., Duan Xufeng, Haslett David A., Wang Shuqi, Pickering Martin J.
- On The Application Of Large Language Models For Language Teaching And Assessment Technology Caines Andrew, Benedetto Luca, Taslimipoor Shiva, Davis Christopher, Gao Yuan, Andersen Oeistein, Yuan Zheng, Elliott Mark, Moore Russell, Bryant Christopher, Rei Marek, Yannakoudakis Helen, Mullooly Andrew, Nicholls Diane, Buttery Paula
- A Systematic Study Of Knowledge Distillation For Natural Language Generation With Pseudo-target Training Calderon Nitay, Mukherjee Subhabrata, Reichart Roi, Kantor Amir
- Localizing Lying In Llama: Understanding Instructed Dishonesty On True-false Questions Through Prompting, Probing, And Patching Campbell James, Ren Richard, Guo Phillip
- Can Transformers Learn Sequential Function Classes In Context? Campbell Ryan, Guo Emma, Hu Evan, Vir Reya, Hsiao Ethan
- Oberta: Improving Sparse Transfer Learning Via Improved Initialization, Distillation, And Pruning Regimes Campos Daniel, Marques Alexandre, Kurtz Mark, Zhai Chengxiang
- Wizardlm: Empowering Large Language Models To Follow Complex Instructions Can Xu, Qingfeng Sun, Kai Zheng, Xiubo Geng, Pu Zhao, Jiazhan Feng, Chongyang Tao, Daxin Jiang
- H2O Open Ecosystem For State-of-the-art Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Lee Chun Ming, Conde Marcos V.
- H2ogpt: Democratizing Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Prabhu Prithvi, Gambera Jeff, Landry Mark, Bansal Shivam, Chesler Ryan, Lee Chun Ming, Conde Marcos V., Stetsenko Pasha, Grellier Olivier, Ambati Srisatish
- The Life Cycle Of Knowledge In Big Language Models: A Survey Cao Boxi, Lin Hongyu, Han Xianpei, Sun Le
- Are Large Language Models Good Fact Checkers: A Preliminary Study Cao Han, Wei Lingwei, Chen Mengyang, Zhou Wei, Hu Songlin
- A Step Closer To Comprehensive Answers: Constrained Multi-stage Question Decomposition With Large Language Models Cao Hejing, An Zhenwei, Feng Jiazhan, Xu Kun, Chen Liwei, Zhao Dongyan
- Systematic Rectification Of Language Models Via Dead-end Analysis Cao Meng, Fatemi Mehdi, Cheung Jackie Chi Kit, Shabanian Samira
- Unnatural Error Correction: GPT-4 Can Almost Perfectly Handle Unnatural Scrambled Text Cao Qi, Kojima Takeshi, Matsuo Yutaka, Iwasawa Yusuke
- Pumer: Pruning And Merging Tokens For Efficient Vision Language Models Cao Qingqing, Paranjape Bhargavi, Hajishirzi Hannaneh
- Exploring The Effectiveness Of Chatgpt-based Feedback Compared With Teacher Feedback And Self-feedback: Evidence From Chinese To English Translation Cao Siyi, Zhong Linping
- How Does AI Chat Change Search Behaviors? Capra Robert, Arguello Jaime
- Assessing Large Language Models' Ability To Predict How Humans Balance Self-interest And The Interest Of Others Capraro Valerio, Di Paolo Roberto, Pizziol Veronica
- News Verifiers Showdown: A Comparative Performance Evaluation Of Chatgpt 3.5, Chatgpt 4.0, Bing AI, And Bard In News Fact-checking Caramancion Kevin Matthe
- A LLM Assisted Exploitation Of Ai-guardian Carlini Nicholas
- Revolutionizing Mobile Interaction: Enabling A 3 Billion Parameter GPT LLM On Mobile Carreira Samuel, Marques Tomás, Ribeiro José, Grilo Carlos
- Promoting Generalized Cross-lingual Question Answering In Few-resource Scenarios Via Self-knowledge Distillation Carrino Casimiro Pio, Escolano Carlos, Fonollosa José A. R.
- Grounding Large Language Models In Interactive Environments With Online Reinforcement Learning Carta Thomas, Romac Clément, Wolf Thomas, Lamprier Sylvain, Sigaud Olivier, Oudeyer Pierre-yves
- Can It Edit? Evaluating The Ability Of Large Language Models To Follow Code Editing Instructions Cassano Federico, Li Luisa, Sethi Akul, Shinn Noah, Brennan-jones Abby, Ginesin Jacob, Berman Edward, Chakhnashvili George, Lozhkov Anton, Anderson Carolyn Jane, Guha Arjun
- Examining The Influence Of Varied Levels Of Domain Knowledge Base Inclusion In Gpt-based Intelligent Tutors Castleman Blake, Turkcan Mehmet Kerem
- Chatgpt To Replace Crowdsourcing Of Paraphrases For Intent Classification: Higher Diversity And Comparable Model Robustness Cegin Jan, Simko Jakub, Brusilovsky Peter
- Do Androids Know They're Only Dreaming Of Electric Sheep? Ch-wang Sky, Van Durme Benjamin, Eisner Jason, Kedzie Chris
- Exploring Answer Information Methods For Question Generation With Transformers Chafekar Talha, Hussain Aafiya, Sharma Grishma, Sharma Deepak
- An Empirical Study Of Using Chatgpt For Fact Verification Task Chakraborty Mohna, Kulkarni Adithya, Li Qi
- Chatgpt May Pass The Bar Exam Soon, But Has A Long Way To Go For The Lexglue Benchmark Chalkidis Ilias
- Chatgpt Evaluation On Sentence Level Relations: A Focus On Temporal, Causal, And Discourse Relations Chan Chunkit, Cheng Jiayang, Wang Weiqi, Jiang Yuxin, Fang Tianqing, Liu Xin, Song Yangqiu
- Which Spurious Correlations Impact Reasoning In NLI Models? A Visual Interactive Diagnosis Through Data-constrained Counterfactuals Chan Robin, Amini Afra, El-assady Mennatallah
- Muse: Text-to-image Generation Via Masked Generative Transformers Chang Huiwen, Zhang Han, Barber Jarred, Maschinot Aj, Lezama Jose, Jiang Lu, Yang Ming-hsuan, Murphy Kevin, Freeman William T., Rubinstein Michael, Li Yuanzhen, Krishnan Dilip
- Learning To Generate Better Than Your LLM Chang Jonathan D., Brantley Kiante, Ramamurthy Rajkumar, Misra Dipendra, Sun Wen
- Chipgpt: How Far Are We From Natural Language Hardware Design Chang Kaiyan, Wang Ying, Ren Haimeng, Wang Mengdi, Liang Shengwen, Han Yinhe, Li Huawei, Li Xiaowei
- Language Model Behavior: A Comprehensive Survey Chang Tyler A., Bergen Benjamin K.
- Hallucination Augmented Contrastive Learning For Multimodal Large Language Model Chaoya Jiang, Haiyang Xu, Mengfan Dong, Jiaxing Chen, Wei Ye, Ming Yan, Qinghao Ye, Ji Zhang, Fei Huang, Shikun Zhang
- Memgpt: Towards Llms As Operating Systems Charles Packer, Sarah Wooders, Kevin Lin, Vivian Fang, Shishir G. Patil, Ion Stoica, Joseph E. Gonzalez
- Not All Layers Are Equally As Important: Every Layer Counts BERT Charpentier Lucas Georges Gabriel, Samuel David
- She Had Cobalt Blue Eyes: Prompt Testing To Create Aligned And Sustainable Language Models Chatrath Veronica, Bamgbose Oluwanifemi, Raza Shaina
- Cabbage Sweeter Than Cake? Analysing The Potential Of Large Language Models For Learning Conceptual Spaces Chatterjee Usashi, Gajbhiye Amit, Schockaert Steven
- Efficient GPT Model Pre-training Using Tensor Train Matrix Representation Chekalina Viktoriia, Novikov Georgii, Gusak Julia, Oseledets Ivan, Panchenko Alexander
- Two Failures Of Self-consistency In The Multi-step Reasoning Of Llms Chen Angelica, Phang Jason, Parrish Alicia, Padmakumar Vishakh, Zhao Chen, Bowman Samuel R., Cho Kyunghyun
- Sudden Drops In The Loss: Syntax Acquisition, Phase Transitions, And Simplicity Bias In Mlms Chen Angelica, Shwartz-ziv Ravid, Cho Kyunghyun, Leavitt Matthew L., Saphra Naomi
- Fireact: Toward Language Agent Fine-tuning Chen Baian, Shu Chang, Shareghi Ehsan, Collier Nigel, Narasimhan Karthik, Yao Shunyu
- Dual-space Hierarchical Learning For Goal-guided Conversational Recommendation Chen Can, Liu Hao, Liu Zeming, Liu Xue, Dou Dejing
- Generate Labeled Training Data Using Prompt Programming And GPT-3. An Example Of Big Five Personality Classification Chen Eason
- Gptutor: A Chatgpt-powered Programming Tool For Code Explanation Chen Eason, Huang Ray, Chen Han-shin, Tseng Yuen-hsien, Li Liang-yi
- X-LLM: Bootstrapping Advanced Large Language Models By Treating Multi-modalities As Foreign Languages Chen Feilong, Han Minglun, Zhao Haozhi, Zhang Qingyang, Shi Jing, Xu Shuang, Xu Bo
- Tem-adapter: Adapting Image-text Pretraining For Video Question Answer Chen Guangyi, Liu Xiao, Wang Guangrun, Zhang Kun, Torr Philip H. S., Zhang Xiao-ping, Tang Yansong
- CLEX: Continuous Length Extrapolation For Large Language Models Chen Guanzheng, Li Xin, Meng Zaiqiao, Liang Shangsong, Bing Lidong
- Videollm: Modeling Video Sequence With Large Language Models Chen Guo, Zheng Yin-dong, Wang Jiahao, Xu Jilan, Huang Yifei, Pan Junting, Wang Yi, Wang Yali, Qiao Yu, Lu Tong, Wang Limin
- Chatgpt's One-year Anniversary: Are Open-source Large Language Models Catching Up? Chen Hailin, Jiao Fangkai, Li Xingxuan, Qin Chengwei, Ravaut Mathieu, Zhao Ruochen, Xiong Caiming, Joty Shafiq
- Personalised Distillation: Empowering Open-sourced Llms With Adaptive Learning For Code Generation Chen Hailin, Saha Amrita, Hoi Steven, Joty Shafiq
- Maybe Only 0.5% Data Is Needed: A Preliminary Exploration Of Low Training Data Instruction Tuning Chen Hao, Zhang Yiming, Zhang Qi, Yang Hantao, Hu Xiaomeng, Ma Xuetao, Yanggong Yifan, Zhao Junbo
- Interpreting And Controlling Vision Foundation Models Via Text Explanations Chen Haozhe, Yang Junfeng, Vondrick Carl, Mao Chengzhi
- MCC-KD: Multi-cot Consistent Knowledge Distillation Chen Hongzhan, Wu Siyue, Quan Xiaojun, Wang Rui, Yan Ming, Zhang Ji
- Walking Down The Memory Maze: Beyond Context Limit Through Interactive Reading Chen Howard, Pasunuru Ramakanth, Weston Jason, Celikyilmaz Asli
- Unlearn What You Want To Forget: Efficient Unlearning For Llms Chen Jiaao, Yang Diyi
- Fairytalecqa: Integrating A Commonsense Knowledge Graph Into Children's Storybook Narratives Chen Jiaju, Lu Yuxuan, Zhang Shao, Yao Bingsheng, Dong Yuanzhe, Xu Ying, Li Yunyao, Wang Qianwen, Wang Dakuo, Sun Yuling
- Quantifying Uncertainty In Answers From Any Language Model And Enhancing Their Trustworthiness Chen Jiuhai, Mueller Jonas
- Huatuogpt-ii, One-stage Training For Medical Adaption Of Llms Chen Junying, Wang Xidong, Gao Anningzhe, Jiang Feng, Chen Shunian, Zhang Hongbo, Song Dingjie, Xie Wenya, Kong Chuyi, Li Jianquan, Wan Xiang, Li Haizhou, Wang Benyou
- Forgetful Large Language Models: Lessons Learned From Using Llms In Robot Programming Chen Juo-tung, Huang Chien-ming
- Reconcile: Round-table Conference Improves Reasoning Via Consensus Among Diverse Llms Chen Justin Chih-yao, Saha Swarnadeep, Bansal Mohit
- Alpagasus: Training A Better Alpaca With Fewer Data Chen Lichang, Li Shiyang, Yan Jun, Wang Hai, Gunaratna Kalpa, Yadav Vikas, Tang Zheng, Srinivasan Vijay, Zhou Tianyi, Huang Heng, Jin Hongxia
- Driving With Llms: Fusing Object-level Vector Modality For Explainable Autonomous Driving Chen Long, Sinavski Oleg, Hünermann Jan, Karnsund Alice, Willmott Andrew James, Birch Danny, Maund Daniel, Shotton Jamie
- Few-shot Data Synthesis For Open Domain Multi-hop Question Answering Chen Mingda, Chen Xilun, Yih Wen-tau
- Natural Response Generation For Chinese Reading Comprehension Chen Nuo, Li Hongguang, Bao Yinan, Wang Baoyuan, Li Jia
- From Good To Great: Improving Math Reasoning With Tool-augmented Interleaf Prompting Chen Nuo, Li Hongguang, Wang Baoyuan, Li Jia
- Breaking Language Barriers In Multilingual Mathematical Reasoning: Insights And Observations Chen Nuo, Zheng Zinan, Wu Ning, Gong Ming, Song Yangqiu, Zhang Dongmei, Li Jia
- Dynamic Strategy Chain: Dynamic Zero-shot Cot For Long Mental Health Support Generation Chen Qi, Liu Dexi
- Webvln: Vision-and-language Navigation On Websites Chen Qi, Pitawela Dileepa, Zhao Chongyang, Zhou Gengze, Chen Hsiang-ting, Wu Qi
- Evaluation Of Chatgpt Family Of Models For Biomedical Reasoning And Classification Chen Shan, Li Yingya, Lu Sheng, Van Hoang, Aerts Hugo Jwl, Savova Guergana K., Bitterman Danielle S.
- Evaluating Factual Consistency Of Summaries With Large Language Models Chen Shiqi, Gao Siyang, He Junxian
- Extending Context Window Of Large Language Models Via Positional Interpolation Chen Shouyuan, Wong Sherman, Chen Liangjian, Tian Yuandong
- Understanding And Improving In-context Learning On Vision-language Models Chen Shuo, Han Zhen, He Bailan, Buckley Mark, Torr Philip, Tresp Volker, Gu Jindong
- Chatpipe: Orchestrating Data Preparation Program By Optimizing Human-chatgpt Interactions Chen Sibei, Liu Hanbing, Jin Weiting, Sun Xiangyu, Feng Xiaoyao, Fan Ju, Du Xiaoyong, Tang Nan
- Llm-empowered Chatbots For Psychiatrist And Patient Simulation: Application And Evaluation Chen Siyuan, Wu Mengyue, Zhu Kenny Q., Lan Kunyao, Zhang Zhiling, Cui Lyuchun
- Enhancing Robot Program Synthesis Through Environmental Context Chen Tianyi, Wang Qidi, Dong Zhen, Shen Liwei, Peng Xin
- Theoremqa: A Theorem-driven Question Answering Dataset Chen Wenhu, Yin Ming, Ku Max, Lu Pan, Wan Yixin, Ma Xueguang, Xu Jianyu, Wang Xinyi, Xia Tony
- Reason Out Your Layout: Evoking The Layout Master From Large Language Models For Text-to-image Synthesis Chen Xiaohui, Liu Yongfei, Yang Yingxiang, Yuan Jianbo, You Quanzeng, Liu Li-ping, Yang Hongxia
- Challenges And Contributing Factors In The Utilization Of Large Language Models (llms) Chen Xiaoliang, Li Liangbin, Chang Le, Huang Yunhe, Zhao Yuxuan, Zhang Yuxiao, Li Dinuo
- The Janus Interface: How Fine-tuning In Large Language Models Amplifies The Privacy Risks Chen Xiaoyi, Tang Siyuan, Zhu Rui, Yan Shijun, Jin Lei, Wang Zihao, Su Liya, Zhang Zhikun, Wang Xiaofeng, Tang Haixu
- Asking Before Acting: Gather Information In Embodied Decision Making With Language Models Chen Xiaoyu, Zhang Shenao, Zhang Pushi, Zhao Li, Chen Jianyu
- Autoeval-video: An Automatic Benchmark For Assessing Large Vision Language Models In Open-ended Video Question Answering Chen Xiuyuan, Lin Yuan, Zhang Yuchen, Huang Weiran
- Medprompt: Cross-modal Prompting For Multi-task Medical Image Translation Chen Xuhang, Pun Chi-man, Wang Shuqiang
- Do Models Explain Themselves? Counterfactual Simulatability Of Natural Language Explanations Chen Yanda, Zhong Ruiqi, Ri Narutatsu, Zhao Chen, He He, Steinhardt Jacob, Yu Zhou, Mckeown Kathleen
- Translation And Fusion Improves Zero-shot Cross-lingual Information Extraction Chen Yang, Shah Vedaant, Ritter Alan
- Bianque: Balancing The Questioning And Suggestion Ability Of Health Llms With Multi-turn Health Conversations Polished By Chatgpt Chen Yirong, Wang Zhenyu, Xing Xiaofen, Zheng Huimin, Xu Zhipei, Fang Kai, Wang Junhong, Li Sihang, Wu Jieling, Liu Qi, Xu Xiangmin
- Prompt To GPT-3: Step-by-step Thinking Instructions For Humor Generation Chen Yuetian, Shi Bowen, Si Mei
- Fortify The Shortest Stave In Attention: Enhancing Context Awareness Of Large Language Models For Effective Tool Use Chen Yuhan, Lv Ang, Lin Ting-en, Chen Changyu, Wu Yuchuan, Huang Fei, Li Yongbin, Yan Rui
- Assessing The Impact Of Prompting Methods On Chatgpt's Mathematical Capabilities Chen Yuhao, Wong Chloe, Yang Hanwen, Aguenza Juan, Bhujangari Sai, Vu Benthan, Lei Xun, Prasad Amisha, Fluss Manny, Phuong Eric, Liu Minghao, Kumar Raja, Vats Vanshika, Davis James
- Confidant: Customizing Transformer-based Llms Via Collaborative Edge Training Chen Yuhao, Yan Yuxuan, Yang Qianqian, Shu Yuanchao, He Shibo, Chen Jiming
- Token Prediction As Implicit Classification To Identify Llm-generated Text Chen Yutian, Kang Hao, Zhai Vivian, Li Liangze, Singh Rita, Raj Bhiksha
- MEDITRON-70B: Scaling Medical Pretraining For Large Language Models Chen Zeming, Cano Alejandro Hernández, Romanou Angelika, Bonnet Antoine, Matoba Kyle, Salvi Francesco, Pagliardini Matteo, Fan Simin, Köpf Andreas, Mohtashami Amirkeivan, Sallinen Alexandre, Sakhaeirad Alireza, Swamy Vinitra, Krawczuk Igor, Bayazit Deniz, Marmet Axel, Montariol Syrielle, Hartley Mary-anne, Jaggi Martin, Bosselut Antoine
- Chatcot: Tool-augmented Chain-of-thought Reasoning On Chat-based Large Language Models Chen Zhipeng, Zhou Kun, Zhang Beichen, Gong Zheng, Zhao Wayne Xin, Wen Ji-rong
- Lmexplainer: Grounding Knowledge And Explaining Language Models Chen Zichen, Chen Jianda, Chen Yuanyuan, Yu Han, Singh Ambuj K, Sra Misha
- Xplainllm: A QA Explanation Dataset For Understanding LLM Decision-making Chen Zichen, Chen Jianda, Gaidhani Mitali, Singh Ambuj, Sra Misha
- Visual Chatgpt: Talking, Drawing And Editing With Visual Foundation Models Chenfei Wu, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, Nan Duan
- UPRISE: Universal Prompt Retrieval For Improving Zero-shot Evaluation Cheng Daixuan, Huang Shaohan, Bi Junyu, Zhan Yuefeng, Liu Jianfeng, Wang Yujing, Sun Hao, Wei Furu, Deng Denvy, Zhang Qi
- Black-box Prompt Optimization: Aligning Large Language Models Without Model Training Cheng Jiale, Liu Xiao, Zheng Kehan, Ke Pei, Wang Hongning, Dong Yuxiao, Tang Jie, Huang Minlie
- Egothink: Evaluating First-person Perspective Thinking Capability Of Vision-language Models Cheng Sijie, Guo Zhicheng, Wu Jingwen, Fang Kechen, Li Peng, Liu Huaping, Liu Yang
- SCALE: Synergized Collaboration Of Asymmetric Language Translation Engines Cheng Xin, Wang Xun, Ge Tao, Chen Si-qing, Wei Furu, Zhao Dongyan, Yan Rui
- Batch Prompting: Efficient Inference With Large Language Model Apis Cheng Zhoujun, Kasai Jungo, Yu Tao
- Chatdb: Augmenting Llms With Databases As Their Symbolic Memory Chenxu Hu, Jie Fu, Chenzhuang Du, Simian Luo, Junbo Zhao, Hang Zhao
- Latent Positional Information Is In The Self-attention Variance Of Transformer Language Models Without Positional Embeddings Chi Ta-chung, Fan Ting-han, Chen Li-wei, Rudnicky Alexander I., Ramadge Peter J.
- Attention Alignment And Flexible Positional Embeddings Improve Transformer Length Extrapolation Chi Ta-chung, Fan Ting-han, Rudnicky Alexander I.
- Transformer Working Memory Enables Regular Language Reasoning And Natural Language Length Extrapolation Chi Ta-chung, Fan Ting-han, Rudnicky Alexander I., Ramadge Peter J.
- M\(^{2}\)chat: Empowering VLM For Multimodal LLM Interleaved Text-image Generation Chi Xiaowei, Zhang Rongyu, Jiang Zhengkai, Liu Yijiang, Wang Yatian, Qi Xingqun, Luo Wenhan, Gao Peng, Zhang Shanghang, Liu Qifeng, Guo Yike
- INSTRUCTEVAL: Towards Holistic Evaluation Of Instruction-tuned Large Language Models Chia Yew Ken, Hong Pengfei, Bing Lidong, Poria Soujanya
- A Closer Look Into Automatic Evaluation Using Large Language Models Chiang Cheng-han, Lee Hung-yi
- An Integrative Survey On Mental Health Conversational Agents To Bridge Computer Science And Medical Perspectives Cho Young Min, Rai Sunny, Ungar Lyle, Sedoc João, Guntuku Sharath Chandra
- Language Grounded Qformer For Efficient Vision Language Understanding Choraria Moulik, Sekhar Nitesh, Wu Yue, Zhang Xu, Singhal Prateek, Varshney Lav R.
- Generative Data Augmentation Using Llms Improves Distributional Robustness In Question Answering Chowdhury Arijit Ghosh, Chadha Aman
- Apollo: Unified Adapter And Prompt Learning For Vision Language Models Chowdhury Sanjoy, Nag Sayan, Manocha Dinesh
- Navigate Through Enigmatic Labyrinth A Survey Of Chain Of Thought Reasoning: Advances, Frontiers And Future Chu Zheng, Chen Jingchang, Chen Qianglong, Yu Weijiang, He Tao, Wang Haotian, Peng Weihua, Liu Ming, Qin Bing, Liu Ting
- Timebench: A Comprehensive Evaluation Of Temporal Reasoning Abilities In Large Language Models Chu Zheng, Chen Jingchang, Chen Qianglong, Yu Weijiang, Wang Haotian, Liu Ming, Qin Bing
- Tutorials On Stance Detection Using Pre-trained Language Models: Fine-tuning BERT And Prompting Large Language Models Chuang Yun-shiuan
- Dola: Decoding By Contrasting Layers Improves Factuality In Large Language Models Chuang Yung-sung, Xie Yujia, Luo Hongyin, Kim Yoon, Glass James, He Pengcheng
- Long Story Short: A Summarize-then-search Method For Long Video Question Answering Chung Jiwan, Yu Youngjae
- LIMA: Less Is More For Alignment Chunting Zhou, Pengfei Liu, Puxin Xu, Srini Iyer, Jiao Sun, Yuning Mao, Xuezhe Ma, Avia Efrat, Ping Yu, Lili Yu, Susan Zhang, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer, Omer Levy
- Safurai 001: New Qualitative Approach For Code LLM Evaluation Cifarelli Davide, Boiardi Leonardo, Puppo Alessandro
- Remember What You Did So You Know What To Do Next Ciosici Manuel R., Hedges Alex, Kankanampati Yash, Martin Justin, Freedman Marjorie, Weischedel Ralph
- Galactic Chitchat: Using Large Language Models To Converse With Astronomy Literature Ciucă Ioana, Ting Yuan-sen
- Harnessing The Power Of Adversarial Prompting And Large Language Models For Robust Hypothesis Generation In Astronomy Ciucă Ioana, Ting Yuan-sen, Kruk Sandor, Iyer Kartheik
- Cognitively Inspired Components For Social Conversational Agents Clay Alex, Alonso Eduardo, Mondragón Esther
- BERT Lost Patience Won't Be Robust To Adversarial Slowdown Coalson Zachary, Ritter Gabriel, Bobba Rakesh, Hong Sanghyun
- Inducing Anxiety In Large Language Models Increases Exploration And Bias Coda-forno Julian, Witte Kristin, Jagadish Akshay K., Binz Marcel, Akata Zeynep, Schulz Eric
- Evaluating Language Models For Mathematics Through Interactions Collins Katherine M., Jiang Albert Q., Frieder Simon, Wong Lionel, Zilka Miri, Bhatt Umang, Lukasiewicz Thomas, Wu Yuhuai, Tenenbaum Joshua B., Hart William, Gowers Timothy, Li Wenda, Weller Adrian, Jamnik Mateja
- Dr. Jekyll And Mr. Hyde: Two Faces Of Llms Collu Matteo Gioele, Janssen-groesbeek Tom, Koffas Stefanos, Conti Mauro, Picek Stjepan
- Strong And Efficient Baselines For Open Domain Conversational Question Answering Coman Andrei C., Barlacchi Gianni, De Gispert Adrià
- Seamlessm4t: Massively Multilingual & Multimodal Machine Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Cora, Dale David, Dong Ning, Duquenne Paul-ambroise, Elsahar Hady, Gong Hongyu, Heffernan Kevin, Hoffman John, Klaiber Christopher, Li Pengwei, Licht Daniel, Maillard Jean, Rakotoarison Alice, Sadagopan Kaushik Ram, Wenzek Guillaume, Ye Ethan, Akula Bapi, Chen Peng-jen, Hachem Naji El, Ellis Brian, Gonzalez Gabriel Mejia, Haaheim Justin, Hansanti Prangthip, Howes Russ, Huang Bernie, Hwang Min-jae, Inaguma Hirofumi, Jain Somya, Kalbassi Elahe, Kallet Amanda, Kulikov Ilia, Lam Janice, Li Daniel, Ma Xutai, Mavlyutov Ruslan, Peloquin Benjamin, Ramadan Mohamed, Ramakrishnan Abinesh, Sun Anna, Tran Kevin, Tran Tuan, Tufanov Igor, Vogeti Vish, Wood Carleigh, Yang Yilin, Yu Bokai, Andrews Pierre, Balioglu Can, Costa-jussà Marta R., Celebi Onur, Elbayad Maha, Gao Cynthia, Guzmán Francisco, Kao Justine, Lee Ann, Mourachko Alexandre, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler
- Seamless: Multilingual Expressive And Streaming Speech Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Coria, Dale David, Dong Ning, Duppenthaler Mark, Duquenne Paul-ambroise, Ellis Brian, Elsahar Hady, Haaheim Justin, Hoffman John, Hwang Min-jae, Inaguma Hirofumi, Klaiber Christopher, Kulikov Ilia, Li Pengwei, Licht Daniel, Maillard Jean, Mavlyutov Ruslan, Rakotoarison Alice, Sadagopan Kaushik Ram, Ramakrishnan Abinesh, Tran Tuan, Wenzek Guillaume, Yang Yilin, Ye Ethan, Evtimov Ivan, Fernandez Pierre, Gao Cynthia, Hansanti Prangthip, Kalbassi Elahe, Kallet Amanda, Kozhevnikov Artyom, Gonzalez Gabriel Mejia, Roman Robin San, Touret Christophe, Wong Corinne, Wood Carleigh, Yu Bokai, Andrews Pierre, Balioglu Can, Chen Peng-jen, Costa-jussà Marta R., Elbayad Maha, Gong Hongyu, Guzmán Francisco, Heffernan Kevin, Jain Somya, Kao Justine, Lee Ann, Ma Xutai, Mourachko Alex, Peloquin Benjamin, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Sun Anna, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler, Williamson Mary
- Prompting A Large Language Model To Generate Diverse Motivational Messages: A Comparison With Human-written Messages Cox Samuel Rhys, Abdul Ashraf, Ooi Wei Tsang
- Robust Infidelity: When Faithfulness Measures On Masked Language Models Are Misleading Crothers Evan, Viktor Herna, Japkowicz Nathalie
- Ultrafeedback: Boosting Language Models With Scaled AI Feedback Cui Ganqu, Yuan Lifan, Ding Ning, Yao Guanming, He Bingxiang, Zhu Wei, Ni Yuan, Xie Guotong, Xie Ruobing, Lin Yankai, Liu Zhiyuan, Sun Maosong
- Efficient And Effective Text Encoding For Chinese Llama And Alpaca Cui Yiming, Yang Ziqing, Yao Xin
- Large Language Models For Compiler Optimization Cummins Chris, Seeker Volker, Grubisic Dejan, Elhoushi Mostafa, Liang Youwei, Roziere Baptiste, Gehring Jonas, Gloeckle Fabian, Hazelwood Kim, Synnaeve Gabriel, Leather Hugh
- Hallucination Is The Last Thing You Need Curran Shawn, Lansley Sam, Bethell Oliver
- Evaluating Prompt-based Question Answering For Object Prediction In The Open Research Knowledge Graph D'souza Jennifer, Hrou Moussab, Auer Sören
- Auggpt: Leveraging Chatgpt For Text Data Augmentation Dai Haixing, Liu Zhengliang, Liao Wenxiong, Huang Xiaoke, Cao Yihan, Wu Zihao, Zhao Lin, Xu Shaochen, Liu Wei, Liu Ninghao, Li Sheng, Zhu Dajiang, Cai Hongmin, Sun Lichao, Li Quanzheng, Shen Dinggang, Liu Tianming, Li Xiang
- Llm-in-the-loop: Leveraging Large Language Model For Thematic Analysis Dai Shih-chieh, Xiong Aiping, Ku Lun-wei
- Syntactic Knowledge Via Graph Attention With BERT In Machine Translation Dai Yuqian, Sharoff Serge, De Kamps Marc
- Choice Over Control: How Users Write With Large Language Models Using Diegetic And Non-diegetic Prompting Dang Hai, Goller Sven, Lehmann Florian, Buschek Daniel
- An Adversarial Example For Direct Logit Attribution: Memory Management In Gelu-4l Dao James, Lau Yeu-tong, Rager Can, Janiak Jett
- Flashattention-2: Faster Attention With Better Parallelism And Work Partitioning Dao Tri
- Performance Comparison Of Large Language Models On VNHSGE English Dataset: Openai Chatgpt, Microsoft Bing Chat, And Google Bard Dao Xuan-quy
- Chatgpt Is Good But Bing Chat Is Better For Vietnamese Students Dao Xuan-quy, Le Ngoc-bich
- Can Chatgpt Pass The Vietnamese National High School Graduation Examination? Dao Xuan-quy, Le Ngoc-bich, Phan Xuan-dung, Ngo Bac-bien
- Mask The Bias: Improving Domain-adaptive Generalization Of Ctc-based ASR With Internal Language Model Estimation Das Nilaksh, Sunkara Monica, Bodapati Sravan, Cai Jinglun, Kulshreshtha Devang, Farris Jeff, Kirchhoff Katrin
- Optimizing Distributed Training On Frontier For Large Language Models Dash Sajal, Lyngaas Isaac, Yin Junqi, Wang Xiao, Egele Romain, Cong Guojing, Wang Feiyi, Balaprakash Prasanna
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- Testing GPT-4 With Wolfram Alpha And Code Interpreter Plug-ins On Math And Science Problems Davis Ernest, Aaronson Scott
- LLMR: Real-time Prompting Of Interactive Worlds Using Large Language Models De La Torre Fernanda, Fang Cathy Mengying, Huang Han, Banburski-fahey Andrzej, Fernandez Judith Amores, Lanier Jaron
- GPT-3 Models Are Few-shot Financial Reasoners De Padua Raul Salles, Qureshi Imran, Karakaplan Mustafa U.
- Improved Prompting And Process For Writing User Personas With Llms, Using Qualitative Interviews: Capturing Behaviour And Personality Traits Of Users De Paoli Stefano
- Writing User Personas With Large Language Models: Testing Phase 6 Of A Thematic Analysis Of Semi-structured Interviews De Paoli Stefano
- Im-promptu: In-context Composition From Image Prompts Dedhia Bhishma, Chang Michael, Snell Jake C., Griffiths Thomas L., Jha Niraj K.
- Facechat: An Emotion-aware Face-to-face Dialogue Framework Deema Alnuhait, Qingyang Wu, Zhou Yu
- Scaling Vision Transformers To 22 Billion Parameters Dehghani Mostafa, Djolonga Josip, Mustafa Basil, Padlewski Piotr, Heek Jonathan, Gilmer Justin, Steiner Andreas, Caron Mathilde, Geirhos Robert, Alabdulmohsin Ibrahim, Jenatton Rodolphe, Beyer Lucas, Tschannen Michael, Arnab Anurag, Wang Xiao, Riquelme Carlos, Minderer Matthias, Puigcerver Joan, Evci Utku, Kumar Manoj, Van Steenkiste Sjoerd, Elsayed Gamaleldin F., Mahendran Aravindh, Yu Fisher, Oliver Avital, Huot Fantine, Bastings Jasmijn, Collier Mark Patrick, Gritsenko Alexey, Birodkar Vighnesh, Vasconcelos Cristina, Tay Yi, Mensink Thomas, Kolesnikov Alexander, Pavetić Filip, Tran Dustin, Kipf Thomas, Lučić Mario, Zhai Xiaohua, Keysers Daniel, Harmsen Jeremiah, Houlsby Neil
- Divergent Token Metrics: Measuring Degradation To Prune Away LLM Components -- And Optimize Quantization Deiseroth Björn, Meuer Max, Gritsch Nikolas, Eichenberg Constantin, Schramowski Patrick, Aßenmacher Matthias, Kersting Kristian
- Speak While You Think: Streaming Speech Synthesis During Text Generation Dekel Avihu, Shechtman Slava, Fernandez Raul, Haws David, Kons Zvi, Hoory Ron
- Fairness Of Chatgpt And The Role Of Explainable-guided Prompts Deldjoo Yashar
- Investigating Data Contamination In Modern Benchmarks For Large Language Models Deng Chunyuan, Zhao Yilun, Tang Xiangru, Gerstein Mark, Cohan Arman
- Masterkey: Automated Jailbreak Across Multiple Large Language Model Chatbots Deng Gelei, Liu Yi, Li Yuekang, Wang Kailong, Zhang Ying, Li Zefeng, Wang Haoyu, Zhang Tianwei, Liu Yang
- Prompting And Evaluating Large Language Models For Proactive Dialogues: Clarification, Target-guided, And Non-collaboration Deng Yang, Liao Lizi, Chen Liang, Wang Hongru, Lei Wenqiang, Chua Tat-seng
- Multilingual Jailbreak Challenges In Large Language Models Deng Yue, Zhang Wenxuan, Pan Sinno Jialin, Bing Lidong
- Early Chatgpt User Portrait Through The Lens Of Data Deng Yuyang, Zhao Ni, Huang Xin
- Can Chatgpt Read Who You Are? Derner Erik, Kučera Dalibor, Oliver Nuria, Zahálka Jan
- Toxicity In Chatgpt: Analyzing Persona-assigned Language Models Deshpande Ameet, Murahari Vishvak, Rajpurohit Tanmay, Kalyan Ashwin, Narasimhan Karthik
- Chatgpt For GTFS: Benchmarking Llms On GTFS Understanding And Retrieval Devunuri Saipraneeth, Qiam Shirin, Lehe Lewis
- Chatgpt Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions Deyao Zhu, Jun Chen, Kilichbek Haydarov, Xiaoqian Shen, Wenxuan Zhang, Mohamed Elhoseiny
- Minigpt-4: Enhancing Vision-language Understanding With Advanced Large Language Models Deyao Zhu, Jun Chen, Xiaoqian Shen, Xiang Li, Mohamed Elhoseiny
- Context-aware Transformer Pre-training For Answer Sentence Selection Di Liello Luca, Garg Siddhant, Moschitti Alessandro
- Evaluating Chatgpt As A Recommender System: A Rigorous Approach Di Palma Dario, Biancofiore Giovanni Maria, Anelli Vito Walter, Narducci Fedelucio, Di Noia Tommaso, Di Sciascio Eugenio
- Codefuse-13b: A Pretrained Multi-lingual Code Large Language Model Di Peng, Li Jianguo, Yu Hang, Jiang Wei, Cai Wenting, Cao Yang, Chen Chaoyu, Chen Dajun, Chen Hongwei, Chen Liang, Fan Gang, Gong Jie, Gong Zi, Hu Wen, Guo Tingting, Lei Zhichao, Li Ting, Li Zheng, Liang Ming, Liao Cong, Liu Bingchang, Liu Jiachen, Liu Zhiwei, Lu Shaojun, Shen Min, Wang Guangpei, Wang Huan, Wang Zhi, Xu Zhaogui, Yang Jiawei, Ye Qing, Zhang Gehao, Zhang Yu, Zhao Zelin, Zheng Xunjin, Zhou Hailian, Zhu Lifu, Zhu Xianying
- A Comprehensive Evaluation Of Neural SPARQL Query Generation From Natural Language Questions Diallo Papa Abdou Karim Karou, Reyd Samuel, Zouaq Amal
- Lmflow: An Extensible Toolkit For Finetuning And Inference Of Large Foundation Models Diao Shizhe, Pan Rui, Dong Hanze, Shum Ka Shun, Zhang Jipeng, Xiong Wei, Zhang Tong
- Can Large Language Models Assist In Hazard Analysis? Diemert Simon, Weber Jens H
- Jump To Conclusions: Short-cutting Transformers With Linear Transformations Din Alexander Yom, Karidi Taelin, Choshen Leshem, Geva Mor
- Longnet: Scaling Transformers To 1,000,000,000 Tokens Ding Jiayu, Ma Shuming, Dong Li, Zhang Xingxing, Huang Shaohan, Wang Wenhui, Zheng Nanning, Wei Furu
- Using Large Language Model To Solve And Explain Physics Word Problems Approaching Human Level Ding Jingzhe, Cen Yan, Wei Xinyuan
- Enhancing Chat Language Models By Scaling High-quality Instructional Conversations Ding Ning, Chen Yulin, Xu Bokai, Qin Yujia, Zheng Zhi, Hu Shengding, Liu Zhiyuan, Sun Maosong, Zhou Bowen
- A Wolf In Sheep's Clothing: Generalized Nested Jailbreak Prompts Can Fool Large Language Models Easily Ding Peng, Kuang Jun, Ma Dan, Cao Xuezhi, Xian Yunsen, Chen Jiajun, Huang Shujian
- Self-agreement: A Framework For Fine-tuning Language Models To Find Agreement Among Diverse Opinions Ding Shiyao, Ito Takayuki
- HPC-GPT: Integrating Large Language Model For High-performance Computing Ding Xianzhong, Chen Le, Emani Murali, Liao Chunhua, Lin Pei-hung, Vanderbruggen Tristan, Xie Zhen, Cerpa Alberto E., Du Wan
- Fluid Transformers And Creative Analogies: Exploring Large Language Models' Capacity For Augmenting Cross-domain Analogical Creativity Ding Zijian, Srinivasan Arvind, Macneil Stephen, Chan Joel
- Modeling What-to-ask And How-to-ask For Answer-unaware Conversational Question Generation Do Xuan Long, Zou Bowei, Joty Shafiq, Tran Anh Tai, Pan Liangming, Chen Nancy F., Aw Ai Ti
- Efficient Finetuning Large Language Models For Vietnamese Chatbot Doan Vu-thuan, Truong Quoc-truong, Nguyen Duc-vu, Nguyen Vinh-tiep, Luu Thuy-ngan Nguyen
- Establishing Performance Baselines In Fine-tuning, Retrieval-augmented Generation And Soft-prompting For Non-specialist LLM Users Dodgson Jennifer, Nanzheng Lin, Peh Julian, Pattirane Akira Rafhael Janson, Alhajir Alfath Daryl, Dinarto Eko Ridho, Lim Joseph, Ahmad Syed Danyal
- Block-wise Bit-compression Of Transformer-based Models Dong Gaochen, Chen Wei
- Blockwise Compression Of Transformer-based Models Without Retraining Dong Gaochen, Chen Wei
- Large Language Model For Science: A Study On P Vs. NP Dong Qingxiu, Dong Li, Xu Ke, Zhou Guangyan, Hao Yaru, Sui Zhifang, Wei Furu
- Statistical Knowledge Assessment For Large Language Models Dong Qingxiu, Xu Jingjing, Kong Lingpeng, Sui Zhifang, Li Lei
- The Philosopher's Stone: Trojaning Plugins Of Large Language Models Dong Tian, Xue Minhui, Chen Guoxing, Holland Rayne, Meng Yan, Li Shaofeng, Liu Zhen, Zhu Haojin
- Speechgpt: Empowering Large Language Models With Intrinsic Cross-modal Conversational Abilities Dong Zhang, Shimin Li, Xin Zhang, Jun Zhan, Pengyu Wang, Yaqian Zhou, Xipeng Qiu
- A Comparative Study Of Ai-generated (GPT-4) And Human-crafted Mcqs In Programming Education Doughty Jacob, Wan Zipiao, Bompelli Anishka, Qayum Jubahed, Wang Taozhi, Zhang Juran, Zheng Yujia, Doyle Aidan, Sridhar Pragnya, Agarwal Arav, Bogart Christopher, Keylor Eric, Kultur Can, Savelka Jaromir, Sakr Majd
- Large Language Models Douglas Michael R.
- Blending Reward Functions Via Few Expert Demonstrations For Faithful And Accurate Knowledge-grounded Dialogue Generation Du Wanyu, Ji Yangfeng
- Classeval: A Manually-crafted Benchmark For Evaluating Llms On Class-level Code Generation Du Xueying, Liu Mingwei, Wang Kaixin, Wang Hanlin, Liu Junwei, Chen Yixuan, Feng Jiayi, Sha Chaofeng, Peng Xin, Lou Yiling
- Lauragpt: Listen, Attend, Understand, And Regenerate Audio With GPT Du Zhihao, Wang Jiaming, Chen Qian, Chu Yunfei, Gao Zhifu, Li Zerui, Hu Kai, Zhou Xiaohuan, Xu Jin, Ma Ziyang, Wang Wen, Zheng Siqi, Zhou Chang, Yan Zhijie, Zhang Shiliang
- Botchat: Evaluating Llms' Capabilities Of Having Multi-turn Dialogues Duan Haodong, Wei Jueqi, Wang Chonghua, Liu Hongwei, Fang Yixiao, Zhang Songyang, Lin Dahua, Chen Kai
- Shifting Attention To Relevance: Towards The Predictive Uncertainty Quantification Of Free-form Large Language Models Duan Jinhao, Cheng Hao, Wang Shiqi, Zavalny Alex, Wang Chenan, Xu Renjing, Kailkhura Bhavya, Xu Kaidi
- Bridging The Language Gap: Knowledge Injected Multilingual Question Answering Duan Zhichao, Li Xiuxing, Zhang Zhengyan, Li Zhenyu, Liu Ning, Wang Jianyong
- Alpacafarm: A Simulation Framework For Methods That Learn From Human Feedback Dubois Yann, Li Xuechen, Taori Rohan, Zhang Tianyi, Gulrajani Ishaan, Ba Jimmy, Guestrin Carlos, Liang Percy, Hashimoto Tatsunori B.
- Stack Attention: Improving The Ability Of Transformers To Model Hierarchical Patterns Dusell Brian, Chiang David
- Frugal Lms Trained To Invoke Symbolic Solvers Achieve Parameter-efficient Arithmetic Reasoning Dutta Subhabrata, Singh Joykirat, Pandey Ishan, Manchanda Sunny, Chakrabarti Soumen, Chakraborty Tanmoy
- A Comparative Analysis Of Large Language Models For Code Documentation Generation Dvivedi Shubhang Shekhar, Vijay Vyshnav, Pujari Sai Leela Rahul, Lodh Shoumik, Kumar Dhruv
- Faith And Fate: Limits Of Transformers On Compositionality Dziri Nouha, Lu Ximing, Sclar Melanie, Li Xiang Lorraine, Jiang Liwei, Lin Bill Yuchen, West Peter, Bhagavatula Chandra, Bras Ronan Le, Hwang Jena D., Sanyal Soumya, Welleck Sean, Ren Xiang, Ettinger Allyson, Harchaoui Zaid, Choi Yejin
- GPT-SW3: An Autoregressive Language Model For The Nordic Languages Ekgren Ariel, Gyllensten Amaru Cuba, Stollenwerk Felix, Öhman Joey, Isbister Tim, Gogoulou Evangelia, Carlsson Fredrik, Heiman Alice, Casademont Judit, Sahlgren Magnus
- Measuring And Improving Attentiveness To Partial Inputs With Counterfactuals Elazar Yanai, Paranjape Bhargavi, Peng Hao, Wiegreffe Sarah, Raghavi Khyathi, Srikumar Vivek, Singh Sameer, Smith Noah A.
- Tinystories: How Small Can Language Models Be And Still Speak Coherent English? Eldan Ronen, Li Yuanzhi
- Octopus: A Multitask Model And Toolkit For Arabic Natural Language Generation Elmadany Abdelrahim, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- A Comprehensive Performance Study Of Large Language Models On Novel AI Accelerators Emani Murali, Foreman Sam, Sastry Varuni, Xie Zhen, Raskar Siddhisanket, Arnold William, Thakur Rajeev, Vishwanath Venkatram, Papka Michael E.
- RAGAS: Automated Evaluation Of Retrieval Augmented Generation Es Shahul, James Jithin, Espinosa-anke Luis, Schockaert Steven
- Jacotext: A Pretrained Model For Java Code-text Generation Espejel Jessica López, Alassan Mahaman Sanoussi Yahaya, Dahhane Walid, Ettifouri El Hassane
- GPT-3.5, GPT-4, Or BARD? Evaluating Llms Reasoning Ability In Zero-shot Setting And Performance Boosting Through Prompts Espejel Jessica López, Ettifouri El Hassane, Alassan Mahaman Sanoussi Yahaya, Chouham El Mehdi, Dahhane Walid
- Chatgpt Outperforms Crowd-workers For Text-annotation Tasks Fabrizio Gilardi, Meysam Alizadeh, Maël Kubli
- Lm-polygraph: Uncertainty Estimation For Language Models Fadeeva Ekaterina, Vashurin Roman, Tsvigun Akim, Vazhentsev Artem, Petrakov Sergey, Fedyanin Kirill, Vasilev Daniil, Goncharova Elizaveta, Panchenko Alexander, Panov Maxim, Baldwin Timothy, Shelmanov Artem
- Perspectives On Large Language Models For Relevance Judgment Faggioli Guglielmo, Dietz Laura, Clarke Charles, Demartini Gianluca, Hagen Matthias, Hauff Claudia, Kando Noriko, Kanoulas Evangelos, Potthast Martin, Stein Benno, Wachsmuth Henning
- NAP At Semeval-2023 Task 3: Is Less Really More? (back-)translation As Data Augmentation Strategies For Detecting Persuasion Techniques Falk Neele, Eichel Annerose, Piccirilli Prisca
- Exploring The Potential Of Large Language Models In Generating Code-tracing Questions For Introductory Programming Courses Fan Aysa Xuemo, Zhang Ranran Haoran, Paquette Luc, Zhang Rui
- Uncovering The Potential Of Chatgpt For Discourse Analysis In Dialogue: An Empirical Study Fan Yaxin, Jiang Feng, Li Peifeng, Li Haizhou
- Using GPT-4 To Augment Unbalanced Data For Automatic Scoring Fang Luyang, Lee Gyeong-geon, Zhai Xiaoming
- Instructseq: Unifying Vision Tasks With Instruction-conditioned Multi-modal Sequence Generation Fang Rongyao, Yan Shilin, Huang Zhaoyang, Zhou Jingqiu, Tian Hao, Dai Jifeng, Li Hongsheng
- Chatgpt As Data Augmentation For Compositional Generalization: A Case Study In Open Intent Detection Fang Yihao, Li Xianzhi, Thomas Stephen W., Zhu Xiaodan
- An Empirical Study Of Multitask Learning To Improve Open Domain Dialogue Systems Farahani Mehrdad, Johansson Richard
- Tooltalk: Evaluating Tool-usage In A Conversational Setting Farn Nicholas, Shin Richard
- Chatgpt Performance On Standardized Testing Exam -- A Proposed Strategy For Learners Farooq Umer, Anwar Saira
- Friend Or Foe? Exploring The Implications Of Large Language Models On The Science System Fecher Benedikt, Hebing Marcel, Laufer Melissa, Pohle Jörg, Sofsky Fabian
- Transferable Decoding With Visual Entities For Zero-shot Image Captioning Fei Junjie, Wang Teng, Zhang Jinrui, He Zhenyu, Wang Chengjie, Zheng Feng
- Extending Context Window Of Large Language Models Via Semantic Compression Fei Weizhi, Niu Xueyan, Zhou Pingyi, Hou Lu, Bai Bo, Deng Lei, Han Wei
- Trapping LLM Hallucinations Using Tagged Context Prompts Feldman Philip, Foulds James R., Pan Shimei
- Language Models Can Be Logical Solvers Feng Jiazhan, Xu Ruochen, Hao Junheng, Sharma Hiteshi, Shen Yelong, Zhao Dongyan, Chen Weizhu
- Sequence-to-sequence Pre-training With Unified Modality Masking For Visual Document Understanding Feng Shuwei, Zhan Tianyang, Jie Zhanming, Luong Trung Quoc, Jin Xiaoran
- Chessgpt: Bridging Policy Learning And Language Modeling Feng Xidong, Luo Yicheng, Wang Ziyan, Tang Hongrui, Yang Mengyue, Shao Kun, Mguni David, Du Yali, Wang Jun
- Towards Llm-driven Dialogue State Tracking Feng Yujie, Lu Zexin, Liu Bo, Zhan Liming, Wu Xiao-ming
- Explaining How Transformers Use Context To Build Predictions Ferrando Javier, Gállego Gerard I., Tsiamas Ioannis, Costa-jussà Marta R.
- Should Chatgpt Be Biased? Challenges And Risks Of Bias In Large Language Models Ferrara Emilio
- Exaranker: Explanation-augmented Neural Ranker Ferraretto Fernando, Laitz Thiago, Lotufo Roberto, Nogueira Rodrigo
- Grounded Complex Task Segmentation For Conversational Assistants Ferreira Rafael, Semedo David, Magalhães João
- Vision Language Transformers: A Survey Fields Clayton, Kennington Casey
- Context-aware Meta-learning Fifty Christopher, Duan Dennis, Junkins Ronald G., Amid Ehsan, Leskovec Jure, Re Christopher, Thrun Sebastian
- Leveraging Large Language Models For Automated Dialogue Analysis Finch Sarah E., Paek Ellie S., Choi Jinho D.
- Medalign: A Clinician-generated Dataset For Instruction Following With Electronic Medical Records Fleming Scott L., Lozano Alejandro, Haberkorn William J., Jindal Jenelle A., Reis Eduardo P., Thapa Rahul, Blankemeier Louis, Genkins Julian Z., Steinberg Ethan, Nayak Ashwin, Patel Birju S., Chiang Chia-chun, Callahan Alison, Huo Zepeng, Gatidis Sergios, Adams Scott J., Fayanju Oluseyi, Shah Shreya J., Savage Thomas, Goh Ethan, Chaudhari Akshay S., Aghaeepour Nima, Sharp Christopher, Pfeffer Michael A., Liang Percy, Chen Jonathan H., Morse Keith E., Brunskill Emma P., Fries Jason A., Shah Nigam H.
- Can Large Language Models Follow Concept Annotation Guidelines? A Case Study On Scientific And Financial Domains Fonseca Marcio, Cohen Shay B.
- Enhancing Pipeline-based Conversational Agents With Large Language Models Foosherian Mina, Purwins Hendrik, Rathnayake Purna, Alam Touhidul, Teimao Rui, Thoben Klaus-dieter
- Breaking The Language Barrier: Improving Cross-lingual Reasoning With Structured Self-attention Foroutan Negar, Banaei Mohammadreza, Aberer Karl, Bosselut Antoine
- A Review Of Chatgpt Applications In Education, Marketing, Software Engineering, And Healthcare: Benefits, Drawbacks, And Research Directions Fraiwan Mohammad, Khasawneh Natheer
- Sparsegpt: Massive Language Models Can Be Accurately Pruned In One-shot Frantar Elias, Alistarh Dan
- Chatgpt & Mechanical Engineering: Examining Performance On The FE Mechanical Engineering And Undergraduate Exams Frenkel Matthew, Emara Hebah
- Leveraging Large Language Models In Conversational Recommender Systems Friedman Luke, Ahuja Sameer, Allen David, Tan Zhenning, Sidahmed Hakim, Long Changbo, Xie Jun, Schubiner Gabriel, Patel Ajay, Lara Harsh, Chu Brian, Chen Zexi, Tiwari Manoj
- A Challenger To GPT-4V? Early Explorations Of Gemini In Visual Expertise Fu Chaoyou, Zhang Renrui, Wang Zihan, Huang Yubo, Zhang Zhengye, Qiu Longtian, Ye Gaoxiang, Shen Yunhang, Zhang Mengdan, Chen Peixian, Zhao Sirui, Lin Shaohui, Jiang Deqiang, Yin Di, Gao Peng, Li Ke, Li Hongsheng, Sun Xing
- Gptscore: Evaluate As You Desire Fu Jinlan, Ng See-kiong, Jiang Zhengbao, Liu Pengfei
- Codeapex: A Bilingual Programming Evaluation Benchmark For Large Language Models Fu Lingyue, Chai Huacan, Luo Shuang, Du Kounianhua, Zhang Weiming, Fan Longteng, Lei Jiayi, Rui Renting, Lin Jianghao, Fang Yuchen, Liu Yifan, Wang Jingkuan, Qi Siyuan, Zhang Kangning, Zhang Weinan, Yu Yong
- Revisiting The Knowledge Injection Frameworks Fu Peng, Zhang Yiming, Wang Haobo, Qiu Weikang, Zhao Junbo
- Generate Then Select: Open-ended Visual Question Answering Guided By World Knowledge Fu Xingyu, Zhang Sheng, Kwon Gukyeong, Perera Pramuditha, Zhu Henghui, Zhang Yuhao, Li Alexander Hanbo, Wang William Yang, Wang Zhiguo, Castelli Vittorio, Ng Patrick, Roth Dan, Xiang Bing
- Chain-of-thought Hub: A Continuous Effort To Measure Large Language Models' Reasoning Performance Fu Yao, Ou Litu, Chen Mingyu, Wan Yuhao, Peng Hao, Khot Tushar
- Specializing Smaller Language Models Towards Multi-step Reasoning Fu Yao, Peng Hao, Ou Litu, Sabharwal Ashish, Khot Tushar
- Gpt4aigchip: Towards Next-generation AI Accelerator Design Automation Via Large Language Models Fu Yonggan, Zhang Yongan, Yu Zhongzhi, Li Sixu, Ye Zhifan, Li Chaojian, Wan Cheng, Lin Yingyan
- Safety Alignment In NLP Tasks: Weakly Aligned Summarization As An In-context Attack Fu Yu, Li Yufei, Xiao Wen, Liu Cong, Dong Yue
- Exposing Limitations Of Language Model Agents In Sequential-task Compositions On The Web Furuta Hiroki, Matsuo Yutaka, Faust Aleksandra, Gur Izzeddin
- Mitigating Hallucination In Large Multi-modal Models Via Robust Instruction Tuning Fuxiao Liu, Kevin Lin, Linjie Li, Jianfeng Wang, Yaser Yacoob, Lijuan Wang
- The Impact And Applications Of Chatgpt: A Systematic Review Of Literature Reviews Gabashvili Irene S.
- The Inner Sentiments Of A Thought Gagne Chris, Dayan Peter
- Cerbero-7b: A Leap Forward In Language-specific Llms Through Enhanced Chat Corpus Generation And Evaluation Galatolo Federico A., Cimino Mario G. C. A.
- Sensitivity And Robustness Of Large Language Models To Prompt Template In Japanese Text Classification Tasks Gan Chengguang, Mori Tatsunori
- Ziya2: Data-centric Learning Is All Llms Need Gan Ruyi, Wu Ziwei, Sun Renliang, Lu Junyu, Wu Xiaojun, Zhang Dixiang, Pan Kunhao, He Junqing, Tian Yuanhe, Yang Ping, Yang Qi, Wang Hao, Zhang Jiaxing, Song Yan
- Systematic Evaluation Of GPT-3 For Zero-shot Personality Estimation Ganesan Adithya V, Lal Yash Kumar, Nilsson August Håkan, Schwartz H. Andrew
- CLIPAG: Towards Generator-free Text-to-image Generation Ganz Roy, Elad Michael
- Roles Of Scaling And Instruction Tuning In Language Perception: Model Vs. Human Attention Gao Changjiang, Huang Shujian, Li Jixing, Chen Jiajun
- Assistgpt: A General Multi-modal Assistant That Can Plan, Execute, Inspect, And Learn Gao Difei, Ji Lei, Zhou Luowei, Lin Kevin Qinghong, Chen Joya, Fan Zihan, Shou Mike Zheng
- Benchmarking Large Language Models With Augmented Instructions For Fine-grained Information Extraction Gao Jun, Zhao Huan, Zhang Yice, Wang Wei, Yu Changlong, Xu Ruifeng
- Examining User-friendly And Open-sourced Large GPT Models: A Survey On Language, Multimodal, And Scientific GPT Models Gao Kaiyuan, He Sunan, He Zhenyu, Lin Jiacheng, Pei Qizhi, Shao Jie, Zhang Wei
- Llama-adapter V2: Parameter-efficient Visual Instruction Model Gao Peng, Han Jiaming, Zhang Renrui, Lin Ziyi, Geng Shijie, Zhou Aojun, Zhang Wei, Lu Pan, He Conghui, Yue Xiangyu, Li Hongsheng, Qiao Yu
- Empower Your Model With Longer And Better Context Comprehension Gao Yifei, Wang Lei, Fang Jun, Hu Longhua, Cheng Jun
- How To Design Translation Prompts For Chatgpt: An Empirical Study Gao Yuan, Wang Ruili, Hou Feng
- Real Customization Or Just Marketing: Are Customized Versions Of Chat GPT Useful? Garrido-merchán Eduardo C., Arroyo-barrigüete Jose L., Borrás-pala Francisco, Escobar-torres Leandro, De Ibarreta Carlos Martínez, Ortiz-lozano Jose María, Rua-vieites Antonio
- Simulating H.P. Lovecraft Horror Literature With The Chatgpt Large Language Model Garrido-merchán Eduardo C., Arroyo-barrigüete José Luis, Gozalo-brizuela Roberto
- Text Encoders Lack Knowledge: Leveraging Generative Llms For Domain-specific Semantic Textual Similarity Gatto Joseph, Sharif Omar, Seegmiller Parker, Bohlman Philip, Preum Sarah Masud
- Building Trustworthy Neurosymbolic AI Systems: Consistency, Reliability, Explainability, And Safety Gaur Manas, Sheth Amit
- Reasoning In Large Language Models Through Symbolic Math Word Problems Gaur Vedant, Saunshi Nikunj
- A Lightweight Method To Generate Unanswerable Questions In English Gautam Vagrant, Zhang Miaoran, Klakow Dietrich
- Ahead-of-time P-tuning Gavrilov Daniil, Balagansky Nikita
- Model Tells You What To Discard: Adaptive KV Cache Compression For Llms Ge Suyu, Zhang Yunan, Liu Liyuan, Zhang Minjia, Han Jiawei, Gao Jianfeng
- Mllm-bench: Evaluating Multimodal Llms With Per-sample Criteria Ge Wentao, Chen Shunian, Chen Guiming Hardy, Chen Zhihong, Chen Junying, Yan Shuo, Zhu Chenghao, Lin Ziyue, Xie Wenya, Zhang Xinyi, Chai Yichen, Liu Xiaoyu, Song Dingjie, Wang Xidong, Gao Anningzhe, Zhang Zhiyi, Li Jianquan, Wan Xiang, Wang Benyou
- Making Llama SEE And Draw With SEED Tokenizer Ge Yuying, Zhao Sijie, Zeng Ziyun, Ge Yixiao, Li Chen, Wang Xintao, Shan Ying
- Large Language Models Are Not Strong Abstract Reasoners Gendron Gaël, Bao Qiming, Witbrock Michael, Dobbie Gillian
- UPAR: A Kantian-inspired Prompting Framework For Enhancing Large Language Model Capabilities Geng Hejia, Xu Boxun, Li Peng
- Hiclip: Contrastive Language-image Pretraining With Hierarchy-aware Attention Geng Shijie, Yuan Jianbo, Tian Yu, Chen Yuxiao, Zhang Yongfeng
- Does Synthetic Data Make Large Language Models More Efficient? Gholami Sia, Omar Marwan
- Flacuna: Unleashing The Problem Solving Power Of Vicuna Using FLAN Fine-tuning Ghosal Deepanway, Chia Yew Ken, Majumder Navonil, Poria Soujanya
- Chatgpt Perpetuates Gender Bias In Machine Translation And Ignores Non-gendered Pronouns: Findings Across Bengali And Five Other Low-resource Languages Ghosh Sourojit, Caliskan Aylin
- Gpt-based Models Meet Simulation: How To Efficiently Use Large-scale Pre-trained Language Models Across Simulation Tasks Giabbanelli Philippe J.
- Resetox: Re-learning Attention Weights For Toxicity Mitigation In Machine Translation Gilabert Javier García, Escolano Carlos, Costa-jussà Marta R.
- Semantic Compression With Large Language Models Gilbert Henry, Sandborn Michael, Schmidt Douglas C., Spencer-smith Jesse, White Jules
- Transformative Effects Of Chatgpt On Modern Education: Emerging Era Of AI Chatbots Gill Sukhpal Singh, Xu Minxian, Patros Panos, Wu Huaming, Kaur Rupinder, Kaur Kamalpreet, Fuller Stephanie, Singh Manmeet, Arora Priyansh, Parlikad Ajith Kumar, Stankovski Vlado, Abraham Ajith, Ghosh Soumya K., Lutfiyya Hanan, Kanhere Salil S., Bahsoon Rami, Rana Omer, Dustdar Schahram, Sakellariou Rizos, Uhlig Steve, Buyya Rajkumar
- Prompt Cache: Modular Attention Reuse For Low-latency Inference Gim In, Chen Guojun, Lee Seung-seob, Sarda Nikhil, Khandelwal Anurag, Zhong Lin
- Wanglab At Mediqa-chat 2023: Clinical Note Generation From Doctor-patient Conversations Using Large Language Models Giorgi John, Toma Augustin, Xie Ronald, Chen Sondra S., An Kevin R., Zheng Grace X., Wang Bo
- Mtuncertainty: Assessing The Need For Post-editing Of Machine Translation Outputs By Fine-tuning Openai Llms Gladkoff Serge, Han Lifeng, Erofeev Gleb, Sorokina Irina, Nenadic Goran
- Generative AI Beyond Llms: System Implications Of Multi-modal Generation Golden Alicia, Hsia Samuel, Sun Fei, Acun Bilge, Hosmer Basil, Lee Yejin, Devito Zachary, Johnson Jeff, Wei Gu-yeon, Brooks David, Wu Carole-jean
- Decoding Stumpers: Large Language Models Vs. Human Problem-solvers Goldstein Alon, Havin Miriam, Reichart Roi, Goldstein Ariel
- Can Llms Capture Human Preferences? Goli Ali, Singh Amandeep
- Model-generated Pretraining Signals Improves Zero-shot Generalization Of Text-to-text Transformers Gong Linyuan, Xiong Chenyan, Liu Xiaodong, Bajaj Payal, Xie Yiqing, Cheung Alvin, Gao Jianfeng, Song Xia
- Evaluating The Potential Of Leading Large Language Models In Reasoning Biology Questions Gong Xinyu, Holmes Jason, Li Yiwei, Liu Zhengliang, Gan Qi, Wu Zihao, Zhang Jianli, Zou Yusong, Teng Yuxi, Jiang Tian, Zhu Hongtu, Liu Wei, Liu Tianming, Yan Yajun
- Figstep: Jailbreaking Large Vision-language Models Via Typographic Visual Prompts Gong Yichen, Ran Delong, Liu Jinyuan, Wang Conglei, Cong Tianshuo, Wang Anyu, Duan Sisi, Wang Xiaoyun
- Bloomvqa: Assessing Hierarchical Multi-modal Comprehension Gong Yunye, Shrestha Robik, Claypoole Jared, Cogswell Michael, Ray Arijit, Kanan Christopher, Divakaran Ajay
- Improving Input-label Mapping With Demonstration Replay For In-context Learning Gong Zhuocheng, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Yan Rui
- Using Large Language Models For Interpreting Autonomous Robots Behaviors González-santamarta Miguel A., Fernández-becerra Laura, Sobrín-hidalgo David, Guerrero-higueras Ángel Manuel, González Irene, Lera Francisco J. Rodríguez
- Integration Of Large Language Models Within Cognitive Architectures For Autonomous Robots González-santamarta Miguel Á., Rodríguez-lera Francisco J., Guerrero-higueras Ángel Manuel, Matellán-olivera Vicente
- Chatgpt Is Not All You Need. A State Of The Art Review Of Large Generative AI Models Gozalo-brizuela Roberto, Garrido-merchan Eduardo C.
- Not What You've Signed Up For: Compromising Real-world Llm-integrated Applications With Indirect Prompt Injection Greshake Kai, Abdelnabi Sahar, Mishra Shailesh, Endres Christoph, Holz Thorsten, Fritz Mario
- AMAGO: Scalable In-context Reinforcement Learning For Adaptive Agents Grigsby Jake, Fan Linxi, Zhu Yuke
- Large Language Models Are Zero-shot Time Series Forecasters Gruver Nate, Finzi Marc, Qiu Shikai, Wilson Andrew Gordon
- Linguistically Informed Chatgpt Prompts To Enhance Japanese-chinese Machine Translation: A Case Study On Attributive Clauses Gu Wenshi
- Seer: Language Instructed Video Prediction With Latent Diffusion Models Gu Xianfan, Wen Chuan, Ye Weirui, Song Jiaming, Gao Yang
- Language Models Hallucinate, But May Excel At Fact Verification Guan Jian, Dodge Jesse, Wadden David, Huang Minlie, Peng Hao
- Leveraging Pre-trained Large Language Models To Construct And Utilize World Models For Model-based Task Planning Guan Lin, Valmeekam Karthik, Sreedharan Sarath, Kambhampati Subbarao
- Intelligent Virtual Assistants With Llm-based Process Automation Guan Yanchu, Wang Dong, Chu Zhixuan, Wang Shiyu, Ni Feiyue, Song Ruihua, Li Longfei, Gu Jinjie, Zhuang Chenyi
- Voyager: An Open-ended Embodied Agent With Large Language Models Guanzhi Wang, Yuqi Xie, Yunfan Jiang, Ajay Mandlekar, Chaowei Xiao, Yuke Zhu, Linxi Fan, Anima Anandkumar
- The False Promise Of Imitating Proprietary Llms Gudibande Arnav, Wallace Eric, Snell Charlie, Geng Xinyang, Liu Hao, Abbeel Pieter, Levine Sergey, Song Dawn
- Hallucinations In Large Multilingual Translation Models Guerreiro Nuno M., Alves Duarte, Waldendorf Jonas, Haddow Barry, Birch Alexandra, Colombo Pierre, Martins André F. T.
- Hifi: High-information Attention Heads Hold For Parameter-efficient Model Adaptation Gui Anchun, Xiao Han
- Hiformer: Heterogeneous Feature Interactions Learning With Transformers For Recommender Systems Gui Huan, Wang Ruoxi, Yin Ke, Jin Long, Kula Maciej, Xu Taibai, Hong Lichan, Chi Ed H.
- SPT: Fine-tuning Transformer-based Language Models Efficiently With Sparsification Gui Yuntao, Yan Xiao, Yin Peiqi, Yang Han, Cheng James
- Efficacy Of Machine-generated Instructions Gulati Samaksh, Verma Anshit, Parmar Manoj, Chaudhary Palash
- Longcoder: A Long-range Pre-trained Language Model For Code Completion Guo Daya, Xu Canwen, Duan Nan, Yin Jian, Mcauley Julian
- Empowering Working Memory For Large Language Model Agents Guo Jing, Li Nan, Qi Jianchuan, Yang Hang, Li Ruiqiao, Feng Yuzhen, Zhang Si, Xu Ming
- Connecting Large Language Models With Evolutionary Algorithms Yields Powerful Prompt Optimizers Guo Qingyan, Wang Rui, Guo Junliang, Li Bei, Song Kaitao, Tan Xu, Liu Guoqing, Bian Jiang, Yang Yujiu
- What Can Large Language Models Do In Chemistry? A Comprehensive Benchmark On Eight Tasks Guo Taicheng, Guo Kehan, Nan Bozhao, Liang Zhenwen, Guo Zhichun, Chawla Nitesh V., Wiest Olaf, Zhang Xiangliang
- ELIP: Efficient Language-image Pre-training With Fewer Vision Tokens Guo Yangyang, Zhang Haoyu, Wong Yongkang, Nie Liqiang, Kankanhalli Mohan
- PPTC Benchmark: Evaluating Large Language Models For Powerpoint Task Completion Guo Yiduo, Zhang Zekai, Liang Yaobo, Zhao Dongyan, Duan Nan
- Arthmodel: Enhance Arithmetic Skills To Large Language Model Guo Yingdi
- Gemrec: Towards Generative Model Recommendation Guo Yuanhe, Liu Haoming, Wen Hongyi
- Improving Small Language Models On Pubmedqa Via Generative Data Augmentation Guo Zhen, Wang Peiqi, Wang Yanwei, Yu Shangdi
- Evaluating Large Language Models: A Comprehensive Survey Guo Zishan, Jin Renren, Liu Chuang, Huang Yufei, Shi Dan, Supryadi, Yu Linhao, Liu Yan, Li Jiaxuan, Xiong Bojian, Xiong Deyi
- Editing Common Sense In Transformers Gupta Anshita, Mondal Debanjan, Sheshadri Akshay Krishna, Zhao Wenlong, Li Xiang Lorraine, Wiegreffe Sarah, Tandon Niket
- Continual Pre-training Of Large Language Models: How To (re)warm Your Model? Gupta Kshitij, Thérien Benjamin, Ibrahim Adam, Richter Mats L., Anthony Quentin, Belilovsky Eugene, Rish Irina, Lesort Timothée
- Bias Runs Deep: Implicit Reasoning Biases In Persona-assigned Llms Gupta Shashank, Shrivastava Vaishnavi, Deshpande Ameet, Kalyan Ashwin, Clark Peter, Sabharwal Ashish, Khot Tushar
- Gistscore: Learning Better Representations For In-context Example Selection With Gist Bottlenecks Gupta Shivanshu, Rosenbaum Clemens, Elenberg Ethan R.
- A Real-world Webagent With Planning, Long Context Understanding, And Program Synthesis Gur Izzeddin, Furuta Hiroki, Huang Austin, Safdari Mustafa, Matsuo Yutaka, Eck Douglas, Faust Aleksandra
- A Survey Of Vision-language Pre-training From The Lens Of Multimodal Machine Translation Gwinnup Jeremy, Duh Kevin
- Regulating Chatgpt And Other Large Generative AI Models Hacker Philipp, Engel Andreas, Mauer Marco
- Is GPT-4 A Reliable Rater? Evaluating Consistency In GPT-4 Text Ratings Hackl Veronika, Müller Alexandra Elena, Granitzer Michael, Sailer Maximilian
- Are Large Language Model-based Evaluators The Solution To Scaling Up Multilingual Evaluation? Hada Rishav, Gumma Varun, De Wynter Adrian, Diddee Harshita, Ahmed Mohamed, Choudhury Monojit, Bali Kalika, Sitaram Sunayana
- Seeing Chatgpt Through Students' Eyes: An Analysis Of Tiktok Data Haensch Anna-carolina, Ball Sarah, Herklotz Markus, Kreuter Frauke
- Efficient Parallelization Layouts For Large-scale Distributed Model Training Hagemann Johannes, Weinbach Samuel, Dobler Konstantin, Schall Maximilian, De Melo Gerard
- Deception Abilities Emerged In Large Language Models Hagendorff Thilo
- Human-like Intuitive Behavior And Reasoning Biases Emerged In Language Models -- And Disappeared In GPT-4 Hagendorff Thilo, Fabi Sarah
- A Theory Of Emergent In-context Learning As Implicit Structure Induction Hahn Michael, Goyal Navin
- Images In Language Space: Exploring The Suitability Of Large Language Models For Vision & Language Tasks Hakimov Sherzod, Schlangen David
- Opiniongpt: Modelling Explicit Biases In Instruction-tuned Llms Haller Patrick, Aynetdinov Ansar, Akbik Alan
- Large Language Models Meet Computer Vision: A Brief Survey Hamadi Raby
- Evaluation Of AI Chatbots For Patient-specific EHR Questions Hamidi Alaleh, Roberts Kirk
- Lm-infinite: Zero-shot Extreme Length Generalization For Large Language Models Han Chi, Wang Qifan, Peng Hao, Xiong Wenhan, Chen Yu, Ji Heng, Wang Sinong
- Potential Benefits Of Employing Large Language Models In Research In Moral Education And Development Han Hyemin
- Onellm: One Framework To Align All Modalities With Language Han Jiaming, Gong Kaixiong, Zhang Yiyuan, Wang Jiaqi, Zhang Kaipeng, Lin Dahua, Qiao Yu, Gao Peng, Yue Xiangyu
- Imagebind-llm: Multi-modality Instruction Tuning Han Jiaming, Zhang Renrui, Shao Wenqi, Gao Peng, Xu Peng, Xiao Han, Zhang Kaipeng, Liu Chris, Wen Song, Guo Ziyu, Lu Xudong, Ren Shuai, Wen Yafei, Chen Xiaoxin, Yue Xiangyu, Li Hongsheng, Qiao Yu
- Cheddar: Student-chatgpt Dialogue In EFL Writing Education Han Jieun, Yoo Haneul, Myung Junho, Kim Minsun, Lee Tak Yeon, Ahn So-yeon, Oh Alice
- Pive: Prompting With Iterative Verification Improving Graph-based Generative Capability Of Llms Han Jiuzhou, Collier Nigel, Buntine Wray, Shareghi Ehsan
- Medalpaca -- An Open-source Collection Of Medical Conversational AI Models And Training Data Han Tianyu, Adams Lisa C., Papaioannou Jens-michalis, Grundmann Paul, Oberhauser Tom, Löser Alexander, Truhn Daniel, Bressem Keno K.
- Chartllama: A Multimodal LLM For Chart Understanding And Generation Han Yucheng, Zhang Chi, Chen Xin, Yang Xu, Wang Zhibin, Yu Gang, Fu Bin, Zhang Hanwang
- Personalisation Within Bounds: A Risk Taxonomy And Policy Framework For The Alignment Of Large Language Models With Personalised Feedback Hannah Rose Kirk, Bertie Vidgen, Paul Röttger, Scott A. Hale
- Reasoning Implicit Sentiment With Chain-of-thought Prompting Hao Fei, Bobo Li, Qian Liu, Lidong Bing, Fei Li, Tat-seng Chua
- Blockwise Parallel Transformer For Large Context Models Hao Liu, Pieter Abbeel
- Vico: Plug-and-play Visual Condition For Personalized Text-to-image Generation Hao Shaozhe, Han Kai, Zhao Shihao, Wong Kwan-yee K.
- Toolkengpt: Augmenting Frozen Language Models With Massive Tools Via Tool Embeddings Hao Shibo, Liu Tianyang, Wang Zhen, Hu Zhiting
- Towards Efficient Vision-language Tuning: More Information Density, More Generalizability Hao Tianxiang, Lyu Mengyao, Chen Hui, Zhao Sicheng, Ding Xiaohan, Han Jungong, Ding Guiguang
- Visual Instruction Tuning Haotian Liu, Chunyuan Li, Qingyang Wu, Yong Jae Lee
- Idealgpt: Iteratively Decomposing Vision And Language Reasoning Via Large Language Models Haoxuan You, Rui Sun, Zhecan Wang, Long Chen, Gengyu Wang, Hammad A. Ayyubi, Kai-wei Chang, Shih-fu Chang
- Test-time Training On Nearest Neighbors For Large Language Models Hardt Moritz, Sun Yu
- Herd: Using Multiple, Smaller Llms To Match The Performances Of Proprietary, Large Llms Via An Intelligent Composer Hari Surya Narayanan, Thomson Matt
- Analyzing The Performance Of Chatgpt In Cardiology And Vascular Pathologies Hariri Walid
- Unlocking The Potential Of Chatgpt: A Comprehensive Exploration Of Its Applications, Advantages, Limitations, And Future Directions In Natural Language Processing Hariri Walid
- Leveraging Large Language Models For Sequential Recommendation Harte Jesse, Zorgdrager Wouter, Louridas Panos, Katsifodimos Asterios, Jannach Dietmar, Fragkoulis Marios
- The Political Ideology Of Conversational AI: Converging Evidence On Chatgpt's Pro-environmental, Left-libertarian Orientation Hartmann Jochen, Schwenzow Jasper, Witte Maximilian
- CHAI-DT: A Framework For Prompting Conversational Generative AI Agents To Actively Participate In Co-creation Harwood Brandon
- Zero- And Few-shot Prompting With Llms: A Comparative Study With Fine-tuned Models For Bangla Sentiment Analysis Hasan Md. Arid, Das Shudipta, Anjum Afiyat, Alam Firoj, Anjum Anika, Sarker Avijit, Noori Sheak Rashed Haider
- Chatgpt As Your Personal Data Scientist Hassan Md Mahadi, Knipper Alex, Santu Shubhra Kanti Karmaker
- Multilingual Language Models Are Not Multicultural: A Case Study In Emotion Havaldar Shreya, Rai Sunny, Singhal Bhumika, Liu Langchen, Guntuku Sharath Chandra, Ungar Lyle
- Spear Phishing With Large Language Models Hazell Julian
- Linear Latent World Models In Simple Transformers: A Case Study On Othello-gpt Hazineh Dean S., Zhang Zechen, Chiu Jeffery
- Wanjuan: A Comprehensive Multimodal Dataset For Advancing English And Chinese Large Models He Conghui, Jin Zhenjiang, Xu Chao, Qiu Jiantao, Wang Bin, Li Wei, Yan Hang, Wang Jiaqi, Lin Dahua
- ICL-D3IE: In-context Learning With Diverse Demonstrations Updating For Document Information Extraction He Jiabang, Wang Lei, Hu Yi, Liu Ning, Liu Hui, Xu Xing, Shen Heng Tao
- Wordart Designer: User-driven Artistic Typography Synthesis Using Large Language Models He Jun-yan, Cheng Zhi-qi, Li Chenyang, Sun Jingdong, Xiang Wangmeng, Lin Xianhui, Kang Xiaoyang, Jin Zengke, Hu Yusen, Luo Bin, Geng Yifeng, Xie Xuansong, Zhou Jingren
- Never Lost In The Middle: Mastering Long-context Question Answering With Position-agnostic Decompositional Training He Junqing, Pan Kunhao, Dong Xiaoqun, Song Zhuoyang, Liu Yibo, Sun Qianguo, Liang Yuxin, Wang Hao, Zhang Enming, Zhang Jiaxing
- Multi-modal Latent Space Learning For Chain-of-thought Reasoning In Language Models He Liqi, Li Zuchao, Cai Xiantao, Wang Ping
- You Only Prompt Once: On The Capabilities Of Prompt Learning On Large Language Models To Tackle Toxic Content He Xinlei, Zannettou Savvas, Shen Yun, Zhang Yang
- Using Natural Language Explanations To Improve Robustness Of In-context Learning He Xuanli, Wu Yuxiang, Camburu Oana-maria, Minervini Pasquale, Stenetorp Pontus
- Multimodal Graph Transformer For Multimodal Question Answering He Xuehai, Wang Xin Eric
- Localized Text-to-image Generation For Free Via Cross Attention Control He Yutong, Salakhutdinov Ruslan, Kolter J. Zico
- An Automatically Discovered Chain-of-thought Prompt Generalizes To Novel Models And Datasets Hebenstreit Konstantin, Praas Robert, Kiesewetter Louis P, Samwald Matthias
- Chatgpt For Zero-shot Dialogue State Tracking: A Solution Or An Opportunity? Heck Michael, Lubis Nurul, Ruppik Benjamin, Vukovic Renato, Feng Shutong, Geishauser Christian, Lin Hsien-chin, Van Niekerk Carel, Gašić Milica
- Divide Et Impera: Multi-transformer Architectures For Complex Nlp-tasks Helland Solveig, Gavagnin Elena, De Spindler Alexandre
- Exploring The Responses Of Large Language Models To Beginner Programmers' Help Requests Hellas Arto, Leinonen Juho, Sarsa Sami, Koutcheme Charles, Kujanpää Lilja, Sorva Juha
- Simplemtod: A Simple Language Model For Multimodal Task-oriented Dialogue With Symbolic Scene Representation Hemanthage Bhathiya, Dondrup Christian, Bartie Phil, Lemon Oliver
- How Good Are GPT Models At Machine Translation? A Comprehensive Evaluation Hendy Amr, Abdelrehim Mohamed, Sharaf Amr, Raunak Vikas, Gabr Mohamed, Matsushita Hitokazu, Kim Young Jin, Afify Mohamed, Awadalla Hany Hassan
- Can Llms Grade Short-answer Reading Comprehension Questions : An Empirical Study With A Novel Dataset Henkel Owen, Hills Libby, Roberts Bill, Mcgrane Joshua
- Stochastic Parrots Looking For Stochastic Parrots: Llms Are Easy To Fine-tune And Hard To Detect With Other Llms Henrique Da Silva Gameiro, Kucharavy Andrei, Guerraoui Rachid
- Improving Long Context Document-level Machine Translation Herold Christian, Ney Hermann
- Large Language Models Are Biased To Overestimate Profoundness Herrera-berg Eugenio, Browne Tomás Vergara, León-villagrá Pablo, Vives Marc-lluís, Calderon Cristian Buc
- Backpack Language Models Hewitt John, Thickstun John, Manning Christopher D., Liang Percy
- Assessing The Efficacy Of Large Language Models In Generating Accurate Teacher Responses Hicke Yann, Masand Abhishek, Guo Wentao, Gangavarapu Tushaar
- Harnessing The Zero-shot Power Of Instruction-tuned Large Language Model In End-to-end Speech Recognition Higuchi Yosuke, Ogawa Tetsuji, Kobayashi Tetsunori
- Newsgpt: Chatgpt Integration For Robot-reporter Hireche Abdelhadi, Belkacem Abdelkader Nasreddine, Jamil Sadia, Chen Chao
- Comparative Analysis Of GPT-4 And Human Graders In Evaluating Praise Given To Students In Synthetic Dialogues Hirunyasiri Dollaya, Thomas Danielle R., Lin Jionghao, Koedinger Kenneth R., Aleven Vincent
- Do Compressed Llms Forget Knowledge? An Experimental Study With Practical Implications Hoang Duc N. M, Cho Minsik, Merth Thomas, Rastegari Mohammad, Wang Zhangyang
- Response: Emergent Analogical Reasoning In Large Language Models Hodel Damian, West Jevin
- Evaluating Large Language Models On A Highly-specialized Topic, Radiation Oncology Physics Holmes Jason, Liu Zhengliang, Zhang Lian, Ding Yuzhen, Sio Terence T., Mcgee Lisa A., Ashman Jonathan B., Li Xiang, Liu Tianming, Shen Jiajian, Liu Wei
- L2MAC: Large Language Model Automatic Computer For Extensive Code Generation Holt Samuel, Luyten Max Ruiz, Van Der Schaar Mihaela
- DP-OPT: Make Large Language Model Your Privacy-preserving Prompt Engineer Hong Junyuan, Wang Jiachen T., Zhang Chenhui, Li Zhangheng, Li Bo, Wang Zhangyang
- Direct2v: Large Language Models Are Frame-level Directors For Zero-shot Text-to-video Generation Hong Susung, Seo Junyoung, Shin Heeseong, Hong Sunghwan, Kim Seungryong
- Cogagent: A Visual Language Model For GUI Agents Hong Wenyi, Wang Weihan, Lv Qingsong, Xu Jiazheng, Yu Wenmeng, Ji Junhui, Wang Yan, Wang Zihan, Zhang Yuxuan, Li Juanzi, Xu Bin, Dong Yuxiao, Ding Ming, Tang Jie
- Do Large Language Models And Humans Have Similar Behaviors In Causal Inference With Script Knowledge? Hong Xudong, Ryzhova Margarita, Biondi Daniel Adrian, Demberg Vera
- SPEED: Speculative Pipelined Execution For Efficient Decoding Hooper Coleman, Kim Sehoon, Mohammadzadeh Hiva, Genc Hasan, Keutzer Kurt, Gholami Amir, Shao Sophia
- More Robots Are Coming: Large Multimodal Models (chatgpt) Can Solve Visually Diverse Images Of Parsons Problems Hou Irene, Man Owen, Mettille Sophie, Gutierrez Sebastian, Angelikas Kenneth, Macneil Stephen
- Towards A Mechanistic Interpretation Of Multi-step Reasoning Capabilities Of Language Models Hou Yifan, Li Jiaoda, Fei Yu, Stolfo Alessandro, Zhou Wangchunshu, Zeng Guangtao, Bosselut Antoine, Sachan Mrinmaya
- Bad Actor, Good Advisor: Exploring The Role Of Large Language Models In Fake News Detection Hu Beizhe, Sheng Qiang, Cao Juan, Shi Yuhui, Li Yang, Wang Danding, Qi Peng
- Chain-of-symbol Prompting Elicits Planning In Large Langauge Models Hu Hanxu, Lu Hongyuan, Zhang Huajian, Song Yun-ze, Lam Wai, Zhang Yue
- Context-aware Prompt Tuning For Vision-language Model With Dual-alignment Hu Hongyu, Lin Tiancheng, Wang Jie, Sun Zhenbang, Xu Yi
- Ladder-of-thought: Using Knowledge As Steps To Elevate Stance Detection Hu Kairui, Yan Ming, Zhou Joey Tianyi, Tsang Ivor W., Chong Wen Haw, Yap Yong Keong
- COMMA: Co-articulated Multi-modal Learning Hu Lianyu, Gao Liqing, Liu Zekang, Pun Chi-man, Feng Wei
- Uncertainty-aware Unlikelihood Learning Improves Generative Aspect Sentiment Quad Prediction Hu Mengting, Bai Yinhao, Wu Yike, Zhang Zhen, Zhang Liqi, Gao Hang, Zhao Shiwan, Huang Minlie
- Enhancing Answer Selection In Community Question Answering With Pre-trained And Large Language Models Hu Xinghang
- Decipherpref: Analyzing Influential Factors In Human Preference Judgments Via GPT-4 Hu Yebowen, Song Kaiqiang, Cho Sangwoo, Wang Xiaoyang, Foroosh Hassan, Liu Fei
- Llm-adapters: An Adapter Family For Parameter-efficient Fine-tuning Of Large Language Models Hu Zhiqiang, Wang Lei, Lan Yihuai, Xu Wanyu, Lim Ee-peng, Bing Lidong, Xu Xing, Poria Soujanya, Lee Roy Ka-wei
- Unlocking The Potential Of User Feedback: Leveraging Large Language Model As User Simulator To Enhance Dialogue System Hu Zhiyuan, Feng Yue, Luu Anh Tuan, Hooi Bryan, Lipani Aldo
- POP: Prompt Of Prompts For Continual Learning Hu Zhiyuan, Lyu Jiancheng, Gao Dashan, Vasconcelos Nuno
- SMILE: Single-turn To Multi-turn Inclusive Language Expansion Via Chatgpt For Mental Health Support Huachuan Qiu, Hongliang He, Shuai Zhang, Anqi Li, Zhenzhong Lan
- Enhancing Large Language Models In Coding Through Multi-perspective Self-consistency Huang Baizhou, Lu Shuai, Chen Weizhu, Wan Xiaojun, Duan Nan
- Towards Equipping Transformer With The Ability Of Systematic Compositionality Huang Chen, Qin Peixin, Lei Wenqiang, Lv Jiancheng
- Agentcoder: Multi-agent-based Code Generation With Iterative Testing And Optimisation Huang Dong, Zhang Jie M., Luck Michael, Bu Qingwen, Qing Yuhao, Cui Heming
- Is Chatgpt Better Than Human Annotators? Potential And Limitations Of Chatgpt In Explaining Implicit Hate Speech Huang Fan, Kwak Haewoon, An Jisun
- Directed Acyclic Transformer Pre-training For High-quality Non-autoregressive Text Generation Huang Fei, Ke Pei, Huang Minlie
- Chatgpt For Shaping The Future Of Dentistry: The Potential Of Multi-modal Large Language Model Huang Hanyao, Zheng Ou, Wang Dongdong, Yin Jiayi, Wang Zijin, Ding Shengxuan, Yin Heng, Xu Chuan, Yang Renjie, Zheng Qian, Shi Bing
- Free-bloom: Zero-shot Text-to-video Generator With LLM Director And LDM Animator Huang Hanzhuo, Feng Yufan, Shi Cheng, Xu Lan, Yu Jingyi, Yang Sibei
- Acegpt, Localizing Large Language Models In Arabic Huang Huang, Yu Fei, Zhu Jianqing, Sun Xuening, Cheng Hao, Song Dingjie, Chen Zhihong, Alharthi Abdulmohsen, An Bang, He Juncai, Liu Ziche, Zhang Zhiyi, Chen Junying, Li Jianquan, Wang Benyou, Zhang Lian, Sun Ruoyu, Wan Xiang, Li Haizhou, Xu Jinchao
- Revisiting The Reliability Of Psychological Scales On Large Language Models Huang Jen-tse, Wang Wenxuan, Lam Man Ho, Li Eric John, Jiao Wenxiang, Lyu Michael R.
- Who Is Chatgpt? Benchmarking Llms' Psychological Portrayal Using Psychobench Huang Jen-tse, Wang Wenxuan, Li Eric John, Lam Man Ho, Ren Shujie, Yuan Youliang, Jiao Wenxiang, Tu Zhaopeng, Lyu Michael R.
- A Survey On Hallucination In Large Language Models: Principles, Taxonomy, Challenges, And Open Questions Huang Lei, Yu Weijiang, Ma Weitao, Zhong Weihong, Feng Zhangyin, Wang Haotian, Chen Qianglong, Peng Weihua, Feng Xiaocheng, Qin Bing, Liu Ting
- Lawyer Llama Technical Report Huang Quzhe, Tao Mingxu, Zhang Chen, An Zhenwei, Jiang Cong, Chen Zhibin, Wu Zirui, Feng Yansong
- Can Large Language Models Explain Themselves? A Study Of Llm-generated Self-explanations Huang Shiyuan, Mamidanna Siddarth, Jangam Shreedhar, Zhou Yilun, Gilpin Leilani H.
- Adapting Pre-trained Language Models To Vision-language Tasks Via Dynamic Visual Prompting Huang Shubin, Wu Qiong, Zhou Yiyi, Chen Weijie, Zhang Rongsheng, Sun Xiaoshuai, Ji Rongrong
- Lateval: An Interactive Llms Evaluation Benchmark With Incomplete Information From Lateral Thinking Puzzles Huang Shulin, Ma Shirong, Li Yinghui, Huang Mengzuo, Zou Wuhe, Zhang Weidong, Zheng Hai-tao
- Fewer Is More: Boosting LLM Reasoning With Reinforced Context Pruning Huang Xijie, Zhang Li Lyna, Cheng Kwang-ting, Yang Fan, Yang Mao
- Generic Attention-model Explainability By Weighted Relevance Accumulation Huang Yiming, Jia Aozhe, Zhang Xiaodan, Zhang Jiawei
- Competition-level Problems Are Effective LLM Evaluators Huang Yiming, Lin Zhenghao, Liu Xiao, Gong Yeyun, Lu Shuai, Lei Fangyu, Liang Yaobo, Shen Yelong, Lin Chen, Duan Nan, Chen Weizhu
- Trustgpt: A Benchmark For Trustworthy And Responsible Large Language Models Huang Yue, Zhang Qihui, Y Philip S., Sun Lichao
- Advancing Transformer Architecture In Long-context Large Language Models: A Comprehensive Survey Huang Yunpeng, Xu Jingwei, Lai Junyu, Jiang Zixu, Chen Taolue, Li Zenan, Yao Yuan, Ma Xiaoxing, Yang Lijuan, Chen Hao, Li Shupeng, Zhao Penghao
- Evaluating And Enhancing Large Language Models For Conversational Reasoning On Knowledge Graphs Huang Yuxuan, Shi Lida, Liu Anqi, Xu Hao
- C-eval: A Multi-level Multi-discipline Chinese Evaluation Suite For Foundation Models Huang Yuzhen, Bai Yuzhuo, Zhu Zhihao, Zhang Junlei, Zhang Jinghan, Su Tangjun, Liu Junteng, Lv Chuancheng, Zhang Yikai, Lei Jiayi, Fu Yao, Sun Maosong, He Junxian
- How Does Pretraining Improve Discourse-aware Translation? Huang Zhihong, Wang Longyue, Liu Siyou, Wong Derek F.
- Comparing Large Language Model AI And Human-generated Coaching Messages For Behavioral Weight Loss Huang Zhuoran, Berry Michael P., Chwyl Christina, Hsieh Gary, Wei Jing, Forman Evan M.
- Llama: Open And Efficient Foundation Language Models Hugo Touvron, Thibaut Lavril, Gautier Izacard, Xavier Martinet, Marie-anne Lachaux, Timothée Lacroix, Baptiste Rozière, Naman Goyal, Eric Hambro, Faisal Azhar, Aurelien Rodriguez, Armand Joulin, Edouard Grave, Guillaume Lample
- TADA: Efficient Task-agnostic Domain Adaptation For Transformers Hung Chia-chien, Lange Lukas, Strötgen Jannik
- Walking A Tightrope -- Evaluating Large Language Models In High-risk Domains Hung Chia-chien, Rim Wiem Ben, Frost Lindsay, Bruckner Lars, Lawrence Carolin
- Do You Trust Chatgpt? -- Perceived Credibility Of Human And Ai-generated Content Huschens Martin, Briesch Martin, Sobania Dominik, Rothlauf Franz
- Understanding The Effectiveness Of Very Large Language Models On Dialog Evaluation Huynh Jessica, Jiao Cathy, Gupta Prakhar, Mehri Shikib, Bajaj Payal, Chaudhary Vishrav, Eskenazi Maxine
- Large Language Models As Sous Chefs: Revising Recipes With GPT-3 Hwang Alyssa, Li Bryan, Hou Zhaoyi, Roth Dan
- Pre-gated Moe: An Algorithm-system Co-design For Fast And Scalable Mixture-of-expert Inference Hwang Ranggi, Wei Jianyu, Cao Shijie, Hwang Changho, Tang Xiaohu, Cao Ting, Yang Mao
- Conceptual Model Interpreter For Large Language Models Härer Felix
- Minimizing Factual Inconsistency And Hallucination In Large Language Models I Muneeswaran, Saxena Shreya, Prasad Siva, Prakash M V Sai, Shankar Advaith, V Varun, Vaddina Vishal, Gopalakrishnan Saisubramaniam
- Large Language Model Displays Emergent Ability To Interpret Novel Literary Metaphors Ichien Nicholas, Stamenković Dušan, Holyoak Keith J.
- Diversigate: A Comprehensive Framework For Reliable Large Language Models Imani Shima, Beyram Ali, Shrivastava Harsh
- Mathprompter: Mathematical Reasoning Using Large Language Models Imani Shima, Du Liang, Shrivastava Harsh
- Multitool-cot: GPT-3 Can Use Multiple External Tools With Chain Of Thought Prompting Inaba Tatsuro, Kiyomaru Hirokazu, Cheng Fei, Kurohashi Sadao
- Leveraging Large Language Models To Generate Answer Set Programs Ishay Adam, Yang Zhun, Lee Joohyung
- Instructed To Bias: Instruction-tuned Language Models Exhibit Emergent Cognitive Bias Itzhak Itay, Stanovsky Gabriel, Rosenfeld Nir, Belinkov Yonatan
- Camels In A Changing Climate: Enhancing LM Adaptation With Tulu 2 Ivison Hamish, Wang Yizhong, Pyatkin Valentina, Lambert Nathan, Peters Matthew, Dasigi Pradeep, Jang Joel, Wadden David, Smith Noah A., Beltagy Iz, Hajishirzi Hannaneh
- 14 Examples Of How Llms Can Transform Materials Science And Chemistry: A Reflection On A Large Language Model Hackathon Jablonka Kevin Maik, Ai Qianxiang, Al-feghali Alexander, Badhwar Shruti, Bocarsly Joshua D., Bran Andres M, Bringuier Stefan, Brinson L. Catherine, Choudhary Kamal, Circi Defne, Cox Sam, De Jong Wibe A., Evans Matthew L., Gastellu Nicolas, Genzling Jerome, Gil María Victoria, Gupta Ankur K., Hong Zhi, Imran Alishba, Kruschwitz Sabine, Labarre Anne, Lála Jakub, Liu Tao, Ma Steven, Majumdar Sauradeep, Merz Garrett W., Moitessier Nicolas, Moubarak Elias, Mouriño Beatriz, Pelkie Brenden, Pieler Michael, Ramos Mayk Caldas, Ranković Bojana, Rodriques Samuel G., Sanders Jacob N., Schwaller Philippe, Schwarting Marcus, Shi Jiale, Smit Berend, Smith Ben E., Van Herck Joren, Völker Christoph, Ward Logan, Warren Sean, Weiser Benjamin, Zhang Sylvester, Zhang Xiaoqi, Zia Ghezal Ahmad, Scourtas Aristana, Schmidt Kj, Foster Ian, White Andrew D., Blaiszik Ben
- Emergent Ai-assisted Discourse: Case Study Of A Second Language Writer Authoring With Chatgpt Jacob Sharin, Tate Tamara, Warschauer Mark
- Neftune: Noisy Embeddings Improve Instruction Finetuning Jain Neel, Chiang Ping-yeh, Wen Yuxin, Kirchenbauer John, Chu Hong-min, Somepalli Gowthami, Bartoldson Brian R., Kailkhura Bhavya, Schwarzschild Avi, Saha Aniruddha, Goldblum Micah, Geiping Jonas, Goldstein Tom
- Breaking The Token Barrier: Chunking And Convolution For Efficient Long Text Classification With BERT Jaiswal Aman, Milios Evangelos
- Co-writing With Opinionated Language Models Affects Users' Views Jakesch Maurice, Bhat Advait, Buschek Daniel, Zalmanson Lior, Naaman Mor
- Cotran: An Llm-based Code Translator Using Reinforcement Learning With Feedback From Compiler And Symbolic Execution Jana Prithwish, Jha Piyush, Ju Haoyang, Kishore Gautham, Mahajan Aryan, Ganesh Vijay
- Faithful Persona-based Conversational Dataset Generation With Large Language Models Jandaghi Pegah, Sheng Xianghai, Bai Xinyi, Pujara Jay, Sidahmed Hakim
- Consistency Analysis Of Chatgpt Jang Myeongjun Erik, Lukasiewicz Thomas
- Chatgpt Is Fun, But It Is Not Funny! Humor Is Still Challenging Large Language Models Jentzsch Sophie, Kersting Kristian
- Hijacking Context In Large Multi-modal Models Jeong Joonhyun
- Inpars-v2: Large Language Models As Efficient Dataset Generators For Information Retrieval Jeronymo Vitor, Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Lotufo Roberto, Zavrel Jakub, Nogueira Rodrigo
- Just CHOP: Embarrassingly Simple LLM Compression Jha Ananya Harsh, Sherborne Tom, Walsh Evan Pete, Groeneveld Dirk, Strubell Emma, Beltagy Iz
- Neuro Symbolic Reasoning For Planning: Counterexample Guided Inductive Synthesis Using Large Language Models And Satisfiability Solving Jha Sumit Kumar, Jha Susmit, Lincoln Patrick, Bastian Nathaniel D., Velasquez Alvaro, Ewetz Rickard, Neema Sandeep
- Large Language Models As Automated Aligners For Benchmarking Vision-language Models Ji Yuanfeng, Ge Chongjian, Kong Weikai, Xie Enze, Liu Zhengying, Li Zhengguo, Luo Ping
- Exploring The Impact Of Instruction Data Scaling On Large Language Models: An Empirical Study On Real-world Use Cases Ji Yunjie, Deng Yong, Gong Yan, Peng Yiping, Niu Qiang, Zhang Lei, Ma Baochang, Li Xiangang
- Zero-shot Faithfulness Evaluation For Text Summarization With Foundation Language Model Jia Qi, Ren Siyu, Liu Yizhu, Zhu Kenny Q.
- Bootstrapping Vision-language Learning With Decoupled Language Pre-training Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Mistral 7B Jiang Albert Q., Sablayrolles Alexandre, Mensch Arthur, Bamford Chris, Chaplot Devendra Singh, Casas Diego De Las, Bressand Florian, Lengyel Gianna, Lample Guillaume, Saulnier Lucile, Lavaud Lélio Renard, Lachaux Marie-anne, Stock Pierre, Scao Teven Le, Lavril Thibaut, Wang Thomas, Lacroix Timothée, Sayed William El
- Bus:efficient And Effective Vision-language Pre-training With Bottom-up Patch Summarization Jiang Chaoya, Xu Haiyang, Ye Wei, Ye Qinghao, Li Chenliang, Yan Ming, Bi Bin, Zhang Shikun, Huang Fei, Huang Songfang
- Legal Syllogism Prompting: Teaching Large Language Models For Legal Judgment Prediction Jiang Cong, Yang Xiaolei
- Tigerscore: Towards Building Explainable Metric For All Text Generation Tasks Jiang Dongfu, Li Yishan, Zhang Ge, Huang Wenhao, Lin Bill Yuchen, Chen Wenhu
- Llm-blender: Ensembling Large Language Models With Pairwise Ranking And Generative Fusion Jiang Dongfu, Ren Xiang, Lin Bill Yuchen
- You Only Forward Once: Prediction And Rationalization In A Single Forward Pass Jiang Han, Duan Junwen, Qu Zhe, Wang Jianxin
- Personallm: Investigating The Ability Of Large Language Models To Express Personality Traits Jiang Hang, Zhang Xiajie, Cao Xubo, Breazeal Cynthia, Roy Deb, Kabbara Jad
- Llmlingua: Compressing Prompts For Accelerated Inference Of Large Language Models Jiang Huiqiang, Wu Qianhui, Lin Chin-yew, Yang Yuqing, Qiu Lili
- Longllmlingua: Accelerating And Enhancing Llms In Long Context Scenarios Via Prompt Compression Jiang Huiqiang, Wu Qianhui, Luo Xufang, Li Dongsheng, Lin Chin-yew, Yang Yuqing, Qiu Lili
- Graphologue: Exploring Large Language Model Responses With Interactive Diagrams Jiang Peiling, Rayan Jude, Dow Steven P., Xia Haijun
- Forcing Generative Models To Degenerate Ones: The Power Of Data Poisoning Attacks Jiang Shuli, Kadhe Swanand Ravindra, Zhou Yi, Cai Ling, Baracaldo Nathalie
- Selfevolve: A Code Evolution Framework Via Large Language Models Jiang Shuyang, Wang Yuhao, Wang Yu
- Prompt Packer: Deceiving Llms Through Compositional Instruction With Hidden Attacks Jiang Shuyu, Chen Xingshu, Tang Rui
- BRAINTEASER: Lateral Thinking Puzzles For Large Language Models Jiang Yifan, Ilievski Filip, Ma Kaixin, Sourati Zhivar
- Lion: Adversarial Distillation Of Proprietary Large Language Models Jiang Yuxin, Chan Chunkit, Chen Mingyang, Wang Wei
- Parrot: Translating During Chat Using Large Language Models Tuned With Human Translation And Feedback Jiao Wenxiang, Huang Jen-tse, Wang Wenxuan, He Zhiwei, Liang Tian, Wang Xing, Shi Shuming, Tu Zhaopeng
- Storyanalogy: Deriving Story-level Analogies From Large Language Models To Unlock Analogical Understanding Jiayang Cheng, Qiu Lin, Chan Tsz Ho, Fang Tianqing, Wang Weiqi, Chan Chunkit, Ru Dongyu, Guo Qipeng, Zhang Hongming, Song Yangqiu, Zhang Yue, Zhang Zheng
- Prompt-based Length Controlled Generation With Reinforcement Learning Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- Emergent Representations Of Program Semantics In Language Models Trained On Programs Jin Charles, Rinard Martin
- Medcpt: Contrastive Pre-trained Transformers With Large-scale Pubmed Search Logs For Zero-shot Biomedical Information Retrieval Jin Qiao, Kim Won, Chen Qingyu, Comeau Donald C., Yeganova Lana, Wilbur W. John, Lu Zhiyong
- Binary Code Summarization: Benchmarking Chatgpt/gpt-4 And Other Large Language Models Jin Xin, Larson Jonathan, Yang Weiwei, Lin Zhiqiang
- Vcoder: Versatile Vision Encoders For Multimodal Large Language Models Jitesh Jain, Jianwei Yang, Humphrey Shi
- Is Chatgpt Fair For Recommendation? Evaluating Fairness In Large Language Model Recommendation Jizhi Zhang, Keqin Bao, Yang Zhang, Wenjie Wang, Fuli Feng, Xiangnan He
- GPT Is Becoming A Turing Machine: Here Are Some Ways To Program It Jojic Ana, Wang Zhen, Jojic Nebojsa
- Does GPT-4 Pass The Turing Test? Jones Cameron R., Bergen Benjamin K.
- "it's Not Like Jarvis, But It's Pretty Close!" -- Examining Chatgpt's Usage Among Undergraduate Students In Computer Science Joshi Ishika, Budhiraja Ritvik, Akolekar Harshal D, Challa Jagat Sesh, Kumar Dhruv
- Chatgpt In The Classroom: An Analysis Of Its Strengths And Weaknesses For Solving Undergraduate Computer Science Questions Joshi Ishika, Budhiraja Ritvik, Dev Harshal, Kadia Jahnvi, Ataullah M. Osama, Mitra Sayan, Kumar Dhruv, Akolekar Harshal D.
- A Glimpse In Chatgpt Capabilities And Its Impact For AI Research Joublin Frank, Ceravola Antonello, Deigmoeller Joerg, Gienger Michael, Franzius Mathias, Eggert Julian
- Small Language Models Fine-tuned To Coordinate Larger Language Models Improve Complex Reasoning Juneja Gurusha, Dutta Subhabrata, Chakrabarti Soumen, Manchanda Sunny, Chakraborty Tanmoy
- Impossible Distillation: From Low-quality Model To High-quality Dataset & Model For Summarization And Paraphrasing Jung Jaehun, West Peter, Jiang Liwei, Brahman Faeze, Lu Ximing, Fisher Jillian, Sorensen Taylor, Choi Yejin
- A Case Study On Test Case Construction With Large Language Models: Unveiling Practical Insights And Challenges Junior Roberto Francisco De Lima, Presta Luiz Fernando Paes De Barros, Borborema Lucca Santos, Da Silva Vanderson Nogueira, Dahia Marcio Leal De Melo, Santos Anderson Carlos Sousa E
- A Comprehensive Capability Analysis Of GPT-3 And GPT-3.5 Series Models Junjie Ye, Xuanting Chen, Nuo Xu, Can Zu, Zekai Shao, Shichun Liu, Yuhan Cui, Zeyang Zhou, Chao Gong, Yang Shen, Jie Zhou, Siming Chen, Tao Gui, Qi Zhang, Xuanjing Huang
- Recommendation As Instruction Following: A Large Language Model Empowered Recommendation Approach Junjie Zhang, Ruobing Xie, Yupeng Hou, Wayne Xin Zhao, Leyu Lin, Ji-rong Wen
- Is Chatgpt A Good Recommender? A Preliminary Study Junling Liu, Chao Liu, Peilin Zhou, Renjie Lv, Kang Zhou, Yan Zhang
- BLIP-2: Bootstrapping Language-image Pre-training With Frozen Image Encoders And Large Language Models Junnan Li, Dongxu Li, Silvio Savarese, Steven Hoi
- Evaluation And Analysis Of Hallucination In Large Vision-language Models Junyang Wang, Yiyang Zhou, Guohai Xu, Pengcheng Shi, Chenlin Zhao, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang, Jihua Zhu, Jitao Sang, Haoyu Tang
- AMBER: An Llm-free Multi-dimensional Benchmark For Mllms Hallucination Evaluation Junyang Wang, Yuhang Wang, Guohai Xu, Jing Zhang, Yukai Gu, Haitao Jia, Jiaqi Wang, Haiyang Xu, Ming Yan, Ji Zhang, Jitao Sang
- TARJAMAT: Evaluation Of Bard And Chatgpt On Machine Translation Of Ten Arabic Varieties Kadaoui Karima, Magdy Samar M., Waheed Abdul, Khondaker Md Tawkat Islam, El-shangiti Ahmed Oumar, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- Domain-specific Code Language Models: Unraveling The Potential For HPC Codes And Tasks Kadosh Tal, Hasabnis Niranjan, Vo Vy A., Schneider Nadav, Krien Neva, Capota Mihai, Wasay Abdul, Ahmed Nesreen, Willke Ted, Tamir Guy, Pinter Yuval, Mattson Timothy, Oren Gal
- Robust Recommender System: A Survey And Future Directions Kaike Zhang, Qi Cao, Fei Sun, Yunfan Wu, Shuchang Tao, Huawei Shen, Xueqi Cheng
- Robust Training For Conversational Question Answering Models With Reinforced Reformulation Generation Kaiser Magdalena, Roy Rishiraj Saha, Weikum Gerhard
- Calibrated Language Models Must Hallucinate Kalai Adam Tauman, Vempala Santosh S.
- Swectrl-mini: A Data-transparent Transformer-based Large Language Model For Controllable Text Generation In Swedish Kalpakchi Dmytro, Boye Johan
- A Survey Of GPT-3 Family Large Language Models Including Chatgpt And GPT-4 Kalyan Katikapalli Subramanyam
- Syntax-guided Transformers: Elevating Compositional Generalization And Grounding In Multimodal Environments Kamali Danial, Kordjamshidi Parisa
- Evaluating Open-domain Question Answering In The Era Of Large Language Models Kamalloo Ehsan, Dziri Nouha, Clarke Charles L. A., Rafiei Davood
- Dialmat: Dialogue-enabled Transformer With Moment-based Adversarial Training Kaneda Kanta, Korekata Ryosuke, Wada Yuiga, Nagashima Shunya, Kambara Motonari, Iioka Yui, Matsuo Haruka, Imai Yuto, Nishimura Takayuki, Sugiura Komei
- Reducing Sequence Length By Predicting Edit Operations With Large Language Models Kaneko Masahiro, Okazaki Naoaki
- Llm-augmented Preference Learning From Natural Language Kang Inwon, Ruan Sikai, Ho Tyler, Lin Jui-chien, Mohsin Farhad, Seneviratne Oshani, Xia Lirong
- Knowledge-augmented Reasoning Distillation For Small Language Models In Knowledge-intensive Tasks Kang Minki, Lee Seanie, Baek Jinheon, Kawaguchi Kenji, Hwang Sung Ju
- Chatmof: An Autonomous AI System For Predicting And Generating Metal-organic Frameworks Kang Yeonghun, Kim Jihan
- Shs-nlp At Radsum23: Domain-adaptive Pre-training Of Instruction-tuned Llms For Radiology Report Impression Generation Karn Sanjeev Kumar, Ghosh Rikhiya, P Kusuma, Farri Oladimeji
- Large Language Models Effectively Leverage Document-level Context For Literary Translation, But Critical Errors Persist Karpinska Marzena, Iyyer Mohit
- Exploring Automatic Evaluation Methods Based On A Decoder-based LLM For Text Generation Kasahara Tomohito, Kawahara Daisuke
- Evaluating GPT-4 And Chatgpt On Japanese Medical Licensing Examinations Kasai Jungo, Kasai Yuhei, Sakaguchi Keisuke, Yamada Yutaro, Radev Dragomir
- Evaluating Large Language Models For Health-related Queries With Presuppositions Kaur Navreet, Choudhury Monojit, Pruthi Danish
- Sorted Llama: Unlocking The Potential Of Intermediate Layers Of Large Language Models For Dynamic Inference Kavehzadeh Parsa, Valipour Mojtaba, Tahaei Marzieh, Ghodsi Ali, Chen Boxing, Rezagholizadeh Mehdi
- Application Of Frozen Large-scale Models To Multimodal Task-oriented Dialogue Kawamoto Tatsuki, Suzuki Takuma, Miyama Ko, Meguro Takumi, Takagi Tomohiro
- The Impact Of Positional Encoding On Length Generalization In Transformers Kazemnejad Amirhossein, Padhi Inkit, Ramamurthy Karthikeyan Natesan, Das Payel, Reddy Siva
- Critiquellm: Towards An Informative Critique Generation Model For Evaluation Of Large Language Model Generation Ke Pei, Wen Bosi, Feng Zhuoer, Liu Xiao, Lei Xuanyu, Cheng Jiale, Wang Shengyuan, Zeng Aohan, Dong Yuxiao, Wang Hongning, Tang Jie, Huang Minlie
- Appropriateness Is All You Need! Kempt Hendrik, Lavie Alon, Nagel Saskia K.
- Shikra: Unleashing Multimodal Llm's Referential Dialogue Magic Keqin Chen, Zhao Zhang, Weili Zeng, Richong Zhang, Feng Zhu, Rui Zhao
- Unnatural Language Processing: How Do Language Models Handle Machine-generated Prompts? Kervadec Corentin, Franzon Francesca, Baroni Marco
- BLESS: Benchmarking Large Language Models On Sentence Simplification Kew Tannon, Chi Alison, Vásquez-rodríguez Laura, Agrawal Sweta, Aumiller Dennis, Alva-manchego Fernando, Shardlow Matthew
- Can Chatgpt And Bard Generate Aligned Assessment Items? A Reliability Analysis Against Human Performance Khademi Abdolvahab
- Assessing The Promise And Pitfalls Of Chatgpt For Automated Code Generation Khan Muhammad Fawad Akbar, Ramsdell Max, Falor Erik, Karimi Hamid
- Q: How To Specialize Large Vision-language Models To Data-scarce VQA Tasks? A: Self-train On Unlabeled Images! Khan Zaid, Bg Vijay Kumar, Schulter Samuel, Yu Xiang, Fu Yun, Chandraker Manmohan
- Zzzgpt: An Interactive GPT Approach To Enhance Sleep Quality Khaokaew Yonchanok, Ji Kaixin, Nguyen Thuc Hanh, Kegalle Hiruni, Alaofi Marwah, Xue Hao, Salim Flora D.
- Tst\(^\mathrm{r}\): Target Similarity Tuning Meets The Real World Khatry Anirudh, Gulwani Sumit, Gupta Priyanshu, Le Vu, Singha Ananya, Singh Mukul, Verbruggen Gust
- Dspy: Compiling Declarative Language Model Calls Into Self-improving Pipelines Khattab Omar, Singhvi Arnav, Maheshwari Paridhi, Zhang Zhiyuan, Santhanam Keshav, Vardhamanan Sri, Haq Saiful, Sharma Ashutosh, Joshi Thomas T., Moazam Hanna, Miller Heather, Zaharia Matei, Potts Christopher
- Reliability Check: An Analysis Of Gpt-3's Response To Sensitive Topics And Prompt Wording Khatun Aisha, Brown Daniel G.
- Gptaraeval: A Comprehensive Evaluation Of Chatgpt On Arabic NLP Khondaker Md Tawkat Islam, Waheed Abdul, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- How Secure Is Code Generated By Chatgpt? Khoury Raphaël, Avila Anderson R., Brunelle Jacob, Camara Baba Mamadou
- Exploring The Potential Of Large Language Models To Generate Formative Programming Feedback Kiesler Natalie, Lohr Dominic, Keuning Hieke
- Large Language Models In Introductory Programming Education: Chatgpt's Performance And Implications For Assessments Kiesler Natalie, Schiffner Daniel
- Expediting Contrastive Language-image Pretraining Via Self-distilled Encoders Kim Bumsoo, Kim Jinhyung, Jo Yeonsik, Kim Seung Hwan
- Visually-situated Natural Language Understanding With Contrastive Reading Model And Frozen Large Language Models Kim Geewook, Lee Hodong, Kim Daehee, Jung Haeji, Park Sanghee, Kim Yoonsik, Yun Sangdoo, Kil Taeho, Lee Bado, Park Seunghyun
- Language Models Can Solve Computer Tasks Kim Geunwoo, Baldi Pierre, Mcaleer Stephen
- GTA: Gated Toxicity Avoidance For LM Performance Preservation Kim Heegyu, Cho Hyunsouk
- Compressed Context Memory For Online Language Model Interaction Kim Jang-hyun, Yeom Junyoung, Yun Sangdoo, Song Hyun Oh
- Which Is Better? Exploring Prompting Strategy For Llm-based Metrics Kim Joonghoon, Park Saeran, Jeong Kiyoon, Lee Sangmin, Han Seung Hun, Lee Jiyoon, Kang Pilsung
- Speculative Decoding With Big Little Decoder Kim Sehoon, Mangalam Karttikeya, Moon Suhong, Malik Jitendra, Mahoney Michael W., Gholami Amir, Keutzer Kurt
- Prometheus: Inducing Fine-grained Evaluation Capability In Language Models Kim Seungone, Shin Jamin, Cho Yejin, Jang Joel, Longpre Shayne, Lee Hwaran, Yun Sangdoo, Shin Seongjin, Kim Sungdong, Thorne James, Seo Minjoon
- Aligning Large Language Models Through Synthetic Feedback Kim Sungdong, Bae Sanghwan, Shin Jamin, Kang Soyoung, Kwak Donghyun, Yoo Kang Min, Seo Minjoon
- MAGVLT: Masked Generative Vision-and-language Transformer Kim Sungwoong, Jo Daejin, Lee Donghoon, Kim Jongmin
- Exploiting Language Models As A Source Of Knowledge For Cognitive Agents Kirk James R., Wray Robert E., Laird John E.
- Can Language Models Laugh At Youtube Short-form Videos? Ko Dayoon, Lee Sangho, Kim Gunhee
- Large Language Models Are Temporal And Causal Reasoners For Video Question Answering Ko Dohwan, Lee Ji Soo, Kang Wooyoung, Roh Byungseok, Kim Hyunwoo J.
- Revisiting Intermediate Layer Distillation For Compressing Language Models: An Overfitting Perspective Ko Jongwoo, Park Seungjoon, Jeong Minchan, Hong Sukjin, Ahn Euijai, Chang Du-seong, Yun Se-young
- Transformer Language Models Handle Word Frequency In Prediction Head Kobayashi Goro, Kuribayashi Tatsuki, Yokoi Sho, Inui Kentaro
- Conversational Ai-powered Design: Chatgpt As Designer, User, And Product Kocaballi A. Baki
- Large Language Models Are State-of-the-art Evaluators Of Translation Quality Kocmi Tom, Federmann Christian
- Chatgpt: Jack Of All Trades, Master Of None Kocoń Jan, Cichecki Igor, Kaszyca Oliwier, Kochanek Mateusz, Szydło Dominika, Baran Joanna, Bielaniewicz Julita, Gruza Marcin, Janz Arkadiusz, Kanclerz Kamil, Kocoń Anna, Koptyra Bartłomiej, Mieleszczenko-kowszewicz Wiktoria, Miłkowski Piotr, Oleksy Marcin, Piasecki Maciej, Radliński Łukasz, Wojtasik Konrad, Woźniak Stanisław, Kazienko Przemysław
- Writing Your Own Book: A Method For Going From Closed To Open Book QA To Improve Robustness And Performance Of Smaller Llms Kokaia Giorgi, Sinha Pratyush, Jiang Yutong, Boujemaa Nozha
- Better Zero-shot Reasoning With Role-play Prompting Kong Aobo, Zhao Shiwan, Chen Hao, Li Qicheng, Qin Yong, Sun Ruiqi, Zhou Xin, Wang Enzhi, Dong Xiaohang
- Platolm: Teaching Llms In Multi-round Dialogue Via A User Simulator Kong Chuyi, Fan Yaxin, Wan Xiang, Jiang Feng, Wang Benyou
- NOLA: Compressing Lora Using Linear Combination Of Random Basis Koohpayegani Soroush Abbasi, Navaneet Kl, Nooralinejad Parsa, Kolouri Soheil, Pirsiavash Hamed
- Humans In Humans Out: On GPT Converging Toward Common Sense In Both Success And Failure Koralus Philipp, Wang-maścianica Vincent
- Performance Of The Pre-trained Large Language Model GPT-4 On Automated Short Answer Grading Kortemeyer Gerd
- Evaluating Large Language Models In Theory Of Mind Tasks Kosinski Michal
- Understanding Catastrophic Forgetting In Language Models Via Implicit Inference Kotha Suhas, Springer Jacob Mitchell, Raghunathan Aditi
- Large Language Models Only Pass Primary School Exams In Indonesia: A Comprehensive Test On Indommlu Koto Fajri, Aisyah Nurul, Li Haonan, Baldwin Timothy
- Do Large Language Models Pay Similar Attention Like Human Programmers When Generating Code? Kou Bonan, Chen Shengmai, Wang Zhijie, Ma Lei, Zhang Tianyi
- Developing Effective Educational Chatbots With Chatgpt Prompts: Insights From Preliminary Tests In A Case Study On Social Media Literacy (with Appendix) Koyuturk Cansu, Yavari Mona, Theophilou Emily, Bursic Sathya, Donabauer Gregor, Telari Alessia, Testa Alessia, Boiano Raffaele, Gabbiadini Alessandro, Hernandez-leo Davinia, Ruskov Martin, Ognibene Dimitri
- Chatgpt For PLC/DCS Control Logic Generation Koziolek Heiko, Gruener Sten, Ashiwal Virendra
- Leveraging Few-shot Data Augmentation And Waterfall Prompting For Response Generation Krause Lea, Santamaría Selene Báez, Van Der Meer Michiel, Khurana Urja
- Prompt Generate Train (PGT): Few-shot Domain Adaption Of Retrieval Augmented Generation Models For Open Book Question-answering Krishna C. S.
- Chatbots Are Not Reliable Text Annotators Kristensen-mclachlan Ross Deans, Canavan Miceal, Kardos Márton, Jacobsen Mia, Aarøe Lene
- The Moral Authority Of Chatgpt Krügel Sebastian, Ostermaier Andreas, Uhl Matthias
- Fundamentals Of Generative Large Language Models And Perspectives In Cyber-defense Kucharavy Andrei, Schillaci Zachary, Maréchal Loïc, Würsch Maxime, Dolamic Ljiljana, Sabonnadiere Remi, David Dimitri Percia, Mermoud Alain, Lenders Vincent
- Certifying LLM Safety Against Adversarial Prompting Kumar Aounon, Agarwal Chirag, Srinivas Suraj, Li Aaron Jiaxun, Feizi Soheil, Lakkaraju Himabindu
- Watch Your Language: Investigating Content Moderation With Large Language Models Kumar Deepak, Abuhashem Yousef, Durumeric Zakir
- Large Language Models Humanize Technology Kumar Pratyush
- How Reliable Are Ai-generated-text Detectors? An Assessment Framework Using Evasive Soft Prompts Kumarage Tharindu, Sheth Paras, Moraffah Raha, Garland Joshua, Liu Huan
- Sensi-bert: Towards Sensitivity Driven Fine-tuning For Parameter-efficient BERT Kundu Souvik, Sridhar Sharath Nittur, Szankin Maciej, Sundaresan Sairam
- DACBERT: Leveraging Dependency Agreement For Cost-efficient Bert Pretraining Kuo Martin, Zhang Jianyi, Chen Yiran
- Large Language Models On The Chessboard: A Study On Chatgpt's Formal Language Comprehension And Complex Reasoning Skills Kuo Mu-tien, Hsueh Chih-chung, Tsai Richard Tzong-han
- Sparse Fine-tuning For Inference Acceleration Of Large Language Models Kurtic Eldar, Kuznedelev Denis, Frantar Elias, Goin Michael, Alistarh Dan
- Chatgpt: Beginning Of An End Of Manual Linguistic Data Annotation? Use Case Of Automatic Genre Identification Kuzman Taja, Mozetič Igor, Ljubešić Nikola
- In Generative AI We Trust: Can Chatbots Effectively Verify Political Information? Kuznetsova Elizaveta, Makhortykh Mykola, Vziatysheva Victoria, Stolze Martha, Baghumyan Ani, Urman Aleksandra
- Jotr: A Joint Transformer And Reinforcement Learning Framework For Dialog Policy Learning Kwan Wai-chung, Wang Huimin, Wang Hongru, Wang Zezhong, Wu Xian, Zheng Yefeng, Wong Kam-fai
- M4LE: A Multi-ability Multi-range Multi-task Multi-domain Long-context Evaluation Benchmark For Large Language Models Kwan Wai-chung, Zeng Xingshan, Wang Yufei, Sun Yusen, Li Liangyou, Shang Lifeng, Liu Qun, Wong Kam-fai
- Reward Design With Language Models Kwon Minae, Xie Sang Michael, Bullard Kalesha, Sadigh Dorsa
- Chatgpt For Arabic Grammatical Error Correction Kwon Sang Yun, Bhatia Gagan, Nagoud El Moatez Billah, Abdul-mageed Muhammad
- Beyond English: Evaluating Llms For Arabic Grammatical Error Correction Kwon Sang Yun, Bhatia Gagan, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- Recallm: An Adaptable Memory Mechanism With Temporal Understanding For Large Language Models Kynoch Brandon, Latapie Hugo, Van Der Sluis Dwane
- Openassistant Conversations -- Democratizing Large Language Model Alignment Köpf Andreas, Kilcher Yannic, Von Rütte Dimitri, Anagnostidis Sotiris, Tam Zhi-rui, Stevens Keith, Barhoum Abdullah, Duc Nguyen Minh, Stanley Oliver, Nagyfi Richárd, Es Shahul, Suri Sameer, Glushkov David, Dantuluri Arnav, Maguire Andrew, Schuhmann Christoph, Nguyen Huu, Mattick Alexander
- A Zero-shot And Few-shot Study Of Instruction-finetuned Large Language Models Applied To Clinical And Biomedical Tasks Labrak Yanis, Rouvier Mickael, Dufour Richard
- Unraveling Chatgpt: A Critical Analysis Of Ai-generated Goal-oriented Dialogues And Annotations Labruna Tiziano, Brenna Sofia, Zaninello Andrea, Magnini Bernardo
- Multidimensional Evaluation For Text Style Transfer Using Chatgpt Lai Huiyuan, Toral Antonio, Nissim Malvina
- Chatgpt Beyond English: Towards A Comprehensive Evaluation Of Large Language Models In Multilingual Learning Lai Viet Dac, Ngo Nghia Trung, Veyseh Amir Pouran Ben, Man Hieu, Dernoncourt Franck, Bui Trung, Nguyen Thien Huu
- Enhancing Transformers Without Self-supervised Learning: A Loss Landscape Perspective In Sequential Recommendation Lai Vivian, Chen Huiyuan, Yeh Chin-chia Michael, Xu Minghua, Cai Yiwei, Yang Hao
- Mini-dalle3: Interactive Text To Image By Prompting Large Language Models Lai Zeqiang, Zhu Xizhou, Dai Jifeng, Qiao Yu, Wang Wenhai
- Exploring Large Language Models To Facilitate Variable Autonomy For Human-robot Teaming Lakhnati Younes, Pascher Max, Gerken Jens
- Can Llms Be Good Financial Advisors?: An Initial Study In Personal Decision Making For Optimized Outcomes Lakkaraju Kausik, Vuruma Sai Krishna Revanth, Pallagani Vishal, Muppasani Bharath, Srivastava Biplav
- End-to-end Speech Recognition Contextualization With Large Language Models Lakomkin Egor, Wu Chunyang, Fathullah Yassir, Kalinli Ozlem, Seltzer Michael L., Fuegen Christian
- Large Language Models Are Partially Primed In Pronoun Interpretation Lam Suet-ying, Zeng Qingcheng, Zhang Kexun, You Chenyu, Voigt Rob
- LLM Cognitive Judgements Differ From Human Lamprinidis Sotiris
- A Data Source For Reasoning Embodied Agents Lanchantin Jack, Sukhbaatar Sainbayar, Synnaeve Gabriel, Sun Yuxuan, Srinet Kavya, Szlam Arthur
- Open Sesame! Universal Black Box Jailbreaking Of Large Language Models Lapid Raz, Langberg Ron, Sipper Moshe
- Cabrita: Closing The Gap For Foreign Languages Larcher Celio, Piau Marcos, Finardi Paulo, Gengo Pedro, Esposito Piero, Caridá Vinicius
- Creating Large Language Model Resistant Exams: Guidelines And Strategies Larsen Simon Kaare
- A Systematic Study And Comprehensive Evaluation Of Chatgpt On Benchmark Datasets Laskar Md Tahmid Rahman, Bari M Saiful, Rahman Mizanur, Bhuiyan Md Amran Hossen, Joty Shafiq, Huang Jimmy Xiangji
- An Experimental Study On Pretraining Transformers From Scratch For IR Lassance Carlos, Déjean Hervé, Clinchant Stéphane
- Fine-tuning Chatgpt For Automatic Scoring Latif Ehsan, Zhai Xiaoming
- Building Efficient Universal Classifiers With Natural Language Inference Laurer Moritz, Van Atteveldt Wouter, Casas Andreu, Welbers Kasper
- VAL: Interactive Task Learning With GPT Dialog Parsing Lawley Lane, Maclellan Christopher J.
- Improved Instruction Ordering In Recipe-grounded Conversation Le Duong Minh, Guo Ruohao, Xu Wei, Ritter Alan
- Voicebox: Text-guided Multilingual Universal Speech Generation At Scale Le Matthew, Vyas Apoorv, Shi Bowen, Karrer Brian, Sari Leda, Moritz Rashel, Williamson Mary, Manohar Vimal, Adi Yossi, Mahadeokar Jay, Hsu Wei-ning
- Challenges Of Gpt-3-based Conversational Agents For Healthcare Lechner Fabian, Lahnala Allison, Welch Charles, Flek Lucie
- Learning From Teaching Assistants To Program With Subgoals: Exploring The Potential For AI Teaching Assistants Lee Changyoon, Myung Junho, Han Jieun, Jin Jiho, Oh Alice
- Read-only Prompt Optimization For Vision-language Few-shot Learning Lee Dongjun, Song Seokwon, Suh Jihee, Choi Joonmyung, Lee Sanghyeok, Kim Hyunwoo J.
- Towards Reliable And Fluent Large Language Models: Incorporating Feedback Learning Loops In QA Systems Lee Dongyub, Whang Taesun, Lee Chanhee, Lim Heuiseok
- Applying Large Language Models And Chain-of-thought For Automatic Scoring Lee Gyeong-geon, Latif Ehsan, Wu Xuansheng, Liu Ninghao, Zhai Xiaoming
- Is Attention Required For ICL? Exploring The Relationship Between Model Architecture And In-context Learning Ability Lee Ivan, Jiang Nan, Berg-kirkpatrick Taylor
- Partially Randomizing Transformer Weights For Dialogue Response Diversity Lee Jing Yang, Lee Kong Aik, Gan Woon-seng
- The Importance Of Prompt Tuning For Automated Neuron Explanations Lee Justin, Oikarinen Tuomas, Chatha Arjun, Chang Keng-chi, Chen Yilan, Weng Tsui-wei
- Building Multimodal AI Chatbots Lee Min Young
- Teaching Arithmetic To Small Transformers Lee Nayoung, Sreenivasan Kartik, Lee Jason D., Lee Kangwook, Papailiopoulos Dimitris
- Explore, Select, Derive, And Recall: Augmenting LLM With Human-like Memory For Mobile Task Automation Lee Sunjae, Choi Junyoung, Lee Jungjae, Wasi Munim Hasan, Choi Hojun, Ko Steven Y., Oh Sangeun, Shin Insik
- Large Language Models Can Share Images, Too! Lee Young-jun, Lee Dokyong, Sung Joo Won, Hyeon Jonghwan, Choi Ho-jin
- Creating A Dataset For High-performance Computing Code Translation Using Llms: A Bridge Between Openmp Fortran And C++ Lei Bin, Ding Caiwen, Chen Le, Lin Pei-hung, Liao Chunhua
- Boosting Logical Reasoning In Large Language Models Through A New Framework: The Graph Of Thought Lei Bin, Lin Pei-hung, Liao Chunhua, Ding Caiwen
- Zero-shot Next-item Recommendation Using Large Pretrained Language Models Lei Wang, Ee-peng Lim
- Mitigating Fine-grained Hallucination By Fine-tuning Large Vision-language Models With Caption Rewrites Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-peng Lim
- Llm4vis: Explainable Visualization Recommendation Using Chatgpt Lei Wang, Songheng Zhang, Yun Wang, Ee-peng Lim, Yong Wang
- Chatgpt: A Meta-analysis After 2.5 Months Leiter Christoph, Zhang Ran, Chen Yanran, Belouadi Jonas, Larionov Daniil, Fresen Vivian, Eger Steffen
- Self-detoxifying Language Models Via Toxification Reversal Leong Chak Tou, Cheng Yi, Wang Jiashuo, Wang Jian, Li Wenjie
- Talk The Walk: Synthetic Data Generation For Conversational Music Recommendation Leszczynski Megan, Zhang Shu, Ganti Ravi, Balog Krisztian, Radlinski Filip, Pereira Fernando, Chaganty Arun Tejasvi
- Guiding LLM To Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut Triggers Levy Mosh, Ravfogel Shauli, Goldberg Yoav
- Sequential Monte Carlo Steering Of Large Language Models Using Probabilistic Programs Lew Alexander K., Zhi-xuan Tan, Grand Gabriel, Mansinghka Vikash K.
- Seed-bench-2: Benchmarking Multimodal Large Language Models Li Bohao, Ge Yuying, Ge Yixiao, Wang Guangzhi, Wang Rui, Zhang Ruimao, Shan Ying
- Task Contamination: Language Models May Not Be Few-shot Anymore Li Changmao, Flanigan Jeffrey
- Teach Llms To Personalize -- An Approach Inspired By Writing Education Li Cheng, Zhang Mingyang, Mei Qiaozhu, Wang Yaqing, Hombaiah Spurthi Amba, Liang Yi, Bendersky Michael
- Mixed Distillation Helps Smaller Language Model Better Reasoning Li Chenglin, Chen Qianglong, Li Liangyue, Wang Caiyu, Li Yicheng, Chen Zulong, Zhang Yin
- Dialogue For Prompting: A Policy-gradient-based Discrete Prompt Generation For Few-shot Learning Li Chengzhengxu, Liu Xiaoming, Wang Yichen, Li Duyi, Lan Yu, Shen Chao
- Modelscope-agent: Building Your Customizable Agent System With Open-source Large Language Models Li Chenliang, Chen Hehong, Yan Ming, Shen Weizhou, Xu Haiyang, Wu Zhikai, Zhang Zhicheng, Zhou Wenmeng, Chen Yingda, Cheng Chen, Shi Hongzhu, Zhang Ji, Huang Fei, Zhou Jingren
- Llava-med: Training A Large Language-and-vision Assistant For Biomedicine In One Day Li Chunyuan, Wong Cliff, Zhang Sheng, Usuyama Naoto, Liu Haotian, Yang Jianwei, Naumann Tristan, Poon Hoifung, Gao Jianfeng
- DISTFLASHATTN: Distributed Memory-efficient Attention For Long-context Llms Training Li Dacheng, Shao Rulin, Xie Anze, Xing Eric P., Ma Xuezhe, Stoica Ion, Gonzalez Joseph E., Zhang Hao
- Explaincpe: A Free-text Explanation Benchmark Of Chinese Pharmacist Examination Li Dongfang, Yu Jindi, Hu Baotian, Xu Zhenran, Zhang Min
- Revisiting Large Language Models As Zero-shot Relation Extractors Li Guozheng, Wang Peng, Ke Wenjun
- Repetition In Repetition Out: Towards Understanding Neural Text Degeneration From The Data Perspective Li Huayang, Lan Tian, Fu Zihao, Cai Deng, Liu Lemao, Collier Nigel, Watanabe Taro, Su Yixuan
- Textbind: Multi-turn Interleaved Multimodal Instruction-following In The Wild Li Huayang, Li Siheng, Cai Deng, Wang Longyue, Liu Lemao, Watanabe Taro, Yang Yujiu, Shi Shuming
- Do Vision And Language Models Share Concepts? A Vector Space Alignment Study Li Jiaang, Kementchedjhieva Yova, Fierro Constanza, Søgaard Anders
- Text Is All You Need: Learning Language Representations For Sequential Recommendation Li Jiacheng, Wang Ming, Li Jin, Fu Jinmiao, Shen Xin, Shang Jingbo, Mcauley Julian
- Eliciting The Translation Ability Of Large Language Models Via Multilingual Finetuning With Translation Instructions Li Jiahuan, Zhou Hao, Huang Shujian, Cheng Shanbo, Chen Jiajun
- Adaptive Gating In Mixture-of-experts Based Language Models Li Jiamin, Su Qiang, Yang Yitao, Jiang Yimin, Wang Cong, Xu Hong
- CFGPT: Chinese Financial Assistant With Large Language Model Li Jiangtong, Bian Yuxuan, Wang Guoxuan, Lei Yang, Cheng Dawei, Ding Zhijun, Jiang Changjun
- Counterfactual Reasoning: Testing Language Models' Understanding Of Hypothetical Scenarios Li Jiaxuan, Yu Lang, Ettinger Allyson
- Distilling Chatgpt For Explainable Automated Student Answer Assessment Li Jiazheng, Gui Lin, Zhou Yuxiang, West David, Aloisi Cesare, He Yulan
- Overprompt: Enhancing Chatgpt Through Efficient In-context Learning Li Jiazheng, Zhao Runcong, Yang Yongxin, He Yulan, Gui Lin
- Contradoc: Understanding Self-contradictions In Documents With Large Language Models Li Jierui, Raheja Vipul, Kumar Dhruv
- Graphix-t5: Mixing Pre-trained Transformers With Graph-aware Layers For Text-to-sql Parsing Li Jinyang, Hui Binyuan, Cheng Reynold, Qin Bowen, Ma Chenhao, Huo Nan, Huang Fei, Du Wenyu, Si Luo, Li Yongbin
- Prefix Propagation: Parameter-efficient Tuning For Long Sequences Li Jonathan, Aitken Will, Bhambhoria Rohan, Zhu Xiaodan
- MAQA: A Multimodal QA Benchmark For Negation Li Judith Yue, Jansen Aren, Huang Qingqing, Lee Joonseok, Ganti Ravi, Kuzmin Dima
- Openba: An Open-sourced 15B Bilingual Asymmetric Seq2seq Model Pre-trained From Scratch Li Juntao, Tang Zecheng, Ding Yuyang, Wang Pinzheng, Guo Pei, You Wangjie, Qiao Dan, Chen Wenliang, Fu Guohong, Zhu Qiaoming, Zhou Guodong, Zhang Min
- Instructcoder: Instruction Tuning Large Language Models For Code Editing Li Kaixin, Hu Qisheng, Zhao Xu, Chen Hui, Xie Yuxi, Liu Tiedong, Xie Qizhe, He Junxian
- Inference-time Intervention: Eliciting Truthful Answers From A Language Model Li Kenneth, Patel Oam, Viégas Fernanda, Pfister Hanspeter, Wattenberg Martin
- Watermarking Llms With Weight Quantization Li Linyang, Jiang Botian, Wang Pengyu, Ren Ke, Yan Hang, Qiu Xipeng
- Summarizing Multiple Documents With Conversational Structure For Meta-review Generation Li Miao, Hovy Eduard, Lau Jey Han
- Api-bank: A Comprehensive Benchmark For Tool-augmented Llms Li Minghao, Zhao Yingxiu, Yu Bowen, Song Feifan, Li Hangyu, Yu Haiyang, Li Zhoujun, Huang Fei, Li Yongbin
- Skillgpt: A Restful API Service For Skill Extraction And Standardization Using A Large Language Model Li Nan, Kang Bo, De Bie Tijl
- Table-gpt: Table-tuned GPT For Diverse Table Tasks Li Peng, He Yeye, Yashar Dror, Cui Weiwei, Ge Song, Zhang Haidong, Fainman Danielle Rifinski, Zhang Dongmei, Chaudhuri Surajit
- Starcoder: May The Source Be With You! Li Raymond, Allal Loubna Ben, Zi Yangtian, Muennighoff Niklas, Kocetkov Denis, Mou Chenghao, Marone Marc, Akiki Christopher, Li Jia, Chim Jenny, Liu Qian, Zheltonozhskii Evgenii, Zhuo Terry Yue, Wang Thomas, Dehaene Olivier, Davaadorj Mishig, Lamy-poirier Joel, Monteiro João, Shliazhko Oleh, Gontier Nicolas, Meade Nicholas, Zebaze Armel, Yee Ming-ho, Umapathi Logesh Kumar, Zhu Jian, Lipkin Benjamin, Oblokulov Muhtasham, Wang Zhiruo, Murthy Rudra, Stillerman Jason, Patel Siva Sankalp, Abulkhanov Dmitry, Zocca Marco, Dey Manan, Zhang Zhihan, Fahmy Nour, Bhattacharyya Urvashi, Yu Wenhao, Singh Swayam, Luccioni Sasha, Villegas Paulo, Kunakov Maxim, Zhdanov Fedor, Romero Manuel, Lee Tony, Timor Nadav, Ding Jennifer, Schlesinger Claire, Schoelkopf Hailey, Ebert Jan, Dao Tri, Mishra Mayank, Gu Alex, Robinson Jennifer, Anderson Carolyn Jane, Dolan-gavitt Brendan, Contractor Danish, Reddy Siva, Fried Daniel, Bahdanau Dzmitry, Jernite Yacine, Ferrandis Carlos Muñoz, Hughes Sean, Wolf Thomas, Guha Arjun, Von Werra Leandro, De Vries Harm
- Vlm-eval: A General Evaluation On Video Large Language Models Li Shuailin, Zhang Yuang, Zhao Yucheng, Wang Qiuyue, Jia Fan, Liu Yingfei, Wang Tiancai
- Benchmarking And Improving Generator-validator Consistency Of Language Models Li Xiang Lisa, Shrivastava Vaishnavi, Li Siyan, Hashimoto Tatsunori, Liang Percy
- Context Matters: Data-efficient Augmentation Of Large Language Models For Scientific Applications Li Xiang, Tang Haoran, Chen Siyu, Wang Ziwei, Maravi Anurag, Abram Marcin
- FLM-101B: An Open LLM And How To Train It With $100K Budget Li Xiang, Yao Yiqun, Jiang Xin, Fang Xuezhi, Meng Xuying, Fan Siqi, Han Peng, Li Jing, Du Li, Qin Bowen, Zhang Zheng, Sun Aixin, Wang Yequan
- Image Content Generation With Causal Reasoning Li Xiaochuan, Fan Baoyu, Zhang Runze, Jin Liang, Wang Di, Guo Zhenhua, Zhao Yaqian, Li Rengang
- Mot: Memory-of-thought Enables Chatgpt To Self-improve Li Xiaonan, Qiu Xipeng
- Unigen: A Unified Generative Framework For Retrieval And Question Answering With Large Language Models Li Xiaoxi, Zhou Yujia, Dou Zhicheng
- Think Outside The Code: Brainstorming Boosts Large Language Models In Code Generation Li Xin-ye, Xue Jiang-tian, Xie Zheng, Li Ming
- Unlock The Power: Competitive Distillation For Multi-modal Large Language Models Li Xinwei, Lin Li, Wang Shuai, Qian Chen
- Exploring Fine-tuning Chatgpt For News Recommendation Li Xinyi, Zhang Yongfeng, Malthouse Edward C
- PBNR: Prompt-based News Recommender System Li Xinyi, Zhang Yongfeng, Malthouse Edward C.
- Stablellava: Enhanced Visual Instruction Tuning With Synthesized Image-dialogue Data Li Yanda, Zhang Chi, Yu Gang, Wang Zhibin, Fu Bin, Lin Guosheng, Shen Chunhua, Chen Ling, Wei Yunchao
- Dissecting Chain-of-thought: Compositionality Through In-context Filtering And Learning Li Yingcong, Sreenivasan Kartik, Giannou Angeliki, Papailiopoulos Dimitris, Oymak Samet
- Losparse: Structured Compression Of Large Language Models Based On Low-rank And Sparse Approximation Li Yixiao, Yu Yifan, Zhang Qingru, Liang Chen, He Pengcheng, Chen Weizhu, Zhao Tuo
- Curriculum-driven Edubot: A Framework For Developing Language Learning Chatbots Through Synthesizing Conversational Data Li Yu, Qu Shang, Shen Jili, Min Shangchao, Yu Zhou
- Posgen: Personalized Opening Sentence Generation For Online Insurance Sales Li Yu, Zhang Yi, Wu Weijia, Zhou Zimu, Li Qiang
- Textbooks Are All You Need II: Phi-1.5 Technical Report Li Yuanzhi, Bubeck Sébastien, Eldan Ronen, Del Giorno Allie, Gunasekar Suriya, Lee Yin Tat
- Compressing Context To Enhance Inference Efficiency Of Large Language Models Li Yucheng, Dong Bo, Lin Chenghua, Guerin Frank
- White-box Multi-objective Adversarial Attack On Dialogue Generation Li Yufei, Li Zexin, Gao Yingfan, Liu Cong
- RAIN: Your Language Models Can Align Themselves Without Finetuning Li Yuhui, Wei Fangyun, Zhao Jinjing, Zhang Chao, Zhang Hongyang
- E-sparse: Boosting The Large Language Model Inference Through Entropy-based N:M Sparsity Li Yun, Niu Lin, Zhang Xipeng, Liu Kai, Zhu Jianchen, Kang Zhanhui
- Towards Vision Enhancing Llms: Empowering Multimodal Knowledge Storage And Sharing In Llms Li Yunxin, Hu Baotian, Wang Wei, Cao Xiaochun, Zhang Min
- Advancing Precise Outline-conditioned Text Generation With Task Duality And Explicit Outline Control Li Yunzhe, Chen Qian, Yan Weixiang, Wang Wen, Zhang Qinglin, Sundaram Hari
- Guiding Large Language Models Via Directional Stimulus Prompting Li Zekun, Peng Baolin, He Pengcheng, Galley Michel, Gao Jianfeng, Yan Xifeng
- Unveiling The Pitfalls Of Knowledge Editing For Large Language Models Li Zhoubo, Zhang Ningyu, Yao Yunzhi, Wang Mengru, Chen Xi, Chen Huajun
- Split And Merge: Aligning Position Biases In Large Language Model Based Evaluators Li Zongjie, Wang Chaozheng, Ma Pingchuan, Wu Daoyuan, Wang Shuai, Gao Cuiyun, Liu Yang
- Label Supervised Llama Finetuning Li Zongxi, Li Xianming, Liu Yuzhang, Xie Haoran, Li Jing, Wang Fu-lee, Li Qing, Zhong Xiaoqin
- Batgpt: A Bidirectional Autoregessive Talker From Generative Pre-trained Transformer Li Zuchao, Zhang Shitou, Zhao Hai, Yang Yifei, Yang Dongjie
- Homodistil: Homotopic Task-agnostic Distillation Of Pre-trained Transformers Liang Chen, Jiang Haoming, Li Zheng, Tang Xianfeng, Yin Bin, Zhao Tuo
- Towards End-to-end Embodied Decision Making Via Multi-modal Large Language Model: Explorations With Gpt4-vision And Beyond Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Tianyu Liu, Baobao Chang
- Exploring Format Consistency For Instruction Tuning Liang Shihao, Tian Runchu, Zhu Kunlun, Qin Yujia, Wang Huadong, Cong Xin, Liu Zhiyuan, Liu Xiaojiang, Sun Maosong
- Encouraging Divergent Thinking In Large Language Models Through Multi-agent Debate Liang Tian, He Zhiwei, Jiao Wenxiang, Wang Xing, Wang Rui, Yang Yujiu, Tu Zhaopeng, Shi Shuming
- GPT Detectors Are Biased Against Non-native English Writers Liang Weixin, Yuksekgonul Mert, Mao Yining, Wu Eric, Zou James
- Uhgeval: Benchmarking The Hallucination Of Chinese Large Language Models Via Unconstrained Generation Liang Xun, Song Shichao, Niu Simin, Li Zhiyu, Xiong Feiyu, Tang Bo, Wang Yezhaohui, He Dawei, Cheng Peng, Wang Zhonghao, Deng Haiying
- Let GPT Be A Math Tutor: Teaching Math Word Problem Solvers With Customized Exercise Generation Liang Zhenwen, Yu Wenhao, Rajpurohit Tanmay, Clark Peter, Zhang Xiangliang, Kaylan Ashwin
- Make Pre-trained Model Reversible: From Parameter To Memory Efficient Fine-tuning Liao Baohao, Tan Shaomu, Monz Christof
- GPT-4 Enhanced Multimodal Grounding For Autonomous Driving: Leveraging Cross-modal Attention With Large Language Models Liao Haicheng, Shen Huanming, Li Zhenning, Wang Chengyue, Li Guofa, Bie Yiming, Xu Chengzhong
- Modeling Complex Mathematical Reasoning Via Large Language Model Based Mathagent Liao Haoran, Du Qinyi, Hu Shaohua, He Hao, Xu Yanyan, Tian Jidong, Jin Yaohui
- Opening Up Chatgpt: Tracking Openness, Transparency, And Accountability In Instruction-tuned Text Generators Liesenfeld Andreas, Lopez Alianda, Dingemanse Mark
- A Survey On Large Language Models For Recommendation Likang Wu, Zhi Zheng, Zhaopeng Qiu, Hao Wang, Hongchao Gu, Tingjia Shen, Chuan Qin, Chen Zhu, Hengshu Zhu, Qi Liu, Hui Xiong, Enhong Chen
- Video-llava: Learning United Visual Representation By Alignment Before Projection Lin Bin, Ye Yang, Zhu Bin, Cui Jiaxi, Ning Munan, Jin Peng, Yuan Li
- Paralinguistics-enhanced Large Language Modeling Of Spoken Dialogue Lin Guan-ting, Shivakumar Prashanth Gurunath, Gandhe Ankur, Yang Chao-han Huck, Gu Yile, Ghosh Shalini, Stolcke Andreas, Lee Hung-yi, Bulyko Ivan
- Sparks Of Artificial General Recommender (AGR): Early Experiments With Chatgpt Lin Guo, Zhang Yongfeng
- Videodirectorgpt: Consistent Multi-scene Video Generation Via Llm-guided Planning Lin Han, Zala Abhay, Cho Jaemin, Bansal Mohit
- Agentsims: An Open-source Sandbox For Large Language Model Evaluation Lin Jiaju, Zhao Haoran, Zhang Aochi, Wu Yiting, Ping Huqiuyue, Chen Qin
- Rella: Retrieval-enhanced Large Language Models For Lifelong Sequential Behavior Comprehension In Recommendation Lin Jianghao, Shan Rong, Zhu Chenxu, Du Kounianhua, Chen Bo, Quan Shigang, Tang Ruiming, Yu Yong, Zhang Weinan
- Batchprompt: Accomplish More With Less Lin Jianzhe, Diesendruck Maurice, Du Liang, Abraham Robin
- Argue With Me Tersely: Towards Sentence-level Counter-argument Generation Lin Jiayu, Ye Rong, Han Meng, Zhang Qi, Lai Ruofei, Zhang Xinyu, Cao Zhao, Huang Xuanjing, Wei Zhongyu
- Advances In Embodied Navigation Using Large Language Models: A Survey Lin Jinzhou, Gao Han, Feng Xuxiang, Xu Rongtao, Wang Changwei, Zhang Man, Guo Li, Xu Shibiao
- Mitigating The Alignment Tax Of RLHF Lin Yong, Lin Hangyu, Xiong Wei, Diao Shizhe, Liu Jianmeng, Zhang Jipeng, Pan Rui, Wang Haoxiang, Hu Wenbin, Zhang Hanning, Dong Hanze, Pi Renjie, Zhao Han, Jiang Nan, Ji Heng, Yao Yuan, Zhang Tong
- Pushing Large Language Models To The 6G Edge: Vision, Challenges, And Opportunities Lin Zheng, Qu Guanqiao, Chen Qiyuan, Chen Xianhao, Chen Zhe, Huang Kaibin
- SPHINX: The Joint Mixing Of Weights, Tasks, And Visual Embeddings For Multi-modal Large Language Models Lin Ziyi, Liu Chris, Zhang Renrui, Gao Peng, Qiu Longtian, Xiao Han, Qiu Han, Lin Chen, Shao Wenqi, Chen Keqin, Han Jiaming, Huang Siyuan, Zhang Yichi, He Xuming, Li Hongsheng, Qiao Yu
- Comparing Generative Chatbots Based On Process Requirements Lins Luis Fernando, Nascimento Nathalia, Alencar Paulo, Oliveira Toacy, Cowan Donald
- Establishing Trustworthiness: Rethinking Tasks And Model Evaluation Litschko Robert, Müller-eberstein Max, Van Der Goot Rob, Weber Leon, Plank Barbara
- External Reasoning: Towards Multi-large-language-models Interchangeable Assistance With Human Feedback Liu Akide
- Exposing Attention Glitches With Flip-flop Language Modeling Liu Bingbin, Ash Jordan T., Goel Surbhi, Krishnamurthy Akshay, Zhang Cyril
- Code Execution With Pre-trained Language Models Liu Chenxiao, Lu Shuai, Chen Weizhu, Jiang Daxin, Svyatkovskiy Alexey, Fu Shengyu, Sundaresan Neel, Duan Nan
- Recprompt: A Prompt Tuning Framework For News Recommendation Using Large Language Models Liu Dairui, Yang Boming, Du Honghui, Greene Derek, Lawlor Aonghus, Dong Ruihai, Li Irene
- MMC: Advancing Multimodal Chart Understanding With Large-scale Instruction Tuning Liu Fuxiao, Wang Xiaoyang, Yao Wenlin, Chen Jianshu, Song Kaiqiang, Cho Sangwoo, Yacoob Yaser, Yu Dong
- Logicot: Logical Chain-of-thought Instruction-tuning Liu Hanmeng, Teng Zhiyang, Cui Leyang, Zhang Chaoli, Zhou Qiji, Zhang Yue
- Glore: Evaluating Logical Reasoning Of Large Language Models Liu Hanmeng, Teng Zhiyang, Ning Ruoxi, Liu Jian, Zhou Qiji, Zhang Yue
- Exploration With Principles For Diverse AI Supervision Liu Hao, Zaharia Matei, Abbeel Pieter
- Ring Attention With Blockwise Transformers For Near-infinite Context Liu Hao, Zaharia Matei, Abbeel Pieter
- Alltogether: Investigating The Efficacy Of Spliced Prompt For Web Navigation Using Large Language Models Liu Jiarun, Hu Wentao, Zhang Chunhong
- Chatcounselor: A Large Language Models For Mental Health Support Liu June M., Li Donghao, Cao He, Ren Tianhe, Liao Zeyi, Wu Jiamin
- Concise And Organized Perception Facilitates Reasoning In Large Language Models Liu Junjie, Yan Shaotian, Shen Chen, Xie Liang, Wang Wenxiao, Ye Jieping
- Qilin-med-vl: Towards Chinese Large Vision-language Model For General Healthcare Liu Junling, Wang Ziming, Ye Qichen, Chong Dading, Zhou Peilin, Hua Yining
- Conversational Question Answering With Reformulations Over Knowledge Graph Liu Lihui, Hill Blaine, Du Boxin, Wang Fei, Tong Hanghang
- Contrastive Vision-language Alignment Makes Efficient Instruction Learner Liu Lizhao, Sun Xinyu, Xiang Tianhang, Zhuang Zhuangwei, Yin Liuren, Tan Mingkui
- An Evaluation Of GPT-4V And Gemini In Online VQA Liu Mengchen, Chen Chongyan, Gurari Danna
- Chipnemo: Domain-adapted Llms For Chip Design Liu Mingjie, Ene Teodor-dumitru, Kirby Robert, Cheng Chris, Pinckney Nathaniel, Liang Rongjian, Alben Jonah, Anand Himyanshu, Banerjee Sanmitra, Bayraktaroglu Ismet, Bhaskaran Bonita, Catanzaro Bryan, Chaudhuri Arjun, Clay Sharon, Dally Bill, Dang Laura, Deshpande Parikshit, Dhodhi Siddhanth, Halepete Sameer, Hill Eric, Hu Jiashang, Jain Sumit, Jindal Ankit, Khailany Brucek, Kokai George, Kunal Kishor, Li Xiaowei, Lind Charley, Liu Hao, Oberman Stuart, Omar Sujeet, Pasandi Ghasem, Pratty Sreedhar, Raiman Jonathan, Sarkar Ambar, Shao Zhengjiang, Sun Hanfei, Suthar Pratik P, Tej Varun, Turner Walker, Xu Kaizhe, Ren Haoxing
- X-eval: Generalizable Multi-aspect Text Evaluation Via Augmented Instruction Tuning With Auxiliary Evaluation Aspects Liu Minqian, Shen Ying, Xu Zhiyang, Cao Yixin, Cho Eunah, Kumar Vaibhav, Ghanadan Reza, Huang Lifu
- Nlebench+norglm: A Comprehensive Empirical Analysis And Benchmark Dataset For Generative Language Models In Norwegian Liu Peng, Zhang Lemei, Farup Terje Nissen, Lauvrak Even W., Ingvaldsen Jon Espen, Eide Simen, Gulla Jon Atle, Yang Zhirong
- From Zero To Hero: Examining The Power Of Symbolic Tasks In Instruction Tuning Liu Qian, Zhou Fan, Jiang Zhengbao, Dou Longxu, Lin Min
- Exploring The Boundaries Of GPT-4 In Radiology Liu Qianchu, Hyland Stephanie, Bannur Shruthi, Bouzid Kenza, Castro Daniel C., Wetscherek Maria Teodora, Tinn Robert, Sharma Harshita, Pérez-garcía Fernando, Schwaighofer Anton, Rajpurkar Pranav, Khanna Sameer Tajdin, Poon Hoifung, Usuyama Naoto, Thieme Anja, Nori Aditya V., Lungren Matthew P., Oktay Ozan, Alvarez-valle Javier
- Chatgpt-powered Conversational Drug Editing Using Retrieval And Domain Feedback Liu Shengchao, Wang Jiongxiao, Yang Yijin, Wang Chengpeng, Liu Ling, Guo Hongyu, Xiao Chaowei
- Language Models As Black-box Optimizers For Vision-language Models Liu Shihong, Lin Zhiqiu, Yu Samuel, Lee Ryan, Ling Tiffany, Pathak Deepak, Ramanan Deva
- Goat: Fine-tuned Llama Outperforms GPT-4 On Arithmetic Tasks Liu Tiedong, Low Bryan Kian Hsiang
- Class Incremental Learning With Pre-trained Vision-language Models Liu Xialei, Cao Xusheng, Lu Haori, Xiao Jia-wen, Bagdanov Andrew D., Cheng Ming-ming
- Beyond Text: Unveiling Multimodal Proficiency Of Large Language Models With Multiapi Benchmark Liu Xiao, Lin Jianfeng, Zhang Jiawei
- Fingpt: Democratizing Internet-scale Data For Financial Large Language Models Liu Xiao-yang, Wang Guoxuan, Yang Hongyang, Zha Daochen
- Scaling Laws Of Rope-based Extrapolation Liu Xiaoran, Yan Hang, Zhang Shuo, An Chenxin, Qiu Xipeng, Lin Dahua
- Litcab: Lightweight Language Model Calibration Over Short- And Long-form Responses Liu Xin, Khalifa Muhammad, Wang Lu
- Query-utterance Attention With Joint Modeling For Query-focused Meeting Summarization Liu Xingxian, Duan Bin, Xiao Bo, Xu Yajing
- Gumsum: Multi-genre Data And Evaluation For English Abstractive Summarization Liu Yang Janet, Zeldes Amir
- G-eval: NLG Evaluation Using GPT-4 With Better Human Alignment Liu Yang, Iter Dan, Xu Yichong, Wang Shuohang, Xu Ruochen, Zhu Chenguang
- Trustworthy Llms: A Survey And Guideline For Evaluating Large Language Models' Alignment Liu Yang, Yao Yuanshun, Ton Jean-francois, Zhang Xiaoying, Guo Ruocheng, Cheng Hao, Klochkov Yegor, Taufiq Muhammad Faaiz, Li Hang
- Modeling Uncertainty And Using Post-fusion As Fallback Improves Retrieval Augmented Generation With Llms Liu Ye, Yavuz Semih, Meng Rui, Moorthy Meghana, Joty Shafiq, Xiong Caiming, Zhou Yingbo
- Jailbreaking Chatgpt Via Prompt Engineering: An Empirical Study Liu Yi, Deng Gelei, Xu Zhengzi, Li Yuekang, Zheng Yaowen, Zhang Ying, Zhao Lida, Zhang Tianwei, Wang Kailong, Liu Yang
- Summary Of Chatgpt-related Research And Perspective Towards The Future Of Large Language Models Liu Yiheng, Han Tianle, Ma Siyuan, Zhang Jiayue, Yang Yuanyuan, Tian Jiaming, He Hao, Li Antong, He Mengshen, Liu Zhengliang, Wu Zihao, Zhao Lin, Zhu Dajiang, Li Xiang, Qiang Ning, Shen Dingang, Liu Tianming, Ge Bao
- Instruction Position Matters In Sequence Generation With Large Language Models Liu Yijin, Zeng Xianfeng, Meng Fandong, Zhou Jie
- Llms As Narcissistic Evaluators: When Ego Inflates Evaluation Scores Liu Yiqi, Moosavi Nafise Sadat, Lin Chenghua
- Evaluate What You Can't Evaluate: Unassessable Quality For Generated Response Liu Yongkang, Feng Shi, Wang Daling, Zhang Yifei, Schütze Hinrich
- Robustness Over Time: Understanding Adversarial Examples' Effectiveness On Longitudinal Versions Of Large Language Models Liu Yugeng, Cong Tianshuo, Zhao Zhengyu, Backes Michael, Shen Yun, Zhang Yang
- Binary And Ternary Natural Language Generation Liu Zechun, Oguz Barlas, Pappu Aasish, Shi Yangyang, Krishnamoorthi Raghuraman
- Molxpt: Wrapping Molecules With Text For Generative Pre-training Liu Zequn, Zhang Wei, Xia Yingce, Wu Lijun, Xie Shufang, Qin Tao, Zhang Ming, Liu Tie-yan
- Radiology-gpt: A Large Language Model For Radiology Liu Zhengliang, Zhong Aoxiao, Li Yiwei, Yang Longtao, Ju Chao, Wu Zihao, Ma Chong, Shu Peng, Chen Cheng, Kim Sekeun, Dai Haixing, Zhao Lin, Sun Lichao, Zhu Dajiang, Liu Jun, Liu Wei, Shen Dinggang, Li Xiang, Li Quanzheng, Liu Tianming
- Picking The Underused Heads: A Network Pruning Perspective Of Attention Head Selection For Fusing Dialogue Coreference Information Liu Zhengyuan, Chen Nancy F.
- Deja Vu: Contextual Sparsity For Efficient Llms At Inference Time Liu Zichang, Wang Jue, Dao Tri, Zhou Tianyi, Yuan Binhang, Song Zhao, Shrivastava Anshumali, Zhang Ce, Tian Yuandong, Re Christopher, Chen Beidi
- Self-contradictory Reasoning Evaluation And Detection Liu Ziyi, Lee Isabelle, Du Yongkang, Sanyal Soumya, Zhao Jieyu
- Exploring Effectiveness Of GPT-3 In Grammatical Error Correction: A Study On Performance And Controllability In Prompt-based Methods Loem Mengsay, Kaneko Masahiro, Takase Sho, Okazaki Naoaki
- Harnessing Large Language Models' Empathetic Response Generation Capabilities For Online Mental Health Counselling Support Loh Siyuan Brandon, Raamkumar Aravind Sesagiri
- Discuss Before Moving: Visual Language Navigation Via Multi-expert Discussions Long Yuxing, Li Xiaoqi, Cai Wenzhe, Dong Hao
- Data-to-text Generation For Severely Under-resourced Languages With GPT-3.5: A Bit Of Help Needed From Google Translate Lorandi Michela, Belz Anya
- Probing Pretrained Language Models With Hierarchy Properties Lovón-melgarejo Jesús, Moreno Jose G., Besançon Romaric, Ferret Olivier, Tamine Lynda
- Chatgpt V Bard V Bing V Claude 2 V Aria V Human-expert. How Good Are AI Chatbots At Scientific Writing? Lozić Edisa, Štular Benjamin
- Bounding The Capabilities Of Large Language Models In Open Text Generation With Prompt Constraints Lu Albert, Zhang Hongxin, Zhang Yanzhe, Wang Xuezhi, Yang Diyi
- Chain-of-dictionary Prompting Elicits Translation In Large Language Models Lu Hongyuan, Yang Haoran, Huang Haoyang, Zhang Dongdong, Lam Wai, Wei Furu
- Unified-io 2: Scaling Autoregressive Multimodal Models With Vision, Language, Audio, And Action Lu Jiasen, Clark Christopher, Lee Sangho, Zhang Zichen, Khosla Savya, Marten Ryan, Hoiem Derek, Kembhavi Aniruddha
- Ziya-visual: Bilingual Large Vision-language Model Via Multi-task Instruction Tuning Lu Junyu, Zhang Dixiang, Wu Xiaojun, Gao Xinyu, Gan Ruyi, Zhang Jiaxing, Song Yan, Zhang Pingjian
- Lyrics: Boosting Fine-grained Language-vision Alignment And Comprehension Via Semantic-aware Visual Objects Lu Junyu, Zhang Dixiang, Zhang Songxin, Xie Zejian, Song Zhuoyang, Lin Cong, Zhang Jiaxing, Jing Bingyi, Zhang Pingjian
- Large Language Models Can Be Guided To Evade Ai-generated Text Detection Lu Ning, Liu Shengcai, He Rui, Wang Qi, Ong Yew-soon, Tang Ke
- Mathvista: Evaluating Mathematical Reasoning Of Foundation Models In Visual Contexts Lu Pan, Bansal Hritik, Xia Tony, Liu Jiacheng, Li Chunyuan, Hajishirzi Hannaneh, Cheng Hao, Chang Kai-wei, Galley Michel, Gao Jianfeng
- Error Analysis Prompting Enables Human-like Translation Evaluation In Large Language Models Lu Qingyu, Qiu Baopu, Ding Liang, Zhang Kanjian, Kocmi Tom, Tao Dacheng
- Inference-time Policy Adapters (IPA): Tailoring Extreme-scale Lms Without Fine-tuning Lu Ximing, Brahman Faeze, West Peter, Jang Jaehun, Chandu Khyathi, Ravichander Abhilasha, Qin Lianhui, Ammanabrolu Prithviraj, Jiang Liwei, Ramnath Sahana, Dziri Nouha, Fisher Jillian, Lin Bill Yuchen, Hallinan Skyler, Ren Xiang, Welleck Sean, Choi Yejin
- SCITAB: A Challenging Benchmark For Compositional Reasoning And Claim Verification On Scientific Tables Lu Xinyuan, Pan Liangming, Liu Qian, Nakov Preslav, Kan Min-yen
- RTLLM: An Open-source Benchmark For Design RTL Generation With Large Language Model Lu Yao, Liu Shang, Zhang Qijun, Xie Zhiyao
- UI Layout Generation With Llms Guided By UI Grammar Lu Yuwen, Tong Ziang, Zhao Qinyi, Zhang Chengzhi, Li Toby Jia-jun
- Effectively Fine-tune To Improve Large Multimodal Models For Radiology Report Generation Lu Yuzhe, Hong Sungmin, Shah Yash, Xu Panpan
- Fighting Fire With Fire: The Dual Role Of Llms In Crafting And Detecting Elusive Disinformation Lucas Jason, Uchendu Adaku, Yamashita Michiharu, Lee Jooyoung, Rohatgi Shaurya, Lee Dongwon
- Explanation-based Finetuning Makes Models More Robust To Spurious Cues Ludan Josh Magnus, Meng Yixuan, Nguyen Tai, Shah Saurabh, Lyu Qing, Apidianaki Marianna, Callison-burch Chris
- Chatgpt And A New Academic Reality: Artificial Intelligence-written Research Papers And The Ethics Of The Large Language Models In Scholarly Publishing Lund Brady, Wang Ting, Mannuru Nishith Reddy, Nie Bing, Shimray Somipam, Wang Ziang
- Enhancing Black-box Few-shot Text Classification With Prompt-based Data Augmentation Luo Danqing, Zhang Chen, Xu Jiahui, Wang Bin, Chen Yiming, Zhang Yan, Li Haizhou
- Wizardmath: Empowering Mathematical Reasoning For Large Language Models Via Reinforced Evol-instruct Luo Haipeng, Sun Qingfeng, Xu Can, Zhao Pu, Lou Jianguang, Tao Chongyang, Geng Xiubo, Lin Qingwei, Chen Shifeng, Zhang Dongmei
- Prosg: Using Prompt Synthetic Gradients To Alleviate Prompt Forgetting Of Rnn-like Language Models Luo Haotian, Wu Kunming, Dai Cheng, Ding Sixian, Chen Xinhao
- Exploring Small Language Models With Prompt-learning Paradigm For Efficient Domain-specific Text Classification Luo Hengyu, Liu Peng, Esping Stefan
- Zero-resource Hallucination Prevention For Large Language Models Luo Junyu, Xiao Cao, Ma Fenglong
- Systematic Assessment Of Factual Knowledge In Large Language Models Luo Linhao, Vu Thuy-trang, Phung Dinh, Haffari Gholamreza
- Hrot: Hybrid Prompt Strategy And Retrieval Of Thought For Table-text Hybrid Question Answering Luo Tongxu, Lei Fangyu, Lei Jiahe, Liu Weihao, He Shihu, Zhao Jun, Liu Kang
- Biomedgpt: Open Multimodal Generative Pre-trained Transformer For Biomedicine Luo Yizhen, Zhang Jiahuan, Fan Siqi, Yang Kai, Wu Yushuai, Qiao Mu, Nie Zaiqing
- Chatgpt As A Factual Inconsistency Evaluator For Text Summarization Luo Zheheng, Xie Qianqian, Ananiadou Sophia
- Fingpt: Large Generative Models For A Small Language Luukkonen Risto, Komulainen Ville, Luoma Jouni, Eskelinen Anni, Kanerva Jenna, Kupari Hanna-mari, Ginter Filip, Laippala Veronika, Muennighoff Niklas, Piktus Aleksandra, Wang Thomas, Tazi Nouamane, Scao Teven Le, Wolf Thomas, Suominen Osma, Sairanen Samuli, Merioksa Mikko, Heinonen Jyrki, Vahtola Aija, Antao Samuel, Pyysalo Sampo
- Are We Falling In A Middle-intelligence Trap? An Analysis And Mitigation Of The Reversal Curse Lv Ang, Zhang Kaiyi, Xie Shufang, Tu Quan, Chen Yuhan, Wen Ji-rong, Yan Rui
- KOSMOS-2.5: A Multimodal Literate Model Lv Tengchao, Huang Yupan, Chen Jingye, Zhao Yuzhong, Jia Yilin, Cui Lei, Ma Shuming, Chang Yaoyao, Huang Shaohan, Wang Wenhui, Dong Li, Luo Weiyao, Wu Shaoxiang, Wang Guoxin, Zhang Cha, Wei Furu
- Comparative Analysis Of Drug-gpt And Chatgpt Llms For Healthcare Insights: Evaluating Accuracy And Relevance In Patient And HCP Contexts Lysandrou Giorgos, Owen Roma English, Mursec Kirsty, Brun Grant Le, Fairley Elizabeth A. L.
- A Paradigm Shift: The Future Of Machine Translation Lies With Large Language Models Lyu Chenyang, Du Zefeng, Xu Jitao, Duan Yitao, Wu Minghao, Lynn Teresa, Aji Alham Fikri, Wong Derek F., Liu Siyou, Wang Longyue
- Translating Radiology Reports Into Plain Language Using Chatgpt And GPT-4 With Prompt Learning: Promising Results, Limitations, And Potential Lyu Qing, Tan Josh, Zapadka Michael E., Ponnatapura Janardhana, Niu Chuang, Myers Kyle J., Wang Ge, Whitlow Christopher T.
- Vista-llama: Reliable Video Narrator Via Equal Distance To Visual Tokens Ma Fan, Jin Xiaojie, Wang Heng, Xian Yuchen, Feng Jiashi, Yang Yi
- Fairness-guided Few-shot Prompting For Large Language Models Ma Huan, Zhang Changqing, Bian Yatao, Liu Lemao, Zhang Zhirui, Zhao Peilin, Zhang Shu, Fu Huazhu, Hu Qinghua, Wu Bingzhe
- Chain-of-skills: A Configurable Model For Open-domain Question Answering Ma Kaixin, Cheng Hao, Zhang Yu, Liu Xiaodong, Nyberg Eric, Gao Jianfeng
- "oops, Did I Just Say That?" Testing And Repairing Unethical Suggestions Of Large Language Models With Suggest-critique-reflect Process Ma Pingchuan, Li Zongjie, Sun Ao, Wang Shuai
- An Examination Of The Compositionality Of Large Generative Vision-language Models Ma Teli, Li Rong, Liang Junwei
- Let's Do A Thought Experiment: Using Counterfactuals To Improve Moral Reasoning Ma Xiao, Mishra Swaroop, Beirami Ahmad, Beutel Alex, Chen Jilin
- Tomchallenges: A Principle-guided Dataset And Diverse Evaluation Tasks For Exploring Theory Of Mind Ma Xiaomeng, Gao Lingyu, Xu Qihui
- Query Rewriting For Retrieval-augmented Large Language Models Ma Xinbei, Gong Yeyun, He Pengcheng, Zhao Hai, Duan Nan
- VOLTA: Improving Generative Diversity By Variational Mutual Information Maximizing Autoencoder Ma Yueen, Chi Dafeng, Li Jingjing, Song Kai, Zhuang Yuzheng, King Irwin
- Non-autoregressive Streaming Transformer For Simultaneous Translation Ma Zhengrui, Zhang Shaolei, Guo Shoutao, Shao Chenze, Zhang Min, Feng Yang
- World-to-words: Grounded Open Vocabulary Acquisition Through Fast Mapping In Vision-language Models Ma Ziqiao, Pan Jiayi, Chai Joyce
- Large Language Models For Telecom: Forthcoming Impact On The Industry Maatouk Ali, Piovesan Nicola, Ayed Fadhel, De Domenico Antonio, Debbah Merouane
- Video-chatgpt: Towards Detailed Video Understanding Via Large Vision And Language Models Maaz Muhammad, Rasheed Hanoona, Khan Salman, Khan Fahad Shahbaz
- Self-refine: Iterative Refinement With Self-feedback Madaan Aman, Tandon Niket, Gupta Prakhar, Hallinan Skyler, Gao Luyu, Wiegreffe Sarah, Alon Uri, Dziri Nouha, Prabhumoye Shrimai, Yang Yiming, Gupta Shashank, Majumder Bodhisattwa Prasad, Hermann Katherine, Welleck Sean, Yazdanbakhsh Amir, Clark Peter
- Enhancing Conversational Quality In Language Learning Chatbots: An Evaluation Of GPT4 For ASR Error Correction Mai Long, Carson-berndsen Julie
- Harnessing The Power Of Prompt-based Techniques For Generating School-level Questions Using Large Language Models Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- How Can Large Language Models Help Humans In Design And Manufacturing? Makatura Liane, Foshey Michael, Wang Bohan, Hähnlein Felix, Ma Pingchuan, Deng Bolei, Tjandrasuwita Megan, Spielberg Andrew, Owens Crystal Elaine, Chen Peter Yichen, Zhao Allan, Zhu Amy, Norton Wil J, Gu Edward, Jacob Joshua, Li Yifei, Schulz Adriana, Matusik Wojciech
- Enhancing CLIP With GPT-4: Harnessing Visual Descriptions As Prompts Maniparambil Mayug, Vorster Chris, Molloy Derek, Murphy Noel, Mcguinness Kevin, O'connor Noel E.
- Do Prompt Positions Really Matter? Mao Junyu, Middleton Stuart E., Niranjan Mahesan
- Gpteval: A Survey On Assessments Of Chatgpt And GPT-4 Mao Rui, Chen Guanyi, Zhang Xulang, Guerin Frank, Cambria Erik
- Editing Personality For Large Language Models Mao Shengyu, Wang Xiaohan, Wang Mengru, Jiang Yong, Xie Pengjun, Huang Fei, Zhang Ningyu
- Unitrec: A Unified Text-to-text Transformer And Joint Contrastive Learning Framework For Text-based Recommendation Mao Zhiming, Wang Huimin, Du Yiming, Wong Kam-fai
- Interpreting Learned Feedback Patterns In Large Language Models Marks Luke, Abdullah Amir, Neo Clement, Arike Rauno, Krueger David, Torr Philip, Barez Fazl
- Establishing Vocabulary Tests As A Benchmark For Evaluating Large Language Models Martínez Gonzalo, Conde Javier, Merino-gómez Elena, Bermúdez-margaretto Beatriz, Hernández José Alberto, Reviriego Pedro, Brysbaert Marc
- Chatgpt Vs State-of-the-art Models: A Benchmarking Study In Keyphrase Generation Task Martínez-cruz Roberto, López-lópez Alvaro J., Portela José
- Investigating The Efficacy Of Large Language Models In Reflective Assessment Methods Through Chain Of Thoughts Prompting Masikisiki Baphumelele, Marivate Vukosi, Hlope Yvette
- Benchmarking Large Language Model Capabilities For Conditional Generation Maynez Joshua, Agrawal Priyanka, Gehrmann Sebastian
- Is Feedback All You Need? Leveraging Natural Language Feedback In Goal-conditioned Reinforcement Learning Mccallum Sabrina, Taylor-davies Max, Albrecht Stefano V., Suglia Alessandro
- The Hydra Effect: Emergent Self-repair In Language Model Computations Mcgrath Thomas, Rahtz Matthew, Kramar Janos, Mikulik Vladimir, Legg Shane
- Sources Of Hallucination By Large Language Models On Inference Tasks Mckenna Nick, Li Tianyi, Cheng Liang, Hosseini Mohammad Javad, Johnson Mark, Steedman Mark
- JAB: Joint Adversarial Prompting And Belief Augmentation Mehrabi Ninareh, Goyal Palash, Ramakrishna Anil, Dhamala Jwala, Ghosh Shalini, Zemel Richard, Chang Kai-wei, Galstyan Aram, Gupta Rahul
- Lightlm: A Lightweight Deep And Narrow Language Model For Generative Recommendation Mei Kai, Zhang Yongfeng
- Representation Deficiency In Masked Language Modeling Meng Yu, Krishnan Jitin, Wang Sinong, Wang Qifan, Mao Yuning, Fang Han, Ghazvininejad Marjan, Han Jiawei, Zettlemoyer Luke
- For Generated Text, Is Nli-neutral Text The Best Text? Mersinias Michail, Mahowald Kyle
- From Bytes To Biases: Investigating The Cultural Self-perception Of Large Language Models Messner Wolfgang, Greene Tatum, Matalone Josephine
- An AI Chatbot For Explaining Deep Reinforcement Learning Decisions Of Service-oriented Systems Metzger Andreas, Bartel Jone, Laufer Jan
- Think Before You Act: Unified Policy For Interleaving Language Reasoning With Actions Mezghani Lina, Bojanowski Piotr, Alahari Karteek, Sukhbaatar Sainbayar
- An Empirical Study Of Netops Capability Of Pre-trained Large Language Models Miao Yukai, Bai Yu, Chen Li, Li Dan, Sun Haifeng, Wang Xizheng, Luo Ziqiu, Ren Yanyu, Sun Dapeng, Xu Xiuting, Zhang Qi, Xiang Chao, Li Xinchi
- Self-checker: Plug-and-play Modules For Fact-checking With Large Language Models Miaoran Li, Baolin Peng, Michel Galley, Jianfeng Gao, Zhu Zhang
- Uzh_clyp At Semeval-2023 Task 9: Head-first Fine-tuning And Chatgpt Data Generation For Cross-lingual Learning In Tweet Intimacy Prediction Michail Andrianos, Konstantinou Stefanos, Clematide Simon
- LAIT: Efficient Multi-segment Encoding In Transformers With Layer-adjustable Interaction Milbauer Jeremiah, Louis Annie, Hosseini Mohammad Javad, Fabrikant Alex, Metzler Donald, Schuster Tal
- ALMANACS: A Simulatability Benchmark For Language Model Explainability Mills Edmund, Su Shiye, Russell Stuart, Emmons Scott
- Attention Link: An Efficient Attention-based Low Resource Machine Translation Architecture Min Zeping
- Beyond Scale: The Diversity Coefficient As A Data Quality Metric For Variability In Natural Language Data Miranda Brando, Lee Alycia, Sundar Sudharsan, Casasola Allison, Koyejo Sanmi
- VQA With Cascade Of Self- And Co-attention Blocks Mishra Aakansha, Anand Ashish, Guha Prithwijit
- Stress Testing Chain-of-thought Prompting For Large Language Models Mishra Aayush, Thakkar Karan
- Synthetic Imitation Edit Feedback For Factual Alignment In Clinical Summarization Mishra Prakamya, Yao Zonghai, Chen Shuwei, Wang Beining, Mittal Rohan, Yu Hong
- Comparing Humans, GPT-4, And GPT-4V On Abstraction And Reasoning Tasks Mitchell Melanie, Palmarini Alessandro B., Moskvichev Arseny
- Boosting Theory-of-mind Performance In Large Language Models Via Prompting Moghaddam Shima Rahimi, Honey Christopher J.
- Chatgpt In The Age Of Generative AI And Large Language Models: A Concise Survey Mohamadi Salman, Mujtaba Ghulam, Le Ngan, Doretto Gianfranco, Adjeroh Donald A.
- Landmark Attention: Random-access Infinite Context Length For Transformers Mohtashami Amirkeivan, Jaggi Martin
- Increasing The Performance Of Cognitively Inspired Data-efficient Language Models Via Implicit Structure Building Momen Omar, Arps David, Kallmeyer Laura
- Evaluating Cognitive Maps And Planning In Large Language Models With Cogeval Momennejad Ida, Hasanbeig Hosein, Vieira Felipe, Sharma Hiteshi, Ness Robert Osazuwa, Jojic Nebojsa, Palangi Hamid, Larson Jonathan
- Narrativexl: A Large-scale Dataset For Long-term Memory Models Moskvichev Arseny, Mai Ky-vinh
- Fine-tuning Large Language Models For Adaptive Machine Translation Moslem Yasmin, Haque Rejwanul, Way Andy
- The Impact Of Artificial Intelligence On The Evolution Of Digital Education: A Comparative Study Of Openai Text Generation Tools Including Chatgpt, Bing Chat, Bard, And Ernie Motlagh Negin Yazdani, Khajavi Matin, Sharifi Abbas, Ahmadi Mohsen
- Can Llms Facilitate Interpretation Of Pre-trained Language Models? Mousi Basel, Durrani Nadir, Dalvi Fahim
- Learning To Compress Prompts With Gist Tokens Mu Jesse, Li Xiang Lisa, Goodman Noah
- Navigating Prompt Complexity For Zero-shot Classification: A Study Of Large Language Models In Computational Social Science Mu Yida, Wu Ben P., Thorne William, Robinson Ambrose, Aletras Nikolaos, Scarton Carolina, Bontcheva Kalina, Song Xingyi
- How To Plant Trees In Language Models: Data And Architectural Effects On The Emergence Of Syntactic Inductive Biases Mueller Aaron, Linzen Tal
- In-context Learning Generalizes, But Not Always Robustly: The Case Of Syntax Mueller Aaron, Webson Albert, Petty Jackson, Linzen Tal
- "medium" Lms Of Code In The Era Of Llms: Lessons From Stackoverflow Mukherjee Manisha, Hellendoorn Vincent J.
- Orca: Progressive Learning From Complex Explanation Traces Of GPT-4 Mukherjee Subhabrata, Mitra Arindam, Jawahar Ganesh, Agarwal Sahaj, Palangi Hamid, Awadallah Ahmed
- Superhf: Supervised Iterative Learning From Human Feedback Mukobi Gabriel, Chatain Peter, Fong Su, Windesheim Robert, Kutyniok Gitta, Bhatia Kush, Alberti Silas
- A Brief History Of Prompt: Leveraging Language Models. (through Advanced Prompting) Muktadir Golam Md
- Arabic Mini-climategpt : A Climate Change And Sustainability Tailored Arabic LLM Mullappilly Sahal Shaji, Shaker Abdelrahman, Thawakar Omkar, Cholakkal Hisham, Anwer Rao Muhammad, Khan Salman, Khan Fahad Shahbaz
- LLM4VV: Developing Llm-driven Testsuite For Compiler Validation Munley Christian, Jarmusch Aaron, Chandrasekaran Sunita
- Ai-assisted Code Authoring At Scale: Fine-tuning, Deploying, And Mixed Methods Evaluation Murali Vijayaraghavan, Maddila Chandra, Ahmad Imad, Bolin Michael, Cheng Daniel, Ghorbani Negar, Fernandez Renuka, Nagappan Nachiappan, Rigby Peter C.
- Testing Llms On Code Generation With Varying Levels Of Prompt Specificity Murr Lincoln, Grainger Morgan, Gao David
- Consistency Analysis Of Chatgpt Myeongjun Erik Jang, Thomas Lukasiewicz
- Self-contradictory Hallucinations Of Large Language Models: Evaluation, Detection And Mitigation Mündler Niels, He Jingxuan, Jenko Slobodan, Vechev Martin
- Teaching Probabilistic Logical Reasoning To Transformers Nafar Aliakbar, Venable Kristen Brent, Kordjamshidi Parisa
- Testing Language Model Agents Safely In The Wild Naihin Silen, Atkinson David, Green Marc, Hamadi Merwane, Swift Craig, Schonholtz Douglas, Kalai Adam Tauman, Bau David
- Generating Medically-accurate Summaries Of Patient-provider Dialogue: A Multi-stage Approach Using Large Language Models Nair Varun, Schumacher Elliot, Kannan Anitha
- DERA: Enhancing Large Language Model Completions With Dialog-enabled Resolving Agents Nair Varun, Schumacher Elliot, Tso Geoffrey, Kannan Anitha
- Using An LLM To Help With Code Understanding Nam Daye, Macvean Andrew, Hellendoorn Vincent, Vasilescu Bogdan, Myers Brad
- Breaking Language Barriers With A LEAP: Learning Strategies For Polyglot Llms Nambi Akshay, Balloli Vaibhav, Ranjit Mercy, Ganu Tanuja, Ahuja Kabir, Sitaram Sunayana, Bali Kalika
- On Evaluating The Integration Of Reasoning And Action In LLM Agents With Database Question Answering Nan Linyong, Zhang Ellen, Zou Weijin, Zhao Yilun, Zhou Wenfei, Cohan Arman
- HALO: An Ontology For Representing And Categorizing Hallucinations In Large Language Models Nananukul Navapat, Kejriwal Mayank
- Cheaply Evaluating Inference Efficiency Metrics For Autoregressive Transformer Apis Narayanan Deepak, Santhanam Keshav, Henderson Peter, Bommasani Rishi, Lee Tony, Liang Percy
- Scalable Extraction Of Training Data From (production) Language Models Nasr Milad, Carlini Nicholas, Hayase Jonathan, Jagielski Matthew, Cooper A. Feder, Ippolito Daphne, Choquette-choo Christopher A., Wallace Eric, Tramèr Florian, Lee Katherine
- Clever Hans Or Neural Theory Of Mind? Stress Testing Social Reasoning In Large Language Models Natalie Shapira, Mosh Levy, Seyed Hossein Alavi, Xuhui Zhou, Yejin Choi, Yoav Goldberg, Maarten Sap, Vered Shwartz
- Reinforcement Replaces Supervision: Query Focused Summarization Using Deep Reinforcement Learning Nath Swaroop, Khadilkar Harshad, Bhattacharyya Pushpak
- Towards Probing Contact Center Large Language Models Nathan Varun, Kumar Ayush, Ingle Digvijay, Vepa Jithendra
- Aligning A Medium-size GPT Model In English To A Small Closed Domain In Spanish Navarrete-parra Oscar R., Uc-cetina Victor, Reyes-magana Jorge
- Large Language Models As Tax Attorneys: A Case Study In Legal Capabilities Emergence Nay John J., Karamardian David, Lawsky Sarah B., Tao Wenting, Bhat Meghana, Jain Raghav, Lee Aaron Travis, Choi Jonathan H., Kasai Jungo
- Vision Encoder-decoder Models For AI Coaching Nayak Jyothi S, Khan Afifah Khan Mohammed Ajmal, Manjeshwar Chirag, Banday Imadh Ajaz
- On Robustness Of Finetuned Transformer-based NLP Models Neerudu Pavan Kalyan Reddy, Oota Subba Reddy, Marreddy Mounika, Kagita Venkateswara Rao, Gupta Manish
- Trusting Language Models In Education Neto Jogi Suda, Deng Li, Raya Thejaswi, Shahbazi Reza, Liu Nick, Venkatesh Adhitya, Shah Miral, Khosla Neeru, Guido Rodrigo Capobianco
- Exploring The Maze Of Multilingual Modeling Nezhad Sina Bagheri, Agrawal Ameeta
- Simplyretrieve: A Private And Lightweight Retrieval-centric Generative AI Tool Ng Youyang, Miyashita Daisuke, Hoshi Yasuto, Morioka Yasuhiro, Torii Osamu, Kodama Tomoya, Deguchi Jun
- Evaluating The Symbol Binding Ability Of Large Language Models For Multiple-choice Questions In Vietnamese General Education Nguyen Duc-vu, Nguyen Quoc-nam
- Black-box Analysis: Gpts Across Time In Legal Textual Entailment Task Nguyen Ha-thanh, Goebel Randy, Toni Francesca, Stathis Kostas, Satoh Ken
- Large Language Models For In-context Student Modeling: Synthesizing Student's Behavior In Visual Programming Nguyen Manh Hung, Tschiatschek Sebastian, Singla Adish
- When Giant Language Brains Just Aren't Enough! Domain Pizzazz With Knowledge Sparkle Dust Nguyen Minh-tien, Nguyen Duy-hung, Sabahi Shahab, Le Hung, Yang Jeff, Hotta Hajime
- EVJVQA Challenge: Multilingual Visual Question Answering Nguyen Ngan Luu-thuy, Nguyen Nghia Hieu, Vo Duong T. D, Tran Khanh Quoc, Van Nguyen Kiet
- Democratizing Llms For Low-resource Languages By Leveraging Their English Dominant Abilities With Linguistically-diverse Prompts Nguyen Xuan-phi, Aljunied Sharifah Mahani, Joty Shafiq, Bing Lidong
- Seallms -- Large Language Models For Southeast Asia Nguyen Xuan-phi, Zhang Wenxuan, Li Xin, Aljunied Mahani, Hu Zhiqiang, Shen Chenhui, Chia Yew Ken, Li Xingxuan, Wang Jianyu, Tan Qingyu, Cheng Liying, Chen Guanzheng, Deng Yue, Yang Sen, Liu Chaoqun, Zhang Hang, Bing Lidong
- L2ceval: Evaluating Language-to-code Generation Capabilities Of Large Language Models Ni Ansong, Yin Pengcheng, Zhao Yilun, Riddell Martin, Feng Troy, Shen Rui, Yin Stephen, Liu Ye, Yavuz Semih, Xiong Caiming, Joty Shafiq, Zhou Yingbo, Radev Dragomir, Cohan Arman
- An Experimental Study: Assessing The Combined Framework Of Wavlm And BEST-RQ For Text-to-speech Synthesis Nielson Via, Hillis Steven
- Codegen2: Lessons For Training Llms On Programming And Natural Languages Nijkamp Erik, Hayashi Hiroaki, Xiong Caiming, Savarese Silvio, Zhou Yingbo
- LEXTREME: A Multi-lingual And Multi-task Benchmark For The Legal Domain Niklaus Joel, Matoshi Veton, Rani Pooja, Galassi Andrea, Stürmer Matthias, Chalkidis Ilias
- Gpt4image: Can Large Pre-trained Models Help Vision Models On Perception Tasks? Ning Ding, Yehui Tang, Zhongqian Fu, Chao Xu, Kai Han, Yunhe Wang
- Ragtruth: A Hallucination Corpus For Developing Trustworthy Retrieval-augmented Language Models Niu Cheng, Wu Yuanhao, Zhu Juno, Xu Siliang, Shum Kashun, Zhong Randy, Song Juntong, Zhang Tong
- Reflexion: Language Agents With Verbal Reinforcement Learning Noah Shinn, Federico Cassano, Edward Berman, Ashwin Gopinath, Karthik Narasimhan, Shunyu Yao
- Evaluating AI Vocational Skills Through Professional Testing Noever David, Ciolino Matt
- AI Text-to-behavior: A Study In Steerability Noever David, Hyams Sam
- Chatbots As Problem Solvers: Playing Twenty Questions With Role Reversals Noever David, Mckee Forrest
- Numeracy From Literacy: Data Science As An Emergent Skill From Large Language Models Noever David, Mckee Forrest
- Multimodal Analysis Of Google Bard And Gpt-vision: Experiments In Visual Reasoning Noever David, Noever Samantha Elizabeth Miller
- Visual AI And Linguistic Intelligence Through Steerability And Composability Noever David, Noever Samantha Elizabeth Miller
- Capabilities Of GPT-4 On Medical Challenge Problems Nori Harsha, King Nicholas, Mckinney Scott Mayer, Carignan Dean, Horvitz Eric
- Evaluating GPT-3.5 And GPT-4 Models On Brazilian University Admission Exams Nunes Desnes, Primi Ricardo, Pires Ramon, Lotufo Roberto, Nogueira Rodrigo
- Contrastive Decoding Improves Reasoning In Large Language Models O'brien Sean, Lewis Mike
- Adversarial Fine-tuning Of Language Models: An Iterative Optimisation Approach For The Generation And Detection Of Problematic Content O'neill Charles, Miller Jack, Ciuca Ioana, Ting Yuan-sen, Bui Thang
- In-contextual Gender Bias Suppression For Large Language Models Oba Daisuke, Kaneko Masahiro, Bollegala Danushka
- Are Large Language Models Fit For Guided Reading? Ochieng Peter
- Comparative Analysis Of CHATGPT And The Evolution Of Language Models Ogundare Oluwatosin, Araya Gustavo Quiros
- Blackvip: Black-box Visual Prompting For Robust Transfer Learning Oh Changdae, Hwang Hyeji, Lee Hee-young, Lim Yongtaek, Jung Geunyoung, Jung Jiyoung, Choi Hosik, Song Kyungwoo
- Data Augmentation For Neural Machine Translation Using Generative Language Model Oh Seokjin, Lee Su Ah, Jung Woohwan
- Large Language Model (LLM) Bias Index -- LLMBI Oketunji Abiodun Finbarrs, Anas Muhammad, Saina Deepthi
- LINC: A Neurosymbolic Approach For Logical Reasoning By Combining Language Models With First-order Logic Provers Olausson Theo X., Gu Alex, Lipkin Benjamin, Zhang Cedegao E., Solar-lezama Armando, Tenenbaum Joshua B., Levy Roger
- Is Self-repair A Silver Bullet For Code Generation? Olausson Theo X., Inala Jeevana Priya, Wang Chenglong, Gao Jianfeng, Solar-lezama Armando
- Generative AI: Implications And Applications For Education Olga Anastasia Olnancy, Tzirides, Saini Akash, Zapata Gabriela, Searsmith Duane, Cope Bill, Kalantzis Mary, Castro Vania, Kourkoulou Theodora, Jones John, Da Silva Rodrigo Abrantes, Whiting Jen, Kastania Nikoleta Polyxeni
- Automated Assessment Of Students' Code Comprehension Using Llms Oli Priti, Banjade Rabin, Chapagain Jeevan, Rus Vasile
- Deepthought: An Architecture For Autonomous Self-motivated Systems Oliveira Arlindo L., Domingos Tiago, Figueiredo Mário, Lima Pedro U.
- GPT-4 Technical Report Openai, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko, Madelaine Boyd, Anna-luisa Brakman, Greg Brockman, Tim Brooks, Miles Brundage, Kevin Button, Trevor Cai, Rosie Campbell, Andrew Cann, Brittany Carey, Chelsea Carlson, Rory Carmichael, Brooke Chan, Che Chang, Fotis Chantzis, Derek Chen, Sully Chen, Ruby Chen, Jason Chen, Mark Chen, Ben Chess, Chester Cho, Casey Chu, Hyung Won Chung, Dave Cummings, Jeremiah Currier, Yunxing Dai, Cory Decareaux, Thomas Degry, Noah Deutsch, Damien Deville, Arka Dhar, David Dohan, Steve Dowling, Sheila Dunning, Adrien Ecoffet, Atty Eleti, Tyna Eloundou, David Farhi, Liam Fedus, Niko Felix, Simón Posada Fishman, Juston Forte, Isabella Fulford, Leo Gao, Elie Georges, Christian Gibson, Vik Goel, Tarun Gogineni, Gabriel Goh, Rapha Gontijo-lopes, Jonathan Gordon, Morgan Grafstein, Scott Gray, Ryan Greene, Joshua Gross, Shixiang Shane Gu, Yufei Guo, Chris Hallacy, Jesse Han, Jeff Harris, Yuchen He, Mike Heaton, Johannes Heidecke, Chris Hesse, Alan Hickey, Wade Hickey, Peter Hoeschele, Brandon Houghton, Kenny Hsu, Shengli Hu, Xin Hu, Joost Huizinga, Shantanu Jain, Shawn Jain, Joanne Jang, Angela Jiang, Roger Jiang, Haozhun Jin, Denny Jin, Shino Jomoto, Billie Jonn, Heewoo Jun, Tomer Kaftan, Łukasz Kaiser, Ali Kamali, Ingmar Kanitscheider, Nitish Shirish Keskar, Tabarak Khan, Logan Kilpatrick, Jong Wook Kim, Christina Kim, Yongjik Kim, Jan Hendrik Kirchner, Jamie Kiros, Matt Knight, Daniel Kokotajlo, Łukasz Kondraciuk, Andrew Kondrich, Aris Konstantinidis, Kyle Kosic, Gretchen Krueger, Vishal Kuo, Michael Lampe, Ikai Lan, Teddy Lee, Jan Leike, Jade Leung, Daniel Levy, Chak Ming Li, Rachel Lim, Molly Lin, Stephanie Lin, Mateusz Litwin, Theresa Lopez, Ryan Lowe, Patricia Lue, Anna Makanju, Kim Malfacini, Sam Manning, Todor Markov, Yaniv Markovski, Bianca Martin, Katie Mayer, Andrew Mayne, Bob Mcgrew, Scott Mayer Mckinney, Christine Mcleavey, Paul Mcmillan, Jake Mcneil, David Medina, Aalok Mehta, Jacob Menick, Luke Metz, Andrey Mishchenko, Pamela Mishkin, Vinnie Monaco, Evan Morikawa, Daniel Mossing, Tong Mu, Mira Murati, Oleg Murk, David Mély, Ashvin Nair, Reiichiro Nakano, Rajeev Nayak, Arvind Neelakantan, Richard Ngo, Hyeonwoo Noh, Long Ouyang, Cullen O'keefe, Jakub Pachocki, Alex Paino, Joe Palermo, Ashley Pantuliano, Giambattista Parascandolo, Joel Parish, Emy Parparita, Alex Passos, Mikhail Pavlov, Andrew Peng, Adam Perelman, Filipe De Avila Belbute Peres, Michael Petrov, Henrique Ponde De Oliveira Pinto, Michael, Pokorny, Michelle Pokrass, Vitchyr H. Pong, Tolly Powell, Alethea Power, Boris Power, Elizabeth Proehl, Raul Puri, Alec Radford, Jack Rae, Aditya Ramesh, Cameron Raymond, Francis Real, Kendra Rimbach, Carl Ross, Bob Rotsted, Henri Roussez, Nick Ryder, Mario Saltarelli, Ted Sanders, Shibani Santurkar, Girish Sastry, Heather Schmidt, David Schnurr, John Schulman, Daniel Selsam, Kyla Sheppard, Toki Sherbakov, Jessica Shieh, Sarah Shoker, Pranav Shyam, Szymon Sidor, Eric Sigler, Maddie Simens, Jordan Sitkin, Katarina Slama, Ian Sohl, Benjamin Sokolowsky, Yang Song, Natalie Staudacher, Felipe Petroski Such, Natalie Summers, Ilya Sutskever, Jie Tang, Nikolas Tezak, Madeleine B. Thompson, Phil Tillet, Amin Tootoonchian, Elizabeth Tseng, Preston Tuggle, Nick Turley, Jerry Tworek, Juan Felipe Cerón Uribe, Andrea Vallone, Arun Vijayvergiya, Chelsea Voss, Carroll Wainwright, Justin Jay Wang, Alvin Wang, Ben Wang, Jonathan Ward, Jason Wei, Cj Weinmann, Akila Welihinda, Peter Welinder, Jiayi Weng, Lilian Weng, Matt Wiethoff, Dave Willner, Clemens Winter, Samuel Wolrich, Hannah Wong, Lauren Workman, Sherwin Wu, Jeff Wu, Michael Wu, Kai Xiao, Tao Xu, Sarah Yoo, Kevin Yu, Qiming Yuan, Wojciech Zaremba, Rowan Zellers, Chong Zhang, Marvin Zhang, Shengjia Zhao, Tianhao Zheng, Juntang Zhuang, William Zhuk, Barret Zoph
- Detecting Llm-generated Text In Computing Education: A Comparative Study For Chatgpt Cases Orenstrakh Michael Sheinman, Karnalim Oscar, Suarez Carlos Anibal, Liut Michael
- Thoughtsource: A Central Hub For Large Language Model Reasoning Data Ott Simon, Hebenstreit Konstantin, Liévin Valentin, Hother Christoffer Egeberg, Moradi Milad, Mayrhauser Maximilian, Praas Robert, Winther Ole, Samwald Matthias
- Dialogbench: Evaluating Llms As Human-like Dialogue Systems Ou Jiao, Lu Junda, Liu Che, Tang Yihong, Zhang Fuzheng, Zhang Di, Gai Kun
- The Shifted And The Overlooked: A Task-oriented Investigation Of User-gpt Interactions Ouyang Siru, Wang Shuohang, Liu Yang, Zhong Ming, Jiao Yizhu, Iter Dan, Pryzant Reid, Zhu Chenguang, Ji Heng, Han Jiawei
- Structured Chemistry Reasoning With Large Language Models Ouyang Siru, Zhang Zhuosheng, Yan Bing, Liu Xuan, Choi Yejin, Han Jiawei, Qin Lianhui
- On The Role Of Attention In Prompt-tuning Oymak Samet, Rawat Ankit Singh, Soltanolkotabi Mahdi, Thrampoulidis Christos
- How To Catch An AI Liar: Lie Detection In Black-box Llms By Asking Unrelated Questions Pacchiardi Lorenzo, Chan Alex J., Mindermann Sören, Moscovitz Ilan, Pan Alexa Y., Gal Yarin, Evans Owain, Brauner Jan
- Several Categories Of Large Language Models (llms): A Short Survey Pahune Saurabh, Chandrasekharan Manoj
- Med-halt: Medical Domain Hallucination Test For Large Language Models Pal Ankit, Umapathi Logesh Kumar, Sankarasubbu Malaikannan
- Giraffe: Adventures In Expanding Context Lengths In Llms Pal Arka, Karkhanis Deep, Roberts Manley, Dooley Samuel, Sundararajan Arvind, Naidu Siddartha
- Evaluating And Explaining Large Language Models For Code Using Syntactic Structures Palacio David N, Velasco Alejandro, Rodriguez-cardenas Daniel, Moran Kevin, Poshyvanyk Denys
- Ontochatgpt Information System: Ontology-driven Structured Prompts For Chatgpt Meta-learning Palagin Oleksandr, Kaverinskiy Vladislav, Litvin Anna, Malakhov Kyrylo
- Kwaiagents: Generalized Information-seeking Agent System With Large Language Models Pan Haojie, Zhai Zepeng, Yuan Hao, Lv Yaojia, Fu Ruiji, Liu Ming, Wang Zhongyuan, Qin Bing
- Finding And Editing Multi-modal Neurons In Pre-trained Transformers Pan Haowen, Cao Yixin, Wang Xiaozhi, Yang Xun, Wang Meng
- What In-context Learning "learns" In-context: Disentangling Task Recognition And Task Learning Pan Jane, Gao Tianyu, Chen Howard, Chen Danqi
- Stelocoder: A Decoder-only LLM For Multi-language To Python Code Translation Pan Jialing, Sadé Adrien, Kim Jin, Soriano Eric, Sole Guillem, Flamant Sylvain
- COSMIC: Data Efficient Instruction-tuning For Speech In-context Learning Pan Jing, Wu Jian, Gaur Yashesh, Sivasankaran Sunit, Chen Zhuo, Liu Shujie, Li Jinyu
- Retrieving-to-answer: Zero-shot Video Question Answering With Frozen Large Language Models Pan Junting, Lin Ziyi, Ge Yuying, Zhu Xiatian, Zhang Renrui, Wang Yi, Qiao Yu, Li Hongsheng
- Chameleon: Plug-and-play Compositional Reasoning With Large Language Models Pan Lu, Baolin Peng, Hao Cheng, Michel Galley, Kai-wei Chang, Ying Nian Wu, Song-chun Zhu, Jianfeng Gao
- A Preliminary Evaluation Of Chatgpt For Zero-shot Dialogue Understanding Pan Wenbo, Chen Qiguang, Xu Xiao, Che Wanxiang, Qin Libo
- Tppoet: Transformer-based Persian Poem Generation Using Minimal Data And Advanced Decoding Techniques Panahandeh Amir, Asemi Hanie, Nourani Esmaeil
- Semantic Composition In Visually Grounded Language Models Pandey Rohan
- Peft-medaware: Large Language Model For Medical Awareness Pandya Keivalya
- Automating Customer Service Using Langchain: Building Custom Open-source GPT Chatbot For Organizations Pandya Keivalya, Holia Mehfuza
- Language Model Self-improvement By Reinforcement Learning Contemplation Pang Jing-cheng, Wang Pengyuan, Li Kaiyuan, Chen Xiong-hui, Xu Jiacheng, Zhang Zongzhang, Yu Yang
- Frozen Transformers In Language Models Are Effective Visual Encoder Layers Pang Ziqi, Xie Ziyang, Man Yunze, Wang Yu-xiong
- Automated Annotation With Generative AI Requires Validation Pangakis Nicholas, Wolken Samuel, Fasching Neil
- Trainable Transformer In Transformer Panigrahi Abhishek, Malladi Sadhika, Xia Mengzhou, Arora Sanjeev
- Injecting Structural Hints: Using Language Models To Study Inductive Biases In Language Learning Papadimitriou Isabel, Jurafsky Dan
- Interpretable Visual Question Answering Via Reasoning Supervision Parelli Maria, Mallis Dimitrios, Diomataris Markos, Pitsikalis Vassilis
- Abductive Reasoning With The GPT-4 Language Model: Case Studies From Criminal Investigation, Medical Practice, Scientific Research Pareschi Remo
- 1.5 Million Materials Narratives Generated By Chatbots Park Yang Jeong, Jerng Sung Eun, Park Jin-sung, Kwon Choah, Hsu Chia-wei, Ren Zhichu, Yoon Sungroh, Li Ju
- A Large Language Model Approach To Educational Survey Feedback Analysis Parker Michael J., Anderson Caitlin, Stone Claire, Oh Yearim
- Longbox: Evaluating Transformers On Long-sequence Clinical Tasks Parmar Mihir, Naik Aakanksha, Gupta Himanshu, Agrawal Disha, Baral Chitta
- Gorilla: Large Language Model Connected With Massive Apis Patil Shishir G., Zhang Tianjun, Wang Xin, Gonzalez Joseph E.
- Sequencing Matters: A Generate-retrieve-generate Model For Building Conversational Agents Patwardhan Quinn, Yang Grace Hui
- REFINER: Reasoning Feedback On Intermediate Representations Paul Debjit, Ismayilzada Mete, Peyrard Maxime, Borges Beatriz, Bosselut Antoine, West Robert, Faltings Boi
- Instructexcel: A Benchmark For Natural Language Instruction In Excel Payan Justin, Mishra Swaroop, Singh Mukul, Negreanu Carina, Poelitz Christian, Baral Chitta, Roy Subhro, Chakravarthy Rasika, Van Durme Benjamin, Nouri Elnaz
- Entity Matching Using Large Language Models Peeters Ralph, Bizer Christian
- GPT Self-supervision For A Better Data Annotator Pei Xiaohuan, Li Yanxi, Xu Chang
- Evaluation Of Medium-large Language Models At Zero-shot Closed Book Generative Question Answering Peinl René, Wirth Johannes
- Exploiting Novel GPT-4 Apis Pelrine Kellin, Taufeeque Mohammad, Zając Michał, Mclean Euan, Gleave Adam
- Chatgpt Prompting Cannot Estimate Predictive Uncertainty In High-resource Languages Pelucchi Martino, Valdenegro-toro Matias
- Instruction Tuning With GPT-4 Peng Baolin, Li Chunyuan, He Pengcheng, Galley Michel, Gao Jianfeng
- Yarn: Efficient Context Window Extension Of Large Language Models Peng Bowen, Quesnelle Jeffrey, Fan Honglu, Shippole Enrico
- A Study Of Generative Large Language Model For Medical Research And Healthcare Peng Cheng, Yang Xi, Chen Aokun, Smith Kaleb E, Pournejatian Nima, Costa Anthony B, Martin Cheryl, Flores Mona G, Zhang Ying, Magoc Tanja, Lipori Gloria, Mitchell Duane A, Ospina Naykky S, Ahmed Mustafa M, Hogan William R, Shenkman Elizabeth A, Guo Yi, Bian Jiang, Wu Yonghui
- Generative Large Language Models Are All-purpose Text Analytics Engines: Text-to-text Learning Is All Your Need Peng Cheng, Yang Xi, Chen Aokun, Yu Zehao, Smith Kaleb E, Costa Anthony B, Flores Mona G, Bian Jiang, Wu Yonghui
- FP8-LM: Training FP8 Large Language Models Peng Houwen, Wu Kan, Wei Yixuan, Zhao Guoshuai, Yang Yuxiang, Liu Ze, Xiong Yifan, Yang Ziyue, Ni Bolin, Hu Jingcheng, Li Ruihang, Zhang Miaosen, Li Chen, Ning Jia, Wang Ruizhe, Zhang Zheng, Liu Shuguang, Chau Joe, Hu Han, Cheng Peng
- Spoken Language Intelligence Of Large Language Models For Language Learning Peng Linkai, Nuchged Baorian, Gao Yingming
- Customising General Large Language Models For Specialised Emotion Recognition Tasks Peng Liyizhe, Zhang Zixing, Pang Tao, Han Jing, Zhao Huan, Chen Hao, Schuller Björn W.
- Large Language Models Can Infer Psychological Dispositions Of Social Media Users Peters Heinrich, Matz Sandra
- Learning From Free-text Human Feedback -- Collect New Datasets Or Extend Existing Ones? Petrak Dominic, Moosavi Nafise Sadat, Tian Ye, Rozanov Nikolai, Gurevych Iryna
- When Do Prompting And Prefix-tuning Work? A Theory Of Capabilities And Limitations Petrov Aleksandar, Torr Philip H. S., Bibi Adel
- Recjpq: Training Large-catalogue Sequential Recommenders Petrov Aleksandr V., Macdonald Craig
- The Impact Of Depth On Compositional Generalization In Transformer Language Models Petty Jackson, Van Steenkiste Sjoerd, Dasgupta Ishita, Sha Fei, Garrette Dan, Linzen Tal
- Attention Sorting Combats Recency Bias In Long Context Language Models Peysakhovich Alexander, Lerer Adam
- Let Models Speak Ciphers: Multiagent Debate Through Embeddings Pham Chau, Liu Boyi, Yang Yingxiang, Chen Zhengyu, Liu Tianyi, Yuan Jianbo, Plummer Bryan A., Wang Zhaoran, Yang Hongxia
- Zero-shot Question Answering Over Financial Documents Using Large Language Models Phogat Karmvir Singh, Harsha Chetan, Dasaratha Sridhar, Ramakrishna Shashishekar, Puranam Sai Akhil
- Generative AI For Programming Education: Benchmarking Chatgpt, GPT-4, And Human Tutors Phung Tung, Pădurean Victor-alexandru, Cambronero José, Gulwani Sumit, Kohn Tobias, Majumdar Rupak, Singla Adish, Soares Gustavo
- Automating Human Tutor-style Programming Feedback: Leveraging GPT-4 Tutor Model For Hint Generation And GPT-3.5 Student Model For Hint Validation Phung Tung, Pădurean Victor-alexandru, Singh Anjali, Brooks Christopher, Cambronero José, Gulwani Sumit, Singla Adish, Soares Gustavo
- LLM Self Defense: By Self Examination, Llms Know They Are Being Tricked Phute Mansi, Helbling Alec, Hull Matthew, Peng Shengyun, Szyller Sebastian, Cornelius Cory, Chau Duen Horng
- Jatmo: Prompt Injection Defense By Task-specific Finetuning Piet Julien, Alrashed Maha, Sitawarin Chawin, Chen Sizhe, Wei Zeming, Sun Elizabeth, Alomair Basel, Wagner David
- Emptying The Ocean With A Spoon: Should We Edit Models? Pinter Yuval, Elhadad Michael
- Typhoon: Thai Large Language Models Pipatanakul Kunat, Jirabovonvisut Phatrasek, Manakul Potsawee, Sripaisarnmongkol Sittipong, Patomwong Ruangsak, Chokchainant Pathomporn, Tharnpipitchai Kasima
- Sabi\'a: Portuguese Large Language Models Pires Ramon, Abonizio Hugo, Almeida Thales Sales, Nogueira Rodrigo
- Evaluating Gpt-4's Vision Capabilities On Brazilian University Admission Exams Pires Ramon, Almeida Thales Sales, Abonizio Hugo, Nogueira Rodrigo
- LLM4TDD: Best Practices For Test Driven Development Using Large Language Models Piya Sanyogita, Sullivan Allison
- Three Ways Of Using Large Language Models To Evaluate Chat Plátek Ondřej, Hudeček Vojtěch, Schmidtová Patricia, Lango Mateusz, Dušek Ondřej
- Ai-assisted Coding: Experiments With GPT-4 Poldrack Russell A, Lu Thomas, Beguš Gašper
- Hyena Hierarchy: Towards Larger Convolutional Language Models Poli Michael, Massaroli Stefano, Nguyen Eric, Fu Daniel Y., Dao Tri, Baccus Stephen, Bengio Yoshua, Ermon Stefano, Ré Christopher
- Mosaicbert: A Bidirectional Encoder Optimized For Fast Pretraining Portes Jacob, Trott Alex, Havens Sam, King Daniel, Venigalla Abhinav, Nadeem Moin, Sardana Nikhil, Khudia Daya, Frankle Jonathan
- Selfcheckgpt: Zero-resource Black-box Hallucination Detection For Generative Large Language Models Potsawee Manakul, Adian Liusie, Mark J. F. Gales
- Transformers In The Service Of Description Logic-based Contexts Poulis Angelos, Tsalapati Eleni, Koubarakis Manolis
- Science In The Era Of Chatgpt, Large Language Models And Generative AI: Challenges For Research Ethics And How To Respond Pournaras Evangelos
- Egovlpv2: Egocentric Video-language Pre-training With Fusion In The Backbone Pramanick Shraman, Song Yale, Nag Sayan, Lin Kevin Qinghong, Shah Hardik, Shou Mike Zheng, Chellappa Rama, Zhang Pengchuan
- Rephrase, Augment, Reason: Visual Grounding Of Questions For Vision-language Models Prasad Archiki, Stengel-eskin Elias, Bansal Mohit
- CORE-GPT: Combining Open Access Research And Large Language Models For Credible, Trustworthy Question Answering Pride David, Cancellieri Matteo, Knoth Petr
- Mini Minds: Exploring Bebeshka And Zlata Baby Models Proskurina Irina, Metzler Guillaume, Velcin Julien
- Chatgpt Vs Human-authored Text: Insights Into Controllable Text Summarization And Sentence Style Transfer Pu Dongqi, Demberg Vera
- Neural Task Synthesis For Visual Programming Pădurean Victor-alexandru, Tzannetos Georgios, Singla Adish
- Visual Adversarial Examples Jailbreak Aligned Large Language Models Qi Xiangyu, Huang Kaixuan, Panda Ashwinee, Henderson Peter, Wang Mengdi, Mittal Prateek
- Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! Qi Xiangyu, Zeng Yi, Xie Tinghao, Chen Pin-yu, Jia Ruoxi, Mittal Prateek, Henderson Peter
- Toolink: Linking Toolkit Creation And Using Through Chain-of-solving On Open-source Model Qian Cheng, Xiong Chenyan, Liu Zhenghao, Liu Zhiyuan
- Harnessing The Power Of Large Language Models For Empathetic Response Generation: Empirical Investigations And Improvements Qian Yushan, Zhang Wei-nan, Liu Ting
- Hijacking Large Language Models Via Adversarial In-context Learning Qiang Yao, Zhou Xiangyu, Zhu Dongxiao
- OPERA: Alleviating Hallucination In Multi-modal Large Language Models Via Over-trust Penalty And Retrospection-allocation Qidong Huang, Xiaoyi Dong, Pan Zhang, Bin Wang, Conghui He, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu
- Improving In-context Learning Via Bidirectional Alignment Qin Chengwei, Xia Wenhan, Jiao Fangkai, Chen Chen, Hu Yuchen, Ding Bosheng, Joty Shafiq
- Is Chatgpt A General-purpose Natural Language Processing Task Solver? Qin Chengwei, Zhang Aston, Zhang Zhuosheng, Chen Jiaao, Yasunaga Michihiro, Yang Diyi
- How Good Is Google Bard's Visual Understanding? An Empirical Study On Open Challenges Qin Haotong, Ji Ge-peng, Khan Salman, Fan Deng-ping, Khan Fahad Shahbaz, Van Gool Luc
- Cross-lingual Prompting: Improving Zero-shot Chain-of-thought Reasoning Across Languages Qin Libo, Chen Qiguang, Wei Fuxuan, Huang Shijue, Che Wanxiang
- Toolllm: Facilitating Large Language Models To Master 16000+ Real-world Apis Qin Yujia, Liang Shihao, Ye Yining, Zhu Kunlun, Yan Lan, Lu Yaxi, Lin Yankai, Cong Xin, Tang Xiangru, Qian Bill, Zhao Sihan, Hong Lauren, Tian Runchu, Xie Ruobing, Zhou Jie, Gerstein Mark, Li Dahai, Liu Zhiyuan, Sun Maosong
- Large Language Models Are Effective Text Rankers With Pairwise Ranking Prompting Qin Zhen, Jagerman Rolf, Hui Kai, Zhuang Honglei, Wu Junru, Yan Le, Shen Jiaming, Liu Tianqi, Liu Jialu, Metzler Donald, Wang Xuanhui, Bendersky Michael
- Transnormerllm: A Faster And Better Large Language Model With Improved Transnormer Qin Zhen, Li Dong, Sun Weigao, Sun Weixuan, Shen Xuyang, Han Xiaodong, Wei Yunshen, Lv Baohong, Luo Xiao, Qiao Yu, Zhong Yiran
- Faithful Chain-of-thought Reasoning Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong, Marianna Apidianaki, Chris Callison-burch
- Are Large Language Models Temporally Grounded? Qiu Yifu, Zhao Zheng, Ziser Yftah, Korhonen Anna, Ponti Edoardo M., Cohen Shay B.
- Unlocking Emergent Modularity In Large Language Models Qiu Zihan, Huang Zeyu, Fu Jie
- Cmed-gpt: Prompt Tuning For Entity-aware Chinese Medical Dialogue Generation Qu Zhijie, Li Juan, Ma Zerui, Li Jianqiang
- Exploring The Effectiveness Of GPT Models In Test-taking: A Case Study Of The Driver's License Knowledge Test Rahimi Saba, Balch Tucker, Veloso Manuela
- Quantized Transformer Language Model Implementations On Edge Devices Rahman Mohammad Wali Ur, Abrar Murad Mehrab, Copening Hunter Gibbons, Hariri Salim, Shao Sicong, Satam Pratik, Salehi Soheil
- Mixed-distil-bert: Code-mixed Language Modeling For Bangla, English, And Hindi Raihan Md Nishat, Goswami Dhiman, Mahmud Antara
- Improving Generalization In Task-oriented Dialogues With Workflows And Action Plans Raimondo Stefania, Pal Christopher, Liu Xiaotian, Vazquez David, Palacios Hector
- K-PERM: Personalized Response Generation Using Dynamic Knowledge Retrieval And Persona-adaptive Queries Raj Kanak, Roy Kaushik, Bonagiri Vamshi, Govil Priyanshul, Thirunarayanan Krishnaprasad, Gaur Manas
- Multimodal Multi-hop Question Answering Through A Conversation Between Tools And Efficiently Finetuned Large Language Models Rajabzadeh Hossein, Wang Suyuchen, Kwon Hyock Ju, Liu Bang
- Bridging The Gap: Exploring The Capabilities Of Bridge-architectures For Complex Visual Reasoning Tasks Rajesh Kousik, Raman Mrigank, Karim Mohammed Asad, Chawla Pranit
- Does The "most Sinfully Decadent Cake Ever" Taste Good? Answering Yes/no Questions From Figurative Contexts Rakshit Geetanjali, Flanigan Jeffrey
- Multi-task End-to-end Training Improves Conversational Recommendation Ram Naveen, Kuzmin Dima, Chio Ellie Ka In, Alzantot Moustafa Farid, Ontanon Santiago, Jash Ambarish, Li Judith Yue
- Mabvit -- Modified Attention Block Enhances Vision Transformers Ramesh Mahesh, Ramkumar Aswinkumar
- Tailoring Self-rationalizers With Multi-reward Distillation Ramnath Sahana, Joshi Brihi, Hallinan Skyler, Lu Ximing, Li Liunian Harold, Chan Aaron, Hessel Jack, Choi Yejin, Ren Xiang
- Retrieval-augmented Image Captioning Ramos Rita, Elliott Desmond, Martins Bruno
- Precog: Exploring The Relation Between Memorization And Performance In Pre-trained Language Models Ranaldi Leonardo, Ruzzetti Elena Sofia, Zanzotto Fabio Massimo
- Chatgpt-crawler: Find Out If Chatgpt Really Knows What It's Talking About Rangapur Aman, Wang Haoran
- Tricking Llms Into Disobedience: Formalizing, Analyzing, And Detecting Jailbreaks Rao Abhinav, Vashistha Sachin, Naik Atharva, Aditya Somak, Choudhury Monojit
- Can Chatgpt Assess Human Personalities? A General Evaluation Framework Rao Haocong, Leung Cyril, Miao Chunyan
- Evaluating Llms On Document-based QA: Exact Answer Selection And Numerical Extraction Using Cogtale Dataset Rasool Zafaryab, Kurniawan Stefanus, Balugo Sherwin, Barnett Scott, Vasa Rajesh, Chesser Courtney, Hampstead Benjamin M., Belleville Sylvie, Mouzakis Kon, Bahar-fuchs Alex
- Supporting Human-ai Collaboration In Auditing Llms With Llms Rastogi Charvi, Ribeiro Marco Tulio, King Nicholas, Nori Harsha, Amershi Saleema
- Weakly Supervised Detection Of Hallucinations In LLM Activations Rateike Miriam, Cintas Celia, Wamburu John, Akumu Tanya, Speakman Skyler
- Dissecting In-context Learning Of Translations In Gpts Raunak Vikas, Awadalla Hany Hassan, Menezes Arul
- Do Gpts Produce Less Literal Translations? Raunak Vikas, Menezes Arul, Post Matt, Awadalla Hany Hassan
- Leveraging GPT-4 For Automatic Translation Post-editing Raunak Vikas, Sharaf Amr, Wang Yiren, Awadallah Hany Hassan, Menezes Arul
- COLA: A Benchmark For Compositional Text-to-image Retrieval Ray Arijit, Radenovic Filip, Dubey Abhimanyu, Plummer Bryan A., Krishna Ranjay, Saenko Kate
- \(\textit{dial Beinfo For Faithfulness}\): Improving Factuality Of Information-seeking Dialogue Via Behavioural Fine-tuning Razumovskaia Evgeniia, Vulić Ivan, Marković Pavle, Cichy Tomasz, Zheng Qian, Wen Tsung-hsien, Budzianowski Paweł
- Fine-tuning Language Models For Context-specific SQL Query Generation Rebei Amine
- Towards A Performance Analysis On Pre-trained Visual Question Answering Models For Autonomous Driving Rekanar Kaavya, Eising Ciarán, Sistu Ganesh, Hayes Martin
- Self-evaluation Improves Selective Generation In Large Language Models Ren Jie, Zhao Yao, Vu Tu, Liu Peter J., Lakshminarayanan Balaji
- Investigating The Factual Knowledge Boundary Of Large Language Models With Retrieval Augmentation Ren Ruiyang, Wang Yuhao, Qu Yingqi, Zhao Wayne Xin, Liu Jing, Tian Hao, Wu Hua, Wen Ji-rong, Wang Haifeng
- Context Compression For Auto-regressive Transformers With Sentinel Tokens Ren Siyu, Jia Qi, Zhu Kenny Q.
- Prompt, Generate, Then Cache: Cascade Of Foundation Models Makes Strong Few-shot Learners Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Hongsheng Li, Yu Qiao, Peng Gao
- I'm Afraid I Can't Do That: Predicting Prompt Refusal In Black-box Generative Language Models Reuter Max, Schulze William
- Negated Complementary Commonsense Using Large Language Models Rezaei Navid, Reformat Marek Z.
- Sparq Attention: Bandwidth-efficient LLM Inference Ribar Luka, Chelombiev Ivan, Hudlass-galley Luke, Blake Charlie, Luschi Carlo, Orr Douglas
- STREET: A Multi-task Structured Reasoning And Explanation Benchmark Ribeiro Danilo, Wang Shen, Ma Xiaofei, Zhu Henry, Dong Rui, Kong Deguang, Burger Juliette, Ramos Anjelica, Wang William, Huang Zhiheng, Karypis George, Xiang Bing, Roth Dan
- Commonsense Reasoning For Conversational AI: A Survey Of The State Of The Art Richardson Christopher, Heck Larry
- SYNDICOM: Improving Conversational Commonsense With Error-injection And Natural Language Feedback Richardson Christopher, Sundar Anirudh, Heck Larry
- Data Contamination Through The Lens Of Time Roberts Manley, Thakur Himanshu, Herlihy Christine, White Colin, Dooley Samuel
- Smoothllm: Defending Large Language Models Against Jailbreaking Attacks Robey Alexander, Wong Eric, Hassani Hamed, Pappas George J.
- Bio-sieve: Exploring Instruction Tuning Large Language Models For Systematic Review Automation Robinson Ambrose, Thorne William, Wu Ben P., Pandor Abdullah, Essat Munira, Stevenson Mark, Song Xingyi
- Chatgpt MT: Competitive For High- (but Not Low-) Resource Languages Robinson Nathaniel R., Ogayo Perez, Mortensen David R., Neubig Graham
- Benchmarking Causal Study To Interpret Large Language Models For Source Code Rodriguez-cardenas Daniel, Palacio David N., Khati Dipin, Burke Henry, Poshyvanyk Denys
- Synergistic Integration Of Large Language Models And Cognitive Architectures For Robust AI: An Exploratory Analysis Romero Oscar J., Zimmerman John, Steinfeld Aaron, Tomasic Anthony
- Audiogpt: Understanding And Generating Speech, Music, Sound, And Talking Head Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Zhou Zhao, Shinji Watanabe
- Grading Conversational Responses Of Chatbots Rosario Grant, Noever David
- The Manipulation Problem: Conversational AI As A Threat To Epistemic Agency Rosenberg Louis
- Axiomatic Preference Modeling For Longform Question Answering Rosset Corby, Zheng Guoqing, Dibia Victor, Awadallah Ahmed, Bennett Paul
- Knowledge-infused Self Attention Transformers Roy Kaushik, Zi Yuxin, Narayanan Vignesh, Gaur Manas, Sheth Amit
- Unveiling The Potential Of Large Language Models In Generating Semantic And Cross-language Clones Roy Palash R., Alam Ajmain I., Al-omari Farouq, Roy Banani, Roy Chanchal K., Schneider Kevin A.
- Generating Phishing Attacks Using Chatgpt Roy Sayak Saha, Naragam Krishna Vamsi, Nilizadeh Shirin
- From Chatbots To Phishbots? -- Preventing Phishing Scams Created Using Chatgpt, Google Bard And Claude Roy Sayak Saha, Thota Poojitha, Naragam Krishna Vamsi, Nilizadeh Shirin
- Identifying The Risks Of LM Agents With An Lm-emulated Sandbox Ruan Yangjun, Dong Honghua, Wang Andrew, Pitis Silviu, Zhou Yongchao, Ba Jimmy, Dubois Yann, Maddison Chris J., Hashimoto Tatsunori
- Retrieval-pretrained Transformer: Long-range Language Modeling With Self-retrieval Rubin Ohad, Berant Jonathan
- Gpt4tools: Teaching Large Language Model To Use Tools Via Self-instruction Rui Yang, Lin Song, Yanwei Li, Sijie Zhao, Yixiao Ge, Xiu Li, Ying Shan
- Attention-likelihood Relationship In Transformers Ruscio Valeria, Maiorca Valentino, Silvestri Fabrizio
- Are Emergent Abilities Of Large Language Models A Mirage? Rylan Schaeffer, Brando Miranda, Sanmi Koyejo
- Plausibility Processing In Transformer Language Models: Focusing On The Role Of Attention Heads In GPT Ryu Soo Hyun
- Naturalness Of Attention: Revisiting Attention In Code Language Models Saad Mootez, Sharma Tushar
- Performance Of Chatgpt-3.5 And GPT-4 On The United States Medical Licensing Examination With And Without Distractions Safrai Myriam, Azaria Amos
- Uncertainty Guided Global Memory Improves Multi-hop Question Answering Sagirova Alsu, Burtsev Mikhail
- Branch-solve-merge Improves Large Language Model Evaluation And Generation Saha Swarnadeep, Levy Omer, Celikyilmaz Asli, Bansal Mohit, Weston Jason, Li Xian
- Promptmix: A Class Boundary Augmentation Method For Large Language Model Distillation Sahu Gaurav, Vechtomova Olga, Bahdanau Dzmitry, Laradji Issam H.
- REINFOREST: Reinforcing Semantic Code Similarity For Cross-lingual Code Search Models Saieva Anthony, Chakraborty Saikat, Kaiser Gail
- Verbosity Bias In Preference Labeling By Large Language Models Saito Keita, Wachi Akifumi, Wataoka Koki, Akimoto Youhei
- Memory Injections: Correcting Multi-hop Reasoning Failures During Inference In Transformer-based Language Models Sakarvadia Mansi, Ajith Aswathy, Khan Arham, Grzenda Daniel, Hudson Nathaniel, Bauer André, Chard Kyle, Foster Ian
- Attention Lens: A Tool For Mechanistically Interpreting The Attention Head Information Retrieval Mechanism Sakarvadia Mansi, Khan Arham, Ajith Aswathy, Grzenda Daniel, Hudson Nathaniel, Bauer André, Chard Kyle, Foster Ian
- Extending The Frontier Of Chatgpt: Code Generation And Debugging Sakib Fardin Ahsan, Khan Saadat Hasan, Karim A. H. M. Rezaul
- UT5: Pretraining Non Autoregressive T5 With Unrolled Denoising Salem Mahmoud G., Ye Jiayu, Lin Chu-cheng, Liu Frederick
- Pre-training Multi-modal Dense Retrievers For Outside-knowledge Visual Question Answering Salemi Alireza, Rafiee Mahta, Zamani Hamed
- Zero-shot Translation Of Attention Patterns In VQA Models To Natural Language Salewski Leonard, Koepke A. Sophia, Lensch Hendrik P. A., Akata Zeynep
- Can Llms Augment Low-resource Reading Comprehension Datasets? Opportunities And Challenges Samuel Vinay, Aynaou Houda, Chowdhury Arijit Ghosh, Ramanan Karthik Venkat, Chadha Aman
- What Matters In The Structured Pruning Of Generative Language Models? Santacroce Michael, Wen Zixin, Shen Yelong, Li Yuanzhi
- Camoscio: An Italian Instruction-tuned Llama Santilli Andrea, Rodolà Emanuele
- Early Weight Averaging Meets High Learning Rates For LLM Pre-training Sanyal Sunny, Neerkaje Atula, Kaddour Jean, Kumar Abhishek, Sanghavi Sujay
- Beyond Chinchilla-optimal: Accounting For Inference In Language Model Scaling Laws Sardana Nikhil, Portes Jacob, Doubov Sasha, Frankle Jonathan
- Will Code Remain A Relevant User Interface For End-user Programming With Generative AI Models? Sarkar Advait
- Inseq: An Interpretability Toolkit For Sequence Generation Models Sarti Gabriele, Feldhus Nils, Sickert Ludwig, Van Der Wal Oskar, Nissim Malvina, Bisazza Arianna
- RAMP: Retrieval And Attribute-marking Enhanced Prompting For Attribute-controlled Translation Sarti Gabriele, Htut Phu Mon, Niu Xing, Hsu Benjamin, Currey Anna, Dinu Georgiana, Nadejde Maria
- Interprompt: Interpretable Prompting For Interrelated Interpersonal Risk Factors In Reddit Posts Sathvik Msvpj, Sarkar Surjodeep, Saxena Chandni, Sohn Sunghwan, Garg Muskan
- GIFT: Generative Interpretable Fine-tuning Savadikar Chinmay, Song Xi, Wu Tianfu
- Diagnostic Reasoning Prompts Reveal The Potential For Large Language Model Interpretability In Medicine Savage Thomas, Nayak Ashwin, Gallo Robert, Rangan Ekanath, Chen Jonathan H
- Unlocking Practical Applications In Legal Domain: Evaluation Of GPT For Zero-shot Semantic Annotation Of Legal Texts Savelka Jaromir
- Large Language Models (GPT) Struggle To Answer Multiple-choice Questions About Code Savelka Jaromir, Agarwal Arav, Bogart Christopher, Sakr Majd
- Can GPT-4 Support Analysis Of Textual Data In Tasks Requiring Highly Specialized Domain Expertise? Savelka Jaromir, Ashley Kevin D., Gray Morgan A, Westermann Hannes, Xu Huihui
- ARB: Advanced Reasoning Benchmark For Large Language Models Sawada Tomohiro, Paleka Daniel, Havrilla Alexander, Tadepalli Pranav, Vidas Paula, Kranias Alexander, Nay John J., Gupta Kshitij, Komatsuzaki Aran
- Pretraining On The Test Set Is All You Need Schaeffer Rylan
- Large Language Models Can Strategically Deceive Their Users When Put Under Pressure Scheurer Jérémy, Balesni Mikita, Hobbhahn Marius
- Probing Conceptual Understanding Of Large Visual-language Models Schiappa Madeline, Abdullah Raiyaan, Azad Shehreen, Claypoole Jared, Cogswell Michael, Divakaran Ajay, Rawat Yogesh
- Towards Llm-based Autograding For Short Textual Answers Schneider Johannes, Schenk Bernd, Niklaus Christina
- Chatbot-supported Thesis Writing: An Autoethnographic Report Schwenke Nicolas, Söbke Heinrich, Kraft Eckhard
- An Empirical Evaluation Of Using Large Language Models For Automated Unit Test Generation Schäfer Max, Nadi Sarah, Eghbali Aryaz, Tip Frank
- Surgicalgpt: End-to-end Language-vision GPT For Visual Question Answering In Surgery Seenivasan Lalithkumar, Islam Mobarakol, Kannan Gokul, Ren Hongliang
- Wikichat: Stopping The Hallucination Of Large Language Model Chatbots By Few-shot Grounding On Wikipedia Semnani Sina J., Yao Violet Z., Zhang Heidi C., Lam Monica S.
- Jais And Jais-chat: Arabic-centric Foundation And Instruction-tuned Open Generative Large Language Models Sengupta Neha, Sahu Sunil Kumar, Jia Bokang, Katipomu Satheesh, Li Haonan, Koto Fajri, Marshall William, Gosal Gurpreet, Liu Cynthia, Chen Zhiming, Afzal Osama Mohammed, Kamboj Samta, Pandit Onkar, Pal Rahul, Pradhan Lalit, Mujahid Zain Muhammad, Baali Massa, Han Xudong, Bsharat Sondos Mahmoud, Aji Alham Fikri, Shen Zhiqiang, Liu Zhengzhong, Vassilieva Natalia, Hestness Joel, Hock Andy, Feldman Andrew, Lee Jonathan, Jackson Andrew, Ren Hector Xuguang, Nakov Preslav, Baldwin Timothy, Xing Eric
- Language Models: A Guide For The Perplexed Serrano Sofia, Brumbaugh Zander, Smith Noah A.
- The Cot Collection: Improving Zero-shot And Few-shot Learning Of Language Models Via Chain-of-thought Fine-tuning Seungone Kim, Se June Joo, Doyoung Kim, Joel Jang, Seonghyeon Ye, Jamin Shin, Minjoon Seo
- Zero Is Not Hero Yet: Benchmarking Zero-shot Performance Of Llms For Financial Tasks Shah Agam, Chava Sudheer
- Human Behavioral Benchmarking: Numeric Magnitude Comparison Effects In Large Language Models Shah Raj Sanjay, Marupudi Vijay, Koenen Reba, Bhardwaj Khushi, Varma Sashank
- Scalable And Transferable Black-box Jailbreaks For Language Models Via Persona Modulation Shah Rusheb, Feuillade--montixi Quentin, Pour Soroush, Tagade Arush, Casper Stephen, Rando Javier
- Let's Have A Chat! A Conversation With Chatgpt: Technology, Applications, And Limitations Shahriar Sakib, Hayawi Kadhim
- Cognitive Effects In Large Language Models Shaki Jonathan, Kraus Sarit, Wooldridge Michael
- Hero: Roberta And Longformer Hebrew Language Models Shalumov Vitaly, Haskey Harel
- PB-LLM: Partially Binarized Large Language Models Shang Yuzhang, Yuan Zhihang, Wu Qiang, Dong Zhen
- One-shot Sensitivity-aware Mixed Sparsity Pruning For Large Language Models Shao Hang, Liu Bei, Xiao Bo, Zeng Ke, Wan Guanglu, Qian Yanmin
- Tinylvlm-ehub: Towards Comprehensive And Efficient Evaluation For Large Vision-language Models Shao Wenqi, Lei Meng, Hu Yutao, Gao Peng, Zhang Kaipeng, Meng Fanqing, Xu Peng, Huang Siyuan, Li Hongsheng, Qiao Yu, Luo Ping
- Character-llm: A Trainable Agent For Role-playing Shao Yunfan, Li Linyang, Dai Junqi, Qiu Xipeng
- Enhancing Retrieval-augmented Large Language Models With Iterative Retrieval-generation Synergy Shao Zhihong, Gong Yeyun, Shen Yelong, Huang Minlie, Duan Nan, Chen Weizhu
- Performance Of Chatgpt On USMLE: Unlocking The Potential Of Large Language Models For Ai-assisted Medical Education Sharma Prabin, Thapa Kisan, Thapa Dikshya, Dhakal Prastab, Upadhaya Mala Deep, Adhikari Santosh, Khanal Salik Ram
- The Truth Is In There: Improving Reasoning In Language Models With Layer-selective Rank Reduction Sharma Pratyusha, Ash Jordan T., Misra Dipendra
- Democratizing Llms: An Exploration Of Cost-performance Trade-offs In Self-refined Open-source Models Shashidhar Sumuk, Chinta Abhinav, Sahai Vaibhav, Wang Zhenhailong, Ji Heng
- Exploring The Factual Consistency In Dialogue Comprehension Of Large Language Models She Shuaijie, Huang Shujian, Wang Xingyun, Zhou Yanke, Chen Jiajun
- Efficient LLM Inference On Cpus Shen Haihao, Chang Hanwen, Dong Bo, Luo Yu, Meng Hengyu
- Is Translation Helpful? An Empirical Analysis Of Cross-lingual Transfer In Low-resource Dialog Generation Shen Lei, Yu Shuai, Shen Xiaoyu
- Do Pretrained Transformers Learn In-context By Gradient Descent? Shen Lingfeng, Mishra Aayush, Khashabi Daniel
- Improving The Robustness Of Transformer-based Large Language Models With Dynamic Attention Shen Lujia, Pu Yuwen, Ji Shouling, Li Changjiang, Zhang Xuhong, Ge Chunpeng, Wang Ting
- Mixture-of-experts Meets Instruction Tuning:a Winning Combination For Large Language Models Shen Sheng, Hou Le, Zhou Yanqi, Du Nan, Longpre Shayne, Wei Jason, Chung Hyung Won, Zoph Barret, Fedus William, Chen Xinyun, Vu Tu, Wu Yuexin, Chen Wuyang, Webson Albert, Li Yunxuan, Zhao Vincent, Yu Hongkun, Keutzer Kurt, Darrell Trevor, Zhou Denny
- Large Language Model Alignment: A Survey Shen Tianhao, Jin Renren, Huang Yufei, Liu Chuang, Dong Weilong, Guo Zishan, Wu Xinwei, Liu Yan, Xiong Deyi
- Retrieval-generation Alignment For End-to-end Task-oriented Dialogue System Shen Weizhou, Gao Yingqi, Huang Canbin, Wan Fanqi, Quan Xiaojun, Bi Wei
- Storygpt-v: Large Language Models As Consistent Story Visualizers Shen Xiaoqian, Elhoseiny Mohamed
- "do Anything Now": Characterizing And Evaluating In-the-wild Jailbreak Prompts On Large Language Models Shen Xinyue, Chen Zeyuan, Backes Michael, Shen Yun, Zhang Yang
- In Chatgpt We Trust? Measuring And Characterizing The Reliability Of Chatgpt Shen Xinyue, Chen Zeyuan, Backes Michael, Zhang Yang
- Why Does Chatgpt Fall Short In Providing Truthful Answers? Shen Zheng, Jie Huang, Kevin Chen-chuan Chang
- Towards More Unified In-context Visual Understanding Sheng Dianmo, Chen Dongdong, Tan Zhentao, Liu Qiankun, Chu Qi, Bao Jianmin, Gong Tao, Liu Bin, Xu Shengwei, Yu Nenghai
- Flexgen: High-throughput Generative Inference Of Large Language Models With A Single GPU Sheng Ying, Zheng Lianmin, Yuan Binhang, Li Zhuohan, Ryabinin Max, Fu Daniel Y., Xie Zhiqiang, Chen Beidi, Barrett Clark, Gonzalez Joseph E., Liang Percy, Ré Christopher, Stoica Ion, Zhang Ce
- TOAST: Transfer Learning Via Attention Steering Shi Baifeng, Gai Siyu, Darrell Trevor, Wang Xin
- Crossget: Cross-guided Ensemble Of Tokens For Accelerating Vision-language Transformers Shi Dachuan, Tao Chaofan, Rao Anyi, Yang Zhendong, Yuan Chun, Wang Jiaqi
- Sotana: The Open-source Software Development Assistant Shi Ensheng, Zhang Fengji, Wang Yanlin, Chen Bei, Du Lun, Zhang Hongyu, Han Shi, Zhang Dongmei, Sun Hongbin
- QADYNAMICS: Training Dynamics-driven Synthetic QA Diagnostic For Zero-shot Commonsense Question Answering Shi Haochen, Wang Weiqi, Fang Tianqing, Xu Baixuan, Ding Wenxuan, Liu Xin, Song Yangqiu
- Badgpt: Exploring Security Vulnerabilities Of Chatgpt Via Backdoor Attacks To Instructgpt Shi Jiawen, Liu Yixin, Zhou Pan, Sun Lichao
- Unleashing The Power Of Pre-trained Language Models For Offline Reinforcement Learning Shi Ruizhe, Liu Yuyao, Ze Yanjie, Du Simon S., Xu Huazhe
- Preliminary Study On Incremental Learning For Large Language Model-based Recommender Systems Shi Tianhao, Zhang Yang, Xu Zhijian, Chen Chong, Feng Fuli, He Xiangnan, Tian Qi
- Trusting Your Evidence: Hallucinate Less With Context-aware Decoding Shi Weijia, Han Xiaochuang, Lewis Mike, Tsvetkov Yulia, Zettlemoyer Luke, Yih Scott Wen-tau
- Dept: Decomposed Prompt Tuning For Parameter-efficient Fine-tuning Shi Zhengxiang, Lipani Aldo
- Reasoning With Language Model Is Planning With World Model Shibo Hao, Yi Gu, Haodi Ma, Joshua Jiahua Hong, Zhen Wang, Daisy Zhe Wang, Zhiting Hu
- Large Language Models Can Enhance Persuasion Through Linguistic Feature Alignment Shin Minkyu, Kim Jin
- Exploring The Robustness Of Large Language Models For Solving Programming Problems Shirafuji Atsushi, Watanobe Yutaka, Ito Takumi, Morishita Makoto, Nakamura Yuki, Oda Yusuke, Suzuki Jun
- SCREWS: A Modular Framework For Reasoning With Revisions Shridhar Kumar, Jhamtani Harsh, Fang Hao, Van Durme Benjamin, Eisner Jason, Xia Patrick
- Llamas Know What Gpts Don't Show: Surrogate Models For Confidence Estimation Shrivastava Vaishnavi, Liang Percy, Kumar Ananya
- What Does CLIP Know About A Red Circle? Visual Prompt Engineering For Vlms Shtedritski Aleksandar, Rupprecht Christian, Vedaldi Andrea
- Audio-visual LLM For Video Understanding Shu Fangxun, Zhang Lei, Jiang Hao, Xie Cihang
- A Comprehensive Evaluation Of Large Language Models On Legal Judgment Prediction Shui Ruihao, Cao Yixin, Wang Xiang, Chua Tat-seng
- A Survey On Multimodal Large Language Models Shukang Yin, Chaoyou Fu, Sirui Zhao, Ke Li, Xing Sun, Tong Xu, Enhong Chen
- Woodpecker: Hallucination Correction For Multimodal Large Language Models Shukang Yin, Chaoyou Fu, Sirui Zhao, Tong Xu, Hao Wang, Dianbo Sui, Yunhang Shen, Ke Li, Xing Sun, Enhong Chen
- Ep-alm: Efficient Perceptual Augmentation Of Language Models Shukor Mustafa, Dancette Corentin, Cord Matthieu
- The Curse Of Recursion: Training On Generated Data Makes Models Forget Shumailov Ilia, Shumaylov Zakhar, Zhao Yiren, Gal Yarin, Papernot Nicolas, Anderson Ross
- Tree Of Thoughts: Deliberate Problem Solving With Large Language Models Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Thomas L. Griffiths, Yuan Cao, Karthik Narasimhan
- Measuring Inductive Biases Of In-context Learning With Underspecified Demonstrations Si Chenglei, Friedman Dan, Joshi Nitish, Feng Shi, Chen Danqi, He He
- An Empirical Study Of Instruction-tuning Large Language Models In Chinese Si Qingyi, Wang Tong, Lin Zheng, Zhang Xu, Cao Yanan, Wang Weiping
- Exploring The Capabilities Of Chatgpt In Ancient Chinese Translation And Person Name Recognition Si Shijing, Zhou Siqing, Tang Le, Cheng Xiaoqing, Zhang Yugui
- Mondrian: Prompt Abstraction Attack Against Large Language Models For Cheaper API Pricing Si Wai Man, Backes Michael, Zhang Yang
- Mind Meets Machine: Unravelling Gpt-4's Cognitive Psychology Sifatkaur Dhingra, Manmeet Singh, Vaisakh Sb, Neetiraj Malviya, Sukhpal Singh Gill
- GPT-4 As An Agronomist Assistant? Answering Agriculture Exams Using Large Language Models Silva Bruno, Nunes Leonardo, Estevão Roberto, Aski Vijay, Chandra Ranveer
- Generalized Planning In PDDL Domains With Pretrained Large Language Models Silver Tom, Dan Soham, Srinivas Kavitha, Tenenbaum Joshua B., Kaelbling Leslie Pack, Katz Michael
- Large Language Models For Aspect-based Sentiment Analysis Simmering Paul F., Huoviala Paavo
- The Confidence-competence Gap In Large Language Models: A Cognitive Study Singh Aniket Kumar, Devkota Suman, Lamichhane Bishal, Dhakal Uttam, Dhakal Chandra
- Explaining Black Box Text Modules In Natural Language With Language Models Singh Chandan, Hsu Aliyah R., Antonello Richard, Jain Shailee, Huth Alexander G., Yu Bin, Gao Jianfeng
- Assessing GPT4-V On Structured Reasoning Tasks Singh Mukul, Cambronero José, Gulwani Sumit, Le Vu, Verbruggen Gust
- Unlocking Model Insights: A Dataset For Automated Model Card Generation Singh Shruti, Lodwal Hitesh, Malwat Husain, Thakur Rakesh, Singh Mayank
- Exploiting Large Language Models (llms) Through Deception Techniques And Persuasion Principles Singh Sonali, Abri Faranak, Namin Akbar Siami
- Applenet: Visual Attention Parameterized Prompt Learning For Few-shot Remote Sensing Image Generalization Using CLIP Singha Mainak, Jha Ankit, Solanki Bhupendra, Bose Shirsha, Banerjee Biplab
- EEL: Efficiently Encoding Lattices For Reranking Singhal Prasann, Xu Jiacheng, Ye Xi, Durrett Greg
- Evaluating Chatgpt And GPT-4 For Visual Programming Singla Adish
- A Mathematical Abstraction For Balancing The Trade-off Between Creativity And Reality In Large Language Models Sinha Ritwik, Song Zhao, Zhou Tianyi
- TABLET: Learning From Instructions For Tabular Data Slack Dylan, Singh Sameer
- Dont Add, Dont Miss: Effective Content Preserving Generation From Pre-selected Text Spans Slobodkin Aviv, Caciularu Avi, Hirsch Eran, Dagan Ido
- On Early Detection Of Hallucinations In Factual Question Answering Snyder Ben, Moisescu Marius, Zafar Muhammad Bilal
- Step: Stacked LLM Policies For Web Actions Sodhi Paloma, Branavan S. R. K., Artzi Yoav, Mcdonald Ryan
- Decoding Chatgpt: A Taxonomy Of Existing Research, Current Challenges, And Possible Future Directions Sohail Shahab Saquib, Farhat Faiza, Himeur Yassine, Nadeem Mohammad, Madsen Dag Øivind, Singh Yashbir, Atalla Shadi, Mansoor Wathiq
- Biomedical Knowledge Graph-optimized Prompt Generation For Large Language Models Soman Karthik, Rose Peter W, Morris John H, Akbas Rabia E, Smith Brett, Peetoom Braian, Villouta-reyes Catalina, Cerono Gabriel, Shi Yongmei, Rizk-jackson Angela, Israni Sharat, Nelson Charlotte A, Huang Sui, Baranzini Sergio E
- HAE-RAE Bench: Evaluation Of Korean Knowledge In Language Models Son Guijin, Lee Hanwool, Kim Suwan, Kim Huiseo, Lee Jaecheol, Yeom Je Won, Jung Jihyu, Kim Jung Woo, Kim Songseong
- Dynamics Of Instruction Tuning: Each Ability Of Large Language Models Has Its Own Growth Pace Song Chiyu, Zhou Zhanchao, Yan Jianhao, Fei Yuejiao, Lan Zhenzhong, Zhang Yue
- Preference Ranking Optimization For Human Alignment Song Feifan, Yu Bowen, Li Minghao, Yu Haiyang, Huang Fei, Li Yongbin, Wang Houfeng
- Zebra: Extending Context Window With Layerwise Grouped Local-global Attention Song Kaiqiang, Wang Xiaoyang, Cho Sangwoo, Pan Xiaoman, Yu Dong
- Nlpbench: Evaluating Large Language Models On Solving NLP Problems Song Linxin, Zhang Jieyu, Cheng Lechao, Zhou Pengyuan, Zhou Tianyi, Li Irene
- How To Bridge The Gap Between Modalities: A Comprehensive Survey On Multimodal Large Language Model Song Shezheng, Li Xiaopeng, Li Shasha, Zhao Shan, Yu Jie, Ma Jun, Mao Xiaoguang, Zhang Weimin
- Large Language Models Meet Open-world Intent Discovery And Recognition: An Evaluation Of Chatgpt Song Xiaoshuai, He Keqing, Wang Pei, Dong Guanting, Mou Yutao, Wang Jingang, Xian Yunsen, Cai Xunliang, Xu Weiran
- Honeybee: Progressive Instruction Finetuning Of Large Language Models For Materials Science Song Yu, Miret Santiago, Zhang Huan, Liu Bang
- Reward Collapse In Aligning Large Language Models Song Ziang, Cai Tianle, Lee Jason D., Su Weijie J.
- Deduction Under Perturbed Evidence: Probing Student Simulation Capabilities Of Large Language Models Sonkar Shashank, Baraniuk Richard G.
- Code Soliloquies For Accurate Calculations In Large Language Models Sonkar Shashank, Le Myco, Chen Xinghe, Liu Naiming, Mallick Debshila Basu, Baraniuk Richard G.
- Recursion Of Thought: A Divide-and-conquer Approach To Multi-context Reasoning With Language Models Soochan Lee, Gunhee Kim
- Joint Prompt Optimization Of Stacked Llms Using Variational Inference Sordoni Alessandro, Yuan Xingdi, Côté Marc-alexandre, Pereira Matheus, Trischler Adam, Xiao Ziang, Hosseini Arian, Niedtner Friederike, Roux Nicolas Le
- Learning Ui-to-code Reverse Generator Using Visual Critic Without Rendering Soselia Davit, Saifullah Khalid, Zhou Tianyi
- Dynamic Context Pruning For Efficient And Interpretable Autoregressive Transformers Sotiris Anagnostidis, Dario Pavllo, Luca Biggio, Lorenzo Noci, Aurelien Lucchi, Thomas Hofmann
- Evaluation Metrics In The Era Of GPT-4: Reliably Evaluating Large Language Models On Sequence To Sequence Tasks Sottana Andrea, Liang Bin, Zou Kai, Yuan Zheng
- Curriculum-guided Abstractive Summarization Sotudeh Sajad, Deilamsalehy Hanieh, Dernoncourt Franck, Goharian Nazli
- Accelerating LLM Inference With Staged Speculative Decoding Spector Benjamin, Re Chris
- Assessing The Nature Of Large Language Models: A Caution Against Anthropocentrism Speed Ann
- Commands As AI Conversations Spinellis Diomidis
- Musr: Testing The Limits Of Chain-of-thought With Multistep Soft Reasoning Sprague Zayne, Ye Xi, Bostrom Kaj, Chaudhuri Swarat, Durrett Greg
- Harnessing Llms In Curricular Design: Using GPT-4 To Support Authoring Of Learning Objectives Sridhar Pragnya, Doyle Aidan, Agarwal Arav, Bogart Christopher, Savelka Jaromir, Sakr Majd
- Chatgpt: A Study On Its Utility For Ubiquitous Software Engineering Tasks Sridhara Giriprasad, G. Ranjani H., Mazumdar Sourav
- Training Large Language Models Efficiently With Sparsity And Dataflow Srinivasan Venkat, Gandhi Darshan, Thakker Urmish, Prabhakar Raghu
- Response-act Guided Reinforced Dialogue Generation For Mental Health Counseling Srivastava Aseem, Pandey Ishan, Akhtar Md. Shad, Chakraborty Tanmoy
- Evaluating Chatbots To Promote Users' Trust -- Practices And Open Problems Srivastava Biplav, Lakkaraju Kausik, Koppel Tarmo, Narayanan Vignesh, Kundu Ashish, Joshi Sachindra
- Core-cog: Conversational Recommendation Of Entities Using Constrained Generation Srivastava Harshvardhan, Pruthi Kanav, Chakrabarti Soumen, Mausam
- Instances Need More Care: Rewriting Prompts For Instances With Llms In The Loop Yields Better Zero-shot Performance Srivastava Saurabh, Huang Chengyue, Fan Weiguo, Yao Ziyu
- Structured Packing In LLM Training Improves Long Context Utilization Staniszewski Konrad, Tworkowski Szymon, Jaszczur Sebastian, Zhao Yu, Michalewski Henryk, Kuciński Łukasz, Miłoś Piotr
- The Languini Kitchen: Enabling Language Modelling Research At Different Scales Of Compute Stanić Aleksandar, Ashley Dylan, Serikov Oleg, Kirsch Louis, Faccio Francesco, Schmidhuber Jürgen, Hofmann Thomas, Schlag Imanol
- Seeing Seeds Beyond Weeds: Green Teaming Generative AI For Beneficial Uses Stapleton Logan, Taylor Jordan, Fox Sarah, Wu Tongshuang, Zhu Haiyi
- GPT-4 Doesn't Know It's Wrong: An Analysis Of Iterative Prompting For Reasoning Problems Stechly Kaya, Marquez Matthew, Kambhampati Subbarao
- Emergent And Predictable Memorization In Large Language Models Stella Biderman, Usvsn Sai Prashanth, Lintang Sutawika, Hailey Schoelkopf, Quentin Anthony, Shivanshu Purohit, Edward Raff
- A Mechanistic Interpretation Of Arithmetic Reasoning In Language Models Using Causal Mediation Analysis Stolfo Alessandro, Belinkov Yonatan, Sachan Mrinmaya
- Distilled GPT For Source Code Summarization Su Chia-yi, Mcmillan Collin
- Learning From Red Teaming: Gender Bias Provocation And Mitigation In Large Language Models Su Hsuan, Cheng Cheng-chu, Farn Hua, Kumar Shachi H, Sahay Saurav, Chen Shang-tse, Lee Hung-yi
- The Synergy Of Speculative Decoding And Batching In Serving Large Language Models Su Qidong, Giannoula Christina, Pekhimenko Gennady
- Choice Fusion As Knowledge For Zero-shot Dialogue State Tracking Su Ruolin, Yang Jingfeng, Wu Ting-wei, Juang Biing-hwang
- F3-pruning: A Training-free And Generalized Pruning Strategy Towards Faster And Finer Text-to-video Synthesis Su Sitong, Liu Jianzhi, Gao Lianli, Song Jingkuan
- Dial-mae: Contextual Masked Auto-encoder For Retrieval-based Dialogue Systems Su Zhenpeng, Wu Xing, Zhou Wei, Ma Guangyuan, Hu Songlin
- HC3 Plus: A Semantic-invariant Human Chatgpt Comparison Corpus Su Zhenpeng, Wu Xing, Zhou Wei, Ma Guangyuan, Hu Songlin
- Can Large Language Models Change User Preference Adversarially? Subhash Varshini
- Language Model-in-the-loop: Data Optimal Approach To Learn-to-recommend Actions In Text Games Sudhakar Arjun Vaithilingam, Parthasarathi Prasanna, Rajendran Janarthanan, Chandar Sarath
- RCMHA: Relative Convolutional Multi-head Attention For Natural Language Modelling Sugiharto Herman, Aradea, Mubarok Husni
- Jmedlora:medical Domain Adaptation On Japanese Large Language Models Using Instruction-tuning Sukeda Issey, Suzuki Masahiro, Sakaji Hiroki, Kodera Satoshi
- Cognitive Architectures For Language Agents Sumers Theodore R., Yao Shunyu, Narasimhan Karthik, Griffiths Thomas L.
- CONSCENDI: A Contrastive And Scenario-guided Distillation Approach To Guardrail Models For Virtual Assistants Sun Albert Yu, Nair Varun, Schumacher Elliot, Kannan Anitha
- Does Fine-tuning GPT-3 With The Openai API Leak Personally-identifiable Information? Sun Albert Yu, Zemour Eliott, Saxena Arushi, Vaidyanathan Udith, Lin Eric, Lau Christian, Mugunthan Vaikkunth
- Building High-accuracy Multilingual ASR With Gated Language Experts And Curriculum Training Sun Eric, Li Jinyu, Hu Yuxuan, Zhu Yimeng, Zhou Long, Xue Jian, Wang Peidong, Liu Linquan, Liu Shujie, Lin Edward, Gong Yifan
- Create Your World: Lifelong Text-to-image Diffusion Sun Gan, Liang Wenqi, Dong Jiahua, Li Jun, Ding Zhengming, Cong Yang
- Speech-based Slot Filling Using Large Language Models Sun Guangzhi, Feng Shutong, Jiang Dongcheng, Zhang Chao, Gašić Milica, Woodland Philip C.
- Fine-grained Audio-visual Joint Representations For Multimodal Large Language Models Sun Guangzhi, Yu Wenyi, Tang Changli, Chen Xianzhao, Tan Tian, Li Wei, Lu Lu, Ma Zejun, Zhang Chao
- Reinforcement Learning In The Era Of Llms: What Is Essential? What Is Needed? An RL Perspective On RLHF, Prompting, And Beyond Sun Hao
- Character-level Chinese Backpack Language Models Sun Hao, Hewitt John
- Think-on-graph: Deep And Responsible Reasoning Of Large Language Model On Knowledge Graph Sun Jiashuo, Xu Chengjin, Tang Lumingyuan, Wang Saizhuo, Lin Chen, Gong Yeyun, Ni Lionel M., Shum Heung-yeung, Guo Jian
- Scieval: A Multi-level Large Language Model Evaluation Benchmark For Scientific Research Sun Liangtai, Han Yang, Zhao Zihan, Ma Da, Shen Zhennan, Chen Baocai, Chen Lu, Yu Kai
- EVA-CLIP: Improved Training Techniques For CLIP At Scale Sun Quan, Fang Yuxin, Wu Ledell, Wang Xinlong, Cao Yue
- Battle Of The Large Language Models: Dolly Vs Llama Vs Vicuna Vs Guanaco Vs Bard Vs Chatgpt -- A Text-to-sql Parsing Comparison Sun Shuo, Zhang Yuchen, Yan Jiahuan, Gao Yuze, Ong Donovan, Chen Bin, Su Jian
- PEARL: Prompting Large Language Models To Plan And Execute Actions Over Long Documents Sun Simeng, Liu Yang, Wang Shuohang, Zhu Chenguang, Iyyer Mohit
- Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers Sun Weiwei, Chen Zheng, Ma Xinyu, Yan Lingyong, Wang Shuaiqiang, Ren Pengjie, Chen Zhumin, Yin Dawei, Ren Zhaochun
- Retentive Network: A Successor To Transformer For Large Language Models Sun Yutao, Dong Li, Huang Shaohan, Ma Shuming, Xia Yuqing, Xue Jilong, Wang Jianyong, Wei Furu
- Multimodal Question Answering For Unified Information Extraction Sun Yuxuan, Zhang Kai, Su Yu
- Ctbls: Augmenting Large Language Models With Conversational Tables Sundar Anirudh S, Heck Larry
- Uncovering Chatgpt's Capabilities In Recommender Systems Sunhao Dai, Ninglu Shao, Haiyuan Zhao, Weijie Yu, Zihua Si, Chen Xu, Zhongxiang Sun, Xiao Zhang, Jun Xu
- Do Large Language Models Show Decision Heuristics Similar To Humans? A Case Study Using GPT-3.5 Suri Gaurav, Slater Lily R., Ziaee Ali, Nguyen Morgan
- Location-aware Visual Question Generation With Lightweight Models Suwono Nicholas Collin, Chen Justin Chih-yao, Hung Tun Min, Huang Ting-hao Kenneth, Liao I-bin, Li Yung-hui, Ku Lun-wei, Sun Shao-hua
- Approximating Online Human Evaluation Of Social Chatbots With Prompting Svikhnushina Ekaterina, Pu Pearl
- Assessing The Ability Of Chatgpt To Screen Articles For Systematic Reviews Syriani Eugene, David Istvan, Kumar Gauransh
- Improving Non-autoregressive Translation Quality With Pretrained Language Model, Embedding Distillation And Upsampling Strategy For CTC Syu Shen-sian, Xie Juncheng, Lee Hung-yi
- Sparks Of Artificial General Intelligence: Early Experiments With GPT-4 Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang
- The BEA 2023 Shared Task On Generating AI Teacher Responses In Educational Dialogues Tack Anaïs, Kochmar Ekaterina, Yuan Zheng, Bibauw Serge, Piech Chris
- Exploring Chain-of-thought Style Prompting For Text-to-sql Tai Chang-you, Chen Ziru, Zhang Tianshu, Deng Xiang, Sun Huan
- Training Generative Question-answering On Synthetic Data Obtained From An Instruct-tuned Model Takahashi Kosuke, Omi Takahiro, Arima Kosuke, Ishigaki Tatsuya
- Redcoast: A Lightweight Tool To Automate Distributed Training Of Llms On Any Gpu/tpus Tan Bowen, Zhu Yun, Liu Lijuan, Wang Hongyi, Zhuang Yonghao, Chen Jindong, Xing Eric, Hu Zhiting
- Is Chatgpt A Good Multi-party Conversation Solver? Tan Chao-hong, Gu Jia-chen, Ling Zhen-hua
- Large Language Model-driven Classroom Flipping: Empowering Student-centric Peer Questioning With Flipped Interaction Tan Chee Wei
- Copilot For Xcode: Exploring Ai-assisted Programming By Prompting Cloud-based Large Language Models Tan Chee Wei, Guo Shangxin, Wong Man Fai, Hang Ching Nam
- Massive Editing For Large Language Models Via Meta Learning Tan Chenmien, Zhang Ge, Fu Jie
- Can Chatgpt Replace Traditional KBQA Models? An In-depth Analysis Of The Question Answering Performance Of The GPT LLM Family Tan Yiming, Min Dehai, Li Yu, Li Wenbo, Hu Nan, Chen Yongrui, Qi Guilin
- Sparsity-guided Holistic Explanation For Llms With Interpretable Inference-time Intervention Tan Zhen, Chen Tianlong, Zhang Zhenyu, Liu Huan
- Interpreting Pretrained Language Models Via Concept Bottlenecks Tan Zhen, Cheng Lu, Wang Song, Bo Yuan, Li Jundong, Liu Huan
- Multiple-question Multiple-answer Text-vqa Tang Peng, Appalaraju Srikar, Manmatha R., Xie Yusheng, Mahadevan Vijay
- Toolalpaca: Generalized Tool Learning For Language Models With 3000 Simulated Cases Tang Qiaoyu, Deng Ziliang, Lin Hongyu, Han Xianpei, Liang Qiao, Cao Boxi, Sun Le
- Learning To Imagine: Visually-augmented Natural Language Generation Tang Tianyi, Chen Yushuo, Du Yifan, Li Junyi, Zhao Wayne Xin, Wen Ji-rong
- Ml-bench: Evaluating Large Language Models And Agents For Machine Learning Tasks On Repository-level Code Tang Xiangru, Liu Yuliang, Cai Zefan, Shao Yanjun, Lu Junjie, Zhang Yichi, Deng Zexuan, Hu Helan, An Kaikai, Huang Ruijun, Si Shuzheng, Chen Sheng, Zhao Haozhe, Chen Liang, Wang Yan, Liu Tianyu, Jiang Zhiwei, Chang Baobao, Fang Yin, Qin Yujia, Zhou Wangchunshu, Zhao Yilun, Cohan Arman, Gerstein Mark
- Struc-bench: Are Large Language Models Really Good At Generating Complex Structured Data? Tang Xiangru, Zong Yiming, Phang Jason, Zhao Yilun, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- Using Large Language Models For Cybersecurity Capture-the-flag Challenges And Certification Questions Tann Wesley, Liu Yuancheng, Sim Jun Heng, Seah Choon Meng, Chang Ee-chien
- POUF: Prompt-oriented Unsupervised Fine-tuning For Large Pre-trained Models Tanwisuth Korawat, Zhang Shujian, Zheng Huangjie, He Pengcheng, Zhou Mingyuan
- Is Chatgpt A Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation Tao Fang, Shu Yang, Kaixin Lan, Derek F. Wong, Jinpeng Hu, Lidia S. Chao, Yue Zhang
- Can BERT Refrain From Forgetting On Sequential Tasks? A Probing Study Tao Mingxu, Feng Yansong, Zhao Dongyan
- Adversarial Conversational Shaping For Intelligent Agents Tarasiewicz Piotr, Kenjeyev Sultan, Sebag Ilana, Alshehabi Shehab
- Caption Anything: Interactive Image Description With Diverse Multimodal Controls Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao
- Nomiracl: Knowing When You Don't Know For Robust Multilingual Retrieval-augmented Generation Thakur Nandan, Bonifacio Luiz, Zhang Xinyu, Ogundepo Odunayo, Kamalloo Ehsan, Alfonso-hermelo David, Li Xiaoguang, Liu Qun, Chen Boxing, Rezagholizadeh Mehdi, Lin Jimmy
- Verigen: A Large Language Model For Verilog Code Generation Thakur Shailja, Ahmad Baleegh, Pearce Hammond, Tan Benjamin, Dolan-gavitt Brendan, Karri Ramesh, Garg Siddharth
- SPDF: Sparse Pre-training And Dense Fine-tuning For Large Language Models Thangarasa Vithursan, Gupta Abhay, Marshall William, Li Tianda, Leong Kevin, Decoste Dennis, Lie Sean, Saxena Shreyas
- Learning To Prompt In The Classroom To Understand AI Limits: A Pilot Study Theophilou Emily, Koyuturk Cansu, Yavari Mona, Bursic Sathya, Donabauer Gregor, Telari Alessia, Testa Alessia, Boiano Raffaele, Hernandez-leo Davinia, Ruskov Martin, Taibi Davide, Gabbiadini Alessandro, Ognibene Dimitri
- Is Chatgpt The Ultimate Programming Assistant -- How Far Is It? Tian Haoye, Lu Weiqi, Li Tsz On, Tang Xunzhu, Cheung Shing-chi, Klein Jacques, Bissyandé Tegawendé F.
- Soft-prompt Tuning For Large Language Models To Evaluate Bias Tian Jacob-junqi, Emerson David, Miyandoab Sevil Zanjani, Pandya Deval, Seyyed-kalantari Laleh, Khattak Faiza Khan
- Just Ask For Calibration: Strategies For Eliciting Calibrated Confidence Scores From Language Models Fine-tuned With Human Feedback Tian Katherine, Mitchell Eric, Zhou Allan, Sharma Archit, Rafailov Rafael, Yao Huaxiu, Finn Chelsea, Manning Christopher D.
- R\(^3\) Prompting: Review, Rephrase And Resolve For Chain-of-thought Reasoning In Large Language Models Under Noisy Context Tian Qingyuan, Zhu Hanlun, Wang Lei, Li Yang, Lan Yunshi
- Graph Neural Prompting With Large Language Models Tian Yijun, Song Huan, Wang Zichen, Wang Haozhu, Hu Ziqing, Wang Fang, Chawla Nitesh V., Xu Panpan
- Evil Geniuses: Delving Into The Safety Of Llm-based Agents Tian Yu, Yang Xiao, Zhang Jingyuan, Dong Yinpeng, Su Hang
- Chimed-gpt: A Chinese Medical Large Language Model With Full Training Regime And Better Alignment To Human Preferences Tian Yuanhe, Gan Ruyi, Song Yan, Zhang Jiaxing, Zhang Yongdong
- BOOST: Harnessing Black-box Control To Boost Commonsense In Lms' Generation Tian Yufei, Zhang Felix, Peng Nanyun
- RLHF-V: Towards Trustworthy Mllms Via Behavior Alignment From Fine-grained Correctional Human Feedback Tianyu Yu, Yuan Yao, Haoye Zhang, Taiwen He, Yifeng Han, Ganqu Cui, Jinyi Hu, Zhiyuan Liu, Hai-tao Zheng, Maosong Sun, Tat-seng Chua
- Qlora: Efficient Finetuning Of Quantized Llms Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, Luke Zettlemoyer
- Baby Llama: Knowledge Distillation From An Ensemble Of Teachers Trained On A Small Dataset With No Performance Penalty Timiryasov Inar, Tastet Jean-loup
- Better Language Models Of Code Through Self-improvement To Hung Quoc, Bui Nghi D. Q., Guo Jin, Nguyen Tien N.
- Function Vectors In Large Language Models Todd Eric, Li Millicent L., Sharma Arnab Sen, Mueller Aaron, Wallace Byron C., Bau David
- Clinical Camel: An Open Expert-level Medical Language Model With Dialogue-based Knowledge Encoding Toma Augustin, Lawler Patrick R., Ba Jimmy, Krishnan Rahul G., Rubin Barry B., Wang Bo
- Videberta: A Powerful Pre-trained Language Model For Vietnamese Tran Cong Dao, Pham Nhut Huy, Nguyen Anh, Hy Truong Son, Vu Tu
- Bioinstruct: Instruction Tuning Of Large Language Models For Biomedical Natural Language Processing Tran Hieu, Yang Zhichao, Yao Zonghai, Yu Hong
- Viclevr: A Visual Reasoning Dataset And Hybrid Multimodal Fusion Model For Visual Question Answering In Vietnamese Tran Khiem Vinh, Phan Hao Phu, Van Nguyen Kiet, Nguyen Ngan Luu Thuy
- Bartphobeit: Pre-trained Sequence-to-sequence And Image Transformers Models For Vietnamese Visual Question Answering Tran Khiem Vinh, Van Nguyen Kiet, Nguyen Ngan Luu Thuy
- Large Language Model Prompt Chaining For Long Legal Document Classification Trautmann Dietrich
- Can Large Language Models Play Text Games Well? Current State-of-the-art And Open Questions Tsai Chen Feng, Zhou Xiaochen, Liu Sierra S., Li Jing, Yu Mo, Mei Hongyuan
- Automl-gpt: Large Language Model For Automl Tsai Yun-da, Tsai Yu-che, Huang Bo-wei, Yang Chun-pai, Lin Shou-de
- Chatgpt As A Commenter To The News: Can Llms Generate Human-like Opinions? Tseng Rayden, Verberne Suzan, Van Der Putten Peter
- Overinformative Question Answering By Humans And Machines Tsvilodub Polina, Franke Michael, Hawkins Robert D., Goodman Noah D.
- How Many Unicorns Are In This Image? A Safety Evaluation Benchmark For Vision Llms Tu Haoqin, Cui Chenhang, Wang Zijun, Zhou Yiyang, Zhao Bingchen, Han Junlin, Zhou Wangchunshu, Yao Huaxiu, Xie Cihang
- Efficiently Aligned Cross-lingual Transfer Learning For Conversational Tasks Using Prompt-tuning Tu Lifu, Qu Jin, Yavuz Semih, Joty Shafiq, Liu Wenhao, Xiong Caiming, Zhou Yingbo
- Unlocking Anticipatory Text Generation: A Constrained Approach For Large Language Models Decoding Tu Lifu, Yavuz Semih, Qu Jin, Xu Jiacheng, Meng Rui, Xiong Caiming, Zhou Yingbo
- Causal-discovery Performance Of Chatgpt In The Context Of Neuropathic Pain Diagnosis Tu Ruibo, Ma Chao, Zhang Cheng
- Automating Code-related Tasks Through Transformers: The Impact Of Pre-training Tufano Rosalia, Pascarella Luca, Bavota Gabriele
- Activation Addition: Steering Language Models Without Optimization Turner Alexander Matt, Thiergart Lisa, Leech Gavin, Udell David, Vazquez Juan J., Mini Ulisse, Macdiarmid Monte
- Language Models Don't Always Say What They Think: Unfaithful Explanations In Chain-of-thought Prompting Turpin Miles, Michael Julian, Perez Ethan, Bowman Samuel R.
- Focused Transformer: Contrastive Training For Context Scaling Tworkowski Szymon, Staniszewski Konrad, Pacek Mikołaj, Wu Yuhuai, Michalewski Henryk, Miłoś Piotr
- Simple Is Better And Large Is Not Enough: Towards Ensembling Of Foundational Language Models Tyagi Nancy, Shiri Aidin, Sarkar Surjodeep, Umrawal Abhishek Kumar, Gaur Manas
- Zeroshotdataaug: Generating And Augmenting Training Data With Chatgpt Ubani Solomon, Polat Suleyman Olcay, Nielsen Rodney
- Taco: Enhancing Cross-lingual Transfer For Low-resource Languages In Llms Through Translation-assisted Chain-of-thought Processes Upadhayay Bibek, Behzadan Vahid
- How Prevalent Is Gender Bias In Chatgpt? -- Exploring German And English Chatgpt Responses Urchs Stefanie, Thurner Veronika, Aßenmacher Matthias, Heumann Christian, Thiemichen Stephanie
- Mlongt5: A Multilingual And Efficient Text-to-text Transformer For Longer Sequences Uthus David, Ontañón Santiago, Ainslie Joshua, Guo Mandy
- Chatclimate: Grounding Conversational AI In Climate Science Vaghefi Saeid Ashraf, Wang Qian, Muccione Veruska, Ni Jingwei, Kraus Mathias, Bingler Julia, Schimanski Tobias, Colesanti-senni Chiara, Webersinke Nicolas, Huggel Christrian, Leippold Markus
- Humans And Language Models Diverge When Predicting Repeating Text Vaidya Aditya R., Turek Javier, Huth Alexander G.
- Let's Think Frame By Frame With VIP: A Video Infilling And Prediction Dataset For Evaluating Video Chain-of-thought Vaishnavi Himakunthala, Andy Ouyang, Daniel Rose, Ryan He, Alex Mei, Yujie Lu, Chinmay Sonar, Michael Saxon, William Yang Wang
- Mini-gpts: Efficient Large Language Models Through Contextual Pruning Valicenti Tim, Vidal Justice, Patnaik Ritik
- Can Large Language Models Really Improve By Self-critiquing Their Own Plans? Valmeekam Karthik, Marquez Matthew, Kambhampati Subbarao
- On The Planning Abilities Of Large Language Models : A Critical Investigation Valmeekam Karthik, Marquez Matthew, Sreedharan Sarath, Kambhampati Subbarao
- The LLM Surgeon Van Der Ouderaa Tycho F. A., Nagel Markus, Van Baalen Mart, Asano Yuki M., Blankevoort Tijmen
- Evaluating Shutdown Avoidance Of Language Models In Textual Scenarios Van Der Weij Teun, Lermen Simon, Lang Leon
- Theory Of Mind In Large Language Models: Examining Performance Of 11 State-of-the-art Models Vs. Children Aged 7-10 On Advanced Tests Van Duijn Max J., Van Dijk Bram M. A., Kouwenhoven Tom, De Valk Werner, Spruit Marco R., Van Der Putten Peter
- Detecting And Correcting Hate Speech In Multimodal Memes With Large Visual Language Model Van Minh-hao, Wu Xintao
- Structured Thoughts Automaton: First Formalized Execution Model For Auto-regressive Language Models Vanderbruggen Tristan, Liao Chunhua, Pirkelbauer Peter, Lin Pei-hung
- Can NLP Models Correctly Reason Over Contexts That Break The Common Assumptions? Varshney Neeraj, Parmar Mihir, Patel Nisarg, Handa Divij, Sarkar Sayantan, Luo Man, Baral Chitta
- A Stitch In Time Saves Nine: Detecting And Mitigating Hallucinations Of Llms By Validating Low-confidence Generation Varshney Neeraj, Yao Wenlin, Zhang Hongming, Chen Jianshu, Yu Dong
- Adventures Of Trustworthy Vision-language Models: A Survey Vatsa Mayank, Jain Anubhooti, Singh Richa
- Gpt-who: An Information Density-based Machine-generated Text Detector Venkatraman Saranya, Uchendu Adaku, Lee Dongwon
- Emulating Human Cognitive Processes For Expert-level Medical Question-answering With Large Language Models Verma Khushboo, Moore Marina, Wottrich Stephanie, López Karla Robles, Aggarwal Nishant, Bhatt Zeel, Singh Aagamjit, Unroe Bradford, Basheer Salah, Sachdeva Nitish, Arora Prinka, Kaur Harmanjeet, Kaur Tanupreet, Hood Tevon, Marquez Anahi, Varshney Tushar, Deng Nanfu, Ramani Azaan, Ishwara Pawanraj, Saeed Maimoona, Peña Tatiana López Velarde, Barksdale Bryan, Guha Sushovan, Kumar Satwant
- Simplesafetytests: A Test Suite For Identifying Critical Safety Risks In Large Language Models Vidgen Bertie, Scherrer Nino, Kirk Hannah Rose, Qian Rebecca, Kannappan Anand, Hale Scott A., Röttger Paul
- Behind The Magic, MERLIM: Multi-modal Evaluation Benchmark For Large Image-language Models Villa Andrés, Alcázar Juan Carlos León, Soto Alvaro, Ghanem Bernard
- Fairpy: A Toolkit For Evaluation Of Social Biases And Their Mitigation In Large Language Models Viswanath Hrishikesh, Zhang Tianyi
- Prompt2model: Generating Deployable Models From Natural Language Instructions Viswanathan Vijay, Zhao Chenyang, Bertsch Amanda, Wu Tongshuang, Neubig Graham
- Generative Pre-trained Transformer For Vietnamese Community-based COVID-19 Question Answering Vo Tam Minh, Tran Khiem Vinh
- Unraveling Downstream Gender Bias From Large Language Models: A Study On AI Educational Writing Assistance Wambsganss Thiemo, Su Xiaotian, Swamy Vinitra, Neshaei Seyed Parsa, Rietsche Roman, Käser Tanja
- Poisoning Language Models During Instruction Tuning Wan Alexander, Wallace Eric, Shen Sheng, Klein Dan
- Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases In Dialogue Systems Wan Yixin, Zhao Jieyu, Chadha Aman, Peng Nanyun, Chang Kai-wei
- GPT-RE: In-context Learning For Relation Extraction Using Large Language Models Wan Zhen, Cheng Fei, Mao Zhuoyuan, Liu Qianying, Song Haiyue, Li Jiwei, Kurohashi Sadao
- Reformulating Domain Adaptation Of Large Language Models As Adapt-retrieve-revise: A Case Study On Chinese Legal Domain Wan Zhen, Zhang Yating, Wang Yexiang, Cheng Fei, Kurohashi Sadao
- Shall We Pretrain Autoregressive Language Models With Retrieval? A Comprehensive Study Wang Boxin, Ping Wei, Xu Peng, Mcafee Lawrence, Liu Zihan, Shoeybi Mohammad, Dong Yi, Kuchaiev Oleksii, Li Bo, Xiao Chaowei, Anandkumar Anima, Catanzaro Bryan
- Keqing: Knowledge-based Question Answering Is A Nature Chain-of-thought Mentor Of LLM Wang Chaojie, Xu Yishi, Peng Zhong, Zhang Chenxi, Chen Bo, Wang Xinrun, Feng Lei, An Bo
- Cost-effective Hyperparameter Optimization For Large Language Model Generation Inference Wang Chi, Liu Susan Xueqing, Awadallah Ahmed H.
- BERT4CTR: An Efficient Framework To Combine Pre-trained Language Model With Non-textual Features For CTR Prediction Wang Dong, Salamatian Kavé, Xia Yunqing, Deng Weiwei, Zhiang Qi
- Docllm: A Layout-aware Generative Language Model For Multimodal Document Understanding Wang Dongsheng, Raman Natraj, Sibue Mathieu, Ma Zhiqiang, Babkin Petr, Kaur Simerjot, Pei Yulong, Nourbakhsh Armineh, Liu Xiaomo
- Robust Natural Language Understanding With Residual Attention Debiasing Wang Fei, Huang James Y., Yan Tianyi, Zhou Wenxuan, Chen Muhao
- A Survey Of The Evolution Of Language Model-based Dialogue Systems Wang Hongru, Wang Lingzhi, Du Yiming, Chen Liang, Zhou Jingyan, Wang Yufei, Wong Kam-fai
- Cue-cot: Chain-of-thought Prompting For Responding To In-depth Dialogue Questions With Llms Wang Hongru, Wang Rui, Mi Fei, Deng Yang, Wang Zezhong, Liang Bin, Xu Ruifeng, Wong Kam-fai
- Dialog Action-aware Transformer For Dialog Policy Learning Wang Huimin, Kwan Wai-chung, Wong Kam-fai
- Cross-lingual Knowledge Editing In Large Language Models Wang Jiaan, Liang Yunlong, Sun Zengkui, Cao Yuxuan, Xu Jiarong, Meng Fandong
- Knowledgeable In-context Tuning: Exploring And Exploiting Factual Knowledge For In-context Learning Wang Jianing, Wang Chengyu, Tan Chuanqi, Huang Jun, Gao Ming
- On The Robustness Of Chatgpt: An Adversarial And Out-of-distribution Perspective Wang Jindong, Hu Xixu, Hou Wenxin, Chen Hao, Zheng Runkai, Wang Yidong, Yang Linyi, Huang Haojun, Ye Wei, Geng Xiubo, Jiao Binxin, Zhang Yue, Xie Xing
- Missrec: Pre-training And Transferring Multi-modal Interest-aware Sequence Representation For Recommendation Wang Jinpeng, Zeng Ziyun, Wang Yunxiao, Wang Yuting, Lu Xingyu, Li Tianxiang, Yuan Jun, Zhang Rui, Zheng Hai-tao, Xia Shu-tao
- Notechat: A Dataset Of Synthetic Doctor-patient Conversations Conditioned On Clinical Notes Wang Junda, Yao Zonghai, Yang Zhichao, Zhou Huixue, Li Rumeng, Wang Xun, Xu Yucheng, Yu Hong
- Improved Visual Fine-tuning With Natural Language Supervision Wang Junyang, Xu Yuanhong, Hu Juhua, Yan Ming, Sang Jitao, Qian Qi
- Mathcoder: Seamless Code Integration In Llms For Enhanced Mathematical Reasoning Wang Ke, Ren Houxing, Zhou Aojun, Lu Zimu, Luo Sichun, Shi Weikang, Zhang Renrui, Song Linqi, Zhan Mingjie, Li Hongsheng
- Chated: A Chatbot Leveraging Chatgpt For An Enhanced Learning Experience In Higher Education Wang Kevin, Ramos Jason, Lawrence Ramon
- Document-level Machine Translation With Large Language Models Wang Longyue, Lyu Chenyang, Ji Tianbo, Zhang Zhirui, Yu Dian, Shi Shuming, Tu Zhaopeng
- Recursively Summarizing Enables Long-term Dialogue Memory In Large Language Models Wang Qingyue, Ding Liang, Cao Yanan, Tian Zhiliang, Wang Shi, Tao Dacheng, Guo Li
- Scimon: Scientific Inspiration Machines Optimized For Novelty Wang Qingyun, Downey Doug, Ji Heng, Hope Tom
- Aurora:activating Chinese Chat Capability For Mixtral-8x7b Sparse Mixture-of-experts Through Instruction-tuning Wang Rongsheng, Chen Haoming, Zhou Ruizhe, Duan Yaofei, Cai Kunyan, Ma Han, Cui Jiaxi, Li Jian, Pang Patrick Cheong-iao, Wang Yapeng, Tan Tao
- Ivygpt: Interactive Chinese Pathway Language Model In Medical Domain Wang Rongsheng, Duan Yaofei, Lam Chantong, Chen Jiexi, Xu Jiangsheng, Chen Haoming, Liu Xiaohong, Pang Patrick Cheong-iao, Tan Tao
- Bridging The Novice-expert Gap Via Models Of Decision-making: A Case Study On Remediating Math Mistakes Wang Rose E., Zhang Qingyang, Robinson Carly, Loeb Susanna, Demszky Dorottya
- Compositional Text-to-image Synthesis With Attention Map Control Of Diffusion Models Wang Ruichen, Chen Zekang, Chen Chen, Ma Jian, Lu Haonan, Lin Xiaodong
- Vamos: Versatile Action Models For Video Understanding Wang Shijie, Zhao Qi, Do Minh Quan, Agarwal Nakul, Lee Kwonjoon, Sun Chen
- READ: Recurrent Adaptation Of Large Transformers Wang Sid, Nguyen John, Li Ke, Wu Carole-jean
- Shepherd: A Critic For Language Model Generation Wang Tianlu, Yu Ping, Tan Xiaoqing Ellen, O'brien Sean, Pasunuru Ramakanth, Dwivedi-yu Jane, Golovneva Olga, Zettlemoyer Luke, Fazel-zarandi Maryam, Celikyilmaz Asli
- CAR: Conceptualization-augmented Reasoner For Zero-shot Commonsense Question Answering Wang Weiqi, Fang Tianqing, Ding Wenxuan, Xu Baixuan, Liu Xin, Song Yangqiu, Bosselut Antoine
- All Languages Matter: On The Multilingual Safety Of Large Language Models Wang Wenxuan, Tu Zhaopeng, Chen Chang, Yuan Youliang, Huang Jen-tse, Jiao Wenxiang, Lyu Michael R.
- Instructuie: Multi-task Instruction Tuning For Unified Information Extraction Wang Xiao, Zhou Weikang, Zu Can, Xia Han, Chen Tianze, Zhang Yuansen, Zheng Rui, Ye Junjie, Zhang Qi, Gui Tao, Kang Jihua, Yang Jingsheng, Li Siyuan, Du Chunsai
- MINT: Evaluating Llms In Multi-turn Interaction With Tools And Language Feedback Wang Xingyao, Wang Zihan, Liu Jiateng, Chen Yangyi, Yuan Lifan, Peng Hao, Ji Heng
- Probing Large Language Models From A Human Behavioral Perspective Wang Xintong, Li Xiaoyu, Li Xingshan, Biemann Chris
- User-aware Prefix-tuning Is A Good Learner For Personalized Image Captioning Wang Xuan, Wang Guanhong, Chai Wenhao, Zhou Jiayu, Wang Gaoang
- Emotional Intelligence Of Large Language Models Wang Xuena, Li Xueting, Yin Zi, Wu Yue, Jia Liu
- Instructta: Instruction-tuned Targeted Attack For Large Vision-language Models Wang Xunguang, Ji Zhenlan, Ma Pingchuan, Li Zongjie, Wang Shuai
- Adding Guardrails To Advanced Chatbots Wang Yanchen, Singh Lisa
- Non-intrusive Adaptation: Input-centric Parameter-efficient Fine-tuning For Versatile Multimodal Modeling Wang Yaqing, Wu Jialin, Dabral Tanmaya, Zhang Jiageng, Brown Geoff, Lu Chun-ta, Liu Frederick, Liang Yi, Pang Bo, Bendersky Michael, Soricut Radu
- NEWTON: Are Large Language Models Capable Of Physical Reasoning? Wang Yi Ru, Duan Jiafei, Fox Dieter, Srinivasa Siddhartha
- Pandalm: An Automatic Evaluation Benchmark For LLM Instruction Tuning Optimization Wang Yidong, Yu Zhuohao, Zeng Zhengran, Yang Linyi, Wang Cunxiang, Chen Hao, Jiang Chaoya, Xie Rui, Wang Jindong, Xie Xing, Ye Wei, Zhang Shikun, Zhang Yue
- Hint-enhanced In-context Learning Wakes Large Language Models Up For Knowledge-intensive Tasks Wang Yifan, Guo Qingyan, Ni Xinzhe, Shi Chufan, Liu Lemao, Jiang Haiyun, Yang Yujiu
- Universality And Limitations Of Prompt Tuning Wang Yihan, Chauhan Jatin, Wang Wei, Hsieh Cho-jui
- Multilora: Democratizing Lora For Better Multi-task Learning Wang Yiming, Lin Yu, Zeng Xiaodong, Zhang Guannan
- Gradient-based Word Substitution For Obstinate Adversarial Examples Generation In Language Models Wang Yimu, Shi Peng, Zhang Hongyang
- Primacy Effect Of Chatgpt Wang Yiwei, Cai Yujun, Chen Muhao, Liang Yuxuan, Hooi Bryan
- How Far Can Camels Go? Exploring The State Of Instruction Tuning On Open Resources Wang Yizhong, Ivison Hamish, Dasigi Pradeep, Hessel Jack, Khot Tushar, Chandu Khyathi Raghavi, Wadden David, Macmillan Kelsey, Smith Noah A., Beltagy Iz, Hajishirzi Hannaneh
- Vaquita: Enhancing Alignment In Llm-assisted Video Understanding Wang Yizhou, Zhang Ruiyi, Wang Haoliang, Bhattacharya Uttaran, Fu Yun, Wu Gang
- Towards Improving Document Understanding: An Exploration On Text-grounding Via Mllms Wang Yonghui, Zhou Wengang, Feng Hao, Zhou Keyi, Li Houqiang
- DRDT: Dynamic Reflection With Divergent Thinking For Llm-based Sequential Recommendation Wang Yu, Liu Zhiwei, Zhang Jianguo, Yao Weiran, Heinecke Shelby, Yu Philip S.
- Learning Hierarchical Prompt With Structured Linguistic Knowledge For Vision-language Models Wang Yubin, Jiang Xinyang, Cheng De, Li Dongsheng, Zhao Cairong
- Augmenting Black-box Llms With Medical Textbooks For Clinical Question Answering Wang Yubo, Ma Xueguang, Chen Wenhu
- Harnessing The Power Of David Against Goliath: Exploring Instruction Data Generation Without Using Closed-source Models Wang Yue, Wang Xinrui, Li Juntao, Chang Jinxiong, Zhang Qishen, Liu Zhongyi, Zhang Guannan, Zhang Min
- Gemini In Reasoning: Unveiling Commonsense In Multimodal Large Language Models Wang Yuqing, Zhao Yun
- Metacognitive Prompting Improves Understanding In Large Language Models Wang Yuqing, Zhao Yun
- Smarttrim: Adaptive Tokens And Attention Pruning For Efficient Vision-language Models Wang Zekun, Chen Jingchang, Zhou Wangchunshu, Zhu Haichao, Liang Jiafeng, Shan Liping, Liu Ming, Xu Dongliang, Yang Qing, Qin Bing
- Gpt4video: A Unified Multimodal Large Language Model For Lnstruction-followed Understanding And Safety-aware Generation Wang Zhanyu, Wang Longyue, Zhao Zhen, Wu Minghao, Lyu Chenyang, Li Huayang, Cai Deng, Zhou Luping, Shi Shuming, Tu Zhaopeng
- An In-depth Investigation Of User Response Simulation For Conversational Search Wang Zhenduo, Xu Zhichao, Ai Qingyao, Srikumar Vivek
- Mediagpt : A Large Language Model For Chinese Media Wang Zhonghao, Lu Zijia, Jin Bo, Deng Haiying
- Can Llms Like GPT-4 Outperform Traditional AI Tools In Dementia Diagnosis? Maybe, But Not Today Wang Zhuo, Li Rongzhen, Dong Bowen, Wang Jie, Li Xiuxing, Liu Ning, Mao Chenhui, Zhang Wei, Dong Liling, Gao Jing, Wang Jianyong
- An Effective Data Creation Pipeline To Generate High-quality Financial Instruction Data For Large Language Model Wang Ziao, Wang Jianning, Wu Junda, Zhang Xiaofeng
- Multiqg-ti: Towards Question Generation From Multi-modal Sources Wang Zichao, Baraniuk Richard
- Interviewbot: Real-time End-to-end Dialogue System To Interview Students For College Admission Wang Zihao, Keyes Nathan, Crawford Terry, Choi Jinho D.
- Generating Valid And Natural Adversarial Examples With Large Language Models Wang Zimu, Wang Wei, Chen Qi, Wang Qiufeng, Nguyen Anh
- Memorybank: Enhancing Large Language Models With Long-term Memory Wanjun Zhong, Lianghong Guo, Qiqi Gao, He Ye, Yanlin Wang
- Machine Translation For Ge'ez Language Wassie Aman Kassahun
- A Survey Of Large Language Models Wayne Xin Zhao, Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-yun Nie, Ji-rong Wen
- Jailbroken: How Does LLM Safety Training Fail? Wei Alexander, Haghtalab Nika, Steinhardt Jacob
- Lmsanitator: Defending Prompt-tuning Against Task-agnostic Backdoors Wei Chengkun, Meng Wenlong, Zhang Zhikun, Chen Min, Zhao Minghu, Fang Wenjing, Wang Lei, Zhang Zihui, Chen Wenzhi
- Larger Language Models Do In-context Learning Differently Wei Jerry, Wei Jason, Tay Yi, Tran Dustin, Webson Albert, Lu Yifeng, Chen Xinyun, Liu Hanxiao, Huang Da, Zhou Denny, Ma Tengyu
- Magicoder: Empowering Code Generation With Oss-instruct Wei Yuxiang, Wang Zhe, Liu Jiawei, Ding Yifeng, Zhang Lingming
- REPLUG: Retrieval-augmented Black-box Language Models Weijia Shi, Sewon Min, Michihiro Yasunaga, Minjoon Seo, Rich James, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih
- Is Chatgpt Good At Search? Investigating Large Language Models As Re-ranking Agents Weiwei Sun, Lingyong Yan, Xinyu Ma, Shuaiqiang Wang, Pengjie Ren, Zhumin Chen, Dawei Yin, Zhaochun Ren
- Layoutgpt: Compositional Visual Planning And Generation With Large Language Models Weixi Feng, Wanrong Zhu, Tsu-jui Fu, Varun Jampani, Arjun Akula, Xuehai He, Sugato Basu, Xin Eric Wang, William Yang Wang
- Infovisdial: An Informative Visual Dialogue Dataset By Bridging Large Multimodal And Language Models Wen Bingbing, Yang Zhengyuan, Wang Jianfeng, Gan Zhe, Howe Bill, Wang Lijuan
- Autodroid: Llm-powered Task Automation In Android Wen Hao, Li Yuanchun, Liu Guohong, Zhao Shanhui, Yu Tao, Li Toby Jia-jun, Jiang Shiqi, Liu Yunhao, Zhang Yaqin, Liu Yunxin
- Droidbot-gpt: Gpt-powered UI Automation For Android Wen Hao, Wang Hongming, Liu Jiaxuan, Li Yuanchun
- Hyperpolyglot Llms: Cross-lingual Interpretability In Token Embeddings Wen-yi Andrea W, Mimno David
- Helping Language Models Learn More: Multi-dimensional Task Prompt For Few-shot Tuning Weng Jinta, Zhang Jiarui, Hu Yue, Fa Daidong, Xuand Xiaofeng, Huang Heyan
- Mastering Symbolic Operations: Augmenting Language Models With Compiled Neural Networks Weng Yixuan, Zhu Minjun, Xia Fei, Li Bin, He Shizhu, Liu Kang, Zhao Jun
- Instructblip: Towards General-purpose Vision-language Models With Instruction Tuning Wenliang Dai, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Junqi Zhao, Weisheng Wang, Boyang Li, Pascale Fung, Steven Hoi
- Is Chatgpt A Good Translator? Yes With GPT-4 As The Engine Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Xing Wang, Shuming Shi, Zhaopeng Tu
- An Overview Of Temporal Commonsense Reasoning And Acquisition Wenzel Georg, Jatowt Adam
- Advances In Apparent Conceptual Physics Reasoning In GPT-4 West Colin G.
- AI And The FCI: Can Chatgpt Project An Understanding Of Introductory Physics? West Colin G.
- The Generative AI Paradox: "what It Can Create, It May Not Understand" West Peter, Lu Ximing, Dziri Nouha, Brahman Faeze, Li Linjie, Hwang Jena D., Jiang Liwei, Fisher Jillian, Ravichander Abhilasha, Chandu Khyathi, Newman Benjamin, Koh Pang Wei, Ettinger Allyson, Choi Yejin
- System 2 Attention (is Something You Might Need Too) Weston Jason, Sukhbaatar Sainbayar
- Llm-powered Data Augmentation For Enhanced Cross-lingual Performance Whitehouse Chenxi, Choudhury Monojit, Aji Alham Fikri
- Better Question-answering Models On A Budget Wijeratne Yudhanjaya, Marikar Ishan
- Frustratingly Simple Memory Efficiency For Pre-trained Language Models Via Dynamic Embedding Pruning Williams Miles, Aletras Nikolaos
- From Word Models To World Models: Translating From Natural Language To The Probabilistic Language Of Thought Wong Lionel, Grand Gabriel, Lew Alexander K., Goodman Noah D., Mansinghka Vikash K., Andreas Jacob, Tenenbaum Joshua B.
- Natural Language Generation And Understanding Of Big Code For Ai-assisted Programming: A Review Wong Man Fai, Guo Shangxin, Hang Ching Nam, Ho Siu Wai, Tan Chee Wei
- Cases Of EFL Secondary Students' Prompt Engineering Pathways To Complete A Writing Task With Chatgpt Woo David James, Guo Kai, Susanto Hengky
- From Big To Small Without Losing It All: Text Augmentation With Chatgpt For Efficient Sentiment Analysis Woźniak Stanisław, Kocoń Jan
- Extrapolating Multilingual Understanding Models As Multilingual Generators Wu Bohong, Yuan Fei, Zhao Hai, Li Lei, Xu Jingjing
- Pmc-llama: Towards Building Open-source Language Models For Medicine Wu Chaoyi, Lin Weixiong, Zhang Xiaoman, Zhang Ya, Wang Yanfeng, Xie Weidi
- Rethinking Model Selection And Decoding For Keyphrase Generation With Pre-trained Sequence-to-sequence Models Wu Di, Ahmad Wasi Uddin, Chang Kai-wei
- Language Prompt For Autonomous Driving Wu Dongming, Han Wencheng, Wang Tiancai, Liu Yingfei, Zhang Xiangyu, Shen Jianbing
- Exploring The Limits Of Chatgpt In Software Security Applications Wu Fangzhou, Zhang Qingzhao, Bajaj Ati Priya, Bao Tiffany, Zhang Ning, Wang Ruoyu "fish", Xiao Chaowei
- Speechgen: Unlocking The Generative Power Of Speech Language Models With Prompts Wu Haibin, Chang Kai-wei, Wu Yuan-kuei, Lee Hung-yi
- P-laplacian Adaptation For Generative Pre-trained Vision-language Models Wu Haoyuan, Zhang Xinyun, Xu Peng, Liao Peiyu, Yao Xufeng, Yu Bei
- Efficient LLM Inference Solution On Intel GPU Wu Hui, Gan Yi, Yuan Feng, Ma Jing, Zhu Wei, Xu Yutao, Zhu Hong, Zhu Yuhua, Liu Xiaoli, Gu Jinghui, Zhao Peng
- Large Language Models Leverage External Knowledge To Extend Clinical Insight Beyond Language Boundaries Wu Jiageng, Wu Xian, Qiu Zhaopeng, Li Minghui, Zhang Yingying, Zheng Yefeng, Yuan Changzheng, Yang Jie
- Document Flattening: Beyond Concatenating Context For Document-level Neural Machine Translation Wu Minghao, Foster George, Qu Lizhen, Haffari Gholamreza
- Lamini-lm: A Diverse Herd Of Distilled Models From Large-scale Instructions Wu Minghao, Waheed Abdul, Zhang Chiyu, Abdul-mageed Muhammad, Aji Alham Fikri
- Approximated Prompt Tuning For Vision-language Pre-trained Models Wu Qiong, Huang Shubin, Zhou Yiyi, Dai Pingyang, Shu Annan, Jiang Guannan, Ji Rongrong
- A Comparative Study Of Open-source Large Language Models, GPT-4 And Claude 2: Multiple-choice Test Taking In Nephrology Wu Sean, Koo Michael, Blum Lesley, Black Andy, Kao Liyo, Scalzo Fabien, Kurtz Ira
- YUAN 2.0: A Large Language Model With Localized Filtering-based Attention Wu Shaohua, Zhao Xudong, Wang Shenling, Luo Jiangang, Li Lingjun, Chen Xi, Zhao Bing, Wang Wei, Yu Tong, Zhang Rongguo, Zhang Jiahua, Wang Chao
- Next-gpt: Any-to-any Multimodal LLM Wu Shengqiong, Fei Hao, Qu Leigang, Ji Wei, Chua Tat-seng
- A New Dialogue Response Generation Agent For Large Language Models By Asking Questions To Detect User's Intentions Wu Siwei, Shen Xiangqing, Xia Rui
- Towards Robust Text Retrieval With Progressive Learning Wu Tong, Qin Yulei, Zhang Enwei, Xu Zihan, Gao Yuting, Li Ke, Sun Xing
- Unleashing Potential Of Evidence In Knowledge-intensive Dialogue Generation Wu Xianjie, Yang Jian, Li Tongliang, Liang Di, Zhang Shiwei, Du Yiyang, Li Zhoujun
- Unveiling Security, Privacy, And Ethical Concerns Of Chatgpt Wu Xiaodong, Duan Ran, Ni Jianbing
- From Language Modeling To Instruction Following: Understanding The Behavior Shift In Llms After Instruction Tuning Wu Xuansheng, Yao Wenlin, Chen Jianshu, Pan Xiaoman, Wang Xiaoyang, Liu Ninghao, Yu Dong
- An Early Evaluation Of Gpt-4v(ision) Wu Yang, Wang Shilong, Yang Hao, Zheng Tian, Zhang Hongbo, Zhao Yanyan, Qin Bing
- Improving Cross-task Generalization With Step-by-step Instructions Wu Yang, Zhao Yanyan, Li Zhongyang, Qin Bing, Xiong Kai
- TLM: Token-level Masking For Transformers Wu Yangjun, Fang Kebin, Zhang Dongxiang, Wang Han, Zhang Hao, Chen Gang
- Jailbreaking GPT-4V Via Self-adversarial Attacks With System Prompts Wu Yuanwei, Li Xiang, Liu Yixin, Zhou Pan, Sun Lichao
- Openicl: An Open-source Framework For In-context Learning Wu Zhenyu, Wang Yaoxiang, Ye Jiacheng, Feng Jiangtao, Xu Jingjing, Qiao Yu, Wu Zhiyong
- A Simple Baseline For Knowledge-based Visual Question Answering Xenos Alexandros, Stafylakis Themos, Patras Ioannis, Tzimiropoulos Georgios
- Pali-3 Vision Language Models: Smaller, Faster, Stronger Xi Chen, Xiao Wang, Lucas Beyer, Alexander Kolesnikov, Jialin Wu, Paul Voigtlaender, Basil Mustafa, Sebastian Goodman, Ibrahim Alabdulmohsin, Piotr Padlewski, Daniel Salz, Xi Xiong, Daniel Vlasic, Filip Pavetic, Keran Rong, Tianli Yu, Daniel Keysers, Xiaohua Zhai, Radu Soricut
- Conversational Automated Program Repair Xia Chunqiu Steven, Zhang Lingming
- Improving Question Generation With Multi-level Content Planning Xia Zehua, Gou Qi, Yu Bowen, Yu Haiyang, Huang Fei, Li Yongbin, Nguyen Cam-tu
- Language Models Meet World Models: Embodied Experiences Enhance Language Models Xiang Jiannan, Tao Tianhua, Gu Yi, Shu Tianmin, Wang Zirui, Yang Zichao, Hu Zhiting
- Efficient Streaming Language Models With Attention Sinks Xiao Guangxuan, Tian Yuandong, Chen Beidi, Han Song, Lewis Mike
- Can I Trust Your Answer? Visually Grounded Video Question Answering Xiao Junbin, Yao Angela, Li Yicong, Chua Tat Seng
- Personalized Abstractive Summarization By Tri-agent Generation Pipeline Xiao Wen, Xie Yujia, Carenini Giuseppe, He Pengcheng
- Instruction-vit: Multi-modal Prompts For Instruction Learning In Vit Xiao Zhenxiang, Chen Yuzhong, Zhang Lu, Yao Junjie, Wu Zihao, Yu Xiaowei, Pan Yi, Zhao Lin, Ma Chong, Liu Xinyu, Liu Wei, Li Xiang, Yuan Yixuan, Shen Dinggang, Zhu Dajiang, Liu Tianming, Jiang Xi
- Supporting Qualitative Analysis With Large Language Models: Combining Codebook With GPT-3 For Deductive Coding Xiao Ziang, Yuan Xingdi, Liao Q. Vera, Abdelghani Rania, Oudeyer Pierre-yves
- Rethinking The Evaluation For Conversational Recommendation In The Era Of Large Language Models Xiaolei Wang, Xinyu Tang, Wayne Xin Zhao, Jingyuan Wang, Ji-rong Wen
- A Survey On Multi-behavior Sequential Recommendation Xiaoqing Chen, Zhitao Li, Weike Pan, Zhong Ming
- Pangu-σ: Towards Trillion Parameter Language Model With Sparse Heterogeneous Computing Xiaozhe Ren, Pingyi Zhou, Xinfan Meng, Xinjing Huang, Yadao Wang, Weichao Wang, Pengfei Li, Xiaoda Zhang, Alexander Podolskiy, Grigory Arshinov, Andrey Bout, Irina Piontkovskaya, Jiansheng Wei, Xin Jiang, Teng Su, Qun Liu, Jun Yao
- Funqa: Towards Surprising Video Comprehension Xie Binzhu, Zhang Sicheng, Zhou Zitang, Li Bo, Zhang Yuanhan, Hessel Jack, Yang Jingkang, Liu Ziwei
- The Wall Street Neophyte: A Zero-shot Analysis Of Chatgpt Over Multimodal Stock Movement Prediction Challenges Xie Qianqian, Han Weiguang, Lai Yanzhao, Peng Min, Huang Jimin
- Tencentllmeval: A Hierarchical Evaluation Of Real-world Capabilities For Human-aligned Llms Xie Shuyi, Yao Wenlin, Dai Yong, Wang Shaobo, Zhou Donlin, Jin Lifeng, Feng Xinhua, Wei Pengzhi, Lin Yujie, Hu Zhichao, Yu Dong, Zhang Zhengyou, Nie Jing, Liu Yuhong
- Openagents: An Open Platform For Language Agents In The Wild Xie Tianbao, Zhou Fan, Cheng Zhoujun, Shi Peng, Weng Luoxuan, Liu Yitao, Hua Toh Jing, Zhao Junning, Liu Qian, Liu Che, Liu Leo Z., Xu Yiheng, Su Hongjin, Shin Dongchan, Xiong Caiming, Yu Tao
- Empirical Study Of Zero-shot NER With Chatgpt Xie Tingyu, Li Qi, Zhang Jian, Zhang Yan, Liu Zuozhu, Wang Hongwei
- Analysis Of The Reasoning With Redundant Information Provided Ability Of Large Language Models Xie Wenbei
- Translating Natural Language To Planning Goals With Large-language Models Xie Yaqi, Yu Chen, Zhu Tongyao, Bai Jinbin, Gong Ze, Soh Harold
- Olagpt: Empowering Llms With Human-like Problem-solving Abilities Xie Yuanzhen, Xie Tao, Lin Mingxiong, Wei Wentao, Li Chenglin, Kong Beibei, Chen Lei, Zhuo Chengxiang, Hu Bo, Li Zang
- Making Small Language Models Better Multi-task Learners With Mixture-of-task-adapters Xie Yukang, Wang Chengyu, Yan Junbing, Zhou Jiyong, Deng Feiqi, Huang Jun
- A Prompt Log Analysis Of Text-to-image Generation Systems Xie Yutong, Pan Zhaoying, Ma Jinge, Jie Luo, Mei Qiaozhu
- The Next Chapter: A Study Of Large Language Models In Storytelling Xie Zhuohan, Cohn Trevor, Lau Jey Han
- A Preliminary Study Of Chatgpt On News Recommendation: Personalization, Provider Fairness, Fake News Xinyi Li, Yongfeng Zhang, Edward C. Malthouse
- Doctorglm: Fine-tuning Your Chinese Doctor Is Not A Herculean Task Xiong Honglin, Wang Sheng, Zhu Yitao, Zhao Zihao, Liu Yuxiao, Huang Linlin, Wang Qian, Shen Dinggang
- TRIGO: Benchmarking Formal Mathematical Proof Reduction For Generative Language Models Xiong Jing, Shen Jianhao, Yuan Ye, Wang Haiming, Yin Yichun, Liu Zhengying, Li Lin, Guo Zhijiang, Cao Qingxing, Huang Yinya, Zheng Chuanyang, Liang Xiaodan, Zhang Ming, Liu Qun
- Examining Inter-consistency Of Large Language Models Collaboration: An In-depth Analysis Via Debate Xiong Kai, Ding Xiao, Cao Yixin, Liu Ting, Qin Bing
- Can Llms Express Their Uncertainty? An Empirical Evaluation Of Confidence Elicitation In Llms Xiong Miao, Hu Zhiyuan, Lu Xinyang, Li Yifei, Fu Jie, He Junxian, Hooi Bryan
- The Program Testing Ability Of Large Language Models For Code Xiong Weimin, Guo Yiwen, Chen Hao
- Effective Long-context Scaling Of Foundation Models Xiong Wenhan, Liu Jingyu, Molybog Igor, Zhang Hejia, Bhargava Prajjwal, Hou Rui, Martin Louis, Rungta Rashi, Sankararaman Karthik Abinav, Oguz Barlas, Khabsa Madian, Fang Han, Mehdad Yashar, Narang Sharan, Malik Kshitiz, Fan Angela, Bhosale Shruti, Edunov Sergey, Lewis Mike, Wang Sinong, Ma Hao
- Expertprompting: Instructing Large Language Models To Be Distinguished Experts Xu Benfeng, Yang An, Lin Junyang, Wang Quan, Zhou Chang, Zhang Yongdong, Mao Zhendong
- Rewoo: Decoupling Reasoning From Observations For Efficient Augmented Language Models Xu Binfeng, Peng Zhiyuan, Lei Bowen, Mukherjee Subhabrata, Liu Yuchen, Xu Dongkuan
- Baize: An Open-source Chat Model With Parameter-efficient Tuning On Self-chat Data Xu Canwen, Guo Daya, Duan Nan, Mcauley Julian
- Automatic Pair Construction For Contrastive Post-training Xu Canwen, Rosset Corby, Chau Ethan C., Del Corro Luciano, Mahajan Shweti, Mcauley Julian, Neville Jennifer, Awadallah Ahmed Hassan, Rao Nikhil
- Small Models Are Valuable Plug-ins For Large Language Models Xu Canwen, Xu Yichong, Wang Shuohang, Liu Yang, Zhu Chenguang, Mcauley Julian
- Ctc-based Non-autoregressive Speech Translation Xu Chen, Liu Xiaoqian, Liu Xiaowen, Sun Qingxuan, Zhang Yuhao, Yang Murun, Dong Qianqian, Ko Tom, Wang Mingxuan, Xiao Tong, Ma Anxiang, Zhu Jingbo
- DPL: Decoupled Prompt Learning For Vision-language Models Xu Chen, Zhu Yuhan, Zhang Guozhen, Shen Haocheng, Liao Yixuan, Chen Xiaoxin, Wu Gangshan, Wang Limin
- Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation And Beyond Xu Fangzhi, Lin Qika, Han Jiawei, Zhao Tianzhe, Liu Jun, Cambria Erik
- A Paradigm Shift In Machine Translation: Boosting Translation Performance Of Large Language Models Xu Haoran, Kim Young Jin, Sharaf Amr, Awadalla Hany Hassan
- Penetrative AI: Making Llms Comprehend The Physical World Xu Huatao, Han Liying, Yang Qirui, Li Mo, Srivastava Mani
- Pixel Aligned Language Models Xu Jiarui, Zhou Xingyi, Yan Shen, Gu Xiuye, Arnab Anurag, Sun Chen, Wang Xiaolong, Schmid Cordelia
- Superclue: A Comprehensive Chinese Large Language Model Benchmark Xu Liang, Li Anqi, Zhu Lei, Xue Hang, Zhu Changtai, Zhao Kangkang, He Haonan, Zhang Xuanwei, Kang Qiyue, Lan Zhenzhong
- Sc-safety: A Multi-round Open-ended Question Adversarial Safety Benchmark For Large Language Models In Chinese Xu Liang, Zhao Kangkang, Zhu Lei, Xue Hang
- Tensorgpt: Efficient Compression Of The Embedding Layer In Llms Based On The Tensor-train Decomposition Xu Mingxue, Xu Yao Lei, Mandic Danilo P.
- Cognitive Overload: Jailbreaking Large Language Models With Overloaded Logical Thinking Xu Nan, Wang Fei, Zhou Ben, Li Bang Zheng, Xiao Chaowei, Chen Muhao
- On The Tool Manipulation Capability Of Open-source Large Language Models Xu Qiantong, Hong Fenglu, Li Bo, Hu Changran, Chen Zhengyu, Zhang Jian
- Does Conceptual Representation Require Embodiment? Insights From Large Language Models Xu Qihui, Peng Yingying, Nastase Samuel A., Chodorow Martin, Wu Minghua, Li Ping
- Fine-tuned Llms Know More, Hallucinate Less With Few-shot Sequence-to-sequence Semantic Parsing Over Wikidata Xu Silei, Liu Shicheng, Culhane Theo, Pertseva Elizaveta, Wu Meng-hsi, Semnani Sina J., Lam Monica S.
- Shattering The Agent-environment Interface For Fine-tuning Inclusive Language Models Xu Wanqiao, Dong Shi, Arumugam Dilip, Van Roy Benjamin
- INSTRUCTSCORE: Explainable Text Generation Evaluation With Finegrained Feedback Xu Wenda, Wang Danqing, Pan Liangming, Song Zhenqiao, Freitag Markus, Wang William Yang, Li Lei
- Re-reading Improves Reasoning In Large Language Models Xu Xiaohan, Tao Chongyang, Shen Tao, Xu Can, Xu Hongbo, Long Guodong, Lou Jian-guang
- An LLM Can Fool Itself: A Prompt-based Adversarial Attack Xu Xilie, Kong Keyi, Liu Ning, Cui Lizhen, Wang Di, Zhang Jingfeng, Kankanhalli Mohan
- It Ain't That Bad: Understanding The Mysterious Performance Drop In OOD Generalization For Generative Transformer Models Xu Xingcheng, Pan Zihao, Zhang Haipeng, Yang Yanqing
- Mental-llm: Leveraging Large Language Models For Mental Health Prediction Via Online Text Data Xu Xuhai, Yao Bingsheng, Dong Yuanzhe, Gabriel Saadia, Yu Hong, Hendler James, Ghassemi Marzyeh, Dey Anind K., Wang Dakuo
- Llms And The Abstraction And Reasoning Corpus: Successes, Failures, And The Importance Of Object-based Representations Xu Yudong, Li Wenhao, Vaezipoor Pashootan, Sanner Scott, Khalil Elias B.
- Drivegpt4: Interpretable End-to-end Autonomous Driving Via Large Language Model Xu Zhenhua, Zhang Yujia, Xie Enze, Zhao Zhen, Guo Yong, Wong Kwan-yee. K., Li Zhenguo, Zhao Hengshuang
- IDOL: Indicator-oriented Logic Pre-training For Logical Reasoning Xu Zihang, Yang Ziqing, Cui Yiming, Wang Shijin
- How Robust Is GPT-3.5 To Predecessors? A Comprehensive Study On Language Understanding Tasks Xuanting Chen, Junjie Ye, Can Zu, Nuo Xu, Rui Zheng, Minlong Peng, Jie Zhou, Tao Gui, Qi Zhang, Xuanjing Huang
- Representation Learning With Large Language Models For Recommendation Xubin Ren, Wei Wei, Lianghao Xia, Lixin Su, Suqi Cheng, Junfeng Wang, Dawei Yin, Chao Huang
- Pit One Against Many: Leveraging Attention-head Embeddings For Parameter-efficient Multi-head Attention Xue Huiyin, Aletras Nikolaos
- Bias And Fairness In Chatbots: An Overview Xue Jintang, Wang Yun-cheng, Wei Chengwei, Liu Xiaofeng, Woo Jonghye, Kuo C. -c. Jay
- Occuquest: Mitigating Occupational Bias For Inclusive Large Language Models Xue Mingfeng, Liu Dayiheng, Yang Kexin, Dong Guanting, Lei Wenqiang, Yuan Zheng, Zhou Chang, Zhou Jingren
- RCOT: Detecting And Rectifying Factual Inconsistency In Reasoning By Reversing Chain-of-thought Xue Tianci, Wang Ziqi, Wang Zhenhailong, Han Chi, Yu Pengfei, Ji Heng
- Scaling Evidence-based Instructional Design Expertise Through Large Language Models Yadav Gautam
- Pretraining Data Mixtures Enable Narrow Model Selection Capabilities In Transformer Models Yadlowsky Steve, Doshi Lyric, Tripuraneni Nilesh
- Evaluating Large Language Models' Ability Using A Psychiatric Screening Tool Based On Metaphor And Sarcasm Scenarios Yakura Hiromu
- Optimizing Machine Translation Through Prompt Engineering: An Investigation Into Chatgpt's Customizability Yamada Masaru
- Evaluating Spatial Understanding Of Large Language Models Yamada Yutaro, Bao Yihan, Lampinen Andrew K., Kasai Jungo, Yildirim Ilker
- LPML: Llm-prompting Markup Language For Mathematical Reasoning Yamauchi Ryutaro, Sonoda Sho, Sannai Akiyoshi, Kumagai Wataru
- Voila-a: Aligning Vision-language Models With User's Gaze Attention Yan Kun, Ji Lei, Wang Zeyu, Wang Yuntao, Duan Nan, Ma Shuai
- Practical And Ethical Challenges Of Large Language Models In Education: A Systematic Scoping Review Yan Lixiang, Sha Lele, Zhao Linxuan, Li Yuheng, Martinez-maldonado Roberto, Chen Guanliang, Li Xinyu, Jin Yueqiao, Gašević Dragan
- Refining The Responses Of Llms By Themselves Yan Tianqiang, Xu Tiansheng
- Multimodal Chatgpt For Medical Applications: An Experimental Study Of GPT-4V Yan Zhiling, Zhang Kai, Zhou Rong, He Lifang, Li Xiang, Sun Lichao
- LACMA: Language-aligning Contrastive Learning With Meta-actions For Embodied Instruction Following Yang Cheng-fu, Chen Yen-chun, Yang Jianwei, Dai Xiyang, Yuan Lu, Wang Yu-chiang Frank, Chang Kai-wei
- Can You Follow Me? Testing Situational Understanding In Chatgpt Yang Chenghao, Ettinger Allyson
- Beyond Testers' Biases: Guiding Model Testing With Knowledge Bases Using Llms Yang Chenyang, Rustogi Rishabh, Brower-sinning Rachel, Lewis Grace A., Kästner Christian, Wu Tongshuang
- Zero-shot Query Reformulation For Conversational Search Yang Dayu, Zhang Yue, Fang Hui
- Refgpt: Dialogue Generation Of GPT, By GPT, And For GPT Yang Dongjie, Yuan Ruifeng, Fan Yuantao, Yang Yifei, Wang Zili, Wang Shusen, Zhao Hai
- PALR: Personalization Aware Llms For Recommendation Yang Fan, Chen Zheng, Jiang Ziyan, Cho Eunah, Huang Xiaojiang, Lu Yanbin
- Empower Large Language Model To Perform Better On Industrial Domain-specific Question Answering Yang Fangkai, Zhao Pu, Wang Zezhong, Wang Lu, Zhang Jue, Garg Mohit, Lin Qingwei, Rajmohan Saravan, Zhang Dongmei
- Instoptima: Evolutionary Multi-objective Instruction Optimization Via Large Language Model-based Instruction Operators Yang Heng, Li Ke
- Auto-gpt For Online Decision Making: Benchmarks And Additional Opinions Yang Hui, Yue Sifu, He Yunzhong
- Longqlora: Efficient And Effective Method To Extend Context Length Of Large Language Models Yang Jianxin
- Iterative Forward Tuning Boosts In-context Learning In Language Models Yang Jiaxi, Hui Binyuan, Yang Min, Wang Bailin, Li Bowen, Li Binhua, Huang Fei, Li Yongbin
- Towards Interpretable Mental Health Analysis With Large Language Models Yang Kailai, Ji Shaoxiong, Zhang Tianlin, Xie Qianqian, Kuang Ziyan, Ananiadou Sophia
- Revisiting Parallel Context Windows: A Frustratingly Simple Alternative And Chain-of-thought Deterioration Yang Kejuan, Liu Xiao, Men Kaiwen, Zeng Aohan, Dong Yuxiao, Tang Jie
- Kobigbird-large: Transformation Of Transformer For Korean Language Understanding Yang Kisu, Jang Yoonna, Lee Taewoo, Seong Jinwoo, Lee Hyungjin, Jang Hwanseok, Lim Heuiseok
- Medxchat: A Unified Multimodal Large Language Model Framework Towards Cxrs Understanding And Generation Yang Ling, Wang Zhanyu, Chen Zhenghao, Liang Xinyu, Zhou Luping
- Give Us The Facts: Enhancing Large Language Models With Knowledge Graphs For Fact-aware Language Modeling Yang Linyao, Chen Hongyang, Li Zhao, Ding Xiao, Wu Xindong
- Supervised Knowledge Makes Large Language Models Better In-context Learners Yang Linyi, Zhang Shuibai, Yu Zhuohao, Bao Guangsheng, Wang Yidong, Wang Jindong, Xu Ruochen, Ye Wei, Xie Xing, Chen Weizhu, Zhang Yue
- Uncertainty-aware Language Modeling For Selective Question Answering Yang Qi, Ravikumar Shreya, Schmitt-ulms Fynn, Lolla Satvik, Demir Ege, Elistratov Iaroslav, Lavaee Alex, Lolla Sadhana, Ahmadi Elaheh, Rus Daniela, Amini Alexander, Perez Alejandro
- Integrating UMLS Knowledge Into Large Language Models For Medical Question Answering Yang Rui, Marrese-taylor Edison, Ke Yuhe, Cheng Lechao, Chen Qingyu, Li Irene
- Lidar-llm: Exploring The Potential Of Large Language Models For 3D Lidar Understanding Yang Senqiao, Liu Jiaming, Zhang Ray, Pan Mingjie, Guo Zoey, Li Xiaoqi, Chen Zehui, Gao Peng, Guo Yandong, Zhang Shanghang
- Progressive Evidence Refinement For Open-domain Multimodal Retrieval Question Answering Yang Shuwen, Wu Anran, Wu Xingjiao, Xiao Luwei, Ma Tianlong, Jin Cheng, He Liang
- Zhongjing: Enhancing The Chinese Medical Capabilities Of Large Language Model Through Expert Feedback And Real-world Multi-turn Dialogue Yang Songhua, Zhao Hanjie, Zhu Senbin, Zhou Guangyu, Xu Hongfei, Jia Yuxiang, Zan Hongying
- Psycot: Psychological Questionnaire As Powerful Chain-of-thought For Personality Detection Yang Tao, Shi Tianyuan, Wan Fanqi, Quan Xiaojun, Wang Qifan, Wu Bingzhe, Wu Jiaxiang
- Dior-cvae: Pre-trained Language Models And Diffusion Priors For Variational Dialog Generation Yang Tianyu, Tran Thy Thy, Gurevych Iryna
- Neural Machine Translation Data Generation And Augmentation Using Chatgpt Yang Wayne, Nicolai Garrett
- Bigtranslate: Augmenting Large Language Models With Multilingual Translation Capability Over 100 Languages Yang Wen, Li Chong, Zhang Jiajun, Zong Chengqing
- Exploring An LM To Generate Prolog Predicates From Mathematics Questions Yang Xiaocheng, Tam Yik-cheung
- Human-in-the-loop Machine Translation With Large Language Model Yang Xinyi, Zhan Runzhe, Wong Derek F., Wu Junchao, Chao Lidia S.
- Lever LM: Configuring In-context Sequence To Lever Large Vision Language Models Yang Xu, Peng Yingzhe, Ma Haoxuan, Xu Shuo, Zhang Chi, Han Yucheng, Zhang Hanwang
- Bias A-head? Analyzing Bias In Transformer-based Language Model Attention Heads Yang Yi, Duan Hanyu, Abbasi Ahmed, Lalor John P., Tam Kar Yan
- Mindllm: Pre-training Lightweight Large Language Model From Scratch, Evaluations And Domain Applications Yang Yizhe, Sun Huashan, Li Jiawei, Liu Runheng, Li Yinghao, Liu Yuhang, Huang Heyan, Gao Yang
- Harnessing The Power Of Large Language Models For Natural Language To First-order Logic Translation Yang Yuan, Xiong Siheng, Payani Ali, Shareghi Ehsan, Fekri Faramarz
- Improving The Reliability Of Large Language Models By Leveraging Uncertainty-aware In-context Learning Yang Yuchen, Li Houqiang, Wang Yanfeng, Wang Yu
- On The Planning, Search, And Memorization Capabilities Of Large Language Models Yang Yunhao, Tomar Anshul
- GPT Can Solve Mathematical Problems Without A Calculator Yang Zhen, Ding Ming, Lv Qingsong, Jiang Zhihuan, He Zehai, Guo Yuyi, Bai Jinfeng, Tang Jie
- Surpassing GPT-4 Medical Coding With A Two-stage Approach Yang Zhichao, Batra Sanjit Singh, Stremmel Joel, Halperin Eran
- Coupling Large Language Models With Logic Programming For Robust And General Reasoning From Text Yang Zhun, Ishay Adam, Lee Joohyung
- Quantization-aware And Tensor-compressed Training Of Transformers For Natural Language Understanding Yang Zi, Choudhary Samridhi, Kunzmann Siegfried, Zhang Zheng
- Benchmarking Llm-based Machine Translation On Cultural Awareness Yao Binwei, Jiang Ming, Yang Diyi, Hu Junjie
- Improving Language Model Negotiation With Self-play And In-context Learning From AI Feedback Yao Fu, Hao Peng, Tushar Khot, Mirella Lapata
- Promptcare: Prompt Copyright Protection By Watermark Injection And Verification Yao Hongwei, Lou Jian, Ren Kui, Qin Zhan
- LLM Lies: Hallucinations Are Not Bugs, But Features As Adversarial Examples Yao Jia-yu, Ning Kun-peng, Liu Zhen-hui, Ning Mu-nan, Liu Yu-yang, Yuan Li
- Embodiedgpt: Vision-language Pre-training Via Embodied Chain Of Thought Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo
- Retroformer: Retrospective Large Language Agents With Policy Gradient Optimization Yao Weiran, Heinecke Shelby, Niebles Juan Carlos, Liu Zhiwei, Feng Yihao, Xue Le, Murthy Rithesh, Chen Zeyuan, Zhang Jianguo, Arpit Devansh, Xu Ran, Mui Phil, Wang Huan, Xiong Caiming, Savarese Silvio
- A Survey On Large Language Model (LLM) Security And Privacy: The Good, The Bad, And The Ugly Yao Yifan, Duan Jinhao, Xu Kaidi, Cai Yuanfang, Sun Zhibo, Zhang Yue
- Nanolm: An Affordable LLM Pre-training Benchmark Via Accurate Loss Prediction Across Scales Yao Yiqun, Fan Siqi, Huang Xiusheng, Fang Xuezhi, Li Xiang, Ni Ziyi, Jiang Xin, Meng Xuying, Han Peng, Shang Shuo, Liu Kang, Sun Aixin, Wang Yequan
- Deepspeed-visualchat: Multi-round Multi-image Interleave Chat Via Multi-modal Causal Attention Yao Zhewei, Wu Xiaoxia, Li Conglong, Zhang Minjia, Qin Heyang, Ruwase Olatunji, Awan Ammar Ahmad, Rajbhandari Samyam, He Yuxiong
- Do Physicians Know How To Prompt? The Need For Automatic Prompt Optimization Help In Clinical Note Generation Yao Zonghai, Jaafar Ahmed, Wang Beining, Yang Zhichao, Yu Hong
- Cognitive Mirage: A Review Of Hallucinations In Large Language Models Ye Hongbin, Liu Tong, Zhang Aijia, Hua Wei, Jia Weiqiang
- Ureader: Universal Ocr-free Visually-situated Language Understanding With Multimodal Large Language Model Ye Jiabo, Hu Anwen, Xu Haiyang, Ye Qinghao, Yan Ming, Xu Guohai, Li Chenliang, Tian Junfeng, Qian Qi, Zhang Ji, Jin Qin, He Liang, Lin Xin Alex, Huang Fei
- Rational Decision-making Agent With Internalized Utility Judgment Ye Yining, Cong Xin, Tian Shizuo, Qin Yujia, Liu Chong, Lin Yankai, Liu Zhiyuan, Sun Maosong
- ASPEN: High-throughput Lora Fine-tuning Of Large Language Models With A Single GPU Ye Zhengmao, Li Dengchun, Tian Jingqi, Lan Tingfeng, Zuo Jie, Duan Lei, Lu Hui, Jiang Yexi, Sha Jian, Zhang Ke, Tang Mingjie
- Navigating Text-to-image Customization: From Lycoris Fine-tuning To Model Evaluation Yeh Shih-ying, Hsieh Yu-guan, Gao Zhidong, Yang Bernard B W, Oh Giyeong, Gong Yanmin
- QAID: Question Answering Inspired Few-shot Intent Detection Yehudai Asaf, Vetzler Matan, Mass Yosi, Lazar Koren, Cohen Doron, Carmeli Boaz
- A Multitask, Multilingual, Multimodal Evaluation Of Chatgpt On Reasoning, Hallucination, And Interactivity Yejin Bang, Samuel Cahyawijaya, Nayeon Lee, Wenliang Dai, Dan Su, Bryan Wilie, Holy Lovenia, Ziwei Ji, Tiezheng Yu, Willy Chung, Quyet V. Do, Yan Xu, Pascale Fung
- Ctrlstruct: Dialogue Structure Learning For Open-domain Response Generation Yin Congchi, Li Piji, Ren Zhaochun
- Agent Lumos: Unified And Modular Training For Open-source Language Agents Yin Da, Brahman Faeze, Ravichander Abhilasha, Chandu Khyathi, Chang Kai-wei, Choi Yejin, Lin Bill Yuchen
- Dynosaur: A Dynamic Growth Paradigm For Instruction-tuning Data Curation Yin Da, Liu Xiao, Yin Fan, Zhong Ming, Bansal Hritik, Han Jiawei, Chang Kai-wei
- Do Large Language Models Know What They Don't Know? Yin Zhangyue, Sun Qiushi, Guo Qipeng, Wu Jiawen, Qiu Xipeng, Huang Xuanjing
- LAMM: Language-assisted Multi-modal Instruction-tuning Dataset, Framework, And Benchmark Yin Zhenfei, Wang Jiong, Cao Jianjian, Shi Zhelun, Liu Dingning, Li Mukai, Sheng Lu, Bai Lei, Huang Xiaoshui, Wang Zhiyong, Shao Jing, Ouyang Wanli
- Analyzing And Mitigating Object Hallucination In Large Vision-language Models Yiyang Zhou, Chenhang Cui, Jaehong Yoon, Linjun Zhang, Zhun Deng, Chelsea Finn, Mohit Bansal, Huaxiu Yao
- Low-resource Languages Jailbreak GPT-4 Yong Zheng-xin, Menghini Cristina, Bach Stephen H.
- Prompting Multilingual Large Language Models To Generate Code-mixed Texts: The Case Of South East Asian Languages Yong Zheng-xin, Zhang Ruochen, Forde Jessica Zosa, Wang Skyler, Subramonian Arjun, Lovenia Holy, Cahyawijaya Samuel, Winata Genta Indra, Sutawika Lintang, Cruz Jan Christian Blaise, Tan Yin Lin, Phan Long, Garcia Rowena, Solorio Thamar, Aji Alham Fikri
- Hugginggpt: Solving AI Tasks With Chatgpt And Its Friends In Hugging Face Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu, Yueting Zhuang
- From Text To Motion: Grounding GPT-4 In A Humanoid Robot "alter3" Yoshida Takahide, Masumori Atsushi, Ikegami Takashi
- Fundamental Limitations Of Alignment In Large Language Models Yotam Wolf, Noam Wies, Oshri Avnery, Yoav Levine, Amnon Shashua
- \(P^{3}O\): Transferring Visual Representations For Reinforcement Learning Via Prompting You Guoliang, Chu Xiaomeng, Duan Yifan, Peng Jie, Ji Jianmin, Zhang Yu, Zhang Yanyong
- Decoding In-context Learning: Neuroscience-inspired Analysis Of Representations In Large Language Models Yousefi Safoora, Betthauser Leo, Hasanbeig Hosein, Millière Raphaël, Momennejad Ida
- OVM, Outcome-supervised Value Models For Planning In Mathematical Reasoning Yu Fei, Gao Anningzhe, Wang Benyou
- GPTFUZZER: Red Teaming Large Language Models With Auto-generated Jailbreak Prompts Yu Jiahao, Lin Xingwei, Yu Zheng, Xing Xinyu
- Assessing Prompt Injection Risks In 200+ Custom Gpts Yu Jiahao, Wu Yuhang, Shu Dong, Jin Mingyu, Yang Sabrina, Xing Xinyu
- Scaling Autoregressive Multi-modal Models: Pretraining And Instruction Tuning Yu Lili, Shi Bowen, Pasunuru Ramakanth, Muller Benjamin, Golovneva Olga, Wang Tianlu, Babu Arun, Tang Binh, Karrer Brian, Sheynin Shelly, Ross Candace, Polyak Adam, Howes Russell, Sharma Vasu, Xu Puxin, Tamoyan Hovhannes, Ashual Oron, Singer Uriel, Li Shang-wen, Zhang Susan, James Richard, Ghosh Gargi, Taigman Yaniv, Fazel-zarandi Maryam, Celikyilmaz Asli, Zettlemoyer Luke, Aghajanyan Armen
- MEGABYTE: Predicting Million-byte Sequences With Multiscale Transformers Yu Lili, Simig Dániel, Flaherty Colin, Aghajanyan Armen, Zettlemoyer Luke, Lewis Mike
- Stateful Large Language Model Serving With Pensieve Yu Lingfan, Li Jinyang
- Metamath: Bootstrap Your Own Mathematical Questions For Large Language Models Yu Longhui, Jiang Weisen, Shi Han, Yu Jincheng, Liu Zhengying, Zhang Yu, Kwok James T., Li Zhenguo, Weller Adrian, Liu Weiyang
- Characterizing Mechanisms For Factual Recall In Language Models Yu Qinan, Merullo Jack, Pavlick Ellie
- Seqgpt: An Out-of-the-box Large Language Model For Open Domain Sequence Understanding Yu Tianyu, Jiang Chengyue, Lou Chao, Huang Shen, Wang Xiaobin, Liu Wei, Cai Jiong, Li Yangning, Li Yinghui, Tu Kewei, Zheng Hai-tao, Zhang Ningyu, Xie Pengjun, Huang Fei, Jiang Yong
- Chain-of-note: Enhancing Robustness In Retrieval-augmented Language Models Yu Wenhao, Zhang Hongming, Pan Xiaoman, Ma Kaixin, Wang Hongwei, Yu Dong
- Prompt-based Monte-carlo Tree Search For Goal-oriented Dialogue Policy Planning Yu Xiao, Chen Maximillian, Yu Zhou
- Reeval: Automatic Hallucination Evaluation For Retrieval-augmented Large Language Models Via Transferable Adversarial Attacks Yu Xiaodong, Cheng Hao, Liu Xiaodong, Roth Dan, Gao Jianfeng
- Finmem: A Performance-enhanced LLM Trading Agent With Layered Memory And Character Design Yu Yangyang, Li Haohang, Chen Zhi, Jiang Yuechen, Li Yang, Zhang Denghui, Liu Rong, Suchow Jordan W., Khashanah Khaldoun
- Large Language Model As Attributed Training Data Generator: A Tale Of Diversity And Bias Yu Yue, Zhuang Yuchen, Zhang Jieyu, Meng Yu, Ratner Alexander, Krishna Ranjay, Shen Jiaming, Zhang Chao
- Prophet: Prompting Large Language Models With Complementary Answer Heuristics For Knowledge-based Visual Question Answering Yu Zhou, Ouyang Xuecheng, Shao Zhenwei, Wang Meng, Yu Jun
- CRAFT: Customizing Llms By Creating And Retrieving From Specialized Toolsets Yuan Lifan, Chen Yangyi, Wang Xingyao, Fung Yi R., Peng Hao, Ji Heng
- Evolving Large Language Model Assistant With Long-term Conditional Memory Yuan Ruifeng, Sun Shichao, Wang Zili, Cao Ziqiang, Li Wenjie
- Beneath Surface Similarity: Large Language Models Make Reasonable Scientific Analogies After Structure Abduction Yuan Siyu, Chen Jiangjie, Ge Xuyang, Xiao Yanghua, Yang Deqing
- Where To Go Next For Recommender Systems? ID- Vs. Modality-based Recommender Models Revisited Yuan Zheng, Yuan Fajie, Song Yu, Li Youhua, Fu Junchen, Yang Fei, Pan Yunzhu, Ni Yongxin
- Multilingual Augmentation For Robust Visual Question Answering In Remote Sensing Images Yuan Zhenghang, Mou Lichao, Zhu Xiao Xiang
- Artgpt-4: Towards Artistic-understanding Large Vision-language Models With Enhanced Adapter Yuan Zhengqing, He Yunhong, Wang Kun, Ye Yanfang, Sun Lichao
- Evotext: Enhancing Natural Language Generation Models Via Self-escalation Learning For Up-to-date Knowledge And Improved Performance Yuan Zhengqing, Xue Huiwen, Zhang Chao, Liu Yongming
- Large Language Model Cascades With Mixture Of Thoughts Representations For Cost-efficient Reasoning Yue Murong, Zhao Jie, Zhang Min, Du Liang, Yao Ziyu
- Plan, Eliminate, And Track -- Language Models Are Good Teachers For Embodied Agents Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye
- Mammoth: Building Math Generalist Models Through Hybrid Instruction Tuning Yue Xiang, Qu Xingwei, Zhang Ge, Fu Yao, Huang Wenhao, Sun Huan, Su Yu, Chen Wenhu
- Prompt Highlighter: Interactive Control For Multi-modal Llms Yuechen Zhang, Shengju Qian, Bohao Peng, Shu Liu, Jiaya Jia
- Attention Satisfies: A Constraint-satisfaction Lens On Factual Errors Of Language Models Yuksekgonul Mert, Chandrasekaran Varun, Jones Erik, Gunasekar Suriya, Naik Ranjita, Palangi Hamid, Kamar Ece, Nushi Besmira
- Chat-rec: Towards Interactive And Explainable Llms-augmented Recommender System Yunfan Gao, Tao Sheng, Youlin Xiang, Yun Xiong, Haofen Wang, Jiawei Zhang
- On Evaluating Adversarial Robustness Of Large Vision-language Models Yunqing Zhao, Tianyu Pang, Chao Du, Xiao Yang, Chongxuan Li, Ngai-man Cheung, Min Lin
- Large Language Models Are Zero-shot Rankers For Recommender Systems Yupeng Hou, Junjie Zhang, Zihan Lin, Hongyu Lu, Ruobing Xie, Julian Mcauley, Wayne Xin Zhao
- Pre-training To Learn In Context Yuxian Gu, Li Dong, Furu Wei, Minlie Huang
- Planting A SEED Of Vision In Large Language Model Yuying Ge, Yixiao Ge, Ziyun Zeng, Xintao Wang, Ying Shan
- Building Trust In Conversational AI: A Comprehensive Review And Solution Architecture For Explainable, Privacy-aware Systems Using Llms And Knowledge Graph Zafar Ahtsham, Parthasarathy Venkatesh Balavadhani, Van Chan Le, Shahid Saad, Khan Aafaq Iqbal, Shahid Arsalan
- Mascqa: A Question Answering Dataset For Investigating Materials Science Knowledge Of Large Language Models Zaki Mohd, Jayadeva, Mausam, Krishnan N. M. Anoop
- Fairness-aware Structured Pruning In Transformers Zayed Abdelrahman, Mordido Goncalo, Shabanian Samira, Baldini Ioana, Chandar Sarath
- Exploring Memorization In Fine-tuned Language Models Zeng Shenglai, Li Yaxin, Ren Jie, Liu Yiding, Xu Han, He Pengfei, Xing Yue, Wang Shuaiqiang, Tang Jiliang, Yin Dawei
- What Matters In Training A Gpt4-style Language Model With Multimodal Inputs? Zeng Yan, Zhang Hanbo, Zheng Jiani, Xia Jiangnan, Wei Guoqiang, Wei Yang, Zhang Yuchen, Kong Tao
- MR-GSM8K: A Meta-reasoning Benchmark For Large Language Model Evaluation Zeng Zhongshen, Chen Pengguang, Liu Shu, Jiang Haiyun, Jia Jiaya
- Is Chatgpt A Good Sentiment Analyzer? A Preliminary Study Zengzhi Wang, Qiming Xie, Yi Feng, Zixiang Ding, Zinong Yang, Rui Xia
- Text Alignment Is An Efficient Unified Model For Massive NLP Tasks Zha Yuheng, Yang Yichi, Li Ruichen, Hu Zhiting
- Investigating The Catastrophic Forgetting In Multimodal Large Language Models Zhai Yuexiang, Tong Shengbang, Li Xiao, Cai Mu, Qu Qing, Lee Yong Jae, Ma Yi
- G3detector: General Gpt-generated Text Detector Zhan Haolan, He Xuanli, Xu Qiongkai, Wu Yuxiang, Stenetorp Pontus
- Removing RLHF Protections In GPT-4 Via Fine-tuning Zhan Qiusi, Fang Richard, Bindu Rohan, Gupta Akul, Hashimoto Tatsunori, Kang Daniel
- Deceptive AI Ecosystems: The Case Of Chatgpt Zhan Xiao, Xu Yifan, Sarkadi Stefan
- Cot-bert: Enhancing Unsupervised Sentence Representation Through Chain-of-thought Zhang Bowen, Chang Kehua, Li Chunping
- A Simple LLM Framework For Long-range Video Question-answering Zhang Ce, Lu Taixi, Islam Md Mohaiminul, Wang Ziyang, Yu Shoubin, Bansal Mohit, Bertasius Gedas
- Mm-narrator: Narrating Long-form Videos With Multimodal In-context Learning Zhang Chaoyi, Lin Kevin, Yang Zhengyuan, Wang Jianfeng, Li Linjie, Lin Chung-ching, Liu Zicheng, Wang Lijuan
- User-controlled Knowledge Fusion In Large Language Models: Balancing Creativity And Hallucination Zhang Chen
- A Comprehensive Analysis Of The Effectiveness Of Large Language Models As Automatic Dialogue Evaluators Zhang Chen, D'haro Luis Fernando, Chen Yiming, Zhang Malu, Li Haizhou
- Xdial-eval: A Multilingual Open-domain Dialogue Evaluation Benchmark Zhang Chen, D'haro Luis Fernando, Tang Chengguang, Shi Ke, Tang Guohua, Li Haizhou
- Task-agnostic Distillation Of Encoder-decoder Language Models Zhang Chen, Yang Yang, Wang Jingang, Song Dawei
- Automatic Generation Of Multiple-choice Questions Zhang Cheng
- Benchmarking And In-depth Performance Study Of Large Language Models On Habana Gaudi Processors Zhang Chengming, Sun Baixi, Yu Xiaodong, Xie Zhen, Zheng Weijian, Iskra Kamil, Beckman Pete, Tao Dingwen
- The Skipped Beat: A Study Of Sociopragmatic Understanding In Llms For 64 Languages Zhang Chiyu, Doan Khai Duy, Liao Qisheng, Abdul-mageed Muhammad
- Taxonomy-based Checklist For Large Language Model Evaluation Zhang Damin
- Mobile-env: Building Qualified Evaluation Benchmarks For LLM-GUI Interaction Zhang Danyang, Shen Zhennan, Xie Rui, Zhang Situo, Xie Tianbao, Zhao Zihan, Chen Siyuan, Chen Lu, Xu Hongshen, Cao Ruisheng, Yu Kai
- Controllable Text Generation With Residual Memory Transformer Zhang Hanqing, Si Sun, Wu Haiming, Song Dawei
- Tuning Large Language Model For End-to-end Speech Translation Zhang Hao, Si Nianwen, Chen Yaqi, Zhang Wenlin, Yang Xukui, Qu Dan, Jiao Xiaolin
- XATU: A Fine-grained Instruction-based Benchmark For Explainable Text Updates Zhang Haopeng, Iso Hayate, Gurajada Sairam, Bhutani Nikita
- Uncovering Hidden Connections: Iterative Search And Reasoning For Video-grounded Dialog Zhang Haoyu, Liu Meng, Wang Yaowei, Cao Da, Guan Weili, Nie Liqiang
- Tractable Control For Autoregressive Language Generation Zhang Honghua, Dang Meihua, Peng Nanyun, Broeck Guy Van Den
- Empirical Evaluation Of Chatgpt On Requirements Information Retrieval Under Zero-shot Setting Zhang Jianzhang, Chen Yiyang, Niu Nan, Wang Yinglin, Liu Chuang
- The Potential And Pitfalls Of Using A Large Language Model Such As Chatgpt Or GPT-4 As A Clinical Assistant Zhang Jingqing, Sun Kai, Jagadeesh Akshay, Ghahfarokhi Mahta, Gupta Deepa, Gupta Ashok, Gupta Vibhor, Guo Yike
- Linear Attention Via Orthogonal Memory Zhang Jun, Jiang Shuyang, Feng Jiangtao, Zheng Lin, Kong Lingpeng
- Llm-based Medical Assistant Personalization With Short- And Long-term Memory Coordination Zhang Kai, Kang Yangyang, Zhao Fubang, Liu Xiaozhong
- Crash: Clustering, Removing, And Sharing Enhance Fine-tuning Without Full Large Language Model Zhang Kaiyan, Ding Ning, Qi Biqing, Zhu Xuekai, Long Xinwei, Zhou Bowen
- Moqagpt : Zero-shot Multi-modal Open-domain Question Answering With Large Language Model Zhang Le, Wu Yihong, Mo Fengran, Nie Jian-yun, Agrawal Aishwarya
- Weighted Sampling For Masked Language Modeling Zhang Linhan, Chen Qian, Wang Wen, Deng Chong, Cao Xin, Hao Kongzhang, Jiang Yuxin, Wang Wei
- Lora-fa: Memory-efficient Low-rank Adaptation For Large Language Models Fine-tuning Zhang Longteng, Zhang Lin, Shi Shaohuai, Chu Xiaowen, Li Bo
- Knowgpt: Knowledge Graph Based Prompting For Large Language Models Zhang Qinggang, Dong Junnan, Chen Hao, Zha Daochen, Yu Zailiang, Huang Xiao
- Tell Your Model Where To Attend: Post-hoc Attention Steering For Llms Zhang Qingru, Singh Chandan, Liu Liyuan, Liu Xiaodong, Yu Bin, Gao Jianfeng, Zhao Tuo
- Llama-adapter: Efficient Fine-tuning Of Language Models With Zero-init Attention Zhang Renrui, Han Jiaming, Liu Chris, Gao Peng, Zhou Aojun, Hu Xiangfei, Yan Shilin, Lu Pan, Li Hongsheng, Qiao Yu
- Generation-driven Contrastive Self-training For Zero-shot Text Classification With Instruction-following LLM Zhang Ruohong, Wang Yau-shian, Yang Yiming
- Automl-gpt: Automatic Machine Learning With GPT Zhang Shujian, Gong Chengyue, Wu Lemeng, Liu Xingchao, Zhou Mingyuan
- Planning With Large Language Models For Code Generation Zhang Shun, Chen Zhenfang, Shen Yikang, Ding Mingyu, Tenenbaum Joshua B., Gan Chuang
- Modeling Sequential Sentence Relation To Improve Cross-lingual Dense Retrieval Zhang Shunyu, Liang Yaobo, Gong Ming, Jiang Daxin, Duan Nan
- Learning To Predict Concept Ordering For Common Sense Generation Zhang Tianhui, Bollegala Danushka, Peng Bei
- The Wisdom Of Hindsight Makes Language Models Better Instruction Followers Zhang Tianjun, Liu Fangchen, Wong Justin, Abbeel Pieter, Gonzalez Joseph E.
- History-aware Hierarchical Transformer For Multi-session Open-domain Dialogue System Zhang Tong, Liu Yong, Li Boyang, Zeng Zhiwei, Wang Pengwei, You Yuan, Miao Chunyan, Cui Lizhen
- M3exam: A Multilingual, Multimodal, Multilevel Benchmark For Examining Large Language Models Zhang Wenxuan, Aljunied Sharifah Mahani, Gao Chang, Chia Yew Ken, Bing Lidong
- Don't Trust Chatgpt When Your Question Is Not In English: A Study Of Multilingual Abilities And Types Of Llms Zhang Xiang, Li Senyu, Hauer Bradley, Shi Ning, Kondrak Grzegorz
- Large Language Models In Medical Term Classification And Unexpected Misalignment Between Response And Reasoning Zhang Xiaodan, Vemulapalli Sandeep, Talukdar Nabasmita, Ahn Sumyeong, Wang Jiankun, Meng Han, Murtaza Sardar Mehtab Bin, Dave Aakash Ajay, Leshchiner Dmitry, Joseph Dimitri F., Witteveen-lane Martin, Chesla Dave, Zhou Jiayu, Chen Bin
- Language Models Are Universal Embedders Zhang Xin, Li Zehan, Zhang Yanzhao, Long Dingkun, Xie Pengjun, Zhang Meishan, Zhang Min
- Gpt-4v(ision) As A Generalist Evaluator For Vision-language Tasks Zhang Xinlu, Lu Yujie, Wang Weizhi, Yan An, Yan Jun, Qin Lianke, Wang Heng, Yan Xifeng, Wang William Yang, Petzold Linda Ruth
- CGCE: A Chinese Generative Chat Evaluation Benchmark For General And Financial Domains Zhang Xuanyu, Li Bingbing, Yang Qing
- Collm: Integrating Collaborative Embeddings Into Large Language Models For Recommendation Zhang Yang, Feng Fuli, Zhang Jizhi, Bao Keqin, Wang Qifan, He Xiangnan
- Llavar: Enhanced Visual Instruction Tuning For Text-rich Image Understanding Zhang Yanzhe, Zhang Ruiyi, Gu Jiuxiang, Zhou Yufan, Lipka Nedim, Yang Diyi, Sun Tong
- Meta Prompting For AI Systems Zhang Yifan, Yuan Yang, Yao Andrew Chi-chih
- Llamavae: Guiding Large Language Model Generation Via Continuous Latent Sentence Spaces Zhang Yingji, Carvalho Danilo S., Pratt-hartmann Ian, Freitas André
- Probing The Multi-turn Planning Capabilities Of Llms Via 20 Question Games Zhang Yizhe, Lu Jiarui, Jaitly Navdeep
- Alleviating Hallucinations Of Large Language Models Through Induced Hallucinations Zhang Yue, Cui Leyang, Bi Wei, Shi Shuming
- Llmeval: A Preliminary Study On How To Evaluate Large Language Models Zhang Yue, Zhang Ming, Yuan Haipeng, Liu Shichun, Shi Yongyao, Gui Tao, Zhang Qi, Huang Xuanjing
- Cached Transformers: Improving Transformers With Differentiable Memory Cache Zhang Zhaoyang, Shao Wenqi, Ge Yixiao, Wang Xiaogang, Gu Jinwei, Luo Ping
- Mani-gpt: A Generative Model For Interactive Robotic Manipulation Zhang Zhe, Chai Wei, Wang Jiankun
- IAG: Induction-augmented Generation Framework For Answering Reasoning Questions Zhang Zhebin, Zhang Xinyu, Ren Yuanhang, Shi Saijiang, Han Meng, Wu Yongkang, Lai Ruofei, Cao Zhao
- VISAR: A Human-ai Argumentative Writing Assistant With Visual Programming And Rapid Draft Prototyping Zhang Zheng, Gao Jie, Dhaliwal Ranjodh Singh, Li Toby Jia-jun
- Students' Perceptions And Preferences Of Generative Artificial Intelligence Feedback For Programming Zhang Zhengdong, Dong Zihan, Shi Yang, Matsuda Noboru, Price Thomas, Xu Dongkuan
- H\(_2\)O: Heavy-hitter Oracle For Efficient Generative Inference Of Large Language Models Zhang Zhenyu, Sheng Ying, Zhou Tianyi, Chen Tianlong, Zheng Lianmin, Cai Ruisi, Song Zhao, Tian Yuandong, Ré Christopher, Barrett Clark, Wang Zhangyang, Chen Beidi
- Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization Zhang Zhexin, Yang Junxiao, Ke Pei, Mi Fei, Wang Hongning, Huang Minlie
- Accountable Textual-visual Chat Learns To Reject Human Instructions In Image Re-creation Zhang Zhiwei, Liu Yuliang
- Responsible Task Automation: Empowering Large Language Models As Responsible Task Automators Zhang Zhizheng, Zhang Xiaoyi, Xie Wenxuan, Lu Yan
- Unifying The Perspectives Of NLP And Software Engineering: A Survey On Language Models For Code Zhang Ziyin, Chen Chaoyu, Liu Bingchang, Liao Cong, Gong Zi, Yu Hang, Li Jianguo, Wang Rui
- MELA: Multilingual Evaluation Of Linguistic Acceptability Zhang Ziyin, Liu Yikang, Huang Weifang, Mao Junyu, Wang Rui, Hu Hai
- Tuning Layernorm In Attention: Towards Efficient Multi-modal LLM Finetuning Zhao Bingchen, Tu Haoqin, Wei Chen, Mei Jieru, Xie Cihang
- SVIT: Scaling Up Visual Instruction Tuning Zhao Bo, Wu Boya, He Muyang, Huang Tiejun
- Large Language Models Are Complex Table Parsers Zhao Bowen, Ji Changkai, Zhang Yuejie, He Wen, Wang Yingwen, Wang Qing, Feng Rui, Zhang Xiaobo
- Explainability For Large Language Models: A Survey Zhao Haiyan, Chen Hanjie, Yang Fan, Liu Ninghao, Deng Huiqi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Du Mengnan
- Genixer: Empowering Multimodal Large Language Models As A Powerful Data Generator Zhao Henry Hengyuan, Zhou Pan, Shou Mike Zheng
- Explicit Planning Helps Language Models In Logical Reasoning Zhao Hongyu, Wang Kangrui, Yu Mo, Mei Hongyuan
- Automatic Model Selection With Large Language Models For Reasoning Zhao James Xu, Xie Yuxi, Kawaguchi Kenji, He Junxian, Xie Michael Qizhe
- Chbias: Bias Evaluation And Mitigation Of Chinese Conversational Language Models Zhao Jiaxu, Fang Meng, Shi Zijing, Li Yitong, Chen Ling, Pechenizkiy Mykola
- Chatspot: Bootstrapping Multimodal Llms Via Precise Referring Instruction Tuning Zhao Liang, Yu En, Ge Zheng, Yang Jinrong, Wei Haoran, Zhou Hongyu, Sun Jianjian, Peng Yuang, Dong Runpei, Han Chunrui, Zhang Xiangyu
- Hallucination Detection For Grounded Instruction Generation Zhao Lingjun, Nguyen Khanh, Daumé Hal Iii
- Verify-and-edit: A Knowledge-enhanced Chain-of-thought Framework Zhao Ruochen, Li Xingxuan, Joty Shafiq, Qin Chengwei, Bing Lidong
- Group Preference Optimization: Few-shot Alignment Of Large Language Models Zhao Siyan, Dang John, Grover Aditya
- A Survey Of Large Language Models Zhao Wayne Xin, Zhou Kun, Li Junyi, Tang Tianyi, Wang Xiaolei, Hou Yupeng, Min Yingqian, Zhang Beichen, Zhang Junjie, Dong Zican, Du Yifan, Yang Chen, Chen Yushuo, Chen Zhipeng, Jiang Jinhao, Ren Ruiyang, Li Yifan, Tang Xinyu, Liu Zikang, Liu Peiyu, Nie Jian-yun, Wen Ji-rong
- Is Chatgpt Equipped With Emotional Dialogue Capabilities? Zhao Weixiang, Zhao Yanyan, Lu Xin, Wang Shilong, Tong Yanpeng, Qin Bing
- Babystories: Can Reinforcement Learning Teach Baby Language Models To Write Better Stories? Zhao Xingmeng, Wang Tongnian, Osborn Sheri, Rios Anthony
- Mind Vs. Mouth: On Measuring Re-judge Inconsistency Of Social Bias In Large Language Models Zhao Yachao, Wang Bo, Zhao Dongming, Huang Kun, Wang Yan, He Ruifang, Hou Yuexian
- Investigating Table-to-text Generation Capabilities Of Llms In Real-world Information Seeking Scenarios Zhao Yilun, Zhang Haowei, Si Shengyun, Nan Linyong, Tang Xiangru, Cohan Arman
- Robut: A Systematic Study Of Table QA Robustness Against Human-annotated Adversarial Perturbations Zhao Yilun, Zhao Chen, Nan Linyong, Qi Zhenting, Zhang Wenlin, Tang Xiangru, Mi Boyu, Radev Dragomir
- Knowing What Llms DO NOT Know: A Simple Yet Effective Self-detection Method Zhao Yukun, Yan Lingyong, Sun Weiwei, Xing Guoliang, Meng Chong, Wang Shuaiqiang, Cheng Zhicong, Ren Zhaochun, Yin Dawei
- Mllm-dataengine: An Iterative Refinement Approach For MLLM Zhao Zhiyuan, Ouyang Linke, Wang Bin, Huang Siyuan, Zhang Pan, Dong Xiaoyi, Wang Jiaqi, He Conghui
- Can We Edit Factual Knowledge By In-context Learning? Zheng Ce, Li Lei, Dong Qingxiu, Fan Yuxuan, Wu Zhiyong, Xu Jingjing, Chang Baobao
- Progressive-hint Prompting Improves Reasoning In Large Language Models Zheng Chuanyang, Liu Zhengying, Xie Enze, Li Zhenguo, Li Yu
- Learn From Model Beyond Fine-tuning: A Survey Zheng Hongling, Shen Li, Tang Anke, Luo Yong, Hu Han, Du Bo, Tao Dacheng
- Take A Step Back: Evoking Reasoning Via Abstraction In Large Language Models Zheng Huaixiu Steven, Mishra Swaroop, Chen Xinyun, Cheng Heng-tze, Chi Ed H., Le Quoc V, Zhou Denny
- Explainable Multimodal Emotion Recognition Zheng Lian, Haiyang Sun, Licai Sun, Hao Gu, Zhuofan Wen, Siyuan Zhang, Shun Chen, Mingyu Xu, Ke Xu, Kang Chen, Lan Chen, Shan Liang, Ya Li, Jiangyan Yi, Bin Liu, Jianhua Tao
- Lmsys-chat-1m: A Large-scale Real-world LLM Conversation Dataset Zheng Lianmin, Chiang Wei-lin, Sheng Ying, Li Tianle, Zhuang Siyuan, Wu Zhanghao, Zhuang Yonghao, Li Zhuohan, Lin Zi, Xing Eric P., Gonzalez Joseph E., Stoica Ion, Zhang Hao
- Judging Llm-as-a-judge With Mt-bench And Chatbot Arena Zheng Lianmin, Chiang Wei-lin, Sheng Ying, Zhuang Siyuan, Wu Zhanghao, Zhuang Yonghao, Lin Zi, Li Zhuohan, Li Dacheng, Xing Eric P., Zhang Hao, Gonzalez Joseph E., Stoica Ion
- Sglang: Efficient Execution Of Structured Language Model Programs Zheng Lianmin, Yin Liangsheng, Xie Zhiqiang, Sun Chuyue, Huang Jeff, Yu Cody Hao, Cao Shiyi, Kozyrakis Christos, Stoica Ion, Gonzalez Joseph E., Barrett Clark, Sheng Ying
- Memorycompanion: A Smart Healthcare Solution To Empower Efficient Alzheimer's Care Via Unleashing Generative AI Zheng Lifei, Heo Yeonie, Fang Yi
- Trojfsp: Trojan Insertion In Few-shot Prompt Tuning Zheng Mengxin, Xue Jiaqi, Chen Xun, Wang Yanshan, Lou Qian, Jiang Lei
- Trafficsafetygpt: Tuning A Pre-trained Large Language Model To A Domain-specific Expert In Transportation Safety Zheng Ou, Abdel-aty Mohamed, Wang Dongdong, Wang Chenzhu, Ding Shengxuan
- Gpt-fathom: Benchmarking Large Language Models To Decipher The Evolutionary Path Towards GPT-4 And Beyond Zheng Shen, Zhang Yuyu, Zhu Yijie, Xi Chenguang, Gao Pengyang, Zhou Xun, Chang Kevin Chen-chuan
- Towards Unified Token Learning For Vision-language Tracking Zheng Yaozong, Zhong Bineng, Liang Qihua, Li Guorong, Ji Rongrong, Li Xianxian
- Response Length Perception And Sequence Scheduling: An Llm-empowered LLM Inference Pipeline Zheng Zangwei, Ren Xiaozhe, Xue Fuzhao, Luo Yang, Jiang Xin, You Yang
- Building Emotional Support Chatbots In The Era Of Llms Zheng Zhonghua, Liao Lizi, Deng Yang, Nie Liqiang
- Marinegpt: Unlocking Secrets Of Ocean To The Public Zheng Ziqiang, Zhang Jipeng, Vu Tuan-anh, Diao Shizhe, Tim Yue Him Wong, Yeung Sai-kit
- MM-REACT: Prompting Chatgpt For Multimodal Reasoning And Action Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Ehsan Azarnasab, Faisal Ahmed, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang
- Aligning Large Multimodal Models With Factually Augmented RLHF Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liang-yan Gui, Yu-xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell
- Principle-driven Self-alignment Of Language Models From Scratch With Minimal Human Supervision Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan
- Exploring Human-like Translation Strategy With Large Language Models Zhiwei He, Tian Liang, Wenxiang Jiao, Zhuosheng Zhang, Yujiu Yang, Rui Wang, Zhaopeng Tu, Shuming Shi, Xing Wang
- Mitigating Hallucination In Visual Language Models With Visual Supervision Zhiyang Chen, Yousong Zhu, Yufei Zhan, Zhaowen Li, Chaoyang Zhao, Jinqiao Wang, Ming Tang
- Beyond Hallucinations: Enhancing Lvlms Through Hallucination-aware Direct Preference Optimization Zhiyuan Zhao, Bin Wang, Linke Ouyang, Xiaoyi Dong, Jiaqi Wang, Conghui He
- Bookgpt: A General Framework For Book Recommendation Empowered By Large Language Model Zhiyuli Aakas, Chen Yanfang, Zhang Xuan, Liang Xun
- Can Chatgpt Understand Too? A Comparative Study On Chatgpt And Fine-tuned BERT Zhong Qihuang, Ding Liang, Liu Juhua, Du Bo, Tao Dacheng
- Revisiting Token Dropping Strategy In Efficient BERT Pretraining Zhong Qihuang, Ding Liang, Liu Juhua, Liu Xuebo, Zhang Min, Du Bo, Tao Dacheng
- Bag Of Tricks For Effective Language Model Pretraining And Downstream Adaptation: A Case Study On GLUE Zhong Qihuang, Ding Liang, Peng Keqin, Liu Juhua, Du Bo, Shen Li, Zhan Yibing, Tao Dacheng
- Chatabl: Abductive Learning Via Natural Language Interaction With Chatgpt Zhong Tianyang, Wei Yaonai, Yang Li, Wu Zihao, Liu Zhengliang, Wei Xiaozheng, Li Wenjun, Yao Junjie, Ma Chong, Li Xiang, Zhu Dajiang, Jiang Xi, Han Junwei, Shen Dinggang, Liu Tianming, Zhang Tuo
- Mquake: Assessing Knowledge Editing In Language Models Via Multi-hop Questions Zhong Zexuan, Wu Zhengxuan, Manning Christopher D., Potts Christopher, Chen Danqi
- Language Agent Tree Search Unifies Reasoning Acting And Planning In Language Models Zhou Andy, Yan Kai, Shlapentokh-rothman Michal, Wang Haohan, Wang Yu-xiong
- Solving Challenging Math Word Problems Using GPT-4 Code Interpreter With Code-based Self-verification Zhou Aojun, Wang Ke, Lu Zimu, Shi Weikang, Luo Sichun, Qin Zipeng, Lu Shaoqing, Jia Anya, Song Linqi, Zhan Mingjie, Li Hongsheng
- Navgpt: Explicit Reasoning In Vision-and-language Navigation With Large Language Models Zhou Gengze, Hong Yicong, Wu Qi
- Ethical Chatgpt: Concerns, Challenges, And Commandments Zhou Jianlong, Müller Heimo, Holzinger Andreas, Chen Fang
- Characterglm: Customizing Chinese Conversational AI Characters With Large Language Models Zhou Jinfeng, Chen Zhuang, Wan Dazhen, Wen Bosi, Song Yi, Yu Jifan, Huang Yongkang, Peng Libiao, Yang Jiaming, Xiao Xiyao, Sabour Sahand, Zhang Xiaohan, Hou Wenjing, Zhang Yijia, Dong Yuxiao, Tang Jie, Huang Minlie
- GPT As A Baseline For Recommendation Explanation Texts Zhou Joyce, Joachims Thorsten
- Don't Make Your LLM An Evaluation Benchmark Cheater Zhou Kun, Zhu Yutao, Chen Zhipeng, Chen Wentong, Zhao Wayne Xin, Chen Xu, Lin Yankai, Wen Ji-rong, Han Jiawei
- CONA: A Novel Context-aware Instruction Paradigm For Communication Using Large Language Model Zhou Nan, Tao Xinghui, Chen Xi
- Exploring Recommendation Capabilities Of Gpt-4v(ision): A Preliminary Case Study Zhou Peilin, Cao Meng, Huang You-liang, Ye Qichen, Zhang Peiyan, Liu Junling, Xie Yueqi, Hua Yining, Kim Jaeboum
- Unleashing GPT On The Metaverse: Savior Or Destroyer? Zhou Pengyuan
- Chinese Intermediate English Learners Outdid Chatgpt In Deep Cohesion: Evidence From English Narrative Writing Zhou Tongquan, Cao Siyi, Zhou Siruo, Zhang Yao, He Aijing
- Complementary Advantages Of Chatgpts And Human Readers In Reasoning: Evidence From English Text Reading Comprehension Zhou Tongquan, Zhang Yao, Cao Siyi, Li Yulu, Wang Tao
- Recurrentgpt: Interactive Generation Of (arbitrarily) Long Text Zhou Wangchunshu, Jiang Yuchen Eleanor, Cui Peng, Wang Tiannan, Xiao Zhenxin, Hou Yifan, Cotterell Ryan, Sachan Mrinmaya
- Multi-stage Pre-training Enhanced By Chatgpt For Multi-scenario Multi-domain Dialogue Summarization Zhou Weixiao, Li Gengyao, Cheng Xianfu, Liang Xinnian, Zhu Junnan, Zhai Feifei, Li Zhoujun
- Flame: Few-shot Learning From Natural Language Explanations Zhou Yangqiaoyu, Zhang Yiming, Tan Chenhao
- Can GPT Models Follow Human Summarization Guidelines? Evaluating Chatgpt And GPT-4 For Dialogue Summarization Zhou Yongxin, Ringeval Fabien, Portet François
- Advancing Transformer's Capabilities In Commonsense Reasoning Zhou Yu, Han Yunqiu, Zhou Hanyu, Wu Yulun
- Quantifying And Analyzing Entity-level Memorization In Large Language Models Zhou Zhenhong, Xiang Jiuyang, Chen Chaomeng, Su Sen
- Avatargpt: All-in-one Framework For Motion Understanding, Planning, Generation And Beyond Zhou Zixiang, Wan Yu, Wang Baoyuan
- On The Calibration Of Large Language Models And Alignment Zhu Chiwei, Xu Benfeng, Wang Quan, Zhang Yongdong, Mao Zhendong
- Dialogue-contextualized Re-ranking For Medical History-taking Zhu Jian, Valmianski Ilya, Kannan Anitha
- VL-GPT: A Generative Pre-trained Transformer For Vision And Language Understanding And Generation Zhu Jinguo, Ding Xiaohan, Ge Yixiao, Ge Yuying, Zhao Sijie, Zhao Hengshuang, Wang Xiaohua, Shan Ying
- Judgelm: Fine-tuned Large Language Models Are Scalable Judges Zhu Lianghui, Wang Xinggang, Wang Xinlong
- Coca: Fusing Position Embedding With Collinear Constrained Attention In Transformers For Long Context Window Extending Zhu Shiyi, Ye Jing, Jiang Wei, Xue Siqiao, Zhang Qi, Wu Yifan, Li Jianguo
- Multilingual Machine Translation With Large Language Models: Empirical Results And Analysis Zhu Wenhao, Liu Hongyi, Dong Qingxiu, Xu Jingjing, Huang Shujian, Kong Lingpeng, Chen Jiajun, Li Lei
- Extrapolating Large Language Models To Non-english By Aligning Languages Zhu Wenhao, Lv Yunzhe, Dong Qingxiu, Yuan Fei, Xu Jingjing, Huang Shujian, Kong Lingpeng, Chen Jiajun, Li Lei
- 3d-vista: Pre-trained Transformer For 3D Vision And Text Alignment Zhu Ziyu, Ma Xiaojian, Chen Yixin, Deng Zhidong, Huang Siyuan, Li Qing
- Red Teaming Chatgpt Via Jailbreaking: Bias, Robustness, Reliability And Toxicity Zhuo Terry Yue, Huang Yujin, Chen Chunyang, Xing Zhenchang
- Scissorhands: Exploiting The Persistence Of Importance Hypothesis For LLM KV Cache Compression At Test Time Zichang Liu, Aditya Desai, Fangshuo Liao, Weitao Wang, Victor Xie, Zhaozhuo Xu, Anastasios Kyrillidis, Anshumali Shrivastava
- Aligner: One Global Token Is Worth Millions Of Parameters When Aligning Large Language Models Ziheng Zhou University Of California, Los Angeles, Wu Yingnian University Of California, Los Angeles, Zhu Song-chun University Of California, Los Angeles, Terzopoulos Demetri University Of California, Los Angeles
- Recommender Systems In The Era Of Large Language Models (llms) Zihuai Zhao, Wenqi Fan, Jiatong Li, Yunqing Liu, Xiaowei Mei, Yiqi Wang, Zhen Wen, Fei Wang, Xiangyu Zhao, Jiliang Tang, Qing Li
- A Family Of Pretrained Transformer Language Models For Russian Zmitrovich Dmitry, Abramov Alexander, Kalmykov Andrey, Tikhonova Maria, Taktasheva Ekaterina, Astafurov Danil, Baushenko Mark, Snegirev Artem, Kadulin Vitalii, Markov Sergey, Shavrina Tatiana, Mikhailov Vladislav, Fenogenova Alena
- Universal And Transferable Adversarial Attacks On Aligned Language Models Zou Andy, Wang Zifan, Carlini Nicholas, Nasr Milad, Kolter J. Zico, Fredrikson Matt
- Dr Chatgpt, Tell Me What I Want To Hear: How Prompt Knowledge Impacts Health Answer Correctness Zuccon Guido, Koopman Bevan
- Chatgpt Hallucinates When Attributing Answers Zuccon Guido, Koopman Bevan, Shaik Razia
- Concept-aware Training Improves In-context Learning Ability Of Language Models Štefánik Michal, Kadlčík Marek
- A Dataset And Benchmark For Hospital Course Summarization With Adapted Large Language Models Aali Asad, Van Veen Dave, Arefeen Yamin Ishraq, Hom Jason, Bluethgen Christian, Reis Eduardo Pontes, Gatidis Sergios, Clifford Namuun, Daws Joseph, Tehrani Arash S., Kim Jangwon, Chaudhari Akshay S.
- Benchmarking Large Language Models For Persian: A Preliminary Study Focusing On Chatgpt Abaskohi Amirhossein, Baruni Sara, Masoudi Mostafa, Abbasi Nesa, Babalou Mohammad Hadi, Edalat Ali, Kamahi Sepehr, Sani Samin Mahdizadeh, Naghavian Nikoo, Namazifard Danial, Sadeghi Pouya, Yaghoobzadeh Yadollah
- Bcamirs At Semeval-2024 Task 4: Beyond Words: A Multimodal And Multilingual Exploration Of Persuasion In Memes Abaskohi Amirhossein, Dabiriaghdam Amirhossein, Wang Lele, Carenini Giuseppe
- Knowledge-infused Llm-powered Conversational Health Agent: A Case Study For Diabetes Patients Abbasian Mahyar, Yang Zhongqi, Khatibi Elahe, Zhang Pengfei, Nagesh Nitish, Azimi Iman, Jain Ramesh, Rahmani Amir M.
- Granite-function Calling Model: Introducing Function Calling Abilities Via Multi-task Learning Of Granular Tasks Abdelaziz Ibrahim, Basu Kinjal, Agarwal Mayank, Kumaravel Sadhana, Stallone Matthew, Panda Rameswar, Rizk Yara, Bhargav Gp, Crouse Maxwell, Gunasekara Chulaka, Ikbal Shajith, Joshi Sachin, Karanam Hima, Kumar Vineet, Munawar Asim, Neelam Sumit, Raghu Dinesh, Sharma Udit, Soria Adriana Meza, Sreedhar Dheeraj, Venkateswaran Praveen, Unuvar Merve, Cox David, Roukos Salim, Lastras Luis, Kapanipathi Pavan
- Olvit: Multi-modal State Tracking Via Attention-based Embeddings For Video-grounded Dialog Abdessaied Adnen, Von Hochmeister Manuel, Bulling Andreas
- Phi-3 Technical Report: A Highly Capable Language Model Locally On Your Phone Abdin Marah, Aneja Jyoti, Awadalla Hany, Awadallah Ahmed, Awan Ammar Ahmad, Bach Nguyen, Bahree Amit, Bakhtiari Arash, Bao Jianmin, Behl Harkirat, Benhaim Alon, Bilenko Misha, Bjorck Johan, Bubeck Sébastien, Cai Martin, Cai Qin, Chaudhary Vishrav, Chen Dong, Chen Dongdong, Chen Weizhu, Chen Yen-chun, Chen Yi-ling, Cheng Hao, Chopra Parul, Dai Xiyang, Dixon Matthew, Eldan Ronen, Fragoso Victor, Gao Jianfeng, Gao Mei, Gao Min, Garg Amit, Del Giorno Allie, Goswami Abhishek, Gunasekar Suriya, Haider Emman, Hao Junheng, Hewett Russell J., Hu Wenxiang, Huynh Jamie, Iter Dan, Jacobs Sam Ade, Javaheripi Mojan, Jin Xin, Karampatziakis Nikos, Kauffmann Piero, Khademi Mahoud, Kim Dongwoo, Kim Young Jin, Kurilenko Lev, Lee James R., Lee Yin Tat, Li Yuanzhi, Li Yunsheng, Liang Chen, Liden Lars, Lin Xihui, Lin Zeqi, Liu Ce, Liu Liyuan, Liu Mengchen, Liu Weishung, Liu Xiaodong, Luo Chong, Madan Piyush, Mahmoudzadeh Ali, Majercak David, Mazzola Matt, Mendes Caio César Teodoro, Mitra Arindam, Modi Hardik, Nguyen Anh, Norick Brandon, Patra Barun, Perez-becker Daniel, Portet Thomas, Pryzant Reid, Qin Heyang, Radmilac Marko, Ren Liliang, De Rosa Gustavo, Rosset Corby, Roy Sambudha, Ruwase Olatunji, Saarikivi Olli, Saied Amin, Salim Adil, Santacroce Michael, Shah Shital, Shang Ning, Sharma Hiteshi, Shen Yelong, Shukla Swadheen, Song Xia, Tanaka Masahiro, Tupini Andrea, Vaddamanu Praneetha, Wang Chunyu, Wang Guanhua, Wang Lijuan, Wang Shuohang, Wang Xin, Wang Yu, Ward Rachel, Wen Wen, Witte Philipp, Wu Haiping, Wu Xiaoxia, Wyatt Michael, Xiao Bin, Xu Can, Xu Jiahang, Xu Weijian, Xue Jilong, Yadav Sonali, Yang Fan, Yang Jianwei, Yang Yifan, Yang Ziyi, Yu Donghan, Yuan Lu, Zhang Chenruidong, Zhang Cyril, Zhang Jianwen, Zhang Li Lyna, Zhang Yi, Zhang Yue, Zhang Yunan, Zhou Xiren
- Synthetic Dialogue Dataset Generation Using LLM Agents Abdullin Yelaman, Molla-aliod Diego, Ofoghi Bahadorreza, Yearwood John, Li Qingyang
- The Challenges Of Evaluating LLM Applications: An Analysis Of Automated, Human, And Llm-based Approaches Abeysinghe Bhashithe, Circi Ruhan
- The Llama 3 Herd Of Models Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang, Bobbie Chern, Charlotte Caucheteux, Chaya Nayak, Chloe Bi, Chris Marra, Chris Mcconnell, Christian Keller, Christophe Touret, Chunyang Wu, Corinne Wong, Cristian Canton Ferrer, Cyrus Nikolaidis, Damien Allonsius, Daniel Song, Danielle Pintz, Danny Livshits, David Esiobu, Dhruv Choudhary, Dhruv Mahajan, Diego Garcia-olano, Diego Perino, Dieuwke Hupkes, Egor Lakomkin, Ehab Albadawy, Elina Lobanova, Emily Dinan, Eric Michael Smith, Filip Radenovic, Frank Zhang, Gabriel Synnaeve, Gabrielle Lee, Georgia Lewis Anderson, Graeme Nail, Gregoire Mialon, Guan Pang, Guillem Cucurell, Hailey Nguyen, Hannah Korevaar, Hu Xu, Hugo Touvron, Iliyan Zarov, Imanol Arrieta Ibarra, Isabel Kloumann, Ishan Misra, Ivan Evtimov, Jade Copet, Jaewon Lee, Jan Geffert, Jana Vranes, Jason Park, Jay Mahadeokar, Jeet Shah, Jelmer Van Der Linde, Jennifer Billock, Jenny Hong, Jenya Lee, Jeremy Fu, Jianfeng Chi, Jianyu Huang, Jiawen Liu, Jie Wang, Jiecao Yu, Joanna Bitton, Joe Spisak, Jongsoo Park, Joseph Rocca, Joshua Johnstun, Joshua Saxe, Junteng Jia, Kalyan Vasuden Alwala, Kartikeya Upasani, Kate Plawiak, Ke Li, Kenneth Heafield, Kevin Stone, Khalid El-arini, Krithika Iyer, Kshitiz Malik, Kuenley Chiu, Kunal Bhalla, Lauren Rantala-yeary, Laurens Van Der Maaten, Lawrence Chen, Liang Tan, Liz Jenkins, Louis Martin, Lovish Madaan, Lubo Malo, Lukas Blecher, Lukas Landzaat, Luke De Oliveira, Madeline Muzzi, Mahesh Pasupuleti, Mannat Singh, Manohar Paluri, Marcin Kardas, Mathew Oldham, Mathieu Rita, Maya Pavlova, Melanie Kambadur, Mike Lewis, Min Si, Mitesh Kumar Singh, Mona Hassan, Naman Goyal, Narjes Torabi, Nikolay Bashlykov, Nikolay Bogoychev, Niladri Chatterji, Olivier Duchenne, Onur Çelebi, Patrick Alrassy, Pengchuan Zhang, Pengwei Li, Petar Vasic, Peter Weng, Prajjwal Bhargava, Pratik Dubal, Praveen Krishnan, Punit Singh Koura, Puxin Xu, Qing He, Qingxiao Dong, Ragavan Srinivasan, Raj Ganapathy, Ramon Calderer, Ricardo Silveira Cabral, Robert Stojnic, Roberta Raileanu, Rohit Girdhar, Rohit Patel, Romain Sauvestre, Ronnie Polidoro, Roshan Sumbaly, Ross Taylor, Ruan Silva, Rui Hou, Rui Wang, Saghar Hosseini, Sahana Chennabasappa, Sanjay Singh, Sean Bell, Seohyun Sonia Kim, Sergey Edunov, Shaoliang Nie, Sharan Narang, Sharath Raparthy, Sheng Shen, Shengye Wan, Shruti Bhosale, Shun Zhang, Simon Vandenhende, Soumya Batra, Spencer Whitman, Sten Sootla, Stephane Collot, Suchin Gururangan, Sydney Borodinsky, Tamar Herman, Tara Fowler, Tarek Sheasha, Thomas Georgiou, Thomas Scialom, Tobias Speckbacher, Todor Mihaylov, Tong Xiao, Ujjwal Karn, Vedanuj Goswami, Vibhor Gupta, Vignesh Ramanathan, Viktor Kerkez, Vincent Gonguet, Virginie Do, Vish Vogeti, Vladan Petrovic, Weiwei Chu, Wenhan Xiong, Wenyin Fu, Whitney Meers, Xavier Martinet, Xiaodong Wang, Xiaoqing Ellen Tan, Xinfeng Xie, Xuchao Jia, Xuewei Wang, Yaelle Goldschlag, Yashesh Gaur, Yasmine Babaei, Yi Wen, Yiwen Song, Yuchen Zhang, Yue Li, Yuning Mao, Zacharie Delpierre Coudert, Zheng Yan, Zhengxing Chen, Zoe Papakipos, Aaditya Singh, Aaron Grattafiori, Abha Jain, Adam Kelsey, Adam Shajnfeld, Adithya Gangidi, Adolfo Victoria, Ahuva Goldstand, Ajay Menon, Ajay Sharma, Alex Boesenberg, Alex Vaughan, Alexei Baevski, Allie Feinstein, Amanda Kallet, Amit Sangani, Anam Yunus, Andrei Lupu, Andres Alvarado, Andrew Caples, Andrew Gu, Andrew Ho, Andrew Poulton, Andrew Ryan, Ankit Ramchandani, Annie Franco, Aparajita Saraf, Arkabandhu Chowdhury, Ashley Gabriel, Ashwin Bharambe, Assaf Eisenman, Azadeh Yazdan, Beau James, Ben Maurer, Benjamin Leonhardi, Bernie Huang, Beth Loyd, Beto De Paola, Bhargavi Paranjape, Bing Liu, Bo Wu, Boyu Ni, Braden Hancock, Bram Wasti, Brandon Spence, Brani Stojkovic, Brian Gamido, Britt Montalvo, Carl Parker, Carly Burton, Catalina Mejia, Changhan Wang, Changkyu Kim, Chao Zhou, Chester Hu, Ching-hsiang Chu, Chris Cai, Chris Tindal, Christoph Feichtenhofer, Damon Civin, Dana Beaty, Daniel Kreymer, Daniel Li, Danny Wyatt, David Adkins, David Xu, Davide Testuggine, Delia David, Devi Parikh, Diana Liskovich, Didem Foss, Dingkang Wang, Duc Le, Dustin Holland, Edward Dowling, Eissa Jamil, Elaine Montgomery, Eleonora Presani, Emily Hahn, Emily Wood, Erik Brinkman, Esteban Arcaute, Evan Dunbar, Evan Smothers, Fei Sun, Felix Kreuk, Feng Tian, Firat Ozgenel, Francesco Caggioni, Francisco Guzmán, Frank Kanayet, Frank Seide, Gabriela Medina Florez, Gabriella Schwarz, Gada Badeer, Georgia Swee, Gil Halpern, Govind Thattai, Grant Herman, Grigory Sizov, Guangyi, Zhang, Guna Lakshminarayanan, Hamid Shojanazeri, Han Zou, Hannah Wang, Hanwen Zha, Haroun Habeeb, Harrison Rudolph, Helen Suk, Henry Aspegren, Hunter Goldman, Ibrahim Damlaj, Igor Molybog, Igor Tufanov, Irina-elena Veliche, Itai Gat, Jake Weissman, James Geboski, James Kohli, Japhet Asher, Jean-baptiste Gaya, Jeff Marcus, Jeff Tang, Jennifer Chan, Jenny Zhen, Jeremy Reizenstein, Jeremy Teboul, Jessica Zhong, Jian Jin, Jingyi Yang, Joe Cummings, Jon Carvill, Jon Shepard, Jonathan Mcphie, Jonathan Torres, Josh Ginsburg, Junjie Wang, Kai Wu, Kam Hou U, Karan Saxena, Karthik Prasad, Kartikay Khandelwal, Katayoun Zand, Kathy Matosich, Kaushik Veeraraghavan, Kelly Michelena, Keqian Li, Kun Huang, Kunal Chawla, Kushal Lakhotia, Kyle Huang, Lailin Chen, Lakshya Garg, Lavender A, Leandro Silva, Lee Bell, Lei Zhang, Liangpeng Guo, Licheng Yu, Liron Moshkovich, Luca Wehrstedt, Madian Khabsa, Manav Avalani, Manish Bhatt, Maria Tsimpoukelli, Martynas Mankus, Matan Hasson, Matthew Lennie, Matthias Reso, Maxim Groshev, Maxim Naumov, Maya Lathi, Meghan Keneally, Michael L. Seltzer, Michal Valko, Michelle Restrepo, Mihir Patel, Mik Vyatskov, Mikayel Samvelyan, Mike Clark, Mike Macey, Mike Wang, Miquel Jubert Hermoso, Mo Metanat, Mohammad Rastegari, Munish Bansal, Nandhini Santhanam, Natascha Parks, Natasha White, Navyata Bawa, Nayan Singhal, Nick Egebo, Nicolas Usunier, Nikolay Pavlovich Laptev, Ning Dong, Ning Zhang, Norman Cheng, Oleg Chernoguz, Olivia Hart, Omkar Salpekar, Ozlem Kalinli, Parkin Kent, Parth Parekh, Paul Saab, Pavan Balaji, Pedro Rittner, Philip Bontrager, Pierre Roux, Piotr Dollar, Polina Zvyagina, Prashant Ratanchandani, Pritish Yuvraj, Qian Liang, Rachad Alao, Rachel Rodriguez, Rafi Ayub, Raghotham Murthy, Raghu Nayani, Rahul Mitra, Raymond Li, Rebekkah Hogan, Robin Battey, Rocky Wang, Rohan Maheswari, Russ Howes, Ruty Rinott, Sai Jayesh Bondu, Samyak Datta, Sara Chugh, Sara Hunt, Sargun Dhillon, Sasha Sidorov, Satadru Pan, Saurabh Verma, Seiji Yamamoto, Sharadh Ramaswamy, Shaun Lindsay, Shaun Lindsay, Sheng Feng, Shenghao Lin, Shengxin Cindy Zha, Shiva Shankar, Shuqiang Zhang, Shuqiang Zhang, Sinong Wang, Sneha Agarwal, Soji Sajuyigbe, Soumith Chintala, Stephanie Max, Stephen Chen, Steve Kehoe, Steve Satterfield, Sudarshan Govindaprasad, Sumit Gupta, Sungmin Cho, Sunny Virk, Suraj Subramanian, Sy Choudhury, Sydney Goldman, Tal Remez, Tamar Glaser, Tamara Best, Thilo Kohler, Thomas Robinson, Tianhe Li, Tianjun Zhang, Tim Matthews, Timothy Chou, Tzook Shaked, Varun Vontimitta, Victoria Ajayi, Victoria Montanez, Vijai Mohan, Vinay Satish Kumar, Vishal Mangla, Vítor Albiero, Vlad Ionescu, Vlad Poenaru, Vlad Tiberiu Mihailescu, Vladimir Ivanov, Wei Li, Wenchen Wang, Wenwen Jiang, Wes Bouaziz, Will Constable, Xiaocheng Tang, Xiaofang Wang, Xiaojian Wu, Xiaolan Wang, Xide Xia, Xilun Wu, Xinbo Gao, Yanjun Chen, Ye Hu, Ye Jia, Ye Qi, Yenda Li, Yilin Zhang, Ying Zhang, Yossi Adi, Youngjin Nam, Yu, Wang, Yuchen Hao, Yundi Qian, Yuzi He, Zach Rait, Zachary Devito, Zef Rosnbrick, Zhaoduo Wen, Zhenyu Yang, Zhiwei Zhao
- Infercept: Efficient Intercept Support For Augmented Large Language Model Inference Abhyankar Reyna, He Zijian, Srivatsa Vikranth, Zhang Hao, Zhang Yiying
- Stackrag Agent: Improving Developer Answers With Retrieval-augmented Generation Abrahamyan Davit, Fard Fatemeh H.
- Knowledge Graphs As Context Sources For Llm-based Explanations Of Learning Recommendations Abu-rasheed Hasan, Weber Christian, Fathi Madjid
- Pathocl: Path-based Prompt Augmentation For OCL Generation With GPT-4 Abukhalaf Seif, Hamdaqa Mohammad, Khomh Foutse
- A Survey On Symbolic Knowledge Distillation Of Large Language Models Acharya Kamal, Velasquez Alvaro, Song Houbing Herbert
- Attnlrp: Attention-aware Layer-wise Relevance Propagation For Transformers Achtibat Reduan, Hatefi Sayed Mohammad Vakilzadeh, Dreyer Maximilian, Jain Aakriti, Wiegand Thomas, Lapuschkin Sebastian, Samek Wojciech
- Keyformer: KV Cache Reduction Through Key Tokens Selection For Efficient Generative Inference Adnan Muhammad, Arunkumar Akhil, Jain Gaurav, Nair Prashant J., Soloveychik Ilya, Kamath Purushotham
- Towards Optimizing And Evaluating A Retrieval Augmented QA Chatbot Using Llms With Human In The Loop Afzal Anum, Kowsik Alexander, Fani Rajna, Matthes Florian
- Copilot Evaluation Harness: Evaluating Llm-guided Software Programming Agarwal Anisha, Chan Aaron, Chandel Shubham, Jang Jinu, Miller Shaun, Moghaddam Roshanak Zilouchian, Mohylevskyy Yevhen, Sundaresan Neel, Tufano Michele
- Ethical Reasoning And Moral Value Alignment Of Llms Depend On The Language We Prompt Them In Agarwal Utkarsh, Tanmay Kumar, Khandelwal Aditi, Choudhury Monojit
- Tricy: Trigger-guided Data-to-text Generation With Intent Aware Attention-copy Agarwal Vibhav, Ghosh Sourav, Bss Harichandana, Arora Himanshu, Raja Barath Raj Kandur
- Codemirage: Hallucinations In Code Generated By Large Language Models Agarwal Vibhor, Pei Yulong, Alamir Salwa, Liu Xiaomo
- Enabling High-sparsity Foundational Llama Models With Efficient Pretraining And Deployment Agarwalla Abhinav, Gupta Abhay, Marques Alexandre, Pandit Shubhra, Goin Michael, Kurtic Eldar, Leong Kevin, Nguyen Tuan, Salem Mahmoud, Alistarh Dan, Lie Sean, Kurtz Mark
- Look Further Ahead: Testing The Limits Of GPT-4 In Path Planning Aghzal Mohamed, Plaku Erion, Yao Ziyu
- Leapformer: Enabling Linear Transformers For Autoregressive And Simultaneous Tasks Via Learned Proportions Agostinelli Victor, Hong Sanghyun, Chen Lizhong
- Illuminate: A Novel Approach For Depression Detection With Explainable Analysis And Proactive Therapy Using Prompt Engineering Agrawal Aryan
- Can't Remember Details In Long Documents? You Need Some R&R Agrawal Devanshu, Gao Shang, Gajek Martin
- Can Llms Perform Structured Graph Reasoning? Agrawal Palaash, Vasania Shavak, Tan Cheston
- LM4OPT: Unveiling The Potential Of Large Language Models In Formulating Mathematical Optimization Problems Ahmed Tasnim, Choudhury Salimur
- Linguistic Intelligence In Large Language Models For Telecommunications Ahmed Tasnim, Piovesan Nicola, De Domenico Antonio, Choudhury Salimur
- Timechara: Evaluating Point-in-time Character Hallucination Of Role-playing Large Language Models Ahn Jaewoo, Lee Taehyun, Lim Junyoung, Kim Jin-hwa, Yun Sangdoo, Lee Hwaran, Kim Gunhee
- Yi: Open Foundation Models By 01.AI Ai 01., :, Young Alex, Chen Bei, Li Chao, Huang Chengen, Zhang Ge, Zhang Guanwei, Li Heng, Zhu Jiangcheng, Chen Jianqun, Chang Jing, Yu Kaidong, Liu Peng, Liu Qiang, Yue Shawn, Yang Senbin, Yang Shiming, Yu Tao, Xie Wen, Huang Wenhao, Hu Xiaohui, Ren Xiaoyi, Niu Xinyao, Nie Pengcheng, Xu Yuchi, Liu Yudong, Wang Yue, Cai Yuxuan, Gu Zhenyu, Liu Zhiyuan, Dai Zonghong
- Enhancing Pre-trained Generative Language Models With Question Attended Span Extraction On Machine Reading Comprehension Ai Lin, Hui Zheng, Liu Zizhou, Hirschberg Julia
- QASE Enhanced Plms: Improved Control In Text Generation For MRC Ai Lin, Hui Zheng, Liu Zizhou, Hirschberg Julia
- Plug And Play With Prompts: A Prompt Tuning Approach For Controlling Text Generation Ajwani Rohan Deepak, Zhu Zining, Rose Jonathan, Rudzicz Frank
- Shadowllm: Predictor-based Contextual Sparsity For Large Language Models Akhauri Yash, Abouelhamayed Ahmed F, Dotzel Jordan, Zhang Zhiru, Rush Alexander M, Huda Safeen, Abdelfattah Mohamed S
- FACTS About Building Retrieval Augmented Generation-based Chatbots Akkiraju Rama, Xu Anbang, Bora Deepak, Yu Tan, An Lu, Seth Vishal, Shukla Aaditya, Gundecha Pritam, Mehta Hridhay, Jha Ashwin, Raj Prithvi, Balasubramanian Abhinav, Maram Murali, Muthusamy Guru, Annepally Shivakesh Reddy, Knowles Sidney, Du Min, Burnett Nick, Javiya Sean, Marannan Ashok, Kumari Mamta, Jha Surbhi, Dereszenski Ethan, Chakraborty Anupam, Ranjan Subhash, Terfai Amina, Surya Anoop, Mercer Tracey, Thanigachalam Vinodh Kumar, Bar Tamar, Krishnan Sanjana, Kilaru Samy, Jaksic Jasmine, Algarici Nave, Liberman Jacob, Conway Joey, Nayyar Sonu, Boitano Justin
- Have We Reached AGI? Comparing Chatgpt, Claude, And Gemini To Human Literacy And Education Benchmarks Akpan Mfon
- In-context Language Learning: Architectures And Algorithms Akyürek Ekin, Wang Bailin, Kim Yoon, Andreas Jacob
- History Of Generative Artificial Intelligence (AI) Chatbots: Past, Present, And Future Development Al-amin Md., Ali Mohammad Shazed, Salam Abdus, Khan Arif, Ali Ashraf, Ullah Ahsan, Alam Md Nur, Chowdhury Shamsul Kabir
- Mixture Of Modular Experts: Distilling Knowledge From A Multilingual Teacher Into Specialized Modular Language Models Al-maamari Mohammed, Amor Mehdi Ben, Granitzer Michael
- Enhancing Textbook Question Answering Task With Large Language Models And Retrieval Augmented Generation Alawwad Hessa Abdulrahman, Alhothali Areej, Naseem Usman, Alkhathlan Ali, Jamal Amani
- Mitigating Catastrophic Forgetting In Language Transfer Via Model Merging Alexandrov Anton, Raychev Veselin, Müller Mark Niklas, Zhang Ce, Vechev Martin, Toutanova Kristina
- A Survey Of Large Language Models For European Languages Ali Wazir, Pyysalo Sampo
- Chatgpt Alternative Solutions: Large Language Models Survey Alipour Hanieh, Pendar Nick, Roy Kohinoor
- Sabi\'a-2: A New Generation Of Portuguese Large Language Models Almeida Thales Sales, Abonizio Hugo, Nogueira Rodrigo, Pires Ramon
- D-NLP At Semeval-2024 Task 2: Evaluating Clinical Inference Capabilities Of Large Language Models Altinok Duygu
- Syntaxshap: Syntax-aware Explainability Method For Text Generation Amara Kenza, Sevastjanova Rita, El-assady Mennatallah
- Large Language Models For Psycholinguistic Plausibility Pretesting Amouyal Samuel Joseph, Meltzer-asscher Aya, Berant Jonathan
- Training-free Long-context Scaling Of Large Language Models An Chenxin, Huang Fei, Zhang Jun, Gong Shansan, Qiu Xipeng, Zhou Chang, Kong Lingpeng
- Sentencevae: Enable Next-sentence Prediction For Large Language Models With Faster Speed, Higher Accuracy And Longer Context An Hongjun, Chen Yifan, Sun Zhe, Li Xuelong
- Enhancing Journalism With AI: A Study Of Contextualized Image Captioning For News Articles Using Llms And Lmms Anagnostopoulou Aliki, Gouvea Thiago, Sonntag Daniel
- Does Refusal Training In Llms Generalize To The Past Tense? Andriushchenko Maksym, Flammarion Nicolas
- BAMO At Semeval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense Ansari Baktash, Rostamkhani Mohammadmostafa, Eetemadi Sauleh
- Simpler Becomes Harder: Do Llms Exhibit A Coherent Behavior On Simplified Corpora? Anschütz Miriam, Mosca Edoardo, Groh Georg
- Evaluating Chatgpt On Nuclear Domain-specific Data Anwar Muhammad, De Costa Mischa, Hammad Issam, Lau Daniel
- An Assessment On Comprehending Mental Health Through Large Language Models Arcan Mihael, Niland David-paul, Delahunty Fionn
- Hired: Attention-guided Token Dropping For Efficient Inference Of High-resolution Vision-language Models In Resource-constrained Environments Arif Kazi Hasan Ibn, Yoon Jinyi, Nikolopoulos Dimitrios S., Vandierendonck Hans, John Deepu, Ji Bo
- Forklift: An Extensible Neural Lifter Armengol-estapé Jordi, Rocha Rodrigo C. O., Woodruff Jackson, Minervini Pasquale, O'boyle Michael F. P.
- SPAFIT: Stratified Progressive Adaptation Fine-tuning For Pre-trained Large Language Models Arora Samir, Wang Liangliang
- Simple Linear Attention Language Models Balance The Recall-throughput Tradeoff Arora Simran, Eyuboglu Sabri, Zhang Michael, Timalsina Aman, Alberti Silas, Zinsley Dylan, Zou James, Rudra Atri, Ré Christopher
- Exploring The Robustness Of Task-oriented Dialogue Systems For Colloquial German Varieties Artemova Ekaterina, Blaschke Verena, Plank Barbara
- Thai Winograd Schemas: A Benchmark For Thai Commonsense Reasoning Artkaew Phakphum
- What Makes Language Models Good-enough? Asami Daiki, Sugawara Saku
- Enhancing Holonic Architecture With Natural Language Processing For System Of Systems Ashfaq Muhammad, Sadik Ahmed R., Mikkonen Tommi, Waseem Muhammad, Akitalo Niko M
- Slicegpt: Compress Large Language Models By Deleting Rows And Columns Ashkboos Saleh, Croci Maximilian L., Nascimento Marcelo Gennari Do, Hoefler Torsten, Hensman James
- Evaluating Large Language Models On The GMAT: Implications For The Future Of Business Education Ashrafimoghari Vahid, Gürkan Necdet, Suchow Jordan W.
- Assessing Llms For Zero-shot Abstractive Summarization Through The Lens Of Relevance Paraphrasing Askari Hadi, Chhabra Anshuman, Chen Muhao, Mohapatra Prasant
- Robustsentembed: Robust Sentence Embeddings Using Adversarial Self-supervised Contrastive Learning Asl Javad Rafiei, Panzade Prajwal, Blanco Eduardo, Takabi Daniel, Cai Zhipeng
- Minigpt4-video: Advancing Multimodal Llms For Video Understanding With Interleaved Visual-textual Tokens Ataallah Kirolos, Shen Xiaoqian, Abdelrahman Eslam, Sleiman Essam, Zhu Deyao, Ding Jian, Elhoseiny Mohamed
- Can Open-source Llms Compete With Commercial Models? Exploring The Few-shot Performance Of Current GPT Models In Biomedical Tasks Ateia Samy, Kruschwitz Udo
- Bifurcated Attention: Accelerating Massively Parallel Decoding With Shared Prefixes In Llms Athiwaratkun Ben, Gonugondla Sujan Kumar, Gouda Sanjay Krishna, Qian Haifeng, Ding Hantian, Sun Qing, Wang Jun, Guo Jiacheng, Chen Liangfu, Bhatia Parminder, Nallapati Ramesh, Sengupta Sudipta, Xiang Bing
- Prompt Design Matters For Computational Social Science Tasks But In Unpredictable Ways Atreja Shubham, Ashkinaze Joshua, Li Lingyao, Mendelsohn Julia, Hemphill Libby
- Transformer Alignment In Large Language Models Aubry Murdock, Meng Haoming, Sugolov Anton, Papyan Vardan
- Current State Of LLM Risks And AI Guardrails Ayyamperumal Suriya Ganesh, Ge Limin
- Feedback-generation For Programming Exercises With GPT-4 Azaiz Imen, Kiesler Natalie, Strickroth Sven
- Walia-llm: Enhancing Amharic-llama By Integrating Task-specific And Generative Datasets Azime Israel Abebe, Tonja Atnafu Lambebo, Belay Tadesse Destaw, Fuge Mitiku Yohannes, Wassie Aman Kassahun, Jada Eyasu Shiferaw, Chanie Yonas, Sewunetie Walelign Tewabe, Yimam Seid Muhie
- Accuracy And Consistency Of Llms In The Registered Dietitian Exam: The Impact Of Prompt Engineering And Knowledge Retrieval Azimi Iman, Qi Mohan, Wang Li, Rahmani Amir M., Li Youlin
- Minigpt-reverse-designing: Predicting Image Adjustments Utilizing Minigpt-4 Azizi Vahid, Koochaki Fatemeh
- Generating Query Recommendations Via Llms Bacciu Andrea, Palumbo Enrico, Damianou Andreas, Tonellotto Nicola, Silvestri Fabrizio
- The Pitfalls Of Next-token Prediction Bachmann Gregor, Nagarajan Vaishnavh
- Masked Mixers For Language Generation And Retrieval Badger Benjamin L.
- Efficient Model-agnostic Alignment Via Bayesian Persuasion Bai Fengshuo, Wang Mingzhi, Zhang Zhaowei, Chen Boyuan, Xu Yinda, Wen Ying, Yang Yaodong
- Sparsellm: Towards Global Pruning For Pre-trained Language Models Bai Guangji, Li Yijiang, Ling Chen, Kim Kibaek, Zhao Liang
- Citrus: Chunked Instruction-aware State Eviction For Long Sequence Modeling Bai Yu, Zou Xiyuan, Huang Heyan, Chen Sanxing, Rondeau Marc-antoine, Gao Yang, Cheung Jackie Chi Kit
- Hallucination Of Multimodal Large Language Models: A Survey Bai Zechen, Wang Pichao, Xiao Tianjun, He Tong, Han Zongbo, Zhang Zheng, Shou Mike Zheng
- Patentgpt: A Large Language Model For Intellectual Property Bai Zilong, Zhang Ruiji, Chen Linqing, Cai Qijun, Zhong Yuan, Wang Cong, Fang Yan, Fang Jie, Sun Jing, Wang Weikuan, Zhou Lizhi, Hua Haoran, Qiu Tian, Wang Chaochao, Sun Cheng, Lu Jianping, Wang Yixin, Xia Yubin, Hu Meng, Liu Haowen, Xu Peng, Xu Licong, Bian Fu, Gu Xiaolong, Zhang Lisha, Wang Weilei, Tu Changyang
- Evaluating The Semantic Profiling Abilities Of Llms For Natural Language Utterances In Data Visualization Bako Hannah K., Bhutani Arshnoor, Liu Xinyi, Cobbina Kwesi A., Liu Zhicheng
- Downstream Bias Mitigation Is All You Need Baksi Arkadeep, Singh Rahul, Joshi Tarun
- RAG Vs Fine-tuning: Pipelines, Tradeoffs, And A Case Study On Agriculture Balaguer Angels, Benara Vinamra, Cunha Renato Luiz De Freitas, Filho Roberto De M. Estevão, Hendry Todd, Holstein Daniel, Marsman Jennifer, Mecklenburg Nick, Malvar Sara, Nunes Leonardo O., Padilha Rafael, Sharp Morris, Silva Bruno, Sharma Swati, Aski Vijay, Chandra Ranveer
- CYGENT: A Cybersecurity Conversational Agent With Log Summarization Powered By GPT-3 Balasubramanian Prasasthy, Seby Justin, Kostakos Panos
- LAMP: A Language Model On The Map Balsebre Pasquale, Huang Weiming, Cong Gao
- Demystifying Platform Requirements For Diverse LLM Inference Use Cases Bambhaniya Abhimanyu, Raj Ritik, Jeong Geonhwa, Kundu Souvik, Srinivasan Sudarshan, Elavazhagan Midhilesh, Kumar Madhu, Krishna Tushar
- Breaking Boundaries: Investigating The Effects Of Model Editing On Cross-linguistic Performance Banerjee Somnath, Halder Avik, Mandal Rajarshi, Layek Sayan, Soboroff Ian, Hazra Rima, Mukherjee Animesh
- How (un)ethical Are Instruction-centric Responses Of Llms? Unveiling The Vulnerabilities Of Safety Guardrails To Harmful Queries Banerjee Somnath, Layek Sayan, Hazra Rima, Mukherjee Animesh
- LLM Augmented Llms: Expanding Capabilities Through Composition Bansal Rachit, Samanta Bidisha, Dalmia Siddharth, Gupta Nitish, Vashishth Shikhar, Ganapathy Sriram, Bapna Abhishek, Jain Prateek, Talukdar Partha
- Transforming Agency. On The Mode Of Existence Of Large Language Models Barandiaran Xabier E., Almendros Lola S.
- TTQA-RS- A Break-down Prompting Approach For Multi-hop Table-text Question Answering With Reasoning And Summarization Bardhan Jayetri, Xiao Bushi, Wang Daisy Zhe
- Find The Gap: Knowledge Base Reasoning For Visual Question Answering Barezi Elham J., Kordjamshidi Parisa
- Slicerchat: Building A Local Chatbot For 3D Slicer Barr Colton
- STARLING: Self-supervised Training Of Text-based Reinforcement Learning Agent With Large Language Models Basavatia Shreyas, Murugesan Keerthiram, Ratnakar Shivam
- Llms' Reading Comprehension Is Affected By Parametric Knowledge And Struggles With Hypothetical Statements Basmov Victoria, Goldberg Yoav, Tsarfaty Reut
- Understanding Information Storage And Transfer In Multi-modal Large Language Models Basu Samyadeep, Grayson Martin, Morrison Cecily, Nushi Besmira, Feizi Soheil, Massiceti Daniela
- Evolutionary Multi-objective Optimization Of Large Language Model Prompts For Balancing Sentiments Baumann Jill, Kramer Oliver
- How Do You Know That? Teaching Generative Language Models To Reference Answers To Biomedical Questions Bašaragin Bojana, Ljajić Adela, Medvecki Darija, Cassano Lorenzo, Košprdić Miloš, Milošević Nikola
- Xlstm: Extended Long Short-term Memory Beck Maximilian, Pöppel Korbinian, Spanring Markus, Auer Andreas, Prudnikova Oleksandra, Kopp Michael, Klambauer Günter, Brandstetter Johannes, Hochreiter Sepp
- Llm2vec: Large Language Models Are Secretly Powerful Text Encoders Behnamghader Parishad, Adlakha Vaibhav, Mosbach Marius, Bahdanau Dzmitry, Chapados Nicolas, Reddy Siva
- Internalinspector \(I^2\): Robust Confidence Estimation In Llms Through Internal States Beigi Mohammad, Shen Ying, Yang Runing, Lin Zihao, Wang Qifan, Mohan Ankith, He Jianfeng, Jin Ming, Lu Chang-tien, Huang Lifu
- Luna: An Evaluation Foundation Model To Catch Language Model Hallucinations With High Accuracy And Low Cost Belyi Masha, Friel Robert, Shao Shuai, Sanyal Atindriyo
- SUTRA: Scalable Multilingual Language Model Architecture Bendale Abhijit, Sapienza Michael, Ripplinger Steven, Gibbs Simon, Lee Jaewon, Mistry Pranav
- Cross-lingual Editing In Multilingual Language Models Beniwal Himanshu, D Kowsik Nandagopan, Singh Mayank
- Redefining "hallucination" In Llms: Towards A Psychology-informed Framework For Mitigating Misinformation Berberette Elijah, Hutchins Jack, Sadovnik Amir
- Cotar: Chain-of-thought Attribution Reasoning With Multi-level Granularity Berchansky Moshe, Fleischer Daniel, Wasserblat Moshe, Izsak Peter
- Lotr: Low Tensor Rank Weight Adaptation Bershatsky Daniel, Cherniuk Daria, Daulbaev Talgat, Mikhalev Aleksandr, Oseledets Ivan
- Multi-head RAG: Solving Multi-aspect Problems With Llms Besta Maciej, Kubicek Ales, Niggli Roman, Gerstenberger Robert, Weitzendorf Lucas, Chi Mingyuan, Iff Patrick, Gajda Joanna, Nyczyk Piotr, Müller Jürgen, Niewiadomski Hubert, Chrapek Marcin, Podstawski Michał, Hoefler Torsten
- Checkembed: Effective Verification Of LLM Solutions To Open-ended Tasks Besta Maciej, Paleari Lorenzo, Kubicek Ales, Nyczyk Piotr, Gerstenberger Robert, Iff Patrick, Lehmann Tomasz, Niewiadomski Hubert, Hoefler Torsten
- Enhancing Event Reasoning In Large Language Models Through Instruction Fine-tuning With Semantic Causal Graphs Bethany Mazal, Bethany Emet, Wherry Brandon, Chiang Cho-yu, Vishwamitra Nishant, Rios Anthony, Najafirad Peyman
- Taking The Next Step With Generative Artificial Intelligence: The Transformative Role Of Multimodal Large Language Models In Science Education Bewersdorff Arne, Hartmann Christian, Hornberger Marie, Seßler Kathrin, Bannert Maria, Kasneci Enkelejda, Kasneci Gjergji, Zhai Xiaoming, Nerdel Claudia
- Evaluating AI For Law: Bridging The Gap With Open-source Solutions Bhambhoria Rohan, Dahan Samuel, Li Jonathan, Zhu Xiaodan
- On The Robustness Of Language Models For Tabular Question Answering Bhandari Kushal Raj, Xing Sixue, Dan Soham, Gao Jianxi
- Fintral: A Family Of GPT-4 Level Multimodal Financial Large Language Models Bhatia Gagan, Nagoudi El Moatez Billah, Cavusoglu Hasan, Abdul-mageed Muhammad
- Understanding The Role Of Ffns In Driving Multilingual Behaviour In Llms Bhattacharya Sunit, Bojar Ondřej
- Enhancing Code Translation In Language Models With Few-shot Learning Via Retrieval-augmented Generation Bhattarai Manish, Santos Javier E., Jones Shawn, Biswas Ayan, Alexandrov Boian, O'malley Daniel
- Speculative Streaming: Fast LLM Inference Without Auxiliary Models Bhendawade Nikhil, Belousova Irina, Fu Qichen, Mason Henry, Rastegari Mohammad, Najibi Mahyar
- Iterative Refinement Of Project-level Code Context For Precise Code Generation With Compiler Feedback Bi Zhangqian, Wan Yao, Wang Zheng, Zhang Hongyu, Guan Batu, Lu Fangxin, Zhang Zili, Sui Yulei, Jin Hai, Shi Xuanhua
- Supervisory Prompt Training Billa Jean Ghislain, Oh Min, Du Liang
- Lipost: Improved Content Understanding With Effective Use Of Multi-task Contrastive Learning Bindal Akanksha, Ramanujam Sudarshan, Golland Dave, Hazen Tj, Jiang Tina, Zhang Fengyu, Yan Peng
- GRAM: Global Reasoning For Multi-page VQA Blau Tsachi, Fogel Sharon, Ronen Roi, Golts Alona, Ganz Roy, Avraham Elad Ben, Aberdam Aviad, Tsiper Shahar, Litman Ron
- Long-span Question-answering: Automatic Question Generation And Qa-system Ranking Via Side-by-side Evaluation Bohnet Bernd, Swersky Kevin, Liu Rosanne, Awasthi Pranjal, Nova Azade, Snaider Javier, Sedghi Hanie, Parisi Aaron T, Collins Michael, Lazaridou Angeliki, Firat Orhan, Fiedel Noah
- Towards Cross-tokenizer Distillation: The Universal Logit Distillation Loss For Llms Boizard Nicolas, Haddad Kevin El, Hudelot Céline, Colombo Pierre
- Assessing The Potential Of Mid-sized Language Models For Clinical QA Bolton Elliot, Xiong Betty, Muralidharan Vijaytha, Schamroth Joel, Muralidharan Vivek, Manning Christopher D., Daneshjou Roxana
- Do Llms Dream Of Ontologies? Bombieri Marco, Fiorini Paolo, Ponzetto Simone Paolo, Rospocher Marco
- Navigating The Labyrinth: Evaluating And Enhancing Llms' Ability To Reason About Search Problems Borazjanizadeh Nasim, Herzig Roei, Darrell Trevor, Feris Rogerio, Karlinsky Leonid
- Reliable Reasoning Beyond Natural Language Borazjanizadeh Nasim, Piantadosi Steven T.
- The Unreasonable Ineffectiveness Of Nucleus Sampling On Mitigating Text Memorization Borec Luka, Sadler Philipp, Schlangen David
- Open Artificial Knowledge Borisov Vadim, Schreiber Richard H.
- Recurrentgemma: Moving Past Transformers For Efficient Open Language Models Botev Aleksandar, De Soham, Smith Samuel L, Fernando Anushan, Muraru George-cristian, Haroun Ruba, Berrada Leonard, Pascanu Razvan, Sessa Pier Giuseppe, Dadashi Robert, Hussenot Léonard, Ferret Johan, Girgin Sertan, Bachem Olivier, Andreev Alek, Kenealy Kathleen, Mesnard Thomas, Hardin Cassidy, Bhupatiraju Surya, Pathak Shreya, Sifre Laurent, Rivière Morgane, Kale Mihir Sanjay, Love Juliette, Tafti Pouya, Joulin Armand, Fiedel Noah, Senter Evan, Chen Yutian, Srinivasan Srivatsan, Desjardins Guillaume, Budden David, Doucet Arnaud, Vikram Sharad, Paszke Adam, Gale Trevor, Borgeaud Sebastian, Chen Charlie, Brock Andy, Paterson Antonia, Brennan Jenny, Risdal Meg, Gundluru Raj, Devanathan Nesh, Mooney Paul, Chauhan Nilay, Culliton Phil, Martins Luiz Gustavo, Bandy Elisa, Huntsperger David, Cameron Glenn, Zucker Arthur, Warkentin Tris, Peran Ludovic, Giang Minh, Ghahramani Zoubin, Farabet Clément, Kavukcuoglu Koray, Hassabis Demis, Hadsell Raia, Teh Yee Whye, De Frietas Nando
- Improving Language Models Trained On Translated Data With Continual Pre-training And Dictionary Learning Analysis Boughorbel Sabri, Parvez Md Rizwan, Hawasly Majd
- Transformers Meet Neural Algorithmic Reasoners Bounsi Wilfried, Ibarz Borja, Dudzik Andrew, Hamrick Jessica B., Markeeva Larisa, Vitvitskyi Alex, Pascanu Razvan, Veličković Petar
- Retrieving Examples From Memory For Retrieval Augmented Neural Machine Translation: A Systematic Comparison Bouthors Maxime, Crego Josep, Yvon Francois
- Embardiment: An Embodied AI Agent For Productivity In XR Bovo Riccardo, Abreu Steven, Ahuja Karan, Gonzalez Eric J, Cheng Li-te, Gonzalez-franco Mar
- Improving Sequential Recommendations With Llms Boz Artun, Zorgdrager Wouter, Kotti Zoe, Harte Jesse, Louridas Panos, Jannach Dietmar, Fragkoulis Marios
- The Art Of Saying No: Contextual Noncompliance In Language Models Brahman Faeze, Kumar Sachin, Balachandran Vidhisha, Dasigi Pradeep, Pyatkin Valentina, Ravichander Abhilasha, Wiegreffe Sarah, Dziri Nouha, Chandu Khyathi, Hessel Jack, Tsvetkov Yulia, Smith Noah A., Choi Yejin, Hajishirzi Hannaneh
- Reducing Transformer Key-value Cache Size With Cross-layer Attention Brandon William, Mishra Mayank, Nrusimha Aniruddha, Panda Rameswar, Kelly Jonathan Ragan
- Large Language Models "ad Referendum": How Good Are They At Machine Translation In The Legal Domain? Briva-iglesias Vicent, Camargo Joao Lucas Cavalheiro, Dogru Gokhan
- LOCOST: State-space Models For Long Document Abstractive Summarization Bronnec Florian Le, Duong Song, Ravaut Mathieu, Allauzen Alexandre, Chen Nancy F., Guigue Vincent, Lumbreras Alberto, Soulier Laure, Gallinari Patrick
- Generation, Distillation And Evaluation Of Motivational Interviewing-style Reflections With A Foundational Language Model Brown Andrew, Zhu Jiading, Abdelwahab Mohamed, Dong Alec, Wang Cindy, Rose Jonathan
- Enhancing Trust In Llms: Algorithms For Comparing And Interpreting Llms Brown Nik Bear
- Fine-tuned 'small' Llms (still) Significantly Outperform Zero-shot Generative AI Models In Text Classification Bucher Martin Juan José, Martini Marco
- Lissard: Long And Simple Sequential Reasoning Datasets Bueno Mirelle, Lotufo Roberto, Nogueira Rodrigo
- The Two Sides Of The Coin: Hallucination Generation And Detection With Llms As Evaluators For Llms Bui Anh Thu Maria, Brech Saskia Felizitas, Hußfeldt Natalie, Jennert Tobias, Ullrich Melanie, Breuer Timo, Khasmakhi Narjes Nikzad, Schaer Philipp
- Pangea: Procedural Artificial Narrative Using Generative AI For Turn-based Video Games Buongiorno Steph, Klinkert Lawrence Jake, Chawla Tanishq, Zhuang Zixin, Clark Corey
- Roguegpt: Dis-ethical Tuning Transforms Chatgpt4 Into A Rogue AI In 158 Words Buscemi Alessio, Proverbio Daniele
- Investigating Wit, Creativity, And Detectability Of Large Language Models In Domain-specific Writing Style Adaptation Of Reddit's Showerthoughts Buz Tolga, Frost Benjamin, Genchev Nikola, Schneider Moritz, Kaffee Lucie-aimée, De Melo Gerard
- ARES: Alternating Reinforcement Learning And Supervised Fine-tuning For Enhanced Multi-modal Chain-of-thought Reasoning Through Diverse AI Feedback Byun Ju-seung, Chun Jiyun, Kil Jihyung, Perrault Andrew
- On Large Language Models In National Security Applications Caballero William N., Jenkins Phillip R.
- In-context Learning And Fine-tuning GPT For Argument Mining Cabessa Jérémie, Hernault Hugo, Mushtaq Umer
- Wiki-llava: Hierarchical Retrieval-augmented Generation For Multimodal Llms Caffagni Davide, Cocchi Federico, Moratelli Nicholas, Sarto Sara, Cornia Marcella, Baraldi Lorenzo, Cucchiara Rita
- High-dimension Human Value Representation In Large Language Models Cahyawijaya Samuel, Chen Delong, Bang Yejin, Khalatbari Leila, Wilie Bryan, Ji Ziwei, Ishii Etsuko, Fung Pascale
- Cendol: Open Instruction-tuned Generative Large Language Models For Indonesian Languages Cahyawijaya Samuel, Lovenia Holy, Koto Fajri, Putri Rifki Afina, Dave Emmanuel, Lee Jhonson, Shadieq Nuur, Cenggoro Wawan, Akbar Salsabil Maulana, Mahendra Muhammad Ihza, Putri Dea Annisayanti, Wilie Bryan, Winata Genta Indra, Aji Alham Fikri, Purwarianti Ayu, Fung Pascale
- Sciassess: Benchmarking LLM Proficiency In Scientific Literature Analysis Cai Hengxing, Cai Xiaochen, Chang Junhan, Li Sihang, Yao Lin, Wang Changxin, Gao Zhifeng, Wang Hongshuai, Li Yongge, Lin Mujie, Yang Shuwen, Wang Jiankun, Xu Mingjun, Huang Jin, Xi Fang, Zhuang Jiaxi, Yin Yuqi, Li Yaqi, Chen Changhong, Cheng Zheng, Zhao Zifeng, Zhang Linfeng, Ke Guolin
- Flextron: Many-in-one Flexible Large Language Model Cai Ruisi, Muralidharan Saurav, Heinrich Greg, Yin Hongxu, Wang Zhangyang, Kautz Jan, Molchanov Pavlo
- Lococo: Dropping In Convolutions For Long Context Compression Cai Ruisi, Tian Yuandong, Wang Zhangyang, Chen Beidi
- Medusa: Simple LLM Inference Acceleration Framework With Multiple Decoding Heads Cai Tianle, Li Yuhong, Geng Zhengyang, Peng Hongwu, Lee Jason D., Chen Deming, Dao Tri
- A Survey On Mixture Of Experts Cai Weilin, Jiang Juyong, Wang Fan, Tang Jing, Kim Sunghun, Huang Jiayi
- Towards Large Language Model Aided Program Refinement Cai Yufan, Hou Zhe, Luan Xiaokun, Baena David Miguel Sanan, Lin Yun, Sun Jun, Dong Jin Song
- Internlm2 Technical Report Cai Zheng, Cao Maosong, Chen Haojiong, Chen Kai, Chen Keyu, Chen Xin, Chen Xun, Chen Zehui, Chen Zhi, Chu Pei, Dong Xiaoyi, Duan Haodong, Fan Qi, Fei Zhaoye, Gao Yang, Ge Jiaye, Gu Chenya, Gu Yuzhe, Gui Tao, Guo Aijia, Guo Qipeng, He Conghui, Hu Yingfan, Huang Ting, Jiang Tao, Jiao Penglong, Jin Zhenjiang, Lei Zhikai, Li Jiaxing, Li Jingwen, Li Linyang, Li Shuaibin, Li Wei, Li Yining, Liu Hongwei, Liu Jiangning, Hong Jiawei, Liu Kaiwen, Liu Kuikun, Liu Xiaoran, Lv Chengqi, Lv Haijun, Lv Kai, Ma Li, Ma Runyuan, Ma Zerun, Ning Wenchang, Ouyang Linke, Qiu Jiantao, Qu Yuan, Shang Fukai, Shao Yunfan, Song Demin, Song Zifan, Sui Zhihao, Sun Peng, Sun Yu, Tang Huanze, Wang Bin, Wang Guoteng, Wang Jiaqi, Wang Jiayu, Wang Rui, Wang Yudong, Wang Ziyi, Wei Xingjian, Weng Qizhen, Wu Fan, Xiong Yingtong, Xu Chao, Xu Ruiliang, Yan Hang, Yan Yirong, Yang Xiaogui, Ye Haochen, Ying Huaiyuan, Yu Jia, Yu Jing, Zang Yuhang, Zhang Chuyu, Zhang Li, Zhang Pan, Zhang Peng, Zhang Ruijie, Zhang Shuo, Zhang Songyang, Zhang Wenjian, Zhang Wenwei, Zhang Xingcheng, Zhang Xinyue, Zhao Hui, Zhao Qian, Zhao Xiaomeng, Zhou Fengzhe, Zhou Zaida, Zhuo Jingming, Zou Yicheng, Qiu Xipeng, Qiao Yu, Lin Dahua
- On The Worst Prompt Performance Of Large Language Models Cao Bowen, Cai Deng, Zhang Zhisong, Zou Yuexian, Lam Wai
- MADTP: Multimodal Alignment-guided Dynamic Token Pruning For Accelerating Vision-language Transformer Cao Jianjian, Ye Peng, Li Shengze, Yu Chong, Tang Yansong, Lu Jiwen, Chen Tao
- Genrec: Generative Sequential Recommendation With Large Language Models Cao Panfeng, Lio Pietro
- Figuring Out Figures: Using Textual References To Caption Scientific Figures Cao Stanley, Liu Kevin
- Personalized Steering Of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization Cao Yuanpu, Zhang Tianrong, Cao Bochuan, Yin Ziyi, Lin Lu, Ma Fenglong, Chen Jinghui
- Head-wise Shareable Attention For Large Language Models Cao Zouying, Yang Yifei, Zhao Hai
- Chart-based Reasoning: Transferring Capabilities From Llms To Vlms Carbune Victor, Mansoor Hassan, Liu Fangyu, Aralikatte Rahul, Baechler Gilles, Chen Jindong, Sharma Abhanshu
- Captioning Visualizations With Large Language Models (CVLLM): A Tutorial Carenini Giuseppe, Johnson Jordon, Salamatian Ali
- A Review Of Multi-modal Large Language And Vision Models Carolan Kilian, Fennelly Laura, Smeaton Alan F.
- Suppressing Pink Elephants With Direct Principle Feedback Castricato Louis, Lile Nathan, Anand Suraj, Schoelkopf Hailey, Verma Siddharth, Biderman Stella
- AMEX: Android Multi-annotation Expo Dataset For Mobile GUI Agents Chai Yuxiang, Huang Siyuan, Niu Yazhe, Xiao Han, Liu Liang, Zhang Dingyu, Gao Peng, Ren Shuai, Li Hongsheng
- Towards Neural Synthesis For Smt-assisted Proof-oriented Programming Chakraborty Saikat, Ebner Gabriel, Bhat Siddharth, Fakhoury Sarah, Fatima Sakina, Lahiri Shuvendu, Swamy Nikhil
- Dense Reward For Free In Reinforcement Learning From Human Feedback Chan Alex J., Sun Hao, Holt Samuel, Van Der Schaar Mihaela
- AMGPT: A Large Language Model For Contextual Querying In Additive Manufacturing Chandrasekhar Achuth, Chan Jonathan, Ogoke Francis, Ajenifujah Olabode, Farimani Amir Barati
- When Parts Are Greater Than Sums: Individual LLM Components Can Outperform Full Models Chang Ting-yun, Thomason Jesse, Jia Robin
- Goldfish: Monolingual Language Models For 350 Languages Chang Tyler A., Arnett Catherine, Tu Zhuowen, Bergen Benjamin K.
- When Large Language Models Meet Evolutionary Algorithms Chao Wang, Zhao Jiaxuan, Jiao Licheng, Li Lingling, Liu Fang, Yang Shuyuan
- Notellm: A Retrievable Large Language Model For Note Recommendation Chao Zhang, Shiwei Wu, Haoxin Zhang, Tong Xu, Yan Gao, Yao Hu, Di Wu, Enhong Chen
- VITA: Towards Open-source Interactive Omni Multimodal LLM Chaoyou Fu, Haojia Lin, Zuwei Long, Yunhang Shen, Meng Zhao, Yifan Zhang, Shaoqi Dong, Xiong Wang, Di Yin, Long Ma, Xiawu Zheng, Ran He, Rongrong Ji, Yunsheng Wu, Caifeng Shan, Xing Sun
- Video-mme: The First-ever Comprehensive Evaluation Benchmark Of Multi-modal Llms In Video Analysis Chaoyou Fu, Yuhan Dai, Yongdong Luo, Lei Li, Shuhuai Ren, Renrui Zhang, Zihan Wang, Chenyu Zhou, Yunhang Shen, Mengdan Zhang, Peixian Chen, Yanwei Li, Shaohui Lin, Sirui Zhao, Ke Li, Tong Xu, Xiawu Zheng, Enhong Chen, Rongrong Ji, Xing Sun
- Quantifying The Gain In Weak-to-strong Generalization Charikar Moses, Pabbaraju Chirag, Shiragur Kirankumar
- \'evaluation Des Capacit\'es De R\'eponse De Larges Mod\`eles De Langage (LLM) Pour Des Questions D'historiens Chartier Mathieu, Dakkoune Nabil, Bourgeois Guillaume, Jean Stéphane
- Language Models Can Exploit Cross-task In-context Learning For Data-scarce Novel Tasks Chatterjee Anwoy, Tanwar Eshaan, Dutta Subhabrata, Chakraborty Tanmoy
- Phantom: General Trigger Attacks On Retrieval Augmented Language Generation Chaudhari Harsh, Severi Giorgio, Abascal John, Jagielski Matthew, Choquette-choo Christopher A., Nasr Milad, Nita-rotaru Cristina, Oprea Alina
- Large Language Models As Instruments Of Power: New Regimes Of Autonomous Manipulation And Control Chaudhary Yaqub, Penn Jonnie
- Coupling Speech Encoders With Downstream Text Models Chelba Ciprian, Schalkwyk Johan
- Benchmarking Llms For Translating Classical Chinese Poetry:evaluating Adequacy, Fluency, And Elegance Chen Andong, Lou Lianzhang, Chen Kehai, Bai Xuefeng, Xiang Yang, Yang Muyun, Zhao Tiejun, Zhang Min
- Lmstyle Benchmark: Evaluating Text Style Transfer For Chatbots Chen Jianlin
- Efficiency In Focus: Layernorm As A Catalyst For Fine-tuning Medical Visual Language Pre-trained Models Chen Jiawei, Yang Dingkang, Jiang Yue, Li Mingcheng, Wei Jinjie, Hou Xiaolu, Zhang Lihua
- Control-dag: Constrained Decoding For Non-autoregressive Directed Acyclic T5 Using Weighted Finite State Automata Chen Jinghong, Lin Weizhe, Mei Jingbiao, Byrne Bill
- Automated Data Curation For Robust Language Model Fine-tuning Chen Jiuhai, Mueller Jonas
- Elcorec: Enhance Language Understanding With Co-propagation Of Numerical And Categorical Features For Recommendation Chen Jizheng, Du Kounianhua, Lin Jianghao, Chen Bo, Tang Ruiming, Zhang Weinan
- Pixart-\sigma: Weak-to-strong Training Of Diffusion Transformer For 4K Text-to-image Generation Chen Junsong, Ge Chongjian, Xie Enze, Wu Yue, Yao Lewei, Ren Xiaozhe, Wang Zhongdao, Luo Ping, Lu Huchuan, Li Zhenguo
- Cost-effective Instruction Learning For Pathology Vision And Language Analysis Chen Kaitao, Liu Mianxin, Yan Fang, Ma Lei, Shi Xiaoming, Wang Lilong, Wang Xiaosong, Zhu Lifeng, Wang Zhe, Zhou Mu, Zhang Shaoting
- Diahalu: A Dialogue-level Hallucination Evaluation Benchmark For Large Language Models Chen Kedi, Chen Qin, Zhou Jie, He Yishen, He Liang
- Machine Unlearning In Large Language Models Chen Kongyang, Wang Zixin, Mi Bing, Liu Waixi, Wang Shaowei, Ren Xiaojun, Shen Jiaxing
- OMPGPT: A Generative Pre-trained Transformer Model For Openmp Chen Le, Bhattacharjee Arijit, Ahmed Nesreen, Hasabnis Niranjan, Oren Gal, Vo Vy, Jannesari Ali
- How Truncating Weights Improves Reasoning In Language Models Chen Lei, Bruna Joan, Bietti Alberto
- Pca-bench: Evaluating Multimodal Large Language Models In Perception-cognition-action Chain Chen Liang, Zhang Yichi, Ren Shuhuai, Zhao Haozhe, Cai Zefan, Wang Yuchi, Wang Peiyi, Meng Xiangdi, Liu Tianyu, Chang Baobao
- Identifying Query-relevant Neurons In Large Language Models For Long-form Texts Chen Lihu, Dejl Adam, Toni Francesca
- Efficientqat: Efficient Quantization-aware Training For Large Language Models Chen Mengzhao, Shao Wenqi, Xu Peng, Wang Jiahao, Gao Peng, Zhang Kaipeng, Qiao Yu, Luo Ping
- Graphwiz: An Instruction-following Language Model For Graph Problems Chen Nuo, Li Yuhan, Tang Jianheng, Li Jia
- Enhancing Visual Question Answering Through Ranking-based Hybrid Training And Multimodal Fusion Chen Peiyuan, Zhang Zecheng, Dong Yiping, Zhou Li, Wang Han
- WSI-VQA: Interpreting Whole Slide Images By Generative Visual Question Answering Chen Pingyi, Zhu Chenglu, Zheng Sunyi, Li Honglin, Yang Lin
- Measuring Taiwanese Mandarin Language Understanding Chen Po-heng, Cheng Sijia, Chen Wei-lin, Lin Yen-ting, Chen Yun-nung
- M\(^3\)cot: A Novel Benchmark For Multi-domain Multi-step Multi-modal Chain-of-thought Chen Qiguang, Qin Libo, Zhang Jin, Chen Zhi, Xu Xiao, Che Wanxiang
- Llava-mole: Sparse Mixture Of Lora Experts For Mitigating Data Conflicts In Instruction Finetuning Mllms Chen Shaoxiang, Jie Zequn, Ma Lin
- Octopus: On-device Language Model For Function Calling Of Software Apis Chen Wei, Li Zhiyuan, Ma Mingyuan
- Intelligent Tutor: Leveraging Chatgpt And Microsoft Copilot Studio To Deliver A Generative AI Student Support And Feedback System Within Teams Chen Wei-yu
- SATO: Stable Text-to-motion Framework Chen Wenshuo, Xiao Hongru, Zhang Erhang, Hu Lijie, Wang Lei, Liu Mengyuan, Chen Chen
- Llast: Improved End-to-end Speech Translation System Leveraged By Large Language Models Chen Xi, Zhang Songyang, Bai Qibing, Chen Kai, Nakamura Satoshi
- Gazexplain: Learning To Predict Natural Language Explanations Of Visual Scanpaths Chen Xianyu, Jiang Ming, Zhao Qi
- Eyegpt: Ophthalmic Assistant With Large Language Models Chen Xiaolan, Zhao Ziwei, Zhang Weiyi, Xu Pusheng, Gao Le, Xu Mingpu, Wu Yue, Li Yinwen, Shi Danli, He Mingguang
- Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided By Text Information Chen Yi, Xu Jian, Zhang Xu-yao, Liu Wen-zhuo, Liu Yang-yang, Liu Cheng-lin
- NACL: A General And Effective KV Cache Eviction Framework For Llms At Inference Time Chen Yilong, Wang Guoxia, Shang Junyuan, Cui Shiyao, Zhang Zhenyu, Liu Tingwen, Wang Shuohuan, Sun Yu, Yu Dianhai, Wu Hua
- See What Llms Cannot Answer: A Self-challenge Framework For Uncovering LLM Weaknesses Chen Yulong, Liu Yang, Yan Jianhao, Bai Xuefeng, Zhong Ming, Yang Yinghao, Yang Ziyi, Zhu Chenguang, Zhang Yue
- MAPO: Boosting Large Language Model Performance With Model-adaptive Prompt Optimization Chen Yuyan, Wen Zhihao, Fan Ge, Chen Zhengyu, Wu Wei, Liu Dayiheng, Li Zhixu, Liu Bang, Xiao Yanghua
- Dia-llama: Towards Large Language Model-driven CT Report Generation Chen Zhixuan, Luo Luyang, Bie Yequan, Chen Hao
- Improving Retrieval Augmented Open-domain Question-answering With Vectorized Contexts Chen Zhuo, Wang Xinyu, Jiang Yong, Xie Pengjun, Huang Fei, Tu Kewei
- An Empirical Study Of Data Ability Boundary In Llms' Math Reasoning Chen Zui, Chen Yezeng, Han Jiaqi, Huang Zhijie, Qi Ji, Zhou Yi
- "in Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles Through In-dialogue Learning Cheng Chuanqi, Tu Quan, Wu Wei, Shang Shuo, Mao Cunli, Yu Zhengtao, Yan Rui
- Emergence Of A High-dimensional Abstraction Phase In Language Transformers Cheng Emily, Doimo Diego, Kervadec Corentin, Macocco Iuri, Yu Jade, Laio Alessandro, Baroni Marco
- Transferring Backdoors Between Large Language Models By Knowledge Distillation Cheng Pengzhou, Wu Zongru, Ju Tianjie, Du Wei, Liu Zhuosheng Zhang Gongshen
- Information Re-organization Improves Reasoning In Large Language Models Cheng Xiaoxia, Tan Zeqi, Xue Wei, Lu Weiming
- Small Agent Can Also Rock! Empowering Small Language Models As Hallucination Detector Cheng Xiaoxue, Li Junyi, Zhao Wayne Xin, Zhang Hongzhi, Zhang Fuzheng, Zhang Di, Gai Kun, Wen Ji-rong
- A Perspective On Large Language Models, Intelligent Machines, And Knowledge Acquisition Cherkassky Vladimir, Lee Eng Hock
- Revisiting Zero-shot Abstractive Summarization In The Era Of Large Language Models From The Perspective Of Position Bias Chhabra Anshuman, Askari Hadi, Mohapatra Prasant
- Lamsum: Creating Extractive Summaries Of User Generated Content Using Llms Chhikara Garima, Sharma Anurag, Gurucharan V., Ghosh Kripabandhu, Chakraborty Abhijnan
- Modeling: A Novel Dataset For Testing Linguistic Reasoning In Language Models Chi Nathan A., Malchev Teodor, Kong Riley, Chi Ryan A., Huang Lucas, Chi Ethan A., Mccoy R. Thomas, Radev Dragomir
- Large Language Model As An Assignment Evaluator: Insights, Feedback, And Challenges In A 1000+ Student Course Chiang Cheng-han, Chen Wei-chih, Kuan Chun-yi, Yang Chienchou, Lee Hung-yi
- Human-centered Llm-agent User Interface: A Position Paper Chin Daniel, Wang Yuxuan, Xia Gus
- An Empirical Study Of In-context Learning In Llms For Machine Translation Chitale Pranjal A., Gala Jay, Dabre Raj
- Exploring Quantization For Efficient Pre-training Of Transformer Language Models Chitsaz Kamran, Fournier Quentin, Mordido Gonçalo, Chandar Sarath
- A Computational Framework For Behavioral Assessment Of LLM Therapists Chiu Yu Ying, Sharma Ashish, Lin Inna Wanyin, Althoff Tim
- SPAR: Personalized Content-based Recommendation Via Long Engagement Attention Chiyu Zhang, Yifei Sun, Jun Chen, Jie Lei, Muhammad Abdul-mageed, Sinong Wang, Rong Jin, Sem Park, Ning Yao, Bo Long
- Transformer Explainer: Interactive Learning Of Text-generative Models Cho Aeree, Kim Grace C., Karpekov Alexander, Helbling Alec, Wang Zijie J., Lee Seongmin, Hoover Benjamin, Chau Duen Horng
- Unveiling Imitation Learning: Exploring The Impact Of Data Falsity To Large Language Model Cho Hyunsoo
- Language-image Models With 3D Understanding Cho Jang Hyun, Ivanovic Boris, Cao Yulong, Schmerling Edward, Wang Yue, Weng Xinshuo, Li Boyi, You Yurong, Krähenbühl Philipp, Wang Yan, Pavone Marco
- Kv-runahead: Scalable Causal LLM Inference By Parallel Key-value Cache Generation Cho Minsik, Rastegari Mohammad, Naik Devang
- Pragmatic Inference Of Scalar Implicature By Llms Cho Ye-eun, Kim Seong Mook
- Optimizing Language Augmentation For Multilingual Large Language Models: A Case Study On Korean Choi Changsu, Jeong Yongbin, Park Seoyoon, Won Inho, Lim Hyeonseok, Kim Sangmin, Kang Yejee, Yoon Chanhyuk, Park Jaewan, Lee Yiseul, Lee Hyejin, Hahm Younggyun, Kim Hansaem, Lim Kyungtae
- Multi-granularity Guided Fusion-in-decoder Choi Eunseong, Lee Hyeri, Lee Jongwuk
- Can Only Llms Do Reasoning?: Potential Of Small Language Models In Task Planning Choi Gawon, Ahn Hyemin
- Lota-bench: Benchmarking Language-oriented Task Planners For Embodied Agents Choi Jae-woo, Yoon Youngwoo, Ong Hyobin, Kim Jaehong, Jang Minsu
- SNAP: Unlearning Selective Knowledge In Large Language Models With Negative Instructions Choi Minseok, Rim Daniel, Lee Dohyun, Choo Jaegul
- Cross-architecture Transfer Learning For Linear-cost Inference Transformers Choi Sehyun
- Breaking Down The Defenses: A Comparative Survey Of Attacks On Large Language Models Chowdhury Arijit Ghosh, Islam Md Mofijul, Kumar Vaibhav, Shezan Faysal Hossain, Kumar Vaibhav, Jain Vinija, Chadha Aman
- Meerkat: Audio-visual Large Language Model For Grounding In Space And Time Chowdhury Sanjoy, Nag Sayan, Dasgupta Subhrajyoti, Chen Jun, Elhoseiny Mohamed, Gao Ruohan, Manocha Dinesh
- Autotutor Meets Large Language Models: A Language Model Tutor With Rich Pedagogy And Guardrails Chowdhury Sankalan Pal, Zouhar Vilém, Sachan Mrinmaya
- Med42-v2: A Suite Of Clinical Llms Christophe Clément, Kanithi Praveen K, Raha Tathagata, Khan Shadab, Pimentel Marco Af
- Professional Agents -- Evolving Large Language Models Into Autonomous Experts With Human-level Competencies Chu Zhixuan, Wang Yan, Zhu Feng, Yu Lu, Li Longfei, Gu Jinjie
- AI Safety In Generative AI Large Language Models: A Survey Chua Jaymari, Li Yun, Yang Shiyi, Wang Chen, Yao Lina
- Lookback Lens: Detecting And Mitigating Contextual Hallucinations In Large Language Models Using Only Attention Maps Chuang Yung-sung, Qiu Linlu, Hsieh Cheng-yu, Krishna Ranjay, Kim Yoon, Glass James
- Developing Generative AI Chatbots Conceptual Framework For Higher Education Chukwuere Joshua Ebere
- Informed AI Regulation: Comparing The Ethical Frameworks Of Leading LLM Chatbots Using An Ethics-based Audit To Assess Moral Reasoning And Normative Values Chun Jon, Elkins Katherine
- Stable Language Model Pre-training By Reducing Embedding Variability Chung Woojin, Hong Jiwoo, An Na Min, Thorne James, Yun Se-young
- PEFT-U: Parameter-efficient Fine-tuning For User Personalization Clarke Christopher, Heng Yuzhao, Tang Lingjia, Mars Jason
- Improving Token-based World Models With Parallel Observation Prediction Cohen Lior, Wang Kaixin, Kang Bingyi, Mannor Shie
- A Chain-of-thought Prompting Approach With Llms For Evaluating Students' Formative Assessment Responses In Science Cohn Clayton, Hutchins Nicole, Le Tuan, Biswas Gautam
- Towards A Human-in-the-loop LLM Approach To Collaborative Discourse Analysis Cohn Clayton, Snyder Caitlin, Montenegro Justin, Biswas Gautam
- Saullm-54b & Saullm-141b: Scaling Up Domain Adaptation For The Legal Domain Colombo Pierre, Pires Telmo, Boudiaf Malik, Melo Rui, Culver Dominic, Morgado Sofia, Malaboeuf Etienne, Hautreux Gabriel, Charpentier Johanne, Desa Michael
- Open Source Conversational Llms Do Not Know Most Spanish Words Conde Javier, González Miguel, Melero Nina, Ferrando Raquel, Martínez Gonzalo, Merino-gómez Elena, Hernández José Alberto, Reviriego Pedro
- Attentionlego: An Open-source Building Block For Spatially-scalable Large Language Model Accelerator With Processing-in-memory Technology Cong Rongqing, He Wenyang, Li Mingxuan, Luo Bangning, Yang Zebin, Yang Yuchao, Huang Ru, Yan Bonan
- Representations As Language: An Information-theoretic Framework For Interpretability Conklin Henry, Smith Kenny
- Gamebench: Evaluating Strategic Reasoning Abilities Of LLM Agents Costarelli Anthony, Allen Mat, Hauksson Roman, Sodunke Grace, Hariharan Suhas, Cheng Carlson, Li Wenjie, Clymer Joshua, Yadav Arjun
- Induction Heads As An Essential Mechanism For Pattern Matching In In-context Learning Crosbie J., Shutova E.
- Llambert: Large-scale Low-cost Data Annotation In NLP Csanády Bálint, Muzsai Lajos, Vedres Péter, Nádasdy Zoltán, Lukács András
- Moeut: Mixture-of-experts Universal Transformers Csordás Róbert, Irie Kazuki, Schmidhuber Jürgen, Potts Christopher, Manning Christopher D.
- The Power Of Noise: Redefining Retrieval For RAG Systems Cuconasu Florin, Trappolini Giovanni, Siciliano Federico, Filice Simone, Campagnano Cesare, Maarek Yoelle, Tonellotto Nicola, Silvestri Fabrizio
- Efficiently Exploring Large Language Models For Document-level Machine Translation With In-context Learning Cui Menglong, Du Jiangcun, Zhu Shaolin, Xiong Deyi
- Webapp1k: A Practical Code-generation Benchmark For Web App Development Cui Yi
- Rethinking LLM Language Adaptation: A Case Study On Chinese Mixtral Cui Yiming, Yao Xin
- Can AI Replace Human Subjects? A Large-scale Replication Of Psychological Experiments With Llms Cui Ziyan, Li Ning, Zhou Huaikang
- Synthetic Dataset Creation And Fine-tuning Of Transformer Models For Question Answering In Serbian Cvetanović Aleksa, Tadić Predrag
- Anchored Preference Optimization And Contrastive Revisions: Addressing Underspecification In Alignment D'oosterlinck Karel, Xu Winnie, Develder Chris, Demeester Thomas, Singh Amanpreet, Potts Christopher, Kiela Douwe, Mehri Shikib
- Chatgpt Vs Llama: Impact, Reliability, And Challenges In Stack Overflow Discussions Da Silva Leuson, Samhi Jordan, Khomh Foutse
- Large Legal Fictions: Profiling Legal Hallucinations In Large Language Models Dahl Matthew, Magesh Varun, Suzgun Mirac, Ho Daniel E.
- MHPP: Exploring The Capabilities And Limitations Of Language Models Beyond Basic Code Generation Dai Jianbo, Lu Jianqiao, Feng Yunlong, Ruan Rongju, Cheng Ming, Tan Haochen, Guo Zhijiang
- CORM: Cache Optimization With Recent Message For Large Language Model Inference Dai Jincheng, Huang Zhuowei, Jiang Haiyun, Chen Chen, Cai Deng, Bi Wei, Shi Shuming
- Muap: Multi-step Adaptive Prompt Learning For Vision-language Model With Missing Modality Dai Ruiting, Tan Yuqiao, Mo Lisi, He Tao, Qin Ke, Liang Shuang
- Large Language Models Can Better Understand Knowledge Graphs Than We Thought Dai Xinbang, Hua Yuncheng, Wu Tongtong, Sheng Yang, Ji Qiu, Qi Guilin
- Inference To The Best Explanation In Large Language Models Dalal Dhairya, Valentino Marco, Freitas André, Buitelaar Paul
- A Complete Survey On Llm-based AI Chatbots Dam Sumit Kumar, Hong Choong Seon, Qiao Yu, Zhang Chaoning
- Impact Of Non-standard Unicode Characters On Security And Comprehension In Large Language Models Daniel Johan S, Pal Anand
- Synthetic Patient-physician Dialogue Generation From Clinical Notes Using LLM Das Trisha, Albassam Dina, Sun Jimeng
- Xmainframe: A Large Language Model For Mainframe Modernization Dau Anh T. V., Dao Hieu Trung, Nguyen Anh Tuan, Tran Hieu Trung, Nguyen Phong X., Bui Nghi D. Q.
- Investigating Symbolic Capabilities Of Large Language Models Dave Neisarg, Kifer Daniel, Giles C. Lee, Mali Ankur
- Prompting Open-source And Commercial Language Models For Grammatical Error Correction Of English Learner Text Davis Christopher, Caines Andrew, Andersen Øistein, Taslimipoor Shiva, Yannakoudakis Helen, Yuan Zheng, Bryant Christopher, Rei Marek, Buttery Paula
- Llms Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark For Comprehensive Evaluation Of Llms Davoodi Arash Gholami, Davoudi Seyed Pouyan Mousavi, Pezeshkpour Pouya
- Aligning Model Evaluations With Human Preferences: Mitigating Token Count Bias In Language Model Assessments Daynauth Roland, Mars Jason
- Application Of GPT Language Models For Innovation In Activities In University Teaching De Buenaga Manuel, Bueno Francisco Javier
- Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding With Childplay De Carvalho Gonçalo Hora, Knap Oscar, Pollice Robert
- Towards Transparency: Exploring LLM Trainings Datasets Through Visual Topic Modeling And Semantic Frame De Dampierre Charles, Mogoutov Andrei, Baumard Nicolas
- Retail-gpt: Leveraging Retrieval Augmented Generation (RAG) For Building E-commerce Chat Assistants De Freitas Bruno Amaral Teixeira, Lotufo Roberto De Alencar
- Pelle: Encoder-based Language Models For Brazilian Portuguese Based On Open Data De Mello Guilherme Lamartine, Finger Marcelo, Serras And Felipe, Carpi Miguel De Mello, Jose Marcos Menon, Domingues Pedro Henrique, Cavalim Paulo
- Will GPT-4 Run DOOM? De Wynter Adrian
- Evaluating Large Language Models For Automatic Analysis Of Teacher Simulations De-fitero-dominguez David, Albaladejo-gonzález Mariano, Garcia-cabot Antonio, Garcia-lopez Eva, Moreno-cediel Antonio, Barno Erin, Reich Justin
- Deepseek-v2: A Strong, Economical, And Efficient Mixture-of-experts Language Model Deepseek-ai, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Li, Hui Qu, J. L. Cai, Jian Liang, Jianzhong Guo, Jiaqi Ni, Jiashi Li, Jin Chen, Jingyang Yuan, Junjie Qiu, Junxiao Song, Kai Dong, Kaige Gao, Kang Guan, Lean Wang, Lecong Zhang, Lei Xu, Leyi Xia, Liang Zhao, Liyue Zhang, Meng Li, Miaojun Wang, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Mingming Li, Ning Tian, Panpan Huang, Peiyi Wang, Peng Zhang, Qihao Zhu, Qinyu Chen, Qiushi Du, R. J. Chen, R. L. Jin, Ruiqi Ge, Ruizhe Pan, Runxin Xu, Ruyi Chen, S. S. Li, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shaoqing Wu, Shengfeng Ye, Shirong Ma, Shiyu Wang, Shuang Zhou, Shuiping Yu, Shunfeng Zhou, Size Zheng, T. Wang, Tian Pei, Tian Yuan, Tianyu Sun, W. L. Xiao, Wangding Zeng, Wei An, Wen Liu, Wenfeng Liang, Wenjun Gao, Wentao Zhang, X. Q. Li, Xiangyue Jin, Xianzu Wang, Xiao Bi, Xiaodong Liu, Xiaohan Wang, Xiaojin Shen, Xiaokang Chen, Xiaosha Chen, Xiaotao Nie, Xiaowen Sun, Xiaoxiang Wang, Xin Liu, Xin Xie, Xingkai Yu, Xinnan Song, Xinyi Zhou, Xinyu Yang, Xuan Lu, Xuecheng Su, Y. Wu, Y. K. Li, Y. X. Wei, Y. X. Zhu, Yanhong Xu, Yanping Huang, Yao Li, Yao Zhao, Yaofeng Sun, Yaohui Li, Yaohui Wang, Yi Zheng, Yichao Zhang, Yiliang Xiong, Yilong Zhao, Ying He, Ying Tang, Yishi Piao, Yixin Dong, Yixuan Tan, Yiyuan Liu, Yongji Wang, Yongqiang Guo, Yuchen Zhu, Yuduan Wang, Yuheng Zou, Yukun Zha, Yunxian Ma, Yuting Yan, Yuxiang You, Yuxuan Liu, Z. Z. Ren, Zehui Ren, Zhangli Sha, Zhe Fu, Zhen Huang, Zhen Zhang, Zhenda Xie, Zhewen Hao, Zhihong Shao, Zhiniu Wen, Zhipeng Xu, Zhongyu Zhang, Zhuoshu Li, Zihan Wang, Zihui Gu, Zilin Li, Ziwei Xie
- Understanding Biases In Chatgpt-based Recommender Systems: Provider Fairness, Temporal Stability, And Recency Deldjoo Yashar
- Cram: Credibility-aware Attention Modification In Llms For Combating Misinformation In RAG Deng Boyi, Wang Wenjie, Zhu Fengbin, Wang Qifan, Feng Fuli
- MIMIR: A Streamlined Platform For Personalized Agent Tuning In Domain Expertise Deng Chunyuan, Tang Xiangru, Zhao Yilun, Wang Hanming, Wang Haoran, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- Pandora: Jailbreak Gpts By Retrieval Augmented Generation Poisoning Deng Gelei, Liu Yi, Wang Kailong, Li Yuekang, Zhang Tianwei, Liu Yang
- ELASTIC: Efficient Linear Attention For Sequential Interest Compression Deng Jiaxin, Wang Shiyao, Lu Song, Li Yinfeng, Luo Xinchen, Liu Yuanjun, Xu Peixing, Zhou Guorui
- Attention Is Naturally Sparse With Gaussian Distributed Input Deng Yichuan, Song Zhao, Yang Chiwun
- Cognidual Framework: Self-training Large Language Models Within A Dual-system Theoretical Framework For Improving Cognitive Tasks Deng Yongxin, Qiu Xihe, Tan Xiaoyu, Qu Chao, Pan Jing, Cheng Yuan, Xu Yinghui, Chu Wei
- Text-tuple-table: Towards Information Integration In Text-to-table Generation Via Global Tuple Extraction Deng Zheye, Chan Chunkit, Wang Weiqi, Sun Yuxi, Fan Wei, Zheng Tianshi, Yim Yauwai, Song Yangqiu
- Language In Vivo Vs. In Silico: Size Matters But Larger Language Models Still Do Not Comprehend Language On A Par With Humans Dentella Vittoria, Guenther Fritz, Leivada Evelina
- Promises, Outlooks And Challenges Of Diffusion Language Modeling Deschenaux Justin, Gulcehre Caglar
- Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets Devine Peter
- A Simple And Effective \(L_2\) Norm-based Strategy For KV Cache Compression Devoto Alessio, Zhao Yu, Scardapane Simone, Minervini Pasquale
- Facts-and-feelings: Capturing Both Objectivity And Subjectivity In Table-to-text Generation Dey Tathagata, Bhattacharyya Pushpak
- From Words To Worlds: Compositionality For Cognitive Architectures Dhar Ruchira, Søgaard Anders
- Unveiling Encoder-free Vision-language Models Diao Haiwen, Cui Yufeng, Li Xiaotong, Wang Yueze, Lu Huchuan, Wang Xinlong
- Enhancing Recommendation Diversity By Re-ranking With Large Language Models Diego Carraro, Derek Bridge
- Learning How To Ask: Cycle-consistency Refines Prompts In Multimodal Foundation Models Diesendruck Maurice, Lin Jianzhe, Imani Shima, Mahalingam Gayathri, Xu Mingyang, Zhao Jie
- Large Language Model Agent In Financial Trading: A Survey Ding Han, Li Yinheng, Wang Junhao, Chen Hang
- MANGO: A Benchmark For Evaluating Mapping And Navigation Abilities Of Large Language Models Ding Peng, Fang Jiading, Li Peng, Wang Kangrui, Zhou Xiaochen, Yu Mo, Li Jing, Walter Matthew R., Mei Hongyuan
- Hallu-pi: Evaluating Hallucination In Multi-modal Large Language Models Within Perturbed Inputs Ding Peng, Wu Jingyu, Kuang Jun, Ma Dan, Cao Xuezhi, Cai Xunliang, Chen Shi, Chen Jiajun, Huang Shujian
- Semcoder: Training Code Language Models With Comprehensive Semantics Ding Yangruibo, Peng Jinjun, Min Marcus J., Kaiser Gail, Yang Junfeng, Ray Baishakhi
- Entgpt: Linking Generative Large Language Models With Knowledge Bases Ding Yifan, Poudel Amrit, Zeng Qingkai, Weninger Tim, Veeramani Balaji, Bhattacharya Sanmitra
- Longrope: Extending LLM Context Window Beyond 2 Million Tokens Ding Yiran, Zhang Li Lyna, Zhang Chengruidong, Xu Yuanyuan, Shang Ning, Xu Jiahang, Yang Fan, Yang Mao
- A Tale Of Tails: Model Collapse As A Change Of Scaling Laws Dohmatob Elvis, Feng Yunzhen, Yang Pu, Charton Francois, Kempe Julia
- Latent Attention For Linear Time Transformers Dolga Rares, Cobzarenco Marius, Barber David
- Evaluating And Mitigating Linguistic Discrimination In Large Language Models Dong Guoliang, Wang Haoyu, Sun Jun, Wang Xinyu
- QAQ: Quality Adaptive Quantization For LLM KV Cache Dong Shichen, Cheng Wen, Qin Jiayu, Wang Wei
- Internlm-xcomposer2-4khd: A Pioneering Large Vision-language Model Handling Resolutions From 336 Pixels To 4K HD Dong Xiaoyi, Zhang Pan, Zang Yuhang, Cao Yuhang, Wang Bin, Ouyang Linke, Zhang Songyang, Duan Haodong, Zhang Wenwei, Li Yining, Yan Hang, Gao Yang, Chen Zhe, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Chen Kai, He Conghui, Zhang Xingcheng, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- Prompt Your Brain: Scaffold Prompt Tuning For Efficient Adaptation Of Fmri Pre-trained Model Dong Zijian, Wu Yilei, Chen Zijiao, Zhang Yichi, Jin Yueming, Zhou Juan Helen
- How Reliable Are Automatic Evaluation Methods For Instruction-tuned Llms? Doostmohammadi Ehsan, Holmström Oskar, Kuhlmann Marco
- To What Extent Is Chatgpt Useful For Language Teacher Lesson Plan Creation? Dornburg Alex, Davin Kristin
- Integrating Physician Diagnostic Logic Into Large Language Models: Preference Learning From Process Feedback Dou Chengfeng, Jin Zhi, Jiao Wenpin, Zhao Haiyan, Zhao Yongqiang, Tao Zhenwei
- What's Wrong With Your Code Generated By Large Language Models? An Extensive Study Dou Shihan, Jia Haoxiang, Wu Shenxi, Zheng Huiyuan, Zhou Weikang, Wu Muling, Chai Mingxu, Fan Jessica, Huang Caishuang, Tao Yunbo, Liu Yan, Zhou Enyu, Zhang Ming, Zhou Yuhao, Wu Yueming, Zheng Rui, Wen Ming, Weng Rongxiang, Wang Jingang, Cai Xunliang, Gui Tao, Qiu Xipeng, Zhang Qi, Huang Xuanjing
- Enhancing Robustness Of Llm-synthetic Text Detectors For Academic Writing: A Comprehensive Analysis Dou Zhicheng, Guo Yuchen, Chang Ching-chun, Nguyen Huy H., Echizen Isao
- Assessing The Reasoning Abilities Of Chatgpt In The Context Of Claim Verification Dougrez-lewis John, Akhter Mahmud Elahi, He Yulan, Liakata Maria
- Evaluating Language Model Context Windows: A "working Memory" Test And Inference-time Correction Dsouza Amanda, Glaze Christopher, Shin Changho, Sala Frederic
- In-context Learning With Reinforcement Learning For Incomplete Utterance Rewriting Du Haowei, Zhao Dongyan
- Efficient Personalized Text-to-image Generation By Leveraging Textual Subspace Du Shian, Cheng Xiaotian, Qian Qi, Wei Henglu, Xu Yi, Ji Xiangyang
- Unlocking Continual Learning Abilities In Language Models Du Wenyu, Cheng Shuang, Luo Tongxu, Qiu Zihan, Huang Zeyu, Cheung Ka Chun, Cheng Reynold, Fu Jie
- Stacking Your Transformers: A Closer Look At Model Growth For Efficient LLM Pre-training Du Wenyu, Luo Tongxu, Qiu Zihan, Huang Zeyu, Shen Yikang, Cheng Reynold, Guo Yike, Fu Jie
- Perltqa: A Personal Long-term Memory Dataset For Memory Classification, Retrieval, And Synthesis In Question Answering Du Yiming, Wang Hongru, Zhao Zhengyi, Liang Bin, Wang Baojun, Zhong Wanjun, Wang Zezhong, Wong Kam-fai
- CLEFT: Language-image Contrastive Learning With Efficient Large Language Model And Prompt Fine-tuning Du Yuexi, Chang Brian, Dvornek Nicha C.
- On The Evaluation Consistency Of Attribution-based Explanations Duan Jiarui, Li Haoling, Zhang Haofei, Jiang Hao, Xue Mengqi, Sun Li, Song Mingli, Song Jie
- Macbehaviour: An R Package For Behavioural Experimentation On Large Language Models Duan Xufeng, Li Shixuan, Cai1 Zhenguang G.
- Prompting Multi-modal Tokens To Enhance End-to-end Autonomous Driving Imitation Learning With Llms Duan Yiqun, Zhang Qiang, Xu Renjing
- How To Think Step-by-step: A Mechanistic Understanding Of Chain-of-thought Reasoning Dutta Subhabrata, Singh Joykirat, Chakrabarti Soumen, Chakraborty Tanmoy
- Applying RLAIF For Code Generation With Api-usage In Lightweight Llms Dutta Sujan, Mahinder Sayantan, Anantha Raviteja, Bandyopadhyay Bortik
- Your Context Is Not An Array: Unveiling Random Access Limitations In Transformers Ebrahimi Mohammadreza, Panchal Sunny, Memisevic Roland
- Dialogue You Can Trust: Human And AI Perspectives On Generated Conversations Ebubechukwu Ike, Takeuchi Johane, Ceravola Antonello, Joublin Frank
- Reducing Selection Bias In Large Language Models Eicher J. E., Irgolič R. F.
- Arabic Automatic Story Generation With Large Language Models El-shangiti Ahmed Oumar, Alwajih Fakhraddin, Abdul-mageed Muhammad
- Creating Arabic LLM Prompts At Scale El-sheikh Abdelrahman, Elmogtaba Ahmed, Darwish Kareem, Elmallah Muhammad, Elneima Ashraf, Sawaf Hassan
- Persuasiveness Of Generated Free-text Rationales In Subjective Decisions: A Case Study On Pairwise Argument Ranking Elaraby Mohamed, Litman Diane, Li Xiang Lorraine, Magooda Ahmed
- Dynamic Q&A Of Clinical Documents With Large Language Models Elgedawy Ran, Danciu Ioana, Mahbub Maria, Srinivasan Sudarshan
- Ocassionally Secure: A Comparative Analysis Of Code Generation Assistants Elgedawy Ran, Sadik John, Dutta Senjuti, Gautam Anuj, Georgiou Konstantinos, Gholamrezae Farzin, Ji Fujiao, Lim Kyungchan, Liu Qian, Ruoti Scott
- Layerskip: Enabling Early Exit Inference And Self-speculative Decoding Elhoushi Mostafa, Shrivastava Akshat, Liskovich Diana, Hosmer Basil, Wasti Bram, Lai Liangzhen, Mahmoud Anas, Acun Bilge, Agarwal Saurabh, Roman Ahmed, Aly Ahmed A, Chen Beidi, Wu Carole-jean
- Enabling Natural Zero-shot Prompting On Encoder Models Via Statement-tuning Elshabrawy Ahmed, Huang Yongxin, Gurevych Iryna, Aji Alham Fikri
- Reasoning In Transformers -- Mitigating Spurious Correlations And Reasoning Shortcuts Enström Daniel, Kjellberg Viktor, Johansson Moa
- Tinyagent: Function Calling At The Edge Erdogan Lutfi Eren, Lee Nicholas, Jha Siddharth, Kim Sehoon, Tabrizi Ryan, Moon Suhong, Hooper Coleman, Anumanchipalli Gopala, Keutzer Kurt, Gholami Amir
- Empirical Studies Of Parameter Efficient Methods For Large Language Models Of Code And Knowledge Transfer To R Esmaeili Amirreza, Saberi Iman, Fard Fatemeh H.
- Low-cost Language Models: Survey And Performance Evaluation On Python Code Generation Espejel Jessica López, Alassan Mahaman Sanoussi Yahaya, Bouhandi Merieme, Dahhane Walid, Ettifouri El Hassane
- Bertaqa: How Much Do Language Models Know About Local Culture? Etxaniz Julen, Azkune Gorka, Soroa Aitor, De Lacalle Oier Lopez, Artetxe Mikel
- Latxa: An Open Language Model And Evaluation Suite For Basque Etxaniz Julen, Sainz Oscar, Perez Naiara, Aldabe Itziar, Rigau German, Agirre Eneko, Ormazabal Aitor, Artetxe Mikel, Soroa Aitor
- Large Language Models As Evaluators For Scientific Synthesis Evans Julia, D'souza Jennifer, Auer Sören
- Decoding AI: The Inside Story Of Data Analysis In Chatgpt Evkaya Ozan, De Carvalho Miguel
- Data-augmentation-based Dialectal Adaptation For Llms Faisal Fahim, Anastasopoulos Antonios
- A Survey On RAG Meeting Llms: Towards Retrieval-augmented Large Language Models Fan Wenqi, Ding Yujuan, Ning Liangbo, Wang Shijie, Li Hengyun, Yin Dawei, Chua Tat-seng, Li Qing
- Prompt Optimizer Of Text-to-image Diffusion Models For Abstract Concept Understanding Fan Zezhong, Li Xiaohan, Fang Chenhao, Biswas Topojoy, Nag Kaushiki, Xu Jianpeng, Achan Kannan
- Biasalert: A Plug-and-play Tool For Social Bias Detection In Llms Fan Zhiting, Chen Ruizhe, Xu Ruiling, Liu Zuozhu
- DARA: Decomposition-alignment-reasoning Autonomous Language Agent For Question Answering Over Knowledge Graphs Fang Haishuo, Zhu Xiaodan, Gurevych Iryna
- Llama-omni: Seamless Speech Interaction With Large Language Models Fang Qingkai, Guo Shoutao, Zhou Yan, Ma Zhengrui, Zhang Shaolei, Feng Yang
- PATCH! Psychometrics-assisted Benchmarking Of Large Language Models: A Case Study Of Proficiency In 8th Grade Mathematics Fang Qixiang, Oberski Daniel L., Nguyen Dong
- \(VILA^2\): VILA Augmented VILA Fang Yunhao, Zhu Ligeng, Lu Yao, Wang Yan, Molchanov Pavlo, Cho Jang Hyun, Pavone Marco, Han Song, Yin Hongxu
- Unraveling The Dominance Of Large Language Models Over Transformer Models For Bangla Natural Language Inference: A Comprehensive Study Faria Fatema Tuj Johora, Moin Mukaffi Bin, Fahim Asif Iftekher, Debnath Pronay, Shah Faisal Muhammad
- Evaluating Human Alignment And Model Faithfulness Of LLM Rationale Fayyaz Mohsen, Yin Fan, Sun Jiao, Peng Nanyun
- What Are Large Language Models Mapping To In The Brain? A Case Against Over-reliance On Brain Scores Feghhi Ebrahim, Hadidi Nima, Song Bryan, Blank Idan A., Kao Jonathan C.
- Video-ccam: Enhancing Video-language Understanding With Causal Cross-attention Masks For Short And Long Videos Fei Jiajun, Li Dian, Deng Zhidong, Wang Zekun, Liu Gang, Wang Hui
- Internlm-law: An Open Source Chinese Legal Large Language Model Fei Zhiwei, Zhang Songyang, Shen Xiaoyu, Zhu Dawei, Wang Xiao, Cao Maosong, Zhou Fengzhe, Li Yining, Zhang Wenwei, Lin Dahua, Chen Kai, Ge Jidong
- Ragged Edges: The Double-edged Sword Of Retrieval-augmented Chatbots Feldman Philip, Foulds James R., Pan Shimei
- IGOT: Information Gain Optimized Tokenizer On Domain Adaptive Pretraining Feng Dawei, Zhang Yihai, Xu Zhixuan
- Citygpt: Empowering Urban Spatial Cognition Of Large Language Models Feng Jie, Du Yuwei, Liu Tianhui, Guo Siqi, Lin Yuming, Li Yong
- AGILE: A Novel Framework Of LLM Agents Feng Peiyuan, He Yichen, Huang Guanhua, Lin Yuan, Zhang Hanchong, Zhang Yuchen, Li Hang
- An Extremely Data-efficient And Generative Llm-based Reinforcement Learning Agent For Recommenders Feng Shuang, Feng Grace
- Courseassist: Pedagogically Appropriate AI Tutor For Computer Science Education Feng Ty, Liu Sa, Ghosal Dipak
- Optimus: Accelerating Large-scale Multi-modal LLM Training By Bubble Exploitation Feng Weiqi, Chen Yangrui, Wang Shaoyu, Peng Yanghua, Lin Haibin, Yu Minlan
- Mixture-of-loras: An Efficient Multitask Tuning For Large Language Models Feng Wenfeng, Hao Chuzhan, Zhang Yuewei, Han Yu, Wang Hao
- Llama-vits: Enhancing TTS Synthesis With Semantic Awareness Feng Xincan, Yoshimoto Akifumi
- Mt-ladder: A Model-agnostic Framework Boosting Llm-based Machine Translation To The Next Level Feng Zhaopeng, Zhang Yan, Chen Ruizhe, Meng Zijie, Liu Zuozhu
- MERA: A Comprehensive LLM Evaluation In Russian Fenogenova Alena, Chervyakov Artem, Martynov Nikita, Kozlova Anastasia, Tikhonova Maria, Akhmetgareeva Albina, Emelyanov Anton, Shevelev Denis, Lebedev Pavel, Sinev Leonid, Isaeva Ulyana, Kolomeytseva Katerina, Moskovskiy Daniil, Goncharova Elizaveta, Savushkin Nikita, Mikhailova Polina, Dimitrov Denis, Panchenko Alexander, Markov Sergei
- Syllabusqa: A Course Logistics Question Answering Dataset Fernandez Nigel, Scarlatos Alexander, Lan Andrew
- Generative AI And Large Language Models For Cyber Security: All Insights You Need Ferrag Mohamed Amine, Alwahedi Fatima, Battah Ammar, Cherif Bilel, Mechri Abdechakour, Tihanyi Norbert
- Exaranker-open: Synthetic Explanation For IR Using Open-source Llms Ferraretto Fernando, Laitz Thiago, Lotufo Roberto, Nogueira Rodrigo
- Toward Large Language Models As A Therapeutic Tool: Comparing Prompting Techniques To Improve Gpt-delivered Problem-solving Therapy Filienko Daniil, Wang Yinzhou, Jazmi Caroline El, Xie Serena, Cohen Trevor, De Cock Martine, Yuwen Weichao
- The Chronicles Of RAG: The Retriever, The Chunk And The Generator Finardi Paulo, Avila Leonardo, Castaldoni Rodrigo, Gengo Pedro, Larcher Celio, Piau Marcos, Costa Pablo, Caridá Vinicius
- Convosense: Overcoming Monotonous Commonsense Inferences For Conversational AI Finch Sarah E., Choi Jinho D.
- Grillbot In Practice: Lessons And Tradeoffs Deploying Large Language Models For Adaptable Conversational Task Assistants Fischer Sophie, Gemmell Carlos, Tecklenburg Niklas, Mackie Iain, Rossetto Federico, Dalton Jeffrey
- Geckopt: LLM System Efficiency Via Intent-based Tool Selection Fore Michael, Singh Simranjit, Stamoulis Dimitrios
- CLIP With Generative Latent Replay: A Strong Baseline For Incremental Learning Frascaroli Emanuele, Panariello Aniello, Buzzega Pietro, Bonicelli Lorenzo, Porrello Angelo, Calderara Simone
- Representing Rule-based Chatbots With Transformers Friedman Dan, Panigrahi Abhishek, Chen Danqi
- LLM Agents In Interaction: Measuring Personality Consistency And Linguistic Alignment In Interacting Populations Of Large Language Models Frisch Ivar, Giulianelli Mario
- Hint-before-solving Prompting: Guiding Llms To Effectively Utilize Encoded Knowledge Fu Jinlan, Huangfu Shenzhen, Yan Hang, Ng See-kiong, Qiu Xipeng
- A Role-specific Guided Large Language Model For Ophthalmic Consultation Based On Stylistic Differentiation Fu Laiyi, Fan Binbin, Du Hongkai, Feng Yanxiang, Li Chunhua, Song Huping
- Lazyllm: Dynamic Token Pruning For Efficient Long Context LLM Inference Fu Qichen, Cho Minsik, Merth Thomas, Mehta Sachin, Rastegari Mohammad, Najibi Mahyar
- Moa: Mixture Of Sparse Attention For Automatic Large Language Model Compression Fu Tianyu, Huang Haofeng, Ning Xuefei, Zhang Genghan, Chen Boju, Wu Tianqi, Wang Hongyi, Huang Zixiao, Li Shiyao, Yan Shengen, Dai Guohao, Yang Huazhong, Wang Yu
- BLINK: Multimodal Large Language Models Can See But Not Perceive Fu Xingyu, Hu Yushi, Li Bangzheng, Feng Yu, Wang Haoyu, Lin Xudong, Roth Dan, Smith Noah A., Ma Wei-chiu, Krishna Ranjay
- Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight In The Real World For Meeting Summarization? Fu Xue-yong, Laskar Md Tahmid Rahman, Khasanova Elena, Chen Cheng, Tn Shashi Bhushan
- Data Engineering For Scaling Language Models To 128K Context Fu Yao, Panda Rameswar, Niu Xinyao, Yue Xiang, Hajishirzi Hannaneh, Kim Yoon, Peng Hao
- UI-JEPA: Towards Active Perception Of User Intent Through Onscreen User Activity Fu Yicheng, Anantha Raviteja, Vashisht Prabal, Cheng Jianpeng, Littwin Etai
- Using Large Language Models For The Interpretation Of Building Regulations Fuchs Stefan, Witbrock Michael, Dimyadi Johannes, Amor Robert
- Layoutllm: Large Language Model Instruction Tuning For Visually Rich Document Understanding Fujitake Masato
- Can AI Relate: Testing Large Language Model Response For Mental Health Support Gabriel Saadia, Puri Isha, Xu Xuhai, Malgaroli Matteo, Ghassemi Marzyeh
- Student Answer Forecasting: Transformer-driven Answer Choice Prediction For Language Learning Gado Elena Grazia, Martorella Tommaso, Zunino Luca, Mejia-domenzain Paola, Swamy Vinitra, Frej Jibril, Käser Tanja
- Open Generative Large Language Models For Galician Gamallo Pablo, Rodríguez Pablo, De-dios-flores Iria, Sotelo Susana, Paniagua Silvia, Bardanca Daniel, Pichel José Ramom, Garcia Marcos
- Application Of LLM Agents In Recruitment: A Novel Framework For Resume Screening Gan Chengguang, Zhang Qinghao, Mori Tatsunori
- Question Aware Vision Transformer For Multimodal Reasoning Ganz Roy, Kittenplon Yair, Aberdam Aviad, Avraham Elad Ben, Nuriel Oren, Mazor Shai, Litman Ron
- Cost-efficient Large Language Model Serving For Multi-turn Conversations With Cachedattention Gao Bin, He Zhuomin, Sharma Puru, Kang Qingxuan, Jevdjic Djordje, Deng Junbo, Yang Xingkun, Yu Zhou, Zuo Pengfei
- Aligning LLM Agents By Learning Latent Preference From User Edits Gao Ge, Taymanov Alexey, Salinas Eduardo, Mineiro Paul, Misra Dipendra
- Self-evolving GPT: A Lifelong Autonomous Experiential Learner Gao Jinglong, Ding Xiao, Cui Yiming, Zhao Jianbai, Wang Hepeng, Liu Ting, Qin Bing
- Selfcp: Compressing Over-limit Prompt Via The Frozen Large Language Model Itself Gao Jun, Cao Ziqiang, Li Wenjie
- Harnessing The Intrinsic Knowledge Of Pretrained Language Models For Challenging Text Classification Settings Gao Lingyu
- Tc-llava: Rethinking The Transfer From Image To Video Understanding With Temporal Considerations Gao Mingze, Liu Jingyu, Li Mingda, Xie Jiangtao, Liu Qingbin, Zhao Bo, Chen Xi, Xiong Hui
- Lumina-t2x: Transforming Text Into Any Modality, Resolution, And Duration Via Flow-based Large Diffusion Transformers Gao Peng, Zhuo Le, Liu Dongyang, Du Ruoyi, Luo Xu, Qiu Longtian, Zhang Yuhang, Lin Chen, Huang Rongjie, Geng Shijie, Zhang Renrui, Xi Junlin, Shao Wenqi, Jiang Zhengkai, Yang Tianshuo, Ye Weicai, Tong He, He Jingwen, Qiao Yu, Li Hongsheng
- Improving Language Understanding From Screenshots Gao Tianyu, Wang Zirui, Bhaskar Adithya, Chen Danqi
- A Review On The Use Of Large Language Models As Virtual Tutors García-méndez Silvia, De Arriba-pérez Francisco, Somoza-lópez María Del Carmen
- Robust Pronoun Fidelity With English Llms: Are They Reasoning, Repeating, Or Just Biased? Gautam Vagrant, Bingert Eileen, Zhu Dawei, Lauscher Anne, Klakow Dietrich
- Longins: A Challenging Long-context Instruction-based Exam For Llms Gavin Shawn, Zheng Tuney, Liu Jiaheng, Que Quehry, Wang Noah, Yang Jian, Zhang Chenchen, Huang Wenhao, Chen Wenhu, Zhang Ge
- Graph Descriptive Order Improves Reasoning With Large Language Model Ge Yuyao, Liu Shenghua, Feng Wenjie, Mei Lingrui, Chen Lizhe, Cheng Xueqi
- Worldgpt: Empowering LLM As Multimodal World Model Ge Zhiqi, Huang Hongzhe, Zhou Mingze, Li Juncheng, Wang Guoming, Tang Siliang, Zhuang Yueting
- Edinburgh Clinical NLP At MEDIQA-CORR 2024: Guiding Large Language Models With Hints Gema Aryo Pradipta, Lee Chaeeun, Minervini Pasquale, Daines Luke, Simpson T. Ian, Alex Beatrice
- Gemini 1.5: Unlocking Multimodal Understanding Across Millions Of Tokens Of Context Gemini Team, Georgiev Petko, Lei Ving Ian, Burnell Ryan, Bai Libin, Gulati Anmol, Tanzer Garrett, Vincent Damien, Pan Zhufeng, Wang Shibo, Mariooryad Soroosh, Ding Yifan, Geng Xinyang, Alcober Fred, Frostig Roy, Omernick Mark, Walker Lexi, Paduraru Cosmin, Sorokin Christina, Tacchetti Andrea, Gaffney Colin, Daruki Samira, Sercinoglu Olcan, Gleicher Zach, Love Juliette, Voigtlaender Paul, Jain Rohan, Surita Gabriela, Mohamed Kareem, Blevins Rory, Ahn Junwhan, Zhu Tao, Kawintiranon Kornraphop, Firat Orhan, Gu Yiming, Zhang Yujing, Rahtz Matthew, Faruqui Manaal, Clay Natalie, Gilmer Justin, Co-reyes Jd, Penchev Ivo, Zhu Rui, Morioka Nobuyuki, Hui Kevin, Haridasan Krishna, Campos Victor, Mahdieh Mahdis, Guo Mandy, Hassan Samer, Kilgour Kevin, Vezer Arpi, Cheng Heng-tze, De Liedekerke Raoul, Goyal Siddharth, Barham Paul, Strouse Dj, Noury Seb, Adler Jonas, Sundararajan Mukund, Vikram Sharad, Lepikhin Dmitry, Paganini Michela, Garcia Xavier, Yang Fan, Valter Dasha, Trebacz Maja, Vodrahalli Kiran, Asawaroengchai Chulayuth, Ring Roman, Kalb Norbert, Soares Livio Baldini, Brahma Siddhartha, Steiner David, Yu Tianhe, Mentzer Fabian, He Antoine, Gonzalez Lucas, Xu Bibo, Kaufman Raphael Lopez, Shafey Laurent El, Oh Junhyuk, Hennigan Tom, Driessche George Van Den, Odoom Seth, Lucic Mario, Roelofs Becca, Lall Sid, Marathe Amit, Chan Betty, Ontanon Santiago, He Luheng, Teplyashin Denis, Lai Jonathan, Crone Phil, Damoc Bogdan, Ho Lewis, Riedel Sebastian, Lenc Karel, Yeh Chih-kuan, Chowdhery Aakanksha, Xu Yang, Kazemi Mehran, Amid Ehsan, Petrushkina Anastasia, Swersky Kevin, Khodaei Ali, Chen Gowoon, Larkin Chris, Pinto Mario, Yan Geng, Badia Adria Puigdomenech, Patil Piyush, Hansen Steven, Orr Dave, Arnold Sebastien M. R., Grimstad Jordan, Dai Andrew, Douglas Sholto, Sinha Rishika, Yadav Vikas, Chen Xi, Gribovskaya Elena, Austin Jacob, Zhao Jeffrey, Patel Kaushal, Komarek Paul, Austin Sophia, Borgeaud Sebastian, Friso Linda, Goyal Abhimanyu, Caine Ben, Cao Kris, Chung Da-woon, Lamm Matthew, Barth-maron Gabe, Kagohara Thais, Olszewska Kate, Chen Mia, Shivakumar Kaushik, Agarwal Rishabh, Godhia Harshal, Rajwar Ravi, Snaider Javier, Dotiwalla Xerxes, Liu Yuan, Barua Aditya, Ungureanu Victor, Zhang Yuan, Batsaikhan Bat-orgil, Wirth Mateo, Qin James, Danihelka Ivo, Doshi Tulsee, Chadwick Martin, Chen Jilin, Jain Sanil, Le Quoc, Kar Arjun, Gurumurthy Madhu, Li Cheng, Sang Ruoxin, Liu Fangyu, Lamprou Lampros, Munoz Rich, Lintz Nathan, Mehta Harsh, Howard Heidi, Reynolds Malcolm, Aroyo Lora, Wang Quan, Blanco Lorenzo, Cassirer Albin, Griffith Jordan, Das Dipanjan, Lee Stephan, Sygnowski Jakub, Fisher Zach, Besley James, Powell Richard, Ahmed Zafarali, Paulus Dominik, Reitter David, Borsos Zalan, Joshi Rishabh, Pope Aedan, Hand Steven, Selo Vittorio, Jain Vihan, Sethi Nikhil, Goel Megha, Makino Takaki, May Rhys, Yang Zhen, Schalkwyk Johan, Butterfield Christina, Hauth Anja, Goldin Alex, Hawkins Will, Senter Evan, Brin Sergey, Woodman Oliver, Ritter Marvin, Noland Eric, Giang Minh, Bolina Vijay, Lee Lisa, Blyth Tim, Mackinnon Ian, Reid Machel, Sarvana Obaid, Silver David, Chen Alexander, Wang Lily, Maggiore Loren, Chang Oscar, Attaluri Nithya, Thornton Gregory, Chiu Chung-cheng, Bunyan Oskar, Levine Nir, Chung Timothy, Eltyshev Evgenii, Si Xiance, Lillicrap Timothy, Brady Demetra, Aggarwal Vaibhav, Wu Boxi, Xu Yuanzhong, Mcilroy Ross, Badola Kartikeya, Sandhu Paramjit, Moreira Erica, Stokowiec Wojciech, Hemsley Ross, Li Dong, Tudor Alex, Shyam Pranav, Rahimtoroghi Elahe, Haykal Salem, Sprechmann Pablo, Zhou Xiang, Mincu Diana, Li Yujia, Addanki Ravi, Krishna Kalpesh, Wu Xiao, Frechette Alexandre, Eyal Matan, Dafoe Allan, Lacey Dave, Whang Jay, Avrahami Thi, Zhang Ye, Taropa Emanuel, Lin Hanzhao, Toyama Daniel, Rutherford Eliza, Sano Motoki, Choe Hyunjeong, Tomala Alex, Safranek-shrader Chalence, Kassner Nora, Pajarskas Mantas, Harvey Matt, Sechrist Sean, Fortunato Meire, Lyu Christina, Elsayed Gamaleldin, Kuang Chenkai, Lottes James, Chu Eric, Jia Chao, Chen Chih-wei, Humphreys Peter, Baumli Kate, Tao Connie, Samuel Rajkumar, Santos Cicero Nogueira Dos, Andreassen Anders, Rakićević Nemanja, Grewe Dominik, Kumar Aviral, Winkler Stephanie, Caton Jonathan, Brock Andrew, Dalmia Sid, Sheahan Hannah, Barr Iain, Miao Yingjie, Natsev Paul, Devlin Jacob, Behbahani Feryal, Prost Flavien, Sun Yanhua, Myaskovsky Artiom, Pillai Thanumalayan Sankaranarayana, Hurt Dan, Lazaridou Angeliki, Xiong Xi, Zheng Ce, Pardo Fabio, Li Xiaowei, Horgan Dan, Stanton Joe, Ambar Moran, Xia Fei, Lince Alejandro, Wang Mingqiu, Mustafa Basil, Webson Albert, Lee Hyo, Anil Rohan, Wicke Martin, Dozat Timothy, Sinha Abhishek, Piqueras Enrique, Dabir Elahe, Upadhyay Shyam, Boral Anudhyan, Hendricks Lisa Anne, Fry Corey, Djolonga Josip, Su Yi, Walker Jake, Labanowski Jane, Huang Ronny, Misra Vedant, Chen Jeremy, Skerry-ryan Rj, Singh Avi, Rijhwani Shruti, Yu Dian, Castro-ros Alex, Changpinyo Beer, Datta Romina, Bagri Sumit, Hrafnkelsson Arnar Mar, Maggioni Marcello, Zheng Daniel, Sulsky Yury, Hou Shaobo, Paine Tom Le, Yang Antoine, Riesa Jason, Rogozinska Dominika, Marcus Dror, Badawy Dalia El, Zhang Qiao, Wang Luyu, Miller Helen, Greer Jeremy, Sjos Lars Lowe, Nova Azade, Zen Heiga, Chaabouni Rahma, Rosca Mihaela, Jiang Jiepu, Chen Charlie, Liu Ruibo, Sainath Tara, Krikun Maxim, Polozov Alex, Lespiau Jean-baptiste, Newlan Josh, Cankara Zeyncep, Kwak Soo, Xu Yunhan, Chen Phil, Coenen Andy, Meyer Clemens, Tsihlas Katerina, Ma Ada, Gottweis Juraj, Xing Jinwei, Gu Chenjie, Miao Jin, Frank Christian, Cankara Zeynep, Ganapathy Sanjay, Dasgupta Ishita, Hughes-fitt Steph, Chen Heng, Reid David, Rong Keran, Fan Hongmin, Van Amersfoort Joost, Zhuang Vincent, Cohen Aaron, Gu Shixiang Shane, Mohananey Anhad, Ilic Anastasija, Tobin Taylor, Wieting John, Bortsova Anna, Thacker Phoebe, Wang Emma, Caveness Emily, Chiu Justin, Sezener Eren, Kaskasoli Alex, Baker Steven, Millican Katie, Elhawaty Mohamed, Aisopos Kostas, Lebsack Carl, Byrd Nathan, Dai Hanjun, Jia Wenhao, Wiethoff Matthew, Davoodi Elnaz, Weston Albert, Yagati Lakshman, Ahuja Arun, Gao Isabel, Pundak Golan, Zhang Susan, Azzam Michael, Sim Khe Chai, Caelles Sergi, Keeling James, Sharma Abhanshu, Swing Andy, Li Yaguang, Liu Chenxi, Bostock Carrie Grimes, Bansal Yamini, Nado Zachary, Anand Ankesh, Lipschultz Josh, Karmarkar Abhijit, Proleev Lev, Ittycheriah Abe, Yeganeh Soheil Hassas, Polovets George, Faust Aleksandra, Sun Jiao, Rrustemi Alban, Li Pen, Shivanna Rakesh, Liu Jeremiah, Welty Chris, Lebron Federico, Baddepudi Anirudh, Krause Sebastian, Parisotto Emilio, Soricut Radu, Xu Zheng, Bloxwich Dawn, Johnson Melvin, Neyshabur Behnam, Mao-jones Justin, Wang Renshen, Ramasesh Vinay, Abbas Zaheer, Guez Arthur, Segal Constant, Nguyen Duc Dung, Svensson James, Hou Le, York Sarah, Milan Kieran, Bridgers Sophie, Gworek Wiktor, Tagliasacchi Marco, Lee-thorp James, Chang Michael, Guseynov Alexey, Hartman Ale Jakse, Kwong Michael, Zhao Ruizhe, Kashem Sheleem, Cole Elizabeth, Miech Antoine, Tanburn Richard, Phuong Mary, Pavetic Filip, Cevey Sebastien, Comanescu Ramona, Ives Richard, Yang Sherry, Du Cosmo, Li Bo, Zhang Zizhao, Iinuma Mariko, Hu Clara Huiyi, Roy Aurko, Bijwadia Shaan, Zhu Zhenkai, Martins Danilo, Saputro Rachel, Gergely Anita, Zheng Steven, Jia Dawei, Antonoglou Ioannis, Sadovsky Adam, Gu Shane, Bi Yingying, Andreev Alek, Samangooei Sina, Khan Mina, Kocisky Tomas, Filos Angelos, Kumar Chintu, Bishop Colton, Yu Adams, Hodkinson Sarah, Mittal Sid, Shah Premal, Moufarek Alexandre, Cheng Yong, Bloniarz Adam, Lee Jaehoon, Pejman Pedram, Michel Paul, Spencer Stephen, Feinberg Vladimir, Xiong Xuehan, Savinov Nikolay, Smith Charlotte, Shakeri Siamak, Tran Dustin, Chesus Mary, Bohnet Bernd, Tucker George, Von Glehn Tamara, Muir Carrie, Mao Yiran, Kazawa Hideto, Slone Ambrose, Soparkar Kedar, Shrivastava Disha, Cobon-kerr James, Sharman Michael, Pavagadhi Jay, Araya Carlos, Misiunas Karolis, Ghelani Nimesh, Laskin Michael, Barker David, Li Qiujia, Briukhov Anton, Houlsby Neil, Glaese Mia, Lakshminarayanan Balaji, Schucher Nathan, Tang Yunhao, Collins Eli, Lim Hyeontaek, Feng Fangxiaoyu, Recasens Adria, Lai Guangda, Magni Alberto, De Cao Nicola, Siddhant Aditya, Ashwood Zoe, Orbay Jordi, Dehghani Mostafa, Brennan Jenny, He Yifan, Xu Kelvin, Gao Yang, Saroufim Carl, Molloy James, Wu Xinyi, Arnold Seb, Chang Solomon, Schrittwieser Julian, Buchatskaya Elena, Radpour Soroush, Polacek Martin, Giordano Skye, Bapna Ankur, Tokumine Simon, Hellendoorn Vincent, Sottiaux Thibault, Cogan Sarah, Severyn Aliaksei, Saleh Mohammad, Thakoor Shantanu, Shefey Laurent, Qiao Siyuan, Gaba Meenu, Chang Shuo-yiin, Swanson Craig, Zhang Biao, Lee Benjamin, Rubenstein Paul Kishan, Song Gan, Kwiatkowski Tom, Koop Anna, Kannan Ajay, Kao David, Schuh Parker, Stjerngren Axel, Ghiasi Golnaz, Gibson Gena, Vilnis Luke, Yuan Ye, Ferreira Felipe Tiengo, Kamath Aishwarya, Klimenko Ted, Franko Ken, Xiao Kefan, Bhattacharya Indro, Patel Miteyan, Wang Rui, Morris Alex, Strudel Robin, Sharma Vivek, Choy Peter, Hashemi Sayed Hadi, Landon Jessica, Finkelstein Mara, Jhakra Priya, Frye Justin, Barnes Megan, Mauger Matthew, Daun Dennis, Baatarsukh Khuslen, Tung Matthew, Farhan Wael, Michalewski Henryk, Viola Fabio, Quitry Felix De Chaumont, Lan Charline Le, Hudson Tom, Wang Qingze, Fischer Felix, Zheng Ivy, White Elspeth, Dragan Anca, Alayrac Jean-baptiste, Ni Eric, Pritzel Alexander, Iwanicki Adam, Isard Michael, Bulanova Anna, Zilka Lukas, Dyer Ethan, Sachan Devendra, Srinivasan Srivatsan, Muckenhirn Hannah, Cai Honglong, Mandhane Amol, Tariq Mukarram, Rae Jack W., Wang Gary, Ayoub Kareem, Fitzgerald Nicholas, Zhao Yao, Han Woohyun, Alberti Chris, Garrette Dan, Krishnakumar Kashyap, Gimenez Mai, Levskaya Anselm, Sohn Daniel, Matak Josip, Iturrate Inaki, Chang Michael B., Xiang Jackie, Cao Yuan, Ranka Nishant, Brown Geoff, Hutter Adrian, Mirrokni Vahab, Chen Nanxin, Yao Kaisheng, Egyed Zoltan, Galilee Francois, Liechty Tyler, Kallakuri Praveen, Palmer Evan, Ghemawat Sanjay, Liu Jasmine, Tao David, Thornton Chloe, Green Tim, Jasarevic Mimi, Lin Sharon, Cotruta Victor, Tan Yi-xuan, Fiedel Noah, Yu Hongkun, Chi Ed, Neitz Alexander, Heitkaemper Jens, Sinha Anu, Zhou Denny, Sun Yi, Kaed Charbel, Hulse Brice, Mishra Swaroop, Georgaki Maria, Kudugunta Sneha, Farabet Clement, Shafran Izhak, Vlasic Daniel, Tsitsulin Anton, Ananthanarayanan Rajagopal, Carin Alen, Su Guolong, Sun Pei, V Shashank, Carvajal Gabriel, Broder Josef, Comsa Iulia, Repina Alena, Wong William, Chen Warren Weilun, Hawkins Peter, Filonov Egor, Loher Lucia, Hirnschall Christoph, Wang Weiyi, Ye Jingchen, Burns Andrea, Cate Hardie, Wright Diana Gage, Piccinini Federico, Zhang Lei, Lin Chu-cheng, Gog Ionel, Kulizhskaya Yana, Sreevatsa Ashwin, Song Shuang, Cobo Luis C., Iyer Anand, Tekur Chetan, Garrido Guillermo, Xiao Zhuyun, Kemp Rupert, Zheng Huaixiu Steven, Li Hui, Agarwal Ananth, Ngani Christel, Goshvadi Kati, Santamaria-fernandez Rebeca, Fica Wojciech, Chen Xinyun, Gorgolewski Chris, Sun Sean, Garg Roopal, Ye Xinyu, Eslami S. M. Ali, Hua Nan, Simon Jon, Joshi Pratik, Kim Yelin, Tenney Ian, Potluri Sahitya, Thiet Lam Nguyen, Yuan Quan, Luisier Florian, Chronopoulou Alexandra, Scellato Salvatore, Srinivasan Praveen, Chen Minmin, Koverkathu Vinod, Dalibard Valentin, Xu Yaming, Saeta Brennan, Anderson Keith, Sellam Thibault, Fernando Nick, Huot Fantine, Jung Junehyuk, Varadarajan Mani, Quinn Michael, Raul Amit, Le Maigo, Habalov Ruslan, Clark Jon, Jalan Komal, Bullard Kalesha, Singhal Achintya, Luong Thang, Wang Boyu, Rajayogam Sujeevan, Eisenschlos Julian, Jia Johnson, Finchelstein Daniel, Yakubovich Alex, Balle Daniel, Fink Michael, Agarwal Sameer, Li Jing, Dvijotham Dj, Pal Shalini, Kang Kai, Konzelmann Jaclyn, Beattie Jennifer, Dousse Olivier, Wu Diane, Crocker Remi, Elkind Chen, Jonnalagadda Siddhartha Reddy, Lee Jong, Holtmann-rice Dan, Kallarackal Krystal, Liu Rosanne, Vnukov Denis, Vats Neera, Invernizzi Luca, Jafari Mohsen, Zhou Huanjie, Taylor Lilly, Prendki Jennifer, Wu Marcus, Eccles Tom, Liu Tianqi, Kopparapu Kavya, Beaufays Francoise, Angermueller Christof, Marzoca Andreea, Sarcar Shourya, Dib Hilal, Stanway Jeff, Perbet Frank, Trdin Nejc, Sterneck Rachel, Khorlin Andrey, Li Dinghua, Wu Xihui, Goenka Sonam, Madras David, Goldshtein Sasha, Gierke Willi, Zhou Tong, Liu Yaxin, Liang Yannie, White Anais, Li Yunjie, Singh Shreya, Bahargam Sanaz, Epstein Mark, Basu Sujoy, Lao Li, Ozturel Adnan, Crous Carl, Zhai Alex, Lu Han, Tung Zora, Gaur Neeraj, Walton Alanna, Dixon Lucas, Zhang Ming, Globerson Amir, Uy Grant, Bolt Andrew, Wiles Olivia, Nasr Milad, Shumailov Ilia, Selvi Marco, Piccinno Francesco, Aguilar Ricardo, Mccarthy Sara, Khalman Misha, Shukla Mrinal, Galic Vlado, Carpenter John, Villela Kevin, Zhang Haibin, Richardson Harry, Martens James, Bosnjak Matko, Belle Shreyas Rammohan, Seibert Jeff, Alnahlawi Mahmoud, Mcwilliams Brian, Singh Sankalp, Louis Annie, Ding Wen, Popovici Dan, Simicich Lenin, Knight Laura, Mehta Pulkit, Gupta Nishesh, Shi Chongyang, Fatehi Saaber, Mitrovic Jovana, Grills Alex, Pagadora Joseph, Petrova Dessie, Eisenbud Danielle, Zhang Zhishuai, Yates Damion, Mittal Bhavishya, Tripuraneni Nilesh, Assael Yannis, Brovelli Thomas, Jain Prateek, Velimirovic Mihajlo, Akbulut Canfer, Mu Jiaqi, Macherey Wolfgang, Kumar Ravin, Xu Jun, Qureshi Haroon, Comanici Gheorghe, Wiesner Jeremy, Gong Zhitao, Ruddock Anton, Bauer Matthias, Felt Nick, Gp Anirudh, Arnab Anurag, Zelle Dustin, Rothfuss Jonas, Rosgen Bill, Shenoy Ashish, Seybold Bryan, Li Xinjian, Mudigonda Jayaram, Erdogan Goker, Xia Jiawei, Simsa Jiri, Michi Andrea, Yao Yi, Yew Christopher, Kan Steven, Caswell Isaac, Radebaugh Carey, Elisseeff Andre, Valenzuela Pedro, Mckinney Kay, Paterson Kim, Cui Albert, Latorre-chimoto Eri, Kim Solomon, Zeng William, Durden Ken, Ponnapalli Priya, Sosea Tiberiu, Choquette-choo Christopher A., Manyika James, Robenek Brona, Vashisht Harsha, Pereira Sebastien, Lam Hoi, Velic Marko, Owusu-afriyie Denese, Lee Katherine, Bolukbasi Tolga, Parrish Alicia, Lu Shawn, Park Jane, Venkatraman Balaji, Talbert Alice, Rosique Lambert, Cheng Yuchung, Sozanschi Andrei, Paszke Adam, Kumar Praveen, Austin Jessica, Li Lu, Salama Khalid, Kim Wooyeol, Dukkipati Nandita, Baryshnikov Anthony, Kaplanis Christos, Sheng Xianghai, Chervonyi Yuri, Unlu Caglar, Casas Diego De Las, Askham Harry, Tunyasuvunakool Kathryn, Gimeno Felix, Poder Siim, Kwak Chester, Miecnikowski Matt, Mirrokni Vahab, Dimitriev Alek, Parisi Aaron, Liu Dangyi, Tsai Tomy, Shevlane Toby, Kouridi Christina, Garmon Drew, Goedeckemeyer Adrian, Brown Adam R., Vijayakumar Anitha, Elqursh Ali, Jazayeri Sadegh, Huang Jin, Carthy Sara Mc, Hoover Jay, Kim Lucy, Kumar Sandeep, Chen Wei, Biles Courtney, Bingham Garrett, Rosen Evan, Wang Lisa, Tan Qijun, Engel David, Pongetti Francesco, De Cesare Dario, Hwang Dongseong, Yu Lily, Pullman Jennifer, Narayanan Srini, Levin Kyle, Gopal Siddharth, Li Megan, Aharoni Asaf, Trinh Trieu, Lo Jessica, Casagrande Norman, Vij Roopali, Matthey Loic, Ramadhana Bramandia, Matthews Austin, Carey Cj, Johnson Matthew, Goranova Kremena, Shah Rohin, Ashraf Shereen, Dasgupta Kingshuk, Larsen Rasmus, Wang Yicheng, Vuyyuru Manish Reddy, Jiang Chong, Ijazi Joana, Osawa Kazuki, Smith Celine, Boppana Ramya Sree, Bilal Taylan, Koizumi Yuma, Xu Ying, Altun Yasemin, Shabat Nir, Bariach Ben, Korchemniy Alex, Choo Kiam, Ronneberger Olaf, Iwuanyanwu Chimezie, Zhao Shubin, Soergel David, Hsieh Cho-jui, Cai Irene, Iqbal Shariq, Sundermeyer Martin, Chen Zhe, Bursztein Elie, Malaviya Chaitanya, Biadsy Fadi, Shroff Prakash, Dhillon Inderjit, Latkar Tejasi, Dyer Chris, Forbes Hannah, Nicosia Massimo, Nikolaev Vitaly, Greene Somer, Georgiev Marin, Wang Pidong, Martin Nina, Sedghi Hanie, Zhang John, Banzal Praseem, Fritz Doug, Rao Vikram, Wang Xuezhi, Zhang Jiageng, Patraucean Viorica, Du Dayou, Mordatch Igor, Jurin Ivan, Liu Lewis, Dubey Ayush, Mohan Abhi, Nowakowski Janek, Ion Vlad-doru, Wei Nan, Tojo Reiko, Raad Maria Abi, Hudson Drew A., Keshava Vaishakh, Agrawal Shubham, Ramirez Kevin, Wu Zhichun, Nguyen Hoang, Liu Ji, Sewak Madhavi, Petrini Bryce, Choi Donghyun, Philips Ivan, Wang Ziyue, Bica Ioana, Garg Ankush, Wilkiewicz Jarek, Agrawal Priyanka, Li Xiaowei, Guo Danhao, Xue Emily, Shaik Naseer, Leach Andrew, Khan Sadh Mnm, Wiesinger Julia, Jerome Sammy, Chakladar Abhishek, Wang Alek Wenjiao, Ornduff Tina, Abu Folake, Ghaffarkhah Alireza, Wainwright Marcus, Cortes Mario, Liu Frederick, Maynez Joshua, Terzis Andreas, Samangouei Pouya, Mansour Riham, Kępa Tomasz, Aubet François-xavier, Algymr Anton, Banica Dan, Weisz Agoston, Orban Andras, Senges Alexandre, Andrejczuk Ewa, Geller Mark, Santo Niccolo Dal, Anklin Valentin, Merey Majd Al, Baeuml Martin, Strohman Trevor, Bai Junwen, Petrov Slav, Wu Yonghui, Hassabis Demis, Kavukcuoglu Koray, Dean Jeffrey, Vinyals Oriol
- Gemma 2: Improving Open Language Models At A Practical Size Gemma Team, Riviere Morgane, Pathak Shreya, Sessa Pier Giuseppe, Hardin Cassidy, Bhupatiraju Surya, Hussenot Léonard, Mesnard Thomas, Shahriari Bobak, Ramé Alexandre, Ferret Johan, Liu Peter, Tafti Pouya, Friesen Abe, Casbon Michelle, Ramos Sabela, Kumar Ravin, Lan Charline Le, Jerome Sammy, Tsitsulin Anton, Vieillard Nino, Stanczyk Piotr, Girgin Sertan, Momchev Nikola, Hoffman Matt, Thakoor Shantanu, Grill Jean-bastien, Neyshabur Behnam, Bachem Olivier, Walton Alanna, Severyn Aliaksei, Parrish Alicia, Ahmad Aliya, Hutchison Allen, Abdagic Alvin, Carl Amanda, Shen Amy, Brock Andy, Coenen Andy, Laforge Anthony, Paterson Antonia, Bastian Ben, Piot Bilal, Wu Bo, Royal Brandon, Chen Charlie, Kumar Chintu, Perry Chris, Welty Chris, Choquette-choo Christopher A., Sinopalnikov Danila, Weinberger David, Vijaykumar Dimple, Rogozińska Dominika, Herbison Dustin, Bandy Elisa, Wang Emma, Noland Eric, Moreira Erica, Senter Evan, Eltyshev Evgenii, Visin Francesco, Rasskin Gabriel, Wei Gary, Cameron Glenn, Martins Gus, Hashemi Hadi, Klimczak-plucińska Hanna, Batra Harleen, Dhand Harsh, Nardini Ivan, Mein Jacinda, Zhou Jack, Svensson James, Stanway Jeff, Chan Jetha, Zhou Jin Peng, Carrasqueira Joana, Iljazi Joana, Becker Jocelyn, Fernandez Joe, Van Amersfoort Joost, Gordon Josh, Lipschultz Josh, Newlan Josh, Ji Ju-yeong, Mohamed Kareem, Badola Kartikeya, Black Kat, Millican Katie, Mcdonell Keelin, Nguyen Kelvin, Sodhia Kiranbir, Greene Kish, Sjoesund Lars Lowe, Usui Lauren, Sifre Laurent, Heuermann Lena, Lago Leticia, Mcnealus Lilly, Soares Livio Baldini, Kilpatrick Logan, Dixon Lucas, Martins Luciano, Reid Machel, Singh Manvinder, Iverson Mark, Görner Martin, Velloso Mat, Wirth Mateo, Davidow Matt, Miller Matt, Rahtz Matthew, Watson Matthew, Risdal Meg, Kazemi Mehran, Moynihan Michael, Zhang Ming, Kahng Minsuk, Park Minwoo, Rahman Mofi, Khatwani Mohit, Dao Natalie, Bardoliwalla Nenshad, Devanathan Nesh, Dumai Neta, Chauhan Nilay, Wahltinez Oscar, Botarda Pankil, Barnes Parker, Barham Paul, Michel Paul, Jin Pengchong, Georgiev Petko, Culliton Phil, Kuppala Pradeep, Comanescu Ramona, Merhej Ramona, Jana Reena, Rokni Reza Ardeshir, Agarwal Rishabh, Mullins Ryan, Saadat Samaneh, Carthy Sara Mc, Perrin Sarah, Arnold Sébastien M. R., Krause Sebastian, Dai Shengyang, Garg Shruti, Sheth Shruti, Ronstrom Sue, Chan Susan, Jordan Timothy, Yu Ting, Eccles Tom, Hennigan Tom, Kocisky Tomas, Doshi Tulsee, Jain Vihan, Yadav Vikas, Meshram Vilobh, Dharmadhikari Vishal, Barkley Warren, Wei Wei, Ye Wenming, Han Woohyun, Kwon Woosuk, Xu Xiang, Shen Zhe, Gong Zhitao, Wei Zichuan, Cotruta Victor, Kirk Phoebe, Rao Anand, Giang Minh, Peran Ludovic, Warkentin Tris, Collins Eli, Barral Joelle, Ghahramani Zoubin, Hadsell Raia, Sculley D., Banks Jeanine, Dragan Anca, Petrov Slav, Vinyals Oriol, Dean Jeff, Hassabis Demis, Kavukcuoglu Koray, Farabet Clement, Buchatskaya Elena, Borgeaud Sebastian, Fiedel Noah, Joulin Armand, Kenealy Kathleen, Dadashi Robert, Andreev Alek
- Can Large Language Models Learn Independent Causal Mechanisms? Gendron Gaël, Nguyen Bao Trung, Peng Alex Yuxuan, Witbrock Michael, Dobbie Gillian
- Breaking The Length Barrier: Llm-enhanced CTR Prediction In Long Textual User Behaviors Geng Binzong, Huan Zhaoxin, Zhang Xiaolu, He Yong, Zhang Liang, Yuan Fajie, Zhou Jun, Mo Linjian
- Why Not Transform Chat Large Language Models To Non-english? Geng Xiang, Zhu Ming, Li Jiahuan, Lai Zhejian, Zou Wei, She Shuaijie, Guo Jiaxin, Zhao Xiaofeng, Li Yinglu, Li Yuang, Su Chang, Zhao Yanqing, Lyu Xinglin, Zhang Min, Chen Jiajun, Yang Hao, Huang Shujian
- Beyond Self-learned Attention: Mitigating Attention Bias In Transformer-based Models Using Attention Guidance Gesi Jiri, Ahmed Iftekhar
- Jailbreaking Llms With Arabic Transliteration And Arabizi Ghanim Mansour Al, Almohaimeed Saleh, Zheng Mengxin, Solihin Yan, Lou Qian
- AI And Memory Wall Gholami Amir, Yao Zhewei, Kim Sehoon, Hooper Coleman, Mahoney Michael W., Keutzer Kurt
- Exploring The Frontier Of Vision-language Models: A Survey Of Current Methodologies And Future Directions Ghosh Akash, Acharya Arkadeep, Saha Sriparna, Jain Vinija, Chadha Aman
- Investigating The Translation Capabilities Of Large Language Models Trained On Parallel Data Only Gilabert Javier García, Escolano Carlos, Savall Aleix Sant, Fornaciari Francesca De Luca, Mash Audrey, Liao Xixian, Melero Maite
- Using Large Language Models To Enrich The Documentation Of Datasets For Machine Learning Giner-miguelez Joan, Gómez Abel, Cabot Jordi
- Agentquest: A Modular Benchmark Framework To Measure Progress And Improve LLM Agents Gioacchini Luca, Siracusano Giuseppe, Sanvito Davide, Gashteovski Kiril, Friede David, Bifulco Roberto, Lawrence Carolin
- Uncovering Hidden Intentions: Exploring Prompt Recovery For Deeper Insights Into Generated Texts Give Louis, Zaoral Timo, Bruno Maria Antonietta
- Blendsql: A Scalable Dialect For Unifying Hybrid Question Answering In Relational Algebra Glenn Parker, Dakle Parag Pravin, Wang Liang, Raghavan Preethi
- Chatglm: A Family Of Large Language Models From GLM-130B To GLM-4 All Tools Glm Team, :, Zeng Aohan, Xu Bin, Wang Bowen, Zhang Chenhui, Yin Da, Zhang Dan, Rojas Diego, Feng Guanyu, Zhao Hanlin, Lai Hanyu, Yu Hao, Wang Hongning, Sun Jiadai, Zhang Jiajie, Cheng Jiale, Gui Jiayi, Tang Jie, Zhang Jing, Sun Jingyu, Li Juanzi, Zhao Lei, Wu Lindong, Zhong Lucen, Liu Mingdao, Huang Minlie, Zhang Peng, Zheng Qinkai, Lu Rui, Duan Shuaiqi, Zhang Shudan, Cao Shulin, Yang Shuxun, Tam Weng Lam, Zhao Wenyi, Liu Xiao, Xia Xiao, Zhang Xiaohan, Gu Xiaotao, Lv Xin, Liu Xinghan, Liu Xinyi, Yang Xinyue, Song Xixuan, Zhang Xunkai, An Yifan, Xu Yifan, Niu Yilin, Yang Yuantao, Li Yueyan, Bai Yushi, Dong Yuxiao, Qi Zehan, Wang Zhaoyu, Yang Zhen, Du Zhengxiao, Hou Zhenyu, Wang Zihan
- Better & Faster Large Language Models Via Multi-token Prediction Gloeckle Fabian, Idrissi Badr Youbi, Rozière Baptiste, Lopez-paz David, Synnaeve Gabriel
- Zamba: A Compact 7B SSM Hybrid Model Glorioso Paolo, Anthony Quentin, Tokpanov Yury, Whittington James, Pilault Jonathan, Ibrahim Adam, Millidge Beren
- Language Guided Exploration For RL Agents In Text Environments Golchha Hitesh, Yerawar Sahil, Patel Dhruvesh, Dan Soham, Murugesan Keerthiram
- Extending Interactive Science Exhibits Into The Classroom Using Anthropomorphized Chatbots And Bloom's Taxonomy Golding Yousuf
- Tokenshap: Interpreting Large Language Models With Monte Carlo Shapley Value Estimation Goldshmidt Roni, Horovicz Miriam
- Goldfinch: High Performance Rwkv/transformer Hybrid With Linear Pre-fill And Extreme Kv-cache Compression Goldstein Daniel, Obeid Fares, Alcaide Eric, Song Guangyu, Cheah Eugene
- Does Chatgpt Have A Mind? Goldstein Simon, Levinstein Benjamin A.
- Mechanistic Interpretability Of Large Language Models With Applications To The Financial Services Industry Golgoon Ashkan, Filom Khashayar, Kannan Arjun Ravi
- Omnifusion Technical Report Goncharova Elizaveta, Razzhigaev Anton, Mikhalchuk Matvey, Kurkin Maxim, Abdullaeva Irina, Skripkin Matvey, Oseledets Ivan, Dimitrov Denis, Kuznetsov Andrey
- AST-T5: Structure-aware Pretraining For Code Generation And Understanding Gong Linyuan, Elhoushi Mostafa, Cheung Alvin
- The Advantages Of Context Specific Language Models: The Case Of The Erasmian Language Model Gonçalves João, Jelicic Nick, Murgia Michele, Stamhuis Evert
- Loose LIPS Sink Ships: Asking Questions In Battleship With Language-informed Program Sampling Grand Gabriel, Pepe Valerio, Andreas Jacob, Tenenbaum Joshua B.
- Assessing Generative Language Models In Classification Tasks: Performance And Self-evaluation Capabilities In The Environmental And Climate Change Domain Grasso Francesca, Locci Stefano
- Is Mamba Capable Of In-context Learning? Grazzi Riccardo, Siems Julien, Schrodi Simon, Brox Thomas, Hutter Frank
- Large Language Models And Linguistic Intentionality Grindrod Jumbly
- Transformers, Contextualism, And Polysemy Grindrod Jumbly
- A Survey On Responsible Generative AI: What To Generate And What Not Gu Jindong
- RWKV-CLIP: A Robust Vision-language Representation Learner Gu Tiancheng, Yang Kaicheng, An Xiang, Feng Ziyong, Liu Dongnan, Cai Weidong, Deng Jiankang
- Middleware For Llms: Tools Are Instrumental For Language Agents In Complex Environments Gu Yu, Shu Yiheng, Yu Hao, Liu Xiao, Dong Yuxiao, Tang Jie, Srinivasa Jayanth, Latapie Hugo, Su Yu
- Anah-v2: Scaling Analytical Hallucination Annotation Of Large Language Models Gu Yuzhe, Ji Ziwei, Zhang Wenwei, Lyu Chengqi, Lin Dahua, Chen Kai
- Diverse And Fine-grained Instruction-following Ability Exploration With Synthetic Data Gu Zihui, Sun Xingwu, Lian Fengzong, Kang Zhanhui, Xu Cheng-zhong, Fan Ju
- Probing The Robustness Of Vision-language Pretrained Models: A Multimodal Adversarial Attack Approach Guan Jiwei, Ding Tianyu, Cao Longbing, Pan Lei, Wang Chen, Zheng Xi
- APTQ: Attention-aware Post-training Mixed-precision Quantization For Large Language Models Guan Ziyi, Huang Hantao, Su Yupeng, Huang Hong, Wong Ngai, Yu Hao
- Look Before You Leap: Towards Decision-aware And Generalizable Tool-usage For Large Language Models Gui Anchun, Li Jian, Dai Yong, Du Nan, Xiao Han
- On The Interchangeability Of Positional Embeddings In Multilingual Neural Machine Translation Models Gumma Varun, Chitale Pranjal A., Bali Kalika
- Conversational Topic Recommendation In Counseling And Psychotherapy With Decision Transformer And Large Language Models Gunal Aylin, Lin Baihan, Bouneffouf Djallel
- Sample Design Engineering: An Empirical Study Of What Makes Good Downstream Fine-tuning Samples For Llms Guo Biyang, Wang He, Xiao Wenyilin, Chen Hong, Lee Zhuxin, Han Songqiao, Huang Hailiang
- A Novel Paradigm Boosting Translation Capabilities Of Large Language Models Guo Jiaxin, Yang Hao, Li Zongyao, Wei Daimeng, Shang Hengchao, Chen Xiaoyu
- Integrating Large Language Models With Graphical Session-based Recommendation Guo Naicheng, Cheng Hongwei, Liang Qianqiao, Chen Linxun, Han Bing
- SGSH: Stimulate Large Language Models With Skeleton Heuristics For Knowledge Base Question Generation Guo Shasha, Liao Lizi, Zhang Jing, Wang Yanling, Li Cuiping, Chen Hong
- Agent-simt: Agent-assisted Simultaneous Machine Translation With Large Language Models Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Sillm: Large Language Models For Simultaneous Machine Translation Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Ds-agent: Automated Data Science By Empowering Large Language Models With Case-based Reasoning Guo Siyuan, Deng Cheng, Wen Ying, Chen Hechang, Chang Yi, Wang Jun
- Fine-tuning Large Language Models For Stock Return Prediction Using Newsflow Guo Tian, Hauptmann Emmanuel
- Efficient Continual Pre-training By Mitigating The Stability Gap Guo Yiduo, Fu Jie, Zhang Huishuai, Zhao Dongyan, Shen Yikang
- GPTQT: Quantize Large Language Models Twice To Push The Efficiency Guo Yipin, Lang Yilin, Ren Qinyuan
- API Pack: A Massive Multi-programming Language Dataset For API Call Generation Guo Zhen, Soria Adriana Meza, Sun Wei, Shen Yikang, Panda Rameswar
- Stabletoolbench: Towards Stable Large-scale Benchmarking On Tool Learning Of Large Language Models Guo Zhicheng, Cheng Sijie, Wang Hao, Liang Shihao, Qin Yujia, Li Peng, Liu Zhiyuan, Sun Maosong, Liu Yang
- Dependency-aware Semi-structured Sparsity: Declining Roles Of Outliers In Pruning Glu-based Llms Guo Zhiyu, Kamigaito Hidetaka, Wanatnabe Taro
- Attention Score Is Not All You Need For Token Importance Indicator In KV Cache Reduction: Value Also Matters Guo Zhiyu, Kamigaito Hidetaka, Watanabe Taro
- METAREFLECTION: Learning Instructions For Language Agents Using Past Reflections Gupta Priyanshu, Kirtania Shashank, Singha Ananya, Gulwani Sumit, Radhakrishna Arjun, Shi Sherry, Soares Gustavo
- Multilingual Large Language Models And Curse Of Multilinguality Gurgurov Daniil, Bäumel Tanja, Anikina Tatiana
- Clochat: Understanding How People Customize, Interact, And Experience Personas In Large Language Models Ha Juhye, Jeon Hyeon, Han Daeun, Seo Jinwook, Oh Changhoon
- METAL: Towards Multilingual Meta-evaluation Hada Rishav, Gumma Varun, Ahmed Mohamed, Bali Kalika, Sitaram Sunayana
- Would I Lie To You? Inference Time Alignment Of Language Models Using Direct Preference Heads Hadji-kyriacou Avelina Asada, Arandjelovic Ognjen
- Looking Into Black Box Code Language Models Haider Muhammad Umair, Farooq Umar, Siddique A. B., Marron Mark
- Covert Malicious Finetuning: Challenges In Safeguarding LLM Adaptation Halawi Danny, Wei Alexander, Wallace Eric, Wang Tony T., Haghtalab Nika, Steinhardt Jacob
- PECC: Problem Extraction And Coding Challenges Haller Patrick, Golde Jonas, Akbik Alan
- Mamba State-space Models Can Be Strong Downstream Learners Halloran John T., Gulati Manbir, Roysdon Paul F.
- Small Language Model Can Self-correct Han Haixia, Liang Jiaqing, Shi Jie, He Qianyu, Xiao Yanghua
- RECIPE4U: Student-chatgpt Interaction Dataset In EFL Writing Education Han Jieun, Yoo Haneul, Myung Junho, Kim Minsun, Lee Tak Yeon, Ahn So-yeon, Oh Alice
- Transfer Learning For Text Diffusion Models Han Kehang, Kenealy Kathleen, Barua Aditya, Fiedel Noah, Constant Noah
- Chatgpt Based Data Augmentation For Improved Parameter-efficient Debiasing Of Llms Han Pengrui, Kocielnik Rafal, Saravanan Adhithya, Jiang Roy, Sharir Or, Anandkumar Anima
- Enhancing CTR Prediction Through Sequential Recommendation Pre-training: Introducing The SRP4CTR Framework Han Ruidong, Li Qianzhong, Jiang He, Li Rui, Zhao Yurou, Li Xiang, Lin Wei
- Wildguard: Open One-stop Moderation Tools For Safety Risks, Jailbreaks, And Refusals Of Llms Han Seungju, Rao Kavel, Ettinger Allyson, Jiang Liwei, Lin Bill Yuchen, Lambert Nathan, Choi Yejin, Dziri Nouha
- Value Augmented Sampling For Language Model Alignment And Personalization Han Seungwook, Shenfeld Idan, Srivastava Akash, Kim Yoon, Agrawal Pulkit
- COCO Is "ALL'' You Need For Visual Instruction Fine-tuning Han Xiaotian, Wang Yiqi, Zhai Bohan, You Quanzeng, Yang Hongxia
- EMMA: Your Text-to-image Diffusion Model Can Secretly Accept Multi-modal Prompts Han Yucheng, Wang Rui, Zhang Chi, Hu Juntao, Cheng Pei, Fu Bin, Zhang Hanwang
- Rethinking Large Language Model Architectures For Sequential Recommendations Hanbing Wang, Xiaorui Liu, Wenqi Fan, Xiangyu Zhao, Venkataramana Kini, Devendra Yadav, Fei Wang, Zhen Wen, Jiliang Tang, Hui Liu
- A Survey On Hallucination In Large Vision-language Models Hanchao Liu, Wenyuan Xue, Yifei Chen, Dapeng Chen, Xiutian Zhao, Ke Wang, Liping Hou, Rongjun Li, Wei Peng
- Jailbreaking Proprietary Large Language Models Using Word Substitution Cipher Handa Divij, Chirmule Advait, Gajera Bimal, Baral Chitta
- Actionreasoningbench: Reasoning About Actions With And Without Ramification Constraints Handa Divij, Dolin Pavel, Kumbhar Shrinidhi, Baral Chitta, Son Tran Cao
- Vision-and-language Navigation Generative Pretrained Transformer Hanlin Wen
- LLM Reasoners: New Evaluation, Library, And Analysis Of Step-by-step Reasoning With Large Language Models Hao Shibo, Gu Yi, Luo Haotian, Liu Tianyang, Shao Xiyan, Wang Xinyuan, Xie Shuhua, Ma Haodi, Samavedhi Adithya, Gao Qiyue, Wang Zhen, Hu Zhiting
- Learning To Rewrite: Generalized Llm-generated Text Detection Hao Wei, Li Ran, Zhao Weiliang, Yang Junfeng, Mao Chengzhi
- Exploring Chatgpt And Its Impact On Society Haque Md. Asraful, Li Shuai
- Astprompter: Weakly Supervised Automated Language Model Red-teaming To Identify Likely Toxic Prompts Hardy Amelia F., Liu Houjun, Lange Bernard, Kochenderfer Mykel J.
- Measuring Psychological Depth In Language Models Harel-canada Fabrice, Zhou Hanyu, Mupalla Sreya, Yildiz Zeynep, Sahai Amit, Peng Nanyun
- A Comparison Of Large Language Model And Human Performance On Random Number Generation Tasks Harrison Rachel M.
- Do Large Language Models Speak All Languages Equally? A Comparative Study In Low-resource Settings Hasan Md. Arid, Tarannum Prerona, Dey Krishno, Razzak Imran, Naseem Usman
- The Unreasonable Effectiveness Of Easy Training Data For Hard Tasks Hase Peter, Bansal Mohit, Clark Peter, Wiegreffe Sarah
- A Comprehensive Overview Of Large Language Models (llms) For Cyber Defences: Opportunities And Directions Hassanin Mohammed, Moustafa Nour
- Relative Value Biases In Large Language Models Hayes William M., Yax Nicolas, Palminteri Stefano
- Olympiadbench: A Challenging Benchmark For Promoting AGI With Olympiad-level Bilingual Multimodal Scientific Problems He Chaoqun, Luo Renjie, Bai Yuzhuo, Hu Shengding, Thai Zhen Leng, Shen Junhao, Hu Jinyi, Han Xu, Huang Yujie, Zhang Yuxiang, Liu Jie, Qi Lei, Liu Zhiyuan, Sun Maosong
- A Law Of Next-token Prediction In Large Language Models He Hangfeng, Su Weijie J.
- Webvoyager: Building An End-to-end Web Agent With Large Multimodal Models He Hongliang, Yao Wenlin, Ma Kaixin, Yu Wenhao, Dai Yong, Zhang Hongming, Lan Zhenzhong, Yu Dong
- Multi-modal Instruction Tuned Llms With Fine-grained Visual Perception He Junwen, Wang Yifan, Wang Lijun, Lu Huchuan, He Jun-yan, Lan Jin-peng, Luo Bin, Xie Xuansong
- Prompt-based Bias Calibration For Better Zero/few-shot Learning Of Language Models He Kang, Long Yinghan, Roy Kaushik
- Pitvqa: Image-grounded Text Embedding LLM For Visual Question Answering In Pituitary Surgery He Runlong, Xu Mengya, Das Adrito, Khan Danyal Z., Bano Sophia, Marcus Hani J., Stoyanov Danail, Clarkson Matthew J., Islam Mobarakol
- What Matters In Transformers? Not All Attention Is Needed He Shwai, Sun Guoheng, Shen Zheyu, Li Ang
- Prompting Chatgpt For Translation: A Comparative Analysis Of Translation Brief And Persona Prompts He Sui
- Transferring Troubles: Cross-lingual Transferability Of Backdoor Attacks In Llms With Instruction Tuning He Xuanli, Wang Jun, Xu Qiongkai, Minervini Pasquale, Stenetorp Pontus, Rubinstein Benjamin I. P., Cohn Trevor
- Language Models As Hierarchy Encoders He Yuan, Yuan Zhangdie, Chen Jiaoyan, Horrocks Ian
- BP4ER: Bootstrap Prompting For Explicit Reasoning In Medical Dialogue Generation He Yuhong, Zhang Yongqi, He Shizhu, Wan Jun
- Fortifying Ethical Boundaries In AI: Advanced Strategies For Enhancing Security In Large Language Models He Yunhong, Qiu Jianling, Zhang Wei, Yuan Zhengqing
- Camelot: Towards Large Language Models With Training-free Consolidated Associative Memory He Zexue, Karlinsky Leonid, Kim Donghyun, Mcauley Julian, Krotov Dmitry, Feris Rogerio
- Quality Of Answers Of Generative Large Language Models Vs Peer Patients For Interpreting Lab Test Results For Lay Patients: Evaluation Study He Zhe, Bhasuran Balu, Jin Qiao, Tian Shubo, Hanna Karim, Shavor Cindy, Arguello Lisbeth Garcia, Murray Patrick, Lu Zhiyong
- HMT: Hierarchical Memory Transformer For Long Context Language Processing He Zifan, Qin Zongyue, Prakriya Neha, Sun Yizhou, Cong Jason
- PERSOMA: Personalized Soft Prompt Adapter Architecture For Personalized Language Prompting Hebert Liam, Sayana Krishna, Jash Ambarish, Karatzoglou Alexandros, Sodhi Sukhdeep, Doddapaneni Sumanth, Cai Yanli, Kuzmin Dima
- Evaluation Of The Programming Skills Of Large Language Models Heitz Luc Bryan, Chamas Joun, Scherb Christopher
- Intent-conditioned And Non-toxic Counterspeech Generation Using Multi-task Instruction Tuning With RLAIF Hengle Amey, Kumar Aswini, Singh Sahajpreet, Bandhakavi Anil, Akhtar Md Shad, Chakroborty Tanmoy
- Can Large Language Models Make The Grade? An Empirical Study Evaluating Llms Ability To Mark Short Answer Questions In K-12 Education Henkel Owen, Boxer Adam, Hills Libby, Roberts Bill
- Generative Visual Instruction Tuning Hernandez Jefferson, Villegas Ruben, Ordonez Vicente
- Lilium: Ebay's Large Language Models For E-commerce Herold Christian, Kozielski Michael, Ekimov Leonid, Petrushkov Pavel, Vandenbussche Pierre-yves, Khadivi Shahram
- The Effect Of Model Size On LLM Post-hoc Explainability Via LIME Heyen Henning, Widdicombe Amy, Siegel Noah Y., Perez-ortiz Maria, Treleaven Philip
- Evaluation Of Instruction-following Ability For Large Language Models On Story-ending Generation Hida Rem, Ohmura Junki, Sekiya Toshiyuki
- Eliciting Personality Traits In Large Language Models Hilliard Airlie, Munoz Cristian, Wu Zekun, Koshiyama Adriano Soares
- Super Tiny Language Models Hillier Dylan, Guertler Leon, Tan Cheston, Agrawal Palaash, Ruirui Chen, Cheng Bobby
- Why Do Llava Vision-language Models Reply To Images In English? Hinck Musashi, Holtermann Carolin, Olson Matthew Lyle, Schneider Florian, Yu Sungduk, Bhiwandiwalla Anahita, Lauscher Anne, Tseng Shaoyen, Lal Vasudev
- Large Language Models Aren't All That You Need Holla Kiran Voderhobli, Kumar Chaithanya, Singh Aryan
- Llm-aided Compilation For Tensor Accelerators Hong Charles, Bhatia Sahil, Haan Altan, Dong Shengjun Kris, Nikiforov Dima, Cheung Alvin, Shao Yakun Sophia
- Towards Building A Robust Knowledge Intensive Question Answering Model With Large Language Models Hong Hong Xingyun, Shao Shao Yan, Wang Wang Zhilin, Duan Duan Manni, Xiongnan Jin
- Outcome-constrained Large Language Models For Countering Hate Speech Hong Lingzi, Luo Pengcheng, Blanco Eduardo, Song Xiaoying
- Free To Play: UN Trade And Development's Experience With Developing Its Own Open-source Retrieval Augmented Generation Large Language Model Application Hopp Daniel
- Large Language Models For Expansion Of Spoken Language Understanding Systems To New Languages Hoscilowicz Jakub, Pawlowski Pawel, Skorupa Marcin, Sowański Marcin, Janicki Artur
- Non-linear Inference Time Intervention: Improving LLM Truthfulness Hoscilowicz Jakub, Wiacek Adam, Chojnacki Jan, Cieslak Adam, Michon Leszek, Urbanevych Vitalii, Janicki Artur
- Securing Vision-language Models With A Robust Encoder Against Jailbreak And Adversarial Attacks Hossain Md Zarif, Imteaj Ahmed
- Visual Prompting For Generalized Few-shot Segmentation: A Multi-scale Approach Hossain Mir Rayat Imtiaz, Siam Mennatullah, Sigal Leonid, Little James J.
- Efficient Solutions For An Intriguing Failure Of Llms: Long Context Window Does Not Mean Llms Can Analyze Long Sequences Flawlessly Hosseini Peyman, Castro Ignacio, Ghinassi Iacopo, Purver Matthew
- Retrieval-augmented Code Completion For Local Projects Using Large Language Models Hostnik Marko, Robnik-šikonja Marko
- Large Language Models As Misleading Assistants In Conversation Hou Betty Li, Shi Kejian, Phang Jason, Aung James, Adler Steven, Campbell Rosie
- Enhancing And Accelerating Large Language Models Via Instruction-aware Contextual Compression Hou Haowen, Ma Fei, Bai Binwen, Zhu Xinxin, Yu Fei
- A Systematic Evaluation Of Large Language Models For Generating Programming Code Hou Wenpin, Ji Zhicheng
- Bridging Language And Items For Retrieval And Recommendation Hou Yupeng, Li Jiacheng, He Zhankui, Yan An, Chen Xiusi, Mcauley Julian
- Large Language Model-powered Chatbots For Internationalizing Student Support In Higher Education Hsain Achraf, Housni Hamza El
- Found In The Middle: Calibrating Positional Attention Bias Improves Long Context Utilization Hsieh Cheng-yu, Chuang Yung-sung, Li Chun-liang, Wang Zifeng, Le Long T., Kumar Abhishek, Glass James, Ratner Alexander, Lee Chen-yu, Krishna Ranjay, Pfister Tomas
- Wilke: Wise-layer Knowledge Editor For Lifelong Knowledge Editing Hu Chenhui, Cao Pengfei, Chen Yubo, Liu Kang, Zhao Jun
- Rankprompt: Step-by-step Comparisons Make Language Models Better Reasoners Hu Chi, Ge Yuan, Ma Xiangnan, Cao Hang, Li Qiang, Yang Yonghua, Xiao Tong, Zhu Jingbo
- Teaching Language Models To Self-improve By Learning From Language Feedback Hu Chi, Hu Yimin, Cao Hang, Xiao Tong, Zhu Jingbo
- Multimodal Chain-of-thought Reasoning Via Chatgpt To Protect Children From Age-inappropriate Apps Hu Chuanbo, Liu Bin, Yin Minglei, Zhou Yilu, Li Xin
- Lightweight Modality Adaptation To Sequential Recommendation Via Correlation Supervision Hu Hengchang, Liu Qijiong, Li Chuang, Kan Min-yen
- Outlier-efficient Hopfield Layers For Large Transformer-based Models Hu Jerry Yao-chieh, Chang Pei-hsuan, Luo Robin, Chen Hong-yu, Li Weijian, Wang Wei-po, Liu Han
- Openrlhf: An Easy-to-use, Scalable And High-performance RLHF Framework Hu Jian, Wu Xibin, Wang Weixun, Xianyu, Zhang Dehao, Cao Yu
- Agentgen: Enhancing Planning Abilities For Large Language Model Based Agent Via Environment And Task Generation Hu Mengkang, Zhao Pu, Xu Can, Sun Qingfeng, Lou Jianguang, Lin Qingwei, Luo Ping, Rajmohan Saravan, Zhang Dongmei
- Mixed Sparsity Training: Achieving 4\(\times\) FLOP Reduction For Transformer Pretraining Hu Pihe, Li Shaolong, Huang Longbo
- A Survey On Large Language Model-based Game Agents Hu Sihao, Huang Tiansheng, Ilhan Fatih, Tekin Selim, Liu Gaowen, Kompella Ramana, Liu Ling
- Matryoshka Query Transformer For Large Vision-language Models Hu Wenbo, Dou Zi-yi, Li Liunian Harold, Kamath Amita, Peng Nanyun, Chang Kai-wei
- Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models At Scale Hu Xiang, Ji Pengyu, Zhu Qingyang, Wu Wei, Tu Kewei
- Themis: Towards Flexible And Interpretable NLG Evaluation Hu Xinyu, Lin Li, Gao Mingqi, Yin Xunjian, Wan Xiaojun
- Case-based Or Rule-based: How Do Transformers Do The Math? Hu Yi, Tang Xiaojuan, Yang Haotong, Zhang Muhan
- Listen Again And Choose The Right Answer: A New Paradigm For Automatic Speech Recognition With Large Language Models Hu Yuchen, Chen Chen, Qin Chengwei, Zhu Qiushi, Chng Eng Siong, Li Ruizhe
- Can Perplexity Reflect Large Language Model's Ability In Long Text Understanding? Hu Yutong, Huang Quzhe, Tao Mingxu, Zhang Chen, Feng Yansong
- Exact And Efficient Unlearning For Large Language Model-based Recommendation Hu Zhiyu, Zhang Yang, Xiao Minghao, Wang Wenjie, Feng Fuli, He Xiangnan
- Longrecipe: Recipe For Efficient Long Context Generalization In Large Language Models Hu Zhiyuan, Liu Yuliang, Zhao Jinman, Wang Suyuchen, Wang Yan, Shen Wei, Gu Qing, Luu Anh Tuan, Ng See-kiong, Jiang Zhiwei, Hooi Bryan
- An Intermediate Fusion Vit Enables Efficient Text-image Alignment In Diffusion Models Hu Zizhao, Jia Shaochong, Rostami Mohammad
- Talk Less, Interact Better: Evaluating In-context Conversational Adaptation In Multimodal Llms Hua Yilun, Artzi Yoav
- Applying And Evaluating Large Language Models In Mental Health Care: A Scoping Review Of Human-assessed Generative Tasks Hua Yining, Na Hongbin, Li Zehan, Liu Fenglin, Fang Xiao, Clifton David, Torous John
- Multimodal Task Vectors Enable Many-shot Multimodal In-context Learning Huang Brandon, Mitra Chancharik, Arbelle Assaf, Karlinsky Leonid, Darrell Trevor, Herzig Roei
- Investigating Decoder-only Large Language Models For Speech-to-text Translation Huang Chao-wei, Lu Hui, Gong Hongyu, Inaguma Hirofumi, Kulikov Ilia, Mavlyutov Ruslan, Popuri Sravya
- Pokergpt: An End-to-end Lightweight Solver For Multi-player Texas Hold'em Via Large Language Model Huang Chenghao, Cao Yanbo, Wen Yinlong, Zhou Tao, Zhang Yanru
- Training Language Models To Generate Text With Citations Via Fine-grained Rewards Huang Chengyu, Wu Zeqiu, Hu Yushi, Wang Wenya
- Recurrent Context Compression: Efficiently Expanding The Context Window Of LLM Huang Chensen, Zhu Guibo, Wang Xuepeng, Luo Yifei, Ge Guojing, Chen Haoran, Yi Dong, Wang Jinqiao
- Generating Educational Materials With Different Levels Of Readability Using Llms Huang Chieh-yang, Wei Jing, Huang Ting-hao 'kenneth'
- LITA: Language Instructed Temporal-localization Assistant Huang De-an, Liao Shijia, Radhakrishnan Subhashree, Yin Hongxu, Molchanov Pavlo, Yu Zhiding, Kautz Jan
- Chatgpt Rates Natural Language Explanation Quality Like Humans: But On Which Scales? Huang Fan, Kwak Haewoon, Park Kunwoo, An Jisun
- A Survey On Evaluation Of Multimodal Large Language Models Huang Jiaxing, Zhang Jingyi
- Integrating Multi-scale Contextualized Information For Byte-based Neural Machine Translation Huang Langlin, Feng Yang
- Learning Fine-grained Grounded Citations For Attributed Large Language Models Huang Lei, Feng Xiaocheng, Ma Weitao, Gu Yuxuan, Zhong Weihong, Feng Xiachong, Yu Weijiang, Peng Weihua, Tang Duyu, Tu Dandan, Qin Bing
- The Solution For The 5th GCAIAC Zero-shot Referring Expression Comprehension Challenge Huang Longfei, Yu Feng, Guan Zhihao, Wan Zhonghua, Yang Yang
- Brainchat: Decoding Semantic Information From Fmri Using Vision-language Pretrained Models Huang Wanaiu
- Visual Hallucinations Of Multi-modal Large Language Models Huang Wen, Liu Hongbin, Guo Minxin, Gong Neil Zhenqiang
- Ml-mamba: Efficient Multi-modal Large Language Model Utilizing Mamba-2 Huang Wenjun, Pan Jiakai, Tang Jiahao, Ding Yanyu, Xing Yifei, Wang Yuhe, Wang Zhengzhuo, Hu Jianguo
- Key-point-driven Data Synthesis With Its Enhancement On Mathematical Reasoning Huang Yiming, Liu Xiao, Gong Yeyun, Gou Zhibin, Shen Yelong, Duan Nan, Chen Weizhu
- Leveraging Large Language Models For Enhanced NLP Task Performance Through Knowledge Distillation And Optimized Training Strategies Huang Yining, Tang Keke, Chen Meilian
- A Comprehensive Survey On Evaluating Large Language Model Applications In The Medical Industry Huang Yining, Tang Keke, Chen Meilian, Wang Boyuan
- Trustllm: Trustworthiness In Large Language Models Huang Yue, Sun Lichao, Wang Haoran, Wu Siyuan, Zhang Qihui, Li Yuan, Gao Chujie, Huang Yixin, Lyu Wenhan, Zhang Yixuan, Li Xiner, Liu Zhengliang, Liu Yixin, Wang Yijue, Zhang Zhikun, Vidgen Bertie, Kailkhura Bhavya, Xiong Caiming, Xiao Chaowei, Li Chunyuan, Xing Eric, Huang Furong, Liu Hao, Ji Heng, Wang Hongyi, Zhang Huan, Yao Huaxiu, Kellis Manolis, Zitnik Marinka, Jiang Meng, Bansal Mohit, Zou James, Pei Jian, Liu Jian, Gao Jianfeng, Han Jiawei, Zhao Jieyu, Tang Jiliang, Wang Jindong, Vanschoren Joaquin, Mitchell John, Shu Kai, Xu Kaidi, Chang Kai-wei, He Lifang, Huang Lifu, Backes Michael, Gong Neil Zhenqiang, Yu Philip S., Chen Pin-yu, Gu Quanquan, Xu Ran, Ying Rex, Ji Shuiwang, Jana Suman, Chen Tianlong, Liu Tianming, Zhou Tianyi, Wang William, Li Xiang, Zhang Xiangliang, Wang Xiao, Xie Xing, Chen Xun, Wang Xuyu, Liu Yan, Ye Yanfang, Cao Yinzhi, Chen Yong, Zhao Yue
- Obscureprompt: Jailbreaking Large Language Models Via Obscure Input Huang Yue, Tang Jingyu, Chen Dongping, Tang Bingda, Wan Yao, Sun Lichao, Zhang Xiangliang
- Fastfid: Improve Inference Efficiency Of Open Domain Question Answering Via Sentence Selection Huang Yufei, Han Xu, Sun Maosong
- Ecr-chain: Advancing Generative Language Models To Better Emotion-cause Reasoners Through Reasoning Chains Huang Zhaopei, Zhao Jinming, Jin Qin
- Olympicarena: Benchmarking Multi-discipline Cognitive Reasoning For Superintelligent AI Huang Zhen, Wang Zengzhi, Xia Shijie, Li Xuefeng, Zou Haoyang, Xu Ruijie, Fan Run-ze, Ye Lyumanshan, Chern Ethan, Ye Yixin, Zhang Yikai, Yang Yuqing, Wu Ting, Wang Binjie, Sun Shichao, Xiao Yang, Li Yiyuan, Zhou Fan, Chern Steffi, Qin Yiwei, Ma Yan, Su Jiadi, Liu Yixiu, Zheng Yuxiang, Zhang Shaoting, Lin Dahua, Qiao Yu, Liu Pengfei
- Accelerating Pre-training Of Multimodal Llms Via Chain-of-sight Huang Ziyuan, Ji Kaixiang, Gong Biao, Qing Zhiwu, Zhang Qinglong, Zheng Kecheng, Wang Jian, Chen Jingdong, Yang Ming
- Towards A Psychology Of Machines: Large Language Models Predict Human Memory Huff Markus, Ulakçı Elanur
- HFT: Half Fine-tuning For Large Language Models Hui Tingfeng, Zhang Zhenyu, Wang Shuohuan, Xu Weiran, Sun Yu, Wu Hua
- Federated Recommendation Via Hybrid Retrieval Augmented Generation Huimin Zeng, Zhenrui Yue, Qian Jiang, Dong Wang
- The Promise And Challenges Of Using Llms To Accelerate The Screening Process Of Systematic Reviews Huotala Aleksi, Kuutila Miikka, Ralph Paul, Mäntylä Mika
- Vietnamese Poem Generation & The Prospect Of Cross-language Poem-to-poem Translation Huynh Triet Minh, Bao Quan Le
- Transformerfam: Feedback Attention Is Working Memory Hwang Dongseong, Wang Weiran, Huo Zhuoyuan, Sim Khe Chai, Mengibar Pedro Moreno
- Pc-lora: Low-rank Adaptation For Progressive Model Compression With Knowledge Distillation Hwang Injoon, Park Haewon, Lee Youngwan, Yang Jooyoung, Maeng Sunjae
- Assessing Llms Suitability For Knowledge Graph Completion Iga Vasile Ionut Remus, Silaghi Gheorghe Cosmin
- Test-time Low Rank Adaptation Via Confidence Maximization For Zero-shot Generalization Of Vision-language Models Imam Raza, Gani Hanan, Huzaifa Muhammad, Nandakumar Karthik
- Hallucinations Or Attention Misdirection? The Path To Strategic Value Extraction In Business Using Large Language Models Ioste Aline
- Scaling Laws For Downstream Task Performance Of Large Language Models Isik Berivan, Ponomareva Natalia, Hazimeh Hussein, Paparas Dimitris, Vassilvitskii Sergei, Koyejo Sanmi
- Autocompletion Of Chief Complaints In The Electronic Health Records Using Large Language Models Islam K M Sajjadul, Nipu Ayesha Siddika, Madiraju Praveen, Deshpande Priya
- Are Large Vision Language Models Up To The Challenge Of Chart Comprehension And Reasoning? An Extensive Investigation Into The Capabilities And Limitations Of Lvlms Islam Mohammed Saidul, Rahman Raian, Masry Ahmed, Laskar Md Tahmid Rahman, Nayeem Mir Tafseer, Hoque Enamul
- Response Generation For Cognitive Behavioral Therapy With Large Language Models: Comparative Study With Socratic Questioning Izumi Kenta, Tanaka Hiroki, Shidara Kazuhiro, Adachi Hiroyoshi, Kanayama Daisuke, Kudo Takashi, Nakamura Satoshi
- Evaluating The Application Of Large Language Models To Generate Feedback In Programming Education Jacobs Sven, Jaschke Steffen
- Ffn-skipllm: A Hidden Gem For Autoregressive Decoding With Adaptive Feed Forward Skipping Jaiswal Ajay, Hu Bodun, Yin Lu, Ro Yeonju, Liu Shiwei, Chen Tianlong, Akella Aditya
- Jamba-1.5: Hybrid Transformer-mamba Models At Scale Jamba Team, Lenz Barak, Arazi Alan, Bergman Amir, Manevich Avshalom, Peleg Barak, Aviram Ben, Almagor Chen, Fridman Clara, Padnos Dan, Gissin Daniel, Jannai Daniel, Muhlgay Dor, Zimberg Dor, Gerber Edden M, Dolev Elad, Krakovsky Eran, Safahi Erez, Schwartz Erez, Cohen Gal, Shachaf Gal, Rozenblum Haim, Bata Hofit, Blass Ido, Magar Inbal, Dalmedigos Itay, Osin Jhonathan, Fadlon Julie, Rozman Maria, Danos Matan, Gokhman Michael, Zusman Mor, Gidron Naama, Ratner Nir, Gat Noam, Rozen Noam, Fried Oded, Leshno Ohad, Antverg Omer, Abend Omri, Lieber Opher, Dagan Or, Cohavi Orit, Alon Raz, Belson Ro'i, Cohen Roi, Gilad Rom, Glozman Roman, Lev Shahar, Meirom Shaked, Delbari Tal, Ness Tal, Asida Tomer, Gal Tom Ben, Braude Tom, Pumerantz Uriya, Cohen Yehoshua, Belinkov Yonatan, Globerson Yuval, Levy Yuval Peleg, Shoham Yoav
- KIT-19: A Comprehensive Korean Instruction Toolkit On 19 Tasks For Fine-tuning Korean Large Language Models Jang Dongjun, Byun Sungjoo, Jo Hyemi, Shin Hyopil
- Rectifying Demonstration Shortcut In In-context Learning Jang Joonwon, Jang Sanghwan, Kweon Wonbin, Jeon Minjin, Yu Hwanjo
- Evaluating Students' Open-ended Written Responses With Llms: Using The RAG Framework For GPT-3.5, GPT-4, Claude-3, And Mistral-large Jauhiainen Jussi S., Guerra Agustín Garagorry
- Are Human Conversations Special? A Large Language Model Perspective Jawale Toshish, Animesh Chaitanya, Vallath Sekhar, Talamadupula Kartik, Heck Larry
- Performance Of Recent Large Language Models For A Low-resourced Language Jayakody Ravindu, Dias Gihan
- Precision Empowers, Excess Distracts: Visual Question Answering With Dynamically Infused Knowledge In Language Models Jhalani Manas, M Annervaz K, Bhattacharyya Pushpak
- Towards Flexible Evaluation For Generative Visual Question Answering Ji Huishan, Si Qingyi, Lin Zheng, Wang Weiping
- Aligner: Efficient Alignment By Learning To Correct Ji Jiaming, Chen Boyuan, Lou Hantao, Hong Donghai, Zhang Borong, Pan Xuehai, Dai Juntao, Qiu Tianyi, Yang Yaodong
- Can't Say Cant? Measuring And Reasoning Of Dark Jargons In Large Language Models Ji Xu, Zhang Jianyi, Zhou Ziyin, Zhao Zhangchi, Qiao Qianqian, Han Kaiying, Hossen Md Imran, Hei Xiali
- ANAH: Analytical Annotation Of Hallucinations In Large Language Models Ji Ziwei, Gu Yuzhe, Zhang Wenwei, Lyu Chengqi, Lin Dahua, Chen Kai
- Linking In-context Learning In Transformers To Human Episodic Memory Ji-an Li, Zhou Corey Y., Benna Marcus K., Mattar Marcelo G.
- Simulbench: Evaluating Language Models With Creative Simulation Tasks Jia Qi, Yue Xiang, Zheng Tianyu, Huang Jie, Lin Bill Yuchen
- Heuristic-enhanced Candidates Selection Strategy For Gpts Tackle Few-shot Aspect-based Sentiment Analysis Jiang Baoxing, Wan Yujie, Ju Shenggen
- Llm-collaboration On Automatic Science Journalism For The General Audience Jiang Gongyao, Shi Xinran, Luo Qiong
- Time Matters: Enhancing Pre-trained News Recommendation Models With Robust User Dwell Time Injection Jiang Hao, Li Chuanzhen, An Mingxiao
- Minference 1.0: Accelerating Pre-filling For Long-context Llms Via Dynamic Sparse Attention Jiang Huiqiang, Li Yucheng, Zhang Chengruidong, Wu Qianhui, Luo Xufang, Ahn Surin, Han Zhenhua, Abdi Amir H., Li Dongsheng, Lin Chin-yew, Yang Yuqing, Qiu Lili
- MC\(^2\): Multi-concept Guidance For Customized Multi-concept Generation Jiang Jiaxiu, Zhang Yabo, Feng Kailai, Wu Xiaohe, Zuo Wangmeng
- Can Large Language Models Generate High-quality Patent Claims? Jiang Lekang, Zhang Caiqi, Scherz Pascal A, Goetz Stephan
- Detecting Scams Using Large Language Models Jiang Liming
- Enhancing Robustness In Large Language Models: Prompting For Mitigating The Impact Of Irrelevant Information Jiang Ming, Huang Tingting, Guo Biao, Lu Yao, Zhang Feng
- Investigating Data Contamination For Pre-training Language Models Jiang Minhao, Liu Ken Ziyu, Zhong Ming, Schaeffer Rylan, Ouyang Siru, Han Jiawei, Koyejo Sanmi
- Sketch: A Toolkit For Streamlining LLM Operations Jiang Xin, Li Xiang, Ma Wenjia, Fang Xuezhi, Yao Yiqun, Yu Naitong, Meng Xuying, Han Peng, Li Jing, Sun Aixin, Wang Yequan
- Large Visual-language Models Are Also Good Classifiers: A Study Of In-context Multimodal Fake News Detection Jiang Ye, Wang Yimin
- Do Llms Dream Of Elephants (when Told Not To)? Latent Concept Association And Associative Memory In Transformers Jiang Yibo, Rajendran Goutham, Ravikumar Pradeep, Aragam Bryon
- Prior Knowledge Integration Via LLM Encoding And Pseudo Event Regulation For Video Moment Retrieval Jiang Yiyang, Zhang Wengyu, Zhang Xulu, Wei Xiaoyong, Chen Chang Wen, Li Qing
- Learning To Edit: Aligning Llms With Knowledge Editing Jiang Yuxin, Wang Yufei, Wu Chuhan, Zhong Wanjun, Zeng Xingshan, Gao Jiahui, Li Liangyou, Jiang Xin, Shang Lifeng, Tang Ruiming, Liu Qun, Wang Wei
- Learning Planning-based Reasoning By Trajectories Collection And Process Reward Synthesizing Jiao Fangkai, Qin Chengwei, Liu Zhengyuan, Chen Nancy F., Joty Shafiq
- Gradable Chatgpt Translation Evaluation Jiao Hui, Peng Bei, Zong Lu, Zhang Xiaojun, Li Xinwei
- Prompt-based Length Controlled Generation With Multiple Control Types Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- Zero-shot Chain-of-thought Reasoning Guided By Evolutionary Algorithms In Large Language Models Jin Feihu, Liu Yifan, Tan Ying
- GUARD: Role-playing To Generate Natural-language Jailbreakings To Test Guideline Adherence Of Large Language Models Jin Haibo, Chen Ruoxi, Zhou Andy, Zhang Yang, Wang Haohan
- LLM Maybe Longlm: Self-extend LLM Context Window Without Tuning Jin Hongye, Han Xiaotian, Yang Jingfeng, Jiang Zhimeng, Liu Zirui, Chang Chia-yuan, Chen Huiyuan, Hu Xia
- Crimson: Empowering Strategic Reasoning In Cybersecurity Through Large Language Models Jin Jiandong, Tang Bowen, Ma Mingxuan, Liu Xiao, Wang Yunfei, Lai Qingnan, Yang Jia, Zhou Changling
- Can Chatgpt Support Developers? An Empirical Evaluation Of Large Language Models For Code Generation Jin Kailun, Wang Chung-yu, Pham Hung Viet, Hemmati Hadi
- Hidden Flaws Behind Expert-level Accuracy Of Multimodal GPT-4 Vision In Medicine Jin Qiao, Chen Fangyuan, Zhou Yiliang, Xu Ziyang, Cheung Justin M., Chen Robert, Summers Ronald M., Rousseau Justin F., Ni Peiyun, Landsman Marc J, Baxter Sally L., Al'aref Subhi J., Li Yijia, Chen Alex, Brejt Josef A., Chiang Michael F., Peng Yifan, Lu Zhiyong
- Self-selected Attention Span For Accelerating Large Language Model Inference Jin Tian, Yazar Wanzin, Xu Zifei, Sharify Sayeh, Wang Xin
- Video-lavit: Unified Video-language Pre-training With Decoupled Visual-motional Tokenization Jin Yang, Sun Zhicheng, Xu Kun, Xu Kun, Chen Liwei, Jiang Hao, Huang Quzhe, Song Chengru, Liu Yuliang, Zhang Di, Song Yang, Gai Kun, Mu Yadong
- Read To Play (r2-play): Decision Transformer With Multimodal Game Instruction Jin Yonggang, Zhang Ge, Zhao Hao, Zheng Tianyu, Guo Jarvi, Xiang Liuyu, Yue Shawn, Huang Stephen W., He Zhaofeng, Fu Jie
- Cutting Off The Head Ends The Conflict: A Mechanism For Interpreting And Mitigating Knowledge Conflicts In Language Models Jin Zhuoran, Cao Pengfei, Yuan Hongbang, Chen Yubo, Xu Jiexin, Li Huaijun, Jiang Xiaojian, Liu Kang, Zhao Jun
- A2SF: Accumulative Attention Scoring With Forgetting Factor For Token Pruning In Transformer Decoder Jo Hyun-rae, Shin Dongkun
- SMART: Automatically Scaling Down Language Models With Accuracy Guarantees For Reduced Processing Fees Jo Saehan, Trummer Immanuel
- Llm-based Open-domain Integrated Task And Knowledge Assistants With Programmable Policies Joshi Harshit, Liu Shicheng, Chen James, Weigle Robert, Lam Monica S.
- How Large Language Models Encode Context Knowledge? A Layer-wise Probing Study Ju Tianjie, Sun Weiwei, Du Wei, Yuan Xinwei, Ren Zhaochun, Liu Gongshen
- Logical Closed Loop: Uncovering Object Hallucinations In Large Vision-language Models Junfei Wu, Qiang Liu, Ding Wang, Jinghao Zhang, Shu Wu, Liang Wang, Tieniu Tan
- Trust Or Escalate: LLM Judges With Provable Guarantees For Human Agreement Jung Jaehun, Brahman Faeze, Choi Yejin
- Information-theoretic Distillation For Reference-less Summarization Jung Jaehun, Lu Ximing, Jiang Liwei, Brahman Faeze, West Peter, Koh Pang Wei, Choi Yejin
- Pmoe: Progressive Mixture Of Experts With Asymmetric Transformer For Continual Learning Jung Min Jae, Kim Joohee
- LARA: Linguistic-adaptive Retrieval-augmented Llms For Multi-turn Intent Classification Junhua Liu, Keat Tan Yong, Bin Fu
- Exploring The Latest Llms For Leaderboard Extraction Kabongo Salomon, D'souza Jennifer, Auer Sören
- A Survey On Hardware Accelerators For Large Language Models Kachris Christoforos
- Evaluating The Impact Of Advanced LLM Techniques On Ai-lecture Tutors For A Robotics Course Kahl Sebastian, Löffler Felix, Maciol Martin, Ridder Fabian, Schmitz Marius, Spanagel Jennifer, Wienkamp Jens, Burgahn Christopher, Schilling Malte
- Using Large Language Models To Assess Tutors' Performance In Reacting To Students Making Math Errors Kakarla Sanjit, Thomas Danielle, Lin Jionghao, Gupta Shivang, Koedinger Kenneth R.
- Scaling Laws For Forgetting When Fine-tuning Large Language Models Kalajdzievski Damjan
- Understanding The Effect Of Using Semantically Meaningful Tokens For Visual Representation Learning Kalibhat Neha, Kattakinda Priyatham, Zarei Arman, Seleznev Nikita, Sharpe Samuel, Kumar Senthil, Feizi Soheil
- Fiddler: CPU-GPU Orchestration For Fast Inference Of Mixture-of-experts Models Kamahori Keisuke, Gu Yile, Zhu Kan, Kasikci Baris
- Eagle: Ethical Dataset Given From Real Interactions Kaneko Masahiro, Bollegala Danushka, Baldwin Timothy
- Prompt-rag: Pioneering Vector Embedding-free Retrieval-augmented Generation In Niche Domains, Exemplified By Korean Medicine Kang Bongsu, Kim Jundong, Yun Tae-rim, Kim Chang-eop
- Mindstar: Enhancing Math Reasoning In Pre-trained Llms At Inference Time Kang Jikun, Li Xin Zhe, Chen Xi, Kazemi Amirreza, Sun Qianyi, Chen Boxing, Li Dong, He Xu, He Quan, Wen Feng, Hao Jianye, Yao Jun
- C-RAG: Certified Generation Risks For Retrieval-augmented Language Models Kang Mintong, Gürel Nezihe Merve, Yu Ning, Song Dawn, Li Bo
- Solving For X And Beyond: Can Large Language Models Solve Complex Math Problems With More-than-two Unknowns? Kao Kuei-chun, Wang Ruochen, Hsieh Cho-jui
- Omniact: A Dataset And Benchmark For Enabling Multimodal Generalist Autonomous Agents For Desktop And Web Kapoor Raghav, Butala Yash Parag, Russak Melisa, Koh Jing Yu, Kamble Kiran, Alshikh Waseem, Salakhutdinov Ruslan
- Teaching Machines To Code: Smart Contract Translation With Llms Karanjai Rabimba, Xu Lei, Shi Weidong
- Can Reinforcement Learning Unlock The Hidden Dangers In Aligned Large Language Models? Karkevandi Mohammad Bahrami, Vishwamitra Nishant, Najafirad Peyman
- Evaluating Interventional Reasoning Capabilities Of Large Language Models Kasetty Tejas, Mahajan Divyat, Dziugaite Gintare Karolina, Drouin Alexandre, Sridhar Dhanya
- Beyond Traditional Benchmarks: Analyzing Behaviors Of Open Llms On Data-to-text Generation Kasner Zdeněk, Dušek Ondřej
- Comparing Plausibility Estimates In Base And Instruction-tuned Large Language Models Kauf Carina, Chersoni Emmanuele, Lenci Alessandro, Fedorenko Evelina, Ivanova Anna A.
- From Text To Transformation: A Comprehensive Review Of Large Language Models' Versatility Kaur Pravneet, Kashyap Gautam Siddharth, Kumar Ankit, Nafis Md Tabrez, Kumar Sandeep, Shokeen Vikrant
- Spectra: A Comprehensive Study Of Ternary, Quantized, And FP16 Language Models Kaushal Ayush, Pandey Tejas, Vaidhya Tejas, Bhagat Aaryan, Rish Irina
- Defending Large Language Models Against Attacks With Residual Stream Activation Analysis Kawasaki Amelia, Davis Andrew, Abbas Houssam
- Codeaid: Evaluating A Classroom Deployment Of An Llm-based Programming Assistant That Balances Student And Educator Needs Kazemitabaar Majeed, Ye Runlong, Wang Xiaoning, Henley Austin Z., Denny Paul, Craig Michelle, Grossman Tovi
- Exploring The Frontiers Of Llms In Psychological Applications: A Comprehensive Review Ke Luoma, Tong Song, Cheng Peng, Peng Kaiping
- Development And Testing Of Retrieval Augmented Generation In Large Language Models -- A Case Study Report Ke Yuhe, Jin Liyuan, Elangovan Kabilan, Abdullah Hairil Rizal, Liu Nan, Sia Alex Tiong Heng, Soh Chai Rick, Tung Joshua Yi Min, Ong Jasmine Chiat Ling, Ting Daniel Shu Wei
- Abdelhak At Semeval-2024 Task 9 : Decoding Brainteasers, The Efficacy Of Dedicated Models Versus Chatgpt Kelious Abdelhak, Okirim Mounir
- Visiongpt: Vision-language Understanding Agent Using Generalized Multimodal Framework Kelly Chris, Hu Luhui, Yang Bang, Tian Yu, Yang Deshun, Yang Cindy, Huang Zaoshan, Li Zihao, Hu Jiayin, Zou Yuexian
- Autonomous Prompt Engineering In Large Language Models Kepel Daan, Valogianni Konstantina
- Harmonic Llms Are Trustworthy Kersting Nicholas S., Rahman Mohammad, Vedala Suchismitha, Wang Yang
- Source-aware Training Enables Knowledge Attribution In Language Models Khalifa Muhammad, Wadden David, Strubell Emma, Lee Honglak, Wang Lu, Beltagy Iz, Peng Hao
- Reinforcement Of Explainability Of Chatgpt Prompts By Embedding Breast Cancer Self-screening Rules Into AI Responses Khan Yousef, Hamed Ahmed Abdeen
- Consistency And Uncertainty: Identifying Unreliable Responses From Black-box Vision-language Models For Selective Visual Question Answering Khan Zaid, Fu Yun
- Do Moral Judgment And Reasoning Capability Of Llms Change With Language? A Study Using The Multilingual Defining Issues Test Khandelwal Aditi, Agarwal Utkarsh, Tanmay Kumar, Choudhury Monojit
- Iterative Prompt Refinement For Radiation Oncology Symptom Extraction Using Teacher-student Large Language Models Khanmohammadi Reza, Ghanem Ahmed I, Verdecchia Kyle, Hall Ryan, Elshaikh Mohamed, Movsas Benjamin, Bagher-ebadian Hassan, Chetty Indrin, Ghassemi Mohammad M., Thind Kundan
- ARGS: Alignment As Reward-guided Search Khanov Maxim, Burapacheep Jirayu, Li Yixuan
- Efficient Medical Question Answering With Knowledge-augmented Question Generation Khlaut Julien, Dancette Corentin, Ferreres Elodie, Bennani Alaedine, Hérent Paul, Manceron Pierre
- Beyond Code Generation: An Observational Study Of Chatgpt Usage In Software Engineering Practice Khojah Ranim, Mohamad Mazen, Leitner Philipp, Neto Francisco Gomes De Oliveira
- Llms In The Loop: Leveraging Large Language Model Annotations For Active Learning In Low-resource Languages Kholodna Nataliia, Julka Sahib, Khodadadi Mohammad, Gumus Muhammed Nurullah, Granitzer Michael
- Why And When Llm-based Assistants Can Go Wrong: Investigating The Effectiveness Of Prompt-based Interactions For Software Help-seeking Khurana Anjali, Subramonyam Hari, Chilana Parmit K
- Shortened Llama: Depth Pruning For Large Language Models With Comparison Of Retraining Methods Kim Bo-kyeong, Kim Geonmin, Kim Tae-ho, Castells Thibault, Choi Shinkook, Shin Junho, Song Hyoung-kyu
- Mhgpt: A Lightweight Generative Pre-trained Transformer For Mental Health Text Analysis Kim Dae-young, Hwa Rebecca, Rahman Muhammad Mahbubur
- A Mechanistic Interpretation Of Syllogistic Reasoning In Auto-regressive Language Models Kim Geonhee, Valentino Marco, Freitas André
- On Initializing Transformers With Pre-trained Embeddings Kim Ha Young, Balasubramanian Niranjan, Kang Byungkon
- Small Language Models Learn Enhanced Reasoning Skills From Medical Textbooks Kim Hyunjae, Hwang Hyeon, Lee Jiwoo, Park Sihyeon, Kim Dain, Lee Taewhoo, Yoon Chanwoong, Sohn Jiwoong, Choi Donghee, Kang Jaewoo
- Finer: Investigating And Enhancing Fine-grained Visual Concept Recognition In Large Vision Language Models Kim Jeonghwan, Ji Heng
- Strategic Data Ordering: Enhancing Large Language Model Performance Through Curriculum Learning Kim Jisu, Lee Juhwan
- Persona Is A Double-edged Sword: Enhancing The Zero-shot Reasoning By Ensembling The Role-playing And Neutral Prompts Kim Junseok, Yang Nakyeong, Jung Kyomin
- Advisorqa: Towards Helpful And Harmless Advice-seeking Question Answering With Collective Intelligence Kim Minbeom, Lee Hwanhee, Park Joonsuk, Lee Hwaran, Jung Kyomin
- Augmenting Query And Passage For Retrieval-augmented Generation Using Llms For Open-domain Question Answering Kim Minsang, Park Cheoneum, Baek Seungjun
- Automatic Jailbreaking Of The Text-to-image Generative AI Systems Kim Minseon, Lee Hyomin, Gong Boqing, Zhang Huishuai, Hwang Sung Ju
- Is GPT-4 Alone Sufficient For Automated Essay Scoring?: A Comparative Judgment Approach Based On Rater Cognition Kim Seungju, Jo Meounggun
- Prometheus 2: An Open Source Language Model Specialized In Evaluating Other Language Models Kim Seungone, Suk Juyoung, Longpre Shayne, Lin Bill Yuchen, Shin Jamin, Welleck Sean, Neubig Graham, Lee Moontae, Lee Kyungjae, Seo Minjoon
- Can Language Models Evaluate Human Written Text? Case Study On Korean Student Writing For Education Kim Seungyoon, Kim Seungone
- Llmem: Estimating GPU Memory Usage For Fine-tuning Pre-trained Llms Kim Taeho, Wang Yanming, Chaturvedi Vatshank, Gupta Lokesh, Kim Seyeon, Kwon Yongin, Ha Sangtae
- A Survey On Integration Of Large Language Models With Intelligent Robots Kim Yeseung, Kim Dohyun, Choi Jieun, Park Jisang, Oh Nayoung, Park Daehyung
- Tablevqa-bench: A Visual Question Answering Benchmark On Multiple Table Domains Kim Yoonsik, Yim Moonbin, Song Ka Yeon
- Health-llm: Large Language Models For Health Prediction Via Wearable Sensor Data Kim Yubin, Xu Xuhai, Mcduff Daniel, Breazeal Cynthia, Park Hae Won
- Medexqa: Medical Question Answering Benchmark With Multiple Explanations Kim Yunsoo, Wu Jinge, Abdulle Yusuf, Wu Honghan
- Evaluating Telugu Proficiency In Large Language Models_ A Comparative Analysis Of Chatgpt And Gemini Kishore Katikela Sreeharsha, Shaik Rahimanuddin
- Unveiling Divergent Inductive Biases Of Llms On Temporal Data Kishore Sindhu, He Hangfeng
- Extended Mind Transformers Klett Phoebe, Ahle Thomas
- Recmind: Japanese Movie Recommendation Dialogue With Seeker's Internal State Kodama Takashi, Kiyomaru Hirokazu, Huang Yin Jou, Kurohashi Sadao
- Cleared For Takeoff? Compositional & Conditional Reasoning May Be The Achilles Heel To (flight-booking) Language Agents Kohli Harsh, Sun Huan
- Chatspamdetector: Leveraging Large Language Models For Effective Phishing Email Detection Koide Takashi, Fukushi Naoki, Nakano Hiroki, Chiba Daiki
- Self-prompt Tuning: Enable Autonomous Role-playing In Llms Kong Aobo, Zhao Shiwan, Chen Hao, Li Qicheng, Qin Yong, Sun Ruiqi, Zhou Xin, Zhou Jiaming, Sun Haoqin
- Labeling Supervised Fine-tuning Data With The Scaling Law Kong Huanjun
- Audio Flamingo: A Novel Audio Language Model With Few-shot Learning And Dialogue Abilities Kong Zhifeng, Goel Arushi, Badlani Rohan, Ping Wei, Valle Rafael, Catanzaro Bryan
- Proptest: Automatic Property Testing For Improved Visual Programming Koo Jaywon, Yang Ziyan, Cascante-bonilla Paola, Ray Baishakhi, Ordonez Vicente
- Bitune: Bidirectional Instruction-tuning Kopiczko Dawid J., Blankevoort Tijmen, Asano Yuki M.
- Large Language Model Driven Recommendation Korikov Anton, Sanner Scott, Deldjoo Yashar, He Zhankui, Mcauley Julian, Ramisa Arnau, Vidal Rene, Sathiamoorthy Mahesh, Kasrizadeh Atoosa, Milano Silvia, Ricci Francesco
- Transllama: Llm-based Simultaneous Translation System Koshkin Roman, Sudoh Katsuhito, Nakamura Satoshi
- Llms Are Zero-shot Context-aware Simultaneous Translators Koshkin Roman, Sudoh Katsuhito, Nakamura Satoshi
- Uncertainty-aware Evaluation For Vision-language Models Kostumov Vasily, Nutfullin Bulat, Pilipenko Oleg, Ilyushin Eugene
- Hints-in-browser: Benchmarking Language Models For Programming Feedback Generation Kotalwar Nachiket, Gotovos Alkis, Singla Adish
- Arabiangpt: Native Arabic Gpt-based Large Language Model Koubaa Anis, Ammar Adel, Ghouti Lahouari, Najar Omar, Sibaee Serry
- Evaluating Language Models For Generating And Judging Programming Feedback Koutcheme Charles, Dainese Nicola, Hellas Arto, Sarsa Sami, Leinonen Juho, Ashraf Syed, Denny Paul
- Open Source Language Models Can Provide Feedback: Evaluating Llms' Ability To Help Students Using Gpt-4-as-a-judge Koutcheme Charles, Dainese Nicola, Sarsa Sami, Hellas Arto, Leinonen Juho, Denny Paul
- From Data To Commonsense Reasoning: The Use Of Large Language Models For Explainable AI Krause Stefanie, Stolzenburg Frieder
- Can Large Language Models Explore In-context? Krishnamurthy Akshay, Harris Keegan, Foster Dylan J., Zhang Cyril, Slivkins Aleksandrs
- Reinforcement Learning For Optimizing RAG For Domain Chatbots Kulkarni Mandar, Tangarajan Praveen, Kim Kyung, Trivedi Anusua
- Confidence Under The Hood: An Investigation Into The Confidence-probability Alignment In Large Language Models Kumar Abhishek, Morabito Robert, Umbet Sanzhar, Kabbara Jad, Emami Ali
- Automatic Question-answer Generation For Long-tail Knowledge Kumar Rohan, Kim Youngmin, Ravi Sunitha, Sun Haitian, Faloutsos Christos, Salakhutdinov Ruslan, Yoon Minji
- Harmonizing Code-mixed Conversations: Personality-assisted Code-mixed Response Generation In Dialogues Kumar Shivani, Chakraborty Tanmoy
- Babilong: Testing The Limits Of Llms With Long Context Reasoning-in-a-haystack Kuratov Yuri, Bulatov Aydar, Anokhin Petr, Rodkin Ivan, Sorokin Dmitry, Sorokin Artyom, Burtsev Mikhail
- Mt-eval: A Multi-turn Capabilities Evaluation Benchmark For Large Language Models Kwan Wai-chung, Zeng Xingshan, Jiang Yuxin, Wang Yufei, Li Liangyou, Shang Lifeng, Jiang Xin, Liu Qun, Wong Kam-fai
- Evaluating Cultural Adaptability Of A Large Language Model Via Simulation Of Synthetic Personas Kwok Louis, Bravansky Michal, Griffin Lewis D.
- Are Llms Effective Negotiators? Systematic Evaluation Of The Multifaceted Capabilities Of Llms In Negotiation Dialogues Kwon Deuksin, Weiss Emily, Kulshrestha Tara, Chawla Kushal, Lucas Gale M., Gratch Jonathan
- BIPED: Pedagogically Informed Tutoring System For ESL Education Kwon Soonwoo, Kim Sojung, Park Minju, Lee Seunghyun, Kim Kyuseok
- Enhancing Multi-domain Automatic Short Answer Grading Through An Explainable Neuro-symbolic Pipeline Künnecke Felix, Filighera Anna, Leong Colin, Steuer Tim
- RES-Q: Evaluating Code-editing Large Language Model Systems At The Repository Scale Labash Beck, Rosedale August, Reents Alex, Negritto Lucas, Wiel Colin
- Autowebglm: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent Lai Hanyu, Liu Xiao, Iong Iat Long, Yao Shuntian, Chen Yuxuan, Shen Pengbo, Yu Hao, Zhang Hanchen, Zhang Xiaohan, Dong Yuxiao, Tang Jie
- Alarm: Align Language Models Via Hierarchical Rewards Modeling Lai Yuhang, Wang Siyuan, Liu Shujun, Huang Xuanjing, Wei Zhongyu
- Residual-based Language Models Are Free Boosters For Biomedical Imaging Lai Zhixin, Wu Jing, Chen Suiyao, Zhou Yucheng, Hovakimyan Naira
- Lapdoc: Layout-aware Prompting For Documents Lamott Marcel, Weweler Yves-noel, Ulges Adrian, Shafait Faisal, Krechel Dirk, Obradovic Darko
- FOCUS: Forging Originality Through Contrastive Use In Self-plagiarism For Language Models Lan Kaixin, Fang Tao, Wong Derek F., Xu Yabo, Chao Lidia S., Zhao Cecilia G.
- Criticeval: Evaluating Large Language Model As Critic Lan Tian, Zhang Wenwei, Xu Chen, Huang Heyan, Lin Dahua, Chen Kai, Mao Xian-ling
- Large Language Models As Evolution Strategies Lange Robert Tjarko, Tian Yingtao, Tang Yujin
- Enhancing Neural Machine Translation Of Low-resource Languages: Corpus Development, Human Evaluation And Explainable AI Architectures Lankford Séamus
- Human Evaluation Of English--irish Transformer-based NMT Lankford Séamus, Afli Haithem, Way Andy
- Adaptnmt: An Open-source, Language-agnostic Development Environment For Neural Machine Translation Lankford Séamus, Afli Haithem, Way Andy
- Transformers For Low-resource Languages:is F\'eidir Linn! Lankford Séamus, Afli Haithem, Way Andy
- A Systematic Survey And Critical Review On Evaluating Large Language Models: Challenges, Limitations, And Recommendations Laskar Md Tahmid Rahman, Alqahtani Sawsan, Bari M Saiful, Rahman Mizanur, Khan Mohammad Abdullah Matin, Khan Haidar, Jahan Israt, Bhuiyan Amran, Tan Chee Wei, Parvez Md Rizwan, Hoque Enamul, Joty Shafiq, Huang Jimmy
- Query-opt: Optimizing Inference Of Large Language Models Via Multi-query Instructions In Meeting Summarization Laskar Md Tahmid Rahman, Khasanova Elena, Fu Xue-yong, Chen Cheng, Tn Shashi Bhushan
- Melting Point: Mobile Evaluation Of Language Transformers Laskaridis Stefanos, Katevas Kleomenis, Minto Lorenzo, Haddadi Hamed
- What Matters When Building Vision-language Models? Laurençon Hugo, Tronchon Léo, Cord Matthieu, Sanh Victor
- Frontier AI Ethics: Anticipating And Evaluating The Societal Impacts Of Generative Agents Lazar Seth
- Benchmarks Underestimate The Readiness Of Multi-lingual Dialogue Agents Lee Andrew H., Semnani Sina J., Castillo-lópez Galo, De Chalendar Gäel, Choudhury Monojit, Dua Ashna, Kavitha Kapil Rajesh, Kim Sungkyun, Kodali Prashant, Kumaraguru Ponnurangam, Lombard Alexis, Moradshahi Mehrad, Park Gihyun, Semmar Nasredine, Seo Jiwon, Shen Tianhao, Shrivastava Manish, Xiong Deyi, Lam Monica S.
- Trol: Traversal Of Layers For Large Language And Vision Models Lee Byung-kwan, Chung Sangyun, Kim Chae Won, Park Beomchan, Ro Yong Man
- Nv-embed: Improved Techniques For Training Llms As Generalist Embedding Models Lee Chankyu, Roy Rajarshi, Xu Mengyao, Raiman Jonathan, Shoeybi Mohammad, Catanzaro Bryan, Ping Wei
- MATTER: Memory-augmented Transformer Using Heterogeneous Knowledge Sources Lee Dongkyu, Prakash Chandana Satya, Fitzgerald Jack, Lehmann Jens
- Hip Attention: Sparse Sub-quadratic Attention With Hierarchical Attention Pruning Lee Heejun, Park Geon, Lee Youngwan, Kim Jina, Jeong Wonyoung, Jeon Myeongjae, Hwang Sung Ju
- A Survey Of Large Language Models In Finance (finllms) Lee Jean, Stevens Nicholas, Han Soyeon Caren, Song Minseok
- Plagbench: Exploring The Duality Of Large Language Models In Plagiarism Generation And Detection Lee Jooyoung, Agrawal Toshini, Uchendu Adaku, Le Thai, Chen Jinghui, Lee Dongwon
- LRQ: Optimizing Post-training Quantization For Large Language Models By Learning Low-rank Weight-scaling Matrices Lee Jung Hyun, Kim Jeonghoon, Yang June Yong, Kwon Se Jung, Yang Eunho, Yoo Kang Min, Lee Dongsoo
- Multimodal Reasoning With Multimodal Knowledge Graph Lee Junlin, Wang Yequan, Li Jing, Zhang Min
- Visual Question Answering Instruction: Unlocking Multimodal Large Language Model To Domain-specific Visual Multitasks Lee Jusung, Cha Sungguk, Lee Younghyun, Yang Cheoljong
- Probability Of Differentiation Reveals Brittleness Of Homogeneity Bias In Large Language Models Lee Messi H. J., Lai Calvin K.
- Prompting Large Language Models For Zero-shot Essay Scoring Via Multi-trait Specialization Lee Sanwoo, Cai Yida, Meng Desong, Wang Ziyang, Wu Yunfang
- Effective And Efficient Conversation Retrieval For Dialogue State Tracking With Implicit Text Summaries Lee Seanie, Cheng Jianpeng, Driesen Joris, Coca Alexandru, Johannsen Anders
- Aligning To Thousands Of Preferences Via System Message Generalization Lee Seongyun, Park Sue Hyun, Kim Seungone, Seo Minjoon
- COCOA: Cbt-based Conversational Counseling Agent Using Memory Specialized In Cognitive Distortions And Dynamic Prompt Lee Suyeon, Kang Jieun, Kim Harim, Chung Kyoung-mee, Lee Dongha, Yeo Jinyoung
- Llava-docent: Instruction Tuning With Multimodal Large Language Model To Support Art Appreciation Education Lee Unggi, Jeon Minji, Lee Yunseo, Byun Gyuri, Son Yoorim, Shin Jaeyoon, Ko Hongkyu, Kim Hyeoncheol
- Infinigen: Efficient Generative Inference Of Large Language Models With Dynamic KV Cache Management Lee Wonbeom, Lee Jungi, Seo Junghwan, Sim Jaewoong
- Large Language Models Produce Responses Perceived To Be Empathic Lee Yoon Kyung, Suh Jina, Zhan Hongli, Li Junyi Jessy, Ong Desmond C.
- Towards Understanding Counseling Conversations: Domain Knowledge And Large Language Models Lee Younghun, Goldwasser Dan, Reese Laura Schwab
- Chatgpt As Research Scientist: Probing Gpt's Capabilities As A Research Librarian, Research Ethicist, Data Generator And Data Predictor Lehr Steven A., Caliskan Aylin, Liyanage Suneragiri, Banaji Mahzarin R.
- Autocoder: Enhancing Code Large Language Model With \textsc{aiev-instruct} Lei Bin, Li Yuchen, Chen Qiuwu
- Small-e: Small Language Model With Linear Attention For Efficient Speech Synthesis Lemerle Théodor, Obin Nicolas, Roebel Axel
- Using Hallucinations To Bypass Gpt4's Filter Lemkin Benjamin
- Using Counterfactual Tasks To Evaluate The Generality Of Analogical Reasoning In Large Language Models Lewis Martha, Mitchell Melanie
- Cascade Reward Sampling For Efficient Decoding-time Alignment Li Bolian, Wang Yifan, Grama Ananth, Zhang Ruqi
- Are Bigger Encoders Always Better In Vision Large Models? Li Bozhou, Liang Hao, Meng Zimo, Zhang Wentao
- Concentrate Attention: Towards Domain-generalizable Prompt Optimization For Language Models Li Chengzhengxu, Liu Xiaoming, Zhang Zhaohan, Wang Yichen, Liu Chen, Lan Yu, Shen Chao
- X-instruction: Aligning Language Model In Low-resource Languages With Self-curated Cross-lingual Instructions Li Chong, Yang Wen, Zhang Jiajun, Lu Jinliang, Wang Shaonan, Zong Chengqing
- Incorporating External Knowledge And Goal Guidance For Llm-based Conversational Recommender Systems Li Chuang, Deng Yang, Hu Hengchang, Kan Min-yen, Li Haizhou
- Searchlvlms: A Plug-and-play Framework For Augmenting Large Vision-language Models By Searching Up-to-date Internet Knowledge Li Chuanhao, Li Zhen, Jing Chenchen, Liu Shuo, Shao Wenqi, Wu Yuwei, Luo Ping, Qiao Yu, Zhang Kaipeng
- Humaneval On Latest GPT Models -- 2024 Li Daniel, Murr Lincoln
- Contextualization Distillation From Large Language Model For Knowledge Graph Completion Li Dawei, Tan Zhen, Chen Tianlong, Liu Huan
- On The Role Of Long-tail Knowledge In Retrieval Augmented Large Language Models Li Dongyang, Yan Junbing, Zhang Taolin, Wang Chengyu, He Xiaofeng, Huang Longtao, Xue Hui, Huang Jun
- "is Chatgpt A Better Explainer Than My Professor?": Evaluating The Explanation Capabilities Of Llms In Conversation Compared To A Human Baseline Li Grace, Alshomary Milad, Muresan Smaranda
- Look Within, Why Llms Hallucinate: A Causal Perspective Li He, Chi Haoang, Liu Mingyu, Yang Wenjing
- Streamingdialogue: Prolonged Dialogue Learning Via Long Context Compression With Minimal Losses Li Jia-nan, Tu Quan, Mao Cunli, Yu Zhengtao, Wen Ji-rong, Yan Rui
- Enhance Lifelong Model Editing With Continuous Data-adapter Association Li Jiaang, Wang Quan, Wang Zhongnan, Zhang Yongdong, Mao Zhendong
- Focus On Your Question! Interpreting And Mitigating Toxic Cot Problems In Commonsense Reasoning Li Jiachun, Cao Pengfei, Wang Chenhao, Jin Zhuoran, Chen Yubo, Zeng Daojian, Liu Kang, Zhao Jun
- A Transformer With Stack Attention Li Jiaoda, White Jennifer C., Sachan Mrinmaya, Cotterell Ryan
- Know The Unknown: An Uncertainty-sensitive Method For LLM Instruction Tuning Li Jiaqi, Tang Yixuan, Yang Yi
- Perteval: Unveiling Real Knowledge Capacity Of Llms With Knowledge-invariant Perturbations Li Jiatong, Hu Renjun, Huang Kunzhe, Zhuang Yan, Liu Qi, Zhu Mengxiao, Shi Xing, Lin Wei
- A Cross-language Investigation Into Jailbreak Attacks In Large Language Models Li Jie, Liu Yi, Liu Chongyang, Shi Ling, Ren Xiaoning, Zheng Yaowen, Liu Yang, Xue Yinxing
- Stylechat: Learning Recitation-augmented Memory In Llms For Stylized Dialogue Generation Li Jinpeng, Zhang Zekai, Tu Quan, Cheng Xin, Zhao Dongyan, Yan Rui
- Experimenting With Legal AI Solutions: The Case Of Question-answering For Access To Justice Li Jonathan, Bhambhoria Rohan, Dahan Samuel, Zhu Xiaodan
- Dissecting Human And LLM Preferences Li Junlong, Zhou Fan, Sun Shichao, Zhang Yikai, Zhao Hai, Liu Pengfei
- Measuring And Controlling Instruction (in)stability In Language Model Dialogs Li Kenneth, Liu Tianle, Bashkansky Naomi, Bau David, Viégas Fernanda, Pfister Hanspeter, Wattenberg Martin
- Dialogue Action Tokens: Steering Language Models In Goal-directed Dialogue With A Multi-turn Planner Li Kenneth, Wang Yiming, Viégas Fernanda, Wattenberg Martin
- On The Power Of Convolution Augmented Transformer Li Mingchen, Zhang Xuechen, Huang Yixiao, Oymak Samet
- Bridging The Language Gap: Enhancing Multilingual Prompt-based Code Generation In Llms Via Zero-shot Cross-lingual Transfer Li Mingda, Mishra Abhijit, Mujumdar Utkarsh
- Red Teaming Visual Language Models Li Mukai, Li Lei, Yin Yuwei, Ahmed Masood, Liu Zhenguang, Liu Qi
- From Text To Insight: Leveraging Large Language Models For Performance Evaluation In Management Li Ning, Zhou Huaikang, Xu Mingze
- Owlore: Outlier-weighed Layerwise Sampled Low-rank Projection For Memory-efficient LLM Fine-tuning Li Pengxiang, Yin Lu, Gao Xiaowei, Liu Shiwei
- Coevol: Constructing Better Responses For Instruction Finetuning Through Multi-agent Cooperation Li Renhao, Tan Minghuan, Wong Derek F., Yang Min
- Anchored Answers: Unravelling Positional Bias In Gpt-2's Multiple-choice Questions Li Ruizhe, Gao Yanjun
- IQA-EVAL: Automatic Evaluation Of Human-model Interactive Question Answering Li Ruosen, Wang Barry, Li Ruochen, Du Xinya
- Prompt4vis: Prompting Large Language Models With Example Mining And Schema Filtering For Tabular Data Visualization Li Shuaimin, Chen Xuanang, Song Yuanfeng, Song Yunze, Zhang Chen
- FSMR: A Feature Swapping Multi-modal Reasoning Approach With Joint Textual And Visual Clues Li Shuang, Wang Jiahua, Wen Lijie
- Improving Faithfulness Of Large Language Models In Summarization Via Sliding Generation And Self-consistency Li Taiji, Li Zhi, Zhang Yin
- Moe-ct: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting Li Tianhao, Li Shangjie, Xie Binbin, Xiong Deyi, Yang Baosong
- TOPA: Extend Large Language Models For Video Understanding Via Text-only Pre-alignment Li Wei, Fan Hehe, Wong Yongkang, Kankanhalli Mohan, Yang Yi
- Stbench: Assessing The Ability Of Large Language Models In Spatio-temporal Analysis Li Wenbin, Yao Di, Zhao Ruibo, Chen Wenjie, Xu Zijie, Luo Chengxue, Gong Chang, Jing Quanliang, Tan Haining, Bi Jingping
- Uio-llms: Unbiased Incremental Optimization For Long-context Llms Li Wenhao, Lin Mingbao, Zhong Yunshan, Yan Shuicheng, Ji Rongrong
- MM-TTS: A Unified Framework For Multimodal, Prompt-induced Emotional Text-to-speech Synthesis Li Xiang, Cheng Zhi-qi, He Jun-yan, Peng Xiaojiang, Hauptmann Alexander G.
- 52B To 1T: Lessons Learned Via Tele-flm Series Li Xiang, Yao Yiqun, Jiang Xin, Fang Xuezhi, Wang Chao, Liu Xinzhang, Wang Zihan, Zhao Yu, Wang Xin, Huang Yuyao, Song Shuangyong, Li Yongxiang, Zhang Zheng, Zhao Bo, Sun Aixin, Wang Yequan, He Zhongjiang, Wang Zhongyuan, Li Xuelong, Huang Tiejun
- What If We Recaption Billions Of Web Images With Llama-3? Li Xianhang, Tu Haoqin, Hui Mude, Wang Zeyu, Zhao Bingchen, Xiao Junfei, Ren Sucheng, Mei Jieru, Liu Qing, Zheng Huangjie, Zhou Yuyin, Xie Cihang
- Preference Tuning For Toxicity Mitigation Generalizes Across Languages Li Xiaochen, Yong Zheng-xin, Bach Stephen H.
- Grounded Compositional And Diverse Text-to-3d With Pretrained Multi-view Diffusion Model Li Xiaolong, Mo Jiawei, Wang Ying, Parameshwara Chethan, Fei Xiaohan, Swaminathan Ashwin, Taylor Cj, Tu Zhuowen, Favaro Paolo, Soatto Stefano
- Evaluating Mathematical Reasoning Of Large Language Models: A Focus On Error Identification And Correction Li Xiaoyuan, Wang Wenjie, Li Moxin, Guo Junrong, Zhang Yang, Feng Fuli
- Personalized Language Modeling From Personalized Human Feedback Li Xinyu, Lipton Zachary C., Leqi Liu
- Rethinking Chatgpt's Success: Usability And Cognitive Behaviors Enabled By Auto-regressive Llms' Prompting Li Xinzhe, Liu Ming
- Reason From Fallacy: Enhancing Large Language Models' Logical Reasoning Through Logical Fallacy Understanding Li Yanda, Wang Dixuan, Liang Jiaqing, Jiang Guochao, He Qianyu, Xiao Yanghua, Yang Deqing
- Mini-gemini: Mining The Potential Of Multi-modality Vision Language Models Li Yanwei, Zhang Yuechen, Wang Chengyao, Zhong Zhisheng, Chen Yixin, Chu Ruihang, Liu Shaoteng, Jia Jiaya
- Making Long-context Language Models Better Multi-hop Reasoners Li Yanyang, Liang Shuo, Lyu Michael R., Wang Liwei
- Calrec: Contrastive Alignment Of Generative Llms For Sequential Recommendation Li Yaoyiran, Zhai Xiang, Alzantot Moustafa, Yu Keyi, Vulić Ivan, Korhonen Anna, Hammad Mohamed
- Backdoorllm: A Comprehensive Benchmark For Backdoor Attacks On Large Language Models Li Yige, Huang Hanxun, Zhao Yunhan, Ma Xingjun, Sun Jun
- Guiding Enumerative Program Synthesis With Large Language Models Li Yixuan, Parsert Julian, Polgreen Elizabeth
- Mateval: A Multi-agent Discussion Framework For Advancing Open-ended Text Evaluation Li Yu, Zhang Shenyu, Wu Rui, Huang Xiutian, Chen Yongrui, Xu Wenhao, Qi Guilin, Min Dehai
- Finding Challenging Metaphors That Confuse Pretrained Language Models Li Yucheng, Guerin Frank, Lin Chenghua
- Dynamic Data Sampler For Cross-language Transfer Learning In Large Language Models Li Yudong, Feng Yuhao, Zhou Wen, Zhao Zhe, Shen Linlin, Hou Cheng, Hou Xianxu
- Spin Glass Model Of In-context Learning Li Yuhao, Bai Ruoran, Huang Haiping
- VSTAR: Generative Temporal Nursing For Longer Dynamic Video Synthesis Li Yumeng, Beluch William, Keuper Margret, Zhang Dan, Khoreva Anna
- Llms Meet Long Video: Advancing Long Video Question Answering With An Interactive Visual Adapter In Llms Li Yunxin, Chen Xinyu, Hu Baotain, Zhang Min
- Uni-moe: Scaling Unified Multimodal Llms With Mixture Of Experts Li Yunxin, Jiang Shenyuan, Hu Baotian, Wang Longyue, Zhong Wanqi, Luo Wenhan, Ma Lin, Zhang Min
- Mmsci: A Multimodal Multi-discipline Dataset For Phd-level Scientific Comprehension Li Zekun, Yang Xianjun, Choi Kyuri, Zhu Wanrong, Hsieh Ryan, Kim Hyeonjung, Lim Jin Hyuk, Ji Sungyoung, Lee Byungju, Yan Xifeng, Petzold Linda Ruth, Wilson Stephen D., Lim Woosang, Wang William Yang
- Amphista: Accelerate LLM Inference With Bi-directional Multiple Drafting Heads In A Non-autoregressive Style Li Zeping, Yang Xinlong, Gao Ziheng, Liu Ji, Liu Zhuang, Li Dong, Peng Jinzhang, Tian Lu, Barsoum Emad
- Text-to-model: Text-conditioned Neural Network Diffusion For Train-once-for-all Personalization Li Zexi, Gao Lingzhi, Wu Chao
- Understanding And Patching Compositional Reasoning In Llms Li Zhaoyi, Jiang Gangwei, Xie Hong, Song Linqi, Lian Defu, Wei Ying
- Do Vision-language Transformers Exhibit Visual Commonsense? An Empirical Study Of VCR Li Zhenyang, Guo Yangyang, Wang Kejie, Chen Xiaolin, Nie Liqiang, Kankanhalli Mohan
- Focusllm: Scaling Llm's Context By Parallel Decoding Li Zhenyu, Zhang Yike, Pan Tengyu, Sun Yutao, Duan Zhichao, Fang Junjie, Han Rong, Wang Zixuan, Wang Jianyong
- EAGLE: Elevating Geometric Reasoning Through Llm-empowered Visual Instruction Tuning Li Zhihao, Du Yao, Liu Yang, Zhang Yan, Liu Yufang, Zhang Mengdi, Cai Xunliang
- Visualization Literacy Of Multimodal Large Language Models: A Comparative Study Li Zhimin, Miao Haichao, Pascucci Valerio, Liu Shusen
- Hunyuan-dit: A Powerful Multi-resolution Diffusion Transformer With Fine-grained Chinese Understanding Li Zhimin, Zhang Jianwei, Lin Qin, Xiong Jiangfeng, Long Yanxin, Deng Xinchi, Zhang Yingfang, Liu Xingchao, Huang Minbin, Xiao Zedong, Chen Dayou, He Jiajun, Li Jiahao, Li Wenyue, Zhang Chen, Quan Rongwei, Lu Jianxiang, Huang Jiabin, Yuan Xiaoyan, Zheng Xiaoxiao, Li Yixuan, Zhang Jihong, Zhang Chao, Chen Meng, Liu Jie, Fang Zheng, Wang Weiyan, Xue Jinbao, Tao Yangyu, Zhu Jianchen, Liu Kai, Lin Sihuan, Sun Yifu, Li Yun, Wang Dongdong, Chen Mingtao, Hu Zhichao, Xiao Xiao, Chen Yan, Liu Yuhong, Liu Wei, Wang Di, Yang Yong, Jiang Jie, Lu Qinglin
- Cog-ga: A Large Language Models-based Generative Agent For Vision-language Navigation In Continuous Environments Li Zhiyuan, Lu Yanfeng, Mu Yao, Qiao Hong
- Self-instructed Derived Prompt Generation Meets In-context Learning: Unlocking New Potential Of Black-box Llms Li Zhuo, Du Yuhao, Hu Jinpeng, Wan Xiang, Gao Anningzhe
- Effects Of Different Prompts On The Quality Of GPT-4 Responses To Dementia Care Questions Li Zhuochun, Xie Bo, Hilsabeck Robin, Aguirre Alyssa, Zou Ning, Luo Zhimeng, He Daqing
- Retrieval Augmented Generation Or Long-context Llms? A Comprehensive Study And Hybrid Approach Li Zhuowan, Li Cheng, Zhang Mingyang, Mei Qiaozhu, Bendersky Michael
- Implicit In-context Learning Li Zhuowei, Xu Zihao, Han Ligong, Gao Yunhe, Wen Song, Liu Di, Wang Hao, Metaxas Dimitris N.
- Two Stacks Are Better Than One: A Comparison Of Language Modeling And Translation As Multilingual Pretraining Objectives Li Zihao, Ji Shaoxiong, Mickus Timothee, Segonne Vincent, Tiedemann Jörg
- Vl-trojan: Multimodal Instruction Backdoor Attacks Against Autoregressive Visual Language Models Liang Jiawei, Liang Siyuan, Luo Man, Liu Aishan, Han Dongchen, Chang Ee-chien, Cao Xiaochun
- Bytecomposer: A Human-like Melody Composition Method Based On Language Model Agent Liang Xia, Du Xingjian, Lin Jiaju, Zou Pei, Wan Yuan, Zhu Bilei
- Fennec: Fine-grained Language Model Evaluation And Correction Extended Through Branching And Bridging Liang Xiaobo, Zhang Haoke, Hu Helan, Li Juntao, Xu Jun, Zhang Min
- Mitigating Hallucination In Visual-language Models Via Re-balancing Contrastive Decoding Liang Xiaoyu, Yu Jiayuan, Mu Lianrui, Zhuang Jiedong, Hu Jiaqi, Yang Yuchen, Ye Jiangnan, Lu Lu, Chen Jian, Hu Haoji
- Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats In Customized Large Language Models Liang Zi, Hu Haibo, Ye Qingqing, Xiao Yaxin, Li Haoyang
- Apiq: Finetuning Of 2-bit Quantized Large Language Model Liao Baohao, Herold Christian, Khadivi Shahram, Monz Christof
- Beyond KV Caching: Shared Attention For Efficient Llms Liao Bingli, Vargas Danilo Vasconcellos
- Extending Token Computation For LLM Reasoning Liao Bingli, Vargas Danilo Vasconcellos
- Look Before You Leap: Problem Elaboration Prompting Improves Mathematical Reasoning In Large Language Models Liao Haoran, Tian Jidong, Hu Shaohua, He Hao, Jin Yaohui
- MARIO: Math Reasoning With Code Interpreter Output -- A Reproducible Pipeline Liao Minpeng, Luo Wei, Li Chengxi, Wu Jing, Fan Kai
- Learning-from-mistakes Prompting For Indigenous Language Translation Liao You-cheng, Yu Chen-jui, Lin Chi-yi, Yun He-feng, Wang Yen-hsiang, Li Hsiao-min, Fan Yao-chung
- MING-MOE: Enhancing Medical Multi-task Learning In Large Language Models With Sparse Mixture Of Low-rank Adapter Experts Liao Yusheng, Jiang Shuyang, Wang Yu, Wang Yanfeng
- D2LLM: Decomposed And Distilled Large Language Models For Semantic Search Liao Zihan, Yu Hang, Li Jianguo, Wang Jun, Zhang Wei
- Large Language Models As Recommender Systems: A Study Of Popularity Bias Lichtenberg Jan Malte, Buchholz Alexander, Schwöbel Pola
- Adapting Mental Health Prediction Tasks For Cross-lingual Learning Via Meta-training And In-context Learning With Large Language Model Lifelo Zita, Ning Huansheng, Dhelim Sahraoui
- The Power Of Words: Generating Powershell Attacks From Natural Language Liguori Pietro, Marescalco Christian, Natella Roberto, Orbinato Vittorio, Pianese Luciano
- Retrieval-augmented Natural Language Reasoning For Explainable Visual Question Answering Lim Su Hyeon, Kim Minkuk, Kim Hyeon Bae, Kim Seong Tae
- Wildbench: Benchmarking Llms With Challenging Tasks From Real Users In The Wild Lin Bill Yuchen, Deng Yuntian, Chandu Khyathi, Brahman Faeze, Ravichander Abhilasha, Pyatkin Valentina, Dziri Nouha, Bras Ronan Le, Choi Yejin
- Tree-based Hard Attention With Self-motivation For Large Language Models Lin Chenxi, Ren Jiayu, He Guoxiu, Jiang Zhuoren, Yu Haiyan, Zhu Xiaomin
- Fine-tuned Network Relies On Generic Representation To Solve Unseen Cognitive Task Lin Dongyan
- Probing Large Language Models For Scalar Adjective Lexical Semantics And Scalar Diversity Pragmatics Lin Fangru, Altshuler Daniel, Pierrehumbert Janet B.
- Graph-enhanced Large Language Models In Asynchronous Plan Reasoning Lin Fangru, La Malfa Emanuele, Hofmann Valentin, Yang Elle Michelle, Cohn Anthony, Pierrehumbert Janet B.
- Can Llms Understand The Implication Of Emphasized Sentences In Dialogue? Lin Guan-ting, Lee Hung-yi
- Emojicrypt: Prompt Encryption For Secure Communication With Large Language Models Lin Guo, Hua Wenyue, Zhang Yongfeng
- Mixture Of In-context Experts Enhance Llms' Long Context Awareness Lin Hongzhan, Lv Ang, Chen Yuhan, Zhu Chen, Song Yang, Zhu Hengshu, Yan Rui
- Designprobe: A Graphic Design Benchmark For Multimodal Large Language Models Lin Jieru, Huang Danqing, Zhao Tiejun, Zhan Dechen, Lin Chin-yew
- How Can I Get It Right? Using GPT To Rephrase Incorrect Trainee Responses Lin Jionghao, Han Zifei, Thomas Danielle R., Gurung Ashish, Gupta Shivang, Aleven Vincent, Koedinger Kenneth R.
- Syncpkl: Harnessing Llms To Generate Synthetic Data For Commonsense Persona Knowledge Linking Lin Kuan-yen
- Improving Visual Storytelling With Multimodal Large Language Models Lin Xiaochuan, Chen Xiangyong
- Diversedialogue: A Methodology For Designing Chatbots With Human-like Diversity Lin Xiaoyu, Yu Xinkai, Aich Ankit, Giorgi Salvatore, Ungar Lyle
- Efficient LLM Training And Serving With Heterogeneous Context Sharding Among Attention Heads Lin Xihui, Zhang Yunan, Ge Suyu, Patra Barun, Chaudhary Vishrav, Peng Hao, Song Xia
- Interpretable User Satisfaction Estimation For Conversational Systems With Large Language Models Lin Ying-chun, Neville Jennifer, Stokes Jack W., Yang Longqi, Safavi Tara, Wan Mengting, Counts Scott, Suri Siddharth, Andersen Reid, Xu Xiaofeng, Gupta Deepak, Jauhar Sujay Kumar, Song Xia, Buscher Georg, Tiwary Saurabh, Hecht Brent, Teevan Jaime
- Qserve: W4A8KV4 Quantization And System Co-design For Efficient LLM Serving Lin Yujun, Tang Haotian, Yang Shang, Zhang Zhekai, Xiao Guangxuan, Gan Chuang, Han Song
- Contextualized Sequence Likelihood: Enhanced Confidence Scores For Natural Language Generation Lin Zhen, Trivedi Shubhendu, Sun Jimeng
- Boosting Multimodal Large Language Models With Visual Tokens Withdrawal For Rapid Inference Lin Zhihang, Lin Mingbao, Lin Luxi, Ji Rongrong
- Evaluating Text-to-visual Generation With Image-to-text Generation Lin Zhiqiu, Pathak Deepak, Li Baiqi, Li Jiayao, Xia Xide, Neubig Graham, Zhang Pengchuan, Ramanan Deva
- Local And Global Contexts For Conversation Lin Zuoquan, Shen Xinyi
- Unveiling Vulnerability Of Self-attention Liong Khai Jiet, Wu Hongqiu, Zhao Hai
- The Colorful Future Of Llms: Evaluating And Improving Llms As Emotional Supporters For Queer Youth Lissak Shir, Calderon Nitay, Shenkman Geva, Ophir Yaakov, Fruchter Eyal, Klomek Anat Brunstein, Reichart Roi
- More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-specific Llms Liu Chengyuan, Wang Shihang, Kang Yangyang, Qing Lizhi, Zhao Fubang, Sun Changlong, Kuang Kun, Wu Fei
- Few-shot Class Incremental Learning With Attention-aware Self-adaptive Prompt Liu Chenxi, Wang Zhenyi, Xiong Tianyi, Chen Ruibo, Wu Yihan, Guo Junfeng, Huang Heng
- Openeval: Benchmarking Chinese Llms Across Capability, Alignment And Safety Liu Chuang, Yu Linhao, Li Jiaxuan, Jin Renren, Huang Yufei, Shi Ling, Zhang Junhui, Ji Xinmeng, Cui Tingting, Liu Tao, Song Jinwang, Zan Hongying, Li Sun, Xiong Deyi
- Llmembed: Rethinking Lightweight Llm's Genuine Function In Text Classification Liu Chun, Zhang Hongguang, Zhao Kainan, Ju Xinghai, Yang Lin
- SPHINX-X: Scaling Data And Parameters For A Family Of Multi-modal Large Language Models Liu Dongyang, Zhang Renrui, Qiu Longtian, Huang Siyuan, Lin Weifeng, Zhao Shitian, Geng Shijie, Lin Ziyi, Jin Peng, Zhang Kaipeng, Shao Wenqi, Xu Chao, He Conghui, He Junjun, Shao Hao, Lu Pan, Li Hongsheng, Qiao Yu, Gao Peng
- Lumina-mgpt: Illuminate Flexible Photorealistic Text-to-image Generation With Multimodal Generative Pretraining Liu Dongyang, Zhao Shitian, Zhuo Le, Lin Weifeng, Qiao Yu, Li Hongsheng, Gao Peng
- An Incomplete Loop: Instruction Inference, Instruction Following, And In-context Learning In Language Models Liu Emmy, Neubig Graham, Andreas Jacob
- Pefomed: Parameter Efficient Fine-tuning Of Multimodal Large Language Models For Medical Imaging Liu Gang, He Jinlong, Li Pengfei, He Genrong, Chen Zhaolin, Zhong Shenjun
- Mmgrec: Multimodal Generative Recommendation With Transformer Model Liu Han, Wei Yinwei, Song Xuemeng, Guan Weili, Li Yuan-fang, Nie Liqiang
- Augmenting Math Word Problems Via Iterative Question Composing Liu Haoxiong, Zhang Yifan, Luo Yifan, Yao Andrew Chi-chih
- Laida: Linguistics-aware In-context Learning With Data Augmentation For Metaphor Components Identification Liu Hongde, He Chenyuan, Meng Feiyang, Niu Changyong, Jia Yuxiang
- VP-LLM: Text-driven 3D Volume Completion With Large Language Models Through Patchification Liu Jianmeng, Liu Yichen, Zhang Yuyao, Meng Zeyuan, Tai Yu-wing, Tang Chi-keung
- Tiny Refinements Elicit Resilience: Toward Efficient Prefix-model Against LLM Red-teaming Liu Jiaxu, Yin Xiangyu, Wu Sihao, Wang Jianhong, Fang Meng, Yi Xinping, Huang Xiaowei
- Iterative Length-regularized Direct Preference Optimization: A Case Study On Improving 7B Language Models To GPT-4 Level Liu Jie, Zhou Zhanhui, Liu Jiaheng, Bu Xingyuan, Yang Chao, Zhong Han-sen, Ouyang Wanli
- Mm-instruct: Generated Visual Instructions For Large Multimodal Model Alignment Liu Jihao, Huang Xin, Zheng Jinliang, Liu Boxiao, Wang Jia, Yoshie Osamu, Liu Yu, Li Hongsheng
- Visualwebbench: How Far Have Multimodal Llms Evolved In Web Page Understanding And Grounding? Liu Junpeng, Song Yifan, Lin Bill Yuchen, Lam Wai, Neubig Graham, Li Yuanzhi, Yue Xiang
- Educating Llms Like Human Students: Structure-aware Injection Of Domain Knowledge Liu Kai, Chen Ze, Fu Zhihang, Jiang Rongxin, Zhou Fan, Chen Yaowu, Wu Yue, Ye Jieping
- Enhancing Llm's Cognition Via Structurization Liu Kai, Fu Zhihang, Chen Chao, Zhang Wei, Jiang Rongxin, Zhou Fan, Chen Yaowu, Wu Yue, Ye Jieping
- Selectit: Selective Instruction Tuning For Large Language Models Via Uncertainty-aware Self-reflection Liu Liangxin, Liu Xuebo, Wong Derek F., Li Dongfang, Wang Ziyi, Hu Baotian, Zhang Min
- When Chatgpt Is Gone: Creativity Reverts And Homogeneity Persists Liu Qinghan, Zhou Yiyong, Huang Jihao, Li Guiquan
- Generative Expressive Conversational Speech Synthesis Liu Rui, Hu Yifan, Ren Yi, Yin Xiang, Li Haizhou
- Intactkv: Improving Large Language Model Quantization By Keeping Pivot Tokens Intact Liu Ruikang, Bai Haoli, Lin Haokun, Li Yuening, Gao Han, Xu Zhengzhuo, Hou Lu, Yao Jun, Yuan Chun
- ST-LLM: Large Language Models Are Effective Temporal Learners Liu Ruyang, Li Chen, Tang Haoran, Ge Yixiao, Shan Ying, Li Ge
- RL-GPT: Integrating Reinforcement Learning And Code-as-policy Liu Shaoteng, Yuan Haoqi, Hu Minda, Li Yanwei, Chen Yukang, Liu Shu, Lu Zongqing, Jia Jiaya
- SPINACH: Sparql-based Information Navigation For Challenging Real-world Questions Liu Shicheng, Semnani Sina J., Triedman Harold, Xu Jialiang, Zhao Isaac Dan, Lam Monica S.
- Are Llms Effective Backbones For Fine-tuning? An Experimental Investigation Of Supervised Llms On Chinese Short Text Matching Liu Shulin, Xu Chengcheng, Liu Hao, Yu Tinghao, Yang Tao
- Making Them Ask And Answer: Jailbreaking Large Language Models In Few Queries Via Disguise And Reconstruction Liu Tong, Zhang Yingjie, Zhao Zhe, Dong Yinpeng, Meng Guozhu, Chen Kai
- Memlong: Memory-augmented Retrieval For Long Text Modeling Liu Weijie, Tang Zecheng, Li Juntao, Chen Kehai, Zhang Min
- Toolace: Winning The Points Of LLM Function Calling Liu Weiwen, Huang Xu, Zeng Xingshan, Hao Xinlong, Yu Shuai, Li Dexun, Wang Shuai, Gan Weinan, Liu Zhengying, Yu Yuanqing, Wang Zezhong, Wang Yuxian, Ning Wu, Hou Yutai, Wang Bin, Wu Chuhan, Wang Xinzhi, Liu Yong, Wang Yasheng, Tang Duyu, Tu Dandan, Shang Lifeng, Jiang Xin, Tang Ruiming, Lian Defu, Liu Qun, Chen Enhong
- Imposter.ai: Adversarial Attacks With Hidden Intentions Towards Aligned Large Language Models Liu Xiao, Li Liangzhi, Xiang Tong, Ye Fuying, Wei Lu, Li Wangyue, Garcia Noa
- Harnessing Large Language Models For Multimodal Product Bundling Liu Xiaohao, Wu Jie, Tao Zhulin, Ma Yunshan, Wei Yinwei, Chua Tat-seng
- Farewell To Length Extrapolation, A Training-free Infinite Context With Finite Attention Scope Liu Xiaoran, Guo Qipeng, Song Yuerong, Liu Zhigeng, Lv Kai, Yan Hang, Li Linlin, Liu Qun, Qiu Xipeng
- An Empirical Analysis On Large Language Models In Debate Evaluation Liu Xinyi, Liu Pinxin, He Hangfeng
- Adaptive Draft-verification For Efficient Large Language Model Decoding Liu Xukun, Lei Bowen, Zhang Ruqi, Xu Dongkuan
- Omgeval: An Open Multilingual Generative Evaluation Benchmark For Large Language Models Liu Yang, Xu Meng, Wang Shuo, Yang Liner, Wang Haoyu, Liu Zhenghao, Kong Cunliang, Chen Yun, Liu Yang, Sun Maosong, Yang Erhong
- Do Large Language Models Possess Sensitive To Sentiment? Liu Yang, Zhu Xichou, Shen Zhou, Liu Yi, Li Min, Chen Yujun, John Benzi, Ma Zhenzhen, Hu Tao, Xu Zhiyang, Luo Wei, Wang Junhui
- Era-cot: Improving Chain-of-thought Through Entity Relationship Analysis Liu Yanming, Peng Xinyue, Du Tianyu, Yin Jianwei, Liu Weihao, Zhang Xuhong
- RA-ISF: Learning To Answer And Understand From Retrieval Augmentation Via Iterative Self-feedback Liu Yanming, Peng Xinyue, Zhang Xuhong, Liu Weihao, Yin Jianwei, Cao Jiannan, Du Tianyu
- Arondight: Red Teaming Large Vision Language Models With Auto-generated Multi-modal Jailbreak Prompts Liu Yi, Cai Chengjun, Zhang Xiaoli, Yuan Xingliang, Wang Cong
- Understanding Llms: A Comprehensive Overview From Training To Inference Liu Yiheng, He Hao, Han Tianle, Zhang Xu, Liu Mengyuan, Tian Jiaming, Zhang Yutong, Wang Jiaqi, Gao Xiaohui, Zhong Tianyang, Pan Yi, Xu Shaochen, Wu Zihao, Liu Zhengliang, Zhang Xin, Zhang Shu, Hu Xintao, Zhang Tuo, Qiang Ning, Liu Tianming, Ge Bao
- Synthvlm: High-efficiency And High-quality Synthetic Data For Vision Language Models Liu Zheng, Liang Hao, Huang Xijie, Xiong Wentao, Yu Qinhan, Sun Linzhuang, Chen Chong, He Conghui, Cui Bin, Zhang Wentao
- Scaffolding Language Learning Via Multi-modal Tutoring Systems With Pedagogical Instructions Liu Zhengyuan, Yin Stella Xin, Lee Carolyn, Chen Nancy F.
- Mixture Of Insightful Experts (mote): The Synergy Of Thought Chains And Expert Mixtures In Self-alignment Liu Zhili, Gou Yunhao, Chen Kai, Hong Lanqing, Gao Jiahui, Mi Fei, Zhang Yu, Li Zhenguo, Jiang Xin, Liu Qun, Kwok James T.
- Agentlite: A Lightweight Library For Building And Advancing Task-oriented LLM Agent System Liu Zhiwei, Yao Weiran, Zhang Jianguo, Yang Liangwei, Liu Zuxin, Tan Juntao, Choubey Prafulla K., Lan Tian, Wu Jason, Wang Huan, Heinecke Shelby, Xiong Caiming, Savarese Silvio
- Fantastic Semantics And Where To Find Them: Investigating Which Layers Of Generative Llms Reflect Lexical Semantics Liu Zhu, Kong Cunliang, Liu Ying, Sun Maosong
- Chatqa: Surpassing GPT-4 On Conversational QA And RAG Liu Zihan, Ping Wei, Roy Rajarshi, Xu Peng, Lee Chankyu, Shoeybi Mohammad, Catanzaro Bryan
- Enabling Weak Llms To Judge Response Reliability Via Meta Ranking Liu Zijun, Kou Boqun, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- A Closer Look Into Mixture-of-experts In Large Language Models Lo Ka Man, Huang Zeyu, Qiu Zihan, Wang Zili, Fu Jie
- Llms Are Biased Towards Output Formats! Systematically Evaluating And Mitigating Output Format Bias Of Llms Long Do Xuan, Ngoc Hai Nguyen, Sim Tiviatis, Dao Hieu, Joty Shafiq, Kawaguchi Kenji, Chen Nancy F., Kan Min-yen
- Evaluating Large Language Models In Analysing Classroom Dialogue Long Yun, Luo Haifeng, Zhang Yu
- Sparser Is Faster And Less Is More: Efficient Sparse Attention For Long-range Transformers Lou Chao, Jia Zixia, Zheng Zilong, Tu Kewei
- Roleplay-doh: Enabling Domain-experts To Create Llm-simulated Patients Via Eliciting And Adhering To Principles Louie Ryan, Nandi Ananjan, Fang William, Chang Cheng, Brunskill Emma, Yang Diyi
- Answering Real-world Clinical Questions Using Large Language Model Based Systems Low Yen Sia 1 And 2, Jackson Michael L. 1 And 2, Hyde Rebecca J. 1 And 2, Brown Robert E. 1 And 2, Sanghavi Neil M. 1 And 2, Baldwin Julian D. 1 And 2, Pike C. William 1 And 2, Muralidharan Jananee 1 And 2, Hui Gavin 1 And 2, Alexander Natasha 1 And 7, Hassan Hadeel 1 And 7, Nene Rahul V. 1 And 7, Pike Morgan 1 And 7, Pokrzywa Courtney J. 1 And 7, Vedak Shivam 1 And 7, Yan Adam Paul 1 And 7, Yao Dong-han 1 And 7, Zipursky Amy R. 1 And 7, Dinh Christina 1 And 7, Ballentine Philip 1 And 7, Derieg Dan C. 1 And 7, Polony Vladimir 1 And 7, Chawdry Rehan N. 1 And 7, Davies Jordan 1 And 7, Hyde Brigham B. 1 And 7, Shah Nigam H. 1 And 7, Gombar Saurabh 1 And 8
- Efficient Encoder-decoder Transformer Decoding For Decomposable Tasks Lu Bo-ru, Haduong Nikita, Lin Chien-yu, Cheng Hao, Smith Noah A., Ostendorf Mari
- From GPT-4 To Gemini And Beyond: Assessing The Landscape Of Mllms On Generalizability, Trustworthiness And Causality Through Four Modalities Lu Chaochao, Qian Chen, Zheng Guodong, Fan Hongxing, Gao Hongzhi, Zhang Jie, Shao Jing, Deng Jingyi, Fu Jinlan, Huang Kexin, Li Kunchang, Li Lijun, Wang Limin, Sheng Lu, Chen Meiqi, Zhang Ming, Ren Qibing, Chen Sirui, Gui Tao, Ouyang Wanli, Wang Yali, Teng Yan, Wang Yaru, Wang Yi, He Yinan, Wang Yingchun, Wang Yixu, Zhang Yongting, Qiao Yu, Shen Yujiong, Mou Yurong, Chen Yuxi, Zhang Zaibin, Shi Zhelun, Yin Zhenfei, Wang Zhipin
- Investigating Bias Representations In Llama 2 Chat Via Activation Steering Lu Dawn, Rimsky Nina
- Revisiting Multi-modal LLM Evaluation Lu Jian, Srivastava Shikhar, Chen Junyu, Shrestha Robik, Acharya Manoj, Kafle Kushal, Kanan Christopher
- In-context Time Series Predictor Lu Jiecheng, Sun Yan, Yang Shihao
- FIPO: Free-form Instruction-oriented Prompt Optimization With Preference Dataset And Modular Fine-tuning Schema Lu Junru, An Siyu, Zhang Min, He Yulan, Yin Di, Sun Xing
- Datasculpt: Crafting Data Landscapes For LLM Post-training Through Multi-objective Partitioning Lu Keer, Liang Zheng, Nie Xiaonan, Pan Da, Zhang Shusen, Zhao Keshi, Chen Weipeng, Zhou Zenan, Dong Guosheng, Zhang Wentao, Cui Bin
- Ovis: Structural Embedding Alignment For Multimodal Large Language Model Lu Shiyin, Li Yang, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Ye Han-jia
- Insights Into LLM Long-context Failures: When Transformers Know But Don't Tell Lu Taiming, Gao Muhan, Yu Kuai, Byerly Adam, Khashabi Daniel
- Fine-tuning Large Language Models For Domain Adaptation: Exploration Of Training Strategies, Scaling, Model Merging And Synergistic Capabilities Lu Wei, Luu Rachel K., Buehler Markus J.
- Blending Is All You Need: Cheaper, Better Alternative To Trillion-parameters LLM Lu Xiaoding, Liu Zongyi, Liusie Adian, Raina Vyas, Mudupalli Vineet, Zhang Yuwen, Beauchamp William
- Generative Students: Using Llm-simulated Student Profiles To Support Question Item Evaluation Lu Xinyi, Wang Xu
- Longheads: Multi-head Attention Is Secretly A Long Context Processor Lu Yi, Zhou Xin, He Wei, Zhao Jun, Ji Tao, Gui Tao, Zhang Qi, Huang Xuanjing
- Asymptotic Theory Of In-context Learning By Linear Attention Lu Yue M., Letey Mary I., Zavatone-veth Jacob A., Maiti Anindita, Pehlevan Cengiz
- Wildvision: Evaluating Vision-language Models In The Wild With Human Preferences Lu Yujie, Jiang Dongfu, Chen Wenhu, Wang William Yang, Choi Yejin, Lin Bill Yuchen
- Mathgenie: Generating Synthetic Data With Question Back-translation For Enhancing Mathematical Reasoning Of Llms Lu Zimu, Zhou Aojun, Ren Houxing, Wang Ke, Shi Weikang, Pan Junting, Zhan Mingjie, Li Hongsheng
- To Err Is Human, But Llamas Can Learn It Too Luhtaru Agnes, Purason Taido, Vainikko Martin, Del Maksym, Fishel Mark
- Can Perplexity Predict Fine-tuning Performance? An Investigation Of Tokenization Effects On Sequential Language Models For Nepali Luitel Nishant, Bekoju Nirajan, Sah Anand Kumar, Shakya Subarna
- MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory For Long Sequences Training Luo Cheng, Zhao Jiawei, Chen Zhuoming, Chen Beidi, Anandkumar Anima
- Crosstune: Black-box Few-shot Classification With Label Enhancement Luo Danqing, Zhang Chen, Zhang Yan, Li Haizhou
- Learning Positional Attention For Sequential Recommendation Luo Fan, Zhang Juan, Xu Shenghui
- From Understanding To Utilization: A Survey On Explainability For Large Language Models Luo Haoyan, Specia Lucia
- Kuaiji: The First Chinese Accounting Large Language Model Luo Jiayuan, Yang Songhua, Qiu Xiaoling, Chen Panyu, Nai Yufei, Zeng Wenxuan, Zhang Wentao, Jiang Xinke
- Hallucination Detection And Hallucination Mitigation: An Investigation Luo Junliang, Li Tianyu, Wu Di, Jenkin Michael, Liu Steve, Dudek Gregory
- BGE Landmark Embedding: A Chunking-free Embedding Method For Retrieval Augmented Long-context Large Language Models Luo Kun, Liu Zheng, Xiao Shitao, Liu Kang
- Large Language Models As Foundations For Next-gen Dense Retrieval: A Comprehensive Empirical Assessment Luo Kun, Qin Minghao, Liu Zheng, Xiao Shitao, Zhao Jun, Liu Kang
- End-to-end Neuro-symbolic Reinforcement Learning With Textual Explanations Luo Lirui, Zhang Guoxi, Xu Hongming, Yang Yaodong, Fang Cong, Li Qing
- Assessing Empathy In Large Language Models With Real-world Physician-patient Interactions Luo Man, Warren Christopher J., Cheng Lu, Abdul-muhsin Haidar M., Banerjee Imon
- M\(^3\)GPT: An Advanced Multimodal, Multitask Framework For Motion Comprehension And Generation Luo Mingshuang, Hou Ruibing, Chang Hong, Liu Zimo, Wang Yaowei, Shan Shiguang
- Taking A Deep Breath: Enhancing Language Modeling Of Large Language Models With Sentinel Tokens Luo Weiyao, Zheng Suncong, Xia Heming, Wang Weikang, Lei Yan, Liu Tianyu, Chen Shuang, Sui Zhifang
- Zero-shot Cross-domain Dialogue State Tracking Via Dual Low-rank Adaptation Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- Turning Trash Into Treasure: Accelerating Inference Of Large Language Models With Token Recycling Luo Xianzhen, Wang Yixuan, Zhu Qingfu, Zhang Zhiming, Zhang Xuanyu, Yang Qing, Xu Dongliang, Che Wanxiang
- Matching Domain Experts By Training From Scratch On Domain Knowledge Luo Xiaoliang, Sun Guangzhi, Love Bradley C.
- Ensuring Safe And High-quality Outputs: A Guideline Library Approach For Language Models Luo Yi, Lin Zhenghao, Zhang Yuhao, Sun Jiashuo, Lin Chen, Xu Chengjin, Su Xiangdong, Shen Yelong, Guo Jian, Gong Yeyun
- TRAWL: Tensor Reduced And Approximated Weights For Large Language Models Luo Yiran, Patel Het, Fu Yu, Ahn Dawon, Chen Jia, Dong Yue, Papalexakis Evangelos E.
- Are Large Language Models True Healthcare Jacks-of-all-trades? Benchmarking Across Health Professions Beyond Physician Exams Luo Zheheng, Yuan Chenhan, Xie Qianqian, Ananiadou Sophia
- Interpreting Key Mechanisms Of Factual Recall In Transformer-based Language Models Lv Ang, Chen Yuhan, Zhang Kaiyi, Wang Yulong, Liu Lifeng, Wen Ji-rong, Xie Jian, Yan Rui
- Coggpt: Unleashing The Power Of Cognitive Dynamics On Large Language Models Lv Yaojia, Pan Haojie, Fu Ruiji, Liu Ming, Wang Zhongyuan, Qin Bing
- GPT-4 Generated Narratives Of Life Events Using A Structured Narrative Prompt: A Validation Study Lynch Christopher J., Jensen Erik, Munro Madison H., Zamponi Virginia, Martinez Joseph, O'brien Kevin, Feldhaus Brandon, Smith Katherine, Reinhold Ann Marie, Gore Ross
- Keeping Llms Aligned After Fine-tuning: The Crucial Role Of Prompt Templates Lyu Kaifeng, Zhao Haoyu, Gu Xinran, Yu Dingli, Goyal Anirudh, Arora Sanjeev
- Automatic Summarization Of Doctor-patient Encounter Dialogues Using Large Language Model Through Prompt Tuning Lyu Mengxian, Peng Cheng, Li Xiaohan, Balian Patrick, Bian Jiang, Wu Yonghui
- Weblinx: Real-world Website Navigation With Multi-turn Dialogue Lù Xing Han, Kasner Zdeněk, Reddy Siva
- Enhancing Programming Education With Chatgpt: A Case Study On Student Perceptions And Interactions In A Python Course Ma Boxaun, Chen Li, Konomi Shin'ichi
- MOYU: A Theoretical Study On Massive Over-activation Yielded Uplifts In Llms Ma Chi, Huang Mincong, Wang Chao, Wang Yujie, Yu Lei
- First Activations Matter: Training-free Methods For Dynamic Activation In Large Language Models Ma Chi, Huang Mincong, Zhang Ying, Wang Chao, Wang Yujie, Yu Lei, Liu Chuan, Lin Wei
- EE-MLLM: A Data-efficient And Compute-efficient Multimodal Large Language Model Ma Feipeng, Zhou Yizhou, Li Hebei, He Zilong, Wu Siying, Rao Fengyun, Zhang Yueyi, Sun Xiaoyan
- Debate On Graph: A Flexible And Reliable Reasoning Framework For Large Language Models Ma Jie, Gao Zhitao, Chai Qi, Sun Wangchun, Wang Pinghui, Pei Hongbin, Tao Jing, Song Lingyun, Liu Jun, Zhang Chen, Cui Lizhen
- FBI-LLM: Scaling Up Fully Binarized Llms From Scratch Via Autoregressive Distillation Ma Liqun, Sun Mingjie, Shen Zhiqiang
- Making Pre-trained Language Models Better Continual Few-shot Relation Extractors Ma Shengkun, Han Jiale, Liang Yi, Cheng Bo
- The Era Of 1-bit Llms: All Large Language Models Are In 1.58 Bits Ma Shuming, Wang Hongyu, Ma Lingxiao, Wang Lei, Wang Wenhui, Huang Shaohan, Dong Li, Wang Ruiping, Xue Jilong, Wei Furu
- Aligned With LLM: A New Multi-modal Training Paradigm For Encoding Fmri Activity In Visual Cortex Ma Shuxiao, Wang Linyuan, Hou Senbao, Yan Bin
- Clawmachine: Fetching Visual Tokens As An Entity For Referring And Grounding Ma Tianren, Xie Lingxi, Tian Yunjie, Yang Boyu, Zhang Yuan, Doermann David, Ye Qixiang
- Megalodon: Efficient LLM Pretraining And Inference With Unlimited Context Length Ma Xuezhe, Yang Xiaomeng, Xiong Wenhan, Chen Beidi, Yu Lili, Zhang Hao, May Jonathan, Zettlemoyer Luke, Levy Omer, Zhou Chunting
- Sciagent: Tool-augmented Language Models For Scientific Reasoning Ma Yubo, Gou Zhibin, Hao Junheng, Xu Ruochen, Wang Shuohang, Pan Liangming, Yang Yujiu, Cao Yixin, Sun Aixin, Awadalla Hany, Chen Weizhu
- Mmlongbench-doc: Benchmarking Long-context Document Understanding With Visualizations Ma Yubo, Zang Yuhang, Chen Liangyu, Chen Meiqi, Jiao Yizhu, Li Xinze, Lu Xinyuan, Liu Ziyu, Ma Yan, Dong Xiaoyi, Zhang Pan, Pan Liangming, Jiang Yu-gang, Wang Jiaqi, Cao Yixin, Sun Aixin
- Llamoco: Instruction Tuning Of Large Language Models For Optimization Code Generation Ma Zeyuan, Guo Hongshu, Chen Jiacheng, Peng Guojun, Cao Zhiguang, Ma Yining, Gong Yue-jiao
- Gerea: Question-aware Prompt Captions For Knowledge-based Visual Question Answering Ma Ziyu, Li Shutao, Sun Bin, Cai Jianfei, Long Zuxiang, Ma Fuyan
- LLM In-context Recall Is Prompt Dependent Machlab Daniel, Battle Rick
- Steering Conversational Large Language Models For Long Emotional Support Conversations Madani Navid, Saha Sougata, Srihari Rohini
- Explaining Genetic Programming Trees Using Large Language Models Maddigan Paula, Lensen Andrew, Xue Bing
- Do Llms Know When To NOT Answer? Investigating Abstention Abilities Of Large Language Models Madhusudhan Nishanth, Madhusudhan Sathwik Tejaswi, Yadav Vikas, Hashemi Masoud
- Beyond Generative Artificial Intelligence: Roadmap For Natural Language Generation Maestre María Miró, Martínez-murillo Iván, Martin Tania J., Navarro-colorado Borja, Ferrández Antonio, Cueto Armando Suárez, Lloret Elena
- Hallucination-free? Assessing The Reliability Of Leading AI Legal Research Tools Magesh Varun, Surani Faiz, Dahl Matthew, Suzgun Mirac, Manning Christopher D., Ho Daniel E.
- Impact Of Model Size On Fine-tuned LLM Performance In Data-to-text Generation: A State-of-the-art Investigation Mahapatra Joy, Garain Utpal
- Evaluating Very Long-term Conversational Memory Of LLM Agents Maharana Adyasha, Lee Dong-ho, Tulyakov Sergey, Bansal Mohit, Barbieri Francesco, Fang Yuwei
- Leveraging Environment Interaction For Automated PDDL Generation And Planning With Large Language Models Mahdavi Sadegh, Aoki Raquel, Tang Keyi, Cao Yanshuai
- Too Late To Train, Too Early To Use? A Study On Necessity And Viability Of Low-resource Bengali Llms Mahfuz Tamzeed, Dey Satak Kumar, Naswan Ruwad, Adil Hasnaen, Sayeed Khondker Salman, Shahgir Haz Sameen
- VURF: A General-purpose Reasoning And Self-refinement Framework For Video Understanding Mahmood Ahmad, Vayani Ashmal, Naseer Muzammal, Khan Salman, Khan Fahad Shahbaz
- Learning To Plan Long-term For Language Modeling Mai Florian, Cornille Nathan, Moens Marie-francine
- Rephrasing The Web: A Recipe For Compute And Data-efficient Language Modeling Maini Pratyush, Seto Skyler, Bai He, Grangier David, Zhang Yizhe, Jaitly Navdeep
- How Effective Is GPT-4 Turbo In Generating School-level Questions From Textbooks Based On Bloom's Revised Taxonomy? Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- A Novel Multi-stage Prompting Approach For Language Agnostic MCQ Generation Using GPT Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- Fairylandai: Personalized Fairy Tales Utilizing Chatgpt And DALLE-3 Makridis Georgios, Oikonomou Athanasios, Koukos Vasileios
- Matmul Or No Matmul In The Era Of 1-bit Llms Malekar Jinendra, Elbtity Mohammed E., Zand Ramtin
- From Tarzan To Tolkien: Controlling The Language Proficiency Level Of Llms For Content Generation Malik Ali, Mayhew Stephen, Piech Chris, Bicknell Klinton
- ULLME: A Unified Framework For Large Language Model Embeddings With Generation-augmented Learning Man Hieu, Ngo Nghia Trung, Dernoncourt Franck, Nguyen Thien Huu
- IITK At Semeval-2024 Task 2: Exploring The Capabilities Of Llms For Safe Biomedical Natural Language Inference For Clinical Trials Mandal Shreyasi, Modi Ashutosh
- Can Large Language Models Automatically Score Proficiency Of Written Essays? Mansour Watheq, Albatarni Salam, Eltanbouly Sohaila, Elsayed Tamer
- Visually Guided Generative Text-layout Pre-training For Document Intelligence Mao Zhiming, Bai Haoli, Hou Lu, Wei Jiansheng, Jiang Xin, Liu Qun, Wong Kam-fai
- A Novel Nuanced Conversation Evaluation Framework For Large Language Models In Mental Health Marrapese Alexander, Suleiman Basem, Ullah Imdad, Kim Juno
- Beware Of Words: Evaluating The Lexical Richness Of Conversational Large Language Models Martínez Gonzalo, Hernández José Alberto, Conde Javier, Reviriego Pedro, Merino Elena
- Using Large Language Models To Estimate Features Of Multi-word Expressions: Concreteness, Valence, Arousal Martínez Gonzalo, Molero Juan Diego, González Sandra, Conde Javier, Brysbaert Marc, Reviriego Pedro
- Test-driven Development For Code Generation Mathews Noble Saji, Nagappan Meiyappan
- Do Large Language Models Understand Verbal Indicators Of Romantic Attraction? Matz Sandra C., Peters Heinrich, Eastwick Paul W., Cerf Moran, Finkel Eli J.
- XAI For All: Can Large Language Models Simplify Explainable AI? Mavrepis Philip, Makridis Georgios, Fatouros Georgios, Koukos Vasileios, Separdani Maria Margarita, Kyriazis Dimosthenis
- Can Opensource Beat Chatgpt? -- A Comparative Study Of Large Language Models For Text-to-code Generation Mayer Luis, Heumann Christian, Aßenmacher Matthias
- LLM Critics Help Catch LLM Bugs Mcaleese Nat, Pokorny Rai Michael, Uribe Juan Felipe Ceron, Nitishinskaya Evgenia, Trebacz Maja, Leike Jan
- Generative Artificial Intelligence In Higher Education: Evidence From An Analysis Of Institutional Policies And Guidelines Mcdonald Nora, Johri Aditya, Ali Areej, Hingle Aayushi
- Set-based Prompting: Provably Solving The Language Model Order Dependency Problem Mcilroy-young Reid, Brown Katrina, Olson Conlan, Zhang Linjun, Dwork Cynthia
- MM1: Methods, Analysis & Insights From Multimodal LLM Pre-training Mckinzie Brandon, Gan Zhe, Fauconnier Jean-philippe, Dodge Sam, Zhang Bowen, Dufter Philipp, Shah Dhruti, Du Xianzhi, Peng Futang, Weers Floris, Belyi Anton, Zhang Haotian, Singh Karanjeet, Kang Doug, Jain Ankur, Hè Hongyu, Schwarzer Max, Gunter Tom, Kong Xiang, Zhang Aonan, Wang Jianyu, Wang Chong, Du Nan, Lei Tao, Wiseman Sam, Yin Guoli, Lee Mark, Wang Zirui, Pang Ruoming, Grasch Peter, Toshev Alexander, Yang Yinfei
- Injecting New Knowledge Into Large Language Models Via Supervised Fine-tuning Mecklenburg Nick, Lin Yiyou, Li Xiaoxiao, Holstein Daniel, Nunes Leonardo, Malvar Sara, Silva Bruno, Chandra Ranveer, Aski Vijay, Yannam Pavan Kumar Reddy, Aktas Tolga, Hendry Todd
- Enhancing Creativity In Large Language Models Through Associative Thinking Strategies Mehrotra Pronita, Parab Aishni, Gulwani Sumit
- Metacheckgpt -- A Multi-task Hallucination Detector Using LLM Uncertainty And Meta-models Mehta Rahul, Hoblitzell Andrew, O'keefe Jack, Jang Hyeju, Varma Vasudeva
- Openelm: An Efficient Language Model Family With Open Training And Inference Framework Mehta Sachin, Sekhavat Mohammad Hossein, Cao Qingqing, Horton Maxwell, Jin Yanzi, Sun Chenfan, Mirzadeh Iman, Najibi Mahyar, Belenko Dmitry, Zatloukal Peter, Rastegari Mohammad
- Towards Human Understanding Of Paraphrase Types In Chatgpt Meier Dominik, Wahle Jan Philip, Ruas Terry, Gipp Bela
- On The Benchmarking Of Llms For Open-domain Dialogue Evaluation Mendonça John, Lavie Alon, Trancoso Isabel
- Soda-eval: Open-domain Dialogue Evaluation In The Age Of Llms Mendonça John, Trancoso Isabel, Lavie Alon
- Ecoh: Turn-level Coherence Evaluation For Multilingual Dialogues Mendonça John, Trancoso Isabel, Lavie Alon
- Phybench: A Physical Commonsense Benchmark For Evaluating Text-to-image Models Meng Fanqing, Shao Wenqi, Luo Lixin, Wang Yahong, Chen Yiran, Lu Quanfeng, Yang Yue, Yang Tianshuo, Zhang Kaipeng, Qiao Yu, Luo Ping
- Deepstack: Deeply Stacking Visual Tokens Is Surprisingly Simple And Effective For Lmms Meng Lingchen, Yang Jianwei, Tian Rui, Dai Xiyang, Wu Zuxuan, Gao Jianfeng, Jiang Yu-gang
- DCR: Divide-and-conquer Reasoning For Multi-choice Question Answering With Llms Meng Zijie, Zhang Yan, Feng Zhaopeng, Liu Zuozhu
- A Survey Of Resource-efficient LLM And Multimodal Foundation Models Mengwei Xu, Wangsong Yin, Dongqi Cai, Rongjie Yi, Daliang Xu, Qipeng Wang, Bingyang Wu, Yihao Zhao, Chen Yang, Shihe Wang, Qiyang Zhang, Zhenyan Lu, Li Zhang, Shangguang Wang, Yuanchun Li, Yunxin Liu, Xin Jin, Xuanzhe Liu
- Logic-enhanced Language Model Agents For Trustworthy Social Simulations Mensfelt Agnieszka, Stathis Kostas, Trencsenyi Vince
- Linearizing Large Language Models Mercat Jean, Vasiljevic Igor, Keh Sedrick, Arora Kushal, Dave Achal, Gaidon Adrien, Kollar Thomas
- Superposition Prompting: Improving And Accelerating Retrieval-augmented Generation Merth Thomas, Fu Qichen, Rastegari Mohammad, Najibi Mahyar
- Low-resource Machine Translation Through Retrieval-augmented LLM Prompting: A Study On The Mambai Language Merx Raphaël, Mahmudi Aso, Langford Katrina, De Araujo Leo Alberto, Vylomova Ekaterina
- A Comparison Of LLM Finetuning Methods & Evaluation Metrics With Travel Chatbot Use Case Meyer Sonia, Singh Shreya, Tam Bertha, Ton Christopher, Ren Angel
- User Centric Evaluation Of Code Generation Tools Miah Tanha, Zhu Hong
- Revenge Of The Fallen? Recurrent Models Match Transformers At Predicting Human Language Comprehension Metrics Michaelov James A., Arnett Catherine, Bergen Benjamin K.
- Is English The New Programming Language? How About Pseudo-code Engineering? Michaelsen Gian Alexandre, Santos Renato P. Dos
- I Have An Attention Bridge To Sell You: Generalization Capabilities Of Modular Translation Architectures Mickus Timothee, Vázquez Raúl, Attieh Joseph
- Semantics Of Multiword Expressions In Transformer-based Models: A Survey Miletić Filip, Walde Sabine Schulte Im
- Large Language Models: A Survey Minaee Shervin, Mikolov Tomas, Nikzad Narjes, Chenaghlu Meysam, Socher Richard, Amatriain Xavier, Gao Jianfeng
- Overview Of Dialogue Robot Competition 2023 Minato Takashi, Higashinaka Ryuichiro, Sakai Kurima, Funayama Tomo, Nishizaki Hiromitsu, Naga Takayuki
- Trust No Bot: Discovering Personal Disclosures In Human-llm Conversations In The Wild Mireshghallah Niloofar, Antoniak Maria, More Yash, Choi Yejin, Farnadi Golnoosh
- Fine-grained Hallucination Detection And Editing For Language Models Mishra Abhika, Asai Akari, Balachandran Vidhisha, Wang Yizhong, Neubig Graham, Tsvetkov Yulia, Hajishirzi Hannaneh
- CBF-LLM: Safe Control For LLM Alignment Miyaoka Yuya, Inoue Masaki
- LSPT: Long-term Spatial Prompt Tuning For Visual Representation Learning Mo Shentong, Wang Yansen, Luo Xufang, Li Dongsheng
- Large Language Model (LLM) AI Text Generation Detection Based On Transformer Deep Learning Algorithm Mo Yuhong, Qin Hao, Dong Yushan, Zhu Ziyi, Li Zhenglin
- Evaluating Large Language Models With Human Feedback: Establishing A Swedish Benchmark Moell Birger
- RDBE: Reasoning Distillation-based Evaluation Enhances Automatic Essay Scoring Mohammadkhani Ali Ghiasvand
- E2TP: Element To Tuple Prompting Improves Aspect Sentiment Tuple Prediction Mohammadkhani Mohammad Ghiasvand, Ranjbar Niloofar, Momtazi Saeedeh
- Routoo: Learning To Route To Large Language Models Effectively Mohammadshahi Alireza, Shaikh Arshad Rafiq, Yazdani Majid
- Llava-chef: A Multi-modal Generative Model For Food Recipes Mohbat Fnu, Zaki Mohammed J.
- Can We Trust Large Language Models Generated Code? A Framework For In-context Learning, Security Patterns, And Code Evaluations Across Diverse Llms Mohsin Ahmad, Janicke Helge, Wood Adrian, Sarker Iqbal H., Maglaras Leandros, Janjua Naeem
- Comparison Of Large Language Models For Generating Contextually Relevant Questions Molina Ivo Lodovico, Švábenský Valdemar, Minematsu Tsubasa, Chen Li, Okubo Fumiya, Shimada Atsushi
- Kam-cot: Knowledge Augmented Multimodal Chain-of-thoughts Reasoning Mondal Debjyoti, Modi Suraj, Panda Subhadarshi, Singh Rituraj, Rao Godawari Sudhakar
- Towards Generating Informative Textual Description For Neurons In Language Models Mondal Shrayani, Garodia Rishabh, Qureshi Arbaaz, Lee Taesung, Park Youngja
- Realm: Reference Resolution As Language Modeling Moniz Joel Ruben Antony, Krishnan Soundarya, Ozyildirim Melis, Saraf Prathamesh, Ates Halim Cagri, Zhang Yuan, Yu Hong
- Xc-cache: Cross-attending To Cached Context For Efficient LLM Inference Monteiro João, Marcotte Étienne, Noël Pierre-andré, Zantedeschi Valentina, Vázquez David, Chapados Nicolas, Pal Christopher, Taslakian Perouz
- Are Llms Robust For Spoken Dialogues? Mousavi Seyed Mahed, Roccabruna Gabriel, Alghisi Simone, Rizzoli Massimo, Ravanelli Mirco, Riccardi Giuseppe
- Cross-layer Attention Sharing For Large Language Models Mu Yongyu, Wu Yuzhang, Fan Yuchun, Wang Chenglong, Li Hengyu, He Qiaozhi, Yang Murun, Xiao Tong, Zhu Jingbo
- Are Large Language Models Actually Good At Text Style Transfer? Mukherjee Sourabrata, Ojha Atul Kr., Dušek Ondřej
- Grouse: A Benchmark To Evaluate Evaluators In Grounded Question Answering Muller Sacha, Loison António, Omrani Bilel, Viaud Gautier
- Chatgpt In Classrooms: Transforming Challenges Into Opportunities In Education Munawar Harris Bin, Misirlis Nikolaos
- Leave No Context Behind: Efficient Infinite Context Transformers With Infini-attention Munkhdalai Tsendsuren, Faruqui Manaal, Gopal Siddharth
- Compact Language Models Via Pruning And Knowledge Distillation Muralidharan Saurav, Sreenivas Sharath Turuvekere, Joshi Raviraj, Chochowski Marcin, Patwary Mostofa, Shoeybi Mohammad, Catanzaro Bryan, Kautz Jan, Molchanov Pavlo
- Evaluating The Evaluator: Measuring Llms' Adherence To Task Evaluation Instructions Murugadoss Bhuvanashree, Poelitz Christian, Drosos Ian, Le Vu, Mckenna Nick, Negreanu Carina Suzana, Parnin Chris, Sarkar Advait
- Open-llm-leaderboard: From Multi-choice To Open-style Questions For Llms Evaluation, Benchmark, And Arena Myrzakhan Aidar, Bsharat Sondos Mahmoud, Shen Zhiqiang
- Benchmarking Llama2, Mistral, Gemma And GPT For Factuality, Toxicity, Bias And Propensity For Hallucinations Nadeau David, Kroutikov Mike, Mcneil Karen, Baribeau Simon
- Negation Blindness In Large Language Models: Unveiling The NO Syndrome In Image Generation Nadeem Mohammad, Sohail Shahab Saquib, Cambria Erik, Schuller Björn W., Hussain Amir
- A Multi-expert Large Language Model Architecture For Verilog Code Generation Nadimi Bardia, Zheng Hao
- Zero-shot Visual Reasoning By Vision-language Models: Benchmarking And Analysis Nagar Aishik, Jaiswal Shantanu, Tan Cheston
- Generating Situated Reflection Triggers About Alternative Solution Paths: A Case Study Of Generative AI For Computer-supported Collaborative Learning Naik Atharva, Yin Jessica Ruhan, Kamath Anusha, Ma Qianou, Wu Sherry Tongshuang, Murray Charles, Bogart Christopher, Sakr Majd, Rose Carolyn P.
- Cdquant: Accurate Post-training Weight Quantization Of Large Pre-trained Models Using Greedy Coordinate Descent Nair Pranav Ajit, Suggala Arun Sai
- Beyond The Hype: A Dispassionate Look At Vision-language Models In Medical Scenario Nan Yang, Zhou Huichi, Xing Xiaodan, Yang Guang
- What If Red Can Talk? Dynamic Dialogue Generation Using Large Language Models Nananukul Navapat, Wongkamjan Wichayaporn
- Review-feedback-reason (refer): A Novel Framework For NLG Evaluation And Reasoning Narsupalli Yaswanth, Chandra Abhranil, Muppirala Sreevatsa, Gupta Manish, Goyal Pawan
- Exploring Italian Sentence Embeddings Properties Through Multi-tasking Nastase Vivi, Samo Giuseppe, Jiang Chunyang, Merlo Paola
- Segment-based Interactive Machine Translation For Pre-trained Models Navarro Angel, Casacuberta Francisco
- Dynamic Memory Compression: Retrofitting Llms For Accelerated Inference Nawrot Piotr, Łańcucki Adrian, Chochowski Marcin, Tarjan David, Ponti Edoardo M.
- Concise Thoughts: Impact Of Output Length On LLM Reasoning And Cost Nayab Sania, Rossolini Giulio, Buttazzo Giorgio, Manes Nicolamaria, Giacomelli Fabrizio
- Revolutionizing Undergraduate Learning: Coursegpt And Its Generative AI Advancements Nazar Ahmad M., Selim Mohamed Y., Gaffar Ashraf, Ahmed Shakil
- Improving Robustness Of Llm-based Speech Synthesis By Learning Monotonic Alignment Neekhara Paarth, Hussain Shehzeen, Ghosh Subhankar, Li Jason, Valle Rafael, Badlani Rohan, Ginsburg Boris
- Needle In The Haystack For Memory Based Large Language Models Nelson Elliot, Kollias Georgios, Das Payel, Chaudhury Subhajit, Dan Soham
- Interpreting Context Look-ups In Transformers: Investigating Attention-mlp Interactions Neo Clement, Cohen Shay B., Barez Fazl
- What Drives Performance In Multilingual Language Models? Nezhad Sina Bagheri, Agrawal Ameeta
- Educational Personalized Learning Path Planning With Large Language Models Ng Chee, Fung Yuen
- How Well Can Llms Echo Us? Evaluating AI Chatbots' Role-play Ability With ECHO Ng Man Tik, Tse Hui Tung, Huang Jen-tse, Li Jingjing, Wang Wenxuan, Lyu Michael R.
- Dude: Dual Distribution-aware Context Prompt Learning For Large Vision-language Model Nguyen Duy M. H., Le An T., Nguyen Trung Q., Diep Nghiem T., Nguyen Tai, Duong-tran Duy, Peters Jan, Shen Li, Niepert Mathias, Sonntag Daniel
- Gpts And Language Barrier: A Cross-lingual Legal QA Examination Nguyen Ha-thanh, Yamada Hiroaki, Satoh Ken
- Advancing Vietnamese Visual Question Answering With Transformer And Convolutional Integration Nguyen Ngoc Son, Nguyen Van Son, Le Tung
- Better Alignment With Instruction Back-and-forth Translation Nguyen Thao, Li Jeffrey, Oh Sewoong, Schmidt Ludwig, Weston Jason, Zettlemoyer Luke, Li Xian
- Cultural Commonsense Knowledge For Intercultural Dialogues Nguyen Tuan-phong, Razniewski Simon, Weikum Gerhard
- Llms For Generating And Evaluating Counterfactuals: A Comprehensive Study Nguyen Van Bach, Youssef Paul, Schlötterer Jörg, Seifert Christin
- Xwin-lm: Strong And Scalable Alignment Practice For Llms Ni Bolin, Hu Jingcheng, Wei Yixuan, Peng Houwen, Zhang Zheng, Meng Gaofeng, Hu Han
- Mozip: A Multilingual Benchmark To Evaluate Large Language Models In Intellectual Property Ni Shiwen, Tan Minghuan, Bai Yuelin, Niu Fuqiang, Yang Min, Zhang Bowen, Xu Ruifeng, Chen Xiaojun, Li Chengming, Hu Xiping, Li Ye, Fan Jianping
- Layer-wise Regularized Dropout For Neural Language Models Ni Shiwen, Yang Min, Xu Ruifeng, Li Chengming, Hu Xiping
- A Systematic Evaluation Of Large Language Models For Natural Language Generation Tasks Ni Xuanfan, Li Piji
- The GPT Surprise: Offering Large Language Model Chat In A Massive Coding Class Reduced Engagement But Increased Adopters Exam Performances Nie Allen, Chandak Yash, Suzara Miroslav, Ali Malika, Woodrow Juliette, Peng Matt, Sahami Mehran, Brunskill Emma, Piech Chris
- Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge In English-centric Large Language Models Nie Ercong, Yuan Shuzhou, Ma Bolei, Schmid Helmut, Färber Michael, Kreuter Frauke, Schütze Hinrich
- Llm-based Conversational AI Therapist For Daily Functioning Screening And Psychotherapeutic Intervention Via Everyday Smart Devices Nie Jingping, Shao Hanya, Fan Yuang, Shao Qijia, You Haoxuan, Preindl Matthias, Jiang Xiaofan
- Do Multilingual Large Language Models Mitigate Stereotype Bias? Nie Shangrui, Fromm Michael, Welch Charles, Görge Rebekka, Karimi Akbar, Plepi Joan, Mowmita Nazia Afsan, Flores-herr Nicolas, Ali Mehdi, Flek Lucie
- Cfinbench: A Comprehensive Chinese Financial Benchmark For Large Language Models Nie Ying, Yan Binwei, Guo Tianyu, Liu Hao, Wang Haoyu, He Wei, Zheng Binfan, Wang Weihao, Li Qiang, Sun Weijian, Wang Yunhe, Tao Dacheng
- WTU-EVAL: A Whether-or-not Tool Usage Evaluation Benchmark For Large Language Models Ning Kangyun, Su Yisong, Lv Xueqiang, Zhang Yuanzhe, Liu Jian, Liu Kang, Xu Jinan
- User-llm: Efficient LLM Contextualization With User Embeddings Ning Lin, Liu Luyang, Wu Jiaxing, Wu Neo, Berlowitz Devora, Prakash Sushant, Green Bradley, O'banion Shawn, Xie Jun
- How Reliable AI Chatbots Are For Disease Prediction From Patient Complaints? Nipu Ayesha Siddika, Islam K M Sajjadul, Madiraju Praveen
- Quantified Task Misalignment To Inform PEFT: An Exploration Of Domain Generalization And Catastrophic Forgetting In CLIP Niss Laura, Vogt-lowell Kevin, Tsiligkaridis Theodoros
- Enhancing Dialogue State Tracking Models Through Llm-backed User-agents Simulation Niu Cheng, Wang Xingguang, Cheng Xuxin, Song Juntong, Zhang Tong
- From Text To Emotion: Unveiling The Emotion Annotation Capabilities Of Llms Niu Minxue, Jaiswal Mimansa, Provost Emily Mower
- Large Language Models And Cognitive Science: A Comprehensive Review Of Similarities, Differences, And Challenges Niu Qian, Liu Junyu, Bi Ziqian, Feng Pohsun, Peng Benji, Chen Keyu, Li Ming
- Language Portability Strategies For Open-domain Dialogue With Pre-trained Language Models From High To Low Resource Languages Njifenjou Ahmed, Sucal Virgile, Jabaian Bassam, Lefèvre Fabrice
- Mobileflow: A Multimodal LLM For Mobile GUI Agent Nong Songqin, Zhu Jiali, Wu Rui, Jin Jiongchao, Shan Shuo, Huang Xiutian, Xu Wenhao
- Generative AI In EU Law: Liability, Privacy, Intellectual Property, And Cybersecurity Novelli Claudio, Casolari Federico, Hacker Philipp, Spedicato Giorgio, Floridi Luciano
- Improving Black-box Robustness With In-context Rewriting O'brien Kyle, Ng Nathan, Puri Isha, Mendez Jorge, Palangi Hamid, Kim Yoon, Ghassemi Marzyeh, Hartvigsen Thomas
- Fine-tuning With HED-IT: The Impact Of Human Post-editing For Dialogical Language Models Occhipinti Daniela, Marchi Michele, Mondella Irene, Lai Huiyuan, Dell'orletta Felice, Nissim Malvina, Guerini Marco
- Beyond Metrics: Evaluating Llms' Effectiveness In Culturally Nuanced, Low-resource Real-world Scenarios Ochieng Millicent, Gumma Varun, Sitaram Sunayana, Wang Jindong, Chaudhary Vishrav, Ronen Keshet, Bali Kalika, O'neill Jacki
- Robust Adaptation Of Foundation Models With Black-box Visual Prompting Oh Changdae, Seo Gyeongdeok, Jung Geunyoung, Cheng Zhi-qi, Choi Hosik, Jung Jiyoung, Song Kyungwoo
- GECKO: Generative Language Model For English, Code And Korean Oh Sungwoo, Kim Donggyu
- Gpt-ology, Computational Models, Silicon Sampling: How Should We Think About Llms In Cognitive Science? Ong Desmond C.
- Jamba: A Hybrid Transformer-mamba Language Model Opher Lieber, Barak Lenz, Hofit Bata, Gal Cohen, Jhonathan Osin, Itay Dalmedigos, Erez Safahi, Shaked Meirom, Yonatan Belinkov, Shai Shalev-shwartz, Omri Abend, Raz Alon, Tomer Asida, Amir Bergman, Roman Glozman, Michael Gokhman, Avashalom Manevich, Nir Ratner, Noam Rozen, Erez Shwartz, Mor Zusman, Yoav Shoham
- Transformers Are Multi-state Rnns Oren Matanel, Hassid Michael, Yarden Nir, Adi Yossi, Schwartz Roy
- Prompting Large Vision-language Models For Compositional Reasoning Ossowski Timothy, Jiang Ming, Hu Junjie
- A Systematic Review Of Data-to-text NLG Osuji Chinonso Cynthia, Ferreira Thiago Castro, Davis Brian
- The Accuracy Of Domain Specific And Descriptive Analysis Generated By Large Language Models Otieno Denish Omondi, Abri Faranak, Siami-namini Sima, Namin Akbar Siami
- Lossless Acceleration Of Large Language Model Via Adaptive N-gram Parallel Decoding Ou Jie, Chen Yueming, Tian Wenhong
- Easyinstruct: An Easy-to-use Instruction Processing Framework For Large Language Models Ou Yixin, Zhang Ningyu, Gui Honghao, Xu Ziwen, Qiao Shuofei, Xue Yida, Fang Runnan, Liu Kangwei, Li Lei, Bi Zhen, Zheng Guozhou, Chen Huajun
- QLSC: A Query Latent Semantic Calibrator For Robust Extractive Question Answering Ouyang Sheng, Wang Jianzong, Zhang Yong, Li Zhitao, Liang Ziqi, Zhang Xulong, Cheng Ning, Xiao Jing
- Unraveling The Dilemma Of AI Errors: Exploring The Effectiveness Of Human And Machine Explanations For Large Language Models Pafla Marvin, Larson Kate, Hancock Mark
- NLP At UC Santa Cruz At Semeval-2024 Task 5: Legal Answer Validation Using Few-shot Multi-choice QA Pahilajani Anish, Jain Samyak Rajesh, Trivedi Devasha
- Training Llms To Recognize Hedges In Spontaneous Narratives Paige Amie J., Soubki Adil, Murzaku John, Rambow Owen, Brennan Susan E.
- Gemini Goes To Med School: Exploring The Capabilities Of Multimodal Large Language Models On Medical Challenge Problems & Hallucinations Pal Ankit, Sankarasubbu Malaikannan
- The Solution For The AIGC Inference Performance Optimization Competition Pan Sishun, Xu Haonan, Wan Zhonghua, Yang Yang
- Assessing AI Detectors In Identifying Ai-generated Code: Implications For Education Pan Wei Hung, Chok Ming Jie, Wong Jonathan Leong Shan, Shin Yung Xin, Poon Yeong Shian, Yang Zhou, Chong Chun Yong, Lo David, Lim Mei Kuan
- Instinfer: In-storage Attention Offloading For Cost-effective Long-context LLM Inference Pan Xiurui, Li Endian, Li Qiao, Liang Shengwen, Shan Yizhou, Zhou Ke, Luo Yingwei, Wang Xiaolin, Zhang Jie
- Preparing Lessons For Progressive Training On Language Models Pan Yu, Yuan Ye, Yin Yichun, Shi Jiaxin, Xu Zenglin, Zhang Ming, Shang Lifeng, Jiang Xin, Liu Qun
- Llmlingua-2: Data Distillation For Efficient And Faithful Task-agnostic Prompt Compression Pan Zhuoshi, Wu Qianhui, Jiang Huiqiang, Xia Menglin, Luo Xufang, Zhang Jue, Lin Qingwei, Rühle Victor, Yang Yuqing, Lin Chin-yew, Zhao H. Vicky, Qiu Lili, Zhang Dongmei
- AILS-NTUA At Semeval-2024 Task 9: Cracking Brain Teasers: Transformer Models For Lateral Thinking Puzzles Panagiotopoulos Ioannis, Filandrianos Giorgos, Lymperaiou Maria, Stamou Giorgos
- Can Large Language Models Put 2 And 2 Together? Probing For Entailed Arithmetical Relationships Panas D., Seth S., Belle V.
- Salute The Classic: Revisiting Challenges Of Machine Translation In The Age Of Large Language Models Pang Jianhui, Ye Fanghua, Wang Longyue, Yu Dian, Wong Derek F., Shi Shuming, Tu Zhaopeng
- Anchor-based Large Language Models Pang Jianhui, Ye Fanghua, Wong Derek Fai, He Xin, Chen Wanshun, Wang Longyue
- Attndreambooth: Towards Text-aligned Personalized Text-to-image Generation Pang Lianyu, Yin Jian, Zhao Baoquan, Wu Feize, Wang Fu Lee, Li Qing, Mao Xudong
- Phased Instruction Fine-tuning For Large Language Models Pang Wei, Zhou Chuan, Zhou Xiao-hua, Wang Xiaojie
- Self-alignment Of Large Language Models Via Monopolylogue-based Social Scene Simulation Pang Xianghe, Tang Shuo, Ye Rui, Xiong Yuxin, Zhang Bolun, Wang Yanfeng, Chen Siheng
- LLM Evaluators Recognize And Favor Their Own Generations Panickssery Arjun, Bowman Samuel R., Feng Shi
- Shaking Up Vlms: Comparing Transformers And Structured State Space Models For Vision & Language Modeling Pantazopoulos Georgios, Nikandrou Malvina, Suglia Alessandro, Lemon Oliver, Eshghi Arash
- Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes Llms More Prone To Jailbreak Attacks Pantazopoulos Georgios, Parekh Amit, Nikandrou Malvina, Suglia Alessandro
- Do Vision & Language Decoders Use Images And Text Equally? How Self-consistent Are Their Explanations? Parcalabescu Letitia, Frank Anette
- Picturing Ambiguity: A Visual Twist On The Winograd Schema Challenge Park Brendan, Janecek Madeline, Ezzati-jivan Naser, Li Yifeng, Emami Ali
- Memorizing Documents With Guidance In Large Language Models Park Bumjin, Choi Jaesik
- Pragmatic Competence Evaluation Of Large Language Models For Korean Park Dojun, Lee Jiwoo, Jeong Hyeyun, Park Seohyun, Lee Sungeun
- Low-resource Cross-lingual Summarization Through Few-shot Learning With Large Language Models Park Gyutae, Hwang Seojin, Lee Hwanhee
- Loramap: Harnessing The Power Of Lora Connections Park Hyeryun, Kwak Jeongwon, Jang Dongsuk, Park Sumin, Choi Jinwook
- Llms For Enhanced Agricultural Meteorological Recommendations Park Ji-jun, Choi Soo-joon
- Building Trust In Mental Health Chatbots: Safety Metrics And Llm-based Evaluation Tools Park Jung In, Abbasian Mahyar, Azimi Iman, Bounds Dawn, Jun Angela, Han Jaesu, Mccarron Robert, Borelli Jessica, Li Jia, Mahmoudi Mona, Wiedenhoeft Carmen, Rahmani Amir
- Token-picker: Accelerating Attention In Text Generation With Minimized Memory Transfer Via Probability Estimation Park Junyoung, Kang Myeonggu, Han Yunki, Kim Yanggon, Shin Jaekang, Kim Lee-sup
- Pre-trained Vision And Language Transformers Are Few-shot Incremental Learners Park Keon-hee, Song Kyungwoo, Park Gyeong-moon
- Disentangling Length From Quality In Direct Preference Optimization Park Ryan, Rafailov Rafael, Ermon Stefano, Finn Chelsea
- Any-precision LLM: Low-cost Deployment Of Multiple, Different-sized Llms Park Yeonhong, Hyun Jake, Cho Sanglyul, Sim Bonggeun, Lee Jae W.
- Can Separators Improve Chain-of-thought Prompting? Park Yoonjeong, Kim Hyunjin, Choi Chanyeol, Kim Junseong, Sohn Jy-yong
- Logicbench: Towards Systematic Evaluation Of Logical Reasoning Ability Of Large Language Models Parmar Mihir, Patel Nisarg, Varshney Neeraj, Nakamura Mutsumi, Luo Man, Mashetty Santosh, Mitra Arindam, Baral Chitta
- The Ultimate Guide To Fine-tuning Llms From Basics To Breakthroughs: An Exhaustive Review Of Technologies, Research, Best Practices, Applied Research Challenges And Opportunities Parthasarathy Venkatesh Balavadhani, Zafar Ahtsham, Khan Aafaq, Shahid Arsalan
- Evidence To Generate (E2G): A Single-agent Two-step Prompting For Context Grounded And Retrieval Augmented Reasoning Parvez Md Rizwan
- Decoding News Narratives: A Critical Analysis Of Large Language Models In Framing Detection Pastorino Valeria, Sivakumar Jasivan A., Moosavi Nafise Sadat
- Automating Code Adaptation For Mlops -- A Benchmarking Study On Llms Patel Harsh, Ramanan Buvaneswari A., Khan Manzoor A., Williams Thomas, Friedman Brian, Drabeck Lawrence
- Multi-logieval: Towards Evaluating Multi-step Logical Reasoning Ability Of Large Language Models Patel Nisarg, Kulkarni Mohith, Parmar Mihir, Budhiraja Aashna, Nakamura Mutsumi, Varshney Neeraj, Baral Chitta
- SWAG: Storytelling With Action Guidance Patel Zeeshan, El-refai Karim, Pei Jonathan, Li Tianle
- Generative AI As A Metacognitive Agent: A Comparative Mixed-method Study With Human Participants On Icf-mimicking Exam Performance Pavlovic Jelena University Of Belgrade, Faculty Of Philosophy And Koucing Centar Resarch Lab, Krstic Jugoslav Koucing Centar Research Lab, Mitrovic Luka Koucing Centar Research Lab, Babic Djordje Koucing Centar Research Lab, Milosavljevic Adrijana Koucing Centar Research Lab, Nikolic Milena Koucing Centar Research Lab, Karaklic Tijana Koucing Centar Research Lab, Mitrovic Tijana Koucing Centar Research Lab
- The Effectiveness Of Llms As Annotators: A Comparative Overview And Empirical Analysis Of Direct Representation Pavlovic Maja, Poesio Massimo
- Automated Multi-language To English Machine Translation Using Generative Pre-trained Transformers Pelofske Elijah, Urias Vincent, Liebrock Lorie M.
- Securing Large Language Models: Addressing Bias, Misinformation, And Prompt Attacks Peng Benji, Chen Keyu, Li Ming, Feng Pohsun, Bi Ziqian, Liu Junyu, Niu Qian
- Answer Is All You Need: Instruction-following Text Embedding Via Answering The Question Peng Letian, Zhang Yuwei, Wang Zilong, Srinivasa Jayanth, Liu Gaowen, Wang Zihan, Shang Jingbo
- Review-llm: Harnessing Large Language Models For Personalized Review Generation Peng Qiyao, Liu Hongtao, Xu Hongyan, Yang Qing, Shao Minglai, Wang Wenjun
- AI Revolution On Chat Bot: Evidence From A Randomized Controlled Experiment Peng Sida, Swiatek Wojciech, Gao Allen, Cullivan Paul, Chang Haoge
- Uncertainty-aware Explainable Recommendation With Large Language Models Peng Yicui, Chen Hao, Lin Chingsheng, Huang Guo, Hu Jinrong, Guo Hui, Kong Bin, Hu Shu, Wu Xi, Wang Xin
- Dreambench++: A Human-aligned Benchmark For Personalized Image Generation Peng Yuang, Cui Yuxin, Tang Haomiao, Qi Zekun, Dong Runpei, Bai Jing, Han Chunrui, Ge Zheng, Zhang Xiangyu, Xia Shu-tao
- Q-PEFT: Query-dependent Parameter Efficient Fine-tuning For Text Reranking With Large Language Models Peng Zhiyuan, Wu Xuyang, Wang Qifan, Rajanala Sravanthi, Fang Yi
- Check-eval: A Checklist-based Approach For Evaluating Text Quality Pereira Jayr, Assumpcao Andre, Lotufo Roberto
- Astrollama-chat: Scaling Astrollama With Conversational And Diverse Datasets Perkowski Ernest, Pan Rui, Nguyen Tuan Dung, Ting Yuan-sen, Kruk Sandor, Zhang Tong, O'neill Charlie, Jablonska Maja, Sun Zechang, Smith Michael J., Liu Huiling, Schawinski Kevin, Iyer Kartheik, Universetbd Ioana Ciucă For
- Analyzing Large Language Models Chatbots: An Experimental Approach Using A Probability Test Peruchini Melise, Teixeira Julio Monteiro
- Large Language Models Can Infer Personality From Free-form User Interactions Peters Heinrich, Cerf Moran, Matz Sandra C.
- Constitutionalexperts: Training A Mixture Of Principle-based Prompts Petridis Savvas, Wedin Ben, Yuan Ann, Wexler James, Thain Nithum
- Prompting A Pretrained Transformer Can Be A Universal Approximator Petrov Aleksandar, Torr Philip H. S., Bibi Adel
- Benchmarking GPT-4 On Algorithmic Problems: A Systematic Evaluation Of Prompting Strategies Petruzzellis Flavio, Testolin Alberto, Sperduti Alessandro
- H2o-danube3 Technical Report Pfeiffer Pascal, Singer Philipp, Babakhin Yauhen, Fodor Gabor, Dhankhar Nischay, Ambati Sri Satish
- Graph Guided Question Answer Generation For Procedural Question-answering Pham Hai X., Hadji Isma, Xu Xinnuo, Degutyte Ziedune, Rainey Jay, Kazakos Evangelos, Fazly Afsaneh, Tzimiropoulos Georgios, Martinez Brais
- Can Base Chatgpt Be Used For Forecasting Without Additional Optimization? Pham Van, Cunningham Scott
- Investigating The Effectiveness Of Hypertuning Via Gisting Phang Jason
- Sign Of The Times: Evaluating The Use Of Large Language Models For Idiomaticity Detection Phelps Dylan, Pickard Thomas, Mi Maggie, Gow-smith Edward, Villavicencio Aline
- Peering Into The Mind Of Language Models: An Approach For Attribution In Contextual Question Answering Phukan Anirudh, Somasundaram Shwetha, Saxena Apoorv, Goswami Koustava, Srinivasan Balaji Vasan
- Mamba-ptq: Outlier Channels In Recurrent Large Language Models Pierro Alessandro, Abreu Steven
- Beyond Metrics: A Critical Analysis Of The Variability In Large Language Model Evaluation Frameworks Pimentel Marco Af, Christophe Clément, Raha Tathagata, Munjal Prateek, Kanithi Praveen K, Khan Shadab
- Revisiting Verilogeval: Newer Llms, In-context Learning, And Specification-to-rtl Tasks Pinckney Nathaniel, Batten Christopher, Liu Mingjie, Ren Haoxing, Khailany Brucek
- Improving Context-aware Preference Modeling For Language Models Pitis Silviu, Xiao Ziang, Roux Nicolas Le, Sordoni Alessandro
- How Effective Are State Space Models For Machine Translation? Pitorro Hugo, Vasylenko Pavlo, Treviso Marcos, Martins André F. T.
- LM-PUB-QUIZ: A Comprehensive Framework For Zero-shot Evaluation Of Relational Knowledge In Language Models Ploner Max, Wiland Jacek, Pohl Sebastian, Akbik Alan
- Dissecting Language Models: Machine Unlearning Via Selective Pruning Pochinkov Nicholas, Schoots Nandi
- V-RECS, A Low-cost LLM4VIS Recommender With Explanations, Captioning And Suggestions Podo Luca, Angelini Marco, Velardi Paola
- Vi(e)va LLM! A Conceptual Stack For Evaluating And Interpreting Generative Ai-based Visualizations Podo Luca, Ishmal Muhammad, Angelini Marco
- Exploring The Efficacy Of Large Language Models (GPT-4) In Binary Reverse Engineering Pordanesh Saman, Tan Benjamin
- Evaluation Of Language Models In The Medical Context Under Resource-constrained Settings Posada Andrea, Rueckert Daniel, Meissen Felix, Müller Philip
- Deciphering The Factors Influencing The Efficacy Of Chain-of-thought: Probability, Memorization, And Noisy Reasoning Prabhakar Akshara, Griffiths Thomas L., Mccoy R. Thomas
- Sambanova SN40L: Scaling The AI Memory Wall With Dataflow And Composition Of Experts Prabhakar Raghu, Sivaramakrishnan Ram, Gandhi Darshan, Du Yun, Wang Mingran, Song Xiangyu, Zhang Kejie, Gao Tianren, Wang Angela, Li Karen, Sheng Yongning, Brot Joshua, Sokolov Denis, Vivek Apurv, Leung Calvin, Sabnis Arjun, Bai Jiayu, Zhao Tuowen, Gottscho Mark, Jackson David, Luttrell Mark, Shah Manish K., Chen Edison, Liang Kaizhao, Jain Swayambhoo, Thakker Urmish, Huang Dawei, Jairath Sumti, Brown Kevin J., Olukotun Kunle
- Interpreting Bias In Large Language Models: A Feature-based Approach Prakash Nirmalendu, Roy Lee Ka Wei
- Accelerating Large Language Model Pretraining Via LFR Pedagogy: Learn, Focus, And Review Prakriya Neha, Yen Jui-nan, Hsieh Cho-jui, Cong Jason
- Fractured-sorry-bench: Framework For Revealing Attacks In Conversational Turns Undermining Refusal Efficacy And Defenses Over Sorry-bench Priyanshu Aman, Vijay Supriti
- When Quantization Affects Confidence Of Large Language Models? Proskurina Irina, Brun Luc, Metzler Guillaume, Velcin Julien
- Code Prompting Elicits Conditional Reasoning Abilities In Text+code Llms Puerto Haritz, Tutek Martin, Aditya Somak, Zhu Xiaodan, Gurevych Iryna
- A Moral Imperative: The Need For Continual Superalignment Of Large Language Models Puthumanaillam Gokul, Vora Manav, Thangeda Pranay, Ornik Melkior
- Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study In Indonesian And Sundanese Putri Rifki Afina, Haznitrama Faiz Ghifari, Adhista Dea, Oh Alice
- Less Is More: Accurate Speech Recognition & Translation Without Web-scale Data Puvvada Krishna C., Żelasko Piotr, Huang He, Hrinchuk Oleksii, Koluguri Nithin Rao, Dhawan Kunal, Majumdar Somshubra, Rastorgueva Elena, Chen Zhehuai, Lavrukhin Vitaly, Balam Jagadeesh, Ginsburg Boris
- Interactive Continual Learning: Fast And Slow Thinking Qi Biqing, Chen Xingquan, Gao Junqi, Li Dong, Liu Jianxing, Wu Ligang, Zhou Bowen
- Practical Token Pruning For Foundation Models In Few-shot Conversational Virtual Assistant Systems Qi Haode, Qian Cheng, Ni Jian, Singh Pratyush, Fazeli Reza, Wang Gengyu, Shu Zhongzheng, Wayne Eric, Bross Juergen
- Is Next Token Prediction Sufficient For GPT? Exploration On Code Logic Comprehension Qi Mengnan, Huang Yufan, Yao Yongqiang, Wang Maoquan, Gu Bin, Sundaresan Neel
- SNIFFER: Multimodal Large Language Model For Explainable Out-of-context Misinformation Detection Qi Peng, Yan Zehong, Hsu Wynne, Lee Mong Li
- Follow My Instruction And Spill The Beans: Scalable Data Extraction From Retrieval-augmented Generation Systems Qi Zhenting, Zhang Hanlin, Xing Eric, Kakade Sham, Lakkaraju Himabindu
- Enhancing Dialogue Generation In Werewolf Game Through Situation Analysis And Persuasion Strategies Qi Zhiyang, Inaba Michimasa
- Memorag: Moving Towards Next-gen RAG Via Memory-inspired Knowledge Discovery Qian Hongjin, Zhang Peitian, Liu Zheng, Mao Kelong, Dou Zhicheng
- How Easy Is It To Fool Your Multimodal Llms? An Empirical Analysis On Deceptive Prompts Qian Yusu, Zhang Haotian, Yang Yinfei, Gan Zhe
- Vl-mamba: Exploring State Space Models For Multimodal Learning Qiao Yanyuan, Yu Zheng, Guo Longteng, Chen Sihan, Zhao Zijia, Sun Mingzhen, Wu Qi, Liu Jing
- Prism: A Framework For Decoupling And Assessing The Capabilities Of Vlms Qiao Yuxuan, Duan Haodong, Fang Xinyu, Yang Junming, Chen Lin, Zhang Songyang, Wang Jiaqi, Lin Dahua, Chen Kai
- Supportiveness-based Knowledge Rewriting For Retrieval-augmented Language Modeling Qiao Zile, Ye Wei, Jiang Yong, Mo Tong, Xie Pengjun, Li Weiping, Huang Fei, Zhang Shikun
- Diffusiongpt: Llm-driven Text-to-image Generation System Qin Jie, Wu Jie, Chen Weifeng, Ren Yuxi, Li Huixia, Wu Hefeng, Xiao Xuefeng, Wang Rui, Wen Shilei
- The Uniqueness Of Llama3-70b With Per-channel Quantization: An Empirical Study Qin Minghai
- Empirical Guidelines For Deploying Llms Onto Resource-constrained Edge Devices Qin Ruiyang, Liu Dancheng, Yan Zheyu, Tan Zhaoxuan, Pan Zixuan, Jia Zhenge, Jiang Meng, Abbasi Ahmed, Xiong Jinjun, Shi Yiyu
- Infobench: Evaluating Instruction Following Ability In Large Language Models Qin Yiwei, Song Kaiqiang, Hu Yebowen, Yao Wenlin, Cho Sangwoo, Wang Xiaoyang, Wu Xuansheng, Liu Fei, Liu Pengfei, Yu Dong
- The Synergy Between Data And Multi-modal Large Language Models: A Survey From Co-development Perspective Qin Zhen, Chen Daoyuan, Zhang Wenhao, Yao Liuyi, Huang Yilun, Ding Bolin, Li Yaliang, Deng Shuiguang
- Lightning Attention-2: A Free Lunch For Handling Unlimited Sequence Lengths In Large Language Models Qin Zhen, Sun Weigao, Li Dong, Shen Xuyang, Sun Weixuan, Zhong Yiran
- Various Lengths, Constant Speed: Efficient Language Modeling With Lightning Attention Qin Zhen, Sun Weigao, Li Dong, Shen Xuyang, Sun Weixuan, Zhong Yiran
- Dissecting Multiplication In Transformers: Insights Into Llms Qiu Luyu, Li Jianing, Su Chi, Zhang Chen Jason, Chen Lei
- Can Large Language Models Understand Symbolic Graphics Programs? Qiu Zeju, Liu Weiyang, Feng Haiwen, Liu Zhen, Xiao Tim Z., Collins Katherine M., Tenenbaum Joshua B., Weller Adrian, Black Michael J., Schölkopf Bernhard
- Grammaticality Representation In Chatgpt As Compared To Linguists And Laypeople Qiu Zhuang, Duan Xufeng, Cai Zhenguang G.
- Layerwise Recurrent Router For Mixture-of-experts Qiu Zihan, Huang Zeyu, Cheng Shuang, Zhou Yizhi, Wang Zili, Titov Ivan, Fu Jie
- Tool Learning With Large Language Models: A Survey Qu Changle, Dai Sunhao, Wei Xiaochi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Xu Jun, Wen Ji-rong
- Unsupervised Distractor Generation Via Large Language Model Distilling And Counterfactual Contrastive Decoding Qu Fanyi, Sun Hao, Wu Yunfang
- Before Generation, Align It! A Novel And Effective Strategy For Mitigating Hallucinations In Text-to-sql Generation Qu Ge, Li Jinyang, Li Bowen, Qin Bowen, Huo Nan, Ma Chenhao, Cheng Reynold
- Mupt: A Generative Symbolic Music Pretrained Transformer Qu Xingwei, Bai Yuelin, Ma Yinghao, Zhou Ziya, Lo Ka Man, Liu Jiaheng, Yuan Ruibin, Min Lejun, Liu Xueling, Zhang Tianyu, Du Xinrun, Guo Shuyue, Liang Yiming, Li Yizhi, Wu Shangda, Zhou Junting, Zheng Tianyu, Ma Ziyang, Han Fengze, Xue Wei, Xia Gus, Benetos Emmanouil, Yue Xiang, Lin Chenghua, Tan Xu, Huang Stephen W., Fu Jie, Zhang Ge
- Coursegpt-zh: An Educational Large Language Model Based On Knowledge Distillation Incorporating Prompt Optimization Qu Zheyan, Yin Lu, Yu Zitong, Wang Wenbo, Zhang Xing
- Memory-augmented Generative Adversarial Transformers Raaijmakers Stephan, Bakker Roos, Cremers Anita, De Kleijn Roy, Kouwenhoven Tom, Verhoef Tessa
- Llms Among Us: Generative AI Participating In Digital Discourse Radivojevic Kristina, Clark Nicholas, Brenner Paul
- Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift In Fine-tuning Llms For Simultaneous Translation Raffel Matthew, Agostinelli Victor, Chen Lizhong
- Defan: Definitive Answer Dataset For Llms Hallucination Evaluation Rahman A B M Ashikur, Anwar Saeed, Usman Muhammad, Mian Ajmal
- Code Hallucination Rahman Mirza Masfiqur, Kundu Ashish
- Report On The 1st Workshop On Large Language Model For Evaluation In Information Retrieval (llm4eval 2024) At SIGIR 2024 Rahmani Hossein A., Siro Clemencia, Aliannejadi Mohammad, Craswell Nick, Clarke Charles L. A., Faggioli Guglielmo, Mitra Bhaskar, Thomas Paul, Yilmaz Emine
- An Investigation Of Neuron Activation As A Unified Lens To Explain Chain-of-thought Eliciting Arithmetic Reasoning Of Llms Rai Daking, Yao Ziyu
- GSR-BENCH: A Benchmark For Grounded Spatial Reasoning Evaluation Via Multimodal Llms Rajabi Navid, Kosecka Jana
- Knowledge-based Consistency Testing Of Large Language Models Rajan Sai Sathiesh, Soremekun Ezekiel, Chattopadhyay Sudipta
- Scavenging Hyena: Distilling Transformers Into Long Convolution Models Ralambomihanta Tokiniaina Raharison, Mohammadzadeh Shahrad, Islam Mohammad Sami Nur, Jabbour Wassim, Liang Laurence
- Analyzing LLM Behavior In Dialogue Summarization: Unveiling Circumstantial Hallucination Trends Ramprasad Sanjana, Ferracane Elisa, Lipton Zachary C.
- The Battle Of Llms: A Comparative Study In Conversational QA Tasks Rangapur Aryan, Rangapur Aman
- Narrow Transformer: Starcoder-based Java-lm For Desktop Rathinasamy Kamalkumar, J Balaji A, Kumar Ankush, Gayari Gagan, K Harshini, Mondal Rajab Ali, S Sreenivasa Raghavan K, Singh Swayam, Tarafdar Mohammed Rafee
- Lynx: An Open Source Hallucination Evaluation Model Ravi Selvan Sunitha, Mielczarek Bartosz, Kannappan Anand, Kiela Douwe, Qian Rebecca
- FACTOID: Factual Entailment For Hallucination Detection Rawte Vipula, Tonmoy S. M Towhidul Islam, Rajbangshi Krishnav, Nag Shravani, Chadha Aman, Sheth Amit P., Das Amitava
- Optimizing Visual Question Answering Models For Driving: Bridging The Gap Between Human And Machine Attention Patterns Rekanar Kaavya, Hayes Martin, Sistu Ganesh, Eising Ciaran
- Identifying Semantic Induction Heads To Understand In-context Learning Ren Jie, Guo Qipeng, Yan Hang, Liu Dongrui, Zhang Quanshi, Qiu Xipeng, Lin Dahua
- Unveiling And Mitigating Memorization In Text-to-image Diffusion Models Through Cross Attention Ren Jie, Li Yaxin, Zeng Shenglai, Xu Han, Lyu Lingjuan, Xing Yue, Tang Jiliang
- Samba: Simple Hybrid State Space Models For Efficient Unlimited Context Language Modeling Ren Liliang, Liu Yang, Lu Yadong, Shen Yelong, Liang Chen, Chen Weizhu
- Codeattack: Revealing Safety Generalization Challenges Of Large Language Models Via Code Completion Ren Qibing, Gao Chang, Shao Jing, Yan Junchi, Tan Xin, Lam Wai, Ma Lizhuang
- On The Efficacy Of Eviction Policy For Key-value Constrained Generative Language Model Inference Ren Siyu, Zhu Kenny Q.
- Healthcare Copilot: Eliciting The Power Of General Llms For Medical Consultation Ren Zhiyao, Zhan Yibing, Yu Baosheng, Ding Liang, Tao Dacheng
- The Benefits Of A Concise Chain Of Thought On Problem-solving In Large Language Models Renze Matthew, Guven Erhan
- Are Electra's Sentence Embeddings Beyond Repair? The Case Of Semantic Textual Similarity Rep Ivan, Dukić David, Šnajder Jan
- Self-generated Replay Memories For Continual Neural Machine Translation Resta Michele, Bacciu Davide
- Evaluating Llms For Gender Disparities In Notable Persons Rhue Lauren, Goethals Sofie, Sundararajan Arun
- Practical And Reproducible Symbolic Music Generation By Large Language Models With Structural Embeddings Rhyu Seungyeon, Yang Kichang, Cho Sungjun, Kim Jaehyeon, Lee Kyogu, Lee Moontae
- Code Generation With Alphacodium: From Prompt Engineering To Flow Engineering Ridnik Tal, Kredo Dedy, Friedman Itamar
- Empirical Study Of Symmetrical Reasoning In Conversational Chatbots Rim Daniela N., Choi Heeyoul
- Simulating Weighted Automata Over Sequences And Trees With Transformers Rizvi Michael, Lizaire Maude, Lacroce Clara, Rabusseau Guillaume
- Pretrained Hybrids With MAD Skills Roberts Nicholas, Guo Samuel, Gao Zhiqi, Gnvv Satya Sai Srinath Namburi, Cromp Sonia, Wu Chengjun, Duan Chengyu, Sala Frederic
- \(\texttt{accord}\): Closing The Commonsense Measurability Gap Roewer-després François, Feng Jinyue, Zhu Zining, Rudzicz Frank
- Exploring Advanced Large Language Models With Llmsuite Roffo Giorgio
- Unibucllm: Harnessing Llms For Automated Prediction Of Item Difficulty And Response Time For Multiple-choice Questions Rogoz Ana-cristina, Ionescu Radu Tudor
- Talking To Machines: Do You Read Me? Rojas-barahona Lina M.
- Zero-shot Spam Email Classification Using Pre-trained Large Language Models Rojas-galeano Sergio
- Question-instructed Visual Descriptions For Zero-shot Video Question Answering Romero David, Solorio Thamar
- Do GPT Language Models Suffer From Split Personality Disorder? The Advent Of Substrate-free Psychometrics Romero Peter, Fitz Stephen, Nakatsuma Teruo
- Exploring The Potential Of The Large Language Models (llms) In Identifying Misleading News Headlines Rony Md Main Uddin, Haque Md Mahfuzul, Ali Mohammad, Alam Ahmed Shatil, Hassan Naeemul
- An Early Categorization Of Prompt Injection Attacks On Large Language Models Rossi Sippo, Michel Alisia Marianne, Mukkamala Raghava Rao, Thatcher Jason Bennett
- Generative AI For Enhancing Active Learning In Education: A Comparative Study Of GPT-3.5 And GPT-4 In Crafting Customized Test Questions Rouzegar Hamdireza, Makrehchi Masoud
- Enhancing Low-resource NMT With A Multilingual Encoder And Knowledge Distillation: A Case Study Roy Aniruddha, Ray Pretam, Maheshwari Ayush, Sarkar Sudeshna, Goyal Pawan
- Convolutional Prompting Meets Language Models For Continual Learning Roy Anurag, Moulick Riddhiman, Verma Vinay K., Ghosh Saptarshi, Das Abir
- ERATTA: Extreme RAG For Table To Answers With Large Language Models Roychowdhury Sohini, Krema Marko, Mahammad Anvar, Moore Brian, Mukherjee Arijit, Prakashchandra Punit
- From PEFT To DEFT: Parameter Efficient Finetuning For Reducing Activation Density In Transformers Runwal Bharat, Pedapati Tejaswini, Chen Pin-yu
- Explorations Of Self-repair In Language Models Rushing Cody, Nanda Neel
- Writing In The Margins: Better Inference Pattern For Long Context Retrieval Russak Melisa, Jamil Umar, Bryant Christopher, Kamble Kiran, Magnuson Axel, Russak Mateusz, Alshikh Waseem
- Code-aware Prompting: A Study Of Coverage Guided Test Generation In Regression Setting Using LLM Ryan Gabriel, Jain Siddhartha, Shang Mingyue, Wang Shiqi, Ma Xiaofei, Ramanathan Murali Krishna, Ray Baishakhi
- How Chatgpt Changed The Media's Narratives On AI: A Semi-automated Narrative Analysis Through Frame Semantics Ryazanov Igor, Öhman Carl, Björklund Johanna
- Tandem Transformers For Inference Efficient Llms S Aishwarya P, Nair Pranav Ajit, Samaga Yashas, Boyd Toby, Kumar Sanjiv, Jain Prateek, Netrapalli Praneeth
- Capabilities Of Gemini Models In Medicine Saab Khaled, Tu Tao, Weng Wei-hung, Tanno Ryutaro, Stutz David, Wulczyn Ellery, Zhang Fan, Strother Tim, Park Chunjong, Vedadi Elahe, Chaves Juanma Zambrano, Hu Szu-yeu, Schaekermann Mike, Kamath Aishwarya, Cheng Yong, Barrett David G. T., Cheung Cathy, Mustafa Basil, Palepu Anil, Mcduff Daniel, Hou Le, Golany Tomer, Liu Luyang, Alayrac Jean-baptiste, Houlsby Neil, Tomasev Nenad, Freyberg Jan, Lau Charles, Kemp Jonas, Lai Jeremy, Azizi Shekoofeh, Kanada Kimberly, Man Siwai, Kulkarni Kavita, Sun Ruoxi, Shakeri Siamak, He Luheng, Caine Ben, Webson Albert, Latysheva Natasha, Johnson Melvin, Mansfield Philip, Lu Jian, Rivlin Ehud, Anderson Jesper, Green Bradley, Wong Renee, Krause Jonathan, Shlens Jonathon, Dominowska Ewa, Eslami S. M. Ali, Chou Katherine, Cui Claire, Vinyals Oriol, Kavukcuoglu Koray, Manyika James, Dean Jeff, Hassabis Demis, Matias Yossi, Webster Dale, Barral Joelle, Corrado Greg, Semturs Christopher, Mahdavi S. Sara, Gottweis Juraj, Karthikesalingam Alan, Natarajan Vivek
- Benchmarking And Building Long-context Retrieval Models With Loco And M2-BERT Saad-falcon Jon, Fu Daniel Y., Arora Simran, Guha Neel, Ré Christopher
- Are Llms Good Cryptic Crossword Solvers? Sadallah Abdelrahman "boda", Kotova Daria, Kochmar Ekaterina
- Utebc-nlp At Semeval-2024 Task 9: Can Llms Be Lateral Thinkers? Sadeghi Pouya, Abaskohi Amirhossein, Yaghoobzadeh Yadollah
- Vision-language And Large Language Model Performance In Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, And Quantized Models Safavi-naini Seyed Amir Ahmad, Ali Shuhaib, Shahab Omer, Shahhoseini Zahra, Savage Thomas, Rafiee Sara, Samaan Jamil S, Shabeeb Reem Al, Ladak Farah, Yang Jamie O, Echavarria Juan, Babar Sumbal, Shaukat Aasma, Margolis Samuel, Tatonetti Nicholas P, Nadkarni Girish, Kurdi Bara El, Soroush Ali
- Neurocache: Efficient Vector Retrieval For Long-range Language Modeling Safaya Ali, Yuret Deniz
- On Zero-shot Counterspeech Generation By Llms Saha Punyajoy, Agrawal Aalok, Jana Abhik, Biemann Chris, Mukherjee Animesh
- Mixsumm: Topic-based Data Augmentation Using Llms For Low-resource Extractive Text Summarization Sahu Gaurav, Laradji Issam H.
- LUNA: A Framework For Language Understanding And Naturalness Assessment Saidov Marat, Bakalova Aleksandra, Taktasheva Ekaterina, Mikhailov Vladislav, Artemova Ekaterina
- Risks, Causes, And Mitigations Of Widespread Deployments Of Large Language Models (llms): A Survey Sakib Md Nazmus, Islam Md Athikul, Pathak Royal, Arifin Md Mashrur
- Algorithmic Language Models With Neurally Compiled Libraries Saldyt Lucas, Kambhampati Subbarao
- Large Language Models Show Human-like Social Desirability Biases In Survey Responses Salecha Aadesh, Ireland Molly E., Subrahmanya Shashanka, Sedoc João, Ungar Lyle H., Eichstaedt Johannes C.
- Synergizing In-context Learning With Hints For End-to-end Task-oriented Dialog Systems Saley Vishal Vivek, Das Rocktim Jyoti, Raghu Dinesh, Mausam
- Berts Are Generative In-context Learners Samuel David
- Zero-shot RTL Code Generation With Attention Sink Augmented Large Language Models Sandal Selim, Akturk Ismail
- The Future Of Large Language Model Pre-training Is Federated Sani Lorenzo, Iacob Alex, Cao Zeyu, Marino Bill, Gao Yan, Paulik Tomas, Zhao Wanru, Shen William F., Aleksandrov Preslav, Qiu Xinchi, Lane Nicholas D.
- Retrieval-augmented Generation Meets Data-driven Tabula Rasa Approach For Temporal Knowledge Graph Forecasting Sannidhi Geethan, Sakhinana Sagar Srinivas, Runkana Venkataramana
- Lean Attention: Hardware-aware Scalable Attention Mechanism For The Decode-phase Of Transformers Sanovar Rya, Bharadwaj Srikant, Amant Renee St., Rühle Victor, Rajmohan Saravan
- Advancing Generative AI For Portuguese With Open Decoder Gerv\'asio PT* Santos Rodrigo, Silva João, Gomes Luís, Rodrigues João, Branco António
- Prompting Llms To Compose Meta-review Drafts From Peer-review Narratives Of Scholarly Manuscripts Santu Shubhra Kanti Karmaker, Sinha Sanjeev Kumar, Bansal Naman, Knipper Alex, Sarkar Souvika, Salvador John, Mahajan Yash, Guttikonda Sri, Akter Mousumi, Freestone Matthew, Williams Matthew C. Jr
- Are Machines Better At Complex Reasoning? Unveiling Human-machine Inference Gaps In Entailment Verification Sanyal Soumya, Xiao Tianyi, Liu Jiacheng, Wang Wenya, Ren Xiang
- Pre-training Small Base Lms With Fewer Tokens Sanyal Sunny, Sanghavi Sujay, Dimakis Alexandros G.
- ICAL: Continual Learning Of Multimodal Agents By Transforming Trajectories Into Actionable Insights Sarch Gabriel, Jang Lawrence, Tarr Michael J., Cohen William W., Marino Kenneth, Fragkiadaki Katerina
- Llms As On-demand Customizable Service Sarkar Souvika Santu, Babar Mohammad Fakhruddin Santu, Hasan Monowar Santu, Karmaker Shubhra Kanti Santu
- Can Llms Master Math? Investigating Large Language Models On Math Stack Exchange Satpute Ankit, Giessing Noah, Greiner-petter Andre, Schubotz Moritz, Teschke Olaf, Aizawa Akiko, Gipp Bela
- A Prompt Response To The Demand For Automatic Gender-neutral Translation Savoldi Beatrice, Piergentili Andrea, Fucci Dennis, Negri Matteo, Bentivogli Luisa
- Evaluating Consistency And Reasoning Capabilities Of Large Language Models Saxena Yash, Chopra Sarthak, Tripathi Arunendra Mani
- Improving The Validity Of Automatically Generated Feedback Via Reinforcement Learning Scarlatos Alexander, Smith Digory, Woodhead Simon, Lan Andrew
- Impacts Of Anthropomorphizing Large Language Models In Learning Environments Schaaff Kristina, Heidelmann Marc-andré
- The Human Factor In Detecting Errors Of Large Language Models: A Systematic Literature Review And Future Research Directions Schiller Christian A.
- Pitfalls Of Conversational Llms On News Debiasing Schlicht Ipek Baris, Altiok Defne, Taouk Maryanne, Flek Lucie
- Detecting Generated Native Ads In Conversational Search Schmidt Sebastian, Zelch Ines, Bevendorff Janek, Stein Benno, Hagen Matthias, Potthast Martin
- Ruffle&riley: Insights From Designing And Evaluating A Large Language Model-based Conversational Tutoring System Schmucker Robin, Xia Meng, Azaria Amos, Mitchell Tom
- M5 -- A Diverse Benchmark To Assess The Performance Of Large Multimodal Models Across Multilingual And Multicultural Vision-language Tasks Schneider Florian, Sitaram Sunayana
- Exploring Human-llm Conversations: Mental Models And The Originator Of Toxicity Schneider Johannes, Flores Arianna Casanova, Kranz Anne-catherine
- Morehopqa: More Than Multi-hop Reasoning Schnitzler Julian, Ho Xanh, Huang Jiahao, Boudin Florian, Sugawara Saku, Aizawa Akiko
- Metacognitive Myopia In Large Language Models Scholten Florian, Rebholz Tobias R., Hütter Mandy
- THREAD: Thinking Deeper With Recursive Spawning Schroeder Philip, Morgan Nathaniel, Luo Hongyin, Glass James
- Team UTSA-NLP At Semeval 2024 Task 5: Prompt Ensembling For Argument Reasoning In Civil Procedures With GPT4 Schumacher Dan, Rios Anthony
- Large Language Models To Generate System-level Test Programs Targeting Non-functional Properties Schwachhofer Denis, Domanski Peter, Becker Steffen, Wagner Stefan, Sauer Matthias, Pflüger Dirk, Polian Ilia
- Speech Reallm -- Real-time Streaming Speech Recognition With Multimodal Llms By Teaching The Flow Of Time Seide Frank, Doulaty Morrie, Shi Yangyang, Gaur Yashesh, Jia Junteng, Wu Chunyang
- Train-attention: Meta-learning Where To Focus In Continual Knowledge Learning Seo Yeongbin, Lee Dongha, Yeo Jinyoung
- Multi-modal Adapter For Vision-language Models Seputis Dominykas, Mihailov Serghei, Chatterjee Soham, Xiao Zehao
- Surprising Efficacy Of Fine-tuned Transformers For Fact-checking Over Larger Language Models Setty Vinay
- Benchmarking Large Language Models For Math Reasoning Tasks Seßler Kathrin, Rong Yao, Gözlüklü Emek, Kasneci Enkelejda
- Medpromptx: Grounded Multimodal Prompting For Chest X-ray Diagnosis Shaaban Mai A., Khan Adnan, Yaqub Mohammad
- Tencdm: Understanding The Properties Of Diffusion Model In The Space Of Language Model Encodings Shabalin Alexander, Meshchaninov Viacheslav, Chimbulatov Egor, Lapikov Vladislav, Kim Roman, Bartosh Grigory, Molchanov Dmitry, Markov Sergey, Vetrov Dmitry
- A Fine-grained Sentiment Analysis Of App Reviews Using Large Language Models: An Evaluation Study Shah Faiz Ali, Sabir Ahmed, Sharma Rajesh
- Flashattention-3: Fast And Accurate Attention With Asynchrony And Low-precision Shah Jay, Bikshandi Ganesh, Zhang Ying, Thakkar Vijay, Ramani Pradeep, Dao Tri
- Putting Gpt-4o To The Sword: A Comprehensive Evaluation Of Language, Vision, Speech, And Multimodal Proficiency Shahriar Sakib, Lund Brady, Mannuru Nishith Reddy, Arshad Muhammad Arbab, Hayawi Kadhim, Bevara Ravi Varma Kumar, Mannuru Aashrith, Batool Laiba
- Evaluating Text Summaries Generated By Large Language Models Using Openai's GPT Shakil Hassan, Mahi Atqiya Munawara, Nguyen Phuoc, Ortiz Zeydy, Mardini Mamoun T.
- Llava-prumerge: Adaptive Token Reduction For Efficient Large Multimodal Models Shang Yuzhang, Cai Mu, Xu Bingxin, Lee Yong Jae, Yan Yan
- Extensible Embedding: A Flexible Multipler For Llm's Context Length Shao Ninglu, Xiao Shitao, Liu Zheng, Zhang Peitian
- Combining Multiple Post-training Techniques To Achieve Most Efficient Quantized Llms Sharify Sayeh, Xu Zifei, Yazar Wanzin, Wang Xin
- Unirag: Universal Retrieval Augmentation For Multi-modal Large Language Models Sharifymoghaddam Sahel, Upadhyay Shivani, Chen Wenhu, Lin Jimmy
- A Critical Evaluation Of AI Feedback For Aligning Large Language Models Sharma Archit, Keh Sedrick, Mitchell Eric, Finn Chelsea, Arora Kushal, Kollar Thomas
- Patched MOA: Optimizing Inference For Diverse Software Development Tasks Sharma Asankhaya
- SPML: A DSL For Defending Language Models Against Prompt Attacks Sharma Reshabh K, Gupta Vinayak, Grossman Dan
- A Preliminary Study On Using Large Language Models In Software Pentesting Shashwat Kumar, Hahn Francis, Ou Xinming, Goldgof Dmitry, Hall Lawrence, Ligatti Jay, Rajgopalan S. Raj, Tabari Armin Ziaie
- Mammothmoda: Multi-modal Large Language Model She Qi, Pan Junwen, Wan Xin, Zhang Rui, Lu Dawei, Huang Kai
- Rumour Evaluation With Very Large Language Models Shehata Dahlia, Cohen Robin, Clarke Charles
- Rapid Optimization For Jailbreaking Llms Via Subconscious Exploitation And Echopraxia Shen Guangyu, Cheng Siyuan, Zhang Kaiyuan, Tao Guanhong, An Shengwei, Yan Lu, Zhang Zhuo, Ma Shiqing, Zhang Xiangyu
- Shortcutsbench: A Large-scale Real-world Benchmark For Api-based Agents Shen Haiyang, Li Yue, Meng Desong, Cai Dongqi, Qi Sheng, Zhang Li, Xu Mengwei, Ma Yun
- Pyramid Coder: Hierarchical Code Generator For Compositional Visual Question Answering Shen Ruoyue, Inoue Nakamasa, Shinoda Koichi
- Improving Reinforcement Learning From Human Feedback Using Contrastive Rewards Shen Wei, Zhang Xiaoying, Yao Yuanshun, Zheng Rui, Guo Hongyi, Liu Yang
- PMG : Personalized Multimodal Generation With Large Language Models Shen Xiaoteng, Zhang Rui, Zhao Xiaoyan, Zhu Jieming, Xiao Xi
- An Empirical Study On Context Length For Open-domain Dialog Generation Shen Xinyi, Lin Zuoquan
- Scaling Laws For Linear Complexity Language Models Shen Xuyang, Li Dong, Leng Ruitao, Qin Zhen, Sun Weigao, Zhong Yiran
- Jetmoe: Reaching Llama2 Performance With 0.1M Dollars Shen Yikang, Guo Zhen, Cai Tianle, Qin Zengyi
- A Fine-tuning Dataset And Benchmark For Large Language Models For Protein Understanding Shen Yiqing, Chen Zan, Mamalakis Michail, He Luhan, Xia Haiyang, Li Tianbin, Su Yanzhou, He Junjun, Wang Yu Guang
- Toursynbio: A Multi-modal Large Model And Agent Framework To Bridge Text And Protein Sequences For Protein Engineering Shen Yiqing, Chen Zan, Mamalakis Michail, Liu Yungeng, Li Tianbin, Su Yanzhou, He Junjun, Liò Pietro, Wang Yu Guang
- Language Models Encode Collaborative Signals In Recommendation Sheng Leheng, Zhang An, Zhang Yi, Chen Yuxin, Wang Xiang, Chua Tat-seng
- Repeval: Effective Text Evaluation With LLM Representation Sheng Shuqian, Xu Yi, Zhang Tianhang, Shen Zanwei, Fu Luoyi, Ding Jiaxin, Zhou Lei, Wang Xinbing, Zhou Chenghu
- Lumos : Empowering Multimodal Llms With Scene Text Recognition Shenoy Ashish, Lu Yichao, Jayakumar Srihari, Chatterjee Debojeet, Moslehpour Mohsen, Chuang Pierce, Harpale Abhay, Bhardwaj Vikas, Xu Di, Zhao Shicong, Zhao Longfang, Ramchandani Ankit, Dong Xin Luna, Kumar Anuj
- Can Language Models Explain Their Own Classification Behavior? Sherburn Dane, Chughtai Bilal, Evans Owain
- Using Chatgpt To Score Essays And Short-form Constructed Responses Shermis Mark D.
- Judging The Judges: A Systematic Investigation Of Position Bias In Pairwise Comparative Assessments By Llms Shi Lin, Ma Chiyu, Ma Weicheng, Vosoughi Soroush
- Paying More Attention To Image: A Training-free Method For Alleviating Hallucination In Lvlms Shi Liu, Kecheng Zheng, Wei Chen
- Keep The Cost Down: A Review On Methods To Optimize LLM' S Kv-cache Consumption Shi Luohe, Zhang Hongyi, Yao Yao, Li Zuchao, Zhao Hai
- Eagle: Exploring The Design Space For Multimodal Llms With Mixture Of Encoders Shi Min, Liu Fuxiao, Wang Shihao, Liao Shijia, Radhakrishnan Subhashree, Huang De-an, Yin Hongxu, Sapra Karan, Yacoob Yaser, Shi Humphrey, Catanzaro Bryan, Tao Andrew, Kautz Jan, Yu Zhiding, Liu Guilin
- Action Controlled Paraphrasing Shi Ning, Wu Zijun
- Math-llava: Bootstrapping Mathematical Reasoning For Multimodal Large Language Models Shi Wenhao, Hu Zhiqiang, Bin Yi, Liu Junhua, Yang Yang, Ng See-kiong, Bing Lidong, Lee Roy Ka-wei
- Medadapter: Efficient Test-time Adaptation Of Large Language Models Towards Medical Reasoning Shi Wenqi, Xu Ran, Zhuang Yuchen, Yu Yue, Wu Hang, Yang Carl, Wang May D.
- Why Larger Language Models Do In-context Learning Differently? Shi Zhenmei, Wei Junyi, Xu Zhuoyan, Liang Yingyu
- Abusegpt: Abuse Of Generative AI Chatbots To Create Smishing Campaigns Shibli Ashfak Md, Pritom Mir Mehedi A., Gupta Maanak
- CAUS: A Dataset For Question Generation Based On Human Cognition Leveraging Large Language Models Shin Minjung, Kim Donghyun, Ryu Jeh-kwang
- Deconstructing In-context Learning: Understanding Prompts Via Corruption Shivagunde Namrata, Lialin Vladislav, Muckatira Sherin, Rumshisky Anna
- Implicit Multimodal Alignment: On The Generalization Of Frozen Llms To Multimodal Inputs Shukor Mustafa, Cord Matthieu
- GPT-4 Understands Discourse At Least As Well As Humans Do Shultz Thomas, Wise Jamie, Nobandegani Ardavan Salehi
- Design2code: How Far Are We From Automating Front-end Engineering? Si Chenglei, Zhang Yanzhe, Yang Zhengyuan, Liu Ruibo, Yang Diyi
- Where Does In-context Translation Happen In Large Language Models Sia Suzanna, Mueller David, Duh Kevin
- The Fault In Our Stars: Quality Assessment Of Code Generation Benchmarks Siddiq Mohammed Latif, Dristi Simantika, Saha Joy, Santos Joanna C. S.
- Attention Overflow: Language Model Input Blur During Long-context Missing Items Recommendation Sileo Damien
- Scaling Synthetic Logical Reasoning Datasets With Context-sensitive Declarative Grammars Sileo Damien
- Constructing Benchmarks And Interventions For Combating Hallucinations In Llms Simhi Adi, Herzig Jonathan, Szpektor Idan, Belinkov Yonatan
- Robustness Of Llms To Perturbations In Text Singh Ayush, Singh Navpreet, Vatsal Shubham
- Indicgenbench: A Multilingual Benchmark To Evaluate Generation Capabilities Of Llms On Indic Languages Singh Harman, Gupta Nitish, Bharadwaj Shikhar, Tewari Dinesh, Talukdar Partha
- Exposing The Achilles' Heel: Evaluating Llms Ability To Handle Mistakes In Mathematical Reasoning Singh Joykirat, Nambi Akshay, Vineet Vibhav
- Llm-dcache: Improving Tool-augmented Llms With Gpt-driven Localized Data Caching Singh Simranjit, Fore Michael, Karatzas Andreas, Lee Chaehong, Jian Yanan, Shangguan Longfei, Yu Fuxun, Anagnostopoulos Iraklis, Stamoulis Dimitrios
- Evaluating Zero-shot GPT-4V Performance On 3D Visual Question Answering Benchmarks Singh Simranjit, Pavlakos Georgios, Stamoulis Dimitrios
- Are Small Language Models Ready To Compete With Large Language Models For Practical Applications? Sinha Neelabh, Jain Vinija, Chadha Aman
- Unipa-gpt: Large Language Models For University-oriented QA In Italian Siragusa Irene, Pirrone Roberto
- Using Adaptive Empathetic Responses For Teaching English Siyan Li, Shao Teresa, Yu Zhou, Hirschberg Julia
- The Mysterious Case Of Neuron 1512: Injectable Realignment Architectures Reveal Internal Characteristics Of Meta's Llama 2 Model Smith Brenden, Baker Dallin, Chase Clayton, Barney Myles, Parker Kaden, Allred Makenna, Hu Peter, Evans Alex, Fulda Nancy
- Chatting Up Attachment: Using Llms To Predict Adult Bonds Soares Paulo, Mccurdy Sean, Gerber Andrew J., Fonagy Peter
- Multi-task Inference: Can Large Language Models Follow Multiple Instructions At Once? Son Guijin, Baek Sangwon, Nam Sangdae, Jeong Ilgyun, Kim Seungone
- KMMLU: Measuring Massive Multitask Language Understanding In Korean Son Guijin, Lee Hanwool, Kim Sungdong, Kim Seungone, Muennighoff Niklas, Choi Taekyoon, Park Cheonbok, Yoo Kang Min, Biderman Stella
- Prefixing Attention Sinks Can Mitigate Activation Outliers For Large Language Model Quantization Son Seungwoo, Park Wonpyo, Han Woohyun, Kim Kyuyeun, Lee Jaeho
- Moviechat+: Question-aware Sparse Memory For Long Video Question Answering Song Enxin, Chai Wenhao, Ye Tian, Hwang Jenq-neng, Li Xi, Wang Gaoang
- Increasing Model Capacity For Free: A Simple Strategy For Parameter Efficient Fine-tuning Song Haobo, Zhao Hao, Majumder Soumajit, Lin Tao
- Out-of-distribution Generalization Via Composition: A Lens Through Induction Heads In Transformers Song Jiajun, Xu Zhuoyan, Zhong Yiqiao
- Multilingual Blending: LLM Safety Alignment Evaluation With Language Mixture Song Jiayang, Huang Yuheng, Zhou Zhehua, Ma Lei
- Moma: Multimodal LLM Adapter For Fast Personalized Image Generation Song Kunpeng, Zhu Yizhe, Liu Bingchen, Yan Qing, Elgammal Ahmed, Yang Xiao
- Can Many-shot In-context Learning Help Long-context LLM Judges? See More, Judge Better! Song Mingyang, Zheng Mao, Luo Xuan
- M3GIA: A Cognition Inspired Multilingual And Multimodal General Intelligence Ability Benchmark Song Wei, Li Yadong, Xu Jianhua, Wu Guowei, Ming Lingfeng, Yi Kexin, Luo Weihua, Li Houyi, Du Yi, Guo Fangda, Yu Kaicheng
- Hierarchical Context Merging: Better Long Context Understanding For Pre-trained Llms Song Woomin, Oh Seunghyuk, Mo Sangwoo, Kim Jaehyung, Yun Sukmin, Ha Jung-woo, Shin Jinwoo
- Position: Leverage Foundational Models For Black-box Optimization Song Xingyou, Tian Yingtao, Lange Robert Tjarko, Lee Chansoo, Tang Yujin, Chen Yutian
- Integrating AI In College Education: Positive Yet Mixed Experiences With Chatgpt Song Xinrui, Zhang Jiajin, Yan Pingkun, Hahn Juergen, Kruger Uwe, Mohamed Hisham, Wang Ge
- The Good, The Bad, And The Greedy: Evaluation Of Llms Should Not Ignore Non-determinism Song Yifan, Wang Guoyin, Li Sujian, Lin Bill Yuchen
- Sharelora: Parameter Efficient And Robust Large Language Model Fine-tuning Via Shared Low-rank Adaptation Song Yurun, Zhao Junchen, Harris Ian G., Jyothi Sangeetha Abdu
- Moviellm: Enhancing Long Video Understanding With Ai-generated Movies Song Zhende, Wang Chenchen, Sheng Jiamu, Zhang Chi, Yu Gang, Fan Jiayuan, Chen Tao
- Llms Simulate Big Five Personality Traits: Further Evidence Sorokovikova Aleksandra, Fedorova Natalia, Rezagholi Sharwin, Yamshchikov Ivan P.
- Empo: Theory-driven Dataset Construction For Empathetic Response Generation Through Preference Optimization Sotolar Ondrej
- Views Are My Own, But Also Yours: Benchmarking Theory Of Mind Using Common Ground Soubki Adil, Murzaku John, Jordehi Arash Yousefi, Zeng Peter, Markowska Magdalena, Mirroshandel Seyed Abolghasem, Rambow Owen
- Enhancing Ai-driven Psychological Consultation: Layered Prompts With Large Language Models Souza Rafael, Lim Jia-hao, Davis Alexander
- Chatgpt For Conversational Recommendation: Refining Recommendations By Reprompting With Feedback Spurlock Kyle Dylan, Acun Cagla, Saka Esin, Nasraoui Olfa
- Canttalkaboutthis: Aligning Language Models To Stay On Topic In Dialogues Sreedhar Makesh Narsimhan, Rebedea Traian, Ghosh Shaona, Zeng Jiaqi, Parisien Christopher
- Probing Multimodal Llms As World Models For Driving Sreeram Shiva, Wang Tsun-hsuan, Maalouf Alaa, Rosman Guy, Karaman Sertac, Rus Daniela
- Enhancing Temporal Understanding In Audio Question Answering For Large Audio Language Models Sridhar Arvind Krishna, Guo Yinyi, Visser Erik
- Interpreting And Learning Voice Commands With A Large Language Model For A Robot System Stankevich Stanislau, Dudek Wojciech
- Fine-tuning Large Language Models For Entity Matching Steiner Aaron, Peeters Ralph, Bizer Christian
- RLVF: Learning From Verbal Feedback Without Overgeneralization Stephan Moritz, Khazatsky Alexander, Mitchell Eric, Chen Annie S, Hsu Sheryl, Sharma Archit, Finn Chelsea
- Satyrn: A Platform For Analytics Augmented Generation Sterbentz Marko, Barrie Cameron, Shahi Shubham, Dutta Abhratanu, Hooshmand Donna, Pack Harper, Hammond Kristian J.
- Few-shot VQA With Frozen Llms: A Tale Of Two Approaches Sterner Igor, Lin Weizhe, Chen Jinghong, Byrne Bill
- Llms Achieve Adult Human Performance On Higher-order Theory Of Mind Tasks Street Winnie, Siy John Oliver, Keeling Geoff, Baranes Adrien, Barnett Benjamin, Mckibben Michael, Kanyere Tatenda, Lentz Alison, Arcas Blaise Aguera Y, Dunbar Robin I. M.
- Tailoring Vaccine Messaging With Common-ground Opinions Stureborg Rickard, Chen Sanxing, Xie Ruoyu, Patel Aayushi, Li Christopher, Zhu Chloe Qinyu, Hu Tingnan, Yang Jun, Dhingra Bhuwan
- ARKS: Active Retrieval In Knowledge Soup For Code Generation Su Hongjin, Jiang Shuyang, Lai Yuhang, Wu Haoyuan, Shi Boao, Liu Che, Liu Qian, Yu Tao
- SK-VQA: Synthetic Knowledge Generation At Scale For Training Context-augmented Multimodal Llms Su Xin, Luo Man, Pan Kris W, Chou Tien Pei, Lal Vasudev, Howard Phillip
- Llm-barber: Block-aware Rebuilder For Sparsity Mask In One-shot For Large Language Models Su Yupeng, Guan Ziyi, Liu Xiaoqun, Jin Tianlai, Wu Dongkuan, Chesi Graziano, Wong Ngai, Yu Hao
- M-QALM: A Benchmark To Assess Clinical Reading Comprehension And Knowledge Recall In Large Language Models Via Question Answering Subramanian Anand, Schlegel Viktor, Kashyap Abhinav Ramesh, Nguyen Thanh-tung, Dwivedi Vijay Prakash, Winkler Stefan
- LAB: Large-scale Alignment For Chatbots Sudalairaj Shivchander, Bhandwaldar Abhishek, Pareja Aldo, Xu Kai, Cox David D., Srivastava Akash
- Decoding At The Speed Of Thought: Harnessing Parallel Decoding Of Lexical Units For Llms Sun Chenxi, Zhang Hongzhi, Lin Zijia, Zhang Jingyuan, Zhang Fuzheng, Wang Zhongyuan, Chen Bin, Song Chengru, Zhang Di, Gai Kun, Xiong Deyi
- Building Better AI Agents: A Provocation On The Utilisation Of Persona In Llm-based Conversational Agents Sun Guangzhi, Zhan Xiao, Such Jose
- Conifer: Improving Complex Constrained Instruction-following Ability Of Large Language Models Sun Haoran, Liu Lixin, Li Junjie, Wang Fengyu, Dong Baohua, Lin Ran, Huang Ruohui
- Bbox-adapter: Lightweight Adapting For Black-box Large Language Models Sun Haotian, Zhuang Yuchen, Wei Wei, Zhang Chao, Dai Bo
- Evograd: A Dynamic Take On The Winograd Schema Challenge With Human Adversaries Sun Jing Han, Emami Ali
- Lawluo: A Chinese Law Firm Co-run By LLM Agents Sun Jingyun, Dai Chengxiao, Luo Zhongze, Chang Yangbo, Li Yang
- Effectiveness Of Chatgpt In Explaining Complex Medical Reports To Patients Sun Mengxuan, Reiter Ehud, Kiltie Anne E, Ramsay George, Duncan Lisa, Murchie Peter, Adam Rosalind
- Fostering Natural Conversation In Large Language Models With NICO: A Natural Interactive Conversation Dataset Sun Renliang, Liu Mengyuan, Yang Shiping, Wang Rui, He Junqing, Zhang Jiaxing
- Dr-llava: Visual Instruction Tuning With Symbolic Clinical Grounding Sun Shenghuan, Goldgof Gregory M., Schubert Alexander, Sun Zhiqing, Hartvigsen Thomas, Butte Atul J., Alaa Ahmed
- Llm-oriented Retrieval Tuner Sun Si, Zhang Hanqing, Liu Zhiyuan, Bao Jie, Song Dawei
- Diversifying Query: Region-guided Transformer For Temporal Sentence Grounding Sun Xiaolong, Shi Liushuai, Wang Le, Zhou Sanping, Xia Kun, Wang Yabing, Hua Gang
- Beyond Text: Utilizing Vocal Cues To Improve Decision Making In Llms For Robot Navigation Tasks Sun Xingpeng, Meng Haoming, Chakraborty Souradip, Bedi Amrit Singh, Bera Aniket
- Multi-turn Context Jailbreak Attack On Large Language Models From First Principles Sun Xiongtao, Zhang Deyue, Yang Dongdong, Zou Quanchen, Li Hui
- Rapidly Developing High-quality Instruction Data And Evaluation Benchmark For Large Language Models With Minimal Human Effort: A Case Study On Japanese Sun Yikun, Wan Zhen, Ueda Nobuhiro, Yahata Sakiko, Cheng Fei, Chu Chenhui, Kurohashi Sadao
- F-eval: Assessing Fundamental Abilities With Refined Evaluation Methods Sun Yu, Chen Keyu, Wang Shujie, Li Peiji, Guo Qipeng, Yan Hang, Qiu Xipeng, Huang Xuanjing, Lin Dahua
- You Only Cache Once: Decoder-decoder Architectures For Language Models Sun Yutao, Dong Li, Zhu Yi, Huang Shaohan, Wang Wenhui, Ma Shuming, Zhang Quanlu, Wang Jianyong, Wei Furu
- Leveraging Chatgpt In Pharmacovigilance Event Extraction: An Empirical Study Sun Zhaoyue, Pergola Gabriele, Wallace Byron C., He Yulan
- Towards Smaller, Faster Decoder-only Transformers: Architectural Variants And Their Implications Suresh Sathya Krishnan, P Shunmugapriya
- Meta-prompting: Enhancing Language Models With Task-agnostic Scaffolding Suzgun Mirac, Kalai Adam Tauman
- From Explanations To Action: A Zero-shot, Theory-driven LLM Framework For Student Performance Feedback Swamy Vinitra, Romano Davide, Desikan Bhargav Srinivasa, Camburu Oana-maria, Käser Tanja
- Automatic Generation And Evaluation Of Reading Comprehension Test Items With Large Language Models Säuberli Andreas, Clematide Simon
- Performance Assessment Of Chatgpt Vs Bard In Detecting Alzheimer's Dementia T Balamurali B, Chen Jer-ming
- Freely Long-thinking Transformer (frailt) Tabak Akbay
- PIXAR: Auto-regressive Language Modeling In Pixel Space Tai Yintao, Liao Xiyang, Suglia Alessandro, Vergari Antonio
- All In How You Ask For It: Simple Black-box Method For Jailbreak Attacks Takemoto Kazuhiro
- 1.5-pints Technical Report: Pretraining In Days, Not Months -- Your Language Model Thrives On Quality Data Tan Calvin, Wang Jerome
- Phantom: Personality Has An Effect On Theory-of-mind Reasoning In Large Language Models Tan Fiona Anting, Yeo Gerard Christopher, Wu Fanyou, Xu Weijie, Jain Vinija, Chadha Aman, Jaidka Kokil, Liu Yang, Ng See-kiong
- Blinded By Generated Contexts: How Language Models Merge Generated And Retrieved Contexts When Knowledge Conflicts? Tan Hexiang, Sun Fei, Yang Wanli, Wang Yuanzhuo, Cao Qi, Cheng Xueqi
- Lloco: Learning Long Contexts Offline Tan Sijun, Li Xiuyu, Patil Shishir, Wu Ziyang, Zhang Tianjun, Keutzer Kurt, Gonzalez Joseph E., Popa Raluca Ada
- Fine-tuning Large Language Model (LLM) Artificial Intelligence Chatbots In Ophthalmology And Llm-based Evaluation Using GPT-4 Tan Ting Fang, Elangovan Kabilan, Jin Liyuan, Jie Yao, Yong Li, Lim Joshua, Poh Stanley, Ng Wei Yan, Lim Daniel, Ke Yuhe, Liu Nan, Ting Daniel Shu Wei
- DLO: Dynamic Layer Operation For Efficient Vertical Scaling Of Llms Tan Zhen, Dong Daize, Zhao Xinyu, Peng Jie, Cheng Yu, Chen Tianlong
- Large Language Models For Data Annotation: A Survey Tan Zhen, Li Dawei, Wang Song, Beigi Alimohammad, Jiang Bohan, Bhattacharjee Amrita, Karami Mansooreh, Li Jundong, Cheng Lu, Liu Huan
- Can I Understand What I Create? Self-knowledge Evaluation Of Large Language Models Tan Zhiquan, Wei Lai, Wang Jindong, Xie Xing, Huang Weiran
- Leveraging Language Models For Emotion And Behavior Analysis In Education Tanaka Kaito, Tan Benjamin, Wong Brian
- Instructdoc: A Dataset For Zero-shot Generalization Of Visual Document Understanding With Instructions Tanaka Ryota, Iki Taichi, Nishida Kyosuke, Saito Kuniko, Suzuki Jun
- Active Label Correction For Building Llm-based Modular AI Systems Taneja Karan, Goel Ashok
- Jill Watson: A Virtual Teaching Assistant Powered By Chatgpt Taneja Karan, Maiti Pratyusha, Kakar Sandeep, Guruprasad Pranav, Rao Sanjeev, Goel Ashok K.
- Ungrammatical-syntax-based In-context Example Selection For Grammatical Error Correction Tang Chenming, Qu Fanyi, Wu Yunfang
- Razorattention: Efficient KV Cache Compression Through Retrieval Heads Tang Hanlin, Lin Yang, Lin Jing, Han Qingsen, Hong Shikuan, Yao Yiwu, Wang Gongyi
- Tofueval: Evaluating Hallucinations Of Llms On Topic-focused Dialogue Summarization Tang Liyan, Shalyminov Igor, Wong Amy Wing-mei, Burnsky Jon, Vincent Jake W., Yang Yu'an, Singh Siffi, Feng Song, Song Hwanjun, Su Hang, Sun Lijia, Zhang Yi, Mansour Saab, Mckeown Kathleen
- Language-specific Neurons: The Key To Multilingual Capabilities In Large Language Models Tang Tianyi, Luo Wenyang, Huang Haoyang, Zhang Dongdong, Wang Xiaolei, Zhao Xin, Wei Furu, Wen Ji-rong
- Multihop-rag: Benchmarking Retrieval-augmented Generation For Multi-hop Queries Tang Yixuan, Yang Yi
- Mathscale: Scaling Instruction Tuning For Mathematical Reasoning Tang Zhengyang, Zhang Xingxing, Wang Benyou, Wei Furu
- Chatgpt Incorrectness Detection In Software Reviews Tanzil Minaoar Hossain, Khan Junaed Younus, Uddin Gias
- Scaling Laws With Vocabulary: Larger Models Deserve Larger Vocabularies Tao Chaofan, Liu Qian, Dou Longxu, Muennighoff Niklas, Wan Zhongwei, Luo Ping, Lin Min, Wong Ngai
- Enhancing Commentary Strategies For Imperfect Information Card Games: A Study Of Large Language Models In Guandan Commentary Tao Meiling, Liang Xuechen, Wang Ziyi, Tao Yiling, Shi Tianyu
- Chatgpt Role-play Dataset: Analysis Of User Motives And Model Naturalness Tao Yufei, Agrawal Ameeta, Dombi Judit, Sydorenko Tetyana, Lee Jung In
- Making A Long Story Short In Conversation Modeling Tao Yufei, Mines Tiernan, Agrawal Ameeta
- MEEL: Multi-modal Event Evolution Learning Tao Zhengwei, Jin Zhi, Huang Junqiang, Chen Xiancai, Bai Xiaoying, Zhao Haiyan, Zhang Yifan, Tao Chongyang
- Targeted Visual Prompting For Medical Visual Question Answering Tascon-morales Sergio, Márquez-neila Pablo, Sznitman Raphael
- Training-free Consistent Text-to-image Generation Tewel Yoad, Kaduri Omri, Gal Rinon, Kasten Yoni, Wolf Lior, Chechik Gal, Atzmon Yuval
- Can Chatgpt Make Explanatory Inferences? Benchmarks For Abductive Reasoning Thagard Paul
- Judging The Judges: Evaluating Alignment And Vulnerabilities In Llms-as-judges Thakur Aman Singh, Choudhary Kartik, Ramayapally Venkat Srinik, Vaidyanathan Sankaran, Hupkes Dieuwke
- Equipping Language Models With Tool Use Capability For Tabular Data Analysis In Finance Theuma Adrian, Shareghi Ehsan
- Atomic Self-consistency For Better Long Form Generations Thirukovalluru Raghuveer, Huang Yukun, Dhingra Bhuwan
- Limits Of Transformer Language Models On Learning To Compose Algorithms Thomm Jonathan, Terzic Aleksandar, Camposampiero Giacomo, Hersche Michael, Schölkopf Bernhard, Rahimi Abbas
- Elitr-bench: A Meeting Assistant Benchmark For Long-context Language Models Thonet Thibaut, Rozen Jos, Besacier Laurent
- Elsevier Arena: Human Evaluation Of Chemistry/biology/health Foundational Large Language Models Thorne Camilo, Druckenbrodt Christian, Szarkowska Kinga, Goyal Deepika, Marajan Pranita, Somanath Vijay, Harper Corey, Yan Mao, Scerri Tony
- Dubo-sql: Diverse Retrieval-augmented Generation And Fine Tuning For Text-to-sql Thorpe Dayton G., Duberstein Andrew J., Kinsey Ian A.
- I Am A Strange Dataset: Metalinguistic Tests For Language Models Thrush Tristan, Moore Jared, Monares Miguel, Potts Christopher, Kiela Douwe
- Climategpt: Towards AI Synthesizing Interdisciplinary Research On Climate Change Thulke David, Gao Yingbo, Pelser Petrus, Brune Rein, Jalota Rricha, Fok Floris, Ramos Michael, Van Wyk Ian, Nasir Abdallah, Goldstein Hayden, Tragemann Taylor, Nguyen Katie, Fowler Ariana, Stanco Andrew, Gabriel Jon, Taylor Jordan, Moro Dean, Tsymbalov Evgenii, De Waal Juliette, Matusov Evgeny, Yaghi Mudar, Shihadah Mohammad, Ney Hermann, Dugast Christian, Dotan Jonathan, Erasmus Daniel
- Hydralora: An Asymmetric Lora Architecture For Efficient Fine-tuning Tian Chunlin, Shi Zhan, Guo Zhijiang, Li Li, Xu Chengzhong
- Assessing Large Language Models In Mechanical Engineering Education: A Study On Mechanics-focused Conceptual Understanding Tian Jie, Hou Jixin, Wu Zihao, Shu Peng, Liu Zhengliang, Xiang Yujie, Gu Beikang, Filla Nicholas, Li Yiwei, Liu Ning, Chen Xianyan, Tang Keke, Liu Tianming, Wang Xianqiao
- Untie The Knots: An Efficient Data Augmentation Strategy For Long-context Pre-training In Language Models Tian Junfeng, Zheng Da, Cheng Yang, Wang Rui, Zhang Colin, Zhang Debing
- Selective Prompt Anchoring For Code Generation Tian Yuan, Zhang Tianyi
- The Instinctive Bias: Spurious Images Lead To Hallucination In Mllms Tianyang Han, Qing Lian, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang
- LLM Circuit Analyses Are Consistent Across Training And Scale Tigges Curt, Hanna Michael, Yu Qinan, Biderman Stella
- Humor Mechanics: Advancing Humor Generation With Multistep Reasoning Tikhonov Alexey, Shtykovskiy Pavel
- Cantor: Inspiring Multimodal Chain-of-thought Of MLLM Timin Gao, Peixian Chen, Mengdan Zhang, Chaoyou Fu, Yunhang Shen, Yan Zhang, Shengchuan Zhang, Xiawu Zheng, Xing Sun, Liujuan Cao, Rongrong Ji
- Aalap: AI Assistant For Legal & Paralegal Functions In India Tiwari Aman, Kalamkar Prathamesh, Banerjee Atreyo, Karn Saurabh, Hemachandran Varun, Gupta Smita
- Generative AI Systems: A Systems-based Perspective On Generative AI Tomczak Jakub M.
- Understanding Linear Probing Then Fine-tuning Language Models From NTK Perspective Tomihari Akiyoshi, Sato Issei
- Eyes Wide Shut? Exploring The Visual Shortcomings Of Multimodal Llms Tong Shengbang, Liu Zhuang, Zhai Yuexiang, Ma Yi, Lecun Yann, Xie Saining
- Prompting Techniques For Secure Code Generation: A Systematic Investigation Tony Catherine, Ferreyra Nicolás E. Díaz, Mutas Markus, Dhiff Salem, Scandariato Riccardo
- Evaluating Large Language Models With Grid-based Game Competitions: An Extensible LLM Benchmark And Leaderboard Topsakal Oguzhan, Edell Colby Jacob, Harper Jackson Bailey
- Openmathinstruct-1: A 1.8 Million Math Instruction Tuning Dataset Toshniwal Shubham, Moshkov Ivan, Narenthiran Sean, Gitman Daria, Jia Fei, Gitman Igor
- Readctrl: Personalizing Text Generation With Readability-controlled Instruction Learning Tran Hieu, Yao Zonghai, Li Lingxi, Yu Hong
- UCCIX: Irish-excellence Large Language Model Tran Khanh-tung, O'sullivan Barry, Nguyen Hoang D.
- Transformers Are Ssms: Generalized Models And Efficient Algorithms Through Structured State Space Duality Tri Dao, Albert Gu
- Discrete Multimodal Transformers With A Pretrained Large Language Model For Mixed-supervision Speech Processing Trinh Viet Anh, Southwell Rosy, Guan Yiwen, He Xinlu, Wang Zhiyong, Whitehill Jacob
- Cognitive Modeling With Scaffolded Llms: A Case Study Of Referential Expression Generation Tsvilodub Polina, Franke Michael, Carcassi Fausto
- Augmenting The Author: Exploring The Potential Of AI Collaboration In Academic Writing Tu Joseph, Hadan Hilda, Wang Derrick M., Sgandurra Sabrina A, Mogavi Reza Hadi, Nacke Lennart E.
- Charactereval: A Chinese Benchmark For Role-playing Conversational Agent Evaluation Tu Quan, Fan Shilong, Tian Zihang, Yan Rui
- Unmasking The Imposters: In-domain Detection Of Human Vs. Machine-generated Tweets Tuck Bryan E., Verma Rakesh M.
- LM Transparency Tool: Interactive Tool For Analyzing Transformer Language Models Tufanov Igor, Hambardzumyan Karen, Ferrando Javier, Voita Elena
- Progres: Prompted Generative Rescoring On ASR N-best Tur Ada Defne, Moumen Adel, Ravanelli Mirco
- Enhancing Inference Efficiency Of Large Language Models: Investigating Optimization Strategies And Architectural Innovations Tyukin Georgy
- Attention Is All You Need But You Don't Need All Of It For Inference Of Large Language Models Tyukin Georgy, Dovonon Gbetondji J-s, Kaddour Jean, Minervini Pasquale
- Asking And Answering Questions To Extract Event-argument Structures Uddin Md Nayem, George Enfa Rose, Blanco Eduardo, Corman Steven
- Adacoder: Adaptive Prompt Compression For Programmatic Visual Question Answering Ukai Mahiro, Kurita Shuhei, Hashimoto Atsushi, Ushiku Yoshitaka, Inoue Nakamasa
- Sandwich Attack: Multi-language Mixture Adaptive Attack On Llms Upadhayay Bibek, Behzadan Vahid
- PARADISE: Evaluating Implicit Planning Skills Of Language Models With Procedural Warnings And Tips Dataset Uzunoglu Arda, Safa Abdalfatah Rashid, Şahin Gözde Gül
- From Words To Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-context Examples Vacareanu Robert, Negru Vlad-andrei, Suciu Vasile, Surdeanu Mihai
- Evaluating The Capability Of Large Language Models To Personalize Science Texts For Diverse Middle-school-age Learners Vaccaro Michael Jr, Friday Mikayla, Zaghi Arash
- Do Large Language Models Perform The Way People Expect? Measuring The Human Generalization Function Vafa Keyon, Rambachan Ashesh, Mullainathan Sendhil
- Natural Language Programming In Medicine: Administering Evidence Based Clinical Workflows With Autonomous Agents Powered By Generative Large Language Models Vaid Akhil, Lampert Joshua, Lee Juhee, Sawant Ashwin, Apakama Donald, Sakhuja Ankit, Soroush Ali, Bick Sarah, Abbott Ethan, Gomez Hernando, Hadley Michael, Lee Denise, Landi Isotta, Duong Son Q, Bussola Nicole, Nabeel Ismail, Muehlstedt Silke, Muehlstedt Silke, Freeman Robert, Kovatch Patricia, Carr Brendan, Wang Fei, Glicksberg Benjamin, Argulian Edgar, Lerakis Stamatios, Khera Rohan, Reich David L., Kraft Monica, Charney Alexander, Nadkarni Girish
- Accelerating Large Language Model Inference With Self-supervised Early Exits Valade Florian
- Improved Baselines For Data-efficient Perceptual Augmentation Of Llms Vallaeys Théophane, Shukor Mustafa, Cord Matthieu, Verbeek Jakob
- Investigating The Performance Of Language Models For Completing Code In Functional Programming Languages: A Haskell Case Study Van Dam Tim, Van Der Heijden Frank, De Bekker Philippe, Nieuwschepen Berend, Otten Marc, Izadi Maliheh
- AI Sandbagging: Language Models Can Strategically Underperform On Evaluations Van Der Weij Teun, Hofstätter Felix, Jaffe Ollie, Brown Samuel F., Ward Francis Rhys
- Gender Bias In Machine Translation And The Era Of Large Language Models Vanmassenhove Eva
- Can GPT Redefine Medical Understanding? Evaluating GPT On Biomedical Machine Reading Comprehension Vatsal Shubham, Singh Ayush
- Replacing Judges With Juries: Evaluating LLM Generations With A Panel Of Diverse Models Verga Pat, Hofstatter Sebastian, Althammer Sophia, Su Yixuan, Piktus Aleksandra, Arkhangorodsky Arkady, Xu Minjie, White Naomi, Lewis Patrick
- Cross-modal Projection In Multimodal Llms Doesn't Really Project Visual Attributes To Textual Space Verma Gaurav, Choi Minje, Sharma Kartik, Watson-daniels Jamelle, Oh Sejoon, Kumar Srijan
- Auditing Counterfire: Evaluating Advanced Counterargument Generation With Evidence And Style Verma Preetika, Jaidka Kokil, Churina Svetlana
- RAG Based Question-answering For Contextual Response Prediction System Veturi Sriram, Vaichal Saurabh, Jagadheesh Reshma Lal, Tripto Nafis Irtiza, Yan Nian
- Exploring LGBTQ+ Bias In Generative AI Answers Across Different Country And Religious Contexts Vicsek Lilla, Vancsó Anna, Zajko Mike, Takacs Judit
- Input Conditioned Graph Generation For Language Agents Vierling Lukas, Fu Jie, Chen Kai
- Exploring Safety-utility Trade-offs In Personalized Language Models Vijjini Anvesh Rao, Chowdhury Somnath Basu Roy, Chaturvedi Snigdha
- Emotional Manipulation Through Prompt Engineering Amplifies Disinformation Generation In AI Large Language Models Vinay Rasita, Spitale Giovanni, Biller-andorno Nikola, Germani Federico
- Medreqal: Examining Medical Knowledge Recall Of Large Language Models Via Question Answering Vladika Juraj, Schneider Phillip, Matthes Florian
- Vi-mistral-x: Building A Vietnamese Language Model With Advanced Continual Pre-training Vo James
- Foundational Autoraters: Taming Large Language Models For Better Automatic Evaluation Vu Tu, Krishna Kalpesh, Alzubi Salaheddin, Tar Chris, Faruqui Manaal, Sung Yun-hsuan
- Are Large Language Models Aligned With People's Social Intuitions For Human-robot Interactions? Wachowiak Lennart, Coles Andrew, Celiktutan Oya, Canal Gerard
- From Rags To Rich Parameters: Probing How Language Models Utilize External Knowledge Over Parametric Information For Factual Queries Wadhwa Hitesh, Seetharaman Rahul, Aggarwal Somyaa, Ghosh Reshmi, Basu Samyadeep, Srinivasan Soundararajan, Zhao Wenlong, Chaudhari Shreyas, Aghazadeh Ehsan
- Learning To Refine With Fine-grained Natural Language Feedback Wadhwa Manya, Zhao Xinyu, Li Junyi Jessy, Durrett Greg
- An Empirical Study Of Mamba-based Language Models Waleffe Roger, Byeon Wonmin, Riach Duncan, Norick Brandon, Korthikanti Vijay, Dao Tri, Gu Albert, Hatamizadeh Ali, Singh Sudhakar, Narayanan Deepak, Kulshreshtha Garvit, Singh Vartika, Casper Jared, Kautz Jan, Shoeybi Mohammad, Catanzaro Bryan
- The Instruction Hierarchy: Training Llms To Prioritize Privileged Instructions Wallace Eric, Xiao Kai, Leike Reimar, Weng Lilian, Heidecke Johannes, Beutel Alex
- Temporal Blind Spots In Large Language Models Wallat Jonas, Jatowt Adam, Anand Avishek
- Fusechat: Knowledge Fusion Of Chat Models Wan Fanqi, Zhong Longguang, Yang Ziyi, Chen Ruijun, Quan Xiaojun
- Teach Better Or Show Smarter? On Instructions And Exemplars In Automatic Prompt Optimization Wan Xingchen, Sun Ruoxi, Nakhost Hootan, Arik Sercan O.
- A & B == B & A: Triggering Logical Reasoning Failures In Large Language Models Wan Yuxuan, Wang Wenxuan, Yang Yiliu, Yuan Youliang, Huang Jen-tse, He Pinjia, Jiao Wenxiang, Lyu Michael R.
- LOOK-M: Look-once Optimization In KV Cache For Efficient Multimodal Long-context Inference Wan Zhongwei, Wu Ziang, Liu Che, Huang Jinfa, Zhu Zhihong, Jin Peng, Wang Longyue, Yuan Li
- Datavist5: A Pre-trained Language Model For Jointly Understanding Text And Data Visualization Wan Zhuoyue, Song Yuanfeng, Li Shuaimin, Zhang Chen Jason, Wong Raymond Chi-wing
- Pargo: Bridging Vision-language With Partial And Global Views Wang An-lan, Shan Bin, Shi Wei, Lin Kun-yu, Fei Xiang, Tang Guozhi, Liao Lei, Tang Jingqun, Huang Can, Zheng Wei-shi
- GOLF: Goal-oriented Long-term Life Tasks Supported By Human-ai Collaboration Wang Ben
- Task Supportive And Personalized Human-large Language Model Interaction: A User Study Wang Ben, Liu Jiqun, Karimnazarov Jamshed, Thompson Nicolas
- IAA: Inner-adaptor Architecture Empowers Frozen Large Language Model With Multimodal Capabilities Wang Bin, Xie Chunyu, Leng Dawei, Yin Yuhui
- Halu-j: Critique-based Hallucination Judge Wang Binjie, Chern Steffi, Chern Ethan, Liu Pengfei
- Grokked Transformers Are Implicit Reasoners: A Mechanistic Journey To The Edge Of Generalization Wang Boshi, Yue Xiang, Su Yu, Sun Huan
- Direct: Diagnostic Reasoning For Clinical Notes Via Large Language Models Wang Bowen, Chang Jiuyang, Qian Yiming, Chen Guoxin, Chen Junhao, Jiang Zhouqiang, Zhang Jiahao, Nakashima Yuta, Nagahara Hajime
- Information Theoretic Text-to-image Alignment Wang Chao, Franzese Giulio, Finamore Alessandro, Gallo Massimo, Michiardi Pietro
- A Survey For Large Language Models In Biomedicine Wang Chong, Li Mengyao, He Junjun, Wang Zhongruo, Darzi Erfan, Chen Zan, Ye Jin, Li Tianbin, Su Yanzhou, Ke Jing, Qu Kaili, Li Shuxin, Yu Yi, Liò Pietro, Wang Tianyun, Wang Yu Guang, Shen Yiqing
- Ada-leval: Evaluating Long-context Llms With Length-adaptable Benchmarks Wang Chonghua, Duan Haodong, Zhang Songyang, Lin Dahua, Chen Kai
- Tokenization Matters! Degrading Large Language Models Through Challenging Their Tokenization Wang Dixuan, Li Yanda, Jiang Junyuan, Ding Zepeng, Jiang Guochao, Liang Jiaqing, Yang Deqing
- Adapting Open-source Large Language Models For Cost-effective, Expert-level Clinical Note Generation With On-policy Reinforcement Learning Wang Hanyin, Gao Chufan, Liu Bolun, Xu Qiping, Hussein Guleid, Labban Mohamad El, Iheasirim Kingsley, Korsapati Hariprasad, Outcalt Chuck, Sun Jimeng
- Reinforcement Learning For Edit-based Non-autoregressive Neural Machine Translation Wang Hao, Morimura Tetsuro, Honda Ukyo, Kawahara Daisuke
- Interpretable Preferences Via Multi-objective Reward Modeling And Mixture-of-experts Wang Haoxiang, Xiong Wei, Xie Tengyang, Zhao Han, Zhang Tong
- M4U: Evaluating Multilingual Understanding And Reasoning For Large Multimodal Models Wang Hongyu, Xu Jiayu, Xie Senwei, Wang Ruiping, Li Jialin, Xie Zhaojie, Zhang Bin, Xiong Chuyan, Chen Xilin
- Target-constrained Bidirectional Planning For Generation Of Target-oriented Proactive Dialogue Wang Jian, Lin Dongding, Li Wenjie
- Instructgraph: Boosting Large Language Models Via Graph-centric Instruction Tuning And Preference Alignment Wang Jianing, Wu Junda, Hou Yupeng, Liu Yao, Gao Ming, Mcauley Julian
- Techgpt-2.0: A Large Language Model Project To Solve The Task Of Knowledge Graph Construction Wang Jiaqi, Chang Yuying, Li Zhong, An Ning, Ma Qi, Hei Lei, Luo Haibo, Lu Yifei, Ren Feiliang
- Tarsier: Recipes For Training And Evaluating Large Video Description Models Wang Jiawei, Yuan Liping, Zhang Yuchen
- Length Generalization Of Causal Transformers Without Position Encoding Wang Jie, Ji Tao, Wu Yuanbin, Yan Hang, Gui Tao, Zhang Qi, Huang Xuanjing, Wang Xiaoling
- Bioinformatics And Biomedical Informatics With Chatgpt: Year One Review Wang Jinge, Cheng Zien, Yao Qiuming, Liu Li, Xu Dong, Hu Gangqing
- Mixture-of-agents Enhances Large Language Model Capabilities Wang Junlin, Wang Jue, Athiwaratkun Ben, Zhang Ce, Zou James
- Mambabyte: Token-free Selective State Space Model Wang Junxiong, Gangavarapu Tushaar, Yan Jing Nathan, Rush Alexander M.
- The Mamba In The Llama: Distilling And Accelerating Hybrid Models Wang Junxiong, Paliotta Daniele, May Avner, Rush Alexander M., Dao Tri
- Exploring The Potential Of Multimodal LLM With Knowledge-intensive Multimodal ASR Wang Minghan, Wang Yuxia, Vu Thuy-trang, Shareghi Ehsan, Haffari Gholamreza
- Leave No Document Behind: Benchmarking Long-context Llms With Extended Multi-doc QA Wang Minzheng, Chen Longze, Fu Cheng, Liao Shengyi, Zhang Xinghua, Wu Bingli, Yu Haiyang, Xu Nan, Zhang Lei, Luo Run, Li Yunshui, Yang Min, Huang Fei, Li Yongbin
- Beyond The Known: Investigating Llms Performance On Out-of-domain Intent Detection Wang Pei, He Keqing, Wang Yejie, Song Xiaoshuai, Mou Yutao, Wang Jingang, Xian Yunsen, Cai Xunliang, Xu Weiran
- Effective Demonstration Annotation For In-context Learning Via Language Model-based Determinantal Point Process Wang Peng, Wang Xiaobin, Lou Chao, Mao Shengyu, Xie Pengjun, Jiang Yong
- Reconstruct The Pruned Model Without Any Retraining Wang Pingjie, Fan Ziqing, Hu Shengchao, Chen Zhe, Wang Yanfeng, Wang Yu
- Cross-refine: Improving Natural Language Explanation Generation By Learning In Tandem Wang Qianli, Anikina Tatiana, Feldhus Nils, Ostermann Simon, Möller Sebastian, Schmitt Vera
- Automated Generation Of Multiple-choice Cloze Questions For Assessing English Vocabulary Using Gpt-turbo 3.5 Wang Qiao, Rose Ralph, Orita Naho, Sugawara Ayaka
- SUBLLM: A Novel Efficient Architecture With Token Sequence Subsampling For LLM Wang Quandong, Yuan Yuxuan, Yang Xiaoyu, Zhang Ruike, Zhao Kang, Liu Wei, Luan Jian, Povey Daniel, Wang Bin
- Deconfounded Causality-aware Parameter-efficient Fine-tuning For Problem-solving Improvement Of Llms Wang Ruoyu, Li Xiaoxuan, Yao Lina
- Unique Security And Privacy Threats Of Large Language Model: A Comprehensive Survey Wang Shang, Zhu Tianqing, Liu Bo, Ding Ming, Guo Xu, Ye Dayong, Zhou Wanlei, Yu Philip S.
- What Is The Best Way For Chatgpt To Translate Poetry? Wang Shanshan, Wong Derek F., Yao Jingming, Chao Lidia S.
- Lora Meets Dropout Under A Unified Framework Wang Sheng, Chen Liheng, Jiang Jiyue, Xue Boyang, Kong Lingpeng, Wu Chuan
- Mfc-bench: Benchmarking Multimodal Fact-checking With Large Vision-language Models Wang Shengkang, Lin Hongzhan, Luo Ziyang, Ye Zhen, Chen Guang, Ma Jing
- Resonance Rope: Improving Context Length Generalization Of Large Language Models Wang Suyuchen, Kobyzev Ivan, Lu Peng, Rezagholizadeh Mehdi, Liu Bang
- Self-taught Evaluators Wang Tianlu, Kulikov Ilia, Golovneva Olga, Yu Ping, Yuan Weizhe, Dwivedi-yu Jane, Pang Richard Yuanzhe, Fazel-zarandi Maryam, Weston Jason, Li Xian
- Weaver: Foundation Models For Creative Writing Wang Tiannan, Chen Jiamin, Jia Qingrui, Wang Shuai, Fang Ruoyu, Wang Huilin, Gao Zhaowei, Xie Chunzhao, Xu Chuou, Dai Jihong, Liu Yibin, Wu Jialong, Ding Shengwei, Li Long, Huang Zhiwei, Deng Xinle, Yu Teng, Ma Gangan, Xiao Han, Chen Zixin, Xiang Danjun, Wang Yunxia, Zhu Yuanyuan, Xiao Yi, Wang Jing, Wang Yiru, Ding Siran, Huang Jiayang, Xu Jiayi, Tayier Yilihamu, Hu Zhenyu, Gao Yuan, Zheng Chengfeng, Ye Yueshu, Li Yihang, Wan Lei, Jiang Xinyue, Wang Yujie, Cheng Siyu, Song Zhule, Tang Xiangru, Xu Xiaohua, Zhang Ningyu, Chen Huajun, Jiang Yuchen Eleanor, Zhou Wangchunshu
- Less Is More For Improving Automatic Evaluation Of Factual Consistency Wang Tong, Kulkarni Ninad, Qi Yanjun
- Using Advanced Llms To Enhance Smaller Llms: An Interpretable Knowledge Distillation Approach Wang Tong, Sudhir K., Hong Dat
- Rocks Coding, Not Development--a Human-centric, Experimental Evaluation Of Llm-supported SE Tasks Wang Wei, Ning Huilong, Zhang Gaowei, Liu Libo, Wang Yi
- Learnable Item Tokenization For Generative Recommendation Wang Wenjie, Bao Honghui, Lin Xinyu, Zhang Jizhi, Li Yongqi, Feng Fuli, Ng See-kiong, Chua Tat-seng
- QUITO: Accelerating Long-context Reasoning Through Query-guided Context Compression Wang Wenshan, Wang Yihang, Fan Yixing, Liao Huaming, Guo Jiafeng
- Testing And Evaluation Of Large Language Models: Correctness, Non-toxicity, And Fairness Wang Wenxuan
- Metatool: Facilitating Large Language Models To Master Tools With Meta-task Augmentation Wang Xiaohan, Li Dian, Zhao Yilin, Sinbadliu, Wang Hui
- Validating Llm-generated Programs With Metamorphic Prompt Testing Wang Xiaoyin, Zhu Dakai
- Longllava: Scaling Multi-modal Llms To 1000 Images Efficiently Via Hybrid Architecture Wang Xidong, Song Dingjie, Chen Shunian, Zhang Chen, Wang Benyou
- Rdrec: Rationale Distillation For Llm-based Recommendation Wang Xinfeng, Cui Jin, Suzuki Yoshimi, Fukumoto Fumiyo
- Coglm: Tracking Cognitive Development Of Large Language Models Wang Xinglin, Yuan Peiwen, Feng Shaoxiong, Li Yiwei, Pan Boyuan, Wang Heda, Hu Yao, Li Kan
- Negativeprompt: Leveraging Psychology For Large Language Models Enhancement Via Negative Emotional Stimuli Wang Xu, Li Cheng, Chang Yi, Wang Jindong, Wu Yuan
- UBENCH: Benchmarking Uncertainty In Large Language Models With Multiple Choice Questions Wang Xunzhi, Zhang Zhuowei, Li Qiongyu, Chen Gaonan, Hu Mengting, Li Zhiyu, Luo Bitong, Gao Hang, Han Zhixin, Wang Haotian
- EAGER: Two-stream Generative Recommender With Behavior-semantic Collaboration Wang Ye, Xun Jiahao, Hong Minjie, Zhu Jieming, Jin Tao, Lin Wang, Li Haoyuan, Li Linjun, Xia Yan, Zhao Zhou, Dong Zhenhua
- A Theoretical Understanding Of Self-correction Through In-context Alignment Wang Yifei, Wu Yuyang, Wei Zeming, Jegelka Stefanie, Wang Yisen
- Speak It Out: Solving Symbol-related Problems With Symbol-to-language Conversion For Language Models Wang Yile, Cheng Sijie, Sun Zixin, Li Peng, Liu Yang
- HPT++: Hierarchically Prompting Vision-language Models With Multi-granularity Knowledge Generation And Improved Structure Modeling Wang Yubin, Jiang Xinyang, Cheng De, Sun Wenli, Li Dongsheng, Zhao Cairong
- Ladic: Are Diffusion Models Really Inferior To Autoregressive Counterparts For Image-to-text Generation? Wang Yuchi, Ren Shuhuai, Gao Rundong, Yao Linli, Guo Qingyan, An Kaikai, Bai Jianhong, Sun Xu
- REAR: A Relevance-aware Retrieval-augmented Framework For Open-domain Question Answering Wang Yuhao, Ren Ruiyang, Li Junyi, Zhao Wayne Xin, Liu Jing, Wen Ji-rong
- Sibyl: Simple Yet Effective Agent Framework For Complex Real-world Reasoning Wang Yulong, Shen Tianhao, Liu Lifeng, Xie Jian
- Loma: Lossless Compressed Memory Attention Wang Yumeng, Xiao Zhenyang
- Factuality Of Large Language Models In The Year 2024 Wang Yuxia, Wang Minghan, Manzoor Muhammad Arslan, Liu Fei, Georgiev Georgi, Das Rocktim Jyoti, Nakov Preslav
- To Recommend Or Not: Recommendability Identification In Conversations With Pre-trained Language Models Wang Zhefan, Ma Weizhi, Zhang Min
- M-RAG: Reinforcing Large Language Model Performance Through Retrieval-augmented Generation With Multiple Partitions Wang Zheng, Teo Shu Xian, Ouyang Jieer, Xu Yongjun, Shi Wei
- A Comprehensive Survey Of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO And More Wang Zhichao James, Bi Bin James, Pentyala Shiva Kumar James, Ramnath Kiran James, Chaudhuri Sougata James, Mehrotra Shubham James, Zixu James, Zhu Claire, Mao Xiang-bo Claire, Asur Sitaram Claire, Na Claire, Cheng
- Promptcharm: Text-to-image Generation Through Multi-modal Prompting And Refinement Wang Zhijie, Huang Yuheng, Song Da, Ma Lei, Zhang Tianyi
- Helpsteer2: Open-source Dataset For Training Top-performing Reward Models Wang Zhilin, Dong Yi, Delalleau Olivier, Zeng Jiaqi, Shen Gerald, Egert Daniel, Zhang Jimmy J., Sreedhar Makesh Narsimhan, Kuchaiev Oleksii
- Trove: Inducing Verifiable And Efficient Toolboxes For Solving Programmatic Tasks Wang Zhiruo, Fried Daniel, Neubig Graham
- Chatlogic: Integrating Logic Programming With Large Language Models For Multi-step Reasoning Wang Zhongsheng, Liu Jiamou, Bao Qiming, Rong Hongfei, Zhang Jingfeng
- Omnijarvis: Unified Vision-language-action Tokenization Enables Open-world Instruction Following Agents Wang Zihao, Cai Shaofei, Mu Zhancun, Lin Haowei, Zhang Ceyao, Liu Xuejie, Li Qing, Liu Anji, Ma Xiaojian, Liang Yitao
- RAT: Retrieval Augmented Thoughts Elicit Context-aware Reasoning In Long-horizon Generation Wang Zihao, Liu Anji, Lin Haowei, Li Jiaqi, Ma Xiaojian, Liang Yitao
- Eliminating Position Bias Of Language Models: A Mechanistic Approach Wang Ziqi, Zhang Hanlin, Li Xiner, Huang Kuan-hao, Han Chi, Ji Shuiwang, Kakade Sham M., Peng Hao, Ji Heng
- C3LLM: Conditional Multimodal Content Generation Using Large Language Models Wang Zixuan, Duan Qinkai, Tai Yu-wing, Tang Chi-keung
- PARIKSHA : A Large-scale Investigation Of Human-llm Evaluator Agreement On Multilingual And Multi-cultural Data Watts Ishaan, Gumma Varun, Yadavalli Aditya, Seshadri Vivek, Swaminathan Manohar, Sitaram Sunayana
- Investigating Multilingual Instruction-tuning: Do Polyglot Models Demand For Multilingual Instructions? Weber Alexander Arno, Thellmann Klaudia, Ebert Jan, Flores-herr Nicolas, Lehmann Jens, Fromm Michael, Ali Mehdi
- Large Language Models Are Pattern Matchers: Editing Semi-structured And Structured Documents With Chatgpt Weber Irene
- Hidden In Plain Sight: Exploring Chat History Tampering In Interactive Language Models Wei Cheng'an, Zhao Yue, Gong Yujia, Chen Kai, Xiang Lu, Zhu Shenchen
- OPDAI At Semeval-2024 Task 6: Small Llms Can Accelerate Hallucination Detection With Weakly Supervised Data Wei Chengcheng, Chen Ze, Fang Songtan, He Jiarong, Gao Max
- Rethinking Generative Large Language Model Evaluation For Semantic Comprehension Wei Fangyun, Chen Xi, Luo Lin
- Systematic Evaluation Of Llm-as-a-judge In LLM Alignment Tasks: Explainable Metrics And Diverse Prompt Templates Wei Hui, He Shenghua, Xia Tian, Wong Andy, Lin Jingyang, Han Mei
- Long-form Factuality In Large Language Models Wei Jerry, Yang Chengrun, Song Xinying, Lu Yifeng, Hu Nathan, Huang Jie, Tran Dustin, Peng Daiyi, Liu Ruibo, Huang Da, Du Cosmo, Le Quoc V.
- Performance Evaluation Of Lightweight Open-source Large Language Models In Pediatric Consultations: A Comparative Analysis Wei Qiuhong, Cui Ying, Ding Mengwei, Wang Yanqin, Xiang Lingling, Yao Zhengxiong, Chen Ceran, Long Ying, Jin Zhezhen, Xu Ximing
- Promptmm: Multi-modal Knowledge Distillation For Recommendation With Prompt-tuning Wei Wei, Tang Jiabin, Jiang Yangqin, Xia Lianghao, Huang Chao
- Are Large Language Models More Empathetic Than Humans? Welivita Anuradha, Pu Pearl
- AGLA: Mitigating Object Hallucinations In Large Vision-language Models With Assembly Of Global And Local Attention Wenbin An, Feng Tian, Sicong Leng, Jiahao Nie, Haonan Lin, Qianying Wang, Guang Dai, Ping Chen, Shijian Lu
- Do Llamas Work In English? On The Latent Language Of Multilingual Transformers Wendler Chris, Veselovsky Veniamin, Monea Giovanni, West Robert
- Longvlm: Efficient Long Video Understanding Via Large Language Models Weng Yuetian, Han Mingfei, He Haoyu, Chang Xiaojun, Zhuang Bohan
- Codeultrafeedback: An Llm-as-a-judge Dataset For Aligning Large Language Models To Coding Preferences Weyssow Martin, Kamanda Aton, Sahraoui Houari
- Towards Knowledge-grounded Natural Language Understanding And Generation Whitehouse Chenxi
- Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions Wiegreffe Sarah, Tafjord Oyvind, Belinkov Yonatan, Hajishirzi Hannaneh, Sabharwal Ashish
- Training-free Exponential Extension Of Sliding Window Context With Cascading KV Cache Willette Jeffrey, Lee Heejun, Lee Youngwan, Jeon Myeongjae, Hwang Sung Ju
- Memory Is All You Need: An Overview Of Compute-in-memory Architectures For Accelerating Large Language Model Inference Wolters Christopher, Yang Xiaoxuan, Schlichtmann Ulf, Suzumura Toyotaro
- Effects Of A Prompt Engineering Intervention On Undergraduate Students' AI Self-efficacy, AI Knowledge And Prompt Engineering Ability: A Mixed Methods Study Woo David James, Wang Deliang, Yung Tim, Guo Kai
- Don't Miss The Forest For The Trees: Attentional Vision Calibration For Large Vision Language Models Woo Sangmin, Kim Donguk, Jang Jaehyuk, Choi Yubin, Kim Changick
- Personalized Large Language Models Woźniak Stanisław, Koptyra Bartłomiej, Janz Arkadiusz, Kazienko Przemysław, Kocoń Jan
- A Surprising Failure? Multimodal Llms And The NLVR Challenge Wu Anne, Brantley Kianté, Artzi Yoav
- Curriculum Learning With Quality-driven Data Selection Wu Biao, Meng Fang, Chen Ling
- Transferable Speech-to-text Large Language Model Alignment Module Wu Boyong, Yan Chao, Pu Haoran
- Towards Evaluating And Building Versatile Large Language Models For Medicine Wu Chaoyi, Qiu Pengcheng, Liu Jinxin, Gu Hongfei, Li Na, Zhang Ya, Wang Yanfeng, Xie Weidi
- Llama Pro: Progressive Llama With Block Expansion Wu Chengyue, Gan Yukang, Ge Yixiao, Lu Zeyu, Wang Jiahao, Feng Ye, Shan Ying, Luo Ping
- Performance Law Of Large Language Models Wu Chuhan, Tang Ruiming
- Parameter-efficient Sparsity Crafting From Dense To Mixture-of-experts For Instruction Tuning On General Tasks Wu Haoyuan, Zheng Haisheng, He Zhuolun, Yu Bei
- Guiding Clinical Reasoning With Large Language Models Via Knowledge Seeds Wu Jiageng, Wu Xian, Yang Jie
- Gendec: A Robust Generative Question-decomposition Method For Multi-hop Reasoning Wu Jian, Yang Linyi, Ji Yuliang, Huang Wenhao, Karlsson Börje F., Okumura Manabu
- What Are The Limits Of Cross-lingual Dense Passage Retrieval For Low-resource Languages? Wu Jie, Ren Zhaochun, Verberne Suzan
- GEB-1.3B: Open Lightweight Large Language Model Wu Jie, Zhu Yufeng, Shen Lei, Lu Xuqing
- Pandora's Box Or Aladdin's Lamp: A Comprehensive Analysis Revealing The Role Of RAG Noise In Large Language Models Wu Jinyang, Che Feihu, Zhang Chuyuan, Tao Jianhua, Zhang Shuai, Shao Pengpeng
- Can Large Language Models Understand Uncommon Meanings Of Common Words? Wu Jinyang, Che Feihu, Zheng Xinxin, Zhang Shuai, Jin Ruihan, Nie Shuai, Shao Pengpeng, Tao Jianhua
- Noiseboost: Alleviating Hallucination With Noise Perturbation For Multimodal Large Language Models Wu Kai, Jiang Boyuan, Jiang Zhengkai, He Qingdong, Luo Donghao, Wang Shengzhi, Liu Qingwen, Wang Chengjie
- Clasheval: Quantifying The Tug-of-war Between An Llm's Internal Prior And External Evidence Wu Kevin, Wu Eric, Zou James
- Adapting Large Language Models For Document-level Machine Translation Wu Minghao, Vu Thuy-trang, Qu Lizhen, Foster George, Haffari Gholamreza
- Controlmllm: Training-free Visual Prompt Learning For Multimodal Large Language Models Wu Mingrui, Cai Xinyue, Ji Jiayi, Li Jiale, Huang Oucheng, Luo Gen, Fei Hao, Sun Xiaoshuai, Ji Rongrong
- Parallel Decoding Via Hidden Transfer For Lossless Large Language Model Acceleration Wu Pengfei, Liu Jiahao, Gong Zhuocheng, Wang Qifan, Li Jinpeng, Wang Jingang, Cai Xunliang, Zhao Dongyan
- Motionllm: Multimodal Motion-language Learning With Large Language Models Wu Qi, Zhao Yubo, Wang Yifan, Tai Yu-wing, Tang Chi-keung
- GUI Action Narrator: Where And When Did That Action Take Place? Wu Qinchen, Gao Difei, Lin Kevin Qinghong, Wu Zhuoyu, Guo Xiangwu, Li Peiran, Zhang Weichen, Wang Hengxu, Shou Mike Zheng
- Not All Attention Is Needed: Parameter And Computation Efficient Transfer Learning For Multi-modal Large Language Models Wu Qiong, Ye Weihao, Zhou Yiyi, Sun Xiaoshuai, Ji Rongrong
- Videollm-mod: Efficient Video-language Streaming With Mixture-of-depths Vision Computation Wu Shiwei, Chen Joya, Lin Kevin Qinghong, Wang Qimeng, Gao Yan, Xu Qianli, Xu Tong, Hu Yao, Chen Enhong, Shou Mike Zheng
- Enhance Reasoning For Large Language Models In The Game Werewolf Wu Shuang, Zhu Liwen, Yang Tao, Xu Shiwei, Fu Qiang, Wei Yang, Fu Haobo
- Cognitive Llms: Towards Integrating Cognitive Architectures And Large Language Models For Manufacturing Decision-making Wu Siyu, Oltramari Alessandro, Francis Jonathan, Giles C. Lee, Ritter Frank E.
- Unigen: A Unified Framework For Textual Dataset Generation Using Large Language Models Wu Siyuan, Huang Yue, Gao Chujie, Chen Dongping, Zhang Qihui, Wan Yao, Zhou Tianyi, Zhang Xiangliang, Gao Jianfeng, Xiao Chaowei, Sun Lichao
- F-LMM: Grounding Frozen Large Multimodal Models Wu Size, Jin Sheng, Zhang Wenwei, Xu Lumin, Liu Wentao, Li Wei, Loy Chen Change
- Semantic Alignment For Multimodal Large Language Models Wu Tao, Li Mengze, Chen Jingyuan, Ji Wei, Lin Wang, Gao Jinyang, Kuang Kun, Zhao Zhou, Wu Fei
- Self-directed Turing Test For Large Language Models Wu Weiqi, Wu Hongqiu, Zhao Hai
- Freeva: Offline MLLM As Training-free Video Assistant Wu Wenhao
- Long Context Alignment With Short Instructions And Synthesized Positions Wu Wenhao, Wang Yizhong, Fu Yao, Yue Xiang, Zhu Dawei, Li Sujian
- Do Language Models Plan Ahead For Future Tokens? Wu Wilson, Morris John X., Levine Lionel
- Tablebench: A Comprehensive And Complex Benchmark For Table Question Answering Wu Xianjie, Yang Jian, Chai Linzheng, Zhang Ge, Liu Jiaheng, Du Xinrun, Liang Di, Shu Daixin, Cheng Xianfu, Sun Tianzhen, Niu Guanglin, Li Tongliang, Li Zhoujun
- Transformer-based Causal Language Models Perform Clustering Wu Xinbo, Varshney Lav R.
- Which Questions Should I Answer? Salience Prediction Of Inquisitive Questions Wu Yating, Mangla Ritika, Dimakis Alexandros G., Durrett Greg, Li Junyi Jessy
- Elicitationgpt: Text Elicitation Mechanisms Via Language Models Wu Yifan, Hartline Jason
- Difflora: Generating Personalized Low-rank Adaptation Weights With Diffusion Wu Yujia, Shi Yiming, Wei Jiwei, Sun Chengwei, Zhou Yuyang, Yang Yang, Shen Heng Tao
- Inversecoder: Unleashing The Power Of Instruction-tuned Code Llms With Inverse-instruct Wu Yutong, Huang Di, Shi Wenxuan, Wang Wei, Gao Lingzhe, Liu Shihao, Nan Ziyuan, Yuan Kaizhao, Zhang Rui, Zhang Xishan, Du Zidong, Guo Qi, Pu Yewen, Yin Dawei, Hu Xing, Chen Yunji
- Instructing Large Language Models To Identify And Ignore Irrelevant Conditions Wu Zhenyu, Shen Chao, Jiang Meng
- Large Language Models Can Self-correct With Minimal Effort Wu Zhenyu, Zeng Qingkai, Zhang Zhihan, Tan Zhaoxuan, Shen Chao, Jiang Meng
- Acquiring Clean Language Models From Backdoor Poisoned Datasets By Downscaling Frequency Space Wu Zongru, Zhang Zhuosheng, Cheng Pengzhou, Liu Gongshen
- FOFO: A Benchmark To Evaluate Llms' Format-following Capability Xia Congying, Xing Chen, Du Jiangshu, Yang Xinyi, Feng Yihao, Xu Ran, Yin Wenpeng, Xiong Caiming
- Language Models Can Evaluate Themselves Via Probability Discrepancy Xia Tingyu, Yu Bowen, Wu Yuan, Chang Yi, Zhou Chang
- Improving Retrieval Augmented Language Model With Self-reasoning Xia Yuan, Zhou Jingbo, Shi Zhenhui, Chen Jun, Huang Haifeng
- What To Do If Language Models Disagree? Black-box Model Ensembling For Textual And Visual Question Answering Xia Yuxi, Zaporojets Kilm, Roth Benjamin
- Badchain: Backdoor Chain-of-thought Prompting For Large Language Models Xiang Zhen, Jiang Fengqing, Xiong Zidi, Ramasubramanian Bhaskar, Poovendran Radha, Li Bo
- Clover-2: Accurate Inference For Regressive Lightweight Speculative Decoding Xiao Bin, Gui Lujun, Su Lei, Chen Weipeng
- Clover: Regressive Lightweight Speculative Decoding With Sequential Knowledge Xiao Bin, Shi Chunan, Nie Xiaonan, Yang Fan, Deng Xiangwei, Su Lei, Chen Weipeng, Cui Bin
- Infllm: Training-free Long-context Extrapolation For Llms With An Efficient Context Memory Xiao Chaojun, Zhang Pengle, Han Xu, Xiao Guangxuan, Lin Yankai, Zhang Zhengyan, Liu Zhiyuan, Sun Maosong
- A Comprehensive Survey Of Large Language Models And Multimodal Large Language Models In Medicine Xiao Hanguang, Zhou Feizhong, Liu Xingyue, Liu Tianqi, Li Zhipeng, Liu Xin, Huang Xiaoxuan
- Palm2-vadapter: Progressively Aligned Language Model Makes A Strong Vision-language Adapter Xiao Junfei, Xu Zheng, Yuille Alan, Yan Shen, Wang Boyu
- Exploring How Multiple Levels Of Gpt-generated Programming Hints Support Or Disappoint Novices Xiao Ruiwei, Hou Xinying, Stamper John
- Emovit: Revolutionizing Emotion Insights With Visual Instruction Tuning Xie Hongxia, Peng Chu-jun, Tseng Yu-wen, Chen Hung-jen, Hsu Chan-feng, Shuai Hong-han, Cheng Wen-huang
- Show-o: One Single Transformer To Unify Multimodal Understanding And Generation Xie Jinheng, Mao Weijia, Bai Zechen, Zhang David Junhao, Wang Weihao, Lin Kevin Qinghong, Gu Yuchao, Chen Zhijie, Yang Zhenheng, Shou Mike Zheng
- Non-instructional Fine-tuning: Enabling Instruction-following Capabilities In Pre-trained Language Models Without Instruction-following Data Xie Juncheng, Syu Shensian, Lee Hung-yi
- Me Llama: Foundation Large Language Models For Medical Applications Xie Qianqian, Chen Qingyu, Chen Aokun, Peng Cheng, Hu Yan, Lin Fongci, Peng Xueqing, Huang Jimin, Zhang Jeffrey, Keloth Vipina, Zhou Xinyu, He Huan, Ohno-machado Lucila, Wu Yonghui, Xu Hua, Bian Jiang
- Codeshell Technical Report Xie Rui, Zeng Zhengran, Yu Zhuohao, Gao Chang, Zhang Shikun, Ye Wei
- Sorry-bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors Xie Tinghao, Qi Xiangyu, Zeng Yi, Huang Yangsibo, Sehwag Udari Madhushani, Huang Kaixuan, He Luxi, Wei Boyi, Li Dacheng, Sheng Ying, Jia Ruoxi, Li Bo, Li Kai, Chen Danqi, Henderson Peter, Mittal Prateek
- Decomposition For Enhancing Attention: Improving Llm-based Text-to-sql Through Workflow Paradigm Xie Yuanzhen, Jin Xinzhou, Xie Tao, Lin Mingxiong, Chen Liang, Yu Chenyun, Cheng Lei, Zhuo Chengxiang, Hu Bo, Li Zang
- Calibrating Reasoning In Language Models With Internal Consistency Xie Zhihui, Guo Jizhou, Yu Tong, Li Shuai
- Order Matters In Hallucination: Reasoning Order As Benchmark And Reflexive Prompting For Large-language-models Xie Zikai
- RALL-E: Robust Codec Language Modeling With Chain-of-thought Prompting For Text-to-speech Synthesis Xin Detai, Tan Xu, Shen Kai, Ju Zeqian, Yang Dongchao, Wang Yuancheng, Takamichi Shinnosuke, Saruwatari Hiroshi, Liu Shujie, Li Jinyu, Zhao Sheng
- EFUF: Efficient Fine-grained Unlearning Framework For Mitigating Hallucinations In Multimodal Large Language Models Xing Shangyu, Zhao Fei, Wu Zhen, An Tuo, Chen Weihao, Li Chunhui, Zhang Jianbing, Dai Xinyu
- Theoretical Understanding Of In-context Learning In Shallow Transformers With Unstructured Data Xing Yue, Lin Xiaofeng, Xu Chenheng, Suh Namjoon, Song Qifan, Cheng Guang
- Benchmarking Retrieval-augmented Generation For Medicine Xiong Guangzhi, Jin Qiao, Lu Zhiyong, Zhang Aidong
- Improving Retrieval-augmented Generation In Medicine With Iterative Follow-up Questions Xiong Guangzhi, Jin Qiao, Wang Xiao, Zhang Minjia, Lu Zhiyong, Zhang Aidong
- Converging Paradigms: The Synergy Of Symbolic And Connectionist AI In Llm-empowered Autonomous Agents Xiong Haoyi, Wang Zhiyuan, Li Xuhong, Bian Jiang, Xie Zeke, Mumtaz Shahid, Barnes Laura E.
- Temporal Scaling Law For Large Language Models Xiong Yizhe, Chen Xiansheng, Ye Xin, Chen Hui, Lin Zijia, Lian Haoran, Su Zhenpeng, Niu Jianwei, Ding Guiguang
- From Artificial Needles To Real Haystacks: Improving Retrieval Capabilities In Llms By Finetuning On Synthetic Data Xiong Zheyang, Papageorgiou Vasilis, Lee Kangwook, Papailiopoulos Dimitris
- Stylemaster: Towards Flexible Stylized Image Generation With Diffusion Models Xu Chengming, Hu Kai, Luo Donghao, Zhang Jiangning, Li Wei, Ge Yanhao, Wang Chengjie
- Data Efficient Evaluation Of Large Language Models And Text-to-image Models Via Adaptive Sampling Xu Cong, Saranathan Gayathri, Alam Mahammad Parwez, Shah Arpit, Lim James, Wong Soon Yee, Martin Foltin, Bhattacharya Suparna
- Understanding The Role Of Cross-entropy Loss In Fairly Evaluating Large Language Model-based Recommendation Xu Cong, Zhu Zhangchi, Wang Jun, Wang Jianyong, Zhang Wei
- Empowering 1000 Tokens/second On-device LLM Prefilling With Mllm-npu Xu Daliang, Zhang Hao, Yang Liming, Liu Ruiqi, Huang Gang, Xu Mengwei, Liu Xuanzhe
- Contrastive Preference Optimization: Pushing The Boundaries Of LLM Performance In Machine Translation Xu Haoran, Sharaf Amr, Chen Yunmo, Tan Weiting, Shen Lingfeng, Van Durme Benjamin, Murray Kenton, Kim Young Jin
- Redagent: Red Teaming Large Language Models With Context-aware Autonomous Language Agent Xu Huiyu, Zhang Wenhui, Wang Zhibo, Xiao Feng, Zheng Rui, Feng Yunhe, Ba Zhongjie, Ren Kui
- On-device Language Models: A Comprehensive Review Xu Jiajun, Li Zhiyuan, Chen Wei, Wang Qun, Gao Xin, Cai Qi, Ling Ziyuan
- Vtensor: Flexible Virtual Tensor Management For Efficient LLM Serving Xu Jiale, Zhang Rui, Guo Cong, Hu Weiming, Liu Zihan, Wu Feiyang, Feng Yu, Sun Shixuan, Shao Changxu, Guo Yuhong, Zhao Junping, Zhang Ke, Guo Minyi, Leng Jingwen
- Reverse Image Retrieval Cues Parametric Memory In Multimodal Llms Xu Jialiang, Moor Michael, Leskovec Jure
- Seamless Language Expansion: Enhancing Multilingual Mastery In Self-supervised Models Xu Jing, Wu Minglin, Wu Xixin, Meng Helen
- Tur[k]ingbench: A Challenge Benchmark For Web Agents Xu Kevin, Kordi Yeganeh, Nayak Tanay, Asija Ado, Wang Yizhong, Sanders Kate, Byerly Adam, Zhang Jingyu, Van Durme Benjamin, Khashabi Daniel
- Accelerating Large Language Model Training With Hybrid Gpu-based Compression Xu Lang, Anthony Quentin, Zhou Qinghua, Alnaasan Nawras, Gulhane Radha R., Shafi Aamir, Subramoni Hari, Panda Dhabaleswar K.
- Prompting Large Language Models For Recommender Systems: A Comprehensive Framework And Empirical Analysis Xu Lanling, Zhang Junjie, Li Bingqian, Wang Jinpeng, Cai Mingchen, Zhao Wayne Xin, Wen Ji-rong
- Superclue-math6: Graded Multi-step Math Reasoning Benchmark For Llms In Chinese Xu Liang, Xue Hang, Zhu Lei, Zhao Kangkang
- Chatqa 2: Bridging The Gap To Proprietary Llms In Long Context And RAG Capabilities Xu Peng, Ping Wei, Wu Xianchao, Xu Chejian, Liu Zihan, Shoeybi Mohammad, Catanzaro Bryan
- BESA: Pruning Large Language Models With Blockwise Parameter-efficient Sparsity Allocation Xu Peng, Shao Wenqi, Chen Mengzhao, Tang Shitao, Zhang Kaipeng, Gao Peng, An Fengwei, Qiao Yu, Luo Ping
- Walking In Others' Shoes: How Perspective-taking Guides Large Language Models In Reducing Toxicity And Bias Xu Rongwu, Zhou Zi'an, Zhang Tianwei, Qi Zehan, Yao Su, Xu Ke, Xu Wei, Qiu Han
- MINDECHO: Role-playing Language Agents For Key Opinion Leaders Xu Rui, Lu Dakuan, Tan Xiaoyu, Wang Xintao, Yuan Siyu, Chen Jiangjie, Chu Wei, Yinghui Xu
- Is DPO Superior To PPO For LLM Alignment? A Comprehensive Study Xu Shusheng, Fu Wei, Gao Jiaxuan, Ye Wenjie, Liu Weilin, Mei Zhiyu, Wang Guangju, Yu Chao, Wu Yi
- Pride And Prejudice: LLM Amplifies Self-bias In Self-refinement Xu Wenda, Zhu Guanglei, Zhao Xuandong, Pan Liangming, Li Lei, Wang William Yang
- Relating The Seemingly Unrelated: Principled Understanding Of Generalization For Generative Models In Arithmetic Reasoning Tasks Xu Xingcheng, Zhao Zibo, Zhang Haipeng, Yang Yanqing
- Conveyor: Efficient Tool-aware LLM Serving With Tool Partial Execution Xu Yechen, Kong Xinhao, Chen Tingjun, Zhuo Danyang
- Libra: Building Decoupled Vision System On Large Language Models Xu Yifan, Yang Xiaoshan, Song Yaguang, Xu Changsheng
- \(\textit{linkprompt}\): Natural And Universal Adversarial Attacks On Prompt-based Language Models Xu Yue, Wang Wenjie
- Think: Thinner Key Cache By Query-driven Pruning Xu Yuhui, Jie Zhanming, Dong Hanze, Wang Lei, Lu Xudong, Zhou Aojun, Saha Amrita, Xiong Caiming, Sahoo Doyen
- Uncovering Safety Risks Of Large Language Models Through Concept Activation Vector Xu Zhihao, Huang Ruixuan, Chen Changyu, Wang Shuai, Wang Xiting
- Vision-flan: Scaling Human-labeled Tasks In Visual Instruction Tuning Xu Zhiyang, Feng Chao, Shao Rulin, Ashby Trevor, Shen Ying, Jin Di, Cheng Yu, Wang Qifan, Huang Lifu
- A Comprehensive Study Of Jailbreak Attack Versus Defense For Large Language Models Xu Zihao, Liu Yi, Deng Gelei, Li Yuekang, Picek Stjepan
- Question Calibration And Multi-hop Modeling For Temporal Question Answering Xue Chao, Liang Di, Wang Pengfei, Zhang Jing
- Longvila: Scaling Long-context Visual Language Models For Long Videos Xue Fuzhao, Chen Yukang, Li Dacheng, Hu Qinghao, Zhu Ligeng, Li Xiuyu, Fang Yunhao, Tang Haotian, Yang Shang, Liu Zhijian, He Ethan, Yin Hongxu, Molchanov Pavlo, Kautz Jan, Fan Linxi, Zhu Yuke, Lu Yao, Han Song
- Empirical Evidence Of Large Language Model's Influence On Human Spoken Communication Yakura Hiromu, Lopez-lopez Ezequiel, Brinkmann Levin, Serna Ignacio, Gupta Prateek, Rahwan Iyad
- List Items One By One: A New Data Source And Learning Paradigm For Multimodal Llms Yan An, Yang Zhengyuan, Wu Junda, Zhu Wanrong, Yang Jianwei, Li Linjie, Lin Kevin, Wang Jianfeng, Mcauley Julian, Gao Jianfeng, Wang Lijuan
- LTNER: Large Language Model Tagging For Named Entity Recognition With Contextualized Entity Marking Yan Faren, Yu Peng, Chen Xin
- GPT-4 Vs. Human Translators: A Comprehensive Evaluation Of Translation Quality Across Languages, Domains, And Expertise Levels Yan Jianhao, Yan Pingchuan, Chen Yulong, Li Judy, Zhu Xianchao, Zhang Yue
- Do Large Language Models Understand Logic Or Just Mimick Context? Yan Junbing, Wang Chengyu, Huang Jun, Zhang Wei
- Exploring The LLM Journey From Cognition To Expression With Linear Representations Yan Yuzi, Li Jialian, Zhang Yipin, Yan Dong
- 3d-properties: Identifying Challenges In DPO And Charting A Path Forward Yan Yuzi, Miao Yibo, Li Jialian, Zhang Yipin, Xie Jian, Deng Zhijie, Yan Dong
- Advancing Genai Assisted Programming--a Comparative Study On Prompt Efficiency And Code Quality Between GPT-4 And GLM-4 Yang Angus, Li Zehan, Li Jie
- Emphasising Structured Information: Integrating Abstract Meaning Representation Into Llms For Enhanced Open-domain Dialogue Evaluation Yang Bohao, Zhao Kun, Tang Chen, Liu Dong, Zhan Liang, Lin Chenghua
- Embedded Translations For Low-resource Automated Glossing Yang Changbing, Nicolai Garrett, Silfverberg Miikka
- Equipping Transformer With Random-access Reading For Long-context Understanding Yang Chenghao, Yang Zi, Hua Nan
- 3D Vision And Language Pretraining With Large-scale Synthetic Data Yang Dejie, Xu Zhu, Mo Wentao, Chen Qingchao, Huang Siyuan, Liu Yang
- Pyramidinfer: Pyramid KV Cache Compression For High-throughput LLM Inference Yang Dongjie, Han Xiaodong, Gao Yan, Hu Yao, Zhang Shilin, Zhao Hai
- Aligning Llms Through Multi-perspective User Preference Ranking-based Feedback For Programming Question Answering Yang Hongyu, He Liyang, Hou Min, Shen Shuanghong, Li Rui, Hou Jiahui, Ma Jianhui, Zhao Junda
- Toward Automatic Relevance Judgment Using Vision--language Models For Image--text Retrieval Evaluation Yang Jheng-hong, Lin Jimmy
- Large Language Model Tokenizer Bias: A Case Study And Solution On Gpt-4o Yang Jin, Wang Zhiqiang, Lin Yanbin, Zhao Zunduo
- Asynchronous And Segmented Bidirectional Encoding For NMT Yang Jingpu, Han Zehua, Xiang Mengyu, Wang Helin, Huang Yuxiao, Fang Miao
- Mastering Text-to-image Diffusion: Recaptioning, Planning, And Generating With Multimodal Llms Yang Ling, Yu Zhaochen, Meng Chenlin, Xu Minkai, Ermon Stefano, Cui Bin
- GPT-4 As Evaluator: Evaluating Large Language Models On Pest Management In Agriculture Yang Shanglong, Yuan Zhipeng, Li Shunbao, Peng Ruoling, Liu Kang, Yang Po
- Seed-story: Multimodal Long Story Generation With Large Language Model Yang Shuai, Ge Yuying, Li Yang, Chen Yukang, Ge Yixiao, Shan Ying, Chen Yingcong
- Is Crowdsourcing Breaking Your Bank? Cost-effective Fine-tuning Of Pre-trained Language Models With Proximal Policy Optimization Yang Shuo, Kasneci Gjergji
- Aqa-bench: An Interactive Benchmark For Evaluating Llms' Sequential Reasoning Ability Yang Siwei, Zhao Bingchen, Xie Cihang
- Parallelizing Linear Transformers With The Delta Rule Over Sequence Length Yang Songlin, Wang Bailin, Zhang Yu, Shen Yikang, Kim Yoon
- The Butterfly Effect Of Model Editing: Few Edits Can Trigger Large Language Models Collapse Yang Wanli, Sun Fei, Ma Xinyu, Liu Xun, Yin Dawei, Cheng Xueqi
- DGL: Dynamic Global-local Prompt Tuning For Text-video Retrieval Yang Xiangpeng, Zhu Linchao, Wang Xiaohan, Yang Yi
- Adazeta: Adaptive Zeroth-order Tensor-train Adaption For Memory-efficient Large Language Models Fine-tuning Yang Yifan, Zhen Kai, Banijamal Ershad, Mouchtaris Athanasios, Zhang Zheng
- Pre-trained Language Models Improve The Few-shot Prompt Ability Of Decision Transformer Yang Yu, Xu Pan
- Adversarial Attacks And Defense For Conversation Entailment Task Yang Zhenning, Krawec Ryan, Wu Liang-yuan
- React Meets Actre: When Language Agents Enjoy Training Data Autonomy Yang Zonghan, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Dense Connector For Mllms Yao Huanjin, Wu Wenhao, Yang Taojiannan, Song Yuxin, Zhang Mengxi, Feng Haocheng, Sun Yifan, Li Zhiheng, Ouyang Wanli, Wang Jingdong
- Cacheblend: Fast Large Language Model Serving For RAG With Cached Knowledge Fusion Yao Jiayi, Li Hanchen, Liu Yuhan, Ray Siddhant, Cheng Yihua, Zhang Qizheng, Du Kuntai, Lu Shan, Jiang Junchen
- CLAVE: An Adaptive Framework For Evaluating Values Of LLM Generated Responses Yao Jing, Yi Xiaoyuan, Xie Xing
- Exploiting Inter-layer Expert Affinity For Accelerating Mixture-of-experts Model Inference Yao Jinghan Dk, Anthony Quentin Dk, Shafi Aamir Dk, Subramoni Hari Dk, K. Dhabaleswar Dk, Panda
- Training Ultra Long Context Language Model With Fully Pipelined Distributed Transformer Yao Jinghan, Jacobs Sam Ade, Tanaka Masahiro, Ruwase Olatunji, Shafi Aamir, Subramoni Hari, Panda Dhabaleswar K.
- Large Language Models Are Contrastive Reasoners Yao Liang
- Enhancing In-context Learning Performance With Just Svd-based Weight Pruning: A Theoretical Perspective Yao Xinhao, Hu Xiaolin, Yang Shenzhi, Liu Yong
- R^2AG: Incorporating Retrieval Information Into Retrieval Augmented Generation Ye Fuda, Li Shuangyin, Zhang Yongqi, Chen Lei
- Self-judge: Selective Instruction Following With Alignment Self-evaluation Ye Hai, Ng Hwee Tou
- Mplug-owl3: Towards Long Image-sequence Understanding In Multi-modal Large Language Models Ye Jiabo, Xu Haiyang, Liu Haowei, Hu Anwen, Yan Ming, Qian Qi, Zhang Ji, Huang Fei, Zhou Jingren
- Toolsword: Unveiling Safety Issues Of Large Language Models In Tool Learning Across Three Stages Ye Junjie, Li Sixian, Li Guanyu, Huang Caishuang, Gao Songyang, Wu Yilong, Zhang Qi, Gui Tao, Huang Xuanjing
- Rotbench: A Multi-level Benchmark For Evaluating The Robustness Of Large Language Models In Tool Learning Ye Junjie, Wu Yilong, Gao Songyang, Huang Caishuang, Li Sixian, Li Guanyu, Fan Xiaoran, Zhang Qi, Gui Tao, Huang Xuanjing
- Dataframe QA: A Universal LLM Framework On Dataframe Question Answering Without Data Exposure Ye Junyi, Du Mengnan, Wang Guiling
- Spactor-t5: Pre-training T5 Models With Span Corruption And Replaced Token Detection Ye Ke, Jiang Heinrich, Rostamizadeh Afshin, Chakrabarti Ayan, Desalvo Giulia, Kagy Jean-françois, Karydas Lazaros, Citovsky Gui, Kumar Sanjiv
- Efficient Vision-and-language Pre-training With Text-relevant Image Patch Selection Ye Wei, Jiang Chaoya, Xu Haiyang, Ye Chenhao, Li Chenliang, Yan Ming, Zhang Shikun, Huang Songhang, Huang Fei
- Long-context Language Modeling With Parallel Context Encoding Yen Howard, Gao Tianyu, Chen Danqi
- How Interpretable Are Reasoning Explanations From Prompting Large Language Models? Yeo Wei Jie, Satapathy Ranjan, Goh Rick Siow Mong, Cambria Erik
- Towards Fast Multilingual LLM Inference: Speculative Decoding And Specialized Drafters Yi Euiin, Kim Taehyeon, Jeung Hongseok, Chang Du-seong, Yun Se-young
- Fine-grained Detoxification Via Instance-level Prefixes For Large Language Models Yi Xin, Wang Linlin, Wang Xiaoling, He Liang
- MEND: Meta Demonstration Distillation For Efficient And Effective In-context Learning Yichuan Li, Xiyao Ma, Sixing Lu, Kyumin Lee, Xiaohu Liu, Chenlei Guo
- Lofit: Localized Fine-tuning On LLM Representations Yin Fangcong, Ye Xi, Durrett Greg
- Stablemask: Refining Causal Masking In Decoder-only Transformer Yin Qingyu, He Xuzheng, Zhuang Xiang, Zhao Yu, Yao Jianhua, Shen Xiaoyu, Zhang Qiang
- Llms-as-instructors: Learning From Errors Toward Automating Model Improvement Ying Jiahao, Lin Mingbao, Cao Yixin, Tang Wei, Wang Bo, Sun Qianru, Huang Xuanjing, Yan Shuicheng
- Unveiling The Safety Of Gpt-4o: An Empirical Study Using Jailbreak Attacks Ying Zonghao, Liu Aishan, Liu Xianglong, Tao Dacheng
- Exploring The Comprehension Of Chatgpt In Traditional Chinese Medicine Knowledge Yizhen Li, Shaohan Huang, Jiaxing Qi, Lei Quan, Dongran Han, Zhongzhi Luan
- Debunc: Mitigating Hallucinations In Large Language Model Agent Communication With Uncertainty Estimations Yoffe Luke, Amayuelas Alfonso, Wang William Yang
- CSRT: Evaluation And Analysis Of Llms Using Code-switching Red-teaming Dataset Yoo Haneul, Yang Yongjin, Lee Hwaran
- C-TPT: Calibrated Test-time Prompt Tuning For Vision-language Models Via Text Feature Dispersion Yoon Hee Suk, Yoon Eunseop, Tee Joshua Tian Jin, Hasegawa-johnson Mark, Li Yingzhen, Yoo Chang D.
- Matryoshka-adaptor: Unsupervised And Supervised Tuning For Smaller Embedding Dimensions Yoon Jinsung, Sinha Raj, Arik Sercan O, Pfister Tomas
- Making The Most Of Your Model: Methods For Finetuning And Applying Pretrained Transformers Yoshida Davis
- Tree-planted Transformers: Unidirectional Transformer Language Models With Implicit Syntactic Supervision Yoshida Ryo, Someya Taiga, Oseki Yohei
- When Linear Attention Meets Autoregressive Decoding: Towards More Effective And Efficient Linearized Large Language Models You Haoran, Fu Yichao, Wang Zheng, Yazdanbakhsh Amir, Lin Yingyan Celine
- Shiftaddllm: Accelerating Pretrained Llms Via Post-training Multiplication-less Reparameterization You Haoran, Guo Yipin, Fu Yichao, Zhou Wei, Shi Huihong, Zhang Xiaofan, Kundu Souvik, Yazdanbakhsh Amir, Lin Yingyan Celine
- Ferret-ui: Grounded Mobile UI Understanding With Multimodal Llms You Keen, Zhang Haotian, Schoop Eldon, Weers Floris, Swearngin Amanda, Nichols Jeffrey, Yang Yinfei, Gan Zhe
- Improving Structural Diversity Of Blackbox Llms Via Chain-of-specification Prompting Young Halley, Zeng Yimeng, Gardner Jacob, Bastani Osbert
- Llasmol: Advancing Large Language Models For Chemistry With A Large-scale, Comprehensive, High-quality Instruction Tuning Dataset Yu Botao, Baker Frazier N., Chen Ziqi, Ning Xia, Sun Huan
- Effectively Compress KV Heads For LLM Yu Hao, Yang Zelan, Li Shen, Li Yong, Wu Jianxin
- Enhancing Healthcare Through Large Language Models: A Study On Medical Question Answering Yu Haoran, Yu Chang, Wang Zihan, Zou Dongxian, Qin Hao
- A Cause-effect Look At Alleviating Hallucination Of Knowledge-grounded Dialogue Generation Yu Jifan, Zhang Xiaohan, Xu Yifan, Lei Xuanyu, Yao Zijun, Zhang Jing, Hou Lei, Li Juanzi
- Mechanistic Understanding And Mitigation Of Language Model Non-factual Hallucinations Yu Lei, Cao Meng, Cheung Jackie Chi Kit, Dong Yue
- Distilling System 2 Into System 1 Yu Ping, Xu Jing, Weston Jason, Kulikov Ilia
- Truth-aware Context Selection: Mitigating Hallucinations Of Large Language Models Being Misled By Untruthful Contexts Yu Tian, Zhang Shaolei, Feng Yang
- Ra-rec: An Efficient ID Representation Alignment Framework For Llm-based Recommendation Yu Xiaohan, Zhang Li, Zhao Xin, Wang Yue, Ma Zhongrui
- Mitigate Position Bias In Large Language Models Via Scaling A Single Dimension Yu Yijiong, Jiang Huiqiang, Luo Xufang, Wu Qianhui, Lin Chin-yew, Li Dongsheng, Yang Yuqing, Huang Yongfeng, Qiu Lili
- Rankrag: Unifying Context Ranking With Retrieval-augmented Generation In Llms Yu Yue, Ping Wei, Liu Zihan, Wang Boxin, You Jiaxuan, Zhang Chao, Shoeybi Mohammad, Catanzaro Bryan
- Unveiling And Harnessing Hidden Attention Sinks: Enhancing Large Language Models Without Training Through Attention Calibration Yu Zhongzhi, Wang Zheng, Fu Yonggan, Shi Huihong, Shaikh Khalid, Lin Yingyan Celine
- Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion For Efficient Inference Intervention In Large Language Model Yuan Chenhan, Huang Fei, Peng Ru, Lu Keming, Yu Bowen, Zhou Chang, Zhou Jingren
- Remamba: Equip Mamba With Effective Long-sequence Modeling Yuan Danlong, Liu Jiahao, Li Bei, Zhang Huishuai, Wang Jingang, Cai Xunliang, Zhao Dongyan
- A Continued Pretrained LLM Approach For Automatic Medical Note Generation Yuan Dong, Rastogi Eti, Naik Gautam, Rajagopal Sree Prasanna, Goyal Sagar, Zhao Fen, Chintagunta Bharath, Ward Jeff
- Whispers That Shake Foundations: Analyzing And Mitigating False Premise Hallucinations In Large Language Models Yuan Hongbang, Cao Pengfei, Jin Zhuoran, Chen Yubo, Zeng Daojian, Liu Kang, Zhao Jun
- Towards A Holistic Evaluation Of Llms On Factual Knowledge Recall Yuan Jiaqing, Pan Lin, Hang Chung-wei, Guo Jiang, Jiang Jiarong, Min Bonan, Ng Patrick, Wang Zhiguo
- KV Cache Compression, But What Must We Give In Return? A Comprehensive Benchmark Of Long Context Capable Approaches Yuan Jiayi Henry, Liu Hongyi Henry, Shaochen Henry, Zhong, Chuang Yu-neng, Li Songchen, Wang Guanchu, Le Duy, Jin Hongye, Chaudhary Vipin, Xu Zhaozhuo, Liu Zirui, Hu Xia
- Advancing LLM Reasoning Generalists With Preference Trees Yuan Lifan, Cui Ganqu, Wang Hanbin, Ding Ning, Wang Xingyao, Deng Jia, Shan Boji, Chen Huimin, Xie Ruobing, Lin Yankai, Liu Zhenghao, Zhou Bowen, Peng Hao, Liu Zhiyuan, Sun Maosong
- Focused Large Language Models Are Stable Many-shot Learners Yuan Peiwen, Feng Shaoxiong, Li Yiwei, Wang Xinglin, Zhang Yueqi, Tan Chuyi, Pan Boyuan, Wang Heda, Hu Yao, Li Kan
- Chatmusician: Understanding And Generating Music Intrinsically With LLM Yuan Ruibin, Lin Hanfeng, Wang Yi, Tian Zeyue, Wu Shangda, Shen Tianhao, Zhang Ge, Wu Yuhang, Liu Cong, Zhou Ziya, Ma Ziyang, Xue Liumeng, Wang Ziyu, Liu Qin, Zheng Tianyu, Li Yizhi, Ma Yinghao, Liang Yiming, Chi Xiaowei, Liu Ruibo, Wang Zili, Li Pengfei, Wu Jingcheng, Lin Chenghua, Liu Qifeng, Jiang Tao, Huang Wenhao, Chen Wenhu, Benetos Emmanouil, Fu Jie, Xia Gus, Dannenberg Roger, Xue Wei, Kang Shiyin, Guo Yike
- R-judge: Benchmarking Safety Risk Awareness For LLM Agents Yuan Tongxin, He Zhiwei, Dong Lingzhong, Wang Yiming, Zhao Ruijie, Xia Tian, Xu Lizhen, Zhou Binglin, Li Fangqi, Zhang Zhuosheng, Wang Rui, Liu Gongshen
- Self-rewarding Language Models Yuan Weizhe, Pang Richard Yuanzhe, Cho Kyunghyun, Li Xian, Sukhbaatar Sainbayar, Xu Jing, Weston Jason
- CO3: Low-resource Contrastive Co-training For Generative Conversational Query Rewrite Yuan Yifei, Shi Chen, Wang Runze, Chen Liyi, Hu Renjun, Zhang Zengming, Jiang Feijun, Lam Wai
- Rhyme-aware Chinese Lyric Generator Based On GPT Yuan Yixiao, Huang Yangchen, Ma Yu, Li Xinjin, Li Zhenglin, Shi Yiming, Zhou Huapeng
- Refuse Whenever You Feel Unsafe: Improving Safety In Llms Via Decoupled Refusal Training Yuan Youliang, Jiao Wenxiang, Wang Wenxuan, Huang Jen-tse, Xu Jiahao, Liang Tian, He Pinjia, Tu Zhaopeng
- Do Large Language Models Understand Conversational Implicature -- A Case Study With A Chinese Sitcom Yue Shisen, Song Siyuan, Cheng Xinyuan, Hu Hai
- Tcmbench: A Comprehensive Benchmark For Evaluating Large Language Models In Traditional Chinese Medicine Yue Wenjing, Wang Xiaoling, Zhu Wei, Guan Ming, Zheng Huanran, Wang Pengfei, Sun Changzhi, Ma Xin
- Mammoth2: Scaling Instructions From The Web Yue Xiang, Zheng Tuney, Zhang Ge, Chen Wenhu
- Distilling Instruction-following Abilities Of Large Language Models With Task-aware Curriculum Planning Yue Yuanhao, Wang Chengyu, Huang Jun, Wang Peng
- Wkvquant: Quantizing Weight And Key/value Cache For Large Language Models Gains More Yue Yuxuan, Yuan Zhihang, Duanmu Haojie, Zhou Sifan, Wu Jianlong, Nie Liqiang
- Question-analysis Prompting Improves LLM Performance In Reasoning Tasks Yugeswardeenoo Dharunish, Zhu Kevin, O'brien Sean
- Can Small Language Models Be Good Reasoners For Sequential Recommendation? Yuling Wang, Changxin Tian, Binbin Hu, Yanhua Yu, Ziqi Liu, Zhiqiang Zhang, Jun Zhou, Liang Pang, Xiao Wang
- Focus On The Core: Efficient Attention Via Pruned Token Compression For Document Classification Yun Jungmin, Kim Mihyeon, Kim Youngbin
- Toward Inference-optimal Mixture-of-expert Large Language Models Yun Longfei, Zhuang Yonghao, Fu Yao, Xing Eric P, Zhang Hao
- Prompting Implicit Discourse Relation Annotation Yung Frances, Ahmad Mansoor, Scholman Merel, Demberg Vera
- Rec-gpt4v: Multimodal Recommendation With Large Vision-language Models Yuqing Liu, Yu Wang, Lichao Sun, Philip S. Yu
- SPOT: Text Source Prediction From Originality Score Thresholding Yvinec Edouard, Kasser Gabriel
- Turkishmmlu: Measuring Massive Multitask Language Understanding In Turkish Yüksel Arda, Köksal Abdullatif, Şenel Lütfi Kerem, Korhonen Anna, Schütze Hinrich
- WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-experts Zahraei Pardis Sadat, Emami Ali
- Envgen: Generating And Adapting Environments Via Llms For Training Embodied Agents Zala Abhay, Cho Jaemin, Lin Han, Yoon Jaehong, Bansal Mohit
- Let Storytelling Tell Vivid Stories: An Expressive And Fluent Multimodal Storyteller Zang Chuanqi, Tang Jiji, Zhang Rongsheng, Zhao Zeng, Lv Tangjie, Pei Mingtao, Liang Wei
- Understanding And Mitigating Compositional Issues In Text-to-image Generative Models Zarei Arman, Rezaei Keivan, Basu Samyadeep, Saberi Mehrdad, Moayeri Mazda, Kattakinda Priyatham, Feizi Soheil
- Tell Me Why: Explainable Public Health Fact-checking With Large Language Models Zarharan Majid, Wullschleger Pascal, Kia Babak Behkam, Pilehvar Mohammad Taher, Foster Jennifer
- Unused Information In Token Probability Distribution Of Generative LLM: Improving LLM Reading Comprehension Through Calculation Of Expected Values Zawistowski Krystian
- Design Proteins Using Large Language Models: Enhancements And Comparative Analyses Zeinalipour Kamyar, Jamshidi Neda, Bianchini Monica, Maggini Marco, Gori Marco
- Automating Turkish Educational Quiz Generation Using Large Language Models Zeinalipour Kamyar, Keptiğ Yusuf Gökberk, Maggini Marco, Gori Marco
- ABQ-LLM: Arbitrary-bit Quantized Inference Acceleration For Large Language Models Zeng Chao, Liu Songwei, Xie Yusheng, Liu Hong, Wang Xiaojian, Wei Miao, Yang Shu, Chen Fangmin, Mei Xing
- Improving Logits-based Detector Without Logits From Black-box Llms Zeng Cong, Tang Shengkun, Yang Xianjun, Chen Yuanzhou, Sun Yiyou, Xu Zhiqiang, Li Yao, Chen Haifeng, Cheng Wei, Xu Dongkuan
- Skywork-math: Data Scaling Laws For Mathematical Reasoning In Large Language Models -- The Story Goes On Zeng Liang, Zhong Liangjun, Zhao Liang, Wei Tianwen, Yang Liu, He Jujie, Cheng Cheng, Hu Rui, Liu Yang, Yan Shuicheng, Fang Han, Zhou Yahui
- How Johnny Can Persuade Llms To Jailbreak Them: Rethinking Persuasion To Challenge AI Safety By Humanizing Llms Zeng Yi, Lin Hongpeng, Zhang Jingwen, Yang Diyi, Jia Ruoxi, Shi Weiyan
- Learning Reward For Robot Skills Using Large Language Models Via Self-alignment Zeng Yuwei, Mu Yao, Shao Lin
- MR-BEN: A Comprehensive Meta-reasoning Benchmark For Large Language Models Zeng Zhongshen, Liu Yinhong, Wan Yingjia, Li Jingyao, Chen Pengguang, Dai Jianbo, Yao Yuxuan, Xu Rongwu, Qi Zehan, Zhao Wanru, Shen Linling, Lu Jianqiao, Tan Haochen, Chen Yukang, Zhang Hao, Shi Zhan, Wang Bailin, Guo Zhijiang, Jia Jiaya
- Actions Speak Louder Than Words: Trillion-parameter Sequential Transducers For Generative Recommendations Zhai Jiaqi, Liao Lucy, Liu Xing, Wang Yueming, Li Rui, Cao Xuan, Gao Leon, Gong Zhaojie, Gu Fangda, He Michael, Lu Yinghai, Shi Yu
- Anygpt: Unified Multimodal LLM With Discrete Sequence Modeling Zhan Jun, Dai Junqi, Ye Jiasheng, Zhou Yunhua, Zhang Dong, Liu Zhigeng, Zhang Xin, Yuan Ruibin, Zhang Ge, Li Linyang, Yan Hang, Fu Jie, Gui Tao, Sun Tianxiang, Jiang Yugang, Qiu Xipeng
- Injecagent: Benchmarking Indirect Prompt Injections In Tool-integrated Large Language Model Agents Zhan Qiusi, Liang Zhixiang, Ying Zifan, Kang Daniel
- Skyeyegpt: Unifying Remote Sensing Vision-language Tasks Via Instruction Tuning With Large Language Model Zhan Yang, Xiong Zhitong, Yuan Yuan
- MC-GPT: Empowering Vision-and-language Navigation With Memory Map And Reasoning Chains Zhan Zhaohuan, Yu Lisha, Yu Sijie, Tan Guang
- Distilling Implicit Multimodal Knowledge Into Llms For Zero-resource Dialogue Generation Zhang Bo, Ma Hui, Ding Jian, Wang Jian, Xu Bo, Lin Hongfei
- Notellm-2: Multimodal Large Representation Models For Recommendation Zhang Chao, Zhang Haoxin, Wu Shiwei, Wu Di, Xu Tong, Gao Yan, Hu Yao, Chen Enhong
- Teaching Large Language Models An Unseen Language On The Fly Zhang Chen, Liu Xiao, Lin Jiuheng, Feng Yansong
- Chemllm: A Chemical Large Language Model Zhang Di, Liu Wei, Tan Qian, Chen Jingdan, Yan Hang, Yan Yuliang, Li Jiatong, Huang Weiran, Yue Xiangyu, Ouyang Wanli, Zhou Dongzhan, Zhang Shufei, Su Mao, Zhong Han-sen, Li Yuqiang
- Speechgpt-gen: Scaling Chain-of-information Speech Generation Zhang Dong, Zhang Xin, Zhan Jun, Li Shimin, Zhou Yaqian, Qiu Xipeng
- Mm-llms: Recent Advances In Multimodal Large Language Models Zhang Duzhen, Yu Yahan, Dong Jiahua, Li Chenxing, Su Dan, Chu Chenhui, Yu Dong
- Unforgettable Generalization In Language Models Zhang Eric, Chosen Leshem, Andreas Jacob
- CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark Zhang Ge, Du Xinrun, Chen Bei, Liang Yiming, Luo Tongxu, Zheng Tianyu, Zhu Kang, Cheng Yuyang, Xu Chunpu, Guo Shuyue, Zhang Haoran, Qu Xingwei, Wang Junjie, Yuan Ruibin, Li Yizhi, Wang Zekun, Liu Yudong, Tsai Yu-hsuan, Zhang Fengji, Lin Chenghua, Huang Wenhao, Fu Jie
- Map-neo: Highly Capable And Transparent Bilingual Large Language Model Series Zhang Ge, Qu Scott, Liu Jiaheng, Zhang Chenchen, Lin Chenghua, Yu Chou Leuang, Pan Danny, Cheng Esther, Liu Jie, Lin Qunshu, Yuan Raven, Zheng Tuney, Pang Wei, Du Xinrun, Liang Yiming, Ma Yinghao, Li Yizhi, Ma Ziyang, Lin Bill, Benetos Emmanouil, Yang Huan, Zhou Junting, Ma Kaijing, Liu Minghao, Niu Morry, Wang Noah, Que Quehry, Liu Ruibo, Liu Sine, Guo Shawn, Gao Soren, Zhou Wangchunshu, Zhang Xinyue, Zhou Yizhi, Wang Yubo, Bai Yuelin, Zhang Yuhan, Zhang Yuxiang, Wang Zenith, Yang Zhenzhu, Zhao Zijian, Zhang Jiajun, Ouyang Wanli, Huang Wenhao, Chen Wenhu
- Closing The Gap Between Open-source And Commercial Large Language Models For Medical Evidence Summarization Zhang Gongbo, Jin Qiao, Zhou Yiliang, Wang Song, Idnay Betina R., Luo Yiming, Park Elizabeth, Nestor Jordan G., Spotnitz Matthew E., Soroush Ali, Campion Thomas, Lu Zhiyong, Weng Chunhua, Peng Yifan
- Language Model Prompt Selection Via Simulation Optimization Zhang Haoting, He Jinghai, Righter Rhonda, Zheng Zeyu
- Are Large Language Models Good At Utility Judgments? Zhang Hengran, Zhang Ruqing, Guo Jiafeng, De Rijke Maarten, Fan Yixing, Cheng Xueqi
- Sinklora: Enhanced Efficiency And Chat Capabilities For Long-context Large Language Models Zhang Hengyu
- Paying More Attention To Source Context: Mitigating Unfaithful Translations From Large Language Model Zhang Hongbin, Chen Kehai, Bai Xuefeng, Xiang Yang, Zhang Min
- Adaptable Logical Control For Large Language Models Zhang Honghua, Kung Po-nien, Yoshida Masahiro, Broeck Guy Van Den, Peng Nanyun
- A Careful Examination Of Large Language Model Performance On Grade School Arithmetic Zhang Hugh, Da Jeff, Lee Dean, Robinson Vaughn, Wu Catherine, Song Will, Zhao Tiffany, Raja Pranav, Slack Dylan, Lyu Qin, Hendryx Sean, Kaplan Russell, Lunati Michele, Yue Summer
- Enja: Ensemble Jailbreak On Large Language Models Zhang Jiahao, Wang Zilong, Wang Ruofan, Ma Xingjun, Jiang Yu-gang
- Longcite: Enabling Llms To Generate Fine-grained Citations In Long-context QA Zhang Jiajie, Bai Yushi, Lv Xin, Gu Wanjun, Liu Danqing, Zou Minhao, Cao Shulin, Hou Lei, Dong Yuxiao, Feng Ling, Li Juanzi
- Agentohana: Design Unified Data And Training Pipeline For Effective Agent Learning Zhang Jianguo, Lan Tian, Murthy Rithesh, Liu Zhiwei, Yao Weiran, Tan Juntao, Hoang Thai, Yang Liangwei, Feng Yihao, Liu Zuxin, Awalgaonkar Tulika, Niebles Juan Carlos, Savarese Silvio, Heinecke Shelby, Wang Huan, Xiong Caiming
- Xlam: A Family Of Large Action Models To Empower AI Agent Systems Zhang Jianguo, Lan Tian, Zhu Ming, Liu Zuxin, Hoang Thai, Kokane Shirley, Yao Weiran, Tan Juntao, Prabhakar Akshara, Chen Haolin, Liu Zhiwei, Feng Yihao, Awalgaonkar Tulika, Murthy Rithesh, Hu Eric, Chen Zeyuan, Xu Ran, Niebles Juan Carlos, Heinecke Shelby, Wang Huan, Savarese Silvio, Xiong Caiming
- Geoeval: Benchmark For Evaluating Llms And Multi-modal Models On Geometry Problem-solving Zhang Jiaxin, Li Zhongzhi, Zhang Mingliang, Yin Fei, Liu Chenglin, Moshfeghi Yashar
- Regurgitative Training: The Value Of Real Data In Training Large Language Models Zhang Jinghui, Qiao Dandan, Yang Mochen, Wei Qiang
- Unimot: Unified Molecule-text Language Model With Discrete Token Representation Zhang Juzheng, Bian Yatao, Chen Yongqiang, Yao Quanming
- Hire A Linguist!: Learning Endangered Languages With In-context Linguistic Descriptions Zhang Kexun, Choi Yee Man, Song Zhenqiao, He Taiqi, Wang William Yang, Li Lei
- Harmonized Speculative Sampling Zhang Lefan, Wang Xiaodan, Huang Yanhua, Xu Ruiwen
- Predicting Learning Performance With Large Language Models: A Study In Adult Literacy Zhang Liang, Lin Jionghao, Borchers Conrad, Sabatini John, Hollander John, Cao Meng, Hu Xiangen
- SPL: A Socratic Playground For Learning Powered By Large Language Model Zhang Liang, Lin Jionghao, Kuang Ziyi, Xu Sheng, Yeasin Mohammed, Hu Xiangen
- Attention Instruction: Amplifying Attention In The Middle Via Prompting Zhang Meiru, Meng Zaiqiao, Collier Nigel
- Enhancing Multi-hop Reasoning Through Knowledge Erasure In Large Language Model Editing Zhang Mengqi, Fang Bowen, Liu Qiang, Ren Pengjie, Wu Shu, Chen Zhumin, Wang Liang
- The Impact Of Demonstrations On Multilingual In-context Learning: A Multidimensional Analysis Zhang Miaoran, Gautam Vagrant, Wang Mingyang, Alabi Jesujoba O., Shen Xiaoyu, Klakow Dietrich, Mosbach Marius
- The Hedgehog & The Porcupine: Expressive Linear Attentions With Softmax Mimicry Zhang Michael, Bhatia Kush, Kumbong Hermann, Ré Christopher
- Internlm-xcomposer-2.5: A Versatile Large Vision Language Model Supporting Long-contextual Input And Output Zhang Pan, Dong Xiaoyi, Zang Yuhang, Cao Yuhang, Qian Rui, Chen Lin, Guo Qipeng, Duan Haodong, Wang Bin, Ouyang Linke, Zhang Songyang, Zhang Wenwei, Li Yining, Gao Yang, Sun Peng, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Yan Hang, He Conghui, Zhang Xingcheng, Chen Kai, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- Extending Llama-3's Context Ten-fold Overnight Zhang Peitian, Shao Ninglu, Liu Zheng, Xiao Shitao, Qian Hongjin, Ye Qiwei, Dou Zhicheng
- Tinyllama: An Open-source Small Language Model Zhang Peiyuan, Zeng Guangtao, Wang Tianduo, Lu Wei
- RECOST: External Knowledge Guided Data-efficient Instruction Tuning Zhang Qi, Zhang Yiming, Wang Haobo, Zhao Junbo
- VAR-CLIP: Text-to-image Generator With Visual Auto-regressive Modeling Zhang Qian, Dai Xiangzi, Yang Ninghua, An Xiang, Feng Ziyong, Ren Xingyu
- Mathverse: Does Your Multi-modal LLM Truly See The Diagrams In Visual Math Problems? Zhang Renrui, Jiang Dongzhi, Zhang Yichi, Lin Haokun, Guo Ziyu, Qiu Pengshuo, Zhou Aojun, Lu Pan, Chang Kai-wei, Gao Peng, Li Hongsheng
- Direct Preference Optimization Of Video Large Multimodal Models From Language Model Reward Zhang Ruohong, Gui Liangke, Sun Zhiqing, Feng Yihao, Xu Keyang, Zhang Yuanhan, Fu Di, Li Chunyuan, Hauptmann Alexander, Bisk Yonatan, Yang Yiming
- Mindspeech: Continuous Imagined Speech Decoding Using High-density Fnirs And Prompt Tuning For Advanced Human-ai Interaction Zhang Suyi, Alam Ekram, Baber Jack, Bianco Francesca, Turner Edward, Chamanzar Maysam, Dehghani Hamid
- R4: Reinforced Retriever-reorder-responder For Retrieval-augmented Large Language Models Zhang Taolin, Li Dongyang, Chen Qizhou, Wang Chengyu, Huang Longtao, Xue Hui, He Xiaofeng, Huang Jun
- In-context Principle Learning From Mistakes Zhang Tianjun, Madaan Aman, Gao Luyu, Zheng Steven, Mishra Swaroop, Yang Yiming, Tandon Niket, Alon Uri
- Promptfix: Few-shot Backdoor Removal Via Adversarial Prompt Tuning Zhang Tianrong, Xi Zhaohan, Wang Ting, Mitra Prasenjit, Chen Jinghui
- Nomad-attention: Efficient LLM Inference On Cpus Through Multiply-add-free Attention Zhang Tianyi, Yi Jonah Wonkyu, Yao Bowen, Xu Zhaozhuo, Shrivastava Anshumali
- Interpreting And Improving Large Language Models In Arithmetic Calculation Zhang Wei, Wan Chaoqun, Zhang Yonggang, Cheung Yiu-ming, Tian Xinmei, Shen Xu, Ye Jieping
- D\'olares Or Dollars? Unraveling The Bilingual Prowess Of Financial Llms Between Spanish And English Zhang Xiao, Xiang Ruoyu, Yuan Chenhan, Feng Duanyu, Han Weiguang, Lopez-lira Alejandro, Liu Xiao-yang, Ananiadou Sophia, Peng Min, Huang Jimin, Xie Qianqian
- Large Language Models As Evaluators For Recommendation Explanations Zhang Xiaoyu, Li Yishan, Wang Jiayin, Sun Bowen, Ma Weizhi, Sun Peijie, Zhang Min
- Laser: Parameter-efficient LLM Bi-tuning For Sequential Recommendation With Collaborative Information Zhang Xinyu, Hu Linmei, Zhang Luhao, Song Dandan, Huang Heyan, Nie Liqiang
- Finercut: Finer-grained Interpretable Layer Pruning For Large Language Models Zhang Yang, Li Yawei, Wang Xinpeng, Shen Qianli, Plank Barbara, Bischl Bernd, Rezaei Mina, Kawaguchi Kenji
- Pushing The Limit Of LLM Capacity For Text Classification Zhang Yazhou, Wang Mengyao, Ren Chenyu, Li Qiuchi, Tiwari Prayag, Wang Benyou, Qin Jing
- Mme-realworld: Could Your Multimodal LLM Challenge High-resolution Real-world Scenarios That Are Difficult For Humans? Zhang Yi-fan, Zhang Huanyu, Tian Haochen, Fu Chaoyou, Zhang Shuangqing, Wu Junfei, Li Feng, Wang Kun, Wen Qingsong, Zhang Zhang, Wang Liang, Jin Rong, Tan Tieniu
- Wings: Learning Multimodal Llms Without Text-only Forgetting Zhang Yi-kai, Lu Shiyin, Li Yang, Ma Yanqing, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Zhan De-chuan, Ye Han-jia
- MLKD-BERT: Multi-level Knowledge Distillation For Pre-trained Language Models Zhang Ying, Yang Ziheng, Ji Shufan
- How Far Are We From Intelligent Visual Deductive Reasoning? Zhang Yizhe, Bai He, Zhang Ruixiang, Gu Jiatao, Zhai Shuangfei, Susskind Josh, Jaitly Navdeep
- Navigating User Experience Of Chatgpt-based Conversational Recommender Systems: The Effects Of Prompt Guidance And Recommendation Domain Zhang Yizhe, Jin Yucheng, Chen Li, Yang Ting
- Leveraging Biases In Large Language Models: "bias-knn'' For Effective Few-shot Learning Zhang Yong, Li Hanzhang, Li Zhitao, Cheng Ning, Li Ming, Xiao Jing, Wang Jianzong
- Autocap: Towards Automatic Cross-lingual Alignment Planning For Zero-shot Chain-of-thought Zhang Yongheng, Chen Qiguang, Li Min, Che Wanxiang, Qin Libo
- Prompt-aware Adapter: Towards Learning Adaptive Visual Tokens For Multimodal Large Language Models Zhang Yue, Fan Hehe, Yang Yi
- OT-VP: Optimal Transport-guided Visual Prompting For Test-time Adaptation Zhang Yunbei, Mehra Akshay, Hamm Jihun
- Small Language Models Need Strong Verifiers To Self-correct Reasoning Zhang Yunxiang, Khalifa Muhammad, Logeswaran Lajanugen, Kim Jaekyeom, Lee Moontae, Lee Honglak, Wang Lu
- Intention Analysis Makes Llms A Good Jailbreak Defender Zhang Yuqi, Ding Liang, Zhang Lefei, Tao Dacheng
- Toolbehonest: A Multi-level Hallucination Diagnostic Benchmark For Tool-augmented Large Language Models Zhang Yuxiang, Chen Jing, Wang Junjie, Liu Yaxin, Yang Cheng, Shi Chufan, Zhu Xinyu, Lin Zihao, Wan Hanwen, Yang Yujiu, Sakai Tetsuya, Feng Tian, Yamana Hayato
- PPTC-R Benchmark: Towards Evaluating The Robustness Of Large Language Models For Powerpoint Task Completion Zhang Zekai, Guo Yiduo, Liang Yaobo, Zhao Dongyan, Duan Nan
- Diversifying The Expert Knowledge For Task-agnostic Pruning In Sparse Mixture-of-experts Zhang Zeliang, Liu Xiaodong, Cheng Hao, Xu Chenliang, Gao Jianfeng
- A Survey On The Memory Mechanism Of Large Language Model Based Agents Zhang Zeyu, Bo Xiaohe, Ma Chen, Li Rui, Chen Xu, Dai Quanyu, Zhu Jieming, Dong Zhenhua, Wen Ji-rong
- Position-aware Parameter Efficient Fine-tuning Approach For Reducing Positional Bias In Llms Zhang Zheng, Yang Fan, Jiang Ziyan, Chen Zheng, Zhao Zhengyang, Ma Chengyuan, Zhao Liang, Liu Yang
- Found In The Middle: How Language Models Use Long Contexts Better Via Plug-and-play Positional Encoding Zhang Zhenyu, Chen Runjin, Liu Shiwei, Yao Zhewei, Ruwase Olatunji, Chen Beidi, Wu Xiaoxia, Wang Zhangyang
- Shieldlm: Empowering Llms As Aligned, Customizable And Explainable Safety Detectors Zhang Zhexin, Lu Yida, Ma Jingyuan, Zhang Di, Li Rui, Ke Pei, Sun Hao, Sha Lei, Sui Zhifang, Wang Hongning, Huang Minlie
- Glitchprober: Advancing Effective Detection And Mitigation Of Glitch Tokens In Large Language Models Zhang Zhibo, Bai Wuxia, Li Yuxi, Meng Mark Huasong, Wang Kailong, Shi Ling, Li Li, Wang Jun, Wang Haoyu
- Galla: Graph Aligned Large Language Models For Improved Source Code Understanding Zhang Ziyin, Yu Hang, Li Shijie, Di Peng, Li Jianguo, Wang Rui
- APT: Adaptive Pruning And Tuning Pretrained Language Models For Efficient Training And Inference Zhao Bowen, Hajishirzi Hannaneh, Cao Qingqing
- Consistency Matters: Explore Llms Consistency From A Black-box Perspective Zhao Fufangchen, Jin Guoqiang, Huang Jiaheng, Zhao Rui, Tan Fei
- LANE: Logic Alignment Of Non-tuning Large Language Models And Online Recommendation Systems For Explainable Reason Generation Zhao Hongke, Zheng Songming, Wu Likang, Yu Bowen, Wang Jing
- Revolutionizing Finance With Llms: An Overview Of Applications And Insights Zhao Huaqin, Liu Zhengliang, Wu Zihao, Li Yiwei, Yang Tianze, Shu Peng, Xu Shaochen, Dai Haixing, Zhao Lin, Mai Gengchen, Liu Ninghao, Liu Tianming
- Galore: Memory-efficient LLM Training By Gradient Low-rank Projection Zhao Jiawei, Zhang Zhenyu, Chen Beidi, Wang Zhangyang, Anandkumar Anima, Tian Yuandong
- Llama Beyond English: An Empirical Study On Language Capability Transfer Zhao Jun, Zhang Zhihao, Gao Luhui, Zhang Qi, Gui Tao, Huang Xuanjing
- Longagent: Scaling Language Models To 128k Context Through Multi-agent Collaboration Zhao Jun, Zu Can, Xu Hao, Lu Yi, He Wei, Ding Yiwen, Gui Tao, Zhang Qi, Huang Xuanjing
- Lora Land: 310 Fine-tuned Llms That Rival GPT-4, A Technical Report Zhao Justin, Wang Timothy, Abid Wael, Angus Geoffrey, Garg Arnav, Kinnison Jeffery, Sherstinsky Alex, Molino Piero, Addair Travis, Rishi Devvret
- Mitigating Object Hallucination In Large Vision-language Models Via Classifier-free Guidance Zhao Linxi, Deng Yihe, Zhang Weitong, Gu Quanquan
- Diffagent: Fast And Accurate Text-to-image API Selection With Large Language Model Zhao Lirui, Yang Yue, Zhang Kaipeng, Shao Wenqi, Zhang Yuxin, Qiao Yu, Luo Ping, Ji Rongrong
- Efficiently Training 7B LLM With 1 Million Sequence Length On 8 Gpus Zhao Pinxue, Zhang Hailin, Fu Fangcheng, Nie Xiaonan, Liu Qibin, Yang Fang, Peng Yuanbo, Jiao Dian, Li Shuaipeng, Xue Jinbao, Tao Yangyu, Cui Bin
- Prepacking: A Simple Method For Fast Prefilling And Increased Throughput In Large Language Models Zhao Siyan, Israel Daniel, Broeck Guy Van Den, Grover Aditya
- Towards Comprehensive And Efficient Post Safety Alignment Of Large Language Models Via Safety Patching Zhao Weixiang, Hu Yulin, Li Zhuojun, Deng Yang, Zhao Yanyan, Qin Bing, Chua Tat-seng
- SAPT: A Shared Attention Framework For Parameter-efficient Continual Learning Of Large Language Models Zhao Weixiang, Wang Shilong, Hu Yulin, Zhao Yanyan, Qin Bing, Zhang Xuanyu, Yang Qing, Xu Dongliang, Che Wanxiang
- Enhancing Cross-domain Pre-trained Decision Transformers With Adaptive Attention Zhao Wenhao, Xu Qiushui, Xu Linjie, Song Lei, Wang Jinyu, Zhou Chunlai, Bian Jiang
- I Could've Asked That: Reformulating Unanswerable Questions Zhao Wenting, Gao Ge, Cardie Claire, Rush Alexander M.
- Wildchat: 1M Chatgpt Interaction Logs In The Wild Zhao Wenting, Ren Xiang, Hessel Jack, Cardie Claire, Choi Yejin, Deng Yuntian
- Codev: Empowering Llms For Verilog Generation Through Multi-level Summarization Zhao Yang, Huang Di, Li Chongxiao, Jin Pengwei, Nan Ziyuan, Ma Tianyun, Qi Lei, Pan Yansong, Zhang Zhenxing, Zhang Rui, Zhang Xishan, Du Zidong, Guo Qi, Hu Xing, Chen Yunji
- How Do Large Language Models Handle Multilingualism? Zhao Yiran, Zhang Wenxuan, Chen Guizhen, Kawaguchi Kenji, Bing Lidong
- ALISA: Accelerating Large Language Model Inference Via Sparsity-aware KV Caching Zhao Youpeng, Wu Di, Wang Jun
- An Empirical Study Of Retrieval Augmented Generation With Chain-of-thought Zhao Yuetong, Cao Hongyu, Zhao Xianyu, Ou Zhijian
- SWIFT:A Scalable Lightweight Infrastructure For Fine-tuning Zhao Yuze, Huang Jintao, Hu Jinghan, Wang Xingjun, Mao Yunlin, Zhang Daoze, Jiang Zeyinzi, Wu Zhikai, Ai Baole, Wang Ang, Zhou Wenmeng, Chen Yingda
- Chemdfm: Dialogue Foundation Model For Chemistry Zhao Zihan, Ma Da, Chen Lu, Sun Liangtai, Li Zihao, Xu Hongshen, Zhu Zichen, Zhu Su, Fan Shuai, Shen Guodong, Chen Xin, Yu Kai
- Leveraging Large Language Models With Chain-of-thought And Prompt Engineering For Traffic Crash Severity Analysis And Inference Zhen Hao, Shi Yucheng, Huang Yongcan, Yang Jidong J., Liu Ninghao
- Gpt-4v(ision) Is A Generalist Web Agent, If Grounded Zheng Boyuan, Gou Boyu, Kil Jihyung, Sun Huan, Su Yu
- ICE-GRT: Instruction Context Enhancement By Generative Reinforcement Based Transformers Zheng Chen, Sun Ke, Tang Da, Ma Yukun, Zhang Yuyu, Xi Chenguang, Zhou Xun
- Mistral-c2f: Coarse To Fine Actor For Analytical And Reasoning Enhancement In RLHF And Effective-merged Llms Zheng Chen, Sun Ke, Zhou Xun
- Large Language Models As Reliable Knowledge Bases? Zheng Danna, Lapata Mirella, Pan Jeff Z.
- Opencodeinterpreter: Integrating Code Generation With Execution And Refinement Zheng Tianyu, Zhang Ge, Shen Tianhao, Liu Xueling, Lin Bill Yuchen, Fu Jie, Chen Wenhu, Yue Xiang
- Attention Heads Of Large Language Models: A Survey Zheng Zifan, Wang Yezhaohui, Huang Yuxin, Song Shichao, Tang Bo, Xiong Feiyu, Li Zhiyu
- Exploring Boundary Of GPT-4V On Marine Analysis: A Preliminary Case Study Zheng Ziqiang, Chen Yiwei, Zhang Jipeng, Vu Tuan-anh, Zeng Huimin, Tim Yue Him Wong, Yeung Sai-kit
- Understanding The Rope Extensions Of Long-context Llms: An Attention Perspective Zhong Meizhi, Zhang Chen, Lei Yikun, Liu Xikai, Gao Yan, Hu Yao, Chen Kehai, Zhang Min
- Enhancing Multimodal Large Language Models With Multi-instance Visual Prompt Generator For Visual Representation Enrichment Zhong Wenliang, Wu Wenyi, Li Qi, Barton Rob, Du Boxin, Sam Shioulin, Bouyarmane Karim, Tutar Ismail, Huang Junzhou
- Robust Prompt Optimization For Defending Language Models Against Jailbreaking Attacks Zhou Andy, Li Bo, Wang Haohan
- Misinforming Llms: Vulnerabilities, Challenges And Opportunities Zhou Bo, Geißler Daniel, Lukowicz Paul
- Navgpt-2: Unleashing Navigational Reasoning Capability For Large Vision-language Models Zhou Gengze, Hong Yicong, Wang Zun, Wang Xin Eric, Wu Qi
- Unibias: Unveiling And Mitigating LLM Bias Through Internal Attention And FFN Manipulation Zhou Hanzhang, Feng Zijian, Zhu Zixiao, Qian Junlang, Mao Kezhi
- Moe-lpr: Multilingual Extension Of Large Language Models Through Mixture-of-experts With Language Priors Routing Zhou Hao, Wang Zhijun, Huang Shujian, Huang Xin, Han Xue, Feng Junlan, Deng Chao, Luo Weihua, Chen Jiajun
- Leveraging Web-crawled Data For High-quality Fine-tuning Zhou Jing, Jiang Chenglin, Shen Wei, Zhou Xiao, He Xiaonan
- Mlps Compass: What Is Learned When Mlps Are Combined With Plms? Zhou Li, Chen Wenyu, Cao Yong, Zeng Dingyi, Liu Wanlong, Qu Hong
- Self-discover: Large Language Models Self-compose Reasoning Structures Zhou Pei, Pujara Jay, Ren Xiang, Chen Xinyun, Cheng Heng-tze, Le Quoc V., Chi Ed H., Zhou Denny, Mishra Swaroop, Zheng Huaixiu Steven
- Enhancing The General Agent Capabilities Of Low-parameter Llms Through Tuning And Multi-branch Reasoning Zhou Qinhao, Zhang Zihan, Xiang Xiang, Wang Ke, Wu Yuchuan, Li Yongbin
- Evaluating The Smooth Control Of Attribute Intensity In Text Generation With Llms Zhou Shang, Yao Feng, Dong Chengyu, Wang Zihan, Shang Jingbo
- Pre-trained Large Language Models Use Fourier Features To Compute Addition Zhou Tianyi, Fu Deqing, Sharan Vatsal, Jia Robin
- Explaining Pre-trained Language Models With Attribution Scores: An Analysis In Low-resource Settings Zhou Wei, Adel Heike, Schuff Hendrik, Vu Ngoc Thang
- Using Pretrained Large Language Model With Prompt Engineering To Answer Biomedical Questions Zhou Wenxin, Ngo Thuy Hang
- Shared Imagination: Llms Hallucinate Alike Zhou Yilun, Xiong Caiming, Savarese Silvio, Wu Chien-sheng
- Few-shot Adversarial Prompt Learning On Vision-language Models Zhou Yiwei, Xia Xiaobo, Lin Zhiwei, Han Bo, Liu Tongliang
- Aligning Modalities In Vision Large Language Models Via Preference Fine-tuning Zhou Yiyang, Cui Chenhang, Rafailov Rafael, Finn Chelsea, Yao Huaxiu
- Metagpt: Merging Large Language Models Using Model Exclusive Task Arithmetic Zhou Yuyan, Song Liang, Wang Bingning, Chen Weipeng
- Lawgpt: A Chinese Legal Knowledge-enhanced Large Language Model Zhou Zhi, Shi Jiang-xin, Song Peng-xiao, Yang Xiao-wen, Jin Yi-xuan, Guo Lan-zhe, Li Yu-feng
- DETAIL: Task Demonstration Attribution For Interpretable In-context Learning Zhou Zijian, Lin Xiaoqiang, Xu Xinyi, Prakash Alok, Rus Daniela, Low Bryan Kian Hsiang
- A Survey On Efficient Inference For Large Language Models Zhou Zixuan, Ning Xuefei, Hong Ke, Fu Tianyu, Xu Jiaming, Li Shiyao, Lou Yuming, Wang Luning, Yuan Zhihang, Li Xiuhong, Yan Shengen, Dai Guohao, Zhang Xiao-ping, Dong Yuhan, Wang Yu
- Can Llms "reason" In Music? An Evaluation Of Llms' Capability Of Music Understanding And Generation Zhou Ziya, Wu Yuhang, Wu Zhiyue, Zhang Xinyue, Yuan Ruibin, Ma Yinghao, Wang Lu, Benetos Emmanouil, Xue Wei, Guo Yike
- TAT-LLM: A Specialized Language Model For Discrete Reasoning Over Tabular And Textual Data Zhu Fengbin, Liu Ziyang, Feng Fuli, Wang Chao, Li Moxin, Chua Tat-seng
- FANNO: Augmenting High-quality Instruction Data With Open-sourced Llms Only Zhu He, Su Junyou, Lun Tianle, Tao Yicheng, Zhang Wenjia, Fan Zipei, Chen Guanhua
- Eliciting The Priors Of Large Language Models Using Iterated In-context Learning Zhu Jian-qiao, Griffiths Thomas L.
- Benchmarking Large Language Models On CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset Zhu Jie, Li Junhui, Wen Yalong, Guo Lifan
- Relayattention For Efficient Large Language Model Serving With Long System Prompts Zhu Lei, Wang Xinjiang, Zhang Wayne, Lau Rynson W. H.
- Scalable Matmul-free Language Modeling Zhu Rui-jie, Zhang Yu, Sifferman Ethan, Sheaves Tyler, Wang Yiqiao, Richmond Dustin, Zhou Peng, Eshraghian Jason K.
- Quite Good, But Not Enough: Nationality Bias In Large Language Models -- A Case Study Of Chatgpt Zhu Shucheng, Wang Weikang, Liu Ying
- IAPT: Instruction-aware Prompt Tuning For Large Language Models Zhu Wei, Tian Aaron Xuxiang, Yin Congrui, Ni Yuan, Wang Xiaoling, Xie Guotong
- How Privacy-savvy Are Large Language Models? A Case Study On Compliance And Privacy Technical Review Zhu Xichou, Liu Yang, Shen Zhou, Liu Yi, Li Min, Chen Yujun, John Benzi, Ma Zhenzhen, Hu Tao, Yang Bolong, Wang Manman, Xie Zongxing, Liu Peng, Cai Dan, Wang Junhui
- KG-FPQ: Evaluating Factuality Hallucination In Llms With Knowledge Graph-based False Premise Questions Zhu Yanxu, Xiao Jinlin, Wang Yuhang, Sang Jitao
- Can Large Language Models Understand Context? Zhu Yilun, Moniz Joel Ruben Antony, Bhargava Shruti, Lu Jiarui, Piraviperumal Dhivya, Li Site, Zhang Yuan, Yu Hong, Tseng Bo-hsiang
- Apt-pipe: A Prompt-tuning Tool For Social Data Annotation Using Chatgpt Zhu Yiming, Yin Zhizhuo, Tyson Gareth, Haq Ehsan-ul, Lee Lik-hang, Hui Pan
- Is Larger Always Better? Evaluating And Prompting Large Language Models For Non-generative Medical Tasks Zhu Yinghao, Gao Junyi, Wang Zixiang, Liao Weibin, Zheng Xiaochen, Liang Lifang, Wang Yasha, Pan Chengwei, Harrison Ewen M., Ma Liantao
- Prompting Large Language Models For Zero-shot Clinical Prediction With Structured Longitudinal Electronic Health Record Data Zhu Yinghao, Wang Zixiang, Gao Junyi, Tong Yuning, An Jingkun, Liao Weibin, Harrison Ewen M., Ma Liantao, Pan Chengwei
- Are Large Language Models Good Statisticians? Zhu Yizhang, Du Shiyin, Li Boyan, Luo Yuyu, Tang Nan
- Generative Pre-trained Speech Language Model With Efficient Hierarchical Transformer Zhu Yongxin, Su Dan, He Liqiang, Xu Linli, Yu Dong
- Accelerating Inference Of Retrieval-augmented Generation Via Sparse Context Selection Zhu Yun, Gu Jia-chen, Sikora Caitlin, Ko Ho, Liu Yinxiao, Lin Chu-cheng, Shu Lei, Luo Liangchen, Meng Lei, Liu Bang, Chen Jindong
- Halueval-wild: Evaluating Hallucinations Of Language Models In The Wild Zhu Zhiying, Yang Yiming, Sun Zhiqing
- Structlm: Towards Building Generalist Models For Structured Knowledge Grounding Zhuang Alex, Zhang Ge, Zheng Tianyu, Du Xinrun, Wang Junjie, Ren Weiming, Huang Stephen W., Fu Jie, Yue Xiang, Chen Wenhu
- Lumina-next: Making Lumina-t2x Stronger And Faster With Next-dit Zhuo Le, Du Ruoyi, Xiao Han, Li Yangguang, Liu Dongyang, Huang Rongjie, Liu Wenze, Zhao Lirui, Wang Fu-yun, Ma Zhanyu, Luo Xu, Wang Zehan, Zhang Kaipeng, Zhu Xiangyang, Liu Si, Yue Xiangyu, Liu Dingning, Ouyang Wanli, Liu Ziwei, Qiao Yu, Li Hongsheng, Gao Peng
- VL-ICL Bench: The Devil In The Details Of Benchmarking Multimodal In-context Learning Zong Yongshuo, Bohdal Ondrej, Hospedales Timothy
- Llama-excitor: General Instruction Tuning Via Indirect Feature Interaction Zou Bo, Yang Chao, Qiao Yu, Quan Chengbin, Zhao Youjian
- Image-to-text Logic Jailbreak: Your Imagination Can Help You Do Anything Zou Xiaotian, Li Ke, Chen Yongkang
- MLKV: Multi-layer Key-value Heads For Memory Efficient Transformer Decoding Zuhri Zayd Muhammad Kawakibi, Adilazuarda Muhammad Farid, Purwarianti Ayu, Aji Alham Fikri
- Enhancing Visual Question Answering Through Question-driven Image Captions As Prompts Özdemir Övgü, Akagündüz Erdem
- BASE TTS: Lessons From Building A Billion-parameter Text-to-speech Model On 100K Hours Of Data Łajszczak Mateusz, Cámbara Guillermo, Li Yang, Beyhan Fatih, Van Korlaar Arent, Yang Fan, Joly Arnaud, Martín-cortinas Álvaro, Abbas Ammar, Michalski Adam, Moinet Alexis, Karlapati Sri, Muszyńska Ewa, Guo Haohan, Putrycz Bartosz, Gambino Soledad López, Yoo Kayeon, Sokolova Elena, Drugman Thomas
- Concept-aware Data Construction Improves In-context Learning Of Language Models Štefánik Michal, Kadlčík Marek, Sojka Petr
🏷 Multimodal Models
- Multimodal Attention For Neural Machine Translation Caglayan Ozan, Barrault Loïc, Bougares Fethi
- Multimodal Compact Bilinear Pooling For Visual Question Answering And Visual Grounding Fukui Akira, Park Dong Huk, Yang Daylen, Rohrbach Anna, Darrell Trevor, Rohrbach Marcus
- Don't Just Assume; Look And Answer: Overcoming Priors For Visual Question Answering Agrawal Aishwarya, Batra Dhruv, Parikh Devi, Kembhavi Aniruddha
- An Empirical Study On The Effectiveness Of Images In Multimodal Neural Machine Translation Delbrouck Jean-benoit, Dupont Stéphane
- Attention Strategies For Multi-source Sequence-to-sequence Learning Libovický Jindřich, Helcl Jindřich
- Tips And Tricks For Visual Question Answering: Learnings From The 2017 Challenge Teney Damien, Anderson Peter, He Xiaodong, Hengel Anton Van Den
- A Knowledge-grounded Multimodal Search-based Conversational Agent Agarwal Shubham, Dusek Ondrej, Konstas Ioannis, Rieser Verena
- LIUM-CVC Submissions For WMT18 Multimodal Translation Task Caglayan Ozan, Bardet Adrien, Bougares Fethi, Barrault Loïc, Wang Kai, Masana Marc, Herranz Luis, Van De Weijer Joost
- The Memad Submission To The WMT18 Multimodal Translation Task Grönroos Stig-arne, Huet Benoit, Kurimo Mikko, Laaksonen Jorma, Merialdo Bernard, Pham Phu, Sjöberg Mats, Sulubacak Umut, Tiedemann Jörg, Troncy Raphael, Vázquez Raúl
- Recursive Visual Attention In Visual Dialog Niu Yulei, Zhang Hanwang, Zhang Manli, Zhang Jianhong, Lu Zhiwu, Wen Ji-rong
- Multimodal Explanations: Justifying Decisions And Pointing To The Evidence Park Dong Huk, Hendricks Lisa Anne, Akata Zeynep, Rohrbach Anna, Schiele Bernt, Darrell Trevor, Rohrbach Marcus
- Dynamic Fusion With Intra- And Inter- Modality Attention Flow For Visual Question Answering Peng Gao, Jiang Zhengkai, You Haoxuan, Lu Pan, Hoi Steven, Wang Xiaogang, Li Hongsheng
- Adapting Visual Question Answering Models For Enhancing Multimodal Community Q&A Platforms Srivastava Avikalp, Liu Hsin Wen, Fujita Sumio
- Dual Ask-answer Network For Machine Reading Comprehension Xiao Han, Wang Feng, Yan Jianfeng, Zheng Jingyao
- A Visual Attention Grounding Neural Model For Multimodal Machine Translation Zhou Mingyang, Cheng Runxiang, Lee Yong Jae, Yu Zhou
- Fusion Of Detected Objects In Text For Visual Question Answering Alberti Chris, Ling Jeffrey, Collins Michael, Reitter David
- Language Features Matter: Effective Language Representations For Vision-language Tasks Burns Andrea, Tan Reuben, Saenko Kate, Sclaroff Stan, Plummer Bryan A.
- DMRM: A Dual-channel Multi-hop Reasoning Model For Visual Dialog Chen Feilong, Meng Fandong, Xu Jiaming, Li Peng, Xu Bo, Zhou Jie
- Heterogeneous Memory Enhanced Multimodal Attention Model For Video Question Answering Fan Chenyou, Zhang Xiaofan, Zhang Shu, Wang Wensheng, Zhang Chi, Huang Heng
- Question-agnostic Attention For Visual Question Answering Farazi Moshiur R, Khan Salman H, Barnes Nick
- Iterative Answer Prediction With Pointer-augmented Multimodal Transformers For Textvqa Hu Ronghang, Singh Amanpreet, Darrell Trevor, Rohrbach Marcus
- All-in-one Image-grounded Conversational Agents Ju Da, Shuster Kurt, Boureau Y-lan, Weston Jason
- Weak Supervision Helps Emergence Of Word-object Alignment And Improves Vision-language Tasks Kervadec Corentin Liris, Antipov Grigory Liris, Baccouche Moez Liris, Wolf Christian Liris
- Unicoder-vl: A Universal Encoder For Vision And Language By Cross-modal Pre-training Li Gen, Duan Nan, Fang Yuejian, Gong Ming, Jiang Daxin, Zhou Ming
- Probing Representations Learned By Multimodal Recurrent And Transformer Models Libovický Jindřich, Madhyastha Pranava
- On Leveraging The Visual Modality For Neural Machine Translation Raunak Vikas, Choe Sang Keun, Lu Quanyang, Xu Yi, Metze Florian
- LXMERT: Learning Cross-modality Encoder Representations From Transformers Tan Hao, Bansal Mohit
- Video Question Generation Via Cross-modal Self-attention Networks Learning Wang Yu-siang, Su Hung-ting, Chang Chen-hsi, Liu Zhe-yu, Hsu Winston H.
- Transformer-based Cascaded Multimodal Speech Translation Wu Zixiu, Caglayan Ozan, Ive Julia, Wang Josiah, Specia Lucia
- Visual Entailment: A Novel Task For Fine-grained Image Understanding Xie Ning, Lai Farley, Doran Derek, Kadav Asim
- Making History Matter: History-advantage Sequence Training For Visual Dialog Yang Tianhao, Zha Zheng-jun, Zhang Hanwang
- Multimodal Unified Attention Networks For Vision-and-language Interactions Yu Zhou, Cui Yuhao, Yu Jun, Tao Dacheng, Tian Qi
- Unified Vision-language Pre-training For Image Captioning And VQA Zhou Luowei, Palangi Hamid, Zhang Lei, Hu Houdong, Corso Jason J., Gao Jianfeng
- Vision-language Navigation With Self-supervised Auxiliary Reasoning Tasks Zhu Fengda, Zhu Yi, Chang Xiaojun, Liang Xiaodan
- A Study On Multimodal And Interactive Explanations For Visual Question Answering Alipour Kamran, Schulze Jurgen P., Yao Yi, Ziskind Avi, Burachas Giedrius
- Adaptive Transformers For Learning Multimodal Representations Bhargava Prajjwal
- Behind The Scene: Revealing The Secrets Of Pre-trained Vision-and-language Models Cao Jize, Gan Zhe, Cheng Yu, Yu Licheng, Chen Yen-chun, Liu Jingjing
- X-LXMERT: Paint, Caption And Answer Questions With Multi-modal Transformers Cho Jaemin, Lu Jiasen, Schwenk Dustin, Hajishirzi Hannaneh, Kembhavi Aniruddha
- Attention Guided Semantic Relationship Parsing For Visual Question Answering Farazi Moshiur, Khan Salman, Barnes Nick
- ISAAQ -- Mastering Textbook Questions With Pre-trained Transformers And Bottom-up And Top-down Attention Gomez-perez Jose Manuel, Ortega Raul
- Towards Learning A Generic Agent For Vision-and-language Navigation Via Pre-training Hao Weituo, Li Chunyuan, Li Xiujun, Carin Lawrence, Gao Jianfeng
- A Closer Look At The Robustness Of Vision-and-language Pre-trained Models Li Linjie, Gan Zhe, Liu Jingjing
- TMT: A Transformer-based Modal Translator For Improving Multimodal Sequence Representations In Audio Visual Scene-aware Dialog Li Wubo, Jiang Dongwei, Zou Wei, Li Xiangang
- Multilingual Speech Translation With Efficient Finetuning Of Pretrained Models Li Xian, Wang Changhan, Tang Yun, Tran Chau, Tang Yuqing, Pino Juan, Baevski Alexei, Conneau Alexis, Auli Michael
- A Comparison Of Pre-trained Vision-and-language Models For Multimodal Representation Learning Across Medical Images And Reports Li Yikuan, Wang Hanyin, Luo Yuan
- Bridging Text And Video: A Universal Multimodal Transformer For Video-audio Scene-aware Dialog Li Zekang, Li Zongjia, Zhang Jinchao, Feng Yang, Niu Cheng, Zhou Jie
- TIME: Text And Image Mutual-translation Adversarial Networks Liu Bingchen, Song Kunpeng, Zhu Yizhe, De Melo Gerard, Elgammal Ahmed
- Lambert: Language And Action Learning Using Multimodal BERT Miyazawa Kazuki, Aoki Tatsuya, Horii Takato, Nagai Takayuki
- Alleviating The Burden Of Labeling: Sentence Generation By Attention Branch Encoder-decoder Network Ogura Tadashi, Magassouba Aly, Sugiura Komei, Hirakawa Tsubasa, Yamashita Takayoshi, Fujiyoshi Hironobu, Kawai Hisashi
- Multi-view Attention Network For Visual Dialog Park Sungjin, Whang Taesun, Yoon Yeochan, Lim Heuiseok
- What BERT Sees: Cross-modal Transfer For Visual Question Generation Scialom Thomas, Bordes Patrick, Dray Paul-alexis, Staiano Jacopo, Gallinari Patrick
- Look Before You Speak: Visually Contextualized Utterances Seo Paul Hongsuck, Nagrani Arsha, Schmid Cordelia
- Vokenization: Improving Language Understanding With Contextualized, Visual-grounded Supervision Tan Hao, Bansal Mohit
- VD-BERT: A Unified Vision And Dialog Transformer With BERT Wang Yue, Joty Shafiq, Lyu Michael R., King Irwin, Xiong Caiming, Hoi Steven C. H.
- On Modality Bias In The TVQA Dataset Winterbottom Thomas, Xiao Sarah, Mclean Alistair, Moubayed Noura Al
- Just Ask: Learning To Answer Questions From Millions Of Narrated Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Seqdialn: Sequential Visual Dialog Networks In Joint Visual-linguistic Representation Space Yang Liu
- Vision-dialog Navigation By Exploring Cross-modal Memory Zhu Yi, Zhu Fengda, Zhan Zhaohuan, Lin Bingqian, Jiao Jianbin, Chang Xiaojun, Liang Xiaodan
- See, Hear, Read: Leveraging Multimodality With Guided Attention For Abstractive Text Summarization Atri Yash Kumar, Pramanick Shraman, Goyal Vikram, Chakraborty Tanmoy
- Latr: Layout-aware Transformer For Scene-text VQA Biten Ali Furkan, Litman Ron, Xie Yusheng, Appalaraju Srikar, Manmatha R.
- Gaud\'i: Conversational Interactions With Deep Representations To Generate Image Collections Bursztyn Victor S., Healey Jennifer, Vinay Vishwa
- Multimodal Incremental Transformer With Visual Grounding For Visual Dialogue Generation Chen Feilong, Meng Fandong, Chen Xiuyi, Li Peng, Zhou Jie
- Unifying Vision-and-language Tasks Via Text Generation Cho Jaemin, Lei Jie, Tan Hao, Bansal Mohit
- EKTVQA: Generalized Use Of External Knowledge To Empower Scene Text In Text-vqa Dey Arka Ujjal, Valveny Ernest, Harit Gaurav
- Cogview: Mastering Text-to-image Generation Via Transformers Ding Ming, Yang Zhuoyi, Hong Wenyi, Zheng Wendi, Zhou Chang, Yin Da, Lin Junyang, Zou Xu, Shao Zhou, Yang Hongxia, Tang Jie
- An Empirical Study Of Training End-to-end Vision-and-language Transformers Dou Zi-yi, Xu Yichong, Gan Zhe, Wang Jianfeng, Wang Shuohang, Wang Lijuan, Zhu Chenguang, Zhang Pengchuan, Yuan Lu, Peng Nanyun, Liu Zicheng, Zeng Michael
- Does CLIP Benefit Visual Question Answering In The Medical Domain As Much As It Does In The General Domain? Eslami Sedigheh, De Melo Gerard, Meinel Christoph
- Latent Alignment Of Procedural Concepts In Multimodal Recipes Faghihi Hossein Rajaby, Mirzaee Roshanak, Paliwal Sudarshan, Kordjamshidi Parisa
- Retrieve, Caption, Generate: Visual Grounding For Enhancing Commonsense In Text Generation Models Feng Steven Y., Lu Kevin, Tao Zhuofu, Alikhani Malihe, Mitamura Teruko, Hovy Eduard, Gangal Varun
- Vision-and-language Or Vision-for-language? On Cross-modal Influence In Multimodal Transformers Frank Stella, Bugliarello Emanuele, Elliott Desmond
- Towards Trustworthy Deception Detection: Benchmarking Model Robustness Across Domains, Modalities, And Languages Glenski Maria, Ayton Ellyn, Cosbey Robin, Arendt Dustin, Volkova Svitlana
- KAT: A Knowledge Augmented Transformer For Vision-and-language Gui Liangke, Wang Borui, Huang Qiuyuan, Hauptmann Alex, Bisk Yonatan, Gao Jianfeng
- Unifying Multimodal Transformer For Bi-directional Image And Text Generation Huang Yupan, Xue Hongwei, Liu Bei, Lu Yutong
- Audio-oriented Multimodal Machine Comprehension: Task, Dataset And Model Huang Zhiqi, Liu Fenglin, Wu Xian, Ge Shen, Wang Helin, Fan Wei, Zou Yuexian
- Target-dependent UNITER: A Transformer-based Multimodal Language Comprehension Model For Domestic Service Robots Ishikawa Shintaro, Sugiura Komei
- Visqa: X-raying Vision And Language Reasoning In Transformers Jaunet Theo, Kervadec Corentin, Vuillemot Romain, Antipov Grigory, Baccouche Moez, Wolf Christian
- A Good Prompt Is Worth Millions Of Parameters: Low-resource Prompt-based Learning For Vision-language Models Jin Woojeong, Cheng Yu, Shen Yelong, Chen Weizhu, Ren Xiang
- L-verse: Bidirectional Generation Between Image And Text Kim Taehoon, Song Gwangmo, Lee Sihaeng, Kim Sangyun, Seo Yewon, Lee Soonyoung, Kim Seung Hwan, Lee Honglak, Bae Kyunghoon
- What BERT Based Language Models Learn In Spoken Transcripts: An Empirical Study Kumar Ayush, Sundararaman Mukuntha Narayanan, Vepa Jithendra
- Semvlp: Vision-language Pre-training By Aligning Semantics At Multiple Levels Li Chenliang, Yan Ming, Xu Haiyang, Luo Fuli, Wang Wei, Bi Bin, Huang Songfang
- Align Before Fuse: Vision And Language Representation Learning With Momentum Distillation Li Junnan, Selvaraju Ramprasaath R., Gotmare Akhilesh Deepak, Joty Shafiq, Xiong Caiming, Hoi Steven
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-image Pre-training Paradigm Li Yangguang, Liang Feng, Zhao Lichen, Cui Yufeng, Ouyang Wanli, Shao Jing, Yu Fengwei, Yan Junjie
- Scheduled Sampling In Vision-language Pretraining With Decoupled Encoder-decoder Network Li Yehao, Pan Yingwei, Yao Ting, Chen Jingwen, Mei Tao
- Multimodal Transformer With Variable-length Memory For Vision-and-language Navigation Lin Chuang, Jiang Yi, Cai Jianfei, Qu Lizhen, Haffari Gholamreza, Yuan Zehuan
- VX2TEXT: End-to-end Learning Of Video-based Text Generation From Multimodal Inputs Lin Xudong, Bertasius Gedas, Wang Jue, Chang Shih-fu, Parikh Devi, Torresani Lorenzo
- Unified Multimodal Pre-training And Prompt-based Tuning For Vision-language Understanding And Generation Liu Tianyi, Wu Zuxuan, Xiong Wenhan, Chen Jingjing, Jiang Yu-gang
- Iconqa: A New Benchmark For Abstract Diagram Understanding And Visual Language Reasoning Lu Pan, Qiu Liang, Chen Jiaqi, Xia Tony, Zhao Yizhou, Zhang Wei, Yu Zhou, Liang Xiaodan, Zhu Song-chun
- BERTGEN: Multi-task Generation Through BERT Mitzalis Faidon, Caglayan Ozan, Madhyastha Pranava, Specia Lucia
- Multi-modal Understanding And Generation For Medical Images And Text Via Vision-language Pre-training Moon Jong Hak, Lee Hyungyung, Shin Woncheol, Kim Young-hak, Choi Edward
- MLP Architectures For Vision-and-language Modeling: An Empirical Study Nie Yixin, Li Linjie, Gan Zhe, Wang Shuohang, Zhu Chenguang, Zeng Michael, Liu Zicheng, Bansal Mohit, Wang Lijuan
- Episodic Transformer For Vision-and-language Navigation Pashevich Alexander, Schmid Cordelia, Sun Chen
- VT-CLIP: Enhancing Vision-language Models With Visual-guided Texts Qiu Longtian, Zhang Renrui, Guo Ziyu, Zeng Ziyao, Guo Zilu, Li Yafeng, Zhang Guangnan
- Don't Sweep Your Learning Rate Under The Rug: A Closer Look At Cross-modal Transfer Of Pretrained Transformers Rothermel Danielle, Li Margaret, Rocktäschel Tim, Foerster Jakob
- Challenges In Procedural Multimodal Machine Comprehension:a Novel Way To Benchmark Sahu Pritish, Sikka Karan, Divakaran Ajay
- Towards Solving Multimodal Comprehension Sahu Pritish, Sikka Karan, Divakaran Ajay
- LAION-400M: Open Dataset Of Clip-filtered 400 Million Image-text Pairs Schuhmann Christoph, Vencu Richard, Beaumont Romain, Kaczmarczyk Robert, Mullis Clayton, Katta Aarush, Coombes Theo, Jitsev Jenia, Komatsuzaki Aran
- Audio-visual Scene-aware Dialog And Reasoning Using Audio-visual Transformers With Joint Student-teacher Learning Shah Ankit P., Geng Shijie, Gao Peng, Cherian Anoop, Hori Takaaki, Marks Tim K., Roux Jonathan Le, Hori Chiori
- Laviter: Learning Aligned Visual And Textual Representations Assisted By Image And Caption Generation Shaikh Mohammad Abuzar, Ji Zhanghexuan, Moukheiber Dana, Shen Yan, Srihari Sargur, Gao Mingchen
- Multimodal Integration Of Human-like Attention In Visual Question Answering Sood Ekta, Kögel Fabian, Müller Philipp, Thomas Dominike, Bace Mihai, Bulling Andreas
- VQA-MHUG: A Gaze Dataset To Study Multimodal Neural Attention In Visual Question Answering Sood Ekta, Kögel Fabian, Strohm Florian, Dhar Prajit, Bulling Andreas
- Worst Of Both Worlds: Biases Compound In Pre-trained Vision-and-language Models Srinivasan Tejas, Bisk Yonatan
- Txt: Crossmodal End-to-end Learning With Transformers Steitz Jan-martin O., Pfeiffer Jonas, Gurevych Iryna, Roth Stefan
- A Non-hierarchical Attention Network With Modality Dropout For Textual Response Generation In Multimodal Dialogue Systems Sun Rongyi, Chen Borun, Zhou Qingyu, Li Yinghui, Cao Yunbo, Zheng Hai-tao
- Lightningdot: Pre-training Visual-semantic Embeddings For Real-time Image-text Retrieval Sun Siqi, Chen Yen-chun, Li Linjie, Wang Shuohang, Fang Yuwei, Liu Jingjing
- Multimodal Few-shot Learning With Frozen Language Models Tsimpoukelli Maria, Menick Jacob, Cabi Serkan, Eslami S. M. Ali, Vinyals Oriol, Hill Felix
- Lightseq2: Accelerated Training For Transformer-based Models On Gpus Wang Xiaohui, Wei Yang, Xiong Ying, Huang Guyue, Qian Xian, Ding Yufei, Wang Mingxuan, Li Lei
- Distilled Dual-encoder Model For Vision-language Understanding Wang Zekun, Wang Wenhui, Zhu Haichao, Liu Ming, Qin Bing, Wei Furu
- Moca: Incorporating Multi-stage Domain Pretraining And Cross-guided Multimodal Attention For Textbook Question Answering Xu Fangzhi, Lin Qika, Liu Jun, Zhang Lingling, Zhao Tianzhe, Chai Qi, Pan Yudai
- E2E-VLP: End-to-end Vision-language Pre-training Enhanced By Visual Learning Xu Haiyang, Yan Ming, Li Chenliang, Bi Bin, Huang Songfang, Xiao Wenming, Huang Fei
- VLM: Task-agnostic Video-language Model Pre-training For Video Understanding Xu Hu, Ghosh Gargi, Huang Po-yao, Arora Prahal, Aminzadeh Masoumeh, Feichtenhofer Christoph, Metze Florian, Zettlemoyer Luke
- Causal Attention For Vision-language Tasks Yang Xu, Zhang Hanwang, Qi Guojun, Cai Jianfei
- An Empirical Study Of GPT-3 For Few-shot Knowledge-based VQA Yang Zhengyuan, Gan Zhe, Wang Jianfeng, Hu Xiaowei, Lu Yumao, Liu Zicheng, Wang Lijuan
- Ernie-vilg: Unified Generative Pre-training For Bidirectional Vision-language Generation Zhang Han, Yin Weichong, Fang Yewei, Li Lanxin, Duan Boqiang, Wu Zhihua, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- LAFITE: Towards Language-free Training For Text-to-image Generation Zhou Yufan, Zhang Ruiyi, Chen Changyou, Li Chunyuan, Tensmeyer Chris, Yu Tong, Gu Jiuxiang, Xu Jinhui, Sun Tong
- Vl-interpret: An Interactive Visualization Tool For Interpreting Vision-language Transformers Aflalo Estelle, Du Meng, Tseng Shao-yen, Liu Yongfei, Wu Chenfei, Duan Nan, Lal Vasudev
- Multimodal Sequential Generative Models For Semi-supervised Language Instruction Following Akuzawa Kei, Iwasawa Yusuke, Matsuo Yutaka
- Compound Tokens: Channel Fusion For Vision-language Representation Learning Aladago Maxwell Mbabilla, Piergiovanni Aj
- End-to-end Multimodal Representation Learning For Video Dialog Alamri Huda, Bilic Anthony, Hu Michael, Beedu Apoorva, Essa Irfan
- Socratic Models: Composing Zero-shot Multimodal Reasoning With Language Andy Zeng, Maria Attarian, Brian Ichter, Krzysztof Choromanski, Adrian Wong, Stefan Welker, Federico Tombari, Aveek Purohit, Michael Ryoo, Vikas Sindhwani, Johnny Lee, Vincent Vanhoucke, Pete Florence
- Vl-beit: Generative Vision-language Pretraining Bao Hangbo, Wang Wenhui, Dong Li, Wei Furu
- The Myth Of Culturally Agnostic AI Models Cetinic Eva
- Murag: Multimodal Retrieval-augmented Generator For Open Question Answering Over Images And Text Chen Wenhu, Hu Hexiang, Chen Xi, Verga Pat, Cohen William W.
- Pali: A Jointly-scaled Multilingual Language-image Model Chen Xi, Wang Xiao, Changpinyo Soravit, Piergiovanni Aj, Padlewski Piotr, Salz Daniel, Goodman Sebastian, Grycner Adam, Mustafa Basil, Beyer Lucas, Kolesnikov Alexander, Puigcerver Joan, Ding Nan, Rong Keran, Akbari Hassan, Mishra Gaurav, Xue Linting, Thapliyal Ashish, Bradbury James, Kuo Weicheng, Seyedhosseini Mojtaba, Jia Chao, Ayan Burcu Karagol, Riquelme Carlos, Steiner Andreas, Angelova Anelia, Zhai Xiaohua, Houlsby Neil, Soricut Radu
- Altclip: Altering The Language Encoder In CLIP For Extended Language Capabilities Chen Zhongzhi, Liu Guang, Zhang Bo-wen, Ye Fulong, Yang Qinghong, Wu Ledell
- Vault: Augmenting The Vision-and-language Transformer For Sentiment Classification On Social Media Chochlakis Georgios University Of Southern California, Srinivasan Tejas University Of Southern California, Thomason Jesse University Of Southern California, Narayanan Shrikanth University Of Southern California
- VQGAN-CLIP: Open Domain Image Generation And Editing With Natural Language Guidance Crowson Katherine, Biderman Stella, Kornis Daniel, Stander Dashiell, Hallahan Eric, Castricato Louis, Raff Edward
- Enabling Multimodal Generation On CLIP Via Vision-language Knowledge Distillation Dai Wenliang, Hou Lu, Shang Lifeng, Jiang Xin, Liu Qun, Fung Pascale
- Write And Paint: Generative Vision-language Models Are Unified Modal Learners Diao Shizhe, Zhou Wangchunshu, Zhang Xinsong, Wang Jiawei
- Cogview2: Faster And Better Text-to-image Generation Via Hierarchical Transformers Ding Ming, Zheng Wendi, Hong Wenyi, Tang Jie
- Coarse-to-fine Vision-language Pre-training With Fusion In The Backbone Dou Zi-yi, Kamath Aishwarya, Gan Zhe, Zhang Pengchuan, Wang Jianfeng, Li Linjie, Liu Zicheng, Liu Ce, Lecun Yann, Peng Nanyun, Gao Jianfeng, Wang Lijuan
- A Survey Of Vision-language Pre-trained Models Du Yifan, Liu Zikang, Li Junyi, Zhao Wayne Xin
- Revision Transformers: Instructing Language Models To Change Their Values Friedrich Felix, Stammer Wolfgang, Schramowski Patrick, Kersting Kristian
- WAVPROMPT: Towards Few-shot Spoken Language Understanding With Frozen Language Models Gao Heting, Ni Junrui, Qian Kaizhi, Zhang Yang, Chang Shiyu, Hasegawa-johnson Mark
- Visual Prompt Tuning For Test-time Domain Adaptation Gao Yunhe, Shi Xingjian, Zhu Yi, Wang Hao, Tang Zhiqiang, Zhou Xiong, Li Mu, Metaxas Dimitris N.
- Unixcoder: Unified Cross-modal Pre-training For Code Representation Guo Daya, Lu Shuai, Duan Nan, Wang Yanlin, Zhou Ming, Yin Jian
- LVP-M3: Language-aware Visual Prompt For Multilingual Multimodal Machine Translation Guo Hongcheng, Liu Jiaheng, Huang Haoyang, Yang Jian, Li Zhoujun, Zhang Dongdong, Cui Zheng, Wei Furu
- Supervised Visual Attention For Simultaneous Multimodal Machine Translation Haralampieva Veneta, Caglayan Ozan, Specia Lucia
- VLMAE: Vision-language Masked Autoencoder He Sunan, Guo Taian, Dai Tao, Qiao Ruizhi, Wu Chen, Shu Xiujun, Ren Bo
- XDBERT: Distilling Visual Information To BERT From Cross-modal Systems To Improve Language Understanding Hsu Chan-jan, Lee Hung-yi, Tsao Yu
- Unified Discrete Diffusion For Simultaneous Vision-language Generation Hu Minghui, Zheng Chuanxia, Zheng Heliang, Cham Tat-jen, Wang Chaoyue, Yang Zuopeng, Tao Dacheng, Suganthan Ponnuthurai N.
- REVEAL: Retrieval-augmented Visual-language Pre-training With Multi-source Multimodal Knowledge Memory Hu Ziniu, Iscen Ahmet, Sun Chen, Wang Zirui, Chang Kai-wei, Sun Yizhou, Schmid Cordelia, Ross David A., Fathi Alireza
- Layoutlmv3: Pre-training For Document AI With Unified Text And Image Masking Huang Yupan, Lv Tengchao, Cui Lei, Lu Yutong, Wei Furu
- Pseudo-q: Generating Pseudo Language Queries For Visual Grounding Jiang Haojun, Lin Yuanze, Han Dongchen, Song Shiji, Huang Gao
- A New Path: Scaling Vision-and-language Navigation With Synthetic Instructions And Imitation Learning Kamath Aishwarya, Anderson Peter, Wang Su, Koh Jing Yu, Ku Alexander, Waters Austin, Yang Yinfei, Baldridge Jason, Parekh Zarana
- Guiding Visual Question Answering With Attention Priors Le Thao Minh, Le Vuong, Gupta Sunil, Venkatesh Svetha, Tran Truyen
- Dialogcc: An Automated Pipeline For Creating High-quality Multi-modal Dialogue Dataset Lee Young-jun, Ko Byungsoo, Kim Han-gyu, Hyeon Jonghwan, Choi Ho-jin
- Bloom Library: Multimodal Datasets In 300+ Languages For A Variety Of Downstream Tasks Leong Colin, Nemecek Joshua, Mansdorfer Jacob, Filighera Anna, Owodunni Abraham, Whitenack Daniel
- Fine-grained Semantically Aligned Vision-language Pre-training Li Juncheng, He Xin, Wei Longhui, Qian Long, Zhu Linchao, Xie Lingxi, Zhuang Yueting, Tian Qi, Tang Siliang
- LAVENDER: Unifying Video-language Understanding As Masked Language Modeling Li Linjie, Gan Zhe, Lin Kevin, Lin Chung-ching, Liu Zicheng, Liu Ce, Wang Lijuan
- Modeling Coreference Relations In Visual Dialog Li Mingxiao, Moens Marie-francine
- Uni-eden: Universal Encoder-decoder Network By Multi-granular Vision-language Pre-training Li Yehao, Fan Jiahao, Pan Yingwei, Yao Ting, Lin Weiyao, Mei Tao
- VALHALLA: Visual Hallucination For Machine Translation Li Yi, Panda Rameswar, Kim Yoon, Chen Chun-fu, Feris Rogerio, Cox David, Vasconcelos Nuno
- Exploring Visual Interpretability For Contrastive Language-image Pre-training Li Yi, Wang Hualiang, Duan Yiqun, Xu Hang, Li Xiaomeng
- ADAPT: Vision-language Navigation With Modality-aligned Action Prompts Lin Bingqian, Zhu Yi, Chen Zicong, Liang Xiwen, Liu Jianzhuang, Liang Xiaodan
- Vision-language Pre-training For Multimodal Aspect-based Sentiment Analysis Ling Yan, Yu Jianfei, Xia Rui
- Delving Deeper Into Cross-lingual Visual Question Answering Liu Chen, Pfeiffer Jonas, Korhonen Anna, Vulić Ivan, Gurevych Iryna
- Instruction-following Agents With Multimodal Transformer Liu Hao, Lee Lisa, Lee Kimin, Abbeel Pieter
- Commitbart: A Large Pre-trained Model For Github Commits Liu Shangqing, Li Yanzhou, Xie Xiaofei, Liu Yang
- Declaration-based Prompt Tuning For Visual Question Answering Liu Yuhang, Wei Wei, Peng Daowan, Zhu Feida
- Prompt Generation Networks For Input-space Adaptation Of Frozen Vision Transformers Loedeman Jochem, Stol Maarten C., Han Tengda, Asano Yuki M.
- Learn To Explain: Multimodal Reasoning Via Thought Chains For Science Question Answering Lu Pan, Mishra Swaroop, Xia Tony, Qiu Liang, Chang Kai-wei, Zhu Song-chun, Tafjord Oyvind, Clark Peter, Kalyan Ashwin
- Imagination-augmented Natural Language Understanding Lu Yujie, Zhu Wanrong, Wang Xin Eric, Eckstein Miguel, Wang William Yang
- A Frustratingly Simple Approach For End-to-end Image Captioning Luo Ziyang, Xi Yadong, Zhang Rongsheng, Ma Jing
- Do Vision-and-language Transformers Learn Grounded Predicate-noun Dependencies? Nikolaus Mitja, Salin Emmanuelle, Ayache Stephane, Fourtassi Abdellah, Favre Benoit
- Improving The Cross-lingual Generalisation In Visual Question Answering Nooralahzadeh Farhad, Sennrich Rico
- Silo Nlp's Participation At WAT2022 Parida Shantipriya, Panda Subhadarshi, Grönroos Stig-arne, Granroth-wilding Mark, Koistinen Mika
- Relation-aware Language-graph Transformer For Question Answering Park Jinyoung, Choi Hyeong Kyu, Ko Juyeon, Park Hyeonjin, Kim Ji-hoon, Jeong Jisu, Kim Kyungmin, Kim Hyunwoo J.
- Xfboost: Improving Text Generation With Controllable Decoders Peng Xiangyu, Sollami Michael
- Pre-training Image-language Transformers For Open-vocabulary Tasks Piergiovanni Aj, Kuo Weicheng, Angelova Anelia
- Answer-me: Multi-task Open-vocabulary Visual Question Answering Piergiovanni Aj, Li Wei, Kuo Weicheng, Saffar Mohammad, Bertsch Fred, Angelova Anelia
- Harnessing The Power Of Multi-task Pretraining For Ground-truth Level Natural Language Explanations Plüster Björn, Ambsdorf Jakob, Braach Lukas, Lee Jae Hee, Wermter Stefan
- VLC-BERT: Visual Question Answering With Contextualized Commonsense Knowledge Ravi Sahithya, Chinchure Aditya, Sigal Leonid, Liao Renjie, Shwartz Vered
- Super-prompting: Utilizing Model-independent Contextual Data To Reduce Data Annotation Required In Visual Commonsense Tasks Rezaei Navid, Reformat Marek Z.
- NLX-GPT: A Model For Natural Language Explanations In Vision And Vision-language Tasks Sammani Fawaz, Mukherjee Tanmoy, Deligiannis Nikos
- Ebms Vs. CL: Exploring Self-supervised Visual Pretraining For Visual Question Answering Shevchenko Violetta, Abbasnejad Ehsan, Dick Anthony, Hengel Anton Van Den, Teney Damien
- On The Efficacy Of Co-attention Transformer Layers In Visual Question Answering Sikarwar Ankur, Kreiman Gabriel
- CLIP Models Are Few-shot Learners: Empirical Studies On VQA And Visual Entailment Song Haoyu, Dong Li, Zhang Wei-nan, Liu Ting, Wei Furu
- Climb: A Continual Learning Benchmark For Vision-and-language Tasks Srinivasan Tejas, Chang Ting-yun, Alva Leticia Leonor Pinto, Chochlakis Georgios, Rostami Mohammad, Thomason Jesse
- Towards Zero-shot And Few-shot Table Question Answering Using GPT-3 Srivastava Pragya, Ganu Tanuja, Guha Saikat
- Long-form Video-language Pre-training With Multimodal Temporal Contrastive Learning Sun Yuchong, Xue Hongwei, Song Ruihua, Liu Bei, Yang Huan, Fu Jianlong
- Towards Reasoning-aware Explainable VQA Vaideeswaran Rakesh, Gao Feng, Mathur Abhinav, Thattai Govind
- UGIF: UI Grounded Instruction Following Venkatesh Sagar Gubbi, Talukdar Partha, Narayanan Srini
- Learning To Decompose Visual Features With Latent Textual Prompts Wang Feng, Li Manling, Lin Xudong, Lv Hairong, Schwing Alexander G., Ji Heng
- TAG: Boosting Text-vqa Via Text-aware Visual Question-answer Generation Wang Jun, Gao Mingfei, Hu Yuqian, Selvaraju Ramprasaath R., Ramaiah Chetan, Xu Ran, Jaja Joseph F., Davis Larry S.
- Zero-shot Image Captioning By Anchor-augmented Vision-language Space Alignment Wang Junyang, Zhang Yi, Yan Ming, Zhang Ji, Sang Jitao
- Visually-augmented Language Modeling Wang Weizhi, Dong Li, Cheng Hao, Song Haoyu, Liu Xiaodong, Yan Xifeng, Gao Jianfeng, Wei Furu
- Image As A Foreign Language: Beit Pretraining For All Vision And Vision-language Tasks Wang Wenhui, Bao Hangbo, Dong Li, Bjorck Johan, Peng Zhiliang, Liu Qiang, Aggarwal Kriti, Mohammed Owais Khan, Singhal Saksham, Som Subhojit, Wei Furu
- Multimodal Adaptive Distillation For Leveraging Unimodal Encoders For Vision-language Tasks Wang Zhecan, Codella Noel, Chen Yen-chun, Zhou Luowei, Dai Xiyang, Xiao Bin, Yang Jianwei, You Haoxuan, Chang Kai-wei, Chang Shih-fu, Yuan Lu
- Unified Multimodal Model With Unlikelihood Training For Visual Dialog Wang Zihao, Wang Junli, Jiang Changjun
- Exploring The Efficacy Of Pre-trained Checkpoints In Text-to-music Generation Task Wu Shangda, Sun Maosong
- Dual Modality Prompt Tuning For Vision-language Pre-trained Model Xing Yinghui, Wu Qirui, Cheng De, Zhang Shizhou, Liang Guoqiang, Wang Peng, Zhang Yanning
- Integrating Translation Memories Into Non-autoregressive Machine Translation Xu Jitao, Crego Josep, Yvon François
- Bridgetower: Building Bridges Between Encoders In Vision-language Representation Learning Xu Xiao, Wu Chenfei, Rosenman Shachar, Lal Vasudev, Che Wanxiang, Duan Nan
- Generative Negative Text Replay For Continual Vision-language Pretraining Yan Shipeng, Hong Lanqing, Xu Hang, Han Jianhua, Tuytelaars Tinne, Li Zhenguo, He Xuming
- Learning To Answer Visual Questions From Web Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Language Models Are General-purpose Interfaces Yaru Hao, Haoyu Song, Li Dong, Shaohan Huang, Zewen Chi, Wenhui Wang, Shuming Ma, Furu Wei
- Retrieval-augmented Multimodal Language Modeling Yasunaga Michihiro, Aghajanyan Armen, Shi Weijia, James Rich, Leskovec Jure, Liang Percy, Lewis Mike, Zettlemoyer Luke, Yih Wen-tau
- Wudaomm: A Large-scale Multi-modal Dataset For Pre-training Models Yuan Sha, Zhao Shuai, Leng Jiahong, Xue Zhao, Zhao Hanyu, Liu Peiyu, Gong Zheng, Zhao Wayne Xin, Li Junyi, Tang Jie
- Leveraging Vision-language Models For Improving Domain Generalization In Image Classification Addepalli Sravanti, Asokan Ashish Ramayee, Sharma Lakshay, Babu R. Venkatesh
- MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models And Tasks Ahuja Sanchit, Aggarwal Divyanshu, Gumma Varun, Watts Ishaan, Sathe Ashutosh, Ochieng Millicent, Hada Rishav, Jain Prachi, Axmed Maxamed, Bali Kalika, Sitaram Sunayana
- Jointly Training Large Autoregressive Multimodal Models Aiello Emanuele, Yu Lili, Nie Yixin, Aghajanyan Armen, Oguz Barlas
- Multimodal Representations For Teacher-guided Compositional Visual Reasoning Aissa Wafa Cedric - Vertigo, Ferecatu Marin Cedric - Vertigo, Crucianu Michel Cedric - Vertigo
- Pixt3: Pixel-based Table-to-text Generation Alonso Iñigo, Agirre Eneko, Lapata Mirella
- Is BERT Blind? Exploring The Effect Of Vision-and-language Pretraining On Visual Language Understanding Alper Morris, Fiman Michael, Averbuch-elor Hadar
- Openleaf: Open-domain Interleaved Image-text Generation And Evaluation An Jie, Yang Zhengyuan, Li Linjie, Wang Jianfeng, Lin Kevin, Liu Zicheng, Wang Lijuan, Luo Jiebo
- Llm2loss: Leveraging Language Models For Explainable Model Diagnostics Ardeshir Shervin
- Towards A Unified Multimodal Reasoning Framework Arun Abhinav, Mal Dipendra Singh, Soni Mehul, Sawada Tomohiro
- Openflamingo: An Open-source Framework For Training Large Autoregressive Vision-language Models Awadalla Anas, Gao Irena, Gardner Josh, Hessel Jack, Hanafy Yusuf, Zhu Wanrong, Marathe Kalyani, Bitton Yonatan, Gadre Samir, Sagawa Shiori, Jitsev Jenia, Kornblith Simon, Koh Pang Wei, Ilharco Gabriel, Wortsman Mitchell, Schmidt Ludwig
- Testing The Depth Of Chatgpt's Comprehension Via Cross-modal Tasks Based On Ascii-art: Gpt3.5's Abilities In Regard To Recognizing And Generating Ascii-art Are Not Totally Lacking Bayani David
- VIGC: Visual Instruction Generation And Correction Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He
- Visit-bench: A Benchmark For Vision-language Instruction Following Inspired By Real-world Use Bitton Yonatan, Bansal Hritik, Hessel Jack, Shao Rulin, Zhu Wanrong, Awadalla Anas, Gardner Josh, Taori Rohan, Schmidt Ludwig
- Halle-control: Controlling Object Hallucination In Large Multimodal Models Bohan Zhai, Shijia Yang, Chenfeng Xu, Sheng Shen, Kurt Keutzer, Chunyuan Li, Manling Li
- Pixlore: A Dataset-driven Approach To Rich Image Captioning Bonilla Diego
- Accuracy Of A Vision-language Model On Challenging Medical Cases Buckley Thomas, Diao James A., Rodman Adam, Manrai Arjun K.
- CLAP: Isolating Content From Style Through Contrastive Learning With Augmented Prompts Cai Yichao, Liu Yuhang, Zhang Zhen, Shi Javen Qinfeng
- Pumer: Pruning And Merging Tokens For Efficient Vision Language Models Cao Qingqing, Paranjape Bhargavi, Hajishirzi Hannaneh
- Hallucination Augmented Contrastive Learning For Multimodal Large Language Model Chaoya Jiang, Haiyang Xu, Mengfan Dong, Jiaxing Chen, Wei Ye, Ming Yan, Qinghao Ye, Ji Zhang, Fei Huang, Shikun Zhang
- MME: A Comprehensive Evaluation Benchmark For Multimodal Large Language Models Chaoyou Fu, Peixian Chen, Yunhang Shen, Yulei Qin, Mengdan Zhang, Xu Lin, Jinrui Yang, Xiawu Zheng, Ke Li, Xing Sun, Yunsheng Wu, Rongrong Ji
- Position-enhanced Visual Instruction Tuning For Multimodal Large Language Models Chen Chi, Qin Ruoyu, Luo Fuwen, Mi Xiaoyue, Li Peng, Sun Maosong, Liu Yang
- Visual Instruction Tuning With Polite Flamingo Chen Delong, Liu Jianfeng, Dai Wenliang, Wang Baoyuan
- X-LLM: Bootstrapping Advanced Large Language Models By Treating Multi-modalities As Foreign Languages Chen Feilong, Han Minglun, Zhao Haozhi, Zhang Qingyang, Shi Jing, Xu Shuang, Xu Bo
- Chain-of-thought Prompt Distillation For Multimodal Named Entity Recognition And Multimodal Relation Extraction Chen Feng, Feng Yujian
- LION : Empowering Multimodal Large Language Model With Dual-level Visual Knowledge Chen Gongwei, Shen Leyang, Shao Rui, Deng Xiang, Nie Liqiang
- Tem-adapter: Adapting Image-text Pretraining For Video Question Answer Chen Guangyi, Liu Xiao, Wang Guangrun, Zhang Kun, Torr Philip H. S., Zhang Xiao-ping, Tang Yansong
- Interpreting And Controlling Vision Foundation Models Via Text Explanations Chen Haozhe, Yang Junfeng, Vondrick Carl, Mao Chengzhi
- Huatuogpt-ii, One-stage Training For Medical Adaption Of Llms Chen Junying, Wang Xidong, Gao Anningzhe, Jiang Feng, Chen Shunian, Zhang Hongbo, Song Dingjie, Xie Wenya, Kong Chuyi, Li Jianquan, Wan Xiang, Li Haizhou, Wang Benyou
- Large Language Models Are Visual Reasoning Coordinators Chen Liangyu, Li Bo, Shen Sheng, Yang Jingkang, Li Chunyuan, Keutzer Kurt, Darrell Trevor, Liu Ziwei
- Driving With Llms: Fusing Object-level Vector Modality For Explainable Autonomous Driving Chen Long, Sinavski Oleg, Hünermann Jan, Karnsund Alice, Willmott Andrew James, Birch Danny, Maund Daniel, Shotton Jamie
- Understanding And Improving In-context Learning On Vision-language Models Chen Shuo, Han Zhen, He Bailan, Buckley Mark, Torr Philip, Tresp Volker, Gu Jindong
- Challenges And Contributing Factors In The Utilization Of Large Language Models (llms) Chen Xiaoliang, Li Liangbin, Chang Le, Huang Yunhe, Zhao Yuxuan, Zhang Yuxiao, Li Dinuo
- Autoeval-video: An Automatic Benchmark For Assessing Large Vision Language Models In Open-ended Video Question Answering Chen Xiuyuan, Lin Yuan, Zhang Yuchen, Huang Weiran
- Medprompt: Cross-modal Prompting For Multi-task Medical Image Translation Chen Xuhang, Pun Chi-man, Wang Shuqiang
- Measuring And Improving Chain-of-thought Reasoning In Vision-language Models Chen Yangyi, Sikka Karan, Cogswell Michael, Ji Heng, Divakaran Ajay
- Internvl: Scaling Up Vision Foundation Models And Aligning For Generic Visual-linguistic Tasks Chen Zhe, Wu Jiannan, Wang Wenhai, Su Weijie, Chen Guo, Xing Sen, Zhong Muyan, Zhang Qinglong, Zhu Xizhou, Lu Lewei, Li Bin, Luo Ping, Lu Tong, Qiao Yu, Dai Jifeng
- GENOME: Generative Neuro-symbolic Visual Reasoning By Growing And Reusing Modules Chen Zhenfang, Sun Rui, Liu Wenjun, Hong Yining, Gan Chuang
- See, Think, Confirm: Interactive Prompting Between Vision And Language Models For Knowledge-based Visual Reasoning Chen Zhenfang, Zhou Qinhong, Shen Yikang, Hong Yining, Zhang Hao, Gan Chuang
- Egothink: Evaluating First-person Perspective Thinking Capability Of Vision-language Models Cheng Sijie, Guo Zhicheng, Wu Jingwen, Fang Kechen, Li Peng, Liu Huaping, Liu Yang
- Can We Edit Multimodal Large Language Models? Cheng Siyuan, Tian Bozhong, Liu Qingbin, Chen Xi, Wang Yongheng, Chen Huajun, Zhang Ningyu
- M\(^{2}\)chat: Empowering VLM For Multimodal LLM Interleaved Text-image Generation Chi Xiaowei, Zhang Rongyu, Jiang Zhengkai, Liu Yijiang, Wang Yatian, Qi Xingqun, Luo Wenhan, Gao Peng, Zhang Shanghang, Liu Qifeng, Guo Yike
- Language Grounded Qformer For Efficient Vision Language Understanding Choraria Moulik, Sekhar Nitesh, Wu Yue, Zhang Xu, Singhal Prateek, Varshney Lav R.
- Apollo: Unified Adapter And Prompt Learning For Vision Language Models Chowdhury Sanjoy, Nag Sayan, Manocha Dinesh
- Long Story Short: A Summarize-then-search Method For Long Video Question Answering Chung Jiwan, Yu Youngjae
- Seamlessm4t: Massively Multilingual & Multimodal Machine Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Cora, Dale David, Dong Ning, Duquenne Paul-ambroise, Elsahar Hady, Gong Hongyu, Heffernan Kevin, Hoffman John, Klaiber Christopher, Li Pengwei, Licht Daniel, Maillard Jean, Rakotoarison Alice, Sadagopan Kaushik Ram, Wenzek Guillaume, Ye Ethan, Akula Bapi, Chen Peng-jen, Hachem Naji El, Ellis Brian, Gonzalez Gabriel Mejia, Haaheim Justin, Hansanti Prangthip, Howes Russ, Huang Bernie, Hwang Min-jae, Inaguma Hirofumi, Jain Somya, Kalbassi Elahe, Kallet Amanda, Kulikov Ilia, Lam Janice, Li Daniel, Ma Xutai, Mavlyutov Ruslan, Peloquin Benjamin, Ramadan Mohamed, Ramakrishnan Abinesh, Sun Anna, Tran Kevin, Tran Tuan, Tufanov Igor, Vogeti Vish, Wood Carleigh, Yang Yilin, Yu Bokai, Andrews Pierre, Balioglu Can, Costa-jussà Marta R., Celebi Onur, Elbayad Maha, Gao Cynthia, Guzmán Francisco, Kao Justine, Lee Ann, Mourachko Alexandre, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler
- Seamless: Multilingual Expressive And Streaming Speech Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Coria, Dale David, Dong Ning, Duppenthaler Mark, Duquenne Paul-ambroise, Ellis Brian, Elsahar Hady, Haaheim Justin, Hoffman John, Hwang Min-jae, Inaguma Hirofumi, Klaiber Christopher, Kulikov Ilia, Li Pengwei, Licht Daniel, Maillard Jean, Mavlyutov Ruslan, Rakotoarison Alice, Sadagopan Kaushik Ram, Ramakrishnan Abinesh, Tran Tuan, Wenzek Guillaume, Yang Yilin, Ye Ethan, Evtimov Ivan, Fernandez Pierre, Gao Cynthia, Hansanti Prangthip, Kalbassi Elahe, Kallet Amanda, Kozhevnikov Artyom, Gonzalez Gabriel Mejia, Roman Robin San, Touret Christophe, Wong Corinne, Wood Carleigh, Yu Bokai, Andrews Pierre, Balioglu Can, Chen Peng-jen, Costa-jussà Marta R., Elbayad Maha, Gong Hongyu, Guzmán Francisco, Heffernan Kevin, Jain Somya, Kao Justine, Lee Ann, Ma Xutai, Mourachko Alex, Peloquin Benjamin, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Sun Anna, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler, Williamson Mary
- Visual Chain Of Thought: Bridging Logical Gaps With Multimodal Infillings Daniel Rose, Vaishnavi Himakunthala, Andy Ouyang, Ryan He, Alex Mei, Yujie Lu, Michael Saxon, Chinmay Sonar, Diba Mirza, William Yang Wang
- Palm-e: An Embodied Multimodal Language Model Danny Driess, Fei Xia, Mehdi S. M. Sajjadi, Corey Lynch, Aakanksha Chowdhery, Brian Ichter, Ayzaan Wahid, Jonathan Tompson, Quan Vuong, Tianhe Yu, Wenlong Huang, Yevgen Chebotar, Pierre Sermanet, Daniel Duckworth, Sergey Levine, Vincent Vanhoucke, Karol Hausman, Marc Toussaint, Klaus Greff, Andy Zeng, Igor Mordatch, Pete Florence
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- Facechat: An Emotion-aware Face-to-face Dialogue Framework Deema Alnuhait, Qingyang Wu, Zhou Yu
- Scaling Vision Transformers To 22 Billion Parameters Dehghani Mostafa, Djolonga Josip, Mustafa Basil, Padlewski Piotr, Heek Jonathan, Gilmer Justin, Steiner Andreas, Caron Mathilde, Geirhos Robert, Alabdulmohsin Ibrahim, Jenatton Rodolphe, Beyer Lucas, Tschannen Michael, Arnab Anurag, Wang Xiao, Riquelme Carlos, Minderer Matthias, Puigcerver Joan, Evci Utku, Kumar Manoj, Van Steenkiste Sjoerd, Elsayed Gamaleldin F., Mahendran Aravindh, Yu Fisher, Oliver Avital, Huot Fantine, Bastings Jasmijn, Collier Mark Patrick, Gritsenko Alexey, Birodkar Vighnesh, Vasconcelos Cristina, Tay Yi, Mensink Thomas, Kolesnikov Alexander, Pavetić Filip, Tran Dustin, Kipf Thomas, Lučić Mario, Zhai Xiaohua, Keysers Daniel, Harmsen Jeremiah, Houlsby Neil
- Musilingo: Bridging Music And Text With Pre-trained Language Models For Music Captioning And Query Response Deng Zihao, Ma Yinghao, Liu Yudong, Guo Rongchen, Zhang Ge, Chen Wenhu, Huang Wenhao, Benetos Emmanouil
- Minigpt-4: Enhancing Vision-language Understanding With Advanced Large Language Models Deyao Zhu, Jun Chen, Xiaoqian Shen, Xiang Li, Mohamed Elhoseiny
- Lmflow: An Extensible Toolkit For Finetuning And Inference Of Large Foundation Models Diao Shizhe, Pan Rui, Dong Hanze, Shum Ka Shun, Zhang Jipeng, Xiong Wei, Zhang Tong
- Do Llms Work On Charts? Designing Few-shot Prompts For Chart Question Answering And Summarization Do Xuan Long, Hassanpour Mohammad, Masry Ahmed, Kavehzadeh Parsa, Hoque Enamul, Joty Shafiq
- Speechgpt: Empowering Large Language Models With Intrinsic Cross-modal Conversational Abilities Dong Zhang, Shimin Li, Xin Zhang, Jun Zhan, Pengyu Wang, Yaqian Zhou, Xipeng Qiu
- Lauragpt: Listen, Attend, Understand, And Regenerate Audio With GPT Du Zhihao, Wang Jiaming, Chen Qian, Chu Yunfei, Gao Zhifu, Li Zerui, Hu Kai, Zhou Xiaohuan, Xu Jin, Ma Ziyang, Wang Wen, Zheng Siqi, Zhou Chang, Yan Zhijie, Zhang Shiliang
- Instructseq: Unifying Vision Tasks With Instruction-conditioned Multi-modal Sequence Generation Fang Rongyao, Yan Shilin, Huang Zhaoyang, Zhou Jingqiu, Tian Hao, Dai Jifeng, Li Hongsheng
- Audiochatllama: Towards General-purpose Speech Abilities For Llms Fathullah Yassir, Wu Chunyang, Lakomkin Egor, Li Ke, Jia Junteng, Shangguan Yuan, Mahadeokar Jay, Kalinli Ozlem, Fuegen Christian, Seltzer Mike
- Transferable Decoding With Visual Entities For Zero-shot Image Captioning Fei Junjie, Wang Teng, Zhang Jinrui, He Zhenyu, Wang Chengjie, Zheng Feng
- A Challenger To GPT-4V? Early Explorations Of Gemini In Visual Expertise Fu Chaoyou, Zhang Renrui, Wang Zihan, Huang Yubo, Zhang Zhengye, Qiu Longtian, Ye Gaoxiang, Shen Yunhang, Zhang Mengdan, Chen Peixian, Zhao Sirui, Lin Shaohui, Jiang Deqiang, Yin Di, Gao Peng, Li Ke, Li Hongsheng, Sun Xing
- Dynamic Clue Bottlenecks: Towards Interpretable-by-design Visual Question Answering Fu Xingyu, Zhou Ben, Chen Sihao, Yatskar Mark, Roth Dan
- CLIPAG: Towards Generator-free Text-to-image Generation Ganz Roy, Elad Michael
- Assistgpt: A General Multi-modal Assistant That Can Plan, Execute, Inspect, And Learn Gao Difei, Ji Lei, Zhou Luowei, Lin Kevin Qinghong, Chen Joya, Fan Zihan, Shou Mike Zheng
- Examining User-friendly And Open-sourced Large GPT Models: A Survey On Language, Multimodal, And Scientific GPT Models Gao Kaiyuan, He Sunan, He Zhenyu, Lin Jiacheng, Pei Qizhi, Shao Jie, Zhang Wei
- Llama-adapter V2: Parameter-efficient Visual Instruction Model Gao Peng, Han Jiaming, Zhang Renrui, Lin Ziyi, Geng Shijie, Zhou Aojun, Zhang Wei, Lu Pan, He Conghui, Yue Xiangyu, Li Hongsheng, Qiao Yu
- CLOVA: A Closed-loop Visual Assistant With Tool Usage And Update Gao Zhi, Du Yuntao, Zhang Xintong, Ma Xiaojian, Han Wenjuan, Zhu Song-chun, Li Qing
- On The Performance Of Multimodal Language Models Garg Utsav, Bas Erhan
- Mllm-bench: Evaluating Multimodal Llms With Per-sample Criteria Ge Wentao, Chen Shunian, Chen Guiming Hardy, Chen Zhihong, Chen Junying, Yan Shuo, Zhu Chenghao, Lin Ziyue, Xie Wenya, Zhang Xinyi, Chai Yichen, Liu Xiaoyu, Song Dingjie, Wang Xidong, Gao Anningzhe, Zhang Zhiyi, Li Jianquan, Wan Xiang, Wang Benyou
- Making Llama SEE And Draw With SEED Tokenizer Ge Yuying, Zhao Sijie, Zeng Ziyun, Ge Yixiao, Li Chen, Wang Xintao, Shan Ying
- Ddcot: Duty-distinct Chain-of-thought Prompting For Multimodal Reasoning In Language Models Ge Zheng, Bin Yang, Jiajin Tang, Hong-yu Zhou, Sibei Yang
- Mblip: Efficient Bootstrapping Of Multilingual Vision-llms Geigle Gregor, Jain Abhay, Timofte Radu, Glavaš Goran
- Hiclip: Contrastive Language-image Pretraining With Hierarchy-aware Attention Geng Shijie, Yuan Jianbo, Tian Yu, Chen Yuxiao, Zhang Yongfeng
- Clipsyntel: CLIP And LLM Synergy For Multimodal Question Summarization In Healthcare Ghosh Akash, Acharya Arkadeep, Jain Raghav, Saha Sriparna, Chadha Aman, Sinha Setu
- Figstep: Jailbreaking Large Vision-language Models Via Typographic Visual Prompts Gong Yichen, Ran Delong, Liu Jinyuan, Wang Conglei, Cong Tianshuo, Wang Anyu, Duan Sisi, Wang Xiaoyun
- Bloomvqa: Assessing Hierarchical Multi-modal Comprehension Gong Yunye, Shrestha Robik, Claypoole Jared, Cogswell Michael, Ray Arijit, Kanan Christopher, Divakaran Ajay
- Large Language Models Are Zero-shot Time Series Forecasters Gruver Nate, Finzi Marc, Qiu Shikai, Wilson Andrew Gordon
- ELIP: Efficient Language-image Pre-training With Fewer Vision Tokens Guo Yangyang, Zhang Haoyu, Wong Yongkang, Nie Liqiang, Kankanhalli Mohan
- A Survey Of Vision-language Pre-training From The Lens Of Multimodal Machine Translation Gwinnup Jeremy, Duh Kevin
- Images In Language Space: Exploring The Suitability Of Large Language Models For Vision & Language Tasks Hakimov Sherzod, Schlangen David
- Large Language Models Meet Computer Vision: A Brief Survey Hamadi Raby
- Onellm: One Framework To Align All Modalities With Language Han Jiaming, Gong Kaixiong, Zhang Yiyuan, Wang Jiaqi, Zhang Kaipeng, Lin Dahua, Qiao Yu, Gao Peng, Yue Xiangyu
- Imagebind-llm: Multi-modality Instruction Tuning Han Jiaming, Zhang Renrui, Shao Wenqi, Gao Peng, Xu Peng, Xiao Han, Zhang Kaipeng, Liu Chris, Wen Song, Guo Ziyu, Lu Xudong, Ren Shuai, Wen Yafei, Chen Xiaoxin, Yue Xiangyu, Li Hongsheng, Qiao Yu
- Llms As Visual Explainers: Advancing Image Classification With Evolving Visual Descriptions Han Songhao, Zhuo Le, Liao Yue, Liu Si
- Chartllama: A Multimodal LLM For Chart Understanding And Generation Han Yucheng, Zhang Chi, Chen Xin, Yang Xu, Wang Zhibin, Yu Gang, Fu Bin, Zhang Hanwang
- Towards Efficient Vision-language Tuning: More Information Density, More Generalizability Hao Tianxiang, Lyu Mengyao, Chen Hui, Zhao Sicheng, Ding Xiaohan, Han Jungong, Ding Guiguang
- Visual Instruction Tuning Haotian Liu, Chunyuan Li, Qingyang Wu, Yong Jae Lee
- Zero-shot Recommendations With Pre-trained Large Language Models For Multimodal Nudging Harrison Rachel M., Dereventsov Anton, Bibin Anton
- Wanjuan: A Comprehensive Multimodal Dataset For Advancing English And Chinese Large Models He Conghui, Jin Zhenjiang, Xu Chao, Qiu Jiantao, Wang Bin, Li Wei, Yan Hang, Wang Jiaqi, Lin Dahua
- Multimodal Graph Transformer For Multimodal Question Answering He Xuehai, Wang Xin Eric
- Simplemtod: A Simple Language Model For Multimodal Task-oriented Dialogue With Symbolic Scene Representation Hemanthage Bhathiya, Dondrup Christian, Bartie Phil, Lemon Oliver
- Negative Object Presence Evaluation (NOPE) To Measure Object Hallucination In Vision-language Models Holy Lovenia, Wenliang Dai, Samuel Cahyawijaya, Ziwei Ji, Pascale Fung
- 3D-LLM: Injecting The 3D World Into Large Language Models Hong Yining, Zhen Haoyu, Chen Peihao, Zheng Shuhong, Du Yilun, Chen Zhenfang, Gan Chuang
- CIEM: Contrastive Instruction Evaluation Method For Better Instruction Tuning Hongyu Hu, Jiyuan Zhang, Minyi Zhao, Zhenbang Sun
- SCITUNE: Aligning Large Language Models With Scientific Multimodal Instructions Horawalavithana Sameera, Munikoti Sai, Stewart Ian, Kvinge Henry
- More Robots Are Coming: Large Multimodal Models (chatgpt) Can Solve Visually Diverse Images Of Parsons Problems Hou Irene, Man Owen, Mettille Sophie, Gutierrez Sebastian, Angelikas Kenneth, Macneil Stephen
- Context-aware Prompt Tuning For Vision-language Model With Dual-alignment Hu Hongyu, Lin Tiancheng, Wang Jie, Sun Zhenbang, Xu Yi
- COMMA: Co-articulated Multi-modal Learning Hu Lianyu, Gao Liqing, Liu Zekang, Pun Chi-man, Feng Wei
- BLIVA: A Simple Multimodal LLM For Better Handling Of Text-rich Visual Questions Hu Wenbo, Xu Yifan, Li Yi, Li Weiyue, Chen Zeyuan, Tu Zhuowen
- Visual Program Distillation: Distilling Tools And Programmatic Reasoning Into Vision-language Models Hu Yushi, Stretcu Otilia, Lu Chun-ta, Viswanathan Krishnamurthy, Hata Kenji, Luo Enming, Krishna Ranjay, Fuxman Ariel
- Vtimellm: Empower LLM To Grasp Video Moments Huang Bin, Wang Xin, Chen Hong, Song Zihan, Zhu Wenwu
- Chatgpt For Shaping The Future Of Dentistry: The Potential Of Multi-modal Large Language Model Huang Hanyao, Zheng Ou, Wang Dongdong, Yin Jiayi, Wang Zijin, Ding Shengxuan, Yin Heng, Xu Chuan, Yang Renjie, Zheng Qian, Shi Bing
- Modality Plug-and-play: Elastic Modality Adaptation In Multimodal Llms For Embodied AI Huang Kai, Yang Boyuan, Gao Wei
- T2i-compbench: A Comprehensive Benchmark For Open-world Compositional Text-to-image Generation Huang Kaiyi, Sun Kaiyue, Xie Enze, Li Zhenguo, Liu Xihui
- Adapting Pre-trained Language Models To Vision-language Tasks Via Dynamic Visual Prompting Huang Shubin, Wu Qiong, Zhou Yiyi, Chen Weijie, Zhang Rongsheng, Sun Xiaoshuai, Ji Rongrong
- Generic Attention-model Explainability By Weighted Relevance Accumulation Huang Yiming, Jia Aozhe, Zhang Xiaodan, Zhang Jiawei
- Smartedit: Exploring Complex Instruction-based Image Editing With Multimodal Large Language Models Huang Yuzhou, Xie Liangbin, Wang Xintao, Yuan Ziyang, Cun Xiaodong, Ge Yixiao, Zhou Jiantao, Dong Chao, Huang Rui, Zhang Ruimao, Shan Ying
- Pushing Boundaries: Exploring Zero Shot Object Classification With Large Multimodal Models Islam Ashhadul, Biswas Md. Rafiul, Zaghouani Wajdi, Belhaouari Samir Brahim, Shah Zubair
- Large Language Models As Automated Aligners For Benchmarking Vision-language Models Ji Yuanfeng, Ge Chongjian, Kong Weikai, Xie Enze, Liu Zhengying, Li Zhengguo, Luo Ping
- Bootstrapping Vision-language Learning With Decoupled Language Pre-training Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Expedited Training Of Visual Conditioned Language Generation Via Redundancy Reduction Jian Yiren, Liu Tingkai, Tao Yunzhe, Zhang Chunhui, Vosoughi Soroush, Yang Hongxia
- Bus:efficient And Effective Vision-language Pre-training With Bottom-up Patch Summarization Jiang Chaoya, Xu Haiyang, Ye Wei, Ye Qinghao, Li Chenliang, Yan Ming, Bi Bin, Zhang Shikun, Huang Fei, Huang Songfang
- Iluvui: Instruction-tuned Language-vision Modeling Of Uis From Machine Conversations Jiang Yue, Schoop Eldon, Swearngin Amanda, Nichols Jeffrey
- Chain Of Thought Prompt Tuning In Vision Language Models Jiaxin Ge, Hongyin Luo, Siyuan Qian, Yulu Gan, Jie Fu, Shanghang Zhang
- Evaluation And Enhancement Of Semantic Grounding In Large Vision-language Models Jiaying Lu, Jinmeng Rao, Kezhen Chen, Xiaoyuan Guo, Yawen Zhang, Baochen Sun, Carl Yang, Jie Yang
- Generating Images With Multimodal Language Models Jing Yu Koh, Daniel Fried, Ruslan Salakhutdinov
- Grounding Language Models To Images For Multimodal Inputs And Outputs Jing Yu Koh, Ruslan Salakhutdinov, Daniel Fried
- Vcoder: Versatile Vision Encoders For Multimodal Large Language Models Jitesh Jain, Jianwei Yang, Humphrey Shi
- BLIP-2: Bootstrapping Language-image Pre-training With Frozen Image Encoders And Large Language Models Junnan Li, Dongxu Li, Silvio Savarese, Steven Hoi
- Evaluation And Analysis Of Hallucination In Large Vision-language Models Junyang Wang, Yiyang Zhou, Guohai Xu, Pengcheng Shi, Chenlin Zhao, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang, Jihua Zhu, Jitao Sang, Haoyu Tang
- Syntax-guided Transformers: Elevating Compositional Generalization And Grounding In Multimodal Environments Kamali Danial, Kordjamshidi Parisa
- Application Of Frozen Large-scale Models To Multimodal Task-oriented Dialogue Kawamoto Tatsuki, Suzuki Takuma, Miyama Ko, Meguro Takumi, Takagi Tomohiro
- Shikra: Unleashing Multimodal Llm's Referential Dialogue Magic Keqin Chen, Zhao Zhang, Weili Zeng, Richong Zhang, Feng Zhu, Rui Zhao
- Q: How To Specialize Large Vision-language Models To Data-scarce VQA Tasks? A: Self-train On Unlabeled Images! Khan Zaid, Bg Vijay Kumar, Schulter Samuel, Yu Xiang, Fu Yun, Chandraker Manmohan
- Expediting Contrastive Language-image Pretraining Via Self-distilled Encoders Kim Bumsoo, Kim Jinhyung, Jo Yeonsik, Kim Seung Hwan
- Transferring Pre-trained Multimodal Representations With Cross-modal Similarity Matching Kim Byoungjip, Choi Sungik, Hwang Dasol, Lee Moontae, Lee Honglak
- MAGVLT: Masked Generative Vision-and-language Transformer Kim Sungwoong, Jo Daejin, Lee Donghoon, Kim Jongmin
- Can Language Models Laugh At Youtube Short-form Videos? Ko Dayoon, Lee Sangho, Kim Gunhee
- Generating Images With Multimodal Language Models Koh Jing Yu, Fried Daniel, Salakhutdinov Ruslan
- Lavip:language-grounded Visual Prompts Kunananthaseelan Nilakshan, Zhang Jing, Harandi Mehrtash
- LISA: Reasoning Segmentation Via Large Language Model Lai Xin, Tian Zhuotao, Chen Yukang, Li Yanwei, Yuan Yuhui, Liu Shu, Jia Jiaya
- Veclip: Improving CLIP Training Via Visual-enriched Captions Lai Zhengfeng, Zhang Haotian, Zhang Bowen, Wu Wentao, Bai Haoping, Timofeev Aleksei, Du Xianzhi, Gan Zhe, Shan Jiulong, Chuah Chen-nee, Yang Yinfei, Cao Meng
- Improving Zero-shot Visual Question Answering Via Large Language Models With Reasoning Question Prompts Lan Yunshi, Li Xiang, Liu Xin, Li Yang, Qin Wei, Qian Weining
- Read-only Prompt Optimization For Vision-language Few-shot Learning Lee Dongjun, Song Seokwon, Suh Jihee, Choi Joonmyung, Lee Sanghyeok, Kim Hyunwoo J.
- Building Multimodal AI Chatbots Lee Min Young
- Volcano: Mitigating Multimodal Hallucination Through Self-feedback Guided Revision Lee Seongyun, Park Sue Hyun, Jo Yongrae, Seo Minjoon
- Mitigating Fine-grained Hallucination By Fine-tuning Large Vision-language Models With Caption Rewrites Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-peng Lim
- T-sciq: Teaching Multimodal Chain-of-thought Reasoning Via Mixed Large Language Model Signals For Science Question Answering Lei Wang, Yi Hu, Jiabang He, Xing Xu, Ning Liu, Hui Liu, Heng Tao Shen
- Seed-bench-2: Benchmarking Multimodal Large Language Models Li Bohao, Ge Yuying, Ge Yixiao, Wang Guangzhi, Wang Rui, Zhang Ruimao, Shan Ying
- Vision-language Instruction Tuning: A Review And Analysis Li Chen, Ge Yixiao, Li Dian, Shan Ying
- Llava-med: Training A Large Language-and-vision Assistant For Biomedicine In One Day Li Chunyuan, Wong Cliff, Zhang Sheng, Usuyama Naoto, Liu Haotian, Yang Jianwei, Naumann Tristan, Poon Hoifung, Gao Jianfeng
- Textbind: Multi-turn Interleaved Multimodal Instruction-following In The Wild Li Huayang, Li Siheng, Cai Deng, Wang Longyue, Liu Lemao, Watanabe Taro, Yang Yujiu, Shi Shuming
- Mastering Robot Manipulation With Multimodal Prompts Through Pretraining And Multi-task Fine-tuning Li Jiachen, Gao Qiaozi, Johnston Michael, Gao Xiaofeng, He Xuehai, Shakiah Suhaila, Shi Hangjie, Ghanadan Reza, Wang William Yang
- MAQA: A Multimodal QA Benchmark For Negation Li Judith Yue, Jansen Aren, Huang Qingqing, Lee Joonseok, Ganti Ravi, Kuzmin Dima
- Fine-tuning Multimodal Llms To Follow Zero-shot Demonstrative Instructions Li Juncheng, Pan Kaihang, Ge Zhiqi, Gao Minghe, Ji Wei, Zhang Wenqiao, Chua Tat-seng, Tang Siliang, Zhang Hanwang, Zhuang Yueting
- Covlm: Composing Visual Entities And Relationships In Large Language Models Via Communicative Decoding Li Junyan, Chen Delin, Hong Yining, Chen Zhenfang, Chen Peihao, Shen Yikang, Gan Chuang
- M3dbench: Let's Instruct Large Models With Multi-modal 3D Prompts Li Mingsheng, Chen Xin, Zhang Chi, Chen Sijin, Zhu Hongyuan, Yin Fukun, Yu Gang, Chen Tao
- FLM-101B: An Open LLM And How To Train It With $100K Budget Li Xiang, Yao Yiqun, Jiang Xin, Fang Xuezhi, Meng Xuying, Fan Siqi, Han Peng, Li Jing, Du Li, Qin Bowen, Zhang Zheng, Sun Aixin, Wang Yequan
- Stablellava: Enhanced Visual Instruction Tuning With Synthesized Image-dialogue Data Li Yanda, Zhang Chi, Yu Gang, Wang Zhibin, Fu Bin, Lin Guosheng, Shen Chunhua, Chen Ling, Wei Yunchao
- Towards Vision Enhancing Llms: Empowering Multimodal Knowledge Storage And Sharing In Llms Li Yunxin, Hu Baotian, Wang Wei, Cao Xiaochun, Zhang Min
- Towards End-to-end Embodied Decision Making Via Multi-modal Large Language Model: Explorations With Gpt4-vision And Beyond Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Tianyu Liu, Baobao Chang
- GPT-4 Enhanced Multimodal Grounding For Autonomous Driving: Leveraging Cross-modal Attention With Large Language Models Liao Haicheng, Shen Huanming, Li Zhenning, Wang Chengyue, Li Guofa, Bie Yiming, Xu Chengzhong
- Video-llava: Learning United Visual Representation By Alignment Before Projection Lin Bin, Ye Yang, Zhu Bin, Cui Jiaxi, Ning Munan, Jin Peng, Yuan Li
- Paralinguistics-enhanced Large Language Modeling Of Spoken Dialogue Lin Guan-ting, Shivakumar Prashanth Gurunath, Gandhe Ankur, Yang Chao-han Huck, Gu Yile, Ghosh Shalini, Stolcke Andreas, Lee Hung-yi, Bulyko Ivan
- Beneath The Surface: Unveiling Harmful Memes With Multimodal Reasoning Distilled From Large Language Models Lin Hongzhan, Luo Ziyang, Ma Jing, Chen Long
- Learning To Model The World With Language Lin Jessy, Du Yuqing, Watkins Olivia, Hafner Danijar, Abbeel Pieter, Klein Dan, Dragan Anca
- VILA: On Pre-training For Visual Language Models Lin Ji, Yin Hongxu, Ping Wei, Lu Yao, Molchanov Pavlo, Tao Andrew, Mao Huizi, Kautz Jan, Shoeybi Mohammad, Han Song
- Pushing Large Language Models To The 6G Edge: Vision, Challenges, And Opportunities Lin Zheng, Qu Guanqiao, Chen Qiyuan, Chen Xianhao, Chen Zhe, Huang Kaibin
- SPHINX: The Joint Mixing Of Weights, Tasks, And Visual Embeddings For Multi-modal Large Language Models Lin Ziyi, Liu Chris, Zhang Renrui, Gao Peng, Qiu Longtian, Xiao Han, Qiu Han, Lin Chen, Shao Wenqi, Chen Keqin, Han Jiaming, Huang Siyuan, Zhang Yichi, He Xuming, Li Hongsheng, Qiao Yu
- FAITHSCORE: Evaluating Hallucinations In Large Vision-language Models Liqiang Jing, Ruosen Li, Yunmo Chen, Mengzhao Jia, Xinya Du
- MMC: Advancing Multimodal Chart Understanding With Large-scale Instruction Tuning Liu Fuxiao, Wang Xiaoyang, Yao Wenlin, Chen Jianshu, Song Kaiqiang, Cho Sangwoo, Yacoob Yaser, Yu Dong
- Qilin-med-vl: Towards Chinese Large Vision-language Model For General Healthcare Liu Junling, Wang Ziming, Ye Qichen, Chong Dading, Zhou Peilin, Hua Yining
- Contrastive Vision-language Alignment Makes Efficient Instruction Learner Liu Lizhao, Sun Xinyu, Xiang Tianhang, Zhuang Zhuangwei, Yin Liuren, Tan Mingkui
- An Evaluation Of GPT-4V And Gemini In Online VQA Liu Mengchen, Chen Chongyan, Gurari Danna
- Bt-adapter: Video Conversation Is Feasible Without Video Instruction Tuning Liu Ruyang, Li Chen, Ge Yixiao, Shan Ying, Li Thomas H., Li Ge
- Language Models As Black-box Optimizers For Vision-language Models Liu Shihong, Lin Zhiqiu, Yu Samuel, Lee Ryan, Ling Tiffany, Pathak Deepak, Ramanan Deva
- MMHQA-ICL: Multimodal In-context Learning For Hybrid Question Answering Over Text, Tables And Images Liu Weihao, Lei Fangyu, Luo Tongxu, Lei Jiahe, He Shizhu, Zhao Jun, Liu Kang
- Class Incremental Learning With Pre-trained Vision-language Models Liu Xialei, Cao Xusheng, Lu Haori, Xiao Jia-wen, Bagdanov Andrew D., Cheng Ming-ming
- Beyond Text: Unveiling Multimodal Proficiency Of Large Language Models With Multiapi Benchmark Liu Xiao, Lin Jianfeng, Zhang Jiawei
- Mm-safetybench: A Benchmark For Safety Evaluation Of Multimodal Large Language Models Liu Xin, Zhu Yichen, Gu Jindong, Lan Yunshi, Yang Chao, Qiao Yu
- What Large Language Models Bring To Text-rich VQA? Liu Xuejing, Tang Wei, Ni Xinzhe, Lu Jinghui, Zhao Rui, Li Zechao, Tan Fei
- Unified-io 2: Scaling Autoregressive Multimodal Models With Vision, Language, Audio, And Action Lu Jiasen, Clark Christopher, Lee Sangho, Zhang Zichen, Khosla Savya, Marten Ryan, Hoiem Derek, Kembhavi Aniruddha
- Ziya-visual: Bilingual Large Vision-language Model Via Multi-task Instruction Tuning Lu Junyu, Zhang Dixiang, Wu Xiaojun, Gao Xinyu, Gan Ruyi, Zhang Jiaxing, Song Yan, Zhang Pingjian
- Lyrics: Boosting Fine-grained Language-vision Alignment And Comprehension Via Semantic-aware Visual Objects Lu Junyu, Zhang Dixiang, Zhang Songxin, Xie Zejian, Song Zhuoyang, Lin Cong, Zhang Jiaxing, Jing Bingyi, Zhang Pingjian
- Mathvista: Evaluating Mathematical Reasoning Of Foundation Models In Visual Contexts Lu Pan, Bansal Hritik, Xia Tony, Liu Jiacheng, Li Chunyuan, Hajishirzi Hannaneh, Cheng Hao, Chang Kai-wei, Galley Michel, Gao Jianfeng
- Multimodal Procedural Planning Via Dual Text-image Prompting Lu Yujie, Lu Pan, Chen Zhiyu, Zhu Wanrong, Wang Xin Eric, Wang William Yang
- Llmscore: Unveiling The Power Of Large Language Models In Text-to-image Synthesis Evaluation Lu Yujie, Yang Xianjun, Li Xiujun, Wang Xin Eric, Wang William Yang
- Effectively Fine-tune To Improve Large Multimodal Models For Radiology Report Generation Lu Yuzhe, Hong Sungmin, Shah Yash, Xu Panpan
- Cheap And Quick: Efficient Vision-language Instruction Tuning For Large Language Models Luo Gen, Zhou Yiyi, Ren Tianhe, Chen Shengxin, Sun Xiaoshuai, Ji Rongrong
- Biomedgpt: Open Multimodal Generative Pre-trained Transformer For Biomedicine Luo Yizhen, Zhang Jiahuan, Fan Siqi, Yang Kai, Wu Yushuai, Qiao Mu, Nie Zaiqing
- Augmented Large Language Models With Parametric Knowledge Guiding Luo Ziyang, Xu Can, Zhao Pu, Geng Xiubo, Tao Chongyang, Ma Jing, Lin Qingwei, Jiang Daxin
- KOSMOS-2.5: A Multimodal Literate Model Lv Tengchao, Huang Yupan, Chen Jingye, Zhao Yuzhong, Jia Yilin, Cui Lei, Ma Shuming, Chang Yaoyao, Huang Shaohan, Wang Wenhui, Dong Li, Luo Weiyao, Wu Shaoxiang, Wang Guoxin, Zhang Cha, Wei Furu
- An Examination Of The Compositionality Of Large Generative Vision-language Models Ma Teli, Li Rong, Liang Junwei
- World-to-words: Grounded Open Vocabulary Acquisition Through Fast Mapping In Vision-language Models Ma Ziqiao, Pan Jiayi, Chai Joyce
- Video-chatgpt: Towards Detailed Video Understanding Via Large Vision And Language Models Maaz Muhammad, Rasheed Hanoona, Khan Salman, Khan Fahad Shahbaz
- Enhancing CLIP With GPT-4: Harnessing Visual Descriptions As Prompts Maniparambil Mayug, Vorster Chris, Molloy Derek, Murphy Noel, Mcguinness Kevin, O'connor Noel E.
- Generating Illustrated Instructions Menon Sachit, Misra Ishan, Girdhar Rohit
- Dialogue-based Generation Of Self-driving Simulation Scenarios Using Large Language Models Miceli-barone Antonio Valerio, Lascarides Alex, Innes Craig
- Mindstorms In Natural Language-based Societies Of Mind Mingchen Zhuge, Haozhe Liu, Francesco Faccio, Dylan R. Ashley, Róbert Csordás, Anand Gopalakrishnan, Abdullah Hamdi, Hasan Abed Al Kader Hammoud, Vincent Herrmann, Kazuki Irie, Louis Kirsch, Bing Li, Guohao Li, Shuming Liu, Jinjie Mai, Piotr Piękos, Aditya Ramesh, Imanol Schlag, Weimin Shi, Aleksandar Stanić, Wenyi Wang, Yuhui Wang, Mengmeng Xu, Deng-ping Fan, Bernard Ghanem, Jürgen Schmidhuber
- PRE: Vision-language Prompt Learning With Reparameterization Encoder Minh Anh Pham Thi, Nguyen An Duc, Tzimiropoulos Georgios
- Comparing Humans, GPT-4, And GPT-4V On Abstraction And Reasoning Tasks Mitchell Melanie, Palmarini Alessandro B., Moskvichev Arseny
- Compositional Chain-of-thought Prompting For Large Multimodal Models Mitra Chancharik, Huang Brandon, Darrell Trevor, Herzig Roei
- Metavl: Transferring In-context Learning Ability From Language Models To Vision-language Models Monajatipoor Masoud, Li Liunian Harold, Rouhsedaghat Mozhdeh, Yang Lin F., Chang Kai-wei
- Anymal: An Efficient And Scalable Any-modality Augmented Language Model Moon Seungwhan, Madotto Andrea, Lin Zhaojiang, Nagarajan Tushar, Smith Matt, Jain Shashank, Yeh Chun-fu, Murugesan Prakash, Heidari Peyman, Liu Yue, Srinet Kavya, Damavandi Babak, Kumar Anuj
- "medium" Lms Of Code In The Era Of Llms: Lessons From Stackoverflow Mukherjee Manisha, Hellendoorn Vincent J.
- Unival: Unified Model For Image, Video, Audio And Language Tasks Mustafa Shukor, Corentin Dancette, Alexandre Rame, Matthieu Cord
- Understanding (un)intended Memorization In Text-to-image Generative Models Naseh Ali, Roh Jaechul, Houmansadr Amir
- Vision Encoder-decoder Models For AI Coaching Nayak Jyothi S, Khan Afifah Khan Mohammed Ajmal, Manjeshwar Chirag, Banday Imadh Ajaz
- Gpt4image: Can Large Pre-trained Models Help Vision Models On Perception Tasks? Ning Ding, Yehui Tang, Zhongqian Fu, Chao Xu, Kai Han, Yunhe Wang
- Multimodal Analysis Of Google Bard And Gpt-vision: Experiments In Visual Reasoning Noever David, Noever Samantha Elizabeth Miller
- Visual AI And Linguistic Intelligence Through Steerability And Composability Noever David, Noever Samantha Elizabeth Miller
- GPT-4 Technical Report Openai, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko, Madelaine Boyd, Anna-luisa Brakman, Greg Brockman, Tim Brooks, Miles Brundage, Kevin Button, Trevor Cai, Rosie Campbell, Andrew Cann, Brittany Carey, Chelsea Carlson, Rory Carmichael, Brooke Chan, Che Chang, Fotis Chantzis, Derek Chen, Sully Chen, Ruby Chen, Jason Chen, Mark Chen, Ben Chess, Chester Cho, Casey Chu, Hyung Won Chung, Dave Cummings, Jeremiah Currier, Yunxing Dai, Cory Decareaux, Thomas Degry, Noah Deutsch, Damien Deville, Arka Dhar, David Dohan, Steve Dowling, Sheila Dunning, Adrien Ecoffet, Atty Eleti, Tyna Eloundou, David Farhi, Liam Fedus, Niko Felix, Simón Posada Fishman, Juston Forte, Isabella Fulford, Leo Gao, Elie Georges, Christian Gibson, Vik Goel, Tarun Gogineni, Gabriel Goh, Rapha Gontijo-lopes, Jonathan Gordon, Morgan Grafstein, Scott Gray, Ryan Greene, Joshua Gross, Shixiang Shane Gu, Yufei Guo, Chris Hallacy, Jesse Han, Jeff Harris, Yuchen He, Mike Heaton, Johannes Heidecke, Chris Hesse, Alan Hickey, Wade Hickey, Peter Hoeschele, Brandon Houghton, Kenny Hsu, Shengli Hu, Xin Hu, Joost Huizinga, Shantanu Jain, Shawn Jain, Joanne Jang, Angela Jiang, Roger Jiang, Haozhun Jin, Denny Jin, Shino Jomoto, Billie Jonn, Heewoo Jun, Tomer Kaftan, Łukasz Kaiser, Ali Kamali, Ingmar Kanitscheider, Nitish Shirish Keskar, Tabarak Khan, Logan Kilpatrick, Jong Wook Kim, Christina Kim, Yongjik Kim, Jan Hendrik Kirchner, Jamie Kiros, Matt Knight, Daniel Kokotajlo, Łukasz Kondraciuk, Andrew Kondrich, Aris Konstantinidis, Kyle Kosic, Gretchen Krueger, Vishal Kuo, Michael Lampe, Ikai Lan, Teddy Lee, Jan Leike, Jade Leung, Daniel Levy, Chak Ming Li, Rachel Lim, Molly Lin, Stephanie Lin, Mateusz Litwin, Theresa Lopez, Ryan Lowe, Patricia Lue, Anna Makanju, Kim Malfacini, Sam Manning, Todor Markov, Yaniv Markovski, Bianca Martin, Katie Mayer, Andrew Mayne, Bob Mcgrew, Scott Mayer Mckinney, Christine Mcleavey, Paul Mcmillan, Jake Mcneil, David Medina, Aalok Mehta, Jacob Menick, Luke Metz, Andrey Mishchenko, Pamela Mishkin, Vinnie Monaco, Evan Morikawa, Daniel Mossing, Tong Mu, Mira Murati, Oleg Murk, David Mély, Ashvin Nair, Reiichiro Nakano, Rajeev Nayak, Arvind Neelakantan, Richard Ngo, Hyeonwoo Noh, Long Ouyang, Cullen O'keefe, Jakub Pachocki, Alex Paino, Joe Palermo, Ashley Pantuliano, Giambattista Parascandolo, Joel Parish, Emy Parparita, Alex Passos, Mikhail Pavlov, Andrew Peng, Adam Perelman, Filipe De Avila Belbute Peres, Michael Petrov, Henrique Ponde De Oliveira Pinto, Michael, Pokorny, Michelle Pokrass, Vitchyr H. Pong, Tolly Powell, Alethea Power, Boris Power, Elizabeth Proehl, Raul Puri, Alec Radford, Jack Rae, Aditya Ramesh, Cameron Raymond, Francis Real, Kendra Rimbach, Carl Ross, Bob Rotsted, Henri Roussez, Nick Ryder, Mario Saltarelli, Ted Sanders, Shibani Santurkar, Girish Sastry, Heather Schmidt, David Schnurr, John Schulman, Daniel Selsam, Kyla Sheppard, Toki Sherbakov, Jessica Shieh, Sarah Shoker, Pranav Shyam, Szymon Sidor, Eric Sigler, Maddie Simens, Jordan Sitkin, Katarina Slama, Ian Sohl, Benjamin Sokolowsky, Yang Song, Natalie Staudacher, Felipe Petroski Such, Natalie Summers, Ilya Sutskever, Jie Tang, Nikolas Tezak, Madeleine B. Thompson, Phil Tillet, Amin Tootoonchian, Elizabeth Tseng, Preston Tuggle, Nick Turley, Jerry Tworek, Juan Felipe Cerón Uribe, Andrea Vallone, Arun Vijayvergiya, Chelsea Voss, Carroll Wainwright, Justin Jay Wang, Alvin Wang, Ben Wang, Jonathan Ward, Jason Wei, Cj Weinmann, Akila Welihinda, Peter Welinder, Jiayi Weng, Lilian Weng, Matt Wiethoff, Dave Willner, Clemens Winter, Samuel Wolrich, Hannah Wong, Lauren Workman, Sherwin Wu, Jeff Wu, Michael Wu, Kai Xiao, Tao Xu, Sarah Yoo, Kevin Yu, Qiming Yuan, Wojciech Zaremba, Rowan Zellers, Chong Zhang, Marvin Zhang, Shengjia Zhao, Tianhao Zheng, Juntang Zhuang, William Zhuk, Barret Zoph
- Towards Vision-language Mechanistic Interpretability: A Causal Tracing Tool For BLIP Palit Vedant, Pandey Rohan, Arora Aryaman, Liang Paul Pu
- Finding And Editing Multi-modal Neurons In Pre-trained Transformers Pan Haowen, Cao Yixin, Wang Xiaozhi, Yang Xun, Wang Meng
- Retrieving-to-answer: Zero-shot Video Question Answering With Frozen Large Language Models Pan Junting, Lin Ziyi, Ge Yuying, Zhu Xiatian, Zhang Renrui, Wang Yi, Qiao Yu, Li Hongsheng
- Kosmos-g: Generating Images In Context With Multimodal Large Language Models Pan Xichen, Dong Li, Huang Shaohan, Peng Zhiliang, Chen Wenhu, Wei Furu
- X-instructblip: A Framework For Aligning X-modal Instruction-aware Representations To Llms And Emergent Cross-modal Reasoning Panagopoulou Artemis, Xue Le, Yu Ning, Li Junnan, Li Dongxu, Joty Shafiq, Xu Ran, Savarese Silvio, Xiong Caiming, Niebles Juan Carlos
- Semantic Composition In Visually Grounded Language Models Pandey Rohan
- Frozen Transformers In Language Models Are Effective Visual Encoder Layers Pang Ziqi, Xie Ziyang, Man Yunze, Wang Yu-xiong
- Clip-guided Vision-language Pre-training For Question Answering In 3D Scenes Parelli Maria, Delitzas Alexandros, Hars Nikolas, Vlassis Georgios, Anagnostidis Sotirios, Bachmann Gregor, Hofmann Thomas
- Interpretable Visual Question Answering Via Reasoning Supervision Parelli Maria, Mallis Dimitrios, Diomataris Markos, Pitsikalis Vassilis
- Localized Symbolic Knowledge Distillation For Visual Commonsense Models Park Jae Sung, Hessel Jack, Chandu Khyathi Raghavi, Liang Paul Pu, Lu Ximing, West Peter, Yu Youngjae, Huang Qiuyuan, Gao Jianfeng, Farhadi Ali, Choi Yejin
- ECLIPSE: A Resource-efficient Text-to-image Prior For Image Generations Patel Maitreya, Kim Changhoon, Cheng Sheng, Baral Chitta, Yang Yezhou
- Evaluating Gpt-4's Vision Capabilities On Brazilian University Admission Exams Pires Ramon, Almeida Thales Sales, Abonizio Hugo, Nogueira Rodrigo
- Egovlpv2: Egocentric Video-language Pre-training With Fusion In The Backbone Pramanick Shraman, Song Yale, Nag Sayan, Lin Kevin Qinghong, Shah Hardik, Shou Mike Zheng, Chellappa Rama, Zhang Pengchuan
- Rephrase, Augment, Reason: Visual Grounding Of Questions For Vision-language Models Prasad Archiki, Stengel-eskin Elias, Bansal Mohit
- Vidcom: Fast Video Comprehension Through Large Language Models With Multimodal Tools Qi Ji, Ji Kaixuan, Yu Jifan, Wang Duokang, Xu Bin, Hou Lei, Li Juanzi
- Visual Adversarial Examples Jailbreak Aligned Large Language Models Qi Xiangyu, Huang Kaixuan, Panda Ashwinee, Henderson Peter, Wang Mengdi, Mittal Prateek
- Mplug-owl: Modularization Empowers Large Language Models With Multimodality Qinghao Ye, Haiyang Xu, Guohai Xu, Jiabo Ye, Ming Yan, Yiyang Zhou, Junyang Wang, Anwen Hu, Pengcheng Shi, Yaya Shi, Chenliang Li, Yuanhong Xu, Hehong Chen, Junfeng Tian, Qi Qian, Ji Zhang, Fei Huang, Jingren Zhou
- Zero-shot Recommendations With Pre-trained Large Language Models For Multimodal Nudging Rachel M. Harrison, Anton Dereventsov, Anton Bibin
- Multimodal Multi-hop Question Answering Through A Conversation Between Tools And Efficiently Finetuned Large Language Models Rajabzadeh Hossein, Wang Suyuchen, Kwon Hyock Ju, Liu Bang
- Mabvit -- Modified Attention Block Enhances Vision Transformers Ramesh Mahesh, Ramkumar Aswinkumar
- Retrieval-augmented Image Captioning Ramos Rita, Elliott Desmond, Martins Bruno
- Retrieval-based Knowledge Augmented Vision Language Pre-training Rao Jiahua, Shan Zifei, Liu Longpo, Zhou Yao, Yang Yuedong
- COLA: A Benchmark For Compositional Text-to-image Retrieval Ray Arijit, Radenovic Filip, Dubey Abhimanyu, Plummer Bryan A., Krishna Ranjay, Saenko Kate
- Towards A Performance Analysis On Pre-trained Visual Question Answering Models For Autonomous Driving Rekanar Kaavya, Eising Ciarán, Sistu Ganesh, Hayes Martin
- Timechat: A Time-sensitive Multimodal Large Language Model For Long Video Understanding Ren Shuhuai, Yao Linli, Li Shicheng, Sun Xu, Hou Lu
- Gpt4tools: Teaching Large Language Model To Use Tools Via Self-instruction Rui Yang, Lin Song, Yanwei Li, Sijie Zhao, Yixiao Ge, Xiu Li, Ying Shan
- Uni-nlx: Unifying Textual Explanations For Vision And Vision-language Tasks Sammani Fawaz, Deligiannis Nikos
- Tinylvlm-ehub: Towards Comprehensive And Efficient Evaluation For Large Vision-language Models Shao Wenqi, Lei Meng, Hu Yutao, Gao Peng, Zhang Kaipeng, Meng Fanqing, Xu Peng, Huang Siyuan, Li Hongsheng, Qiao Yu, Luo Ping
- Language Is Not All You Need: Aligning Perception With Language Models Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Johan Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei
- Jailbreak In Pieces: Compositional Adversarial Attacks On Multi-modal Language Models Shayegani Erfan, Dong Yue, Abu-ghazaleh Nael
- Graphextqa: A Benchmark For Evaluating Graph-enhanced Large Language Models Shen Yuanchun, Liao Ruotong, Han Zhen, Ma Yunpu, Tresp Volker
- Towards More Unified In-context Visual Understanding Sheng Dianmo, Chen Dongdong, Tan Zhentao, Liu Qiankun, Chu Qi, Bao Jianmin, Gong Tao, Liu Bin, Xu Shengwei, Yu Nenghai
- Crossget: Cross-guided Ensemble Of Tokens For Accelerating Vision-language Transformers Shi Dachuan, Tao Chaofan, Rao Anyi, Yang Zhendong, Yuan Chun, Wang Jiaqi
- Dept: Decomposed Prompt Tuning For Parameter-efficient Fine-tuning Shi Zhengxiang, Lipani Aldo
- What Does CLIP Know About A Red Circle? Visual Prompt Engineering For Vlms Shtedritski Aleksandar, Rupprecht Christian, Vedaldi Andrea
- Audio-visual LLM For Video Understanding Shu Fangxun, Zhang Lei, Jiang Hao, Xie Cihang
- SC-ML: Self-supervised Counterfactual Metric Learning For Debiased Visual Question Answering Shu Xinyao, Yan Shiyang, Yang Xu, Wu Ziheng, Chen Zhongfeng, Lu Zhenyu
- A Survey On Multimodal Large Language Models Shukang Yin, Chaoyou Fu, Sirui Zhao, Ke Li, Xing Sun, Tong Xu, Enhong Chen
- Woodpecker: Hallucination Correction For Multimodal Large Language Models Shukang Yin, Chaoyou Fu, Sirui Zhao, Tong Xu, Hao Wang, Dianbo Sui, Yunhang Shen, Ke Li, Xing Sun, Enhong Chen
- Ep-alm: Efficient Perceptual Augmentation Of Language Models Shukor Mustafa, Dancette Corentin, Cord Matthieu
- Beyond Task Performance: Evaluating And Reducing The Flaws Of Large Multimodal Models With In-context Learning Shukor Mustafa, Rame Alexandre, Dancette Corentin, Cord Matthieu
- Combo Of Thinking And Observing For Outside-knowledge VQA Si Qingyi, Mo Yuchen, Lin Zheng, Ji Huishan, Wang Weiping
- Mitigating Object Hallucinations In Large Vision-language Models Through Visual Contrastive Decoding Sicong Leng, Hang Zhang, Guanzheng Chen, Xin Li, Shijian Lu, Chunyan Miao, Lidong Bing
- Assessing GPT4-V On Structured Reasoning Tasks Singh Mukul, Cambronero José, Gulwani Sumit, Le Vu, Verbruggen Gust
- Gopro: Generate And Optimize Prompts In CLIP Using Self-supervised Learning Singha Mainak, Jha Ankit, Banerjee Biplab
- Applenet: Visual Attention Parameterized Prompt Learning For Few-shot Remote Sensing Image Generalization Using CLIP Singha Mainak, Jha Ankit, Solanki Bhupendra, Bose Shirsha, Banerjee Biplab
- Pneumollm: Harnessing The Power Of Large Language Model For Pneumoconiosis Diagnosis Song Meiyue, Yu Zhihua, Wang Jiaxin, Wang Jiarui, Lu Yuting, Li Baicun, Wang Xiaoxu, Huang Qinghua, Li Zhijun, Kanellakis Nikolaos I., Liu Jiangfeng, Wang Jing, Wang Binglu, Yang Juntao
- How To Bridge The Gap Between Modalities: A Comprehensive Survey On Multimodal Large Language Model Song Shezheng, Li Xiaopeng, Li Shasha, Zhao Shan, Yu Jie, Ma Jun, Mao Xiaoguang, Zhang Weimin
- The First Step Is The Hardest: Pitfalls Of Representing And Tokenizing Temporal Data For Large Language Models Spathis Dimitris, Kawsar Fahim
- Fine-grained Audio-visual Joint Representations For Multimodal Large Language Models Sun Guangzhi, Yu Wenyi, Tang Changli, Chen Xianzhao, Tan Tian, Li Wei, Lu Lu, Ma Zejun, Zhang Chao
- Generative Multimodal Models Are In-context Learners Sun Quan, Cui Yufeng, Zhang Xiaosong, Zhang Fan, Yu Qiying, Luo Zhengxiong, Wang Yueze, Rao Yongming, Liu Jingjing, Huang Tiejun, Wang Xinlong
- Multimodal Question Answering For Unified Information Extraction Sun Yuxuan, Zhang Kai, Su Yu
- Glitchbench: Can Large Multimodal Models Detect Video Game Glitches? Taesiri Mohammad Reza, Feng Tianjun, Nguyen Anh, Bezemer Cor-paul
- Imaginations Of WALL-E : Reconstructing Experiences With An Imagination-inspired Module For Advanced AI Systems Taghavi Zeinab Sadat, Gooran Soroush, Dalili Seyed Arshan, Amirzadeh Hamidreza, Nematbakhsh Mohammad Jalal, Sameti Hossein
- Link-context Learning For Multimodal Llms Tai Yan, Fan Weichen, Zhang Zhao, Zhu Feng, Zhao Rui, Liu Ziwei
- Tackling VQA With Pretrained Foundation Models Without Further Training Tan Alvin De Jun, Shen Bingquan
- Compound Text-guided Prompt Tuning Via Image-adaptive Cues Tan Hao, Li Jun, Zhou Yizhuang, Wan Jun, Lei Zhen, Zhang Xiangyu
- Codi-2: In-context, Interleaved, And Interactive Any-to-any Generation Tang Zineng, Yang Ziyi, Khademi Mahmoud, Liu Yang, Zhu Chenguang, Bansal Mohit
- Expanding Frozen Vision-language Models Without Retraining: Towards Improved Robot Perception Tavassoli Riley, Amani Mani, Akhavian Reza
- Caption Anything: Interactive Image Description With Diverse Multimodal Controls Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao
- CLAMP: Contrastive Language Model Prompt-tuning Teterwak Piotr, Sun Ximeng, Plummer Bryan A., Saenko Kate, Lim Ser-nam
- Argue: Attribute-guided Prompt Tuning For Vision-language Models Tian Xinyu, Zou Shu, Yang Zhaoyuan, Zhang Jing
- Graph Neural Prompting With Large Language Models Tian Yijun, Song Huan, Wang Zichen, Wang Haozhu, Hu Ziqing, Wang Fang, Chawla Nitesh V., Xu Panpan
- RLHF-V: Towards Trustworthy Mllms Via Behavior Alignment From Fine-grained Correctional Human Feedback Tianyu Yu, Yuan Yao, Haoye Zhang, Taiwen He, Yifeng Han, Ganqu Cui, Jinyi Hu, Zhiyuan Liu, Hai-tao Zheng, Maosong Sun, Tat-seng Chua
- Viclevr: A Visual Reasoning Dataset And Hybrid Multimodal Fusion Model For Visual Question Answering In Vietnamese Tran Khiem Vinh, Phan Hao Phu, Van Nguyen Kiet, Nguyen Ngan Luu Thuy
- How Many Unicorns Are In This Image? A Safety Evaluation Benchmark For Vision Llms Tu Haoqin, Cui Chenhang, Wang Zijun, Zhou Yiyang, Zhao Bingchen, Han Junlin, Zhou Wangchunshu, Yao Huaxiu, Xie Cihang
- Let's Think Frame By Frame With VIP: A Video Infilling And Prediction Dataset For Evaluating Video Chain-of-thought Vaishnavi Himakunthala, Andy Ouyang, Daniel Rose, Ryan He, Alex Mei, Yujie Lu, Chinmay Sonar, Michael Saxon, William Yang Wang
- Detecting And Correcting Hate Speech In Multimodal Memes With Large Visual Language Model Van Minh-hao, Wu Xintao
- Adventures Of Trustworthy Vision-language Models: A Survey Vatsa Mayank, Jain Anubhooti, Singh Richa
- Behind The Magic, MERLIM: Multi-modal Evaluation Benchmark For Large Image-language Models Villa Andrés, Alcázar Juan Carlos León, Soto Alvaro, Ghanem Bernard
- Docllm: A Layout-aware Generative Language Model For Multimodal Document Understanding Wang Dongsheng, Raman Natraj, Sibue Mathieu, Ma Zhiqiang, Babkin Petr, Kaur Simerjot, Pei Yulong, Nourbakhsh Armineh, Liu Xiaomo
- What Makes For Good Visual Tokenizers For Large Language Models? Wang Guangzhi, Ge Yixiao, Ding Xiaohan, Kankanhalli Mohan, Shan Ying
- Instructta: Instruction-tuned Targeted Attack For Large Vision-language Models Wang Xunguang, Ji Zhenlan, Ma Pingchuan, Li Zongjie, Wang Shuai
- Non-intrusive Adaptation: Input-centric Parameter-efficient Fine-tuning For Versatile Multimodal Modeling Wang Yaqing, Wu Jialin, Dabral Tanmaya, Zhang Jiageng, Brown Geoff, Lu Chun-ta, Liu Frederick, Liang Yi, Pang Bo, Bendersky Michael, Soricut Radu
- Towards Improving Document Understanding: An Exploration On Text-grounding Via Mllms Wang Yonghui, Zhou Wengang, Feng Hao, Zhou Keyi, Li Houqiang
- Learning Hierarchical Prompt With Structured Linguistic Knowledge For Vision-language Models Wang Yubin, Jiang Xinyang, Cheng De, Li Dongsheng, Zhao Cairong
- Gemini In Reasoning: Unveiling Commonsense In Multimodal Large Language Models Wang Yuqing, Zhao Yun
- Smarttrim: Adaptive Tokens And Attention Pruning For Efficient Vision-language Models Wang Zekun, Chen Jingchang, Zhou Wangchunshu, Zhu Haichao, Liang Jiafeng, Shan Liping, Liu Ming, Xu Dongliang, Yang Qing, Qin Bing
- Gpt4video: A Unified Multimodal Large Language Model For Lnstruction-followed Understanding And Safety-aware Generation Wang Zhanyu, Wang Longyue, Zhao Zhen, Wu Minghao, Lyu Chenyang, Li Huayang, Cai Deng, Zhou Luping, Shi Shuming, Tu Zhaopeng
- Filling The Image Information Gap For VQA: Prompting Large Language Models To Proactively Ask Questions Wang Ziyue, Chen Chi, Li Peng, Liu Yang
- Infovisdial: An Informative Visual Dialogue Dataset By Bridging Large Multimodal And Language Models Wen Bingbing, Yang Zhengyuan, Wang Jianfeng, Gan Zhe, Howe Bill, Wang Lijuan
- Instructblip: Towards General-purpose Vision-language Models With Instruction Tuning Wenliang Dai, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Junqi Zhao, Weisheng Wang, Boyang Li, Pascale Fung, Steven Hoi
- Resolving References In Visually-grounded Dialogue Via Text Generation Willemsen Bram, Qian Livia, Skantze Gabriel
- P-laplacian Adaptation For Generative Pre-trained Vision-language Models Wu Haoyuan, Zhang Xinyun, Xu Peng, Liao Peiyu, Yao Xufeng, Yu Bei
- Approximated Prompt Tuning For Vision-language Pre-trained Models Wu Qiong, Huang Shubin, Zhou Yiyi, Dai Pingyang, Shu Annan, Jiang Guannan, Ji Rongrong
- Next-gpt: Any-to-any Multimodal LLM Wu Shengqiong, Fei Hao, Qu Leigang, Ji Wei, Chua Tat-seng
- Paragraph-to-image Generation With Information-enriched Diffusion Model Wu Weijia, Li Zhuang, He Yefei, Shou Mike Zheng, Shen Chunhua, Cheng Lele, Li Yan, Gao Tingting, Zhang Di, Wang Zhongyuan
- Jailbreaking GPT-4V Via Self-adversarial Attacks With System Prompts Wu Yuanwei, Li Xiang, Liu Yixin, Zhou Pan, Sun Lichao
- Pali-3 Vision Language Models: Smaller, Faster, Stronger Xi Chen, Xiao Wang, Lucas Beyer, Alexander Kolesnikov, Jialin Wu, Paul Voigtlaender, Basil Mustafa, Sebastian Goodman, Ibrahim Alabdulmohsin, Piotr Padlewski, Daniel Salz, Xi Xiong, Daniel Vlasic, Filip Pavetic, Keran Rong, Tianli Yu, Daniel Keysers, Xiaohua Zhai, Radu Soricut
- LLMGA: Multimodal Large Language Model Based Generation Assistant Xia Bin, Wang Shiyin, Tao Yingfan, Wang Yitong, Jia Jiaya
- Can I Trust Your Answer? Visually Grounded Video Question Answering Xiao Junbin, Yao Angela, Li Yicong, Chua Tat Seng
- Funqa: Towards Surprising Video Comprehension Xie Binzhu, Zhang Sicheng, Zhou Zitang, Li Bo, Zhang Yuanhan, Hessel Jack, Yang Jingkang, Liu Ziwei
- The Wall Street Neophyte: A Zero-shot Analysis Of Chatgpt Over Multimodal Stock Movement Prediction Challenges Xie Qianqian, Han Weiguang, Lai Yanzhao, Peng Min, Huang Jimin
- DPL: Decoupled Prompt Learning For Vision-language Models Xu Chen, Zhu Yuhan, Zhang Guozhen, Shen Haocheng, Liao Yixuan, Chen Xiaoxin, Wu Gangshan, Wang Limin
- Pixel Aligned Language Models Xu Jiarui, Zhou Xingyi, Yan Shen, Gu Xiuye, Arnab Anurag, Sun Chen, Wang Xiaolong, Schmid Cordelia
- Drivegpt4: Interpretable End-to-end Autonomous Driving Via Large Language Model Xu Zhenhua, Zhang Yujia, Xie Enze, Zhao Zhen, Guo Yong, Wong Kwan-yee. K., Li Zhenguo, Zhao Hengshuang
- Voila-a: Aligning Vision-language Models With User's Gaze Attention Yan Kun, Ji Lei, Wang Zeyu, Wang Yuntao, Duan Nan, Ma Shuai
- Multimodal Chatgpt For Medical Applications: An Experimental Study Of GPT-4V Yan Zhiling, Zhang Kai, Zhou Rong, He Lifang, Li Xiang, Sun Lichao
- Tackling Vision Language Tasks Through Learning Inner Monologues Yang Diji, Chen Kezhen, Rao Jinmeng, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- Unified Language-vision Pretraining In LLM With Dynamic Discrete Visual Tokenization Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chenyi Lei, An Liu, Chengru Song, Xiaoqiang Lei, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu
- Good Questions Help Zero-shot Image Reasoning Yang Kaiwen, Shen Tao, Tian Xinmei, Geng Xiubo, Tao Chongyang, Tao Dacheng, Zhou Tianyi
- Medxchat: A Unified Multimodal Large Language Model Framework Towards Cxrs Understanding And Generation Yang Ling, Wang Zhanyu, Chen Zhenghao, Liang Xinyu, Zhou Luping
- Lidar-llm: Exploring The Potential Of Large Language Models For 3D Lidar Understanding Yang Senqiao, Liu Jiaming, Zhang Ray, Pan Mingjie, Guo Zoey, Li Xiaoqi, Chen Zehui, Gao Peng, Guo Yandong, Zhang Shanghang
- Progressive Evidence Refinement For Open-domain Multimodal Retrieval Question Answering Yang Shuwen, Wu Anran, Wu Xingjiao, Xiao Luwei, Ma Tianlong, Jin Cheng, He Liang
- Lever LM: Configuring In-context Sequence To Lever Large Vision Language Models Yang Xu, Peng Yingzhe, Ma Haoxuan, Xu Shuo, Zhang Chi, Han Yucheng, Zhang Hanwang
- TEAL: Tokenize And Embed ALL For Multi-modal Large Language Models Yang Zhen, Zhang Yingxue, Meng Fandong, Zhou Jie
- I-code V2: An Autoregressive Generation Framework Over Vision, Language, And Speech Data Yang Ziyi, Khademi Mahmoud, Xu Yichong, Pryzant Reid, Fang Yuwei, Zhu Chenguang, Chen Dongdong, Qian Yao, Gao Mei, Chen Yi-ling, Gmyr Robert, Kanda Naoyuki, Codella Noel, Xiao Bin, Shi Yu, Yuan Lu, Yoshioka Takuya, Zeng Michael, Huang Xuedong
- Embodiedgpt: Vision-language Pre-training Via Embodied Chain Of Thought Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo
- Beyond Chain-of-thought, Effective Graph-of-thought Reasoning In Language Models Yao Yao, Li Zuchao, Zhao Hai
- Ureader: Universal Ocr-free Visually-situated Language Understanding With Multimodal Large Language Model Ye Jiabo, Hu Anwen, Xu Haiyang, Ye Qinghao, Yan Ming, Xu Guohai, Li Chenliang, Tian Junfeng, Qian Qi, Zhang Ji, Jin Qin, He Liang, Lin Xin Alex, Huang Fei
- A Multitask, Multilingual, Multimodal Evaluation Of Chatgpt On Reasoning, Hallucination, And Interactivity Yejin Bang, Samuel Cahyawijaya, Nayeon Lee, Wenliang Dai, Dan Su, Bryan Wilie, Holy Lovenia, Ziwei Ji, Tiezheng Yu, Willy Chung, Quyet V. Do, Yan Xu, Pascale Fung
- Cones: Concept Embedding Search For Parameter Efficient Tuning Large Vision Language Models Yi Huahui, Qin Ziyuan, Xu Wei, Guo Miaotian, Wang Kun, Zhang Shaoting, Li Kang, Lao Qicheng
- Evaluating Object Hallucination In Large Vision-language Models Yifan Li, Yifan Du, Kun Zhou, Jinpeng Wang, Wayne Xin Zhao, Ji-rong Wen
- Analyzing And Mitigating Object Hallucination In Large Vision-language Models Yiyang Zhou, Chenhang Cui, Jaehong Yoon, Linjun Zhang, Zhun Deng, Chelsea Finn, Mohit Bansal, Huaxiu Yao
- Search-adaptor: Embedding Customization For Information Retrieval Yoon Jinsung, Arik Sercan O, Chen Yanfei, Pfister Tomas
- Ferret: Refer And Ground Anything Anywhere At Any Granularity You Haoxuan, Zhang Haotian, Gan Zhe, Du Xianzhi, Zhang Bowen, Wang Zirui, Cao Liangliang, Chang Shih-fu, Yang Yinfei
- Merlin:empowering Multimodal Llms With Foresight Minds Yu En, Zhao Liang, Wei Yana, Yang Jinrong, Wu Dongming, Kong Lingyu, Wei Haoran, Wang Tiancai, Ge Zheng, Zhang Xiangyu, Tao Wenbing
- CRAFT: Customizing Llms By Creating And Retrieving From Specialized Toolsets Yuan Lifan, Chen Yangyi, Wang Xingyao, Fung Yi R., Peng Hao, Ji Heng
- Large Language Models Illuminate A Progressive Pathway To Artificial Healthcare Assistant: A Review Yuan Mingze, Bao Peng, Yuan Jiajia, Shen Yunhao, Chen Zifan, Xie Yi, Zhao Jie, Chen Yang, Zhang Li, Shen Lin, Dong Bin
- Osprey: Pixel Understanding With Visual Instruction Tuning Yuan Yuqian, Li Wentong, Liu Jian, Tang Dongqi, Luo Xinjie, Qin Chi, Zhang Lei, Zhu Jianke
- Artgpt-4: Towards Artistic-understanding Large Vision-language Models With Enhanced Adapter Yuan Zhengqing, He Yunhong, Wang Kun, Ye Yanfang, Sun Lichao
- Contextual Object Detection With Multimodal Large Language Models Yuhang Zang, Wei Li, Jun Han, Kaiyang Zhou, Chen Change Loy
- On Evaluating Adversarial Robustness Of Large Vision-language Models Yunqing Zhao, Tianyu Pang, Chao Du, Xiao Yang, Chongxuan Li, Ngai-man Cheung, Min Lin
- Planting A SEED Of Vision In Large Language Model Yuying Ge, Yixiao Ge, Ziyun Zeng, Xintao Wang, Ying Shan
- What Matters In Training A Gpt4-style Language Model With Multimodal Inputs? Zeng Yan, Zhang Hanbo, Zheng Jiani, Xia Jiangnan, Wei Guoqiang, Wei Yang, Zhang Yuchen, Kong Tao
- Investigating The Catastrophic Forgetting In Multimodal Large Language Models Zhai Yuexiang, Tong Shengbang, Li Xiao, Cai Mu, Qu Qing, Lee Yong Jae, Ma Yi
- Vpgtrans: Transfer Visual Prompt Generator Across Llms Zhang Ao, Fei Hao, Yao Yuan, Ji Wei, Li Li, Liu Zhiyuan, Chua Tat-seng
- Next-chat: An LMM For Chat, Detection And Segmentation Zhang Ao, Yao Yuan, Ji Wei, Liu Zhiyuan, Chua Tat-seng
- Mm-narrator: Narrating Long-form Videos With Multimodal In-context Learning Zhang Chaoyi, Lin Kevin, Yang Zhengyuan, Wang Jianfeng, Li Linjie, Lin Chung-ching, Liu Zicheng, Wang Lijuan
- Video-llama: An Instruction-tuned Audio-visual Language Model For Video Understanding Zhang Hang, Li Xin, Bing Lidong
- Tuning Large Language Model For End-to-end Speech Translation Zhang Hao, Si Nianwen, Chen Yaqi, Zhang Wenlin, Yang Xukui, Qu Dan, Jiao Xiaolin
- Towards Perceiving Small Visual Details In Zero-shot Visual Question Answering With Multimodal Llms Zhang Jiarui, Khayatkhoei Mahyar, Chhikara Prateek, Ilievski Filip
- Moqagpt : Zero-shot Multi-modal Open-domain Question Answering With Large Language Model Zhang Le, Wu Yihong, Mo Fengran, Nie Jian-yun, Agrawal Aishwarya
- M3exam: A Multilingual, Multimodal, Multilevel Benchmark For Examining Large Language Models Zhang Wenxuan, Aljunied Sharifah Mahani, Gao Chang, Chia Yew Ken, Bing Lidong
- Gpt-4v(ision) As A Generalist Evaluator For Vision-language Tasks Zhang Xinlu, Lu Yujie, Wang Weizhi, Yan An, Yan Jun, Qin Lianke, Wang Heng, Yan Xifeng, Wang William Yang, Petzold Linda Ruth
- Making Large Language Models Perform Better In Knowledge Graph Completion Zhang Yichi, Chen Zhuo, Guo Lingbing, Xu Yajing, Zhang Wen, Chen Huajun
- Pre-trained Language Models Do Not Help Auto-regressive Text-to-image Generation Zhang Yuhui, Mckinzie Brandon, Gan Zhe, Shankar Vaishaal, Toshev Alexander
- Accountable Textual-visual Chat Learns To Reject Human Instructions In Image Re-creation Zhang Zhiwei, Liu Yuliang
- Reinforced UI Instruction Grounding: Towards A Generic UI Task Automation API Zhang Zhizheng, Xie Wenxuan, Zhang Xiaoyi, Lu Yan
- You Only Look At Screens: Multimodal Chain-of-action Agents Zhang Zhuosheng, Zhang Aston
- SVIT: Scaling Up Visual Instruction Tuning Zhao Bo, Wu Boya, He Muyang, Huang Tiejun
- Genixer: Empowering Multimodal Large Language Models As A Powerful Data Generator Zhao Henry Hengyuan, Zhou Pan, Shou Mike Zheng
- Chatspot: Bootstrapping Multimodal Llms Via Precise Referring Instruction Tuning Zhao Liang, Yu En, Ge Zheng, Yang Jinrong, Wei Haoran, Zhou Hongyu, Sun Jianjian, Peng Yuang, Dong Runpei, Han Chunrui, Zhang Xiangyu
- Hallucination Detection For Grounded Instruction Generation Zhao Lingjun, Nguyen Khanh, Daumé Hal Iii
- Retrieving Multimodal Information For Augmented Generation: A Survey Zhao Ruochen, Chen Hailin, Wang Weishi, Jiao Fangkai, Do Xuan Long, Qin Chengwei, Ding Bosheng, Guo Xiaobao, Li Minzhi, Li Xingxuan, Joty Shafiq
- Easygen: Easing Multimodal Generation With Bidiffuser And Llms Zhao Xiangyu, Liu Bo, Liu Qijiong, Shi Guangyuan, Wu Xiao-ming
- Chat With The Environment: Interactive Multimodal Perception Using Large Language Models Zhao Xufeng, Li Mengdi, Weber Cornelius, Hafez Muhammad Burhan, Wermter Stefan
- Mllm-dataengine: An Iterative Refinement Approach For MLLM Zhao Zhiyuan, Ouyang Linke, Wang Bin, Huang Siyuan, Zhang Pan, Dong Xiaoyi, Wang Jiaqi, He Conghui
- Librisqa: A Novel Dataset And Framework For Spoken Question Answering With Large Language Models Zhao Zihan, Jiang Yiyang, Liu Heyang, Wang Yanfeng, Wang Yu
- Chatbridge: Bridging Modalities With Large Language Model As A Language Catalyst Zhao Zijia, Guo Longteng, Yue Tongtian, Chen Sihan, Shao Shuai, Zhu Xinxin, Yuan Zehuan, Liu Jing
- Explainable Multimodal Emotion Recognition Zheng Lian, Haiyang Sun, Licai Sun, Hao Gu, Zhuofan Wen, Siyuan Zhang, Shun Chen, Mingyu Xu, Ke Xu, Kang Chen, Lan Chen, Shan Liang, Ya Li, Jiangyan Yi, Bin Liu, Jianhua Tao
- Towards Unified Token Learning For Vision-language Tracking Zheng Yaozong, Zhong Bineng, Liang Qihua, Li Guorong, Ji Rongrong, Li Xianxian
- Large Language Models Are Good Prompt Learners For Low-shot Image Classification Zheng Zhaoheng, Wei Jingmin, Hu Xuefeng, Zhu Haidong, Nevatia Ram
- Marinegpt: Unlocking Secrets Of Ocean To The Public Zheng Ziqiang, Zhang Jipeng, Vu Tuan-anh, Diao Shizhe, Tim Yue Him Wong, Yeung Sai-kit
- Large Language Model Can Interpret Latent Space Of Sequential Recommender Zhengyi Yang, Jiancan Wu, Yanchen Luo, Jizhi Zhang, Yancheng Yuan, An Zhang, Xiang Wang, Xiangnan He
- MM-REACT: Prompting Chatgpt For Multimodal Reasoning And Action Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Ehsan Azarnasab, Faisal Ahmed, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang
- Aligning Large Multimodal Models With Factually Augmented RLHF Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liang-yan Gui, Yu-xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell
- Mitigating Hallucination In Visual Language Models With Visual Supervision Zhiyang Chen, Yousong Zhu, Yufei Zhan, Zhaowen Li, Chaoyang Zhao, Jinqiao Wang, Ming Tang
- Beyond Hallucinations: Enhancing Lvlms Through Hallucination-aware Direct Preference Optimization Zhiyuan Zhao, Bin Wang, Linke Ouyang, Xiaoyi Dong, Jiaqi Wang, Conghui He
- Let's Think Outside The Box: Exploring Leap-of-thought In Large Language Models With Creative Humor Generation Zhong Shanshan, Huang Zhongzhan, Gao Shanghua, Wen Wushao, Lin Liang, Zitnik Marinka, Zhou Pan
- Exploring Recommendation Capabilities Of Gpt-4v(ision): A Preliminary Case Study Zhou Peilin, Cao Meng, Huang You-liang, Ye Qichen, Zhang Peiyan, Liu Junling, Xie Yueqi, Hua Yining, Kim Jaeboum
- Infmllm: A Unified Framework For Visual-language Tasks Zhou Qiang, Wang Zhibin, Chu Wei, Xu Yinghui, Li Hao, Qi Yuan
- VL-GPT: A Generative Pre-trained Transformer For Vision And Language Understanding And Generation Zhu Jinguo, Ding Xiaohan, Ge Yixiao, Ge Yuying, Zhao Sijie, Zhao Hengshuang, Wang Xiaohua, Shan Ying
- Judgelm: Fine-tuned Large Language Models Are Scalable Judges Zhu Lianghui, Wang Xinggang, Wang Xinlong
- 3d-vista: Pre-trained Transformer For 3D Vision And Text Alignment Zhu Ziyu, Ma Xiaojian, Chen Yixin, Deng Zhidong, Huang Siyuan, Li Qing
- Multimodal Chain-of-thought Reasoning In Language Models Zhuosheng Zhang, Aston Zhang, Mu Li, Hai Zhao, George Karypis, Alex Smola
- JARVIS-1: Open-world Multi-task Agents With Memory-augmented Multimodal Language Models Zihao Wang, Shaofei Cai, Anji Liu, Yonggang Jin, Jinbing Hou, Bowei Zhang, Haowei Lin, Zhaofeng He, Zilong Zheng, Yaodong Yang, Xiaojian Ma, Yitao Liang
- Incorporating Probing Signals Into Multimodal Machine Translation Via Visual Question-answering Pairs Zuo Yuxin, Li Bei, Lv Chuanhao, Zheng Tong, Xiao Tong, Zhu Jingbo
- Bcamirs At Semeval-2024 Task 4: Beyond Words: A Multimodal And Multilingual Exploration Of Persuasion In Memes Abaskohi Amirhossein, Dabiriaghdam Amirhossein, Wang Lele, Carenini Giuseppe
- Phi-3 Technical Report: A Highly Capable Language Model Locally On Your Phone Abdin Marah, Aneja Jyoti, Awadalla Hany, Awadallah Ahmed, Awan Ammar Ahmad, Bach Nguyen, Bahree Amit, Bakhtiari Arash, Bao Jianmin, Behl Harkirat, Benhaim Alon, Bilenko Misha, Bjorck Johan, Bubeck Sébastien, Cai Martin, Cai Qin, Chaudhary Vishrav, Chen Dong, Chen Dongdong, Chen Weizhu, Chen Yen-chun, Chen Yi-ling, Cheng Hao, Chopra Parul, Dai Xiyang, Dixon Matthew, Eldan Ronen, Fragoso Victor, Gao Jianfeng, Gao Mei, Gao Min, Garg Amit, Del Giorno Allie, Goswami Abhishek, Gunasekar Suriya, Haider Emman, Hao Junheng, Hewett Russell J., Hu Wenxiang, Huynh Jamie, Iter Dan, Jacobs Sam Ade, Javaheripi Mojan, Jin Xin, Karampatziakis Nikos, Kauffmann Piero, Khademi Mahoud, Kim Dongwoo, Kim Young Jin, Kurilenko Lev, Lee James R., Lee Yin Tat, Li Yuanzhi, Li Yunsheng, Liang Chen, Liden Lars, Lin Xihui, Lin Zeqi, Liu Ce, Liu Liyuan, Liu Mengchen, Liu Weishung, Liu Xiaodong, Luo Chong, Madan Piyush, Mahmoudzadeh Ali, Majercak David, Mazzola Matt, Mendes Caio César Teodoro, Mitra Arindam, Modi Hardik, Nguyen Anh, Norick Brandon, Patra Barun, Perez-becker Daniel, Portet Thomas, Pryzant Reid, Qin Heyang, Radmilac Marko, Ren Liliang, De Rosa Gustavo, Rosset Corby, Roy Sambudha, Ruwase Olatunji, Saarikivi Olli, Saied Amin, Salim Adil, Santacroce Michael, Shah Shital, Shang Ning, Sharma Hiteshi, Shen Yelong, Shukla Swadheen, Song Xia, Tanaka Masahiro, Tupini Andrea, Vaddamanu Praneetha, Wang Chunyu, Wang Guanhua, Wang Lijuan, Wang Shuohang, Wang Xin, Wang Yu, Ward Rachel, Wen Wen, Witte Philipp, Wu Haiping, Wu Xiaoxia, Wyatt Michael, Xiao Bin, Xu Can, Xu Jiahang, Xu Weijian, Xue Jilong, Yadav Sonali, Yang Fan, Yang Jianwei, Yang Yifan, Yang Ziyi, Yu Donghan, Yuan Lu, Zhang Chenruidong, Zhang Cyril, Zhang Jianwen, Zhang Li Lyna, Zhang Yi, Zhang Yue, Zhang Yunan, Zhou Xiren
- Attnlrp: Attention-aware Layer-wise Relevance Propagation For Transformers Achtibat Reduan, Hatefi Sayed Mohammad Vakilzadeh, Dreyer Maximilian, Jain Aakriti, Wiegand Thomas, Lapuschkin Sebastian, Samek Wojciech
- Mememqa: Multimodal Question Answering For Memes Via Rationale-based Inferencing Agarwal Siddhant, Sharma Shivam, Nakov Preslav, Chakraborty Tanmoy
- Tuning Large Multimodal Models For Videos Using Reinforcement Learning From AI Feedback Ahn Daechul, Choi Yura, Yu Youngjae, Kang Dongyeop, Choi Jonghyun
- Yi: Open Foundation Models By 01.AI Ai 01., :, Young Alex, Chen Bei, Li Chao, Huang Chengen, Zhang Ge, Zhang Guanwei, Li Heng, Zhu Jiangcheng, Chen Jianqun, Chang Jing, Yu Kaidong, Liu Peng, Liu Qiang, Yue Shawn, Yang Senbin, Yang Shiming, Yu Tao, Xie Wen, Huang Wenhao, Hu Xiaohui, Ren Xiaoyi, Niu Xinyao, Nie Pengcheng, Xu Yuchi, Liu Yudong, Wang Yue, Cai Yuxuan, Gu Zhenyu, Liu Zhiyuan, Dai Zonghong
- Enhancing Textbook Question Answering Task With Large Language Models And Retrieval Augmented Generation Alawwad Hessa Abdulrahman, Alhothali Areej, Naseem Usman, Alkhathlan Ali, Jamal Amani
- Understanding Alignment In Multimodal Llms: A Comprehensive Study Amirloo Elmira, Fauconnier Jean-philippe, Roesmann Christoph, Kerl Christian, Boney Rinu, Qian Yusu, Wang Zirui, Dehghan Afshin, Yang Yinfei, Gan Zhe, Grasch Peter
- Enhancing Journalism With AI: A Study Of Contextualized Image Captioning For News Articles Using Llms And Lmms Anagnostopoulou Aliki, Gouvea Thiago, Sonntag Daniel
- Mm-phyrlhf: Reinforcement Learning Framework For Multimodal Physics Question-answering Anand Avinash, Kapuriya Janak, Kirtani Chhavi, Singh Apoorv, Saraf Jay, Lal Naman, Kumar Jatin, Shivam Adarsh Raj, Verma Astha, Shah Rajiv Ratn, Zimmermann Roger
- Hired: Attention-guided Token Dropping For Efficient Inference Of High-resolution Vision-language Models In Resource-constrained Environments Arif Kazi Hasan Ibn, Yoon Jinyi, Nikolopoulos Dimitrios S., Vandierendonck Hans, John Deepu, Ji Bo
- Minigpt4-video: Advancing Multimodal Llms For Video Understanding With Interleaved Visual-textual Tokens Ataallah Kirolos, Shen Xiaoqian, Abdelrahman Eslam, Sleiman Essam, Zhu Deyao, Ding Jian, Elhoseiny Mohamed
- Minigpt-reverse-designing: Predicting Image Adjustments Utilizing Minigpt-4 Azizi Vahid, Koochaki Fatemeh
- Souplm: Model Integration In Large Language And Multi-modal Models Bai Yue, Zhang Zichen, Lu Jiasen, Fu Yun
- Hallucination Of Multimodal Large Language Models: A Survey Bai Zechen, Wang Pichao, Xiao Tianjun, He Tong, Han Zongbo, Zhang Zheng, Shou Mike Zheng
- Enhancing Presentation Slide Generation By Llms With A Multi-staged End-to-end Approach Bandyopadhyay Sambaran, Maheshwari Himanshu, Natarajan Anandhavelu, Saxena Apoorv
- MAIRA-2: Grounded Radiology Report Generation Bannur Shruthi, Bouzid Kenza, Castro Daniel C., Schwaighofer Anton, Bond-taylor Sam, Ilse Maximilian, Pérez-garcía Fernando, Salvatelli Valentina, Sharma Harshita, Meissen Felix, Ranjit Mercy, Srivastav Shaury, Gong Julia, Falck Fabian, Oktay Ozan, Thieme Anja, Lungren Matthew P., Wetscherek Maria Teodora, Alvarez-valle Javier, Hyland Stephanie L.
- Exploring Autonomous Agents Through The Lens Of Large Language Models: A Review Barua Saikat
- Taking The Next Step With Generative Artificial Intelligence: The Transformative Role Of Multimodal Large Language Models In Science Education Bewersdorff Arne, Hartmann Christian, Hornberger Marie, Seßler Kathrin, Bannert Maria, Kasneci Enkelejda, Kasneci Gjergji, Zhai Xiaoming, Nerdel Claudia
- Fintral: A Family Of GPT-4 Level Multimodal Financial Large Language Models Bhatia Gagan, Nagoudi El Moatez Billah, Cavusoglu Hasan, Abdul-mageed Muhammad
- Workarena++: Towards Compositional Planning And Reasoning-based Common Knowledge Work Tasks Boisvert Léo, Thakkar Megh, Gasse Maxime, Caccia Massimo, De Chezelles Thibault Le Sellier, Cappart Quentin, Chapados Nicolas, Lacoste Alexandre, Drouin Alexandre
- FIHA: Autonomous Hallucination Evaluation In Vision-language Models With Davidson Scene Graphs Bowen Yan, Zhengsong Zhang, Liqiang Jing, Eftekhar Hossain, Xinya Du
- ARES: Alternating Reinforcement Learning And Supervised Fine-tuning For Enhanced Multi-modal Chain-of-thought Reasoning Through Diverse AI Feedback Byun Ju-seung, Chun Jiyun, Kil Jihyung, Perrault Andrew
- Wiki-llava: Hierarchical Retrieval-augmented Generation For Multimodal Llms Caffagni Davide, Cocchi Federico, Moratelli Nicholas, Sarto Sara, Cornia Marcella, Baraldi Lorenzo, Cucchiara Rita
- Sciassess: Benchmarking LLM Proficiency In Scientific Literature Analysis Cai Hengxing, Cai Xiaochen, Chang Junhan, Li Sihang, Yao Lin, Wang Changxin, Gao Zhifeng, Wang Hongshuai, Li Yongge, Lin Mujie, Yang Shuwen, Wang Jiankun, Xu Mingjun, Huang Jin, Xi Fang, Zhuang Jiaxi, Yin Yuqi, Li Yaqi, Chen Changhong, Cheng Zheng, Zhao Zifeng, Zhang Linfeng, Ke Guolin
- Matryoshka Multimodal Models Cai Mu, Yang Jianwei, Gao Jianfeng, Lee Yong Jae
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes Cao He, Shao Yanjun, Liu Zhiyuan, Liu Zijing, Tang Xiangru, Yao Yuan, Li Yu
- MADTP: Multimodal Alignment-guided Dynamic Token Pruning For Accelerating Vision-language Transformer Cao Jianjian, Ye Peng, Li Shengze, Yu Chong, Tang Yansong, Lu Jiwen, Chen Tao
- Genception: Evaluate Multimodal Llms With Unlabeled Unimodal Data Cao Lele, Buchner Valentin, Senane Zineb, Yang Fangkai
- What Is The Visual Cognition Gap Between Humans And Multimodal Llms? Cao Xu, Lai Bolin, Ye Wenqian, Ma Yunsheng, Heintz Joerg, Chen Jintai, Cao Jianguo, Rehg James M.
- Chart-based Reasoning: Transferring Capabilities From Llms To Vlms Carbune Victor, Mansoor Hassan, Liu Fangyu, Aralikatte Rahul, Baechler Gilles, Chen Jindong, Sharma Abhanshu
- HAMMR: Hierarchical Multimodal React Agents For Generic VQA Castrejon Lluis, Mensink Thomas, Zhou Howard, Ferrari Vittorio, Araujo Andre, Uijlings Jasper
- Cross-modal Safety Alignment: Is Textual Unlearning All You Need? Chakraborty Trishna, Shayegani Erfan, Cai Zikui, Abu-ghazaleh Nael, Asif M. Salman, Dong Yue, Roy-chowdhury Amit K., Song Chengyu
- VITA: Towards Open-source Interactive Omni Multimodal LLM Chaoyou Fu, Haojia Lin, Zuwei Long, Yunhang Shen, Meng Zhao, Yifan Zhang, Shaoqi Dong, Xiong Wang, Di Yin, Long Ma, Xiawu Zheng, Ran He, Rongrong Ji, Yunsheng Wu, Caifeng Shan, Xing Sun
- Veagle: Advancements In Multimodal Representation Learning Chawla Rajat, Datta Arkajit, Verma Tushar, Jha Adarsh, Gautam Anmol, Vatsal Ayush, Chaterjee Sukrit, Ns Mukunda, Bhola Ishaan
- Model Composition For Multimodal Large Language Models Chen Chi, Du Yiyang, Fang Zheng, Wang Ziyue, Luo Fuwen, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Sun Maosong, Liu Yang
- GUI-WORLD: A Dataset For Gui-oriented Multimodal Llm-based Agents Chen Dongping, Huang Yue, Wu Siyuan, Tang Jingyu, Chen Liuyi, Bai Yilin, He Zhigang, Wang Chenlong, Zhou Huichi, Li Yiqiang, Zhou Tianshuo, Yu Yue, Gao Chujie, Zhang Qihui, Gui Yi, Li Zhen, Wan Yao, Zhou Pan, Gao Jianfeng, Sun Lichao
- Cost-effective Instruction Learning For Pathology Vision And Language Analysis Chen Kaitao, Liu Mianxin, Yan Fang, Ma Lei, Shi Xiaoming, Wang Lilong, Wang Xiaosong, Zhu Lifeng, Wang Zhe, Zhou Mu, Zhang Shaoting
- Pca-bench: Evaluating Multimodal Large Language Models In Perception-cognition-action Chain Chen Liang, Zhang Yichi, Ren Shuhuai, Zhao Haozhe, Cai Zefan, Wang Yuchi, Wang Peiyi, Meng Xiangdi, Liu Tianyu, Chang Baobao
- Are We On The Right Way For Evaluating Large Vision-language Models? Chen Lin, Li Jinsong, Dong Xiaoyi, Zhang Pan, Zang Yuhang, Chen Zehui, Duan Haodong, Wang Jiaqi, Qiao Yu, Lin Dahua, Zhao Feng
- Quantifying And Mitigating Unimodal Biases In Multimodal Large Language Models: A Causal Perspective Chen Meiqi, Cao Yixin, Zhang Yan, Lu Chaochao
- Efficientqat: Efficient Quantization-aware Training For Large Language Models Chen Mengzhao, Shao Wenqi, Xu Peng, Wang Jiahao, Gao Peng, Zhang Kaipeng, Qiao Yu, Luo Ping
- Enhancing Visual Question Answering Through Ranking-based Hybrid Training And Multimodal Fusion Chen Peiyuan, Zhang Zecheng, Dong Yiping, Zhou Li, Wang Han
- Llava-mole: Sparse Mixture Of Lora Experts For Mitigating Data Conflicts In Instruction Finetuning Mllms Chen Shaoxiang, Jie Zequn, Ma Lin
- Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided By Text Information Chen Yi, Xu Jian, Zhang Xu-yao, Liu Wen-zhuo, Liu Yang-yang, Liu Cheng-lin
- MLLM Is A Strong Reranker: Advancing Multimodal Retrieval-augmented Generation Via Knowledge-enhanced Reranking And Noise-injected Training Chen Zhanpeng, Xu Chengjin, Qi Yiyan, Guo Jian
- Unveiling Typographic Deceptions: Insights Of The Typographic Vulnerability In Large Vision-language Model Cheng Hao, Xiao Erjia, Gu Jindong, Yang Le, Duan Jinhao, Zhang Jize, Cao Jiahang, Xu Kaidi, Xu Renjing
- Social Debiasing For Fair Multi-modal Llms Cheng Harry, Guo Yangyang, Guo Qingpei, Yang Ming, Gan Tian, Nie Liqiang
- ANOLE: An Open, Autoregressive, Native Large Multimodal Models For Interleaved Image-text Generation Chern Ethan, Su Jiadi, Ma Yan, Liu Pengfei
- Attentionlego: An Open-source Building Block For Spatially-scalable Large Language Model Accelerator With Processing-in-memory Technology Cong Rongqing, He Wenyang, Li Mingxuan, Luo Bangning, Yang Zebin, Yang Yuchao, Huang Ru, Yan Bonan
- Muap: Multi-step Adaptive Prompt Learning For Vision-language Model With Missing Modality Dai Ruiting, Tan Yuqiao, Mo Lisi, He Tao, Qin Ke, Liang Shuang
- Speechverse: A Large-scale Generalizable Audio Language Model Das Nilaksh, Dingliwal Saket, Ronanki Srikanth, Paturi Rohit, Huang Zhaocheng, Mathur Prashant, Yuan Jie, Bekal Dhanush, Niu Xing, Jayanthi Sai Muralidhar, Li Xilai, Mundnich Karel, Sunkara Monica, Srinivasan Sundararajan, Han Kyu J, Kirchhoff Katrin
- Unveiling Encoder-free Vision-language Models Diao Haiwen, Cui Yufeng, Li Xiaotong, Wang Yueze, Lu Huchuan, Wang Xinlong
- Learning How To Ask: Cycle-consistency Refines Prompts In Multimodal Foundation Models Diesendruck Maurice, Lin Jianzhe, Imani Shima, Mahalingam Gayathri, Xu Mingyang, Zhao Jie
- Evaluating Linguistic Capabilities Of Multimodal Llms In The Lens Of Few-shot Learning Dogan Mustafa, Kesen Ilker, Calixto Iacer, Erdem Aykut, Erdem Erkut
- Modality-aware Integration With Large Language Models For Knowledge-based Visual Question Answering Dong Junnan, Zhang Qinggang, Zhou Huachi, Zha Daochen, Zheng Pai, Huang Xiao
- Internlm-xcomposer2-4khd: A Pioneering Large Vision-language Model Handling Resolutions From 336 Pixels To 4K HD Dong Xiaoyi, Zhang Pan, Zang Yuhang, Cao Yuhang, Wang Bin, Ouyang Linke, Zhang Songyang, Duan Haodong, Zhang Wenwei, Li Yining, Yan Hang, Gao Yang, Chen Zhe, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Chen Kai, He Conghui, Zhang Xingcheng, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- Jailbreaking Text-to-image Models With Llm-based Agents Dong Yingkai, Li Zheng, Meng Xiangtao, Yu Ning, Guo Shanqing
- Training A Vision Language Model As Smartphone Assistant Dorka Nicolai, Marecki Janusz, Anwar Ammar
- Towards Multimodal In-context Learning For Vision & Language Models Doveh Sivan, Perek Shaked, Mirza M. Jehanzeb, Lin Wei, Alfassy Amit, Arbelle Assaf, Ullman Shimon, Karlinsky Leonid
- Workarena: How Capable Are Web Agents At Solving Common Knowledge Work Tasks? Drouin Alexandre, Gasse Maxime, Caccia Massimo, Laradji Issam H., Del Verme Manuel, Marty Tom, Boisvert Léo, Thakkar Megh, Cappart Quentin, Vazquez David, Chapados Nicolas, Lacoste Alexandre
- Instruction-guided Scene Text Recognition Du Yongkun, Chen Zhineng, Su Yuchen, Jia Caiyan, Jiang Yu-gang
- CROME: Cross-modal Adapters For Efficient Multimodal LLM Ebrahimi Sayna, Arik Sercan O., Nama Tejas, Pfister Tomas
- Nphardeval4v: A Dynamic Reasoning Benchmark Of Multimodal Large Language Models Fan Lizhou, Hua Wenyue, Li Xiang, Zhu Kaijie, Jin Mingyu, Li Lingyao, Ling Haoyang, Chi Jinkui, Wang Jindong, Ma Xin, Zhang Yongfeng
- PATCH! Psychometrics-assisted Benchmarking Of Large Language Models: A Case Study Of Proficiency In 8th Grade Mathematics Fang Qixiang, Oberski Daniel L., Nguyen Dong
- Multi-modal Hallucination Control By Visual Information Grounding Favero Alessandro, Zancato Luca, Trager Matthew, Choudhary Siddharth, Perera Pramuditha, Achille Alessandro, Swaminathan Ashwin, Soatto Stefano
- Optimus: Accelerating Large-scale Multi-modal LLM Training By Bubble Exploitation Feng Weiqi, Chen Yangrui, Wang Shaoyu, Peng Yanghua, Lin Haibin, Yu Minlan
- MERA: A Comprehensive LLM Evaluation In Russian Fenogenova Alena, Chervyakov Artem, Martynov Nikita, Kozlova Anastasia, Tikhonova Maria, Akhmetgareeva Albina, Emelyanov Anton, Shevelev Denis, Lebedev Pavel, Sinev Leonid, Isaeva Ulyana, Kolomeytseva Katerina, Moskovskiy Daniil, Goncharova Elizaveta, Savushkin Nikita, Mikhailova Polina, Dimitrov Denis, Panchenko Alexander, Markov Sergei
- Grillbot In Practice: Lessons And Tradeoffs Deploying Large Language Models For Adaptable Conversational Task Assistants Fischer Sophie, Gemmell Carlos, Tecklenburg Niklas, Mackie Iain, Rossetto Federico, Dalton Jeffrey
- CLIP With Generative Latent Replay: A Strong Baseline For Incremental Learning Frascaroli Emanuele, Panariello Aniello, Buzzega Pietro, Bonicelli Lorenzo, Porrello Angelo, Calderara Simone
- BLINK: Multimodal Large Language Models Can See But Not Perceive Fu Xingyu, Hu Yushi, Li Bangzheng, Feng Yu, Wang Haoyu, Lin Xudong, Roth Dan, Smith Noah A., Ma Wei-chiu, Krishna Ranjay
- UI-JEPA: Towards Active Perception Of User Intent Through Onscreen User Activity Fu Yicheng, Anantha Raviteja, Vashisht Prabal, Cheng Jianpeng, Littwin Etai
- Layoutllm: Large Language Model Instruction Tuning For Visually Rich Document Understanding Fujitake Masato
- Question Aware Vision Transformer For Multimodal Reasoning Ganz Roy, Kittenplon Yair, Aberdam Aviad, Avraham Elad Ben, Nuriel Oren, Mazor Shai, Litman Ron
- AIM: Let Any Multi-modal Large Language Models Embrace Efficient In-context Learning Gao Jun, Qiao Qian, Cao Ziqiang, Wang Zili, Li Wenjie
- Tc-llava: Rethinking The Transfer From Image To Video Understanding With Temporal Considerations Gao Mingze, Liu Jingyu, Li Mingda, Xie Jiangtao, Liu Qingbin, Zhao Bo, Chen Xi, Xiong Hui
- Lumina-t2x: Transforming Text Into Any Modality, Resolution, And Duration Via Flow-based Large Diffusion Transformers Gao Peng, Zhuo Le, Liu Dongyang, Du Ruoyi, Luo Xu, Qiu Longtian, Zhang Yuhang, Lin Chen, Huang Rongjie, Geng Shijie, Zhang Renrui, Xi Junlin, Shao Wenqi, Jiang Zhengkai, Yang Tianshuo, Ye Weicai, Tong He, He Jingwen, Qiao Yu, Li Hongsheng
- Worldgpt: Empowering LLM As Multimodal World Model Ge Zhiqi, Huang Hongzhe, Zhou Mingze, Li Juncheng, Wang Guoming, Tang Siliang, Zhuang Yueting
- Does Object Grounding Really Reduce Hallucination Of Large Vision-language Models? Geigle Gregor, Timofte Radu, Glavaš Goran
- Gemini 1.5: Unlocking Multimodal Understanding Across Millions Of Tokens Of Context Gemini Team, Georgiev Petko, Lei Ving Ian, Burnell Ryan, Bai Libin, Gulati Anmol, Tanzer Garrett, Vincent Damien, Pan Zhufeng, Wang Shibo, Mariooryad Soroosh, Ding Yifan, Geng Xinyang, Alcober Fred, Frostig Roy, Omernick Mark, Walker Lexi, Paduraru Cosmin, Sorokin Christina, Tacchetti Andrea, Gaffney Colin, Daruki Samira, Sercinoglu Olcan, Gleicher Zach, Love Juliette, Voigtlaender Paul, Jain Rohan, Surita Gabriela, Mohamed Kareem, Blevins Rory, Ahn Junwhan, Zhu Tao, Kawintiranon Kornraphop, Firat Orhan, Gu Yiming, Zhang Yujing, Rahtz Matthew, Faruqui Manaal, Clay Natalie, Gilmer Justin, Co-reyes Jd, Penchev Ivo, Zhu Rui, Morioka Nobuyuki, Hui Kevin, Haridasan Krishna, Campos Victor, Mahdieh Mahdis, Guo Mandy, Hassan Samer, Kilgour Kevin, Vezer Arpi, Cheng Heng-tze, De Liedekerke Raoul, Goyal Siddharth, Barham Paul, Strouse Dj, Noury Seb, Adler Jonas, Sundararajan Mukund, Vikram Sharad, Lepikhin Dmitry, Paganini Michela, Garcia Xavier, Yang Fan, Valter Dasha, Trebacz Maja, Vodrahalli Kiran, Asawaroengchai Chulayuth, Ring Roman, Kalb Norbert, Soares Livio Baldini, Brahma Siddhartha, Steiner David, Yu Tianhe, Mentzer Fabian, He Antoine, Gonzalez Lucas, Xu Bibo, Kaufman Raphael Lopez, Shafey Laurent El, Oh Junhyuk, Hennigan Tom, Driessche George Van Den, Odoom Seth, Lucic Mario, Roelofs Becca, Lall Sid, Marathe Amit, Chan Betty, Ontanon Santiago, He Luheng, Teplyashin Denis, Lai Jonathan, Crone Phil, Damoc Bogdan, Ho Lewis, Riedel Sebastian, Lenc Karel, Yeh Chih-kuan, Chowdhery Aakanksha, Xu Yang, Kazemi Mehran, Amid Ehsan, Petrushkina Anastasia, Swersky Kevin, Khodaei Ali, Chen Gowoon, Larkin Chris, Pinto Mario, Yan Geng, Badia Adria Puigdomenech, Patil Piyush, Hansen Steven, Orr Dave, Arnold Sebastien M. R., Grimstad Jordan, Dai Andrew, Douglas Sholto, Sinha Rishika, Yadav Vikas, Chen Xi, Gribovskaya Elena, Austin Jacob, Zhao Jeffrey, Patel Kaushal, Komarek Paul, Austin Sophia, Borgeaud Sebastian, Friso Linda, Goyal Abhimanyu, Caine Ben, Cao Kris, Chung Da-woon, Lamm Matthew, Barth-maron Gabe, Kagohara Thais, Olszewska Kate, Chen Mia, Shivakumar Kaushik, Agarwal Rishabh, Godhia Harshal, Rajwar Ravi, Snaider Javier, Dotiwalla Xerxes, Liu Yuan, Barua Aditya, Ungureanu Victor, Zhang Yuan, Batsaikhan Bat-orgil, Wirth Mateo, Qin James, Danihelka Ivo, Doshi Tulsee, Chadwick Martin, Chen Jilin, Jain Sanil, Le Quoc, Kar Arjun, Gurumurthy Madhu, Li Cheng, Sang Ruoxin, Liu Fangyu, Lamprou Lampros, Munoz Rich, Lintz Nathan, Mehta Harsh, Howard Heidi, Reynolds Malcolm, Aroyo Lora, Wang Quan, Blanco Lorenzo, Cassirer Albin, Griffith Jordan, Das Dipanjan, Lee Stephan, Sygnowski Jakub, Fisher Zach, Besley James, Powell Richard, Ahmed Zafarali, Paulus Dominik, Reitter David, Borsos Zalan, Joshi Rishabh, Pope Aedan, Hand Steven, Selo Vittorio, Jain Vihan, Sethi Nikhil, Goel Megha, Makino Takaki, May Rhys, Yang Zhen, Schalkwyk Johan, Butterfield Christina, Hauth Anja, Goldin Alex, Hawkins Will, Senter Evan, Brin Sergey, Woodman Oliver, Ritter Marvin, Noland Eric, Giang Minh, Bolina Vijay, Lee Lisa, Blyth Tim, Mackinnon Ian, Reid Machel, Sarvana Obaid, Silver David, Chen Alexander, Wang Lily, Maggiore Loren, Chang Oscar, Attaluri Nithya, Thornton Gregory, Chiu Chung-cheng, Bunyan Oskar, Levine Nir, Chung Timothy, Eltyshev Evgenii, Si Xiance, Lillicrap Timothy, Brady Demetra, Aggarwal Vaibhav, Wu Boxi, Xu Yuanzhong, Mcilroy Ross, Badola Kartikeya, Sandhu Paramjit, Moreira Erica, Stokowiec Wojciech, Hemsley Ross, Li Dong, Tudor Alex, Shyam Pranav, Rahimtoroghi Elahe, Haykal Salem, Sprechmann Pablo, Zhou Xiang, Mincu Diana, Li Yujia, Addanki Ravi, Krishna Kalpesh, Wu Xiao, Frechette Alexandre, Eyal Matan, Dafoe Allan, Lacey Dave, Whang Jay, Avrahami Thi, Zhang Ye, Taropa Emanuel, Lin Hanzhao, Toyama Daniel, Rutherford Eliza, Sano Motoki, Choe Hyunjeong, Tomala Alex, Safranek-shrader Chalence, Kassner Nora, Pajarskas Mantas, Harvey Matt, Sechrist Sean, Fortunato Meire, Lyu Christina, Elsayed Gamaleldin, Kuang Chenkai, Lottes James, Chu Eric, Jia Chao, Chen Chih-wei, Humphreys Peter, Baumli Kate, Tao Connie, Samuel Rajkumar, Santos Cicero Nogueira Dos, Andreassen Anders, Rakićević Nemanja, Grewe Dominik, Kumar Aviral, Winkler Stephanie, Caton Jonathan, Brock Andrew, Dalmia Sid, Sheahan Hannah, Barr Iain, Miao Yingjie, Natsev Paul, Devlin Jacob, Behbahani Feryal, Prost Flavien, Sun Yanhua, Myaskovsky Artiom, Pillai Thanumalayan Sankaranarayana, Hurt Dan, Lazaridou Angeliki, Xiong Xi, Zheng Ce, Pardo Fabio, Li Xiaowei, Horgan Dan, Stanton Joe, Ambar Moran, Xia Fei, Lince Alejandro, Wang Mingqiu, Mustafa Basil, Webson Albert, Lee Hyo, Anil Rohan, Wicke Martin, Dozat Timothy, Sinha Abhishek, Piqueras Enrique, Dabir Elahe, Upadhyay Shyam, Boral Anudhyan, Hendricks Lisa Anne, Fry Corey, Djolonga Josip, Su Yi, Walker Jake, Labanowski Jane, Huang Ronny, Misra Vedant, Chen Jeremy, Skerry-ryan Rj, Singh Avi, Rijhwani Shruti, Yu Dian, Castro-ros Alex, Changpinyo Beer, Datta Romina, Bagri Sumit, Hrafnkelsson Arnar Mar, Maggioni Marcello, Zheng Daniel, Sulsky Yury, Hou Shaobo, Paine Tom Le, Yang Antoine, Riesa Jason, Rogozinska Dominika, Marcus Dror, Badawy Dalia El, Zhang Qiao, Wang Luyu, Miller Helen, Greer Jeremy, Sjos Lars Lowe, Nova Azade, Zen Heiga, Chaabouni Rahma, Rosca Mihaela, Jiang Jiepu, Chen Charlie, Liu Ruibo, Sainath Tara, Krikun Maxim, Polozov Alex, Lespiau Jean-baptiste, Newlan Josh, Cankara Zeyncep, Kwak Soo, Xu Yunhan, Chen Phil, Coenen Andy, Meyer Clemens, Tsihlas Katerina, Ma Ada, Gottweis Juraj, Xing Jinwei, Gu Chenjie, Miao Jin, Frank Christian, Cankara Zeynep, Ganapathy Sanjay, Dasgupta Ishita, Hughes-fitt Steph, Chen Heng, Reid David, Rong Keran, Fan Hongmin, Van Amersfoort Joost, Zhuang Vincent, Cohen Aaron, Gu Shixiang Shane, Mohananey Anhad, Ilic Anastasija, Tobin Taylor, Wieting John, Bortsova Anna, Thacker Phoebe, Wang Emma, Caveness Emily, Chiu Justin, Sezener Eren, Kaskasoli Alex, Baker Steven, Millican Katie, Elhawaty Mohamed, Aisopos Kostas, Lebsack Carl, Byrd Nathan, Dai Hanjun, Jia Wenhao, Wiethoff Matthew, Davoodi Elnaz, Weston Albert, Yagati Lakshman, Ahuja Arun, Gao Isabel, Pundak Golan, Zhang Susan, Azzam Michael, Sim Khe Chai, Caelles Sergi, Keeling James, Sharma Abhanshu, Swing Andy, Li Yaguang, Liu Chenxi, Bostock Carrie Grimes, Bansal Yamini, Nado Zachary, Anand Ankesh, Lipschultz Josh, Karmarkar Abhijit, Proleev Lev, Ittycheriah Abe, Yeganeh Soheil Hassas, Polovets George, Faust Aleksandra, Sun Jiao, Rrustemi Alban, Li Pen, Shivanna Rakesh, Liu Jeremiah, Welty Chris, Lebron Federico, Baddepudi Anirudh, Krause Sebastian, Parisotto Emilio, Soricut Radu, Xu Zheng, Bloxwich Dawn, Johnson Melvin, Neyshabur Behnam, Mao-jones Justin, Wang Renshen, Ramasesh Vinay, Abbas Zaheer, Guez Arthur, Segal Constant, Nguyen Duc Dung, Svensson James, Hou Le, York Sarah, Milan Kieran, Bridgers Sophie, Gworek Wiktor, Tagliasacchi Marco, Lee-thorp James, Chang Michael, Guseynov Alexey, Hartman Ale Jakse, Kwong Michael, Zhao Ruizhe, Kashem Sheleem, Cole Elizabeth, Miech Antoine, Tanburn Richard, Phuong Mary, Pavetic Filip, Cevey Sebastien, Comanescu Ramona, Ives Richard, Yang Sherry, Du Cosmo, Li Bo, Zhang Zizhao, Iinuma Mariko, Hu Clara Huiyi, Roy Aurko, Bijwadia Shaan, Zhu Zhenkai, Martins Danilo, Saputro Rachel, Gergely Anita, Zheng Steven, Jia Dawei, Antonoglou Ioannis, Sadovsky Adam, Gu Shane, Bi Yingying, Andreev Alek, Samangooei Sina, Khan Mina, Kocisky Tomas, Filos Angelos, Kumar Chintu, Bishop Colton, Yu Adams, Hodkinson Sarah, Mittal Sid, Shah Premal, Moufarek Alexandre, Cheng Yong, Bloniarz Adam, Lee Jaehoon, Pejman Pedram, Michel Paul, Spencer Stephen, Feinberg Vladimir, Xiong Xuehan, Savinov Nikolay, Smith Charlotte, Shakeri Siamak, Tran Dustin, Chesus Mary, Bohnet Bernd, Tucker George, Von Glehn Tamara, Muir Carrie, Mao Yiran, Kazawa Hideto, Slone Ambrose, Soparkar Kedar, Shrivastava Disha, Cobon-kerr James, Sharman Michael, Pavagadhi Jay, Araya Carlos, Misiunas Karolis, Ghelani Nimesh, Laskin Michael, Barker David, Li Qiujia, Briukhov Anton, Houlsby Neil, Glaese Mia, Lakshminarayanan Balaji, Schucher Nathan, Tang Yunhao, Collins Eli, Lim Hyeontaek, Feng Fangxiaoyu, Recasens Adria, Lai Guangda, Magni Alberto, De Cao Nicola, Siddhant Aditya, Ashwood Zoe, Orbay Jordi, Dehghani Mostafa, Brennan Jenny, He Yifan, Xu Kelvin, Gao Yang, Saroufim Carl, Molloy James, Wu Xinyi, Arnold Seb, Chang Solomon, Schrittwieser Julian, Buchatskaya Elena, Radpour Soroush, Polacek Martin, Giordano Skye, Bapna Ankur, Tokumine Simon, Hellendoorn Vincent, Sottiaux Thibault, Cogan Sarah, Severyn Aliaksei, Saleh Mohammad, Thakoor Shantanu, Shefey Laurent, Qiao Siyuan, Gaba Meenu, Chang Shuo-yiin, Swanson Craig, Zhang Biao, Lee Benjamin, Rubenstein Paul Kishan, Song Gan, Kwiatkowski Tom, Koop Anna, Kannan Ajay, Kao David, Schuh Parker, Stjerngren Axel, Ghiasi Golnaz, Gibson Gena, Vilnis Luke, Yuan Ye, Ferreira Felipe Tiengo, Kamath Aishwarya, Klimenko Ted, Franko Ken, Xiao Kefan, Bhattacharya Indro, Patel Miteyan, Wang Rui, Morris Alex, Strudel Robin, Sharma Vivek, Choy Peter, Hashemi Sayed Hadi, Landon Jessica, Finkelstein Mara, Jhakra Priya, Frye Justin, Barnes Megan, Mauger Matthew, Daun Dennis, Baatarsukh Khuslen, Tung Matthew, Farhan Wael, Michalewski Henryk, Viola Fabio, Quitry Felix De Chaumont, Lan Charline Le, Hudson Tom, Wang Qingze, Fischer Felix, Zheng Ivy, White Elspeth, Dragan Anca, Alayrac Jean-baptiste, Ni Eric, Pritzel Alexander, Iwanicki Adam, Isard Michael, Bulanova Anna, Zilka Lukas, Dyer Ethan, Sachan Devendra, Srinivasan Srivatsan, Muckenhirn Hannah, Cai Honglong, Mandhane Amol, Tariq Mukarram, Rae Jack W., Wang Gary, Ayoub Kareem, Fitzgerald Nicholas, Zhao Yao, Han Woohyun, Alberti Chris, Garrette Dan, Krishnakumar Kashyap, Gimenez Mai, Levskaya Anselm, Sohn Daniel, Matak Josip, Iturrate Inaki, Chang Michael B., Xiang Jackie, Cao Yuan, Ranka Nishant, Brown Geoff, Hutter Adrian, Mirrokni Vahab, Chen Nanxin, Yao Kaisheng, Egyed Zoltan, Galilee Francois, Liechty Tyler, Kallakuri Praveen, Palmer Evan, Ghemawat Sanjay, Liu Jasmine, Tao David, Thornton Chloe, Green Tim, Jasarevic Mimi, Lin Sharon, Cotruta Victor, Tan Yi-xuan, Fiedel Noah, Yu Hongkun, Chi Ed, Neitz Alexander, Heitkaemper Jens, Sinha Anu, Zhou Denny, Sun Yi, Kaed Charbel, Hulse Brice, Mishra Swaroop, Georgaki Maria, Kudugunta Sneha, Farabet Clement, Shafran Izhak, Vlasic Daniel, Tsitsulin Anton, Ananthanarayanan Rajagopal, Carin Alen, Su Guolong, Sun Pei, V Shashank, Carvajal Gabriel, Broder Josef, Comsa Iulia, Repina Alena, Wong William, Chen Warren Weilun, Hawkins Peter, Filonov Egor, Loher Lucia, Hirnschall Christoph, Wang Weiyi, Ye Jingchen, Burns Andrea, Cate Hardie, Wright Diana Gage, Piccinini Federico, Zhang Lei, Lin Chu-cheng, Gog Ionel, Kulizhskaya Yana, Sreevatsa Ashwin, Song Shuang, Cobo Luis C., Iyer Anand, Tekur Chetan, Garrido Guillermo, Xiao Zhuyun, Kemp Rupert, Zheng Huaixiu Steven, Li Hui, Agarwal Ananth, Ngani Christel, Goshvadi Kati, Santamaria-fernandez Rebeca, Fica Wojciech, Chen Xinyun, Gorgolewski Chris, Sun Sean, Garg Roopal, Ye Xinyu, Eslami S. M. Ali, Hua Nan, Simon Jon, Joshi Pratik, Kim Yelin, Tenney Ian, Potluri Sahitya, Thiet Lam Nguyen, Yuan Quan, Luisier Florian, Chronopoulou Alexandra, Scellato Salvatore, Srinivasan Praveen, Chen Minmin, Koverkathu Vinod, Dalibard Valentin, Xu Yaming, Saeta Brennan, Anderson Keith, Sellam Thibault, Fernando Nick, Huot Fantine, Jung Junehyuk, Varadarajan Mani, Quinn Michael, Raul Amit, Le Maigo, Habalov Ruslan, Clark Jon, Jalan Komal, Bullard Kalesha, Singhal Achintya, Luong Thang, Wang Boyu, Rajayogam Sujeevan, Eisenschlos Julian, Jia Johnson, Finchelstein Daniel, Yakubovich Alex, Balle Daniel, Fink Michael, Agarwal Sameer, Li Jing, Dvijotham Dj, Pal Shalini, Kang Kai, Konzelmann Jaclyn, Beattie Jennifer, Dousse Olivier, Wu Diane, Crocker Remi, Elkind Chen, Jonnalagadda Siddhartha Reddy, Lee Jong, Holtmann-rice Dan, Kallarackal Krystal, Liu Rosanne, Vnukov Denis, Vats Neera, Invernizzi Luca, Jafari Mohsen, Zhou Huanjie, Taylor Lilly, Prendki Jennifer, Wu Marcus, Eccles Tom, Liu Tianqi, Kopparapu Kavya, Beaufays Francoise, Angermueller Christof, Marzoca Andreea, Sarcar Shourya, Dib Hilal, Stanway Jeff, Perbet Frank, Trdin Nejc, Sterneck Rachel, Khorlin Andrey, Li Dinghua, Wu Xihui, Goenka Sonam, Madras David, Goldshtein Sasha, Gierke Willi, Zhou Tong, Liu Yaxin, Liang Yannie, White Anais, Li Yunjie, Singh Shreya, Bahargam Sanaz, Epstein Mark, Basu Sujoy, Lao Li, Ozturel Adnan, Crous Carl, Zhai Alex, Lu Han, Tung Zora, Gaur Neeraj, Walton Alanna, Dixon Lucas, Zhang Ming, Globerson Amir, Uy Grant, Bolt Andrew, Wiles Olivia, Nasr Milad, Shumailov Ilia, Selvi Marco, Piccinno Francesco, Aguilar Ricardo, Mccarthy Sara, Khalman Misha, Shukla Mrinal, Galic Vlado, Carpenter John, Villela Kevin, Zhang Haibin, Richardson Harry, Martens James, Bosnjak Matko, Belle Shreyas Rammohan, Seibert Jeff, Alnahlawi Mahmoud, Mcwilliams Brian, Singh Sankalp, Louis Annie, Ding Wen, Popovici Dan, Simicich Lenin, Knight Laura, Mehta Pulkit, Gupta Nishesh, Shi Chongyang, Fatehi Saaber, Mitrovic Jovana, Grills Alex, Pagadora Joseph, Petrova Dessie, Eisenbud Danielle, Zhang Zhishuai, Yates Damion, Mittal Bhavishya, Tripuraneni Nilesh, Assael Yannis, Brovelli Thomas, Jain Prateek, Velimirovic Mihajlo, Akbulut Canfer, Mu Jiaqi, Macherey Wolfgang, Kumar Ravin, Xu Jun, Qureshi Haroon, Comanici Gheorghe, Wiesner Jeremy, Gong Zhitao, Ruddock Anton, Bauer Matthias, Felt Nick, Gp Anirudh, Arnab Anurag, Zelle Dustin, Rothfuss Jonas, Rosgen Bill, Shenoy Ashish, Seybold Bryan, Li Xinjian, Mudigonda Jayaram, Erdogan Goker, Xia Jiawei, Simsa Jiri, Michi Andrea, Yao Yi, Yew Christopher, Kan Steven, Caswell Isaac, Radebaugh Carey, Elisseeff Andre, Valenzuela Pedro, Mckinney Kay, Paterson Kim, Cui Albert, Latorre-chimoto Eri, Kim Solomon, Zeng William, Durden Ken, Ponnapalli Priya, Sosea Tiberiu, Choquette-choo Christopher A., Manyika James, Robenek Brona, Vashisht Harsha, Pereira Sebastien, Lam Hoi, Velic Marko, Owusu-afriyie Denese, Lee Katherine, Bolukbasi Tolga, Parrish Alicia, Lu Shawn, Park Jane, Venkatraman Balaji, Talbert Alice, Rosique Lambert, Cheng Yuchung, Sozanschi Andrei, Paszke Adam, Kumar Praveen, Austin Jessica, Li Lu, Salama Khalid, Kim Wooyeol, Dukkipati Nandita, Baryshnikov Anthony, Kaplanis Christos, Sheng Xianghai, Chervonyi Yuri, Unlu Caglar, Casas Diego De Las, Askham Harry, Tunyasuvunakool Kathryn, Gimeno Felix, Poder Siim, Kwak Chester, Miecnikowski Matt, Mirrokni Vahab, Dimitriev Alek, Parisi Aaron, Liu Dangyi, Tsai Tomy, Shevlane Toby, Kouridi Christina, Garmon Drew, Goedeckemeyer Adrian, Brown Adam R., Vijayakumar Anitha, Elqursh Ali, Jazayeri Sadegh, Huang Jin, Carthy Sara Mc, Hoover Jay, Kim Lucy, Kumar Sandeep, Chen Wei, Biles Courtney, Bingham Garrett, Rosen Evan, Wang Lisa, Tan Qijun, Engel David, Pongetti Francesco, De Cesare Dario, Hwang Dongseong, Yu Lily, Pullman Jennifer, Narayanan Srini, Levin Kyle, Gopal Siddharth, Li Megan, Aharoni Asaf, Trinh Trieu, Lo Jessica, Casagrande Norman, Vij Roopali, Matthey Loic, Ramadhana Bramandia, Matthews Austin, Carey Cj, Johnson Matthew, Goranova Kremena, Shah Rohin, Ashraf Shereen, Dasgupta Kingshuk, Larsen Rasmus, Wang Yicheng, Vuyyuru Manish Reddy, Jiang Chong, Ijazi Joana, Osawa Kazuki, Smith Celine, Boppana Ramya Sree, Bilal Taylan, Koizumi Yuma, Xu Ying, Altun Yasemin, Shabat Nir, Bariach Ben, Korchemniy Alex, Choo Kiam, Ronneberger Olaf, Iwuanyanwu Chimezie, Zhao Shubin, Soergel David, Hsieh Cho-jui, Cai Irene, Iqbal Shariq, Sundermeyer Martin, Chen Zhe, Bursztein Elie, Malaviya Chaitanya, Biadsy Fadi, Shroff Prakash, Dhillon Inderjit, Latkar Tejasi, Dyer Chris, Forbes Hannah, Nicosia Massimo, Nikolaev Vitaly, Greene Somer, Georgiev Marin, Wang Pidong, Martin Nina, Sedghi Hanie, Zhang John, Banzal Praseem, Fritz Doug, Rao Vikram, Wang Xuezhi, Zhang Jiageng, Patraucean Viorica, Du Dayou, Mordatch Igor, Jurin Ivan, Liu Lewis, Dubey Ayush, Mohan Abhi, Nowakowski Janek, Ion Vlad-doru, Wei Nan, Tojo Reiko, Raad Maria Abi, Hudson Drew A., Keshava Vaishakh, Agrawal Shubham, Ramirez Kevin, Wu Zhichun, Nguyen Hoang, Liu Ji, Sewak Madhavi, Petrini Bryce, Choi Donghyun, Philips Ivan, Wang Ziyue, Bica Ioana, Garg Ankush, Wilkiewicz Jarek, Agrawal Priyanka, Li Xiaowei, Guo Danhao, Xue Emily, Shaik Naseer, Leach Andrew, Khan Sadh Mnm, Wiesinger Julia, Jerome Sammy, Chakladar Abhishek, Wang Alek Wenjiao, Ornduff Tina, Abu Folake, Ghaffarkhah Alireza, Wainwright Marcus, Cortes Mario, Liu Frederick, Maynez Joshua, Terzis Andreas, Samangouei Pouya, Mansour Riham, Kępa Tomasz, Aubet François-xavier, Algymr Anton, Banica Dan, Weisz Agoston, Orban Andras, Senges Alexandre, Andrejczuk Ewa, Geller Mark, Santo Niccolo Dal, Anklin Valentin, Merey Majd Al, Baeuml Martin, Strohman Trevor, Bai Junwen, Petrov Slav, Wu Yonghui, Hassabis Demis, Kavukcuoglu Koray, Dean Jeffrey, Vinyals Oriol
- Exploring The Frontier Of Vision-language Models: A Survey Of Current Methodologies And Future Directions Ghosh Akash, Acharya Arkadeep, Saha Sriparna, Jain Vinija, Chadha Aman
- Feedback-aligned Mixed Llms For Machine Language-molecule Translation Gkoumas Dimitris, Liakata Maria
- Transforming Llms Into Cross-modal And Cross-lingual Retrieval Systems Gomez Frank Palma, Sanabria Ramon, Sung Yun-hsuan, Cer Daniel, Dalmia Siddharth, Abrego Gustavo Hernandez
- Omnifusion Technical Report Goncharova Elizaveta, Razzhigaev Anton, Mikhalchuk Matvey, Kurkin Maxim, Abdullaeva Irina, Skripkin Matvey, Oseledets Ivan, Dimitrov Denis, Kuznetsov Andrey
- LLMC: Benchmarking Large Language Model Quantization With A Versatile Compression Toolkit Gong Ruihao, Yong Yang, Gu Shiqiao, Huang Yushi, Lv Chentao, Zhang Yunchen, Liu Xianglong, Tao Dacheng
- Multi-frame, Lightweight & Efficient Vision-language Models For Question Answering In Autonomous Driving Gopalkrishnan Akshay, Greer Ross, Trivedi Mohan
- Eyes Closed, Safety On: Protecting Multimodal Llms Via Image-to-text Transformation Gou Yunhao, Chen Kai, Liu Zhili, Hong Lanqing, Xu Hang, Li Zhenguo, Yeung Dit-yan, Kwok James T., Zhang Yu
- RWKV-CLIP: A Robust Vision-language Representation Learner Gu Tiancheng, Yang Kaicheng, An Xiang, Feng Ziyong, Liu Dongnan, Cai Weidong, Deng Jiankang
- Inquire, Interact, And Integrate: A Proactive Agent Collaborative Framework For Zero-shot Multimodal Medical Reasoning Gu Zishan, Liu Fenglin, Yin Changchang, Zhang Ping
- Probing The Robustness Of Vision-language Pretrained Models: A Multimodal Adversarial Attack Approach Guan Jiwei, Ding Tianyu, Cao Longbing, Pan Lei, Wang Chen, Zheng Xi
- Joint Embeddings For Graph Instruction Tuning Haag Aaron, Argatu Vlad, Lohse Oliver
- MERLIN: Multimodal Embedding Refinement Via Llm-based Iterative Navigation For Text-video Retrieval-rerank Pipeline Han Donghoon, Park Eunhwan, Lee Gisang, Lee Adam, Kwak Nojun
- Towards Robust Instruction Tuning On Multimodal Large Language Models Han Wei, Chen Hui, Poria Soujanya
- A Survey On Hallucination In Large Vision-language Models Hanchao Liu, Wenyuan Xue, Yifei Chen, Dapeng Chen, Xiutian Zhao, Ke Wang, Liping Hou, Rongjun Li, Wei Peng
- MA-LMM: Memory-augmented Large Multimodal Model For Long-term Video Understanding He Bo, Li Hengduo, Jang Young Kyun, Jia Menglin, Cao Xuefei, Shah Ashish, Shrivastava Abhinav, Lim Ser-nam
- Olympiadbench: A Challenging Benchmark For Promoting AGI With Olympiad-level Bilingual Multimodal Scientific Problems He Chaoqun, Luo Renjie, Bai Yuzhuo, Hu Shengding, Thai Zhen Leng, Shen Junhao, Hu Jinyi, Han Xu, Huang Yujie, Zhang Yuxiang, Liu Jie, Qi Lei, Liu Zhiyuan, Sun Maosong
- Webvoyager: Building An End-to-end Web Agent With Large Multimodal Models He Hongliang, Yao Wenlin, Ma Kaixin, Yu Wenhao, Dai Yong, Zhang Hongming, Lan Zhenzhong, Yu Dong
- POEM: Interactive Prompt Optimization For Enhancing Multimodal Reasoning Of Large Language Models He Jianben, Wang Xingbo, Liu Shiyi, Wu Guande, Silva Claudio, Qu Huamin
- Multi-modal Instruction Tuned Llms With Fine-grained Visual Perception He Junwen, Wang Yifan, Wang Lijun, Lu Huchuan, He Jun-yan, Lan Jin-peng, Luo Bin, Xie Xuansong
- Pitvqa: Image-grounded Text Embedding LLM For Visual Question Answering In Pituitary Surgery He Runlong, Xu Mengya, Das Adrito, Khan Danyal Z., Bano Sophia, Marcus Hani J., Stoyanov Danail, Clarkson Matthew J., Islam Mobarakol
- MARS: Mixture Of Auto-regressive Models For Fine-grained Text-to-image Synthesis He Wanggui, Fu Siming, Liu Mushui, Wang Xierui, Xiao Wenyi, Shu Fangxun, Wang Yi, Zhang Lei, Yu Zhelun, Li Haoyuan, Huang Ziwei, Gan Leilei, Jiang Hao
- Llms Meet Multimodal Generation And Editing: A Survey He Yingqing, Liu Zhaoyang, Chen Jingye, Tian Zeyue, Liu Hongyu, Chi Xiaowei, Liu Runtao, Yuan Ruibin, Xing Yazhou, Wang Wenhai, Dai Jifeng, Zhang Yong, Xue Wei, Liu Qifeng, Guo Yike, Chen Qifeng
- Generative Visual Instruction Tuning Hernandez Jefferson, Villegas Ruben, Ordonez Vicente
- Why Do Llava Vision-language Models Reply To Images In English? Hinck Musashi, Holtermann Carolin, Olson Matthew Lyle, Schneider Florian, Yu Sungduk, Bhiwandiwalla Anahita, Lauscher Anne, Tseng Shaoyen, Lal Vasudev
- Llava-gemma: Accelerating Multimodal Foundation Models With A Compact Language Model Hinck Musashi, Olson Matthew L., Cobbley David, Tseng Shao-yen, Lal Vasudev
- From Descriptive Richness To Bias: Unveiling The Dark Side Of Generative Image Caption Enrichment Hirota Yusuke, Hachiuma Ryo, Yang Chao-han Huck, Nakashima Yuta
- Securing Vision-language Models With A Robust Encoder Against Jailbreak And Adversarial Attacks Hossain Md Zarif, Imteaj Ahmed
- Uncovering Bias In Large Vision-language Models At Scale With Counterfactuals Howard Phillip, Fraser Kathleen C., Bhiwandiwalla Anahita, Kiritchenko Svetlana
- Multimodal Chain-of-thought Reasoning Via Chatgpt To Protect Children From Age-inappropriate Apps Hu Chuanbo, Liu Bin, Yin Minglei, Zhou Yilu, Li Xin
- A Survey On Large Language Model-based Game Agents Hu Sihao, Huang Tiansheng, Ilhan Fatih, Tekin Selim, Liu Gaowen, Kompella Ramana, Liu Ling
- Matryoshka Query Transformer For Large Vision-language Models Hu Wenbo, Dou Zi-yi, Li Liunian Harold, Kamath Amita, Peng Nanyun, Chang Kai-wei
- Listen Again And Choose The Right Answer: A New Paradigm For Automatic Speech Recognition With Large Language Models Hu Yuchen, Chen Chen, Qin Chengwei, Zhu Qiushi, Chng Eng Siong, Li Ruizhe
- An Intermediate Fusion Vit Enables Efficient Text-image Alignment In Diffusion Models Hu Zizhao, Jia Shaochong, Rostami Mohammad
- V2xum-llm: Cross-modal Video Summarization With Temporal Prompt Instruction Tuning Hua Hang, Tang Yunlong, Xu Chenliang, Luo Jiebo
- Talk Less, Interact Better: Evaluating In-context Conversational Adaptation In Multimodal Llms Hua Yilun, Artzi Yoav
- Multimodal Task Vectors Enable Many-shot Multimodal In-context Learning Huang Brandon, Mitra Chancharik, Arbelle Assaf, Karlinsky Leonid, Darrell Trevor, Herzig Roei
- LITA: Language Instructed Temporal-localization Assistant Huang De-an, Liao Shijia, Radhakrishnan Subhashree, Yin Hongxu, Molchanov Pavlo, Yu Zhiding, Kautz Jan
- Conme: Rethinking Evaluation Of Compositional Reasoning For Modern Vlms Huang Irene, Lin Wei, Mirza M. Jehanzeb, Hansen Jacob A., Doveh Sivan, Butoi Victor Ion, Herzig Roei, Arbelle Assaf, Kuhene Hilde, Darrel Trevor, Gan Chuang, Oliva Aude, Feris Rogerio, Karlinsky Leonid
- A Survey On Evaluation Of Multimodal Large Language Models Huang Jiaxing, Zhang Jingyi
- Mmevalpro: Calibrating Multimodal Benchmarks Towards Trustworthy And Efficient Evaluation Huang Jinsheng, Chen Liang, Guo Taian, Zeng Fu, Zhao Yusheng, Wu Bohan, Yuan Ye, Zhao Haozhe, Guo Zhihui, Zhang Yichi, Yuan Jingyang, Ju Wei, Liu Luchen, Liu Tianyu, Chang Baobao, Zhang Ming
- Reason3d: Searching And Reasoning 3D Segmentation Via Large Language Model Huang Kuan-chih, Li Xiangtai, Qi Lu, Yan Shuicheng, Yang Ming-hsuan
- The Solution For The 5th GCAIAC Zero-shot Referring Expression Comprehension Challenge Huang Longfei, Yu Feng, Guan Zhihao, Wan Zhonghua, Yang Yang
- Brainchat: Decoding Semantic Information From Fmri Using Vision-language Pretrained Models Huang Wanaiu
- An Empirical Study Of Llama3 Quantization: From Llms To Mllms Huang Wei, Zheng Xingyu, Ma Xudong, Qin Haotong, Lv Chengtao, Chen Hong, Luo Jie, Qi Xiaojuan, Liu Xianglong, Magno Michele
- Ml-mamba: Efficient Multi-modal Large Language Model Utilizing Mamba-2 Huang Wenjun, Pan Jiakai, Tang Jiahao, Ding Yanyu, Xing Yifei, Wang Yuhe, Wang Zhengzhuo, Hu Jianguo
- Rolora: Fine-tuning Rotated Outlier-free Llms For Effective Weight-activation Quantization Huang Xijie, Liu Zechun, Liu Shih-yang, Cheng Kwang-ting
- Olympicarena: Benchmarking Multi-discipline Cognitive Reasoning For Superintelligent AI Huang Zhen, Wang Zengzhi, Xia Shijie, Li Xuefeng, Zou Haoyang, Xu Ruijie, Fan Run-ze, Ye Lyumanshan, Chern Ethan, Ye Yixin, Zhang Yikai, Yang Yuqing, Wu Ting, Wang Binjie, Sun Shichao, Xiao Yang, Li Yiyuan, Zhou Fan, Chern Steffi, Qin Yiwei, Ma Yan, Su Jiadi, Liu Yixiu, Zheng Yuxiang, Zhang Shaoting, Lin Dahua, Qiao Yu, Liu Pengfei
- Accelerating Pre-training Of Multimodal Llms Via Chain-of-sight Huang Ziyuan, Ji Kaixiang, Gong Biao, Qing Zhiwu, Zhang Qinglong, Zheng Kecheng, Wang Jian, Chen Jingdong, Yang Ming
- Self-introspective Decoding: Alleviating Hallucinations For Large Vision-language Models Huo Fushuo, Xu Wenchao, Zhang Zhong, Wang Haozhao, Chen Zhicheng, Zhao Peilin
- Test-time Low Rank Adaptation Via Confidence Maximization For Zero-shot Generalization Of Vision-language Models Imam Raza, Gani Hanan, Huzaifa Muhammad, Nandakumar Karthik
- Are Large Vision Language Models Up To The Challenge Of Chart Comprehension And Reasoning? An Extensive Investigation Into The Capabilities And Limitations Of Lvlms Islam Mohammed Saidul, Rahman Raian, Masry Ahmed, Laskar Md Tahmid Rahman, Nayeem Mir Tafseer, Hoque Enamul
- Memeguard: An LLM And Vlm-based Framework For Advancing Content Moderation Via Meme Intervention Jha Prince, Jain Raghav, Mandal Konika, Chadha Aman, Saha Sriparna, Bhattacharyya Pushpak
- Precision Empowers, Excess Distracts: Visual Question Answering With Dynamically Infused Knowledge In Language Models Jhalani Manas, M Annervaz K, Bhattacharyya Pushpak
- Towards Flexible Evaluation For Generative Visual Question Answering Ji Huishan, Si Qingyi, Lin Zheng, Wang Weiping
- Large Visual-language Models Are Also Good Classifiers: A Study Of In-context Multimodal Fake News Detection Jiang Ye, Wang Yimin
- Prior Knowledge Integration Via LLM Encoding And Pseudo Event Regulation For Video Moment Retrieval Jiang Yiyang, Zhang Wengyu, Zhang Xulu, Wei Xiaoyong, Chen Chang Wen, Li Qing
- Hidden Flaws Behind Expert-level Accuracy Of Multimodal GPT-4 Vision In Medicine Jin Qiao, Chen Fangyuan, Zhou Yiliang, Xu Ziyang, Cheung Justin M., Chen Robert, Summers Ronald M., Rousseau Justin F., Ni Peiyun, Landsman Marc J, Baxter Sally L., Al'aref Subhi J., Li Yijia, Chen Alex, Brejt Josef A., Chiang Michael F., Peng Yifan, Lu Zhiyong
- Reasoning Grasping Via Multimodal Large Language Model Jin Shiyu, Xu Jinxuan, Lei Yutian, Zhang Liangjun
- Video-lavit: Unified Video-language Pre-training With Decoupled Visual-motional Tokenization Jin Yang, Sun Zhicheng, Xu Kun, Xu Kun, Chen Liwei, Jiang Hao, Huang Quzhe, Song Chengru, Liu Yuliang, Zhang Di, Song Yang, Gai Kun, Mu Yadong
- Read To Play (r2-play): Decision Transformer With Multimodal Game Instruction Jin Yonggang, Zhang Ge, Zhao Hao, Zheng Tianyu, Guo Jarvi, Xiang Liuyu, Yue Shawn, Huang Stephen W., He Zhaofeng, Fu Jie
- Logical Closed Loop: Uncovering Object Hallucinations In Large Vision-language Models Junfei Wu, Qiang Liu, Ding Wang, Jinghao Zhang, Shu Wu, Liang Wang, Tieniu Tan
- CODE: Contrasting Self-generated Description To Combat Hallucination In Large Multi-modal Models Junho Kim, Hyunjun Kim, Yeonju Kim, Yong Man Ro
- What If...?: Thinking Counterfactual Keywords Helps To Mitigate Hallucination In Large Multi-modal Models Junho Kim, Yeon Ju Kim, Yong Man Ro
- RAP: Retrieval-augmented Planning With Contextual Memory For Multimodal LLM Agents Kagaya Tomoyuki, Yuan Thong Jing, Lou Yuxuan, Karlekar Jayashree, Pranata Sugiri, Kinose Akira, Oguri Koki, Wick Felix, You Yang
- Understanding The Effect Of Using Semantically Meaningful Tokens For Visual Representation Learning Kalibhat Neha, Kattakinda Priyatham, Zarei Arman, Seleznev Nikita, Sharpe Samuel, Kumar Senthil, Feizi Soheil
- Visualizing Dialogues: Enhancing Image Selection Through Dialogue Understanding With Large Language Models Kao Chang-sheng, Chen Yun-nung
- Omniact: A Dataset And Benchmark For Enabling Multimodal Generalist Autonomous Agents For Desktop And Web Kapoor Raghav, Butala Yash Parag, Russak Melisa, Koh Jing Yu, Kamble Kiran, Alshikh Waseem, Salakhutdinov Ruslan
- BRAVE: Broadening The Visual Encoding Of Vision-language Models Kar Oğuzhan Fatih, Tonioni Alessio, Poklukar Petra, Kulshrestha Achin, Zamir Amir, Tombari Federico
- Visiongpt: Vision-language Understanding Agent Using Generalized Multimodal Framework Kelly Chris, Hu Luhui, Yang Bang, Tian Yu, Yang Deshun, Yang Cindy, Huang Zaoshan, Li Zihao, Hu Jiayin, Zou Yuexian
- On Scalable Oversight With Weak Llms Judging Strong Llms Kenton Zachary, Siegel Noah Y., Kramár János, Brown-cohen Jonah, Albanie Samuel, Bulian Jannis, Agarwal Rishabh, Lindner David, Tang Yunhao, Goodman Noah D., Shah Rohin
- Self-training Large Language Models For Improved Visual Program Synthesis With Visual Reinforcement Khan Zaid, Bg Vijay Kumar, Schulter Samuel, Fu Yun, Chandraker Manmohan
- Consistency And Uncertainty: Identifying Unreliable Responses From Black-box Vision-language Models For Selective Visual Question Answering Khan Zaid, Fu Yun
- Learning To Prompt With Text Only Supervision For Vision-language Models Khattak Muhammad Uzair, Naeem Muhammad Ferjad, Naseer Muzammal, Van Gool Luc, Tombari Federico
- Integrating Paralinguistics In Speech-empowered Large Language Models For Natural Conversation Kim Heeseung, Seo Soonshin, Jeong Kyeongseok, Kwon Ohsung, Kim Soyoon, Kim Jungwhan, Lee Jaehong, Song Eunwoo, Oh Myungwoo, Ha Jung-woo, Yoon Sungroh, Yoo Kang Min
- Fine-tuning CLIP Text Encoders With Two-step Paraphrasing Kim Hyunjae, Yoon Seunghyun, Bui Trung, Zhao Handong, Tran Quan, Dernoncourt Franck, Kang Jaewoo
- Finer: Investigating And Enhancing Fine-grained Visual Concept Recognition In Large Vision Language Models Kim Jeonghwan, Ji Heng
- Openvla: An Open-source Vision-language-action Model Kim Moo Jin, Pertsch Karl, Karamcheti Siddharth, Xiao Ted, Balakrishna Ashwin, Nair Suraj, Rafailov Rafael, Foster Ethan, Lam Grace, Sanketi Pannag, Vuong Quan, Kollar Thomas, Burchfiel Benjamin, Tedrake Russ, Sadigh Dorsa, Levine Sergey, Liang Percy, Finn Chelsea
- A Survey On Integration Of Large Language Models With Intelligent Robots Kim Yeseung, Kim Dohyun, Choi Jieun, Park Jisang, Oh Nayoung, Park Daehyung
- Uncertainty-aware Evaluation For Vision-language Models Kostumov Vasily, Nutfullin Bulat, Pilipenko Oleg, Ilyushin Eugene
- Can Better Text Semantics In Prompt Tuning Improve VLM Generalization? Kuchibhotla Hari Chandana, Kancheti Sai Srinivas, Reddy Abbavaram Gowtham, Balasubramanian Vineeth N
- Multi-modal Generation Via Cross-modal In-context Learning Kumar Amandeep, Naseer Muzammal, Narayan Sanath, Anwer Rao Muhammad, Khan Salman, Cholakkal Hisham
- Residual-based Language Models Are Free Boosters For Biomedical Imaging Lai Zhixin, Wu Jing, Chen Suiyao, Zhou Yucheng, Hovakimyan Naira
- IBD: Alleviating Hallucinations In Large Vision-language Models Via Image-biased Decoding Lanyun Zhu, Deyi Ji, Tianrun Chen, Peng Xu, Jieping Ye, Jun Liu
- What Matters When Building Vision-language Models? Laurençon Hugo, Tronchon Léo, Cord Matthieu, Sanh Victor
- Frontier AI Ethics: Anticipating And Evaluating The Societal Impacts Of Generative Agents Lazar Seth
- Sentiment Reasoning For Healthcare Le-duc Khai, Nguyen Khai-nguyen, Tat Bach Phan, Le Duy, Ngo Jerry, Vo-dang Long, Nguyen Anh Totti, Hy Truong-son
- Multimodal Reasoning With Multimodal Knowledge Graph Lee Junlin, Wang Yequan, Li Jing, Zhang Min
- Visual Question Answering Instruction: Unlocking Multimodal Large Language Model To Domain-specific Visual Multitasks Lee Jusung, Cha Sungguk, Lee Younghyun, Yang Cheoljong
- Llava-docent: Instruction Tuning With Multimodal Large Language Model To Support Art Appreciation Education Lee Unggi, Jeon Minji, Lee Yunseo, Byun Gyuri, Son Yoorim, Shin Jaeyoon, Ko Hongkyu, Kim Hyeoncheol
- Are Bigger Encoders Always Better In Vision Large Models? Li Bozhou, Liang Hao, Meng Zimo, Zhang Wentao
- Searchlvlms: A Plug-and-play Framework For Augmenting Large Vision-language Models By Searching Up-to-date Internet Knowledge Li Chuanhao, Li Zhen, Jing Chenchen, Liu Shuo, Shao Wenqi, Wu Yuwei, Luo Ping, Qiao Yu, Zhang Kaipeng
- Chemvlm: Exploring The Power Of Multimodal Large Language Models In Chemistry Area Li Junxian, Zhang Di, Wang Xunzhi, Hao Zeying, Lei Jingdi, Tan Qian, Zhou Cai, Liu Wei, Yang Yaotian, Xiong Xinrui, Wang Weiyun, Chen Zhe, Wang Wenhai, Li Wei, Zhang Shufei, Su Mao, Ouyang Wanli, Li Yuqiang, Zhou Dongzhan
- Red Teaming Visual Language Models Li Mukai, Li Lei, Yin Yuwei, Ahmed Masood, Liu Zhenguang, Liu Qi
- FSMR: A Feature Swapping Multi-modal Reasoning Approach With Joint Textual And Visual Clues Li Shuang, Wang Jiahua, Wen Lijie
- Temporal Insight Enhancement: Mitigating Temporal Hallucination In Multimodal Large Language Models Li Sun, Liuan Wang, Jun Sun, Takayuki Okatani
- TOPA: Extend Large Language Models For Video Understanding Via Text-only Pre-alignment Li Wei, Fan Hehe, Wong Yongkang, Kankanhalli Mohan, Yang Yi
- Tokenpacker: Efficient Visual Projector For Multimodal LLM Li Wentong, Yuan Yuqian, Liu Jian, Tang Dongqi, Wang Song, Qin Jie, Zhu Jianke, Zhang Lei
- MM-TTS: A Unified Framework For Multimodal, Prompt-induced Emotional Text-to-speech Synthesis Li Xiang, Cheng Zhi-qi, He Jun-yan, Peng Xiaojiang, Hauptmann Alexander G.
- Llara: Supercharging Robot Learning Data For Vision-language Policy Li Xiang, Mata Cristina, Park Jongwoo, Kahatapitiya Kumara, Jang Yoo Sung, Shang Jinghuan, Ranasinghe Kanchana, Burgert Ryan, Cai Mu, Lee Yong Jae, Ryoo Michael S.
- What If We Recaption Billions Of Web Images With Llama-3? Li Xianhang, Tu Haoqin, Hui Mude, Wang Zeyu, Zhao Bingchen, Xiao Junfei, Ren Sucheng, Mei Jieru, Liu Qing, Zheng Huangjie, Zhou Yuyin, Xie Cihang
- Cognitive Visual-language Mapper: Advancing Multimodal Comprehension With Enhanced Visual Knowledge Alignment Li Yunxin, Chen Xinyu, Hu Baotian, Shi Haoyuan, Zhang Min
- Uni-moe: Scaling Unified Multimodal Llms With Mixture Of Experts Li Yunxin, Jiang Shenyuan, Hu Baotian, Wang Longyue, Zhong Wanqi, Luo Wenhan, Ma Lin, Zhang Min
- Mmsci: A Multimodal Multi-discipline Dataset For Phd-level Scientific Comprehension Li Zekun, Yang Xianjun, Choi Kyuri, Zhu Wanrong, Hsieh Ryan, Kim Hyeonjung, Lim Jin Hyuk, Ji Sungyoung, Lee Byungju, Yan Xifeng, Petzold Linda Ruth, Wilson Stephen D., Lim Woosang, Wang William Yang
- Do Vision-language Transformers Exhibit Visual Commonsense? An Empirical Study Of VCR Li Zhenyang, Guo Yangyang, Wang Kejie, Chen Xiaolin, Nie Liqiang, Kankanhalli Mohan
- EAGLE: Elevating Geometric Reasoning Through Llm-empowered Visual Instruction Tuning Li Zhihao, Du Yao, Liu Yang, Zhang Yan, Liu Yufang, Zhang Mengdi, Cai Xunliang
- Visualization Literacy Of Multimodal Large Language Models: A Comparative Study Li Zhimin, Miao Haichao, Pascucci Valerio, Liu Shusen
- Hunyuan-dit: A Powerful Multi-resolution Diffusion Transformer With Fine-grained Chinese Understanding Li Zhimin, Zhang Jianwei, Lin Qin, Xiong Jiangfeng, Long Yanxin, Deng Xinchi, Zhang Yingfang, Liu Xingchao, Huang Minbin, Xiao Zedong, Chen Dayou, He Jiajun, Li Jiahao, Li Wenyue, Zhang Chen, Quan Rongwei, Lu Jianxiang, Huang Jiabin, Yuan Xiaoyan, Zheng Xiaoxiao, Li Yixuan, Zhang Jihong, Zhang Chao, Chen Meng, Liu Jie, Fang Zheng, Wang Weiyan, Xue Jinbao, Tao Yangyu, Zhu Jianchen, Liu Kai, Lin Sihuan, Sun Yifu, Li Yun, Wang Dongdong, Chen Mingtao, Hu Zhichao, Xiao Xiao, Chen Yan, Liu Yuhong, Liu Wei, Wang Di, Yang Yong, Jiang Jie, Lu Qinglin
- Cog-ga: A Large Language Models-based Generative Agent For Vision-language Navigation In Continuous Environments Li Zhiyuan, Lu Yanfeng, Mu Yao, Qiao Hong
- Vl-trojan: Multimodal Instruction Backdoor Attacks Against Autoregressive Visual Language Models Liang Jiawei, Liang Siyuan, Luo Man, Liu Aishan, Han Dongchen, Chang Ee-chien, Cao Xiaochun
- Bytecomposer: A Human-like Melody Composition Method Based On Language Model Agent Liang Xia, Du Xingjian, Lin Jiaju, Zou Pei, Wan Yuan, Zhu Bilei
- Mitigating Hallucination In Visual-language Models Via Re-balancing Contrastive Decoding Liang Xiaoyu, Yu Jiayuan, Mu Lianrui, Zhuang Jiedong, Hu Jiaqi, Yang Yuchen, Ye Jiangnan, Lu Lu, Chen Jian, Hu Haoji
- Instructlayout: Instruction-driven 2D And 3D Layout Synthesis With Semantic Graph Prior Lin Chenguo, Lin Yuchen, Pan Panwang, Zhang Xuanyang, Mu Yadong
- Designprobe: A Graphic Design Benchmark For Multimodal Large Language Models Lin Jieru, Huang Danqing, Zhao Tiejun, Zhan Dechen, Lin Chin-yew
- Draw-and-understand: Leveraging Visual Prompts To Enable Mllms To Comprehend What You Want Lin Weifeng, Wei Xinyu, An Ruichuan, Gao Peng, Zou Bocheng, Luo Yulin, Huang Siyuan, Zhang Shanghang, Li Hongsheng
- Improving Visual Storytelling With Multimodal Large Language Models Lin Xiaochuan, Chen Xiangyong
- Rethinking Visual Prompting For Multimodal Large Language Models With External Knowledge Lin Yuanze, Li Yunsheng, Chen Dongdong, Xu Weijian, Clark Ronald, Torr Philip, Yuan Lu
- Boosting Multimodal Large Language Models With Visual Tokens Withdrawal For Rapid Inference Lin Zhihang, Lin Mingbao, Lin Luxi, Ji Rongrong
- Evaluating Text-to-visual Generation With Image-to-text Generation Lin Zhiqiu, Pathak Deepak, Li Baiqi, Li Jiayao, Xia Xide, Neubig Graham, Zhang Pengchuan, Ramanan Deva
- SPHINX-X: Scaling Data And Parameters For A Family Of Multi-modal Large Language Models Liu Dongyang, Zhang Renrui, Qiu Longtian, Huang Siyuan, Lin Weifeng, Zhao Shitian, Geng Shijie, Lin Ziyi, Jin Peng, Zhang Kaipeng, Shao Wenqi, Xu Chao, He Conghui, He Junjun, Shao Hao, Lu Pan, Li Hongsheng, Qiao Yu, Gao Peng
- Lumina-mgpt: Illuminate Flexible Photorealistic Text-to-image Generation With Multimodal Generative Pretraining Liu Dongyang, Zhao Shitian, Zhuo Le, Lin Weifeng, Qiao Yu, Li Hongsheng, Gao Peng
- Pefomed: Parameter Efficient Fine-tuning Of Multimodal Large Language Models For Medical Imaging Liu Gang, He Jinlong, Li Pengfei, He Genrong, Chen Zhaolin, Zhong Shenjun
- On The Intrinsic Self-correction Capability Of Llms: Uncertainty And Latent Concept Liu Guangliang, Mao Haitao, Cao Bochuan, Xue Zhiyu, Johnson Kristen, Tang Jiliang, Wang Rongrong
- Mmgrec: Multimodal Generative Recommendation With Transformer Model Liu Han, Wei Yinwei, Song Xuemeng, Guan Weili, Li Yuan-fang, Nie Liqiang
- Mibench: Evaluating Multimodal Large Language Models Over Multiple Images Liu Haowei, Zhang Xi, Xu Haiyang, Shi Yaya, Jiang Chaoya, Yan Ming, Zhang Ji, Huang Fei, Yuan Chunfeng, Li Bing, Hu Weiming
- Kangaroo: A Powerful Video-language Model Supporting Long-context Video Input Liu Jiajun, Wang Yibing, Ma Hanghang, Wu Xiaoping, Ma Xiaoqi, Wei Xiaoming, Jiao Jianbin, Wu Enhua, Hu Jie
- Mm-instruct: Generated Visual Instructions For Large Multimodal Model Alignment Liu Jihao, Huang Xin, Zheng Jinliang, Liu Boxiao, Wang Jia, Yoshie Osamu, Liu Yu, Li Hongsheng
- Visualwebbench: How Far Have Multimodal Llms Evolved In Web Page Understanding And Grounding? Liu Junpeng, Song Yifan, Lin Bill Yuchen, Lam Wai, Neubig Graham, Li Yuanzhi, Yue Xiang
- LLM4GEN: Leveraging Semantic Representation Of Llms For Text-to-image Generation Liu Mushui, Ma Yuhang, Zhen Yang, Dan Jun, Yu Yunlong, Zhao Zeng, Hu Zhipeng, Liu Bai, Fan Changjie
- Generative Expressive Conversational Speech Synthesis Liu Rui, Hu Yifan, Ren Yi, Yin Xiang, Li Haizhou
- Cmm-math: A Chinese Multimodal Math Dataset To Evaluate And Enhance The Mathematics Reasoning Of Large Multimodal Models Liu Wentao, Pan Qianjun, Zhang Yi, Liu Zhuo, Wu Ji, Zhou Jie, Zhou Aimin, Chen Qin, Jiang Bo, He Liang
- Harnessing Large Language Models For Multimodal Product Bundling Liu Xiaohao, Wu Jie, Tao Zhulin, Ma Yunshan, Wei Yinwei, Chua Tat-seng
- Arondight: Red Teaming Large Vision Language Models With Auto-generated Multi-modal Jailbreak Prompts Liu Yi, Cai Chengjun, Zhang Xiaoli, Yuan Xingliang, Wang Cong
- Synthvlm: High-efficiency And High-quality Synthetic Data For Vision Language Models Liu Zheng, Liang Hao, Huang Xijie, Xiong Wentao, Yu Qinhan, Sun Linzhuang, Chen Chong, He Conghui, Cui Bin, Zhang Wentao
- Less Is More: Data Value Estimation For Visual Instruction Tuning Liu Zikang, Zhou Kun, Zhao Wayne Xin, Gao Dawei, Li Yaliang, Wen Ji-rong
- Chain-of-spot: Interactive Reasoning Improves Large Vision-language Models Liu Zuyan, Dong Yuhao, Rao Yongming, Zhou Jie, Lu Jiwen
- Generative Multi-modal Knowledge Retrieval With Large Language Models Long Xinwei, Zeng Jiali, Meng Fandong, Ma Zhiyuan, Zhang Kaiyan, Zhou Bowen, Zhou Jie
- Deepseek-vl: Towards Real-world Vision-language Understanding Lu Haoyu, Liu Wen, Zhang Bo, Wang Bingxuan, Dong Kai, Liu Bo, Sun Jingxiang, Ren Tongzheng, Li Zhuoshu, Yang Hao, Sun Yaofeng, Deng Chengqi, Xu Hanwei, Xie Zhenda, Ruan Chong
- Multimodal Large Language Model Driven Scenario Testing For Autonomous Vehicles Lu Qiujing, Wang Xuanhan, Jiang Yiwei, Zhao Guangming, Ma Mingyue, Feng Shuo
- Ovis: Structural Embedding Alignment For Multimodal Large Language Model Lu Shiyin, Li Yang, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Ye Han-jia
- Wildvision: Evaluating Vision-language Models In The Wild With Human Preferences Lu Yujie, Jiang Dongfu, Chen Wenhu, Wang William Yang, Choi Yejin, Lin Bill Yuchen
- Layoutllm: Layout Instruction Tuning With Large Language Models For Document Understanding Luo Chuwei, Shen Yufan, Zhu Zhaoqing, Zheng Qi, Yu Zhi, Yao Cong
- Autom3l: An Automated Multimodal Machine Learning Framework With Large Language Models Luo Daqin, Feng Chengjian, Nong Yuxuan, Shen Yiqing
- M\(^3\)GPT: An Advanced Multimodal, Multitask Framework For Motion Comprehension And Generation Luo Mingshuang, Hou Ruibing, Chang Hong, Liu Zimo, Wang Yaowei, Shan Shiguang
- DEEM: Diffusion Models Serve As The Eyes Of Large Language Models For Image Perception Luo Run, Li Yunshui, Chen Longze, He Wanwei, Lin Ting-en, Liu Ziqiang, Zhang Lei, Song Zikai, Xia Xiaobo, Liu Tongliang, Yang Min, Hui Binyuan
- X-reflect: Cross-reflection Prompting For Multimodal Recommendation Lyu Hanjia, Rossi Ryan, Chen Xiang, Tanjim Md Mehrab, Petrangeli Stefano, Sarkhel Somdeb, Luo Jiebo
- Weblinx: Real-world Website Navigation With Multi-turn Dialogue Lù Xing Han, Kasner Zdeněk, Reddy Siva
- Multi-modal Generative Embedding Model Ma Feipeng, Xue Hongwei, Wang Guangting, Zhou Yizhou, Rao Fengyun, Yan Shilin, Zhang Yueyi, Wu Siying, Shou Mike Zheng, Sun Xiaoyan
- EE-MLLM: A Data-efficient And Compute-efficient Multimodal Large Language Model Ma Feipeng, Zhou Yizhou, Li Hebei, He Zilong, Wu Siying, Rao Fengyun, Zhang Yueyi, Sun Xiaoyan
- Eventlens: Leveraging Event-aware Pretraining And Cross-modal Linking Enhances Visual Commonsense Reasoning Ma Mingjie, Yu Zhihuan, Ma Yichao, Li Guohui
- Clawmachine: Fetching Visual Tokens As An Entity For Referring And Grounding Ma Tianren, Xie Lingxi, Tian Yunjie, Yang Boyu, Zhang Yuan, Doermann David, Ye Qixiang
- Coco-agent: A Comprehensive Cognitive MLLM Agent For Smartphone GUI Automation Ma Xinbei, Zhang Zhuosheng, Zhao Hai
- Mmlongbench-doc: Benchmarking Long-context Document Understanding With Visualizations Ma Yubo, Zang Yuhang, Chen Liangyu, Chen Meiqi, Jiao Yizhu, Li Xinze, Lu Xinyuan, Liu Ziyu, Ma Yan, Dong Xiaoyi, Zhang Pan, Pan Liangming, Jiang Yu-gang, Wang Jiaqi, Cao Yixin, Sun Aixin
- Gerea: Question-aware Prompt Captions For Knowledge-based Visual Question Answering Ma Ziyu, Li Shutao, Sun Bin, Cai Jianfei, Long Zuxiang, Ma Fuyan
- Mitigating Hallucinations In Large Vision-language Models (lvlms) Via Language-contrastive Decoding (LCD) Manevich Avshalom, Tsarfaty Reut
- MM1: Methods, Analysis & Insights From Multimodal LLM Pre-training Mckinzie Brandon, Gan Zhe, Fauconnier Jean-philippe, Dodge Sam, Zhang Bowen, Dufter Philipp, Shah Dhruti, Du Xianzhi, Peng Futang, Weers Floris, Belyi Anton, Zhang Haotian, Singh Karanjeet, Kang Doug, Jain Ankur, Hè Hongyu, Schwarzer Max, Gunter Tom, Kong Xiang, Zhang Aonan, Wang Jianyu, Wang Chong, Du Nan, Lei Tao, Wiseman Sam, Yin Guoli, Lee Mark, Wang Zirui, Pang Ruoming, Grasch Peter, Toshev Alexander, Yang Yinfei
- Deepstack: Deeply Stacking Visual Tokens Is Surprisingly Simple And Effective For Lmms Meng Lingchen, Yang Jianwei, Tian Rui, Dai Xiyang, Wu Zuxuan, Gao Jianfeng, Jiang Yu-gang
- A Survey Of Resource-efficient LLM And Multimodal Foundation Models Mengwei Xu, Wangsong Yin, Dongqi Cai, Rongjie Yi, Daliang Xu, Qipeng Wang, Bingyang Wu, Yihao Zhao, Chen Yang, Shihe Wang, Qiyang Zhang, Zhenyan Lu, Li Zhang, Shangguang Wang, Yuanchun Li, Yunxin Liu, Xin Jin, Xuanzhe Liu
- A Philosophical Introduction To Language Models - Part II: The Way Forward Millière Raphaël, Buckner Cameron
- Evaluating And Analyzing Relationship Hallucinations In Large Vision-language Models Mingrui Wu, Jiayi Ji, Oucheng Huang, Jiale Li, Yuhang Wu, Xiaoshuai Sun, Rongrong Ji
- PSLM: Parallel Generation Of Text And Speech With Llms For Low-latency Spoken Dialogue Systems Mitsui Kentaro, Mitsuda Koh, Wakatsuki Toshiaki, Hono Yukiya, Sawada Kei
- Multimodal Contrastive In-context Learning Miyanishi Yosuke, Nguyen Minh Le
- LSPT: Long-term Spatial Prompt Tuning For Visual Representation Learning Mo Shentong, Wang Yansen, Luo Xufang, Li Dongsheng
- Kam-cot: Knowledge Augmented Multimodal Chain-of-thoughts Reasoning Mondal Debjyoti, Modi Suraj, Panda Subhadarshi, Singh Rituraj, Rao Godawari Sudhakar
- Lhrs-bot: Empowering Remote Sensing With Vgi-enhanced Large Multimodal Language Model Muhtar Dilxat, Li Zhenshi, Gu Feng, Zhang Xueliang, Xiao Pengfeng
- Zero-shot Visual Reasoning By Vision-language Models: Benchmarking And Analysis Nagar Aishik, Jaiswal Shantanu, Tan Cheston
- Beyond The Hype: A Dispassionate Look At Vision-language Models In Medical Scenario Nan Yang, Zhou Huichi, Xing Xiaodan, Yang Guang
- Dude: Dual Distribution-aware Context Prompt Learning For Large Vision-language Model Nguyen Duy M. H., Le An T., Nguyen Trung Q., Diep Nghiem T., Nguyen Tai, Duong-tran Duy, Peters Jan, Shen Li, Niepert Mathias, Sonntag Daniel
- Mobileflow: A Multimodal LLM For Mobile GUI Agent Nong Songqin, Zhu Jiali, Wu Rui, Jin Jiongchao, Shan Shuo, Huang Xiutian, Xu Wenhao
- Generative AI In EU Law: Liability, Privacy, Intellectual Property, And Cybersecurity Novelli Claudio, Casolari Federico, Hacker Philipp, Spedicato Giorgio, Floridi Luciano
- The Conversation Is The Command: Interacting With Real-world Autonomous Robot Through Natural Language Nwankwo Linus, Rueckert Elmar
- Jdocqa: Japanese Document Question Answering Dataset For Generative Language Models Onami Eri, Kurita Shuhei, Miyanishi Taiki, Watanabe Taro
- Prompting Large Vision-language Models For Compositional Reasoning Ossowski Timothy, Jiang Ming, Hu Junjie
- Gemini Goes To Med School: Exploring The Capabilities Of Multimodal Large Language Models On Medical Challenge Problems & Hallucinations Pal Ankit, Sankarasubbu Malaikannan
- Auto-encoding Morph-tokens For Multimodal LLM Pan Kaihang, Tang Siliang, Li Juncheng, Fan Zhaoyu, Chow Wei, Yan Shuicheng, Chua Tat-seng, Zhuang Yueting, Zhang Hanwang
- Shaking Up Vlms: Comparing Transformers And Structured State Space Models For Vision & Language Modeling Pantazopoulos Georgios, Nikandrou Malvina, Suglia Alessandro, Lemon Oliver, Eshghi Arash
- Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes Llms More Prone To Jailbreak Attacks Pantazopoulos Georgios, Parekh Amit, Nikandrou Malvina, Suglia Alessandro
- Do Vision & Language Decoders Use Images And Text Equally? How Self-consistent Are Their Explanations? Parcalabescu Letitia, Frank Anette
- Picturing Ambiguity: A Visual Twist On The Winograd Schema Challenge Park Brendan, Janecek Madeline, Ezzati-jivan Naser, Li Yifeng, Emami Ali
- Bridging Vision And Language Spaces With Assignment Prediction Park Jungin, Lee Jiyoung, Sohn Kwanghoon
- The Ultimate Guide To Fine-tuning Llms From Basics To Breakthroughs: An Exhaustive Review Of Technologies, Research, Best Practices, Applied Research Challenges And Opportunities Parthasarathy Venkatesh Balavadhani, Zafar Ahtsham, Khan Aafaq, Shahid Arsalan
- Autonomous Workflow For Multimodal Fine-grained Training Assistants Towards Mixed Reality Pei Jiahuan, Viola Irene, Huang Haochen, Wang Junxiao, Ahsan Moonisa, Ye Fanghua, Yiming Jiang, Sai Yao, Wang Di, Chen Zhumin, Ren Pengjie, Cesar Pablo
- Learnable In-context Vector For Visual Question Answering Peng Yingzhe, Hao Chenduo, Yang Xu, Peng Jiawei, Hu Xinting, Geng Xin
- Dreambench++: A Human-aligned Benchmark For Personalized Image Generation Peng Yuang, Cui Yuxin, Tang Haomiao, Qi Zekun, Dong Runpei, Bai Jing, Han Chunrui, Ge Zheng, Zhang Xiangyu, Xia Shu-tao
- Mllm-protector: Ensuring Mllm's Safety Without Hurting Performance Pi Renjie, Han Tianyang, Zhang Jianshu, Xie Yueqi, Pan Rui, Lian Qing, Dong Hanze, Zhang Jipeng, Zhang Tong
- Interactive Continual Learning: Fast And Slow Thinking Qi Biqing, Chen Xingquan, Gao Junqi, Li Dong, Liu Jianxing, Wu Ligang, Zhou Bowen
- SNIFFER: Multimodal Large Language Model For Explainable Out-of-context Misinformation Detection Qi Peng, Yan Zehong, Hsu Wynne, Lee Mong Li
- Mia-bench: Towards Better Instruction Following Evaluation Of Multimodal Llms Qian Yusu, Ye Hanrong, Fauconnier Jean-philippe, Grasch Peter, Yang Yinfei, Gan Zhe
- How Easy Is It To Fool Your Multimodal Llms? An Empirical Analysis On Deceptive Prompts Qian Yusu, Zhang Haotian, Yang Yinfei, Gan Zhe
- Vl-mamba: Exploring State Space Models For Multimodal Learning Qiao Yanyuan, Yu Zheng, Guo Longteng, Chen Sihan, Zhao Zijia, Sun Mingzhen, Wu Qi, Liu Jing
- Prism: A Framework For Decoupling And Assessing The Capabilities Of Vlms Qiao Yuxuan, Duan Haodong, Fang Xinyu, Yang Junming, Chen Lin, Zhang Songyang, Wang Jiaqi, Lin Dahua, Chen Kai
- Snapntell: Enhancing Entity-centric Visual Question Answering With Retrieval Augmented Multimodal LLM Qiu Jielin, Madotto Andrea, Lin Zhaojiang, Crook Paul A., Xu Yifan Ethan, Dong Xin Luna, Faloutsos Christos, Li Lei, Damavandi Babak, Moon Seungwhan
- V-zen: Efficient GUI Understanding And Precise Grounding With A Novel Multimodal LLM Rahman Abdur, Chawla Rajat, Kumar Muskaan, Datta Arkajit, Jha Adarsh, Ns Mukunda, Bhola Ishaan
- GSR-BENCH: A Benchmark For Grounded Spatial Reasoning Evaluation Via Multimodal Llms Rajabi Navid, Kosecka Jana
- Cataractbot: An Llm-powered Expert-in-the-loop Chatbot For Cataract Patients Ramjee Pragnya, Sachdeva Bhuvan, Golechha Satvik, Kulkarni Shreyas, Fulari Geeta, Murali Kaushik, Jain Mohit
- Learning To Localize Objects Improves Spatial Reasoning In Visual-llms Ranasinghe Kanchana, Shukla Satya Narayan, Poursaeed Omid, Ryoo Michael S., Lin Tsung-yu
- Visual Hallucination: Definition, Quantification, And Prescriptive Remediations Rani Anku, Rawte Vipula, Sharma Harshad, Anand Neeraj, Rajbangshi Krishnav, Sheth Amit, Das Amitava
- RAVEN: Multitask Retrieval Augmented Vision-language Learning Rao Varun Nagaraj, Choudhary Siddharth, Deshpande Aditya, Satzoda Ravi Kumar, Appalaraju Srikar
- Cinepile: A Long Video Question Answering Dataset And Benchmark Rawal Ruchit, Saifullah Khalid, Basri Ronen, Jacobs David, Somepalli Gowthami, Goldstein Tom
- Strengthening Multimodal Large Language Model With Bootstrapped Preference Optimization Renjie Pi, Tianyang Han, Wei Xiong, Jipeng Zhang, Runtao Liu, Rui Pan, Tong Zhang
- Talking To Machines: Do You Read Me? Rojas-barahona Lina M.
- Question-instructed Visual Descriptions For Zero-shot Video Question Answering Romero David, Solorio Thamar
- Convolutional Prompting Meets Language Models For Continual Learning Roy Anurag, Moulick Riddhiman, Verma Vinay K., Ghosh Saptarshi, Das Abir
- Capabilities Of Gemini Models In Medicine Saab Khaled, Tu Tao, Weng Wei-hung, Tanno Ryutaro, Stutz David, Wulczyn Ellery, Zhang Fan, Strother Tim, Park Chunjong, Vedadi Elahe, Chaves Juanma Zambrano, Hu Szu-yeu, Schaekermann Mike, Kamath Aishwarya, Cheng Yong, Barrett David G. T., Cheung Cathy, Mustafa Basil, Palepu Anil, Mcduff Daniel, Hou Le, Golany Tomer, Liu Luyang, Alayrac Jean-baptiste, Houlsby Neil, Tomasev Nenad, Freyberg Jan, Lau Charles, Kemp Jonas, Lai Jeremy, Azizi Shekoofeh, Kanada Kimberly, Man Siwai, Kulkarni Kavita, Sun Ruoxi, Shakeri Siamak, He Luheng, Caine Ben, Webson Albert, Latysheva Natasha, Johnson Melvin, Mansfield Philip, Lu Jian, Rivlin Ehud, Anderson Jesper, Green Bradley, Wong Renee, Krause Jonathan, Shlens Jonathon, Dominowska Ewa, Eslami S. M. Ali, Chou Katherine, Cui Claire, Vinyals Oriol, Kavukcuoglu Koray, Manyika James, Dean Jeff, Hassabis Demis, Matias Yossi, Webster Dale, Barral Joelle, Corrado Greg, Semturs Christopher, Mahdavi S. Sara, Gottweis Juraj, Karthikesalingam Alan, Natarajan Vivek
- Vision-language And Large Language Model Performance In Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, And Quantized Models Safavi-naini Seyed Amir Ahmad, Ali Shuhaib, Shahab Omer, Shahhoseini Zahra, Savage Thomas, Rafiee Sara, Samaan Jamil S, Shabeeb Reem Al, Ladak Farah, Yang Jamie O, Echavarria Juan, Babar Sumbal, Shaukat Aasma, Margolis Samuel, Tatonetti Nicholas P, Nadkarni Girish, Kurdi Bara El, Soroush Ali
- A Systematic Survey Of Prompt Engineering In Large Language Models: Techniques And Applications Sahoo Pranab, Singh Ayush Kumar, Saha Sriparna, Jain Vinija, Mondal Samrat, Chadha Aman
- ICAL: Continual Learning Of Multimodal Agents By Transforming Trajectories Into Actionable Insights Sarch Gabriel, Jang Lawrence, Tarr Michael J., Cohen William W., Marino Kenneth, Fragkiadaki Katerina
- HELPER-X: A Unified Instructable Embodied Agent To Tackle Four Interactive Vision-language Domains With Memory-augmented Language Models Sarch Gabriel, Somani Sahil, Kapoor Raghav, Tarr Michael J., Fragkiadaki Katerina
- M5 -- A Diverse Benchmark To Assess The Performance Of Large Multimodal Models Across Multilingual And Multicultural Vision-language Tasks Schneider Florian, Sitaram Sunayana
- Speech Reallm -- Real-time Streaming Speech Recognition With Multimodal Llms By Teaching The Flow Of Time Seide Frank, Doulaty Morrie, Shi Yangyang, Gaur Yashesh, Jia Junteng, Wu Chunyang
- Multi-modal Adapter For Vision-language Models Seputis Dominykas, Mihailov Serghei, Chatterjee Soham, Xiao Zehao
- Medpromptx: Grounded Multimodal Prompting For Chest X-ray Diagnosis Shaaban Mai A., Khan Adnan, Yaqub Mohammad
- Putting Gpt-4o To The Sword: A Comprehensive Evaluation Of Language, Vision, Speech, And Multimodal Proficiency Shahriar Sakib, Lund Brady, Mannuru Nishith Reddy, Arshad Muhammad Arbab, Hayawi Kadhim, Bevara Ravi Varma Kumar, Mannuru Aashrith, Batool Laiba
- An End-to-end Speech Summarization Using Large Language Model Shang Hengchao, Li Zongyao, Guo Jiaxin, Li Shaojun, Rao Zhiqiang, Luo Yuanchang, Wei Daimeng, Yang Hao
- Llava-prumerge: Adaptive Token Reduction For Efficient Large Multimodal Models Shang Yuzhang, Cai Mu, Xu Bingxin, Lee Yong Jae, Yan Yan
- Synth\(^2\): Boosting Visual-language Models With Synthetic Captions And Image Embeddings Sharifzadeh Sahand, Kaplanis Christos, Pathak Shreya, Kumaran Dharshan, Ilic Anastasija, Mitrovic Jovana, Blundell Charles, Banino Andrea
- Mammothmoda: Multi-modal Large Language Model She Qi, Pan Junwen, Wan Xin, Zhang Rui, Lu Dawei, Huang Kai
- PMG : Personalized Multimodal Generation With Large Language Models Shen Xiaoteng, Zhang Rui, Zhao Xiaoyan, Zhu Jieming, Xiao Xi
- Proctag: Process Tagging For Assessing The Efficacy Of Document Instruction Data Shen Yufan, Luo Chuwei, Zhu Zhaoqing, Chen Yang, Zheng Qi, Yu Zhi, Bu Jiajun, Yao Cong
- Lumos : Empowering Multimodal Llms With Scene Text Recognition Shenoy Ashish, Lu Yichao, Jayakumar Srihari, Chatterjee Debojeet, Moslehpour Mohsen, Chuang Pierce, Harpale Abhay, Bhardwaj Vikas, Xu Di, Zhao Shicong, Zhao Longfang, Ramchandani Ankit, Dong Xin Luna, Kumar Anuj
- Non-autoregressive Sequence-to-sequence Vision-language Models Shi Kunyu, Dong Qi, Goncalves Luis, Tu Zhuowen, Soatto Stefano
- Paying More Attention To Image: A Training-free Method For Alleviating Hallucination In Lvlms Shi Liu, Kecheng Zheng, Wei Chen
- Eagle: Exploring The Design Space For Multimodal Llms With Mixture Of Encoders Shi Min, Liu Fuxiao, Wang Shihao, Liao Shijia, Radhakrishnan Subhashree, Huang De-an, Yin Hongxu, Sapra Karan, Yacoob Yaser, Shi Humphrey, Catanzaro Bryan, Tao Andrew, Kautz Jan, Yu Zhiding, Liu Guilin
- Math-llava: Bootstrapping Mathematical Reasoning For Multimodal Large Language Models Shi Wenhao, Hu Zhiqiang, Bin Yi, Liu Junhua, Yang Yang, Ng See-kiong, Bing Lidong, Lee Roy Ka-wei
- Assessment Of Multimodal Large Language Models In Alignment With Human Values Shi Zhelun, Wang Zhipin, Fan Hongxing, Zhang Zaibin, Li Lijun, Zhang Yongting, Yin Zhenfei, Sheng Lu, Qiao Yu, Shao Jing
- How Can Large Language Models Enable Better Socially Assistive Human-robot Interaction: A Brief Survey Shi Zhonghao, Landrum Ellen, Connell Amy O', Kian Mina, Pinto-alva Leticia, Shrestha Kaleen, Zhu Xiaoyuan, Matarić Maja J
- Implicit Multimodal Alignment: On The Generalization Of Frozen Llms To Multimodal Inputs Shukor Mustafa, Cord Matthieu
- Design2code: How Far Are We From Automating Front-end Engineering? Si Chenglei, Zhang Yanzhe, Yang Zhengyuan, Liu Ruibo, Yang Diyi
- Moma: Multimodal LLM Adapter For Fast Personalized Image Generation Song Kunpeng, Zhu Yizhe, Liu Bingchen, Yan Qing, Elgammal Ahmed, Yang Xiao
- M3GIA: A Cognition Inspired Multilingual And Multimodal General Intelligence Ability Benchmark Song Wei, Li Yadong, Xu Jianhua, Wu Guowei, Ming Lingfeng, Yi Kexin, Luo Weihua, Li Houyi, Du Yi, Guo Fangda, Yu Kaicheng
- Moviellm: Enhancing Long Video Understanding With Ai-generated Movies Song Zhende, Wang Chenchen, Sheng Jiamu, Zhang Chi, Yu Gang, Fan Jiayuan, Chen Tao
- Probing Multimodal Llms As World Models For Driving Sreeram Shiva, Wang Tsun-hsuan, Maalouf Alaa, Rosman Guy, Karaman Sertac, Rus Daniela
- VDGD: Mitigating LVLM Hallucinations In Cognitive Prompts By Bridging The Visual Perception Gap Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Oriol Nieto, Zeyu Jin, Dinesh Manocha
- Crossing New Frontiers: Knowledge-augmented Large Language Model Prompting For Zero-shot Text-based De Novo Molecule Design Srinivas Sakhinana Sagar, Runkana Venkataramana
- Few-shot VQA With Frozen Llms: A Tale Of Two Approaches Sterner Igor, Lin Weizhe, Chen Jinghong, Byrne Bill
- Investigating Video Reasoning Capability Of Large Language Models With Tropes In Movies Su Hung-ting, Chao Chun-tong, Hsu Ya-ching, Lin Xudong, Niu Yulei, Lee Hung-yi, Hsu Winston H.
- SK-VQA: Synthetic Knowledge Generation At Scale For Training Context-augmented Multimodal Llms Su Xin, Luo Man, Pan Kris W, Chou Tien Pei, Lal Vasudev, Howard Phillip
- Sq-llava: Self-questioning For Large Vision-language Assistant Sun Guohao, Qin Can, Wang Jiamian, Chen Zeyuan, Xu Ran, Tao Zhiqiang
- Dr-llava: Visual Instruction Tuning With Symbolic Clinical Grounding Sun Shenghuan, Goldgof Gregory M., Schubert Alexander, Sun Zhiqing, Hartvigsen Thomas, Butte Atul J., Alaa Ahmed
- Llavaolmobitnet1b: Ternary LLM Goes Multimodal! Sundaram Jainaveen, Iyer Ravi
- Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning Tan Cheng, Wei Jingxuan, Sun Linzhuang, Gao Zhangyang, Li Siyuan, Yu Bihui, Guo Ruifeng, Li Stan Z.
- EVALALIGN: Supervised Fine-tuning Multimodal Llms With Human-aligned Data For Evaluating Text-to-image Models Tan Zhiyu, Yang Xiaomeng, Qin Luozheng, Yang Mengping, Zhang Cheng, Li Hao
- Instructdoc: A Dataset For Zero-shot Generalization Of Visual Document Understanding With Instructions Tanaka Ryota, Iki Taichi, Nishida Kyosuke, Saito Kuniko, Suzuki Jun
- Targeted Visual Prompting For Medical Visual Question Answering Tascon-morales Sergio, Márquez-neila Pablo, Sznitman Raphael
- Cantor: Inspiring Multimodal Chain-of-thought Of MLLM Timin Gao, Peixian Chen, Mengdan Zhang, Chaoyou Fu, Yunhang Shen, Yan Zhang, Shengchuan Zhang, Xiawu Zheng, Xing Sun, Liujuan Cao, Rongrong Ji
- An Ecosage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant Tomar Mohit, Tiwari Abhisek, Saha Tulika, Jha Prince, Saha Sriparna
- Generative AI Systems: A Systems-based Perspective On Generative AI Tomczak Jakub M.
- Eyes Wide Shut? Exploring The Visual Shortcomings Of Multimodal Llms Tong Shengbang, Liu Zhuang, Zhai Yuexiang, Ma Yi, Lecun Yann, Xie Saining
- Lavy: Vietnamese Multimodal Large Language Model Tran Chi, Thanh Huong Le
- Discrete Multimodal Transformers With A Pretrained Large Language Model For Mixed-supervision Speech Processing Trinh Viet Anh, Southwell Rosy, Guan Yiwen, He Xinlu, Wang Zhiyong, Whitehill Jacob
- Enhance Modality Robustness In Text-centric Multimodal Alignment With Adversarial Prompting Tsai Yun-da, Yen Ting-yu, Liao Keng-te, Lin Shou-de
- Improved Baselines For Data-efficient Perceptual Augmentation Of Llms Vallaeys Théophane, Shukor Mustafa, Cord Matthieu, Verbeek Jakob
- Enhancing Model Performance: Another Approach To Vision-language Instruction Tuning Vedanshu, Tripathi Mm, Jaint Bhavnesh
- Cross-modal Projection In Multimodal Llms Doesn't Really Project Visual Attributes To Textual Space Verma Gaurav, Choi Minje, Sharma Kartik, Watson-daniels Jamelle, Oh Sejoon, Kumar Srijan
- LOOK-M: Look-once Optimization In KV Cache For Efficient Multimodal Long-context Inference Wan Zhongwei, Wu Ziang, Liu Che, Huang Jinfa, Zhu Zhihong, Jin Peng, Wang Longyue, Yuan Li
- Datavist5: A Pre-trained Language Model For Jointly Understanding Text And Data Visualization Wan Zhuoyue, Song Yuanfeng, Li Shuaimin, Zhang Chen Jason, Wong Raymond Chi-wing
- COSMO: Contrastive Streamlined Multimodal Model With Interleaved Pre-training Wang Alex Jinpeng, Li Linjie, Lin Kevin Qinghong, Wang Jianfeng, Lin Kevin, Yang Zhengyuan, Wang Lijuan, Shou Mike Zheng
- Pargo: Bridging Vision-language With Partial And Global Views Wang An-lan, Shan Bin, Shi Wei, Lin Kun-yu, Fei Xiang, Tang Guozhi, Liao Lei, Tang Jingqun, Huang Can, Zheng Wei-shi
- Image Re-identification: Where Self-supervision Meets Vision-language Learning Wang Bin, Liang Yuying, Cai Lei, Huang Huakun, Zeng Huanqiang
- IAA: Inner-adaptor Architecture Empowers Frozen Large Language Model With Multimodal Capabilities Wang Bin, Xie Chunyu, Leng Dawei, Yin Yuhui
- Information Theoretic Text-to-image Alignment Wang Chao, Franzese Giulio, Finamore Alessandro, Gallo Massimo, Michiardi Pietro
- Mllm-tool: A Multimodal Large Language Model For Tool Agent Learning Wang Chenyu Michael, Luo Weixin Michael, Chen Qianyu Michael, Mai Haonan Michael, Guo Jindi Michael, Dong Sixun Michael, Xiaohua Michael, Xuan, Li Zhengxin, Ma Lin, Gao Shenghua
- Instruction Tuning-free Visual Token Complement For Multimodal Llms Wang Dongsheng, Cui Jiequan, Li Miaoge, Lin Wang, Chen Bo, Zhang Hanwang
- M4U: Evaluating Multilingual Understanding And Reasoning For Large Multimodal Models Wang Hongyu, Xu Jiayu, Xie Senwei, Wang Ruiping, Li Jialin, Xie Zhaojie, Zhang Bin, Xiong Chuyan, Chen Xilin
- Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning For Vision Language Models Wang Jiayu, Ming Yifei, Shi Zhenmei, Vineet Vibhav, Wang Xin, Joshi Neel
- Exploring The Potential Of Multimodal LLM With Knowledge-intensive Multimodal ASR Wang Minghan, Wang Yuxia, Vu Thuy-trang, Shareghi Ehsan, Haffari Gholamreza
- Inferaligner: Inference-time Alignment For Harmlessness Through Cross-model Guidance Wang Pengyu, Zhang Dong, Li Linyang, Tan Chenkun, Wang Xinghao, Ren Ke, Jiang Botian, Qiu Xipeng
- Mfc-bench: Benchmarking Multimodal Fact-checking With Large Vision-language Models Wang Shengkang, Lin Hongzhan, Luo Ziyang, Ye Zhen, Chen Guang, Ma Jing
- From Llms To Mllms: Exploring The Landscape Of Multimodal Jailbreaking Wang Siyuan, Long Zhuohan, Fan Zhihao, Wei Zhongyu
- Searching For Best Practices In Retrieval-augmented Generation Wang Xiaohua, Wang Zhenghua, Gao Xuan, Zhang Feiran, Wu Yixin, Xu Zhibo, Shi Tianyuan, Wang Zhengyuan, Li Shizheng, Qian Qi, Yin Ruicheng, Lv Changze, Zheng Xiaoqing, Huang Xuanjing
- Adaptvision: Dynamic Input Scaling In Mllms For Versatile Scene Understanding Wang Yonghui, Zhou Wengang, Feng Hao, Li Houqiang
- HPT++: Hierarchically Prompting Vision-language Models With Multi-granularity Knowledge Generation And Improved Structure Modeling Wang Yubin, Jiang Xinyang, Cheng De, Sun Wenli, Li Dongsheng, Zhao Cairong
- Ladic: Are Diffusion Models Really Inferior To Autoregressive Counterparts For Image-to-text Generation? Wang Yuchi, Ren Shuhuai, Gao Rundong, Yao Linli, Guo Qingyan, An Kaikai, Bai Jianhong, Sun Xu
- Exovip: Step-by-step Verification And Exploration With Exoskeleton Modules For Compositional Visual Reasoning Wang Yuxuan, Yuille Alan, Li Zhuowan, Zheng Zilong
- Stop Reasoning! When Multimodal Llms With Chain-of-thought Reasoning Meets Adversarial Images Wang Zefeng, Han Zhen, Chen Shuo, Xue Fan, Ding Zifeng, Xiao Xun, Tresp Volker, Torr Philip, Gu Jindong
- Genartist: Multimodal LLM As An Agent For Unified Image Generation And Editing Wang Zhenyu, Li Aoxue, Li Zhenguo, Liu Xihui
- Omnijarvis: Unified Vision-language-action Tokenization Enables Open-world Instruction Following Agents Wang Zihao, Cai Shaofei, Mu Zhancun, Lin Haowei, Zhang Ceyao, Liu Xuejie, Li Qing, Liu Anji, Ma Xiaojian, Liang Yitao
- Eliminating Position Bias Of Language Models: A Mechanistic Approach Wang Ziqi, Zhang Hanlin, Li Xiner, Huang Kuan-hao, Han Chi, Ji Shuiwang, Kakade Sham M., Peng Hao, Ji Heng
- C3LLM: Conditional Multimodal Content Generation Using Large Language Models Wang Zixuan, Duan Qinkai, Tai Yu-wing, Tang Chi-keung
- Browse And Concentrate: Comprehending Multimodal Content Via Prior-llm Context Fusion Wang Ziyue, Chen Chi, Zhu Yiqi, Luo Fuwen, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Sun Maosong, Liu Yang
- AGLA: Mitigating Object Hallucinations In Large Vision-language Models With Assembly Of Global And Local Attention Wenbin An, Feng Tian, Sicong Leng, Jiahao Nie, Haonan Lin, Qianying Wang, Guang Dai, Ping Chen, Shijian Lu
- Towards Knowledge-grounded Natural Language Understanding And Generation Whitehouse Chenxi
- A Surprising Failure? Multimodal Llms And The NLVR Challenge Wu Anne, Brantley Kianté, Artzi Yoav
- Curriculum Learning With Quality-driven Data Selection Wu Biao, Meng Fang, Chen Ling
- Synthetic Multimodal Question Generation Wu Ian, Jayanthi Sravan, Viswanathan Vijay, Rosenberg Simon, Pakazad Sina, Wu Tongshuang, Neubig Graham
- Visionllm V2: An End-to-end Generalist Multimodal Large Language Model For Hundreds Of Vision-language Tasks Wu Jiannan, Zhong Muyan, Xing Sen, Lai Zeqiang, Liu Zhaoyang, Wang Wenhai, Chen Zhe, Zhu Xizhou, Lu Lewei, Lu Tong, Luo Ping, Qiao Yu, Dai Jifeng
- Commit: Coordinated Instruction Tuning For Multimodal Large Language Models Wu Junda, Li Xintong, Yu Tong, Wang Yu, Chen Xiang, Gu Jiuxiang, Yao Lina, Shang Jingbo, Mcauley Julian
- Noiseboost: Alleviating Hallucination With Noise Perturbation For Multimodal Large Language Models Wu Kai, Jiang Boyuan, Jiang Zhengkai, He Qingdong, Luo Donghao, Wang Shengzhi, Liu Qingwen, Wang Chengjie
- Controlmllm: Training-free Visual Prompt Learning For Multimodal Large Language Models Wu Mingrui, Cai Xinyue, Ji Jiayi, Li Jiale, Huang Oucheng, Luo Gen, Fei Hao, Sun Xiaoshuai, Ji Rongrong
- Motionllm: Multimodal Motion-language Learning With Large Language Models Wu Qi, Zhao Yubo, Wang Yifan, Tai Yu-wing, Tang Chi-keung
- GUI Action Narrator: Where And When Did That Action Take Place? Wu Qinchen, Gao Difei, Lin Kevin Qinghong, Wu Zhuoyu, Guo Xiangwu, Li Peiran, Zhang Weichen, Wang Hengxu, Shou Mike Zheng
- Avatar: Optimizing LLM Agents For Tool-assisted Knowledge Retrieval Wu Shirley, Zhao Shiyu, Huang Qian, Huang Kexin, Yasunaga Michihiro, Cao Kaidi, Ioannidis Vassilis N., Subbian Karthik, Leskovec Jure, Zou James
- Videollm-mod: Efficient Video-language Streaming With Mixture-of-depths Vision Computation Wu Shiwei, Chen Joya, Lin Kevin Qinghong, Wang Qimeng, Gao Yan, Xu Qianli, Xu Tong, Hu Yao, Chen Enhong, Shou Mike Zheng
- F-LMM: Grounding Frozen Large Multimodal Models Wu Size, Jin Sheng, Zhang Wenwei, Xu Lumin, Liu Wentao, Li Wei, Loy Chen Change
- Semantic Alignment For Multimodal Large Language Models Wu Tao, Li Mengze, Chen Jingyuan, Ji Wei, Lin Wang, Gao Jinyang, Kuang Kun, Zhao Zhou, Wu Fei
- Freeva: Offline MLLM As Training-free Video Assistant Wu Wenhao
- Mind's Eye Of Llms: Visualization-of-thought Elicits Spatial Reasoning In Large Language Models Wu Wenshan, Mao Shaoguang, Zhang Yadong, Xia Yan, Dong Li, Cui Lei, Wei Furu
- What To Do If Language Models Disagree? Black-box Model Ensembling For Textual And Visual Question Answering Xia Yuxi, Zaporojets Kilm, Roth Benjamin
- Unified Hallucination Detection For Multimodal Large Language Models Xiang Chen, Chenxi Wang, Yida Xue, Ningyu Zhang, Xiaoyan Yang, Qiang Li, Yue Shen, Lei Liang, Jinjie Gu, Huajun Chen
- A Comprehensive Survey Of Large Language Models And Multimodal Large Language Models In Medicine Xiao Hanguang, Zhou Feizhong, Liu Xingyue, Liu Tianqi, Li Zhipeng, Liu Xin, Huang Xiaoxuan
- Palm2-vadapter: Progressively Aligned Language Model Makes A Strong Vision-language Adapter Xiao Junfei, Xu Zheng, Yuille Alan, Yan Shen, Wang Boyu
- Seeing The Image: Prioritizing Visual Correlation By Contrastive Alignment Xiao Xin, Wu Bohong, Wang Jiacong, Li Chunyuan, Zhou Xun, Guo Haoyuan
- Logicvista: Multimodal LLM Logical Reasoning Benchmark In Visual Contexts Xiao Yijia, Sun Edward, Liu Tianyu, Wang Wei
- Alleviating Hallucination In Large Vision-language Models With Active Retrieval Augmentation Xiaoye Qu, Qiyuan Chen, Wei Wei, Jishuo Sun, Jianfeng Dong
- Advancing Multimodal Large Language Models With Quantization-aware Scale Learning For Efficient Adaptation Xie Jingjing, Zhang Yuxin, Lin Mingbao, Cao Liujuan, Ji Rongrong
- Show-o: One Single Transformer To Unify Multimodal Understanding And Generation Xie Jinheng, Mao Weijia, Bai Zechen, Zhang David Junhao, Wang Weihao, Lin Kevin Qinghong, Gu Yuchao, Chen Zhijie, Yang Zhenheng, Shou Mike Zheng
- Large Multimodal Agents: A Survey Xie Junlin, Chen Zhihong, Zhang Ruifei, Wan Xiang, Li Guanbin
- Wanglab At MEDIQA-M3G 2024: Multimodal Medical Answer Generation Using Large Language Models Xie Ronald, Palayew Steven, Toma Augustin, Bader Gary, Wang Bo
- EFUF: Efficient Fine-grained Unlearning Framework For Mitigating Hallucinations In Multimodal Large Language Models Xing Shangyu, Zhao Fei, Wu Zhen, An Tuo, Chen Weihao, Li Chunhui, Zhang Jianbing, Dai Xinyu
- Mitigating Hallucinations In Large Vision-language Models With Instruction Contrastive Decoding Xintong Wang, Jingheng Pan, Liang Ding, Chris Biemann
- A General-purpose Device For Interaction With Llms Xu Jiajun, Wang Qun, Cao Yuhang, Zeng Baitao, Liu Sicheng
- Reverse Image Retrieval Cues Parametric Memory In Multimodal Llms Xu Jialiang, Moor Michael, Leskovec Jure
- Tur[k]ingbench: A Challenge Benchmark For Web Agents Xu Kevin, Kordi Yeganeh, Nayak Tanay, Asija Ado, Wang Yizhong, Sanders Kate, Byerly Adam, Zhang Jingyu, Van Durme Benjamin, Khashabi Daniel
- From Introspection To Best Practices: Principled Analysis Of Demonstrations In Multimodal In-context Learning Xu Nan, Wang Fei, Zhang Sheng, Poon Hoifung, Chen Muhao
- Libra: Building Decoupled Vision System On Large Language Models Xu Yifan, Yang Xiaoshan, Song Yaguang, Xu Changsheng
- Vision-flan: Scaling Human-labeled Tasks In Visual Instruction Tuning Xu Zhiyang, Feng Chao, Shao Rulin, Ashby Trevor, Shen Ying, Jin Di, Cheng Yu, Wang Qifan, Huang Lifu
- LEMMA: Towards Lvlm-enhanced Multimodal Misinformation Detection With External Knowledge Augmentation Xuan Keyang, Yi Li, Yang Fan, Wu Ruochen, Fung Yi R., Ji Heng
- List Items One By One: A New Data Source And Learning Paradigm For Multimodal Llms Yan An, Yang Zhengyuan, Wu Junda, Zhu Wanrong, Yang Jianwei, Li Linjie, Lin Kevin, Wang Jianfeng, Mcauley Julian, Gao Jianfeng, Wang Lijuan
- Echosight: Advancing Visual-language Models With Wiki Knowledge Yan Yibin, Xie Weidi
- 3D Vision And Language Pretraining With Large-scale Synthetic Data Yang Dejie, Xu Zhu, Mo Wentao, Chen Qingchao, Huang Siyuan, Liu Yang
- Mastering Text-to-image Diffusion: Recaptioning, Planning, And Generating With Multimodal Llms Yang Ling, Yu Zhaochen, Meng Chenlin, Xu Minkai, Ermon Stefano, Cui Bin
- Mllmreid: Multimodal Large Language Model-based Person Re-identification Yang Shan, Zhang Yongfei
- Seed-story: Multimodal Long Story Generation With Large Language Model Yang Shuai, Ge Yuying, Li Yang, Chen Yukang, Ge Yixiao, Shan Ying, Chen Yingcong
- DGL: Dynamic Global-local Prompt Tuning For Text-video Retrieval Yang Xiangpeng, Zhu Linchao, Wang Xiaohan, Yang Yi
- Recent Advances Of Foundation Language Models-based Continual Learning: A Survey Yang Yutao, Zhou Jie, Ding Xuanwen, Huai Tianyu, Liu Shunyu, Chen Qin, He Liang, Xie Yuan
- Dense Connector For Mllms Yao Huanjin, Wu Wenhao, Yang Taojiannan, Song Yuxin, Zhang Mengxi, Feng Haocheng, Sun Yifan, Li Zhiheng, Ouyang Wanli, Wang Jingdong
- Deco: Decoupling Token Compression From Semantic Abstraction In Multimodal Large Language Models Yao Linli, Li Lei, Ren Shuhuai, Wang Lean, Liu Yuanxin, Sun Xu, Hou Lu
- Improving Visual Commonsense In Language Models Via Multiple Image Generation Yariv Guy, Schwartz Idan, Adi Yossi, Benaim Sagie
- X-VILA: Cross-modality Alignment For Large Language Model Ye Hanrong, Huang De-an, Lu Yao, Yu Zhiding, Ping Wei, Tao Andrew, Kautz Jan, Han Song, Xu Dan, Molchanov Pavlo, Yin Hongxu
- Mplug-owl3: Towards Long Image-sequence Understanding In Multi-modal Large Language Models Ye Jiabo, Xu Haiyang, Liu Haowei, Hu Anwen, Yan Ming, Qian Qi, Zhang Ji, Huang Fei, Zhou Jingren
- Efficient Vision-and-language Pre-training With Text-relevant Image Patch Selection Ye Wei, Jiang Chaoya, Xu Haiyang, Ye Chenhao, Li Chenliang, Yan Ming, Zhang Shikun, Huang Songhang, Huang Fei
- Mm-spubench: Towards Better Understanding Of Spurious Biases In Multimodal Llms Ye Wenqian, Zheng Guangtao, Ma Yunsheng, Cao Xu, Lai Bolin, Rehg James M., Zhang Aidong
- Harnessing Multimodal Large Language Models For Multimodal Sequential Recommendation Ye Yuyang, Zheng Zhi, Shen Yishan, Wang Tianshu, Zhang Hengruo, Zhu Peijun, Yu Runlong, Zhang Kai, Xiong Hui
- BEAF: Observing Before-after Changes To Evaluate Hallucination In Vision-language Models Ye-bin Moon, Hyeon-woo Nam, Choi Wonseok, Oh Tae-hyun
- Debiasing Multimodal Large Language Models Yi-fan Zhang, Weichen Yu, Qingsong Wen, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan
- SEA: Supervised Embedding Alignment For Token-level Visual-textual Integration In Mllms Yin Yuanyang, Zhao Yaqi, Zhang Yajie, Lin Ke, Wang Jiahao, Tao Xin, Wan Pengfei, Zhang Di, Yin Baoqun, Zhang Wentao
- Vqattack: Transferable Adversarial Attacks On Visual Question Answering Via Pre-trained Models Yin Ziyi, Ye Muchao, Zhang Tianrong, Wang Jiaqi, Liu Han, Chen Jinghui, Wang Ting, Ma Fenglong
- Unveiling The Safety Of Gpt-4o: An Empirical Study Using Jailbreak Attacks Ying Zonghao, Liu Aishan, Liu Xianglong, Tao Dacheng
- Safety Fine-tuning At (almost) No Cost: A Baseline For Vision Large Language Models Yongshuo Zong, Ondrej Bohdal, Tingyang Yu, Yongxin Yang, Timothy Hospedales
- C-TPT: Calibrated Test-time Prompt Tuning For Vision-language Models Via Text Feature Dispersion Yoon Hee Suk, Yoon Eunseop, Tee Joshua Tian Jin, Hasegawa-johnson Mark, Li Yingzhen, Yoo Chang D.
- By My Eyes: Grounding Multimodal Large Language Models With Sensor Data Via Visual Prompting Yoon Hyungjun, Tolera Biniyam Aschalew, Gong Taesik, Lee Kimin, Lee Sung-ju
- Matryoshka-adaptor: Unsupervised And Supervised Tuning For Smaller Embedding Dimensions Yoon Jinsung, Sinha Raj, Arik Sercan O, Pfister Tomas
- Ferret-ui: Grounded Mobile UI Understanding With Multimodal Llms You Keen, Zhang Haotian, Schoop Eldon, Weers Floris, Swearngin Amanda, Nichols Jeffrey, Yang Yinfei, Gan Zhe
- Web2code: A Large-scale Webpage-to-code Dataset And Evaluation Framework For Multimodal Llms Yun Sukmin, Lin Haokun, Thushara Rusiru, Bhat Mohammad Qazim, Wang Yongxin, Jiang Zutao, Deng Mingkai, Wang Jinhong, Tao Tianhua, Li Junbo, Li Haonan, Nakov Preslav, Baldwin Timothy, Liu Zhengzhong, Xing Eric P., Liang Xiaodan, Shen Zhiqiang
- Rec-gpt4v: Multimodal Recommendation With Large Vision-language Models Yuqing Liu, Yu Wang, Lichao Sun, Philip S. Yu
- Let Storytelling Tell Vivid Stories: An Expressive And Fluent Multimodal Storyteller Zang Chuanqi, Tang Jiji, Zhang Rongsheng, Zhao Zeng, Lv Tangjie, Pei Mingtao, Liang Wei
- Understanding And Mitigating Compositional Issues In Text-to-image Generative Models Zarei Arman, Rezaei Keivan, Basu Samyadeep, Saberi Mehrdad, Moayeri Mazda, Kattakinda Priyatham, Feizi Soheil
- Advancing Multimodal Large Language Models In Chart Question Answering With Visualization-referenced Instruction Tuning Zeng Xingchen, Lin Haichuan, Ye Yilin, Zeng Wei
- Can Mllms Perform Text-to-image In-context Learning? Zeng Yuchen, Kang Wonjun, Chen Yicong, Koo Hyung Il, Lee Kangwook
- Anygpt: Unified Multimodal LLM With Discrete Sequence Modeling Zhan Jun, Dai Junqi, Ye Jiasheng, Zhou Yunhua, Zhang Dong, Liu Zhigeng, Zhang Xin, Yuan Ruibin, Zhang Ge, Li Linyang, Yan Hang, Fu Jie, Gui Tao, Sun Tianxiang, Jiang Yugang, Qiu Xipeng
- Skyeyegpt: Unifying Remote Sensing Vision-language Tasks Via Instruction Tuning With Large Language Model Zhan Yang, Xiong Zhitong, Yuan Yuan
- Distilling Implicit Multimodal Knowledge Into Llms For Zero-resource Dialogue Generation Zhang Bo, Ma Hui, Ding Jian, Wang Jian, Xu Bo, Lin Hongfei
- Notellm-2: Multimodal Large Representation Models For Recommendation Zhang Chao, Zhang Haoxin, Wu Shiwei, Wu Di, Xu Tong, Gao Yan, Hu Yao, Chen Enhong
- Mm-llms: Recent Advances In Multimodal Large Language Models Zhang Duzhen, Yu Yahan, Dong Jiahua, Li Chenxing, Su Dan, Chu Chenhui, Yu Dong
- CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark Zhang Ge, Du Xinrun, Chen Bei, Liang Yiming, Luo Tongxu, Zheng Tianyu, Zhu Kang, Cheng Yuyang, Xu Chunpu, Guo Shuyue, Zhang Haoran, Qu Xingwei, Wang Junjie, Yuan Ruibin, Li Yizhi, Wang Zekun, Liu Yudong, Tsai Yu-hsuan, Zhang Fengji, Lin Chenghua, Huang Wenhao, Fu Jie
- Read And Think: An Efficient Step-wise Multimodal Language Model For Document Understanding And Reasoning Zhang Jinxu
- Automated Multi-level Preference For Mllms Zhang Mengxi, Wu Wenhao, Lu Yu, Song Yuxin, Rong Kang, Yao Huanjin, Zhao Jianbo, Liu Fanglong, Sun Yifan, Feng Haocheng, Wang Jingdong
- Internlm-xcomposer-2.5: A Versatile Large Vision Language Model Supporting Long-contextual Input And Output Zhang Pan, Dong Xiaoyi, Zang Yuhang, Cao Yuhang, Qian Rui, Chen Lin, Guo Qipeng, Duan Haodong, Wang Bin, Ouyang Linke, Zhang Songyang, Zhang Wenwei, Li Yining, Gao Yang, Sun Peng, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Yan Hang, He Conghui, Zhang Xingcheng, Chen Kai, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- VAR-CLIP: Text-to-image Generator With Visual Auto-regressive Modeling Zhang Qian, Dai Xiangzi, Yang Ninghua, An Xiang, Feng Ziyong, Ren Xingyu
- Llava-read: Enhancing Reading Ability Of Multimodal Language Models Zhang Ruiyi, Zhou Yufan, Chen Jian, Gu Jiuxiang, Chen Changyou, Sun Tong
- Direct Preference Optimization Of Video Large Multimodal Models From Language Model Reward Zhang Ruohong, Gui Liangke, Sun Zhiqing, Feng Yihao, Xu Keyang, Zhang Yuanhan, Fu Di, Li Chunyuan, Hauptmann Alexander, Bisk Yonatan, Yang Yiming
- From Redundancy To Relevance: Enhancing Explainability In Multimodal Large Language Models Zhang Xiaofeng, Shen Chen, Yuan Xiaosong, Yan Shaotian, Xie Liang, Wang Wenxiao, Gu Chaochen, Tang Hao, Ye Jieping
- Mme-realworld: Could Your Multimodal LLM Challenge High-resolution Real-world Scenarios That Are Difficult For Humans? Zhang Yi-fan, Zhang Huanyu, Tian Haochen, Fu Chaoyou, Zhang Shuangqing, Wu Junfei, Li Feng, Wang Kun, Wen Qingsong, Zhang Zhang, Wang Liang, Jin Rong, Tan Tieniu
- Wings: Learning Multimodal Llms Without Text-only Forgetting Zhang Yi-kai, Lu Shiyin, Li Yang, Ma Yanqing, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Zhan De-chuan, Ye Han-jia
- Benchmarking Trustworthiness Of Multimodal Large Language Models: A Comprehensive Study Zhang Yichi, Huang Yao, Sun Yitong, Liu Chang, Zhao Zhe, Fang Zhengwei, Wang Yifan, Chen Huanran, Yang Xiao, Wei Xingxing, Su Hang, Dong Yinpeng, Zhu Jun
- Rankclip: Ranking-consistent Language-image Pretraining Zhang Yiming, Zhao Zhuokai, Chen Zhaorun, Feng Zhili, Ding Zenghui, Sun Yining
- How Far Are We From Intelligent Visual Deductive Reasoning? Zhang Yizhe, Bai He, Zhang Ruixiang, Gu Jiatao, Zhai Shuangfei, Susskind Josh, Jaitly Navdeep
- Text-augmented Multimodal Llms For Chemical Reaction Condition Recommendation Zhang Yu, Yu Ruijie, Zeng Kaipeng, Li Ding, Zhu Feng, Yang Xiaokang, Jin Yaohui, Xu Yanyan
- Worldqa: Multimodal World Knowledge In Videos Through Long-chain Reasoning Zhang Yuanhan, Zhang Kaichen, Li Bo, Pu Fanyi, Setiadharma Christopher Arif, Yang Jingkang, Liu Ziwei
- Prompt-aware Adapter: Towards Learning Adaptive Visual Tokens For Multimodal Large Language Models Zhang Yue, Fan Hehe, Yang Yi
- OT-VP: Optimal Transport-guided Visual Prompting For Test-time Adaptation Zhang Yunbei, Mehra Akshay, Hamm Jihun
- Preserving Knowledge In Large Language Model With Model-agnostic Self-decompression Zhang Zilun, Sun Yutao, Zhao Tiancheng, Sha Leigang, Xu Ruochen, Lee Kyusong, Yin Jianwei
- Galla: Graph Aligned Large Language Models For Improved Source Code Understanding Zhang Ziyin, Yu Hang, Li Shijie, Di Peng, Li Jianguo, Wang Rui
- Mitigating Object Hallucination In Large Vision-language Models Via Classifier-free Guidance Zhao Linxi, Deng Yihe, Zhang Weitong, Gu Quanquan
- Mg-llava: Towards Multi-granularity Visual Instruction Tuning Zhao Xiangyu, Li Xiangtai, Duan Haodong, Huang Haian, Li Yining, Chen Kai, Yang Hua
- HALC: Object Hallucination Reduction Via Adaptive Focal-contrast Decoding Zhaorun Chen, Zhuokai Zhao, Hongyin Luo, Huaxiu Yao, Bo Li, Jiawei Zhou
- Gpt-4v(ision) Is A Generalist Web Agent, If Grounded Zheng Boyuan, Gou Boyu, Kil Jihyung, Sun Huan, Su Yu
- Dreamlip: Language-image Pre-training With Long Captions Zheng Kecheng, Zhang Yifei, Wu Wei, Lu Fan, Ma Shuailei, Jin Xin, Chen Wei, Shen Yujun
- More-3s:multimodal-based Offline Reinforcement Learning With Shared Semantic Spaces Zheng Tianyu, Zhang Ge, Qu Xingwei, Kuang Ming, Huang Stephen W., He Zhaofeng
- Moextend: Tuning New Experts For Modality And Task Extension Zhong Shanshan, Gao Shanghua, Huang Zhongzhan, Wen Wushao, Zitnik Marinka, Zhou Pan
- Enhancing Multimodal Large Language Models With Multi-instance Visual Prompt Generator For Visual Representation Enrichment Zhong Wenliang, Wu Wenyi, Li Qi, Barton Rob, Du Boxin, Sam Shioulin, Bouyarmane Karim, Tutar Ismail, Huang Junzhou
- Navgpt-2: Unleashing Navigational Reasoning Capability For Large Vision-language Models Zhou Gengze, Hong Yicong, Wang Zun, Wang Xin Eric, Wu Qi
- Image-of-thought Prompting For Visual Reasoning Refinement In Multimodal Large Language Models Zhou Qiji, Zhou Ruochen, Hu Zike, Lu Panzhong, Gao Siyang, Zhang Yue
- Few-shot Adversarial Prompt Learning On Vision-language Models Zhou Yiwei, Xia Xiaobo, Lin Zhiwei, Han Bo, Liu Tongliang
- Calibrated Self-rewarding Vision Language Models Zhou Yiyang, Fan Zhiyuan, Cheng Dongjie, Yang Sihan, Chen Zhaorun, Cui Chenhang, Wang Xiyao, Li Yun, Zhang Linjun, Yao Huaxiu
- LIME-M: Less Is More For Evaluation Of Mllms Zhu Kang, Zang Qianbo, Jia Shian, Wu Siwei, Fang Feiteng, Li Yizhi, Guo Shuyue, Zheng Tianyu, Li Bo, Wu Haoning, Qu Xingwei, Yang Jian, Liu Zachary, Yue Xiang, Liu J. H., Lin Chenghua, Yang Min, Ni Shiwen, Huang Wenhao, Zhang Ge
- Efficient Test-time Prompt Tuning For Vision-language Models Zhu Yuhan, Zhang Guozhen, Xu Chen, Shen Haocheng, Chen Xiaoxin, Wu Gangshan, Wang Limin
- Murar: A Simple And Effective Multimodal Retrieval And Answer Refinement Framework For Multimodal Question Answering Zhu Zhengyuan, Lee Daniel, Zhang Hong, Harsha Sai Sree, Feujio Loic, Maharaj Akash, Li Yunyao
- Protllm: An Interleaved Protein-language LLM With Protein-as-word Pre-training Zhuo Le, Chi Zewen, Xu Minghao, Huang Heyan, Zheng Heqi, He Conghui, Mao Xian-ling, Zhang Wentao
- Less Is More: Mitigating Multimodal Hallucination From An EOS Decision Perspective Zihao Yue, Liang Zhang, Qin Jin
- Mmmmodal -- Multi-images Multi-audio Multi-turn Multi-modal Zolkepli Husein, Razak Aisyah, Adha Kamarul, Nazhan Ariff
- VL-ICL Bench: The Devil In The Details Of Benchmarking Multimodal In-context Learning Zong Yongshuo, Bohdal Ondrej, Hospedales Timothy
- Llama-excitor: General Instruction Tuning Via Indirect Feature Interaction Zou Bo, Yang Chao, Qiao Yu, Quan Chengbin, Zhao Youjian
- Implicitave: An Open-source Dataset And Multimodal Llms Benchmark For Implicit Attribute Value Extraction Zou Henry Peng, Samuel Vinay, Zhou Yue, Zhang Weizhi, Fang Liancheng, Song Zihe, Yu Philip S., Caragea Cornelia
- What Do Mllms Hear? Examining Reasoning With Text And Sound Components In Multimodal Large Language Models Çoban Enis Berk, Mandel Michael I., Devaney Johanna
🏷 Pretraining Methods
- Weighted Transformer Network For Machine Translation Ahmed Karim, Keskar Nitish Shirish, Socher Richard
- Attention Is All You Need Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin
- Sockeye: A Toolkit For Neural Machine Translation Hieber Felix, Domhan Tobias, Denkowski Michael, Vilar David, Sokolov Artem, Clifton Ann, Post Matt
- Simple Recurrent Units For Highly Parallelizable Recurrence Lei Tao, Zhang Yu, Wang Sida I., Dai Hui, Artzi Yoav
- Shortcut-stacked Sentence Encoders For Multi-domain Inference Nie Yixin, Bansal Mohit
- Training Deeper Neural Machine Translation Models With Transparent Attention Bapna Ankur, Chen Mia Xu, Firat Orhan, Cao Yuan, Wu Yonghui
- The Best Of Both Worlds: Combining Recent Advances In Neural Machine Translation Chen Mia Xu, Firat Orhan, Bapna Ankur, Johnson Melvin, Macherey Wolfgang, Foster George, Jones Llion, Parmar Niki, Schuster Mike, Chen Zhifeng, Wu Yonghui, Hughes Macduff
- SALSA-TEXT : Self Attentive Latent Space Based Adversarial Text Generation Gagnon-marchand Jules, Sadeghi Hamed, Haidar Md. Akmal, Rezagholizadeh Mehdi
- The Memad Submission To The WMT18 Multimodal Translation Task Grönroos Stig-arne, Huet Benoit, Kurimo Mikko, Laaksonen Jorma, Merialdo Bernard, Pham Phu, Sjöberg Mats, Sulubacak Umut, Tiedemann Jörg, Troncy Raphael, Vázquez Raúl
- BERT: Pre-training Of Deep Bidirectional Transformers For Language Understanding Jacob Devlin, Ming-wei Chang, Kenton Lee, Kristina Toutanova
- Ms-uedin Submission To The WMT2018 APE Shared Task: Dual-source Transformer For Automatic Post-editing Junczys-dowmunt Marcin, Grundkiewicz Roman
- Fast And Simple Mixture Of Softmaxes With BPE And Hybrid-lightrnn For Language Generation Kong Xiang, Xie Qizhe, Dai Zihang, Hovy Eduard
- Phrase-based Attentions Nguyen Phi Xuan, Joty Shafiq
- Polite Dialogue Generation Without Parallel Data Niu Tong, Bansal Mohit
- Like A Baby: Visually Situated Neural Language Acquisition Ororbia Alexander G., Mali Ankur, Kelly Matthew A., Reitter David
- Training Tips For The Transformer Model Popel Martin, Bojar Ondřej
- The University Of Cambridge's Machine Translation Systems For WMT18 Stahlberg Felix, De Gispert Adria, Byrne Bill
- An Operation Sequence Model For Explainable Neural Machine Translation Stahlberg Felix, Saunders Danielle, Byrne Bill
- Can You Tell Me How To Get Past Sesame Street? Sentence-level Pretraining Beyond Language Modeling Wang Alex, Hula Jan, Xia Patrick, Pappagari Raghavendra, Mccoy R. Thomas, Patel Roma, Kim Najoung, Tenney Ian, Huang Yinghui, Yu Katherin, Jin Shuning, Chen Berlin, Van Durme Benjamin, Grave Edouard, Pavlick Ellie, Bowman Samuel R.
- Robust Text-to-sql Generation With Execution-guided Decoding Wang Chenglong, Tatwawadi Kedar, Brockschmidt Marc, Huang Po-sen, Mao Yi, Polozov Oleksandr, Singh Rishabh
- A Task In A Suit And A Tie: Paraphrase Generation With Semantic Augmentation Wang Su, Gupta Rahul, Chang Nancy, Baldridge Jason
- Improved Training Of End-to-end Attention Models For Speech Recognition Zeyer Albert, Irie Kazuki, Schlüter Ralf, Ney Hermann
- Abductive Reasoning As Self-supervision For Common Sense Question Answering Aakur Sathyanarayanan N., Sarkar Sudeep
- Syntactically Supervised Transformers For Faster Neural Machine Translation Akoury Nader, Krishna Kalpesh, Iyyer Mohit
- Synthetic QA Corpora Generation With Roundtrip Consistency Alberti Chris, Andor Daniel, Pitler Emily, Devlin Jacob, Collins Michael
- Fusion Of Detected Objects In Text For Visual Question Answering Alberti Chris, Ling Jeffrey, Collins Michael, Reitter David
- The Unreasonable Effectiveness Of Transformer Language Models In Grammatical Error Correction Alikaniotis Dimitrios, Raheja Vipul
- BERTQA -- Attention On Steroids Chadha Ankit, Sood Rewa
- An Empirical Study Of Generation Order For Machine Translation Chan William, Stern Mitchell, Kiros Jamie, Uszkoreit Jakob
- Distilling Knowledge Learned In BERT For Text Generation Chen Yen-chun, Gan Zhe, Cheng Yu, Liu Jingzhou, Liu Jingjing
- Attending To Entities For Better Text Understanding Cheng Pengxiang, Erk Katrin
- Learning Multi-level Information For Dialogue Response Selection By Highway Recurrent Transformer Chiang Ting-rui, Huang Chao-wei, Su Shang-yu, Chen Yun-nung
- Contrastive Multi-document Question Generation Cho Woon Sang, Zhang Yizhe, Rao Sudha, Celikyilmaz Asli, Xiong Chenyan, Gao Jianfeng, Wang Mengdi, Dolan Bill
- Exploring The Limits Of Transfer Learning With A Unified Text-to-text Transformer Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J. Liu
- Deep Learning Based Chatbot Models Csaky Richard
- Transformer-xl: Attentive Language Models Beyond A Fixed-length Context Dai Zihang, Yang Zhilin, Yang Yiming, Carbonell Jaime, Le Quoc V., Salakhutdinov Ruslan
- The Second Conversational Intelligence Challenge (convai2) Dinan Emily, Logacheva Varvara, Malykh Valentin, Miller Alexander, Shuster Kurt, Urbanek Jack, Kiela Douwe, Szlam Arthur, Serban Iulian, Lowe Ryan, Prabhumoye Shrimai, Black Alan W, Rudnicky Alexander, Williams Jason, Pineau Joelle, Burtsev Mikhail, Weston Jason
- Unified Language Model Pre-training For Natural Language Understanding And Generation Dong Li, Yang Nan, Wang Wenhui, Wei Furu, Liu Xiaodong, Wang Yu, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- Neural Data-to-text Generation: A Comparison Between Pipeline And End-to-end Architectures Ferreira Thiago Castro, Van Der Lee Chris, Van Miltenburg Emiel, Krahmer Emiel
- TANDA: Transfer And Adapt Pre-trained Transformer Models For Answer Sentence Selection Garg Siddhant, Vu Thuy, Moschitti Alessandro
- Self-attentive Model For Headline Generation Gavrilov Daniil, Kalaidin Pavel, Malykh Valentin
- Span Selection Pre-training For Question Answering Glass Michael, Gliozzo Alfio, Chakravarti Rishav, Ferritto Anthony, Pan Lin, Bhargav G P Shrivatsa, Garg Dinesh, Sil Avirup
- Insertion-based Decoding With Automatically Inferred Generation Order Gu Jiatao, Liu Qi, Cho Kyunghyun
- Reweighted Proximal Pruning For Large-scale Language Representation Guo Fu-ming, Liu Sijia, Mungall Finlay S., Lin Xue, Wang Yanzhi
- Fine-tuning By Curriculum Learning For Non-autoregressive Neural Machine Translation Guo Junliang, Tan Xu, Xu Linli, Qin Tao, Chen Enhong, Liu Tie-yan
- Zero-shot Paraphrase Generation With Multilingual Language Models Guo Yinpeng, Liao Yi, Jiang Xin, Zhang Qing, Zhang Yibo, Liu Qun
- Exploring Neural Net Augmentation To BERT For Question Answering On SQUAD 2.0 Gupta Suhas
- Episodic Memory Reader: Learning What To Remember For Question Answering From Streaming Data Han Moonsu, Kang Minki, Jung Hyunwoo, Hwang Sung Ju
- Visualizing And Understanding The Effectiveness Of BERT Hao Yaru, Dong Li, Wei Furu, Xu Ke
- Analyzing The Forgetting Problem In The Pretrain-finetuning Of Dialogue Response Models He Tianxing, Liu Jun, Cho Kyunghyun, Ott Myle, Liu Bing, Glass James, Peng Fuchun
- Exposure Bias Versus Self-recovery: Are Distortions Really Incremental For Autoregressive Text Generation? He Tianxing, Zhang Jingzhao, Zhou Zhiming, Glass James
- Convert: Efficient And Accurate Conversational Representations From Transformers Henderson Matthew, Casanueva Iñigo, Mrkšić Nikola, Su Pei-hao, Wen Tsung-hsien, Vulić Ivan
- Efficient Adaptation Of Pretrained Transformers For Abstractive Summarization Hoang Andrew, Bosselut Antoine, Celikyilmaz Asli, Choi Yejin
- Exbert: A Visual Analysis Tool To Explore Learned Representations In Transformers Models Hoover Benjamin, Strobelt Hendrik, Gehrmann Sebastian
- Iterative Answer Prediction With Pointer-augmented Multimodal Transformers For Textvqa Hu Ronghang, Singh Amanpreet, Darrell Trevor, Rohrbach Marcus
- INSET: Sentence Infilling With Inter-sentential Transformer Huang Yichen, Zhang Yizhe, Elachqar Oussama, Cheng Yu
- Large-scale Representation Learning From Visually Grounded Untranscribed Speech Ilharco Gabriel, Zhang Yuan, Baldridge Jason
- Document-level Neural Machine Translation With Associated Memory Network Jiang Shu, Wang Rui, Li Zuchao, Utiyama Masao, Chen Kehai, Sumita Eiichiro, Zhao Hai, Lu Bao-liang
- Tinybert: Distilling BERT For Natural Language Understanding Jiao Xiaoqi, Yin Yichun, Shang Lifeng, Jiang Xin, Chen Xiao, Li Linlin, Wang Fang, Liu Qun
- All-in-one Image-grounded Conversational Agents Ju Da, Shuster Kurt, Boureau Y-lan, Weston Jason
- Microsoft Translator At WMT 2019: Towards Large-scale Document-level Neural Machine Translation Junczys-dowmunt Marcin
- Learning And Evaluating Contextual Embedding Of Source Code Kanade Aditya, Maniatis Petros, Balakrishnan Gogul, Shi Kensen
- Weak Supervision Helps Emergence Of Word-object Alignment And Improves Vision-language Tasks Kervadec Corentin Liris, Antipov Grigory Liris, Baccouche Moez Liris, Wolf Christian Liris
- CTRL: A Conditional Transformer Language Model For Controllable Generation Keskar Nitish Shirish, Mccann Bryan, Varshney Lav R., Xiong Caiming, Socher Richard
- Sample Efficient Text Summarization Using A Single Pre-trained Transformer Khandelwal Urvashi, Clark Kevin, Jurafsky Dan, Kaiser Lukasz
- Learning To Answer By Learning To Ask: Getting The Best Of GPT-2 And BERT Worlds Klein Tassilo, Nabi Moin
- Investigating Multilingual NMT Representations At Scale Kudugunta Sneha Reddy, Bapna Ankur, Caswell Isaac, Arivazhagan Naveen, Firat Orhan
- Question Generation From Paragraphs: A Tale Of Two Hierarchical Models Kumar Vishwajeet, Chaki Raktim, Talluri Sai Teja, Ramakrishnan Ganesh, Li Yuan-fang, Haffari Gholamreza
- ALBERT: A Lite BERT For Self-supervised Learning Of Language Representations Lan Zhenzhong, Chen Mingda, Goodman Sebastian, Gimpel Kevin, Sharma Piyush, Soricut Radu
- What Would Elsa Do? Freezing Layers During Transformer Fine-tuning Lee Jaejun, Tang Raphael, Lin Jimmy
- Patent Claim Generation By Fine-tuning Openai GPT-2 Lee Jieh-sheng, Hsiang Jieh
- BART: Denoising Sequence-to-sequence Pre-training For Natural Language Generation, Translation, And Comprehension Lewis Mike, Liu Yinhan, Goyal Naman, Ghazvininejad Marjan, Mohamed Abdelrahman, Levy Omer, Stoyanov Ves, Zettlemoyer Luke
- Bertsel: Answer Selection With Pre-trained Models Li Dongfang, Yu Yifei, Chen Qingcai, Li Xinyu
- Unicoder-vl: A Universal Encoder For Vision And Language By Cross-modal Pre-training Li Gen, Duan Nan, Fang Yuejian, Gong Ming, Jiang Daxin, Zhou Ming
- Big Bidirectional Insertion Representations For Documents Li Lala, Chan William
- Visualbert: A Simple And Performant Baseline For Vision And Language Li Liunian Harold, Yatskar Mark, Yin Da, Hsieh Cho-jui, Chang Kai-wei
- Relevance-promoting Language Model For Short-text Conversation Li Xin, Li Piji, Bi Wei, Liu Xiaojiang, Lam Wai
- Incremental Transformer With Deliberation Decoder For Document Grounded Conversations Li Zekang, Niu Cheng, Meng Fandong, Feng Yang, Li Qian, Zhou Jie
- Explicit Sentence Compression For Neural Machine Translation Li Zuchao, Wang Rui, Chen Kehai, Utiyama Masao, Sumita Eiichiro, Zhang Zhuosheng, Zhao Hai
- Probing Representations Learned By Multimodal Recurrent And Transformer Models Libovický Jindřich, Madhyastha Pranava
- MKD: A Multi-task Knowledge Distillation Approach For Pretrained Language Models Liu Linqing, Wang Huan, Lin Jimmy, Socher Richard, Xiong Caiming
- Roberta: A Robustly Optimized BERT Pretraining Approach Liu Yinhan, Ott Myle, Goyal Naman, Du Jingfei, Joshi Mandar, Chen Danqi, Levy Omer, Lewis Mike, Zettlemoyer Luke, Stoyanov Veselin
- Vilbert: Pretraining Task-agnostic Visiolinguistic Representations For Vision-and-language Tasks Lu Jiasen, Batra Dhruv, Parikh Devi, Lee Stefan
- Selective Attention For Context-aware Neural Machine Translation Maruf Sameen, Martins André F. T., Haffari Gholamreza
- Emergent Properties Of Finetuned Language Representation Models Matton Alexandre, De Oliveira Luke
- Structured Pruning Of A Bert-based Question Answering Model Mccarley J. S., Chakravarti Rishav, Sil Avirup
- Mogrifier LSTM Melis Gábor, Kočiský Tomáš, Blunsom Phil
- Megatron-lm: Training Multi-billion Parameter Language Models Using Model Parallelism Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick Legresley, Jared Casper, Bryan Catanzaro
- Neural Assistant: Joint Action Prediction, Response Generation, And Latent Knowledge Reasoning Neelakantan Arvind, Yavuz Semih, Narang Sharan, Prasad Vishaal, Goodrich Ben, Duckworth Daniel, Sankar Chinnadhurai, Yan Xifeng
- Efficient Attention Mechanism For Visual Dialog That Can Handle All The Interactions Between Multiple Inputs Nguyen Van-quang, Suganuma Masanori, Okatani Takayuki
- Dlgnet: A Transformer-based Model For Dialogue Response Generation Olabiyi Oluwatobi, Mueller Erik T.
- Adversarial Bootstrapping For Dialogue Model Training Olabiyi Oluwatobi, Mueller Erik T., Larson Christopher, Lahlou Tarek
- Stabilizing Transformers For Reinforcement Learning Parisotto Emilio, Song H. Francis, Rae Jack W., Pascanu Razvan, Gulcehre Caglar, Jayakumar Siddhant M., Jaderberg Max, Kaufman Raphael Lopez, Clark Aidan, Noury Seb, Botvinick Matthew M., Heess Nicolas, Hadsell Raia
- Thisiscompetition At Semeval-2019 Task 9: BERT Is Unstable For Out-of-domain Samples Park Cheoneum, Kim Juae, Lee Hyeon-gu, Amplayo Reinald Kim, Kim Harksoo, Seo Jungyun, Lee Changki
- Language Models As Knowledge Bases? Petroni Fabio, Rocktäschel Tim, Lewis Patrick, Bakhtin Anton, Wu Yuxiang, Miller Alexander H., Riedel Sebastian
- Probing Natural Language Inference Models Through Semantic Fragments Richardson Kyle, Hu Hai, Moss Lawrence S., Sabharwal Ashish
- Leveraging Pre-trained Checkpoints For Sequence Generation Tasks Rothe Sascha, Narayan Shashi, Severyn Aliaksei
- Masked Language Model Scoring Salazar Julian, Liang Davis, Nguyen Toan Q., Kirchhoff Katrin
- Hierarchical Reinforcement Learning For Open-domain Dialog Saleh Abdelrhman, Jaques Natasha, Ghandeharioun Asma, Shen Judy Hanwen, Picard Rosalind
- Do Neural Dialog Systems Use The Conversation History Effectively? An Empirical Study Sankar Chinnadhurai, Subramanian Sandeep, Pal Christopher, Chandar Sarath, Bengio Yoshua
- Generalization In Generation: A Closer Look At Exposure Bias Schmidt Florian
- Inducing Brain-relevant Bias In Natural Language Processing Models Schwartz Dan, Toneva Mariya, Wehbe Leila
- The Evolved Transformer So David R., Liang Chen, Le Quoc V.
- MASS: Masked Sequence To Sequence Pre-training For Language Generation Song Kaitao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Tie-yan
- On NMT Search Errors And Model Errors: Cat Got Your Tongue? Stahlberg Felix, Byrne Bill
- Cued@wmt19:ewc&lms Stahlberg Felix, Saunders Danielle, De Gispert Adria, Byrne Bill
- Insertion Transformer: Flexible Sequence Generation Via Insertion Operations Stern Mitchell, Chan William, Kiros Jamie, Uszkoreit Jakob
- Sesamebert: Attention For Anywhere Su Ta-chun, Cheng Hsiang-chih
- VL-BERT: Pre-training Of Generic Visual-linguistic Representations Su Weijie, Zhu Xizhou, Cao Yue, Li Bin, Lu Lewei, Wei Furu, Dai Jifeng
- Adaptive Attention Span In Transformers Sukhbaatar Sainbayar, Grave Edouard, Bojanowski Piotr, Joulin Armand
- Augmenting Self-attention With Persistent Memory Sukhbaatar Sainbayar, Grave Edouard, Lample Guillaume, Jegou Herve, Joulin Armand
- Generating Diverse Translation By Manipulating Multi-head Attention Sun Zewei, Huang Shujian, Wei Hao-ran, Dai Xin-yu, Chen Jiajun
- Syntax-infused Transformer And BERT Models For Machine Translation And Natural Language Understanding Sundararaman Dhanasekar, Subramanian Vivek, Wang Guoyin, Si Shijing, Shen Dinghan, Wang Dong, Carin Lawrence
- Olmpics -- On What Language Model Pre-training Captures Talmor Alon, Elazar Yanai, Goldberg Yoav, Berant Jonathan
- LXMERT: Learning Cross-modality Encoder Representations From Transformers Tan Hao, Bansal Mohit
- Waldorf: Wasteless Language-model Distillation On Reading-comprehension Tian James Yi, Kreuzer Alexander P., Chen Pai-hung, Will Hans-martin
- Repurposing Entailment For Multi-hop Question Answering Tasks Trivedi Harsh, Kwon Heeyoung, Khot Tushar, Sabharwal Ashish, Balasubramanian Niranjan
- Well-read Students Learn Better: On The Importance Of Pre-training Compact Models Turc Iulia, Chang Ming-wei, Lee Kenton, Toutanova Kristina
- How Does BERT Answer Questions? A Layer-wise Analysis Of Transformer Representations Van Aken Betty, Winter Benjamin, Löser Alexander, Gers Felix A.
- Quantity Doesn't Buy Quality Syntax With Neural Language Models Van Schijndel Marten, Mueller Aaron, Linzen Tal
- Visualizing Attention In Transformer-based Language Representation Models Vig Jesse
- Training Optimus Prime, M.D.: Generating Medical Certification Items By Fine-tuning Openai's Gpt2 Transformer Model Von Davier Matthias
- Language Models With Transformers Wang Chenguang, Li Mu, Smola Alexander J.
- Tree Transformer: Integrating Tree Structures Into Self-attention Wang Yau-shian, Lee Hung-yi, Chen Yun-nung
- Structured Pruning Of Large Language Models Wang Ziheng, Wohlwend Jeremy, Lei Tao
- Adapting And Evaluating A Deep Learning Language Model For Clinical Why-question Answering Wen Andrew, Elwazir Mohamed Y., Moon Sungrim, Fan Jungwei
- Improving Neural Machine Translation With Pre-trained Representation Weng Rongxiang, Yu Heng, Huang Shujian, Luo Weihua, Chen Jiajun
- Huggingface's Transformers: State-of-the-art Natural Language Processing Wolf Thomas, Debut Lysandre, Sanh Victor, Chaumond Julien, Delangue Clement, Moi Anthony, Cistac Pierric, Rault Tim, Louf Rémi, Funtowicz Morgan, Davison Joe, Shleifer Sam, Von Platen Patrick, Ma Clara, Jernite Yacine, Plu Julien, Xu Canwen, Scao Teven Le, Gugger Sylvain, Drame Mariama, Lhoest Quentin, Rush Alexander M.
- Transfertransfo: A Transfer Learning Approach For Neural Network Based Conversational Agents Wolf Thomas, Sanh Victor, Chaumond Julien, Delangue Clement
- Transformer-based Cascaded Multimodal Speech Translation Wu Zixiu, Caglayan Ozan, Ive Julia, Wang Josiah, Specia Lucia
- Sharing Attention Weights For Fast Transformer Xiao Tong, Li Yinqiao, Zhu Jingbo, Yu Zhengtao, Liu Tongran
- Review Conversational Reading Comprehension Xu Hu, Liu Bing, Shu Lei, Yu Philip S.
- Towards Making The Most Of BERT In Neural Machine Translation Yang Jiacheng, Wang Mingxuan, Zhou Hao, Zhao Chengqi, Yu Yong, Zhang Weinan, Li Lei
- Deepening Hidden Representations From Pre-trained Language Models Yang Junjie, Zhao Hai
- Data Augmentation For BERT Fine-tuning In Open-domain Question Answering Yang Wei, Xie Yuqing, Tan Luchen, Xiong Kun, Li Ming, Lin Jimmy
- Model Compression With Two-stage Multi-teacher Knowledge Distillation For Web Question Answering System Yang Ze, Shou Linjun, Gong Ming, Lin Wutao, Jiang Daxin
- Bp-transformer: Modelling Long-range Context Via Binary Partitioning Ye Zihao, Guo Qipeng, Gan Quan, Qiu Xipeng, Zhang Zheng
- Q8BERT: Quantized 8bit BERT Zafrir Ofir, Boudoukh Guy, Izsak Peter, Wasserblat Moshe
- PEGASUS: Pre-training With Extracted Gap-sentences For Abstractive Summarization Zhang Jingqing, Zhao Yao, Saleh Mohammad, Liu Peter J.
- Sequence-to-sequence Pre-training With Data Augmentation For Sentence Rewriting Zhang Yi, Ge Tao, Wei Furu, Zhou Ming, Sun Xu
- Dialogpt: Large-scale Generative Pre-training For Conversational Response Generation Zhang Yizhe, Sun Siqi, Galley Michel, Chen Yen-chun, Brockett Chris, Gao Xiang, Gao Jianfeng, Liu Jingjing, Dolan Bill
- Semantics-aware BERT For Language Understanding Zhang Zhuosheng, Wu Yuwei, Zhao Hai, Li Zuchao, Zhang Shuailiang, Zhou Xi, Zhou Xiang
- MUSE: Parallel Multi-scale Attention For Sequence To Sequence Learning Zhao Guangxiang, Sun Xu, Xu Jingjing, Zhang Zhiyuan, Luo Liangchen
- Synchronous Bidirectional Neural Machine Translation Zhou Long, Zhang Jiajun, Zong Chengqing
- Unified Vision-language Pre-training For Image Captioning And VQA Zhou Luowei, Palangi Hamid, Zhang Lei, Hu Houdong, Corso Jason J., Gao Jianfeng
- Encoder-agnostic Adaptation For Conditional Language Generation Ziegler Zachary M., Melas-kyriazi Luke, Gehrmann Sebastian, Rush Alexander M.
- How Much Knowledge Can You Pack Into The Parameters Of A Language Model? Adam Roberts, Colin Raffel, Noam Shazeer
- Paranoid Transformer: Reading Narrative Of Madness As Computational Approach To Creativity Agafonova Yana, Tikhonov Alexey, Yamshchikov Ivan P.
- Accelerating Natural Language Understanding In Task-oriented Dialog Ahuja Ojas, Desai Shrey
- Optimizing Transformer For Low-resource Neural Machine Translation Araabi Ali, Monz Christof
- Stress Test Evaluation Of Transformer-based Models In Natural Language Understanding Tasks Aspillaga Carlos, Carvallo Andrés, Araujo Vladimir
- Segatron: Segment-aware Transformer For Language Modeling And Understanding Bai He, Shi Peng, Lin Jimmy, Xie Yuqing, Tan Luchen, Xiong Kun, Gao Wen, Li Ming
- Character-level Transformer-based Neural Machine Translation Banar Nikolay, Daelemans Walter, Kestemont Mike
- Unilmv2: Pseudo-masked Language Models For Unified Language Model Pre-training Bao Hangbo, Dong Li, Wei Furu, Wang Wenhui, Yang Nan, Liu Xiaodong, Wang Yu, Piao Songhao, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- Co-attentional Transformers For Story-based Video Understanding Bebensee Björn, Zhang Byoung-tak
- Longformer: The Long-document Transformer Beltagy Iz, Peters Matthew E., Cohan Arman
- Adaptive Transformers For Learning Multimodal Representations Bhargava Prajjwal
- PALM: Pre-training An Autoencoding&autoregressive Language Model For Context-conditioned Generation Bi Bin, Li Chenliang, Wu Chen, Yan Ming, Wang Wei, Huang Songfang, Huang Fei, Si Luo
- Chatbot Interaction With Artificial Intelligence: Human Data Augmentation With T5 And Language Transformer Ensemble For Text Classification Bird Jordan J., Ekárt Anikó, Faria Diego R.
- Language Models Are Few-shot Learners Brown Tom B., Mann Benjamin, Ryder Nick, Subbiah Melanie, Kaplan Jared, Dhariwal Prafulla, Neelakantan Arvind, Shyam Pranav, Sastry Girish, Askell Amanda, Agarwal Sandhini, Herbert-voss Ariel, Krueger Gretchen, Henighan Tom, Child Rewon, Ramesh Aditya, Ziegler Daniel M., Wu Jeffrey, Winter Clemens, Hesse Christopher, Chen Mark, Sigler Eric, Litwin Mateusz, Gray Scott, Chess Benjamin, Clark Jack, Berner Christopher, Mccandlish Sam, Radford Alec, Sutskever Ilya, Amodei Dario
- Advances Of Transformer-based Models For News Headline Generation Bukhtiyarov Alexey, Gusev Ilya
- DIET: Lightweight Language Understanding For Dialogue Systems Bunk Tanja, Varshneya Daksh, Vlasov Vladimir, Nichol Alan
- Memory Transformer Burtsev Mikhail S., Kuratov Yuri, Peganov Anton, Sapunov Grigory V.
- Behind The Scene: Revealing The Secrets Of Pre-trained Vision-and-language Models Cao Jize, Gan Zhe, Cheng Yu, Yu Licheng, Chen Yen-chun, Liu Jingjing
- Syntax-driven Iterative Expansion Language Models For Controllable Text Generation Casas Noe, Fonollosa José A. R., Costa-jussà Marta R.
- Simultaneous Paraphrasing And Translation By Fine-tuning Transformer Models Chada Rakesh
- Exploring Fluent Query Reformulations With Text-to-text Transformers And Reinforcement Learning Chen Jerry Zikun, Yu Shi, Wang Haoran
- Adaptive Bi-directional Attention: Exploring Multi-granularity Representations For Machine Reading Comprehension Chen Nuo, Liu Fenglin, You Chenyu, Zhou Peilin, Zou Yuexian
- Logical Natural Language Generation From Open-domain Tables Chen Wenhu, Chen Jianshu, Su Yu, Chen Zhiyu, Wang William Yang
- Reasoning In Dialog: Improving Response Generation By Context Reading Comprehension Chen Xiuying, Cui Zhi, Zhang Jiayi, Wei Chen, Cui Jianwei, Wang Bin, Zhao Dongyan, Yan Rui
- Model Selection For Cross-lingual Transfer Chen Yang, Ritter Alan
- Accurate Word Alignment Induction From Neural Machine Translation Chen Yun, Liu Yang, Chen Guanhua, Jiang Xin, Liu Qun
- Comparative Study Of Language Models On Cross-domain Data With Model Agnostic Explainability Chhipa Mayank, Vazurkar Hrushikesh Mahesh, Kumar Abhijeet, Mishra Mridul
- X-LXMERT: Paint, Caption And Answer Questions With Multi-modal Transformers Cho Jaemin, Lu Jiasen, Schwenk Dustin, Hajishirzi Hannaneh, Kembhavi Aniruddha
- Rethinking Embedding Coupling In Pre-trained Language Models Chung Hyung Won, Févry Thibault, Tsai Henry, Johnson Melvin, Ruder Sebastian
- Transformers As Soft Reasoners Over Language Clark Peter, Tafjord Oyvind, Richardson Kyle
- Pymt5: Multi-mode Translation Of Natural Language And Python Code With Transformers Clement Colin B., Drain Dawn, Timcheck Jonathan, Svyatkovskiy Alexey, Sundaresan Neel
- Cascaded Text Generation With Markov Transformers Deng Yuntian, Rush Alexander M.
- Calibration Of Pre-trained Transformers Desai Shrey, Durrett Greg
- Ernie-doc: A Retrospective Long-document Modeling Transformer Ding Siyu, Shang Junyuan, Wang Shuohuan, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Gshard: Scaling Giant Models With Conditional Computation And Automatic Sharding Dmitry Lepikhin, Hyoukjoong Lee, Yuanzhong Xu, Dehao Chen, Orhan Firat, Yanping Huang, Maxim Krikun, Noam Shazeer, Zhifeng Chen
- Fine-tuning Pretrained Language Models: Weight Initializations, Data Orders, And Early Stopping Dodge Jesse, Ilharco Gabriel, Schwartz Roy, Farhadi Ali, Hajishirzi Hannaneh, Smith Noah
- Sound Natural: Content Rephrasing In Dialog Systems Einolghozati Arash, Gupta Anchit, Diedrick Keith, Gupta Sonal
- Turngpt: A Transformer-based Language Model For Predicting Turn-taking In Spoken Dialog Ekstedt Erik, Skantze Gabriel
- A Comparison Of LSTM And BERT For Small Corpus Ezen-can Aysu
- The Unreasonable Volatility Of Neural Machine Translation Models Fadaee Marzieh, Monz Christof
- Augmenting Transformers With Knn-based Composite Memory For Dialogue Fan Angela, Gardent Claire, Braud Chloe, Bordes Antoine
- Addressing Some Limitations Of Transformers With Feedback Memory Fan Angela, Lavril Thibaut, Grave Edouard, Joulin Armand, Sukhbaatar Sainbayar
- CERT: Contrastive Self-supervised Learning For Language Understanding Fang Hongchao, Wang Sicheng, Zhou Meng, Ding Jiayuan, Xie Pengtao
- Paragraph-level Commonsense Transformers With Recurrent Memory Gabriel Saadia, Bhagavatula Chandra, Shwartz Vered, Bras Ronan Le, Forbes Maxwell, Choi Yejin
- Compressing Large-scale Transformer-based Models: A Case Study On BERT Ganesh Prakhar, Chen Yao, Lou Xin, Khan Mohammad Ali, Yang Yin, Sajjad Hassan, Nakov Preslav, Chen Deming, Winslett Marianne
- Robust Conversational AI With Grounded Text Generation Gao Jianfeng, Peng Baolin, Li Chunyuan, Li Jinchao, Shayandeh Shahin, Liden Lars, Shum Heung-yeung
- Relevance Transformer: Generating Concise Code Snippets With Relevance Feedback Gemmell Carlos, Rossetto Federico, Dalton Jeffrey
- ISAAQ -- Mastering Textbook Questions With Pre-trained Transformers And Bottom-up And Top-down Attention Gomez-perez Jose Manuel, Ortega Raul
- It's Not Greek To Mbert: Inducing Word-level Translations From Multilingual BERT Gonen Hila, Ravfogel Shauli, Elazar Yanai, Goldberg Yoav
- Recurrent Chunking Mechanisms For Long-text Machine Reading Comprehension Gong Hongyu, Shen Yelong, Yu Dian, Chen Jianshu, Yu Dong
- Measuring Systematic Generalization In Neural Proof Generation With Transformers Gontier Nicolas, Sinha Koustuv, Reddy Siva, Pal Christopher
- Are Neural Open-domain Dialog Systems Robust To Speech Recognition Errors In The Dialog History? An Empirical Study Gopalakrishnan Karthik, Hedayatnia Behnam, Wang Longshaokan, Liu Yang, Hakkani-tur Dilek
- Contextualize Knowledge Bases With Transformer For End-to-end Task-oriented Dialogue Systems Gou Yanjie, Lei Yinjie, Liu Lingqiao, Dai Yong, Shen Chunxu
- A Tailored Pre-training Model For Task-oriented Dialog Generation Gu Jing, Wu Qingyang, Wu Chongruo, Shi Weiyan, Yu Zhou
- Dialogbert: Discourse-aware Response Generation Via Learning To Recover And Rank Utterances Gu Xiaodong, Yoo Kang Min, Ha Jung-woo
- Sequence-level Mixed Sample Data Augmentation Guo Demi, Kim Yoon, Rush Alexander M.
- Incorporating BERT Into Parallel Sequence Decoding With Adapters Guo Junliang, Zhang Zhirui, Xu Linli, Wei Hao-ran, Chen Boxing, Chen Enhong
- GMAT: Global Memory Augmentation For Transformers Gupta Ankit, Berant Jonathan
- BERT Based Multilingual Machine Comprehension In English And Hindi Gupta Somil, Khade Nilesh
- Improvement Of A Dedicated Model For Open Domain Persona-aware Dialogue Generation Han Qiang
- ECONET: Effective Continual Pretraining Of Language Models For Event Temporal Reasoning Han Rujun, Ren Xiang, Peng Nanyun
- Towards Learning A Generic Agent For Vision-and-language Navigation Via Pre-training Hao Weituo, Li Chunyuan, Li Xiujun, Carin Lawrence, Gao Jianfeng
- Exploring Explainable Selection To Control Abstractive Summarization Haonan Wang, Yang Gao, Yu Bai, Lapata Mirella, Heyan Huang
- Enriched Pre-trained Transformers For Joint Slot Filling And Intent Detection Hardalov Momchil, Koychev Ivan, Nakov Preslav
- Guided Transformer: Leveraging Multiple External Sources For Representation Learning In Conversational Search Hashemi Helia, Zamani Hamed, Croft W. Bruce
- Investigating Representations Of Verb Bias In Neural Language Models Hawkins Robert D., Yamakoshi Takateru, Griffiths Thomas L., Goldberg Adele E.
- Pretrained Transformers Improve Out-of-distribution Robustness Hendrycks Dan, Liu Xiaoyuan, Wallace Eric, Dziedzic Adam, Krishnan Rishabh, Song Dawn
- PAIR: Planning And Iterative Refinement In Pre-trained Transformers For Long Text Generation Hua Xinyu, Wang Lu
- TRANS-BLSTM: Transformer With Bidirectional LSTM For Language Understanding Huang Zhiheng, Xu Peng, Liang Davis, Mishra Ajay, Xiang Bing
- Exploring BERT Parameter Efficiency On The Stanford Question Answering Dataset V2.0 Hulburd Eric
- Memory Attentive Fusion: External Language Model Integration For Transformer-based Sequence-to-sequence Model Ihori Mana, Masumura Ryo, Makishima Naoki, Tanaka Tomohiro, Takashima Akihiko, Orihashi Shota
- Probing Pretrained Language Models For Lexical Semantics Ivan Vulić, Edoardo Maria Ponti, Robert Litschko, Goran Glavaš, Anna Korhonen
- TLDR: Token Loss Dynamic Reweighting For Reducing Repetitive Utterance Generation Jiang Shaojie, Wolf Thomas, Monz Christof, De Rijke Maarten
- How Can We Know When Language Models Know? On The Calibration Of Language Models For Question Answering Jiang Zhengbao, Araki Jun, Ding Haibo, Neubig Graham
- Text-to-text Pre-training For Data-to-text Tasks Kale Mihir, Rastogi Abhinav
- Meta-context Transformers For Domain-specific Response Generation Kar Debanjana, Samanta Suranjana, Azad Amar Prakash
- Do Fine-tuned Commonsense Language Models Really Generalize? Kejriwal Mayank, Shen Ke
- SMRT Chatbots: Improving Non-task-oriented Dialog With Simulated Multiple Reference Training Khayrallah Huda, Sedoc João
- Code Prediction By Feeding Trees To Transformers Kim Seohyun, Zhao Jinman, Tian Yuchi, Chandra Satish
- Current Limitations Of Language Models: What You Need Is Retrieval Komatsuzaki Aran
- Syntactic Structure Distillation Pretraining For Bidirectional Encoders Kuncoro Adhiguna, Kong Lingpeng, Fried Daniel, Yogatama Dani, Rimell Laura, Dyer Chris, Blunsom Phil
- Fid-ex: Improving Sequence-to-sequence Models For Extractive Rationale Generation Lakhotia Kushal, Paranjape Bhargavi, Ghoshal Asish, Yih Wen-tau, Mehdad Yashar, Iyer Srinivasan
- Utilizing Bidirectional Encoder Representations From Transformers For Answer Selection Laskar Md Tahmid Rahman, Hoque Enamul, Huang Jimmy Xiangji
- Context-based Transformer Models For Answer Sentence Selection Lauriola Ivano, Moschitti Alessandro
- Common Sense Or World Knowledge? Investigating Adapter-based Knowledge Injection Into Pretrained Transformers Lauscher Anne, Majewska Olga, Ribeiro Leonardo F. R., Gurevych Iryna, Rozanov Nikolai, Glavaš Goran
- From Zero To Hero: On The Limitations Of Zero-shot Cross-lingual Transfer With Multilingual Transformers Lauscher Anne, Ravishankar Vinit, Vulić Ivan, Glavaš Goran
- SLM: Learning A Discourse Language Representation With Sentence Unshuffling Lee Haejun, Hudson Drew A., Lee Kangwook, Manning Christopher D.
- MART: Memory-augmented Recurrent Transformer For Coherent Video Paragraph Captioning Lei Jie, Wang Liwei, Shen Yelong, Yu Dong, Berg Tamara L., Bansal Mohit
- Efficient Transformer-based Large Scale Language Representations Using Hardware-friendly Block Structured Pruning Li Bingbing, Kong Zhenglun, Zhang Tianyun, Li Ji, Li Zhengang, Liu Hang, Ding Caiwen
- Transformers To Learn Hierarchical Contexts In Multiparty Dialogue For Span-based Question Answering Li Changmao, Choi Jinho D.
- Dialogue-adaptive Language Model Pre-training From Quality Estimation Li Junlong, Zhang Zhuosheng, Zhao Hai
- A Closer Look At The Robustness Of Vision-and-language Pre-trained Models Li Linjie, Gan Zhe, Liu Jingjing
- An Empirical Investigation Of Pre-trained Transformer Language Models For Open-domain Dialogue Generation Li Piji
- TMT: A Transformer-based Modal Translator For Improving Multimodal Sequence Representations In Audio Visual Scene-aware Dialog Li Wubo, Jiang Dongwei, Zou Wei, Li Xiangang
- Enhancing Dialogue Generation Via Multi-level Contrastive Learning Li Xin, Li Piji, Wang Yan, Liu Xiaojiang, Lam Wai
- Mapping Natural Language Instructions To Mobile UI Action Sequences Li Yang, He Jiacong, Zhou Xin, Zhang Yuan, Baldridge Jason
- Rewriter-evaluator Architecture For Neural Machine Translation Li Yangming, Yao Kaisheng
- Neural Machine Translation With Joint Representation Li Yanyang, Wang Qiang, Xiao Tong, Liu Tongran, Zhu Jingbo
- Bridging Text And Video: A Universal Multimodal Transformer For Video-audio Scene-aware Dialog Li Zekang, Li Zongjia, Zhang Jinchao, Feng Yang, Niu Cheng, Zhou Jie
- Train Large, Then Compress: Rethinking Model Size For Efficient Training And Inference Of Transformers Li Zhuohan, Wallace Eric, Shen Sheng, Lin Kevin, Keutzer Kurt, Klein Dan, Gonzalez Joseph E.
- Probabilistically Masked Language Model Capable Of Autoregressive Generation In Arbitrary Word Order Liao Yi, Jiang Xin, Liu Qun
- Autoregressive Knowledge Distillation Through Imitation Learning Lin Alexander, Wohlwend Jeremy, Chen Howard, Lei Tao
- Multi-node Bert-pretraining: Cost-efficient Approach Lin Jiahuang, Li Xin, Pekhimenko Gennady
- Conversational Question Reformulation Via Sequence-to-sequence Architectures And Pretrained Language Models Lin Sheng-chieh, Yang Jheng-hong, Nogueira Rodrigo, Tsai Ming-feng, Wang Chuan-ju, Lin Jimmy
- Readonce Transformers: Reusable Representations Of Text For Transformers Lin Shih-ting, Sabharwal Ashish, Khot Tushar
- Variational Transformers For Diverse Response Generation Lin Zhaojiang, Winata Genta Indra, Xu Peng, Liu Zihan, Fung Pascale
- Mt5: A Massively Multilingual Pre-trained Text-to-text Transformer Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al-rfou, Aditya Siddhant, Aditya Barua, Colin Raffel
- TIME: Text And Image Mutual-translation Adversarial Networks Liu Bingchen, Song Kunpeng, Zhu Yizhe, De Melo Gerard, Elgammal Ahmed
- Tell Me How To Ask Again: Question Data Augmentation With Controllable Rewriting In Continuous Space Liu Dayiheng, Gong Yeyun, Fu Jie, Yan Yu, Chen Jiusheng, Lv Jiancheng, Duan Nan, Zhou Ming
- Task-level Curriculum Learning For Non-autoregressive Neural Machine Translation Liu Jinglin, Ren Yi, Tan Xu, Zhang Chen, Qin Tao, Zhao Zhou, Liu Tie-yan
- Understanding The Difficulty Of Training Transformers Liu Liyuan, Liu Xiaodong, Gao Jianfeng, Chen Weizhu, Han Jiawei
- Adversarial Training For Large Neural Language Models Liu Xiaodong, Cheng Hao, He Pengcheng, Chen Weizhu, Wang Yu, Poon Hoifung, Gao Jianfeng
- Very Deep Transformers For Neural Machine Translation Liu Xiaodong, Duh Kevin, Liu Liyuan, Gao Jianfeng
- Simplifying Paragraph-level Question Generation Via Transformer Language Models Lopez Luis Enrico, Cruz Diane Kathryn, Cruz Jan Christian Blaise, Cheng Charibeth
- VECO: Variable And Flexible Cross-lingual Pre-training For Language Understanding And Generation Luo Fuli, Wang Wei, Liu Jiahao, Liu Yijia, Bi Bin, Huang Songfang, Huang Fei, Si Luo
- Language Models As Few-shot Learner For Task-oriented Dialogue Systems Madotto Andrea, Liu Zihan, Lin Zhaojiang, Fung Pascale
- Incremental Processing In The Age Of Non-incremental Encoders: An Empirical Assessment Of Bidirectional Models For Incremental NLU Madureira Brielen, Schlangen David
- Gpt-too: A Language-model-first Approach For Amr-to-text Generation Mager Manuel, Astudillo Ramon Fernandez, Naseem Tahira, Sultan Md Arafat, Lee Young-suk, Florian Radu, Roukos Salim
- Verb Knowledge Injection For Multilingual Event Processing Majewska Olga, Vulić Ivan, Glavaš Goran, Ponti Edoardo M., Korhonen Anna
- Natural Language Rationales With Full-stack Visual Reasoning: From Pixels To Semantic Frames To Commonsense Graphs Marasović Ana, Bhagavatula Chandra, Park Jae Sung, Bras Ronan Le, Smith Noah A., Choi Yejin
- Delight: Deep And Light-weight Transformer Mehta Sachin, Ghazvininejad Marjan, Iyer Srinivasan, Zettlemoyer Luke, Hajishirzi Hannaneh
- Wechat Neural Machine Translation Systems For WMT20 Meng Fandong, Yan Jianhao, Liu Yijin, Gao Yuan, Zeng Xianfeng, Zeng Qinsong, Li Peng, Chen Ming, Zhou Jie, Liu Sifan, Zhou Hao
- What Happens To BERT Embeddings During Fine-tuning? Merchant Amil, Rahimtoroghi Elahe, Pavlick Ellie, Tenney Ian
- Lambert: Language And Action Learning Using Multimodal BERT Miyazawa Kazuki, Aoki Tatsuya, Horii Takato, Nagai Takayuki
- Modern Methods For Text Generation Montesinos Dimas Munoz
- On The Stability Of Fine-tuning BERT: Misconceptions, Explanations, And Strong Baselines Mosbach Marius, Andriushchenko Maksym, Klakow Dietrich
- NABU \(\mathrm{-}\) Multilingual Graph-based Neural RDF Verbalizer Moussallem Diego, Gnaneshwar Dwaraknath, Ferreira Thiago Castro, Ngomo Axel-cyrille Ngonga
- Axformer: Accuracy-driven Approximation Of Transformers For Faster, Smaller And More Accurate NLP Models Nagarajan Amrit, Sen Sanchari, Stevens Jacob R., Raghunathan Anand
- Stepwise Extractive Summarization And Planning With Structured Transformers Narayan Shashi, Maynez Joshua, Adamek Jakub, Pighin Daniele, Bratanič Blaž, Mcdonald Ryan
- I Like Fish, Especially Dolphins: Addressing Contradictions In Dialogue Modeling Nie Yixin, Williamson Mary, Bansal Mohit, Kiela Douwe, Weston Jason
- Avgout: A Simple Output-probability Measure To Eliminate Dull Responses Niu Tong, Bansal Mohit
- Real-time Execution Of Large-scale Language Models On Mobile Niu Wei, Kong Zhenglun, Yuan Geng, Jiang Weiwen, Guan Jiexiong, Ding Caiwen, Zhao Pu, Liu Sijia, Ren Bin, Wang Yanzhi
- The Chess Transformer: Mastering Play Using Generative Language Models Noever David, Ciolino Matt, Kalin Josh
- Better Distractions: Transformer-based Distractor Generation And Multiple Choice Question Filtering Offerijns Jeroen, Verberne Suzan, Verhoef Tessa
- Facts2story: Controlling Text Generation By Key Facts Orbach Eyal Bar Ilan University, Goldberg Yoav Bar Ilan University And Allen Institute For Artificial Intelligence
- Challenges And Thrills Of Legal Arguments Pallaprolu Anurag, Vaidya Radha, Attawar Aditya Swaroop
- Multilingual BERT Post-pretraining Alignment Pan Lin, Hang Chung-wei, Qi Haode, Shah Abhishek, Potdar Saloni, Yu Mo
- X-linear Attention Networks For Image Captioning Pan Yingwei, Yao Ting, Li Yehao, Mei Tao
- On Task-level Dialogue Composition Of Generative Transformer Model Parthasarathi Prasanna, Neelakantan Arvind, Narang Sharan
- How To Evaluate Your Dialogue System: Probe Tasks As An Alternative For Token-level Evaluation Metrics Parthasarathi Prasanna, Pineau Joelle, Chandar Sarath
- Telling Bert's Full Story: From Local Attention To Global Aggregation Pascual Damian, Brunner Gino, Wattenhofer Roger
- Retrieval-augmented Generation For Knowledge-intensive NLP Tasks Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela
- SOLOIST: Building Task Bots At Scale With Transfer Learning And Machine Teaching Peng Baolin, Li Chunyuan, Li Jinchao, Shayandeh Shahin, Liden Lars, Gao Jianfeng
- What Does BERT Know About Books, Movies And Music? Probing BERT For Conversational Recommendation Penha Gustavo, Hauff Claudia
- Adapterhub: A Framework For Adapting Transformers Pfeiffer Jonas, Rücklé Andreas, Poth Clifton, Kamath Aishwarya, Vulić Ivan, Ruder Sebastian, Cho Kyunghyun, Gurevych Iryna
- Sparsifying Transformer Models With Trainable Representation Pooling Pietruszka Michał, Borchmann Łukasz, Garncarek Łukasz
- Interactive Teaching For Conversational AI Ping Qing, Niu Feiyang, Thattai Govind, Chengottusseriyil Joel, Gao Qiaozi, Reganti Aishwarya, Rajagopal Prashanth, Tur Gokhan, Hakkani-tur Dilek, Nataraja Prem
- Compressing Transformer-based Semantic Parsing Models Using Compositional Code Embeddings Prakash Prafull, Shashidhar Saurabh Kumar, Zhao Wenlong, Rongali Subendhu, Khan Haidar, Kayser Michael
- Intermediate-task Transfer Learning With Pretrained Models For Natural Language Understanding: When And Why Does It Work? Pruksachatkun Yada, Phang Jason, Liu Haokun, Htut Phu Mon, Zhang Xiaoyi, Pang Richard Yuanzhe, Vania Clara, Kann Katharina, Bowman Samuel R.
- Training Large Neural Networks With Constant Memory Using A New Execution Algorithm Pudipeddi Bharadwaj, Mesmakhosroshahi Maral, Xi Jinwen, Bharadwaj Sujeeth
- BANG: Bridging Autoregressive And Non-autoregressive Generation With Large Scale Pretraining Qi Weizhen, Gong Yeyun, Jiao Jian, Yan Yu, Chen Weizhu, Liu Dayiheng, Tang Kewen, Li Houqiang, Chen Jiusheng, Zhang Ruofei, Zhou Ming, Duan Nan
- How Fine Can Fine-tuning Be? Learning Efficient Language Models Radiya-dixit Evani, Wang Xin
- Exploring Recurrent, Memory And Attention Based Architectures For Scoring Interactional Aspects Of Human-machine Text Dialog Ramanarayanan Vikram, Mulholland Matthew, Ghosh Debanjan
- Transquest At WMT2020: Sentence-level Direct Assessment Ranasinghe Tharindu, Orasan Constantin, Mitkov Ruslan
- Investigating Pretrained Language Models For Graph-to-text Generation Ribeiro Leonardo F. R., Schmitt Martin, Schütze Hinrich, Gurevych Iryna
- Hierarchical GPT With Congruent Transformers For Multi-sentence Language Models Roh Jihyeon, Gim Huiseong, Lee Soo-young
- Knowledge-aware Language Model Pretraining Rosset Corby, Xiong Chenyan, Phan Minh, Song Xia, Bennett Paul, Tiwary Saurabh
- Fine-tuning BERT For Schema-guided Zero-shot Dialogue State Tracking Ruan Yu-ping, Ling Zhen-hua, Gu Jia-chen, Liu Quan
- Stronger Transformers For Neural Multi-hop Question Generation Sachan Devendra Singh, Wu Lingfei, Sachan Mrinmaya, Hamilton William
- Prover: Proof Generation For Interpretable Reasoning Over Rules Saha Swarnadeep, Ghosh Sayan, Srivastava Shashank, Bansal Mohit
- On The Effect Of Dropping Layers Of Pre-trained Transformer Models Sajjad Hassan, Dalvi Fahim, Durrani Nadir, Nakov Preslav
- Hierarchical Transformer For Task Oriented Dialog Systems Santra Bishal, Anusha Potnuru, Goyal Pawan
- Look Before You Speak: Visually Contextualized Utterances Seo Paul Hongsuck, Nagrani Arsha, Schmid Cordelia
- End-to-end Synthetic Data Generation For Domain Adaptation Of Question Answering Systems Shakeri Siamak, Santos Cicero Nogueira Dos, Zhu Henry, Ng Patrick, Nan Feng, Wang Zhiguo, Nallapati Ramesh, Xiang Bing
- Data-efficient Methods For Dialogue Systems Shalyminov Igor
- Hybrid Generative-retrieval Transformers For Dialogue Domain Adaptation Shalyminov Igor, Sordoni Alessandro, Atkinson Adam, Schulz Hannes
- Russiansuperglue: A Russian Language Understanding Evaluation Benchmark Shavrina Tatiana, Fenogenova Alena, Emelyanov Anton, Shevelev Denis, Artemova Ekaterina, Malykh Valentin, Mikhailov Vladislav, Tikhonova Maria, Chertok Andrey, Evlampiev Andrey
- A Simple But Tough-to-beat Data Augmentation Approach For Natural Language Understanding And Generation Shen Dinghan, Zheng Mingzhi, Shen Yelong, Qu Yanru, Chen Weizhu
- Pre-trained Summarization Distillation Shleifer Sam, Rush Alexander M.
- Saying No Is An Art: Contextualized Fallback Responses For Unanswerable Dialogue Queries Shrivastava Ashish, Dhole Kaustubh, Bhatt Abhinav, Raghunath Sharvani
- Unnatural Language Inference Sinha Koustuv, Parthasarathi Prasanna, Pineau Joelle, Williams Adina
- Lightpaff: A Two-stage Distillation Framework For Pre-training And Fine-tuning Song Kaitao, Sun Hao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Hongzhi, Liu Tie-yan
- Neural Machine Translation With Error Correction Song Kaitao, Tan Xu, Lu Jianfeng
- Recipes For Adapting Pre-trained Monolingual And Multilingual Models To Machine Translation Stickland Asa Cooper, Li Xian, Ghazvininejad Marjan
- Bert-hlstms: BERT And Hierarchical Lstms For Visual Storytelling Su Jing, Dai Qingyun, Guerin Frank, Zhou Mian
- Discovering Useful Sentence Representations From Large Pretrained Language Models Subramani Nishant, Suresh Nivedita
- Multi-scale Transformer Language Models Subramanian Sandeep, Collobert Ronan, Ranzato Marc'aurelio, Boureau Y-lan
- Explain And Improve: Lrp-inference Fine-tuning For Image Captioning Models Sun Jiamei, Lapuschkin Sebastian, Samek Wojciech, Binder Alexander
- Intellicode Compose: Code Generation Using Transformer Svyatkovskiy Alexey, Deng Shao Kun, Fu Shengyu, Sundaresan Neel
- Proofwriter: Generating Implications, Proofs, And Abductive Statements Over Natural Language Tafjord Oyvind, Mishra Bhavana Dalvi, Clark Peter
- Let's Stop Incorrect Comparisons In End-to-end Relation Extraction! Taillé Bruno, Guigue Vincent, Scoutheeten Geoffrey, Gallinari Patrick
- Edgebert: Sentence-level Energy Optimizations For Latency-aware Multi-task NLP Inference Tambe Thierry, Hooper Coleman, Pentecost Lillian, Jia Tianyu, Yang En-yu, Donato Marco, Sanh Victor, Whatmough Paul N., Rush Alexander M., Brooks David, Wei Gu-yeon
- Synthesizer: Rethinking Self-attention In Transformer Models Tay Yi, Bahri Dara, Metzler Donald, Juan Da-cheng, Zhao Zhe, Zheng Che
- Making Pre-trained Language Models Better Few-shot Learners Tianyu Gao, Adam Fisch, Danqi Chen
- Language Models Are Few-shot Learners Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam Mccandlish, Alec Radford, Ilya Sutskever, Dario Amodei
- On Optimal Transformer Depth For Low-resource Language Translation Van Biljon Elan, Pretorius Arnu, Kreutzer Julia
- Unsupervised Pretraining For Neural Machine Translation Using Elastic Weight Consolidation Variš Dušan, Bojar Ondřej
- Bertnesia: Investigating The Capture And Forgetting Of Knowledge In BERT Wallat Jonas, Singh Jaspreet, Anand Avishek
- Multi-task Learning With Multi-head Attention For Multi-choice Reading Comprehension Wan Hui
- Infobert: Improving Robustness Of Language Models From An Information Theoretic Perspective Wang Boxin, Wang Shuohang, Cheng Yu, Gan Zhe, Jia Ruoxi, Li Bo, Liu Jingjing
- HAT: Hardware-aware Transformers For Efficient Natural Language Processing Wang Hanrui, Wu Zhanghao, Liu Zhijian, Cai Han, Zhu Ligeng, Gan Chuang, Han Song
- Encoding Syntactic Knowledge In Transformer Encoder For Intent Detection And Slot Filling Wang Jixuan, Wei Kai, Radfar Martin, Zhang Weiwei, Chung Clement
- Minilm: Deep Self-attention Distillation For Task-agnostic Compression Of Pre-trained Transformers Wang Wenhui, Wei Furu, Dong Li, Bao Hangbo, Yang Nan, Zhou Ming
- Rethinking The Value Of Transformer Components Wang Wenxuan, Tu Zhaopeng
- Lightseq: A High Performance Inference Library For Transformers Wang Xiaohui, Xiong Ying, Wei Yang, Wang Mingxuan, Li Lei
- VD-BERT: A Unified Vision And Dialog Transformer With BERT Wang Yue, Joty Shafiq, Lyu Michael R., King Irwin, Xiong Caiming, Hoi Steven C. H.
- Towards Faithful Neural Table-to-text Generation With Content-matching Constraints Wang Zhenyi, Wang Xiaoyang, An Bang, Yu Dong, Chen Changyou
- Contextual Neural Machine Translation Improves Translation Of Cataphoric Pronouns Wong Kayyen, Maruf Sameen, Haffari Gholamreza
- Cort: Complementary Rankings From Transformers Wrzalik Marco, Krechel Dirk
- TOD-BERT: Pre-trained Natural Language Understanding For Task-oriented Dialogue Wu Chien-sheng, Hoi Steven, Socher Richard, Xiong Caiming
- A Controllable Model Of Grounded Response Generation Wu Zeqiu, Galley Michel, Brockett Chris, Zhang Yizhe, Gao Xiang, Quirk Chris, Koncel-kedziorski Rik, Gao Jianfeng, Hajishirzi Hannaneh, Ostendorf Mari, Dolan Bill
- Lite Transformer With Long-short Range Attention Wu Zhanghao, Liu Zhijian, Lin Ji, Lin Yujun, Han Song
- Context-guided BERT For Targeted Aspect-based Sentiment Analysis Wu Zhengxuan, Ong Desmond C.
- ERNIE-GEN: An Enhanced Multi-flow Pre-training And Fine-tuning Framework For Natural Language Generation Xiao Dongling, Zhang Han, Li Yukun, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- EDITOR: An Edit-based Transformer With Repositioning For Neural Machine Translation With Soft Lexical Constraints Xu Weijia, Carpuat Marine
- Improving BERT Fine-tuning Via Self-ensemble And Self-distillation Xu Yige, Qiu Xipeng, Zhou Ligao, Huang Xuanjing
- MC-BERT: Efficient Language Pre-training Via A Meta Controller Xu Zhenhui, Gong Linyuan, Ke Guolin, He Di, Zheng Shuxin, Wang Liwei, Bian Jiang, Liu Tie-yan
- Multi-unit Transformers For Neural Machine Translation Yan Jianhao, Meng Fandong, Zhou Jie
- Off-policy Self-critical Training For Transformer In Visual Paragraph Generation Yan Shiyang, Hua Yang, Robertson Neil M.
- Just Ask: Learning To Answer Questions From Millions Of Narrated Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Seqdialn: Sequential Visual Dialog Networks In Joint Visual-linguistic Representation Space Yang Liu
- IART: Intent-aware Response Ranking With Transformers In Information-seeking Conversation Systems Yang Liu, Qiu Minghui, Qu Chen, Chen Cen, Guo Jiafeng, Zhang Yongfeng, Croft W. Bruce, Chen Haiqing
- On The Generation Of Medical Dialogues For COVID-19 Yang Wenmian, Zeng Guangtao, Tan Bowen, Ju Zeqian, Chakravorty Subrato, He Xuehai, Chen Shu, Yang Xingyi, Wu Qingyang, Yu Zhou, Xing Eric, Xie Pengtao
- On The Sub-layer Functionalities Of Transformer Decoder Yang Yilin, Wang Longyue, Shi Shuming, Tadepalli Prasad, Lee Stefan, Tu Zhaopeng
- Contrastive Triple Extraction With Generative Transformer Ye Hongbin, Zhang Ningyu, Deng Shumin, Chen Mosha, Tan Chuanqi, Huang Fei, Chen Huajun
- Adding Recurrence To Pretrained Transformers For Improved Efficiency And Context Size Yoshida Davis, Ettinger Allyson, Gimpel Kevin
- Hard-coded Gaussian Attention For Neural Machine Translation You Weiqiu, Sun Simeng, Iyyer Mohit
- Cocolm: Complex Commonsense Enhanced Language Model With Discourse Relations Yu Changlong, Zhang Hongming, Song Yangqiu, Ng Wilfred
- Assessing Phrasal Representation And Composition In Transformers Yu Lang, Ettinger Allyson
- Few-shot Generative Conversational Query Rewriting Yu Shi, Liu Jiahua, Yang Jingqin, Xiong Chenyan, Bennett Paul, Gao Jianfeng, Liu Zhiyuan
- GOBO: Quantizing Attention-based NLP Models For Low Latency And Energy Efficient Inference Zadeh Ali Hadi, Edo Isak, Awad Omar Mohamed, Moshovos Andreas
- Big Bird: Transformers For Longer Sequences Zaheer Manzil, Guruganesh Guru, Dubey Avinava, Ainslie Joshua, Alberti Chris, Ontanon Santiago, Pham Philip, Ravula Anirudh, Wang Qifan, Yang Li, Ahmed Amr
- Emptransfo: A Multi-head Transformer Architecture For Creating Empathetic Dialog Systems Zandie Rohola, Mahoor Mohammad H.
- Open-domain Dialogue Generation Based On Pre-trained Language Models Zeng Yan, Nie Jian-yun
- A Simple And Efficient Multi-task Learning Approach For Conditioned Dialogue Generation Zeng Yan, Nie Jian-yun
- Token Drop Mechanism For Neural Machine Translation Zhang Huaao, Qiu Shigui, Duan Xiangyu, Zhang Min
- Accelerating Training Of Transformer-based Language Models With Progressive Layer Dropping Zhang Minjia, He Yuxiong
- Future-guided Incremental Transformer For Simultaneous Translation Zhang Shaolei, Feng Yang, Li Liangyou
- When Do You Need Billions Of Words Of Pretraining Data? Zhang Yian, Warstadt Alex, Li Haau-sing, Bowman Samuel R.
- POINTER: Constrained Progressive Text Generation Via Insertion-based Generative Pre-training Zhang Yizhe, Wang Guoyin, Li Chunyuan, Gan Zhe, Brockett Chris, Dolan Bill
- BERT-JAM: Boosting Bert-enhanced Neural Machine Translation With Joint Attention Zhang Zhebin, Wu Sai, Jiang Dawei, Chen Gang
- Know What You Don't Need: Single-shot Meta-pruning For Attention Heads Zhang Zhengyan, Qi Fanchao, Liu Zhiyuan, Liu Qun, Sun Maosong
- Towards Making The Most Of Context In Neural Machine Translation Zheng Zaixiang, Yue Xiang, Huang Shujian, Chen Jiajun, Birch Alexandra
- Improving Non-autoregressive Neural Machine Translation With Monolingual Data Zhou Jiawei, Keung Phillip
- Pre-training Text-to-text Transformers For Concept-centric Common Sense Zhou Wangchunshu, Lee Dong-ho, Selvam Ravi Kiran, Lee Seyeon, Lin Bill Yuchen, Ren Xiang
- Learning To Compare For Better Training And Evaluation Of Open Domain Natural Language Generation Models Zhou Wangchunshu, Xu Ke
- Modifying Memories In Transformer Models Zhu Chen, Rawat Ankit Singh, Zaheer Manzil, Bhojanapalli Srinadh, Li Daliang, Yu Felix, Kumar Sanjiv
- Overcoming Language Priors With Self-supervised Learning For Visual Question Answering Zhu Xi, Mao Zhendong, Liu Chunxiao, Zhang Peng, Wang Bin, Zhang Yongdong
- Regularizing Transformers With Deep Probabilistic Layers Aguilera Aurora Cobo, Olmos Pablo Martínez, Artés-rodríguez Antonio, Pérez-cruz Fernando
- Goal-directed Story Generation: Augmenting Generative Language Models With Reinforcement Learning Alabdulkarim Amal, Li Winston, Martin Lara J., Riedl Mark O.
- On The Multilingual Capabilities Of Very Large-scale English Language Models Armengol-estapé Jordi, Bonet Ona De Gibert, Melero Maite
- Efficient Large Scale Language Modeling With Mixtures Of Experts Artetxe Mikel, Bhosale Shruti, Goyal Naman, Mihaylov Todor, Ott Myle, Shleifer Sam, Lin Xi Victoria, Du Jingfei, Iyer Srinivasan, Pasunuru Ramakanth, Anantharaman Giri, Li Xian, Chen Shuohui, Akin Halil, Baines Mandeep, Martin Louis, Zhou Xing, Koura Punit Singh, O'horo Brian, Wang Jeff, Zettlemoyer Luke, Diab Mona, Kozareva Zornitsa, Stoyanov Ves
- One Question Answering Model For Many Languages With Cross-lingual Dense Passage Retrieval Asai Akari, Yu Xinyan, Kasai Jungo, Hajishirzi Hannaneh
- See, Hear, Read: Leveraging Multimodality With Guided Attention For Abstractive Text Summarization Atri Yash Kumar, Pramanick Shraman, Goyal Vikram, Chakraborty Tanmoy
- Re-evaluating Germeval17 Using German Pre-trained Language Models Aßenmacher M., Corvonato A., Heumann C.
- G-transformer For Document-level Machine Translation Bao Guangsheng, Zhang Yue, Teng Zhiyang, Chen Boxing, Luo Weihua
- S2s-ft: Fine-tuning Pretrained Transformer Encoders For Sequence-to-sequence Learning Bao Hangbo, Dong Li, Wang Wenhui, Yang Nan, Wei Furu
- PLATO-XL: Exploring The Large-scale Pre-training Of Dialogue Generation Bao Siqi, He Huang, Wang Fan, Wu Hua, Wang Haifeng, Wu Wenquan, Wu Zhihua, Guo Zhen, Lu Hua, Huang Xinxian, Tian Xin, Xu Xinchao, Lin Yingzhan, Niu Zheng-yu
- Non-autoregressive Translation By Learning Target Categorical Codes Bao Yu, Huang Shujian, Xiao Tong, Wang Dongqi, Dai Xinyu, Chen Jiajun
- Gabert -- An Irish Language Model Barry James, Wagner Joachim, Cassidy Lauren, Cowap Alan, Lynn Teresa, Walsh Abigail, Meachair Mícheál J. Ó, Foster Jennifer
- NLP-IIS@UT At Semeval-2021 Task 4: Machine Reading Comprehension Using The Long Document Transformer Basafa Hossein, Movahedi Sajad, Ebrahimi Ali, Shakery Azadeh, Faili Heshaam
- Building A Question And Answer System For News Domain Basu Sandipan, Gaddala Aravind, Chetan Pooja, Tiwari Garima, Darapaneni Narayana, Parvathaneni Sadwik, Paduri Anwesh Reddy
- Exploring Unsupervised Pretraining Objectives For Machine Translation Baziotis Christos, Titov Ivan, Birch Alexandra, Haddow Barry
- Continual Learning In Multilingual NMT Via Language-specific Embeddings Berard Alexandre
- SHAQ: Single Headed Attention With Quasi-recurrence Bharwani Nashwin, Kushner Warren, Dandona Sangeet, Schreiber Ben
- On The Universality Of Deep Contextual Language Models Bhatt Shaily, Goyal Poonam, Dandapat Sandipan, Choudhury Monojit, Sitaram Sunayana
- Leveraging Redundancy In Attention With Reuse Transformers Bhojanapalli Srinadh, Chakrabarti Ayan, Veit Andreas, Lukasik Michal, Jain Himanshu, Liu Frederick, Chang Yin-wen, Kumar Sanjiv
- Bridging The Gap Between Language Model And Reading Comprehension: Unsupervised MRC Via Self-supervision Bian Ning, Han Xianpei, Chen Bo, Lin Hongyu, He Ben, Sun Le
- Latr: Layout-aware Transformer For Scene-text VQA Biten Ali Furkan, Litman Ron, Xie Yusheng, Appalaraju Srikar, Manmatha R.
- Improving Language Models By Retrieving From Trillions Of Tokens Borgeaud Sebastian, Mensch Arthur, Hoffmann Jordan, Cai Trevor, Rutherford Eliza, Millican Katie, Driessche George Van Den, Lespiau Jean-baptiste, Damoc Bogdan, Clark Aidan, Casas Diego De Las, Guy Aurelia, Menick Jacob, Ring Roman, Hennigan Tom, Huang Saffron, Maggiore Loren, Jones Chris, Cassirer Albin, Brock Andy, Paganini Michela, Irving Geoffrey, Vinyals Oriol, Osindero Simon, Simonyan Karen, Rae Jack W., Elsen Erich, Sifre Laurent
- What Changes Can Large-scale Language Models Bring? Intensive Study On Hyperclova: Billions-scale Korean Generative Pretrained Transformers Boseop Kim, Hyoungseok Kim, Sang-woo Lee, Gichang Lee, Donghyun Kwak, Dong Hyeon Jeon, Sunghyun Park, Sungju Kim, Seonhoon Kim, Dongpil Seo, Heungsub Lee, Minyoung Jeong, Sungjae Lee, Minsub Kim, Suk Hyun Ko, Seokhun Kim, Taeyong Park, Jinuk Kim, Soyoung Kang, Na-hyeon Ryu, Kang Min Yoo, Minsuk Chang, Soobin Suh, Sookyo In, Jinseong Park, Kyungduk Kim, Hiun Kim, Jisu Jeong, Yong Goo Yeo, Donghoon Ham, Dongju Park, Min Young Lee, Jaewook Kang, Inho Kang, Jung-woo Ha, Woomyoung Park, Nako Sung
- On The Prunability Of Attention Heads In Multilingual BERT Budhraja Aakriti, Pande Madhura, Kumar Pratyush, Khapra Mitesh M.
- CDLM: Cross-document Language Modeling Caciularu Avi, Cohan Arman, Beltagy Iz, Peters Matthew E., Cattan Arie, Dagan Ido
- Long Context Question Answering Via Supervised Contrastive Learning Caciularu Avi, Dagan Ido, Goldberger Jacob, Cohan Arman
- Curriculum Learning For Language Modeling Campos Daniel
- Can Transformers Jump Around Right In Natural Language? Assessing Performance Transfer From SCAN Chaabouni Rahma, Dessì Roberto, Kharitonov Eugene
- Implicit Premise Generation With Discourse-aware Commonsense Knowledge Models Chakrabarty Tuhin, Trivedi Aadit, Muresan Smaranda
- Generic Attention-model Explainability For Interpreting Bi-modal And Encoder-decoder Transformers Chefer Hila, Gur Shir, Wolf Lior
- Topic Transferable Table Question Answering Chemmengath Saneem Ahmed, Kumar Vishwajeet, Bharadwaj Samarth, Sen Jaydeep, Canim Mustafa, Chakrabarti Soumen, Gliozzo Alfio, Sankaranarayanan Karthik
- Multimodal Incremental Transformer With Visual Grounding For Visual Dialogue Generation Chen Feilong, Meng Fandong, Chen Xiuyi, Li Peng, Zhou Jie
- Towards Making The Most Of Multilingual Pretraining For Zero-shot Neural Machine Translation Chen Guanhua, Ma Shuming, Chen Yun, Zhang Dongdong, Pan Jia, Wang Wenping, Wei Furu
- Visualgpt: Data-efficient Adaptation Of Pretrained Language Models For Image Captioning Chen Jun, Guo Han, Yi Kai, Li Boyang, Elhoseiny Mohamed
- MT6: Multilingual Pretrained Text-to-text Transformer With Translation Pairs Chi Zewen, Dong Li, Ma Shuming, Mao Shaohan Huang Xian-ling, Huang Heyan, Wei Furu
- Language Modeling Using Lmus: 10x Better Data Efficiency Or Improved Scaling Compared To Transformers Chilkuri Narsimha, Hunsberger Eric, Voelker Aaron, Malik Gurshaant, Eliasmith Chris
- Ensemble Fine-tuned Mbert For Translation Quality Estimation Chowdhury Shaika, Baili Naouel, Vannah Brian
- Enjoy The Salience: Towards Better Transformer-based Faithful Explanations With Word Salience Chrysostomou George, Aletras Nikolaos
- CANINE: Pre-training An Efficient Tokenization-free Encoder For Language Representation Clark Jonathan H., Garrette Dan, Turc Iulia, Wieting John
- Long-range Modeling Of Source Code Files With Ewash: Extended Window Access By Syntax Hierarchy Clement Colin B., Lu Shuai, Liu Xiaoyu, Tufano Michele, Drain Dawn, Duan Nan, Sundaresan Neel, Svyatkovskiy Alexey
- Distilling Transformers For Neural Cross-domain Search Clement Colin B., Wu Chen, Drain Dawn, Sundaresan Neel
- Control Prefixes For Parameter-efficient Text Generation Clive Jordan, Cao Kris, Rei Marek
- Training Verifiers To Solve Math Word Problems Cobbe Karl, Kosaraju Vineet, Bavarian Mohammad, Chen Mark, Jun Heewoo, Kaiser Lukasz, Plappert Matthias, Tworek Jerry, Hilton Jacob, Nakano Reiichiro, Hesse Christopher, Schulman John
- YANMTT: Yet Another Neural Machine Translation Toolkit Dabre Raj, Sumita Eiichiro
- Knowledge Neurons In Pretrained Transformers Dai Damai, Dong Li, Hao Yaru, Sui Zhifang, Chang Baobao, Wei Furu
- Mention Memory: Incorporating Textual Knowledge Into Transformers Through Entity Mention Attention De Jong Michiel, Zemlyanskiy Yury, Fitzgerald Nicholas, Sha Fei, Cohen William
- Adapting Monolingual Models: Data Can Be Scarce When Language Similarity Is High De Vries Wietse, Bartelds Martijn, Nissim Malvina, Wieling Martijn
- Multitask Finetuning For Improving Neural Machine Translation In Indian Languages Desai Shaily, Kshirsagar Atharva, Marathe Manisha
- Diagnosing Transformers In Task-oriented Semantic Parsing Desai Shrey, Aly Ahmed
- EKTVQA: Generalized Use Of External Knowledge To Empower Scene Text In Text-vqa Dey Arka Ujjal, Valveny Ernest, Harit Gaurav
- Cogview: Mastering Text-to-image Generation Via Transformers Ding Ming, Yang Zhuoyi, Hong Wenyi, Zheng Wendi, Zhou Chang, Yin Da, Lin Junyang, Zou Xu, Shao Zhou, Yang Hongxia, Tang Jie
- Openprompt: An Open-source Framework For Prompt-learning Ding Ning, Hu Shengding, Zhao Weilin, Chen Yulin, Liu Zhiyuan, Zheng Hai-tao, Sun Maosong
- A Primer On Pretrained Multilingual Language Models Doddapaneni Sumanth, Ramesh Gowtham, Khapra Mitesh M., Kunchukuttan Anoop, Kumar Pratyush
- Diverse Pretrained Context Encodings Improve Document Translation Donato Domenic, Yu Lei, Dyer Chris
- An Empirical Study Of Training End-to-end Vision-and-language Transformers Dou Zi-yi, Xu Yichong, Gan Zhe, Wang Jianfeng, Wang Shuohang, Wang Lijuan, Zhu Chenguang, Zhang Pengchuan, Yuan Lu, Peng Nanyun, Liu Zicheng, Zeng Michael
- GLM: General Language Model Pretraining With Autoregressive Blank Infilling Du Zhengxiao, Qian Yujie, Liu Xiao, Ding Ming, Qiu Jiezhong, Yang Zhilin, Tang Jie
- Lora: Low-rank Adaptation Of Large Language Models Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen
- Robustly Optimized And Distilled Training For Natural Language Understanding Elfadeel Haytham, Peshterliev Stan
- Transformer-based Conditional Variational Autoencoder For Controllable Story Generation Fang Le, Zeng Tao, Liu Chaochun, Bo Liefeng, Dong Wen, Chen Changyou
- Compressing Visual-linguistic Model Via Knowledge Distillation Fang Zhiyuan, Wang Jianfeng, Hu Xiaowei, Wang Lijuan, Yang Yezhou, Liu Zicheng
- Retrieve, Caption, Generate: Visual Grounding For Enhancing Commonsense In Text Generation Models Feng Steven Y., Lu Kevin, Tao Zhuofu, Alikhani Malihe, Mitamura Teruko, Hovy Eduard, Gangal Varun
- Attention Weights In Transformer NMT Fail Aligning Words Between Sequences But Largely Explain Model Predictions Ferrando Javier, Costa-jussà Marta R.
- Open-domain Conversational Search Assistant With Transformers Ferreira Rafael, Leite Mariana, Semedo David, Magalhaes Joao
- Vision-and-language Or Vision-for-language? On Cross-modal Influence In Multimodal Transformers Frank Stella, Bugliarello Emanuele, Elliott Desmond
- Interactively Providing Explanations For Transformer Language Models Friedrich Felix, Schramowski Patrick, Tauchmann Christopher, Kersting Kristian
- Does Dialog Length Matter For Next Response Selection Task? An Empirical Study Ganhotra Jatin, Joshi Sachindra
- Chop Chop BERT: Visual Question Answering By Chopping Visualbert's Heads Gao Chenyu, Zhu Qi, Wang Peng, Wu Qi
- Condenser: A Pre-training Architecture For Dense Retrieval Gao Luyu, Callan Jamie
- Scalable Transformers For Neural Machine Translation Gao Peng, Geng Shijie, Qiao Yu, Wang Xiaogang, Dai Jifeng, Li Hongsheng
- Cross-attention Is All You Need: Adapting Pretrained Transformers For Machine Translation Gheini Mozhdeh, Ren Xiang, May Jonathan
- LAWDR: Language-agnostic Weighted Document Representations From Pre-trained Models Gong Hongyu, Chaudhary Vishrav, Tang Yuqing, Guzmán Francisco
- Larger-scale Transformers For Multilingual Masked Language Modeling Goyal Naman, Du Jingfei, Ott Myle, Anantharaman Giri, Conneau Alexis
- Xeroalign: Zero-shot Cross-lingual Transformer Alignment Gritta Milan, Iacobacci Ignacio
- Transaction: ICL-SJTU Submission To Epic-kitchens Action Anticipation Challenge 2021 Gu Xiao, Qiu Jianing, Guo Yao, Lo Benny, Yang Guang-zhong
- Long Text Generation By Modeling Sentence-level And Discourse-level Coherence Guan Jian, Mao Xiaoxi, Fan Changjie, Liu Zitao, Ding Wenbiao, Huang Minlie
- Block-skim: Efficient Question Answering For Transformer Guan Yue, Li Zhengyi, Leng Jingwen, Lin Zhouhan, Guo Minyi, Zhu Yuhao
- KAT: A Knowledge Augmented Transformer For Vision-and-language Gui Liangke, Wang Borui, Huang Qiuyuan, Hauptmann Alex, Bisk Yonatan, Gao Jianfeng
- Longt5: Efficient Text-to-text Transformer For Long Sequences Guo Mandy, Ainslie Joshua, Uthus David, Ontanon Santiago, Ni Jianmo, Sung Yun-hsuan, Yang Yinfei
- Memory-efficient Transformers Via Top-\(k\) Attention Gupta Ankit, Dar Guy, Goodman Shaya, Ciprut David, Berant Jonathan
- A Million Tweets Are Worth A Few Points: Tuning Transformers For Customer Service Tasks Hadifar Amir, Labat Sofie, Hoste Véronique, Develder Chris, Demeester Thomas
- Zero-shot Controlled Generation With Encoder-decoder Transformers Hazarika Devamanyu, Namazifar Mahdi, Hakkani-tür Dilek
- ELIT: Emory Language And Information Toolkit He Han, Xu Liyan, Choi Jinho D.
- Generate, Annotate, And Learn: NLP With Synthetic Text He Xuanli, Nassar Islam, Kiros Jamie, Haffari Gholamreza, Norouzi Mohammad
- Multilingual Language Models Predict Human Reading Behavior Hollenstein Nora, Pirovano Federico, Zhang Ce, Jäger Lena, Beinborn Lisa
- ENCONTER: Entity Constrained Progressive Sequence Generation Via Insertion-based Transformer Hsieh Lee-hsun, Lee Yang-yin, Lim Ee-peng
- The Niutrans System For WNGT 2020 Efficiency Task Hu Chi, Li Bei, Lin Ye, Li Yinqiao, Li Yanyang, Wang Chenglong, Xiao Tong, Zhu Jingbo
- Lora: Low-rank Adaptation Of Large Language Models Hu Edward J., Shen Yelong, Wallis Phillip, Allen-zhu Zeyuan, Li Yuanzhi, Wang Shean, Wang Lu, Chen Weizhu
- Investigating Transfer Learning In Multilingual Pre-trained Language Models Through Chinese Natural Language Inference Hu Hai, Zhou He, Tian Zuoyu, Zhang Yiwen, Ma Yina, Li Yanting, Nie Yixin, Richardson Kyle
- Non-autoregressive Translation With Layer-wise Prediction And Deep Supervision Huang Chenyang, Zhou Hao, Zaïane Osmar R., Mou Lili, Li Lei
- Unifying Multimodal Transformer For Bi-directional Image And Text Generation Huang Yupan, Xue Hongwei, Liu Bei, Lu Yutong
- SILT: Efficient Transformer Training For Inter-lingual Inference Huertas-tato Javier, Martín Alejandro, Camacho David
- DS-TOD: Efficient Domain Specialization For Task Oriented Dialog Hung Chia-chien, Lauscher Anne, Ponzetto Simone Paolo, Glavaš Goran
- Target-dependent UNITER: A Transformer-based Multimodal Language Comprehension Model For Domestic Service Robots Ishikawa Shintaro, Sugiura Komei
- Scaling Language Models: Methods, Analysis & Insights From Training Gopher Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George Van Den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor, Irina Higgins, Antonia Creswell, Nat Mcaleese, Amy Wu, Erich Elsen, Siddhant Jayakumar, Elena Buchatskaya, David Budden, Esme Sutherland, Karen Simonyan, Michela Paganini, Laurent Sifre, Lena Martens, Xiang Lorraine Li, Adhiguna Kuncoro, Aida Nematzadeh, Elena Gribovskaya, Domenic Donato, Angeliki Lazaridou, Arthur Mensch, Jean-baptiste Lespiau, Maria Tsimpoukelli, Nikolai Grigorev, Doug Fritz, Thibault Sottiaux, Mantas Pajarskas, Toby Pohlen, Zhitao Gong, Daniel Toyama, Cyprien De Masson D'autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego De Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew Johnson, Blake Hechtman, Laura Weidinger, Iason Gabriel, William Isaac, Ed Lockhart, Simon Osindero, Laura Rimell, Chris Dyer, Oriol Vinyals, Kareem Ayoub, Jeff Stanway, Lorrayne Bennett, Demis Hassabis, Koray Kavukcuoglu, Geoffrey Irving
- Visqa: X-raying Vision And Language Reasoning In Transformers Jaunet Theo, Kervadec Corentin, Vuillemot Romain, Antipov Grigory, Baccouche Moez, Wolf Christian
- Evaluating Pretrained Transformer Models For Entity Linking In Task-oriented Dialog Jayanthi Sai Muralidhar, Embar Varsha, Raghunathan Karthik
- Discodvt: Generating Long Text With Discourse-aware Discrete Variational Transformer Ji Haozhe, Huang Minlie
- Improving Non-autoregressive Generation With Mixup Training Jiang Ting, Huang Shaohan, Zhang Zihan, Wang Deqing, Zhuang Fuzhen, Wei Furu, Huang Haizhen, Zhang Liangjie, Zhang Qi
- Inducing Transformer's Compositional Generalization Ability Via Auxiliary Sequence Prediction Tasks Jiang Yichen, Bansal Mohit
- Enriching Transformers With Structured Tensor-product Representations For Abstractive Summarization Jiang Yichen, Celikyilmaz Asli, Smolensky Paul, Soulos Paul, Rao Sudha, Palangi Hamid, Fernandez Roland, Smith Caitlin, Bansal Mohit, Gao Jianfeng
- A Good Prompt Is Worth Millions Of Parameters: Low-resource Prompt-based Learning For Vision-language Models Jin Woojeong, Cheng Yu, Shen Yelong, Chen Weizhu, Ren Xiang
- Rethinking Self-supervision Objectives For Generalizable Coherence Modeling Jwalapuram Prathyusha, Joty Shafiq, Lin Xiang
- Towards Incremental Transformers: An Empirical Analysis Of Transformer Models For Incremental NLU Kahardipraja Patrick, Madureira Brielen, Schlangen David
- Distilling Large Language Models Into Tiny And Effective Students Using Pqrnn Kaliamoorthi Prabhu, Siddhant Aditya, Li Edward, Johnson Melvin
- AMMUS : A Survey Of Transformer-based Pretrained Models In Natural Language Processing Kalyan Katikapalli Subramanyam, Rajasekharan Ajit, Sangeetha Sivanesan
- What Do Pre-trained Code Models Know About Code? Karmakar Anjan, Robbes Romain
- Finetuning Pretrained Transformers Into Rnns Kasai Jungo, Peng Hao, Zhang Yizhe, Yogatama Dani, Ilharco Gabriel, Pappas Nikolaos, Mao Yi, Chen Weizhu, Smith Noah A.
- Uquad1.0: Development Of An Urdu Question Answering Training Data For Machine Reading Comprehension Kazi Samreen, Khoja Shakeel
- Generating Rich Product Descriptions For Conversational E-commerce Systems Kedia Shashank, Mantha Aditya, Gupta Sneha, Guo Stephen, Achan Kannan
- How Transferable Are Reasoning Patterns In VQA? Kervadec Corentin, Jaunet Theo, Antipov Grigory, Baccouche Moez, Vuillemot Romain, Wolf Christian
- Text-free Prosody-aware Generative Spoken Language Modeling Kharitonov Eugene, Lee Ann, Polyak Adam, Adi Yossi, Copet Jade, Lakhotia Kushal, Nguyen Tu-anh, Rivière Morgane, Mohamed Abdelrahman, Dupoux Emmanuel, Hsu Wei-ning
- Learned Token Pruning For Transformers Kim Sehoon, Shen Sheng, Thorsley David, Gholami Amir, Kwon Woosuk, Hassoun Joseph, Keutzer Kurt
- L-verse: Bidirectional Generation Between Image And Text Kim Taehoon, Song Gwangmo, Lee Sihaeng, Kim Sangyun, Seo Yewon, Lee Soonyoung, Kim Seung Hwan, Lee Honglak, Bae Kyunghoon
- What's In A Measurement? Using GPT-3 On Semeval 2021 Task 8 -- Measeval Kohler Curt, Daniel Ron Jr
- Improving Scheduled Sampling With Elastic Weight Consolidation For Neural Machine Translation Korakakis Michalis, Vlachos Andreas
- Controlling Conditional Language Models Without Catastrophic Forgetting Korbak Tomasz, Elsahar Hady, Kruszewski German, Dymetman Marc
- BERT Busters: Outlier Dimensions That Disrupt Transformers Kovaleva Olga, Kulshreshtha Saurabh, Rogers Anna, Rumshisky Anna
- On The Significance Of Question Encoder Sequence Model In The Out-of-distribution Performance In Visual Question Answering Kv Gouthaman, Mittal Anurag
- Can Transformer Models Measure Coherence In Text? Re-thinking The Shuffle Test Laban Philippe, Dai Luke, Bandarkar Lucas, Hearst Marti A.
- Layered Gradient Accumulation And Modular Pipeline Parallelism: Fast And Efficient Training Of Large Language Models Lamy-poirier Joel
- Can Transformer Language Models Predict Psychometric Properties? Laverghetta Antonio Jr., Nighojkar Animesh, Mirzakhalov Jamshidbek, Licato John
- You Only Need One Model For Open-domain Question Answering Lee Haejun, Kedia Akhil, Lee Jongwon, Paranjape Ashwin, Manning Christopher D., Woo Kyoung-gu
- Towards Few-shot Fact-checking Via Perplexity Lee Nayeon, Bang Yejin, Madotto Andrea, Khabsa Madian, Fung Pascale
- When Attention Meets Fast Recurrence: Training Language Models With Reduced Compute Lei Tao
- Quiz-style Question Generation For News Stories Lelkes Adam D., Tran Vinh Q., Yu Cong
- Exploiting Sentence-level Representations For Passage Ranking Leonhardt Jurek, Beringer Fabian, Anand Avishek
- Semvlp: Vision-language Pre-training By Aligning Semantics At Multiple Levels Li Chenliang, Yan Ming, Xu Haiyang, Luo Fuli, Wang Wei, Bi Bin, Huang Songfang
- The Stability-efficiency Dilemma: Investigating Sequence Length Warmup For Training GPT Models Li Conglong, Zhang Minjia, He Yuxiong
- Residual Tree Aggregation Of Layers For Neural Machine Translation Li Guoliang, Li Yiyang
- Align Before Fuse: Vision And Language Representation Learning With Momentum Distillation Li Junnan, Selvaraju Ramprasaath R., Gotmare Akhilesh Deepak, Joty Shafiq, Xiong Caiming, Hoi Steven
- Personalized Transformer For Explainable Recommendation Li Lei, Zhang Yongfeng, Chen Li
- Ensemble ALBERT On Squad 2.0 Li Shilun, Li Renee, Peng Veronica
- A Short Study On Compressing Decoder-based Language Models Li Tianda, Mesbahi Yassir El, Kobyzev Ivan, Rashid Ahmad, Mahmud Atif, Anchuri Nithin, Hajimolahoseini Habib, Liu Yang, Rezagholizadeh Mehdi
- Scheduled Sampling In Vision-language Pretraining With Decoupled Encoder-decoder Network Li Yehao, Pan Yingwei, Yao Ting, Chen Jingwen, Mei Tao
- Terapipe: Token-level Pipeline Parallelism For Training Large-scale Language Models Li Zhuohan, Zhuang Siyuan, Guo Shiyuan, Zhuo Danyang, Zhang Hao, Song Dawn, Stoica Ion
- Lyra: A Benchmark For Turducken-style Code Generation Liang Qingyuan, Sun Zeyu, Zhu Qihao, Zhang Wenjie, Yu Lian, Xiong Yingfei, Zhang Lu
- Multimodal Transformer With Variable-length Memory For Vision-and-language Navigation Lin Chuang, Jiang Yi, Cai Jianfei, Qu Lizhen, Haffari Gholamreza, Yuan Zehuan
- Domain-independent User Simulation With Transformers For Task-oriented Dialogue Systems Lin Hsien-chin, Lubis Nurul, Hu Songbo, Van Niekerk Carel, Geishauser Christian, Heck Michael, Feng Shutong, Gašić Milica
- MDQE: A More Accurate Direct Pretraining For Machine Translation Quality Estimation Lin Lei
- Truthfulqa: Measuring How Models Mimic Human Falsehoods Lin Stephanie, Hilton Jacob, Evans Owain
- VX2TEXT: End-to-end Learning Of Video-based Text Generation From Multimodal Inputs Lin Xudong, Bertasius Gedas, Wang Jue, Chang Shih-fu, Parikh Devi, Torresani Lorenzo
- Variational Latent-state GPT For Semi-supervised Task-oriented Dialog Systems Liu Hong, Cai Yucheng, Lin Zhenru, Ou Zhijian, Huang Yi, Feng Junlan
- Towards Automated Psychotherapy Via Language Modeling Liu Houjun
- A Three-stage Learning Framework For Low-resource Knowledge-grounded Dialogue Generation Liu Shilei, Zhao Xiaofeng, Li Bochao, Ren Feiliang, Zhang Longhui, Yin Shujuan
- Unified Multimodal Pre-training And Prompt-based Tuning For Vision-language Understanding And Generation Liu Tianyi, Wu Zuxuan, Xiong Wenhan, Chen Jingjing, Jiang Yu-gang
- Enriching Non-autoregressive Transformer With Syntactic And Semanticstructures For Neural Machine Translation Liu Ye, Wan Yao, Zhang Jian-guo, Zhao Wenting, Yu Philip S.
- Confidence-aware Scheduled Sampling For Neural Machine Translation Liu Yijin, Meng Fandong, Chen Yufeng, Xu Jinan, Zhou Jie
- Scheduled Sampling Based On Decoding Steps For Neural Machine Translation Liu Yijin, Meng Fandong, Chen Yufeng, Xu Jinan, Zhou Jie
- Augmenting Sequential Recommendation With Pseudo-prior Items Via Reversely Pre-training Transformer Liu Zhiwei, Fan Ziwei, Wang Yu, Yu Philip S.
- Pretrained Transformers As Universal Computation Engines Lu Kevin, Grover Aditya, Abbeel Pieter, Mordatch Igor
- Iconqa: A New Benchmark For Abstract Diagram Understanding And Visual Language Reasoning Lu Pan, Qiu Liang, Chen Jiaqi, Xia Tony, Zhao Yizhou, Zhang Wei, Yu Zhou, Liang Xiaodan, Zhu Song-chun
- X-METRA-ADA: Cross-lingual Meta-transfer Learning Adaptation To Natural Language Understanding And Question Answering M'hamdi Meryem, Kim Doo Soon, Dernoncourt Franck, Bui Trung, Ren Xiang, May Jonathan
- Local And Global Context-based Pairwise Models For Sentence Ordering Manku Ruskin Raj, Paul Aditya Jyoti
- Lightweight Cross-lingual Sentence Representation Learning Mao Zhuoyuan, Gupta Prakhar, Wang Pei, Chu Chenhui, Jaggi Martin, Kurohashi Sadao
- \(\infty\)-former: Infinite Memory Transformer Martins Pedro Henrique, Marinho Zita, Martins André F. T.
- Transformer Based Bengali Chatbot Using General Knowledge Dataset Masum Abu Kaisar Mohammad, Abujar Sheikh, Akter Sharmin, Ria Nushrat Jahan, Hossain Syed Akhter
- Unified Questioner Transformer For Descriptive Question Generation In Goal-oriented Visual Dialogue Matsumori Shoya, Shingyouchi Kosuke, Abe Yuki, Fukuchi Yosuke, Sugiura Komei, Imai Michita
- How Much Do Language Models Copy From Their Training Data? Evaluating Linguistic Novelty In Text Generation Using RAVEN Mccoy R. Thomas, Smolensky Paul, Linzen Tal, Gao Jianfeng, Celikyilmaz Asli
- COCO-LM: Correcting And Contrasting Text Sequences For Language Model Pretraining Meng Yu, Xiong Chenyan, Bajaj Payal, Tiwary Saurabh, Bennett Paul, Han Jiawei, Song Xia
- Self-training Improves Pre-training For Few-shot Learning In Task-oriented Dialog Systems Mi Fei, Zhou Wanhao, Cai Fengyu, Kong Lingjing, Huang Minlie, Faltings Boi
- Prevent The Language Model From Being Overconfident In Neural Machine Translation Miao Mengqi, Meng Fandong, Liu Yijin, Zhou Xiao-hua, Zhou Jie
- Language Models Are Few-shot Butlers Micheli Vincent, Fleuret François
- Structural Analysis Of An All-purpose Question Answering Model Micheli Vincent, Heinrich Quentin, Fleuret François, Belblidia Wacim
- Recent Advances In Natural Language Processing Via Large Pre-trained Language Models: A Survey Min Bonan, Ross Hayley, Sulem Elior, Veyseh Amir Pouran Ben, Nguyen Thien Huu, Sainz Oscar, Agirre Eneko, Heinz Ilana, Roth Dan
- Distantly Supervised Transformers For E-commerce Product QA Mittal Happy, Chakrabarti Aniket, Bayar Belhassen, Sharma Animesh Anant, Rasiwasia Nikhil
- GPT-3 Models Are Poor Few-shot Learners In The Biomedical Domain Moradi Milad, Blagec Kathrin, Haberl Florian, Samwald Matthias
- First Align, Then Predict: Understanding The Cross-lingual Ability Of Multilingual BERT Muller Benjamin, Elazar Yanai, Sagot Benoît, Seddah Djamé
- Arat5: Text-to-text Transformers For Arabic Language Generation Nagoudi El Moatez Billah, Elmadany Abdelrahim, Abdul-mageed Muhammad
- Planning With Learned Entity Prompts For Abstractive Summarization Narayan Shashi, Zhao Yao, Maynez Joshua, Simoes Gonçalo, Nikolaev Vitaly, Mcdonald Ryan
- Hierarchical Transformers Are More Efficient Language Models Nawrot Piotr, Tworkowski Szymon, Tyrolski Michał, Kaiser Łukasz, Wu Yuhuai, Szegedy Christian, Michalewski Henryk
- Improving Transformers With Probabilistic Attention Keys Nguyen Tam, Nguyen Tan M., Le Dung D., Nguyen Duy Khuong, Tran Viet-anh, Baraniuk Richard G., Ho Nhat, Osher Stanley J.
- Materialized Knowledge Bases From Commonsense Transformers Nguyen Tuan-phong, Razniewski Simon
- Sentence-t5: Scalable Sentence Encoders From Pre-trained Text-to-text Models Ni Jianmo, Ábrego Gustavo Hernández, Constant Noah, Ma Ji, Hall Keith B., Cer Daniel, Yang Yinfei
- MLP Architectures For Vision-and-language Modeling: An Empirical Study Nie Yixin, Li Linjie, Gan Zhe, Wang Shuohang, Zhu Chenguang, Zeng Michael, Liu Zicheng, Bansal Mohit, Wang Lijuan
- Investigating The Limitations Of Transformers With Simple Arithmetic Tasks Nogueira Rodrigo, Jiang Zhiying, Lin Jimmy
- Progressive Transformer-based Generation Of Radiology Reports Nooralahzadeh Farhad, Gonzalez Nicolas Perez, Frauenfelder Thomas, Fujimoto Koji, Krauthammer Michael
- Code Generation From Natural Language With Less Prior And More Monolingual Data Norouzi Sajad, Tang Keyi, Cao Yanshuai
- Show Your Work: Scratchpads For Intermediate Computation With Language Models Nye Maxwell, Andreassen Anders Johan, Gur-ari Guy, Michalewski Henryk, Austin Jacob, Bieber David, Dohan David, Lewkowycz Aitor, Bosma Maarten, Luan David, Sutton Charles, Odena Augustus
- What Context Features Can Transformer Language Models Use? O'connor Joe, Andreas Jacob
- Investigating Numeracy Learning Ability Of A Text-to-text Transfer Model Pal Kuntal Kumar, Baral Chitta
- Probing For Bridging Inference In Transformer Language Models Pandit Onkar, Hou Yufang
- Cascading Adaptors To Leverage English Data To Improve Performance Of Question Answering For Low-resource Languages Pandya Hariom A., Ardeshna Bhavik, Bhatt Brijesh S.
- PENELOPIE: Enabling Open Information Extraction For The Greek Language Through Machine Translation Papadopoulos Dimitris, Papadakis Nikolaos, Matsatsinis Nikolaos
- Do Encoder Representations Of Generative Dialogue Models Encode Sufficient Information About The Task ? Parthasarathi Prasanna, Pineau Joelle, Chandar Sarath
- Episodic Transformer For Vision-and-language Navigation Pashevich Alexander, Schmid Cordelia, Sun Chen
- A Comparative Study Of Transformer-based Language Models On Extractive Question Answering Pearce Kate, Zhan Tiffany, Komanduri Aneesh, Zhan Justin
- ABC: Attention With Bounded-memory Control Peng Hao, Kasai Jungo, Pappas Nikolaos, Yogatama Dani, Wu Zhaofeng, Kong Lingpeng, Schwartz Roy, Smith Noah A.
- Cotext: Multi-task Learning With Code-text Transformer Phan Long, Tran Hieu, Le Daniel, Nguyen Hieu, Anibal James, Peltekian Alec, Ye Yanfang
- Context Transformer With Stacked Pointer Networks For Conversational Question Answering Over Knowledge Graphs Plepi Joan, Kacupaj Endri, Singh Kuldeep, Thakkar Harsh, Lehmann Jens
- Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge Porada Ian, Sordoni Alessandro, Cheung Jackie Chi Kit
- Modeling Event Plausibility With Consistent Conceptual Abstraction Porada Ian, Suleman Kaheer, Trischler Adam, Cheung Jackie Chi Kit
- ORCHARD: A Benchmark For Measuring Systematic Generalization Of Multi-hierarchical Reasoning Pung Bill Tuck Weng, Chan Alvin
- Attentive Fine-tuning Of Transformers For Translation Of Low-resourced Languages @loresmt 2021 Puranik Karthik, Hande Adeep, Priyadharshini Ruba, Durairaj Thenmozhi, Sampath Anbukkarasi, Thamburaj Kingston Pal, Chakravarthi Bharathi Raja
- Robertuito: A Pre-trained Language Model For Social Media Text In Spanish Pérez Juan Manuel, Furman Damián A., Alemany Laura Alonso, Luque Franco
- Assessing The Syntactic Capabilities Of Transformer-based Multilingual Language Models Pérez-mayos Laura, García Alba Táboas, Mille Simon, Wanner Leo
- Structural Guidance For Transformer Language Models Qian Peng, Naseem Tahira, Levy Roger, Astudillo Ramón Fernandez
- Winner Team Mia At Textvqa Challenge 2021: Vision-and-language Representation Learning With Pre-trained Sequence-to-sequence Model Qiao Yixuan, Chen Hao, Wang Jun, Chen Yihao, Ye Xianbin, Li Ziliang, Qi Xianbiao, Gao Peng, Xie Guotong
- Vinmt: Neural Machine Translation Toolkit Quan Nguyen Hoang, Dat Nguyen Thanh, Cong Nguyen Hoang Minh, Van Vinh Nguyen, Vinh Ngo Thi, Thai Nguyen Phuong, Viet Tran Hong
- Grid Search Hyperparameter Benchmarking Of BERT, ALBERT, And Longformer On Duorc Quijano Alex John, Nguyen Sam, Ordonez Juanita
- Teaching Autoregressive Language Models Complex Tasks By Demonstration Recchia Gabriel
- Webgpt: Browser-assisted Question-answering With Human Feedback Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman
- Revisiting Context Choices For Context-aware Machine Translation Rikters Matīss, Nakazawa Toshiaki
- Don't Sweep Your Learning Rate Under The Rug: A Closer Look At Cross-modal Transfer Of Pretrained Transformers Rothermel Danielle, Li Margaret, Rocktäschel Tim, Foerster Jakob
- Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information? Rozanova Julia, Ferreira Deborah, Dubba Krishna, Cheng Weiwei, Zhang Dell, Freitas Andre
- Efficient Domain Adaptation Of Language Models Via Adaptive Tokenization Sachidananda Vin, Kessler Jason S., Lai Yi-an
- Challenges In Procedural Multimodal Machine Comprehension:a Novel Way To Benchmark Sahu Pritish, Sikka Karan, Divakaran Ajay
- Rome Was Built In 1776: A Case Study On Factual Correctness In Knowledge-grounded Response Generation Santhanam Sashank, Hedayatnia Behnam, Gella Spandana, Padmakumar Aishwarya, Kim Seokhwan, Liu Yang, Hakkani-tur Dilek
- Representation Learning For Conversational Data Using Discourse Mutual Information Maximization Santra Bishal, Roychowdhury Sumegh, Mandal Aishik, Gurram Vasu, Naik Atharva, Gupta Manish, Goyal Pawan
- Towards Neural Functional Program Evaluation Scholak Torsten, Pilault Jonathan, Velez-ginorio Joey
- Large Pre-trained Language Models Contain Human-like Biases Of What Is Right And Wrong To Do Schramowski Patrick, Turan Cigdem, Andersen Nico, Rothkopf Constantin A., Kersting Kristian
- Audio-visual Scene-aware Dialog And Reasoning Using Audio-visual Transformers With Joint Student-teacher Learning Shah Ankit P., Geng Shijie, Gao Peng, Cherian Anoop, Hori Takaaki, Marks Tim K., Roux Jonathan Le, Hori Chiori
- Laviter: Learning Aligned Visual And Textual Representations Assisted By Image And Caption Generation Shaikh Mohammad Abuzar, Ji Zhanghexuan, Moukheiber Dana, Shen Yan, Srihari Sargur, Gao Mingchen
- CPT: A Pre-trained Unbalanced Transformer For Both Chinese Language Understanding And Generation Shao Yunfan, Geng Zhichao, Liu Yitao, Dai Junqi, Yan Hang, Yang Fei, Zhe Li, Bao Hujun, Qiu Xipeng
- Better Neural Machine Translation By Extracting Linguistic Information From BERT Shavarani Hassan S., Sarkar Anoop
- FPM: A Collection Of Large-scale Foundation Pre-trained Language Models Shen Dezhou
- How Much Can CLIP Benefit Vision-and-language Tasks? Shen Sheng, Li Liunian Harold, Tan Hao, Bansal Mohit, Rohrbach Anna, Chang Kai-wei, Yao Zhewei, Keutzer Kurt
- Adapting Long Context NLM For ASR Rescoring In Conversational Agents Shenoy Ashish, Bodapati Sravan, Sunkara Monica, Ronanki Srikanth, Kirchhoff Katrin
- Layer-wise Pruning Of Transformer Attention Heads For Efficient Language Modeling Shim Kyuhong, Choi Iksoo, Sung Wonyong, Choi Jungwook
- Scaling Law For Recommendation Models: Towards General-purpose User Representations Shin Kyuyong, Kwak Hanock, Kim Su Young, Ramstrom Max Nihlen, Jeong Jisu, Ha Jung-woo, Kim Kyung-min
- Exploring Low-cost Transformer Model Compression For Large-scale Commercial Reply Suggestions Shrivastava Vaishnavi, Gaonkar Radhika, Gupta Shashank, Jha Abhishek
- Structural Persistence In Language Models: Priming As A Window Into Abstract Language Representations Sinclair Arabella, Jumelet Jaap, Zuidema Willem, Fernández Raquel
- Pre-trained Language Models As Prior Knowledge For Playing Text-based Games Singh Ishika, Singh Gargi, Modi Ashutosh
- Semantics-aware Attention Improves Neural Machine Translation Slobodkin Aviv, Choshen Leshem, Abend Omri
- Primer: Searching For Efficient Transformers For Language Modeling So David R., Mańke Wojciech, Liu Hanxiao, Dai Zihang, Shazeer Noam, Le Quoc V.
- Multimodal Integration Of Human-like Attention In Visual Question Answering Sood Ekta, Kögel Fabian, Müller Philipp, Thomas Dominike, Bace Mihai, Bulling Andreas
- Txt: Crossmodal End-to-end Learning With Transformers Steitz Jan-martin O., Pfeiffer Jonas, Gurevych Iryna, Roth Stefan
- Corelm: Coreference-aware Language Model Fine-tuning Stylianou Nikolaos, Vlahavas Ioannis
- Ernie-tiny : A Progressive Distillation Framework For Pretrained Transformer Compression Su Weiyue, Chen Xuyi, Feng Shikun, Liu Jiaxiang, Liu Weixin, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Non-autoregressive Text Generation With Pre-trained Language Models Su Yixuan, Cai Deng, Wang Yan, Vandyke David, Baker Simon, Li Piji, Collier Nigel
- Tacl: Improving BERT Pre-training With Token-aware Contrastive Learning Su Yixuan, Liu Fangyu, Meng Zaiqiao, Lan Tian, Shu Lei, Shareghi Ehsan, Collier Nigel
- NVIDIA Nemo Neural Machine Translation Systems For English-german And English-russian News And Biomedical Tasks At WMT21 Subramanian Sandeep, Hrinchuk Oleksii, Adams Virginia, Kuchaiev Oleksii
- Embodied BERT: A Transformer Model For Embodied, Language-guided Visual Task Completion Suglia Alessandro, Gao Qiaozi, Thomason Jesse, Thattai Govind, Sukhatme Gaurav
- Not All Memories Are Created Equal: Learning To Forget By Expiring Sukhbaatar Sainbayar, Ju Da, Poff Spencer, Roller Stephen, Szlam Arthur, Weston Jason, Fan Angela
- Do Long-range Language Models Actually Use Long-range Context? Sun Simeng, Krishna Kalpesh, Mattarella-micke Andrew, Iyyer Mohit
- Lightningdot: Pre-training Visual-semantic Embeddings For Real-time Image-text Retrieval Sun Siqi, Chen Yen-chun, Li Linjie, Wang Shuohang, Fang Yuwei, Liu Jingjing
- NSP-BERT: A Prompt-based Few-shot Learner Through An Original Pre-training Task--next Sentence Prediction Sun Yi, Zheng Yu, Hao Chao, Qiu Hangping
- ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Sun Yu, Wang Shuohuan, Feng Shikun, Ding Siyu, Pang Chao, Shang Junyuan, Liu Jiaxiang, Chen Xuyi, Zhao Yanbin, Lu Yuxiang, Liu Weixin, Wu Zhihua, Gong Weibao, Liang Jianzhong, Shang Zhizhou, Sun Peng, Liu Wei, Ouyang Xuan, Yu Dianhai, Tian Hao, Wu Hua, Wang Haifeng
- Multilingual Translation Via Grafting Pre-trained Language Models Sun Zewei, Wang Mingxuan, Li Lei
- Data Processing Matters: Srph-konvergen Ai's Machine Translation System For WMT'21 Sutawika Lintang, Cruz Jan Christian Blaise
- A Three Step Training Approach With Data Augmentation For Morphological Inflection Szolnok Gabor, Barta Botond, Lakatos Dorina, Acs Judit
- Diff-explainer: Differentiable Convex Optimization For Explainable Multi-hop Inference Thayaparan Mokanarangan, Valentino Marco, Ferreira Deborah, Rozanova Julia, Freitas André
- Towards Reinforcement Learning For Pivot-based Neural Machine Translation With Non-autoregressive Transformer Tokarchuk Evgeniia, Rosendahl Jan, Wang Weiyue, Petrushkov Pavel, Lancewicki Tomer, Khadivi Shahram, Ney Hermann
- Exploring Transformers In Natural Language Generation: GPT, BERT, And Xlnet Topal M. Onat, Bas Anil, Van Heerden Imke
- Transferring Semantic Knowledge Into Language Encoders Umair Mohammad, Ferraro Francis
- Comparing Test Sets With Item Response Theory Vania Clara, Htut Phu Mon, Huang William, Mungra Dhara, Pang Richard Yuanzhe, Phang Jason, Liu Haokun, Cho Kyunghyun, Bowman Samuel R.
- Multitask Prompted Training Enables Zero-shot Task Generalization Victor Sanh, Albert Webson, Colin Raffel, Stephen H. Bach, Lintang Sutawika, Zaid Alyafeai, Antoine Chaffin, Arnaud Stiegler, Teven Le Scao, Arun Raja, Manan Dey, M Saiful Bari, Canwen Xu, Urmish Thakker, Shanya Sharma Sharma, Eliza Szczechla, Taewoon Kim, Gunjan Chhablani, Nihal Nayak, Debajyoti Datta, Jonathan Chang, Mike Tian-jian Jiang, Han Wang, Matteo Manica, Sheng Shen, Zheng Xin Yong, Harshit Pandey, Rachel Bawden, Thomas Wang, Trishala Neeraj, Jos Rozen, Abheesht Sharma, Andrea Santilli, Thibault Fevry, Jason Alan Fries, Ryan Teehan, Tali Bers, Stella Biderman, Leo Gao, Thomas Wolf, Alexander M. Rush
- Transformers For Headline Selection For Russian News Clusters Voropaev Pavel, Sopilnyak Olga
- Convfit: Conversational Fine-tuning Of Pretrained Language Models Vulić Ivan, Su Pei-hao, Coope Sam, Gerz Daniela, Budzianowski Paweł, Casanueva Iñigo, Mrkšić Nikola, Wen Tsung-hsien
- Bertnesia: Investigating The Capture And Forgetting Of Knowledge In BERT Wallat Jonas, Singh Jaspreet, Anand Avishek
- Selective Knowledge Distillation For Neural Machine Translation Wang Fusheng, Yan Jianhao, Meng Fandong, Zhou Jie
- Contextualized Scene Imagination For Generative Commonsense Reasoning Wang Peifeng, Zamora Jonathan, Liu Junfeng, Ilievski Filip, Chen Muhao, Ren Xiang
- Entailment As Few-shot Learner Wang Sinong, Fang Han, Khabsa Madian, Mao Hanzi, Ma Hao
- Sentence Semantic Regression For Text Generation Wang Wei, Li Piji, Zheng Hai-tao
- Lightseq2: Accelerated Training For Transformer-based Models On Gpus Wang Xiaohui, Wei Yang, Xiong Ying, Huang Guyue, Qian Xian, Ding Yufei, Wang Mingxuan, Li Lei
- List: Lite Prompted Self-training Makes Parameter-efficient Few-shot Learners Wang Yaqing, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Codet5: Identifier-aware Unified Pre-trained Encoder-decoder Models For Code Understanding And Generation Wang Yue, Wang Weishi, Joty Shafiq, Hoi Steven C. H.
- Distilled Dual-encoder Model For Vision-language Understanding Wang Zekun, Wang Wenhui, Zhu Haichao, Liu Ming, Qin Bing, Wei Furu
- Dodrio: Exploring Transformer Models With Interactive Visualization Wang Zijie J., Turko Robert, Chau Duen Horng
- Visual Question Rewriting For Increasing Response Rate Wei Jiayi, Li Xilian, Zhang Yi, Wang Xin
- Pangu-\(α\): Large-scale Autoregressive Pretrained Chinese Language Models With Auto-parallel Computation Wei Zeng, Xiaozhe Ren, Teng Su, Hui Wang, Yi Liao, Zhiwei Wang, Xin Jiang, Zhenzhang Yang, Kaisheng Wang, Xiaoda Zhang, Chen Li, Ziyan Gong, Yifan Yao, Xinjing Huang, Jun Wang, Jianfeng Yu, Qi Guo, Yue Yu, Yan Zhang, Jin Wang, Hengtao Tao, Dasen Yan, Zexuan Yi, Fang Peng, Fangqing Jiang, Han Zhang, Lingfeng Deng, Yehong Zhang, Zhe Lin, Chao Zhang, Shaojie Zhang, Mingyue Guo, Shanzhi Gu, Gaojun Fan, Yaowei Wang, Xuefeng Jin, Qun Liu, Yonghong Tian
- Memory And Knowledge Augmented Language Models For Inferring Salience In Long-form Stories Wilmot David, Keller Frank
- Fastformer: Additive Attention Can Be All You Need Wu Chuhan, Wu Fangzhao, Qi Tao, Huang Yongfeng, Xie Xing
- Domain-adaptive Pretraining Methods For Dialogue Understanding Wu Han, Xu Kun, Song Linfeng, Jin Lifeng, Zhang Haisong, Song Linqi
- On Explaining Your Explanations Of BERT: An Empirical Study With Sequence Classification Wu Zhengxuan, Ong Desmond C.
- Prefix-tuning: Optimizing Continuous Prompts For Generation Xiang Lisa Li, Percy Liang
- P-tuning V2: Prompt Tuning Can Be Comparable To Fine-tuning Universally Across Scales And Tasks Xiao Liu, Kaixuan Ji, Yicheng Fu, Weng Lam Tam, Zhengxiao Du, Zhilin Yang, Jie Tang
- An Explanation Of In-context Learning As Implicit Bayesian Inference Xie Sang Michael, Raghunathan Aditi, Liang Percy, Ma Tengyu
- Simple Local Attentions Remain Competitive For Long-context Tasks Xiong Wenhan, Oğuz Barlas, Gupta Anchit, Chen Xilun, Liskovich Diana, Levy Omer, Yih Wen-tau, Mehdad Yashar
- Moca: Incorporating Multi-stage Domain Pretraining And Cross-guided Multimodal Attention For Textbook Question Answering Xu Fangzhi, Lin Qika, Liu Jun, Zhang Lingling, Zhao Tianzhe, Chai Qi, Pan Yudai
- E2E-VLP: End-to-end Vision-language Pre-training Enhanced By Visual Learning Xu Haiyang, Yan Ming, Li Chenliang, Bi Bin, Huang Songfang, Xiao Wenming, Huang Fei
- VLM: Task-agnostic Video-language Model Pre-training For Video Understanding Xu Hu, Ghosh Gargi, Huang Po-yao, Arora Prahal, Aminzadeh Masoumeh, Feichtenhofer Christoph, Metze Florian, Zettlemoyer Luke
- Attention-guided Generative Models For Extractive Question Answering Xu Peng, Liang Davis, Huang Zhiheng, Xiang Bing
- From Dense To Sparse: Contrastive Pruning For Better Pre-trained Language Model Compression Xu Runxin, Luo Fuli, Wang Chengyu, Chang Baobao, Huang Jun, Huang Songfang, Huang Fei
- K-PLUG: Knowledge-injected Pre-trained Language Model For Natural Language Understanding And Generation In E-commerce Xu Song, Li Haoran, Yuan Peng, Wang Yujia, Wu Youzheng, He Xiaodong, Liu Ying, Zhou Bowen
- Byt5: Towards A Token-free Future With Pre-trained Byte-to-byte Models Xue Linting, Barua Aditya, Constant Noah, Al-rfou Rami, Narang Sharan, Kale Mihir, Roberts Adam, Raffel Colin
- Fastseq: Make Sequence Generation Faster Yan Yu, Hu Fei, Chen Jiusheng, Bhendawade Nikhil, Ye Ting, Gong Yeyun, Duan Nan, Cui Desheng, Chi Bingyu, Zhang Ruofei
- Transformer-based Korean Pretrained Language Models: A Survey On Three Years Of Progress Yang Kichang
- Causal Attention For Vision-language Tasks Yang Xu, Zhang Hanwang, Qi Guojun, Cai Jianfei
- Learning To Generate Task-specific Adapters From Task Description Ye Qinyuan, Ren Xiang
- Gpt3mix: Leveraging Large-scale Language Models For Text Augmentation Yoo Kang Min, Park Dongju, Kang Jaewook, Lee Sang-woo, Park Woomyeong
- Reconsidering The Past: Optimizing Hidden States In Language Models Yoshida Davis, Gimpel Kevin
- On The Interplay Between Fine-tuning And Composition In Transformers Yu Lang, Ettinger Allyson
- Prune Once For All: Sparse Pre-trained Language Models Zafrir Ofir, Larey Ariel, Boudoukh Guy, Shen Haihao, Wasserblat Moshe
- Bitfit: Simple Parameter-efficient Fine-tuning For Transformer-based Masked Language-models Zaken Elad Ben, Ravfogel Shauli, Goldberg Yoav
- Topical Language Generation Using Transformers Zandie Rohola, Mahoor Mohammad H.
- Readtwice: Reading Very Large Documents With Memories Zemlyanskiy Yury, Ainslie Joshua, De Jong Michiel, Pham Philip, Eckstein Ilya, Sha Fei
- Wechat Neural Machine Translation Systems For WMT21 Zeng Xianfeng, Liu Yijin, Li Ernan, Ran Qiu, Meng Fandong, Li Peng, Xu Jinan, Zhou Jie
- Improving Stack Overflow Question Title Generation With Copying Enhanced Codebert Model And Bi-modal Information Zhang Fengji, Yu Xiao, Keung Jacky, Li Fuyang, Xie Zhiwen, Yang Zhen, Ma Caoyuan, Zhang Zhimin
- Ernie-vilg: Unified Generative Pre-training For Bidirectional Vision-language Generation Zhang Han, Yin Weichong, Fang Yewei, Li Lanxin, Duan Boqiang, Wu Zhihua, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- A Knowledge-grounded Dialog System Based On Pre-trained Language Models Zhang Weijie, Chen Jiaoxuan, Wu Haipang, Wan Sanhui, Li Gongfeng
- DSGPT: Domain-specific Generative Pre-training Of Transformers For Text Generation In E-commerce Title And Review Summarization Zhang Xueying, Jiang Yunjiang, Shang Yue, Cheng Zhaomeng, Zhang Chi, Fan Xiaochuan, Xiao Yun, Long Bo
- An Exploratory Study On Long Dialogue Summarization: What Works And What's Next Zhang Yusen, Ni Ansong, Yu Tao, Zhang Rui, Zhu Chenguang, Deb Budhaditya, Celikyilmaz Asli, Awadallah Ahmed Hassan, Radev Dragomir
- Towards More Efficient Insertion Transformer With Fractional Positional Encoding Zhang Zhisong, Zhang Yizhe, Dolan Bill
- Ror: Read-over-read For Long Document Machine Reading Comprehension Zhao Jing, Bao Junwei, Wang Yifan, Zhou Yongwei, Wu Youzheng, He Xiaodong, Zhou Bowen
- CPM-2: Large-scale Cost-effective Pre-trained Language Models Zhengyan Zhang, Yuxian Gu, Xu Han, Shengqi Chen, Chaojun Xiao, Zhenbo Sun, Yuan Yao, Fanchao Qi, Jian Guan, Pei Ke, Yanzheng Cai, Guoyang Zeng, Zhixing Tan, Zhiyuan Liu, Minlie Huang, Wentao Han, Yang Liu, Xiaoyan Zhu, Maosong Sun
- Adapting Language Models For Zero-shot Learning By Meta-tuning On Dataset And Prompt Collections Zhong Ruiqi, Lee Kristy, Zhang Zheng, Klein Dan
- Self-guided Curriculum Learning For Neural Machine Translation Zhou Lei, Ding Liang, Duh Kevin, Watanabe Shinji, Sasano Ryohei, Takeda Koichi
- Improving Sequence-to-sequence Pre-training Via Sequence Span Rewriting Zhou Wangchunshu, Ge Tao, Xu Canwen, Xu Ke, Wei Furu
- LAFITE: Towards Language-free Training For Text-to-image Generation Zhou Yufan, Zhang Ruiyi, Chen Changyou, Li Chunyuan, Tensmeyer Chris, Yu Tong, Gu Jiuxiang, Xu Jinhui, Sun Tong
- Thinking Clearly, Talking Fast: Concept-guided Non-autoregressive Generation For Open-domain Dialogue Systems Zou Yicheng, Liu Zhihua, Hu Xingwu, Zhang Qi
- Taming Sparsely Activated Transformer With Stochastic Experts Zuo Simiao, Liu Xiaodong, Jiao Jian, Kim Young Jin, Hassan Hany, Zhang Ruofei, Zhao Tuo, Gao Jianfeng
- Palm: Scaling Language Modeling With Pathways Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-hellstern, Douglas Eck, Jeff Dean, Slav Petrov, Noah Fiedel
- Global Memory Transformer For Processing Long Documents Adel Arij Al
- Vector Representations Of Idioms In Conversational Systems Adewumi Tosin, Liwicki Foteini, Liwicki Marcus
- Vl-interpret: An Interactive Visualization Tool For Interpreting Vision-language Transformers Aflalo Estelle, Du Meng, Tseng Shao-yen, Liu Yongfei, Wu Chenfei, Duan Nan, Lal Vasudev
- Few-shot Training Llms For Project-specific Code-summarization Ahmed Toufique, Devanbu Premkumar
- Compound Tokens: Channel Fusion For Vision-language Representation Learning Aladago Maxwell Mbabilla, Piergiovanni Aj
- How Does The Pre-training Objective Affect What Large Language Models Learn About Linguistic Properties? Alajrami Ahmed, Aletras Nikolaos
- End-to-end Multimodal Representation Learning For Video Dialog Alamri Huda, Bilic Anthony, Hu Michael, Beedu Apoorva, Essa Irfan
- Input-tuning: Adapting Unfamiliar Inputs To Frozen Pretrained Models An Shengnan, Li Yifei, Lin Zeqi, Liu Qian, Chen Bei, Fu Qiang, Chen Weizhu, Zheng Nanning, Lou Jian-guang
- Tempo: Accelerating Transformer-based Model Training Through Memory Footprint Reduction Andoorveedu Muralidhar, Zhu Zhanda, Zheng Bojian, Pekhimenko Gennady
- A Comparative Study On Language Models For Task-oriented Dialogue Systems Andreas Vinsen Marselino, Winata Genta Indra, Purwarianti Ayu
- Exploring Length Generalization In Large Language Models Anil Cem, Wu Yuhuai, Andreassen Anders, Lewkowycz Aitor, Misra Vedant, Ramasesh Vinay, Slone Ambrose, Gur-ari Guy, Dyer Ethan, Neyshabur Behnam
- Selection-inference: Exploiting Large Language Models For Interpretable Logical Reasoning Antonia Creswell, Murray Shanahan, Irina Higgins
- Characterizing Verbatim Short-term Memory In Neural Language Models Armeni Kristijan, Honey Christopher, Linzen Tal
- ATTEMPT: Parameter-efficient Multi-task Tuning Via Attentional Mixtures Of Soft Prompts Asai Akari, Salehi Mohammadreza, Peters Matthew E., Hajishirzi Hannaneh
- Small Batch Sizes Improve Training Of Low-resource Neural MT Atrio Àlex R., Popescu-belis Andrei
- Parameter-efficient Finetuning Of Transformers For Source Code Ayupov Shamil, Chirkova Nadezhda
- Vl-beit: Generative Vision-language Pretraining Bao Hangbo, Wang Wenhui, Dong Li, Wei Furu
- P\(^3\)LM: Probabilistically Permuted Prophet Language Modeling For Generative Pre-training Bao Junwei, Wang Yifan, Ying Jiangyong, Gong Yeyun, Zhao Jing, Wu Youzheng, He Xiaodong
- \(\textit{latent}\)-glat: Glancing At Latent Variables For Parallel Text Generation Bao Yu, Zhou Hao, Huang Shujian, Wang Dongqi, Qian Lihua, Dai Xinyu, Chen Jiajun, Li Lei
- Grad-sam: Explaining Transformers Via Gradient Self-attention Maps Barkan Oren, Hauon Edan, Caciularu Avi, Katz Ori, Malkiel Itzik, Armstrong Omri, Koenigstein Noam
- Evidence > Intuition: Transferability Estimation For Encoder Selection Bassignana Elisa, Müller-eberstein Max, Zhang Mike, Plank Barbara
- Leveraging Pre-trained Language Models For Conversational Information Seeking From Text Bellan Patrizio, Dragoni Mauro, Ghidini Chiara
- Logical Reasoning For Task Oriented Dialogue Systems Beygi Sajjad, Fazel-zarandi Maryam, Cervone Alessandra, Krishnan Prakash, Jonnalagadda Siddhartha Reddy
- Banglanlg And Banglat5: Benchmarks And Resources For Evaluating Low-resource Natural Language Generation In Bangla Bhattacharjee Abhik, Hasan Tahmid, Ahmad Wasi Uddin, Shahriyar Rifat
- Effective Cross-task Transfer Learning For Explainable Natural Language Inference With T5 Bigoulaeva Irina, Sachdeva Rachneet, Madabushi Harish Tayyar, Villavicencio Aline, Gurevych Iryna
- BLOOM: A 176b-parameter Open-access Multilingual Language Model Bigscience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova Del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina Mcmillan-major, Iz Beltagy, Huu Nguyen, Lucile Saulnier, Samson Tan, Pedro Ortiz Suarez, Victor Sanh, Hugo Laurençon, Yacine Jernite, Julien Launay, Margaret Mitchell, Colin Raffel, Aaron Gokaslan, Adi Simhi, Aitor Soroa, Alham Fikri Aji, Amit Alfassy, Anna Rogers, Ariel Kreisberg Nitzav, Canwen Xu, Chenghao Mou, Chris Emezue, Christopher Klamm, Colin Leong, Daniel Van Strien, David Ifeoluwa Adelani, Dragomir Radev, Eduardo González Ponferrada, Efrat Levkovizh, Ethan Kim, Eyal Bar Natan, Francesco De Toni, Gérard Dupont, Germán Kruszewski, Giada Pistilli, Hady Elsahar, Hamza Benyamina, Hieu Tran, Ian Yu, Idris Abdulmumin, Isaac Johnson, Itziar Gonzalez-dios, Javier De La Rosa, Jenny Chim, Jesse Dodge, Jian Zhu, Jonathan Chang, Jörg Frohberg, Joseph Tobing, Joydeep Bhattacharjee, Khalid Almubarak, Kimbo Chen, Kyle Lo, Leandro Von Werra, Leon Weber, Long Phan, Loubna Ben Allal, Ludovic Tanguy, Manan Dey, Manuel Romero Muñoz, Maraim Masoud, María Grandury, Mario Šaško, Max Huang, Maximin Coavoux, Mayank Singh, Mike Tian-jian Jiang, Minh Chien Vu, Mohammad A. Jauhar, Mustafa Ghaleb, Nishant Subramani, Nora Kassner, Nurulaqilla Khamis, Olivier Nguyen, Omar Espejel, Ona De Gibert, Paulo Villegas, Peter Henderson, Pierre Colombo, Priscilla Amuok, Quentin Lhoest, Rheza Harliman, Rishi Bommasani, Roberto Luis López, Rui Ribeiro, Salomey Osei, Sampo Pyysalo, Sebastian Nagel, Shamik Bose, Shamsuddeen Hassan Muhammad, Shanya Sharma, Shayne Longpre, Somaieh Nikpoor, Stanislav Silberberg, Suhas Pai, Sydney Zink, Tiago Timponi Torrent, Timo Schick, Tristan Thrush, Valentin Danchev, Vassilina Nikoulina, Veronika Laippala, Violette Lepercq, Vrinda Prabhu, Zaid Alyafeai, Zeerak Talat, Arun Raja, Benjamin Heinzerling, Chenglei Si, Davut Emre Taşar, Elizabeth Salesky, Sabrina J. Mielke, Wilson Y. Lee, Abheesht Sharma, Andrea Santilli, Antoine Chaffin, Arnaud Stiegler, Debajyoti Datta, Eliza Szczechla, Gunjan Chhablani, Han Wang, Harshit Pandey, Hendrik Strobelt, Jason Alan Fries, Jos Rozen, Leo Gao, Lintang Sutawika, M Saiful Bari, Maged S. Al-shaibani, Matteo Manica, Nihal Nayak, Ryan Teehan, Samuel Albanie, Sheng Shen, Srulik Ben-david, Stephen H. Bach, Taewoon Kim, Tali Bers, Thibault Fevry, Trishala Neeraj, Urmish Thakker, Vikas Raunak, Xiangru Tang, Zheng-xin Yong, Zhiqing Sun, Shaked Brody, Yallow Uri, Hadar Tojarieh, Adam Roberts, Hyung Won Chung, Jaesung Tae, Jason Phang, Ofir Press, Conglong Li, Deepak Narayanan, Hatim Bourfoune, Jared Casper, Jeff Rasley, Max Ryabinin, Mayank Mishra, Minjia Zhang, Mohammad Shoeybi, Myriam Peyrounette, Nicolas Patry, Nouamane Tazi, Omar Sanseviero, Patrick Von Platen, Pierre Cornette, Pierre François Lavallée, Rémi Lacroix, Samyam Rajbhandari, Sanchit Gandhi, Shaden Smith, Stéphane Requena, Suraj Patil, Tim Dettmers, Ahmed Baruwa, Amanpreet Singh, Anastasia Cheveleva, Anne-laure Ligozat, Arjun Subramonian, Aurélie Névéol, Charles Lovering, Dan Garrette, Deepak Tunuguntla, Ehud Reiter, Ekaterina Taktasheva, Ekaterina Voloshina, Eli Bogdanov, Genta Indra Winata, Hailey Schoelkopf, Jan-christoph Kalo, Jekaterina Novikova, Jessica Zosa Forde, Jordan Clive, Jungo Kasai, Ken Kawamura, Liam Hazan, Marine Carpuat, Miruna Clinciu, Najoung Kim, Newton Cheng, Oleg Serikov, Omer Antverg, Oskar Van Der Wal, Rui Zhang, Ruochen Zhang, Sebastian Gehrmann, Shachar Mirkin, Shani Pais, Tatiana Shavrina, Thomas Scialom, Tian Yun, Tomasz Limisiewicz, Verena Rieser, Vitaly Protasov, Vladislav Mikhailov, Yada Pruksachatkun, Yonatan Belinkov, Zachary Bamberger, Zdeněk Kasner, Alice Rueda, Amanda Pestana, Amir Feizpour, Ammar Khan, Amy Faranak, Ana Santos, Anthony Hevia, Antigona Unldreaj, Arash Aghagol, Arezoo Abdollahi, Aycha Tammour, Azadeh Hajihosseini, Bahareh Behroozi, Benjamin Ajibade, Bharat Saxena, Carlos Muñoz Ferrandis, Daniel Mcduff, Danish Contractor, David Lansky, Davis David, Douwe Kiela, Duong A. Nguyen, Edward Tan, Emi Baylor, Ezinwanne Ozoani, Fatima Mirza, Frankline Ononiwu, Habib Rezanejad, Hessie Jones, Indrani Bhattacharya, Irene Solaiman, Irina Sedenko, Isar Nejadgholi, Jesse Passmore, Josh Seltzer, Julio Bonis Sanz, Livia Dutra, Mairon Samagaio, Maraim Elbadri, Margot Mieskes, Marissa Gerchick, Martha Akinlolu, Michael Mckenna, Mike Qiu, Muhammed Ghauri, Mykola Burynok, Nafis Abrar, Nazneen Rajani, Nour Elkott, Nour Fahmy, Olanrewaju Samuel, Ran An, Rasmus Kromann, Ryan Hao, Samira Alizadeh, Sarmad Shubber, Silas Wang, Sourav Roy, Sylvain Viguier, Thanh Le, Tobi Oyebade, Trieu Le, Yoyo Yang, Zach Nguyen, Abhinav Ramesh Kashyap, Alfredo Palasciano, Alison Callahan, Anima Shukla, Antonio Miranda-escalada, Ayush Singh, Benjamin Beilharz, Bo Wang, Caio Brito, Chenxi Zhou, Chirag Jain, Chuxin Xu, Clémentine Fourrier, Daniel León Periñán, Daniel Molano, Dian Yu, Enrique Manjavacas, Fabio Barth, Florian Fuhrimann, Gabriel Altay, Giyaseddin Bayrak, Gully Burns, Helena U. Vrabec, Imane Bello, Ishani Dash, Jihyun Kang, John Giorgi, Jonas Golde, Jose David Posada, Karthik Rangasai Sivaraman, Lokesh Bulchandani, Lu Liu, Luisa Shinzato, Madeleine Hahn De Bykhovetz, Maiko Takeuchi, Marc Pàmies, Maria A Castillo, Marianna Nezhurina, Mario Sänger, Matthias Samwald, Michael Cullan, Michael Weinberg, Michiel De Wolf, Mina Mihaljcic, Minna Liu, Moritz Freidank, Myungsun Kang, Natasha Seelam, Nathan Dahlberg, Nicholas Michio Broad, Nikolaus Muellner, Pascale Fung, Patrick Haller, Ramya Chandrasekhar, Renata Eisenberg, Robert Martin, Rodrigo Canalli, Rosaline Su, Ruisi Su, Samuel Cahyawijaya, Samuele Garda, Shlok S Deshmukh, Shubhanshu Mishra, Sid Kiblawi, Simon Ott, Sinee Sang-aroonsiri, Srishti Kumar, Stefan Schweter, Sushil Bharati, Tanmay Laud, Théo Gigant, Tomoya Kainuma, Wojciech Kusa, Yanis Labrak, Yash Shailesh Bajaj, Yash Venkatraman, Yifan Xu, Yingxin Xu, Yu Xu, Zhe Tan, Zhongli Xie, Zifan Ye, Mathilde Bras, Younes Belkada, Thomas Wolf
- Prompting Language Models For Linguistic Structure Blevins Terra, Gonen Hila, Zettlemoyer Luke
- Inpars: Data Augmentation For Information Retrieval Using Large Language Models Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Nogueira Rodrigo
- Looking For A Handsome Carpenter! Debiasing GPT-3 Job Advertisements Borchers Conrad, Gala Dalia Sara, Gilburt Benjamin, Oravkin Eduard, Bounsi Wilfried, Asano Yuki M., Kirk Hannah Rose
- Petals: Collaborative Inference And Fine-tuning Of Large Models Borzunov Alexander, Baranchuk Dmitry, Dettmers Tim, Ryabinin Max, Belkada Younes, Chumachenko Artem, Samygin Pavel, Raffel Colin
- Understanding Performance Of Long-document Ranking Models Through Comprehensive Evaluation And Leaderboarding Boytsov Leonid, Akinpelu David, Lin Tianyi, Gao Fangwei, Zhao Yutian, Huang Jeffrey, Katyal Nipun, Nyberg Eric
- Evaluating The Susceptibility Of Pre-trained Language Models Via Handcrafted Adversarial Examples Branch Hezekiah J., Cefalu Jonathan Rodriguez, Mchugh Jeremy, Hujer Leyla, Bahl Aditya, Iglesias Daniel Del Castillo, Heichman Ron, Darwishi Ramesh
- Large Language Models Can Implement Policy Iteration Brooks Ethan, Walls Logan, Lewis Richard L., Singh Satinder
- HIBRIDS: Attention With Hierarchical Biases For Structure-aware Long Document Summarization Cao Shuyang, Wang Lu
- A Model-agnostic Data Manipulation Method For Persona-based Dialogue Generation Cao Yu, Bi Wei, Fang Meng, Shi Shuming, Tao Dacheng
- Pre-trained Neural Language Models For Automatic Mobile App User Feedback Answer Generation Cao Yue, Fard Fatemeh H.
- Robust Preference Learning For Storytelling Via Contrastive Reinforcement Learning Castricato Louis, Havrilla Alexander, Matiana Shahbuland, Pieler Michael, Ye Anbang, Yang Ian, Frazier Spencer, Riedl Mark
- Ernie-code: Beyond English-centric Cross-lingual Pretraining For Programming Languages Chai Yekun, Wang Shuohuan, Pang Chao, Sun Yu, Tian Hao, Wu Hua
- Speechprompt: An Exploration Of Prompt Tuning On Generative Spoken Language Model For Speech Processing Tasks Chang Kai-wei, Tseng Wei-cheng, Li Shang-wen, Lee Hung-yi
- Analyzing Semantic Faithfulness Of Language Models Via Input Intervention On Question Answering Chaturvedi Akshay, Bhar Swarnadeep, Saha Soumadeep, Garain Utpal, Asher Nicholas
- Character-centric Story Visualization Via Visual Planning And Token Alignment Chen Hong, Han Rujun, Wu Te-lin, Nakayama Hideki, Peng Nanyun
- Xdoc: Unified Pre-training For Cross-format Document Understanding Chen Jingye, Lv Tengchao, Cui Lei, Zhang Cha, Wei Furu
- Imputing Out-of-vocabulary Embeddings With LOVE Makes Language Models Robust With Little Cost Chen Lihu, Varoquaux Gaël, Suchanek Fabian M.
- Leveraging Natural Supervision For Language Representation Learning And Generation Chen Mingda
- Improving In-context Few-shot Learning Via Self-supervised Training Chen Mingda, Du Jingfei, Pasunuru Ramakanth, Mihaylov Todor, Iyer Srini, Stoyanov Veselin, Kozareva Zornitsa
- Exploring And Exploiting Multi-granularity Representations For Machine Reading Comprehension Chen Nuo, You Chenyu
- Murag: Multimodal Retrieval-augmented Generator For Open Question Answering Over Images And Text Chen Wenhu, Hu Hexiang, Chen Xi, Verga Pat, Cohen William W.
- Pali: A Jointly-scaled Multilingual Language-image Model Chen Xi, Wang Xiao, Changpinyo Soravit, Piergiovanni Aj, Padlewski Piotr, Salz Daniel, Goodman Sebastian, Grycner Adam, Mustafa Basil, Beyer Lucas, Kolesnikov Alexander, Puigcerver Joan, Ding Nan, Rong Keran, Akbari Hassan, Mishra Gaurav, Xue Linting, Thapliyal Ashish, Bradbury James, Kuo Weicheng, Seyedhosseini Mojtaba, Jia Chao, Ayan Burcu Karagol, Riquelme Carlos, Steiner Andreas, Angelova Anelia, Zhai Xiaohua, Houlsby Neil, Soricut Radu
- Transformers Go For The Lols: Generating (humourous) Titles From Scientific Abstracts End-to-end Chen Yanran, Eger Steffen
- Empowering Parameter-efficient Transfer Learning By Recognizing The Kernel Structure In Self-attention Chen Yifan, Hazarika Devamanyu, Namazifar Mahdi, Liu Yang, Jin Di, Hakkani-tur Dilek
- Testing Occupational Gender Bias In Language Models: Towards Robust Measurement And Zero-shot Debiasing Chen Yuen, Raghuram Vethavikashini Chithrra, Mattern Justus, Sachan Mrinmaya, Mihalcea Rada, Schölkopf Bernhard, Jin Zhijing
- Adaprompt: Adaptive Model Training For Prompt-based NLP Chen Yulong, Liu Yang, Dong Li, Wang Shuohang, Zhu Chenguang, Zeng Michael, Zhang Yue
- Curriculum: A Broad-coverage Benchmark For Linguistic Phenomena In Natural Language Understanding Chen Zeming, Gao Qiyue
- Altclip: Altering The Language Encoder In CLIP For Extended Language Capabilities Chen Zhongzhi, Liu Guang, Zhang Bo-wen, Ye Fulong, Yang Qinghong, Wu Ledell
- Capture Salient Historical Information: A Fast And Accurate Non-autoregressive Model For Multi-turn Spoken Language Understanding Cheng Lizhi, Jia Weijia, Yang Wenmian
- Is Multiwoz A Solved Task? An Interactive TOD Evaluation Framework With User Simulator Cheng Qinyuan, Li Linyang, Quan Guofeng, Gao Feng, Mou Xiaofeng, Qiu Xipeng
- Vault: Augmenting The Vision-and-language Transformer For Sentiment Classification On Social Media Chochlakis Georgios University Of Southern California, Srinivasan Tejas University Of Southern California, Thomason Jesse University Of Southern California, Narayanan Shrikanth University Of Southern California
- Language-family Adapters For Low-resource Multilingual Neural Machine Translation Chronopoulou Alexandra, Stojanovski Dario, Fraser Alexander
- Meta-learning Fast Weight Language Models Clark Kevin, Guu Kelvin, Chang Ming-wei, Pasupat Panupong, Hinton Geoffrey, Norouzi Mohammad
- LSG Attention: Extrapolation Of Pretrained Transformers To Long Sequences Condevaux Charles, Harispe Sébastien
- Building The Intent Landscape Of Real-world Conversational Corpora With Extractive Question-answering Transformers Corbeil Jean-philippe, Li Mia Taige, Ghavidel Hadi Abdi
- Continual Pre-training Mitigates Forgetting In Language And Vision Cossu Andrea, Tuytelaars Tinne, Carta Antonio, Passaro Lucia, Lomonaco Vincenzo, Bacciu Davide
- M6-rec: Generative Pretrained Language Models Are Open-ended Recommender Systems Cui Zeyu, Ma Jianxin, Zhou Chang, Zhou Jingren, Yang Hongxia
- Neural Knowledge Bank For Pretrained Transformers Dai Damai, Jiang Wenbin, Dong Qingxiu, Lyu Yajuan, She Qiaoqiao, Sui Zhifang
- Why Can GPT Learn In-context? Language Models Implicitly Perform Gradient Descent As Meta-optimizers Dai Damai, Sun Yutao, Dong Li, Hao Yaru, Ma Shuming, Sui Zhifang, Wei Furu
- Understanding BLOOM: An Empirical Study On Diverse NLP Tasks Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- Llm.int8(): 8-bit Matrix Multiplication For Transformers At Scale Dettmers Tim, Lewis Mike, Belkada Younes, Zettlemoyer Luke
- Less Is More: Linear Layers On CLIP Features As Powerful Vizwiz Model Deuser Fabian, Habel Konrad, Rösch Philipp J., Oswald Norbert
- Pre-training Transformer Models With Sentence-level Objectives For Answer Sentence Selection Di Liello Luca, Garg Siddhant, Soldaini Luca, Moschitti Alessandro
- Black-box Prompt Learning For Pre-trained Language Models Diao Shizhe, Huang Zhichao, Xu Ruijia, Li Xuechun, Lin Yong, Zhou Xiao, Zhang Tong
- Cogview2: Faster And Better Text-to-image Generation Via Hierarchical Transformers Ding Ming, Zheng Wendi, Hong Wenyi, Tang Jie
- Parallel Attention Forcing For Machine Translation Dou Qingyun, Gales Mark
- Coarse-to-fine Vision-language Pre-training With Fusion In The Backbone Dou Zi-yi, Kamath Aishwarya, Gan Zhe, Zhang Pengchuan, Wang Jianfeng, Li Linjie, Liu Zicheng, Liu Ce, Lecun Yann, Peng Nanyun, Gao Jianfeng, Wang Lijuan
- Contrastive Learning With Bidirectional Transformers For Sequential Recommendation Du Hanwen, Shi Hui, Zhao Pengpeng, Wang Deqing, Sheng Victor S., Liu Yanchi, Liu Guanfeng, Zhao Lei
- A Survey Of Vision-language Pre-trained Models Du Yifan, Liu Zikang, Li Junyi, Zhao Wayne Xin
- Probing For Incremental Parse States In Autoregressive Language Models Eisape Tiwalayo, Gangireddy Vineet, Levy Roger P., Kim Yoon
- Star: Bootstrapping Reasoning With Reasoning Eric Zelikman, Yuhuai Wu, Jesse Mu, Noah D. Goodman
- Imagination Is All You Need! Curved Contrastive Learning For Abstract Sequence Modeling Utilized On Long Short-term Dialogue Planning Erker Justus-jonas, Schaffer Stefan, Spanakis Gerasimos
- Leveraging Pre-trained Models For Failure Analysis Triplets Generation Ezukwoke Kenneth, Hoayek Anis, Batton-hubert Mireille, Boucher Xavier, Gounet Pascal, Adrian Jerome
- Progressive Text-to-image Generation Fei Zhengcong, Fan Mingyuan, Zhu Li, Huang Junshi
- CERES: Pretraining Of Graph-conditioned Transformer For Semi-structured Session Data Feng Rui, Luo Chen, Yin Qingyu, Yin Bing, Zhao Tuo, Zhang Chao
- Learn To Remember: Transformer With Recurrent Memory For Document-level Machine Translation Feng Yukun, Li Feng, Song Ziang, Zheng Boyuan, Koehn Philipp
- Towards Opening The Black Box Of Neural Machine Translation: Source And Target Interpretations Of The Transformer Ferrando Javier, Gállego Gerard I., Alastruey Belen, Escolano Carlos, Costa-jussà Marta R.
- Should Attention Be All We Need? The Epistemic And Ethical Implications Of Unification In Machine Learning Fishman Nic, Hancox-li Leif
- GPTQ: Accurate Post-training Quantization For Generative Pre-trained Transformers Frantar Elias, Ashkboos Saleh, Hoefler Torsten, Alistarh Dan
- Revision Transformers: Instructing Language Models To Change Their Values Friedrich Felix, Stammer Wolfgang, Schramowski Patrick, Kersting Kristian
- Adapterbias: Parameter-efficient Token-dependent Representation Shift For Adapters In NLP Tasks Fu Chin-lun, Chen Zih-ching, Lee Yun-ru, Lee Hung-yi
- Hungry Hungry Hippos: Towards Language Modeling With State Space Models Fu Daniel Y., Dao Tri, Saab Khaled K., Thomas Armin W., Rudra Atri, Ré Christopher
- An Empirical Study Of End-to-end Video-language Transformers With Masked Visual Modeling Fu Tsu-jui, Li Linjie, Gan Zhe, Lin Kevin, Wang William Yang, Wang Lijuan, Liu Zicheng
- MIGA: A Unified Multi-task Generation Framework For Conversational Text-to-sql Fu Yingwen, Ou Wenjie, Yu Zhou, Lin Yue
- Pnlp-mixer: An Efficient All-mlp Architecture For Language Fusco Francesco, Pascual Damian, Staar Peter, Antognini Diego
- WAVPROMPT: Towards Few-shot Spoken Language Understanding With Frozen Language Models Gao Heting, Ni Junrui, Qian Kaizhi, Zhang Yang, Chang Shiyu, Hasegawa-johnson Mark
- Visual Prompt Tuning For Test-time Domain Adaptation Gao Yunhe, Shi Xingjian, Zhu Yi, Wang Hao, Tang Zhiqiang, Zhou Xiong, Li Mu, Metaxas Dimitris N.
- Cramming: Training A Language Model On A Single GPU In One Day Geiping Jonas, Goldstein Tom
- Microbert: Effective Training Of Low-resource Monolingual Berts Through Parameter Reduction And Multitask Learning Gessler Luke, Zeldes Amir
- Re2g: Retrieve, Rerank, Generate Glass Michael, Rossiello Gaetano, Chowdhury Md Faisal Mahbub, Naik Ankita Rajaram, Cai Pengshan, Gliozzo Alfio
- Diffuseq: Sequence To Sequence Text Generation With Diffusion Models Gong Shansan, Li Mukai, Feng Jiangtao, Wu Zhiyong, Kong Lingpeng
- Does Entity Abstraction Help Generative Transformers Reason? Gontier Nicolas, Reddy Siva, Pal Christopher
- Massively Multilingual Lexical Specialization Of Multilingual Transformers Green Tommaso, Ponzetto Simone Paolo, Glavaš Goran
- Continuous Decomposition Of Granularity For Neural Paraphrase Generation Gu Xiaodong, Zhang Zhaowei, Lee Sang-woo, Yoo Kang Min, Ha Jung-woo
- Generating Coherent Narratives By Learning Dynamic And Discrete Entity States With A Contrastive Framework Guan Jian, Yang Zhenyu, Zhang Rongsheng, Hu Zhipeng, Huang Minlie
- Transkimmer: Transformer Learns To Layer-wise Skim Guan Yue, Li Zhengyi, Leng Jingwen, Lin Zhouhan, Guo Minyi
- Unixcoder: Unified Cross-modal Pre-training For Code Representation Guo Daya, Lu Shuai, Duan Nan, Wang Yanlin, Zhou Ming, Yin Jian
- Visually-augmented Pretrained Language Models For NLP Tasks Without Images Guo Hangyu, Zhou Kun, Zhao Wayne Xin, Zhang Qinyu, Wen Ji-rong
- STI: Turbocharge NLP Inference At The Edge Via Elastic Pipelining Guo Liwei, Choe Wonkyo, Lin Felix Xiaozhu
- MALM: Mixing Augmented Language Modeling For Zero-shot Machine Translation Gupta Kshitij
- Quala-minilm: A Quantized Length Adaptive Minilm Guskin Shira, Wasserblat Moshe, Wang Chang, Shen Haihao
- Thinking About GPT-3 In-context Learning For Biomedical IE? Think Again Gutiérrez Bernal Jiménez, Mcneal Nikolas, Washington Clay, Chen You, Li Lang, Sun Huan, Su Yu
- LUNA: Language Understanding With Number Augmentations On Transformers Via Number Plugins And Pre-training Han Hongwei, Xu Jialiang, Zhou Mengyu, Shao Yijia, Han Shi, Zhang Dongmei
- FOLIO: Natural Language Reasoning With First-order Logic Han Simeng, Schoelkopf Hailey, Zhao Yilun, Qi Zhenting, Riddell Martin, Zhou Wenfei, Coady James, Peng David, Qiao Yujie, Benson Luke, Sun Lucy, Wardle-solano Alex, Szabo Hannah, Zubova Ekaterina, Burtell Matthew, Fan Jonathan, Liu Yixin, Wong Brian, Sailor Malcolm, Ni Ansong, Nan Linyong, Kasai Jungo, Yu Tao, Zhang Rui, Fabbri Alexander R., Kryscinski Wojciech, Yavuz Semih, Liu Ye, Lin Xi Victoria, Joty Shafiq, Zhou Yingbo, Xiong Caiming, Ying Rex, Cohan Arman, Radev Dragomir
- SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model For Text Generation And Modular Control Han Xiaochuang, Kumar Sachin, Tsvetkov Yulia
- ORCA: Interpreting Prompted Language Models Via Locating Supporting Data Evidence In The Ocean Of Pretraining Data Han Xiaochuang, Tsvetkov Yulia
- Learnable Model Augmentation Self-supervised Learning For Sequential Recommendation Hao Yongjing, Zhao Pengpeng, Xian Xuefeng, Liu Guanfeng, Wang Deqing, Zhao Lei, Liu Yanchi, Sheng Victor S.
- Supervised Visual Attention For Simultaneous Multimodal Machine Translation Haralampieva Veneta, Caglayan Ozan, Specia Lucia
- How Much Does Attention Actually Attend? Questioning The Importance Of Attention In Pretrained Transformers Hassid Michael, Peng Hao, Rotem Daniel, Kasai Jungo, Montero Ivan, Smith Noah A., Schwartz Roy
- Understanding Transformer Memorization Recall Through Idioms Haviv Adi, Cohen Ido, Gidron Jacob, Schuster Roei, Goldberg Yoav, Geva Mor
- Transformer Language Models Without Positional Encodings Still Learn Positional Information Haviv Adi, Ram Ori, Press Ofir, Izsak Peter, Levy Omer
- Rethinking With Retrieval: Faithful Large Language Model Inference He Hangfeng, Zhang Hongming, Roth Dan
- SPACE-3: Unified Dialog Model Pre-training For Task-oriented Dialog Understanding And Generation He Wanwei, Dai Yinpei, Yang Min, Sun Jian, Huang Fei, Si Luo, Li Yongbin
- TRAC: A Textual Benchmark For Reasoning About Actions And Change He Weinan, Huang Canming, Xiao Zhanhao, Liu Yongmei
- Hyperprompt: Prompt-based Task-conditioning Of Transformers He Yun, Zheng Huaixiu Steven, Tay Yi, Gupta Jai, Du Yu, Aribandi Vamsi, Zhao Zhe, Li Yaguang, Chen Zhao, Metzler Donald, Cheng Heng-tze, Chi Ed H.
- Bundle MCR: Towards Conversational Bundle Recommendation He Zhankui, Zhao Handong, Yu Tong, Kim Sungchul, Du Fan, Mcauley Julian
- Shapley Head Pruning: Identifying And Removing Interference In Multilingual Transformers Held William, Yang Diyi
- Compressing Pre-trained Transformers Via Low-bit Nxm Sparsity For Natural Language Understanding Holmes Connor, Zhang Minjia, He Yuxiong, Wu Bo
- Cogvideo: Large-scale Pretraining For Text-to-video Generation Via Transformers Hong Wenyi, Ding Ming, Zheng Wendi, Liu Xinghan, Tang Jie
- Syntax-guided Localized Self-attention By Constituency Syntactic Distance Hou Shengyuan, Kai Jushi, Xue Haotian, Zhu Bingyu, Yuan Bo, Huang Longtao, Wang Xinbing, Lin Zhouhan
- XDBERT: Distilling Visual Information To BERT From Cross-modal Systems To Improve Language Understanding Hsu Chan-jan, Lee Hung-yi, Tsao Yu
- Empirical Evaluation Of Post-training Quantization Methods For Language Tasks Hu Ting, Meinel Christoph, Yang Haojin
- PLANET: Dynamic Content Planning In Autoregressive Transformers For Long-form Text Generation Hu Zhe, Chan Hou Pong, Liu Jiachen, Xiao Xinyan, Wu Hua, Huang Lifu
- Empowering Language Models With Knowledge Graph Reasoning For Question Answering Hu Ziniu, Xu Yichong, Yu Wenhao, Wang Shuohang, Yang Ziyi, Zhu Chenguang, Chang Kai-wei, Sun Yizhou
- Transformer Quality In Linear Time Hua Weizhe, Dai Zihang, Liu Hanxiao, Le Quoc V.
- Autoregressive Entity Generation For End-to-end Task-oriented Dialog Huang Guanhuan, Quan Xiaojun, Wang Qifan
- Detecting Unintended Memorization In Language-model-fused ASR Huang W. Ronny, Chien Steve, Thakkar Om, Mathews Rajiv
- Layoutlmv3: Pre-training For Document AI With Unified Text And Image Masking Huang Yupan, Lv Tengchao, Cui Lei, Lu Yutong, Wei Furu
- MCP: Self-supervised Pre-training For Personalized Chatbots With Multi-level Contrastive Sampling Huang Zhaoheng, Dou Zhicheng, Zhu Yutao, Ma Zhengyi
- Welm: A Well-read Pre-trained Language Model For Chinese Hui Su, Xiao Zhou, Houjin Yu, Xiaoyu Shen, Yuwen Chen, Zilin Zhu, Yang Yu, Jie Zhou
- Do Berts Learn To Use Browser User Interface? Exploring Multi-step Tasks With Unified Vision-and-language Berts Iki Taichi, Aizawa Akiko
- Efficient Long-text Understanding With Short-text Models Ivgi Maor, Shaham Uri, Berant Jonathan
- HINT: Hypernetwork Instruction Tuning For Efficient Zero- & Few-shot Generalisation Ivison Hamish, Bhagia Akshita, Wang Yizhong, Hajishirzi Hannaneh, Peters Matthew
- Litetransformersearch: Training-free Neural Architecture Search For Efficient Language Models Javaheripi Mojan, De Rosa Gustavo H., Mukherjee Subhabrata, Shah Shital, Religa Tomasz L., Mendes Caio C. T., Bubeck Sebastien, Koushanfar Farinaz, Dey Debadeepta
- DP-KB: Data Programming With Knowledge Bases Improves Transformer Fine Tuning For Answer Sentence Selection Jedema Nic, Vu Thuy, Gupta Manish, Moschitti Alessandro
- Lamemo: Language Modeling With Look-ahead Memory Ji Haozhe, Zhang Rongsheng, Yang Zhenyu, Hu Zhipeng, Huang Minlie
- Survey Of Hallucination In Natural Language Generation Ji Ziwei, Lee Nayeon, Frieske Rita, Yu Tiezheng, Su Dan, Xu Yan, Ishii Etsuko, Bang Yejin, Chen Delong, Dai Wenliang, Chan Ho Shu, Madotto Andrea, Fung Pascale
- Visual Prompt Tuning Jia Menglin, Tang Luming, Chen Bor-chun, Cardie Claire, Belongie Serge, Hariharan Bharath, Lim Ser-nam
- Contrastive Learning For Prompt-based Few-shot Language Learners Jian Yiren, Gao Chongyang, Vosoughi Soroush
- A Simple Contrastive Learning Objective For Alleviating Neural Text Degeneration Jiang Shaojie, Zhang Ruqing, Vakulenko Svitlana, De Rijke Maarten
- Large Language Models Can Self-improve Jiaxin Huang, Shixiang Shane Gu, Le Hou, Yuexin Wu, Xuezhi Wang, Hongkun Yu, Jiawei Han
- Probing Script Knowledge From Pre-trained Models Jin Zijian, Zhang Xingyu, Yu Mo, Huang Lifu
- Training Compute-optimal Large Language Models Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego De Las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, Tom Hennigan, Eric Noland, Katie Millican, George Van Den Driessche, Bogdan Damoc, Aurelia Guy, Simon Osindero, Karen Simonyan, Erich Elsen, Jack W. Rae, Oriol Vinyals, Laurent Sifre
- Latency Adjustable Transformer Encoder For Language Understanding Kachuee Sajjad, Sharifkhani Mohammad
- A New Path: Scaling Vision-and-language Navigation With Synthetic Instructions And Imitation Learning Kamath Aishwarya, Anderson Peter, Wang Su, Koh Jing Yu, Ku Alexander, Waters Austin, Yang Yinfei, Baldridge Jason, Parekh Zarana
- Gpt-neo For Commonsense Reasoning -- A Theoretical And Practical Lens Kashyap Rohan, Kashyap Vivek, P. Narendra C.
- Fie: Building A Global Probability Space By Leveraging Early Fusion In Encoder For Open-domain Question Answering Kedia Akhil, Zaidi Mohd Abbas, Lee Haejun
- What Makes Data-to-text Generation Hard For Pretrained Language Models? Keymanesh Moniba, Benton Adrian, Dredze Mark
- High Recall Data-to-text Generation With Progressive Edit Kim Choonghan, Lee Gary Geunbae
- Who Says Elephants Can't Run: Bringing Large Scale Moe Models Into Cloud Scale Production Kim Young Jin, Henry Rawn, Fahim Raffy, Awadalla Hany Hassan
- Iterative Vision-and-language Navigation Krantz Jacob, Banerjee Shurjo, Zhu Wang, Corso Jason, Anderson Peter, Lee Stefan, Thomason Jesse
- Downstream Datasets Make Surprisingly Good Pretraining Corpora Krishna Kundan, Garg Saurabh, Bigham Jeffrey P., Lipton Zachary C.
- Few-shot Question Generation For Personalized Feedback In Intelligent Tutoring Systems Kulshreshtha Devang, Shayan Muhammad, Belfer Robert, Reddy Siva, Serban Iulian Vlad, Kochmar Ekaterina
- Mucot: Multilingual Contrastive Training For Question-answering In Low-resource Languages Kumar Gokul Karthik, Gehlot Abhishek Singh, Mullappilly Sahal Shaji, Nandakumar Karthik
- The Optimal BERT Surgeon: Scalable And Accurate Second-order Pruning For Large Language Models Kurtic Eldar, Campos Daniel, Nguyen Tuan, Frantar Elias, Kurtz Mark, Fineran Benjamin, Goin Michael, Alistarh Dan
- Natural Language To Code Using Transformers Kusupati Uday, Ailavarapu Venkata Ravi Teja
- Language Detoxification With Attribute-discriminative Latent Space Kwak Jin Myung, Kim Minseon, Hwang Sung Ju
- Does BERT Really Agree ? Fine-grained Analysis Of Lexical Dependence On A Syntactic Task Lasri Karim, Lenci Alessandro, Poibeau Thierry
- Internet-augmented Language Models Through Few-shot Prompting For Open-domain Question Answering Lazaridou Angeliki, Gribovskaya Elena, Stokowiec Wojciech, Grigorev Nikolai
- Littlebird: Efficient Faster & Longer Transformer For Question Answering Lee Minchul, Han Kijong, Shin Myeong Cheol
- Simple Recurrence Improves Masked Language Models Lei Tao, Tian Ran, Bastings Jasmijn, Parikh Ankur P.
- Fast Inference From Transformers Via Speculative Decoding Leviathan Yaniv, Kalman Matan, Matias Yossi
- GPT-D: Inducing Dementia-related Linguistic Anomalies By Deliberate Degradation Of Artificial Neural Language Models Li Changye, Knopman David, Xu Weizhe, Cohen Trevor, Pakhomov Serguei
- Large Language Models With Controllable Working Memory Li Daliang, Rawat Ankit Singh, Zaheer Manzil, Wang Xin, Lukasik Michal, Veit Andreas, Yu Felix, Kumar Sanjiv
- Summer: Wechat Neural Machine Translation Systems For The WMT22 Biomedical Translation Task Li Ernan, Meng Fandong, Zhou Jie
- Fine-grained Semantically Aligned Vision-language Pre-training Li Juncheng, He Xin, Wei Longhui, Qian Long, Zhu Linchao, Xie Lingxi, Zhuang Yueting, Tian Qi, Tang Siliang
- Learning To Transfer Prompts For Text Generation Li Junyi, Tang Tianyi, Nie Jian-yun, Wen Ji-rong, Zhao Wayne Xin
- ELMER: A Non-autoregressive Pre-trained Language Model For Efficient And Effective Text Generation Li Junyi, Tang Tianyi, Zhao Wayne Xin, Nie Jian-yun, Wen Ji-rong
- Personalized Prompt Learning For Explainable Recommendation Li Lei, Zhang Yongfeng, Chen Li
- LAVENDER: Unifying Video-language Understanding As Masked Language Modeling Li Linjie, Gan Zhe, Lin Kevin, Lin Chung-ching, Liu Zicheng, Liu Ce, Wang Lijuan
- Modeling Coreference Relations In Visual Dialog Li Mingxiao, Moens Marie-francine
- Pre-trained Language Models For Interactive Decision-making Li Shuang, Puig Xavier, Paxton Chris, Du Yilun, Wang Clinton, Fan Linxi, Chen Tao, Huang De-an, Akyürek Ekin, Anandkumar Anima, Andreas Jacob, Mordatch Igor, Torralba Antonio, Zhu Yuke
- Evade The Trap Of Mediocrity: Promoting Diversity And Novelty In Text Generation Via Concentrating Attention Li Wenhao, Yi Xiaoyuan, Hu Jinyi, Sun Maosong, Xie Xing
- Evaluating Psychological Safety Of Large Language Models Li Xingxuan, Li Yutong, Qiu Lin, Joty Shafiq, Bing Lidong
- P-transformer: Towards Better Document-to-document Neural Machine Translation Li Yachao, Li Junhui, Jiang Jing, Tao Shimin, Yang Hao, Zhang Min
- Uni-eden: Universal Encoder-decoder Network By Multi-granular Vision-language Pre-training Li Yehao, Fan Jiahao, Pan Yingwei, Yao Ting, Lin Weiyao, Mei Tao
- VALHALLA: Visual Hallucination For Machine Translation Li Yi, Panda Rameswar, Kim Yoon, Chen Chun-fu, Feris Rogerio, Cox David, Vasconcelos Nuno
- Exploring Visual Interpretability For Contrastive Language-image Pre-training Li Yi, Wang Hualiang, Duan Yiqun, Xu Hang, Li Xiaomeng
- Clinical-longformer And Clinical-bigbird: Transformers For Long Clinical Sequences Li Yikuan, Wehbe Ramsey M., Ahmad Faraz S., Wang Hanyin, Luo Yuan
- Accelerating Attention Through Gradient-based Learned Runtime Pruning Li Zheng, Ghodrati Soroush, Yazdanbakhsh Amir, Esmaeilzadeh Hadi, Kang Mingu
- Life After BERT: What Do Other Muppets Understand About Language? Lialin Vladislav, Zhao Kevin, Shivagunde Namrata, Rumshisky Anna
- Bjtu-wechat's Systems For The WMT22 Chat Translation Task Liang Yunlong, Meng Fandong, Xu Jinan, Chen Yufeng, Zhou Jie
- Simple And Effective Gradient-based Tuning Of Sequence-to-sequence Models Lichtarge Jared, Alberti Chris, Kumar Shankar
- ADAPT: Vision-language Navigation With Modality-aligned Action Prompts Lin Bingqian, Zhu Yi, Chen Zicong, Liang Xiwen, Liu Jianzhuang, Liang Xiaodan
- Gentus: Simulating User Behaviour And Language In Task-oriented Dialogues With Generative Transformers Lin Hsien-chin, Geishauser Christian, Feng Shutong, Lubis Nurul, Van Niekerk Carel, Heck Michael, Gašić Milica
- Contextual Transformer For Offline Meta Reinforcement Learning Lin Runji, Li Ye, Feng Xidong, Zhang Zhaowei, Fung Xian Hong Wu, Zhang Haifeng, Wang Jun, Du Yali, Yang Yaodong
- On Task-adaptive Pretraining For Dialogue Response Selection Lin Tzu-hsiang, Chi Ta-chung, Rumshisky Anna
- Vision-language Pre-training For Multimodal Aspect-based Sentiment Analysis Ling Yan, Yu Jianfei, Xia Rui
- Delving Deeper Into Cross-lingual Visual Question Answering Liu Chen, Pfeiffer Jonas, Korhonen Anna, Vulić Ivan, Gurevych Iryna
- Towards Better Few-shot And Finetuning Performance With Forgetful Causal Language Models Liu Hao, Geng Xinyang, Lee Lisa, Mordatch Igor, Levine Sergey, Narang Sharan, Abbeel Pieter
- Instruction-following Agents With Multimodal Transformer Liu Hao, Lee Lisa, Lee Kimin, Abbeel Pieter
- Building Markovian Generative Architectures Over Pretrained LM Backbones For Efficient Task-oriented Dialog Systems Liu Hong, Cai Yucheng, Ou Zhijian, Huang Yi, Feng Junlan
- Same Pre-training Loss, Better Downstream: Implicit Bias Matters For Language Models Liu Hong, Xie Sang Michael, Li Zhiyuan, Ma Tengyu
- Fl-tuning: Layer Tuning For Feed-forward Network In Transformer Liu Jingping, Song Yuqiu, Xue Kui, Sun Hongli, Wang Chao, Chen Lihan, Jiang Haiyun, Liang Jiaqing, Ruan Tong
- Relational Memory Augmented Language Models Liu Qi, Yogatama Dani, Blunsom Phil
- Transformer With Memory Replay Liu Rui, Mozafari Barzan
- Commitbart: A Large Pre-trained Model For Github Commits Liu Shangqing, Li Yanzhou, Xie Xiaofei, Liu Yang
- PSP: Pre-trained Soft Prompts For Few-shot Abstractive Summarization Liu Xiaochen, Gao Yang, Bai Yu, Li Jiawei, Hu Yinan, Huang Heyan, Chen Boxing
- Improving Personality Consistency In Conversation By Persona Extending Liu Yifan, Wei Wei, Liu Jiayi, Mao Xianling, Fang Rui, Chen Dangyang
- Dialogconv: A Lightweight Fully Convolutional Network For Multi-view Response Selection Liu Yongkang, Feng Shi, Gao Wei, Wang Daling, Zhang Yifei
- A Win-win Deal: Towards Sparse And Robust Pre-trained Language Models Liu Yuanxin, Meng Fandong, Lin Zheng, Li Jiangnan, Fu Peng, Cao Yanan, Wang Weiping, Zhou Jie
- Declaration-based Prompt Tuning For Visual Question Answering Liu Yuhang, Wei Wei, Peng Daowan, Zhu Feida
- Prompt Generation Networks For Input-space Adaptation Of Frozen Vision Transformers Loedeman Jochem, Stol Maarten C., Han Tengda, Asano Yuki M.
- Relaxed Attention For Transformer Models Lohrenz Timo, Möller Björn, Li Zhengyang, Fingscheidt Tim
- Training Language Models To Follow Instructions With Human Feedback Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe
- Vision-and-language Pretrained Models: A Survey Long Siqu, Cao Feiqi, Han Soyeon Caren, Yang Haiqin
- Reacc: A Retrieval-augmented Code Completion Framework Lu Shuai, Duan Nan, Han Hojae, Guo Daya, Hwang Seung-won, Svyatkovskiy Alexey
- Choose Your QA Model Wisely: A Systematic Study Of Generative And Extractive Readers For Question Answering Luo Man, Hashimoto Kazuma, Yavuz Semih, Liu Zhiwei, Baral Chitta, Zhou Yingbo
- Biogpt: Generative Pre-trained Transformer For Biomedical Text Generation And Mining Luo Renqian, Sun Liai, Xia Yingce, Qin Tao, Zhang Sheng, Poon Hoifung, Liu Tie-yan
- Torchscale: Transformers At Scale Ma Shuming, Wang Hongyu, Huang Shaohan, Wang Wenhui, Chi Zewen, Dong Li, Benhaim Alon, Patra Barun, Chaudhary Vishrav, Song Xia, Wei Furu
- PERFECT: Prompt-free And Efficient Few-shot Learning With Language Models Mahabadi Rabeeh Karimi, Zettlemoyer Luke, Henderson James, Saeidi Marzieh, Mathias Lambert, Stoyanov Veselin, Yazdani Majid
- Edit5: Semi-autoregressive Text-editing With T5 Warm-start Mallinson Jonathan, Adamek Jakub, Malmi Eric, Severyn Aliaksei
- Tangobert: Reducing Inference Cost By Using Cascaded Architecture Mamou Jonathan, Pereg Oren, Wasserblat Moshe, Schwartz Roy
- Fine-tuning Pre-trained Transformers Into Decaying Fast Weights Mao Huanru Henry
- Quick Starting Dialog Systems With Paraphrase Generation Marceau Louis, Belbahar Raouf, Queudot Marc, Naji Nada, Charton Eric, Meurs Marie-jean
- Mini-model Adaptation: Efficiently Extending Pretrained Models To New Languages Via Aligned Shallow Training Marchisio Kelly, Lewis Patrick, Chen Yihong, Artetxe Mikel
- GAAMA 2.0: An Integrated System That Answers Boolean And Extractive Questions Mccarley Scott, Bornea Mihaela, Rosenthal Sara, Ferritto Anthony, Sultan Md Arafat, Sil Avirup, Florian Radu
- DSI++: Updating Transformer Memory With New Documents Mehta Sanket Vaibhav, Gupta Jai, Tay Yi, Dehghani Mostafa, Tran Vinh Q., Rao Jinfeng, Najork Marc, Strubell Emma, Metzler Donald
- Mass-editing Memory In A Transformer Meng Kevin, Sharma Arnab Sen, Andonian Alex, Belinkov Yonatan, Bau David
- Generating Training Data With Language Models: Towards Zero-shot Language Understanding Meng Yu, Huang Jiaxin, Zhang Yu, Han Jiawei
- Tuning Language Models As Training Data Generators For Augmentation-enhanced Few-shot Learning Meng Yu, Michalski Martin, Huang Jiaxin, Zhang Yu, Abdelzaher Tarek, Han Jiawei
- Rarely A Problem? Language Models Exhibit Inverse Scaling In Their Predictions Following Few-type Quantifiers Michaelov James A., Bergen Benjamin K.
- Minicons: Enabling Flexible Behavioral And Representational Analyses Of Transformer Language Models Misra Kanishka
- Domain-specific Text Generation For Machine Translation Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- SGPT: GPT Sentence Embeddings For Semantic Search Muennighoff Niklas
- JASMINE: Arabic GPT Models For Few-shot Learning Nagoudi El Moatez Billah, Abdul-mageed Muhammad, Elmadany Abdelrahim, Inciarte Alcides Alcoba, Khondaker Md Tawkat Islam
- Large Language Models Are Reasoning Teachers Namgyu Ho, Laura Schmid, Se-young Yun
- Task Formulation Matters When Learning Continually: A Case Study In Visual Question Answering Nikandrou Mavina, Yu Lu, Suglia Alessandro, Konstas Ioannis, Rieser Verena
- Crosslingual Generalization Through Multitask Finetuning Niklas Muennighoff, Thomas Wang, Lintang Sutawika, Adam Roberts, Stella Biderman, Teven Le Scao, M Saiful Bari, Sheng Shen, Zheng-xin Yong, Hailey Schoelkopf, Xiangru Tang, Dragomir Radev, Alham Fikri Aji, Khalid Almubarak, Samuel Albanie, Zaid Alyafeai, Albert Webson, Edward Raff, Colin Raffel
- Budgetlongformer: Can We Cheaply Pretrain A Sota Legal Language Model From Scratch? Niklaus Joel, Giofré Daniele
- Do Vision-and-language Transformers Learn Grounded Predicate-noun Dependencies? Nikolaus Mitja, Salin Emmanuelle, Ayache Stephane, Fourtassi Abdellah, Favre Benoit
- Improving The Cross-lingual Generalisation In Visual Question Answering Nooralahzadeh Farhad, Sennrich Rico
- Minialbert: Model Distillation Via Parameter-efficient Recursive Transformers Nouriborji Mohammadmahdi, Rohanian Omid, Kouchaki Samaneh, Clifton David A.
- CUE Vectors: Modular Training Of Language Models Conditioned On Diverse Contextual Signals Novotney Scott, Mukherjee Sreeparna, Ahmed Zeeshan, Stolcke Andreas
- Measuring And Narrowing The Compositionality Gap In Language Models Ofir Press, Muru Zhang, Sewon Min, Ludwig Schmidt, Noah A. Smith, Mike Lewis
- Neural Language Taskonomy: Which NLP Tasks Are The Most Predictive Of Fmri Brain Activity? Oota Subba Reddy, Arora Jashn, Agarwal Veeral, Marreddy Mounika, Gupta Manish, Surampudi Bapi Raju
- Parameter-efficient Abstractive Question Answering Over Tables Or Text Pal Vaishali, Kanoulas Evangelos, De Rijke Maarten
- Extremebert: A Toolkit For Accelerating Pretraining Of Customized BERT Pan Rui, Diao Shizhe, Chen Jianlin, Zhang Tong
- Silo Nlp's Participation At WAT2022 Parida Shantipriya, Panda Subhadarshi, Grönroos Stig-arne, Granroth-wilding Mark, Koistinen Mika
- TALM: Tool Augmented Language Models Parisi Aaron, Zhao Yao, Fiedel Noah
- LUT-GEMM: Quantized Matrix Multiplication Based On Luts For Efficient Inference In Large-scale Generative Language Models Park Gunho, Park Baeseong, Kim Minsub, Lee Sungjae, Kim Jeonghoon, Kwon Beomseok, Kwon Se Jung, Kim Byeongwook, Lee Youngjoo, Lee Dongsoo
- Relation-aware Language-graph Transformer For Question Answering Park Jinyoung, Choi Hyeong Kyu, Ko Juyeon, Park Hyeonjin, Kim Ji-hoon, Jeong Jisu, Kim Kyungmin, Kim Hyunwoo J.
- Bidirectional Language Models Are Also Few-shot Learners Patel Ajay, Li Bryan, Rasooli Mohammad Sadegh, Constant Noah, Raffel Colin, Callison-burch Chris
- Beyond English-centric Bitexts For Better Multilingual Language Representation Learning Patra Barun, Singhal Saksham, Huang Shaohan, Chi Zewen, Dong Li, Wei Furu, Chaudhary Vishrav, Song Xia
- Xfboost: Improving Text Generation With Controllable Decoders Peng Xiangyu, Sollami Michael
- Sparse And Dense Approaches For The Full-rank Retrieval Of Responses For Dialogues Penha Gustavo, Hauff Claudia
- Ignore Previous Prompt: Attack Techniques For Language Models Perez Fábio, Ribeiro Ian
- Lifting The Curse Of Multilinguality By Pre-training Modular Transformers Pfeiffer Jonas, Goyal Naman, Lin Xi Victoria, Li Xian, Cross James, Riedel Sebastian, Artetxe Mikel
- Vit5: Pretrained Text-to-text Transformer For Vietnamese Language Generation Phan Long, Tran Hieu, Nguyen Hieu, Trinh Trieu H.
- Hypertuning: Toward Adapting Large Language Models Without Back-propagation Phang Jason, Mao Yi, He Pengcheng, Chen Weizhu
- Investigating Efficiently Extending Transformers For Long Input Summarization Phang Jason, Zhao Yao, Liu Peter J.
- Pre-training Image-language Transformers For Open-vocabulary Tasks Piergiovanni Aj, Kuo Weicheng, Angelova Anelia
- Harnessing The Power Of Multi-task Pretraining For Ground-truth Level Natural Language Explanations Plüster Björn, Ambsdorf Jakob, Braach Lukas, Lee Jae Hee, Wermter Stefan
- Synchromesh: Reliable Code Generation From Pre-trained Language Models Poesia Gabriel, Polozov Oleksandr, Le Vu, Tiwari Ashish, Soares Gustavo, Meek Christopher, Gulwani Sumit
- Outliers Dimensions That Disrupt Transformers Are Driven By Frequency Puccetti Giovanni, Rogers Anna, Drozd Aleksandr, Dell'orletta Felice
- Tegformer: Topic-to-essay Generation With Good Topic Coverage And High Text Coherence Qi Wang, Liu Rui, Zuo Yuan, Chen Yong, Zhang Dell
- A Self-paced Mixed Distillation Method For Non-autoregressive Generation Qi Weizhen, Gong Yeyun, Shen Yelong, Jiao Jian, Yan Yu, Li Houqiang, Zhang Ruofei, Chen Weizhu, Duan Nan
- Diffusion Glancing Transformer For Parallel Sequence To Sequence Learning Qian Lihua, Wang Mingxuan, Liu Yang, Zhou Hao
- The NLP Task Effectiveness Of Long-range Transformers Qin Guanghui, Feng Yukun, Van Durme Benjamin
- Memory In Humans And Deep Language Models: Linking Hypotheses For Model Augmentation Raccah Omri, Chen Phoebe, Willke Ted L., Poeppel David, Vo Vy A.
- VLC-BERT: Visual Question Answering With Contextualized Commonsense Knowledge Ravi Sahithya, Chinchure Aditya, Sigal Leonid, Liao Renjie, Shwartz Vered
- Cometkiwi: Ist-unbabel 2022 Submission For The Quality Estimation Shared Task Rei Ricardo, Treviso Marcos, Guerreiro Nuno M., Zerva Chrysoula, Farinha Ana C., Maroti Christine, De Souza José G. C., Glushkova Taisiya, Alves Duarte M., Lavie Alon, Coheur Luisa, Martins André F. T.
- Exploring Extreme Parameter Compression For Pre-trained Language Models Ren Yuxin, Wang Benyou, Shang Lifeng, Jiang Xin, Liu Qun
- Super-prompting: Utilizing Model-independent Contextual Data To Reduce Data Annotation Required In Visual Commonsense Tasks Rezaei Navid, Reformat Marek Z.
- Breakpoint Transformers For Modeling And Tracking Intermediate Beliefs Richardson Kyle, Tamari Ronen, Sultan Oren, Tsarfaty Reut, Shahaf Dafna, Sabharwal Ashish
- Additive Interventions Yield Robust Multi-domain Machine Translation Models Rippeth Elijah, Post Matt
- Learning To Diversify For Product Question Generation Roitman Haggai, Singer Uriel, Eshel Yotam, Nus Alexander, Kiperwasser Eliyahu
- Lamda: Language Models For Dialog Applications Romal Thoppilan, Daniel De Freitas, Jamie Hall, Noam Shazeer, Apoorv Kulshreshtha, Heng-tze Cheng, Alicia Jin, Taylor Bos, Leslie Baker, Yu Du, Yaguang Li, Hongrae Lee, Huaixiu Steven Zheng, Amin Ghafouri, Marcelo Menegali, Yanping Huang, Maxim Krikun, Dmitry Lepikhin, James Qin, Dehao Chen, Yuanzhong Xu, Zhifeng Chen, Adam Roberts, Maarten Bosma, Vincent Zhao, Yanqi Zhou, Chung-ching Chang, Igor Krivokon, Will Rusch, Marc Pickett, Pranesh Srinivasan, Laichee Man, Kathleen Meier-hellstern, Meredith Ringel Morris, Tulsee Doshi, Renelito Delos Santos, Toju Duke, Johnny Soraker, Ben Zevenbergen, Vinodkumar Prabhakaran, Mark Diaz, Ben Hutchinson, Kristen Olson, Alejandra Molina, Erin Hoffman-john, Josh Lee, Lora Aroyo, Ravi Rajakumar, Alena Butryna, Matthew Lamm, Viktoriya Kuzmina, Joe Fenton, Aaron Cohen, Rachel Bernstein, Ray Kurzweil, Blaise Aguera-arcas, Claire Cui, Marian Croak, Ed Chi, Quoc Le
- LINGUIST: Language Model Instruction Tuning To Generate Annotated Utterances For Intent Classification And Slot Tagging Rosenbaum Andy, Soltan Saleh, Hamza Wael, Versley Yannick, Boese Markus
- Benchclamp: A Benchmark For Evaluating Language Models On Syntactic And Semantic Parsing Roy Subhro, Thomson Sam, Chen Tongfei, Shin Richard, Pauls Adam, Eisner Jason, Van Durme Benjamin
- The Goldilocks Of Pragmatic Understanding: Fine-tuning Strategy Matters For Implicature Resolution By Llms Ruis Laura, Khan Akbir, Biderman Stella, Hooker Sara, Rocktäschel Tim, Grefenstette Edward
- Data Augmentation For Intent Classification With Off-the-shelf Large Language Models Sahu Gaurav, Rodriguez Pau, Laradji Issam H., Atighehchian Parmida, Vazquez David, Bahdanau Dzmitry
- Alexatm 20B: Few-shot Learning Using A Large-scale Multilingual Seq2seq Model Saleh Soltan, Shankar Ananthakrishnan, Jack Fitzgerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, Stephen Rawls, Andy Rosenbaum, Anna Rumshisky, Chandana Satya Prakash, Mukund Sridhar, Fabian Triefenbach, Apurv Verma, Gokhan Tur, Prem Natarajan
- Self-repetition In Abstractive Neural Summarizers Salkar Nikita, Trikalinos Thomas, Wallace Byron C., Nenkova Ani
- Knowledge Prompts: Injecting World Knowledge Into Language Models Through Soft Prompts Santos Cicero Nogueira Dos, Dong Zhe, Cer Daniel, Nham John, Shakeri Siamak, Ni Jianmo, Sung Yun-hsuan
- APOLLO: A Simple Approach For Adaptive Pretraining Of Language Models For Logical Reasoning Sanyal Soumya, Xu Yichong, Wang Shuohang, Yang Ziyi, Pryzant Reid, Yu Wenhao, Zhu Chenguang, Ren Xiang
- IT5: Text-to-text Pretraining For Italian Language Understanding And Generation Sarti Gabriele, Nissim Malvina
- Transformer Grammars: Augmenting Transformer Language Models With Syntactic Inductive Biases At Scale Sartran Laurent, Barrett Samuel, Kuncoro Adhiguna, Stanojević Miloš, Blunsom Phil, Dyer Chris
- What Language Model To Train If You Have One Million GPU Hours? Scao Teven Le, Wang Thomas, Hesslow Daniel, Saulnier Lucile, Bekman Stas, Bari M Saiful, Biderman Stella, Elsahar Hady, Muennighoff Niklas, Phang Jason, Press Ofir, Raffel Colin, Sanh Victor, Shen Sheng, Sutawika Lintang, Tae Jaesung, Yong Zheng Xin, Launay Julien, Beltagy Iz
- Non-autoregressive Neural Machine Translation: A Call For Clarity Schmidt Robin M., Pires Telmo, Peitz Stephan, Lööf Jonas
- Confident Adaptive Language Modeling Schuster Tal, Fisch Adam, Gupta Jai, Dehghani Mostafa, Bahri Dara, Tran Vinh Q., Tay Yi, Metzler Donald
- On The Effect Of Pretraining Corpora On In-context Learning By A Large-scale Language Model Seongjin Shin, Sang-woo Lee, Hwijeen Ahn, Sungdong Kim, Hyoungseok Kim, Boseop Kim, Kyunghyun Cho, Gichang Lee, Woomyoung Park, Jung-woo Ha, Nako Sung
- Bird-eye Transformers For Text Generation Models Sha Lei, Song Yuhang, Yordanov Yordan, Salvatori Tommaso, Lukasiewicz Thomas
- Using Deepspeed And Megatron To Train Megatron-turing NLG 530B, A Large-scale Generative Language Model Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick Legresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zhang, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, Bryan Catanzaro
- Deanthropomorphising NLP: Can A Language Model Be Conscious? Shardlow Matthew, Przybyła Piotr
- Staged Training For Transformer Language Models Shen Sheng, Walsh Pete, Keutzer Kurt, Dodge Jesse, Peters Matthew, Beltagy Iz
- Incorporating Domain Knowledge Through Task Augmentation For Front-end Javascript Code Generation Shen Sijie, Zhu Xiang, Dong Yihong, Guo Qizhi, Zhen Yankun, Li Ge
- Ebms Vs. CL: Exploring Self-supervised Visual Pretraining For Visual Question Answering Shevchenko Violetta, Abbasnejad Ehsan, Dick Anthony, Hengel Anton Van Den, Teney Damien
- Mgpt: Few-shot Learners Go Multilingual Shliazhko Oleh, Fenogenova Alena, Tikhonova Maria, Mikhailov Vladislav, Kozlova Anastasia, Shavrina Tatiana
- Gpt-neox-20b: An Open-source Autoregressive Language Model Sid Black, Stella Biderman, Eric Hallahan, Quentin Anthony, Leo Gao, Laurence Golding, Horace He, Connor Leahy, Kyle Mcdonell, Jason Phang, Michael Pieler, Usvsn Sai Prashanth, Shivanshu Purohit, Laria Reynolds, Jonathan Tow, Ben Wang, Samuel Weinbach
- Towards The Next 1000 Languages In Multilingual Machine Translation: Exploring The Synergy Between Supervised And Self-supervised Learning Siddhant Aditya, Bapna Ankur, Firat Orhan, Cao Yuan, Chen Mia Xu, Caswell Isaac, Garcia Xavier
- On The Efficacy Of Co-attention Transformer Layers In Visual Question Answering Sikarwar Ankur, Kreiman Gabriel
- Improving The Domain Adaptation Of Retrieval Augmented Generation (RAG) Models For Open Domain Question Answering Siriwardhana Shamane, Weerasekera Rivindu, Wen Elliott, Kaluarachchi Tharindu, Rana Rajib, Nanayakkara Suranga
- CLIP Models Are Few-shot Learners: Empirical Studies On VQA And Visual Entailment Song Haoyu, Dong Li, Zhang Wei-nan, Liu Ting, Wei Furu
- Human Language Modeling Soni Nikita, Matero Matthew, Balasubramanian Niranjan, Schwartz H. Andrew
- OPT-IML: Scaling Language Model Instruction Meta Learning Through The Lens Of Generalization Srinivasan Iyer, Xi Victoria Lin, Ramakanth Pasunuru, Todor Mihaylov, Daniel Simig, Ping Yu, Kurt Shuster, Tianlu Wang, Qing Liu, Punit Singh Koura, Xian Li, Brian O'horo, Gabriel Pereyra, Jeff Wang, Christopher Dewan, Asli Celikyilmaz, Luke Zettlemoyer, Ves Stoyanov
- Climb: A Continual Learning Benchmark For Vision-and-language Tasks Srinivasan Tejas, Chang Ting-yun, Alva Leticia Leonor Pinto, Chochlakis Georgios, Rostami Mohammad, Thomason Jesse
- Beyond The Imitation Game: Quantifying And Extrapolating The Capabilities Of Language Models Srivastava Aarohi Shammie, Rastogi Abhinav Shammie, Rao Abhishek Shammie, Shoeb Abu Awal Md Shammie, Abid Abubakar Shammie, Fisch Adam Shammie, Brown Adam R. Shammie, Santoro Adam Shammie, Gupta Aditya Shammie, Garriga-alonso Adrià Shammie, Kluska Agnieszka Shammie, Lewkowycz Aitor Shammie, Agarwal Akshat Shammie, Power Alethea Shammie, Ray Alex Shammie, Warstadt Alex Shammie, Kocurek Alexander W. Shammie, Safaya Ali Shammie, Tazarv Ali Shammie, Xiang Alice Shammie, Parrish Alicia Shammie, Nie Allen Shammie, Hussain Aman Shammie, Askell Amanda Shammie, Dsouza Amanda Shammie, Slone Ambrose Shammie, Rahane Ameet Shammie, Iyer Anantharaman S. Shammie, Andreassen Anders Shammie, Madotto Andrea Shammie, Santilli Andrea Shammie, Stuhlmüller Andreas Shammie, Dai Andrew Shammie, La Andrew Shammie, Lampinen Andrew Shammie, Zou Andy Shammie, Jiang Angela Shammie, Chen Angelica Shammie, Vuong Anh Shammie, Gupta Animesh Shammie, Gottardi Anna Shammie, Norelli Antonio Shammie, Venkatesh Anu Shammie, Gholamidavoodi Arash Shammie, Tabassum Arfa Shammie, Menezes Arul Shammie, Kirubarajan Arun Shammie, Mullokandov Asher Shammie, Sabharwal Ashish Shammie, Herrick Austin Shammie, Efrat Avia Shammie, Erdem Aykut Shammie, Karakaş Ayla Shammie, Roberts B. Ryan Shammie, Loe Bao Sheng Shammie, Zoph Barret Shammie, Bojanowski Bartłomiej Shammie, Özyurt Batuhan Shammie, Hedayatnia Behnam Shammie, Neyshabur Behnam Shammie, Inden Benjamin Shammie, Stein Benno Shammie, Ekmekci Berk Shammie, Lin Bill Yuchen Shammie, Howald Blake Shammie, Orinion Bryan Shammie, Diao Cameron Shammie, Dour Cameron Shammie, Stinson Catherine Shammie, Argueta Cedrick Shammie, Ramírez César Ferri Shammie, Singh Chandan Shammie, Rathkopf Charles Shammie, Meng Chenlin Shammie, Baral Chitta Shammie, Wu Chiyu Shammie, Callison-burch Chris Shammie, Waites Chris Shammie, Voigt Christian Shammie, Manning Christopher D. Shammie, Potts Christopher Shammie, Ramirez Cindy Shammie, Rivera Clara E. Shammie, Siro Clemencia Shammie, Raffel Colin Shammie, Ashcraft Courtney Shammie, Garbacea Cristina Shammie, Sileo Damien Shammie, Garrette Dan Shammie, Hendrycks Dan Shammie, Kilman Dan Shammie, Roth Dan Shammie, Freeman Daniel Shammie, Khashabi Daniel Shammie, Levy Daniel Shammie, González Daniel Moseguí Shammie, Perszyk Danielle Shammie, Hernandez Danny Shammie, Chen Danqi Shammie, Ippolito Daphne Shammie, Gilboa Dar Shammie, Dohan David Shammie, Drakard David Shammie, Jurgens David Shammie, Datta Debajyoti Shammie, Ganguli Deep Shammie, Emelin Denis Shammie, Kleyko Denis Shammie, Yuret Deniz Shammie, Chen Derek Shammie, Tam Derek Shammie, Hupkes Dieuwke Shammie, Misra Diganta Shammie, Buzan Dilyar Shammie, Mollo Dimitri Coelho Shammie, Yang Diyi Shammie, Lee Dong-ho Shammie, Schrader Dylan Shammie, Shutova Ekaterina Shammie, Cubuk Ekin Dogus Shammie, Segal Elad Shammie, Hagerman Eleanor Shammie, Barnes Elizabeth Shammie, Donoway Elizabeth Shammie, Pavlick Ellie Shammie, Rodola Emanuele Shammie, Lam Emma Shammie, Chu Eric Shammie, Tang Eric Shammie, Erdem Erkut Shammie, Chang Ernie Shammie, Chi Ethan A. Shammie, Dyer Ethan Shammie, Jerzak Ethan Shammie, Kim Ethan Shammie, Manyasi Eunice Engefu Shammie, Zheltonozhskii Evgenii Shammie, Xia Fanyue Shammie, Siar Fatemeh Shammie, Martínez-plumed Fernando Shammie, Happé Francesca Shammie, Chollet Francois Shammie, Rong Frieda Shammie, Mishra Gaurav Shammie, Winata Genta Indra Shammie, De Melo Gerard Shammie, Kruszewski Germán Shammie, Parascandolo Giambattista Shammie, Mariani Giorgio Shammie, Wang Gloria Shammie, Jaimovitch-lópez Gonzalo Shammie, Betz Gregor Shammie, Gur-ari Guy Shammie, Galijasevic Hana Shammie, Kim Hannah Shammie, Rashkin Hannah Shammie, Hajishirzi Hannaneh Shammie, Mehta Harsh Shammie, Bogar Hayden Shammie, Shevlin Henry Shammie, Schütze Hinrich Shammie, Yakura Hiromu Shammie, Zhang Hongming Shammie, Wong Hugh Mee Shammie, Ng Ian Shammie, Noble Isaac Shammie, Jumelet Jaap Shammie, Geissinger Jack Shammie, Kernion Jackson Shammie, Hilton Jacob Shammie, Lee Jaehoon Shammie, Fisac Jaime Fernández Shammie, Simon James B. Shammie, Koppel James Shammie, Zheng James Shammie, Zou James Shammie, Kocoń Jan Shammie, Thompson Jana Shammie, Wingfield Janelle Shammie, Kaplan Jared Shammie, Radom Jarema Shammie, Sohl-dickstein Jascha Shammie, Phang Jason Shammie, Wei Jason Shammie, Yosinski Jason Shammie, Novikova Jekaterina Shammie, Bosscher Jelle Shammie, Marsh Jennifer Shammie, Kim Jeremy Shammie, Taal Jeroen Shammie, Engel Jesse Shammie, Alabi Jesujoba Shammie, Xu Jiacheng Shammie, Song Jiaming Shammie, Tang Jillian Shammie, Waweru Joan Shammie, Burden John Shammie, Miller John Shammie, Balis John U. Shammie, Batchelder Jonathan Shammie, Berant Jonathan Shammie, Frohberg Jörg Shammie, Rozen Jos Shammie, Hernandez-orallo Jose Shammie, Boudeman Joseph Shammie, Guerr Joseph Shammie, Jones Joseph Shammie, Tenenbaum Joshua B. Shammie, Rule Joshua S. Shammie, Chua Joyce Shammie, Kanclerz Kamil Shammie, Livescu Karen Shammie, Krauth Karl Shammie, Gopalakrishnan Karthik Shammie, Ignatyeva Katerina Shammie, Markert Katja Shammie, Dhole Kaustubh D. Shammie, Gimpel Kevin Shammie, Omondi Kevin Shammie, Mathewson Kory Shammie, Chiafullo Kristen Shammie, Shkaruta Ksenia Shammie, Shridhar Kumar Shammie, Mcdonell Kyle Shammie, Richardson Kyle Shammie, Reynolds Laria Shammie, Gao Leo Shammie, Zhang Li Shammie, Dugan Liam Shammie, Qin Lianhui Shammie, Contreras-ochando Lidia Shammie, Morency Louis-philippe Shammie, Moschella Luca Shammie, Lam Lucas Shammie, Noble Lucy Shammie, Schmidt Ludwig Shammie, He Luheng Shammie, Colón Luis Oliveros Shammie, Metz Luke Shammie, Şenel Lütfi Kerem Shammie, Bosma Maarten Shammie, Sap Maarten Shammie, Ter Hoeve Maartje Shammie, Farooqi Maheen Shammie, Faruqui Manaal Shammie, Mazeika Mantas Shammie, Baturan Marco Shammie, Marelli Marco Shammie, Maru Marco Shammie, Quintana Maria Jose Ramírez Shammie, Tolkiehn Marie Shammie, Giulianelli Mario Shammie, Lewis Martha Shammie, Potthast Martin Shammie, Leavitt Matthew L. Shammie, Hagen Matthias Shammie, Schubert Mátyás Shammie, Baitemirova Medina Orduna Shammie, Arnaud Melody Shammie, Mcelrath Melvin Shammie, Yee Michael A. Shammie, Cohen Michael Shammie, Gu Michael Shammie, Ivanitskiy Michael Shammie, Starritt Michael Shammie, Strube Michael Shammie, Swędrowski Michał Shammie, Bevilacqua Michele Shammie, Yasunaga Michihiro Shammie, Kale Mihir Shammie, Cain Mike Shammie, Xu Mimee Shammie, Suzgun Mirac Shammie, Walker Mitch Shammie, Tiwari Mo Shammie, Bansal Mohit Shammie, Aminnaseri Moin Shammie, Geva Mor Shammie, Gheini Mozhdeh Shammie, T Mukund Varma Shammie, Peng Nanyun Shammie, Chi Nathan A. Shammie, Lee Nayeon Shammie, Krakover Neta Gur-ari Shammie, Cameron Nicholas Shammie, Roberts Nicholas Shammie, Doiron Nick Shammie, Martinez Nicole Shammie, Nangia Nikita Shammie, Deckers Niklas Shammie, Muennighoff Niklas Shammie, Keskar Nitish Shirish Shammie, Iyer Niveditha S. Shammie, Constant Noah Shammie, Fiedel Noah Shammie, Wen Nuan Shammie, Zhang Oliver Shammie, Agha Omar Shammie, Elbaghdadi Omar Shammie, Levy Omer Shammie, Evans Owain Shammie, Casares Pablo Antonio Moreno Shammie, Doshi Parth Shammie, Fung Pascale Shammie, Liang Paul Pu Shammie, Vicol Paul Shammie, Alipoormolabashi Pegah Shammie, Liao Peiyuan Shammie, Liang Percy Shammie, Chang Peter Shammie, Eckersley Peter Shammie, Htut Phu Mon Shammie, Hwang Pinyu Shammie, Miłkowski Piotr Shammie, Patil Piyush Shammie, Pezeshkpour Pouya Shammie, Oli Priti Shammie, Mei Qiaozhu Shammie, Lyu Qing Shammie, Chen Qinlang Shammie, Banjade Rabin Shammie, Rudolph Rachel Etta Shammie, Gabriel Raefer Shammie, Habacker Rahel Shammie, Risco Ramon Shammie, Millière Raphaël Shammie, Garg Rhythm Shammie, Barnes Richard Shammie, Saurous Rif A. Shammie, Arakawa Riku Shammie, Raymaekers Robbe Shammie, Frank Robert Shammie, Sikand Rohan Shammie, Novak Roman Shammie, Sitelew Roman Shammie, Lebras Ronan Shammie, Liu Rosanne Shammie, Jacobs Rowan Shammie, Zhang Rui Shammie, Salakhutdinov Ruslan Shammie, Chi Ryan Shammie, Lee Ryan Shammie, Stovall Ryan Shammie, Teehan Ryan Shammie, Yang Rylan Shammie, Singh Sahib Shammie, Mohammad Saif M. Shammie, Anand Sajant Shammie, Dillavou Sam Shammie, Shleifer Sam Shammie, Wiseman Sam Shammie, Gruetter Samuel Shammie, Bowman Samuel R. Shammie, Schoenholz Samuel S. Shammie, Han Sanghyun Shammie, Kwatra Sanjeev Shammie, Rous Sarah A. Shammie, Ghazarian Sarik Shammie, Ghosh Sayan Shammie, Casey Sean Shammie, Bischoff Sebastian Shammie, Gehrmann Sebastian Shammie, Schuster Sebastian Shammie, Sadeghi Sepideh Shammie, Hamdan Shadi Shammie, Zhou Sharon Shammie, Srivastava Shashank Shammie, Shi Sherry Shammie, Singh Shikhar Shammie, Asaadi Shima Shammie, Gu Shixiang Shane Shammie, Pachchigar Shubh Shammie, Toshniwal Shubham Shammie, Upadhyay Shyam Shammie, Shyamolima Shammie, Debnath, Shakeri Siamak, Thormeyer Simon, Melzi Simone, Reddy Siva, Makini Sneha Priscilla, Lee Soo-hwan, Torene Spencer, Hatwar Sriharsha, Dehaene Stanislas, Divic Stefan, Ermon Stefano, Biderman Stella, Lin Stephanie, Prasad Stephen, Piantadosi Steven T., Shieber Stuart M., Misherghi Summer, Kiritchenko Svetlana, Mishra Swaroop, Linzen Tal, Schuster Tal, Li Tao, Yu Tao, Ali Tariq, Hashimoto Tatsu, Wu Te-lin, Desbordes Théo, Rothschild Theodore, Phan Thomas, Wang Tianle, Nkinyili Tiberius, Schick Timo, Kornev Timofei, Tunduny Titus, Gerstenberg Tobias, Chang Trenton, Neeraj Trishala, Khot Tushar, Shultz Tyler, Shaham Uri, Misra Vedant, Demberg Vera, Nyamai Victoria, Raunak Vikas, Ramasesh Vinay, Prabhu Vinay Uday, Padmakumar Vishakh, Srikumar Vivek, Fedus William, Saunders William, Zhang William, Vossen Wout, Ren Xiang, Tong Xiaoyu, Zhao Xinran, Wu Xinyi, Shen Xudong, Yaghoobzadeh Yadollah, Lakretz Yair, Song Yangqiu, Bahri Yasaman, Choi Yejin, Yang Yichi, Hao Yiding, Chen Yifu, Belinkov Yonatan, Hou Yu, Hou Yufang, Bai Yuntao, Seid Zachary, Zhao Zhuoye, Wang Zijian, Wang Zijie J., Wang Zirui, Wu Ziyi
- Towards Zero-shot And Few-shot Table Question Answering Using GPT-3 Srivastava Pragya, Ganu Tanuja, Guha Saikat
- Conciseness: An Overlooked Language Task Stahlberg Felix, Kumar Aashish, Alberti Chris, Kumar Shankar
- QA4QG: Using Question Answering To Constrain Multi-hop Question Generation Su Dan, Xu Peng, Fung Pascale
- Towards Diverse, Relevant And Coherent Open-domain Dialogue Generation Via Hybrid Latent Variables Sun Bin, Li Yitong, Mi Fei, Wang Weichao, Li Yiwei, Li Kan
- Dynamar: Dynamic Prompt With Mask Token Representation Sun Xiaodi, Rajagopalan Sunny, Nigam Priyanka, Lu Weiyi, Xu Yi, Zeng Belinda, Chilimbi Trishul
- Long-form Video-language Pre-training With Multimodal Temporal Contrastive Learning Sun Yuchong, Xue Hongwei, Song Ruihua, Liu Bei, Yang Huan, Fu Jianlong
- A Length-extrapolatable Transformer Sun Yutao, Dong Li, Patra Barun, Ma Shuming, Huang Shaohan, Benhaim Alon, Chaudhary Vishrav, Song Xia, Wei Furu
- OPT: Open Pre-trained Transformer Language Models Susan Zhang, Stephen Roller, Naman Goyal, Mikel Artetxe, Moya Chen, Shuohui Chen, Christopher Dewan, Mona Diab, Xian Li, Xi Victoria Lin, Todor Mihaylov, Myle Ott, Sam Shleifer, Kurt Shuster, Daniel Simig, Punit Singh Koura, Anjali Sridhar, Tianlu Wang, Luke Zettlemoyer
- On The Effect Of Pre-training For Transformer In Different Modality On Offline Reinforcement Learning Takagi Shiro
- TAPE: Assessing Few-shot Russian Language Understanding Taktasheva Ekaterina, Shavrina Tatiana, Fenogenova Alena, Shevelev Denis, Katricheva Nadezhda, Tikhonova Maria, Akhmetgareeva Albina, Zinkevich Oleg, Bashmakova Anastasiia, Iordanskaia Svetlana, Spiridonova Alena, Kurenshchikova Valentina, Artemova Ekaterina, Mikhailov Vladislav
- Fewer Errors, But More Stereotypes? The Effect Of Model Size On Gender Bias Tal Yarden, Magar Inbal, Schwartz Roy
- ACORT: A Compact Object Relation Transformer For Parameter Efficient Image Captioning Tan Jia Huei, Tan Ying Hua, Chan Chee Seng, Chuah Joon Huang
- News Summarization And Evaluation In The Era Of GPT-3 Tanya Goyal, Junyi Jessy Li, Greg Durrett
- Towards Fine-tuning Pre-trained Language Models With Integer Forward And Backward Propagation Tayaranian Mohammadreza, Ghaffari Alireza, Tahaei Marzieh S., Rezagholizadeh Mehdi, Asgharian Masoud, Nia Vahid Partovi
- Benchmarking Large Language Models For Automated Verilog RTL Code Generation Thakur Shailja, Ahmad Baleegh, Fan Zhenxing, Pearce Hammond, Tan Benjamin, Karri Ramesh, Dolan-gavitt Brendan, Garg Siddharth
- Predicting Issue Types With Sebert Trautsch Alexander, Herbold Steffen
- Adavae: Exploring Adaptive Gpt-2s In Variational Auto-encoders For Language Modeling Tu Haoqin, Yang Zhongliang, Yang Jinshuai, Huang Yongfeng
- Efficient Few-shot Learning Without Prompts Tunstall Lewis, Reimers Nils, Jo Unso Eun Seo, Bates Luke, Korat Daniel, Wasserblat Moshe, Pereg Oren
- Effect And Analysis Of Large-scale Language Model Rescoring On Competitive ASR Systems Udagawa Takuma, Suzuki Masayuki, Kurata Gakuto, Itoh Nobuyasu, Saon George
- Towards Reasoning-aware Explainable VQA Vaideeswaran Rakesh, Gao Feng, Mathur Abhinav, Thattai Govind
- Dylora: Parameter Efficient Tuning Of Pre-trained Models Using Dynamic Search-free Low-rank Adaptation Valipour Mojtaba, Rezagholizadeh Mehdi, Kobyzev Ivan, Ghodsi Ali
- VLSP 2021 - Vimrc Challenge: Vietnamese Machine Reading Comprehension Van Nguyen Kiet, Tran Son Quoc, Nguyen Luan Thanh, Van Huynh Tin, Luu Son T., Nguyen Ngan Luu-thuy
- How Large Language Models Are Transforming Machine-paraphrased Plagiarism Wahle Jan Philip, Ruas Terry, Kirstein Frederic, Gipp Bela
- What Do They Capture? -- A Structural Analysis Of Pre-trained Language Models For Source Code Wan Yao, Zhao Wei, Zhang Hongyu, Sui Yulei, Xu Guandong, Jin Hai
- Unite: Unified Translation Evaluation Wan Yu, Liu Dayiheng, Yang Baosong, Zhang Haibo, Chen Boxing, Wong Derek F., Chao Lidia S.
- Can Language Models Make Fun? A Case Study In Chinese Comical Crosstalk Wang Benyou, Wu Xiangbo, Liu Xiaokang, Li Jianquan, Tiwari Prayag, Xie Qianqian
- STRUDEL: Structured Dialogue Summarization For Dialogue Comprehension Wang Borui, Feng Chengcheng, Nair Arjun, Mao Madelyn, Desai Jai, Celikyilmaz Asli, Li Haoran, Mehdad Yashar, Radev Dragomir
- Knowledgeable Salient Span Mask For Enhancing Language Models As Knowledge Base Wang Cunxiang, Luo Fuli, Li Yanyang, Xu Runxin, Huang Fei, Zhang Yue
- Lilt: A Simple Yet Effective Language-independent Layout Transformer For Structured Document Understanding Wang Jiapeng, Jin Lianwen, Ding Kai
- TAG: Boosting Text-vqa Via Text-aware Visual Question-answer Generation Wang Jun, Gao Mingfei, Hu Yuqian, Selvaraju Ramprasaath R., Ramaiah Chetan, Xu Ran, Jaja Joseph F., Davis Larry S.
- Pretraining Without Attention Wang Junxiong, Yan Jing Nathan, Gu Albert, Rush Alexander M.
- Hybrid-regressive Neural Machine Translation Wang Qiang, Hu Xinhui, Chen Ming
- What Language Model Architecture And Pretraining Objective Work Best For Zero-shot Generalization? Wang Thomas, Roberts Adam, Hesslow Daniel, Scao Teven Le, Chung Hyung Won, Beltagy Iz, Launay Julien, Raffel Colin
- Image As A Foreign Language: Beit Pretraining For All Vision And Vision-language Tasks Wang Wenhui, Bao Hangbo, Dong Li, Bjorck Johan, Peng Zhiliang, Liu Qiang, Aggarwal Kriti, Mohammed Owais Khan, Singhal Saksham, Som Subhojit, Wei Furu
- Understanding And Improving Sequence-to-sequence Pretraining For Neural Machine Translation Wang Wenxuan, Jiao Wenxiang, Hao Yongchang, Wang Xing, Shi Shuming, Tu Zhaopeng, Lyu Michael
- Finding Skill Neurons In Pre-trained Transformer-based Language Models Wang Xiaozhi, Wen Kaiyue, Zhang Zhengyan, Hou Lei, Liu Zhiyuan, Li Juanzi
- Compilable Neural Code Generation With Compiler Feedback Wang Xin, Wang Yasheng, Wan Yao, Mi Fei, Li Yitong, Zhou Pingyi, Liu Jin, Wu Hao, Jiang Xin, Liu Qun
- S-prompts Learning With Pre-trained Transformers: An Occam's Razor For Domain Incremental Learning Wang Yabin, Huang Zhiwu, Hong Xiaopeng
- Adamix: Mixture-of-adaptations For Parameter-efficient Model Tuning Wang Yaqing, Agarwal Sahaj, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Controlling Styles In Neural Machine Translation With Activation Prompt Wang Yifan, Sun Zewei, Cheng Shanbo, Zheng Weiguo, Wang Mingxuan
- Two-stage LLM Fine-tuning With Less Specialization And More Generalization Wang Yihan, Si Si, Li Daliang, Lukasik Michal, Yu Felix, Hsieh Cho-jui, Dhillon Inderjit S, Kumar Sanjiv
- Pan More Gold From The Sand: Refining Open-domain Dialogue Training With Noisy Self-retrieval Generation Wang Yihe, Li Yitong, Wang Yasheng, Mi Fei, Zhou Pingyi, Wang Xin, Liu Jin, Jiang Xin, Liu Qun
- Multimodal Adaptive Distillation For Leveraging Unimodal Encoders For Vision-language Tasks Wang Zhecan, Codella Noel, Chen Yen-chun, Zhou Luowei, Dai Xiyang, Xiao Bin, Yang Jianwei, You Haoxuan, Chang Kai-wei, Chang Shih-fu, Yuan Lu
- Language Models With Image Descriptors Are Strong Few-shot Video-language Learners Wang Zhenhailong, Li Manling, Xu Ruochen, Zhou Luowei, Lei Jie, Lin Xudong, Wang Shuohang, Yang Ziyi, Zhu Chenguang, Hoiem Derek, Chang Shih-fu, Bansal Mohit, Ji Heng
- Unified Multimodal Model With Unlikelihood Training For Visual Dialog Wang Zihao, Wang Junli, Jiang Changjun
- CREATER: Ctr-driven Advertising Text Generation With Controlled Pre-training And Contrastive Fine-tuning Wei Penghui, Yang Xuanhua, Liu Shaoguo, Wang Liang, Zheng Bo
- Mixed-effects Transformers For Hierarchical Adaptation White Julia, Goodman Noah, Hawkins Robert
- Probing Of Quantitative Values In Abstractive Summarization Models White Nathan M.
- Generative Or Contrastive? Phrase Reconstruction For Better Sentence Representation Learning Wu Bohong, Zhao Hai
- Sentence Representation Learning With Generative Objective Rather Than Contrastive Objective Wu Bohong, Zhao Hai
- Adversarial Self-attention For Language Understanding Wu Hongqiu, Ding Ruixue, Zhao Hai, Xie Pengjun, Huang Fei, Zhang Min
- Stateful Memory-augmented Transformers For Efficient Dialogue Modeling Wu Qingyang, Yu Zhou
- Extreme Compression For Pre-trained Transformers Made Simple And Efficient Wu Xiaoxia, Yao Zhewei, Zhang Minjia, Li Conglong, He Yuxiong
- Memorizing Transformers Wu Yuhuai, Rabe Markus N., Hutchins Delesley, Szegedy Christian
- An Efficient Memory-augmented Transformer For Knowledge-intensive NLP Tasks Wu Yuxiang, Zhao Yu, Hu Baotian, Minervini Pasquale, Stenetorp Pontus, Riedel Sebastian
- Modeling Context With Linear Attention For Scalable Document-level Translation Wu Zhaofeng, Peng Hao, Pappas Nikolaos, Smith Noah A.
- MGA-VQA: Multi-granularity Alignment For Visual Question Answering Xiong Peixi, Shen Yilin, Jin Hongxia
- Adapting Pretrained Text-to-text Models For Long Text Sequences Xiong Wenhan, Gupta Anchit, Toshniwal Shubham, Mehdad Yashar, Yih Wen-tau
- Go-tuning: Improving Zero-shot Learning Abilities Of Smaller Language Models Xu Jingjing, Dong Qingxiu, Liu Hongyi, Li Lei
- Integrating Translation Memories Into Non-autoregressive Machine Translation Xu Jitao, Crego Josep, Yvon François
- CLIP Also Understands Text: Prompting CLIP For Phrase Understanding Yan An, Li Jiacheng, Zhu Wanrong, Lu Yujie, Wang William Yang, Mcauley Julian
- Generative Negative Text Replay For Continual Vision-language Pretraining Yan Shipeng, Hong Lanqing, Xu Hang, Han Jianhua, Tuytelaars Tinne, Li Zhenguo, He Xuming
- Learning To Answer Visual Questions From Web Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Zero-shot Video Question Answering Via Frozen Bidirectional Language Models Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Tableformer: Robust Transformer Modeling For Table-text Encoding Yang Jingfeng, Gupta Aditya, Upadhyay Shyam, He Luheng, Goel Rahul, Paul Shachi
- AD-DROP: Attribution-driven Dropout For Robust Language Model Fine-tuning Yang Tao, Deng Jinghao, Quan Xiaojun, Wang Qifan, Nie Shaoliang
- Building Knowledge-grounded Dialogue Systems With Graph-based Semantic Modeling Yang Yizhe, Huang Heyan, Gao Yang, And Jiawei Li
- Parameter-efficient Tuning Makes A Good Classification Head Yang Zhuoyi, Ding Ming, Guo Yanhui, Lv Qingsong, Tang Jie
- Prompt Tuning For Discriminative Pre-trained Language Models Yao Yuan, Dong Bowen, Zhang Ao, Zhang Zhengyan, Xie Ruobing, Liu Zhiyuan, Lin Leyu, Sun Maosong, Wang Jianyong
- Zeroquant: Efficient And Affordable Post-training Quantization For Large-scale Transformers Yao Zhewei, Aminabadi Reza Yazdani, Zhang Minjia, Wu Xiaoxia, Li Conglong, He Yuxiong
- Random-ltd: Random And Layerwise Token Dropping Brings Efficient Training For Large-scale Transformers Yao Zhewei, Wu Xiaoxia, Li Conglong, Holmes Connor, Zhang Minjia, Li Cheng, He Yuxiong
- Language Models Are General-purpose Interfaces Yaru Hao, Haoyu Song, Li Dong, Shaohan Huang, Zewen Chi, Wenhui Wang, Shuming Ma, Furu Wei
- Retrieval-augmented Multimodal Language Modeling Yasunaga Michihiro, Aghajanyan Armen, Shi Weijia, James Rich, Leskovec Jure, Liang Percy, Lewis Mike, Zettlemoyer Luke, Yih Wen-tau
- Modeling Multi-hop Question Answering As Single Sequence Prediction Yavuz Semih, Hashimoto Kazuma, Zhou Yingbo, Keskar Nitish Shirish, Xiong Caiming
- Alzheimer's Diagnosis And Generation-based Chatbot Using Hierarchical Attention And Transformer Yeong Park Jun, Jong Shin Su, Hwan Choi Chang, Jae Lee Jung, Sang-il Choi
- UL2: Unifying Language Learning Paradigms Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Xavier Garcia, Jason Wei, Xuezhi Wang, Hyung Won Chung, Siamak Shakeri, Dara Bahri, Tal Schuster, Huaixiu Steven Zheng, Denny Zhou, Neil Houlsby, Donald Metzler
- Active Example Selection For In-context Learning Yiming Zhang, Shi Feng, Chenhao Tan
- Categorizing Semantic Representations For Neural Machine Translation Yin Yongjing, Li Yafu, Meng Fandong, Zhou Jie, Zhang Yue
- Super-naturalinstructions: Generalization Via Declarative Instructions On 1600+ NLP Tasks Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei, Anjana Arunkumar, Arjun Ashok, Arut Selvan Dhanasekaran, Atharva Naik, David Stap, Eshaan Pathak, Giannis Karamanolakis, Haizhi Gary Lai, Ishan Purohit, Ishani Mondal, Jacob Anderson, Kirby Kuznia, Krima Doshi, Maitreya Patel, Kuntal Kumar Pal, Mehrad Moradshahi, Mihir Parmar, Mirali Purohit, Neeraj Varshney, Phani Rohitha Kaza, Pulkit Verma, Ravsehaj Singh Puri, Rushang Karia, Shailaja Keyur Sampat, Savan Doshi, Siddhartha Mishra, Sujan Reddy, Sumanta Patro, Tanay Dixit, Xudong Shen, Chitta Baral, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi, Daniel Khashabi
- BLOOM+1: Adding Language Support To BLOOM For Zero-shot Prompting Yong Zheng-xin, Schoelkopf Hailey, Muennighoff Niklas, Aji Alham Fikri, Adelani David Ifeoluwa, Almubarak Khalid, Bari M Saiful, Sutawika Lintang, Kasai Jungo, Baruwa Ahmed, Winata Genta Indra, Biderman Stella, Raff Edward, Radev Dragomir, Nikoulina Vassilina
- Scaling Autoregressive Models For Content-rich Text-to-image Generation Yu Jiahui, Xu Yuanzhong, Koh Jing Yu, Luong Thang, Baid Gunjan, Wang Zirui, Vasudevan Vijay, Ku Alexander, Yang Yinfei, Ayan Burcu Karagol, Hutchinson Ben, Han Wei, Parekh Zarana, Li Xin, Zhang Han, Baldridge Jason, Wu Yonghui
- Efficient Language Modeling With Sparse All-mlp Yu Ping, Artetxe Mikel, Ott Myle, Shleifer Sam, Gong Hongyu, Stoyanov Ves, Li Xian
- Bilaterally Slimmable Transformer For Elastic And Efficient Visual Question Answering Yu Zhou, Jin Zitian, Yu Jun, Xu Mingliang, Wang Hongbo, Fan Jianping
- Hype: Better Pre-trained Language Model Fine-tuning With Hidden Representation Perturbation Yuan Hongyi, Yuan Zheng, Tan Chuanqi, Huang Fei, Huang Songfang
- Wudaomm: A Large-scale Multi-modal Dataset For Pre-training Models Yuan Sha, Zhao Shuai, Leng Jiahong, Xue Zhao, Zhao Hanyu, Liu Peiyu, Gong Zheng, Zhao Wayne Xin, Li Junyi, Tang Jie
- Can Pretrained Language Models (yet) Reason Deductively? Yuan Zhangdie, Hu Songbo, Vulić Ivan, Korhonen Anna, Meng Zaiqiao
- Competition-level Code Generation With Alphacode Yujia Li, David Choi, Junyoung Chung, Nate Kushman, Julian Schrittwieser, Rémi Leblond, Tom Eccles, James Keeling, Felix Gimeno, Agustin Dal Lago, Thomas Hubert, Peter Choy, Cyprien De Masson D'autume, Igor Babuschkin, Xinyun Chen, Po-sen Huang, Johannes Welbl, Sven Gowal, Alexey Cherepanov, James Molloy, Daniel J. Mankowitz, Esme Sutherland Robson, Pushmeet Kohli, Nando De Freitas, Koray Kavukcuoglu, Oriol Vinyals
- Bridging Cross-lingual Gaps During Leveraging The Multilingual Sequence-to-sequence Pretraining For Text Generation And Understanding Zan Changtong, Ding Liang, Shen Li, Cao Yu, Liu Weifeng, Tao Dacheng
- Dyrex: Dynamic Query Representation For Extractive Question Answering Zaratiana Urchade, Khbir Niama El, Núñez Dennis, Holat Pierre, Tomeh Nadi, Charnois Thierry
- Better Pre-training By Reducing Representation Confusion Zhang Haojie, Liang Mingfei, Xie Ruobing, Sun Zhenlong, Zhang Bo, Lin Leyu
- Improved Data Augmentation For Translation Suggestion Zhang Hongxiao, Lai Siyu, Zhang Songming, Huang Hui, Chen Yufeng, Xu Jinan, Liu Jian
- Scala: Accelerating Adaptation Of Pre-trained Transformer-based Language Models Via Efficient Large-batch Adversarial Noise Zhang Minjia, Naresh Niranjan Uma, He Yuxiong
- Analyzing And Simulating User Utterance Reformulation In Conversational Recommender Systems Zhang Shuo, Wang Mu-chun, Balog Krisztian
- Unveiling Transformers With LEGO: A Synthetic Reasoning Task Zhang Yi, Backurs Arturs, Bubeck Sébastien, Eldan Ronen, Gunasekar Suriya, Wagner Tal
- BLISS: Robust Sequence-to-sequence Learning Via Self-supervised Input Representation Zhang Zheng, Ding Liang, Cheng Dazhao, Liu Xuebo, Zhang Min, Tao Dacheng
- Can Offline Reinforcement Learning Help Natural Language Understanding? Zhang Ziqi, Wang Yile, Zhang Yue, Wang Donglin
- M-adapter: Modality Adaptation For End-to-end Speech-to-text Translation Zhao Jinming, Yang Hao, Shareghi Ehsan, Haffari Gholamreza
- Attend, Memorize And Generate: Towards Faithful Table-to-text Generation In Few Shots Zhao Wenting, Liu Ye, Wan Yao, Yu Philip S.
- E2S2: Encoding-enhanced Sequence-to-sequence Pretraining For Language Understanding And Generation Zhong Qihuang, Ding Liang, Liu Juhua, Du Bo, Tao Dacheng
- Learning To Decompose: Hypothetical Question Decomposition Based On Comparable Texts Zhou Ben, Richardson Kyle, Yu Xiaodong, Roth Dan
- Reflection Of Thought: Inversely Eliciting Numerical Reasoning In Language Models Via Solving Linear Systems Zhou Fan, Dong Haoyu, Liu Qian, Cheng Zhoujun, Han Shi, Zhang Dongmei
- Syntax-informed Question Answering With Heterogeneous Graph Transformer Zhu Fangyi, Tan Lok You, Ng See-kiong, Bressan Stéphane
- Predicting Fine-tuning Performance With Probing Zhu Zining, Shahtalebi Soroosh, Rudzicz Frank
- Future Sight: Dynamic Story Generation With Large Pretrained Language Models Zimmerman Brian D., Sahu Gaurav, Vechtomova Olga
- Exploring And Evaluating Personalized Models For Code Generation Zlotchevski Andrei, Drain Dawn, Svyatkovskiy Alexey, Clement Colin, Sundaresan Neel, Tufano Michele
- St-moe: Designing Stable And Transferable Sparse Expert Models Zoph Barret, Bello Irwan, Kumar Sameer, Du Nan, Huang Yanping, Dean Jeff, Shazeer Noam, Fedus William
- Efficient Long Sequence Modeling Via State Space Augmented Transformer Zuo Simiao, Liu Xiaodong, Jiao Jian, Charles Denis, Manavoglu Eren, Zhao Tuo, Gao Jianfeng
- LM-CPPF: Paraphrasing-guided Data Augmentation For Contrastive Prompt-based Few-shot Fine-tuning Abaskohi Amirhossein, Rothe Sascha, Yaghoobzadeh Yadollah
- Attention Over Pre-trained Sentence Embeddings For Long Document Classification Abdaoui Amine, Dutta Sourav
- Beyond Traditional Teaching: The Potential Of Large Language Models And Chatbots In Graduate Engineering Education Abedi Mahyar, Alshybani Ibrahem, Shahadat Muhammad Rubayat Bin, Murillo Michael S.
- On The Relationship Between Skill Neurons And Robustness In Prompt Tuning Ackermann Leon, Ohmer Xenia
- Multi-party Goal Tracking With Llms: Comparing Pre-training, Fine-tuning, And Prompt Engineering Addlesee Angus, Sieińska Weronika, Gunson Nancie, Garcia Daniel Hernández, Dondrup Christian, Lemon Oliver
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon
- Can Large Language Models Be Good Path Planners? A Benchmark And Investigation On Spatial-temporal Reasoning Aghzal Mohamed, Plaku Erion, Yao Ziyu
- Simul-llm: A Framework For Exploring High-quality Simultaneous Translation With Large Language Models Agostinelli Victor, Wild Max, Raffel Matthew, Fuad Kazi Ahmed Asif, Chen Lizhong
- SARATHI: Efficient LLM Inference By Piggybacking Decodes With Chunked Prefills Agrawal Amey, Panwar Ashish, Mohan Jayashree, Kwatra Nipun, Gulavani Bhargav S., Ramjee Ramachandran
- Story Visualization By Online Text Augmentation With Context Memory Ahn Daechul, Kim Daneul, Song Gwangmo, Kim Seung Hwan, Lee Honglak, Kang Dongyeop, Choi Jonghyun
- MEGA: Multilingual Evaluation Of Generative AI Ahuja Kabir, Diddee Harshita, Hada Rishav, Ochieng Millicent, Ramesh Krithika, Jain Prachi, Nambi Akshay, Ganu Tanuja, Segal Sameer, Axmed Maxamed, Bali Kalika, Sitaram Sunayana
- Jointly Training Large Autoregressive Multimodal Models Aiello Emanuele, Yu Lili, Nie Yixin, Aghajanyan Armen, Oguz Barlas
- The Emergence Of Essential Sparsity In Large Pre-trained Models: The Weights That Matter Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Zhangyang Wang
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Akyürek Afra Feyza, Akyürek Ekin, Madaan Aman, Kalyan Ashwin, Clark Peter, Wijaya Derry, Tandon Niket
- Can Language Models Employ The Socratic Method? Experiments With Code Debugging Al-hossami Erfan, Bunescu Razvan, Smith Justin, Teehan Ryan
- Mamba: Linear-time Sequence Modeling With Selective State Spaces Albert Gu, Tri Dao
- Tokenizer Choice For LLM Training: Negligible Or Crucial? Ali Mehdi, Fromm Michael, Thellmann Klaudia, Rutmann Richard, Lübbering Max, Leveling Johannes, Klug Katrin, Ebert Jan, Doll Niclas, Buschhoff Jasper Schulze, Jain Charvi, Weber Alexander Arno, Jurkschat Lena, Abdelwahab Hammam, John Chelsea, Suarez Pedro Ortiz, Ostendorff Malte, Weinbach Samuel, Sifa Rafet, Kesselheim Stefan, Flores-herr Nicolas
- OPT-R: Exploring The Role Of Explanations In Finetuning And Prompting For Reasoning Skills Of Large Language Models Alkhamissi Badr, Verma Siddharth, Yu Ping, Jin Zhijing, Celikyilmaz Asli, Diab Mona
- The Falcon Series Of Open Language Models Almazrouei Ebtesam, Alobeidli Hamza, Alshamsi Abdulaziz, Cappelli Alessandro, Cojocaru Ruxandra, Debbah Mérouane, Goffinet Étienne, Hesslow Daniel, Launay Julien, Malartic Quentin, Mazzotta Daniele, Noune Badreddine, Pannier Baptiste, Penedo Guilherme
- Pixt3: Pixel-based Table-to-text Generation Alonso Iñigo, Agirre Eneko, Lapata Mirella
- Is BERT Blind? Exploring The Effect Of Vision-and-language Pretraining On Visual Language Understanding Alper Morris, Fiman Michael, Averbuch-elor Hadar
- Becoming Self-instruct: Introducing Early Stopping Criteria For Minimal Instruct Tuning Alshikh Waseem, Daaboul Manhal, Goddard Kirk, Imel Brock, Kamble Kiran, Kulkarni Parikshith, Russak Melisa
- Taqyim: Evaluating Arabic NLP Tasks Using Chatgpt Models Alyafeai Zaid, Alshaibani Maged S., Alkhamissi Badr, Luqman Hamzah, Alareqi Ebrahim, Fadel Ali
- Knowledge Of Knowledge: Exploring Known-unknowns Uncertainty With Large Language Models Amayuelas Alfonso, Wong Kyle, Pan Liangming, Chen Wenhu, Wang William
- Prompt-engineering And Transformer-based Question Generation And Evaluation Amyeen Rubaba
- Learning From Mistakes Makes LLM Better Reasoner An Shengnan, Ma Zexiong, Lin Zeqi, Zheng Nanning, Lou Jian-guang, Chen Weizhu
- Protip: Progressive Tool Retrieval Improves Planning Anantha Raviteja, Bandyopadhyay Bortik, Kashi Anirudh, Mahinder Sayantan, Hill Andrew W, Chappidi Srinivas
- Randomized Positional Encodings Boost Length Generalization Of Transformers Anian Ruoss, Grégoire Delétang, Tim Genewein, Jordi Grau-moya, Róbert Csordás, Mehdi Bennani, Shane Legg, Joel Veness
- Distilling Efficient Language-specific Models For Cross-lingual Transfer Ansell Alan, Ponti Edoardo Maria, Korhonen Anna, Vulić Ivan
- Scaling Laws For Language Encoding Models In Fmri Antonello Richard, Vaidya Aditya, Huth Alexander G.
- Mixture Of Tokens: Efficient Llms Through Cross-example Aggregation Antoniak Szymon, Jaszczur Sebastian, Krutul Michał, Pióro Maciej, Krajewski Jakub, Ludziejewski Jan, Odrzygóźdź Tomasz, Cygan Marek
- Slimfit: Memory-efficient Fine-tuning Of Transformer-based Models Using Training Dynamics Ardakani Arash, Haan Altan, Tan Shangyin, Popovici Doru Thom, Cheung Alvin, Iancu Costin, Sen Koushik
- BUFFET: Benchmarking Large Language Models For Few-shot Cross-lingual Transfer Asai Akari, Kudugunta Sneha, Yu Xinyan Velocity, Blevins Terra, Gonen Hila, Reid Machel, Tsvetkov Yulia, Ruder Sebastian, Hajishirzi Hannaneh
- Openflamingo: An Open-source Framework For Training Large Autoregressive Vision-language Models Awadalla Anas, Gao Irena, Gardner Josh, Hessel Jack, Hanafy Yusuf, Zhu Wanrong, Marathe Kalyani, Bitton Yonatan, Gadre Samir, Sagawa Shiori, Jitsev Jenia, Kornblith Simon, Koh Pang Wei, Ilharco Gabriel, Wortsman Mitchell, Schmidt Ludwig
- Using Large Language Models For Zero-shot Natural Language Generation From Knowledge Graphs Axelsson Agnes, Skantze Gabriel
- Scaling Transformer To 1M Tokens And Beyond With RMT Aydar Bulatov, Yuri Kuratov, Yermek Kapushev, Mikhail S. Burtsev
- Chatgpt: Applications, Opportunities, And Threats Bahrini Aram, Khamoshifar Mohammadsadra, Abbasimehr Hossein, Riggs Robert J., Esmaeili Maryam, Majdabadkohne Rastin Mastali, Pasehvar Morteza
- Longbench: A Bilingual, Multitask Benchmark For Long Context Understanding Bai Yushi, Lv Xin, Zhang Jiajie, Lyu Hongchang, Tang Jiankai, Huang Zhidian, Du Zhengxiao, Liu Xiao, Zeng Aohan, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- Gpt-4: A Review On Advancements And Opportunities In Natural Language Processing Baktash Jawid Ahmad, Dawodi Mursal
- Tamil-llama: A New Tamil Language Model Based On Llama 2 Balachandran Abhinand
- Fine-tuning Large Enterprise Language Models Via Ontological Reasoning Baldazzi Teodoro, Bellomarini Luigi, Ceri Stefano, Colombo Andrea, Gentili Andrea, Sallinger Emanuel
- Investigating Pre-trained Language Models On Cross-domain Datasets, A Step Closer To General AI Ballout Mohamad, Krumnack Ulf, Heidemann Gunther, Kühnberger Kai-uwe
- A Bi-step Grounding Paradigm For Large Language Models In Recommendation Systems Bao Keqin, Zhang Jizhi, Wang Wenjie, Zhang Yang, Yang Zhengyi, Luo Yancheng, Chen Chong, Feng Fuli, Tian Qi
- Assessing And Enhancing The Robustness Of Large Language Models With Task Structure Variations For Logical Reasoning Bao Qiming, Gendron Gael, Peng Alex Yuxuan, Zhong Wanjun, Tan Neset, Chen Yang, Witbrock Michael, Liu Jiamou
- Abstract Meaning Representation-based Logic-driven Data Augmentation For Logical Reasoning Bao Qiming, Peng Alex Yuxuan, Deng Zhenyun, Zhong Wanjun, Gendron Gael, Pistotti Timothy, Tan Neset, Young Nathan, Chen Yang, Zhu Yonghua, Denny Paul, Witbrock Michael, Liu Jiamou
- Disc-medllm: Bridging General Large Language Models And Real-world Medical Consultation Bao Zhijie, Chen Wei, Xiao Shengze, Ren Kuang, Wu Jiaao, Zhong Cheng, Peng Jiajie, Huang Xuanjing, Wei Zhongyu
- Do Language Models Learn About Legal Entity Types During Pretraining? Barale Claire, Rovatsos Michael, Bhuta Nehal
- Understanding Telecom Language Through Large Language Models Bariah Lina, Zou Hang, Zhao Qiyang, Mouhouche Belkacem, Bader Faouzi, Debbah Merouane
- Llamantino: Llama 2 Models For Effective Text Generation In Italian Language Basile Pierpaolo, Musacchio Elio, Polignano Marco, Siciliani Lucia, Fiameni Giuseppe, Semeraro Giovanni
- Text Summarization Using Large Language Models: A Comparative Study Of Mpt-7b-instruct, Falcon-7b-instruct, And Openai Chat-gpt Models Basyal Lochan, Sanghvi Mihir
- Efficient Pre-training For Localized Instruction Generation Of Videos Batra Anil, Moltisanti Davide, Sevilla-lara Laura, Rohrbach Marcus, Keller Frank
- Alternating Updates For Efficient Transformers Baykal Cenk, Cutler Dylan, Dikkala Nishanth, Ghosh Nikhil, Panigrahy Rina, Wang Xin
- Span-selective Linear Attention Transformers For Effective And Robust Schema-guided Dialogue State Tracking Bebensee Björn, Lee Haejun
- The Poison Of Alignment Bekbayev Aibek, Chun Sungbae, Dulat Yerzat, Yamazaki James
- Examining The Emergence Of Deductive Reasoning In Generative Language Models Belcak Peter, Lanzendörfer Luca A., Wattenhofer Roger
- Harnessing Gpt-3.5-turbo For Rhetorical Role Prediction In Legal Cases Belfathi Anas, Hernandez Nicolas, Monceaux Laura
- Labrador: Exploring The Limits Of Masked Language Modeling For Laboratory Data Bellamy David R., Kumar Bhawesh, Wang Cindy, Beam Andrew
- Prompt Sketching For Large Language Models Beurer-kellner Luca, Müller Mark Niklas, Fischer Marc, Vechev Martin
- When Automated Assessment Meets Automated Content Generation: Examining Text Quality In The Era Of Gpts Bevilacqua Marialena, Oketch Kezia, Qin Ruiyang, Stamey Will, Zhang Xinyuan, Gan Yi, Yang Kai, Abbasi Ahmed
- A Survey On Prompting Techniques In Llms Bhandari Prabin
- Are Large Language Models Geospatially Knowledgeable? Bhandari Prabin, Anastasopoulos Antonios, Pfoser Dieter
- TART: A Plug-and-play Transformer Module For Task-agnostic Reasoning Bhatia Kush, Narayan Avanika, De Sa Christopher, Ré Christopher
- Automating Question Generation From Educational Text Bhowmick Ayan Kumar, Jagmohan Ashish, Vempaty Aditya, Dey Prasenjit, Hall Leigh, Hartman Jeremy, Kokku Ravi, Maheshwari Hema
- Helm: Highlighted Evidence Augmented Language Model For Enhanced Table-to-text Generation Bian Junyi, Qin Xiaolei, Zou Wuhe, Huang Mengzuo, Luo Congyi, Zhang Ke, Zhang Weidong
- Safety-tuned Llamas: Lessons From Improving The Safety Of Large Language Models That Follow Instructions Bianchi Federico, Suzgun Mirac, Attanasio Giuseppe, Röttger Paul, Jurafsky Dan, Hashimoto Tatsunori, Zou James
- Birth Of A Transformer: A Memory Viewpoint Bietti Alberto, Cabannes Vivien, Bouchacourt Diane, Jegou Herve, Bottou Leon
- BLT: Can Large Language Models Handle Basic Legal Text? Blair-stanek Andrew, Holzenberger Nils, Van Durme Benjamin
- RWKV: Reinventing Rnns For The Transformer Era Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Stella Biderman, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran Gv, Xuzheng He, Haowen Hou, Jiaju Lin, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Guangyu Song, Xiangru Tang, Bolun Wang, Johan S. Wind, Stanislaw Wozniak, Ruichong Zhang, Zhenyuan Zhang, Qihang Zhao, Peng Zhou, Qinghua Zhou, Jian Zhu, Rui-jie Zhu
- Ask Language Model To Clean Your Noisy Translation Data Bolding Quinten, Liao Baohao, Denis Brandon James, Luo Jun, Monz Christof
- Pixlore: A Dataset-driven Approach To Rich Image Captioning Bonilla Diego
- Soundstorm: Efficient Parallel Audio Generation Borsos Zalán, Sharifi Matt, Vincent Damien, Kharitonov Eugene, Zeghidour Neil, Tagliasacchi Marco
- Evaluating Large Language Models For Document-grounded Response Generation In Information-seeking Dialogues Braunschweiler Norbert, Doddipatla Rama, Keizer Simon, Stoyanchev Svetlana
- Extractgpt: Exploring The Potential Of Large Language Models For Product Attribute Value Extraction Brinkmann Alexander, Shraga Roee, Bizer Christian
- Fine-grained Affective Processing Capabilities Emerging From Large Language Models Broekens Joost, Hilpert Bernhard, Verberne Suzan, Baraka Kim, Gebhard Patrick, Plaat Aske
- Efficient Transformer Knowledge Distillation: A Performance Review Brown Nathan, Williamson Ashton, Anderson Tahj, Lawrence Logan
- Accuracy Of A Vision-language Model On Challenging Medical Cases Buckley Thomas, Diao James A., Rodman Adam, Manrai Arjun K.
- Utilizing Chatgpt Generated Data To Retrieve Depression Symptoms From Social Media Bucur Ana-maria
- Codetf: One-stop Transformer Library For State-of-the-art Code LLM Bui Nghi D. Q., Le Hung, Wang Yue, Li Junnan, Gotmare Akhilesh Deepak, Hoi Steven C. H.
- Finding The Needle In A Haystack: Unsupervised Rationale Extraction From Long Text Classifiers Bujel Kamil, Caines Andrew, Yannakoudakis Helen, Rei Marek
- Just Tell Me: Prompt Engineering In Business Process Management Busch Kiran, Rochlitzer Alexander, Sola Diana, Leopold Henrik
- Development Of The Chatgpt, Generative Artificial Intelligence And Natural Large Language Models For Accountable Reporting And Use (CANGARU) Guidelines Cacciamani Giovanni E., Eppler Michael B., Ganjavi Conner, Pekan Asli, Biedermann Brett, Collins Gary S., Gill Inderbir S.
- CLAP: Isolating Content From Style Through Contrastive Learning With Augmented Prompts Cai Yichao, Liu Yuhang, Zhang Zhen, Shi Javen Qinfeng
- Do Large Language Models Resemble Humans In Language Use? Cai Zhenguang G., Duan Xufeng, Haslett David A., Wang Shuqi, Pickering Martin J.
- Can Transformers Learn Sequential Function Classes In Context? Campbell Ryan, Guo Emma, Hu Evan, Vir Reya, Hsiao Ethan
- Oberta: Improving Sparse Transfer Learning Via Improved Initialization, Distillation, And Pruning Regimes Campos Daniel, Marques Alexandre, Kurtz Mark, Zhai Chengxiang
- Wizardlm: Empowering Large Language Models To Follow Complex Instructions Can Xu, Qingfeng Sun, Kai Zheng, Xiubo Geng, Pu Zhao, Jiazhan Feng, Chongyang Tao, Daxin Jiang
- H2O Open Ecosystem For State-of-the-art Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Lee Chun Ming, Conde Marcos V.
- H2ogpt: Democratizing Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Prabhu Prithvi, Gambera Jeff, Landry Mark, Bansal Shivam, Chesler Ryan, Lee Chun Ming, Conde Marcos V., Stetsenko Pasha, Grellier Olivier, Ambati Srisatish
- Defending Against Alignment-breaking Attacks Via Robustly Aligned LLM Cao Bochuan, Cao Yuanpu, Lin Lu, Chen Jinghui
- Pumer: Pruning And Merging Tokens For Efficient Vision Language Models Cao Qingqing, Paranjape Bhargavi, Hajishirzi Hannaneh
- Revolutionizing Mobile Interaction: Enabling A 3 Billion Parameter GPT LLM On Mobile Carreira Samuel, Marques Tomás, Ribeiro José, Grilo Carlos
- Promoting Generalized Cross-lingual Question Answering In Few-resource Scenarios Via Self-knowledge Distillation Carrino Casimiro Pio, Escolano Carlos, Fonollosa José A. R.
- Do Androids Know They're Only Dreaming Of Electric Sheep? Ch-wang Sky, Van Durme Benjamin, Eisner Jason, Kedzie Chris
- Exploring Answer Information Methods For Question Generation With Transformers Chafekar Talha, Hussain Aafiya, Sharma Grishma, Sharma Deepak
- INT2.1: Towards Fine-tunable Quantized Large Language Models With Error Correction Through Low-rank Adaptation Chai Yuji, Gkountouras John, Ko Glenn G., Brooks David, Wei Gu-yeon
- Muse: Text-to-image Generation Via Masked Generative Transformers Chang Huiwen, Zhang Han, Barber Jarred, Maschinot Aj, Lezama Jose, Jiang Lu, Yang Ming-hsuan, Murphy Kevin, Freeman William T., Rubinstein Michael, Li Yuanzhen, Krishnan Dilip
- Learning To Generate Better Than Your LLM Chang Jonathan D., Brantley Kiante, Ramamurthy Rajkumar, Misra Dipendra, Sun Wen
- Prompting And Adapter Tuning For Self-supervised Encoder-decoder Speech Model Chang Kai-wei, Chen Ming-hsin, Lin Yun-ping, Hsu Jing Neng, Huang Paul Kuo-ming, Huang Chien-yu, Li Shang-wen, Lee Hung-yi
- Language Model Behavior: A Comprehensive Survey Chang Tyler A., Bergen Benjamin K.
- Characterizing Learning Curves During Language Model Pre-training: Learning, Forgetting, And Stability Chang Tyler A., Tu Zhuowen, Bergen Benjamin K.
- Hallucination Augmented Contrastive Learning For Multimodal Large Language Model Chaoya Jiang, Haiyang Xu, Mengfan Dong, Jiaxing Chen, Wei Ye, Ming Yan, Qinghao Ye, Ji Zhang, Fei Huang, Shikun Zhang
- Not All Layers Are Equally As Important: Every Layer Counts BERT Charpentier Lucas Georges Gabriel, Samuel David
- She Had Cobalt Blue Eyes: Prompt Testing To Create Aligned And Sustainable Language Models Chatrath Veronica, Bamgbose Oluwanifemi, Raza Shaina
- Efficient GPT Model Pre-training Using Tensor Train Matrix Representation Chekalina Viktoriia, Novikov Georgii, Gusak Julia, Oseledets Ivan, Panchenko Alexander
- Sudden Drops In The Loss: Syntax Acquisition, Phase Transitions, And Simplicity Bias In Mlms Chen Angelica, Shwartz-ziv Ravid, Cho Kyunghyun, Leavitt Matthew L., Saphra Naomi
- Fireact: Toward Language Agent Fine-tuning Chen Baian, Shu Chang, Shareghi Ehsan, Collier Nigel, Narasimhan Karthik, Yao Shunyu
- Visual Instruction Tuning With Polite Flamingo Chen Delong, Liu Jianfeng, Dai Wenliang, Wang Baoyuan
- Generate Labeled Training Data Using Prompt Programming And GPT-3. An Example Of Big Five Personality Classification Chen Eason
- Tem-adapter: Adapting Image-text Pretraining For Video Question Answer Chen Guangyi, Liu Xiao, Wang Guangrun, Zhang Kun, Torr Philip H. S., Zhang Xiao-ping, Tang Yansong
- CLEX: Continuous Length Extrapolation For Large Language Models Chen Guanzheng, Li Xin, Meng Zaiqiao, Liang Shangsong, Bing Lidong
- Videollm: Modeling Video Sequence With Large Language Models Chen Guo, Zheng Yin-dong, Wang Jiahao, Xu Jilan, Huang Yifei, Pan Junting, Wang Yi, Wang Yali, Qiao Yu, Lu Tong, Wang Limin
- Chatgpt's One-year Anniversary: Are Open-source Large Language Models Catching Up? Chen Hailin, Jiao Fangkai, Li Xingxuan, Qin Chengwei, Ravaut Mathieu, Zhao Ruochen, Xiong Caiming, Joty Shafiq
- Maybe Only 0.5% Data Is Needed: A Preliminary Exploration Of Low Training Data Instruction Tuning Chen Hao, Zhang Yiming, Zhang Qi, Yang Hantao, Hu Xiaomeng, Ma Xuetao, Yanggong Yifan, Zhao Junbo
- Interpreting And Controlling Vision Foundation Models Via Text Explanations Chen Haozhe, Yang Junfeng, Vondrick Carl, Mao Chengzhi
- Skills-in-context Prompting: Unlocking Compositionality In Large Language Models Chen Jiaao, Pan Xiaoman, Yu Dian, Song Kaiqiang, Wang Xiaoyang, Yu Dong, Chen Jianshu
- Unlearn What You Want To Forget: Efficient Unlearning For Llms Chen Jiaao, Yang Diyi
- Schema-guided Semantic Accuracy: Faithfulness In Task-oriented Dialogue Response Generation Chen Jinghong, Lin Weizhe, Byrne Bill
- Introspective Tips: Large Language Model For In-context Decision Making Chen Liting, Wang Lu, Dong Hang, Du Yali, Yan Jie, Yang Fangkai, Li Shuang, Zhao Pu, Qin Si, Rajmohan Saravan, Lin Qingwei, Zhang Dongmei
- Driving With Llms: Fusing Object-level Vector Modality For Explainable Autonomous Driving Chen Long, Sinavski Oleg, Hünermann Jan, Karnsund Alice, Willmott Andrew James, Birch Danny, Maund Daniel, Shotton Jamie
- Natural Response Generation For Chinese Reading Comprehension Chen Nuo, Li Hongguang, Bao Yinan, Wang Baoyuan, Li Jia
- Breaking Language Barriers In Multilingual Mathematical Reasoning: Insights And Observations Chen Nuo, Zheng Zinan, Wu Ning, Gong Ming, Song Yangqiu, Zhang Dongmei, Li Jia
- Evaluation Of Chatgpt Family Of Models For Biomedical Reasoning And Classification Chen Shan, Li Yingya, Lu Sheng, Van Hoang, Aerts Hugo Jwl, Savova Guergana K., Bitterman Danielle S.
- Evaluating Factual Consistency Of Summaries With Large Language Models Chen Shiqi, Gao Siyang, He Junxian
- Extending Context Window Of Large Language Models Via Positional Interpolation Chen Shouyuan, Wong Sherman, Chen Liangjian, Tian Yuandong
- MMICT: Boosting Multi-modal Fine-tuning With In-context Examples Chen Tao, Zhang Enwei, Gao Yuting, Li Ke, Sun Xing, Zhang Yan, Li Hui, Ji Rongrong
- Lorashear: Efficient Large Language Model Structured Pruning And Knowledge Recovery Chen Tianyi, Ding Tianyu, Yadav Badal, Zharkov Ilya, Liang Luming
- The Janus Interface: How Fine-tuning In Large Language Models Amplifies The Privacy Risks Chen Xiaoyi, Tang Siyuan, Zhu Rui, Yan Shijun, Jin Lei, Wang Zihao, Su Liya, Zhang Zhikun, Wang Xiaofeng, Tang Haixu
- Minprompt: Graph-based Minimal Prompt Data Augmentation For Few-shot Question Answering Chen Xiusi, Jiang Jyun-yu, Chang Wei-cheng, Hsieh Cho-jui, Yu Hsiang-fu, Wang Wei
- Medprompt: Cross-modal Prompting For Multi-task Medical Image Translation Chen Xuhang, Pun Chi-man, Wang Shuqiang
- Translation And Fusion Improves Zero-shot Cross-lingual Information Extraction Chen Yang, Shah Vedaant, Ritter Alan
- Measuring And Improving Chain-of-thought Reasoning In Vision-language Models Chen Yangyi, Sikka Karan, Cogswell Michael, Ji Heng, Divakaran Ajay
- EE-LLM: Large-scale Training And Inference Of Early-exit Large Language Models With 3D Parallelism Chen Yanxi, Pan Xuchen, Li Yaliang, Ding Bolin, Zhou Jingren
- Confidant: Customizing Transformer-based Llms Via Collaborative Edge Training Chen Yuhao, Yan Yuxuan, Yang Qianqian, Shu Yuanchao, He Shibo, Chen Jiming
- Token Prediction As Implicit Classification To Identify Llm-generated Text Chen Yutian, Kang Hao, Zhai Vivian, Li Liangze, Singh Rita, Raj Bhiksha
- MEDITRON-70B: Scaling Medical Pretraining For Large Language Models Chen Zeming, Cano Alejandro Hernández, Romanou Angelika, Bonnet Antoine, Matoba Kyle, Salvi Francesco, Pagliardini Matteo, Fan Simin, Köpf Andreas, Mohtashami Amirkeivan, Sallinen Alexandre, Sakhaeirad Alireza, Swamy Vinitra, Krawczuk Igor, Bayazit Deniz, Marmet Axel, Montariol Syrielle, Hartley Mary-anne, Jaggi Martin, Bosselut Antoine
- See, Think, Confirm: Interactive Prompting Between Vision And Language Models For Knowledge-based Visual Reasoning Chen Zhenfang, Zhou Qinhong, Shen Yikang, Hong Yining, Zhang Hao, Gan Chuang
- Cascade Speculative Drafting For Even Faster LLM Inference Chen Ziyi, Yang Xiaocong, Lin Jiacheng, Sun Chenkai, Chang Kevin Chen-chuan, Huang Jie
- Visual Chatgpt: Talking, Drawing And Editing With Visual Foundation Models Chenfei Wu, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, Nan Duan
- UPRISE: Universal Prompt Retrieval For Improving Zero-shot Evaluation Cheng Daixuan, Huang Shaohan, Bi Junyu, Zhan Yuefeng, Liu Jianfeng, Wang Yujing, Sun Hao, Wei Furu, Deng Denvy, Zhang Qi
- SCALE: Synergized Collaboration Of Asymmetric Language Translation Engines Cheng Xin, Wang Xun, Ge Tao, Chen Si-qing, Wei Furu, Zhao Dongyan, Yan Rui
- Latent Positional Information Is In The Self-attention Variance Of Transformer Language Models Without Positional Embeddings Chi Ta-chung, Fan Ting-han, Chen Li-wei, Rudnicky Alexander I., Ramadge Peter J.
- Attention Alignment And Flexible Positional Embeddings Improve Transformer Length Extrapolation Chi Ta-chung, Fan Ting-han, Rudnicky Alexander I.
- Transformer Working Memory Enables Regular Language Reasoning And Natural Language Length Extrapolation Chi Ta-chung, Fan Ting-han, Rudnicky Alexander I., Ramadge Peter J.
- M\(^{2}\)chat: Empowering VLM For Multimodal LLM Interleaved Text-image Generation Chi Xiaowei, Zhang Rongyu, Jiang Zhengkai, Liu Yijiang, Wang Yatian, Qi Xingqun, Luo Wenhan, Gao Peng, Zhang Shanghang, Liu Qifeng, Guo Yike
- INSTRUCTEVAL: Towards Holistic Evaluation Of Instruction-tuned Large Language Models Chia Yew Ken, Hong Pengfei, Bing Lidong, Poria Soujanya
- Language Grounded Qformer For Efficient Vision Language Understanding Choraria Moulik, Sekhar Nitesh, Wu Yue, Zhang Xu, Singhal Prateek, Varshney Lav R.
- Large Language Models For User Interest Journeys Christakopoulou Konstantina, Lalama Alberto, Adams Cj, Qu Iris, Amir Yifat, Chucri Samer, Vollucci Pierce, Soldo Fabio, Bseiso Dina, Scodel Sarah, Dixon Lucas, Chi Ed H., Chen Minmin
- Language And Task Arithmetic With Parameter-efficient Layers For Zero-shot Summarization Chronopoulou Alexandra, Pfeiffer Jonas, Maynez Joshua, Wang Xinyi, Ruder Sebastian, Agrawal Priyanka
- Tutorials On Stance Detection Using Pre-trained Language Models: Fine-tuning BERT And Prompting Large Language Models Chuang Yun-shiuan
- Dola: Decoding By Contrasting Layers Improves Factuality In Large Language Models Chuang Yung-sung, Xie Yujia, Luo Hongyin, Kim Yoon, Glass James, He Pengcheng
- Instructtods: Large Language Models For End-to-end Task-oriented Dialogue Systems Chung Willy, Cahyawijaya Samuel, Wilie Bryan, Lovenia Holy, Fung Pascale
- LIMA: Less Is More For Alignment Chunting Zhou, Pengfei Liu, Puxin Xu, Srini Iyer, Jiao Sun, Yuning Mao, Xuezhe Ma, Avia Efrat, Ping Yu, Lili Yu, Susan Zhang, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer, Omer Levy
- Generalization Analogies: A Testbed For Generalizing AI Oversight To Hard-to-measure Domains Clymer Joshua, Baker Garrett, Subramani Rohan, Wang Sam
- Inducing Anxiety In Large Language Models Increases Exploration And Bias Coda-forno Julian, Witte Kristin, Jagadish Akshay K., Binz Marcel, Akata Zeynep, Schulz Eric
- Robust Infidelity: When Faithfulness Measures On Masked Language Models Are Misleading Crothers Evan, Viktor Herna, Japkowicz Nathalie
- Reinforcement Learning Fine-tuning Of Language Models Is Biased Towards More Extractable Features Cruz Diogo, Pona Edoardo, Holness-tofts Alex, Schmied Elias, Alonso Víctor Abia, Griffin Charlie, Cirstea Bogdan-ionut
- Machine Mindset: An MBTI Exploration Of Large Language Models Cui Jiaxi, Lv Liuzhenghao, Wen Jing, Wang Rongsheng, Tang Jing, Tian Yonghong, Yuan Li
- Ada-instruct: Adapting Instruction Generators For Complex Reasoning Cui Wanyun, Wang Qianle
- Large Language Models For Compiler Optimization Cummins Chris, Seeker Volker, Grubisic Dejan, Elhoushi Mostafa, Liang Youwei, Roziere Baptiste, Gehring Jonas, Gloeckle Fabian, Hazelwood Kim, Synnaeve Gabriel, Leather Hugh
- Evaluating Prompt-based Question Answering For Object Prediction In The Open Research Knowledge Graph D'souza Jennifer, Hrou Moussab, Auer Sören
- Syntactic Knowledge Via Graph Attention With BERT In Machine Translation Dai Yuqian, Sharoff Serge, De Kamps Marc
- Educhat: A Large-scale Language Model-based Chatbot System For Intelligent Education Dan Yuhao, Lei Zhikai, Gu Yiyang, Li Yong, Yin Jianghao, Lin Jiaju, Ye Linhao, Tie Zhiyan, Zhou Yougen, Wang Yilei, Zhou Aimin, Zhou Ze, Chen Qin, Zhou Jie, He Liang, Qiu Xipeng
- An Adversarial Example For Direct Logit Attribution: Memory Management In Gelu-4l Dao James, Lau Yeu-tong, Rager Can, Janiak Jett
- Flashattention-2: Faster Attention With Better Parallelism And Work Partitioning Dao Tri
- Mask The Bias: Improving Domain-adaptive Generalization Of Ctc-based ASR With Internal Language Model Estimation Das Nilaksh, Sunkara Monica, Bodapati Sravan, Cai Jinglun, Kulshreshtha Devang, Farris Jeff, Kirchhoff Katrin
- Optimizing Distributed Training On Frontier For Large Language Models Dash Sajal, Lyngaas Isaac, Yin Junqi, Wang Xiao, Egele Romain, Cong Guojing, Wang Feiyi, Balaprakash Prasanna
- Competence-based Analysis Of Language Models Davies Adam, Jiang Jize, Zhai Chengxiang
- GPT-3 Models Are Few-shot Financial Reasoners De Padua Raul Salles, Qureshi Imran, Karakaplan Mustafa U.
- Scaling Vision Transformers To 22 Billion Parameters Dehghani Mostafa, Djolonga Josip, Mustafa Basil, Padlewski Piotr, Heek Jonathan, Gilmer Justin, Steiner Andreas, Caron Mathilde, Geirhos Robert, Alabdulmohsin Ibrahim, Jenatton Rodolphe, Beyer Lucas, Tschannen Michael, Arnab Anurag, Wang Xiao, Riquelme Carlos, Minderer Matthias, Puigcerver Joan, Evci Utku, Kumar Manoj, Van Steenkiste Sjoerd, Elsayed Gamaleldin F., Mahendran Aravindh, Yu Fisher, Oliver Avital, Huot Fantine, Bastings Jasmijn, Collier Mark Patrick, Gritsenko Alexey, Birodkar Vighnesh, Vasconcelos Cristina, Tay Yi, Mensink Thomas, Kolesnikov Alexander, Pavetić Filip, Tran Dustin, Kipf Thomas, Lučić Mario, Zhai Xiaohua, Keysers Daniel, Harmsen Jeremiah, Houlsby Neil
- Skipdecode: Autoregressive Skip Decoding With Batching And Caching For Efficient LLM Inference Del Corro Luciano, Del Giorno Allie, Agarwal Sahaj, Yu Bin, Awadallah Ahmed, Mukherjee Subhabrata
- Investigating Data Contamination In Modern Benchmarks For Large Language Models Deng Chunyuan, Zhao Yilun, Tang Xiangru, Gerstein Mark, Cohan Arman
- Plug-and-play Policy Planner For Large Language Model Powered Dialogue Agents Deng Yang, Zhang Wenxuan, Lam Wai, Ng See-kiong, Chua Tat-seng
- Mutual Enhancement Of Large And Small Language Models With Cross-silo Knowledge Transfer Deng Yongheng, Qiao Ziqing, Ren Ju, Liu Yang, Zhang Yaoxue
- Multilingual Jailbreak Challenges In Large Language Models Deng Yue, Zhang Wenxuan, Pan Sinno Jialin, Bing Lidong
- Spqr: A Sparse-quantized Representation For Near-lossless LLM Weight Compression Dettmers Tim, Svirschevski Ruslan, Egiazarian Vage, Kuznedelev Denis, Frantar Elias, Ashkboos Saleh, Borzunov Alexander, Hoefler Torsten, Alistarh Dan
- Context-aware Transformer Pre-training For Answer Sentence Selection Di Liello Luca, Garg Siddhant, Moschitti Alessandro
- A Comprehensive Evaluation Of Neural SPARQL Query Generation From Natural Language Questions Diallo Papa Abdou Karim Karou, Reyd Samuel, Zouaq Amal
- Lmflow: An Extensible Toolkit For Finetuning And Inference Of Large Foundation Models Diao Shizhe, Pan Rui, Dong Hanze, Shum Ka Shun, Zhang Jipeng, Xiong Wei, Zhang Tong
- Jump To Conclusions: Short-cutting Transformers With Linear Transformations Din Alexander Yom, Karidi Taelin, Choshen Leshem, Geva Mor
- Longnet: Scaling Transformers To 1,000,000,000 Tokens Ding Jiayu, Ma Shuming, Dong Li, Zhang Xingxing, Huang Shaohan, Wang Wenhui, Zheng Nanning, Wei Furu
- Enhancing Chat Language Models By Scaling High-quality Instructional Conversations Ding Ning, Chen Yulin, Xu Bokai, Qin Yujia, Zheng Zhi, Hu Shengding, Liu Zhiyuan, Sun Maosong, Zhou Bowen
- Self-agreement: A Framework For Fine-tuning Language Models To Find Agreement Among Diverse Opinions Ding Shiyao, Ito Takayuki
- HPC-GPT: Integrating Large Language Model For High-performance Computing Ding Xianzhong, Chen Le, Emani Murali, Liao Chunhua, Lin Pei-hung, Vanderbruggen Tristan, Xie Zhen, Cerpa Alberto E., Du Wan
- Fluid Transformers And Creative Analogies: Exploring Large Language Models' Capacity For Augmenting Cross-domain Analogical Creativity Ding Zijian, Srinivasan Arvind, Macneil Stephen, Chan Joel
- Establishing Performance Baselines In Fine-tuning, Retrieval-augmented Generation And Soft-prompting For Non-specialist LLM Users Dodgson Jennifer, Nanzheng Lin, Peh Julian, Pattirane Akira Rafhael Janson, Alhajir Alfath Daryl, Dinarto Eko Ridho, Lim Joseph, Ahmad Syed Danyal
- Block-wise Bit-compression Of Transformer-based Models Dong Gaochen, Chen Wei
- Blockwise Compression Of Transformer-based Models Without Retraining Dong Gaochen, Chen Wei
- Speechgpt: Empowering Large Language Models With Intrinsic Cross-modal Conversational Abilities Dong Zhang, Shimin Li, Xin Zhang, Jun Zhan, Pengyu Wang, Yaqian Zhou, Xipeng Qiu
- Loramoe: Alleviate World Knowledge Forgetting In Large Language Models Via Moe-style Plugin Dou Shihan, Zhou Enyu, Liu Yan, Gao Songyang, Zhao Jun, Shen Wei, Zhou Yuhao, Xi Zhiheng, Wang Xiao, Fan Xiaoran, Pu Shiliang, Zhu Jiang, Zheng Rui, Gui Tao, Zhang Qi, Huang Xuanjing
- Large Language Models Douglas Michael R.
- Quantifying And Attributing The Hallucination Of Large Language Models Via Association Analysis Du Li, Wang Yequan, Xing Xingrun, Ya Yiqun, Li Xiang, Jiang Xin, Fang Xuezhi
- Blending Reward Functions Via Few Expert Demonstrations For Faithful And Accurate Knowledge-grounded Dialogue Generation Du Wanyu, Ji Yangfeng
- Guiding Pretraining In Reinforcement Learning With Large Language Models Du Yuqing, Watkins Olivia, Wang Zihan, Colas Cédric, Darrell Trevor, Abbeel Pieter, Gupta Abhishek, Andreas Jacob
- Lauragpt: Listen, Attend, Understand, And Regenerate Audio With GPT Du Zhihao, Wang Jiaming, Chen Qian, Chu Yunfei, Gao Zhifu, Li Zerui, Hu Kai, Zhou Xiaohuan, Xu Jin, Ma Ziyang, Wang Wen, Zheng Siqi, Zhou Chang, Yan Zhijie, Zhang Shiliang
- Alpacafarm: A Simulation Framework For Methods That Learn From Human Feedback Dubois Yann, Li Xuechen, Taori Rohan, Zhang Tianyi, Gulrajani Ishaan, Ba Jimmy, Guestrin Carlos, Liang Percy, Hashimoto Tatsunori B.
- Stack Attention: Improving The Ability Of Transformers To Model Hierarchical Patterns Dusell Brian, Chiang David
- Faith And Fate: Limits Of Transformers On Compositionality Dziri Nouha, Lu Ximing, Sclar Melanie, Li Xiang Lorraine, Jiang Liwei, Lin Bill Yuchen, West Peter, Bhagavatula Chandra, Bras Ronan Le, Hwang Jena D., Sanyal Soumya, Welleck Sean, Ren Xiang, Ettinger Allyson, Harchaoui Zaid, Choi Yejin
- GPT-SW3: An Autoregressive Language Model For The Nordic Languages Ekgren Ariel, Gyllensten Amaru Cuba, Stollenwerk Felix, Öhman Joey, Isbister Tim, Gogoulou Evangelia, Carlsson Fredrik, Heiman Alice, Casademont Judit, Sahlgren Magnus
- Tinystories: How Small Can Language Models Be And Still Speak Coherent English? Eldan Ronen, Li Yuanzhi
- Octopus: A Multitask Model And Toolkit For Arabic Natural Language Generation Elmadany Abdelrahim, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- A Comprehensive Performance Study Of Large Language Models On Novel AI Accelerators Emani Murali, Foreman Sam, Sastry Varuni, Xie Zhen, Raskar Siddhisanket, Arnold William, Thakur Rajeev, Vishwanath Venkatram, Papka Michael E.
- Textgenshap: Scalable Post-hoc Explanations In Text Generation With Long Documents Enouen James, Nakhost Hootan, Ebrahimi Sayna, Arik Sercan O, Liu Yan, Pfister Tomas
- Jacotext: A Pretrained Model For Java Code-text Generation Espejel Jessica López, Alassan Mahaman Sanoussi Yahaya, Dahhane Walid, Ettifouri El Hassane
- NAP At Semeval-2023 Task 3: Is Less Really More? (back-)translation As Data Augmentation Strategies For Detecting Persuasion Techniques Falk Neele, Eichel Annerose, Piccirilli Prisca
- Chain-of-thought Tuning: Masked Language Models Can Also Think Step By Step In Natural Language Understanding Fan Caoyun, Tian Jidong, Li Yitian, Chen Wenqing, He Hao, Jin Yaohui
- Instructseq: Unifying Vision Tasks With Instruction-conditioned Multi-modal Sequence Generation Fang Rongyao, Yan Shilin, Huang Zhaoyang, Zhou Jingqiu, Tian Hao, Dai Jifeng, Li Hongsheng
- An Empirical Study Of Multitask Learning To Improve Open Domain Dialogue Systems Farahani Mehrdad, Johansson Richard
- Revisiting Instruction Fine-tuned Model Evaluation To Guide Industrial Applications Faysse Manuel, Viaud Gautier, Hudelot Céline, Colombo Pierre
- Extending Context Window Of Large Language Models Via Semantic Compression Fei Weizhi, Niu Xueyan, Zhou Pingyi, Hou Lu, Bai Bo, Deng Lei, Han Wei
- Llama Rider: Spurring Large Language Models To Explore The Open World Feng Yicheng, Wang Yuxuan, Liu Jiazheng, Zheng Sipeng, Lu Zongqing
- Explaining How Transformers Use Context To Build Predictions Ferrando Javier, Gállego Gerard I., Tsiamas Ioannis, Costa-jussà Marta R.
- Grounded Complex Task Segmentation For Conversational Assistants Ferreira Rafael, Semedo David, Magalhães João
- Vision Language Transformers: A Survey Fields Clayton, Kennington Casey
- Context-aware Meta-learning Fifty Christopher, Duan Dennis, Junkins Ronald G., Amid Ehsan, Leskovec Jure, Re Christopher, Thrun Sebastian
- Can Large Language Models Follow Concept Annotation Guidelines? A Case Study On Scientific And Financial Domains Fonseca Marcio, Cohen Shay B.
- Sparsegpt: Massive Language Models Can Be Accurately Pruned In One-shot Frantar Elias, Alistarh Dan
- Kwaiyiimath: Technical Report Fu Jiayi, Lin Lei, Gao Xiaoyang, Liu Pengli, Chen Zhengzong, Yang Zhirui, Zhang Shengnan, Zheng Xue, Li Yan, Liu Yuliang, Ye Xucheng, Liao Yiqiao, Liao Chao, Chen Bin, Song Chengru, Wan Junchen, Lin Zijia, Zhang Fuzheng, Wang Zhongyuan, Zhang Di, Gai Kun
- Badllama: Cheaply Removing Safety Fine-tuning From Llama 2-chat 13B Gade Pranav, Lermen Simon, Rogers-smith Charlie, Ladish Jeffrey
- The Inner Sentiments Of A Thought Gagne Chris, Dayan Peter
- Cerbero-7b: A Leap Forward In Language-specific Llms Through Enhanced Chat Corpus Generation And Evaluation Galatolo Federico A., Cimino Mario G. C. A.
- Roles Of Scaling And Instruction Tuning In Language Perception: Model Vs. Human Attention Gao Changjiang, Huang Shujian, Li Jixing, Chen Jiajun
- Text-to-sql Empowered By Large Language Models: A Benchmark Evaluation Gao Dawei, Wang Haibin, Li Yaliang, Sun Xiuyu, Qian Yichen, Ding Bolin, Zhou Jingren
- Examining User-friendly And Open-sourced Large GPT Models: A Survey On Language, Multimodal, And Scientific GPT Models Gao Kaiyuan, He Sunan, He Zhenyu, Lin Jiacheng, Pei Qizhi, Shao Jie, Zhang Wei
- Real Customization Or Just Marketing: Are Customized Versions Of Chat GPT Useful? Garrido-merchán Eduardo C., Arroyo-barrigüete Jose L., Borrás-pala Francisco, Escobar-torres Leandro, De Ibarreta Carlos Martínez, Ortiz-lozano Jose María, Rua-vieites Antonio
- Ahead-of-time P-tuning Gavrilov Daniil, Balagansky Nikita
- Model Tells You What To Discard: Adaptive KV Cache Compression For Llms Ge Suyu, Zhang Yunan, Liu Liyuan, Zhang Minjia, Han Jiawei, Gao Jianfeng
- MART: Improving LLM Safety With Multi-round Automatic Red-teaming Ge Suyu, Zhou Chunting, Hou Rui, Khabsa Madian, Wang Yi-chia, Wang Qifan, Han Jiawei, Mao Yuning
- Making Llama SEE And Draw With SEED Tokenizer Ge Yuying, Zhao Sijie, Zeng Ziyun, Ge Yixiao, Li Chen, Wang Xintao, Shan Ying
- Ddcot: Duty-distinct Chain-of-thought Prompting For Multimodal Reasoning In Language Models Ge Zheng, Bin Yang, Jiajin Tang, Hong-yu Zhou, Sibei Yang
- Mblip: Efficient Bootstrapping Of Multilingual Vision-llms Geigle Gregor, Jain Abhay, Timofte Radu, Glavaš Goran
- Hiclip: Contrastive Language-image Pretraining With Hierarchy-aware Attention Geng Shijie, Yuan Jianbo, Tian Yu, Chen Yuxiao, Zhang Yongfeng
- Does Synthetic Data Make Large Language Models More Efficient? Gholami Sia, Omar Marwan
- Flacuna: Unleashing The Problem Solving Power Of Vicuna Using FLAN Fine-tuning Ghosal Deepanway, Chia Yew Ken, Majumder Navonil, Poria Soujanya
- Text-to-audio Generation Using Instruction-tuned LLM And Latent Diffusion Model Ghosal Deepanway, Majumder Navonil, Mehrish Ambuj, Poria Soujanya
- Clipsyntel: CLIP And LLM Synergy For Multimodal Question Summarization In Healthcare Ghosh Akash, Acharya Arkadeep, Jain Raghav, Saha Sriparna, Chadha Aman, Sinha Setu
- Mtuncertainty: Assessing The Need For Post-editing Of Machine Translation Outputs By Fine-tuning Openai Llms Gladkoff Serge, Han Lifeng, Erofeev Gleb, Sorokina Irina, Nenadic Goran
- Generative AI Beyond Llms: System Implications Of Multi-modal Generation Golden Alicia, Hsia Samuel, Sun Fei, Acun Bilge, Hosmer Basil, Lee Yejin, Devito Zachary, Johnson Jeff, Wei Gu-yeon, Brooks David, Wu Carole-jean
- Model-generated Pretraining Signals Improves Zero-shot Generalization Of Text-to-text Transformers Gong Linyuan, Xiong Chenyan, Liu Xiaodong, Bajaj Payal, Xie Yiqing, Cheung Alvin, Gao Jianfeng, Song Xia
- Improving Input-label Mapping With Demonstration Replay For In-context Learning Gong Zhuocheng, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Yan Rui
- Using Large Language Models For Interpreting Autonomous Robots Behaviors González-santamarta Miguel A., Fernández-becerra Laura, Sobrín-hidalgo David, Guerrero-higueras Ángel Manuel, González Irene, Lera Francisco J. Rodríguez
- Intention And Context Elicitation With Large Language Models In The Legal Aid Intake Process Goodson Nick, Lu Rongfei
- Lil-bevo: Explorations Of Strategies For Training Language Models In More Humanlike Ways Govindarajan Venkata S, Rodriguez Juan Diego, Bostrom Kaj, Mahowald Kyle
- AMAGO: Scalable In-context Reinforcement Learning For Adaptive Agents Grigsby Jake, Fan Linxi, Zhu Yuke
- Pokemqa: Programmable Knowledge Editing For Multi-hop Question Answering Gu Hengrui, Zhou Kaixiong, Han Xiaotian, Liu Ninghao, Wang Ruobing, Wang Xin
- Seer: Language Instructed Video Prediction With Latent Diffusion Models Gu Xianfan, Wen Chuan, Ye Weirui, Song Jiaming, Gao Yang
- Voyager: An Open-ended Embodied Agent With Large Language Models Guanzhi Wang, Yuqi Xie, Yunfan Jiang, Ajay Mandlekar, Chaowei Xiao, Yuke Zhu, Linxi Fan, Anima Anandkumar
- Hifi: High-information Attention Heads Hold For Parameter-efficient Model Adaptation Gui Anchun, Xiao Han
- Hiformer: Heterogeneous Feature Interactions Learning With Transformers For Recommender Systems Gui Huan, Wang Ruoxi, Yin Ke, Jin Long, Kula Maciej, Xu Taibai, Hong Lichan, Chi Ed H.
- SPT: Fine-tuning Transformer-based Language Models Efficiently With Sparsification Gui Yuntao, Yan Xiao, Yin Peiqi, Yang Han, Cheng James
- Longcoder: A Long-range Pre-trained Language Model For Code Completion Guo Daya, Xu Canwen, Duan Nan, Yin Jian, Mcauley Julian
- Beyond Imitation: Leveraging Fine-grained Quality Signals For Alignment Guo Geyang, Zhao Ranchi, Tang Tianyi, Zhao Wayne Xin, Wen Ji-rong
- Meta-tuning Llms To Leverage Lexical Knowledge For Generalizable Language Style Understanding Guo Ruohao, Xu Wei, Ritter Alan
- Instruction Fusion: Advancing Prompt Evolution Through Hybridization Guo Weidong, Yang Jiuding, Yang Kaitong, Li Xiangyang, Rao Zhuwei, Xu Yu, Niu Di
- Improving Small Language Models On Pubmedqa Via Generative Data Augmentation Guo Zhen, Wang Peiqi, Wang Yanwei, Yu Shangdi
- Point-bind & Point-llm: Aligning Point Cloud With Multi-modality For 3D Understanding, Generation, And Instruction Following Guo Ziyu, Zhang Renrui, Zhu Xiangyang, Tang Yiwen, Ma Xianzheng, Han Jiaming, Chen Kexin, Gao Peng, Li Xianzhi, Li Hongsheng, Heng Pheng-ann
- Editing Common Sense In Transformers Gupta Anshita, Mondal Debanjan, Sheshadri Akshay Krishna, Zhao Wenlong, Li Xiang Lorraine, Wiegreffe Sarah, Tandon Niket
- Grace: Generation Using Associated Code Edits Gupta Priyanshu, Khare Avishree, Bajpai Yasharth, Chakraborty Saikat, Gulwani Sumit, Kanade Aditya, Radhakrishna Arjun, Soares Gustavo, Tiwari Ashish
- Gistscore: Learning Better Representations For In-context Example Selection With Gist Bottlenecks Gupta Shivanshu, Rosenbaum Clemens, Elenberg Ethan R.
- Evaluating Concurrent Robustness Of Language Models Across Diverse Challenge Sets Gupta Vatsal, Pandya Pranshu, Kataria Tushar, Gupta Vivek, Roth Dan
- A Survey Of Vision-language Pre-training From The Lens Of Multimodal Machine Translation Gwinnup Jeremy, Duh Kevin
- A Theory Of Emergent In-context Learning As Implicit Structure Induction Hahn Michael, Goyal Navin
- Large Language Models Meet Computer Vision: A Brief Survey Hamadi Raby
- Lm-infinite: Zero-shot Extreme Length Generalization For Large Language Models Han Chi, Wang Qifan, Peng Hao, Xiong Wenhan, Chen Yu, Ji Heng, Wang Sinong
- Llms As Visual Explainers: Advancing Image Classification With Evolving Visual Descriptions Han Songhao, Zhuo Le, Liao Yue, Liu Si
- Medalpaca -- An Open-source Collection Of Medical Conversational AI Models And Training Data Han Tianyu, Adams Lisa C., Papaioannou Jens-michalis, Grundmann Paul, Oberhauser Tom, Löser Alexander, Truhn Daniel, Bressem Keno K.
- In-context Alignment: Chat With Vanilla Language Models Before Fine-tuning Han Xiaochuang
- David Helps Goliath: Inference-time Collaboration Between Small Specialized And Large General Diffusion Lms Han Xiaochuang, Kumar Sachin, Tsvetkov Yulia, Ghazvininejad Marjan
- Personalisation Within Bounds: A Risk Taxonomy And Policy Framework For The Alignment Of Large Language Models With Personalised Feedback Hannah Rose Kirk, Bertie Vidgen, Paul Röttger, Scott A. Hale
- Blockwise Parallel Transformer For Large Context Models Hao Liu, Pieter Abbeel
- Vico: Plug-and-play Visual Condition For Personalized Text-to-image Generation Hao Shaozhe, Han Kai, Zhao Shihao, Wong Kwan-yee K.
- Towards Efficient Vision-language Tuning: More Information Density, More Generalizability Hao Tianxiang, Lyu Mengyao, Chen Hui, Zhao Sicheng, Ding Xiaohan, Han Jungong, Ding Guiguang
- Test-time Training On Nearest Neighbors For Large Language Models Hardt Moritz, Sun Yu
- Unlocking The Potential Of Chatgpt: A Comprehensive Exploration Of Its Applications, Advantages, Limitations, And Future Directions In Natural Language Processing Hariri Walid
- CHAI-DT: A Framework For Prompting Conversational Generative AI Agents To Actively Participate In Co-creation Harwood Brandon
- Zero- And Few-shot Prompting With Llms: A Comparative Study With Fine-tuned Models For Bangla Sentiment Analysis Hasan Md. Arid, Das Shudipta, Anjum Afiyat, Alam Firoj, Anjum Anika, Sarker Avijit, Noori Sheak Rashed Haider
- Linear Latent World Models In Simple Transformers: A Case Study On Othello-gpt Hazineh Dean S., Zhang Zechen, Chiu Jeffery
- Multimodal Graph Transformer For Multimodal Question Answering He Xuehai, Wang Xin Eric
- Chatgpt For Zero-shot Dialogue State Tracking: A Solution Or An Opportunity? Heck Michael, Lubis Nurul, Ruppik Benjamin, Vukovic Renato, Feng Shutong, Geishauser Christian, Lin Hsien-chin, Van Niekerk Carel, Gašić Milica
- Divide Et Impera: Multi-transformer Architectures For Complex Nlp-tasks Helland Solveig, Gavagnin Elena, De Spindler Alexandre
- Simplemtod: A Simple Language Model For Multimodal Task-oriented Dialogue With Symbolic Scene Representation Hemanthage Bhathiya, Dondrup Christian, Bartie Phil, Lemon Oliver
- How Good Are GPT Models At Machine Translation? A Comprehensive Evaluation Hendy Amr, Abdelrehim Mohamed, Sharaf Amr, Raunak Vikas, Gabr Mohamed, Matsushita Hitokazu, Kim Young Jin, Afify Mohamed, Awadalla Hany Hassan
- Stochastic Parrots Looking For Stochastic Parrots: Llms Are Easy To Fine-tune And Hard To Detect With Other Llms Henrique Da Silva Gameiro, Kucharavy Andrei, Guerraoui Rachid
- Backpack Language Models Hewitt John, Thickstun John, Manning Christopher D., Liang Percy
- Assessing The Efficacy Of Large Language Models In Generating Accurate Teacher Responses Hicke Yann, Masand Abhishek, Guo Wentao, Gangavarapu Tushaar
- Newsgpt: Chatgpt Integration For Robot-reporter Hireche Abdelhadi, Belkacem Abdelkader Nasreddine, Jamil Sadia, Chen Chao
- L2MAC: Large Language Model Automatic Computer For Extensive Code Generation Holt Samuel, Luyten Max Ruiz, Van Der Schaar Mihaela
- Zero-shot Goal-directed Dialogue Via RL On Imagined Conversations Hong Joey, Levine Sergey, Dragan Anca
- SPEED: Speculative Pipelined Execution For Efficient Decoding Hooper Coleman, Kim Sehoon, Mohammadzadeh Hiva, Genc Hasan, Keutzer Kurt, Gholami Amir, Shao Sophia
- Towards A Mechanistic Interpretation Of Multi-step Reasoning Capabilities Of Language Models Hou Yifan, Li Jiaoda, Fei Yu, Stolfo Alessandro, Zhou Wangchunshu, Zeng Guangtao, Bosselut Antoine, Sachan Mrinmaya
- COMMA: Co-articulated Multi-modal Learning Hu Lianyu, Gao Liqing, Liu Zekang, Pun Chi-man, Feng Wei
- Meta-learning Online Adaptation Of Language Models Hu Nathan, Mitchell Eric, Manning Christopher D., Finn Chelsea
- Scaled Prompt-tuning For Few-shot Natural Language Generation Hu Ting, Meinel Christoph, Yang Haojin
- Do Large Language Models Know About Facts? Hu Xuming, Chen Junzhe, Li Xiaochuan, Guo Yufei, Wen Lijie, Yu Philip S., Guo Zhijiang
- AMERICANO: Argument Generation With Discourse-driven Decomposition And Agent Interaction Hu Zhe, Chan Hou Pong, Yin Yu
- Llm-adapters: An Adapter Family For Parameter-efficient Fine-tuning Of Large Language Models Hu Zhiqiang, Wang Lei, Lan Yihuai, Xu Wanyu, Lim Ee-peng, Bing Lidong, Xu Xing, Poria Soujanya, Lee Roy Ka-wei
- Towards Equipping Transformer With The Ability Of Systematic Compositionality Huang Chen, Qin Peixin, Lei Wenqiang, Lv Jiancheng
- Agentcoder: Multi-agent-based Code Generation With Iterative Testing And Optimisation Huang Dong, Zhang Jie M., Luck Michael, Bu Qingwen, Qing Yuhao, Cui Heming
- Directed Acyclic Transformer Pre-training For High-quality Non-autoregressive Text Generation Huang Fei, Ke Pei, Huang Minlie
- Chatgpt For Shaping The Future Of Dentistry: The Potential Of Multi-modal Large Language Model Huang Hanyao, Zheng Ou, Wang Dongdong, Yin Jiayi, Wang Zijin, Ding Shengxuan, Yin Heng, Xu Chuan, Yang Renjie, Zheng Qian, Shi Bing
- Acegpt, Localizing Large Language Models In Arabic Huang Huang, Yu Fei, Zhu Jianqing, Sun Xuening, Cheng Hao, Song Dingjie, Chen Zhihong, Alharthi Abdulmohsen, An Bang, He Juncai, Liu Ziche, Zhang Zhiyi, Chen Junying, Li Jianquan, Wang Benyou, Zhang Lian, Sun Ruoyu, Wan Xiang, Li Haizhou, Xu Jinchao
- T2i-compbench: A Comprehensive Benchmark For Open-world Compositional Text-to-image Generation Huang Kaiyi, Sun Kaiyue, Xie Enze, Li Zhenguo, Liu Xihui
- Lawyer Llama Technical Report Huang Quzhe, Tao Mingxu, Zhang Chen, An Zhenwei, Jiang Cong, Chen Zhibin, Wu Zirui, Feng Yansong
- Grounded Decoding: Guiding Text Generation With Grounded Models For Embodied Agents Huang Wenlong, Xia Fei, Shah Dhruv, Driess Danny, Zeng Andy, Lu Yao, Florence Pete, Mordatch Igor, Levine Sergey, Hausman Karol, Ichter Brian
- Fewer Is More: Boosting LLM Reasoning With Reinforced Context Pruning Huang Xijie, Zhang Li Lyna, Cheng Kwang-ting, Yang Fan, Yang Mao
- Generic Attention-model Explainability By Weighted Relevance Accumulation Huang Yiming, Jia Aozhe, Zhang Xiaodan, Zhang Jiawei
- Competition-level Problems Are Effective LLM Evaluators Huang Yiming, Lin Zhenghao, Liu Xiao, Gong Yeyun, Lu Shuai, Lei Fangyu, Liang Yaobo, Shen Yelong, Lin Chen, Duan Nan, Chen Weizhu
- Advancing Transformer Architecture In Long-context Large Language Models: A Comprehensive Survey Huang Yunpeng, Xu Jingwei, Lai Junyu, Jiang Zixu, Chen Taolue, Li Zenan, Yao Yuan, Ma Xiaoxing, Yang Lijuan, Chen Hao, Li Shupeng, Zhao Penghao
- How Does Pretraining Improve Discourse-aware Translation? Huang Zhihong, Wang Longyue, Liu Siyou, Wong Derek F.
- Llama 2: Open Foundation And Fine-tuned Chat Models Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini, Rui Hou, Hakan Inan, Marcin Kardas, Viktor Kerkez, Madian Khabsa, Isabel Kloumann, Artem Korenev, Punit Singh Koura, Marie-anne Lachaux, Thibaut Lavril, Jenya Lee, Diana Liskovich, Yinghai Lu, Yuning Mao, Xavier Martinet, Todor Mihaylov, Pushkar Mishra, Igor Molybog, Yixin Nie, Andrew Poulton, Jeremy Reizenstein, Rashi Rungta, Kalyan Saladi, Alan Schelten, Ruan Silva, Eric Michael Smith, Ranjan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, Jian Xiang Kuan, Puxin Xu, Zheng Yan, Iliyan Zarov, Yuchen Zhang, Angela Fan, Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom
- TADA: Efficient Task-agnostic Domain Adaptation For Transformers Hung Chia-chien, Lange Lukas, Strötgen Jannik
- Pre-gated Moe: An Algorithm-system Co-design For Fast And Scalable Mixture-of-expert Inference Hwang Ranggi, Wei Jianyu, Cao Shijie, Hwang Changho, Tang Xiaohu, Cao Ting, Yang Mao
- Minimizing Factual Inconsistency And Hallucination In Large Language Models I Muneeswaran, Saxena Shreya, Prasad Siva, Prakash M V Sai, Shankar Advaith, V Varun, Vaddina Vishal, Gopalakrishnan Saisubramaniam
- Glot500: Scaling Multilingual Corpora And Language Models To 500 Languages Imani Ayyoob, Lin Peiqin, Kargaran Amir Hossein, Severini Silvia, Sabet Masoud Jalili, Kassner Nora, Ma Chunlan, Schmid Helmut, Martins André F. T., Yvon François, Schütze Hinrich
- Llama Guard: Llm-based Input-output Safeguard For Human-ai Conversations Inan Hakan, Upasani Kartikeya, Chi Jianfeng, Rungta Rashi, Iyer Krithika, Mao Yuning, Tontchev Michael, Hu Qing, Fuller Brian, Testuggine Davide, Khabsa Madian
- Pushing Boundaries: Exploring Zero Shot Object Classification With Large Multimodal Models Islam Ashhadul, Biswas Md. Rafiul, Zaghouani Wajdi, Belhaouari Samir Brahim, Shah Zubair
- A Comprehensive Evaluation Of Large Language Models On Benchmark Biomedical Text Processing Tasks Jahan Israt, Laskar Md Tahmid Rahman, Peng Chun, Huang Jimmy
- Breaking The Token Barrier: Chunking And Convolution For Efficient Long Text Classification With BERT Jaiswal Aman, Milios Evangelos
- Cotran: An Llm-based Code Translator Using Reinforcement Learning With Feedback From Compiler And Symbolic Execution Jana Prithwish, Jha Piyush, Ju Haoyang, Kishore Gautham, Mahajan Aryan, Ganesh Vijay
- Just CHOP: Embarrassingly Simple LLM Compression Jha Ananya Harsh, Sherborne Tom, Walsh Evan Pete, Groeneveld Dirk, Strubell Emma, Beltagy Iz
- Tailoring Language Generation Models Under Total Variation Distance Ji Haozhe, Ke Pei, Hu Zhipeng, Zhang Rongsheng, Huang Minlie
- Bootstrapping Vision-language Learning With Decoupled Language Pre-training Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Bus:efficient And Effective Vision-language Pre-training With Bottom-up Patch Summarization Jiang Chaoya, Xu Haiyang, Ye Wei, Ye Qinghao, Li Chenliang, Yan Ming, Bi Bin, Zhang Shikun, Huang Fei, Huang Songfang
- Legal Syllogism Prompting: Teaching Large Language Models For Legal Judgment Prediction Jiang Cong, Yang Xiaolei
- Forcing Generative Models To Degenerate Ones: The Power Of Data Poisoning Attacks Jiang Shuli, Kadhe Swanand Ravindra, Zhou Yi, Cai Ling, Baracaldo Nathalie
- Scaling Sentence Embeddings With Large Language Models Jiang Ting, Huang Shaohan, Luan Zhongzhi, Wang Deqing, Zhuang Fuzhen
- Exploring Self-supervised Logic-enhanced Training For Large Language Models Jiao Fangkai, Teng Zhiyang, Ding Bosheng, Liu Zhengyuan, Chen Nancy F., Joty Shafiq
- Prompt-based Length Controlled Generation With Reinforcement Learning Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- Emergent Representations Of Program Semantics In Language Models Trained On Programs Jin Charles, Rinard Martin
- Growlength: Accelerating Llms Pretraining By Progressively Growing Training Length Jin Hongye, Han Xiaotian, Yang Jingfeng, Jiang Zhimeng, Chang Chia-yuan, Hu Xia
- Medcpt: Contrastive Pre-trained Transformers With Large-scale Pubmed Search Logs For Zero-shot Biomedical Information Retrieval Jin Qiao, Kim Won, Chen Qingyu, Comeau Donald C., Yeganova Lana, Wilbur W. John, Lu Zhiyong
- Grounding Language Models To Images For Multimodal Inputs And Outputs Jing Yu Koh, Ruslan Salakhutdinov, Daniel Fried
- Knowledge-augmented Large Language Models For Personalized Contextual Query Suggestion Jinheon Baek, Nirupama Chandrasekaran, Silviu Cucerzan, Allen Herring, Sujay Kumar Jauhar
- Teaching Language Models To Hallucinate Less With Synthetic Tasks Jones Erik, Palangi Hamid, Simões Clarisse, Chandrasekaran Varun, Mukherjee Subhabrata, Mitra Arindam, Awadallah Ahmed, Kamar Ece
- Small Language Models Fine-tuned To Coordinate Larger Language Models Improve Complex Reasoning Juneja Gurusha, Dutta Subhabrata, Chakrabarti Soumen, Manchanda Sunny, Chakraborty Tanmoy
- BLIP-2: Bootstrapping Language-image Pre-training With Frozen Image Encoders And Large Language Models Junnan Li, Dongxu Li, Silvio Savarese, Steven Hoi
- Synthetic Data Generation In Low-resource Settings Via Fine-tuning Of Large Language Models Kaddour Jean, Liu Qi
- Domain-specific Code Language Models: Unraveling The Potential For HPC Codes And Tasks Kadosh Tal, Hasabnis Niranjan, Vo Vy A., Schneider Nadav, Krien Neva, Capota Mihai, Wasay Abdul, Ahmed Nesreen, Willke Ted, Tamir Guy, Pinter Yuval, Mattson Timothy, Oren Gal
- Robust Recommender System: A Survey And Future Directions Kaike Zhang, Qi Cao, Fei Sun, Yunfan Wu, Shuchang Tao, Huawei Shen, Xueqi Cheng
- Contextual Code Switching For Machine Translation Using Language Models Kaji Arshad, Shah Manan
- Calibrated Language Models Must Hallucinate Kalai Adam Tauman, Vempala Santosh S.
- A Rank Stabilization Scaling Factor For Fine-tuning With Lora Kalajdzievski Damjan
- Swectrl-mini: A Data-transparent Transformer-based Large Language Model For Controllable Text Generation In Swedish Kalpakchi Dmytro, Boye Johan
- A Survey Of GPT-3 Family Large Language Models Including Chatgpt And GPT-4 Kalyan Katikapalli Subramanyam
- Syntax-guided Transformers: Elevating Compositional Generalization And Grounding In Multimodal Environments Kamali Danial, Kordjamshidi Parisa
- Dialmat: Dialogue-enabled Transformer With Moment-based Adversarial Training Kaneda Kanta, Korekata Ryosuke, Wada Yuiga, Nagashima Shunya, Kambara Motonari, Iioka Yui, Matsuo Haruka, Imai Yuto, Nishimura Takayuki, Sugiura Komei
- Reducing Sequence Length By Predicting Edit Operations With Large Language Models Kaneko Masahiro, Okazaki Naoaki
- Llm-augmented Preference Learning From Natural Language Kang Inwon, Ruan Sikai, Ho Tyler, Lin Jui-chien, Mohsin Farhad, Seneviratne Oshani, Xia Lirong
- Knowledge-augmented Reasoning Distillation For Small Language Models In Knowledge-intensive Tasks Kang Minki, Lee Seanie, Baek Jinheon, Kawaguchi Kenji, Hwang Sung Ju
- Sorted Llama: Unlocking The Potential Of Intermediate Layers Of Large Language Models For Dynamic Inference Kavehzadeh Parsa, Valipour Mojtaba, Tahaei Marzieh, Ghodsi Ali, Chen Boxing, Rezagholizadeh Mehdi
- Application Of Frozen Large-scale Models To Multimodal Task-oriented Dialogue Kawamoto Tatsuki, Suzuki Takuma, Miyama Ko, Meguro Takumi, Takagi Tomohiro
- The Impact Of Positional Encoding On Length Generalization In Transformers Kazemnejad Amirhossein, Padhi Inkit, Ramamurthy Karthikeyan Natesan, Das Payel, Reddy Siva
- Critiquellm: Towards An Informative Critique Generation Model For Evaluation Of Large Language Model Generation Ke Pei, Wen Bosi, Feng Zhuoer, Liu Xiao, Lei Xuanyu, Cheng Jiale, Wang Shengyuan, Zeng Aohan, Dong Yuxiao, Wang Hongning, Tang Jie, Huang Minlie
- Turning English-centric Llms Into Polyglots: How Much Multilinguality Is Needed? Kew Tannon, Schottmann Florian, Sennrich Rico
- Q: How To Specialize Large Vision-language Models To Data-scarce VQA Tasks? A: Self-train On Unlabeled Images! Khan Zaid, Bg Vijay Kumar, Schulter Samuel, Yu Xiang, Fu Yun, Chandraker Manmohan
- Tst\(^\mathrm{r}\): Target Similarity Tuning Meets The Real World Khatry Anirudh, Gulwani Sumit, Gupta Priyanshu, Le Vu, Singha Ananya, Singh Mukul, Verbruggen Gust
- Reliability Check: An Analysis Of Gpt-3's Response To Sensitive Topics And Prompt Wording Khatun Aisha, Brown Daniel G.
- Expediting Contrastive Language-image Pretraining Via Self-distilled Encoders Kim Bumsoo, Kim Jinhyung, Jo Yeonsik, Kim Seung Hwan
- Transferring Pre-trained Multimodal Representations With Cross-modal Similarity Matching Kim Byoungjip, Choi Sungik, Hwang Dasol, Lee Moontae, Lee Honglak
- SOLAR 10.7B: Scaling Large Language Models With Simple Yet Effective Depth Up-scaling Kim Dahyun, Park Chanjun, Kim Sanghoon, Lee Wonsung, Song Wonho, Kim Yunsu, Kim Hyeonwoo, Kim Yungi, Lee Hyeonju, Kim Jihoo, Ahn Changbae, Yang Seonghoon, Lee Sukyung, Park Hyunbyung, Gim Gyoungjin, Cha Mikyoung, Lee Hwalsuk, Kim Sunghun
- Fantom: A Benchmark For Stress-testing Machine Theory Of Mind In Interactions Kim Hyunwoo, Sclar Melanie, Zhou Xuhui, Bras Ronan Le, Kim Gunhee, Choi Yejin, Sap Maarten
- Compressed Context Memory For Online Language Model Interaction Kim Jang-hyun, Yeom Junyoung, Yun Sangdoo, Song Hyun Oh
- Memory-efficient Fine-tuning Of Compressed Large Language Models Via Sub-4-bit Integer Quantization Kim Jeonghoon, Lee Jung Hyun, Kim Sungdong, Park Joonsuk, Yoo Kang Min, Kwon Se Jung, Lee Dongsoo
- Speculative Decoding With Big Little Decoder Kim Sehoon, Mangalam Karttikeya, Moon Suhong, Malik Jitendra, Mahoney Michael W., Gholami Amir, Keutzer Kurt
- Cotever: Chain Of Thought Prompting Annotation Toolkit For Explanation Verification Kim Seungone, Joo Se June, Jang Yul, Chae Hyungjoo, Yeo Jinyoung
- MAGVLT: Masked Generative Vision-and-language Transformer Kim Sungwoong, Jo Daejin, Lee Donghoon, Kim Jongmin
- Finequant: Unlocking Efficiency With Fine-grained Weight-only Quantization For Llms Kim Young Jin, Henry Rawn, Fahim Raffy, Awadalla Hany Hassan
- Revisiting Intermediate Layer Distillation For Compressing Language Models: An Overfitting Perspective Ko Jongwoo, Park Seungjoon, Jeong Minchan, Hong Sukjin, Ahn Euijai, Chang Du-seong, Yun Se-young
- Transformer Language Models Handle Word Frequency In Prediction Head Kobayashi Goro, Kuribayashi Tatsuki, Yokoi Sho, Inui Kentaro
- Chatgpt: Jack Of All Trades, Master Of None Kocoń Jan, Cichecki Igor, Kaszyca Oliwier, Kochanek Mateusz, Szydło Dominika, Baran Joanna, Bielaniewicz Julita, Gruza Marcin, Janz Arkadiusz, Kanclerz Kamil, Kocoń Anna, Koptyra Bartłomiej, Mieleszczenko-kowszewicz Wiktoria, Miłkowski Piotr, Oleksy Marcin, Piasecki Maciej, Radliński Łukasz, Wojtasik Konrad, Woźniak Stanisław, Kazienko Przemysław
- Building A Llama2-finetuned LLM For Odia Language Utilizing Domain Knowledge Instruction Set Kohli Guneet Singh, Parida Shantipriya, Sekhar Sambit, Saha Samirit, Nair Nipun B, Agarwal Parul, Khosla Sonal, Patiyal Kusumlata, Dhal Debasish
- Contextual Refinement Of Translations: Large Language Models For Sentence And Document-level Post-editing Koneru Sai, Exel Miriam, Huck Matthias, Niehues Jan
- NOLA: Compressing Lora Using Linear Combination Of Random Basis Koohpayegani Soroush Abbasi, Navaneet Kl, Nooralinejad Parsa, Kolouri Soheil, Pirsiavash Hamed
- Humans In Humans Out: On GPT Converging Toward Common Sense In Both Success And Failure Koralus Philipp, Wang-maścianica Vincent
- Understanding Catastrophic Forgetting In Language Models Via Implicit Inference Kotha Suhas, Springer Jacob Mitchell, Raghunathan Aditi
- L-TUNING: Synchronized Label Tuning For Prompt And Prefix In Llms Kowsher Md., Sobuj Md. Shohanur Islam, Mahmud Asif, Prottasha Nusrat Jahan, Bhat Prakash
- Implicit Meta-learning May Lead Language Models To Trust More Reliable Sources Krasheninnikov Dmitrii, Krasheninnikov Egor, Mlodozeniec Bruno, Maharaj Tegan, Krueger David
- Prompt Generate Train (PGT): Few-shot Domain Adaption Of Retrieval Augmented Generation Models For Open Book Question-answering Krishna C. S.
- Fundamentals Of Generative Large Language Models And Perspectives In Cyber-defense Kucharavy Andrei, Schillaci Zachary, Maréchal Loïc, Würsch Maxime, Dolamic Ljiljana, Sabonnadiere Remi, David Dimitri Percia, Mermoud Alain, Lenders Vincent
- Sensi-bert: Towards Sensitivity Driven Fine-tuning For Parameter-efficient BERT Kundu Souvik, Sridhar Sharath Nittur, Szankin Maciej, Sundaresan Sairam
- DACBERT: Leveraging Dependency Agreement For Cost-efficient Bert Pretraining Kuo Martin, Zhang Jianyi, Chen Yiran
- Sparse Fine-tuning For Inference Acceleration Of Large Language Models Kurtic Eldar, Kuznedelev Denis, Frantar Elias, Goin Michael, Alistarh Dan
- Jotr: A Joint Transformer And Reinforcement Learning Framework For Dialog Policy Learning Kwan Wai-chung, Wang Huimin, Wang Hongru, Wang Zezhong, Wu Xian, Zheng Yefeng, Wong Kam-fai
- M4LE: A Multi-ability Multi-range Multi-task Multi-domain Long-context Evaluation Benchmark For Large Language Models Kwan Wai-chung, Zeng Xingshan, Wang Yufei, Sun Yusen, Li Liangyou, Shang Lifeng, Liu Qun, Wong Kam-fai
- Hallucination Augmented Recitations For Language Models Köksal Abdullatif, Aksitov Renat, Chang Chung-ching
- Openassistant Conversations -- Democratizing Large Language Model Alignment Köpf Andreas, Kilcher Yannic, Von Rütte Dimitri, Anagnostidis Sotiris, Tam Zhi-rui, Stevens Keith, Barhoum Abdullah, Duc Nguyen Minh, Stanley Oliver, Nagyfi Richárd, Es Shahul, Suri Sameer, Glushkov David, Dantuluri Arnav, Maguire Andrew, Schuhmann Christoph, Nguyen Huu, Mattick Alexander
- Enhancing Transformers Without Self-supervised Learning: A Loss Landscape Perspective In Sequential Recommendation Lai Vivian, Chen Huiyuan, Yeh Chin-chia Michael, Xu Minghua, Cai Yiwei, Yang Hao
- LISA: Reasoning Segmentation Via Large Language Model Lai Xin, Tian Zhuotao, Chen Yukang, Li Yanwei, Yuan Yuhui, Liu Shu, Jia Jiaya
- Exploring Large Language Models To Facilitate Variable Autonomy For Human-robot Teaming Lakhnati Younes, Pascher Max, Gerken Jens
- A Data Source For Reasoning Embodied Agents Lanchantin Jack, Sukhbaatar Sainbayar, Synnaeve Gabriel, Sun Yuxuan, Srinet Kavya, Szlam Arthur
- Cabrita: Closing The Gap For Foreign Languages Larcher Celio, Piau Marcos, Finardi Paulo, Gengo Pedro, Esposito Piero, Caridá Vinicius
- An Experimental Study On Pretraining Transformers From Scratch For IR Lassance Carlos, Déjean Hervé, Clinchant Stéphane
- Fine-tuning Chatgpt For Automatic Scoring Latif Ehsan, Zhai Xiaoming
- Building Efficient Universal Classifiers With Natural Language Inference Laurer Moritz, Van Atteveldt Wouter, Casas Andreu, Welbers Kasper
- Voicebox: Text-guided Multilingual Universal Speech Generation At Scale Le Matthew, Vyas Apoorv, Shi Bowen, Karrer Brian, Sari Leda, Moritz Rashel, Williamson Mary, Manohar Vimal, Adi Yossi, Mahadeokar Jay, Hsu Wei-ning
- Platypus: Quick, Cheap, And Powerful Refinement Of Llms Lee Ariel N., Hunter Cole J., Ruiz Nataniel
- Prompted Llms As Chatbot Modules For Long Open-domain Conversation Lee Gibbeum, Hartmann Volker, Park Jongho, Papailiopoulos Dimitris, Lee Kangwook
- Is Attention Required For ICL? Exploring The Relationship Between Model Architecture And In-context Learning Ability Lee Ivan, Jiang Nan, Berg-kirkpatrick Taylor
- Partially Randomizing Transformer Weights For Dialogue Response Diversity Lee Jing Yang, Lee Kong Aik, Gan Woon-seng
- Teaching Arithmetic To Small Transformers Lee Nayoung, Sreenivasan Kartik, Lee Jason D., Lee Kangwook, Papailiopoulos Dimitris
- Can Large Language Models Capture Dissenting Human Voices? Lee Noah, An Na Min, Thorne James
- Do We Still Need Clinical Language Models? Lehman Eric, Hernandez Evan, Mahajan Diwakar, Wulff Jonas, Smith Micah J., Ziegler Zachary, Nadler Daniel, Szolovits Peter, Johnson Alistair, Alsentzer Emily
- Chain Of Natural Language Inference For Reducing Large Language Model Ungrounded Hallucinations Lei Deren, Li Yaxi, Hu Mengya, Wang Mingyu, Yun Vincent, Ching Emily, Kamal Eslam
- Mitigating Fine-grained Hallucination By Fine-tuning Large Vision-language Models With Caption Rewrites Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-peng Lim
- T-sciq: Teaching Multimodal Chain-of-thought Reasoning Via Mixed Large Language Model Signals For Science Question Answering Lei Wang, Yi Hu, Jiabang He, Xing Xu, Ning Liu, Hui Liu, Heng Tao Shen
- The Language Of Prompting: What Linguistic Properties Make A Prompt Successful? Leidinger Alina, Van Rooij Robert, Shutova Ekaterina
- The Eval4nlp 2023 Shared Task On Prompting Large Language Models As Explainable Metrics Leiter Christoph, Opitz Juri, Deutsch Daniel, Gao Yang, Dror Rotem, Eger Steffen
- Self-detoxifying Language Models Via Toxification Reversal Leong Chak Tou, Cheng Yi, Wang Jiashuo, Wang Jian, Li Wenjie
- Lora Fine-tuning Efficiently Undoes Safety Training In Llama 2-chat 70B Lermen Simon, Rogers-smith Charlie, Ladish Jeffrey
- Sequential Monte Carlo Steering Of Large Language Models Using Probabilistic Programs Lew Alexander K., Zhi-xuan Tan, Grand Gabriel, Mansinghka Vikash K.
- Learning To Rewrite Prompts For Personalized Text Generation Li Cheng, Zhang Mingyang, Mei Qiaozhu, Kong Weize, Bendersky Michael
- Mugglemath: Assessing The Impact Of Query And Response Augmentation On Math Reasoning Li Chengpeng, Yuan Zheng, Yuan Hongyi, Dong Guanting, Lu Keming, Wu Jiancan, Tan Chuanqi, Wang Xiang, Zhou Chang
- DISTFLASHATTN: Distributed Memory-efficient Attention For Long-context Llms Training Li Dacheng, Shao Rulin, Xie Anze, Xing Eric P., Ma Xuezhe, Stoica Ion, Gonzalez Joseph E., Zhang Hao
- Mastering Robot Manipulation With Multimodal Prompts Through Pretraining And Multi-task Fine-tuning Li Jiachen, Gao Qiaozi, Johnston Michael, Gao Xiaofeng, He Xuehai, Shakiah Suhaila, Shi Hangjie, Ghanadan Reza, Wang William Yang
- Text Is All You Need: Learning Language Representations For Sequential Recommendation Li Jiacheng, Wang Ming, Li Jin, Fu Jinmiao, Shen Xin, Shang Jingbo, Mcauley Julian
- Eliciting The Translation Ability Of Large Language Models Via Multilingual Finetuning With Translation Instructions Li Jiahuan, Zhou Hao, Huang Shujian, Cheng Shanbo, Chen Jiajun
- CFGPT: Chinese Financial Assistant With Large Language Model Li Jiangtong, Bian Yuxuan, Wang Guoxuan, Lei Yang, Cheng Dawei, Ding Zhijun, Jiang Changjun
- Evcap: Retrieval-augmented Image Captioning With External Visual-name Memory For Open-world Comprehension Li Jiaxuan, Vo Duc Minh, Sugimoto Akihiro, Nakayama Hideki
- Graphix-t5: Mixing Pre-trained Transformers With Graph-aware Layers For Text-to-sql Parsing Li Jinyang, Hui Binyuan, Cheng Reynold, Qin Bowen, Ma Chenhao, Huo Nan, Huang Fei, Du Wenyu, Si Luo, Li Yongbin
- Prefix Propagation: Parameter-efficient Tuning For Long Sequences Li Jonathan, Aitken Will, Bhambhoria Rohan, Zhu Xiaodan
- MAQA: A Multimodal QA Benchmark For Negation Li Judith Yue, Jansen Aren, Huang Qingqing, Lee Joonseok, Ganti Ravi, Kuzmin Dima
- Fine-tuning Multimodal Llms To Follow Zero-shot Demonstrative Instructions Li Juncheng, Pan Kaihang, Ge Zhiqi, Gao Minghe, Ji Wei, Zhang Wenqiao, Chua Tat-seng, Tang Siliang, Zhang Hanwang, Zhuang Yueting
- Openba: An Open-sourced 15B Bilingual Asymmetric Seq2seq Model Pre-trained From Scratch Li Juntao, Tang Zecheng, Ding Yuyang, Wang Pinzheng, Guo Pei, You Wangjie, Qiao Dan, Chen Wenliang, Fu Guohong, Zhu Qiaoming, Zhou Guodong, Zhang Min
- The Web Can Be Your Oyster For Improving Large Language Models Li Junyi, Tang Tianyi, Zhao Wayne Xin, Wang Jingyuan, Nie Jian-yun, Wen Ji-rong
- Tool-augmented Reward Modeling Li Lei, Chai Yekun, Wang Shuohuan, Sun Yu, Tian Hao, Zhang Ningyu, Wu Hua
- Watermarking Llms With Weight Quantization Li Linyang, Jiang Botian, Wang Pengyu, Ren Ke, Yan Hang, Qiu Xipeng
- Laffi: Leveraging Hybrid Natural Language Feedback For Fine-tuning Language Models Li Qianxi, Cao Yingyue, Kang Jikun, Yang Tianpei, Chen Xi, Jin Jun, Taylor Matthew E.
- Vlm-eval: A General Evaluation On Video Large Language Models Li Shuailin, Zhang Yuang, Zhao Yucheng, Wang Qiuyue, Jia Fan, Liu Yingfei, Wang Tiancai
- Tailoring Personality Traits In Large Language Models Via Unsupervisedly-built Personalized Lexicons Li Tianlong, Dou Shihan, Lv Changze, Liu Wenhao, Xu Jianhan, Wu Muling, Ling Zixuan, Zheng Xiaoqing, Huang Xuanjing
- Benchmarking And Improving Generator-validator Consistency Of Language Models Li Xiang Lisa, Shrivastava Vaishnavi, Li Siyan, Hashimoto Tatsunori, Liang Percy
- Mot: Memory-of-thought Enables Chatgpt To Self-improve Li Xiaonan, Qiu Xipeng
- Exploring Fine-tuning Chatgpt For News Recommendation Li Xinyi, Zhang Yongfeng, Malthouse Edward C
- PBNR: Prompt-based News Recommender System Li Xinyi, Zhang Yongfeng, Malthouse Edward C.
- Explicit Syntactic Guidance For Neural Text Generation Li Yafu, Cui Leyang, Yan Jianhao, Yin Yongjing, Bi Wei, Shi Shuming, Zhang Yue
- On Bilingual Lexicon Induction With Large Language Models Li Yaoyiran, Korhonen Anna, Vulić Ivan
- Dissecting Chain-of-thought: Compositionality Through In-context Filtering And Learning Li Yingcong, Sreenivasan Kartik, Giannou Angeliki, Papailiopoulos Dimitris, Oymak Samet
- A Survey On Fairness In Large Language Models Li Yingji, Du Mengnan, Song Rui, Wang Xin, Wang Ying
- Loftq: Lora-fine-tuning-aware Quantization For Large Language Models Li Yixiao, Yu Yifan, Liang Chen, He Pengcheng, Karampatziakis Nikos, Chen Weizhu, Zhao Tuo
- Losparse: Structured Compression Of Large Language Models Based On Low-rank And Sparse Approximation Li Yixiao, Yu Yifan, Zhang Qingru, Liang Chen, He Pengcheng, Chen Weizhu, Zhao Tuo
- Textbooks Are All You Need II: Phi-1.5 Technical Report Li Yuanzhi, Bubeck Sébastien, Eldan Ronen, Del Giorno Allie, Gunasekar Suriya, Lee Yin Tat
- White-box Multi-objective Adversarial Attack On Dialogue Generation Li Yufei, Li Zexin, Gao Yingfan, Liu Cong
- E-sparse: Boosting The Large Language Model Inference Through Entropy-based N:M Sparsity Li Yun, Niu Lin, Zhang Xipeng, Liu Kai, Zhu Jianchen, Kang Zhanhui
- Advancing Precise Outline-conditioned Text Generation With Task Duality And Explicit Outline Control Li Yunzhe, Chen Qian, Yan Weixiang, Wang Wen, Zhang Qinglin, Sundaram Hari
- Guiding Large Language Models Via Directional Stimulus Prompting Li Zekun, Peng Baolin, He Pengcheng, Galley Michel, Gao Jianfeng, Yan Xifeng
- QFT: Quantized Full-parameter Tuning Of Llms With Affordable Resources Li Zhikai, Liu Xiaoxuan, Zhu Banghua, Dong Zhen, Gu Qingyi, Keutzer Kurt
- Unveiling The Pitfalls Of Knowledge Editing For Large Language Models Li Zhoubo, Zhang Ningyu, Yao Yunzhi, Wang Mengru, Chen Xi, Chen Huajun
- Batgpt: A Bidirectional Autoregessive Talker From Generative Pre-trained Transformer Li Zuchao, Zhang Shitou, Zhao Hai, Yang Yifei, Yang Dongjie
- Homodistil: Homotopic Task-agnostic Distillation Of Pre-trained Transformers Liang Chen, Jiang Haoming, Li Zheng, Tang Xianfeng, Yin Bin, Zhao Tuo
- Prompting Large Language Models With Chain-of-thought For Few-shot Knowledge Base Question Generation Liang Yuanyuan, Wang Jianing, Zhu Hanlun, Wang Lei, Qian Weining, Lan Yunshi
- Make Pre-trained Model Reversible: From Parameter To Memory Efficient Fine-tuning Liao Baohao, Tan Shaomu, Monz Christof
- Opening Up Chatgpt: Tracking Openness, Transparency, And Accountability In Instruction-tuned Text Generators Liesenfeld Andreas, Lopez Alianda, Dingemanse Mark
- STEVE-1: A Generative Model For Text-to-behavior In Minecraft Lifshitz Shalev, Paster Keiran, Chan Harris, Ba Jimmy, Mcilraith Sheila
- A Survey On Large Language Models For Recommendation Likang Wu, Zhi Zheng, Zhaopeng Qiu, Hao Wang, Hongchao Gu, Tingjia Shen, Chuan Qin, Chen Zhu, Hengshu Zhu, Qi Liu, Hui Xiong, Enhong Chen
- The Unlocking Spell On Base Llms: Rethinking Alignment Via In-context Learning Lin Bill Yuchen, Ravichander Abhilasha, Lu Ximing, Dziri Nouha, Sclar Melanie, Chandu Khyathi, Bhagavatula Chandra, Choi Yejin
- Paralinguistics-enhanced Large Language Modeling Of Spoken Dialogue Lin Guan-ting, Shivakumar Prashanth Gurunath, Gandhe Ankur, Yang Chao-han Huck, Gu Yile, Ghosh Shalini, Stolcke Andreas, Lee Hung-yi, Bulyko Ivan
- Beneath The Surface: Unveiling Harmful Memes With Multimodal Reasoning Distilled From Large Language Models Lin Hongzhan, Luo Ziyang, Ma Jing, Chen Long
- VILA: On Pre-training For Visual Language Models Lin Ji, Yin Hongxu, Ping Wei, Lu Yao, Molchanov Pavlo, Tao Andrew, Mao Huizi, Kautz Jan, Shoeybi Mohammad, Han Song
- Advances In Embodied Navigation Using Large Language Models: A Survey Lin Jinzhou, Gao Han, Feng Xuxiang, Xu Rongtao, Wang Changwei, Zhang Man, Guo Li, Xu Shibiao
- Few-shot Adaptation For Parsing Contextual Utterances With Llms Lin Kevin, Xia Patrick, Fang Hao
- RA-DIT: Retrieval-augmented Dual Instruction Tuning Lin Xi Victoria, Chen Xilun, Chen Mingda, Shi Weijia, Lomeli Maria, James Rich, Rodriguez Pedro, Kahn Jacob, Szilvasy Gergely, Lewis Mike, Zettlemoyer Luke, Yih Scott
- Taiwan LLM: Bridging The Linguistic Divide With A Culturally Aligned Language Model Lin Yen-ting, Chen Yun-nung
- Mitigating The Alignment Tax Of RLHF Lin Yong, Lin Hangyu, Xiong Wei, Diao Shizhe, Liu Jianmeng, Zhang Jipeng, Pan Rui, Wang Haoxiang, Hu Wenbin, Zhang Hanning, Dong Hanze, Pi Renjie, Zhao Han, Jiang Nan, Ji Heng, Yao Yuan, Zhang Tong
- Pushing Large Language Models To The 6G Edge: Vision, Challenges, And Opportunities Lin Zheng, Qu Guanqiao, Chen Qiyuan, Chen Xianhao, Chen Zhe, Huang Kaibin
- Comparing Generative Chatbots Based On Process Requirements Lins Luis Fernando, Nascimento Nathalia, Alencar Paulo, Oliveira Toacy, Cowan Donald
- Exposing Attention Glitches With Flip-flop Language Modeling Liu Bingbin, Ash Jordan T., Goel Surbhi, Krishnamurthy Akshay, Zhang Cyril
- Code Execution With Pre-trained Language Models Liu Chenxiao, Lu Shuai, Chen Weizhu, Jiang Daxin, Svyatkovskiy Alexey, Fu Shengyu, Sundaresan Neel, Duan Nan
- Examining Llms' Uncertainty Expression Towards Questions Outside Parametric Knowledge Liu Genglin, Wang Xingyao, Yuan Lifan, Chen Yangyi, Peng Hao
- Logicot: Logical Chain-of-thought Instruction-tuning Liu Hanmeng, Teng Zhiyang, Cui Leyang, Zhang Chaoli, Zhou Qiji, Zhang Yue
- Glore: Evaluating Logical Reasoning Of Large Language Models Liu Hanmeng, Teng Zhiyang, Ning Ruoxi, Liu Jian, Zhou Qiji, Zhang Yue
- Exploration With Principles For Diverse AI Supervision Liu Hao, Zaharia Matei, Abbeel Pieter
- Ring Attention With Blockwise Transformers For Near-infinite Context Liu Hao, Zaharia Matei, Abbeel Pieter
- Qilin-med-vl: Towards Chinese Large Vision-language Model For General Healthcare Liu Junling, Wang Ziming, Ye Qichen, Chong Dading, Zhou Peilin, Hua Yining
- Contrastive Vision-language Alignment Makes Efficient Instruction Learner Liu Lizhao, Sun Xinyu, Xiang Tianhang, Zhuang Zhuangwei, Yin Liuren, Tan Mingkui
- Chipnemo: Domain-adapted Llms For Chip Design Liu Mingjie, Ene Teodor-dumitru, Kirby Robert, Cheng Chris, Pinckney Nathaniel, Liang Rongjian, Alben Jonah, Anand Himyanshu, Banerjee Sanmitra, Bayraktaroglu Ismet, Bhaskaran Bonita, Catanzaro Bryan, Chaudhuri Arjun, Clay Sharon, Dally Bill, Dang Laura, Deshpande Parikshit, Dhodhi Siddhanth, Halepete Sameer, Hill Eric, Hu Jiashang, Jain Sumit, Jindal Ankit, Khailany Brucek, Kokai George, Kunal Kishor, Li Xiaowei, Lind Charley, Liu Hao, Oberman Stuart, Omar Sujeet, Pasandi Ghasem, Pratty Sreedhar, Raiman Jonathan, Sarkar Ambar, Shao Zhengjiang, Sun Hanfei, Suthar Pratik P, Tej Varun, Turner Walker, Xu Kaizhe, Ren Haoxing
- Nlebench+norglm: A Comprehensive Empirical Analysis And Benchmark Dataset For Generative Language Models In Norwegian Liu Peng, Zhang Lemei, Farup Terje Nissen, Lauvrak Even W., Ingvaldsen Jon Espen, Eide Simen, Gulla Jon Atle, Yang Zhirong
- From Zero To Hero: Examining The Power Of Symbolic Tasks In Instruction Tuning Liu Qian, Zhou Fan, Jiang Zhengbao, Dou Longxu, Lin Min
- In-context Vectors: Making In Context Learning More Effective And Controllable Through Latent Space Steering Liu Sheng, Ye Haotian, Xing Lei, Zou James
- Language Models As Black-box Optimizers For Vision-language Models Liu Shihong, Lin Zhiqiu, Yu Samuel, Lee Ryan, Ling Tiffany, Pathak Deepak, Ramanan Deva
- Goat: Fine-tuned Llama Outperforms GPT-4 On Arithmetic Tasks Liu Tiedong, Low Bryan Kian Hsiang
- Aligning Large Language Models With Human Preferences Through Representation Engineering Liu Wenhao, Wang Xiaohua, Wu Muling, Li Tianlong, Lv Changze, Ling Zixuan, Zhu Jianhao, Zhang Cenyuan, Zheng Xiaoqing, Huang Xuanjing
- Fingpt: Democratizing Internet-scale Data For Financial Large Language Models Liu Xiao-yang, Wang Guoxuan, Yang Hongyang, Zha Daochen
- Scaling Laws Of Rope-based Extrapolation Liu Xiaoran, Yan Hang, Zhang Shuo, An Chenxin, Qiu Xipeng, Lin Dahua
- BOLT: Fast Energy-based Controlled Text Generation With Tunable Biases Liu Xin, Khalifa Muhammad, Wang Lu
- Litcab: Lightweight Language Model Calibration Over Short- And Long-form Responses Liu Xin, Khalifa Muhammad, Wang Lu
- Query-utterance Attention With Joint Modeling For Query-focused Meeting Summarization Liu Xingxian, Duan Bin, Xiao Bo, Xu Yajing
- Parallel Sentence-level Explanation Generation For Real-world Low-resource Scenarios Liu Yan, Chen Xiaokang, Dai Qi
- Summary Of Chatgpt-related Research And Perspective Towards The Future Of Large Language Models Liu Yiheng, Han Tianle, Ma Siyuan, Zhang Jiayue, Yang Yuanyuan, Tian Jiaming, He Hao, Li Antong, He Mengshen, Liu Zhengliang, Wu Zihao, Zhao Lin, Zhu Dajiang, Li Xiang, Qiang Ning, Shen Dingang, Liu Tianming, Ge Bao
- Instruction Position Matters In Sequence Generation With Large Language Models Liu Yijin, Zeng Xianfeng, Meng Fandong, Zhou Jie
- On Learning To Summarize With Large Language Models As References Liu Yixin, Shi Kejian, He Katherine S, Ye Longtian, Fabbri Alexander R., Liu Pengfei, Radev Dragomir, Cohan Arman
- Binary And Ternary Natural Language Generation Liu Zechun, Oguz Barlas, Pappu Aasish, Shi Yangyang, Krishnamoorthi Raghuraman
- Molxpt: Wrapping Molecules With Text For Generative Pre-training Liu Zequn, Zhang Wei, Xia Yingce, Wu Lijun, Xie Shufang, Qin Tao, Zhang Ming, Liu Tie-yan
- Picking The Underused Heads: A Network Pruning Perspective Of Attention Head Selection For Fusing Dialogue Coreference Information Liu Zhengyuan, Chen Nancy F.
- Reason For Future, Act For Now: A Principled Framework For Autonomous LLM Agents With Provable Sample Efficiency Liu Zhihan, Hu Hao, Zhang Shenao, Guo Hongyi, Ke Shuqi, Liu Boyi, Wang Zhaoran
- Deja Vu: Contextual Sparsity For Efficient Llms At Inference Time Liu Zichang, Wang Jue, Dao Tri, Zhou Tianyi, Yuan Binhang, Song Zhao, Shrivastava Anshumali, Zhang Ce, Tian Yuandong, Re Christopher, Chen Beidi
- SAIE Framework: Support Alone Isn't Enough -- Advancing LLM Training With Adversarial Remarks Loem Mengsay, Kaneko Masahiro, Okazaki Naoaki
- Unified-io 2: Scaling Autoregressive Multimodal Models With Vision, Language, Audio, And Action Lu Jiasen, Clark Christopher, Lee Sangho, Zhang Zichen, Khosla Savya, Marten Ryan, Hoiem Derek, Kembhavi Aniruddha
- Llama-reviewer: Advancing Code Review Automation With Large Language Models Through Parameter-efficient Fine-tuning Lu Junyi, Yu Lei, Li Xiaojia, Yang Li, Zuo Chun
- Ziya-visual: Bilingual Large Vision-language Model Via Multi-task Instruction Tuning Lu Junyu, Zhang Dixiang, Wu Xiaojun, Gao Xinyu, Gan Ruyi, Zhang Jiaxing, Song Yan, Zhang Pingjian
- Lyrics: Boosting Fine-grained Language-vision Alignment And Comprehension Via Semantic-aware Visual Objects Lu Junyu, Zhang Dixiang, Zhang Songxin, Xie Zejian, Song Zhuoyang, Lin Cong, Zhang Jiaxing, Jing Bingyi, Zhang Pingjian
- Inference-time Policy Adapters (IPA): Tailoring Extreme-scale Lms Without Fine-tuning Lu Ximing, Brahman Faeze, West Peter, Jang Jaehun, Chandu Khyathi, Ravichander Abhilasha, Qin Lianhui, Ammanabrolu Prithviraj, Jiang Liwei, Ramnath Sahana, Dziri Nouha, Fisher Jillian, Lin Bill Yuchen, Hallinan Skyler, Ren Xiang, Welleck Sean, Choi Yejin
- SCITAB: A Challenging Benchmark For Compositional Reasoning And Claim Verification On Scientific Tables Lu Xinyuan, Pan Liangming, Liu Qian, Nakov Preslav, Kan Min-yen
- Effectively Fine-tune To Improve Large Multimodal Models For Radiology Report Generation Lu Yuzhe, Hong Sungmin, Shah Yash, Xu Panpan
- Chatgpt And A New Academic Reality: Artificial Intelligence-written Research Papers And The Ethics Of The Large Language Models In Scholarly Publishing Lund Brady, Wang Ting, Mannuru Nishith Reddy, Nie Bing, Shimray Somipam, Wang Ziang
- Prosg: Using Prompt Synthetic Gradients To Alleviate Prompt Forgetting Of Rnn-like Language Models Luo Haotian, Wu Kunming, Dai Cheng, Ding Sixian, Chen Xinhao
- Exploring Small Language Models With Prompt-learning Paradigm For Efficient Domain-specific Text Classification Luo Hengyu, Liu Peng, Esping Stefan
- SAIL: Search-augmented Instruction Learning Luo Hongyin, Chuang Yung-sung, Gong Yuan, Zhang Tianhua, Kim Yoon, Wu Xixin, Fox Danny, Meng Helen, Glass James
- Taiyi: A Bilingual Fine-tuned Large Language Model For Diverse Biomedical Tasks Luo Ling, Ning Jinzhong, Zhao Yingwen, Wang Zhijun, Ding Zeyuan, Chen Peng, Fu Weiru, Han Qinyu, Xu Guangtao, Qiu Yunzhi, Pan Dinghao, Li Jiru, Li Hao, Feng Wenduo, Tu Senbo, Liu Yuqi, Yang Zhihao, Wang Jian, Sun Yuanyuan, Lin Hongfei
- Systematic Assessment Of Factual Knowledge In Large Language Models Luo Linhao, Vu Thuy-trang, Phung Dinh, Haffari Gholamreza
- Towards Logiglue: A Brief Survey And A Benchmark For Analyzing Logical Reasoning Capabilities Of Language Models Luo Man, Kumbhar Shrinidhi, Shen Ming, Parmar Mihir, Varshney Neeraj, Banerjee Pratyay, Aditya Somak, Baral Chitta
- YAYI 2: Multilingual Open-source Large Language Models Luo Yin, Kong Qingchao, Xu Nan, Cao Jia, Hao Bao, Qu Baoyu, Chen Bo, Zhu Chao, Zhao Chenyang, Zhang Donglei, Feng Fan, Zhao Feifei, Sun Hailong, Yang Hanxuan, Pan Haojun, Liu Hongyu, Guo Jianbin, Du Jiangtao, Wang Jingyi, Li Junfeng, Sun Lei, Liu Liduo, Dong Lifeng, Liu Lili, Wang Lin, Zhang Liwen, Wang Minzheng, Wang Pin, Yu Ping, Li Qingxiao, Yan Rui, Zou Rui, Li Ruiqun, Huang Taiwen, Wang Xiaodong, Wu Xiaofei, Peng Xin, Zhang Xina, Fang Xing, Xiao Xinglin, Hao Yanni, Dong Yao, Wang Yigang, Liu Ying, Jiang Yongyu, Wang Yungan, Wang Yuqi, Wang Zhangsheng, Yu Zhaoxin, Luo Zhen, Mao Wenji, Wang Lei, Zeng Dajun
- Biomedgpt: Open Multimodal Generative Pre-trained Transformer For Biomedicine Luo Yizhen, Zhang Jiahuan, Fan Siqi, Yang Kai, Wu Yushuai, Qiao Mu, Nie Zaiqing
- Unlocking The Potential Of Large Language Models For Explainable Recommendations Luo Yucong, Cheng Mingyue, Zhang Hao, Lu Junyu, Liu Qi, Chen Enhong
- An Empirical Study Of Catastrophic Forgetting In Large Language Models During Continual Fine-tuning Luo Yun, Yang Zhen, Meng Fandong, Li Yafu, Zhou Jie, Zhang Yue
- Augmented Large Language Models With Parametric Knowledge Guiding Luo Ziyang, Xu Can, Zhao Pu, Geng Xiubo, Tao Chongyang, Ma Jing, Lin Qingwei, Jiang Daxin
- Fingpt: Large Generative Models For A Small Language Luukkonen Risto, Komulainen Ville, Luoma Jouni, Eskelinen Anni, Kanerva Jenna, Kupari Hanna-mari, Ginter Filip, Laippala Veronika, Muennighoff Niklas, Piktus Aleksandra, Wang Thomas, Tazi Nouamane, Scao Teven Le, Wolf Thomas, Suominen Osma, Sairanen Samuli, Merioksa Mikko, Heinonen Jyrki, Vahtola Aija, Antao Samuel, Pyysalo Sampo
- Are We Falling In A Middle-intelligence Trap? An Analysis And Mitigation Of The Reversal Curse Lv Ang, Zhang Kaiyi, Xie Shufang, Tu Quan, Chen Yuhan, Wen Ji-rong, Yan Rui
- Full Parameter Fine-tuning For Large Language Models With Limited Resources Lv Kai, Yang Yuqing, Liu Tengxiao, Gao Qinghui, Guo Qipeng, Qiu Xipeng
- Collie: Collaborative Training Of Large Language Models In An Efficient Way Lv Kai, Zhang Shuo, Gu Tianle, Xing Shuhao, Hong Jiawei, Chen Keyu, Liu Xiaoran, Yang Yuqing, Guo Honglin, Liu Tengxiao, Sun Yu, Guo Qipeng, Yan Hang, Qiu Xipeng
- KOSMOS-2.5: A Multimodal Literate Model Lv Tengchao, Huang Yupan, Chen Jingye, Zhao Yuzhong, Jia Yilin, Cui Lei, Ma Shuming, Chang Yaoyao, Huang Shaohan, Wang Wenhui, Dong Li, Luo Weiyao, Wu Shaoxiang, Wang Guoxin, Zhang Cha, Wei Furu
- Comparative Analysis Of Drug-gpt And Chatgpt Llms For Healthcare Insights: Evaluating Accuracy And Relevance In Patient And HCP Contexts Lysandrou Giorgos, Owen Roma English, Mursec Kirsty, Brun Grant Le, Fairley Elizabeth A. L.
- Pre-training With Large Language Model-based Document Expansion For Dense Passage Retrieval Ma Guangyuan, Wu Xing, Wang Peng, Lin Zijia, Hu Songlin
- Chain-of-skills: A Configurable Model For Open-domain Question Answering Ma Kaixin, Cheng Hao, Zhang Yu, Liu Xiaodong, Nyberg Eric, Gao Jianfeng
- Fine-tuning Llama For Multi-stage Text Retrieval Ma Xueguang, Wang Liang, Yang Nan, Wei Furu, Lin Jimmy
- VOLTA: Improving Generative Diversity By Variational Mutual Information Maximizing Autoencoder Ma Yueen, Chi Dafeng, Li Jingjing, Song Kai, Zhuang Yuzheng, King Irwin
- Non-autoregressive Streaming Transformer For Simultaneous Translation Ma Zhengrui, Zhang Shaolei, Guo Shoutao, Shao Chenze, Zhang Min, Feng Yang
- Harnessing The Power Of Prompt-based Techniques For Generating School-level Questions Using Large Language Models Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- Fine-tuning Language Models With Just Forward Passes Malladi Sadhika, Gao Tianyu, Nichani Eshaan, Damian Alex, Lee Jason D., Chen Danqi, Arora Sanjeev
- Unitrec: A Unified Text-to-text Transformer And Joint Contrastive Learning Framework For Text-based Recommendation Mao Zhiming, Wang Huimin, Du Yiming, Wong Kam-fai
- ICL Markup: Structuring In-context Learning Using Soft-token Tags Marc-etienne Brunet, Ashton Anderson, Richard Zemel
- When Less Is More: Investigating Data Pruning For Pretraining Llms At Scale Marion Max, Üstün Ahmet, Pozzobon Luiza, Wang Alex, Fadaee Marzieh, Hooker Sara
- Interpreting Learned Feedback Patterns In Large Language Models Marks Luke, Abdullah Amir, Neo Clement, Arike Rauno, Krueger David, Torr Philip, Barez Fazl
- Chatgpt Vs State-of-the-art Models: A Benchmarking Study In Keyphrase Generation Task Martínez-cruz Roberto, López-lópez Alvaro J., Portela José
- Investigating The Efficacy Of Large Language Models In Reflective Assessment Methods Through Chain Of Thoughts Prompting Masikisiki Baphumelele, Marivate Vukosi, Hlope Yvette
- Benchmarking Large Language Model Capabilities For Conditional Generation Maynez Joshua, Agrawal Priyanka, Gehrmann Sebastian
- Is Feedback All You Need? Leveraging Natural Language Feedback In Goal-conditioned Reinforcement Learning Mccallum Sabrina, Taylor-davies Max, Albrecht Stefano V., Suglia Alessandro
- Sources Of Hallucination By Large Language Models On Inference Tasks Mckenna Nick, Li Tianyi, Cheng Liang, Hosseini Mohammad Javad, Johnson Mark, Steedman Mark
- Lightlm: A Lightweight Deep And Narrow Language Model For Generative Recommendation Mei Kai, Zhang Yongfeng
- Enhancing LLM Intelligence With ARM-RAG: Auxiliary Rationale Memory For Retrieval Augmented Generation Melz Eric
- Representation Deficiency In Masked Language Modeling Meng Yu, Krishnan Jitin, Wang Sinong, Wang Qifan, Mao Yuning, Fang Han, Ghazvininejad Marjan, Han Jiawei, Zettlemoyer Luke
- Think Before You Act: Unified Policy For Interleaving Language Reasoning With Actions Mezghani Lina, Bojanowski Piotr, Alahari Karteek, Sukhbaatar Sainbayar
- Self-checker: Plug-and-play Modules For Fact-checking With Large Language Models Miaoran Li, Baolin Peng, Michel Galley, Jianfeng Gao, Zhu Zhang
- Uzh_clyp At Semeval-2023 Task 9: Head-first Fine-tuning And Chatgpt Data Generation For Cross-lingual Learning In Tweet Intimacy Prediction Michail Andrianos, Konstantinou Stefanos, Clematide Simon
- LAIT: Efficient Multi-segment Encoding In Transformers With Layer-adjustable Interaction Milbauer Jeremiah, Louis Annie, Hosseini Mohammad Javad, Fabrikant Alex, Metzler Donald, Schuster Tal
- MEDIMP: 3D Medical Images With Clinical Prompts From Limited Tabular Data For Renal Transplantation Milecki Leo, Kalogeiton Vicky, Bodard Sylvain, Anglicheau Dany, Correas Jean-michel, Timsit Marc-olivier, Vakalopoulou Maria
- Attention Link: An Efficient Attention-based Low Resource Machine Translation Architecture Min Zeping
- PEFTT: Parameter-efficient Fine-tuning For Low-resource Tibetan Pre-trained Language Models Mingjun Zhou, Zhuoma Daiqing, Nuo Qun, Tashi Nyima
- Synthetic Imitation Edit Feedback For Factual Alignment In Clinical Summarization Mishra Prakamya, Yao Zonghai, Chen Shuwei, Wang Beining, Mittal Rohan, Yu Hong
- An Emulator For Fine-tuning Large Language Models Using Small Language Models Mitchell Eric, Rafailov Rafael, Sharma Archit, Finn Chelsea, Manning Christopher D.
- Compositional Chain-of-thought Prompting For Large Multimodal Models Mitra Chancharik, Huang Brandon, Darrell Trevor, Herzig Roei
- How Trustworthy Are Open-source Llms? An Assessment Under Malicious Demonstrations Shows Their Vulnerabilities Mo Lingbo, Wang Boshi, Chen Muhao, Sun Huan
- Landmark Attention: Random-access Infinite Context Length For Transformers Mohtashami Amirkeivan, Jaggi Martin
- Increasing The Performance Of Cognitively Inspired Data-efficient Language Models Via Implicit Structure Building Momen Omar, Arps David, Kallmeyer Laura
- Adaptive Machine Translation With Large Language Models Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- Fine-tuning Large Language Models For Adaptive Machine Translation Moslem Yasmin, Haque Rejwanul, Way Andy
- Learning To Compress Prompts With Gist Tokens Mu Jesse, Li Xiang Lisa, Goodman Noah
- Can Llms Follow Simple Rules? Mu Norman, Chen Sarah, Wang Zifan, Chen Sizhe, Karamardian David, Aljeraisy Lulwa, Alomair Basel, Hendrycks Dan, Wagner David
- Navigating Prompt Complexity For Zero-shot Classification: A Study Of Large Language Models In Computational Social Science Mu Yida, Wu Ben P., Thorne William, Robinson Ambrose, Aletras Nikolaos, Scarton Carolina, Bontcheva Kalina, Song Xingyi
- How To Plant Trees In Language Models: Data And Architectural Effects On The Emergence Of Syntactic Inductive Biases Mueller Aaron, Linzen Tal
- Assessing Translation Capabilities Of Large Language Models Involving English And Indian Languages Mujadia Vandan, Urlana Ashok, Bhaskar Yash, Pavani Penumalla Aditya, Shravya Kukkapalli, Krishnamurthy Parameswari, Sharma Dipti Misra
- Superhf: Supervised Iterative Learning From Human Feedback Mukobi Gabriel, Chatain Peter, Fong Su, Windesheim Robert, Kutyniok Gitta, Bhatia Kush, Alberti Silas
- A Brief History Of Prompt: Leveraging Language Models. (through Advanced Prompting) Muktadir Golam Md
- Evaluating The Effectiveness Of Retrieval-augmented Large Language Models In Scientific Document Reasoning Munikoti Sai, Acharya Anurag, Wagle Sridevi, Horawalavithana Sameera
- LLM4VV: Developing Llm-driven Testsuite For Compiler Validation Munley Christian, Jarmusch Aaron, Chandrasekaran Sunita
- Ai-assisted Code Authoring At Scale: Fine-tuning, Deploying, And Mixed Methods Evaluation Murali Vijayaraghavan, Maddila Chandra, Ahmad Imad, Bolin Michael, Cheng Daniel, Ghorbani Negar, Fernandez Renuka, Nagappan Nachiappan, Rigby Peter C.
- Teaching Probabilistic Logical Reasoning To Transformers Nafar Aliakbar, Venable Kristen Brent, Kordjamshidi Parisa
- Cheaply Evaluating Inference Efficiency Metrics For Autoregressive Transformer Apis Narayanan Deepak, Santhanam Keshav, Henderson Peter, Bommasani Rishi, Lee Tony, Liang Percy
- Reinforcement Replaces Supervision: Query Focused Summarization Using Deep Reinforcement Learning Nath Swaroop, Khadilkar Harshad, Bhattacharyya Pushpak
- Towards Probing Contact Center Large Language Models Nathan Varun, Kumar Ayush, Ingle Digvijay, Vepa Jithendra
- A Comprehensive Overview Of Large Language Models Naveed Humza, Khan Asad Ullah, Qiu Shi, Saqib Muhammad, Anwar Saeed, Usman Muhammad, Akhtar Naveed, Barnes Nick, Mian Ajmal
- Nanot5: A Pytorch Framework For Pre-training And Fine-tuning T5-style Models With Limited Resources Nawrot Piotr
- Vision Encoder-decoder Models For AI Coaching Nayak Jyothi S, Khan Afifah Khan Mohammed Ajmal, Manjeshwar Chirag, Banday Imadh Ajaz
- On Robustness Of Finetuned Transformer-based NLP Models Neerudu Pavan Kalyan Reddy, Oota Subba Reddy, Marreddy Mounika, Kagita Venkateswara Rao, Gupta Manish
- Exploring The Maze Of Multilingual Modeling Nezhad Sina Bagheri, Agrawal Ameeta
- Simplyretrieve: A Private And Lightweight Retrieval-centric Generative AI Tool Ng Youyang, Miyashita Daisuke, Hoshi Yasuto, Morioka Yasuhiro, Torii Osamu, Kodama Tomoya, Deguchi Jun
- Multi-dimensional Data Refining Strategy For Effective Fine-tuning Llms Ngoc Thanh Nguyen, Tran Quang Nhat, Tang Arthur, Nguyen Bao, Nguyen Thuy, Pham Thanh
- Meet In The Middle: A New Pre-training Paradigm Nguyen Anh, Karampatziakis Nikos, Chen Weizhu
- Black-box Analysis: Gpts Across Time In Legal Textual Entailment Task Nguyen Ha-thanh, Goebel Randy, Toni Francesca, Stathis Kostas, Satoh Ken
- EVJVQA Challenge: Multilingual Visual Question Answering Nguyen Ngan Luu-thuy, Nguyen Nghia Hieu, Vo Duong T. D, Tran Khanh Quoc, Van Nguyen Kiet
- Democratizing Llms For Low-resource Languages By Leveraging Their English Dominant Abilities With Linguistically-diverse Prompts Nguyen Xuan-phi, Aljunied Sharifah Mahani, Joty Shafiq, Bing Lidong
- L2ceval: Evaluating Language-to-code Generation Capabilities Of Large Language Models Ni Ansong, Yin Pengcheng, Zhao Yilun, Riddell Martin, Feng Troy, Shen Rui, Yin Stephen, Liu Ye, Yavuz Semih, Xiong Caiming, Joty Shafiq, Zhou Yingbo, Radev Dragomir, Cohan Arman
- Forgetting Before Learning: Utilizing Parametric Arithmetic For Knowledge Updating In Large Language Models Ni Shiwen, Chen Dingwei, Li Chengming, Hu Xiping, Xu Ruifeng, Yang Min
- An Experimental Study: Assessing The Combined Framework Of Wavlm And BEST-RQ For Text-to-speech Synthesis Nielson Via, Hillis Steven
- Codegen2: Lessons For Training Llms On Programming And Natural Languages Nijkamp Erik, Hayashi Hiroaki, Xiong Caiming, Savarese Silvio, Zhou Yingbo
- LEXTREME: A Multi-lingual And Multi-task Benchmark For The Legal Domain Niklaus Joel, Matoshi Veton, Rani Pooja, Galassi Andrea, Stürmer Matthias, Chalkidis Ilias
- Reflexion: Language Agents With Verbal Reinforcement Learning Noah Shinn, Federico Cassano, Edward Berman, Ashwin Gopinath, Karthik Narasimhan, Shunyu Yao
- Evaluating AI Vocational Skills Through Professional Testing Noever David, Ciolino Matt
- Numeracy From Literacy: Data Science As An Emergent Skill From Large Language Models Noever David, Mckee Forrest
- Adversarial Fine-tuning Of Language Models: An Iterative Optimisation Approach For The Generation And Detection Of Problematic Content O'neill Charles, Miller Jack, Ciuca Ioana, Ting Yuan-sen, Bui Thang
- In-contextual Gender Bias Suppression For Large Language Models Oba Daisuke, Kaneko Masahiro, Bollegala Danushka
- Blackvip: Black-box Visual Prompting For Robust Transfer Learning Oh Changdae, Hwang Hyeji, Lee Hee-young, Lim Yongtaek, Jung Geunyoung, Jung Jiyoung, Choi Hosik, Song Kyungwoo
- Automated Assessment Of Students' Code Comprehension Using Llms Oli Priti, Banjade Rabin, Chapagain Jeevan, Rus Vasile
- GPT-4 Technical Report Openai, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko, Madelaine Boyd, Anna-luisa Brakman, Greg Brockman, Tim Brooks, Miles Brundage, Kevin Button, Trevor Cai, Rosie Campbell, Andrew Cann, Brittany Carey, Chelsea Carlson, Rory Carmichael, Brooke Chan, Che Chang, Fotis Chantzis, Derek Chen, Sully Chen, Ruby Chen, Jason Chen, Mark Chen, Ben Chess, Chester Cho, Casey Chu, Hyung Won Chung, Dave Cummings, Jeremiah Currier, Yunxing Dai, Cory Decareaux, Thomas Degry, Noah Deutsch, Damien Deville, Arka Dhar, David Dohan, Steve Dowling, Sheila Dunning, Adrien Ecoffet, Atty Eleti, Tyna Eloundou, David Farhi, Liam Fedus, Niko Felix, Simón Posada Fishman, Juston Forte, Isabella Fulford, Leo Gao, Elie Georges, Christian Gibson, Vik Goel, Tarun Gogineni, Gabriel Goh, Rapha Gontijo-lopes, Jonathan Gordon, Morgan Grafstein, Scott Gray, Ryan Greene, Joshua Gross, Shixiang Shane Gu, Yufei Guo, Chris Hallacy, Jesse Han, Jeff Harris, Yuchen He, Mike Heaton, Johannes Heidecke, Chris Hesse, Alan Hickey, Wade Hickey, Peter Hoeschele, Brandon Houghton, Kenny Hsu, Shengli Hu, Xin Hu, Joost Huizinga, Shantanu Jain, Shawn Jain, Joanne Jang, Angela Jiang, Roger Jiang, Haozhun Jin, Denny Jin, Shino Jomoto, Billie Jonn, Heewoo Jun, Tomer Kaftan, Łukasz Kaiser, Ali Kamali, Ingmar Kanitscheider, Nitish Shirish Keskar, Tabarak Khan, Logan Kilpatrick, Jong Wook Kim, Christina Kim, Yongjik Kim, Jan Hendrik Kirchner, Jamie Kiros, Matt Knight, Daniel Kokotajlo, Łukasz Kondraciuk, Andrew Kondrich, Aris Konstantinidis, Kyle Kosic, Gretchen Krueger, Vishal Kuo, Michael Lampe, Ikai Lan, Teddy Lee, Jan Leike, Jade Leung, Daniel Levy, Chak Ming Li, Rachel Lim, Molly Lin, Stephanie Lin, Mateusz Litwin, Theresa Lopez, Ryan Lowe, Patricia Lue, Anna Makanju, Kim Malfacini, Sam Manning, Todor Markov, Yaniv Markovski, Bianca Martin, Katie Mayer, Andrew Mayne, Bob Mcgrew, Scott Mayer Mckinney, Christine Mcleavey, Paul Mcmillan, Jake Mcneil, David Medina, Aalok Mehta, Jacob Menick, Luke Metz, Andrey Mishchenko, Pamela Mishkin, Vinnie Monaco, Evan Morikawa, Daniel Mossing, Tong Mu, Mira Murati, Oleg Murk, David Mély, Ashvin Nair, Reiichiro Nakano, Rajeev Nayak, Arvind Neelakantan, Richard Ngo, Hyeonwoo Noh, Long Ouyang, Cullen O'keefe, Jakub Pachocki, Alex Paino, Joe Palermo, Ashley Pantuliano, Giambattista Parascandolo, Joel Parish, Emy Parparita, Alex Passos, Mikhail Pavlov, Andrew Peng, Adam Perelman, Filipe De Avila Belbute Peres, Michael Petrov, Henrique Ponde De Oliveira Pinto, Michael, Pokorny, Michelle Pokrass, Vitchyr H. Pong, Tolly Powell, Alethea Power, Boris Power, Elizabeth Proehl, Raul Puri, Alec Radford, Jack Rae, Aditya Ramesh, Cameron Raymond, Francis Real, Kendra Rimbach, Carl Ross, Bob Rotsted, Henri Roussez, Nick Ryder, Mario Saltarelli, Ted Sanders, Shibani Santurkar, Girish Sastry, Heather Schmidt, David Schnurr, John Schulman, Daniel Selsam, Kyla Sheppard, Toki Sherbakov, Jessica Shieh, Sarah Shoker, Pranav Shyam, Szymon Sidor, Eric Sigler, Maddie Simens, Jordan Sitkin, Katarina Slama, Ian Sohl, Benjamin Sokolowsky, Yang Song, Natalie Staudacher, Felipe Petroski Such, Natalie Summers, Ilya Sutskever, Jie Tang, Nikolas Tezak, Madeleine B. Thompson, Phil Tillet, Amin Tootoonchian, Elizabeth Tseng, Preston Tuggle, Nick Turley, Jerry Tworek, Juan Felipe Cerón Uribe, Andrea Vallone, Arun Vijayvergiya, Chelsea Voss, Carroll Wainwright, Justin Jay Wang, Alvin Wang, Ben Wang, Jonathan Ward, Jason Wei, Cj Weinmann, Akila Welihinda, Peter Welinder, Jiayi Weng, Lilian Weng, Matt Wiethoff, Dave Willner, Clemens Winter, Samuel Wolrich, Hannah Wong, Lauren Workman, Sherwin Wu, Jeff Wu, Michael Wu, Kai Xiao, Tao Xu, Sarah Yoo, Kevin Yu, Qiming Yuan, Wojciech Zaremba, Rowan Zellers, Chong Zhang, Marvin Zhang, Shengjia Zhao, Tianhao Zheng, Juntang Zhuang, William Zhuk, Barret Zoph
- Fine-tuning Or Retrieval? Comparing Knowledge Injection In Llms Ovadia Oded, Brief Menachem, Mishaeli Moshik, Elisha Oren
- Several Categories Of Large Language Models (llms): A Short Survey Pahune Saurabh, Chandrasekharan Manoj
- Evaluating And Explaining Large Language Models For Code Using Syntactic Structures Palacio David N, Velasco Alejandro, Rodriguez-cardenas Daniel, Moran Kevin, Poshyvanyk Denys
- Understanding The Capabilities Of Large Language Models For Automated Planning Pallagani Vishal, Muppasani Bharath, Murugesan Keerthiram, Rossi Francesca, Srivastava Biplav, Horesh Lior, Fabiano Francesco, Loreggia Andrea
- Finding And Editing Multi-modal Neurons In Pre-trained Transformers Pan Haowen, Cao Yixin, Wang Xiaozhi, Yang Xun, Wang Meng
- Stelocoder: A Decoder-only LLM For Multi-language To Python Code Translation Pan Jialing, Sadé Adrien, Kim Jin, Soriano Eric, Sole Guillem, Flamant Sylvain
- Retrieving-to-answer: Zero-shot Video Question Answering With Frozen Large Language Models Pan Junting, Lin Ziyi, Ge Yuying, Zhu Xiatian, Zhang Renrui, Wang Yi, Qiao Yu, Li Hongsheng
- Tppoet: Transformer-based Persian Poem Generation Using Minimal Data And Advanced Decoding Techniques Panahandeh Amir, Asemi Hanie, Nourani Esmaeil
- Peft-medaware: Large Language Model For Medical Awareness Pandya Keivalya
- Automating Customer Service Using Langchain: Building Custom Open-source GPT Chatbot For Organizations Pandya Keivalya, Holia Mehfuza
- Language Model Self-improvement By Reinforcement Learning Contemplation Pang Jing-cheng, Wang Pengyuan, Li Kaiyuan, Chen Xiong-hui, Xu Jiacheng, Zhang Zongzhang, Yu Yang
- Frozen Transformers In Language Models Are Effective Visual Encoder Layers Pang Ziqi, Xie Ziyang, Man Yunze, Wang Yu-xiong
- Trainable Transformer In Transformer Panigrahi Abhishek, Malladi Sadhika, Xia Mengzhou, Arora Sanjeev
- Injecting Structural Hints: Using Language Models To Study Inductive Biases In Language Learning Papadimitriou Isabel, Jurafsky Dan
- Interpretable Visual Question Answering Via Reasoning Supervision Parelli Maria, Mallis Dimitrios, Diomataris Markos, Pitsikalis Vassilis
- On The Analysis Of Cross-lingual Prompt Tuning For Decoder-based Multilingual Model Park Nohil, Park Joonsuk, Yoo Kang Min, Yoon Sungroh
- A Large Language Model Approach To Educational Survey Feedback Analysis Parker Michael J., Anderson Caitlin, Stone Claire, Oh Yearim
- Longbox: Evaluating Transformers On Long-sequence Clinical Tasks Parmar Mihir, Naik Aakanksha, Gupta Himanshu, Agrawal Disha, Baral Chitta
- ECLIPSE: A Resource-efficient Text-to-image Prior For Image Generations Patel Maitreya, Kim Changhoon, Cheng Sheng, Baral Chitta, Yang Yezhou
- Entity Matching Using Large Language Models Peeters Ralph, Bizer Christian
- GPT Self-supervision For A Better Data Annotator Pei Xiaohuan, Li Yanxi, Xu Chang
- Evaluation Of Medium-large Language Models At Zero-shot Closed Book Generative Question Answering Peinl René, Wirth Johannes
- Exploiting Novel GPT-4 Apis Pelrine Kellin, Taufeeque Mohammad, Zając Michał, Mclean Euan, Gleave Adam
- Yarn: Efficient Context Window Extension Of Large Language Models Peng Bowen, Quesnelle Jeffrey, Fan Honglu, Shippole Enrico
- Generative Large Language Models Are All-purpose Text Analytics Engines: Text-to-text Learning Is All Your Need Peng Cheng, Yang Xi, Chen Aokun, Yu Zehao, Smith Kaleb E, Costa Anthony B, Flores Mona G, Bian Jiang, Wu Yonghui
- When Does In-context Learning Fall Short And Why? A Study On Specification-heavy Tasks Peng Hao, Wang Xiaozhi, Chen Jianhui, Li Weikai, Qi Yunjia, Wang Zimu, Wu Zhili, Zeng Kaisheng, Xu Bin, Hou Lei, Li Juanzi
- FP8-LM: Training FP8 Large Language Models Peng Houwen, Wu Kan, Wei Yixuan, Zhao Guoshuai, Yang Yuxiang, Liu Ze, Xiong Yifan, Yang Ziyue, Ni Bolin, Hu Jingcheng, Li Ruihang, Zhang Miaosen, Li Chen, Ning Jia, Wang Ruizhe, Zhang Zheng, Liu Shuguang, Chau Joe, Hu Han, Cheng Peng
- Constitutionmaker: Interactively Critiquing Large Language Models By Converting Feedback Into Principles Petridis Savvas, Wedin Ben, Wexler James, Donsbach Aaron, Pushkarna Mahima, Goyal Nitesh, Cai Carrie J., Terry Michael
- When Do Prompting And Prefix-tuning Work? A Theory Of Capabilities And Limitations Petrov Aleksandar, Torr Philip H. S., Bibi Adel
- Recjpq: Training Large-catalogue Sequential Recommenders Petrov Aleksandr V., Macdonald Craig
- The Impact Of Depth On Compositional Generalization In Transformer Language Models Petty Jackson, Van Steenkiste Sjoerd, Dasgupta Ishita, Sha Fei, Garrette Dan, Linzen Tal
- Let Models Speak Ciphers: Multiagent Debate Through Embeddings Pham Chau, Liu Boyi, Yang Yingxiang, Chen Zhengyu, Liu Tianyi, Yuan Jianbo, Plummer Bryan A., Wang Zhaoran, Yang Hongxia
- Training Chain-of-thought Via Latent-variable Inference Phan Du, Hoffman Matthew D., Dohan David, Douglas Sholto, Le Tuan Anh, Parisi Aaron, Sountsov Pavel, Sutton Charles, Vikram Sharad, Saurous Rif A.
- Zero-shot Question Answering Over Financial Documents Using Large Language Models Phogat Karmvir Singh, Harsha Chetan, Dasaratha Sridhar, Ramakrishna Shashishekar, Puranam Sai Akhil
- LLM Self Defense: By Self Examination, Llms Know They Are Being Tricked Phute Mansi, Helbling Alec, Hull Matthew, Peng Shengyun, Szyller Sebastian, Cornelius Cory, Chau Duen Horng
- Typhoon: Thai Large Language Models Pipatanakul Kunat, Jirabovonvisut Phatrasek, Manakul Potsawee, Sripaisarnmongkol Sittipong, Patomwong Ruangsak, Chokchainant Pathomporn, Tharnpipitchai Kasima
- Sabi\'a: Portuguese Large Language Models Pires Ramon, Abonizio Hugo, Almeida Thales Sales, Nogueira Rodrigo
- Hyena Hierarchy: Towards Larger Convolutional Language Models Poli Michael, Massaroli Stefano, Nguyen Eric, Fu Daniel Y., Dao Tri, Baccus Stephen, Bengio Yoshua, Ermon Stefano, Ré Christopher
- Mosaicbert: A Bidirectional Encoder Optimized For Fast Pretraining Portes Jacob, Trott Alex, Havens Sam, King Daniel, Venigalla Abhinav, Nadeem Moin, Sardana Nikhil, Khudia Daya, Frankle Jonathan
- Transformers In The Service Of Description Logic-based Contexts Poulis Angelos, Tsalapati Eleni, Koubarakis Manolis
- Painter: Teaching Auto-regressive Language Models To Draw Sketches Pourreza Reza, Bhattacharyya Apratim, Panchal Sunny, Lee Mingu, Madan Pulkit, Memisevic Roland
- Egovlpv2: Egocentric Video-language Pre-training With Fusion In The Backbone Pramanick Shraman, Song Yale, Nag Sayan, Lin Kevin Qinghong, Shah Hardik, Shou Mike Zheng, Chellappa Rama, Zhang Pengchuan
- Mini Minds: Exploring Bebeshka And Zlata Baby Models Proskurina Irina, Metzler Guillaume, Velcin Julien
- Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! Qi Xiangyu, Zeng Yi, Xie Tinghao, Chen Pin-yu, Jia Ruoxi, Mittal Prateek, Henderson Peter
- PILLOW: Enhancing Efficient Instruction Fine-tuning Via Prompt Matching Qi Zhenting, Tan Xiaoyu, Shi Shaojie, Qu Chao, Xu Yinghui, Qi Yuan
- ONCE: Boosting Content-based Recommendation With Both Open- And Closed-source Large Language Models Qijiong Liu, Nuo Chen, Tetsuya Sakai, Xiao-ming Wu
- Enabling On-device Large Language Model Personalization With Self-supervised Data Selection And Synthesis Qin Ruiyang, Xia Jun, Jia Zhenge, Jiang Meng, Abbasi Ahmed, Zhou Peipei, Hu Jingtong, Shi Yiyu
- Federated Full-parameter Tuning Of Billion-sized Language Models With Communication Cost Under 18 Kilobytes Qin Zhen, Chen Daoyuan, Qian Bingchen, Ding Bolin, Li Yaliang, Deng Shuiguang
- Transnormerllm: A Faster And Better Large Language Model With Improved Transnormer Qin Zhen, Li Dong, Sun Weigao, Sun Weixuan, Shen Xuyang, Han Xiaodong, Wei Yunshen, Lv Baohong, Luo Xiao, Qiao Yu, Zhong Yiran
- Adalora: Adaptive Budget Allocation For Parameter-efficient Fine-tuning Qingru Zhang, Minshuo Chen, Alexander Bukharin, Nikos Karampatziakis, Pengcheng He, Yu Cheng, Weizhu Chen, Tuo Zhao
- Embodied Executable Policy Learning With Language-based Scene Summarization Qiu Jielin, Xu Mengdi, Han William, Moon Seungwhan, Zhao Ding
- Controlrec: Bridging The Semantic Gap Between Language Model And Personalized Recommendation Qiu Junyan, Wang Haitao, Hong Zhaolin, Yang Yiping, Liu Qiang, Wang Xingxing
- Unlocking Emergent Modularity In Large Language Models Qiu Zihan, Huang Zeyu, Fu Jie
- Cmed-gpt: Prompt Tuning For Entity-aware Chinese Medical Dialogue Generation Qu Zhijie, Li Juan, Ma Zerui, Li Jianqiang
- Direct Preference Optimization: Your Language Model Is Secretly A Reward Model Rafael Rafailov, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D. Manning, Chelsea Finn
- Exploring The Effectiveness Of GPT Models In Test-taking: A Case Study Of The Driver's License Knowledge Test Rahimi Saba, Balch Tucker, Veloso Manuela
- Quantized Transformer Language Model Implementations On Edge Devices Rahman Mohammad Wali Ur, Abrar Murad Mehrab, Copening Hunter Gibbons, Hariri Salim, Shao Sicong, Satam Pratik, Salehi Soheil
- Improving Generalization In Task-oriented Dialogues With Workflows And Action Plans Raimondo Stefania, Pal Christopher, Liu Xiaotian, Vazquez David, Palacios Hector
- Bridging The Gap: Exploring The Capabilities Of Bridge-architectures For Complex Visual Reasoning Tasks Rajesh Kousik, Raman Mrigank, Karim Mohammed Asad, Chawla Pranit
- Multi-task End-to-end Training Improves Conversational Recommendation Ram Naveen, Kuzmin Dima, Chio Ellie Ka In, Alzantot Moustafa Farid, Ontanon Santiago, Jash Ambarish, Li Judith Yue
- Mabvit -- Modified Attention Block Enhances Vision Transformers Ramesh Mahesh, Ramkumar Aswinkumar
- Controllable Generation Of Dialogue Acts For Dialogue Systems Via Few-shot Response Generation And Ranking Ramirez Angela, Agarwal Karik, Juraska Juraj, Garg Utkarsh, Walker Marilyn A.
- Tailoring Self-rationalizers With Multi-reward Distillation Ramnath Sahana, Joshi Brihi, Hallinan Skyler, Lu Ximing, Li Liunian Harold, Chan Aaron, Hessel Jack, Choi Yejin, Ren Xiang
- Retrieval-based Knowledge Augmented Vision Language Pre-training Rao Jiahua, Shan Zifei, Liu Longpo, Zhou Yao, Yang Yuedong
- Weakly Supervised Detection Of Hallucinations In LLM Activations Rateike Miriam, Cintas Celia, Wamburu John, Akumu Tanya, Speakman Skyler
- COLA: A Benchmark For Compositional Text-to-image Retrieval Ray Arijit, Radenovic Filip, Dubey Abhimanyu, Plummer Bryan A., Krishna Ranjay, Saenko Kate
- \(\textit{dial Beinfo For Faithfulness}\): Improving Factuality Of Information-seeking Dialogue Via Behavioural Fine-tuning Razumovskaia Evgeniia, Vulić Ivan, Marković Pavle, Cichy Tomasz, Zheng Qian, Wen Tsung-hsien, Budzianowski Paweł
- Fine-tuning Language Models For Context-specific SQL Query Generation Rebei Amine
- Towards A Performance Analysis On Pre-trained Visual Question Answering Models For Autonomous Driving Rekanar Kaavya, Eising Ciarán, Sistu Ganesh, Hayes Martin
- Context Compression For Auto-regressive Transformers With Sentinel Tokens Ren Siyu, Jia Qi, Zhu Kenny Q.
- I'm Afraid I Can't Do That: Predicting Prompt Refusal In Black-box Generative Language Models Reuter Max, Schulze William
- Sparq Attention: Bandwidth-efficient LLM Inference Ribar Luka, Chelombiev Ivan, Hudlass-galley Luke, Blake Charlie, Luschi Carlo, Orr Douglas
- Commonsense Reasoning For Conversational AI: A Survey Of The State Of The Art Richardson Christopher, Heck Larry
- Carexpert: Leveraging Large Language Models For In-car Conversational Question Answering Rony Md Rashad Al Hasan, Suess Christian, Bhat Sinchana Ramakanth, Sudhi Viju, Schneider Julia, Vogel Maximilian, Teucher Roman, Friedl Ken E., Sahoo Soumya
- LLM As An Art Director (ladi): Using Llms To Improve Text-to-media Generators Roush Allen, Zakirov Emil, Shirokov Artemiy, Lunina Polina, Gane Jack, Duffy Alexander, Basil Charlie, Whitcomb Aber, Benedetto Jim, Dewolfe Chris
- Knowledge-infused Self Attention Transformers Roy Kaushik, Zi Yuxin, Narayanan Vignesh, Gaur Manas, Sheth Amit
- Retrieval-pretrained Transformer: Long-range Language Modeling With Self-retrieval Rubin Ohad, Berant Jonathan
- Outlier Dimensions Encode Task-specific Knowledge Rudman William, Chen Catherine, Eickhoff Carsten
- Gpt4tools: Teaching Large Language Model To Use Tools Via Self-instruction Rui Yang, Lin Song, Yanwei Li, Sijie Zhao, Yixiao Ge, Xiu Li, Ying Shan
- Attention-likelihood Relationship In Transformers Ruscio Valeria, Maiorca Valentino, Silvestri Fabrizio
- Plausibility Processing In Transformer Language Models: Focusing On The Role Of Attention Heads In GPT Ryu Soo Hyun
- Naturalness Of Attention: Revisiting Attention In Code Language Models Saad Mootez, Sharma Tushar
- Uncertainty Guided Global Memory Improves Multi-hop Question Answering Sagirova Alsu, Burtsev Mikhail
- Memory Injections: Correcting Multi-hop Reasoning Failures During Inference In Transformer-based Language Models Sakarvadia Mansi, Ajith Aswathy, Khan Arham, Grzenda Daniel, Hudson Nathaniel, Bauer André, Chard Kyle, Foster Ian
- Attention Lens: A Tool For Mechanistically Interpreting The Attention Head Information Retrieval Mechanism Sakarvadia Mansi, Khan Arham, Ajith Aswathy, Grzenda Daniel, Hudson Nathaniel, Bauer André, Chard Kyle, Foster Ian
- Extending The Frontier Of Chatgpt: Code Generation And Debugging Sakib Fardin Ahsan, Khan Saadat Hasan, Karim A. H. M. Rezaul
- UT5: Pretraining Non Autoregressive T5 With Unrolled Denoising Salem Mahmoud G., Ye Jiayu, Lin Chu-cheng, Liu Frederick
- Zero-shot Translation Of Attention Patterns In VQA Models To Natural Language Salewski Leonard, Koepke A. Sophia, Lensch Hendrik P. A., Akata Zeynep
- Efficient RLHF: Reducing The Memory Usage Of PPO Santacroce Michael, Lu Yadong, Yu Han, Li Yuanzhi, Shen Yelong
- Testing The Limits Of Unified Sequence To Sequence LLM Pretraining On Diverse Table Data Tasks Sarkar Soumajyoti, Lausen Leonard
- Inseq: An Interpretability Toolkit For Sequence Generation Models Sarti Gabriele, Feldhus Nils, Sickert Ludwig, Van Der Wal Oskar, Nissim Malvina, Bisazza Arianna
- Interprompt: Interpretable Prompting For Interrelated Interpersonal Risk Factors In Reddit Posts Sathvik Msvpj, Sarkar Surjodeep, Saxena Chandni, Sohn Sunghwan, Garg Muskan
- GIFT: Generative Interpretable Fine-tuning Savadikar Chinmay, Song Xi, Wu Tianfu
- Unlocking Practical Applications In Legal Domain: Evaluation Of GPT For Zero-shot Semantic Annotation Of Legal Texts Savelka Jaromir
- Large Language Models (GPT) Struggle To Answer Multiple-choice Questions About Code Savelka Jaromir, Agarwal Arav, Bogart Christopher, Sakr Majd
- Can GPT-4 Support Analysis Of Textual Data In Tasks Requiring Highly Specialized Domain Expertise? Savelka Jaromir, Ashley Kevin D., Gray Morgan A, Westermann Hannes, Xu Huihui
- Pretraining On The Test Set Is All You Need Schaeffer Rylan
- Probing Conceptual Understanding Of Large Visual-language Models Schiappa Madeline, Abdullah Raiyaan, Azad Shehreen, Claypoole Jared, Cogswell Michael, Divakaran Ajay, Rawat Yogesh
- Surgicalgpt: End-to-end Language-vision GPT For Visual Question Answering In Surgery Seenivasan Lalithkumar, Islam Mobarakol, Kannan Gokul, Ren Hongliang
- The Cot Collection: Improving Zero-shot And Few-shot Learning Of Language Models Via Chain-of-thought Fine-tuning Seungone Kim, Se June Joo, Doyoung Kim, Joel Jang, Seonghyeon Ye, Jamin Shin, Minjoon Seo
- Let's Have A Chat! A Conversation With Chatgpt: Technology, Applications, And Limitations Shahriar Sakib, Hayawi Kadhim
- Hero: Roberta And Longformer Hebrew Language Models Shalumov Vitaly, Haskey Harel
- Towards Concept-aware Large Language Models Shani Chen, Vreeken Jilles, Shahaf Dafna
- One-shot Sensitivity-aware Mixed Sparsity Pruning For Large Language Models Shao Hang, Liu Bei, Xiao Bo, Zeng Ke, Wan Guanglu, Qian Yanmin
- The Truth Is In There: Improving Reasoning In Language Models With Layer-selective Rank Reduction Sharma Pratyusha, Ash Jordan T., Misra Dipendra
- Exploring The Factual Consistency In Dialogue Comprehension Of Large Language Models She Shuaijie, Huang Shujian, Wang Xingyun, Zhou Yanke, Chen Jiajun
- Pangu-coder2: Boosting Large Language Models For Code With Ranking Feedback Shen Bo, Zhang Jiaxin, Chen Taihong, Zan Daoguang, Geng Bing, Fu An, Zeng Muhan, Yu Ailun, Ji Jichuan, Zhao Jingyang, Guo Yuenan, Wang Qianxiang
- Efficient LLM Inference On Cpus Shen Haihao, Chang Hanwen, Dong Bo, Luo Yu, Meng Hengyu
- Do Pretrained Transformers Learn In-context By Gradient Descent? Shen Lingfeng, Mishra Aayush, Khashabi Daniel
- Improving The Robustness Of Transformer-based Large Language Models With Dynamic Attention Shen Lujia, Pu Yuwen, Ji Shouling, Li Changjiang, Zhang Xuhong, Ge Chunpeng, Wang Ting
- Towards More Unified In-context Visual Understanding Sheng Dianmo, Chen Dongdong, Tan Zhentao, Liu Qiankun, Chu Qi, Bao Jianmin, Gong Tao, Liu Bin, Xu Shengwei, Yu Nenghai
- TOAST: Transfer Learning Via Attention Steering Shi Baifeng, Gai Siyu, Darrell Trevor, Wang Xin
- Crossget: Cross-guided Ensemble Of Tokens For Accelerating Vision-language Transformers Shi Dachuan, Tao Chaofan, Rao Anyi, Yang Zhendong, Yuan Chun, Wang Jiaqi
- Sotana: The Open-source Software Development Assistant Shi Ensheng, Zhang Fengji, Wang Yanlin, Chen Bei, Du Lun, Zhang Hongyu, Han Shi, Zhang Dongmei, Sun Hongbin
- Badgpt: Exploring Security Vulnerabilities Of Chatgpt Via Backdoor Attacks To Instructgpt Shi Jiawen, Liu Yixin, Zhou Pan, Sun Lichao
- Unleashing The Power Of Pre-trained Language Models For Offline Reinforcement Learning Shi Ruizhe, Liu Yuyao, Ze Yanjie, Du Simon S., Xu Huazhe
- Preliminary Study On Incremental Learning For Large Language Model-based Recommender Systems Shi Tianhao, Zhang Yang, Xu Zhijian, Chen Chong, Feng Fuli, He Xiangnan, Tian Qi
- MKRAG: Medical Knowledge Retrieval Augmented Generation For Medical Question Answering Shi Yucheng, Xu Shaochen, Yang Tianze, Liu Zhengliang, Liu Tianming, Li Quanzheng, Li Xiang, Liu Ninghao
- Dept: Decomposed Prompt Tuning For Parameter-efficient Fine-tuning Shi Zhengxiang, Lipani Aldo
- Exploring The Robustness Of Large Language Models For Solving Programming Problems Shirafuji Atsushi, Watanobe Yutaka, Ito Takumi, Morishita Makoto, Nakamura Yuki, Oda Yusuke, Suzuki Jun
- L3 Ensembles: Lifelong Learning Approach For Ensemble Of Foundational Language Models Shiri Aidin, Roy Kaushik, Sheth Amit, Gaur Manas
- Introducing Dictalm -- A Large Generative Language Model For Modern Hebrew Shmidman Shaltiel, Shmidman Avi, Cohen Amir David Nissan, Koppel Moshe
- The ART Of LLM Refinement: Ask, Refine, And Trust Shridhar Kumar, Sinha Koustuv, Cohen Andrew, Wang Tianlu, Yu Ping, Pasunuru Ram, Sachan Mrinmaya, Weston Jason, Celikyilmaz Asli
- Ep-alm: Efficient Perceptual Augmentation Of Language Models Shukor Mustafa, Dancette Corentin, Cord Matthieu
- Large Language Models For Aspect-based Sentiment Analysis Simmering Paul F., Huoviala Paavo
- Gopro: Generate And Optimize Prompts In CLIP Using Self-supervised Learning Singha Mainak, Jha Ankit, Banerjee Biplab
- EEL: Efficiently Encoding Lattices For Reranking Singhal Prasann, Xu Jiacheng, Ye Xi, Durrett Greg
- A Mathematical Abstraction For Balancing The Trade-off Between Creativity And Reality In Large Language Models Sinha Ritwik, Song Zhao, Zhou Tianyi
- Decoding Chatgpt: A Taxonomy Of Existing Research, Current Challenges, And Possible Future Directions Sohail Shahab Saquib, Farhat Faiza, Himeur Yassine, Nadeem Mohammad, Madsen Dag Øivind, Singh Yashbir, Atalla Shadi, Mansoor Wathiq
- SPARSEFIT: Few-shot Prompting With Sparse Fine-tuning For Jointly Generating Predictions And Natural Language Explanations Solano Jesus, Sanni Mardhiyah, Camburu Oana-maria, Minervini Pasquale
- Biomedical Knowledge Graph-optimized Prompt Generation For Large Language Models Soman Karthik, Rose Peter W, Morris John H, Akbas Rabia E, Smith Brett, Peetoom Braian, Villouta-reyes Catalina, Cerono Gabriel, Shi Yongmei, Rizk-jackson Angela, Israni Sharat, Nelson Charlotte A, Huang Sui, Baranzini Sergio E
- Zebra: Extending Context Window With Layerwise Grouped Local-global Attention Song Kaiqiang, Wang Xiaoyang, Cho Sangwoo, Pan Xiaoman, Yu Dong
- Pneumollm: Harnessing The Power Of Large Language Model For Pneumoconiosis Diagnosis Song Meiyue, Yu Zhihua, Wang Jiaxin, Wang Jiarui, Lu Yuting, Li Baicun, Wang Xiaoxu, Huang Qinghua, Li Zhijun, Kanellakis Nikolaos I., Liu Jiangfeng, Wang Jing, Wang Binglu, Yang Juntao
- Large Language Models Meet Open-world Intent Discovery And Recognition: An Evaluation Of Chatgpt Song Xiaoshuai, He Keqing, Wang Pei, Dong Guanting, Mou Yutao, Wang Jingang, Xian Yunsen, Cai Xunliang, Xu Weiran
- Learning Ui-to-code Reverse Generator Using Visual Critic Without Rendering Soselia Davit, Saifullah Khalid, Zhou Tianyi
- Dynamic Context Pruning For Efficient And Interpretable Autoregressive Transformers Sotiris Anagnostidis, Dario Pavllo, Luca Biggio, Lorenzo Noci, Aurelien Lucchi, Thomas Hofmann
- Curriculum-guided Abstractive Summarization Sotudeh Sajad, Deilamsalehy Hanieh, Dernoncourt Franck, Goharian Nazli
- Harnessing Llms In Curricular Design: Using GPT-4 To Support Authoring Of Learning Objectives Sridhar Pragnya, Doyle Aidan, Agarwal Arav, Bogart Christopher, Savelka Jaromir, Sakr Majd
- Chatgpt: A Study On Its Utility For Ubiquitous Software Engineering Tasks Sridhara Giriprasad, G. Ranjani H., Mazumdar Sourav
- Response-act Guided Reinforced Dialogue Generation For Mental Health Counseling Srivastava Aseem, Pandey Ishan, Akhtar Md. Shad, Chakraborty Tanmoy
- Structured Packing In LLM Training Improves Long Context Utilization Staniszewski Konrad, Tworkowski Szymon, Jaszczur Sebastian, Zhao Yu, Michalewski Henryk, Kuciński Łukasz, Miłoś Piotr
- Probing Llms For Joint Encoding Of Linguistic Categories Starace Giulio, Papakostas Konstantinos, Choenni Rochelle, Panagiotopoulos Apostolos, Rosati Matteo, Leidinger Alina, Shutova Ekaterina
- A Mechanistic Interpretation Of Arithmetic Reasoning In Language Models Using Causal Mediation Analysis Stolfo Alessandro, Belinkov Yonatan, Sachan Mrinmaya
- Learning From Red Teaming: Gender Bias Provocation And Mitigation In Large Language Models Su Hsuan, Cheng Cheng-chu, Farn Hua, Kumar Shachi H, Sahay Saurav, Chen Shang-tse, Lee Hung-yi
- F3-pruning: A Training-free And Generalized Pruning Strategy Towards Faster And Finer Text-to-video Synthesis Su Sitong, Liu Jianzhi, Gao Lianli, Song Jingkuan
- Dial-mae: Contextual Masked Auto-encoder For Retrieval-based Dialogue Systems Su Zhenpeng, Wu Xing, Zhou Wei, Ma Guangyuan, Hu Songlin
- Language Model-in-the-loop: Data Optimal Approach To Learn-to-recommend Actions In Text Games Sudhakar Arjun Vaithilingam, Parthasarathi Prasanna, Rajendran Janarthanan, Chandar Sarath
- RCMHA: Relative Convolutional Multi-head Attention For Natural Language Modelling Sugiharto Herman, Aradea, Mubarok Husni
- Does Fine-tuning GPT-3 With The Openai API Leak Personally-identifiable Information? Sun Albert Yu, Zemour Eliott, Saxena Arushi, Vaidyanathan Udith, Lin Eric, Lau Christian, Mugunthan Vaikkunth
- Building High-accuracy Multilingual ASR With Gated Language Experts And Curriculum Training Sun Eric, Li Jinyu, Hu Yuxuan, Zhu Yimeng, Zhou Long, Xue Jian, Wang Peidong, Liu Linquan, Liu Shujie, Lin Edward, Gong Yifan
- Speech-based Slot Filling Using Large Language Models Sun Guangzhi, Feng Shutong, Jiang Dongcheng, Zhang Chao, Gašić Milica, Woodland Philip C.
- Character-level Chinese Backpack Language Models Sun Hao, Hewitt John
- Evaluating The Zero-shot Robustness Of Instruction-tuned Language Models Sun Jiuding, Shaib Chantal, Wallace Byron C.
- Generative Multimodal Models Are In-context Learners Sun Quan, Cui Yufeng, Zhang Xiaosong, Zhang Fan, Yu Qiying, Luo Zhengxiong, Wang Yueze, Rao Yongming, Liu Jingjing, Huang Tiejun, Wang Xinlong
- Exploring The Impact Of Low-rank Adaptation On The Performance, Efficiency, And Regularization Of RLHF Sun Simeng, Gupta Dhawal, Iyyer Mohit
- A Prompt Learning Framework For Source Code Summarization Sun Weisong, Fang Chunrong, You Yudu, Chen Yuchen, Liu Yi, Wang Chong, Zhang Jian, Zhang Quanjun, Qian Hanwei, Zhao Wei, Liu Yang, Chen Zhenyu
- A Comparative Study Between Full-parameter And Lora-based Fine-tuning On Chinese Instruction Data For Instruction Following Large Language Model Sun Xianghui, Ji Yunjie, Ma Baochang, Li Xiangang
- Retentive Network: A Successor To Transformer For Large Language Models Sun Yutao, Dong Li, Huang Shaohan, Ma Shuming, Xia Yuqing, Xue Jilong, Wang Jianyong, Wei Furu
- Ctbls: Augmenting Large Language Models With Conversational Tables Sundar Anirudh S, Heck Larry
- Pre-training Intent-aware Encoders For Zero- And Few-shot Intent Classification Sung Mujeen, Gung James, Mansimov Elman, Pappas Nikolaos, Shu Raphael, Romeo Salvatore, Zhang Yi, Castelli Vittorio
- Improving Non-autoregressive Translation Quality With Pretrained Language Model, Embedding Distillation And Upsampling Strategy For CTC Syu Shen-sian, Xie Juncheng, Lee Hung-yi
- Imaginations Of WALL-E : Reconstructing Experiences With An Imagination-inspired Module For Advanced AI Systems Taghavi Zeinab Sadat, Gooran Soroush, Dalili Seyed Arshan, Amirzadeh Hamidreza, Nematbakhsh Mohammad Jalal, Sameti Hossein
- Training Generative Question-answering On Synthetic Data Obtained From An Instruct-tuned Model Takahashi Kosuke, Omi Takahiro, Arima Kosuke, Ishigaki Tatsuya
- Large Language Model-driven Classroom Flipping: Empowering Student-centric Peer Questioning With Flipped Interaction Tan Chee Wei
- Medchatzh: A Better Medical Adviser Learns From Better Instructions Tan Yang, Li Mingchen, Huang Zijie, Yu Huiqun, Fan Guisheng
- Multiple-question Multiple-answer Text-vqa Tang Peng, Appalaraju Srikar, Manmatha R., Xie Yusheng, Mahadevan Vijay
- Learning To Imagine: Visually-augmented Natural Language Generation Tang Tianyi, Chen Yushuo, Du Yifan, Li Junyi, Zhao Wayne Xin, Wen Ji-rong
- Struc-bench: Are Large Language Models Really Good At Generating Complex Structured Data? Tang Xiangru, Zong Yiming, Phang Jason, Zhao Yilun, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- Codi-2: In-context, Interleaved, And Interactive Any-to-any Generation Tang Zineng, Yang Ziyi, Khademi Mahmoud, Liu Yang, Zhu Chenguang, Bansal Mohit
- POUF: Prompt-oriented Unsupervised Fine-tuning For Large Pre-trained Models Tanwisuth Korawat, Zhang Shujian, Zheng Huangjie, He Pengcheng, Zhou Mingyuan
- Adversarial Conversational Shaping For Intelligent Agents Tarasiewicz Piotr, Kenjeyev Sultan, Sebag Ilana, Alshehabi Shehab
- Expanding Frozen Vision-language Models Without Retraining: Towards Improved Robot Perception Tavassoli Riley, Amani Mani, Akhavian Reza
- CLAMP: Contrastive Language Model Prompt-tuning Teterwak Piotr, Sun Ximeng, Plummer Bryan A., Saenko Kate, Lim Ser-nam
- Leveraging Llms For Synthesizing Training Data Across Many Languages In Multilingual Dense Retrieval Thakur Nandan, Ni Jianmo, Ábrego Gustavo Hernández, Wieting John, Lin Jimmy, Cer Daniel
- SPDF: Sparse Pre-training And Dense Fine-tuning For Large Language Models Thangarasa Vithursan, Gupta Abhay, Marshall William, Li Tianda, Leong Kevin, Decoste Dennis, Lie Sean, Saxena Shreyas
- Soft-prompt Tuning For Large Language Models To Evaluate Bias Tian Jacob-junqi, Emerson David, Miyandoab Sevil Zanjani, Pandya Deval, Seyyed-kalantari Laleh, Khattak Faiza Khan
- Chimed-gpt: A Chinese Medical Large Language Model With Full Training Regime And Better Alignment To Human Preferences Tian Yuanhe, Gan Ruyi, Song Yan, Zhang Jiaxing, Zhang Yongdong
- BOOST: Harnessing Black-box Control To Boost Commonsense In Lms' Generation Tian Yufei, Zhang Felix, Peng Nanyun
- Impact Of Tokenization On Llama Russian Adaptation Tikhomirov Mikhail, Chernyshev Daniil
- Fine-tune Language Models To Approximate Unbiased In-context Learning Timothy Chu, Zhao Song, Chiwun Yang
- D4: Improving LLM Pretraining Via Document De-duplication And Diversification Tirumala Kushal, Simig Daniel, Aghajanyan Armen, Morcos Ari S.
- Better Language Models Of Code Through Self-improvement To Hung Quoc, Bui Nghi D. Q., Guo Jin, Nguyen Tien N.
- Function Vectors In Large Language Models Todd Eric, Li Millicent L., Sharma Arnab Sen, Mueller Aaron, Wallace Byron C., Bau David
- Parameter Efficient Tuning Allows Scalable Personalization Of Llms For Text Entry: A Case Study On Abbreviation Expansion Tomanek Katrin, Cai Shanqing, Venugopalan Subhashini
- Videberta: A Powerful Pre-trained Language Model For Vietnamese Tran Cong Dao, Pham Nhut Huy, Nguyen Anh, Hy Truong Son, Vu Tu
- Bioinstruct: Instruction Tuning Of Large Language Models For Biomedical Natural Language Processing Tran Hieu, Yang Zhichao, Yao Zonghai, Yu Hong
- Viclevr: A Visual Reasoning Dataset And Hybrid Multimodal Fusion Model For Visual Question Answering In Vietnamese Tran Khiem Vinh, Phan Hao Phu, Van Nguyen Kiet, Nguyen Ngan Luu Thuy
- Bartphobeit: Pre-trained Sequence-to-sequence And Image Transformers Models For Vietnamese Visual Question Answering Tran Khiem Vinh, Van Nguyen Kiet, Nguyen Ngan Luu Thuy
- Efficiently Aligned Cross-lingual Transfer Learning For Conversational Tasks Using Prompt-tuning Tu Lifu, Qu Jin, Yavuz Semih, Joty Shafiq, Liu Wenhao, Xiong Caiming, Zhou Yingbo
- Automating Code-related Tasks Through Transformers: The Impact Of Pre-training Tufano Rosalia, Pascarella Luca, Bavota Gabriele
- Focused Transformer: Contrastive Training For Context Scaling Tworkowski Szymon, Staniszewski Konrad, Pacek Mikołaj, Wu Yuhuai, Michalewski Henryk, Miłoś Piotr
- Taco: Enhancing Cross-lingual Transfer For Low-resource Languages In Llms Through Translation-assisted Chain-of-thought Processes Upadhayay Bibek, Behzadan Vahid
- Mlongt5: A Multilingual And Efficient Text-to-text Transformer For Longer Sequences Uthus David, Ontañón Santiago, Ainslie Joshua, Guo Mandy
- In-context Ability Transfer For Question Decomposition In Complex QA V Venktesh, Bhattacharya Sourangshu, Anand Avishek
- On The Automatic Generation And Simplification Of Children's Stories Valentini Maria, Weber Jennifer, Salcido Jesus, Wright Téa, Colunga Eliana, Kann Katharina
- Mini-gpts: Efficient Large Language Models Through Contextual Pruning Valicenti Tim, Vidal Justice, Patnaik Ritik
- The LLM Surgeon Van Der Ouderaa Tycho F. A., Nagel Markus, Van Baalen Mart, Asano Yuki M., Blankevoort Tijmen
- Adventures Of Trustworthy Vision-language Models: A Survey Vatsa Mayank, Jain Anubhooti, Singh Richa
- Small Language Models Improve Giants By Rewriting Their Outputs Vernikos Giorgos, Bražinskas Arthur, Adamek Jakub, Mallinson Jonathan, Severyn Aliaksei, Malmi Eric
- Prompt2model: Generating Deployable Models From Natural Language Instructions Viswanathan Vijay, Zhao Chenyang, Bertsch Amanda, Wu Tongshuang, Neubig Graham
- Generative Pre-trained Transformer For Vietnamese Community-based COVID-19 Question Answering Vo Tam Minh, Tran Khiem Vinh
- Empirical Evaluation Of Uncertainty Quantification In Retrieval-augmented Language Models For Science Wagle Sridevi, Munikoti Sai, Acharya Anurag, Smith Sara, Horawalavithana Sameera
- PIP: Parse-instructed Prefix For Syntactically Controlled Paraphrase Generation Wan Yixin, Huang Kuan-hao, Chang Kai-wei
- Shall We Pretrain Autoregressive Language Models With Retrieval? A Comprehensive Study Wang Boxin, Ping Wei, Xu Peng, Mcafee Lawrence, Liu Zihan, Shoeybi Mohammad, Dong Yi, Kuchaiev Oleksii, Li Bo, Xiao Chaowei, Anandkumar Anima, Catanzaro Bryan
- BERT4CTR: An Efficient Framework To Combine Pre-trained Language Model With Non-textual Features For CTR Prediction Wang Dong, Salamatian Kavé, Xia Yunqing, Deng Weiwei, Zhiang Qi
- Docllm: A Layout-aware Generative Language Model For Multimodal Document Understanding Wang Dongsheng, Raman Natraj, Sibue Mathieu, Ma Zhiqiang, Babkin Petr, Kaur Simerjot, Pei Yulong, Nourbakhsh Armineh, Liu Xiaomo
- Openchat: Advancing Open-source Language Models With Mixed-quality Data Wang Guan, Cheng Sijie, Zhan Xianyuan, Li Xiangang, Song Sen, Liu Yang
- What Makes For Good Visual Tokenizers For Large Language Models? Wang Guangzhi, Ge Yixiao, Ding Xiaohan, Kankanhalli Mohan, Shan Ying
- Dialog Action-aware Transformer For Dialog Policy Learning Wang Huimin, Kwan Wai-chung, Wong Kam-fai
- Knowledgeable In-context Tuning: Exploring And Exploiting Factual Knowledge For In-context Learning Wang Jianing, Wang Chengyu, Tan Chuanqi, Huang Jun, Gao Ming
- Missrec: Pre-training And Transferring Multi-modal Interest-aware Sequence Representation For Recommendation Wang Jinpeng, Zeng Ziyun, Wang Yunxiao, Wang Yuting, Lu Xingyu, Li Tianxiang, Yuan Jun, Zhang Rui, Zheng Hai-tao, Xia Shu-tao
- Improved Visual Fine-tuning With Natural Language Supervision Wang Junyang, Xu Yuanhong, Hu Juhua, Yan Ming, Sang Jitao, Qian Qi
- Mathcoder: Seamless Code Integration In Llms For Enhanced Mathematical Reasoning Wang Ke, Ren Houxing, Zhou Aojun, Lu Zimu, Luo Sichun, Shi Weikang, Zhang Renrui, Song Linqi, Zhan Mingjie, Li Hongsheng
- Adapting LLM Agents With Universal Feedback In Communication Wang Kuan, Lu Yadong, Santacroce Michael, Gong Yeyun, Zhang Chao, Shen Yelong
- Improving Text Embeddings With Large Language Models Wang Liang, Yang Nan, Huang Xiaolong, Yang Linjun, Majumder Rangan, Wei Furu
- Query2doc: Query Expansion With Large Language Models Wang Liang, Yang Nan, Wei Furu
- Making Large Language Models Better Reasoners With Alignment Wang Peiyi, Li Lei, Chen Liang, Song Feifan, Lin Binghuai, Cao Yunbo, Liu Tianyu, Sui Zhifang
- Demystifying Instruction Mixing For Fine-tuning Large Language Models Wang Renxi, Li Haonan, Wu Minghao, Wang Yuxia, Han Xudong, Zhang Chiyu, Baldwin Timothy
- Aurora:activating Chinese Chat Capability For Mixtral-8x7b Sparse Mixture-of-experts Through Instruction-tuning Wang Rongsheng, Chen Haoming, Zhou Ruizhe, Duan Yaofei, Cai Kunyan, Ma Han, Cui Jiaxi, Li Jian, Pang Patrick Cheong-iao, Wang Yapeng, Tan Tao
- Ivygpt: Interactive Chinese Pathway Language Model In Medical Domain Wang Rongsheng, Duan Yaofei, Lam Chantong, Chen Jiexi, Xu Jiangsheng, Chen Haoming, Liu Xiaohong, Pang Patrick Cheong-iao, Tan Tao
- READ: Recurrent Adaptation Of Large Transformers Wang Sid, Nguyen John, Li Ke, Wu Carole-jean
- CAR: Conceptualization-augmented Reasoner For Zero-shot Commonsense Question Answering Wang Weiqi, Fang Tianqing, Ding Wenxuan, Xu Baixuan, Liu Xin, Song Yangqiu, Bosselut Antoine
- All Languages Matter: On The Multilingual Safety Of Large Language Models Wang Wenxuan, Tu Zhaopeng, Chen Chang, Yuan Youliang, Huang Jen-tse, Jiao Wenxiang, Lyu Michael R.
- Lora Ensembles For Large Language Model Fine-tuning Wang Xi, Aitchison Laurence, Rudolph Maja
- Guiding Language Model Reasoning With Planning Tokens Wang Xinyi, Caccia Lucas, Ostapenko Oleksiy, Yuan Xingdi, Wang William Yang, Sordoni Alessandro
- FIAT: Fusing Learning Paradigms With Instruction-accelerated Tuning Wang Xinyi, Wieting John, Clark Jonathan H.
- User-aware Prefix-tuning Is A Good Learner For Personalized Image Captioning Wang Xuan, Wang Guanhong, Chai Wenhao, Zhou Jiayu, Wang Gaoang
- Non-intrusive Adaptation: Input-centric Parameter-efficient Fine-tuning For Versatile Multimodal Modeling Wang Yaqing, Wu Jialin, Dabral Tanmaya, Zhang Jiageng, Brown Geoff, Lu Chun-ta, Liu Frederick, Liang Yi, Pang Bo, Bendersky Michael, Soricut Radu
- Universality And Limitations Of Prompt Tuning Wang Yihan, Chauhan Jatin, Wang Wei, Hsieh Cho-jui
- Rescue: Ranking LLM Responses With Partial Ordering To Improve Response Generation Wang Yikun, Zheng Rui, Li Haoming, Zhang Qi, Gui Tao, Liu Fei
- Multilora: Democratizing Lora For Better Multi-task Learning Wang Yiming, Lin Yu, Zeng Xiaodong, Zhang Guannan
- Primacy Effect Of Chatgpt Wang Yiwei, Cai Yujun, Chen Muhao, Liang Yuxuan, Hooi Bryan
- Fake Alignment: Are Llms Really Aligned Well? Wang Yixu, Teng Yan, Huang Kexin, Lyu Chengqi, Zhang Songyang, Zhang Wenwei, Ma Xingjun, Jiang Yu-gang, Qiao Yu, Wang Yingchun
- Vaquita: Enhancing Alignment In Llm-assisted Video Understanding Wang Yizhou, Zhang Ruiyi, Wang Haoliang, Bhattacharya Uttaran, Fu Yun, Wu Gang
- Towards Improving Document Understanding: An Exploration On Text-grounding Via Mllms Wang Yonghui, Zhou Wengang, Feng Hao, Zhou Keyi, Li Houqiang
- Smarttrim: Adaptive Tokens And Attention Pruning For Efficient Vision-language Models Wang Zekun, Chen Jingchang, Zhou Wangchunshu, Zhu Haichao, Liang Jiafeng, Shan Liping, Liu Ming, Xu Dongliang, Yang Qing, Qin Bing
- Interviewbot: Real-time End-to-end Dialogue System To Interview Students For College Admission Wang Zihao, Keyes Nathan, Crawford Terry, Choi Jinho D.
- Do Llms Understand User Preferences? Evaluating Llms On User Rating Prediction Wang-cheng Kang, Jianmo Ni, Nikhil Mehta, Maheswaran Sathiamoorthy, Lichan Hong, Ed Chi, Derek Zhiyuan Cheng
- Call For Papers -- The Babylm Challenge: Sample-efficient Pretraining On A Developmentally Plausible Corpus Warstadt Alex, Choshen Leshem, Mueller Aaron, Williams Adina, Wilcox Ethan, Zhuang Chengxu
- A Survey Of Large Language Models Wayne Xin Zhao, Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-yun Nie, Ji-rong Wen
- Lmsanitator: Defending Prompt-tuning Against Task-agnostic Backdoors Wei Chengkun, Meng Wenlong, Zhang Zhikun, Chen Min, Zhao Minghu, Fang Wenjing, Wang Lei, Zhang Zihui, Chen Wenzhi
- Larger Language Models Do In-context Learning Differently Wei Jerry, Wei Jason, Tay Yi, Tran Dustin, Webson Albert, Lu Yifeng, Chen Xinyun, Liu Hanxiao, Huang Da, Zhou Denny, Ma Tengyu
- Polylm: An Open Source Polyglot Large Language Model Wei Xiangpeng, Wei Haoran, Lin Huan, Li Tianhao, Zhang Pei, Ren Xingzhang, Li Mei, Wan Yu, Cao Zhiwei, Xie Binbin, Hu Tianxiang, Li Shangjie, Hui Binyuan, Yu Bowen, Liu Dayiheng, Yang Baosong, Huang Fei, Xie Jun
- Copiloting The Copilots: Fusing Large Language Models With Completion Engines For Automated Program Repair Wei Yuxiang, Xia Chunqiu Steven, Zhang Lingming
- In-context Pretraining: Language Modeling Beyond Document Boundaries Weijia Shi, Sewon Min, Maria Lomeli, Chunting Zhou, Margaret Li, Gergely Szilvasy, Rich James, Xi Victoria Lin, Noah A. Smith, Luke Zettlemoyer, Scott Yih, Mike Lewis
- LLMSTEP: LLM Proofstep Suggestions In Lean Welleck Sean, Saha Rahul
- Unveiling The Implicit Toxicity In Large Language Models Wen Jiaxin, Ke Pei, Sun Hao, Zhang Zhexin, Li Chengfei, Bai Jinfeng, Huang Minlie
- Lmtuner: An User-friendly And Highly-integrable Training Framework For Fine-tuning Large Language Models Weng Yixuan, Wang Zhiqi, Liao Huanxuan, He Shizhu, Liu Shengping, Liu Kang, Zhao Jun
- Mastering Symbolic Operations: Augmenting Language Models With Compiled Neural Networks Weng Yixuan, Zhu Minjun, Xia Fei, Li Bin, He Shizhu, Liu Kang, Zhao Jun
- Instructblip: Towards General-purpose Vision-language Models With Instruction Tuning Wenliang Dai, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Junqi Zhao, Weisheng Wang, Boyang Li, Pascale Fung, Steven Hoi
- An Overview Of Temporal Commonsense Reasoning And Acquisition Wenzel Georg, Jatowt Adam
- System 2 Attention (is Something You Might Need Too) Weston Jason, Sukhbaatar Sainbayar
- Exploring Parameter-efficient Fine-tuning Techniques For Code Generation With Large Language Models Weyssow Martin, Zhou Xin, Kim Kisub, Lo David, Sahraoui Houari
- Llm-powered Data Augmentation For Enhanced Cross-lingual Performance Whitehouse Chenxi, Choudhury Monojit, Aji Alham Fikri
- Resolving References In Visually-grounded Dialogue Via Text Generation Willemsen Bram, Qian Livia, Skantze Gabriel
- Frustratingly Simple Memory Efficiency For Pre-trained Language Models Via Dynamic Embedding Pruning Williams Miles, Aletras Nikolaos
- Natural Language Generation And Understanding Of Big Code For Ai-assisted Programming: A Review Wong Man Fai, Guo Shangxin, Hang Ching Nam, Ho Siu Wai, Tan Chee Wei
- Extrapolating Multilingual Understanding Models As Multilingual Generators Wu Bohong, Yuan Fei, Zhao Hai, Li Lei, Xu Jingjing
- Pmc-llama: Towards Building Open-source Language Models For Medicine Wu Chaoyi, Lin Weixiong, Zhang Xiaoman, Zhang Ya, Wang Yanfeng, Xie Weidi
- Language Prompt For Autonomous Driving Wu Dongming, Han Wencheng, Wang Tiancai, Liu Yingfei, Zhang Xiangyu, Shen Jianbing
- P-laplacian Adaptation For Generative Pre-trained Vision-language Models Wu Haoyuan, Zhang Xinyun, Xu Peng, Liao Peiyu, Yao Xufeng, Yu Bei
- Empower Nested Boolean Logic Via Self-supervised Curriculum Learning Wu Hongqiu, Liu Linfeng, Zhao Hai, Zhang Min
- Efficient LLM Inference Solution On Intel GPU Wu Hui, Gan Yi, Yuan Feng, Ma Jing, Zhu Wei, Xu Yutao, Zhu Hong, Zhu Yuhua, Liu Xiaoli, Gu Jinghui, Zhao Peng
- Document Flattening: Beyond Concatenating Context For Document-level Neural Machine Translation Wu Minghao, Foster George, Qu Lizhen, Haffari Gholamreza
- Lamini-lm: A Diverse Herd Of Distilled Models From Large-scale Instructions Wu Minghao, Waheed Abdul, Zhang Chiyu, Abdul-mageed Muhammad, Aji Alham Fikri
- Approximated Prompt Tuning For Vision-language Pre-trained Models Wu Qiong, Huang Shubin, Zhou Yiyi, Dai Pingyang, Shu Annan, Jiang Guannan, Ji Rongrong
- YUAN 2.0: A Large Language Model With Localized Filtering-based Attention Wu Shaohua, Zhao Xudong, Wang Shenling, Luo Jiangang, Li Lingjun, Chen Xi, Zhao Bing, Wang Wei, Yu Tong, Zhang Rongguo, Zhang Jiahua, Wang Chao
- Paragraph-to-image Generation With Information-enriched Diffusion Model Wu Weijia, Li Zhuang, He Yefei, Shou Mike Zheng, Shen Chunhua, Cheng Lele, Li Yan, Gao Tingting, Zhang Di, Wang Zhongyuan
- TLM: Token-level Masking For Transformers Wu Yangjun, Fang Kebin, Zhang Dongxiang, Wang Han, Zhang Hao, Chen Gang
- Openicl: An Open-source Framework For In-context Learning Wu Zhenyu, Wang Yaoxiang, Ye Jiacheng, Feng Jiangtao, Xu Jingjing, Qiao Yu, Wu Zhiyong
- Pali-3 Vision Language Models: Smaller, Faster, Stronger Xi Chen, Xiao Wang, Lucas Beyer, Alexander Kolesnikov, Jialin Wu, Paul Voigtlaender, Basil Mustafa, Sebastian Goodman, Ibrahim Alabdulmohsin, Piotr Padlewski, Daniel Salz, Xi Xiong, Daniel Vlasic, Filip Pavetic, Keran Rong, Tianli Yu, Daniel Keysers, Xiaohua Zhai, Radu Soricut
- Improving Question Generation With Multi-level Content Planning Xia Zehua, Gou Qi, Yu Bowen, Yu Haiyang, Huang Fei, Li Yongbin, Nguyen Cam-tu
- Efficient Streaming Language Models With Attention Sinks Xiao Guangxuan, Tian Yuandong, Chen Beidi, Han Song, Lewis Mike
- Can I Trust Your Answer? Visually Grounded Video Question Answering Xiao Junbin, Yao Angela, Li Yicong, Chua Tat Seng
- Large Language Models Can Be Good Privacy Protection Learners Xiao Yijia, Jin Yiqiao, Bai Yushi, Wu Yue, Yang Xianjun, Luo Xiao, Yu Wenchao, Zhao Xujiang, Liu Yanchi, Chen Haifeng, Wang Wei, Cheng Wei
- Instruction-vit: Multi-modal Prompts For Instruction Learning In Vit Xiao Zhenxiang, Chen Yuzhong, Zhang Lu, Yao Junjie, Wu Zihao, Yu Xiaowei, Pan Yi, Zhao Lin, Ma Chong, Liu Xinyu, Liu Wei, Li Xiang, Yuan Yixuan, Shen Dinggang, Zhu Dajiang, Liu Tianming, Jiang Xi
- Supporting Qualitative Analysis With Large Language Models: Combining Codebook With GPT-3 For Deductive Coding Xiao Ziang, Yuan Xingdi, Liao Q. Vera, Abdelghani Rania, Oudeyer Pierre-yves
- A Survey On Multi-behavior Sequential Recommendation Xiaoqing Chen, Zhitao Li, Weike Pan, Zhong Ming
- Pangu-σ: Towards Trillion Parameter Language Model With Sparse Heterogeneous Computing Xiaozhe Ren, Pingyi Zhou, Xinfan Meng, Xinjing Huang, Yadao Wang, Weichao Wang, Pengfei Li, Xiaoda Zhang, Alexander Podolskiy, Grigory Arshinov, Andrey Bout, Irina Piontkovskaya, Jiansheng Wei, Xin Jiang, Teng Su, Qun Liu, Jun Yao
- The Wall Street Neophyte: A Zero-shot Analysis Of Chatgpt Over Multimodal Stock Movement Prediction Challenges Xie Qianqian, Han Weiguang, Lai Yanzhao, Peng Min, Huang Jimin
- Proto-lm: A Prototypical Network-based Framework For Built-in Interpretability In Large Language Models Xie Sean, Vosoughi Soroush, Hassanpour Saeed
- Analysis Of The Reasoning With Redundant Information Provided Ability Of Large Language Models Xie Wenbei
- Making Small Language Models Better Multi-task Learners With Mixture-of-task-adapters Xie Yukang, Wang Chengyu, Yan Junbing, Zhou Jiyong, Deng Feiqi, Huang Jun
- Doctorglm: Fine-tuning Your Chinese Doctor Is Not A Herculean Task Xiong Honglin, Wang Sheng, Zhu Yitao, Zhao Zihao, Liu Yuxiao, Huang Linlin, Wang Qian, Shen Dinggang
- Can Llms Express Their Uncertainty? An Empirical Evaluation Of Confidence Elicitation In Llms Xiong Miao, Hu Zhiyuan, Lu Xinyang, Li Yifei, Fu Jie, He Junxian, Hooi Bryan
- Effective Long-context Scaling Of Foundation Models Xiong Wenhan, Liu Jingyu, Molybog Igor, Zhang Hejia, Bhargava Prajjwal, Hou Rui, Martin Louis, Rungta Rashi, Sankararaman Karthik Abinav, Oguz Barlas, Khabsa Madian, Fang Han, Mehdad Yashar, Narang Sharan, Malik Kshitiz, Fan Angela, Bhosale Shruti, Edunov Sergey, Lewis Mike, Wang Sinong, Ma Hao
- Rewoo: Decoupling Reasoning From Observations For Efficient Augmented Language Models Xu Binfeng, Peng Zhiyuan, Lei Bowen, Mukherjee Subhabrata, Liu Yuchen, Xu Dongkuan
- Ctc-based Non-autoregressive Speech Translation Xu Chen, Liu Xiaoqian, Liu Xiaowen, Sun Qingxuan, Zhang Yuhao, Yang Murun, Dong Qianqian, Ko Tom, Wang Mingxuan, Xiao Tong, Ma Anxiang, Zhu Jingbo
- Align On The Fly: Adapting Chatbot Behavior To Established Norms Xu Chunpu, Chern Steffi, Chern Ethan, Zhang Ge, Wang Zekun, Liu Ruibo, Li Jing, Fu Jie, Liu Pengfei
- A Paradigm Shift In Machine Translation: Boosting Translation Performance Of Large Language Models Xu Haoran, Kim Young Jin, Sharaf Amr, Awadalla Hany Hassan
- No Train Still Gain. Unleash Mathematical Reasoning Of Large Language Models With Monte Carlo Tree Search Guided By Energy Function Xu Haotian
- Shattering The Agent-environment Interface For Fine-tuning Inclusive Language Models Xu Wanqiao, Dong Shi, Arumugam Dilip, Van Roy Benjamin
- It Ain't That Bad: Understanding The Mysterious Performance Drop In OOD Generalization For Generative Transformer Models Xu Xingcheng, Pan Zihao, Zhang Haipeng, Yang Yanqing
- Mental-llm: Leveraging Large Language Models For Mental Health Prediction Via Online Text Data Xu Xuhai, Yao Bingsheng, Dong Yuanzhe, Gabriel Saadia, Yu Hong, Hendler James, Ghassemi Marzyeh, Dey Anind K., Wang Dakuo
- Drivegpt4: Interpretable End-to-end Autonomous Driving Via Large Language Model Xu Zhenhua, Zhang Yujia, Xie Enze, Zhao Zhen, Guo Yong, Wong Kwan-yee. K., Li Zhenguo, Zhao Hengshuang
- Pit One Against Many: Leveraging Attention-head Embeddings For Parameter-efficient Multi-head Attention Xue Huiyin, Aletras Nikolaos
- Scaling Evidence-based Instructional Design Expertise Through Large Language Models Yadav Gautam
- Pretraining Data Mixtures Enable Narrow Model Selection Capabilities In Transformer Models Yadlowsky Steve, Doshi Lyric, Tripuraneni Nilesh
- LACMA: Language-aligning Contrastive Learning With Meta-actions For Embodied Instruction Following Yang Cheng-fu, Chen Yen-chun, Yang Jianwei, Dai Xiyang, Yuan Lu, Wang Yu-chiang Frank, Chang Kai-wei
- Tackling Vision Language Tasks Through Learning Inner Monologues Yang Diji, Chen Kezhen, Rao Jinmeng, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- Refgpt: Dialogue Generation Of GPT, By GPT, And For GPT Yang Dongjie, Yuan Ruifeng, Fan Yuantao, Yang Yifei, Wang Zili, Wang Shusen, Zhao Hai
- PRCA: Fitting Black-box Large Language Models For Retrieval Question Answering Via Pluggable Reward-driven Contextual Adapter Yang Haoyan, Li Zhitao, Zhang Yong, Wang Jianzong, Cheng Ning, Li Ming, Xiao Jing
- Instoptima: Evolutionary Multi-objective Instruction Optimization Via Large Language Model-based Instruction Operators Yang Heng, Li Ke
- Auto-gpt For Online Decision Making: Benchmarks And Additional Opinions Yang Hui, Yue Sifu, He Yunzhong
- Unified Language-vision Pretraining In LLM With Dynamic Discrete Visual Tokenization Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chenyi Lei, An Liu, Chengru Song, Xiaoqiang Lei, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu
- Kobigbird-large: Transformation Of Transformer For Korean Language Understanding Yang Kisu, Jang Yoonna, Lee Taewoo, Seong Jinwoo, Lee Hyungjin, Jang Hwanseok, Lim Heuiseok
- Skymath: Technical Report Yang Liu, Yang Haihua, Cheng Wenjun, Lin Lei, Li Chenxia, Chen Yifu, Liu Lunan, Pan Jianfei, Wei Tianwen, Li Biye, Zhao Liang, Wang Lijie, Zhu Bo, Li Guoliang, Wu Xuejie, Luo Xilin, Hu Rui
- Lidar-llm: Exploring The Potential Of Large Language Models For 3D Lidar Understanding Yang Senqiao, Liu Jiaming, Zhang Ray, Pan Mingjie, Guo Zoey, Li Xiaoqi, Chen Zehui, Gao Peng, Guo Yandong, Zhang Shanghang
- Progressive Evidence Refinement For Open-domain Multimodal Retrieval Question Answering Yang Shuwen, Wu Anran, Wu Xingjiao, Xiao Luwei, Ma Tianlong, Jin Cheng, He Liang
- Zhongjing: Enhancing The Chinese Medical Capabilities Of Large Language Model Through Expert Feedback And Real-world Multi-turn Dialogue Yang Songhua, Zhao Hanjie, Zhu Senbin, Zhou Guangyu, Xu Hongfei, Jia Yuxiang, Zan Hongying
- Exploring An LM To Generate Prolog Predicates From Mathematics Questions Yang Xiaocheng, Tam Yik-cheung
- Lever LM: Configuring In-context Sequence To Lever Large Vision Language Models Yang Xu, Peng Yingzhe, Ma Haoxuan, Xu Shuo, Zhang Chi, Han Yucheng, Zhang Hanwang
- On The Calibration Of Multilingual Question Answering Llms Yang Yahan, Dan Soham, Roth Dan, Lee Insup
- Bias A-head? Analyzing Bias In Transformer-based Language Model Attention Heads Yang Yi, Duan Hanyu, Abbasi Ahmed, Lalor John P., Tam Kar Yan
- Harnessing The Power Of Large Language Models For Natural Language To First-order Logic Translation Yang Yuan, Xiong Siheng, Payani Ali, Shareghi Ehsan, Fekri Faramarz
- Improving The Reliability Of Large Language Models By Leveraging Uncertainty-aware In-context Learning Yang Yuchen, Li Houqiang, Wang Yanfeng, Wang Yu
- On The Planning, Search, And Memorization Capabilities Of Large Language Models Yang Yunhao, Tomar Anshul
- TEAL: Tokenize And Embed ALL For Multi-modal Large Language Models Yang Zhen, Zhang Yingxue, Meng Fandong, Zhou Jie
- Re-vilm: Retrieval-augmented Visual Language Model For Zero And Few-shot Image Captioning Yang Zhuolin, Ping Wei, Liu Zihan, Korthikanti Vijay, Nie Weili, Huang De-an, Fan Linxi, Yu Zhiding, Lan Shiyi, Li Bo, Liu Ming-yu, Zhu Yuke, Shoeybi Mohammad, Catanzaro Bryan, Xiao Chaowei, Anandkumar Anima
- Quantization-aware And Tensor-compressed Training Of Transformers For Natural Language Understanding Yang Zi, Choudhary Samridhi, Kunzmann Siegfried, Zhang Zheng
- I-code V2: An Autoregressive Generation Framework Over Vision, Language, And Speech Data Yang Ziyi, Khademi Mahmoud, Xu Yichong, Pryzant Reid, Fang Yuwei, Zhu Chenguang, Chen Dongdong, Qian Yao, Gao Mei, Chen Yi-ling, Gmyr Robert, Kanda Naoyuki, Codella Noel, Xiao Bin, Shi Yu, Yuan Lu, Yoshioka Takuya, Zeng Michael, Huang Xuedong
- LLM Lies: Hallucinations Are Not Bugs, But Features As Adversarial Examples Yao Jia-yu, Ning Kun-peng, Liu Zhen-hui, Ning Mu-nan, Liu Yu-yang, Yuan Li
- Knowledge Plugins: Enhancing Large Language Models For Domain-specific Recommendations Yao Jing, Xu Wei, Lian Jianxun, Wang Xiting, Yi Xiaoyuan, Xie Xing
- Retroformer: Retrospective Large Language Agents With Policy Gradient Optimization Yao Weiran, Heinecke Shelby, Niebles Juan Carlos, Liu Zhiwei, Feng Yihao, Xue Le, Murthy Rithesh, Chen Zeyuan, Zhang Jianguo, Arpit Devansh, Xu Ran, Mui Phil, Wang Huan, Xiong Caiming, Savarese Silvio
- Deltazip: Multi-tenant Language Model Serving Via Delta Compression Yao Xiaozhe, Klimovic Ana
- Ureader: Universal Ocr-free Visually-situated Language Understanding With Multimodal Large Language Model Ye Jiabo, Hu Anwen, Xu Haiyang, Ye Qinghao, Yan Ming, Xu Guohai, Li Chenliang, Tian Junfeng, Qian Qi, Zhang Ji, Jin Qin, He Liang, Lin Xin Alex, Huang Fei
- Diffusion Language Models Can Perform Many Tasks With Scaling And Instruction-finetuning Ye Jiasheng, Zheng Zaixiang, Bao Yu, Qian Lihua, Gu Quanquan
- Qilin-med: Multi-stage Knowledge Injection Advanced Medical Large Language Model Ye Qichen, Liu Junling, Chong Dading, Zhou Peilin, Hua Yining, Liu Fenglin, Cao Meng, Wang Ziming, Cheng Xuxin, Lei Zhu, Guo Zhenhua
- ASPEN: High-throughput Lora Fine-tuning Of Large Language Models With A Single GPU Ye Zhengmao, Li Dengchun, Tian Jingqi, Lan Tingfeng, Zuo Jie, Duan Lei, Lu Hui, Jiang Yexi, Sha Jian, Zhang Ke, Tang Mingjie
- Navigating Text-to-image Customization: From Lycoris Fine-tuning To Model Evaluation Yeh Shih-ying, Hsieh Yu-guan, Gao Zhidong, Yang Bernard B W, Oh Giyeong, Gong Yanmin
- QAID: Question Answering Inspired Few-shot Intent Detection Yehudai Asaf, Vetzler Matan, Mass Yosi, Lazar Koren, Cohen Doron, Carmeli Boaz
- Cones: Concept Embedding Search For Parameter Efficient Tuning Large Vision Language Models Yi Huahui, Qin Ziyuan, Xu Wei, Guo Miaotian, Wang Kun, Zhang Shaoting, Li Kang, Lao Qicheng
- Benchmarking And Defending Against Indirect Prompt Injection Attacks On Large Language Models Yi Jingwei, Xie Yueqi, Zhu Bin, Kiciman Emre, Sun Guangzhong, Xie Xing, Wu Fangzhao
- Lemur: Harmonizing Natural Language And Code For Language Agents Yiheng Xu, Hongjin Su, Chen Xing, Boyu Mi, Qian Liu, Weijia Shi, Binyuan Hui, Fan Zhou, Yitao Liu, Tianbao Xie, Zhoujun Cheng, Siheng Zhao, Lingpeng Kong, Bailin Wang, Caiming Xiong, Tao Yu
- Ctrlstruct: Dialogue Structure Learning For Open-domain Response Generation Yin Congchi, Li Piji, Ren Zhaochun
- From Text To Motion: Grounding GPT-4 In A Humanoid Robot "alter3" Yoshida Takahide, Masumori Atsushi, Ikegami Takashi
- \(P^{3}O\): Transferring Visual Representations For Reinforcement Learning Via Prompting You Guoliang, Chu Xiaomeng, Duan Yifan, Peng Jie, Ji Jianmin, Zhang Yu, Zhang Yanyong
- Language Models Are Super Mario: Absorbing Abilities From Homologous Models As A Free Lunch Yu Le, Yu Bowen, Yu Haiyang, Huang Fei, Li Yongbin
- Scaling Autoregressive Multi-modal Models: Pretraining And Instruction Tuning Yu Lili, Shi Bowen, Pasunuru Ramakanth, Muller Benjamin, Golovneva Olga, Wang Tianlu, Babu Arun, Tang Binh, Karrer Brian, Sheynin Shelly, Ross Candace, Polyak Adam, Howes Russell, Sharma Vasu, Xu Puxin, Tamoyan Hovhannes, Ashual Oron, Singer Uriel, Li Shang-wen, Zhang Susan, James Richard, Ghosh Gargi, Taigman Yaniv, Fazel-zarandi Maryam, Celikyilmaz Asli, Zettlemoyer Luke, Aghajanyan Armen
- MEGABYTE: Predicting Million-byte Sequences With Multiscale Transformers Yu Lili, Simig Dániel, Flaherty Colin, Aghajanyan Armen, Zettlemoyer Luke, Lewis Mike
- Information Association For Language Model Updating By Mitigating Lm-logical Discrepancy Yu Pengfei, Ji Heng
- Characterizing Mechanisms For Factual Recall In Language Models Yu Qinan, Merullo Jack, Pavlick Ellie
- Seqgpt: An Out-of-the-box Large Language Model For Open Domain Sequence Understanding Yu Tianyu, Jiang Chengyue, Lou Chao, Huang Shen, Wang Xiaobin, Liu Wei, Cai Jiong, Li Yangning, Li Yinghui, Tu Kewei, Zheng Hai-tao, Zhang Ningyu, Xie Pengjun, Huang Fei, Jiang Yong
- Improving Language Models Via Plug-and-play Retrieval Feedback Yu Wenhao, Zhang Zhihan, Liang Zhenwen, Jiang Meng, Sabharwal Ashish
- Training With "paraphrasing The Original Text" Improves Long-context Performance Yu Yijiong, Huang Yongfeng, Qi Zhixiao, Zhou Zhe
- Fine-tuning Language Models With Generative Adversarial Reward Modelling Yu Zhang Ze, Jaw Lau Jia, Hui Zhang, Low Bryan Kian Hsiang
- How Vocabulary Sharing Facilitates Multilingualism In Llama? Yuan Fei, Yuan Shuai, Wu Zhiyong, Li Lei
- Scaling Relationship On Learning Mathematical Reasoning With Large Language Models Yuan Zheng, Yuan Hongyi, Li Chengpeng, Dong Guanting, Lu Keming, Tan Chuanqi, Zhou Chang, Zhou Jingren
- Multilingual Augmentation For Robust Visual Question Answering In Remote Sensing Images Yuan Zhenghang, Mou Lichao, Zhu Xiao Xiang
- Artgpt-4: Towards Artistic-understanding Large Vision-language Models With Enhanced Adapter Yuan Zhengqing, He Yunhong, Wang Kun, Ye Yanfang, Sun Lichao
- Evotext: Enhancing Natural Language Generation Models Via Self-escalation Learning For Up-to-date Knowledge And Improved Performance Yuan Zhengqing, Xue Huiwen, Zhang Chao, Liu Yongming
- Evaluating Instruction-tuned Large Language Models On Code Comprehension And Generation Yuan Zhiqiang, Liu Junwei, Zi Qiancheng, Liu Mingwei, Peng Xin, Lou Yiling
- Plan, Eliminate, And Track -- Language Models Are Good Teachers For Embodied Agents Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye
- Prompt Highlighter: Interactive Control For Multi-modal Llms Yuechen Zhang, Shengju Qian, Bohao Peng, Shu Liu, Jiaya Jia
- Aligning Large Language Models With Human: A Survey Yufei Wang, Wanjun Zhong, Liangyou Li, Fei Mi, Xingshan Zeng, Wenyong Huang, Lifeng Shang, Xin Jiang, Qun Liu
- Attention Satisfies: A Constraint-satisfaction Lens On Factual Errors Of Language Models Yuksekgonul Mert, Chandrasekaran Varun, Jones Erik, Gunasekar Suriya, Naik Ranjita, Palangi Hamid, Kamar Ece, Nushi Besmira
- Planting A SEED Of Vision In Large Language Model Yuying Ge, Yixiao Ge, Ziyun Zeng, Xintao Wang, Ying Shan
- Fairness-aware Structured Pruning In Transformers Zayed Abdelrahman, Mordido Goncalo, Shabanian Samira, Baldini Ioana, Chandar Sarath
- TIM: Teaching Large Language Models To Translate With Comparison Zeng Jiali, Meng Fandong, Yin Yongjing, Zhou Jie
- Exploring Memorization In Fine-tuned Language Models Zeng Shenglai, Li Yaxin, Ren Jie, Liu Yiding, Xu Han, He Pengfei, Xing Yue, Wang Shuaiqiang, Tang Jiliang, Yin Dawei
- Building Open-ended Embodied Agent Via Language-policy Bidirectional Adaptation Zhai Shaopeng, Wang Jie, Zhang Tianyi, Huang Fuxian, Zhang Qi, Zhou Ming, Hou Jing, Qiao Yu, Liu Yu
- Investigating The Catastrophic Forgetting In Multimodal Large Language Models Zhai Yuexiang, Tong Shengbang, Li Xiao, Cai Mu, Qu Qing, Lee Yong Jae, Ma Yi
- Removing RLHF Protections In GPT-4 Via Fine-tuning Zhan Qiusi, Fang Richard, Bindu Rohan, Gupta Akul, Hashimoto Tatsunori, Kang Daniel
- Cot-bert: Enhancing Unsupervised Sentence Representation Through Chain-of-thought Zhang Bowen, Chang Kehua, Li Chunping
- Mm-narrator: Narrating Long-form Videos With Multimodal In-context Learning Zhang Chaoyi, Lin Kevin, Yang Zhengyuan, Wang Jianfeng, Li Linjie, Lin Chung-ching, Liu Zicheng, Wang Lijuan
- User-controlled Knowledge Fusion In Large Language Models: Balancing Creativity And Hallucination Zhang Chen
- Automatic Generation Of Multiple-choice Questions Zhang Cheng
- Revisiting Block-based Quantisation: What Is Important For Sub-8-bit LLM Inference? Zhang Cheng, Cheng Jianyi, Shumailov Ilia, Constantinides George A., Zhao Yiren
- Benchmarking And In-depth Performance Study Of Large Language Models On Habana Gaudi Processors Zhang Chengming, Sun Baixi, Yu Xiaodong, Xie Zhen, Zheng Weijian, Iskra Kamil, Beckman Pete, Tao Dingwen
- The Skipped Beat: A Study Of Sociopragmatic Understanding In Llms For 64 Languages Zhang Chiyu, Doan Khai Duy, Liao Qisheng, Abdul-mageed Muhammad
- Taxonomy-based Checklist For Large Language Model Evaluation Zhang Damin
- On The Safety Of Open-sourced Large Language Models: Does Alignment Really Prevent Them From Being Misused? Zhang Hangfan, Guo Zhimeng, Zhu Huaisheng, Cao Bochuan, Lin Lu, Jia Jinyuan, Chen Jinghui, Wu Dinghao
- Controllable Text Generation With Residual Memory Transformer Zhang Hanqing, Si Sun, Wu Haiming, Song Dawei
- Tuning Large Language Model For End-to-end Speech Translation Zhang Hao, Si Nianwen, Chen Yaqi, Zhang Wenlin, Yang Xukui, Qu Dan, Jiao Xiaolin
- XATU: A Fine-grained Instruction-based Benchmark For Explainable Text Updates Zhang Haopeng, Iso Hayate, Gurajada Sairam, Bhutani Nikita
- Tractable Control For Autoregressive Language Generation Zhang Honghua, Dang Meihua, Peng Nanyun, Broeck Guy Van Den
- Linear Attention Via Orthogonal Memory Zhang Jun, Jiang Shuyang, Feng Jiangtao, Zheng Lin, Kong Lingpeng
- Aligning Instruction Tasks Unlocks Large Language Models As Zero-shot Relation Extractors Zhang Kai, Gutiérrez Bernal Jiménez, Su Yu
- Llm-based Medical Assistant Personalization With Short- And Long-term Memory Coordination Zhang Kai, Kang Yangyang, Zhao Fubang, Liu Xiaozhong
- Crash: Clustering, Removing, And Sharing Enhance Fine-tuning Without Full Large Language Model Zhang Kaiyan, Ding Ning, Qi Biqing, Zhu Xuekai, Long Xinwei, Zhou Bowen
- Weighted Sampling For Masked Language Modeling Zhang Linhan, Chen Qian, Wang Wen, Deng Chong, Cao Xin, Hao Kongzhang, Jiang Yuxin, Wang Wei
- A Two-stage Adaptation Of Large Language Models For Text Ranking Zhang Longhui, Zhang Yanzhao, Long Dingkun, Xie Pengjun, Zhang Meishan, Zhang Min
- Lora-fa: Memory-efficient Low-rank Adaptation For Large Language Models Fine-tuning Zhang Longteng, Zhang Lin, Shi Shaohuai, Chu Xiaowen, Li Bo
- Retrieve Anything To Augment Large Language Models Zhang Peitian, Xiao Shitao, Liu Zheng, Dou Zhicheng, Nie Jian-yun
- Llama-adapter: Efficient Fine-tuning Of Language Models With Zero-init Attention Zhang Renrui, Han Jiaming, Liu Chris, Gao Peng, Zhou Aojun, Hu Xiangfei, Yan Shilin, Lu Pan, Li Hongsheng, Qiao Yu
- Generation-driven Contrastive Self-training For Zero-shot Text Classification With Instruction-following LLM Zhang Ruohong, Wang Yau-shian, Yang Yiming
- Planning With Large Language Models For Code Generation Zhang Shun, Chen Zhenfang, Shen Yikang, Ding Mingyu, Tenenbaum Joshua B., Gan Chuang
- The Wisdom Of Hindsight Makes Language Models Better Instruction Followers Zhang Tianjun, Liu Fangchen, Wong Justin, Abbeel Pieter, Gonzalez Joseph E.
- History-aware Hierarchical Transformer For Multi-session Open-domain Dialogue System Zhang Tong, Liu Yong, Li Boyang, Zeng Zhiwei, Wang Pengwei, You Yuan, Miao Chunyan, Cui Lizhen
- Adarefiner: Refining Decisions Of Language Models With Adaptive Feedback Zhang Wanpeng, Lu Zongqing
- Bridging The Information Gap Between Domain-specific Model And General LLM For Personalized Recommendation Zhang Wenxuan, Liu Hongzhi, Du Yingpeng, Zhu Chen, Song Yang, Zhu Hengshu, Wu Zhonghai
- Large Language Models In Medical Term Classification And Unexpected Misalignment Between Response And Reasoning Zhang Xiaodan, Vemulapalli Sandeep, Talukdar Nabasmita, Ahn Sumyeong, Wang Jiankun, Meng Han, Murtaza Sardar Mehtab Bin, Dave Aakash Ajay, Leshchiner Dmitry, Joseph Dimitri F., Witteveen-lane Martin, Chesla Dave, Zhou Jiayu, Chen Bin
- Language Models Are Universal Embedders Zhang Xin, Li Zehan, Zhang Yanzhao, Long Dingkun, Xie Pengjun, Zhang Meishan, Zhang Min
- Dialoguellm: Context And Emotion Knowledge-tuned Large Language Models For Emotion Recognition In Conversations Zhang Yazhou, Wang Mengyao, Wu Youxi, Tiwari Prayag, Li Qiuchi, Wang Benyou, Qin Jing
- Cached Transformers: Improving Transformers With Differentiable Memory Cache Zhang Zhaoyang, Shao Wenqi, Ge Yixiao, Wang Xiaogang, Gu Jinwei, Luo Ping
- Mani-gpt: A Generative Model For Interactive Robotic Manipulation Zhang Zhe, Chai Wei, Wang Jiankun
- Balancing Specialized And General Skills In Llms: The Impact Of Modern Tuning And Data Strategy Zhang Zheng, Zheng Chen, Tang Da, Sun Ke, Ma Yukun, Bu Yingtong, Zhou Xun, Zhao Liang
- Auto-instruct: Automatic Instruction Generation And Ranking For Black-box Language Models Zhang Zhihan, Wang Shuohang, Yu Wenhao, Xu Yichong, Iter Dan, Zeng Qingkai, Liu Yang, Zhu Chenguang, Jiang Meng
- Accountable Textual-visual Chat Learns To Reject Human Instructions In Image Re-creation Zhang Zhiwei, Liu Yuliang
- Unifying The Perspectives Of NLP And Software Engineering: A Survey On Language Models For Code Zhang Ziyin, Chen Chaoyu, Liu Bingchang, Liao Cong, Gong Zi, Yu Hang, Li Jianguo, Wang Rui
- MELA: Multilingual Evaluation Of Linguistic Acceptability Zhang Ziyin, Liu Yikang, Huang Weifang, Mao Junyu, Wang Rui, Hu Hai
- Large Language Models As Zero-shot Conversational Recommenders Zhankui He, Zhouhang Xie, Rahul Jha, Harald Steck, Dawen Liang, Yesu Feng, Bodhisattwa Prasad Majumder, Nathan Kallus, Julian Mcauley
- Large Language Models Are Complex Table Parsers Zhao Bowen, Ji Changkai, Zhang Yuejie, He Wen, Wang Yingwen, Wang Qing, Feng Rui, Zhang Xiaobo
- Explainability For Large Language Models: A Survey Zhao Haiyan, Chen Hanjie, Yang Fan, Liu Ninghao, Deng Huiqi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Du Mengnan
- Hallucination Detection For Grounded Instruction Generation Zhao Lingjun, Nguyen Khanh, Daumé Hal Iii
- Group Preference Optimization: Few-shot Alignment Of Large Language Models Zhao Siyan, Dang John, Grover Aditya
- A Survey Of Large Language Models Zhao Wayne Xin, Zhou Kun, Li Junyi, Tang Tianyi, Wang Xiaolei, Hou Yupeng, Min Yingqian, Zhang Beichen, Zhang Junjie, Dong Zican, Du Yifan, Yang Chen, Chen Yushuo, Chen Zhipeng, Jiang Jinhao, Ren Ruiyang, Li Yifan, Tang Xinyu, Liu Zikang, Liu Peiyu, Nie Jian-yun, Wen Ji-rong
- Babystories: Can Reinforcement Learning Teach Baby Language Models To Write Better Stories? Zhao Xingmeng, Wang Tongnian, Osborn Sheri, Rios Anthony
- Slic-hf: Sequence Likelihood Calibration With Human Feedback Zhao Yao, Joshi Rishabh, Liu Tianqi, Khalman Misha, Saleh Mohammad, Liu Peter J.
- Multi-modal In-context Learning Makes An Ego-evolving Scene Text Recognizer Zhao Zhen, Tang Jingqun, Lin Chunhui, Wu Binghong, Huang Can, Liu Hao, Tan Xin, Zhang Zhizhong, Xie Yuan
- Adapting Large Language Models By Integrating Collaborative Semantics For Recommendation Zheng Bowen, Hou Yupeng, Lu Hongyu, Chen Yu, Zhao Wayne Xin, Chen Ming, Wen Ji-rong
- Can We Edit Factual Knowledge By In-context Learning? Zheng Ce, Li Lei, Dong Qingxiu, Fan Yuxuan, Wu Zhiyong, Xu Jingjing, Chang Baobao
- Learn From Model Beyond Fine-tuning: A Survey Zheng Hongling, Shen Li, Tang Anke, Luo Yong, Hu Han, Du Bo, Tao Dacheng
- Candidate Soups: Fusing Candidate Results Improves Translation Quality For Non-autoregressive Translation Zheng Huanran, Zhu Wei, Wang Pengfei, Wang Xiaoling
- Trafficsafetygpt: Tuning A Pre-trained Large Language Model To A Domain-specific Expert In Transportation Safety Zheng Ou, Abdel-aty Mohamed, Wang Dongdong, Wang Chenzhu, Ding Shengxuan
- Llamarec: Two-stage Recommendation Using Large Language Models For Ranking Zhenrui Yue, Sara Rabhi, Gabriel De Souza Pereira Moreira, Dong Wang, Even Oldridge
- Principle-driven Self-alignment Of Language Models From Scratch With Minimal Human Supervision Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan
- Bookgpt: A General Framework For Book Recommendation Empowered By Large Language Model Zhiyuli Aakas, Chen Yanfang, Zhang Xuan, Liang Xun
- Revisiting Token Dropping Strategy In Efficient BERT Pretraining Zhong Qihuang, Ding Liang, Liu Juhua, Liu Xuebo, Zhang Min, Du Bo, Tao Dacheng
- Bag Of Tricks For Effective Language Model Pretraining And Downstream Adaptation: A Case Study On GLUE Zhong Qihuang, Ding Liang, Peng Keqin, Liu Juhua, Du Bo, Shen Li, Zhan Yibing, Tao Dacheng
- Sur-adapter: Enhancing Text-to-image Pre-trained Diffusion Models With Large Language Models Zhong Shanshan, Huang Zhongzhan, Wen Wushao, Qin Jinghui, Lin Liang
- Language Agent Tree Search Unifies Reasoning Acting And Planning In Language Models Zhou Andy, Yan Kai, Shlapentokh-rothman Michal, Wang Haohan, Wang Yu-xiong
- Lobass: Gauging Learnability In Supervised Fine-tuning Data Zhou Haotian, Liu Tingkai, Ma Qianli, Yuan Jianbo, Liu Pengfei, You Yang, Yang Hongxia
- Vicor: Bridging Visual Understanding And Commonsense Reasoning With Large Language Models Zhou Kaiwen, Lee Kwonjoon, Misu Teruhisa, Wang Xin Eric
- CONA: A Novel Context-aware Instruction Paradigm For Communication Using Large Language Model Zhou Nan, Tao Xinghui, Chen Xi
- Infmllm: A Unified Framework For Visual-language Tasks Zhou Qiang, Wang Zhibin, Chu Wei, Xu Yinghui, Li Hao, Qi Yuan
- Recurrentgpt: Interactive Generation Of (arbitrarily) Long Text Zhou Wangchunshu, Jiang Yuchen Eleanor, Cui Peng, Wang Tiannan, Xiao Zhenxin, Hou Yifan, Cotterell Ryan, Sachan Mrinmaya
- Multi-stage Pre-training Enhanced By Chatgpt For Multi-scenario Multi-domain Dialogue Summarization Zhou Weixiao, Li Gengyao, Cheng Xianfu, Liang Xinnian, Zhu Junnan, Zhai Feifei, Li Zhoujun
- Making Harmful Behaviors Unlearnable For Large Language Models Zhou Xin, Lu Yi, Ma Ruotian, Gui Tao, Zhang Qi, Huang Xuanjing
- Advancing Transformer's Capabilities In Commonsense Reasoning Zhou Yu, Han Yunqiu, Zhou Hanyu, Wu Yulun
- Revisiting Automated Prompting: Are We Actually Doing Better? Zhou Yulin, Zhao Yiren, Shumailov Ilia, Mullins Robert, Gal Yarin
- Quantifying And Analyzing Entity-level Memorization In Large Language Models Zhou Zhenhong, Xiang Jiuyang, Chen Chaomeng, Su Sen
- On The Calibration Of Large Language Models And Alignment Zhu Chiwei, Xu Benfeng, Wang Quan, Zhang Yongdong, Mao Zhendong
- Pose: Efficient Context Window Extension Of Llms Via Positional Skip-wise Training Zhu Dawei, Yang Nan, Wang Liang, Song Yifan, Wu Wenhao, Wei Furu, Li Sujian
- Dialogue-contextualized Re-ranking For Medical History-taking Zhu Jian, Valmianski Ilya, Kannan Anitha
- VL-GPT: A Generative Pre-trained Transformer For Vision And Language Understanding And Generation Zhu Jinguo, Ding Xiaohan, Ge Yixiao, Ge Yuying, Zhao Sijie, Zhao Hengshuang, Wang Xiaohua, Shan Ying
- Judgelm: Fine-tuned Large Language Models Are Scalable Judges Zhu Lianghui, Wang Xinggang, Wang Xinlong
- Coca: Fusing Position Embedding With Collinear Constrained Attention In Transformers For Long Context Window Extending Zhu Shiyi, Ye Jing, Jiang Wei, Xue Siqiao, Zhang Qi, Wu Yifan, Li Jianguo
- Promptcblue: A Chinese Prompt Tuning Benchmark For The Medical Domain Zhu Wei, Wang Xiaoling, Zheng Huanran, Chen Mosha, Tang Buzhou
- Pad: Program-aided Distillation Can Teach Small Models Reasoning Better Than Chain-of-thought Fine-tuning Zhu Xuekai, Qi Biqing, Zhang Kaiyan, Long Xinwei, Lin Zhouhan, Zhou Bowen
- 3d-vista: Pre-trained Transformer For 3D Vision And Text Alignment Zhu Ziyu, Ma Xiaojian, Chen Yixin, Deng Zhidong, Huang Siyuan, Li Qing
- Open-source Large Language Models Are Strong Zero-shot Query Likelihood Models For Document Ranking Zhuang Shengyao, Liu Bing, Koopman Bevan, Zuccon Guido
- Aligner: One Global Token Is Worth Millions Of Parameters When Aligning Large Language Models Ziheng Zhou University Of California, Los Angeles, Wu Yingnian University Of California, Los Angeles, Zhu Song-chun University Of California, Los Angeles, Terzopoulos Demetri University Of California, Los Angeles
- Recommender Systems In The Era Of Large Language Models (llms) Zihuai Zhao, Wenqi Fan, Jiatong Li, Yunqing Liu, Xiaowei Mei, Yiqi Wang, Zhen Wen, Fei Wang, Xiangyu Zhao, Jiliang Tang, Qing Li
- A Family Of Pretrained Transformer Language Models For Russian Zmitrovich Dmitry, Abramov Alexander, Kalmykov Andrey, Tikhonova Maria, Taktasheva Ekaterina, Astafurov Danil, Baushenko Mark, Snegirev Artem, Kadulin Vitalii, Markov Sergey, Shavrina Tatiana, Mikhailov Vladislav, Fenogenova Alena
- Concept-aware Training Improves In-context Learning Ability Of Language Models Štefánik Michal, Kadlčík Marek
- A Dataset And Benchmark For Hospital Course Summarization With Adapted Large Language Models Aali Asad, Van Veen Dave, Arefeen Yamin Ishraq, Hom Jason, Bluethgen Christian, Reis Eduardo Pontes, Gatidis Sergios, Clifford Namuun, Daws Joseph, Tehrani Arash S., Kim Jangwon, Chaudhari Akshay S.
- Are You Still On Track!? Catching LLM Task Drift With Activations Abdelnabi Sahar, Fay Aideen, Cherubin Giovanni, Salem Ahmed, Fritz Mario, Paverd Andrew
- Olvit: Multi-modal State Tracking Via Attention-based Embeddings For Video-grounded Dialog Abdessaied Adnen, Von Hochmeister Manuel, Bulling Andreas
- The Challenges Of Evaluating LLM Applications: An Analysis Of Automated, Human, And Llm-based Approaches Abeysinghe Bhashithe, Circi Ruhan
- The Llama 3 Herd Of Models Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang, Bobbie Chern, Charlotte Caucheteux, Chaya Nayak, Chloe Bi, Chris Marra, Chris Mcconnell, Christian Keller, Christophe Touret, Chunyang Wu, Corinne Wong, Cristian Canton Ferrer, Cyrus Nikolaidis, Damien Allonsius, Daniel Song, Danielle Pintz, Danny Livshits, David Esiobu, Dhruv Choudhary, Dhruv Mahajan, Diego Garcia-olano, Diego Perino, Dieuwke Hupkes, Egor Lakomkin, Ehab Albadawy, Elina Lobanova, Emily Dinan, Eric Michael Smith, Filip Radenovic, Frank Zhang, Gabriel Synnaeve, Gabrielle Lee, Georgia Lewis Anderson, Graeme Nail, Gregoire Mialon, Guan Pang, Guillem Cucurell, Hailey Nguyen, Hannah Korevaar, Hu Xu, Hugo Touvron, Iliyan Zarov, Imanol Arrieta Ibarra, Isabel Kloumann, Ishan Misra, Ivan Evtimov, Jade Copet, Jaewon Lee, Jan Geffert, Jana Vranes, Jason Park, Jay Mahadeokar, Jeet Shah, Jelmer Van Der Linde, Jennifer Billock, Jenny Hong, Jenya Lee, Jeremy Fu, Jianfeng Chi, Jianyu Huang, Jiawen Liu, Jie Wang, Jiecao Yu, Joanna Bitton, Joe Spisak, Jongsoo Park, Joseph Rocca, Joshua Johnstun, Joshua Saxe, Junteng Jia, Kalyan Vasuden Alwala, Kartikeya Upasani, Kate Plawiak, Ke Li, Kenneth Heafield, Kevin Stone, Khalid El-arini, Krithika Iyer, Kshitiz Malik, Kuenley Chiu, Kunal Bhalla, Lauren Rantala-yeary, Laurens Van Der Maaten, Lawrence Chen, Liang Tan, Liz Jenkins, Louis Martin, Lovish Madaan, Lubo Malo, Lukas Blecher, Lukas Landzaat, Luke De Oliveira, Madeline Muzzi, Mahesh Pasupuleti, Mannat Singh, Manohar Paluri, Marcin Kardas, Mathew Oldham, Mathieu Rita, Maya Pavlova, Melanie Kambadur, Mike Lewis, Min Si, Mitesh Kumar Singh, Mona Hassan, Naman Goyal, Narjes Torabi, Nikolay Bashlykov, Nikolay Bogoychev, Niladri Chatterji, Olivier Duchenne, Onur Çelebi, Patrick Alrassy, Pengchuan Zhang, Pengwei Li, Petar Vasic, Peter Weng, Prajjwal Bhargava, Pratik Dubal, Praveen Krishnan, Punit Singh Koura, Puxin Xu, Qing He, Qingxiao Dong, Ragavan Srinivasan, Raj Ganapathy, Ramon Calderer, Ricardo Silveira Cabral, Robert Stojnic, Roberta Raileanu, Rohit Girdhar, Rohit Patel, Romain Sauvestre, Ronnie Polidoro, Roshan Sumbaly, Ross Taylor, Ruan Silva, Rui Hou, Rui Wang, Saghar Hosseini, Sahana Chennabasappa, Sanjay Singh, Sean Bell, Seohyun Sonia Kim, Sergey Edunov, Shaoliang Nie, Sharan Narang, Sharath Raparthy, Sheng Shen, Shengye Wan, Shruti Bhosale, Shun Zhang, Simon Vandenhende, Soumya Batra, Spencer Whitman, Sten Sootla, Stephane Collot, Suchin Gururangan, Sydney Borodinsky, Tamar Herman, Tara Fowler, Tarek Sheasha, Thomas Georgiou, Thomas Scialom, Tobias Speckbacher, Todor Mihaylov, Tong Xiao, Ujjwal Karn, Vedanuj Goswami, Vibhor Gupta, Vignesh Ramanathan, Viktor Kerkez, Vincent Gonguet, Virginie Do, Vish Vogeti, Vladan Petrovic, Weiwei Chu, Wenhan Xiong, Wenyin Fu, Whitney Meers, Xavier Martinet, Xiaodong Wang, Xiaoqing Ellen Tan, Xinfeng Xie, Xuchao Jia, Xuewei Wang, Yaelle Goldschlag, Yashesh Gaur, Yasmine Babaei, Yi Wen, Yiwen Song, Yuchen Zhang, Yue Li, Yuning Mao, Zacharie Delpierre Coudert, Zheng Yan, Zhengxing Chen, Zoe Papakipos, Aaditya Singh, Aaron Grattafiori, Abha Jain, Adam Kelsey, Adam Shajnfeld, Adithya Gangidi, Adolfo Victoria, Ahuva Goldstand, Ajay Menon, Ajay Sharma, Alex Boesenberg, Alex Vaughan, Alexei Baevski, Allie Feinstein, Amanda Kallet, Amit Sangani, Anam Yunus, Andrei Lupu, Andres Alvarado, Andrew Caples, Andrew Gu, Andrew Ho, Andrew Poulton, Andrew Ryan, Ankit Ramchandani, Annie Franco, Aparajita Saraf, Arkabandhu Chowdhury, Ashley Gabriel, Ashwin Bharambe, Assaf Eisenman, Azadeh Yazdan, Beau James, Ben Maurer, Benjamin Leonhardi, Bernie Huang, Beth Loyd, Beto De Paola, Bhargavi Paranjape, Bing Liu, Bo Wu, Boyu Ni, Braden Hancock, Bram Wasti, Brandon Spence, Brani Stojkovic, Brian Gamido, Britt Montalvo, Carl Parker, Carly Burton, Catalina Mejia, Changhan Wang, Changkyu Kim, Chao Zhou, Chester Hu, Ching-hsiang Chu, Chris Cai, Chris Tindal, Christoph Feichtenhofer, Damon Civin, Dana Beaty, Daniel Kreymer, Daniel Li, Danny Wyatt, David Adkins, David Xu, Davide Testuggine, Delia David, Devi Parikh, Diana Liskovich, Didem Foss, Dingkang Wang, Duc Le, Dustin Holland, Edward Dowling, Eissa Jamil, Elaine Montgomery, Eleonora Presani, Emily Hahn, Emily Wood, Erik Brinkman, Esteban Arcaute, Evan Dunbar, Evan Smothers, Fei Sun, Felix Kreuk, Feng Tian, Firat Ozgenel, Francesco Caggioni, Francisco Guzmán, Frank Kanayet, Frank Seide, Gabriela Medina Florez, Gabriella Schwarz, Gada Badeer, Georgia Swee, Gil Halpern, Govind Thattai, Grant Herman, Grigory Sizov, Guangyi, Zhang, Guna Lakshminarayanan, Hamid Shojanazeri, Han Zou, Hannah Wang, Hanwen Zha, Haroun Habeeb, Harrison Rudolph, Helen Suk, Henry Aspegren, Hunter Goldman, Ibrahim Damlaj, Igor Molybog, Igor Tufanov, Irina-elena Veliche, Itai Gat, Jake Weissman, James Geboski, James Kohli, Japhet Asher, Jean-baptiste Gaya, Jeff Marcus, Jeff Tang, Jennifer Chan, Jenny Zhen, Jeremy Reizenstein, Jeremy Teboul, Jessica Zhong, Jian Jin, Jingyi Yang, Joe Cummings, Jon Carvill, Jon Shepard, Jonathan Mcphie, Jonathan Torres, Josh Ginsburg, Junjie Wang, Kai Wu, Kam Hou U, Karan Saxena, Karthik Prasad, Kartikay Khandelwal, Katayoun Zand, Kathy Matosich, Kaushik Veeraraghavan, Kelly Michelena, Keqian Li, Kun Huang, Kunal Chawla, Kushal Lakhotia, Kyle Huang, Lailin Chen, Lakshya Garg, Lavender A, Leandro Silva, Lee Bell, Lei Zhang, Liangpeng Guo, Licheng Yu, Liron Moshkovich, Luca Wehrstedt, Madian Khabsa, Manav Avalani, Manish Bhatt, Maria Tsimpoukelli, Martynas Mankus, Matan Hasson, Matthew Lennie, Matthias Reso, Maxim Groshev, Maxim Naumov, Maya Lathi, Meghan Keneally, Michael L. Seltzer, Michal Valko, Michelle Restrepo, Mihir Patel, Mik Vyatskov, Mikayel Samvelyan, Mike Clark, Mike Macey, Mike Wang, Miquel Jubert Hermoso, Mo Metanat, Mohammad Rastegari, Munish Bansal, Nandhini Santhanam, Natascha Parks, Natasha White, Navyata Bawa, Nayan Singhal, Nick Egebo, Nicolas Usunier, Nikolay Pavlovich Laptev, Ning Dong, Ning Zhang, Norman Cheng, Oleg Chernoguz, Olivia Hart, Omkar Salpekar, Ozlem Kalinli, Parkin Kent, Parth Parekh, Paul Saab, Pavan Balaji, Pedro Rittner, Philip Bontrager, Pierre Roux, Piotr Dollar, Polina Zvyagina, Prashant Ratanchandani, Pritish Yuvraj, Qian Liang, Rachad Alao, Rachel Rodriguez, Rafi Ayub, Raghotham Murthy, Raghu Nayani, Rahul Mitra, Raymond Li, Rebekkah Hogan, Robin Battey, Rocky Wang, Rohan Maheswari, Russ Howes, Ruty Rinott, Sai Jayesh Bondu, Samyak Datta, Sara Chugh, Sara Hunt, Sargun Dhillon, Sasha Sidorov, Satadru Pan, Saurabh Verma, Seiji Yamamoto, Sharadh Ramaswamy, Shaun Lindsay, Shaun Lindsay, Sheng Feng, Shenghao Lin, Shengxin Cindy Zha, Shiva Shankar, Shuqiang Zhang, Shuqiang Zhang, Sinong Wang, Sneha Agarwal, Soji Sajuyigbe, Soumith Chintala, Stephanie Max, Stephen Chen, Steve Kehoe, Steve Satterfield, Sudarshan Govindaprasad, Sumit Gupta, Sungmin Cho, Sunny Virk, Suraj Subramanian, Sy Choudhury, Sydney Goldman, Tal Remez, Tamar Glaser, Tamara Best, Thilo Kohler, Thomas Robinson, Tianhe Li, Tianjun Zhang, Tim Matthews, Timothy Chou, Tzook Shaked, Varun Vontimitta, Victoria Ajayi, Victoria Montanez, Vijai Mohan, Vinay Satish Kumar, Vishal Mangla, Vítor Albiero, Vlad Ionescu, Vlad Poenaru, Vlad Tiberiu Mihailescu, Vladimir Ivanov, Wei Li, Wenchen Wang, Wenwen Jiang, Wes Bouaziz, Will Constable, Xiaocheng Tang, Xiaofang Wang, Xiaojian Wu, Xiaolan Wang, Xide Xia, Xilun Wu, Xinbo Gao, Yanjun Chen, Ye Hu, Ye Jia, Ye Qi, Yenda Li, Yilin Zhang, Ying Zhang, Yossi Adi, Youngjin Nam, Yu, Wang, Yuchen Hao, Yundi Qian, Yuzi He, Zach Rait, Zachary Devito, Zef Rosnbrick, Zhaoduo Wen, Zhenyu Yang, Zhiwei Zhao
- A Survey On Symbolic Knowledge Distillation Of Large Language Models Acharya Kamal, Velasquez Alvaro, Song Houbing Herbert
- Attnlrp: Attention-aware Layer-wise Relevance Propagation For Transformers Achtibat Reduan, Hatefi Sayed Mohammad Vakilzadeh, Dreyer Maximilian, Jain Aakriti, Wiegand Thomas, Lapuschkin Sebastian, Samek Wojciech
- Bridging The Bosphorus: Advancing Turkish Large Language Models Through Strategies For Low-resource Language Adaptation And Benchmarking Acikgoz Emre Can, Erdogan Mete, Yuret Deniz
- Fine-grained Contract NER Using Instruction Based Model Adibhatla Hiranmai Sri, Baswani Pavan, Shrivastava Manish
- Keyformer: KV Cache Reduction Through Key Tokens Selection For Efficient Generative Inference Adnan Muhammad, Arunkumar Akhil, Jain Gaurav, Nair Prashant J., Soloveychik Ilya, Kamath Purushotham
- Step-by-step Unmasking For Parameter-efficient Fine-tuning Of Large Language Models Agarwal Aradhye, Ramesh Suhas K, Sengupta Ayan, Chakraborty Tanmoy
- Many-shot In-context Learning Agarwal Rishabh, Singh Avi, Zhang Lei M., Bohnet Bernd, Rosias Luis, Chan Stephanie, Zhang Biao, Anand Ankesh, Abbas Zaheer, Nova Azade, Co-reyes John D., Chu Eric, Behbahani Feryal, Faust Aleksandra, Larochelle Hugo
- Enabling High-sparsity Foundational Llama Models With Efficient Pretraining And Deployment Agarwalla Abhinav, Gupta Abhay, Marques Alexandre, Pandit Shubhra, Goin Michael, Kurtic Eldar, Leong Kevin, Nguyen Tuan, Salem Mahmoud, Alistarh Dan, Lie Sean, Kurtz Mark
- Leapformer: Enabling Linear Transformers For Autoregressive And Simultaneous Tasks Via Learned Proportions Agostinelli Victor, Hong Sanghyun, Chen Lizhong
- Illuminate: A Novel Approach For Depression Detection With Explainable Analysis And Proactive Therapy Using Prompt Engineering Agrawal Aryan
- LM4OPT: Unveiling The Potential Of Large Language Models In Formulating Mathematical Optimization Problems Ahmed Tasnim, Choudhury Salimur
- Linguistic Intelligence In Large Language Models For Telecommunications Ahmed Tasnim, Piovesan Nicola, De Domenico Antonio, Choudhury Salimur
- Tuning Large Multimodal Models For Videos Using Reinforcement Learning From AI Feedback Ahn Daechul, Choi Yura, Yu Youngjae, Kang Dongyeop, Choi Jonghyun
- Sphinx: Sample Efficient Multilingual Instruction Fine-tuning Through N-shot Guided Prompting Ahuja Sanchit, Tanmay Kumar, Chauhan Hardik Hansrajbhai, Patra Barun, Aggarwal Kriti, Del Corro Luciano, Mitra Arindam, Dhamecha Tejas Indulal, Awadallah Ahmed, Choudhary Monojit, Chaudhary Vishrav, Sitaram Sunayana
- Yi: Open Foundation Models By 01.AI Ai 01., :, Young Alex, Chen Bei, Li Chao, Huang Chengen, Zhang Ge, Zhang Guanwei, Li Heng, Zhu Jiangcheng, Chen Jianqun, Chang Jing, Yu Kaidong, Liu Peng, Liu Qiang, Yue Shawn, Yang Senbin, Yang Shiming, Yu Tao, Xie Wen, Huang Wenhao, Hu Xiaohui, Ren Xiaoyi, Niu Xinyao, Nie Pengcheng, Xu Yuchi, Liu Yudong, Wang Yue, Cai Yuxuan, Gu Zhenyu, Liu Zhiyuan, Dai Zonghong
- Enhancing Pre-trained Generative Language Models With Question Attended Span Extraction On Machine Reading Comprehension Ai Lin, Hui Zheng, Liu Zizhou, Hirschberg Julia
- QASE Enhanced Plms: Improved Control In Text Generation For MRC Ai Lin, Hui Zheng, Liu Zizhou, Hirschberg Julia
- Semantically Diverse Language Generation For Uncertainty Estimation In Language Models Aichberger Lukas, Schweighofer Kajetan, Ielanskyi Mykyta, Hochreiter Sepp
- Plug And Play With Prompts: A Prompt Tuning Approach For Controlling Text Generation Ajwani Rohan Deepak, Zhu Zining, Rose Jonathan, Rudzicz Frank
- FACTS About Building Retrieval Augmented Generation-based Chatbots Akkiraju Rama, Xu Anbang, Bora Deepak, Yu Tan, An Lu, Seth Vishal, Shukla Aaditya, Gundecha Pritam, Mehta Hridhay, Jha Ashwin, Raj Prithvi, Balasubramanian Abhinav, Maram Murali, Muthusamy Guru, Annepally Shivakesh Reddy, Knowles Sidney, Du Min, Burnett Nick, Javiya Sean, Marannan Ashok, Kumari Mamta, Jha Surbhi, Dereszenski Ethan, Chakraborty Anupam, Ranjan Subhash, Terfai Amina, Surya Anoop, Mercer Tracey, Thanigachalam Vinodh Kumar, Bar Tamar, Krishnan Sanjana, Kilaru Samy, Jaksic Jasmine, Algarici Nave, Liberman Jacob, Conway Joey, Nayyar Sonu, Boitano Justin
- In-context Language Learning: Architectures And Algorithms Akyürek Ekin, Wang Bailin, Kim Yoon, Andreas Jacob
- History Of Generative Artificial Intelligence (AI) Chatbots: Past, Present, And Future Development Al-amin Md., Ali Mohammad Shazed, Salam Abdus, Khan Arif, Ali Ashraf, Ullah Ahsan, Alam Md Nur, Chowdhury Shamsul Kabir
- Enhancing Textbook Question Answering Task With Large Language Models And Retrieval Augmented Generation Alawwad Hessa Abdulrahman, Alhothali Areej, Naseem Usman, Alkhathlan Ali, Jamal Amani
- Mitigating Catastrophic Forgetting In Language Transfer Via Model Merging Alexandrov Anton, Raychev Veselin, Müller Mark Niklas, Zhang Ce, Vechev Martin, Toutanova Kristina
- Should We Fine-tune Or RAG? Evaluating Different Techniques To Adapt Llms For Dialogue Alghisi Simone, Rizzoli Massimo, Roccabruna Gabriel, Mousavi Seyed Mahed, Riccardi Giuseppe
- A Survey Of Large Language Models For European Languages Ali Wazir, Pyysalo Sampo
- D-NLP At Semeval-2024 Task 2: Evaluating Clinical Inference Capabilities Of Large Language Models Altinok Duygu
- Tower: An Open Multilingual Large Language Model For Translation-related Tasks Alves Duarte M., Pombal José, Guerreiro Nuno M., Martins Pedro H., Alves João, Farajian Amin, Peters Ben, Rei Ricardo, Fernandes Patrick, Agrawal Sweta, Colombo Pierre, De Souza José G. C., Martins André F. T.
- Syntaxshap: Syntax-aware Explainability Method For Text Generation Amara Kenza, Sevastjanova Rita, El-assady Mennatallah
- Evaluating The Efficacy Of Foundational Models: Advancing Benchmarking Practices To Enhance Fine-tuning Decision-making Amujo Oluyemi Enoch, Yang Shanchieh Jay
- Training-free Long-context Scaling Of Large Language Models An Chenxin, Huang Fei, Zhang Jun, Gong Shansan, Qiu Xipeng, Zhou Chang, Kong Lingpeng
- Funaudiollm: Voice Understanding And Generation Foundation Models For Natural Interaction Between Humans And Llms An Keyu, Chen Qian, Deng Chong, Du Zhihao, Gao Changfeng, Gao Zhifu, Gu Yue, He Ting, Hu Hangrui, Hu Kai, Ji Shengpeng, Li Yabin, Li Zerui, Lu Heng, Luo Haoneng, Lv Xiang, Ma Bin, Ma Ziyang, Ni Chongjia, Song Changhe, Shi Jiaqi, Shi Xian, Wang Hao, Wang Wen, Wang Yuxuan, Xiao Zhangyu, Yan Zhijie, Yang Yexin, Zhang Bin, Zhang Qinglin, Zhang Shiliang, Zhao Nan, Zheng Siqi
- Investigating The Robustness Of Llms On Math Word Problems Anantheswaran Ujjwala, Gupta Himanshu, Scaria Kevin, Verma Shreyas, Baral Chitta, Mishra Swaroop
- Does Refusal Training In Llms Generalize To The Past Tense? Andriushchenko Maksym, Flammarion Nicolas
- Tensor Train Low-rank Approximation (tt-lora): Democratizing AI With Accelerated Llms Anjum Afia, Eren Maksim E., Boureima Ismael, Alexandrov Boian, Bhattarai Manish
- Scaling Sparse Fine-tuning To Large Language Models Ansell Alan, Vulić Ivan, Sterz Hannah, Korhonen Anna, Ponti Edoardo M.
- Generalization V.s. Memorization: Tracing Language Models' Capabilities Back To Pretraining Data Antoniades Antonis, Wang Xinyi, Elazar Yanai, Amayuelas Alfonso, Albalak Alon, Zhang Kexun, Wang William Yang
- A Framework For Fine-tuning Llms Using Heterogeneous Feedback Aponte Ryan, Rossi Ryan A., Guo Shunan, Dernoncourt Franck, Yu Tong, Chen Xiang, Mitra Subrata, Lipka Nedim
- An Assessment On Comprehending Mental Health Through Large Language Models Arcan Mihael, Niland David-paul, Delahunty Fionn
- Forklift: An Extensible Neural Lifter Armengol-estapé Jordi, Rocha Rodrigo C. O., Woodruff Jackson, Minervini Pasquale, O'boyle Michael F. P.
- SPAFIT: Stratified Progressive Adaptation Fine-tuning For Pre-trained Large Language Models Arora Samir, Wang Liangliang
- Exploring The Robustness Of Task-oriented Dialogue Systems For Colloquial German Varieties Artemova Ekaterina, Blaschke Verena, Plank Barbara
- What Makes Language Models Good-enough? Asami Daiki, Sugawara Saku
- Slicegpt: Compress Large Language Models By Deleting Rows And Columns Ashkboos Saleh, Croci Maximilian L., Nascimento Marcelo Gennari Do, Hoefler Torsten, Hensman James
- Robustsentembed: Robust Sentence Embeddings Using Adversarial Self-supervised Contrastive Learning Asl Javad Rafiei, Panzade Prajwal, Blanco Eduardo, Takabi Daniel, Cai Zhipeng
- Can Open-source Llms Compete With Commercial Models? Exploring The Few-shot Performance Of Current GPT Models In Biomedical Tasks Ateia Samy, Kruschwitz Udo
- Transformer Alignment In Large Language Models Aubry Murdock, Meng Haoming, Sugolov Anton, Papyan Vardan
- Walia-llm: Enhancing Amharic-llama By Integrating Task-specific And Generative Datasets Azime Israel Abebe, Tonja Atnafu Lambebo, Belay Tadesse Destaw, Fuge Mitiku Yohannes, Wassie Aman Kassahun, Jada Eyasu Shiferaw, Chanie Yonas, Sewunetie Walelign Tewabe, Yimam Seid Muhie
- Lamda: Large Model Fine-tuning Via Spectrally Decomposed Low-dimensional Adaptation Azizi Seyedarmin, Kundu Souvik, Pedram Massoud
- The Pitfalls Of Next-token Prediction Bachmann Gregor, Nagarajan Vaishnavh
- Masked Mixers For Language Generation And Retrieval Badger Benjamin L.
- Efficient Model-agnostic Alignment Via Bayesian Persuasion Bai Fengshuo, Wang Mingzhi, Zhang Zhaowei, Chen Boyuan, Xu Yinda, Wen Ying, Yang Yaodong
- Federated Fine-tuning Of Large Language Models Under Heterogeneous Tasks And Client Resources Bai Jiamu, Chen Daoyuan, Qian Bingchen, Yao Liuyi, Li Yaliang
- Citrus: Chunked Instruction-aware State Eviction For Long Sequence Modeling Bai Yu, Zou Xiyuan, Huang Heyan, Chen Sanxing, Rondeau Marc-antoine, Gao Yang, Cheung Jackie Chi Kit
- COIG-CQIA: Quality Is All You Need For Chinese Instruction Fine-tuning Bai Yuelin, Du Xinrun, Liang Yiming, Jin Yonggang, Liu Ziqiang, Zhou Junting, Zheng Tianyu, Zhang Xincheng, Ma Nuo, Wang Zekun, Yuan Ruibin, Wu Haihong, Lin Hongquan, Huang Wenhao, Zhang Jiajun, Chen Wenhu, Lin Chenghua, Fu Jie, Yang Min, Ni Shiwen, Zhang Ge
- Longalign: A Recipe For Long Context Alignment Of Large Language Models Bai Yushi, Lv Xin, Zhang Jiajie, He Yuze, Qi Ji, Hou Lei, Tang Jie, Dong Yuxiao, Li Juanzi
- Longwriter: Unleashing 10,000+ Word Generation From Long Context Llms Bai Yushi, Zhang Jiajie, Lv Xin, Zheng Linzhi, Zhu Siqi, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- Downstream Bias Mitigation Is All You Need Baksi Arkadeep, Singh Rahul, Joshi Tarun
- RAG Vs Fine-tuning: Pipelines, Tradeoffs, And A Case Study On Agriculture Balaguer Angels, Benara Vinamra, Cunha Renato Luiz De Freitas, Filho Roberto De M. Estevão, Hendry Todd, Holstein Daniel, Marsman Jennifer, Mecklenburg Nick, Malvar Sara, Nunes Leonardo O., Padilha Rafael, Sharp Morris, Silva Bruno, Sharma Swati, Aski Vijay, Chandra Ranveer
- CYGENT: A Cybersecurity Conversational Agent With Log Summarization Powered By GPT-3 Balasubramanian Prasasthy, Seby Justin, Kostakos Panos
- LAMP: A Language Model On The Map Balsebre Pasquale, Huang Weiming, Cong Gao
- Llms With Chain-of-thought Are Non-causal Reasoners Bao Guangsheng, Zhang Hongbo, Yang Linyi, Wang Cunxiang, Zhang Yue
- Fine-tuning Or Fine-failing? Debunking Performance Myths In Large Language Models Barnett Scott, Brannelly Zac, Kurniawan Stefanus, Wong Sheng
- Slicerchat: Building A Local Chatbot For 3D Slicer Barr Colton
- From 'showgirls' To 'performers': Fine-tuning With Gender-inclusive Language For Bias Reduction In Llms Bartl Marion, Leavy Susan
- A Comparative Study Of DSL Code Generation: Fine-tuning Vs. Optimized Retrieval Augmentation Bassamzadeh Nastaran, Methani Chhaya
- Understanding Information Storage And Transfer In Multi-modal Large Language Models Basu Samyadeep, Grayson Martin, Morrison Cecily, Nushi Besmira, Feizi Soheil, Massiceti Daniela
- Xlstm: Extended Long Short-term Memory Beck Maximilian, Pöppel Korbinian, Spanring Markus, Auer Andreas, Prudnikova Oleksandra, Kopp Michael, Klambauer Günter, Brandstetter Johannes, Hochreiter Sepp
- Llm2vec: Large Language Models Are Secretly Powerful Text Encoders Behnamghader Parishad, Adlakha Vaibhav, Mosbach Marius, Bahdanau Dzmitry, Chapados Nicolas, Reddy Siva
- Internalinspector \(I^2\): Robust Confidence Estimation In Llms Through Internal States Beigi Mohammad, Shen Ying, Yang Runing, Lin Zihao, Wang Qifan, Mohan Ankith, He Jianfeng, Jin Ming, Lu Chang-tien, Huang Lifu
- Think Big, Generate Quick: Llm-to-slm For Fast Autoregressive Decoding Bergner Benjamin, Skliar Andrii, Royer Amelie, Blankevoort Tijmen, Asano Yuki, Bejnordi Babak Ehteshami
- Lotr: Low Tensor Rank Weight Adaptation Bershatsky Daniel, Cherniuk Daria, Daulbaev Talgat, Mikhalev Aleksandr, Oseledets Ivan
- A Systematic Analysis Of Large Language Models As Soft Reasoners: The Case Of Syllogistic Inferences Bertolazzi Leonardo, Gatt Albert, Bernardi Raffaella
- Multi-head RAG: Solving Multi-aspect Problems With Llms Besta Maciej, Kubicek Ales, Niggli Roman, Gerstenberger Robert, Weitzendorf Lucas, Chi Mingyuan, Iff Patrick, Gajda Joanna, Nyczyk Piotr, Müller Jürgen, Niewiadomski Hubert, Chrapek Marcin, Podstawski Michał, Hoefler Torsten
- Enhancing Event Reasoning In Large Language Models Through Instruction Fine-tuning With Semantic Causal Graphs Bethany Mazal, Bethany Emet, Wherry Brandon, Chiang Cho-yu, Vishwamitra Nishant, Rios Anthony, Najafirad Peyman
- Soft Prompting For Unlearning In Large Language Models Bhaila Karuna, Van Minh-hao, Wu Xintao
- Self-amplify: Improving Small Language Models With Self Post Hoc Explanations Bhan Milan, Vittaut Jean-noel, Chesneau Nicolas, Lesot Marie-jeanne
- Language Models Are Homer Simpson! Safety Re-alignment Of Fine-tuned Language Models Through Task Arithmetic Bhardwaj Rishabh, Anh Do Duc, Poria Soujanya
- An Approach To Build Zero-shot Slot-filling System For Industry-grade Conversational Assistants Bhargav G P Shrivatsa, Neelam Sumit, Sharma Udit, Ikbal Shajith, Sreedhar Dheeraj, Karanam Hima, Joshi Sachindra, Dhoolia Pankaj, Garg Dinesh, Croutwater Kyle, Qi Haode, Wayne Eric, Murdock J William
- Fintral: A Family Of GPT-4 Level Multimodal Financial Large Language Models Bhatia Gagan, Nagoudi El Moatez Billah, Cavusoglu Hasan, Abdul-mageed Muhammad
- Zero-shot Llm-guided Counterfactual Generation For Text Bhattacharjee Amrita, Moraffah Raha, Garland Joshua, Liu Huan
- Enhancing Code Translation In Language Models With Few-shot Learning Via Retrieval-augmented Generation Bhattarai Manish, Santos Javier E., Jones Shawn, Biswas Ayan, Alexandrov Boian, O'malley Daniel
- Speculative Streaming: Fast LLM Inference Without Auxiliary Models Bhendawade Nikhil, Belousova Irina, Fu Qichen, Mason Henry, Rastegari Mohammad, Najibi Mahyar
- Decoding By Contrasting Knowledge: Enhancing Llms' Confidence On Edited Facts Bi Baolong, Liu Shenghua, Mei Lingrui, Wang Yiwei, Ji Pengliang, Cheng Xueqi
- Supervisory Prompt Training Billa Jean Ghislain, Oh Min, Du Liang
- Lipost: Improved Content Understanding With Effective Use Of Multi-task Contrastive Learning Bindal Akanksha, Ramanujam Sudarshan, Golland Dave, Hazen Tj, Jiang Tina, Zhang Fengyu, Yan Peng
- GRAM: Global Reasoning For Multi-page VQA Blau Tsachi, Fogel Sharon, Ronen Roi, Golts Alona, Ganz Roy, Avraham Elad Ben, Aberdam Aviad, Tsiper Shahar, Litman Ron
- Exploring And Benchmarking The Planning Capabilities Of Large Language Models Bohnet Bernd, Nova Azade, Parisi Aaron T, Swersky Kevin, Goshvadi Katayoon, Dai Hanjun, Schuurmans Dale, Fiedel Noah, Sedghi Hanie
- Long-span Question-answering: Automatic Question Generation And Qa-system Ranking Via Side-by-side Evaluation Bohnet Bernd, Swersky Kevin, Liu Rosanne, Awasthi Pranjal, Nova Azade, Snaider Javier, Sedghi Hanie, Parisi Aaron T, Collins Michael, Lazaridou Angeliki, Firat Orhan, Fiedel Noah
- Workarena++: Towards Compositional Planning And Reasoning-based Common Knowledge Work Tasks Boisvert Léo, Thakkar Megh, Gasse Maxime, Caccia Massimo, De Chezelles Thibault Le Sellier, Cappart Quentin, Chapados Nicolas, Lacoste Alexandre, Drouin Alexandre
- Low-rank Quantization-aware Training For Llms Bondarenko Yelysei, Del Chiaro Riccardo, Nagel Markus
- Recurrentgemma: Moving Past Transformers For Efficient Open Language Models Botev Aleksandar, De Soham, Smith Samuel L, Fernando Anushan, Muraru George-cristian, Haroun Ruba, Berrada Leonard, Pascanu Razvan, Sessa Pier Giuseppe, Dadashi Robert, Hussenot Léonard, Ferret Johan, Girgin Sertan, Bachem Olivier, Andreev Alek, Kenealy Kathleen, Mesnard Thomas, Hardin Cassidy, Bhupatiraju Surya, Pathak Shreya, Sifre Laurent, Rivière Morgane, Kale Mihir Sanjay, Love Juliette, Tafti Pouya, Joulin Armand, Fiedel Noah, Senter Evan, Chen Yutian, Srinivasan Srivatsan, Desjardins Guillaume, Budden David, Doucet Arnaud, Vikram Sharad, Paszke Adam, Gale Trevor, Borgeaud Sebastian, Chen Charlie, Brock Andy, Paterson Antonia, Brennan Jenny, Risdal Meg, Gundluru Raj, Devanathan Nesh, Mooney Paul, Chauhan Nilay, Culliton Phil, Martins Luiz Gustavo, Bandy Elisa, Huntsperger David, Cameron Glenn, Zucker Arthur, Warkentin Tris, Peran Ludovic, Giang Minh, Ghahramani Zoubin, Farabet Clément, Kavukcuoglu Koray, Hassabis Demis, Hadsell Raia, Teh Yee Whye, De Frietas Nando
- Transformers Meet Neural Algorithmic Reasoners Bounsi Wilfried, Ibarz Borja, Dudzik Andrew, Hamrick Jessica B., Markeeva Larisa, Vitvitskyi Alex, Pascanu Razvan, Veličković Petar
- Retrieving Examples From Memory For Retrieval Augmented Neural Machine Translation: A Systematic Comparison Bouthors Maxime, Crego Josep, Yvon Francois
- Improving Sequential Recommendations With Llms Boz Artun, Zorgdrager Wouter, Kotti Zoe, Harte Jesse, Louridas Panos, Jannach Dietmar, Fragkoulis Marios
- Embedding Large Language Models Into Extended Reality: Opportunities And Challenges For Inclusion, Engagement, And Privacy Bozkir Efe, Özdel Süleyman, Lau Ka Hei Carrie, Wang Mengdi, Gao Hong, Kasneci Enkelejda
- Fast And Effective Weight Update For Pruned Large Language Models Boža Vladimír
- Reducing Transformer Key-value Cache Size With Cross-layer Attention Brandon William, Mishra Mayank, Nrusimha Aniruddha, Panda Rameswar, Kelly Jonathan Ragan
- LOCOST: State-space Models For Long Document Abstractive Summarization Bronnec Florian Le, Duong Song, Ravaut Mathieu, Allauzen Alexandre, Chen Nancy F., Guigue Vincent, Lumbreras Alberto, Soulier Laure, Gallinari Patrick
- Fine-tuned 'small' Llms (still) Significantly Outperform Zero-shot Generative AI Models In Text Classification Bucher Martin Juan José, Martini Marco
- Pheme: Efficient And Conversational Speech Generation Budzianowski Paweł, Sereda Taras, Cichy Tomasz, Vulić Ivan
- Roguegpt: Dis-ethical Tuning Transforms Chatgpt4 Into A Rogue AI In 158 Words Buscemi Alessio, Proverbio Daniele
- Aligning Large Language Models With Counterfactual DPO Butcher Bradley
- ARES: Alternating Reinforcement Learning And Supervised Fine-tuning For Enhanced Multi-modal Chain-of-thought Reasoning Through Diverse AI Feedback Byun Ju-seung, Chun Jiyun, Kil Jihyung, Perrault Andrew
- Olora: Orthonormal Low-rank Adaptation Of Large Language Models Büyükakyüz Kerim
- In-context Learning And Fine-tuning GPT For Argument Mining Cabessa Jérémie, Hernault Hugo, Mushtaq Umer
- Cendol: Open Instruction-tuned Generative Large Language Models For Indonesian Languages Cahyawijaya Samuel, Lovenia Holy, Koto Fajri, Putri Rifki Afina, Dave Emmanuel, Lee Jhonson, Shadieq Nuur, Cenggoro Wawan, Akbar Salsabil Maulana, Mahendra Muhammad Ihza, Putri Dea Annisayanti, Wilie Bryan, Winata Genta Indra, Aji Alham Fikri, Purwarianti Ayu, Fung Pascale
- Low-cost Generation And Evaluation Of Dictionary Example Sentences Cai Bill, Ng Clarence Boon Liang, Tan Daniel, Hotama Shelvia
- CERET: Cost-effective Extrinsic Refinement For Text Generation Cai Jason, Su Hang, Sunkara Monica, Shalyminov Igor, Mansour Saab
- Flextron: Many-in-one Flexible Large Language Model Cai Ruisi, Muralidharan Saurav, Heinrich Greg, Yin Hongxu, Wang Zhangyang, Kautz Jan, Molchanov Pavlo
- Lococo: Dropping In Convolutions For Long Context Compression Cai Ruisi, Tian Yuandong, Wang Zhangyang, Chen Beidi
- Medusa: Simple LLM Inference Acceleration Framework With Multiple Decoding Heads Cai Tianle, Li Yuhong, Geng Zhengyang, Peng Hongwu, Lee Jason D., Chen Deming, Dao Tri
- Internlm2 Technical Report Cai Zheng, Cao Maosong, Chen Haojiong, Chen Kai, Chen Keyu, Chen Xin, Chen Xun, Chen Zehui, Chen Zhi, Chu Pei, Dong Xiaoyi, Duan Haodong, Fan Qi, Fei Zhaoye, Gao Yang, Ge Jiaye, Gu Chenya, Gu Yuzhe, Gui Tao, Guo Aijia, Guo Qipeng, He Conghui, Hu Yingfan, Huang Ting, Jiang Tao, Jiao Penglong, Jin Zhenjiang, Lei Zhikai, Li Jiaxing, Li Jingwen, Li Linyang, Li Shuaibin, Li Wei, Li Yining, Liu Hongwei, Liu Jiangning, Hong Jiawei, Liu Kaiwen, Liu Kuikun, Liu Xiaoran, Lv Chengqi, Lv Haijun, Lv Kai, Ma Li, Ma Runyuan, Ma Zerun, Ning Wenchang, Ouyang Linke, Qiu Jiantao, Qu Yuan, Shang Fukai, Shao Yunfan, Song Demin, Song Zifan, Sui Zhihao, Sun Peng, Sun Yu, Tang Huanze, Wang Bin, Wang Guoteng, Wang Jiaqi, Wang Jiayu, Wang Rui, Wang Yudong, Wang Ziyi, Wei Xingjian, Weng Qizhen, Wu Fan, Xiong Yingtong, Xu Chao, Xu Ruiliang, Yan Hang, Yan Yirong, Yang Xiaogui, Ye Haochen, Ying Huaiyuan, Yu Jia, Yu Jing, Zang Yuhang, Zhang Chuyu, Zhang Li, Zhang Pan, Zhang Peng, Zhang Ruijie, Zhang Shuo, Zhang Songyang, Zhang Wenjian, Zhang Wenwei, Zhang Xingcheng, Zhang Xinyue, Zhao Hui, Zhao Qian, Zhao Xiaomeng, Zhou Fengzhe, Zhou Zaida, Zhuo Jingming, Zou Yicheng, Qiu Xipeng, Qiao Yu, Lin Dahua
- Towards Logically Consistent Language Models Via Probabilistic Reasoning Calanzone Diego, Teso Stefano, Vergari Antonio
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes Cao He, Shao Yanjun, Liu Zhiyuan, Liu Zijing, Tang Xiangru, Yao Yuan, Li Yu
- MADTP: Multimodal Alignment-guided Dynamic Token Pruning For Accelerating Vision-language Transformer Cao Jianjian, Ye Peng, Li Shengze, Yu Chong, Tang Yansong, Lu Jiwen, Chen Tao
- Genrec: Generative Sequential Recommendation With Large Language Models Cao Panfeng, Lio Pietro
- Personalized Steering Of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization Cao Yuanpu, Zhang Tianrong, Cao Bochuan, Yin Ziyi, Lin Lu, Ma Fenglong, Chen Jinghui
- Aligning Large Language Models With Recommendation Knowledge Cao Yuwei, Mehta Nikhil, Yi Xinyang, Keshavan Raghunandan, Heldt Lukasz, Hong Lichan, Chi Ed H., Sathiamoorthy Maheswaran
- Captioning Visualizations With Large Language Models (CVLLM): A Tutorial Carenini Giuseppe, Johnson Jordon, Salamatian Ali
- A Review Of Multi-modal Large Language And Vision Models Carolan Kilian, Fennelly Laura, Smeaton Alan F.
- Suppressing Pink Elephants With Direct Principle Feedback Castricato Louis, Lile Nathan, Anand Suraj, Schoelkopf Hailey, Verma Siddharth, Biderman Stella
- Smileyllama: Modifying Large Language Models For Directed Chemical Space Exploration Cavanagh Joseph M., Sun Kunyang, Gritsevskiy Andrew, Bagni Dorian, Bannister Thomas D., Head-gordon Teresa
- Towards Robust And Cost-efficient Knowledge Unlearning For Large Language Models Cha Sungmin, Cho Sungjun, Hwang Dasol, Lee Moontae
- Dual Modalities Of Text: Visual And Textual Generative Pre-training Chai Yekun, Liu Qingyi, Xiao Jingwu, Wang Shuohuan, Sun Yu, Wu Hua
- Cross-modal Safety Alignment: Is Textual Unlearning All You Need? Chakraborty Trishna, Shayegani Erfan, Cai Zikui, Abu-ghazaleh Nael, Asif M. Salman, Dong Yue, Roy-chowdhury Amit K., Song Chengyu
- Dense Reward For Free In Reinforcement Learning From Human Feedback Chan Alex J., Sun Hao, Holt Samuel, Van Der Schaar Mihaela
- RQ-RAG: Learning To Refine Queries For Retrieval Augmented Generation Chan Chi-min, Xu Chunpu, Yuan Ruibin, Luo Hongyin, Xue Wei, Guo Yike, Fu Jie
- Integrating Emotional And Linguistic Models For Ethical Compliance In Large Language Models Chang Edward Y.
- How Do Large Language Models Acquire Factual Knowledge During Pretraining? Chang Hoyeon, Park Jinho, Ye Seonghyeon, Yang Sohee, Seo Youngkyung, Chang Du-seong, Seo Minjoon
- Speechprompt: Prompting Speech Language Models For Speech Processing Tasks Chang Kai-wei, Wu Haibin, Wang Yu-kai, Wu Yuan-kuei, Shen Hua, Tseng Wei-cheng, Kang Iu-thing, Li Shang-wen, Lee Hung-yi
- Goldfish: Monolingual Language Models For 350 Languages Chang Tyler A., Arnett Catherine, Tu Zhuowen, Bergen Benjamin K.
- Bias-aware Low-rank Adaptation: Mitigating Catastrophic Inheritance Of Large Language Models Chang Yupeng, Chang Yi, Wu Yuan
- When Large Language Models Meet Evolutionary Algorithms Chao Wang, Zhao Jiaxuan, Jiao Licheng, Li Lingling, Liu Fang, Yang Shuyuan
- Notellm: A Retrievable Large Language Model For Note Recommendation Chao Zhang, Shiwei Wu, Haoxin Zhang, Tong Xu, Yan Gao, Yao Hu, Di Wu, Enhong Chen
- Language Models Can Exploit Cross-task In-context Learning For Data-scarce Novel Tasks Chatterjee Anwoy, Tanwar Eshaan, Dutta Subhabrata, Chakraborty Tanmoy
- Phantom: General Trigger Attacks On Retrieval Augmented Language Generation Chaudhari Harsh, Severi Giorgio, Abascal John, Jagielski Matthew, Choquette-choo Christopher A., Nasr Milad, Nita-rotaru Cristina, Oprea Alina
- Generalizing Conversational Dense Retrieval Via Llm-cognition Data Augmentation Chen Haonan, Dou Zhicheng, Mao Kelong, Liu Jiongnan, Zhao Ziliang
- Efficiency In Focus: Layernorm As A Catalyst For Fine-tuning Medical Visual Language Pre-trained Models Chen Jiawei, Yang Dingkang, Jiang Yue, Li Mingcheng, Wei Jinjie, Hou Xiaolu, Zhang Lihua
- Control-dag: Constrained Decoding For Non-autoregressive Directed Acyclic T5 Using Weighted Finite State Automata Chen Jinghong, Lin Weizhe, Mei Jingbiao, Byrne Bill
- Automated Data Curation For Robust Language Model Fine-tuning Chen Jiuhai, Mueller Jonas
- Pixart-\sigma: Weak-to-strong Training Of Diffusion Transformer For 4K Text-to-image Generation Chen Junsong, Ge Chongjian, Xie Enze, Wu Yue, Yao Lewei, Ren Xiaozhe, Wang Zhongdao, Luo Ping, Lu Huchuan, Li Zhenguo
- Instructioncp: A Fast Approach To Transfer Large Language Models Into Target Language Chen Kuang-ming, Lee Hung-yi
- OMPGPT: A Generative Pre-trained Transformer Model For Openmp Chen Le, Bhattacharjee Arijit, Ahmed Nesreen, Hasabnis Niranjan, Oren Gal, Vo Vy, Jannesari Ali
- How Truncating Weights Improves Reasoning In Language Models Chen Lei, Bruna Joan, Bietti Alberto
- Identifying Query-relevant Neurons In Large Language Models For Long-form Texts Chen Lihu, Dejl Adam, Toni Francesca
- Bailong: Bilingual Transfer Learning Based On Qlora And Zip-tie Embedding Chen Lung-chuan, Li Zong-ru
- Quantifying And Mitigating Unimodal Biases In Multimodal Large Language Models: A Causal Perspective Chen Meiqi, Cao Yixin, Zhang Yan, Lu Chaochao
- Efficientqat: Efficient Quantization-aware Training For Large Language Models Chen Mengzhao, Shao Wenqi, Xu Peng, Wang Jiahao, Gao Peng, Zhang Kaipeng, Qiao Yu, Luo Ping
- Jumpcoder: Go Beyond Autoregressive Coder Via Online Modification Chen Mouxiang, Tian Hao, Liu Zhongxin, Ren Xiaoxue, Sun Jianling
- WSI-VQA: Interpreting Whole Slide Images By Generative Visual Question Answering Chen Pingyi, Zhu Chenglu, Zheng Sunyi, Li Honglin, Yang Lin
- Llava-mole: Sparse Mixture Of Lora Experts For Mitigating Data Conflicts In Instruction Finetuning Mllms Chen Shaoxiang, Jie Zequn, Ma Lin
- Advancing Tool-augmented Large Language Models: Integrating Insights From Errors In Inference Trees Chen Sijia, Wang Yibo, Wu Yi-feng, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Zhang Lijun
- Struq: Defending Against Prompt Injection With Structured Queries Chen Sizhe, Piet Julien, Sitawarin Chawin, Wagner David
- From Yes-men To Truth-tellers: Addressing Sycophancy In Large Language Models With Pinpoint Tuning Chen Wei, Huang Zhen, Xie Liang, Lin Binbin, Li Houqiang, Lu Le, Tian Xinmei, Cai Deng, Zhang Yonggang, Wan Wenxiao, Shen Xu, Ye Jieping
- Octopus: On-device Language Model For Function Calling Of Software Apis Chen Wei, Li Zhiyuan, Ma Mingyuan
- SATO: Stable Text-to-motion Framework Chen Wenshuo, Xiao Hongru, Zhang Erhang, Hu Lijie, Wang Lei, Liu Mengyuan, Chen Chen
- Icleval: Evaluating In-context Learning Ability Of Large Language Models Chen Wentong, Lin Yankai, Zhou Zhenhao, Huang Hongyun, Jia Yantao, Cao Zhao, Wen Ji-rong
- Cause-aware Empathetic Response Generation Via Chain-of-thought Fine-tuning Chen Xinhao, Yang Chong, Lan Man, Cai Li, Chen Yang, Hu Tu, Zhuang Xinlin, Zhou Aimin
- Can We Rely On LLM Agents To Draft Long-horizon Plans? Let's Take Travelplanner As An Example Chen Yanan, Pesaranghader Ali, Sadhu Tanmana, Yi Dong Hoon
- Extroversion Or Introversion? Controlling The Personality Of Your Large Language Models Chen Yanquan, Wu Zhen, Guo Junjie, Huang Shujian, Dai Xinyu
- Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided By Text Information Chen Yi, Xu Jian, Zhang Xu-yao, Liu Wen-zhuo, Liu Yang-yang, Liu Cheng-lin
- See What Llms Cannot Answer: A Self-challenge Framework For Uncovering LLM Weaknesses Chen Yulong, Liu Yang, Yan Jianhao, Bai Xuefeng, Zhong Ming, Yang Yinghao, Yang Ziyi, Zhu Chenguang, Zhang Yue
- Mofo: Momentum-filtered Optimizer For Mitigating Forgetting In LLM Fine-tuning Chen Yupeng, Wang Senmiao, Lin Zhihang, Qin Zeyu, Zhang Yushun, Ding Tian, Sun Ruoyu
- Autoprm: Automating Procedural Supervision For Multi-step Reasoning Via Controllable Question Decomposition Chen Zhaorun, Zhao Zhuokai, Zhu Zhihong, Zhang Ruiqi, Li Xiang, Raj Bhiksha, Yao Huaxiu
- Improving Retrieval Augmented Open-domain Question-answering With Vectorized Contexts Chen Zhuo, Wang Xinyu, Jiang Yong, Xie Pengjun, Huang Fei, Tu Kewei
- An Empirical Study Of Data Ability Boundary In Llms' Math Reasoning Chen Zui, Chen Yezeng, Han Jiaqi, Huang Zhijie, Qi Ji, Zhou Yi
- "in Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles Through In-dialogue Learning Cheng Chuanqi, Tu Quan, Wu Wei, Shang Shuo, Mao Cunli, Yu Zhengtao, Yan Rui
- Emergence Of A High-dimensional Abstraction Phase In Language Transformers Cheng Emily, Doimo Diego, Kervadec Corentin, Macocco Iuri, Yu Jade, Laio Alessandro, Baroni Marco
- Unveiling Typographic Deceptions: Insights Of The Typographic Vulnerability In Large Vision-language Model Cheng Hao, Xiao Erjia, Gu Jindong, Yang Le, Duan Jinhao, Zhang Jize, Cao Jiahang, Xu Kaidi, Xu Renjing
- Social Debiasing For Fair Multi-modal Llms Cheng Harry, Guo Yangyang, Guo Qingpei, Yang Ming, Gan Tian, Nie Liqiang
- Trojanrag: Retrieval-augmented Generation Can Be Backdoor Driver In Large Language Models Cheng Pengzhou, Ding Yidong, Ju Tianjie, Wu Zongru, Du Wei, Yi Ping, Zhang Zhuosheng, Liu Gongshen
- Small Agent Can Also Rock! Empowering Small Language Models As Hallucination Detector Cheng Xiaoxue, Li Junyi, Zhao Wayne Xin, Zhang Hongzhi, Zhang Fuzheng, Zhang Di, Gai Kun, Wen Ji-rong
- ANOLE: An Open, Autoregressive, Native Large Multimodal Models For Interleaved Image-text Generation Chern Ethan, Su Jiadi, Ma Yan, Liu Pengfei
- Exploring Quantization For Efficient Pre-training Of Transformer Language Models Chitsaz Kamran, Fournier Quentin, Mordido Gonçalo, Chandar Sarath
- Transformer Explainer: Interactive Learning Of Text-generative Models Cho Aeree, Kim Grace C., Karpekov Alexander, Helbling Alec, Wang Zijie J., Lee Seongmin, Hoover Benjamin, Chau Duen Horng
- Optimizing Language Augmentation For Multilingual Large Language Models: A Case Study On Korean Choi Changsu, Jeong Yongbin, Park Seoyoon, Won Inho, Lim Hyeonseok, Kim Sangmin, Kang Yejee, Yoon Chanhyuk, Park Jaewan, Lee Yiseul, Lee Hyejin, Hahm Younggyun, Kim Hansaem, Lim Kyungtae
- Cross-architecture Transfer Learning For Linear-cost Inference Transformers Choi Sehyun
- Gemmar: Enhancing Llms Through Arabic Instruction-tuning Chouikhi Hasna, Aloui Manel, Hammou Cyrine Ben, Chaabane Ghaith, Kchaou Haithem, Dhaouadi Chehir
- Crosslingual Capabilities And Knowledge Barriers In Multilingual Large Language Models Chua Lynn, Ghazi Badih, Huang Yangsibo, Kamath Pritish, Kumar Ravi, Manurangsi Pasin, Sinha Amer, Xie Chulin, Zhang Chiyuan
- PEFT-U: Parameter-efficient Fine-tuning For User Personalization Clarke Christopher, Heng Yuzhao, Tang Lingjia, Mars Jason
- Cogbench: A Large Language Model Walks Into A Psychology Lab Coda-forno Julian, Binz Marcel, Wang Jane X., Schulz Eric
- Improving Token-based World Models With Parallel Observation Prediction Cohen Lior, Wang Kaixin, Kang Bingyi, Mannor Shie
- Saullm-54b & Saullm-141b: Scaling Up Domain Adaptation For The Legal Domain Colombo Pierre, Pires Telmo, Boudiaf Malik, Melo Rui, Culver Dominic, Morgado Sofia, Malaboeuf Etienne, Hautreux Gabriel, Charpentier Johanne, Desa Michael
- Attentionlego: An Open-source Building Block For Spatially-scalable Large Language Model Accelerator With Processing-in-memory Technology Cong Rongqing, He Wenyang, Li Mingxuan, Luo Bangning, Yang Zebin, Yang Yuchao, Huang Ru, Yan Bonan
- Representations As Language: An Information-theoretic Framework For Interpretability Conklin Henry, Smith Kenny
- Ragsys: Item-cold-start Recommender As RAG System Contal Emile, Mcgoldrick Garrin
- LLASP: Fine-tuning Large Language Models For Answer Set Programming Coppolillo Erica, Calimeri Francesco, Manco Giuseppe, Perri Simona, Ricca Francesco
- Rocode: A Dataset For Measuring Code Intelligence From Problem Definitions In Romanian Cosma Adrian, Iordache Bogdan, Rosso Paolo
- Gamebench: Evaluating Strategic Reasoning Abilities Of LLM Agents Costarelli Anthony, Allen Mat, Hauksson Roman, Sodunke Grace, Hariharan Suhas, Cheng Carlson, Li Wenjie, Clymer Joshua, Yadav Arjun
- Llambert: Large-scale Low-cost Data Annotation In NLP Csanády Bálint, Muzsai Lajos, Vedres Péter, Nádasdy Zoltán, Lukács András
- Moeut: Mixture-of-experts Universal Transformers Csordás Róbert, Irie Kazuki, Schmidhuber Jürgen, Potts Christopher, Manning Christopher D.
- Rethinking LLM Language Adaptation: A Case Study On Chinese Mixtral Cui Yiming, Yao Xin
- Meta Large Language Model Compiler: Foundation Models Of Compiler Optimization Cummins Chris, Seeker Volker, Grubisic Dejan, Roziere Baptiste, Gehring Jonas, Synnaeve Gabriel, Leather Hugh
- Synthetic Dataset Creation And Fine-tuning Of Transformer Models For Question Answering In Serbian Cvetanović Aleksa, Tadić Predrag
- Anchored Preference Optimization And Contrastive Revisions: Addressing Underspecification In Alignment D'oosterlinck Karel, Xu Winnie, Develder Chris, Demeester Thomas, Singh Amanpreet, Potts Christopher, Kiela Douwe, Mehri Shikib
- Getting The Most Out Of Your Tokenizer For Pre-training And Domain Adaptation Dagan Gautier, Synnaeve Gabriel, Rozière Baptiste
- CORM: Cache Optimization With Recent Message For Large Language Model Inference Dai Jincheng, Huang Zhuowei, Jiang Haiyun, Chen Chen, Cai Deng, Bi Wei, Shi Shuming
- Muap: Multi-step Adaptive Prompt Learning For Vision-language Model With Missing Modality Dai Ruiting, Tan Yuqiao, Mo Lisi, He Tao, Qin Ke, Liang Shuang
- ACT-MNMT Auto-constriction Turning For Multilingual Neural Machine Translation Dai Shaojie, Liu Xin, Luo Ping, Yu Yue
- Low-rank Finetuning For Llms: A Fairness Perspective Das Saswat, Romanelli Marco, Tran Cuong, Reza Zarreen, Kailkhura Bhavya, Fioretto Ferdinando
- Application Of GPT Language Models For Innovation In Activities In University Teaching De Buenaga Manuel, Bueno Francisco Javier
- Towards Transparency: Exploring LLM Trainings Datasets Through Visual Topic Modeling And Semantic Frame De Dampierre Charles, Mogoutov Andrei, Baumard Nicolas
- Pelle: Encoder-based Language Models For Brazilian Portuguese Based On Open Data De Mello Guilherme Lamartine, Finger Marcelo, Serras And Felipe, Carpi Miguel De Mello, Jose Marcos Menon, Domingues Pedro Henrique, Cavalim Paulo
- Evaluating Large Language Models For Automatic Analysis Of Teacher Simulations De-fitero-dominguez David, Albaladejo-gonzález Mariano, Garcia-cabot Antonio, Garcia-lopez Eva, Moreno-cediel Antonio, Barno Erin, Reich Justin
- Deepseek-v2: A Strong, Economical, And Efficient Mixture-of-experts Language Model Deepseek-ai, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Li, Hui Qu, J. L. Cai, Jian Liang, Jianzhong Guo, Jiaqi Ni, Jiashi Li, Jin Chen, Jingyang Yuan, Junjie Qiu, Junxiao Song, Kai Dong, Kaige Gao, Kang Guan, Lean Wang, Lecong Zhang, Lei Xu, Leyi Xia, Liang Zhao, Liyue Zhang, Meng Li, Miaojun Wang, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Mingming Li, Ning Tian, Panpan Huang, Peiyi Wang, Peng Zhang, Qihao Zhu, Qinyu Chen, Qiushi Du, R. J. Chen, R. L. Jin, Ruiqi Ge, Ruizhe Pan, Runxin Xu, Ruyi Chen, S. S. Li, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shaoqing Wu, Shengfeng Ye, Shirong Ma, Shiyu Wang, Shuang Zhou, Shuiping Yu, Shunfeng Zhou, Size Zheng, T. Wang, Tian Pei, Tian Yuan, Tianyu Sun, W. L. Xiao, Wangding Zeng, Wei An, Wen Liu, Wenfeng Liang, Wenjun Gao, Wentao Zhang, X. Q. Li, Xiangyue Jin, Xianzu Wang, Xiao Bi, Xiaodong Liu, Xiaohan Wang, Xiaojin Shen, Xiaokang Chen, Xiaosha Chen, Xiaotao Nie, Xiaowen Sun, Xiaoxiang Wang, Xin Liu, Xin Xie, Xingkai Yu, Xinnan Song, Xinyi Zhou, Xinyu Yang, Xuan Lu, Xuecheng Su, Y. Wu, Y. K. Li, Y. X. Wei, Y. X. Zhu, Yanhong Xu, Yanping Huang, Yao Li, Yao Zhao, Yaofeng Sun, Yaohui Li, Yaohui Wang, Yi Zheng, Yichao Zhang, Yiliang Xiong, Yilong Zhao, Ying He, Ying Tang, Yishi Piao, Yixin Dong, Yixuan Tan, Yiyuan Liu, Yongji Wang, Yongqiang Guo, Yuchen Zhu, Yuduan Wang, Yuheng Zou, Yukun Zha, Yunxian Ma, Yuting Yan, Yuxiang You, Yuxuan Liu, Z. Z. Ren, Zehui Ren, Zhangli Sha, Zhe Fu, Zhen Huang, Zhen Zhang, Zhenda Xie, Zhewen Hao, Zhihong Shao, Zhiniu Wen, Zhipeng Xu, Zhongyu Zhang, Zhuoshu Li, Zihan Wang, Zihui Gu, Zilin Li, Ziwei Xie
- Cram: Credibility-aware Attention Modification In Llms For Combating Misinformation In RAG Deng Boyi, Wang Wenjie, Zhu Fengbin, Wang Qifan, Feng Fuli
- MIMIR: A Streamlined Platform For Personalized Agent Tuning In Domain Expertise Deng Chunyuan, Tang Xiangru, Zhao Yilun, Wang Hanming, Wang Haoran, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- ELASTIC: Efficient Linear Attention For Sequential Interest Compression Deng Jiaxin, Wang Shiyao, Lu Song, Li Yinfeng, Luo Xinchen, Liu Yuanjun, Xu Peixing, Zhou Guorui
- Gotcha! Don't Trick Me With Unanswerable Questions! Self-aligning Large Language Models For Responding To Unknown Questions Deng Yang, Zhao Yong, Li Moxin, Ng See-kiong, Chua Tat-seng
- Attention Is Naturally Sparse With Gaussian Distributed Input Deng Yichuan, Song Zhao, Yang Chiwun
- Text-tuple-table: Towards Information Integration In Text-to-table Generation Via Global Tuple Extraction Deng Zheye, Chan Chunkit, Wang Weiqi, Sun Yuxi, Fan Wei, Zheng Tianshi, Yim Yauwai, Song Yangqiu
- Glam: Fine-tuning Large Language Models For Domain Knowledge Graph Alignment Via Neighborhood Partitioning And Generative Subgraph Encoding Dernbach Stefan, Agarwal Khushbu, Zuniga Alejandro, Henry Michael, Choudhury Sutanay
- Promises, Outlooks And Challenges Of Diffusion Language Modeling Deschenaux Justin, Gulcehre Caglar
- GNOME: Generating Negotiations Through Open-domain Mapping Of Exchanges Deshpande Darshan, Sinha Shambhavi, Kumar Anirudh Ravi, Pal Debaditya, May Jonathan
- A Simple And Effective \(L_2\) Norm-based Strategy For KV Cache Compression Devoto Alessio, Zhao Yu, Scardapane Simone, Minervini Pasquale
- Facts-and-feelings: Capturing Both Objectivity And Subjectivity In Table-to-text Generation Dey Tathagata, Bhattacharyya Pushpak
- Learning How To Ask: Cycle-consistency Refines Prompts In Multimodal Foundation Models Diesendruck Maurice, Lin Jianzhe, Imani Shima, Mahalingam Gayathri, Xu Mingyang, Zhao Jie
- Entgpt: Linking Generative Large Language Models With Knowledge Bases Ding Yifan, Poudel Amrit, Zeng Qingkai, Weninger Tim, Veeramani Balaji, Bhattacharya Sanmitra
- Longrope: Extending LLM Context Window Beyond 2 Million Tokens Ding Yiran, Zhang Li Lyna, Zhang Chengruidong, Xu Yuanyuan, Shang Ning, Xu Jiahang, Yang Fan, Yang Mao
- Language Adaptation On A Tight Academic Compute Budget: Tokenizer Swapping Works And Pure Bfloat16 Is Enough Dobler Konstantin, De Melo Gerard
- Evaluating Linguistic Capabilities Of Multimodal Llms In The Lens Of Few-shot Learning Dogan Mustafa, Kesen Ilker, Calixto Iacer, Erdem Aykut, Erdem Erkut
- A Tale Of Tails: Model Collapse As A Change Of Scaling Laws Dohmatob Elvis, Feng Yunzhen, Yang Pu, Charton Francois, Kempe Julia
- Latent Attention For Linear Time Transformers Dolga Rares, Cobzarenco Marius, Barber David
- Self-play With Execution Feedback: Improving Instruction-following Capabilities Of Large Language Models Dong Guanting, Lu Keming, Li Chengpeng, Xia Tingyu, Yu Bowen, Zhou Chang, Zhou Jingren
- Understand What LLM Needs: Dual Preference Alignment For Retrieval-augmented Generation Dong Guanting, Zhu Yutao, Zhang Chenghao, Wang Zechen, Dou Zhicheng, Wen Ji-rong
- Baichuanseed: Sharing The Potential Of Extensive Data Collection And Deduplication By Introducing A Competitive Large Language Model Baseline Dong Guosheng, Pan Da, Sun Yiding, Zhang Shusen, Liang Zheng, Wu Xin, Shen Yanjun, Yang Fan, Sun Haoze, Li Tianpeng, Lin Mingan, Xu Jianhua, Zhang Yufan, Nie Xiaonan, Su Lei, Wang Bingning, Zhang Wentao, Mao Jiaxin, Zhou Zenan, Chen Weipeng
- RLHF Workflow: From Reward Modeling To Online RLHF Dong Hanze, Xiong Wei, Pang Bo, Wang Haoxiang, Zhao Han, Zhou Yingbo, Jiang Nan, Sahoo Doyen, Xiong Caiming, Zhang Tong
- Internlm-xcomposer2-4khd: A Pioneering Large Vision-language Model Handling Resolutions From 336 Pixels To 4K HD Dong Xiaoyi, Zhang Pan, Zang Yuhang, Cao Yuhang, Wang Bin, Ouyang Linke, Zhang Songyang, Duan Haodong, Zhang Wenwei, Li Yining, Yan Hang, Gao Yang, Chen Zhe, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Chen Kai, He Conghui, Zhang Xingcheng, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- Unmemorization In Large Language Models Via Self-distillation And Deliberate Imagination Dong Yijiang River, Lin Hongzhou, Belkin Mikhail, Huerta Ramon, Vulić Ivan
- Prompt Your Brain: Scaffold Prompt Tuning For Efficient Adaptation Of Fmri Pre-trained Model Dong Zijian, Wu Yilei, Chen Zijiao, Zhang Yichi, Jin Yueming, Zhou Juan Helen
- Biomedical Large Languages Models Seem Not To Be Superior To Generalist Models On Unseen Medical Data Dorfner Felix J., Dada Amin, Busch Felix, Makowski Marcus R., Han Tianyu, Truhn Daniel, Kleesiek Jens, Sushil Madhumita, Lammert Jacqueline, Adams Lisa C., Bressem Keno K.
- Enhancing Robustness Of Llm-synthetic Text Detectors For Academic Writing: A Comprehensive Analysis Dou Zhicheng, Guo Yuchen, Chang Ching-chun, Nguyen Huy H., Echizen Isao
- Investigating Low-cost LLM Annotation For~spoken Dialogue Understanding Datasets Druart Lucas Lia, Vielzeuf Valentin Lia, Estève Yannick Lia
- Stacking Your Transformers: A Closer Look At Model Growth For Efficient LLM Pre-training Du Wenyu, Luo Tongxu, Qiu Zihan, Huang Zeyu, Shen Yikang, Cheng Reynold, Guo Yike, Fu Jie
- Chinese Tiny LLM: Pretraining A Chinese-centric Large Language Model Du Xinrun, Yu Zhouliang, Gao Songyang, Pan Ding, Cheng Yuyang, Ma Ziyang, Yuan Ruibin, Qu Xingwei, Liu Jiaheng, Zheng Tianyu, Luo Xinchen, Zhou Guorui, Chen Wenhu, Zhang Ge
- CLEFT: Language-image Contrastive Learning With Efficient Large Language Model And Prompt Fine-tuning Du Yuexi, Chang Brian, Dvornek Nicha C.
- How To Think Step-by-step: A Mechanistic Understanding Of Chain-of-thought Reasoning Dutta Subhabrata, Singh Joykirat, Chakrabarti Soumen, Chakraborty Tanmoy
- Your Context Is Not An Array: Unveiling Random Access Limitations In Transformers Ebrahimi Mohammadreza, Panchal Sunny, Memisevic Roland
- CROME: Cross-modal Adapters For Efficient Multimodal LLM Ebrahimi Sayna, Arik Sercan O., Nama Tejas, Pfister Tomas
- Relation Extraction With Fine-tuned Large Language Models In Retrieval Augmented Generation Frameworks Efeoglu Sefika, Paschke Adrian
- Dynamic Q&A Of Clinical Documents With Large Language Models Elgedawy Ran, Danciu Ioana, Mahbub Maria, Srinivasan Sudarshan
- Layerskip: Enabling Early Exit Inference And Self-speculative Decoding Elhoushi Mostafa, Shrivastava Akshat, Liskovich Diana, Hosmer Basil, Wasti Bram, Lai Liangzhen, Mahmoud Anas, Acun Bilge, Agarwal Saurabh, Roman Ahmed, Aly Ahmed A, Chen Beidi, Wu Carole-jean
- Enabling Natural Zero-shot Prompting On Encoder Models Via Statement-tuning Elshabrawy Ahmed, Huang Yongxin, Gurevych Iryna, Aji Alham Fikri
- Reasoning In Transformers -- Mitigating Spurious Correlations And Reasoning Shortcuts Enström Daniel, Kjellberg Viktor, Johansson Moa
- Navigating Uncertainty: Optimizing API Dependency For Hallucination Reduction In Closed-book Question Answering Erbacher Pierre, Falissar Louis, Guigue Vincent, Soulier Laure
- Empirical Studies Of Parameter Efficient Methods For Large Language Models Of Code And Knowledge Transfer To R Esmaeili Amirreza, Saberi Iman, Fard Fatemeh H.
- Latxa: An Open Language Model And Evaluation Suite For Basque Etxaniz Julen, Sainz Oscar, Perez Naiara, Aldabe Itziar, Rigau German, Agirre Eneko, Ormazabal Aitor, Artetxe Mikel, Soroa Aitor
- Coda: Constrained Generation Based Data Augmentation For Low-resource NLP Evuru Chandra Kiran Reddy, Ghosh Sreyan, Kumar Sonal, S Ramaneswaran, Tyagi Utkarsh, Manocha Dinesh
- TRACE The Evidence: Constructing Knowledge-grounded Reasoning Chains For Retrieval-augmented Generation Fang Jinyuan, Meng Zaiqiao, Macdonald Craig
- \(VILA^2\): VILA Augmented VILA Fang Yunhao, Zhu Ligeng, Lu Yao, Wang Yan, Molchanov Pavlo, Cho Jang Hyun, Pavone Marco, Han Song, Yin Hongxu
- Unraveling The Dominance Of Large Language Models Over Transformer Models For Bangla Natural Language Inference: A Comprehensive Study Faria Fatema Tuj Johora, Moin Mukaffi Bin, Fahim Asif Iftekher, Debnath Pronay, Shah Faisal Muhammad
- Multi-modal Hallucination Control By Visual Information Grounding Favero Alessandro, Zancato Luca, Trager Matthew, Choudhary Siddharth, Perera Pramuditha, Achille Alessandro, Swaminathan Ashwin, Soatto Stefano
- Curlora: Stable LLM Continual Fine-tuning And Catastrophic Forgetting Mitigation Fawi Muhammad
- Evaluating Human Alignment And Model Faithfulness Of LLM Rationale Fayyaz Mohsen, Yin Fan, Sun Jiao, Peng Nanyun
- What Are Large Language Models Mapping To In The Brain? A Case Against Over-reliance On Brain Scores Feghhi Ebrahim, Hadidi Nima, Song Bryan, Blank Idan A., Kao Jonathan C.
- Internlm-law: An Open Source Chinese Legal Large Language Model Fei Zhiwei, Zhang Songyang, Shen Xiaoyu, Zhu Dawei, Wang Xiao, Cao Maosong, Zhou Fengzhe, Li Yining, Zhang Wenwei, Lin Dahua, Chen Kai, Ge Jidong
- IGOT: Information Gain Optimized Tokenizer On Domain Adaptive Pretraining Feng Dawei, Zhang Yihai, Xu Zhixuan
- An Extremely Data-efficient And Generative Llm-based Reinforcement Learning Agent For Recommenders Feng Shuang, Feng Grace
- Exploring Automated Distractor Generation For Math Multiple-choice Questions Via Large Language Models Feng Wanyong, Lee Jaewook, Mcnichols Hunter, Scarlatos Alexander, Smith Digory, Woodhead Simon, Ornelas Nancy Otero, Lan Andrew
- Mt-ladder: A Model-agnostic Framework Boosting Llm-based Machine Translation To The Next Level Feng Zhaopeng, Zhang Yan, Chen Ruizhe, Meng Zijie, Liu Zuozhu
- RAG Foundry: A Framework For Enhancing Llms For Retrieval Augmented Generation Fleischer Daniel, Berchansky Moshe, Wasserblat Moshe, Izsak Peter
- Re-adaptir: Improving Information Retrieval Through Reverse Engineered Adaptation Fleshman William, Van Durme Benjamin
- CLIP With Generative Latent Replay: A Strong Baseline For Incremental Learning Frascaroli Emanuele, Panariello Aniello, Buzzega Pietro, Bonicelli Lorenzo, Porrello Angelo, Calderara Simone
- Representing Rule-based Chatbots With Transformers Friedman Dan, Panigrahi Abhishek, Chen Danqi
- A Role-specific Guided Large Language Model For Ophthalmic Consultation Based On Stylistic Differentiation Fu Laiyi, Fan Binbin, Du Hongkai, Feng Yanxiang, Li Chunhua, Song Huping
- Lazyllm: Dynamic Token Pruning For Efficient Long Context LLM Inference Fu Qichen, Cho Minsik, Merth Thomas, Mehta Sachin, Rastegari Mohammad, Najibi Mahyar
- Disperse-then-merge: Pushing The Limits Of Instruction Tuning Via Alignment Tax Reduction Fu Tingchen, Cai Deng, Liu Lemao, Shi Shuming, Yan Rui
- Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight In The Real World For Meeting Summarization? Fu Xue-yong, Laskar Md Tahmid Rahman, Khasanova Elena, Chen Cheng, Tn Shashi Bhushan
- Data Engineering For Scaling Language Models To 128K Context Fu Yao, Panda Rameswar, Niu Xinyao, Yue Xiang, Hajishirzi Hannaneh, Kim Yoon, Peng Hao
- UI-JEPA: Towards Active Perception Of User Intent Through Onscreen User Activity Fu Yicheng, Anantha Raviteja, Vashisht Prabal, Cheng Jianpeng, Littwin Etai
- Using Large Language Models For The Interpretation Of Building Regulations Fuchs Stefan, Witbrock Michael, Dimyadi Johannes, Amor Robert
- Layoutllm: Large Language Model Instruction Tuning For Visually Rich Document Understanding Fujitake Masato
- Zero-shot Persuasive Chatbots With Llm-generated Strategies And Information Retrieval Furumai Kazuaki, Legaspi Roberto, Vizcarra Julio, Yamazaki Yudai, Nishimura Yasutaka, Semnani Sina J., Ikeda Kazushi, Shi Weiyan, Lam Monica S.
- Student Answer Forecasting: Transformer-driven Answer Choice Prediction For Language Learning Gado Elena Grazia, Martorella Tommaso, Zunino Luca, Mejia-domenzain Paola, Swamy Vinitra, Frej Jibril, Käser Tanja
- Airavata: Introducing Hindi Instruction-tuned LLM Gala Jay, Jayakumar Thanmay, Husain Jaavid Aktar, M Aswanth Kumar, Khan Mohammed Safi Ur Rahman, Kanojia Diptesh, Puduppully Ratish, Khapra Mitesh M., Dabre Raj, Murthy Rudra, Kunchukuttan Anoop
- Open Generative Large Language Models For Galician Gamallo Pablo, Rodríguez Pablo, De-dios-flores Iria, Sotelo Susana, Paniagua Silvia, Bardanca Daniel, Pichel José Ramom, Garcia Marcos
- Application Of LLM Agents In Recruitment: A Novel Framework For Resume Screening Gan Chengguang, Zhang Qinghao, Mori Tatsunori
- Question Aware Vision Transformer For Multimodal Reasoning Ganz Roy, Kittenplon Yair, Aberdam Aviad, Avraham Elad Ben, Nuriel Oren, Mazor Shai, Litman Ron
- Multilingual Pretraining And Instruction Tuning Improve Cross-lingual Knowledge Alignment, But Only Shallowly Gao Changjiang, Hu Hongda, Hu Peng, Chen Jiajun, Li Jixing, Huang Shujian
- The Best Of Both Worlds: Toward An Honest And Helpful Large Language Model Gao Chujie, Zhang Qihui, Chen Dongping, Huang Yue, Wu Siyuan, Fu Zhengyan, Wan Yao, Zhang Xiangliang, Sun Lichao
- Aligning LLM Agents By Learning Latent Preference From User Edits Gao Ge, Taymanov Alexey, Salinas Eduardo, Mineiro Paul, Misra Dipendra
- Selfcp: Compressing Over-limit Prompt Via The Frozen Large Language Model Itself Gao Jun, Cao Ziqiang, Li Wenjie
- Harnessing The Intrinsic Knowledge Of Pretrained Language Models For Challenging Text Classification Settings Gao Lingyu
- Tc-llava: Rethinking The Transfer From Image To Video Understanding With Temporal Considerations Gao Mingze, Liu Jingyu, Li Mingda, Xie Jiangtao, Liu Qingbin, Zhao Bo, Chen Xi, Xiong Hui
- Autovcoder: A Systematic Framework For Automated Verilog Code Generation Using Llms Gao Mingzhe, Zhao Jieru, Lin Zhe, Ding Wenchao, Hou Xiaofeng, Feng Yu, Li Chao, Guo Minyi
- Lumina-t2x: Transforming Text Into Any Modality, Resolution, And Duration Via Flow-based Large Diffusion Transformers Gao Peng, Zhuo Le, Liu Dongyang, Du Ruoyi, Luo Xu, Qiu Longtian, Zhang Yuhang, Lin Chen, Huang Rongjie, Geng Shijie, Zhang Renrui, Xi Junlin, Shao Wenqi, Jiang Zhengkai, Yang Tianshuo, Ye Weicai, Tong He, He Jingwen, Qiao Yu, Li Hongsheng
- Efficient Tool Use With Chain-of-abstraction Reasoning Gao Silin, Dwivedi-yu Jane, Yu Ping, Tan Xiaoqing Ellen, Pasunuru Ramakanth, Golovneva Olga, Sinha Koustuv, Celikyilmaz Asli, Bosselut Antoine, Wang Tianlu
- Improving Language Understanding From Screenshots Gao Tianyu, Wang Zirui, Bhaskar Adithya, Chen Danqi
- A Review On The Use Of Large Language Models As Virtual Tutors García-méndez Silvia, De Arriba-pérez Francisco, Somoza-lópez María Del Carmen
- Worldgpt: Empowering LLM As Multimodal World Model Ge Zhiqi, Huang Hongzhe, Zhou Mingze, Li Juncheng, Wang Guoming, Tang Siliang, Zhuang Yueting
- Does Fine-tuning Llms On New Knowledge Encourage Hallucinations? Gekhman Zorik, Yona Gal, Aharoni Roee, Eyal Matan, Feder Amir, Reichart Roi, Herzig Jonathan
- Gemma 2: Improving Open Language Models At A Practical Size Gemma Team, Riviere Morgane, Pathak Shreya, Sessa Pier Giuseppe, Hardin Cassidy, Bhupatiraju Surya, Hussenot Léonard, Mesnard Thomas, Shahriari Bobak, Ramé Alexandre, Ferret Johan, Liu Peter, Tafti Pouya, Friesen Abe, Casbon Michelle, Ramos Sabela, Kumar Ravin, Lan Charline Le, Jerome Sammy, Tsitsulin Anton, Vieillard Nino, Stanczyk Piotr, Girgin Sertan, Momchev Nikola, Hoffman Matt, Thakoor Shantanu, Grill Jean-bastien, Neyshabur Behnam, Bachem Olivier, Walton Alanna, Severyn Aliaksei, Parrish Alicia, Ahmad Aliya, Hutchison Allen, Abdagic Alvin, Carl Amanda, Shen Amy, Brock Andy, Coenen Andy, Laforge Anthony, Paterson Antonia, Bastian Ben, Piot Bilal, Wu Bo, Royal Brandon, Chen Charlie, Kumar Chintu, Perry Chris, Welty Chris, Choquette-choo Christopher A., Sinopalnikov Danila, Weinberger David, Vijaykumar Dimple, Rogozińska Dominika, Herbison Dustin, Bandy Elisa, Wang Emma, Noland Eric, Moreira Erica, Senter Evan, Eltyshev Evgenii, Visin Francesco, Rasskin Gabriel, Wei Gary, Cameron Glenn, Martins Gus, Hashemi Hadi, Klimczak-plucińska Hanna, Batra Harleen, Dhand Harsh, Nardini Ivan, Mein Jacinda, Zhou Jack, Svensson James, Stanway Jeff, Chan Jetha, Zhou Jin Peng, Carrasqueira Joana, Iljazi Joana, Becker Jocelyn, Fernandez Joe, Van Amersfoort Joost, Gordon Josh, Lipschultz Josh, Newlan Josh, Ji Ju-yeong, Mohamed Kareem, Badola Kartikeya, Black Kat, Millican Katie, Mcdonell Keelin, Nguyen Kelvin, Sodhia Kiranbir, Greene Kish, Sjoesund Lars Lowe, Usui Lauren, Sifre Laurent, Heuermann Lena, Lago Leticia, Mcnealus Lilly, Soares Livio Baldini, Kilpatrick Logan, Dixon Lucas, Martins Luciano, Reid Machel, Singh Manvinder, Iverson Mark, Görner Martin, Velloso Mat, Wirth Mateo, Davidow Matt, Miller Matt, Rahtz Matthew, Watson Matthew, Risdal Meg, Kazemi Mehran, Moynihan Michael, Zhang Ming, Kahng Minsuk, Park Minwoo, Rahman Mofi, Khatwani Mohit, Dao Natalie, Bardoliwalla Nenshad, Devanathan Nesh, Dumai Neta, Chauhan Nilay, Wahltinez Oscar, Botarda Pankil, Barnes Parker, Barham Paul, Michel Paul, Jin Pengchong, Georgiev Petko, Culliton Phil, Kuppala Pradeep, Comanescu Ramona, Merhej Ramona, Jana Reena, Rokni Reza Ardeshir, Agarwal Rishabh, Mullins Ryan, Saadat Samaneh, Carthy Sara Mc, Perrin Sarah, Arnold Sébastien M. R., Krause Sebastian, Dai Shengyang, Garg Shruti, Sheth Shruti, Ronstrom Sue, Chan Susan, Jordan Timothy, Yu Ting, Eccles Tom, Hennigan Tom, Kocisky Tomas, Doshi Tulsee, Jain Vihan, Yadav Vikas, Meshram Vilobh, Dharmadhikari Vishal, Barkley Warren, Wei Wei, Ye Wenming, Han Woohyun, Kwon Woosuk, Xu Xiang, Shen Zhe, Gong Zhitao, Wei Zichuan, Cotruta Victor, Kirk Phoebe, Rao Anand, Giang Minh, Peran Ludovic, Warkentin Tris, Collins Eli, Barral Joelle, Ghahramani Zoubin, Hadsell Raia, Sculley D., Banks Jeanine, Dragan Anca, Petrov Slav, Vinyals Oriol, Dean Jeff, Hassabis Demis, Kavukcuoglu Koray, Farabet Clement, Buchatskaya Elena, Borgeaud Sebastian, Fiedel Noah, Joulin Armand, Kenealy Kathleen, Dadashi Robert, Andreev Alek
- Can Large Language Models Learn Independent Causal Mechanisms? Gendron Gaël, Nguyen Bao Trung, Peng Alex Yuxuan, Witbrock Michael, Dobbie Gillian
- Beyond Self-learned Attention: Mitigating Attention Bias In Transformer-based Models Using Attention Guidance Gesi Jiri, Ahmed Iftekhar
- AI And Memory Wall Gholami Amir, Yao Zhewei, Kim Sehoon, Hooper Coleman, Mahoney Michael W., Keutzer Kurt
- A Closer Look At The Limitations Of Instruction Tuning Ghosh Sreyan, Evuru Chandra Kiran Reddy, Kumar Sonal, S Ramaneswaran, Aneja Deepali, Jin Zeyu, Duraiswami Ramani, Manocha Dinesh
- Investigating The Translation Capabilities Of Large Language Models Trained On Parallel Data Only Gilabert Javier García, Escolano Carlos, Savall Aleix Sant, Fornaciari Francesca De Luca, Mash Audrey, Liao Xixian, Melero Maite
- Uncovering Hidden Intentions: Exploring Prompt Recovery For Deeper Insights Into Generated Texts Give Louis, Zaoral Timo, Bruno Maria Antonietta
- Blendsql: A Scalable Dialect For Unifying Hybrid Question Answering In Relational Algebra Glenn Parker, Dakle Parag Pravin, Wang Liang, Raghavan Preethi
- Chatglm: A Family Of Large Language Models From GLM-130B To GLM-4 All Tools Glm Team, :, Zeng Aohan, Xu Bin, Wang Bowen, Zhang Chenhui, Yin Da, Zhang Dan, Rojas Diego, Feng Guanyu, Zhao Hanlin, Lai Hanyu, Yu Hao, Wang Hongning, Sun Jiadai, Zhang Jiajie, Cheng Jiale, Gui Jiayi, Tang Jie, Zhang Jing, Sun Jingyu, Li Juanzi, Zhao Lei, Wu Lindong, Zhong Lucen, Liu Mingdao, Huang Minlie, Zhang Peng, Zheng Qinkai, Lu Rui, Duan Shuaiqi, Zhang Shudan, Cao Shulin, Yang Shuxun, Tam Weng Lam, Zhao Wenyi, Liu Xiao, Xia Xiao, Zhang Xiaohan, Gu Xiaotao, Lv Xin, Liu Xinghan, Liu Xinyi, Yang Xinyue, Song Xixuan, Zhang Xunkai, An Yifan, Xu Yifan, Niu Yilin, Yang Yuantao, Li Yueyan, Bai Yushi, Dong Yuxiao, Qi Zehan, Wang Zhaoyu, Yang Zhen, Du Zhengxiao, Hou Zhenyu, Wang Zihan
- Zamba: A Compact 7B SSM Hybrid Model Glorioso Paolo, Anthony Quentin, Tokpanov Yury, Whittington James, Pilault Jonathan, Ibrahim Adam, Millidge Beren
- Arcee's Mergekit: A Toolkit For Merging Large Language Models Goddard Charles, Siriwardhana Shamane, Ehghaghi Malikeh, Meyers Luke, Karpukhin Vlad, Benedict Brian, Mcquade Mark, Solawetz Jacob
- Direct Alignment Of Draft Model For Speculative Decoding With Chat-fine-tuned Llms Goel Raghavv, Gagrani Mukul, Jeon Wonseok, Park Junyoung, Lee Mingu, Lott Christopher
- Language Guided Exploration For RL Agents In Text Environments Golchha Hitesh, Yerawar Sahil, Patel Dhruvesh, Dan Soham, Murugesan Keerthiram
- Goldfinch: High Performance Rwkv/transformer Hybrid With Linear Pre-fill And Extreme Kv-cache Compression Goldstein Daniel, Obeid Fares, Alcaide Eric, Song Guangyu, Cheah Eugene
- Mechanistic Interpretability Of Large Language Models With Applications To The Financial Services Industry Golgoon Ashkan, Filom Khashayar, Kannan Arjun Ravi
- Omnifusion Technical Report Goncharova Elizaveta, Razzhigaev Anton, Mikhalchuk Matvey, Kurkin Maxim, Abdullaeva Irina, Skripkin Matvey, Oseledets Ivan, Dimitrov Denis, Kuznetsov Andrey
- AST-T5: Structure-aware Pretraining For Code Generation And Understanding Gong Linyuan, Elhoushi Mostafa, Cheung Alvin
- Evaluation Of Llms On Syntax-aware Code Fill-in-the-middle Tasks Gong Linyuan, Wang Sida, Elhoushi Mostafa, Cheung Alvin
- Estimating Knowledge In Large Language Models Without Generating A Single Token Gottesman Daniela, Geva Mor
- From Generalist To Specialist: Improving Large Language Models For Medical Physics Using Arcot Grandinetti Jace, Mcbeth Rafe
- Assessing Generative Language Models In Classification Tasks: Performance And Self-evaluation Capabilities In The Environmental And Climate Change Domain Grasso Francesca, Locci Stefano
- Is Mamba Capable Of In-context Learning? Grazzi Riccardo, Siems Julien, Schrodi Simon, Brox Thomas, Hutter Frank
- Stress-testing Capability Elicitation With Password-locked Models Greenblatt Ryan, Roger Fabien, Krasheninnikov Dmitrii, Krueger David
- Transformers, Contextualism, And Polysemy Grindrod Jumbly
- Fine-tuned Language Models Generate Stable Inorganic Materials As Text Gruver Nate, Sriram Anuroop, Madotto Andrea, Wilson Andrew Gordon, Zitnick C. Lawrence, Ulissi Zachary
- Teams-rl: Teaching Llms To Generate Better Instruction Datasets Via Reinforcement Learning Gu Shangding, Knoll Alois, Jin Ming
- RWKV-CLIP: A Robust Vision-language Representation Learner Gu Tiancheng, Yang Kaicheng, An Xiang, Feng Ziyong, Liu Dongnan, Cai Weidong, Deng Jiankang
- Structext-eval: An Autogenerated Benchmark For Evaluating Large Language Model's Ability In Structure-rich Text Understanding Gu Zhouhong, Ye Haoning, Zhou Zeyang, Feng Hongwei, Xiao Yanghua
- AMOR: A Recipe For Building Adaptable Modular Knowledge Agents Through Process Feedback Guan Jian, Wu Wei, Wen Zujie, Xu Peng, Wang Hongning, Huang Minlie
- Probing The Robustness Of Vision-language Pretrained Models: A Multimodal Adversarial Attack Approach Guan Jiwei, Ding Tianyu, Cao Longbing, Pan Lei, Wang Chen, Zheng Xi
- Enhancing Collaborative Semantics Of Language Model-driven Recommendations Via Graph-aware Learning Guan Zhong, Wu Likang, Zhao Hongke, He Ming, Fan Jianpin
- On The Interchangeability Of Positional Embeddings In Multilingual Neural Machine Translation Models Gumma Varun, Chitale Pranjal A., Bali Kalika
- Conversational Topic Recommendation In Counseling And Psychotherapy With Decision Transformer And Large Language Models Gunal Aylin, Lin Baihan, Bouneffouf Djallel
- Sample Design Engineering: An Empirical Study Of What Makes Good Downstream Fine-tuning Samples For Llms Guo Biyang, Wang He, Xiao Wenyilin, Chen Hong, Lee Zhuxin, Han Songqiao, Huang Hailiang
- A Novel Paradigm Boosting Translation Capabilities Of Large Language Models Guo Jiaxin, Yang Hao, Li Zongyao, Wei Daimeng, Shang Hengchao, Chen Xiaoyu
- Agent-simt: Agent-assisted Simultaneous Machine Translation With Large Language Models Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Sillm: Large Language Models For Simultaneous Machine Translation Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Fine-tuning Large Language Models For Stock Return Prediction Using Newsflow Guo Tian, Hauptmann Emmanuel
- Serial Position Effects Of Large Language Models Guo Xiaobo, Vosoughi Soroush
- API Pack: A Massive Multi-programming Language Dataset For API Call Generation Guo Zhen, Soria Adriana Meza, Sun Wei, Shen Yikang, Panda Rameswar
- Selective Self-rehearsal: A Fine-tuning Approach To Improve Generalization In Large Language Models Gupta Sonam, Nandwani Yatin, Yehudai Asaf, Mishra Mayank, Pandey Gaurav, Raghu Dinesh, Joshi Sachindra
- Would I Lie To You? Inference Time Alignment Of Language Models Using Direct Preference Heads Hadji-kyriacou Avelina Asada, Arandjelovic Ognjen
- Looking Into Black Box Code Language Models Haider Muhammad Umair, Farooq Umar, Siddique A. B., Marron Mark
- Mamba State-space Models Can Be Strong Downstream Learners Halloran John T., Gulati Manbir, Roysdon Paul F.
- Sltrain: A Sparse Plus Low-rank Approach For Parameter And Memory Efficient Pretraining Han Andi, Li Jiaxiang, Huang Wei, Hong Mingyi, Takeda Akiko, Jawanpuria Pratik, Mishra Bamdev
- Small Language Model Can Self-correct Han Haixia, Liang Jiaqing, Shi Jie, He Qianyu, Xiao Yanghua
- Transfer Learning For Text Diffusion Models Han Kehang, Kenealy Kathleen, Barua Aditya, Fiedel Noah, Constant Noah
- Enhancing CTR Prediction Through Sequential Recommendation Pre-training: Introducing The SRP4CTR Framework Han Ruidong, Li Qianzhong, Jiang He, Li Rui, Zhao Yurou, Li Xiang, Lin Wei
- Towards Robust Instruction Tuning On Multimodal Large Language Models Han Wei, Chen Hui, Poria Soujanya
- COCO Is "ALL'' You Need For Visual Instruction Fine-tuning Han Xiaotian, Wang Yiqi, Zhai Bohan, You Quanzeng, Yang Hongxia
- Vision-and-language Navigation Generative Pretrained Transformer Hanlin Wen
- Exploring Chatgpt And Its Impact On Society Haque Md. Asraful, Li Shuai
- Spectrum: Targeted Training On Signal To Noise Ratio Hartford Eric, Atkins Lucas, Neto Fernando Fernandes, Golchinfar David
- Pruning For Protection: Increasing Jailbreak Resistance In Aligned Llms Without Fine-tuning Hasan Adib, Rugina Ileana, Wang Alex
- A Comprehensive Overview Of Large Language Models (llms) For Cyber Defences: Opportunities And Directions Hassanin Mohammed, Moustafa Nour
- Teaching Large Language Models To Reason With Reinforcement Learning Havrilla Alex, Du Yuqing, Raparthy Sharath Chandra, Nalmpantis Christoforos, Dwivedi-yu Jane, Zhuravinskyi Maksym, Hambro Eric, Sukhbaatar Sainbayar, Raileanu Roberta
- Chain-of-instructions: Compositional Instruction Tuning On Large Language Models Hayati Shirley Anugrah, Jung Taehee, Bodding-long Tristan, Kar Sudipta, Sethy Abhinav, Kim Joo-kyung, Kang Dongyeop
- A Law Of Next-token Prediction In Large Language Models He Hangfeng, Su Weijie J.
- Cos: Enhancing Personalization And Mitigating Bias With Context Steering He Jerry Zhi-yang, Pandey Sashrika, Schrum Mariah L., Dragan Anca
- From Words To Actions: Unveiling The Theoretical Underpinnings Of Llm-driven Autonomous Systems He Jianliang, Chen Siyu, Zhang Fengzhuo, Yang Zhuoran
- Prompt-based Bias Calibration For Better Zero/few-shot Learning Of Language Models He Kang, Long Yinghan, Roy Kaushik
- Achieving Tool Calling Functionality In Llms Using Only Prompt Engineering Without Fine-tuning He Shengtao
- What Matters In Transformers? Not All Attention Is Needed He Shwai, Sun Guoheng, Shen Zheyu, Li Ang
- MARS: Mixture Of Auto-regressive Models For Fine-grained Text-to-image Synthesis He Wanggui, Fu Siming, Liu Mushui, Wang Xierui, Xiao Wenyi, Shu Fangxun, Wang Yi, Zhang Lei, Yu Zhelun, Li Haoyuan, Huang Ziwei, Gan Leilei, Jiang Hao
- SHED: Shapley-based Automated Dataset Refinement For Instruction Fine-tuning He Yexiao, Wang Ziyao, Shen Zheyu, Sun Guoheng, Dai Yucong, Wu Yongkai, Wang Hongyi, Li Ang
- Language Models As Hierarchy Encoders He Yuan, Yuan Zhangdie, Chen Jiaoyan, Horrocks Ian
- Fortifying Ethical Boundaries In AI: Advanced Strategies For Enhancing Security In Large Language Models He Yunhong, Qiu Jianling, Zhang Wei, Yuan Zhengqing
- Camelot: Towards Large Language Models With Training-free Consolidated Associative Memory He Zexue, Karlinsky Leonid, Kim Donghyun, Mcauley Julian, Krotov Dmitry, Feris Rogerio
- Reindex-then-adapt: Improving Large Language Models For Conversational Recommendation He Zhankui, Xie Zhouhang, Steck Harald, Liang Dawen, Jha Rahul, Kallus Nathan, Mcauley Julian
- Telechat Technical Report He Zhongjiang, Wang Zihan, Liu Xinzhang, Liu Shixuan, Yao Yitong, Huang Yuyao, Li Xuelong, Li Yongxiang, Che Zhonghao, Zhang Zhaoxi, Wang Yan, Wang Xin, Pu Luwen, Xu Huinan, Fang Ruiyu, Zhao Yu, Zhang Jie, Huang Xiaomeng, Lu Zhilong, Peng Jiaxin, Zheng Wenjun, Wang Shiquan, Yang Bingkai, He Xuewei, Jiang Zhuoru, Xie Qiyi, Zhang Yanhan, Li Zhongqiu, Shi Lingling, Fu Weiwei, Zhang Yin, Huang Zilu, Xiong Sishi, Zhang Yuxiang, Wang Chao, Song Shuangyong
- HMT: Hierarchical Memory Transformer For Long Context Language Processing He Zifan, Qin Zongyue, Prakriya Neha, Sun Yizhou, Cong Jason
- N-gram Prediction And Word Difference Representations For Language Modeling Heo Dongnyeong, Rim Daniela Noemi, Choi Heeyoul
- On Overcoming Miscalibrated Conversational Priors In Llm-based Chatbots Herlihy Christine, Neville Jennifer, Schnabel Tobias, Swaminathan Adith
- Lilium: Ebay's Large Language Models For E-commerce Herold Christian, Kozielski Michael, Ekimov Leonid, Petrushkov Pavel, Vandenbussche Pierre-yves, Khadivi Shahram
- Super Tiny Language Models Hillier Dylan, Guertler Leon, Tan Cheston, Agrawal Palaash, Ruirui Chen, Cheng Bobby
- Llava-gemma: Accelerating Multimodal Foundation Models With A Compact Language Model Hinck Musashi, Olson Matthew L., Cobbley David, Tseng Shao-yen, Lal Vasudev
- Towards Building A Robust Knowledge Intensive Question Answering Model With Large Language Models Hong Hong Xingyun, Shao Shao Yan, Wang Wang Zhilin, Duan Duan Manni, Xiongnan Jin
- Accelerating Multilingual Language Model For Excessively Tokenized Languages Hong Jimin, Lee Gibbeum, Cho Jaewoong
- Outcome-constrained Large Language Models For Countering Hate Speech Hong Lingzi, Luo Pengcheng, Blanco Eduardo, Song Xiaoying
- Free To Play: UN Trade And Development's Experience With Developing Its Own Open-source Retrieval Augmented Generation Large Language Model Application Hopp Daniel
- Large Language Models For Expansion Of Spoken Language Understanding Systems To New Languages Hoscilowicz Jakub, Pawlowski Pawel, Skorupa Marcin, Sowański Marcin, Janicki Artur
- Non-linear Inference Time Intervention: Improving LLM Truthfulness Hoscilowicz Jakub, Wiacek Adam, Chojnacki Jan, Cieslak Adam, Michon Leszek, Urbanevych Vitalii, Janicki Artur
- Visual Prompting For Generalized Few-shot Segmentation: A Multi-scale Approach Hossain Mir Rayat Imtiaz, Siam Mennatullah, Sigal Leonid, Little James J.
- Retrieval-augmented Code Completion For Local Projects Using Large Language Models Hostnik Marko, Robnik-šikonja Marko
- Outlier-efficient Hopfield Layers For Large Transformer-based Models Hu Jerry Yao-chieh, Chang Pei-hsuan, Luo Robin, Chen Hong-yu, Li Weijian, Wang Wei-po, Liu Han
- Openrlhf: An Easy-to-use, Scalable And High-performance RLHF Framework Hu Jian, Wu Xibin, Wang Weixun, Xianyu, Zhang Dehao, Cao Yu
- Mitigating Large Language Model Hallucination With Faithful Finetuning Hu Minda, He Bowei, Wang Yufei, Li Liangyou, Ma Chen, King Irwin
- Mixed Sparsity Training: Achieving 4\(\times\) FLOP Reduction For Transformer Pretraining Hu Pihe, Li Shaolong, Huang Longbo
- Characterization Of Large Language Model Development In The Datacenter Hu Qinghao, Ye Zhisheng, Wang Zerui, Wang Guoteng, Zhang Meng, Chen Qiaoling, Sun Peng, Lin Dahua, Wang Xiaolin, Luo Yingwei, Wen Yonggang, Zhang Tianwei
- Matryoshka Query Transformer For Large Vision-language Models Hu Wenbo, Dou Zi-yi, Li Liunian Harold, Kamath Amita, Peng Nanyun, Chang Kai-wei
- Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models At Scale Hu Xiang, Ji Pengyu, Zhu Qingyang, Wu Wei, Tu Kewei
- Case-based Or Rule-based: How Do Transformers Do The Math? Hu Yi, Tang Xiaojuan, Yang Haotong, Zhang Muhan
- Exact And Efficient Unlearning For Large Language Model-based Recommendation Hu Zhiyu, Zhang Yang, Xiao Minghao, Wang Wenjie, Feng Fuli, He Xiangnan
- Longrecipe: Recipe For Efficient Long Context Generalization In Large Language Models Hu Zhiyuan, Liu Yuliang, Zhao Jinman, Wang Suyuchen, Wang Yan, Shen Wei, Gu Qing, Luu Anh Tuan, Ng See-kiong, Jiang Zhiwei, Hooi Bryan
- Disentangling Logic: The Role Of Context In Large Language Model Reasoning Capabilities Hua Wenyue, Zhu Kaijie, Li Lingyao, Fan Lizhou, Lin Shuhang, Jin Mingyu, Xue Haochen, Li Zelong, Wang Jindong, Zhang Yongfeng
- Multimodal Task Vectors Enable Many-shot Multimodal In-context Learning Huang Brandon, Mitra Chancharik, Arbelle Assaf, Karlinsky Leonid, Darrell Trevor, Herzig Roei
- Instupr : Instruction-based Unsupervised Passage Reranking With Large Language Models Huang Chao-wei, Chen Yun-nung
- Investigating Decoder-only Large Language Models For Speech-to-text Translation Huang Chao-wei, Lu Hui, Gong Hongyu, Inaguma Hirofumi, Kulikov Ilia, Mavlyutov Ruslan, Popuri Sravya
- Pokergpt: An End-to-end Lightweight Solver For Multi-player Texas Hold'em Via Large Language Model Huang Chenghao, Cao Yanbo, Wen Yinlong, Zhou Tao, Zhang Yanru
- Learn When (not) To Trust Language Models: A Privacy-centric Adaptive Model-aware Approach Huang Chengkai, Wang Rui, Xie Kaige, Yu Tong, Yao Lina
- Recurrent Context Compression: Efficiently Expanding The Context Window Of LLM Huang Chensen, Zhu Guibo, Wang Xuepeng, Luo Yifei, Ge Guojing, Chen Haoran, Yi Dong, Wang Jinqiao
- Selective Prompting Tuning For Personalized Conversations With Llms Huang Qiushi, Liu Xubo, Ko Tom, Wu Bo, Wang Wenwu, Zhang Yu, Tang Lilian
- Antidote: Post-fine-tuning Safety Alignment For Large Language Models Against Harmful Fine-tuning Huang Tiansheng, Bhattacharya Gautam, Joshi Pratik, Kimball Josh, Liu Ling
- Vaccine: Perturbation-aware Alignment For Large Language Models Against Harmful Fine-tuning Huang Tiansheng, Hu Sihao, Liu Ling
- Improving Llms For Recommendation With Out-of-vocabulary Tokens Huang Ting-ji, Yang Jia-qi, Shen Chunxu, Liu Kai-qi, Zhan De-chuan, Ye Han-jia
- Pruning Large Language Models With Semi-structural Adaptive Sparse Training Huang Weiyu, Hu Yuezhou, Jian Guohao, Zhu Jun, Chen Jianfei
- Visual Hallucinations Of Multi-modal Large Language Models Huang Wen, Liu Hongbin, Guo Minxin, Gong Neil Zhenqiang
- Ml-mamba: Efficient Multi-modal Large Language Model Utilizing Mamba-2 Huang Wenjun, Pan Jiakai, Tang Jiahao, Ding Yanyu, Xing Yifei, Wang Yuhe, Wang Zhengzhuo, Hu Jianguo
- Rolora: Fine-tuning Rotated Outlier-free Llms For Effective Weight-activation Quantization Huang Xijie, Liu Zechun, Liu Shih-yang, Cheng Kwang-ting
- WESE: Weak Exploration To Strong Exploitation For LLM Agents Huang Xu, Liu Weiwen, Chen Xiaolong, Wang Xingmei, Lian Defu, Wang Yasheng, Tang Ruiming, Chen Enhong
- Lost In The Source Language: How Large Language Models Evaluate The Quality Of Machine Translation Huang Xu, Zhang Zhirui, Geng Xiang, Du Yichao, Chen Jiajun, Huang Shujian
- Leveraging Large Language Models For Enhanced NLP Task Performance Through Knowledge Distillation And Optimized Training Strategies Huang Yining, Tang Keke, Chen Meilian
- A Comprehensive Survey On Evaluating Large Language Model Applications In The Medical Industry Huang Yining, Tang Keke, Chen Meilian, Wang Boyuan
- See The Unseen: Better Context-consistent Knowledge-editing By Noises Huang Youcheng, Lei Wenqiang, Zhang Zheng, Lv Jiancheng, Yan Shuicheng
- Accelerating Pre-training Of Multimodal Llms Via Chain-of-sight Huang Ziyuan, Ji Kaixiang, Gong Biao, Qing Zhiwu, Zhang Qinglong, Zheng Kecheng, Wang Jian, Chen Jingdong, Yang Ming
- Instruction Fine-tuning: Does Prompt Loss Matter? Huerta-enochian Mathew, Ko Seung Yong
- HFT: Half Fine-tuning For Large Language Models Hui Tingfeng, Zhang Zhenyu, Wang Shuohuan, Xu Weiran, Sun Yu, Wu Hua
- Romansetu: Efficiently Unlocking Multilingual Capabilities Of Large Language Models Via Romanization Husain Jaavid Aktar, Dabre Raj, Kumar Aswanth, Gala Jay, Jayakumar Thanmay, Puduppully Ratish, Kunchukuttan Anoop
- Transformerfam: Feedback Attention Is Working Memory Hwang Dongseong, Wang Weiran, Huo Zhuoyuan, Sim Khe Chai, Mengibar Pedro Moreno
- Self-explore To Avoid The Pit: Improving The Reasoning Capabilities Of Language Models With Fine-grained Rewards Hwang Hyeonbin, Kim Doyoung, Kim Seungone, Ye Seonghyeon, Seo Minjoon
- Pc-lora: Low-rank Adaptation For Progressive Model Compression With Knowledge Distillation Hwang Injoon, Park Haewon, Lee Youngwan, Yang Jooyoung, Maeng Sunjae
- News Without Borders: Domain Adaptation Of Multilingual Sentence Embeddings For Cross-lingual News Recommendation Iana Andreea, Schmidt Fabian David, Glavaš Goran, Paulheim Heiko
- Test-time Low Rank Adaptation Via Confidence Maximization For Zero-shot Generalization Of Vision-language Models Imam Raza, Gani Hanan, Huzaifa Muhammad, Nandakumar Karthik
- Improving Multilingual Instruction Finetuning Via Linguistically Natural And Diverse Datasets Indurthi Sathish Reddy, Zhou Wenxuan, Chollampatt Shamil, Agrawal Ravi, Song Kaiqiang, Zhao Lingxiao, Zhu Chenguang
- Hallucinations Or Attention Misdirection? The Path To Strategic Value Extraction In Business Using Large Language Models Ioste Aline
- Scaling Laws For Downstream Task Performance Of Large Language Models Isik Berivan, Ponomareva Natalia, Hazimeh Hussein, Paparas Dimitris, Vassilvitskii Sergei, Koyejo Sanmi
- Autocompletion Of Chief Complaints In The Electronic Health Records Using Large Language Models Islam K M Sajjadul, Nipu Ayesha Siddika, Madiraju Praveen, Deshpande Priya
- From Loops To Oops: Fallback Behaviors Of Language Models Under Uncertainty Ivgi Maor, Yoran Ori, Berant Jonathan, Geva Mor
- Quality Or Quantity? On Data Scale And Diversity In Adapting Large Language Models For Low-resource Translation Iyer Vivek, Malik Bhavitvya, Stepachev Pavel, Chen Pinzhen, Haddow Barry, Birch Alexandra
- Response Generation For Cognitive Behavioral Therapy With Large Language Models: Comparative Study With Socratic Questioning Izumi Kenta, Tanaka Hiroki, Shidara Kazuhiro, Adachi Hiroyoshi, Kanayama Daisuke, Kudo Takashi, Nakamura Satoshi
- Ffn-skipllm: A Hidden Gem For Autoregressive Decoding With Adaptive Feed Forward Skipping Jaiswal Ajay, Hu Bodun, Yin Lu, Ro Yeonju, Liu Shiwei, Chen Tianlong, Akella Aditya
- Jamba-1.5: Hybrid Transformer-mamba Models At Scale Jamba Team, Lenz Barak, Arazi Alan, Bergman Amir, Manevich Avshalom, Peleg Barak, Aviram Ben, Almagor Chen, Fridman Clara, Padnos Dan, Gissin Daniel, Jannai Daniel, Muhlgay Dor, Zimberg Dor, Gerber Edden M, Dolev Elad, Krakovsky Eran, Safahi Erez, Schwartz Erez, Cohen Gal, Shachaf Gal, Rozenblum Haim, Bata Hofit, Blass Ido, Magar Inbal, Dalmedigos Itay, Osin Jhonathan, Fadlon Julie, Rozman Maria, Danos Matan, Gokhman Michael, Zusman Mor, Gidron Naama, Ratner Nir, Gat Noam, Rozen Noam, Fried Oded, Leshno Ohad, Antverg Omer, Abend Omri, Lieber Opher, Dagan Or, Cohavi Orit, Alon Raz, Belson Ro'i, Cohen Roi, Gilad Rom, Glozman Roman, Lev Shahar, Meirom Shaked, Delbari Tal, Ness Tal, Asida Tomer, Gal Tom Ben, Braude Tom, Pumerantz Uriya, Cohen Yehoshua, Belinkov Yonatan, Globerson Yuval, Levy Yuval Peleg, Shoham Yoav
- KIT-19: A Comprehensive Korean Instruction Toolkit On 19 Tasks For Fine-tuning Korean Large Language Models Jang Dongjun, Byun Sungjoo, Jo Hyemi, Shin Hyopil
- Performance Of Recent Large Language Models For A Low-resourced Language Jayakody Ravindu, Dias Gihan
- L4Q: Parameter Efficient Quantization-aware Fine-tuning On Large Language Models Jeon Hyesung, Kim Yulhwa, Kim Jae-joon
- RLSF: Reinforcement Learning Via Symbolic Feedback Jha Piyush, Jana Prithwish, Arora Arnav, Ganesh Vijay
- Precision Empowers, Excess Distracts: Visual Question Answering With Dynamically Infused Knowledge In Language Models Jhalani Manas, M Annervaz K, Bhattacharyya Pushpak
- Language Models Resist Alignment Ji Jiaming, Wang Kaile, Qiu Tianyi, Chen Boyuan, Zhou Jiayi, Li Changye, Lou Hantao, Yang Yaodong
- Llms Could Autonomously Learn Without External Supervision Ji Ke, Chen Junying, Gao Anningzhe, Xie Wenya, Wan Xiang, Wang Benyou
- Can't Say Cant? Measuring And Reasoning Of Dark Jargons In Large Language Models Ji Xu, Zhang Jianyi, Zhou Ziyin, Zhao Zhangchi, Qiao Qianqian, Han Kaiying, Hossen Md Imran, Hei Xiali
- Linking In-context Learning In Transformers To Human Episodic Memory Ji-an Li, Zhou Corey Y., Benna Marcus K., Mattar Marcelo G.
- Bwarea Model: Learning World Model, Inverse Dynamics, And Policy For Controllable Language Generation Jia Chengxing, Wang Pengyuan, Li Ziniu, Li Yi-chen, Zhang Zhilong, Tang Nan, Yu Yang
- Heuristic-enhanced Candidates Selection Strategy For Gpts Tackle Few-shot Aspect-based Sentiment Analysis Jiang Baoxing, Wan Yujie, Ju Shenggen
- Interpretable Catastrophic Forgetting Of Large Language Model Fine-tuning Via Instruction Vector Jiang Gangwei, Jiang Caigao, Li Zhaoyi, Xue Siqiao, Zhou Jun, Song Linqi, Lian Defu, Wei Ying
- Time Matters: Enhancing Pre-trained News Recommendation Models With Robust User Dwell Time Injection Jiang Hao, Li Chuanzhen, An Mingxiao
- Minference 1.0: Accelerating Pre-filling For Long-context Llms Via Dynamic Sparse Attention Jiang Huiqiang, Li Yucheng, Zhang Chengruidong, Wu Qianhui, Luo Xufang, Ahn Surin, Han Zhenhua, Abdi Amir H., Li Dongsheng, Lin Chin-yew, Yang Yuqing, Qiu Lili
- Can Large Language Models Generate High-quality Patent Claims? Jiang Lekang, Zhang Caiqi, Scherz Pascal A, Goetz Stephan
- Turning Generative Models Degenerate: The Power Of Data Poisoning Attacks Jiang Shuli, Kadhe Swanand Ravindra, Zhou Yi, Ahmed Farhan, Cai Ling, Baracaldo Nathalie
- Mora: High-rank Updating For Parameter-efficient Fine-tuning Jiang Ting, Huang Shaohan, Luo Shengyue, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi, Wang Deqing, Zhuang Fuzhen
- SEED: Customize Large Language Models With Sample-efficient Adaptation For Code Generation Jiang Xue, Dong Yihong, Jin Zhi, Li Ge
- Large Visual-language Models Are Also Good Classifiers: A Study Of In-context Multimodal Fake News Detection Jiang Ye, Wang Yimin
- Do Llms Dream Of Elephants (when Told Not To)? Latent Concept Association And Associative Memory In Transformers Jiang Yibo, Rajendran Goutham, Ravikumar Pradeep, Aragam Bryon
- LLM Maybe Longlm: Self-extend LLM Context Window Without Tuning Jin Hongye, Han Xiaotian, Yang Jingfeng, Jiang Zhimeng, Liu Zirui, Chang Chia-yuan, Chen Huiyuan, Hu Xia
- BIDER: Bridging Knowledge Inconsistency For Efficient Retrieval-augmented Llms Via Key Supporting Evidence Jin Jiajie, Zhu Yutao, Zhou Yujia, Dou Zhicheng
- Crimson: Empowering Strategic Reasoning In Cybersecurity Through Large Language Models Jin Jiandong, Tang Bowen, Ma Mingxuan, Liu Xiao, Wang Yunfei, Lai Qingnan, Yang Jia, Zhou Changling
- Hidden Flaws Behind Expert-level Accuracy Of Multimodal GPT-4 Vision In Medicine Jin Qiao, Chen Fangyuan, Zhou Yiliang, Xu Ziyang, Cheung Justin M., Chen Robert, Summers Ronald M., Rousseau Justin F., Ni Peiyun, Landsman Marc J, Baxter Sally L., Al'aref Subhi J., Li Yijia, Chen Alex, Brejt Josef A., Chiang Michael F., Peng Yifan, Lu Zhiyong
- Adaptive Skeleton Graph Decoding Jin Shuowei, Wu Yongji, Zheng Haizhong, Zhang Qingzhao, Lentz Matthew, Mao Z. Morley, Prakash Atul, Qian Feng, Zhuo Danyang
- Self-selected Attention Span For Accelerating Large Language Model Inference Jin Tian, Yazar Wanzin, Xu Zifei, Sharify Sayeh, Wang Xin
- Read To Play (r2-play): Decision Transformer With Multimodal Game Instruction Jin Yonggang, Zhang Ge, Zhao Hao, Zheng Tianyu, Guo Jarvi, Xiang Liuyu, Yue Shawn, Huang Stephen W., He Zhaofeng, Fu Jie
- Birbal: An Efficient 7B Instruct-model Fine-tuned With Curated Datasets Jindal Ashvini Kumar, Rajpoot Pawan Kumar, Parikh Ankur
- A2SF: Accumulative Attention Scoring With Forgetting Factor For Token Pruning In Transformer Decoder Jo Hyun-rae, Shin Dongkun
- Llm-based Federated Recommendation Jujia Zhao, Wenjie Wang, Chen Xu, Zhaochun Ren, See-kiong Ng, Tat-seng Chua
- Pmoe: Progressive Mixture Of Experts With Asymmetric Transformer For Continual Learning Jung Min Jae, Kim Joohee
- Dipt: Enhancing LLM Reasoning Through Diversified Perspective-taking Just Hoang Anh, Dabas Mahavir, Huang Lifu, Jin Ming, Jia Ruoxi
- Beyond Labels: Aligning Large Language Models With Human-like Reasoning Kabir Muhammad Rafsan, Sultan Rafeed Mohammad, Asif Ihsanul Haque, Ahad Jawad Ibn, Rahman Fuad, Amin Mohammad Ruhul, Mohammed Nabeel, Rahman Shafin
- Evaluating The Impact Of Advanced LLM Techniques On Ai-lecture Tutors For A Robotics Course Kahl Sebastian, Löffler Felix, Maciol Martin, Ridder Fabian, Schmitz Marius, Spanagel Jennifer, Wienkamp Jens, Burgahn Christopher, Schilling Malte
- Scaling Laws For Forgetting When Fine-tuning Large Language Models Kalajdzievski Damjan
- Understanding The Effect Of Using Semantically Meaningful Tokens For Visual Representation Learning Kalibhat Neha, Kattakinda Priyatham, Zarei Arman, Seleznev Nikita, Sharpe Samuel, Kumar Senthil, Feizi Soheil
- Counterfactuals As A Means For Evaluating Faithfulness Of Attribution Methods In Autoregressive Language Models Kamahi Sepehr, Yaghoobzadeh Yadollah
- When Can Llms Actually Correct Their Own Mistakes? A Critical Survey Of Self-correction Of Llms Kamoi Ryo, Zhang Yusen, Zhang Nan, Han Jiawei, Zhang Rui
- Domain-specific Improvement On Psychotherapy Chatbot Using Assistant Kang Cheng, Novak Daniel, Urbanova Katerina, Cheng Yuqing, Hu Yong
- Mindstar: Enhancing Math Reasoning In Pre-trained Llms At Inference Time Kang Jikun, Li Xin Zhe, Chen Xi, Kazemi Amirreza, Sun Qianyi, Chen Boxing, Li Dong, He Xu, He Quan, Wen Feng, Hao Jianye, Yao Jun
- C-RAG: Certified Generation Risks For Retrieval-augmented Language Models Kang Mintong, Gürel Nezihe Merve, Yu Ning, Song Dawn, Li Bo
- Large Language Models Must Be Taught To Know What They Don't Know Kapoor Sanyam, Gruver Nate, Roberts Manley, Collins Katherine, Pal Arka, Bhatt Umang, Weller Adrian, Dooley Samuel, Goldblum Micah, Wilson Andrew Gordon
- From Text To Transformation: A Comprehensive Review Of Large Language Models' Versatility Kaur Pravneet, Kashyap Gautam Siddharth, Kumar Ankit, Nafis Md Tabrez, Kumar Sandeep, Shokeen Vikrant
- Defending Large Language Models Against Attacks With Residual Stream Activation Analysis Kawasaki Amelia, Davis Andrew, Abbas Houssam
- Source-aware Training Enables Knowledge Attribution In Language Models Khalifa Muhammad, Wadden David, Strubell Emma, Lee Honglak, Wang Lu, Beltagy Iz, Peng Hao
- Indicllmsuite: A Blueprint For Creating Pre-training And Fine-tuning Datasets For Indian Languages Khan Mohammed Safi Ur Rahman, Mehta Priyam, Sankar Ananth, Kumaravelan Umashankar, Doddapaneni Sumanth, G Suriyaprasaad, G Varun Balan, Jain Sparsh, Kunchukuttan Anoop, Kumar Pratyush, Dabre Raj, Khapra Mitesh M.
- A Comprehensive Survey Of Accelerated Generation Techniques In Large Language Models Khoshnoodi Mahsa, Jain Vinija, Gao Mingye, Srikanth Malavika, Chadha Aman
- Guiding Large Language Models To Post-edit Machine Translation With Error Annotations Ki Dayeon, Carpuat Marine
- Shortened Llama: Depth Pruning For Large Language Models With Comparison Of Retraining Methods Kim Bo-kyeong, Kim Geonmin, Kim Tae-ho, Castells Thibault, Choi Shinkook, Shin Junho, Song Hyoung-kyu
- Understanding Large-language Model (llm)-powered Human-robot Interaction Kim Callie Y., Lee Christine P., Mutlu Bilge
- Mhgpt: A Lightweight Generative Pre-trained Transformer For Mental Health Text Analysis Kim Dae-young, Hwa Rebecca, Rahman Muhammad Mahbubur
- Promptkd: Distilling Student-friendly Knowledge For Generative Language Models Via Prompt Tuning Kim Gyeongman, Jang Doohyuk, Yang Eunho
- On Initializing Transformers With Pre-trained Embeddings Kim Ha Young, Balasubramanian Niranjan, Kang Byungkon
- Integrating Paralinguistics In Speech-empowered Large Language Models For Natural Conversation Kim Heeseung, Seo Soonshin, Jeong Kyeongseok, Kwon Ohsung, Kim Soyoon, Kim Jungwhan, Lee Jaehong, Song Eunwoo, Oh Myungwoo, Ha Jung-woo, Yoon Sungroh, Yoo Kang Min
- Fine-tuning CLIP Text Encoders With Two-step Paraphrasing Kim Hyunjae, Yoon Seunghyun, Bui Trung, Zhao Handong, Tran Quan, Dernoncourt Franck, Kang Jaewoo
- Sure: Summarizing Retrievals Using Answer Candidates For Open-domain QA Of Llms Kim Jaehyung, Nam Jaehyun, Mo Sangwoo, Park Jongjin, Lee Sang-woo, Seo Minjoon, Ha Jung-woo, Shin Jinwoo
- Openvla: An Open-source Vision-language-action Model Kim Moo Jin, Pertsch Karl, Karamcheti Siddharth, Xiao Ted, Balakrishna Ashwin, Nair Suraj, Rafailov Rafael, Foster Ethan, Lam Grace, Sanketi Pannag, Vuong Quan, Kollar Thomas, Burchfiel Benjamin, Tedrake Russ, Sadigh Dorsa, Levine Sergey, Liang Percy, Finn Chelsea
- Large Language Models Meet Collaborative Filtering: An Efficient All-round Llm-based Recommender System Kim Sein, Kang Hongseok, Choi Seungyoon, Kim Donghyun, Yang Minchul, Park Chanyoung
- Is GPT-4 Alone Sufficient For Automated Essay Scoring?: A Comparative Judgment Approach Based On Rater Cognition Kim Seungju, Jo Meounggun
- Groundial: Human-norm Grounded Safe Dialog Response Generation Kim Siwon, Dai Shuyang, Kachuee Mohammad, Ray Shayan, Taghavi Tara, Yoon Sungroh
- Llmem: Estimating GPU Memory Usage For Fine-tuning Pre-trained Llms Kim Taeho, Wang Yanming, Chaturvedi Vatshank, Gupta Lokesh, Kim Seyeon, Kwon Yongin, Ha Sangtae
- Health-llm: Large Language Models For Health Prediction Via Wearable Sensor Data Kim Yubin, Xu Xuhai, Mcduff Daniel, Breazeal Cynthia, Park Hae Won
- From Bytes To Borsch: Fine-tuning Gemma And Mistral For The Ukrainian Language Representation Kiulian Artur, Polishko Anton, Khandoga Mykola, Chubych Oryna, Connor Jack, Ravishankar Raghav, Shirawalmath Adarsh
- Contrastive Perplexity For Controlled Generation: An Application In Detoxifying Large Language Models Klein Tassilo, Nabi Moin
- Extended Mind Transformers Klett Phoebe, Ahle Thomas
- Blending Llms Into Cascaded Speech Translation: Kit's Offline Speech Translation System For IWSLT 2024 Koneru Sai, Nguyen Thai-binh, Pham Ngoc-quan, Liu Danni, Li Zhaolin, Waibel Alexander, Niehues Jan
- Self-prompt Tuning: Enable Autonomous Role-playing In Llms Kong Aobo, Zhao Shiwan, Chen Hao, Li Qicheng, Qin Yong, Sun Ruiqi, Zhou Xin, Zhou Jiaming, Sun Haoqin
- Labeling Supervised Fine-tuning Data With The Scaling Law Kong Huanjun
- Aligning Large Language Models With Representation Editing: A Control Perspective Kong Lingkai, Wang Haorui, Mu Wenhao, Du Yuanqi, Zhuang Yuchen, Zhou Yifei, Song Yue, Zhang Rongzhi, Wang Kai, Zhang Chao
- Customizing Language Models With Instance-wise Lora For Sequential Recommendation Kong Xiaoyu, Wu Jiancan, Zhang An, Sheng Leheng, Lin Hui, Wang Xiang, He Xiangnan
- Large Language Model Driven Recommendation Korikov Anton, Sanner Scott, Deldjoo Yashar, He Zhankui, Mcauley Julian, Ramisa Arnau, Vidal Rene, Sathiamoorthy Mahesh, Kasrizadeh Atoosa, Milano Silvia, Ricci Francesco
- Transllama: Llm-based Simultaneous Translation System Koshkin Roman, Sudoh Katsuhito, Nakamura Satoshi
- Llms Are Zero-shot Context-aware Simultaneous Translators Koshkin Roman, Sudoh Katsuhito, Nakamura Satoshi
- Hints-in-browser: Benchmarking Language Models For Programming Feedback Generation Kotalwar Nachiket, Gotovos Alkis, Singla Adish
- Arabiangpt: Native Arabic Gpt-based Large Language Model Koubaa Anis, Ammar Adel, Ghouti Lahouari, Najar Omar, Sibaee Serry
- Can Large Language Models Explore In-context? Krishnamurthy Akshay, Harris Keegan, Foster Dylan J., Zhang Cyril, Slivkins Aleksandrs
- Can Better Text Semantics In Prompt Tuning Improve VLM Generalization? Kuchibhotla Hari Chandana, Kancheti Sai Srinivas, Reddy Abbavaram Gowtham, Balasubramanian Vineeth N
- Fine-tuning, Quantization, And Llms: Navigating Unintended Outcomes Kumar Divyanshu, Kumar Anurakt, Agarwal Sahil, Harshangi Prashanth
- Bridging The Gap: Dynamic Learning Strategies For Improving Multilingual Performance In Llms Kumar Somnath, Balloli Vaibhav, Ranjit Mercy, Ahuja Kabir, Ganu Tanuja, Sitaram Sunayana, Bali Kalika, Nambi Akshay
- Properties And Challenges Of Llm-generated Explanations Kunz Jenny, Kuhlmann Marco
- Babilong: Testing The Limits Of Llms With Long Context Reasoning-in-a-haystack Kuratov Yuri, Bulatov Aydar, Anokhin Petr, Rodkin Ivan, Sorokin Dmitry, Sorokin Artyom, Burtsev Mikhail
- Teaching Llama A New Language Through Cross-lingual Knowledge Transfer Kuulmets Hele-andra, Purason Taido, Luhtaru Agnes, Fishel Mark
- Continual Learning Optimizations For Auto-regressive Decoder Of Multilingual ASR Systems Kwok Chin Yuen, Yip Jia Qi, Chng Eng Siong
- Enhancing Multi-domain Automatic Short Answer Grading Through An Explainable Neuro-symbolic Pipeline Künnecke Felix, Filighera Anna, Leong Colin, Steuer Tim
- Thalle: Text Hyperlocally Augmented Large Language Extension -- Technical Report Labs Kbtg, Khamnuansin Danupat, Petchsod Atthakorn, Lertpiya Anuruth, Balee Pornchanan, Lodkaew Thanawat, Chalothorn Tawunrat, Pongthawornkamol Thadpong, Lertsutthiwong Monchai
- The Remarkable Robustness Of Llms: Stages Of Inference? Lad Vedang, Gurnee Wes, Tegmark Max
- Residual-based Language Models Are Free Boosters For Biomedical Imaging Lai Zhixin, Wu Jing, Chen Suiyao, Zhou Yucheng, Hovakimyan Naira
- From Distributional To Overton Pluralism: Investigating Large Language Model Alignment Lake Thom, Choi Eunsol, Durrett Greg
- Self-directed Synthetic Dialogues And Revisions Technical Report Lambert Nathan, Schoelkopf Hailey, Gokaslan Aaron, Soldaini Luca, Pyatkin Valentina, Castricato Louis
- Lapdoc: Layout-aware Prompting For Documents Lamott Marcel, Weweler Yves-noel, Ulges Adrian, Shafait Faisal, Krechel Dirk, Obradovic Darko
- Fine-tuning Language Models With Reward Learning On Policy Lang Hao, Huang Fei, Li Yongbin
- Large Language Models As Evolution Strategies Lange Robert Tjarko, Tian Yingtao, Tang Yujin
- Enhancing Neural Machine Translation Of Low-resource Languages: Corpus Development, Human Evaluation And Explainable AI Architectures Lankford Séamus
- Adaptmllm: Fine-tuning Multilingual Language Models On Low-resource Languages With Integrated LLM Playgrounds Lankford Séamus, Afli Haithem, Way Andy
- Human Evaluation Of English--irish Transformer-based NMT Lankford Séamus, Afli Haithem, Way Andy
- Adaptnmt: An Open-source, Language-agnostic Development Environment For Neural Machine Translation Lankford Séamus, Afli Haithem, Way Andy
- Transformers For Low-resource Languages:is F\'eidir Linn! Lankford Séamus, Afli Haithem, Way Andy
- Melting Point: Mobile Evaluation Of Language Transformers Laskaridis Stefanos, Katevas Kleomenis, Minto Lorenzo, Haddadi Hamed
- Sparse Rewards Can Self-train Dialogue Agents Lattimer Barrett Martin, Gangal Varun, Mcdonald Ryan, Yang Yi
- What Matters When Building Vision-language Models? Laurençon Hugo, Tronchon Léo, Cord Matthieu, Sanh Victor
- Reallm: A General Framework For LLM Compression And Fine-tuning Leconte Louis, Bedin Lisa, Nguyen Van Minh, Moulines Eric
- MATTER: Memory-augmented Transformer Using Heterogeneous Knowledge Sources Lee Dongkyu, Prakash Chandana Satya, Fitzgerald Jack, Lehmann Jens
- Improving Conversational Abilities Of Quantized Large Language Models Via Direct Preference Alignment Lee Janghwan, Park Seongmin, Hong Sukjin, Kim Minsoo, Chang Du-seong, Choi Jungwook
- CATS: Contextually-aware Thresholding For Sparsity In Large Language Models Lee Je-yong, Lee Donghyun, Zhang Genghan, Tiwari Mo, Mirhoseini Azalia
- A Survey Of Large Language Models In Finance (finllms) Lee Jean, Stevens Nicholas, Han Soyeon Caren, Song Minseok
- LRQ: Optimizing Post-training Quantization For Large Language Models By Learning Low-rank Weight-scaling Matrices Lee Jung Hyun, Kim Jeonghoon, Yang June Yong, Kwon Se Jung, Yang Eunho, Yoo Kang Min, Lee Dongsoo
- Token-supervised Value Models For Enhancing Mathematical Reasoning Capabilities Of Large Language Models Lee Jung Hyun, Yang June Yong, Heo Byeongho, Han Dongyoon, Yoo Kang Min
- Multimodal Reasoning With Multimodal Knowledge Graph Lee Junlin, Wang Yequan, Li Jing, Zhang Min
- Reinforcement Learning From Reflective Feedback (RLRF): Aligning And Improving Llms Via Fine-grained Self-reflection Lee Kyungjae, Hwang Dasol, Park Sunghyun, Jang Youngsoo, Lee Moontae
- LLM2LLM: Boosting Llms With Novel Iterative Data Enhancement Lee Nicholas, Wattanawong Thanakul, Kim Sehoon, Mangalam Karttikeya, Shen Sheng, Anumanchipalli Gopala, Mahoney Michael W., Keutzer Kurt, Gholami Amir
- Interactive Text-to-image Retrieval With Large Language Models: A Plug-and-play Approach Lee Saehyung, Yu Sangwon, Park Junsung, Yi Jihun, Yoon Sungroh
- Effective And Efficient Conversation Retrieval For Dialogue State Tracking With Implicit Text Summaries Lee Seanie, Cheng Jianpeng, Driesen Joris, Coca Alexandru, Johannsen Anders
- Infinigen: Efficient Generative Inference Of Large Language Models With Dynamic KV Cache Management Lee Wonbeom, Lee Jungi, Seo Junghwan, Sim Jaewoong
- Towards Understanding Counseling Conversations: Domain Knowledge And Large Language Models Lee Younghun, Goldwasser Dan, Reese Laura Schwab
- Meta-task Prompting Elicits Embeddings From Large Language Models Lei Yibin, Wu Di, Zhou Tianyi, Shen Tao, Cao Yu, Tao Chongyang, Yates Andrew
- Small-e: Small Language Model With Linear Attention For Efficient Speech Synthesis Lemerle Théodor, Obin Nicolas, Roebel Axel
- Cascade Reward Sampling For Efficient Decoding-time Alignment Li Bolian, Wang Yifan, Grama Ananth, Zhang Ruqi
- Are Bigger Encoders Always Better In Vision Large Models? Li Bozhou, Liang Hao, Meng Zimo, Zhang Wentao
- Recall, Retrieve And Reason: Towards Better In-context Relation Extraction Li Guozheng, Wang Peng, Ke Wenjun, Guo Yikai, Ji Ke, Shang Ziyu, Liu Jiajun, Xu Zijie
- Gradient-mask Tuning Elevates The Upper Limits Of LLM Performance Li Haoling, Zhang Xin, Liu Xiao, Gong Yeyun, Wang Yifan, Yang Yujiu, Chen Qi, Cheng Peng
- Backdoor Removal For Generative Large Language Models Li Haoran, Chen Yulin, Zheng Zihao, Hu Qi, Chan Chunkit, Liu Heshan, Song Yangqiu
- Look Within, Why Llms Hallucinate: A Causal Perspective Li He, Chi Haoang, Liu Mingyu, Yang Wenjing
- Revisiting Catastrophic Forgetting In Large Language Model Tuning Li Hongyu, Ding Liang, Fang Meng, Tao Dacheng
- Datacomp-lm: In Search Of The Next Generation Of Training Sets For Language Models Li Jeffrey, Fang Alex, Smyrnis Georgios, Ivgi Maor, Jordan Matt, Gadre Samir, Bansal Hritik, Guha Etash, Keh Sedrick, Arora Kushal, Garg Saurabh, Xin Rui, Muennighoff Niklas, Heckel Reinhard, Mercat Jean, Chen Mayee, Gururangan Suchin, Wortsman Mitchell, Albalak Alon, Bitton Yonatan, Nezhurina Marianna, Abbas Amro, Hsieh Cheng-yu, Ghosh Dhruba, Gardner Josh, Kilian Maciej, Zhang Hanlin, Shao Rulin, Pratt Sarah, Sanyal Sunny, Ilharco Gabriel, Daras Giannis, Marathe Kalyani, Gokaslan Aaron, Zhang Jieyu, Chandu Khyathi, Nguyen Thao, Vasiljevic Igor, Kakade Sham, Song Shuran, Sanghavi Sujay, Faghri Fartash, Oh Sewoong, Zettlemoyer Luke, Lo Kyle, El-nouby Alaaeldin, Pouransari Hadi, Toshev Alexander, Wang Stephanie, Groeneveld Dirk, Soldaini Luca, Koh Pang Wei, Jitsev Jenia, Kollar Thomas, Dimakis Alexandros G., Carmon Yair, Dave Achal, Schmidt Ludwig, Shankar Vaishaal
- A Transformer With Stack Attention Li Jiaoda, White Jennifer C., Sachan Mrinmaya, Cotterell Ryan
- A Cross-language Investigation Into Jailbreak Attacks In Large Language Models Li Jie, Liu Yi, Liu Chongyang, Shi Ling, Ren Xiaoning, Zheng Yaowen, Liu Yang, Xue Yinxing
- Dissecting Human And LLM Preferences Li Junlong, Zhou Fan, Sun Shichao, Zhang Yikai, Zhao Hai, Liu Pengfei
- Measuring And Controlling Instruction (in)stability In Language Model Dialogs Li Kenneth, Liu Tianle, Bashkansky Naomi, Bau David, Viégas Fernanda, Pfister Hanspeter, Wattenberg Martin
- On The Power Of Convolution Augmented Transformer Li Mingchen, Zhang Xuechen, Huang Yixiao, Oymak Samet
- Bridging The Language Gap: Enhancing Multilingual Prompt-based Code Generation In Llms Via Zero-shot Cross-lingual Transfer Li Mingda, Mishra Abhijit, Mujumdar Utkarsh
- Red Teaming Visual Language Models Li Mukai, Li Lei, Yin Yuwei, Ahmed Masood, Liu Zhenguang, Liu Qi
- Owlore: Outlier-weighed Layerwise Sampled Low-rank Projection For Memory-efficient LLM Fine-tuning Li Pengxiang, Yin Lu, Gao Xiaowei, Liu Shiwei
- Coevol: Constructing Better Responses For Instruction Finetuning Through Multi-agent Cooperation Li Renhao, Tan Minghuan, Wong Derek F., Yang Min
- Scilitllm: How To Adapt Llms For Scientific Literature Understanding Li Sihang, Huang Jin, Zhuang Jiaxi, Shi Yaorui, Cai Xiaochen, Xu Mingjun, Wang Xiang, Zhang Linfeng, Ke Guolin, Cai Hengxing
- Improving Faithfulness Of Large Language Models In Summarization Via Sliding Generation And Self-consistency Li Taiji, Li Zhi, Zhang Yin
- Is Programming By Example Solved By Llms? Li Wen-ding, Ellis Kevin
- Stbench: Assessing The Ability Of Large Language Models In Spatio-temporal Analysis Li Wenbin, Yao Di, Zhao Ruibo, Chen Wenjie, Xu Zijie, Luo Chengxue, Gong Chang, Jing Quanliang, Tan Haining, Bi Jingping
- Uio-llms: Unbiased Incremental Optimization For Long-context Llms Li Wenhao, Lin Mingbao, Zhong Yunshan, Yan Shuicheng, Ji Rongrong
- Unlocking Large Language Model's Planning Capabilities With Maximum Diversity Fine-tuning Li Wenjun, Chen Changyu, Varakantham Pradeep
- MM-TTS: A Unified Framework For Multimodal, Prompt-induced Emotional Text-to-speech Synthesis Li Xiang, Cheng Zhi-qi, He Jun-yan, Peng Xiaojiang, Hauptmann Alexander G.
- 52B To 1T: Lessons Learned Via Tele-flm Series Li Xiang, Yao Yiqun, Jiang Xin, Fang Xuezhi, Wang Chao, Liu Xinzhang, Wang Zihan, Zhao Yu, Wang Xin, Huang Yuyao, Song Shuangyong, Li Yongxiang, Zhang Zheng, Zhao Bo, Sun Aixin, Wang Yequan, He Zhongjiang, Wang Zhongyuan, Li Xuelong, Huang Tiejun
- What If We Recaption Billions Of Web Images With Llama-3? Li Xianhang, Tu Haoqin, Hui Mude, Wang Zeyu, Zhao Bingchen, Xiao Junfei, Ren Sucheng, Mei Jieru, Liu Qing, Zheng Huangjie, Zhou Yuyin, Xie Cihang
- Making Long-context Language Models Better Multi-hop Reasoners Li Yanyang, Liang Shuo, Lyu Michael R., Wang Liwei
- Badedit: Backdooring Large Language Models By Model Editing Li Yanzhou, Li Tianlin, Chen Kangjie, Zhang Jian, Liu Shangqing, Wang Wenhan, Zhang Tianwei, Liu Yang
- Calrec: Contrastive Alignment Of Generative Llms For Sequential Recommendation Li Yaoyiran, Zhai Xiang, Alzantot Moustafa, Yu Keyi, Vulić Ivan, Korhonen Anna, Hammad Mohamed
- Q-adapter: Training Your LLM Adapter As A Residual Q-function Li Yi-chen, Zhang Fuxiang, Qiu Wenjie, Yuan Lei, Jia Chengxing, Zhang Zongzhang, Yu Yang
- Womd-reasoning: A Large-scale Language Dataset For Interaction And Driving Intentions Reasoning Li Yiheng, Ge Chongjian, Li Chenran, Xu Chenfeng, Tomizuka Masayoshi, Tang Chen, Ding Mingyu, Zhan Wei
- Dynamic Data Sampler For Cross-language Transfer Learning In Large Language Models Li Yudong, Feng Yuhao, Zhou Wen, Zhao Zhe, Shen Linlin, Hou Cheng, Hou Xianxu
- Spin Glass Model Of In-context Learning Li Yuhao, Bai Ruoran, Huang Haiping
- EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty Li Yuhui, Wei Fangyun, Zhang Chao, Zhang Hongyang
- Llms Meet Long Video: Advancing Long Video Question Answering With An Interactive Visual Adapter In Llms Li Yunxin, Chen Xinyu, Hu Baotain, Zhang Min
- Amphista: Accelerate LLM Inference With Bi-directional Multiple Drafting Heads In A Non-autoregressive Style Li Zeping, Yang Xinlong, Gao Ziheng, Liu Ji, Liu Zhuang, Li Dong, Peng Jinzhang, Tian Lu, Barsoum Emad
- Text-to-model: Text-conditioned Neural Network Diffusion For Train-once-for-all Personalization Li Zexi, Gao Lingzhi, Wu Chao
- Do Vision-language Transformers Exhibit Visual Commonsense? An Empirical Study Of VCR Li Zhenyang, Guo Yangyang, Wang Kejie, Chen Xiaolin, Nie Liqiang, Kankanhalli Mohan
- Focusllm: Scaling Llm's Context By Parallel Decoding Li Zhenyu, Zhang Yike, Pan Tengyu, Sun Yutao, Duan Zhichao, Fang Junjie, Han Rong, Wang Zixuan, Wang Jianyong
- EAGLE: Elevating Geometric Reasoning Through Llm-empowered Visual Instruction Tuning Li Zhihao, Du Yao, Liu Yang, Zhang Yan, Liu Yufang, Zhang Mengdi, Cai Xunliang
- Hunyuan-dit: A Powerful Multi-resolution Diffusion Transformer With Fine-grained Chinese Understanding Li Zhimin, Zhang Jianwei, Lin Qin, Xiong Jiangfeng, Long Yanxin, Deng Xinchi, Zhang Yingfang, Liu Xingchao, Huang Minbin, Xiao Zedong, Chen Dayou, He Jiajun, Li Jiahao, Li Wenyue, Zhang Chen, Quan Rongwei, Lu Jianxiang, Huang Jiabin, Yuan Xiaoyan, Zheng Xiaoxiao, Li Yixuan, Zhang Jihong, Zhang Chao, Chen Meng, Liu Jie, Fang Zheng, Wang Weiyan, Xue Jinbao, Tao Yangyu, Zhu Jianchen, Liu Kai, Lin Sihuan, Sun Yifu, Li Yun, Wang Dongdong, Chen Mingtao, Hu Zhichao, Xiao Xiao, Chen Yan, Liu Yuhong, Liu Wei, Wang Di, Yang Yong, Jiang Jie, Lu Qinglin
- Two Stacks Are Better Than One: A Comparison Of Language Modeling And Translation As Multilingual Pretraining Objectives Li Zihao, Ji Shaoxiong, Mickus Timothee, Segonne Vincent, Tiedemann Jörg
- 500xcompressor: Generalized Prompt Compression For Large Language Models Li Zongqian, Su Yixuan, Collier Nigel
- What Is The Best Model? Application-driven Evaluation For Large Language Models Lian Shiguo, Zhao Kaikai, Liu Xinhui, Lei Xuejiao, Yang Bikun, Zhang Wenjing, Wang Kai, Liu Zhaoxiang
- Vl-trojan: Multimodal Instruction Backdoor Attacks Against Autoregressive Visual Language Models Liang Jiawei, Liang Siyuan, Luo Man, Liu Aishan, Han Dongchen, Chang Ee-chien, Cao Xiaochun
- Controllable Text Generation For Large Language Models: A Survey Liang Xun, Wang Hanyu, Wang Yezhaohui, Song Shichao, Yang Jiawei, Niu Simin, Hu Jie, Liu Dan, Yao Shunyu, Xiong Feiyu, Li Zhiyu
- Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats In Customized Large Language Models Liang Zi, Hu Haibo, Ye Qingqing, Xiao Yaxin, Li Haoyang
- IKUN For WMT24 General MT Task: Llms Are Here For Multilingual Machine Translation Liao Baohao, Herold Christian, Khadivi Shahram, Monz Christof
- Beyond KV Caching: Shared Attention For Efficient Llms Liao Bingli, Vargas Danilo Vasconcellos
- Extending Token Computation For LLM Reasoning Liao Bingli, Vargas Danilo Vasconcellos
- Enhancing Reinforcement Learning With Label-sensitive Reward For Natural Language Understanding Liao Kuo, Li Shuang, Zhao Meng, Liu Liqun, Xue Mengge, Hu Zhenyu, Han Honglin, Yin Chengguo
- MARIO: Math Reasoning With Code Interpreter Output -- A Reproducible Pipeline Liao Minpeng, Luo Wei, Li Chengxi, Wu Jing, Fan Kai
- Align And Aggregate: Compositional Reasoning With Video Alignment And Answer Aggregation For Video Question-answering Liao Zhaohe, Li Jiangtong, Niu Li, Zhang Liqing
- E2LLM: Encoder Elongated Large Language Models For Long-context Understanding And Reasoning Liao Zihan, Wang Jun, Yu Hang, Wei Lingxiao, Li Jianguo, Wang Jun, Zhang Wei
- Adapting Mental Health Prediction Tasks For Cross-lingual Learning Via Meta-training And In-context Learning With Large Language Model Lifelo Zita, Ning Huansheng, Dhelim Sahraoui
- Inducing Generalization Across Languages And Tasks Using Featurized Low-rank Mixtures Lin Chu-cheng, Wang Xinyi, Clark Jonathan H., Lu Han, Zhu Yun, Whitehouse Chenxi, Yu Hongkun
- Fine-tuned Network Relies On Generic Representation To Solve Unseen Cognitive Task Lin Dongyan
- Bita: Bi-directional Tuning For Lossless Acceleration In Large Language Models Lin Feng, Yi Hanling, Li Hongbin, Yang Yifan, Yu Xiaotian, Lu Guangming, Xiao Rong
- Mala-500: Massive Language Adaptation Of Large Language Models Lin Peiqin, Ji Shaoxiong, Tiedemann Jörg, Martins André F. T., Schütze Hinrich
- FLAME: Factuality-aware Alignment For Large Language Models Lin Sheng-chieh, Gao Luyu, Oguz Barlas, Xiong Wenhan, Lin Jimmy, Yih Wen-tau, Chen Xilun
- Local And Global Contexts For Conversation Lin Zuoquan, Shen Xinyi
- Unveiling Vulnerability Of Self-attention Liong Khai Jiet, Wu Hongqiu, Zhao Hai
- PANDA: Preference Adaptation For Enhancing Domain-specific Abilities Of Llms Liu An, Yang Zonghan, Zhang Zhenhe, Hu Qingyuan, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Llmembed: Rethinking Lightweight Llm's Genuine Function In Text Classification Liu Chun, Zhang Hongguang, Zhao Kainan, Ju Xinghai, Yang Lin
- Lumina-mgpt: Illuminate Flexible Photorealistic Text-to-image Generation With Multimodal Generative Pretraining Liu Dongyang, Zhao Shitian, Zhuo Le, Lin Weifeng, Qiao Yu, Li Hongsheng, Gao Peng
- A Practice-friendly Two-stage Llm-enhanced Paradigm In Sequential Recommendation Liu Dugang, Xian Shenxian, Lin Xiaolin, Zhang Xiaolian, Zhu Hong, Fang Yuan, Chen Zhen, Ming Zhong
- Pefomed: Parameter Efficient Fine-tuning Of Multimodal Large Language Models For Medical Imaging Liu Gang, He Jinlong, Li Pengfei, He Genrong, Chen Zhaolin, Zhong Shenjun
- On The Intrinsic Self-correction Capability Of Llms: Uncertainty And Latent Concept Liu Guangliang, Mao Haitao, Cao Bochuan, Xue Zhiyu, Johnson Kristen, Tang Jiliang, Wang Rongrong
- Mmgrec: Multimodal Generative Recommendation With Transformer Model Liu Han, Wei Yinwei, Song Xuemeng, Guan Weili, Li Yuan-fang, Nie Liqiang
- Laida: Linguistics-aware In-context Learning With Data Augmentation For Metaphor Components Identification Liu Hongde, He Chenyuan, Meng Feiyang, Niu Changyong, Jia Yuxiang
- Bucket Pre-training Is All You Need Liu Hongtao, Peng Qiyao, Yang Qing, Liu Kai, Xu Hongyan
- Bitdelta: Your Fine-tune May Only Be Worth One Bit Liu James, Xiao Guangxuan, Li Kai, Lee Jason D., Han Song, Dao Tri, Cai Tianle
- Tiny Refinements Elicit Resilience: Toward Efficient Prefix-model Against LLM Red-teaming Liu Jiaxu, Yin Xiangyu, Wu Sihao, Wang Jianhong, Fang Meng, Yi Xinping, Huang Xiaowei
- Educating Llms Like Human Students: Structure-aware Injection Of Domain Knowledge Liu Kai, Chen Ze, Fu Zhihang, Jiang Rongxin, Zhou Fan, Chen Yaowu, Wu Yue, Ye Jieping
- Synthetic Context Generation For Question Generation Liu Naiming, Wang Zichao, Baraniuk Richard
- Flashback:efficient Retrieval-augmented Language Modeling For Long Context Inference Liu Runheng, Xiao Xingchen, Huang Heyan, Chi Zewen, Wu Zhijing
- Latent Guard: A Safety Framework For Text-to-image Generation Liu Runtao, Khakzar Ashkan, Gu Jindong, Chen Qifeng, Torr Philip, Pizzati Fabio
- Are Llms Effective Backbones For Fine-tuning? An Experimental Investigation Of Supervised Llms On Chinese Short Text Matching Liu Shulin, Xu Chengcheng, Liu Hao, Yu Tinghao, Yang Tao
- Making Them Ask And Answer: Jailbreaking Large Language Models In Few Queries Via Disguise And Reconstruction Liu Tong, Zhang Yingjie, Zhao Zhe, Dong Yinpeng, Meng Guozhu, Chen Kai
- Cmm-math: A Chinese Multimodal Math Dataset To Evaluate And Enhance The Mathematics Reasoning Of Large Multimodal Models Liu Wentao, Pan Qianjun, Zhang Yi, Liu Zhuo, Wu Ji, Zhou Jie, Zhou Aimin, Chen Qin, Jiang Bo, He Liang
- Extensive Self-contrast Enables Feedback-free Language Model Alignment Liu Xiao, Song Xixuan, Dong Yuxiao, Tang Jie
- Adaptive Draft-verification For Efficient Large Language Model Decoding Liu Xukun, Lei Bowen, Zhang Ruqi, Xu Dongkuan
- Datasets For Large Language Models: A Comprehensive Survey Liu Yang, Cao Jiahuan, Liu Chongyu, Ding Kai, Jin Lianwen
- Understanding Llms: A Comprehensive Overview From Training To Inference Liu Yiheng, He Hao, Han Tianle, Zhang Xu, Liu Mengyuan, Tian Jiaming, Zhang Yutong, Wang Jiaqi, Gao Xiaohui, Zhong Tianyang, Pan Yi, Xu Shaochen, Wu Zihao, Liu Zhengliang, Zhang Xin, Zhang Shu, Hu Xintao, Zhang Tuo, Qiang Ning, Liu Tianming, Ge Bao
- Chatzero:zero-shot Cross-lingual Dialogue Generation Via Pseudo-target Language Liu Yongkang, Shi Feng, Wang Daling, Zhang Yifei, Schütze Hinrich
- DELTA: Decomposed Efficient Long-term Robot Task Planning Using Large Language Models Liu Yuchen, Palmieri Luigi, Koch Sebastian, Georgievski Ilche, Aiello Marco
- From Summary To Action: Enhancing Large Language Models For Complex Tasks With Open World Apis Liu Yulong, Yuan Yunlong, Wang Chunwei, Han Jianhua, Ma Yongqiang, Zhang Li, Zheng Nanning, Xu Hang
- Adamole: Fine-tuning Large Language Models With Adaptive Mixture Of Low-rank Adaptation Experts Liu Zefang, Luo Jiahua
- Mixture Of Insightful Experts (mote): The Synergy Of Thought Chains And Expert Mixtures In Self-alignment Liu Zhili, Gou Yunhao, Chen Kai, Hong Lanqing, Gao Jiahui, Mi Fei, Zhang Yu, Li Zhenguo, Jiang Xin, Liu Qun, Kwok James T.
- Loqt: Low-rank Adapters For Quantized Pre-training Loeschcke Sebastian, Toftrup Mads, Kastoryano Michael J., Belongie Serge, Snæbjarnarson Vésteinn
- Llms Are Biased Towards Output Formats! Systematically Evaluating And Mitigating Output Format Bias Of Llms Long Do Xuan, Ngoc Hai Nguyen, Sim Tiviatis, Dao Hieu, Joty Shafiq, Kawaguchi Kenji, Chen Nancy F., Kan Min-yen
- Sparser Is Faster And Less Is More: Efficient Sparse Attention For Long-range Transformers Lou Chao, Jia Zixia, Zheng Zilong, Tu Kewei
- Efficient Encoder-decoder Transformer Decoding For Decomposable Tasks Lu Bo-ru, Haduong Nikita, Lin Chien-yu, Cheng Hao, Smith Noah A., Ostendorf Mari
- Deepseek-vl: Towards Real-world Vision-language Understanding Lu Haoyu, Liu Wen, Zhang Bo, Wang Bingxuan, Dong Kai, Liu Bo, Sun Jingxiang, Ren Tongzheng, Li Zhuoshu, Yang Hao, Sun Yaofeng, Deng Chengqi, Xu Hanwei, Xie Zhenda, Ruan Chong
- YODA: Teacher-student Progressive Learning For Language Models Lu Jianqiao, Zhong Wanjun, Wang Yufei, Guo Zhijiang, Zhu Qi, Huang Wenyong, Wang Yanlin, Mi Fei, Wang Baojun, Wang Yasheng, Shang Lifeng, Jiang Xin, Liu Qun
- In-context Time Series Predictor Lu Jiecheng, Sun Yan, Yang Shihao
- Padellm-ner: Parallel Decoding In Large Language Models For Named Entity Recognition Lu Jinghui, Yang Ziwei, Wang Yanjie, Liu Xuejing, Mac Namee Brian, Huang Can
- A Bounding Box Is Worth One Token: Interleaving Layout And Text In A Large Language Model For Document Understanding Lu Jinghui, Yu Haiyang, Wang Yanjie, Ye Yongjie, Tang Jingqun, Yang Ziwei, Wu Binghong, Liu Qi, Feng Hao, Wang Han, Liu Hao, Huang Can
- FIPO: Free-form Instruction-oriented Prompt Optimization With Preference Dataset And Modular Fine-tuning Schema Lu Junru, An Siyu, Zhang Min, He Yulan, Yin Di, Sun Xing
- Online Merging Optimizers For Boosting Rewards And Mitigating Tax In Alignment Lu Keming, Yu Bowen, Huang Fei, Fan Yang, Lin Runji, Zhou Chang
- Ovis: Structural Embedding Alignment For Multimodal Large Language Model Lu Shiyin, Li Yang, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Ye Han-jia
- Insights Into LLM Long-context Failures: When Transformers Know But Don't Tell Lu Taiming, Gao Muhan, Yu Kuai, Byerly Adam, Khashabi Daniel
- Fine-tuning Large Language Models For Domain Adaptation: Exploration Of Training Strategies, Scaling, Model Merging And Synergistic Capabilities Lu Wei, Luu Rachel K., Buehler Markus J.
- SPP: Sparsity-preserved Parameter-efficient Fine-tuning For Large Language Models Lu Xudong, Zhou Aojun, Xu Yuhui, Zhang Renrui, Gao Peng, Li Hongsheng
- Asymptotic Theory Of In-context Learning By Linear Attention Lu Yue M., Letey Mary I., Zavatone-veth Jacob A., Maiti Anindita, Pehlevan Cengiz
- To Err Is Human, But Llamas Can Learn It Too Luhtaru Agnes, Purason Taido, Vainikko Martin, Del Maksym, Fishel Mark
- Can Perplexity Predict Fine-tuning Performance? An Investigation Of Tokenization Effects On Sequential Language Models For Nepali Luitel Nishant, Bekoju Nirajan, Sah Anand Kumar, Shakya Subarna
- Metric-aware LLM Inference For Regression And Scoring Lukasik Michal, Narasimhan Harikrishna, Menon Aditya Krishna, Yu Felix, Kumar Sanjiv
- MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory For Long Sequences Training Luo Cheng, Zhao Jiawei, Chen Zhuoming, Chen Beidi, Anandkumar Anima
- Layoutllm: Layout Instruction Tuning With Large Language Models For Document Understanding Luo Chuwei, Shen Yufan, Zhu Zhaoqing, Zheng Qi, Yu Zhi, Yao Cong
- Arena Learning: Build Data Flywheel For Llms Post-training Via Simulated Chatbot Arena Luo Haipeng, Sun Qingfeng, Xu Can, Zhao Pu, Lin Qingwei, Lou Jianguang, Chen Shifeng, Tang Yansong, Chen Weizhu
- From Understanding To Utilization: A Survey On Explainability For Large Language Models Luo Haoyan, Specia Lucia
- Decoupled Alignment For Robust Plug-and-play Adaptation Luo Haozheng, Yu Jiahao, Zhang Wenxin, Li Jialong, Hu Jerry Yao-chieh, Xing Xinyu, Liu Han
- Kuaiji: The First Chinese Accounting Large Language Model Luo Jiayuan, Yang Songhua, Qiu Xiaoling, Chen Panyu, Nai Yufei, Zeng Wenxuan, Zhang Wentao, Jiang Xinke
- Large Language Models As Foundations For Next-gen Dense Retrieval: A Comprehensive Empirical Assessment Luo Kun, Qin Minghao, Liu Zheng, Xiao Shitao, Zhao Jun, Liu Kang
- Taking A Deep Breath: Enhancing Language Modeling Of Large Language Models With Sentinel Tokens Luo Weiyao, Zheng Suncong, Xia Heming, Wang Weikang, Lei Yan, Liu Tianyu, Chen Shuang, Sui Zhifang
- Zero-shot Cross-domain Dialogue State Tracking Via Dual Low-rank Adaptation Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- Ensuring Safe And High-quality Outputs: A Guideline Library Approach For Language Models Luo Yi, Lin Zhenghao, Zhang Yuhao, Sun Jiashuo, Lin Chen, Xu Chengjin, Su Xiangdong, Shen Yelong, Guo Jian, Gong Yeyun
- TRAWL: Tensor Reduced And Approximated Weights For Large Language Models Luo Yiran, Patel Het, Fu Yu, Ahn Dawon, Chen Jia, Dong Yue, Papalexakis Evangelos E.
- Reft: Reasoning With Reinforced Fine-tuning Luong Trung Quoc, Zhang Xinbo, Jie Zhanming, Sun Peng, Jin Xiaoran, Li Hang
- WILBUR: Adaptive In-context Learning For Robust And Accurate Web Agents Lutz Michael, Bohra Arth, Saroyan Manvel, Harutyunyan Artem, Campagna Giovanni
- Interpreting Key Mechanisms Of Factual Recall In Transformer-based Language Models Lv Ang, Chen Yuhan, Zhang Kaiyi, Wang Yulong, Liu Lifeng, Wen Ji-rong, Xie Jian, Yan Rui
- Codeact: Code Adaptive Compute-efficient Tuning Framework For Code Llms Lv Weijie, Xia Xuan, Huang Sheng-jun
- Keeping Llms Aligned After Fine-tuning: The Crucial Role Of Prompt Templates Lyu Kaifeng, Zhao Haoyu, Gu Xinran, Yu Dingli, Goyal Anirudh, Arora Sanjeev
- Automatic Summarization Of Doctor-patient Encounter Dialogues Using Large Language Model Through Prompt Tuning Lyu Mengxian, Peng Cheng, Li Xiaohan, Balian Patrick, Bian Jiang, Wu Yonghui
- Knowtuning: Knowledge-aware Fine-tuning For Large Language Models Lyu Yougang, Yan Lingyong, Wang Shuaiqiang, Shi Haibo, Yin Dawei, Ren Pengjie, Chen Zhumin, De Rijke Maarten, Ren Zhaochun
- Sparsity-accelerated Training For Large Language Models Ma Da, Chen Lu, Wang Pengyu, Xu Hongshen, Li Hanqi, Sun Liangtai, Zhu Su, Fan Shuai, Yu Kai
- FBI-LLM: Scaling Up Fully Binarized Llms From Scratch Via Autoregressive Distillation Ma Liqun, Sun Mingjie, Shen Zhiqiang
- Eventlens: Leveraging Event-aware Pretraining And Cross-modal Linking Enhances Visual Commonsense Reasoning Ma Mingjie, Yu Zhihuan, Ma Yichao, Li Guohui
- Making Pre-trained Language Models Better Continual Few-shot Relation Extractors Ma Shengkun, Han Jiale, Liang Yi, Cheng Bo
- The Era Of 1-bit Llms: All Large Language Models Are In 1.58 Bits Ma Shuming, Wang Hongyu, Ma Lingxiao, Wang Lei, Wang Wenhui, Huang Shaohan, Dong Li, Wang Ruiping, Xue Jilong, Wei Furu
- Megalodon: Efficient LLM Pretraining And Inference With Unlimited Context Length Ma Xuezhe, Yang Xiaomeng, Xiong Wenhan, Chen Beidi, Yu Lili, Zhang Hao, May Jonathan, Zettlemoyer Luke, Levy Omer, Zhou Chunting
- Llamoco: Instruction Tuning Of Large Language Models For Optimization Code Generation Ma Zeyuan, Guo Hongshu, Chen Jiacheng, Peng Guojun, Cao Zhiguang, Ma Yining, Gong Yue-jiao
- LLM In-context Recall Is Prompt Dependent Machlab Daniel, Battle Rick
- Too Late To Train, Too Early To Use? A Study On Necessity And Viability Of Low-resource Bengali Llms Mahfuz Tamzeed, Dey Satak Kumar, Naswan Ruwad, Adil Hasnaen, Sayeed Khondker Salman, Shahgir Haz Sameen
- ULLME: A Unified Framework For Large Language Model Embeddings With Generation-augmented Learning Man Hieu, Ngo Nghia Trung, Dernoncourt Franck, Nguyen Thien Huu
- Coalitions Of Large Language Models Increase The Robustness Of AI Agents Mangal Prattyush, Mak Carol, Kanakis Theo, Donovan Timothy, Braines Dave, Pyzer-knapp Edward
- Can Large Language Models Automatically Score Proficiency Of Written Essays? Mansour Watheq, Albatarni Salam, Eltanbouly Sohaila, Elsayed Tamer
- Chatretriever: Adapting Large Language Models For Generalized And Robust Conversational Dense Retrieval Mao Kelong, Deng Chenlong, Chen Haonan, Mo Fengran, Liu Zheng, Sakai Tetsuya, Dou Zhicheng
- Visually Guided Generative Text-layout Pre-training For Document Intelligence Mao Zhiming, Bai Haoli, Hou Lu, Wei Jiansheng, Jiang Xin, Liu Qun, Wong Kam-fai
- Tuning Llms With Contrastive Alignment Instructions For Machine Translation In Unseen, Low-resource Languages Mao Zhuoyuan, Yu Yen
- German Also Hallucinates! Inconsistency Detection In News Summaries With The Absinth Dataset Mascarell Laura, Chalumattu Ribin, Rios Annette
- Improving Text-to-image Consistency Via Automatic Prompt Optimization Mañas Oscar, Astolfi Pietro, Hall Melissa, Ross Candace, Urbanek Jack, Williams Adina, Agrawal Aishwarya, Romero-soriano Adriana, Drozdzal Michal
- Set-based Prompting: Provably Solving The Language Model Order Dependency Problem Mcilroy-young Reid, Brown Katrina, Olson Conlan, Zhang Linjun, Dwork Cynthia
- MM1: Methods, Analysis & Insights From Multimodal LLM Pre-training Mckinzie Brandon, Gan Zhe, Fauconnier Jean-philippe, Dodge Sam, Zhang Bowen, Dufter Philipp, Shah Dhruti, Du Xianzhi, Peng Futang, Weers Floris, Belyi Anton, Zhang Haotian, Singh Karanjeet, Kang Doug, Jain Ankur, Hè Hongyu, Schwarzer Max, Gunter Tom, Kong Xiang, Zhang Aonan, Wang Jianyu, Wang Chong, Du Nan, Lei Tao, Wiseman Sam, Yin Guoli, Lee Mark, Wang Zirui, Pang Ruoming, Grasch Peter, Toshev Alexander, Yang Yinfei
- Injecting New Knowledge Into Large Language Models Via Supervised Fine-tuning Mecklenburg Nick, Lin Yiyou, Li Xiaoxiao, Holstein Daniel, Nunes Leonardo, Malvar Sara, Silva Bruno, Chandra Ranveer, Aski Vijay, Yannam Pavan Kumar Reddy, Aktas Tolga, Hendry Todd
- Metacheckgpt -- A Multi-task Hallucination Detector Using LLM Uncertainty And Meta-models Mehta Rahul, Hoblitzell Andrew, O'keefe Jack, Jang Hyeju, Varma Vasudeva
- Openelm: An Efficient Language Model Family With Open Training And Inference Framework Mehta Sachin, Sekhavat Mohammad Hossein, Cao Qingqing, Horton Maxwell, Jin Yanzi, Sun Chenfan, Mirzadeh Iman, Najibi Mahyar, Belenko Dmitry, Zatloukal Peter, Rastegari Mohammad
- Soda-eval: Open-domain Dialogue Evaluation In The Age Of Llms Mendonça John, Trancoso Isabel, Lavie Alon
- Deepstack: Deeply Stacking Visual Tokens Is Surprisingly Simple And Effective For Lmms Meng Lingchen, Yang Jianwei, Tian Rui, Dai Xiyang, Wu Zuxuan, Gao Jianfeng, Jiang Yu-gang
- A Survey Of Resource-efficient LLM And Multimodal Foundation Models Mengwei Xu, Wangsong Yin, Dongqi Cai, Rongjie Yi, Daliang Xu, Qipeng Wang, Bingyang Wu, Yihao Zhao, Chen Yang, Shihe Wang, Qiyang Zhang, Zhenyan Lu, Li Zhang, Shangguang Wang, Yuanchun Li, Yunxin Liu, Xin Jin, Xuanzhe Liu
- Linearizing Large Language Models Mercat Jean, Vasiljevic Igor, Keh Sedrick, Arora Kushal, Dave Achal, Gaidon Adrien, Kollar Thomas
- Superposition Prompting: Improving And Accelerating Retrieval-augmented Generation Merth Thomas, Fu Qichen, Rastegari Mohammad, Najibi Mahyar
- A Comparison Of LLM Finetuning Methods & Evaluation Metrics With Travel Chatbot Use Case Meyer Sonia, Singh Shreya, Tam Bertha, Ton Christopher, Ren Angel
- Subject-driven Text-to-image Generation Via Preference-based Reinforcement Learning Miao Yanting, Loh William, Kothawade Suraj, Poupart Pascal, Rashwan Abdullah, Li Yeqing
- Revenge Of The Fallen? Recurrent Models Match Transformers At Predicting Human Language Comprehension Metrics Michaelov James A., Arnett Catherine, Bergen Benjamin K.
- Exploration Of Masked And Causal Language Modelling For Text Generation Micheletti Nicolo, Belkadi Samuel, Han Lifeng, Nenadic Goran
- Velora: Memory Efficient Training Using Rank-1 Sub-token Projections Miles Roy, Reddy Pradyumna, Elezi Ismail, Deng Jiankang
- Semantics Of Multiword Expressions In Transformer-based Models: A Survey Miletić Filip, Walde Sabine Schulte Im
- Large Language Models: A Survey Minaee Shervin, Mikolov Tomas, Nikzad Narjes, Chenaghlu Meysam, Socher Richard, Amatriain Xavier, Gao Jianfeng
- Multimodal Contrastive In-context Learning Miyanishi Yosuke, Nguyen Minh Le
- Aligning Query Representation With Rewritten Query And Relevance Judgments In Conversational Search Mo Fengran, Qu Chen, Mao Kelong, Wu Yihong, Su Zhan, Huang Kaiyu, Nie Jian-yun
- Convsdg: Session Data Generation For Conversational Search Mo Fengran, Yi Bole, Mao Kelong, Qu Chen, Huang Kaiyu, Nie Jian-yun
- LSPT: Long-term Spatial Prompt Tuning For Visual Representation Learning Mo Shentong, Wang Yansen, Luo Xufang, Li Dongsheng
- Large Language Model (LLM) AI Text Generation Detection Based On Transformer Deep Learning Algorithm Mo Yuhong, Qin Hao, Dong Yushan, Zhu Ziyi, Li Zhenglin
- Llava-chef: A Multi-modal Generative Model For Food Recipes Mohbat Fnu, Zaki Mohammed J.
- Comparison Of Large Language Models For Generating Contextually Relevant Questions Molina Ivo Lodovico, Švábenský Valdemar, Minematsu Tsubasa, Chen Li, Okubo Fumiya, Shimada Atsushi
- Towards Generating Informative Textual Description For Neurons In Language Models Mondal Shrayani, Garodia Rishabh, Qureshi Arbaaz, Lee Taesung, Park Youngja
- Xc-cache: Cross-attending To Cached Context For Efficient LLM Inference Monteiro João, Marcotte Étienne, Noël Pierre-andré, Zantedeschi Valentina, Vázquez David, Chapados Nicolas, Pal Christopher, Taslakian Perouz
- Synthetic Programming Elicitation And Repair For Text-to-code In Very Low-resource Programming Languages Mora Federico, Wong Justin, Lepe Haley, Bhatia Sahil, Elmaaroufi Karim, Varghese George, Gonzalez Joseph E., Polgreen Elizabeth, Seshia Sanjit A.
- Exploring The Landscape Of Large Language Models: Foundations, Techniques, And Challenges Moradi Milad, Yan Ke, Colwell David, Samwald Matthias, Asgari Rhona
- Language Modelling Approaches To Adaptive Machine Translation Moslem Yasmin
- Are Llms Robust For Spoken Dialogues? Mousavi Seyed Mahed, Roccabruna Gabriel, Alghisi Simone, Rizzoli Massimo, Ravanelli Mirco, Riccardi Giuseppe
- Leave No Context Behind: Efficient Infinite Context Transformers With Infini-attention Munkhdalai Tsendsuren, Faruqui Manaal, Gopal Siddharth
- Novel-wd: Exploring Acquisition Of Novel World Knowledge In Llms Using Prefix-tuning Méloux Maxime, Cerisara Christophe
- Aurora-m: The First Open Source Multilingual Language Model Red-teamed According To The U.S. Executive Order Nakamura Taishi, Mishra Mayank, Tedeschi Simone, Chai Yekun, Stillerman Jason T, Friedrich Felix, Yadav Prateek, Laud Tanmay, Chien Vu Minh, Zhuo Terry Yue, Misra Diganta, Bogin Ben, Vu Xuan-son, Karpinska Marzena, Dantuluri Arnav Varma, Kusa Wojciech, Furlanello Tommaso, Yokota Rio, Muennighoff Niklas, Pai Suhas, Adewumi Tosin, Laippala Veronika, Yao Xiaozhe, Junior Adalberto, Ariyak Alpay, Drozd Aleksandr, Clive Jordan, Gupta Kshitij, Chen Liangyu, Sun Qi, Tsui Ken, Persaud Noah, Fahmy Nour, Chen Tianlong, Bansal Mohit, Monti Nicolo, Dang Tai, Luo Ziyang, Bui Tien-tung, Navigli Roberto, Mehta Virendra, Blumberg Matthew, May Victor, Nguyen Huu, Pyysalo Sampo
- Open Llama2 Model For The Lithuanian Language Nakvosas Artūras, Daniušis Povilas, Mulevičius Vytas
- Gemquad : Generating Multilingual Question Answering Datasets From Large Language Models Using Few Shot Learning Namboori Amani, Mangale Shivam, Rosenbaum Andy, Soltan Saleh
- Using Llms To Model The Beliefs And Preferences Of Targeted Populations Namikoshi Keiichi, Filipowicz Alex, Shamma David A., Iliev Rumen, Hogan Candice L., Arechiga Nikos
- Word2world: Generating Stories And Worlds Through Large Language Models Nasir Muhammad U., James Steven, Togelius Julian
- Learning Goal-conditioned Representations For Language Reward Models Nath Vaskar, Slack Dylan, Da Jeff, Ma Yuntao, Zhang Hugh, Whitehead Spencer, Hendryx Sean
- Segment-based Interactive Machine Translation For Pre-trained Models Navarro Angel, Casacuberta Francisco
- Dynamic Memory Compression: Retrofitting Llms For Accelerated Inference Nawrot Piotr, Łańcucki Adrian, Chochowski Marcin, Tarjan David, Ponti Edoardo M.
- Improving Robustness Of Llm-based Speech Synthesis By Learning Monotonic Alignment Neekhara Paarth, Hussain Shehzeen, Ghosh Subhankar, Li Jason, Valle Rafael, Badlani Rohan, Ginsburg Boris
- Needle In The Haystack For Memory Based Large Language Models Nelson Elliot, Kollias Georgios, Das Payel, Chaudhury Subhajit, Dan Soham
- Interpreting Context Look-ups In Transformers: Investigating Attention-mlp Interactions Neo Clement, Cohen Shay B., Barez Fazl
- MBBQ: A Dataset For Cross-lingual Comparison Of Stereotypes In Generative Llms Neplenbroek Vera, Bisazza Arianna, Fernández Raquel
- Towards Modeling Learner Performance With Large Language Models Neshaei Seyed Parsa, Davis Richard Lee, Hazimeh Adam, Lazarevski Bojan, Dillenbourg Pierre, Käser Tanja
- What Drives Performance In Multilingual Language Models? Nezhad Sina Bagheri, Agrawal Ameeta
- Gpts And Language Barrier: A Cross-lingual Legal QA Examination Nguyen Ha-thanh, Yamada Hiroaki, Satoh Ken
- Advancing Vietnamese Visual Question Answering With Transformer And Convolutional Integration Nguyen Ngoc Son, Nguyen Van Son, Le Tung
- Better Alignment With Instruction Back-and-forth Translation Nguyen Thao, Li Jeffrey, Oh Sewoong, Schmidt Ludwig, Weston Jason, Zettlemoyer Luke, Li Xian
- Layer-wise Regularized Dropout For Neural Language Models Ni Shiwen, Yang Min, Xu Ruifeng, Li Chengming, Hu Xiping
- Panza: A Personalized Text Writing Assistant Via Data Playback And Local Fine-tuning Nicolicioiu Armand, Iofinova Eugenia, Kurtic Eldar, Nikdan Mahdi, Panferov Andrei, Markov Ilia, Shavit Nir, Alistarh Dan
- Flawn-t5: An Empirical Examination Of Effective Instruction-tuning Data Mixtures For Legal Reasoning Niklaus Joel, Zheng Lucia, Mccarthy Arya D., Hahn Christopher, Rosen Brian M., Henderson Peter, Ho Daniel E., Honke Garrett, Liang Percy, Manning Christopher
- WTU-EVAL: A Whether-or-not Tool Usage Evaluation Benchmark For Large Language Models Ning Kangyun, Su Yisong, Lv Xueqiang, Zhang Yuanzhe, Liu Jian, Liu Kang, Xu Jinan
- Can Llms Learn By Teaching? A Preliminary Study Ning Xuefei, Wang Zifu, Li Shiyao, Lin Zinan, Yao Peiran, Fu Tianyu, Blaschko Matthew B., Dai Guohao, Yang Huazhong, Wang Yu
- How Reliable AI Chatbots Are For Disease Prediction From Patient Complaints? Nipu Ayesha Siddika, Islam K M Sajjadul, Madiraju Praveen
- Quantified Task Misalignment To Inform PEFT: An Exploration Of Domain Generalization And Catastrophic Forgetting In CLIP Niss Laura, Vogt-lowell Kevin, Tsiligkaridis Theodoros
- Enhancing Dialogue State Tracking Models Through Llm-backed User-agents Simulation Niu Cheng, Wang Xingguang, Cheng Xuxin, Song Juntong, Zhang Tong
- PARAMANU-GANITA: Language Model With Mathematical Capabilities Niyogi Mitodru, Bhattacharya Arnab
- Language Portability Strategies For Open-domain Dialogue With Pre-trained Language Models From High To Low Resource Languages Njifenjou Ahmed, Sucal Virgile, Jabaian Bassam, Lefèvre Fabrice
- Role-play Zero-shot Prompting With Large Language Models For Open-domain Human-machine Conversation Njifenjou Ahmed, Sucal Virgile, Jabaian Bassam, Lefèvre Fabrice
- Fine-tuning With HED-IT: The Impact Of Human Post-editing For Dialogical Language Models Occhipinti Daniela, Marchi Michele, Mondella Irene, Lai Huiyuan, Dell'orletta Felice, Nissim Malvina, Guerini Marco
- Crafting Efficient Fine-tuning Strategies For Large Language Models Oliver Michael, Wang Guan
- Jamba: A Hybrid Transformer-mamba Language Model Opher Lieber, Barak Lenz, Hofit Bata, Gal Cohen, Jhonathan Osin, Itay Dalmedigos, Erez Safahi, Shaked Meirom, Yonatan Belinkov, Shai Shalev-shwartz, Omri Abend, Raz Alon, Tomer Asida, Amir Bergman, Roman Glozman, Michael Gokhman, Avashalom Manevich, Nir Ratner, Noam Rozen, Erez Shwartz, Mor Zusman, Yoav Shoham
- Transformers Are Multi-state Rnns Oren Matanel, Hassid Michael, Yarden Nir, Adi Yossi, Schwartz Roy
- A Systematic Review Of Data-to-text NLG Osuji Chinonso Cynthia, Ferreira Thiago Castro, Davis Brian
- The Accuracy Of Domain Specific And Descriptive Analysis Generated By Large Language Models Otieno Denish Omondi, Abri Faranak, Siami-namini Sima, Namin Akbar Siami
- Lossless Acceleration Of Large Language Model Via Adaptive N-gram Parallel Decoding Ou Jie, Chen Yueming, Tian Wenhong
- Training Llms To Recognize Hedges In Spontaneous Narratives Paige Amie J., Soubki Adil, Murzaku John, Rambow Owen, Brennan Susan E.
- Investigating LLM Applications In E-commerce Palen-michel Chester, Wang Ruixiang, Zhang Yipeng, Yu David, Xu Canran, Wu Zhe
- Dense Training, Sparse Inference: Rethinking Training Of Mixture-of-experts Language Models Pan Bowen, Shen Yikang, Liu Haokun, Mishra Mayank, Zhang Gaoyuan, Oliva Aude, Raffel Colin, Panda Rameswar
- LISA: Layerwise Importance Sampling For Memory-efficient Large Language Model Fine-tuning Pan Rui, Liu Xiang, Diao Shizhe, Pi Renjie, Zhang Jipeng, Han Chi, Zhang Tong
- POMP: Probability-driven Meta-graph Prompter For Llms In Low-resource Unsupervised Neural Machine Translation Pan Shilong, Tian Zhiliang, Ding Liang, Huang Zhen, Wen Zhihua, Li Dongsheng
- The Solution For The AIGC Inference Performance Optimization Competition Pan Sishun, Xu Haonan, Wan Zhonghua, Yang Yang
- Preparing Lessons For Progressive Training On Language Models Pan Yu, Yuan Ye, Yin Yichun, Shi Jiaxin, Xu Zenglin, Zhang Ming, Shang Lifeng, Jiang Xin, Liu Qun
- Llmlingua-2: Data Distillation For Efficient And Faithful Task-agnostic Prompt Compression Pan Zhuoshi, Wu Qianhui, Jiang Huiqiang, Xia Menglin, Luo Xufang, Zhang Jue, Lin Qingwei, Rühle Victor, Yang Yuqing, Lin Chin-yew, Zhao H. Vicky, Qiu Lili, Zhang Dongmei
- AILS-NTUA At Semeval-2024 Task 9: Cracking Brain Teasers: Transformer Models For Lateral Thinking Puzzles Panagiotopoulos Ioannis, Filandrianos Giorgos, Lymperaiou Maria, Stamou Giorgos
- Can Watermarking Large Language Models Prevent Copyrighted Text Generation And Hide Training Data? Panaitescu-liess Michael-andrei, Che Zora, An Bang, Xu Yuancheng, Pathmanathan Pankayaraj, Chakraborty Souradip, Zhu Sicheng, Goldstein Tom, Huang Furong
- Lottery Ticket Adaptation: Mitigating Destructive Interference In Llms Panda Ashwinee, Isik Berivan, Qi Xiangyu, Koyejo Sanmi, Weissman Tsachy, Mittal Prateek
- Salute The Classic: Revisiting Challenges Of Machine Translation In The Age Of Large Language Models Pang Jianhui, Ye Fanghua, Wang Longyue, Yu Dian, Wong Derek F., Shi Shuming, Tu Zhaopeng
- Anchor-based Large Language Models Pang Jianhui, Ye Fanghua, Wong Derek Fai, He Xin, Chen Wanshun, Wang Longyue
- Phased Instruction Fine-tuning For Large Language Models Pang Wei, Zhou Chuan, Zhou Xiao-hua, Wang Xiaojie
- LLM Evaluators Recognize And Favor Their Own Generations Panickssery Arjun, Bowman Samuel R., Feng Shi
- Shaking Up Vlms: Comparing Transformers And Structured State Space Models For Vision & Language Modeling Pantazopoulos Georgios, Nikandrou Malvina, Suglia Alessandro, Lemon Oliver, Eshghi Arash
- Lost In Space: Probing Fine-grained Spatial Understanding In Vision And Language Resamplers Pantazopoulos Georgios, Suglia Alessandro, Lemon Oliver, Eshghi Arash
- Measuring Memorization In RLHF For Code Completion Pappu Aneesh, Porter Billy, Shumailov Ilia, Hayes Jamie
- Offsetbias: Leveraging Debiased Data For Tuning Evaluators Park Junsoo, Jwa Seungyeon, Ren Meiying, Kim Daeyoung, Choi Sanghyuk
- Token-picker: Accelerating Attention In Text Generation With Minimized Memory Transfer Via Probability Estimation Park Junyoung, Kang Myeonggu, Han Yunki, Kim Yanggon, Shin Jaekang, Kim Lee-sup
- Pre-trained Vision And Language Transformers Are Few-shot Incremental Learners Park Keon-hee, Song Kyungwoo, Park Gyeong-moon
- Textboost: Towards One-shot Personalization Of Text-to-image Models Via Fine-tuning Text Encoder Park Nahyeon, Kim Kunhee, Shim Hyunjung
- The Ultimate Guide To Fine-tuning Llms From Basics To Breakthroughs: An Exhaustive Review Of Technologies, Research, Best Practices, Applied Research Challenges And Opportunities Parthasarathy Venkatesh Balavadhani, Zafar Ahtsham, Khan Aafaq, Shahid Arsalan
- Decoding News Narratives: A Critical Analysis Of Large Language Models In Framing Detection Pastorino Valeria, Sivakumar Jasivan A., Moosavi Nafise Sadat
- Large Language Models Can Self-improve At Web Agent Tasks Patel Ajay, Hofmarcher Markus, Leoveanu-condrei Claudiu, Dinu Marius-constantin, Callison-burch Chris, Hochreiter Sepp
- Making Reasoning Matter: Measuring And Improving Faithfulness Of Chain-of-thought Reasoning Paul Debjit, West Robert, Bosselut Antoine, Faltings Boi
- Advprompter: Fast Adaptive Adversarial Prompting For Llms Paulus Anselm, Zharmagambetov Arman, Guo Chuan, Amos Brandon, Tian Yuandong
- Autonomous Workflow For Multimodal Fine-grained Training Assistants Towards Mixed Reality Pei Jiahuan, Viola Irene, Huang Haochen, Wang Junxiao, Ahsan Moonisa, Ye Fanghua, Yiming Jiang, Sai Yao, Wang Di, Chen Zhumin, Ren Pengjie, Cesar Pablo
- Automated Multi-language To English Machine Translation Using Generative Pre-trained Transformers Pelofske Elijah, Urias Vincent, Liebrock Lorie M.
- Answer Is All You Need: Instruction-following Text Embedding Via Answering The Question Peng Letian, Zhang Yuwei, Wang Zilong, Srinivasa Jayanth, Liu Gaowen, Wang Zihan, Shang Jingbo
- Review-llm: Harnessing Large Language Models For Personalized Review Generation Peng Qiyao, Liu Hongtao, Xu Hongyan, Yang Qing, Shao Minglai, Wang Wenjun
- Q-PEFT: Query-dependent Parameter Efficient Fine-tuning For Text Reranking With Large Language Models Peng Zhiyuan, Wu Xuyang, Wang Qifan, Rajanala Sravanthi, Fang Yi
- Astrollama-chat: Scaling Astrollama With Conversational And Diverse Datasets Perkowski Ernest, Pan Rui, Nguyen Tuan Dung, Ting Yuan-sen, Kruk Sandor, Zhang Tong, O'neill Charlie, Jablonska Maja, Sun Zechang, Smith Michael J., Liu Huiling, Schawinski Kevin, Iyer Kartheik, Universetbd Ioana Ciucă For
- Prompting A Pretrained Transformer Can Be A Universal Approximator Petrov Aleksandar, Torr Philip H. S., Bibi Adel
- Assessing The Emergent Symbolic Reasoning Abilities Of Llama Large Language Models Petruzzellis Flavio, Testolin Alberto, Sperduti Alessandro
- Benchmarking GPT-4 On Algorithmic Problems: A Systematic Evaluation Of Prompting Strategies Petruzzellis Flavio, Testolin Alberto, Sperduti Alessandro
- Optimizing Large Language Models For Openapi Code Completion Petryshyn Bohdan, Lukoševičius Mantas
- H2o-danube3 Technical Report Pfeiffer Pascal, Singer Philipp, Babakhin Yauhen, Fodor Gabor, Dhankhar Nischay, Ambati Sri Satish
- Suri: Multi-constraint Instruction Following For Long-form Text Generation Pham Chau Minh, Sun Simeng, Iyyer Mohit
- Towards Reliable Medical Question Answering: Techniques And Challenges In Mitigating Hallucinations In Language Models Pham Duy Khoa, Vo Bao Quoc
- Investigating The Effectiveness Of Hypertuning Via Gisting Phang Jason
- Mllm-protector: Ensuring Mllm's Safety Without Hurting Performance Pi Renjie, Han Tianyang, Zhang Jianshu, Xie Yueqi, Pan Rui, Lian Qing, Dong Hanze, Zhang Jipeng, Zhang Tong
- Ptt5-v2: A Closer Look At Continued Pretraining Of T5 Models For The Portuguese Language Piau Marcos, Lotufo Roberto, Nogueira Rodrigo
- Mamba-ptq: Outlier Channels In Recurrent Large Language Models Pierro Alessandro, Abreu Steven
- Doccgen: Document-based Controlled Code Generation Pimparkhede Sameer, Kammakomati Mehant, Tamilselvam Srikanth, Kumar Prince, Kumar Ashok Pon, Bhattacharyya Pushpak
- How Effective Are State Space Models For Machine Translation? Pitorro Hugo, Vasylenko Pavlo, Treviso Marcos, Martins André F. T.
- LM-PUB-QUIZ: A Comprehensive Framework For Zero-shot Evaluation Of Relational Knowledge In Language Models Ploner Max, Wiland Jacek, Pohl Sebastian, Akbik Alan
- Sbora: Low-rank Adaptation With Regional Weight Updates Po Lai-man, Liu Yuyang, Wu Haoxuan, Zhang Tianqi, Yu Wing-yin, Jiang Zeyu, Li Kun
- Advanced Natural-based Interaction For The Italian Language: Llamantino-3-anita Polignano Marco, Basile Pierpaolo, Semeraro Giovanni
- Evaluation Of Language Models In The Medical Context Under Resource-constrained Settings Posada Andrea, Rueckert Daniel, Meissen Felix, Müller Philip
- Documint: Docstring Generation For Python Using Small Language Models Poudel Bibek, Cook Adam, Traore Sekou, Ameli Shelah
- Machine Translation With Large Language Models: Prompt Engineering For Persian, English, And Russian Directions Pourkamali Nooshin, Sharifi Shler Ebrahim
- Accelerating Large Language Model Pretraining Via LFR Pedagogy: Learn, Focus, And Review Prakriya Neha, Yen Jui-nan, Hsieh Cho-jui, Cong Jason
- The Silent Curriculum: How Does LLM Monoculture Shape Educational Content And Its Accessibility? Priyanshu Aman, Vijay Supriti
- Customized Retrieval Augmented Generation And Benchmarking For EDA Tool Documentation QA Pu Yuan, He Zhuolun, Qiu Tairu, Wu Haoyuan, Yu Bei
- Fine-tuning With Divergent Chains Of Thought Boosts Reasoning Through Self-correction In Language Models Puerto Haritz, Chubakov Tilek, Zhu Xiaodan, Madabushi Harish Tayyar, Gurevych Iryna
- Less Is More: Accurate Speech Recognition & Translation Without Web-scale Data Puvvada Krishna C., Żelasko Piotr, Huang He, Hrinchuk Oleksii, Koluguri Nithin Rao, Dhawan Kunal, Majumdar Somshubra, Rastorgueva Elena, Chen Zhehuai, Lavrukhin Vitaly, Balam Jagadeesh, Ginsburg Boris
- Practical Token Pruning For Foundation Models In Few-shot Conversational Virtual Assistant Systems Qi Haode, Qian Cheng, Ni Jian, Singh Pratyush, Fazeli Reza, Wang Gengyu, Shu Zhongzheng, Wayne Eric, Bross Juergen
- Is Next Token Prediction Sufficient For GPT? Exploration On Code Logic Comprehension Qi Mengnan, Huang Yufan, Yao Yongqiang, Wang Maoquan, Gu Bin, Sundaresan Neel
- Safety Alignment Should Be Made More Than Just A Few Tokens Deep Qi Xiangyu, Panda Ashwinee, Lyu Kaifeng, Ma Xiao, Roy Subhrajit, Beirami Ahmad, Mittal Prateek, Henderson Peter
- Mutual Reasoning Makes Smaller Llms Stronger Problem-solvers Qi Zhenting, Ma Mingyuan, Xu Jiahang, Zhang Li Lyna, Yang Fan, Yang Mao
- Mia-bench: Towards Better Instruction Following Evaluation Of Multimodal Llms Qian Yusu, Ye Hanrong, Fauconnier Jean-philippe, Grasch Peter, Yang Yinfei, Gan Zhe
- Prompt Perturbation Consistency Learning For Robust Language Models Qiang Yao, Nandi Subhrangshu, Mehrabi Ninareh, Steeg Greg Ver, Kumar Anoop, Rumshisky Anna, Galstyan Aram
- Vl-mamba: Exploring State Space Models For Multimodal Learning Qiao Yanyuan, Yu Zheng, Guo Longteng, Chen Sihan, Zhao Zijia, Sun Mingzhen, Wu Qi, Liu Jing
- The Uniqueness Of Llama3-70b With Per-channel Quantization: An Empirical Study Qin Minghai
- Empirical Guidelines For Deploying Llms Onto Resource-constrained Edge Devices Qin Ruiyang, Liu Dancheng, Yan Zheyu, Tan Zhaoxuan, Pan Zixuan, Jia Zhenge, Jiang Meng, Abbasi Ahmed, Xiong Jinjun, Shi Yiyu
- Exploring The Nexus Of Large Language Models And Legal Systems: A Short Survey Qin Weicong, Sun Zhongxiang
- Enhancing Sequential Recommendations Through Multi-perspective Reflections And Iteration Qin Weicong, Xu Yi, Yu Weijie, Shen Chenglei, Zhang Xiao, He Ming, Fan Jianping, Xu Jun
- Various Lengths, Constant Speed: Efficient Language Modeling With Lightning Attention Qin Zhen, Sun Weigao, Li Dong, Shen Xuyang, Sun Weixuan, Zhong Yiran
- Efficient Interactive LLM Serving With Proxy Model-based Sequence Length Prediction Qiu Haoran, Mao Weichao, Patke Archit, Cui Shengkun, Jha Saurabh, Wang Chen, Franke Hubertus, Kalbarczyk Zbigniew T., Başar Tamer, Iyer Ravishankar K.
- Dissecting Multiplication In Transformers: Insights Into Llms Qiu Luyu, Li Jianing, Su Chi, Zhang Chen Jason, Chen Lei
- Mupt: A Generative Symbolic Music Pretrained Transformer Qu Xingwei, Bai Yuelin, Ma Yinghao, Zhou Ziya, Lo Ka Man, Liu Jiaheng, Yuan Ruibin, Min Lejun, Liu Xueling, Zhang Tianyu, Du Xinrun, Guo Shuyue, Liang Yiming, Li Yizhi, Wu Shangda, Zhou Junting, Zheng Tianyu, Ma Ziyang, Han Fengze, Xue Wei, Xia Gus, Benetos Emmanouil, Yue Xiang, Lin Chenghua, Tan Xu, Huang Stephen W., Fu Jie, Zhang Ge
- DEEP-ICL: Definition-enriched Experts For Language Model In-context Learning Qu Xingwei, Liang Yiming, Wang Yucheng, Zheng Tianyu, Yue Tommy, Ma Lei, Huang Stephen W., Zhang Jiajun, Shi Yinan, Lin Chenghua, Fu Jie, Zhang Ge
- Coursegpt-zh: An Educational Large Language Model Based On Knowledge Distillation Incorporating Prompt Optimization Qu Zheyan, Yin Lu, Yu Zitong, Wang Wenbo, Zhang Xing
- Automatically Generating Numerous Context-driven SFT Data For Llms Across Diverse Granularity Quan Shanghaoran
- Memory-augmented Generative Adversarial Transformers Raaijmakers Stephan, Bakker Roos, Cremers Anita, De Kleijn Roy, Kouwenhoven Tom, Verhoef Tessa
- Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift In Fine-tuning Llms For Simultaneous Translation Raffel Matthew, Agostinelli Victor, Chen Lizhong
- Medit: Multilingual Text Editing Via Instruction Tuning Raheja Vipul, Alikaniotis Dimitris, Kulkarni Vivek, Alhafni Bashar, Kumar Dhruv
- V-zen: Efficient GUI Understanding And Precise Grounding With A Novel Multimodal LLM Rahman Abdur, Chawla Rajat, Kumar Muskaan, Datta Arkajit, Jha Adarsh, Ns Mukunda, Bhola Ishaan
- Qdylora: Quantized Dynamic Low-rank Adaptation For Efficient Large Language Model Tuning Rajabzadeh Hossein, Valipour Mojtaba, Zhu Tianshu, Tahaei Marzieh, Kwon Hyock Ju, Ghodsi Ali, Chen Boxing, Rezagholizadeh Mehdi
- Scavenging Hyena: Distilling Transformers Into Long Convolution Models Ralambomihanta Tokiniaina Raharison, Mohammadzadeh Shahrad, Islam Mohammad Sami Nur, Jabbour Wassim, Liang Laurence
- ANCHOR: Llm-driven News Subject Conditioning For Text-to-image Synthesis Ramakrishnan Aashish Anantha, Huang Sharon X., Lee Dongwon
- Self-refinement Of Language Models From External Proxy Metrics Feedback Ramji Keshav, Lee Young-suk, Astudillo Ramón Fernandez, Sultan Md Arafat, Naseem Tahira, Munawar Asim, Florian Radu, Roukos Salim
- Self-refine Instruction-tuning For Aligning Reasoning In Language Models Ranaldi Leonardo, Freitas Andrè
- Learning To Localize Objects Improves Spatial Reasoning In Visual-llms Ranasinghe Kanchana, Shukla Satya Narayan, Poursaeed Omid, Ryoo Michael S., Lin Tsung-yu
- A Fine-tuning Enhanced RAG System With Quantized Influence Measure As AI Judge Rangan Keshav, Yin Yiqiao
- SPARQL Generation: An Analysis On Fine-tuning Openllama For Question Answering Over A Life Science Knowledge Graph Rangel Julio C., De Farias Tarcisio Mendes, Sima Ana Claudia, Kobayashi Norio
- RAVEN: Multitask Retrieval Augmented Vision-language Learning Rao Varun Nagaraj, Choudhary Siddharth, Deshpande Aditya, Satzoda Ravi Kumar, Appalaraju Srikar
- A Critical Look At Tokenwise Reward-guided Text Generation Rashid Ahmad, Wu Ruotian, Grosse Julia, Kristiadi Agustinus, Poupart Pascal
- Narrow Transformer: Starcoder-based Java-lm For Desktop Rathinasamy Kamalkumar, J Balaji A, Kumar Ankush, Gayari Gagan, K Harshini, Mondal Rajab Ali, S Sreenivasa Raghavan K, Singh Swayam, Tarafdar Mohammed Rafee
- Developing Safe And Responsible Large Language Model : Can We Balance Bias Reduction And Language Understanding In Large Language Models? Raza Shaina, Bamgbose Oluwanifemi, Ghuge Shardul, Tavakol Fatemeh, Reji Deepak John, Bashir Syed Raza
- Beads: Bias Evaluation Across Domains Raza Shaina, Rahman Mizanur, Zhang Michael R.
- MBIAS: Mitigating Bias In Large Language Models While Retaining Context Raza Shaina, Raval Ananya, Chatrath Veronica
- Analyzing And Adapting Large Language Models For Few-shot Multilingual NLU: Are We There Yet? Razumovskaia Evgeniia, Vulić Ivan, Korhonen Anna
- Retrieval-augmented Generation: Is Dense Passage Retrieval Retrieving? Reichman Benjamin, Heck Larry
- Identifying Semantic Induction Heads To Understand In-context Learning Ren Jie, Guo Qipeng, Yan Hang, Liu Dongrui, Zhang Quanshi, Qiu Xipeng, Lin Dahua
- Samba: Simple Hybrid State Space Models For Efficient Unlimited Context Language Modeling Ren Liliang, Liu Yang, Lu Yadong, Shen Yelong, Liang Chen, Chen Weizhu
- Melora: Mini-ensemble Low-rank Adapters For Parameter-efficient Fine-tuning Ren Pengjie, Shi Chengshun, Wu Shiguang, Zhang Mengqi, Ren Zhaochun, De Rijke Maarten, Chen Zhumin, Pei Jiahuan
- Codeattack: Revealing Safety Generalization Challenges Of Large Language Models Via Code Completion Ren Qibing, Gao Chang, Shao Jing, Yan Junchi, Tan Xin, Lam Wai, Ma Lizhuang
- Analyzing And Reducing Catastrophic Forgetting In Parameter Efficient Tuning Ren Weijieying, Li Xinlong, Wang Lei, Zhao Tianxiang, Qin Wei
- I Learn Better If You Speak My Language: Understanding The Superior Performance Of Fine-tuning Large Language Models With Llm-generated Responses Ren Xuan, Wu Biao, Liu Lingqiao
- Healthcare Copilot: Eliciting The Power Of General Llms For Medical Consultation Ren Zhiyao, Zhan Yibing, Yu Baosheng, Ding Liang, Tao Dacheng
- Strengthening Multimodal Large Language Model With Bootstrapped Preference Optimization Renjie Pi, Tianyang Han, Wei Xiong, Jipeng Zhang, Runtao Liu, Rui Pan, Tong Zhang
- Are Electra's Sentence Embeddings Beyond Repair? The Case Of Semantic Textual Similarity Rep Ivan, Dukić David, Šnajder Jan
- Self-generated Replay Memories For Continual Neural Machine Translation Resta Michele, Bacciu Davide
- Practical And Reproducible Symbolic Music Generation By Large Language Models With Structural Embeddings Rhyu Seungyeon, Yang Kichang, Cho Sungjun, Kim Jaehyeon, Lee Kyogu, Lee Moontae
- How Multilingual Are Large Language Models Fine-tuned For Translation? Richburg Aquia, Carpuat Marine
- Quantifying Contamination In Evaluating Code Generation Capabilities Of Language Models Riddell Martin, Ni Ansong, Cohan Arman
- Instruction-tuned Large Language Models For Machine Translation In The Medical Domain Rios Miguel
- Simulating Weighted Automata Over Sequences And Trees With Transformers Rizvi Michael, Lizaire Maude, Lacroce Clara, Rabusseau Guillaume
- Pretrained Hybrids With MAD Skills Roberts Nicholas, Guo Samuel, Gao Zhiqi, Gnvv Satya Sai Srinath Namburi, Cromp Sonia, Wu Chengjun, Duan Chengyu, Sala Frederic
- Exploring Advanced Large Language Models With Llmsuite Roffo Giorgio
- Unibucllm: Harnessing Llms For Automated Prediction Of Item Difficulty And Response Time For Multiple-choice Questions Rogoz Ana-cristina, Ionescu Radu Tudor
- Achieving Peak Performance For Large Language Models: A Systematic Review Rostam Zhyar Rzgar K, Szénási Sándor, Kertész Gábor
- Convolutional Prompting Meets Language Models For Continual Learning Roy Anurag, Moulick Riddhiman, Verma Vinay K., Ghosh Saptarshi, Das Abir
- Fist-financial Style Transfer With Hallucination And Creativity Control Framework Roychowdhury Sohini, Krema Marko, Moore Brian, Lai Xingjian, Effedua Dike, Jethwani Bharat
- Evaluation Of RAG Metrics For Question Answering In The Telecom Domain Roychowdhury Sujoy, Soman Sumit, Ranjani H G, Gunda Neeraj, Chhabra Vansh, Bala Sai Krishna
- Efficient Language Adaptive Pre-training: Extending State-of-the-art Large Language Models For Polish Ruciński Szymon
- From PEFT To DEFT: Parameter Efficient Finetuning For Reducing Activation Density In Transformers Runwal Bharat, Pedapati Tejaswini, Chen Pin-yu
- Writing In The Margins: Better Inference Pattern For Long Context Retrieval Russak Melisa, Jamil Umar, Bryant Christopher, Kamble Kiran, Magnuson Axel, Russak Mateusz, Alshikh Waseem
- Tandem Transformers For Inference Efficient Llms S Aishwarya P, Nair Pranav Ajit, Samaga Yashas, Boyd Toby, Kumar Sanjiv, Jain Prateek, Netrapalli Praneeth
- In-context Learning Demonstration Selection Via Influence Analysis S. Vinay M., Van Minh-hao, Wu Xintao
- Benchmarking And Building Long-context Retrieval Models With Loco And M2-BERT Saad-falcon Jon, Fu Daniel Y., Arora Simran, Guha Neel, Ré Christopher
- Utebc-nlp At Semeval-2024 Task 9: Can Llms Be Lateral Thinkers? Sadeghi Pouya, Abaskohi Amirhossein, Yaghoobzadeh Yadollah
- Athena: Safe Autonomous Agents With Verbal Contrastive Learning Sadhu Tanmana, Pesaranghader Ali, Chen Yanan, Yi Dong Hoon
- Insights Into Alignment: Evaluating DPO And Its Variants Across Multiple Tasks Saeidi Amir, Verma Shivanshu, Baral Chitta
- On Zero-shot Counterspeech Generation By Llms Saha Punyajoy, Agrawal Aalok, Jana Abhik, Biemann Chris, Mukherjee Animesh
- Where Is The Answer? Investigating Positional Bias In Language Model Knowledge Extraction Saito Kuniaki, Sohn Kihyuk, Lee Chen-yu, Ushiku Yoshitaka
- Algorithmic Language Models With Neurally Compiled Libraries Saldyt Lucas, Kambhampati Subbarao
- Berts Are Generative In-context Learners Samuel David
- Rainbow Teaming: Open-ended Generation Of Diverse Adversarial Prompts Samvelyan Mikayel, Raparthy Sharath Chandra, Lupu Andrei, Hambro Eric, Markosyan Aram H., Bhatt Manish, Mao Yuning, Jiang Minqi, Parker-holder Jack, Foerster Jakob, Rocktäschel Tim, Raileanu Roberta
- Lean Attention: Hardware-aware Scalable Attention Mechanism For The Decode-phase Of Transformers Sanovar Rya, Bharadwaj Srikant, Amant Renee St., Rühle Victor, Rajmohan Saravan
- Advancing Generative AI For Portuguese With Open Decoder Gerv\'asio PT* Santos Rodrigo, Silva João, Gomes Luís, Rodrigues João, Branco António
- Pre-training Small Base Lms With Fewer Tokens Sanyal Sunny, Sanghavi Sujay, Dimakis Alexandros G.
- Improving Sentence Embeddings With Automatic Generation Of Training Data Using Few-shot Examples Sato Soma, Tsukagoshi Hayato, Sasano Ryohei, Takeda Koichi
- Can Small Language Models Learn, Unlearn, And Retain Noise Patterns? Scaria Nicy, Kennedy Silvester John Joseph, Subramani Deepak
- Towards Faithful And Robust LLM Specialists For Evidence-based Question-answering Schimanski Tobias, Ni Jingwei, Kraus Mathias, Ash Elliott, Leippold Markus
- Detecting Generated Native Ads In Conversational Search Schmidt Sebastian, Zelch Ines, Bevendorff Janek, Stein Benno, Hagen Matthias, Potthast Martin
- Soft Prompt Threats: Attacking Safety Alignment And Unlearning In Open-source Llms Through The Embedding Space Schwinn Leo, Dobre David, Xhonneux Sophie, Gidel Gauthier, Gunnemann Stephan
- Generative Explore-exploit: Training-free Optimization Of Generative Recommender Systems Using LLM Optimizers Senel Lütfi Kerem, Fetahu Besnik, Yoshida Davis, Chen Zhiyu, Castellucci Giuseppe, Vedula Nikhita, Choi Jason, Malmasi Shervin
- Surprising Efficacy Of Fine-tuned Transformers For Fact-checking Over Larger Language Models Setty Vinay
- Tencdm: Understanding The Properties Of Diffusion Model In The Space Of Language Model Encodings Shabalin Alexander, Meshchaninov Viacheslav, Chimbulatov Egor, Lapikov Vladislav, Kim Roman, Bartosh Grigory, Molchanov Dmitry, Markov Sergey, Vetrov Dmitry
- Flashattention-3: Fast And Accurate Attention With Asynchrony And Low-precision Shah Jay, Bikshandi Ganesh, Zhang Ying, Thakkar Vijay, Ramani Pradeep, Dao Tri
- Detection And Measurement Of Syntactic Templates In Generated Text Shaib Chantal, Elazar Yanai, Li Junyi Jessy, Wallace Byron C.
- Show, Don't Tell: Aligning Language Models With Demonstrated Feedback Shaikh Omar, Lam Michelle, Hejna Joey, Shao Yijia, Bernstein Michael, Yang Diyi
- Evaluating Text Summaries Generated By Large Language Models Using Openai's GPT Shakil Hassan, Mahi Atqiya Munawara, Nguyen Phuoc, Ortiz Zeydy, Mardini Mamoun T.
- Llava-prumerge: Adaptive Token Reduction For Efficient Large Multimodal Models Shang Yuzhang, Cai Mu, Xu Bingxin, Lee Yong Jae, Yan Yan
- Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization Shao Ninglu, Xiao Shitao, Liu Zheng, Zhang Peitian
- Guiding In-context Learning Of Llms Through Quality Estimation For Machine Translation Sharami Javad Pourmostafa Roshan, Shterionov Dimitar, Spronck Pieter
- A Critical Evaluation Of AI Feedback For Aligning Large Language Models Sharma Archit, Keh Sedrick, Mitchell Eric, Finn Chelsea, Arora Kushal, Kollar Thomas
- Patched MOA: Optimizing Inference For Diverse Software Development Tasks Sharma Asankhaya
- Accurate And Efficient Fine-tuning Of Quantized Large Language Models Through Optimal Balance Shen Ao, Wang Qiang, Lai Zhiquan, Li Xionglve, Li Dongsheng
- Nemo-aligner: Scalable Toolkit For Efficient Model Alignment Shen Gerald, Wang Zhilin, Delalleau Olivier, Zeng Jiaqi, Dong Yi, Egert Daniel, Sun Shengyang, Zhang Jimmy, Jain Sahil, Taghibakhshi Ali, Ausin Markel Sanz, Aithal Ashwath, Kuchaiev Oleksii
- Tag-llm: Repurposing General-purpose Llms For Specialized Domains Shen Junhong, Tenenholtz Neil, Hall James Brian, Alvarez-melis David, Fusi Nicolo
- The Language Barrier: Dissecting Safety Challenges Of Llms In Multilingual Contexts Shen Lingfeng, Tan Weiting, Chen Sihao, Chen Yunmo, Zhang Jingyu, Xu Haoran, Zheng Boyuan, Koehn Philipp, Khashabi Daniel
- Exploring User Retrieval Integration Towards Large Language Models For Cross-domain Sequential Recommendation Shen Tingjia, Wang Hao, Zhang Jiaqing, Zhao Sirui, Li Liangyue, Chen Zulong, Lian Defu, Chen Enhong
- An Empirical Study On Context Length For Open-domain Dialog Generation Shen Xinyi, Lin Zuoquan
- Scaling Laws For Linear Complexity Language Models Shen Xuyang, Li Dong, Leng Ruitao, Qin Zhen, Sun Weigao, Zhong Yiran
- A Fine-tuning Dataset And Benchmark For Large Language Models For Protein Understanding Shen Yiqing, Chen Zan, Mamalakis Michail, He Luhan, Xia Haiyang, Li Tianbin, Su Yanzhou, He Junjun, Wang Yu Guang
- Toursynbio: A Multi-modal Large Model And Agent Framework To Bridge Text And Protein Sequences For Protein Engineering Shen Yiqing, Chen Zan, Mamalakis Michail, Liu Yungeng, Li Tianbin, Su Yanzhou, He Junjun, Liò Pietro, Wang Yu Guang
- Language Models Encode Collaborative Signals In Recommendation Sheng Leheng, Zhang An, Zhang Yi, Chen Yuxin, Wang Xiang, Chua Tat-seng
- Using Chatgpt To Score Essays And Short-form Constructed Responses Shermis Mark D.
- Latent Adversarial Training Improves Robustness To Persistent Harmful Behaviors In Llms Sheshadri Abhay, Ewart Aidan, Guo Phillip, Lynch Aengus, Wu Cindy, Hebbar Vivek, Sleight Henry, Stickland Asa Cooper, Perez Ethan, Hadfield-menell Dylan, Casper Stephen
- Neurosymbolic AI For Enhancing Instructability In Generative AI Sheth Amit, Pallagani Vishal, Roy Kaushik
- Continual Learning Of Large Language Models: A Comprehensive Survey Shi Haizhou, Xu Zihao, Wang Hengyi, Qin Weiyi, Wang Wenyuan, Wang Yibin, Wang Zifeng, Ebrahimi Sayna, Wang Hao
- Non-autoregressive Sequence-to-sequence Vision-language Models Shi Kunyu, Dong Qi, Goncalves Luis, Tu Zhuowen, Soatto Stefano
- Keep The Cost Down: A Review On Methods To Optimize LLM' S Kv-cache Consumption Shi Luohe, Zhang Hongyi, Yao Yao, Li Zuchao, Zhao Hai
- Reslora: Identity Residual Mapping In Low-rank Adaption Shi Shuhua, Huang Shaohan, Song Minghui, Li Zhoujun, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Math-llava: Bootstrapping Mathematical Reasoning For Multimodal Large Language Models Shi Wenhao, Hu Zhiqiang, Bin Yi, Liu Junhua, Yang Yang, Ng See-kiong, Bing Lidong, Lee Roy Ka-wei
- Medadapter: Efficient Test-time Adaptation Of Large Language Models Towards Medical Reasoning Shi Wenqi, Xu Ran, Zhuang Yuchen, Yu Yue, Wu Hang, Yang Carl, Wang May D.
- Why Larger Language Models Do In-context Learning Differently? Shi Zhenmei, Wei Junyi, Xu Zhuoyan, Liang Yingyu
- FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation Shum Kashun, Xu Minrui, Zhang Jianshu, Chen Zixin, Diao Shizhe, Dong Hanze, Zhang Jipeng, Raza Muhammad Omer
- Iclguard: Controlling In-context Learning Behavior For Applicability Authorization Si Wai Man, Backes Michael, Zhang Yang
- Where Does In-context Translation Happen In Large Language Models Sia Suzanna, Mueller David, Duh Kevin
- H2o-danube-1.8b Technical Report Singer Philipp, Pfeiffer Pascal, Babakhin Yauhen, Jeblick Maximilian, Dhankhar Nischay, Fodor Gabor, Ambati Sri Satish
- Aya Dataset: An Open-access Collection For Multilingual Instruction Tuning Singh Shivalika, Vargus Freddie, Dsouza Daniel, Karlsson Börje F., Mahendiran Abinaya, Ko Wei-yin, Shandilya Herumb, Patel Jay, Mataciunas Deividas, Omahony Laura, Zhang Mike, Hettiarachchi Ramith, Wilson Joseph, Machado Marina, Moura Luisa Souza, Krzemiński Dominik, Fadaei Hakimeh, Ergün Irem, Okoh Ifeoma, Alaagib Aisha, Mudannayake Oshan, Alyafeai Zaid, Chien Vu Minh, Ruder Sebastian, Guthikonda Surya, Alghamdi Emad A., Gehrmann Sebastian, Muennighoff Niklas, Bartolo Max, Kreutzer Julia, Üstün Ahmet, Fadaee Marzieh, Hooker Sara
- Evaluating Zero-shot GPT-4V Performance On 3D Visual Question Answering Benchmarks Singh Simranjit, Pavlakos Georgios, Stamoulis Dimitrios
- Llava Finds Free Lunch: Teaching Human Behavior Improves Content Understanding Abilities Of Llms Singh Somesh, S Harini I, Singla Yaman K, Baths Veeky, Shah Rajiv Ratn, Chen Changyou, Krishnamurthy Balaji
- An Empirical Study Of Validating Synthetic Data For Formula Generation Singh Usneek, Cambronero José, Gulwani Sumit, Kanade Aditya, Khatry Anirudh, Le Vu, Singh Mukul, Verbruggen Gust
- A Three-pronged Approach To Cross-lingual Adaptation With Multilingual Llms Singh Vaibhav, Krishna Amrith, Nj Karthika, Ramakrishnan Ganesh
- Maml-en-llm: Model Agnostic Meta-training Of Llms For Improved In-context Learning Sinha Sanchit, Yue Yuguang, Soto Victor, Kulkarni Mayank, Lu Jianhua, Zhang Aidong
- Unipa-gpt: Large Language Models For University-oriented QA In Italian Siragusa Irene, Pirrone Roberto
- The Mysterious Case Of Neuron 1512: Injectable Realignment Architectures Reveal Internal Characteristics Of Meta's Llama 2 Model Smith Brenden, Baker Dallin, Chase Clayton, Barney Myles, Parker Kaden, Allred Makenna, Hu Peter, Evans Alex, Fulda Nancy
- Scaling LLM Test-time Compute Optimally Can Be More Effective Than Scaling Model Parameters Snell Charlie, Lee Jaehoon, Xu Kelvin, Kumar Aviral
- Gistembed: Guided In-sample Selection Of Training Negatives For Text Embedding Fine-tuning Solatorio Aivin V.
- A Statistical Framework For Weak-to-strong Generalization Somerstep Seamus, Polo Felipe Maia, Banerjee Moulinath, Ritov Ya'acov, Yurochkin Mikhail, Sun Yuekai
- A New Pipeline For Generating Instruction Dataset Via RAG And Self Fine-tuning Song Chih-wei, Lee Yu-kai, Tsai Yin-te
- Moviechat+: Question-aware Sparse Memory For Long Video Question Answering Song Enxin, Chai Wenhao, Ye Tian, Hwang Jenq-neng, Li Xi, Wang Gaoang
- Scaling Data Diversity For Fine-tuning Language Models In Human Alignment Song Feifan, Yu Bowen, Lang Hao, Yu Haiyang, Huang Fei, Wang Houfeng, Li Yongbin
- Increasing Model Capacity For Free: A Simple Strategy For Parameter Efficient Fine-tuning Song Haobo, Zhao Hao, Majumder Soumajit, Lin Tao
- Out-of-distribution Generalization Via Composition: A Lens Through Induction Heads In Transformers Song Jiajun, Xu Zhuoyan, Zhong Yiqiao
- Hierarchical Context Merging: Better Long Context Understanding For Pre-trained Llms Song Woomin, Oh Seunghyuk, Mo Sangwoo, Kim Jaehyung, Yun Sukmin, Ha Jung-woo, Shin Jinwoo
- Position: Leverage Foundational Models For Black-box Optimization Song Xingyou, Tian Yingtao, Lange Robert Tjarko, Lee Chansoo, Tang Yujin, Chen Yutian
- ELLA-V: Stable Neural Codec Language Modeling With Alignment-guided Sequence Reordering Song Yakun, Chen Zhuo, Wang Xiaofei, Ma Ziyang, Chen Xie
- Sharelora: Parameter Efficient And Robust Large Language Model Fine-tuning Via Shared Low-rank Adaptation Song Yurun, Zhao Junchen, Harris Ian G., Jyothi Sangeetha Abdu
- Alchemistcoder: Harmonizing And Eliciting Code Capability By Hindsight Tuning On Multi-source Data Song Zifan, Wang Yudong, Zhang Wenwei, Liu Kuikun, Lyu Chengqi, Song Demin, Guo Qipeng, Yan Hang, Lin Dahua, Chen Kai, Zhao Cairong
- Fine Tuning Vs. Retrieval Augmented Generation For Less Popular Knowledge Soudani Heydar, Kanoulas Evangelos, Hasibi Faegheh
- Generaitor: Tree-in-the-loop Text Generation For Language Model Explainability And Adaptation Spinner Thilo, Kehlbeck Rebecca, Sevastjanova Rita, Stähle Tobias, Keim Daniel A., Deussen Oliver, El-assady Mennatallah
- Repetition Improves Language Model Embeddings Springer Jacob Mitchell, Kotha Suhas, Fried Daniel, Neubig Graham, Raghunathan Aditi
- PUB: A Pragmatics Understanding Benchmark For Assessing Llms' Pragmatics Capabilities Sravanthi Settaluri Lakshmi, Doshi Meet, Kalyan Tankala Pavan, Murthy Rudra, Bhattacharyya Pushpak, Dabre Raj
- Canttalkaboutthis: Aligning Language Models To Stay On Topic In Dialogues Sreedhar Makesh Narsimhan, Rebedea Traian, Ghosh Shaona, Zeng Jiaqi, Parisien Christopher
- Comparative Analysis Of Different Efficient Fine Tuning Methods Of Large Language Models (llms) In Low-resource Setting Srinivasan Krishna Prasad Varadarajan, Gumpena Prasanth, Yattapu Madhusudhana, Brahmbhatt Vishal H.
- The Fine-tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities Stap David, Hasler Eva, Byrne Bill, Monz Christof, Tran Ke
- Fine-tuning Large Language Models For Entity Matching Steiner Aaron, Peeters Ralph, Bizer Christian
- Numllm: Numeric-sensitive Large Language Model For Chinese Finance Su Huan-yi, Wu Ke, Huang Yu-hao, Li Wu-jun
- Investigating Video Reasoning Capability Of Large Language Models With Tropes In Movies Su Hung-ting, Chao Chun-tong, Hsu Ya-ching, Lin Xudong, Niu Yulei, Lee Hung-yi, Hsu Winston H.
- Mission Impossible: A Statistical Perspective On Jailbreaking Llms Su Jingtong, Kempe Julia, Ullrich Karen
- M-QALM: A Benchmark To Assess Clinical Reading Comprehension And Knowledge Recall In Large Language Models Via Question Answering Subramanian Anand, Schlegel Viktor, Kashyap Abhinav Ramesh, Nguyen Thanh-tung, Dwivedi Vijay Prakash, Winkler Stefan
- 70b-parameter Large Language Models In Japanese Medical Question-answering Sukeda Issey, Kishikawa Risa, Kodera Satoshi
- Persona-db: Efficient Large Language Model Personalization For Response Prediction With Collaborative Data Refinement Sun Chenkai, Yang Ke, Reddy Revanth Gangi, Fung Yi R., Chan Hou Pong, Small Kevin, Zhai Chengxiang, Ji Heng
- Sq-llava: Self-questioning For Large Vision-language Assistant Sun Guohao, Qin Can, Wang Jiamian, Chen Zeyuan, Xu Ran, Tao Zhiqiang
- Supervised Fine-tuning As Inverse Reinforcement Learning Sun Hao
- Fuxitranyu: A Multilingual Large Language Model Trained With Balanced Data Sun Haoran, Jin Renren, Xu Shaoyang, Pan Leiyu, Supryadi, Cui Menglong, Du Jiangcun, Lei Yikun, Yang Lei, Shi Ling, Xiao Juesi, Zhu Shaolin, Xiong Deyi
- Bbox-adapter: Lightweight Adapting For Black-box Large Language Models Sun Haotian, Zhuang Yuchen, Wei Wei, Zhang Chao, Dai Bo
- Lawluo: A Chinese Law Firm Co-run By LLM Agents Sun Jingyun, Dai Chengxiao, Luo Zhongze, Chang Yangbo, Li Yang
- Fine-tuning Vs Prompting, Can Language Models Understand Human Values? Sun Pingwei
- Diversifying Query: Region-guided Transformer For Temporal Sentence Grounding Sun Xiaolong, Shi Liushuai, Wang Le, Zhou Sanping, Xia Kun, Wang Yabing, Hua Gang
- Expert-guided Extinction Of Toxic Tokens For Debiased Generation Sun Xueyao, Shi Kaize, Tang Haoran, Xu Guandong, Li Qing
- QDA-SQL: Questions Enhanced Dialogue Augmentation For Multi-turn Text-to-sql Sun Yinggang, Guo Ziming, Yu Haining, Liu Chuanyi, Li Xiang, Wang Bingxuan, Yu Xiangzhan, Zhao Tiancheng
- You Only Cache Once: Decoder-decoder Architectures For Language Models Sun Yutao, Dong Li, Zhu Yi, Huang Shaohan, Wang Wenhui, Ma Shuming, Zhang Quanlu, Wang Jianyong, Wei Furu
- Leveraging Chatgpt In Pharmacovigilance Event Extraction: An Empirical Study Sun Zhaoyue, Pergola Gabriele, Wallace Byron C., He Yulan
- Towards Smaller, Faster Decoder-only Transformers: Architectural Variants And Their Implications Suresh Sathya Krishnan, P Shunmugapriya
- Automating Research Synthesis With Domain-specific Large Language Model Fine-tuning Susnjak Teo, Hwang Peter, Reyes Napoleon H., Barczak Andre L. C., Mcintosh Timothy R., Ranathunga Surangika
- Freely Long-thinking Transformer (frailt) Tabak Akbay
- JORA: JAX Tensor-parallel Lora Library For Retrieval Augmented Fine-tuning Tahir Anique, Cheng Lu, Liu Huan
- PIXAR: Auto-regressive Language Modeling In Pixel Space Tai Yintao, Liao Xiyang, Suglia Alessandro, Vergari Antonio
- Pretraining And Updating Language- And Domain-specific Large Language Model: A Case Study In Japanese Business Domain Takahashi Kosuke, Omi Takahiro, Arima Kosuke, Ishigaki Tatsuya
- 1.5-pints Technical Report: Pretraining In Days, Not Months -- Your Language Model Thrives On Quality Data Tan Calvin, Wang Jerome
- Fine-tuning Large Language Model (LLM) Artificial Intelligence Chatbots In Ophthalmology And Llm-based Evaluation Using GPT-4 Tan Ting Fang, Elangovan Kabilan, Jin Liyuan, Jie Yao, Yong Li, Lim Joshua, Poh Stanley, Ng Wei Yan, Lim Daniel, Ke Yuhe, Liu Nan, Ting Daniel Shu Wei
- Democratizing Large Language Models Via Personalized Parameter-efficient Fine-tuning Tan Zhaoxuan, Zeng Qingkai, Tian Yijun, Liu Zheyuan, Yin Bing, Jiang Meng
- DLO: Dynamic Layer Operation For Efficient Vertical Scaling Of Llms Tan Zhen, Dong Daize, Zhao Xinyu, Peng Jie, Cheng Yu, Chen Tianlong
- Can I Understand What I Create? Self-knowledge Evaluation Of Large Language Models Tan Zhiquan, Wei Lai, Wang Jindong, Xie Xing, Huang Weiran
- EVALALIGN: Supervised Fine-tuning Multimodal Llms With Human-aligned Data For Evaluating Text-to-image Models Tan Zhiyu, Yang Xiaomeng, Qin Luozheng, Yang Mengping, Zhang Cheng, Li Hao
- Vilco-bench: Video Language Continual Learning Benchmark Tang Tianqi, Deldari Shohreh, Xue Hao, De Melo Celso, Salim Flora D.
- Language-specific Neurons: The Key To Multilingual Capabilities In Large Language Models Tang Tianyi, Luo Wenyang, Huang Haoyang, Zhang Dongdong, Wang Xiaolei, Zhao Xin, Wei Furu, Wen Ji-rong
- On The Hardness Of Faithful Chain-of-thought Reasoning In Large Language Models Tanneru Sree Harsha, Ley Dan, Agarwal Chirag, Lakkaraju Himabindu
- Unlocking The Potential Of Model Merging For Low-resource Languages Tao Mingxu, Zhang Chen, Huang Quzhe, Ma Tianyao, Huang Songfang, Zhao Dongyan, Feng Yansong
- MEEL: Multi-modal Event Evolution Learning Tao Zhengwei, Jin Zhi, Huang Junqiang, Chen Xiancai, Bai Xiaoying, Zhao Haiyan, Zhang Yifan, Tao Chongyang
- Exploring Design Choices For Building Language-specific Llms Tejaswi Atula, Gupta Nilesh, Choi Eunsol
- Equipping Language Models With Tool Use Capability For Tabular Data Analysis In Finance Theuma Adrian, Shareghi Ehsan
- Limits Of Transformer Language Models On Learning To Compose Algorithms Thomm Jonathan, Terzic Aleksandar, Camposampiero Giacomo, Hersche Michael, Schölkopf Bernhard, Rahimi Abbas
- Elsevier Arena: Human Evaluation Of Chemistry/biology/health Foundational Large Language Models Thorne Camilo, Druckenbrodt Christian, Szarkowska Kinga, Goyal Deepika, Marajan Pranita, Somanath Vijay, Harper Corey, Yan Mao, Scerri Tony
- Hydralora: An Asymmetric Lora Architecture For Efficient Fine-tuning Tian Chunlin, Shi Zhan, Guo Zhijiang, Li Li, Xu Chengzhong
- Toward Self-improvement Of Llms Via Imagination, Searching, And Criticizing Tian Ye, Peng Baolin, Song Linfeng, Jin Lifeng, Yu Dian, Mi Haitao, Yu Dong
- Cantor: Inspiring Multimodal Chain-of-thought Of MLLM Timin Gao, Peixian Chen, Mengdan Zhang, Chaoyou Fu, Yunhang Shen, Yan Zhang, Shengchuan Zhang, Xiawu Zheng, Xing Sun, Liujuan Cao, Rongrong Ji
- Zyda: A 1.3T Dataset For Open Language Modeling Tokpanov Yury, Millidge Beren, Glorioso Paolo, Pilault Jonathan, Ibrahim Adam, Whittington James, Anthony Quentin
- An Ecosage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant Tomar Mohit, Tiwari Abhisek, Saha Tulika, Jha Prince, Saha Sriparna
- Understanding Linear Probing Then Fine-tuning Language Models From NTK Perspective Tomihari Akiyoshi, Sato Issei
- Eyes Wide Shut? Exploring The Visual Shortcomings Of Multimodal Llms Tong Shengbang, Liu Zhuang, Zhai Yuexiang, Ma Yi, Lecun Yann, Xie Saining
- Can Llms Learn From Previous Mistakes? Investigating Llms' Errors To Boost For Reasoning Tong Yongqi, Li Dawei, Wang Sizhe, Wang Yujia, Teng Fei, Shang Jingbo
- Optimizing Language Model's Reasoning Abilities With Weak Supervision Tong Yongqi, Wang Sizhe, Li Dawei, Wang Yifan, Han Simeng, Lin Zi, Huang Chengsong, Huang Jiaxin, Shang Jingbo
- Few-shot Cross-lingual Transfer For Prompting Large Language Models In Low-resource Languages Toukmaji Christopher
- Transformers Are Ssms: Generalized Models And Efficient Algorithms Through Structured State Space Duality Tri Dao, Albert Gu
- Discrete Multimodal Transformers With A Pretrained Large Language Model For Mixed-supervision Speech Processing Trinh Viet Anh, Southwell Rosy, Guan Yiwen, He Xinlu, Wang Zhiyong, Whitehill Jacob
- Linguacodus: A Synergistic Framework For Transformative Code Generation In Machine Learning Pipelines Trofimova Ekaterina, Sataev Emil, Ustyuzhanin Andrey E.
- Crossing Linguistic Horizons: Finetuning And Comprehensive Evaluation Of Vietnamese Large Language Models Truong Sang T., Nguyen Duc Q., Nguyen Toan, Le Dong D., Truong Nhi N., Quan Tho, Koyejo Sanmi
- Code Less, Align More: Efficient LLM Fine-tuning For Code Generation With Data Pruning Tsai Yun-da, Liu Mingjie, Ren Haoxing
- Towards Safety And Helpfulness Balanced Responses Via Controllable Large Language Models Tuan Yi-lin, Chen Xilun, Smith Eric Michael, Martin Louis, Batra Soumya, Celikyilmaz Asli, Wang William Yang, Bikel Daniel M.
- Unmasking The Imposters: In-domain Detection Of Human Vs. Machine-generated Tweets Tuck Bryan E., Verma Rakesh M.
- LM Transparency Tool: Interactive Tool For Analyzing Transformer Language Models Tufanov Igor, Hambardzumyan Karen, Ferrando Javier, Voita Elena
- Dynamo: Accelerating Language Model Inference With Dynamic Multi-token Sampling Tuli Shikhar, Lin Chi-heng, Hsu Yen-chang, Jha Niraj K., Shen Yilin, Jin Hongxia
- VBART: The Turkish LLM Turker Meliksah, Ari Mehmet Erdi, Han Aydin
- Enhancing Inference Efficiency Of Large Language Models: Investigating Optimization Strategies And Architectural Innovations Tyukin Georgy
- Asking And Answering Questions To Extract Event-argument Structures Uddin Md Nayem, George Enfa Rose, Blanco Eduardo, Corman Steven
- Bootstrapping Llm-based Task-oriented Dialogue Agents Via Self-talk Ulmer Dennis, Mansimov Elman, Lin Kaixiang, Sun Justin, Gao Xibin, Zhang Yi
- Autopuredata: Automated Filtering Of Web Data For LLM Fine-tuning Vadlapati Praneeth
- Accelerating Large Language Model Inference With Self-supervised Early Exits Valade Florian
- Gender Bias In Machine Translation And The Era Of Large Language Models Vanmassenhove Eva
- A Survey Of Prompt Engineering Methods In Large Language Models For Different NLP Tasks Vatsal Shubham, Dubey Harsh
- Question Suggestion For Conversational Shopping Assistants Using Product Metadata Vedula Nikhita, Rokhlenko Oleg, Malmasi Shervin
- Cross-modal Projection In Multimodal Llms Doesn't Really Project Visual Attributes To Textual Space Verma Gaurav, Choi Minje, Sharma Kartik, Watson-daniels Jamelle, Oh Sejoon, Kumar Srijan
- How Much Data Is Enough Data? Fine-tuning Large Language Models For In-house Translation: Performance Evaluation Across Multiple Dataset Sizes Vieira Inacio, Allred Will, Lankford Séamus, Castilho Sheila, Way Andy
- Redwhale: An Adapted Korean LLM Through Efficient Continual Pretraining Vo Anh-dung, Jung Minseong, Lee Wonbeen, Choi Daewoo
- Foundational Autoraters: Taming Large Language Models For Better Automatic Evaluation Vu Tu, Krishna Kalpesh, Alzubi Salaheddin, Tar Chris, Faruqui Manaal, Sung Yun-hsuan
- Generative Large Language Models In Automated Fact-checking: A Survey Vykopal Ivan, Pikuliak Matúš, Ostermann Simon, Šimko Marián
- Investigating Mysteries Of Cot-augmented Distillation Wadhwa Somin, Amir Silvio, Wallace Byron C.
- An Empirical Study Of Mamba-based Language Models Waleffe Roger, Byeon Wonmin, Riach Duncan, Norick Brandon, Korthikanti Vijay, Dao Tri, Gu Albert, Hatamizadeh Ali, Singh Sudhakar, Narayanan Deepak, Kulshreshtha Garvit, Singh Vartika, Casper Jared, Kautz Jan, Shoeybi Mohammad, Catanzaro Bryan
- Fusechat: Knowledge Fusion Of Chat Models Wan Fanqi, Zhong Longguang, Yang Ziyi, Chen Ruijun, Quan Xiaojun
- LARR: Large Language Model Aided Real-time Scene Recommendation With Semantic Understanding Wan Zhizhong, Yin Bin, Xie Junjie, Jiang Fei, Li Xiang, Lin Wei
- LOOK-M: Look-once Optimization In KV Cache For Efficient Multimodal Long-context Inference Wan Zhongwei, Wu Ziang, Liu Che, Huang Jinfa, Zhu Zhihong, Jin Peng, Wang Longyue, Yuan Li
- Datavist5: A Pre-trained Language Model For Jointly Understanding Text And Data Visualization Wan Zhuoyue, Song Yuanfeng, Li Shuaimin, Zhang Chen Jason, Wong Raymond Chi-wing
- COSMO: Contrastive Streamlined Multimodal Model With Interleaved Pre-training Wang Alex Jinpeng, Li Linjie, Lin Kevin Qinghong, Wang Jianfeng, Lin Kevin, Yang Zhengyuan, Wang Lijuan, Shou Mike Zheng
- IAA: Inner-adaptor Architecture Empowers Frozen Large Language Model With Multimodal Capabilities Wang Bin, Xie Chunyu, Leng Dawei, Yin Yuhui
- Grokked Transformers Are Implicit Reasoners: A Mechanistic Journey To The Edge Of Generalization Wang Boshi, Yue Xiang, Su Yu, Sun Huan
- Information Theoretic Text-to-image Alignment Wang Chao, Franzese Giulio, Finamore Alessandro, Gallo Massimo, Michiardi Pietro
- BLSP-KD: Bootstrapping Language-speech Pre-training Via Knowledge Distillation Wang Chen, Liao Minpeng, Huang Zhongqiang, Zhang Jiajun
- A Survey For Large Language Models In Biomedicine Wang Chong, Li Mengyao, He Junjun, Wang Zhongruo, Darzi Erfan, Chen Zan, Ye Jin, Li Tianbin, Su Yanzhou, Ke Jing, Qu Kaili, Li Shuxin, Yu Yi, Liò Pietro, Wang Tianyun, Wang Yu Guang, Shen Yiqing
- Adapting Open-source Large Language Models For Cost-effective, Expert-level Clinical Note Generation With On-policy Reinforcement Learning Wang Hanyin, Gao Chufan, Liu Bolun, Xu Qiping, Hussein Guleid, Labban Mohamad El, Iheasirim Kingsley, Korsapati Hariprasad, Outcalt Chuck, Sun Jimeng
- Reinforcement Learning For Edit-based Non-autoregressive Neural Machine Translation Wang Hao, Morimura Tetsuro, Honda Ukyo, Kawahara Daisuke
- Target-constrained Bidirectional Planning For Generation Of Target-oriented Proactive Dialogue Wang Jian, Lin Dongding, Li Wenjie
- Techgpt-2.0: A Large Language Model Project To Solve The Task Of Knowledge Graph Construction Wang Jiaqi, Chang Yuying, Li Zhong, An Ning, Ma Qi, Hei Lei, Luo Haibo, Lu Yifei, Ren Feiliang
- Length Generalization Of Causal Transformers Without Position Encoding Wang Jie, Ji Tao, Wu Yuanbin, Yan Hang, Gui Tao, Zhang Qi, Huang Xuanjing, Wang Xiaoling
- Helpful Or Harmful Data? Fine-tuning-free Shapley Attribution For Explaining Language Model Predictions Wang Jingtan, Lin Xiaoqiang, Qiao Rui, Foo Chuan-sheng, Low Bryan Kian Hsiang
- JMLR: Joint Medical LLM And Retrieval Training For Enhancing Reasoning And Professional Question Answering Capability Wang Junda, Yang Zhichao, Yao Zonghai, Yu Hong
- Mambabyte: Token-free Selective State Space Model Wang Junxiong, Gangavarapu Tushaar, Yan Jing Nathan, Rush Alexander M.
- The Mamba In The Llama: Distilling And Accelerating Hybrid Models Wang Junxiong, Paliotta Daniele, May Avner, Rush Alexander M., Dao Tri
- Beyond The Known: Investigating Llms Performance On Out-of-domain Intent Detection Wang Pei, He Keqing, Wang Yejie, Song Xiaoshuai, Mou Yutao, Wang Jingang, Xian Yunsen, Cai Xunliang, Xu Weiran
- Inferaligner: Inference-time Alignment For Harmlessness Through Cross-model Guidance Wang Pengyu, Zhang Dong, Li Linyang, Tan Chenkun, Wang Xinghao, Ren Ke, Jiang Botian, Qiu Xipeng
- Deconfounded Causality-aware Parameter-efficient Fine-tuning For Problem-solving Improvement Of Llms Wang Ruoyu, Li Xiaoxuan, Yao Lina
- Unique Security And Privacy Threats Of Large Language Model: A Comprehensive Survey Wang Shang, Zhu Tianqing, Liu Bo, Ding Ming, Guo Xu, Ye Dayong, Zhou Wanlei, Yu Philip S.
- Lora Meets Dropout Under A Unified Framework Wang Sheng, Chen Liheng, Jiang Jiyue, Xue Boyang, Kong Lingpeng, Wu Chuan
- Zero-shot Generative Large Language Models For Systematic Review Screening Automation Wang Shuai, Scells Harrisen, Zhuang Shengyao, Potthast Martin, Koopman Bevan, Zuccon Guido
- Resonance Rope: Improving Context Length Generalization Of Large Language Models Wang Suyuchen, Kobyzev Ivan, Lu Peng, Rezagholizadeh Mehdi, Liu Bang
- Weaver: Foundation Models For Creative Writing Wang Tiannan, Chen Jiamin, Jia Qingrui, Wang Shuai, Fang Ruoyu, Wang Huilin, Gao Zhaowei, Xie Chunzhao, Xu Chuou, Dai Jihong, Liu Yibin, Wu Jialong, Ding Shengwei, Li Long, Huang Zhiwei, Deng Xinle, Yu Teng, Ma Gangan, Xiao Han, Chen Zixin, Xiang Danjun, Wang Yunxia, Zhu Yuanyuan, Xiao Yi, Wang Jing, Wang Yiru, Ding Siran, Huang Jiayang, Xu Jiayi, Tayier Yilihamu, Hu Zhenyu, Gao Yuan, Zheng Chengfeng, Ye Yueshu, Li Yihang, Wan Lei, Jiang Xinyue, Wang Yujie, Cheng Siyu, Song Zhule, Tang Xiangru, Xu Xiaohua, Zhang Ningyu, Chen Huajun, Jiang Yuchen Eleanor, Zhou Wangchunshu
- Using Advanced Llms To Enhance Smaller Llms: An Interpretable Knowledge Distillation Approach Wang Tong, Sudhir K., Hong Dat
- QCRD: Quality-guided Contrastive Rationale Distillation For Large Language Models Wang Wei, Li Zhaowei, Xu Qi, Cai Yiqing, Song Hang, Qi Qi, Zhou Ran, Huang Zhida, Wang Tao, Xiao Li
- Secokd: Aligning Large Language Models For In-context Learning With Fewer Shots Wang Weixing, Yang Haojin, Meinel Christoph
- Metatool: Facilitating Large Language Models To Master Tools With Meta-task Augmentation Wang Xiaohan, Li Dian, Zhao Yilin, Sinbadliu, Wang Hui
- Validating Llm-generated Programs With Metamorphic Prompt Testing Wang Xiaoyin, Zhu Dakai
- Longllava: Scaling Multi-modal Llms To 1000 Images Efficiently Via Hybrid Architecture Wang Xidong, Song Dingjie, Chen Shunian, Zhang Chen, Wang Benyou
- "my Answer Is C": First-token Probabilities Do Not Match Text Answers In Instruction-tuned Language Models Wang Xinpeng, Ma Bolei, Hu Chengzhi, Weber-genzel Leon, Röttger Paul, Kreuter Frauke, Hovy Dirk, Plank Barbara
- EAGER: Two-stream Generative Recommender With Behavior-semantic Collaboration Wang Ye, Xun Jiahao, Hong Minjie, Zhu Jieming, Jin Tao, Lin Wang, Li Haoyuan, Li Linjun, Xia Yan, Zhao Zhou, Dong Zhenhua
- Inscl: A Data-efficient Continual Learning Paradigm For Fine-tuning Large Language Models With Instructions Wang Yifan, Liu Yafei, Shi Chufan, Li Haoling, Chen Chen, Lu Haonan, Yang Yujiu
- A Theoretical Understanding Of Self-correction Through In-context Alignment Wang Yifei, Wu Yuyang, Wei Zeming, Jegelka Stefanie, Wang Yisen
- Uncertainty Aware Learning For Language Model Alignment Wang Yikun, Zheng Rui, Ding Liang, Zhang Qi, Lin Dahua, Tao Dacheng
- Ladic: Are Diffusion Models Really Inferior To Autoregressive Counterparts For Image-to-text Generation? Wang Yuchi, Ren Shuhuai, Gao Rundong, Yao Linli, Guo Qingyan, An Kaikai, Bai Jianhong, Sun Xu
- Loma: Lossless Compressed Memory Attention Wang Yumeng, Xiao Zhenyang
- LSTP: Language-guided Spatial-temporal Prompt Learning For Long-form Video-text Understanding Wang Yuxuan, Wang Yueqian, Wu Pengfei, Liang Jianxin, Zhao Dongyan, Zheng Zilong
- To Recommend Or Not: Recommendability Identification In Conversations With Pre-trained Language Models Wang Zhefan, Ma Weizhi, Zhang Min
- A Comprehensive Survey Of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO And More Wang Zhichao James, Bi Bin James, Pentyala Shiva Kumar James, Ramnath Kiran James, Chaudhuri Sougata James, Mehrotra Shubham James, Zixu James, Zhu Claire, Mao Xiang-bo Claire, Asur Sitaram Claire, Na Claire, Cheng
- Re-task: Revisiting LLM Tasks From Capability, Skill, And Knowledge Perspectives Wang Zhihu, Zhao Shiwan, Wang Yu, Huang Heyuan, Shi Jiaxin, Xie Sitao, Wang Zhixing, Zhang Yubo, Li Hongyan, Yan Junchi
- Omnijarvis: Unified Vision-language-action Tokenization Enables Open-world Instruction Following Agents Wang Zihao, Cai Shaofei, Mu Zhancun, Lin Haowei, Zhang Ceyao, Liu Xuejie, Li Qing, Liu Anji, Ma Xiaojian, Liang Yitao
- C3LLM: Conditional Multimodal Content Generation Using Large Language Models Wang Zixuan, Duan Qinkai, Tai Yu-wing, Tang Chi-keung
- Re2llm: Reflective Reinforcement Large Language Model For Session-based Recommendation Wang Ziyan, Du Yingpeng, Sun Zhu, Chua Haoyan, Feng Kaidong, Wang Wenya, Zhang Jie
- Flora: Federated Fine-tuning Large Language Models With Heterogeneous Low-rank Adaptations Wang Ziyao, Shen Zheyu, He Yexiao, Sun Guoheng, Wang Hongyi, Lyu Lingjuan, Li Ang
- Measuring And Reducing LLM Hallucination Without Gold-standard Answers Wei Jiaheng, Yao Yuanshun, Ton Jean-francois, Guo Hongyi, Estornell Andrew, Liu Yang
- Instructrag: Instructing Retrieval-augmented Generation Via Self-synthesized Rationales Wei Zhepei, Chen Wei-lin, Meng Yu
- Grounding Data Science Code Generation With Input-output Specifications Wen Yeming, Yin Pengcheng, Shi Kensen, Michalewski Henryk, Chaudhuri Swarat, Polozov Alex
- How Chinese Are Chinese Language Models? The Puzzling Lack Of Language Policy In China's Llms Wen-yi Andrea W, Jo Unso Eun Seo, Lin Lu Jia, Mimno David
- Do Llamas Work In English? On The Latent Language Of Multilingual Transformers Wendler Chris, Veselovsky Veniamin, Monea Giovanni, West Robert
- Codeultrafeedback: An Llm-as-a-judge Dataset For Aligning Large Language Models To Coding Preferences Weyssow Martin, Kamanda Aton, Sahraoui Houari
- Towards Knowledge-grounded Natural Language Understanding And Generation Whitehouse Chenxi
- Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions Wiegreffe Sarah, Tafjord Oyvind, Belinkov Yonatan, Hajishirzi Hannaneh, Sabharwal Ashish
- Training-free Exponential Extension Of Sliding Window Context With Cascading KV Cache Willette Jeffrey, Lee Heejun, Lee Youngwan, Jeon Myeongjae, Hwang Sung Ju
- Memory Is All You Need: An Overview Of Compute-in-memory Architectures For Accelerating Large Language Model Inference Wolters Christopher, Yang Xiaoxuan, Schlichtmann Ulf, Suzumura Toyotaro
- Personalized Large Language Models Woźniak Stanisław, Koptyra Bartłomiej, Janz Arkadiusz, Kazienko Przemysław, Kocoń Jan
- Llama Pro: Progressive Llama With Block Expansion Wu Chengyue, Gan Yukang, Ge Yixiao, Lu Zeyu, Wang Jiahao, Feng Ye, Shan Ying, Luo Ping
- Sparkra: A Retrieval-augmented Knowledge Service System Based On Spark Large Language Model Wu Dayong, Li Jiaqi, Wang Baoxin, Zhao Honghong, Xue Siyuan, Yang Yanjie, Chang Zhijun, Zhang Rui, Qian Li, Wang Bo, Wang Shijin, Zhang Zhixiong, Hu Guoping
- Generative Monoculture In Large Language Models Wu Fan, Black Emily, Chandrasekaran Varun
- \(\beta\)-dpo: Direct Preference Optimization With Dynamic \(\beta\) Wu Junkang, Xie Yuexiang, Yang Zhengyi, Wu Jiancan, Gao Jinyang, Ding Bolin, Wang Xiang, He Xiangnan
- Noiseboost: Alleviating Hallucination With Noise Perturbation For Multimodal Large Language Models Wu Kai, Jiang Boyuan, Jiang Zhengkai, He Qingdong, Luo Donghao, Wang Shengzhi, Liu Qingwen, Wang Chengjie
- Adapting Large Language Models For Document-level Machine Translation Wu Minghao, Vu Thuy-trang, Qu Lizhen, Foster George, Haffari Gholamreza
- Mixture-of-skills: Learning To Optimize Data Usage For Fine-tuning Large Language Models Wu Minghao, Vu Thuy-trang, Qu Lizhen, Haffari Gholamreza
- ITERTL: An Iterative Framework For Fine-tuning Llms For RTL Code Generation Wu Peiyang, Guo Nan, Xiao Xiao, Li Wenming, Ye Xiaochun, Fan Dongrui
- Parallel Decoding Via Hidden Transfer For Lossless Large Language Model Acceleration Wu Pengfei, Liu Jiahao, Gong Zhuocheng, Wang Qifan, Li Jinpeng, Wang Jingang, Cai Xunliang, Zhao Dongyan
- Motionllm: Multimodal Motion-language Learning With Large Language Models Wu Qi, Zhao Yubo, Wang Yifan, Tai Yu-wing, Tang Chi-keung
- GUI Action Narrator: Where And When Did That Action Take Place? Wu Qinchen, Gao Difei, Lin Kevin Qinghong, Wu Zhuoyu, Guo Xiangwu, Li Peiran, Zhang Weichen, Wang Hengxu, Shou Mike Zheng
- Videollm-mod: Efficient Video-language Streaming With Mixture-of-depths Vision Computation Wu Shiwei, Chen Joya, Lin Kevin Qinghong, Wang Qimeng, Gao Yan, Xu Qianli, Xu Tong, Hu Yao, Chen Enhong, Shou Mike Zheng
- Semantic Alignment For Multimodal Large Language Models Wu Tao, Li Mengze, Chen Jingyuan, Ji Wei, Lin Wang, Gao Jinyang, Kuang Kun, Zhao Zhou, Wu Fei
- Never Miss A Beat: An Efficient Recipe For Context Window Extension Of Large Language Models With Consistent "middle" Enhancement Wu Tong, Zhao Yanpeng, Zheng Zilong
- Continual Learning For Large Language Models: A Survey Wu Tongtong, Luo Linhao, Li Yuan-fang, Pan Shirui, Vu Thuy-trang, Haffari Gholamreza
- Do Language Models Plan Ahead For Future Tokens? Wu Wilson, Morris John X., Levine Lionel
- Transformer-based Causal Language Models Perform Clustering Wu Xinbo, Varshney Lav R.
- Passage-specific Prompt Tuning For Passage Reranking In Question Answering With Large Language Models Wu Xuyang, Peng Zhiyuan, Sai Krishna Sravanthi Rajanala, Wu Hsin-tai, Fang Yi
- VILA-U: A Unified Foundation Model Integrating Visual Understanding And Generation Wu Yecheng, Zhang Zhuoyang, Chen Junyu, Tang Haotian, Li Dacheng, Fang Yunhao, Zhu Ligeng, Xie Enze, Yin Hongxu, Yi Li, Han Song, Lu Yao
- Difflora: Generating Personalized Low-rank Adaptation Weights With Diffusion Wu Yujia, Shi Yiming, Wei Jiwei, Sun Chengwei, Zhou Yuyang, Yang Yang, Shen Heng Tao
- Inversecoder: Unleashing The Power Of Instruction-tuned Code Llms With Inverse-instruct Wu Yutong, Huang Di, Shi Wenxuan, Wang Wei, Gao Lingzhe, Liu Shihao, Nan Ziyuan, Yuan Kaizhao, Zhang Rui, Zhang Xishan, Du Zidong, Guo Qi, Pu Yewen, Yin Dawei, Hu Xing, Chen Yunji
- Imitating Language Via Scalable Inverse Reinforcement Learning Wulfmeier Markus, Bloesch Michael, Vieillard Nino, Ahuja Arun, Bornschein Jorg, Huang Sandy, Sokolov Artem, Barnes Matt, Desjardins Guillaume, Bewley Alex, Bechtle Sarah Maria Elisabeth, Springenberg Jost Tobias, Momchev Nikola, Bachem Olivier, Geist Matthieu, Riedmiller Martin
- In-context Learning Can Re-learn Forbidden Tasks Xhonneux Sophie, Dobre David, Tang Jian, Gidel Gauthier, Sridhar Dhanya
- A Practice Of Post-training On Llama-3 70B With Optimal Selection Of Additional Language Mixture Ratio Xi Ningyuan, Wu Yetao, Fan Kun, Chen Teng, Gu Qingqing, Yu Peng, Qu Jinxian, Liu Chenxi, Jiang Zhonglin, Chen Yong, Ji Luo
- Unlocking Efficiency In Large Language Model Inference: A Comprehensive Survey Of Speculative Decoding Xia Heming, Yang Zhe, Dong Qingxiu, Wang Peiyi, Li Yongqi, Ge Tao, Liu Tianyu, Li Wenjie, Sui Zhifang
- What To Do If Language Models Disagree? Black-box Model Ensembling For Textual And Visual Question Answering Xia Yuxi, Zaporojets Kilm, Roth Benjamin
- Pandora: Towards General World Model With Natural Language Actions And Video States Xiang Jiannan, Liu Guangyi, Gu Yi, Gao Qiyue, Ning Yuting, Zha Yuheng, Feng Zeyu, Tao Tianhua, Hao Shibo, Shi Yemin, Liu Zhengzhong, Xing Eric P., Hu Zhiting
- Clover-2: Accurate Inference For Regressive Lightweight Speculative Decoding Xiao Bin, Gui Lujun, Su Lei, Chen Weipeng
- Infllm: Training-free Long-context Extrapolation For Llms With An Efficient Context Memory Xiao Chaojun, Zhang Pengle, Han Xu, Xiao Guangxuan, Lin Yankai, Zhang Zhengyan, Liu Zhiyuan, Sun Maosong
- Rar-b: Reasoning As Retrieval Benchmark Xiao Chenghao, Hudson G Thomas, Moubayed Noura Al
- EMP: Enhance Memory In Data Pruning Xiao Jinying, Li Ping, Nie Jie, Tang Zhe
- Seeing The Image: Prioritizing Visual Correlation By Contrastive Alignment Xiao Xin, Wu Bohong, Wang Jiacong, Li Chunyuan, Zhou Xun, Guo Haoyuan
- Emovit: Revolutionizing Emotion Insights With Visual Instruction Tuning Xie Hongxia, Peng Chu-jun, Tseng Yu-wen, Chen Hung-jen, Hsu Chan-feng, Shuai Hong-han, Cheng Wen-huang
- Show-o: One Single Transformer To Unify Multimodal Understanding And Generation Xie Jinheng, Mao Weijia, Bai Zechen, Zhang David Junhao, Wang Weihao, Lin Kevin Qinghong, Gu Yuchao, Chen Zhijie, Yang Zhenheng, Shou Mike Zheng
- Non-instructional Fine-tuning: Enabling Instruction-following Capabilities In Pre-trained Language Models Without Instruction-following Data Xie Juncheng, Syu Shensian, Lee Hung-yi
- Autoad-zero: A Training-free Framework For Zero-shot Audio Description Xie Junyu, Han Tengda, Bain Max, Nagrani Arsha, Varol Gül, Xie Weidi, Zisserman Andrew
- Recall: Membership Inference Via Relative Conditional Log-likelihoods Xie Roy, Wang Junlin, Huang Ruomin, Zhang Minxing, Ge Rong, Pei Jian, Gong Neil Zhenqiang, Dhingra Bhuwan
- Monte Carlo Tree Search Boosts Reasoning Via Iterative Preference Learning Xie Yuxi, Goyal Anirudh, Zheng Wenyue, Kan Min-yen, Lillicrap Timothy P., Kawaguchi Kenji, Shieh Michael
- RALL-E: Robust Codec Language Modeling With Chain-of-thought Prompting For Text-to-speech Synthesis Xin Detai, Tan Xu, Shen Kai, Ju Zeqian, Yang Dongchao, Wang Yuancheng, Takamichi Shinnosuke, Saruwatari Hiroshi, Liu Shujie, Li Jinyu, Zhao Sheng
- Theoretical Understanding Of In-context Learning In Shallow Transformers With Unstructured Data Xing Yue, Lin Xiaofeng, Xu Chenheng, Suh Namjoon, Song Qifan, Cheng Guang
- Improving Retrieval-augmented Generation In Medicine With Iterative Follow-up Questions Xiong Guangzhi, Jin Qiao, Wang Xiao, Zhang Minjia, Lu Zhiyong, Zhang Aidong
- Large Language Models Can Learn Temporal Reasoning Xiong Siheng, Payani Ali, Kompella Ramana, Fekri Faramarz
- Building Math Agents With Multi-turn Iterative Preference Learning Xiong Wei, Shi Chengshuai, Shen Jiaming, Rosenberg Aviv, Qin Zhen, Calandriello Daniele, Khalman Misha, Joshi Rishabh, Piot Bilal, Saleh Mohammad, Jin Chi, Zhang Tong, Liu Tianqi
- Empowering 1000 Tokens/second On-device LLM Prefilling With Mllm-npu Xu Daliang, Zhang Hao, Yang Liming, Liu Ruiqi, Huang Gang, Xu Mengwei, Liu Xuanzhe
- Interactive Evolution: A Neural-symbolic Self-training Framework For Large Language Models Xu Fangzhi, Sun Qiushi, Cheng Kanzhi, Liu Jun, Qiao Yu, Wu Zhiyong
- Large Language Models For Education: A Survey Xu Hanyi, Gan Wensheng, Qi Zhenlian, Wu Jiayang, Yu Philip S.
- Contrastive Preference Optimization: Pushing The Boundaries Of LLM Performance In Machine Translation Xu Haoran, Sharaf Amr, Chen Yunmo, Tan Weiting, Shen Lingfeng, Van Durme Benjamin, Murray Kenton, Kim Young Jin
- Meteora: Multiple-tasks Embedded Lora For Large Language Models Xu Jingwei, Lai Junyu, Huang Yunpeng
- From Introspection To Best Practices: Principled Analysis Of Demonstrations In Multimodal In-context Learning Xu Nan, Wang Fei, Zhang Sheng, Poon Hoifung, Chen Muhao
- BESA: Pruning Large Language Models With Blockwise Parameter-efficient Sparsity Allocation Xu Peng, Shao Wenqi, Chen Mengzhao, Tang Shitao, Zhang Kaipeng, Gao Peng, An Fengwei, Qiao Yu, Luo Ping
- Course-correction: Safety Alignment Using Synthetic Preferences Xu Rongwu, Cai Yishuo, Zhou Zhenhong, Gu Renjie, Weng Haiqin, Liu Yan, Zhang Tianwei, Xu Wei, Qiu Han
- Is DPO Superior To PPO For LLM Alignment? A Comprehensive Study Xu Shusheng, Fu Wei, Gao Jiaxuan, Ye Wenjie, Liu Weilin, Mei Zhiyu, Wang Guangju, Yu Chao, Wu Yi
- Sayself: Teaching Llms To Express Confidence With Self-reflective Rationales Xu Tianyang, Wu Shujin, Diao Shizhe, Liu Xiaoze, Wang Xingyao, Chen Yangyi, Gao Jing
- Chatglm-math: Improving Math Problem-solving In Large Language Models With A Self-critique Pipeline Xu Yifan, Liu Xiao, Liu Xinghan, Hou Zhenyu, Li Yueyan, Zhang Xiaohan, Wang Zihan, Zeng Aohan, Du Zhengxiao, Zhao Wenyi, Tang Jie, Dong Yuxiao
- \(\textit{linkprompt}\): Natural And Universal Adversarial Attacks On Prompt-based Language Models Xu Yue, Wang Wenjie
- Think: Thinner Key Cache By Query-driven Pruning Xu Yuhui, Jie Zhanming, Dong Hanze, Wang Lei, Lu Xudong, Zhou Aojun, Saha Amrita, Xiong Caiming, Sahoo Doyen
- Magpie: Alignment Data Synthesis From Scratch By Prompting Aligned Llms With Nothing Xu Zhangchen, Jiang Fengqing, Niu Luyao, Deng Yuntian, Poovendran Radha, Choi Yejin, Lin Bill Yuchen
- Vision-flan: Scaling Human-labeled Tasks In Visual Instruction Tuning Xu Zhiyang, Feng Chao, Shao Rulin, Ashby Trevor, Shen Ying, Jin Di, Cheng Yu, Wang Qifan, Huang Lifu
- Do Large Language Models Have Compositional Ability? An Investigation Into Limitations And Scalability Xu Zhuoyan, Shi Zhenmei, Liang Yingyu
- Longvila: Scaling Long-context Visual Language Models For Long Videos Xue Fuzhao, Chen Yukang, Li Dacheng, Hu Qinghao, Zhu Ligeng, Li Xiuyu, Fang Yunhao, Tang Haotian, Yang Shang, Liu Zhijian, He Ethan, Yin Hongxu, Molchanov Pavlo, Kautz Jan, Fan Linxi, Zhu Yuke, Lu Yao, Han Song
- Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-choice Selectors Xue Mengge, Hu Zhenyu, Liu Liqun, Liao Kuo, Li Shuang, Han Honglin, Zhao Meng, Yin Chengguo
- Fine-tuning Large Language Models For Automated Diagnostic Screening Summaries Yadav Manjeet, Sahu Nilesh Kumar, Chaturvedi Mudita, Gupta Snehil, Lone Haroon R
- Medlm: Exploring Language Models For Medical Question Answering Systems Yagnik Niraj, Jhaveri Jay, Sharma Vivek, Pila Gabriel
- LTNER: Large Language Model Tagging For Named Entity Recognition With Contextualized Entity Marking Yan Faren, Yu Peng, Chen Xin
- Predicting The Big Five Personality Traits In Chinese Counselling Dialogues Using Large Language Models Yan Yang, Ma Lizhi, Li Anqi, Ma Jingsong, Lan Zhenzhong
- Exploring The LLM Journey From Cognition To Expression With Linear Representations Yan Yuzi, Li Jialian, Zhang Yipin, Yan Dong
- Equipping Transformer With Random-access Reading For Long-context Understanding Yang Chenghao, Yang Zi, Hua Nan
- 3D Vision And Language Pretraining With Large-scale Synthetic Data Yang Dejie, Xu Zhu, Mo Wentao, Chen Qingchao, Huang Siyuan, Liu Yang
- IM-RAG: Multi-round Retrieval-augmented Generation Through Learning Inner Monologues Yang Diji, Rao Jinmeng, Chen Kezhen, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- Improving Factuality In Large Language Models Via Decoding-time Hallucinatory And Truthful Comparators Yang Dingkang, Xiao Dongling, Wei Jinjie, Li Mingcheng, Chen Zhaoyu, Li Ke, Zhang Lihua
- Unveiling The Generalization Power Of Fine-tuned Large Language Models Yang Haoran, Zhang Yumeng, Xu Jiaqi, Lu Hongyuan, Heng Pheng Ann, Lam Wai
- \(\text{memory}^3\): Language Modeling With Explicit Memory Yang Hongkang, Lin Zehao, Wang Wenjin, Wu Hao, Li Zhiyu, Tang Bo, Wei Wenqiang, Wang Jinbo, Tang Zeyun, Song Shichao, Xi Chenyang, Yu Yu, Chen Kai, Xiong Feiyu, Tang Linpeng, E Weinan
- Aligning Llms Through Multi-perspective User Preference Ranking-based Feedback For Programming Question Answering Yang Hongyu, He Liyang, Hou Min, Shen Shuanghong, Li Rui, Hou Jiahui, Ma Jianhui, Zhao Junda
- Asynchronous And Segmented Bidirectional Encoding For NMT Yang Jingpu, Han Zehua, Xiang Mengyu, Wang Helin, Huang Yuxiao, Fang Miao
- Mllmreid: Multimodal Large Language Model-based Person Re-identification Yang Shan, Zhang Yongfei
- GPT-4 As Evaluator: Evaluating Large Language Models On Pest Management In Agriculture Yang Shanglong, Yuan Zhipeng, Li Shunbao, Peng Ruoling, Liu Kang, Yang Po
- Video As The New Language For Real-world Decision Making Yang Sherry, Walker Jacob, Parker-holder Jack, Du Yilun, Bruce Jake, Barreto Andre, Abbeel Pieter, Schuurmans Dale
- Moral: Moe Augmented Lora For Llms' Lifelong Learning Yang Shu, Ali Muhammad Asif, Wang Cheng-long, Hu Lijie, Wang Di
- Seed-story: Multimodal Long Story Generation With Large Language Model Yang Shuai, Ge Yuying, Li Yang, Chen Yukang, Ge Yixiao, Shan Ying, Chen Yingcong
- Is Crowdsourcing Breaking Your Bank? Cost-effective Fine-tuning Of Pre-trained Language Models With Proximal Policy Optimization Yang Shuo, Kasneci Gjergji
- Parallelizing Linear Transformers With The Delta Rule Over Sequence Length Yang Songlin, Wang Bailin, Zhang Yu, Shen Yikang, Kim Yoon
- Corda: Context-oriented Decomposition Adaptation Of Large Language Models Yang Yibo, Li Xiaojie, Zhou Zhongzhu, Song Shuaiwen Leon, Wu Jianlong, Nie Liqiang, Ghanem Bernard
- Adazeta: Adaptive Zeroth-order Tensor-train Adaption For Memory-efficient Large Language Models Fine-tuning Yang Yifan, Zhen Kai, Banijamal Ershad, Mouchtaris Athanasios, Zhang Zheng
- Loretta: Low-rank Economic Tensor-train Adaptation For Ultra-low-parameter Fine-tuning Of Large Language Models Yang Yifan, Zhou Jiajun, Wong Ngai, Zhang Zheng
- Guardt2i: Defending Text-to-image Models From Adversarial Prompts Yang Yijun, Gao Ruiyuan, Yang Xiao, Zhong Jianyuan, Xu Qiang
- P3: A Policy-driven, Pace-adaptive, And Diversity-promoted Framework For Optimizing LLM Training Yang Yingxuan, Wang Huayi, Wen Muning, Zhang Weinan
- Pre-trained Language Models Improve The Few-shot Prompt Ability Of Decision Transformer Yang Yu, Xu Pan
- Weak-to-strong Reasoning Yang Yuqing, Ma Yan, Liu Pengfei
- Fine-tuning Large Language Models With Human-inspired Learning Strategies In Medical Question Answering Yang Yushi, Bean Andrew M., Mccraith Robert, Mahdi Adam
- Assessing Adversarial Robustness Of Large Language Models: An Empirical Study Yang Zeyu, Meng Zhao, Zheng Xiaochen, Wattenhofer Roger
- Self-distillation Bridges Distribution Gap In Language Model Fine-tuning Yang Zhaorui, Pang Tianyu, Feng Haozhe, Wang Han, Chen Wei, Zhu Minfeng, Liu Qian
- Adversarial Attacks And Defense For Conversation Entailment Task Yang Zhenning, Krawec Ryan, Wu Liang-yuan
- React Meets Actre: When Language Agents Enjoy Training Data Autonomy Yang Zonghan, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Curiousllm: Elevating Multi-document QA With Reasoning-infused Knowledge Graph Prompting Yang Zukang, Zhu Zixuan
- Exploiting Inter-layer Expert Affinity For Accelerating Mixture-of-experts Model Inference Yao Jinghan Dk, Anthony Quentin Dk, Shafi Aamir Dk, Subramoni Hari Dk, K. Dhabaleswar Dk, Panda
- Training Ultra Long Context Language Model With Fully Pipelined Distributed Transformer Yao Jinghan, Jacobs Sam Ade, Tanaka Masahiro, Ruwase Olatunji, Shafi Aamir, Subramoni Hari, Panda Dhabaleswar K.
- Enhancing In-context Learning Performance With Just Svd-based Weight Pruning: A Theoretical Perspective Yao Xinhao, Hu Xiaolin, Yang Shenzhi, Liu Yong
- Sirllm: Streaming Infinite Retentive LLM Yao Yao, Li Zuchao, Zhao Hai
- Open-domain Implicit Format Control For Large Language Model Generation Yao Yiqun, Ma Wenjia, Fang Xuezhi, Jiang Xin, Li Xiang, Meng Xuying, Han Peng, Li Jing, Sun Aixin, Wang Yequan
- Efficient Vision-and-language Pre-training With Text-relevant Image Patch Selection Ye Wei, Jiang Chaoya, Xu Haiyang, Ye Chenhao, Li Chenliang, Yan Ming, Zhang Shikun, Huang Songhang, Huang Fei
- Harnessing Multimodal Large Language Models For Multimodal Sequential Recommendation Ye Yuyang, Zheng Zhi, Shen Yishan, Wang Tianshu, Zhang Hengruo, Zhu Peijun, Yu Runlong, Zhang Kai, Xiong Hui
- BEAF: Observing Before-after Changes To Evaluate Hallucination In Vision-language Models Ye-bin Moon, Hyeon-woo Nam, Choi Wonseok, Oh Tae-hyun
- Long-context Language Modeling With Parallel Context Encoding Yen Howard, Gao Tianyu, Chen Danqi
- Generation Meets Verification: Accelerating Large Language Model Inference With Smart Parallel Auto-correct Decoding Yi Hanling, Lin Feng, Li Hongbin, Ning Peiyang, Yu Xiaotian, Xiao Rong
- MEND: Meta Demonstration Distillation For Efficient And Effective In-context Learning Yichuan Li, Xiyao Ma, Sixing Lu, Kyumin Lee, Xiaohu Liu, Chenlei Guo
- Characterizing Truthfulness In Large Language Model Generations With Local Intrinsic Dimension Yin Fan, Srinivasa Jayanth, Chang Kai-wei
- Lofit: Localized Fine-tuning On LLM Representations Yin Fangcong, Ye Xi, Durrett Greg
- Stablemask: Refining Causal Masking In Decoder-only Transformer Yin Qingyu, He Xuzheng, Zhuang Xiang, Zhao Yu, Yao Jianhua, Shen Xiaoyu, Zhang Qiang
- SEA: Supervised Embedding Alignment For Token-level Visual-textual Integration In Mllms Yin Yuanyang, Zhao Yaqi, Zhang Yajie, Lin Ke, Wang Jiahao, Tao Xin, Wan Pengfei, Zhang Di, Yin Baoqun, Zhang Wentao
- Vqattack: Transferable Adversarial Attacks On Visual Question Answering Via Pre-trained Models Yin Ziyi, Ye Muchao, Zhang Tianrong, Wang Jiaqi, Liu Han, Chen Jinghui, Wang Ting, Ma Fenglong
- Internlm-math: Open Math Large Language Models Toward Verifiable Reasoning Ying Huaiyuan, Zhang Shuo, Li Linyang, Zhou Zhejian, Shao Yunfan, Fei Zhaoye, Ma Yichuan, Hong Jiawei, Liu Kuikun, Wang Ziyi, Wang Yudong, Wu Zijian, Li Shuaibin, Zhou Fengzhe, Liu Hongwei, Zhang Songyang, Zhang Wenwei, Yan Hang, Qiu Xipeng, Wang Jiayu, Chen Kai, Lin Dahua
- Llms-as-instructors: Learning From Errors Toward Automating Model Improvement Ying Jiahao, Lin Mingbao, Cao Yixin, Tang Wei, Wang Bo, Sun Qianru, Huang Xuanjing, Yan Shuicheng
- Safety Fine-tuning At (almost) No Cost: A Baseline For Vision Large Language Models Yongshuo Zong, Ondrej Bohdal, Tingyang Yu, Yongxin Yang, Timothy Hospedales
- TLCR: Token-level Continuous Reward For Fine-grained Reinforcement Learning From Human Feedback Yoon Eunseop, Yoon Hee Suk, Eom Soohwan, Han Gunsoo, Nam Daniel Wontae, Jo Daejin, On Kyoung-woon, Hasegawa-johnson Mark A., Kim Sungwoong, Yoo Chang D.
- C-TPT: Calibrated Test-time Prompt Tuning For Vision-language Models Via Text Feature Dispersion Yoon Hee Suk, Yoon Eunseop, Tee Joshua Tian Jin, Hasegawa-johnson Mark, Li Yingzhen, Yoo Chang D.
- Making The Most Of Your Model: Methods For Finetuning And Applying Pretrained Transformers Yoshida Davis
- Tree-planted Transformers: Unidirectional Transformer Language Models With Implicit Syntactic Supervision Yoshida Ryo, Someya Taiga, Oseki Yohei
- When Linear Attention Meets Autoregressive Decoding: Towards More Effective And Efficient Linearized Large Language Models You Haoran, Fu Yichao, Wang Zheng, Yazdanbakhsh Amir, Lin Yingyan Celine
- Shiftaddllm: Accelerating Pretrained Llms Via Post-training Multiplication-less Reparameterization You Haoran, Guo Yipin, Fu Yichao, Zhou Wei, Shi Huihong, Zhang Xiaofan, Kundu Souvik, Yazdanbakhsh Amir, Lin Yingyan Celine
- Flow Of Reasoning: Efficient Training Of LLM Policy With Divergent Thinking Yu Fangxu, Jiang Lai, Kang Haoqiang, Hao Shibo, Qin Lianhui
- Iterative Graph Alignment Yu Fangyuan, Arora Hardeep Singh, Johnson Matt
- Enhancing Healthcare Through Large Language Models: A Study On Medical Question Answering Yu Haoran, Yu Chang, Wang Zihan, Zou Dongxian, Qin Hao
- Rankrag: Unifying Context Ranking With Retrieval-augmented Generation In Llms Yu Yue, Ping Wei, Liu Zihan, Wang Boxin, You Jiaxuan, Zhang Chao, Shoeybi Mohammad, Catanzaro Bryan
- Kieval: A Knowledge-grounded Interactive Evaluation Framework For Large Language Models Yu Zhuohao, Gao Chang, Yao Wenjin, Wang Yidong, Ye Wei, Wang Jindong, Xie Xing, Zhang Yue, Zhang Shikun
- Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion For Efficient Inference Intervention In Large Language Model Yuan Chenhan, Huang Fei, Peng Ru, Lu Keming, Yu Bowen, Zhou Chang, Zhou Jingren
- Remamba: Equip Mamba With Effective Long-sequence Modeling Yuan Danlong, Liu Jiahao, Li Bei, Zhang Huishuai, Wang Jingang, Cai Xunliang, Zhao Dongyan
- Towards A Holistic Evaluation Of Llms On Factual Knowledge Recall Yuan Jiaqing, Pan Lin, Hang Chung-wei, Guo Jiang, Jiang Jiarong, Min Bonan, Ng Patrick, Wang Zhiguo
- KV Cache Compression, But What Must We Give In Return? A Comprehensive Benchmark Of Long Context Capable Approaches Yuan Jiayi Henry, Liu Hongyi Henry, Shaochen Henry, Zhong, Chuang Yu-neng, Li Songchen, Wang Guanchu, Le Duy, Jin Hongye, Chaudhary Vipin, Xu Zhaozhuo, Liu Zirui, Hu Xia
- Advancing LLM Reasoning Generalists With Preference Trees Yuan Lifan, Cui Ganqu, Wang Hanbin, Ding Ning, Wang Xingyao, Deng Jia, Shan Boji, Chen Huimin, Xie Ruobing, Lin Yankai, Liu Zhenghao, Zhou Bowen, Peng Hao, Liu Zhiyuan, Sun Maosong
- Why Lift So Heavy? Slimming Large Language Models By Cutting Off The Layers Yuan Shuzhou, Nie Ercong, Ma Bolei, Färber Michael
- Self-rewarding Language Models Yuan Weizhe, Pang Richard Yuanzhe, Cho Kyunghyun, Li Xian, Sukhbaatar Sainbayar, Xu Jing, Weston Jason
- CO3: Low-resource Contrastive Co-training For Generative Conversational Query Rewrite Yuan Yifei, Shi Chen, Wang Runze, Chen Liyi, Hu Renjun, Zhang Zengming, Jiang Feijun, Lam Wai
- Tcmbench: A Comprehensive Benchmark For Evaluating Large Language Models In Traditional Chinese Medicine Yue Wenjing, Wang Xiaoling, Zhu Wei, Guan Ming, Zheng Huanran, Wang Pengfei, Sun Changzhi, Ma Xin
- Mammoth2: Scaling Instructions From The Web Yue Xiang, Zheng Tuney, Zhang Ge, Chen Wenhu
- Focus On The Core: Efficient Attention Via Pruned Token Compression For Document Classification Yun Jungmin, Kim Mihyeon, Kim Youngbin
- Toward Inference-optimal Mixture-of-expert Large Language Models Yun Longfei, Zhuang Yonghao, Fu Yao, Xing Eric P, Zhang Hao
- Investigating Continual Pretraining In Large Language Models: Insights And Implications Yıldız Çağatay, Ravichandran Nishaanth Kanna, Punia Prishruit, Bethge Matthias, Ermis Beyza
- Building Accurate Translation-tailored Llms With Language Aware Instruction Tuning Zan Changtong, Ding Liang, Shen Li, Zhen Yibing, Liu Weifeng, Tao Dacheng
- Understanding And Mitigating Compositional Issues In Text-to-image Generative Models Zarei Arman, Rezaei Keivan, Basu Samyadeep, Saberi Mehrdad, Moayeri Mazda, Kattakinda Priyatham, Feizi Soheil
- Tell Me Why: Explainable Public Health Fact-checking With Large Language Models Zarharan Majid, Wullschleger Pascal, Kia Babak Behkam, Pilehvar Mohammad Taher, Foster Jennifer
- ABQ-LLM: Arbitrary-bit Quantized Inference Acceleration For Large Language Models Zeng Chao, Liu Songwei, Xie Yusheng, Liu Hong, Wang Xiaojian, Wei Miao, Yang Shu, Chen Fangmin, Mei Xing
- Token-level Direct Preference Optimization Zeng Yongcheng, Liu Guoqing, Ma Weiyu, Yang Ning, Zhang Haifeng, Wang Jun
- Can Mllms Perform Text-to-image In-context Learning? Zeng Yuchen, Kang Wonjun, Chen Yicong, Koo Hyung Il, Lee Kangwook
- Actions Speak Louder Than Words: Trillion-parameter Sequential Transducers For Generative Recommendations Zhai Jiaqi, Liao Lucy, Liu Xing, Wang Yueming, Li Rui, Cao Xuan, Gao Leon, Gong Zhaojie, Gu Fangda, He Michael, Lu Yinghai, Shi Yu
- Prefix Text As A Yarn: Eliciting Non-english Alignment In Foundation Language Model Zhan Runzhe, Yang Xinyi, Wong Derek F., Chao Lidia S., Zhang Yue
- When Scaling Meets LLM Finetuning: The Effect Of Data, Model And Finetuning Method Zhang Biao, Liu Zhongtao, Cherry Colin, Firat Orhan
- Distilling Implicit Multimodal Knowledge Into Llms For Zero-resource Dialogue Generation Zhang Bo, Ma Hui, Ding Jian, Wang Jian, Xu Bo, Lin Hongfei
- Simple Techniques For Enhancing Sentence Embeddings In Generative Language Models Zhang Bowen, Chang Kehua, Li Chunping
- Ts-align: A Teacher-student Collaborative Framework For Scalable Iterative Finetuning Of Large Language Models Zhang Chen, Tang Chengguang, Chong Dading, Shi Ke, Tang Guohua, Jiang Feng, Li Haizhou
- Sciglm: Training Scientific Language Models With Self-reflective Instruction Annotation And Tuning Zhang Dan, Hu Ziniu, Zhoubian Sining, Du Zhengxiao, Yang Kaiyu, Wang Zihan, Yue Yisong, Dong Yuxiao, Tang Jie
- Rest-mcts*: LLM Self-training Via Process Reward Guided Tree Search Zhang Dan, Zhoubian Sining, Hu Ziniu, Yue Yisong, Dong Yuxiao, Tang Jie
- Code Representation Learning At Scale Zhang Dejiao, Ahmad Wasi, Tan Ming, Ding Hantian, Nallapati Ramesh, Roth Dan, Ma Xiaofei, Xiang Bing
- Speechgpt-gen: Scaling Chain-of-information Speech Generation Zhang Dong, Zhang Xin, Zhan Jun, Li Shimin, Zhou Yaqian, Qiu Xipeng
- Unforgettable Generalization In Language Models Zhang Eric, Chosen Leshem, Andreas Jacob
- Closing The Gap Between Open-source And Commercial Large Language Models For Medical Evidence Summarization Zhang Gongbo, Jin Qiao, Zhou Yiliang, Wang Song, Idnay Betina R., Luo Yiming, Park Elizabeth, Nestor Jordan G., Spotnitz Matthew E., Soroush Ali, Campion Thomas, Lu Zhiyong, Weng Chunhua, Peng Yifan
- Sinklora: Enhanced Efficiency And Chat Capabilities For Long-context Large Language Models Zhang Hengyu
- Balancing Speciality And Versatility: A Coarse To Fine Framework For Supervised Fine-tuning Large Language Model Zhang Hengyuan, Wu Yanru, Li Dawei, Yang Sak, Zhao Rui, Jiang Yong, Tan Fei
- Kb-plugin: A Plug-and-play Framework For Large Language Models To Induce Programs Over Low-resourced Knowledge Bases Zhang Jiajie, Cao Shulin, Hu Linmei, Feng Ling, Hou Lei, Li Juanzi
- Regurgitative Training: The Value Of Real Data In Training Large Language Models Zhang Jinghui, Qiao Dandan, Yang Mochen, Wei Qiang
- Onegen: Efficient One-pass Unified Generation And Retrieval For Llms Zhang Jintian, Peng Cheng, Sun Mengshu, Chen Xiang, Liang Lei, Zhang Zhiqiang, Zhou Jun, Chen Huajun, Zhang Ningyu
- Android In The Zoo: Chain-of-action-thought For GUI Agents Zhang Jiwen, Wu Jihao, Teng Yihua, Liao Minghui, Xu Nuo, Xiao Xiao, Wei Zhongyu, Tang Duyu
- Unimot: Unified Molecule-text Language Model With Discrete Token Representation Zhang Juzheng, Bian Yatao, Chen Yongqiang, Yao Quanming
- Fast And Slow Generating: An Empirical Study On Large And Small Language Models Collaborative Decoding Zhang Kaiyan, Wang Jianyu, Ding Ning, Qi Biqing, Hua Ermo, Lv Xingtai, Zhou Bowen
- The Hedgehog & The Porcupine: Expressive Linear Attentions With Softmax Mimicry Zhang Michael, Bhatia Kush, Kumbong Hermann, Ré Christopher
- Transfertod: A Generalizable Chinese Multi-domain Task-oriented Dialogue System With Transfer Capabilities Zhang Ming, Huang Caishuang, Wu Yilong, Liu Shichun, Zheng Huiyuan, Dong Yurui, Shen Yujiong, Dou Shihan, Zhao Jun, Ye Junjie, Zhang Qi, Gui Tao, Huang Xuanjing
- Soaring From 4K To 400K: Extending Llm's Context With Activation Beacon Zhang Peitian, Liu Zheng, Xiao Shitao, Shao Ninglu, Ye Qiwei, Dou Zhicheng
- Extending Llama-3's Context Ten-fold Overnight Zhang Peitian, Shao Ninglu, Liu Zheng, Xiao Shitao, Qian Hongjin, Ye Qiwei, Dou Zhicheng
- Look Ahead Or Look Around? A Theoretical Comparison Between Autoregressive And Masked Pretraining Zhang Qi, Du Tianqi, Huang Haotian, Wang Yifei, Wang Yisen
- VAR-CLIP: Text-to-image Generator With Visual Auto-regressive Modeling Zhang Qian, Dai Xiangzi, Yang Ninghua, An Xiang, Feng Ziyong, Ren Xingyu
- Improving Diversity Of Commonsense Generation By Large Language Models Via In-context Learning Zhang Tianhui, Peng Bei, Bollegala Danushka
- RAFT: Adapting Language Model To Domain Specific RAG Zhang Tianjun, Patil Shishir G., Jain Naman, Shen Sheng, Zaharia Matei, Stoica Ion, Gonzalez Joseph E.
- Promptfix: Few-shot Backdoor Removal Via Adversarial Prompt Tuning Zhang Tianrong, Xi Zhaohan, Wang Ting, Mitra Prasenjit, Chen Jinghui
- Interpreting And Improving Large Language Models In Arithmetic Calculation Zhang Wei, Wan Chaoqun, Zhang Yonggang, Cheung Yiu-ming, Tian Xinmei, Shen Xu, Ye Jieping
- Defining Boundaries: A Spectrum Of Task Feasibility For Large Language Models Zhang Wenbo, Xu Zihang, Cai Hengrui
- Agent-pro: Learning To Evolve Via Policy-level Reflection And Optimization Zhang Wenqi, Tang Ke, Wu Hai, Wang Mengna, Shen Yongliang, Hou Guiyang, Tan Zeqi, Li Peng, Zhuang Yueting, Lu Weiming
- Laser: Parameter-efficient LLM Bi-tuning For Sequential Recommendation With Collaborative Information Zhang Xinyu, Hu Linmei, Zhang Luhao, Song Dandan, Huang Heyan, Nie Liqiang
- Chain Of Preference Optimization: Improving Chain-of-thought Reasoning In Llms Zhang Xuan, Du Chao, Pang Tianyu, Liu Qian, Gao Wei, Lin Min
- Finercut: Finer-grained Interpretable Layer Pruning For Large Language Models Zhang Yang, Li Yawei, Wang Xinpeng, Shen Qianli, Plank Barbara, Bischl Bernd, Rezaei Mina, Kawaguchi Kenji
- Pushing The Limit Of LLM Capacity For Text Classification Zhang Yazhou, Wang Mengyao, Ren Chenyu, Li Qiuchi, Tiwari Prayag, Wang Benyou, Qin Jing
- Rankclip: Ranking-consistent Language-image Pretraining Zhang Yiming, Zhao Zhuokai, Chen Zhaorun, Feng Zhili, Ding Zenghui, Sun Yining
- Mg-verilog: Multi-grained Dataset Towards Enhanced Llm-assisted Verilog Generation Zhang Yongan, Yu Zhongzhi, Fu Yonggan, Wan Cheng, Lin Yingyan Celine
- OT-VP: Optimal Transport-guided Visual Prompting For Test-time Adaptation Zhang Yunbei, Mehra Akshay, Hamm Jihun
- Small Language Models Need Strong Verifiers To Self-correct Reasoning Zhang Yunxiang, Khalifa Muhammad, Logeswaran Lajanugen, Kim Jaekyeom, Lee Moontae, Lee Honglak, Wang Lu
- Adam-mini: Use Fewer Learning Rates To Gain More Zhang Yushun, Chen Congliang, Li Ziniu, Ding Tian, Wu Chenwei, Ye Yinyu, Luo Zhi-quan, Sun Ruoyu
- Understanding The Relationship Between Prompts And Response Uncertainty In Large Language Models Zhang Ze Yu, Verma Arun, Doshi-velez Finale, Low Bryan Kian Hsiang
- Position-aware Parameter Efficient Fine-tuning Approach For Reducing Positional Bias In Llms Zhang Zheng, Yang Fan, Jiang Ziyan, Chen Zheng, Zhao Zhengyang, Ma Chengyuan, Zhao Liang, Liu Yang
- Found In The Middle: How Language Models Use Long Contexts Better Via Plug-and-play Positional Encoding Zhang Zhenyu, Chen Runjin, Liu Shiwei, Yao Zhewei, Ruwase Olatunji, Chen Beidi, Wu Xiaoxia, Wang Zhangyang
- Safe Unlearning: A Surprisingly Effective And Generalizable Solution To Defend Against Jailbreak Attacks Zhang Zhexin, Yang Junxiao, Ke Pei, Cui Shiyao, Zheng Chujie, Wang Hongning, Huang Minlie
- Galla: Graph Aligned Large Language Models For Improved Source Code Understanding Zhang Ziyin, Yu Hang, Li Shijie, Di Peng, Li Jianguo, Wang Rui
- Reverse That Number! Decoding Order Matters In Arithmetic Learning Zhang-li Daniel, Lin Nianyi, Yu Jifan, Zhang Zheyuan, Yao Zijun, Zhang Xiaokang, Hou Lei, Zhang Jing, Li Juanzi
- APT: Adaptive Pruning And Tuning Pretrained Language Models For Efficient Training And Inference Zhao Bowen, Hajishirzi Hannaneh, Cao Qingqing
- LANE: Logic Alignment Of Non-tuning Large Language Models And Online Recommendation Systems For Explainable Reason Generation Zhao Hongke, Zheng Songming, Wu Likang, Yu Bowen, Wang Jing
- Revolutionizing Finance With Llms: An Overview Of Applications And Insights Zhao Huaqin, Liu Zhengliang, Wu Zihao, Li Yiwei, Yang Tianze, Shu Peng, Xu Shaochen, Dai Haixing, Zhao Lin, Mai Gengchen, Liu Ninghao, Liu Tianming
- Galore: Memory-efficient LLM Training By Gradient Low-rank Projection Zhao Jiawei, Zhang Zhenyu, Chen Beidi, Wang Zhangyang, Anandkumar Anima, Tian Yuandong
- Exploring The Compositional Deficiency Of Large Language Models In Mathematical Reasoning Zhao Jun, Tong Jingqi, Mou Yurong, Zhang Ming, Zhang Qi, Huang Xuanjing
- Llama Beyond English: An Empirical Study On Language Capability Transfer Zhao Jun, Zhang Zhihao, Gao Luhui, Zhang Qi, Gui Tao, Huang Xuanjing
- Lora Land: 310 Fine-tuned Llms That Rival GPT-4, A Technical Report Zhao Justin, Wang Timothy, Abid Wael, Angus Geoffrey, Garg Arnav, Kinnison Jeffery, Sherstinsky Alex, Molino Piero, Addair Travis, Rishi Devvret
- SLIDE: A Framework Integrating Small And Large Language Models For Open-domain Dialogues Evaluation Zhao Kun, Yang Bohao, Tang Chen, Lin Chenghua, Zhan Liang
- Longskywork: A Training Recipe For Efficiently Extending Context Length In Large Language Models Zhao Liang, Wei Tianwen, Zeng Liang, Cheng Cheng, Yang Liu, Cheng Peng, Wang Lijie, Li Chenxia, Wu Xuejie, Zhu Bo, Gan Yimeng, Hu Rui, Yan Shuicheng, Fang Han, Zhou Yahui
- Mitigating Object Hallucination In Large Vision-language Models Via Classifier-free Guidance Zhao Linxi, Deng Yihe, Zhang Weitong, Gu Quanquan
- Aqulia-med LLM: Pioneering Full-process Open-source Medical Language Models Zhao Lulu, Zeng Weihao, Shi Xiaofeng, Zhou Hua, Hao Donglin, Lin Yonghua
- Efficiently Training 7B LLM With 1 Million Sequence Length On 8 Gpus Zhao Pinxue, Zhang Hailin, Fu Fangcheng, Nie Xiaonan, Liu Qibin, Yang Fang, Peng Yuanbo, Jiao Dian, Li Shuaipeng, Xue Jinbao, Tao Yangyu, Cui Bin
- A Survey Of Backdoor Attacks And Defenses On Large Language Models: Implications For Security Measures Zhao Shuai, Jia Meihuizi, Guo Zhongliang, Gan Leilei, Xu Xiaoyu, Wu Xiaobao, Fu Jie, Feng Yichao, Pan Fengjun, Tuan Luu Anh
- Prepacking: A Simple Method For Fast Prefilling And Increased Throughput In Large Language Models Zhao Siyan, Israel Daniel, Broeck Guy Van Den, Grover Aditya
- Defending Large Language Models Against Jailbreak Attacks Via Layer-specific Editing Zhao Wei, Li Zhe, Li Yige, Zhang Ye, Sun Jun
- Ouroboros: Generating Longer Drafts Phrase By Phrase For Faster Speculative Decoding Zhao Weilin, Huang Yuxiang, Han Xu, Xu Wang, Xiao Chaojun, Zhang Xinrong, Fang Yewei, Zhang Kaihuo, Liu Zhiyuan, Sun Maosong
- Both Matter: Enhancing The Emotional Intelligence Of Large Language Models Without Compromising The General Intelligence Zhao Weixiang, Li Zhuojun, Wang Shilong, Wang Yang, Hu Yulin, Zhao Yanyan, Wei Chen, Qin Bing
- Enhancing Cross-domain Pre-trained Decision Transformers With Adaptive Attention Zhao Wenhao, Xu Qiushui, Xu Linjie, Song Lei, Wang Jinyu, Zhou Chunlai, Bian Jiang
- Wildchat: 1M Chatgpt Interaction Logs In The Wild Zhao Wenting, Ren Xiang, Hessel Jack, Cardie Claire, Choi Yejin, Deng Yuntian
- How Do Large Language Models Handle Multilingualism? Zhao Yiran, Zhang Wenxuan, Chen Guizhen, Kawaguchi Kenji, Bing Lidong
- ALISA: Accelerating Large Language Model Inference Via Sparsity-aware KV Caching Zhao Youpeng, Wu Di, Wang Jun
- An Empirical Study Of Retrieval Augmented Generation With Chain-of-thought Zhao Yuetong, Cao Hongyu, Zhao Xianyu, Ou Zhijian
- Improving The Robustness Of Large Language Models Via Consistency Alignment Zhao Yukun, Yan Lingyong, Sun Weiwei, Xing Guoliang, Wang Shuaiqiang, Meng Chong, Cheng Zhicong, Ren Zhaochun, Yin Dawei
- SWIFT:A Scalable Lightweight Infrastructure For Fine-tuning Zhao Yuze, Huang Jintao, Hu Jinghan, Wang Xingjun, Mao Yunlin, Zhang Daoze, Jiang Zeyinzi, Wu Zhikai, Ai Baole, Wang Ang, Zhou Wenmeng, Chen Yingda
- Adversarial Contrastive Decoding: Boosting Safety Alignment Of Large Language Models Via Opposite Prompt Optimization Zhao Zhengyue, Zhang Xiaoyun, Xu Kaidi, Hu Xing, Zhang Rui, Du Zidong, Guo Qi, Chen Yunji
- Loraretriever: Input-aware Lora Retrieval And Composition For Mixed Tasks In The Wild Zhao Ziyu, Gan Leilei, Wang Guoyin, Zhou Wangchunshu, Yang Hongxia, Kuang Kun, Wu Fei
- ICE-GRT: Instruction Context Enhancement By Generative Reinforcement Based Transformers Zheng Chen, Sun Ke, Tang Da, Ma Yukun, Zhang Yuyu, Xi Chenguang, Zhou Xun
- Balancing Enhancement, Harmlessness, And General Capabilities: Enhancing Conversational Llms With Direct RLHF Zheng Chen, Sun Ke, Wu Hang, Xi Chenguang, Zhou Xun
- Large Language Models As Reliable Knowledge Bases? Zheng Danna, Lapata Mirella, Pan Jeff Z.
- Fine-tuning Large Language Models For Domain-specific Machine Translation Zheng Jiawei, Hong Hanghai, Wang Xiaoli, Su Jingsong, Liang Yonggui, Wu Shikai
- Can Llms Learn New Concepts Incrementally Without Forgetting? Zheng Junhao, Qiu Shengjie, Ma Qianli
- Towards Lifelong Learning Of Large Language Models: A Survey Zheng Junhao, Qiu Shengjie, Shi Chengming, Ma Qianli
- Dreamlip: Language-image Pre-training With Long Captions Zheng Kecheng, Zhang Yifei, Wu Wei, Lu Fan, Ma Shuailei, Jin Xin, Chen Wei, Shen Yujun
- Breaking Language Barriers: Cross-lingual Continual Pre-training At Scale Zheng Wenzhen, Pan Wenbo, Xu Xu, Qin Libo, Yue Li, Zhou Ming
- Llamafactory: Unified Efficient Fine-tuning Of 100+ Language Models Zheng Yaowei, Zhang Richong, Zhang Junhao, Ye Yanhan, Luo Zheyan, Feng Zhangchi, Ma Yongqiang
- Harnessing Large Language Models For Text-rich Sequential Recommendation Zheng Zhi, Chao Wenshuo, Qiu Zhaopeng, Zhu Hengshu, Xiong Hui
- Exploring Boundary Of GPT-4V On Marine Analysis: A Preliminary Case Study Zheng Ziqiang, Chen Yiwei, Zhang Jipeng, Vu Tuan-anh, Zeng Huimin, Tim Yue Him Wong, Yeung Sai-kit
- Understanding The Rope Extensions Of Long-context Llms: An Attention Perspective Zhong Meizhi, Zhang Chen, Lei Yikun, Liu Xikai, Gao Yan, Hu Yao, Chen Kehai, Zhang Min
- Revisiting Knowledge Distillation For Autoregressive Language Models Zhong Qihuang, Ding Liang, Shen Li, Liu Juhua, Du Bo, Tao Dacheng
- Moextend: Tuning New Experts For Modality And Task Extension Zhong Shanshan, Gao Shanghua, Huang Zhongzhan, Wen Wushao, Zitnik Marinka, Zhou Pan
- Enhancing Multimodal Large Language Models With Multi-instance Visual Prompt Generator For Visual Representation Enrichment Zhong Wenliang, Wu Wenyi, Li Qi, Barton Rob, Du Boxin, Sam Shioulin, Bouyarmane Karim, Tutar Ismail, Huang Junzhou
- Harnessing Large Language Models As Post-hoc Correctors Zhong Zhiqiang, Zhou Kuangyu, Mottin Davide
- Synthet2c: Generating Synthetic Data For Fine-tuning Large Language Models On The Text2cypher Task Zhong Ziije, Zhong Linqing, Sun Zhaoze, Jin Qingyun, Qin Zengchang, Zhang Xiaofan
- Misinforming Llms: Vulnerabilities, Challenges And Opportunities Zhou Bo, Geißler Daniel, Lukowicz Paul
- Moe-lpr: Multilingual Extension Of Large Language Models Through Mixture-of-experts With Language Priors Routing Zhou Hao, Wang Zhijun, Huang Shujian, Huang Xin, Han Xue, Feng Junlan, Deng Chao, Luo Weihua, Chen Jiajun
- Sequence To Sequence Reward Modeling: Improving RLHF By Language Feedback Zhou Jiayi, Ji Jiaming, Dai Juntao, Yang Yaodong
- Orchestrating Llms With Different Personalizations Zhou Jin Peng, Luo Katie Z, Gu Jingwen, Yuan Jason, Weinberger Kilian Q., Sun Wen
- Qog:question And Options Generation Based On Language Model Zhou Jincheng
- Leveraging Web-crawled Data For High-quality Fine-tuning Zhou Jing, Jiang Chenglin, Shen Wei, Zhou Xiao, He Xiaonan
- Mlps Compass: What Is Learned When Mlps Are Combined With Plms? Zhou Li, Chen Wenyu, Cao Yong, Zeng Dingyi, Liu Wanlong, Qu Hong
- Towards Democratizing Multilingual Large Language Models For Medicine Through A Two-stage Instruction Fine-tuning Approach Zhou Meng, Parmar Surajsinh, Bhatti Anubhav
- Enhancing The General Agent Capabilities Of Low-parameter Llms Through Tuning And Multi-branch Reasoning Zhou Qinhao, Zhang Zihan, Xiang Xiang, Wang Ke, Wu Yuchuan, Li Yongbin
- Pre-trained Large Language Models Use Fourier Features To Compute Addition Zhou Tianyi, Fu Deqing, Sharan Vatsal, Jia Robin
- Explaining Pre-trained Language Models With Attribution Scores: An Analysis In Low-resource Settings Zhou Wei, Adel Heike, Schuff Hendrik, Vu Ngoc Thang
- Archer: Training Language Model Agents Via Hierarchical Multi-turn RL Zhou Yifei, Zanette Andrea, Pan Jiayi, Levine Sergey, Kumar Aviral
- Aligning Modalities In Vision Large Language Models Via Preference Fine-tuning Zhou Yiyang, Cui Chenhang, Rafailov Rafael, Finn Chelsea, Yao Huaxiu
- Calibrated Self-rewarding Vision Language Models Zhou Yiyang, Fan Zhiyuan, Cheng Dongjie, Yang Sihan, Chen Zhaorun, Cui Chenhang, Wang Xiyao, Li Yun, Zhang Linjun, Yao Huaxiu
- Dual Instruction Tuning With Large Language Models For Mathematical Reasoning Zhou Yongwei, Zhao Tiejun
- Defending Jailbreak Prompts Via In-context Adversarial Game Zhou Yujun, Han Yufei, Zhuang Haomin, Guo Kehan, Liang Zhenwen, Bao Hongyan, Zhang Xiangliang
- Emulated Disalignment: Safety Alignment For Large Language Models May Backfire! Zhou Zhanhui, Liu Jie, Dong Zhichen, Liu Jiaheng, Yang Chao, Ouyang Wanli, Qiao Yu
- Lawgpt: A Chinese Legal Knowledge-enhanced Large Language Model Zhou Zhi, Shi Jiang-xin, Song Peng-xiao, Yang Xiao-wen, Jin Yi-xuan, Guo Lan-zhe, Li Yu-feng
- DETAIL: Task Demonstration Attribution For Interpretable In-context Learning Zhou Zijian, Lin Xiaoqiang, Xu Xinyi, Prakash Alok, Rus Daniela, Low Bryan Kian Hsiang
- Fine-tuning Large Language Models To Translate: Will A Touch Of Noisy Data In Misaligned Languages Suffice? Zhu Dawei, Chen Pinzhen, Zhang Miaoran, Haddow Barry, Shen Xiaoyu, Klakow Dietrich
- A Preference-driven Paradigm For Enhanced Translation With Large Language Models Zhu Dawei, Trenous Sony, Shen Xiaoyu, Klakow Dietrich, Byrne Bill, Hasler Eva
- TAT-LLM: A Specialized Language Model For Discrete Reasoning Over Tabular And Textual Data Zhu Fengbin, Liu Ziyang, Feng Fuli, Wang Chao, Li Moxin, Chua Tat-seng
- FANNO: Augmenting High-quality Instruction Data With Open-sourced Llms Only Zhu He, Su Junyou, Lun Tianle, Tao Yicheng, Zhang Wenjia, Fan Zipei, Chen Guanhua
- Cost: Contrastive Quantization Based Semantic Tokenization For Generative Recommendation Zhu Jieming, Jin Mengqun, Liu Qijiong, Qiu Zexuan, Dong Zhenhua, Li Xiu
- Collectivesft: Scaling Large Language Models For Chinese Medical Benchmark With Collective Instructions In Healthcare Zhu Jingwei, Tan Minghuan, Yang Min, Li Ruixue, Alinejad-rokny Hamid
- Fastmem: Fast Memorization Of Prompt Improves Context Awareness Of Large Language Models Zhu Junyi, Liu Shuochen, Yu Yu, Tang Bo, Yan Yibo, Li Zhiyu, Xiong Feiyu, Xu Tong, Blaschko Matthew B.
- Parameter-efficient Tuning Large Language Models For Graph Representation Learning Zhu Qi, Zheng Da, Song Xiang, Zhang Shichang, Jin Bowen, Sun Yizhou, Karypis George
- Scalable Matmul-free Language Modeling Zhu Rui-jie, Zhang Yu, Sifferman Ethan, Sheaves Tyler, Wang Yiqiao, Richmond Dustin, Zhou Peng, Eshraghian Jason K.
- IAPT: Instruction-aware Prompt Tuning For Large Language Models Zhu Wei, Tian Aaron Xuxiang, Yin Congrui, Ni Yuan, Wang Xiaoling, Xie Guotong
- Distilling Mathematical Reasoning Capabilities Into Small Language Models Zhu Xunyu, Li Jian, Liu Yong, Ma Can, Wang Weiping
- Can Large Language Models Understand Context? Zhu Yilun, Moniz Joel Ruben Antony, Bhargava Shruti, Lu Jiarui, Piraviperumal Dhivya, Li Site, Zhang Yuan, Yu Hong, Tseng Bo-hsiang
- Generative Pre-trained Speech Language Model With Efficient Hierarchical Transformer Zhu Yongxin, Su Dan, He Liqiang, Xu Linli, Yu Dong
- Efficient Test-time Prompt Tuning For Vision-language Models Zhu Yuhan, Zhang Guozhen, Xu Chen, Shen Haocheng, Chen Xiaoxin, Wu Gangshan, Wang Limin
- One Token Can Help! Learning Scalable And Pluggable Virtual Tokens For Retrieval-augmented Large Language Models Zhu Yutao, Huang Zhaoheng, Dou Zhicheng, Wen Ji-rong
- Lumina-next: Making Lumina-t2x Stronger And Faster With Next-dit Zhuo Le, Du Ruoyi, Xiao Han, Li Yangguang, Liu Dongyang, Huang Rongjie, Liu Wenze, Zhao Lirui, Wang Fu-yun, Ma Zhanyu, Luo Xu, Wang Zehan, Zhang Kaipeng, Zhu Xiangyang, Liu Si, Yue Xiangyu, Liu Dingning, Ouyang Wanli, Liu Ziwei, Qiao Yu, Li Hongsheng, Gao Peng
- Astraios: Parameter-efficient Instruction Tuning Code Large Language Models Zhuo Terry Yue, Zebaze Armel, Suppattarachai Nitchakarn, Von Werra Leandro, De Vries Harm, Liu Qian, Muennighoff Niklas
- Language Model Sentence Completion With A Parser-driven Rhetorical Control Method Zingale Joshua, Kalita Jugal
- Proswitch: Knowledge-guided Instruction Tuning To Generate Professional And Non-professional Styled Text Zong Chang, Chen Yuyan, Lu Weiming, Shao Jian, Zhuang Yueting
- Llama-excitor: General Instruction Tuning Via Indirect Feature Interaction Zou Bo, Yang Chao, Qiao Yu, Quan Chengbin, Zhao Youjian
- Promptintern: Saving Inference Costs By Internalizing Recurrent Prompt During Large Language Model Fine-tuning Zou Jiaru, Zhou Mengyu, Li Tao, Han Shi, Zhang Dongmei
- MLKV: Multi-layer Key-value Heads For Memory Efficient Transformer Decoding Zuhri Zayd Muhammad Kawakibi, Adilazuarda Muhammad Farid, Purwarianti Ayu, Aji Alham Fikri
- BASE TTS: Lessons From Building A Billion-parameter Text-to-speech Model On 100K Hours Of Data Łajszczak Mateusz, Cámbara Guillermo, Li Yang, Beyhan Fatih, Van Korlaar Arent, Yang Fan, Joly Arnaud, Martín-cortinas Álvaro, Abbas Ammar, Michalski Adam, Moinet Alexis, Karlapati Sri, Muszyńska Ewa, Guo Haohan, Putrycz Bartosz, Gambino Soledad López, Yoo Kayeon, Sokolova Elena, Drugman Thomas
- Concept-aware Data Construction Improves In-context Learning Of Language Models Štefánik Michal, Kadlčík Marek, Sojka Petr
🏷 Prompting
- Boolq: Exploring The Surprising Difficulty Of Natural Yes/no Questions Clark Christopher, Lee Kenton, Chang Ming-wei, Kwiatkowski Tom, Collins Michael, Toutanova Kristina
- App-aware Response Synthesis For User Reviews Farooq Umar, Siddique A. B., Jamour Fuad, Zhao Zhijia, Hristidis Vagelis
- SMRT Chatbots: Improving Non-task-oriented Dialog With Simulated Multiple Reference Training Khayrallah Huda, Sedoc João
- What Does BERT Know About Books, Movies And Music? Probing BERT For Conversational Recommendation Penha Gustavo, Hauff Claudia
- Making Pre-trained Language Models Better Few-shot Learners Tianyu Gao, Adam Fisch, Danqi Chen
- A General Language Assistant As A Laboratory For Alignment Amanda Askell, Yuntao Bai, Anna Chen, Dawn Drain, Deep Ganguli, Tom Henighan, Andy Jones, Nicholas Joseph, Ben Mann, Nova Dassarma, Nelson Elhage, Zac Hatfield-dodds, Danny Hernandez, Jackson Kernion, Kamal Ndousse, Catherine Olsson, Dario Amodei, Tom Brown, Jack Clark, Sam Mccandlish, Chris Olah, Jared Kaplan
- What Changes Can Large-scale Language Models Bring? Intensive Study On Hyperclova: Billions-scale Korean Generative Pretrained Transformers Boseop Kim, Hyoungseok Kim, Sang-woo Lee, Gichang Lee, Donghyun Kwak, Dong Hyeon Jeon, Sunghyun Park, Sungju Kim, Seonhoon Kim, Dongpil Seo, Heungsub Lee, Minyoung Jeong, Sungjae Lee, Minsub Kim, Suk Hyun Ko, Seokhun Kim, Taeyong Park, Jinuk Kim, Soyoung Kang, Na-hyeon Ryu, Kang Min Yoo, Minsuk Chang, Soobin Suh, Sookyo In, Jinseong Park, Kyungduk Kim, Hiun Kim, Jisu Jeong, Yong Goo Yeo, Donghoon Ham, Dongju Park, Min Young Lee, Jaewook Kang, Inho Kang, Jung-woo Ha, Woomyoung Park, Nako Sung
- The Power Of Scale For Parameter-efficient Prompt Tuning Brian Lester, Rami Al-rfou, Noah Constant
- Control Prefixes For Parameter-efficient Text Generation Clive Jordan, Cao Kris, Rei Marek
- Zero-shot Recommendation As Language Modeling Damien Sileo, Wout Vossen, Robbe Raymaekers
- Openprompt: An Open-source Framework For Prompt-learning Ding Ning, Hu Shengding, Zhao Weilin, Chen Yulin, Liu Zhiyuan, Zheng Hai-tao, Sun Maosong
- WARP: Word-level Adversarial Reprogramming Hambardzumyan Karen, Khachatrian Hrant, May Jonathan
- Generate, Annotate, And Learn: NLP With Synthetic Text He Xuanli, Nassar Islam, Kiros Jamie, Haffari Gholamreza, Norouzi Mohammad
- A Good Prompt Is Worth Millions Of Parameters: Low-resource Prompt-based Learning For Vision-language Models Jin Woojeong, Cheng Yu, Shen Yelong, Chen Weizhu, Ren Xiang
- Text-free Prosody-aware Generative Spoken Language Modeling Kharitonov Eugene, Lee Ann, Polyak Adam, Adi Yossi, Copet Jade, Lakhotia Kushal, Nguyen Tu-anh, Rivière Morgane, Mohamed Abdelrahman, Dupoux Emmanuel, Hsu Wei-ning
- What's In A Measurement? Using GPT-3 On Semeval 2021 Task 8 -- Measeval Kohler Curt, Daniel Ron Jr
- Few-shot Learning With Multilingual Language Models Lin Xi Victoria, Mihaylov Todor, Artetxe Mikel, Wang Tianlu, Chen Shuohui, Simig Daniel, Ott Myle, Goyal Naman, Bhosale Shruti, Du Jingfei, Pasunuru Ramakanth, Shleifer Sam, Koura Punit Singh, Chaudhary Vishrav, O'horo Brian, Wang Jeff, Zettlemoyer Luke, Kozareva Zornitsa, Diab Mona, Stoyanov Veselin, Li Xian
- What Makes Good In-context Examples For GPT-\(3\)? Liu Jiachang, Shen Dinghan, Zhang Yizhe, Dolan Bill, Carin Lawrence, Chen Weizhu
- Unified Multimodal Pre-training And Prompt-based Tuning For Vision-language Understanding And Generation Liu Tianyi, Wu Zuxuan, Xiong Wenhan, Chen Jingjing, Jiang Yu-gang
- GPT Understands, Too Liu Xiao, Zheng Yanan, Du Zhengxiao, Ding Ming, Qian Yujie, Yang Zhilin, Tang Jie
- Fantastically Ordered Prompts And Where To Find Them: Overcoming Few-shot Prompt Order Sensitivity Lu Yao, Bartolo Max, Moore Alastair, Riedel Sebastian, Stenetorp Pontus
- Few-shot Self-rationalization With Natural Language Prompts Marasović Ana, Beltagy Iz, Downey Doug, Peters Matthew E.
- Evaluating Large Language Models Trained On Code Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde De Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter, Philippe Tillet, Felipe Petroski Such, Dave Cummings, Matthias Plappert, Fotios Chantzis, Elizabeth Barnes, Ariel Herbert-voss, William Hebgen Guss, Alex Nichol, Alex Paino, Nikolas Tezak, Jie Tang, Igor Babuschkin, Suchir Balaji, Shantanu Jain, William Saunders, Christopher Hesse, Andrew N. Carr, Jan Leike, Josh Achiam, Vedant Misra, Evan Morikawa, Alec Radford, Matthew Knight, Miles Brundage, Mira Murati, Katie Mayer, Peter Welinder, Bob Mcgrew, Dario Amodei, Sam Mccandlish, Ilya Sutskever, Wojciech Zaremba
- CINS: Comprehensive Instruction For Few-shot Learning In Task-oriented Dialog Systems Mi Fei, Li Yitong, Wang Yasheng, Jiang Xin, Liu Qun
- Recent Advances In Natural Language Processing Via Large Pre-trained Language Models: A Survey Min Bonan, Ross Hayley, Sulem Elior, Veyseh Amir Pouran Ben, Nguyen Thien Huu, Sainz Oscar, Agirre Eneko, Heinz Ilana, Roth Dan
- Glam: Efficient Scaling Of Language Models With Mixture-of-experts Nan Du, Yanping Huang, Andrew M. Dai, Simon Tong, Dmitry Lepikhin, Yuanzhong Xu, Maxim Krikun, Yanqi Zhou, Adams Wei Yu, Orhan Firat, Barret Zoph, Liam Fedus, Maarten Bosma, Zongwei Zhou, Tao Wang, Yu Emma Wang, Kellie Webster, Marie Pellat, Kevin Robinson, Kathleen Meier-hellstern, Toju Duke, Lucas Dixon, Kun Zhang, Quoc V Le, Yonghui Wu, Zhifeng Chen, Claire Cui
- Planning With Learned Entity Prompts For Abstractive Summarization Narayan Shashi, Zhao Yao, Maynez Joshua, Simoes Gonçalo, Nikolaev Vitaly, Mcdonald Ryan
- P-adapters: Robustly Extracting Factual Information From Language Models With Diverse Prompts Newman Benjamin, Choubey Prafulla Kumar, Rajani Nazneen
- Learning To Retrieve Prompts For In-context Learning Ohad Rubin, Jonathan Herzig, Jonathan Berant
- Pre-train, Prompt, And Predict: A Systematic Survey Of Prompting Methods In Natural Language Processing Pengfei Liu, Weizhe Yuan, Jinlan Fu, Zhengbao Jiang, Hiroaki Hayashi, Graham Neubig
- LFPT5: A Unified Framework For Lifelong Few-shot Language Learning Based On Prompt Tuning Of T5 Qin Chengwei, Joty Shafiq
- Prompt Programming For Large Language Models: Beyond The Few-shot Paradigm Reynolds Laria, Mcdonell Kyle
- Large Pre-trained Language Models Contain Human-like Biases Of What Is Right And Wrong To Do Schramowski Patrick, Turan Cigdem, Andersen Nico, Rothkopf Constantin A., Kersting Kristian
- The Power Of Prompt Tuning For Low-resource Semantic Parsing Schucher Nathan, Reddy Siva, De Vries Harm
- Metaicl: Learning To Learn In Context Sewon Min, Mike Lewis, Luke Zettlemoyer, Hannaneh Hajishirzi
- Few-shot Semantic Parsing With Language Models Trained On Code Shin Richard, Van Durme Benjamin
- NSP-BERT: A Prompt-based Few-shot Learner Through An Original Pre-training Task--next Sentence Prediction Sun Yi, Zheng Yu, Hao Chao, Qiu Hangping
- Multimodal Few-shot Learning With Frozen Language Models Tsimpoukelli Maria, Menick Jacob, Cabi Serkan, Eslami S. M. Ali, Vinyals Oriol, Hill Felix
- Multitask Prompted Training Enables Zero-shot Task Generalization Victor Sanh, Albert Webson, Colin Raffel, Stephen H. Bach, Lintang Sutawika, Zaid Alyafeai, Antoine Chaffin, Arnaud Stiegler, Teven Le Scao, Arun Raja, Manan Dey, M Saiful Bari, Canwen Xu, Urmish Thakker, Shanya Sharma Sharma, Eliza Szczechla, Taewoon Kim, Gunjan Chhablani, Nihal Nayak, Debajyoti Datta, Jonathan Chang, Mike Tian-jian Jiang, Han Wang, Matteo Manica, Sheng Shen, Zheng Xin Yong, Harshit Pandey, Rachel Bawden, Thomas Wang, Trishala Neeraj, Jos Rozen, Abheesht Sharma, Andrea Santilli, Thibault Fevry, Jason Alan Fries, Ryan Teehan, Tali Bers, Stella Biderman, Leo Gao, Thomas Wolf, Alexander M. Rush
- List: Lite Prompted Self-training Makes Parameter-efficient Few-shot Learners Wang Yaqing, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Reframing Human-ai Collaboration For Generating Free-text Explanations Wiegreffe Sarah, Hessel Jack, Swayamdipta Swabha, Riedl Mark, Choi Yejin
- AI Chains: Transparent And Controllable Human-ai Interaction By Chaining Large Language Model Prompts Wu Tongshuang, Terry Michael, Cai Carrie J.
- Prefix-tuning: Optimizing Continuous Prompts For Generation Xiang Lisa Li, Percy Liang
- P-tuning V2: Prompt Tuning Can Be Comparable To Fine-tuning Universally Across Scales And Tasks Xiao Liu, Kaixuan Ji, Yicheng Fu, Weng Lam Tam, Zhengxiao Du, Zhilin Yang, Jie Tang
- An Explanation Of In-context Learning As Implicit Bayesian Inference Xie Sang Michael, Raghunathan Aditi, Liang Percy, Ma Tengyu
- An Empirical Study Of GPT-3 For Few-shot Knowledge-based VQA Yang Zhengyuan, Gan Zhe, Wang Jianfeng, Hu Xiaowei, Lu Yumao, Liu Zicheng, Wang Lijuan
- Gpt3mix: Leveraging Large-scale Language Models For Text Augmentation Yoo Kang Min, Park Dongju, Kang Jaewook, Lee Sang-woo, Park Woomyeong
- Reconsidering The Past: Optimizing Hidden States In Language Models Yoshida Davis, Gimpel Kevin
- Retgen: A Joint Framework For Retrieval And Grounded Text Generation Modeling Zhang Yizhe, Sun Siqi, Gao Xiang, Fang Yuwei, Brockett Chris, Galley Michel, Gao Jianfeng, Dolan Bill
- Calibrate Before Use: Improving Few-shot Performance Of Language Models Zhao Tony Z., Wallace Eric, Feng Shi, Klein Dan, Singh Sameer
- CPM-2: Large-scale Cost-effective Pre-trained Language Models Zhengyan Zhang, Yuxian Gu, Xu Han, Shengqi Chen, Chaojun Xiao, Zhenbo Sun, Yuan Yao, Fanchao Qi, Jian Guan, Pei Ke, Yanzheng Cai, Guoyang Zeng, Zhixing Tan, Zhiyuan Liu, Minlie Huang, Wentao Han, Yang Liu, Xiaoyan Zhu, Maosong Sun
- Adapting Language Models For Zero-shot Learning By Meta-tuning On Dataset And Prompt Collections Zhong Ruiqi, Lee Kristy, Zhang Zheng, Klein Dan
- Gpt-3-driven Pedagogical Agents For Training Children's Curious Question-asking Skills Abdelghani Rania, Wang Yen-hsiang, Yuan Xingdi, Wang Tong, Lucas Pauline, Sauzéon Hélène, Oudeyer Pierre-yves
- Vector Representations Of Idioms In Conversational Systems Adewumi Tosin, Liwicki Foteini, Liwicki Marcus
- Despite "super-human" Performance, Current Llms Are Unsuited For Decisions About Ethics And Safety Albrecht Joshua, Kitanidis Ellie, Fetterman Abraham J.
- Prompting As Probing: Using Language Models For Knowledge Base Construction Alivanistos Dimitrios, Santamaría Selene Báez, Cochez Michael, Kalo Jan-christoph, Van Krieken Emile, Thanapalasingam Thiviyan
- Text And Patterns: For Effective Chain Of Thought, It Takes Two To Tango Aman Madaan, Amir Yazdanbakhsh
- Improving Alignment Of Dialogue Agents Via Targeted Human Judgements Amelia Glaese, Nat Mcaleese, Maja Trębacz, John Aslanides, Vlad Firoiu, Timo Ewalds, Maribeth Rauh, Laura Weidinger, Martin Chadwick, Phoebe Thacker, Lucy Campbell-gillingham, Jonathan Uesato, Po-sen Huang, Ramona Comanescu, Fan Yang, Abigail See, Sumanth Dathathri, Rory Greig, Charlie Chen, Doug Fritz, Jaume Sanchez Elias, Richard Green, Soňa Mokrá, Nicholas Fernando, Boxi Wu, Rachel Foley, Susannah Young, Iason Gabriel, William Isaac, John Mellor, Demis Hassabis, Koray Kavukcuoglu, Lisa Anne Hendricks, Geoffrey Irving
- Query Refinement Prompts For Closed-book Long-form Question Answering Amplayo Reinald Kim, Webster Kellie, Collins Michael, Das Dipanjan, Narayan Shashi
- Input-tuning: Adapting Unfamiliar Inputs To Frozen Pretrained Models An Shengnan, Li Yifei, Lin Zeqi, Liu Qian, Chen Bei, Fu Qiang, Chen Weizhu, Zheng Nanning, Lou Jian-guang
- Compositional Semantic Parsing With Large Language Models Andrew Drozdov, Nathanael Schärli, Ekin Akyürek, Nathan Scales, Xinying Song, Xinyun Chen, Olivier Bousquet, Denny Zhou
- Can Language Models Learn From Explanations In Context? Andrew K. Lampinen, Ishita Dasgupta, Stephanie C. Y. Chan, Kory Matthewson, Michael Henry Tessler, Antonia Creswell, James L. Mcclelland, Jane X. Wang, Felix Hill
- Socratic Models: Composing Zero-shot Multimodal Reasoning With Language Andy Zeng, Maria Attarian, Brian Ichter, Krzysztof Choromanski, Adrian Wong, Stefan Welker, Federico Tombari, Aveek Purohit, Michael Ryoo, Vikas Sindhwani, Johnny Lee, Vincent Vanhoucke, Pete Florence
- Exploring Length Generalization In Large Language Models Anil Cem, Wu Yuhuai, Andreassen Anders, Lewkowycz Aitor, Misra Vedant, Ramasesh Vinay, Slone Ambrose, Gur-ari Guy, Dyer Ethan, Neyshabur Behnam
- ATTEMPT: Parameter-efficient Multi-task Tuning Via Attentional Mixtures Of Soft Prompts Asai Akari, Salehi Mohammadreza, Peters Matthew E., Hajishirzi Hannaneh
- Bootstrapping Multilingual Semantic Parsers Using Large Language Models Awasthi Abhijeet, Gupta Nitish, Samanta Bidisha, Dave Shachi, Sarawagi Sunita, Talukdar Partha
- Enabling Classifiers To Make Judgements Explicitly Aligned With Human Values Bang Yejin, Yu Tiezheng, Madotto Andrea, Lin Zhaojiang, Diab Mona, Fung Pascale
- Rethinking The Role Of Scale For In-context Learning: An Interpretability-based Case Study At 66 Billion Scale Bansal Hritik, Gopalakrishnan Karthik, Dingliwal Saket, Bodapati Sravan, Kirchhoff Katrin, Roth Dan
- Leveraging Pre-trained Language Models For Conversational Information Seeking From Text Bellan Patrizio, Dragoni Mauro, Ghidini Chiara
- Vector-quantized Input-contextualized Soft Prompts For Natural Language Understanding Bhardwaj Rishabh, Saha Amrita, Hoi Steven C. H., Poria Soujanya
- BLOOM: A 176b-parameter Open-access Multilingual Language Model Bigscience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova Del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina Mcmillan-major, Iz Beltagy, Huu Nguyen, Lucile Saulnier, Samson Tan, Pedro Ortiz Suarez, Victor Sanh, Hugo Laurençon, Yacine Jernite, Julien Launay, Margaret Mitchell, Colin Raffel, Aaron Gokaslan, Adi Simhi, Aitor Soroa, Alham Fikri Aji, Amit Alfassy, Anna Rogers, Ariel Kreisberg Nitzav, Canwen Xu, Chenghao Mou, Chris Emezue, Christopher Klamm, Colin Leong, Daniel Van Strien, David Ifeoluwa Adelani, Dragomir Radev, Eduardo González Ponferrada, Efrat Levkovizh, Ethan Kim, Eyal Bar Natan, Francesco De Toni, Gérard Dupont, Germán Kruszewski, Giada Pistilli, Hady Elsahar, Hamza Benyamina, Hieu Tran, Ian Yu, Idris Abdulmumin, Isaac Johnson, Itziar Gonzalez-dios, Javier De La Rosa, Jenny Chim, Jesse Dodge, Jian Zhu, Jonathan Chang, Jörg Frohberg, Joseph Tobing, Joydeep Bhattacharjee, Khalid Almubarak, Kimbo Chen, Kyle Lo, Leandro Von Werra, Leon Weber, Long Phan, Loubna Ben Allal, Ludovic Tanguy, Manan Dey, Manuel Romero Muñoz, Maraim Masoud, María Grandury, Mario Šaško, Max Huang, Maximin Coavoux, Mayank Singh, Mike Tian-jian Jiang, Minh Chien Vu, Mohammad A. Jauhar, Mustafa Ghaleb, Nishant Subramani, Nora Kassner, Nurulaqilla Khamis, Olivier Nguyen, Omar Espejel, Ona De Gibert, Paulo Villegas, Peter Henderson, Pierre Colombo, Priscilla Amuok, Quentin Lhoest, Rheza Harliman, Rishi Bommasani, Roberto Luis López, Rui Ribeiro, Salomey Osei, Sampo Pyysalo, Sebastian Nagel, Shamik Bose, Shamsuddeen Hassan Muhammad, Shanya Sharma, Shayne Longpre, Somaieh Nikpoor, Stanislav Silberberg, Suhas Pai, Sydney Zink, Tiago Timponi Torrent, Timo Schick, Tristan Thrush, Valentin Danchev, Vassilina Nikoulina, Veronika Laippala, Violette Lepercq, Vrinda Prabhu, Zaid Alyafeai, Zeerak Talat, Arun Raja, Benjamin Heinzerling, Chenglei Si, Davut Emre Taşar, Elizabeth Salesky, Sabrina J. Mielke, Wilson Y. Lee, Abheesht Sharma, Andrea Santilli, Antoine Chaffin, Arnaud Stiegler, Debajyoti Datta, Eliza Szczechla, Gunjan Chhablani, Han Wang, Harshit Pandey, Hendrik Strobelt, Jason Alan Fries, Jos Rozen, Leo Gao, Lintang Sutawika, M Saiful Bari, Maged S. Al-shaibani, Matteo Manica, Nihal Nayak, Ryan Teehan, Samuel Albanie, Sheng Shen, Srulik Ben-david, Stephen H. Bach, Taewoon Kim, Tali Bers, Thibault Fevry, Trishala Neeraj, Urmish Thakker, Vikas Raunak, Xiangru Tang, Zheng-xin Yong, Zhiqing Sun, Shaked Brody, Yallow Uri, Hadar Tojarieh, Adam Roberts, Hyung Won Chung, Jaesung Tae, Jason Phang, Ofir Press, Conglong Li, Deepak Narayanan, Hatim Bourfoune, Jared Casper, Jeff Rasley, Max Ryabinin, Mayank Mishra, Minjia Zhang, Mohammad Shoeybi, Myriam Peyrounette, Nicolas Patry, Nouamane Tazi, Omar Sanseviero, Patrick Von Platen, Pierre Cornette, Pierre François Lavallée, Rémi Lacroix, Samyam Rajbhandari, Sanchit Gandhi, Shaden Smith, Stéphane Requena, Suraj Patil, Tim Dettmers, Ahmed Baruwa, Amanpreet Singh, Anastasia Cheveleva, Anne-laure Ligozat, Arjun Subramonian, Aurélie Névéol, Charles Lovering, Dan Garrette, Deepak Tunuguntla, Ehud Reiter, Ekaterina Taktasheva, Ekaterina Voloshina, Eli Bogdanov, Genta Indra Winata, Hailey Schoelkopf, Jan-christoph Kalo, Jekaterina Novikova, Jessica Zosa Forde, Jordan Clive, Jungo Kasai, Ken Kawamura, Liam Hazan, Marine Carpuat, Miruna Clinciu, Najoung Kim, Newton Cheng, Oleg Serikov, Omer Antverg, Oskar Van Der Wal, Rui Zhang, Ruochen Zhang, Sebastian Gehrmann, Shachar Mirkin, Shani Pais, Tatiana Shavrina, Thomas Scialom, Tian Yun, Tomasz Limisiewicz, Verena Rieser, Vitaly Protasov, Vladislav Mikhailov, Yada Pruksachatkun, Yonatan Belinkov, Zachary Bamberger, Zdeněk Kasner, Alice Rueda, Amanda Pestana, Amir Feizpour, Ammar Khan, Amy Faranak, Ana Santos, Anthony Hevia, Antigona Unldreaj, Arash Aghagol, Arezoo Abdollahi, Aycha Tammour, Azadeh Hajihosseini, Bahareh Behroozi, Benjamin Ajibade, Bharat Saxena, Carlos Muñoz Ferrandis, Daniel Mcduff, Danish Contractor, David Lansky, Davis David, Douwe Kiela, Duong A. Nguyen, Edward Tan, Emi Baylor, Ezinwanne Ozoani, Fatima Mirza, Frankline Ononiwu, Habib Rezanejad, Hessie Jones, Indrani Bhattacharya, Irene Solaiman, Irina Sedenko, Isar Nejadgholi, Jesse Passmore, Josh Seltzer, Julio Bonis Sanz, Livia Dutra, Mairon Samagaio, Maraim Elbadri, Margot Mieskes, Marissa Gerchick, Martha Akinlolu, Michael Mckenna, Mike Qiu, Muhammed Ghauri, Mykola Burynok, Nafis Abrar, Nazneen Rajani, Nour Elkott, Nour Fahmy, Olanrewaju Samuel, Ran An, Rasmus Kromann, Ryan Hao, Samira Alizadeh, Sarmad Shubber, Silas Wang, Sourav Roy, Sylvain Viguier, Thanh Le, Tobi Oyebade, Trieu Le, Yoyo Yang, Zach Nguyen, Abhinav Ramesh Kashyap, Alfredo Palasciano, Alison Callahan, Anima Shukla, Antonio Miranda-escalada, Ayush Singh, Benjamin Beilharz, Bo Wang, Caio Brito, Chenxi Zhou, Chirag Jain, Chuxin Xu, Clémentine Fourrier, Daniel León Periñán, Daniel Molano, Dian Yu, Enrique Manjavacas, Fabio Barth, Florian Fuhrimann, Gabriel Altay, Giyaseddin Bayrak, Gully Burns, Helena U. Vrabec, Imane Bello, Ishani Dash, Jihyun Kang, John Giorgi, Jonas Golde, Jose David Posada, Karthik Rangasai Sivaraman, Lokesh Bulchandani, Lu Liu, Luisa Shinzato, Madeleine Hahn De Bykhovetz, Maiko Takeuchi, Marc Pàmies, Maria A Castillo, Marianna Nezhurina, Mario Sänger, Matthias Samwald, Michael Cullan, Michael Weinberg, Michiel De Wolf, Mina Mihaljcic, Minna Liu, Moritz Freidank, Myungsun Kang, Natasha Seelam, Nathan Dahlberg, Nicholas Michio Broad, Nikolaus Muellner, Pascale Fung, Patrick Haller, Ramya Chandrasekhar, Renata Eisenberg, Robert Martin, Rodrigo Canalli, Rosaline Su, Ruisi Su, Samuel Cahyawijaya, Samuele Garda, Shlok S Deshmukh, Shubhanshu Mishra, Sid Kiblawi, Simon Ott, Sinee Sang-aroonsiri, Srishti Kumar, Stefan Schweter, Sushil Bharati, Tanmay Laud, Théo Gigant, Tomoya Kainuma, Wojciech Kusa, Yanis Labrak, Yash Shailesh Bajaj, Yash Venkatraman, Yifan Xu, Yingxin Xu, Yu Xu, Zhe Tan, Zhongli Xie, Zifan Ye, Mathilde Bras, Younes Belkada, Thomas Wolf
- Prompting Language Models For Linguistic Structure Blevins Terra, Gonen Hila, Zettlemoyer Luke
- Looking For A Handsome Carpenter! Debiasing GPT-3 Job Advertisements Borchers Conrad, Gala Dalia Sara, Gilburt Benjamin, Oravkin Eduard, Bounsi Wilfried, Asano Yuki M., Kirk Hannah Rose
- Large Language Models Can Implement Policy Iteration Brooks Ethan, Walls Logan, Lewis Richard L., Singh Satinder
- Robust Preference Learning For Storytelling Via Contrastive Reinforcement Learning Castricato Louis, Havrilla Alexander, Matiana Shahbuland, Pieler Michael, Ye Anbang, Yang Ian, Frazier Spencer, Riedl Mark
- Ernie-code: Beyond English-centric Cross-lingual Pretraining For Programming Languages Chai Yekun, Wang Shuohuan, Pang Chao, Sun Yu, Tian Hao, Wu Hua
- Speechprompt: An Exploration Of Prompt Tuning On Generative Spoken Language Model For Speech Processing Tasks Chang Kai-wei, Tseng Wei-cheng, Li Shang-wen, Lee Hung-yi
- Data Curation Alone Can Stabilize In-context Learning Chang Ting-yun, Jia Robin
- Analyzing Semantic Faithfulness Of Language Models Via Input Intervention On Question Answering Chaturvedi Akshay, Bhar Swarnadeep, Saha Soumadeep, Garain Utpal, Asher Nicholas
- Testing Occupational Gender Bias In Language Models: Towards Robust Measurement And Zero-shot Debiasing Chen Yuen, Raghuram Vethavikashini Chithrra, Mattern Justus, Sachan Mrinmaya, Mihalcea Rada, Schölkopf Bernhard, Jin Zhijing
- Adaprompt: Adaptive Model Training For Prompt-based NLP Chen Yulong, Liu Yang, Dong Li, Wang Shuohang, Zhu Chenguang, Zeng Michael, Zhang Yue
- Prompting GPT-3 To Be Reliable Chenglei Si, Zhe Gan, Zhengyuan Yang, Shuohang Wang, Jianfeng Wang, Jordan Boyd-graber, Lijuan Wang
- Scaling Instruction-finetuned Language Models Chung Hyung Won, Hou Le, Longpre Shayne, Zoph Barret, Tay Yi, Fedus William, Li Yunxuan, Wang Xuezhi, Dehghani Mostafa, Brahma Siddhartha, Webson Albert, Gu Shixiang Shane, Dai Zhuyun, Suzgun Mirac, Chen Xinyun, Chowdhery Aakanksha, Castro-ros Alex, Pellat Marie, Robinson Kevin, Valter Dasha, Narang Sharan, Mishra Gaurav, Yu Adams, Zhao Vincent, Huang Yanping, Dai Andrew, Yu Hongkun, Petrov Slav, Chi Ed H., Dean Jeff, Devlin Jacob, Roberts Adam, Zhou Denny, Le Quoc V., Wei Jason
- Building The Intent Landscape Of Real-world Conversational Corpora With Extractive Question-answering Transformers Corbeil Jean-philippe, Li Mia Taige, Ghavidel Hadi Abdi
- VQGAN-CLIP: Open Domain Image Generation And Editing With Natural Language Guidance Crowson Katherine, Biderman Stella, Kornis Daniel, Stander Dashiell, Hallahan Eric, Castricato Louis, Raff Edward
- Decoder Tuning: Efficient Language Understanding As Decoding Cui Ganqu, Li Wentao, Ding Ning, Huang Longtao, Liu Zhiyuan, Sun Maosong
- M6-rec: Generative Pretrained Language Models Are Open-ended Recommender Systems Cui Zeyu, Ma Jianxin, Zhou Chang, Zhou Jingren, Yang Hongxia
- Why Can GPT Learn In-context? Language Models Implicitly Perform Gradient Descent As Meta-optimizers Dai Damai, Sun Yutao, Dong Li, Hao Yaru, Ma Shuming, Sui Zhifang, Wei Furu
- Promptagator: Few-shot Dense Retrieval From 8 Examples Dai Zhuyun, Zhao Vincent Y., Ma Ji, Luan Yi, Ni Jianmo, Lu Jing, Bakalov Anton, Guu Kelvin, Hall Keith B., Chang Ming-wei
- Understanding BLOOM: An Empirical Study On Diverse NLP Tasks Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- Language Model Cascades David Dohan, Winnie Xu, Aitor Lewkowycz, Jacob Austin, David Bieber, Raphael Gontijo Lopes, Yuhuai Wu, Henryk Michalewski, Rif A. Saurous, Jascha Sohl-dickstein, Kevin Murphy, Charles Sutton
- Rlprompt: Optimizing Discrete Text Prompts With Reinforcement Learning Deng Mingkai, Wang Jianyu, Hsieh Cheng-ping, Wang Yihan, Guo Han, Shu Tianmin, Song Meng, Xing Eric P., Hu Zhiting
- A Unified Multi-task Learning Framework For Multi-goal Conversational Recommender Systems Deng Yang, Zhang Wenxuan, Xu Weiwen, Lei Wenqiang, Chua Tat-seng, Lam Wai
- Least-to-most Prompting Enables Complex Reasoning In Large Language Models Denny Zhou, Nathanael Schärli, Le Hou, Jason Wei, Nathan Scales, Xuezhi Wang, Dale Schuurmans, Claire Cui, Olivier Bousquet, Quoc Le, Ed Chi
- Black-box Prompt Learning For Pre-trained Language Models Diao Shizhe, Huang Zhichao, Xu Ruijia, Li Xuechun, Lin Yong, Zhou Xiao, Zhang Tong
- Corrpus: Code-based Structured Prompting For Neurosymbolic Story Understanding Dong Yijiang River, Martin Lara J., Callison-burch Chris
- Winodict: Probing Language Models For In-context Word Acquisition Eisenschlos Julian Martin, Cole Jeremy R., Liu Fangyu, Cohen William W.
- Star: Bootstrapping Reasoning With Reasoning Eric Zelikman, Yuhuai Wu, Jesse Mu, Noah D. Goodman
- Codegen: An Open Large Language Model For Code With Multi-turn Program Synthesis Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong
- Language Models Are Multilingual Chain-of-thought Reasoners Freda Shi, Mirac Suzgun, Markus Freitag, Xuezhi Wang, Suraj Srivats, Soroush Vosoughi, Hyung Won Chung, Yi Tay, Sebastian Ruder, Denny Zhou, Dipanjan Das, Jason Wei
- Complexity-based Prompting For Multi-step Reasoning Fu Yao, Peng Hao, Sabharwal Ashish, Clark Peter, Khot Tushar
- MIGA: A Unified Multi-task Generation Framework For Conversational Text-to-sql Fu Yingwen, Ou Wenjie, Yu Zhou, Lin Yue
- Normsage: Multi-lingual Multi-cultural Norm Discovery From Conversations On-the-fly Fung Yi R., Chakraborty Tuhin, Guo Hao, Rambow Owen, Muresan Smaranda, Ji Heng
- WAVPROMPT: Towards Few-shot Spoken Language Understanding With Frozen Language Models Gao Heting, Ni Junrui, Qian Kaizhi, Zhang Yang, Chang Shiyu, Hasegawa-johnson Mark
- Visual Prompt Tuning For Test-time Domain Adaptation Gao Yunhe, Shi Xingjian, Zhu Yi, Wang Hao, Tang Zhiqiang, Zhou Xiong, Li Mu, Metaxas Dimitris N.
- Extensible Prompts For Language Models On Zero-shot Language Style Customization Ge Tao, Hu Jing, Dong Li, Mao Shaoguang, Xia Yan, Wang Xun, Chen Si-qing, Wei Furu
- LVP-M3: Language-aware Visual Prompt For Multilingual Multimodal Machine Translation Guo Hongcheng, Liu Jiaheng, Huang Haoyang, Yang Jian, Li Zhoujun, Zhang Dongdong, Cui Zheng, Wei Furu
- MALM: Mixing Augmented Language Modeling For Zero-shot Machine Translation Gupta Kshitij
- Show, Don't Tell: Demonstrations Outperform Descriptions For Schema-guided Task-oriented Dialogue Gupta Raghav, Lee Harrison, Zhao Jeffrey, Rastogi Abhinav, Cao Yuan, Wu Yonghui
- Thinking About GPT-3 In-context Learning For Biomedical IE? Think Again Gutiérrez Bernal Jiménez, Mcneal Nikolas, Washington Clay, Chen You, Li Lang, Sun Huan, Su Yu
- ORCA: Interpreting Prompted Language Models Via Locating Supporting Data Evidence In The Ocean Of Pretraining Data Han Xiaochuang, Tsvetkov Yulia
- Rethinking With Retrieval: Faithful Large Language Model Inference He Hangfeng, Zhang Hongming, Roth Dan
- Hyperprompt: Prompt-based Task-conditioning Of Transformers He Yun, Zheng Huaixiu Steven, Tay Yi, Gupta Jai, Du Yu, Aribandi Vamsi, Zhao Zhe, Li Yaguang, Chen Zhao, Metzler Donald, Cheng Heng-tze, Chi Ed H.
- Prompt-to-prompt Image Editing With Cross Attention Control Hertz Amir, Mokady Ron, Tenenbaum Jay, Aberman Kfir, Pritch Yael, Cohen-or Daniel
- Demystifying Prompts In Language Models Via Perplexity Estimation Hila Gonen, Srini Iyer, Terra Blevins, Noah A. Smith, Luke Zettlemoyer
- Instruction Induction: From Few Examples To Natural Language Task Descriptions Honovich Or, Shaham Uri, Bowman Samuel R., Levy Omer
- Promptcap: Prompt-guided Task-aware Image Captioning Hu Yushi, Hua Hang, Yang Zhengyuan, Shi Weijia, Smith Noah A, Luo Jiebo
- FPT: Improving Prompt Tuning Efficiency Via Progressive Training Huang Yufei, Qin Yujia, Wang Huadong, Yin Yichun, Sun Maosong, Liu Zhiyuan, Liu Qun
- Welm: A Well-read Pre-trained Language Model For Chinese Hui Su, Xiao Zhou, Houjin Yu, Xiaoyu Shen, Yuwen Chen, Zilin Zhu, Yang Yu, Jie Zhou
- Self-generated In-context Learning: Leveraging Auto-regressive Language Models As A Demonstration Generator Hyuhng Joon Kim, Hyunsoo Cho, Junyeob Kim, Taeuk Kim, Kang Min Yoo, Sang-goo Lee
- Scaling Instruction-finetuned Language Models Hyung Won Chung, Le Hou, Shayne Longpre, Barret Zoph, Yi Tay, William Fedus, Yunxuan Li, Xuezhi Wang, Mostafa Dehghani, Siddhartha Brahma, Albert Webson, Shixiang Shane Gu, Zhuyun Dai, Mirac Suzgun, Xinyun Chen, Aakanksha Chowdhery, Alex Castro-ros, Marie Pellat, Kevin Robinson, Dasha Valter, Sharan Narang, Gaurav Mishra, Adams Yu, Vincent Zhao, Yanping Huang, Andrew Dai, Hongkun Yu, Slav Petrov, Ed H. Chi, Jeff Dean, Jacob Devlin, Adam Roberts, Denny Zhou, Quoc V. Le, Jason Wei
- From Human Days To Machine Seconds: Automatically Answering And Generating Machine Learning Final Exams Iddo Drori, Sarah J. Zhang, Reece Shuttleworth, Sarah Zhang, Keith Tyser, Zad Chin, Pedro Lantigua, Saisamrit Surbehera, Gregory Hunter, Derek Austin, Leonard Tang, Yann Hicke, Sage Simhon, Sathwik Karnik, Darnell Granberry, Madeleine Udell
- Uniform Complexity For Text Generation Imperial Joseph Marvin, Madabushi Harish Tayyar
- HINT: Hypernetwork Instruction Tuning For Efficient Zero- & Few-shot Generalisation Ivison Hamish, Bhagia Akshita, Wang Yizhong, Hajishirzi Hannaneh, Peters Matthew
- Can Large Language Models Truly Understand Prompts? A Case Study With Negated Prompts Jang Joel, Ye Seonghyeon, Seo Minjoon
- Chain-of-thought Prompting Elicits Reasoning In Large Language Models Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, Denny Zhou
- Chatgpt Makes Medicine Easy To Swallow: An Exploratory Case Study On Simplified Radiology Reports Jeblick Katharina, Schachtner Balthasar, Dexl Jakob, Mittermeier Andreas, Stüber Anna Theresa, Topalis Johanna, Weber Tobias, Wesp Philipp, Sabel Bastian, Ricke Jens, Ingrisch Michael
- Visual Prompt Tuning Jia Menglin, Tang Luming, Chen Bor-chun, Cardie Claire, Belongie Serge, Hariharan Bharath, Lim Ser-nam
- Contrastive Learning For Prompt-based Few-shot Language Learners Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Evaluating And Inducing Personality In Pre-trained Language Models Jiang Guangyuan, Xu Manjie, Zhu Song-chun, Han Wenjuan, Zhang Chi, Zhu Yixin
- Pseudo-q: Generating Pseudo Language Queries For Visual Grounding Jiang Haojun, Lin Yuanze, Han Dongchen, Song Shiji, Huang Gao
- From Images To Textual Prompts: Zero-shot VQA With Frozen Large Language Models Jiaxian Guo, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Boyang Li, Dacheng Tao, Steven C. H. Hoi
- Large Language Models Can Self-improve Jiaxin Huang, Shixiang Shane Gu, Le Hou, Yuexin Wu, Xuezhi Wang, Hongkun Yu, Jiawei Han
- Self-prompting Large Language Models For Zero-shot Open-domain QA Junlong Li, Jinyuan Wang, Zhuosheng Zhang, Hai Zhao
- Latency Adjustable Transformer Encoder For Language Understanding Kachuee Sajjad, Sharifkhani Mohammad
- Action-gpt: Leveraging Large-scale Language Models For Improved And Generalized Action Generation Kalakonda Sai Shashank, Maheshwari Shubh, Sarvadevabhatla Ravi Kiran
- Decomposed Prompting: A Modular Approach For Solving Complex Tasks Khot Tushar, Trivedi Harsh, Finlayson Matthew, Fu Yao, Richardson Kyle, Clark Peter, Sabharwal Ashish
- Self-generated In-context Learning: Leveraging Auto-regressive Language Models As A Demonstration Generator Kim Hyuhng Joon, Cho Hyunsoo, Kim Junyeob, Kim Taeuk, Yoo Kang Min, Lee Sang-goo
- Improving Language Model Prompting In Support Of Semi-autonomous Task Learning Kirk James R., Wray Robert E., Lindes Peter, Laird John E.
- Towards The Generation Of Musical Explanations With GPT-3 Krol Stephen James, Llano Maria Teresa, Mccormack Jon
- Validating Large Language Models With Relm Kuchnik Michael, Smith Virginia, Amvrosiadis George
- Co-training Improves Prompt-based Learning For Large Language Models Lang Hunter, Agrawal Monica, Kim Yoon, Sontag David
- Internet-augmented Language Models Through Few-shot Prompting For Open-domain Question Answering Lazaridou Angeliki, Gribovskaya Elena, Stokowiec Wojciech, Grigorev Nikolai
- Dialogcc: An Automated Pipeline For Creating High-quality Multi-modal Dialogue Dataset Lee Young-jun, Ko Byungsoo, Kim Han-gyu, Hyeon Jonghwan, Choi Ho-jin
- Reducing Retraining By Recycling Parameter-efficient Prompts Lester Brian, Yurtsever Joshua, Shakeri Siamak, Constant Noah
- Learning To Transfer Prompts For Text Generation Li Junyi, Tang Tianyi, Nie Jian-yun, Wen Ji-rong, Zhao Wayne Xin
- Personalized Prompt Learning For Explainable Recommendation Li Lei, Zhang Yongfeng, Chen Li
- Explanations From Large Language Models Make Small Reasoners Better Li Shiyang, Chen Jianshu, Shen Yelong, Chen Zhiyu, Zhang Xinlu, Li Zekun, Wang Hong, Qian Jing, Peng Baolin, Mao Yi, Chen Wenhu, Yan Xifeng
- Systematicity In Gpt-3's Interpretation Of Novel English Noun Compounds Li Siyan, Carlson Riley, Potts Christopher
- Task-specific Pre-training And Prompt Decomposition For Knowledge Graph Population With Language Models Li Tianyi, Huang Wenyu, Papasarantopoulos Nikos, Vougiouklis Pavlos, Pan Jeff Z.
- Evaluating Psychological Safety Of Large Language Models Li Xingxuan, Li Yutong, Qiu Lin, Joty Shafiq, Bing Lidong
- Bjtu-wechat's Systems For The WMT22 Chat Translation Task Liang Yunlong, Meng Fandong, Xu Jinan, Chen Yufeng, Zhou Jie
- Using Large Language Models To Generate Engaging Captions For Data Visualizations Liew Ashley, Mueller Klaus
- ADAPT: Vision-language Navigation With Modality-aligned Action Prompts Lin Bingqian, Zhu Yi, Chen Zicong, Liang Xiwen, Liu Jianzhuang, Liang Xiaodan
- Contextual Transformer For Offline Meta Reinforcement Learning Lin Runji, Li Ye, Feng Xidong, Zhang Zhaowei, Fung Xian Hong Wu, Zhang Haifeng, Wang Jun, Du Yali, Yang Yaodong
- Deplot: One-shot Visual Language Reasoning By Plot-to-table Translation Liu Fangyu, Eisenschlos Julian Martin, Piccinno Francesco, Krichene Syrine, Pang Chenxi, Lee Kenton, Joshi Mandar, Chen Wenhu, Collier Nigel, Altun Yasemin
- Fl-tuning: Layer Tuning For Feed-forward Network In Transformer Liu Jingping, Song Yuqiu, Xue Kui, Sun Hongli, Wang Chao, Chen Lihan, Jiang Haiyun, Liang Jiaqing, Ruan Tong
- 3DALL-E: Integrating Text-to-image AI In 3D Design Workflows Liu Vivian, Vermeulen Jo, Fitzmaurice George, Matejka Justin
- PSP: Pre-trained Soft Prompts For Few-shot Abstractive Summarization Liu Xiaochen, Gao Yang, Bai Yu, Li Jiawei, Hu Yinan, Huang Heyan, Chen Boxing
- Declaration-based Prompt Tuning For Visual Question Answering Liu Yuhang, Wei Wei, Peng Daowan, Zhu Feida
- Can Large Language Models Reason About Medical Questions? Liévin Valentin, Hother Christoffer Egeberg, Motzfeldt Andreas Geert, Winther Ole
- Prompt Generation Networks For Input-space Adaptation Of Frozen Vision Transformers Loedeman Jochem, Stol Maarten C., Han Tengda, Asano Yuki M.
- Training Language Models To Follow Instructions With Human Feedback Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe
- Neuro-symbolic Procedural Planning With Commonsense Prompting Lu Yujie, Feng Weixi, Zhu Wanrong, Xu Wenda, Wang Xin Eric, Eckstein Miguel, Wang William Yang
- PAL: Program-aided Language Models Luyu Gao, Aman Madaan, Shuyan Zhou, Uri Alon, Pengfei Liu, Yiming Yang, Jamie Callan, Graham Neubig
- Memory-assisted Prompt Editing To Improve GPT-3 After Deployment Madaan Aman, Tandon Niket, Clark Peter, Yang Yiming
- Structured Like A Language Model: Analysing AI As An Automated Subject Magee Liam, Arora Vanicka, Munn Luke
- Teaching Small Language Models To Reason Magister Lucie Charlotte, Mallinson Jonathan, Adamek Jakub, Malmi Eric, Severyn Aliaksei
- PERFECT: Prompt-free And Efficient Few-shot Learning With Language Models Mahabadi Rabeeh Karimi, Zettlemoyer Luke, Henderson James, Saeidi Marzieh, Mathias Lambert, Stoyanov Veselin, Yazdani Majid
- Chatbots In A Botnet World Mckee Forrest, Noever David
- Grounding Language With Visual Affordances Over Unstructured Data Mees Oier, Borja-diaz Jessica, Burgard Wolfram
- ZEROTOP: Zero-shot Task-oriented Semantic Parsing Using Large Language Models Mekala Dheeraj, Wolfe Jason, Roy Subhro
- Generating Training Data With Language Models: Towards Zero-shot Language Understanding Meng Yu, Huang Jiaxin, Zhang Yu, Han Jiawei
- Tuning Language Models As Training Data Generators For Augmentation-enhanced Few-shot Learning Meng Yu, Michalski Martin, Huang Jiaxin, Zhang Yu, Abdelzaher Tarek, Han Jiawei
- Rethinking The Role Of Demonstrations: What Makes In-context Learning Work? Min Sewon, Lyu Xinxi, Holtzman Ari, Artetxe Mikel, Lewis Mike, Hajishirzi Hannaneh, Zettlemoyer Luke
- HELP ME THINK: A Simple Prompting Strategy For Non-experts To Create Customized Content With Models Mishra Swaroop, Nouri Elnaz
- SGPT: GPT Sentence Embeddings For Semantic Search Muennighoff Niklas
- Large Language Models Are Reasoning Teachers Namgyu Ho, Laura Schmid, Se-young Yun
- Crosslingual Generalization Through Multitask Finetuning Niklas Muennighoff, Thomas Wang, Lintang Sutawika, Adam Roberts, Stella Biderman, Teven Le Scao, M Saiful Bari, Sheng Shen, Zheng-xin Yong, Hailey Schoelkopf, Xiangru Tang, Dragomir Radev, Alham Fikri Aji, Khalid Almubarak, Samuel Albanie, Zaid Alyafeai, Albert Webson, Edward Raff, Colin Raffel
- The Turing Deception Noever David, Ciolino Matt
- Measuring And Narrowing The Compositionality Gap In Language Models Ofir Press, Muru Zhang, Sewon Min, Ludwig Schmidt, Noah A. Smith, Mike Lewis
- Thinksum: Probabilistic Reasoning Over Sets Using Large Language Models Ozturkler Batu, Malkin Nikolay, Wang Zhen, Jojic Nebojsa
- Dynamic Prompt Learning Via Policy Gradient For Semi-structured Mathematical Reasoning Pan Lu, Liang Qiu, Kai-wei Chang, Ying Nian Wu, Song-chun Zhu, Tanmay Rajpurohit, Peter Clark, Ashwin Kalyan
- Bidirectional Language Models Are Also Few-shot Learners Patel Ajay, Li Bryan, Rasooli Mohammad Sadegh, Constant Noah, Raffel Colin, Callison-burch Chris
- Holistic Evaluation Of Language Models Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao, Jue Wang, Keshav Santhanam, Laurel Orr, Lucia Zheng, Mert Yuksekgonul, Mirac Suzgun, Nathan Kim, Neel Guha, Niladri Chatterji, Omar Khattab, Peter Henderson, Qian Huang, Ryan Chi, Sang Michael Xie, Shibani Santurkar, Surya Ganguli, Tatsunori Hashimoto, Thomas Icard, Tianyi Zhang, Vishrav Chaudhary, William Wang, Xuechen Li, Yifan Mai, Yuhui Zhang, Yuta Koreeda
- Ignore Previous Prompt: Attack Techniques For Language Models Perez Fábio, Ribeiro Ian
- Grips: Gradient-free, Edit-based Instruction Search For Prompting Large Language Models Prasad Archiki, Hase Peter, Zhou Xiang, Bansal Mohit
- Psychologically-informed Chain-of-thought Prompts For Metaphor Understanding In Large Language Models Prystawski Ben, Thibodeau Paul, Potts Christopher, Goodman Noah D.
- ELLE: Efficient Lifelong Pre-training For Emerging Data Qin Yujia, Zhang Jiajie, Lin Yankai, Liu Zhiyuan, Li Peng, Sun Maosong, Zhou Jie
- A Survey On In-context Learning Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Lei Li, Zhifang Sui
- Parallel Context Windows For Large Language Models Ratner Nir, Levine Yoav, Belinkov Yonatan, Ram Ori, Magar Inbal, Abend Omri, Karpas Ehud, Shashua Amnon, Leyton-brown Kevin, Shoham Yoav
- Super-prompting: Utilizing Model-independent Contextual Data To Reduce Data Annotation Required In Visual Commonsense Tasks Rezaei Navid, Reformat Marek Z.
- Leveraging Large Language Models For Multiple Choice Question Answering Robinson Joshua, Rytting Christopher Michael, Wingate David
- LINGUIST: Language Model Instruction Tuning To Generate Annotated Utterances For Intent Classification And Slot Tagging Rosenbaum Andy, Soltan Saleh, Hamza Wael, Versley Yannick, Boese Markus
- Benchclamp: A Benchmark For Evaluating Language Models On Syntactic And Semantic Parsing Roy Subhro, Thomson Sam, Chen Tongfei, Shin Richard, Pauls Adam, Eisner Jason, Van Durme Benjamin
- Data Augmentation For Intent Classification With Off-the-shelf Large Language Models Sahu Gaurav, Rodriguez Pau, Laradji Issam H., Atighehchian Parmida, Vazquez David, Bahdanau Dzmitry
- Unpacking Large Language Models With Conceptual Consistency Sahu Pritish, Cogswell Michael, Gong Yunye, Divakaran Ajay
- Knowledge Prompts: Injecting World Knowledge Into Language Models Through Soft Prompts Santos Cicero Nogueira Dos, Dong Zhe, Cer Daniel, Nham John, Shakeri Siamak, Ni Jianmo, Sung Yun-hsuan
- Language Models Are Greedy Reasoners: A Systematic Formal Analysis Of Chain-of-thought Saparov Abulhair, He He
- On The Effect Of Pretraining Corpora On In-context Learning By A Large-scale Language Model Seongjin Shin, Sang-woo Lee, Hwijeen Ahn, Sungdong Kim, Hyoungseok Kim, Boseop Kim, Kyunghyun Cho, Gichang Lee, Woomyoung Park, Jung-woo Ha, Nako Sung
- On Second Thought, Let's Not Think Step By Step! Bias And Toxicity In Zero-shot Reasoning Shaikh Omar, Zhang Hongxin, Held William, Bernstein Michael, Yang Diyi
- XRICL: Cross-lingual Retrieval-augmented In-context Learning For Cross-lingual Text-to-sql Semantic Parsing Shi Peng, Zhang Rui, Bai He, Lin Jimmy
- Repository-level Prompt Generation For Large Language Models Of Code Shrivastava Disha, Larochelle Hugo, Tarlow Daniel
- React: Synergizing Reasoning And Acting In Language Models Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, Yuan Cao
- Reasoning With Language Model Prompting: A Survey Shuofei Qiao, Yixin Ou, Ningyu Zhang, Xiang Chen, Yunzhi Yao, Shumin Deng, Chuanqi Tan, Fei Huang, Huajun Chen
- Language Models That Seek For Knowledge: Modular Search & Generation For Dialogue And Prompt Completion Shuster Kurt, Komeili Mojtaba, Adolphs Leonard, Roller Stephen, Szlam Arthur, Weston Jason
- Ask Me Anything: A Simple Strategy For Prompting Language Models Simran Arora, Avanika Narayan, Mayee F. Chen, Laurel Orr, Neel Guha, Kush Bhatia, Ines Chami, Frederic Sala, Christopher Ré
- Explaining Patterns In Data With Language Models Via Interpretable Autoprompting Singh Chandan, Morris John X., Aneja Jyoti, Rush Alexander M., Gao Jianfeng
- Progprompt: Generating Situated Robot Task Plans Using Large Language Models Singh Ishika, Blukis Valts, Mousavian Arsalan, Goyal Ankit, Xu Danfei, Tremblay Jonathan, Fox Dieter, Thomason Jesse, Garg Animesh
- Large Language Models Encode Clinical Knowledge Singhal Karan, Azizi Shekoofeh, Tu Tao, Mahdavi S. Sara, Wei Jason, Chung Hyung Won, Scales Nathan, Tanwani Ajay, Cole-lewis Heather, Pfohl Stephen, Payne Perry, Seneviratne Martin, Gamble Paul, Kelly Chris, Scharli Nathaneal, Chowdhery Aakanksha, Mansfield Philip, Arcas Blaise Aguera Y, Webster Dale, Corrado Greg S., Matias Yossi, Chou Katherine, Gottweis Juraj, Tomasev Nenad, Liu Yun, Rajkomar Alvin, Barral Joelle, Semturs Christopher, Karthikesalingam Alan, Natarajan Vivek
- Events Realm: Event Reasoning Of Entity States Via Language Models Spiliopoulou Evangelia, Pagnoni Artidoro, Bisk Yonatan, Hovy Eduard
- OPT-IML: Scaling Language Model Instruction Meta Learning Through The Lens Of Generalization Srinivasan Iyer, Xi Victoria Lin, Ramakanth Pasunuru, Todor Mihaylov, Daniel Simig, Ping Yu, Kurt Shuster, Tianlu Wang, Qing Liu, Punit Singh Koura, Xian Li, Brian O'horo, Gabriel Pereyra, Jeff Wang, Christopher Dewan, Asli Celikyilmaz, Luke Zettlemoyer, Ves Stoyanov
- Beyond The Imitation Game: Quantifying And Extrapolating The Capabilities Of Language Models Srivastava Aarohi Shammie, Rastogi Abhinav Shammie, Rao Abhishek Shammie, Shoeb Abu Awal Md Shammie, Abid Abubakar Shammie, Fisch Adam Shammie, Brown Adam R. Shammie, Santoro Adam Shammie, Gupta Aditya Shammie, Garriga-alonso Adrià Shammie, Kluska Agnieszka Shammie, Lewkowycz Aitor Shammie, Agarwal Akshat Shammie, Power Alethea Shammie, Ray Alex Shammie, Warstadt Alex Shammie, Kocurek Alexander W. Shammie, Safaya Ali Shammie, Tazarv Ali Shammie, Xiang Alice Shammie, Parrish Alicia Shammie, Nie Allen Shammie, Hussain Aman Shammie, Askell Amanda Shammie, Dsouza Amanda Shammie, Slone Ambrose Shammie, Rahane Ameet Shammie, Iyer Anantharaman S. Shammie, Andreassen Anders Shammie, Madotto Andrea Shammie, Santilli Andrea Shammie, Stuhlmüller Andreas Shammie, Dai Andrew Shammie, La Andrew Shammie, Lampinen Andrew Shammie, Zou Andy Shammie, Jiang Angela Shammie, Chen Angelica Shammie, Vuong Anh Shammie, Gupta Animesh Shammie, Gottardi Anna Shammie, Norelli Antonio Shammie, Venkatesh Anu Shammie, Gholamidavoodi Arash Shammie, Tabassum Arfa Shammie, Menezes Arul Shammie, Kirubarajan Arun Shammie, Mullokandov Asher Shammie, Sabharwal Ashish Shammie, Herrick Austin Shammie, Efrat Avia Shammie, Erdem Aykut Shammie, Karakaş Ayla Shammie, Roberts B. Ryan Shammie, Loe Bao Sheng Shammie, Zoph Barret Shammie, Bojanowski Bartłomiej Shammie, Özyurt Batuhan Shammie, Hedayatnia Behnam Shammie, Neyshabur Behnam Shammie, Inden Benjamin Shammie, Stein Benno Shammie, Ekmekci Berk Shammie, Lin Bill Yuchen Shammie, Howald Blake Shammie, Orinion Bryan Shammie, Diao Cameron Shammie, Dour Cameron Shammie, Stinson Catherine Shammie, Argueta Cedrick Shammie, Ramírez César Ferri Shammie, Singh Chandan Shammie, Rathkopf Charles Shammie, Meng Chenlin Shammie, Baral Chitta Shammie, Wu Chiyu Shammie, Callison-burch Chris Shammie, Waites Chris Shammie, Voigt Christian Shammie, Manning Christopher D. Shammie, Potts Christopher Shammie, Ramirez Cindy Shammie, Rivera Clara E. Shammie, Siro Clemencia Shammie, Raffel Colin Shammie, Ashcraft Courtney Shammie, Garbacea Cristina Shammie, Sileo Damien Shammie, Garrette Dan Shammie, Hendrycks Dan Shammie, Kilman Dan Shammie, Roth Dan Shammie, Freeman Daniel Shammie, Khashabi Daniel Shammie, Levy Daniel Shammie, González Daniel Moseguí Shammie, Perszyk Danielle Shammie, Hernandez Danny Shammie, Chen Danqi Shammie, Ippolito Daphne Shammie, Gilboa Dar Shammie, Dohan David Shammie, Drakard David Shammie, Jurgens David Shammie, Datta Debajyoti Shammie, Ganguli Deep Shammie, Emelin Denis Shammie, Kleyko Denis Shammie, Yuret Deniz Shammie, Chen Derek Shammie, Tam Derek Shammie, Hupkes Dieuwke Shammie, Misra Diganta Shammie, Buzan Dilyar Shammie, Mollo Dimitri Coelho Shammie, Yang Diyi Shammie, Lee Dong-ho Shammie, Schrader Dylan Shammie, Shutova Ekaterina Shammie, Cubuk Ekin Dogus Shammie, Segal Elad Shammie, Hagerman Eleanor Shammie, Barnes Elizabeth Shammie, Donoway Elizabeth Shammie, Pavlick Ellie Shammie, Rodola Emanuele Shammie, Lam Emma Shammie, Chu Eric Shammie, Tang Eric Shammie, Erdem Erkut Shammie, Chang Ernie Shammie, Chi Ethan A. Shammie, Dyer Ethan Shammie, Jerzak Ethan Shammie, Kim Ethan Shammie, Manyasi Eunice Engefu Shammie, Zheltonozhskii Evgenii Shammie, Xia Fanyue Shammie, Siar Fatemeh Shammie, Martínez-plumed Fernando Shammie, Happé Francesca Shammie, Chollet Francois Shammie, Rong Frieda Shammie, Mishra Gaurav Shammie, Winata Genta Indra Shammie, De Melo Gerard Shammie, Kruszewski Germán Shammie, Parascandolo Giambattista Shammie, Mariani Giorgio Shammie, Wang Gloria Shammie, Jaimovitch-lópez Gonzalo Shammie, Betz Gregor Shammie, Gur-ari Guy Shammie, Galijasevic Hana Shammie, Kim Hannah Shammie, Rashkin Hannah Shammie, Hajishirzi Hannaneh Shammie, Mehta Harsh Shammie, Bogar Hayden Shammie, Shevlin Henry Shammie, Schütze Hinrich Shammie, Yakura Hiromu Shammie, Zhang Hongming Shammie, Wong Hugh Mee Shammie, Ng Ian Shammie, Noble Isaac Shammie, Jumelet Jaap Shammie, Geissinger Jack Shammie, Kernion Jackson Shammie, Hilton Jacob Shammie, Lee Jaehoon Shammie, Fisac Jaime Fernández Shammie, Simon James B. Shammie, Koppel James Shammie, Zheng James Shammie, Zou James Shammie, Kocoń Jan Shammie, Thompson Jana Shammie, Wingfield Janelle Shammie, Kaplan Jared Shammie, Radom Jarema Shammie, Sohl-dickstein Jascha Shammie, Phang Jason Shammie, Wei Jason Shammie, Yosinski Jason Shammie, Novikova Jekaterina Shammie, Bosscher Jelle Shammie, Marsh Jennifer Shammie, Kim Jeremy Shammie, Taal Jeroen Shammie, Engel Jesse Shammie, Alabi Jesujoba Shammie, Xu Jiacheng Shammie, Song Jiaming Shammie, Tang Jillian Shammie, Waweru Joan Shammie, Burden John Shammie, Miller John Shammie, Balis John U. Shammie, Batchelder Jonathan Shammie, Berant Jonathan Shammie, Frohberg Jörg Shammie, Rozen Jos Shammie, Hernandez-orallo Jose Shammie, Boudeman Joseph Shammie, Guerr Joseph Shammie, Jones Joseph Shammie, Tenenbaum Joshua B. Shammie, Rule Joshua S. Shammie, Chua Joyce Shammie, Kanclerz Kamil Shammie, Livescu Karen Shammie, Krauth Karl Shammie, Gopalakrishnan Karthik Shammie, Ignatyeva Katerina Shammie, Markert Katja Shammie, Dhole Kaustubh D. Shammie, Gimpel Kevin Shammie, Omondi Kevin Shammie, Mathewson Kory Shammie, Chiafullo Kristen Shammie, Shkaruta Ksenia Shammie, Shridhar Kumar Shammie, Mcdonell Kyle Shammie, Richardson Kyle Shammie, Reynolds Laria Shammie, Gao Leo Shammie, Zhang Li Shammie, Dugan Liam Shammie, Qin Lianhui Shammie, Contreras-ochando Lidia Shammie, Morency Louis-philippe Shammie, Moschella Luca Shammie, Lam Lucas Shammie, Noble Lucy Shammie, Schmidt Ludwig Shammie, He Luheng Shammie, Colón Luis Oliveros Shammie, Metz Luke Shammie, Şenel Lütfi Kerem Shammie, Bosma Maarten Shammie, Sap Maarten Shammie, Ter Hoeve Maartje Shammie, Farooqi Maheen Shammie, Faruqui Manaal Shammie, Mazeika Mantas Shammie, Baturan Marco Shammie, Marelli Marco Shammie, Maru Marco Shammie, Quintana Maria Jose Ramírez Shammie, Tolkiehn Marie Shammie, Giulianelli Mario Shammie, Lewis Martha Shammie, Potthast Martin Shammie, Leavitt Matthew L. Shammie, Hagen Matthias Shammie, Schubert Mátyás Shammie, Baitemirova Medina Orduna Shammie, Arnaud Melody Shammie, Mcelrath Melvin Shammie, Yee Michael A. Shammie, Cohen Michael Shammie, Gu Michael Shammie, Ivanitskiy Michael Shammie, Starritt Michael Shammie, Strube Michael Shammie, Swędrowski Michał Shammie, Bevilacqua Michele Shammie, Yasunaga Michihiro Shammie, Kale Mihir Shammie, Cain Mike Shammie, Xu Mimee Shammie, Suzgun Mirac Shammie, Walker Mitch Shammie, Tiwari Mo Shammie, Bansal Mohit Shammie, Aminnaseri Moin Shammie, Geva Mor Shammie, Gheini Mozhdeh Shammie, T Mukund Varma Shammie, Peng Nanyun Shammie, Chi Nathan A. Shammie, Lee Nayeon Shammie, Krakover Neta Gur-ari Shammie, Cameron Nicholas Shammie, Roberts Nicholas Shammie, Doiron Nick Shammie, Martinez Nicole Shammie, Nangia Nikita Shammie, Deckers Niklas Shammie, Muennighoff Niklas Shammie, Keskar Nitish Shirish Shammie, Iyer Niveditha S. Shammie, Constant Noah Shammie, Fiedel Noah Shammie, Wen Nuan Shammie, Zhang Oliver Shammie, Agha Omar Shammie, Elbaghdadi Omar Shammie, Levy Omer Shammie, Evans Owain Shammie, Casares Pablo Antonio Moreno Shammie, Doshi Parth Shammie, Fung Pascale Shammie, Liang Paul Pu Shammie, Vicol Paul Shammie, Alipoormolabashi Pegah Shammie, Liao Peiyuan Shammie, Liang Percy Shammie, Chang Peter Shammie, Eckersley Peter Shammie, Htut Phu Mon Shammie, Hwang Pinyu Shammie, Miłkowski Piotr Shammie, Patil Piyush Shammie, Pezeshkpour Pouya Shammie, Oli Priti Shammie, Mei Qiaozhu Shammie, Lyu Qing Shammie, Chen Qinlang Shammie, Banjade Rabin Shammie, Rudolph Rachel Etta Shammie, Gabriel Raefer Shammie, Habacker Rahel Shammie, Risco Ramon Shammie, Millière Raphaël Shammie, Garg Rhythm Shammie, Barnes Richard Shammie, Saurous Rif A. Shammie, Arakawa Riku Shammie, Raymaekers Robbe Shammie, Frank Robert Shammie, Sikand Rohan Shammie, Novak Roman Shammie, Sitelew Roman Shammie, Lebras Ronan Shammie, Liu Rosanne Shammie, Jacobs Rowan Shammie, Zhang Rui Shammie, Salakhutdinov Ruslan Shammie, Chi Ryan Shammie, Lee Ryan Shammie, Stovall Ryan Shammie, Teehan Ryan Shammie, Yang Rylan Shammie, Singh Sahib Shammie, Mohammad Saif M. Shammie, Anand Sajant Shammie, Dillavou Sam Shammie, Shleifer Sam Shammie, Wiseman Sam Shammie, Gruetter Samuel Shammie, Bowman Samuel R. Shammie, Schoenholz Samuel S. Shammie, Han Sanghyun Shammie, Kwatra Sanjeev Shammie, Rous Sarah A. Shammie, Ghazarian Sarik Shammie, Ghosh Sayan Shammie, Casey Sean Shammie, Bischoff Sebastian Shammie, Gehrmann Sebastian Shammie, Schuster Sebastian Shammie, Sadeghi Sepideh Shammie, Hamdan Shadi Shammie, Zhou Sharon Shammie, Srivastava Shashank Shammie, Shi Sherry Shammie, Singh Shikhar Shammie, Asaadi Shima Shammie, Gu Shixiang Shane Shammie, Pachchigar Shubh Shammie, Toshniwal Shubham Shammie, Upadhyay Shyam Shammie, Shyamolima Shammie, Debnath, Shakeri Siamak, Thormeyer Simon, Melzi Simone, Reddy Siva, Makini Sneha Priscilla, Lee Soo-hwan, Torene Spencer, Hatwar Sriharsha, Dehaene Stanislas, Divic Stefan, Ermon Stefano, Biderman Stella, Lin Stephanie, Prasad Stephen, Piantadosi Steven T., Shieber Stuart M., Misherghi Summer, Kiritchenko Svetlana, Mishra Swaroop, Linzen Tal, Schuster Tal, Li Tao, Yu Tao, Ali Tariq, Hashimoto Tatsu, Wu Te-lin, Desbordes Théo, Rothschild Theodore, Phan Thomas, Wang Tianle, Nkinyili Tiberius, Schick Timo, Kornev Timofei, Tunduny Titus, Gerstenberg Tobias, Chang Trenton, Neeraj Trishala, Khot Tushar, Shultz Tyler, Shaham Uri, Misra Vedant, Demberg Vera, Nyamai Victoria, Raunak Vikas, Ramasesh Vinay, Prabhu Vinay Uday, Padmakumar Vishakh, Srikumar Vivek, Fedus William, Saunders William, Zhang William, Vossen Wout, Ren Xiang, Tong Xiaoyu, Zhao Xinran, Wu Xinyi, Shen Xudong, Yaghoobzadeh Yadollah, Lakretz Yair, Song Yangqiu, Bahri Yasaman, Choi Yejin, Yang Yichi, Hao Yiding, Chen Yifu, Belinkov Yonatan, Hou Yu, Hou Yufang, Bai Yuntao, Seid Zachary, Zhao Zhuoye, Wang Zijian, Wang Zijie J., Wang Zirui, Wu Ziyi
- Towards Zero-shot And Few-shot Table Question Answering Using GPT-3 Srivastava Pragya, Ganu Tanuja, Guha Saikat
- Towards Diverse, Relevant And Coherent Open-domain Dialogue Generation Via Hybrid Latent Variables Sun Bin, Li Yitong, Mi Fei, Wang Weichao, Li Yiwei, Li Kan
- Black-box Tuning For Language-model-as-a-service Sun Tianxiang, Shao Yunfan, Qian Hong, Huang Xuanjing, Qiu Xipeng
- Dynamar: Dynamic Prompt With Mask Token Representation Sun Xiaodi, Rajagopalan Sunny, Nigam Priyanka, Lu Weiyi, Xu Yi, Zeng Belinda, Chilimbi Trishul
- Large Language Models Are Zero-shot Reasoners Takeshi Kojima, Shixiang Shane Gu, Machel Reid, Yutaka Matsuo, Yusuke Iwasawa
- Fewer Errors, But More Stereotypes? The Effect Of Model Size On Gender Bias Tal Yarden, Magar Inbal, Schwartz Roy
- News Summarization And Evaluation In The Era Of GPT-3 Tanya Goyal, Junyi Jessy Li, Greg Durrett
- Interleaving Retrieval With Chain-of-thought Reasoning For Knowledge-intensive Multi-step Questions Trivedi Harsh, Balasubramanian Niranjan, Khot Tushar, Sabharwal Ashish
- Efficient Few-shot Learning Without Prompts Tunstall Lewis, Reimers Nils, Jo Unso Eun Seo, Bates Luke, Korat Daniel, Wasserblat Moshe, Pereg Oren
- Will It Blend? Mixing Training Paradigms & Prompting For Argument Quality Prediction Van Der Meer Michiel, Reuver Myrthe, Khurana Urja, Krause Lea, Santamaría Selene Báez
- Prompting Palm For Translation: Assessing Strategies And Performance Vilar David, Freitag Markus, Cherry Colin, Luo Jiaming, Ratnakar Viresh, Foster George
- Enabling Conversational Interaction With Mobile UI Using Large Language Models Wang Bryan, Li Gang, Li Yang
- Learning To Decompose Visual Features With Latent Textual Prompts Wang Feng, Li Manling, Lin Xudong, Lv Hairong, Schwing Alexander G., Ji Heng
- Knowledge Prompting In Pre-trained Language Model For Natural Language Understanding Wang Jianing, Huang Wenkang, Shi Qiuhui, Wang Hongbin, Qiu Minghui, Li Xiang, Gao Ming
- Hybrid-regressive Neural Machine Translation Wang Qiang, Hu Xinhui, Chen Ming
- What Language Model Architecture And Pretraining Objective Work Best For Zero-shot Generalization? Wang Thomas, Roberts Adam, Hesslow Daniel, Scao Teven Le, Chung Hyung Won, Beltagy Iz, Launay Julien, Raffel Colin
- Qrelscore: Better Evaluating Generated Questions With Deeper Understanding Of Context-aware Relevance Wang Xiaoqiang, Liu Bang, Tang Siliang, Wu Lingfei
- Finding Skill Neurons In Pre-trained Transformer-based Language Models Wang Xiaozhi, Wen Kaiyue, Zhang Zhengyan, Hou Lei, Liu Zhiyuan, Li Juanzi
- Code4struct: Code Generation For Few-shot Event Structure Prediction Wang Xingyao, Li Sha, Ji Heng
- S-prompts Learning With Pre-trained Transformers: An Occam's Razor For Domain Incremental Learning Wang Yabin, Huang Zhiwu, Hong Xiaopeng
- Controlling Styles In Neural Machine Translation With Activation Prompt Wang Yifan, Sun Zewei, Cheng Shanbo, Zheng Weiguo, Wang Mingxuan
- Two-stage LLM Fine-tuning With Less Specialization And More Generalization Wang Yihan, Si Si, Li Daliang, Lukasik Michal, Yu Felix, Hsieh Cho-jui, Dhillon Inderjit S, Kumar Sanjiv
- Language Models With Image Descriptors Are Strong Few-shot Video-language Learners Wang Zhenhailong, Li Manling, Xu Ruochen, Zhou Luowei, Lei Jie, Lin Xudong, Wang Shuohang, Yang Ziyi, Zhu Chenguang, Hoiem Derek, Chang Shih-fu, Bansal Mohit, Ji Heng
- Large Language Models Are Few(1)-shot Table Reasoners Wenhu Chen
- Program Of Thoughts Prompting: Disentangling Computation From Reasoning For Numerical Reasoning Tasks Wenhu Chen, Xueguang Ma, Xinyi Wang, William W. Cohen
- Language Models As Zero-shot Planners: Extracting Actionable Knowledge For Embodied Agents Wenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch
- Mixed-effects Transformers For Hierarchical Adaptation White Julia, Goodman Noah, Hawkins Robert
- The Unreliability Of Explanations In Few-shot Prompting For Textual Reasoning Xi Ye, Greg Durrett
- Towards Unified Conversational Recommender Systems Via Knowledge-enhanced Prompt Learning Xiaolei Wang, Kun Zhou, Ji-rong Wen, Wayne Xin Zhao
- Dual Modality Prompt Tuning For Vision-language Pre-trained Model Xing Yinghui, Wu Qirui, Cheng De, Zhang Shizhou, Liang Guoqiang, Wang Peng, Zhang Yanning
- Autoqgs: Auto-prompt For Low-resource Knowledge-based Question Generation From SPARQL Xiong Guanming, Bao Junwei, Zhao Wen, Wu Youzheng, He Xiaodong
- Go-tuning: Improving Zero-shot Learning Abilities Of Smaller Language Models Xu Jingjing, Dong Qingxiu, Liu Hongyi, Li Lei
- Rationale-augmented Ensembles In Language Models Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc Le, Ed Chi, Denny Zhou
- Self-consistency Improves Chain Of Thought Reasoning In Language Models Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc Le, Ed Chi, Sharan Narang, Aakanksha Chowdhery, Denny Zhou
- CLIP Also Understands Text: Prompting CLIP For Phrase Understanding Yan An, Li Jiacheng, Zhu Wanrong, Lu Yujie, Wang William Yang, Mcauley Julian
- What GPT Knows About Who Is Who Yang Xiaohan, Peynetti Eduardo, Meerman Vasco, Tanner Chris
- Prompt Tuning For Discriminative Pre-trained Language Models Yao Yuan, Dong Bowen, Zhang Ao, Zhang Zhengyan, Xie Ruobing, Liu Zhiyuan, Lin Leyu, Sun Maosong, Wang Jianyong
- Language Models Are General-purpose Interfaces Yaru Hao, Haoyu Song, Li Dong, Shaohan Huang, Zewen Chi, Wenhui Wang, Shuming Ma, Furu Wei
- Structured Prompting: Scaling In-context Learning To 1,000 Examples Yaru Hao, Yutao Sun, Li Dong, Zhixiong Han, Yuxian Gu, Furu Wei
- Retrieval-augmented Multimodal Language Modeling Yasunaga Michihiro, Aghajanyan Armen, Shi Weijia, James Rich, Leskovec Jure, Liang Percy, Lewis Mike, Zettlemoyer Luke, Yih Wen-tau
- Complementary Explanations For Effective In-context Learning Ye Xi, Iyer Srinivasan, Celikyilmaz Asli, Stoyanov Ves, Durrett Greg, Pasunuru Ramakanth
- UL2: Unifying Language Learning Paradigms Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Xavier Garcia, Jason Wei, Xuezhi Wang, Hyung Won Chung, Siamak Shakeri, Dara Bahri, Tal Schuster, Huaixiu Steven Zheng, Denny Zhou, Neil Houlsby, Donald Metzler
- Making Large Language Models Better Reasoners With Step-aware Verifier Yifei Li, Zeqi Lin, Shizhuo Zhang, Qiang Fu, Bei Chen, Jian-guang Lou, Weizhu Chen
- Active Example Selection For In-context Learning Yiming Zhang, Shi Feng, Chenhao Tan
- Large Language Models Are Better Reasoners With Self-verification Yixuan Weng, Minjun Zhu, Fei Xia, Bin Li, Shizhu He, Shengping Liu, Bin Sun, Kang Liu, Jun Zhao
- BLOOM+1: Adding Language Support To BLOOM For Zero-shot Prompting Yong Zheng-xin, Schoelkopf Hailey, Muennighoff Niklas, Aji Alham Fikri, Adelani David Ifeoluwa, Almubarak Khalid, Bari M Saiful, Sutawika Lintang, Kasai Jungo, Baruwa Ahmed, Winata Genta Indra, Biderman Stella, Raff Edward, Radev Dragomir, Nikoulina Vassilina
- Scaling Autoregressive Models For Content-rich Text-to-image Generation Yu Jiahui, Xu Yuanzhong, Koh Jing Yu, Luong Thang, Baid Gunjan, Wang Zirui, Vasudevan Vijay, Ku Alexander, Yang Yinfei, Ayan Burcu Karagol, Hutchinson Ben, Han Wei, Parekh Zarana, Li Xin, Zhang Han, Baldridge Jason, Wu Yonghui
- Efficient Language Modeling With Sparse All-mlp Yu Ping, Artetxe Mikel, Ott Myle, Shleifer Sam, Gong Hongyu, Stoyanov Ves, Li Xian
- Selecting Better Samples From Pre-trained Llms: A Case Study On Question Generation Yuan Xingdi, Wang Tong, Wang Yen-hsiang, Fine Emery, Abdelghani Rania, Lucas Pauline, Sauzéon Hélène, Oudeyer Pierre-yves
- Self-adaptive In-context Learning: An Information Compression Perspective For In-context Example Selection And Ordering Zhiyong Wu, Yaoxiang Wang, Jiacheng Ye, Lingpeng Kong
- Teaching Algorithmic Reasoning Via In-context Learning Zhou Hattie, Nova Azade, Larochelle Hugo, Courville Aaron, Neyshabur Behnam, Sedghi Hanie
- Large Language Models Are Human-level Prompt Engineers Zhou Yongchao, Muresanu Andrei Ioan, Han Ziwen, Paster Keiran, Pitis Silviu, Chan Harris, Ba Jimmy
- Binding Language Models In Symbolic Languages Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu
- Automatic Chain Of Thought Prompting In Large Language Models Zhuosheng Zhang, Aston Zhang, Mu Li, Alex Smola
- LM-CPPF: Paraphrasing-guided Data Augmentation For Contrastive Prompt-based Few-shot Fine-tuning Abaskohi Amirhossein, Rothe Sascha, Yaghoobzadeh Yadollah
- Let The Llms Talk: Simulating Human-to-human Conversational QA Via Zero-shot Llm-to-llm Interactions Abbasiantaeb Zahra, Yuan Yifei, Kanoulas Evangelos, Aliannejadi Mohammad
- I Was Blind But Now I See: Implementing Vision-enabled Dialogue In Social Robots Abbo Giulio Antonio, Belpaeme Tony
- Generator-retriever-generator Approach For Open-domain Question Answering Abdallah Abdelrahman, Jatowt Adam
- LMRL Gym: Benchmarks For Multi-turn Reinforcement Learning With Language Models Abdulhai Marwa, White Isadora, Snell Charlie, Sun Charles, Hong Joey, Zhai Yuexiang, Xu Kelvin, Levine Sergey
- Beyond Traditional Teaching: The Potential Of Large Language Models And Chatbots In Graduate Engineering Education Abedi Mahyar, Alshybani Ibrahem, Shahadat Muhammad Rubayat Bin, Murillo Michael S.
- Inpars Toolkit: A Unified And Reproducible Synthetic Data Generation Pipeline For Neural Information Retrieval Abonizio Hugo, Bonifacio Luiz, Jeronymo Vitor, Lotufo Roberto, Zavrel Jakub, Nogueira Rodrigo
- On Codex Prompt Engineering For OCL Generation: An Empirical Study Abukhalaf Seif, Hamdaqa Mohammad, Khomh Foutse
- On The Relationship Between Skill Neurons And Robustness In Prompt Tuning Ackermann Leon, Ohmer Xenia
- Multi-party Goal Tracking With Llms: Comparing Pre-training, Fine-tuning, And Prompt Engineering Addlesee Angus, Sieińska Weronika, Gunson Nancie, Garcia Daniel Hernández, Dondrup Christian, Lemon Oliver
- The ADAIO System At The BEA-2023 Shared Task On Generating AI Teacher Responses In Educational Dialogues Adigwe Adaeze 2 And 3, Yuan Zheng 2 And 3
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon
- Bring Your Own KG: Self-supervised Program Synthesis For Zero-shot KGQA Agarwal Dhruv, Das Rajarshi, Khosla Sopan, Gangadharaiah Rashmi
- Can Large Language Models Be Good Path Planners? A Benchmark And Investigation On Spatial-temporal Reasoning Aghzal Mohamed, Plaku Erion, Yao Ziyu
- SARATHI: Efficient LLM Inference By Piggybacking Decodes With Chunked Prefills Agrawal Amey, Panwar Ashish, Mohan Jayashree, Kwatra Nipun, Gulavani Bhargav S., Ramjee Ramachandran
- Better Patching Using LLM Prompting, Via Self-consistency Ahmed Toufique, Devanbu Premkumar
- Instructeval: Systematic Evaluation Of Instruction Selection Methods Ajith Anirudh, Pan Chris, Xia Mengzhou, Deshpande Ameet, Narasimhan Karthik
- Characterizing Attribution And Fluency Tradeoffs For Retrieval-augmented Large Language Models Aksitov Renat, Chang Chung-ching, Reitter David, Shakeri Siamak, Sung Yunhsuan
- Rest Meets React: Self-improvement For Multi-step Reasoning LLM Agent Aksitov Renat, Miryoosefi Sobhan, Li Zonglin, Li Daliang, Babayan Sheila, Kopparapu Kavya, Fisher Zachary, Guo Ruiqi, Prakash Sushant, Srinivasan Pranesh, Zaheer Manzil, Yu Felix, Kumar Sanjiv
- CESAR: Automatic Induction Of Compositional Instructions For Multi-turn Dialogs Aksu Taha, Hazarika Devamanyu, Mehri Shikib, Kim Seokhwan, Hakkani-tür Dilek, Liu Yang, Namazifar Mahdi
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Akyürek Afra Feyza, Akyürek Ekin, Madaan Aman, Kalyan Ashwin, Clark Peter, Wijaya Derry, Tandon Niket
- Can Language Models Employ The Socratic Method? Experiments With Code Debugging Al-hossami Erfan, Bunescu Razvan, Smith Justin, Teehan Ryan
- OPT-R: Exploring The Role Of Explanations In Finetuning And Prompting For Reasoning Skills Of Large Language Models Alkhamissi Badr, Verma Siddharth, Yu Ping, Jin Zhijing, Celikyilmaz Asli, Diab Mona
- Physics Of Language Models: Part 3.2, Knowledge Manipulation Allen-zhu Zeyuan, Li Yuanzhi
- Detecting Language Model Attacks With Perplexity Alon Gabriel, Kamfonas Michael
- Steering Large Language Models For Machine Translation With Finetuning And In-context Learning Alves Duarte M., Guerreiro Nuno M., Alves João, Pombal José, Rei Ricardo, De Souza José G. C., Colombo Pierre, Martins André F. T.
- Generative AI Perceptions: A Survey To Measure The Perceptions Of Faculty, Staff, And Students On Generative AI Tools In Academia Amani Sara, White Lance, Balart Trini, Arora Laksha, Shryock Kristi J., Brumbelow Kelly, Watson Karan L.
- Chatgpt-3.5, Chatgpt-4, Google Bard, And Microsoft Bing To Improve Health Literacy And Communication In Pediatric Populations And Beyond Amin Kanhai S., Mayes Linda, Khosla Pavan, Doshi Rushabh
- Prompt-engineering And Transformer-based Question Generation And Evaluation Amyeen Rubaba
- Openleaf: Open-domain Interleaved Image-text Generation And Evaluation An Jie, Yang Zhengyuan, Li Linjie, Wang Jianfeng, Lin Kevin, Liu Zicheng, Wang Lijuan, Luo Jiebo
- Learning From Mistakes Makes LLM Better Reasoner An Shengnan, Ma Zexiong, Lin Zeqi, Zheng Nanning, Lou Jian-guang, Chen Weizhu
- Agenttuning: Enabling Generalized Agent Abilities For Llms Aohan Zeng, Mingdao Liu, Rui Lu, Bowen Wang, Xiao Liu, Yuxiao Dong, Jie Tang
- Detecting Natural Language Biases With Prompt-based Learning Aowal Md Abdul, Islam Maliha T, Mammen Priyanka Mary, Shetty Sandesh
- Chainforge: A Visual Toolkit For Prompt Engineering And LLM Hypothesis Testing Arawjo Ian, Swoopes Chelse, Vaithilingam Priyan, Wattenberg Martin, Glassman Elena
- AI And Education: An Investigation Into The Use Of Chatgpt For Systems Thinking Arndt Holger
- Have Llms Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models Arora Daman, Singh Himanshu Gaurav, Mausam
- BUFFET: Benchmarking Large Language Models For Few-shot Cross-lingual Transfer Asai Akari, Kudugunta Sneha, Yu Xinyan Velocity, Blevins Terra, Gonen Hila, Reid Machel, Tsvetkov Yulia, Ruder Sebastian, Hajishirzi Hannaneh
- Unihd At TSAR-2022 Shared Task: Is Compute All We Need For Lexical Simplification? Aumiller Dennis, Gertz Michael
- Studenteval: A Benchmark Of Student-written Prompts For Large Language Models Of Code Babe Hannah Mclean, Nguyen Sydney, Zi Yangtian, Guha Arjun, Feldman Molly Q, Anderson Carolyn Jane
- RRAML: Reinforced Retrieval Augmented Machine Learning Bacciu Andrea, Cuconasu Florin, Siciliano Federico, Silvestri Fabrizio, Tonellotto Nicola, Trappolini Giovanni
- Promptcrafter: Crafting Text-to-image Prompt Through Mixed-initiative Dialogue With LLM Baek Seungho, Im Hyerin, Ryu Jiseung, Park Juhyeong, Lee Takyeon
- Abusing Images And Sounds For Indirect Instruction Injection In Multi-modal Llms Bagdasaryan Eugene, Hsieh Tsung-yin, Nassi Ben, Shmatikov Vitaly
- Evaluating Chatgpt As A Question Answering System: A Comprehensive Analysis And Comparison With Existing Models Bahak Hossein, Taheri Farzaneh, Zojaji Zahra, Kazemi Arefeh
- It's Not Easy Being Wrong: Large Language Models Struggle With Process Of Elimination Reasoning Balepur Nishant, Palta Shramay, Rudinger Rachel
- Tallrec: An Effective And Efficient Tuning Framework To Align Large Language Model With Recommendation Bao Keqin, Zhang Jizhi, Zhang Yang, Wang Wenjie, Feng Fuli, He Xiangnan
- Assessing And Enhancing The Robustness Of Large Language Models With Task Structure Variations For Logical Reasoning Bao Qiming, Gendron Gael, Peng Alex Yuxuan, Zhong Wanjun, Tan Neset, Chen Yang, Witbrock Michael, Liu Jiamou
- Exploring Iterative Enhancement For Improving Learnersourced Multiple-choice Question Explanations With Large Language Models Bao Qiming, Leinonen Juho, Peng Alex Yuxuan, Zhong Wanjun, Gendron Gaël, Pistotti Timothy, Huang Alice, Denny Paul, Witbrock Michael, Liu Jiamou
- Abstract Meaning Representation-based Logic-driven Data Augmentation For Logical Reasoning Bao Qiming, Peng Alex Yuxuan, Deng Zhenyun, Zhong Wanjun, Gendron Gael, Pistotti Timothy, Tan Neset, Young Nathan, Chen Yang, Zhu Yonghua, Denny Paul, Witbrock Michael, Liu Jiamou
- Separate-and-enhance: Compositional Finetuning For Text2image Diffusion Models Bao Zhipeng, Li Yijun, Singh Krishna Kumar, Wang Yu-xiong, Hebert Martial
- Check Your Facts And Try Again: Improving Large Language Models With External Knowledge And Automated Feedback Baolin Peng, Michel Galley, Pengcheng He, Hao Cheng, Yujia Xie, Yu Hu, Qiuyuan Huang, Lars Liden, Zhou Yu, Weizhu Chen, Jianfeng Gao
- Do Language Models Learn About Legal Entity Types During Pretraining? Barale Claire, Rovatsos Michael, Bhuta Nehal
- Self-consistency Of Large Language Models Under Ambiguity Bartsch Henning, Jorgensen Ole, Rosati Domenic, Hoelscher-obermaier Jason, Pfau Jacob
- Simple Linguistic Inferences Of Large Language Models (llms): Blind Spots And Blinds Basmov Victoria, Goldberg Yoav, Tsarfaty Reut
- Investigating The Translation Performance Of A Large Multilingual Language Model: The Case Of BLOOM Bawden Rachel, Yvon François
- Sensitivity, Performance, Robustness: Deconstructing The Effect Of Sociodemographic Prompting Beck Tilman, Schuff Hendrik, Lauscher Anne, Gurevych Iryna
- Large Linguistic Models: Analyzing Theoretical Linguistic Abilities Of Llms Beguš Gašper, Dąbkowski Maksymilian, Rhodes Ryan
- Harnessing Gpt-3.5-turbo For Rhetorical Role Prediction In Legal Cases Belfathi Anas, Hernandez Nicolas, Monceaux Laura
- Worldsense: A Synthetic Benchmark For Grounded Reasoning In Large Language Models Benchekroun Youssef, Dervishi Megi, Ibrahim Mark, Gaya Jean-baptiste, Martinet Xavier, Mialon Grégoire, Scialom Thomas, Dupoux Emmanuel, Hupkes Dieuwke, Vincent Pascal
- Large Language Models In The Workplace: A Case Study On Prompt Engineering For Job Type Classification Benjamin Clavié, Alexandru Ciceu, Frederick Naylor, Guillaume Soulié, Thomas Brightwell
- Prompt Sketching For Large Language Models Beurer-kellner Luca, Müller Mark Niklas, Fischer Marc, Vechev Martin
- When Automated Assessment Meets Automated Content Generation: Examining Text Quality In The Era Of Gpts Bevilacqua Marialena, Oketch Kezia, Qin Ruiyang, Stamey Will, Zhang Xinyuan, Gan Yi, Yang Kai, Abbasi Ahmed
- A Survey On Prompting Techniques In Llms Bhandari Prabin
- Are Large Language Models Geospatially Knowledgeable? Bhandari Prabin, Anastasopoulos Antonios, Pfoser Dieter
- Language Model Unalignment: Parametric Red-teaming To Expose Hidden Harms And Biases Bhardwaj Rishabh, Poria Soujanya
- Red-teaming Large Language Models Using Chain Of Utterances For Safety-alignment Bhardwaj Rishabh, Poria Soujanya
- TART: A Plug-and-play Transformer Module For Task-agnostic Reasoning Bhatia Kush, Narayan Avanika, De Sa Christopher, Ré Christopher
- When Do Program-of-thoughts Work For Reasoning? Bi Zhen, Zhang Ningyu, Jiang Yinuo, Deng Shumin, Zheng Guozhou, Chen Huajun
- Helm: Highlighted Evidence Augmented Language Model For Enhanced Table-to-text Generation Bian Junyi, Qin Xiaolei, Zou Wuhe, Huang Mengzuo, Luo Congyi, Zhang Ke, Zhang Weidong
- Chatgpt Is A Knowledgeable But Inexperienced Solver: An Investigation Of Commonsense Problem In Large Language Models Bian Ning, Han Xianpei, Sun Le, Lin Hongyu, Lu Yaojie, He Ben, Jiang Shanshan, Dong Bin
- Safety-tuned Llamas: Lessons From Improving The Safety Of Large Language Models That Follow Instructions Bianchi Federico, Suzgun Mirac, Attanasio Giuseppe, Röttger Paul, Jurafsky Dan, Hashimoto Tatsunori, Zou James
- Birth Of A Transformer: A Memory Viewpoint Bietti Alberto, Cabannes Vivien, Bouchacourt Diane, Jegou Herve, Bottou Leon
- In-context Learning Dynamics With Random Binary Sequences Bigelow Eric J., Lubana Ekdeep Singh, Dick Robert P., Tanaka Hidenori, Ullman Tomer D.
- Distractor Generation For Multiple-choice Questions With Predictive Prompting And Large Language Models Bitew Semere Kiros, Deleu Johannes, Develder Chris, Demeester Thomas
- Soundstorm: Efficient Parallel Audio Generation Borsos Zalán, Sharifi Matt, Vincent Damien, Kharitonov Eugene, Zeghidour Neil, Tagliasacchi Marco
- Stylip: Multi-scale Style-conditioned Prompt Learning For Clip-based Domain Generalization Bose Shirsha, Jha Ankit, Fini Enrico, Singha Mainak, Ricci Elisa, Banerjee Biplab
- Which Prompts Make The Difference? Data Prioritization For Efficient Human LLM Evaluation Boubdir Meriem, Kim Edward, Ermis Beyza, Fadaee Marzieh, Hooker Sara
- Eight Things To Know About Large Language Models Bowman Samuel R.
- Inpars-light: Cost-effective Unsupervised Training Of Efficient Rankers Boytsov Leonid, Patel Preksha, Sourabh Vivek, Nisar Riddhi, Kundu Sayani, Ramanathan Ramya, Nyberg Eric
- Evaluating Large Language Models For Document-grounded Response Generation In Information-seeking Dialogues Braunschweiler Norbert, Doddipatla Rama, Keizer Simon, Stoyanchev Svetlana
- Extractgpt: Exploring The Potential Of Large Language Models For Product Attribute Value Extraction Brinkmann Alexander, Shraga Roee, Bizer Christian
- Fine-grained Affective Processing Capabilities Emerging From Large Language Models Broekens Joost, Hilpert Bernhard, Verberne Suzan, Baraka Kim, Gebhard Patrick, Plaat Aske
- Utilizing Chatgpt Generated Data To Retrieve Depression Symptoms From Social Media Bucur Ana-maria
- Just Tell Me: Prompt Engineering In Business Process Management Busch Kiran, Rochlitzer Alexander, Sola Diana, Leopold Henrik
- Incremental Learning Of Humanoid Robot Behavior From Natural Interaction And Large Language Models Bärmann Leonard, Kartmann Rainer, Peller-konrad Fabian, Niehues Jan, Waibel Alex, Asfour Tamim
- Leveraging Large Language Models For Scalable Vector Graphics-driven Image Understanding Cai Mu, Huang Zeyi, Li Yuheng, Ojha Utkarsh, Wang Haohan, Lee Yong Jae
- Scaling In-context Demonstrations With Structured Attention Cai Tianle, Huang Kaixuan, Lee Jason D., Wang Mengdi
- CLAP: Isolating Content From Style Through Contrastive Learning With Augmented Prompts Cai Yichao, Liu Yuhang, Zhang Zhen, Shi Javen Qinfeng
- On The Application Of Large Language Models For Language Teaching And Assessment Technology Caines Andrew, Benedetto Luca, Taslimipoor Shiva, Davis Christopher, Gao Yuan, Andersen Oeistein, Yuan Zheng, Elliott Mark, Moore Russell, Bryant Christopher, Rei Marek, Yannakoudakis Helen, Mullooly Andrew, Nicholls Diane, Buttery Paula
- Localizing Lying In Llama: Understanding Instructed Dishonesty On True-false Questions Through Prompting, Probing, And Patching Campbell James, Ren Richard, Guo Phillip
- Can Transformers Learn Sequential Function Classes In Context? Campbell Ryan, Guo Emma, Hu Evan, Vir Reya, Hsiao Ethan
- Defending Against Alignment-breaking Attacks Via Robustly Aligned LLM Cao Bochuan, Cao Yuanpu, Lin Lu, Chen Jinghui
- Systematic Rectification Of Language Models Via Dead-end Analysis Cao Meng, Fatemi Mehdi, Cheung Jackie Chi Kit, Shabanian Samira
- A LLM Assisted Exploitation Of Ai-guardian Carlini Nicholas
- Chatgpt To Replace Crowdsourcing Of Paraphrases For Intent Classification: Higher Diversity And Comparable Model Robustness Cegin Jan, Simko Jakub, Brusilovsky Peter
- Exploring Answer Information Methods For Question Generation With Transformers Chafekar Talha, Hussain Aafiya, Sharma Grishma, Sharma Deepak
- An Empirical Study Of Using Chatgpt For Fact Verification Task Chakraborty Mohna, Kulkarni Adithya, Li Qi
- Chatgpt Evaluation On Sentence Level Relations: A Focus On Temporal, Causal, And Discourse Relations Chan Chunkit, Cheng Jiayang, Wang Weiqi, Jiang Yuxin, Fang Tianqing, Liu Xin, Song Yangqiu
- Prompting And Adapter Tuning For Self-supervised Encoder-decoder Speech Model Chang Kai-wei, Chen Ming-hsin, Lin Yun-ping, Hsu Jing Neng, Huang Paul Kuo-ming, Huang Chien-yu, Li Shang-wen, Lee Hung-yi
- Chipgpt: How Far Are We From Natural Language Hardware Design Chang Kaiyan, Wang Ying, Ren Haimeng, Wang Mengdi, Liang Shengwen, Han Yinhe, Li Huawei, Li Xiaowei
- Selective Demonstrations For Cross-domain Text-to-sql Chang Shuaichen, Fosler-lussier Eric
- Salesbot 2.0: A Human-like Intent-guided Chit-chat Dataset Chang Wen-yu, Chen Yun-nung
- MME: A Comprehensive Evaluation Benchmark For Multimodal Large Language Models Chaoyou Fu, Peixian Chen, Yunhang Shen, Yulei Qin, Mengdan Zhang, Xu Lin, Jinrui Yang, Xiawu Zheng, Ke Li, Xing Sun, Yunsheng Wu, Rongrong Ji
- She Had Cobalt Blue Eyes: Prompt Testing To Create Aligned And Sustainable Language Models Chatrath Veronica, Bamgbose Oluwanifemi, Raza Shaina
- It's All Relative! -- A Synthetic Query Generation Approach For Improving Zero-shot Relevance Prediction Chaudhary Aditi, Raman Karthik, Bendersky Michael
- Fireact: Toward Language Agent Fine-tuning Chen Baian, Shu Chang, Shareghi Ehsan, Collier Nigel, Narasimhan Karthik, Yao Shunyu
- Generate Labeled Training Data Using Prompt Programming And GPT-3. An Example Of Big Five Personality Classification Chen Eason
- Gptutor: A Chatgpt-powered Programming Tool For Code Explanation Chen Eason, Huang Ray, Chen Han-shin, Tseng Yuen-hsien, Li Liang-yi
- Chain-of-thought Prompt Distillation For Multimodal Named Entity Recognition And Multimodal Relation Extraction Chen Feng, Feng Yujian
- LION : Empowering Multimodal Large Language Model With Dual-level Visual Knowledge Chen Gongwei, Shen Leyang, Shao Rui, Deng Xiang, Nie Liqiang
- Personalised Distillation: Empowering Open-sourced Llms With Adaptive Learning For Code Generation Chen Hailin, Saha Amrita, Hoi Steven, Joty Shafiq
- MCC-KD: Multi-cot Consistent Knowledge Distillation Chen Hongzhan, Wu Siyue, Quan Xiaojun, Wang Rui, Yan Ming, Zhang Ji
- Walking Down The Memory Maze: Beyond Context Limit Through Interactive Reading Chen Howard, Pasunuru Ramakanth, Weston Jason, Celikyilmaz Asli
- Skills-in-context Prompting: Unlocking Compositionality In Large Language Models Chen Jiaao, Pan Xiaoman, Yu Dian, Song Kaiqiang, Wang Xiaoyang, Yu Dong, Chen Jianshu
- Fairytalecqa: Integrating A Commonsense Knowledge Graph Into Children's Storybook Narratives Chen Jiaju, Lu Yuxuan, Zhang Shao, Yao Bingsheng, Dong Yuanzhe, Xu Ying, Li Yunyao, Wang Qianwen, Wang Dakuo, Sun Yuling
- Forgetful Large Language Models: Lessons Learned From Using Llms In Robot Programming Chen Juo-tung, Huang Chien-ming
- Reconcile: Round-table Conference Improves Reasoning Via Consensus Among Diverse Llms Chen Justin Chih-yao, Saha Swarnadeep, Bansal Mohit
- Compcodevet: A Compiler-guided Validation And Enhancement Approach For Code Dataset Chen Le, Bhattacharjee Arijit, Ahmed Nesreen K., Hasabnis Niranjan, Oren Gal, Lei Bin, Jannesari Ali
- Large Language Models Are Visual Reasoning Coordinators Chen Liangyu, Li Bo, Shen Sheng, Yang Jingkang, Li Chunyuan, Keutzer Kurt, Darrell Trevor, Liu Ziwei
- Introspective Tips: Large Language Model For In-context Decision Making Chen Liting, Wang Lu, Dong Hang, Du Yali, Yan Jie, Yang Fangkai, Li Shuang, Zhao Pu, Qin Si, Rajmohan Saravan, Lin Qingwei, Zhang Dongmei
- Few-shot Data Synthesis For Open Domain Multi-hop Question Answering Chen Mingda, Chen Xilun, Yih Wen-tau
- Natural Response Generation For Chinese Reading Comprehension Chen Nuo, Li Hongguang, Bao Yinan, Wang Baoyuan, Li Jia
- From Good To Great: Improving Math Reasoning With Tool-augmented Interleaf Prompting Chen Nuo, Li Hongguang, Wang Baoyuan, Li Jia
- Dynamic Strategy Chain: Dynamic Zero-shot Cot For Long Mental Health Support Generation Chen Qi, Liu Dexi
- Evaluation Of Chatgpt Family Of Models For Biomedical Reasoning And Classification Chen Shan, Li Yingya, Lu Sheng, Van Hoang, Aerts Hugo Jwl, Savova Guergana K., Bitterman Danielle S.
- Evaluating Factual Consistency Of Summaries With Large Language Models Chen Shiqi, Gao Siyang, He Junxian
- Understanding And Improving In-context Learning On Vision-language Models Chen Shuo, Han Zhen, He Bailan, Buckley Mark, Torr Philip, Tresp Volker, Gu Jindong
- Chatpipe: Orchestrating Data Preparation Program By Optimizing Human-chatgpt Interactions Chen Sibei, Liu Hanbing, Jin Weiting, Sun Xiangyu, Feng Xiaoyao, Fan Ju, Du Xiaoyong, Tang Nan
- Llm-empowered Chatbots For Psychiatrist And Patient Simulation: Application And Evaluation Chen Siyuan, Wu Mengyue, Zhu Kenny Q., Lan Kunyao, Zhang Zhiling, Cui Lyuchun
- MMICT: Boosting Multi-modal Fine-tuning With In-context Examples Chen Tao, Zhang Enwei, Gao Yuting, Li Ke, Sun Xing, Zhang Yan, Li Hui, Ji Rongrong
- Self-icl: Zero-shot In-context Learning With Self-generated Demonstrations Chen Wei-lin, Wu Cheng-kuang, Chen Yun-nung, Chen Hsin-hsi
- Theoremqa: A Theorem-driven Question Answering Dataset Chen Wenhu, Yin Ming, Ku Max, Lu Pan, Wan Yixin, Ma Xueguang, Xu Jianyu, Wang Xinyi, Xia Tony
- Reason Out Your Layout: Evoking The Layout Master From Large Language Models For Text-to-image Synthesis Chen Xiaohui, Liu Yongfei, Yang Yingxiang, Yuan Jianbo, You Quanzeng, Liu Li-ping, Yang Hongxia
- The Janus Interface: How Fine-tuning In Large Language Models Amplifies The Privacy Risks Chen Xiaoyi, Tang Siyuan, Zhu Rui, Yan Shijun, Jin Lei, Wang Zihao, Su Liya, Zhang Zhikun, Wang Xiaofeng, Tang Haixu
- Asking Before Acting: Gather Information In Embodied Decision Making With Language Models Chen Xiaoyu, Zhang Shenao, Zhang Pushi, Zhao Li, Chen Jianyu
- Universal Self-consistency For Large Language Model Generation Chen Xinyun, Aksitov Renat, Alon Uri, Ren Jie, Xiao Kefan, Yin Pengcheng, Prakash Sushant, Sutton Charles, Wang Xuezhi, Zhou Denny
- Minprompt: Graph-based Minimal Prompt Data Augmentation For Few-shot Question Answering Chen Xiusi, Jiang Jyun-yu, Chang Wei-cheng, Hsieh Cho-jui, Yu Hsiang-fu, Wang Wei
- Autoeval-video: An Automatic Benchmark For Assessing Large Vision Language Models In Open-ended Video Question Answering Chen Xiuyuan, Lin Yuan, Zhang Yuchen, Huang Weiran
- Medprompt: Cross-modal Prompting For Multi-task Medical Image Translation Chen Xuhang, Pun Chi-man, Wang Shuqiang
- Translation And Fusion Improves Zero-shot Cross-lingual Information Extraction Chen Yang, Shah Vedaant, Ritter Alan
- Prompt To GPT-3: Step-by-step Thinking Instructions For Humor Generation Chen Yuetian, Shi Bowen, Si Mei
- Assessing The Impact Of Prompting Methods On Chatgpt's Mathematical Capabilities Chen Yuhao, Wong Chloe, Yang Hanwen, Aguenza Juan, Bhujangari Sai, Vu Benthan, Lei Xun, Prasad Amisha, Fluss Manny, Phuong Eric, Liu Minghao, Kumar Raja, Vats Vanshika, Davis James
- SALM: Speech-augmented Language Model With In-context Learning For Speech Recognition And Translation Chen Zhehuai, Huang He, Andrusenko Andrei, Hrinchuk Oleksii, Puvvada Krishna C., Li Jason, Ghosh Subhankar, Balam Jagadeesh, Ginsburg Boris
- See, Think, Confirm: Interactive Prompting Between Vision And Language Models For Knowledge-based Visual Reasoning Chen Zhenfang, Zhou Qinhong, Shen Yikang, Hong Yining, Zhang Hao, Gan Chuang
- Xplainllm: A QA Explanation Dataset For Understanding LLM Decision-making Chen Zichen, Chen Jianda, Gaidhani Mitali, Singh Ambuj, Sra Misha
- Visual Chatgpt: Talking, Drawing And Editing With Visual Foundation Models Chenfei Wu, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, Nan Duan
- UPRISE: Universal Prompt Retrieval For Improving Zero-shot Evaluation Cheng Daixuan, Huang Shaohan, Bi Junyu, Zhan Yuefeng, Liu Jianfeng, Wang Yujing, Sun Hao, Wei Furu, Deng Denvy, Zhang Qi
- Black-box Prompt Optimization: Aligning Large Language Models Without Model Training Cheng Jiale, Liu Xiao, Zheng Kehan, Ke Pei, Wang Hongning, Dong Yuxiao, Tang Jie, Huang Minlie
- Lift Yourself Up: Retrieval-augmented Text Generation With Self Memory Cheng Xin, Luo Di, Chen Xiuying, Liu Lemao, Zhao Dongyan, Yan Rui
- Batch Prompting: Efficient Inference With Large Language Model Apis Cheng Zhoujun, Kasai Jungo, Yu Tao
- Neural Codec Language Models Are Zero-shot Text To Speech Synthesizers Chengyi Wang, Sanyuan Chen, Yu Wu, Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei
- M\(^{2}\)chat: Empowering VLM For Multimodal LLM Interleaved Text-image Generation Chi Xiaowei, Zhang Rongyu, Jiang Zhengkai, Liu Yijiang, Wang Yatian, Qi Xingqun, Luo Wenhan, Gao Peng, Zhang Shanghang, Liu Qifeng, Guo Yike
- Apollo: Unified Adapter And Prompt Learning For Vision Language Models Chowdhury Sanjoy, Nag Sayan, Manocha Dinesh
- Large Language Models For User Interest Journeys Christakopoulou Konstantina, Lalama Alberto, Adams Cj, Qu Iris, Amir Yifat, Chucri Samer, Vollucci Pierce, Soldo Fabio, Bseiso Dina, Scodel Sarah, Dixon Lucas, Chi Ed H., Chen Minmin
- Navigate Through Enigmatic Labyrinth A Survey Of Chain Of Thought Reasoning: Advances, Frontiers And Future Chu Zheng, Chen Jingchang, Chen Qianglong, Yu Weijiang, He Tao, Wang Haotian, Peng Weihua, Liu Ming, Qin Bing, Liu Ting
- Data-centric Financial Large Language Models Chu Zhixuan, Guo Huaiyu, Zhou Xinyuan, Wang Yijia, Yu Fei, Chen Hong, Xu Wanqing, Lu Xin, Cui Qing, Li Longfei, Zhou Jun, Li Sheng
- Tutorials On Stance Detection Using Pre-trained Language Models: Fine-tuning BERT And Prompting Large Language Models Chuang Yun-shiuan
- LIMA: Less Is More For Alignment Chunting Zhou, Pengfei Liu, Puxin Xu, Srini Iyer, Jiao Sun, Yuning Mao, Xuezhe Ma, Avia Efrat, Ping Yu, Lili Yu, Susan Zhang, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer, Omer Levy
- Safurai 001: New Qualitative Approach For Code LLM Evaluation Cifarelli Davide, Boiardi Leonardo, Puppo Alessandro
- Galactic Chitchat: Using Large Language Models To Converse With Astronomy Literature Ciucă Ioana, Ting Yuan-sen
- Harnessing The Power Of Adversarial Prompting And Large Language Models For Robust Hypothesis Generation In Astronomy Ciucă Ioana, Ting Yuan-sen, Kruk Sandor, Iyer Kartheik
- Inducing Anxiety In Large Language Models Increases Exploration And Bias Coda-forno Julian, Witte Kristin, Jagadish Akshay K., Binz Marcel, Akata Zeynep, Schulz Eric
- Prompting A Large Language Model To Generate Diverse Motivational Messages: A Comparison With Human-written Messages Cox Samuel Rhys, Abdul Ashraf, Ooi Wei Tsang
- Receive, Reason, And React: Drive As You Say With Large Language Models In Autonomous Vehicles Cui Can, Ma Yunsheng, Cao Xu, Ye Wenqian, Wang Ziran
- Ada-instruct: Adapting Instruction Generators For Complex Reasoning Cui Wanyun, Wang Qianle
- Evaluating Prompt-based Question Answering For Object Prediction In The Open Research Knowledge Graph D'souza Jennifer, Hrou Moussab, Auer Sören
- Llm-in-the-loop: Leveraging Large Language Model For Thematic Analysis Dai Shih-chieh, Xiong Aiping, Ku Lun-wei
- Llmebench: A Flexible Framework For Accelerating Llms Benchmarking Dalvi Fahim, Hasanain Maram, Boughorbel Sabri, Mousi Basel, Abdaljalil Samir, Nazar Nizi, Abdelali Ahmed, Chowdhury Shammur Absar, Mubarak Hamdy, Ali Ahmed, Hawasly Majd, Durrani Nadir, Alam Firoj
- Educhat: A Large-scale Language Model-based Chatbot System For Intelligent Education Dan Yuhao, Lei Zhikai, Gu Yiyang, Li Yong, Yin Jianghao, Lin Jiaju, Ye Linhao, Tie Zhiyan, Zhou Yougen, Wang Yilei, Zhou Aimin, Zhou Ze, Chen Qin, Zhou Jie, He Liang, Qiu Xipeng
- Choice Over Control: How Users Write With Large Language Models Using Diegetic And Non-diegetic Prompting Dang Hai, Goller Sven, Lehmann Florian, Buschek Daniel
- Visual Chain Of Thought: Bridging Logical Gaps With Multimodal Infillings Daniel Rose, Vaishnavi Himakunthala, Andy Ouyang, Ryan He, Alex Mei, Yujie Lu, Michael Saxon, Chinmay Sonar, Diba Mirza, William Yang Wang
- An Adversarial Example For Direct Logit Attribution: Memory Management In Gelu-4l Dao James, Lau Yeu-tong, Rager Can, Janiak Jett
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- User Simulation With Large Language Models For Evaluating Task-oriented Dialogue Davidson Sam, Romeo Salvatore, Shu Raphael, Gung James, Gupta Arshit, Mansour Saab, Zhang Yi
- Competence-based Analysis Of Language Models Davies Adam, Jiang Jize, Zhai Chengxiang
- LLMR: Real-time Prompting Of Interactive Worlds Using Large Language Models De La Torre Fernanda, Fang Cathy Mengying, Huang Han, Banburski-fahey Andrzej, Fernandez Judith Amores, Lanier Jaron
- GPT-3 Models Are Few-shot Financial Reasoners De Padua Raul Salles, Qureshi Imran, Karakaplan Mustafa U.
- Improved Prompting And Process For Writing User Personas With Llms, Using Qualitative Interviews: Capturing Behaviour And Personality Traits Of Users De Paoli Stefano
- Im-promptu: In-context Composition From Image Prompts Dedhia Bhishma, Chang Michael, Snell Jake C., Griffiths Thomas L., Jha Niraj K.
- Controlled Text Generation Via Language Model Arithmetic Dekoninck Jasper, Fischer Marc, Beurer-kellner Luca, Vechev Martin
- Fairness Of Chatgpt And The Role Of Explainable-guided Prompts Deldjoo Yashar
- Investigating Data Contamination In Modern Benchmarks For Large Language Models Deng Chunyuan, Zhao Yilun, Tang Xiangru, Gerstein Mark, Cohan Arman
- Masterkey: Automated Jailbreak Across Multiple Large Language Model Chatbots Deng Gelei, Liu Yi, Li Yuekang, Wang Kailong, Zhang Ying, Li Zefeng, Wang Haoyu, Zhang Tianwei, Liu Yang
- Towards A Unified View Of Answer Calibration For Multi-step Reasoning Deng Shumin, Zhang Ningyu, Oo Nay, Hooi Bryan
- Prompting And Evaluating Large Language Models For Proactive Dialogues: Clarification, Target-guided, And Non-collaboration Deng Yang, Liao Lizi, Chen Liang, Wang Hongru, Lei Wenqiang, Chua Tat-seng
- Plug-and-play Policy Planner For Large Language Model Powered Dialogue Agents Deng Yang, Zhang Wenxuan, Lam Wai, Ng See-kiong, Chua Tat-seng
- Rephrase And Respond: Let Large Language Models Ask Better Questions For Themselves Deng Yihe, Zhang Weitong, Chen Zixiang, Gu Quanquan
- Multilingual Jailbreak Challenges In Large Language Models Deng Yue, Zhang Wenxuan, Pan Sinno Jialin, Bing Lidong
- Testing AI On Language Comprehension Tasks Reveals Insensitivity To Underlying Meaning Dentella Vittoria, Guenther Fritz, Murphy Elliot, Marcus Gary, Leivada Evelina
- Mixture Of Soft Prompts For Controllable Data Generation Derek Chen, Celine Lee, Yunan Lu, Domenic Rosati, Zhou Yu
- Can Chatgpt Read Who You Are? Derner Erik, Kučera Dalibor, Oliver Nuria, Zahálka Jan
- Chatgpt Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions Deyao Zhu, Jun Chen, Kilichbek Haydarov, Xiaoqian Shen, Wenxuan Zhang, Mohamed Elhoseiny
- Queer People Are People First: Deconstructing Sexual Identity Stereotypes In Large Language Models Dhingra Harnoor, Jayashanker Preetiha, Moghe Sayali, Strubell Emma
- Evaluating Chatgpt As A Recommender System: A Rigorous Approach Di Palma Dario, Biancofiore Giovanni Maria, Anelli Vito Walter, Narducci Fedelucio, Di Noia Tommaso, Di Sciascio Eugenio
- Codefuse-13b: A Pretrained Multi-lingual Code Large Language Model Di Peng, Li Jianguo, Yu Hang, Jiang Wei, Cai Wenting, Cao Yang, Chen Chaoyu, Chen Dajun, Chen Hongwei, Chen Liang, Fan Gang, Gong Jie, Gong Zi, Hu Wen, Guo Tingting, Lei Zhichao, Li Ting, Li Zheng, Liang Ming, Liao Cong, Liu Bingchang, Liu Jiachen, Liu Zhiwei, Lu Shaojun, Shen Min, Wang Guangpei, Wang Huan, Wang Zhi, Xu Zhaogui, Yang Jiawei, Ye Qing, Zhang Gehao, Zhang Yu, Zhao Zelin, Zheng Xunjin, Zhou Hailian, Zhu Lifu, Zhu Xianying
- Can Instruction Fine-tuned Language Models Identify Social Bias Through Prompting? Dige Omkar, Tian Jacob-junqi, Emerson David, Khattak Faiza Khan
- Using Large Language Model To Solve And Explain Physics Word Problems Approaching Human Level Ding Jingzhe, Cen Yan, Wei Xinyuan
- A Wolf In Sheep's Clothing: Generalized Nested Jailbreak Prompts Can Fool Large Language Models Easily Ding Peng, Kuang Jun, Ma Dan, Cao Xuezhi, Xian Yunsen, Chen Jiajun, Huang Shujian
- Everything Of Thoughts: Defying The Law Of Penrose Triangle For Thought Generation Ding Ruomeng, Zhang Chaoyun, Wang Lu, Xu Yong, Ma Minghua, Zhang Wei, Qin Si, Rajmohan Saravan, Lin Qingwei, Zhang Dongmei
- Mapping The Design Space Of Interactions In Human-ai Text Co-creation Tasks Ding Zijian, Chan Joel
- Do Llms Work On Charts? Designing Few-shot Prompts For Chart Question Answering And Summarization Do Xuan Long, Hassanpour Mohammad, Masry Ahmed, Kavehzadeh Parsa, Hoque Enamul, Joty Shafiq
- Prompt Optimization Via Adversarial In-context Learning Do Xuan Long, Zhao Yiran, Brown Hannah, Xie Yuxi, Zhao James Xu, Chen Nancy F., Kawaguchi Kenji, Shieh Michael, He Junxian
- Establishing Performance Baselines In Fine-tuning, Retrieval-augmented Generation And Soft-prompting For Non-specialist LLM Users Dodgson Jennifer, Nanzheng Lin, Peh Julian, Pattirane Akira Rafhael Janson, Alhajir Alfath Daryl, Dinarto Eko Ridho, Lim Joseph, Ahmad Syed Danyal
- Revisit Input Perturbation Problems For Llms: A Unified Robustness Evaluation Framework For Noisy Slot Filling Task Dong Guanting, Zhao Jinxu, Hui Tingfeng, Guo Daichi, Wan Wenlong, Feng Boqi, Qiu Yueyan, Gongque Zhuoma, He Keqing, Wang Zechen, Xu Weiran
- Statistical Knowledge Assessment For Large Language Models Dong Qingxiu, Xu Jingjing, Kong Lingpeng, Sui Zhifang, Li Lei
- PACE: Improving Prompt With Actor-critic Editing For Large Language Model Dong Yihong, Luo Kangcheng, Jiang Xue, Jin Zhi, Li Ge
- BAMBOO: A Comprehensive Benchmark For Evaluating Long Text Modeling Capacities Of Large Language Models Dong Zican, Tang Tianyi, Li Junyi, Zhao Wayne Xin, Wen Ji-rong
- Plugmed: Improving Specificity In Patient-centered Medical Dialogue Generation Using In-context Learning Dou Chengfeng, Jin Zhi, Jiao Wenping, Zhao Haiyan, Tao Zhenwei, Zhao Yongqiang
- Improving Factuality And Reasoning In Language Models Through Multiagent Debate Du Yilun, Li Shuang, Torralba Antonio, Tenenbaum Joshua B., Mordatch Igor
- Guiding Pretraining In Reinforcement Learning With Large Language Models Du Yuqing, Watkins Olivia, Wang Zihan, Colas Cédric, Darrell Trevor, Abbeel Pieter, Gupta Abhishek, Andreas Jacob
- Exploring The Relationship Between In-context Learning And Instruction Tuning Duan Hanyu, Tang Yixuan, Yang Yi, Abbasi Ahmed, Tam Kar Yan
- Botchat: Evaluating Llms' Capabilities Of Having Multi-turn Dialogues Duan Haodong, Wei Jueqi, Wang Chonghua, Liu Hongwei, Fang Yixiao, Zhang Songyang, Lin Dahua, Chen Kai
- Denevil: Towards Deciphering And Navigating The Ethical Values Of Large Language Models Via Instruction Learning Duan Shitong, Yi Xiaoyuan, Zhang Peng, Lu Tun, Xie Xing, Gu Ning
- Alpacafarm: A Simulation Framework For Methods That Learn From Human Feedback Dubois Yann, Li Xuechen, Taori Rohan, Zhang Tianyi, Gulrajani Ishaan, Ba Jimmy, Guestrin Carlos, Liang Percy, Hashimoto Tatsunori B.
- Sweeping Heterogeneity With Smart Mops: Mixture Of Prompts For LLM Task Adaptation Dun Chen, Garcia Mirian Hipolito, Zheng Guoqing, Awadallah Ahmed Hassan, Kyrillidis Anastasios, Sim Robert
- Measuring And Improving Attentiveness To Partial Inputs With Counterfactuals Elazar Yanai, Paranjape Bhargavi, Peng Hao, Wiegreffe Sarah, Raghavi Khyathi, Srikumar Vivek, Singh Sameer, Smith Noah A.
- Who's Harry Potter? Approximate Unlearning In Llms Eldan Ronen, Russinovich Mark
- In What Languages Are Generative Language Models The Most Formal? Analyzing Formality Distribution Across Languages Ersoy Asım, Vizcarra Gerson, Mayeesha Tasmiah Tahsin, Muller Benjamin
- ROBBIE: Robust Bias Evaluation Of Large Generative Language Models Esiobu David, Tan Xiaoqing, Hosseini Saghar, Ung Megan, Zhang Yuchen, Fernandes Jude, Dwivedi-yu Jane, Presani Eleonora, Williams Adina, Smith Eric Michael
- GPT-3.5, GPT-4, Or BARD? Evaluating Llms Reasoning Ability In Zero-shot Setting And Performance Boosting Through Prompts Espejel Jessica López, Ettifouri El Hassane, Alassan Mahaman Sanoussi Yahaya, Chouham El Mehdi, Dahhane Walid
- Exploring The Potential Of Large Language Models In Generating Code-tracing Questions For Introductory Programming Courses Fan Aysa Xuemo, Zhang Ranran Haoran, Paquette Luc, Zhang Rui
- Chain-of-thought Tuning: Masked Language Models Can Also Think Step By Step In Natural Language Understanding Fan Caoyun, Tian Jidong, Li Yitian, Chen Wenqing, He Hao, Jin Yaohui
- Comparable Demonstrations Are Important In In-context Learning: A Novel Perspective On Demonstration Selection Fan Caoyun, Tian Jidong, Li Yitian, He Hao, Jin Yaohui
- Uncovering The Potential Of Chatgpt For Discourse Analysis In Dialogue: An Empirical Study Fan Yaxin, Jiang Feng, Li Peifeng, Li Haizhou
- Using GPT-4 To Augment Unbalanced Data For Automatic Scoring Fang Luyang, Lee Gyeong-geon, Zhai Xiaoming
- Chatgpt Performance On Standardized Testing Exam -- A Proposed Strategy For Learners Farooq Umer, Anwar Saira
- Prompting Large Language Models With Speech Recognition Abilities Fathullah Yassir, Wu Chunyang, Lakomkin Egor, Jia Junteng, Shangguan Yuan, Li Ke, Guo Jinxi, Xiong Wenhan, Mahadeokar Jay, Kalinli Ozlem, Fuegen Christian, Seltzer Mike
- Audiochatllama: Towards General-purpose Speech Abilities For Llms Fathullah Yassir, Wu Chunyang, Lakomkin Egor, Li Ke, Jia Junteng, Shangguan Yuan, Mahadeokar Jay, Kalinli Ozlem, Fuegen Christian, Seltzer Mike
- Friend Or Foe? Exploring The Implications Of Large Language Models On The Science System Fecher Benedikt, Hebing Marcel, Laufer Melissa, Pohle Jörg, Sofsky Fabian
- Transferable Decoding With Visual Entities For Zero-shot Image Captioning Fei Junjie, Wang Teng, Zhang Jinrui, He Zhenyu, Wang Chengjie, Zheng Feng
- Trapping LLM Hallucinations Using Tagged Context Prompts Feldman Philip, Foulds James R., Pan Shimei
- Language Models Can Be Logical Solvers Feng Jiazhan, Xu Ruochen, Hao Junheng, Sharma Hiteshi, Shen Yelong, Zhao Dongyan, Chen Weizhu
- LLM4VG: Large Language Models Evaluation For Video Grounding Feng Wei, Wang Xin, Chen Hong, Zhang Zeyang, Chen Houlun, Song Zihan, Zhou Yuwei, Yang Yuekui, Wu Haiyang, Zhu Wenwu
- Alphazero-like Tree-search Can Guide Large Language Model Decoding And Training Feng Xidong, Wan Ziyu, Wen Muning, Mcaleer Stephen Marcus, Wen Ying, Zhang Weinan, Wang Jun
- The Devil Is In The Errors: Leveraging Large Language Models For Fine-grained Machine Translation Evaluation Fernandes Patrick, Deutsch Daniel, Finkelstein Mara, Riley Parker, Martins André F. T., Neubig Graham, Garg Ankush, Clark Jonathan H., Freitag Markus, Firat Orhan
- Promptbreeder: Self-referential Self-improvement Via Prompt Evolution Fernando Chrisantha, Banarse Dylan, Michalewski Henryk, Osindero Simon, Rocktäschel Tim
- Context-aware Meta-learning Fifty Christopher, Duan Dennis, Junkins Ronald G., Amid Ehsan, Leskovec Jure, Re Christopher, Thrun Sebastian
- Can Large Language Models Follow Concept Annotation Guidelines? A Case Study On Scientific And Financial Domains Fonseca Marcio, Cohen Shay B.
- A Review Of Chatgpt Applications In Education, Marketing, Software Engineering, And Healthcare: Benefits, Drawbacks, And Research Directions Fraiwan Mohammad, Khasawneh Natheer
- Large Language Models Can Be Easily Distracted By Irrelevant Context Freda Shi, Xinyun Chen, Kanishka Misra, Nathan Scales, David Dohan, Ed Chi, Nathanael Schärli, Denny Zhou
- Inca: Rethinking In-car Conversational System Assessment Leveraging Large Language Models Friedl Ken E., Khan Abbas Goher, Sahoo Soumya Ranjan, Rony Md Rashad Al Hasan, Germies Jana, Süß Christian
- Chainpoll: A High Efficacy Method For LLM Hallucination Detection Friel Robert, Sanyal Atindriyo
- Estimating Large Language Model Capabilities Without Labeled Test Data Fu Harvey Yiyun, Ye Qinyuan, Xu Albert, Ren Xiang, Jia Robin
- Misusing Tools In Large Language Models With Visual Adversarial Examples Fu Xiaohan, Wang Zihan, Li Shuheng, Gupta Rajesh K., Mireshghallah Niloofar, Berg-kirkpatrick Taylor, Fernandes Earlence
- Generate Then Select: Open-ended Visual Question Answering Guided By World Knowledge Fu Xingyu, Zhang Sheng, Kwon Gukyeong, Perera Pramuditha, Zhu Henghui, Zhang Yuhao, Li Alexander Hanbo, Wang William Yang, Wang Zhiguo, Castelli Vittorio, Ng Patrick, Roth Dan, Xiang Bing
- Specializing Smaller Language Models Towards Multi-step Reasoning Fu Yao, Peng Hao, Ou Litu, Sabharwal Ashish, Khot Tushar
- Gpt4aigchip: Towards Next-generation AI Accelerator Design Automation Via Large Language Models Fu Yonggan, Zhang Yongan, Yu Zhongzhi, Li Sixu, Ye Zhifan, Li Chaojian, Wan Cheng, Lin Yingyan
- Exposing Limitations Of Language Model Agents In Sequential-task Compositions On The Web Furuta Hiroki, Matsuo Yutaka, Faust Aleksandra, Gur Izzeddin
- Sensitivity And Robustness Of Large Language Models To Prompt Template In Japanese Text Classification Tasks Gan Chengguang, Mori Tatsunori
- Natural Language Commanding Via Program Synthesis Gandhi Apurva, Nguyen Thong Q., Jiao Huitian, Steen Robert, Bhatawdekar Ameya
- Systematic Evaluation Of GPT-3 For Zero-shot Personality Estimation Ganesan Adithya V, Lal Yash Kumar, Nilsson August Håkan, Schwartz H. Andrew
- LLM Blueprint: Enabling Text-to-image Generation With Complex And Detailed Prompts Gani Hanan, Bhat Shariq Farooq, Naseer Muzammal, Khan Salman, Wonka Peter
- Strategyllm: Large Language Models As Strategy Generators, Executors, Optimizers, And Evaluators For Problem Solving Gao Chang, Jiang Haiyun, Cai Deng, Shi Shuming, Lam Wai
- Text-to-sql Empowered By Large Language Models: A Benchmark Evaluation Gao Dawei, Wang Haibin, Li Yaliang, Sun Xiuyu, Qian Yichen, Ding Bolin, Zhou Jingren
- Self-explanation Prompting Improves Dialogue Understanding In Large Language Models Gao Haoyu, Lin Ting-en, Li Hangyu, Yang Min, Wu Yuchuan, Ma Wentao, Li Yongbin
- Enabling Large Language Models To Generate Text With Citations Gao Tianyu, Yen Howard, Yu Jiatong, Chen Danqi
- How To Design Translation Prompts For Chatgpt: An Empirical Study Gao Yuan, Wang Ruili, Hou Feng
- CLOVA: A Closed-loop Visual Assistant With Tool Usage And Update Gao Zhi, Du Yuntao, Zhang Xintong, Ma Xiaojian, Han Wenjuan, Zhu Song-chun, Li Qing
- Real Customization Or Just Marketing: Are Customized Versions Of Chat GPT Useful? Garrido-merchán Eduardo C., Arroyo-barrigüete Jose L., Borrás-pala Francisco, Escobar-torres Leandro, De Ibarreta Carlos Martínez, Ortiz-lozano Jose María, Rua-vieites Antonio
- Simulating H.P. Lovecraft Horror Literature With The Chatgpt Large Language Model Garrido-merchán Eduardo C., Arroyo-barrigüete José Luis, Gozalo-brizuela Roberto
- Faithful Explanations Of Black-box NLP Models Using Llm-generated Counterfactuals Gat Yair, Calderon Nitay, Feder Amir, Chapanin Alexander, Sharma Amit, Reichart Roi
- Text Encoders Lack Knowledge: Leveraging Generative Llms For Domain-specific Semantic Textual Similarity Gatto Joseph, Sharif Omar, Seegmiller Parker, Bohlman Philip, Preum Sarah Masud
- Reasoning In Large Language Models Through Symbolic Math Word Problems Gaur Vedant, Saunshi Nikunj
- MART: Improving LLM Safety With Multi-round Automatic Red-teaming Ge Suyu, Zhou Chunting, Hou Rui, Khabsa Madian, Wang Yi-chia, Wang Qifan, Han Jiawei, Mao Yuning
- In-context Autoencoder For Context Compression In A Large Language Model Ge Tao, Hu Jing, Wang Lei, Wang Xun, Chen Si-qing, Wei Furu
- Ddcot: Duty-distinct Chain-of-thought Prompting For Multimodal Reasoning In Language Models Ge Zheng, Bin Yang, Jiajin Tang, Hong-yu Zhou, Sibei Yang
- UPAR: A Kantian-inspired Prompting Framework For Enhancing Large Language Model Capabilities Geng Hejia, Xu Boxun, Li Peng
- Dictionary-based Phrase-level Prompting Of Large Language Models For Machine Translation Ghazvininejad Marjan, Gonen Hila, Zettlemoyer Luke
- Chatgpt Perpetuates Gender Bias In Machine Translation And Ignores Non-gendered Pronouns: Findings Across Bengali And Five Other Low-resource Languages Ghosh Sourojit, Caliskan Aylin
- Semantic Compression With Large Language Models Gilbert Henry, Sandborn Michael, Schmidt Douglas C., Spencer-smith Jesse, White Jules
- Prompt Cache: Modular Attention Reuse For Low-latency Inference Gim In, Chen Guojun, Lee Seung-seob, Sarda Nikhil, Khandelwal Anurag, Zhong Lin
- Wanglab At Mediqa-chat 2023: Clinical Note Generation From Doctor-patient Conversations Using Large Language Models Giorgi John, Toma Augustin, Xie Ronald, Chen Sondra S., An Kevin R., Zheng Grace X., Wang Bo
- Can Llms Capture Human Preferences? Goli Ali, Singh Amandeep
- Model-generated Pretraining Signals Improves Zero-shot Generalization Of Text-to-text Transformers Gong Linyuan, Xiong Chenyan, Liu Xiaodong, Bajaj Payal, Xie Yiqing, Cheung Alvin, Gao Jianfeng, Song Xia
- Coascore: Chain-of-aspects Prompting For NLG Evaluation Gong Peiyuan, Mao Jiaxin
- Evaluating The Potential Of Leading Large Language Models In Reasoning Biology Questions Gong Xinyu, Holmes Jason, Li Yiwei, Liu Zhengliang, Gan Qi, Wu Zihao, Zhang Jianli, Zou Yusong, Teng Yuxi, Jiang Tian, Zhu Hongtu, Liu Wei, Liu Tianming, Yan Yajun
- Figstep: Jailbreaking Large Vision-language Models Via Typographic Visual Prompts Gong Yichen, Ran Delong, Liu Jinyuan, Wang Conglei, Cong Tianshuo, Wang Anyu, Duan Sisi, Wang Xiaoyun
- Improving Input-label Mapping With Demonstration Replay For In-context Learning Gong Zhuocheng, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Yan Rui
- Intention And Context Elicitation With Large Language Models In The Legal Aid Intake Process Goodson Nick, Lu Rongfei
- Not What You've Signed Up For: Compromising Real-world Llm-integrated Applications With Indirect Prompt Injection Greshake Kai, Abdelnabi Sahar, Mishra Shailesh, Endres Christoph, Holz Thorsten, Fritz Mario
- AMAGO: Scalable In-context Reinforcement Learning For Adaptive Agents Grigsby Jake, Fan Linxi, Zhu Yuke
- Pokemqa: Programmable Knowledge Editing For Multi-hop Question Answering Gu Hengrui, Zhou Kaixiong, Han Xiaotian, Liu Ninghao, Wang Ruobing, Wang Xin
- Linguistically Informed Chatgpt Prompts To Enhance Japanese-chinese Machine Translation: A Case Study On Attributive Clauses Gu Wenshi
- Leveraging Pre-trained Large Language Models To Construct And Utilize World Models For Model-based Task Planning Guan Lin, Valmeekam Karthik, Sreedharan Sarath, Kambhampati Subbarao
- Voyager: An Open-ended Embodied Agent With Large Language Models Guanzhi Wang, Yuqi Xie, Yunfan Jiang, Ajay Mandlekar, Chaowei Xiao, Yuke Zhu, Linxi Fan, Anima Anandkumar
- Hallucinations In Large Multilingual Translation Models Guerreiro Nuno M., Alves Duarte, Waldendorf Jonas, Haddow Barry, Birch Alexandra, Colombo Pierre, Martins André F. T.
- Connecting Large Language Models With Evolutionary Algorithms Yields Powerful Prompt Optimizers Guo Qingyan, Wang Rui, Guo Junliang, Li Bei, Song Kaitao, Tan Xu, Liu Guoqing, Bian Jiang, Yang Yujiu
- Compresso: Structured Pruning With Collaborative Prompting Learns Compact Large Language Models Guo Song, Xu Jiahang, Zhang Li Lyna, Yang Mao
- What Can Large Language Models Do In Chemistry? A Comprehensive Benchmark On Eight Tasks Guo Taicheng, Guo Kehan, Nan Bozhao, Liang Zhenwen, Guo Zhichun, Chawla Nitesh V., Wiest Olaf, Zhang Xiangliang
- Knowledgenavigator: Leveraging Large Language Models For Enhanced Reasoning Over Knowledge Graph Guo Tiezheng, Yang Qingwen, Wang Chen, Liu Yanyi, Li Pan, Tang Jiawei, Li Dapeng, Wen Yingyou
- Instruction Fusion: Advancing Prompt Evolution Through Hybridization Guo Weidong, Yang Jiuding, Yang Kaitong, Li Xiangyang, Rao Zhuwei, Xu Yu, Niu Di
- Learning To Plan With Natural Language Guo Yiduo, Liang Yaobo, Wu Chenfei, Wu Wenshan, Zhao Dongyan, Duan Nan
- Arthmodel: Enhance Arithmetic Skills To Large Language Model Guo Yingdi
- Gemrec: Towards Generative Model Recommendation Guo Yuanhe, Liu Haoming, Wen Hongyi
- Retrieval-augmented Code Generation For Universal Information Extraction Guo Yucan, Li Zixuan, Jin Xiaolong, Liu Yantao, Zeng Yutao, Liu Wenxuan, Li Xiang, Yang Pan, Bai Long, Guo Jiafeng, Cheng Xueqi
- Targen: Targeted Data Generation With Large Language Models Gupta Himanshu, Scaria Kevin, Anantheswaran Ujjwala, Verma Shreyas, Parmar Mihir, Sawant Saurabh Arjun, Baral Chitta, Mishra Swaroop
- Bias Runs Deep: Implicit Reasoning Biases In Persona-assigned Llms Gupta Shashank, Shrivastava Vaishnavi, Deshpande Ameet, Kalyan Ashwin, Clark Peter, Sabharwal Ashish, Khot Tushar
- Gistscore: Learning Better Representations For In-context Example Selection With Gist Bottlenecks Gupta Shivanshu, Rosenbaum Clemens, Elenberg Ethan R.
- Evaluating Concurrent Robustness Of Language Models Across Diverse Challenge Sets Gupta Vatsal, Pandya Pranshu, Kataria Tushar, Gupta Vivek, Roth Dan
- Is GPT-4 A Reliable Rater? Evaluating Consistency In GPT-4 Text Ratings Hackl Veronika, Müller Alexandra Elena, Granitzer Michael, Sailer Maximilian
- A Theory Of Emergent In-context Learning As Implicit Structure Induction Hahn Michael, Goyal Navin
- Dialcot Meets PPO: Decomposing And Exploring Reasoning Paths In Smaller Language Models Han Chengcheng, Du Xiaowei, Zhang Che, Lian Yixin, Li Xiang, Gao Ming, Wang Baoyuan
- Potential Benefits Of Employing Large Language Models In Research In Moral Education And Development Han Hyemin
- Pive: Prompting With Iterative Verification Improving Graph-based Generative Capability Of Llms Han Jiuzhou, Collier Nigel, Buntine Wray, Shareghi Ehsan
- In-context Alignment: Chat With Vanilla Language Models Before Fine-tuning Han Xiaochuang
- "mistakes Help Us Grow": Facilitating And Evaluating Growth Mindset Supportive Language In Classrooms Handa Kunal, Clapper Margaret, Boyle Jessica, Wang Rose E, Yang Diyi, Yeager David S, Demszky Dorottya
- Reasoning Implicit Sentiment With Chain-of-thought Prompting Hao Fei, Bobo Li, Qian Liu, Lidong Bing, Fei Li, Tat-seng Chua
- Boosting Large Language Model For Speech Synthesis: An Empirical Study Hao Hongkun, Zhou Long, Liu Shujie, Li Jinyu, Hu Shujie, Wang Rui, Wei Furu
- Toolkengpt: Augmenting Frozen Language Models With Massive Tools Via Tool Embeddings Hao Shibo, Liu Tianyang, Wang Zhen, Hu Zhiting
- Towards Efficient Vision-language Tuning: More Information Density, More Generalizability Hao Tianxiang, Lyu Mengyao, Chen Hui, Zhao Sicheng, Ding Xiaohan, Han Jungong, Ding Guiguang
- Unlocking The Potential Of Chatgpt: A Comprehensive Exploration Of Its Applications, Advantages, Limitations, And Future Directions In Natural Language Processing Hariri Walid
- Reranking For Natural Language Generation From Logical Forms: A Study Based On Large Language Models Haroutunian Levon, Li Zhuang, Galescu Lucian, Cohen Philip, Tumuluri Raj, Haffari Gholamreza
- Answering Unseen Questions With Smaller Language Models Using Rationale Generation And Dense Retrieval Hartill Tim, Benavides-prado Diana, Witbrock Michael, Riddle Patricia J.
- The Political Ideology Of Conversational AI: Converging Evidence On Chatgpt's Pro-environmental, Left-libertarian Orientation Hartmann Jochen, Schwenzow Jasper, Witte Maximilian
- CHAI-DT: A Framework For Prompting Conversational Generative AI Agents To Actively Participate In Co-creation Harwood Brandon
- Zero- And Few-shot Prompting With Llms: A Comparative Study With Fine-tuned Models For Bangla Sentiment Analysis Hasan Md. Arid, Das Shudipta, Anjum Afiyat, Alam Firoj, Anjum Anika, Sarker Avijit, Noori Sheak Rashed Haider
- Multilingual Language Models Are Not Multicultural: A Case Study In Emotion Havaldar Shreya, Rai Sunny, Singhal Bhumika, Liu Langchen, Guntuku Sharath Chandra, Ungar Lyle
- Spear Phishing With Large Language Models Hazell Julian
- ICL-D3IE: In-context Learning With Diverse Demonstrations Updating For Document Information Extraction He Jiabang, Wang Lei, Hu Yi, Liu Ning, Liu Hui, Xu Xing, Shen Heng Tao
- Wordart Designer: User-driven Artistic Typography Synthesis Using Large Language Models He Jun-yan, Cheng Zhi-qi, Li Chenyang, Sun Jingdong, Xiang Wangmeng, Lin Xianhui, Kang Xiaoyang, Jin Zengke, Hu Yusen, Luo Bin, Geng Yifeng, Xie Xuansong, Zhou Jingren
- You Only Prompt Once: On The Capabilities Of Prompt Learning On Large Language Models To Tackle Toxic Content He Xinlei, Zannettou Savvas, Shen Yun, Zhang Yang
- Using Natural Language Explanations To Improve Robustness Of In-context Learning He Xuanli, Wu Yuxiang, Camburu Oana-maria, Minervini Pasquale, Stenetorp Pontus
- An Automatically Discovered Chain-of-thought Prompt Generalizes To Novel Models And Datasets Hebenstreit Konstantin, Praas Robert, Kiesewetter Louis P, Samwald Matthias
- Chatgpt For Zero-shot Dialogue State Tracking: A Solution Or An Opportunity? Heck Michael, Lubis Nurul, Ruppik Benjamin, Vukovic Renato, Feng Shutong, Geishauser Christian, Lin Hsien-chin, Van Niekerk Carel, Gašić Milica
- Exploring The Responses Of Large Language Models To Beginner Programmers' Help Requests Hellas Arto, Leinonen Juho, Sarsa Sami, Koutcheme Charles, Kujanpää Lilja, Sorva Juha
- How Good Are GPT Models At Machine Translation? A Comprehensive Evaluation Hendy Amr, Abdelrehim Mohamed, Sharaf Amr, Raunak Vikas, Gabr Mohamed, Matsushita Hitokazu, Kim Young Jin, Afify Mohamed, Awadalla Hany Hassan
- Can Llms Grade Short-answer Reading Comprehension Questions : An Empirical Study With A Novel Dataset Henkel Owen, Hills Libby, Roberts Bill, Mcgrane Joshua
- Towards Verifiable Text Generation With Symbolic References Hennigen Lucas Torroba, Shen Shannon, Nrusimha Aniruddha, Gapp Bernhard, Sontag David, Kim Yoon
- Stochastic Parrots Looking For Stochastic Parrots: Llms Are Easy To Fine-tune And Hard To Detect With Other Llms Henrique Da Silva Gameiro, Kucharavy Andrei, Guerraoui Rachid
- Large Language Models Are Biased To Overestimate Profoundness Herrera-berg Eugenio, Browne Tomás Vergara, León-villagrá Pablo, Vives Marc-lluís, Calderon Cristian Buc
- Assessing The Efficacy Of Large Language Models In Generating Accurate Teacher Responses Hicke Yann, Masand Abhishek, Guo Wentao, Gangavarapu Tushaar
- Harnessing The Zero-shot Power Of Instruction-tuned Large Language Model In End-to-end Speech Recognition Higuchi Yosuke, Ogawa Tetsuji, Kobayashi Tetsunori
- Comparative Analysis Of GPT-4 And Human Graders In Evaluating Praise Given To Students In Synthetic Dialogues Hirunyasiri Dollaya, Thomas Danielle R., Lin Jionghao, Koedinger Kenneth R., Aleven Vincent
- Do Compressed Llms Forget Knowledge? An Experimental Study With Practical Implications Hoang Duc N. M, Cho Minsik, Merth Thomas, Rastegari Mohammad, Wang Zhangyang
- Evaluating Large Language Models On A Highly-specialized Topic, Radiation Oncology Physics Holmes Jason, Liu Zhengliang, Zhang Lian, Ding Yuzhen, Sio Terence T., Mcgee Lisa A., Ashman Jonathan B., Li Xiang, Liu Tianming, Shen Jiajian, Liu Wei
- L2MAC: Large Language Model Automatic Computer For Extensive Code Generation Holt Samuel, Luyten Max Ruiz, Van Der Schaar Mihaela
- DP-OPT: Make Large Language Model Your Privacy-preserving Prompt Engineer Hong Junyuan, Wang Jiachen T., Zhang Chenhui, Li Zhangheng, Li Bo, Wang Zhangyang
- Direct2v: Large Language Models Are Frame-level Directors For Zero-shot Text-to-video Generation Hong Susung, Seo Junyoung, Shin Heeseong, Hong Sunghwan, Kim Seungryong
- 3D-LLM: Injecting The 3D World Into Large Language Models Hong Yining, Zhen Haoyu, Chen Peihao, Zheng Shuhong, Du Yilun, Chen Zhenfang, Gan Chuang
- Ralle: A Framework For Developing And Evaluating Retrieval-augmented Large Language Models Hoshi Yasuto, Miyashita Daisuke, Ng Youyang, Tatsuno Kento, Morioka Yasuhiro, Torii Osamu, Deguchi Jun
- More Robots Are Coming: Large Multimodal Models (chatgpt) Can Solve Visually Diverse Images Of Parsons Problems Hou Irene, Man Owen, Mettille Sophie, Gutierrez Sebastian, Angelikas Kenneth, Macneil Stephen
- Distilling Step-by-step! Outperforming Larger Language Models With Less Training Data And Smaller Model Sizes Hsieh Cheng-yu, Li Chun-liang, Yeh Chih-kuan, Nakhost Hootan, Fujii Yasuhisa, Ratner Alexander, Krishna Ranjay, Lee Chen-yu, Pfister Tomas
- Chain-of-symbol Prompting Elicits Planning In Large Langauge Models Hu Hanxu, Lu Hongyuan, Zhang Huajian, Song Yun-ze, Lam Wai, Zhang Yue
- Context-aware Prompt Tuning For Vision-language Model With Dual-alignment Hu Hongyu, Lin Tiancheng, Wang Jie, Sun Zhenbang, Xu Yi
- Prompting Is Not A Substitute For Probability Measurements In Large Language Models Hu Jennifer, Levy Roger
- Ladder-of-thought: Using Knowledge As Steps To Elevate Stance Detection Hu Kairui, Yan Ming, Zhou Joey Tianyi, Tsang Ivor W., Chong Wen Haw, Yap Yong Keong
- COMMA: Co-articulated Multi-modal Learning Hu Lianyu, Gao Liqing, Liu Zekang, Pun Chi-man, Feng Wei
- Scaled Prompt-tuning For Few-shot Natural Language Generation Hu Ting, Meinel Christoph, Yang Haojin
- BLIVA: A Simple Multimodal LLM For Better Handling Of Text-rich Visual Questions Hu Wenbo, Xu Yifan, Li Yi, Li Weiyue, Chen Zeyuan, Tu Zhuowen
- Enhancing Answer Selection In Community Question Answering With Pre-trained And Large Language Models Hu Xinghang
- Evoke: Evoking Critical Thinking Abilities In Llms Via Reviewer-author Prompt Editing Hu Xinyu, Tang Pengfei, Zuo Simiao, Wang Zihan, Song Bowen, Lou Qiang, Jiao Jian, Charles Denis
- Code Prompting: A Neural Symbolic Method For Complex Reasoning In Large Language Models Hu Yi, Yang Haotong, Lin Zhouchen, Zhang Muhan
- AMERICANO: Argument Generation With Discourse-driven Decomposition And Agent Interaction Hu Zhe, Chan Hou Pong, Yin Yu
- Token-level Adversarial Prompt Detection Based On Perplexity Measures And Contextual Information Hu Zhengmian, Wu Gang, Mitra Saayan, Zhang Ruiyi, Sun Tong, Huang Heng, Swaminathan Viswanathan
- Llm-adapters: An Adapter Family For Parameter-efficient Fine-tuning Of Large Language Models Hu Zhiqiang, Wang Lei, Lan Yihuai, Xu Wanyu, Lim Ee-peng, Bing Lidong, Xu Xing, Poria Soujanya, Lee Roy Ka-wei
- POP: Prompt Of Prompts For Continual Learning Hu Zhiyuan, Lyu Jiancheng, Gao Dashan, Vasconcelos Nuno
- Deploying And Evaluating Llms To Program Service Mobile Robots Hu Zichao, Lucchetti Francesca, Schlesinger Claire, Saxena Yash, Freeman Anders, Modak Sadanand, Guha Arjun, Biswas Joydeep
- SMILE: Single-turn To Multi-turn Inclusive Language Expansion Via Chatgpt For Mental Health Support Huachuan Qiu, Hongliang He, Shuai Zhang, Anqi Li, Zhenzhong Lan
- Enhancing Large Language Models In Coding Through Multi-perspective Self-consistency Huang Baizhou, Lu Shuai, Chen Weizhu, Wan Xiaojun, Duan Nan
- Lorahub: Efficient Cross-task Generalization Via Dynamic Lora Composition Huang Chengsong, Liu Qian, Lin Bill Yuchen, Pang Tianyu, Du Chao, Lin Min
- Agentcoder: Multi-agent-based Code Generation With Iterative Testing And Optimisation Huang Dong, Zhang Jie M., Luck Michael, Bu Qingwen, Qing Yuhao, Cui Heming
- Is Chatgpt Better Than Human Annotators? Potential And Limitations Of Chatgpt In Explaining Implicit Hate Speech Huang Fan, Kwak Haewoon, An Jisun
- Free-bloom: Zero-shot Text-to-video Generator With LLM Director And LDM Animator Huang Hanzhuo, Feng Yufan, Shi Cheng, Xu Lan, Yu Jingyi, Yang Sibei
- Not All Languages Are Created Equal In Llms: Improving Multilingual Capability By Cross-lingual-thought Prompting Huang Haoyang, Tang Tianyi, Zhang Dongdong, Zhao Wayne Xin, Song Ting, Xia Yan, Wei Furu
- Revisiting The Reliability Of Psychological Scales On Large Language Models Huang Jen-tse, Wang Wenxuan, Lam Man Ho, Li Eric John, Jiao Wenxiang, Lyu Michael R.
- Modality Plug-and-play: Elastic Modality Adaptation In Multimodal Llms For Embodied AI Huang Kai, Yang Boyuan, Gao Wei
- T2i-compbench: A Comprehensive Benchmark For Open-world Compositional Text-to-image Generation Huang Kaiyi, Sun Kaiyue, Xie Enze, Li Zhenguo, Liu Xihui
- Can Large Language Models Explain Themselves? A Study Of Llm-generated Self-explanations Huang Shiyuan, Mamidanna Siddarth, Jangam Shreedhar, Zhou Yilun, Gilpin Leilani H.
- Adapting Pre-trained Language Models To Vision-language Tasks Via Dynamic Visual Prompting Huang Shubin, Wu Qiong, Zhou Yiyi, Chen Weijie, Zhang Rongsheng, Sun Xiaoshuai, Ji Rongrong
- Fewer Is More: Boosting LLM Reasoning With Reinforced Context Pruning Huang Xijie, Zhang Li Lyna, Cheng Kwang-ting, Yang Fan, Yang Mao
- Catastrophic Jailbreak Of Open-source Llms Via Exploiting Generation Huang Yangsibo, Gupta Samyak, Xia Mengzhou, Li Kai, Chen Danqi
- Competition-level Problems Are Effective LLM Evaluators Huang Yiming, Lin Zhenghao, Liu Xiao, Gong Yeyun, Lu Shuai, Lei Fangyu, Liang Yaobo, Shen Yelong, Lin Chen, Duan Nan, Chen Weizhu
- Trustgpt: A Benchmark For Trustworthy And Responsible Large Language Models Huang Yue, Zhang Qihui, Y Philip S., Sun Lichao
- Advancing Transformer Architecture In Long-context Large Language Models: A Comprehensive Survey Huang Yunpeng, Xu Jingwei, Lai Junyu, Jiang Zixu, Chen Taolue, Li Zenan, Yao Yuan, Ma Xiaoxing, Yang Lijuan, Chen Hao, Li Shupeng, Zhao Penghao
- Evaluating And Enhancing Large Language Models For Conversational Reasoning On Knowledge Graphs Huang Yuxuan, Shi Lida, Liu Anqi, Xu Hao
- Speech Translation With Large Language Models: An Industrial Practice Huang Zhichao, Ye Rong, Ko Tom, Dong Qianqian, Cheng Shanbo, Wang Mingxuan, Li Hang
- Soft Prompt Decoding For Multilingual Dense Retrieval Huang Zhiqi, Zeng Hansi, Zamani Hamed, Allan James
- Comparing Large Language Model AI And Human-generated Coaching Messages For Behavioral Weight Loss Huang Zhuoran, Berry Michael P., Chwyl Christina, Hsieh Gary, Wei Jing, Forman Evan M.
- Retrieving Supporting Evidence For Generative Question Answering Huo Siqing, Arabzadeh Negar, Clarke Charles L. A.
- Towards Leveraging Llms For Conditional QA Hussain Syed-amad, Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- Understanding The Effectiveness Of Very Large Language Models On Dialog Evaluation Huynh Jessica, Jiao Cathy, Gupta Prakhar, Mehri Shikib, Bajaj Payal, Chaudhary Vishrav, Eskenazi Maxine
- Large Language Models As Sous Chefs: Revising Recipes With GPT-3 Hwang Alyssa, Li Bryan, Hou Zhaoyi, Roth Dan
- Aligning Language Models To User Opinions Hwang Eunjeong, Majumder Bodhisattwa Prasad, Tandon Niket
- Diversigate: A Comprehensive Framework For Reliable Large Language Models Imani Shima, Beyram Ali, Shrivastava Harsh
- Mathprompter: Mathematical Reasoning Using Large Language Models Imani Shima, Du Liang, Shrivastava Harsh
- Multitool-cot: GPT-3 Can Use Multiple External Tools With Chain Of Thought Prompting Inaba Tatsuro, Kiyomaru Hirokazu, Cheng Fei, Kurohashi Sadao
- Llama Guard: Llm-based Input-output Safeguard For Human-ai Conversations Inan Hakan, Upasani Kartikeya, Chi Jianfeng, Rungta Rashi, Iyer Krithika, Mao Yuning, Tontchev Michael, Hu Qing, Fuller Brian, Testuggine Davide, Khabsa Madian
- Leveraging Large Language Models To Generate Answer Set Programs Ishay Adam, Yang Zhun, Lee Joohyung
- Pushing Boundaries: Exploring Zero Shot Object Classification With Large Multimodal Models Islam Ashhadul, Biswas Md. Rafiul, Zaghouani Wajdi, Belhaouari Samir Brahim, Shah Zubair
- Running Cognitive Evaluations On Large Language Models: The Do's And The Don'ts Ivanova Anna A.
- Query Expansion By Prompting Large Language Models Jagerman Rolf, Zhuang Honglei, Qin Zhen, Wang Xuanhui, Bendersky Michael
- Faithful Persona-based Conversational Dataset Generation With Large Language Models Jandaghi Pegah, Sheng Xianghai, Bai Xinyi, Pujara Jay, Sidahmed Hakim
- Consistency Analysis Of Chatgpt Jang Myeongjun Erik, Lukasiewicz Thomas
- Chatgpt Is Fun, But It Is Not Funny! Humor Is Still Challenging Large Language Models Jentzsch Sophie, Kersting Kristian
- Hijacking Context In Large Multi-modal Models Jeong Joonhyun
- Chatbot Is Not All You Need: Information-rich Prompting For More Realistic Responses Jeong Seokhoon, Makhmud Assentay
- Inpars-v2: Large Language Models As Efficient Dataset Generators For Information Retrieval Jeronymo Vitor, Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Lotufo Roberto, Zavrel Jakub, Nogueira Rodrigo
- Symbol Tuning Improves In-context Learning In Language Models Jerry Wei, Le Hou, Andrew Lampinen, Xiangning Chen, Da Huang, Yi Tay, Xinyun Chen, Yifeng Lu, Denny Zhou, Tengyu Ma, Quoc V. Le
- Chit-chat Or Deep Talk: Prompt Engineering For Process Mining Jessen Urszula, Sroka Michal, Fahland Dirk
- Neuro Symbolic Reasoning For Planning: Counterexample Guided Inductive Synthesis Using Large Language Models And Satisfiability Solving Jha Sumit Kumar, Jha Susmit, Lincoln Patrick, Bastian Nathaniel D., Velasquez Alvaro, Ewetz Rickard, Neema Sandeep
- Natural Language Decomposition And Interpretation Of Complex Utterances Jhamtani Harsh, Fang Hao, Xia Patrick, Levy Eran, Andreas Jacob, Van Durme Ben
- Large Language Models As Automated Aligners For Benchmarking Vision-language Models Ji Yuanfeng, Ge Chongjian, Kong Weikai, Xie Enze, Liu Zhengying, Li Zhengguo, Luo Ping
- Benchmarking And Explaining Large Language Model-based Code Generation: A Causality-centric Approach Ji Zhenlan, Ma Pingchuan, Li Zongjie, Wang Shuai
- Zero-shot Faithfulness Evaluation For Text Summarization With Foundation Language Model Jia Qi, Ren Siyu, Liu Yizhu, Zhu Kenny Q.
- Bootstrapping Vision-language Learning With Decoupled Language Pre-training Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Legal Syllogism Prompting: Teaching Large Language Models For Legal Judgment Prediction Jiang Cong, Yang Xiaolei
- Llmlingua: Compressing Prompts For Accelerated Inference Of Large Language Models Jiang Huiqiang, Wu Qianhui, Lin Chin-yew, Yang Yuqing, Qiu Lili
- Longllmlingua: Accelerating And Enhancing Llms In Long Context Scenarios Via Prompt Compression Jiang Huiqiang, Wu Qianhui, Luo Xufang, Li Dongsheng, Lin Chin-yew, Yang Yuqing, Qiu Lili
- Graphologue: Exploring Large Language Model Responses With Interactive Diagrams Jiang Peiling, Rayan Jude, Dow Steven P., Xia Haijun
- Selfevolve: A Code Evolution Framework Via Large Language Models Jiang Shuyang, Wang Yuhao, Wang Yu
- Prompt Packer: Deceiving Llms Through Compositional Instruction With Hidden Attacks Jiang Shuyu, Chen Xingshu, Tang Rui
- Resprompt: Residual Connection Prompting Advances Multi-step Reasoning In Large Language Models Jiang Song, Shakeri Zahra, Chan Aaron, Sanjabi Maziar, Firooz Hamed, Xia Yinglong, Akyildiz Bugra, Sun Yizhou, Li Jinchao, Wang Qifan, Celikyilmaz Asli
- Scaling Sentence Embeddings With Large Language Models Jiang Ting, Huang Shaohan, Luan Zhongzhi, Wang Deqing, Zhuang Fuzhen
- Hykge: A Hypothesis Knowledge Graph Enhanced Framework For Accurate And Reliable Medical Llms Responses Jiang Xinke, Zhang Ruizhe, Xu Yongxin, Qiu Rihong, Fang Yue, Wang Zhiyuan, Tang Jinyi, Ding Hongxin, Chu Xu, Zhao Junfeng, Wang Yasha
- Lion: Adversarial Distillation Of Proprietary Large Language Models Jiang Yuxin, Chan Chunkit, Chen Mingyang, Wang Wei
- Followbench: A Multi-level Fine-grained Constraints Following Benchmark For Large Language Models Jiang Yuxin, Wang Yufei, Zeng Xingshan, Zhong Wanjun, Li Liangyou, Mi Fei, Shang Lifeng, Jiang Xin, Liu Qun, Wang Wei
- Generative Calibration For In-context Learning Jiang Zhongtao, Zhang Yuanzhe, Liu Cao, Zhao Jun, Liu Kang
- Chain Of Thought Prompt Tuning In Vision Language Models Jiaxin Ge, Hongyin Luo, Siyuan Qian, Yulu Gan, Jie Fu, Shanghang Zhang
- Llara: Large Language-recommendation Assistant Jiayi Liao, Sihang Li, Zhengyi Yang, Jiancan Wu, Yancheng Yuan, Xiang Wang, Xiangnan He
- Prompt-based Length Controlled Generation With Reinforcement Learning Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- The Cost Of Down-scaling Language Models: Fact Recall Deteriorates Before In-context Learning Jin Tian, Clement Nolan, Dong Xin, Nagarajan Vaishnavh, Carbin Michael, Ragan-kelley Jonathan, Dziugaite Gintare Karolina
- Binary Code Summarization: Benchmarking Chatgpt/gpt-4 And Other Large Language Models Jin Xin, Larson Jonathan, Yang Weiwei, Lin Zhiqiang
- Grounding Language Models To Images For Multimodal Inputs And Outputs Jing Yu Koh, Ruslan Salakhutdinov, Daniel Fried
- Knowledge-augmented Large Language Models For Personalized Contextual Query Suggestion Jinheon Baek, Nirupama Chandrasekaran, Silviu Cucerzan, Allen Herring, Sujay Kumar Jauhar
- Vcoder: Versatile Vision Encoders For Multimodal Large Language Models Jitesh Jain, Jianwei Yang, Humphrey Shi
- GPT Is Becoming A Turing Machine: Here Are Some Ways To Program It Jojic Ana, Wang Zhen, Jojic Nebojsa
- Does GPT-4 Pass The Turing Test? Jones Cameron R., Bergen Benjamin K.
- Small Language Models Fine-tuned To Coordinate Larger Language Models Improve Complex Reasoning Juneja Gurusha, Dutta Subhabrata, Chakrabarti Soumen, Manchanda Sunny, Chakraborty Tanmoy
- Is Chatgpt A Good Recommender? A Preliminary Study Junling Liu, Chao Liu, Peilin Zhou, Renjie Lv, Kang Zhou, Yan Zhang
- Program-aided Reasoners (better) Know What They Know Kabra Anubha, Rangreji Sanketh, Mathur Yash, Madaan Aman, Liu Emmy, Neubig Graham
- Synthetic Data Generation In Low-resource Settings Via Fine-tuning Of Large Language Models Kaddour Jean, Liu Qi
- Robust Training For Conversational Question Answering Models With Reinforced Reformulation Generation Kaiser Magdalena, Roy Rishiraj Saha, Weikum Gerhard
- Contextual Code Switching For Machine Translation Using Language Models Kaji Arshad, Shah Manan
- Swectrl-mini: A Data-transparent Transformer-based Large Language Model For Controllable Text Generation In Swedish Kalpakchi Dmytro, Boye Johan
- Deficiency Of Large Language Models In Finance: An Empirical Examination Of Hallucination Kang Haoqiang, Liu Xiao-yang
- Llm-augmented Preference Learning From Natural Language Kang Inwon, Ruan Sikai, Ho Tyler, Lin Jui-chien, Mohsin Farhad, Seneviratne Oshani, Xia Lirong
- Exploring Automatic Evaluation Methods Based On A Decoder-based LLM For Text Generation Kasahara Tomohito, Kawahara Daisuke
- Critiquellm: Towards An Informative Critique Generation Model For Evaluation Of Large Language Model Generation Ke Pei, Wen Bosi, Feng Zhuoer, Liu Xiao, Lei Xuanyu, Cheng Jiale, Wang Shengyuan, Zeng Aohan, Dong Yuxiao, Wang Hongning, Tang Jie, Huang Minlie
- Unnatural Language Processing: How Do Language Models Handle Machine-generated Prompts? Kervadec Corentin, Franzon Francesca, Baroni Marco
- Can Chatgpt And Bard Generate Aligned Assessment Items? A Reliability Analysis Against Human Performance Khademi Abdolvahab
- Assessing The Promise And Pitfalls Of Chatgpt For Automated Code Generation Khan Muhammad Fawad Akbar, Ramsdell Max, Falor Erik, Karimi Hamid
- Introducing Language Guidance In Prompt-based Continual Learning Khan Muhammad Gul Zain Ali, Naeem Muhammad Ferjad, Van Gool Luc, Stricker Didier, Tombari Federico, Afzal Muhammad Zeshan
- Dspy: Compiling Declarative Language Model Calls Into Self-improving Pipelines Khattab Omar, Singhvi Arnav, Maheshwari Paridhi, Zhang Zhiyuan, Santhanam Keshav, Vardhamanan Sri, Haq Saiful, Sharma Ashutosh, Joshi Thomas T., Moazam Hanna, Miller Heather, Zaharia Matei, Potts Christopher
- Reliability Check: An Analysis Of Gpt-3's Response To Sensitive Topics And Prompt Wording Khatun Aisha, Brown Daniel G.
- How Secure Is Code Generated By Chatgpt? Khoury Raphaël, Avila Anderson R., Brunelle Jacob, Camara Baba Mamadou
- Auto-survey Challenge Khuong Thanh Gia Hieu Tau, Lisn, Rachmat Benedictus Kent Tau, Lisn
- Transferring Pre-trained Multimodal Representations With Cross-modal Similarity Matching Kim Byoungjip, Choi Sungik, Hwang Dasol, Lee Moontae, Lee Honglak
- Language Models Can Solve Computer Tasks Kim Geunwoo, Baldi Pierre, Mcaleer Stephen
- Which Is Better? Exploring Prompting Strategy For Llm-based Metrics Kim Joonghoon, Park Saeran, Jeong Kiyoon, Lee Sangmin, Han Seung Hun, Lee Jiyoon, Kang Pilsung
- Cotever: Chain Of Thought Prompting Annotation Toolkit For Explanation Verification Kim Seungone, Joo Se June, Jang Yul, Chae Hyungjoo, Yeo Jinyoung
- Aligning Large Language Models Through Synthetic Feedback Kim Sungdong, Bae Sanghwan, Shin Jamin, Kang Soyoung, Kwak Donghyun, Yoo Kang Min, Seo Minjoon
- Evallm: Interactive Evaluation Of Large Language Model Prompts On User-defined Criteria Kim Tae Soo, Lee Yoonjoo, Shin Jamin, Kim Young-ho, Kim Juho
- Lmcanvas: Object-oriented Interaction To Personalize Large Language Model-powered Writing Environments Kim Tae Soo, Sarkar Arghya, Lee Yoonjoo, Chang Minsuk, Kim Juho
- Improving Knowledge Extraction From Llms For Task Learning Through Agent Analysis Kirk James R., Wray Robert E., Lindes Peter, Laird John E.
- Can Language Models Laugh At Youtube Short-form Videos? Ko Dayoon, Lee Sangho, Kim Gunhee
- Conversational Ai-powered Design: Chatgpt As Designer, User, And Product Kocaballi A. Baki
- Large Language Models Are State-of-the-art Evaluators Of Translation Quality Kocmi Tom, Federmann Christian
- Chatgpt: Jack Of All Trades, Master Of None Kocoń Jan, Cichecki Igor, Kaszyca Oliwier, Kochanek Mateusz, Szydło Dominika, Baran Joanna, Bielaniewicz Julita, Gruza Marcin, Janz Arkadiusz, Kanclerz Kamil, Kocoń Anna, Koptyra Bartłomiej, Mieleszczenko-kowszewicz Wiktoria, Miłkowski Piotr, Oleksy Marcin, Piasecki Maciej, Radliński Łukasz, Wojtasik Konrad, Woźniak Stanisław, Kazienko Przemysław
- How You Prompt Matters! Even Task-oriented Constraints In Instructions Affect Llm-generated Text Detection Koike Ryuto, Kaneko Masahiro, Okazaki Naoaki
- Writing Your Own Book: A Method For Going From Closed To Open Book QA To Improve Robustness And Performance Of Smaller Llms Kokaia Giorgi, Sinha Pratyush, Jiang Yutong, Boujemaa Nozha
- Better Zero-shot Reasoning With Role-play Prompting Kong Aobo, Zhao Shiwan, Chen Hao, Li Qicheng, Qin Yong, Sun Ruiqi, Zhou Xin, Wang Enzhi, Dong Xiaohang
- Benchmarking Cognitive Biases In Large Language Models As Evaluators Koo Ryan, Lee Minhwa, Raheja Vipul, Park Jong Inn, Kim Zae Myung, Kang Dongyeop
- Humans In Humans Out: On GPT Converging Toward Common Sense In Both Success And Failure Koralus Philipp, Wang-maścianica Vincent
- Evaluating Large Language Models In Theory Of Mind Tasks Kosinski Michal
- Gender Bias And Stereotypes In Large Language Models Kotek Hadas, Dockum Rikker, Sun David Q.
- Understanding Catastrophic Forgetting In Language Models Via Implicit Inference Kotha Suhas, Springer Jacob Mitchell, Raghunathan Aditi
- Little Giants: Exploring The Potential Of Small Llms As Evaluation Metrics In Summarization In The Eval4nlp 2023 Shared Task Kotonya Neema, Krishnasamy Saran, Tetreault Joel, Jaimes Alejandro
- L-TUNING: Synchronized Label Tuning For Prompt And Prefix In Llms Kowsher Md., Sobuj Md. Shohanur Islam, Mahmud Asif, Prottasha Nusrat Jahan, Bhat Prakash
- Developing Effective Educational Chatbots With Chatgpt Prompts: Insights From Preliminary Tests In A Case Study On Social Media Literacy (with Appendix) Koyuturk Cansu, Yavari Mona, Theophilou Emily, Bursic Sathya, Donabauer Gregor, Telari Alessia, Testa Alessia, Boiano Raffaele, Gabbiadini Alessandro, Hernandez-leo Davinia, Ruskov Martin, Ognibene Dimitri
- Chatgpt For PLC/DCS Control Logic Generation Koziolek Heiko, Gruener Sten, Ashiwal Virendra
- Leveraging Few-shot Data Augmentation And Waterfall Prompting For Response Generation Krause Lea, Santamaría Selene Báez, Van Der Meer Michiel, Khurana Urja
- Prompt Generate Train (PGT): Few-shot Domain Adaption Of Retrieval Augmented Generation Models For Open Book Question-answering Krishna C. S.
- Post Hoc Explanations Of Language Models Can Improve Language Models Krishna Satyapriya, Ma Jiaqi, Slack Dylan, Ghandeharioun Asma, Singh Sameer, Lakkaraju Himabindu
- Chatbots Are Not Reliable Text Annotators Kristensen-mclachlan Ross Deans, Canavan Miceal, Kardos Márton, Jacobsen Mia, Aarøe Lene
- In-context Explainers: Harnessing Llms For Explaining Black Box Models Kroeger Nicholas, Ley Dan, Krishna Satyapriya, Agarwal Chirag, Lakkaraju Himabindu
- Certifying LLM Safety Against Adversarial Prompting Kumar Aounon, Agarwal Chirag, Srinivas Suraj, Li Aaron Jiaxun, Feizi Soheil, Lakkaraju Himabindu
- Watch Your Language: Investigating Content Moderation With Large Language Models Kumar Deepak, Abuhashem Yousef, Durumeric Zakir
- How Reliable Are Ai-generated-text Detectors? An Assessment Framework Using Evasive Soft Prompts Kumarage Tharindu, Sheth Paras, Moraffah Raha, Garland Joshua, Liu Huan
- Lavip:language-grounded Visual Prompts Kunananthaseelan Nilakshan, Zhang Jing, Harandi Mehrtash
- Active Instruction Tuning: Improving Cross-task Generalization By Training On Prompt Sensitive Tasks Kung Po-nien, Yin Fan, Wu Di, Chang Kai-wei, Peng Nanyun
- Psychometric Predictive Power Of Large Language Models Kuribayashi Tatsuki, Oseki Yohei, Baldwin Timothy
- Chatgpt: Beginning Of An End Of Manual Linguistic Data Annotation? Use Case Of Automatic Genre Identification Kuzman Taja, Mozetič Igor, Ljubešić Nikola
- In Generative AI We Trust: Can Chatbots Effectively Verify Political Information? Kuznetsova Elizaveta, Makhortykh Mykola, Vziatysheva Victoria, Stolze Martha, Baghumyan Ani, Urman Aleksandra
- Reward Design With Language Models Kwon Minae, Xie Sang Michael, Bullard Kalesha, Sadigh Dorsa
- Chatgpt For Arabic Grammatical Error Correction Kwon Sang Yun, Bhatia Gagan, Nagoud El Moatez Billah, Abdul-mageed Muhammad
- Beyond English: Evaluating Llms For Arabic Grammatical Error Correction Kwon Sang Yun, Bhatia Gagan, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- Recallm: An Adaptable Memory Mechanism With Temporal Understanding For Large Language Models Kynoch Brandon, Latapie Hugo, Van Der Sluis Dwane
- Unraveling Chatgpt: A Critical Analysis Of Ai-generated Goal-oriented Dialogues And Annotations Labruna Tiziano, Brenna Sofia, Zaninello Andrea, Magnini Bernardo
- Improving Diversity Of Demographic Representation In Large Language Models Via Collective-critiques And Self-voting Lahoti Preethi, Blumm Nicholas, Ma Xiao, Kotikalapudi Raghavendra, Potluri Sahitya, Tan Qijun, Srinivasan Hansa, Packer Ben, Beirami Ahmad, Beutel Alex, Chen Jilin
- Instruction-following Speech Recognition Lai Cheng-i Jeff, Lu Zhiyun, Cao Liangliang, Pang Ruoming
- Multidimensional Evaluation For Text Style Transfer Using Chatgpt Lai Huiyuan, Toral Antonio, Nissim Malvina
- Mini-dalle3: Interactive Text To Image By Prompting Large Language Models Lai Zeqiang, Zhu Xizhou, Dai Jifeng, Qiao Yu, Wang Wenhai
- Large Language Models Are Partially Primed In Pronoun Interpretation Lam Suet-ying, Zeng Qingcheng, Zhang Kexun, You Chenyu, Voigt Rob
- Improving Zero-shot Visual Question Answering Via Large Language Models With Reasoning Question Prompts Lan Yunshi, Li Xiang, Liu Xin, Li Yang, Qin Wei, Qian Weining
- Open Sesame! Universal Black Box Jailbreaking Of Large Language Models Lapid Raz, Langberg Ron, Sipper Moshe
- Creating Large Language Model Resistant Exams: Guidelines And Strategies Larsen Simon Kaare
- Voicebox: Text-guided Multilingual Universal Speech Generation At Scale Le Matthew, Vyas Apoorv, Shi Bowen, Karrer Brian, Sari Leda, Moritz Rashel, Williamson Mary, Manohar Vimal, Adi Yossi, Mahadeokar Jay, Hsu Wei-ning
- Making Large Language Models Better Data Creators Lee Dong-ho, Pujara Jay, Sewak Mohit, White Ryen W., Jauhar Sujay Kumar
- Read-only Prompt Optimization For Vision-language Few-shot Learning Lee Dongjun, Song Seokwon, Suh Jihee, Choi Joonmyung, Lee Sanghyeok, Kim Hyunwoo J.
- Prompted Llms As Chatbot Modules For Long Open-domain Conversation Lee Gibbeum, Hartmann Volker, Park Jongho, Papailiopoulos Dimitris, Lee Kangwook
- Applying Large Language Models And Chain-of-thought For Automatic Scoring Lee Gyeong-geon, Latif Ehsan, Wu Xuansheng, Liu Ninghao, Zhai Xiaoming
- Is Attention Required For ICL? Exploring The Relationship Between Model Architecture And In-context Learning Ability Lee Ivan, Jiang Nan, Berg-kirkpatrick Taylor
- The Importance Of Prompt Tuning For Automated Neuron Explanations Lee Justin, Oikarinen Tuomas, Chatha Arjun, Chang Keng-chi, Chen Yilan, Weng Tsui-wei
- Teaching Arithmetic To Small Transformers Lee Nayoung, Sreenivasan Kartik, Lee Jason D., Lee Kangwook, Papailiopoulos Dimitris
- Chain Of Empathy: Enhancing Empathetic Response Of Large Language Models Based On Psychotherapy Models Lee Yoon Kyung, Lee Inju, Shin Minjung, Bae Seoyeon, Hahn Sowon
- Large Language Models Can Share Images, Too! Lee Young-jun, Lee Dokyong, Sung Joo Won, Hyeon Jonghwan, Choi Ho-jin
- Violation Of Expectation Via Metacognitive Prompting Reduces Theory Of Mind Prediction Error In Large Language Models Leer Courtland, Trost Vincent, Voruganti Vineeth
- Do We Still Need Clinical Language Models? Lehman Eric, Hernandez Evan, Mahajan Diwakar, Wulff Jonas, Smith Micah J., Ziegler Zachary, Nadler Daniel, Szolovits Peter, Johnson Alistair, Alsentzer Emily
- Boosting Logical Reasoning In Large Language Models Through A New Framework: The Graph Of Thought Lei Bin, Lin Pei-hung, Liao Chunhua, Ding Caiwen
- Chain Of Natural Language Inference For Reducing Large Language Model Ungrounded Hallucinations Lei Deren, Li Yaxi, Hu Mengya, Wang Mingyu, Yun Vincent, Ching Emily, Kamal Eslam
- Hint Of Thought Prompting: An Explainable And Zero-shot Approach To Reasoning Tasks With Llms Lei Ioktong, Deng Zhidong
- Zero-shot Next-item Recommendation Using Large Pretrained Language Models Lei Wang, Ee-peng Lim
- Llm4vis: Explainable Visualization Recommendation Using Chatgpt Lei Wang, Songheng Zhang, Yun Wang, Ee-peng Lim, Yong Wang
- The Language Of Prompting: What Linguistic Properties Make A Prompt Successful? Leidinger Alina, Van Rooij Robert, Shutova Ekaterina
- The Eval4nlp 2023 Shared Task On Prompting Large Language Models As Explainable Metrics Leiter Christoph, Opitz Juri, Deutsch Daniel, Gao Yang, Dror Rotem, Eger Steffen
- Self-detoxifying Language Models Via Toxification Reversal Leong Chak Tou, Cheng Yi, Wang Jiashuo, Wang Jian, Li Wenjie
- Sequential Monte Carlo Steering Of Large Language Models Using Probabilistic Programs Lew Alexander K., Zhi-xuan Tan, Grand Gabriel, Mansinghka Vikash K.
- Susceptibility To Influence Of Large Language Models Lewis D Griffin, Bennett Kleinberg, Maximilian Mozes, Kimberly T Mai, Maria Vau, Matthew Caldwell, Augustine Marvor-parker
- Deceptive Semantic Shortcuts On Reasoning Chains: How Far Can Models Go Without Hallucination? Li Bangzheng, Zhou Ben, Wang Fei, Fu Xingyu, Roth Dan, Chen Muhao
- Deliberate Then Generate: Enhanced Prompting Framework For Text Generation Li Bei, Wang Rui, Guo Junliang, Song Kaitao, Tan Xu, Hassan Hany, Menezes Arul, Xiao Tong, Bian Jiang, Zhu Jingbo
- This Land Is {your, My} Land: Evaluating Geopolitical Biases In Language Models Li Bryan, Haider Samar, Callison-burch Chris
- Learning To Rewrite Prompts For Personalized Text Generation Li Cheng, Zhang Mingyang, Mei Qiaozhu, Kong Weize, Bendersky Michael
- Mixed Distillation Helps Smaller Language Model Better Reasoning Li Chenglin, Chen Qianglong, Li Liangyue, Wang Caiyu, Li Yicheng, Chen Zulong, Zhang Yin
- Dialogue For Prompting: A Policy-gradient-based Discrete Prompt Generation For Few-shot Learning Li Chengzhengxu, Liu Xiaoming, Wang Yichen, Li Duyi, Lan Yu, Shen Chao
- Explaincpe: A Free-text Explanation Benchmark Of Chinese Pharmacist Examination Li Dongfang, Yu Jindi, Hu Baotian, Xu Zhenran, Zhang Min
- Revisiting Large Language Models As Zero-shot Relation Extractors Li Guozheng, Wang Peng, Ke Wenjun
- CMMLU: Measuring Massive Multitask Language Understanding In Chinese Li Haonan, Zhang Yixuan, Koto Fajri, Yang Yifei, Zhao Hai, Gong Yeyun, Duan Nan, Baldwin Timothy
- Acecoder: Utilizing Existing Code To Enhance Code Generation Li Jia, Zhao Yunfei, Li Yongmin, Li Ge, Jin Zhi
- Mastering Robot Manipulation With Multimodal Prompts Through Pretraining And Multi-task Fine-tuning Li Jiachen, Gao Qiaozi, Johnston Michael, Gao Xiaofeng, He Xuehai, Shakiah Suhaila, Shi Hangjie, Ghanadan Reza, Wang William Yang
- Evcap: Retrieval-augmented Image Captioning With External Visual-name Memory For Open-world Comprehension Li Jiaxuan, Vo Duc Minh, Sugimoto Akihiro, Nakayama Hideki
- Distilling Chatgpt For Explainable Automated Student Answer Assessment Li Jiazheng, Gui Lin, Zhou Yuxiang, West David, Aloisi Cesare, He Yulan
- Overprompt: Enhancing Chatgpt Through Efficient In-context Learning Li Jiazheng, Zhao Runcong, Yang Yongxin, He Yulan, Gui Lin
- MAQA: A Multimodal QA Benchmark For Negation Li Judith Yue, Jansen Aren, Huang Qingqing, Lee Joonseok, Ganti Ravi, Kuzmin Dima
- Fine-tuning Multimodal Llms To Follow Zero-shot Demonstrative Instructions Li Juncheng, Pan Kaihang, Ge Zhiqi, Gao Minghe, Ji Wei, Zhang Wenqiao, Chua Tat-seng, Tang Siliang, Zhang Hanwang, Zhuang Yueting
- Instructcoder: Instruction Tuning Large Language Models For Code Editing Li Kaixin, Hu Qisheng, Zhao Xu, Chen Hui, Xie Yuxi, Liu Tiedong, Xie Qizhe, He Junxian
- M3dbench: Let's Instruct Large Models With Multi-modal 3D Prompts Li Mingsheng, Chen Xin, Zhang Chi, Chen Sijin, Zhu Hongyuan, Yin Fukun, Yu Gang, Chen Tao
- Skillgpt: A Restful API Service For Skill Extraction And Standardization Using A Large Language Model Li Nan, Kang Bo, De Bie Tijl
- Starcoder: May The Source Be With You! Li Raymond, Allal Loubna Ben, Zi Yangtian, Muennighoff Niklas, Kocetkov Denis, Mou Chenghao, Marone Marc, Akiki Christopher, Li Jia, Chim Jenny, Liu Qian, Zheltonozhskii Evgenii, Zhuo Terry Yue, Wang Thomas, Dehaene Olivier, Davaadorj Mishig, Lamy-poirier Joel, Monteiro João, Shliazhko Oleh, Gontier Nicolas, Meade Nicholas, Zebaze Armel, Yee Ming-ho, Umapathi Logesh Kumar, Zhu Jian, Lipkin Benjamin, Oblokulov Muhtasham, Wang Zhiruo, Murthy Rudra, Stillerman Jason, Patel Siva Sankalp, Abulkhanov Dmitry, Zocca Marco, Dey Manan, Zhang Zhihan, Fahmy Nour, Bhattacharyya Urvashi, Yu Wenhao, Singh Swayam, Luccioni Sasha, Villegas Paulo, Kunakov Maxim, Zhdanov Fedor, Romero Manuel, Lee Tony, Timor Nadav, Ding Jennifer, Schlesinger Claire, Schoelkopf Hailey, Ebert Jan, Dao Tri, Mishra Mayank, Gu Alex, Robinson Jennifer, Anderson Carolyn Jane, Dolan-gavitt Brendan, Contractor Danish, Reddy Siva, Fried Daniel, Bahdanau Dzmitry, Jernite Yacine, Ferrandis Carlos Muñoz, Hughes Sean, Wolf Thomas, Guha Arjun, Von Werra Leandro, De Vries Harm
- Are Human-generated Demonstrations Necessary For In-context Learning? Li Rui, Wang Guoyin, Li Jiwei
- A Zero-shot Language Agent For Computer Control With Structured Reflection Li Tao, Li Gang, Deng Zhiwei, Wang Bryan, Li Yang
- Tailoring Personality Traits In Large Language Models Via Unsupervisedly-built Personalized Lexicons Li Tianlong, Dou Shihan, Lv Changze, Liu Wenhao, Xu Jianhan, Wu Muling, Ling Zixuan, Zheng Xiaoqing, Huang Xuanjing
- Crosslingual Retrieval Augmented In-context Learning For Bangla Li Xiaoqian, Nie Ercong, Liang Sheng
- From Classification To Generation: Insights Into Crosslingual Retrieval Augmented ICL Li Xiaoqian, Nie Ercong, Liang Sheng
- Exploring Fine-tuning Chatgpt For News Recommendation Li Xinyi, Zhang Yongfeng, Malthouse Edward C
- PBNR: Prompt-based News Recommender System Li Xinyi, Zhang Yongfeng, Malthouse Edward C.
- On Bilingual Lexicon Induction With Large Language Models Li Yaoyiran, Korhonen Anna, Vulić Ivan
- Dissecting Chain-of-thought: Compositionality Through In-context Filtering And Learning Li Yingcong, Sreenivasan Kartik, Giannou Angeliki, Papailiopoulos Dimitris, Oymak Samet
- A Survey On Fairness In Large Language Models Li Yingji, Du Mengnan, Song Rui, Wang Xin, Wang Ying
- A Practical Survey On Zero-shot Prompt Design For In-context Learning Li Yinheng
- Prompting Large Language Models For Counterfactual Generation: An Empirical Study Li Yongqi, Xu Mayi, Miao Xin, Zhou Shen, Qian Tieyun
- Textbooks Are All You Need II: Phi-1.5 Technical Report Li Yuanzhi, Bubeck Sébastien, Eldan Ronen, Del Giorno Allie, Gunasekar Suriya, Lee Yin Tat
- SPELL: Semantic Prompt Evolution Based On A LLM Li Yujian Betterest, Wu Kai
- Guiding Large Language Models Via Directional Stimulus Prompting Li Zekun, Peng Baolin, He Pengcheng, Galley Michel, Gao Jianfeng, Yan Xifeng
- Split And Merge: Aligning Position Biases In Large Language Model Based Evaluators Li Zongjie, Wang Chaozheng, Ma Pingchuan, Wu Daoyuan, Wang Shuai, Gao Cuiyun, Liu Yang
- Label Supervised Llama Finetuning Li Zongxi, Li Xianming, Liu Yuzhang, Xie Haoran, Li Jing, Wang Fu-lee, Li Qing, Zhong Xiaoqin
- Batgpt: A Bidirectional Autoregessive Talker From Generative Pre-trained Transformer Li Zuchao, Zhang Shitou, Zhao Hai, Yang Yifei, Yang Dongjie
- Llm-grounded Diffusion: Enhancing Prompt Understanding Of Text-to-image Diffusion Models With Large Language Models Lian Long, Li Boyi, Yala Adam, Darrell Trevor
- Beyond Factuality: A Comprehensive Evaluation Of Large Language Models As Knowledge Generators Liang Chen, Yang Deng, Yatao Bian, Zeyu Qin, Bingzhe Wu, Tat-seng Chua, Kam-fai Wong
- Exploring Format Consistency For Instruction Tuning Liang Shihao, Tian Runchu, Zhu Kunlun, Qin Yujia, Wang Huadong, Cong Xin, Liu Zhiyuan, Liu Xiaojiang, Sun Maosong
- GPT Detectors Are Biased Against Non-native English Writers Liang Weixin, Yuksekgonul Mert, Mao Yining, Wu Eric, Zou James
- Prompting Large Language Models With Chain-of-thought For Few-shot Knowledge Base Question Generation Liang Yuanyuan, Wang Jianing, Zhu Hanlun, Wang Lei, Qian Weining, Lan Yunshi
- Text-to-image Generation For Abstract Concepts Liao Jiayi, Chen Xu, Fu Qiang, Du Lun, He Xiangnan, Wang Xiang, Han Shi, Zhang Dongmei
- A Survey On Large Language Models For Recommendation Likang Wu, Zhi Zheng, Zhaopeng Qiu, Hao Wang, Hongchao Gu, Tingjia Shen, Chuan Qin, Chen Zhu, Hengshu Zhu, Qi Liu, Hui Xiong, Enhong Chen
- The Unlocking Spell On Base Llms: Rethinking Alignment Via In-context Learning Lin Bill Yuchen, Ravichander Abhilasha, Lu Ximing, Dziri Nouha, Sclar Melanie, Chandu Khyathi, Bhagavatula Chandra, Choi Yejin
- Paralinguistics-enhanced Large Language Modeling Of Spoken Dialogue Lin Guan-ting, Shivakumar Prashanth Gurunath, Gandhe Ankur, Yang Chao-han Huck, Gu Yile, Ghosh Shalini, Stolcke Andreas, Lee Hung-yi, Bulyko Ivan
- Videodirectorgpt: Consistent Multi-scene Video Generation Via Llm-guided Planning Lin Han, Zala Abhay, Cho Jaemin, Bansal Mohit
- VILA: On Pre-training For Visual Language Models Lin Ji, Yin Hongxu, Ping Wei, Lu Yao, Molchanov Pavlo, Tao Andrew, Mao Huizi, Kautz Jan, Shoeybi Mohammad, Han Song
- Batchprompt: Accomplish More With Less Lin Jianzhe, Diesendruck Maurice, Du Liang, Abraham Robin
- Few-shot Adaptation For Parsing Contextual Utterances With Llms Lin Kevin, Xia Patrick, Fang Hao
- Just Ask One More Time! Self-agreement Improves Reasoning Of Language Models In (almost) All Scenarios Lin Lei, Fu Jiayi, Liu Pengli, Li Qingyang, Gong Yan, Wan Junchen, Zhang Fuzheng, Wang Zhongyuan, Zhang Di, Gai Kun
- Towards Language-guided Interactive 3D Generation: Llms As Layout Interpreter With Generative Feedback Lin Yiqi, Wu Hao, Wang Ruichen, Lu Haonan, Lin Xiaodong, Xiong Hui, Wang Lin
- Unleashing The Creative Mind: Language Model As Hierarchical Policy For Improved Exploration On Challenging Problem Solving Ling Zhan, Fang Yunhao, Li Xuanlin, Mu Tongzhou, Lee Mingu, Pourreza Reza, Memisevic Roland, Su Hao
- Recprompt: A Prompt Tuning Framework For News Recommendation Using Large Language Models Liu Dairui, Yang Boming, Du Honghui, Greene Derek, Lawlor Aonghus, Dong Ruihai, Li Irene
- Logicot: Logical Chain-of-thought Instruction-tuning Liu Hanmeng, Teng Zhiyang, Cui Leyang, Zhang Chaoli, Zhou Qiji, Zhang Yue
- Alltogether: Investigating The Efficacy Of Spliced Prompt For Web Navigation Using Large Language Models Liu Jiarun, Hu Wentao, Zhang Chunhong
- Retrieval-based Knowledge Transfer: An Effective Approach For Extreme Large Language Model Compression Liu Jiduan, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Wang Ran Lucien, Yan Rui
- Llm-powered Hierarchical Language Agent For Real-time Human-ai Coordination Liu Jijia, Yu Chao, Gao Jiaxuan, Xie Yuqing, Liao Qingmin, Wu Yi, Wang Yu
- Chatcounselor: A Large Language Models For Mental Health Support Liu June M., Li Donghao, Cao He, Ren Tianhe, Liao Zeyi, Wu Jiamin
- Concise And Organized Perception Facilitates Reasoning In Large Language Models Liu Junjie, Yan Shaotian, Shen Chen, Xie Liang, Wang Wenxiao, Ye Jieping
- Nlebench+norglm: A Comprehensive Empirical Analysis And Benchmark Dataset For Generative Language Models In Norwegian Liu Peng, Zhang Lemei, Farup Terje Nissen, Lauvrak Even W., Ingvaldsen Jon Espen, Eide Simen, Gulla Jon Atle, Yang Zhirong
- Exploring The Boundaries Of GPT-4 In Radiology Liu Qianchu, Hyland Stephanie, Bannur Shruthi, Bouzid Kenza, Castro Daniel C., Wetscherek Maria Teodora, Tinn Robert, Sharma Harshita, Pérez-garcía Fernando, Schwaighofer Anton, Rajpurkar Pranav, Khanna Sameer Tajdin, Poon Hoifung, Usuyama Naoto, Thieme Anja, Nori Aditya V., Lungren Matthew P., Oktay Ozan, Alvarez-valle Javier
- In-context Vectors: Making In Context Learning More Effective And Controllable Through Latent Space Steering Liu Sheng, Ye Haotian, Xing Lei, Zou James
- Chatgpt-powered Conversational Drug Editing Using Retrieval And Domain Feedback Liu Shengchao, Wang Jiongxiao, Yang Yijin, Wang Chengpeng, Liu Ling, Guo Hongyu, Xiao Chaowei
- Language Models As Black-box Optimizers For Vision-language Models Liu Shihong, Lin Zhiqiu, Yu Samuel, Lee Ryan, Ling Tiffany, Pathak Deepak, Ramanan Deva
- Rethinking Tabular Data Understanding With Large Language Models Liu Tianyang, Wang Fei, Chen Muhao
- MMHQA-ICL: Multimodal In-context Learning For Hybrid Question Answering Over Text, Tables And Images Liu Weihao, Lei Fangyu, Luo Tongxu, Lei Jiahe, He Shizhu, Zhao Jun, Liu Kang
- Class Incremental Learning With Pre-trained Vision-language Models Liu Xialei, Cao Xusheng, Lu Haori, Xiao Jia-wen, Bagdanov Andrew D., Cheng Ming-ming
- Beyond Text: Unveiling Multimodal Proficiency Of Large Language Models With Multiapi Benchmark Liu Xiao, Lin Jianfeng, Zhang Jiawei
- The Magic Of IF: Investigating Causal Reasoning Abilities In Large Language Models Of Code Liu Xiao, Yin Da, Zhang Chen, Feng Yansong, Zhao Dongyan
- Mm-safetybench: A Benchmark For Safety Evaluation Of Multimodal Large Language Models Liu Xin, Zhu Yichen, Gu Jindong, Lan Yunshi, Yang Chao, Qiao Yu
- Divide And Prompt: Chain Of Thought Prompting For Text-to-sql Liu Xiping, Tan Zhao
- Wavjourney: Compositional Audio Creation With Large Language Models Liu Xubo, Zhu Zhongkai, Liu Haohe, Yuan Yi, Cui Meng, Huang Qiushi, Liang Jinhua, Cao Yin, Kong Qiuqiang, Plumbley Mark D., Wang Wenwu
- Gumsum: Multi-genre Data And Evaluation For English Abstractive Summarization Liu Yang Janet, Zeldes Amir
- Jailbreaking Chatgpt Via Prompt Engineering: An Empirical Study Liu Yi, Deng Gelei, Xu Zhengzi, Li Yuekang, Zheng Yaowen, Zhang Ying, Zhao Lida, Zhang Tianwei, Wang Kailong, Liu Yang
- Robustness Over Time: Understanding Adversarial Examples' Effectiveness On Longitudinal Versions Of Large Language Models Liu Yugeng, Cong Tianshuo, Zhao Zhengyu, Backes Michael, Shen Yun, Zhang Yang
- Calibrating Llm-based Evaluator Liu Yuxuan, Yang Tianchi, Huang Shaohan, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Reason For Future, Act For Now: A Principled Framework For Autonomous LLM Agents With Provable Sample Efficiency Liu Zhihan, Hu Hao, Zhang Shenao, Guo Hongyi, Ke Shuqi, Liu Boyi, Wang Zhaoran
- Deja Vu: Contextual Sparsity For Efficient Llms At Inference Time Liu Zichang, Wang Jue, Dao Tri, Zhou Tianyi, Yuan Binhang, Song Zhao, Shrivastava Anshumali, Zhang Ce, Tian Yuandong, Re Christopher, Chen Beidi
- LLM Comparative Assessment: Zero-shot NLG Evaluation Through Pairwise Comparisons Using Large Language Models Liusie Adian, Manakul Potsawee, Gales Mark J. F.
- Fully Autonomous Programming With Large Language Models Liventsev Vadim, Grishina Anastasiia, Härmä Aki, Moonen Leon
- Exploring Effectiveness Of GPT-3 In Grammatical Error Correction: A Study On Performance And Controllability In Prompt-based Methods Loem Mengsay, Kaneko Masahiro, Takase Sho, Okazaki Naoaki
- Harnessing Large Language Models' Empathetic Response Generation Capabilities For Online Mental Health Counselling Support Loh Siyuan Brandon, Raamkumar Aravind Sesagiri
- Data-to-text Generation For Severely Under-resourced Languages With GPT-3.5: A Bit Of Help Needed From Google Translate Lorandi Michela, Belz Anya
- Exploring The Sensitivity Of Llms' Decision-making Capabilities: Insights From Prompt Variation And Hyperparameters Loya Manikanta, Sinha Divya Anand, Futrell Richard
- Bounding The Capabilities Of Large Language Models In Open Text Generation With Prompt Constraints Lu Albert, Zhang Hongxin, Zhang Yanzhe, Wang Xuezhi, Yang Diyi
- EPA: Easy Prompt Augmentation On Large Language Models Via Multiple Sources And Multiple Targets Lu Hongyuan, Lam Wai
- Chain-of-dictionary Prompting Elicits Translation In Large Language Models Lu Hongyuan, Yang Haoran, Huang Haoyang, Zhang Dongdong, Lam Wai, Wei Furu
- Unified-io 2: Scaling Autoregressive Multimodal Models With Vision, Language, Audio, And Action Lu Jiasen, Clark Christopher, Lee Sangho, Zhang Zichen, Khosla Savya, Marten Ryan, Hoiem Derek, Kembhavi Aniruddha
- Ziya-visual: Bilingual Large Vision-language Model Via Multi-task Instruction Tuning Lu Junyu, Zhang Dixiang, Wu Xiaojun, Gao Xinyu, Gan Ruyi, Zhang Jiaxing, Song Yan, Zhang Pingjian
- Large Language Models Can Be Guided To Evade Ai-generated Text Detection Lu Ning, Liu Shengcai, He Rui, Wang Qi, Ong Yew-soon, Tang Ke
- Error Analysis Prompting Enables Human-like Translation Evaluation In Large Language Models Lu Qingyu, Qiu Baopu, Ding Liang, Zhang Kanjian, Kocmi Tom, Tao Dacheng
- Inference-time Policy Adapters (IPA): Tailoring Extreme-scale Lms Without Fine-tuning Lu Ximing, Brahman Faeze, West Peter, Jang Jaehun, Chandu Khyathi, Ravichander Abhilasha, Qin Lianhui, Ammanabrolu Prithviraj, Jiang Liwei, Ramnath Sahana, Dziri Nouha, Fisher Jillian, Lin Bill Yuchen, Hallinan Skyler, Ren Xiang, Welleck Sean, Choi Yejin
- SCITAB: A Challenging Benchmark For Compositional Reasoning And Claim Verification On Scientific Tables Lu Xinyuan, Pan Liangming, Liu Qian, Nakov Preslav, Kan Min-yen
- Illuminating The Black Box: A Psychometric Investigation Into The Multifaceted Nature Of Large Language Models Lu Yang, Yu Jordan, Huang Shou-hsuan Stephen
- RTLLM: An Open-source Benchmark For Design RTL Generation With Large Language Model Lu Yao, Liu Shang, Zhang Qijun, Xie Zhiyao
- Multimodal Procedural Planning Via Dual Text-image Prompting Lu Yujie, Lu Pan, Chen Zhiyu, Zhu Wanrong, Wang Xin Eric, Wang William Yang
- UI Layout Generation With Llms Guided By UI Grammar Lu Yuwen, Tong Ziang, Zhao Qinyi, Zhang Chengzhi, Li Toby Jia-jun
- Effectively Fine-tune To Improve Large Multimodal Models For Radiology Report Generation Lu Yuzhe, Hong Sungmin, Shah Yash, Xu Panpan
- Fighting Fire With Fire: The Dual Role Of Llms In Crafting And Detecting Elusive Disinformation Lucas Jason, Uchendu Adaku, Yamashita Michiharu, Lee Jooyoung, Rohatgi Shaurya, Lee Dongwon
- Enhancing Black-box Few-shot Text Classification With Prompt-based Data Augmentation Luo Danqing, Zhang Chen, Xu Jiahui, Wang Bin, Chen Yiming, Zhang Yan, Li Haizhou
- Prosg: Using Prompt Synthetic Gradients To Alleviate Prompt Forgetting Of Rnn-like Language Models Luo Haotian, Wu Kunming, Dai Cheng, Ding Sixian, Chen Xinhao
- Exploring Small Language Models With Prompt-learning Paradigm For Efficient Domain-specific Text Classification Luo Hengyu, Liu Peng, Esping Stefan
- Dr.icl: Demonstration-retrieved In-context Learning Luo Man, Xu Xin, Dai Zhuyun, Pasupat Panupong, Kazemi Mehran, Baral Chitta, Imbrasaite Vaiva, Zhao Vincent Y
- Recranker: Instruction Tuning Large Language Model As Ranker For Top-k Recommendation Luo Sichun, He Bowei, Zhao Haohan, Shao Wei, Qi Yanlin, Huang Yinya, Zhou Aojun, Yao Yuxuan, Li Zongpeng, Xiao Yuanzhang, Zhan Mingjie, Song Linqi
- Hrot: Hybrid Prompt Strategy And Retrieval Of Thought For Table-text Hybrid Question Answering Luo Tongxu, Lei Fangyu, Lei Jiahe, Liu Weihao, He Shihu, Zhao Jun, Liu Kang
- Unlocking The Potential Of Large Language Models For Explainable Recommendations Luo Yucong, Cheng Mingyue, Zhang Hao, Lu Junyu, Liu Qi, Chen Enhong
- KOSMOS-2.5: A Multimodal Literate Model Lv Tengchao, Huang Yupan, Chen Jingye, Zhao Yuzhong, Jia Yilin, Cui Lei, Ma Shuming, Chang Yaoyao, Huang Shaohan, Wang Wenhui, Dong Li, Luo Weiyao, Wu Shaoxiang, Wang Guoxin, Zhang Cha, Wei Furu
- Comparative Analysis Of Drug-gpt And Chatgpt Llms For Healthcare Insights: Evaluating Accuracy And Relevance In Patient And HCP Contexts Lysandrou Giorgos, Owen Roma English, Mursec Kirsty, Brun Grant Le, Fairley Elizabeth A. L.
- A Paradigm Shift: The Future Of Machine Translation Lies With Large Language Models Lyu Chenyang, Du Zefeng, Xu Jitao, Duan Yitao, Wu Minghao, Lynn Teresa, Aji Alham Fikri, Wong Derek F., Liu Siyou, Wang Longyue
- Llm-rec: Personalized Recommendation Via Prompting Large Language Models Lyu Hanjia, Jiang Song, Zeng Hanqing, Xia Yinglong, Wang Qifan, Zhang Si, Chen Ren, Leung Christopher, Tang Jiajie, Luo Jiebo
- Translating Radiology Reports Into Plain Language Using Chatgpt And GPT-4 With Prompt Learning: Promising Results, Limitations, And Potential Lyu Qing, Tan Josh, Zapadka Michael E., Ponnatapura Janardhana, Niu Chuang, Myers Kyle J., Wang Ge, Whitlow Christopher T.
- Fairness-guided Few-shot Prompting For Large Language Models Ma Huan, Zhang Changqing, Bian Yatao, Liu Lemao, Zhang Zhirui, Zhao Peilin, Zhang Shu, Fu Huazhu, Hu Qinghua, Wu Bingzhe
- POS: A Prompts Optimization Suite For Augmenting Text-to-video Generation Ma Shijie, Xu Huayi, Li Mengjian, Geng Weidong, Wang Yaxiong, Wang Meng
- (why) Is My Prompt Getting Worse? Rethinking Regression Testing For Evolving LLM Apis Ma Wanqin, Yang Chenyang, Kästner Christian
- Let's Do A Thought Experiment: Using Counterfactuals To Improve Moral Reasoning Ma Xiao, Mishra Swaroop, Beirami Ahmad, Beutel Alex, Chen Jilin
- Tomchallenges: A Principle-guided Dataset And Diverse Evaluation Tasks For Exploring Theory Of Mind Ma Xiaomeng, Gao Lingyu, Xu Qihui
- Query Rewriting For Retrieval-augmented Large Language Models Ma Xinbei, Gong Yeyun, He Pengcheng, Zhao Hai, Duan Nan
- Large Language Model Is Not A Good Few-shot Information Extractor, But A Good Reranker For Hard Samples! Ma Yubo, Cao Yixin, Hong Yongching, Sun Aixin
- Sci-cot: Leveraging Large Language Models For Enhanced Knowledge Distillation In Small Models For Scientific QA Ma Yuhan, Jiang Haiqi, Fan Chenyou
- Exploring Prompting Large Language Models As Explainable Metrics Mahmoudi Ghazaleh
- Harnessing The Power Of Prompt-based Techniques For Generating School-level Questions Using Large Language Models Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- How Can Large Language Models Help Humans In Design And Manufacturing? Makatura Liane, Foshey Michael, Wang Bohan, Hähnlein Felix, Ma Pingchuan, Deng Bolei, Tjandrasuwita Megan, Spielberg Andrew, Owens Crystal Elaine, Chen Peter Yichen, Zhao Allan, Zhu Amy, Norton Wil J, Gu Edward, Jacob Joshua, Li Yifei, Schulz Adriana, Matusik Wojciech
- Fine-tuning Language Models With Just Forward Passes Malladi Sadhika, Gao Tianyu, Nichani Eshaan, Damian Alex, Lee Jason D., Chen Danqi, Arora Sanjeev
- Aligning Large Language Models For Clinical Tasks Manathunga Supun, Hettigoda Isuru
- Enhancing CLIP With GPT-4: Harnessing Visual Descriptions As Prompts Maniparambil Mayug, Vorster Chris, Molloy Derek, Murphy Noel, Mcguinness Kevin, O'connor Noel E.
- Zero-shot Conversational Summarization Evaluations With Small Large Language Models Manuvinakurike Ramesh, Sahay Saurav, Manepalli Sangeeta, Nachman Lama
- Do Prompt Positions Really Matter? Mao Junyu, Middleton Stuart E., Niranjan Mahesan
- Large Language Models Know Your Contextual Search Intent: A Prompting Framework For Conversational Search Mao Kelong, Dou Zhicheng, Mo Fengran, Hou Jiewen, Chen Haonan, Qian Hongjin
- ICL Markup: Structuring In-context Learning Using Soft-token Tags Marc-etienne Brunet, Ashton Anderson, Richard Zemel
- Investigating The Efficacy Of Large Language Models In Reflective Assessment Methods Through Chain Of Thoughts Prompting Masikisiki Baphumelele, Marivate Vukosi, Hlope Yvette
- Improving Automatic VQA Evaluation Using Large Language Models Mañas Oscar, Krojer Benno, Agrawal Aishwarya
- JAB: Joint Adversarial Prompting And Belief Augmentation Mehrabi Ninareh, Goyal Palash, Ramakrishna Anil, Dhamala Jwala, Ghosh Shalini, Zemel Richard, Chang Kai-wei, Galstyan Aram, Gupta Rahul
- Simple LLM Prompting Is State-of-the-art For Robust And Multilingual Dialogue Evaluation Mendonça John, Pereira Patrícia, Moniz Helena, Carvalho João Paulo, Lavie Alon, Trancoso Isabel
- For Generated Text, Is Nli-neutral Text The Best Text? Mersinias Michail, Mahowald Kyle
- From Bytes To Biases: Investigating The Cultural Self-perception Of Large Language Models Messner Wolfgang, Greene Tatum, Matalone Josephine
- An AI Chatbot For Explaining Deep Reinforcement Learning Decisions Of Service-oriented Systems Metzger Andreas, Bartel Jone, Laufer Jan
- Developing A Scalable Benchmark For Assessing Large Language Models In Knowledge Graph Engineering Meyer Lars-peter, Frey Johannes, Junghanns Kurt, Brei Felix, Bulert Kirill, Gründer-fahrer Sabine, Martin Michael
- Self-checker: Plug-and-play Modules For Fact-checking With Large Language Models Miaoran Li, Baolin Peng, Michel Galley, Jianfeng Gao, Zhu Zhang
- MEDIMP: 3D Medical Images With Clinical Prompts From Limited Tabular Data For Renal Transplantation Milecki Leo, Kalogeiton Vicky, Bodard Sylvain, Anglicheau Dany, Correas Jean-michel, Timsit Marc-olivier, Vakalopoulou Maria
- PEFTT: Parameter-efficient Fine-tuning For Low-resource Tibetan Pre-trained Language Models Mingjun Zhou, Zhuoma Daiqing, Nuo Qun, Tashi Nyima
- PRE: Vision-language Prompt Learning With Reparameterization Encoder Minh Anh Pham Thi, Nguyen An Duc, Tzimiropoulos Georgios
- TAP: Targeted Prompting For Task Adaptive Generation Of Textual Training Instances For Visual Classification Mirza M. Jehanzeb, Karlinsky Leonid, Lin Wei, Possegger Horst, Feris Rogerio, Bischof Horst
- Stress Testing Chain-of-thought Prompting For Large Language Models Mishra Aayush, Thakkar Karan
- Prompting With Pseudo-code Instructions Mishra Mayank, Kumar Prince, Bhat Riyaz, Murthy Rudra V, Contractor Danish, Tamilselvam Srikanth
- Triggering Multi-hop Reasoning For Question Answering In Language Models Using Soft Prompts And Random Walks Misra Kanishka, Santos Cicero Nogueira Dos, Shakeri Siamak
- Comparing Humans, GPT-4, And GPT-4V On Abstraction And Reasoning Tasks Mitchell Melanie, Palmarini Alessandro B., Moskvichev Arseny
- Compositional Chain-of-thought Prompting For Large Multimodal Models Mitra Chancharik, Huang Brandon, Darrell Trevor, Herzig Roei
- State Of What Art? A Call For Multi-prompt LLM Evaluation Mizrahi Moran, Kaplan Guy, Malkin Dan, Dror Rotem, Shahaf Dafna, Stanovsky Gabriel
- How Trustworthy Are Open-source Llms? An Assessment Under Malicious Demonstrations Shows Their Vulnerabilities Mo Lingbo, Wang Boshi, Chen Muhao, Sun Huan
- Tree Of Uncertain Thoughts Reasoning For Large Language Models Mo Shentong, Xin Miao
- Boosting Theory-of-mind Performance In Large Language Models Via Prompting Moghaddam Shima Rahimi, Honey Christopher J.
- Interpreting User Requests In The Context Of Natural Language Standing Instructions Moghe Nikita, Xia Patrick, Andreas Jacob, Eisner Jason, Van Durme Benjamin, Jhamtani Harsh
- Assigning AI: Seven Approaches For Students, With Prompts Mollick Ethan, Mollick Lilach
- Evaluating Cognitive Maps And Planning In Large Language Models With Cogeval Momennejad Ida, Hasanbeig Hosein, Vieira Felipe, Sharma Hiteshi, Ness Robert Osazuwa, Jojic Nebojsa, Palangi Hamid, Larson Jonathan
- Metavl: Transferring In-context Learning Ability From Language Models To Vision-language Models Monajatipoor Masoud, Li Liunian Harold, Rouhsedaghat Mozhdeh, Yang Lin F., Chang Kai-wei
- Llms Killed The Script Kiddie: How Agents Supported By Large Language Models Change The Landscape Of Network Threat Testing Moskal Stephen, Laney Sam, Hemberg Erik, O'reilly Una-may
- Adaptive Machine Translation With Large Language Models Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- Fine-tuning Large Language Models For Adaptive Machine Translation Moslem Yasmin, Haque Rejwanul, Way Andy
- Gradient-based Automated Iterative Recovery For Parameter-efficient Tuning Mozes Maximilian, Bolukbasi Tolga, Yuan Ann, Liu Frederick, Thain Nithum, Dixon Lucas
- Learning To Compress Prompts With Gist Tokens Mu Jesse, Li Xiang Lisa, Goodman Noah
- Navigating Prompt Complexity For Zero-shot Classification: A Study Of Large Language Models In Computational Social Science Mu Yida, Wu Ben P., Thorne William, Robinson Ambrose, Aletras Nikolaos, Scarton Carolina, Bontcheva Kalina, Song Xingyi
- Augmenting Large Language Model Translators Via Translation Memories Mu Yongyu, Reheman Abudurexiti, Cao Zhiquan, Fan Yuchun, Li Bei, Li Yinqiao, Xiao Tong, Zhang Chunliang, Zhu Jingbo
- In-context Learning Generalizes, But Not Always Robustly: The Case Of Syntax Mueller Aaron, Webson Albert, Petty Jackson, Linzen Tal
- A Brief History Of Prompt: Leveraging Language Models. (through Advanced Prompting) Muktadir Golam Md
- LLM4VV: Developing Llm-driven Testsuite For Compiler Validation Munley Christian, Jarmusch Aaron, Chandrasekaran Sunita
- Testing Llms On Code Generation With Varying Levels Of Prompt Specificity Murr Lincoln, Grainger Morgan, Gao David
- Consistency Analysis Of Chatgpt Myeongjun Erik Jang, Thomas Lukasiewicz
- PEARL: Personalizing Large Language Model Writing Assistants With Generation-calibrated Retrievers Mysore Sheshera, Lu Zhuoran, Wan Mengting, Yang Longqi, Menezes Steve, Baghaee Tina, Gonzalez Emmanuel Barajas, Neville Jennifer, Safavi Tara
- Large Language Model Augmented Narrative Driven Recommendations Mysore Sheshera, Mccallum Andrew, Zamani Hamed
- Self-contradictory Hallucinations Of Large Language Models: Evaluation, Detection And Mitigation Mündler Niels, He Jingxuan, Jenko Slobodan, Vechev Martin
- Diversity Of Thought Improves Reasoning Abilities Of Llms Naik Ranjita, Chandrasekaran Varun, Yuksekgonul Mert, Palangi Hamid, Nushi Besmira
- Generating Medically-accurate Summaries Of Patient-provider Dialogue: A Multi-stage Approach Using Large Language Models Nair Varun, Schumacher Elliot, Kannan Anitha
- Using An LLM To Help With Code Understanding Nam Daye, Macvean Andrew, Hellendoorn Vincent, Vasilescu Bogdan, Myers Brad
- Breaking Language Barriers With A LEAP: Learning Strategies For Polyglot Llms Nambi Akshay, Balloli Vaibhav, Ranjit Mercy, Ganu Tanuja, Ahuja Kabir, Sitaram Sunayana, Bali Kalika
- Enhancing Few-shot Text-to-sql Capabilities Of Large Language Models: A Study On Prompt Design Strategies Nan Linyong, Zhao Yilun, Zou Weijin, Ri Narutatsu, Tae Jaesung, Zhang Ellen, Cohan Arman, Radev Dragomir
- Large Language Models As Tax Attorneys: A Case Study In Legal Capabilities Emergence Nay John J., Karamardian David, Lawsky Sarah B., Tao Wenting, Bhat Meghana, Jain Raghav, Lee Aaron Travis, Choi Jonathan H., Kasai Jungo
- Multi-dimensional Data Refining Strategy For Effective Fine-tuning Llms Ngoc Thanh Nguyen, Tran Quang Nhat, Tang Arthur, Nguyen Bao, Nguyen Thuy, Pham Thanh
- Cof-cot: Enhancing Large Language Models With Coarse-to-fine Chain-of-thought Prompting For Multi-domain NLU Tasks Nguyen Hoang H., Liu Ye, Zhang Chenwei, Zhang Tao, Yu Philip S.
- In-context Example Selection With Influences Nguyen Tai, Wong Eric
- Democratizing Llms For Low-resource Languages By Leveraging Their English Dominant Abilities With Linguistically-diverse Prompts Nguyen Xuan-phi, Aljunied Sharifah Mahani, Joty Shafiq, Bing Lidong
- L2ceval: Evaluating Language-to-code Generation Capabilities Of Large Language Models Ni Ansong, Yin Pengcheng, Zhao Yilun, Riddell Martin, Feng Troy, Shen Rui, Yin Stephen, Liu Ye, Yavuz Semih, Xiong Caiming, Joty Shafiq, Zhou Yingbo, Radev Dragomir, Cohan Arman
- ORES: Open-vocabulary Responsible Visual Synthesis Ni Minheng, Wu Chenfei, Wang Xiaodong, Yin Shengming, Wang Lijuan, Liu Zicheng, Duan Nan
- Code-style In-context Learning For Knowledge-based Question Answering Nie Zhijie, Zhang Richong, Wang Zhongyuan, Liu Xudong
- Gpt4image: Can Large Pre-trained Models Help Vision Models On Perception Tasks? Ning Ding, Yehui Tang, Zhongqian Fu, Chao Xu, Kai Han, Yunhe Wang
- Ragtruth: A Hallucination Corpus For Developing Trustworthy Retrieval-augmented Language Models Niu Cheng, Wu Yuanhao, Zhu Juno, Xu Siliang, Shum Kashun, Zhong Randy, Song Juntong, Zhang Tong
- AI Text-to-behavior: A Study In Steerability Noever David, Hyams Sam
- Capabilities Of GPT-4 On Medical Challenge Problems Nori Harsha, King Nicholas, Mckinney Scott Mayer, Carignan Dean, Horvitz Eric
- Evaluating GPT-3.5 And GPT-4 Models On Brazilian University Admission Exams Nunes Desnes, Primi Ricardo, Pires Ramon, Lotufo Roberto, Nogueira Rodrigo
- Adversarial Fine-tuning Of Language Models: An Iterative Optimisation Approach For The Generation And Detection Of Problematic Content O'neill Charles, Miller Jack, Ciuca Ioana, Ting Yuan-sen, Bui Thang
- Steering Language Generation: Harnessing Contrastive Expert Guidance And Negative Prompting For Coherent And Diverse Synthetic Data Generation O'neill Charles, Ting Yuan-sen, Ciuca Ioana, Miller Jack, Bui Thang
- Blackvip: Black-box Visual Prompting For Robust Transfer Learning Oh Changdae, Hwang Hyeji, Lee Hee-young, Lim Yongtaek, Jung Geunyoung, Jung Jiyoung, Choi Hosik, Song Kyungwoo
- Data Augmentation For Neural Machine Translation Using Generative Language Model Oh Seokjin, Lee Su Ah, Jung Woohwan
- Askit: Unified Programming Interface For Programming With Large Language Models Okuda Katsumi, Amarasinghe Saman
- LINC: A Neurosymbolic Approach For Logical Reasoning By Combining Language Models With First-order Logic Provers Olausson Theo X., Gu Alex, Lipkin Benjamin, Zhang Cedegao E., Solar-lezama Armando, Tenenbaum Joshua B., Levy Roger
- Generative AI: Implications And Applications For Education Olga Anastasia Olnancy, Tzirides, Saini Akash, Zapata Gabriela, Searsmith Duane, Cope Bill, Kalantzis Mary, Castro Vania, Kourkoulou Theodora, Jones John, Da Silva Rodrigo Abrantes, Whiting Jen, Kastania Nikoleta Polyxeni
- Automated Assessment Of Students' Code Comprehension Using Llms Oli Priti, Banjade Rabin, Chapagain Jeevan, Rus Vasile
- The Behavior Of Large Language Models When Prompted To Generate Code Explanations Oli Priti, Banjade Rabin, Chapagain Jeevan, Rus Vasile
- Large Language Models Vote: Prompting For Rare Disease Identification Oniani David, Hilsman Jordan, Dong Hang, Gao Fengyi, Verma Shiven, Wang Yanshan
- Text Style Transfer Evaluation Using Large Language Models Ostheimer Phil, Nagda Mayank, Kloft Marius, Fellenz Sophie
- Thoughtsource: A Central Hub For Large Language Model Reasoning Data Ott Simon, Hebenstreit Konstantin, Liévin Valentin, Hother Christoffer Egeberg, Moradi Milad, Mayrhauser Maximilian, Praas Robert, Winther Ole, Samwald Matthias
- Dialogbench: Evaluating Llms As Human-like Dialogue Systems Ou Jiao, Lu Junda, Liu Che, Tang Yihong, Zhang Fuzheng, Zhang Di, Gai Kun
- Autoplan: Automatic Planning Of Interactive Decision-making Tasks With Large Language Models Ouyang Siqi, Li Lei
- Structured Chemistry Reasoning With Large Language Models Ouyang Siru, Zhang Zhuosheng, Yan Bing, Liu Xuan, Choi Yejin, Han Jiawei, Qin Lianhui
- On The Role Of Attention In Prompt-tuning Oymak Samet, Rawat Ankit Singh, Soltanolkotabi Mahdi, Thrampoulidis Christos
- How To Catch An AI Liar: Lie Detection In Black-box Llms By Asking Unrelated Questions Pacchiardi Lorenzo, Chan Alex J., Mindermann Sören, Moscovitz Ilan, Pan Alexa Y., Gal Yarin, Evans Owain, Brauner Jan
- Ontochatgpt Information System: Ontology-driven Structured Prompts For Chatgpt Meta-learning Palagin Oleksandr, Kaverinskiy Vladislav, Litvin Anna, Malakhov Kyrylo
- Understanding The Capabilities Of Large Language Models For Automated Planning Pallagani Vishal, Muppasani Bharath, Murugesan Keerthiram, Rossi Francesca, Srivastava Biplav, Horesh Lior, Fabiano Francesco, Loreggia Andrea
- What In-context Learning "learns" In-context: Disentangling Task Recognition And Task Learning Pan Jane, Gao Tianyu, Chen Howard, Chen Danqi
- COSMIC: Data Efficient Instruction-tuning For Speech In-context Learning Pan Jing, Wu Jian, Gaur Yashesh, Sivasankaran Sunit, Chen Zhuo, Liu Shujie, Li Jinyu
- Self-supervised Meta-prompt Learning With Meta-gradient Regularization For Few-shot Generalization Pan Kaihang, Li Juncheng, Song Hongye, Lin Jun, Liu Xiaozhong, Tang Siliang
- Logic-lm: Empowering Large Language Models With Symbolic Solvers For Faithful Logical Reasoning Pan Liangming, Albalak Alon, Wang Xinyi, Wang William Yang
- Automatically Correcting Large Language Models: Surveying The Landscape Of Diverse Self-correction Strategies Pan Liangming, Saxon Michael, Xu Wenda, Nathani Deepak, Wang Xinyi, Wang William Yang
- Tokenize Anything Via Prompting Pan Ting, Tang Lulu, Wang Xinlong, Shan Shiguang
- A Preliminary Evaluation Of Chatgpt For Zero-shot Dialogue Understanding Pan Wenbo, Chen Qiguang, Xu Xiao, Che Wanxiang, Qin Libo
- Frozen Transformers In Language Models Are Effective Visual Encoder Layers Pang Ziqi, Xie Ziyang, Man Yunze, Wang Yu-xiong
- Automated Annotation With Generative AI Requires Validation Pangakis Nicholas, Wolken Samuel, Fasching Neil
- Steering Llama 2 Via Contrastive Activation Addition Panickssery Nina, Gabrieli Nick, Schulz Julian, Tong Meg, Hubinger Evan, Turner Alexander Matt
- Trainable Transformer In Transformer Panigrahi Abhishek, Malladi Sadhika, Xia Mengzhou, Arora Sanjeev
- Revisiting Prompt Engineering Via Declarative Crowdsourcing Parameswaran Aditya G., Shankar Shreya, Asawa Parth, Jain Naman, Wang Yujie
- ART: Automatic Multi-step Reasoning And Tool-use For Large Language Models Paranjape Bhargavi, Lundberg Scott, Singh Sameer, Hajishirzi Hannaneh, Zettlemoyer Luke, Ribeiro Marco Tulio
- Localized Symbolic Knowledge Distillation For Visual Commonsense Models Park Jae Sung, Hessel Jack, Chandu Khyathi Raghavi, Liang Paul Pu, Lu Ximing, West Peter, Yu Youngjae, Huang Qiuyuan, Gao Jianfeng, Farhadi Ali, Choi Yejin
- Graph Elicitation For Guiding Multi-step Reasoning In Large Language Models Park Jinyoung, Patel Ameen, Khan Omar Zia, Kim Hyunwoo J., Kim Joo-kyung
- On The Analysis Of Cross-lingual Prompt Tuning For Decoder-based Multilingual Model Park Nohil, Park Joonsuk, Yoo Kang Min, Yoon Sungroh
- A Large Language Model Approach To Educational Survey Feedback Analysis Parker Michael J., Anderson Caitlin, Stone Claire, Oh Yearim
- Evaluating In-context Learning Of Libraries For Code Generation Patel Arkil, Reddy Siva, Bahdanau Dzmitry, Dasigi Pradeep
- Gorilla: Large Language Model Connected With Massive Apis Patil Shishir G., Zhang Tianjun, Wang Xin, Gonzalez Joseph E.
- REFINER: Reasoning Feedback On Intermediate Representations Paul Debjit, Ismayilzada Mete, Peyrard Maxime, Borges Beatriz, Bosselut Antoine, West Robert, Faltings Boi
- Instructexcel: A Benchmark For Natural Language Instruction In Excel Payan Justin, Mishra Swaroop, Singh Mukul, Negreanu Carina, Poelitz Christian, Baral Chitta, Roy Subhro, Chakravarthy Rasika, Van Durme Benjamin, Nouri Elnaz
- Entity Matching Using Large Language Models Peeters Ralph, Bizer Christian
- GPT Self-supervision For A Better Data Annotator Pei Xiaohuan, Li Yanxi, Xu Chang
- Chatgpt Prompting Cannot Estimate Predictive Uncertainty In High-resource Languages Pelucchi Martino, Valdenegro-toro Matias
- Generative Large Language Models Are All-purpose Text Analytics Engines: Text-to-text Learning Is All Your Need Peng Cheng, Yang Xi, Chen Aokun, Yu Zehao, Smith Kaleb E, Costa Anthony B, Flores Mona G, Bian Jiang, Wu Yonghui
- When Does In-context Learning Fall Short And Why? A Study On Specification-heavy Tasks Peng Hao, Wang Xiaozhi, Chen Jianhui, Li Weikai, Qi Yunjia, Wang Zimu, Wu Zhili, Zeng Kaisheng, Xu Bin, Hou Lei, Li Juanzi
- Spoken Language Intelligence Of Large Language Models For Language Learning Peng Linkai, Nuchged Baorian, Gao Yingming
- Pre-train, Prompt And Recommendation: A Comprehensive Survey Of Language Modelling Paradigm Adaptations In Recommender Systems Peng Liu, Lemei Zhang, Jon Atle Gulla
- Customising General Large Language Models For Specialised Emotion Recognition Tasks Peng Liyizhe, Zhang Zixing, Pang Tao, Han Jing, Zhao Huan, Chen Hao, Schuller Björn W.
- Soft Prompt Tuning For Augmenting Dense Retrieval With Large Language Models Peng Zhiyuan, Wu Xuyang, Wang Qifan, Fang Yi
- Promptinfuser: How Tightly Coupling AI And UI Design Impacts Designers' Workflows Petridis Savvas, Terry Michael, Cai Carrie J.
- Constitutionmaker: Interactively Critiquing Large Language Models By Converting Feedback Into Principles Petridis Savvas, Wedin Ben, Wexler James, Donsbach Aaron, Pushkarna Mahima, Goyal Nitesh, Cai Carrie J., Terry Michael
- When Do Prompting And Prefix-tuning Work? A Theory Of Capabilities And Limitations Petrov Aleksandar, Torr Philip H. S., Bibi Adel
- Large Language Models Sensitivity To The Order Of Options In Multiple-choice Questions Pezeshkpour Pouya, Hruschka Estevam
- Training Chain-of-thought Via Latent-variable Inference Phan Du, Hoffman Matthew D., Dohan David, Douglas Sholto, Le Tuan Anh, Parisi Aaron, Sountsov Pavel, Sutton Charles, Vikram Sharad, Saurous Rif A.
- Zero-shot Question Answering Over Financial Documents Using Large Language Models Phogat Karmvir Singh, Harsha Chetan, Dasaratha Sridhar, Ramakrishna Shashishekar, Puranam Sai Akhil
- Automating Human Tutor-style Programming Feedback: Leveraging GPT-4 Tutor Model For Hint Generation And GPT-3.5 Student Model For Hint Validation Phung Tung, Pădurean Victor-alexandru, Singh Anjali, Brooks Christopher, Cambronero José, Gulwani Sumit, Singla Adish, Soares Gustavo
- LLM Self Defense: By Self Examination, Llms Know They Are Being Tricked Phute Mansi, Helbling Alec, Hull Matthew, Peng Shengyun, Szyller Sebastian, Cornelius Cory, Chau Duen Horng
- Jatmo: Prompt Injection Defense By Task-specific Finetuning Piet Julien, Alrashed Maha, Sitawarin Chawin, Chen Sizhe, Wei Zeming, Sun Elizabeth, Alomair Basel, Wagner David
- LLM4TDD: Best Practices For Test Driven Development Using Large Language Models Piya Sanyogita, Sullivan Allison
- Three Ways Of Using Large Language Models To Evaluate Chat Plátek Ondřej, Hudeček Vojtěch, Schmidtová Patricia, Lango Mateusz, Dušek Ondřej
- Selfcheckgpt: Zero-resource Black-box Hallucination Detection For Generative Large Language Models Potsawee Manakul, Adian Liusie, Mark J. F. Gales
- Transformers In The Service Of Description Logic-based Contexts Poulis Angelos, Tsalapati Eleni, Koubarakis Manolis
- Painter: Teaching Auto-regressive Language Models To Draw Sketches Pourreza Reza, Bhattacharyya Apratim, Panchal Sunny, Lee Mingu, Madan Pulkit, Memisevic Roland
- Automatic Prompt Optimization With "gradient Descent" And Beam Search Pryzant Reid, Iter Dan, Li Jerry, Lee Yin Tat, Zhu Chenguang, Zeng Michael
- Chatgpt Vs Human-authored Text: Insights Into Controllable Text Summarization And Sentence Style Transfer Pu Dongqi, Demberg Vera
- Decomposed Prompting For Machine Translation Between Related Languages Using Large Language Models Puduppully Ratish, Kunchukuttan Anoop, Dabre Raj, Aw Ai Ti, Chen Nancy F.
- PILLOW: Enhancing Efficient Instruction Fine-tuning Via Prompt Matching Qi Zhenting, Tan Xiaoyu, Shi Shaojie, Qu Chao, Xu Yinghui, Qi Yuan
- Harnessing The Power Of Large Language Models For Empathetic Response Generation: Empirical Investigations And Improvements Qian Yushan, Zhang Wei-nan, Liu Ting
- Hijacking Large Language Models Via Adversarial In-context Learning Qiang Yao, Zhou Xiangyu, Zhu Dongxiao
- March In Chat: Interactive Prompting For Remote Embodied Referring Expression Qiao Yanyuan, Qi Yuankai, Yu Zheng, Liu Jing, Wu Qi
- ONCE: Boosting Content-based Recommendation With Both Open- And Closed-source Large Language Models Qijiong Liu, Nuo Chen, Tetsuya Sakai, Xiao-ming Wu
- Improving In-context Learning Via Bidirectional Alignment Qin Chengwei, Xia Wenhan, Jiao Fangkai, Chen Chen, Hu Yuchen, Ding Bosheng, Joty Shafiq
- In-context Learning With Iterative Demonstration Selection Qin Chengwei, Zhang Aston, Chen Chen, Dagar Anirudh, Ye Wenming
- How Good Is Google Bard's Visual Understanding? An Empirical Study On Open Challenges Qin Haotong, Ji Ge-peng, Khan Salman, Fan Deng-ping, Khan Fahad Shahbaz, Van Gool Luc
- Cross-lingual Prompting: Improving Zero-shot Chain-of-thought Reasoning Across Languages Qin Libo, Chen Qiguang, Wei Fuxuan, Huang Shijue, Che Wanxiang
- Toolllm: Facilitating Large Language Models To Master 16000+ Real-world Apis Qin Yujia, Liang Shihao, Ye Yining, Zhu Kunlun, Yan Lan, Lu Yaxi, Lin Yankai, Cong Xin, Tang Xiangru, Qian Bill, Zhao Sihan, Hong Lauren, Tian Runchu, Xie Ruobing, Zhou Jie, Gerstein Mark, Li Dahai, Liu Zhiyuan, Sun Maosong
- Large Language Models Are Effective Text Rankers With Pairwise Ranking Prompting Qin Zhen, Jagerman Rolf, Hui Kai, Zhuang Honglei, Wu Junru, Yan Le, Shen Jiaming, Liu Tianqi, Liu Jialu, Metzler Donald, Wang Xuanhui, Bendersky Michael
- Faithful Chain-of-thought Reasoning Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong, Marianna Apidianaki, Chris Callison-burch
- Latent Jailbreak: A Benchmark For Evaluating Text Safety And Output Robustness Of Large Language Models Qiu Huachuan, Zhang Shuai, Li Anqi, He Hongliang, Lan Zhenzhong
- Controlrec: Bridging The Semantic Gap Between Language Model And Personalized Recommendation Qiu Junyan, Wang Haitao, Hong Zhaolin, Yang Yiping, Liu Qiang, Wang Xingxing
- Are Large Language Models Temporally Grounded? Qiu Yifu, Zhao Zheng, Ziser Yftah, Korhonen Anna, Ponti Edoardo M., Cohen Shay B.
- Layoutllm-t2i: Eliciting Layout Guidance From LLM For Text-to-image Generation Qu Leigang, Wu Shengqiong, Fei Hao, Nie Liqiang, Chua Tat-seng
- Cmed-gpt: Prompt Tuning For Entity-aware Chinese Medical Dialogue Generation Qu Zhijie, Li Juan, Ma Zerui, Li Jianqiang
- Logic-scaffolding: Personalized Aspect-instructed Recommendation Explanation Generation Using Llms Rahdari Behnam, Ding Hao, Fan Ziwei, Ma Yifei, Chen Zhuotong, Deoras Anoop, Kveton Branislav
- Exploring The Effectiveness Of GPT Models In Test-taking: A Case Study Of The Driver's License Knowledge Test Rahimi Saba, Balch Tucker, Veloso Manuela
- Semantic Consistency For Assuring Reliability Of Large Language Models Raj Harsh, Gupta Vipul, Rosati Domenic, Majumdar Subhabrata
- Multimodal Multi-hop Question Answering Through A Conversation Between Tools And Efficiently Finetuned Large Language Models Rajabzadeh Hossein, Wang Suyuchen, Kwon Hyock Ju, Liu Bang
- Does The "most Sinfully Decadent Cake Ever" Taste Good? Answering Yes/no Questions From Figurative Contexts Rakshit Geetanjali, Flanigan Jeffrey
- Controllable Generation Of Dialogue Acts For Dialogue Systems Via Few-shot Response Generation And Ranking Ramirez Angela, Agarwal Karik, Juraska Juraj, Garg Utkarsh, Walker Marilyn A.
- Lmcap: Few-shot Multilingual Image Captioning By Retrieval Augmented Language Model Prompting Ramos Rita, Martins Bruno, Elliott Desmond
- When Large Language Models Contradict Humans? Large Language Models' Sycophantic Behaviour Ranaldi Leonardo, Pucci Giulia
- Empowering Multi-step Reasoning Across Languages Via Tree-of-thoughts Ranaldi Leonardo, Pucci Giulia, Ranaldi Federico, Ruzzetti Elena Sofia, Zanzotto Fabio Massimo
- Tricking Llms Into Disobedience: Formalizing, Analyzing, And Detecting Jailbreaks Rao Abhinav, Vashistha Sachin, Naik Atharva, Aditya Somak, Choudhury Monojit
- Can Chatgpt Assess Human Personalities? A General Evaluation Framework Rao Haocong, Leung Cyril, Miao Chunyan
- Dissecting In-context Learning Of Translations In Gpts Raunak Vikas, Awadalla Hany Hassan, Menezes Arul
- Do Gpts Produce Less Literal Translations? Raunak Vikas, Menezes Arul, Post Matt, Awadalla Hany Hassan
- Exploring The Relationship Between LLM Hallucinations And Prompt Linguistic Nuances: Readability, Formality, And Concreteness Rawte Vipula, Priya Prachi, Tonmoy S. M Towhidul Islam, Zaman S M Mehedi, Sheth Amit, Das Amitava
- Progressive Prompts: Continual Learning For Language Models Razdaibiedina Anastasia, Mao Yuning, Hou Rui, Khabsa Madian, Lewis Mike, Almahairi Amjad
- Prompt, Generate, Then Cache: Cascade Of Foundation Models Makes Strong Few-shot Learners Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Hongsheng Li, Yu Qiao, Peng Gao
- I'm Afraid I Can't Do That: Predicting Prompt Refusal In Black-box Generative Language Models Reuter Max, Schulze William
- STREET: A Multi-task Structured Reasoning And Explanation Benchmark Ribeiro Danilo, Wang Shen, Ma Xiaofei, Zhu Henry, Dong Rui, Kong Deguang, Burger Juliette, Ramos Anjelica, Wang William, Huang Zhiheng, Karypis George, Xiang Bing, Roth Dan
- Integrating Summarization And Retrieval For Enhanced Personalization Via Large Language Models Richardson Chris, Zhang Yao, Gillespie Kellen, Kar Sudipta, Singh Arshdeep, Raeesy Zeynab, Khan Omar Zia, Sethy Abhinav
- Smoothllm: Defending Large Language Models Against Jailbreaking Attacks Robey Alexander, Wong Eric, Hassani Hamed, Pappas George J.
- Benchmarking Causal Study To Interpret Large Language Models For Source Code Rodriguez-cardenas Daniel, Palacio David N., Khati Dipin, Burke Henry, Poshyvanyk Denys
- Next-step Hint Generation For Introductory Programming Using Large Language Models Roest Lianne, Keuning Hieke, Jeuring Johan
- Synergistic Integration Of Large Language Models And Cognitive Architectures For Robust AI: An Exploratory Analysis Romero Oscar J., Zimmerman John, Steinfeld Aaron, Tomasic Anthony
- Grading Conversational Responses Of Chatbots Rosario Grant, Noever David
- LLM As An Art Director (ladi): Using Llms To Improve Text-to-media Generators Roush Allen, Zakirov Emil, Shirokov Artemiy, Lunina Polina, Gane Jack, Duffy Alexander, Basil Charlie, Whitcomb Aber, Benedetto Jim, Dewolfe Chris
- Unveiling The Potential Of Large Language Models In Generating Semantic And Cross-language Clones Roy Palash R., Alam Ajmain I., Al-omari Farouq, Roy Banani, Roy Chanchal K., Schneider Kevin A.
- Generating Phishing Attacks Using Chatgpt Roy Sayak Saha, Naragam Krishna Vamsi, Nilizadeh Shirin
- From Chatbots To Phishbots? -- Preventing Phishing Scams Created Using Chatgpt, Google Bard And Claude Roy Sayak Saha, Thota Poojitha, Naragam Krishna Vamsi, Nilizadeh Shirin
- Journey Of Hallucination-minimized Generative AI Solutions For Financial Decision Makers Roychowdhury Sohini
- Hallucination-minimized Data-to-answer Framework For Financial Decision-makers Roychowdhury Sohini, Alvarez Andres, Moore Brian, Krema Marko, Gelpi Maria Paz, Rodriguez Federico Martin, Rodriguez Angel, Cabrejas Jose Ramon, Serrano Pablo Martinez, Agrawal Punit, Mukherjee Arijit
- Gpt4tools: Teaching Large Language Model To Use Tools Via Self-instruction Rui Yang, Lin Song, Yanwei Li, Sijie Zhao, Yixiao Ge, Xiu Li, Ying Shan
- ICLEF: In-context Learning With Expert Feedback For Explainable Style Transfer Saakyan Arkadiy, Muresan Smaranda
- Catfood: Counterfactual Augmented Training For Improving Out-of-domain Performance And Calibration Sachdeva Rachneet, Tutek Martin, Gurevych Iryna
- Performance Of Chatgpt-3.5 And GPT-4 On The United States Medical Licensing Examination With And Without Distractions Safrai Myriam, Azaria Amos
- Branch-solve-merge Improves Large Language Model Evaluation And Generation Saha Swarnadeep, Levy Omer, Celikyilmaz Asli, Bansal Mohit, Weston Jason, Li Xian
- Promptmix: A Class Boundary Augmentation Method For Large Language Model Distillation Sahu Gaurav, Vechtomova Olga, Bahdanau Dzmitry, Laradji Issam H.
- Memory Injections: Correcting Multi-hop Reasoning Failures During Inference In Transformer-based Language Models Sakarvadia Mansi, Ajith Aswathy, Khan Arham, Grzenda Daniel, Hudson Nathaniel, Bauer André, Chard Kyle, Foster Ian
- In-context Impersonation Reveals Large Language Models' Strengths And Biases Salewski Leonard, Alaniz Stephan, Rio-torto Isabel, Schulz Eric, Akata Zeynep
- Zero-shot Translation Of Attention Patterns In VQA Models To Natural Language Salewski Leonard, Koepke A. Sophia, Lensch Hendrik P. A., Akata Zeynep
- Large Language Models Are Competitive Near Cold-start Recommenders For Language- And Item-based Preferences Sanner Scott, Balog Krisztian, Radlinski Filip, Wedin Ben, Dixon Lucas
- Camoscio: An Italian Instruction-tuned Llama Santilli Andrea, Rodolà Emanuele
- Frugal Prompting For Dialog Models Santra Bishal, Basak Sakya, De Abhinandan, Gupta Manish, Goyal Pawan
- Teler: A General Taxonomy Of LLM Prompts For Benchmarking Complex Tasks Santu Shubhra Kanti Karmaker, Feng Dongji
- Testing The General Deductive Reasoning Capacity Of Large Language Models Using OOD Examples Saparov Abulhair, Pang Richard Yuanzhe, Padmakumar Vishakh, Joshi Nitish, Kazemi Seyed Mehran, Kim Najoung, He He
- Open-ended Instructable Embodied Agents With Memory-augmented Large Language Models Sarch Gabriel, Wu Yue, Tarr Michael J., Fragkiadaki Katerina
- Will Code Remain A Relevant User Interface For End-user Programming With Generative AI Models? Sarkar Advait
- RAMP: Retrieval And Attribute-marking Enhanced Prompting For Attribute-controlled Translation Sarti Gabriele, Htut Phu Mon, Niu Xing, Hsu Benjamin, Currey Anna, Dinu Georgiana, Nadejde Maria
- Interprompt: Interpretable Prompting For Interrelated Interpersonal Risk Factors In Reddit Posts Sathvik Msvpj, Sarkar Surjodeep, Saxena Chandni, Sohn Sunghwan, Garg Muskan
- Diagnostic Reasoning Prompts Reveal The Potential For Large Language Model Interpretability In Medicine Savage Thomas, Nayak Ashwin, Gallo Robert, Rangan Ekanath, Chen Jonathan H
- Can GPT-4 Support Analysis Of Textual Data In Tasks Requiring Highly Specialized Domain Expertise? Savelka Jaromir, Ashley Kevin D., Gray Morgan A, Westermann Hannes, Xu Huihui
- A Review Of Repository Level Prompting For Llms Schonholtz Douglas
- Ignore This Title And Hackaprompt: Exposing Systemic Vulnerabilities Of Llms Through A Global Scale Prompt Hacking Competition Schulhoff Sander, Pinto Jeremy, Khan Anaum, Bouchard Louis-françois, Si Chenglei, Anati Svetlina, Tagliabue Valen, Kost Anson Liu, Carnahan Christopher, Boyd-graber Jordan
- An Empirical Evaluation Of Using Large Language Models For Automated Unit Test Generation Schäfer Max, Nadi Sarah, Eghbali Aryaz, Tip Frank
- Quantifying Language Models' Sensitivity To Spurious Features In Prompt Design Or: How I Learned To Start Worrying About Prompt Formatting Sclar Melanie, Choi Yejin, Tsvetkov Yulia, Suhr Alane
- Evaluating The Deductive Competence Of Large Language Models Seals Spencer M., Shalin Valerie L.
- Personality Traits In Large Language Models Serapio-garcía Greg, Safdari Mustafa, Crepy Clément, Sun Luning, Fitz Stephen, Romero Peter, Abdulhai Marwa, Faust Aleksandra, Matarić Maja
- Scalable And Transferable Black-box Jailbreaks For Language Models Via Persona Modulation Shah Rusheb, Feuillade--montixi Quentin, Pour Soroush, Tagade Arush, Casper Stephen, Rando Javier
- Evaluating Large Language Model Creativity From A Literary Perspective Shanahan Murray, Clarke Catherine
- Character-llm: A Trainable Agent For Role-playing Shao Yunfan, Li Linyang, Dai Junqi, Qiu Xipeng
- Language Is Not All You Need: Aligning Perception With Language Models Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Johan Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei
- Performance Of Chatgpt On USMLE: Unlocking The Potential Of Large Language Models For Ai-assisted Medical Education Sharma Prabin, Thapa Kisan, Thapa Dikshya, Dhakal Prastab, Upadhaya Mala Deep, Adhikari Santosh, Khanal Salik Ram
- Jailbreak In Pieces: Compositional Adversarial Attacks On Multi-modal Language Models Shayegani Erfan, Dong Yue, Abu-ghazaleh Nael
- The Flan Collection: Designing Data And Methods For Effective Instruction Tuning Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay, Denny Zhou, Quoc V. Le, Barret Zoph, Jason Wei, Adam Roberts
- The Trickle-down Impact Of Reward (in-)consistency On RLHF Shen Lingfeng, Chen Sihao, Song Linfeng, Jin Lifeng, Peng Baolin, Mi Haitao, Khashabi Daniel, Yu Dong
- Do Pretrained Transformers Learn In-context By Gradient Descent? Shen Lingfeng, Mishra Aayush, Khashabi Daniel
- Storygpt-v: Large Language Models As Consistent Story Visualizers Shen Xiaoqian, Elhoseiny Mohamed
- "do Anything Now": Characterizing And Evaluating In-the-wild Jailbreak Prompts On Large Language Models Shen Xinyue, Chen Zeyuan, Backes Michael, Shen Yun, Zhang Yang
- Towards More Unified In-context Visual Understanding Sheng Dianmo, Chen Dongdong, Tan Zhentao, Liu Qiankun, Chu Qi, Bao Jianmin, Gong Tao, Liu Bin, Xu Shengwei, Yu Nenghai
- TOAST: Transfer Learning Via Attention Steering Shi Baifeng, Gai Siyu, Darrell Trevor, Wang Xin
- Prompt Space Optimizing Few-shot Reasoning Success With Large Language Models Shi Fobo, Qing Peijun, Yang Dong, Wang Nan, Lei Youbo, Lu Haonan, Lin Xiaodong, Li Duantengchuan
- MKRAG: Medical Knowledge Retrieval Augmented Generation For Medical Question Answering Shi Yucheng, Xu Shaochen, Yang Tianze, Liu Zhengliang, Liu Tianming, Li Quanzheng, Li Xiang, Liu Ninghao
- Dept: Decomposed Prompt Tuning For Parameter-efficient Fine-tuning Shi Zhengxiang, Lipani Aldo
- Reasoning With Language Model Is Planning With World Model Shibo Hao, Yi Gu, Haodi Ma, Joshua Jiahua Hong, Zhen Wang, Daisy Zhe Wang, Zhiting Hu
- Exploring The Robustness Of Large Language Models For Solving Programming Problems Shirafuji Atsushi, Watanobe Yutaka, Ito Takumi, Morishita Makoto, Nakamura Yuki, Oda Yusuke, Suzuki Jun
- Active Prompting With Chain-of-thought For Large Language Models Shizhe Diao, Pengcheng Wang, Yong Lin, Rui Pan, Xiang Liu, Tong Zhang
- What Does CLIP Know About A Red Circle? Visual Prompt Engineering For Vlms Shtedritski Aleksandar, Rupprecht Christian, Vedaldi Andrea
- You Don't Need A Personality Test To Know These Models Are Unreliable: Assessing The Reliability Of Large Language Models On Psychometric Instruments Shu Bangzhao, Zhang Lechen, Choi Minje, Dunagan Lavinia, Logeswaran Lajanugen, Lee Moontae, Card Dallas, Jurgens David
- POSQA: Probe The World Models Of Llms With Size Comparisons Shu Chang, Han Jiuzhou, Liu Fangyu, Shareghi Ehsan, Collier Nigel
- Rewritelm: An Instruction-tuned Large Language Model For Text Rewriting Shu Lei, Luo Liangchen, Hoskere Jayakumar, Zhu Yun, Liu Yinxiao, Tong Simon, Chen Jindong, Meng Lei
- A Comprehensive Evaluation Of Large Language Models On Legal Judgment Prediction Shui Ruihao, Cao Yixin, Wang Xiang, Chua Tat-seng
- Beyond Task Performance: Evaluating And Reducing The Flaws Of Large Multimodal Models With In-context Learning Shukor Mustafa, Rame Alexandre, Dancette Corentin, Cord Matthieu
- Automatic Prompt Augmentation And Selection With Chain-of-thought From Labeled Data Shum Kashun, Diao Shizhe, Zhang Tong
- Tree Of Thoughts: Deliberate Problem Solving With Large Language Models Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Thomas L. Griffiths, Yuan Cao, Karthik Narasimhan
- Measuring Inductive Biases Of In-context Learning With Underspecified Demonstrations Si Chenglei, Friedman Dan, Joshi Nitish, Feng Shi, Chen Danqi, He He
- Large Language Models Help Humans Verify Truthfulness -- Except When They Are Convincingly Wrong Si Chenglei, Goyal Navita, Wu Sherry Tongshuang, Zhao Chen, Feng Shi, Daumé Hal Iii, Boyd-graber Jordan
- Getting More Out Of Mixture Of Language Model Reasoning Experts Si Chenglei, Shi Weijia, Zhao Chen, Zettlemoyer Luke, Boyd-graber Jordan
- Knowledge Unlearning For Llms: Tasks, Methods, And Challenges Si Nianwen, Zhang Hao, Chang Heyu, Zhang Wenlin, Qu Dan, Zhang Weiqiang
- Mondrian: Prompt Abstraction Attack Against Large Language Models For Cheaper API Pricing Si Wai Man, Backes Michael, Zhang Yang
- GPT-4 As An Agronomist Assistant? Answering Agriculture Exams Using Large Language Models Silva Bruno, Nunes Leonardo, Estevão Roberto, Aski Vijay, Chandra Ranveer
- Generalized Planning In PDDL Domains With Pretrained Large Language Models Silver Tom, Dan Soham, Srinivas Kavitha, Tenenbaum Joshua B., Kaelbling Leslie Pack, Katz Michael
- Large Language Models For Aspect-based Sentiment Analysis Simmering Paul F., Huoviala Paavo
- Assessing GPT4-V On Structured Reasoning Tasks Singh Mukul, Cambronero José, Gulwani Sumit, Le Vu, Verbruggen Gust
- Gopro: Generate And Optimize Prompts In CLIP Using Self-supervised Learning Singha Mainak, Jha Ankit, Banerjee Biplab
- Applenet: Visual Attention Parameterized Prompt Learning For Few-shot Remote Sensing Image Generalization Using CLIP Singha Mainak, Jha Ankit, Solanki Bhupendra, Bose Shirsha, Banerjee Biplab
- Towards Expert-level Medical Question Answering With Large Language Models Singhal Karan, Tu Tao, Gottweis Juraj, Sayres Rory, Wulczyn Ellery, Hou Le, Clark Kevin, Pfohl Stephen, Cole-lewis Heather, Neal Darlene, Schaekermann Mike, Wang Amy, Amin Mohamed, Lachgar Sami, Mansfield Philip, Prakash Sushant, Green Bradley, Dominowska Ewa, Arcas Blaise Aguera Y, Tomasev Nenad, Liu Yun, Wong Renee, Semturs Christopher, Mahdavi S. Sara, Barral Joelle, Webster Dale, Corrado Greg S., Matias Yossi, Azizi Shekoofeh, Karthikesalingam Alan, Natarajan Vivek
- Should We Be Going MAD? A Look At Multi-agent Debate Strategies For Llms Smit Andries, Duckworth Paul, Grinsztajn Nathan, Barrett Thomas D., Pretorius Arnu
- Step: Stacked LLM Policies For Web Actions Sodhi Paloma, Branavan S. R. K., Artzi Yoav, Mcdonald Ryan
- SPARSEFIT: Few-shot Prompting With Sparse Fine-tuning For Jointly Generating Predictions And Natural Language Explanations Solano Jesus, Sanni Mardhiyah, Camburu Oana-maria, Minervini Pasquale
- Biomedical Knowledge Graph-optimized Prompt Generation For Large Language Models Soman Karthik, Rose Peter W, Morris John H, Akbas Rabia E, Smith Brett, Peetoom Braian, Villouta-reyes Catalina, Cerono Gabriel, Shi Yongmei, Rizk-jackson Angela, Israni Sharat, Nelson Charlotte A, Huang Sui, Baranzini Sergio E
- Nlpbench: Evaluating Large Language Models On Solving NLP Problems Song Linxin, Zhang Jieyu, Cheng Lechao, Zhou Pengyuan, Zhou Tianyi, Li Irene
- Large Language Models Meet Open-world Intent Discovery And Recognition: An Evaluation Of Chatgpt Song Xiaoshuai, He Keqing, Wang Pei, Dong Guanting, Mou Yutao, Wang Jingang, Xian Yunsen, Cai Xunliang, Xu Weiran
- Honeybee: Progressive Instruction Finetuning Of Large Language Models For Materials Science Song Yu, Miret Santiago, Zhang Huan, Liu Bang
- Reward Collapse In Aligning Large Language Models Song Ziang, Cai Tianle, Lee Jason D., Su Weijie J.
- Deduction Under Perturbed Evidence: Probing Student Simulation Capabilities Of Large Language Models Sonkar Shashank, Baraniuk Richard G.
- Code Soliloquies For Accurate Calculations In Large Language Models Sonkar Shashank, Le Myco, Chen Xinghe, Liu Naiming, Mallick Debshila Basu, Baraniuk Richard G.
- Joint Prompt Optimization Of Stacked Llms Using Variational Inference Sordoni Alessandro, Yuan Xingdi, Côté Marc-alexandre, Pereira Matheus, Trischler Adam, Xiao Ziang, Hosseini Arian, Niedtner Friederike, Roux Nicolas Le
- The First Step Is The Hardest: Pitfalls Of Representing And Tokenizing Temporal Data For Large Language Models Spathis Dimitris, Kawsar Fahim
- Commands As AI Conversations Spinellis Diomidis
- Musr: Testing The Limits Of Chain-of-thought With Multistep Soft Reasoning Sprague Zayne, Ye Xi, Bostrom Kaj, Chaudhuri Swarat, Durrett Greg
- Hierarchical Prompting Assists Large Language Model On Web Navigation Sridhar Abishek, Lo Robert, Xu Frank F., Zhu Hao, Zhou Shuyan
- Harnessing Llms In Curricular Design: Using GPT-4 To Support Authoring Of Learning Objectives Sridhar Pragnya, Doyle Aidan, Agarwal Arav, Bogart Christopher, Savelka Jaromir, Sakr Majd
- Instances Need More Care: Rewriting Prompts For Instances With Llms In The Loop Yields Better Zero-shot Performance Srivastava Saurabh, Huang Chengyue, Fan Weiguo, Yao Ziyu
- GPT-4 Doesn't Know It's Wrong: An Analysis Of Iterative Prompting For Reasoning Problems Stechly Kaya, Marquez Matthew, Kambhampati Subbarao
- A Mechanistic Interpretation Of Arithmetic Reasoning In Language Models Using Causal Mediation Analysis Stolfo Alessandro, Belinkov Yonatan, Sachan Mrinmaya
- Finetuning An LLM On Contextual Knowledge Of Classics For Q&A Strachan Shane Storm
- Learning From Red Teaming: Gender Bias Provocation And Mitigation In Large Language Models Su Hsuan, Cheng Cheng-chu, Farn Hua, Kumar Shachi H, Sahay Saurav, Chen Shang-tse, Lee Hung-yi
- TAP4LLM: Table Provider On Sampling, Augmenting, And Packing Semi-structured Data For Large Language Model Reasoning Sui Yuan, Zou Jiaru, Zhou Mengyu, He Xinyi, Du Lun, Han Shi, Zhang Dongmei
- Cognitive Architectures For Language Agents Sumers Theodore R., Yao Shunyu, Narasimhan Karthik, Griffiths Thomas L.
- CONSCENDI: A Contrastive And Scenario-guided Distillation Approach To Guardrail Models For Virtual Assistants Sun Albert Yu, Nair Varun, Schumacher Elliot, Kannan Anitha
- Does Fine-tuning GPT-3 With The Openai API Leak Personally-identifiable Information? Sun Albert Yu, Zemour Eliott, Saxena Arushi, Vaidyanathan Udith, Lin Eric, Lau Christian, Mugunthan Vaikkunth
- TEST: Text Prototype Aligned Embedding To Activate Llm's Ability For Time Series Sun Chenxi, Li Hongyan, Li Yaliang, Hong Shenda
- Create Your World: Lifelong Text-to-image Diffusion Sun Gan, Liang Wenqi, Dong Jiahua, Li Jun, Ding Zhengming, Cong Yang
- Speech-based Slot Filling Using Large Language Models Sun Guangzhi, Feng Shutong, Jiang Dongcheng, Zhang Chao, Gašić Milica, Woodland Philip C.
- Reinforcement Learning In The Era Of Llms: What Is Essential? What Is Needed? An RL Perspective On RLHF, Prompting, And Beyond Sun Hao
- Towards Verifiable Text Generation With Evolving Memory And Self-reflection Sun Hao, Cai Hengyi, Wang Bo, Hou Yingyan, Wei Xiaochi, Wang Shuaiqiang, Zhang Yan, Yin Dawei
- Query-dependent Prompt Evaluation And Optimization With Offline Inverse RL Sun Hao, Hüyük Alihan, Van Der Schaar Mihaela
- Adaplanner: Adaptive Planning From Feedback With Language Models Sun Haotian, Zhuang Yuchen, Kong Lingkai, Dai Bo, Zhang Chao
- Autohint: Automatic Prompt Optimization With Hint Generation Sun Hong, Li Xue, Xu Yinchuan, Homma Youkow, Cao Qi, Wu Min, Jiao Jian, Charles Denis
- Think-on-graph: Deep And Responsible Reasoning Of Large Language Model On Knowledge Graph Sun Jiashuo, Xu Chengjin, Tang Lumingyuan, Wang Saizhuo, Lin Chen, Gong Yeyun, Ni Lionel M., Shum Heung-yeung, Guo Jian
- Evaluating The Zero-shot Robustness Of Instruction-tuned Language Models Sun Jiuding, Shaib Chantal, Wallace Byron C.
- Make Prompt-based Black-box Tuning Colorful: Boosting Model Generalization From Three Orthogonal Perspectives Sun Qiushi, Han Chengcheng, Chen Nuo, Zhu Renyu, Gong Jingyang, Li Xiang, Gao Ming
- Generative Multimodal Models Are In-context Learners Sun Quan, Cui Yufeng, Zhang Xiaosong, Zhang Fan, Yu Qiying, Luo Zhengxiong, Wang Yueze, Rao Yongming, Liu Jingjing, Huang Tiejun, Wang Xinlong
- Battle Of The Large Language Models: Dolly Vs Llama Vs Vicuna Vs Guanaco Vs Bard Vs Chatgpt -- A Text-to-sql Parsing Comparison Sun Shuo, Zhang Yuchen, Yan Jiahuan, Gao Yuze, Ong Donovan, Chen Bin, Su Jian
- PEARL: Prompting Large Language Models To Plan And Execute Actions Over Long Documents Sun Simeng, Liu Yang, Wang Shuohang, Zhu Chenguang, Iyyer Mohit
- A Prompt Learning Framework For Source Code Summarization Sun Weisong, Fang Chunrong, You Yudu, Chen Yuchen, Liu Yi, Wang Chong, Zhang Jian, Zhang Quanjun, Qian Hanwei, Zhao Wei, Liu Yang, Chen Zhenyu
- Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers Sun Weiwei, Chen Zheng, Ma Xinyu, Yan Lingyong, Wang Shuaiqiang, Ren Pengjie, Chen Zhumin, Yin Dawei, Ren Zhaochun
- Multimodal Question Answering For Unified Information Extraction Sun Yuxuan, Zhang Kai, Su Yu
- Large Language Models For Intent-driven Session Recommendations Sun Zhu, Liu Hongyang, Qu Xinghua, Feng Kaidong, Wang Yan, Ong Yew-soon
- Uncovering Chatgpt's Capabilities In Recommender Systems Sunhao Dai, Ninglu Shao, Haiyuan Zhao, Weijie Yu, Zihua Si, Chen Xu, Zhongxiang Sun, Xiao Zhang, Jun Xu
- Do Large Language Models Show Decision Heuristics Similar To Humans? A Case Study Using GPT-3.5 Suri Gaurav, Slater Lily R., Ziaee Ali, Nguyen Morgan
- Approximating Online Human Evaluation Of Social Chatbots With Prompting Svikhnushina Ekaterina, Pu Pearl
- Gender-specific Machine Translation With Large Language Models Sánchez Eduardo, Andrews Pierre, Stenetorp Pontus, Artetxe Mikel, Costa-jussà Marta R.
- Sparks Of Artificial General Intelligence: Early Experiments With GPT-4 Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang
- The BEA 2023 Shared Task On Generating AI Teacher Responses In Educational Dialogues Tack Anaïs, Kochmar Ekaterina, Yuan Zheng, Bibauw Serge, Piech Chris
- Exploring Chain-of-thought Style Prompting For Text-to-sql Tai Chang-you, Chen Ziru, Zhang Tianshu, Deng Xiang, Sun Huan
- Cappy: Outperforming And Boosting Large Multi-task Lms With A Small Scorer Tan Bowen, Zhu Yun, Liu Lijuan, Xing Eric, Hu Zhiting, Chen Jindong
- Large Language Model-driven Classroom Flipping: Empowering Student-centric Peer Questioning With Flipped Interaction Tan Chee Wei
- Copilot For Xcode: Exploring Ai-assisted Programming By Prompting Cloud-based Large Language Models Tan Chee Wei, Guo Shangxin, Wong Man Fai, Hang Ching Nam
- Compound Text-guided Prompt Tuning Via Image-adaptive Cues Tan Hao, Li Jun, Zhou Yizhuang, Wan Jun, Lei Zhen, Zhang Xiangyu
- Ml-bench: Evaluating Large Language Models And Agents For Machine Learning Tasks On Repository-level Code Tang Xiangru, Liu Yuliang, Cai Zefan, Shao Yanjun, Lu Junjie, Zhang Yichi, Deng Zexuan, Hu Helan, An Kaikai, Huang Ruijun, Si Shuzheng, Chen Sheng, Zhao Haozhe, Chen Liang, Wang Yan, Liu Tianyu, Jiang Zhiwei, Chang Baobao, Fang Yin, Qin Yujia, Zhou Wangchunshu, Zhao Yilun, Cohan Arman, Gerstein Mark
- Struc-bench: Are Large Language Models Really Good At Generating Complex Structured Data? Tang Xiangru, Zong Yiming, Phang Jason, Zhao Yilun, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- Codi-2: In-context, Interleaved, And Interactive Any-to-any Generation Tang Zineng, Yang Ziyi, Khademi Mahmoud, Liu Yang, Zhu Chenguang, Bansal Mohit
- Using Large Language Models For Cybersecurity Capture-the-flag Challenges And Certification Questions Tann Wesley, Liu Yuancheng, Sim Jun Heng, Seah Choon Meng, Chang Ee-chien
- Multilingual Llms Are Better Cross-lingual In-context Learners With Alignment Tanwar Eshaan, Dutta Subhabrata, Borthakur Manish, Chakraborty Tanmoy
- POUF: Prompt-oriented Unsupervised Fine-tuning For Large Pre-trained Models Tanwisuth Korawat, Zhang Shujian, Zheng Huangjie, He Pengcheng, Zhou Mingyuan
- Is Chatgpt A Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation Tao Fang, Shu Yang, Kaixin Lan, Derek F. Wong, Jinpeng Hu, Lidia S. Chao, Yue Zhang
- Webwise: Web Interface Control And Sequential Exploration With Large Language Models Tao Heyi, T Sethuraman V, Shlapentokh-rothman Michal, Hoiem Derek
- Expanding Frozen Vision-language Models Without Retraining: Towards Improved Robot Perception Tavassoli Riley, Amani Mani, Akhavian Reza
- Caption Anything: Interactive Image Description With Diverse Multimodal Controls Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao
- CLAMP: Contrastive Language Model Prompt-tuning Teterwak Piotr, Sun Ximeng, Plummer Bryan A., Saenko Kate, Lim Ser-nam
- Leveraging Llms For Synthesizing Training Data Across Many Languages In Multilingual Dense Retrieval Thakur Nandan, Ni Jianmo, Ábrego Gustavo Hernández, Wieting John, Lin Jimmy, Cer Daniel
- Learning To Prompt In The Classroom To Understand AI Limits: A Pilot Study Theophilou Emily, Koyuturk Cansu, Yavari Mona, Bursic Sathya, Donabauer Gregor, Telari Alessia, Testa Alessia, Boiano Raffaele, Hernandez-leo Davinia, Ruskov Martin, Taibi Davide, Gabbiadini Alessandro, Ognibene Dimitri
- Is Chatgpt The Ultimate Programming Assistant -- How Far Is It? Tian Haoye, Lu Weiqi, Li Tsz On, Tang Xunzhu, Cheung Shing-chi, Klein Jacques, Bissyandé Tegawendé F.
- Soft-prompt Tuning For Large Language Models To Evaluate Bias Tian Jacob-junqi, Emerson David, Miyandoab Sevil Zanjani, Pandya Deval, Seyyed-kalantari Laleh, Khattak Faiza Khan
- R\(^3\) Prompting: Review, Rephrase And Resolve For Chain-of-thought Reasoning In Large Language Models Under Noisy Context Tian Qingyuan, Zhu Hanlun, Wang Lei, Li Yang, Lan Yunshi
- Argue: Attribute-guided Prompt Tuning For Vision-language Models Tian Xinyu, Zou Shu, Yang Zhaoyuan, Zhang Jing
- Graph Neural Prompting With Large Language Models Tian Yijun, Song Huan, Wang Zichen, Wang Haozhu, Hu Ziqing, Wang Fang, Chawla Nitesh V., Xu Panpan
- Evil Geniuses: Delving Into The Safety Of Llm-based Agents Tian Yu, Yang Xiao, Zhang Jingyuan, Dong Yinpeng, Su Hang
- Chimed-gpt: A Chinese Medical Large Language Model With Full Training Regime And Better Alignment To Human Preferences Tian Yuanhe, Gan Ruyi, Song Yan, Zhang Jiaxing, Zhang Yongdong
- Macgyver: Are Large Language Models Creative Problem Solvers? Tian Yufei, Ravichander Abhilasha, Qin Lianhui, Bras Ronan Le, Marjieh Raja, Peng Nanyun, Choi Yejin, Griffiths Thomas L., Brahman Faeze
- Fine-tune Language Models To Approximate Unbiased In-context Learning Timothy Chu, Zhao Song, Chiwun Yang
- Do Llms Exhibit Human-like Response Biases? A Case Study In Survey Design Tjuatja Lindia, Chen Valerie, Wu Sherry Tongshuang, Talwalkar Ameet, Neubig Graham
- Parameter Efficient Tuning Allows Scalable Personalization Of Llms For Text Entry: A Case Study On Abbreviation Expansion Tomanek Katrin, Cai Shanqing, Venugopalan Subhashini
- Eliminating Reasoning Via Inferring With Planning: A New Framework To Guide Llms' Non-linear Thinking Tong Yongqi, Wang Yifan, Li Dawei, Wang Sizhe, Lin Zi, Han Simeng, Shang Jingbo
- Llmseceval: A Dataset Of Natural Language Prompts For Security Evaluations Tony Catherine, Mutas Markus, Ferreyra Nicolás E. Díaz, Scandariato Riccardo
- Bioinstruct: Instruction Tuning Of Large Language Models For Biomedical Natural Language Processing Tran Hieu, Yang Zhichao, Yao Zonghai, Yu Hong
- Large Language Model Prompt Chaining For Long Legal Document Classification Trautmann Dietrich
- Chatgpt As A Commenter To The News: Can Llms Generate Human-like Opinions? Tseng Rayden, Verberne Suzan, Van Der Putten Peter
- Overinformative Question Answering By Humans And Machines Tsvilodub Polina, Franke Michael, Hawkins Robert D., Goodman Noah D.
- Efficiently Aligned Cross-lingual Transfer Learning For Conversational Tasks Using Prompt-tuning Tu Lifu, Qu Jin, Yavuz Semih, Joty Shafiq, Liu Wenhao, Xiong Caiming, Zhou Yingbo
- Unlocking Anticipatory Text Generation: A Constrained Approach For Large Language Models Decoding Tu Lifu, Yavuz Semih, Qu Jin, Xu Jiacheng, Meng Rui, Xiong Caiming, Zhou Yingbo
- Activation Addition: Steering Language Models Without Optimization Turner Alexander Matt, Thiergart Lisa, Leech Gavin, Udell David, Vazquez Juan J., Mini Ulisse, Macdiarmid Monte
- Language Models Don't Always Say What They Think: Unfaithful Explanations In Chain-of-thought Prompting Turpin Miles, Michael Julian, Perez Ethan, Bowman Samuel R.
- Why Can Large Language Models Generate Correct Chain-of-thoughts? Tutunov Rasul, Grosnit Antoine, Ziomek Juliusz, Wang Jun, Bou-ammar Haitham
- Zeroshotdataaug: Generating And Augmenting Training Data With Chatgpt Ubani Solomon, Polat Suleyman Olcay, Nielsen Rodney
- How Prevalent Is Gender Bias In Chatgpt? -- Exploring German And English Chatgpt Responses Urchs Stefanie, Thurner Veronika, Aßenmacher Matthias, Heumann Christian, Thiemichen Stephanie
- In-context Ability Transfer For Question Decomposition In Complex QA V Venktesh, Bhattacharya Sourangshu, Anand Avishek
- On The Planning Abilities Of Large Language Models : A Critical Investigation Valmeekam Karthik, Marquez Matthew, Sreedharan Sarath, Kambhampati Subbarao
- Evaluating Shutdown Avoidance Of Language Models In Textual Scenarios Van Der Weij Teun, Lermen Simon, Lang Leon
- Theory Of Mind In Large Language Models: Examining Performance Of 11 State-of-the-art Models Vs. Children Aged 7-10 On Advanced Tests Van Duijn Max J., Van Dijk Bram M. A., Kouwenhoven Tom, De Valk Werner, Spruit Marco R., Van Der Putten Peter
- Detecting And Correcting Hate Speech In Multimodal Memes With Large Visual Language Model Van Minh-hao, Wu Xintao
- ASPIRO: Any-shot Structured Parsing-error-induced Reprompting For Consistent Data-to-text Generation Vejvar Martin, Fujimoto Yasutaka
- Small Language Models Improve Giants By Rewriting Their Outputs Vernikos Giorgos, Bražinskas Arthur, Adamek Jakub, Mallinson Jonathan, Severyn Aliaksei, Malmi Eric
- Simplesafetytests: A Test Suite For Identifying Critical Safety Risks In Large Language Models Vidgen Bertie, Scherrer Nino, Kirk Hannah Rose, Qian Rebecca, Kannappan Anand, Hale Scott A., Röttger Paul
- Prompt2model: Generating Deployable Models From Natural Language Instructions Viswanathan Vijay, Zhao Chenyang, Bertsch Amanda, Wu Tongshuang, Neubig Graham
- Freshllms: Refreshing Large Language Models With Search Engine Augmentation Vu Tu, Iyyer Mohit, Wang Xuezhi, Constant Noah, Wei Jerry, Wei Jason, Tar Chris, Sung Yun-hsuan, Zhou Denny, Le Quoc, Luong Thang
- Using Natural Language Explanations To Rescale Human Judgments Wadhwa Manya, Chen Jifan, Li Junyi Jessy, Durrett Greg
- Better Zero-shot Reasoning With Self-adaptive Prompting Wan Xingchen, Sun Ruoxi, Dai Hanjun, Arik Sercan O., Pfister Tomas
- Universal Self-adaptive Prompting Wan Xingchen, Sun Ruoxi, Nakhost Hootan, Dai Hanjun, Eisenschlos Julian Martin, Arik Sercan O., Pfister Tomas
- GPT-RE: In-context Learning For Relation Extraction Using Large Language Models Wan Zhen, Cheng Fei, Mao Zhuoyuan, Liu Qianying, Song Haiyue, Li Jiwei, Kurohashi Sadao
- Reformulating Domain Adaptation Of Large Language Models As Adapt-retrieve-revise: A Case Study On Chinese Legal Domain Wan Zhen, Zhang Yating, Wang Yexiang, Cheng Fei, Kurohashi Sadao
- Grammar Prompting For Domain-specific Language Generation With Large Language Models Wang Bailin, Wang Zi, Wang Xuezhi, Cao Yuan, Saurous Rif A., Kim Yoon
- BLSP: Bootstrapping Language-speech Pre-training Via Behavior Alignment Of Continuation Writing Wang Chen, Liao Minpeng, Huang Zhongqiang, Lu Jinliang, Wu Junhong, Liu Yuchen, Zong Chengqing, Zhang Jiajun
- Cue-cot: Chain-of-thought Prompting For Responding To In-depth Dialogue Questions With Llms Wang Hongru, Wang Rui, Mi Fei, Deng Yang, Wang Zezhong, Liang Bin, Xu Ruifeng, Wong Kam-fai
- Boosting Language Models Reasoning With Chain-of-knowledge Prompting Wang Jianing, Sun Qiushi, Li Xiang, Gao Ming
- Knowledgeable In-context Tuning: Exploring And Exploiting Factual Knowledge For In-context Learning Wang Jianing, Wang Chengyu, Tan Chuanqi, Huang Jun, Gao Ming
- Self-prompted Chain-of-thought On Large Language Models For Open-domain Multi-hop Reasoning Wang Jinyuan, Li Junlong, Zhao Hai
- Notechat: A Dataset Of Synthetic Doctor-patient Conversations Conditioned On Clinical Notes Wang Junda, Yao Zonghai, Yang Zhichao, Zhou Huixue, Li Rumeng, Wang Xun, Xu Yucheng, Yu Hong
- Query2doc: Query Expansion With Large Language Models Wang Liang, Yang Nan, Wei Furu
- Document-level Machine Translation With Large Language Models Wang Longyue, Lyu Chenyang, Ji Tianbo, Zhang Zhirui, Yu Dian, Shi Shuming, Tu Zhaopeng
- Compositional Text-to-image Synthesis With Attention Map Control Of Diffusion Models Wang Ruichen, Chen Zekang, Chen Chen, Ma Jian, Lu Haonan, Lin Xiaodong
- Hypothesis Search: Inductive Reasoning With Language Models Wang Ruocheng, Zelikman Eric, Poesia Gabriel, Pu Yewen, Haber Nick, Goodman Noah D.
- STEPS: A Benchmark For Order Reasoning In Sequential Tasks Wang Weizhi, Wang Hong, Yan Xifeng
- All Languages Matter: On The Multilingual Safety Of Large Language Models Wang Wenxuan, Tu Zhaopeng, Chen Chang, Yuan Youliang, Huang Jen-tse, Jiao Wenxiang, Lyu Michael R.
- Instructuie: Multi-task Instruction Tuning For Unified Information Extraction Wang Xiao, Zhou Weikang, Zu Can, Xia Han, Chen Tianze, Zhang Yuansen, Zheng Rui, Ye Junjie, Zhang Qi, Gui Tao, Kang Jihua, Yang Jingsheng, Li Siyuan, Du Chunsai
- FIAT: Fusing Learning Paradigms With Instruction-accelerated Tuning Wang Xinyi, Wieting John, Clark Jonathan H.
- Promptagent: Strategic Planning With Language Models Enables Expert-level Prompt Optimization Wang Xinyuan, Li Chenxi, Wang Zhen, Bai Fan, Luo Haotian, Zhang Jiayou, Jojic Nebojsa, Xing Eric P., Hu Zhiting
- Instructta: Instruction-tuned Targeted Attack For Large Vision-language Models Wang Xunguang, Ji Zhenlan, Ma Pingchuan, Li Zongjie, Wang Shuai
- Adding Guardrails To Advanced Chatbots Wang Yanchen, Singh Lisa
- Hint-enhanced In-context Learning Wakes Large Language Models Up For Knowledge-intensive Tasks Wang Yifan, Guo Qingyan, Ni Xinzhe, Shi Chufan, Liu Lemao, Jiang Haiyun, Yang Yujiu
- Universality And Limitations Of Prompt Tuning Wang Yihan, Chauhan Jatin, Wang Wei, Hsieh Cho-jui
- Primacy Effect Of Chatgpt Wang Yiwei, Cai Yujun, Chen Muhao, Liang Yuxuan, Hooi Bryan
- Vaquita: Enhancing Alignment In Llm-assisted Video Understanding Wang Yizhou, Zhang Ruiyi, Wang Haoliang, Bhattacharya Uttaran, Fu Yun, Wu Gang
- Towards Improving Document Understanding: An Exploration On Text-grounding Via Mllms Wang Yonghui, Zhou Wengang, Feng Hao, Zhou Keyi, Li Houqiang
- DRDT: Dynamic Reflection With Divergent Thinking For Llm-based Sequential Recommendation Wang Yu, Liu Zhiwei, Zhang Jianguo, Yao Weiran, Heinecke Shelby, Yu Philip S.
- Learning Hierarchical Prompt With Structured Linguistic Knowledge For Vision-language Models Wang Yubin, Jiang Xinyang, Cheng De, Li Dongsheng, Zhao Cairong
- Metacognitive Prompting Improves Understanding In Large Language Models Wang Yuqing, Zhao Yun
- Mediagpt : A Large Language Model For Chinese Media Wang Zhonghao, Lu Zijia, Jin Bo, Deng Haiying
- Empowering Few-shot Recommender Systems With Large Language Models -- Enhanced Representations Wang Zhoumeng
- Multiqg-ti: Towards Question Generation From Multi-modal Sources Wang Zichao, Baraniuk Richard
- Enabling Language Models To Implicitly Learn Self-improvement Wang Ziqi, Hou Le, Lu Tianjian, Wu Yuexin, Li Yunxuan, Yu Hongkun, Ji Heng
- Filling The Image Information Gap For VQA: Prompting Large Language Models To Proactively Ask Questions Wang Ziyue, Chen Chi, Li Peng, Liu Yang
- The ICL Consistency Test Weber Lucas, Bruni Elia, Hupkes Dieuwke
- Mind The Instructions: A Holistic Evaluation Of Consistency And Interactions In Prompt-based Learning Weber Lucas, Bruni Elia, Hupkes Dieuwke
- Evaluation Is All You Need. Prompting Generative Large Language Models For Annotation Tasks In The Social Sciences. A Primer Using Open Models Weber Maximilian, Reichardt Merle
- Jailbroken: How Does LLM Safety Training Fail? Wei Alexander, Haghtalab Nika, Steinhardt Jacob
- Lmsanitator: Defending Prompt-tuning Against Task-agnostic Backdoors Wei Chengkun, Meng Wenlong, Zhang Zhikun, Chen Min, Zhao Minghu, Fang Wenjing, Wang Lei, Zhang Zihui, Chen Wenzhi
- Larger Language Models Do In-context Learning Differently Wei Jerry, Wei Jason, Tay Yi, Tran Dustin, Webson Albert, Lu Yifeng, Chen Xinyun, Liu Hanxiao, Huang Da, Zhou Denny, Ma Tengyu
- Leveraging Large Language Models To Power Chatbots For Collecting User Self-reported Data Wei Jing, Kim Sungdong, Jung Hyunhoon, Kim Young-ho
- Menatqa: A New Dataset For Testing The Temporal Comprehension And Reasoning Abilities Of Large Language Models Wei Yifan, Su Yisong, Ma Huanhuan, Yu Xiaoyan, Lei Fangyu, Zhang Yuanzhe, Zhao Jun, Liu Kang
- Jailbreak And Guard Aligned Language Models With Only Few In-context Demonstrations Wei Zeming, Wang Yifei, Li Ang, Mo Yichuan, Wang Yisen
- In-context Pretraining: Language Modeling Beyond Document Boundaries Weijia Shi, Sewon Min, Maria Lomeli, Chunting Zhou, Margaret Li, Gergely Szilvasy, Rich James, Xi Victoria Lin, Noah A. Smith, Luke Zettlemoyer, Scott Yih, Mike Lewis
- "according To ...": Prompting Language Models Improves Quoting From Pre-training Data Weller Orion, Marone Marc, Weir Nathaniel, Lawrie Dawn, Khashabi Daniel, Van Durme Benjamin
- Infovisdial: An Informative Visual Dialogue Dataset By Bridging Large Multimodal And Language Models Wen Bingbing, Yang Zhengyuan, Wang Jianfeng, Gan Zhe, Howe Bill, Wang Lijuan
- Droidbot-gpt: Gpt-powered UI Automation For Android Wen Hao, Wang Hongming, Liu Jiaxuan, Li Yuanchun
- Unveiling The Implicit Toxicity In Large Language Models Wen Jiaxin, Ke Pei, Sun Hao, Zhang Zhexin, Li Chengfei, Bai Jinfeng, Huang Minlie
- Mindmap: Knowledge Graph Prompting Sparks Graph Of Thoughts In Large Language Models Wen Yilin, Wang Zifeng, Sun Jimeng
- GROVE: A Retrieval-augmented Complex Story Generation Framework With A Forest Of Evidence Wen Zhihua, Tian Zhiliang, Wu Wei, Yang Yuxin, Shi Yanqi, Huang Zhen, Li Dongsheng
- Helping Language Models Learn More: Multi-dimensional Task Prompt For Few-shot Tuning Weng Jinta, Zhang Jiarui, Hu Yue, Fa Daidong, Xuand Xiaofeng, Huang Heyan
- Is Chatgpt A Good Translator? Yes With GPT-4 As The Engine Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Xing Wang, Shuming Shi, Zhaopeng Tu
- AI And The FCI: Can Chatgpt Project An Understanding Of Introductory Physics? West Colin G.
- Exploring Parameter-efficient Fine-tuning Techniques For Code Generation With Large Language Models Weyssow Martin, Zhou Xin, Kim Kisub, Lo David, Sahraoui Houari
- Better Question-answering Models On A Budget Wijeratne Yudhanjaya, Marikar Ishan
- Think Twice: Perspective-taking Improves Large Language Models' Theory-of-mind Capabilities Wilf Alex, Lee Sihyun Shawn, Liang Paul Pu, Morency Louis-philippe
- Cases Of EFL Secondary Students' Prompt Engineering Pathways To Complete A Writing Task With Chatgpt Woo David James, Guo Kai, Susanto Hengky
- Language Prompt For Autonomous Driving Wu Dongming, Han Wencheng, Wang Tiancai, Liu Yingfei, Zhang Xiangyu, Shen Jianbing
- Speechgen: Unlocking The Generative Power Of Speech Language Models With Prompts Wu Haibin, Chang Kai-wei, Wu Yuan-kuei, Lee Hung-yi
- Large Language Models Leverage External Knowledge To Extend Clinical Insight Beyond Language Boundaries Wu Jiageng, Wu Xian, Qiu Zhaopeng, Li Minghui, Zhang Yingying, Zheng Yefeng, Yuan Changzheng, Yang Jie
- Approximated Prompt Tuning For Vision-language Pre-trained Models Wu Qiong, Huang Shubin, Zhou Yiyi, Dai Pingyang, Shu Annan, Jiang Guannan, Ji Rongrong
- Analyzing Chain-of-thought Prompting In Large Language Models Via Gradient-based Feature Attributions Wu Skyler, Shen Eric Meng, Badrinath Charumathi, Ma Jiaqi, Lakkaraju Himabindu
- Self-correcting Llm-controlled Diffusion Models Wu Tsung-han, Lian Long, Gonzalez Joseph E., Li Boyi, Darrell Trevor
- Paragraph-to-image Generation With Information-enriched Diffusion Model Wu Weijia, Li Zhuang, He Yefei, Shou Mike Zheng, Shen Chunhua, Cheng Lele, Li Yan, Gao Tingting, Zhang Di, Wang Zhongyuan
- From Language Modeling To Instruction Following: Understanding The Behavior Shift In Llms After Instruction Tuning Wu Xuansheng, Yao Wenlin, Chen Jianshu, Pan Xiaoman, Wang Xiaoyang, Liu Ninghao, Yu Dong
- An Early Evaluation Of Gpt-4v(ision) Wu Yang, Wang Shilong, Yang Hao, Zheng Tian, Zhang Hongbo, Zhao Yanyan, Qin Bing
- Improving Cross-task Generalization With Step-by-step Instructions Wu Yang, Zhao Yanyan, Li Zhongyang, Qin Bing, Xiong Kai
- Retrieve-rewrite-answer: A Kg-to-text Enhanced Llms Framework For Knowledge Graph Question Answering Wu Yike, Hu Nan, Bi Sheng, Qi Guilin, Ren Jie, Xie Anhuan, Song Wei
- Mathchat: Converse To Tackle Challenging Math Problems With LLM Agents Wu Yiran, Jia Feiran, Zhang Shaokun, Li Hangyu, Zhu Erkang, Wang Yue, Lee Yin Tat, Peng Richard, Wu Qingyun, Wang Chi
- Jailbreaking GPT-4V Via Self-adversarial Attacks With System Prompts Wu Yuanwei, Li Xiang, Liu Yixin, Zhou Pan, Sun Lichao
- Semantic Parsing By Large Language Models For Intricate Updating Strategies Of Zero-shot Dialogue State Tracking Wu Yuxiang, Dong Guanting, Xu Weiran
- Get An A In Math: Progressive Rectification Prompting Wu Zhenyu, Jiang Meng, Shen Chao
- Openicl: An Open-source Framework For In-context Learning Wu Zhenyu, Wang Yaoxiang, Ye Jiacheng, Feng Jiangtao, Xu Jingjing, Qiao Yu, Wu Zhiyong
- A Simple Baseline For Knowledge-based Visual Question Answering Xenos Alexandros, Stafylakis Themos, Patras Ioannis, Tzimiropoulos Georgios
- Towards Open-world Recommendation With Knowledge Augmentation From Large Language Models Xi Yunjia, Liu Weiwen, Lin Jianghao, Cai Xiaoling, Zhu Hong, Zhu Jieming, Chen Bo, Tang Ruiming, Zhang Weinan, Zhang Rui, Yu Yong
- LLMGA: Multimodal Large Language Model Based Generation Assistant Xia Bin, Wang Shiyin, Tao Yingfan, Wang Yitong, Jia Jiaya
- Conversational Automated Program Repair Xia Chunqiu Steven, Zhang Lingming
- O3D: Offline Data-driven Discovery And Distillation For Sequential Decision-making With Large Language Models Xiao Yuchen, Sun Yanchao, Xu Mengda, Madhushani Udari, Vann Jared, Garg Deepeka, Ganesh Sumitra
- Instruction-vit: Multi-modal Prompts For Instruction Learning In Vit Xiao Zhenxiang, Chen Yuzhong, Zhang Lu, Yao Junjie, Wu Zihao, Yu Xiaowei, Pan Yi, Zhao Lin, Ma Chong, Liu Xinyu, Liu Wei, Li Xiang, Yuan Yixuan, Shen Dinggang, Zhu Dajiang, Liu Tianming, Jiang Xi
- Supporting Qualitative Analysis With Large Language Models: Combining Codebook With GPT-3 For Deductive Coding Xiao Ziang, Yuan Xingdi, Liao Q. Vera, Abdelghani Rania, Oudeyer Pierre-yves
- Instructed Language Models With Retrievers Are Powerful Entity Linkers Xiao Zilin, Gong Ming, Wu Jie, Zhang Xingyao, Shou Linjun, Pei Jian, Jiang Daxin
- The Wall Street Neophyte: A Zero-shot Analysis Of Chatgpt Over Multimodal Stock Movement Prediction Challenges Xie Qianqian, Han Weiguang, Lai Yanzhao, Peng Min, Huang Jimin
- Empirical Study Of Zero-shot NER With Chatgpt Xie Tingyu, Li Qi, Zhang Jian, Zhang Yan, Liu Zuozhu, Wang Hongwei
- Translating Natural Language To Planning Goals With Large-language Models Xie Yaqi, Yu Chen, Zhu Tongyao, Bai Jinbin, Gong Ze, Soh Harold
- Olagpt: Empowering Llms With Human-like Problem-solving Abilities Xie Yuanzhen, Xie Tao, Lin Mingxiong, Wei Wentao, Li Chenglin, Kong Beibei, Chen Lei, Zhuo Chengxiang, Hu Bo, Li Zang
- A Prompt Log Analysis Of Text-to-image Generation Systems Xie Yutong, Pan Zhaoying, Ma Jinge, Jie Luo, Mei Qiaozhu
- The Next Chapter: A Study Of Large Language Models In Storytelling Xie Zhuohan, Cohn Trevor, Lau Jey Han
- A Preliminary Study Of Chatgpt On News Recommendation: Personalization, Provider Fairness, Fake News Xinyi Li, Yongfeng Zhang, Edward C. Malthouse
- Can Llms Express Their Uncertainty? An Empirical Evaluation Of Confidence Elicitation In Llms Xiong Miao, Hu Zhiyuan, Lu Xinyang, Li Yifei, Fu Jie, He Junxian, Hooi Bryan
- Expertprompting: Instructing Large Language Models To Be Distinguished Experts Xu Benfeng, Yang An, Lin Junyang, Wang Quan, Zhou Chang, Zhang Yongdong, Mao Zhendong
- Gentopia: A Collaborative Platform For Tool-augmented Llms Xu Binfeng, Liu Xukun, Shen Hua, Han Zeyu, Li Yuhan, Yue Murong, Peng Zhiyuan, Liu Yuchen, Yao Ziyu, Xu Dongkuan
- Rewoo: Decoupling Reasoning From Observations For Efficient Augmented Language Models Xu Binfeng, Peng Zhiyuan, Lei Bowen, Mukherjee Subhabrata, Liu Yuchen, Xu Dongkuan
- Small Models Are Valuable Plug-ins For Large Language Models Xu Canwen, Xu Yichong, Wang Shuohang, Liu Yang, Zhu Chenguang, Mcauley Julian
- DPL: Decoupled Prompt Learning For Vision-language Models Xu Chen, Zhu Yuhan, Zhang Guozhen, Shen Haocheng, Liao Yixuan, Chen Xiaoxin, Wu Gangshan, Wang Limin
- On The Tool Manipulation Capability Of Open-source Large Language Models Xu Qiantong, Hong Fenglu, Li Bo, Hu Changran, Chen Zhengyu, Zhang Jian
- Knowledge-infused Prompting: Assessing And Advancing Clinical Text Data Generation With Large Language Models Xu Ran, Cui Hejie, Yu Yue, Kan Xuan, Shi Wenqi, Zhuang Yuchen, Jin Wei, Ho Joyce, Yang Carl
- Re-reading Improves Reasoning In Large Language Models Xu Xiaohan, Tao Chongyang, Shen Tao, Xu Can, Xu Hongbo, Long Guodong, Lou Jian-guang
- An LLM Can Fool Itself: A Prompt-based Adversarial Attack Xu Xilie, Kong Keyi, Liu Ning, Cui Lizhen, Wang Di, Zhang Jingfeng, Kankanhalli Mohan
- Mental-llm: Leveraging Large Language Models For Mental Health Prediction Via Online Text Data Xu Xuhai, Yao Bingsheng, Dong Yuanzhe, Gabriel Saadia, Yu Hong, Hendler James, Ghassemi Marzyeh, Dey Anind K., Wang Dakuo
- Compress, Then Prompt: Improving Accuracy-efficiency Trade-off Of LLM Inference With Transferable Prompt Xu Zhaozhuo, Liu Zirui, Chen Beidi, Tang Yuxin, Wang Jue, Zhou Kaixiong, Hu Xia, Shrivastava Anshumali
- Latent Skill Discovery For Chain-of-thought Reasoning Xu Zifan, Wang Haozhu, Bespalov Dmitriy, Stone Peter, Qi Yanjun
- How Robust Is GPT-3.5 To Predecessors? A Comprehensive Study On Language Understanding Tasks Xuanting Chen, Junjie Ye, Can Zu, Nuo Xu, Rui Zheng, Minlong Peng, Jie Zhou, Tao Gui, Qi Zhang, Xuanjing Huang
- Representation Learning With Large Language Models For Recommendation Xubin Ren, Wei Wei, Lianghao Xia, Lixin Su, Suqi Cheng, Junfeng Wang, Dawei Yin, Chao Huang
- Occuquest: Mitigating Occupational Bias For Inclusive Large Language Models Xue Mingfeng, Liu Dayiheng, Yang Kexin, Dong Guanting, Lei Wenqiang, Yuan Zheng, Zhou Chang, Zhou Jingren
- Parameter-efficient Tuning Helps Language Model Alignment Xue Tianci, Wang Ziqi, Ji Heng
- RCOT: Detecting And Rectifying Factual Inconsistency In Reasoning By Reversing Chain-of-thought Xue Tianci, Wang Ziqi, Wang Zhenhailong, Han Chi, Yu Pengfei, Ji Heng
- Pretraining Data Mixtures Enable Narrow Model Selection Capabilities In Transformer Models Yadlowsky Steve, Doshi Lyric, Tripuraneni Nilesh
- Optimizing Machine Translation Through Prompt Engineering: An Investigation Into Chatgpt's Customizability Yamada Masaru
- LPML: Llm-prompting Markup Language For Mathematical Reasoning Yamauchi Ryutaro, Sonoda Sho, Sannai Akiyoshi, Kumagai Wataru
- Understanding In-context Learning From Repetitions Yan Jianhao, Xu Jin, Song Chiyu, Wu Chenming, Li Yafu, Zhang Yue
- Predicting Text Preference Via Structured Comparative Reasoning Yan Jing Nathan, Liu Tianqi, Chiu Justin T, Shen Jiaming, Qin Zhen, Yu Yue, Zhao Yao, Lakshmanan Charu, Kurzion Yair, Rush Alexander M., Liu Jialu, Bendersky Michael
- Backdooring Instruction-tuned Large Language Models With Virtual Prompt Injection Yan Jun, Yadav Vikas, Li Shiyang, Chen Lichang, Tang Zheng, Wang Hai, Srinivasan Vijay, Ren Xiang, Jin Hongxia
- Refining The Responses Of Llms By Themselves Yan Tianqiang, Xu Tiansheng
- Ask More, Know Better: Reinforce-learned Prompt Questions For Decision Making With Large Language Models Yan Xue, Song Yan, Cui Xinyu, Christianos Filippos, Zhang Haifeng, Mguni David Henry, Wang Jun
- Multimodal Chatgpt For Medical Applications: An Experimental Study Of GPT-4V Yan Zhiling, Zhang Kai, Zhou Rong, He Lifang, Li Xiang, Sun Lichao
- Can You Follow Me? Testing Situational Understanding In Chatgpt Yang Chenghao, Ettinger Allyson
- Knowledge-prompted Estimator: A Novel Approach To Explainable Machine Translation Assessment Yang Hao, Zhang Min, Tao Shimin, Wang Minghan, Wei Daimeng, Jiang Yanfei
- Iterative Forward Tuning Boosts In-context Learning In Language Models Yang Jiaxi, Hui Binyuan, Yang Min, Wang Bailin, Li Bowen, Li Binhua, Huang Fei, Li Yongbin
- Unified Language-vision Pretraining In LLM With Dynamic Discrete Visual Tokenization Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chenyi Lei, An Liu, Chengru Song, Xiaoqiang Lei, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu
- Towards Interpretable Mental Health Analysis With Large Language Models Yang Kailai, Ji Shaoxiong, Zhang Tianlin, Xie Qianqian, Kuang Ziyan, Ananiadou Sophia
- Good Questions Help Zero-shot Image Reasoning Yang Kaiwen, Shen Tao, Tian Xinmei, Geng Xiubo, Tao Chongyang, Tao Dacheng, Zhou Tianyi
- Supervised Knowledge Makes Large Language Models Better In-context Learners Yang Linyi, Zhang Shuibai, Yu Zhuohao, Bao Guangsheng, Wang Yidong, Wang Jindong, Xu Ruochen, Ye Wei, Xie Xing, Chen Weizhu, Zhang Yue
- Psycot: Psychological Questionnaire As Powerful Chain-of-thought For Personality Detection Yang Tao, Shi Tianyuan, Wan Fanqi, Quan Xiaojun, Wang Qifan, Wu Bingzhe, Wu Jiaxiang
- Shadow Alignment: The Ease Of Subverting Safely-aligned Language Models Yang Xianjun, Wang Xiao, Zhang Qi, Petzold Linda, Wang William Yang, Zhao Xun, Lin Dahua
- Exploring An LM To Generate Prolog Predicates From Mathematics Questions Yang Xiaocheng, Tam Yik-cheung
- Human-in-the-loop Machine Translation With Large Language Model Yang Xinyi, Zhan Runzhe, Wong Derek F., Wu Junchao, Chao Lidia S.
- On The Calibration Of Multilingual Question Answering Llms Yang Yahan, Dan Soham, Roth Dan, Lee Insup
- Harnessing The Power Of Large Language Models For Natural Language To First-order Logic Translation Yang Yuan, Xiong Siheng, Payani Ali, Shareghi Ehsan, Fekri Faramarz
- Improving The Reliability Of Large Language Models By Leveraging Uncertainty-aware In-context Learning Yang Yuchen, Li Houqiang, Wang Yanfeng, Wang Yu
- Enhance Reasoning Ability Of Visual-language Models Via Large Language Models Yang Yueting, Zhang Xintong, Han Wenjuan
- Failures Pave The Way: Enhancing Large Language Models Through Tuning-free Rule Accumulation Yang Zeyuan, Li Peng, Liu Yang
- Alignedcot: Prompting Large Language Models Via Native-speaking Demonstrations Yang Zhicheng, Huang Yinya, Xiong Jing, Feng Liang, Liang Xiaodan, Wang Yiwei, Tang Jing
- More Samples Or More Prompts? Exploring Effective In-context Sampling For LLM Few-shot Prompt Engineering Yao Bingsheng, Chen Guiming, Zou Ruishi, Lu Yuxuan, Li Jiachen, Zhang Shao, Sang Yisi, Liu Sijia, Hendler James, Wang Dakuo
- Benchmarking Llm-based Machine Translation On Cultural Awareness Yao Binwei, Jiang Ming, Yang Diyi, Hu Junjie
- Improving Language Model Negotiation With Self-play And In-context Learning From AI Feedback Yao Fu, Hao Peng, Tushar Khot, Mirella Lapata
- Tcp:textual-based Class-aware Prompt Tuning For Visual-language Model Yao Hantao, Zhang Rui, Xu Changsheng
- Poisonprompt: Backdoor Attack On Prompt-based Large Language Models Yao Hongwei, Lou Jian, Qin Zhan
- Promptcare: Prompt Copyright Protection By Watermark Injection And Verification Yao Hongwei, Lou Jian, Ren Kui, Qin Zhan
- LLM Lies: Hallucinations Are Not Bugs, But Features As Adversarial Examples Yao Jia-yu, Ning Kun-peng, Liu Zhen-hui, Ning Mu-nan, Liu Yu-yang, Yuan Li
- Knowledge Plugins: Enhancing Large Language Models For Domain-specific Recommendations Yao Jing, Xu Wei, Lian Jianxun, Wang Xiting, Yi Xiaoyuan, Xie Xing
- Retroformer: Retrospective Large Language Agents With Policy Gradient Optimization Yao Weiran, Heinecke Shelby, Niebles Juan Carlos, Liu Zhiwei, Feng Yihao, Xue Le, Murthy Rithesh, Chen Zeyuan, Zhang Jianguo, Arpit Devansh, Xu Ran, Mui Phil, Wang Huan, Xiong Caiming, Savarese Silvio
- Do Physicians Know How To Prompt? The Need For Automatic Prompt Optimization Help In Clinical Note Generation Yao Zonghai, Jaafar Ahmed, Wang Beining, Yang Zhichao, Yu Hong
- Collaborative Large Language Model For Recommender Systems Yaochen Zhu, Liang Wu, Qi Guo, Liangjie Hong, Jundong Li
- Studying And Improving Reasoning In Humans And Machines Yax Nicolas, Anlló Hernan, Palminteri Stefano
- Generating Data For Symbolic Language With Large Language Models Ye Jiacheng, Li Chengzu, Kong Lingpeng, Yu Tao
- Diffusion Language Models Can Perform Many Tasks With Scaling And Instruction-finetuning Ye Jiasheng, Zheng Zaixiang, Bao Yu, Qian Lihua, Gu Quanquan
- Assessing Step-by-step Reasoning Against Lexical Negation: A Case Study On Syllogism Ye Mengyu, Kuribayashi Tatsuki, Suzuki Jun, Kobayashi Goro, Funayama Hiroaki
- Investigating The Effectiveness Of Task-agnostic Prefix Prompt For Instruction Following Ye Seonghyeon, Hwang Hyeonbin, Yang Sohee, Yun Hyeongu, Kim Yireun, Seo Minjoon
- Satlm: Satisfiability-aided Language Models Using Declarative Prompting Ye Xi, Chen Qiaochu, Dillig Isil, Durrett Greg
- Effective Large Language Model Adaptation For Improved Grounding And Citation Generation Ye Xi, Sun Ruoxi, Arik Sercan Ö., Pfister Tomas
- Navigating Text-to-image Customization: From Lycoris Fine-tuning To Model Evaluation Yeh Shih-ying, Hsieh Yu-guan, Gao Zhidong, Yang Bernard B W, Oh Giyeong, Gong Yanmin
- A Multitask, Multilingual, Multimodal Evaluation Of Chatgpt On Reasoning, Hallucination, And Interactivity Yejin Bang, Samuel Cahyawijaya, Nayeon Lee, Wenliang Dai, Dan Su, Bryan Wilie, Holy Lovenia, Ziwei Ji, Tiezheng Yu, Willy Chung, Quyet V. Do, Yan Xu, Pascale Fung
- Llm-eval: Unified Multi-dimensional Automatic Evaluation For Open-domain Conversations With Large Language Models Yen-ting Lin, Yun-nung Chen
- Cones: Concept Embedding Search For Parameter Efficient Tuning Large Vision Language Models Yi Huahui, Qin Ziyuan, Xu Wei, Guo Miaotian, Wang Kun, Zhang Shaoting, Li Kang, Lao Qicheng
- Benchmarking And Defending Against Indirect Prompt Injection Attacks On Large Language Models Yi Jingwei, Xie Yueqi, Zhu Bin, Kiciman Emre, Sun Guangzhong, Xie Xing, Wu Fangzhao
- Do Large Language Models Know What They Don't Know? Yin Zhangyue, Sun Qiushi, Guo Qipeng, Wu Jiawen, Qiu Xipeng, Huang Xuanjing
- Intuitive Or Dependent? Investigating Llms' Behavior Style To Conflicting Prompts Ying Jiahao, Cao Yixin, Xiong Kai, He Yidong, Cui Long, Liu Yongbin
- Prompting Multilingual Large Language Models To Generate Code-mixed Texts: The Case Of South East Asian Languages Yong Zheng-xin, Zhang Ruochen, Forde Jessica Zosa, Wang Skyler, Subramonian Arjun, Lovenia Holy, Cahyawijaya Samuel, Winata Genta Indra, Sutawika Lintang, Cruz Jan Christian Blaise, Tan Yin Lin, Phan Long, Garcia Rowena, Solorio Thamar, Aji Alham Fikri
- Fundamental Limitations Of Alignment In Large Language Models Yotam Wolf, Noam Wies, Oshri Avnery, Yoav Levine, Amnon Shashua
- \(P^{3}O\): Transferring Visual Representations For Reinforcement Learning Via Prompting You Guoliang, Chu Xiaomeng, Duan Yifan, Peng Jie, Ji Jianmin, Zhang Yu, Zhang Yanyong
- Decoding In-context Learning: Neuroscience-inspired Analysis Of Representations In Large Language Models Yousefi Safoora, Betthauser Leo, Hasanbeig Hosein, Millière Raphaël, Momennejad Ida
- GPTFUZZER: Red Teaming Large Language Models With Auto-generated Jailbreak Prompts Yu Jiahao, Lin Xingwei, Yu Zheng, Xing Xinyu
- Assessing Prompt Injection Risks In 200+ Custom Gpts Yu Jiahao, Wu Yuhang, Shu Dong, Jin Mingyu, Yang Sabrina, Xing Xinyu
- Information Association For Language Model Updating By Mitigating Lm-logical Discrepancy Yu Pengfei, Ji Heng
- Seqgpt: An Out-of-the-box Large Language Model For Open Domain Sequence Understanding Yu Tianyu, Jiang Chengyue, Lou Chao, Huang Shen, Wang Xiaobin, Liu Wei, Cai Jiong, Li Yangning, Li Yinghui, Tu Kewei, Zheng Hai-tao, Zhang Ningyu, Xie Pengjun, Huang Fei, Jiang Yong
- Prompt-based Monte-carlo Tree Search For Goal-oriented Dialogue Policy Planning Yu Xiao, Chen Maximillian, Yu Zhou
- Teaching Language Models To Self-improve Through Interactive Demonstrations Yu Xiao, Peng Baolin, Galley Michel, Gao Jianfeng, Yu Zhou
- Reeval: Automatic Hallucination Evaluation For Retrieval-augmented Large Language Models Via Transferable Adversarial Attacks Yu Xiaodong, Cheng Hao, Liu Xiaodong, Roth Dan, Gao Jianfeng
- Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning Yu Yue, Shen Jiaming, Liu Tianqi, Qin Zhen, Yan Jing Nathan, Liu Jialu, Zhang Chao, Bendersky Michael
- Large Language Model As Attributed Training Data Generator: A Tale Of Diversity And Bias Yu Yue, Zhuang Yuchen, Zhang Jieyu, Meng Yu, Ratner Alexander, Krishna Ranjay, Shen Jiaming, Zhang Chao
- Prophet: Prompting Large Language Models With Complementary Answer Heuristics For Knowledge-based Visual Question Answering Yu Zhou, Ouyang Xuecheng, Shao Zhenwei, Wang Meng, Yu Jun
- Towards Better Chain-of-thought Prompting Strategies: A Survey Yu Zihan, He Liang, Wu Zhen, Dai Xinyu, Chen Jiajun
- CRAFT: Customizing Llms By Creating And Retrieving From Specialized Toolsets Yuan Lifan, Chen Yangyi, Wang Xingyao, Fung Yi R., Peng Hao, Ji Heng
- Batcheval: Towards Human-like Text Evaluation Yuan Peiwen, Feng Shaoxiong, Li Yiwei, Wang Xinglin, Pan Boyuan, Wang Heda, Li Kan
- Prompt Highlighter: Interactive Control For Multi-modal Llms Yuechen Zhang, Shengju Qian, Bohao Peng, Shu Liu, Jiaya Jia
- Attention Satisfies: A Constraint-satisfaction Lens On Factual Errors Of Language Models Yuksekgonul Mert, Chandrasekaran Varun, Jones Erik, Gunasekar Suriya, Naik Ranjita, Palangi Hamid, Kamar Ece, Nushi Besmira
- Empowering Private Tutoring By Chaining Large Language Models Yulin Chen, Ning Ding, Hai-tao Zheng, Zhiyuan Liu, Maosong Sun, Bowen Zhou
- Chat-rec: Towards Interactive And Explainable Llms-augmented Recommender System Yunfan Gao, Tao Sheng, Youlin Xiang, Yun Xiong, Haofen Wang, Jiawei Zhang
- On Evaluating Adversarial Robustness Of Large Vision-language Models Yunqing Zhao, Tianyu Pang, Chao Du, Xiao Yang, Chongxuan Li, Ngai-man Cheung, Min Lin
- Large Language Models Are Zero-shot Rankers For Recommender Systems Yupeng Hou, Junjie Zhang, Zihan Lin, Hongyu Lu, Ruobing Xie, Julian Mcauley, Wayne Xin Zhao
- Pre-training To Learn In Context Yuxian Gu, Li Dong, Furu Wei, Minlie Huang
- Chain-of-thought Prompting Under Streaming Batch: A Case Study Yuxin Tang
- Mascqa: A Question Answering Dataset For Investigating Materials Science Knowledge Of Large Language Models Zaki Mohd, Jayadeva, Mausam, Krishnan N. M. Anoop
- What Matters In Training A Gpt4-style Language Model With Multimodal Inputs? Zeng Yan, Zhang Hanbo, Zheng Jiani, Xia Jiangnan, Wei Guoqiang, Wei Yang, Zhang Yuchen, Kong Tao
- Evaluating Large Language Models At Evaluating Instruction Following Zeng Zhiyuan, Yu Jiatong, Gao Tianyu, Meng Yu, Goyal Tanya, Chen Danqi
- Is Chatgpt A Good Sentiment Analyzer? A Preliminary Study Zengzhi Wang, Qiming Xie, Yi Feng, Zixiang Ding, Zinong Yang, Rui Xia
- Vpgtrans: Transfer Visual Prompt Generator Across Llms Zhang Ao, Fei Hao, Yao Yuan, Ji Wei, Li Li, Liu Zhiyuan, Chua Tat-seng
- Evaluating And Improving Tool-augmented Computation-intensive Math Reasoning Zhang Beichen, Zhou Kun, Wei Xilin, Zhao Wayne Xin, Sha Jing, Wang Shijin, Wen Ji-rong
- Cot-bert: Enhancing Unsupervised Sentence Representation Through Chain-of-thought Zhang Bowen, Chang Kehua, Li Chunping
- A Simple LLM Framework For Long-range Video Question-answering Zhang Ce, Lu Taixi, Islam Md Mohaiminul, Wang Ziyang, Yu Shoubin, Bansal Mohit, Bertasius Gedas
- Mm-narrator: Narrating Long-form Videos With Multimodal In-context Learning Zhang Chaoyi, Lin Kevin, Yang Zhengyuan, Wang Jianfeng, Li Linjie, Lin Chung-ching, Liu Zicheng, Wang Lijuan
- On The Safety Of Open-sourced Large Language Models: Does Alignment Really Prevent Them From Being Misused? Zhang Hangfan, Guo Zhimeng, Zhu Huaisheng, Cao Bochuan, Lin Lu, Jia Jinyuan, Chen Jinghui, Wu Dinghao
- Expressive TTS Driven By Natural Language Prompts Using Few Human Annotations Zhang Hanglei, Guo Yiwei, Liu Sen, Chen Xie, Yu Kai
- Efficient Toxic Content Detection By Bootstrapping And Distilling Large Language Models Zhang Jiang, Wu Qiong, Xu Yiming, Cao Cheng, Du Zheng, Psounis Konstantinos
- Glm-dialog: Noise-tolerant Pre-training For Knowledge-grounded Dialogue Generation Zhang Jing, Zhang Xiaokang, Zhang-li Daniel, Yu Jifan, Yao Zijun, Ma Zeyao, Xu Yiqi, Wang Haohua, Zhang Xiaohan, Lin Nianyi, Lu Sunrui, Li Juanzi, Tang Jie
- The Potential And Pitfalls Of Using A Large Language Model Such As Chatgpt Or GPT-4 As A Clinical Assistant Zhang Jingqing, Sun Kai, Jagadeesh Akshay, Ghahfarokhi Mahta, Gupta Deepa, Gupta Ashok, Gupta Vibhor, Guo Yike
- Weighted Sampling For Masked Language Modeling Zhang Linhan, Chen Qian, Wang Wen, Deng Chong, Cao Xin, Hao Kongzhang, Jiang Yuxin, Wang Wei
- Can LLM Find The Green Circle? Investigation And Human-guided Tool Manipulation For Compositional Generalization Zhang Min, He Jianfeng, Lei Shuo, Yue Murong, Wang Linhang, Lu Chang-tien
- Ask An Expert: Leveraging Language Models To Improve Strategic Reasoning In Goal-oriented Dialogue Models Zhang Qiang, Naradowsky Jason, Miyao Yusuke
- Knowgpt: Knowledge Graph Based Prompting For Large Language Models Zhang Qinggang, Dong Junnan, Chen Hao, Zha Daochen, Yu Zailiang, Huang Xiao
- Tell Your Model Where To Attend: Post-hoc Attention Steering For Llms Zhang Qingru, Singh Chandan, Liu Liyuan, Liu Xiaodong, Yu Bin, Gao Jianfeng, Zhao Tuo
- Llama-adapter: Efficient Fine-tuning Of Language Models With Zero-init Attention Zhang Renrui, Han Jiaming, Liu Chris, Gao Peng, Zhou Aojun, Hu Xiangfei, Yan Shilin, Lu Pan, Li Hongsheng, Qiao Yu
- Generation-driven Contrastive Self-training For Zero-shot Text Classification With Instruction-following LLM Zhang Ruohong, Wang Yau-shian, Yang Yiming
- Automl-gpt: Automatic Machine Learning With GPT Zhang Shujian, Gong Chengyue, Wu Lemeng, Liu Xingchao, Zhou Mingyuan
- Adarefiner: Refining Decisions Of Language Models With Adaptive Feedback Zhang Wanpeng, Lu Zongqing
- Bridging The Information Gap Between Domain-specific Model And General LLM For Personalized Recommendation Zhang Wenxuan, Liu Hongzhi, Du Yingpeng, Zhu Chen, Song Yang, Zhu Hengshu, Wu Zhonghai
- Don't Trust Chatgpt When Your Question Is Not In English: A Study Of Multilingual Abilities And Types Of Llms Zhang Xiang, Li Senyu, Hauer Bradley, Shi Ning, Kondrak Grzegorz
- Large Language Models In Medical Term Classification And Unexpected Misalignment Between Response And Reasoning Zhang Xiaodan, Vemulapalli Sandeep, Talukdar Nabasmita, Ahn Sumyeong, Wang Jiankun, Meng Han, Murtaza Sardar Mehtab Bin, Dave Aakash Ajay, Leshchiner Dmitry, Joseph Dimitri F., Witteveen-lane Martin, Chesla Dave, Zhou Jiayu, Chen Bin
- SGP-TOD: Building Task Bots Effortlessly Via Schema-guided LLM Prompting Zhang Xiaoying, Peng Baolin, Li Kun, Zhou Jingyan, Meng Helen
- Llavar: Enhanced Visual Instruction Tuning For Text-rich Image Understanding Zhang Yanzhe, Zhang Ruiyi, Gu Jiuxiang, Zhou Yufan, Lipka Nedim, Yang Diyi, Sun Tong
- Making Large Language Models Perform Better In Knowledge Graph Completion Zhang Yichi, Chen Zhuo, Guo Lingbing, Xu Yajing, Zhang Wen, Chen Huajun
- Meta Prompting For AI Systems Zhang Yifan, Yuan Yang, Yao Andrew Chi-chih
- IAG: Induction-augmented Generation Framework For Answering Reasoning Questions Zhang Zhebin, Zhang Xinyu, Ren Yuanhang, Shi Saijiang, Han Meng, Wu Yongkang, Lai Ruofei, Cao Zhao
- Students' Perceptions And Preferences Of Generative Artificial Intelligence Feedback For Programming Zhang Zhengdong, Dong Zihan, Shi Yang, Matsuda Noboru, Price Thomas, Xu Dongkuan
- Reinforced UI Instruction Grounding: Towards A Generic UI Task Automation API Zhang Zhizheng, Xie Wenxuan, Zhang Xiaoyi, Lu Yan
- Responsible Task Automation: Empowering Large Language Models As Responsible Task Automators Zhang Zhizheng, Zhang Xiaoyi, Xie Wenxuan, Lu Yan
- LLM4DV: Using Large Language Models For Hardware Test Stimuli Generation Zhang Zixi, Chadwick Greg, Mcnally Hugo, Zhao Yiren, Mullins Robert
- SVIT: Scaling Up Visual Instruction Tuning Zhao Bo, Wu Boya, He Muyang, Huang Tiejun
- Large Language Models Are Complex Table Parsers Zhao Bowen, Ji Changkai, Zhang Yuejie, He Wen, Wang Yingwen, Wang Qing, Feng Rui, Zhang Xiaobo
- Explainability For Large Language Models: A Survey Zhao Haiyan, Chen Hanjie, Yang Fan, Liu Ninghao, Deng Huiqi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Du Mengnan
- Explicit Planning Helps Language Models In Logical Reasoning Zhao Hongyu, Wang Kangrui, Yu Mo, Mei Hongyuan
- Automatic Model Selection With Large Language Models For Reasoning Zhao James Xu, Xie Yuxi, Kawaguchi Kenji, He Junxian, Xie Michael Qizhe
- In-context Exemplars As Clues To Retrieving From Large Associative Memory Zhao Jiachen
- Chatspot: Bootstrapping Multimodal Llms Via Precise Referring Instruction Tuning Zhao Liang, Yu En, Ge Zheng, Yang Jinrong, Wei Haoran, Zhou Hongyu, Sun Jianjian, Peng Yuang, Dong Runpei, Han Chunrui, Zhang Xiangyu
- Verify-and-edit: A Knowledge-enhanced Chain-of-thought Framework Zhao Ruochen, Li Xingxuan, Joty Shafiq, Qin Chengwei, Bing Lidong
- Pareto Optimal Learning For Estimating Large Language Model Errors Zhao Theodore, Wei Mu, Preston J. Samuel, Poon Hoifung
- DIVKNOWQA: Assessing The Reasoning Ability Of Llms Via Open-domain Question Answering Over Knowledge Base And Text Zhao Wenting, Liu Ye, Niu Tong, Wan Yao, Yu Philip S., Joty Shafiq, Zhou Yingbo, Yavuz Semih
- Knn-icl: Compositional Task-oriented Parsing Generalization With Nearest Neighbor In-context Learning Zhao Wenting, Liu Ye, Wan Yao, Wang Yibo, Wu Qingyang, Deng Zhongfen, Du Jiangshu, Liu Shuaiqi, Xu Yunlong, Yu Philip S.
- Mind Vs. Mouth: On Measuring Re-judge Inconsistency Of Social Bias In Large Language Models Zhao Yachao, Wang Bo, Zhao Dongming, Huang Kun, Wang Yan, He Ruifang, Hou Yuexian
- Knowing What Llms DO NOT Know: A Simple Yet Effective Self-detection Method Zhao Yukun, Yan Lingyong, Sun Weiwei, Xing Guoliang, Meng Chong, Wang Shuaiqiang, Cheng Zhicong, Ren Zhaochun, Yin Dawei
- Multi-modal In-context Learning Makes An Ego-evolving Scene Text Recognizer Zhao Zhen, Tang Jingqun, Lin Chunhui, Wu Binghong, Huang Can, Liu Hao, Tan Xin, Zhang Zhizhong, Xie Yuan
- Mllm-dataengine: An Iterative Refinement Approach For MLLM Zhao Zhiyuan, Ouyang Linke, Wang Bin, Huang Siyuan, Zhang Pan, Dong Xiaoyi, Wang Jiaqi, He Conghui
- Can We Edit Factual Knowledge By In-context Learning? Zheng Ce, Li Lei, Dong Qingxiu, Fan Yuxuan, Wu Zhiyong, Xu Jingjing, Chang Baobao
- Progressive-hint Prompting Improves Reasoning In Large Language Models Zheng Chuanyang, Liu Zhengying, Xie Enze, Li Zhenguo, Li Yu
- Lyra: Orchestrating Dual Correction In Automated Theorem Proving Zheng Chuanyang, Wang Haiming, Xie Enze, Liu Zhengying, Sun Jiankai, Xin Huajian, Shen Jianhao, Li Zhenguo, Li Yu
- Noisy Exemplars Make Large Language Models More Robust: A Domain-agnostic Behavioral Analysis Zheng Hongyi, Saparov Abulhair
- Take A Step Back: Evoking Reasoning Via Abstraction In Large Language Models Zheng Huaixiu Steven, Mishra Swaroop, Chen Xinyun, Cheng Heng-tze, Chi Ed H., Le Quoc V, Zhou Denny
- Sglang: Efficient Execution Of Structured Language Model Programs Zheng Lianmin, Yin Liangsheng, Xie Zhiqiang, Sun Chuyue, Huang Jeff, Yu Cody Hao, Cao Shiyi, Kozyrakis Christos, Stoica Ion, Gonzalez Joseph E., Barrett Clark, Sheng Ying
- Memorycompanion: A Smart Healthcare Solution To Empower Efficient Alzheimer's Care Via Unleashing Generative AI Zheng Lifei, Heo Yeonie, Fang Yi
- Trojfsp: Trojan Insertion In Few-shot Prompt Tuning Zheng Mengxin, Xue Jiaqi, Chen Xun, Wang Yanshan, Lou Qian, Jiang Lei
- Gpt-fathom: Benchmarking Large Language Models To Decipher The Evolutionary Path Towards GPT-4 And Beyond Zheng Shen, Zhang Yuyu, Zhu Yijie, Xi Chenguang, Gao Pengyang, Zhou Xun, Chang Kevin Chen-chuan
- Large Language Models Are Good Prompt Learners For Low-shot Image Classification Zheng Zhaoheng, Wei Jingmin, Hu Xuefeng, Zhu Haidong, Nevatia Ram
- Building Emotional Support Chatbots In The Era Of Llms Zheng Zhonghua, Liao Lizi, Deng Yang, Nie Liqiang
- Large Language Model Can Interpret Latent Space Of Sequential Recommender Zhengyi Yang, Jiancan Wu, Yanchen Luo, Jizhi Zhang, Yancheng Yuan, An Zhang, Xiang Wang, Xiangnan He
- MM-REACT: Prompting Chatgpt For Multimodal Reasoning And Action Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Ehsan Azarnasab, Faisal Ahmed, Zicheng Liu, Ce Liu, Michael Zeng, Lijuan Wang
- Llamarec: Two-stage Recommendation Using Large Language Models For Ranking Zhenrui Yue, Sara Rabhi, Gabriel De Souza Pereira Moreira, Dong Wang, Even Oldridge
- Self-polish: Enhance Reasoning In Large Language Models Via Problem Refinement Zhiheng Xi, Senjie Jin, Yuhao Zhou, Rui Zheng, Songyang Gao, Tao Gui, Qi Zhang, Xuanjing Huang
- Principle-driven Self-alignment Of Language Models From Scratch With Minimal Human Supervision Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan
- Exploring Human-like Translation Strategy With Large Language Models Zhiwei He, Tian Liang, Wenxiang Jiao, Zhuosheng Zhang, Yujiu Yang, Rui Wang, Zhaopeng Tu, Shuming Shi, Xing Wang
- Can Chatgpt Understand Too? A Comparative Study On Chatgpt And Fine-tuned BERT Zhong Qihuang, Ding Liang, Liu Juhua, Du Bo, Tao Dacheng
- Sur-adapter: Enhancing Text-to-image Pre-trained Diffusion Models With Large Language Models Zhong Shanshan, Huang Zhongzhan, Wen Wushao, Qin Jinghui, Lin Liang
- Mquake: Assessing Knowledge Editing In Language Models Via Multi-hop Questions Zhong Zexuan, Wu Zhengxuan, Manning Christopher D., Potts Christopher, Chen Danqi
- Language Agent Tree Search Unifies Reasoning Acting And Planning In Language Models Zhou Andy, Yan Kai, Shlapentokh-rothman Michal, Wang Haohan, Wang Yu-xiong
- Solving Challenging Math Word Problems Using GPT-4 Code Interpreter With Code-based Self-verification Zhou Aojun, Wang Ke, Lu Zimu, Shi Weikang, Luo Sichun, Qin Zipeng, Lu Shaoqing, Jia Anya, Song Linqi, Zhan Mingjie, Li Hongsheng
- Batch Calibration: Rethinking Calibration For In-context Learning And Prompt Engineering Zhou Han, Wan Xingchen, Proleev Lev, Mincu Diana, Chen Jilin, Heller Katherine, Roy Subhrajit
- Survival Of The Most Influential Prompts: Efficient Black-box Prompt Search Via Clustering And Pruning Zhou Han, Wan Xingchen, Vulić Ivan, Korhonen Anna
- Llms Learn Task Heuristics From Demonstrations: A Heuristic-driven Prompting Strategy For Document-level Event Argument Extraction Zhou Hanzhang, Qian Junlang, Feng Zijian, Lu Hui, Zhu Zixiao, Mao Kezhi
- Instruction-following Evaluation For Large Language Models Zhou Jeffrey, Lu Tianjian, Mishra Swaroop, Brahma Siddhartha, Basu Sujoy, Luan Yi, Zhou Denny, Hou Le
- CONA: A Novel Context-aware Instruction Paradigm For Communication Using Large Language Model Zhou Nan, Tao Xinghui, Chen Xi
- Exploring Recommendation Capabilities Of Gpt-4v(ision): A Preliminary Case Study Zhou Peilin, Cao Meng, Huang You-liang, Ye Qichen, Zhang Peiyan, Liu Junling, Xie Yueqi, Hua Yining, Kim Jaeboum
- Recurrentgpt: Interactive Generation Of (arbitrarily) Long Text Zhou Wangchunshu, Jiang Yuchen Eleanor, Cui Peng, Wang Tiannan, Xiao Zhenxin, Hou Yifan, Cotterell Ryan, Sachan Mrinmaya
- Context-faithful Prompting For Large Language Models Zhou Wenxuan, Zhang Sheng, Poon Hoifung, Chen Muhao
- Can GPT Models Follow Human Summarization Guidelines? Evaluating Chatgpt And GPT-4 For Dialogue Summarization Zhou Yongxin, Ringeval Fabien, Portet François
- Thread Of Thought Unraveling Chaotic Contexts Zhou Yucheng, Geng Xiubo, Shen Tao, Tao Chongyang, Long Guodong, Lou Jian-guang, Shen Jianbing
- Revisiting Automated Prompting: Are We Actually Doing Better? Zhou Yulin, Zhao Yiren, Shumailov Ilia, Mullins Robert, Gal Yarin
- The Mystery Of In-context Learning: A Comprehensive Survey On Interpretation And Analysis Zhou Yuxiang, Li Jiazheng, Xiang Yanzheng, Yan Hanqi, Gui Lin, He Yulan
- Quantifying And Analyzing Entity-level Memorization In Large Language Models Zhou Zhenhong, Xiang Jiuyang, Chen Chaomeng, Su Sen
- VL-GPT: A Generative Pre-trained Transformer For Vision And Language Understanding And Generation Zhu Jinguo, Ding Xiaohan, Ge Yixiao, Ge Yuying, Zhao Sijie, Zhao Hengshuang, Wang Xiaohua, Shan Ying
- Promptbench: A Unified Library For Evaluation Of Large Language Models Zhu Kaijie, Zhao Qinlin, Chen Hao, Wang Jindong, Xie Xing
- Autodan: Interpretable Gradient-based Adversarial Attacks On Large Language Models Zhu Sicheng, Zhang Ruiyi, An Bang, Wu Gang, Barrow Joe, Wang Zichao, Huang Furong, Nenkova Ani, Sun Tong
- Overview Of The Promptcblue Shared Task In CHIP2023 Zhu Wei, Wang Xiaoling, Chen Mosha, Tang Buzhou
- Promptcblue: A Chinese Prompt Tuning Benchmark For The Medical Domain Zhu Wei, Wang Xiaoling, Zheng Huanran, Chen Mosha, Tang Buzhou
- Large Language Models Can Learn Rules Zhu Zhaocheng, Xue Yuan, Chen Xinyun, Zhou Denny, Tang Jian, Schuurmans Dale, Dai Hanjun
- Beyond Yes And No: Improving Zero-shot LLM Rankers Via Scoring Fine-grained Relevance Labels Zhuang Honglei, Qin Zhen, Hui Kai, Wu Junru, Yan Le, Wang Xuanhui, Bendersky Michael
- Multimodal Chain-of-thought Reasoning In Language Models Zhuosheng Zhang, Aston Zhang, Mu Li, Hai Zhao, George Karypis, Alex Smola
- Can Large Language Models Transform Computational Social Science? Ziems Caleb, Held William, Shaikh Omar, Chen Jiaao, Zhang Zhehao, Yang Diyi
- Recommender Systems In The Era Of Large Language Models (llms) Zihuai Zhao, Wenqi Fan, Jiatong Li, Yunqing Liu, Xiaowei Mei, Yiqi Wang, Zhen Wen, Fei Wang, Xiangyu Zhao, Jiliang Tang, Qing Li
- Tab-cot: Zero-shot Tabular Chain Of Thought Ziqi Jin, Wei Lu
- Universal And Transferable Adversarial Attacks On Aligned Language Models Zou Andy, Wang Zifan, Carlini Nicholas, Nasr Milad, Kolter J. Zico, Fredrikson Matt
- Generalizable Chain-of-thought Prompting In Mixed-task Scenarios With Large Language Models Zou Anni, Zhang Zhuosheng, Zhao Hai, Tang Xiangru
- Dr Chatgpt, Tell Me What I Want To Hear: How Prompt Knowledge Impacts Health Answer Correctness Zuccon Guido, Koopman Bevan
- Chatgpt Hallucinates When Attributing Answers Zuccon Guido, Koopman Bevan, Shaik Razia
- Concept-aware Training Improves In-context Learning Ability Of Language Models Štefánik Michal, Kadlčík Marek
- A Dataset And Benchmark For Hospital Course Summarization With Adapted Large Language Models Aali Asad, Van Veen Dave, Arefeen Yamin Ishraq, Hom Jason, Bluethgen Christian, Reis Eduardo Pontes, Gatidis Sergios, Clifford Namuun, Daws Joseph, Tehrani Arash S., Kim Jangwon, Chaudhari Akshay S.
- Can Llms Be Fooled? Investigating Vulnerabilities In Llms Abdali Sara, He Jia, Barberan Cj, Anarfi Richard
- Are You Still On Track!? Catching LLM Task Drift With Activations Abdelnabi Sahar, Fay Aideen, Cherubin Giovanni, Salem Ahmed, Fritz Mario, Paverd Andrew
- Phi-3 Technical Report: A Highly Capable Language Model Locally On Your Phone Abdin Marah, Aneja Jyoti, Awadalla Hany, Awadallah Ahmed, Awan Ammar Ahmad, Bach Nguyen, Bahree Amit, Bakhtiari Arash, Bao Jianmin, Behl Harkirat, Benhaim Alon, Bilenko Misha, Bjorck Johan, Bubeck Sébastien, Cai Martin, Cai Qin, Chaudhary Vishrav, Chen Dong, Chen Dongdong, Chen Weizhu, Chen Yen-chun, Chen Yi-ling, Cheng Hao, Chopra Parul, Dai Xiyang, Dixon Matthew, Eldan Ronen, Fragoso Victor, Gao Jianfeng, Gao Mei, Gao Min, Garg Amit, Del Giorno Allie, Goswami Abhishek, Gunasekar Suriya, Haider Emman, Hao Junheng, Hewett Russell J., Hu Wenxiang, Huynh Jamie, Iter Dan, Jacobs Sam Ade, Javaheripi Mojan, Jin Xin, Karampatziakis Nikos, Kauffmann Piero, Khademi Mahoud, Kim Dongwoo, Kim Young Jin, Kurilenko Lev, Lee James R., Lee Yin Tat, Li Yuanzhi, Li Yunsheng, Liang Chen, Liden Lars, Lin Xihui, Lin Zeqi, Liu Ce, Liu Liyuan, Liu Mengchen, Liu Weishung, Liu Xiaodong, Luo Chong, Madan Piyush, Mahmoudzadeh Ali, Majercak David, Mazzola Matt, Mendes Caio César Teodoro, Mitra Arindam, Modi Hardik, Nguyen Anh, Norick Brandon, Patra Barun, Perez-becker Daniel, Portet Thomas, Pryzant Reid, Qin Heyang, Radmilac Marko, Ren Liliang, De Rosa Gustavo, Rosset Corby, Roy Sambudha, Ruwase Olatunji, Saarikivi Olli, Saied Amin, Salim Adil, Santacroce Michael, Shah Shital, Shang Ning, Sharma Hiteshi, Shen Yelong, Shukla Swadheen, Song Xia, Tanaka Masahiro, Tupini Andrea, Vaddamanu Praneetha, Wang Chunyu, Wang Guanhua, Wang Lijuan, Wang Shuohang, Wang Xin, Wang Yu, Ward Rachel, Wen Wen, Witte Philipp, Wu Haiping, Wu Xiaoxia, Wyatt Michael, Xiao Bin, Xu Can, Xu Jiahang, Xu Weijian, Xue Jilong, Yadav Sonali, Yang Fan, Yang Jianwei, Yang Yifan, Yang Ziyi, Yu Donghan, Yuan Lu, Zhang Chenruidong, Zhang Cyril, Zhang Jianwen, Zhang Li Lyna, Zhang Yi, Zhang Yue, Zhang Yunan, Zhou Xiren
- Synthetic Dialogue Dataset Generation Using LLM Agents Abdullin Yelaman, Molla-aliod Diego, Ofoghi Bahadorreza, Yearwood John, Li Qingyang
- Supporting Student Decisions On Learning Recommendations: An Llm-based Chatbot With Knowledge Graph Contextualization For Conversational Explainability And Mentoring Abu-rasheed Hasan, Abdulsalam Mohamad Hussam, Weber Christian, Fathi Madjid
- Knowledge Graphs As Context Sources For Llm-based Explanations Of Learning Recommendations Abu-rasheed Hasan, Weber Christian, Fathi Madjid
- Pathocl: Path-based Prompt Augmentation For OCL Generation With GPT-4 Abukhalaf Seif, Hamdaqa Mohammad, Khomh Foutse
- Comparing LLM Prompting With Cross-lingual Transfer Performance On Indigenous And Low-resource Brazilian Languages Adelani David Ifeoluwa, Doğruöz A. Seza, Coneglian André, Ojha Atul Kr.
- Fine-grained Contract NER Using Instruction Based Model Adibhatla Hiranmai Sri, Baswani Pavan, Shrivastava Manish
- Keyformer: KV Cache Reduction Through Key Tokens Selection For Efficient Generative Inference Adnan Muhammad, Arunkumar Akhil, Jain Gaurav, Nair Prashant J., Soloveychik Ilya, Kamath Purushotham
- Towards Optimizing And Evaluating A Retrieval Augmented QA Chatbot Using Llms With Human In The Loop Afzal Anum, Kowsik Alexander, Fani Rajna, Matthes Florian
- Promptwizard: Task-aware Agent-driven Prompt Optimization Framework Agarwal Eshaan, Dani Vivek, Ganu Tanuja, Nambi Akshay
- Many-shot In-context Learning Agarwal Rishabh, Singh Avi, Zhang Lei M., Bohnet Bernd, Rosias Luis, Chan Stephanie, Zhang Biao, Anand Ankesh, Abbas Zaheer, Nova Azade, Co-reyes John D., Chu Eric, Behbahani Feryal, Faust Aleksandra, Larochelle Hugo
- Ethical Reasoning And Moral Value Alignment Of Llms Depend On The Language We Prompt Them In Agarwal Utkarsh, Tanmay Kumar, Khandelwal Aditi, Choudhury Monojit
- Codemirage: Hallucinations In Code Generated By Large Language Models Agarwal Vibhor, Pei Yulong, Alamir Salwa, Liu Xiaomo
- Look Further Ahead: Testing The Limits Of GPT-4 In Path Planning Aghzal Mohamed, Plaku Erion, Yao Ziyu
- Etalon: Holistic Performance Evaluation Framework For LLM Inference Systems Agrawal Amey, Agarwal Anmol, Kedia Nitin, Mohan Jayashree, Kundu Souvik, Kwatra Nipun, Ramjee Ramachandran, Tumanov Alexey
- Illuminate: A Novel Approach For Depression Detection With Explainable Analysis And Proactive Therapy Using Prompt Engineering Agrawal Aryan
- Can't Remember Details In Long Documents? You Need Some R&R Agrawal Devanshu, Gao Shang, Gajek Martin
- Can Llms Perform Structured Graph Reasoning? Agrawal Palaash, Vasania Shavak, Tan Cheston
- Direct-inverse Prompting: Analyzing Llms' Discriminative Capacity In Self-improving Generation Ahn Jihyun Janice, Kamoi Ryo, Cheng Lu, Zhang Rui, Yin Wenpeng
- Recursive Chain-of-feedback Prevents Performance Degradation From Redundant Prompting Ahn Jinwoo, Shin Kyuseung
- The Curious Case Of Nonverbal Abstract Reasoning With Multi-modal Large Language Models Ahrabian Kian, Sourati Zhivar, Sun Kexuan, Zhang Jiarui, Jiang Yifan, Morstatter Fred, Pujara Jay
- Sphinx: Sample Efficient Multilingual Instruction Fine-tuning Through N-shot Guided Prompting Ahuja Sanchit, Tanmay Kumar, Chauhan Hardik Hansrajbhai, Patra Barun, Aggarwal Kriti, Del Corro Luciano, Mitra Arindam, Dhamecha Tejas Indulal, Awadallah Ahmed, Choudhary Monojit, Chaudhary Vishrav, Sitaram Sunayana
- Examining The Robustness Of LLM Evaluation To The Distributional Assumptions Of Benchmarks Ailem Melissa, Marazopoulou Katerina, Siska Charlotte, Bono James
- Plug And Play With Prompts: A Prompt Tuning Approach For Controlling Text Generation Ajwani Rohan Deepak, Zhu Zining, Rose Jonathan, Rudzicz Frank
- Clustered Retrieved Augmented Generation (CRAG) Akesson Simon, Santos Frances A.
- FACTS About Building Retrieval Augmented Generation-based Chatbots Akkiraju Rama, Xu Anbang, Bora Deepak, Yu Tan, An Lu, Seth Vishal, Shukla Aaditya, Gundecha Pritam, Mehta Hridhay, Jha Ashwin, Raj Prithvi, Balasubramanian Abhinav, Maram Murali, Muthusamy Guru, Annepally Shivakesh Reddy, Knowles Sidney, Du Min, Burnett Nick, Javiya Sean, Marannan Ashok, Kumari Mamta, Jha Surbhi, Dereszenski Ethan, Chakraborty Anupam, Ranjan Subhash, Terfai Amina, Surya Anoop, Mercer Tracey, Thanigachalam Vinodh Kumar, Bar Tamar, Krishnan Sanjana, Kilaru Samy, Jaksic Jasmine, Algarici Nave, Liberman Jacob, Conway Joey, Nayyar Sonu, Boitano Justin
- Have We Reached AGI? Comparing Chatgpt, Claude, And Gemini To Human Literacy And Education Benchmarks Akpan Mfon
- In-context Language Learning: Architectures And Algorithms Akyürek Ekin, Wang Bailin, Kim Yoon, Andreas Jacob
- Should We Fine-tune Or RAG? Evaluating Different Techniques To Adapt Llms For Dialogue Alghisi Simone, Rizzoli Massimo, Roccabruna Gabriel, Mousavi Seyed Mahed, Riccardi Giuseppe
- PROMPT-SAW: Leveraging Relation-aware Graphs For Textual Prompt Compression Ali Muhammad Asif, Li Zhengping, Yang Shu, Cheng Keyuan, Cao Yang, Huang Tianhao, Hu Lijie, Yu Lu, Wang Di
- Toward Conversational Agents With Context And Time Sensitive Long-term Memory Alonso Nick, Figliolia Tomás, Ndirango Anthony, Millidge Beren
- Do Llms Exhibit Human-like Reasoning? Evaluating Theory Of Mind In Llms For Open-ended Responses Amirizaniani Maryam, Martin Elias, Sivachenko Maryna, Mashhadi Afra, Shah Chirag
- Llmauditor: A Framework For Auditing Large Language Models Using Human-in-the-loop Amirizaniani Maryam, Yao Jihan, Lavergne Adrian, Okada Elizabeth Snell, Chadha Aman, Roosta Tanya, Shah Chirag
- Evaluating The Efficacy Of Foundational Models: Advancing Benchmarking Practices To Enhance Fine-tuning Decision-making Amujo Oluyemi Enoch, Yang Shanchieh Jay
- Automatic Pseudo-harmful Prompt Generation For Evaluating False Refusals In Large Language Models An Bang, Zhu Sicheng, Zhang Ruiyi, Panaitescu-liess Michael-andrei, Xu Yuancheng, Huang Furong
- How Susceptible Are Llms To Influence In Prompts? Anagnostidis Sotiris, Bulian Jannis
- Investigating The Robustness Of Llms On Math Word Problems Anantheswaran Ujjwala, Gupta Himanshu, Scaria Kevin, Verma Shreyas, Baral Chitta, Mishra Swaroop
- BAMO At Semeval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense Ansari Baktash, Rostamkhani Mohammadmostafa, Eetemadi Sauleh
- Simpler Becomes Harder: Do Llms Exhibit A Coherent Behavior On Simplified Corpora? Anschütz Miriam, Mosca Edoardo, Groh Georg
- Exploring The Impact Of Large Language Models On Recommender Systems: An Extensive Review Arpita Vats, Vinija Jain, Rahul Raja, Aman Chadha
- Open (clinical) Llms Are Sensitive To Instruction Phrasings Arroyo Alberto Mario Ceballos, Munnangi Monica, Sun Jiuding, Zhang Karen Y. C., Mcinerney Denis Jered, Wallace Byron C., Amir Silvio
- Self-seeding And Multi-intent Self-instructing Llms For Generating Intent-aware Information-seeking Dialogs Askari Arian, Petcu Roxana, Meng Chuan, Aliannejadi Mohammad, Abolghasemi Amin, Kanoulas Evangelos, Verberne Suzan
- Can Open-source Llms Compete With Commercial Models? Exploring The Few-shot Performance Of Current GPT Models In Biomedical Tasks Ateia Samy, Kruschwitz Udo
- Convocache: Smart Re-use Of Chatbot Responses Atkins Conor, Wood Ian, Kaafar Mohamed Ali, Asghar Hassan, Basta Nardine, Kepkowski Michal
- Prompt Design Matters For Computational Social Science Tasks But In Unpredictable Ways Atreja Shubham, Ashkinaze Joshua, Li Lingyao, Mendelsohn Julia, Hemphill Libby
- GRAD-SUM: Leveraging Gradient Summarization For Optimal Prompt Engineering Austin Derek, Chartock Elliott
- Current State Of LLM Risks And AI Guardrails Ayyamperumal Suriya Ganesh, Ge Limin
- Feedback-generation For Programming Exercises With GPT-4 Azaiz Imen, Kiesler Natalie, Strickroth Sven
- Accuracy And Consistency Of Llms In The Registered Dietitian Exam: The Impact Of Prompt Engineering And Knowledge Retrieval Azimi Iman, Qi Mohan, Wang Li, Rahmani Amir M., Li Youlin
- Self-improving Customer Review Response Generation Based On Llms Azov Guy, Pelc Tatiana, Alon Adi Fledel, Kamhi Gila
- Generating Query Recommendations Via Llms Bacciu Andrea, Palumbo Enrico, Damianou Andreas, Tonellotto Nicola, Silvestri Fabrizio
- Hallucination Of Multimodal Large Language Models: A Survey Bai Zechen, Wang Pichao, Xiao Tianjun, He Tong, Han Zongbo, Zhang Zheng, Shou Mike Zheng
- RAG Vs Fine-tuning: Pipelines, Tradeoffs, And A Case Study On Agriculture Balaguer Angels, Benara Vinamra, Cunha Renato Luiz De Freitas, Filho Roberto De M. Estevão, Hendry Todd, Holstein Daniel, Marsman Jennifer, Mecklenburg Nick, Malvar Sara, Nunes Leonardo O., Padilha Rafael, Sharp Morris, Silva Bruno, Sharma Swati, Aski Vijay, Chandra Ranveer
- Artifacts Or Abduction: How Do Llms Answer Multiple-choice Questions Without The Question? Balepur Nishant, Ravichander Abhilasha, Rudinger Rachel
- Enhancing Presentation Slide Generation By Llms With A Multi-staged End-to-end Approach Bandyopadhyay Sambaran, Maheshwari Himanshu, Natarajan Anandhavelu, Saxena Apoorv
- Llms With Chain-of-thought Are Non-causal Reasoners Bao Guangsheng, Zhang Hongbo, Yang Linyi, Wang Cunxiang, Zhang Yue
- TTQA-RS- A Break-down Prompting Approach For Multi-hop Table-text Question Answering With Reasoning And Summarization Bardhan Jayetri, Xiao Bushi, Wang Daisy Zhe
- Slicerchat: Building A Local Chatbot For 3D Slicer Barr Colton
- Exploring Autonomous Agents Through The Lens Of Large Language Models: A Review Barua Saikat
- Understanding Information Storage And Transfer In Multi-modal Large Language Models Basu Samyadeep, Grayson Martin, Morrison Cecily, Nushi Besmira, Feizi Soheil, Massiceti Daniela
- The Unreasonable Effectiveness Of Eccentric Automatic Prompts Battle Rick, Gollapudi Teja
- Generative AI To Generate Test Data Generators Baudry Benoit, Etemadi Khashayar, Fang Sen, Gamage Yogya, Liu Yi, Liu Yuxin, Monperrus Martin, Ron Javier, Silva André, Tiwari Deepika
- Evolutionary Multi-objective Optimization Of Large Language Model Prompts For Balancing Sentiments Baumann Jill, Kramer Oliver
- How Do You Know That? Teaching Generative Language Models To Reference Answers To Biomedical Questions Bašaragin Bojana, Ljajić Adela, Medvecki Darija, Cassano Lorenzo, Košprdić Miloš, Milošević Nikola
- Task Prompt Vectors: Effective Initialization Through Multi-task Soft-prompt Transfer Belanec Robert, Ostermann Simon, Srba Ivan, Bielikova Maria
- Crafting Interpretable Embeddings By Asking Llms Questions Benara Vinamra, Singh Chandan, Morris John X., Antonello Richard, Stoica Ion, Huth Alexander G., Gao Jianfeng
- Prompting Large Language Models With Human Error Markings For Self-correcting Machine Translation Berger Nathaniel, Riezler Stefan, Exel Miriam, Huck Matthias
- Think Big, Generate Quick: Llm-to-slm For Fast Autoregressive Decoding Bergner Benjamin, Skliar Andrii, Royer Amelie, Blankevoort Tijmen, Asano Yuki, Bejnordi Babak Ehteshami
- A Systematic Analysis Of Large Language Models As Soft Reasoners: The Case Of Syllogistic Inferences Bertolazzi Leonardo, Gatt Albert, Bernardi Raffaella
- Enhancing Event Reasoning In Large Language Models Through Instruction Fine-tuning With Semantic Causal Graphs Bethany Mazal, Bethany Emet, Wherry Brandon, Chiang Cho-yu, Vishwamitra Nishant, Rios Anthony, Najafirad Peyman
- Clembench-2024: A Challenging, Dynamic, Complementary, Multilingual Benchmark And Underlying Flexible Framework For Llms As Multi-action Agents Beyer Anne, Chalamalasetti Kranti, Hakimov Sherzod, Madureira Brielen, Sadler Philipp, Schlangen David
- Soft Prompting For Unlearning In Large Language Models Bhaila Karuna, Van Minh-hao, Wu Xintao
- Efficient Reinforcement Learning Via Large Language Model-based Search Bhambri Siddhant, Bhattacharjee Amrita, Liu Huan, Kambhampati Subbarao
- Self-amplify: Improving Small Language Models With Self Post Hoc Explanations Bhan Milan, Vittaut Jean-noel, Chesneau Nicolas, Lesot Marie-jeanne
- On The Robustness Of Language Models For Tabular Question Answering Bhandari Kushal Raj, Xing Sixue, Dan Soham, Gao Jianxi
- Adaptive Token Biaser: Knowledge Editing Via Biasing Key Entities Bi Baolong, Liu Shenghua, Wang Yiwei, Mei Lingrui, Gao Hongcheng, Xu Yilong, Cheng Xueqi
- Iterative Refinement Of Project-level Code Context For Precise Code Generation With Compiler Feedback Bi Zhangqian, Wan Yao, Wang Zheng, Zhang Hongyu, Guan Batu, Lu Fangxin, Zhang Zili, Sui Yulei, Jin Hai, Shi Xuanhua
- Large Language Models Are Vulnerable To Bait-and-switch Attacks For Generating Harmful Content Bianchi Federico, Zou James
- Supervisory Prompt Training Billa Jean Ghislain, Oh Min, Du Liang
- Statically Contextualizing Large Language Models With Typed Holes Blinn Andrew, Li Xiang, Kim June Hyung, Omar Cyrus
- Task-oriented Dialogue With In-context Learning Bocklisch Tom, Werkmeister Thomas, Varshneya Daksh, Nichol Alan
- Exploring And Benchmarking The Planning Capabilities Of Large Language Models Bohnet Bernd, Nova Azade, Parisi Aaron T, Swersky Kevin, Goshvadi Katayoon, Dai Hanjun, Schuurmans Dale, Fiedel Noah, Sedghi Hanie
- Rambla: A Framework For Evaluating The Reliability Of Llms As Assistants In The Biomedical Domain Bolton William James, Poyiadzi Rafael, Morrell Edward R., Bueno Gabriela Van Bergen Gonzalez, Goetz Lea
- Do Llms Dream Of Ontologies? Bombieri Marco, Fiorini Paolo, Ponzetto Simone Paolo, Rospocher Marco
- Measuring Moral Inconsistencies In Large Language Models Bonagiri Vamshi Krishna, Vennam Sreeram, Gaur Manas, Kumaraguru Ponnurangam
- Navigating The Labyrinth: Evaluating And Enhancing Llms' Ability To Reason About Search Problems Borazjanizadeh Nasim, Herzig Roei, Darrell Trevor, Feris Rogerio, Karlinsky Leonid
- Reliable Reasoning Beyond Natural Language Borazjanizadeh Nasim, Piantadosi Steven T.
- Retrieving Examples From Memory For Retrieval Augmented Neural Machine Translation: A Systematic Comparison Bouthors Maxime, Crego Josep, Yvon Francois
- Embardiment: An Embodied AI Agent For Productivity In XR Bovo Riccardo, Abreu Steven, Ahuja Karan, Gonzalez Eric J, Cheng Li-te, Gonzalez-franco Mar
- Embedding Large Language Models Into Extended Reality: Opportunities And Challenges For Inclusion, Engagement, And Privacy Bozkir Efe, Özdel Süleyman, Lau Ka Hei Carrie, Wang Mengdi, Gao Hong, Kasneci Enkelejda
- The Art Of Saying No: Contextual Noncompliance In Language Models Brahman Faeze, Kumar Sachin, Balachandran Vidhisha, Dasigi Pradeep, Pyatkin Valentina, Ravichander Abhilasha, Wiegreffe Sarah, Dziri Nouha, Chandu Khyathi, Hessel Jack, Tsvetkov Yulia, Smith Noah A., Choi Yejin, Hajishirzi Hannaneh
- Conditioning Llms With Emotion In Neural Machine Translation Brazier Charles, Rouas Jean-luc
- Breaking News: Case Studies Of Generative Ai's Use In Journalism Brigham Natalie Grace, Gao Chongjiu, Kohno Tadayoshi, Roesner Franziska, Mireshghallah Niloofar
- Generation, Distillation And Evaluation Of Motivational Interviewing-style Reflections With A Foundational Language Model Brown Andrew, Zhu Jiading, Abdelwahab Mohamed, Dong Alec, Wang Cindy, Rose Jonathan
- Fine-tuned 'small' Llms (still) Significantly Outperform Zero-shot Generative AI Models In Text Classification Bucher Martin Juan José, Martini Marco
- Hierarchical Prompting Taxonomy: A Universal Evaluation Framework For Large Language Models Budagam Devichand, Kj Sankalp, Kumar Ashutosh, Jain Vinija, Chadha Aman
- Pangea: Procedural Artificial Narrative Using Generative AI For Turn-based Video Games Buongiorno Steph, Klinkert Lawrence Jake, Chawla Tanishq, Zhuang Zixin, Clark Corey
- Roguegpt: Dis-ethical Tuning Transforms Chatgpt4 Into A Rogue AI In 158 Words Buscemi Alessio, Proverbio Daniele
- Aligning Large Language Models With Counterfactual DPO Butcher Bradley
- In-context Learning And Fine-tuning GPT For Argument Mining Cabessa Jérémie, Hernault Hugo, Mushtaq Umer
- TACT: Advancing Complex Aggregative Reasoning With Information Extraction Tools Caciularu Avi, Jacovi Alon, Ben-david Eyal, Goldshtein Sasha, Schuster Tal, Herzig Jonathan, Elidan Gal, Globerson Amir
- Llms Are Few-shot In-context Low-resource Language Learners Cahyawijaya Samuel, Lovenia Holy, Fung Pascale
- On The Transformations Across Reward Model, Parameter Update, And In-context Prompt Cai Deng, Li Huayang, Fu Tingchen, Li Siheng, Xu Weiwen, Li Shuaiyi, Cao Bowen, Zhang Zhisong, Huang Xinting, Cui Leyang, Wang Yan, Liu Lemao, Watanabe Taro, Shi Shuming
- A Survey On Mixture Of Experts Cai Weilin, Jiang Juyong, Wang Fan, Tang Jing, Kim Sunghun, Huang Jiayi
- Towards Large Language Model Aided Program Refinement Cai Yufan, Hou Zhe, Luan Xiaokun, Baena David Miguel Sanan, Lin Yun, Sun Jun, Dong Jin Song
- Towards Logically Consistent Language Models Via Probabilistic Reasoning Calanzone Diego, Teso Stefano, Vergari Antonio
- Are Large Language Models Really Bias-free? Jailbreak Prompts For Assessing Adversarial Robustness To Bias Elicitation Cantini Riccardo, Cosenza Giada, Orsino Alessio, Talia Domenico
- On The Worst Prompt Performance Of Large Language Models Cao Bowen, Cai Deng, Zhang Zhisong, Zou Yuexian, Lam Wai
- Genrec: Generative Sequential Recommendation With Large Language Models Cao Panfeng, Lio Pietro
- Graphinsight: Unlocking Insights In Large Language Models For Graph Structure Understanding Cao Yukun, Han Shuo, Gao Zengyi, Ding Zezhong, Xie Xike, Zhou S. Kevin
- Chart-based Reasoning: Transferring Capabilities From Llms To Vlms Carbune Victor, Mansoor Hassan, Liu Fangyu, Aralikatte Rahul, Baechler Gilles, Chen Jindong, Sharma Abhanshu
- A Review Of Multi-modal Large Language And Vision Models Carolan Kilian, Fennelly Laura, Smeaton Alan F.
- Suppressing Pink Elephants With Direct Principle Feedback Castricato Louis, Lile Nathan, Anand Suraj, Schoelkopf Hailey, Verma Siddharth, Biderman Stella
- Can Few-shot Work In Long-context? Recycling The Context To Generate Demonstrations Cattan Arie, Jacovi Alon, Fabrikant Alex, Herzig Jonathan, Aharoni Roee, Rashkin Hannah, Marcus Dror, Hassidim Avinatan, Matias Yossi, Szpektor Idan, Caciularu Avi
- Smileyllama: Modifying Large Language Models For Directed Chemical Space Exploration Cavanagh Joseph M., Sun Kunyang, Gritsevskiy Andrew, Bagni Dorian, Bannister Thomas D., Head-gordon Teresa
- RQ-RAG: Learning To Refine Queries For Retrieval Augmented Generation Chan Chi-min, Xu Chunpu, Yuan Ruibin, Luo Hongyin, Xue Wei, Guo Yike, Fu Jie
- Prompto: An Open Source Library For Asynchronous Querying Of LLM Endpoints Chan Ryan Sze-yin, Nanni Federico, Brown Edwin, Chapman Ed, Williams Angus R., Bright Jonathan, Gabasova Evelina
- Scaling Synthetic Data Creation With 1,000,000,000 Personas Chan Xin, Wang Xiaoyang, Yu Dian, Mi Haitao, Yu Dong
- Speechprompt: Prompting Speech Language Models For Speech Processing Tasks Chang Kai-wei, Wu Haibin, Wang Yu-kai, Wu Yuan-kuei, Shen Hua, Tseng Wei-cheng, Kang Iu-thing, Li Shang-wen, Lee Hung-yi
- Efficient Prompting Methods For Large Language Models: A Survey Chang Kaiyan, Xu Songcheng, Wang Chenglong, Luo Yingfeng, Xiao Tong, Zhu Jingbo
- When Parts Are Greater Than Sums: Individual LLM Components Can Outperform Full Models Chang Ting-yun, Thomason Jesse, Jia Robin
- Injecting Salesperson's Dialogue Strategies In Large Language Models With Chain-of-thought Reasoning Chang Wen-yu, Chen Yun-nung
- Xprompt:explaining Large Language Model's Generation Via Joint Prompt Attribution Chang Yurui, Cao Bochuan, Wang Yujia, Chen Jinghui, Lin Lu
- Notellm: A Retrievable Large Language Model For Note Recommendation Chao Zhang, Shiwei Wu, Haoxin Zhang, Tong Xu, Yan Gao, Yao Hu, Di Wu, Enhong Chen
- Language Models Can Exploit Cross-task In-context Learning For Data-scarce Novel Tasks Chatterjee Anwoy, Tanwar Eshaan, Dutta Subhabrata, Chakraborty Tanmoy
- Quantitative Certification Of Bias In Large Language Models Chaudhary Isha, Hu Qian, Kumar Manoj, Ziyadi Morteza, Gupta Rahul, Singh Gagandeep
- The Dark Side Of Human Feedback: Poisoning Large Language Models Via User Inputs Chen Bocheng, Guo Hanqing, Wang Guangjing, Wang Yuanda, Yan Qiben
- Livemind: Low-latency Large Language Models With Simultaneous Inference Chen Chuangtao, Zhang Grace Li, Yin Xunzhao, Zhuo Cheng, Schlichtmann Ulf, Li Bing
- Self-cognition In Large Language Models: An Exploratory Study Chen Dongping, Shi Jiawen, Wan Yao, Zhou Pan, Gong Neil Zhenqiang, Sun Lichao
- Hardware-aware Parallel Prompt Decoding For Memory-efficient Acceleration Of LLM Inference Chen Hao Mark, Luk Wayne, Yiu Ka Fai Cedric, Li Rui, Mishchenko Konstantin, Venieris Stylianos I., Fan Hongxiang
- Automated Data Curation For Robust Language Model Fine-tuning Chen Jiuhai, Mueller Jonas
- Genqa: Generating Millions Of Instructions From A Handful Of Prompts Chen Jiuhai, Qadri Rifaa, Wen Yuxin, Jain Neel, Kirchenbauer John, Zhou Tianyi, Goldstein Tom
- Elcorec: Enhance Language Understanding With Co-propagation Of Numerical And Categorical Features For Recommendation Chen Jizheng, Du Kounianhua, Lin Jianghao, Chen Bo, Tang Ruiming, Zhang Weinan
- Putting People In Llms' Shoes: Generating Better Answers Via Question Rewriter Chen Junhao, Wang Bowen, Jiang Zhouqiang, Nakashima Yuta
- Nlperturbator: Studying The Robustness Of Code Llms To Natural Language Variations Chen Junkai, Li Zhenhao, Hu Xing, Xia Xin
- Pixart-\sigma: Weak-to-strong Training Of Diffusion Transformer For 4K Text-to-image Generation Chen Junsong, Ge Chongjian, Xie Enze, Wu Yue, Yao Lewei, Ren Xiaozhe, Wang Zhongdao, Luo Ping, Lu Huchuan, Li Zhenguo
- Cost-effective Instruction Learning For Pathology Vision And Language Analysis Chen Kaitao, Liu Mianxin, Yan Fang, Ma Lei, Shi Xiaoming, Wang Lilong, Wang Xiaosong, Zhu Lifeng, Wang Zhe, Zhou Mu, Zhang Shaoting
- Diahalu: A Dialogue-level Hallucination Evaluation Benchmark For Large Language Models Chen Kedi, Chen Qin, Zhou Jie, He Yishen, He Liang
- Machine Unlearning In Large Language Models Chen Kongyang, Wang Zixin, Mi Bing, Liu Waixi, Wang Shaowei, Ren Xiaojun, Shen Jiaxing
- OMPGPT: A Generative Pre-trained Transformer Model For Openmp Chen Le, Bhattacharjee Arijit, Ahmed Nesreen, Hasabnis Niranjan, Oren Gal, Vo Vy, Jannesari Ali
- Identifying Query-relevant Neurons In Large Language Models For Long-form Texts Chen Lihu, Dejl Adam, Toni Francesca
- Lifelong Knowledge Editing For Llms With Retrieval-augmented Continuous Prompt Learning Chen Qizhou, Zhang Taolin, He Xiaofeng, Li Dongyang, Wang Chengyu, Huang Longtao, Xue Hui
- Attribution Analysis Meets Model Editing: Advancing Knowledge Correction In Vision Language Models With Visedit Chen Qizhou, Zhang Taolin, Wang Chengyu, He Xiaofeng, Wang Dakan, Liu Tingting
- Struq: Defending Against Prompt Injection With Structured Queries Chen Sizhe, Piet Julien, Sitawarin Chawin, Wagner David
- From Yes-men To Truth-tellers: Addressing Sycophancy In Large Language Models With Pinpoint Tuning Chen Wei, Huang Zhen, Xie Liang, Lin Binbin, Li Houqiang, Lu Le, Tian Xinmei, Cai Deng, Zhang Yonggang, Wan Wenxiao, Shen Xu, Ye Jieping
- GRATH: Gradual Self-truthifying For Large Language Models Chen Weixin, Song Dawn, Li Bo
- Reprompt: Planning By Automatic Prompt Engineering For Large Language Models Agents Chen Weizhe, Koenig Sven, Dilkina Bistra
- Icleval: Evaluating In-context Learning Ability Of Large Language Models Chen Wentong, Lin Yankai, Zhou Zhenhao, Huang Hongyun, Jia Yantao, Cao Zhao, Wen Ji-rong
- Distilling Reasoning Ability From Large Language Models With Adaptive Thinking Chen Xiaoshu, Zhou Sihang, Liang Ke, Liu Xinwang
- Cause-aware Empathetic Response Generation Via Chain-of-thought Fine-tuning Chen Xinhao, Yang Chong, Lan Man, Cai Li, Chen Yang, Hu Tu, Zhuang Xinlin, Zhou Aimin
- Premise Order Matters In Reasoning With Large Language Models Chen Xinyun, Chi Ryan A., Wang Xuezhi, Zhou Denny
- RL-JACK: Reinforcement Learning-powered Black-box Jailbreaking Attack Against Llms Chen Xuan, Nie Yuzhou, Yan Lu, Mao Yunshu, Guo Wenbo, Zhang Xiangyu
- Can We Rely On LLM Agents To Draft Long-horizon Plans? Let's Take Travelplanner As An Example Chen Yanan, Pesaranghader Ali, Sadhu Tanmana, Yi Dong Hoon
- Extroversion Or Introversion? Controlling The Personality Of Your Large Language Models Chen Yanquan, Wu Zhen, Guo Junjie, Huang Shujian, Dai Xinyu
- Simulating Field Experiments With Large Language Models Chen Yaoyu, Hu Yuheng, Lu Yingda
- Robust And Scalable Model Editing For Large Language Models Chen Yingfa, Zhang Zhengyan, Han Xu, Xiao Chaojun, Liu Zhiyuan, Chen Chen, Li Kuai, Yang Tao, Sun Maosong
- See What Llms Cannot Answer: A Self-challenge Framework For Uncovering LLM Weaknesses Chen Yulong, Liu Yang, Yan Jianhao, Bai Xuefeng, Zhong Ming, Yang Yinghao, Yang Ziyi, Zhu Chenguang, Zhang Yue
- MAPO: Boosting Large Language Model Performance With Model-adaptive Prompt Optimization Chen Yuyan, Wen Zhihao, Fan Ge, Chen Zhengyu, Wu Wei, Liu Dayiheng, Li Zhixu, Liu Bang, Xiao Yanghua
- Towards Boosting Llms-driven Relevance Modeling With Progressive Retrieved Behavior-augmented Prompting Chen Zeyuan, Wu Haiyan, Wu Kaixin, Chen Wei, Zhong Mingjie, Xu Jia, Liu Zhongyi, Zhang Wei
- Dia-llama: Towards Large Language Model-driven CT Report Generation Chen Zhixuan, Luo Luyang, Bie Yequan, Chen Hao
- Empowering Large Language Models On Robotic Manipulation With Affordance Prompting Cheng Guangran, Zhang Chuheng, Cai Wenzhe, Zhao Li, Sun Changyin, Bian Jiang
- Structure Guided Prompt: Instructing Large Language Model In Multi-step Reasoning By Exploring Graph Structure Of The Text Cheng Kewei, Ahmed Nesreen K., Willke Theodore, Sun Yizhou
- Potential And Limitations Of Llms In Capturing Structured Semantics: A Case Study On SRL Cheng Ning, Yan Zhaohui, Wang Ziming, Li Zhijie, Yu Jiaming, Zheng Zilong, Tu Kewei, Xu Jinan, Han Wenjuan
- BISCUIT: Scaffolding Llm-generated Code With Ephemeral Uis In Computational Notebooks Cheng Ruijia, Barik Titus, Leung Alan, Hohman Fred, Nichols Jeffrey
- Multi-prompting Decoder Helps Better Language Understanding Cheng Zifeng, Chen Zhaoling, Jiang Zhiwei, Yin Yafeng, Ge Shiping, Liu Yuliang, Gu Qing
- Talking Nonsense: Probing Large Language Models' Understanding Of Adversarial Gibberish Inputs Cherepanova Valeriia, Zou James
- A Reality Check Of The Benefits Of LLM In Business Cheung Ming
- Do Language Models Enjoy Their Own Stories? Prompting Large Language Models For Automatic Story Evaluation Chhun Cyril, Suchanek Fabian M., Clavel Chloé
- Human-centered Llm-agent User Interface: A Position Paper Chin Daniel, Wang Yuxuan, Xia Gus
- Zero-shot Cross-lingual Transfer In Instruction Tuning Of Large Language Models Chirkova Nadezhda, Nikoulina Vassilina
- Retrieval-augmented Generation In Multilingual Settings Chirkova Nadezhda, Rau David, Déjean Hervé, Formal Thibault, Clinchant Stéphane, Nikoulina Vassilina
- An Empirical Study Of In-context Learning In Llms For Machine Translation Chitale Pranjal A., Gala Jay, Dabre Raj
- A Computational Framework For Behavioral Assessment Of LLM Therapists Chiu Yu Ying, Sharma Ashish, Lin Inna Wanyin, Althoff Tim
- Language-image Models With 3D Understanding Cho Jang Hyun, Ivanovic Boris, Cao Yulong, Schmerling Edward, Wang Yue, Weng Xinshuo, Li Boyi, You Yurong, Krähenbühl Philipp, Wang Yan, Pavone Marco
- Kv-runahead: Scalable Causal LLM Inference By Parallel Key-value Cache Generation Cho Minsik, Rastegari Mohammad, Naik Devang
- Larger Language Models Don't Care How You Think: Why Chain-of-thought Prompting Fails In Subjective Tasks Chochlakis Georgios, Pandiyan Niyantha Maruthu, Lerman Kristina, Narayanan Shrikanth
- Can Only Llms Do Reasoning?: Potential Of Small Language Models In Task Planning Choi Gawon, Ahn Hyemin
- Picle: Eliciting Diverse Behaviors From Large Language Models With Persona In-context Learning Choi Hyeong Kyu, Li Yixuan
- Lota-bench: Benchmarking Language-oriented Task Planners For Embodied Agents Choi Jae-woo, Yoon Youngwoo, Ong Hyobin, Kim Jaehong, Jang Minsu
- Gemmar: Enhancing Llms Through Arabic Instruction-tuning Chouikhi Hasna, Aloui Manel, Hammou Cyrine Ben, Chaabane Ghaith, Kchaou Haithem, Dhaouadi Chehir
- Med42-v2: A Suite Of Clinical Llms Christophe Clément, Kanithi Praveen K, Raha Tathagata, Khan Shadab, Pimentel Marco Af
- A Better LLM Evaluator For Text Generation: The Impact Of Prompt Output Sequencing And Optimization Chu Kuanchao, Chen Yi-pei, Nakayama Hideki
- Improve Temporal Awareness Of Llms For Sequential Recommendation Chu Zhendong, Wang Zichao, Zhang Ruiyi, Ji Yangfeng, Wang Hongning, Sun Tong
- A Causal Explainable Guardrails For Large Language Models Chu Zhixuan, Wang Yan, Li Longfei, Wang Zhibo, Qin Zhan, Ren Kui
- Learning To Compress Prompt In Natural Language Formats Chuang Yu-neng, Xing Tianwei, Chang Chia-yuan, Liu Zirui, Chen Xun, Hu Xia
- Cogbench: A Large Language Model Walks Into A Psychology Lab Coda-forno Julian, Binz Marcel, Wang Jane X., Schulz Eric
- A Chain-of-thought Prompting Approach With Llms For Evaluating Students' Formative Assessment Responses In Science Cohn Clayton, Hutchins Nicole, Le Tuan, Biswas Gautam
- Towards A Human-in-the-loop LLM Approach To Collaborative Discourse Analysis Cohn Clayton, Snyder Caitlin, Montenegro Justin, Biswas Gautam
- Ragsys: Item-cold-start Recommender As RAG System Contal Emile, Mcgoldrick Garrin
- Rocode: A Dataset For Measuring Code Intelligence From Problem Definitions In Romanian Cosma Adrian, Iordache Bogdan, Rosso Paolo
- Gamebench: Evaluating Strategic Reasoning Abilities Of LLM Agents Costarelli Anthony, Allen Mat, Hauksson Roman, Sodunke Grace, Hariharan Suhas, Cheng Carlson, Li Wenjie, Clymer Joshua, Yadav Arjun
- Out-of-context Prompting Boosts Fairness And Robustness In Large Language Model Predictions Cotta Leonardo, Maddison Chris J.
- Shortcomings Of Llms For Low-resource Translation: Retrieval And Understanding Are Both The Problem Court Sara, Elsner Micha
- Induction Heads As An Essential Mechanism For Pattern Matching In In-context Learning Crosbie J., Shutova E.
- The Power Of Noise: Redefining Retrieval For RAG Systems Cuconasu Florin, Trappolini Giovanni, Siciliano Federico, Filice Simone, Campagnano Cesare, Maarek Yoelle, Tonellotto Nicola, Silvestri Fabrizio
- Or-bench: An Over-refusal Benchmark For Large Language Models Cui Justin, Chiang Wei-lin, Stoica Ion, Hsieh Cho-jui
- Efficiently Exploring Large Language Models For Document-level Machine Translation With In-context Learning Cui Menglong, Du Jiangcun, Zhu Shaolin, Xiong Deyi
- Phaseevo: Towards Unified In-context Prompt Optimization For Large Language Models Cui Wendi, Zhang Jiaxin, Li Zhuohang, Sun Hao, Lopez Damien, Das Kamalika, Malin Bradley, Kumar Sricharan
- Insights From Benchmarking Frontier Language Models On Web App Code Generation Cui Yi
- Webapp1k: A Practical Code-generation Benchmark For Web App Development Cui Yi
- Muap: Multi-step Adaptive Prompt Learning For Vision-language Model With Missing Modality Dai Ruiting, Tan Yuqiao, Mo Lisi, He Tao, Qin Ke, Liang Shuang
- ACT-MNMT Auto-constriction Turning For Multilingual Neural Machine Translation Dai Shaojie, Liu Xin, Luo Ping, Yu Yue
- Large Language Models Can Better Understand Knowledge Graphs Than We Thought Dai Xinbang, Hua Yuncheng, Wu Tongtong, Sheng Yang, Ji Qiu, Qi Guilin
- A Complete Survey On Llm-based AI Chatbots Dam Sumit Kumar, Hong Choong Seon, Qiao Yu, Zhang Chaoning
- Impact Of Non-standard Unicode Characters On Security And Comprehension In Large Language Models Daniel Johan S, Pal Anand
- Under The Surface: Tracking The Artifactuality Of Llm-generated Data Das Debarati, De Langis Karin, Martin-boyle Anna, Kim Jaehyung, Lee Minhwa, Kim Zae Myung, Hayati Shirley Anugrah, Owan Risako, Hu Bin, Parkar Ritik, Koo Ryan, Park Jonginn, Tyagi Aahan, Ferland Libby, Roy Sanjali, Liu Vincent, Kang Dongyeop
- Speechverse: A Large-scale Generalizable Audio Language Model Das Nilaksh, Dingliwal Saket, Ronanki Srikanth, Paturi Rohit, Huang Zhaocheng, Mathur Prashant, Yuan Jie, Bekal Dhanush, Niu Xing, Jayanthi Sai Muralidhar, Li Xilai, Mundnich Karel, Sunkara Monica, Srinivasan Sundararajan, Han Kyu J, Kirchhoff Katrin
- Human-interpretable Adversarial Prompt Attack On Large Language Models With Situational Context Das Nilanjana, Raff Edward, Gaur Manas
- Active Preference Optimization For Sample Efficient RLHF Das Nirjhar, Chakraborty Souradip, Pacchiano Aldo, Chowdhury Sayak Ray
- Entropy Guided Extrapolative Decoding To Improve Factuality In Large Language Models Das Souvik, Jin Lifeng, Song Linfeng, Mi Haitao, Peng Baolin, Yu Dong
- Synthetic Patient-physician Dialogue Generation From Clinical Notes Using LLM Das Trisha, Albassam Dina, Sun Jimeng
- Investigating Symbolic Capabilities Of Large Language Models Dave Neisarg, Kifer Daniel, Giles C. Lee, Mali Ankur
- Prompting Open-source And Commercial Language Models For Grammatical Error Correction Of English Learner Text Davis Christopher, Caines Andrew, Andersen Øistein, Taslimipoor Shiva, Yannakoudakis Helen, Yuan Zheng, Bryant Christopher, Rei Marek, Buttery Paula
- Llms Are Not Intelligent Thinkers: Introducing Mathematical Topic Tree Benchmark For Comprehensive Evaluation Of Llms Davoodi Arash Gholami, Davoudi Seyed Pouyan Mousavi, Pezeshkpour Pouya
- Thematic Analysis With Large Language Models: Does It Work With Languages Other Than English? A Targeted Test In Italian De Paoli Stefano
- Will GPT-4 Run DOOM? De Wynter Adrian
- RTP-LX: Can Llms Evaluate Toxicity In Multilingual Scenarios? De Wynter Adrian, Watts Ishaan, Altıntoprak Nektar Ege, Wongsangaroonsri Tua, Zhang Minghui, Farra Noura, Baur Lena, Claudet Samantha, Gajdusek Pavel, Gören Can, Gu Qilong, Kaminska Anna, Kaminski Tomasz, Kuo Ruby, Kyuba Akiko, Lee Jongho, Mathur Kartik, Merok Petter, Milovanović Ivana, Paananen Nani, Paananen Vesa-matti, Pavlenko Anna, Vidal Bruno Pereira, Strika Luciano, Tsao Yueh, Turcato Davide, Vakhno Oleksandr, Velcsov Judit, Vickers Anna, Visser Stéphanie, Widarmanto Herdyan, Zaikin Andrey, Chen Si-qing
- Evaluating Large Language Models For Automatic Analysis Of Teacher Simulations De-fitero-dominguez David, Albaladejo-gonzález Mariano, Garcia-cabot Antonio, Garcia-lopez Eva, Moreno-cediel Antonio, Barno Erin, Reich Justin
- Understanding Biases In Chatgpt-based Recommender Systems: Provider Fairness, Temporal Stability, And Recency Deldjoo Yashar
- Pandora: Jailbreak Gpts By Retrieval Augmented Generation Poisoning Deng Gelei, Liu Yi, Wang Kailong, Li Yuekang, Zhang Tianwei, Liu Yang
- Can LLM Be A Good Path Planner Based On Prompt Engineering? Mitigating The Hallucination For Path Planning Deng Hourui, Zhang Hongjie, Ou Jie, Feng Chaosheng
- Tables As Texts Or Images: Evaluating The Table Reasoning Ability Of Llms And Mllms Deng Naihao, Sun Zhenjie, He Ruiqi, Sikka Aman, Chen Yulong, Ma Lin, Zhang Yue, Mihalcea Rada
- Language In Vivo Vs. In Silico: Size Matters But Larger Language Models Still Do Not Comprehend Language On A Par With Humans Dentella Vittoria, Guenther Fritz, Leivada Evelina
- Promises, Outlooks And Challenges Of Diffusion Language Modeling Deschenaux Justin, Gulcehre Caglar
- Chain-of-translation Prompting (cotr): A Novel Prompting Technique For Low Resource Languages Deshpande Tejas, Kowtal Nidhi, Joshi Raviraj
- Exploring Prompt Engineering Practices In The Enterprise Desmond Michael, Brachman Michelle
- Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets Devine Peter
- Tagengo: A Multilingual Chat Dataset Devine Peter
- Facts-and-feelings: Capturing Both Objectivity And Subjectivity In Table-to-text Generation Dey Tathagata, Bhattacharyya Pushpak
- Shaping Human-ai Collaboration: Varied Scaffolding Levels In Co-writing With Language Models Dhillon Paramveer S., Molaei Somayeh, Li Jiaqi, Golub Maximilian, Zheng Shaochun, Robert Lionel P.
- Ranking Large Language Models Without Ground Truth Dhurandhar Amit, Nair Rahul, Singh Moninder, Daly Elizabeth, Ramamurthy Karthikeyan Natesan
- Enhancing Recommendation Diversity By Re-ranking With Large Language Models Diego Carraro, Derek Bridge
- Learning How To Ask: Cycle-consistency Refines Prompts In Multimodal Foundation Models Diesendruck Maurice, Lin Jianzhe, Imani Shima, Mahalingam Gayathri, Xu Mingyang, Zhao Jie
- Entgpt: Linking Generative Large Language Models With Knowledge Bases Ding Yifan, Poudel Amrit, Zeng Qingkai, Weninger Tim, Veeramani Balaji, Bhattacharya Sanmitra
- Large Language Models Prompting With Episodic Memory Do Dai, Tran Quan, Venkatesh Svetha, Le Hung
- Automatic Prompt Selection For Large Language Models Do Viet-tung, Hoang Van-khanh, Nguyen Duy-hung, Sabahi Shahab, Yang Jeff, Hotta Hajime, Nguyen Minh-tien, Le Hung
- Evaluating Linguistic Capabilities Of Multimodal Llms In The Lens Of Few-shot Learning Dogan Mustafa, Kesen Ilker, Calixto Iacer, Erdem Aykut, Erdem Erkut
- Modality-aware Integration With Large Language Models For Knowledge-based Visual Question Answering Dong Junnan, Zhang Qinggang, Zhou Huachi, Zha Daochen, Zheng Pai, Huang Xiao
- Jailbreaking Text-to-image Models With Llm-based Agents Dong Yingkai, Li Zheng, Meng Xiangtao, Yu Ning, Guo Shanqing
- Prompt Your Brain: Scaffold Prompt Tuning For Efficient Adaptation Of Fmri Pre-trained Model Dong Zijian, Wu Yilei, Chen Zijiao, Zhang Yichi, Jin Yueming, Zhou Juan Helen
- How Reliable Are Automatic Evaluation Methods For Instruction-tuned Llms? Doostmohammadi Ehsan, Holmström Oskar, Kuhlmann Marco
- To What Extent Is Chatgpt Useful For Language Teacher Lesson Plan Creation? Dornburg Alex, Davin Kristin
- Enhancing Robustness Of Llm-synthetic Text Detectors For Academic Writing: A Comprehensive Analysis Dou Zhicheng, Guo Yuchen, Chang Ching-chun, Nguyen Huy H., Echizen Isao
- Towards Multimodal In-context Learning For Vision & Language Models Doveh Sivan, Perek Shaked, Mirza M. Jehanzeb, Lin Wei, Alfassy Amit, Arbelle Assaf, Ullman Shimon, Karlinsky Leonid
- Internal And External Knowledge Interactive Refinement Framework For Knowledge-intensive Question Answering Du Haowei, Zhao Dongyan
- In-context Learning With Reinforcement Learning For Incomplete Utterance Rewriting Du Haowei, Zhao Dongyan
- Efficient Personalized Text-to-image Generation By Leveraging Textual Subspace Du Shian, Cheng Xiaotian, Qian Qi, Wei Henglu, Xu Yi, Ji Xiangyang
- CLEFT: Language-image Contrastive Learning With Efficient Large Language Model And Prompt Fine-tuning Du Yuexi, Chang Brian, Dvornek Nicha C.
- Prompting Multi-modal Tokens To Enhance End-to-end Autonomous Driving Imitation Learning With Llms Duan Yiqun, Zhang Qiang, Xu Renjing
- Leveraging Machine-generated Rationales To Facilitate Social Meaning Detection In Conversations Dutt Ritam, Wu Zhen, Shi Kelly, Sheth Divyanshu, Gupta Prakhar, Rose Carolyn Penstein
- How To Think Step-by-step: A Mechanistic Understanding Of Chain-of-thought Reasoning Dutta Subhabrata, Singh Joykirat, Chakrabarti Soumen, Chakraborty Tanmoy
- Applying RLAIF For Code Generation With Api-usage In Lightweight Llms Dutta Sujan, Mahinder Sayantan, Anantha Raviteja, Bandyopadhyay Bortik
- Lusifer: Llm-based User Simulated Feedback Environment For Online Recommender Systems Ebrat Danial, Rueda Luis
- Cognitive Bias In High-stakes Decision-making With Llms Echterhoff Jessica, Liu Yao, Alessa Abeer, Mcauley Julian, He Zexue
- Relation Extraction With Fine-tuned Large Language Models In Retrieval Augmented Generation Frameworks Efeoglu Sefika, Paschke Adrian
- Reducing Selection Bias In Large Language Models Eicher J. E., Irgolič R. F.
- Conversational Prompt Engineering Ein-dor Liat, Toledo-ronen Orith, Spector Artem, Gretz Shai, Dankin Lena, Halfon Alon, Katz Yoav, Slonim Noam
- Arabic Automatic Story Generation With Large Language Models El-shangiti Ahmed Oumar, Alwajih Fakhraddin, Abdul-mageed Muhammad
- Creating Arabic LLM Prompts At Scale El-sheikh Abdelrahman, Elmogtaba Ahmed, Darwish Kareem, Elmallah Muhammad, Elneima Ashraf, Sawaf Hassan
- Lightweight Large Language Model For Medication Enquiry: Med-pal Elangovan Kabilan, Ong Jasmine Chiat Ling, Jin Liyuan, Seng Benjamin Jun Jie, Kwan Yu Heng, Tan Lit Soo, Zhong Ryan Jian, Ma Justina Koi Li, Ke Yuhe, Liu Nan, Giacomini Kathleen M, Ting Daniel Shu Wei
- Persuasiveness Of Generated Free-text Rationales In Subjective Decisions: A Case Study On Pairwise Argument Ranking Elaraby Mohamed, Litman Diane, Li Xiang Lorraine, Magooda Ahmed
- Enabling Natural Zero-shot Prompting On Encoder Models Via Statement-tuning Elshabrawy Ahmed, Huang Yongxin, Gurevych Iryna, Aji Alham Fikri
- Tinyagent: Function Calling At The Edge Erdogan Lutfi Eren, Lee Nicholas, Jha Siddharth, Kim Sehoon, Tabrizi Ryan, Moon Suhong, Hooper Coleman, Anumanchipalli Gopala, Keutzer Kurt, Gholami Amir
- Combinatorial Reasoning: Selecting Reasons In Generative AI Pipelines Via Combinatorial Optimization Esencan Mert, Kumar Tarun Advaith, Asanjan Ata Akbari, Lott P. Aaron, Mohseni Masoud, Unlu Can, Venturelli Davide, Ho Alan
- Low-cost Language Models: Survey And Performance Evaluation On Python Code Generation Espejel Jessica López, Alassan Mahaman Sanoussi Yahaya, Bouhandi Merieme, Dahhane Walid, Ettifouri El Hassane
- Coda: Constrained Generation Based Data Augmentation For Low-resource NLP Evuru Chandra Kiran Reddy, Ghosh Sreyan, Kumar Sonal, S Ramaneswaran, Tyagi Utkarsh, Manocha Dinesh
- Nphardeval4v: A Dynamic Reasoning Benchmark Of Multimodal Large Language Models Fan Lizhou, Hua Wenyue, Li Xiang, Zhu Kaijie, Jin Mingyu, Li Lingyao, Ling Haoyang, Chi Jinkui, Wang Jindong, Ma Xin, Zhang Yongfeng
- Navigation Instruction Generation With BEV Perception And Large Language Models Fan Sheng, Liu Rui, Wang Wenguan, Yang Yi
- Prompt Optimizer Of Text-to-image Diffusion Models For Abstract Concept Understanding Fan Zezhong, Li Xiaohan, Fang Chenhao, Biswas Topojoy, Nag Kaushiki, Xu Jianpeng, Achan Kannan
- DARA: Decomposition-alignment-reasoning Autonomous Language Agent For Question Answering Over Knowledge Graphs Fang Haishuo, Zhu Xiaodan, Gurevych Iryna
- Porting Large Language Models To Mobile Devices For Question Answering Fassold Hannes
- Multi-modal Hallucination Control By Visual Information Grounding Favero Alessandro, Zancato Luca, Trager Matthew, Choudhary Siddharth, Perera Pramuditha, Achille Alessandro, Swaminathan Ashwin, Soatto Stefano
- Evaluating Human Alignment And Model Faithfulness Of LLM Rationale Fayyaz Mohsen, Yin Fan, Sun Jiao, Peng Nanyun
- Boosting Zero-shot Crosslingual Performance Using Llm-based Augmentations With Effective Data Selection Fazili Barah, Agrawal Ashish Sunil, Jyothi Preethi
- Gensco: Can Question Decomposition Based Passage Alignment Improve Question Answering? Fazili Barah, Goswami Koustava, Modani Natwar, Nair Inderjeet
- Prompt Exploration With Prompt Regression Feffer Michael, Xu Ronald, Sun Yuekai, Yurochkin Mikhail
- Ragged Edges: The Double-edged Sword Of Retrieval-augmented Chatbots Feldman Philip, Foulds James R., Pan Shimei
- Exploring Automated Distractor Generation For Math Multiple-choice Questions Via Large Language Models Feng Wanyong, Lee Jaewook, Mcnichols Hunter, Scarlatos Alexander, Smith Digory, Woodhead Simon, Ornelas Nancy Otero, Lan Andrew
- Unveiling And Manipulating Prompt Influence In Large Language Models Feng Zijian, Zhou Hanzhang, Zhu Zixiao, Qian Junlang, Mao Kezhi
- Syllabusqa: A Course Logistics Question Answering Dataset Fernandez Nigel, Scarlatos Alexander, Lan Andrew
- Generative AI And Large Language Models For Cyber Security: All Insights You Need Ferrag Mohamed Amine, Alwahedi Fatima, Battah Ammar, Cherif Bilel, Mechri Abdechakour, Tihanyi Norbert
- Toward Large Language Models As A Therapeutic Tool: Comparing Prompting Techniques To Improve Gpt-delivered Problem-solving Therapy Filienko Daniil, Wang Yinzhou, Jazmi Caroline El, Xie Serena, Cohen Trevor, De Cock Martine, Yuwen Weichao
- Geckopt: LLM System Efficiency Via Intent-based Tool Selection Fore Michael, Singh Simranjit, Stamoulis Dimitrios
- CLIP With Generative Latent Replay: A Strong Baseline For Incremental Learning Frascaroli Emanuele, Panariello Aniello, Buzzega Pietro, Bonicelli Lorenzo, Porrello Angelo, Calderara Simone
- An Application Of Large Language Models To Coding Negotiation Transcripts Friedman Ray, Cho Jaewoo, Brett Jeanne, Zhan Xuhui, Han Ningyu, Kannan Sriram, Ma Yingxiang, Spencer-smith Jesse, Jäckel Elisabeth, Zerres Alfred, Hooper Madison, Babbit Katie, Acharya Manish, Adair Wendi, Aslani Soroush, Aykaç Tayfun, Bauman Chris, Bennett Rebecca, Brady Garrett, Briggs Peggy, Dowie Cheryl, Eck Chase, Geiger Igmar, Jacob Frank, Kern Molly, Lee Sujin, Liu Leigh Anne, Liu Wu, Loewenstein Jeffrey, Lytle Anne, Ma Li, Mann Michel, Mislin Alexandra, Mitchell Tyree, Nagler Hannah Martensen Née, Nandkeolyar Amit, Olekalns Mara, Paliakova Elena, Parlamis Jennifer, Pierce Jason, Pierce Nancy, Pinkley Robin, Prime Nathalie, Ramirez-marin Jimena, Rockmann Kevin, Ross William, Semnani-azad Zhaleh, Schroeder Juliana, Smith Philip, Stimmer Elena, Swaab Roderick, Thompson Leigh, Tinsley Cathy, Tuncel Ece, Weingart Laurie, Wilken Robert, Yao Jingjing, Zhang Zhi-xue
- LLM Agents In Interaction: Measuring Personality Consistency And Linguistic Alignment In Interacting Populations Of Large Language Models Frisch Ivar, Giulianelli Mario
- Preact: Predicting Future In React Enhances Agent's Planning Ability Fu Dayuan, Huang Jianzhao, Lu Siyuan, Dong Guanting, Wang Yejie, He Keqing, Xu Weiran
- Autorag-hp: Automatic Online Hyper-parameter Tuning For Retrieval-augmented Generation Fu Jia, Qin Xiaoting, Yang Fangkai, Wang Lu, Zhang Jue, Lin Qingwei, Chen Yubo, Zhang Dongmei, Rajmohan Saravan, Zhang Qi
- Hint-before-solving Prompting: Guiding Llms To Effectively Utilize Encoded Knowledge Fu Jinlan, Huangfu Shenzhen, Yan Hang, Ng See-kiong, Qiu Xipeng
- Lazyllm: Dynamic Token Pruning For Efficient Long Context LLM Inference Fu Qichen, Cho Minsik, Merth Thomas, Mehta Sachin, Rastegari Mohammad, Najibi Mahyar
- BLINK: Multimodal Large Language Models Can See But Not Perceive Fu Xingyu, Hu Yushi, Li Bangzheng, Feng Yu, Wang Haoyu, Lin Xudong, Roth Dan, Smith Noah A., Ma Wei-chiu, Krishna Ranjay
- Using Large Language Models For The Interpretation Of Building Regulations Fuchs Stefan, Witbrock Michael, Dimyadi Johannes, Amor Robert
- Thinking Fair And Slow: On The Efficacy Of Structured Prompts For Debiasing Language Models Furniturewala Shaz, Jandial Surgan, Java Abhinav, Banerjee Pragyan, Shahid Simra, Bhatia Sumit, Jaidka Kokil
- RAVE: Residual Vector Embedding For Clip-guided Backlit Image Enhancement Gaintseva Tatiana, Benning Martin, Slabaugh Gregory
- Intelligence Analysis Of Language Models Galanti Liane, Baron Ethan
- Context-augmented Retrieval: A Novel Framework For Fast Information Retrieval Based Response Generation Using Large Language Model Ganesh Sai, Purwar Anupam, B Gautam
- Cost-efficient Large Language Model Serving For Multi-turn Conversations With Cachedattention Gao Bin, He Zhuomin, Sharma Puru, Kang Qingxuan, Jevdjic Djordje, Deng Junbo, Yang Xingkun, Yu Zhou, Zuo Pengfei
- The Best Of Both Worlds: Toward An Honest And Helpful Large Language Model Gao Chujie, Zhang Qihui, Chen Dongping, Huang Yue, Wu Siyuan, Fu Zhengyan, Wan Yao, Zhang Xiangliang, Sun Lichao
- Aligning LLM Agents By Learning Latent Preference From User Edits Gao Ge, Taymanov Alexey, Salinas Eduardo, Mineiro Paul, Misra Dipendra
- Self-evolving GPT: A Lifelong Autonomous Experiential Learner Gao Jinglong, Ding Xiao, Cui Yiming, Zhao Jianbai, Wang Hepeng, Liu Ting, Qin Bing
- Selfcp: Compressing Over-limit Prompt Via The Frozen Large Language Model Itself Gao Jun, Cao Ziqiang, Li Wenjie
- Unifying Demonstration Selection And Compression For In-context Learning Gao Jun, Cao Ziqiang, Li Wenjie
- AIM: Let Any Multi-modal Large Language Models Embrace Efficient In-context Learning Gao Jun, Qiao Qian, Cao Ziqiang, Wang Zili, Li Wenjie
- Harnessing The Intrinsic Knowledge Of Pretrained Language Models For Challenging Text Classification Settings Gao Lingyu
- DORY: Deliberative Prompt Recovery For LLM Gao Lirong, Peng Ru, Zhang Yiming, Zhao Junbo
- Meta Reasoning For Large Language Models Gao Peizhong, Xie Ao, Mao Shaoguang, Wu Wenshan, Xia Yan, Mi Haipeng, Wei Furu
- Customizing Language Model Responses With Contrastive In-context Learning Gao Xiang, Das Kamalika
- Introducing Bode: A Fine-tuned Large Language Model For Portuguese Prompt-based Task Garcia Gabriel Lino, Paiola Pedro Henrique, Morelli Luis Henrique, Candido Giovani, Júnior Arnaldo Cândido, Jodas Danilo Samuel, Afonso Luis C. S., Guilherme Ivan Rizzo, Penteado Bruno Elias, Papa João Paulo
- Attacking Large Language Models With Projected Gradient Descent Geisler Simon, Wollschläger Tom, Abdalla M. H. I., Gasteiger Johannes, Günnemann Stephan
- Edinburgh Clinical NLP At MEDIQA-CORR 2024: Guiding Large Language Models With Hints Gema Aryo Pradipta, Lee Chaeeun, Minervini Pasquale, Daines Luke, Simpson T. Ian, Alex Beatrice
- Supporting Sensemaking Of Large Language Model Outputs At Scale Gero Katy Ilonka, Swoopes Chelse, Gu Ziwei, Kummerfeld Jonathan K., Glassman Elena L.
- Jailbreaking Llms With Arabic Transliteration And Arabizi Ghanim Mansour Al, Almohaimeed Saleh, Zheng Mengxin, Solihin Yan, Lou Qian
- Reinforcement Learning Problem Solving With Large Language Models Gholamian Sina, Huh Domingo
- Investigating The Translation Capabilities Of Large Language Models Trained On Parallel Data Only Gilabert Javier García, Escolano Carlos, Savall Aleix Sant, Fornaciari Francesca De Luca, Mash Audrey, Liao Xixian, Melero Maite
- Using Large Language Models To Enrich The Documentation Of Datasets For Machine Learning Giner-miguelez Joan, Gómez Abel, Cabot Jordi
- Explicit And Implicit Large Language Model Personas Generate Opinions But Fail To Replicate Deeper Perceptions And Biases Giorgi Salvatore, Liu Tingting, Aich Ankit, Isman Kelsey, Sherman Garrick, Fried Zachary, Sedoc João, Ungar Lyle H., Curtis Brenda
- RE-GAINS & Enchant: Intelligent Tool Manipulation Systems For Enhanced Query Responses Girhepuje Sahil, Sajeev Siva Sankar, Jain Purvam, Sikder Arya, Varma Adithya Rama, George Ryan, Srinivasan Akshay Govind, Kurup Mahendra, Sinha Ashmit, Mondal Sudip
- Uncovering Hidden Intentions: Exploring Prompt Recovery For Deeper Insights Into Generated Texts Give Louis, Zaoral Timo, Bruno Maria Antonietta
- Blendsql: A Scalable Dialect For Unifying Hybrid Question Answering In Relational Algebra Glenn Parker, Dakle Parag Pravin, Wang Liang, Raghavan Preethi
- Memorization In In-context Learning Golchin Shahriar, Surdeanu Mihai, Bethard Steven, Blanco Eduardo, Riloff Ellen
- Extending Interactive Science Exhibits Into The Classroom Using Anthropomorphized Chatbots And Bloom's Taxonomy Golding Yousuf
- Tokenshap: Interpreting Large Language Models With Monte Carlo Shapley Value Estimation Goldshmidt Roni, Horovicz Miriam
- Mechanistic Interpretability Of Large Language Models With Applications To The Financial Services Industry Golgoon Ashkan, Filom Khashayar, Kannan Arjun Ravi
- Evaluation Of Llms On Syntax-aware Code Fill-in-the-middle Tasks Gong Linyuan, Wang Sida, Elhoushi Mostafa, Cheung Alvin
- Multi-frame, Lightweight & Efficient Vision-language Models For Question Answering In Autonomous Driving Gopalkrishnan Akshay, Greer Ross, Trivedi Mohan
- Few Shot Chain-of-thought Driven Reasoning To Prompt Llms For Open Ended Medical Question Answering Gramopadhye Ojas, Nachane Saeel Sandeep, Chanda Prateek, Ramakrishnan Ganesh, Jadhav Kshitij Sharad, Nandwani Yatin, Raghu Dinesh, Joshi Sachindra
- Specify What? Enhancing Neural Specification Synthesis By Symbolic Methods Granberry George, Ahrendt Wolfgang, Johansson Moa
- From Generalist To Specialist: Improving Large Language Models For Medical Physics Using Arcot Grandinetti Jace, Mcbeth Rafe
- Enhancing Llm-based Human-robot Interaction With Nuances For Diversity Awareness Grassi Lucrezia, Recchiuto Carmine Tommaso, Sgorbissa Antonio
- Is Mamba Capable Of In-context Learning? Grazzi Riccardo, Siems Julien, Schrodi Simon, Brox Thomas, Hutter Frank
- Talking The Talk Does Not Entail Walking The Walk: On The Limits Of Large Language Models In Lexical Entailment Recognition Greco Candida M., La Cava Lucio, Tagarelli Andrea
- Stress-testing Capability Elicitation With Password-locked Models Greenblatt Ryan, Roger Fabien, Krasheninnikov Dmitrii, Krueger David
- Fine-tuned Language Models Generate Stable Inorganic Materials As Text Gruver Nate, Sriram Anuroop, Madotto Andrea, Wilson Andrew Gordon, Zitnick C. Lawrence, Ulissi Zachary
- OLMES: A Standard For Language Model Evaluations Gu Yuling, Tafjord Oyvind, Kuehl Bailey, Haddad Dany, Dodge Jesse, Hajishirzi Hannaneh
- Detectbench: Can Large Language Model Detect And Piece Together Implicit Evidence? Gu Zhouhong, Zhang Lin, Zhu Xiaoxuan, Chen Jiangjie, Huang Wenhao, Zhang Yikai, Wang Shusen, Ye Zheyu, Gao Yan, Feng Hongwei, Xiao Yanghua
- Towards Generalist Prompting For Large Language Models By Mental Models Guan Haoxiang, He Jiyan, Zheng Shuxin, Chen En-hong, Zhang Weiming, Yu Nenghai
- Enhancing Collaborative Semantics Of Language Model-driven Recommendations Via Graph-aware Learning Guan Zhong, Wu Likang, Zhao Hongke, He Ming, Fan Jianpin
- Look Before You Leap: Towards Decision-aware And Generalizable Tool-usage For Large Language Models Gui Anchun, Li Jian, Dai Yong, Du Nan, Xiao Han
- Lisbon Computational Linguists At Semeval-2024 Task 2: Using A Mistral 7B Model And Data Augmentation Guimarães Artur, Martins Bruno, Magalhães João
- Sample Design Engineering: An Empirical Study Of What Makes Good Downstream Fine-tuning Samples For Llms Guo Biyang, Wang He, Xiao Wenyilin, Chen Hong, Lee Zhuxin, Han Songqiao, Huang Hailiang
- Integrating Large Language Models With Graphical Session-based Recommendation Guo Naicheng, Cheng Hongwei, Liang Qianqiao, Chen Linxun, Han Bing
- Direct Language Model Alignment From Online AI Feedback Guo Shangmin, Zhang Biao, Liu Tianlin, Liu Tianqi, Khalman Misha, Llinares Felipe, Rame Alexandre, Mesnard Thomas, Zhao Yao, Piot Bilal, Ferret Johan, Blondel Mathieu
- SGSH: Stimulate Large Language Models With Skeleton Heuristics For Knowledge Base Question Generation Guo Shasha, Liao Lizi, Zhang Jing, Wang Yanling, Li Cuiping, Chen Hong
- Serial Position Effects Of Large Language Models Guo Xiaobo, Vosoughi Soroush
- Stabletoolbench: Towards Stable Large-scale Benchmarking On Tool Learning Of Large Language Models Guo Zhicheng, Cheng Sijie, Wang Hao, Liang Shihao, Qin Yujia, Li Peng, Liu Zhiyuan, Sun Maosong, Liu Yang
- AAVENUE: Detecting LLM Biases On NLU Tasks In AAVE Via A Novel Benchmark Gupta Abhay, Meng Philip, Yurtseven Ece, O'brien Sean, Zhu Kevin
- LLM Task Interference: An Initial Study On The Impact Of Task-switch In Conversational History Gupta Akash, Sheth Ivaxi, Raina Vyas, Gales Mark, Fritz Mario
- Irel At Semeval-2024 Task 9: Improving Conventional Prompting Methods For Brain Teasers Gupta Harshit, Chaudhary Manav, Raha Tathagata, Subramanian Shivansh, Varma Vasudeva
- Walledeval: A Comprehensive Safety Evaluation Toolkit For Large Language Models Gupta Prannaya, Yau Le Qi, Low Hao Han, Lee I-shiang, Lim Hugo Maximus, Teoh Yu Xin, Koh Jia Hng, Liew Dar Win, Bhardwaj Rishabh, Bhardwaj Rajat, Poria Soujanya
- METAREFLECTION: Learning Instructions For Language Agents Using Past Reflections Gupta Priyanshu, Kirtania Shashank, Singha Ananya, Gulwani Sumit, Radhakrishna Arjun, Shi Sherry, Soares Gustavo
- PECC: Problem Extraction And Coding Challenges Haller Patrick, Golde Jonas, Akbik Alan
- Chain-of-interaction: Enhancing Large Language Models For Psychiatric Behavior Understanding By Dyadic Contexts Han Guangzeng, Liu Weisi, Huang Xiaolei, Borsari Brian
- Small Language Model Can Self-correct Han Haixia, Liang Jiaqing, Shi Jie, He Qianyu, Xiao Yanghua
- Chatgpt Based Data Augmentation For Improved Parameter-efficient Debiasing Of Llms Han Pengrui, Kocielnik Rafal, Saravanan Adhithya, Jiang Roy, Sharir Or, Anandkumar Anima
- Wildguard: Open One-stop Moderation Tools For Safety Risks, Jailbreaks, And Refusals Of Llms Han Seungju, Rao Kavel, Ettinger Allyson, Jiang Liwei, Lin Bill Yuchen, Lambert Nathan, Choi Yejin, Dziri Nouha
- Torchopera: A Compound AI System For LLM Safety Han Shanshan, Yao Yuhang, Hu Zijian, Stripelis Dimitris, Xu Zhaozhuo, He Chaoyang
- EMMA: Your Text-to-image Diffusion Model Can Secretly Accept Multi-modal Prompts Han Yucheng, Wang Rui, Zhang Chi, Hu Juntao, Cheng Pei, Fu Bin, Zhang Hanwang
- Jailbreaking Proprietary Large Language Models Using Word Substitution Cipher Handa Divij, Chirmule Advait, Gajera Bimal, Baral Chitta
- LLM Reasoners: New Evaluation, Library, And Analysis Of Step-by-step Reasoning With Large Language Models Hao Shibo, Gu Yi, Luo Haotian, Liu Tianyang, Shao Xiyan, Wang Xinyuan, Xie Shuhua, Ma Haodi, Samavedhi Adithya, Gao Qiyue, Wang Zhen, Hu Zhiting
- Astprompter: Weakly Supervised Automated Language Model Red-teaming To Identify Likely Toxic Prompts Hardy Amelia F., Liu Houjun, Lange Bernard, Kochenderfer Mykel J.
- Measuring Psychological Depth In Language Models Harel-canada Fabrice, Zhou Hanyu, Mupalla Sreya, Yildiz Zeynep, Sahai Amit, Peng Nanyun
- A Comparison Of Large Language Model And Human Performance On Random Number Generation Tasks Harrison Rachel M.
- Pruning For Protection: Increasing Jailbreak Resistance In Aligned Llms Without Fine-tuning Hasan Adib, Rugina Ileana, Wang Alex
- The Unreasonable Effectiveness Of Easy Training Data For Hard Tasks Hase Peter, Bansal Mohit, Clark Peter, Wiegreffe Sarah
- Large Language Models Are Biased Reinforcement Learners Hayes William M., Yax Nicolas, Palminteri Stefano
- Relative Value Biases In Large Language Models Hayes William M., Yax Nicolas, Palminteri Stefano
- POEM: Interactive Prompt Optimization For Enhancing Multimodal Reasoning Of Large Language Models He Jianben, Wang Xingbo, Liu Shiyi, Wu Guande, Silva Claudio, Qu Huamin
- From Words To Actions: Unveiling The Theoretical Underpinnings Of Llm-driven Autonomous Systems He Jianliang, Chen Siyu, Zhang Fengzhuo, Yang Zhuoran
- Multi-modal Instruction Tuned Llms With Fine-grained Visual Perception He Junwen, Wang Yifan, Wang Lijun, Lu Huchuan, He Jun-yan, Lan Jin-peng, Luo Bin, Xie Xuansong
- Prompt-based Bias Calibration For Better Zero/few-shot Learning Of Language Models He Kang, Long Yinghan, Roy Kaushik
- Achieving Tool Calling Functionality In Llms Using Only Prompt Engineering Without Fine-tuning He Shengtao
- Prompting Chatgpt For Translation: A Comparative Analysis Of Translation Brief And Persona Prompts He Sui
- MARS: Mixture Of Auto-regressive Models For Fine-grained Text-to-image Synthesis He Wanggui, Fu Siming, Liu Mushui, Wang Xierui, Xiao Wenyi, Shu Fangxun, Wang Yi, Zhang Lei, Yu Zhelun, Li Haoyuan, Huang Ziwei, Gan Leilei, Jiang Hao
- G-retriever: Retrieval-augmented Generation For Textual Graph Understanding And Question Answering He Xiaoxin, Tian Yijun, Sun Yifei, Chawla Nitesh V., Laurent Thomas, Lecun Yann, Bresson Xavier, Hooi Bryan
- BP4ER: Bootstrap Prompting For Explicit Reasoning In Medical Dialogue Generation He Yuhong, Zhang Yongqi, He Shizhu, Wan Jun
- Fortifying Ethical Boundaries In AI: Advanced Strategies For Enhancing Security In Large Language Models He Yunhong, Qiu Jianling, Zhang Wei, Yuan Zhengqing
- Automated Black-box Prompt Engineering For Personalized Text-to-image Generation He Yutong, Robey Alexander, Murata Naoki, Jiang Yiding, Williams Joshua, Pappas George J., Hassani Hamed, Mitsufuji Yuki, Salakhutdinov Ruslan, Kolter J. Zico
- Camelot: Towards Large Language Models With Training-free Consolidated Associative Memory He Zexue, Karlinsky Leonid, Kim Donghyun, Mcauley Julian, Krotov Dmitry, Feris Rogerio
- Position Engineering: Boosting Large Language Models Through Positional Information Manipulation He Zhiyuan, Jiang Huiqiang, Wang Zilong, Yang Yuqing, Qiu Luna, Qiu Lili
- PERSOMA: Personalized Soft Prompt Adapter Architecture For Personalized Language Prompting Hebert Liam, Sayana Krishna, Jash Ambarish, Karatzoglou Alexandros, Sodhi Sukhdeep, Doddapaneni Sumanth, Cai Yanli, Kuzmin Dima
- A Piece Of Theatre: Investigating How Teachers Design LLM Chatbots To Assist Adolescent Cyberbullying Education Hedderich Michael A., Bazarova Natalie N., Zou Wenting, Shim Ryun, Ma Xinda, Yang Qian
- Improving Minimum Bayes Risk Decoding With Multi-prompt Heineman David, Dou Yao, Xu Wei
- Can Large Language Models Make The Grade? An Empirical Study Evaluating Llms Ability To Mark Short Answer Questions In K-12 Education Henkel Owen, Boxer Adam, Hills Libby, Roberts Bill
- On Overcoming Miscalibrated Conversational Priors In Llm-based Chatbots Herlihy Christine, Neville Jennifer, Schnabel Tobias, Swaminathan Adith
- Eliciting Personality Traits In Large Language Models Hilliard Airlie, Munoz Cristian, Wu Zekun, Koshiyama Adriano Soares
- Optimising Hard Prompts With Few-shot Meta-prompting Hiraou Sayash Raaj
- Deepspeed-fastgen: High-throughput Text Generation For Llms Via MII And Deepspeed-inference Holmes Connor, Tanaka Masahiro, Wyatt Michael, Awan Ammar Ahmad, Rasley Jeff, Rajbhandari Samyam, Aminabadi Reza Yazdani, Qin Heyang, Bakhtiari Arash, Kurilenko Lev, He Yuxiong
- Evaluating The Elementary Multilingual Capabilities Of Large Language Models With Multiq Holtermann Carolin, Röttger Paul, Dill Timm, Lauscher Anne
- Outcome-constrained Large Language Models For Countering Hate Speech Hong Lingzi, Luo Pengcheng, Blanco Eduardo, Song Xiaoying
- Curiosity-driven Red-teaming For Large Language Models Hong Zhang-wei, Shenfeld Idan, Wang Tsun-hsuan, Chuang Yung-sung, Pareja Aldo, Glass James, Srivastava Akash, Agrawal Pulkit
- Visual Prompting For Generalized Few-shot Segmentation: A Multi-scale Approach Hossain Mir Rayat Imtiaz, Siam Mennatullah, Sigal Leonid, Little James J.
- Large Language Models As Misleading Assistants In Conversation Hou Betty Li, Shi Kejian, Phang Jason, Aung James, Adler Steven, Campbell Rosie
- A Systematic Evaluation Of Large Language Models For Generating Programming Code Hou Wenpin, Ji Zhicheng
- Raw Text Is All You Need: Knowledge-intensive Multi-turn Instruction Tuning For Large Language Model Hou Xia, Li Qifeng, Yang Jian, Li Tongliang, Chai Linzheng, Wu Xianjie, Ji Hangyuan, Li Zhoujun, Nie Jixuan, Dun Jingbo, Song Wenfeng
- Uncovering Bias In Large Vision-language Models At Scale With Counterfactuals Howard Phillip, Fraser Kathleen C., Bhiwandiwalla Anahita, Kiritchenko Svetlana
- Exploring Scaling Trends In LLM Robustness Howe Nikolaus, Zajac Michał, Mckenzie Ian, Hollinsworth Oskar, Tseng Tom, Bacon Pierre-luc, Gleave Adam
- Free-text Rationale Generation Under Readability Level Control Hsu Yi-sheng, Feldhus Nils, Hakimov Sherzod
- Game Generation Via Large Language Models Hu Chengpeng, Zhao Yunlong, Liu Jialin
- Rankprompt: Step-by-step Comparisons Make Language Models Better Reasoners Hu Chi, Ge Yuan, Ma Xiangnan, Cao Hang, Li Qiang, Yang Yonghua, Xiao Tong, Zhu Jingbo
- Language Models Align With Human Judgments On Key Grammatical Constructions Hu Jennifer, Mahowald Kyle, Lupyan Gary, Ivanova Anna, Levy Roger
- A Hopfieldian View-based Interpretation For Chain-of-thought Reasoning Hu Lijie, Liu Liang, Yang Shu, Chen Xin, Xiao Hongru, Li Mengdi, Zhou Pan, Ali Muhammad Asif, Wang Di
- Hiagent: Hierarchical Working Memory Management For Solving Long-horizon Agent Tasks With Large Language Model Hu Mengkang, Chen Tianxing, Chen Qiguang, Mu Yao, Shao Wenqi, Luo Ping
- SLM Meets LLM: Balancing Latency, Interpretability And Consistency In Hallucination Detection Hu Mengya, Xu Rui, Lei Deren, Li Yaxi, Wang Mingyu, Ching Emily, Kamal Eslam, Deng Alex
- Limited Out-of-context Knowledge Reasoning In Large Language Models Hu Peng, Gao Changjiang, Gao Ruiqi, Chen Jiajun, Huang Shujian
- 3D Building Generation In Minecraft Via Large Language Models Hu Shiying, Huang Zengrong, Hu Chengpeng, Liu Jialin
- FOKE: A Personalized And Explainable Education Framework Integrating Foundation Models, Knowledge Graphs, And Prompt Engineering Hu Silan, Wang Xiaoning
- Quantifying The Persona Effect In LLM Simulations Hu Tiancheng, Collier Nigel
- ELLA: Equip Diffusion Models With LLM For Enhanced Semantic Alignment Hu Xiwei, Wang Rui, Fang Yixiao, Fu Bin, Cheng Pei, Yu Gang
- Prompt Perturbation In Retrieval-augmented Generation Based Large Language Models Hu Zhibo Hye-young, Wang Chen Hye-young, Shu Yanfeng Hye-young, Helen Hye-young, Paik, Zhu Liming
- V2xum-llm: Cross-modal Video Summarization With Temporal Prompt Instruction Tuning Hua Hang, Tang Yunlong, Xu Chenliang, Luo Jiebo
- Talk Less, Interact Better: Evaluating In-context Conversational Adaptation In Multimodal Llms Hua Yilun, Artzi Yoav
- Multimodal Task Vectors Enable Many-shot Multimodal In-context Learning Huang Brandon, Mitra Chancharik, Arbelle Assaf, Karlinsky Leonid, Darrell Trevor, Herzig Roei
- Pokergpt: An End-to-end Lightweight Solver For Multi-player Texas Hold'em Via Large Language Model Huang Chenghao, Cao Yanbo, Wen Yinlong, Zhou Tao, Zhang Yanru
- What Affects The Stability Of Tool Learning? An Empirical Study On The Robustness Of Tool Learning Frameworks Huang Chengrui, Shi Zhengliang, Wen Yuntao, Chen Xiuying, Han Peng, Gao Shen, Shang Shuo
- Training Language Models To Generate Text With Citations Via Fine-grained Rewards Huang Chengyu, Wu Zeqiu, Hu Yushi, Wang Wenya
- Generating Educational Materials With Different Levels Of Readability Using Llms Huang Chieh-yang, Wei Jing, Huang Ting-hao 'kenneth'
- Chatgpt Rates Natural Language Explanation Quality Like Humans: But On Which Scales? Huang Fan, Kwak Haewoon, Park Kunwoo, An Jisun
- Verbalized Probabilistic Graphical Modeling With Large Language Models Huang Hengguan, Shen Xing, Wang Songtao, Liu Dianbo, Wang Hao
- Conme: Rethinking Evaluation Of Compositional Reasoning For Modern Vlms Huang Irene, Lin Wei, Mirza M. Jehanzeb, Hansen Jacob A., Doveh Sivan, Butoi Victor Ion, Herzig Roei, Arbelle Assaf, Kuhene Hilde, Darrel Trevor, Gan Chuang, Oliva Aude, Feris Rogerio, Karlinsky Leonid
- Mitigating Catastrophic Forgetting In Large Language Models With Self-synthesized Rehearsal Huang Jianheng, Cui Leyang, Wang Ante, Yang Chengyi, Liao Xinting, Song Linfeng, Yao Junfeng, Su Jinsong
- Reason3d: Searching And Reasoning 3D Segmentation Via Large Language Model Huang Kuan-chih, Li Xiangtai, Qi Lu, Yan Shuicheng, Yang Ming-hsuan
- Learning Fine-grained Grounded Citations For Attributed Large Language Models Huang Lei, Feng Xiaocheng, Ma Weitao, Gu Yuxuan, Zhong Weihong, Feng Xiachong, Yu Weijiang, Peng Weihua, Tang Duyu, Tu Dandan, Qin Bing
- The Solution For The 5th GCAIAC Zero-shot Referring Expression Comprehension Challenge Huang Longfei, Yu Feng, Guan Zhihao, Wan Zhonghua, Yang Yang
- Translate-and-revise: Boosting Large Language Models For Constrained Translation Huang Pengcheng, Mu Yongyu, Wu Yuzhang, Li Bei, Xiao Chunyang, Xiao Tong, Zhu Jingbo
- Selective Prompting Tuning For Personalized Conversations With Llms Huang Qiushi, Liu Xubo, Ko Tom, Wu Bo, Wang Wenwu, Zhang Yu, Tang Lilian
- Vaccine: Perturbation-aware Alignment For Large Language Models Against Harmful Fine-tuning Huang Tiansheng, Hu Sihao, Liu Ling
- Prompting Large Language Models With Knowledge Graphs For Question Answering Involving Long-tail Facts Huang Wenyu, Zhou Guancheng, Lapata Mirella, Vougiouklis Pavlos, Montella Sebastien, Pan Jeff Z.
- WESE: Weak Exploration To Strong Exploitation For LLM Agents Huang Xu, Liu Weiwen, Chen Xiaolong, Wang Xingmei, Lian Defu, Wang Yasheng, Tang Ruiming, Chen Enhong
- Lost In The Source Language: How Large Language Models Evaluate The Quality Of Machine Translation Huang Xu, Zhang Zhirui, Geng Xiang, Du Yichao, Chen Jiajun, Huang Shujian
- Leveraging Large Language Models For Enhanced NLP Task Performance Through Knowledge Distillation And Optimized Training Strategies Huang Yining, Tang Keke, Chen Meilian
- Trustllm: Trustworthiness In Large Language Models Huang Yue, Sun Lichao, Wang Haoran, Wu Siyuan, Zhang Qihui, Li Yuan, Gao Chujie, Huang Yixin, Lyu Wenhan, Zhang Yixuan, Li Xiner, Liu Zhengliang, Liu Yixin, Wang Yijue, Zhang Zhikun, Vidgen Bertie, Kailkhura Bhavya, Xiong Caiming, Xiao Chaowei, Li Chunyuan, Xing Eric, Huang Furong, Liu Hao, Ji Heng, Wang Hongyi, Zhang Huan, Yao Huaxiu, Kellis Manolis, Zitnik Marinka, Jiang Meng, Bansal Mohit, Zou James, Pei Jian, Liu Jian, Gao Jianfeng, Han Jiawei, Zhao Jieyu, Tang Jiliang, Wang Jindong, Vanschoren Joaquin, Mitchell John, Shu Kai, Xu Kaidi, Chang Kai-wei, He Lifang, Huang Lifu, Backes Michael, Gong Neil Zhenqiang, Yu Philip S., Chen Pin-yu, Gu Quanquan, Xu Ran, Ying Rex, Ji Shuiwang, Jana Suman, Chen Tianlong, Liu Tianming, Zhou Tianyi, Wang William, Li Xiang, Zhang Xiangliang, Wang Xiao, Xie Xing, Chen Xun, Wang Xuyu, Liu Yan, Ye Yanfang, Cao Yinzhi, Chen Yong, Zhao Yue
- Obscureprompt: Jailbreaking Large Language Models Via Obscure Input Huang Yue, Tang Jingyu, Chen Dongping, Tang Bingda, Wan Yao, Sun Lichao, Zhang Xiangliang
- Ecr-chain: Advancing Generative Language Models To Better Emotion-cause Reasoners Through Reasoning Chains Huang Zhaopei, Zhao Jinming, Jin Qin
- Enhancing The Capability And Robustness Of Large Language Models Through Reinforcement Learning-driven Query Refinement Huang Zisu, Wang Xiaohua, Zhang Feiran, Xu Zhibo, Zhang Cenyuan, Zheng Xiaoqing, Huang Xuanjing
- Instruction Fine-tuning: Does Prompt Loss Matter? Huerta-enochian Mathew, Ko Seung Yong
- Rot: Enhancing Large Language Models With Reflection On Search Trees Hui Wenyang, Tu Kewei
- Federated Recommendation Via Hybrid Retrieval Augmented Generation Huimin Zeng, Zhenrui Yue, Qian Jiang, Dong Wang
- The Promise And Challenges Of Using Llms To Accelerate The Screening Process Of Systematic Reviews Huotala Aleksi, Kuutila Miikka, Ralph Paul, Mäntylä Mika
- Vietnamese Poem Generation & The Prospect Of Cross-language Poem-to-poem Translation Huynh Triet Minh, Bao Quan Le
- Promptdsi: Prompt-based Rehearsal-free Instance-wise Incremental Learning For Document Retrieval Huynh Tuan-luc, Vu Thuy-trang, Wang Weiqing, Wei Yinwei, Le Trung, Gasevic Dragan, Li Yuan-fang, Do Thanh-toan
- Pc-lora: Low-rank Adaptation For Progressive Model Compression With Knowledge Distillation Hwang Injoon, Park Haewon, Lee Youngwan, Yang Jooyoung, Maeng Sunjae
- Assessing Llms Suitability For Knowledge Graph Completion Iga Vasile Ionut Remus, Silaghi Gheorghe Cosmin
- Self-guiding Exploration For Combinatorial Problems Iklassov Zangir, Du Yali, Akimov Farkhad, Takac Martin
- Test-time Low Rank Adaptation Via Confidence Maximization For Zero-shot Generalization Of Vision-language Models Imam Raza, Gani Hanan, Huzaifa Muhammad, Nandakumar Karthik
- Improving Multilingual Instruction Finetuning Via Linguistically Natural And Diverse Datasets Indurthi Sathish Reddy, Zhou Wenxuan, Chollampatt Shamil, Agrawal Ravi, Song Kaiqiang, Zhao Lingxiao, Zhu Chenguang
- Grade Score: Quantifying LLM Performance In Option Selection Iourovitski Dmitri
- Autocompletion Of Chief Complaints In The Electronic Health Records Using Large Language Models Islam K M Sajjadul, Nipu Ayesha Siddika, Madiraju Praveen, Deshpande Priya
- Mapcoder: Multi-agent Code Generation For Competitive Problem Solving Islam Md. Ashraful, Ali Mohammed Eunus, Parvez Md Rizwan
- Are Large Vision Language Models Up To The Challenge Of Chart Comprehension And Reasoning? An Extensive Investigation Into The Capabilities And Limitations Of Lvlms Islam Mohammed Saidul, Rahman Raian, Masry Ahmed, Laskar Md Tahmid Rahman, Nayeem Mir Tafseer, Hoque Enamul
- Polyglotoxicityprompts: Multilingual Evaluation Of Neural Toxic Degeneration In Large Language Models Jain Devansh, Kumar Priyanshu, Gehman Samuel, Zhou Xuhui, Hartvigsen Thomas, Sap Maarten
- From RAG To RICHES: Retrieval Interlaced With Sequence Generation Jain Palak, Soares Livio Baldini, Kwiatkowski Tom
- Structsum Generation For Faster Text Comprehension Jain Parag, Marzoca Andreea, Piccinno Francesco
- LLM Agents Improve Semantic Code Search Jain Sarthak University Of Illinois Urbana Champaign And Cisco, Dora Aditya University Of Illinois Urbana Champaign, Sam Ka Seng University Of Illinois Urbana Champaign, Singh Prabhat Cisco
- Rectifying Demonstration Shortcut In In-context Learning Jang Joonwon, Jang Sanghwan, Kweon Wonbin, Jeon Minjin, Yu Hwanjo
- Large Language Models Enable Automated Formative Feedback In Human-robot Interaction Tasks Jensen Emily, Sankaranarayanan Sriram, Hayes Bradley
- L4Q: Parameter Efficient Quantization-aware Fine-tuning On Large Language Models Jeon Hyesung, Kim Yulhwa, Kim Jae-joon
- SDQ: Sparse Decomposed Quantization For LLM Inference Jeong Geonhwa, Tsai Po-an, Keckler Stephen W., Krishna Tushar
- Defending Large Language Models Against Jailbreak Attacks Via Semantic Smoothing Ji Jiabao, Hou Bairu, Robey Alexander, Pappas George J., Hassani Hamed, Zhang Yang, Wong Eric, Chang Shiyu
- Can't Say Cant? Measuring And Reasoning Of Dark Jargons In Large Language Models Ji Xu, Zhang Jianyi, Zhou Ziyin, Zhao Zhangchi, Qiao Qianqian, Han Kaiying, Hossen Md Imran, Hei Xiali
- Linking In-context Learning In Transformers To Human Episodic Memory Ji-an Li, Zhou Corey Y., Benna Marcus K., Mattar Marcelo G.
- Chatbug: A Common Vulnerability Of Aligned Llms Induced By Chat Templates Jiang Fengqing, Xu Zhangchen, Niu Luyao, Lin Bill Yuchen, Poovendran Radha
- Minference 1.0: Accelerating Pre-filling For Long-context Llms Via Dynamic Sparse Attention Jiang Huiqiang, Li Yucheng, Zhang Chengruidong, Wu Qianhui, Luo Xufang, Ahn Surin, Han Zhenhua, Abdi Amir H., Li Dongsheng, Lin Chin-yew, Yang Yuqing, Qiu Lili
- MC\(^2\): Multi-concept Guidance For Customized Multi-concept Generation Jiang Jiaxiu, Zhang Yabo, Feng Kailai, Wu Xiaohe, Zuo Wangmeng
- Data Augmentation Of Multi-turn Psychological Dialogue Via Knowledge-driven Progressive Thought Prompting Jiang Jiyue, Chen Liheng, Wang Sheng, Kong Lingpeng, Li Yu, Wu Chuan
- Wildteaming At Scale: From In-the-wild Jailbreaks To (adversarially) Safer Language Models Jiang Liwei, Rao Kavel, Han Seungju, Ettinger Allyson, Brahman Faeze, Kumar Sachin, Mireshghallah Niloofar, Lu Ximing, Sap Maarten, Choi Yejin, Dziri Nouha
- Enhancing Robustness In Large Language Models: Prompting For Mitigating The Impact Of Irrelevant Information Jiang Ming, Huang Tingting, Guo Biao, Lu Yao, Zhang Feng
- Investigating Data Contamination For Pre-training Language Models Jiang Minhao, Liu Ken Ziyu, Zhong Ming, Schaeffer Rylan, Ouyang Siru, Han Jiawei, Koyejo Sanmi
- Seek And Solve Reasoning For Table Question Answering Jiang Ruya, Wang Chun, Deng Weihong
- Sketch: A Toolkit For Streamlining LLM Operations Jiang Xin, Li Xiang, Ma Wenjia, Fang Xuezhi, Yao Yiqun, Yu Naitong, Meng Xuying, Han Peng, Li Jing, Sun Aixin, Wang Yequan
- Large Visual-language Models Are Also Good Classifiers: A Study Of In-context Multimodal Fake News Detection Jiang Ye, Wang Yimin
- Llms Can Find Mathematical Reasoning Mistakes By Pedagogical Chain-of-thought Jiang Zhuoxuan, Peng Haoyuan, Feng Shanshan, Li Fan, Li Dongsheng
- Gradable Chatgpt Translation Evaluation Jiao Hui, Peng Bei, Zong Lu, Zhang Xiaojun, Li Xinwei
- Prompt-based Length Controlled Generation With Multiple Control Types Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking Jin Can, Peng Hongwu, Zhao Shiyu, Wang Zhenting, Xu Wujiang, Han Ligong, Zhao Jiahui, Zhong Kai, Rajasekaran Sanguthevar, Metaxas Dimitris N.
- Zero-shot Chain-of-thought Reasoning Guided By Evolutionary Algorithms In Large Language Models Jin Feihu, Liu Yifan, Tan Ying
- The Impact Of Reasoning Step Length On Large Language Models Jin Mingyu, Yu Qinkai, Shu Dong, Zhao Haiyan, Hua Wenyue, Meng Yanda, Zhang Yongfeng, Du Mengnan
- HGT: Leveraging Heterogeneous Graph-enhanced Large Language Models For Few-shot Complex Table Understanding Jin Rihui, Li Yu, Qi Guilin, Hu Nan, Li Yuan-fang, Chen Jiaoyan, Wang Jianan, Chen Yongrui, Min Dehai
- Adaptive Skeleton Graph Decoding Jin Shuowei, Wu Yongji, Zheng Haizhong, Zhang Qingzhao, Lentz Matthew, Mao Z. Morley, Prakash Atul, Qian Feng, Zhuo Danyang
- MULTIVERSE: Exposing Large Language Model Alignment Problems In Diverse Worlds Jin Xiaolong, Zhang Zhuo, Zhang Xiangyu
- Investigating Multi-hop Factual Shortcuts In Knowledge Editing Of Large Language Models Ju Tianjie, Chen Yijin, Yuan Xinwei, Zhang Zhuosheng, Du Wei, Zheng Yubin, Liu Gongshen
- LARA: Linguistic-adaptive Retrieval-augmented Llms For Multi-turn Intent Classification Junhua Liu, Keat Tan Yong, Bin Fu
- Dipt: Enhancing LLM Reasoning Through Diversified Perspective-taking Just Hoang Anh, Dabas Mahavir, Huang Lifu, Jin Ming, Jia Ruoxi
- Evaluating The Impact Of Advanced LLM Techniques On Ai-lecture Tutors For A Robotics Course Kahl Sebastian, Löffler Felix, Maciol Martin, Ridder Fabian, Schmitz Marius, Spanagel Jennifer, Wienkamp Jens, Burgahn Christopher, Schilling Malte
- Llms Can't Plan, But Can Help Planning In Llm-modulo Frameworks Kambhampati Subbarao, Valmeekam Karthik, Guan Lin, Verma Mudit, Stechly Kaya, Bhambri Siddhant, Saldyt Lucas, Murthy Anil
- When Can Llms Actually Correct Their Own Mistakes? A Critical Survey Of Self-correction Of Llms Kamoi Ryo, Zhang Yusen, Zhang Nan, Han Jiawei, Zhang Rui
- Eagle: Ethical Dataset Given From Real Interactions Kaneko Masahiro, Bollegala Danushka, Baldwin Timothy
- Evaluating Gender Bias In Large Language Models Via Chain-of-thought Prompting Kaneko Masahiro, Bollegala Danushka, Okazaki Naoaki, Baldwin Timothy
- Prompt-rag: Pioneering Vector Embedding-free Retrieval-augmented Generation In Niche Domains, Exemplified By Korean Medicine Kang Bongsu, Kim Jundong, Yun Tae-rim, Kim Chang-eop
- Prompting Large Language Models With Audio For General-purpose Speech Summarization Kang Wonjune, Roy Deb
- MEDIC: Towards A Comprehensive Framework For Evaluating Llms In Clinical Applications Kanithi Praveen K, Christophe Clément, Pimentel Marco Af, Raha Tathagata, Saadi Nada, Javed Hamza, Maslenkova Svetlana, Hayat Nasir, Rajan Ronnie, Khan Shadab
- Solving For X And Beyond: Can Large Language Models Solve Complex Math Problems With More-than-two Unknowns? Kao Kuei-chun, Wang Ruochen, Hsieh Cho-jui
- Large Language Models Must Be Taught To Know What They Don't Know Kapoor Sanyam, Gruver Nate, Roberts Manley, Collins Katherine, Pal Arka, Bhatt Umang, Weller Adrian, Dooley Samuel, Goldblum Micah, Wilson Andrew Gordon
- Can Reinforcement Learning Unlock The Hidden Dangers In Aligned Large Language Models? Karkevandi Mohammad Bahrami, Vishwamitra Nishant, Najafirad Peyman
- Evaluating Interventional Reasoning Capabilities Of Large Language Models Kasetty Tejas, Mahajan Divyat, Dziugaite Gintare Karolina, Drouin Alexandre, Sridhar Dhanya
- Alpaca Against Vicuna: Using Llms To Uncover Memorization Of Llms Kassem Aly M., Mahmoud Omar, Mireshghallah Niloofar, Kim Hyunwoo, Tsvetkov Yulia, Choi Yejin, Saad Sherif, Rana Santu
- Faaf: Facts As A Function For The Evaluation Of Generated Text Katranidis Vasileios, Barany Gabor
- Comparing Plausibility Estimates In Base And Instruction-tuned Large Language Models Kauf Carina, Chersoni Emmanuele, Lenci Alessandro, Fedorenko Evelina, Ivanova Anna A.
- Instruct-skillmix: A Powerful Pipeline For LLM Instruction Tuning Kaur Simran, Park Simon, Goyal Anirudh, Arora Sanjeev
- Defending Large Language Models Against Attacks With Residual Stream Activation Analysis Kawasaki Amelia, Davis Andrew, Abbas Houssam
- Retrieval-augmented Conversational Recommendation With Prompt-based Semi-structured Natural Language State Tracking Kemper Sara, Cui Justin, Dicarlantonio Kai, Lin Kathy, Tang Danjie, Korikov Anton, Sanner Scott
- Autonomous Prompt Engineering In Large Language Models Kepel Daan, Valogianni Konstantina
- Harmonic Llms Are Trustworthy Kersting Nicholas S., Rahman Mohammad, Vedala Suchismitha, Wang Yang
- Source-aware Training Enables Knowledge Attribution In Language Models Khalifa Muhammad, Wadden David, Strubell Emma, Lee Honglak, Wang Lu, Beltagy Iz, Peng Hao
- Indicllmsuite: A Blueprint For Creating Pre-training And Fine-tuning Datasets For Indian Languages Khan Mohammed Safi Ur Rahman, Mehta Priyam, Sankar Ananth, Kumaravelan Umashankar, Doddapaneni Sumanth, G Suriyaprasaad, G Varun Balan, Jain Sparsh, Kunchukuttan Anoop, Kumar Pratyush, Dabre Raj, Khapra Mitesh M.
- Reinforcement Of Explainability Of Chatgpt Prompts By Embedding Breast Cancer Self-screening Rules Into AI Responses Khan Yousef, Hamed Ahmed Abdeen
- Self-training Large Language Models For Improved Visual Program Synthesis With Visual Reinforcement Khan Zaid, Bg Vijay Kumar, Schulter Samuel, Fu Yun, Chandraker Manmohan
- Iterative Prompt Refinement For Radiation Oncology Symptom Extraction Using Teacher-student Large Language Models Khanmohammadi Reza, Ghanem Ahmed I, Verdecchia Kyle, Hall Ryan, Elshaikh Mohamed, Movsas Benjamin, Bagher-ebadian Hassan, Chetty Indrin, Ghassemi Mohammad M., Thind Kundan
- How Well Do Llms Represent Values Across Cultures? Empirical Analysis Of LLM Responses Based On Hofstede Cultural Dimensions Kharchenko Julia, Roosta Tanya, Chadha Aman, Shah Chirag
- How Good Is My Video LMM? Complex Video Reasoning And Robustness Evaluation Suite For Video-lmms Khattak Muhammad Uzair, Naeem Muhammad Ferjad, Hassan Jameel, Naseer Muzammal, Tombari Federico, Khan Fahad Shahbaz, Khan Salman
- Learning To Prompt With Text Only Supervision For Vision-language Models Khattak Muhammad Uzair, Naeem Muhammad Ferjad, Naseer Muzammal, Van Gool Luc, Tombari Federico
- Efficient Medical Question Answering With Knowledge-augmented Question Generation Khlaut Julien, Dancette Corentin, Ferreres Elodie, Bennani Alaedine, Hérent Paul, Manceron Pierre
- Why And When Llm-based Assistants Can Go Wrong: Investigating The Effectiveness Of Prompt-based Interactions For Software Help-seeking Khurana Anjali, Subramonyam Hari, Chilana Parmit K
- Guiding Large Language Models To Post-edit Machine Translation With Error Annotations Ki Dayeon, Carpuat Marine
- Keyword-driven Retrieval-augmented Large Language Models For Cold-start User Recommendations Kieu Hai-dang, Nguyen Minh Duc, Nguyen Thanh-son, Le Dung D.
- Aligning Large Language Models With Self-generated Preference Data Kim Dongyoung, Lee Kimin, Shin Jinwoo, Kim Jaehyung
- Nevermind: Instruction Override And Moderation In Large Language Models Kim Edward
- Promptkd: Distilling Student-friendly Knowledge For Generative Language Models Via Prompt Tuning Kim Gyeongman, Jang Doohyuk, Yang Eunho
- Sure: Summarizing Retrievals Using Answer Candidates For Open-domain QA Of Llms Kim Jaehyung, Nam Jaehyun, Mo Sangwoo, Park Jongjin, Lee Sang-woo, Seo Minjoon, Ha Jung-woo, Shin Jinwoo
- Few-shot Personalization Of Llms With Mis-aligned Responses Kim Jaehyung, Yang Yiming
- Exploring Prompting Methods For Mitigating Class Imbalance Through Synthetic Data Generation With Large Language Models Kim Jinhee, Kim Taesung, Choo Jaegul
- Strategic Data Ordering: Enhancing Large Language Model Performance Through Curriculum Learning Kim Jisu, Lee Juhwan
- Persona Is A Double-edged Sword: Enhancing The Zero-shot Reasoning By Ensembling The Role-playing And Neutral Prompts Kim Junseok, Yang Nakyeong, Jung Kyomin
- Automatic Jailbreaking Of The Text-to-image Generative AI Systems Kim Minseon, Lee Hyomin, Gong Boqing, Zhang Huishuai, Hwang Sung Ju
- Is GPT-4 Alone Sufficient For Automated Essay Scoring?: A Comparative Judgment Approach Based On Rater Cognition Kim Seungju, Jo Meounggun
- Groundial: Human-norm Grounded Safe Dialog Response Generation Kim Siwon, Dai Shuyang, Kachuee Mohammad, Ray Shayan, Taghavi Tara, Yoon Sungroh
- A Survey On Integration Of Large Language Models With Intelligent Robots Kim Yeseung, Kim Dohyun, Choi Jieun, Park Jisang, Oh Nayoung, Park Daehyung
- Health-llm: Large Language Models For Health Prediction Via Wearable Sensor Data Kim Yubin, Xu Xuhai, Mcduff Daniel, Breazeal Cynthia, Park Hae Won
- LOGIC-LM++: Multi-step Refinement For Symbolic Formulations Kirtania Shashank, Gupta Priyanshu, Radhakirshna Arjun
- Unveiling Divergent Inductive Biases Of Llms On Temporal Data Kishore Sindhu, He Hangfeng
- Native Vs Non-native Language Prompting: A Comparative Analysis Kmainasi Mohamed Bayan, Khan Rakif, Shahroor Ali Ezzat, Bendou Boushra, Hasanain Maram, Alam Firoj
- Recmind: Japanese Movie Recommendation Dialogue With Seeker's Internal State Kodama Takashi, Kiyomaru Hirokazu, Huang Yin Jou, Kurohashi Sadao
- Cleared For Takeoff? Compositional & Conditional Reasoning May Be The Achilles Heel To (flight-booking) Language Agents Kohli Harsh, Sun Huan
- Chatspamdetector: Leveraging Large Language Models For Effective Phishing Email Detection Koide Takashi, Fukushi Naoki, Nakano Hiroki, Chiba Daiki
- Style Vectors For Steering Generative Large Language Model Konen Kai, Jentzsch Sophie, Diallo Diaoulé, Schütt Peer, Bensch Oliver, Baff Roxanne El, Opitz Dominik, Hecking Tobias
- Self-prompt Tuning: Enable Autonomous Role-playing In Llms Kong Aobo, Zhao Shiwan, Chen Hao, Li Qicheng, Qin Yong, Sun Ruiqi, Zhou Xin, Zhou Jiaming, Sun Haoqin
- Aligning Large Language Models With Representation Editing: A Control Perspective Kong Lingkai, Wang Haorui, Mu Wenhao, Du Yuanqi, Zhuang Yuchen, Zhou Yifei, Song Yue, Zhang Rongzhi, Wang Kai, Zhang Chao
- Prewrite: Prompt Rewriting With Reinforcement Learning Kong Weize, Hombaiah Spurthi Amba, Zhang Mingyang, Mei Qiaozhu, Bendersky Michael
- Controllable Navigation Instruction Generation With Chain Of Thought Prompting Kong Xianghao, Chen Jinyu, Wang Wenguan, Su Hang, Hu Xiaolin, Yang Yi, Liu Si
- Customizing Language Models With Instance-wise Lora For Sequential Recommendation Kong Xiaoyu, Wu Jiancan, Zhang An, Sheng Leheng, Lin Hui, Wang Xiang, He Xiangnan
- QPO: Query-dependent Prompt Optimization Via Multi-loop Offline Reinforcement Learning Kong Yilun, Mao Hangyu, Zhao Qi, Zhang Bin, Ruan Jingqing, Shen Li, Chang Yongzhe, Wang Xueqian, Zhao Rui, Tao Dacheng
- Audio Flamingo: A Novel Audio Language Model With Few-shot Learning And Dialogue Abilities Kong Zhifeng, Goel Arushi, Badlani Rohan, Ping Wei, Valle Rafael, Catanzaro Bryan
- Bitune: Bidirectional Instruction-tuning Kopiczko Dawid J., Blankevoort Tijmen, Asano Yuki M.
- Leveraging Large Language Models For Enhanced Process Model Comprehension Kourani Humam, Berti Alessandro, Henrich Jasmin, Kratsch Wolfgang, Weidlich Robin, Li Chiao-yun, Arslan Ahmad, Schuster Daniel, Van Der Aalst Wil M. P.
- Retrieval-augmented Code Generation For Situated Action Generation: A Case Study On Minecraft Kranti Chalamalasetti, Hakimov Sherzod, Schlangen David
- Understanding The Effects Of Iterative Prompting On Truthfulness Krishna Satyapriya, Agarwal Chirag, Lakkaraju Himabindu
- Can Large Language Models Explore In-context? Krishnamurthy Akshay, Harris Keegan, Foster Dylan J., Zhang Cyril, Slivkins Aleksandrs
- Can Better Text Semantics In Prompt Tuning Improve VLM Generalization? Kuchibhotla Hari Chandana, Kancheti Sai Srinivas, Reddy Abbavaram Gowtham, Balasubramanian Vineeth N
- Synthdst: Synthetic Data Is All You Need For Few-shot Dialog State Tracking Kulkarni Atharva, Tseng Bo-hsiang, Moniz Joel Ruben Antony, Piraviperumal Dhivya, Yu Hong, Bhargava Shruti
- Confidence Under The Hood: An Investigation Into The Confidence-probability Alignment In Large Language Models Kumar Abhishek, Morabito Robert, Umbet Sanzhar, Kabbara Jad, Emami Ali
- Multi-modal Generation Via Cross-modal In-context Learning Kumar Amandeep, Naseer Muzammal, Narayan Sanath, Anwer Rao Muhammad, Khan Salman, Cholakkal Hisham
- Fine-tuning, Quantization, And Llms: Navigating Unintended Outcomes Kumar Divyanshu, Kumar Anurakt, Agarwal Sahil, Harshangi Prashanth
- Improving Socratic Question Generation Using Data Augmentation And Preference Optimization Kumar Nischal Ashok, Lan Andrew
- Bridging The Gap: Dynamic Learning Strategies For Improving Multilingual Performance In Llms Kumar Somnath, Balloli Vaibhav, Ranjit Mercy, Ahuja Kabir, Ganu Tanuja, Sitaram Sunayana, Bali Kalika, Nambi Akshay
- Evaluating Cultural Adaptability Of A Large Language Model Via Simulation Of Synthetic Personas Kwok Louis, Bravansky Michal, Griffin Lewis D.
- Open Models, Closed Minds? On Agents Capabilities In Mimicking Human Personalities Through Open Large Language Models La Cava Lucio, Tagarelli Andrea
- Can Github Issues Be Solved With Tree Of Thoughts? La Rosa Ricardo, Hulse Corey, Liu Bangdi
- RES-Q: Evaluating Code-editing Large Language Model Systems At The Repository Scale Labash Beck, Rosedale August, Reents Alex, Negritto Lucas, Wiel Colin
- RVISA: Reasoning And Verification For Implicit Sentiment Analysis Lai Wenna, Xie Haoran, Xu Guandong, Li Qing
- Residual-based Language Models Are Free Boosters For Biomedical Imaging Lai Zhixin, Wu Jing, Chen Suiyao, Zhou Yucheng, Hovakimyan Naira
- Cat-bench: Benchmarking Language Model Understanding Of Causal And Temporal Dependencies In Plans Lal Yash Kumar, Cohen Vanya, Chambers Nathanael, Balasubramanian Niranjan, Mooney Raymond
- Lapdoc: Layout-aware Prompting For Documents Lamott Marcel, Weweler Yves-noel, Ulges Adrian, Shafait Faisal, Krechel Dirk, Obradovic Darko
- FOCUS: Forging Originality Through Contrastive Use In Self-plagiarism For Language Models Lan Kaixin, Fang Tao, Wong Derek F., Xu Yabo, Chao Lidia S., Zhao Cecilia G.
- Fishing For Magikarp: Automatically Detecting Under-trained Tokens In Large Language Models Land Sander, Bartolo Max
- Large Language Models As Evolution Strategies Lange Robert Tjarko, Tian Yingtao, Tang Yujin
- Query-opt: Optimizing Inference Of Large Language Models Via Multi-query Instructions In Meeting Summarization Laskar Md Tahmid Rahman, Khasanova Elena, Fu Xue-yong, Chen Cheng, Tn Shashi Bhushan
- Melting Point: Mobile Evaluation Of Language Transformers Laskaridis Stefanos, Katevas Kleomenis, Minto Lorenzo, Haddadi Hamed
- Benchmarks Underestimate The Readiness Of Multi-lingual Dialogue Agents Lee Andrew H., Semnani Sina J., Castillo-lópez Galo, De Chalendar Gäel, Choudhury Monojit, Dua Ashna, Kavitha Kapil Rajesh, Kim Sungkyun, Kodali Prashant, Kumaraguru Ponnurangam, Lombard Alexis, Moradshahi Mehrad, Park Gihyun, Semmar Nasredine, Seo Jiwon, Shen Tianhao, Shrivastava Manish, Xiong Deyi, Lam Monica S.
- Language Models Show Stable Value Orientations Across Diverse Role-plays Lee Bruce W., Lee Yeongheon, Cho Hyunsoo
- Programming Refusal With Conditional Activation Steering Lee Bruce W., Padhi Inkit, Ramamurthy Karthikeyan Natesan, Miehling Erik, Dognin Pierre, Nagireddy Manish, Dhurandhar Amit
- Collavo: Crayon Large Language And Vision Model Lee Byung-kwan, Park Beomchan, Kim Chae Won, Ro Yong Man
- Hip Attention: Sparse Sub-quadratic Attention With Hierarchical Attention Pruning Lee Heejun, Park Geon, Lee Youngwan, Kim Jina, Jeong Wonyoung, Jeon Myeongjae, Hwang Sung Ju
- Can Long-context Language Models Subsume Retrieval, RAG, SQL, And More? Lee Jinhyuk, Chen Anthony, Dai Zhuyun, Dua Dheeru, Sachan Devendra Singh, Boratko Michael, Luan Yi, Arnold Sébastien M. R., Perot Vincent, Dalmia Siddharth, Hu Hexiang, Lin Xudong, Pasupat Panupong, Amini Aida, Cole Jeremy R., Riedel Sebastian, Naim Iftekhar, Chang Ming-wei, Guu Kelvin
- A Human-inspired Reading Agent With Gist Memory Of Very Long Contexts Lee Kuang-huei, Chen Xinyun, Furuta Hiroki, Canny John, Fischer Ian
- Probability Of Differentiation Reveals Brittleness Of Homogeneity Bias In Large Language Models Lee Messi H. J., Lai Calvin K.
- Prompting Large Language Models For Zero-shot Essay Scoring Via Multi-trait Specialization Lee Sanwoo, Cai Yida, Meng Desong, Wang Ziyang, Wu Yunfang
- Effective And Efficient Conversation Retrieval For Dialogue State Tracking With Implicit Text Summaries Lee Seanie, Cheng Jianpeng, Driesen Joris, Coca Alexandru, Johannsen Anders
- Aligning To Thousands Of Preferences Via System Message Generalization Lee Seongyun, Park Sue Hyun, Kim Seungone, Seo Minjoon
- Do Llms Have Distinct And Consistent Personality? TRAIT: Personality Testset Designed For Llms With Psychometrics Lee Seungbeen, Lim Seungwon, Han Seungju, Oh Giyeong, Chae Hyungjoo, Chung Jiwan, Kim Minju, Kwak Beong-woo, Lee Yeonsoo, Lee Dongha, Yeo Jinyoung, Yu Youngjae
- COCOA: Cbt-based Conversational Counseling Agent Using Memory Specialized In Cognitive Distortions And Dynamic Prompt Lee Suyeon, Kang Jieun, Kim Harim, Chung Kyoung-mee, Lee Dongha, Yeo Jinyoung
- Learning To Reduce: Optimal Representations Of Structured Data In Prompting Large Language Models Lee Younghun, Kim Sungchul, Yu Tong, Rossi Ryan A., Chen Xiang
- Meta-task Prompting Elicits Embeddings From Large Language Models Lei Yibin, Wu Di, Zhou Tianyi, Shen Tao, Cao Yu, Tao Chongyang, Yates Andrew
- Prexme! Large Scale Prompt Exploration Of Open Source Llms For Machine Translation And Summarization Evaluation Leiter Christoph, Eger Steffen
- Intent-based Prompt Calibration: Enhancing Prompt Optimization With Synthetic Boundary Cases Levi Elad, Brosh Eli, Friedmann Matan
- Eliciting Better Multilingual Structured Reasoning From Llms Through Code Li Bryan, Alkhouli Tamer, Bonadiman Daniele, Pappas Nikolaos, Mansour Saab
- Concentrate Attention: Towards Domain-generalizable Prompt Optimization For Language Models Li Chengzhengxu, Liu Xiaoming, Zhang Zhaohan, Wang Yichen, Liu Chen, Lan Yu, Shen Chao
- Searchlvlms: A Plug-and-play Framework For Augmenting Large Vision-language Models By Searching Up-to-date Internet Knowledge Li Chuanhao, Li Zhen, Jing Chenchen, Liu Shuo, Shao Wenqi, Wu Yuwei, Luo Ping, Qiao Yu, Zhang Kaipeng
- Humaneval On Latest GPT Models -- 2024 Li Daniel, Murr Lincoln
- Recall, Retrieve And Reason: Towards Better In-context Relation Extraction Li Guozheng, Wang Peng, Ke Wenjun, Guo Yikai, Ji Ke, Shang Ziyu, Liu Jiajun, Xu Zijie
- Know The Unknown: An Uncertainty-sensitive Method For LLM Instruction Tuning Li Jiaqi, Tang Yixuan, Yang Yi
- Steering Llms Towards Unbiased Responses: A Causality-guided Debiasing Framework Li Jingling, Tang Zeyu, Liu Xiaoyu, Spirtes Peter, Zhang Kun, Leqi Liu, Liu Yang
- Fmm-attack: A Flow-based Multi-modal Adversarial Attack On Video-based Llms Li Jinmin, Gao Kuofeng, Bai Yang, Zhang Jingyun, Xia Shu-tao, Wang Yisen
- Stylechat: Learning Recitation-augmented Memory In Llms For Stylized Dialogue Generation Li Jinpeng, Zhang Zekai, Tu Quan, Cheng Xin, Zhao Dongyan, Yan Rui
- Pctoolkit: A Unified Plug-and-play Prompt Compression Toolkit Of Large Language Models Li Jinyi, Lan Yihuai, Wang Lei, Wang Hao
- Measuring And Controlling Instruction (in)stability In Language Model Dialogs Li Kenneth, Liu Tianle, Bashkansky Naomi, Bau David, Viégas Fernanda, Pfister Hanspeter, Wattenberg Martin
- Confidence Matters: Revisiting Intrinsic Self-correction Capabilities Of Large Language Models Li Loka, Chen Zhenhao, Chen Guangyi, Zhang Yixuan, Su Yusheng, Xing Eric, Zhang Kun
- Predicting Vs. Acting: A Trade-off Between World Modeling & Agent Modeling Li Margaret, Shi Weijia, Pagnoni Artidoro, West Peter, Holtzman Ari
- A Sentiment Consolidation Framework For Meta-review Generation Li Miao, Lau Jey Han, Hovy Eduard
- Bridging The Language Gap: Enhancing Multilingual Prompt-based Code Generation In Llms Via Zero-shot Cross-lingual Transfer Li Mingda, Mishra Abhijit, Mujumdar Utkarsh
- Nearest Neighbor Speculative Decoding For LLM Generation And Attribution Li Minghan, Chen Xilun, Holtzman Ari, Chen Beidi, Lin Jimmy, Yih Wen-tau, Lin Xi Victoria
- LLM Defenses Are Not Robust To Multi-turn Human Jailbreaks Yet Li Nathaniel, Han Ziwen, Steneker Ian, Primack Willow, Goodside Riley, Zhang Hugh, Wang Zifan, Menghini Cristina, Yue Summer
- Gsm-plus: A Comprehensive Benchmark For Evaluating The Robustness Of Llms As Mathematical Problem Solvers Li Qintong, Cui Leyang, Zhao Xueliang, Kong Lingpeng, Bi Wei
- Prompt4vis: Prompting Large Language Models With Example Mining And Schema Filtering For Tabular Data Visualization Li Shuaimin, Chen Xuanang, Song Yuanfeng, Song Yunze, Zhang Chen
- MEDIQ: Question-asking Llms For Adaptive And Reliable Clinical Reasoning Li Shuyue Stella, Balachandran Vidhisha, Feng Shangbin, Ilgen Jonathan, Pierson Emma, Koh Pang Wei, Tsvetkov Yulia
- Model-enhanced Llm-driven VUI Testing Of VPA Apps Li Suwan, Bu Lei, Bai Guangdong, Xie Fuman, Chen Kai, Yue Chang
- Long-context Llms Struggle With Long In-context Learning Li Tianle, Zhang Ge, Do Quy Duc, Yue Xiang, Chen Wenhu
- Stbench: Assessing The Ability Of Large Language Models In Spatio-temporal Analysis Li Wenbin, Yao Di, Zhao Ruibo, Chen Wenjie, Xu Zijie, Luo Chengxue, Gong Chang, Jing Quanliang, Tan Haining, Bi Jingping
- Optimizing Psychological Counseling With Instruction-tuned Large Language Models Li Wenjie, Sun Tianyu, Qian Kun, Wang Wenhong
- Unlocking Large Language Model's Planning Capabilities With Maximum Diversity Fine-tuning Li Wenjun, Chen Changyu, Varakantham Pradeep
- MM-TTS: A Unified Framework For Multimodal, Prompt-induced Emotional Text-to-speech Synthesis Li Xiang, Cheng Zhi-qi, He Jun-yan, Peng Xiaojiang, Hauptmann Alexander G.
- Llara: Supercharging Robot Learning Data For Vision-language Policy Li Xiang, Mata Cristina, Park Jongwoo, Kahatapitiya Kumara, Jang Yoo Sung, Shang Jinghuan, Ranasinghe Kanchana, Burgert Ryan, Cai Mu, Lee Yong Jae, Ryoo Michael S.
- Grounded Compositional And Diverse Text-to-3d With Pretrained Multi-view Diffusion Model Li Xiaolong, Mo Jiawei, Wang Ying, Parameshwara Chethan, Fei Xiaohan, Swaminathan Ashwin, Taylor Cj, Tu Zhuowen, Favaro Paolo, Soatto Stefano
- Evaluating Mathematical Reasoning Of Large Language Models: A Focus On Error Identification And Correction Li Xiaoyuan, Wang Wenjie, Li Moxin, Guo Junrong, Zhang Yang, Feng Fuli
- Ganprompt: Enhancing Robustness In Llm-based Recommendations With Gan-enhanced Diversity Prompts Li Xinyu, Zhao Chuang, Zhao Hongke, Wu Likang, He Ming
- Say More With Less: Understanding Prompt Learning Behaviors Through Gist Compression Li Xinze, Liu Zhenghao, Xiong Chenyan, Yu Shi, Yan Yukun, Wang Shuo, Yu Ge
- Rethinking Chatgpt's Success: Usability And Cognitive Behaviors Enabled By Auto-regressive Llms' Prompting Li Xinzhe, Liu Ming
- When Hindsight Is Not 20/20: Testing Limits On Reflective Thinking In Large Language Models Li Yanhong, Yang Chenghao, Ettinger Allyson
- Making Long-context Language Models Better Multi-hop Reasoners Li Yanyang, Liang Shuo, Lyu Michael R., Wang Liwei
- Backdoorllm: A Comprehensive Benchmark For Backdoor Attacks On Large Language Models Li Yige, Huang Hanxun, Zhao Yunhan, Ma Xingjun, Sun Jun
- A Simple But Effective Approach To Improve Structured Language Model Output For Information Extraction Li Yinghao, Ramprasad Rampi, Zhang Chao
- Data Generation Using Large Language Models For Text Classification: An Empirical Case Study Li Yinheng, Bonatti Rogerio, Abdali Sara, Wagle Justin, Koishida Kazuhito
- Guiding Enumerative Program Synthesis With Large Language Models Li Yixuan, Parsert Julian, Polgreen Elizabeth
- Spin Glass Model Of In-context Learning Li Yuhao, Bai Ruoran, Huang Haiping
- VSTAR: Generative Temporal Nursing For Longer Dynamic Video Synthesis Li Yumeng, Beluch William, Keuper Margret, Zhang Dan, Khoreva Anna
- Text-to-model: Text-conditioned Neural Network Diffusion For Train-once-for-all Personalization Li Zexi, Gao Lingzhi, Wu Chao
- Focusllm: Scaling Llm's Context By Parallel Decoding Li Zhenyu, Zhang Yike, Pan Tengyu, Sun Yutao, Duan Zhichao, Fang Junjie, Han Rong, Wang Zixuan, Wang Jianyong
- Self-instructed Derived Prompt Generation Meets In-context Learning: Unlocking New Potential Of Black-box Llms Li Zhuo, Du Yuhao, Hu Jinpeng, Wan Xiang, Gao Anningzhe
- Effects Of Different Prompts On The Quality Of GPT-4 Responses To Dementia Care Questions Li Zhuochun, Xie Bo, Hilsabeck Robin, Aguirre Alyssa, Zou Ning, Luo Zhimeng, He Daqing
- Implicit In-context Learning Li Zhuowei, Xu Zihao, Han Ligong, Gao Yunhe, Wen Song, Liu Di, Wang Hao, Metaxas Dimitris N.
- 500xcompressor: Generalized Prompt Compression For Large Language Models Li Zongqian, Su Yixuan, Collier Nigel
- What Is The Best Model? Application-driven Evaluation For Large Language Models Lian Shiguo, Zhao Kaikai, Liu Xinhui, Lei Xuejiao, Yang Bikun, Zhang Wenjing, Wang Kai, Liu Zhaoxiang
- Languashrink: Reducing Token Overhead With Psycholinguistics Liang Xuechen, Tao Meiling, Xia Yinghui, Shi Tianyu, Wang Jun, Yang Jingsong
- Controllable Text Generation For Large Language Models: A Survey Liang Xun, Wang Hanyu, Wang Yezhaohui, Song Shichao, Yang Jiawei, Niu Simin, Hu Jie, Liu Dan, Yao Shunyu, Xiong Feiyu, Li Zhiyu
- Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats In Customized Large Language Models Liang Zi, Hu Haibo, Ye Qingqing, Xiao Yaxin, Li Haoyang
- Look Before You Leap: Problem Elaboration Prompting Improves Mathematical Reasoning In Large Language Models Liao Haoran, Tian Jidong, Hu Shaohua, He Hao, Jin Yaohui
- Learning-from-mistakes Prompting For Indigenous Language Translation Liao You-cheng, Yu Chen-jui, Lin Chi-yi, Yun He-feng, Wang Yen-hsiang, Li Hsiao-min, Fan Yao-chung
- Towards Automated Data Sciences With Natural Language And Sagecopilot: Practices And Lessons Learned Liao Yuan, Bian Jiang, Yun Yuhui, Wang Shuo, Zhang Yubo, Chu Jiaming, Wang Tao, Li Kewei, Li Yuchen, Li Xuhong, Ji Shilei, Xiong Haoyi
- E2LLM: Encoder Elongated Large Language Models For Long-context Understanding And Reasoning Liao Zihan, Wang Jun, Yu Hang, Wei Lingxiao, Li Jianguo, Wang Jun, Zhang Wei
- Large Language Models As Recommender Systems: A Study Of Popularity Bias Lichtenberg Jan Malte, Buchholz Alexander, Schwöbel Pola
- Adapting Mental Health Prediction Tasks For Cross-lingual Learning Via Meta-training And In-context Learning With Large Language Model Lifelo Zita, Ning Huansheng, Dhelim Sahraoui
- Tree-based Hard Attention With Self-motivation For Large Language Models Lin Chenxi, Ren Jiayu, He Guoxiu, Jiang Zhuoren, Yu Haiyan, Zhu Xiaomin
- Graph-enhanced Large Language Models In Asynchronous Plan Reasoning Lin Fangru, La Malfa Emanuele, Hofmann Valentin, Yang Elle Michelle, Cohn Anthony, Pierrehumbert Janet B.
- Bita: Bi-directional Tuning For Lossless Acceleration In Large Language Models Lin Feng, Yi Hanling, Li Hongbin, Yang Yifan, Yu Xiaotian, Lu Guangming, Xiao Rong
- Emojicrypt: Prompt Encryption For Secure Communication With Large Language Models Lin Guo, Hua Wenyue, Zhang Yongfeng
- Designprobe: A Graphic Design Benchmark For Multimodal Large Language Models Lin Jieru, Huang Danqing, Zhao Tiejun, Zhan Dechen, Lin Chin-yew
- Single Character Perturbations Break LLM Alignment Lin Leon, Brown Hannah, Kawaguchi Kenji, Shieh Michael
- Mala-500: Massive Language Adaptation Of Large Language Models Lin Peiqin, Ji Shaoxiong, Tiedemann Jörg, Martins André F. T., Schütze Hinrich
- Draw-and-understand: Leveraging Visual Prompts To Enable Mllms To Comprehend What You Want Lin Weifeng, Wei Xinyu, An Ruichuan, Gao Peng, Zou Bocheng, Luo Yulin, Huang Siyuan, Zhang Shanghang, Li Hongsheng
- Diversedialogue: A Methodology For Designing Chatbots With Human-like Diversity Lin Xiaoyu, Yu Xinkai, Aich Ankit, Giorgi Salvatore, Ungar Lyle
- Interpretable User Satisfaction Estimation For Conversational Systems With Large Language Models Lin Ying-chun, Neville Jennifer, Stokes Jack W., Yang Longqi, Safavi Tara, Wan Mengting, Counts Scott, Suri Siddharth, Andersen Reid, Xu Xiaofeng, Gupta Deepak, Jauhar Sujay Kumar, Song Xia, Buscher Georg, Tiwary Saurabh, Hecht Brent, Teevan Jaime
- Rethinking Visual Prompting For Multimodal Large Language Models With External Knowledge Lin Yuanze, Li Yunsheng, Chen Dongdong, Xu Weijian, Clark Ronald, Torr Philip, Yuan Lu
- Contextualized Sequence Likelihood: Enhanced Confidence Scores For Natural Language Generation Lin Zhen, Trivedi Shubhendu, Sun Jimeng
- Evaluating Text-to-visual Generation With Image-to-text Generation Lin Zhiqiu, Pathak Deepak, Li Baiqi, Li Jiayao, Xia Xide, Neubig Graham, Zhang Pengchuan, Ramanan Deva
- Navigating The Dual Facets: A Comprehensive Evaluation Of Sequential Memory Editing In Large Language Models Lin Zihao, Beigi Mohammad, Li Hongxuan, Zhou Yufan, Zhang Yuxiang, Wang Qifan, Yin Wenpeng, Huang Lifu
- Prompt Compression With Context-aware Sentence Encoding For Fast And Improved LLM Inference Liskavets Barys, Ushakov Maxim, Roy Shuvendu, Klibanov Mark, Etemad Ali, Luke Shane
- The Colorful Future Of Llms: Evaluating And Improving Llms As Emotional Supporters For Queer Youth Lissak Shir, Calderon Nitay, Shenkman Geva, Ophir Yaakov, Fruchter Eyal, Klomek Anat Brunstein, Reichart Roi
- Compromising Embodied Agents With Contextual Backdoor Attacks Liu Aishan, Zhou Yuguang, Liu Xianglong, Zhang Tianyuan, Liang Siyuan, Wang Jiakai, Pu Yanjun, Li Tianlin, Zhang Junqi, Zhou Wenbo, Guo Qing, Tao Dacheng
- Direct Large Language Model Alignment Through Self-rewarding Contrastive Prompt Distillation Liu Aiwei, Bai Haoping, Lu Zhiyun, Kong Xiang, Wang Simon, Shan Jiulong, Cao Meng, Wen Lijie
- Is Translation All You Need? A Study On Solving Multilingual Tasks With Large Language Models Liu Chaoqun, Zhang Wenxuan, Zhao Yiran, Luu Anh Tuan, Bing Lidong
- Few-shot Class Incremental Learning With Attention-aware Self-adaptive Prompt Liu Chenxi, Wang Zhenyi, Xiong Tianyi, Chen Ruibo, Wu Yihan, Guo Junfeng, Huang Heng
- Llmembed: Rethinking Lightweight Llm's Genuine Function In Text Classification Liu Chun, Zhang Hongguang, Zhao Kainan, Ju Xinghai, Yang Lin
- An Incomplete Loop: Instruction Inference, Instruction Following, And In-context Learning In Language Models Liu Emmy, Neubig Graham, Andreas Jacob
- Self-reflection Outcome Is Sensitive To Prompt Construction Liu Fengyuan, Aldahoul Nouar, Eady Gregory, Zaki Yasir, Alshebli Bedoor, Rahwan Talal
- Exploring Vulnerabilities And Protections In Large Language Models: A Survey Liu Frank Weizhen, Hu Chenhui
- Logic Agent: Enhancing Validity With Logic Rule Invocation Liu Hanmeng, Teng Zhiyang, Zhang Chaoli, Zhang Yue
- Mibench: Evaluating Multimodal Large Language Models Over Multiple Images Liu Haowei, Zhang Xi, Xu Haiyang, Shi Yaya, Jiang Chaoya, Yan Ming, Zhang Ji, Huang Fei, Yuan Chunfeng, Li Bing, Hu Weiming
- \(se^2\): Sequential Example Selection For In-context Learning Liu Haoyu, Liu Jianfeng, Huang Shaohan, Zhan Yuefeng, Sun Hao, Deng Weiwei, Wei Furu, Zhang Qi
- Laida: Linguistics-aware In-context Learning With Data Augmentation For Metaphor Components Identification Liu Hongde, He Chenyuan, Meng Feiyang, Niu Changyong, Jia Yuxiang
- Unraveling The Mechanics Of Learning-based Demonstration Selection For In-context Learning Liu Hui, Wang Wenya, Sun Hao, Tian Chris Xing, Kong Chenqi, Dong Xin, Li Haoliang
- VP-LLM: Text-driven 3D Volume Completion With Large Language Models Through Patchification Liu Jianmeng, Liu Yichen, Zhang Yuyao, Meng Zeyuan, Tai Yu-wing, Tang Chi-keung
- Tiny Refinements Elicit Resilience: Toward Efficient Prefix-model Against LLM Red-teaming Liu Jiaxu, Yin Xiangyu, Wu Sihao, Wang Jianhong, Fang Meng, Yi Xinping, Huang Xiaowei
- Educating Llms Like Human Students: Structure-aware Injection Of Domain Knowledge Liu Kai, Chen Ze, Fu Zhihang, Jiang Rongxin, Zhou Fan, Chen Yaowu, Wu Yue, Ye Jieping
- LLM4GEN: Leveraging Semantic Representation Of Llms For Text-to-image Generation Liu Mushui, Ma Yuhang, Zhen Yang, Dan Jun, Yu Yunlong, Zhao Zeng, Hu Zhipeng, Liu Bai, Fan Changjie
- Synthetic Context Generation For Question Generation Liu Naiming, Wang Zichao, Baraniuk Richard
- Monotonic Paraphrasing Improves Generalization Of Language Model Prompting Liu Qin, Wang Fei, Xu Nan, Yan Tianyi, Meng Tao, Chen Muhao
- Flashback:efficient Retrieval-augmented Language Modeling For Long Context Inference Liu Runheng, Xiao Xingchen, Huang Heyan, Chi Zewen, Wu Zhijing
- ST-LLM: Large Language Models Are Effective Temporal Learners Liu Ruyang, Li Chen, Tang Haoran, Ge Yixiao, Shan Ying, Li Ge
- Are Llms Effective Backbones For Fine-tuning? An Experimental Investigation Of Supervised Llms On Chinese Short Text Matching Liu Shulin, Xu Chengcheng, Liu Hao, Yu Tinghao, Yang Tao
- The Generation Gap:exploring Age Bias In The Underlying Value Systems Of Large Language Models Liu Siyang, Maturi Trish, Yi Bowen, Shen Siqi, Mihalcea Rada
- Multi-group Uncertainty Quantification For Long-form Text Generation Liu Terrance, Wu Zhiwei Steven
- Making Them Ask And Answer: Jailbreaking Large Language Models In Few Queries Via Disguise And Reconstruction Liu Tong, Zhang Yingjie, Zhao Zhe, Dong Yinpeng, Meng Guozhu, Chen Kai
- Demorank: Selecting Effective Demonstrations For Large Language Models In Ranking Task Liu Wenhan, Zhu Yutao, Dou Zhicheng
- Imposter.ai: Adversarial Attacks With Hidden Intentions Towards Aligned Large Language Models Liu Xiao, Li Liangzhi, Xiang Tong, Ye Fuying, Wei Lu, Li Wangyue, Garcia Noa
- Harnessing Large Language Models For Multimodal Product Bundling Liu Xiaohao, Wu Jie, Tao Zhulin, Ma Yunshan, Wei Yinwei, Chua Tat-seng
- An Empirical Analysis On Large Language Models In Debate Evaluation Liu Xinyi, Liu Pinxin, He Hangfeng
- Prompt Framework For Role-playing: Generation And Evaluation Liu Xun, Ni Zhengwei
- Era-cot: Improving Chain-of-thought Through Entity Relationship Analysis Liu Yanming, Peng Xinyue, Du Tianyu, Yin Jianwei, Liu Weihao, Zhang Xuhong
- Arondight: Red Teaming Large Vision Language Models With Auto-generated Multi-modal Jailbreak Prompts Liu Yi, Cai Chengjun, Zhang Xiaoli, Yuan Xingliang, Wang Cong
- Let's Learn Step By Step: Enhancing In-context Learning Ability With Curriculum Learning Liu Yinpeng, Liu Jiawei, Shi Xiang, Cheng Qikai, Huang Yong, Lu Wei
- Hd-eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition Liu Yuxuan, Yang Tianchi, Huang Shaohan, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Agentlite: A Lightweight Library For Building And Advancing Task-oriented LLM Agent System Liu Zhiwei, Yao Weiran, Zhang Jianguo, Yang Liangwei, Liu Zuxin, Tan Juntao, Choubey Prafulla K., Lan Tian, Wu Jason, Wang Huan, Heinecke Shelby, Xiong Caiming, Savarese Silvio
- Fantastic Semantics And Where To Find Them: Investigating Which Layers Of Generative Llms Reflect Lexical Semantics Liu Zhu, Kong Cunliang, Liu Ying, Sun Maosong
- Protecting Your Llms With Information Bottleneck Liu Zichuan, Wang Zefan, Xu Linjie, Wang Jinyu, Song Lei, Wang Tianchun, Chen Chunlin, Cheng Wei, Bian Jiang
- Enabling Weak Llms To Judge Response Reliability Via Meta Ranking Liu Zijun, Kou Boqun, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Llms Are Biased Towards Output Formats! Systematically Evaluating And Mitigating Output Format Bias Of Llms Long Do Xuan, Ngoc Hai Nguyen, Sim Tiviatis, Dao Hieu, Joty Shafiq, Kawaguchi Kenji, Chen Nancy F., Kan Min-yen
- Large Language Models Know What Makes Exemplary Contexts Long Quanyu, Chen Jianda, Wang Wenya, Pan Sinno Jialin
- Does In-context Learning Really Learn? Rethinking How Large Language Models Respond And Solve Tasks Via In-context Learning Long Quanyu, Wu Yin, Wang Wenya, Pan Sinno Jialin
- Personalitychat: Conversation Distillation For Personalized Dialog Modeling With Facts And Traits Lotfi Ehsan, De Bruyn Maxime, Buhmann Jeska, Daelemans Walter
- Roleplay-doh: Enabling Domain-experts To Create Llm-simulated Patients Via Eliciting And Adhering To Principles Louie Ryan, Nandi Ananjan, Fang William, Chang Cheng, Brunskill Emma, Yang Diyi
- Efficient Encoder-decoder Transformer Decoding For Decomposable Tasks Lu Bo-ru, Haduong Nikita, Lin Chien-yu, Cheng Hao, Smith Noah A., Ostendorf Mari
- Investigating Bias Representations In Llama 2 Chat Via Activation Steering Lu Dawn, Rimsky Nina
- Toolsandbox: A Stateful, Conversational, Interactive Evaluation Benchmark For LLM Tool Use Capabilities Lu Jiarui, Holleis Thomas, Zhang Yizhe, Aumayer Bernhard, Nan Feng, Bai Felix, Ma Shuang, Ma Shen, Li Mengyu, Yin Guoli, Wang Zirui, Pang Ruoming
- FIPO: Free-form Instruction-oriented Prompt Optimization With Preference Dataset And Modular Fine-tuning Schema Lu Junru, An Siyu, Zhang Min, He Yulan, Yin Di, Sun Xing
- Multimodal Large Language Model Driven Scenario Testing For Autonomous Vehicles Lu Qiujing, Wang Xuanhan, Jiang Yiwei, Zhao Guangming, Ma Mingyue, Feng Shuo
- Fine-tuning Large Language Models For Domain Adaptation: Exploration Of Training Strategies, Scaling, Model Merging And Synergistic Capabilities Lu Wei, Luu Rachel K., Buehler Markus J.
- Generative Students: Using Llm-simulated Student Profiles To Support Question Item Evaluation Lu Xinyi, Wang Xu
- Asymptotic Theory Of In-context Learning By Linear Attention Lu Yue M., Letey Mary I., Zavatone-veth Jacob A., Maiti Anindita, Pehlevan Cengiz
- Prompt Tuning As User Inherent Profile Inference Machine Lu Yusheng, Du Zhaocheng, Li Xiangyang, Zhao Xiangyu, Liu Weiwen, Wang Yichao, Guo Huifeng, Tang Ruiming, Dong Zhenhua, Duan Yongrui
- To Err Is Human, But Llamas Can Learn It Too Luhtaru Agnes, Purason Taido, Vainikko Martin, Del Maksym, Fishel Mark
- Crosstune: Black-box Few-shot Classification With Label Enhancement Luo Danqing, Zhang Chen, Zhang Yan, Li Haizhou
- End-to-end Neuro-symbolic Reinforcement Learning With Textual Explanations Luo Lirui, Zhang Guoxi, Xu Hongming, Yang Yaodong, Fang Cong, Li Qing
- DEEM: Diffusion Models Serve As The Eyes Of Large Language Models For Image Perception Luo Run, Li Yunshui, Chen Longze, He Wanwei, Lin Ting-en, Liu Ziqiang, Zhang Lei, Song Zikai, Xia Xiaobo, Liu Tongliang, Yang Min, Hui Binyuan
- Zero-shot Cross-domain Dialogue State Tracking Via Dual Low-rank Adaptation Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- Knowla: Enhancing Parameter-efficient Finetuning With Knowledgeable Adaptation Luo Xindi, Sun Zequn, Zhao Jing, Zhao Zhe, Hu Wei
- Realistic Evaluation Of Toxicity In Large Language Models Luong Tinh Son, Le Thanh-thien, Van Linh Ngo, Nguyen Thien Huu
- WILBUR: Adaptive In-context Learning For Robust And Accurate Web Agents Lutz Michael, Bohra Arth, Saroyan Manvel, Harutyunyan Artem, Campagna Giovanni
- Interpreting Key Mechanisms Of Factual Recall In Transformer-based Language Models Lv Ang, Chen Yuhan, Zhang Kaiyi, Wang Yulong, Liu Lifeng, Wen Ji-rong, Xie Jian, Yan Rui
- GPT-4 Generated Narratives Of Life Events Using A Structured Narrative Prompt: A Validation Study Lynch Christopher J., Jensen Erik, Munro Madison H., Zamponi Virginia, Martinez Joseph, O'brien Kevin, Feldhaus Brandon, Smith Katherine, Reinhold Ann Marie, Gore Ross
- X-reflect: Cross-reflection Prompting For Multimodal Recommendation Lyu Hanjia, Rossi Ryan, Chen Xiang, Tanjim Md Mehrab, Petrangeli Stefano, Sarkhel Somdeb, Luo Jiebo
- Keeping Llms Aligned After Fine-tuning: The Crucial Role Of Prompt Templates Lyu Kaifeng, Zhao Haoyu, Gu Xinran, Yu Dingli, Goyal Anirudh, Arora Sanjeev
- Automatic Summarization Of Doctor-patient Encounter Dialogues Using Large Language Model Through Prompt Tuning Lyu Mengxian, Peng Cheng, Li Xiaohan, Balian Patrick, Bian Jiang, Wu Yonghui
- Dempt: Decoding-enhanced Multi-phase Prompt Tuning For Making Llms Be Better Context-aware Translators Lyu Xinglin, Li Junhui, Zhao Yanqing, Zhang Min, Wei Daimeng, Tao Shimin, Yang Hao, Zhang Min
- Retrieve-plan-generation: An Iterative Planning And Answering Framework For Knowledge-intensive LLM Generation Lyu Yuanjie, Niu Zihan, Xie Zheyong, Zhang Chao, Xu Tong, Wang Yang, Chen Enhong
- When Emotional Stimuli Meet Prompt Designing: An Auto-prompt Graphical Paradigm Ma Chenggian, Zhao Xiangyu, Zhang Chunhui, Qin Yanzhao, Zhang Wentao
- Exploring The Role Of Transliteration In In-context Learning For Low-resource Languages Written In Non-latin Scripts Ma Chunlan, Liu Yihong, Ye Haotian, Schütze Hinrich
- Large Language Models Are Unconscious Of Unreasonability In Math Problems Ma Jingyuan, Dai Damai, Sha Lei, Sui Zhifang
- Eventlens: Leveraging Event-aware Pretraining And Cross-modal Linking Enhances Visual Commonsense Reasoning Ma Mingjie, Yu Zhihuan, Ma Yichao, Li Guohui
- Are Large Language Models Good Prompt Optimizers? Ma Ruotian, Wang Xiaolei, Zhou Xin, Li Jian, Du Nan, Gui Tao, Zhang Qi, Huang Xuanjing
- Making Pre-trained Language Models Better Continual Few-shot Relation Extractors Ma Shengkun, Han Jiale, Liang Yi, Cheng Bo
- Clawmachine: Fetching Visual Tokens As An Entity For Referring And Grounding Ma Tianren, Xie Lingxi, Tian Yunjie, Yang Boyu, Zhang Yuan, Doermann David, Ye Qixiang
- Is It Possible To Edit Large Language Models Robustly? Ma Xinbei, Ju Tianjie, Qiu Jiyang, Zhang Zhuosheng, Zhao Hai, Liu Lifeng, Wang Yulong
- Compositional API Recommendation For Library-oriented Code Generation Ma Zexiong, An Shengnan, Xie Bing, Lin Zeqi
- Llamoco: Instruction Tuning Of Large Language Models For Optimization Code Generation Ma Zeyuan, Guo Hongshu, Chen Jiacheng, Peng Guojun, Cao Zhiguang, Ma Yining, Gong Yue-jiao
- Gerea: Question-aware Prompt Captions For Knowledge-based Visual Question Answering Ma Ziyu, Li Shutao, Sun Bin, Cai Jianfei, Long Zuxiang, Ma Fuyan
- Exploring The Impact Of The Output Format On The Evaluation Of Large Language Models For Code Translation Macedo Marcos, Tian Yuan, Cogo Filipe R., Adams Bram
- LLM In-context Recall Is Prompt Dependent Machlab Daniel, Battle Rick
- Steering Conversational Large Language Models For Long Emotional Support Conversations Madani Navid, Saha Sougata, Srihari Rohini
- Explaining Genetic Programming Trees Using Large Language Models Maddigan Paula, Lensen Andrew, Xue Bing
- Do Llms Know When To NOT Answer? Investigating Abstention Abilities Of Large Language Models Madhusudhan Nishanth, Madhusudhan Sathwik Tejaswi, Yadav Vikas, Hashemi Masoud
- Leveraging Environment Interaction For Automated PDDL Generation And Planning With Large Language Models Mahdavi Sadegh, Aoki Raquel, Tang Keyi, Cao Yanshuai
- Venn Diagram Prompting : Accelerating Comprehension With Scaffolding Effect Mahendru Sakshi, Pandit Tejul
- Too Late To Train, Too Early To Use? A Study On Necessity And Viability Of Low-resource Bengali Llms Mahfuz Tamzeed, Dey Satak Kumar, Naswan Ruwad, Adil Hasnaen, Sayeed Khondker Salman, Shahgir Haz Sameen
- Rephrasing The Web: A Recipe For Compute And Data-efficient Language Modeling Maini Pratyush, Seto Skyler, Bai He, Grangier David, Zhang Yizhe, Jaitly Navdeep
- Exploring The Capabilities Of Prompted Large Language Models In Educational And Assessment Applications Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- A Novel Multi-stage Prompting Approach For Language Agnostic MCQ Generation Using GPT Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- Fairylandai: Personalized Fairy Tales Utilizing Chatgpt And DALLE-3 Makridis Georgios, Oikonomou Athanasios, Koukos Vasileios
- From Tarzan To Tolkien: Controlling The Language Proficiency Level Of Llms For Content Generation Malik Ali, Mayhew Stephen, Piech Chris, Bicknell Klinton
- Can Large Language Models Automatically Score Proficiency Of Written Essays? Mansour Watheq, Albatarni Salam, Eltanbouly Sohaila, Elsayed Tamer
- A Data Generation Perspective To The Mechanism Of In-context Learning Mao Haitao, Liu Guangliang, Ma Yao, Wang Rongrong, Johnson Kristen, Tang Jiliang
- Reinforced Prompt Personalization For Recommendation With Large Language Models Mao Wenyu, Wu Jiancan, Chen Weijian, Gao Chongming, Wang Xiang, He Xiangnan
- How Does Quantization Affect Multilingual Llms? Marchisio Kelly, Dash Saurabh, Chen Hongyu, Aumiller Dennis, Üstün Ahmet, Hooker Sara, Ruder Sebastian
- Understanding And Mitigating Language Confusion In Llms Marchisio Kelly, Ko Wei-yin, Bérard Alexandre, Dehaze Théo, Ruder Sebastian
- German Also Hallucinates! Inconsistency Detection In News Summaries With The Absinth Dataset Mascarell Laura, Chalumattu Ribin, Rios Annette
- Pack Of Llms: Model Fusion At Test-time Via Perplexity Optimization Mavromatis Costas, Karypis Petros, Karypis George
- Can Opensource Beat Chatgpt? -- A Comparative Study Of Large Language Models For Text-to-code Generation Mayer Luis, Heumann Christian, Aßenmacher Matthias
- Improving Text-to-image Consistency Via Automatic Prompt Optimization Mañas Oscar, Astolfi Pietro, Hall Melissa, Ross Candace, Urbanek Jack, Williams Adina, Agrawal Aishwarya, Romero-soriano Adriana, Drozdzal Michal
- Leveraging Prompts In Llms To Overcome Imbalances In Complex Educational Text Data Mcclure Jeanne, Shimmei Machi, Matsuda Noboru, Jiang Shiyan
- Mimetic Poet Mccormack Jon, Wilson Elliott, Rajcic Nina, Llano Maria Teresa
- Generative Artificial Intelligence In Higher Education: Evidence From An Analysis Of Institutional Policies And Guidelines Mcdonald Nora, Johri Aditya, Ali Areej, Hingle Aayushi
- Set-based Prompting: Provably Solving The Language Model Order Dependency Problem Mcilroy-young Reid, Brown Katrina, Olson Conlan, Zhang Linjun, Dwork Cynthia
- Inadequacies Of Large Language Model Benchmarks In The Era Of Generative Artificial Intelligence Mcintosh Timothy R., Susnjak Teo, Liu Tong, Watters Paul, Halgamuge Malka N.
- MM1: Methods, Analysis & Insights From Multimodal LLM Pre-training Mckinzie Brandon, Gan Zhe, Fauconnier Jean-philippe, Dodge Sam, Zhang Bowen, Dufter Philipp, Shah Dhruti, Du Xianzhi, Peng Futang, Weers Floris, Belyi Anton, Zhang Haotian, Singh Karanjeet, Kang Doug, Jain Ankur, Hè Hongyu, Schwarzer Max, Gunter Tom, Kong Xiang, Zhang Aonan, Wang Jianyu, Wang Chong, Du Nan, Lei Tao, Wiseman Sam, Yin Guoli, Lee Mark, Wang Zirui, Pang Ruoming, Grasch Peter, Toshev Alexander, Yang Yinfei
- Large Language Models For Relevance Judgment In Product Search Mehrdad Navid, Mohapatra Hrushikesh, Bagdouri Mossaab, Chandran Prijith, Magnani Alessandro, Cai Xunfan, Puthenputhussery Ajit, Yadav Sachin, Lee Tony, Zhai Chengxiang, Liao Ciya
- Enhancing Creativity In Large Language Models Through Associative Thinking Strategies Mehrotra Pronita, Parab Aishni, Gulwani Sumit
- "not Aligned" Is Not "malicious": Being Careful About Hallucinations Of Large Language Models' Jailbreak Mei Lingrui, Liu Shenghua, Wang Yiwei, Bi Baolong, Mao Jiayi, Cheng Xueqi
- Towards Human Understanding Of Paraphrase Types In Chatgpt Meier Dominik, Wahle Jan Philip, Ruas Terry, Gipp Bela
- Phybench: A Physical Commonsense Benchmark For Evaluating Text-to-image Models Meng Fanqing, Shao Wenqi, Luo Lixin, Wang Yahong, Chen Yiran, Lu Quanfeng, Yang Yue, Yang Tianshuo, Zhang Kaipeng, Qiao Yu, Luo Ping
- Linearizing Large Language Models Mercat Jean, Vasiljevic Igor, Keh Sedrick, Arora Kushal, Dave Achal, Gaidon Adrien, Kollar Thomas
- Superposition Prompting: Improving And Accelerating Retrieval-augmented Generation Merth Thomas, Fu Qichen, Rastegari Mohammad, Najibi Mahyar
- Low-resource Machine Translation Through Retrieval-augmented LLM Prompting: A Study On The Mambai Language Merx Raphaël, Mahmudi Aso, Langford Katrina, De Araujo Leo Alberto, Vylomova Ekaterina
- A Comparison Of LLM Finetuning Methods & Evaluation Metrics With Travel Chatbot Use Case Meyer Sonia, Singh Shreya, Tam Bertha, Ton Christopher, Ren Angel
- Subject-driven Text-to-image Generation Via Preference-based Reinforcement Learning Miao Yanting, Loh William, Kothawade Suraj, Poupart Pascal, Rashwan Abdullah, Li Yeqing
- Is English The New Programming Language? How About Pseudo-code Engineering? Michaelsen Gian Alexandre, Santos Renato P. Dos
- Smart Language Agents In Real-world Planning Miin Annabelle, Wei Timothy
- Natural Language As Policies: Reasoning For Coordinate-level Embodied Control With Llms Mikami Yusuke, Melnik Andrew, Miura Jun, Hautamäki Ville
- Multimodal Contrastive In-context Learning Miyanishi Yosuke, Nguyen Minh Le
- LSPT: Long-term Spatial Prompt Tuning For Visual Representation Learning Mo Shentong, Wang Yansen, Luo Xufang, Li Dongsheng
- Fight Back Against Jailbreaking Via Prompt Adversarial Tuning Mo Yichuan, Wang Yuji, Wei Zeming, Wang Yisen
- E2TP: Element To Tuple Prompting Improves Aspect Sentiment Tuple Prediction Mohammadkhani Mohammad Ghiasvand, Ranjbar Niloofar, Momtazi Saeedeh
- Routoo: Learning To Route To Large Language Models Effectively Mohammadshahi Alireza, Shaikh Arshad Rafiq, Yazdani Majid
- Llava-chef: A Multi-modal Generative Model For Food Recipes Mohbat Fnu, Zaki Mohammed J.
- Can We Trust Large Language Models Generated Code? A Framework For In-context Learning, Security Patterns, And Code Evaluations Across Diverse Llms Mohsin Ahmad, Janicke Helge, Wood Adrian, Sarker Iqbal H., Maglaras Leandros, Janjua Naeem
- Llm-based Frameworks For API Argument Filling In Task-oriented Conversational Systems Mok Jisoo, Kachuee Mohammad, Dai Shuyang, Ray Shayan, Taghavi Tara, Yoon Sungroh
- Do Large Language Models Exhibit Cognitive Dissonance? Studying The Difference Between Revealed Beliefs And Stated Answers Mondal Manuel, Dolamic Ljiljana, Bovet Gérôme, Cudré-mauroux Philippe, Audiffren Julien
- Xc-cache: Cross-attending To Cached Context For Efficient LLM Inference Monteiro João, Marcotte Étienne, Noël Pierre-andré, Zantedeschi Valentina, Vázquez David, Chapados Nicolas, Pal Christopher, Taslakian Perouz
- The Base-rate Effect On LLM Benchmark Performance: Disambiguating Test-taking Strategies From Benchmark Performance Moore Kyle, Roberts Jesse, Pham Thao, Ewaleifoh Oseremhen, Fisher Doug
- Exploring The Landscape Of Large Language Models: Foundations, Techniques, And Challenges Moradi Milad, Yan Ke, Colwell David, Samwald Matthias, Asgari Rhona
- Langbite: A Platform For Testing Bias In Large Language Models Morales Sergio, Clarisó Robert, Cabot Jordi
- Language Modelling Approaches To Adaptive Machine Translation Moslem Yasmin
- Large Language Models Are Parallel Multilingual Learners Mu Yongyu, Feng Peinan, Cao Zhiquan, Wu Yuzhang, Li Bei, Wang Chenglong, Xiao Tong, Song Kai, Liu Tongran, Zhang Chunliang, Zhu Jingbo
- Are Large Language Models Actually Good At Text Style Transfer? Mukherjee Sourabrata, Ojha Atul Kr., Dušek Ondřej
- Towards Robots That Know When They Need Help: Affordance-based Uncertainty For Large Language Model Planners Mullen James F. Jr., Manocha Dinesh
- Evaluating The Evaluator: Measuring Llms' Adherence To Task Evaluation Instructions Murugadoss Bhuvanashree, Poelitz Christian, Drosos Ian, Le Vu, Mckenna Nick, Negreanu Carina Suzana, Parnin Chris, Sarkar Advait
- Benchmarking Llama2, Mistral, Gemma And GPT For Factuality, Toxicity, Bias And Propensity For Hallucinations Nadeau David, Kroutikov Mike, Mcneil Karen, Baribeau Simon
- Negation Blindness In Large Language Models: Unveiling The NO Syndrome In Image Generation Nadeem Mohammad, Sohail Shahab Saquib, Cambria Erik, Schuller Björn W., Hussain Amir
- Learning Vs Retrieval: The Role Of In-context Examples In Regression With Llms Nafar Aliakbar, Venable Kristen Brent, Kordjamshidi Parisa
- Zero-shot Visual Reasoning By Vision-language Models: Benchmarking And Analysis Nagar Aishik, Jaiswal Shantanu, Tan Cheston
- Pii-compass: Guiding LLM Training Data Extraction Prompts Towards The Target PII Via Grounding Nakka Krishna Kanth, Frikha Ahmed, Mendes Ricardo, Jiang Xue, Zhou Xuebing
- Gemquad : Generating Multilingual Question Answering Datasets From Large Language Models Using Few Shot Learning Namboori Amani, Mangale Shivam, Rosenbaum Andy, Soltan Saleh
- Apprentices To Research Assistants: Advancing Research With Large Language Models Namvarpour M., Razi A.
- Concise Thoughts: Impact Of Output Length On LLM Reasoning And Cost Nayab Sania, Rossolini Giulio, Buttazzo Giorgio, Manes Nicolamaria, Giacomelli Fabrizio
- Interpreting Context Look-ups In Transformers: Investigating Attention-mlp Interactions Neo Clement, Cohen Shay B., Barez Fazl
- Contextual AI Journaling: Integrating LLM And Time Series Behavioral Sensing Technology To Promote Self-reflection And Well-being Using The Mindscape App Nepal Subigya, Pillai Arvind, Campbell William, Massachi Talie, Choi Eunsol Soul, Xu Orson, Kuc Joanna, Huckins Jeremy, Holden Jason, Depp Colin, Jacobson Nicholas, Czerwinski Mary, Granholm Eric, Campbell Andrew T.
- MBBQ: A Dataset For Cross-lingual Comparison Of Stereotypes In Generative Llms Neplenbroek Vera, Bisazza Arianna, Fernández Raquel
- Towards Modeling Learner Performance With Large Language Models Neshaei Seyed Parsa, Davis Richard Lee, Hazimeh Adam, Lazarevski Bojan, Dillenbourg Pierre, Käser Tanja
- Using Grammar Masking To Ensure Syntactic Validity In Llm-based Modeling Tasks Netz Lukas, Reimer Jan, Rumpe Bernhard
- Medinsight: A Multi-source Context Augmentation Framework For Generating Patient-centric Medical Responses Using Large Language Models Neupane Subash, Mitra Shaswata, Mittal Sudip, Golilarz Noorbakhsh Amiri, Rahimi Shahram, Amirlatifi Amin
- Educational Personalized Learning Path Planning With Large Language Models Ng Chee, Fung Yuen
- Examining The Influence Of Political Bias On Large Language Model Performance In Stance Classification Ng Lynnette Hui Xian, Cruickshank Iain, Lee Roy Ka-wei
- When Is The Consistent Prediction Likely To Be A Correct Prediction? Nguyen Alex, Mekala Dheeraj, Dong Chengyu, Shang Jingbo
- Dude: Dual Distribution-aware Context Prompt Learning For Large Vision-language Model Nguyen Duy M. H., Le An T., Nguyen Trung Q., Diep Nghiem T., Nguyen Tai, Duong-tran Duy, Peters Jan, Shen Li, Niepert Mathias, Sonntag Daniel
- Gpts And Language Barrier: A Cross-lingual Legal QA Examination Nguyen Ha-thanh, Yamada Hiroaki, Satoh Ken
- Direct Evaluation Of Chain-of-thought In Multi-hop Reasoning With Knowledge Graphs Nguyen Minh-vuong, Luo Linhao, Shiri Fatemeh, Phung Dinh, Li Yuan-fang, Vu Thuy-trang, Haffari Gholamreza
- Cultural Commonsense Knowledge For Intercultural Dialogues Nguyen Tuan-phong, Razniewski Simon, Weikum Gerhard
- Ceval: A Benchmark For Evaluating Counterfactual Text Generation Nguyen Van Bach, Schlötterer Jörg, Seifert Christin
- Aligners: Decoupling Llms And Alignment Ngweta Lilian, Agarwal Mayank, Maity Subha, Gittens Alex, Sun Yuekai, Yurochkin Mikhail
- Xwin-lm: Strong And Scalable Alignment Practice For Llms Ni Bolin, Hu Jingcheng, Wei Yixuan, Peng Houwen, Zhang Zheng, Meng Gaofeng, Hu Han
- BMIKE-53: Investigating Cross-lingual Knowledge Editing With In-context Learning Nie Ercong, Shao Bo, Ding Zifeng, Wang Mingyang, Schmid Helmut, Schütze Hinrich
- Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge In English-centric Large Language Models Nie Ercong, Yuan Shuzhou, Ma Bolei, Schmid Helmut, Färber Michael, Kreuter Frauke, Schütze Hinrich
- WTU-EVAL: A Whether-or-not Tool Usage Evaluation Benchmark For Large Language Models Ning Kangyun, Su Yisong, Lv Xueqiang, Zhang Yuanzhe, Liu Jian, Liu Kang, Xu Jinan
- User-llm: Efficient LLM Contextualization With User Embeddings Ning Lin, Liu Luyang, Wu Jiaxing, Wu Neo, Berlowitz Devora, Prakash Sushant, Green Bradley, O'banion Shawn, Xie Jun
- Can Llms Learn By Teaching? A Preliminary Study Ning Xuefei, Wang Zifu, Li Shiyao, Lin Zinan, Yao Peiran, Fu Tianyu, Blaschko Matthew B., Dai Guohao, Yang Huazhong, Wang Yu
- Role-play Zero-shot Prompting With Large Language Models For Open-domain Human-machine Conversation Njifenjou Ahmed, Sucal Virgile, Jabaian Bassam, Lefèvre Fabrice
- Robust Adaptation Of Foundation Models With Black-box Visual Prompting Oh Changdae, Seo Gyeongdeok, Jung Geunyoung, Cheng Zhi-qi, Choi Hosik, Jung Jiyoung, Song Kyungwoo
- Likelihood-based Mitigation Of Evaluation Bias In Large Language Models Ohi Masanari, Kaneko Masahiro, Koike Ryuto, Loem Mengsay, Okazaki Naoaki
- Gpt-ology, Computational Models, Silicon Sampling: How Should We Think About Llms In Cognitive Science? Ong Desmond C.
- Large Language Models Are Self-taught Reasoners: Enhancing LLM Applications Via Tailored Problem-solving Demonstrations Ong Kai Tzu-iunn, Kwon Taeyoon, Yeo Jinyoung
- Prompting Large Vision-language Models For Compositional Reasoning Ossowski Timothy, Jiang Ming, Hu Junjie
- Easyinstruct: An Easy-to-use Instruction Processing Framework For Large Language Models Ou Yixin, Zhang Ningyu, Gui Honghao, Xu Ziwen, Qiao Shuofei, Xue Yida, Fang Runnan, Liu Kangwei, Li Lei, Bi Zhen, Zheng Guozhou, Chen Huajun
- Multilingual Prompts In Llm-based Recommenders: Performance Across Languages Ozsoy Makbule Gulcin
- NLP At UC Santa Cruz At Semeval-2024 Task 5: Legal Answer Validation Using Few-shot Multi-choice QA Pahilajani Anish, Jain Samyak Rajesh, Trivedi Devasha
- Training Llms To Recognize Hedges In Spontaneous Narratives Paige Amie J., Soubki Adil, Murzaku John, Rambow Owen, Brennan Susan E.
- Gemini Goes To Med School: Exploring The Capabilities Of Multimodal Large Language Models On Medical Challenge Problems & Hallucinations Pal Ankit, Sankarasubbu Malaikannan
- Investigating LLM Applications In E-commerce Palen-michel Chester, Wang Ruixiang, Zhang Yipeng, Yu David, Xu Canran, Wu Zhe
- Dynathink: Fast Or Slow? A Dynamic Decision-making Framework For Large Language Models Pan Jiabao, Zhang Yan, Zhang Chen, Liu Zuozhu, Wang Hongwei, Li Haizhou
- Auto-encoding Morph-tokens For Multimodal LLM Pan Kaihang, Tang Siliang, Li Juncheng, Fan Zhaoyu, Chow Wei, Yan Shuicheng, Chua Tat-seng, Zhuang Yueting, Zhang Hanwang
- POMP: Probability-driven Meta-graph Prompter For Llms In Low-resource Unsupervised Neural Machine Translation Pan Shilong, Tian Zhiliang, Ding Liang, Huang Zhen, Wen Zhihua, Li Dongsheng
- Assessing AI Detectors In Identifying Ai-generated Code: Implications For Education Pan Wei Hung, Chok Ming Jie, Wong Jonathan Leong Shan, Shin Yung Xin, Poon Yeong Shian, Yang Zhou, Chong Chun Yong, Lo David, Lim Mei Kuan
- Enhancing Repository-level Code Generation With Integrated Contextual Information Pan Zhiyuan, Hu Xing, Xia Xin, Yang Xiaohu
- Llmlingua-2: Data Distillation For Efficient And Faithful Task-agnostic Prompt Compression Pan Zhuoshi, Wu Qianhui, Jiang Huiqiang, Xia Menglin, Luo Xufang, Zhang Jue, Lin Qingwei, Rühle Victor, Yang Yuqing, Lin Chin-yew, Zhao H. Vicky, Qiu Lili, Zhang Dongmei
- Attndreambooth: Towards Text-aligned Personalized Text-to-image Generation Pang Lianyu, Yin Jian, Zhao Baoquan, Wu Feize, Wang Fu Lee, Li Qing, Mao Xudong
- Lost In Space: Probing Fine-grained Spatial Understanding In Vision And Language Resamplers Pantazopoulos Georgios, Suglia Alessandro, Lemon Oliver, Eshghi Arash
- Picturing Ambiguity: A Visual Twist On The Winograd Schema Challenge Park Brendan, Janecek Madeline, Ezzati-jivan Naser, Li Yifeng, Emami Ali
- Pragmatic Competence Evaluation Of Large Language Models For Korean Park Dojun, Lee Jiwoo, Jeong Hyeyun, Park Seohyun, Lee Sungeun
- Llms For Enhanced Agricultural Meteorological Recommendations Park Ji-jun, Choi Soo-joon
- M4CXR: Exploring Multi-task Potentials Of Multi-modal Large Language Models For Chest X-ray Interpretation Park Jonggwon, Kim Soobum, Yoon Byungmu, Hyun Jihun, Choi Kyoyun
- Offsetbias: Leveraging Debiased Data For Tuning Evaluators Park Junsoo, Jwa Seungyeon, Ren Meiying, Kim Daeyoung, Choi Sanghyuk
- Pre-trained Vision And Language Transformers Are Few-shot Incremental Learners Park Keon-hee, Song Kyungwoo, Park Gyeong-moon
- Textboost: Towards One-shot Personalization Of Text-to-image Models Via Fine-tuning Text Encoder Park Nahyeon, Kim Kunhee, Shim Hyunjung
- Can Separators Improve Chain-of-thought Prompting? Park Yoonjeong, Kim Hyunjin, Choi Chanyeol, Kim Junseong, Sohn Jy-yong
- Logicbench: Towards Systematic Evaluation Of Logical Reasoning Ability Of Large Language Models Parmar Mihir, Patel Nisarg, Varshney Neeraj, Nakamura Mutsumi, Luo Man, Mashetty Santosh, Mitra Arindam, Baral Chitta
- Evidence To Generate (E2G): A Single-agent Two-step Prompting For Context Grounded And Retrieval Augmented Reasoning Parvez Md Rizwan
- Large Language Models Can Self-improve At Web Agent Tasks Patel Ajay, Hofmarcher Markus, Leoveanu-condrei Claudiu, Dinu Marius-constantin, Callison-burch Chris, Hochreiter Sepp
- Measuring And Benchmarking Large Language Models' Capabilities To Generate Persuasive Language Pauli Amalie Brogaard, Augenstein Isabelle, Assent Ira
- Advprompter: Fast Adaptive Adversarial Prompting For Llms Paulus Anselm, Zharmagambetov Arman, Guo Chuan, Amos Brandon, Tian Yuandong
- The Effectiveness Of Llms As Annotators: A Comparative Overview And Empirical Analysis Of Direct Representation Pavlovic Maja, Poesio Massimo
- Securing Large Language Models: Addressing Bias, Misinformation, And Prompt Attacks Peng Benji, Chen Keyu, Li Ming, Feng Pohsun, Bi Ziqian, Liu Junyu, Niu Qian
- Revisiting Demonstration Selection Strategies In In-context Learning Peng Keqin, Ding Liang, Yuan Yancheng, Liu Xuebo, Zhang Min, Ouyang Yuanxin, Tao Dacheng
- Humaneval-xl: A Multilingual Code Generation Benchmark For Cross-lingual Natural Language Generalization Peng Qiwei, Chai Yekun, Li Xuhong
- Review-llm: Harnessing Large Language Models For Personalized Review Generation Peng Qiyao, Liu Hongtao, Xu Hongyan, Yang Qing, Shao Minglai, Wang Wenjun
- Uncertainty-aware Explainable Recommendation With Large Language Models Peng Yicui, Chen Hao, Lin Chingsheng, Huang Guo, Hu Jinrong, Guo Hui, Kong Bin, Hu Shu, Wu Xi, Wang Xin
- Learnable In-context Vector For Visual Question Answering Peng Yingzhe, Hao Chenduo, Yang Xu, Peng Jiawei, Hu Xinting, Geng Xin
- Dreambench++: A Human-aligned Benchmark For Personalized Image Generation Peng Yuang, Cui Yuxin, Tang Haomiao, Qi Zekun, Dong Runpei, Bai Jing, Han Chunrui, Ge Zheng, Zhang Xiangyu, Xia Shu-tao
- Q-PEFT: Query-dependent Parameter Efficient Fine-tuning For Text Reranking With Large Language Models Peng Zhiyuan, Wu Xuyang, Wang Qifan, Rajanala Sravanthi, Fang Yi
- Analyzing Large Language Models Chatbots: An Experimental Approach Using A Probability Test Peruchini Melise, Teixeira Julio Monteiro
- Large Language Models Can Infer Personality From Free-form User Interactions Peters Heinrich, Cerf Moran, Matz Sandra C.
- Constitutionalexperts: Training A Mixture Of Principle-based Prompts Petridis Savvas, Wedin Ben, Yuan Ann, Wexler James, Thain Nithum
- Prompting A Pretrained Transformer Can Be A Universal Approximator Petrov Aleksandar, Torr Philip H. S., Bibi Adel
- Assessing The Emergent Symbolic Reasoning Abilities Of Llama Large Language Models Petruzzellis Flavio, Testolin Alberto, Sperduti Alessandro
- Benchmarking GPT-4 On Algorithmic Problems: A Systematic Evaluation Of Prompting Strategies Petruzzellis Flavio, Testolin Alberto, Sperduti Alessandro
- Optimizing Large Language Models For Openapi Code Completion Petryshyn Bohdan, Lukoševičius Mantas
- Towards Reliable Medical Question Answering: Techniques And Challenges In Mitigating Hallucinations In Language Models Pham Duy Khoa, Vo Bao Quoc
- Can Base Chatgpt Be Used For Forecasting Without Additional Optimization? Pham Van, Cunningham Scott
- Distillation Contrastive Decoding: Improving Llms Reasoning With Contrastive Decoding And Distillation Phan Phuc, Tran Hieu, Phan Long
- Sign Of The Times: Evaluating The Use Of Large Language Models For Idiomaticity Detection Phelps Dylan, Pickard Thomas, Mi Maggie, Gow-smith Edward, Villavicencio Aline
- Peering Into The Mind Of Language Models: An Approach For Attribution In Contextual Question Answering Phukan Anirudh, Somasundaram Shwetha, Saxena Apoorv, Goswami Koustava, Srinivasan Balaji Vasan
- Llms Can Learn Self-restraint Through Iterative Self-reflection Piché Alexandre, Milios Aristides, Bahdanau Dzmitry, Pal Chris
- Enhancing Gender-inclusive Machine Translation With Neomorphemes And Large Language Models Piergentili Andrea, Savoldi Beatrice, Negri Matteo, Bentivogli Luisa
- Doccgen: Document-based Controlled Code Generation Pimparkhede Sameer, Kammakomati Mehant, Tamilselvam Srikanth, Kumar Prince, Kumar Ashok Pon, Bhattacharyya Pushpak
- Revisiting Verilogeval: Newer Llms, In-context Learning, And Specification-to-rtl Tasks Pinckney Nathaniel, Batten Christopher, Liu Mingjie, Ren Haoxing, Khailany Brucek
- Reasoning With Large Language Models, A Survey Plaat Aske, Wong Annie, Verberne Suzan, Broekens Joost, Van Stein Niki, Back Thomas
- V-RECS, A Low-cost LLM4VIS Recommender With Explanations, Captioning And Suggestions Podo Luca, Angelini Marco, Velardi Paola
- Evaluation Of Language Models In The Medical Context Under Resource-constrained Settings Posada Andrea, Rueckert Daniel, Meissen Felix, Müller Philip
- Machine Translation With Large Language Models: Prompt Engineering For Persian, English, And Russian Directions Pourkamali Nooshin, Sharifi Shler Ebrahim
- Deciphering The Factors Influencing The Efficacy Of Chain-of-thought: Probability, Memorization, And Noisy Reasoning Prabhakar Akshara, Griffiths Thomas L., Mccoy R. Thomas
- PEDAL: Enhancing Greedy Decoding With Large Language Models Using Diverse Exemplars Prabhu Sumanth
- Interactions With Prompt Problems: A New Way To Teach Programming With Large Language Models Prather James, Denny Paul, Leinonen Juho, Smith David H. Iv, Reeves Brent N., Macneil Stephen, Becker Brett A., Luxton-reilly Andrew, Amarouche Thezyrie, Kimmel Bailey
- Fractured-sorry-bench: Framework For Revealing Attacks In Conversational Turns Undermining Refusal Efficacy And Defenses Over Sorry-bench Priyanshu Aman, Vijay Supriti
- Helpful Or Harmful? Exploring The Efficacy Of Large Language Models For Online Grooming Prevention Prosser Ellie, Edwards Matthew
- Crafting A Good Prompt Or Providing Exemplary Dialogues? A Study Of In-context Learning For Persona-based Dialogue Generation Pu Jiashu, Wan Yajing, Zhang Yuru, Chen Jing, Cheng Ling, Shao Qian, Chang Yongzhu, Lv Tangjie, Zhang Rongsheng
- Code Prompting Elicits Conditional Reasoning Abilities In Text+code Llms Puerto Haritz, Tutek Martin, Aditya Somak, Zhu Xiaodan, Gurevych Iryna
- Model Internals-based Answer Attribution For Trustworthy Retrieval-augmented Generation Qi Jirui, Sarti Gabriele, Fernández Raquel, Bisazza Arianna
- Follow My Instruction And Spill The Beans: Scalable Data Extraction From Retrieval-augmented Generation Systems Qi Zhenting, Zhang Hanlin, Xing Eric, Kakade Sham, Lakkaraju Himabindu
- Mia-bench: Towards Better Instruction Following Evaluation Of Multimodal Llms Qian Yusu, Ye Hanrong, Fauconnier Jean-philippe, Grasch Peter, Yang Yinfei, Gan Zhe
- How Easy Is It To Fool Your Multimodal Llms? An Empirical Analysis On Deceptive Prompts Qian Yusu, Zhang Haotian, Yang Yinfei, Gan Zhe
- Prompt Perturbation Consistency Learning For Robust Language Models Qiang Yao, Nandi Subhrangshu, Mehrabi Ninareh, Steeg Greg Ver, Kumar Anoop, Rumshisky Anna, Galstyan Aram
- Relevant Or Random: Can Llms Truly Perform Analogical Reasoning? Qin Chengwei, Xia Wenhan, Wang Tan, Jiao Fangkai, Hu Yuchen, Ding Bosheng, Chen Ruirui, Joty Shafiq
- Diffusiongpt: Llm-driven Text-to-image Generation System Qin Jie, Wu Jie, Chen Weifeng, Ren Yuxi, Li Huixia, Wu Hefeng, Xiao Xuefeng, Wang Rui, Wen Shilei
- Enhancing Sequential Recommendations Through Multi-perspective Reflections And Iteration Qin Weicong, Xu Yi, Yu Weijie, Shen Chenglei, Zhang Xiao, He Ming, Fan Jianping, Xu Jun
- LAMPO: Large Language Models As Preference Machines For Few-shot Ordinal Classification Qin Zhen, Wu Junru, Shen Jiaming, Liu Tianqi, Wang Xuanhui
- Brainstorming Brings Power To Large Language Models Of Knowledge Reasoning Qin Zining, Wang Chenhao, Qin Huiling, Jia Weijia
- Scenarios And Approaches For Situated Natural Language Explanations Qiu Pengshuo, Rudzicz Frank, Zhu Zining
- Before Generation, Align It! A Novel And Effective Strategy For Mitigating Hallucinations In Text-to-sql Generation Qu Ge, Li Jinyang, Li Bowen, Qin Bowen, Huo Nan, Ma Chenhao, Cheng Reynold
- Tokenrec: Learning To Tokenize ID For Llm-based Generative Recommendation Qu Haohao, Fan Wenqi, Zhao Zihuai, Li Qing
- DEEP-ICL: Definition-enriched Experts For Language Model In-context Learning Qu Xingwei, Liang Yiming, Wang Yucheng, Zheng Tianyu, Yue Tommy, Ma Lei, Huang Stephen W., Zhang Jiajun, Shi Yinan, Lin Chenghua, Fu Jie, Zhang Ge
- Coursegpt-zh: An Educational Large Language Model Based On Knowledge Distillation Incorporating Prompt Optimization Qu Zheyan, Yin Lu, Yu Zitong, Wang Wenbo, Zhang Xing
- Enhancing Ethical Explanations Of Large Language Models Through Iterative Symbolic Refinement Quan Xin, Valentino Marco, Dennis Louise A., Freitas André
- An Empirical Study On Self-correcting Large Language Models For Data Science Code Generation Quoc Thai Tang, Minh Duc Ha, Thanh Tho Quan, Nguyen-duc Anh
- Towards Generalizable Agents In Text-based Educational Environments: A Study Of Integrating RL With Llms Radmehr Bahar, Singla Adish, Käser Tanja
- Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift In Fine-tuning Llms For Simultaneous Translation Raffel Matthew, Agostinelli Victor, Chen Lizhong
- Defan: Definitive Answer Dataset For Llms Hallucination Evaluation Rahman A B M Ashikur, Anwar Saeed, Usman Muhammad, Mian Ajmal
- Code Hallucination Rahman Mirza Masfiqur, Kundu Ashish
- An Investigation Of Neuron Activation As A Unified Lens To Explain Chain-of-thought Eliciting Arithmetic Reasoning Of Llms Rai Daking, Yao Ziyu
- Masontigers At Semeval-2024 Task 9: Solving Puzzles With An Ensemble Of Chain-of-thoughts Raihan Md Nishat, Goswami Dhiman, Emran Al Nahian Bin, Puspo Sadiya Sayara Chowdhury, Ganguly Amrita, Zampieri Marcos
- ANCHOR: Llm-driven News Subject Conditioning For Text-to-image Synthesis Ramakrishnan Aashish Anantha, Huang Sharon X., Lee Dongwon
- STEER: Assessing The Economic Rationality Of Large Language Models Raman Narun, Lundy Taylor, Amouyal Samuel, Levine Yoav, Leyton-brown Kevin, Tennenholtz Moshe
- Preference Distillation For Personalized Generative Recommendation Ramos Jerome, Wu Bin, Lipani Aldo
- Analyzing LLM Behavior In Dialogue Summarization: Unveiling Circumstantial Hallucination Trends Ramprasad Sanjana, Ferracane Elisa, Lipton Zachary C.
- Animate, Or Inanimate, That Is The Question For Large Language Models Ranaldi Leonardo, Pucci Giulia, Zanzotto Fabio Massimo
- Evaluating The Reliability Of Self-explanations In Large Language Models Randl Korbinian, Pavlopoulos John, Henriksson Aron, Lindgren Tony
- SSP: Self-supervised Prompting For Cross-lingual Transfer To Low-resource Languages Using Large Language Models Rathore Vipul, Deb Aniruddha, Chandresh Ankish, Singla Parag, Mausam
- FACTOID: Factual Entailment For Hallucination Detection Rawte Vipula, Tonmoy S. M Towhidul Islam, Rajbangshi Krishnav, Nag Shravani, Chadha Aman, Sheth Amit P., Das Amitava
- "sorry, Come Again?" Prompting -- Enhancing Comprehension And Diminishing Hallucination With [pause]-injected Optimal Paraphrasing Rawte Vipula, Tonmoy S. M Towhidul Islam, Zaman S M Mehedi, Priya Prachi, Chadha Aman, Sheth Amit P., Das Amitava
- Mitigating Exaggerated Safety In Large Language Models Ray Ruchira, Bhalani Ruchi
- Developing Safe And Responsible Large Language Model : Can We Balance Bias Reduction And Language Understanding In Large Language Models? Raza Shaina, Bamgbose Oluwanifemi, Ghuge Shardul, Tavakol Fatemeh, Reji Deepak John, Bashir Syed Raza
- Analyzing And Adapting Large Language Models For Few-shot Multilingual NLU: Are We There Yet? Razumovskaia Evgeniia, Vulić Ivan, Korhonen Anna
- Beyond Performance: Quantifying And Mitigating Label Bias In Llms Reif Yuval, Schwartz Roy
- Identifying Semantic Induction Heads To Understand In-context Learning Ren Jie, Guo Qipeng, Yan Hang, Liu Dongrui, Zhang Quanshi, Qiu Xipeng, Lin Dahua
- Unveiling And Mitigating Memorization In Text-to-image Diffusion Models Through Cross Attention Ren Jie, Li Yaxin, Zeng Shenglai, Xu Han, Lyu Lingjuan, Xing Yue, Tang Jiliang
- Samba: Simple Hybrid State Space Models For Efficient Unlimited Context Language Modeling Ren Liliang, Liu Yang, Lu Yadong, Shen Yelong, Liang Chen, Chen Weizhu
- Do Large Language Models Mirror Cognitive Language Processing? Ren Yuqi, Jin Renren, Zhang Tongxuan, Xiong Deyi
- The Benefits Of A Concise Chain Of Thought On Problem-solving In Large Language Models Renze Matthew, Guven Erhan
- The Effect Of Sampling Temperature On Problem Solving In Large Language Models Renze Matthew, Guven Erhan
- Evaluating Llms For Gender Disparities In Notable Persons Rhue Lauren, Goethals Sofie, Sundararajan Arun
- Code Generation With Alphacodium: From Prompt Engineering To Flow Engineering Ridnik Tal, Kredo Dedy, Friedman Itamar
- Empirical Study Of Symmetrical Reasoning In Conversational Chatbots Rim Daniela N., Choi Heeyoul
- Countering Reward Over-optimization In LLM With Demonstration-guided Reinforcement Learning Rita Mathieu, Strub Florian, Chaabouni Rahma, Michel Paul, Dupoux Emmanuel, Pietquin Olivier
- Zero-shot Spam Email Classification Using Pre-trained Large Language Models Rojas-galeano Sergio
- Question-instructed Visual Descriptions For Zero-shot Video Question Answering Romero David, Solorio Thamar
- An Early Categorization Of Prompt Injection Attacks On Large Language Models Rossi Sippo, Michel Alisia Marianne, Mukkamala Raghava Rao, Thatcher Jason Bennett
- Convolutional Prompting Meets Language Models For Continual Learning Roy Anurag, Moulick Riddhiman, Verma Vinay K., Ghosh Saptarshi, Das Abir
- ERATTA: Extreme RAG For Table To Answers With Large Language Models Roychowdhury Sohini, Krema Marko, Mahammad Anvar, Moore Brian, Mukherjee Arijit, Prakashchandra Punit
- Fist-financial Style Transfer With Hallucination And Creativity Control Framework Roychowdhury Sohini, Krema Marko, Moore Brian, Lai Xingjian, Effedua Dike, Jethwani Bharat
- Evaluation Of RAG Metrics For Question Answering In The Telecom Domain Roychowdhury Sujoy, Soman Sumit, Ranjani H G, Gunda Neeraj, Chhabra Vansh, Bala Sai Krishna
- Do Llms Have Consistent Values? Rozen Naama, Elidan Gal, Globerson Amir, Daniel Ella
- From PEFT To DEFT: Parameter Efficient Finetuning For Reducing Activation Density In Transformers Runwal Bharat, Pedapati Tejaswini, Chen Pin-yu
- Explorations Of Self-repair In Language Models Rushing Cody, Nanda Neel
- Code-aware Prompting: A Study Of Coverage Guided Test Generation In Regression Setting Using LLM Ryan Gabriel, Jain Siddhartha, Shang Mingyue, Wang Shiqi, Ma Xiaofei, Ramanathan Murali Krishna, Ray Baishakhi
- Safetyprompts: A Systematic Review Of Open Datasets For Evaluating And Improving Large Language Model Safety Röttger Paul, Pernisi Fabio, Vidgen Bertie, Hovy Dirk
- In-context Learning Demonstration Selection Via Influence Analysis S. Vinay M., Van Minh-hao, Wu Xintao
- Capabilities Of Gemini Models In Medicine Saab Khaled, Tu Tao, Weng Wei-hung, Tanno Ryutaro, Stutz David, Wulczyn Ellery, Zhang Fan, Strother Tim, Park Chunjong, Vedadi Elahe, Chaves Juanma Zambrano, Hu Szu-yeu, Schaekermann Mike, Kamath Aishwarya, Cheng Yong, Barrett David G. T., Cheung Cathy, Mustafa Basil, Palepu Anil, Mcduff Daniel, Hou Le, Golany Tomer, Liu Luyang, Alayrac Jean-baptiste, Houlsby Neil, Tomasev Nenad, Freyberg Jan, Lau Charles, Kemp Jonas, Lai Jeremy, Azizi Shekoofeh, Kanada Kimberly, Man Siwai, Kulkarni Kavita, Sun Ruoxi, Shakeri Siamak, He Luheng, Caine Ben, Webson Albert, Latysheva Natasha, Johnson Melvin, Mansfield Philip, Lu Jian, Rivlin Ehud, Anderson Jesper, Green Bradley, Wong Renee, Krause Jonathan, Shlens Jonathon, Dominowska Ewa, Eslami S. M. Ali, Chou Katherine, Cui Claire, Vinyals Oriol, Kavukcuoglu Koray, Manyika James, Dean Jeff, Hassabis Demis, Matias Yossi, Webster Dale, Barral Joelle, Corrado Greg, Semturs Christopher, Mahdavi S. Sara, Gottweis Juraj, Karthikesalingam Alan, Natarajan Vivek
- Utebc-nlp At Semeval-2024 Task 9: Can Llms Be Lateral Thinkers? Sadeghi Pouya, Abaskohi Amirhossein, Yaghoobzadeh Yadollah
- Vision-language And Large Language Model Performance In Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, And Quantized Models Safavi-naini Seyed Amir Ahmad, Ali Shuhaib, Shahab Omer, Shahhoseini Zahra, Savage Thomas, Rafiee Sara, Samaan Jamil S, Shabeeb Reem Al, Ladak Farah, Yang Jamie O, Echavarria Juan, Babar Sumbal, Shaukat Aasma, Margolis Samuel, Tatonetti Nicholas P, Nadkarni Girish, Kurdi Bara El, Soroush Ali
- On Zero-shot Counterspeech Generation By Llms Saha Punyajoy, Agrawal Aalok, Jana Abhik, Biemann Chris, Mukherjee Animesh
- A Systematic Survey Of Prompt Engineering In Large Language Models: Techniques And Applications Sahoo Pranab, Singh Ayush Kumar, Saha Sriparna, Jain Vinija, Mondal Samrat, Chadha Aman
- Mixsumm: Topic-based Data Augmentation Using Llms For Low-resource Extractive Text Summarization Sahu Gaurav, Laradji Issam H.
- Pelican: Correcting Hallucination In Vision-llms Via Claim Decomposition And Program Of Thought Verification Sahu Pritish, Sikka Karan, Divakaran Ajay
- Where Is The Answer? Investigating Positional Bias In Language Model Knowledge Extraction Saito Kuniaki, Sohn Kihyuk, Lee Chen-yu, Ushiku Yoshitaka
- Toward The Evaluation Of Large Language Models Considering Score Variance Across Instruction Templates Sakai Yusuke, Nohejl Adam, Hang Jiangnan, Kamigaito Hidetaka, Watanabe Taro
- Synergizing In-context Learning With Hints For End-to-end Task-oriented Dialog Systems Saley Vishal Vivek, Das Rocktim Jyoti, Raghu Dinesh, Mausam
- Simulating Task-oriented Dialogues With State Transition Graphs And Large Language Models Samarinas Chris, Promthaw Pracha, Nijasure Atharva, Zeng Hansi, Killingback Julian, Zamani Hamed
- Berts Are Generative In-context Learners Samuel David
- Rainbow Teaming: Open-ended Generation Of Diverse Adversarial Prompts Samvelyan Mikayel, Raparthy Sharath Chandra, Lupu Andrei, Hambro Eric, Markosyan Aram H., Bhatt Manish, Mao Yuning, Jiang Minqi, Parker-holder Jack, Foerster Jakob, Rocktäschel Tim, Raileanu Roberta
- Zero-shot RTL Code Generation With Attention Sink Augmented Large Language Models Sandal Selim, Akturk Ismail
- Retrieval-augmented Generation Meets Data-driven Tabula Rasa Approach For Temporal Knowledge Graph Forecasting Sannidhi Geethan, Sakhinana Sagar Srinivas, Runkana Venkataramana
- Lean Attention: Hardware-aware Scalable Attention Mechanism For The Decode-phase Of Transformers Sanovar Rya, Bharadwaj Srikant, Amant Renee St., Rühle Victor, Rajmohan Saravan
- The Power Of Prompts: Evaluating And Mitigating Gender Bias In MT With Llms Sant Aleix, Escolano Carlos, Mash Audrey, Fornaciari Francesca De Luca, Melero Maite
- Prompting Llms To Compose Meta-review Drafts From Peer-review Narratives Of Scholarly Manuscripts Santu Shubhra Kanti Karmaker, Sinha Sanjeev Kumar, Bansal Naman, Knipper Alex, Sarkar Souvika, Salvador John, Mahajan Yash, Guttikonda Sri, Akter Mousumi, Freestone Matthew, Williams Matthew C. Jr
- ICAL: Continual Learning Of Multimodal Agents By Transforming Trajectories Into Actionable Insights Sarch Gabriel, Jang Lawrence, Tarr Michael J., Cohen William W., Marino Kenneth, Fragkiadaki Katerina
- HELPER-X: A Unified Instructable Embodied Agent To Tackle Four Interactive Vision-language Domains With Memory-augmented Language Models Sarch Gabriel, Somani Sahil, Kapoor Raghav, Tarr Michael J., Fragkiadaki Katerina
- Textmachina: Seamless Generation Of Machine-generated Text Datasets Sarvazyan Areg Mikael, González José Ángel, Franco-salvador Marc
- Improving Sentence Embeddings With Automatic Generation Of Training Data Using Few-shot Examples Sato Soma, Tsukagoshi Hayato, Sasano Ryohei, Takeda Koichi
- A Prompt Response To The Demand For Automatic Gender-neutral Translation Savoldi Beatrice, Piergentili Andrea, Fucci Dennis, Negri Matteo, Bentivogli Luisa
- Robust Few-shot Transfer Learning For Knowledge Base Question Answering With Unanswerable Questions Sawhney Riya, Bhattacharya Indrajit, Mausam
- Evaluating Consistency And Reasoning Capabilities Of Large Language Models Saxena Yash, Chopra Sarthak, Tripathi Arunendra Mani
- Can Llms Correct Physicians, Yet? Investigating Effective Interaction Methods In The Medical Domain Sayin Burcu, Minervini Pasquale, Staiano Jacopo, Passerini Andrea
- Automated Educational Question Generation At Different Bloom's Skill Levels Using Large Language Models: Strategies And Evaluation Scaria Nicy, Chenna Suma Dharani, Subramani Deepak
- Can Small Language Models Learn, Unlearn, And Retain Noise Patterns? Scaria Nicy, Kennedy Silvester John Joseph, Subramani Deepak
- Symbolic Prompt Program Search: A Structure-aware Approach To Efficient Compile-time Prompt Optimization Schnabel Tobias, Neville Jennifer
- In-context Learning Agents Are Asymmetric Belief Updaters Schubert Johannes A., Jagadish Akshay K., Binz Marcel, Schulz Eric
- Team UTSA-NLP At Semeval 2024 Task 5: Prompt Ensembling For Argument Reasoning In Civil Procedures With GPT4 Schumacher Dan, Rios Anthony
- Large Language Models To Generate System-level Test Programs Targeting Non-functional Properties Schwachhofer Denis, Domanski Peter, Becker Steffen, Wagner Stefan, Sauer Matthias, Pflüger Dirk, Polian Ilia
- Rethinking LLM Memorization Through The Lens Of Adversarial Compression Schwarzschild Avi, Feng Zhili, Maini Pratyush, Lipton Zachary C., Kolter J. Zico
- Soft Prompt Threats: Attacking Safety Alignment And Unlearning In Open-source Llms Through The Embedding Space Schwinn Leo, Dobre David, Xhonneux Sophie, Gidel Gauthier, Gunnemann Stephan
- Behavioral Testing: Can Large Language Models Implicitly Resolve Ambiguous Entities? Sedova Anastasiia, Litschko Robert, Frassinelli Diego, Roth Benjamin, Plank Barbara
- Speech Reallm -- Real-time Streaming Speech Recognition With Multimodal Llms By Teaching The Flow Of Time Seide Frank, Doulaty Morrie, Shi Yangyang, Gaur Yashesh, Jia Junteng, Wu Chunyang
- Quantitative Knowledge Retrieval From Large Language Models Selby David, Spriestersbach Kai, Iwashita Yuichiro, Bappert Dennis, Warrier Archana, Mukherjee Sumantrak, Asim Muhammad Nabeel, Kise Koichi, Vollmer Sebastian
- Benchmarking Large Language Models For Math Reasoning Tasks Seßler Kathrin, Rong Yao, Gözlüklü Emek, Kasneci Enkelejda
- Medpromptx: Grounded Multimodal Prompting For Chest X-ray Diagnosis Shaaban Mai A., Khan Adnan, Yaqub Mohammad
- Ai-assisted Generation Of Difficult Math Questions Shah Vedant, Yu Dingli, Lyu Kaifeng, Park Simon, Ke Nan Rosemary, Mozer Michael, Bengio Yoshua, Arora Sanjeev, Goyal Anirudh
- Putting Gpt-4o To The Sword: A Comprehensive Evaluation Of Language, Vision, Speech, And Multimodal Proficiency Shahriar Sakib, Lund Brady, Mannuru Nishith Reddy, Arshad Muhammad Arbab, Hayawi Kadhim, Bevara Ravi Varma Kumar, Mannuru Aashrith, Batool Laiba
- Show, Don't Tell: Aligning Language Models With Demonstrated Feedback Shaikh Omar, Lam Michelle, Hejna Joey, Shao Yijia, Bernstein Michael, Yang Diyi
- Guiding In-context Learning Of Llms Through Quality Estimation For Machine Translation Sharami Javad Pourmostafa Roshan, Shterionov Dimitar, Spronck Pieter
- Unirag: Universal Retrieval Augmentation For Multi-modal Large Language Models Sharifymoghaddam Sahel, Upadhyay Shivani, Chen Wenhu, Lin Jimmy
- SPML: A DSL For Defending Language Models Against Prompt Attacks Sharma Reshabh K, Gupta Vinayak, Grossman Dan
- A Preliminary Study On Using Large Language Models In Software Pentesting Shashwat Kumar, Hahn Francis, Ou Xinming, Goldgof Dmitry, Hall Lawrence, Ligatti Jay, Rajgopalan S. Raj, Tabari Armin Ziaie
- Rumour Evaluation With Very Large Language Models Shehata Dahlia, Cohen Robin, Clarke Charles
- Rapid Optimization For Jailbreaking Llms Via Subconscious Exploitation And Echopraxia Shen Guangyu, Cheng Siyuan, Zhang Kaiyuan, Tao Guanhong, An Shengwei, Yan Lu, Zhang Zhuo, Ma Shiqing, Zhang Xiangyu
- Boosting Reward Model With Preference-conditional Multi-aspect Synthetic Data Generation Shen Jiaming, Xu Ran, Jun Yennie, Qin Zhen, Liu Tianqi, Yang Carl, Liang Yi, Baumgartner Simon, Bendersky Michael
- The Language Barrier: Dissecting Safety Challenges Of Llms In Multilingual Contexts Shen Lingfeng, Tan Weiting, Chen Sihao, Chen Yunmo, Zhang Jingyu, Xu Haoran, Zheng Boyuan, Koehn Philipp, Khashabi Daniel
- Pyramid Coder: Hierarchical Code Generator For Compositional Visual Question Answering Shen Ruoyue, Inoue Nakamasa, Shinoda Koichi
- Improving Reinforcement Learning From Human Feedback Using Contrastive Rewards Shen Wei, Zhang Xiaoying, Yao Yuanshun, Zheng Rui, Guo Hongyi, Liu Yang
- PMG : Personalized Multimodal Generation With Large Language Models Shen Xiaoteng, Zhang Rui, Zhao Xiaoyan, Zhu Jieming, Xiao Xi
- Proctag: Process Tagging For Assessing The Efficacy Of Document Instruction Data Shen Yufan, Luo Chuwei, Zhu Zhaoqing, Chen Yang, Zheng Qi, Yu Zhi, Bu Jiajun, Yao Cong
- The Fire Thief Is Also The Keeper: Balancing Usability And Privacy In Prompts Shen Zhili, Xi Zihang, He Ying, Tong Wei, Hua Jingyu, Zhong Sheng
- From Language Models To Practical Self-improving Computer Agents Sheng Alex
- Repeval: Effective Text Evaluation With LLM Representation Sheng Shuqian, Xu Yi, Zhang Tianhang, Shen Zanwei, Fu Luoyi, Ding Jiaxin, Zhou Lei, Wang Xinbing, Zhou Chenghu
- Neurosymbolic AI For Enhancing Instructability In Generative AI Sheth Amit, Pallagani Vishal, Roy Kaushik
- Judging The Judges: A Systematic Investigation Of Position Bias In Pairwise Comparative Assessments By Llms Shi Lin, Ma Chiyu, Ma Weicheng, Vosoughi Soroush
- Retrieval-enhanced Knowledge Editing In Language Models For Multi-hop Question Answering Shi Yucheng, Tan Qiaoyu, Wu Xuansheng, Zhong Shaochen, Zhou Kaixiong, Liu Ninghao
- Why Larger Language Models Do In-context Learning Differently? Shi Zhenmei, Wei Junyi, Xu Zhuoyan, Liang Yingyu
- Abusegpt: Abuse Of Generative AI Chatbots To Create Smishing Campaigns Shibli Ashfak Md, Pritom Mir Mehedi A., Gupta Maanak
- Can Prompt Modifiers Control Bias? A Comparative Analysis Of Text-to-image Generative Models Shin Philip Wootaek, Ahn Jihyun Janice, Yin Wenpeng, Sampson Jack, Narayanan Vijaykrishnan
- Decompose, Enrich, And Extract! Schema-aware Event Extraction Using Llms Shiri Fatemeh, Nguyen Van, Moghimifar Farhad, Yoo John, Haffari Gholamreza, Li Yuan-fang
- Deconstructing In-context Learning: Understanding Prompts Via Corruption Shivagunde Namrata, Lialin Vladislav, Muckatira Sherin, Rumshisky Anna
- Structuredrag: JSON Response Formatting With Large Language Models Shorten Connor, Pierse Charles, Smith Thomas Benjamin, Cardenas Erika, Sharma Akanksha, Trengrove John, Van Luijt Bob
- Knowledge Graph Large Language Model (KG-LLM) For Link Prediction Shu Dong, Chen Tianle, Jin Mingyu, Zhang Chong, Du Mengnan, Zhang Yongfeng
- An Active Inference Strategy For Prompting Reliable Responses From Large Language Models In Medical Practice Shusterman Roma, Waters Allison C., O`neill Shannon, Luu Phan, Tucker Don M.
- Design2code: How Far Are We From Automating Front-end Engineering? Si Chenglei, Zhang Yanzhe, Yang Zhengyuan, Liu Ruibo, Yang Diyi
- Iclguard: Controlling In-context Learning Behavior For Applicability Authorization Si Wai Man, Backes Michael, Zhang Yang
- Where Does In-context Translation Happen In Large Language Models Sia Suzanna, Mueller David, Duh Kevin
- Integrating Large Language Models Into Recommendation Via Mutual Augmentation And Adaptive Aggregation Sichun Luo, Yuxuan Yao, Bowei He, Yinya Huang, Aojun Zhou, Xinyi Zhang, Yuanzhang Xiao, Mingjie Zhan, Linqi Song
- The Fault In Our Stars: Quality Assessment Of Code Generation Benchmarks Siddiq Mohammed Latif, Dristi Simantika, Saha Joy, Santos Joanna C. S.
- Attention Overflow: Language Model Input Blur During Long-context Missing Items Recommendation Sileo Damien
- Robustness Of Llms To Perturbations In Text Singh Ayush, Singh Navpreet, Vatsal Shubham
- Llm-dcache: Improving Tool-augmented Llms With Gpt-driven Localized Data Caching Singh Simranjit, Fore Michael, Karatzas Andreas, Lee Chaehong, Jian Yanan, Shangguan Longfei, Yu Fuxun, Anagnostopoulos Iraklis, Stamoulis Dimitrios
- A Three-pronged Approach To Cross-lingual Adaptation With Multilingual Llms Singh Vaibhav, Krishna Amrith, Nj Karthika, Ramakrishnan Ganesh
- Are Small Language Models Ready To Compete With Large Language Models For Practical Applications? Sinha Neelabh, Jain Vinija, Chadha Aman
- Maml-en-llm: Model Agnostic Meta-training Of Llms For Improved In-context Learning Sinha Sanchit, Yue Yuguang, Soto Victor, Kulkarni Mayank, Lu Jianhua, Zhang Aidong
- Exploring Value Biases: How Llms Deviate Towards The Ideal Sivaprasad Sarath, Kaushik Pramod, Abdelnabi Sahar, Fritz Mario
- Using Adaptive Empathetic Responses For Teaching English Siyan Li, Shao Teresa, Yu Zhou, Hirschberg Julia
- Scaling LLM Test-time Compute Optimally Can Be More Effective Than Scaling Model Parameters Snell Charlie, Lee Jaehoon, Xu Kelvin, Kumar Aviral
- Aligning Large Language Models For Enhancing Psychiatric Interviews Through Symptom Delineation And Summarization So Jae-hee, Chang Joonhwan, Kim Eunji, Na Junho, Choi Jiyeon, Sohn Jy-yong, Kim Byung-hoon, Chu Sang Hui
- From Words To Worlds: Transforming One-line Prompt Into Immersive Multi-modal Digital Stories With Communicative LLM Agent Sohn Samuel S., Li Danrui, Zhang Sen, Chang Che-jui, Kapadia Mubbasir
- Gistembed: Guided In-sample Selection Of Training Negatives For Text Embedding Fine-tuning Solatorio Aivin V.
- Multi-task Inference: Can Large Language Models Follow Multiple Instructions At Once? Son Guijin, Baek Sangwon, Nam Sangdae, Jeong Ilgyun, Kim Seungone
- Prefixing Attention Sinks Can Mitigate Activation Outliers For Large Language Model Quantization Son Seungwoo, Park Wonpyo, Han Woohyun, Kim Kyuyeun, Lee Jaeho
- Scaling Data Diversity For Fine-tuning Language Models In Human Alignment Song Feifan, Yu Bowen, Lang Hao, Yu Haiyang, Huang Fei, Wang Houfeng, Li Yongbin
- Out-of-distribution Generalization Via Composition: A Lens Through Induction Heads In Transformers Song Jiajun, Xu Zhuoyan, Zhong Yiqiao
- Multilingual Blending: LLM Safety Alignment Evaluation With Language Mixture Song Jiayang, Huang Yuheng, Zhou Zhehua, Ma Lei
- Moma: Multimodal LLM Adapter For Fast Personalized Image Generation Song Kunpeng, Zhu Yizhe, Liu Bingchen, Yan Qing, Elgammal Ahmed, Yang Xiao
- Can Many-shot In-context Learning Help Long-context LLM Judges? See More, Judge Better! Song Mingyang, Zheng Mao, Luo Xuan
- ELLA-V: Stable Neural Codec Language Modeling With Alignment-guided Sequence Reordering Song Yakun, Chen Zhuo, Wang Xiaofei, Ma Ziyang, Chen Xie
- Alchemistcoder: Harmonizing And Eliciting Code Capability By Hindsight Tuning On Multi-source Data Song Zifan, Wang Yudong, Zhang Wenwei, Liu Kuikun, Lyu Chengqi, Song Demin, Guo Qipeng, Yan Hang, Lin Dahua, Chen Kai, Zhao Cairong
- Enhancing Ai-driven Psychological Consultation: Layered Prompts With Large Language Models Souza Rafael, Lim Jia-hao, Davis Alexander
- Understanding Memorisation In Llms: Dynamics, Influencing Factors, And Implications Speicher Till, Khan Mohammad Aflah, Wu Qinyuan, Nanda Vedant, Das Soumi, Ghosh Bishwamittra, Gummadi Krishna P., Terzi Evimaria
- Chatgpt For Conversational Recommendation: Refining Recommendations By Reprompting With Feedback Spurlock Kyle Dylan, Acun Cagla, Saka Esin, Nasraoui Olfa
- VDGD: Mitigating LVLM Hallucinations In Cognitive Prompts By Bridging The Visual Perception Gap Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Oriol Nieto, Zeyu Jin, Dinesh Manocha
- Crossing New Frontiers: Knowledge-augmented Large Language Model Prompting For Zero-shot Text-based De Novo Molecule Design Srinivas Sakhinana Sagar, Runkana Venkataramana
- Comparative Analysis Of Different Efficient Fine Tuning Methods Of Large Language Models (llms) In Low-resource Setting Srinivasan Krishna Prasad Varadarajan, Gumpena Prasanth, Yattapu Madhusudhana, Brahmbhatt Vishal H.
- Evaluating Llms' Mathematical Reasoning In Financial Document Question Answering Srivastava Pragya, Malik Manuj, Gupta Vivek, Ganu Tanuja, Roth Dan
- Exploring LLM Prompting Strategies For Joint Essay Scoring And Feedback Generation Stahl Maja, Biermann Leon, Nehring Andreas, Wachsmuth Henning
- Towards Truly Zero-shot Compositional Visual Reasoning With Llms As Programmers Stanić Aleksandar, Caelles Sergi, Tschannen Michael
- A Comprehensive Study Of The Capabilities Of Large Language Models For Vulnerability Detection Steenhoek Benjamin, Rahman Md Mahbubur, Roy Monoshi Kumar, Alam Mirza Sanjida, Barr Earl T., Le Wei
- Virtual Agents For Alcohol Use Counseling: Exploring Llm-powered Motivational Interviewing Steenstra Ian, Nouraei Farnaz, Arjmand Mehdi, Bickmore Timothy W.
- Fine-tuning Large Language Models For Entity Matching Steiner Aaron, Peeters Ralph, Bizer Christian
- RLVF: Learning From Verbal Feedback Without Overgeneralization Stephan Moritz, Khazatsky Alexander, Mitchell Eric, Chen Annie S, Hsu Sheryl, Sharma Archit, Finn Chelsea
- Large Language Models Are Inconsistent And Biased Evaluators Stureborg Rickard, Alikaniotis Dimitris, Suhara Yoshi
- Mission Impossible: A Statistical Perspective On Jailbreaking Llms Su Jingtong, Kempe Julia, Ullrich Karen
- Demonstration Augmentation For Zero-shot In-context Learning Su Yi, Tai Yunpeng, Ji Yixin, Li Juntao, Yan Bowen, Zhang Min
- Whispering Experts: Neural Interventions For Toxicity Mitigation In Language Models Suau Xavier, Delobelle Pieter, Metcalf Katherine, Joulin Armand, Apostoloff Nicholas, Zappella Luca, Rodríguez Pau
- 70b-parameter Large Language Models In Japanese Medical Question-answering Sukeda Issey, Kishikawa Risa, Kodera Satoshi
- Structured Chain-of-thought Prompting For Few-shot Generation Of Content-grounded QA Conversations Sultan Md Arafat, Ganhotra Jatin, Astudillo Ramón Fernandez
- Hierarchical In-context Reinforcement Learning With Hindsight Modular Reflections For Planning Sun Chuanneng, Huang Songjun, Pompili Dario
- Delrec: Distilling Sequential Pattern To Enhance Llm-based Recommendation Sun Guohao, Zhang Haoyi
- Retrieved In-context Principles From Previous Mistakes Sun Hao, Jiang Yong, Wang Bo, Hou Yingyan, Zhang Yan, Xie Pengjun, Huang Fei
- Benchmarking Chinese Commonsense Reasoning Of Llms: From Chinese-specifics To Reasoning-memorization Correlations Sun Jiaxing, Huang Weiquan, Wu Jiang, Gu Chenya, Li Wei, Zhang Songyang, Yan Hang, He Conghui
- Fine-tuning Vs Prompting, Can Language Models Understand Human Values? Sun Pingwei
- Large Language Models As Conversational Movie Recommenders: A User Study Sun Ruixuan, Li Xinyi, Akella Avinash, Konstan Joseph A.
- Prompt Chaining Or Stepwise Prompt? Refinement In Text Summarization Sun Shichao, Yuan Ruifeng, Cao Ziqiang, Li Wenjie, Liu Pengfei
- Unicoder: Scaling Code Large Language Model Via Universal Code Sun Tao, Chai Linzheng, Yang Jian, Yin Yuwei, Guo Hongcheng, Liu Jiaheng, Wang Bing, Yang Liqun, Li Zhoujun
- Expert-guided Extinction Of Toxic Tokens For Debiased Generation Sun Xueyao, Shi Kaize, Tang Haoran, Xu Guandong, Li Qing
- Leveraging Chatgpt In Pharmacovigilance Event Extraction: An Empirical Study Sun Zhaoyue, Pergola Gabriele, Wallace Byron C., He Yulan
- Meta-prompting: Enhancing Language Models With Task-agnostic Scaffolding Suzgun Mirac, Kalai Adam Tauman
- From Explanations To Action: A Zero-shot, Theory-driven LLM Framework For Student Performance Feedback Swamy Vinitra, Romano Davide, Desikan Bhargav Srinivasa, Camburu Oana-maria, Käser Tanja
- Performance Assessment Of Chatgpt Vs Bard In Detecting Alzheimer's Dementia T Balamurali B, Chen Jer-ming
- JORA: JAX Tensor-parallel Lora Library For Retrieval Augmented Fine-tuning Tahir Anique, Cheng Lu, Liu Huan
- All In How You Ask For It: Simple Black-box Method For Jailbreak Attacks Takemoto Kazuhiro
- Phantom: Personality Has An Effect On Theory-of-mind Reasoning In Large Language Models Tan Fiona Anting, Yeo Gerard Christopher, Wu Fanyou, Xu Weijie, Jain Vinija, Chadha Aman, Jaidka Kokil, Liu Yang, Ng See-kiong
- Lloco: Learning Long Contexts Offline Tan Sijun, Li Xiuyu, Patil Shishir, Wu Ziyang, Zhang Tianjun, Keutzer Kurt, Gonzalez Joseph E., Popa Raluca Ada
- Thought-like-pro: Enhancing Reasoning Of Large Language Models Through Self-driven Prolog-based Chain-of-thought Tan Xiaoyu, Deng Yongxin, Qiu Xihe, Xu Weidi, Qu Chao, Chu Wei, Xu Yinghui, Qi Yuan
- Struct-x: Enhancing Large Language Models Reasoning With Structured Data Tan Xiaoyu, Wang Haoyu, Qiu Xihe, Cheng Yuan, Xu Yinghui, Chu Wei, Qi Yuan
- Democratizing Large Language Models Via Personalized Parameter-efficient Fine-tuning Tan Zhaoxuan, Zeng Qingkai, Tian Yijun, Liu Zheyuan, Yin Bing, Jiang Meng
- Tuning-free Accountable Intervention For LLM Deployment -- A Metacognitive Approach Tan Zhen, Peng Jie, Chen Tianlong, Liu Huan
- An Empirical Study And Analysis Of Text-to-image Generation Using Large Language Model-powered Textual Representation Tan Zhiyu, Yang Mengping, Qin Luozheng, Yang Hao, Qian Ye, Zhou Qiang, Zhang Cheng, Li Hao
- Leveraging Language Models For Emotion And Behavior Analysis In Education Tanaka Kaito, Tan Benjamin, Wong Brian
- Ungrammatical-syntax-based In-context Example Selection For Grammatical Error Correction Tang Chenming, Qu Fanyi, Wu Yunfang
- Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions Tang Chenming, Wang Zhixiang, Wu Yunfang
- Unleashing The Potential Of Large Language Models As Prompt Optimizers: An Analogical Analysis With Gradient-based Model Optimizers Tang Xinyu, Wang Xiaolei, Zhao Wayne Xin, Lu Siyuan, Li Yaliang, Wen Ji-rong
- On The Hardness Of Faithful Chain-of-thought Reasoning In Large Language Models Tanneru Sree Harsha, Ley Dan, Agarwal Chirag, Lakkaraju Himabindu
- Chatgpt Incorrectness Detection In Software Reviews Tanzil Minaoar Hossain, Khan Junaed Younus, Uddin Gias
- Targeted Visual Prompting For Medical Visual Question Answering Tascon-morales Sergio, Márquez-neila Pablo, Sznitman Raphael
- Large Language Models Are Null-shot Learners Taveekitworachai Pittawat, Abdullah Febri, Thawonmas Ruck
- Dynamic Contexts For Generating Suggestion Questions In RAG Based Conversational Systems Tayal Anuja, Tyagi Aman
- From Feature Importance To Natural Language Explanations Using Llms With RAG Tekkesinoglu Sule, Kunze Lars
- Interactive Prompt Debugging With Sequence Salience Tenney Ian, Mullins Ryan, Du Bin, Pandya Shree, Kahng Minsuk, Dixon Lucas
- Training-free Consistent Text-to-image Generation Tewel Yoad, Kaduri Omri, Gal Rinon, Kasten Yoni, Wolf Lior, Chechik Gal, Atzmon Yuval
- Guardrail Baselines For Unlearning In Llms Thaker Pratiksha, Maurya Yash, Hu Shengyuan, Wu Zhiwei Steven, Smith Virginia
- Judging The Judges: Evaluating Alignment And Vulnerabilities In Llms-as-judges Thakur Aman Singh, Choudhary Kartik, Ramayapally Venkat Srinik, Vaidyanathan Sankaran, Hupkes Dieuwke
- Supporting Human Raters With The Detection Of Harmful Content Using Large Language Models Thomas Kurt, Kelley Patrick Gage, Tao David, Meiklejohn Sarah, Vallis Owen, Tan Shunwen, Bratanič Blaž, Ferreira Felipe Tiengo, Eranti Vijay Kumar, Bursztein Elie
- Limits Of Transformer Language Models On Learning To Compose Algorithms Thomm Jonathan, Terzic Aleksandar, Camposampiero Giacomo, Hersche Michael, Schölkopf Bernhard, Rahimi Abbas
- Assessing Large Language Models In Mechanical Engineering Education: A Study On Mechanics-focused Conceptual Understanding Tian Jie, Hou Jixin, Wu Zihao, Shu Peng, Liu Zhengliang, Xiang Yujie, Gu Beikang, Filla Nicholas, Li Yiwei, Liu Ning, Chen Xianyan, Tang Keke, Liu Tianming, Wang Xianqiao
- Robustness Assessment Of Mathematical Reasoning In The Presence Of Missing And Contradictory Conditions Tian Shi-yu, Zhou Zhi, Jia Lin-han, Guo Lan-zhe, Li Yu-feng
- Toward Self-improvement Of Llms Via Imagination, Searching, And Criticizing Tian Ye, Peng Baolin, Song Linfeng, Jin Lifeng, Yu Dian, Mi Haitao, Yu Dong
- Selective Prompt Anchoring For Code Generation Tian Yuan, Zhang Tianyi
- Can Large Language Models Assess Serendipity In Recommender Systems? Tokutake Yu, Okamoto Kazushi
- An Ecosage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant Tomar Mohit, Tiwari Abhisek, Saha Tulika, Jha Prince, Saha Sriparna
- Can Llms Learn From Previous Mistakes? Investigating Llms' Errors To Boost For Reasoning Tong Yongqi, Li Dawei, Wang Sizhe, Wang Yujia, Teng Fei, Shang Jingbo
- A Comprehensive Survey Of Hallucination Mitigation Techniques In Large Language Models Tonmoy S. M Towhidul Islam, Zaman S M Mehedi, Jain Vinija, Rani Anku, Rawte Vipula, Chadha Aman, Das Amitava
- Prompting Techniques For Secure Code Generation: A Systematic Investigation Tony Catherine, Ferreyra Nicolás E. Díaz, Mutas Markus, Dhiff Salem, Scandariato Riccardo
- Evaluating Large Language Models With Grid-based Game Competitions: An Extensible LLM Benchmark And Leaderboard Topsakal Oguzhan, Edell Colby Jacob, Harper Jackson Bailey
- Openmathinstruct-1: A 1.8 Million Math Instruction Tuning Dataset Toshniwal Shubham, Moshkov Ivan, Narenthiran Sean, Gitman Daria, Jia Fei, Gitman Igor
- Few-shot Cross-lingual Transfer For Prompting Large Language Models In Low-resource Languages Toukmaji Christopher
- Connecting The Dots: Llms Can Infer And Verbalize Latent Structure From Disparate Training Data Treutlein Johannes, Choi Dami, Betley Jan, Anil Cem, Marks Samuel, Grosse Roger Baker, Evans Owain
- Leveraging LLM Reasoning Enhances Personalized Recommender Systems Tsai Alicia Y., Kraft Adam, Jin Long, Cai Chenwei, Hosseini Anahita, Xu Taibai, Zhang Zemin, Hong Lichan, Chi Ed H., Yi Xinyang
- Enhance Modality Robustness In Text-centric Multimodal Alignment With Adversarial Prompting Tsai Yun-da, Yen Ting-yu, Liao Keng-te, Lin Shou-de
- Augmenting The Author: Exploring The Potential Of AI Collaboration In Academic Writing Tu Joseph, Hadan Hilda, Wang Derrick M., Sgandurra Sabrina A, Mogavi Reza Hadi, Nacke Lennart E.
- Progres: Prompted Generative Rescoring On ASR N-best Tur Ada Defne, Moumen Adel, Ravanelli Mirco
- Lifelong Robot Library Learning: Bootstrapping Composable And Generalizable Skills For Embodied Control With Language Models Tziafas Georgios, Kasaei Hamidreza
- Adacoder: Adaptive Prompt Compression For Programmatic Visual Question Answering Ukai Mahiro, Kurita Shuhei, Hashimoto Atsushi, Ushiku Yoshitaka, Inoue Nakamasa
- PARADISE: Evaluating Implicit Planning Skills Of Language Models With Procedural Warnings And Tips Dataset Uzunoglu Arda, Safa Abdalfatah Rashid, Şahin Gözde Gül
- Evaluating The Capability Of Large Language Models To Personalize Science Texts For Diverse Middle-school-age Learners Vaccaro Michael Jr, Friday Mikayla, Zaghi Arash
- Natural Language Programming In Medicine: Administering Evidence Based Clinical Workflows With Autonomous Agents Powered By Generative Large Language Models Vaid Akhil, Lampert Joshua, Lee Juhee, Sawant Ashwin, Apakama Donald, Sakhuja Ankit, Soroush Ali, Bick Sarah, Abbott Ethan, Gomez Hernando, Hadley Michael, Lee Denise, Landi Isotta, Duong Son Q, Bussola Nicole, Nabeel Ismail, Muehlstedt Silke, Muehlstedt Silke, Freeman Robert, Kovatch Patricia, Carr Brendan, Wang Fei, Glicksberg Benjamin, Argulian Edgar, Lerakis Stamatios, Khera Rohan, Reich David L., Kraft Monica, Charney Alexander, Nadkarni Girish
- AI Sandbagging: Language Models Can Strategically Underperform On Evaluations Van Der Weij Teun, Hofstätter Felix, Jaffe Ollie, Brown Samuel F., Ward Francis Rhys
- Infusing Knowledge Into Large Language Models With Contextual Prompts Vasisht Kinshuk, Ganesan Balaji, Kumar Vikas, Bhatnagar Vasudha
- A Survey Of Prompt Engineering Methods In Large Language Models For Different NLP Tasks Vatsal Shubham, Dubey Harsh
- Can GPT Redefine Medical Understanding? Evaluating GPT On Biomedical Machine Reading Comprehension Vatsal Shubham, Singh Ayush
- On The Brittle Foundations Of React Prompting For Agentic Large Language Models Verma Mudit, Bhambri Siddhant, Kambhampati Subbarao
- Theory Of Mind Abilities Of Large Language Models In Human-robot Interaction : An Illusion? Verma Mudit, Bhambri Siddhant, Kambhampati Subbarao
- Auditing Counterfire: Evaluating Advanced Counterargument Generation With Evidence And Style Verma Preetika, Jaidka Kokil, Churina Svetlana
- Exploring LGBTQ+ Bias In Generative AI Answers Across Different Country And Religious Contexts Vicsek Lilla, Vancsó Anna, Zajko Mike, Takacs Judit
- Exploring Safety-utility Trade-offs In Personalized Language Models Vijjini Anvesh Rao, Chowdhury Somnath Basu Roy, Chaturvedi Snigdha
- Emotional Manipulation Through Prompt Engineering Amplifies Disinformation Generation In AI Large Language Models Vinay Rasita, Spitale Giovanni, Biller-andorno Nikola, Germani Federico
- Exploring Fact Memorization And Style Imitation In Llms Using Qlora: An Experimental Study And Quality Assessment Methods Vyborov Eugene, Osypenko Oleksiy, Sotnyk Serge
- Generative Large Language Models In Automated Fact-checking: A Survey Vykopal Ivan, Pikuliak Matúš, Ostermann Simon, Šimko Marián
- Leveraging Zero-shot Prompting For Efficient Language Model Distillation Vöge Lukas, Gurgul Vincent, Lessmann Stefan
- From Rags To Rich Parameters: Probing How Language Models Utilize External Knowledge Over Parametric Information For Factual Queries Wadhwa Hitesh, Seetharaman Rahul, Aggarwal Somyaa, Ghosh Reshmi, Basu Samyadeep, Srinivasan Soundararajan, Zhao Wenlong, Chaudhari Shreyas, Aghazadeh Ehsan
- Learning To Refine With Fine-grained Natural Language Feedback Wadhwa Manya, Zhao Xinyu, Li Junyi Jessy, Durrett Greg
- Paraphrase Types Elicit Prompt Engineering Capabilities Wahle Jan Philip, Ruas Terry, Xu Yang, Gipp Bela
- An Empirical Study Of Mamba-based Language Models Waleffe Roger, Byeon Wonmin, Riach Duncan, Norick Brandon, Korthikanti Vijay, Dao Tri, Gu Albert, Hatamizadeh Ali, Singh Sudhakar, Narayanan Deepak, Kulshreshtha Garvit, Singh Vartika, Casper Jared, Kautz Jan, Shoeybi Mohammad, Catanzaro Bryan
- The Instruction Hierarchy: Training Llms To Prioritize Privileged Instructions Wallace Eric, Xiao Kai, Leike Reimar, Weng Lilian, Heidecke Johannes, Beutel Alex
- Cot Rerailer: Enhancing The Reliability Of Large Language Models In Complex Reasoning Tasks Through Error Detection And Correction Wan Guangya, Wu Yuqi, Chen Jie, Li Sheng
- Dynamic Self-consistency: Leveraging Reasoning Paths For Efficient LLM Sampling Wan Guangya, Wu Yuqi, Chen Jie, Li Sheng
- Teach Better Or Show Smarter? On Instructions And Exemplars In Automatic Prompt Optimization Wan Xingchen, Sun Ruoxi, Nakhost Hootan, Arik Sercan O.
- A & B == B & A: Triggering Logical Reasoning Failures In Large Language Models Wan Yuxuan, Wang Wenxuan, Yang Yiliu, Yuan Youliang, Huang Jen-tse, He Pinjia, Jiao Wenxiang, Lyu Michael R.
- LOOK-M: Look-once Optimization In KV Cache For Efficient Multimodal Long-context Inference Wan Zhongwei, Wu Ziang, Liu Che, Huang Jinfa, Zhu Zhihong, Jin Peng, Wang Longyue, Yuan Li
- Fine-grained Self-endorsement Improves Factuality And Reasoning Wang Ante, Song Linfeng, Peng Baolin, Tian Ye, Jin Lifeng, Mi Haitao, Su Jinsong, Yu Dong
- Task Supportive And Personalized Human-large Language Model Interaction: A User Study Wang Ben, Liu Jiqun, Karimnazarov Jamshed, Thompson Nicolas
- Image Re-identification: Where Self-supervision Meets Vision-language Learning Wang Bin, Liang Yuying, Cai Lei, Huang Huakun, Zeng Huanqiang
- Grokked Transformers Are Implicit Reasoners: A Mechanistic Journey To The Edge Of Generalization Wang Boshi, Yue Xiang, Su Yu, Sun Huan
- Information Theoretic Text-to-image Alignment Wang Chao, Franzese Giulio, Finamore Alessandro, Gallo Massimo, Michiardi Pietro
- Probing The Safety Response Boundary Of Large Language Models Via Unsafe Decoding Path Generation Wang Haoyu, Wu Bingzhe, Bian Yatao, Chang Yongzhe, Wang Xueqian, Zhao Peilin
- M4U: Evaluating Multilingual Understanding And Reasoning For Large Multimodal Models Wang Hongyu, Xu Jiayu, Xie Senwei, Wang Ruiping, Li Jialin, Xie Zhaojie, Zhang Bin, Xiong Chuyan, Chen Xilin
- Target-constrained Bidirectional Planning For Generation Of Target-oriented Proactive Dialogue Wang Jian, Lin Dongding, Li Wenjie
- Chinese Metaphor Recognition Using A Multi-stage Prompting Large Language Model Wang Jie, Wang Jin, Zhang Xuejie
- The Whole Is Better Than The Sum: Using Aggregated Demonstrations In In-context Learning For Sequential Recommendation Wang Lei, Lim Ee-peng
- Interactive Dualchecker For Mitigating Hallucinations In Distilling Large Language Models Wang Meiyun, Suzuki Masahiro, Sakaji Hiroki, Izumi Kiyoshi
- Exploring The Potential Of Multimodal LLM With Knowledge-intensive Multimodal ASR Wang Minghan, Wang Yuxia, Vu Thuy-trang, Shareghi Ehsan, Haffari Gholamreza
- Effective Demonstration Annotation For In-context Learning Via Language Model-based Determinantal Point Process Wang Peng, Wang Xiaobin, Lou Chao, Mao Shengyu, Xie Pengjun, Jiang Yong
- Will The Real Linda Please Stand Up...to Large Language Models? Examining The Representativeness Heuristic In Llms Wang Pengda, Xiao Zilin, Chen Hanjie, Oswald Frederick L.
- Automated Generation Of Multiple-choice Cloze Questions For Assessing English Vocabulary Using Gpt-turbo 3.5 Wang Qiao, Rose Ralph, Orita Naho, Sugawara Ayaka
- Role Prompting Guided Domain Adaptation With General Capability Preserve For Large Language Models Wang Rui, Mi Fei, Chen Yi, Xue Boyang, Wang Hongru, Zhu Qi, Wong Kam-fai, Xu Ruifeng
- One Prompt Is Not Enough: Automated Construction Of A Mixture-of-expert Prompts Wang Ruochen, An Sohyun, Cheng Minhao, Zhou Tianyi, Hwang Sung Ju, Hsieh Cho-jui
- What Is The Best Way For Chatgpt To Translate Poetry? Wang Shanshan, Wong Derek F., Yao Jingming, Chao Lidia S.
- \(\mathbb{uscd}\): Improving Code Generation Of Llms By Uncertainty-aware Selective Contrastive Decoding Wang Shuai, Ding Liang, Shen Li, Luo Yong, He Zheng, Yu Wei, Tao Dacheng
- Using Advanced Llms To Enhance Smaller Llms: An Interpretable Knowledge Distillation Approach Wang Tong, Sudhir K., Hong Dat
- Secokd: Aligning Large Language Models For In-context Learning With Fewer Shots Wang Weixing, Yang Haojin, Meinel Christoph
- QUITO: Accelerating Long-context Reasoning Through Query-guided Context Compression Wang Wenshan, Wang Yihang, Fan Yixing, Liao Huaming, Guo Jiafeng
- Learning To Ask: When Llms Meet Unclear Instruction Wang Wenxuan, Shi Juluan, Wang Chaozheng, Lee Cheryl, Yuan Youliang, Huang Jen-tse, Lyu Michael R.
- FSM: A Finite State Machine Based Zero-shot Prompting Paradigm For Multi-hop Question Answering Wang Xiaochen, He Junqing, Yang Zhe, Wang Yiru, Meng Xiangdi, Pan Kunhao, Sui Zhifang
- Metatool: Facilitating Large Language Models To Master Tools With Meta-task Augmentation Wang Xiaohan, Li Dian, Zhao Yilin, Sinbadliu, Wang Hui
- Validating Llm-generated Programs With Metamorphic Prompt Testing Wang Xiaoyin, Zhu Dakai
- Rdrec: Rationale Distillation For Llm-based Recommendation Wang Xinfeng, Cui Jin, Suzuki Yoshimi, Fukumoto Fumiyo
- Executable Code Actions Elicit Better LLM Agents Wang Xingyao, Chen Yangyi, Yuan Lifan, Zhang Yizhe, Li Yunzhu, Peng Hao, Ji Heng
- "my Answer Is C": First-token Probabilities Do Not Match Text Answers In Instruction-tuned Language Models Wang Xinpeng, Ma Bolei, Hu Chengzhi, Weber-genzel Leon, Röttger Paul, Kreuter Frauke, Hovy Dirk, Plank Barbara
- Negativeprompt: Leveraging Psychology For Large Language Models Enhancement Via Negative Emotional Stimuli Wang Xu, Li Cheng, Chang Yi, Wang Jindong, Wu Yuan
- Chain-of-thought Reasoning Without Prompting Wang Xuezhi, Zhou Denny
- UBENCH: Benchmarking Uncertainty In Large Language Models With Multiple Choice Questions Wang Xunzhi, Zhang Zhuowei, Li Qiongyu, Chen Gaonan, Hu Mengting, Li Zhiyu, Luo Bitong, Gao Hang, Han Zhixin, Wang Haotian
- Unveiling Factual Recall Behaviors Of Large Language Models Through Knowledge Neurons Wang Yifei, Chen Yuheng, Wen Wanting, Sheng Yu, Li Linjing, Zeng Daniel Dajun
- A Theoretical Understanding Of Self-correction Through In-context Alignment Wang Yifei, Wu Yuyang, Wei Zeming, Jegelka Stefanie, Wang Yisen
- Defending Llms Against Jailbreaking Attacks Via Backtranslation Wang Yihan, Shi Zhouxing, Bai Andrew, Hsieh Cho-jui
- Speak It Out: Solving Symbol-related Problems With Symbol-to-language Conversion For Language Models Wang Yile, Cheng Sijie, Sun Zixin, Li Peng, Liu Yang
- Strategic Chain-of-thought: Guiding Accurate Reasoning In Llms Through Strategy Elicitation Wang Yu, Zhao Shiwan, Wang Zhihu, Huang Heyuan, Fan Ming, Zhang Yubo, Wang Zhixing, Wang Haijun, Liu Ting
- HPT++: Hierarchically Prompting Vision-language Models With Multi-granularity Knowledge Generation And Improved Structure Modeling Wang Yubin, Jiang Xinyang, Cheng De, Sun Wenli, Li Dongsheng, Zhao Cairong
- Sibyl: Simple Yet Effective Agent Framework For Complex Real-world Reasoning Wang Yulong, Shen Tianhao, Liu Lifeng, Xie Jian
- Taste: Teaching Large Language Models To Translate Through Self-reflection Wang Yutong, Zeng Jiali, Liu Xuebo, Meng Fandong, Zhou Jie, Zhang Min
- LSTP: Language-guided Spatial-temporal Prompt Learning For Long-form Video-text Understanding Wang Yuxuan, Wang Yueqian, Wu Pengfei, Liang Jianxin, Zhao Dongyan, Zheng Zilong
- To Recommend Or Not: Recommendability Identification In Conversations With Pre-trained Language Models Wang Zhefan, Ma Weizhi, Zhang Min
- Genartist: Multimodal LLM As An Agent For Unified Image Generation And Editing Wang Zhenyu, Li Aoxue, Li Zhenguo, Liu Xihui
- Re-task: Revisiting LLM Tasks From Capability, Skill, And Knowledge Perspectives Wang Zhihu, Zhao Shiwan, Wang Yu, Huang Heyuan, Shi Jiaxin, Xie Sitao, Wang Zhixing, Zhang Yubo, Li Hongyan, Yan Junchi
- Promptcharm: Text-to-image Generation Through Multi-modal Prompting And Refinement Wang Zhijie, Huang Yuheng, Song Da, Ma Lei, Zhang Tianyi
- Wordflow: Social Prompt Engineering For Large Language Models Wang Zijie J., Chakravarthy Aishwarya, Munechika David, Chau Duen Horng
- Farsight: Fostering Responsible AI Awareness During AI Application Prototyping Wang Zijie J., Kulkarni Chinmay, Wilcox Lauren, Terry Michael, Madaio Michael
- Eliminating Position Bias Of Language Models: A Mechanistic Approach Wang Ziqi, Zhang Hanlin, Li Xiner, Huang Kuan-hao, Han Chi, Ji Shuiwang, Kakade Sham M., Peng Hao, Ji Heng
- Poisoned Langchain: Jailbreak Llms By Langchain Wang Ziqiu, Liu Jun, Zhang Shengkai, Yang Yang
- Re2llm: Reflective Reinforcement Large Language Model For Session-based Recommendation Wang Ziyan, Du Yingpeng, Sun Zhu, Chua Haoyan, Feng Kaidong, Wang Wenya, Zhang Jie
- Large Language User Interfaces: Voice Interactive User Interfaces Powered By Llms Wasti Syed Mekael, Pu Ken Q., Neshati Ali
- Large Language Models Are Pattern Matchers: Editing Semi-structured And Structured Documents With Chatgpt Weber Irene
- Hidden In Plain Sight: Exploring Chat History Tampering In Interactive Language Models Wei Cheng'an, Zhao Yue, Gong Yujia, Chen Kai, Xiang Lu, Zhu Shenchen
- OPDAI At Semeval-2024 Task 6: Small Llms Can Accelerate Hallucination Detection With Weakly Supervised Data Wei Chengcheng, Chen Ze, Fang Songtan, He Jiarong, Gao Max
- Systematic Evaluation Of Llm-as-a-judge In LLM Alignment Tasks: Explainable Metrics And Diverse Prompt Templates Wei Hui, He Shenghua, Xia Tian, Wong Andy, Lin Jingyang, Han Mei
- Long-form Factuality In Large Language Models Wei Jerry, Yang Chengrun, Song Xinying, Lu Yifeng, Hu Nathan, Huang Jie, Tran Dustin, Peng Daiyi, Liu Ruibo, Huang Da, Du Cosmo, Le Quoc V.
- Measuring And Reducing LLM Hallucination Without Gold-standard Answers Wei Jiaheng, Yao Yuanshun, Ton Jean-francois, Guo Hongyi, Estornell Andrew, Liu Yang
- Promptmm: Multi-modal Knowledge Distillation For Recommendation With Prompt-tuning Wei Wei, Tang Jiabin, Jiang Yangqin, Xia Lianghao, Huang Chao
- Instructrag: Instructing Retrieval-augmented Generation Via Self-synthesized Rationales Wei Zhepei, Chen Wei-lin, Meng Yu
- Are Large Language Models More Empathetic Than Humans? Welivita Anuradha, Pu Pearl
- Characterizing LLM Abstention Behavior In Science QA With Context Perturbations Wen Bingbing, Howe Bill, Wang Lucy Lu
- Grounding Data Science Code Generation With Input-output Specifications Wen Yeming, Yin Pengcheng, Shi Kensen, Michalewski Henryk, Chaudhuri Swarat, Polozov Alex
- AGLA: Mitigating Object Hallucinations In Large Vision-language Models With Assembly Of Global And Local Attention Wenbin An, Feng Tian, Sicong Leng, Jiahao Nie, Haonan Lin, Qianying Wang, Guang Dai, Ping Chen, Shijian Lu
- Do Llamas Work In English? On The Latent Language Of Multilingual Transformers Wendler Chris, Veselovsky Veniamin, Monea Giovanni, West Robert
- Easy Problems That Llms Get Wrong Williams Sean, Huckle James
- CBR-RAG: Case-based Reasoning For Retrieval Augmented Generation In Llms For Legal Question Answering Wiratunga Nirmalie, Abeyratne Ramitha, Jayawardena Lasal, Martin Kyle, Massie Stewart, Nkisi-orji Ikechukwu, Weerasinghe Ruvan, Liret Anne, Fleisch Bruno
- Effects Of A Prompt Engineering Intervention On Undergraduate Students' AI Self-efficacy, AI Knowledge And Prompt Engineering Ability: A Mixed Methods Study Woo David James, Wang Deliang, Yung Tim, Guo Kai
- Revealing Fine-grained Values And Opinions In Large Language Models Wright Dustin, Arora Arnav, Borenstein Nadav, Yadav Srishti, Belongie Serge, Augenstein Isabelle
- Curriculum Learning With Quality-driven Data Selection Wu Biao, Meng Fang, Chen Ling
- EDA Corpus: A Large Language Model Dataset For Enhanced Interaction With Openroad Wu Bing-yue, Sharma Utsav, Kankipati Sai Rahul Dhanvi, Yadav Ajay, George Bintu Kappil, Guntupalli Sai Ritish, Rovinski Austin, Chhabria Vidya A.
- Towards Evaluating And Building Versatile Large Language Models For Medicine Wu Chaoyi, Qiu Pengcheng, Liu Jinxin, Gu Hongfei, Li Na, Zhang Ya, Wang Yanfeng, Xie Weidi
- Generative Monoculture In Large Language Models Wu Fan, Black Emily, Chandrasekaran Varun
- Core: Context-regularized Text Embedding Learning For Text-to-image Personalization Wu Feize, Pang Yun, Zhang Junyi, Pang Lianyu, Yin Jian, Zhao Baoquan, Li Qing, Mao Xudong
- Visionllm V2: An End-to-end Generalist Multimodal Large Language Model For Hundreds Of Vision-language Tasks Wu Jiannan, Zhong Muyan, Xing Sen, Lai Zeqiang, Liu Zhaoyang, Wang Wenhai, Chen Zhe, Zhu Xizhou, Lu Lewei, Lu Tong, Luo Ping, Qiao Yu, Dai Jifeng
- Can Large Language Models Understand Uncommon Meanings Of Common Words? Wu Jinyang, Che Feihu, Zheng Xinxin, Zhang Shuai, Jin Ruihan, Nie Shuai, Shao Pengpeng, Tao Jianhua
- Adapting Large Language Models For Document-level Machine Translation Wu Minghao, Vu Thuy-trang, Qu Lizhen, Foster George, Haffari Gholamreza
- Controlmllm: Training-free Visual Prompt Learning For Multimodal Large Language Models Wu Mingrui, Cai Xinyue, Ji Jiayi, Li Jiale, Huang Oucheng, Luo Gen, Fei Hao, Sun Xiaoshuai, Ji Rongrong
- GUI Action Narrator: Where And When Did That Action Take Place? Wu Qinchen, Gao Difei, Lin Kevin Qinghong, Wu Zhuoyu, Guo Xiangwu, Li Peiran, Zhang Weichen, Wang Hengxu, Shou Mike Zheng
- Avatar: Optimizing LLM Agents For Tool-assisted Knowledge Retrieval Wu Shirley, Zhao Shiyu, Huang Qian, Huang Kexin, Yasunaga Michihiro, Cao Kaidi, Ioannidis Vassilis N., Subbian Karthik, Leskovec Jure, Zou James
- Enhance Reasoning For Large Language Models In The Game Werewolf Wu Shuang, Zhu Liwen, Yang Tao, Xu Shiwei, Fu Qiang, Wei Yang, Fu Haobo
- Mind's Eye Of Llms: Visualization-of-thought Elicits Spatial Reasoning In Large Language Models Wu Wenshan, Mao Shaoguang, Zhang Yadong, Xia Yan, Dong Li, Cui Lei, Wei Furu
- Passage-specific Prompt Tuning For Passage Reranking In Question Answering With Large Language Models Wu Xuyang, Peng Zhiyuan, Sai Krishna Sravanthi Rajanala, Wu Hsin-tai, Fang Yi
- Agentkit: Structured LLM Reasoning With Dynamic Graphs Wu Yue, Fan Yewen, Min So Yeon, Prabhumoye Shrimai, Mcaleer Stephen, Bisk Yonatan, Salakhutdinov Ruslan, Li Yuanzhi, Mitchell Tom
- Longgenbench: Benchmarking Long-form Generation In Long Context Llms Wu Yuhao, Hee Ming Shan, Hu Zhiqing, Lee Roy Ka-wei
- Difflora: Generating Personalized Low-rank Adaptation Weights With Diffusion Wu Yujia, Shi Yiming, Wei Jiwei, Sun Chengwei, Zhou Yuyang, Yang Yang, Shen Heng Tao
- Instructing Large Language Models To Identify And Ignore Irrelevant Conditions Wu Zhenyu, Shen Chao, Jiang Meng
- Large Language Models Can Self-correct With Minimal Effort Wu Zhenyu, Zeng Qingkai, Zhang Zhihan, Tan Zhaoxuan, Shen Chao, Jiang Meng
- In-context Learning Can Re-learn Forbidden Tasks Xhonneux Sophie, Dobre David, Tang Jian, Gidel Gauthier, Sridhar Dhanya
- Badchain: Backdoor Chain-of-thought Prompting For Large Language Models Xiang Zhen, Jiang Fengqing, Xiong Zidi, Ramasubramanian Bhaskar, Poovendran Radha, Li Bo
- Verbalized Machine Learning: Revisiting Machine Learning With Language Models Xiao Tim Z., Bamler Robert, Schölkopf Bernhard, Liu Weiyang
- Non-instructional Fine-tuning: Enabling Instruction-following Capabilities In Pre-trained Language Models Without Instruction-following Data Xie Juncheng, Syu Shensian, Lee Hung-yi
- Autoad-zero: A Training-free Framework For Zero-shot Audio Description Xie Junyu, Han Tengda, Bain Max, Nagrani Arsha, Varol Gül, Xie Weidi, Zisserman Andrew
- Sorry-bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors Xie Tinghao, Qi Xiangyu, Zeng Yi, Huang Yangsibo, Sehwag Udari Madhushani, Huang Kaixuan, He Luxi, Wei Boyi, Li Dacheng, Sheng Ying, Jia Ruoxi, Li Bo, Li Kai, Chen Danqi, Henderson Peter, Mittal Prateek
- Decomposition For Enhancing Attention: Improving Llm-based Text-to-sql Through Workflow Paradigm Xie Yuanzhen, Jin Xinzhou, Xie Tao, Lin Mingxiong, Chen Liang, Yu Chenyun, Cheng Lei, Zhuo Chengxiang, Hu Bo, Li Zang
- Gradsafe: Detecting Jailbreak Prompts For Llms Via Safety-critical Gradient Analysis Xie Yueqi, Fang Minghong, Pi Renjie, Gong Neil
- Calibrating Reasoning In Language Models With Internal Consistency Xie Zhihui, Guo Jizhou, Yu Tong, Li Shuai
- Order Matters In Hallucination: Reasoning Order As Benchmark And Reflexive Prompting For Large-language-models Xie Zikai
- RALL-E: Robust Codec Language Modeling With Chain-of-thought Prompting For Text-to-speech Synthesis Xin Detai, Tan Xu, Shen Kai, Ju Zeqian, Yang Dongchao, Wang Yuancheng, Takamichi Shinnosuke, Saruwatari Hiroshi, Liu Shujie, Li Jinyu, Zhao Sheng
- Theoretical Understanding Of In-context Learning In Shallow Transformers With Unstructured Data Xing Yue, Lin Xiaofeng, Xu Chenheng, Suh Namjoon, Song Qifan, Cheng Guang
- Llm-enhanced User-item Interactions: Leveraging Edge Information For Optimized Recommendations Xinyuan Wang, Liang Wu, Liangjie Hong, Hao Liu, Yanjie Fu
- Defensive Prompt Patch: A Robust And Interpretable Defense Of Llms Against Jailbreak Attacks Xiong Chen, Qi Xiangyu, Chen Pin-yu, Ho Tsung-yi
- Benchmarking Retrieval-augmented Generation For Medicine Xiong Guangzhi, Jin Qiao, Lu Zhiyong, Zhang Aidong
- Improving Retrieval-augmented Generation In Medicine With Iterative Follow-up Questions Xiong Guangzhi, Jin Qiao, Wang Xiao, Zhang Minjia, Lu Zhiyong, Zhang Aidong
- Interactive-kbqa: Multi-turn Interactions For Knowledge Base Question Answering With Large Language Models Xiong Guanming, Bao Junwei, Zhao Wen
- Building Math Agents With Multi-turn Iterative Preference Learning Xiong Wei, Shi Chengshuai, Shen Jiaming, Rosenberg Aviv, Qin Zhen, Calandriello Daniele, Khalman Misha, Joshi Rishabh, Piot Bilal, Saleh Mohammad, Jin Chi, Zhang Tong, Liu Tianqi
- Mixture-of-instructions: Comprehensive Alignment Of A Large Language Model Through The Mixture Of Diverse System Prompting Instructions Xu Bowen, Wu Shaoyu, Liu Kai, Hu Lulu
- Stylemaster: Towards Flexible Stylized Image Generation With Diffusion Models Xu Chengming, Hu Kai, Luo Donghao, Zhang Jiangning, Li Wei, Ge Yanhao, Wang Chengjie
- Empowering 1000 Tokens/second On-device LLM Prefilling With Mllm-npu Xu Daliang, Zhang Hao, Yang Liming, Liu Ruiqi, Huang Gang, Xu Mengwei, Liu Xuanzhe
- Redagent: Red Teaming Large Language Models With Context-aware Autonomous Language Agent Xu Huiyu, Zhang Wenhui, Wang Zhibo, Xiao Feng, Zheng Rui, Feng Yunhe, Ba Zhongjie, Ren Kui
- Faithful Logical Reasoning Via Symbolic Chain-of-thought Xu Jundong, Fei Hao, Pan Liangming, Liu Qian, Lee Mong-li, Hsu Wynne
- Prompting Large Language Models For Recommender Systems: A Comprehensive Framework And Empirical Analysis Xu Lanling, Zhang Junjie, Li Bingqian, Wang Jinpeng, Cai Mingchen, Zhao Wayne Xin, Wen Ji-rong
- From Introspection To Best Practices: Principled Analysis Of Demonstrations In Multimodal In-context Learning Xu Nan, Wang Fei, Zhang Sheng, Poon Hoifung, Chen Muhao
- On The Tip Of The Tongue: Analyzing Conceptual Representation In Large Language Models With Reverse-dictionary Probe Xu Ningyu, Zhang Qi, Zhang Menghan, Qian Peng, Huang Xuanjing
- Chatqa 2: Bridging The Gap To Proprietary Llms In Long Context And RAG Capabilities Xu Peng, Ping Wei, Wu Xianchao, Xu Chejian, Liu Zihan, Shoeybi Mohammad, Catanzaro Bryan
- Preemptive Answer "attacks" On Chain-of-thought Reasoning Xu Rongwu, Qi Zehan, Xu Wei
- Walking In Others' Shoes: How Perspective-taking Guides Large Language Models In Reducing Toxicity And Bias Xu Rongwu, Zhou Zi'an, Zhang Tianwei, Qi Zehan, Yao Su, Xu Ke, Xu Wei, Qiu Han
- Benchmarking Benchmark Leakage In Large Language Models Xu Ruijie, Wang Zengzhi, Fan Run-ze, Liu Pengfei
- Misconfidence-based Demonstration Selection For LLM In-context Learning Xu Shangqing Georgia Institute Of Technology, Zhang Chao Georgia Institute Of Technology
- Sayself: Teaching Llms To Express Confidence With Self-reflective Rationales Xu Tianyang, Wu Shujin, Diao Shizhe, Liu Xiaoze, Wang Xingyao, Chen Yangyi, Gao Jing
- Can Llms Solve Longer Math Word Problems Better? Xu Xin, Xiao Tong, Chao Zitong, Huang Zhenya, Yang Can, Wang Yang
- \(\textit{linkprompt}\): Natural And Universal Adversarial Attacks On Prompt-based Language Models Xu Yue, Wang Wenjie
- Magpie: Alignment Data Synthesis From Scratch By Prompting Aligned Llms With Nothing Xu Zhangchen, Jiang Fengqing, Niu Luyao, Deng Yuntian, Poovendran Radha, Choi Yejin, Lin Bill Yuchen
- Bag Of Tricks: Benchmarking Of Jailbreak Attacks On Llms Xu Zhao, Liu Fan, Liu Hao
- Uncovering Safety Risks Of Large Language Models Through Concept Activation Vector Xu Zhihao, Huang Ruixuan, Chen Changyu, Wang Shuai, Wang Xiting
- Do Large Language Models Have Compositional Ability? An Investigation Into Limitations And Scalability Xu Zhuoyan, Shi Zhenmei, Liang Yingyu
- A Comprehensive Study Of Jailbreak Attack Versus Defense For Large Language Models Xu Zihao, Liu Yi, Deng Gelei, Li Yuekang, Picek Stjepan
- Improving Audio Codec-based Zero-shot Text-to-speech Synthesis With Multi-modal Context And Large Language Model Xue Jinlong, Deng Yayue, Han Yicheng, Gao Yingming, Li Ya
- Pythonsaga: Redefining The Benchmark To Evaluate Code Generating Llms Yadav Ankit, Beniwal Himanshu, Singh Mayank
- List Items One By One: A New Data Source And Learning Paradigm For Multimodal Llms Yan An, Yang Zhengyuan, Wu Junda, Zhu Wanrong, Yang Jianwei, Li Linjie, Lin Kevin, Wang Jianfeng, Mcauley Julian, Gao Jianfeng, Wang Lijuan
- Refutebench: Evaluating Refuting Instruction-following For Large Language Models Yan Jianhao, Luo Yun, Zhang Yue
- Do Large Language Models Understand Logic Or Just Mimick Context? Yan Junbing, Wang Chengyu, Huang Jun, Zhang Wei
- Contrastive Instruction Tuning Yan Tianyi Lorena, Wang Fei, Huang James Y., Zhou Wenxuan, Yin Fan, Galstyan Aram, Yin Wenpeng, Chen Muhao
- Predicting The Big Five Personality Traits In Chinese Counselling Dialogues Using Large Language Models Yan Yang, Ma Lizhi, Li Anqi, Ma Jingsong, Lan Zhenzhong
- Advancing Genai Assisted Programming--a Comparative Study On Prompt Efficiency And Code Quality Between GPT-4 And GLM-4 Yang Angus, Li Zehan, Li Jie
- Emphasising Structured Information: Integrating Abstract Meaning Representation Into Llms For Enhanced Open-domain Dialogue Evaluation Yang Bohao, Zhao Kun, Tang Chen, Liu Dong, Zhan Liang, Lin Chenghua
- MAPLE: Enhancing Review Generation With Multi-aspect Prompt Learning In Explainable Recommendation Yang Ching-wen, Chen Che Wei, Wu Kun-da, Xu Hao, Yao Jui-feng, Kao Hung-yu
- Aligning Llms Through Multi-perspective User Preference Ranking-based Feedback For Programming Question Answering Yang Hongyu, He Liyang, Hou Min, Shen Shuanghong, Li Rui, Hou Jiahui, Ma Jianhui, Zhao Junda
- Synthesizing Text-to-sql Data From Weak And Strong Llms Yang Jiaxi, Hui Binyuan, Yang Min, Yang Jian, Lin Junyang, Zhou Chang
- Metaaligner: Towards Generalizable Multi-objective Alignment Of Language Models Yang Kailai, Liu Zhiwei, Xie Qianqian, Huang Jimin, Zhang Tianlin, Ananiadou Sophia
- Mastering Text-to-image Diffusion: Recaptioning, Planning, And Generating With Multimodal Llms Yang Ling, Yu Zhaochen, Meng Chenlin, Xu Minkai, Ermon Stefano, Cui Bin
- Buffer Of Thoughts: Thought-augmented Reasoning With Large Language Models Yang Ling, Yu Zhaochen, Zhang Tianjun, Cao Shiyi, Xu Minkai, Zhang Wentao, Gonzalez Joseph E., Cui Bin
- Dual-phase Accelerated Prompt Optimization Yang Muchen, Li Moxin, Li Yongle, Chen Zijun, Gao Chongming, Zhang Junqi, Li Yangyang, Feng Fuli
- Regularizing Hidden States Enables Learning Generalizable Reward Model For Llms Yang Rui, Ding Ruomeng, Lin Yong, Zhang Huan, Zhang Tong
- GPT-4 As Evaluator: Evaluating Large Language Models On Pest Management In Agriculture Yang Shanglong, Yuan Zhipeng, Li Shunbao, Peng Ruoling, Liu Kang, Yang Po
- Video As The New Language For Real-world Decision Making Yang Sherry, Walker Jacob, Parker-holder Jack, Du Yilun, Bruce Jake, Barreto Andre, Abbeel Pieter, Schuurmans Dale
- Do Large Language Models Latently Perform Multi-hop Reasoning? Yang Sohee, Gribovskaya Elena, Kassner Nora, Geva Mor, Riedel Sebastian
- DGL: Dynamic Global-local Prompt Tuning For Text-video Retrieval Yang Xiangpeng, Zhu Linchao, Wang Xiaohan, Yang Yi
- Unleashing The Potential Of Large Language Models For Predictive Tabular Tasks In Data Science Yang Yazheng, Wang Yuqi, Sen Sankalok, Li Lei, Liu Qi
- Guardt2i: Defending Text-to-image Models From Adversarial Prompts Yang Yijun, Gao Ruiyuan, Yang Xiao, Zhong Jianyuan, Xu Qiang
- Pre-trained Language Models Improve The Few-shot Prompt Ability Of Decision Transformer Yang Yu, Xu Pan
- Exploring And Unleashing The Power Of Large Language Models In Automated Code Translation Yang Zhen, Liu Fang, Yu Zhongxing, Keung Jacky Wai, Li Jia, Liu Shuo, Hong Yifan, Ma Xiaoxue, Jin Zhi, Li Ge
- SOS! Soft Prompt Attack Against Open-source Large Language Models Yang Ziqing, Backes Michael, Zhang Yang, Salem Ahmed
- React Meets Actre: When Language Agents Enjoy Training Data Autonomy Yang Zonghan, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Curiousllm: Elevating Multi-document QA With Reasoning-infused Knowledge Graph Prompting Yang Zukang, Zhu Zixuan
- CLAVE: An Adaptive Framework For Evaluating Values Of LLM Generated Responses Yao Jing, Yi Xiaoyuan, Xie Xing
- Large Language Models Are Contrastive Reasoners Yao Liang
- Enhancing In-context Learning Performance With Just Svd-based Weight Pruning: A Theoretical Perspective Yao Xinhao, Hu Xiaolin, Yang Shenzhi, Liu Yong
- Learning From Correctness Without Prompting Makes LLM Efficient Reasoner Yao Yuxuan, Wu Han, Guo Zhijiang, Zhou Biyan, Gao Jiahui, Luo Sichun, Hou Hanxu, Fu Xiaojin, Song Linqi
- Improving Visual Commonsense In Language Models Via Multiple Image Generation Yariv Guy, Schwartz Idan, Adi Yossi, Benaim Sagie
- R^2AG: Incorporating Retrieval Information Into Retrieval Augmented Generation Ye Fuda, Li Shuangyin, Zhang Yongqi, Chen Lei
- Dataframe QA: A Universal LLM Framework On Dataframe Question Answering Without Data Exposure Ye Junyi, Du Mengnan, Wang Guiling
- Harnessing Multimodal Large Language Models For Multimodal Sequential Recommendation Ye Yuyang, Zheng Zhi, Shen Yishan, Wang Tianshu, Zhang Hengruo, Zhu Peijun, Yu Runlong, Zhang Kai, Xiong Hui
- Long-context Language Modeling With Parallel Context Encoding Yen Howard, Gao Tianyu, Chen Danqi
- How Interpretable Are Reasoning Explanations From Prompting Large Language Models? Yeo Wei Jie, Satapathy Ranjan, Goh Rick Siow Mong, Cambria Erik
- Using Large Language Models To Create AI Personas For Replication And Prediction Of Media Effects: An Empirical Test Of 133 Published Experimental Research Findings Yeykelis Leo, Pichai Kaavya, Cummings James J., Reeves Byron
- Jailbreak Attacks And Defenses Against Large Language Models: A Survey Yi Sibo, Liu Yule, Sun Zhen, Cong Tianshuo, He Xinlei, Song Jiaxing, Xu Ke, Li Qi
- Fine-grained Detoxification Via Instance-level Prefixes For Large Language Models Yi Xin, Wang Linlin, Wang Xiaoling, He Liang
- MEND: Meta Demonstration Distillation For Efficient And Effective In-context Learning Yichuan Li, Xiyao Ma, Sixing Lu, Kyumin Lee, Xiaohu Liu, Chenlei Guo
- The Task-oriented Queries Benchmark (toqb) Yim Keun Soo
- MMAU: A Holistic Benchmark Of Agent Capabilities Across Diverse Domains Yin Guoli, Bai Haoping, Ma Shuang, Nan Feng, Sun Yanchao, Xu Zhaoyang, Ma Shen, Lu Jiarui, Kong Xiang, Zhang Aonan, Yap Dian Ang, Zhang Yizhe, Ahnert Karsten, Kamath Vik, Berglund Mathias, Walsh Dominic, Gindele Tobias, Wiest Juergen, Lai Zhengfeng, Wang Xiaoming, Shan Jiulong, Cao Meng, Pang Ruoming, Wang Zirui
- Relative Preference Optimization: Enhancing LLM Alignment Through Contrasting Responses Across Identical And Diverse Prompts Yin Yueqin, Wang Zhendong, Gu Yi, Huang Hai, Chen Weizhu, Zhou Mingyuan
- Aggregation Of Reasoning: A Hierarchical Framework For Enhancing Answer Selection In Large Language Models Yin Zhangyue, Sun Qiushi, Guo Qipeng, Zeng Zhiyuan, Li Xiaonan, Sun Tianxiang, Chang Cheng, Cheng Qinyuan, Wang Ding, Mou Xiaofeng, Qiu Xipeng, Huang Xuanjing
- Internlm-math: Open Math Large Language Models Toward Verifiable Reasoning Ying Huaiyuan, Zhang Shuo, Li Linyang, Zhou Zhejian, Shao Yunfan, Fei Zhaoye, Ma Yichuan, Hong Jiawei, Liu Kuikun, Wang Ziyi, Wang Yudong, Wu Zijian, Li Shuaibin, Zhou Fengzhe, Liu Hongwei, Zhang Songyang, Zhang Wenwei, Yan Hang, Qiu Xipeng, Wang Jiayu, Chen Kai, Lin Dahua
- Automating Dataset Updates Towards Reliable And Timely Evaluation Of Large Language Models Ying Jiahao, Cao Yixin, Bai Yushi, Sun Qianru, Wang Bo, Tang Wei, Ding Zhaojun, Yang Yizhe, Huang Xuanjing, Yan Shuicheng
- Large Language Model With Graph Convolution For Recommendation Yingpeng Du, Ziyan Wang, Zhu Sun, Haoyan Chua, Hongzhi Liu, Zhonghai Wu, Yining Ma, Jie Zhang, Youchen Sun
- Exploring The Comprehension Of Chatgpt In Traditional Chinese Medicine Knowledge Yizhen Li, Shaohan Huang, Jiaxing Qi, Lei Quan, Dongran Han, Zhongzhi Luan
- Debunc: Mitigating Hallucinations In Large Language Model Agent Communication With Uncertainty Estimations Yoffe Luke, Amayuelas Alfonso, Wang William Yang
- CSRT: Evaluation And Analysis Of Llms Using Code-switching Red-teaming Dataset Yoo Haneul, Yang Yongjin, Lee Hwaran
- C-TPT: Calibrated Test-time Prompt Tuning For Vision-language Models Via Text Feature Dispersion Yoon Hee Suk, Yoon Eunseop, Tee Joshua Tian Jin, Hasegawa-johnson Mark, Li Yingzhen, Yoo Chang D.
- By My Eyes: Grounding Multimodal Large Language Models With Sensor Data Via Visual Prompting Yoon Hyungjun, Tolera Biniyam Aschalew, Gong Taesik, Lee Kimin, Lee Sung-ju
- Improving Structural Diversity Of Blackbox Llms Via Chain-of-specification Prompting Young Halley, Zeng Yimeng, Gardner Jacob, Bastani Osbert
- Siam: Self-improving Code-assisted Mathematical Reasoning Of Large Language Models Yu Dian, Peng Baolin, Tian Ye, Song Linfeng, Mi Haitao, Yu Dong
- Cosafe: Evaluating Large Language Model Safety In Multi-turn Dialogue Coreference Yu Erxin, Li Jing, Liao Ming, Wang Siqi, Gao Zuchen, Mi Fei, Hong Lanqing
- Iterative Graph Alignment Yu Fangyuan, Arora Hardeep Singh, Johnson Matt
- Enhancing Healthcare Through Large Language Models: A Study On Medical Question Answering Yu Haoran, Yu Chang, Wang Zihan, Zou Dongxian, Qin Hao
- Ra-rec: An Efficient ID Representation Alignment Framework For Llm-based Recommendation Yu Xiaohan, Zhang Li, Zhao Xin, Wang Yue, Ma Zhongrui
- BEYOND DIALOGUE: A Profile-dialogue Alignment Framework Towards General Role-playing Language Model Yu Yeyong, Yu Runsheng, Wei Haojie, Zhang Zhanqiu, Qian Quan
- Mitigate Position Bias In Large Language Models Via Scaling A Single Dimension Yu Yijiong, Jiang Huiqiang, Luo Xufang, Wu Qianhui, Lin Chin-yew, Li Dongsheng, Yang Yuqing, Huang Yongfeng, Qiu Lili
- Don't Listen To Me: Understanding And Exploring Jailbreak Prompts Of Large Language Models Yu Zhiyuan, Liu Xiaogeng, Liang Shunning, Cameron Zach, Xiao Chaowei, Zhang Ning
- KV Cache Compression, But What Must We Give In Return? A Comprehensive Benchmark Of Long Context Capable Approaches Yuan Jiayi Henry, Liu Hongyi Henry, Shaochen Henry, Zhong, Chuang Yu-neng, Li Songchen, Wang Guanchu, Le Duy, Jin Hongye, Chaudhary Vipin, Xu Zhaozhuo, Liu Zirui, Hu Xia
- Focused Large Language Models Are Stable Many-shot Learners Yuan Peiwen, Feng Shaoxiong, Li Yiwei, Wang Xinglin, Zhang Yueqi, Tan Chuyi, Pan Boyuan, Wang Heda, Hu Yao, Li Kan
- Why Lift So Heavy? Slimming Large Language Models By Cutting Off The Layers Yuan Shuzhou, Nie Ercong, Ma Bolei, Färber Michael
- Self-rewarding Language Models Yuan Weizhe, Pang Richard Yuanzhe, Cho Kyunghyun, Li Xian, Sukhbaatar Sainbayar, Xu Jing, Weston Jason
- Refuse Whenever You Feel Unsafe: Improving Safety In Llms Via Decoupled Refusal Training Yuan Youliang, Jiao Wenxiang, Wang Wenxuan, Huang Jen-tse, Xu Jiahao, Liang Tian, He Pinjia, Tu Zhaopeng
- Question-analysis Prompting Improves LLM Performance In Reasoning Tasks Yugeswardeenoo Dharunish, Zhu Kevin, O'brien Sean
- Can Small Language Models Be Good Reasoners For Sequential Recommendation? Yuling Wang, Changxin Tian, Binbin Hu, Yanhua Yu, Ziqi Liu, Zhiqiang Zhang, Jun Zhou, Liang Pang, Xiao Wang
- Round Trip Translation Defence Against Large Language Model Jailbreaking Attacks Yung Canaan, Dolatabadi Hadi Mohaghegh, Erfani Sarah, Leckie Christopher
- Prompting Implicit Discourse Relation Annotation Yung Frances, Ahmad Mansoor, Scholman Merel, Demberg Vera
- Rec-gpt4v: Multimodal Recommendation With Large Vision-language Models Yuqing Liu, Yu Wang, Lichao Sun, Philip S. Yu
- WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-experts Zahraei Pardis Sadat, Emami Ali
- Envgen: Generating And Adapting Environments Via Llms For Training Embodied Agents Zala Abhay, Cho Jaemin, Lin Han, Yoon Jaehong, Bansal Mohit
- Building Accurate Translation-tailored Llms With Language Aware Instruction Tuning Zan Changtong, Ding Liang, Shen Li, Zhen Yibing, Liu Weifeng, Tao Dacheng
- Understanding And Mitigating Compositional Issues In Text-to-image Generative Models Zarei Arman, Rezaei Keivan, Basu Samyadeep, Saberi Mehrdad, Moayeri Mazda, Kattakinda Priyatham, Feizi Soheil
- Tell Me Why: Explainable Public Health Fact-checking With Large Language Models Zarharan Majid, Wullschleger Pascal, Kia Babak Behkam, Pilehvar Mohammad Taher, Foster Jennifer
- Unused Information In Token Probability Distribution Of Generative LLM: Improving LLM Reading Comprehension Through Calculation Of Expected Values Zawistowski Krystian
- In-context Example Selection Via Similarity Search Improves Low-resource Machine Translation Zebaze Armel, Sagot Benoît, Bawden Rachel
- How Johnny Can Persuade Llms To Jailbreak Them: Rethinking Persuasion To Challenge AI Safety By Humanizing Llms Zeng Yi, Lin Hongpeng, Zhang Jingwen, Yang Diyi, Jia Ruoxi, Shi Weiyan
- Can Mllms Perform Text-to-image In-context Learning? Zeng Yuchen, Kang Wonjun, Chen Yicong, Koo Hyung Il, Lee Kangwook
- Online Self-preferring Language Models Zhai Yuanzhao, Zhang Zhuo, Xu Kele, Peng Hanyang, Yu Yue, Feng Dawei, Yang Cheng, Ding Bo, Wang Huaimin
- Unveiling The Lexical Sensitivity Of Llms: Combinatorial Optimization For Prompt Enhancement Zhan Pengwei, Xu Zhen, Tan Qian, Song Jie, Xie Ru
- Injecagent: Benchmarking Indirect Prompt Injections In Tool-integrated Large Language Model Agents Zhan Qiusi, Liang Zhixiang, Ying Zifan, Kang Daniel
- When Scaling Meets LLM Finetuning: The Effect Of Data, Model And Finetuning Method Zhang Biao, Liu Zhongtao, Cherry Colin, Firat Orhan
- Simple Techniques For Enhancing Sentence Embeddings In Generative Language Models Zhang Bowen, Chang Kehua, Li Chunping
- Notellm-2: Multimodal Large Representation Models For Recommendation Zhang Chao, Zhang Haoxin, Wu Shiwei, Wu Di, Xu Tong, Gao Yan, Hu Yao, Chen Enhong
- Teaching Large Language Models An Unseen Language On The Fly Zhang Chen, Liu Xiao, Lin Jiuheng, Feng Yansong
- Goal-guided Generative Prompt Injection Attack On Large Language Models Zhang Chong, Jin Mingyu, Yu Qinkai, Liu Chengzhi, Xue Haochen, Jin Xiaobo
- Extracting Prompts By Inverting LLM Outputs Zhang Collin, Morris John X., Shmatikov Vitaly
- Causal Prompting: Debiasing Large Language Model Prompting Based On Front-door Adjustment Zhang Congzhi, Zhang Linhai, Wu Jialong, Zhou Deyu, He Yulan
- Coe-sql: In-context Learning For Multi-turn Text-to-sql With Chain-of-editions Zhang Hanchong, Cao Ruisheng, Xu Hongshen, Chen Lu, Yu Kai
- Language Model Prompt Selection Via Simulation Optimization Zhang Haoting, He Jinghai, Righter Rhonda, Zheng Zeyu
- Paying More Attention To Source Context: Mitigating Unfaithful Translations From Large Language Model Zhang Hongbin, Chen Kehai, Bai Xuefeng, Xiang Yang, Zhang Min
- Enja: Ensemble Jailbreak On Large Language Models Zhang Jiahao, Wang Zilong, Wang Ruofan, Ma Xingjun, Jiang Yu-gang
- Hire A Linguist!: Learning Endangered Languages With In-context Linguistic Descriptions Zhang Kexun, Choi Yee Man, Song Zhenqiao, He Taiqi, Wang William Yang, Li Lei
- SPL: A Socratic Playground For Learning Powered By Large Language Model Zhang Liang, Lin Jionghao, Kuang Ziyi, Xu Sheng, Yeasin Mohammed, Hu Xiangen
- Attention Instruction: Amplifying Attention In The Middle Via Prompting Zhang Meiru, Meng Zaiqiao, Collier Nigel
- The Impact Of Demonstrations On Multilingual In-context Learning: A Multidimensional Analysis Zhang Miaoran, Gautam Vagrant, Wang Mingyang, Alabi Jesujoba O., Shen Xiaoyu, Klakow Dietrich, Mosbach Marius
- Mindspeech: Continuous Imagined Speech Decoding Using High-density Fnirs And Prompt Tuning For Advanced Human-ai Interaction Zhang Suyi, Alam Ekram, Baber Jack, Bianco Francesca, Turner Edward, Chamanzar Maysam, Dehghani Hamid
- R4: Reinforced Retriever-reorder-responder For Retrieval-augmented Large Language Models Zhang Taolin, Li Dongyang, Chen Qizhou, Wang Chengyu, Huang Longtao, Xue Hui, He Xiaofeng, Huang Jun
- Improving Diversity Of Commonsense Generation By Large Language Models Via In-context Learning Zhang Tianhui, Peng Bei, Bollegala Danushka
- In-context Principle Learning From Mistakes Zhang Tianjun, Madaan Aman, Gao Luyu, Zheng Steven, Mishra Swaroop, Yang Yiming, Tandon Niket, Alon Uri
- RAFT: Adapting Language Model To Domain Specific RAG Zhang Tianjun, Patil Shishir G., Jain Naman, Shen Sheng, Zaharia Matei, Stoica Ion, Gonzalez Joseph E.
- Promptfix: Few-shot Backdoor Removal Via Adversarial Prompt Tuning Zhang Tianrong, Xi Zhaohan, Wang Ting, Mitra Prasenjit, Chen Jinghui
- Revisiting OPRO: The Limitations Of Small-scale Llms As Optimizers Zhang Tuo, Yuan Jinyue, Avestimehr Salman
- Interpreting And Improving Large Language Models In Arithmetic Calculation Zhang Wei, Wan Chaoqun, Zhang Yonggang, Cheung Yiu-ming, Tian Xinmei, Shen Xu, Ye Jieping
- Agent-pro: Learning To Evolve Via Policy-level Reflection And Optimization Zhang Wenqi, Tang Ke, Wu Hai, Wang Mengna, Shen Yongliang, Hou Guiyang, Tan Zeqi, Li Peng, Zhuang Yueting, Lu Weiming
- Event Temporal Relation Extraction Based On Retrieval-augmented On Llms Zhang Xiaobin, Zang Liangjun, Liu Qianwen, Wei Shuchong, Hu Songlin
- From Redundancy To Relevance: Enhancing Explainability In Multimodal Large Language Models Zhang Xiaofeng, Shen Chen, Yuan Xiaosong, Yan Shaotian, Xie Liang, Wang Wenxiao, Gu Chaochen, Tang Hao, Ye Jieping
- Large Language Models As Evaluators For Recommendation Explanations Zhang Xiaoyu, Li Yishan, Wang Jiayin, Sun Bowen, Ma Weizhi, Sun Peijie, Zhang Min
- Dpp-based Adversarial Prompt Searching For Lanugage Models Zhang Xu, Wan Xiaojun
- Glape: Gold Label-agnostic Prompt Evaluation And Optimization For Large Language Model Zhang Xuanchang, Zhang Zhuosheng, Zhao Hai
- TREACLE: Thrifty Reasoning Via Context-aware LLM And Prompt Selection Zhang Xuechen, Huang Zijian, Taga Ege Onur, Joe-wong Carlee, Oymak Samet, Chen Jiasi
- Boosting Jailbreak Attack With Momentum Zhang Yihao, Wei Zeming
- How Far Are We From Intelligent Visual Deductive Reasoning? Zhang Yizhe, Bai He, Zhang Ruixiang, Gu Jiatao, Zhai Shuangfei, Susskind Josh, Jaitly Navdeep
- Navigating User Experience Of Chatgpt-based Conversational Recommender Systems: The Effects Of Prompt Guidance And Recommendation Domain Zhang Yizhe, Jin Yucheng, Chen Li, Yang Ting
- An Examination On The Effectiveness Of Divide-and-conquer Prompting In Large Language Models Zhang Yizhou, Du Lun, Cao Defu, Fu Qiang, Liu Yan
- Leveraging Biases In Large Language Models: "bias-knn'' For Effective Few-shot Learning Zhang Yong, Li Hanzhang, Li Zhitao, Cheng Ning, Li Ming, Xiao Jing, Wang Jianzong
- Mg-verilog: Multi-grained Dataset Towards Enhanced Llm-assisted Verilog Generation Zhang Yongan, Yu Zhongzhi, Fu Yonggan, Wan Cheng, Lin Yingyan Celine
- Autocap: Towards Automatic Cross-lingual Alignment Planning For Zero-shot Chain-of-thought Zhang Yongheng, Chen Qiguang, Li Min, Che Wanxiang, Qin Libo
- Prompt-aware Adapter: Towards Learning Adaptive Visual Tokens For Multimodal Large Language Models Zhang Yue, Fan Hehe, Yang Yi
- Pattern-aware Chain-of-thought Prompting In Large Language Models Zhang Yufeng, Wang Xuepeng, Wu Lingxiang, Wang Jinqiao
- OT-VP: Optimal Transport-guided Visual Prompting For Test-time Adaptation Zhang Yunbei, Mehra Akshay, Hamm Jihun
- Small Language Models Need Strong Verifiers To Self-correct Reasoning Zhang Yunxiang, Khalifa Muhammad, Logeswaran Lajanugen, Kim Jaekyeom, Lee Moontae, Lee Honglak, Wang Lu
- Understanding The Relationship Between Prompts And Response Uncertainty In Large Language Models Zhang Ze Yu, Verma Arun, Doshi-velez Finale, Low Bryan Kian Hsiang
- Generating Chain-of-thoughts With A Pairwise-comparison Approach To Searching For The Most Promising Intermediate Thought Zhang Zhen-yu, Han Siwei, Yao Huaxiu, Niu Gang, Sugiyama Masashi
- Position-aware Parameter Efficient Fine-tuning Approach For Reducing Positional Bias In Llms Zhang Zheng, Yang Fan, Jiang Ziyan, Chen Zheng, Zhao Zhengyang, Ma Chengyuan, Zhao Liang, Liu Yang
- E-bench: Towards Evaluating The Ease-of-use Of Large Language Models Zhang Zhenyu, Hao Bingguang, Li Jinpeng, Zhang Zekai, Zhao Dongyan
- Safe Unlearning: A Surprisingly Effective And Generalizable Solution To Defend Against Jailbreak Attacks Zhang Zhexin, Yang Junxiao, Ke Pei, Cui Shiyao, Zheng Chujie, Wang Hongning, Huang Minlie
- SELF-GUIDE: Better Task-specific Instruction Following Via Self-synthetic Finetuning Zhao Chenyang, Jia Xueying, Viswanathan Vijay, Wu Tongshuang, Neubig Graham
- LANE: Logic Alignment Of Non-tuning Large Language Models And Online Recommendation Systems For Explainable Reason Generation Zhao Hongke, Zheng Songming, Wu Likang, Yu Bowen, Wang Jing
- Revolutionizing Finance With Llms: An Overview Of Applications And Insights Zhao Huaqin, Liu Zhengliang, Wu Zihao, Li Yiwei, Yang Tianze, Shu Peng, Xu Shaochen, Dai Haixing, Zhao Lin, Mai Gengchen, Liu Ninghao, Liu Tianming
- Prefix Guidance: A Steering Wheel For Large Language Models To Defend Against Jailbreak Attacks Zhao Jiawei, Chen Kejiang, Yuan Xiaojian, Zhang Weiming
- Exploring The Compositional Deficiency Of Large Language Models In Mathematical Reasoning Zhao Jun, Tong Jingqi, Mou Yurong, Zhang Ming, Zhang Qi, Huang Xuanjing
- Prepacking: A Simple Method For Fast Prefilling And Increased Throughput In Large Language Models Zhao Siyan, Israel Daniel, Broeck Guy Van Den, Grover Aditya
- Defending Large Language Models Against Jailbreak Attacks Via Layer-specific Editing Zhao Wei, Li Zhe, Li Yige, Zhang Ye, Sun Jun
- Wildhallucinations: Evaluating Long-form Factuality In Llms With Real-world Entity Queries Zhao Wenting, Goyal Tanya, Chiu Yu Ying, Jiang Liwei, Newman Benjamin, Ravichander Abhilasha, Chandu Khyathi, Bras Ronan Le, Cardie Claire, Deng Yuntian, Choi Yejin
- Wildchat: 1M Chatgpt Interaction Logs In The Wild Zhao Wenting, Ren Xiang, Hessel Jack, Cardie Claire, Choi Yejin, Deng Yuntian
- Fact-and-reflection (far) Improves Confidence Calibration Of Large Language Models Zhao Xinran, Zhang Hongming, Pan Xiaoman, Yao Wenlin, Yu Dong, Wu Tongshuang, Chen Jianshu
- Codev: Empowering Llms For Verilog Generation Through Multi-level Summarization Zhao Yang, Huang Di, Li Chongxiao, Jin Pengwei, Nan Ziyuan, Ma Tianyun, Qi Lei, Pan Yansong, Zhang Zhenxing, Zhang Rui, Zhang Xishan, Du Zidong, Guo Qi, Hu Xing, Chen Yunji
- Adversarial Contrastive Decoding: Boosting Safety Alignment Of Large Language Models Via Opposite Prompt Optimization Zhao Zhengyue, Zhang Xiaoyun, Xu Kaidi, Hu Xing, Zhang Rui, Du Zidong, Guo Qi, Chen Yunji
- Retrieval-augmented Mixture Of Lora Experts For Uploadable Machine Learning Zhao Ziyu, Gan Leilei, Wang Guoyin, Hu Yuwei, Shen Tao, Yang Hongxia, Kuang Kun, Wu Fei
- Loraretriever: Input-aware Lora Retrieval And Composition For Mixed Tasks In The Wild Zhao Ziyu, Gan Leilei, Wang Guoyin, Zhou Wangchunshu, Yang Hongxia, Kuang Kun, Wu Fei
- Leveraging Large Language Models With Chain-of-thought And Prompt Engineering For Traffic Crash Severity Analysis And Inference Zhen Hao, Shi Yucheng, Huang Yongcan, Yang Jidong J., Liu Ninghao
- Gpt-4v(ision) Is A Generalist Web Agent, If Grounded Zheng Boyuan, Gou Boyu, Kil Jihyung, Sun Huan, Su Yu
- Balancing Enhancement, Harmlessness, And General Capabilities: Enhancing Conversational Llms With Direct RLHF Zheng Chen, Sun Ke, Wu Hang, Xi Chenguang, Zhou Xun
- On Prompt-driven Safeguarding For Large Language Models Zheng Chujie, Yin Fan, Zhou Hao, Meng Fandong, Zhou Jie, Chang Kai-wei, Huang Minlie, Peng Nanyun
- Large Language Models As Reliable Knowledge Bases? Zheng Danna, Lapata Mirella, Pan Jeff Z.
- Trustscore: Reference-free Evaluation Of LLM Response Trustworthiness Zheng Danna, Liu Danyang, Lapata Mirella, Pan Jeff Z.
- Fine-tuning Large Language Models For Domain-specific Machine Translation Zheng Jiawei, Hong Hanghai, Wang Xiaoli, Su Jingsong, Liang Yonggui, Wu Shikai
- Can Llms Learn New Concepts Incrementally Without Forgetting? Zheng Junhao, Qiu Shengjie, Ma Qianli
- PAS: Data-efficient Plug-and-play Prompt Augmentation System Zheng Miao, Liang Hao, Yang Fan, Sun Haoze, Li Tianpeng, Xiong Lingchu, Zhang Yan, Wu Youzhen, Li Kun, Shen Yanjun, Lin Mingan, Zhang Tao, Dong Guosheng, Qiao Yujing, Fang Kun, Chen Weipeng, Cui Bin, Zhang Wentao, Zhou Zenan
- Clr-fact: Evaluating The Complex Logical Reasoning Capability Of Large Language Models Over Factual Knowledge Zheng Tianshi, Bai Jiaxin, Wang Yicheng, Fang Tianqing, Guo Yue, Yim Yauwai, Song Yangqiu
- Critic-cot: Boosting The Reasoning Abilities Of Large Language Model Via Chain-of-thoughts Critic Zheng Xin, Lou Jie, Cao Boxi, Wen Xueru, Ji Yuqiu, Lin Hongyu, Lu Yaojie, Han Xianpei, Zhang Debing, Sun Le
- Harnessing Large Language Models For Text-rich Sequential Recommendation Zheng Zhi, Chao Wenshuo, Qiu Zhaopeng, Zhu Hengshu, Xiong Hui
- Exploring Boundary Of GPT-4V On Marine Analysis: A Preliminary Case Study Zheng Ziqiang, Chen Yiwei, Zhang Jipeng, Vu Tuan-anh, Zeng Huimin, Tim Yue Him Wong, Yeung Sai-kit
- ROSE Doesn't Do That: Boosting The Safety Of Instruction-tuned Large Language Models With Reverse Prompt Contrastive Decoding Zhong Qihuang, Ding Liang, Liu Juhua, Du Bo, Tao Dacheng
- Cultural Value Differences Of Llms: Prompt, Language, And Model Size Zhong Qishuai, Yun Yike, Sun Aixin
- Enhancing Multimodal Large Language Models With Multi-instance Visual Prompt Generator For Visual Representation Enrichment Zhong Wenliang, Wu Wenyi, Li Qi, Barton Rob, Du Boxin, Sam Shioulin, Bouyarmane Karim, Tutar Ismail, Huang Junzhou
- Harnessing Large Language Models As Post-hoc Correctors Zhong Zhiqiang, Zhou Kuangyu, Mottin Davide
- Synthet2c: Generating Synthetic Data For Fine-tuning Large Language Models On The Text2cypher Task Zhong Ziije, Zhong Linqing, Sun Zhaoze, Jin Qingyun, Qin Zengchang, Zhang Xiaofan
- Robust Prompt Optimization For Defending Language Models Against Jailbreaking Attacks Zhou Andy, Li Bo, Wang Haohan
- Fairer Preferences Elicit Improved Human-aligned Large Language Model Judgments Zhou Han, Wan Xingchen, Liu Yinhong, Collier Nigel, Vulić Ivan, Korhonen Anna
- Unibias: Unveiling And Mitigating LLM Bias Through Internal Attention And FFN Manipulation Zhou Hanzhang, Feng Zijian, Zhu Zixiao, Qian Junlang, Mao Kezhi
- Sequence To Sequence Reward Modeling: Improving RLHF By Language Feedback Zhou Jiayi, Ji Jiaming, Dai Juntao, Yang Yaodong
- On Speeding Up Language Model Evaluation Zhou Jin Peng, Belardi Christian K., Wu Ruihan, Zhang Travis, Gomes Carla P., Sun Wen, Weinberger Kilian Q.
- An LLM Feature-based Framework For Dialogue Constructiveness Assessment Zhou Lexin, Farag Youmna, Vlachos Andreas
- Self-discover: Large Language Models Self-compose Reasoning Structures Zhou Pei, Pujara Jay, Ren Xiang, Chen Xinyun, Cheng Heng-tze, Le Quoc V., Chi Ed H., Zhou Denny, Mishra Swaroop, Zheng Huaixiu Steven
- Image-of-thought Prompting For Visual Reasoning Refinement In Multimodal Large Language Models Zhou Qiji, Zhou Ruochen, Hu Zike, Lu Panzhong, Gao Siyang, Zhang Yue
- Enhancing The General Agent Capabilities Of Low-parameter Llms Through Tuning And Multi-branch Reasoning Zhou Qinhao, Zhang Zihan, Xiang Xiang, Wang Ke, Wu Yuchuan, Li Yongbin
- Evaluating The Smooth Control Of Attribute Intensity In Text Generation With Llms Zhou Shang, Yao Feng, Dong Chengyu, Wang Zihan, Shang Jingbo
- Explaining Pre-trained Language Models With Attribution Scores: An Analysis In Low-resource Settings Zhou Wei, Adel Heike, Schuff Hendrik, Vu Ngoc Thang
- Using Pretrained Large Language Model With Prompt Engineering To Answer Biomedical Questions Zhou Wenxin, Ngo Thuy Hang
- Sirius: Contextual Sparsity With Correction For Efficient Llms Zhou Yang, Chen Zhuoming, Xu Zhaozhuo, Lin Victoria, Chen Beidi
- Archer: Training Language Model Agents Via Hierarchical Multi-turn RL Zhou Yifei, Zanette Andrea, Pan Jiayi, Levine Sergey, Kumar Aviral
- Shared Imagination: Llms Hallucinate Alike Zhou Yilun, Xiong Caiming, Savarese Silvio, Wu Chien-sheng
- Few-shot Adversarial Prompt Learning On Vision-language Models Zhou Yiwei, Xia Xiaobo, Lin Zhiwei, Han Bo, Liu Tongliang
- Aligning Modalities In Vision Large Language Models Via Preference Fine-tuning Zhou Yiyang, Cui Chenhang, Rafailov Rafael, Finn Chelsea, Yao Huaxiu
- Defending Jailbreak Prompts Via In-context Adversarial Game Zhou Yujun, Han Yufei, Zhuang Haomin, Guo Kehan, Liang Zhenwen, Bao Hongyan, Zhang Xiangliang
- Don't Say No: Jailbreaking LLM By Suppressing Refusal Zhou Yukai, Wang Wenjie
- DETAIL: Task Demonstration Attribution For Interpretable In-context Learning Zhou Zijian, Lin Xiaoqiang, Xu Xinyi, Prakash Alok, Rus Daniela, Low Bryan Kian Hsiang
- Eliciting The Priors Of Large Language Models Using Iterated In-context Learning Zhu Jian-qiao, Griffiths Thomas L.
- Collectivesft: Scaling Large Language Models For Chinese Medical Benchmark With Collective Instructions In Healthcare Zhu Jingwei, Tan Minghuan, Yang Min, Li Ruixue, Alinejad-rokny Hamid
- Fastmem: Fast Memorization Of Prompt Improves Context Awareness Of Large Language Models Zhu Junyi, Liu Shuochen, Yu Yu, Tang Bo, Yan Yibo, Li Zhiyu, Xiong Feiyu, Xu Tong, Blaschko Matthew B.
- Dynamic Evaluation Of Large Language Models By Meta Probing Agents Zhu Kaijie, Wang Jindong, Zhao Qinlin, Xu Ruochen, Xie Xing
- Relayattention For Efficient Large Language Model Serving With Long System Prompts Zhu Lei, Wang Xinjiang, Zhang Wayne, Lau Rynson W. H.
- A Llm-based Controllable, Scalable, Human-involved User Simulator Framework For Conversational Recommender Systems Zhu Lixi, Huang Xiaowen, Sang Jitao
- Parameter-efficient Tuning Large Language Models For Graph Representation Learning Zhu Qi, Zheng Da, Song Xiang, Zhang Shichang, Jin Bowen, Sun Yizhou, Karypis George
- Quite Good, But Not Enough: Nationality Bias In Large Language Models -- A Case Study Of Chatgpt Zhu Shucheng, Wang Weikang, Liu Ying
- IAPT: Instruction-aware Prompt Tuning For Large Language Models Zhu Wei, Tian Aaron Xuxiang, Yin Congrui, Ni Yuan, Wang Xiaoling, Xie Guotong
- Can Large Language Models Understand Context? Zhu Yilun, Moniz Joel Ruben Antony, Bhargava Shruti, Lu Jiarui, Piraviperumal Dhivya, Li Site, Zhang Yuan, Yu Hong, Tseng Bo-hsiang
- Apt-pipe: A Prompt-tuning Tool For Social Data Annotation Using Chatgpt Zhu Yiming, Yin Zhizhuo, Tyson Gareth, Haq Ehsan-ul, Lee Lik-hang, Hui Pan
- Is Larger Always Better? Evaluating And Prompting Large Language Models For Non-generative Medical Tasks Zhu Yinghao, Gao Junyi, Wang Zixiang, Liao Weibin, Zheng Xiaochen, Liang Lifang, Wang Yasha, Pan Chengwei, Harrison Ewen M., Ma Liantao
- Prompting Large Language Models For Zero-shot Clinical Prediction With Structured Longitudinal Electronic Health Record Data Zhu Yinghao, Wang Zixiang, Gao Junyi, Tong Yuning, An Jingkun, Liao Weibin, Harrison Ewen M., Ma Liantao, Pan Chengwei
- Are Large Language Models Good Statisticians? Zhu Yizhang, Du Shiyin, Li Boyan, Luo Yuyu, Tang Nan
- Generative Pre-trained Speech Language Model With Efficient Hierarchical Transformer Zhu Yongxin, Su Dan, He Liqiang, Xu Linli, Yu Dong
- Efficient Test-time Prompt Tuning For Vision-language Models Zhu Yuhan, Zhang Guozhen, Xu Chen, Shen Haocheng, Chen Xiaoxin, Wu Gangshan, Wang Limin
- Accelerating Inference Of Retrieval-augmented Generation Via Sparse Context Selection Zhu Yun, Gu Jia-chen, Sikora Caitlin, Ko Ho, Liu Yinxiao, Lin Chu-cheng, Shu Lei, Luo Liangchen, Meng Lei, Liu Bang, Chen Jindong
- One Token Can Help! Learning Scalable And Pluggable Virtual Tokens For Retrieval-augmented Large Language Models Zhu Yutao, Huang Zhaoheng, Dou Zhicheng, Wen Ji-rong
- INTERS: Unlocking The Power Of Large Language Models In Search With Instruction Tuning Zhu Yutao, Zhang Peitian, Zhang Chenghao, Chen Yifei, Xie Binyu, Liu Zheng, Wen Ji-rong, Dou Zhicheng
- Sketch Then Generate: Providing Incremental User Feedback And Guiding LLM Code Generation Through Language-oriented Code Sketches Zhu-tian Chen, Xiong Zeyu, Yao Xiaoshuo, Glassman Elena
- VL-ICL Bench: The Devil In The Details Of Benchmarking Multimodal In-context Learning Zong Yongshuo, Bohdal Ondrej, Hospedales Timothy
- Llama-excitor: General Instruction Tuning Via Indirect Feature Interaction Zou Bo, Yang Chao, Qiao Yu, Quan Chengbin, Zhao Youjian
- Vgbench: Evaluating Large Language Models On Vector Graphics Understanding And Generation Zou Bocheng, Cai Mu, Zhang Jianrui, Lee Yong Jae
- Promptintern: Saving Inference Costs By Internalizing Recurrent Prompt During Large Language Model Fine-tuning Zou Jiaru, Zhou Mengyu, Li Tao, Han Shi, Zhang Dongmei
- Llms For XAI: Future Directions For Explaining Explanations Zytek Alexandra, Pidò Sara, Veeramachaneni Kalyan
- Prompt-time Ontology-driven Symbolic Knowledge Capture With Large Language Models Çöplü Tolga, Bendiken Arto, Skomorokhov Andrii, Bateiko Eduard, Cobb Stephen
- Enhancing Visual Question Answering Through Question-driven Image Captions As Prompts Özdemir Övgü, Akagündüz Erdem
- Systematic Task Exploration With Llms: A Study In Citation Text Generation Şahinuç Furkan, Kuznetsov Ilia, Hou Yufang, Gurevych Iryna
- Concept-aware Data Construction Improves In-context Learning Of Language Models Štefánik Michal, Kadlčík Marek, Sojka Petr
🏷 Pruning
- Reweighted Proximal Pruning For Large-scale Language Representation Guo Fu-ming, Liu Sijia, Mungall Finlay S., Lin Xue, Wang Yanzhi
- TAB-VCR: Tags And Attributes Based Visual Commonsense Reasoning Baselines Lin Jingxiang, Jain Unnat, Schwing Alexander G.
- Structured Pruning Of A Bert-based Question Answering Model Mccarley J. S., Chakravarti Rishav, Sil Avirup
- Structured Pruning Of Large Language Models Wang Ziheng, Wohlwend Jeremy, Lei Tao
- Integrated Triaging For Fast Reading Comprehension Wu Felix, Li Boyi, Wang Lequn, Lao Ni, Blitzer John, Weinberger Kilian Q.
- Accelerating Natural Language Understanding In Task-oriented Dialog Ahuja Ojas, Desai Shrey
- Efficient Transformer-based Large Scale Language Representations Using Hardware-friendly Block Structured Pruning Li Bingbing, Kong Zhenglun, Zhang Tianyun, Li Ji, Li Zhengang, Liu Hang, Ding Caiwen
- Train Large, Then Compress: Rethinking Model Size For Efficient Training And Inference Of Transformers Li Zhuohan, Wallace Eric, Shen Sheng, Lin Kevin, Keutzer Kurt, Klein Dan, Gonzalez Joseph E.
- Ladabert: Lightweight Adaptation Of BERT Through Hybrid Model Compression Mao Yihuan, Wang Yujing, Wu Chufan, Zhang Chen, Wang Yang, Yang Yaming, Zhang Quanlu, Tong Yunhai, Bai Jing
- Axformer: Accuracy-driven Approximation Of Transformers For Faster, Smaller And More Accurate NLP Models Nagarajan Amrit, Sen Sanchari, Stevens Jacob R., Raghunathan Anand
- On The Effect Of Dropping Layers Of Pre-trained Transformer Models Sajjad Hassan, Dalvi Fahim, Durrani Nadir, Nakov Preslav
- Edgebert: Sentence-level Energy Optimizations For Latency-aware Multi-task NLP Inference Tambe Thierry, Hooper Coleman, Pentecost Lillian, Jia Tianyu, Yang En-yu, Donato Marco, Sanh Victor, Whatmough Paul N., Rush Alexander M., Brooks David, Wei Gu-yeon
- Lite Transformer With Long-short Range Attention Wu Zhanghao, Liu Zhijian, Lin Ji, Lin Yujun, Han Song
- Know What You Don't Need: Single-shot Meta-pruning For Attention Heads Zhang Zhengyan, Qi Fanchao, Liu Zhiyuan, Liu Qun, Sun Maosong
- On The Prunability Of Attention Heads In Multilingual BERT Budhraja Aakriti, Pande Madhura, Kumar Pratyush, Khapra Mitesh M.
- Robustness Challenges In Model Distillation And Pruning For Natural Language Understanding Du Mengnan, Mukherjee Subhabrata, Cheng Yu, Shokouhi Milad, Hu Xia, Awadallah Ahmed Hassan
- The Niutrans System For WNGT 2020 Efficiency Task Hu Chi, Li Bei, Lin Ye, Li Yinqiao, Li Yanyang, Wang Chenglong, Xiao Tong, Zhu Jingbo
- Learned Token Pruning For Transformers Kim Sehoon, Shen Sheng, Thorsley David, Gholami Amir, Kwon Woosuk, Hassoun Joseph, Keutzer Kurt
- BERT Busters: Outlier Dimensions That Disrupt Transformers Kovaleva Olga, Kulshreshtha Saurabh, Rogers Anna, Rumshisky Anna
- Layer-wise Pruning Of Transformer Attention Heads For Efficient Language Modeling Shim Kyuhong, Choi Iksoo, Sung Wonyong, Choi Jungwook
- From Dense To Sparse: Contrastive Pruning For Better Pre-trained Language Model Compression Xu Runxin, Luo Fuli, Wang Chengyu, Chang Baobao, Huang Jun, Huang Songfang, Huang Fei
- Prune Once For All: Sparse Pre-trained Language Models Zafrir Ofir, Larey Ariel, Boudoukh Guy, Shen Haihao, Wasserblat Moshe
- M6-rec: Generative Pretrained Language Models Are Open-ended Recommender Systems Cui Zeyu, Ma Jianxin, Zhou Chang, Zhou Jingren, Yang Hongxia
- Shapley Head Pruning: Identifying And Removing Interference In Multilingual Transformers Held William, Yang Diyi
- Compressing Pre-trained Transformers Via Low-bit Nxm Sparsity For Natural Language Understanding Holmes Connor, Zhang Minjia, He Yuxiong, Wu Bo
- The Optimal BERT Surgeon: Scalable And Accurate Second-order Pruning For Large Language Models Kurtic Eldar, Campos Daniel, Nguyen Tuan, Frantar Elias, Kurtz Mark, Fineran Benjamin, Goin Michael, Alistarh Dan
- Interactive Code Generation Via Test-driven User-intent Formalization Lahiri Shuvendu K., Fakhoury Sarah, Naik Aaditya, Sakkas Georgios, Chakraborty Saikat, Musuvathi Madanlal, Choudhury Piali, Von Veh Curtis, Inala Jeevana Priya, Wang Chenglong, Gao Jianfeng
- Accelerating Attention Through Gradient-based Learned Runtime Pruning Li Zheng, Ghodrati Soroush, Yazdanbakhsh Amir, Esmaeilzadeh Hadi, Kang Mingu
- Outliers Dimensions That Disrupt Transformers Are Driven By Frequency Puccetti Giovanni, Rogers Anna, Drozd Aleksandr, Dell'orletta Felice
- Finding Skill Neurons In Pre-trained Transformer-based Language Models Wang Xiaozhi, Wen Kaiyue, Zhang Zhengyan, Hou Lei, Liu Zhiyuan, Li Juanzi
- Bilaterally Slimmable Transformer For Elastic And Efficient Visual Question Answering Yu Zhou, Jin Zitian, Yu Jun, Xu Mingliang, Wang Hongbo, Fan Jianping
- The Emergence Of Essential Sparsity In Large Pre-trained Models: The Weights That Matter Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Zhangyang Wang
- Slimfit: Memory-efficient Fine-tuning Of Transformer-based Models Using Training Dynamics Ardakani Arash, Haan Altan, Tan Shangyin, Popovici Doru Thom, Cheung Alvin, Iancu Costin, Sen Koushik
- Oberta: Improving Sparse Transfer Learning Via Improved Initialization, Distillation, And Pruning Regimes Campos Daniel, Marques Alexandre, Kurtz Mark, Zhai Chengxiang
- Pumer: Pruning And Merging Tokens For Efficient Vision Language Models Cao Qingqing, Paranjape Bhargavi, Hajishirzi Hannaneh
- Lorashear: Efficient Large Language Model Structured Pruning And Knowledge Recovery Chen Tianyi, Ding Tianyu, Yadav Badal, Zharkov Ilya, Liang Luming
- Investigating Hallucinations In Pruned Large Language Models For Abstractive Summarization Chrysostomou George, Zhao Zhixue, Williams Miles, Aletras Nikolaos
- Divergent Token Metrics: Measuring Degradation To Prune Away LLM Components -- And Optimize Quantization Deiseroth Björn, Meuer Max, Gritsch Nikolas, Eichenberg Constantin, Schramowski Patrick, Aßenmacher Matthias, Kersting Kristian
- Sparsegpt: Massive Language Models Can Be Accurately Pruned In One-shot Frantar Elias, Alistarh Dan
- Revisiting The Knowledge Injection Frameworks Fu Peng, Zhang Yiming, Wang Haobo, Qiu Weikang, Zhao Junbo
- Hiformer: Heterogeneous Feature Interactions Learning With Transformers For Recommender Systems Gui Huan, Wang Ruoxi, Yin Ke, Jin Long, Kula Maciej, Xu Taibai, Hong Lichan, Chi Ed H.
- SPT: Fine-tuning Transformer-based Language Models Efficiently With Sparsification Gui Yuntao, Yan Xiao, Yin Peiqi, Yang Han, Cheng James
- Compresso: Structured Pruning With Collaborative Prompting Learns Compact Large Language Models Guo Song, Xu Jiahang, Zhang Li Lyna, Yang Mao
- ELIP: Efficient Language-image Pre-training With Fewer Vision Tokens Guo Yangyang, Zhang Haoyu, Wong Yongkang, Nie Liqiang, Kankanhalli Mohan
- Fewer Is More: Boosting LLM Reasoning With Reinforced Context Pruning Huang Xijie, Zhang Li Lyna, Cheng Kwang-ting, Yang Fan, Yang Mao
- Just CHOP: Embarrassingly Simple LLM Compression Jha Ananya Harsh, Sherborne Tom, Walsh Evan Pete, Groeneveld Dirk, Strubell Emma, Beltagy Iz
- The Cost Of Down-scaling Language Models: Fact Recall Deteriorates Before In-context Learning Jin Tian, Clement Nolan, Dong Xin, Nagarajan Vaishnavh, Carbin Michael, Ragan-kelley Jonathan, Dziugaite Gintare Karolina
- Losparse: Structured Compression Of Large Language Models Based On Low-rank And Sparse Approximation Li Yixiao, Yu Yifan, Zhang Qingru, Liang Chen, He Pengcheng, Chen Weizhu, Zhao Tuo
- Compressing Context To Enhance Inference Efficiency Of Large Language Models Li Yucheng, Dong Bo, Lin Chenghua, Guerin Frank
- E-sparse: Boosting The Large Language Model Inference Through Entropy-based N:M Sparsity Li Yun, Niu Lin, Zhang Xipeng, Liu Kai, Zhu Jianchen, Kang Zhanhui
- Homodistil: Homotopic Task-agnostic Distillation Of Pre-trained Transformers Liang Chen, Jiang Haoming, Li Zheng, Tang Xianfeng, Yin Bin, Zhao Tuo
- Picking The Underused Heads: A Network Pruning Perspective Of Attention Head Selection For Fusing Dialogue Coreference Information Liu Zhengyuan, Chen Nancy F.
- When Less Is More: Investigating Data Pruning For Pretraining Llms At Scale Marion Max, Üstün Ahmet, Pozzobon Luiza, Wang Alex, Fadaee Marzieh, Hooker Sara
- LEVER: Learning To Verify Language-to-code Generation With Execution Ni Ansong, Iyer Srini, Radev Dragomir, Stoyanov Ves, Yih Wen-tau, Wang Sida I., Lin Xi Victoria
- What Matters In The Structured Pruning Of Generative Language Models? Santacroce Michael, Wen Zixin, Shen Yelong, Li Yuanzhi
- One-shot Sensitivity-aware Mixed Sparsity Pruning For Large Language Models Shao Hang, Liu Bei, Xiao Bo, Zeng Ke, Wan Guanglu, Qian Yanmin
- Automatic Prompt Augmentation And Selection With Chain-of-thought From Labeled Data Shum Kashun, Diao Shizhe, Zhang Tong
- Biomedical Knowledge Graph-optimized Prompt Generation For Large Language Models Soman Karthik, Rose Peter W, Morris John H, Akbas Rabia E, Smith Brett, Peetoom Braian, Villouta-reyes Catalina, Cerono Gabriel, Shi Yongmei, Rizk-jackson Angela, Israni Sharat, Nelson Charlotte A, Huang Sui, Baranzini Sergio E
- Dynamic Context Pruning For Efficient And Interpretable Autoregressive Transformers Sotiris Anagnostidis, Dario Pavllo, Luca Biggio, Lorenzo Noci, Aurelien Lucchi, Thomas Hofmann
- Core-cog: Conversational Recommendation Of Entities Using Constrained Generation Srivastava Harshvardhan, Pruthi Kanav, Chakrabarti Soumen, Mausam
- F3-pruning: A Training-free And Generalized Pruning Strategy Towards Faster And Finer Text-to-video Synthesis Su Sitong, Liu Jianzhi, Gao Lianli, Song Jingkuan
- A Simple And Effective Pruning Approach For Large Language Models Sun Mingjie, Liu Zhuang, Bair Anna, Kolter J. Zico
- Mini-gpts: Efficient Large Language Models Through Contextual Pruning Valicenti Tim, Vidal Justice, Patnaik Ritik
- The LLM Surgeon Van Der Ouderaa Tycho F. A., Nagel Markus, Van Baalen Mart, Asano Yuki M., Blankevoort Tijmen
- Cost-effective Hyperparameter Optimization For Large Language Model Generation Inference Wang Chi, Liu Susan Xueqing, Awadallah Ahmed H.
- Smarttrim: Adaptive Tokens And Attention Pruning For Efficient Vision-language Models Wang Zekun, Chen Jingchang, Zhou Wangchunshu, Zhu Haichao, Liang Jiafeng, Shan Liping, Liu Ming, Xu Dongliang, Yang Qing, Qin Bing
- Llmrec: Large Language Models With Graph Augmentation For Recommendation Wei Wei, Xubin Ren, Jiabin Tang, Qinyong Wang, Lixin Su, Suqi Cheng, Junfeng Wang, Dawei Yin, Chao Huang
- Frustratingly Simple Memory Efficiency For Pre-trained Language Models Via Dynamic Embedding Pruning Williams Miles, Aletras Nikolaos
- Sheared Llama: Accelerating Language Model Pre-training Via Structured Pruning Xia Mengzhou, Gao Tianyu, Zeng Zhiyuan, Chen Danqi
- Llm-pruner: On The Structural Pruning Of Large Language Models Xinyin Ma, Gongfan Fang, Xinchao Wang
- Compress, Then Prompt: Improving Accuracy-efficiency Trade-off Of LLM Inference With Transferable Prompt Xu Zhaozhuo, Liu Zirui, Chen Beidi, Tang Yuxin, Wang Jue, Zhou Kaixiong, Hu Xia, Shrivastava Anshumali
- Fairness-aware Structured Pruning In Transformers Zayed Abdelrahman, Mordido Goncalo, Shabanian Samira, Baldini Ioana, Chandar Sarath
- Survival Of The Most Influential Prompts: Efficient Black-box Prompt Search Via Clustering And Pruning Zhou Han, Wan Xingchen, Vulić Ivan, Korhonen Anna
- Step-by-step Unmasking For Parameter-efficient Fine-tuning Of Large Language Models Agarwal Aradhye, Ramesh Suhas K, Sengupta Ayan, Chakraborty Tanmoy
- Enabling High-sparsity Foundational Llama Models With Efficient Pretraining And Deployment Agarwalla Abhinav, Gupta Abhay, Marques Alexandre, Pandit Shubhra, Goin Michael, Kurtic Eldar, Leong Kevin, Nguyen Tuan, Salem Mahmoud, Alistarh Dan, Lie Sean, Kurtz Mark
- Shadowllm: Predictor-based Contextual Sparsity For Large Language Models Akhauri Yash, Abouelhamayed Ahmed F, Dotzel Jordan, Zhang Zhiru, Rush Alexander M, Huda Safeen, Abdelfattah Mohamed S
- Scaling Sparse Fine-tuning To Large Language Models Ansell Alan, Vulić Ivan, Sterz Hannah, Korhonen Anna, Ponti Edoardo M.
- Slicegpt: Compress Large Language Models By Deleting Rows And Columns Ashkboos Saleh, Croci Maximilian L., Nascimento Marcelo Gennari Do, Hoefler Torsten, Hensman James
- Sparsellm: Towards Global Pruning For Pre-trained Language Models Bai Guangji, Li Yijiang, Ling Chen, Kim Kibaek, Zhao Liang
- Fast And Effective Weight Update For Pruned Large Language Models Boža Vladimír
- Matryoshka Multimodal Models Cai Mu, Yang Jianwei, Gao Jianfeng, Lee Yong Jae
- MADTP: Multimodal Alignment-guided Dynamic Token Pruning For Accelerating Vision-language Transformer Cao Jianjian, Ye Peng, Li Shengze, Yu Chong, Tang Yansong, Lu Jiwen, Chen Tao
- Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided By Text Information Chen Yi, Xu Jian, Zhang Xu-yao, Liu Wen-zhuo, Liu Yang-yang, Liu Cheng-lin
- Information Re-organization Improves Reasoning In Large Language Models Cheng Xiaoxia, Tan Zeqi, Xue Wei, Lu Weiming
- Multi-granularity Guided Fusion-in-decoder Choi Eunseong, Lee Hyeri, Lee Jongwuk
- Pruner-zero: Evolving Symbolic Pruning Metric From Scratch For Large Language Models Dong Peijie, Li Lujun, Tang Zhenheng, Liu Xiang, Pan Xinglin, Wang Qiang, Chu Xiaowen
- Lazyllm: Dynamic Token Pruning For Efficient Long Context LLM Inference Fu Qichen, Cho Minsik, Merth Thomas, Mehta Sachin, Rastegari Mohammad, Najibi Mahyar
- LLMC: Benchmarking Large Language Model Quantization With A Versatile Compression Toolkit Gong Ruihao, Yong Yang, Gu Shiqiao, Huang Yushi, Lv Chentao, Zhang Yunchen, Liu Xianglong, Tao Dacheng
- Dependency-aware Semi-structured Sparsity: Declining Roles Of Outliers In Pruning Glu-based Llms Guo Zhiyu, Kamigaito Hidetaka, Wanatnabe Taro
- Attention Score Is Not All You Need For Token Importance Indicator In KV Cache Reduction: Value Also Matters Guo Zhiyu, Kamigaito Hidetaka, Watanabe Taro
- Pruning For Protection: Increasing Jailbreak Resistance In Aligned Llms Without Fine-tuning Hasan Adib, Rugina Ileana, Wang Alex
- What Matters In Transformers? Not All Attention Is Needed He Shwai, Sun Guoheng, Shen Zheyu, Li Ang
- Mixed Sparsity Training: Achieving 4\(\times\) FLOP Reduction For Transformer Pretraining Hu Pihe, Li Shaolong, Huang Longbo
- Antidote: Post-fine-tuning Safety Alignment For Large Language Models Against Harmful Fine-tuning Huang Tiansheng, Bhattacharya Gautam, Joshi Pratik, Kimball Josh, Liu Ling
- Pruning Large Language Models With Semi-structural Adaptive Sparse Training Huang Weiyu, Hu Yuezhou, Jian Guohao, Zhu Jun, Chen Jianfei
- Cutting Off The Head Ends The Conflict: A Mechanism For Interpreting And Mitigating Knowledge Conflicts In Language Models Jin Zhuoran, Cao Pengfei, Yuan Hongbang, Chen Yubo, Xu Jiexin, Li Huaijun, Jiang Xiaojian, Liu Kang, Zhao Jun
- A2SF: Accumulative Attention Scoring With Forgetting Factor For Token Pruning In Transformer Decoder Jo Hyun-rae, Shin Dongkun
- Language Repository For Long Video Understanding Kahatapitiya Kumara, Ranasinghe Kanchana, Park Jongwoo, Ryoo Michael S.
- Shortened Llama: Depth Pruning For Large Language Models With Comparison Of Retraining Methods Kim Bo-kyeong, Kim Geonmin, Kim Tae-ho, Castells Thibault, Choi Shinkook, Shin Junho, Song Hyoung-kyu
- Language-specific Calibration For Pruning Multilingual Language Models Kurz Simon, Chen Jian-jia, Flek Lucie, Zhao Zhixue
- Hip Attention: Sparse Sub-quadratic Attention With Hierarchical Attention Pruning Lee Heejun, Park Geon, Lee Youngwan, Kim Jina, Jeong Wonyoung, Jeon Myeongjae, Hwang Sung Ju
- CATS: Contextually-aware Thresholding For Sparsity In Large Language Models Lee Je-yong, Lee Donghyun, Zhang Genghan, Tiwari Mo, Mirhoseini Azalia
- Owlore: Outlier-weighed Layerwise Sampled Low-rank Projection For Memory-efficient LLM Fine-tuning Li Pengxiang, Yin Lu, Gao Xiaowei, Liu Shiwei
- Hd-eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition Liu Yuxuan, Yang Tianchi, Huang Shaohan, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Large Language Models Relearn Removed Concepts Lo Michelle, Cohen Shay B., Barez Fazl
- Not All Experts Are Equal: Efficient Expert Pruning And Skipping For Mixture-of-experts Large Language Models Lu Xudong, Liu Qi, Xu Yuhui, Zhou Aojun, Huang Siyuan, Zhang Bo, Yan Junchi, Li Hongsheng
- SPP: Sparsity-preserved Parameter-efficient Fine-tuning For Large Language Models Lu Xudong, Zhou Aojun, Xu Yuhui, Zhang Renrui, Gao Peng, Li Hongsheng
- COPAL: Continual Pruning In Large Language Generative Models Malla Srikanth, Choi Joon Hee, Choi Chiho
- Large Language Models Are Parallel Multilingual Learners Mu Yongyu, Feng Peinan, Cao Zhiquan, Wu Yuzhang, Li Bei, Wang Chenglong, Xiao Tong, Song Kai, Liu Tongran, Zhang Chunliang, Zhu Jingbo
- Compact Language Models Via Pruning And Knowledge Distillation Muralidharan Saurav, Sreenivas Sharath Turuvekere, Joshi Raviraj, Chochowski Marcin, Patwary Mostofa, Shoeybi Mohammad, Catanzaro Bryan, Kautz Jan, Molchanov Pavlo
- Skill Set Optimization: Reinforcing Language Model Behavior Via Transferable Skills Nottingham Kolby, Majumder Bodhisattwa Prasad, Mishra Bhavana Dalvi, Singh Sameer, Clark Peter, Fox Roy
- The Solution For The AIGC Inference Performance Optimization Competition Pan Sishun, Xu Haonan, Wan Zhonghua, Yang Yang
- Token-picker: Accelerating Attention In Text Generation With Minimized Memory Transfer Via Probability Estimation Park Junyoung, Kang Myeonggu, Han Yunki, Kim Yanggon, Shin Jaekang, Kim Lee-sup
- The Ultimate Guide To Fine-tuning Llms From Basics To Breakthroughs: An Exhaustive Review Of Technologies, Research, Best Practices, Applied Research Challenges And Opportunities Parthasarathy Venkatesh Balavadhani, Zafar Ahtsham, Khan Aafaq, Shahid Arsalan
- Mamba-ptq: Outlier Channels In Recurrent Large Language Models Pierro Alessandro, Abreu Steven
- Dissecting Language Models: Machine Unlearning Via Selective Pruning Pochinkov Nicholas, Schoots Nandi
- Practical Token Pruning For Foundation Models In Few-shot Conversational Virtual Assistant Systems Qi Haode, Qian Cheng, Ni Jian, Singh Pratyush, Fazeli Reza, Wang Gengyu, Shu Zhongzheng, Wayne Eric, Bross Juergen
- Openba-v2: Reaching 77.3% High Compression Ratio With Fast Multi-stage Pruning Qiao Dan, Su Yi, Wang Pinzheng, Ye Jing, Xie Wenjing, Zhou Yuechi, Ding Yuyang, Tang Zecheng, Wang Jikai, Ji Yixin, Wang Yue, Guo Pei, Sun Zechen, Zhang Zikang, Li Juntao, Chao Pingfu, Chen Wenliang, Fu Guohong, Zhou Guodong, Zhu Qiaoming, Zhang Min
- RAGE Against The Machine: Retrieval-augmented LLM Explanations Rorseth Joel, Godfrey Parke, Golab Lukasz, Srivastava Divesh, Szlichta Jaroslaw
- Retrieval-enhanced Knowledge Editing In Language Models For Multi-hop Question Answering Shi Yucheng, Tan Qiaoyu, Wu Xuansheng, Zhong Shaochen, Zhou Kaixiong, Liu Ninghao
- Rethinking Pruning Large Language Models: Benefits And Pitfalls Of Reconstruction Error Minimization Shin Sungbin, Park Wonpyo, Lee Jaeho, Lee Namhoon
- Prosparse: Introducing And Enhancing Intrinsic Activation Sparsity Within Large Language Models Song Chenyang, Han Xu, Zhang Zhengyan, Hu Shengding, Shi Xiyu, Li Kuai, Chen Chen, Liu Zhiyuan, Li Guangli, Yang Tao, Sun Maosong
- Llm-barber: Block-aware Rebuilder For Sparsity Mask In One-shot For Large Language Models Su Yupeng, Guan Ziyi, Liu Xiaoqun, Jin Tianlai, Wu Dongkuan, Chesi Graziano, Wong Ngai, Yu Hao
- Code Less, Align More: Efficient LLM Fine-tuning For Code Generation With Data Pruning Tsai Yun-da, Liu Mingjie, Ren Haoxing
- Reconstruct The Pruned Model Without Any Retraining Wang Pingjie, Fan Ziqing, Hu Shengchao, Chen Zhe, Wang Yanfeng, Wang Yu
- EMP: Enhance Memory In Data Pruning Xiao Jinying, Li Ping, Nie Jie, Tang Zhe
- On-device Language Models: A Comprehensive Review Xu Jiajun, Li Zhiyuan, Chen Wei, Wang Qun, Gao Xin, Cai Qi, Ling Ziyuan
- BESA: Pruning Large Language Models With Blockwise Parameter-efficient Sparsity Allocation Xu Peng, Shao Wenqi, Chen Mengzhao, Tang Shitao, Zhang Kaipeng, Gao Peng, An Fengwei, Qiao Yu, Luo Ping
- Think: Thinner Key Cache By Query-driven Pruning Xu Yuhui, Jie Zhanming, Dong Hanze, Wang Lei, Lu Xudong, Zhou Aojun, Saha Amrita, Xiong Caiming, Sahoo Doyen
- Beyond Perplexity: Multi-dimensional Safety Evaluation Of LLM Compression Xu Zhichao, Gupta Ashim, Li Tao, Bentham Oliver, Srikumar Vivek
- Pyramidinfer: Pyramid KV Cache Compression For High-throughput LLM Inference Yang Dongjie, Han Xiaodong, Gao Yan, Hu Yao, Zhang Shilin, Zhao Hai
- \(\text{memory}^3\): Language Modeling With Explicit Memory Yang Hongkang, Lin Zehao, Wang Wenjin, Wu Hao, Li Zhiyu, Tang Bo, Wei Wenqiang, Wang Jinbo, Tang Zeyun, Song Shichao, Xi Chenyang, Yu Yu, Chen Kai, Xiong Feiyu, Tang Linpeng, E Weinan
- P3: A Policy-driven, Pace-adaptive, And Diversity-promoted Framework For Optimizing LLM Training Yang Yingxuan, Wang Huayi, Wen Muning, Zhang Weinan
- Enhancing In-context Learning Performance With Just Svd-based Weight Pruning: A Theoretical Perspective Yao Xinhao, Hu Xiaolin, Yang Shenzhi, Liu Yong
- Why Lift So Heavy? Slimming Large Language Models By Cutting Off The Layers Yuan Shuzhou, Nie Ercong, Ma Bolei, Färber Michael
- Focus On The Core: Efficient Attention Via Pruned Token Compression For Document Classification Yun Jungmin, Kim Mihyeon, Kim Youngbin
- Finercut: Finer-grained Interpretable Layer Pruning For Large Language Models Zhang Yang, Li Yawei, Wang Xinpeng, Shen Qianli, Plank Barbara, Bischl Bernd, Rezaei Mina, Kawaguchi Kenji
- Diversifying The Expert Knowledge For Task-agnostic Pruning In Sparse Mixture-of-experts Zhang Zeliang, Liu Xiaodong, Cheng Hao, Xu Chenliang, Gao Jianfeng
- APT: Adaptive Pruning And Tuning Pretrained Language Models For Efficient Training And Inference Zhao Bowen, Hajishirzi Hannaneh, Cao Qingqing
- Aya Model: An Instruction Finetuned Open-access Multilingual Language Model Üstün Ahmet, Aryabumi Viraat, Yong Zheng-xin, Ko Wei-yin, D'souza Daniel, Onilude Gbemileke, Bhandari Neel, Singh Shivalika, Ooi Hui-lee, Kayid Amr, Vargus Freddie, Blunsom Phil, Longpre Shayne, Muennighoff Niklas, Fadaee Marzieh, Kreutzer Julia, Hooker Sara
🏷 Quantization
- Convert: Efficient And Accurate Conversational Representations From Transformers Henderson Matthew, Casanueva Iñigo, Mrkšić Nikola, Su Pei-hao, Wen Tsung-hsien, Vulić Ivan
- Structured Pruning Of A Bert-based Question Answering Model Mccarley J. S., Chakravarti Rishav, Sil Avirup
- Well-read Students Learn Better: On The Importance Of Pre-training Compact Models Turc Iulia, Chang Ming-wei, Lee Kenton, Toutanova Kristina
- Structured Pruning Of Large Language Models Wang Ziheng, Wohlwend Jeremy, Lei Tao
- Model Compression With Two-stage Multi-teacher Knowledge Distillation For Web Question Answering System Yang Ze, Shou Linjun, Gong Ming, Lin Wutao, Jiang Daxin
- Q8BERT: Quantized 8bit BERT Zafrir Ofir, Boudoukh Guy, Izsak Peter, Wasserblat Moshe
- Extremely Small BERT Models From Mixed-vocabulary Training Zhao Sanqiang, Gupta Raghav, Song Yang, Zhou Denny
- Train Large, Then Compress: Rethinking Model Size For Efficient Training And Inference Of Transformers Li Zhuohan, Wallace Eric, Shen Sheng, Lin Kevin, Keutzer Kurt, Klein Dan, Gonzalez Joseph E.
- Ladabert: Lightweight Adaptation Of BERT Through Hybrid Model Compression Mao Yihuan, Wang Yujing, Wu Chufan, Zhang Chen, Wang Yang, Yang Yaming, Zhang Quanlu, Tong Yunhai, Bai Jing
- Axformer: Accuracy-driven Approximation Of Transformers For Faster, Smaller And More Accurate NLP Models Nagarajan Amrit, Sen Sanchari, Stevens Jacob R., Raghunathan Anand
- Edgebert: Sentence-level Energy Optimizations For Latency-aware Multi-task NLP Inference Tambe Thierry, Hooper Coleman, Pentecost Lillian, Jia Tianyu, Yang En-yu, Donato Marco, Sanh Victor, Whatmough Paul N., Rush Alexander M., Brooks David, Wei Gu-yeon
- Lite Transformer With Long-short Range Attention Wu Zhanghao, Liu Zhijian, Lin Ji, Lin Yujun, Han Song
- GOBO: Quantizing Attention-based NLP Models For Low Latency And Energy Efficient Inference Zadeh Ali Hadi, Edo Isak, Awad Omar Mohamed, Moshovos Andreas
- Know What You Don't Need: Single-shot Meta-pruning For Attention Heads Zhang Zhengyan, Qi Fanchao, Liu Zhiyuan, Liu Qun, Sun Maosong
- YANMTT: Yet Another Neural Machine Translation Toolkit Dabre Raj, Sumita Eiichiro
- Robustness Challenges In Model Distillation And Pruning For Natural Language Understanding Du Mengnan, Mukherjee Subhabrata, Cheng Yu, Shokouhi Milad, Hu Xia, Awadallah Ahmed Hassan
- The Niutrans System For WNGT 2020 Efficiency Task Hu Chi, Li Bei, Lin Ye, Li Yinqiao, Li Yanyang, Wang Chenglong, Xiao Tong, Zhu Jingbo
- Tiny Neural Models For Seq2seq Kandoor Arun
- Exploring Low-cost Transformer Model Compression For Large-scale Commercial Reply Suggestions Shrivastava Vaishnavi, Gaonkar Radhika, Gupta Shashank, Jha Abhishek
- From Dense To Sparse: Contrastive Pruning For Better Pre-trained Language Model Compression Xu Runxin, Luo Fuli, Wang Chengyu, Chang Baobao, Huang Jun, Huang Songfang, Huang Fei
- Prune Once For All: Sparse Pre-trained Language Models Zafrir Ofir, Larey Ariel, Boudoukh Guy, Shen Haihao, Wasserblat Moshe
- Ernie-vilg: Unified Generative Pre-training For Bidirectional Vision-language Generation Zhang Han, Yin Weichong, Fang Yewei, Li Lanxin, Duan Boqiang, Wu Zhihua, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- GLM-130B: An Open Bilingual Pre-trained Model Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Peng Zhang, Yuxiao Dong, Jie Tang
- Vector-quantized Input-contextualized Soft Prompts For Natural Language Understanding Bhardwaj Rishabh, Saha Amrita, Hoi Steven C. H., Poria Soujanya
- Llm.int8(): 8-bit Matrix Multiplication For Transformers At Scale Dettmers Tim, Lewis Mike, Belkada Younes, Zettlemoyer Luke
- GPTQ: Accurate Post-training Quantization For Generative Pre-trained Transformers Frantar Elias, Ashkboos Saleh, Hoefler Torsten, Alistarh Dan
- Quala-minilm: A Quantized Length Adaptive Minilm Guskin Shira, Wasserblat Moshe, Wang Chang, Shen Haihao
- Compressing Pre-trained Transformers Via Low-bit Nxm Sparsity For Natural Language Understanding Holmes Connor, Zhang Minjia, He Yuxiong, Wu Bo
- Empirical Evaluation Of Post-training Quantization Methods For Language Tasks Hu Ting, Meinel Christoph, Yang Haojin
- The Optimal BERT Surgeon: Scalable And Accurate Second-order Pruning For Large Language Models Kurtic Eldar, Campos Daniel, Nguyen Tuan, Frantar Elias, Kurtz Mark, Fineran Benjamin, Goin Michael, Alistarh Dan
- DQ-BART: Efficient Sequence-to-sequence Model Via Joint Distillation And Quantization Li Zheng, Wang Zijian, Tan Ming, Nallapati Ramesh, Bhatia Parminder, Arnold Andrew, Xiang Bing, Roth Dan
- LUT-GEMM: Quantized Matrix Multiplication Based On Luts For Efficient Inference In Large-scale Generative Language Models Park Gunho, Park Baeseong, Kim Minsub, Lee Sungjae, Kim Jeonghoon, Kwon Beomseok, Kwon Se Jung, Kim Byeongwook, Lee Youngjoo, Lee Dongsoo
- Compression Of Generative Pre-trained Language Models Via Quantization Tao Chaofan, Hou Lu, Zhang Wei, Shang Lifeng, Jiang Xin, Liu Qun, Luo Ping, Wong Ngai
- Extreme Compression For Pre-trained Transformers Made Simple And Efficient Wu Xiaoxia, Yao Zhewei, Zhang Minjia, Li Conglong, He Yuxiong
- Smoothquant: Accurate And Efficient Post-training Quantization For Large Language Models Xiao Guangxuan, Lin Ji, Seznec Mickael, Wu Hao, Demouth Julien, Han Song
- Zeroquant: Efficient And Affordable Post-training Quantization For Large-scale Transformers Yao Zhewei, Aminabadi Reza Yazdani, Zhang Minjia, Wu Xiaoxia, Li Conglong, He Yuxiong
- Slimfit: Memory-efficient Fine-tuning Of Transformer-based Models Using Training Dynamics Ardakani Arash, Haan Altan, Tan Shangyin, Popovici Doru Thom, Cheung Alvin, Iancu Costin, Sen Koushik
- Efficient Transformer Knowledge Distillation: A Performance Review Brown Nathan, Williamson Ashton, Anderson Tahj, Lawrence Logan
- Oberta: Improving Sparse Transfer Learning Via Improved Initialization, Distillation, And Pruning Regimes Campos Daniel, Marques Alexandre, Kurtz Mark, Zhai Chengxiang
- Revolutionizing Mobile Interaction: Enabling A 3 Billion Parameter GPT LLM On Mobile Carreira Samuel, Marques Tomás, Ribeiro José, Grilo Carlos
- INT2.1: Towards Fine-tunable Quantized Large Language Models With Error Correction Through Low-rank Adaptation Chai Yuji, Gkountouras John, Ko Glenn G., Brooks David, Wei Gu-yeon
- Divergent Token Metrics: Measuring Degradation To Prune Away LLM Components -- And Optimize Quantization Deiseroth Björn, Meuer Max, Gritsch Nikolas, Eichenberg Constantin, Schramowski Patrick, Aßenmacher Matthias, Kersting Kristian
- Spqr: A Sparse-quantized Representation For Near-lossless LLM Weight Compression Dettmers Tim, Svirschevski Ruslan, Egiazarian Vage, Kuznedelev Denis, Frantar Elias, Ashkboos Saleh, Borzunov Alexander, Hoefler Torsten, Alistarh Dan
- Blockwise Compression Of Transformer-based Models Without Retraining Dong Gaochen, Chen Wei
- Sweeping Heterogeneity With Smart Mops: Mixture Of Prompts For LLM Task Adaptation Dun Chen, Garcia Mirian Hipolito, Zheng Guoqing, Awadallah Ahmed Hassan, Kyrillidis Anastasios, Sim Robert
- Sparsegpt: Massive Language Models Can Be Accurately Pruned In One-shot Frantar Elias, Alistarh Dan
- SPT: Fine-tuning Transformer-based Language Models Efficiently With Sparsification Gui Yuntao, Yan Xiao, Yin Peiqi, Yang Han, Cheng James
- Compresso: Structured Pruning With Collaborative Prompting Learns Compact Large Language Models Guo Song, Xu Jiahang, Zhang Li Lyna, Yang Mao
- Quip: 2-bit Quantization Of Large Language Models With Guarantees Jerry Chee, Yaohui Cai, Volodymyr Kuleshov, Christopher De Sa
- Just CHOP: Embarrassingly Simple LLM Compression Jha Ananya Harsh, Sherborne Tom, Walsh Evan Pete, Groeneveld Dirk, Strubell Emma, Beltagy Iz
- Memory-efficient Fine-tuning Of Compressed Large Language Models Via Sub-4-bit Integer Quantization Kim Jeonghoon, Lee Jung Hyun, Kim Sungdong, Park Joonsuk, Yoo Kang Min, Kwon Se Jung, Lee Dongsoo
- Squeezellm: Dense-and-sparse Quantization Kim Sehoon, Hooper Coleman, Gholami Amir, Dong Zhen, Li Xiuyu, Shen Sheng, Mahoney Michael W., Keutzer Kurt
- Finequant: Unlocking Efficiency With Fine-grained Weight-only Quantization For Llms Kim Young Jin, Henry Rawn, Fahim Raffy, Awadalla Hany Hassan
- Sparse Fine-tuning For Inference Acceleration Of Large Language Models Kurtic Eldar, Kuznedelev Denis, Frantar Elias, Goin Michael, Alistarh Dan
- Watermarking Llms With Weight Quantization Li Linyang, Jiang Botian, Wang Pengyu, Ren Ke, Yan Hang, Qiu Xipeng
- Loftq: Lora-fine-tuning-aware Quantization For Large Language Models Li Yixiao, Yu Yifan, Liang Chen, He Pengcheng, Karampatziakis Nikos, Chen Weizhu, Zhao Tuo
- Losparse: Structured Compression Of Large Language Models Based On Low-rank And Sparse Approximation Li Yixiao, Yu Yifan, Zhang Qingru, Liang Chen, He Pengcheng, Chen Weizhu, Zhao Tuo
- QFT: Quantized Full-parameter Tuning Of Llms With Affordable Resources Li Zhikai, Liu Xiaoxuan, Zhu Banghua, Dong Zhen, Gu Qingyi, Keutzer Kurt
- Homodistil: Homotopic Task-agnostic Distillation Of Pre-trained Transformers Liang Chen, Jiang Haoming, Li Zheng, Tang Xianfeng, Yin Bin, Zhao Tuo
- AWQ: Activation-aware Weight Quantization For LLM Compression And Acceleration Lin Ji, Tang Jiaming, Tang Haotian, Yang Shang, Chen Wei-ming, Wang Wei-chen, Xiao Guangxuan, Dang Xingyu, Gan Chuang, Han Song
- Pushing Large Language Models To The 6G Edge: Vision, Challenges, And Opportunities Lin Zheng, Qu Guanqiao, Chen Qiyuan, Chen Xianhao, Chen Zhe, Huang Kaibin
- Retrieval-based Knowledge Transfer: An Effective Approach For Extreme Large Language Model Compression Liu Jiduan, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Wang Ran Lucien, Yan Rui
- Binary And Ternary Natural Language Generation Liu Zechun, Oguz Barlas, Pappu Aasish, Shi Yangyang, Krishnamoorthi Raghuraman
- An Experimental Study: Assessing The Combined Framework Of Wavlm And BEST-RQ For Text-to-speech Synthesis Nielson Via, Hillis Steven
- Smoothquant+: Accurate And Efficient 4-bit Post-training Weightquantization For LLM Pan Jiayi, Wang Chengcan, Zheng Kaifu, Li Yangguang, Wang Zhenyu, Feng Bin
- PB-LLM: Partially Binarized Large Language Models Shang Yuzhang, Yuan Zhihang, Wu Qiang, Dong Zhen
- One-shot Sensitivity-aware Mixed Sparsity Pruning For Large Language Models Shao Hang, Liu Bei, Xiao Bo, Zeng Ke, Wan Guanglu, Qian Yanmin
- Omniquant: Omnidirectionally Calibrated Quantization For Large Language Models Shao Wenqi, Chen Mengzhao, Zhang Zhaoyang, Xu Peng, Zhao Lirui, Li Zhiqian, Zhang Kaipeng, Gao Peng, Qiao Yu, Luo Ping
- Efficient LLM Inference On Cpus Shen Haihao, Chang Hanwen, Dong Bo, Luo Yu, Meng Hengyu
- Qlora: Efficient Finetuning Of Quantized Llms Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, Luke Zettlemoyer
- Mini-gpts: Efficient Large Language Models Through Contextual Pruning Valicenti Tim, Vidal Justice, Patnaik Ritik
- Exploring Parameter-efficient Fine-tuning Techniques For Code Generation With Large Language Models Weyssow Martin, Zhou Xin, Kim Kisub, Lo David, Sahraoui Houari
- Compress, Then Prompt: Improving Accuracy-efficiency Trade-off Of LLM Inference With Transferable Prompt Xu Zhaozhuo, Liu Zirui, Chen Beidi, Tang Yuxin, Wang Jue, Zhou Kaixiong, Hu Xia, Shrivastava Anshumali
- Quantization-aware And Tensor-compressed Training Of Transformers For Natural Language Understanding Yang Zi, Choudhary Samridhi, Kunzmann Siegfried, Zhang Zheng
- Modulora: Finetuning 2-bit Llms On Consumer Gpus By Integrating With Modular Quantizers Yin Junjie, Dong Jiahao, Wang Yingheng, De Sa Christopher, Kuleshov Volodymyr
- Ca-lora: Adapting Existing Lora For Compressed Llms To Enable Efficient Multi-tasking On Personal Devices Zhao Weilin, Huang Yuxiang, Han Xu, Liu Zhiyuan, Zhang Zhengyan, Li Kuai, Chen Chen, Yang Tao, Sun Maosong
- Adapting Large Language Models By Integrating Collaborative Semantics For Recommendation Zheng Bowen, Hou Yupeng, Lu Hongyu, Chen Yu, Zhao Wayne Xin, Chen Ming, Wen Ji-rong
- Response Length Perception And Sequence Scheduling: An Llm-empowered LLM Inference Pipeline Zheng Zangwei, Ren Xiaozhe, Xue Fuzhao, Luo Yang, Jiang Xin, You Yang
- Scissorhands: Exploiting The Persistence Of Importance Hypothesis For LLM KV Cache Compression At Test Time Zichang Liu, Aditya Desai, Fangshuo Liao, Weitao Wang, Victor Xie, Zhaozhuo Xu, Anastasios Kyrillidis, Anshumali Shrivastava
- Enabling High-sparsity Foundational Llama Models With Efficient Pretraining And Deployment Agarwalla Abhinav, Gupta Abhay, Marques Alexandre, Pandit Shubhra, Goin Michael, Kurtic Eldar, Leong Kevin, Nguyen Tuan, Salem Mahmoud, Alistarh Dan, Lie Sean, Kurtz Mark
- Shadowllm: Predictor-based Contextual Sparsity For Large Language Models Akhauri Yash, Abouelhamayed Ahmed F, Dotzel Jordan, Zhang Zhiru, Rush Alexander M, Huda Safeen, Abdelfattah Mohamed S
- Tensor Train Low-rank Approximation (tt-lora): Democratizing AI With Accelerated Llms Anjum Afia, Eren Maksim E., Boureima Ismael, Alexandrov Boian, Bhattarai Manish
- Scaling Sparse Fine-tuning To Large Language Models Ansell Alan, Vulić Ivan, Sterz Hannah, Korhonen Anna, Ponti Edoardo M.
- Quantifying The Capabilities Of Llms Across Scale And Precision Badshah Sher, Sajjad Hassan
- Low-rank Quantization-aware Training For Llms Bondarenko Yelysei, Del Chiaro Riccardo, Nagel Markus
- Efficientqat: Efficient Quantization-aware Training For Large Language Models Chen Mengzhao, Shao Wenqi, Xu Peng, Wang Jiahao, Gao Peng, Zhang Kaipeng, Qiao Yu, Luo Ping
- Ternaryllm: Ternarized Large Language Model Chen Tianqi, Li Zhe, Xu Weixiang, Zhu Zeyu, Li Dong, Tian Lu, Barsoum Emad, Wang Peisong, Cheng Jian
- Exploring Quantization For Efficient Pre-training Of Transformer Language Models Chitsaz Kamran, Fournier Quentin, Mordido Gonçalo, Chandar Sarath
- Cherry On Top: Parameter Heterogeneity And Quantization In Large Language Models Cui Wanyun, Wang Qianle
- QAQ: Quality Adaptive Quantization For LLM KV Cache Dong Shichen, Cheng Wen, Qin Jiayu, Wang Wei
- Bitdistiller: Unleashing The Potential Of Sub-4-bit Llms Via Self-distillation Du Dayou, Zhang Yijia, Cao Shijie, Guo Jiaqi, Cao Ting, Chu Xiaowen, Xu Ningyi
- Accuracy Is Not All You Need Dutta Abhinav, Krishnan Sanjeev, Kwatra Nipun, Ramjee Ramachandran
- Dynamic Q&A Of Clinical Documents With Large Language Models Elgedawy Ran, Danciu Ioana, Mahbub Maria, Srinivasan Sudarshan
- Tinyagent: Function Calling At The Edge Erdogan Lutfi Eren, Lee Nicholas, Jha Siddharth, Kim Sehoon, Tabrizi Ryan, Moon Suhong, Hooper Coleman, Anumanchipalli Gopala, Keutzer Kurt, Gholami Amir
- Generative AI And Large Language Models For Cyber Security: All Insights You Need Ferrag Mohamed Amine, Alwahedi Fatima, Battah Ammar, Cherif Bilel, Mechri Abdechakour, Tihanyi Norbert
- Moa: Mixture Of Sparse Attention For Automatic Large Language Model Compression Fu Tianyu, Huang Haofeng, Ning Xuefei, Zhang Genghan, Chen Boju, Wu Tianqi, Wang Hongyi, Huang Zixiao, Li Shiyao, Yan Shengen, Dai Guohao, Yang Huazhong, Wang Yu
- Compensate Quantization Errors: Make Weights Hierarchical To Compensate Each Other Gao Yifei, Ou Jie, Wang Lei, Xiao Yuting, Xiang Zhiyuan, Dai Ruiting, Cheng Jun
- I've Got The "answer"! Interpretation Of Llms Hidden States In Question Answering Goloviznina Valeriya, Kotelnikov Evgeny
- LLMC: Benchmarking Large Language Model Quantization With A Versatile Compression Toolkit Gong Ruihao, Yong Yang, Gu Shiqiao, Huang Yushi, Lv Chentao, Zhang Yunchen, Liu Xianglong, Tao Dacheng
- APTQ: Attention-aware Post-training Mixed-precision Quantization For Large Language Models Guan Ziyi, Huang Hantao, Su Yupeng, Huang Hong, Wong Ngai, Yu Hao
- GPTQT: Quantize Large Language Models Twice To Push The Efficiency Guo Yipin, Lang Yilin, Ren Qinyuan
- Dependency-aware Semi-structured Sparsity: Declining Roles Of Outliers In Pruning Glu-based Llms Guo Zhiyu, Kamigaito Hidetaka, Wanatnabe Taro
- Sltrain: A Sparse Plus Low-rank Approach For Parameter And Memory Efficient Pretraining Han Andi, Li Jiaxiang, Huang Wei, Hong Mingyi, Takeda Akiko, Jawanpuria Pratik, Mishra Bamdev
- Outlier-efficient Hopfield Layers For Large Transformer-based Models Hu Jerry Yao-chieh, Chang Pei-hsuan, Luo Robin, Chen Hong-yu, Li Weijian, Wang Wei-po, Liu Han
- Billm: Pushing The Limit Of Post-training Quantization For Llms Huang Wei, Liu Yangdong, Qin Haotong, Li Ying, Zhang Shiming, Liu Xianglong, Magno Michele, Qi Xiaojuan
- Slim-llm: Salience-driven Mixed-precision Quantization For Large Language Models Huang Wei, Qin Haotong, Liu Yangdong, Li Yawei, Liu Xianglong, Benini Luca, Magno Michele, Qi Xiaojuan
- An Empirical Study Of Llama3 Quantization: From Llms To Mllms Huang Wei, Zheng Xingyu, Ma Xudong, Qin Haotong, Lv Chengtao, Chen Hong, Luo Jie, Qi Xiaojuan, Liu Xianglong, Magno Michele
- Pruning Large Language Models With Semi-structural Adaptive Sparse Training Huang Weiyu, Hu Yuezhou, Jian Guohao, Zhu Jun, Chen Jianfei
- Rolora: Fine-tuning Rotated Outlier-free Llms For Effective Weight-activation Quantization Huang Xijie, Liu Zechun, Liu Shih-yang, Cheng Kwang-ting
- Pc-lora: Low-rank Adaptation For Progressive Model Compression With Knowledge Distillation Hwang Injoon, Park Haewon, Lee Youngwan, Yang Jooyoung, Maeng Sunjae
- Jamba-1.5: Hybrid Transformer-mamba Models At Scale Jamba Team, Lenz Barak, Arazi Alan, Bergman Amir, Manevich Avshalom, Peleg Barak, Aviram Ben, Almagor Chen, Fridman Clara, Padnos Dan, Gissin Daniel, Jannai Daniel, Muhlgay Dor, Zimberg Dor, Gerber Edden M, Dolev Elad, Krakovsky Eran, Safahi Erez, Schwartz Erez, Cohen Gal, Shachaf Gal, Rozenblum Haim, Bata Hofit, Blass Ido, Magar Inbal, Dalmedigos Itay, Osin Jhonathan, Fadlon Julie, Rozman Maria, Danos Matan, Gokhman Michael, Zusman Mor, Gidron Naama, Ratner Nir, Gat Noam, Rozen Noam, Fried Oded, Leshno Ohad, Antverg Omer, Abend Omri, Lieber Opher, Dagan Or, Cohavi Orit, Alon Raz, Belson Ro'i, Cohen Roi, Gilad Rom, Glozman Roman, Lev Shahar, Meirom Shaked, Delbari Tal, Ness Tal, Asida Tomer, Gal Tom Ben, Braude Tom, Pumerantz Uriya, Cohen Yehoshua, Belinkov Yonatan, Globerson Yuval, Levy Yuval Peleg, Shoham Yoav
- L4Q: Parameter Efficient Quantization-aware Fine-tuning On Large Language Models Jeon Hyesung, Kim Yulhwa, Kim Jae-joon
- SDQ: Sparse Decomposed Quantization For LLM Inference Jeong Geonhwa, Tsai Po-an, Keckler Stephen W., Krishna Tushar
- Spectra: A Comprehensive Study Of Ternary, Quantized, And FP16 Language Models Kaushal Ayush, Pandey Tejas, Vaidhya Tejas, Bhagat Aaryan, Rish Irina
- Promptkd: Distilling Student-friendly Knowledge For Generative Language Models Via Prompt Tuning Kim Gyeongman, Jang Doohyuk, Yang Eunho
- Openvla: An Open-source Vision-language-action Model Kim Moo Jin, Pertsch Karl, Karamcheti Siddharth, Xiao Ted, Balakrishna Ashwin, Nair Suraj, Rafailov Rafael, Foster Ethan, Lam Grace, Sanketi Pannag, Vuong Quan, Kollar Thomas, Burchfiel Benjamin, Tedrake Russ, Sadigh Dorsa, Levine Sergey, Liang Percy, Finn Chelsea
- Hints-in-browser: Benchmarking Language Models For Programming Feedback Generation Kotalwar Nachiket, Gotovos Alkis, Singla Adish
- Fine-tuning, Quantization, And Llms: Navigating Unintended Outcomes Kumar Divyanshu, Kumar Anurakt, Agarwal Sahil, Harshangi Prashanth
- Biomistral: A Collection Of Open-source Pretrained Large Language Models For Medical Domains Labrak Yanis, Bazoge Adrien, Morin Emmanuel, Gourraud Pierre-antoine, Rouvier Mickael, Dufour Richard
- Melting Point: Mobile Evaluation Of Language Transformers Laskaridis Stefanos, Katevas Kleomenis, Minto Lorenzo, Haddadi Hamed
- Reallm: A General Framework For LLM Compression And Fine-tuning Leconte Louis, Bedin Lisa, Nguyen Van Minh, Moulines Eric
- Improving Conversational Abilities Of Quantized Large Language Models Via Direct Preference Alignment Lee Janghwan, Park Seongmin, Hong Sukjin, Kim Minsoo, Chang Du-seong, Choi Jungwook
- LRQ: Optimizing Post-training Quantization For Large Language Models By Learning Low-rank Weight-scaling Matrices Lee Jung Hyun, Kim Jeonghoon, Yang June Yong, Kwon Se Jung, Yang Eunho, Yoo Kang Min, Lee Dongsoo
- Tender: Accelerating Large Language Models Via Tensor Decomposition And Runtime Requantization Lee Jungi, Lee Wonbeom, Sim Jaewoong
- Examining Post-training Quantization For Mixture-of-experts: A Benchmark Li Pingzhi, Jin Xiaolong, Cheng Yu, Chen Tianlong
- Apiq: Finetuning Of 2-bit Quantized Large Language Model Liao Baohao, Herold Christian, Khadivi Shahram, Monz Christof
- Qserve: W4A8KV4 Quantization And System Co-design For Efficient LLM Serving Lin Yujun, Tang Haotian, Yang Shang, Zhang Zhekai, Xiao Guangxuan, Gan Chuang, Han Song
- Mmgrec: Multimodal Generative Recommendation With Transformer Model Liu Han, Wei Yinwei, Song Xuemeng, Guan Weili, Li Yuan-fang, Nie Liqiang
- Training-free Activation Sparsity In Large Language Models Liu James, Ponnusamy Pragaash, Cai Tianle, Guo Han, Kim Yoon, Athiwaratkun Ben
- STORE: Streamlining Semantic Tokenization And Generative Recommendation With A Single LLM Liu Qijiong, Zhu Jieming, Fan Lu, Zhao Zhou, Wu Xiao-ming
- Proofread: Fixes All Errors With One Tap Liu Renjie, Zhang Yanxiang, Zhu Yun, Sun Haicheng, Zhang Yuanbo, Huang Michael Xuelin, Cai Shanqing, Meng Lei, Zhai Shumin
- Intactkv: Improving Large Language Model Quantization By Keeping Pivot Tokens Intact Liu Ruikang, Bai Haoli, Lin Haokun, Li Yuening, Gao Han, Xu Zhengzhuo, Hou Lu, Yao Jun, Yuan Chun
- Understanding Llms: A Comprehensive Overview From Training To Inference Liu Yiheng, He Hao, Han Tianle, Zhang Xu, Liu Mengyuan, Tian Jiaming, Zhang Yutong, Wang Jiaqi, Gao Xiaohui, Zhong Tianyang, Pan Yi, Xu Shaochen, Wu Zihao, Liu Zhengliang, Zhang Xin, Zhang Shu, Hu Xintao, Zhang Tuo, Qiang Ning, Liu Tianming, Ge Bao
- Loqt: Low-rank Adapters For Quantized Pre-training Loeschcke Sebastian, Toftrup Mads, Kastoryano Michael J., Belongie Serge, Snæbjarnarson Vésteinn
- Prompt Tuning As User Inherent Profile Inference Machine Lu Yusheng, Du Zhaocheng, Li Xiangyang, Zhao Xiangyu, Liu Weiwen, Wang Yichao, Guo Huifeng, Tang Ruiming, Dong Zhenhua, Duan Yongrui
- M\(^3\)GPT: An Advanced Multimodal, Multitask Framework For Motion Comprehension And Generation Luo Mingshuang, Hou Ruibing, Chang Hong, Liu Zimo, Wang Yaowei, Shan Shiguang
- Matmul Or No Matmul In The Era Of 1-bit Llms Malekar Jinendra, Elbtity Mohammed E., Zand Ramtin
- How Does Quantization Affect Multilingual Llms? Marchisio Kelly, Dash Saurabh, Chen Hongyu, Aumiller Dennis, Üstün Ahmet, Hooker Sara, Ruder Sebastian
- Cdquant: Accurate Post-training Weight Quantization Of Large Pre-trained Models Using Greedy Coordinate Descent Nair Pranav Ajit, Suggala Arun Sai
- Outliers And Calibration Sets Have Diminishing Effect On Quantization Of Modern Llms Paglieri Davide, Dash Saurabh, Rocktäschel Tim, Parker-holder Jack
- Any-precision LLM: Low-cost Deployment Of Multiple, Different-sized Llms Park Yeonhong, Hyun Jake, Cho Sanglyul, Sim Bonggeun, Lee Jae W.
- Distillation Contrastive Decoding: Improving Llms Reasoning With Contrastive Decoding And Distillation Phan Phuc, Tran Hieu, Phan Long
- Mamba-ptq: Outlier Channels In Recurrent Large Language Models Pierro Alessandro, Abreu Steven
- When Quantization Affects Confidence Of Large Language Models? Proskurina Irina, Brun Luc, Metzler Guillaume, Velcin Julien
- The Uniqueness Of Llama3-70b With Per-channel Quantization: An Empirical Study Qin Minghai
- Qdylora: Quantized Dynamic Low-rank Adaptation For Efficient Large Language Model Tuning Rajabzadeh Hossein, Valipour Mojtaba, Zhu Tianshu, Tahaei Marzieh, Kwon Hyock Ju, Ghodsi Ali, Chen Boxing, Rezagholizadeh Mehdi
- Vision-language And Large Language Model Performance In Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, And Quantized Models Safavi-naini Seyed Amir Ahmad, Ali Shuhaib, Shahab Omer, Shahhoseini Zahra, Savage Thomas, Rafiee Sara, Samaan Jamil S, Shabeeb Reem Al, Ladak Farah, Yang Jamie O, Echavarria Juan, Babar Sumbal, Shaukat Aasma, Margolis Samuel, Tatonetti Nicholas P, Nadkarni Girish, Kurdi Bara El, Soroush Ali
- Flashattention-3: Fast And Accurate Attention With Asynchrony And Low-precision Shah Jay, Bikshandi Ganesh, Zhang Ying, Thakkar Vijay, Ramani Pradeep, Dao Tri
- Combining Multiple Post-training Techniques To Achieve Most Efficient Quantized Llms Sharify Sayeh, Xu Zifei, Yazar Wanzin, Wang Xin
- Accurate And Efficient Fine-tuning Of Quantized Large Language Models Through Optimal Balance Shen Ao, Wang Qiang, Lai Zhiquan, Li Xionglve, Li Dongsheng
- A Thorough Examination Of Decoding Methods In The Era Of Llms Shi Chufan, Yang Haoran, Cai Deng, Zhang Zhisong, Wang Yifan, Yang Yujiu, Lam Wai
- Structuredrag: JSON Response Formatting With Large Language Models Shorten Connor, Pierse Charles, Smith Thomas Benjamin, Cardenas Erika, Sharma Akanksha, Trengrove John, Van Luijt Bob
- Prefixing Attention Sinks Can Mitigate Activation Outliers For Large Language Model Quantization Son Seungwoo, Park Wonpyo, Han Woohyun, Kim Kyuyeun, Lee Jaeho
- Specexec: Massively Parallel Speculative Decoding For Interactive LLM Inference On Consumer Devices Svirschevski Ruslan, May Avner, Chen Zhuoming, Chen Beidi, Jia Zhihao, Ryabinin Max
- Mobilequant: Mobile-friendly Quantization For On-device Language Models Tan Fuwen, Lee Royson, Dudziak Łukasz, Hu Shell Xu, Bhattacharya Sourav, Hospedales Timothy, Tzimiropoulos Georgios, Martinez Brais
- Enhancing Inference Efficiency Of Large Language Models: Investigating Optimization Strategies And Architectural Innovations Tyukin Georgy
- Promptmm: Multi-modal Knowledge Distillation For Recommendation With Prompt-tuning Wei Wei, Tang Jiabin, Jiang Yangqin, Xia Lianghao, Huang Chao
- GEB-1.3B: Open Lightweight Large Language Model Wu Jie, Zhu Yufeng, Shen Lei, Lu Xuqing
- FP6-LLM: Efficiently Serving Large Language Models Through Fp6-centric Algorithm-system Co-design Xia Haojun, Zheng Zhen, Wu Xiaoxia, Chen Shiyang, Yao Zhewei, Youn Stephen, Bakhtiari Arash, Wyatt Michael, Zhuang Donglin, Zhou Zhongzhu, Ruwase Olatunji, He Yuxiong, Song Shuaiwen Leon
- Advancing Multimodal Large Language Models With Quantization-aware Scale Learning For Efficient Adaptation Xie Jingjing, Zhang Yuxin, Lin Mingbao, Cao Liujuan, Ji Rongrong
- On-device Language Models: A Comprehensive Review Xu Jiajun, Li Zhiyuan, Chen Wei, Wang Qun, Gao Xin, Cai Qi, Ling Ziyuan
- Beyond Perplexity: Multi-dimensional Safety Evaluation Of LLM Compression Xu Zhichao, Gupta Ashim, Li Tao, Bentham Oliver, Srikumar Vivek
- No Token Left Behind: Reliable KV Cache Compression Via Importance-aware Mixed Precision Quantization Yang June Yong, Kim Byeongwook, Bae Jeongin, Kwon Beomseok, Park Gunho, Yang Eunho, Kwon Se Jung, Lee Dongsoo
- The Impact Of Quantization On Retrieval-augmented Generation: An Analysis Of Small Llms Yazan Mert, Verberne Suzan, Situmeang Frederik
- KV Cache Compression, But What Must We Give In Return? A Comprehensive Benchmark Of Long Context Capable Approaches Yuan Jiayi Henry, Liu Hongyi Henry, Shaochen Henry, Zhong, Chuang Yu-neng, Li Songchen, Wang Guanchu, Le Duy, Jin Hongye, Chaudhary Vipin, Xu Zhaozhuo, Liu Zirui, Hu Xia
- Wkvquant: Quantizing Weight And Key/value Cache For Large Language Models Gains More Yue Yuxuan, Yuan Zhihang, Duanmu Haojie, Zhou Sifan, Wu Jianlong, Nie Liqiang
- ABQ-LLM: Arbitrary-bit Quantized Inference Acceleration For Large Language Models Zeng Chao, Liu Songwei, Xie Yusheng, Liu Hong, Wang Xiaojian, Wei Miao, Yang Shu, Chen Fangmin, Mei Xing
- Multilingual Brain Surgeon: Large Language Models Can Be Compressed Leaving No Language Behind Zeng Hongchuan, Xu Hongshen, Chen Lu, Yu Kai
- Unimot: Unified Molecule-text Language Model With Discrete Token Representation Zhang Juzheng, Bian Yatao, Chen Yongqiang, Yao Quanming
- Leanquant: Accurate Large Language Model Quantization With Loss-error-aware Grid Zhang Tianyi, Shrivastava Anshumali
- MLKD-BERT: Multi-level Knowledge Distillation For Pre-trained Language Models Zhang Ying, Yang Ziheng, Ji Shufan
- SWIFT:A Scalable Lightweight Infrastructure For Fine-tuning Zhao Yuze, Huang Jintao, Hu Jinghan, Wang Xingjun, Mao Yunlin, Zhang Daoze, Jiang Zeyinzi, Wu Zhikai, Ai Baole, Wang Ang, Zhou Wenmeng, Chen Yingda
- Cost: Contrastive Quantization Based Semantic Tokenization For Generative Recommendation Zhu Jieming, Jin Mengqun, Liu Qijiong, Qiu Zexuan, Dong Zhenhua, Li Xiu
- Can Large Language Models Understand Context? Zhu Yilun, Moniz Joel Ruben Antony, Bhargava Shruti, Lu Jiarui, Piraviperumal Dhivya, Li Site, Zhang Yuan, Yu Hong, Tseng Bo-hsiang
🏷 RAG
- Rationalizing Neural Predictions Lei Tao, Barzilay Regina, Jaakkola Tommi
- Interactive Attention For Neural Machine Translation Meng Fandong, Lu Zhengdong, Li Hang, Liu Qun
- Generative Deep Neural Networks For Dialogue: A Short Review Serban Iulian Vlad, Lowe Ryan, Charlin Laurent, Pineau Joelle
- Neural Machine Translation Advised By Statistical Machine Translation Wang Xing, Lu Zhengdong, Tu Zhaopeng, Li Hang, Xiong Deyi, Zhang Min
- Topic Aware Neural Response Generation Xing Chen, Wu Wei, Wu Yu, Liu Jie, Huang Yalou, Zhou Ming, Ma Wei-ying
- Don't Just Assume; Look And Answer: Overcoming Priors For Visual Question Answering Agrawal Aishwarya, Batra Dhruv, Parikh Devi, Kembhavi Aniruddha
- Enabling Multi-source Neural Machine Translation By Concatenating Source Sentences In Multiple Languages Dabre Raj, Cromieres Fabien, Kurohashi Sadao
- Neural Machine Translation Leveraging Phrase-based Models In A Hybrid Search Dahlmann Leonard, Matusov Evgeny, Petrushkov Pavel, Khadivi Shahram
- Reinforced Mnemonic Reader For Machine Reading Comprehension Hu Minghao, Peng Yuxing, Huang Zhen, Qiu Xipeng, Wei Furu, Zhou Ming
- Simple Recurrent Units For Highly Parallelizable Recurrence Lei Tao, Zhang Yu, Wang Sida I., Dai Hui, Artzi Yoav
- Best Of Both Worlds: Transferring Knowledge From Discriminative Learning To A Generative Visual Dialog Model Lu Jiasen, Kannan Anitha, Yang Jianwei, Parikh Devi, Batra Dhruv
- A Generative Approach To Question Answering Mitra Rajarshee
- A Unified Query-based Generative Model For Question Generation And Question Answering Song Linfeng, Wang Zhiguo, Hamza Wael
- Deepprobe: Information Directed Sequence Understanding And Chatbot Design Via Recurrent Neural Networks Yin Zi, Chang Keng-hao, Zhang Ruofei
- Chunk-based Bi-scale Decoder For Neural Machine Translation Zhou Hao, Tu Zhaopeng, Huang Shujian, Liu Xiaohua, Li Hang, Chen Jiajun
- Mojitalk: Generating Emotional Responses At Scale Zhou Xianda, Wang William Yang
- Code2seq: Generating Sequences From Structured Representations Of Code Alon Uri, Brody Shaked, Levy Omer, Yahav Eran
- Multilingual Extractive Reading Comprehension By Runtime Machine Translation Asai Akari, Eriguchi Akiko, Hashimoto Kazuma, Tsuruoka Yoshimasa
- Building Sequential Inference Models For End-to-end Response Selection Gu Jia-chen, Ling Zhen-hua, Ruan Yu-ping, Liu Quan
- SNU_IDS At Semeval-2018 Task 12: Sentence Encoder With Contextualized Vectors For Argument Reasoning Comprehension Kim Taeuk, Choi Jihun, Lee Sang-goo
- Neural Machine Translation With Adequacy-oriented Learning Kong Xiang, Tu Zhaopeng, Shi Shuming, Hovy Eduard, Zhang Tong
- Seq2rdf: An End-to-end Application For Deriving Triples From Natural Language Text Liu Yue, Zhang Tongtao, Liang Zhicheng, Ji Heng, Mcguinness Deborah L.
- Polite Dialogue Generation Without Parallel Data Niu Tong, Bansal Mohit
- Extending Neural Generative Conversational Model Using External Knowledge Sources Parthasarathi Prasanna, Pineau Joelle
- Training Tips For The Transformer Model Popel Martin, Bojar Ondřej
- Beyond Task Success: A Closer Look At Jointly Learning To See, Ask, And Guesswhat Shekhar Ravi, Venkatesh Aashish, Baumgärtner Tim, Bruni Elia, Plank Barbara, Bernardi Raffaella, Fernández Raquel
- Adapting Visual Question Answering Models For Enhancing Multimodal Community Q&A Platforms Srivastava Avikalp, Liu Hsin Wen, Fujita Sumio
- Inducing Grammars With And For Neural Machine Translation Tran Ke, Bisk Yonatan
- Robust Text-to-sql Generation With Execution-guided Decoding Wang Chenglong, Tatwawadi Kedar, Brockschmidt Marc, Huang Po-sen, Mao Yi, Polozov Oleksandr, Singh Rishabh
- Response Generation By Context-aware Prototype Editing Wu Yu, Wei Furu, Huang Shaohan, Wang Yunli, Li Zhoujun, Zhou Ming
- Dual Ask-answer Network For Machine Reading Comprehension Xiao Han, Wang Feng, Yan Jianfeng, Zheng Jingyao
- An Affect-rich Neural Conversational Model With Biased Attention And Weighted Cross-entropy Loss Zhong Peixiang, Wang Di, Miao Chunyan
- A Visual Attention Grounding Neural Model For Multimodal Machine Translation Zhou Mingyang, Cheng Runxiang, Lee Yong Jae, Yu Zhou
- Retrieval-enhanced Adversarial Training For Neural Response Generation Zhu Qingfu, Cui Lei, Zhang Weinan, Wei Furu, Liu Ting
- Fusion Of Detected Objects In Text For Visual Question Answering Alberti Chris, Ling Jeffrey, Collins Michael, Reitter David
- Whatcha Lookin' At? Deeplifting Bert's Attention In Question Answering Arkhangelskaia Ekaterina, Dutta Sourav
- PLATO: Pre-trained Dialogue Generation Model With Discrete Latent Variable Bao Siqi, He Huang, Wang Fan, Wu Hua, Wang Haifeng
- Language Features Matter: Effective Language Representations For Vision-language Tasks Burns Andrea, Tan Reuben, Saenko Kate, Sclaroff Stan, Plummer Bryan A.
- DMRM: A Dual-channel Multi-hop Reasoning Model For Visual Dialog Chen Feilong, Meng Fandong, Xu Jiaming, Li Peng, Xu Bo, Zhou Jie
- Distilling Knowledge Learned In BERT For Text Generation Chen Yen-chun, Gan Zhe, Cheng Yu, Liu Jingzhou, Liu Jingjing
- Cross-lingual Natural Language Generation Via Pre-training Chi Zewen, Dong Li, Wei Furu, Wang Wenhui, Mao Xian-ling, Huang Heyan
- Transformer-xl: Attentive Language Models Beyond A Fixed-length Context Dai Zihang, Yang Zhilin, Yang Yiming, Carbonell Jaime, Le Quoc V., Salakhutdinov Ruslan
- Memory-augmented Recurrent Networks For Dialogue Coherence Donahue David, Meng Yuanliang, Rumshisky Anna
- Optibox: Breaking The Limits Of Proposals For Visual Grounding Fan Zicong, Meng Si Yi, Sigal Leonid, Little James J.
- MRQA 2019 Shared Task: Evaluating Generalization In Reading Comprehension Fisch Adam, Talmor Alon, Jia Robin, Seo Minjoon, Choi Eunsol, Chen Danqi
- Zero-shot Paraphrase Generation With Multilingual Language Models Guo Yinpeng, Liao Yi, Jiang Xin, Zhang Qing, Zhang Yibo, Liu Qun
- Resurrecting Submodularity For Neural Text Generation Han Simeng, Lin Xiang, Joty Shafiq
- Convert: Efficient And Accurate Conversational Representations From Transformers Henderson Matthew, Casanueva Iñigo, Mrkšić Nikola, Su Pei-hao, Wen Tsung-hsien, Vulić Ivan
- A Corpus-free State2seq User Simulator For Task-oriented Dialogue Hou Yutai, Fang Meng, Che Wanxiang, Liu Ting
- INSET: Sentence Infilling With Inter-sentential Transformer Huang Yichen, Zhang Yizhe, Elachqar Oussama, Cheng Yu
- Improving Neural Response Diversity With Frequency-aware Cross-entropy Loss Jiang Shaojie, Ren Pengjie, Monz Christof, De Rijke Maarten
- Tinybert: Distilling BERT For Natural Language Understanding Jiao Xiaoqi, Yin Yichun, Shang Lifeng, Jiang Xin, Chen Xiao, Li Linlin, Wang Fang, Liu Qun
- Bipar: A Bilingual Parallel Dataset For Multilingual And Cross-lingual Reading Comprehension On Novels Jing Yimin, Xiong Deyi, Zhen Yan
- All-in-one Image-grounded Conversational Agents Ju Da, Shuster Kurt, Boureau Y-lan, Weston Jason
- Question Generation From Paragraphs: A Tale Of Two Hierarchical Models Kumar Vishwajeet, Chaki Raktim, Talluri Sai Teja, Ramakrishnan Ganesh, Li Yuan-fang, Haffari Gholamreza
- Patent Claim Generation By Fine-tuning Openai GPT-2 Lee Jieh-sheng, Hsiang Jieh
- TAB-VCR: Tags And Attributes Based Visual Commonsense Reasoning Baselines Lin Jingxiang, Jain Unnat, Schwing Alexander G.
- Task-oriented Conversation Generation Using Heterogeneous Memory Networks Lin Zehao, Huang Xinjing, Ji Feng, Chen Haiqing, Zhang Ying
- Attention-informed Mixed-language Training For Zero-shot Cross-lingual Task-oriented Dialogue Systems Liu Zihan, Winata Genta Indra, Lin Zhaojiang, Xu Peng, Fung Pascale
- Improving Question Generation With Sentence-level Semantic Matching And Answer Position Inferring Ma Xiyao, Zhu Qile, Zhou Yanlin, Li Xiaolin, Wu Dapeng
- Human Vs. Muppet: A Conservative Estimate Of Human Performance On The GLUE Benchmark Nangia Nikita, Bowman Samuel R.
- Neural Assistant: Joint Action Prediction, Response Generation, And Latent Knowledge Reasoning Neelakantan Arvind, Yavuz Semih, Narang Sharan, Prasad Vishaal, Goodrich Ben, Duckworth Daniel, Sankar Chinnadhurai, Yan Xifeng
- A Simple But Effective Method To Incorporate Multi-turn Context With BERT For Conversational Machine Comprehension Ohsugi Yasuhito, Saito Itsumi, Nishida Kyosuke, Asano Hisako, Tomita Junji
- Dlgnet: A Transformer-based Model For Dialogue Response Generation Olabiyi Oluwatobi, Mueller Erik T.
- Adversarial Bootstrapping For Dialogue Model Training Olabiyi Oluwatobi, Mueller Erik T., Larson Christopher, Lahlou Tarek
- Answering Complex Open-domain Questions Through Iterative Query Generation Qi Peng, Lin Xiaowen, Mehr Leo, Wang Zijian, Manning Christopher D.
- Blockwise Self-attention For Long Document Understanding Qiu Jiezhong, Ma Hao, Levy Omer, Yih Scott Wen-tau, Wang Sinong, Tang Jie
- GECOR: An End-to-end Generative Ellipsis And Co-reference Resolution Model For Task-oriented Dialogue Quan Jun, Xiong Deyi, Webber Bonnie, Hu Changjian
- Option Comparison Network For Multiple-choice Reading Comprehension Ran Qiu, Li Peng, Hu Weiwei, Zhou Jie
- On Leveraging The Visual Modality For Neural Machine Translation Raunak Vikas, Choe Sang Keun, Lu Quanyang, Xu Yi, Metze Florian
- Probing Natural Language Inference Models Through Semantic Fragments Richardson Kyle, Hu Hai, Moss Lawrence S., Sabharwal Ashish
- Leveraging Pre-trained Checkpoints For Sequence Generation Tasks Rothe Sascha, Narayan Shashi, Severyn Aliaksei
- Distilbert, A Distilled Version Of BERT: Smaller, Faster, Cheaper And Lighter Sanh Victor, Debut Lysandre, Chaumond Julien, Wolf Thomas
- Inducing Brain-relevant Bias In Natural Language Processing Models Schwartz Dan, Toneva Mariya, Wehbe Leila
- Taking A HINT: Leveraging Explanations To Make Vision And Language Models More Grounded Selvaraju Ramprasaath R., Lee Stefan, Shen Yilin, Jin Hongxia, Ghosh Shalini, Heck Larry, Batra Dhruv, Parikh Devi
- Pragmatically Informative Text Generation Shen Sheng, Fried Daniel, Andreas Jacob, Klein Dan
- MASS: Masked Sequence To Sequence Pre-training For Language Generation Song Kaitao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Tie-yan
- Cued@wmt19:ewc&lms Stahlberg Felix, Saunders Danielle, De Gispert Adria, Byrne Bill
- Syntax-infused Transformer And BERT Models For Machine Translation And Natural Language Understanding Sundararaman Dhanasekar, Subramanian Vivek, Wang Guoyin, Si Shijing, Shen Dinghan, Wang Dong, Carin Lawrence
- Well-read Students Learn Better: On The Importance Of Pre-training Compact Models Turc Iulia, Chang Ming-wei, Lee Kenton, Toutanova Kristina
- Quantity Doesn't Buy Quality Syntax With Neural Language Models Van Schijndel Marten, Mueller Aaron, Linzen Tal
- Language Models With Transformers Wang Chenguang, Li Mu, Smola Alexander J.
- Theme-aware Generation Model For Chinese Lyrics Wang Jie, Zhao Xinyan
- Structbert: Incorporating Language Structures Into Pre-training For Deep Language Understanding Wang Wei, Bi Bin, Yan Ming, Wu Chen, Bao Zuyi, Xia Jiangnan, Peng Liwei, Si Luo
- Tree Transformer: Integrating Tree Structures Into Self-attention Wang Yau-shian, Lee Hung-yi, Chen Yun-nung
- Improving Neural Machine Translation With Pre-trained Representation Weng Rongxiang, Yu Heng, Huang Shujian, Luo Weihua, Chen Jiajun
- Learning To Memorize In Neural Task-oriented Dialogue Systems Wu Chien-sheng
- Integrated Triaging For Fast Reading Comprehension Wu Felix, Li Boyi, Wang Lequn, Lao Ni, Blitzer John, Weinberger Kilian Q.
- Sharing Attention Weights For Fast Transformer Xiao Tong, Li Yinqiao, Zhu Jingbo, Yu Zhengtao, Liu Tongran
- Review Conversational Reading Comprehension Xu Hu, Liu Bing, Shu Lei, Yu Philip S.
- Extended Answer And Uncertainty Aware Neural Question Generation Zeng Hongwei, Zhi Zhuo, Liu Jun, Wei Bifan
- Context-aware Visual Policy Network For Fine-grained Image Captioning Zha Zheng-jun, Liu Daqing, Zhang Hanwang, Zhang Yongdong, Wu Feng
- Dialogpt: Large-scale Generative Pre-training For Conversational Response Generation Zhang Yizhe, Sun Siqi, Galley Michel, Chen Yen-chun, Brockett Chris, Gao Xiang, Gao Jianfeng, Liu Jingjing, Dolan Bill
- Synchronous Bidirectional Neural Machine Translation Zhou Long, Zhang Jiajun, Zong Chengqing
- Accelerating Natural Language Understanding In Task-oriented Dialog Ahuja Ojas, Desai Shrey
- Stress Test Evaluation Of Transformer-based Models In Natural Language Understanding Tasks Aspillaga Carlos, Carvallo Andrés, Araujo Vladimir
- Segatron: Segment-aware Transformer For Language Modeling And Understanding Bai He, Shi Peng, Lin Jimmy, Xie Yuqing, Tan Luchen, Xiong Kun, Gao Wen, Li Ming
- Chatbot Interaction With Artificial Intelligence: Human Data Augmentation With T5 And Language Transformer Ensemble For Text Classification Bird Jordan J., Ekárt Anikó, Faria Diego R.
- Advances Of Transformer-based Models For News Headline Generation Bukhtiyarov Alexey, Gusev Ilya
- Pretrained Language Models For Dialogue Generation With Multiple Input Sources Cao Yu, Bi Wei, Fang Meng, Tao Dacheng
- Simultaneous Paraphrasing And Translation By Fine-tuning Transformer Models Chada Rakesh
- Exploring Fluent Query Reformulations With Text-to-text Transformers And Reinforcement Learning Chen Jerry Zikun, Yu Shi, Wang Haoran
- Autoadr: Automatic Model Design For Ad Relevance Chen Yiren, Yang Yaming, Sun Hong, Wang Yujing, Xu Yu, Shen Wei, Zhou Rong, Tong Yunhai, Bai Jing, Zhang Ruofei
- MEANTIME: Mixture Of Attention Mechanisms With Multi-temporal Embeddings For Sequential Recommendation Cho Sung Min, Park Eunhyeok, Yoo Sungjoo
- Rethinking Embedding Coupling In Pre-trained Language Models Chung Hyung Won, Févry Thibault, Tsai Henry, Johnson Melvin, Ruder Sebastian
- Residual Energy-based Models For Text Generation Deng Yuntian, Bakhtin Anton, Ott Myle, Szlam Arthur, Ranzato Marc'aurelio
- Ernie-doc: A Retrospective Long-document Modeling Transformer Ding Siyu, Shang Junyuan, Wang Shuohuan, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Fine-tuning Pretrained Language Models: Weight Initializations, Data Orders, And Early Stopping Dodge Jesse, Ilharco Gabriel, Schwartz Roy, Farhadi Ali, Hajishirzi Hannaneh, Smith Noah
- Turngpt: A Transformer-based Language Model For Predicting Turn-taking In Spoken Dialog Ekstedt Erik, Skantze Gabriel
- Augmenting Transformers With Knn-based Composite Memory For Dialogue Fan Angela, Gardent Claire, Braud Chloe, Bordes Antoine
- CERT: Contrastive Self-supervised Learning For Language Understanding Fang Hongchao, Wang Sicheng, Zhou Meng, Ding Jiayuan, Xie Pengtao
- Paragraph-level Commonsense Transformers With Recurrent Memory Gabriel Saadia, Bhagavatula Chandra, Shwartz Vered, Bras Ronan Le, Forbes Maxwell, Choi Yejin
- Measuring Systematic Generalization In Neural Proof Generation With Transformers Gontier Nicolas, Sinha Koustuv, Reddy Siva, Pal Christopher
- Contextualize Knowledge Bases With Transformer For End-to-end Task-oriented Dialogue Systems Gou Yanjie, Lei Yinjie, Liu Lingqiao, Dai Yong, Shen Chunxu
- Token-level Adaptive Training For Neural Machine Translation Gu Shuhao, Zhang Jinchao, Meng Fandong, Feng Yang, Xie Wanying, Zhou Jie, Yu Dong
- Sequence-level Mixed Sample Data Augmentation Guo Demi, Kim Yoon, Rush Alexander M.
- Incorporating BERT Into Parallel Sequence Decoding With Adapters Guo Junliang, Zhang Zhirui, Xu Linli, Wei Hao-ran, Chen Boxing, Chen Enhong
- Enriched Pre-trained Transformers For Joint Slot Filling And Intent Detection Hardalov Momchil, Koychev Ivan, Nakov Preslav
- Guided Transformer: Leveraging Multiple External Sources For Representation Learning In Conversational Search Hashemi Helia, Zamani Hamed, Croft W. Bruce
- PAIR: Planning And Iterative Refinement In Pre-trained Transformers For Long Text Generation Hua Xinyu, Wang Lu
- Less Is More: Data-efficient Complex Question Answering Over Knowledge Bases Hua Yuncheng, Li Yuan-fang, Qi Guilin, Wu Wei, Zhang Jingyao, Qi Daiqing
- Memory Attentive Fusion: External Language Model Integration For Transformer-based Sequence-to-sequence Model Ihori Mana, Masumura Ryo, Makishima Naoki, Tanaka Tomohiro, Takashima Akihiko, Orihashi Shota
- Probing Pretrained Language Models For Lexical Semantics Ivan Vulić, Edoardo Maria Ponti, Robert Litschko, Goran Glavaš, Anna Korhonen
- Are Natural Language Inference Models Imppressive? Learning Implicature And Presupposition Jeretic Paloma, Warstadt Alex, Bhooshan Suvrat, Williams Adina
- Do Fine-tuned Commonsense Language Models Really Generalize? Kejriwal Mayank, Shen Ke
- Nearest Neighbor Machine Translation Khandelwal Urvashi, Fan Angela, Jurafsky Dan, Zettlemoyer Luke, Lewis Mike
- Fid-ex: Improving Sequence-to-sequence Models For Extractive Rationale Generation Lakhotia Kushal, Paranjape Bhargavi, Ghoshal Asish, Yih Wen-tau, Mehdad Yashar, Iyer Srinivasan
- PONE: A Novel Automatic Evaluation Metric For Open-domain Generative Dialogue Systems Lan Tian, Mao Xian-ling, Wei Wei, Gao Xiaoyan, Huang Heyan
- Which Kind Is Better In Open-domain Multi-turn Dialog,hierarchical Or Non-hierarchical Models? An Empirical Study Lan Tian, Mao Xian-ling, Wei Wei, Huang Heyan
- Context-based Transformer Models For Answer Sentence Selection Lauriola Ivano, Moschitti Alessandro
- SLM: Learning A Discourse Language Representation With Sentence Unshuffling Lee Haejun, Hudson Drew A., Lee Kangwook, Manning Christopher D.
- MART: Memory-augmented Recurrent Transformer For Coherent Video Paragraph Captioning Lei Jie, Wang Liwei, Shen Yelong, Yu Dong, Berg Tamara L., Bansal Mohit
- Efficient Transformer-based Large Scale Language Representations Using Hardware-friendly Block Structured Pruning Li Bingbing, Kong Zhenglun, Zhang Tianyun, Li Ji, Li Zhengang, Liu Hang, Ding Caiwen
- Dialogue-adaptive Language Model Pre-training From Quality Estimation Li Junlong, Zhang Zhuosheng, Zhao Hai
- Multilingual Speech Translation With Efficient Finetuning Of Pretrained Models Li Xian, Wang Changhan, Tang Yun, Tran Chau, Tang Yuqing, Pino Juan, Baevski Alexei, Conneau Alexis, Auli Michael
- Enhancing Dialogue Generation Via Multi-level Contrastive Learning Li Xin, Li Piji, Wang Yan, Liu Xiaojiang, Lam Wai
- Conversational Question Reformulation Via Sequence-to-sequence Architectures And Pretrained Language Models Lin Sheng-chieh, Yang Jheng-hong, Nogueira Rodrigo, Tsai Ming-feng, Wang Chuan-ju, Lin Jimmy
- Mt5: A Massively Multilingual Pre-trained Text-to-text Transformer Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al-rfou, Aditya Siddhant, Aditya Barua, Colin Raffel
- Improving Zero-shot Translation By Disentangling Positional Information Liu Danni, Niehues Jan, Cross James, Guzmán Francisco, Li Xian
- Rikinet: Reading Wikipedia Pages For Natural Question Answering Liu Dayiheng, Gong Yeyun, Fu Jie, Yan Yu, Chen Jiusheng, Jiang Daxin, Lv Jiancheng, Duan Nan
- KG-BART: Knowledge Graph-augmented BART For Generative Commonsense Reasoning Liu Ye, Wan Yao, He Lifang, Peng Hao, Yu Philip S.
- Simplifying Paragraph-level Question Generation Via Transformer Language Models Lopez Luis Enrico, Cruz Diane Kathryn, Cruz Jan Christian Blaise, Cheng Charibeth
- VECO: Variable And Flexible Cross-lingual Pre-training For Language Understanding And Generation Luo Fuli, Wang Wei, Liu Jiahao, Liu Yijia, Bi Bin, Huang Songfang, Huang Fei, Si Luo
- Charbert: Character-aware Pre-trained Language Model Ma Wentao, Cui Yiming, Si Chenglei, Liu Ting, Wang Shijin, Hu Guoping
- Gpt-too: A Language-model-first Approach For Amr-to-text Generation Mager Manuel, Astudillo Ramon Fernandez, Naseem Tahira, Sultan Md Arafat, Lee Young-suk, Florian Radu, Roukos Salim
- Verb Knowledge Injection For Multilingual Event Processing Majewska Olga, Vulić Ivan, Glavaš Goran, Ponti Edoardo M., Korhonen Anna
- Ladabert: Lightweight Adaptation Of BERT Through Hybrid Model Compression Mao Yihuan, Wang Yujing, Wu Chufan, Zhang Chen, Wang Yang, Yang Yaming, Zhang Quanlu, Tong Yunhai, Bai Jing
- Natural Language Rationales With Full-stack Visual Reasoning: From Pixels To Semantic Frames To Commonsense Graphs Marasović Ana, Bhagavatula Chandra, Park Jae Sung, Bras Ronan Le, Smith Noah A., Choi Yejin
- Dialoglue: A Natural Language Understanding Benchmark For Task-oriented Dialogue Mehri Shikib, Eric Mihail, Hakkani-tur Dilek
- Delight: Deep And Light-weight Transformer Mehta Sachin, Ghazvininejad Marjan, Iyer Srinivasan, Zettlemoyer Luke, Hajishirzi Hannaneh
- Generation-distillation For Efficient Natural Language Understanding In Low-data Settings Melas-kyriazi Luke, Han George, Liang Celine
- Axformer: Accuracy-driven Approximation Of Transformers For Faster, Smaller And More Accurate NLP Models Nagarajan Amrit, Sen Sanchari, Stevens Jacob R., Raghunathan Anand
- Avgout: A Simple Output-probability Measure To Eliminate Dull Responses Niu Tong, Bansal Mohit
- Real-time Execution Of Large-scale Language Models On Mobile Niu Wei, Kong Zhenglun, Yuan Geng, Jiang Weiwen, Guan Jiexiong, Ding Caiwen, Zhao Pu, Liu Sijia, Ren Bin, Wang Yanzhi
- X-linear Attention Networks For Image Captioning Pan Yingwei, Yao Ting, Li Yehao, Mei Tao
- Multi-view Attention Network For Visual Dialog Park Sungjin, Whang Taesun, Yoon Yeochan, Lim Heuiseok
- Telling Bert's Full Story: From Local Attention To Global Aggregation Pascual Damian, Brunner Gino, Wattenhofer Roger
- Retrieval-augmented Generation For Knowledge-intensive NLP Tasks Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela
- Intermediate-task Transfer Learning With Pretrained Models For Natural Language Understanding: When And Why Does It Work? Pruksachatkun Yada, Phang Jason, Liu Haokun, Htut Phu Mon, Zhang Xiaoyi, Pang Richard Yuanzhe, Vania Clara, Kann Katharina, Bowman Samuel R.
- Prophetnet: Predicting Future N-gram For Sequence-to-sequence Pre-training Qi Weizhen, Yan Yu, Gong Yeyun, Liu Dayiheng, Duan Nan, Chen Jiusheng, Zhang Ruofei, Zhou Ming
- How Fine Can Fine-tuning Be? Learning Efficient Language Models Radiya-dixit Evani, Wang Xin
- Hierarchical GPT With Congruent Transformers For Multi-sentence Language Models Roh Jihyeon, Gim Huiseong, Lee Soo-young
- Personalized Query Rewriting In Conversational AI Agents Roshan-ghias Alireza, Mathialagan Clint Solomon, Ponnusamy Pragaash, Mathias Lambert, Guo Chenlei
- Knowledge-aware Language Model Pretraining Rosset Corby, Xiong Chenyan, Phan Minh, Song Xia, Bennett Paul, Tiwary Saurabh
- Stronger Transformers For Neural Multi-hop Question Generation Sachan Devendra Singh, Wu Lingfei, Sachan Mrinmaya, Hamilton William
- What BERT Sees: Cross-modal Transfer For Visual Question Generation Scialom Thomas, Bordes Patrick, Dray Paul-alexis, Staiano Jacopo, Gallinari Patrick
- Look Before You Speak: Visually Contextualized Utterances Seo Paul Hongsuck, Nagrani Arsha, Schmid Cordelia
- Neural Data-to-text Generation Via Jointly Learning The Segmentation And Correspondence Shen Xiaoyu, Chang Ernie, Su Hui, Zhou Jie, Klakow Dietrich
- Lightpaff: A Two-stage Distillation Framework For Pre-training And Fine-tuning Song Kaitao, Sun Hao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Hongzhi, Liu Tie-yan
- Neural Machine Translation With Error Correction Song Kaitao, Tan Xu, Lu Jianfeng
- Recipes For Adapting Pre-trained Monolingual And Multilingual Models To Machine Translation Stickland Asa Cooper, Li Xian, Ghazvininejad Marjan
- Multi-hop Question Generation With Graph Convolutional Network Su Dan, Xu Yan, Dai Wenliang, Ji Ziwei, Yu Tiezheng, Fung Pascale
- Diversifying Dialogue Generation With Non-conversational Text Su Hui, Shen Xiaoyu, Zhao Sanqiang, Zhou Xiao, Hu Pengwei, Zhong Randy, Niu Cheng, Zhou Jie
- Intellicode Compose: Code Generation Using Transformer Svyatkovskiy Alexey, Deng Shao Kun, Fu Shengyu, Sundaresan Neel
- Making Pre-trained Language Models Better Few-shot Learners Tianyu Gao, Adam Fisch, Danqi Chen
- VD-BERT: A Unified Vision And Dialog Transformer With BERT Wang Yue, Joty Shafiq, Lyu Michael R., King Irwin, Xiong Caiming, Hoi Steven C. H.
- On Modality Bias In The TVQA Dataset Winterbottom Thomas, Xiao Sarah, Mclean Alistair, Moubayed Noura Al
- Cort: Complementary Rankings From Transformers Wrzalik Marco, Krechel Dirk
- CG-BERT: Conditional Text Generation With BERT For Generalized Few-shot Intent Detection Xia Congying, Zhang Chenwei, Nguyen Hoang, Zhang Jiawei, Yu Philip
- Improving BERT Fine-tuning Via Self-ensemble And Self-distillation Xu Yige, Qiu Xipeng, Zhou Ligao, Huang Xuanjing
- Multi-unit Transformers For Neural Machine Translation Yan Jianhao, Meng Fandong, Zhou Jie
- Off-policy Self-critical Training For Transformer In Visual Paragraph Generation Yan Shiyang, Hua Yang, Robertson Neil M.
- Just Ask: Learning To Answer Questions From Millions Of Narrated Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- On The Generation Of Medical Dialogues For COVID-19 Yang Wenmian, Zeng Guangtao, Tan Bowen, Ju Zeqian, Chakravorty Subrato, He Xuehai, Chen Shu, Yang Xingyi, Wu Qingyang, Yu Zhou, Xing Eric, Xie Pengtao
- On The Sub-layer Functionalities Of Transformer Decoder Yang Yilin, Wang Longyue, Shi Shuming, Tadepalli Prasad, Lee Stefan, Tu Zhaopeng
- Assessing Phrasal Representation And Composition In Transformers Yu Lang, Ettinger Allyson
- GOBO: Quantizing Attention-based NLP Models For Low Latency And Energy Efficient Inference Zadeh Ali Hadi, Edo Isak, Awad Omar Mohamed, Moshovos Andreas
- A Simple And Efficient Multi-task Learning Approach For Conditioned Dialogue Generation Zeng Yan, Nie Jian-yun
- Accelerating Training Of Transformer-based Language Models With Progressive Layer Dropping Zhang Minjia, He Yuxiong
- Future-guided Incremental Transformer For Simultaneous Translation Zhang Shaolei, Feng Yang, Li Liangyou
- BERT-JAM: Boosting Bert-enhanced Neural Machine Translation With Joint Attention Zhang Zhebin, Wu Sai, Jiang Dawei, Chen Gang
- Improving Non-autoregressive Neural Machine Translation With Monolingual Data Zhou Jiawei, Keung Phillip
- RAFT: A Real-world Few-shot Text Classification Benchmark Alex Neel, Lifland Eli, Tunstall Lewis, Thakur Abhishek, Maham Pegah, Riedel C. Jess, Hine Emmie, Ashurst Carolyn, Sedille Paul, Carlier Alexis, Noetel Michael, Stuhlmüller Andreas
- See, Hear, Read: Leveraging Multimodality With Guided Attention For Abstractive Text Summarization Atri Yash Kumar, Pramanick Shraman, Goyal Vikram, Chakraborty Tanmoy
- S2s-ft: Fine-tuning Pretrained Transformer Encoders For Sequence-to-sequence Learning Bao Hangbo, Dong Li, Wang Wenhui, Yang Nan, Wei Furu
- Building A Question And Answer System For News Domain Basu Sandipan, Gaddala Aravind, Chetan Pooja, Tiwari Garima, Darapaneni Narayana, Parvathaneni Sadwik, Paduri Anwesh Reddy
- Leveraging Redundancy In Attention With Reuse Transformers Bhojanapalli Srinadh, Chakrabarti Ayan, Veit Andreas, Lukasik Michal, Jain Himanshu, Liu Frederick, Chang Yin-wen, Kumar Sanjiv
- Latr: Layout-aware Transformer For Scene-text VQA Biten Ali Furkan, Litman Ron, Xie Yusheng, Appalaraju Srikar, Manmatha R.
- CDLM: Cross-document Language Modeling Caciularu Avi, Cohan Arman, Beltagy Iz, Peters Matthew E., Cattan Arie, Dagan Ido
- Long Context Question Answering Via Supervised Contrastive Learning Caciularu Avi, Dagan Ido, Goldberger Jacob, Cohan Arman
- Curriculum Learning For Language Modeling Campos Daniel
- Topic Transferable Table Question Answering Chemmengath Saneem Ahmed, Kumar Vishwajeet, Bharadwaj Samarth, Sen Jaydeep, Canim Mustafa, Chakrabarti Soumen, Gliozzo Alfio, Sankaranarayanan Karthik
- Towards Making The Most Of Multilingual Pretraining For Zero-shot Neural Machine Translation Chen Guanhua, Ma Shuming, Chen Yun, Zhang Dongdong, Pan Jia, Wang Wenping, Wei Furu
- Visualgpt: Data-efficient Adaptation Of Pretrained Language Models For Image Captioning Chen Jun, Guo Han, Yi Kai, Li Boyang, Elhoseiny Mohamed
- Long-range Modeling Of Source Code Files With Ewash: Extended Window Access By Syntax Hierarchy Clement Colin B., Lu Shuai, Liu Xiaoyu, Tufano Michele, Drain Dawn, Duan Nan, Sundaresan Neel, Svyatkovskiy Alexey
- Knowledge Neurons In Pretrained Transformers Dai Damai, Dong Li, Hao Yaru, Sui Zhifang, Chang Baobao, Wei Furu
- Is GPT-3 Text Indistinguishable From Human Text? Scarecrow: A Framework For Scrutinizing Machine Text Dou Yao, Forbes Maxwell, Koncel-kedziorski Rik, Smith Noah A., Choi Yejin
- Robustness Challenges In Model Distillation And Pruning For Natural Language Understanding Du Mengnan, Mukherjee Subhabrata, Cheng Yu, Shokouhi Milad, Hu Xia, Awadallah Ahmed Hassan
- Leveraging Knowledge In Multilingual Commonsense Reasoning Fang Yuwei, Wang Shuohang, Xu Yichong, Xu Ruochen, Sun Siqi, Zhu Chenguang, Zeng Michael
- Open-domain Conversational Search Assistant With Transformers Ferreira Rafael, Leite Mariana, Semedo David, Magalhaes Joao
- Cross-attention Is All You Need: Adapting Pretrained Transformers For Machine Translation Gheini Mozhdeh, Ren Xiang, May Jonathan
- LAWDR: Language-agnostic Weighted Document Representations From Pre-trained Models Gong Hongyu, Chaudhary Vishrav, Tang Yuqing, Guzmán Francisco
- Larger-scale Transformers For Multilingual Masked Language Modeling Goyal Naman, Du Jingfei, Ott Myle, Anantharaman Giri, Conneau Alexis
- Xeroalign: Zero-shot Cross-lingual Transformer Alignment Gritta Milan, Iacobacci Ignacio
- Transaction: ICL-SJTU Submission To Epic-kitchens Action Anticipation Challenge 2021 Gu Xiao, Qiu Jianing, Guo Yao, Lo Benny, Yang Guang-zhong
- Block-skim: Efficient Question Answering For Transformer Guan Yue, Li Zhengyi, Leng Jingwen, Lin Zhouhan, Guo Minyi, Zhu Yuhao
- KAT: A Knowledge Augmented Transformer For Vision-and-language Gui Liangke, Wang Borui, Huang Qiuyuan, Hauptmann Alex, Bisk Yonatan, Gao Jianfeng
- Audio-oriented Multimodal Machine Comprehension: Task, Dataset And Model Huang Zhiqi, Liu Fenglin, Wu Xian, Ge Shen, Wang Helin, Fan Wei, Zou Yuexian
- SILT: Efficient Transformer Training For Inter-lingual Inference Huertas-tato Javier, Martín Alejandro, Camacho David
- DS-TOD: Efficient Domain Specialization For Task Oriented Dialog Hung Chia-chien, Lauscher Anne, Ponzetto Simone Paolo, Glavaš Goran
- Retraining Distilbert For A Voice Shopping Assistant By Using Universal Dependencies Jayarao Pratik, Sharma Arpit
- Improving Non-autoregressive Generation With Mixup Training Jiang Ting, Huang Shaohan, Zhang Zihan, Wang Deqing, Zhuang Fuzhen, Wei Furu, Huang Haizhen, Zhang Liangjie, Zhang Qi
- Inducing Transformer's Compositional Generalization Ability Via Auxiliary Sequence Prediction Tasks Jiang Yichen, Bansal Mohit
- Numgpt: Improving Numeracy Ability Of Generative Pre-trained Models Jin Zhihua, Jiang Xin, Wang Xingbo, Liu Qun, Wang Yong, Ren Xiaozhe, Qu Huamin
- Nmt5 -- Is Parallel Data Still Relevant For Pre-training Massively Multilingual Language Models? Kale Mihir, Siddhant Aditya, Constant Noah, Johnson Melvin, Al-rfou Rami, Xue Linting
- Distilling Large Language Models Into Tiny And Effective Students Using Pqrnn Kaliamoorthi Prabhu, Siddhant Aditya, Li Edward, Johnson Melvin
- Tiny Neural Models For Seq2seq Kandoor Arun
- Text-free Prosody-aware Generative Spoken Language Modeling Kharitonov Eugene, Lee Ann, Polyak Adam, Adi Yossi, Copet Jade, Lakhotia Kushal, Nguyen Tu-anh, Rivière Morgane, Mohamed Abdelrahman, Dupoux Emmanuel, Hsu Wei-ning
- L-verse: Bidirectional Generation Between Image And Text Kim Taehoon, Song Gwangmo, Lee Sihaeng, Kim Sangyun, Seo Yewon, Lee Soonyoung, Kim Seung Hwan, Lee Honglak, Bae Kyunghoon
- BERT Busters: Outlier Dimensions That Disrupt Transformers Kovaleva Olga, Kulshreshtha Saurabh, Rogers Anna, Rumshisky Anna
- Hurdles To Progress In Long-form Question Answering Krishna Kalpesh, Roy Aurko, Iyyer Mohit
- What BERT Based Language Models Learn In Spoken Transcripts: An Empirical Study Kumar Ayush, Sundararaman Mukuntha Narayanan, Vepa Jithendra
- Layered Gradient Accumulation And Modular Pipeline Parallelism: Fast And Efficient Training Of Large Language Models Lamy-poirier Joel
- When Attention Meets Fast Recurrence: Training Language Models With Reduced Compute Lei Tao
- Exploiting Sentence-level Representations For Passage Ranking Leonhardt Jurek, Beringer Fabian, Anand Avishek
- Self-supervised Bot Play For Conversational Recommendation With Justifications Li Shuyang, Majumder Bodhisattwa Prasad, Mcauley Julian
- VX2TEXT: End-to-end Learning Of Video-based Text Generation From Multimodal Inputs Lin Xudong, Bertasius Gedas, Wang Jue, Chang Shih-fu, Parikh Devi, Torresani Lorenzo
- Pushing Paraphrase Away From Original Sentence: A Multi-round Paraphrase Generation Approach Lin Zhe, Wan Xiaojun
- What Makes Good In-context Examples For GPT-\(3\)? Liu Jiachang, Shen Dinghan, Zhang Yizhe, Dolan Bill, Carin Lawrence, Chen Weizhu
- X-METRA-ADA: Cross-lingual Meta-transfer Learning Adaptation To Natural Language Understanding And Question Answering M'hamdi Meryem, Kim Doo Soon, Dernoncourt Franck, Bui Trung, Ren Xiang, May Jonathan
- Direct Simultaneous Speech-to-speech Translation With Variational Monotonic Multihead Attention Ma Xutai, Gong Hongyu, Liu Danni, Lee Ann, Tang Yun, Chen Peng-jen, Hsu Wei-ning, Koehn Phillip, Pino Juan
- Local And Global Context-based Pairwise Models For Sentence Ordering Manku Ruskin Raj, Paul Aditya Jyoti
- Towards Retrieval-based Conversational Recommendation Manzoor Ahtsham, Jannach Dietmar
- Few-shot Self-rationalization With Natural Language Prompts Marasović Ana, Beltagy Iz, Downey Doug, Peters Matthew E.
- Schema-guided Paradigm For Zero-shot Dialog Mehri Shikib, Eskenazi Maxine
- COCO-LM: Correcting And Contrasting Text Sequences For Language Model Pretraining Meng Yu, Xiong Chenyan, Bajaj Payal, Tiwary Saurabh, Bennett Paul, Han Jiawei, Song Xia
- Hierarchical Transformers Are More Efficient Language Models Nawrot Piotr, Tworkowski Szymon, Tyrolski Michał, Kaiser Łukasz, Wu Yuhuai, Szegedy Christian, Michalewski Henryk
- MLP Architectures For Vision-and-language Modeling: An Empirical Study Nie Yixin, Li Linjie, Gan Zhe, Wang Shuohang, Zhu Chenguang, Zeng Michael, Liu Zicheng, Bansal Mohit, Wang Lijuan
- Cascading Adaptors To Leverage English Data To Improve Performance Of Question Answering For Low-resource Languages Pandya Hariom A., Ardeshna Bhavik, Bhatt Brijesh S.
- PENELOPIE: Enabling Open Information Extraction For The Greek Language Through Machine Translation Papadopoulos Dimitris, Papadakis Nikolaos, Matsatsinis Nikolaos
- Episodic Transformer For Vision-and-language Navigation Pashevich Alexander, Schmid Cordelia, Sun Chen
- Crossfit: A Few-shot Learning Challenge For Cross-task Generalization In NLP Qinyuan Ye, Bill Yuchen Lin, Xiang Ren
- Grid Search Hyperparameter Benchmarking Of BERT, ALBERT, And Longformer On Duorc Quijano Alex John, Nguyen Sam, Ordonez Juanita
- Prompt Programming For Large Language Models: Beyond The Few-shot Paradigm Reynolds Laria, Mcdonell Kyle
- Leveraging The Inductive Bias Of Large Language Models For Abstract Textual Reasoning Rytting Christopher Michael, Wingate David
- End-to-end Training Of Neural Retrievers For Open-domain Question Answering Sachan Devendra Singh, Patwary Mostofa, Shoeybi Mohammad, Kant Neel, Ping Wei, Hamilton William L, Catanzaro Bryan
- Challenges In Procedural Multimodal Machine Comprehension:a Novel Way To Benchmark Sahu Pritish, Sikka Karan, Divakaran Ajay
- Towards Solving Multimodal Comprehension Sahu Pritish, Sikka Karan, Divakaran Ajay
- Rome Was Built In 1776: A Case Study On Factual Correctness In Knowledge-grounded Response Generation Santhanam Sashank, Hedayatnia Behnam, Gella Spandana, Padmakumar Aishwarya, Kim Seokhwan, Liu Yang, Hakkani-tur Dilek
- CPT: A Pre-trained Unbalanced Transformer For Both Chinese Language Understanding And Generation Shao Yunfan, Geng Zhichao, Liu Yitao, Dai Junqi, Yan Hang, Yang Fei, Zhe Li, Bao Hujun, Qiu Xipeng
- Adapting Long Context NLM For ASR Rescoring In Conversational Agents Shenoy Ashish, Bodapati Sravan, Sunkara Monica, Ronanki Srikanth, Kirchhoff Katrin
- Txt: Crossmodal End-to-end Learning With Transformers Steitz Jan-martin O., Pfeiffer Jonas, Gurevych Iryna, Roth Stefan
- Tacl: Improving BERT Pre-training With Token-aware Contrastive Learning Su Yixuan, Liu Fangyu, Meng Zaiqiao, Lan Tian, Shu Lei, Shareghi Ehsan, Collier Nigel
- NVIDIA Nemo Neural Machine Translation Systems For English-german And English-russian News And Biomedical Tasks At WMT21 Subramanian Sandeep, Hrinchuk Oleksii, Adams Virginia, Kuchaiev Oleksii
- Know Deeper: Knowledge-conversation Cyclic Utilization Mechanism For Open-domain Dialogue Generation Sun Yajing, Hu Yue, Xing Luxi, Xie Yuqiang, Wei Xiangpeng
- Multilingual Translation Via Grafting Pre-trained Language Models Sun Zewei, Wang Mingxuan, Li Lei
- Data Processing Matters: Srph-konvergen Ai's Machine Translation System For WMT'21 Sutawika Lintang, Cruz Jan Christian Blaise
- Contextualized Scene Imagination For Generative Commonsense Reasoning Wang Peifeng, Zamora Jonathan, Liu Junfeng, Ilievski Filip, Chen Muhao, Ren Xiang
- List: Lite Prompted Self-training Makes Parameter-efficient Few-shot Learners Wang Yaqing, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Codet5: Identifier-aware Unified Pre-trained Encoder-decoder Models For Code Understanding And Generation Wang Yue, Wang Weishi, Joty Shafiq, Hoi Steven C. H.
- VAULT: Variable Unified Long Text Representation For Machine Reading Comprehension Wen Haoyang, Ferritto Anthony, Ji Heng, Florian Radu, Sil Avirup
- Graph-free Multi-hop Reading Comprehension: A Select-to-guide Strategy Wu Bohong, Zhang Zhuosheng, Zhao Hai
- Improving Low-resource Reading Comprehension Via Cross-lingual Transposition Rethinking Wu Gaochen, Xu Bin, Qin Yuxin, Kong Fei, Liu Bangchang, Zhao Hongwen, Chang Dejie
- AI Chains: Transparent And Controllable Human-ai Interaction By Chaining Large Language Model Prompts Wu Tongshuang, Terry Michael, Cai Carrie J.
- Prefix-tuning: Optimizing Continuous Prompts For Generation Xiang Lisa Li, Percy Liang
- P-tuning V2: Prompt Tuning Can Be Comparable To Fine-tuning Universally Across Scales And Tasks Xiao Liu, Kaixuan Ji, Yicheng Fu, Weng Lam Tam, Zhengxiao Du, Zhilin Yang, Jie Tang
- Training Large-scale News Recommenders With Pretrained Language Models In The Loop Xiao Shitao, Liu Zheng, Shao Yingxia, Di Tao, Xie Xing
- Exploring Story Generation With Multi-task Objectives In Variational Autoencoders Xie Zhuohan, Cohn Trevor, Lau Jey Han
- Attention-guided Generative Models For Extractive Question Answering Xu Peng, Liang Davis, Huang Zhiheng, Xiang Bing
- An Empirical Study Of GPT-3 For Few-shot Knowledge-based VQA Yang Zhengyuan, Gan Zhe, Wang Jianfeng, Hu Xiaowei, Lu Yumao, Liu Zicheng, Wang Lijuan
- Gpt3mix: Leveraging Large-scale Language Models For Text Augmentation Yoo Kang Min, Park Dongju, Kang Jaewook, Lee Sang-woo, Park Woomyeong
- Turning Tables: Generating Examples From Semi-structured Tables For Endowing Language Models With Reasoning Skills Yoran Ori, Talmor Alon, Berant Jonathan
- Leveraging Linguistic Coordination In Reranking N-best Candidates For End-to-end Response Selection Using BERT Yu Mingzhi University Of Pittsburgh, Litman Diane University Of Pittsburgh
- A Knowledge-grounded Dialog System Based On Pre-trained Language Models Zhang Weijie, Chen Jiaoxuan, Wu Haipang, Wan Sanhui, Li Gongfeng
- Learning To Rehearse In Long Sequence Memorization Zhang Zhu, Zhou Chang, Ma Jianxin, Lin Zhijie, Zhou Jingren, Yang Hongxia, Zhao Zhou
- Calibrate Before Use: Improving Few-shot Performance Of Language Models Zhao Tony Z., Wallace Eric, Feng Shi, Klein Dan, Singh Sameer
- Self-guided Curriculum Learning For Neural Machine Translation Zhou Lei, Ding Liang, Duh Kevin, Watanabe Shinji, Sasano Ryohei, Takeda Koichi
- LAFITE: Towards Language-free Training For Text-to-image Generation Zhou Yufan, Zhang Ruiyi, Chen Changyou, Li Chunyuan, Tensmeyer Chris, Yu Tong, Gu Jiuxiang, Xu Jinhui, Sun Tong
- Taming Sparsely Activated Transformer With Stochastic Experts Zuo Simiao, Liu Xiaodong, Jiao Jian, Kim Young Jin, Hassan Hany, Zhang Ruofei, Zhao Tuo, Gao Jianfeng
- Palm: Scaling Language Modeling With Pathways Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-hellstern, Douglas Eck, Jeff Dean, Slav Petrov, Noah Fiedel
- Gpt-3-driven Pedagogical Agents For Training Children's Curious Question-asking Skills Abdelghani Rania, Wang Yen-hsiang, Yuan Xingdi, Wang Tong, Lucas Pauline, Sauzéon Hélène, Oudeyer Pierre-yves
- Making Large Language Models Interactive: A Pioneer Study On Supporting Complex Information-seeking Tasks With Implicit Constraints Ahmadvand Ali, Arabzadeh Negar, Kiseleva Julia, Sanz Patricio Figueroa, Deng Xin, Jauhar Sujay, Gamon Michael, Agichtein Eugene, Friend Ned, Aniruddha
- Few-shot Training Llms For Project-specific Code-summarization Ahmed Toufique, Devanbu Premkumar
- Multimodal Sequential Generative Models For Semi-supervised Language Instruction Following Akuzawa Kei, Iwasawa Yusuke, Matsuo Yutaka
- End-to-end Multimodal Representation Learning For Video Dialog Alamri Huda, Bilic Anthony, Hu Michael, Beedu Apoorva, Essa Irfan
- Despite "super-human" Performance, Current Llms Are Unsuited For Decisions About Ethics And Safety Albrecht Joshua, Kitanidis Ellie, Fetterman Abraham J.
- Prompting As Probing: Using Language Models For Knowledge Base Construction Alivanistos Dimitrios, Santamaría Selene Báez, Cochez Michael, Kalo Jan-christoph, Van Krieken Emile, Thanapalasingam Thiviyan
- Query Refinement Prompts For Closed-book Long-form Question Answering Amplayo Reinald Kim, Webster Kellie, Collins Michael, Das Dipanjan, Narayan Shashi
- Socratic Models: Composing Zero-shot Multimodal Reasoning With Language Andy Zeng, Maria Attarian, Brian Ichter, Krzysztof Choromanski, Adrian Wong, Stefan Welker, Federico Tombari, Aveek Purohit, Michael Ryoo, Vikas Sindhwani, Johnny Lee, Vincent Vanhoucke, Pete Florence
- GLM-130B: An Open Bilingual Pre-trained Model Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Peng Zhang, Yuxiao Dong, Jie Tang
- Leveraging Pre-trained Language Models For Conversational Information Seeking From Text Bellan Patrizio, Dragoni Mauro, Ghidini Chiara
- Vector-quantized Input-contextualized Soft Prompts For Natural Language Understanding Bhardwaj Rishabh, Saha Amrita, Hoi Steven C. H., Poria Soujanya
- Team \'UFAL At CMCL 2022 Shared Task: Figuring Out The Correct Recipe For Predicting Eye-tracking Features Using Pretrained Language Models Bhattacharya Sunit, Kumar Rishu, Bojar Ondrej
- Looking For A Handsome Carpenter! Debiasing GPT-3 Job Advertisements Borchers Conrad, Gala Dalia Sara, Gilburt Benjamin, Oravkin Eduard, Bounsi Wilfried, Asano Yuki M., Kirk Hannah Rose
- Understanding Performance Of Long-document Ranking Models Through Comprehensive Evaluation And Leaderboarding Boytsov Leonid, Akinpelu David, Lin Tianyi, Gao Fangwei, Zhao Yutian, Huang Jeffrey, Katyal Nipun, Nyberg Eric
- HIBRIDS: Attention With Hierarchical Biases For Structure-aware Long Document Summarization Cao Shuyang, Wang Lu
- Speechprompt: An Exploration Of Prompt Tuning On Generative Spoken Language Model For Speech Processing Tasks Chang Kai-wei, Tseng Wei-cheng, Li Shang-wen, Lee Hung-yi
- Data Curation Alone Can Stabilize In-context Learning Chang Ting-yun, Jia Robin
- Character-centric Story Visualization Via Visual Planning And Token Alignment Chen Hong, Han Rujun, Wu Te-lin, Nakayama Hideki, Peng Nanyun
- Controllable Text Generation With Language Constraints Chen Howard, Li Huihan, Chen Danqi, Narasimhan Karthik
- Leveraging Natural Supervision For Language Representation Learning And Generation Chen Mingda
- Murag: Multimodal Retrieval-augmented Generator For Open Question Answering Over Images And Text Chen Wenhu, Hu Hexiang, Chen Xi, Verga Pat, Cohen William W.
- Pali: A Jointly-scaled Multilingual Language-image Model Chen Xi, Wang Xiao, Changpinyo Soravit, Piergiovanni Aj, Padlewski Piotr, Salz Daniel, Goodman Sebastian, Grycner Adam, Mustafa Basil, Beyer Lucas, Kolesnikov Alexander, Puigcerver Joan, Ding Nan, Rong Keran, Akbari Hassan, Mishra Gaurav, Xue Linting, Thapliyal Ashish, Bradbury James, Kuo Weicheng, Seyedhosseini Mojtaba, Jia Chao, Ayan Burcu Karagol, Riquelme Carlos, Steiner Andreas, Angelova Anelia, Zhai Xiaohua, Houlsby Neil, Soricut Radu
- Curriculum: A Broad-coverage Benchmark For Linguistic Phenomena In Natural Language Understanding Chen Zeming, Gao Qiyue
- Is Multiwoz A Solved Task? An Interactive TOD Evaluation Framework With User Simulator Cheng Qinyuan, Li Linyang, Quan Guofeng, Gao Feng, Mou Xiaofeng, Qiu Xipeng
- Neural Machine Translation With Contrastive Translation Memories Cheng Xin, Gao Shen, Liu Lemao, Zhao Dongyan, Yan Rui
- On The Evaluation Of Answer-agnostic Paragraph-level Multi-question Generation Chowdhury Jishnu Ray, Mahata Debanjan, Caragea Cornelia
- Language-family Adapters For Low-resource Multilingual Neural Machine Translation Chronopoulou Alexandra, Stojanovski Dario, Fraser Alexander
- Scaling Instruction-finetuned Language Models Chung Hyung Won, Hou Le, Longpre Shayne, Zoph Barret, Tay Yi, Fedus William, Li Yunxuan, Wang Xuezhi, Dehghani Mostafa, Brahma Siddhartha, Webson Albert, Gu Shixiang Shane, Dai Zhuyun, Suzgun Mirac, Chen Xinyun, Chowdhery Aakanksha, Castro-ros Alex, Pellat Marie, Robinson Kevin, Valter Dasha, Narang Sharan, Mishra Gaurav, Yu Adams, Zhao Vincent, Huang Yanping, Dai Andrew, Yu Hongkun, Petrov Slav, Chi Ed H., Dean Jeff, Devlin Jacob, Roberts Adam, Zhou Denny, Le Quoc V., Wei Jason
- One Agent To Rule Them All: Towards Multi-agent Conversational AI Clarke Christopher, Peper Joseph Joshua, Krishnamurthy Karthik, Talamonti Walter, Leach Kevin, Lasecki Walter, Kang Yiping, Tang Lingjia, Mars Jason
- Building The Intent Landscape Of Real-world Conversational Corpora With Extractive Question-answering Transformers Corbeil Jean-philippe, Li Mia Taige, Ghavidel Hadi Abdi
- M6-rec: Generative Pretrained Language Models Are Open-ended Recommender Systems Cui Zeyu, Ma Jianxin, Zhou Chang, Zhou Jingren, Yang Hongxia
- The Future Is Different: Large Pre-trained Language Models Fail In Prediction Tasks Cvejoski Kostadin, Sánchez Ramsés J., Ojeda César
- Promptagator: Few-shot Dense Retrieval From 8 Examples Dai Zhuyun, Zhao Vincent Y., Ma Ji, Luan Yi, Ni Jianmo, Lu Jing, Bakalov Anton, Guu Kelvin, Hall Keith B., Chang Ming-wei
- Fido: Fusion-in-decoder Optimized For Stronger Performance And Faster Inference De Jong Michiel, Zemlyanskiy Yury, Ainslie Joshua, Fitzgerald Nicholas, Sanghai Sumit, Sha Fei, Cohen William
- Pre-training Transformer Models With Sentence-level Objectives For Answer Sentence Selection Di Liello Luca, Garg Siddhant, Soldaini Luca, Moschitti Alessandro
- Black-box Prompt Learning For Pre-trained Language Models Diao Shizhe, Huang Zhichao, Xu Ruijia, Li Xuechun, Lin Yong, Zhou Xiao, Zhang Tong
- Write And Paint: Generative Vision-language Models Are Unified Modal Learners Diao Shizhe, Zhou Wangchunshu, Zhang Xinsong, Wang Jiawei
- Star: Bootstrapping Reasoning With Reasoning Eric Zelikman, Yuhuai Wu, Jesse Mu, Noah D. Goodman
- Leveraging Pre-trained Models For Failure Analysis Triplets Generation Ezukwoke Kenneth, Hoayek Anis, Batton-hubert Mireille, Boucher Xavier, Gounet Pascal, Adrian Jerome
- Llmeffichecker: Understanding And Testing Efficiency Degradation Of Large Language Models Feng Xiaoning, Han Xiaohong, Chen Simin, Yang Wei
- Topic-aware Response Generation In Task-oriented Dialogue With Unstructured Knowledge Access Feng Yue, Lampouras Gerasimos, Iacobacci Ignacio
- Learn To Remember: Transformer With Recurrent Memory For Document-level Machine Translation Feng Yukun, Li Feng, Song Ziang, Zheng Boyuan, Koehn Philipp
- GPTQ: Accurate Post-training Quantization For Generative Pre-trained Transformers Frantar Elias, Ashkboos Saleh, Hoefler Torsten, Alistarh Dan
- Adapterbias: Parameter-efficient Token-dependent Representation Shift For Adapters In NLP Tasks Fu Chin-lun, Chen Zih-ching, Lee Yun-ru, Lee Hung-yi
- Complexity-based Prompting For Multi-step Reasoning Fu Yao, Peng Hao, Sabharwal Ashish, Clark Peter, Khot Tushar
- MIGA: A Unified Multi-task Generation Framework For Conversational Text-to-sql Fu Yingwen, Ou Wenjie, Yu Zhou, Lin Yue
- Normsage: Multi-lingual Multi-cultural Norm Discovery From Conversations On-the-fly Fung Yi R., Chakraborty Tuhin, Guo Hao, Rambow Owen, Muresan Smaranda, Ji Heng
- Re2g: Retrieve, Rerank, Generate Glass Michael, Rossiello Gaetano, Chowdhury Md Faisal Mahbub, Naik Ankita Rajaram, Cai Pengshan, Gliozzo Alfio
- Does Entity Abstraction Help Generative Transformers Reason? Gontier Nicolas, Reddy Siva, Pal Christopher
- Massively Multilingual Lexical Specialization Of Multilingual Transformers Green Tommaso, Ponzetto Simone Paolo, Glavaš Goran
- Continuous Decomposition Of Granularity For Neural Paraphrase Generation Gu Xiaodong, Zhang Zhaowei, Lee Sang-woo, Yoo Kang Min, Ha Jung-woo
- Transkimmer: Transformer Learns To Layer-wise Skim Guan Yue, Li Zhengyi, Leng Jingwen, Lin Zhouhan, Guo Minyi
- Unixcoder: Unified Cross-modal Pre-training For Code Representation Guo Daya, Lu Shuai, Duan Nan, Wang Yanlin, Zhou Ming, Yin Jian
- STI: Turbocharge NLP Inference At The Edge Via Elastic Pipelining Guo Liwei, Choe Wonkyo, Lin Felix Xiaozhu
- Show, Don't Tell: Demonstrations Outperform Descriptions For Schema-guided Task-oriented Dialogue Gupta Raghav, Lee Harrison, Zhao Jeffrey, Rastogi Abhinav, Cao Yuan, Wu Yonghui
- LUNA: Language Understanding With Number Augmentations On Transformers Via Number Plugins And Pre-training Han Hongwei, Xu Jialiang, Zhou Mengyu, Shao Yijia, Han Shi, Zhang Dongmei
- How Much Does Attention Actually Attend? Questioning The Importance Of Attention In Pretrained Transformers Hassid Michael, Peng Hao, Rotem Daniel, Kasai Jungo, Montero Ivan, Smith Noah A., Schwartz Roy
- Robust Dialogue State Tracking With Weak Supervision And Sparse Data Heck Michael, Lubis Nurul, Van Niekerk Carel, Feng Shutong, Geishauser Christian, Lin Hsien-chin, Gašić Milica
- Syntax-guided Localized Self-attention By Constituency Syntactic Distance Hou Shengyuan, Kai Jushi, Xue Haotian, Zhu Bingyu, Yuan Bo, Huang Longtao, Wang Xinbing, Lin Zhouhan
- Promptcap: Prompt-guided Task-aware Image Captioning Hu Yushi, Hua Hang, Yang Zhengyuan, Shi Weijia, Smith Noah A, Luo Jiebo
- PLANET: Dynamic Content Planning In Autoregressive Transformers For Long-form Text Generation Hu Zhe, Chan Hou Pong, Liu Jiachen, Xiao Xinyan, Wu Hua, Huang Lifu
- REVEAL: Retrieval-augmented Visual-language Pre-training With Multi-source Multimodal Knowledge Memory Hu Ziniu, Iscen Ahmet, Sun Chen, Wang Zirui, Chang Kai-wei, Sun Yizhou, Schmid Cordelia, Ross David A., Fathi Alireza
- Autoregressive Entity Generation For End-to-end Task-oriented Dialog Huang Guanhuan, Quan Xiaojun, Wang Qifan
- Inner Monologue: Embodied Reasoning Through Planning With Language Models Huang Wenlong, Xia Fei, Xiao Ted, Chan Harris, Liang Jacky, Florence Pete, Zeng Andy, Tompson Jonathan, Mordatch Igor, Chebotar Yevgen, Sermanet Pierre, Brown Noah, Jackson Tomas, Luu Linda, Levine Sergey, Hausman Karol, Ichter Brian
- MCP: Self-supervised Pre-training For Personalized Chatbots With Multi-level Contrastive Sampling Huang Zhaoheng, Dou Zhicheng, Zhu Yutao, Ma Zhengyi
- Clues Before Answers: Generation-enhanced Multiple-choice QA Huang Zixian, Wu Ao, Zhou Jiaying, Gu Yu, Zhao Yue, Cheng Gong
- Self-generated In-context Learning: Leveraging Auto-regressive Language Models As A Demonstration Generator Hyuhng Joon Kim, Hyunsoo Cho, Junyeob Kim, Taeuk Kim, Kang Min Yoo, Sang-goo Lee
- Scaling Instruction-finetuned Language Models Hyung Won Chung, Le Hou, Shayne Longpre, Barret Zoph, Yi Tay, William Fedus, Yunxuan Li, Xuezhi Wang, Mostafa Dehghani, Siddhartha Brahma, Albert Webson, Shixiang Shane Gu, Zhuyun Dai, Mirac Suzgun, Xinyun Chen, Aakanksha Chowdhery, Alex Castro-ros, Marie Pellat, Kevin Robinson, Dasha Valter, Sharan Narang, Gaurav Mishra, Adams Yu, Vincent Zhao, Yanping Huang, Andrew Dai, Hongkun Yu, Slav Petrov, Ed H. Chi, Jeff Dean, Jacob Devlin, Adam Roberts, Denny Zhou, Quoc V. Le, Jason Wei
- From Human Days To Machine Seconds: Automatically Answering And Generating Machine Learning Final Exams Iddo Drori, Sarah J. Zhang, Reece Shuttleworth, Sarah Zhang, Keith Tyser, Zad Chin, Pedro Lantigua, Saisamrit Surbehera, Gregory Hunter, Derek Austin, Leonard Tang, Yann Hicke, Sage Simhon, Sathwik Karnik, Darnell Granberry, Madeleine Udell
- Efficient Long-text Understanding With Short-text Models Ivgi Maor, Shaham Uri, Berant Jonathan
- Can Large Language Models Truly Understand Prompts? A Case Study With Negated Prompts Jang Joel, Ye Seonghyeon, Seo Minjoon
- Litetransformersearch: Training-free Neural Architecture Search For Efficient Language Models Javaheripi Mojan, De Rosa Gustavo H., Mukherjee Subhabrata, Shah Shital, Religa Tomasz L., Mendes Caio C. T., Bubeck Sebastien, Koushanfar Farinaz, Dey Debadeepta
- RHO (\(\rho\)): Reducing Hallucination In Open-domain Dialogues With Knowledge Grounding Ji Ziwei, Liu Zihan, Lee Nayeon, Yu Tiezheng, Wilie Bryan, Zeng Min, Fung Pascale
- Visual Prompt Tuning Jia Menglin, Tang Luming, Chen Bor-chun, Cardie Claire, Belongie Serge, Hariharan Bharath, Lim Ser-nam
- Evaluating And Inducing Personality In Pre-trained Language Models Jiang Guangyuan, Xu Manjie, Zhu Song-chun, Han Wenjuan, Zhang Chi, Zhu Yixin
- Pseudo-q: Generating Pseudo Language Queries For Visual Grounding Jiang Haojun, Lin Yuanze, Han Dongchen, Song Shiji, Huang Gao
- Training Compute-optimal Large Language Models Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego De Las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, Tom Hennigan, Eric Noland, Katie Millican, George Van Den Driessche, Bogdan Damoc, Aurelia Guy, Simon Osindero, Karen Simonyan, Erich Elsen, Jack W. Rae, Oriol Vinyals, Laurent Sifre
- Latency Adjustable Transformer Encoder For Language Understanding Kachuee Sajjad, Sharifkhani Mohammad
- Action-gpt: Leveraging Large-scale Language Models For Improved And Generalized Action Generation Kalakonda Sai Shashank, Maheshwari Shubh, Sarvadevabhatla Ravi Kiran
- A New Path: Scaling Vision-and-language Navigation With Synthetic Instructions And Imitation Learning Kamath Aishwarya, Anderson Peter, Wang Su, Koh Jing Yu, Ku Alexander, Waters Austin, Yang Yinfei, Baldridge Jason, Parekh Zarana
- Fie: Building A Global Probability Space By Leveraging Early Fusion In Encoder For Open-domain Question Answering Kedia Akhil, Zaidi Mohd Abbas, Lee Haejun
- Self-generated In-context Learning: Leveraging Auto-regressive Language Models As A Demonstration Generator Kim Hyuhng Joon, Cho Hyunsoo, Kim Junyeob, Kim Taeuk, Yoo Kang Min, Lee Sang-goo
- Validating Large Language Models With Relm Kuchnik Michael, Smith Virginia, Amvrosiadis George
- Emergent Linguistic Structures In Neural Networks Are Fragile La Malfa Emanuele, Wicker Matthew, Kwiatkowska Marta
- Interactive Code Generation Via Test-driven User-intent Formalization Lahiri Shuvendu K., Fakhoury Sarah, Naik Aaditya, Sakkas Georgios, Chakraborty Saikat, Musuvathi Madanlal, Choudhury Piali, Von Veh Curtis, Inala Jeevana Priya, Wang Chenglong, Gao Jianfeng
- Evaluation Of Synthetic Datasets For Conversational Recommender Systems Lara Harsh, Tiwari Manoj
- Hard Gate Knowledge Distillation -- Leverage Calibration For Robust And Reliable Language Model Lee Dongkyu, Tian Zhiliang, Zhao Yingxiu, Cheung Ka Chun, Zhang Nevin L.
- Littlebird: Efficient Faster & Longer Transformer For Question Answering Lee Minchul, Han Kijong, Shin Myeong Cheol
- Dialogcc: An Automated Pipeline For Creating High-quality Multi-modal Dialogue Dataset Lee Young-jun, Ko Byungsoo, Kim Han-gyu, Hyeon Jonghwan, Choi Ho-jin
- Simple Recurrence Improves Masked Language Models Lei Tao, Tian Ran, Bastings Jasmijn, Parikh Ankur P.
- A Survey On Retrieval-augmented Text Generation Li Huayang, Su Yixuan, Cai Deng, Wang Yan, Liu Lemao
- Counterfactual Reasoning: Do Language Models Need World Knowledge For Causal Understanding? Li Jiaxuan, Yu Lang, Ettinger Allyson
- ELMER: A Non-autoregressive Pre-trained Language Model For Efficient And Effective Text Generation Li Junyi, Tang Tianyi, Zhao Wayne Xin, Nie Jian-yun, Wen Ji-rong
- Explanations From Large Language Models Make Small Reasoners Better Li Shiyang, Chen Jianshu, Shen Yelong, Chen Zhiyu, Zhang Xinlu, Li Zekun, Wang Hong, Qian Jing, Peng Baolin, Mao Yi, Chen Wenhu, Yan Xifeng
- Pre-trained Language Models For Interactive Decision-making Li Shuang, Puig Xavier, Paxton Chris, Du Yilun, Wang Clinton, Fan Linxi, Chen Tao, Huang De-an, Akyürek Ekin, Anandkumar Anima, Andreas Jacob, Mordatch Igor, Torralba Antonio, Zhu Yuke
- Evaluating Psychological Safety Of Large Language Models Li Xingxuan, Li Yutong, Qiu Lin, Joty Shafiq, Bing Lidong
- VALHALLA: Visual Hallucination For Machine Translation Li Yi, Panda Rameswar, Kim Yoon, Chen Chun-fu, Feris Rogerio, Cox David, Vasconcelos Nuno
- Accelerating Attention Through Gradient-based Learned Runtime Pruning Li Zheng, Ghodrati Soroush, Yazdanbakhsh Amir, Esmaeilzadeh Hadi, Kang Mingu
- Contextual Transformer For Offline Meta Reinforcement Learning Lin Runji, Li Ye, Feng Xidong, Zhang Zhaowei, Fung Xian Hong Wu, Zhang Haifeng, Wang Jun, Du Yali, Yang Yaodong
- A Simple Meta-learning Paradigm For Zero-shot Intent Classification With Mixture Attention Mechanism Liu Han, Zhao Siyang, Zhang Xiaotong, Zhang Feng, Sun Junjie, Yu Hong, Zhang Xianchao
- Towards Better Few-shot And Finetuning Performance With Forgetful Causal Language Models Liu Hao, Geng Xinyang, Lee Lisa, Mordatch Igor, Levine Sergey, Narang Sharan, Abbeel Pieter
- Building Markovian Generative Architectures Over Pretrained LM Backbones For Efficient Task-oriented Dialog Systems Liu Hong, Cai Yucheng, Ou Zhijian, Huang Yi, Feng Junlan
- A Generative User Simulator With Gpt-based Architecture And Goal State Tracking For Reinforced Multi-domain Dialog Systems Liu Hong, Cai Yucheng, Ou Zhijian, Huang Yi, Feng Junlan
- Commitbart: A Large Pre-trained Model For Github Commits Liu Shangqing, Li Yanzhou, Xie Xiaofei, Liu Yang
- Dialogconv: A Lightweight Fully Convolutional Network For Multi-view Response Selection Liu Yongkang, Feng Shi, Gao Wei, Wang Daling, Zhang Yifei
- Can Large Language Models Reason About Medical Questions? Liévin Valentin, Hother Christoffer Egeberg, Motzfeldt Andreas Geert, Winther Ole
- Learn To Explain: Multimodal Reasoning Via Thought Chains For Science Question Answering Lu Pan, Mishra Swaroop, Xia Tony, Qiu Liang, Chang Kai-wei, Zhu Song-chun, Tafjord Oyvind, Clark Peter, Kalyan Ashwin
- Reacc: A Retrieval-augmented Code Completion Framework Lu Shuai, Duan Nan, Han Hojae, Guo Daya, Hwang Seung-won, Svyatkovskiy Alexey
- PERFECT: Prompt-free And Efficient Few-shot Learning With Language Models Mahabadi Rabeeh Karimi, Zettlemoyer Luke, Henderson James, Saeidi Marzieh, Mathias Lambert, Stoyanov Veselin, Yazdani Majid
- Edit5: Semi-autoregressive Text-editing With T5 Warm-start Mallinson Jonathan, Adamek Jakub, Malmi Eric, Severyn Aliaksei
- Tangobert: Reducing Inference Cost By Using Cascaded Architecture Mamou Jonathan, Pereg Oren, Wasserblat Moshe, Schwartz Roy
- Mini-model Adaptation: Efficiently Extending Pretrained Models To New Languages Via Aligned Shallow Training Marchisio Kelly, Lewis Patrick, Chen Yihong, Artetxe Mikel
- LAD: Language Models As Data For Zero-shot Dialog Mehri Shikib, Altun Yasemin, Eskenazi Maxine
- DSI++: Updating Transformer Memory With New Documents Mehta Sanket Vaibhav, Gupta Jai, Tay Yi, Dehghani Mostafa, Tran Vinh Q., Rao Jinfeng, Najork Marc, Strubell Emma, Metzler Donald
- ZEROTOP: Zero-shot Task-oriented Semantic Parsing Using Large Language Models Mekala Dheeraj, Wolfe Jason, Roy Subhro
- Tuning Language Models As Training Data Generators For Augmentation-enhanced Few-shot Learning Meng Yu, Michalski Martin, Huang Jiaxin, Zhang Yu, Abdelzaher Tarek, Han Jiawei
- Do As I Can, Not As I Say: Grounding Language In Robotic Affordances Michael Ahn, Anthony Brohan, Noah Brown, Yevgen Chebotar, Omar Cortes, Byron David, Chelsea Finn, Chuyuan Fu, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Daniel Ho, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Eric Jang, Rosario Jauregui Ruano, Kyle Jeffrey, Sally Jesmonth, Nikhil J Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Kuang-huei Lee, Sergey Levine, Yao Lu, Linda Luu, Carolina Parada, Peter Pastor, Jornell Quiambao, Kanishka Rao, Jarek Rettinghouse, Diego Reyes, Pierre Sermanet, Nicolas Sievers, Clayton Tan, Alexander Toshev, Vincent Vanhoucke, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Mengyuan Yan, Andy Zeng
- HELP ME THINK: A Simple Prompting Strategy For Non-experts To Create Customized Content With Models Mishra Swaroop, Nouri Elnaz
- Domain-specific Text Generation For Machine Translation Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- JASMINE: Arabic GPT Models For Few-shot Learning Nagoudi El Moatez Billah, Abdul-mageed Muhammad, Elmadany Abdelrahim, Inciarte Alcides Alcoba, Khondaker Md Tawkat Islam
- Large Language Models Are Reasoning Teachers Namgyu Ho, Laura Schmid, Se-young Yun
- Do Vision-and-language Transformers Learn Grounded Predicate-noun Dependencies? Nikolaus Mitja, Salin Emmanuelle, Ayache Stephane, Fourtassi Abdellah, Favre Benoit
- Neural Language Taskonomy: Which NLP Tasks Are The Most Predictive Of Fmri Brain Activity? Oota Subba Reddy, Arora Jashn, Agarwal Veeral, Marreddy Mounika, Gupta Manish, Surampudi Bapi Raju
- Follow-up Attention: An Empirical Study Of Developer And Neural Model Code Exploration Paltenghi Matteo, Pandita Rahul, Henley Austin Z., Ziegler Albert
- Holistic Evaluation Of Language Models Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao, Jue Wang, Keshav Santhanam, Laurel Orr, Lucia Zheng, Mert Yuksekgonul, Mirac Suzgun, Nathan Kim, Neel Guha, Niladri Chatterji, Omar Khattab, Peter Henderson, Qian Huang, Ryan Chi, Sang Michael Xie, Shibani Santurkar, Surya Ganguli, Tatsunori Hashimoto, Thomas Icard, Tianyi Zhang, Vishrav Chaudhary, William Wang, Xuechen Li, Yifan Mai, Yuhui Zhang, Yuta Koreeda
- Hypertuning: Toward Adapting Large Language Models Without Back-propagation Phang Jason, Mao Yi, He Pengcheng, Chen Weizhu
- Synchromesh: Reliable Code Generation From Pre-trained Language Models Poesia Gabriel, Polozov Oleksandr, Le Vu, Tiwari Ashish, Soares Gustavo, Meek Christopher, Gulwani Sumit
- Grips: Gradient-free, Edit-based Instruction Search For Prompting Large Language Models Prasad Archiki, Hase Peter, Zhou Xiang, Bansal Mohit
- Tegformer: Topic-to-essay Generation With Good Topic Coverage And High Text Coherence Qi Wang, Liu Rui, Zuo Yuan, Chen Yong, Zhang Dell
- A Self-paced Mixed Distillation Method For Non-autoregressive Generation Qi Weizhen, Gong Yeyun, Shen Yelong, Jiao Jian, Yan Yu, Li Houqiang, Zhang Ruofei, Chen Weizhu, Duan Nan
- A Survey On In-context Learning Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Lei Li, Zhifang Sui
- Parallel Context Windows For Large Language Models Ratner Nir, Levine Yoav, Belinkov Yonatan, Ram Ori, Magar Inbal, Abend Omri, Karpas Ehud, Shashua Amnon, Leyton-brown Kevin, Shoham Yoav
- Leveraging Large Language Models For Multiple Choice Question Answering Robinson Joshua, Rytting Christopher Michael, Wingate David
- The Goldilocks Of Pragmatic Understanding: Fine-tuning Strategy Matters For Implicature Resolution By Llms Ruis Laura, Khan Akbir, Biderman Stella, Hooker Sara, Rocktäschel Tim, Grefenstette Edward
- Bird-eye Transformers For Text Generation Models Sha Lei, Song Yuhang, Yordanov Yordan, Salvatori Tommaso, Lukasiewicz Thomas
- Unintended Bias In Language Model-driven Conversational Recommendation Shen Tianshu, Li Jiaru, Bouadjenek Mohamed Reda, Mai Zheda, Sanner Scott
- Ebms Vs. CL: Exploring Self-supervised Visual Pretraining For Visual Question Answering Shevchenko Violetta, Abbasnejad Ehsan, Dick Anthony, Hengel Anton Van Den, Teney Damien
- XRICL: Cross-lingual Retrieval-augmented In-context Learning For Cross-lingual Text-to-sql Semantic Parsing Shi Peng, Zhang Rui, Bai He, Lin Jimmy
- Distilling Reasoning Capabilities Into Smaller Language Models Shridhar Kumar, Stolfo Alessandro, Sachan Mrinmaya
- Towards The Next 1000 Languages In Multilingual Machine Translation: Exploring The Synergy Between Supervised And Self-supervised Learning Siddhant Aditya, Bapna Ankur, Firat Orhan, Cao Yuan, Chen Mia Xu, Caswell Isaac, Garcia Xavier
- Ask Me Anything: A Simple Strategy For Prompting Language Models Simran Arora, Avanika Narayan, Mayee F. Chen, Laurel Orr, Neel Guha, Kush Bhatia, Ines Chami, Frederic Sala, Christopher Ré
- Augmenting Interpretable Models With Llms During Training Singh Chandan, Askari Armin, Caruana Rich, Gao Jianfeng
- Explaining Patterns In Data With Language Models Via Interpretable Autoprompting Singh Chandan, Morris John X., Aneja Jyoti, Rush Alexander M., Gao Jianfeng
- Large Language Models Encode Clinical Knowledge Singhal Karan, Azizi Shekoofeh, Tu Tao, Mahdavi S. Sara, Wei Jason, Chung Hyung Won, Scales Nathan, Tanwani Ajay, Cole-lewis Heather, Pfohl Stephen, Payne Perry, Seneviratne Martin, Gamble Paul, Kelly Chris, Scharli Nathaneal, Chowdhery Aakanksha, Mansfield Philip, Arcas Blaise Aguera Y, Webster Dale, Corrado Greg S., Matias Yossi, Chou Katherine, Gottweis Juraj, Tomasev Nenad, Liu Yun, Rajkomar Alvin, Barral Joelle, Semturs Christopher, Karthikesalingam Alan, Natarajan Vivek
- Improving The Domain Adaptation Of Retrieval Augmented Generation (RAG) Models For Open Domain Question Answering Siriwardhana Shamane, Weerasekera Rivindu, Wen Elliott, Kaluarachchi Tharindu, Rana Rajib, Nanayakkara Suranga
- CLIP Models Are Few-shot Learners: Empirical Studies On VQA And Visual Entailment Song Haoyu, Dong Li, Zhang Wei-nan, Liu Ting, Wei Furu
- QUILL: Query Intent With Large Language Models Using Retrieval Augmentation And Multi-stage Distillation Srinivasan Krishna, Raman Karthik, Samanta Anupam, Liao Lingrui, Bertelli Luca, Bendersky Mike
- Putting Gpt-3's Creativity To The (alternative Uses) Test Stevenson Claire, Smal Iris, Baas Matthijs, Grasman Raoul, Van Der Maas Han
- Tokenization Consistency Matters For Generative Models On Extractive NLP Tasks Sun Kaiser, Qi Peng, Zhang Yuhao, Liu Lan, Wang William Yang, Huang Zhiheng
- Dynamar: Dynamic Prompt With Mask Token Representation Sun Xiaodi, Rajagopalan Sunny, Nigam Priyanka, Lu Weiyi, Xu Yi, Zeng Belinda, Chilimbi Trishul
- Long-form Video-language Pre-training With Multimodal Temporal Contrastive Learning Sun Yuchong, Xue Hongwei, Song Ruihua, Liu Bei, Yang Huan, Fu Jianlong
- Recitation-augmented Language Models Sun Zhiqing, Wang Xuezhi, Tay Yi, Yang Yiming, Zhou Denny
- Etrica: Event-triggered Context-aware Story Generation Augmented By Cross Attention Tang Chen, Lin Chenghua, Huang Henglin, Guerin Frank, Zhang Zhihao
- Towards Fine-tuning Pre-trained Language Models With Integer Forward And Backward Propagation Tayaranian Mohammadreza, Ghaffari Alireza, Tahaei Marzieh S., Rezagholizadeh Mehdi, Asgharian Masoud, Nia Vahid Partovi
- What Do They Capture? -- A Structural Analysis Of Pre-trained Language Models For Source Code Wan Yao, Zhao Wei, Zhang Hongyu, Sui Yulei, Xu Guandong, Jin Hai
- Exploring The Limits Of Domain-adaptive Training For Detoxifying Large-scale Language Models Wang Boxin, Ping Wei, Xiao Chaowei, Xu Peng, Patwary Mostofa, Shoeybi Mohammad, Li Bo, Anandkumar Anima, Catanzaro Bryan
- Learning To Decompose Visual Features With Latent Textual Prompts Wang Feng, Li Manling, Lin Xudong, Lv Hairong, Schwing Alexander G., Ji Heng
- Knowledge Prompting In Pre-trained Language Model For Natural Language Understanding Wang Jianing, Huang Wenkang, Shi Qiuhui, Wang Hongbin, Qiu Minghui, Li Xiang, Gao Ming
- Rethinking Textual Adversarial Defense For Pre-trained Language Models Wang Jiayi, Bao Rongzhou, Zhang Zhuosheng, Zhao Hai
- Pretraining Without Attention Wang Junxiong, Yan Jing Nathan, Gu Albert, Rush Alexander M.
- Compilable Neural Code Generation With Compiler Feedback Wang Xin, Wang Yasheng, Wan Yao, Mi Fei, Li Yitong, Zhou Pingyi, Liu Jin, Wu Hao, Jiang Xin, Liu Qun
- Code4struct: Code Generation For Few-shot Event Structure Prediction Wang Xingyao, Li Sha, Ji Heng
- S-prompts Learning With Pre-trained Transformers: An Occam's Razor For Domain Incremental Learning Wang Yabin, Huang Zhiwu, Hong Xiaopeng
- Adamix: Mixture-of-adaptations For Parameter-efficient Model Tuning Wang Yaqing, Agarwal Sahaj, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Multimodal Adaptive Distillation For Leveraging Unimodal Encoders For Vision-language Tasks Wang Zhecan, Codella Noel, Chen Yen-chun, Zhou Luowei, Dai Xiyang, Xiao Bin, Yang Jianwei, You Haoxuan, Chang Kai-wei, Chang Shih-fu, Yuan Lu
- CREATER: Ctr-driven Advertising Text Generation With Controlled Pre-training And Contrastive Fine-tuning Wei Penghui, Yang Xuanhua, Liu Shaoguo, Wang Liang, Zheng Bo
- Program Of Thoughts Prompting: Disentangling Computation From Reasoning For Numerical Reasoning Tasks Wenhu Chen, Xueguang Ma, Xinyi Wang, William W. Cohen
- Adversarial Self-attention For Language Understanding Wu Hongqiu, Ding Ruixue, Zhao Hai, Xie Pengjun, Huang Fei, Zhang Min
- Exploring The Efficacy Of Pre-trained Checkpoints In Text-to-music Generation Task Wu Shangda, Sun Maosong
- An Efficient Memory-augmented Transformer For Knowledge-intensive NLP Tasks Wu Yuxiang, Zhao Yu, Hu Baotian, Minervini Pasquale, Stenetorp Pontus, Riedel Sebastian
- Modeling Context With Linear Attention For Scalable Document-level Translation Wu Zhaofeng, Peng Hao, Pappas Nikolaos, Smith Noah A.
- Adapting Pretrained Text-to-text Models For Long Text Sequences Xiong Wenhan, Gupta Anchit, Toshniwal Shubham, Mehdad Yashar, Yih Wen-tau
- Go-tuning: Improving Zero-shot Learning Abilities Of Smaller Language Models Xu Jingjing, Dong Qingxiu, Liu Hongyi, Li Lei
- Rationale-augmented Ensembles In Language Models Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc Le, Ed Chi, Denny Zhou
- Self-consistency Improves Chain Of Thought Reasoning In Language Models Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc Le, Ed Chi, Sharan Narang, Aakanksha Chowdhery, Denny Zhou
- Generative Negative Text Replay For Continual Vision-language Pretraining Yan Shipeng, Hong Lanqing, Xu Hang, Han Jianhua, Tuytelaars Tinne, Li Zhenguo, He Xuming
- Learning To Answer Visual Questions From Web Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Testaug: A Framework For Augmenting Capability-based NLP Tests Yang Guanqun, Haque Mirazul, Song Qiaochu, Yang Wei, Liu Xueqing
- GLUE-X: Evaluating Natural Language Understanding Models From An Out-of-distribution Generalization Perspective Yang Linyi, Zhang Shuibai, Qin Libo, Li Yafu, Wang Yidong, Liu Hanmeng, Wang Jindong, Xie Xing, Zhang Yue
- AD-DROP: Attribution-driven Dropout For Robust Language Model Fine-tuning Yang Tao, Deng Jinghao, Quan Xiaojun, Wang Qifan, Nie Shaoliang
- Retrieval-augmented Multimodal Language Modeling Yasunaga Michihiro, Aghajanyan Armen, Shi Weijia, James Rich, Leskovec Jure, Liang Percy, Lewis Mike, Zettlemoyer Luke, Yih Wen-tau
- Modeling Multi-hop Question Answering As Single Sequence Prediction Yavuz Semih, Hashimoto Kazuma, Zhou Yingbo, Keskar Nitish Shirish, Xiong Caiming
- Guess The Instruction! Flipped Learning Makes Language Models Stronger Zero-shot Learners Ye Seonghyeon, Kim Doyoung, Jang Joel, Shin Joongbo, Seo Minjoon
- Active Example Selection For In-context Learning Yiming Zhang, Shi Feng, Chenhao Tan
- Competition-level Code Generation With Alphacode Yujia Li, David Choi, Junyoung Chung, Nate Kushman, Julian Schrittwieser, Rémi Leblond, Tom Eccles, James Keeling, Felix Gimeno, Agustin Dal Lago, Thomas Hubert, Peter Choy, Cyprien De Masson D'autume, Igor Babuschkin, Xinyun Chen, Po-sen Huang, Johannes Welbl, Sven Gowal, Alexey Cherepanov, James Molloy, Daniel J. Mankowitz, Esme Sutherland Robson, Pushmeet Kohli, Nando De Freitas, Koray Kavukcuoglu, Oriol Vinyals
- Bridging Cross-lingual Gaps During Leveraging The Multilingual Sequence-to-sequence Pretraining For Text Generation And Understanding Zan Changtong, Ding Liang, Shen Li, Cao Yu, Liu Weifeng, Tao Dacheng
- Scala: Accelerating Adaptation Of Pre-trained Transformer-based Language Models Via Efficient Large-batch Adversarial Noise Zhang Minjia, Naresh Niranjan Uma, He Yuxiong
- Learning Video Representations From Large Language Models Zhao Yue, Misra Ishan, Krähenbühl Philipp, Girdhar Rohit
- E2S2: Encoding-enhanced Sequence-to-sequence Pretraining For Language Understanding And Generation Zhong Qihuang, Ding Liang, Liu Juhua, Du Bo, Tao Dacheng
- Training Language Models With Memory Augmentation Zhong Zexuan, Lei Tao, Chen Danqi
- Reflection Of Thought: Inversely Eliciting Numerical Reasoning In Language Models Via Solving Linear Systems Zhou Fan, Dong Haoyu, Liu Qian, Cheng Zhoujun, Han Shi, Zhang Dongmei
- Tacube: Pre-computing Data Cubes For Answering Numerical-reasoning Questions Over Tabular Data Zhou Fan, Hu Mengkang, Dong Haoyu, Cheng Zhoujun, Han Shi, Zhang Dongmei
- Binding Language Models In Symbolic Languages Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu
- Syntax-informed Question Answering With Heterogeneous Graph Transformer Zhu Fangyi, Tan Lok You, Ng See-kiong, Bressan Stéphane
- Automatic Chain Of Thought Prompting In Large Language Models Zhuosheng Zhang, Aston Zhang, Mu Li, Alex Smola
- LM-CPPF: Paraphrasing-guided Data Augmentation For Contrastive Prompt-based Few-shot Fine-tuning Abaskohi Amirhossein, Rothe Sascha, Yaghoobzadeh Yadollah
- I Was Blind But Now I See: Implementing Vision-enabled Dialogue In Social Robots Abbo Giulio Antonio, Belpaeme Tony
- LMRL Gym: Benchmarks For Multi-turn Reinforcement Learning With Language Models Abdulhai Marwa, White Isadora, Snell Charlie, Sun Charles, Hong Joey, Zhai Yuexiang, Xu Kelvin, Levine Sergey
- Beyond Traditional Teaching: The Potential Of Large Language Models And Chatbots In Graduate Engineering Education Abedi Mahyar, Alshybani Ibrahem, Shahadat Muhammad Rubayat Bin, Murillo Michael S.
- Comparative Analysis Of Chatgpt, GPT-4, And Microsoft Bing Chatbots For GRE Test Abu-haifa Mohammad, Etawi Bara'a, Alkhatatbeh Huthaifa, Ababneh Ayman
- Leveraging Vision-language Models For Improving Domain Generalization In Image Classification Addepalli Sravanti, Asokan Ashish Ramayee, Sharma Lakshay, Babu R. Venkatesh
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon
- Bring Your Own KG: Self-supervised Program Synthesis For Zero-shot KGQA Agarwal Dhruv, Das Rajarshi, Khosla Sopan, Gangadharaiah Rashmi
- Let's Sample Step By Step: Adaptive-consistency For Efficient Reasoning And Coding With Llms Aggarwal Pranjal, Madaan Aman, Yang Yiming, Mausam
- Can Large Language Models Be Good Path Planners? A Benchmark And Investigation On Spatial-temporal Reasoning Aghzal Mohamed, Plaku Erion, Yao Ziyu
- SARATHI: Efficient LLM Inference By Piggybacking Decodes With Chunked Prefills Agrawal Amey, Panwar Ashish, Mohan Jayashree, Kwatra Nipun, Gulavani Bhargav S., Ramjee Ramachandran
- Story Visualization By Online Text Augmentation With Context Memory Ahn Daechul, Kim Daneul, Song Gwangmo, Kim Seung Hwan, Lee Honglak, Kang Dongyeop, Choi Jonghyun
- The Emergence Of Essential Sparsity In Large Pre-trained Models: The Weights That Matter Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Zhangyang Wang
- Performance Trade-offs Of Watermarking Large Language Models Ajith Anirudh, Singh Sameer, Pruthi Danish
- Characterizing Attribution And Fluency Tradeoffs For Retrieval-augmented Large Language Models Aksitov Renat, Chang Chung-ching, Reitter David, Shakeri Siamak, Sung Yunhsuan
- Rest Meets React: Self-improvement For Multi-step Reasoning LLM Agent Aksitov Renat, Miryoosefi Sobhan, Li Zonglin, Li Daliang, Babayan Sheila, Kopparapu Kavya, Fisher Zachary, Guo Ruiqi, Prakash Sushant, Srinivasan Pranesh, Zaheer Manzil, Yu Felix, Kumar Sanjiv
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Akyürek Afra Feyza, Akyürek Ekin, Madaan Aman, Kalyan Ashwin, Clark Peter, Wijaya Derry, Tandon Niket
- Mistral 7B Albert Q. Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego De Las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lucile Saulnier, Lélio Renard Lavaud, Marie-anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timothée Lacroix, William El Sayed
- Physics Of Language Models: Part 3.2, Knowledge Manipulation Allen-zhu Zeyuan, Li Yuanzhi
- Prompt-engineering And Transformer-based Question Generation And Evaluation Amyeen Rubaba
- On Generative Agents In Recommendation An Zhang, Yuxin Chen, Leheng Sheng, Xiang Wang, Tat-seng Chua
- Context Tuning For Retrieval Augmented Generation Anantha Raviteja, Bethi Tharun, Vodianik Danil, Chappidi Srinivas
- Expel: LLM Agents Are Experiential Learners Andrew Zhao, Daniel Huang, Quentin Xu, Matthieu Lin, Yong-jin Liu, Gao Huang
- Augmenting Llms With Knowledge: A Survey On Hallucination Prevention Andriopoulos Konstantinos, Pouwelse Johan
- Randomized Positional Encodings Boost Length Generalization Of Transformers Anian Ruoss, Grégoire Delétang, Tim Genewein, Jordi Grau-moya, Róbert Csordás, Mehdi Bennani, Shane Legg, Joel Veness
- Distilling Efficient Language-specific Models For Cross-lingual Transfer Ansell Alan, Ponti Edoardo Maria, Korhonen Anna, Vulić Ivan
- Slimfit: Memory-efficient Fine-tuning Of Transformer-based Models Using Training Dynamics Ardakani Arash, Haan Altan, Tan Shangyin, Popovici Doru Thom, Cheung Alvin, Iancu Costin, Sen Koushik
- Llm2loss: Leveraging Language Models For Explainable Model Diagnostics Ardeshir Shervin
- Learning And Leveraging Verifiers To Improve Planning Capabilities Of Pre-trained Language Models Arora Daman, Kambhampati Subbarao
- Self-rag: Learning To Retrieve, Generate, And Critique Through Self-reflection Asai Akari, Wu Zeqiu, Wang Yizhong, Sil Avirup, Hajishirzi Hannaneh
- Mitigating Open-vocabulary Caption Hallucinations Assaf Ben-kish, Moran Yanuka, Morris Alper, Raja Giryes, Hadar Averbuch-elor
- Openflamingo: An Open-source Framework For Training Large Autoregressive Vision-language Models Awadalla Anas, Gao Irena, Gardner Josh, Hessel Jack, Hanafy Yusuf, Zhu Wanrong, Marathe Kalyani, Bitton Yonatan, Gadre Samir, Sagawa Shiori, Jitsev Jenia, Kornblith Simon, Koh Pang Wei, Ilharco Gabriel, Wortsman Mitchell, Schmidt Ludwig
- RRAML: Reinforced Retrieval Augmented Machine Learning Bacciu Andrea, Cuconasu Florin, Siciliano Federico, Silvestri Fabrizio, Tonellotto Nicola, Trappolini Giovanni
- Evaluating Chatgpt As A Question Answering System: A Comprehensive Analysis And Comparison With Existing Models Bahak Hossein, Taheri Farzaneh, Zojaji Zahra, Kazemi Arefeh
- Llms-augmented Contextual Bandit Baheri Ali, Alm Cecilia O.
- Longbench: A Bilingual, Multitask Benchmark For Long Context Understanding Bai Yushi, Lv Xin, Zhang Jiajie, Lyu Hongchang, Tang Jiankai, Huang Zhidian, Du Zhengxiao, Liu Xiao, Zeng Aohan, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- Fine-tuning Large Enterprise Language Models Via Ontological Reasoning Baldazzi Teodoro, Bellomarini Luigi, Ceri Stefano, Colombo Andrea, Gentili Andrea, Sallinger Emanuel
- Investigating Pre-trained Language Models On Cross-domain Datasets, A Step Closer To General AI Ballout Mohamad, Krumnack Ulf, Heidemann Gunther, Kühnberger Kai-uwe
- The Belebele Benchmark: A Parallel Reading Comprehension Dataset In 122 Language Variants Bandarkar Lucas, Liang Davis, Muller Benjamin, Artetxe Mikel, Shukla Satya Narayan, Husa Donald, Goyal Naman, Krishnan Abhinandan, Zettlemoyer Luke, Khabsa Madian
- Peering Through Preferences: Unraveling Feedback Acquisition For Aligning Large Language Models Bansal Hritik, Dang John, Grover Aditya
- Tallrec: An Effective And Efficient Tuning Framework To Align Large Language Model With Recommendation Bao Keqin, Zhang Jizhi, Zhang Yang, Wang Wenjie, Feng Fuli, He Xiangnan
- Disc-medllm: Bridging General Large Language Models And Real-world Medical Consultation Bao Zhijie, Chen Wei, Xiao Shengze, Ren Kuang, Wu Jiaao, Zhong Cheng, Peng Jiajie, Huang Xuanjing, Wei Zhongyu
- Self-consistency Of Large Language Models Under Ambiguity Bartsch Henning, Jorgensen Ole, Rosati Domenic, Hoelscher-obermaier Jason, Pfau Jacob
- Llamantino: Llama 2 Models For Effective Text Generation In Italian Language Basile Pierpaolo, Musacchio Elio, Polignano Marco, Siciliani Lucia, Fiameni Giuseppe, Semeraro Giovanni
- Text Summarization Using Large Language Models: A Comparative Study Of Mpt-7b-instruct, Falcon-7b-instruct, And Openai Chat-gpt Models Basyal Lochan, Sanghvi Mihir
- Efficient Pre-training For Localized Instruction Generation Of Videos Batra Anil, Moltisanti Davide, Sevilla-lara Laura, Rohrbach Marcus, Keller Frank
- The Poison Of Alignment Bekbayev Aibek, Chun Sungbae, Dulat Yerzat, Yamazaki James
- Labrador: Exploring The Limits Of Masked Language Modeling For Laboratory Data Bellamy David R., Kumar Bhawesh, Wang Cindy, Beam Andrew
- When Automated Assessment Meets Automated Content Generation: Examining Text Quality In The Era Of Gpts Bevilacqua Marialena, Oketch Kezia, Qin Ruiyang, Stamey Will, Zhang Xinyuan, Gan Yi, Yang Kai, Abbasi Ahmed
- Red-teaming Large Language Models Using Chain Of Utterances For Safety-alignment Bhardwaj Rishabh, Poria Soujanya
- Automating Question Generation From Educational Text Bhowmick Ayan Kumar, Jagmohan Ashish, Vempaty Aditya, Dey Prasenjit, Hall Leigh, Hartman Jeremy, Kokku Ravi, Maheshwari Hema
- Chatgpt Is A Knowledgeable But Inexperienced Solver: An Investigation Of Commonsense Problem In Large Language Models Bian Ning, Han Xianpei, Sun Le, Lin Hongyu, Lu Yaojie, He Ben, Jiang Shanshan, Dong Bin
- VIGC: Visual Instruction Generation And Correction Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He
- Distractor Generation For Multiple-choice Questions With Predictive Prompting And Large Language Models Bitew Semere Kiros, Deleu Johannes, Develder Chris, Demeester Thomas
- Chip-chat: Challenges And Opportunities In Conversational Hardware Design Blocklove Jason, Garg Siddharth, Karri Ramesh, Pearce Hammond
- RWKV: Reinventing Rnns For The Transformer Era Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Stella Biderman, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran Gv, Xuzheng He, Haowen Hou, Jiaju Lin, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Guangyu Song, Xiangru Tang, Bolun Wang, Johan S. Wind, Stanislaw Wozniak, Ruichong Zhang, Zhenyuan Zhang, Qihang Zhao, Peng Zhou, Qinghua Zhou, Jian Zhu, Rui-jie Zhu
- Halle-control: Controlling Object Hallucination In Large Multimodal Models Bohan Zhai, Shijia Yang, Chenfeng Xu, Sheng Shen, Kurt Keutzer, Chunyuan Li, Manling Li
- GPT-4 Surpassing Human Performance In Linguistic Pragmatics Bojic Ljubisa, Kovacevic Predrag, Cabarkapa Milan
- Ask Language Model To Clean Your Noisy Translation Data Bolding Quinten, Liao Baohao, Denis Brandon James, Luo Jun, Monz Christof
- Pixlore: A Dataset-driven Approach To Rich Image Captioning Bonilla Diego
- Stylip: Multi-scale Style-conditioned Prompt Learning For Clip-based Domain Generalization Bose Shirsha, Jha Ankit, Fini Enrico, Singha Mainak, Ricci Elisa, Banerjee Biplab
- Inpars-light: Cost-effective Unsupervised Training Of Efficient Rankers Boytsov Leonid, Patel Preksha, Sourabh Vivek, Nisar Riddhi, Kundu Sayani, Ramanathan Ramya, Nyberg Eric
- Plasma: Making Small Language Models Better Procedural Knowledge Models For (counterfactual) Planning Brahman Faeze, Bhagavatula Chandra, Pyatkin Valentina, Hwang Jena D., Li Xiang Lorraine, Arai Hirona J., Sanyal Soumya, Sakaguchi Keisuke, Ren Xiang, Choi Yejin
- Extractgpt: Exploring The Potential Of Large Language Models For Product Attribute Value Extraction Brinkmann Alexander, Shraga Roee, Bizer Christian
- Codetf: One-stop Transformer Library For State-of-the-art Code LLM Bui Nghi D. Q., Le Hung, Wang Yue, Li Junnan, Gotmare Akhilesh Deepak, Hoi Steven C. H.
- Just Tell Me: Prompt Engineering In Business Process Management Busch Kiran, Rochlitzer Alexander, Sola Diana, Leopold Henrik
- Peek Across: Improving Multi-document Modeling Via Cross-document Question-answering Caciularu Avi, Peters Matthew E., Goldberger Jacob, Dagan Ido, Cohan Arman
- Leveraging Large Language Models For Scalable Vector Graphics-driven Image Understanding Cai Mu, Huang Zeyi, Li Yuheng, Ojha Utkarsh, Wang Haohan, Lee Yong Jae
- Large Language Models As Tool Makers Cai Tianle, Wang Xuezhi, Ma Tengyu, Chen Xinyun, Zhou Denny
- A Systematic Study Of Knowledge Distillation For Natural Language Generation With Pseudo-target Training Calderon Nitay, Mukherjee Subhabrata, Reichart Roi, Kantor Amir
- Can Transformers Learn Sequential Function Classes In Context? Campbell Ryan, Guo Emma, Hu Evan, Vir Reya, Hsiao Ethan
- Oberta: Improving Sparse Transfer Learning Via Improved Initialization, Distillation, And Pruning Regimes Campos Daniel, Marques Alexandre, Kurtz Mark, Zhai Chengxiang
- Instruction Mining: Instruction Data Selection For Tuning Large Language Models Cao Yihan, Kang Yanbin, Wang Chi, Sun Lichao
- Assessing Large Language Models' Ability To Predict How Humans Balance Self-interest And The Interest Of Others Capraro Valerio, Di Paolo Roberto, Pizziol Veronica
- News Verifiers Showdown: A Comparative Performance Evaluation Of Chatgpt 3.5, Chatgpt 4.0, Bing AI, And Bard In News Fact-checking Caramancion Kevin Matthe
- Promoting Generalized Cross-lingual Question Answering In Few-resource Scenarios Via Self-knowledge Distillation Carrino Casimiro Pio, Escolano Carlos, Fonollosa José A. R.
- Grounding Large Language Models In Interactive Environments With Online Reinforcement Learning Carta Thomas, Romac Clément, Wolf Thomas, Lamprier Sylvain, Sigaud Olivier, Oudeyer Pierre-yves
- Knowledge Transfer From High-resource To Low-resource Programming Languages For Code Llms Cassano Federico, Gouwar John, Lucchetti Francesca, Schlesinger Claire, Freeman Anders, Anderson Carolyn Jane, Feldman Molly Q, Greenberg Michael, Jangda Abhinav, Guha Arjun
- Dialogue Chain-of-thought Distillation For Commonsense-aware Conversational Agents Chae Hyungjoo, Song Yongho, Ong Kai Tzu-iunn, Kwon Taeyoon, Kim Minjin, Yu Youngjae, Lee Dongha, Kang Dongyeop, Yeo Jinyoung
- Exploring Answer Information Methods For Question Generation With Transformers Chafekar Talha, Hussain Aafiya, Sharma Grishma, Sharma Deepak
- Chatgpt May Pass The Bar Exam Soon, But Has A Long Way To Go For The Lexglue Benchmark Chalkidis Ilias
- Selective Demonstrations For Cross-domain Text-to-sql Chang Shuaichen, Fosler-lussier Eric
- Language Model Behavior: A Comprehensive Survey Chang Tyler A., Bergen Benjamin K.
- Salesbot 2.0: A Human-like Intent-guided Chit-chat Dataset Chang Wen-yu, Chen Yun-nung
- Cabbage Sweeter Than Cake? Analysing The Potential Of Large Language Models For Learning Conceptual Spaces Chatterjee Usashi, Gajbhiye Amit, Schockaert Steven
- Dual-space Hierarchical Learning For Goal-guided Conversational Recommendation Chen Can, Liu Hao, Liu Zeming, Liu Xue, Dou Dejing
- LION : Empowering Multimodal Large Language Model With Dual-level Visual Knowledge Chen Gongwei, Shen Leyang, Shao Rui, Deng Xiang, Nie Liqiang
- Tem-adapter: Adapting Image-text Pretraining For Video Question Answer Chen Guangyi, Liu Xiao, Wang Guangrun, Zhang Kun, Torr Philip H. S., Zhang Xiao-ping, Tang Yansong
- Videollm: Modeling Video Sequence With Large Language Models Chen Guo, Zheng Yin-dong, Wang Jiahao, Xu Jilan, Huang Yifei, Pan Junting, Wang Yi, Wang Yali, Qiao Yu, Lu Tong, Wang Limin
- Benchmarking Large Language Models In Retrieval-augmented Generation Chen Jiawei, Lin Hongyu, Han Xianpei, Sun Le
- Gaining Wisdom From Setbacks: Aligning Large Language Models Via Mistake Analysis Chen Kai, Wang Chunwei, Yang Kuo, Han Jianhua, Hong Lanqing, Mi Fei, Xu Hang, Liu Zhengying, Huang Wenyong, Li Zhenguo, Yeung Dit-yan, Shang Lifeng, Jiang Xin, Liu Qun
- Large Language Models Are Visual Reasoning Coordinators Chen Liangyu, Li Bo, Shen Sheng, Yang Jingkang, Li Chunyuan, Keutzer Kurt, Darrell Trevor, Liu Ziwei
- MMICT: Boosting Multi-modal Fine-tuning With In-context Examples Chen Tao, Zhang Enwei, Gao Yuting, Li Ke, Sun Xing, Zhang Yan, Li Hui, Ji Rongrong
- Self-icl: Zero-shot In-context Learning With Self-generated Demonstrations Chen Wei-lin, Wu Cheng-kuang, Chen Yun-nung, Chen Hsin-hsi
- Theoremqa: A Theorem-driven Question Answering Dataset Chen Wenhu, Yin Ming, Ku Max, Lu Pan, Wan Yixin, Ma Xueguang, Xu Jianyu, Wang Xinyi, Xia Tony
- Reason Out Your Layout: Evoking The Layout Master From Large Language Models For Text-to-image Synthesis Chen Xiaohui, Liu Yongfei, Yang Yingxiang, Yuan Jianbo, You Quanzeng, Liu Li-ping, Yang Hongxia
- Universal Self-consistency For Large Language Model Generation Chen Xinyun, Aksitov Renat, Alon Uri, Ren Jie, Xiao Kefan, Yin Pengcheng, Prakash Sushant, Sutton Charles, Wang Xuezhi, Zhou Denny
- Teaching Large Language Models To Self-debug Chen Xinyun, Lin Maxwell, Schärli Nathanael, Zhou Denny
- EE-LLM: Large-scale Training And Inference Of Early-exit Large Language Models With 3D Parallelism Chen Yanxi, Pan Xuchen, Li Yaliang, Ding Bolin, Zhou Jingren
- Prompt To GPT-3: Step-by-step Thinking Instructions For Humor Generation Chen Yuetian, Shi Bowen, Si Mei
- Fortify The Shortest Stave In Attention: Enhancing Context Awareness Of Large Language Models For Effective Tool Use Chen Yuhan, Lv Ang, Lin Ting-en, Chen Changyu, Wu Yuchuan, Huang Fei, Li Yongbin, Yan Rui
- Chatcot: Tool-augmented Chain-of-thought Reasoning On Chat-based Large Language Models Chen Zhipeng, Zhou Kun, Zhang Beichen, Gong Zheng, Zhao Wayne Xin, Wen Ji-rong
- Lmexplainer: Grounding Knowledge And Explaining Language Models Chen Zichen, Chen Jianda, Chen Yuanyuan, Yu Han, Singh Ambuj K, Sra Misha
- Xplainllm: A QA Explanation Dataset For Understanding LLM Decision-making Chen Zichen, Chen Jianda, Gaidhani Mitali, Singh Ambuj, Sra Misha
- Black-box Prompt Optimization: Aligning Large Language Models Without Model Training Cheng Jiale, Liu Xiao, Zheng Kehan, Ke Pei, Wang Hongning, Dong Yuxiao, Tang Jie, Huang Minlie
- CREATOR: Tool Creation For Disentangling Abstract And Concrete Reasoning Of Large Language Models Cheng Qian, Chi Han, Yi R. Fung, Yujia Qin, Zhiyuan Liu, Heng Ji
- Lift Yourself Up: Retrieval-augmented Text Generation With Self Memory Cheng Xin, Luo Di, Chen Xiuying, Liu Lemao, Zhao Dongyan, Yan Rui
- SCALE: Synergized Collaboration Of Asymmetric Language Translation Engines Cheng Xin, Wang Xun, Ge Tao, Chen Si-qing, Wei Furu, Zhao Dongyan, Yan Rui
- INSTRUCTEVAL: Towards Holistic Evaluation Of Instruction-tuned Large Language Models Chia Yew Ken, Hong Pengfei, Bing Lidong, Poria Soujanya
- Symbolic Planning And Code Generation For Grounded Dialogue Chiu Justin T., Zhao Wenting, Chen Derek, Vaduguru Saujas, Rush Alexander M., Fried Daniel
- Language And Task Arithmetic With Parameter-efficient Layers For Zero-shot Summarization Chronopoulou Alexandra, Pfeiffer Jonas, Maynez Joshua, Wang Xinyi, Ruder Sebastian, Agrawal Priyanka
- Instructtods: Large Language Models For End-to-end Task-oriented Dialogue Systems Chung Willy, Cahyawijaya Samuel, Wilie Bryan, Lovenia Holy, Fung Pascale
- Remember What You Did So You Know What To Do Next Ciosici Manuel R., Hedges Alex, Kankanampati Yash, Martin Justin, Freedman Marjorie, Weischedel Ralph
- Galactic Chitchat: Using Large Language Models To Converse With Astronomy Literature Ciucă Ioana, Ting Yuan-sen
- Strong And Efficient Baselines For Open Domain Conversational Question Answering Coman Andrei C., Barlacchi Gianni, De Gispert Adrià
- Seamlessm4t: Massively Multilingual & Multimodal Machine Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Cora, Dale David, Dong Ning, Duquenne Paul-ambroise, Elsahar Hady, Gong Hongyu, Heffernan Kevin, Hoffman John, Klaiber Christopher, Li Pengwei, Licht Daniel, Maillard Jean, Rakotoarison Alice, Sadagopan Kaushik Ram, Wenzek Guillaume, Ye Ethan, Akula Bapi, Chen Peng-jen, Hachem Naji El, Ellis Brian, Gonzalez Gabriel Mejia, Haaheim Justin, Hansanti Prangthip, Howes Russ, Huang Bernie, Hwang Min-jae, Inaguma Hirofumi, Jain Somya, Kalbassi Elahe, Kallet Amanda, Kulikov Ilia, Lam Janice, Li Daniel, Ma Xutai, Mavlyutov Ruslan, Peloquin Benjamin, Ramadan Mohamed, Ramakrishnan Abinesh, Sun Anna, Tran Kevin, Tran Tuan, Tufanov Igor, Vogeti Vish, Wood Carleigh, Yang Yilin, Yu Bokai, Andrews Pierre, Balioglu Can, Costa-jussà Marta R., Celebi Onur, Elbayad Maha, Gao Cynthia, Guzmán Francisco, Kao Justine, Lee Ann, Mourachko Alexandre, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler
- Seamless: Multilingual Expressive And Streaming Speech Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Coria, Dale David, Dong Ning, Duppenthaler Mark, Duquenne Paul-ambroise, Ellis Brian, Elsahar Hady, Haaheim Justin, Hoffman John, Hwang Min-jae, Inaguma Hirofumi, Klaiber Christopher, Kulikov Ilia, Li Pengwei, Licht Daniel, Maillard Jean, Mavlyutov Ruslan, Rakotoarison Alice, Sadagopan Kaushik Ram, Ramakrishnan Abinesh, Tran Tuan, Wenzek Guillaume, Yang Yilin, Ye Ethan, Evtimov Ivan, Fernandez Pierre, Gao Cynthia, Hansanti Prangthip, Kalbassi Elahe, Kallet Amanda, Kozhevnikov Artyom, Gonzalez Gabriel Mejia, Roman Robin San, Touret Christophe, Wong Corinne, Wood Carleigh, Yu Bokai, Andrews Pierre, Balioglu Can, Chen Peng-jen, Costa-jussà Marta R., Elbayad Maha, Gong Hongyu, Guzmán Francisco, Heffernan Kevin, Jain Somya, Kao Justine, Lee Ann, Ma Xutai, Mourachko Alex, Peloquin Benjamin, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Sun Anna, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler, Williamson Mary
- Receive, Reason, And React: Drive As You Say With Large Language Models In Autonomous Vehicles Cui Can, Ma Yunsheng, Cao Xu, Ye Wenqian, Wang Ziran
- Ada-instruct: Adapting Instruction Generators For Complex Reasoning Cui Wanyun, Wang Qianle
- Auggpt: Leveraging Chatgpt For Text Data Augmentation Dai Haixing, Liu Zhengliang, Liao Wenxiong, Huang Xiaoke, Cao Yihan, Wu Zihao, Zhao Lin, Xu Shaochen, Liu Wei, Liu Ninghao, Li Sheng, Zhu Dajiang, Cai Hongmin, Sun Lichao, Li Quanzheng, Shen Dinggang, Liu Tianming, Li Xiang
- Llm-in-the-loop: Leveraging Large Language Model For Thematic Analysis Dai Shih-chieh, Xiong Aiping, Ku Lun-wei
- Visual Chain Of Thought: Bridging Logical Gaps With Multimodal Infillings Daniel Rose, Vaishnavi Himakunthala, Andy Ouyang, Ryan He, Alex Mei, Yujie Lu, Michael Saxon, Chinmay Sonar, Diba Mirza, William Yang Wang
- Can Chatgpt Pass The Vietnamese National High School Graduation Examination? Dao Xuan-quy, Le Ngoc-bich, Phan Xuan-dung, Ngo Bac-bien
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- Pre-computed Memory Or On-the-fly Encoding? A Hybrid Approach To Retrieval Augmentation Makes The Most Of Your Compute De Jong Michiel, Zemlyanskiy Yury, Fitzgerald Nicholas, Ainslie Joshua, Sanghai Sumit, Sha Fei, Cohen William
- LLMR: Real-time Prompting Of Interactive Worlds Using Large Language Models De La Torre Fernanda, Fang Cathy Mengying, Huang Han, Banburski-fahey Andrzej, Fernandez Judith Amores, Lanier Jaron
- Masterkey: Automated Jailbreak Across Multiple Large Language Model Chatbots Deng Gelei, Liu Yi, Li Yuekang, Wang Kailong, Zhang Ying, Li Zefeng, Wang Haoyu, Zhang Tianwei, Liu Yang
- Mixture Of Soft Prompts For Controllable Data Generation Derek Chen, Celine Lee, Yunan Lu, Domenic Rosati, Zhou Yu
- Minigpt-4: Enhancing Vision-language Understanding With Advanced Large Language Models Deyao Zhu, Jun Chen, Xiaoqian Shen, Xiang Li, Mohamed Elhoseiny
- Enhancing Chat Language Models By Scaling High-quality Instructional Conversations Ding Ning, Chen Yulin, Xu Bokai, Qin Yujia, Zheng Zhi, Hu Shengding, Liu Zhiyuan, Sun Maosong, Zhou Bowen
- A Wolf In Sheep's Clothing: Generalized Nested Jailbreak Prompts Can Fool Large Language Models Easily Ding Peng, Kuang Jun, Ma Dan, Cao Xuezhi, Xian Yunsen, Chen Jiajun, Huang Shujian
- Everything Of Thoughts: Defying The Law Of Penrose Triangle For Thought Generation Ding Ruomeng, Zhang Chaoyun, Wang Lu, Xu Yong, Ma Minghua, Zhang Wei, Qin Si, Rajmohan Saravan, Lin Qingwei, Zhang Dongmei
- Mapping The Design Space Of Interactions In Human-ai Text Co-creation Tasks Ding Zijian, Chan Joel
- Harnessing The Power Of Llms: Evaluating Human-ai Text Co-creation Through The Lens Of News Headline Generation Ding Zijian, Smith-renner Alison, Zhang Wenjuan, Tetreault Joel R., Jaimes Alejandro
- Efficient Finetuning Large Language Models For Vietnamese Chatbot Doan Vu-thuan, Truong Quoc-truong, Nguyen Duc-vu, Nguyen Vinh-tiep, Luu Thuy-ngan Nguyen
- Establishing Performance Baselines In Fine-tuning, Retrieval-augmented Generation And Soft-prompting For Non-specialist LLM Users Dodgson Jennifer, Nanzheng Lin, Peh Julian, Pattirane Akira Rafhael Janson, Alhajir Alfath Daryl, Dinarto Eko Ridho, Lim Joseph, Ahmad Syed Danyal
- How To Build An AI Tutor That Can Adapt To Any Course And Provide Accurate Answers Using Large Language Model And Retrieval-augmented Generation Dong Chenxi
- Large Language Model For Science: A Study On P Vs. NP Dong Qingxiu, Dong Li, Xu Ke, Zhou Guangyan, Hao Yaru, Sui Zhifang, Wei Furu
- The Philosopher's Stone: Trojaning Plugins Of Large Language Models Dong Tian, Xue Minhui, Chen Guoxing, Holland Rayne, Meng Yan, Li Shaofeng, Liu Zhen, Zhu Haojin
- PACE: Improving Prompt With Actor-critic Editing For Large Language Model Dong Yihong, Luo Kangcheng, Jiang Xue, Jin Zhi, Li Ge
- Loramoe: Alleviate World Knowledge Forgetting In Large Language Models Via Moe-style Plugin Dou Shihan, Zhou Enyu, Liu Yan, Gao Songyang, Zhao Jun, Shen Wei, Zhou Yuhao, Xi Zhiheng, Wang Xiao, Fan Xiaoran, Pu Shiliang, Zhu Jiang, Zheng Rui, Gui Tao, Zhang Qi, Huang Xuanjing
- A Comparative Study Of Ai-generated (GPT-4) And Human-crafted Mcqs In Programming Education Doughty Jacob, Wan Zipiao, Bompelli Anishka, Qayum Jubahed, Wang Taozhi, Zhang Juran, Zheng Yujia, Doyle Aidan, Sridhar Pragnya, Agarwal Arav, Bogart Christopher, Keylor Eric, Kultur Can, Savelka Jaromir, Sakr Majd
- Blending Reward Functions Via Few Expert Demonstrations For Faithful And Accurate Knowledge-grounded Dialogue Generation Du Wanyu, Ji Yangfeng
- Guiding Pretraining In Reinforcement Learning With Large Language Models Du Yuqing, Watkins Olivia, Wang Zihan, Colas Cédric, Darrell Trevor, Abbeel Pieter, Gupta Abhishek, Andreas Jacob
- Lauragpt: Listen, Attend, Understand, And Regenerate Audio With GPT Du Zhihao, Wang Jiaming, Chen Qian, Chu Yunfei, Gao Zhifu, Li Zerui, Hu Kai, Zhou Xiaohuan, Xu Jin, Ma Ziyang, Wang Wen, Zheng Siqi, Zhou Chang, Yan Zhijie, Zhang Shiliang
- Bridging The Language Gap: Knowledge Injected Multilingual Question Answering Duan Zhichao, Li Xiuxing, Zhang Zhengyan, Li Zhenyu, Liu Ning, Wang Jianyong
- Tinystories: How Small Can Language Models Be And Still Speak Coherent English? Eldan Ronen, Li Yuanzhi
- Who's Harry Potter? Approximate Unlearning In Llms Eldan Ronen, Russinovich Mark
- RAGAS: Automated Evaluation Of Retrieval Augmented Generation Es Shahul, James Jithin, Espinosa-anke Luis, Schockaert Steven
- ROBBIE: Robust Bias Evaluation Of Large Generative Language Models Esiobu David, Tan Xiaoqing, Hosseini Saghar, Ung Megan, Zhang Yuchen, Fernandes Jude, Dwivedi-yu Jane, Presani Eleonora, Williams Adina, Smith Eric Michael
- Jacotext: A Pretrained Model For Java Code-text Generation Espejel Jessica López, Alassan Mahaman Sanoussi Yahaya, Dahhane Walid, Ettifouri El Hassane
- NAP At Semeval-2023 Task 3: Is Less Really More? (back-)translation As Data Augmentation Strategies For Detecting Persuasion Techniques Falk Neele, Eichel Annerose, Piccirilli Prisca
- Using GPT-4 To Augment Unbalanced Data For Automatic Scoring Fang Luyang, Lee Gyeong-geon, Zhai Xiaoming
- An Empirical Study Of Multitask Learning To Improve Open Domain Dialogue Systems Farahani Mehrdad, Johansson Richard
- Revisiting Instruction Fine-tuned Model Evaluation To Guide Industrial Applications Faysse Manuel, Viaud Gautier, Hudelot Céline, Colombo Pierre
- Transferable Decoding With Visual Entities For Zero-shot Image Captioning Fei Junjie, Wang Teng, Zhang Jinrui, He Zhenyu, Wang Chengjie, Zheng Feng
- Sequence-to-sequence Pre-training With Unified Modality Masking For Visual Document Understanding Feng Shuwei, Zhan Tianyang, Jie Zhanming, Luong Trung Quoc, Jin Xiaoran
- Chessgpt: Bridging Policy Learning And Language Modeling Feng Xidong, Luo Yicheng, Wang Ziyan, Tang Hongrui, Yang Mengyue, Shao Kun, Mguni David, Du Yali, Wang Jun
- Alphazero-like Tree-search Can Guide Large Language Model Decoding And Training Feng Xidong, Wan Ziyu, Wen Muning, Mcaleer Stephen Marcus, Wen Ying, Zhang Weinan, Wang Jun
- Llama Rider: Spurring Large Language Models To Explore The Open World Feng Yicheng, Wang Yuxuan, Liu Jiazheng, Zheng Sipeng, Lu Zongqing
- The Devil Is In The Errors: Leveraging Large Language Models For Fine-grained Machine Translation Evaluation Fernandes Patrick, Deutsch Daniel, Finkelstein Mara, Riley Parker, Martins André F. T., Neubig Graham, Garg Ankush, Clark Jonathan H., Freitag Markus, Firat Orhan
- Explaining How Transformers Use Context To Build Predictions Ferrando Javier, Gállego Gerard I., Tsiamas Ioannis, Costa-jussà Marta R.
- Should Chatgpt Be Biased? Challenges And Risks Of Bias In Large Language Models Ferrara Emilio
- Context-aware Meta-learning Fifty Christopher, Duan Dennis, Junkins Ronald G., Amid Ehsan, Leskovec Jure, Re Christopher, Thrun Sebastian
- Leveraging Large Language Models For Automated Dialogue Analysis Finch Sarah E., Paek Ellie S., Choi Jinho D.
- MBR And QE Finetuning: Training-time Distillation Of The Best And Most Expensive Decoding Methods Finkelstein Mara, Naskar Subhajit, Mirzazadeh Mehdi, Shah Apurva, Freitag Markus
- Can Large Language Models Follow Concept Annotation Guidelines? A Case Study On Scientific And Financial Domains Fonseca Marcio, Cohen Shay B.
- Breaking The Language Barrier: Improving Cross-lingual Reasoning With Structured Self-attention Foroutan Negar, Banaei Mohammadreza, Aberer Karl, Bosselut Antoine
- Inca: Rethinking In-car Conversational System Assessment Leveraging Large Language Models Friedl Ken E., Khan Abbas Goher, Sahoo Soumya Ranjan, Rony Md Rashad Al Hasan, Germies Jana, Süß Christian
- Leveraging Large Language Models In Conversational Recommender Systems Friedman Luke, Ahuja Sameer, Allen David, Tan Zhenning, Sidahmed Hakim, Long Changbo, Xie Jun, Schubiner Gabriel, Patel Ajay, Lara Harsh, Chu Brian, Chen Zexi, Tiwari Manoj
- Generate Then Select: Open-ended Visual Question Answering Guided By World Knowledge Fu Xingyu, Zhang Sheng, Kwon Gukyeong, Perera Pramuditha, Zhu Henghui, Zhang Yuhao, Li Alexander Hanbo, Wang William Yang, Wang Zhiguo, Castelli Vittorio, Ng Patrick, Roth Dan, Xiang Bing
- Gpt4aigchip: Towards Next-generation AI Accelerator Design Automation Via Large Language Models Fu Yonggan, Zhang Yongan, Yu Zhongzhi, Li Sixu, Ye Zhifan, Li Chaojian, Wan Cheng, Lin Yingyan
- Exposing Limitations Of Language Model Agents In Sequential-task Compositions On The Web Furuta Hiroki, Matsuo Yutaka, Faust Aleksandra, Gur Izzeddin
- Ziya2: Data-centric Learning Is All Llms Need Gan Ruyi, Wu Ziwei, Sun Renliang, Lu Junyu, Wu Xiaojun, Zhang Dixiang, Pan Kunhao, He Junqing, Tian Yuanhe, Yang Ping, Yang Qi, Wang Hao, Zhang Jiaxing, Song Yan
- Natural Language Commanding Via Program Synthesis Gandhi Apurva, Nguyen Thong Q., Jiao Huitian, Steen Robert, Bhatawdekar Ameya
- LLM Blueprint: Enabling Text-to-image Generation With Complex And Detailed Prompts Gani Hanan, Bhat Shariq Farooq, Naseer Muzammal, Khan Salman, Wonka Peter
- CLIPAG: Towards Generator-free Text-to-image Generation Ganz Roy, Elad Michael
- ASSISTGUI: Task-oriented Desktop Graphical User Interface Automation Gao Difei, Ji Lei, Bai Zechen, Ouyang Mingyu, Li Peiran, Mao Dongxing, Wu Qinchen, Zhang Weichen, Wang Peiyi, Guo Xiangwu, Wang Hengxu, Zhou Luowei, Shou Mike Zheng
- Policy-gradient Training Of Language Models For Ranking Gao Ge, Chang Jonathan D., Cardie Claire, Brantley Kianté, Joachim Thorsten
- Benchmarking Large Language Models With Augmented Instructions For Fine-grained Information Extraction Gao Jun, Zhao Huan, Zhang Yice, Wang Wei, Yu Changlong, Xu Ruifeng
- Text Encoders Lack Knowledge: Leveraging Generative Llms For Domain-specific Semantic Textual Similarity Gatto Joseph, Sharif Omar, Seegmiller Parker, Bohlman Philip, Preum Sarah Masud
- Reasoning In Large Language Models Through Symbolic Math Word Problems Gaur Vedant, Saunshi Nikunj
- In-context Autoencoder For Context Compression In A Large Language Model Ge Tao, Hu Jing, Wang Lei, Wang Xun, Chen Si-qing, Wei Furu
- Mllm-bench: Evaluating Multimodal Llms With Per-sample Criteria Ge Wentao, Chen Shunian, Chen Guiming Hardy, Chen Zhihong, Chen Junying, Yan Shuo, Zhu Chenghao, Lin Ziyue, Xie Wenya, Zhang Xinyi, Chai Yichen, Liu Xiaoyu, Song Dingjie, Wang Xidong, Gao Anningzhe, Zhang Zhiyi, Li Jianquan, Wan Xiang, Wang Benyou
- Ddcot: Duty-distinct Chain-of-thought Prompting For Multimodal Reasoning In Language Models Ge Zheng, Bin Yang, Jiajin Tang, Hong-yu Zhou, Sibei Yang
- Mblip: Efficient Bootstrapping Of Multilingual Vision-llms Geigle Gregor, Jain Abhay, Timofte Radu, Glavaš Goran
- Trueteacher: Learning Factual Consistency Evaluation With Large Language Models Gekhman Zorik, Herzig Jonathan, Aharoni Roee, Elkind Chen, Szpektor Idan
- Flacuna: Unleashing The Problem Solving Power Of Vicuna Using FLAN Fine-tuning Ghosal Deepanway, Chia Yew Ken, Majumder Navonil, Poria Soujanya
- Clipsyntel: CLIP And LLM Synergy For Multimodal Question Summarization In Healthcare Ghosh Akash, Acharya Arkadeep, Jain Raghav, Saha Sriparna, Chadha Aman, Sinha Setu
- Resetox: Re-learning Attention Weights For Toxicity Mitigation In Machine Translation Gilabert Javier García, Escolano Carlos, Costa-jussà Marta R.
- Semantic Compression With Large Language Models Gilbert Henry, Sandborn Michael, Schmidt Douglas C., Spencer-smith Jesse, White Jules
- Can Llms Capture Human Preferences? Goli Ali, Singh Amandeep
- Coascore: Chain-of-aspects Prompting For NLG Evaluation Gong Peiyuan, Mao Jiaxin
- Evaluating The Potential Of Leading Large Language Models In Reasoning Biology Questions Gong Xinyu, Holmes Jason, Li Yiwei, Liu Zhengliang, Gan Qi, Wu Zihao, Zhang Jianli, Zou Yusong, Teng Yuxi, Jiang Tian, Zhu Hongtu, Liu Wei, Liu Tianming, Yan Yajun
- Figstep: Jailbreaking Large Vision-language Models Via Typographic Visual Prompts Gong Yichen, Ran Delong, Liu Jinyuan, Wang Conglei, Cong Tianshuo, Wang Anyu, Duan Sisi, Wang Xiaoyun
- Integration Of Large Language Models Within Cognitive Architectures For Autonomous Robots González-santamarta Miguel Á., Rodríguez-lera Francisco J., Guerrero-higueras Ángel Manuel, Matellán-olivera Vicente
- Augmented Language Models: A Survey Grégoire Mialon, Roberto Dessì, Maria Lomeli, Christoforos Nalmpantis, Ram Pasunuru, Roberta Raileanu, Baptiste Rozière, Timo Schick, Jane Dwivedi-yu, Asli Celikyilmaz, Edouard Grave, Yann Lecun, Thomas Scialom
- Linguistically Informed Chatgpt Prompts To Enhance Japanese-chinese Machine Translation: A Case Study On Attributive Clauses Gu Wenshi
- Seer: Language Instructed Video Prediction With Latent Diffusion Models Gu Xianfan, Wen Chuan, Ye Weirui, Song Jiaming, Gao Yang
- Leveraging Pre-trained Large Language Models To Construct And Utilize World Models For Model-based Task Planning Guan Lin, Valmeekam Karthik, Sreedharan Sarath, Kambhampati Subbarao
- Mitigating Large Language Model Hallucinations Via Autonomous Knowledge Graph-based Retrofitting Guan Xinyan, Liu Yanjiang, Lin Hongyu, Lu Yaojie, He Ben, Han Xianpei, Sun Le
- The False Promise Of Imitating Proprietary Llms Gudibande Arnav, Wallace Eric, Snell Charlie, Geng Xinyang, Liu Hao, Abbeel Pieter, Levine Sergey, Song Dawn
- Hifi: High-information Attention Heads Hold For Parameter-efficient Model Adaptation Gui Anchun, Xiao Han
- Hiformer: Heterogeneous Feature Interactions Learning With Transformers For Recommender Systems Gui Huan, Wang Ruoxi, Yin Ke, Jin Long, Kula Maciej, Xu Taibai, Hong Lichan, Chi Ed H.
- Beyond Imitation: Leveraging Fine-grained Quality Signals For Alignment Guo Geyang, Zhao Ranchi, Tang Tianyi, Zhao Wayne Xin, Wen Ji-rong
- Empowering Working Memory For Large Language Model Agents Guo Jing, Li Nan, Qi Jianchuan, Yang Hang, Li Ruiqiao, Feng Yuzhen, Zhang Si, Xu Ming
- Connecting Large Language Models With Evolutionary Algorithms Yields Powerful Prompt Optimizers Guo Qingyan, Wang Rui, Guo Junliang, Li Bei, Song Kaitao, Tan Xu, Liu Guoqing, Bian Jiang, Yang Yujiu
- Meta-tuning Llms To Leverage Lexical Knowledge For Generalizable Language Style Understanding Guo Ruohao, Xu Wei, Ritter Alan
- Knowledgenavigator: Leveraging Large Language Models For Enhanced Reasoning Over Knowledge Graph Guo Tiezheng, Yang Qingwen, Wang Chen, Liu Yanyi, Li Pan, Tang Jiawei, Li Dapeng, Wen Yingyou
- ELIP: Efficient Language-image Pre-training With Fewer Vision Tokens Guo Yangyang, Zhang Haoyu, Wong Yongkang, Nie Liqiang, Kankanhalli Mohan
- Doremi: Grounding Language Model By Detecting And Recovering From Plan-execution Misalignment Guo Yanjiang, Wang Yen-jen, Zha Lihan, Jiang Zheyuan, Chen Jianyu
- Retrieval-augmented Code Generation For Universal Information Extraction Guo Yucan, Li Zixuan, Jin Xiaolong, Liu Yantao, Zeng Yutao, Liu Wenxuan, Li Xiang, Yang Pan, Bai Long, Guo Jiafeng, Cheng Xueqi
- Evaluating Concurrent Robustness Of Language Models Across Diverse Challenge Sets Gupta Vatsal, Pandya Pranshu, Kataria Tushar, Gupta Vivek, Roth Dan
- A Survey Of Vision-language Pre-training From The Lens Of Multimodal Machine Translation Gwinnup Jeremy, Duh Kevin
- Scaling Up And Distilling Down: Language-guided Robot Skill Acquisition Ha Huy, Florence Pete, Song Shuran
- Is GPT-4 A Reliable Rater? Evaluating Consistency In GPT-4 Text Ratings Hackl Veronika, Müller Alexandra Elena, Granitzer Michael, Sailer Maximilian
- Onellm: One Framework To Align All Modalities With Language Han Jiaming, Gong Kaixiong, Zhang Yiyuan, Wang Jiaqi, Zhang Kaipeng, Lin Dahua, Qiao Yu, Gao Peng, Yue Xiangyu
- Llms As Visual Explainers: Advancing Image Classification With Evolving Visual Descriptions Han Songhao, Zhuo Le, Liao Yue, Liu Si
- In-context Alignment: Chat With Vanilla Language Models Before Fine-tuning Han Xiaochuang
- Chartllama: A Multimodal LLM For Chart Understanding And Generation Han Yucheng, Zhang Chi, Chen Xin, Yang Xu, Wang Zhibin, Yu Gang, Fu Bin, Zhang Hanwang
- Boosting Large Language Model For Speech Synthesis: An Empirical Study Hao Hongkun, Zhou Long, Liu Shujie, Li Jinyu, Hu Shujie, Wang Rui, Wei Furu
- Blockwise Parallel Transformer For Large Context Models Hao Liu, Pieter Abbeel
- Towards Efficient Vision-language Tuning: More Information Density, More Generalizability Hao Tianxiang, Lyu Mengyao, Chen Hui, Zhao Sicheng, Ding Xiaohan, Han Jungong, Ding Guiguang
- Zero-shot Recommendations With Pre-trained Large Language Models For Multimodal Nudging Harrison Rachel M., Dereventsov Anton, Bibin Anton
- Leveraging Large Language Models For Sequential Recommendation Harte Jesse, Zorgdrager Wouter, Louridas Panos, Katsifodimos Asterios, Jannach Dietmar, Fragkoulis Marios
- Answering Unseen Questions With Smaller Language Models Using Rationale Generation And Dense Retrieval Hartill Tim, Benavides-prado Diana, Witbrock Michael, Riddle Patricia J.
- Can Large Language Models Understand Real-world Complex Instructions? He Qianyu, Zeng Jie, Huang Wenhao, Chen Lina, Xiao Jin, He Qianxi, Zhou Xunzhe, Chen Lida, Wang Xintao, Huang Yuncheng, Ye Haoning, Li Zihan, Chen Shisong, Zhang Yikai, Gu Zhouhong, Liang Jiaqing, Xiao Yanghua
- You Only Prompt Once: On The Capabilities Of Prompt Learning On Large Language Models To Tackle Toxic Content He Xinlei, Zannettou Savvas, Shen Yun, Zhang Yang
- Simplemtod: A Simple Language Model For Multimodal Task-oriented Dialogue With Symbolic Scene Representation Hemanthage Bhathiya, Dondrup Christian, Bartie Phil, Lemon Oliver
- Newsgpt: Chatgpt Integration For Robot-reporter Hireche Abdelhadi, Belkacem Abdelkader Nasreddine, Jamil Sadia, Chen Chao
- Evaluating Large Language Models On A Highly-specialized Topic, Radiation Oncology Physics Holmes Jason, Liu Zhengliang, Zhang Lian, Ding Yuzhen, Sio Terence T., Mcgee Lisa A., Ashman Jonathan B., Li Xiang, Liu Tianming, Shen Jiajian, Liu Wei
- Zero-shot Goal-directed Dialogue Via RL On Imagined Conversations Hong Joey, Levine Sergey, Dragan Anca
- Direct2v: Large Language Models Are Frame-level Directors For Zero-shot Text-to-video Generation Hong Susung, Seo Junyoung, Shin Heeseong, Hong Sunghwan, Kim Seungryong
- CIEM: Contrastive Instruction Evaluation Method For Better Instruction Tuning Hongyu Hu, Jiyuan Zhang, Minyi Zhao, Zhenbang Sun
- SCITUNE: Aligning Large Language Models With Scientific Multimodal Instructions Horawalavithana Sameera, Munikoti Sai, Stewart Ian, Kvinge Henry
- Ralle: A Framework For Developing And Evaluating Retrieval-augmented Large Language Models Hoshi Yasuto, Miyashita Daisuke, Ng Youyang, Tatsuno Kento, Morioka Yasuhiro, Torii Osamu, Deguchi Jun
- More Robots Are Coming: Large Multimodal Models (chatgpt) Can Solve Visually Diverse Images Of Parsons Problems Hou Irene, Man Owen, Mettille Sophie, Gutierrez Sebastian, Angelikas Kenneth, Macneil Stephen
- Distilling Step-by-step! Outperforming Larger Language Models With Less Training Data And Smaller Model Sizes Hsieh Cheng-yu, Li Chun-liang, Yeh Chih-kuan, Nakhost Hootan, Fujii Yasuhisa, Ratner Alexander, Krishna Ranjay, Lee Chen-yu, Pfister Tomas
- Context-aware Prompt Tuning For Vision-language Model With Dual-alignment Hu Hongyu, Lin Tiancheng, Wang Jie, Sun Zhenbang, Xu Yi
- Massively Multilingual Shallow Fusion With Large Language Models Hu Ke, Sainath Tara N., Li Bo, Du Nan, Huang Yanping, Dai Andrew M., Zhang Yu, Cabrera Rodrigo, Chen Zhifeng, Strohman Trevor
- Separate The Wheat From The Chaff: Model Deficiency Unlearning Via Parameter-efficient Module Operation Hu Xinshuo, Li Dongfang, Hu Baotian, Zheng Zihao, Liu Zhenyu, Zhang Min
- Token-level Adversarial Prompt Detection Based On Perplexity Measures And Contextual Information Hu Zhengmian, Wu Gang, Mitra Saayan, Zhang Ruiyi, Sun Tong, Huang Heng, Swaminathan Viswanathan
- Unlocking The Potential Of User Feedback: Leveraging Large Language Model As User Simulator To Enhance Dialogue System Hu Zhiyuan, Feng Yue, Luu Anh Tuan, Hooi Bryan, Lipani Aldo
- Deploying And Evaluating Llms To Program Service Mobile Robots Hu Zichao, Lucchetti Francesca, Schlesinger Claire, Saxena Yash, Freeman Anders, Modak Sadanand, Guha Arjun, Biswas Joydeep
- SMILE: Single-turn To Multi-turn Inclusive Language Expansion Via Chatgpt For Mental Health Support Huachuan Qiu, Hongliang He, Shuai Zhang, Anqi Li, Zhenzhong Lan
- Directed Acyclic Transformer Pre-training For High-quality Non-autoregressive Text Generation Huang Fei, Ke Pei, Huang Minlie
- Not All Languages Are Created Equal In Llms: Improving Multilingual Capability By Cross-lingual-thought Prompting Huang Haoyang, Tang Tianyi, Zhang Dongdong, Zhao Wayne Xin, Song Ting, Xia Yan, Wei Furu
- Lawyer Llama Technical Report Huang Quzhe, Tao Mingxu, Zhang Chen, An Zhenwei, Jiang Cong, Chen Zhibin, Wu Zirui, Feng Yansong
- Grounded Decoding: Guiding Text Generation With Grounded Models For Embodied Agents Huang Wenlong, Xia Fei, Shah Dhruv, Driess Danny, Zeng Andy, Lu Yao, Florence Pete, Mordatch Igor, Levine Sergey, Hausman Karol, Ichter Brian
- Evaluating And Enhancing Large Language Models For Conversational Reasoning On Knowledge Graphs Huang Yuxuan, Shi Lida, Liu Anqi, Xu Hao
- C-eval: A Multi-level Multi-discipline Chinese Evaluation Suite For Foundation Models Huang Yuzhen, Bai Yuzhuo, Zhu Zhihao, Zhang Junlei, Zhang Jinghan, Su Tangjun, Liu Junteng, Lv Chuancheng, Zhang Yikai, Lei Jiayi, Fu Yao, Sun Maosong, He Junxian
- Smartedit: Exploring Complex Instruction-based Image Editing With Multimodal Large Language Models Huang Yuzhou, Xie Liangbin, Wang Xintao, Yuan Ziyang, Cun Xiaodong, Ge Yixiao, Zhou Jiantao, Dong Chao, Huang Rui, Zhang Ruimao, Shan Ying
- Soft Prompt Decoding For Multilingual Dense Retrieval Huang Zhiqi, Zeng Hansi, Zamani Hamed, Allan James
- Comparing Large Language Model AI And Human-generated Coaching Messages For Behavioral Weight Loss Huang Zhuoran, Berry Michael P., Chwyl Christina, Hsieh Gary, Wei Jing, Forman Evan M.
- Do You Trust Chatgpt? -- Perceived Credibility Of Human And Ai-generated Content Huschens Martin, Briesch Martin, Sobania Dominik, Rothlauf Franz
- Towards Leveraging Llms For Conditional QA Hussain Syed-amad, Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- Conceptual Model Interpreter For Large Language Models Härer Felix
- Minimizing Factual Inconsistency And Hallucination In Large Language Models I Muneeswaran, Saxena Shreya, Prasad Siva, Prakash M V Sai, Shankar Advaith, V Varun, Vaddina Vishal, Gopalakrishnan Saisubramaniam
- Multitool-cot: GPT-3 Can Use Multiple External Tools With Chain Of Thought Prompting Inaba Tatsuro, Kiyomaru Hirokazu, Cheng Fei, Kurohashi Sadao
- Llama Guard: Llm-based Input-output Safeguard For Human-ai Conversations Inan Hakan, Upasani Kartikeya, Chi Jianfeng, Rungta Rashi, Iyer Krithika, Mao Yuning, Tontchev Michael, Hu Qing, Fuller Brian, Testuggine Davide, Khabsa Madian
- Leveraging Large Language Models To Generate Answer Set Programs Ishay Adam, Yang Zhun, Lee Joohyung
- Query Expansion By Prompting Large Language Models Jagerman Rolf, Zhuang Honglei, Qin Zhen, Wang Xuanhui, Bendersky Michael
- Faithful Persona-based Conversational Dataset Generation With Large Language Models Jandaghi Pegah, Sheng Xianghai, Bai Xinyi, Pujara Jay, Sidahmed Hakim
- Chatbot Is Not All You Need: Information-rich Prompting For More Realistic Responses Jeong Seokhoon, Makhmud Assentay
- Symbol Tuning Improves In-context Learning In Language Models Jerry Wei, Le Hou, Andrew Lampinen, Xiangning Chen, Da Huang, Yi Tay, Xinyun Chen, Yifeng Lu, Denny Zhou, Tengyu Ma, Quoc V. Le
- Chit-chat Or Deep Talk: Prompt Engineering For Process Mining Jessen Urszula, Sroka Michal, Fahland Dirk
- Tailoring Language Generation Models Under Total Variation Distance Ji Haozhe, Ke Pei, Hu Zhipeng, Zhang Rongsheng, Huang Minlie
- Large Language Models As Automated Aligners For Benchmarking Vision-language Models Ji Yuanfeng, Ge Chongjian, Kong Weikai, Xie Enze, Liu Zhengying, Li Zhengguo, Luo Ping
- Mistral 7B Jiang Albert Q., Sablayrolles Alexandre, Mensch Arthur, Bamford Chris, Chaplot Devendra Singh, Casas Diego De Las, Bressand Florian, Lengyel Gianna, Lample Guillaume, Saulnier Lucile, Lavaud Lélio Renard, Lachaux Marie-anne, Stock Pierre, Scao Teven Le, Lavril Thibaut, Wang Thomas, Lacroix Timothée, Sayed William El
- Llm-blender: Ensembling Large Language Models With Pairwise Ranking And Generative Fusion Jiang Dongfu, Ren Xiang, Lin Bill Yuchen
- Selfevolve: A Code Evolution Framework Via Large Language Models Jiang Shuyang, Wang Yuhao, Wang Yu
- Resprompt: Residual Connection Prompting Advances Multi-step Reasoning In Large Language Models Jiang Song, Shakeri Zahra, Chan Aaron, Sanjabi Maziar, Firooz Hamed, Xia Yinglong, Akyildiz Bugra, Sun Yizhou, Li Jinchao, Wang Qifan, Celikyilmaz Asli
- Hykge: A Hypothesis Knowledge Graph Enhanced Framework For Accurate And Reliable Medical Llms Responses Jiang Xinke, Zhang Ruizhe, Xu Yongxin, Qiu Rihong, Fang Yue, Wang Zhiyuan, Tang Jinyi, Ding Hongxin, Chu Xu, Zhao Junfeng, Wang Yasha
- Lion: Adversarial Distillation Of Proprietary Large Language Models Jiang Yuxin, Chan Chunkit, Chen Mingyang, Wang Wei
- Llara: Large Language-recommendation Assistant Jiayi Liao, Sihang Li, Zhengyi Yang, Jiancan Wu, Yancheng Yuan, Xiang Wang, Xiangnan He
- Prompt-based Length Controlled Generation With Reinforcement Learning Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- When Large Language Models Meet Personalization: Perspectives Of Challenges And Opportunities Jin Chen, Zheng Liu, Xu Huang, Chenwang Wu, Qi Liu, Gangwei Jiang, Yuanhao Pu, Yuxuan Lei, Xiaolong Chen, Xingmei Wang, Defu Lian, Enhong Chen
- Generating Images With Multimodal Language Models Jing Yu Koh, Daniel Fried, Ruslan Salakhutdinov
- Grounding Language Models To Images For Multimodal Inputs And Outputs Jing Yu Koh, Ruslan Salakhutdinov, Daniel Fried
- Knowledge-augmented Large Language Models For Personalized Contextual Query Suggestion Jinheon Baek, Nirupama Chandrasekaran, Silviu Cucerzan, Allen Herring, Sujay Kumar Jauhar
- Vcoder: Versatile Vision Encoders For Multimodal Large Language Models Jitesh Jain, Jianwei Yang, Humphrey Shi
- GPT Is Becoming A Turing Machine: Here Are Some Ways To Program It Jojic Ana, Wang Zhen, Jojic Nebojsa
- Improving Activation Steering In Language Models With Mean-centring Jorgensen Ole, Cope Dylan, Schoots Nandi, Shanahan Murray
- Chatgpt In The Classroom: An Analysis Of Its Strengths And Weaknesses For Solving Undergraduate Computer Science Questions Joshi Ishika, Budhiraja Ritvik, Dev Harshal, Kadia Jahnvi, Ataullah M. Osama, Mitra Sayan, Kumar Dhruv, Akolekar Harshal D.
- A Case Study On Test Case Construction With Large Language Models: Unveiling Practical Insights And Challenges Junior Roberto Francisco De Lima, Presta Luiz Fernando Paes De Barros, Borborema Lucca Santos, Da Silva Vanderson Nogueira, Dahia Marcio Leal De Melo, Santos Anderson Carlos Sousa E
- Evaluation And Analysis Of Hallucination In Large Vision-language Models Junyang Wang, Yiyang Zhou, Guohai Xu, Pengcheng Shi, Chenlin Zhao, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang, Jihua Zhu, Jitao Sang, Haoyu Tang
- TARJAMAT: Evaluation Of Bard And Chatgpt On Machine Translation Of Ten Arabic Varieties Kadaoui Karima, Magdy Samar M., Waheed Abdul, Khondaker Md Tawkat Islam, El-shangiti Ahmed Oumar, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- Domain-specific Code Language Models: Unraveling The Potential For HPC Codes And Tasks Kadosh Tal, Hasabnis Niranjan, Vo Vy A., Schneider Nadav, Krien Neva, Capota Mihai, Wasay Abdul, Ahmed Nesreen, Willke Ted, Tamir Guy, Pinter Yuval, Mattson Timothy, Oren Gal
- Deficiency Of Large Language Models In Finance: An Empirical Examination Of Hallucination Kang Haoqiang, Liu Xiao-yang
- Ever: Mitigating Hallucination In Large Language Models Through Real-time Verification And Rectification Kang Haoqiang, Ni Juntong, Yao Huaxiu
- Llm-augmented Preference Learning From Natural Language Kang Inwon, Ruan Sikai, Ho Tyler, Lin Jui-chien, Mohsin Farhad, Seneviratne Oshani, Xia Lirong
- Self-specialization: Uncovering Latent Expertise Within Large Language Models Kang Junmo, Luo Hongyin, Zhu Yada, Hansen Jacob, Glass James, Cox David, Ritter Alan, Feris Rogerio, Karlinsky Leonid
- Chatmof: An Autonomous AI System For Predicting And Generating Metal-organic Frameworks Kang Yeonghun, Kim Jihan
- Shs-nlp At Radsum23: Domain-adaptive Pre-training Of Instruction-tuned Llms For Radiology Report Impression Generation Karn Sanjeev Kumar, Ghosh Rikhiya, P Kusuma, Farri Oladimeji
- Large Language Models Effectively Leverage Document-level Context For Literary Translation, But Critical Errors Persist Karpinska Marzena, Iyyer Mohit
- Sorted Llama: Unlocking The Potential Of Intermediate Layers Of Large Language Models For Dynamic Inference Kavehzadeh Parsa, Valipour Mojtaba, Tahaei Marzieh, Ghodsi Ali, Chen Boxing, Rezagholizadeh Mehdi
- Zzzgpt: An Interactive GPT Approach To Enhance Sleep Quality Khaokaew Yonchanok, Ji Kaixin, Nguyen Thuc Hanh, Kegalle Hiruni, Alaofi Marwah, Xue Hao, Salim Flora D.
- Memory-efficient Fine-tuning Of Compressed Large Language Models Via Sub-4-bit Integer Quantization Kim Jeonghoon, Lee Jung Hyun, Kim Sungdong, Park Joonsuk, Yoo Kang Min, Kwon Se Jung, Lee Dongsoo
- Which Is Better? Exploring Prompting Strategy For Llm-based Metrics Kim Joonghoon, Park Saeran, Jeong Kiyoon, Lee Sangmin, Han Seung Hun, Lee Jiyoon, Kang Pilsung
- LLM4SGG: Large Language Models For Weakly Supervised Scene Graph Generation Kim Kibum, Yoon Kanghoon, Jeon Jaehyeong, In Yeonjun, Moon Jinyoung, Kim Donghyun, Park Chanyoung
- Speculative Decoding With Big Little Decoder Kim Sehoon, Mangalam Karttikeya, Moon Suhong, Malik Jitendra, Mahoney Michael W., Gholami Amir, Keutzer Kurt
- Lmcanvas: Object-oriented Interaction To Personalize Large Language Model-powered Writing Environments Kim Tae Soo, Sarkar Arghya, Lee Yoonjoo, Chang Minsuk, Kim Juho
- Mindfuldiary: Harnessing Large Language Model To Support Psychiatric Patients' Journaling Kim Taewan, Bae Seolyeong, Kim Hyun Ah, Lee Su-woo, Hong Hwajung, Yang Chanmo, Kim Young-ho
- The Past, Present And Better Future Of Feedback Learning In Large Language Models For Subjective Human Preferences And Values Kirk Hannah Rose, Bean Andrew M., Vidgen Bertie, Röttger Paul, Hale Scott A.
- Boschai @ PLABA 2023: Leveraging Edit Operations In End-to-end Neural Sentence Simplification Knappich Valentin, Razniewski Simon, Friedrich Annemarie
- Large Language Models Are Temporal And Causal Reasoners For Video Question Answering Ko Dohwan, Lee Ji Soo, Kang Wooyoung, Roh Byungseok, Kim Hyunwoo J.
- Chatgpt: Jack Of All Trades, Master Of None Kocoń Jan, Cichecki Igor, Kaszyca Oliwier, Kochanek Mateusz, Szydło Dominika, Baran Joanna, Bielaniewicz Julita, Gruza Marcin, Janz Arkadiusz, Kanclerz Kamil, Kocoń Anna, Koptyra Bartłomiej, Mieleszczenko-kowszewicz Wiktoria, Miłkowski Piotr, Oleksy Marcin, Piasecki Maciej, Radliński Łukasz, Wojtasik Konrad, Woźniak Stanisław, Kazienko Przemysław
- Generating Images With Multimodal Language Models Koh Jing Yu, Fried Daniel, Salakhutdinov Ruslan
- Writing Your Own Book: A Method For Going From Closed To Open Book QA To Improve Robustness And Performance Of Smaller Llms Kokaia Giorgi, Sinha Pratyush, Jiang Yutong, Boujemaa Nozha
- Contextual Refinement Of Translations: Large Language Models For Sentence And Document-level Post-editing Koneru Sai, Exel Miriam, Huck Matthias, Niehues Jan
- Platolm: Teaching Llms In Multi-round Dialogue Via A User Simulator Kong Chuyi, Fan Yaxin, Wan Xiang, Jiang Feng, Wang Benyou
- Benchmarking Cognitive Biases In Large Language Models As Evaluators Koo Ryan, Lee Minhwa, Raheja Vipul, Park Jong Inn, Kim Zae Myung, Kang Dongyeop
- NOLA: Compressing Lora Using Linear Combination Of Random Basis Koohpayegani Soroush Abbasi, Navaneet Kl, Nooralinejad Parsa, Kolouri Soheil, Pirsiavash Hamed
- Developing Effective Educational Chatbots With Chatgpt Prompts: Insights From Preliminary Tests In A Case Study On Social Media Literacy (with Appendix) Koyuturk Cansu, Yavari Mona, Theophilou Emily, Bursic Sathya, Donabauer Gregor, Telari Alessia, Testa Alessia, Boiano Raffaele, Gabbiadini Alessandro, Hernandez-leo Davinia, Ruskov Martin, Ognibene Dimitri
- Enhancing Large Language Models With Climate Resources Kraus Mathias, Bingler Julia Anna, Leippold Markus, Schimanski Tobias, Senni Chiara Colesanti, Stammbach Dominik, Vaghefi Saeid Ashraf, Webersinke Nicolas
- Leveraging Few-shot Data Augmentation And Waterfall Prompting For Response Generation Krause Lea, Santamaría Selene Báez, Van Der Meer Michiel, Khurana Urja
- Prompt Generate Train (PGT): Few-shot Domain Adaption Of Retrieval Augmented Generation Models For Open Book Question-answering Krishna C. S.
- Post Hoc Explanations Of Language Models Can Improve Language Models Krishna Satyapriya, Ma Jiaqi, Slack Dylan, Ghandeharioun Asma, Singh Sameer, Lakkaraju Himabindu
- How Reliable Are Ai-generated-text Detectors? An Assessment Framework Using Evasive Soft Prompts Kumarage Tharindu, Sheth Paras, Moraffah Raha, Garland Joshua, Liu Huan
- DACBERT: Leveraging Dependency Agreement For Cost-efficient Bert Pretraining Kuo Martin, Zhang Jianyi, Chen Yiran
- Sparse Fine-tuning For Inference Acceleration Of Large Language Models Kurtic Eldar, Kuznedelev Denis, Frantar Elias, Goin Michael, Alistarh Dan
- In Generative AI We Trust: Can Chatbots Effectively Verify Political Information? Kuznetsova Elizaveta, Makhortykh Mykola, Vziatysheva Victoria, Stolze Martha, Baghumyan Ani, Urman Aleksandra
- Jotr: A Joint Transformer And Reinforcement Learning Framework For Dialog Policy Learning Kwan Wai-chung, Wang Huimin, Wang Hongru, Wang Zezhong, Wu Xian, Zheng Yefeng, Wong Kam-fai
- M4LE: A Multi-ability Multi-range Multi-task Multi-domain Long-context Evaluation Benchmark For Large Language Models Kwan Wai-chung, Zeng Xingshan, Wang Yufei, Sun Yusen, Li Liangyou, Shang Lifeng, Liu Qun, Wong Kam-fai
- Reward Design With Language Models Kwon Minae, Xie Sang Michael, Bullard Kalesha, Sadigh Dorsa
- Are You Sure? Challenging Llms Leads To Performance Drops In The Flipflop Experiment Laban Philippe, Murakhovs'ka Lidiya, Xiong Caiming, Wu Chien-sheng
- Psy-llm: Scaling Up Global Mental Health Psychological Services With Ai-based Large Language Models Lai Tin, Shi Yukun, Du Zicong, Wu Jiajie, Fu Ken, Dou Yichao, Wang Ziqi
- End-to-end Speech Recognition Contextualization With Large Language Models Lakomkin Egor, Wu Chunyang, Fathullah Yassir, Kalinli Ozlem, Seltzer Michael L., Fuegen Christian
- Fine-tuning Chatgpt For Automatic Scoring Latif Ehsan, Zhai Xiaoming
- Orchestrallm: Efficient Orchestration Of Language Models For Dialogue State Tracking Lee Chia-hsuan, Cheng Hao, Ostendorf Mari
- Read-only Prompt Optimization For Vision-language Few-shot Learning Lee Dongjun, Song Seokwon, Suh Jihee, Choi Joonmyung, Lee Sanghyeok, Kim Hyunwoo J.
- Towards Reliable And Fluent Large Language Models: Incorporating Feedback Learning Loops In QA Systems Lee Dongyub, Whang Taesun, Lee Chanhee, Lim Heuiseok
- Is Attention Required For ICL? Exploring The Relationship Between Model Architecture And In-context Learning Ability Lee Ivan, Jiang Nan, Berg-kirkpatrick Taylor
- Volcano: Mitigating Multimodal Hallucination Through Self-feedback Guided Revision Lee Seongyun, Park Sue Hyun, Jo Yongrae, Seo Minjoon
- Violation Of Expectation Via Metacognitive Prompting Reduces Theory Of Mind Prediction Error In Large Language Models Leer Courtland, Trost Vincent, Voruganti Vineeth
- Boosting Logical Reasoning In Large Language Models Through A New Framework: The Graph Of Thought Lei Bin, Lin Pei-hung, Liao Chunhua, Ding Caiwen
- Large Language Models For Generative Recommendation: A Survey And Visionary Discussions Lei Li, Yongfeng Zhang, Dugang Liu, Li Chen
- Recexplainer: Aligning Large Language Models For Explaining Recommendation Models Lei Yuxuan, Lian Jianxun, Yao Jing, Huang Xu, Lian Defu, Xie Xing
- Talk The Walk: Synthetic Data Generation For Conversational Music Recommendation Leszczynski Megan, Zhang Shu, Ganti Ravi, Balog Krisztian, Radlinski Filip, Pereira Fernando, Chaganty Arun Tejasvi
- Deliberate Then Generate: Enhanced Prompting Framework For Text Generation Li Bei, Wang Rui, Guo Junliang, Song Kaitao, Tan Xu, Hassan Hany, Menezes Arul, Xiao Tong, Bian Jiang, Zhu Jingbo
- Dialogue For Prompting: A Policy-gradient-based Discrete Prompt Generation For Few-shot Learning Li Chengzhengxu, Liu Xiaoming, Wang Yichen, Li Duyi, Lan Yu, Shen Chao
- Llava-med: Training A Large Language-and-vision Assistant For Biomedicine In One Day Li Chunyuan, Wong Cliff, Zhang Sheng, Usuyama Naoto, Liu Haotian, Yang Jianwei, Naumann Tristan, Poon Hoifung, Gao Jianfeng
- CMMLU: Measuring Massive Multitask Language Understanding In Chinese Li Haonan, Zhang Yixuan, Koto Fajri, Yang Yifei, Zhao Hai, Gong Yeyun, Duan Nan, Baldwin Timothy
- Mastering Robot Manipulation With Multimodal Prompts Through Pretraining And Multi-task Fine-tuning Li Jiachen, Gao Qiaozi, Johnston Michael, Gao Xiaofeng, He Xuehai, Shakiah Suhaila, Shi Hangjie, Ghanadan Reza, Wang William Yang
- Adaptive Gating In Mixture-of-experts Based Language Models Li Jiamin, Su Qiang, Yang Yitao, Jiang Yimin, Wang Cong, Xu Hong
- Evcap: Retrieval-augmented Image Captioning With External Visual-name Memory For Open-world Comprehension Li Jiaxuan, Vo Duc Minh, Sugimoto Akihiro, Nakayama Hideki
- Counterfactual Reasoning: Testing Language Models' Understanding Of Hypothetical Scenarios Li Jiaxuan, Yu Lang, Ettinger Allyson
- Overprompt: Enhancing Chatgpt Through Efficient In-context Learning Li Jiazheng, Zhao Runcong, Yang Yongxin, He Yulan, Gui Lin
- On The Steerability Of Large Language Models Toward Data-driven Personas Li Junyi, Mehrabi Ninareh, Peris Charith, Goyal Palash, Chang Kai-wei, Galstyan Aram, Zemel Richard, Gupta Rahul
- The Web Can Be Your Oyster For Improving Large Language Models Li Junyi, Tang Tianyi, Zhao Wayne Xin, Wang Jingyuan, Nie Jian-yun, Wen Ji-rong
- Instructcoder: Instruction Tuning Large Language Models For Code Editing Li Kaixin, Hu Qisheng, Zhao Xu, Chen Hui, Xie Yuxi, Liu Tiedong, Xie Qizhe, He Junxian
- Tool-augmented Reward Modeling Li Lei, Chai Yekun, Wang Shuohuan, Sun Yu, Tian Hao, Zhang Ningyu, Wu Hua
- Api-bank: A Comprehensive Benchmark For Tool-augmented Llms Li Minghao, Zhao Yingxiu, Yu Bowen, Song Feifan, Li Hangyu, Yu Haiyang, Li Zhoujun, Huang Fei, Li Yongbin
- Laffi: Leveraging Hybrid Natural Language Feedback For Fine-tuning Language Models Li Qianxi, Cao Yingyue, Kang Jikun, Yang Tianpei, Chen Xi, Jin Jun, Taylor Matthew E.
- Adapting Large Language Models For Education: Foundational Capabilities, Potentials, And Challenges Li Qingyao, Fu Lingyue, Zhang Weiming, Chen Xianyu, Yu Jingwei, Xia Wei, Zhang Weinan, Tang Ruiming, Yu Yong
- Vlm-eval: A General Evaluation On Video Large Language Models Li Shuailin, Zhang Yuang, Zhao Yucheng, Wang Qiuyue, Jia Fan, Liu Yingfei, Wang Tiancai
- TRAQ: Trustworthy Retrieval Augmented Question Answering Via Conformal Prediction Li Shuo, Park Sangdon, Lee Insup, Bastani Osbert
- From Classification To Generation: Insights Into Crosslingual Retrieval Augmented ICL Li Xiaoqian, Nie Ercong, Liang Sheng
- Unigen: A Unified Generative Framework For Retrieval And Question Answering With Large Language Models Li Xiaoxi, Zhou Yujia, Dou Zhicheng
- Think Outside The Code: Brainstorming Boosts Large Language Models In Code Generation Li Xin-ye, Xue Jiang-tian, Xie Zheng, Li Ming
- Chain-of-knowledge: Grounding Large Language Models Via Dynamic Knowledge Adapting Over Heterogeneous Sources Li Xingxuan, Zhao Ruochen, Chia Yew Ken, Ding Bosheng, Joty Shafiq, Poria Soujanya, Bing Lidong
- PBNR: Prompt-based News Recommender System Li Xinyi, Zhang Yongfeng, Malthouse Edward C.
- Turning Dust Into Gold: Distilling Complex Reasoning Capabilities From Llms By Leveraging Negative Data Li Yiwei, Yuan Peiwen, Feng Shaoxiong, Pan Boyuan, Sun Bin, Wang Xinglin, Wang Heda, Li Kan
- Textbooks Are All You Need II: Phi-1.5 Technical Report Li Yuanzhi, Bubeck Sébastien, Eldan Ronen, Del Giorno Allie, Gunasekar Suriya, Lee Yin Tat
- E-sparse: Boosting The Large Language Model Inference Through Entropy-based N:M Sparsity Li Yun, Niu Lin, Zhang Xipeng, Liu Kai, Zhu Jianchen, Kang Zhanhui
- Towards Vision Enhancing Llms: Empowering Multimodal Knowledge Storage And Sharing In Llms Li Yunxin, Hu Baotian, Wang Wei, Cao Xiaochun, Zhang Min
- Advancing Precise Outline-conditioned Text Generation With Task Duality And Explicit Outline Control Li Yunzhe, Chen Qian, Yan Weixiang, Wang Wen, Zhang Qinglin, Sundaram Hari
- Flexkbqa: A Flexible Llm-powered Framework For Few-shot Knowledge Base Question Answering Li Zhenyu, Fan Sunqi, Gu Yu, Li Xiuxing, Duan Zhichao, Dong Bowen, Liu Ning, Wang Jianyong
- Synthetic Data Generation With Large Language Models For Text Classification: Potential And Limitations Li Zhuoyan, Zhu Hangxiao, Lu Zhuoran, Yin Ming
- Split And Merge: Aligning Position Biases In Large Language Model Based Evaluators Li Zongjie, Wang Chaozheng, Ma Pingchuan, Wu Daoyuan, Wang Shuai, Gao Cuiyun, Liu Yang
- Label Supervised Llama Finetuning Li Zongxi, Li Xianming, Liu Yuzhang, Xie Haoran, Li Jing, Wang Fu-lee, Li Qing, Zhong Xiaoqin
- Batgpt: A Bidirectional Autoregessive Talker From Generative Pre-trained Transformer Li Zuchao, Zhang Shitou, Zhao Hai, Yang Yifei, Yang Dongjie
- Llm-grounded Diffusion: Enhancing Prompt Understanding Of Text-to-image Diffusion Models With Large Language Models Lian Long, Li Boyi, Yala Adam, Darrell Trevor
- Towards End-to-end Embodied Decision Making Via Multi-modal Large Language Model: Explorations With Gpt4-vision And Beyond Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Tianyu Liu, Baobao Chang
- Encouraging Divergent Thinking In Large Language Models Through Multi-agent Debate Liang Tian, He Zhiwei, Jiao Wenxiang, Wang Xing, Wang Rui, Yang Yujiu, Tu Zhaopeng, Shi Shuming
- GPT-4 Enhanced Multimodal Grounding For Autonomous Driving: Leveraging Cross-modal Attention With Large Language Models Liao Haicheng, Shen Huanming, Li Zhenning, Wang Chengyue, Li Guofa, Bie Yiming, Xu Chengzhong
- Text-to-image Generation For Abstract Concepts Liao Jiayi, Chen Xu, Fu Qiang, Du Lun, He Xiangnan, Wang Xiang, Han Shi, Zhang Dongmei
- STEVE-1: A Generative Model For Text-to-behavior In Minecraft Lifshitz Shalev, Paster Keiran, Chan Harris, Ba Jimmy, Mcilraith Sheila
- A Survey On Large Language Models For Recommendation Likang Wu, Zhi Zheng, Zhaopeng Qiu, Hao Wang, Hongchao Gu, Tingjia Shen, Chuan Qin, Chen Zhu, Hengshu Zhu, Qi Liu, Hui Xiong, Enhong Chen
- Paralinguistics-enhanced Large Language Modeling Of Spoken Dialogue Lin Guan-ting, Shivakumar Prashanth Gurunath, Gandhe Ankur, Yang Chao-han Huck, Gu Yile, Ghosh Shalini, Stolcke Andreas, Lee Hung-yi, Bulyko Ivan
- Videodirectorgpt: Consistent Multi-scene Video Generation Via Llm-guided Planning Lin Han, Zala Abhay, Cho Jaemin, Bansal Mohit
- Learning To Model The World With Language Lin Jessy, Du Yuqing, Watkins Olivia, Hafner Danijar, Abbeel Pieter, Klein Dan, Dragan Anca
- Argue With Me Tersely: Towards Sentence-level Counter-argument Generation Lin Jiayu, Ye Rong, Han Meng, Zhang Qi, Lai Ruofei, Zhang Xinyu, Cao Zhao, Huang Xuanjing, Wei Zhongyu
- Advances In Embodied Navigation Using Large Language Models: A Survey Lin Jinzhou, Gao Han, Feng Xuxiang, Xu Rongtao, Wang Changwei, Zhang Man, Guo Li, Xu Shibiao
- Just Ask One More Time! Self-agreement Improves Reasoning Of Language Models In (almost) All Scenarios Lin Lei, Fu Jiayi, Liu Pengli, Li Qingyang, Gong Yan, Wan Junchen, Zhang Fuzheng, Wang Zhongyuan, Zhang Di, Gai Kun
- RA-DIT: Retrieval-augmented Dual Instruction Tuning Lin Xi Victoria, Chen Xilun, Chen Mingda, Shi Weijia, Lomeli Maria, James Rich, Rodriguez Pedro, Kahn Jacob, Szilvasy Gergely, Lewis Mike, Zettlemoyer Luke, Yih Scott
- Taiwan LLM: Bridging The Linguistic Divide With A Culturally Aligned Language Model Lin Yen-ting, Chen Yun-nung
- Mitigating The Alignment Tax Of RLHF Lin Yong, Lin Hangyu, Xiong Wei, Diao Shizhe, Liu Jianmeng, Zhang Jipeng, Pan Rui, Wang Haoxiang, Hu Wenbin, Zhang Hanning, Dong Hanze, Pi Renjie, Zhao Han, Jiang Nan, Ji Heng, Yao Yuan, Zhang Tong
- Exposing Attention Glitches With Flip-flop Language Modeling Liu Bingbin, Ash Jordan T., Goel Surbhi, Krishnamurthy Akshay, Zhang Cyril
- Code Execution With Pre-trained Language Models Liu Chenxiao, Lu Shuai, Chen Weizhu, Jiang Daxin, Svyatkovskiy Alexey, Fu Shengyu, Sundaresan Neel, Duan Nan
- Recprompt: A Prompt Tuning Framework For News Recommendation Using Large Language Models Liu Dairui, Yang Boming, Du Honghui, Greene Derek, Lawlor Aonghus, Dong Ruihai, Li Irene
- MMC: Advancing Multimodal Chart Understanding With Large-scale Instruction Tuning Liu Fuxiao, Wang Xiaoyang, Yao Wenlin, Chen Jianshu, Song Kaiqiang, Cho Sangwoo, Yacoob Yaser, Yu Dong
- Glore: Evaluating Logical Reasoning Of Large Language Models Liu Hanmeng, Teng Zhiyang, Ning Ruoxi, Liu Jian, Zhou Qiji, Zhang Yue
- Ring Attention With Blockwise Transformers For Near-infinite Context Liu Hao, Zaharia Matei, Abbeel Pieter
- Grounding Complex Natural Language Commands For Temporal Tasks In Unseen Environments Liu Jason Xinyu, Yang Ziyi, Idrees Ifrah, Liang Sam, Schornstein Benjamin, Tellex Stefanie, Shah Ankit
- Retrieval-based Knowledge Transfer: An Effective Approach For Extreme Large Language Model Compression Liu Jiduan, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Wang Ran Lucien, Yan Rui
- RETA-LLM: A Retrieval-augmented Large Language Model Toolkit Liu Jiongnan, Jin Jiajie, Wang Zihan, Cheng Jiehan, Dou Zhicheng, Wen Ji-rong
- Chatcounselor: A Large Language Models For Mental Health Support Liu June M., Li Donghao, Cao He, Ren Tianhe, Liao Zeyi, Wu Jiamin
- Contrastive Vision-language Alignment Makes Efficient Instruction Learner Liu Lizhao, Sun Xinyu, Xiang Tianhang, Zhuang Zhuangwei, Yin Liuren, Tan Mingkui
- Selenite: Scaffolding Online Sensemaking With Comprehensive Overviews Elicited From Large Language Models Liu Michael Xieyang, Wu Tongshuang, Chen Tianying, Li Franklin Mingzhe, Kittur Aniket, Myers Brad A.
- Chatgpt-powered Conversational Drug Editing Using Retrieval And Domain Feedback Liu Shengchao, Wang Jiongxiao, Yang Yijin, Wang Chengpeng, Liu Ling, Guo Hongyu, Xiao Chaowei
- Language Models As Black-box Optimizers For Vision-language Models Liu Shihong, Lin Zhiqiu, Yu Samuel, Lee Ryan, Ling Tiffany, Pathak Deepak, Ramanan Deva
- Goat: Fine-tuned Llama Outperforms GPT-4 On Arithmetic Tasks Liu Tiedong, Low Bryan Kian Hsiang
- MMHQA-ICL: Multimodal In-context Learning For Hybrid Question Answering Over Text, Tables And Images Liu Weihao, Lei Fangyu, Luo Tongxu, Lei Jiahe, He Shizhu, Zhao Jun, Liu Kang
- Litcab: Lightweight Language Model Calibration Over Short- And Long-form Responses Liu Xin, Khalifa Muhammad, Wang Lu
- Divide And Prompt: Chain Of Thought Prompting For Text-to-sql Liu Xiping, Tan Zhao
- Wavjourney: Compositional Audio Creation With Large Language Models Liu Xubo, Zhu Zhongkai, Liu Haohe, Yuan Yi, Cui Meng, Huang Qiushi, Liang Jinhua, Cao Yin, Kong Qiuqiang, Plumbley Mark D., Wang Wenwu
- What Large Language Models Bring To Text-rich VQA? Liu Xuejing, Tang Wei, Ni Xinzhe, Lu Jinghui, Zhao Rui, Li Zechao, Tan Fei
- Meta Semantic Template For Evaluation Of Large Language Models Liu Yachuan, Chen Liang, Wang Jindong, Mei Qiaozhu, Xie Xing
- Modeling Uncertainty And Using Post-fusion As Fallback Improves Retrieval Augmented Generation With Llms Liu Ye, Yavuz Semih, Meng Rui, Moorthy Meghana, Joty Shafiq, Xiong Caiming, Zhou Yingbo
- Llms As Narcissistic Evaluators: When Ego Inflates Evaluation Scores Liu Yiqi, Moosavi Nafise Sadat, Lin Chenghua
- On Learning To Summarize With Large Language Models As References Liu Yixin, Shi Kejian, He Katherine S, Ye Longtian, Fabbri Alexander R., Liu Pengfei, Radev Dragomir, Cohan Arman
- Cachegen: KV Cache Compression And Streaming For Fast Large Language Model Serving Liu Yuhan, Li Hanchen, Cheng Yihua, Ray Siddhant, Huang Yuyang, Zhang Qizheng, Du Kuntai, Yao Jiayi, Lu Shan, Ananthanarayanan Ganesh, Maire Michael, Hoffmann Henry, Holtzman Ari, Jiang Junchen
- Calibrating Llm-based Evaluator Liu Yuxuan, Yang Tianchi, Huang Shaohan, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Molxpt: Wrapping Molecules With Text For Generative Pre-training Liu Zequn, Zhang Wei, Xia Yingce, Wu Lijun, Xie Shufang, Qin Tao, Zhang Ming, Liu Tie-yan
- Llama-reviewer: Advancing Code Review Automation With Large Language Models Through Parameter-efficient Fine-tuning Lu Junyi, Yu Lei, Li Xiaojia, Yang Li, Zuo Chun
- Routing To The Expert: Efficient Reward-guided Ensemble Of Large Language Models Lu Keming, Yuan Hongyi, Lin Runji, Lin Junyang, Yuan Zheng, Zhou Chang, Zhou Jingren
- Large Language Models Can Be Guided To Evade Ai-generated Text Detection Lu Ning, Liu Shengcai, He Rui, Wang Qi, Ong Yew-soon, Tang Ke
- Safer Conversational AI As A Source Of User Delight Lu Xiaoding, Korshuk Aleksey, Liu Zongyi, Beauchamp William, Research Chai
- Multimodal Procedural Planning Via Dual Text-image Prompting Lu Yujie, Lu Pan, Chen Zhiyu, Zhu Wanrong, Wang Xin Eric, Wang William Yang
- Llmscore: Unveiling The Power Of Large Language Models In Text-to-image Synthesis Evaluation Lu Yujie, Yang Xianjun, Li Xiujun, Wang Xin Eric, Wang William Yang
- Effectively Fine-tune To Improve Large Multimodal Models For Radiology Report Generation Lu Yuzhe, Hong Sungmin, Shah Yash, Xu Panpan
- Fighting Fire With Fire: The Dual Role Of Llms In Crafting And Detecting Elusive Disinformation Lucas Jason, Uchendu Adaku, Yamashita Michiharu, Lee Jooyoung, Rohatgi Shaurya, Lee Dongwon
- Critique Ability Of Large Language Models Luo Liangchen, Lin Zi, Liu Yinxiao, Shu Lei, Zhu Yun, Shang Jingbo, Meng Lei
- Taiyi: A Bilingual Fine-tuned Large Language Model For Diverse Biomedical Tasks Luo Ling, Ning Jinzhong, Zhao Yingwen, Wang Zhijun, Ding Zeyuan, Chen Peng, Fu Weiru, Han Qinyu, Xu Guangtao, Qiu Yunzhi, Pan Dinghao, Li Jiru, Li Hao, Feng Wenduo, Tu Senbo, Liu Yuqi, Yang Zhihao, Wang Jian, Sun Yuanyuan, Lin Hongfei
- Systematic Assessment Of Factual Knowledge In Large Language Models Luo Linhao, Vu Thuy-trang, Phung Dinh, Haffari Gholamreza
- Recranker: Instruction Tuning Large Language Model As Ranker For Top-k Recommendation Luo Sichun, He Bowei, Zhao Haohan, Shao Wei, Qi Yanlin, Huang Yinya, Zhou Aojun, Yao Yuxuan, Li Zongpeng, Xiao Yuanzhang, Zhan Mingjie, Song Linqi
- Fingpt: Large Generative Models For A Small Language Luukkonen Risto, Komulainen Ville, Luoma Jouni, Eskelinen Anni, Kanerva Jenna, Kupari Hanna-mari, Ginter Filip, Laippala Veronika, Muennighoff Niklas, Piktus Aleksandra, Wang Thomas, Tazi Nouamane, Scao Teven Le, Wolf Thomas, Suominen Osma, Sairanen Samuli, Merioksa Mikko, Heinonen Jyrki, Vahtola Aija, Antao Samuel, Pyysalo Sampo
- Full Parameter Fine-tuning For Large Language Models With Limited Resources Lv Kai, Yang Yuqing, Liu Tengxiao, Gao Qinghui, Guo Qipeng, Qiu Xipeng
- Translating Radiology Reports Into Plain Language Using Chatgpt And GPT-4 With Prompt Learning: Promising Results, Limitations, And Potential Lyu Qing, Tan Josh, Zapadka Michael E., Ponnatapura Janardhana, Niu Chuang, Myers Kyle J., Wang Ge, Whitlow Christopher T.
- Paperqa: Retrieval-augmented Generative Agent For Scientific Research Lála Jakub, O'donoghue Odhran, Shtedritski Aleksandar, Cox Sam, Rodriques Samuel G., White Andrew D.
- Pre-training With Large Language Model-based Document Expansion For Dense Passage Retrieval Ma Guangyuan, Wu Xing, Wang Peng, Lin Zijia, Hu Songlin
- An Examination Of The Compositionality Of Large Generative Vision-language Models Ma Teli, Li Rong, Liang Junwei
- Query Rewriting For Retrieval-augmented Large Language Models Ma Xinbei, Gong Yeyun, He Pengcheng, Zhao Hai, Duan Nan
- Large Language Model Is Not A Good Few-shot Information Extractor, But A Good Reranker For Hard Samples! Ma Yubo, Cao Yixin, Hong Yongching, Sun Aixin
- Sci-cot: Leveraging Large Language Models For Enhanced Knowledge Distillation In Small Models For Scientific QA Ma Yuhan, Jiang Haiqi, Fan Chenyou
- Lampilot: An Open Benchmark Dataset For Autonomous Driving With Language Model Programs Ma Yunsheng, Cui Can, Cao Xu, Ye Wenqian, Liu Peiran, Lu Juanwu, Abdelraouf Amr, Gupta Rohit, Han Kyungtae, Bera Aniket, Rehg James M., Wang Ziran
- Self-refine: Iterative Refinement With Self-feedback Madaan Aman, Tandon Niket, Gupta Prakhar, Hallinan Skyler, Gao Luyu, Wiegreffe Sarah, Alon Uri, Dziri Nouha, Prabhumoye Shrimai, Yang Yiming, Gupta Shashank, Majumder Bodhisattwa Prasad, Hermann Katherine, Welleck Sean, Yazdanbakhsh Amir, Clark Peter
- Harnessing The Power Of Prompt-based Techniques For Generating School-level Questions Using Large Language Models Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- Enhancing CLIP With GPT-4: Harnessing Visual Descriptions As Prompts Maniparambil Mayug, Vorster Chris, Molloy Derek, Murphy Noel, Mcguinness Kevin, O'connor Noel E.
- Large Language Models Know Your Contextual Search Intent: A Prompting Framework For Conversational Search Mao Kelong, Dou Zhicheng, Mo Fengran, Hou Jiewen, Chen Haonan, Qian Hongjin
- Unitrec: A Unified Text-to-text Transformer And Joint Contrastive Learning Framework For Text-based Recommendation Mao Zhiming, Wang Huimin, Du Yiming, Wong Kam-fai
- A Large Language Model-assisted Education Tool To Provide Feedback On Open-ended Responses Matelsky Jordan K., Parodi Felipe, Liu Tony, Lange Richard D., Kording Konrad P.
- Cook2ltl: Translating Cooking Recipes To LTL Formulae Using Large Language Models Mavrogiannis Angelos, Mavrogiannis Christoforos, Aloimonos Yiannis
- Improving Automatic VQA Evaluation Using Large Language Models Mañas Oscar, Krojer Benno, Agrawal Aishwarya
- Is Feedback All You Need? Leveraging Natural Language Feedback In Goal-conditioned Reinforcement Learning Mccallum Sabrina, Taylor-davies Max, Albrecht Stefano V., Suglia Alessandro
- JAB: Joint Adversarial Prompting And Belief Augmentation Mehrabi Ninareh, Goyal Palash, Ramakrishna Anil, Dhamala Jwala, Ghosh Shalini, Zemel Richard, Chang Kai-wei, Galstyan Aram, Gupta Rahul
- Enhancing LLM Intelligence With ARM-RAG: Auxiliary Rationale Memory For Retrieval Augmented Generation Melz Eric
- Towards Multilingual Automatic Dialogue Evaluation Mendonça John, Lavie Alon, Trancoso Isabel
- An AI Chatbot For Explaining Deep Reinforcement Learning Decisions Of Service-oriented Systems Metzger Andreas, Bartel Jone, Laufer Jan
- Developing A Scalable Benchmark For Assessing Large Language Models In Knowledge Graph Engineering Meyer Lars-peter, Frey Johannes, Junghanns Kurt, Brei Felix, Bulert Kirill, Gründer-fahrer Sabine, Martin Michael
- Specinfer: Accelerating Generative Large Language Model Serving With Tree-based Speculative Inference And Verification Miao Xupeng, Oliaro Gabriele, Zhang Zhihao, Cheng Xinhao, Wang Zeyu, Zhang Zhengxin, Wong Rae Ying Yee, Zhu Alan, Yang Lijie, Shi Xiaoxiang, Shi Chunan, Chen Zhuoming, Arfeen Daiyaan, Abhyankar Reyna, Jia Zhihao
- LAIT: Efficient Multi-segment Encoding In Transformers With Layer-adjustable Interaction Milbauer Jeremiah, Louis Annie, Hosseini Mohammad Javad, Fabrikant Alex, Metzler Donald, Schuster Tal
- ALMANACS: A Simulatability Benchmark For Language Model Explainability Mills Edmund, Su Shiye, Russell Stuart, Emmons Scott
- Mindstorms In Natural Language-based Societies Of Mind Mingchen Zhuge, Haozhe Liu, Francesco Faccio, Dylan R. Ashley, Róbert Csordás, Anand Gopalakrishnan, Abdullah Hamdi, Hasan Abed Al Kader Hammoud, Vincent Herrmann, Kazuki Irie, Louis Kirsch, Bing Li, Guohao Li, Shuming Liu, Jinjie Mai, Piotr Piękos, Aditya Ramesh, Imanol Schlag, Weimin Shi, Aleksandar Stanić, Wenyi Wang, Yuhui Wang, Mengmeng Xu, Deng-ping Fan, Bernard Ghanem, Jürgen Schmidhuber
- PRE: Vision-language Prompt Learning With Reparameterization Encoder Minh Anh Pham Thi, Nguyen An Duc, Tzimiropoulos Georgios
- TAP: Targeted Prompting For Task Adaptive Generation Of Textual Training Instances For Visual Classification Mirza M. Jehanzeb, Karlinsky Leonid, Lin Wei, Possegger Horst, Feris Rogerio, Bischof Horst
- Relu Strikes Back: Exploiting Activation Sparsity In Large Language Models Mirzadeh Iman, Alizadeh Keivan, Mehta Sachin, Del Mundo Carlo C, Tuzel Oncel, Samei Golnoosh, Rastegari Mohammad, Farajtabar Mehrdad
- Prompting With Pseudo-code Instructions Mishra Mayank, Kumar Prince, Bhat Riyaz, Murthy Rudra V, Contractor Danish, Tamilselvam Srikanth
- Tree Of Uncertain Thoughts Reasoning For Large Language Models Mo Shentong, Xin Miao
- Narrativexl: A Large-scale Dataset For Long-term Memory Models Moskvichev Arseny, Mai Ky-vinh
- Learning To Compress Prompts With Gist Tokens Mu Jesse, Li Xiang Lisa, Goodman Noah
- Octopack: Instruction Tuning Code Large Language Models Muennighoff Niklas, Liu Qian, Zebaze Armel, Zheng Qinkai, Hui Binyuan, Zhuo Terry Yue, Singh Swayam, Tang Xiangru, Von Werra Leandro, Longpre Shayne
- Assessing Translation Capabilities Of Large Language Models Involving English And Indian Languages Mujadia Vandan, Urlana Ashok, Bhaskar Yash, Pavani Penumalla Aditya, Shravya Kukkapalli, Krishnamurthy Parameswari, Sharma Dipti Misra
- "medium" Lms Of Code In The Era Of Llms: Lessons From Stackoverflow Mukherjee Manisha, Hellendoorn Vincent J.
- Superhf: Supervised Iterative Learning From Human Feedback Mukobi Gabriel, Chatain Peter, Fong Su, Windesheim Robert, Kutyniok Gitta, Bhatia Kush, Alberti Silas
- A Brief History Of Prompt: Leveraging Language Models. (through Advanced Prompting) Muktadir Golam Md
- Evaluating The Effectiveness Of Retrieval-augmented Large Language Models In Scientific Document Reasoning Munikoti Sai, Acharya Anurag, Wagle Sridevi, Horawalavithana Sameera
- LLM4VV: Developing Llm-driven Testsuite For Compiler Validation Munley Christian, Jarmusch Aaron, Chandrasekaran Sunita
- Salespeople Vs Salesbot: Exploring The Role Of Educational Value In Conversational Recommender Systems Murakhovs'ka Lidiya, Laban Philippe, Xie Tian, Xiong Caiming, Wu Chien-sheng
- PEARL: Personalizing Large Language Model Writing Assistants With Generation-calibrated Retrievers Mysore Sheshera, Lu Zhuoran, Wan Mengting, Yang Longqi, Menezes Steve, Baghaee Tina, Gonzalez Emmanuel Barajas, Neville Jennifer, Safavi Tara
- Large Language Models Are Fixated By Red Herrings: Exploring Creative Problem Solving And Einstellung Effect Using The Only Connect Wall Dataset Naeini Saeid, Saqur Raeid, Saeidi Mozhgan, Giorgi John, Taati Babak
- Diversity Of Thought Improves Reasoning Abilities Of Llms Naik Ranjita, Chandrasekaran Varun, Yuksekgonul Mert, Palangi Hamid, Nushi Besmira
- Enhancing Few-shot Text-to-sql Capabilities Of Large Language Models: A Study On Prompt Design Strategies Nan Linyong, Zhao Yilun, Zou Weijin, Ri Narutatsu, Tae Jaesung, Zhang Ellen, Cohan Arman, Radev Dragomir
- Large Language Models As Tax Attorneys: A Case Study In Legal Capabilities Emergence Nay John J., Karamardian David, Lawsky Sarah B., Tao Wenting, Bhat Meghana, Jain Raghav, Lee Aaron Travis, Choi Jonathan H., Kasai Jungo
- Simplyretrieve: A Private And Lightweight Retrieval-centric Generative AI Tool Ng Youyang, Miyashita Daisuke, Hoshi Yasuto, Morioka Yasuhiro, Torii Osamu, Kodama Tomoya, Deguchi Jun
- Multi-dimensional Data Refining Strategy For Effective Fine-tuning Llms Ngoc Thanh Nguyen, Tran Quang Nhat, Tang Arthur, Nguyen Bao, Nguyen Thuy, Pham Thanh
- Cof-cot: Enhancing Large Language Models With Coarse-to-fine Chain-of-thought Prompting For Multi-domain NLU Tasks Nguyen Hoang H., Liu Ye, Zhang Chenwei, Zhang Tao, Yu Philip S.
- Large Language Models For In-context Student Modeling: Synthesizing Student's Behavior In Visual Programming Nguyen Manh Hung, Tschiatschek Sebastian, Singla Adish
- Democratizing Llms For Low-resource Languages By Leveraging Their English Dominant Abilities With Linguistically-diverse Prompts Nguyen Xuan-phi, Aljunied Sharifah Mahani, Joty Shafiq, Bing Lidong
- L2ceval: Evaluating Language-to-code Generation Capabilities Of Large Language Models Ni Ansong, Yin Pengcheng, Zhao Yilun, Riddell Martin, Feng Troy, Shen Rui, Yin Stephen, Liu Ye, Yavuz Semih, Xiong Caiming, Joty Shafiq, Zhou Yingbo, Radev Dragomir, Cohan Arman
- Ragtruth: A Hallucination Corpus For Developing Trustworthy Retrieval-augmented Language Models Niu Cheng, Wu Yuanhao, Zhu Juno, Xu Siliang, Shum Kashun, Zhong Randy, Song Juntong, Zhang Tong
- Chatbots As Problem Solvers: Playing Twenty Questions With Role Reversals Noever David, Mckee Forrest
- Are Large Language Models Fit For Guided Reading? Ochieng Peter
- Data Augmentation For Neural Machine Translation Using Generative Language Model Oh Seokjin, Lee Su Ah, Jung Woohwan
- Large Language Model (LLM) Bias Index -- LLMBI Oketunji Abiodun Finbarrs, Anas Muhammad, Saina Deepthi
- LINC: A Neurosymbolic Approach For Logical Reasoning By Combining Language Models With First-order Logic Provers Olausson Theo X., Gu Alex, Lipkin Benjamin, Zhang Cedegao E., Solar-lezama Armando, Tenenbaum Joshua B., Levy Roger
- Text Style Transfer Evaluation Using Large Language Models Ostheimer Phil, Nagda Mayank, Kloft Marius, Fellenz Sophie
- Dialogbench: Evaluating Llms As Human-like Dialogue Systems Ou Jiao, Lu Junda, Liu Che, Tang Yihong, Zhang Fuzheng, Zhang Di, Gai Kun
- Fine-tuning Or Retrieval? Comparing Knowledge Injection In Llms Ovadia Oded, Brief Menachem, Mishaeli Moshik, Elisha Oren
- Tokenization Matters: Navigating Data-scarce Tokenization For Gender Inclusive Language Technologies Ovalle Anaelia, Mehrabi Ninareh, Goyal Palash, Dhamala Jwala, Chang Kai-wei, Zemel Richard, Galstyan Aram, Pinter Yuval, Gupta Rahul
- Towards Vision-language Mechanistic Interpretability: A Causal Tracing Tool For BLIP Palit Vedant, Pandey Rohan, Arora Aryaman, Liang Paul Pu
- What In-context Learning "learns" In-context: Disentangling Task Recognition And Task Learning Pan Jane, Gao Tianyu, Chen Howard, Chen Danqi
- Stelocoder: A Decoder-only LLM For Multi-language To Python Code Translation Pan Jialing, Sadé Adrien, Kim Jin, Soriano Eric, Sole Guillem, Flamant Sylvain
- COSMIC: Data Efficient Instruction-tuning For Speech In-context Learning Pan Jing, Wu Jian, Gaur Yashesh, Sivasankaran Sunit, Chen Zhuo, Liu Shujie, Li Jinyu
- Self-supervised Meta-prompt Learning With Meta-gradient Regularization For Few-shot Generalization Pan Kaihang, Li Juncheng, Song Hongye, Lin Jun, Liu Xiaozhong, Tang Siliang
- Logic-lm: Empowering Large Language Models With Symbolic Solvers For Faithful Logical Reasoning Pan Liangming, Albalak Alon, Wang Xinyi, Wang William Yang
- Automatically Correcting Large Language Models: Surveying The Landscape Of Diverse Self-correction Strategies Pan Liangming, Saxon Michael, Xu Wenda, Nathani Deepak, Wang Xinyi, Wang William Yang
- Kosmos-g: Generating Images In Context With Multimodal Large Language Models Pan Xichen, Dong Li, Huang Shaohan, Peng Zhiliang, Chen Wenhu, Wei Furu
- X-instructblip: A Framework For Aligning X-modal Instruction-aware Representations To Llms And Emergent Cross-modal Reasoning Panagopoulou Artemis, Xue Le, Yu Ning, Li Junnan, Li Dongxu, Joty Shafiq, Xu Ran, Savarese Silvio, Xiong Caiming, Niebles Juan Carlos
- Peft-medaware: Large Language Model For Medical Awareness Pandya Keivalya
- Automating Customer Service Using Langchain: Building Custom Open-source GPT Chatbot For Organizations Pandya Keivalya, Holia Mehfuza
- Frozen Transformers In Language Models Are Effective Visual Encoder Layers Pang Ziqi, Xie Ziyang, Man Yunze, Wang Yu-xiong
- Steering Llama 2 Via Contrastive Activation Addition Panickssery Nina, Gabrieli Nick, Schulz Julian, Tong Meg, Hubinger Evan, Turner Alexander Matt
- Trainable Transformer In Transformer Panigrahi Abhishek, Malladi Sadhika, Xia Mengzhou, Arora Sanjeev
- Injecting Structural Hints: Using Language Models To Study Inductive Biases In Language Learning Papadimitriou Isabel, Jurafsky Dan
- Revisiting Prompt Engineering Via Declarative Crowdsourcing Parameswaran Aditya G., Shankar Shreya, Asawa Parth, Jain Naman, Wang Yujie
- Interpretable Visual Question Answering Via Reasoning Supervision Parelli Maria, Mallis Dimitrios, Diomataris Markos, Pitsikalis Vassilis
- Graph Elicitation For Guiding Multi-step Reasoning In Large Language Models Park Jinyoung, Patel Ameen, Khan Omar Zia, Kim Hyunwoo J., Kim Joo-kyung
- 1.5 Million Materials Narratives Generated By Chatbots Park Yang Jeong, Jerng Sung Eun, Park Jin-sung, Kwon Choah, Hsu Chia-wei, Ren Zhichu, Yoon Sungroh, Li Ju
- ECLIPSE: A Resource-efficient Text-to-image Prior For Image Generations Patel Maitreya, Kim Changhoon, Cheng Sheng, Baral Chitta, Yang Yezhou
- Sequencing Matters: A Generate-retrieve-generate Model For Building Conversational Agents Patwardhan Quinn, Yang Grace Hui
- Instructexcel: A Benchmark For Natural Language Instruction In Excel Payan Justin, Mishra Swaroop, Singh Mukul, Negreanu Carina, Poelitz Christian, Baral Chitta, Roy Subhro, Chakravarthy Rasika, Van Durme Benjamin, Nouri Elnaz
- GPT Self-supervision For A Better Data Annotator Pei Xiaohuan, Li Yanxi, Xu Chang
- Exploiting Novel GPT-4 Apis Pelrine Kellin, Taufeeque Mohammad, Zając Michał, Mclean Euan, Gleave Adam
- Customising General Large Language Models For Specialised Emotion Recognition Tasks Peng Liyizhe, Zhang Zixing, Pang Tao, Han Jing, Zhao Huan, Chen Hao, Schuller Björn W.
- Soft Prompt Tuning For Augmenting Dense Retrieval With Large Language Models Peng Zhiyuan, Wu Xuyang, Wang Qifan, Fang Yi
- LARG, Language-based Automatic Reward And Goal Generation Perez Julien, Proux Denys, Roux Claude, Niemaz Michael
- Large Language Models Can Infer Psychological Dispositions Of Social Media Users Peters Heinrich, Matz Sandra
- Promptinfuser: How Tightly Coupling AI And UI Design Impacts Designers' Workflows Petridis Savvas, Terry Michael, Cai Carrie J.
- Attention Sorting Combats Recency Bias In Long Context Language Models Peysakhovich Alexander, Lerer Adam
- Training Chain-of-thought Via Latent-variable Inference Phan Du, Hoffman Matthew D., Dohan David, Douglas Sholto, Le Tuan Anh, Parisi Aaron, Sountsov Pavel, Sutton Charles, Vikram Sharad, Saurous Rif A.
- Automating Human Tutor-style Programming Feedback: Leveraging GPT-4 Tutor Model For Hint Generation And GPT-3.5 Student Model For Hint Validation Phung Tung, Pădurean Victor-alexandru, Singh Anjali, Brooks Christopher, Cambronero José, Gulwani Sumit, Singla Adish, Soares Gustavo
- Jatmo: Prompt Injection Defense By Task-specific Finetuning Piet Julien, Alrashed Maha, Sitawarin Chawin, Chen Sizhe, Wei Zeming, Sun Elizabeth, Alomair Basel, Wagner David
- Ai-assisted Coding: Experiments With GPT-4 Poldrack Russell A, Lu Thomas, Beguš Gašper
- Mosaicbert: A Bidirectional Encoder Optimized For Fast Pretraining Portes Jacob, Trott Alex, Havens Sam, King Daniel, Venigalla Abhinav, Nadeem Moin, Sardana Nikhil, Khudia Daya, Frankle Jonathan
- Selfcheckgpt: Zero-resource Black-box Hallucination Detection For Generative Large Language Models Potsawee Manakul, Adian Liusie, Mark J. F. Gales
- Transformers In The Service Of Description Logic-based Contexts Poulis Angelos, Tsalapati Eleni, Koubarakis Manolis
- Painter: Teaching Auto-regressive Language Models To Draw Sketches Pourreza Reza, Bhattacharyya Apratim, Panchal Sunny, Lee Mingu, Madan Pulkit, Memisevic Roland
- The Robots Are Here: Navigating The Generative AI Revolution In Computing Education Prather James, Denny Paul, Leinonen Juho, Becker Brett A., Albluwi Ibrahim, Craig Michelle, Keuning Hieke, Kiesler Natalie, Kohn Tobias, Luxton-reilly Andrew, Macneil Stephen, Peterson Andrew, Pettit Raymond, Reeves Brent N., Savelka Jaromir
- Decomposed Prompting For Machine Translation Between Related Languages Using Large Language Models Puduppully Ratish, Kunchukuttan Anoop, Dabre Raj, Aw Ai Ti, Chen Nancy F.
- Vidcom: Fast Video Comprehension Through Large Language Models With Multimodal Tools Qi Ji, Ji Kaixuan, Yu Jifan, Wang Duokang, Xu Bin, Hou Lei, Li Juanzi
- Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! Qi Xiangyu, Zeng Yi, Xie Tinghao, Chen Pin-yu, Jia Ruoxi, Mittal Prateek, Henderson Peter
- PILLOW: Enhancing Efficient Instruction Fine-tuning Via Prompt Matching Qi Zhenting, Tan Xiaoyu, Shi Shaojie, Qu Chao, Xu Yinghui, Qi Yuan
- Toolink: Linking Toolkit Creation And Using Through Chain-of-solving On Open-source Model Qian Cheng, Xiong Chenyan, Liu Zhenghao, Liu Zhiyuan
- Hijacking Large Language Models Via Adversarial In-context Learning Qiang Yao, Zhou Xiangyu, Zhu Dongxiao
- Taskweaver: A Code-first Agent Framework Qiao Bo, Li Liqun, Zhang Xu, He Shilin, Kang Yu, Zhang Chaoyun, Yang Fangkai, Dong Hang, Zhang Jue, Wang Lu, Ma Minghua, Zhao Pu, Qin Si, Qin Xiaoting, Du Chao, Xu Yong, Lin Qingwei, Rajmohan Saravan, Zhang Dongmei
- ONCE: Boosting Content-based Recommendation With Both Open- And Closed-source Large Language Models Qijiong Liu, Nuo Chen, Tetsuya Sakai, Xiao-ming Wu
- Improving In-context Learning Via Bidirectional Alignment Qin Chengwei, Xia Wenhan, Jiao Fangkai, Chen Chen, Hu Yuchen, Ding Bosheng, Joty Shafiq
- In-context Learning With Iterative Demonstration Selection Qin Chengwei, Zhang Aston, Chen Chen, Dagar Anirudh, Ye Wenming
- Enabling On-device Large Language Model Personalization With Self-supervised Data Selection And Synthesis Qin Ruiyang, Xia Jun, Jia Zhenge, Jiang Meng, Abbasi Ahmed, Zhou Peipei, Hu Jingtong, Shi Yiyu
- Large Language Models Are Effective Text Rankers With Pairwise Ranking Prompting Qin Zhen, Jagerman Rolf, Hui Kai, Zhuang Honglei, Wu Junru, Yan Le, Shen Jiaming, Liu Tianqi, Liu Jialu, Metzler Donald, Wang Xuanhui, Bendersky Michael
- Transnormerllm: A Faster And Better Large Language Model With Improved Transnormer Qin Zhen, Li Dong, Sun Weigao, Sun Weixuan, Shen Xuyang, Han Xiaodong, Wei Yunshen, Lv Baohong, Luo Xiao, Qiao Yu, Zhong Yiran
- Mplug-owl: Modularization Empowers Large Language Models With Multimodality Qinghao Ye, Haiyang Xu, Guohai Xu, Jiabo Ye, Ming Yan, Yiyang Zhou, Junyang Wang, Anwen Hu, Pengcheng Shi, Yaya Shi, Chenliang Li, Yuanhong Xu, Hehong Chen, Junfeng Tian, Qi Qian, Ji Zhang, Fei Huang, Jingren Zhou
- Predicting Question-answering Performance Of Large Language Models Through Semantic Consistency Rabinovich Ella, Ackerman Samuel, Raz Orna, Farchi Eitan, Anaby-tavor Ateret
- Zero-shot Recommendations With Pre-trained Large Language Models For Multimodal Nudging Rachel M. Harrison, Anton Dereventsov, Anton Bibin
- K-PERM: Personalized Response Generation Using Dynamic Knowledge Retrieval And Persona-adaptive Queries Raj Kanak, Roy Kaushik, Bonagiri Vamshi, Govil Priyanshul, Thirunarayanan Krishnaprasad, Gaur Manas
- Bridging The Gap: Exploring The Capabilities Of Bridge-architectures For Complex Visual Reasoning Tasks Rajesh Kousik, Raman Mrigank, Karim Mohammed Asad, Chawla Pranit
- Does The "most Sinfully Decadent Cake Ever" Taste Good? Answering Yes/no Questions From Figurative Contexts Rakshit Geetanjali, Flanigan Jeffrey
- Retrieval-augmented Image Captioning Ramos Rita, Elliott Desmond, Martins Bruno
- Can Chatgpt Assess Human Personalities? A General Evaluation Framework Rao Haocong, Leung Cyril, Miao Chunyan
- Retrieval-based Knowledge Augmented Vision Language Pre-training Rao Jiahua, Shan Zifei, Liu Longpo, Zhou Yao, Yang Yuedong
- Supporting Human-ai Collaboration In Auditing Llms With Llms Rastogi Charvi, Ribeiro Marco Tulio, King Nicholas, Nori Harsha, Amershi Saleema
- Dissecting In-context Learning Of Translations In Gpts Raunak Vikas, Awadalla Hany Hassan, Menezes Arul
- Leveraging GPT-4 For Automatic Translation Post-editing Raunak Vikas, Sharaf Amr, Wang Yiren, Awadallah Hany Hassan, Menezes Arul
- The Troubling Emergence Of Hallucination In Large Language Models -- An Extensive Definition, Quantification, And Prescriptive Remediations Rawte Vipula, Chakraborty Swagata, Pathak Agnibh, Sarkar Anubhav, Tonmoy S. M Towhidul Islam, Chadha Aman, Sheth Amit P., Das Amitava
- Progressive Prompts: Continual Learning For Language Models Razdaibiedina Anastasia, Mao Yuning, Hou Rui, Khabsa Madian, Lewis Mike, Almahairi Amjad
- Self-evaluation Improves Selective Generation In Large Language Models Ren Jie, Zhao Yao, Vu Tu, Liu Peter J., Lakshminarayanan Balaji
- Prompt, Generate, Then Cache: Cascade Of Foundation Models Makes Strong Few-shot Learners Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Hongsheng Li, Yu Qiao, Peng Gao
- Integrating Summarization And Retrieval For Enhanced Personalization Via Large Language Models Richardson Chris, Zhang Yao, Gillespie Kellen, Kar Sudipta, Singh Arshdeep, Raeesy Zeynab, Khan Omar Zia, Sethy Abhinav
- CARTIER: Cartographic Language Reasoning Targeted At Instruction Execution For Robots Rivkin Dmitriy, Kakodkar Nikhil, Hogan Francois, Baghi Bobak H., Dudek Gregory
- Benchmarking Causal Study To Interpret Large Language Models For Source Code Rodriguez-cardenas Daniel, Palacio David N., Khati Dipin, Burke Henry, Poshyvanyk Denys
- Carexpert: Leveraging Large Language Models For In-car Conversational Question Answering Rony Md Rashad Al Hasan, Suess Christian, Bhat Sinchana Ramakanth, Sudhi Viju, Schneider Julia, Vogel Maximilian, Teucher Roman, Friedl Ken E., Sahoo Soumya
- Unveiling The Potential Of Large Language Models In Generating Semantic And Cross-language Clones Roy Palash R., Alam Ajmain I., Al-omari Farouq, Roy Banani, Roy Chanchal K., Schneider Kevin A.
- From Chatbots To Phishbots? -- Preventing Phishing Scams Created Using Chatgpt, Google Bard And Claude Roy Sayak Saha, Thota Poojitha, Naragam Krishna Vamsi, Nilizadeh Shirin
- TPTU: Large Language Model-based AI Agents For Task Planning And Tool Usage Ruan Jingqing, Chen Yihong, Zhang Bin, Xu Zhiwei, Bao Tianpeng, Du Guoqing, Shi Shiwei, Mao Hangyu, Li Ziyue, Zeng Xingyu, Zhao Rui
- Retrieval-pretrained Transformer: Long-range Language Modeling With Self-retrieval Rubin Ohad, Berant Jonathan
- Are Emergent Abilities Of Large Language Models A Mirage? Rylan Schaeffer, Brando Miranda, Sanmi Koyejo
- Delucionqa: Detecting Hallucinations In Domain-specific Question Answering Sadat Mobashir, Zhou Zhengyu, Lange Lukas, Araki Jun, Gundroo Arsalan, Wang Bingqing, Menon Rakesh R, Parvez Md Rizwan, Feng Zhe
- Gollie: Annotation Guidelines Improve Zero-shot Information-extraction Sainz Oscar, García-ferrero Iker, Agerri Rodrigo, De Lacalle Oier Lopez, Rigau German, Agirre Eneko
- Extending The Frontier Of Chatgpt: Code Generation And Debugging Sakib Fardin Ahsan, Khan Saadat Hasan, Karim A. H. M. Rezaul
- Uni-nlx: Unifying Textual Explanations For Vision And Vision-language Tasks Sammani Fawaz, Deligiannis Nikos
- Large Language Models Are Competitive Near Cold-start Recommenders For Language- And Item-based Preferences Sanner Scott, Balog Krisztian, Radlinski Filip, Wedin Ben, Dixon Lucas
- Early Weight Averaging Meets High Learning Rates For LLM Pre-training Sanyal Sunny, Neerkaje Atula, Kaddour Jean, Kumar Abhishek, Sanghavi Sujay
- Open-ended Instructable Embodied Agents With Memory-augmented Large Language Models Sarch Gabriel, Wu Yue, Tarr Michael J., Fragkiadaki Katerina
- RAMP: Retrieval And Attribute-marking Enhanced Prompting For Attribute-controlled Translation Sarti Gabriele, Htut Phu Mon, Niu Xing, Hsu Benjamin, Currey Anna, Dinu Georgiana, Nadejde Maria
- GIFT: Generative Interpretable Fine-tuning Savadikar Chinmay, Song Xi, Wu Tianfu
- Unlocking Practical Applications In Legal Domain: Evaluation Of GPT For Zero-shot Semantic Annotation Of Legal Texts Savelka Jaromir
- Large Language Models (GPT) Struggle To Answer Multiple-choice Questions About Code Savelka Jaromir, Agarwal Arav, Bogart Christopher, Sakr Majd
- Can GPT-4 Support Analysis Of Textual Data In Tasks Requiring Highly Specialized Domain Expertise? Savelka Jaromir, Ashley Kevin D., Gray Morgan A, Westermann Hannes, Xu Huihui
- Ruffle&riley: Towards The Automated Induction Of Conversational Tutoring Systems Schmucker Robin, Xia Meng, Azaria Amos, Mitchell Tom
- SEMQA: Semi-extractive Multi-source Question Answering Schuster Tal, Lelkes Adam D., Sun Haitian, Gupta Jai, Berant Jonathan, Cohen William W., Metzler Donald
- An Empirical Evaluation Of Using Large Language Models For Automated Unit Test Generation Schäfer Max, Nadi Sarah, Eghbali Aryaz, Tip Frank
- Surgicalgpt: End-to-end Language-vision GPT For Visual Question Answering In Surgery Seenivasan Lalithkumar, Islam Mobarakol, Kannan Gokul, Ren Hongliang
- The Cot Collection: Improving Zero-shot And Few-shot Learning Of Language Models Via Chain-of-thought Fine-tuning Seungone Kim, Se June Joo, Doyoung Kim, Joel Jang, Seonghyeon Ye, Jamin Shin, Minjoon Seo
- PB-LLM: Partially Binarized Large Language Models Shang Yuzhang, Yuan Zhihang, Wu Qiang, Dong Zhen
- Tinylvlm-ehub: Towards Comprehensive And Efficient Evaluation For Large Vision-language Models Shao Wenqi, Lei Meng, Hu Yutao, Gao Peng, Zhang Kaipeng, Meng Fanqing, Xu Peng, Huang Siyuan, Li Hongsheng, Qiao Yu, Luo Ping
- Enhancing Retrieval-augmented Large Language Models With Iterative Retrieval-generation Synergy Shao Zhihong, Gong Yeyun, Shen Yelong, Huang Minlie, Duan Nan, Chen Weizhu
- Democratizing Llms: An Exploration Of Cost-performance Trade-offs In Self-refined Open-source Models Shashidhar Sumuk, Chinta Abhinav, Sahai Vaibhav, Wang Zhenhailong, Ji Heng
- Jailbreak In Pieces: Compositional Adversarial Attacks On Multi-modal Language Models Shayegani Erfan, Dong Yue, Abu-ghazaleh Nael
- Exploring The Factual Consistency In Dialogue Comprehension Of Large Language Models She Shuaijie, Huang Shujian, Wang Xingyun, Zhou Yanke, Chen Jiajun
- Is Translation Helpful? An Empirical Analysis Of Cross-lingual Transfer In Low-resource Dialog Generation Shen Lei, Yu Shuai, Shen Xiaoyu
- The Trickle-down Impact Of Reward (in-)consistency On RLHF Shen Lingfeng, Chen Sihao, Song Linfeng, Jin Lifeng, Peng Baolin, Mi Haitao, Khashabi Daniel, Yu Dong
- Retrieval-generation Alignment For End-to-end Task-oriented Dialogue System Shen Weizhou, Gao Yingqi, Huang Canbin, Wan Fanqi, Quan Xiaojun, Bi Wei
- Storygpt-v: Large Language Models As Consistent Story Visualizers Shen Xiaoqian, Elhoseiny Mohamed
- "do Anything Now": Characterizing And Evaluating In-the-wild Jailbreak Prompts On Large Language Models Shen Xinyue, Chen Zeyuan, Backes Michael, Shen Yun, Zhang Yang
- Can Language Agents Be Alternatives To PPO? A Preliminary Empirical Study On Openai Gym Sheng Junjie, Huang Zixiao, Shen Chuyun, Li Wenhao, Hua Yun, Jin Bo, Zha Hongyuan, Wang Xiangfeng
- Specialist Or Generalist? Instruction Tuning For Specific NLP Tasks Shi Chufan, Su Yixuan, Yang Cheng, Yang Yujiu, Cai Deng
- Crossget: Cross-guided Ensemble Of Tokens For Accelerating Vision-language Transformers Shi Dachuan, Tao Chaofan, Rao Anyi, Yang Zhendong, Yuan Chun, Wang Jiaqi
- MKRAG: Medical Knowledge Retrieval Augmented Generation For Medical Question Answering Shi Yucheng, Xu Shaochen, Yang Tianze, Liu Zhengliang, Liu Tianming, Li Quanzheng, Li Xiang, Liu Ninghao
- RADE: Reference-assisted Dialogue Evaluation For Open-domain Dialogue Shi Zhengliang, Sun Weiwei, Zhang Shuo, Zhang Zhen, Ren Pengjie, Ren Zhaochun
- Llamas Know What Gpts Don't Show: Surrogate Models For Confidence Estimation Shrivastava Vaishnavi, Liang Percy, Kumar Ananya
- Fusion-eval: Integrating Assistant Evaluators With Llms Shu Lei, Wichers Nevan, Luo Liangchen, Zhu Yun, Liu Yinxiao, Chen Jindong, Meng Lei
- Getting More Out Of Mixture Of Language Model Reasoning Experts Si Chenglei, Shi Weijia, Zhao Chen, Zettlemoyer Luke, Boyd-graber Jordan
- Mondrian: Prompt Abstraction Attack Against Large Language Models For Cheaper API Pricing Si Wai Man, Backes Michael, Zhang Yang
- GPT-4 As An Agronomist Assistant? Answering Agriculture Exams Using Large Language Models Silva Bruno, Nunes Leonardo, Estevão Roberto, Aski Vijay, Chandra Ranveer
- Exploiting Large Language Models (llms) Through Deception Techniques And Persuasion Principles Singh Sonali, Abri Faranak, Namin Akbar Siami
- Gopro: Generate And Optimize Prompts In CLIP Using Self-supervised Learning Singha Mainak, Jha Ankit, Banerjee Biplab
- Towards Expert-level Medical Question Answering With Large Language Models Singhal Karan, Tu Tao, Gottweis Juraj, Sayres Rory, Wulczyn Ellery, Hou Le, Clark Kevin, Pfohl Stephen, Cole-lewis Heather, Neal Darlene, Schaekermann Mike, Wang Amy, Amin Mohamed, Lachgar Sami, Mansfield Philip, Prakash Sushant, Green Bradley, Dominowska Ewa, Arcas Blaise Aguera Y, Tomasev Nenad, Liu Yun, Wong Renee, Semturs Christopher, Mahdavi S. Sara, Barral Joelle, Webster Dale, Corrado Greg S., Matias Yossi, Azizi Shekoofeh, Karthikesalingam Alan, Natarajan Vivek
- TABLET: Learning From Instructions For Tabular Data Slack Dylan, Singh Sameer
- Decoding Chatgpt: A Taxonomy Of Existing Research, Current Challenges, And Possible Future Directions Sohail Shahab Saquib, Farhat Faiza, Himeur Yassine, Nadeem Mohammad, Madsen Dag Øivind, Singh Yashbir, Atalla Shadi, Mansoor Wathiq
- SPARSEFIT: Few-shot Prompting With Sparse Fine-tuning For Jointly Generating Predictions And Natural Language Explanations Solano Jesus, Sanni Mardhiyah, Camburu Oana-maria, Minervini Pasquale
- Biomedical Knowledge Graph-optimized Prompt Generation For Large Language Models Soman Karthik, Rose Peter W, Morris John H, Akbas Rabia E, Smith Brett, Peetoom Braian, Villouta-reyes Catalina, Cerono Gabriel, Shi Yongmei, Rizk-jackson Angela, Israni Sharat, Nelson Charlotte A, Huang Sui, Baranzini Sergio E
- Powerinfer: Fast Large Language Model Serving With A Consumer-grade GPU Song Yixin, Mi Zeyu, Xie Haotong, Chen Haibo
- Curriculum-guided Abstractive Summarization Sotudeh Sajad, Deilamsalehy Hanieh, Dernoncourt Franck, Goharian Nazli
- Harnessing Llms In Curricular Design: Using GPT-4 To Support Authoring Of Learning Objectives Sridhar Pragnya, Doyle Aidan, Agarwal Arav, Bogart Christopher, Savelka Jaromir, Sakr Majd
- Core-cog: Conversational Recommendation Of Entities Using Constrained Generation Srivastava Harshvardhan, Pruthi Kanav, Chakrabarti Soumen, Mausam
- Instances Need More Care: Rewriting Prompts For Instances With Llms In The Loop Yields Better Zero-shot Performance Srivastava Saurabh, Huang Chengyue, Fan Weiguo, Yao Ziyu
- Seeing Seeds Beyond Weeds: Green Teaming Generative AI For Beneficial Uses Stapleton Logan, Taylor Jordan, Fox Sarah, Wu Tongshuang, Zhu Haiyi
- Choice Fusion As Knowledge For Zero-shot Dialogue State Tracking Su Ruolin, Yang Jingfeng, Wu Ting-wei, Juang Biing-hwang
- Language Model-in-the-loop: Data Optimal Approach To Learn-to-recommend Actions In Text Games Sudhakar Arjun Vaithilingam, Parthasarathi Prasanna, Rajendran Janarthanan, Chandar Sarath
- RCMHA: Relative Convolutional Multi-head Attention For Natural Language Modelling Sugiharto Herman, Aradea, Mubarok Husni
- Datatales: Investigating The Use Of Large Language Models For Authoring Data-driven Articles Sultanum Nicole, Srinivasan Arjun
- Does Fine-tuning GPT-3 With The Openai API Leak Personally-identifiable Information? Sun Albert Yu, Zemour Eliott, Saxena Arushi, Vaidyanathan Udith, Lin Eric, Lau Christian, Mugunthan Vaikkunth
- Building High-accuracy Multilingual ASR With Gated Language Experts And Curriculum Training Sun Eric, Li Jinyu, Hu Yuxuan, Zhu Yimeng, Zhou Long, Xue Jian, Wang Peidong, Liu Linquan, Liu Shujie, Lin Edward, Gong Yifan
- Character-level Chinese Backpack Language Models Sun Hao, Hewitt John
- Adaplanner: Adaptive Planning From Feedback With Language Models Sun Haotian, Zhuang Yuchen, Kong Lingkai, Dai Bo, Zhang Chao
- Think-on-graph: Deep And Responsible Reasoning Of Large Language Model On Knowledge Graph Sun Jiashuo, Xu Chengjin, Tang Lumingyuan, Wang Saizhuo, Lin Chen, Gong Yeyun, Ni Lionel M., Shum Heung-yeung, Guo Jian
- Evaluating The Zero-shot Robustness Of Instruction-tuned Language Models Sun Jiuding, Shaib Chantal, Wallace Byron C.
- PEARL: Prompting Large Language Models To Plan And Execute Actions Over Long Documents Sun Simeng, Liu Yang, Wang Shuohang, Zhu Chenguang, Iyyer Mohit
- A Comparative Study Between Full-parameter And Lora-based Fine-tuning On Chinese Instruction Data For Instruction Following Large Language Model Sun Xianghui, Ji Yunjie, Ma Baochang, Li Xiangang
- Location-aware Visual Question Generation With Lightweight Models Suwono Nicholas Collin, Chen Justin Chih-yao, Hung Tun Min, Huang Ting-hao Kenneth, Liao I-bin, Li Yung-hui, Ku Lun-wei, Sun Shao-hua
- Approximating Online Human Evaluation Of Social Chatbots With Prompting Svikhnushina Ekaterina, Pu Pearl
- Assessing The Ability Of Chatgpt To Screen Articles For Systematic Reviews Syriani Eugene, David Istvan, Kumar Gauransh
- Gender-specific Machine Translation With Large Language Models Sánchez Eduardo, Andrews Pierre, Stenetorp Pontus, Artetxe Mikel, Costa-jussà Marta R.
- Link-context Learning For Multimodal Llms Tai Yan, Fan Weichen, Zhang Zhao, Zhu Feng, Zhao Rui, Liu Ziwei
- Copilot For Xcode: Exploring Ai-assisted Programming By Prompting Cloud-based Large Language Models Tan Chee Wei, Guo Shangxin, Wong Man Fai, Hang Ching Nam
- Learning To Imagine: Visually-augmented Natural Language Generation Tang Tianyi, Chen Yushuo, Du Yifan, Li Junyi, Zhao Wayne Xin, Wen Ji-rong
- Ml-bench: Evaluating Large Language Models And Agents For Machine Learning Tasks On Repository-level Code Tang Xiangru, Liu Yuliang, Cai Zefan, Shao Yanjun, Lu Junjie, Zhang Yichi, Deng Zexuan, Hu Helan, An Kaikai, Huang Ruijun, Si Shuzheng, Chen Sheng, Zhao Haozhe, Chen Liang, Wang Yan, Liu Tianyu, Jiang Zhiwei, Chang Baobao, Fang Yin, Qin Yujia, Zhou Wangchunshu, Zhao Yilun, Cohan Arman, Gerstein Mark
- Struc-bench: Are Large Language Models Really Good At Generating Complex Structured Data? Tang Xiangru, Zong Yiming, Phang Jason, Zhao Yilun, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- Medagents: Large Language Models As Collaborators For Zero-shot Medical Reasoning Tang Xiangru, Zou Anni, Zhang Zhuosheng, Li Ziming, Zhao Yilun, Zhang Xingyao, Cohan Arman, Gerstein Mark
- RSVP: Customer Intent Detection Via Agent Response Contrastive And Generative Pre-training Tang Yu-chien, Wang Wei-yao, Yen An-zi, Peng Wen-chih
- A Benchmark For Learning To Translate A New Language From One Grammar Book Tanzer Garrett, Suzgun Mirac, Visser Eline, Jurafsky Dan, Melas-kyriazi Luke
- Caption Anything: Interactive Image Description With Diverse Multimodal Controls Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao
- Nomiracl: Knowing When You Don't Know For Robust Multilingual Retrieval-augmented Generation Thakur Nandan, Bonifacio Luiz, Zhang Xinyu, Ogundepo Odunayo, Kamalloo Ehsan, Alfonso-hermelo David, Li Xiaoguang, Liu Qun, Chen Boxing, Rezagholizadeh Mehdi, Lin Jimmy
- Leveraging Llms For Synthesizing Training Data Across Many Languages In Multilingual Dense Retrieval Thakur Nandan, Ni Jianmo, Ábrego Gustavo Hernández, Wieting John, Lin Jimmy, Cer Daniel
- Learning To Prompt In The Classroom To Understand AI Limits: A Pilot Study Theophilou Emily, Koyuturk Cansu, Yavari Mona, Bursic Sathya, Donabauer Gregor, Telari Alessia, Testa Alessia, Boiano Raffaele, Hernandez-leo Davinia, Ruskov Martin, Taibi Davide, Gabbiadini Alessandro, Ognibene Dimitri
- Is Chatgpt The Ultimate Programming Assistant -- How Far Is It? Tian Haoye, Lu Weiqi, Li Tsz On, Tang Xunzhu, Cheung Shing-chi, Klein Jacques, Bissyandé Tegawendé F.
- Soft-prompt Tuning For Large Language Models To Evaluate Bias Tian Jacob-junqi, Emerson David, Miyandoab Sevil Zanjani, Pandya Deval, Seyyed-kalantari Laleh, Khattak Faiza Khan
- R\(^3\) Prompting: Review, Rephrase And Resolve For Chain-of-thought Reasoning In Large Language Models Under Noisy Context Tian Qingyuan, Zhu Hanlun, Wang Lei, Li Yang, Lan Yunshi
- Argue: Attribute-guided Prompt Tuning For Vision-language Models Tian Xinyu, Zou Shu, Yang Zhaoyuan, Zhang Jing
- Graph Neural Prompting With Large Language Models Tian Yijun, Song Huan, Wang Zichen, Wang Haozhu, Hu Ziqing, Wang Fang, Chawla Nitesh V., Xu Panpan
- Evil Geniuses: Delving Into The Safety Of Llm-based Agents Tian Yu, Yang Xiao, Zhang Jingyuan, Dong Yinpeng, Su Hang
- Chimed-gpt: A Chinese Medical Large Language Model With Full Training Regime And Better Alignment To Human Preferences Tian Yuanhe, Gan Ruyi, Song Yan, Zhang Jiaxing, Zhang Yongdong
- Qlora: Efficient Finetuning Of Quantized Llms Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, Luke Zettlemoyer
- D4: Improving LLM Pretraining Via Document De-duplication And Diversification Tirumala Kushal, Simig Daniel, Aghajanyan Armen, Morcos Ari S.
- Clinical Camel: An Open Expert-level Medical Language Model With Dialogue-based Knowledge Encoding Toma Augustin, Lawler Patrick R., Ba Jimmy, Krishnan Rahul G., Rubin Barry B., Wang Bo
- Llmseceval: A Dataset Of Natural Language Prompts For Security Evaluations Tony Catherine, Mutas Markus, Ferreyra Nicolás E. Díaz, Scandariato Riccardo
- Viclevr: A Visual Reasoning Dataset And Hybrid Multimodal Fusion Model For Visual Question Answering In Vietnamese Tran Khiem Vinh, Phan Hao Phu, Van Nguyen Kiet, Nguyen Ngan Luu Thuy
- Large Language Model Prompt Chaining For Long Legal Document Classification Trautmann Dietrich
- Can Large Language Models Play Text Games Well? Current State-of-the-art And Open Questions Tsai Chen Feng, Zhou Xiaochen, Liu Sierra S., Li Jing, Yu Mo, Mei Hongyuan
- Automl-gpt: Large Language Model For Automl Tsai Yun-da, Tsai Yu-che, Huang Bo-wei, Yang Chun-pai, Lin Shou-de
- Efficiently Aligned Cross-lingual Transfer Learning For Conversational Tasks Using Prompt-tuning Tu Lifu, Qu Jin, Yavuz Semih, Joty Shafiq, Liu Wenhao, Xiong Caiming, Zhou Yingbo
- Large Language Models Fail On Trivial Alterations To Theory-of-mind Tasks Ullman Tomer
- Taco: Enhancing Cross-lingual Transfer For Low-resource Languages In Llms Through Translation-assisted Chain-of-thought Processes Upadhayay Bibek, Behzadan Vahid
- Mlongt5: A Multilingual And Efficient Text-to-text Transformer For Longer Sequences Uthus David, Ontañón Santiago, Ainslie Joshua, Guo Mandy
- Let's Think Frame By Frame With VIP: A Video Infilling And Prediction Dataset For Evaluating Video Chain-of-thought Vaishnavi Himakunthala, Andy Ouyang, Daniel Rose, Ryan He, Alex Mei, Yujie Lu, Chinmay Sonar, Michael Saxon, William Yang Wang
- On The Planning Abilities Of Large Language Models : A Critical Investigation Valmeekam Karthik, Marquez Matthew, Sreedharan Sarath, Kambhampati Subbarao
- Large Language Models: The Need For Nuance In Current Debates And A Pragmatic Perspective On Understanding Van Dijk Bram M. A., Kouwenhoven Tom, Spruit Marco R., Van Duijn Max J.
- Structured Thoughts Automaton: First Formalized Execution Model For Auto-regressive Language Models Vanderbruggen Tristan, Liao Chunhua, Pirkelbauer Peter, Lin Pei-hung
- Can NLP Models Correctly Reason Over Contexts That Break The Common Assumptions? Varshney Neeraj, Parmar Mihir, Patel Nisarg, Handa Divij, Sarkar Sayantan, Luo Man, Baral Chitta
- A Stitch In Time Saves Nine: Detecting And Mitigating Hallucinations Of Llms By Validating Low-confidence Generation Varshney Neeraj, Yao Wenlin, Zhang Hongming, Chen Jianshu, Yu Dong
- ASPIRO: Any-shot Structured Parsing-error-induced Reprompting For Consistent Data-to-text Generation Vejvar Martin, Fujimoto Yasutaka
- Reducing LLM Hallucinations Using Epistemic Neural Networks Verma Shreyas, Tran Kien, Ali Yusuf, Min Guangyu
- Small Language Models Improve Giants By Rewriting Their Outputs Vernikos Giorgos, Bražinskas Arthur, Adamek Jakub, Mallinson Jonathan, Severyn Aliaksei, Malmi Eric
- Mtcue: Learning Zero-shot Control Of Extra-textual Attributes By Leveraging Unstructured Context In Neural Machine Translation Vincent Sebastian, Flynn Robert, Scarton Carolina
- Prompt2model: Generating Deployable Models From Natural Language Instructions Viswanathan Vijay, Zhao Chenyang, Bertsch Amanda, Wu Tongshuang, Neubig Graham
- Freshllms: Refreshing Large Language Models With Search Engine Augmentation Vu Tu, Iyyer Mohit, Wang Xuezhi, Constant Noah, Wei Jerry, Wei Jason, Tar Chris, Sung Yun-hsuan, Zhou Denny, Le Quoc, Luong Thang
- Empirical Evaluation Of Uncertainty Quantification In Retrieval-augmented Language Models For Science Wagle Sridevi, Munikoti Sai, Acharya Anurag, Smith Sara, Horawalavithana Sameera
- Poisoning Language Models During Instruction Tuning Wan Alexander, Wallace Eric, Shen Sheng, Klein Dan
- Reformulating Domain Adaptation Of Large Language Models As Adapt-retrieve-revise: A Case Study On Chinese Legal Domain Wan Zhen, Zhang Yating, Wang Yexiang, Cheng Fei, Kurohashi Sadao
- Shall We Pretrain Autoregressive Language Models With Retrieval? A Comprehensive Study Wang Boxin, Ping Wei, Xu Peng, Mcafee Lawrence, Liu Zihan, Shoeybi Mohammad, Dong Yi, Kuchaiev Oleksii, Li Bo, Xiao Chaowei, Anandkumar Anima, Catanzaro Bryan
- Cost-effective Hyperparameter Optimization For Large Language Model Generation Inference Wang Chi, Liu Susan Xueqing, Awadallah Ahmed H.
- BERT4CTR: An Efficient Framework To Combine Pre-trained Language Model With Non-textual Features For CTR Prediction Wang Dong, Salamatian Kavé, Xia Yunqing, Deng Weiwei, Zhiang Qi
- Openchat: Advancing Open-source Language Models With Mixed-quality Data Wang Guan, Cheng Sijie, Zhan Xianyuan, Li Xiangang, Song Sen, Liu Yang
- What Makes For Good Visual Tokenizers For Large Language Models? Wang Guangzhi, Ge Yixiao, Ding Xiaohan, Kankanhalli Mohan, Shan Ying
- Dialog Action-aware Transformer For Dialog Policy Learning Wang Huimin, Kwan Wai-chung, Wong Kam-fai
- Notechat: A Dataset Of Synthetic Doctor-patient Conversations Conditioned On Clinical Notes Wang Junda, Yao Zonghai, Yang Zhichao, Zhou Huixue, Li Rumeng, Wang Xun, Xu Yucheng, Yu Hong
- Improved Visual Fine-tuning With Natural Language Supervision Wang Junyang, Xu Yuanhong, Hu Juhua, Yan Ming, Sang Jitao, Qian Qi
- Chated: A Chatbot Leveraging Chatgpt For An Enhanced Learning Experience In Higher Education Wang Kevin, Ramos Jason, Lawrence Ramon
- Improving Text Embeddings With Large Language Models Wang Liang, Yang Nan, Huang Xiaolong, Yang Linjun, Majumder Rangan, Wei Furu
- Document-level Machine Translation With Large Language Models Wang Longyue, Lyu Chenyang, Ji Tianbo, Zhang Zhirui, Yu Dian, Shi Shuming, Tu Zhaopeng
- Speech-to-text Adapter And Speech-to-entity Retriever Augmented Llms For Speech Understanding Wang Mingqiu, Shafran Izhak, Soltau Hagen, Han Wei, Cao Yuan, Yu Dian, Shafey Laurent El
- Making Large Language Models Better Reasoners With Alignment Wang Peiyi, Li Lei, Chen Liang, Song Feifan, Lin Binghuai, Cao Yunbo, Liu Tianyu, Sui Zhifang
- Vamos: Versatile Action Models For Video Understanding Wang Shijie, Zhao Qi, Do Minh Quan, Agarwal Nakul, Lee Kwonjoon, Sun Chen
- Shepherd: A Critic For Language Model Generation Wang Tianlu, Yu Ping, Tan Xiaoqing Ellen, O'brien Sean, Pasunuru Ramakanth, Dwivedi-yu Jane, Golovneva Olga, Zettlemoyer Luke, Fazel-zarandi Maryam, Celikyilmaz Asli
- CAR: Conceptualization-augmented Reasoner For Zero-shot Commonsense Question Answering Wang Weiqi, Fang Tianqing, Ding Wenxuan, Xu Baixuan, Liu Xin, Song Yangqiu, Bosselut Antoine
- Orthogonal Subspace Learning For Language Model Continual Learning Wang Xiao, Chen Tianze, Ge Qiming, Xia Han, Bao Rong, Zheng Rui, Zhang Qi, Gui Tao, Huang Xuanjing
- MINT: Evaluating Llms In Multi-turn Interaction With Tools And Language Feedback Wang Xingyao, Wang Zihan, Liu Jiateng, Chen Yangyi, Yuan Lifan, Peng Hao, Ji Heng
- Guiding Language Model Reasoning With Planning Tokens Wang Xinyi, Caccia Lucas, Ostapenko Oleksiy, Yuan Xingdi, Wang William Yang, Sordoni Alessandro
- User-aware Prefix-tuning Is A Good Learner For Personalized Image Captioning Wang Xuan, Wang Guanhong, Chai Wenhao, Zhou Jiayu, Wang Gaoang
- Emotional Intelligence Of Large Language Models Wang Xuena, Li Xueting, Yin Zi, Wu Yue, Jia Liu
- Hint-enhanced In-context Learning Wakes Large Language Models Up For Knowledge-intensive Tasks Wang Yifan, Guo Qingyan, Ni Xinzhe, Shi Chufan, Liu Lemao, Jiang Haiyun, Yang Yujiu
- Lifelongmemory: Leveraging Llms For Answering Queries In Long-form Egocentric Videos Wang Ying, Yang Yanlai, Ren Mengye
- How Far Can Camels Go? Exploring The State Of Instruction Tuning On Open Resources Wang Yizhong, Ivison Hamish, Dasigi Pradeep, Hessel Jack, Khot Tushar, Chandu Khyathi Raghavi, Wadden David, Macmillan Kelsey, Smith Noah A., Beltagy Iz, Hajishirzi Hannaneh
- Learning Hierarchical Prompt With Structured Linguistic Knowledge For Vision-language Models Wang Yubin, Jiang Xinyang, Cheng De, Li Dongsheng, Zhao Cairong
- Multiqg-ti: Towards Question Generation From Multi-modal Sources Wang Zichao, Baraniuk Richard
- Filling The Image Information Gap For VQA: Prompting Large Language Models To Proactively Ask Questions Wang Ziyue, Chen Chi, Li Peng, Liu Yang
- Machine Translation For Ge'ez Language Wassie Aman Kassahun
- Lmsanitator: Defending Prompt-tuning Against Task-agnostic Backdoors Wei Chengkun, Meng Wenlong, Zhang Zhikun, Chen Min, Zhao Minghu, Fang Wenjing, Wang Lei, Zhang Zihui, Chen Wenzhi
- Leveraging Large Language Models To Power Chatbots For Collecting User Self-reported Data Wei Jing, Kim Sungdong, Jung Hyunhoon, Kim Young-ho
- Llmrec: Large Language Models With Graph Augmentation For Recommendation Wei Wei, Xubin Ren, Jiabin Tang, Qinyong Wang, Lixin Su, Suqi Cheng, Junfeng Wang, Dawei Yin, Chao Huang
- Menatqa: A New Dataset For Testing The Temporal Comprehension And Reasoning Abilities Of Large Language Models Wei Yifan, Su Yisong, Ma Huanhuan, Yu Xiaoyan, Lei Fangyu, Zhang Yuanzhe, Zhao Jun, Liu Kang
- Copiloting The Copilots: Fusing Large Language Models With Completion Engines For Automated Program Repair Wei Yuxiang, Xia Chunqiu Steven, Zhang Lingming
- In-context Pretraining: Language Modeling Beyond Document Boundaries Weijia Shi, Sewon Min, Maria Lomeli, Chunting Zhou, Margaret Li, Gergely Szilvasy, Rich James, Xi Victoria Lin, Noah A. Smith, Luke Zettlemoyer, Scott Yih, Mike Lewis
- REPLUG: Retrieval-augmented Black-box Language Models Weijia Shi, Sewon Min, Michihiro Yasunaga, Minjoon Seo, Rich James, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih
- Infovisdial: An Informative Visual Dialogue Dataset By Bridging Large Multimodal And Language Models Wen Bingbing, Yang Zhengyuan, Wang Jianfeng, Gan Zhe, Howe Bill, Wang Lijuan
- Droidbot-gpt: Gpt-powered UI Automation For Android Wen Hao, Wang Hongming, Liu Jiaxuan, Li Yuanchun
- Mindmap: Knowledge Graph Prompting Sparks Graph Of Thoughts In Large Language Models Wen Yilin, Wang Zifeng, Sun Jimeng
- GROVE: A Retrieval-augmented Complex Story Generation Framework With A Forest Of Evidence Wen Zhihua, Tian Zhiliang, Wu Wei, Yang Yuxin, Shi Yanqi, Huang Zhen, Li Dongsheng
- Hyperpolyglot Llms: Cross-lingual Interpretability In Token Embeddings Wen-yi Andrea W, Mimno David
- Generative Recommendation: Towards Next-generation Recommender Paradigm Wenjie Wang, Xinyu Lin, Fuli Feng, Xiangnan He, Tat-seng Chua
- System 2 Attention (is Something You Might Need Too) Weston Jason, Sukhbaatar Sainbayar
- Llm-powered Data Augmentation For Enhanced Cross-lingual Performance Whitehouse Chenxi, Choudhury Monojit, Aji Alham Fikri
- Resolving References In Visually-grounded Dialogue Via Text Generation Willemsen Bram, Qian Livia, Skantze Gabriel
- Frustratingly Simple Memory Efficiency For Pre-trained Language Models Via Dynamic Embedding Pruning Williams Miles, Aletras Nikolaos
- From Word Models To World Models: Translating From Natural Language To The Probabilistic Language Of Thought Wong Lionel, Grand Gabriel, Lew Alexander K., Goodman Noah D., Mansinghka Vikash K., Andreas Jacob, Tenenbaum Joshua B.
- From Big To Small Without Losing It All: Text Augmentation With Chatgpt For Efficient Sentiment Analysis Woźniak Stanisław, Kocoń Jan
- Rethinking Model Selection And Decoding For Keyphrase Generation With Pre-trained Sequence-to-sequence Models Wu Di, Ahmad Wasi Uddin, Chang Kai-wei
- Language Prompt For Autonomous Driving Wu Dongming, Han Wencheng, Wang Tiancai, Liu Yingfei, Zhang Xiangyu, Shen Jianbing
- Large Language Models Leverage External Knowledge To Extend Clinical Insight Beyond Language Boundaries Wu Jiageng, Wu Xian, Qiu Zhaopeng, Li Minghui, Zhang Yingying, Zheng Yefeng, Yuan Changzheng, Yang Jie
- Document Flattening: Beyond Concatenating Context For Document-level Neural Machine Translation Wu Minghao, Foster George, Qu Lizhen, Haffari Gholamreza
- Lamini-lm: A Diverse Herd Of Distilled Models From Large-scale Instructions Wu Minghao, Waheed Abdul, Zhang Chiyu, Abdul-mageed Muhammad, Aji Alham Fikri
- Mindshift: Leveraging Large Language Models For Mental-states-based Problematic Smartphone Use Intervention Wu Ruolan, Yu Chun, Pan Xiaole, Liu Yujia, Zhang Ningning, Fu Yue, Wang Yuhan, Zheng Zhi, Chen Li, Jiang Qiaolei, Xu Xuhai, Shi Yuanchun
- Next-gpt: Any-to-any Multimodal LLM Wu Shengqiong, Fei Hao, Qu Leigang, Ji Wei, Chua Tat-seng
- Analyzing Chain-of-thought Prompting In Large Language Models Via Gradient-based Feature Attributions Wu Skyler, Shen Eric Meng, Badrinath Charumathi, Ma Jiaqi, Lakkaraju Himabindu
- Paragraph-to-image Generation With Information-enriched Diffusion Model Wu Weijia, Li Zhuang, He Yefei, Shou Mike Zheng, Shen Chunhua, Cheng Lele, Li Yan, Gao Tingting, Zhang Di, Wang Zhongyuan
- Unleashing Potential Of Evidence In Knowledge-intensive Dialogue Generation Wu Xianjie, Yang Jian, Li Tongliang, Liang Di, Zhang Shiwei, Du Yiyang, Li Zhoujun
- From Language Modeling To Instruction Following: Understanding The Behavior Shift In Llms After Instruction Tuning Wu Xuansheng, Yao Wenlin, Chen Jianshu, Pan Xiaoman, Wang Xiaoyang, Liu Ninghao, Yu Dong
- Jailbreaking GPT-4V Via Self-adversarial Attacks With System Prompts Wu Yuanwei, Li Xiang, Liu Yixin, Zhou Pan, Sun Lichao
- Semantic Parsing By Large Language Models For Intricate Updating Strategies Of Zero-shot Dialogue State Tracking Wu Yuxiang, Dong Guanting, Xu Weiran
- Get An A In Math: Progressive Rectification Prompting Wu Zhenyu, Jiang Meng, Shen Chao
- LLMGA: Multimodal Large Language Model Based Generation Assistant Xia Bin, Wang Shiyin, Tao Yingfan, Wang Yitong, Jia Jiaya
- Conversational Automated Program Repair Xia Chunqiu Steven, Zhang Lingming
- Hybrid Retrieval-augmented Generation For Real-time Composition Assistance Xia Menglin, Zhang Xuchao, Couturier Camille, Zheng Guoqing, Rajmohan Saravan, Ruhle Victor
- Sheared Llama: Accelerating Language Model Pre-training Via Structured Pruning Xia Mengzhou, Gao Tianyu, Zeng Zhiyuan, Chen Danqi
- Language Models Meet World Models: Embodied Experiences Enhance Language Models Xiang Jiannan, Tao Tianhua, Gu Yi, Shu Tianmin, Wang Zirui, Yang Zichao, Hu Zhiting
- Lm-cocktail: Resilient Tuning Of Language Models Via Model Merging Xiao Shitao, Liu Zheng, Zhang Peitian, Xing Xingrun
- Personalized Abstractive Summarization By Tri-agent Generation Pipeline Xiao Wen, Xie Yujia, Carenini Giuseppe, He Pengcheng
- Instructed Language Models With Retrievers Are Powerful Entity Linkers Xiao Zilin, Gong Ming, Wu Jie, Zhang Xingyao, Shou Linjun, Pei Jian, Jiang Daxin
- Pangu-σ: Towards Trillion Parameter Language Model With Sparse Heterogeneous Computing Xiaozhe Ren, Pingyi Zhou, Xinfan Meng, Xinjing Huang, Yadao Wang, Weichao Wang, Pengfei Li, Xiaoda Zhang, Alexander Podolskiy, Grigory Arshinov, Andrey Bout, Irina Piontkovskaya, Jiansheng Wei, Xin Jiang, Teng Su, Qun Liu, Jun Yao
- QUERT: Continual Pre-training Of Language Model For Query Understanding In Travel Domain Search Xie Jian, Liang Yidan, Liu Jingping, Xiao Yanghua, Wu Baohua, Ni Shenghua
- Adaptive Chameleon Or Stubborn Sloth: Revealing The Behavior Of Large Language Models In Knowledge Conflicts Xie Jian, Zhang Kai, Chen Jiangjie, Lou Renze, Su Yu
- The Wall Street Neophyte: A Zero-shot Analysis Of Chatgpt Over Multimodal Stock Movement Prediction Challenges Xie Qianqian, Han Weiguang, Lai Yanzhao, Peng Min, Huang Jimin
- Empirical Study Of Zero-shot NER With Chatgpt Xie Tingyu, Li Qi, Zhang Jian, Zhang Yan, Liu Zuozhu, Wang Hongwei
- Translating Natural Language To Planning Goals With Large-language Models Xie Yaqi, Yu Chen, Zhu Tongyao, Bai Jinbin, Gong Ze, Soh Harold
- E4srec: An Elegant Effective Efficient Extensible Solution Of Large Language Models For Sequential Recommendation Xinhang Li, Chong Chen, Xiangyu Zhao, Yong Zhang, Chunxiao Xing
- A Preliminary Study Of Chatgpt On News Recommendation: Personalization, Provider Fairness, Fake News Xinyi Li, Yongfeng Zhang, Edward C. Malthouse
- Bridging Items And Language: A Transition Paradigm For Large Language Model-based Recommendation Xinyu Lin, Wenjie Wang, Yongqi Li, Fuli Feng, See-kiong Ng, Tat-seng Chua
- Examining Inter-consistency Of Large Language Models Collaboration: An In-depth Analysis Via Debate Xiong Kai, Ding Xiao, Cao Yixin, Liu Ting, Qin Bing
- The Program Testing Ability Of Large Language Models For Code Xiong Weimin, Guo Yiwen, Chen Hao
- Rewoo: Decoupling Reasoning From Observations For Efficient Augmented Language Models Xu Binfeng, Peng Zhiyuan, Lei Bowen, Mukherjee Subhabrata, Liu Yuchen, Xu Dongkuan
- Baize: An Open-source Chat Model With Parameter-efficient Tuning On Self-chat Data Xu Canwen, Guo Daya, Duan Nan, Mcauley Julian
- Ctc-based Non-autoregressive Speech Translation Xu Chen, Liu Xiaoqian, Liu Xiaowen, Sun Qingxuan, Zhang Yuhao, Yang Murun, Dong Qianqian, Ko Tom, Wang Mingxuan, Xiao Tong, Ma Anxiang, Zhu Jingbo
- A Paradigm Shift In Machine Translation: Boosting Translation Performance Of Large Language Models Xu Haoran, Kim Young Jin, Sharaf Amr, Awadalla Hany Hassan
- Recommender AI Agent: Integrating Large Language Models For Interactive Recommendations Xu Huang, Jianxun Lian, Yuxuan Lei, Jing Yao, Defu Lian, Xing Xie
- U-llava: Unifying Multi-modal Tasks Via Large Language Model Xu Jinjin, Xu Liwu, Yang Yuzhe, Li Xiang, Wang Fanyi, Xie Yanchun, Huang Yi-jie, Li Yaqian
- Tensorgpt: Efficient Compression Of The Embedding Layer In Llms Based On The Tensor-train Decomposition Xu Mingxue, Xu Yao Lei, Mandic Danilo P.
- Llmrefine: Pinpointing And Refining Large Language Models Via Fine-grained Actionable Feedback Xu Wenda, Deutsch Daniel, Finkelstein Mara, Juraska Juraj, Zhang Biao, Liu Zhongtao, Wang William Yang, Li Lei, Freitag Markus
- Mental-llm: Leveraging Large Language Models For Mental Health Prediction Via Online Text Data Xu Xuhai, Yao Bingsheng, Dong Yuanzhe, Gabriel Saadia, Yu Hong, Hendler James, Ghassemi Marzyeh, Dey Anind K., Wang Dakuo
- Drivegpt4: Interpretable End-to-end Autonomous Driving Via Large Language Model Xu Zhenhua, Zhang Yujia, Xie Enze, Zhao Zhen, Guo Yong, Wong Kwan-yee. K., Li Zhenguo, Zhao Hengshuang
- IDOL: Indicator-oriented Logic Pre-training For Logical Reasoning Xu Zihang, Yang Ziqing, Cui Yiming, Wang Shijin
- Pink: Unveiling The Power Of Referential Comprehension For Multi-modal Llms Xuan Shiyu, Guo Qingpei, Yang Ming, Zhang Shiliang
- How Robust Is GPT-3.5 To Predecessors? A Comprehensive Study On Language Understanding Tasks Xuanting Chen, Junjie Ye, Can Zu, Nuo Xu, Rui Zheng, Minlong Peng, Jie Zhou, Tao Gui, Qi Zhang, Xuanjing Huang
- To Repeat Or Not To Repeat: Insights From Scaling LLM Under Token-crisis Xue Fuzhao, Fu Yao, Zhou Wangchunshu, Zheng Zangwei, You Yang
- Pit One Against Many: Leveraging Attention-head Embeddings For Parameter-efficient Multi-head Attention Xue Huiyin, Aletras Nikolaos
- Occuquest: Mitigating Occupational Bias For Inclusive Large Language Models Xue Mingfeng, Liu Dayiheng, Yang Kexin, Dong Guanting, Lei Wenqiang, Yuan Zheng, Zhou Chang, Zhou Jingren
- RCOT: Detecting And Rectifying Factual Inconsistency In Reasoning By Reversing Chain-of-thought Xue Tianci, Wang Ziqi, Wang Zhenhailong, Han Chi, Yu Pengfei, Ji Heng
- Scaling Evidence-based Instructional Design Expertise Through Large Language Models Yadav Gautam
- Pretraining Data Mixtures Enable Narrow Model Selection Capabilities In Transformer Models Yadlowsky Steve, Doshi Lyric, Tripuraneni Nilesh
- Practical And Ethical Challenges Of Large Language Models In Education: A Systematic Scoping Review Yan Lixiang, Sha Lele, Zhao Linxuan, Li Yuheng, Martinez-maldonado Roberto, Chen Guanliang, Li Xinyu, Jin Yueqiao, Gašević Dragan
- Refining The Responses Of Llms By Themselves Yan Tianqiang, Xu Tiansheng
- Recmind: Large Language Model Powered Agent For Recommendation Yancheng Wang, Ziyan Jiang, Zheng Chen, Fan Yang, Yingxue Zhou, Eunah Cho, Xing Fan, Xiaojiang Huang, Yanbin Lu, Yingzhen Yang
- Effective Distillation Of Table-based Reasoning Ability From Llms Yang Bohao, Tang Chen, Zhao Kun, Xiao Chenghao, Lin Chenghua
- Beyond Testers' Biases: Guiding Model Testing With Knowledge Bases Using Llms Yang Chenyang, Rustogi Rishabh, Brower-sinning Rachel, Lewis Grace A., Kästner Christian, Wu Tongshuang
- Refgpt: Dialogue Generation Of GPT, By GPT, And For GPT Yang Dongjie, Yuan Ruifeng, Fan Yuantao, Yang Yifei, Wang Zili, Wang Shusen, Zhao Hai
- PALR: Personalization Aware Llms For Recommendation Yang Fan, Chen Zheng, Jiang Ziyan, Cho Eunah, Huang Xiaojiang, Lu Yanbin
- Empower Large Language Model To Perform Better On Industrial Domain-specific Question Answering Yang Fangkai, Zhao Pu, Wang Zezhong, Wang Lu, Zhang Jue, Garg Mohit, Lin Qingwei, Rajmohan Saravan, Zhang Dongmei
- PRCA: Fitting Black-box Large Language Models For Retrieval Question Answering Via Pluggable Reward-driven Contextual Adapter Yang Haoyan, Li Zhitao, Zhang Yong, Wang Jianzong, Cheng Ning, Li Ming, Xiao Jing
- Auto-gpt For Online Decision Making: Benchmarks And Additional Opinions Yang Hui, Yue Sifu, He Yunzhong
- Good Questions Help Zero-shot Image Reasoning Yang Kaiwen, Shen Tao, Tian Xinmei, Geng Xiubo, Tao Chongyang, Tao Dacheng, Zhou Tianyi
- Medxchat: A Unified Multimodal Large Language Model Framework Towards Cxrs Understanding And Generation Yang Ling, Wang Zhanyu, Chen Zhenghao, Liang Xinyu, Zhou Luping
- Progressive Evidence Refinement For Open-domain Multimodal Retrieval Question Answering Yang Shuwen, Wu Anran, Wu Xingjiao, Xiao Luwei, Ma Tianlong, Jin Cheng, He Liang
- Psycot: Psychological Questionnaire As Powerful Chain-of-thought For Personality Detection Yang Tao, Shi Tianyuan, Wan Fanqi, Quan Xiaojun, Wang Qifan, Wu Bingzhe, Wu Jiaxiang
- Dior-cvae: Pre-trained Language Models And Diffusion Priors For Variational Dialog Generation Yang Tianyu, Tran Thy Thy, Gurevych Iryna
- Neural Machine Translation Data Generation And Augmentation Using Chatgpt Yang Wayne, Nicolai Garrett
- Human-in-the-loop Machine Translation With Large Language Model Yang Xinyi, Zhan Runzhe, Wong Derek F., Wu Junchao, Chao Lidia S.
- Mindllm: Pre-training Lightweight Large Language Model From Scratch, Evaluations And Domain Applications Yang Yizhe, Sun Huashan, Li Jiawei, Liu Runheng, Li Yinghao, Liu Yuhang, Huang Heyan, Gao Yang
- Harnessing The Power Of Large Language Models For Natural Language To First-order Logic Translation Yang Yuan, Xiong Siheng, Payani Ali, Shareghi Ehsan, Fekri Faramarz
- Improving The Reliability Of Large Language Models By Leveraging Uncertainty-aware In-context Learning Yang Yuchen, Li Houqiang, Wang Yanfeng, Wang Yu
- Re-vilm: Retrieval-augmented Visual Language Model For Zero And Few-shot Image Captioning Yang Zhuolin, Ping Wei, Liu Zihan, Korthikanti Vijay, Nie Weili, Huang De-an, Fan Linxi, Yu Zhiding, Lan Shiyi, Li Bo, Liu Ming-yu, Zhu Yuke, Shoeybi Mohammad, Catanzaro Bryan, Xiao Chaowei, Anandkumar Anima
- I-code V2: An Autoregressive Generation Framework Over Vision, Language, And Speech Data Yang Ziyi, Khademi Mahmoud, Xu Yichong, Pryzant Reid, Fang Yuwei, Zhu Chenguang, Chen Dongdong, Qian Yao, Gao Mei, Chen Yi-ling, Gmyr Robert, Kanda Naoyuki, Codella Noel, Xiao Bin, Shi Yu, Yuan Lu, Yoshioka Takuya, Zeng Michael, Huang Xuedong
- More Samples Or More Prompts? Exploring Effective In-context Sampling For LLM Few-shot Prompt Engineering Yao Bingsheng, Chen Guiming, Zou Ruishi, Lu Yuxuan, Li Jiachen, Zhang Shao, Sang Yisi, Liu Sijia, Hendler James, Wang Dakuo
- Tcp:textual-based Class-aware Prompt Tuning For Visual-language Model Yao Hantao, Zhang Rui, Xu Changsheng
- Retroformer: Retrospective Large Language Agents With Policy Gradient Optimization Yao Weiran, Heinecke Shelby, Niebles Juan Carlos, Liu Zhiwei, Feng Yihao, Xue Le, Murthy Rithesh, Chen Zeyuan, Zhang Jianguo, Arpit Devansh, Xu Ran, Mui Phil, Wang Huan, Xiong Caiming, Savarese Silvio
- Do Physicians Know How To Prompt? The Need For Automatic Prompt Optimization Help In Clinical Note Generation Yao Zonghai, Jaafar Ahmed, Wang Beining, Yang Zhichao, Yu Hong
- Collaborative Large Language Model For Recommender Systems Yaochen Zhu, Liang Wu, Qi Guo, Liangjie Hong, Jundong Li
- Enhancing Conversational Search: Large Language Model-aided Informative Query Rewriting Ye Fanghua, Fang Meng, Li Shenghui, Yilmaz Emine
- Cognitive Mirage: A Review Of Hallucinations In Large Language Models Ye Hongbin, Liu Tong, Zhang Aijia, Hua Wei, Jia Weiqiang
- Ureader: Universal Ocr-free Visually-situated Language Understanding With Multimodal Large Language Model Ye Jiabo, Hu Anwen, Xu Haiyang, Ye Qinghao, Yan Ming, Xu Guohai, Li Chenliang, Tian Junfeng, Qian Qi, Zhang Ji, Jin Qin, He Liang, Lin Xin Alex, Huang Fei
- Qilin-med: Multi-stage Knowledge Injection Advanced Medical Large Language Model Ye Qichen, Liu Junling, Chong Dading, Zhou Peilin, Hua Yining, Liu Fenglin, Cao Meng, Wang Ziming, Cheng Xuxin, Lei Zhu, Guo Zhenhua
- Investigating The Effectiveness Of Task-agnostic Prefix Prompt For Instruction Following Ye Seonghyeon, Hwang Hyeonbin, Yang Sohee, Yun Hyeongu, Kim Yireun, Seo Minjoon
- Tram: A Token-level Retrieval-augmented Mechanism For Source Code Summarization Ye Tong, Wu Lingfei, Ma Tengfei, Zhang Xuhong, Du Yangkai, Liu Peiyu, Ji Shouling, Wang Wenhai
- Satlm: Satisfiability-aided Language Models Using Declarative Prompting Ye Xi, Chen Qiaochu, Dillig Isil, Durrett Greg
- ASPEN: High-throughput Lora Fine-tuning Of Large Language Models With A Single GPU Ye Zhengmao, Li Dengchun, Tian Jingqi, Lan Tingfeng, Zuo Jie, Duan Lei, Lu Hui, Jiang Yexi, Sha Jian, Zhang Ke, Tang Mingjie
- A Multitask, Multilingual, Multimodal Evaluation Of Chatgpt On Reasoning, Hallucination, And Interactivity Yejin Bang, Samuel Cahyawijaya, Nayeon Lee, Wenliang Dai, Dan Su, Bryan Wilie, Holy Lovenia, Ziwei Ji, Tiezheng Yu, Willy Chung, Quyet V. Do, Yan Xu, Pascale Fung
- Llm-eval: Unified Multi-dimensional Automatic Evaluation For Open-domain Conversations With Large Language Models Yen-ting Lin, Yun-nung Chen
- Cumulative Reasoning With Large Language Models Yifan Zhang, Jingqin Yang, Yang Yuan, Andrew Chi-chih Yao
- Lemur: Harmonizing Natural Language And Code For Language Agents Yiheng Xu, Hongjin Su, Chen Xing, Boyu Mi, Qian Liu, Weijia Shi, Binyuan Hui, Fan Zhou, Yitao Liu, Tianbao Xie, Zhoujun Cheng, Siheng Zhao, Lingpeng Kong, Bailin Wang, Caiming Xiong, Tao Yu
- Ctrlstruct: Dialogue Structure Learning For Open-domain Response Generation Yin Congchi, Li Piji, Ren Zhaochun
- Dynosaur: A Dynamic Growth Paradigm For Instruction-tuning Data Curation Yin Da, Liu Xiao, Yin Fan, Zhong Ming, Bansal Hritik, Han Jiawei, Chang Kai-wei
- Did You Read The Instructions? Rethinking The Effectiveness Of Task Definitions In Instruction Learning Yin Fan, Vig Jesse, Laban Philippe, Joty Shafiq, Xiong Caiming, Wu Chien-sheng Jason
- Modulora: Finetuning 2-bit Llms On Consumer Gpus By Integrating With Modular Quantizers Yin Junjie, Dong Jiahao, Wang Yingheng, De Sa Christopher, Kuleshov Volodymyr
- Intuitive Or Dependent? Investigating Llms' Behavior Style To Conflicting Prompts Ying Jiahao, Cao Yixin, Xiong Kai, He Yidong, Cui Long, Liu Yongbin
- Low-resource Languages Jailbreak GPT-4 Yong Zheng-xin, Menghini Cristina, Bach Stephen H.
- Hugginggpt: Solving AI Tasks With Chatgpt And Its Friends In Hugging Face Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu, Yueting Zhuang
- Search-adaptor: Embedding Customization For Information Retrieval Yoon Jinsung, Arik Sercan O, Chen Yanfei, Pfister Tomas
- Decoding In-context Learning: Neuroscience-inspired Analysis Of Representations In Large Language Models Yousefi Safoora, Betthauser Leo, Hasanbeig Hosein, Millière Raphaël, Momennejad Ida
- GPTFUZZER: Red Teaming Large Language Models With Auto-generated Jailbreak Prompts Yu Jiahao, Lin Xingwei, Yu Zheng, Xing Xinyu
- Scaling Autoregressive Multi-modal Models: Pretraining And Instruction Tuning Yu Lili, Shi Bowen, Pasunuru Ramakanth, Muller Benjamin, Golovneva Olga, Wang Tianlu, Babu Arun, Tang Binh, Karrer Brian, Sheynin Shelly, Ross Candace, Polyak Adam, Howes Russell, Sharma Vasu, Xu Puxin, Tamoyan Hovhannes, Ashual Oron, Singer Uriel, Li Shang-wen, Zhang Susan, James Richard, Ghosh Gargi, Taigman Yaniv, Fazel-zarandi Maryam, Celikyilmaz Asli, Zettlemoyer Luke, Aghajanyan Armen
- Chain-of-note: Enhancing Robustness In Retrieval-augmented Language Models Yu Wenhao, Zhang Hongming, Pan Xiaoman, Ma Kaixin, Wang Hongwei, Yu Dong
- Reeval: Automatic Hallucination Evaluation For Retrieval-augmented Large Language Models Via Transferable Adversarial Attacks Yu Xiaodong, Cheng Hao, Liu Xiaodong, Roth Dan, Gao Jianfeng
- Training With "paraphrasing The Original Text" Improves Long-context Performance Yu Yijiong, Huang Yongfeng, Qi Zhixiao, Zhou Zhe
- Large Language Model As Attributed Training Data Generator: A Tale Of Diversity And Bias Yu Yue, Zhuang Yuchen, Zhang Jieyu, Meng Yu, Ratner Alexander, Krishna Ranjay, Shen Jiaming, Zhang Chao
- Prophet: Prompting Large Language Models With Complementary Answer Heuristics For Knowledge-based Visual Question Answering Yu Zhou, Ouyang Xuecheng, Shao Zhenwei, Wang Meng, Yu Jun
- Speculative Contrastive Decoding Yuan Hongyi, Lu Keming, Huang Fei, Yuan Zheng, Zhou Chang
- Large Language Models Illuminate A Progressive Pathway To Artificial Healthcare Assistant: A Review Yuan Mingze, Bao Peng, Yuan Jiajia, Shen Yunhao, Chen Zifan, Xie Yi, Zhao Jie, Chen Yang, Zhang Li, Shen Lin, Dong Bin
- Batcheval: Towards Human-like Text Evaluation Yuan Peiwen, Feng Shaoxiong, Li Yiwei, Wang Xinglin, Pan Boyuan, Wang Heda, Li Kan
- Large Language Model Cascades With Mixture Of Thoughts Representations For Cost-efficient Reasoning Yue Murong, Zhao Jie, Zhang Min, Du Liang, Yao Ziyu
- Plan, Eliminate, And Track -- Language Models Are Good Teachers For Embodied Agents Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye
- Mammoth: Building Math Generalist Models Through Hybrid Instruction Tuning Yue Xiang, Qu Xingwei, Zhang Ge, Fu Yao, Huang Wenhao, Sun Huan, Su Yu, Chen Wenhu
- Aligning Large Language Models With Human: A Survey Yufei Wang, Wanjun Zhong, Liangyou Li, Fei Mi, Xingshan Zeng, Wenyong Huang, Lifeng Shang, Xin Jiang, Qun Liu
- Empowering Private Tutoring By Chaining Large Language Models Yulin Chen, Ning Ding, Hai-tao Zheng, Zhiyuan Liu, Maosong Sun, Bowen Zhou
- Retrieval-augmented Generation For Large Language Models: A Survey Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Meng Wang, Haofen Wang
- Large Language Models Are Versatile Decomposers: Decompose Evidence And Questions For Table-based Reasoning Yunhu Ye, Binyuan Hui, Min Yang, Binhua Li, Fei Huang, Yongbin Li
- Pre-training To Learn In Context Yuxian Gu, Li Dong, Furu Wei, Minlie Huang
- Text Alignment Is An Efficient Unified Model For Massive NLP Tasks Zha Yuheng, Yang Yichi, Li Ruichen, Hu Zhiting
- Building Open-ended Embodied Agent Via Language-policy Bidirectional Adaptation Zhai Shaopeng, Wang Jie, Zhang Tianyi, Huang Fuxian, Zhang Qi, Zhou Ming, Hou Jing, Qiao Yu, Liu Yu
- Next-chat: An LMM For Chat, Detection And Segmentation Zhang Ao, Yao Yuan, Ji Wei, Liu Zhiyuan, Chua Tat-seng
- Zhujiu: A Multi-dimensional, Multi-faceted Chinese Benchmark For Large Language Models Zhang Baoli, Xie Haining, Du Pengfan, Chen Junhao, Cao Pengfei, Chen Yubo, Liu Shengping, Liu Kang, Zhao Jun
- Mm-narrator: Narrating Long-form Videos With Multimodal In-context Learning Zhang Chaoyi, Lin Kevin, Yang Zhengyuan, Wang Jianfeng, Li Linjie, Lin Chung-ching, Liu Zicheng, Wang Lijuan
- A Comprehensive Analysis Of The Effectiveness Of Large Language Models As Automatic Dialogue Evaluators Zhang Chen, D'haro Luis Fernando, Chen Yiming, Zhang Malu, Li Haizhou
- Xdial-eval: A Multilingual Open-domain Dialogue Evaluation Benchmark Zhang Chen, D'haro Luis Fernando, Tang Chengguang, Shi Ke, Tang Guohua, Li Haizhou
- The Skipped Beat: A Study Of Sociopragmatic Understanding In Llms For 64 Languages Zhang Chiyu, Doan Khai Duy, Liao Qisheng, Abdul-mageed Muhammad
- Video-llama: An Instruction-tuned Audio-visual Language Model For Video Understanding Zhang Hang, Li Xin, Bing Lidong
- Expressive TTS Driven By Natural Language Prompts Using Few Human Annotations Zhang Hanglei, Guo Yiwei, Liu Sen, Chen Xie, Yu Kai
- Towards Perceiving Small Visual Details In Zero-shot Visual Question Answering With Multimodal Llms Zhang Jiarui, Khayatkhoei Mahyar, Chhikara Prateek, Ilievski Filip
- A Two-stage Adaptation Of Large Language Models For Text Ranking Zhang Longhui, Zhang Yanzhao, Long Dingkun, Xie Pengjun, Zhang Meishan, Zhang Min
- Ask An Expert: Leveraging Language Models To Improve Strategic Reasoning In Goal-oriented Dialogue Models Zhang Qiang, Naradowsky Jason, Miyao Yusuke
- Knowgpt: Knowledge Graph Based Prompting For Large Language Models Zhang Qinggang, Dong Junnan, Chen Hao, Zha Daochen, Yu Zailiang, Huang Xiao
- Tell Your Model Where To Attend: Post-hoc Attention Steering For Llms Zhang Qingru, Singh Chandan, Liu Liyuan, Liu Xiaodong, Yu Bin, Gao Jianfeng, Zhao Tuo
- Generation-driven Contrastive Self-training For Zero-shot Text Classification With Instruction-following LLM Zhang Ruohong, Wang Yau-shian, Yang Yiming
- Arcmmlu: A Library And Information Science Benchmark For Large Language Models Zhang Shitou, Li Zuchao, Liu Xingshen, Yang Liming, Wang Ping
- Automl-gpt: Automatic Machine Learning With GPT Zhang Shujian, Gong Chengyue, Wu Lemeng, Liu Xingchao, Zhou Mingyuan
- History-aware Hierarchical Transformer For Multi-session Open-domain Dialogue System Zhang Tong, Liu Yong, Li Boyang, Zeng Zhiwei, Wang Pengwei, You Yuan, Miao Chunyan, Cui Lizhen
- Bridging The Information Gap Between Domain-specific Model And General LLM For Personalized Recommendation Zhang Wenxuan, Liu Hongzhi, Du Yingpeng, Zhu Chen, Song Yang, Zhu Hengshu, Wu Zhonghai
- PMC-VQA: Visual Instruction Tuning For Medical Visual Question Answering Zhang Xiaoman, Wu Chaoyi, Zhao Ziheng, Lin Weixiong, Zhang Ya, Wang Yanfeng, Xie Weidi
- Gpt-4v(ision) As A Generalist Evaluator For Vision-language Tasks Zhang Xinlu, Lu Yujie, Wang Weizhi, Yan An, Yan Jun, Qin Lianke, Wang Heng, Yan Xifeng, Wang William Yang, Petzold Linda Ruth
- Collm: Integrating Collaborative Embeddings Into Large Language Models For Recommendation Zhang Yang, Feng Fuli, Zhang Jizhi, Bao Keqin, Wang Qifan, He Xiangnan
- Dialoguellm: Context And Emotion Knowledge-tuned Large Language Models For Emotion Recognition In Conversations Zhang Yazhou, Wang Mengyao, Wu Youxi, Tiwari Prayag, Li Qiuchi, Wang Benyou, Qin Jing
- Pre-trained Language Models Do Not Help Auto-regressive Text-to-image Generation Zhang Yuhui, Mckinzie Brandon, Gan Zhe, Shankar Vaishaal, Toshev Alexander
- Mani-gpt: A Generative Model For Interactive Robotic Manipulation Zhang Zhe, Chai Wei, Wang Jiankun
- IAG: Induction-augmented Generation Framework For Answering Reasoning Questions Zhang Zhebin, Zhang Xinyu, Ren Yuanhang, Shi Saijiang, Han Meng, Wu Yongkang, Lai Ruofei, Cao Zhao
- PLUG: Leveraging Pivot Language In Cross-lingual Instruction Tuning Zhang Zhihan, Lee Dong-ho, Fang Yuwei, Yu Wenhao, Jia Mengzhao, Jiang Meng, Barbieri Francesco
- Auto-instruct: Automatic Instruction Generation And Ranking For Black-box Language Models Zhang Zhihan, Wang Shuohang, Yu Wenhao, Xu Yichong, Iter Dan, Zeng Qingkai, Liu Yang, Zhu Chenguang, Jiang Meng
- Responsible Task Automation: Empowering Large Language Models As Responsible Task Automators Zhang Zhizheng, Zhang Xiaoyi, Xie Wenxuan, Lu Yan
- Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-thought Reasoning To Language Agents Zhang Zhuosheng, Yao Yao, Zhang Aston, Tang Xiangru, Ma Xinbei, He Zhiwei, Wang Yiming, Gerstein Mark, Wang Rui, Liu Gongshen, Zhao Hai
- You Only Look At Screens: Multimodal Chain-of-action Agents Zhang Zhuosheng, Zhang Aston
- Ehrtutor: Enhancing Patient Understanding Of Discharge Instructions Zhang Zihao, Yao Zonghai, Zhou Huixue, Ouyang Feiyun, Yu Hong
- LLM4DV: Using Large Language Models For Hardware Test Stimuli Generation Zhang Zixi, Chadwick Greg, Mcnally Hugo, Zhao Yiren, Mullins Robert
- Tuning Layernorm In Attention: Towards Efficient Multi-modal LLM Finetuning Zhao Bingchen, Tu Haoqin, Wei Chen, Mei Jieru, Xie Cihang
- Explainability For Large Language Models: A Survey Zhao Haiyan, Chen Hanjie, Yang Fan, Liu Ninghao, Deng Huiqi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Du Mengnan
- Chatspot: Bootstrapping Multimodal Llms Via Precise Referring Instruction Tuning Zhao Liang, Yu En, Ge Zheng, Yang Jinrong, Wei Haoran, Zhou Hongyu, Sun Jianjian, Peng Yuang, Dong Runpei, Han Chunrui, Zhang Xiangyu
- Retrieving Multimodal Information For Augmented Generation: A Survey Zhao Ruochen, Chen Hailin, Wang Weishi, Jiao Fangkai, Do Xuan Long, Qin Chengwei, Ding Bosheng, Guo Xiaobao, Li Minzhi, Li Xingxuan, Joty Shafiq
- Ca-lora: Adapting Existing Lora For Compressed Llms To Enable Efficient Multi-tasking On Personal Devices Zhao Weilin, Huang Yuxiang, Han Xu, Liu Zhiyuan, Zhang Zhengyan, Li Kuai, Chen Chen, Yang Tao, Sun Maosong
- DIVKNOWQA: Assessing The Reasoning Ability Of Llms Via Open-domain Question Answering Over Knowledge Base And Text Zhao Wenting, Liu Ye, Niu Tong, Wan Yao, Yu Philip S., Joty Shafiq, Zhou Yingbo, Yavuz Semih
- Easygen: Easing Multimodal Generation With Bidiffuser And Llms Zhao Xiangyu, Liu Bo, Liu Qijiong, Shi Guangyuan, Wu Xiao-ming
- Mind Vs. Mouth: On Measuring Re-judge Inconsistency Of Social Bias In Large Language Models Zhao Yachao, Wang Bo, Zhao Dongming, Huang Kun, Wang Yan, He Ruifang, Hou Yuexian
- Chatbridge: Bridging Modalities With Large Language Model As A Language Catalyst Zhao Zijia, Guo Longteng, Yue Tongtian, Chen Sihan, Shao Shuai, Zhu Xinxin, Yuan Zehuan, Liu Jing
- Tabula: Harnessing Language Models For Tabular Data Synthesis Zhao Zilong, Birke Robert, Chen Lydia
- Adapting Large Language Models By Integrating Collaborative Semantics For Recommendation Zheng Bowen, Hou Yupeng, Lu Hongyu, Chen Yu, Zhao Wayne Xin, Chen Ming, Wen Ji-rong
- Lyra: Orchestrating Dual Correction In Automated Theorem Proving Zheng Chuanyang, Wang Haiming, Xie Enze, Liu Zhengying, Sun Jiankai, Xin Huajian, Shen Jianhao, Li Zhenguo, Li Yu
- Sglang: Efficient Execution Of Structured Language Model Programs Zheng Lianmin, Yin Liangsheng, Xie Zhiqiang, Sun Chuyue, Huang Jeff, Yu Cody Hao, Cao Shiyi, Kozyrakis Christos, Stoica Ion, Gonzalez Joseph E., Barrett Clark, Sheng Ying
- Improving Generalization Of Alignment With Human Preferences Through Group Invariant Learning Zheng Rui, Shen Wei, Hua Yuan, Lai Wenbin, Dou Shihan, Zhou Yuhao, Xi Zhiheng, Wang Xiao, Huang Haoran, Gui Tao, Zhang Qi, Huang Xuanjing
- Gpt-fathom: Benchmarking Large Language Models To Decipher The Evolutionary Path Towards GPT-4 And Beyond Zheng Shen, Zhang Yuyu, Zhu Yijie, Xi Chenguang, Gao Pengyang, Zhou Xun, Chang Kevin Chen-chuan
- Response Length Perception And Sequence Scheduling: An Llm-empowered LLM Inference Pipeline Zheng Zangwei, Ren Xiaozhe, Xue Fuzhao, Luo Yang, Jiang Xin, You Yang
- Large Language Model Can Interpret Latent Space Of Sequential Recommender Zhengyi Yang, Jiancan Wu, Yanchen Luo, Jizhi Zhang, Yancheng Yuan, An Zhang, Xiang Wang, Xiangnan He
- Llamarec: Two-stage Recommendation Using Large Language Models For Ranking Zhenrui Yue, Sara Rabhi, Gabriel De Souza Pereira Moreira, Dong Wang, Even Oldridge
- The Rise And Potential Of Large Language Model Based Agents: A Survey Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang, Changhao Jiang, Yicheng Zou, Xiangyang Liu, Zhangyue Yin, Shihan Dou, Rongxiang Weng, Wensen Cheng, Qi Zhang, Wenjuan Qin, Yongyan Zheng, Xipeng Qiu, Xuanjing Huang, Tao Gui
- Leveraging Large Language Models For Pre-trained Recommender Systems Zhixuan Chu, Hongyan Hao, Xin Ouyang, Simeng Wang, Yan Wang, Yue Shen, Jinjie Gu, Qing Cui, Longfei Li, Siqiao Xue, James Y Zhang, Sheng Li
- Mitigating Hallucination In Visual Language Models With Visual Supervision Zhiyang Chen, Yousong Zhu, Yufei Zhan, Zhaowen Li, Chaoyang Zhao, Jinqiao Wang, Ming Tang
- Hierarchical Neural Program Synthesis Zhong Linghan, Lindeborg Ryan, Zhang Jesse, Lim Joseph J., Sun Shao-hua
- Revisiting Token Dropping Strategy In Efficient BERT Pretraining Zhong Qihuang, Ding Liang, Liu Juhua, Liu Xuebo, Zhang Min, Du Bo, Tao Dacheng
- Bag Of Tricks For Effective Language Model Pretraining And Downstream Adaptation: A Case Study On GLUE Zhong Qihuang, Ding Liang, Peng Keqin, Liu Juhua, Du Bo, Shen Li, Zhan Yibing, Tao Dacheng
- Let's Think Outside The Box: Exploring Leap-of-thought In Large Language Models With Creative Humor Generation Zhong Shanshan, Huang Zhongzhan, Gao Shanghua, Wen Wushao, Lin Liang, Zitnik Marinka, Zhou Pan
- Language Agent Tree Search Unifies Reasoning Acting And Planning In Language Models Zhou Andy, Yan Kai, Shlapentokh-rothman Michal, Wang Haohan, Wang Yu-xiong
- Solving Challenging Math Word Problems Using GPT-4 Code Interpreter With Code-based Self-verification Zhou Aojun, Wang Ke, Lu Zimu, Shi Weikang, Luo Sichun, Qin Zipeng, Lu Shaoqing, Jia Anya, Song Linqi, Zhan Mingjie, Li Hongsheng
- Survival Of The Most Influential Prompts: Efficient Black-box Prompt Search Via Clustering And Pruning Zhou Han, Wan Xingchen, Vulić Ivan, Korhonen Anna
- Don't Make Your LLM An Evaluation Benchmark Cheater Zhou Kun, Zhu Yutao, Chen Zhipeng, Chen Wentong, Zhao Wayne Xin, Chen Xu, Lin Yankai, Wen Ji-rong, Han Jiawei
- CONA: A Novel Context-aware Instruction Paradigm For Communication Using Large Language Model Zhou Nan, Tao Xinghui, Chen Xi
- Recurrentgpt: Interactive Generation Of (arbitrarily) Long Text Zhou Wangchunshu, Jiang Yuchen Eleanor, Cui Peng, Wang Tiannan, Xiao Zhenxin, Hou Yifan, Cotterell Ryan, Sachan Mrinmaya
- Pose: Efficient Context Window Extension Of Llms Via Positional Skip-wise Training Zhu Dawei, Yang Nan, Wang Liang, Song Yifan, Wu Wenhao, Wei Furu, Li Sujian
- Dialogue-contextualized Re-ranking For Medical History-taking Zhu Jian, Valmianski Ilya, Kannan Anitha
- Extrapolating Large Language Models To Non-english By Aligning Languages Zhu Wenhao, Lv Yunzhe, Dong Qingxiu, Yuan Fei, Xu Jingjing, Huang Shujian, Kong Lingpeng, Chen Jiajun, Li Lei
- Question Answering As Programming For Solving Time-sensitive Questions Zhu Xinyu, Yang Cheng, Chen Bei, Li Siheng, Lou Jian-guang, Yang Yujiu
- Furthest Reasoning With Plan Assessment: Stable Reasoning Path With Retrieval-augmented Large Language Models Zhu Yin, Luo Zhiling, Cheng Gong
- Ice-score: Instructing Large Language Models To Evaluate Code Zhuo Terry Yue
- Multimodal Chain-of-thought Reasoning In Language Models Zhuosheng Zhang, Aston Zhang, Mu Li, Hai Zhao, George Karypis, Alex Smola
- Tab-cot: Zero-shot Tabular Chain Of Thought Ziqi Jin, Wei Lu
- Dr Chatgpt, Tell Me What I Want To Hear: How Prompt Knowledge Impacts Health Answer Correctness Zuccon Guido, Koopman Bevan
- Chatgpt Hallucinates When Attributing Answers Zuccon Guido, Koopman Bevan, Shaik Razia
- Enhancing Long-term Memory Using Hierarchical Aggregate Tree For Retrieval Augmented Generation A Aadharsh Aadhithya, S Sachin Kumar, P Soman K.
- Knowledge-infused Llm-powered Conversational Health Agent: A Case Study For Diabetes Patients Abbasian Mahyar, Yang Zhongqi, Khatibi Elahe, Zhang Pengfei, Nagesh Nitish, Azimi Iman, Jain Ramesh, Rahmani Amir M.
- Generate Then Retrieve: Conversational Response Retrieval Using Llms As Answer And Query Generators Abbasiantaeb Zahra, Aliannejadi Mohammad
- Are You Still On Track!? Catching LLM Task Drift With Activations Abdelnabi Sahar, Fay Aideen, Cherubin Giovanni, Salem Ahmed, Fritz Mario, Paverd Andrew
- Stackrag Agent: Improving Developer Answers With Retrieval-augmented Generation Abrahamyan Davit, Fard Fatemeh H.
- Pathocl: Path-based Prompt Augmentation For OCL Generation With GPT-4 Abukhalaf Seif, Hamdaqa Mohammad, Khomh Foutse
- Keyformer: KV Cache Reduction Through Key Tokens Selection For Efficient Generative Inference Adnan Muhammad, Arunkumar Akhil, Jain Gaurav, Nair Prashant J., Soloveychik Ilya, Kamath Purushotham
- Promptwizard: Task-aware Agent-driven Prompt Optimization Framework Agarwal Eshaan, Dani Vivek, Ganu Tanuja, Nambi Akshay
- Mememqa: Multimodal Question Answering For Memes Via Rationale-based Inferencing Agarwal Siddhant, Sharma Shivam, Nakov Preslav, Chakraborty Tanmoy
- Tricy: Trigger-guided Data-to-text Generation With Intent Aware Attention-copy Agarwal Vibhav, Ghosh Sourav, Bss Harichandana, Arora Himanshu, Raja Barath Raj Kandur
- Codemirage: Hallucinations In Code Generated By Large Language Models Agarwal Vibhor, Pei Yulong, Alamir Salwa, Liu Xiaomo
- Can't Remember Details In Long Documents? You Need Some R&R Agrawal Devanshu, Gao Shang, Gajek Martin
- Mindful-rag: A Study Of Points Of Failure In Retrieval Augmented Generation Agrawal Garima, Kumarage Tharindu, Alghamdi Zeyad, Liu Huan
- Can Llms Perform Structured Graph Reasoning? Agrawal Palaash, Vasania Shavak, Tan Cheston
- Direct-inverse Prompting: Analyzing Llms' Discriminative Capacity In Self-improving Generation Ahn Jihyun Janice, Kamoi Ryo, Cheng Lu, Zhang Rui, Yin Wenpeng
- Sphinx: Sample Efficient Multilingual Instruction Fine-tuning Through N-shot Guided Prompting Ahuja Sanchit, Tanmay Kumar, Chauhan Hardik Hansrajbhai, Patra Barun, Aggarwal Kriti, Del Corro Luciano, Mitra Arindam, Dhamecha Tejas Indulal, Awadallah Ahmed, Choudhary Monojit, Chaudhary Vishrav, Sitaram Sunayana
- Enhancing Pre-trained Generative Language Models With Question Attended Span Extraction On Machine Reading Comprehension Ai Lin, Hui Zheng, Liu Zizhou, Hirschberg Julia
- Examining The Robustness Of LLM Evaluation To The Distributional Assumptions Of Benchmarks Ailem Melissa, Marazopoulou Katerina, Siska Charlotte, Bono James
- Clustered Retrieved Augmented Generation (CRAG) Akesson Simon, Santos Frances A.
- FACTS About Building Retrieval Augmented Generation-based Chatbots Akkiraju Rama, Xu Anbang, Bora Deepak, Yu Tan, An Lu, Seth Vishal, Shukla Aaditya, Gundecha Pritam, Mehta Hridhay, Jha Ashwin, Raj Prithvi, Balasubramanian Abhinav, Maram Murali, Muthusamy Guru, Annepally Shivakesh Reddy, Knowles Sidney, Du Min, Burnett Nick, Javiya Sean, Marannan Ashok, Kumari Mamta, Jha Surbhi, Dereszenski Ethan, Chakraborty Anupam, Ranjan Subhash, Terfai Amina, Surya Anoop, Mercer Tracey, Thanigachalam Vinodh Kumar, Bar Tamar, Krishnan Sanjana, Kilaru Samy, Jaksic Jasmine, Algarici Nave, Liberman Jacob, Conway Joey, Nayyar Sonu, Boitano Justin
- Have We Reached AGI? Comparing Chatgpt, Claude, And Gemini To Human Literacy And Education Benchmarks Akpan Mfon
- Enhancing Textbook Question Answering Task With Large Language Models And Retrieval Augmented Generation Alawwad Hessa Abdulrahman, Alhothali Areej, Naseem Usman, Alkhathlan Ali, Jamal Amani
- User Preferences For Large Language Model Versus Template-based Explanations Of Movie Recommendations: A Pilot Study Albert Julien, Balfroid Martin, Doh Miriam, Bogaert Jeremie, La Fisca Luca, De Vos Liesbet, Renard Bryan, Stragier Vincent, Jean Emmanuel
- Should We Fine-tune Or RAG? Evaluating Different Techniques To Adapt Llms For Dialogue Alghisi Simone, Rizzoli Massimo, Roccabruna Gabriel, Mousavi Seyed Mahed, Riccardi Giuseppe
- PROMPT-SAW: Leveraging Relation-aware Graphs For Textual Prompt Compression Ali Muhammad Asif, Li Zhengping, Yang Shu, Cheng Keyuan, Cao Yang, Huang Tianhao, Hu Lijie, Yu Lu, Wang Di
- Evaluating The Retrieval Component In Llm-based Question Answering Systems Alinejad Ashkan, Kumar Krtin, Vahdat Ali
- Toward Conversational Agents With Context And Time Sensitive Long-term Memory Alonso Nick, Figliolia Tomás, Ndirango Anthony, Millidge Beren
- Understanding Alignment In Multimodal Llms: A Comprehensive Study Amirloo Elmira, Fauconnier Jean-philippe, Roesmann Christoph, Kerl Christian, Boney Rinu, Qian Yusu, Wang Zirui, Dehghan Afshin, Yang Yinfei, Gan Zhe, Grasch Peter
- Make Your LLM Fully Utilize The Context An Shengnan, Ma Zexiong, Lin Zeqi, Zheng Nanning, Lou Jian-guang
- Investigating The Robustness Of Llms On Math Word Problems Anantheswaran Ujjwala, Gupta Himanshu, Scaria Kevin, Verma Shreyas, Baral Chitta, Mishra Swaroop
- Critique-out-loud Reward Models Ankner Zachary, Paul Mansheej, Cui Brandon, Chang Jonathan D., Ammanabrolu Prithviraj
- Generalization V.s. Memorization: Tracing Language Models' Capabilities Back To Pretraining Data Antoniades Antonis, Wang Xinyi, Elazar Yanai, Amayuelas Alfonso, Albalak Alon, Zhang Kexun, Wang William Yang
- Evaluating Chatgpt On Nuclear Domain-specific Data Anwar Muhammad, De Costa Mischa, Hammad Issam, Lau Daniel
- SPAFIT: Stratified Progressive Adaptation Fine-tuning For Pre-trained Large Language Models Arora Samir, Wang Liangliang
- Calmqa: Exploring Culturally Specific Long-form Question Answering Across 23 Languages Arora Shane, Karpinska Marzena, Chen Hung-ting, Bhattacharjee Ipsita, Iyyer Mohit, Choi Eunsol
- Exploring The Impact Of Large Language Models On Recommender Systems: An Extensive Review Arpita Vats, Vinija Jain, Rahul Raja, Aman Chadha
- Exploring The Robustness Of Task-oriented Dialogue Systems For Colloquial German Varieties Artemova Ekaterina, Blaschke Verena, Plank Barbara
- Enhancing Holonic Architecture With Natural Language Processing For System Of Systems Ashfaq Muhammad, Sadik Ahmed R., Mikkonen Tommi, Waseem Muhammad, Akitalo Niko M
- Evaluating Large Language Models On The GMAT: Implications For The Future Of Business Education Ashrafimoghari Vahid, Gürkan Necdet, Suchow Jordan W.
- Self-seeding And Multi-intent Self-instructing Llms For Generating Intent-aware Information-seeking Dialogs Askari Arian, Petcu Roxana, Meng Chuan, Aliannejadi Mohammad, Abolghasemi Amin, Kanoulas Evangelos, Verberne Suzan
- Elicitron: An LLM Agent-based Simulation Framework For Design Requirements Elicitation Ataei Mohammadmehdi, Cheong Hyunmin, Grandi Daniele, Wang Ye, Morris Nigel, Tessier Alexander
- Can Open-source Llms Compete With Commercial Models? Exploring The Few-shot Performance Of Current GPT Models In Biomedical Tasks Ateia Samy, Kruschwitz Udo
- Convocache: Smart Re-use Of Chatbot Responses Atkins Conor, Wood Ian, Kaafar Mohamed Ali, Asghar Hassan, Basta Nardine, Kepkowski Michal
- GRAD-SUM: Leveraging Gradient Summarization For Optimal Prompt Engineering Austin Derek, Chartock Elliott
- Current State Of LLM Risks And AI Guardrails Ayyamperumal Suriya Ganesh, Ge Limin
- Accuracy And Consistency Of Llms In The Registered Dietitian Exam: The Impact Of Prompt Engineering And Knowledge Retrieval Azimi Iman, Qi Mohan, Wang Li, Rahmani Amir M., Li Youlin
- Lamda: Large Model Fine-tuning Via Spectrally Decomposed Low-dimensional Adaptation Azizi Seyedarmin, Kundu Souvik, Pedram Massoud
- Self-improving Customer Review Response Generation Based On Llms Azov Guy, Pelc Tatiana, Alon Adi Fledel, Kamhi Gila
- Evaluating The Efficacy Of Open-source Llms In Enterprise-specific RAG Systems: A Comparative Study Of Performance And Scalability B Gautam, Purwar Anupam
- Generating Query Recommendations Via Llms Bacciu Andrea, Palumbo Enrico, Damianou Andreas, Tonellotto Nicola, Silvestri Fabrizio
- Hybrid Preference Optimization: Augmenting Direct Preference Optimization With Auxiliary Objectives Badrinath Anirudhan, Agarwal Prabhat, Xu Jiajing
- Reference-guided Verdict: Llms-as-judges In Automatic Evaluation Of Free-form Text Badshah Sher, Sajjad Hassan
- Efficient Model-agnostic Alignment Via Bayesian Persuasion Bai Fengshuo, Wang Mingzhi, Zhang Zhaowei, Chen Boyuan, Xu Yinda, Wen Ying, Yang Yaodong
- Sparsellm: Towards Global Pruning For Pre-trained Language Models Bai Guangji, Li Yijiang, Ling Chen, Kim Kibaek, Zhao Liang
- Federated Fine-tuning Of Large Language Models Under Heterogeneous Tasks And Client Resources Bai Jiamu, Chen Daoyuan, Qian Bingchen, Yao Liuyi, Li Yaliang
- Longwriter: Unleashing 10,000+ Word Generation From Long Context Llms Bai Yushi, Zhang Jiajie, Lv Xin, Zheng Linzhi, Zhu Siqi, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- RAG Vs Fine-tuning: Pipelines, Tradeoffs, And A Case Study On Agriculture Balaguer Angels, Benara Vinamra, Cunha Renato Luiz De Freitas, Filho Roberto De M. Estevão, Hendry Todd, Holstein Daniel, Marsman Jennifer, Mecklenburg Nick, Malvar Sara, Nunes Leonardo O., Padilha Rafael, Sharp Morris, Silva Bruno, Sharma Swati, Aski Vijay, Chandra Ranveer
- MAIRA-2: Grounded Radiology Report Generation Bannur Shruthi, Bouzid Kenza, Castro Daniel C., Schwaighofer Anton, Bond-taylor Sam, Ilse Maximilian, Pérez-garcía Fernando, Salvatelli Valentina, Sharma Harshita, Meissen Felix, Ranjit Mercy, Srivastav Shaury, Gong Julia, Falck Fabian, Oktay Ozan, Thieme Anja, Lungren Matthew P., Wetscherek Maria Teodora, Alvarez-valle Javier, Hyland Stephanie L.
- Smaller, Weaker, Yet Better: Training LLM Reasoners Via Compute-optimal Sampling Bansal Hritik, Hosseini Arian, Agarwal Rishabh, Tran Vinh Q., Kazemi Mehran
- Comparing Bad Apples To Good Oranges: Aligning Large Language Models Via Joint Preference Optimization Bansal Hritik, Suvarna Ashima, Bhatt Gantavya, Peng Nanyun, Chang Kai-wei, Grover Aditya
- Decoding Matters: Addressing Amplification Bias And Homogeneity Issue For Llm-based Recommendation Bao Keqin, Zhang Jizhi, Zhang Yang, Huo Xinyue, Chen Chong, Feng Fuli
- Find The Gap: Knowledge Base Reasoning For Visual Question Answering Barezi Elham J., Kordjamshidi Parisa
- Reinforcement Learning From LLM Feedback To Counteract Goal Misgeneralization Barj Houda Nait El, Sautory Theophile
- Fine-tuning Or Fine-failing? Debunking Performance Myths In Large Language Models Barnett Scott, Brannelly Zac, Kurniawan Stefanus, Wong Sheng
- Medaide: Leveraging Large Language Models For On-premise Medical Assistance On Edge Devices Basit Abdul, Hussain Khizar, Hanif Muhammad Abdullah, Shafique Muhammad
- A Comparative Study Of DSL Code Generation: Fine-tuning Vs. Optimized Retrieval Augmentation Bassamzadeh Nastaran, Methani Chhaya
- Understanding Information Storage And Transfer In Multi-modal Large Language Models Basu Samyadeep, Grayson Martin, Morrison Cecily, Nushi Besmira, Feizi Soheil, Massiceti Daniela
- How Do You Know That? Teaching Generative Language Models To Reference Answers To Biomedical Questions Bašaragin Bojana, Ljajić Adela, Medvecki Darija, Cassano Lorenzo, Košprdić Miloš, Milošević Nikola
- Xlstm: Extended Long Short-term Memory Beck Maximilian, Pöppel Korbinian, Spanring Markus, Auer Andreas, Prudnikova Oleksandra, Kopp Michael, Klambauer Günter, Brandstetter Johannes, Hochreiter Sepp
- Internalinspector \(I^2\): Robust Confidence Estimation In Llms Through Internal States Beigi Mohammad, Shen Ying, Yang Runing, Lin Zihao, Wang Qifan, Mohan Ankith, He Jianfeng, Jin Ming, Lu Chang-tien, Huang Lifu
- Divergent Creativity In Humans And Large Language Models Bellemare-pepin Antoine 1 And 2, Lespinasse François 4 And 6, Thölke Philipp 4 And 6, Harel Yann 4 And 6, Mathewson Kory 4 And 6, Olson Jay A. 4 And 6, Bengio Yoshua 4 And 6, Jerbi Karim 1, 4 And 7
- Luna: An Evaluation Foundation Model To Catch Language Model Hallucinations With High Accuracy And Low Cost Belyi Masha, Friel Robert, Shao Shuai, Sanyal Atindriyo
- Machine Translation Hallucination Detection For Low And High Resource Languages Using Large Language Models Benkirane Kenza, Gongas Laura, Pelles Shahar, Fuchs Naomi, Darmon Joshua, Stenetorp Pontus, Adelani David Ifeoluwa, Sánchez Eduardo
- Redefining "hallucination" In Llms: Towards A Psychology-informed Framework For Mitigating Misinformation Berberette Elijah, Hutchins Jack, Sadovnik Amir
- Multi-head RAG: Solving Multi-aspect Problems With Llms Besta Maciej, Kubicek Ales, Niggli Roman, Gerstenberger Robert, Weitzendorf Lucas, Chi Mingyuan, Iff Patrick, Gajda Joanna, Nyczyk Piotr, Müller Jürgen, Niewiadomski Hubert, Chrapek Marcin, Podstawski Michał, Hoefler Torsten
- Enhancing Event Reasoning In Large Language Models Through Instruction Fine-tuning With Semantic Causal Graphs Bethany Mazal, Bethany Emet, Wherry Brandon, Chiang Cho-yu, Vishwamitra Nishant, Rios Anthony, Najafirad Peyman
- Guiding Llms The Right Way: Fast, Non-invasive Constrained Generation Beurer-kellner Luca, Fischer Marc, Vechev Martin
- Evaluating AI For Law: Bridging The Gap With Open-source Solutions Bhambhoria Rohan, Dahan Samuel, Li Jonathan, Zhu Xiaodan
- Efficient Reinforcement Learning Via Large Language Model-based Search Bhambri Siddhant, Bhattacharjee Amrita, Liu Huan, Kambhampati Subbarao
- Self-amplify: Improving Small Language Models With Self Post Hoc Explanations Bhan Milan, Vittaut Jean-noel, Chesneau Nicolas, Lesot Marie-jeanne
- An Approach To Build Zero-shot Slot-filling System For Industry-grade Conversational Assistants Bhargav G P Shrivatsa, Neelam Sumit, Sharma Udit, Ikbal Shajith, Sreedhar Dheeraj, Karanam Hima, Joshi Sachindra, Dhoolia Pankaj, Garg Dinesh, Croutwater Kyle, Qi Haode, Wayne Eric, Murdock J William
- Zero-shot Llm-guided Counterfactual Generation For Text Bhattacharjee Amrita, Moraffah Raha, Garland Joshua, Liu Huan
- Enhancing Code Translation In Language Models With Few-shot Learning Via Retrieval-augmented Generation Bhattarai Manish, Santos Javier E., Jones Shawn, Biswas Ayan, Alexandrov Boian, O'malley Daniel
- Iterative Refinement Of Project-level Code Context For Precise Code Generation With Compiler Feedback Bi Zhangqian, Wan Yao, Wang Zheng, Zhang Hongyu, Guan Batu, Lu Fangxin, Zhang Zili, Sui Yulei, Jin Hai, Shi Xuanhua
- Beyond Human Norms: Unveiling Unique Values Of Large Language Models Through Interdisciplinary Approaches Biedma Pablo, Yi Xiaoyuan, Huang Linus, Sun Maosong, Xie Xing
- Lipost: Improved Content Understanding With Effective Use Of Multi-task Contrastive Learning Bindal Akanksha, Ramanujam Sudarshan, Golland Dave, Hazen Tj, Jiang Tina, Zhang Fengyu, Yan Peng
- GRAM: Global Reasoning For Multi-page VQA Blau Tsachi, Fogel Sharon, Ronen Roi, Golts Alona, Ganz Roy, Avraham Elad Ben, Aberdam Aviad, Tsiper Shahar, Litman Ron
- Measuring Moral Inconsistencies In Large Language Models Bonagiri Vamshi Krishna, Vennam Sreeram, Gaur Manas, Kumaraguru Ponnurangam
- Batch Universal Prediction Bondaschi Marco, Gastpar Michael
- Open Artificial Knowledge Borisov Vadim, Schreiber Richard H.
- MALTO At Semeval-2024 Task 6: Leveraging Synthetic Data For LLM Hallucination Detection Borra Federico, Savelli Claudio, Rosso Giacomo, Koudounas Alkis, Giobergia Flavio
- Transformers Meet Neural Algorithmic Reasoners Bounsi Wilfried, Ibarz Borja, Dudzik Andrew, Hamrick Jessica B., Markeeva Larisa, Vitvitskyi Alex, Pascanu Razvan, Veličković Petar
- Retrieving Examples From Memory For Retrieval Augmented Neural Machine Translation: A Systematic Comparison Bouthors Maxime, Crego Josep, Yvon Francois
- Embardiment: An Embodied AI Agent For Productivity In XR Bovo Riccardo, Abreu Steven, Ahuja Karan, Gonzalez Eric J, Cheng Li-te, Gonzalez-franco Mar
- Improving Sequential Recommendations With Llms Boz Artun, Zorgdrager Wouter, Kotti Zoe, Harte Jesse, Louridas Panos, Jannach Dietmar, Fragkoulis Marios
- ACORN: Aspect-wise Commonsense Reasoning Explanation Evaluation Brassard Ana, Heinzerling Benjamin, Kudo Keito, Sakaguchi Keisuke, Inui Kentaro
- Generation, Distillation And Evaluation Of Motivational Interviewing-style Reflections With A Foundational Language Model Brown Andrew, Zhu Jiading, Abdelwahab Mohamed, Dong Alec, Wang Cindy, Rose Jonathan
- Retrieve, Generate, Evaluate: A Case Study For Medical Paraphrases Generation With Small Language Models Buhnila Ioana, Sinha Aman, Constant Mathieu
- Pangea: Procedural Artificial Narrative Using Generative AI For Turn-based Video Games Buongiorno Steph, Klinkert Lawrence Jake, Chawla Tanishq, Zhuang Zixin, Clark Corey
- Aligning Large Language Models With Counterfactual DPO Butcher Bradley
- Investigating Wit, Creativity, And Detectability Of Large Language Models In Domain-specific Writing Style Adaptation Of Reddit's Showerthoughts Buz Tolga, Frost Benjamin, Genchev Nikola, Schneider Moritz, Kaffee Lucie-aimée, De Melo Gerard
- ARES: Alternating Reinforcement Learning And Supervised Fine-tuning For Enhanced Multi-modal Chain-of-thought Reasoning Through Diverse AI Feedback Byun Ju-seung, Chun Jiyun, Kil Jihyung, Perrault Andrew
- Reducing Hallucination In Structured Outputs Via Retrieval-augmented Generation Béchard Patrice, Ayala Orlando Marquez
- Olora: Orthonormal Low-rank Adaptation Of Large Language Models Büyükakyüz Kerim
- On Large Language Models In National Security Applications Caballero William N., Jenkins Phillip R.
- In-context Learning And Fine-tuning GPT For Argument Mining Cabessa Jérémie, Hernault Hugo, Mushtaq Umer
- TACT: Advancing Complex Aggregative Reasoning With Information Extraction Tools Caciularu Avi, Jacovi Alon, Ben-david Eyal, Goldshtein Sasha, Schuster Tal, Herzig Jonathan, Elidan Gal, Globerson Amir
- Wiki-llava: Hierarchical Retrieval-augmented Generation For Multimodal Llms Caffagni Davide, Cocchi Federico, Moratelli Nicholas, Sarto Sara, Cornia Marcella, Baraldi Lorenzo, Cucchiara Rita
- Rethinking How To Evaluate Language Model Jailbreak Cai Hongyu, Arunasalam Arjun, Lin Leo Y., Bianchi Antonio, Celik Z. Berkay
- Lococo: Dropping In Convolutions For Long Context Compression Cai Ruisi, Tian Yuandong, Wang Zhangyang, Chen Beidi
- Medusa: Simple LLM Inference Acceleration Framework With Multiple Decoding Heads Cai Tianle, Li Yuhong, Geng Zhengyang, Peng Hongwu, Lee Jason D., Chen Deming, Dao Tri
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes Cao He, Shao Yanjun, Liu Zhiyuan, Liu Zijing, Tang Xiangru, Yao Yuan, Li Yu
- Recent Advances In Text Embedding: A Comprehensive Review Of Top-performing Methods On The MTEB Benchmark Cao Hongliu
- Tonggu: Mastering Classical Chinese Understanding With Knowledge-grounded Large Language Models Cao Jiahuan, Peng Dezhi, Zhang Peirong, Shi Yongxin, Liu Yang, Ding Kai, Jin Lianwen
- Genrec: Generative Sequential Recommendation With Large Language Models Cao Panfeng, Lio Pietro
- Verifiable Generation With Subsentence-level Fine-grained Citations Cao Shuyang, Wang Lu
- Graphinsight: Unlocking Insights In Large Language Models For Graph Structure Understanding Cao Yukun, Han Shuo, Gao Zengyi, Ding Zezhong, Xie Xike, Zhou S. Kevin
- Retaining Key Information Under High Compression Ratios: Query-guided Compressor For Llms Cao Zhiwei, Cao Qian, Lu Yu, Peng Ningxin, Huang Luyang, Cheng Shanbo, Su Jinsong
- Head-wise Shareable Attention For Large Language Models Cao Zouying, Yang Yifei, Zhao Hai
- A Review Of Multi-modal Large Language And Vision Models Carolan Kilian, Fennelly Laura, Smeaton Alan F.
- Can Few-shot Work In Long-context? Recycling The Context To Generate Demonstrations Cattan Arie, Jacovi Alon, Fabrikant Alex, Herzig Jonathan, Aharoni Roee, Rashkin Hannah, Marcus Dror, Hassidim Avinatan, Matias Yossi, Szpektor Idan, Caciularu Avi
- AMEX: Android Multi-annotation Expo Dataset For Mobile GUI Agents Chai Yuxiang, Huang Siyuan, Niu Yazhe, Xiao Han, Liu Liang, Zhang Dingyu, Gao Peng, Ren Shuai, Li Hongsheng
- Detoxbench: Benchmarking Large Language Models For Multitask Fraud & Abuse Detection Chakraborty Joymallya, Xia Wei, Majumder Anirban, Ma Dan, Chaabene Walid, Janvekar Naveed
- Towards Neural Synthesis For Smt-assisted Proof-oriented Programming Chakraborty Saikat, Ebner Gabriel, Bhat Siddharth, Fakhoury Sarah, Fatima Sakina, Lahiri Shuvendu, Swamy Nikhil
- Investigating Llms As Voting Assistants Via Contextual Augmentation: A Case Study On The European Parliament Elections 2024 Chalkidis Ilias
- Dense Reward For Free In Reinforcement Learning From Human Feedback Chan Alex J., Sun Hao, Holt Samuel, Van Der Schaar Mihaela
- RQ-RAG: Learning To Refine Queries For Retrieval Augmented Generation Chan Chi-min, Xu Chunpu, Yuan Ruibin, Luo Hongyin, Xue Wei, Guo Yike, Fu Jie
- Scaling Synthetic Data Creation With 1,000,000,000 Personas Chan Xin, Wang Xiaoyang, Yu Dian, Mi Haitao, Yu Dong
- AMGPT: A Large Language Model For Contextual Querying In Additive Manufacturing Chandrasekhar Achuth, Chan Jonathan, Ogoke Francis, Ajenifujah Olabode, Farimani Amir Barati
- Benchmarking Cognitive Domains For Llms: Insights From Taiwanese Hakka Culture Chang Chen-chi, Chen Ching-yuan, Lee Hung-shin, Lee Chih-cheng
- Speechprompt: Prompting Speech Language Models For Speech Processing Tasks Chang Kai-wei, Wu Haibin, Wang Yu-kai, Wu Yuan-kuei, Shen Hua, Tseng Wei-cheng, Kang Iu-thing, Li Shang-wen, Lee Hung-yi
- When Parts Are Greater Than Sums: Individual LLM Components Can Outperform Full Models Chang Ting-yun, Thomason Jesse, Jia Robin
- Detecting Hallucination And Coverage Errors In Retrieval Augmented Generation For Controversial Topics Chang Tyler A., Tomanek Katrin, Hoffmann Jessica, Thain Nithum, Van Liemt Erin, Meier-hellstern Kathleen, Dixon Lucas
- Injecting Salesperson's Dialogue Strategies In Large Language Models With Chain-of-thought Reasoning Chang Wen-yu, Chen Yun-nung
- Notellm: A Retrievable Large Language Model For Note Recommendation Chao Zhang, Shiwei Wu, Haoxin Zhang, Tong Xu, Yan Gao, Yao Hu, Di Wu, Enhong Chen
- Language Models Can Exploit Cross-task In-context Learning For Data-scarce Novel Tasks Chatterjee Anwoy, Tanwar Eshaan, Dutta Subhabrata, Chakraborty Tanmoy
- Phantom: General Trigger Attacks On Retrieval Augmented Language Generation Chaudhari Harsh, Severi Giorgio, Abascal John, Jagielski Matthew, Choquette-choo Christopher A., Nasr Milad, Nita-rotaru Cristina, Oprea Alina
- Large Language Models As Instruments Of Power: New Regimes Of Autonomous Manipulation And Control Chaudhary Yaqub, Penn Jonnie
- Veagle: Advancements In Multimodal Representation Learning Chawla Rajat, Datta Arkajit, Verma Tushar, Jha Adarsh, Gautam Anmol, Vatsal Ayush, Chaterjee Sukrit, Ns Mukunda, Bhola Ishaan
- Coupling Speech Encoders With Downstream Text Models Chelba Ciprian, Schalkwyk Johan
- DUAL-REFLECT: Enhancing Large Language Models For Reflective Translation Through Dual Learning Feedback Mechanisms Chen Andong, Lou Lianzhang, Chen Kehai, Bai Xuefeng, Xiang Yang, Yang Muyun, Zhao Tiejun, Zhang Min
- The Dark Side Of Human Feedback: Poisoning Large Language Models Via User Inputs Chen Bocheng, Guo Hanqing, Wang Guangjing, Wang Yuanda, Yan Qiben
- Livemind: Low-latency Large Language Models With Simultaneous Inference Chen Chuangtao, Zhang Grace Li, Yin Xunzhao, Zhuo Cheng, Schlichtmann Ulf, Li Bing
- Lower Layer Matters: Alleviating Hallucination Via Multi-layer Fusion Contrastive Decoding With Truthfulness Refocused Chen Dingwei, Fang Feiteng, Ni Shiwen, Liang Feng, Xu Ruifeng, Yang Min, Li Chengming
- GUI-WORLD: A Dataset For Gui-oriented Multimodal Llm-based Agents Chen Dongping, Huang Yue, Wu Siyuan, Tang Jingyu, Chen Liuyi, Bai Yilin, He Zhigang, Wang Chenlong, Zhou Huichi, Li Yiqiang, Zhou Tianshuo, Yu Yue, Gao Chujie, Zhang Qihui, Gui Yi, Li Zhen, Wan Yao, Zhou Pan, Gao Jianfeng, Sun Lichao
- Hardware-aware Parallel Prompt Decoding For Memory-efficient Acceleration Of LLM Inference Chen Hao Mark, Luk Wayne, Yiu Ka Fai Cedric, Li Rui, Mishchenko Konstantin, Venieris Stylianos I., Fan Hongxiang
- Nlperturbator: Studying The Robustness Of Code Llms To Natural Language Variations Chen Junkai, Li Zhenhao, Hu Xing, Xia Xin
- Reasoning Runtime Behavior Of A Program With LLM: How Far Are We? Chen Junkai, Pan Zhiyuan, Hu Xing, Li Zhenhao, Li Ge, Xia Xin
- Pixart-\sigma: Weak-to-strong Training Of Diffusion Transformer For 4K Text-to-image Generation Chen Junsong, Ge Chongjian, Xie Enze, Wu Yue, Yao Lewei, Ren Xiaozhe, Wang Zhongdao, Luo Ping, Lu Huchuan, Li Zhenguo
- OMPGPT: A Generative Pre-trained Transformer Model For Openmp Chen Le, Bhattacharjee Arijit, Ahmed Nesreen, Hasabnis Niranjan, Oren Gal, Vo Vy, Jannesari Ali
- Teaching Large Language Models To Express Knowledge Boundary From Their Own Signals Chen Lida, Liang Zujie, Wang Xintao, Liang Jiaqing, Xiao Yanghua, Wei Feng, Chen Jinglei, Hao Zhenghong, Han Bing, Wang Wei
- Are We On The Right Way For Evaluating Large Vision-language Models? Chen Lin, Li Jinsong, Dong Xiaoyi, Zhang Pan, Zang Yuhang, Chen Zehui, Duan Haodong, Wang Jiaqi, Qiao Yu, Lin Dahua, Zhao Feng
- Bailong: Bilingual Transfer Learning Based On Qlora And Zip-tie Embedding Chen Lung-chuan, Li Zong-ru
- Graphwiz: An Instruction-following Language Model For Graph Problems Chen Nuo, Li Yuhan, Tang Jianheng, Li Jia
- Enhancing Visual Question Answering Through Ranking-based Hybrid Training And Multimodal Fusion Chen Peiyuan, Zhang Zecheng, Dong Yiping, Zhou Li, Wang Han
- M\(^3\)cot: A Novel Benchmark For Multi-domain Multi-step Multi-modal Chain-of-thought Chen Qiguang, Qin Libo, Zhang Jin, Chen Zhi, Xu Xiao, Che Wanxiang
- Lifelong Knowledge Editing For Llms With Retrieval-augmented Continuous Prompt Learning Chen Qizhou, Zhang Taolin, He Xiaofeng, Li Dongyang, Wang Chengyu, Huang Longtao, Xue Hui
- Llm-based Multi-hop Question Answering With Knowledge Graph Integration In Evolving Environments Chen Ruirui, Jiang Weifeng, Qin Chengwei, Rawal Ishaan Singh, Tan Cheston, Choi Dongkyu, Xiong Bo, Ai Bo
- Editable Fairness: Fine-grained Bias Mitigation In Language Models Chen Ruizhe, Li Yichen, Yang Jianfei, Zhou Joey Tianyi, Liu Zuozhu
- Apollonion: Profile-centric Dialog Agent Chen Shangyu, Zhao Zibo, Zhao Yuanyuan, Li Xiang
- In-context Sharpness As Alerts: An Inner Representation Perspective For Hallucination Mitigation Chen Shiqi, Xiong Miao, Liu Junteng, Wu Zhengxuan, Xiao Teng, Gao Siyang, He Junxian
- Advancing Tool-augmented Large Language Models: Integrating Insights From Errors In Inference Trees Chen Sijia, Wang Yibo, Wu Yi-feng, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Zhang Lijun
- Ternaryllm: Ternarized Large Language Model Chen Tianqi, Li Zhe, Xu Weixiang, Zhu Zeyu, Li Dong, Tian Lu, Barsoum Emad, Wang Peisong, Cheng Jian
- Intelligent Tutor: Leveraging Chatgpt And Microsoft Copilot Studio To Deliver A Generative AI Student Support And Feedback System Within Teams Chen Wei-yu
- Beyond Natural Language: Llms Leveraging Alternative Formats For Enhanced Reasoning And Communication Chen Weize, Yuan Chenfei, Yuan Jiarui, Su Yusheng, Qian Chen, Yang Cheng, Xie Ruobing, Liu Zhiyuan, Sun Maosong
- Llast: Improved End-to-end Speech Translation System Leveraged By Large Language Models Chen Xi, Zhang Songyang, Bai Qibing, Chen Kai, Nakamura Satoshi
- Eyegpt: Ophthalmic Assistant With Large Language Models Chen Xiaolan, Zhao Ziwei, Zhang Weiyi, Xu Pusheng, Gao Le, Xu Mingpu, Wu Yue, Li Yinwen, Shi Danli, He Mingguang
- Cause-aware Empathetic Response Generation Via Chain-of-thought Fine-tuning Chen Xinhao, Yang Chong, Lan Man, Cai Li, Chen Yang, Hu Tu, Zhuang Xinlin, Zhou Aimin
- Hiqa: A Hierarchical Contextual Augmentation RAG For Massive Documents QA Chen Xinyue, Gao Pengyu, Song Jiangjiang, Tan Xiaoyang
- Iteralign: Iterative Constitutional Alignment Of Large Language Models Chen Xiusi, Wen Hongzhi, Nag Sreyashi, Luo Chen, Yin Qingyu, Li Ruirui, Li Zheng, Wang Wei
- Can We Rely On LLM Agents To Draft Long-horizon Plans? Let's Take Travelplanner As An Example Chen Yanan, Pesaranghader Ali, Sadhu Tanmana, Yi Dong Hoon
- Simulating Field Experiments With Large Language Models Chen Yaoyu, Hu Yuheng, Lu Yingda
- Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided By Text Information Chen Yi, Xu Jian, Zhang Xu-yao, Liu Wen-zhuo, Liu Yang-yang, Liu Cheng-lin
- MAPO: Boosting Large Language Model Performance With Model-adaptive Prompt Optimization Chen Yuyan, Wen Zhihao, Fan Ge, Chen Zhengyu, Wu Wei, Liu Dayiheng, Li Zhixu, Liu Bang, Xiao Yanghua
- Towards Boosting Llms-driven Relevance Modeling With Progressive Retrieved Behavior-augmented Prompting Chen Zeyuan, Wu Haiyan, Wu Kaixin, Chen Wei, Zhong Mingjie, Xu Jia, Liu Zhongyi, Zhang Wei
- MLLM Is A Strong Reranker: Advancing Multimodal Retrieval-augmented Generation Via Knowledge-enhanced Reranking And Noise-injected Training Chen Zhanpeng, Xu Chengjin, Qi Yiyan, Guo Jian
- Autoprm: Automating Procedural Supervision For Multi-step Reasoning Via Controllable Question Decomposition Chen Zhaorun, Zhao Zhuokai, Zhu Zhihong, Zhang Ruiqi, Li Xiang, Raj Bhiksha, Yao Huaxiu
- Dia-llama: Towards Large Language Model-driven CT Report Generation Chen Zhixuan, Luo Luyang, Bie Yequan, Chen Hao
- Improving Retrieval Augmented Open-domain Question-answering With Vectorized Contexts Chen Zhuo, Wang Xinyu, Jiang Yong, Xie Pengjun, Huang Fei, Tu Kewei
- "in Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles Through In-dialogue Learning Cheng Chuanqi, Tu Quan, Wu Wei, Shang Shuo, Mao Cunli, Yu Zhengtao, Yan Rui
- Trojanrag: Retrieval-augmented Generation Can Be Backdoor Driver In Large Language Models Cheng Pengzhou, Ding Yidong, Ju Tianjie, Wu Zongru, Du Wei, Yi Ping, Zhang Zhuosheng, Liu Gongshen
- Call Me When Necessary: Llms Can Efficiently And Faithfully Reason Over Structured Environments Cheng Sitao, Zhuang Ziyuan, Xu Yong, Yang Fangkai, Zhang Chaoyun, Qin Xiaoting, Huang Xiang, Chen Ling, Lin Qingwei, Zhang Dongmei, Rajmohan Saravan, Zhang Qi
- Information Re-organization Improves Reasoning In Large Language Models Cheng Xiaoxia, Tan Zeqi, Xue Wei, Lu Weiming
- Small Agent Can Also Rock! Empowering Small Language Models As Hallucination Detector Cheng Xiaoxue, Li Junyi, Zhao Wayne Xin, Zhang Hongzhi, Zhang Fuzheng, Zhang Di, Gai Kun, Wen Ji-rong
- Interpreting Conversational Dense Retrieval By Rewriting-enhanced Inversion Of Session Embedding Cheng Yiruo, Mao Kelong, Dou Zhicheng
- Can Large Language Models Be Trusted For Evaluation? Scalable Meta-evaluation Of Llms As Evaluators Via Agent Debate Chern Steffi, Chern Ethan, Neubig Graham, Liu Pengfei
- A Reality Check Of The Benefits Of LLM In Business Cheung Ming
- Lamsum: Creating Extractive Summaries Of User Generated Content Using Llms Chhikara Garima, Sharma Anurag, Gurucharan V., Ghosh Kripabandhu, Chakraborty Abhijnan
- Chatbot Arena: An Open Platform For Evaluating Llms By Human Preference Chiang Wei-lin, Zheng Lianmin, Sheng Ying, Angelopoulos Anastasios Nikolas, Li Tianle, Li Dacheng, Zhang Hao, Zhu Banghua, Jordan Michael, Gonzalez Joseph E., Stoica Ion
- Retrieval-augmented Generation In Multilingual Settings Chirkova Nadezhda, Rau David, Déjean Hervé, Formal Thibault, Clinchant Stéphane, Nikoulina Vassilina
- SPAR: Personalized Content-based Recommendation Via Long Engagement Attention Chiyu Zhang, Yifei Sun, Jun Chen, Jie Lei, Muhammad Abdul-mageed, Sinong Wang, Rong Jin, Sem Park, Ning Yao, Bo Long
- Language-image Models With 3D Understanding Cho Jang Hyun, Ivanovic Boris, Cao Yulong, Schmerling Edward, Wang Yue, Weng Xinshuo, Li Boyi, You Yurong, Krähenbühl Philipp, Wang Yan, Pavone Marco
- Kv-runahead: Scalable Causal LLM Inference By Parallel Key-value Cache Generation Cho Minsik, Rastegari Mohammad, Naik Devang
- Pragmatic Inference Of Scalar Implicature By Llms Cho Ye-eun, Kim Seong Mook
- Exploring Interaction Patterns For Debugging: Enhancing Conversational Capabilities Of Ai-assistants Chopra Bhavya, Bajpai Yasharth, Biyani Param, Soares Gustavo, Radhakrishna Arjun, Parnin Chris, Gulwani Sumit
- Large Language Models And User Trust: Consequence Of Self-referential Learning Loop And The Deskilling Of Healthcare Professionals Choudhury Avishek, Chaudhry Zaria
- Meerkat: Audio-visual Large Language Model For Grounding In Space And Time Chowdhury Sanjoy, Nag Sayan, Dasgupta Subhrajyoti, Chen Jun, Elhoseiny Mohamed, Gao Ruohan, Manocha Dinesh
- Think Together And Work Better: Combining Humans' And Llms' Think-aloud Outcomes For Effective Text Evaluation Chu Seongyeub, Kim Jongwoo, Yi Munyong
- Beamaggr: Beam Aggregation Reasoning Over Multi-source Knowledge For Multi-hop Question Answering Chu Zheng, Chen Jingchang, Chen Qianglong, Wang Haotian, Zhu Kun, Du Xiyuan, Yu Weijiang, Liu Ming, Qin Bing
- Beyond Llms: Advancing The Landscape Of Complex Reasoning Chu-carroll Jennifer, Beck Andrew, Burnham Greg, Melville David Os, Nachman David, Özcan A. Erdem, Ferrucci David
- AI Safety In Generative AI Large Language Models: A Survey Chua Jaymari, Li Yun, Yang Shiyi, Wang Chen, Yao Lina
- Faithlm: Towards Faithful Explanations For Large Language Models Chuang Yu-neng, Wang Guanchu, Chang Chia-yuan, Tang Ruixiang, Zhong Shaochen, Yang Fan, Du Mengnan, Cai Xuanting, Hu Xia
- Ragsys: Item-cold-start Recommender As RAG System Contal Emile, Mcgoldrick Garrin
- Out-of-context Prompting Boosts Fairness And Robustness In Large Language Model Predictions Cotta Leonardo, Maddison Chris J.
- Llambert: Large-scale Low-cost Data Annotation In NLP Csanády Bálint, Muzsai Lajos, Vedres Péter, Nádasdy Zoltán, Lukács András
- The Power Of Noise: Redefining Retrieval For RAG Systems Cuconasu Florin, Trappolini Giovanni, Siciliano Federico, Filice Simone, Campagnano Cesare, Maarek Yoelle, Tonellotto Nicola, Silvestri Fabrizio
- A Tale Of Trust And Accuracy: Base Vs. Instruct Llms In RAG Systems Cuconasu Florin, Trappolini Giovanni, Tonellotto Nicola, Silvestri Fabrizio
- Or-bench: An Over-refusal Benchmark For Large Language Models Cui Justin, Chiang Wei-lin, Stoica Ion, Hsieh Cho-jui
- Distillation Matters: Empowering Sequential Recommenders To Match The Performance Of Large Language Model Cui Yu, Liu Feng, Wang Pengbo, Wang Bohao, Tang Heng, Wan Yi, Wang Jun, Chen Jiawei
- Faculty Perspectives On The Potential Of RAG In Computer Science Higher Education Dakshit Sagnik
- Entropy Guided Extrapolative Decoding To Improve Factuality In Large Language Models Das Souvik, Jin Lifeng, Song Linfeng, Mi Haitao, Peng Baolin, Yu Dong
- Towards Transparency: Exploring LLM Trainings Datasets Through Visual Topic Modeling And Semantic Frame De Dampierre Charles, Mogoutov Andrei, Baumard Nicolas
- Retail-gpt: Leveraging Retrieval Augmented Generation (RAG) For Building E-commerce Chat Assistants De Freitas Bruno Amaral Teixeira, Lotufo Roberto De Alencar
- Polyrating: A Cost-effective And Bias-aware Rating System For LLM Evaluation Dekoninck Jasper, Baader Maximilian, Vechev Martin
- Understanding Biases In Chatgpt-based Recommender Systems: Provider Fairness, Temporal Stability, And Recency Deldjoo Yashar
- Cram: Credibility-aware Attention Modification In Llms For Combating Misinformation In RAG Deng Boyi, Wang Wenjie, Zhu Fengbin, Wang Qifan, Feng Fuli
- MIMIR: A Streamlined Platform For Personalized Agent Tuning In Domain Expertise Deng Chunyuan, Tang Xiangru, Zhao Yilun, Wang Hanming, Wang Haoran, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- Pandora: Jailbreak Gpts By Retrieval Augmented Generation Poisoning Deng Gelei, Liu Yi, Wang Kailong, Li Yuekang, Zhang Tianwei, Liu Yang
- Can LLM Be A Good Path Planner Based On Prompt Engineering? Mitigating The Hallucination For Path Planning Deng Hourui, Zhang Hongjie, Ou Jie, Feng Chaosheng
- LMGT: Optimizing Exploration-exploitation Balance In Reinforcement Learning Through Language Model Guided Trade-offs Deng Yongxin, Qiu Xihe, Tan Xiaoyu, Chu Wei, Xu Yinghui
- Glam: Fine-tuning Large Language Models For Domain Knowledge Graph Alignment Via Neighborhood Partitioning And Generative Subgraph Encoding Dernbach Stefan, Agarwal Khushbu, Zuniga Alejandro, Henry Michael, Choudhury Sutanay
- Class-level Code Generation From Natural Language Using Iterative, Tool-enhanced Reasoning Over Repository Deshpande Ajinkya, Agarwal Anmol, Shet Shashank, Iyer Arun, Kanade Aditya, Bairi Ramakrishna, Parthasarathy Suresh
- Exploring Prompt Engineering Practices In The Enterprise Desmond Michael, Brachman Michelle
- A Simple And Effective \(L_2\) Norm-based Strategy For KV Cache Compression Devoto Alessio, Zhao Yu, Scardapane Simone, Minervini Pasquale
- SOCIALITE-LLAMA: An Instruction-tuned Model For Social Scientific Tasks Dey Gourab, Ganesan Adithya V, Lal Yash Kumar, Shah Manal, Sinha Shreyashee, Matero Matthew, Giorgi Salvatore, Kulkarni Vivek, Schwartz H. Andrew
- Shaping Human-ai Collaboration: Varied Scaffolding Levels In Co-writing With Language Models Dhillon Paramveer S., Molaei Somayeh, Li Jiaqi, Golub Maximilian, Zheng Shaochun, Robert Lionel P.
- Entgpt: Linking Generative Large Language Models With Knowledge Bases Ding Yifan, Poudel Amrit, Zeng Qingkai, Weninger Tim, Veeramani Balaji, Bhattacharya Sanmitra
- Finding Blind Spots In Evaluator Llms With Interpretable Checklists Doddapaneni Sumanth, Khan Mohammed Safi Ur Rahman, Verma Sshubam, Khapra Mitesh M.
- Understand What LLM Needs: Dual Preference Alignment For Retrieval-augmented Generation Dong Guanting, Zhu Yutao, Zhang Chenghao, Wang Zechen, Dou Zhicheng, Wen Ji-rong
- Evaluating And Mitigating Linguistic Discrimination In Large Language Models Dong Guoliang, Wang Haoyu, Sun Jun, Wang Xinyu
- Modality-aware Integration With Large Language Models For Knowledge-based Visual Question Answering Dong Junnan, Zhang Qinggang, Zhou Huachi, Zha Daochen, Zheng Pai, Huang Xiao
- A Framework For Real-time Safeguarding The Text Generation Of Large Language Model Dong Ximing, Lin Dayi, Wang Shaowei, Hassan Ahmed E.
- Can LLM Be A Personalized Judge? Dong Yijiang River, Hu Tiancheng, Collier Nigel
- Jailbreaking Text-to-image Models With Llm-based Agents Dong Yingkai, Li Zheng, Meng Xiangtao, Yu Ning, Guo Shanqing
- Attacks, Defenses And Evaluations For LLM Conversation Safety: A Survey Dong Zhichen, Zhou Zhanhui, Yang Chao, Shao Jing, Qiao Yu
- Biomedical Large Languages Models Seem Not To Be Superior To Generalist Models On Unseen Medical Data Dorfner Felix J., Dada Amin, Busch Felix, Makowski Marcus R., Han Tianyu, Truhn Daniel, Kleesiek Jens, Sushil Madhumita, Lammert Jacqueline, Adams Lisa C., Bressem Keno K.
- Training A Vision Language Model As Smartphone Assistant Dorka Nicolai, Marecki Janusz, Anwar Ammar
- Learning From Students: Applying T-distributions To Explore Accurate And Efficient Formats For Llms Dotzel Jordan, Chen Yuzong, Kotb Bahaa, Prasad Sushma, Wu Gang, Li Sheng, Abdelfattah Mohamed S., Zhang Zhiru
- What's Wrong With Your Code Generated By Large Language Models? An Extensive Study Dou Shihan, Jia Haoxiang, Wu Shenxi, Zheng Huiyuan, Zhou Weikang, Wu Muling, Chai Mingxu, Fan Jessica, Huang Caishuang, Tao Yunbo, Liu Yan, Zhou Enyu, Zhang Ming, Zhou Yuhao, Wu Yueming, Zheng Rui, Wen Ming, Weng Rongxiang, Wang Jingang, Cai Xunliang, Gui Tao, Qiu Xipeng, Zhang Qi, Huang Xuanjing
- Towards Multimodal In-context Learning For Vision & Language Models Doveh Sivan, Perek Shaked, Mirza M. Jehanzeb, Lin Wei, Alfassy Amit, Arbelle Assaf, Ullman Shimon, Karlinsky Leonid
- Evidence-enhanced Triplet Generation Framework For Hallucination Alleviation In Generative Question Answering Du Haowei, Zhang Huishuai, Zhao Dongyan
- Efficient Personalized Text-to-image Generation By Leveraging Textual Subspace Du Shian, Cheng Xiaotian, Qian Qi, Wei Henglu, Xu Yi, Ji Xiangyang
- Unlocking Continual Learning Abilities In Language Models Du Wenyu, Cheng Shuang, Luo Tongxu, Qiu Zihan, Huang Zeyu, Cheung Ka Chun, Cheng Reynold, Fu Jie
- Stacking Your Transformers: A Closer Look At Model Growth For Efficient LLM Pre-training Du Wenyu, Luo Tongxu, Qiu Zihan, Huang Zeyu, Shen Yikang, Cheng Reynold, Guo Yike, Fu Jie
- Perltqa: A Personal Long-term Memory Dataset For Memory Classification, Retrieval, And Synthesis In Question Answering Du Yiming, Wang Hongru, Zhao Zhengyi, Liang Bin, Wang Baojun, Zhong Wanjun, Wang Zezhong, Wong Kam-fai
- Towards Comprehensive Vietnamese Retrieval-augmented Generation And Large Language Models Duc Nguyen Quang, Son Le Hai, Nhan Nguyen Duc, Minh Nguyen Dich Nhat, Huong Le Thanh, Sang Dinh Viet
- Leveraging Machine-generated Rationales To Facilitate Social Meaning Detection In Conversations Dutt Ritam, Wu Zhen, Shi Kelly, Sheth Divyanshu, Gupta Prakhar, Rose Carolyn Penstein
- AXOLOTL: Fairness Through Assisted Self-debiasing Of Large Language Model Outputs Ebrahimi Sana, Chen Kaiwen, Asudeh Abolfazl, Das Gautam, Koudas Nick
- Lusifer: Llm-based User Simulated Feedback Environment For Online Recommender Systems Ebrat Danial, Rueda Luis
- Relation Extraction With Fine-tuned Large Language Models In Retrieval Augmented Generation Frameworks Efeoglu Sefika, Paschke Adrian
- Explainable AI Reloaded: Challenging The XAI Status Quo In The Era Of Large Language Models Ehsan Upol, Riedl Mark O.
- ARAGOG: Advanced RAG Output Grading Eibich Matouš, Nagpal Shivay, Fred-ojala Alexander
- Kapqa: Knowledge-augmented Product Question-answering Eppalapally Swetha, Dangi Daksh, Bhat Chaithra, Gupta Ankita, Zhang Ruiyi, Agarwal Shubham, Bagga Karishma, Yoon Seunghyun, Lipka Nedim, Rossi Ryan A., Dernoncourt Franck
- Navigating Uncertainty: Optimizing API Dependency For Hallucination Reduction In Closed-book Question Answering Erbacher Pierre, Falissar Louis, Guigue Vincent, Soulier Laure
- Fact-checking The Output Of Large Language Models Via Token-level Uncertainty Quantification Fadeeva Ekaterina, Rubashevskii Aleksandr, Shelmanov Artem, Petrakov Sergey, Li Haonan, Mubarak Hamdy, Tsymbalov Evgenii, Kuzmin Gleb, Panchenko Alexander, Baldwin Timothy, Nakov Preslav, Panov Maxim
- Data-augmentation-based Dialectal Adaptation For Llms Faisal Fahim, Anastasopoulos Antonios
- HLAT: High-quality Large Language Model Pre-trained On AWS Trainium Fan Haozheng, Zhou Hao, Huang Guangtai, Raman Parameswaran, Fu Xinwei, Gupta Gaurav, Ram Dhananjay, Wang Yida, Huan Jun
- Reformatted Alignment Fan Run-ze, Li Xuefeng, Zou Haoyang, Li Junlong, He Shwai, Chern Ethan, Hu Jiewen, Liu Pengfei
- Navigation Instruction Generation With BEV Perception And Large Language Models Fan Sheng, Liu Rui, Wang Wenguan, Yang Yi
- A Survey On RAG Meeting Llms: Towards Retrieval-augmented Large Language Models Fan Wenqi, Ding Yujuan, Ning Liangbo, Wang Shijie, Li Hengyun, Yin Dawei, Chua Tat-seng, Li Qing
- Enhancing Noise Robustness Of Retrieval-augmented Language Models With Adaptive Adversarial Training Fang Feiteng, Bai Yuelin, Ni Shiwen, Yang Min, Chen Xiaojun, Xu Ruifeng
- TRACE The Evidence: Constructing Knowledge-grounded Reasoning Chains For Retrieval-augmented Generation Fang Jinyuan, Meng Zaiqiao, Macdonald Craig
- Large Language Models Are Neurosymbolic Reasoners Fang Meng, Deng Shilong, Zhang Yudi, Shi Zijing, Chen Ling, Pechenizkiy Mykola, Wang Jun
- PATCH! Psychometrics-assisted Benchmarking Of Large Language Models: A Case Study Of Proficiency In 8th Grade Mathematics Fang Qixiang, Oberski Daniel L., Nguyen Dong
- Integrating A.I. In Higher Education: Protocol For A Pilot Study With 'samcares: An Adaptive Learning Hub' Faruqui Syed Hasib Akhter, Tasnim Nazia, Basith Iftekhar Ibne, Obeidat Suleiman, Yildiz Faruk
- Curlora: Stable LLM Continual Fine-tuning And Catastrophic Forgetting Mitigation Fawi Muhammad
- Boosting Zero-shot Crosslingual Performance Using Llm-based Augmentations With Effective Data Selection Fazili Barah, Agrawal Ashish Sunil, Jyothi Preethi
- Gensco: Can Question Decomposition Based Passage Alignment Improve Question Answering? Fazili Barah, Goswami Koustava, Modani Natwar, Nair Inderjeet
- Retrieval Meets Reasoning: Dynamic In-context Editing For Long-text Understanding Fei Weizhi, Niu Xueyan, Xie Guoqing, Zhang Yanhua, Bai Bo, Deng Lei, Han Wei
- Internlm-law: An Open Source Chinese Legal Large Language Model Fei Zhiwei, Zhang Songyang, Shen Xiaoyu, Zhu Dawei, Wang Xiao, Cao Maosong, Zhou Fengzhe, Li Yining, Zhang Wenwei, Lin Dahua, Chen Kai, Ge Jidong
- Ragged Edges: The Double-edged Sword Of Retrieval-augmented Chatbots Feldman Philip, Foulds James R., Pan Shimei
- Legend: Leveraging Representation Engineering To Annotate Safety Margin For Preference Datasets Feng Duanyu, Qin Bowen, Huang Chen, Huang Youcheng, Zhang Zheng, Lei Wenqiang
- Keypoint-based Progressive Chain-of-thought Distillation For Llms Feng Kaituo, Li Changsheng, Zhang Xiaolu, Zhou Jun, Yuan Ye, Wang Guoren
- AGILE: A Novel Framework Of LLM Agents Feng Peiyuan, He Yichen, Huang Guanhua, Lin Yuan, Zhang Hanchong, Zhang Yuchen, Li Hang
- Modular Pluralism: Pluralistic Alignment Via Multi-llm Collaboration Feng Shangbin, Sorensen Taylor, Liu Yuhan, Fisher Jillian, Park Chan Young, Choi Yejin, Tsvetkov Yulia
- Courseassist: Pedagogically Appropriate AI Tutor For Computer Science Education Feng Ty, Liu Sa, Ghosal Dipak
- Llama-vits: Enhancing TTS Synthesis With Semantic Awareness Feng Xincan, Yoshimoto Akifumi
- Mt-ladder: A Model-agnostic Framework Boosting Llm-based Machine Translation To The Next Level Feng Zhaopeng, Zhang Yan, Chen Ruizhe, Meng Zijie, Liu Zuozhu
- Unveiling And Manipulating Prompt Influence In Large Language Models Feng Zijian, Zhou Hanzhang, Zhu Zixiao, Qian Junlang, Mao Kezhi
- Syllabusqa: A Course Logistics Question Answering Dataset Fernandez Nigel, Scarlatos Alexander, Lan Andrew
- Generative AI And Large Language Models For Cyber Security: All Insights You Need Ferrag Mohamed Amine, Alwahedi Fatima, Battah Ammar, Cherif Bilel, Mechri Abdechakour, Tihanyi Norbert
- Exaranker-open: Synthetic Explanation For IR Using Open-source Llms Ferraretto Fernando, Laitz Thiago, Lotufo Roberto, Nogueira Rodrigo
- The Chronicles Of RAG: The Retriever, The Chunk And The Generator Finardi Paulo, Avila Leonardo, Castaldoni Rodrigo, Gengo Pedro, Larcher Celio, Piau Marcos, Costa Pablo, Caridá Vinicius
- Grillbot In Practice: Lessons And Tradeoffs Deploying Large Language Models For Adaptable Conversational Task Assistants Fischer Sophie, Gemmell Carlos, Tecklenburg Niklas, Mackie Iain, Rossetto Federico, Dalton Jeffrey
- RAG Foundry: A Framework For Enhancing Llms For Retrieval Augmented Generation Fleischer Daniel, Berchansky Moshe, Wasserblat Moshe, Izsak Peter
- Preact: Predicting Future In React Enhances Agent's Planning Ability Fu Dayuan, Huang Jianzhao, Lu Siyuan, Dong Guanting, Wang Yejie, He Keqing, Xu Weiran
- Autorag-hp: Automatic Online Hyper-parameter Tuning For Retrieval-augmented Generation Fu Jia, Qin Xiaoting, Yang Fangkai, Wang Lu, Zhang Jue, Lin Qingwei, Chen Yubo, Zhang Dongmei, Rajmohan Saravan, Zhang Qi
- A Role-specific Guided Large Language Model For Ophthalmic Consultation Based On Stylistic Differentiation Fu Laiyi, Fan Binbin, Du Hongkai, Feng Yanxiang, Li Chunhua, Song Huping
- Moa: Mixture Of Sparse Attention For Automatic Large Language Model Compression Fu Tianyu, Huang Haofeng, Ning Xuefei, Zhang Genghan, Chen Boju, Wu Tianqi, Wang Hongyi, Huang Zixiao, Li Shiyao, Yan Shengen, Dai Guohao, Yang Huazhong, Wang Yu
- BLINK: Multimodal Large Language Models Can See But Not Perceive Fu Xingyu, Hu Yushi, Li Bangzheng, Feng Yu, Wang Haoyu, Lin Xudong, Roth Dan, Smith Noah A., Ma Wei-chiu, Krishna Ranjay
- Autoguide: Automated Generation And Selection Of State-aware Guidelines For Large Language Model Agents Fu Yao, Kim Dong-ki, Kim Jaekyeom, Sohn Sungryull, Logeswaran Lajanugen, Bae Kyunghoon, Lee Honglak
- UI-JEPA: Towards Active Perception Of User Intent Through Onscreen User Activity Fu Yicheng, Anantha Raviteja, Vashisht Prabal, Cheng Jianpeng, Littwin Etai
- Continual Pre-training For Cross-lingual LLM Adaptation: Enhancing Japanese Language Capabilities Fujii Kazuki, Nakamura Taishi, Loem Mengsay, Iida Hiroki, Ohi Masanari, Hattori Kakeru, Shota Hirai, Mizuki Sakae, Yokota Rio, Okazaki Naoaki
- Layoutllm: Large Language Model Instruction Tuning For Visually Rich Document Understanding Fujitake Masato
- Zero-shot Persuasive Chatbots With Llm-generated Strategies And Information Retrieval Furumai Kazuaki, Legaspi Roberto, Vizcarra Julio, Yamazaki Yudai, Nishimura Yasutaka, Semnani Sina J., Ikeda Kazushi, Shi Weiyan, Lam Monica S.
- Student Answer Forecasting: Transformer-driven Answer Choice Prediction For Language Learning Gado Elena Grazia, Martorella Tommaso, Zunino Luca, Mejia-domenzain Paola, Swamy Vinitra, Frej Jibril, Käser Tanja
- Open Generative Large Language Models For Galician Gamallo Pablo, Rodríguez Pablo, De-dios-flores Iria, Sotelo Susana, Paniagua Silvia, Bardanca Daniel, Pichel José Ramom, Garcia Marcos
- Language Models Do Hard Arithmetic Tasks Easily And Hardly Do Easy Arithmetic Tasks Gambardella Andrew, Iwasawa Yusuke, Matsuo Yutaka
- MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test With Open-domain Information Extraction Large Language Models Gan Chengguang, Yin Qingyu, He Xinyang, Wei Hanjun, Liang Yunhao, Lim Younghun, Wang Shijian, Huang Hexiang, Zhang Qinghao, Ni Shiwen, Mori Tatsunori
- Context-augmented Retrieval: A Novel Framework For Fast Information Retrieval Based Response Generation Using Large Language Model Ganesh Sai, Purwar Anupam, B Gautam
- Cost-efficient Large Language Model Serving For Multi-turn Conversations With Cachedattention Gao Bin, He Zhuomin, Sharma Puru, Kang Qingxuan, Jevdjic Djordje, Deng Junbo, Yang Xingkun, Yu Zhou, Zuo Pengfei
- Aligning LLM Agents By Learning Latent Preference From User Edits Gao Ge, Taymanov Alexey, Salinas Eduardo, Mineiro Paul, Misra Dipendra
- Harnessing The Intrinsic Knowledge Of Pretrained Language Models For Challenging Text Classification Settings Gao Lingyu
- DORY: Deliberative Prompt Recovery For LLM Gao Lirong, Peng Ru, Zhang Yiming, Zhao Junbo
- Autovcoder: A Systematic Framework For Automated Verilog Code Generation Using Llms Gao Mingzhe, Zhao Jieru, Lin Zhe, Ding Wenchao, Hou Xiaofeng, Feng Yu, Li Chao, Guo Minyi
- Meta Reasoning For Large Language Models Gao Peizhong, Xie Ao, Mao Shaoguang, Wu Wenshan, Xia Yan, Mi Haipeng, Wei Furu
- Efficient Tool Use With Chain-of-abstraction Reasoning Gao Silin, Dwivedi-yu Jane, Yu Ping, Tan Xiaoqing Ellen, Pasunuru Ramakanth, Golovneva Olga, Sinha Koustuv, Celikyilmaz Asli, Bosselut Antoine, Wang Tianlu
- Compensate Quantization Errors: Make Weights Hierarchical To Compensate Each Other Gao Yifei, Ou Jie, Wang Lei, Xiao Yuting, Xiang Zhiyuan, Dai Ruiting, Cheng Jun
- Dr3: Ask Large Language Models Not To Give Off-topic Answers In Open Domain Multi-hop Question Answering Gao Yuan, Zhu Yiheng, Cao Yuanbin, Zhou Yinzhi, Wu Zhen, Chen Yujie, Wu Shenglan, Hu Haoyuan, Dai Xinyu
- Robust Pronoun Fidelity With English Llms: Are They Reasoning, Repeating, Or Just Biased? Gautam Vagrant, Bingert Eileen, Zhu Dawei, Lauscher Anne, Klakow Dietrich
- Worldgpt: Empowering LLM As Multimodal World Model Ge Zhiqi, Huang Hongzhe, Zhou Mingze, Li Juncheng, Wang Guoming, Tang Siliang, Zhuang Yueting
- Does Fine-tuning Llms On New Knowledge Encourage Hallucinations? Gekhman Zorik, Yona Gal, Aharoni Roee, Eyal Matan, Feder Amir, Reichart Roi, Herzig Jonathan
- Breaking The Length Barrier: Llm-enhanced CTR Prediction In Long Textual User Behaviors Geng Binzong, Huan Zhaoxin, Zhang Xiaolu, He Yong, Zhang Liang, Yuan Fajie, Zhou Jun, Mo Linjian
- Syndarin: Synthesising Datasets For Automated Reasoning In Low-resource Languages Ghazaryan Gayane, Arakelyan Erik, Minervini Pasquale, Augenstein Isabelle
- Reinforcement Learning Problem Solving With Large Language Models Gholamian Sina, Huh Domingo
- RE-GAINS & Enchant: Intelligent Tool Manipulation Systems For Enhanced Query Responses Girhepuje Sahil, Sajeev Siva Sankar, Jain Purvam, Sikder Arya, Varma Adithya Rama, George Ryan, Srinivasan Akshay Govind, Kurup Mahendra, Sinha Ashmit, Mondal Sudip
- Analysis Of Plan-based Retrieval For Grounded Text Generation Godbole Ameya, Monath Nicholas, Kim Seungyeon, Rawat Ankit Singh, Mccallum Andrew, Zaheer Manzil
- Tokenshap: Interpreting Large Language Models With Monte Carlo Shapley Value Estimation Goldshmidt Roni, Horovicz Miriam
- Transforming Llms Into Cross-modal And Cross-lingual Retrieval Systems Gomez Frank Palma, Sanabria Ramon, Sung Yun-hsuan, Cer Daniel, Dalmia Siddharth, Abrego Gustavo Hernandez
- AST-T5: Structure-aware Pretraining For Code Generation And Understanding Gong Linyuan, Elhoushi Mostafa, Cheung Alvin
- Estimating Knowledge In Large Language Models Without Generating A Single Token Gottesman Daniela, Geva Mor
- Loose LIPS Sink Ships: Asking Questions In Battleship With Language-informed Program Sampling Grand Gabriel, Pepe Valerio, Andreas Jacob, Tenenbaum Joshua B.
- From Generalist To Specialist: Improving Large Language Models For Medical Physics Using Arcot Grandinetti Jace, Mcbeth Rafe
- Enhancing Llm-based Human-robot Interaction With Nuances For Diversity Awareness Grassi Lucrezia, Recchiuto Carmine Tommaso, Sgorbissa Antonio
- RWKV-CLIP: A Robust Vision-language Representation Learner Gu Tiancheng, Yang Kaicheng, An Xiang, Feng Ziyong, Liu Dongnan, Cai Weidong, Deng Jiankang
- Detectbench: Can Large Language Model Detect And Piece Together Implicit Evidence? Gu Zhouhong, Zhang Lin, Zhu Xiaoxuan, Chen Jiangjie, Huang Wenhao, Zhang Yikai, Wang Shusen, Ye Zheyu, Gao Yan, Feng Hongwei, Xiao Yanghua
- APTQ: Attention-aware Post-training Mixed-precision Quantization For Large Language Models Guan Ziyi, Huang Hantao, Su Yupeng, Huang Hong, Wong Ngai, Yu Hao
- Iepile: Unearthing Large-scale Schema-based Information Extraction Corpus Gui Honghao, Yuan Lin, Ye Hongbin, Zhang Ningyu, Sun Mengshu, Liang Lei, Chen Huajun
- Boosting Lossless Speculative Decoding Via Feature Sampling And Partial Alignment Distillation Gui Lujun, Xiao Bin, Su Lei, Chen Weipeng
- Conversational Topic Recommendation In Counseling And Psychotherapy With Decision Transformer And Large Language Models Gunal Aylin, Lin Baihan, Bouneffouf Djallel
- A Novel Paradigm Boosting Translation Capabilities Of Large Language Models Guo Jiaxin, Yang Hao, Li Zongyao, Wei Daimeng, Shang Hengchao, Chen Xiaoyu
- Learning On Graphs With Large Language Models(llms): A Deep Dive Into Model Robustness Guo Kai, Liu Zewen, Chen Zhikai, Wen Hongzhi, Jin Wei, Tang Jiliang, Chang Yi
- Integrating Large Language Models With Graphical Session-based Recommendation Guo Naicheng, Cheng Hongwei, Liang Qianqiao, Chen Linxun, Han Bing
- Direct Language Model Alignment From Online AI Feedback Guo Shangmin, Zhang Biao, Liu Tianlin, Liu Tianqi, Khalman Misha, Llinares Felipe, Rame Alexandre, Mesnard Thomas, Zhao Yao, Piot Bilal, Ferret Johan, Blondel Mathieu
- SGSH: Stimulate Large Language Models With Skeleton Heuristics For Knowledge Base Question Generation Guo Shasha, Liao Lizi, Zhang Jing, Wang Yanling, Li Cuiping, Chen Hong
- Agent-simt: Agent-assisted Simultaneous Machine Translation With Large Language Models Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Sillm: Large Language Models For Simultaneous Machine Translation Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Ds-agent: Automated Data Science By Empowering Large Language Models With Case-based Reasoning Guo Siyuan, Deng Cheng, Wen Ying, Chen Hechang, Chang Yi, Wang Jun
- Generative AI For Synthetic Data Generation: Methods, Challenges And The Future Guo Xu, Chen Yiqiang
- Efficient Continual Pre-training By Mitigating The Stability Gap Guo Yiduo, Fu Jie, Zhang Huishuai, Zhao Dongyan, Shen Yikang
- GPTQT: Quantize Large Language Models Twice To Push The Efficiency Guo Yipin, Lang Yilin, Ren Qinyuan
- API Pack: A Massive Multi-programming Language Dataset For API Call Generation Guo Zhen, Soria Adriana Meza, Sun Wei, Shen Yikang, Panda Rameswar
- Dependency-aware Semi-structured Sparsity: Declining Roles Of Outliers In Pruning Glu-based Llms Guo Zhiyu, Kamigaito Hidetaka, Wanatnabe Taro
- AAVENUE: Detecting LLM Biases On NLU Tasks In AAVE Via A Novel Benchmark Gupta Abhay, Meng Philip, Yurtseven Ece, O'brien Sean, Zhu Kevin
- Selective Self-rehearsal: A Fine-tuning Approach To Improve Generalization In Large Language Models Gupta Sonam, Nandwani Yatin, Yehudai Asaf, Mishra Mayank, Pandey Gaurav, Raghu Dinesh, Joshi Sachindra
- Changing Answer Order Can Decrease MMLU Accuracy Gupta Vipul, Pantoja David, Ross Candace, Williams Adina, Ung Megan
- Hipporag: Neurobiologically Inspired Long-term Memory For Large Language Models Gutiérrez Bernal Jiménez, Shu Yiheng, Gu Yu, Yasunaga Michihiro, Su Yu
- On The Impacts Of Contexts On Repository-level Code Generation Hai Nam Le, Nguyen Dung Manh, Bui Nghi D. Q.
- Single Parent Family: A Spectrum Of Family Members From A Single Pre-trained Foundation Model Hajimolahoseini Habib, Hassanpour Mohammad, Ataiefard Foozhan, Chen Boxing, Liu Yang
- Synthclip: Are We Ready For A Fully Synthetic CLIP Training? Hammoud Hasan Abed Al Kader, Itani Hani, Pizzati Fabio, Torr Philip, Bibi Adel, Ghanem Bernard
- MERLIN: Multimodal Embedding Refinement Via Llm-based Iterative Navigation For Text-video Retrieval-rerank Pipeline Han Donghoon, Park Eunhwan, Lee Gisang, Lee Adam, Kwak Nojun
- Chain-of-interaction: Enhancing Large Language Models For Psychiatric Behavior Understanding By Dyadic Contexts Han Guangzeng, Liu Weisi, Huang Xiaolei, Borsari Brian
- Chatgpt Based Data Augmentation For Improved Parameter-efficient Debiasing Of Llms Han Pengrui, Kocielnik Rafal, Saravanan Adhithya, Jiang Roy, Sharir Or, Anandkumar Anima
- RAG-QA Arena: Evaluating Domain Robustness For Long-form Retrieval Augmented Question Answering Han Rujun, Zhang Yuhao, Qi Peng, Xu Yumo, Wang Jenyuan, Liu Lan, Wang William Yang, Min Bonan, Castelli Vittorio
- Wildguard: Open One-stop Moderation Tools For Safety Risks, Jailbreaks, And Refusals Of Llms Han Seungju, Rao Kavel, Ettinger Allyson, Jiang Liwei, Lin Bill Yuchen, Lambert Nathan, Choi Yejin, Dziri Nouha
- Jailbreaking Proprietary Large Language Models Using Word Substitution Cipher Handa Divij, Chirmule Advait, Gajera Bimal, Baral Chitta
- It Cannot Be Right If It Was Written By AI: On Lawyers' Preferences Of Documents Perceived As Authored By An LLM Vs A Human Harasta Jakub, Novotná Tereza, Savelka Jaromir
- Measuring Psychological Depth In Language Models Harel-canada Fabrice, Zhou Hanyu, Mupalla Sreya, Yildiz Zeynep, Sahai Amit, Peng Nanyun
- Twips: A Large Language Model Powered Texting Application To Simplify Conversational Nuances For Autistic Users Haroon Rukhshan, Dogar Fahad
- Sub-goal Distillation: A Method To Improve Small Language Agents Hashemzadeh Maryam, Stengel-eskin Elias, Chandar Sarath, Cote Marc-alexandre
- A Comprehensive Overview Of Large Language Models (llms) For Cyber Defences: Opportunities And Directions Hassanin Mohammed, Moustafa Nour
- Chain-of-instructions: Compositional Instruction Tuning On Large Language Models Hayati Shirley Anugrah, Jung Taehee, Bodding-long Tristan, Kar Sudipta, Sethy Abhinav, Kim Joo-kyung, Kang Dongyeop
- Olympiadbench: A Challenging Benchmark For Promoting AGI With Olympiad-level Bilingual Multimodal Scientific Problems He Chaoqun, Luo Renjie, Bai Yuzhuo, Hu Shengding, Thai Zhen Leng, Shen Junhao, Hu Jinyi, Han Xu, Huang Yujie, Zhang Yuxiang, Liu Jie, Qi Lei, Liu Zhiyuan, Sun Maosong
- Webvoyager: Building An End-to-end Web Agent With Large Multimodal Models He Hongliang, Yao Wenlin, Ma Kaixin, Yu Wenhao, Dai Yong, Zhang Hongming, Lan Zhenzhong, Yu Dong
- Cos: Enhancing Personalization And Mitigating Bias With Context Steering He Jerry Zhi-yang, Pandey Sashrika, Schrum Mariah L., Dragan Anca
- Multi-modal Instruction Tuned Llms With Fine-grained Visual Perception He Junwen, Wang Yifan, Wang Lijun, Lu Huchuan, He Jun-yan, Lan Jin-peng, Luo Bin, Xie Xuansong
- Prompt-based Bias Calibration For Better Zero/few-shot Learning Of Language Models He Kang, Long Yinghan, Roy Kaushik
- Think-program-rectify: 3D Situated Reasoning With Large Language Models He Qingrong, Lin Kejun, Chen Shizhe, Hu Anwen, Jin Qin
- Pitvqa: Image-grounded Text Embedding LLM For Visual Question Answering In Pituitary Surgery He Runlong, Xu Mengya, Das Adrito, Khan Danyal Z., Bano Sophia, Marcus Hani J., Stoyanov Danail, Clarkson Matthew J., Islam Mobarakol
- G-retriever: Retrieval-augmented Generation For Textual Graph Understanding And Question Answering He Xiaoxin, Tian Yijun, Sun Yifei, Chawla Nitesh V., Laurent Thomas, Lecun Yann, Bresson Xavier, Hooi Bryan
- Transferring Troubles: Cross-lingual Transferability Of Backdoor Attacks In Llms With Instruction Tuning He Xuanli, Wang Jun, Xu Qiongkai, Minervini Pasquale, Stenetorp Pontus, Rubinstein Benjamin I. P., Cohn Trevor
- Llms Meet Multimodal Generation And Editing: A Survey He Yingqing, Liu Zhaoyang, Chen Jingye, Tian Zeyue, Liu Hongyu, Chi Xiaowei, Liu Runtao, Yuan Ruibin, Xing Yazhou, Wang Wenhai, Dai Jifeng, Zhang Yong, Xue Wei, Liu Qifeng, Guo Yike, Chen Qifeng
- Language Models As Hierarchy Encoders He Yuan, Yuan Zhangdie, Chen Jiaoyan, Horrocks Ian
- Automated Black-box Prompt Engineering For Personalized Text-to-image Generation He Yutong, Robey Alexander, Murata Naoki, Jiang Yiding, Williams Joshua, Pappas George J., Hassani Hamed, Mitsufuji Yuki, Salakhutdinov Ruslan, Kolter J. Zico
- Position Engineering: Boosting Large Language Models Through Positional Information Manipulation He Zhiyuan, Jiang Huiqiang, Wang Zilong, Yang Yuqing, Qiu Luna, Qiu Lili
- HMT: Hierarchical Memory Transformer For Long Context Language Processing He Zifan, Qin Zongyue, Prakriya Neha, Sun Yizhou, Cong Jason
- Intent-conditioned And Non-toxic Counterspeech Generation Using Multi-task Instruction Tuning With RLAIF Hengle Amey, Kumar Aswini, Singh Sahajpreet, Bandhakavi Anil, Akhtar Md Shad, Chakroborty Tanmoy
- Lilium: Ebay's Large Language Models For E-commerce Herold Christian, Kozielski Michael, Ekimov Leonid, Petrushkov Pavel, Vandenbussche Pierre-yves, Khadivi Shahram
- From Descriptive Richness To Bias: Unveiling The Dark Side Of Generative Image Caption Enrichment Hirota Yusuke, Hachiuma Ryo, Yang Chao-han Huck, Nakashima Yuta
- Deepspeed-fastgen: High-throughput Text Generation For Llms Via MII And Deepspeed-inference Holmes Connor, Tanaka Masahiro, Wyatt Michael, Awan Ammar Ahmad, Rasley Jeff, Rajbhandari Samyam, Aminabadi Reza Yazdani, Qin Heyang, Bakhtiari Arash, Kurilenko Lev, He Yuxiong
- Llm-aided Compilation For Tensor Accelerators Hong Charles, Bhatia Sahil, Haan Altan, Dong Shengjun Kris, Nikiforov Dima, Cheung Alvin, Shao Yakun Sophia
- Accelerating Multilingual Language Model For Excessively Tokenized Languages Hong Jimin, Lee Gibbeum, Cho Jaewoong
- Curiosity-driven Red-teaming For Large Language Models Hong Zhang-wei, Shenfeld Idan, Wang Tsun-hsuan, Chuang Yung-sung, Pareja Aldo, Glass James, Srivastava Akash, Agrawal Pulkit
- Free To Play: UN Trade And Development's Experience With Developing Its Own Open-source Retrieval Augmented Generation Large Language Model Application Hopp Daniel
- Evolution Of Social Norms In LLM Agents Using Natural Language Horiguchi Ilya, Yoshida Takahide, Ikegami Takashi
- Securing Vision-language Models With A Robust Encoder Against Jailbreak And Adversarial Attacks Hossain Md Zarif, Imteaj Ahmed
- Visual Prompting For Generalized Few-shot Segmentation: A Multi-scale Approach Hossain Mir Rayat Imtiaz, Siam Mennatullah, Sigal Leonid, Little James J.
- Retrieval-augmented Code Completion For Local Projects Using Large Language Models Hostnik Marko, Robnik-šikonja Marko
- Enhancing And Accelerating Large Language Models Via Instruction-aware Contextual Compression Hou Haowen, Ma Fei, Bai Binwen, Zhu Xinxin, Yu Fei
- Raw Text Is All You Need: Knowledge-intensive Multi-turn Instruction Tuning For Large Language Model Hou Xia, Li Qifeng, Yang Jian, Li Tongliang, Chai Linzheng, Wu Xianjie, Ji Hangyuan, Li Zhoujun, Nie Jixuan, Dun Jingbo, Song Wenfeng
- Wikicontradict: A Benchmark For Evaluating Llms On Real-world Knowledge Conflicts From Wikipedia Hou Yufang, Pascale Alessandra, Carnerero-cano Javier, Tchrakian Tigran, Marinescu Radu, Daly Elizabeth, Padhi Inkit, Sattigeri Prasanna
- Bridging Language And Items For Retrieval And Recommendation Hou Yupeng, Li Jiacheng, He Zhankui, Yan An, Chen Xiusi, Mcauley Julian
- Chatglm-rlhf: Practices Of Aligning Large Language Models With Human Feedback Hou Zhenyu, Niu Yilin, Du Zhengxiao, Zhang Xiaohan, Liu Xiao, Zeng Aohan, Zheng Qinkai, Huang Minlie, Wang Hongning, Tang Jie, Dong Yuxiao
- Large Language Model-powered Chatbots For Internationalizing Student Support In Higher Education Hsain Achraf, Housni Hamza El
- RAGGED: Towards Informed Design Of Retrieval Augmented Generation Systems Hsia Jennifer, Shaikh Afreen, Wang Zhiruo, Neubig Graham
- Found In The Middle: Calibrating Positional Attention Bias Improves Long Context Utilization Hsieh Cheng-yu, Chuang Yung-sung, Li Chun-liang, Wang Zifeng, Le Long T., Kumar Abhishek, Glass James, Ratner Alexander, Lee Chen-yu, Krishna Ranjay, Pfister Tomas
- Wilke: Wise-layer Knowledge Editor For Lifelong Knowledge Editing Hu Chenhui, Cao Pengfei, Chen Yubo, Liu Kang, Zhao Jun
- Rankprompt: Step-by-step Comparisons Make Language Models Better Reasoners Hu Chi, Ge Yuan, Ma Xiangnan, Cao Hang, Li Qiang, Yang Yonghua, Xiao Tong, Zhu Jingbo
- Teaching Language Models To Self-improve By Learning From Language Feedback Hu Chi, Hu Yimin, Cao Hang, Xiao Tong, Zhu Jingbo
- Multimodal Chain-of-thought Reasoning Via Chatgpt To Protect Children From Age-inappropriate Apps Hu Chuanbo, Liu Bin, Yin Minglei, Zhou Yilu, Li Xin
- LRP4RAG: Detecting Hallucinations In Retrieval-augmented Generation Via Layer-wise Relevance Propagation Hu Haichuan, Sun Yuhan, Zhang Quanjun
- Lightweight Modality Adaptation To Sequential Recommendation Via Correlation Supervision Hu Hengchang, Liu Qijiong, Li Chuang, Kan Min-yen
- Outlier-efficient Hopfield Layers For Large Transformer-based Models Hu Jerry Yao-chieh, Chang Pei-hsuan, Luo Robin, Chen Hong-yu, Li Weijian, Wang Wei-po, Liu Han
- Openrlhf: An Easy-to-use, Scalable And High-performance RLHF Framework Hu Jian, Wu Xibin, Wang Weixun, Xianyu, Zhang Dehao, Cao Yu
- Hiagent: Hierarchical Working Memory Management For Solving Long-horizon Agent Tasks With Large Language Model Hu Mengkang, Chen Tianxing, Chen Qiguang, Mu Yao, Shao Wenqi, Luo Ping
- Agentgen: Enhancing Planning Abilities For Large Language Model Based Agent Via Environment And Task Generation Hu Mengkang, Zhao Pu, Xu Can, Sun Qingfeng, Lou Jianguang, Lin Qingwei, Luo Ping, Rajmohan Saravan, Zhang Dongmei
- SLM Meets LLM: Balancing Latency, Interpretability And Consistency In Hallucination Detection Hu Mengya, Xu Rui, Lei Deren, Li Yaxi, Wang Mingyu, Ching Emily, Kamal Eslam, Deng Alex
- Limited Out-of-context Knowledge Reasoning In Large Language Models Hu Peng, Gao Changjiang, Gao Ruiqi, Chen Jiajun, Huang Shujian
- Evaluating Robustness Of Generative Search Engine On Adversarial Factual Questions Hu Xuming, Li Xiaochuan, Chen Junzhe, Li Yinghui, Li Yangning, Li Xiaoguang, Wang Yasheng, Liu Qun, Wen Lijie, Yu Philip S., Guo Zhijiang
- Gentranslate: Large Language Models Are Generative Multilingual Speech And Machine Translators Hu Yuchen, Chen Chen, Yang Chao-han Huck, Li Ruizhe, Zhang Dong, Chen Zhehuai, Chng Eng Siong
- RAG And RAU: A Survey On Retrieval-augmented Language Model In Natural Language Processing Hu Yucheng, Lu Yuxing
- Prompt Perturbation In Retrieval-augmented Generation Based Large Language Models Hu Zhibo Hye-young, Wang Chen Hye-young, Shu Yanfeng Hye-young, Helen Hye-young, Paik, Zhu Liming
- V2xum-llm: Cross-modal Video Summarization With Temporal Prompt Instruction Tuning Hua Hang, Tang Yunlong, Xu Chenliang, Luo Jiebo
- Multimodal Task Vectors Enable Many-shot Multimodal In-context Learning Huang Brandon, Mitra Chancharik, Arbelle Assaf, Karlinsky Leonid, Darrell Trevor, Herzig Roei
- Instupr : Instruction-based Unsupervised Passage Reranking With Large Language Models Huang Chao-wei, Chen Yun-nung
- Learn When (not) To Trust Language Models: A Privacy-centric Adaptive Model-aware Approach Huang Chengkai, Wang Rui, Xie Kaige, Yu Tong, Yao Lina
- Recurrent Context Compression: Efficiently Expanding The Context Window Of LLM Huang Chensen, Zhu Guibo, Wang Xuepeng, Luo Yifei, Ge Guojing, Chen Haoran, Yi Dong, Wang Jinqiao
- LITA: Language Instructed Temporal-localization Assistant Huang De-an, Liao Shijia, Radhakrishnan Subhashree, Yin Hongxu, Molchanov Pavlo, Yu Zhiding, Kautz Jan
- Conme: Rethinking Evaluation Of Compositional Reasoning For Modern Vlms Huang Irene, Lin Wei, Mirza M. Jehanzeb, Hansen Jacob A., Doveh Sivan, Butoi Victor Ion, Herzig Roei, Arbelle Assaf, Kuhene Hilde, Darrel Trevor, Gan Chuang, Oliva Aude, Feris Rogerio, Karlinsky Leonid
- Context-aware Assistant Selection For Improved Inference Acceleration With Large Language Models Huang Jerry, Parthasarathi Prasanna, Rezagholizadeh Mehdi, Chandar Sarath
- Mmevalpro: Calibrating Multimodal Benchmarks Towards Trustworthy And Efficient Evaluation Huang Jinsheng, Chen Liang, Guo Taian, Zeng Fu, Zhao Yusheng, Wu Bohan, Yuan Ye, Zhao Haozhe, Guo Zhihui, Zhang Yichi, Yuan Jingyang, Ju Wei, Liu Luchen, Liu Tianyu, Chang Baobao, Zhang Ming
- Integrating Multi-scale Contextualized Information For Byte-based Neural Machine Translation Huang Langlin, Feng Yang
- Learning Fine-grained Grounded Citations For Attributed Large Language Models Huang Lei, Feng Xiaocheng, Ma Weitao, Gu Yuxuan, Zhong Weihong, Feng Xiachong, Yu Weijiang, Peng Weihua, Tang Duyu, Tu Dandan, Qin Bing
- Translate-and-revise: Boosting Large Language Models For Constrained Translation Huang Pengcheng, Mu Yongyu, Wu Yuzhang, Li Bei, Xiao Chunyang, Xiao Tong, Zhu Jingbo
- Selective Prompting Tuning For Personalized Conversations With Llms Huang Qiushi, Liu Xubo, Ko Tom, Wu Bo, Wang Wenwu, Zhang Yu, Tang Lilian
- Brainchat: Decoding Semantic Information From Fmri Using Vision-language Pretrained Models Huang Wanaiu
- Prompting Large Language Models With Knowledge Graphs For Question Answering Involving Long-tail Facts Huang Wenyu, Zhou Guancheng, Lapata Mirella, Vougiouklis Pavlos, Montella Sebastien, Pan Jeff Z.
- Queryagent: A Reliable And Efficient Reasoning Framework With Environmental Feedback-based Self-correction Huang Xiang, Cheng Sitao, Huang Shanshan, Shen Jiayu, Xu Yong, Zhang Chaoyun, Qu Yuzhong
- Lost In The Source Language: How Large Language Models Evaluate The Quality Of Machine Translation Huang Xu, Zhang Zhirui, Geng Xiang, Du Yichao, Chen Jiajun, Huang Shujian
- Key-point-driven Data Synthesis With Its Enhancement On Mathematical Reasoning Huang Yiming, Liu Xiao, Gong Yeyun, Gou Zhibin, Shen Yelong, Duan Nan, Chen Weizhu
- Leveraging Large Language Models For Enhanced NLP Task Performance Through Knowledge Distillation And Optimized Training Strategies Huang Yining, Tang Keke, Chen Meilian
- Obscureprompt: Jailbreaking Large Language Models Via Obscure Input Huang Yue, Tang Jingyu, Chen Dongping, Tang Bingda, Wan Yao, Sun Lichao, Zhang Xiangliang
- Compression Represents Intelligence Linearly Huang Yuzhen, Zhang Jinghan, Shan Zifei, He Junxian
- Tool Calling: Enhancing Medication Consultation Via Retrieval-augmented Large Language Models Huang Zhongzhen, Xue Kui, Fan Yongqi, Mu Linjie, Liu Ruoyu, Ruan Tong, Zhang Shaoting, Zhang Xiaofan
- Mindmerger: Efficient Boosting LLM Reasoning In Non-english Languages Huang Zixian, Zhu Wenhao, Cheng Gong, Li Lei, Yuan Fei
- Accelerating Pre-training Of Multimodal Llms Via Chain-of-sight Huang Ziyuan, Ji Kaixiang, Gong Biao, Qing Zhiwu, Zhang Qinglong, Zheng Kecheng, Wang Jian, Chen Jingdong, Yang Ming
- Towards A Psychology Of Machines: Large Language Models Predict Human Memory Huff Markus, Ulakçı Elanur
- Federated Recommendation Via Hybrid Retrieval Augmented Generation Huimin Zeng, Zhenrui Yue, Qian Jiang, Dong Wang
- Self-introspective Decoding: Alleviating Hallucinations For Large Vision-language Models Huo Fushuo, Xu Wenchao, Zhang Zhong, Wang Haozhao, Chen Zhicheng, Zhao Peilin
- The Promise And Challenges Of Using Llms To Accelerate The Screening Process Of Systematic Reviews Huotala Aleksi, Kuutila Miikka, Ralph Paul, Mäntylä Mika
- Romansetu: Efficiently Unlocking Multilingual Capabilities Of Large Language Models Via Romanization Husain Jaavid Aktar, Dabre Raj, Kumar Aswanth, Gala Jay, Jayakumar Thanmay, Puduppully Ratish, Kunchukuttan Anoop
- Promptdsi: Prompt-based Rehearsal-free Instance-wise Incremental Learning For Document Retrieval Huynh Tuan-luc, Vu Thuy-trang, Wang Weiqing, Wei Yinwei, Le Trung, Gasevic Dragan, Li Yuan-fang, Do Thanh-toan
- Transformerfam: Feedback Attention Is Working Memory Hwang Dongseong, Wang Weiran, Huo Zhuoyuan, Sim Khe Chai, Mengibar Pedro Moreno
- Self-explore To Avoid The Pit: Improving The Reasoning Capabilities Of Language Models With Fine-grained Rewards Hwang Hyeonbin, Kim Doyoung, Kim Seungone, Ye Seonghyeon, Seo Minjoon
- DSLR: Document Refinement With Sentence-level Re-ranking And Reconstruction To Enhance Retrieval-augmented Generation Hwang Taeho, Jeong Soyeong, Cho Sukmin, Han Seungyoon, Park Jong C.
- MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs Hwang Yerin, Kim Yongil, Jang Yunah, Bang Jeesoo, Bae Hyunkyung, Jung Kyomin
- News Without Borders: Domain Adaptation Of Multilingual Sentence Embeddings For Cross-lingual News Recommendation Iana Andreea, Schmidt Fabian David, Glavaš Goran, Paulheim Heiko
- Simple And Scalable Strategies To Continually Pre-train Large Language Models Ibrahim Adam, Thérien Benjamin, Gupta Kshitij, Richter Mats L., Anthony Quentin, Lesort Timothée, Belilovsky Eugene, Rish Irina
- Beneath The Surface Of Consistency: Exploring Cross-lingual Knowledge Representation Sharing In Llms Ifergan Maxim, Choshen Leshem, Aharoni Roee, Szpektor Idan, Abend Omri
- Test-time Low Rank Adaptation Via Confidence Maximization For Zero-shot Generalization Of Vision-language Models Imam Raza, Gani Hanan, Huzaifa Muhammad, Nandakumar Karthik
- Improving Multilingual Instruction Finetuning Via Linguistically Natural And Diverse Datasets Indurthi Sathish Reddy, Zhou Wenxuan, Chollampatt Shamil, Agrawal Ravi, Song Kaiqiang, Zhao Lingxiao, Zhu Chenguang
- Grade Score: Quantifying LLM Performance In Option Selection Iourovitski Dmitri
- Facilitating Holistic Evaluations With Llms: Insights From Scenario-based Experiments Ishida Toru, Liu Tongxi, Wang Hailong, Cheunga William K.
- Mapcoder: Multi-agent Code Generation For Competitive Problem Solving Islam Md. Ashraful, Ali Mohammed Eunus, Parvez Md Rizwan
- Elements Of World Knowledge (EWOK): A Cognition-inspired Framework For Evaluating Basic World Knowledge In Language Models Ivanova Anna A., Sathe Aalok, Lipkin Benjamin, Kumar Unnathi, Radkani Setayesh, Clark Thomas H., Kauf Carina, Hu Jennifer, Pramod R. T., Grand Gabriel, Paulun Vivian, Ryskina Maria, Akyürek Ekin, Wilcox Ethan, Rashid Nafisa, Choshen Leshem, Levy Roger, Fedorenko Evelina, Tenenbaum Joshua, Andreas Jacob
- Large Language Models For Behavioral Economics: Internal Validity And Elicitation Of Mental Models Jabarian Brian
- Meddoc-bot: A Chat Tool For Comparative Analysis Of Large Language Models In The Context Of The Pediatric Hypertension Guideline Jabarulla Mohamed Yaseen, Oeltze-jafra Steffen, Beerbaum Philipp, Uden Theodor
- Polyglotoxicityprompts: Multilingual Evaluation Of Neural Toxic Degeneration In Large Language Models Jain Devansh, Kumar Priyanshu, Gehman Samuel, Zhou Xuhui, Hartvigsen Thomas, Sap Maarten
- From RAG To RICHES: Retrieval Interlaced With Sequence Generation Jain Palak, Soares Livio Baldini, Kwiatkowski Tom
- Structsum Generation For Faster Text Comprehension Jain Parag, Marzoca Andreea, Piccinno Francesco
- LLM Agents Improve Semantic Code Search Jain Sarthak University Of Illinois Urbana Champaign And Cisco, Dora Aditya University Of Illinois Urbana Champaign, Sam Ka Seng University Of Illinois Urbana Champaign, Singh Prabhat Cisco
- Smart-infinity: Fast Large Language Model Training Using Near-storage Processing On A Real System Jang Hongsun, Song Jaeyong, Jung Jaewon, Park Jaeyoung, Kim Youngsok, Lee Jinho
- Evaluating Students' Open-ended Written Responses With Llms: Using The RAG Framework For GPT-3.5, GPT-4, Claude-3, And Mistral-large Jauhiainen Jussi S., Guerra Agustín Garagorry
- L4Q: Parameter Efficient Quantization-aware Fine-tuning On Large Language Models Jeon Hyesung, Kim Yulhwa, Kim Jae-joon
- RLSF: Reinforcement Learning Via Symbolic Feedback Jha Piyush, Jana Prithwish, Arora Arnav, Ganesh Vijay
- Memeguard: An LLM And Vlm-based Framework For Advancing Content Moderation Via Meme Intervention Jha Prince, Jain Raghav, Mandal Konika, Chadha Aman, Saha Sriparna, Bhattacharyya Pushpak
- Precision Empowers, Excess Distracts: Visual Question Answering With Dynamically Infused Knowledge In Language Models Jhalani Manas, M Annervaz K, Bhattacharyya Pushpak
- Towards Flexible Evaluation For Generative Visual Question Answering Ji Huishan, Si Qingyi, Lin Zheng, Wang Weiping
- Aligner: Efficient Alignment By Learning To Correct Ji Jiaming, Chen Boyuan, Lou Hantao, Hong Donghai, Zhang Borong, Pan Xuehai, Dai Juntao, Qiu Tianyi, Yang Yaodong
- Llms Could Autonomously Learn Without External Supervision Ji Ke, Chen Junying, Gao Anningzhe, Xie Wenya, Wan Xiang, Wang Benyou
- LLM Internal States Reveal Hallucination Risk Faced With A Query Ji Ziwei, Chen Delong, Ishii Etsuko, Cahyawijaya Samuel, Bang Yejin, Wilie Bryan, Fung Pascale
- ANAH: Analytical Annotation Of Hallucinations In Large Language Models Ji Ziwei, Gu Yuzhe, Zhang Wenwei, Lyu Chengqi, Lin Dahua, Chen Kai
- Knowledge Adaptation From Large Language Model To Recommendation For Practical Industrial Application Jia Jian, Wang Yipei, Li Yan, Chen Honggang, Bai Xuehan, Liu Zhaocheng, Liang Jian, Chen Quan, Li Han, Jiang Peng, Gai Kun
- Leveraging Llms For Dialogue Quality Measurement Jia Jinghan, Komma Abi, Leffel Timothy, Peng Xujun, Nagesh Ajay, Soliman Tamer, Galstyan Aram, Kumar Anoop
- Heuristic-enhanced Candidates Selection Strategy For Gpts Tackle Few-shot Aspect-based Sentiment Analysis Jiang Baoxing, Wan Yujie, Ju Shenggen
- Chatbug: A Common Vulnerability Of Aligned Llms Induced By Chat Templates Jiang Fengqing, Xu Zhangchen, Niu Luyao, Lin Bill Yuchen, Poovendran Radha
- Llm-collaboration On Automatic Science Journalism For The General Audience Jiang Gongyao, Shi Xinran, Luo Qiong
- The Real, The Better: Aligning Large Language Models With Online Human Behaviors Jiang Guanying, Yan Lingyong, Shi Haibo, Yin Dawei
- Leveraging Large Language Models For Learning Complex Legal Concepts Through Storytelling Jiang Hang, Zhang Xiajie, Mahari Robert, Kessler Daniel, Ma Eric, August Tal, Li Irene, Pentland Alex 'sandy', Kim Yoon, Roy Deb, Kabbara Jad
- Minference 1.0: Accelerating Pre-filling For Long-context Llms Via Dynamic Sparse Attention Jiang Huiqiang, Li Yucheng, Zhang Chengruidong, Wu Qianhui, Luo Xufang, Ahn Surin, Han Zhenhua, Abdi Amir H., Li Dongsheng, Lin Chin-yew, Yang Yuqing, Qiu Lili
- Mix-cpt: A Domain Adaptation Framework Via Decoupling Knowledge Learning And Format Alignment Jiang Jinhao, Li Junyi, Zhao Wayne Xin, Song Yang, Zhang Tao, Wen Ji-rong
- Can Large Language Models Generate High-quality Patent Claims? Jiang Lekang, Zhang Caiqi, Scherz Pascal A, Goetz Stephan
- Utilizing Large Languagemodels To Detect Privacy Leaks In Mini-app Code Jiang Liming
- Seek And Solve Reasoning For Table Question Answering Jiang Ruya, Wang Chun, Deng Weihong
- SEED: Customize Large Language Models With Sample-efficient Adaptation For Code Generation Jiang Xue, Dong Yihong, Jin Zhi, Li Ge
- Prior Knowledge Integration Via LLM Encoding And Pseudo Event Regulation For Video Moment Retrieval Jiang Yiyang, Zhang Wengyu, Zhang Xulu, Wei Xiaoyong, Chen Chang Wen, Li Qing
- Retrieve, Summarize, Plan: Advancing Multi-hop Question Answering With An Iterative Approach Jiang Zhouyu, Sun Mengshu, Liang Lei, Zhang Zhiqiang
- Megascale: Scaling Large Language Model Training To More Than 10,000 Gpus Jiang Ziheng, Lin Haibin, Zhong Yinmin, Huang Qi, Chen Yangrui, Zhang Zhi, Peng Yanghua, Li Xiang, Xie Cong, Nong Shibiao, Jia Yulu, He Sun, Chen Hongmin, Bai Zhihao, Hou Qi, Yan Shipeng, Zhou Ding, Sheng Yiyao, Jiang Zhuo, Xu Haohan, Wei Haoran, Zhang Zhang, Nie Pengfei, Zou Leqi, Zhao Sida, Xiang Liang, Liu Zherui, Li Zhe, Jia Xiaoying, Ye Jianxi, Jin Xin, Liu Xin
- Duetrag: Collaborative Retrieval-augmented Generation Jiao Dian, Cai Li, Huang Jingsheng, Zhang Wenqiao, Tang Siliang, Zhuang Yueting
- Graph Chain-of-thought: Augmenting Large Language Models By Reasoning On Graphs Jin Bowen, Xie Chulin, Zhang Jiawei, Roy Kashob Kumar, Zhang Yu, Li Zheng, Li Ruirui, Tang Xianfeng, Wang Suhang, Meng Yu, Han Jiawei
- Ragcache: Efficient Knowledge Caching For Retrieval-augmented Generation Jin Chao, Zhang Zili, Jiang Xuanlin, Liu Fangyue, Liu Xin, Liu Xuanzhe, Jin Xin
- Zero-shot Chain-of-thought Reasoning Guided By Evolutionary Algorithms In Large Language Models Jin Feihu, Liu Yifan, Tan Ying
- GUARD: Role-playing To Generate Natural-language Jailbreakings To Test Guideline Adherence Of Large Language Models Jin Haibo, Chen Ruoxi, Zhou Andy, Zhang Yang, Wang Haohan
- BIDER: Bridging Knowledge Inconsistency For Efficient Retrieval-augmented Llms Via Key Supporting Evidence Jin Jiajie, Zhu Yutao, Zhou Yujia, Dou Zhicheng
- Crimson: Empowering Strategic Reasoning In Cybersecurity Through Large Language Models Jin Jiandong, Tang Bowen, Ma Mingxuan, Liu Xiao, Wang Yunfei, Lai Qingnan, Yang Jia, Zhou Changling
- HGT: Leveraging Heterogeneous Graph-enhanced Large Language Models For Few-shot Complex Table Understanding Jin Rihui, Li Yu, Qi Guilin, Hu Nan, Li Yuan-fang, Chen Jiaoyan, Wang Jianan, Chen Yongrui, Min Dehai
- Adaptive Skeleton Graph Decoding Jin Shuowei, Wu Yongji, Zheng Haizhong, Zhang Qingzhao, Lentz Matthew, Mao Z. Morley, Prakash Atul, Qian Feng, Zhuo Danyang
- MULTIVERSE: Exposing Large Language Model Alignment Problems In Diverse Worlds Jin Xiaolong, Zhang Zhuo, Zhang Xiangyu
- SMART: Automatically Scaling Down Language Models With Accuracy Guarantees For Reduced Processing Fees Jo Saehan, Trummer Immanuel
- Doing Personal LAPS: Llm-augmented Dialogue Construction For Personalized Multi-session Conversational Search Joko Hideaki, Chatterjee Shubham, Ramsay Andrew, De Vries Arjen P., Dalton Jeff, Hasibi Faegheh
- Investigating Multi-hop Factual Shortcuts In Knowledge Editing Of Large Language Models Ju Tianjie, Chen Yijin, Yuan Xinwei, Zhang Zhuosheng, Du Wei, Zheng Yubin, Liu Gongshen
- How Large Language Models Encode Context Knowledge? A Layer-wise Probing Study Ju Tianjie, Sun Weiwei, Du Wei, Yuan Xinwei, Ren Zhaochun, Liu Gongshen
- Llm-based Federated Recommendation Jujia Zhao, Wenjie Wang, Chen Xu, Zhaochun Ren, See-kiong Ng, Tat-seng Chua
- Trust Or Escalate: LLM Judges With Provable Guarantees For Human Agreement Jung Jaehun, Brahman Faeze, Choi Yejin
- CODE: Contrasting Self-generated Description To Combat Hallucination In Large Multi-modal Models Junho Kim, Hyunjun Kim, Yeonju Kim, Yong Man Ro
- LARA: Linguistic-adaptive Retrieval-augmented Llms For Multi-turn Intent Classification Junhua Liu, Keat Tan Yong, Bin Fu
- Introducing A New Hyper-parameter For RAG: Context Window Utilization Juvekar Kush, Purwar Anupam
- RAP: Retrieval-augmented Planning With Contextual Memory For Multimodal LLM Agents Kagaya Tomoyuki, Yuan Thong Jing, Lou Yuxuan, Karlekar Jayashree, Pranata Sugiri, Kinose Akira, Oguri Koki, Wick Felix, You Yang
- Evaluating The Impact Of Advanced LLM Techniques On Ai-lecture Tutors For A Robotics Course Kahl Sebastian, Löffler Felix, Maciol Martin, Ridder Fabian, Schmitz Marius, Spanagel Jennifer, Wienkamp Jens, Burgahn Christopher, Schilling Malte
- Slo-aware GPU Frequency Scaling For Energy Efficient LLM Inference Serving Kakolyris Andreas Kosmas, Masouros Dimosthenis, Vavaroutsos Petros, Xydis Sotirios, Soudris Dimitrios
- Understanding The Effect Of Using Semantically Meaningful Tokens For Visual Representation Learning Kalibhat Neha, Kattakinda Priyatham, Zarei Arman, Seleznev Nikita, Sharpe Samuel, Kumar Senthil, Feizi Soheil
- Counterfactuals As A Means For Evaluating Faithfulness Of Attribution Methods In Autoregressive Language Models Kamahi Sepehr, Yaghoobzadeh Yadollah
- Evaluating Gender Bias In Large Language Models Via Chain-of-thought Prompting Kaneko Masahiro, Bollegala Danushka, Okazaki Naoaki, Baldwin Timothy
- Prompt-rag: Pioneering Vector Embedding-free Retrieval-augmented Generation In Niche Domains, Exemplified By Korean Medicine Kang Bongsu, Kim Jundong, Yun Tae-rim, Kim Chang-eop
- Mindstar: Enhancing Math Reasoning In Pre-trained Llms At Inference Time Kang Jikun, Li Xin Zhe, Chen Xi, Kazemi Amirreza, Sun Qianyi, Chen Boxing, Li Dong, He Xu, He Quan, Wen Feng, Hao Jianye, Yao Jun
- Self-moe: Towards Compositional Large Language Models With Self-specialized Experts Kang Junmo, Karlinsky Leonid, Luo Hongyin, Wang Zhen, Hansen Jacob, Glass James, Cox David, Panda Rameswar, Feris Rogerio, Ritter Alan
- Unfamiliar Finetuning Examples Control How Language Models Hallucinate Kang Katie, Wallace Eric, Tomlin Claire, Kumar Aviral, Levine Sergey
- C-RAG: Certified Generation Risks For Retrieval-augmented Language Models Kang Mintong, Gürel Nezihe Merve, Yu Ning, Song Dawn, Li Bo
- Prompting Large Language Models With Audio For General-purpose Speech Summarization Kang Wonjune, Roy Deb
- Continuous Language Model Interpolation For Dynamic And Controllable Text Generation Kangaslahti Sara, Alvarez-melis David
- MEDIC: Towards A Comprehensive Framework For Evaluating Llms In Clinical Applications Kanithi Praveen K, Christophe Clément, Pimentel Marco Af, Raha Tathagata, Saadi Nada, Javed Hamza, Maslenkova Svetlana, Hayat Nasir, Rajan Ronnie, Khan Shadab
- Visualizing Dialogues: Enhancing Image Selection Through Dialogue Understanding With Large Language Models Kao Chang-sheng, Chen Yun-nung
- Instruct, Not Assist: Llm-based Multi-turn Planning And Hierarchical Questioning For Socratic Code Debugging Kargupta Priyanka, Agarwal Ishika, Hakkani-tur Dilek, Han Jiawei
- Can Reinforcement Learning Unlock The Hidden Dangers In Aligned Large Language Models? Karkevandi Mohammad Bahrami, Vishwamitra Nishant, Najafirad Peyman
- Finding A Needle In The Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases With Minimal Distribution Distortion Kassem Aly M., Saad Sherif
- Faaf: Facts As A Function For The Evaluation Of Generated Text Katranidis Vasileios, Barany Gabor
- Instruct-skillmix: A Powerful Pipeline For LLM Instruction Tuning Kaur Simran, Park Simon, Goyal Anirudh, Arora Sanjeev
- Codeaid: Evaluating A Classroom Deployment Of An Llm-based Programming Assistant That Balances Student And Educator Needs Kazemitabaar Majeed, Ye Runlong, Wang Xiaoning, Henley Austin Z., Denny Paul, Craig Michelle, Grossman Tovi
- Exploring The Frontiers Of Llms In Psychological Applications: A Comprehensive Review Ke Luoma, Tong Song, Cheng Peng, Peng Kaiping
- Development And Testing Of Retrieval Augmented Generation In Large Language Models -- A Case Study Report Ke Yuhe, Jin Liyuan, Elangovan Kabilan, Abdullah Hairil Rizal, Liu Nan, Sia Alex Tiong Heng, Soh Chai Rick, Tung Joshua Yi Min, Ong Jasmine Chiat Ling, Ting Daniel Shu Wei
- Bridging The Preference Gap Between Retrievers And Llms Ke Zixuan, Kong Weize, Li Cheng, Zhang Mingyang, Mei Qiaozhu, Bendersky Michael
- Retrieval-augmented Conversational Recommendation With Prompt-based Semi-structured Natural Language State Tracking Kemper Sara, Cui Justin, Dicarlantonio Kai, Lin Kathy, Tang Danjie, Korikov Anton, Sanner Scott
- Autonomous Prompt Engineering In Large Language Models Kepel Daan, Valogianni Konstantina
- Debating With More Persuasive Llms Leads To More Truthful Answers Khan Akbir, Hughes John, Valentine Dan, Ruis Laura, Sachan Kshitij, Radhakrishnan Ansh, Grefenstette Edward, Bowman Samuel R., Rocktäschel Tim, Perez Ethan
- Tabular Embedding Model (TEM): Finetuning Embedding Models For Tabular RAG Applications Khanna Sujit, Subedi Shishir
- ARGS: Alignment As Reward-guided Search Khanov Maxim, Burapacheep Jirayu, Li Yixuan
- Llms In The Loop: Leveraging Large Language Model Annotations For Active Learning In Low-resource Languages Kholodna Nataliia, Julka Sahib, Khodadadi Mohammad, Gumus Muhammed Nurullah, Granitzer Michael
- Keyword-driven Retrieval-augmented Large Language Models For Cold-start User Recommendations Kieu Hai-dang, Nguyen Minh Duc, Nguyen Thanh-son, Le Dung D.
- Ontology-free General-domain Knowledge Graph-to-text Generation Dataset Synthesis Using Large Language Model Kim Daehee, Kang Deokhyung, Ryu Sangwon, Lee Gary Geunbae
- Aligning Large Language Models With Self-generated Preference Data Kim Dongyoung, Lee Kimin, Shin Jinwoo, Kim Jaehyung
- Small Language Models Learn Enhanced Reasoning Skills From Medical Textbooks Kim Hyunjae, Hwang Hyeon, Lee Jiwoo, Park Sihyeon, Kim Dain, Lee Taewhoo, Yoon Chanwoong, Sohn Jiwoong, Choi Donghee, Kang Jaewoo
- Fine-tuning CLIP Text Encoders With Two-step Paraphrasing Kim Hyunjae, Yoon Seunghyun, Bui Trung, Zhao Handong, Tran Quan, Dernoncourt Franck, Kang Jaewoo
- Few-shot Personalization Of Llms With Mis-aligned Responses Kim Jaehyung, Yang Yiming
- From RAG To QA-RAG: Integrating Generative AI For Pharmaceutical Regulatory Compliance Process Kim Jaewoong Sungkyunkwan University, Min Moohong Sungkyunkwan University
- Finer: Investigating And Enhancing Fine-grained Visual Concept Recognition In Large Vision Language Models Kim Jeonghwan, Ji Heng
- Review-driven Personalized Preference Reasoning With Large Language Models For Recommendation Kim Jieyong, Kim Hyunseo, Cho Hyunjin, Kang Seongku, Chang Buru, Yeo Jinyoung, Lee Dongha
- Enhancing Psychotherapy Counseling: A Data Augmentation Pipeline Leveraging Large Language Models For Counseling Conversations Kim Jun-woo, Han Ji-eun, Koh Jun-seok, Seo Hyeon-tae, Chang Du-seong
- RE-RAG: Improving Open-domain QA Performance And Interpretability With Relevance Estimator In Retrieval-augmented Generation Kim Kiseung, Lee Jay-yoon
- Advisorqa: Towards Helpful And Harmless Advice-seeking Question Answering With Collective Intelligence Kim Minbeom, Lee Hwanhee, Park Joonsuk, Lee Hwaran, Jung Kyomin
- Augmenting Query And Passage For Retrieval-augmented Generation Using Llms For Open-domain Question Answering Kim Minsang, Park Cheoneum, Baek Seungjun
- Automatic Jailbreaking Of The Text-to-image Generative AI Systems Kim Minseon, Lee Hyomin, Gong Boqing, Zhang Huishuai, Hwang Sung Ju
- Large Language Models Meet Collaborative Filtering: An Efficient All-round Llm-based Recommender System Kim Sein, Kang Hongseok, Choi Seungyoon, Kim Donghyun, Yang Minchul, Park Chanyoung
- Llmem: Estimating GPU Memory Usage For Fine-tuning Pre-trained Llms Kim Taeho, Wang Yanming, Chaturvedi Vatshank, Gupta Lokesh, Kim Seyeon, Kwon Yongin, Ha Sangtae
- Learning To Explore And Select For Coverage-conditioned Retrieval-augmented Generation Kim Takyoung, Lee Kyungjae, Jang Young Rok, Cho Ji Yong, Kim Gangwoo, Cho Minseok, Lee Moontae
- A Survey On Integration Of Large Language Models With Intelligent Robots Kim Yeseung, Kim Dohyun, Choi Jieun, Park Jisang, Oh Nayoung, Park Daehyung
- Adaptive Contrastive Decoding In Retrieval-augmented Generation For Handling Noisy Contexts Kim Youna, Kim Hyuhng Joon, Park Cheonbok, Park Choonghyun, Cho Hyunsoo, Kim Junyeob, Yoo Kang Min, Lee Sang-goo, Kim Taeuk
- What's Wrong? Refining Meeting Summaries With LLM Feedback Kirstein Frederic, Ruas Terry, Gipp Bela
- LOGIC-LM++: Multi-step Refinement For Symbolic Formulations Kirtania Shashank, Gupta Priyanshu, Radhakirshna Arjun
- Evaluating Telugu Proficiency In Large Language Models_ A Comparative Analysis Of Chatgpt And Gemini Kishore Katikela Sreeharsha, Shaik Rahimanuddin
- From Bytes To Borsch: Fine-tuning Gemma And Mistral For The Ukrainian Language Representation Kiulian Artur, Polishko Anton, Khandoga Mykola, Chubych Oryna, Connor Jack, Ravishankar Raghav, Shirawalmath Adarsh
- Contrastive Perplexity For Controlled Generation: An Application In Detoxifying Large Language Models Klein Tassilo, Nabi Moin
- Extended Mind Transformers Klett Phoebe, Ahle Thomas
- Native Vs Non-native Language Prompting: A Comparative Analysis Kmainasi Mohamed Bayan, Khan Rakif, Shahroor Ali Ezzat, Bendou Boushra, Hasanain Maram, Alam Firoj
- Growover: How Can Llms Adapt To Growing Real-world Knowledge? Ko Dayoon, Kim Jinyoung, Choi Hahyeon, Kim Gunhee
- Investigating How Large Language Models Leverage Internal Knowledge To Perform Complex Reasoning Ko Miyoung, Park Sue Hyun, Park Joonsuk, Seo Minjoon
- Chatspamdetector: Leveraging Large Language Models For Effective Phishing Email Detection Koide Takashi, Fukushi Naoki, Nakano Hiroki, Chiba Daiki
- Blending Llms Into Cascaded Speech Translation: Kit's Offline Speech Translation System For IWSLT 2024 Koneru Sai, Nguyen Thai-binh, Pham Ngoc-quan, Liu Danni, Li Zhaolin, Waibel Alexander, Niehues Jan
- Self-prompt Tuning: Enable Autonomous Role-playing In Llms Kong Aobo, Zhao Shiwan, Chen Hao, Li Qicheng, Qin Yong, Sun Ruiqi, Zhou Xin, Zhou Jiaming, Sun Haoqin
- Labeling Supervised Fine-tuning Data With The Scaling Law Kong Huanjun
- Controllable Navigation Instruction Generation With Chain Of Thought Prompting Kong Xianghao, Chen Jinyu, Wang Wenguan, Su Hang, Hu Xiaolin, Yang Yi, Liu Si
- Customizing Language Models With Instance-wise Lora For Sequential Recommendation Kong Xiaoyu, Wu Jiancan, Zhang An, Sheng Leheng, Lin Hui, Wang Xiang, He Xiangnan
- QPO: Query-dependent Prompt Optimization Via Multi-loop Offline Reinforcement Learning Kong Yilun, Mao Hangyu, Zhao Qi, Zhang Bin, Ruan Jingqing, Shen Li, Chang Yongzhe, Wang Xueqian, Zhao Rui, Tao Dacheng
- Proptest: Automatic Property Testing For Improved Visual Programming Koo Jaywon, Yang Ziyan, Cascante-bonilla Paola, Ray Baishakhi, Ordonez Vicente
- Bitune: Bidirectional Instruction-tuning Kopiczko Dawid J., Blankevoort Tijmen, Asano Yuki M.
- Studying Large Language Model Behaviors Under Realistic Knowledge Conflicts Kortukov Evgenii, Rubinstein Alexander, Nguyen Elisa, Oh Seong Joon
- Transllama: Llm-based Simultaneous Translation System Koshkin Roman, Sudoh Katsuhito, Nakamura Satoshi
- Hints-in-browser: Benchmarking Language Models For Programming Feedback Generation Kotalwar Nachiket, Gotovos Alkis, Singla Adish
- Leveraging Large Language Models For Enhanced Process Model Comprehension Kourani Humam, Berti Alessandro, Henrich Jasmin, Kratsch Wolfgang, Weidlich Robin, Li Chiao-yun, Arslan Ahmad, Schuster Daniel, Van Der Aalst Wil M. P.
- Token Trails: Navigating Contextual Depths In Conversational AI With Chatllm Kowsher Md., Panditi Ritesh, Prottasha Nusrat Jahan, Bhat Prakash, Bairagi Anupam Kumar, Arefin Mohammad Shamsul
- Large Language Models For Tuning Evolution Strategies Kramer Oliver
- Retrieval-augmented Code Generation For Situated Action Generation: A Case Study On Minecraft Kranti Chalamalasetti, Hakimov Sherzod, Schlangen David
- From Data To Commonsense Reasoning: The Use Of Large Language Models For Explainable AI Krause Stefanie, Stolzenburg Frieder
- Can Better Text Semantics In Prompt Tuning Improve VLM Generalization? Kuchibhotla Hari Chandana, Kancheti Sai Srinivas, Reddy Abbavaram Gowtham, Balasubramanian Vineeth N
- Reinforcement Learning For Optimizing RAG For Domain Chatbots Kulkarni Mandar, Tangarajan Praveen, Kim Kyung, Trivedi Anusua
- Confidence Under The Hood: An Investigation Into The Confidence-probability Alignment In Large Language Models Kumar Abhishek, Morabito Robert, Umbet Sanzhar, Kabbara Jad, Emami Ali
- Multi-modal Generation Via Cross-modal In-context Learning Kumar Amandeep, Naseer Muzammal, Narayan Sanath, Anwer Rao Muhammad, Khan Salman, Cholakkal Hisham
- Bridging The Gap: Dynamic Learning Strategies For Improving Multilingual Performance In Llms Kumar Somnath, Balloli Vaibhav, Ranjit Mercy, Ahuja Kabir, Ganu Tanuja, Sitaram Sunayana, Bali Kalika, Nambi Akshay
- Babilong: Testing The Limits Of Llms With Long Context Reasoning-in-a-haystack Kuratov Yuri, Bulatov Aydar, Anokhin Petr, Rodkin Ivan, Sorokin Dmitry, Sorokin Artyom, Burtsev Mikhail
- Teaching Llama A New Language Through Cross-lingual Knowledge Transfer Kuulmets Hele-andra, Purason Taido, Luhtaru Agnes, Fishel Mark
- Mt-eval: A Multi-turn Capabilities Evaluation Benchmark For Large Language Models Kwan Wai-chung, Zeng Xingshan, Jiang Yuxin, Wang Yufei, Li Liangyou, Shang Lifeng, Jiang Xin, Liu Qun, Wong Kam-fai
- Continual Learning Optimizations For Auto-regressive Decoder Of Multilingual ASR Systems Kwok Chin Yuen, Yip Jia Qi, Chng Eng Siong
- Can Github Issues Be Solved With Tree Of Thoughts? La Rosa Ricardo, Hulse Corey, Liu Bangdi
- When To Retrieve: Teaching Llms To Utilize Information Retrieval Effectively Labruna Tiziano, Campos Jon Ander, Azkune Gorka
- Llms Beyond English: Scaling The Multilingual Capability Of Llms With Cross-lingual Feedback Lai Wen, Mesgar Mohsen, Fraser Alexander
- Cat-bench: Benchmarking Language Model Understanding Of Causal And Temporal Dependencies In Plans Lal Yash Kumar, Cohen Vanya, Chambers Nathanael, Balasubramanian Niranjan, Mooney Raymond
- Self-directed Synthetic Dialogues And Revisions Technical Report Lambert Nathan, Schoelkopf Hailey, Gokaslan Aaron, Soldaini Luca, Pyatkin Valentina, Castricato Louis
- FOCUS: Forging Originality Through Contrastive Use In Self-plagiarism For Language Models Lan Kaixin, Fang Tao, Wong Derek F., Xu Yabo, Chao Lidia S., Zhao Cecilia G.
- Melting Point: Mobile Evaluation Of Language Transformers Laskaridis Stefanos, Katevas Kleomenis, Minto Lorenzo, Haddadi Hamed
- Sparse Rewards Can Self-train Dialogue Agents Lattimer Barrett Martin, Gangal Varun, Mcdonald Ryan, Yang Yi
- MATTER: Memory-augmented Transformer Using Heterogeneous Knowledge Sources Lee Dongkyu, Prakash Chandana Satya, Fitzgerald Jack, Lehmann Jens
- Can Long-context Language Models Subsume Retrieval, RAG, SQL, And More? Lee Jinhyuk, Chen Anthony, Dai Zhuyun, Dua Dheeru, Sachan Devendra Singh, Boratko Michael, Luan Yi, Arnold Sébastien M. R., Perot Vincent, Dalmia Siddharth, Hu Hexiang, Lin Xudong, Pasupat Panupong, Amini Aida, Cole Jeremy R., Riedel Sebastian, Naim Iftekhar, Chang Ming-wei, Guu Kelvin
- Gecko: Versatile Text Embeddings Distilled From Large Language Models Lee Jinhyuk, Dai Zhuyun, Ren Xiaoqi, Chen Blair, Cer Daniel, Cole Jeremy R., Hui Kai, Boratko Michael, Kapadia Rajvi, Ding Wen, Luan Yi, Duddu Sai Meher Karthik, Abrego Gustavo Hernandez, Shi Weiqiang, Gupta Nithi, Kusupati Aditya, Jain Prateek, Jonnalagadda Siddhartha Reddy, Chang Ming-wei, Naim Iftekhar
- Plagbench: Exploring The Duality Of Large Language Models In Plagiarism Generation And Detection Lee Jooyoung, Agrawal Toshini, Uchendu Adaku, Le Thai, Chen Jinghui, Lee Dongwon
- LRQ: Optimizing Post-training Quantization For Large Language Models By Learning Low-rank Weight-scaling Matrices Lee Jung Hyun, Kim Jeonghoon, Yang June Yong, Kwon Se Jung, Yang Eunho, Yoo Kang Min, Lee Dongsoo
- Multimodal Reasoning With Multimodal Knowledge Graph Lee Junlin, Wang Yequan, Li Jing, Zhang Min
- Reinforcement Learning From Reflective Feedback (RLRF): Aligning And Improving Llms Via Fine-grained Self-reflection Lee Kyungjae, Hwang Dasol, Park Sunghyun, Jang Youngsoo, Lee Moontae
- Prompting Large Language Models For Zero-shot Essay Scoring Via Multi-trait Specialization Lee Sanwoo, Cai Yida, Meng Desong, Wang Ziyang, Wu Yunfang
- Llava-docent: Instruction Tuning With Multimodal Large Language Model To Support Art Appreciation Education Lee Unggi, Jeon Minji, Lee Yunseo, Byun Gyuri, Son Yoorim, Shin Jaeyoon, Ko Hongkyu, Kim Hyeoncheol
- Infinigen: Efficient Generative Inference Of Large Language Models With Dynamic KV Cache Management Lee Wonbeom, Lee Jungi, Seo Junghwan, Sim Jaewoong
- Chatgpt As Research Scientist: Probing Gpt's Capabilities As A Research Librarian, Research Ethicist, Data Generator And Data Predictor Lehr Steven A., Caliskan Aylin, Liyanage Suneragiri, Banaji Mahzarin R.
- Materials Science In The Era Of Large Language Models: A Perspective Lei Ge, Docherty Ronan, Cooper Samuel J.
- Meta-task Prompting Elicits Embeddings From Large Language Models Lei Yibin, Wu Di, Zhou Tianyi, Shen Tao, Cao Yu, Tao Chongyang, Yates Andrew
- Incorporating External Knowledge And Goal Guidance For Llm-based Conversational Recommender Systems Li Chuang, Deng Yang, Hu Hengchang, Kan Min-yen, Li Haizhou
- Searchlvlms: A Plug-and-play Framework For Augmenting Large Vision-language Models By Searching Up-to-date Internet Knowledge Li Chuanhao, Li Zhen, Jing Chenchen, Liu Shuo, Shao Wenqi, Wu Yuwei, Luo Ping, Qiao Yu, Zhang Kaipeng
- On The Role Of Long-tail Knowledge In Retrieval Augmented Large Language Models Li Dongyang, Yan Junbing, Zhang Taolin, Wang Chengyu, He Xiaofeng, Huang Longtao, Xue Hui, Huang Jun
- A Study On Training And Developing Large Language Models For Behavior Tree Generation Li Fu, Wang Xueying, Li Bin, Wu Yunlong, Wang Yanzhen, Yi Xiaodong
- "is Chatgpt A Better Explainer Than My Professor?": Evaluating The Explanation Capabilities Of Llms In Conversation Compared To A Human Baseline Li Grace, Alshomary Milad, Muresan Smaranda
- Gradient-mask Tuning Elevates The Upper Limits Of LLM Performance Li Haoling, Zhang Xin, Liu Xiao, Gong Yeyun, Wang Yifan, Yang Yujiu, Chen Qi, Cheng Peng
- Datacomp-lm: In Search Of The Next Generation Of Training Sets For Language Models Li Jeffrey, Fang Alex, Smyrnis Georgios, Ivgi Maor, Jordan Matt, Gadre Samir, Bansal Hritik, Guha Etash, Keh Sedrick, Arora Kushal, Garg Saurabh, Xin Rui, Muennighoff Niklas, Heckel Reinhard, Mercat Jean, Chen Mayee, Gururangan Suchin, Wortsman Mitchell, Albalak Alon, Bitton Yonatan, Nezhurina Marianna, Abbas Amro, Hsieh Cheng-yu, Ghosh Dhruba, Gardner Josh, Kilian Maciej, Zhang Hanlin, Shao Rulin, Pratt Sarah, Sanyal Sunny, Ilharco Gabriel, Daras Giannis, Marathe Kalyani, Gokaslan Aaron, Zhang Jieyu, Chandu Khyathi, Nguyen Thao, Vasiljevic Igor, Kakade Sham, Song Shuran, Sanghavi Sujay, Faghri Fartash, Oh Sewoong, Zettlemoyer Luke, Lo Kyle, El-nouby Alaaeldin, Pouransari Hadi, Toshev Alexander, Wang Stephanie, Groeneveld Dirk, Soldaini Luca, Koh Pang Wei, Jitsev Jenia, Kollar Thomas, Dimakis Alexandros G., Carmon Yair, Dave Achal, Schmidt Ludwig, Shankar Vaishaal
- MT-PATCHER: Selective And Extendable Knowledge Distillation From Large Language Models For Machine Translation Li Jiahuan, Cheng Shanbo, Huang Shujian, Chen Jiajun
- Know The Unknown: An Uncertainty-sensitive Method For LLM Instruction Tuning Li Jiaqi, Tang Yixuan, Yang Yi
- Distilling Algorithmic Reasoning From Llms Via Explaining Solution Programs Li Jierui, Mooney Raymond
- Steering Llms Towards Unbiased Responses: A Causality-guided Debiasing Framework Li Jingling, Tang Zeyu, Liu Xiaoyu, Spirtes Peter, Zhang Kun, Leqi Liu, Liu Yang
- Stylechat: Learning Recitation-augmented Memory In Llms For Stylized Dialogue Generation Li Jinpeng, Zhang Zekai, Tu Quan, Cheng Xin, Zhao Dongyan, Yan Rui
- Experimenting With Legal AI Solutions: The Case Of Question-answering For Access To Justice Li Jonathan, Bhambhoria Rohan, Dahan Samuel, Zhu Xiaodan
- Think Twice Before Trusting: Self-detection For Large Language Models Through Comprehensive Answer Reflection Li Moxin, Wang Wenjie, Feng Fuli, Zhu Fengbin, Wang Qifan, Chua Tat-seng
- From Text To Insight: Leveraging Large Language Models For Performance Evaluation In Management Li Ning, Zhou Huaikang, Xu Mingze
- Owlore: Outlier-weighed Layerwise Sampled Low-rank Projection For Memory-efficient LLM Fine-tuning Li Pengxiang, Yin Lu, Gao Xiaowei, Liu Shiwei
- Coevol: Constructing Better Responses For Instruction Finetuning Through Multi-agent Cooperation Li Renhao, Tan Minghuan, Wong Derek F., Yang Min
- Prompt4vis: Prompting Large Language Models With Example Mining And Schema Filtering For Tabular Data Visualization Li Shuaimin, Chen Xuanang, Song Yuanfeng, Song Yunze, Zhang Chen
- FSMR: A Feature Swapping Multi-modal Reasoning Approach With Joint Textual And Visual Clues Li Shuang, Wang Jiahua, Wen Lijie
- Temporal Insight Enhancement: Mitigating Temporal Hallucination In Multimodal Large Language Models Li Sun, Liuan Wang, Jun Sun, Takayuki Okatani
- Model-enhanced Llm-driven VUI Testing Of VPA Apps Li Suwan, Bu Lei, Bai Guangdong, Xie Fuman, Chen Kai, Yue Chang
- Reinforcement Learning With Token-level Feedback For Controllable Text Generation Li Wendi, Wei Wei, Xu Kaihe, Xie Wenfeng, Chen Dangyang, Cheng Yu
- Uio-llms: Unbiased Incremental Optimization For Long-context Llms Li Wenhao, Lin Mingbao, Zhong Yunshan, Yan Shuicheng, Ji Rongrong
- MM-TTS: A Unified Framework For Multimodal, Prompt-induced Emotional Text-to-speech Synthesis Li Xiang, Cheng Zhi-qi, He Jun-yan, Peng Xiaojiang, Hauptmann Alexander G.
- What If We Recaption Billions Of Web Images With Llama-3? Li Xianhang, Tu Haoqin, Hui Mude, Wang Zeyu, Zhao Bingchen, Xiao Junfei, Ren Sucheng, Mei Jieru, Liu Qing, Zheng Huangjie, Zhou Yuyin, Xie Cihang
- Grounded Compositional And Diverse Text-to-3d With Pretrained Multi-view Diffusion Model Li Xiaolong, Mo Jiawei, Wang Ying, Parameshwara Chethan, Fei Xiaohan, Swaminathan Ashwin, Taylor Cj, Tu Zhuowen, Favaro Paolo, Soatto Stefano
- Evaluating Mathematical Reasoning Of Large Language Models: A Focus On Error Identification And Correction Li Xiaoyuan, Wang Wenjie, Li Moxin, Guo Junrong, Zhang Yang, Feng Fuli
- Leveraging Large Language Model As Simulated Patients For Clinical Education Li Yanzeng, Zeng Cheng, Zhong Jialun, Zhang Ruoyu, Zhang Minhao, Zou Lei
- The First Place Solution Of WSDM Cup 2024: Leveraging Large Language Models For Conversational Multi-doc QA Li Yiming, Zhang Zhao
- VSTAR: Generative Temporal Nursing For Longer Dynamic Video Synthesis Li Yumeng, Beluch William, Keuper Margret, Zhang Dan, Khoreva Anna
- Large Language Models For Human-like Autonomous Driving: A Survey Li Yun, Katsumata Kai, Javanmardi Ehsan, Tsukada Manabu
- Llms Meet Long Video: Advancing Long Video Question Answering With An Interactive Visual Adapter In Llms Li Yunxin, Chen Xinyu, Hu Baotain, Zhang Min
- Cognitive Visual-language Mapper: Advancing Multimodal Comprehension With Enhanced Visual Knowledge Alignment Li Yunxin, Chen Xinyu, Hu Baotian, Shi Haoyuan, Zhang Min
- Text-to-model: Text-conditioned Neural Network Diffusion For Train-once-for-all Personalization Li Zexi, Gao Lingzhi, Wu Chao
- Understanding And Patching Compositional Reasoning In Llms Li Zhaoyi, Jiang Gangwei, Xie Hong, Song Linqi, Lian Defu, Wei Ying
- Wikitableedit: A Benchmark For Table Editing By Natural Language Instruction Li Zheng, Chen Xiang, Wan Xiaojun
- EAGLE: Elevating Geometric Reasoning Through Llm-empowered Visual Instruction Tuning Li Zhihao, Du Yao, Liu Yang, Zhang Yan, Liu Yufang, Zhang Mengdi, Cai Xunliang
- Refiner: Restructure Retrieval Content Efficiently To Advance Question-answering Capabilities Li Zhonghao, Hu Xuming, Liu Aiwei, Zheng Kening, Huang Sirui, Xiong Hui
- Synthesize Step-by-step: Tools, Templates And Llms As Data Generators For Reasoning-based Chart VQA Li Zhuowan, Jasani Bhavan, Tang Peng, Ghadar Shabnam
- Retrieval Augmented Generation Or Long-context Llms? A Comprehensive Study And Hybrid Approach Li Zhuowan, Li Cheng, Zhang Mingyang, Mei Qiaozhu, Bendersky Michael
- Intermediate Distillation: Data-efficient Distillation From Black-box Llms For Information Retrieval Li Zizhong, Zhang Haopeng, Zhang Jiawei
- Recai: Leveraging Large Language Models For Next-generation Recommender Systems Lian Jianxun, Lei Yuxuan, Huang Xu, Yao Jing, Xu Wei, Xie Xing
- Bytecomposer: A Human-like Melody Composition Method Based On Language Model Agent Liang Xia, Du Xingjian, Lin Jiaju, Zou Pei, Wan Yuan, Zhu Bilei
- Task Oriented In-domain Data Augmentation Liang Xiao, Hu Xinyu, Zuo Simiao, Gong Yeyun, Lou Qiang, Liu Yi, Huang Shao-lun, Jiao Jian
- Languashrink: Reducing Token Overhead With Psycholinguistics Liang Xuechen, Tao Meiling, Xia Yinghui, Shi Tianyu, Wang Jun, Yang Jingsong
- Empowering Large Language Models To Set Up A Knowledge Retrieval Indexer Via Self-learning Liang Xun, Niu Simin, Li Zhiyu, Zhang Sensen, Song Shichao, Wang Hanyu, Yang Jiawei, Xiong Feiyu, Tang Bo, Xi Chenyang
- Internal Consistency And Self-feedback In Large Language Models: A Survey Liang Xun, Song Shichao, Zheng Zifan, Wang Hanyu, Yu Qingchen, Li Xunkai, Li Rong-hua, Cheng Peng, Wang Zhonghao, Xiong Feiyu, Li Zhiyu
- I-SHEEP: Self-alignment Of LLM From Scratch Through An Iterative Self-enhancement Paradigm Liang Yiming, Zhang Ge, Qu Xingwei, Zheng Tianyu, Guo Jiawei, Du Xinrun, Yang Zhenzhu, Liu Jiaheng, Lin Chenghua, Ma Lei, Huang Wenhao, Zhang Jiajun
- Learning To Trust Your Feelings: Leveraging Self-awareness In Llms For Hallucination Mitigation Liang Yuxin, Song Zhuoyang, Wang Hao, Zhang Jiaxing
- IKUN For WMT24 General MT Task: Llms Are Here For Multilingual Machine Translation Liao Baohao, Herold Christian, Khadivi Shahram, Monz Christof
- 3-in-1: 2D Rotary Adaptation For Efficient Finetuning, Efficient Batching And Composability Liao Baohao, Monz Christof
- Imagination Augmented Generation: Learning To Imagine Richer Context For Question Answering Over Large Language Models Liao Huanxuan, He Shizhu, Xu Yao, Zhang Yuanzhe, Liu Kang, Liu Shengping, Zhao Jun
- Enhancing Reinforcement Learning With Label-sensitive Reward For Natural Language Understanding Liao Kuo, Li Shuang, Zhao Meng, Liu Liqun, Xue Mengge, Hu Zhenyu, Han Honglin, Yin Chengguo
- E2LLM: Encoder Elongated Large Language Models For Long-context Understanding And Reasoning Liao Zihan, Wang Jun, Yu Hang, Wei Lingxiao, Li Jianguo, Wang Jun, Zhang Wei
- Adapting Mental Health Prediction Tasks For Cross-lingual Learning Via Meta-training And In-context Learning With Large Language Model Lifelo Zita, Ning Huansheng, Dhelim Sahraoui
- Retrieval-augmented Natural Language Reasoning For Explainable Visual Question Answering Lim Su Hyeon, Kim Minkuk, Kim Hyeon Bae, Kim Seong Tae
- Tree-based Hard Attention With Self-motivation For Large Language Models Lin Chenxi, Ren Jiayu, He Guoxiu, Jiang Zhuoren, Yu Haiyan, Zhu Xiaomin
- Probing Large Language Models For Scalar Adjective Lexical Semantics And Scalar Diversity Pragmatics Lin Fangru, Altshuler Daniel, Pierrehumbert Janet B.
- Emojicrypt: Prompt Encryption For Secure Communication With Large Language Models Lin Guo, Hua Wenyue, Zhang Yongfeng
- How Can I Get It Right? Using GPT To Rephrase Incorrect Trainee Responses Lin Jionghao, Han Zifei, Thomas Danielle R., Gurung Ashish, Gupta Shivang, Aleven Vincent, Koedinger Kenneth R.
- Syncpkl: Harnessing Llms To Generate Synthetic Data For Commonsense Persona Knowledge Linking Lin Kuan-yen
- Single Character Perturbations Break LLM Alignment Lin Leon, Brown Hannah, Kawaguchi Kenji, Shieh Michael
- Mala-500: Massive Language Adaptation Of Large Language Models Lin Peiqin, Ji Shaoxiong, Tiedemann Jörg, Martins André F. T., Schütze Hinrich
- FLAME: Factuality-aware Alignment For Large Language Models Lin Sheng-chieh, Gao Luyu, Oguz Barlas, Xiong Wenhan, Lin Jimmy, Yih Wen-tau, Chen Xilun
- Draw-and-understand: Leveraging Visual Prompts To Enable Mllms To Comprehend What You Want Lin Weifeng, Wei Xinyu, An Ruichuan, Gao Peng, Zou Bocheng, Luo Yulin, Huang Siyuan, Zhang Shanghang, Li Hongsheng
- Improving Visual Storytelling With Multimodal Large Language Models Lin Xiaochuan, Chen Xiangyong
- Diversedialogue: A Methodology For Designing Chatbots With Human-like Diversity Lin Xiaoyu, Yu Xinkai, Aich Ankit, Giorgi Salvatore, Ungar Lyle
- Rethinking Visual Prompting For Multimodal Large Language Models With External Knowledge Lin Yuanze, Li Yunsheng, Chen Dongdong, Xu Weijian, Clark Ronald, Torr Philip, Yuan Lu
- PANDA: Preference Adaptation For Enhancing Domain-specific Abilities Of Llms Liu An, Yang Zonghan, Zhang Zhenhe, Hu Qingyuan, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Is Translation All You Need? A Study On Solving Multilingual Tasks With Large Language Models Liu Chaoqun, Zhang Wenxuan, Zhao Yiran, Luu Anh Tuan, Bing Lidong
- Few-shot Class Incremental Learning With Attention-aware Self-adaptive Prompt Liu Chenxi, Wang Zhenyi, Xiong Tianyi, Chen Ruibo, Wu Yihan, Guo Junfeng, Huang Heng
- Cool-fusion: Fuse Large Language Models Without Training Liu Cong, Quan Xiaojun, Pan Yan, Lin Liang, Wu Weigang, Chen Xu
- Pefomed: Parameter Efficient Fine-tuning Of Multimodal Large Language Models For Medical Imaging Liu Gang, He Jinlong, Li Pengfei, He Genrong, Chen Zhaolin, Zhong Shenjun
- On The Intrinsic Self-correction Capability Of Llms: Uncertainty And Latent Concept Liu Guangliang, Mao Haitao, Cao Bochuan, Xue Zhiyu, Johnson Kristen, Tang Jiliang, Wang Rongrong
- Logic Agent: Enhancing Validity With Logic Rule Invocation Liu Hanmeng, Teng Zhiyang, Zhang Chaoli, Zhang Yue
- \(se^2\): Sequential Example Selection For In-context Learning Liu Haoyu, Liu Jianfeng, Huang Shaohan, Zhan Yuefeng, Sun Hao, Deng Weiwei, Wei Furu, Zhang Qi
- Advancing Adversarial Suffix Transfer Learning On Aligned Large Language Models Liu Hongfu, Xie Yuxi, Wang Ye, Shieh Michael
- Bitdelta: Your Fine-tune May Only Be Worth One Bit Liu James, Xiao Guangxuan, Li Kai, Lee Jason D., Han Song, Dao Tri, Cai Tianle
- Andes: Defining And Enhancing Quality-of-experience In Llm-based Text Streaming Services Liu Jiachen, Wu Zhiyu, Chung Jae-won, Lai Fan, Lee Myungjin, Chowdhury Mosharaf
- Speculative Decoding Via Early-exiting For Faster LLM Inference With Thompson Sampling Control Mechanism Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang
- VP-LLM: Text-driven 3D Volume Completion With Large Language Models Through Patchification Liu Jianmeng, Liu Yichen, Zhang Yuyao, Meng Zeyuan, Tai Yu-wing, Tang Chi-keung
- Mm-instruct: Generated Visual Instructions For Large Multimodal Model Alignment Liu Jihao, Huang Xin, Zheng Jinliang, Liu Boxiao, Wang Jia, Yoshie Osamu, Liu Yu, Li Hongsheng
- Educating Llms Like Human Students: Structure-aware Injection Of Domain Knowledge Liu Kai, Chen Ze, Fu Zhihang, Jiang Rongxin, Zhou Fan, Chen Yaowu, Wu Yue, Ye Jieping
- LLM4GEN: Leveraging Semantic Representation Of Llms For Text-to-image Generation Liu Mushui, Ma Yuhang, Zhen Yang, Dan Jun, Yu Yunlong, Zhao Zeng, Hu Zhipeng, Liu Bai, Fan Changjie
- STORE: Streamlining Semantic Tokenization And Generative Recommendation With A Single LLM Liu Qijiong, Zhu Jieming, Fan Lu, Zhao Zhou, Wu Xiao-ming
- Proofread: Fixes All Errors With One Tap Liu Renjie, Zhang Yanxiang, Zhu Yun, Sun Haicheng, Zhang Yuanbo, Huang Michael Xuelin, Cai Shanqing, Meng Lei, Zhai Shumin
- Generative Expressive Conversational Speech Synthesis Liu Rui, Hu Yifan, Ren Yi, Yin Xiang, Li Haizhou
- Flashback:efficient Retrieval-augmented Language Modeling For Long Context Inference Liu Runheng, Xiao Xingchen, Huang Heyan, Chi Zewen, Wu Zhijing
- Are Llms Effective Backbones For Fine-tuning? An Experimental Investigation Of Supervised Llms On Chinese Short Text Matching Liu Shulin, Xu Chengcheng, Liu Hao, Yu Tinghao, Yang Tao
- Bi-chainer: Automated Large Language Models Reasoning With Bidirectional Chaining Liu Shuqi, He Bowei, Song Linqi
- The Generation Gap:exploring Age Bias In The Underlying Value Systems Of Large Language Models Liu Siyang, Maturi Trish, Yi Bowen, Shen Siqi, Mihalcea Rada
- Explicit Inductive Inference Using Large Language Models Liu Tianyang, Li Tianyi, Cheng Liang, Steedman Mark
- Memlong: Memory-augmented Retrieval For Long Text Modeling Liu Weijie, Tang Zecheng, Li Juntao, Chen Kehai, Zhang Min
- Toolace: Winning The Points Of LLM Function Calling Liu Weiwen, Huang Xu, Zeng Xingshan, Hao Xinlong, Yu Shuai, Li Dexun, Wang Shuai, Gan Weinan, Liu Zhengying, Yu Yuanqing, Wang Zezhong, Wang Yuxian, Ning Wu, Hou Yutai, Wang Bin, Wu Chuhan, Wang Xinzhi, Liu Yong, Wang Yasheng, Tang Duyu, Tu Dandan, Shang Lifeng, Jiang Xin, Tang Ruiming, Lian Defu, Liu Qun, Chen Enhong
- Extensive Self-contrast Enables Feedback-free Language Model Alignment Liu Xiao, Song Xixuan, Dong Yuxiao, Tang Jie
- Prompt Framework For Role-playing: Generation And Evaluation Liu Xun, Ni Zhengwei
- Era-cot: Improving Chain-of-thought Through Entity Relationship Analysis Liu Yanming, Peng Xinyue, Du Tianyu, Yin Jianwei, Liu Weihao, Zhang Xuhong
- RA-ISF: Learning To Answer And Understand From Retrieval Augmentation Via Iterative Self-feedback Liu Yanming, Peng Xinyue, Zhang Xuhong, Liu Weihao, Yin Jianwei, Cao Jiannan, Du Tianyu
- Arondight: Red Teaming Large Vision Language Models With Auto-generated Multi-modal Jailbreak Prompts Liu Yi, Cai Chengjun, Zhang Xiaoli, Yuan Xingliang, Wang Cong
- DELTA: Decomposed Efficient Long-term Robot Task Planning Using Large Language Models Liu Yuchen, Palmieri Luigi, Koch Sebastian, Georgievski Ilche, Aiello Marco
- Hd-eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition Liu Yuxuan, Yang Tianchi, Huang Shaohan, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- World Models With Hints Of Large Language Models For Goal Achieving Liu Zeyuan, Huan Ziyu, Wang Xiyao, Lyu Jiafei, Tao Jian, Li Xiu, Huang Furong, Xu Huazhe
- Synthvlm: High-efficiency And High-quality Synthetic Data For Vision Language Models Liu Zheng, Liang Hao, Huang Xijie, Xiong Wentao, Yu Qinhan, Sun Linzhuang, Chen Chong, He Conghui, Cui Bin, Zhang Wentao
- Personality-aware Student Simulation For Conversational Intelligent Tutoring Systems Liu Zhengyuan, Yin Stella Xin, Lin Geyu, Chen Nancy F.
- Protecting Your Llms With Information Bottleneck Liu Zichuan, Wang Zefan, Xu Linjie, Wang Jinyu, Song Lei, Wang Tianchun, Chen Chunlin, Cheng Wei, Bian Jiang
- Chatqa: Surpassing GPT-4 On Conversational QA And RAG Liu Zihan, Ping Wei, Roy Rajarshi, Xu Peng, Lee Chankyu, Shoeybi Mohammad, Catanzaro Bryan
- Generative Multi-modal Knowledge Retrieval With Large Language Models Long Xinwei, Zeng Jiali, Meng Fandong, Ma Zhiyuan, Zhang Kaiyan, Zhou Bowen, Zhou Jie
- Roleplay-doh: Enabling Domain-experts To Create Llm-simulated Patients Via Eliciting And Adhering To Principles Louie Ryan, Nandi Ananjan, Fang William, Chang Cheng, Brunskill Emma, Yang Diyi
- Answering Real-world Clinical Questions Using Large Language Model Based Systems Low Yen Sia 1 And 2, Jackson Michael L. 1 And 2, Hyde Rebecca J. 1 And 2, Brown Robert E. 1 And 2, Sanghavi Neil M. 1 And 2, Baldwin Julian D. 1 And 2, Pike C. William 1 And 2, Muralidharan Jananee 1 And 2, Hui Gavin 1 And 2, Alexander Natasha 1 And 7, Hassan Hadeel 1 And 7, Nene Rahul V. 1 And 7, Pike Morgan 1 And 7, Pokrzywa Courtney J. 1 And 7, Vedak Shivam 1 And 7, Yan Adam Paul 1 And 7, Yao Dong-han 1 And 7, Zipursky Amy R. 1 And 7, Dinh Christina 1 And 7, Ballentine Philip 1 And 7, Derieg Dan C. 1 And 7, Polony Vladimir 1 And 7, Chawdry Rehan N. 1 And 7, Davies Jordan 1 And 7, Hyde Brigham B. 1 And 7, Shah Nigam H. 1 And 7, Gombar Saurabh 1 And 8
- A Bounding Box Is Worth One Token: Interleaving Layout And Text In A Large Language Model For Document Understanding Lu Jinghui, Yu Haiyang, Wang Yanjie, Ye Yongjie, Tang Jingqun, Yang Ziwei, Wu Binghong, Liu Qi, Feng Hao, Wang Han, Liu Hao, Huang Can
- FIPO: Free-form Instruction-oriented Prompt Optimization With Preference Dataset And Modular Fine-tuning Schema Lu Junru, An Siyu, Zhang Min, He Yulan, Yin Di, Sun Xing
- Desta: Enhancing Speech Language Models Through Descriptive Speech-text Alignment Lu Ke-han, Chen Zhehuai, Fu Szu-wei, Huang He, Ginsburg Boris, Wang Yu-chiang Frank, Lee Hung-yi
- Multimodal Large Language Model Driven Scenario Testing For Autonomous Vehicles Lu Qiujing, Wang Xuanhan, Jiang Yiwei, Zhao Guangming, Ma Mingyue, Feng Shuo
- Insights Into LLM Long-context Failures: When Transformers Know But Don't Tell Lu Taiming, Gao Muhan, Yu Kuai, Byerly Adam, Khashabi Daniel
- Mental Modeling Of Reinforcement Learning Agents By Language Models Lu Wenhao, Zhao Xufeng, Spisak Josua, Lee Jae Hee, Wermter Stefan
- Generative Students: Using Llm-simulated Student Profiles To Support Question Item Evaluation Lu Xinyi, Wang Xu
- Can Perplexity Predict Fine-tuning Performance? An Investigation Of Tokenization Effects On Sequential Language Models For Nepali Luitel Nishant, Bekoju Nirajan, Sah Anand Kumar, Shakya Subarna
- Layoutllm: Layout Instruction Tuning With Large Language Models For Document Understanding Luo Chuwei, Shen Yufan, Zhu Zhaoqing, Zheng Qi, Yu Zhi, Yao Cong
- Crosstune: Black-box Few-shot Classification With Label Enhancement Luo Danqing, Zhang Chen, Zhang Yan, Li Haizhou
- Autom3l: An Automated Multimodal Machine Learning Framework With Large Language Models Luo Daqin, Feng Chengjian, Nong Yuxuan, Shen Yiqing
- Decoupled Alignment For Robust Plug-and-play Adaptation Luo Haozheng, Yu Jiahao, Zhang Wenxin, Li Jialong, Hu Jerry Yao-chieh, Xing Xinyu, Liu Han
- BGE Landmark Embedding: A Chunking-free Embedding Method For Retrieval Augmented Long-context Large Language Models Luo Kun, Liu Zheng, Xiao Shitao, Liu Kang
- Taking A Deep Breath: Enhancing Language Modeling Of Large Language Models With Sentinel Tokens Luo Weiyao, Zheng Suncong, Xia Heming, Wang Weikang, Lei Yan, Liu Tianyu, Chen Shuang, Sui Zhifang
- Duetsim: Building User Simulator With Dual Large Language Models For Task-oriented Dialogues Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- Turning Trash Into Treasure: Accelerating Inference Of Large Language Models With Token Recycling Luo Xianzhen, Wang Yixuan, Zhu Qingfu, Zhang Zhiming, Zhang Xuanyu, Yang Qing, Xu Dongliang, Che Wanxiang
- Knowla: Enhancing Parameter-efficient Finetuning With Knowledgeable Adaptation Luo Xindi, Sun Zequn, Zhao Jing, Zhao Zhe, Hu Wei
- TRAWL: Tensor Reduced And Approximated Weights For Large Language Models Luo Yiran, Patel Het, Fu Yu, Ahn Dawon, Chen Jia, Dong Yue, Papalexakis Evangelos E.
- Source2synth: Synthetic Data Generation And Curation Grounded In Real Data Sources Lupidi Alisia, Gemmell Carlos, Cancedda Nicola, Dwivedi-yu Jane, Weston Jason, Foerster Jakob, Raileanu Roberta, Lomeli Maria
- Interpreting Key Mechanisms Of Factual Recall In Transformer-based Language Models Lv Ang, Chen Yuhan, Zhang Kaiyi, Wang Yulong, Liu Lifeng, Wen Ji-rong, Xie Jian, Yan Rui
- GPT-4 Generated Narratives Of Life Events Using A Structured Narrative Prompt: A Validation Study Lynch Christopher J., Jensen Erik, Munro Madison H., Zamponi Virginia, Martinez Joseph, O'brien Kevin, Feldhaus Brandon, Smith Katherine, Reinhold Ann Marie, Gore Ross
- Knowtuning: Knowledge-aware Fine-tuning For Large Language Models Lyu Yougang, Yan Lingyong, Wang Shuaiqiang, Shi Haibo, Yin Dawei, Ren Pengjie, Chen Zhumin, De Rijke Maarten, Ren Zhaochun
- Retrieve-plan-generation: An Iterative Planning And Answering Framework For Knowledge-intensive LLM Generation Lyu Yuanjie, Niu Zihan, Xie Zheyong, Zhang Chao, Xu Tong, Wang Yang, Chen Enhong
- First Activations Matter: Training-free Methods For Dynamic Activation In Large Language Models Ma Chi, Huang Mincong, Zhang Ying, Wang Chao, Wang Yujie, Yu Lei, Liu Chuan, Lin Wei
- Exploring The Role Of Transliteration In In-context Learning For Low-resource Languages Written In Non-latin Scripts Ma Chunlan, Liu Yihong, Ye Haotian, Schütze Hinrich
- Sparsity-accelerated Training For Large Language Models Ma Da, Chen Lu, Wang Pengyu, Xu Hongshen, Li Hanqi, Sun Liangtai, Zhu Su, Fan Shuai, Yu Kai
- EE-MLLM: A Data-efficient And Compute-efficient Multimodal Large Language Model Ma Feipeng, Zhou Yizhou, Li Hebei, He Zilong, Wu Siying, Rao Fengyun, Zhang Yueyi, Sun Xiaoyan
- Integrating AI Tutors In A Programming Course Ma Iris, Martins Alberto Krone, Lopes Cristina Videira
- Debate On Graph: A Flexible And Reliable Reasoning Framework For Large Language Models Ma Jie, Gao Zhitao, Chai Qi, Sun Wangchun, Wang Pinghui, Pei Hongbin, Tao Jing, Song Lingyun, Liu Jun, Zhang Chen, Cui Lizhen
- FBI-LLM: Scaling Up Fully Binarized Llms From Scratch Via Autoregressive Distillation Ma Liqun, Sun Mingjie, Shen Zhiqiang
- Eventlens: Leveraging Event-aware Pretraining And Cross-modal Linking Enhances Visual Commonsense Reasoning Ma Mingjie, Yu Zhihuan, Ma Yichao, Li Guohui
- Think-on-graph 2.0: Deep And Interpretable Large Language Model Reasoning With Knowledge Graph-guided Retrieval Ma Shengjie, Xu Chengjin, Jiang Xuhui, Li Muzhi, Qu Huaren, Guo Jian
- Megalodon: Efficient LLM Pretraining And Inference With Unlimited Context Length Ma Xuezhe, Yang Xiaomeng, Xiong Wenhan, Chen Beidi, Yu Lili, Zhang Hao, May Jonathan, Zettlemoyer Luke, Levy Omer, Zhou Chunting
- Mmlongbench-doc: Benchmarking Long-context Document Understanding With Visualizations Ma Yubo, Zang Yuhang, Chen Liangyu, Chen Meiqi, Jiao Yizhu, Li Xinze, Lu Xinyuan, Liu Ziyu, Ma Yan, Dong Xiaoyi, Zhang Pan, Pan Liangming, Jiang Yu-gang, Wang Jiaqi, Cao Yixin, Sun Aixin
- Compositional API Recommendation For Library-oriented Code Generation Ma Zexiong, An Shengnan, Xie Bing, Lin Zeqi
- Llmparser: An Exploratory Study On Using Large Language Models For Log Parsing Ma Zeyang, Chen An Ran, Kim Dong Jae, Chen Tse-hsun, Wang Shaowei
- Exploring The Impact Of The Output Format On The Evaluation Of Large Language Models For Code Translation Macedo Marcos, Tian Yuan, Cogo Filipe R., Adams Bram
- Explaining Genetic Programming Trees Using Large Language Models Maddigan Paula, Lensen Andrew, Xue Bing
- Hallucination-free? Assessing The Reliability Of Leading AI Legal Research Tools Magesh Varun, Surani Faiz, Dahl Matthew, Suzgun Mirac, Manning Christopher D., Ho Daniel E.
- Evaluating Very Long-term Conversational Memory Of LLM Agents Maharana Adyasha, Lee Dong-ho, Tulyakov Sergey, Bansal Mohit, Barbieri Francesco, Fang Yuwei
- Leveraging Environment Interaction For Automated PDDL Generation And Planning With Large Language Models Mahdavi Sadegh, Aoki Raquel, Tang Keyi, Cao Yanshuai
- VURF: A General-purpose Reasoning And Self-refinement Framework For Video Understanding Mahmood Ahmad, Vayani Ashmal, Naseer Muzammal, Khan Salman, Khan Fahad Shahbaz
- Rephrasing The Web: A Recipe For Compute And Data-efficient Language Modeling Maini Pratyush, Seto Skyler, Bai He, Grangier David, Zhang Yizhe, Jaitly Navdeep
- FRACTAL: Fine-grained Scoring From Aggregate Text Labels Makhija Yukti, Agrawal Priyanka, Saket Rishi, Raghuveer Aravindan
- ULLME: A Unified Framework For Large Language Model Embeddings With Generation-augmented Learning Man Hieu, Ngo Nghia Trung, Dernoncourt Franck, Nguyen Thien Huu
- IITK At Semeval-2024 Task 2: Exploring The Capabilities Of Llms For Safe Biomedical Natural Language Inference For Clinical Trials Mandal Shreyasi, Modi Ashutosh
- Coalitions Of Large Language Models Increase The Robustness Of AI Agents Mangal Prattyush, Mak Carol, Kanakis Theo, Donovan Timothy, Braines Dave, Pyzer-knapp Edward
- Making Task-oriented Dialogue Datasets More Natural By Synthetically Generating Indirect User Requests Mannekote Amogh, Nam Jinseok, Li Ziming, Gao Jian, Boyer Kristy Elizabeth, Dorr Bonnie J.
- Can Large Language Models Automatically Score Proficiency Of Written Essays? Mansour Watheq, Albatarni Salam, Eltanbouly Sohaila, Elsayed Tamer
- A Data Generation Perspective To The Mechanism Of In-context Learning Mao Haitao, Liu Guangliang, Ma Yao, Wang Rongrong, Johnson Kristen, Tang Jiliang
- Reinforced Prompt Personalization For Recommendation With Large Language Models Mao Wenyu, Wu Jiancan, Chen Weijian, Gao Chongming, Wang Xiang, He Xiangnan
- How Does Quantization Affect Multilingual Llms? Marchisio Kelly, Dash Saurabh, Chen Hongyu, Aumiller Dennis, Üstün Ahmet, Hooker Sara, Ruder Sebastian
- The Impact Of Hyperparameters On Large Language Model Inference Performance: An Evaluation Of Vllm And Huggingface Pipelines Martinez Matias
- "vorbe\c{s}ti Rom\^ane\c{s}te?" A Recipe To Train Powerful Romanian Llms With English Instructions Masala Mihai, Ilie-ablachim Denis C., Dima Alexandru, Corlatescu Dragos, Zavelca Miruna, Olaru Ovio, Terian Simina, Terian Andrei, Leordeanu Marius, Velicu Horia, Popescu Marius, Dascalu Mihai, Rebedea Traian
- Datastates-llm: Lazy Asynchronous Checkpointing For Large Language Models Maurya Avinash, Underwood Robert, Rafique M. Mustafa, Cappello Franck, Nicolae Bogdan
- Pack Of Llms: Model Fusion At Test-time Via Perplexity Optimization Mavromatis Costas, Karypis Petros, Karypis George
- Improving Text-to-image Consistency Via Automatic Prompt Optimization Mañas Oscar, Astolfi Pietro, Hall Melissa, Ross Candace, Urbanek Jack, Williams Adina, Agrawal Aishwarya, Romero-soriano Adriana, Drozdzal Michal
- Leveraging Prompts In Llms To Overcome Imbalances In Complex Educational Text Data Mcclure Jeanne, Shimmei Machi, Matsuda Noboru, Jiang Shiyan
- Generative Artificial Intelligence In Higher Education: Evidence From An Analysis Of Institutional Policies And Guidelines Mcdonald Nora, Johri Aditya, Ali Areej, Hingle Aayushi
- Injecting New Knowledge Into Large Language Models Via Supervised Fine-tuning Mecklenburg Nick, Lin Yiyou, Li Xiaoxiao, Holstein Daniel, Nunes Leonardo, Malvar Sara, Silva Bruno, Chandra Ranveer, Aski Vijay, Yannam Pavan Kumar Reddy, Aktas Tolga, Hendry Todd
- Large Language Models For Relevance Judgment In Product Search Mehrdad Navid, Mohapatra Hrushikesh, Bagdouri Mossaab, Chandran Prijith, Magnani Alessandro, Cai Xunfan, Puthenputhussery Ajit, Yadav Sachin, Lee Tony, Zhai Chengxiang, Liao Ciya
- Enhancing Creativity In Large Language Models Through Associative Thinking Strategies Mehrotra Pronita, Parab Aishni, Gulwani Sumit
- Realhf: Optimized RLHF Training For Large Language Models Through Parameter Reallocation Mei Zhiyu, Fu Wei, Li Kaiwei, Wang Guangju, Zhang Huanchen, Wu Yi
- Soda-eval: Open-domain Dialogue Evaluation In The Age Of Llms Mendonça John, Trancoso Isabel, Lavie Alon
- Ecoh: Turn-level Coherence Evaluation For Multilingual Dialogues Mendonça John, Trancoso Isabel, Lavie Alon
- Deepstack: Deeply Stacking Visual Tokens Is Surprisingly Simple And Effective For Lmms Meng Lingchen, Yang Jianwei, Tian Rui, Dai Xiyang, Wu Zuxuan, Gao Jianfeng, Jiang Yu-gang
- DCR: Divide-and-conquer Reasoning For Multi-choice Question Answering With Llms Meng Zijie, Zhang Yan, Feng Zhaopeng, Liu Zuozhu
- Linearizing Large Language Models Mercat Jean, Vasiljevic Igor, Keh Sedrick, Arora Kushal, Dave Achal, Gaidon Adrien, Kollar Thomas
- Transforming Wearable Data Into Health Insights Using Large Language Model Agents Merrill Mike A., Paruchuri Akshay, Rezaei Naghmeh, Kovacs Geza, Perez Javier, Liu Yun, Schenck Erik, Hammerquist Nova, Sunshine Jake, Tailor Shyam, Ayush Kumar, Su Hao-wei, He Qian, Mclean Cory Y., Malhotra Mark, Patel Shwetak, Zhan Jiening, Althoff Tim, Mcduff Daniel, Liu Xin
- Superposition Prompting: Improving And Accelerating Retrieval-augmented Generation Merth Thomas, Fu Qichen, Rastegari Mohammad, Najibi Mahyar
- Low-resource Machine Translation Through Retrieval-augmented LLM Prompting: A Study On The Mambai Language Merx Raphaël, Mahmudi Aso, Langford Katrina, De Araujo Leo Alberto, Vylomova Ekaterina
- A Comparison Of LLM Finetuning Methods & Evaluation Metrics With Travel Chatbot Use Case Meyer Sonia, Singh Shreya, Tam Bertha, Ton Christopher, Ren Angel
- Permitqa: A Benchmark For Retrieval Augmented Generation In Wind Siting And Permitting Domain Meyur Rounak, Phan Hung, Wagle Sridevi, Strube Jan, Halappanavar Mahantesh, Horawalavithana Sameera, Acharya Anurag, Munikoti Sai
- User Centric Evaluation Of Code Generation Tools Miah Tanha, Zhu Hong
- Are Large Language Models Superhuman Chemists? Mirza Adrian, Alampara Nawaf, Kunchapu Sreekanth, Emoekabu Benedict, Krishnan Aswanth, Wilhelmi Mara, Okereke Macjonathan, Eberhardt Juliane, Elahi Amir Mohammad, Greiner Maximilian, Holick Caroline T., Gupta Tanya, Asgari Mehrdad, Glaubitz Christina, Klepsch Lea C., Köster Yannik, Meyer Jakob, Miret Santiago, Hoffmann Tim, Kreth Fabian Alexander, Ringleb Michael, Roesner Nicole, Schubert Ulrich S., Stafast Leanne M., Wonanke Dinga, Pieler Michael, Schwaller Philippe, Jablonka Kevin Maik
- Fine-grained Hallucination Detection And Editing For Language Models Mishra Abhika, Asai Akari, Balachandran Vidhisha, Wang Yizhong, Neubig Graham, Tsvetkov Yulia, Hajishirzi Hannaneh
- CBF-LLM: Safe Control For LLM Alignment Miyaoka Yuya, Inoue Masaki
- CHIQ: Contextual History Enhancement For Improving Query Rewriting In Conversational Search Mo Fengran, Ghaddar Abbas, Mao Kelong, Rezagholizadeh Mehdi, Chen Boxing, Liu Qun, Nie Jian-yun
- Aligning Query Representation With Rewritten Query And Relevance Judgments In Conversational Search Mo Fengran, Qu Chen, Mao Kelong, Wu Yihong, Su Zhan, Huang Kaiyu, Nie Jian-yun
- Memllm: Finetuning Llms To Use An Explicit Read-write Memory Modarressi Ali, Köksal Abdullatif, Imani Ayyoob, Fayyaz Mohsen, Schütze Hinrich
- RDBE: Reasoning Distillation-based Evaluation Enhances Automatic Essay Scoring Mohammadkhani Ali Ghiasvand
- Routoo: Learning To Route To Large Language Models Effectively Mohammadshahi Alireza, Shaikh Arshad Rafiq, Yazdani Majid
- Can We Trust Large Language Models Generated Code? A Framework For In-context Learning, Security Patterns, And Code Evaluations Across Diverse Llms Mohsin Ahmad, Janicke Helge, Wood Adrian, Sarker Iqbal H., Maglaras Leandros, Janjua Naeem
- Llm-based Frameworks For API Argument Filling In Task-oriented Conversational Systems Mok Jisoo, Kachuee Mohammad, Dai Shuyang, Ray Shayan, Taghavi Tara, Yoon Sungroh
- Kam-cot: Knowledge Augmented Multimodal Chain-of-thoughts Reasoning Mondal Debjyoti, Modi Suraj, Panda Subhadarshi, Singh Rituraj, Rao Godawari Sudhakar
- Towards Generating Informative Textual Description For Neurons In Language Models Mondal Shrayani, Garodia Rishabh, Qureshi Arbaaz, Lee Taesung, Park Youngja
- Repliqa: A Question-answering Dataset For Benchmarking Llms On Unseen Reference Content Monteiro Joao, Noel Pierre-andre, Marcotte Etienne, Rajeswar Sai, Zantedeschi Valentina, Vazquez David, Chapados Nicolas, Pal Christopher, Taslakian Perouz
- Xc-cache: Cross-attending To Cached Context For Efficient LLM Inference Monteiro João, Marcotte Étienne, Noël Pierre-andré, Zantedeschi Valentina, Vázquez David, Chapados Nicolas, Pal Christopher, Taslakian Perouz
- A Logical Fallacy-informed Framework For Argument Generation Mouchel Luca, Paul Debjit, Cui Shaobo, West Robert, Bosselut Antoine, Faltings Boi
- Large Language Models Are Parallel Multilingual Learners Mu Yongyu, Feng Peinan, Cao Zhiquan, Wu Yuzhang, Li Bei, Wang Chenglong, Xiao Tong, Song Kai, Liu Tongran, Zhang Chunliang, Zhu Jingbo
- Generative Representational Instruction Tuning Muennighoff Niklas, Su Hongjin, Wang Liang, Yang Nan, Wei Furu, Yu Tao, Singh Amanpreet, Kiela Douwe
- Lhrs-bot: Empowering Remote Sensing With Vgi-enhanced Large Multimodal Language Model Muhtar Dilxat, Li Zhenshi, Gu Feng, Zhang Xueliang, Xiao Pengfeng
- Are Large Language Models Actually Good At Text Style Transfer? Mukherjee Sourabrata, Ojha Atul Kr., Dušek Ondřej
- Towards Robots That Know When They Need Help: Affordance-based Uncertainty For Large Language Model Planners Mullen James F. Jr., Manocha Dinesh
- Grouse: A Benchmark To Evaluate Evaluators In Grounded Question Answering Muller Sacha, Loison António, Omrani Bilel, Viaud Gautier
- Chatgpt In Classrooms: Transforming Challenges Into Opportunities In Education Munawar Harris Bin, Misirlis Nikolaos
- Learning Vs Retrieval: The Role Of In-context Examples In Regression With Llms Nafar Aliakbar, Venable Kristen Brent, Kordjamshidi Parisa
- Cdquant: Accurate Post-training Weight Quantization Of Large Pre-trained Models Using Greedy Coordinate Descent Nair Pranav Ajit, Suggala Arun Sai
- Apprentices To Research Assistants: Advancing Research With Large Language Models Namvarpour M., Razi A.
- Review-feedback-reason (refer): A Novel Framework For NLG Evaluation And Reasoning Narsupalli Yaswanth, Chandra Abhranil, Muppirala Sreevatsa, Gupta Manish, Goyal Pawan
- Word2world: Generating Stories And Worlds Through Large Language Models Nasir Muhammad U., James Steven, Togelius Julian
- Learning Goal-conditioned Representations For Language Reward Models Nath Vaskar, Slack Dylan, Da Jeff, Ma Yuntao, Zhang Hugh, Whitehead Spencer, Hendryx Sean
- Concise Thoughts: Impact Of Output Length On LLM Reasoning And Cost Nayab Sania, Rossolini Giulio, Buttazzo Giorgio, Manes Nicolamaria, Giacomelli Fabrizio
- Improving Robustness Of Llm-based Speech Synthesis By Learning Monotonic Alignment Neekhara Paarth, Hussain Shehzeen, Ghosh Subhankar, Li Jason, Valle Rafael, Badlani Rohan, Ginsburg Boris
- Contextual AI Journaling: Integrating LLM And Time Series Behavioral Sensing Technology To Promote Self-reflection And Well-being Using The Mindscape App Nepal Subigya, Pillai Arvind, Campbell William, Massachi Talie, Choi Eunsol Soul, Xu Orson, Kuc Joanna, Huckins Jeremy, Holden Jason, Depp Colin, Jacobson Nicholas, Czerwinski Mary, Granholm Eric, Campbell Andrew T.
- MBBQ: A Dataset For Cross-lingual Comparison Of Stereotypes In Generative Llms Neplenbroek Vera, Bisazza Arianna, Fernández Raquel
- From Questions To Insightful Answers: Building An Informed Chatbot For University Resources Neupane Subash, Hossain Elias, Keith Jason, Tripathi Himanshu, Ghiasi Farbod, Golilarz Noorbakhsh Amiri, Amirlatifi Amin, Mittal Sudip, Rahimi Shahram
- Medinsight: A Multi-source Context Augmentation Framework For Generating Patient-centric Medical Responses Using Large Language Models Neupane Subash, Mitra Shaswata, Mittal Sudip, Golilarz Noorbakhsh Amiri, Rahimi Shahram, Amirlatifi Amin
- How Well Can Llms Echo Us? Evaluating AI Chatbots' Role-play Ability With ECHO Ng Man Tik, Tse Hui Tung, Huang Jen-tse, Li Jingjing, Wang Wenxuan, Lyu Michael R.
- Balancing Exploration And Exploitation In LLM Using Soft RLLF For Enhanced Negation Understanding Nguyen Ha-thanh, Satoh Ken
- Advancing Vietnamese Visual Question Answering With Transformer And Convolutional Integration Nguyen Ngoc Son, Nguyen Van Son, Le Tung
- Text-to-drive: Diverse Driving Behavior Synthesis Via Large Language Models Nguyen Phat, Wang Tsun-hsuan, Hong Zhang-wei, Karaman Sertac, Rus Daniela
- An Empirical Study On Capability Of Large Language Models In Understanding Code Semantics Nguyen Thu-trang, Vu Thanh Trong, Vo Hieu Dinh, Nguyen Son
- Ceval: A Benchmark For Evaluating Counterfactual Text Generation Nguyen Van Bach, Schlötterer Jörg, Seifert Christin
- Xl\(^2\)bench: A Benchmark For Extremely Long Context Understanding With Long-range Dependencies Ni Xuanfan, Cai Hengyi, Wei Xiaochi, Wang Shuaiqiang, Yin Dawei, Li Piji
- W-RAG: Weakly Supervised Dense Retrieval In RAG For Open-domain Question Answering Nian Jinming, Peng Zhiyuan, Wang Qifan, Fang Yi
- Panza: A Personalized Text Writing Assistant Via Data Playback And Local Fine-tuning Nicolicioiu Armand, Iofinova Eugenia, Kurtic Eldar, Nikdan Mahdi, Panferov Andrei, Markov Ilia, Shavit Nir, Alistarh Dan
- The GPT Surprise: Offering Large Language Model Chat In A Massive Coding Class Reduced Engagement But Increased Adopters Exam Performances Nie Allen, Chandak Yash, Suzara Miroslav, Ali Malika, Woodrow Juliette, Peng Matt, Sahami Mehran, Brunskill Emma, Piech Chris
- Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge In English-centric Large Language Models Nie Ercong, Yuan Shuzhou, Ma Bolei, Schmid Helmut, Färber Michael, Kreuter Frauke, Schütze Hinrich
- Llm-based Conversational AI Therapist For Daily Functioning Screening And Psychotherapeutic Intervention Via Everyday Smart Devices Nie Jingping, Shao Hanya, Fan Yuang, Shao Qijia, You Haoxuan, Preindl Matthias, Jiang Xiaofan
- Cfinbench: A Comprehensive Chinese Financial Benchmark For Large Language Models Nie Ying, Yan Binwei, Guo Tianyu, Liu Hao, Wang Haoyu, He Wei, Zheng Binfan, Wang Weihao, Li Qiang, Sun Weijian, Wang Yunhe, Tao Dacheng
- WTU-EVAL: A Whether-or-not Tool Usage Evaluation Benchmark For Large Language Models Ning Kangyun, Su Yisong, Lv Xueqiang, Zhang Yuanzhe, Liu Jian, Liu Kang, Xu Jinan
- User-llm: Efficient LLM Contextualization With User Embeddings Ning Lin, Liu Luyang, Wu Jiaxing, Wu Neo, Berlowitz Devora, Prakash Sushant, Green Bradley, O'banion Shawn, Xie Jun
- Can Llms Learn By Teaching? A Preliminary Study Ning Xuefei, Wang Zifu, Li Shiyao, Lin Zinan, Yao Peiran, Fu Tianyu, Blaschko Matthew B., Dai Guohao, Yang Huazhong, Wang Yu
- How Reliable AI Chatbots Are For Disease Prediction From Patient Complaints? Nipu Ayesha Siddika, Islam K M Sajjadul, Madiraju Praveen
- Quantified Task Misalignment To Inform PEFT: An Exploration Of Domain Generalization And Catastrophic Forgetting In CLIP Niss Laura, Vogt-lowell Kevin, Tsiligkaridis Theodoros
- Mitigating Hallucinations In Large Language Models Via Self-refinement-enhanced Knowledge Retrieval Niu Mengjia, Li Hao, Shi Jie, Haddadi Hamed, Mo Fan
- Language Portability Strategies For Open-domain Dialogue With Pre-trained Language Models From High To Low Resource Languages Njifenjou Ahmed, Sucal Virgile, Jabaian Bassam, Lefèvre Fabrice
- Skill Set Optimization: Reinforcing Language Model Behavior Via Transferable Skills Nottingham Kolby, Majumder Bodhisattwa Prasad, Mishra Bhavana Dalvi, Singh Sameer, Clark Peter, Fox Roy
- The Conversation Is The Command: Interacting With Real-world Autonomous Robot Through Natural Language Nwankwo Linus, Rueckert Elmar
- Improving Black-box Robustness With In-context Rewriting O'brien Kyle, Ng Nathan, Puri Isha, Mendez Jorge, Palangi Hamid, Kim Yoon, Ghassemi Marzyeh, Hartvigsen Thomas
- Crafting Efficient Fine-tuning Strategies For Large Language Models Oliver Michael, Wang Guan
- Routellm: Learning To Route Llms With Preference Data Ong Isaac, Almahairi Amjad, Wu Vincent, Chiang Wei-lin, Wu Tianhao, Gonzalez Joseph E., Kadous M Waleed, Stoica Ion
- Jamba: A Hybrid Transformer-mamba Language Model Opher Lieber, Barak Lenz, Hofit Bata, Gal Cohen, Jhonathan Osin, Itay Dalmedigos, Erez Safahi, Shaked Meirom, Yonatan Belinkov, Shai Shalev-shwartz, Omri Abend, Raz Alon, Tomer Asida, Amir Bergman, Roman Glozman, Michael Gokhman, Avashalom Manevich, Nir Ratner, Noam Rozen, Erez Shwartz, Mor Zusman, Yoav Shoham
- Lossless Acceleration Of Large Language Model Via Adaptive N-gram Parallel Decoding Ou Jie, Chen Yueming, Tian Wenhong
- Multilingual Prompts In Llm-based Recommenders: Performance Across Languages Ozsoy Makbule Gulcin
- Not All Contexts Are Equal: Teaching Llms Credibility-aware Generation Pan Ruotong, Cao Boxi, Lin Hongyu, Han Xianpei, Zheng Jia, Wang Sirui, Cai Xunliang, Sun Le
- The Solution For The AIGC Inference Performance Optimization Competition Pan Sishun, Xu Haonan, Wan Zhonghua, Yang Yang
- Instinfer: In-storage Attention Offloading For Cost-effective Long-context LLM Inference Pan Xiurui, Li Endian, Li Qiao, Liang Shengwen, Shan Yizhou, Zhou Ke, Luo Yingwei, Wang Xiaolin, Zhang Jie
- Enhancing Repository-level Code Generation With Integrated Contextual Information Pan Zhiyuan, Hu Xing, Xia Xin, Yang Xiaohu
- Llmlingua-2: Data Distillation For Efficient And Faithful Task-agnostic Prompt Compression Pan Zhuoshi, Wu Qianhui, Jiang Huiqiang, Xia Menglin, Luo Xufang, Zhang Jue, Lin Qingwei, Rühle Victor, Yang Yuqing, Lin Chin-yew, Zhao H. Vicky, Qiu Lili, Zhang Dongmei
- AILS-NTUA At Semeval-2024 Task 9: Cracking Brain Teasers: Transformer Models For Lateral Thinking Puzzles Panagiotopoulos Ioannis, Filandrianos Giorgos, Lymperaiou Maria, Stamou Giorgos
- Anchor-based Large Language Models Pang Jianhui, Ye Fanghua, Wong Derek Fai, He Xin, Chen Wanshun, Wang Longyue
- Empowering Language Models With Active Inquiry For Deeper Understanding Pang Jing-cheng, Fan Heng-bo, Wang Pengyuan, Xiao Jia-hao, Tang Nan, Yang Si-hang, Jia Chengxing, Huang Sheng-jun, Yu Yang
- Knowledgeable Agents By Offline Reinforcement Learning From Large Language Model Rollouts Pang Jing-cheng, Yang Si-hang, Li Kaiyuan, Zhang Jiaji, Chen Xiong-hui, Tang Nan, Yu Yang
- Pragmatic Competence Evaluation Of Large Language Models For Korean Park Dojun, Lee Jiwoo, Jeong Hyeyun, Park Seohyun, Lee Sungeun
- Multiprageval: Multilingual Pragmatic Evaluation Of Large Language Models Park Dojun, Lee Jiwoo, Park Seohyun, Jeong Hyeyun, Koo Youngeun, Hwang Soonha, Park Seonwoo, Lee Sungeun
- Llms For Enhanced Agricultural Meteorological Recommendations Park Ji-jun, Choi Soo-joon
- Offsetbias: Leveraging Debiased Data For Tuning Evaluators Park Junsoo, Jwa Seungyeon, Ren Meiying, Kim Daeyoung, Choi Sanghyuk
- Token-picker: Accelerating Attention In Text Generation With Minimized Memory Transfer Via Probability Estimation Park Junyoung, Kang Myeonggu, Han Yunki, Kim Yanggon, Shin Jaekang, Kim Lee-sup
- Textboost: Towards One-shot Personalization Of Text-to-image Models Via Fine-tuning Text Encoder Park Nahyeon, Kim Kunhee, Shim Hyunjung
- Any-precision LLM: Low-cost Deployment Of Multiple, Different-sized Llms Park Yeonhong, Hyun Jake, Cho Sanglyul, Sim Bonggeun, Lee Jae W.
- The Ultimate Guide To Fine-tuning Llms From Basics To Breakthroughs: An Exhaustive Review Of Technologies, Research, Best Practices, Applied Research Challenges And Opportunities Parthasarathy Venkatesh Balavadhani, Zafar Ahtsham, Khan Aafaq, Shahid Arsalan
- Evidence To Generate (E2G): A Single-agent Two-step Prompting For Context Grounded And Retrieval Augmented Reasoning Parvez Md Rizwan
- Automating Code Adaptation For Mlops -- A Benchmarking Study On Llms Patel Harsh, Ramanan Buvaneswari A., Khan Manzoor A., Williams Thomas, Friedman Brian, Drabeck Lawrence
- Multi-logieval: Towards Evaluating Multi-step Logical Reasoning Ability Of Large Language Models Patel Nisarg, Kulkarni Mohith, Parmar Mihir, Budhiraja Aashna, Nakamura Mutsumi, Varshney Neeraj, Baral Chitta
- The Effectiveness Of Llms As Annotators: A Comparative Overview And Empirical Analysis Of Direct Representation Pavlovic Maja, Poesio Massimo
- Humaneval-xl: A Multilingual Code Generation Benchmark For Cross-lingual Natural Language Generalization Peng Qiwei, Chai Yekun, Li Xuhong
- Check-eval: A Checklist-based Approach For Evaluating Text Quality Pereira Jayr, Assumpcao Andre, Lotufo Roberto
- Did Translation Models Get More Robust Without Anyone Even Noticing? Peters Ben, Martins André F. T.
- Optimizing Large Language Models For Openapi Code Completion Petryshyn Bohdan, Lukoševičius Mantas
- Towards Reliable Medical Question Answering: Techniques And Challenges In Mitigating Hallucinations In Language Models Pham Duy Khoa, Vo Bao Quoc
- Graph Guided Question Answer Generation For Procedural Question-answering Pham Hai X., Hadji Isma, Xu Xinnuo, Degutyte Ziedune, Rainey Jay, Kazakos Evangelos, Fazly Afsaneh, Tzimiropoulos Georgios, Martinez Brais
- Can Base Chatgpt Be Used For Forecasting Without Additional Optimization? Pham Van, Cunningham Scott
- Peering Into The Mind Of Language Models: An Approach For Attribution In Contextual Question Answering Phukan Anirudh, Somasundaram Shwetha, Saxena Apoorv, Goswami Koustava, Srinivasan Balaji Vasan
- Llms Can Learn Self-restraint Through Iterative Self-reflection Piché Alexandre, Milios Aristides, Bahdanau Dzmitry, Pal Chris
- Better RAG Using Relevant Information Gain Pickett Marc, Hartman Jeremy, Bhowmick Ayan Kumar, Alam Raquib-ul, Vempaty Aditya
- Bimedix: Bilingual Medical Mixture Of Experts LLM Pieri Sara, Mullappilly Sahal Shaji, Khan Fahad Shahbaz, Anwer Rao Muhammad, Khan Salman, Baldwin Timothy, Cholakkal Hisham
- Softqe: Learned Representations Of Queries Expanded By Llms Pimpalkhute Varad, Heyer John, Yin Xusen, Gupta Sameer
- Doccgen: Document-based Controlled Code Generation Pimparkhede Sameer, Kammakomati Mehant, Tamilselvam Srikanth, Kumar Prince, Kumar Ashok Pon, Bhattacharyya Pushpak
- How Effective Are State Space Models For Machine Translation? Pitorro Hugo, Vasylenko Pavlo, Treviso Marcos, Martins André F. T.
- Reasoning With Large Language Models, A Survey Plaat Aske, Wong Annie, Verberne Suzan, Broekens Joost, Van Stein Niki, Back Thomas
- Softmax Probabilities (mostly) Predict Large Language Model Correctness On Multiple-choice Q&A Plaut Benjamin, Nguyen Khanh, Trinh Tu
- Sbora: Low-rank Adaptation With Regional Weight Updates Po Lai-man, Liu Yuyang, Wu Haoxuan, Zhang Tianqi, Yu Wing-yin, Jiang Zeyu, Li Kun
- Extracting Paragraphs From LLM Token Activations Pochinkov Nicholas, Benoit Angelo, Agarwal Lovkush, Majid Zainab Ali, Ter-minassian Lucile
- V-RECS, A Low-cost LLM4VIS Recommender With Explanations, Captioning And Suggestions Podo Luca, Angelini Marco, Velardi Paola
- Advanced Natural-based Interaction For The Italian Language: Llamantino-3-anita Polignano Marco, Basile Pierpaolo, Semeraro Giovanni
- Evaluation Of Language Models In The Medical Context Under Resource-constrained Settings Posada Andrea, Rueckert Daniel, Meissen Felix, Müller Philip
- Documint: Docstring Generation For Python Using Small Language Models Poudel Bibek, Cook Adam, Traore Sekou, Ameli Shelah
- Retrieval Augmented Thought Process For Private Data Handling In Healthcare Pouplin Thomas, Sun Hao, Holt Samuel, Van Der Schaar Mihaela
- Large Language Models Meet User Interfaces: The Case Of Provisioning Feedback Pozdniakov Stanislav, Brazil Jonathan, Abdi Solmaz, Bakharia Aneesha, Sadiq Shazia, Gasevic Dragan, Denny Paul, Khosravi Hassan
- Deciphering The Factors Influencing The Efficacy Of Chain-of-thought: Probability, Memorization, And Noisy Reasoning Prabhakar Akshara, Griffiths Thomas L., Mccoy R. Thomas
- When Quantization Affects Confidence Of Large Language Models? Proskurina Irina, Brun Luc, Metzler Guillaume, Velcin Julien
- Customized Retrieval Augmented Generation And Benchmarking For EDA Tool Documentation QA Pu Yuan, He Zhuolun, Qiu Tairu, Wu Haoyuan, Yu Bei
- Practical Token Pruning For Foundation Models In Few-shot Conversational Virtual Assistant Systems Qi Haode, Qian Cheng, Ni Jian, Singh Pratyush, Fazeli Reza, Wang Gengyu, Shu Zhongzheng, Wayne Eric, Bross Juergen
- Model Internals-based Answer Attribution For Trustworthy Retrieval-augmented Generation Qi Jirui, Sarti Gabriele, Fernández Raquel, Bisazza Arianna
- SNIFFER: Multimodal Large Language Model For Explainable Out-of-context Misinformation Detection Qi Peng, Yan Zehong, Hsu Wynne, Lee Mong Li
- Follow My Instruction And Spill The Beans: Scalable Data Extraction From Retrieval-augmented Generation Systems Qi Zhenting, Zhang Hanlin, Xing Eric, Kakade Sham, Lakkaraju Himabindu
- Data Augmentation Integrating Dialogue Flow And Style To Adapt Spoken Dialogue Systems To Low-resource User Groups Qi Zhiyang, Inaba Michimasa
- Iterative Experience Refinement Of Software-developing Agents Qian Chen, Li Jiahao, Dang Yufan, Liu Wei, Wang Yifei, Xie Zihao, Chen Weize, Yang Cheng, Zhang Yingli, Liu Zhiyuan, Sun Maosong
- Memorag: Moving Towards Next-gen RAG Via Memory-inspired Knowledge Discovery Qian Hongjin, Zhang Peitian, Liu Zheng, Mao Kelong, Dou Zhicheng
- How Easy Is It To Fool Your Multimodal Llms? An Empirical Analysis On Deceptive Prompts Qian Yusu, Zhang Haotian, Yang Yinfei, Gan Zhe
- Prompt Perturbation Consistency Learning For Robust Language Models Qiang Yao, Nandi Subhrangshu, Mehrabi Ninareh, Steeg Greg Ver, Kumar Anoop, Rumshisky Anna, Galstyan Aram
- LLM4SBR: A Lightweight And Effective Framework For Integrating Large Language Models In Session-based Recommendation Qiao Shutong, Gao Chen, Wen Junhao, Zhou Wei, Luo Qun, Chen Peixuan, Li Yong
- Supportiveness-based Knowledge Rewriting For Retrieval-augmented Language Modeling Qiao Zile, Ye Wei, Jiang Yong, Mo Tong, Xie Pengjun, Li Weiping, Huang Fei, Zhang Shikun
- Diffusiongpt: Llm-driven Text-to-image Generation System Qin Jie, Wu Jie, Chen Weifeng, Ren Yuxi, Li Huixia, Wu Hefeng, Xiao Xuefeng, Wang Rui, Wen Shilei
- The Uniqueness Of Llama3-70b With Per-channel Quantization: An Empirical Study Qin Minghai
- Empirical Guidelines For Deploying Llms Onto Resource-constrained Edge Devices Qin Ruiyang, Liu Dancheng, Yan Zheyu, Tan Zhaoxuan, Pan Zixuan, Jia Zhenge, Jiang Meng, Abbasi Ahmed, Xiong Jinjun, Shi Yiyu
- Enhancing Sequential Recommendations Through Multi-perspective Reflections And Iteration Qin Weicong, Xu Yi, Yu Weijie, Shen Chenglei, Zhang Xiao, He Ming, Fan Jianping, Xu Jun
- Lightning Attention-2: A Free Lunch For Handling Unlimited Sequence Lengths In Large Language Models Qin Zhen, Sun Weigao, Li Dong, Shen Xuyang, Sun Weixuan, Zhong Yiran
- LAMPO: Large Language Models As Preference Machines For Few-shot Ordinal Classification Qin Zhen, Wu Junru, Shen Jiaming, Liu Tianqi, Wang Xuanhui
- Efficient Interactive LLM Serving With Proxy Model-based Sequence Length Prediction Qiu Haoran, Mao Weichao, Patke Archit, Cui Shengkun, Jha Saurabh, Wang Chen, Franke Hubertus, Kalbarczyk Zbigniew T., Başar Tamer, Iyer Ravishankar K.
- Snapntell: Enhancing Entity-centric Visual Question Answering With Retrieval Augmented Multimodal LLM Qiu Jielin, Madotto Andrea, Lin Zhaojiang, Crook Paul A., Xu Yifan Ethan, Dong Xin Luna, Faloutsos Christos, Li Lei, Damavandi Babak, Moon Seungwhan
- Megen: Generative Backdoor In Large Language Models Via Model Editing Qiu Jiyang, Ma Xinbei, Zhang Zhuosheng, Zhao Hai
- Entropy-based Decoding For Retrieval-augmented Large Language Models Qiu Zexuan, Ou Zijing, Wu Bin, Li Jingjing, Liu Aiwei, King Irwin
- Layerwise Recurrent Router For Mixture-of-experts Qiu Zihan, Huang Zeyu, Cheng Shuang, Zhou Yizhi, Wang Zili, Titov Ivan, Fu Jie
- Tool Learning With Large Language Models: A Survey Qu Changle, Dai Sunhao, Wei Xiaochi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Xu Jun, Wen Ji-rong
- Unsupervised Distractor Generation Via Large Language Model Distilling And Counterfactual Contrastive Decoding Qu Fanyi, Sun Hao, Wu Yunfang
- Before Generation, Align It! A Novel And Effective Strategy For Mitigating Hallucinations In Text-to-sql Generation Qu Ge, Li Jinyang, Li Bowen, Qin Bowen, Huo Nan, Ma Chenhao, Cheng Reynold
- DEEP-ICL: Definition-enriched Experts For Language Model In-context Learning Qu Xingwei, Liang Yiming, Wang Yucheng, Zheng Tianyu, Yue Tommy, Ma Lei, Huang Stephen W., Zhang Jiajun, Shi Yinan, Lin Chenghua, Fu Jie, Zhang Ge
- Coursegpt-zh: An Educational Large Language Model Based On Knowledge Distillation Incorporating Prompt Optimization Qu Zheyan, Yin Lu, Yu Zitong, Wang Wenbo, Zhang Xing
- Towards Generalizable Agents In Text-based Educational Environments: A Study Of Integrating RL With Llms Radmehr Bahar, Singla Adish, Käser Tanja
- Defan: Definitive Answer Dataset For Llms Hallucination Evaluation Rahman A B M Ashikur, Anwar Saeed, Usman Muhammad, Mian Ajmal
- Code Hallucination Rahman Mirza Masfiqur, Kundu Ashish
- Question-based Retrieval Using Atomic Units For Enterprise RAG Raina Vatsal, Gales Mark
- Knowledge-based Consistency Testing Of Large Language Models Rajan Sai Sathiesh, Soremekun Ezekiel, Chattopadhyay Sudipta
- Constructing Domain-specific Evaluation Sets For Llm-as-a-judge Raju Ravi, Jain Swayambhoo, Li Bo, Li Jonathan, Thakker Urmish
- Scavenging Hyena: Distilling Transformers Into Long Convolution Models Ralambomihanta Tokiniaina Raharison, Mohammadzadeh Shahrad, Islam Mohammad Sami Nur, Jabbour Wassim, Liang Laurence
- ANCHOR: Llm-driven News Subject Conditioning For Text-to-image Synthesis Ramakrishnan Aashish Anantha, Huang Sharon X., Lee Dongwon
- Self-refinement Of Language Models From External Proxy Metrics Feedback Ramji Keshav, Lee Young-suk, Astudillo Ramón Fernandez, Sultan Md Arafat, Naseem Tahira, Munawar Asim, Florian Radu, Roukos Salim
- WARP: On The Benefits Of Weight Averaged Rewarded Policies Ramé Alexandre, Ferret Johan, Vieillard Nino, Dadashi Robert, Hussenot Léonard, Cedoz Pierre-louis, Sessa Pier Giuseppe, Girgin Sertan, Douillard Arthur, Bachem Olivier
- Capturing Minds, Not Just Words: Enhancing Role-playing Language Models With Personality-indicative Data Ran Yiting, Wang Xintao, Xu Rui, Yuan Xinfeng, Liang Jiaqing, Xiao Yanghua, Yang Deqing
- A Fine-tuning Enhanced RAG System With Quantized Influence Measure As AI Judge Rangan Keshav, Yin Yiqiao
- SPARQL Generation: An Analysis On Fine-tuning Openllama For Question Answering Over A Life Science Knowledge Graph Rangel Julio C., De Farias Tarcisio Mendes, Sima Ana Claudia, Kobayashi Norio
- RAVEN: Multitask Retrieval Augmented Vision-language Learning Rao Varun Nagaraj, Choudhary Siddharth, Deshpande Aditya, Satzoda Ravi Kumar, Appalaraju Srikar
- SSP: Self-supervised Prompting For Cross-lingual Transfer To Low-resource Languages Using Large Language Models Rathore Vipul, Deb Aniruddha, Chandresh Ankish, Singla Parag, Mausam
- Context Embeddings For Efficient Answer Generation In RAG Rau David, Wang Shuai, Déjean Hervé, Clinchant Stéphane
- Lynx: An Open Source Hallucination Evaluation Model Ravi Selvan Sunitha, Mielczarek Bartosz, Kannappan Anand, Kiela Douwe, Qian Rebecca
- FACTOID: Factual Entailment For Hallucination Detection Rawte Vipula, Tonmoy S. M Towhidul Islam, Rajbangshi Krishnav, Nag Shravani, Chadha Aman, Sheth Amit P., Das Amitava
- FIRST: Faster Improved Listwise Reranking With Single Token Decoding Reddy Revanth Gangi, Doo Jaehyeok, Xu Yifei, Sultan Md Arafat, Swain Deevya, Sil Avirup, Ji Heng
- Retrieval-augmented Generation: Is Dense Passage Retrieval Retrieving? Reichman Benjamin, Heck Larry
- Beyond Performance: Quantifying And Mitigating Label Bias In Llms Reif Yuval, Schwartz Roy
- Strengthening Multimodal Large Language Model With Bootstrapped Preference Optimization Renjie Pi, Tianyang Han, Wei Xiong, Jipeng Zhang, Runtao Liu, Rui Pan, Tong Zhang
- The Benefits Of A Concise Chain Of Thought On Problem-solving In Large Language Models Renze Matthew, Guven Erhan
- Self-generated Replay Memories For Continual Neural Machine Translation Resta Michele, Bacciu Davide
- Practical And Reproducible Symbolic Music Generation By Large Language Models With Structural Embeddings Rhyu Seungyeon, Yang Kichang, Cho Sungjun, Kim Jaehyeon, Lee Kyogu, Lee Moontae
- How Multilingual Are Large Language Models Fine-tuned For Translation? Richburg Aquia, Carpuat Marine
- Empirical Study Of Symmetrical Reasoning In Conversational Chatbots Rim Daniela N., Choi Heeyoul
- Countering Reward Over-optimization In LLM With Demonstration-guided Reinforcement Learning Rita Mathieu, Strub Florian, Chaabouni Rahma, Michel Paul, Dupoux Emmanuel, Pietquin Olivier
- Exploring Advanced Large Language Models With Llmsuite Roffo Giorgio
- Zero-shot Spam Email Classification Using Pre-trained Large Language Models Rojas-galeano Sergio
- Question-instructed Visual Descriptions For Zero-shot Video Question Answering Romero David, Solorio Thamar
- RAGE Against The Machine: Retrieval-augmented LLM Explanations Rorseth Joel, Godfrey Parke, Golab Lukasz, Srivastava Divesh, Szlichta Jaroslaw
- CLAPNQ: Cohesive Long-form Answers From Passages In Natural Questions For RAG Systems Rosenthal Sara, Sil Avirup, Florian Radu, Roukos Salim
- Pairing Analogy-augmented Generation With Procedural Memory For Procedural Q&A Roth K, Gupta Rushil, Halle Simon, Liu Bang
- CONFLARE: Conformal Large Language Model Retrieval Rouzrokh Pouria, Faghani Shahriar, Gamble Cooper U., Shariatnia Moein, Erickson Bradley J.
- Enhancing Low-resource NMT With A Multilingual Encoder And Knowledge Distillation: A Case Study Roy Aniruddha, Ray Pretam, Maheshwari Ayush, Sarkar Sudeshna, Goyal Pawan
- Convolutional Prompting Meets Language Models For Continual Learning Roy Anurag, Moulick Riddhiman, Verma Vinay K., Ghosh Saptarshi, Das Abir
- ERATTA: Extreme RAG For Table To Answers With Large Language Models Roychowdhury Sohini, Krema Marko, Mahammad Anvar, Moore Brian, Mukherjee Arijit, Prakashchandra Punit
- Fist-financial Style Transfer With Hallucination And Creativity Control Framework Roychowdhury Sohini, Krema Marko, Moore Brian, Lai Xingjian, Effedua Dike, Jethwani Bharat
- Evaluation Of RAG Metrics For Question Answering In The Telecom Domain Roychowdhury Sujoy, Soman Sumit, Ranjani H G, Gunda Neeraj, Chhabra Vansh, Bala Sai Krishna
- From PEFT To DEFT: Parameter Efficient Finetuning For Reducing Activation Density In Transformers Runwal Bharat, Pedapati Tejaswini, Chen Pin-yu
- Writing In The Margins: Better Inference Pattern For Long Context Retrieval Russak Melisa, Jamil Umar, Bryant Christopher, Kamble Kiran, Magnuson Axel, Russak Mateusz, Alshikh Waseem
- Code-aware Prompting: A Study Of Coverage Guided Test Generation In Regression Setting Using LLM Ryan Gabriel, Jain Siddhartha, Shang Mingyue, Wang Shiqi, Ma Xiaofei, Ramanathan Murali Krishna, Ray Baishakhi
- How Chatgpt Changed The Media's Narratives On AI: A Semi-automated Narrative Analysis Through Frame Semantics Ryazanov Igor, Öhman Carl, Björklund Johanna
- Safetyprompts: A Systematic Review Of Open Datasets For Evaluating And Improving Large Language Model Safety Röttger Paul, Pernisi Fabio, Vidgen Bertie, Hovy Dirk
- Tandem Transformers For Inference Efficient Llms S Aishwarya P, Nair Pranav Ajit, Samaga Yashas, Boyd Toby, Kumar Sanjiv, Jain Prateek, Netrapalli Praneeth
- Capabilities Of Gemini Models In Medicine Saab Khaled, Tu Tao, Weng Wei-hung, Tanno Ryutaro, Stutz David, Wulczyn Ellery, Zhang Fan, Strother Tim, Park Chunjong, Vedadi Elahe, Chaves Juanma Zambrano, Hu Szu-yeu, Schaekermann Mike, Kamath Aishwarya, Cheng Yong, Barrett David G. T., Cheung Cathy, Mustafa Basil, Palepu Anil, Mcduff Daniel, Hou Le, Golany Tomer, Liu Luyang, Alayrac Jean-baptiste, Houlsby Neil, Tomasev Nenad, Freyberg Jan, Lau Charles, Kemp Jonas, Lai Jeremy, Azizi Shekoofeh, Kanada Kimberly, Man Siwai, Kulkarni Kavita, Sun Ruoxi, Shakeri Siamak, He Luheng, Caine Ben, Webson Albert, Latysheva Natasha, Johnson Melvin, Mansfield Philip, Lu Jian, Rivlin Ehud, Anderson Jesper, Green Bradley, Wong Renee, Krause Jonathan, Shlens Jonathon, Dominowska Ewa, Eslami S. M. Ali, Chou Katherine, Cui Claire, Vinyals Oriol, Kavukcuoglu Koray, Manyika James, Dean Jeff, Hassabis Demis, Matias Yossi, Webster Dale, Barral Joelle, Corrado Greg, Semturs Christopher, Mahdavi S. Sara, Gottweis Juraj, Karthikesalingam Alan, Natarajan Vivek
- Emobench: Evaluating The Emotional Intelligence Of Large Language Models Sabour Sahand, Liu Siyang, Zhang Zheyuan, Liu June M., Zhou Jinfeng, Sunaryo Alvionna S., Li Juanzi, Lee Tatia M. C., Mihalcea Rada, Huang Minlie
- Utebc-nlp At Semeval-2024 Task 9: Can Llms Be Lateral Thinkers? Sadeghi Pouya, Abaskohi Amirhossein, Yaghoobzadeh Yadollah
- Athena: Safe Autonomous Agents With Verbal Contrastive Learning Sadhu Tanmana, Pesaranghader Ali, Chen Yanan, Yi Dong Hoon
- Vision-language And Large Language Model Performance In Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, And Quantized Models Safavi-naini Seyed Amir Ahmad, Ali Shuhaib, Shahab Omer, Shahhoseini Zahra, Savage Thomas, Rafiee Sara, Samaan Jamil S, Shabeeb Reem Al, Ladak Farah, Yang Jamie O, Echavarria Juan, Babar Sumbal, Shaukat Aasma, Margolis Samuel, Tatonetti Nicholas P, Nadkarni Girish, Kurdi Bara El, Soroush Ali
- A Systematic Survey Of Prompt Engineering In Large Language Models: Techniques And Applications Sahoo Pranab, Singh Ayush Kumar, Saha Sriparna, Jain Vinija, Mondal Samrat, Chadha Aman
- Where Is The Answer? Investigating Positional Bias In Language Model Knowledge Extraction Saito Kuniaki, Sohn Kihyuk, Lee Chen-yu, Ushiku Yoshitaka
- Optimization Methods For Personalizing Large Language Models Through Retrieval Augmentation Salemi Alireza, Kallumadi Surya, Zamani Hamed
- Simulating Task-oriented Dialogues With State Transition Graphs And Large Language Models Samarinas Chris, Promthaw Pracha, Nijasure Atharva, Zeng Hansi, Killingback Julian, Zamani Hamed
- The Future Of Large Language Model Pre-training Is Federated Sani Lorenzo, Iacob Alex, Cao Zeyu, Marino Bill, Gao Yan, Paulik Tomas, Zhao Wanru, Shen William F., Aleksandrov Preslav, Qiu Xinchi, Lane Nicholas D.
- Retrieval-augmented Generation Meets Data-driven Tabula Rasa Approach For Temporal Knowledge Graph Forecasting Sannidhi Geethan, Sakhinana Sagar Srinivas, Runkana Venkataramana
- Lean Attention: Hardware-aware Scalable Attention Mechanism For The Decode-phase Of Transformers Sanovar Rya, Bharadwaj Srikant, Amant Renee St., Rühle Victor, Rajmohan Saravan
- Grapheval: A Knowledge-graph Based LLM Hallucination Evaluation Framework Sansford Hannah, Richardson Nicholas, Maretic Hermina Petric, Saada Juba Nait
- Are Machines Better At Complex Reasoning? Unveiling Human-machine Inference Gaps In Entailment Verification Sanyal Soumya, Xiao Tianyi, Liu Jiacheng, Wang Wenya, Ren Xiang
- ICAL: Continual Learning Of Multimodal Agents By Transforming Trajectories Into Actionable Insights Sarch Gabriel, Jang Lawrence, Tarr Michael J., Cohen William W., Marino Kenneth, Fragkiadaki Katerina
- Improving Self Consistency In Llms Through Probabilistic Tokenization Sathe Ashutosh, Aggarwal Divyanshu, Sitaram Sunayana
- Improving Sentence Embeddings With Automatic Generation Of Training Data Using Few-shot Examples Sato Soma, Tsukagoshi Hayato, Sasano Ryohei, Takeda Koichi
- Improving The Validity Of Automatically Generated Feedback Via Reinforcement Learning Scarlatos Alexander, Smith Digory, Woodhead Simon, Lan Andrew
- The Human Factor In Detecting Errors Of Large Language Models: A Systematic Literature Review And Future Research Directions Schiller Christian A.
- Ruffle&riley: Insights From Designing And Evaluating A Large Language Model-based Conversational Tutoring System Schmucker Robin, Xia Meng, Azaria Amos, Mitchell Tom
- Symbolic Prompt Program Search: A Structure-aware Approach To Efficient Compile-time Prompt Optimization Schnabel Tobias, Neville Jennifer
- Evaluating Large Language Models With Fmeval Schwöbel Pola, Franceschi Luca, Zafar Muhammad Bilal, Vasist Keerthan, Malhotra Aman, Shenhar Tomer, Tailor Pinal, Yilmaz Pinar, Diamond Michael, Donini Michele
- Quantitative Knowledge Retrieval From Large Language Models Selby David, Spriestersbach Kai, Iwashita Yuichiro, Bappert Dennis, Warrier Archana, Mukherjee Sumantrak, Asim Muhammad Nabeel, Kise Koichi, Vollmer Sebastian
- Flashattention-3: Fast And Accurate Attention With Asynchrony And Low-precision Shah Jay, Bikshandi Ganesh, Zhang Ying, Thakkar Vijay, Ramani Pradeep, Dao Tri
- Ai-assisted Generation Of Difficult Math Questions Shah Vedant, Yu Dingli, Lyu Kaifeng, Park Simon, Ke Nan Rosemary, Mozer Michael, Bengio Yoshua, Arora Sanjeev, Goyal Anirudh
- Show, Don't Tell: Aligning Language Models With Demonstrated Feedback Shaikh Omar, Lam Michelle, Hejna Joey, Shao Yijia, Bernstein Michael, Yang Diyi
- Ai-native Memory: A Pathway From Llms Towards AGI Shang Jingbo, Zheng Zai, Wei Jiale, Ying Xiang, Tao Felix, Team Mindverse
- Synergy-of-thoughts: Eliciting Efficient Reasoning In Hybrid Language Models Shang Yu, Li Yu, Xu Fengli, Li Yong
- Llava-prumerge: Adaptive Token Reduction For Efficient Large Multimodal Models Shang Yuzhang, Cai Mu, Xu Bingxin, Lee Yong Jae, Yan Yan
- Extensible Embedding: A Flexible Multipler For Llm's Context Length Shao Ninglu, Xiao Shitao, Liu Zheng, Zhang Peitian
- Guiding In-context Learning Of Llms Through Quality Estimation For Machine Translation Sharami Javad Pourmostafa Roshan, Shterionov Dimitar, Spronck Pieter
- Combining Multiple Post-training Techniques To Achieve Most Efficient Quantized Llms Sharify Sayeh, Xu Zifei, Yazar Wanzin, Wang Xin
- Unirag: Universal Retrieval Augmentation For Multi-modal Large Language Models Sharifymoghaddam Sahel, Upadhyay Shivani, Chen Wenhu, Lin Jimmy
- Synth\(^2\): Boosting Visual-language Models With Synthetic Captions And Image Embeddings Sharifzadeh Sahand, Kaplanis Christos, Pathak Shreya, Kumaran Dharshan, Ilic Anastasija, Mitrovic Jovana, Blundell Charles, Banino Andrea
- Rumour Evaluation With Very Large Language Models Shehata Dahlia, Cohen Robin, Clarke Charles
- Rapid Optimization For Jailbreaking Llms Via Subconscious Exploitation And Echopraxia Shen Guangyu, Cheng Siyuan, Zhang Kaiyuan, Tao Guanhong, An Shengwei, Yan Lu, Zhang Zhuo, Ma Shiqing, Zhang Xiangyu
- Shortcutsbench: A Large-scale Real-world Benchmark For Api-based Agents Shen Haiyang, Li Yue, Meng Desong, Cai Dongqi, Qi Sheng, Zhang Li, Xu Mengwei, Ma Yun
- Improving Reinforcement Learning From Human Feedback Using Contrastive Rewards Shen Wei, Zhang Xiaoying, Yao Yuanshun, Zheng Rui, Guo Hongyi, Liu Yang
- Think-then-act: A Dual-angle Evaluated Retrieval-augmented Generation Shen Yige, Jiang Hao, Qu Hua, Zhao Jihong
- Jetmoe: Reaching Llama2 Performance With 0.1M Dollars Shen Yikang, Guo Zhen, Cai Tianle, Qin Zengyi
- Toursynbio: A Multi-modal Large Model And Agent Framework To Bridge Text And Protein Sequences For Protein Engineering Shen Yiqing, Chen Zan, Mamalakis Michail, Liu Yungeng, Li Tianbin, Su Yanzhou, He Junjun, Liò Pietro, Wang Yu Guang
- Proctag: Process Tagging For Assessing The Efficacy Of Document Instruction Data Shen Yufan, Luo Chuwei, Zhu Zhaoqing, Chen Yang, Zheng Qi, Yu Zhi, Bu Jiajun, Yao Cong
- Repeval: Effective Text Evaluation With LLM Representation Sheng Shuqian, Xu Yi, Zhang Tianhang, Shen Zanwei, Fu Luoyi, Ding Jiaxin, Zhou Lei, Wang Xinbing, Zhou Chenghu
- Latent Adversarial Training Improves Robustness To Persistent Harmful Behaviors In Llms Sheshadri Abhay, Ewart Aidan, Guo Phillip, Lynch Aengus, Wu Cindy, Hebbar Vivek, Sleight Henry, Stickland Asa Cooper, Perez Ethan, Hadfield-menell Dylan, Casper Stephen
- Paying More Attention To Image: A Training-free Method For Alleviating Hallucination In Lvlms Shi Liu, Kecheng Zheng, Wei Chen
- Ask-eda: A Design Assistant Empowered By LLM, Hybrid RAG And Abbreviation De-hallucination Shi Luyao, Kazda Michael, Sears Bradley, Shropshire Nick, Puri Ruchir
- Action Controlled Paraphrasing Shi Ning, Wu Zijun
- Wildfeedback: Aligning Llms With In-situ User Interactions And Feedback Shi Taiwei, Wang Zhuoer, Yang Longqi, Lin Ying-chun, He Zexue, Wan Mengting, Zhou Pei, Jauhar Sujay, Xu Xiaofeng, Song Xia, Neville Jennifer
- Medadapter: Efficient Test-time Adaptation Of Large Language Models Towards Medical Reasoning Shi Wenqi, Xu Ran, Zhuang Yuchen, Yu Yue, Wu Hang, Yang Carl, Wang May D.
- Ehragent: Code Empowers Large Language Models For Few-shot Complex Tabular Reasoning On Electronic Health Records Shi Wenqi, Xu Ran, Zhuang Yuchen, Yu Yue, Zhang Jieyu, Wu Hang, Zhu Yuanda, Ho Joyce, Yang Carl, Wang May D.
- Large Language Models Are Learnable Planners For Long-term Recommendation Shi Wentao, He Xiangnan, Zhang Yang, Gao Chongming, Li Xinyue, Zhang Jizhi, Wang Qifan, Feng Fuli
- Agentre: An Agent-based Framework For Navigating Complex Information Landscapes In Relation Extraction Shi Yuchen, Jiang Guochao, Qiu Tian, Yang Deqing
- Retrieval-enhanced Knowledge Editing In Language Models For Multi-hop Question Answering Shi Yucheng, Tan Qiaoyu, Wu Xuansheng, Zhong Shaochen, Zhou Kaixiong, Liu Ninghao
- Eragent: Enhancing Retrieval-augmented Language Models With Improved Accuracy, Efficiency, And Personalization Shi Yunxiao, Zi Xing, Shi Zijing, Zhang Haimin, Wu Qiang, Xu Min
- Generate-then-ground In Retrieval-augmented Generation For Multi-hop Question Answering Shi Zhengliang, Zhang Shuo, Sun Weiwei, Gao Shen, Ren Pengjie, Chen Zhumin, Ren Zhaochun
- CAUS: A Dataset For Question Generation Based On Human Cognition Leveraging Large Language Models Shin Minjung, Kim Donghyun, Ryu Jeh-kwang
- Decompose, Enrich, And Extract! Schema-aware Event Extraction Using Llms Shiri Fatemeh, Nguyen Van, Moghimifar Farhad, Yoo John, Haffari Gholamreza, Li Yuan-fang
- Adapting Llms To Hebrew: Unveiling Dictalm 2.0 With Enhanced Vocabulary And Instruction Capabilities Shmidman Shaltiel, Shmidman Avi, Cohen Amir Dn, Koppel Moshe
- Structuredrag: JSON Response Formatting With Large Language Models Shorten Connor, Pierse Charles, Smith Thomas Benjamin, Cardenas Erika, Sharma Akanksha, Trengrove John, Van Luijt Bob
- Knowledge Graph Large Language Model (KG-LLM) For Link Prediction Shu Dong, Chen Tianle, Jin Mingyu, Zhang Chong, Du Mengnan, Zhang Yongfeng
- FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation Shum Kashun, Xu Minrui, Zhang Jianshu, Chen Zixin, Diao Shizhe, Dong Hanze, Zhang Jipeng, Raza Muhammad Omer
- Integrating Large Language Models Into Recommendation Via Mutual Augmentation And Adaptive Aggregation Sichun Luo, Yuxuan Yao, Bowei He, Yinya Huang, Aojun Zhou, Xinyi Zhang, Yuanzhang Xiao, Mingjie Zhan, Linqi Song
- H2o-danube-1.8b Technical Report Singer Philipp, Pfeiffer Pascal, Babakhin Yauhen, Jeblick Maximilian, Dhankhar Nischay, Fodor Gabor, Ambati Sri Satish
- Llm-dcache: Improving Tool-augmented Llms With Gpt-driven Localized Data Caching Singh Simranjit, Fore Michael, Karatzas Andreas, Lee Chaehong, Jian Yanan, Shangguan Longfei, Yu Fuxun, Anagnostopoulos Iraklis, Stamoulis Dimitrios
- An Empirical Study Of Validating Synthetic Data For Formula Generation Singh Usneek, Cambronero José, Gulwani Sumit, Kanade Aditya, Khatry Anirudh, Le Vu, Singh Mukul, Verbruggen Gust
- LGR2: Language Guided Reward Relabeling For Accelerating Hierarchical Reinforcement Learning Singh Utsav, Bhattacharyya Pramit, Namboodiri Vinay P.
- Recall Them All: Retrieval-augmented Language Models For Long Object List Extraction From Long Documents Singhania Sneha, Razniewski Simon, Weikum Gerhard
- Are Small Language Models Ready To Compete With Large Language Models For Practical Applications? Sinha Neelabh, Jain Vinija, Chadha Aman
- Maml-en-llm: Model Agnostic Meta-training Of Llms For Improved In-context Learning Sinha Sanchit, Yue Yuguang, Soto Victor, Kulkarni Mayank, Lu Jianhua, Zhang Aidong
- Unipa-gpt: Large Language Models For University-oriented QA In Italian Siragusa Irene, Pirrone Roberto
- Replan: Robotic Replanning With Perception And Language Models Skreta Marta, Zhou Zihan, Yuan Jia Lin, Darvish Kourosh, Aspuru-guzik Alán, Garg Animesh
- The Mysterious Case Of Neuron 1512: Injectable Realignment Architectures Reveal Internal Characteristics Of Meta's Llama 2 Model Smith Brenden, Baker Dallin, Chase Clayton, Barney Myles, Parker Kaden, Allred Makenna, Hu Peter, Evans Alex, Fulda Nancy
- Explaining Autonomy: Enhancing Human-robot Interaction Through Explanation Generation With Large Language Models Sobrín-hidalgo David, González-santamarta Miguel A., Guerrero-higueras Ángel M., Rodríguez-lera Francisco J., Matellán-olivera Vicente
- Gistembed: Guided In-sample Selection Of Training Negatives For Text Embedding Fine-tuning Solatorio Aivin V.
- Multi-task Inference: Can Large Language Models Follow Multiple Instructions At Once? Son Guijin, Baek Sangwon, Nam Sangdae, Jeong Ilgyun, Kim Seungone
- A New Pipeline For Generating Instruction Dataset Via RAG And Self Fine-tuning Song Chih-wei, Lee Yu-kai, Tsai Yin-te
- Hyacinth6b: A Large Language Model For Traditional Chinese Song Chih-wei, Tsai Yin-te
- Scaling Data Diversity For Fine-tuning Language Models In Human Alignment Song Feifan, Yu Bowen, Lang Hao, Yu Haiyang, Huang Fei, Wang Houfeng, Li Yongbin
- Increasing Model Capacity For Free: A Simple Strategy For Parameter Efficient Fine-tuning Song Haobo, Zhao Hao, Majumder Soumajit, Lin Tao
- Moma: Multimodal LLM Adapter For Fast Personalized Image Generation Song Kunpeng, Zhu Yizhe, Liu Bingchen, Yan Qing, Elgammal Ahmed, Yang Xiao
- Can Many-shot In-context Learning Help Long-context LLM Judges? See More, Judge Better! Song Mingyang, Zheng Mao, Luo Xuan
- Knowledge Editing On Black-box Large Language Models Song Xiaoshuai, Wang Zhengyang, He Keqing, Dong Guanting, Mou Yutao, Zhao Jinxu, Xu Weiran
- Position: Leverage Foundational Models For Black-box Optimization Song Xingyou, Tian Yingtao, Lange Robert Tjarko, Lee Chansoo, Tang Yujin, Chen Yutian
- Empo: Theory-driven Dataset Construction For Empathetic Response Generation Through Preference Optimization Sotolar Ondrej
- Fine Tuning Vs. Retrieval Augmented Generation For Less Popular Knowledge Soudani Heydar, Kanoulas Evangelos, Hasibi Faegheh
- Repetition Improves Language Model Embeddings Springer Jacob Mitchell, Kotha Suhas, Fried Daniel, Neubig Graham, Raghunathan Aditi
- PUB: A Pragmatics Understanding Benchmark For Assessing Llms' Pragmatics Capabilities Sravanthi Settaluri Lakshmi, Doshi Meet, Kalyan Tankala Pavan, Murthy Rudra, Bhattacharyya Pushpak, Dabre Raj
- Crossing New Frontiers: Knowledge-augmented Large Language Model Prompting For Zero-shot Text-based De Novo Molecule Design Srinivas Sakhinana Sagar, Runkana Venkataramana
- LUCID: Llm-generated Utterances For Complex And Interesting Dialogues Stacey Joe, Cheng Jianpeng, Torr John, Guigue Tristan, Driesen Joris, Coca Alexandru, Gaynor Mark, Johannsen Anders
- Towards Truly Zero-shot Compositional Visual Reasoning With Llms As Programmers Stanić Aleksandar, Caelles Sergi, Tschannen Michael
- A Comprehensive Study Of The Capabilities Of Large Language Models For Vulnerability Detection Steenhoek Benjamin, Rahman Md Mahbubur, Roy Monoshi Kumar, Alam Mirza Sanjida, Barr Earl T., Le Wei
- Virtual Agents For Alcohol Use Counseling: Exploring Llm-powered Motivational Interviewing Steenstra Ian, Nouraei Farnaz, Arjmand Mehdi, Bickmore Timothy W.
- LACIE: Listener-aware Finetuning For Confidence Calibration In Large Language Models Stengel-eskin Elias, Hase Peter, Bansal Mohit
- Satyrn: A Platform For Analytics Augmented Generation Sterbentz Marko, Barrie Cameron, Shahi Shubham, Dutta Abhratanu, Hooshmand Donna, Pack Harper, Hammond Kristian J.
- Groundedness In Retrieval-augmented Long-form Generation: An Empirical Study Stolfo Alessandro
- ARKS: Active Retrieval In Knowledge Soup For Code Generation Su Hongjin, Jiang Shuyang, Lai Yuhang, Wu Haoyuan, Shi Boao, Liu Che, Liu Qian, Yu Tao
- Unsupervised Real-time Hallucination Detection Based On The Internal States Of Large Language Models Su Weihang, Wang Changyue, Ai Qingyao, Hu Yiran, Wu Zhijing, Zhou Yujia, Liu Yiqun
- SK-VQA: Synthetic Knowledge Generation At Scale For Training Context-augmented Multimodal Llms Su Xin, Luo Man, Pan Kris W, Chou Tien Pei, Lal Vasudev, Howard Phillip
- Demonstration Augmentation For Zero-shot In-context Learning Su Yi, Tai Yunpeng, Ji Yixin, Li Juntao, Yan Bowen, Zhang Min
- Living In The Moment: Can Large Language Models Grasp Co-temporal Reasoning? Su Zhaochen, Li Juntao, Zhang Jun, Zhu Tong, Qu Xiaoye, Zhou Pan, Bowen Yan, Cheng Yu, Zhang Min
- Timo: Towards Better Temporal Reasoning For Language Models Su Zhaochen, Zhang Jun, Zhu Tong, Qu Xiaoye, Li Juntao, Zhang Min, Cheng Yu
- Whispering Experts: Neural Interventions For Toxicity Mitigation In Language Models Suau Xavier, Delobelle Pieter, Metcalf Katherine, Joulin Armand, Apostoloff Nicholas, Zappella Luca, Rodríguez Pau
- M-QALM: A Benchmark To Assess Clinical Reading Comprehension And Knowledge Recall In Large Language Models Via Question Answering Subramanian Anand, Schlegel Viktor, Kashyap Abhinav Ramesh, Nguyen Thanh-tung, Dwivedi Vijay Prakash, Winkler Stefan
- LAB: Large-scale Alignment For Chatbots Sudalairaj Shivchander, Bhandwaldar Abhishek, Pareja Aldo, Xu Kai, Cox David D., Srivastava Akash
- Fidelis: Faithful Reasoning In Large Language Model For Knowledge Graph Question Answering Sui Yuan, He Yufei, Liu Nian, He Xiaoxin, Wang Kun, Hooi Bryan
- Structured Chain-of-thought Prompting For Few-shot Generation Of Content-grounded QA Conversations Sultan Md Arafat, Ganhotra Jatin, Astudillo Ramón Fernandez
- Parallelparc: A Scalable Pipeline For Generating Natural-language Analogies Sultan Oren, Bitton Yonatan, Yosef Ron, Shahaf Dafna
- Triforce: Lossless Acceleration Of Long Sequence Generation With Hierarchical Speculative Decoding Sun Hanshi, Chen Zhuoming, Yang Xinyu, Tian Yuandong, Chen Beidi
- Retrieved In-context Principles From Previous Mistakes Sun Hao, Jiang Yong, Wang Bo, Hou Yingyan, Zhang Yan, Xie Pengjun, Huang Fei
- Evograd: A Dynamic Take On The Winograd Schema Challenge With Human Adversaries Sun Jing Han, Emami Ali
- Lawluo: A Chinese Law Firm Co-run By LLM Agents Sun Jingyun, Dai Chengxiao, Luo Zhongze, Chang Yangbo, Li Yang
- Diversifying Query: Region-guided Transformer For Temporal Sentence Grounding Sun Xiaolong, Shi Liushuai, Wang Le, Zhou Sanping, Xia Kun, Wang Yabing, Hua Gang
- DFA-RAG: Conversational Semantic Router For Large Language Model With Definite Finite Automaton Sun Yiyou, Hu Junjie, Cheng Wei, Chen Haifeng
- Leveraging Chatgpt In Pharmacovigilance Event Extraction: An Empirical Study Sun Zhaoyue, Pergola Gabriele, Wallace Byron C., He Yulan
- Llavaolmobitnet1b: Ternary LLM Goes Multimodal! Sundaram Jainaveen, Iyer Ravi
- Meta-prompting: Enhancing Language Models With Task-agnostic Scaffolding Suzgun Mirac, Kalai Adam Tauman
- JORA: JAX Tensor-parallel Lora Library For Retrieval Augmented Fine-tuning Tahir Anique, Cheng Lu, Liu Huan
- All In How You Ask For It: Simple Black-box Method For Jailbreak Attacks Takemoto Kazuhiro
- Improving Large Language Model (LLM) Fidelity Through Context-aware Grounding: A Systematic Approach To Reliability And Veracity Talukdar Wrick, Biswas Anjanava
- An Improved Traditional Chinese Evaluation Suite For Foundation Model Tam Zhi-rui, Pai Ya-ting, Lee Yen-wei, Chen Jun-da, Chu Wei-min, Cheng Sega, Shuai Hong-han
- Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning Tan Cheng, Wei Jingxuan, Sun Linzhuang, Gao Zhangyang, Li Siyuan, Yu Bihui, Guo Ruifeng, Li Stan Z.
- Blinded By Generated Contexts: How Language Models Merge Generated And Retrieved Contexts When Knowledge Conflicts? Tan Hexiang, Sun Fei, Yang Wanli, Wang Yuanzhuo, Cao Qi, Cheng Xueqi
- Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When And What To Retrieve For Llms Tan Jiejun, Dou Zhicheng, Zhu Yutao, Guo Peidong, Fang Kun, Wen Ji-rong
- Thought-like-pro: Enhancing Reasoning Of Large Language Models Through Self-driven Prolog-based Chain-of-thought Tan Xiaoyu, Deng Yongxin, Qiu Xihe, Xu Weidi, Qu Chao, Chu Wei, Xu Yinghui, Qi Yuan
- Struct-x: Enhancing Large Language Models Reasoning With Structured Data Tan Xiaoyu, Wang Haoyu, Qiu Xihe, Cheng Yuan, Xu Yinghui, Chu Wei, Qi Yuan
- An Empirical Study And Analysis Of Text-to-image Generation Using Large Language Model-powered Textual Representation Tan Zhiyu, Yang Mengping, Qin Luozheng, Yang Hao, Qian Ye, Zhou Qiang, Zhang Cheng, Li Hao
- EVALALIGN: Supervised Fine-tuning Multimodal Llms With Human-aligned Data For Evaluating Text-to-image Models Tan Zhiyu, Yang Xiaomeng, Qin Luozheng, Yang Mengping, Zhang Cheng, Li Hao
- Leveraging Language Models For Emotion And Behavior Analysis In Education Tanaka Kaito, Tan Benjamin, Wong Brian
- Jill Watson: A Virtual Teaching Assistant Powered By Chatgpt Taneja Karan, Maiti Pratyusha, Kakar Sandeep, Guruprasad Pranav, Rao Sanjeev, Goel Ashok K.
- Llmbox: A Comprehensive Library For Large Language Models Tang Tianyi, Hu Yiwen, Li Bingqian, Luo Wenyang, Qin Zijing, Sun Haoxiang, Wang Jiapeng, Xu Shiyi, Cheng Xiaoxue, Guo Geyang, Peng Han, Zheng Bowen, Tang Yiru, Min Yingqian, Chen Yushuo, Chen Jie, Zhao Yuanqian, Ding Luran, Wang Yuhao, Dong Zican, Xia Chunxuan, Li Junyi, Zhou Kun, Zhao Wayne Xin, Wen Ji-rong
- A + B: A General Generator-reader Framework For Optimizing Llms To Unleash Synergy Potential Tang Wei, Cao Yixin, Ying Jiahao, Wang Bo, Zhao Yuyue, Liao Yong, Zhou Pengyuan
- Multihop-rag: Benchmarking Retrieval-augmented Generation For Multi-hop Queries Tang Yixuan, Yang Yi
- Mathscale: Scaling Instruction Tuning For Mathematical Reasoning Tang Zhengyang, Zhang Xingxing, Wang Benyou, Wei Furu
- Enhancing Commentary Strategies For Imperfect Information Card Games: A Study Of Large Language Models In Guandan Commentary Tao Meiling, Liang Xuechen, Wang Ziyi, Tao Yiling, Shi Tianyu
- When To Trust Llms: Aligning Confidence With Response Quality Tao Shuchang, Yao Liuyi, Ding Hanxing, Xie Yuexiang, Cao Qi, Sun Fei, Gao Jinyang, Shen Huawei, Ding Bolin
- Distilling Llms' Decomposition Abilities Into Compact Language Models Tarasov Denis, Shridhar Kumar
- Dynamic Contexts For Generating Suggestion Questions In RAG Based Conversational Systems Tayal Anuja, Tyagi Aman
- From Feature Importance To Natural Language Explanations Using Llms With RAG Tekkesinoglu Sule, Kunze Lars
- Embedding-aligned Language Models Tennenholtz Guy, Chow Yinlam, Hsu Chih-wei, Shani Lior, Liang Ethan, Boutilier Craig
- Interactive Prompt Debugging With Sequence Salience Tenney Ian, Mullins Ryan, Du Bin, Pandya Shree, Kahng Minsuk, Dixon Lucas
- Training-free Consistent Text-to-image Generation Tewel Yoad, Kaduri Omri, Gal Rinon, Kasten Yoni, Wolf Lior, Chechik Gal, Atzmon Yuval
- Judging The Judges: Evaluating Alignment And Vulnerabilities In Llms-as-judges Thakur Aman Singh, Choudhary Kartik, Ramayapally Venkat Srinik, Vaidyanathan Sankaran, Hupkes Dieuwke
- Introducing Super Rags In Mistral 8x7b-v1 Thakur Ayush, Gupta Raghav
- Supporting Human Raters With The Detection Of Harmful Content Using Large Language Models Thomas Kurt, Kelley Patrick Gage, Tao David, Meiklejohn Sarah, Vallis Owen, Tan Shunwen, Bratanič Blaž, Ferreira Felipe Tiengo, Eranti Vijay Kumar, Bursztein Elie
- Limits Of Transformer Language Models On Learning To Compose Algorithms Thomm Jonathan, Terzic Aleksandar, Camposampiero Giacomo, Hersche Michael, Schölkopf Bernhard, Rahimi Abbas
- Dubo-sql: Diverse Retrieval-augmented Generation And Fine Tuning For Text-to-sql Thorpe Dayton G., Duberstein Andrew J., Kinsey Ian A.
- Battling Botpoop Using Genai For Higher Education: A Study Of A Retrieval Augmented Generation Chatbots Impact On Learning Thway Maung, Recatala-gomez Jose, Lim Fun Siong, Hippalgaonkar Kedar, Ng Leonard W. T.
- Tokenize The World Into Object-level Knowledge To Address Long-tail Events In Autonomous Driving Tian Ran, Li Boyi, Weng Xinshuo, Chen Yuxiao, Schmerling Edward, Wang Yue, Ivanovic Boris, Pavone Marco
- Tinyllm: Learning A Small Student From Multiple Large Language Models Tian Yijun, Han Yikun, Chen Xiusi, Wang Wei, Chawla Nitesh V.
- Are Large Language Models Capable Of Generating Human-level Narratives? Tian Yufei, Huang Tenghao, Liu Miri, Jiang Derek, Spangher Alexander, Chen Muhao, May Jonathan, Peng Nanyun
- Cantor: Inspiring Multimodal Chain-of-thought Of MLLM Timin Gao, Peixian Chen, Mengdan Zhang, Chaoyou Fu, Yunhang Shen, Yan Zhang, Shengchuan Zhang, Xiawu Zheng, Xing Sun, Liujuan Cao, Rongrong Ji
- Can Large Language Models Assess Serendipity In Recommender Systems? Tokutake Yu, Okamoto Kazushi
- Generative AI Systems: A Systems-based Perspective On Generative AI Tomczak Jakub M.
- Can Llms Learn From Previous Mistakes? Investigating Llms' Errors To Boost For Reasoning Tong Yongqi, Li Dawei, Wang Sizhe, Wang Yujia, Teng Fei, Shang Jingbo
- A Comprehensive Survey Of Hallucination Mitigation Techniques In Large Language Models Tonmoy S. M Towhidul Islam, Zaman S M Mehedi, Jain Vinija, Rani Anku, Rawte Vipula, Chadha Aman, Das Amitava
- Evaluating Large Language Models With Grid-based Game Competitions: An Extensible LLM Benchmark And Leaderboard Topsakal Oguzhan, Edell Colby Jacob, Harper Jackson Bailey
- Few-shot Cross-lingual Transfer For Prompting Large Language Models In Low-resource Languages Toukmaji Christopher
- Xtower: A Multilingual LLM For Explaining And Correcting Translation Errors Treviso Marcos, Guerreiro Nuno M., Agrawal Sweta, Rei Ricardo, Pombal José, Vaz Tania, Wu Helena, Silva Beatriz, Van Stigt Daan, Martins André F. T.
- Leveraging LLM Reasoning Enhances Personalized Recommender Systems Tsai Alicia Y., Kraft Adam, Jin Long, Cai Chenwei, Hosseini Anahita, Xu Taibai, Zhang Zemin, Hong Lichan, Chi Ed H., Yi Xinyang
- Enhance Modality Robustness In Text-centric Multimodal Alignment With Adversarial Prompting Tsai Yun-da, Yen Ting-yu, Liao Keng-te, Lin Shou-de
- Two Tales Of Persona In Llms: A Survey Of Role-playing And Personalization Tseng Yu-min, Huang Yu-chao, Hsiao Teng-yun, Chen Wei-lin, Huang Chao-wei, Meng Yu, Chen Yun-nung
- Predictions From Language Models For Multiple-choice Tasks Are Not Robust Under Variation Of Scoring Methods Tsvilodub Polina, Wang Hening, Grosch Sharon, Franke Michael
- R-eval: A Unified Toolkit For Evaluating Domain Knowledge Of Retrieval Augmented Large Language Models Tu Shangqing, Wang Yuanchun, Yu Jifan, Xie Yuyang, Shi Yaran, Wang Xiaozhi, Zhang Jing, Hou Lei, Li Juanzi
- Dynamo: Accelerating Language Model Inference With Dynamic Multi-token Sampling Tuli Shikhar, Lin Chi-heng, Hsu Yen-chang, Jha Niraj K., Shen Yilin, Jin Hongxia
- VBART: The Turkish LLM Turker Meliksah, Ari Mehmet Erdi, Han Aydin
- Attention Is All You Need But You Don't Need All Of It For Inference Of Large Language Models Tyukin Georgy, Dovonon Gbetondji J-s, Kaddour Jean, Minervini Pasquale
- Lifelong Robot Library Learning: Bootstrapping Composable And Generalizable Skills For Embodied Control With Language Models Tziafas Georgios, Kasaei Hamidreza
- Hypotermqa: Hypothetical Terms Dataset For Benchmarking Hallucination Tendency Of Llms Uluoglakci Cem Middle East Technical University, Temizel Tugba Taskaya Middle East Technical University
- No Size Fits All: The Perils And Pitfalls Of Leveraging Llms Vary With Company Size Urlana Ashok, Kumar Charaka Vinayak, Garlapati Bala Mallikarjunarao, Singh Ajeet Kumar, Mishra Rahul
- Llms With Industrial Lens: Deciphering The Challenges And Prospects -- A Survey Urlana Ashok, Kumar Charaka Vinayak, Singh Ajeet Kumar, Garlapati Bala Mallikarjunarao, Chalamala Srinivasa Rao, Mishra Rahul
- Natural Language Programming In Medicine: Administering Evidence Based Clinical Workflows With Autonomous Agents Powered By Generative Large Language Models Vaid Akhil, Lampert Joshua, Lee Juhee, Sawant Ashwin, Apakama Donald, Sakhuja Ankit, Soroush Ali, Bick Sarah, Abbott Ethan, Gomez Hernando, Hadley Michael, Lee Denise, Landi Isotta, Duong Son Q, Bussola Nicole, Nabeel Ismail, Muehlstedt Silke, Muehlstedt Silke, Freeman Robert, Kovatch Patricia, Carr Brendan, Wang Fei, Glicksberg Benjamin, Argulian Edgar, Lerakis Stamatios, Khera Rohan, Reich David L., Kraft Monica, Charney Alexander, Nadkarni Girish
- Accelerating Large Language Model Inference With Self-supervised Early Exits Valade Florian
- Can GPT Redefine Medical Understanding? Evaluating GPT On Biomedical Machine Reading Comprehension Vatsal Shubham, Singh Ayush
- RAG Based Question-answering For Contextual Response Prediction System Veturi Sriram, Vaichal Saurabh, Jagadheesh Reshma Lal, Tripto Nafis Irtiza, Yan Nian
- How Much Data Is Enough Data? Fine-tuning Large Language Models For In-house Translation: Performance Evaluation Across Multiple Dataset Sizes Vieira Inacio, Allred Will, Lankford Séamus, Castilho Sheila, Way Andy
- Emotional Manipulation Through Prompt Engineering Amplifies Disinformation Generation In AI Large Language Models Vinay Rasita, Spitale Giovanni, Biller-andorno Nikola, Germani Federico
- Medreqal: Examining Medical Knowledge Recall Of Large Language Models Via Question Answering Vladika Juraj, Schneider Phillip, Matthes Florian
- Redwhale: An Adapted Korean LLM Through Efficient Continual Pretraining Vo Anh-dung, Jung Minseong, Lee Wonbeen, Choi Daewoo
- Vi-mistral-x: Building A Vietnamese Language Model With Advanced Continual Pre-training Vo James
- Exploring Fact Memorization And Style Imitation In Llms Using Qlora: An Experimental Study And Quality Assessment Methods Vyborov Eugene, Osypenko Oleksiy, Sotnyk Serge
- Generative Large Language Models In Automated Fact-checking: A Survey Vykopal Ivan, Pikuliak Matúš, Ostermann Simon, Šimko Marián
- Leveraging Zero-shot Prompting For Efficient Language Model Distillation Vöge Lukas, Gurgul Vincent, Lessmann Stefan
- From Rags To Rich Parameters: Probing How Language Models Utilize External Knowledge Over Parametric Information For Factual Queries Wadhwa Hitesh, Seetharaman Rahul, Aggarwal Somyaa, Ghosh Reshmi, Basu Samyadeep, Srinivasan Soundararajan, Zhao Wenlong, Chaudhari Shreyas, Aghazadeh Ehsan
- An Empirical Study Of Mamba-based Language Models Waleffe Roger, Byeon Wonmin, Riach Duncan, Norick Brandon, Korthikanti Vijay, Dao Tri, Gu Albert, Hatamizadeh Ali, Singh Sudhakar, Narayanan Deepak, Kulshreshtha Garvit, Singh Vartika, Casper Jared, Kautz Jan, Shoeybi Mohammad, Catanzaro Bryan
- Dynamic Self-consistency: Leveraging Reasoning Paths For Efficient LLM Sampling Wan Guangya, Wu Yuqi, Chen Jie, Li Sheng
- COSMO: Contrastive Streamlined Multimodal Model With Interleaved Pre-training Wang Alex Jinpeng, Li Linjie, Lin Kevin Qinghong, Wang Jianfeng, Lin Kevin, Yang Zhengyuan, Wang Lijuan, Shou Mike Zheng
- Fine-grained Self-endorsement Improves Factuality And Reasoning Wang Ante, Song Linfeng, Peng Baolin, Tian Ye, Jin Lifeng, Mi Haitao, Su Jinsong, Yu Dong
- GOLF: Goal-oriented Long-term Life Tasks Supported By Human-ai Collaboration Wang Ben
- ANGO: A Next-level Evaluation Benchmark For Generation-oriented Language Models In Chinese Domain Wang Bingchao
- Halu-j: Critique-based Hallucination Judge Wang Binjie, Chern Steffi, Chern Ethan, Liu Pengfei
- Grokked Transformers Are Implicit Reasoners: A Mechanistic Journey To The Edge Of Generalization Wang Boshi, Yue Xiang, Su Yu, Sun Huan
- Towards Efficient And Effective Unlearning Of Large Language Models For Recommendation Wang Hangyu, Lin Jianghao, Chen Bo, Yang Yang, Tang Ruiming, Zhang Weinan, Yu Yong
- Blendfilter: Advancing Retrieval-augmented Large Language Models Via Query Generation Blending And Knowledge Filtering Wang Haoyu, Li Ruirui, Jiang Haoming, Tian Jinjin, Wang Zhengyang, Luo Chen, Tang Xianfeng, Cheng Monica, Zhao Tuo, Gao Jing
- Unims-rag: A Unified Multi-source Retrieval-augmented Generation For Personalized Dialogue Systems Wang Hongru, Huang Wenyu, Deng Yang, Wang Rui, Wang Zezhong, Wang Yufei, Mi Fei, Pan Jeff Z., Wong Kam-fai
- M4U: Evaluating Multilingual Understanding And Reasoning For Large Multimodal Models Wang Hongyu, Xu Jiayu, Xie Senwei, Wang Ruiping, Li Jialin, Xie Zhaojie, Zhang Bin, Xiong Chuyan, Chen Xilin
- Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning For Vision Language Models Wang Jiayu, Ming Yifei, Shi Zhenmei, Vineet Vibhav, Wang Xin, Joshi Neel
- JMLR: Joint Medical LLM And Retrieval Training For Enhancing Reasoning And Professional Question Answering Capability Wang Junda, Yang Zhichao, Yao Zonghai, Yu Hong
- Mixture-of-agents Enhances Large Language Model Capabilities Wang Junlin, Wang Jue, Athiwaratkun Ben, Zhang Ce, Zou James
- Balancing Rigor And Utility: Mitigating Cognitive Biases In Large Language Models For Multiple-choice Questions Wang Liman, Zhong Hanyang, Cao Wenting, Sun Zeyuan
- Selective Forgetting: Advancing Machine Unlearning Techniques And Evaluation In Language Models Wang Lingzhi, Zeng Xingshan, Guo Jinsong, Wong Kam-fai, Gottlob Georg
- Exploring The Potential Of Multimodal LLM With Knowledge-intensive Multimodal ASR Wang Minghan, Wang Yuxia, Vu Thuy-trang, Shareghi Ehsan, Haffari Gholamreza
- Leave No Document Behind: Benchmarking Long-context Llms With Extended Multi-doc QA Wang Minzheng, Chen Longze, Fu Cheng, Liao Shengyi, Zhang Xinghua, Wu Bingli, Yu Haiyang, Xu Nan, Zhang Lei, Luo Run, Li Yunshui, Yang Min, Huang Fei, Li Yongbin
- Multi-perspective Consistency Enhances Confidence Estimation In Large Language Models Wang Pei, Wang Yejie, Diao Muxi, He Keqing, Dong Guanting, Xu Weiran
- Prefclm: Enhancing Preference-based Reinforcement Learning With Crowdsourced Large Language Models Wang Ruiqi, Zhao Dezhong, Yuan Ziqin, Obi Ike, Min Byung-cheol
- One Prompt Is Not Enough: Automated Construction Of A Mixture-of-expert Prompts Wang Ruochen, An Sohyun, Cheng Minhao, Zhou Tianyi, Hwang Sung Ju, Hsieh Cho-jui
- Deconfounded Causality-aware Parameter-efficient Fine-tuning For Problem-solving Improvement Of Llms Wang Ruoyu, Li Xiaoxuan, Yao Lina
- Unique Security And Privacy Threats Of Large Language Model: A Comprehensive Survey Wang Shang, Zhu Tianqing, Liu Bo, Ding Ming, Guo Xu, Ye Dayong, Zhou Wanlei, Yu Philip S.
- What Is The Best Way For Chatgpt To Translate Poetry? Wang Shanshan, Wong Derek F., Yao Jingming, Chao Lidia S.
- \(\mathbb{uscd}\): Improving Code Generation Of Llms By Uncertainty-aware Selective Contrastive Decoding Wang Shuai, Ding Liang, Shen Li, Luo Yong, He Zheng, Yu Wei, Tao Dacheng
- Weaver: Foundation Models For Creative Writing Wang Tiannan, Chen Jiamin, Jia Qingrui, Wang Shuai, Fang Ruoyu, Wang Huilin, Gao Zhaowei, Xie Chunzhao, Xu Chuou, Dai Jihong, Liu Yibin, Wu Jialong, Ding Shengwei, Li Long, Huang Zhiwei, Deng Xinle, Yu Teng, Ma Gangan, Xiao Han, Chen Zixin, Xiang Danjun, Wang Yunxia, Zhu Yuanyuan, Xiao Yi, Wang Jing, Wang Yiru, Ding Siran, Huang Jiayang, Xu Jiayi, Tayier Yilihamu, Hu Zhenyu, Gao Yuan, Zheng Chengfeng, Ye Yueshu, Li Yihang, Wan Lei, Jiang Xinyue, Wang Yujie, Cheng Siyu, Song Zhule, Tang Xiangru, Xu Xiaohua, Zhang Ningyu, Chen Huajun, Jiang Yuchen Eleanor, Zhou Wangchunshu
- QUITO: Accelerating Long-context Reasoning Through Query-guided Context Compression Wang Wenshan, Wang Yihang, Fan Yixing, Liao Huaming, Guo Jiafeng
- Learning To Ask: When Llms Meet Unclear Instruction Wang Wenxuan, Shi Juluan, Wang Chaozheng, Lee Cheryl, Yuan Youliang, Huang Jen-tse, Lyu Michael R.
- Adaptive Retrieval-augmented Generation For Conversational Systems Wang Xi, Sen Procheta, Li Ruizhe, Yilmaz Emine
- Searching For Best Practices In Retrieval-augmented Generation Wang Xiaohua, Wang Zhenghua, Gao Xuan, Zhang Feiran, Wu Yixin, Xu Zhibo, Shi Tianyuan, Wang Zhengyuan, Li Shizheng, Qian Qi, Yin Ruicheng, Lv Changze, Zheng Xiaoqing, Huang Xuanjing
- Rdrec: Rationale Distillation For Llm-based Recommendation Wang Xinfeng, Cui Jin, Suzuki Yoshimi, Fukumoto Fumiyo
- Executable Code Actions Elicit Better LLM Agents Wang Xingyao, Chen Yangyi, Yuan Lifan, Zhang Yizhe, Li Yunzhu, Peng Hao, Ji Heng
- Negativeprompt: Leveraging Psychology For Large Language Models Enhancement Via Negative Emotional Stimuli Wang Xu, Li Cheng, Chang Yi, Wang Jindong, Wu Yuan
- EAGER: Two-stream Generative Recommender With Behavior-semantic Collaboration Wang Ye, Xun Jiahao, Hong Minjie, Zhu Jieming, Jin Tao, Lin Wang, Li Haoyuan, Li Linjun, Xia Yan, Zhao Zhou, Dong Zhenhua
- Unveiling Factual Recall Behaviors Of Large Language Models Through Knowledge Neurons Wang Yifei, Chen Yuheng, Wen Wanting, Sheng Yu, Li Linjing, Zeng Daniel Dajun
- Uncertainty Aware Learning For Language Model Alignment Wang Yikun, Zheng Rui, Ding Liang, Zhang Qi, Lin Dahua, Tao Dacheng
- Speak It Out: Solving Symbol-related Problems With Symbol-to-language Conversion For Language Models Wang Yile, Cheng Sijie, Sun Zixin, Li Peng, Liu Yang
- HPT++: Hierarchically Prompting Vision-language Models With Multi-granularity Knowledge Generation And Improved Structure Modeling Wang Yubin, Jiang Xinyang, Cheng De, Sun Wenli, Li Dongsheng, Zhao Cairong
- REAR: A Relevance-aware Retrieval-augmented Framework For Open-domain Question Answering Wang Yuhao, Ren Ruiyang, Li Junyi, Zhao Wayne Xin, Liu Jing, Wen Ji-rong
- Maferw: Query Rewriting With Multi-aspect Feedbacks For Retrieval-augmented Large Language Models Wang Yujing, Zhang Hainan, Pang Liang, Pang Liang, Zheng Hongwei, Zheng Zhiming
- Sibyl: Simple Yet Effective Agent Framework For Complex Real-world Reasoning Wang Yulong, Shen Tianhao, Liu Lifeng, Xie Jian
- LSTP: Language-guided Spatial-temporal Prompt Learning For Long-form Video-text Understanding Wang Yuxuan, Wang Yueqian, Wu Pengfei, Liang Jianxin, Zhao Dongyan, Zheng Zilong
- To Recommend Or Not: Recommendability Identification In Conversations With Pre-trained Language Models Wang Zhefan, Ma Weizhi, Zhang Min
- M-RAG: Reinforcing Large Language Model Performance Through Retrieval-augmented Generation With Multiple Partitions Wang Zheng, Teo Shu Xian, Ouyang Jieer, Xu Yongjun, Shi Wei
- Chatlogic: Integrating Logic Programming With Large Language Models For Multi-step Reasoning Wang Zhongsheng, Liu Jiamou, Bao Qiming, Rong Hongfei, Zhang Jingfeng
- RAT: Retrieval Augmented Thoughts Elicit Context-aware Reasoning In Long-horizon Generation Wang Zihao, Liu Anji, Lin Haowei, Li Jiaqi, Ma Xiaojian, Liang Yitao
- Wordflow: Social Prompt Engineering For Large Language Models Wang Zijie J., Chakravarthy Aishwarya, Munechika David, Chau Duen Horng
- Farsight: Fostering Responsible AI Awareness During AI Application Prototyping Wang Zijie J., Kulkarni Chinmay, Wilcox Lauren, Terry Michael, Madaio Michael
- Speculative RAG: Enhancing Retrieval Augmented Generation Through Drafting Wang Zilong, Wang Zifeng, Le Long, Zheng Huaixiu Steven, Mishra Swaroop, Perot Vincent, Zhang Yuwei, Mattapalli Anush, Taly Ankur, Shang Jingbo, Lee Chen-yu, Pfister Tomas
- Eliminating Position Bias Of Language Models: A Mechanistic Approach Wang Ziqi, Zhang Hanlin, Li Xiner, Huang Kuan-hao, Han Chi, Ji Shuiwang, Kakade Sham M., Peng Hao, Ji Heng
- Poisoned Langchain: Jailbreak Llms By Langchain Wang Ziqiu, Liu Jun, Zhang Shengkai, Yang Yang
- Browse And Concentrate: Comprehending Multimodal Content Via Prior-llm Context Fusion Wang Ziyue, Chen Chi, Zhu Yiqi, Luo Fuwen, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Sun Maosong, Liu Yang
- Large Language User Interfaces: Voice Interactive User Interfaces Powered By Llms Wasti Syed Mekael, Pu Ken Q., Neshati Ali
- Hidden In Plain Sight: Exploring Chat History Tampering In Interactive Language Models Wei Cheng'an, Zhao Yue, Gong Yujia, Chen Kai, Xiang Lu, Zhu Shenchen
- Measuring And Reducing LLM Hallucination Without Gold-standard Answers Wei Jiaheng, Yao Yuanshun, Ton Jean-francois, Guo Hongyi, Estornell Andrew, Liu Yang
- Skywork-moe: A Deep Dive Into Training Techniques For Mixture-of-experts Language Models Wei Tianwen, Zhu Bo, Zhao Liang, Cheng Cheng, Li Biye, Lü Weiwei, Cheng Peng, Zhang Jianhao, Zhang Xiaoyu, Zeng Liang, Wang Xiaokun, Ma Yutuan, Hu Rui, Yan Shuicheng, Fang Han, Zhou Yahui
- Instructrag: Instructing Retrieval-augmented Generation Via Self-synthesized Rationales Wei Zhepei, Chen Wei-lin, Meng Yu
- Leveraging Large Language Models For Patient Engagement: The Power Of Conversational AI In Digital Health Wen Bo, Norel Raquel, Liu Julia, Stappenbeck Thaddeus, Zulkernine Farhana, Chen Huamin
- Grounding Data Science Code Generation With Input-output Specifications Wen Yeming, Yin Pengcheng, Shi Kensen, Michalewski Henryk, Chaudhuri Swarat, Polozov Alex
- How Chinese Are Chinese Language Models? The Puzzling Lack Of Language Policy In China's Llms Wen-yi Andrea W, Jo Unso Eun Seo, Lin Lu Jia, Mimno David
- CBR-RAG: Case-based Reasoning For Retrieval Augmented Generation In Llms For Legal Question Answering Wiratunga Nirmalie, Abeyratne Ramitha, Jayawardena Lasal, Martin Kyle, Massie Stewart, Nkisi-orji Ikechukwu, Weerasinghe Ruvan, Liret Anne, Fleisch Bruno
- Effects Of A Prompt Engineering Intervention On Undergraduate Students' AI Self-efficacy, AI Knowledge And Prompt Engineering Ability: A Mixed Methods Study Woo David James, Wang Deliang, Yung Tim, Guo Kai
- Curriculum Learning With Quality-driven Data Selection Wu Biao, Meng Fang, Chen Ling
- Understanding The Role Of User Profile In The Personalization Of Large Language Models Wu Bin, Shi Zhengyan, Rahmani Hossein A., Ramineni Varsha, Yilmaz Emine
- Transferable Speech-to-text Large Language Model Alignment Module Wu Boyong, Yan Chao, Pu Haoran
- Sparkra: A Retrieval-augmented Knowledge Service System Based On Spark Large Language Model Wu Dayong, Li Jiaqi, Wang Baoxin, Zhao Honghong, Xue Siyuan, Yang Yanjie, Chang Zhijun, Zhang Rui, Qian Li, Wang Bo, Wang Shijin, Zhang Zhixiong, Hu Guoping
- Synthetic Multimodal Question Generation Wu Ian, Jayanthi Sravan, Viswanathan Vijay, Rosenberg Simon, Pakazad Sina, Wu Tongshuang, Neubig Graham
- Gendec: A Robust Generative Question-decomposition Method For Multi-hop Reasoning Wu Jian, Yang Linyi, Ji Yuliang, Huang Wenhao, Karlsson Börje F., Okumura Manabu
- RLPF: Reinforcement Learning From Prediction Feedback For User Summarization With Llms Wu Jiaxing, Ning Lin, Liu Luyang, Lee Harrison, Wu Neo, Wang Chao, Prakash Sushant, O'banion Shawn, Green Bradley, Xie Jun
- Pandora's Box Or Aladdin's Lamp: A Comprehensive Analysis Revealing The Role Of RAG Noise In Large Language Models Wu Jinyang, Che Feihu, Zhang Chuyuan, Tao Jianhua, Zhang Shuai, Shao Pengpeng
- Can Large Language Models Understand Uncommon Meanings Of Common Words? Wu Jinyang, Che Feihu, Zheng Xinxin, Zhang Shuai, Jin Ruihan, Nie Shuai, Shao Pengpeng, Tao Jianhua
- Clasheval: Quantifying The Tug-of-war Between An Llm's Internal Prior And External Evidence Wu Kevin, Wu Eric, Zou James
- Adversarial Databases Improve Success In Retrieval-based Large Language Models Wu Sean, Koo Michael, Kao Li Yo, Black Andy, Blum Lesley, Scalzo Fabien, Kurtz Ira
- Automated Review Generation Method Based On Large Language Models Wu Shican, Ma Xiao, Luo Dehui, Li Lulu, Shi Xiangcheng, Chang Xin, Lin Xiaoyun, Luo Ran, Pei Chunlei, Zhao Zhi-jian, Gong Jinlong
- Avatar: Optimizing LLM Agents For Tool-assisted Knowledge Retrieval Wu Shirley, Zhao Shiyu, Huang Qian, Huang Kexin, Yasunaga Michihiro, Cao Kaidi, Ioannidis Vassilis N., Subbian Karthik, Leskovec Jure, Zou James
- Videollm-mod: Efficient Video-language Streaming With Mixture-of-depths Vision Computation Wu Shiwei, Chen Joya, Lin Kevin Qinghong, Wang Qimeng, Gao Yan, Xu Qianli, Xu Tong, Hu Yao, Chen Enhong, Shou Mike Zheng
- How Easily Do Irrelevant Inputs Skew The Responses Of Large Language Models? Wu Siye, Xie Jian, Chen Jiangjie, Zhu Tinghui, Zhang Kai, Xiao Yanghua
- Cognitive Llms: Towards Integrating Cognitive Architectures And Large Language Models For Manufacturing Decision-making Wu Siyu, Oltramari Alessandro, Francis Jonathan, Giles C. Lee, Ritter Frank E.
- Unigen: A Unified Framework For Textual Dataset Generation Using Large Language Models Wu Siyuan, Huang Yue, Gao Chujie, Chen Dongping, Zhang Qihui, Wan Yao, Zhou Tianyi, Zhang Xiangliang, Gao Jianfeng, Xiao Chaowei, Sun Lichao
- Never Miss A Beat: An Efficient Recipe For Context Window Extension Of Large Language Models With Consistent "middle" Enhancement Wu Tong, Zhao Yanpeng, Zheng Zilong
- Continual Learning For Large Language Models: A Survey Wu Tongtong, Luo Linhao, Li Yuan-fang, Pan Shirui, Vu Thuy-trang, Haffari Gholamreza
- Freeva: Offline MLLM As Training-free Video Assistant Wu Wenhao
- Enhancing Q-learning With Large Language Model Heuristics Wu Xiefeng
- Passage-specific Prompt Tuning For Passage Reranking In Question Answering With Large Language Models Wu Xuyang, Peng Zhiyuan, Sai Krishna Sravanthi Rajanala, Wu Hsin-tai, Fang Yi
- Mitigating Misleading Chain-of-thought Reasoning With Selective Filtering Wu Yexin, Zhang Zhuosheng, Zhao Hai
- MLDT: Multi-level Decomposition For Complex Long-horizon Robotic Task Planning With Open-source Large Language Model Wu Yike, Zhang Jiatao, Hu Nan, Tang Lanling, Qi Guilin, Shao Jun, Ren Jie, Song Wei
- Difflora: Generating Personalized Low-rank Adaptation Weights With Diffusion Wu Yujia, Shi Yiming, Wei Jiwei, Sun Chengwei, Zhou Yuyang, Yang Yang, Shen Heng Tao
- Large Language Models Can Self-correct With Minimal Effort Wu Zhenyu, Zeng Qingkai, Zhang Zhihan, Tan Zhaoxuan, Shen Chao, Jiang Meng
- Acquiring Clean Language Models From Backdoor Poisoned Datasets By Downscaling Frequency Space Wu Zongru, Zhang Zhuosheng, Cheng Pengzhou, Liu Gongshen
- Memocrs: Memory-enhanced Sequential Conversational Recommender Systems With Large Language Models Xi Yunjia, Liu Weiwen, Lin Jianghao, Chen Bo, Tang Ruiming, Zhang Weinan, Yu Yong
- Inverse-q*: Token Level Reinforcement Learning For Aligning Large Language Models Without Preference Data Xia Han, Gao Songyang, Ge Qiming, Xi Zhiheng, Zhang Qi, Huang Xuanjing
- LESS: Selecting Influential Data For Targeted Instruction Tuning Xia Mengzhou, Malladi Sadhika, Gururangan Suchin, Arora Sanjeev, Chen Danqi
- Improving Retrieval Augmented Language Model With Self-reasoning Xia Yuan, Zhou Jingbo, Shi Zhenhui, Chen Jun, Huang Haifeng
- Unified Hallucination Detection For Multimodal Large Language Models Xiang Chen, Chenxi Wang, Yida Xue, Ningyu Zhang, Xiaoyan Yang, Qiang Li, Yue Shen, Lei Liang, Jinjie Gu, Huajun Chen
- Badchain: Backdoor Chain-of-thought Prompting For Large Language Models Xiang Zhen, Jiang Fengqing, Xiong Zidi, Ramasubramanian Bhaskar, Poovendran Radha, Li Bo
- Clover-2: Accurate Inference For Regressive Lightweight Speculative Decoding Xiao Bin, Gui Lujun, Su Lei, Chen Weipeng
- Rar-b: Reasoning As Retrieval Benchmark Xiao Chenghao, Hudson G Thomas, Moubayed Noura Al
- Alleviating Hallucination In Large Vision-language Models With Active Retrieval Augmentation Xiaoye Qu, Qiyuan Chen, Wei Wei, Jishuo Sun, Jianfeng Dong
- Leveraging Human Revisions For Improving Text-to-layout Models Xie Amber, Cheng Chin-yi, Huang Forrest, Li Yang
- Emovit: Revolutionizing Emotion Insights With Visual Instruction Tuning Xie Hongxia, Peng Chu-jun, Tseng Yu-wen, Chen Hung-jen, Hsu Chan-feng, Shuai Hong-han, Cheng Wen-huang
- Me Llama: Foundation Large Language Models For Medical Applications Xie Qianqian, Chen Qingyu, Chen Aokun, Peng Cheng, Hu Yan, Lin Fongci, Peng Xueqing, Huang Jimin, Zhang Jeffrey, Keloth Vipina, Zhou Xinyu, He Huan, Ohno-machado Lucila, Wu Yonghui, Xu Hua, Bian Jiang
- Recall: Membership Inference Via Relative Conditional Log-likelihoods Xie Roy, Wang Junlin, Huang Ruomin, Zhang Minxing, Ge Rong, Pei Jian, Gong Neil Zhenqiang, Dhingra Bhuwan
- Exploratory Preference Optimization: Harnessing Implicit Q*-approximation For Sample-efficient RLHF Xie Tengyang, Foster Dylan J., Krishnamurthy Akshay, Rosset Corby, Awadallah Ahmed, Rakhlin Alexander
- Monte Carlo Tree Search Boosts Reasoning Via Iterative Preference Learning Xie Yuxi, Goyal Anirudh, Zheng Wenyue, Kan Min-yen, Lillicrap Timothy P., Kawaguchi Kenji, Shieh Michael
- LLMAEL: Large Language Models Are Good Context Augmenters For Entity Linking Xin Amy, Qi Yunjia, Yao Zijun, Zhu Fangwei, Zeng Kaisheng, Bin Xu, Hou Lei, Li Juanzi
- Llm-enhanced User-item Interactions: Leveraging Edge Information For Optimized Recommendations Xinyuan Wang, Liang Wu, Liangjie Hong, Hao Liu, Yanjie Fu
- Benchmarking Retrieval-augmented Generation For Medicine Xiong Guangzhi, Jin Qiao, Lu Zhiyong, Zhang Aidong
- Improving Retrieval-augmented Generation In Medicine With Iterative Follow-up Questions Xiong Guangzhi, Jin Qiao, Wang Xiao, Zhang Minjia, Lu Zhiyong, Zhang Aidong
- Interactive-kbqa: Multi-turn Interactions For Knowledge Base Question Answering With Large Language Models Xiong Guanming, Bao Junwei, Zhao Wen
- Converging Paradigms: The Synergy Of Symbolic And Connectionist AI In Llm-empowered Autonomous Agents Xiong Haoyi, Wang Zhiyuan, Li Xuhong, Bian Jiang, Xie Zeke, Mumtaz Shahid, Barnes Laura E.
- Meaningful Learning: Advancing Abstract Reasoning In Large Language Models Via Generic Fact Guidance Xiong Kai, Ding Xiao, Liu Ting, Qin Bing, Xu Dongliang, Yang Qing, Liu Hongtao, Cao Yixin
- Building Math Agents With Multi-turn Iterative Preference Learning Xiong Wei, Shi Chengshuai, Shen Jiaming, Rosenberg Aviv, Qin Zhen, Calandriello Daniele, Khalman Misha, Joshi Rishabh, Piot Bilal, Saleh Mohammad, Jin Chi, Zhang Tong, Liu Tianqi
- From Artificial Needles To Real Haystacks: Improving Retrieval Capabilities In Llms By Finetuning On Synthetic Data Xiong Zheyang, Papageorgiou Vasilis, Lee Kangwook, Papailiopoulos Dimitris
- Mixture-of-instructions: Comprehensive Alignment Of A Large Language Model Through The Mixture Of Diverse System Prompting Instructions Xu Bowen, Wu Shaoyu, Liu Kai, Hu Lulu
- Stylemaster: Towards Flexible Stylized Image Generation With Diffusion Models Xu Chengming, Hu Kai, Luo Donghao, Zhang Jiangning, Li Wei, Ge Yanhao, Wang Chengjie
- Enhancing Retrieval-augmented Lms With A Two-stage Consistency Learning Compressor Xu Chuankai, Zhao Dongming, Wang Bo, Xing Hanwen
- Data Efficient Evaluation Of Large Language Models And Text-to-image Models Via Adaptive Sampling Xu Cong, Saranathan Gayathri, Alam Mahammad Parwez, Shah Arpit, Lim James, Wong Soon Yee, Martin Foltin, Bhattacharya Suparna
- Empowering 1000 Tokens/second On-device LLM Prefilling With Mllm-npu Xu Daliang, Zhang Hao, Yang Liming, Liu Ruiqi, Huang Gang, Xu Mengwei, Liu Xuanzhe
- Rejection Improves Reliability: Training Llms To Refuse Unknown Questions Using RL From Knowledge Feedback Xu Hongshen, Zhu Zichen, Zhang Situo, Ma Da, Fan Shuai, Chen Lu, Yu Kai
- Redagent: Red Teaming Large Language Models With Context-aware Autonomous Language Agent Xu Huiyu, Zhang Wenhui, Wang Zhibo, Xiao Feng, Zheng Rui, Feng Yunhe, Ba Zhongjie, Ren Kui
- Vtensor: Flexible Virtual Tensor Management For Efficient LLM Serving Xu Jiale, Zhang Rui, Guo Cong, Hu Weiming, Liu Zihan, Wu Feiyang, Feng Yu, Sun Shixuan, Shao Changxu, Guo Yuhong, Zhao Junping, Zhang Ke, Guo Minyi, Leng Jingwen
- Accelerating Large Language Model Training With Hybrid Gpu-based Compression Xu Lang, Anthony Quentin, Zhou Qinghua, Alnaasan Nawras, Gulhane Radha R., Shafi Aamir, Subramoni Hari, Panda Dhabaleswar K.
- Chatqa 2: Bridging The Gap To Proprietary Llms In Long Context And RAG Capabilities Xu Peng, Ping Wei, Wu Xianchao, Xu Chejian, Liu Zihan, Shoeybi Mohammad, Catanzaro Bryan
- MINDECHO: Role-playing Language Agents For Key Opinion Leaders Xu Rui, Lu Dakuan, Tan Xiaoyu, Wang Xintao, Yuan Siyu, Chen Jiangjie, Chu Wei, Yinghui Xu
- Character Is Destiny: Can Large Language Models Simulate Persona-driven Decisions In Role-playing? Xu Rui, Wang Xintao, Chen Jiangjie, Yuan Siyu, Yuan Xinfeng, Liang Jiaqing, Chen Zulong, Dong Xiaoqing, Xiao Yanghua
- Benchmarking Benchmark Leakage In Large Language Models Xu Ruijie, Wang Zengzhi, Fan Run-ze, Liu Pengfei
- Misconfidence-based Demonstration Selection For LLM In-context Learning Xu Shangqing Georgia Institute Of Technology, Zhang Chao Georgia Institute Of Technology
- Unveil The Duality Of Retrieval-augmented Generation: Theoretical Analysis And Practical Solution Xu Shicheng, Pang Liang, Shen Huawei, Cheng Xueqi
- Unsupervised Information Refinement Training Of Large Language Models For Retrieval-augmented Generation Xu Shicheng, Pang Liang, Yu Mo, Meng Fandong, Shen Huawei, Cheng Xueqi, Zhou Jie
- Is DPO Superior To PPO For LLM Alignment? A Comprehensive Study Xu Shusheng, Fu Wei, Gao Jiaxuan, Ye Wenjie, Liu Weilin, Mei Zhiyu, Wang Guangju, Yu Chao, Wu Yi
- Bridging The Gap Between Different Vocabularies For LLM Ensemble Xu Yangyifan, Lu Jinliang, Zhang Jiajun
- Generate-on-graph: Treat LLM As Both Agent And KG In Incomplete Knowledge Graph Question Answering Xu Yao, He Shizhu, Chen Jiabei, Wang Zihao, Song Yangqiu, Tong Hanghang, Liu Kang, Zhao Jun
- Think: Thinner Key Cache By Query-driven Pruning Xu Yuhui, Jie Zhanming, Dong Hanze, Wang Lei, Lu Xudong, Zhou Aojun, Saha Amrita, Xiong Caiming, Sahoo Doyen
- Face4rag: Factual Consistency Evaluation For Retrieval Augmented Generation In Chinese Xu Yunqi, Cai Tianchi, Jiang Jiyan, Song Xierui
- Bag Of Tricks: Benchmarking Of Jailbreak Attacks On Llms Xu Zhao, Liu Fan, Liu Hao
- Detectiveqa: Evaluating Long-context Reasoning On Detective Novels Xu Zhe, Ye Jiasheng, Liu Xiangyang, Sun Tianxiang, Liu Xiaoran, Guo Qipeng, Li Linlin, Liu Qun, Huang Xuanjing, Qiu Xipeng
- Activerag: Revealing The Treasures Of Knowledge Via Active Learning Xu Zhipeng, Liu Zhenghao, Liu Yibin, Xiong Chenyan, Yan Yukun, Wang Shuo, Yu Shi, Liu Zhiyuan, Yu Ge
- LEMMA: Towards Lvlm-enhanced Multimodal Misinformation Detection With External Knowledge Augmentation Xuan Keyang, Yi Li, Yang Fan, Wu Ruochen, Fung Yi R., Ji Heng
- Question Calibration And Multi-hop Modeling For Temporal Question Answering Xue Chao, Liang Di, Wang Pengfei, Zhang Jing
- Improving Audio Codec-based Zero-shot Text-to-speech Synthesis With Multi-modal Context And Large Language Model Xue Jinlong, Deng Yayue, Han Yicheng, Gao Yingming, Li Ya
- Multi-programming Language Ensemble For Code Generation In Large Language Model Xue Tengfei, Li Xuefeng, Azim Tahir, Smirnov Roman, Yu Jianhui, Sadrieh Arash, Pahlavan Babak
- Ninjallm: Fast, Scalable And Cost-effective RAG Using Amazon Sagemaker And AWS Trainium And Inferentia2 Xue Tengfei, Li Xuefeng, Smirnov Roman, Azim Tahir, Sadrieh Arash, Pahlavan Babak
- Mitigating LLM Hallucinations Via Conformal Abstention Yadkori Yasin Abbasi, Kuzborskij Ilja, Stutz David, György András, Fisch Adam, Doucet Arnaud, Beloshapka Iuliya, Weng Wei-hung, Yang Yao-yuan, Szepesvári Csaba, Cemgil Ali Taylan, Tomasev Nenad
- LTNER: Large Language Model Tagging For Named Entity Recognition With Contextualized Entity Marking Yan Faren, Yu Peng, Chen Xin
- Mirror: A Multiple-perspective Self-reflection Method For Knowledge-rich Reasoning Yan Hanqi, Zhu Qinglin, Wang Xinyu, Gui Lin, He Yulan
- Corrective Retrieval Augmented Generation Yan Shi-qi, Gu Jia-chen, Zhu Yun, Ling Zhen-hua
- Contrastive Instruction Tuning Yan Tianyi Lorena, Wang Fei, Huang James Y., Zhou Wenxuan, Yin Fan, Galstyan Aram, Yin Wenpeng, Chen Muhao
- Echosight: Advancing Visual-language Models With Wiki Knowledge Yan Yibin, Xie Weidi
- Advancing Genai Assisted Programming--a Comparative Study On Prompt Efficiency And Code Quality Between GPT-4 And GLM-4 Yang Angus, Li Zehan, Li Jie
- Crafting Customisable Characters With Llms: Introducing Simschat, A Persona-driven Role-playing Agent Framework Yang Bohao, Liu Dong, Tang Chen, Xiao Chenghao, Zhao Kun, Li Chao, Yuan Lin, Yang Guang, Huang Lanxiao, Lin Chenghua
- Embedded Translations For Low-resource Automated Glossing Yang Changbing, Nicolai Garrett, Silfverberg Miikka
- IM-RAG: Multi-round Retrieval-augmented Generation Through Learning Inner Monologues Yang Diji, Rao Jinmeng, Chen Kezhen, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- The Geometry Of Queries: Query-based Innovations In Retrieval-augmented Generation Yang Eric, Amar Jonathan, Lee Jong Ha, Kumar Bhawesh, Jia Yugang
- \(\text{memory}^3\): Language Modeling With Explicit Memory Yang Hongkang, Lin Zehao, Wang Wenjin, Wu Hao, Li Zhiyu, Tang Bo, Wei Wenqiang, Wang Jinbo, Tang Zeyun, Song Shichao, Xi Chenyang, Yu Yu, Chen Kai, Xiong Feiyu, Tang Linpeng, E Weinan
- Aligning Llms Through Multi-perspective User Preference Ranking-based Feedback For Programming Question Answering Yang Hongyu, He Liyang, Hou Min, Shen Shuanghong, Li Rui, Hou Jiahui, Ma Jianhui, Zhao Junda
- Asynchronous And Segmented Bidirectional Encoding For NMT Yang Jingpu, Han Zehua, Xiang Mengyu, Wang Helin, Huang Yuxiao, Fang Miao
- Buffer Of Thoughts: Thought-augmented Reasoning With Large Language Models Yang Ling, Yu Zhaochen, Zhang Tianjun, Cao Shiyi, Xu Minkai, Zhang Wentao, Gonzalez Joseph E., Cui Bin
- Dual-phase Accelerated Prompt Optimization Yang Muchen, Li Moxin, Li Yongle, Chen Zijun, Gao Chongming, Zhang Junqi, Li Yangyang, Feng Fuli
- Confidence Calibration And Rationalization For Llms Via Multi-agent Deliberation Yang Ruixin, Rajagopal Dheeraj, Hayati Shirley Anugrah, Hu Bin, Kang Dongyeop
- Mllmreid: Multimodal Large Language Model-based Person Re-identification Yang Shan, Zhang Yongfei
- Video As The New Language For Real-world Decision Making Yang Sherry, Walker Jacob, Parker-holder Jack, Du Yilun, Bruce Jake, Barreto Andre, Abbeel Pieter, Schuurmans Dale
- Seed-story: Multimodal Long Story Generation With Large Language Model Yang Shuai, Ge Yuying, Li Yang, Chen Yukang, Ge Yixiao, Shan Ying, Chen Yingcong
- Is Crowdsourcing Breaking Your Bank? Cost-effective Fine-tuning Of Pre-trained Language Models With Proximal Policy Optimization Yang Shuo, Kasneci Gjergji
- Do Large Language Models Latently Perform Multi-hop Reasoning? Yang Sohee, Gribovskaya Elena, Kassner Nora, Geva Mor, Riedel Sebastian
- DGL: Dynamic Global-local Prompt Tuning For Text-video Retrieval Yang Xiangpeng, Zhu Linchao, Wang Xiaohan, Yang Yi
- CRAG -- Comprehensive RAG Benchmark Yang Xiao, Sun Kai, Xin Hao, Sun Yushi, Bhalla Nikita, Chen Xiangsen, Choudhary Sajal, Gui Rongze Daniel, Jiang Ziran Will, Jiang Ziyu, Kong Lingkun, Moran Brian, Wang Jiaqi, Xu Yifan Ethan, Yan An, Yang Chenyu, Yuan Eting, Zha Hanwen, Tang Nan, Chen Lei, Scheffer Nicolas, Liu Yue, Shah Nirav, Wanga Rakesh, Kumar Anuj, Yih Wen-tau, Dong Xin Luna
- Pre-trained Language Models Improve The Few-shot Prompt Ability Of Decision Transformer Yang Yu, Xu Pan
- Weak-to-strong Reasoning Yang Yuqing, Ma Yan, Liu Pengfei
- React Meets Actre: When Language Agents Enjoy Training Data Autonomy Yang Zonghan, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Dense Connector For Mllms Yao Huanjin, Wu Wenhao, Yang Taojiannan, Song Yuxin, Zhang Mengxi, Feng Haocheng, Sun Yifan, Li Zhiheng, Ouyang Wanli, Wang Jingdong
- Cacheblend: Fast Large Language Model Serving For RAG With Cached Knowledge Fusion Yao Jiayi, Li Hanchen, Liu Yuhan, Ray Siddhant, Cheng Yihua, Zhang Qizheng, Du Kuntai, Lu Shan, Jiang Junchen
- CLAVE: An Adaptive Framework For Evaluating Values Of LLM Generated Responses Yao Jing, Yi Xiaoyuan, Xie Xing
- Open-domain Implicit Format Control For Large Language Model Generation Yao Yiqun, Ma Wenjia, Fang Xuezhi, Jiang Xin, Li Xiang, Meng Xuying, Han Peng, Li Jing, Sun Aixin, Wang Yequan
- Seakr: Self-aware Knowledge Retrieval For Adaptive Retrieval Augmented Generation Yao Zijun, Qi Weijian, Pan Liangming, Cao Shulin, Hu Linmei, Liu Weichuan, Hou Lei, Li Juanzi
- The Impact Of Quantization On Retrieval-augmented Generation: An Analysis Of Small Llms Yazan Mert, Verberne Suzan, Situmeang Frederik
- R^2AG: Incorporating Retrieval Information Into Retrieval Augmented Generation Ye Fuda, Li Shuangyin, Zhang Yongqi, Chen Lei
- Self-judge: Selective Instruction Following With Alignment Self-evaluation Ye Hai, Ng Hwee Tou
- Large Language Models As Hyper-heuristics For Combinatorial Optimization Ye Haoran, Wang Jiarui, Cao Zhiguang, Berto Federico, Hua Chuanbo, Kim Haeyeon, Park Jinkyoo, Song Guojie
- Toolsword: Unveiling Safety Issues Of Large Language Models In Tool Learning Across Three Stages Ye Junjie, Li Sixian, Li Guanyu, Huang Caishuang, Gao Songyang, Wu Yilong, Zhang Qi, Gui Tao, Huang Xuanjing
- Boosting Conversational Question Answering With Fine-grained Retrieval-augmentation And Self-check Ye Linhao, Lei Zhikai, Yin Jianghao, Chen Qin, Zhou Jie, He Liang
- Mm-spubench: Towards Better Understanding Of Spurious Biases In Multimodal Llms Ye Wenqian, Zheng Guangtao, Ma Yunsheng, Cao Xu, Lai Bolin, Rehg James M., Zhang Aidong
- Harnessing Multimodal Large Language Models For Multimodal Sequential Recommendation Ye Yuyang, Zheng Zhi, Shen Yishan, Wang Tianshu, Zhang Hengruo, Zhu Peijun, Yu Runlong, Zhang Kai, Xiong Hui
- Long-context Language Modeling With Parallel Context Encoding Yen Howard, Gao Tianyu, Chen Danqi
- Is The Pope Catholic? Yes, The Pope Is Catholic. Generative Evaluation Of Non-literal Intent Resolution In Llms Yerukola Akhila, Vaduguru Saujas, Fried Daniel, Sap Maarten
- Towards Fast Multilingual LLM Inference: Speculative Decoding And Specialized Drafters Yi Euiin, Kim Taehyeon, Jeung Hongseok, Chang Du-seong, Yun Se-young
- Do Large Language Model Understand Multi-intent Spoken Language ? Yin Shangjian, Huang Peijie, Xu Yuhong, Huang Haojing, Chen Jiatian
- Mumath-code: Combining Tool-use Large Language Models With Multi-perspective Data Augmentation For Mathematical Reasoning Yin Shuo, You Weihao, Ji Zhilong, Zhong Guoqiang, Bai Jinfeng
- SEA: Supervised Embedding Alignment For Token-level Visual-textual Integration In Mllms Yin Yuanyang, Zhao Yaqi, Zhang Yajie, Lin Ke, Wang Jiahao, Tao Xin, Wan Pengfei, Zhang Di, Yin Baoqun, Zhang Wentao
- Relative Preference Optimization: Enhancing LLM Alignment Through Contrasting Responses Across Identical And Diverse Prompts Yin Yueqin, Wang Zhendong, Gu Yi, Huang Hai, Chen Weizhu, Zhou Mingyuan
- Llms-as-instructors: Learning From Errors Toward Automating Model Improvement Ying Jiahao, Lin Mingbao, Cao Yixin, Tang Wei, Wang Bo, Sun Qianru, Huang Xuanjing, Yan Shuicheng
- Exploring The Comprehension Of Chatgpt In Traditional Chinese Medicine Knowledge Yizhen Li, Shaohan Huang, Jiaxing Qi, Lei Quan, Dongran Han, Zhongzhi Luan
- TLCR: Token-level Continuous Reward For Fine-grained Reinforcement Learning From Human Feedback Yoon Eunseop, Yoon Hee Suk, Eom Soohwan, Han Gunsoo, Nam Daniel Wontae, Jo Daejin, On Kyoung-woon, Hasegawa-johnson Mark A., Kim Sungwoong, Yoo Chang D.
- C-TPT: Calibrated Test-time Prompt Tuning For Vision-language Models Via Text Feature Dispersion Yoon Hee Suk, Yoon Eunseop, Tee Joshua Tian Jin, Hasegawa-johnson Mark, Li Yingzhen, Yoo Chang D.
- By My Eyes: Grounding Multimodal Large Language Models With Sensor Data Via Visual Prompting Yoon Hyungjun, Tolera Biniyam Aschalew, Gong Taesik, Lee Kimin, Lee Sung-ju
- Making The Most Of Your Model: Methods For Finetuning And Applying Pretrained Transformers Yoshida Davis
- Shiftaddllm: Accelerating Pretrained Llms Via Post-training Multiplication-less Reparameterization You Haoran, Guo Yipin, Fu Yichao, Zhou Wei, Shi Huihong, Zhang Xiaofan, Kundu Souvik, Yazdanbakhsh Amir, Lin Yingyan Celine
- Ferret-ui: Grounded Mobile UI Understanding With Multimodal Llms You Keen, Zhang Haotian, Schoop Eldon, Weers Floris, Swearngin Amanda, Nichols Jeffrey, Yang Yinfei, Gan Zhe
- Llasmol: Advancing Large Language Models For Chemistry With A Large-scale, Comprehensive, High-quality Instruction Tuning Dataset Yu Botao, Baker Frazier N., Chen Ziqi, Ning Xia, Sun Huan
- Siam: Self-improving Code-assisted Mathematical Reasoning Of Large Language Models Yu Dian, Peng Baolin, Tian Ye, Song Linfeng, Mi Haitao, Yu Dong
- Enhancing Healthcare Through Large Language Models: A Study On Medical Question Answering Yu Haoran, Yu Chang, Wang Zihan, Zou Dongxian, Qin Hao
- In Defense Of RAG In The Era Of Long-context Language Models Yu Tan, Xu Anbang, Akkiraju Rama
- Truth-aware Context Selection: Mitigating Hallucinations Of Large Language Models Being Misled By Untruthful Contexts Yu Tian, Zhang Shaolei, Feng Yang
- Neeko: Leveraging Dynamic Lora For Efficient Multi-character Role-playing Agent Yu Xiaoyan, Luo Tongxu, Wei Yifan, Lei Fangyu, Huang Yiming, Peng Hao, Zhu Liehuang
- Rankrag: Unifying Context Ranking With Retrieval-augmented Generation In Llms Yu Yue, Ping Wei, Liu Zihan, Wang Boxin, You Jiaxuan, Zhang Chao, Shoeybi Mohammad, Catanzaro Bryan
- Unveiling And Harnessing Hidden Attention Sinks: Enhancing Large Language Models Without Training Through Attention Calibration Yu Zhongzhi, Wang Zheng, Fu Yonggan, Shi Huihong, Shaikh Khalid, Lin Yingyan Celine
- Beyond Under-alignment: Atomic Preference Enhanced Factuality Tuning For Large Language Models Yuan Hongbang, Chen Yubo, Cao Pengfei, Jin Zhuoran, Liu Kang, Zhao Jun
- Focused Large Language Models Are Stable Many-shot Learners Yuan Peiwen, Feng Shaoxiong, Li Yiwei, Wang Xinglin, Zhang Yueqi, Tan Chuyi, Pan Boyuan, Wang Heda, Hu Yao, Li Kan
- Why Lift So Heavy? Slimming Large Language Models By Cutting Off The Layers Yuan Shuzhou, Nie Ercong, Ma Bolei, Färber Michael
- R-judge: Benchmarking Safety Risk Awareness For LLM Agents Yuan Tongxin, He Zhiwei, Dong Lingzhong, Wang Yiming, Zhao Ruijie, Xia Tian, Xu Lizhen, Zhou Binglin, Li Fangqi, Zhang Zhuosheng, Wang Rui, Liu Gongshen
- A Hybrid RAG System With Comprehensive Enhancement On Complex Reasoning Yuan Ye, Liu Chengwu, Yuan Jingyang, Sun Gongbo, Li Siqi, Zhang Ming
- CO3: Low-resource Contrastive Co-training For Generative Conversational Query Rewrite Yuan Yifei, Shi Chen, Wang Runze, Chen Liyi, Hu Renjun, Zhang Zengming, Jiang Feijun, Lam Wai
- Do Large Language Models Understand Conversational Implicature -- A Case Study With A Chinese Sitcom Yue Shisen, Song Siyuan, Cheng Xinyuan, Hu Hai
- Web2code: A Large-scale Webpage-to-code Dataset And Evaluation Framework For Multimodal Llms Yun Sukmin, Lin Haokun, Thushara Rusiru, Bhat Mohammad Qazim, Wang Yongxin, Jiang Zutao, Deng Mingkai, Wang Jinhong, Tao Tianhua, Li Junbo, Li Haonan, Nakov Preslav, Baldwin Timothy, Liu Zhengzhong, Xing Eric P., Liang Xiaodan, Shen Zhiqiang
- Rec-gpt4v: Multimodal Recommendation With Large Vision-language Models Yuqing Liu, Yu Wang, Lichao Sun, Philip S. Yu
- Building Accurate Translation-tailored Llms With Language Aware Instruction Tuning Zan Changtong, Ding Liang, Shen Li, Zhen Yibing, Liu Weifeng, Tao Dacheng
- Let Storytelling Tell Vivid Stories: An Expressive And Fluent Multimodal Storyteller Zang Chuanqi, Tang Jiji, Zhang Rongsheng, Zhao Zeng, Lv Tangjie, Pei Mingtao, Liang Wei
- Automating Turkish Educational Quiz Generation Using Large Language Models Zeinalipour Kamyar, Keptiğ Yusuf Gökberk, Maggini Marco, Gori Marco
- Improving Logits-based Detector Without Logits From Black-box Llms Zeng Cong, Tang Shengkun, Yang Xianjun, Chen Yuanzhou, Sun Yiyou, Xu Zhiqiang, Li Yao, Chen Haifeng, Cheng Wei, Xu Dongkuan
- DELIA: Diversity-enhanced Learning For Instruction Adaptation In Large Language Models Zeng Yuanhao, Ren Fei, Zhou Xinpeng, Wang Yihang, Shao Yingxia
- Chimera: A Lossless Decoding Method For Accelerating Large Language Models Inference By Fusing All Tokens Zeng Ziqian, Yu Jiahong, Pang Qianshi, Wang Zihao, Zhuang Huiping, Shao Hongen, Zou Xiaofeng
- Personarag: Enhancing Retrieval-augmented Generation Systems With User-centric Agents Zerhoudi Saber, Granitzer Michael
- Online Self-preferring Language Models Zhai Yuanzhao, Zhang Zhuo, Xu Kele, Peng Hanyang, Yu Yue, Feng Dawei, Yang Cheng, Ding Bo, Wang Huaimin
- Skyeyegpt: Unifying Remote Sensing Vision-language Tasks Via Instruction Tuning With Large Language Model Zhan Yang, Xiong Zhitong, Yuan Yuan
- MC-GPT: Empowering Vision-and-language Navigation With Memory Map And Reasoning Chains Zhan Zhaohuan, Yu Lisha, Yu Sijie, Tan Guang
- Distilling Implicit Multimodal Knowledge Into Llms For Zero-resource Dialogue Generation Zhang Bo, Ma Hui, Ding Jian, Wang Jian, Xu Bo, Lin Hongfei
- Ts-align: A Teacher-student Collaborative Framework For Scalable Iterative Finetuning Of Large Language Models Zhang Chen, Tang Chengguang, Chong Dading, Shi Ke, Tang Guohua, Jiang Feng, Li Haizhou
- Sciglm: Training Scientific Language Models With Self-reflective Instruction Annotation And Tuning Zhang Dan, Hu Ziniu, Zhoubian Sining, Du Zhengxiao, Yang Kaiyu, Wang Zihan, Yue Yisong, Dong Yuxiao, Tang Jie
- Code Representation Learning At Scale Zhang Dejiao, Ahmad Wasi, Tan Ming, Ding Hantian, Nallapati Ramesh, Roth Dan, Ma Xiaofei, Xiang Bing
- Iterative Utility Judgment Framework Via Llms Inspired By Relevance In Philosophy Zhang Hengran, Bi Keping, Guo Jiafeng, Cheng Xueqi
- Are Large Language Models Good At Utility Judgments? Zhang Hengran, Zhang Ruqing, Guo Jiafeng, De Rijke Maarten, Fan Yixing, Cheng Xueqi
- Paying More Attention To Source Context: Mitigating Unfaithful Translations From Large Language Model Zhang Hongbin, Chen Kehai, Bai Xuefeng, Xiang Yang, Zhang Min
- Enja: Ensemble Jailbreak On Large Language Models Zhang Jiahao, Wang Zilong, Wang Ruofan, Ma Xingjun, Jiang Yu-gang
- Longcite: Enabling Llms To Generate Fine-grained Citations In Long-context QA Zhang Jiajie, Bai Yushi, Lv Xin, Gu Wanjun, Liu Danqing, Zou Minhao, Cao Shulin, Hou Lei, Dong Yuxiao, Feng Ling, Li Juanzi
- Agentohana: Design Unified Data And Training Pipeline For Effective Agent Learning Zhang Jianguo, Lan Tian, Murthy Rithesh, Liu Zhiwei, Yao Weiran, Tan Juntao, Hoang Thai, Yang Liangwei, Feng Yihao, Liu Zuxin, Awalgaonkar Tulika, Niebles Juan Carlos, Savarese Silvio, Heinecke Shelby, Wang Huan, Xiong Caiming
- Knowhalu: Hallucination Detection Via Multi-form Knowledge Based Factual Checking Zhang Jiawei, Xu Chejian, Gai Yu, Lecue Freddy, Song Dawn, Li Bo
- Verifiable By Design: Aligning Language Models To Quote From Pre-training Data Zhang Jingyu, Marone Marc, Li Tianjian, Van Durme Benjamin, Khashabi Daniel
- Onegen: Efficient One-pass Unified Generation And Retrieval For Llms Zhang Jintian, Peng Cheng, Sun Mengshu, Chen Xiang, Liang Lei, Zhang Zhiqiang, Zhou Jun, Chen Huajun, Zhang Ningyu
- Read And Think: An Efficient Step-wise Multimodal Language Model For Document Understanding And Reasoning Zhang Jinxu
- Exploring The Best Practices Of Query Expansion With Large Language Models Zhang Le, Wu Yihong, Yang Qian, Nie Jian-yun
- Harmonized Speculative Sampling Zhang Lefan, Wang Xiaodan, Huang Yanhua, Xu Ruiwen
- ARL2: Aligning Retrievers For Black-box Large Language Models Via Self-guided Adaptive Relevance Labeling Zhang Lingxi, Yu Yue, Wang Kuan, Zhang Chao
- HARE: Human Priors, A Key To Small Language Model Efficiency Zhang Lingyun, Jin Bin, Ge Gaojian, Liu Lunhui, Shen Xuewen, Wu Mingyong, Zhang Houqian, Jiang Yongneng, Chen Shiqi, Pu Shi
- Attention Instruction: Amplifying Attention In The Middle Via Prompting Zhang Meiru, Meng Zaiqiao, Collier Nigel
- Automated Multi-level Preference For Mllms Zhang Mengxi, Wu Wenhao, Lu Yu, Song Yuxin, Rong Kang, Yao Huanjin, Zhao Jianbo, Liu Fanglong, Sun Yifan, Feng Haocheng, Wang Jingdong
- Transfertod: A Generalizable Chinese Multi-domain Task-oriented Dialogue System With Transfer Capabilities Zhang Ming, Huang Caishuang, Wu Yilong, Liu Shichun, Zheng Huiyuan, Dong Yurui, Shen Yujiong, Dou Shihan, Zhao Jun, Ye Junjie, Zhang Qi, Gui Tao, Huang Xuanjing
- Instructedit: Instruction-based Knowledge Editing For Large Language Models Zhang Ningyu, Tian Bozhong, Cheng Siyuan, Liang Xiaozhuan, Hu Yi, Xue Kouying, Gou Yanjie, Chen Xi, Chen Huajun
- Tinyllama: An Open-source Small Language Model Zhang Peiyuan, Zeng Guangtao, Wang Tianduo, Lu Wei
- Look Ahead Or Look Around? A Theoretical Comparison Between Autoregressive And Masked Pretraining Zhang Qi, Du Tianqi, Huang Haotian, Wang Yifei, Wang Yisen
- RECOST: External Knowledge Guided Data-efficient Instruction Tuning Zhang Qi, Zhang Yiming, Wang Haobo, Zhao Junbo
- VAR-CLIP: Text-to-image Generator With Visual Auto-regressive Modeling Zhang Qian, Dai Xiangzi, Yang Ninghua, An Xiang, Feng Ziyong, Ren Xingyu
- Collaborative Performance Prediction For Large Language Models Zhang Qiyuan, Lyu Fuyuan, Liu Xue, Ma Chen
- Plad: Preference-based Large Language Model Distillation With Pseudo-preference Pairs Zhang Rongzhi, Shen Jiaming, Liu Tianqi, Wang Haorui, Qin Zhen, Han Feng, Liu Jialu, Baumgartner Simon, Bendersky Michael, Zhang Chao
- Offline Training Of Language Model Agents With Functions As Learnable Weights Zhang Shaokun, Zhang Jieyu, Liu Jiale, Song Linxin, Wang Chi, Krishna Ranjay, Wu Qingyun
- Truthx: Alleviating Hallucinations By Editing Large Language Models In Truthful Space Zhang Shaolei, Yu Tian, Feng Yang
- Getting More From Less: Large Language Models Are Good Spontaneous Multilingual Learners Zhang Shimao, Gao Changjiang, Zhu Wenhao, Chen Jiajun, Huang Xin, Han Xue, Feng Junlan, Deng Chao, Huang Shujian
- Cfbench: A Comprehensive Constraints-following Benchmark For Llms Zhang Tao, Shen Yanjun, Luo Wenjing, Zhang Yan, Liang Hao, Zhang Tao, Yang Fan, Lin Mingan, Qiao Yujing, Chen Weipeng, Cui Bin, Zhang Wentao, Zhou Zenan
- R4: Reinforced Retriever-reorder-responder For Retrieval-augmented Large Language Models Zhang Taolin, Li Dongyang, Chen Qizhou, Wang Chengyu, Huang Longtao, Xue Hui, He Xiaofeng, Huang Jun
- RAFT: Adapting Language Model To Domain Specific RAG Zhang Tianjun, Patil Shishir G., Jain Naman, Shen Sheng, Zaharia Matei, Stoica Ion, Gonzalez Joseph E.
- Leanquant: Accurate Large Language Model Quantization With Loss-error-aware Grid Zhang Tianyi, Shrivastava Anshumali
- Nomad-attention: Efficient LLM Inference On Cpus Through Multiply-add-free Attention Zhang Tianyi, Yi Jonah Wonkyu, Yao Bowen, Xu Zhaozhuo, Shrivastava Anshumali
- Revisiting OPRO: The Limitations Of Small-scale Llms As Optimizers Zhang Tuo, Yuan Jinyue, Avestimehr Salman
- Methodology Of Adapting Large English Language Models For Specific Cultural Contexts Zhang Wenjing, Xiao Siqi, Lei Xuejiao, Wang Ning, Zhang Huazheng, An Meijuan, Yang Bikun, Liu Zhaoxiang, Wang Kai, Lian Shiguo
- Seallms 3: Open Foundation And Chat Multilingual Large Language Models For Southeast Asian Languages Zhang Wenxuan, Chan Hou Pong, Zhao Yiran, Aljunied Mahani, Wang Jianyu, Liu Chaoqun, Deng Yue, Hu Zhiqiang, Xu Weiwen, Chia Yew Ken, Li Xin, Bing Lidong
- D\'olares Or Dollars? Unraveling The Bilingual Prowess Of Financial Llms Between Spanish And English Zhang Xiao, Xiang Ruoyu, Yuan Chenhan, Feng Duanyu, Han Weiguang, Lopez-lira Alejandro, Liu Xiao-yang, Ananiadou Sophia, Peng Min, Huang Jimin, Xie Qianqian
- Event Temporal Relation Extraction Based On Retrieval-augmented On Llms Zhang Xiaobin, Zang Liangjun, Liu Qianwen, Wei Shuchong, Hu Songlin
- Large Language Models As Evaluators For Recommendation Explanations Zhang Xiaoyu, Li Yishan, Wang Jiayin, Sun Bowen, Ma Weizhi, Sun Peijie, Zhang Min
- Llms Instruct Llms:an Extraction And Editing Method Zhang Xin, Ju Tianjie, Liang Huijia, Fu Ying, Zhang Qin
- Chain Of Preference Optimization: Improving Chain-of-thought Reasoning In Llms Zhang Xuan, Du Chao, Pang Tianyu, Liu Qian, Gao Wei, Lin Min
- Reinforcement Retrieval Leveraging Fine-grained Feedback For Fact Checking News Claims With Black-box LLM Zhang Xuan, Gao Wei
- Glape: Gold Label-agnostic Prompt Evaluation And Optimization For Large Language Model Zhang Xuanchang, Zhang Zhuosheng, Zhao Hai
- RAGLAB: A Modular And Research-oriented Unified Framework For Retrieval-augmented Generation Zhang Xuanwang, Song Yunze, Wang Yidong, Tang Shuyun, Li Xinfeng, Zeng Zhengran, Wu Zhen, Ye Wei, Xu Wenyuan, Zhang Yue, Dai Xinyu, Zhang Shikun, Wen Qingsong
- Pushing The Limit Of LLM Capacity For Text Classification Zhang Yazhou, Wang Mengyao, Ren Chenyu, Li Qiuchi, Tiwari Prayag, Wang Benyou, Qin Jing
- Rankclip: Ranking-consistent Language-image Pretraining Zhang Yiming, Zhao Zhuokai, Chen Zhaorun, Feng Zhili, Ding Zenghui, Sun Yining
- How Far Are We From Intelligent Visual Deductive Reasoning? Zhang Yizhe, Bai He, Zhang Ruixiang, Gu Jiatao, Zhai Shuangfei, Susskind Josh, Jaitly Navdeep
- Leveraging Biases In Large Language Models: "bias-knn'' For Effective Few-shot Learning Zhang Yong, Li Hanzhang, Li Zhitao, Cheng Ning, Li Ming, Xiao Jing, Wang Jianzong
- Mg-verilog: Multi-grained Dataset Towards Enhanced Llm-assisted Verilog Generation Zhang Yongan, Yu Zhongzhi, Fu Yonggan, Wan Cheng, Lin Yingyan Celine
- Worldqa: Multimodal World Knowledge In Videos Through Long-chain Reasoning Zhang Yuanhan, Zhang Kaichen, Li Bo, Pu Fanyi, Setiadharma Christopher Arif, Yang Jingkang, Liu Ziwei
- OT-VP: Optimal Transport-guided Visual Prompting For Test-time Adaptation Zhang Yunbei, Mehra Akshay, Hamm Jihun
- Small Language Models Need Strong Verifiers To Self-correct Reasoning Zhang Yunxiang, Khalifa Muhammad, Logeswaran Lajanugen, Kim Jaekyeom, Lee Moontae, Lee Honglak, Wang Lu
- Intention Analysis Makes Llms A Good Jailbreak Defender Zhang Yuqi, Ding Liang, Zhang Lefei, Tao Dacheng
- Understanding The Relationship Between Prompts And Response Uncertainty In Large Language Models Zhang Ze Yu, Verma Arun, Doshi-velez Finale, Low Bryan Kian Hsiang
- Quantized Side Tuning: Fast And Memory-efficient Tuning Of Quantized Large Language Models Zhang Zhengxin, Zhao Dan, Miao Xupeng, Oliaro Gabriele, Li Qing, Jiang Yong, Jia Zhihao
- Found In The Middle: How Language Models Use Long Contexts Better Via Plug-and-play Positional Encoding Zhang Zhenyu, Chen Runjin, Liu Shiwei, Yao Zhewei, Ruwase Olatunji, Chen Beidi, Wu Xiaoxia, Wang Zhangyang
- Glitchprober: Advancing Effective Detection And Mitigation Of Glitch Tokens In Large Language Models Zhang Zhibo, Bai Wuxia, Li Yuxi, Meng Mark Huasong, Wang Kailong, Shi Ling, Li Li, Wang Jun, Wang Haoyu
- LANE: Logic Alignment Of Non-tuning Large Language Models And Online Recommendation Systems For Explainable Reason Generation Zhao Hongke, Zheng Songming, Wu Likang, Yu Bowen, Wang Jing
- Revolutionizing Finance With Llms: An Overview Of Applications And Insights Zhao Huaqin, Liu Zhengliang, Wu Zihao, Li Yiwei, Yang Tianze, Shu Peng, Xu Shaochen, Dai Haixing, Zhao Lin, Mai Gengchen, Liu Ninghao, Liu Tianming
- Cpsyexam: A Chinese Benchmark For Evaluating Psychology Using Examinations Zhao Jiahao, Zhu Jingwei, Tan Minghuan, Yang Min, Yang Di, Zhang Chenhao, Ye Guancheng, Li Chengming, Hu Xiping
- Prefix Guidance: A Steering Wheel For Large Language Models To Defend Against Jailbreak Attacks Zhao Jiawei, Chen Kejiang, Yuan Xiaojian, Zhang Weiming
- Lora Land: 310 Fine-tuned Llms That Rival GPT-4, A Technical Report Zhao Justin, Wang Timothy, Abid Wael, Angus Geoffrey, Garg Arnav, Kinnison Jeffery, Sherstinsky Alex, Molino Piero, Addair Travis, Rishi Devvret
- SLIDE: A Framework Integrating Small And Large Language Models For Open-domain Dialogues Evaluation Zhao Kun, Yang Bohao, Tang Chen, Lin Chenghua, Zhan Liang
- Diffagent: Fast And Accurate Text-to-image API Selection With Large Language Model Zhao Lirui, Yang Yue, Zhang Kaipeng, Shao Wenqi, Zhang Yuxin, Qiao Yu, Luo Ping, Ji Rongrong
- Efficiently Training 7B LLM With 1 Million Sequence Length On 8 Gpus Zhao Pinxue, Zhang Hailin, Fu Fangcheng, Nie Xiaonan, Liu Qibin, Yang Fang, Peng Yuanbo, Jiao Dian, Li Shuaipeng, Xue Jinbao, Tao Yangyu, Cui Bin
- SYNTHEVAL: Hybrid Behavioral Testing Of NLP Models With Synthetic Checklists Zhao Raoyuan, Köksal Abdullatif, Liu Yihong, Weissweiler Leonie, Korhonen Anna, Schütze Hinrich
- Towards Comprehensive And Efficient Post Safety Alignment Of Large Language Models Via Safety Patching Zhao Weixiang, Hu Yulin, Li Zhuojun, Deng Yang, Zhao Yanyan, Qin Bing, Chua Tat-seng
- How Do Large Language Models Handle Multilingualism? Zhao Yiran, Zhang Wenxuan, Chen Guizhen, Kawaguchi Kenji, Bing Lidong
- An Empirical Study Of Retrieval Augmented Generation With Chain-of-thought Zhao Yuetong, Cao Hongyu, Zhao Xianyu, Ou Zhijian
- Codejudge-eval: Can Large Language Models Be Good Judges In Code Understanding? Zhao Yuwei, Luo Ziyang, Tian Yuchen, Lin Hongzhan, Yan Weixiang, Li Annan, Ma Jing
- Let Me Do It For You: Towards LLM Empowered Recommendation Via Tool Learning Zhao Yuyue, Wu Jiancan, Wang Xiang, Tang Wei, Wang Dingxian, De Rijke Maarten
- SWIFT:A Scalable Lightweight Infrastructure For Fine-tuning Zhao Yuze, Huang Jintao, Hu Jinghan, Wang Xingjun, Mao Yunlin, Zhang Daoze, Jiang Zeyinzi, Wu Zhikai, Ai Baole, Wang Ang, Zhou Wenmeng, Chen Yingda
- Retrieval-augmented Mixture Of Lora Experts For Uploadable Machine Learning Zhao Ziyu, Gan Leilei, Wang Guoyin, Hu Yuwei, Shen Tao, Yang Hongxia, Kuang Kun, Wu Fei
- HALC: Object Hallucination Reduction Via Adaptive Focal-contrast Decoding Zhaorun Chen, Zhuokai Zhao, Hongyin Luo, Huaxiu Yao, Bo Li, Jiawei Zhou
- Leveraging Large Language Models With Chain-of-thought And Prompt Engineering For Traffic Crash Severity Analysis And Inference Zhen Hao, Shi Yucheng, Huang Yongcan, Yang Jidong J., Liu Ninghao
- Gpt-4v(ision) Is A Generalist Web Agent, If Grounded Zheng Boyuan, Gou Boyu, Kil Jihyung, Sun Huan, Su Yu
- Large Language Models As Reliable Knowledge Bases? Zheng Danna, Lapata Mirella, Pan Jeff Z.
- Ali-agent: Assessing Llms' Alignment With Human Values Via Agent-based Evaluation Zheng Jingnan, Wang Han, Zhang An, Nguyen Tai D., Sun Jun, Chua Tat-seng
- Towards Lifelong Learning Of Large Language Models: A Survey Zheng Junhao, Qiu Shengjie, Shi Chengming, Ma Qianli
- Dreamlip: Language-image Pre-training With Long Captions Zheng Kecheng, Zhang Yifei, Wu Wei, Lu Fan, Ma Shuailei, Jin Xin, Chen Wei, Shen Yujun
- PAS: Data-efficient Plug-and-play Prompt Augmentation System Zheng Miao, Liang Hao, Yang Fan, Sun Haoze, Li Tianpeng, Xiong Lingchu, Zhang Yan, Wu Youzhen, Li Kun, Shen Yanjun, Lin Mingan, Zhang Tao, Dong Guosheng, Qiao Yujing, Fang Kun, Chen Weipeng, Cui Bin, Zhang Wentao, Zhou Zenan
- Kun: Answer Polishment For Chinese Self-alignment With Instruction Back-translation Zheng Tianyu, Guo Shuyue, Qu Xingwei, Guo Jiawei, Du Xinrun, Jia Qi, Lin Chenghua, Huang Wenhao, Fu Jie, Zhang Ge
- Opencodeinterpreter: Integrating Code Generation With Execution And Refinement Zheng Tianyu, Zhang Ge, Shen Tianhao, Liu Xueling, Lin Bill Yuchen, Fu Jie, Chen Wenhu, Yue Xiang
- Training Large Language Models For Reasoning Through Reverse Curriculum Reinforcement Learning Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang
- Revisiting Knowledge Distillation For Autoregressive Language Models Zhong Qihuang, Ding Liang, Shen Li, Liu Juhua, Du Bo, Tao Dacheng
- Enhancing Multimodal Large Language Models With Multi-instance Visual Prompt Generator For Visual Representation Enrichment Zhong Wenliang, Wu Wenyi, Li Qi, Barton Rob, Du Boxin, Sam Shioulin, Bouyarmane Karim, Tutar Ismail, Huang Junzhou
- Harnessing Large Language Models As Post-hoc Correctors Zhong Zhiqiang, Zhou Kuangyu, Mottin Davide
- Sequence To Sequence Reward Modeling: Improving RLHF By Language Feedback Zhou Jiayi, Ji Jiaming, Dai Juntao, Yang Yaodong
- Leveraging Web-crawled Data For High-quality Fine-tuning Zhou Jing, Jiang Chenglin, Shen Wei, Zhou Xiao, He Xiaonan
- Enhancing The General Agent Capabilities Of Low-parameter Llms Through Tuning And Multi-branch Reasoning Zhou Qinhao, Zhang Zihan, Xiang Xiang, Wang Ke, Wu Yuchuan, Li Yongbin
- Evaluating The Smooth Control Of Attribute Intensity In Text Generation With Llms Zhou Shang, Yao Feng, Dong Chengyu, Wang Zihan, Shang Jingbo
- Cogmg: Collaborative Augmentation Between Large Language Model And Knowledge Graph Zhou Tong, Chen Yubo, Liu Kang, Zhao Jun
- Few-shot Adversarial Prompt Learning On Vision-language Models Zhou Yiwei, Xia Xiaobo, Lin Zhiwei, Han Bo, Liu Tongliang
- Calibrated Self-rewarding Vision Language Models Zhou Yiyang, Fan Zhiyuan, Cheng Dongjie, Yang Sihan, Chen Zhaorun, Cui Chenhang, Wang Xiyao, Li Yun, Zhang Linjun, Yao Huaxiu
- Defending Jailbreak Prompts Via In-context Adversarial Game Zhou Yujun, Han Yufei, Zhuang Haomin, Guo Kehan, Liang Zhenwen, Bao Hongyan, Zhang Xiangliang
- Metagpt: Merging Large Language Models Using Model Exclusive Task Arithmetic Zhou Yuyan, Song Liang, Wang Bingning, Chen Weipeng
- DETAIL: Task Demonstration Attribution For Interpretable In-context Learning Zhou Zijian, Lin Xiaoqiang, Xu Xinyi, Prakash Alok, Rus Daniela, Low Bryan Kian Hsiang
- Can Llms "reason" In Music? An Evaluation Of Llms' Capability Of Music Understanding And Generation Zhou Ziya, Wu Yuhang, Wu Zhiyue, Zhang Xinyue, Yuan Ruibin, Ma Yinghao, Wang Lu, Benetos Emmanouil, Xue Wei, Guo Yike
- FANNO: Augmenting High-quality Instruction Data With Open-sourced Llms Only Zhu He, Su Junyou, Lun Tianle, Tao Yicheng, Zhang Wenjia, Fan Zipei, Chen Guanhua
- Rageval: Scenario Specific RAG Evaluation Dataset Generation Framework Zhu Kunlun, Luo Yifan, Xu Dingling, Wang Ruobing, Yu Shi, Wang Shuo, Yan Yukun, Liu Zhenghao, Han Xu, Liu Zhiyuan, Sun Maosong
- A Llm-based Controllable, Scalable, Human-involved User Simulator Framework For Conversational Recommender Systems Zhu Lixi, Huang Xiaowen, Sang Jitao
- LIRE: Listwise Reward Enhancement For Preference Alignment Zhu Mingye, Liu Yi, Zhang Lei, Guo Junbo, Mao Zhendong
- Parameter-efficient Tuning Large Language Models For Graph Representation Learning Zhu Qi, Zheng Da, Song Xiang, Zhang Shichang, Jin Bowen, Sun Yizhou, Karypis George
- The Power Of Question Translation Training In Multilingual Reasoning: Broadened Scope And Deepened Insights Zhu Wenhao, Huang Shujian, Yuan Fei, Chen Cheng, Chen Jiajun, Birch Alexandra
- Question Translation Training For Better Multilingual Reasoning Zhu Wenhao, Huang Shujian, Yuan Fei, She Shuaijie, Chen Jiajun, Birch Alexandra
- Apt-pipe: A Prompt-tuning Tool For Social Data Annotation Using Chatgpt Zhu Yiming, Yin Zhizhuo, Tyson Gareth, Haq Ehsan-ul, Lee Lik-hang, Hui Pan
- Efficient Test-time Prompt Tuning For Vision-language Models Zhu Yuhan, Zhang Guozhen, Xu Chen, Shen Haocheng, Chen Xiaoxin, Wu Gangshan, Wang Limin
- Accelerating Inference Of Retrieval-augmented Generation Via Sparse Context Selection Zhu Yun, Gu Jia-chen, Sikora Caitlin, Ko Ho, Liu Yinxiao, Lin Chu-cheng, Shu Lei, Luo Liangchen, Meng Lei, Liu Bang, Chen Jindong
- One Token Can Help! Learning Scalable And Pluggable Virtual Tokens For Retrieval-augmented Large Language Models Zhu Yutao, Huang Zhaoheng, Dou Zhicheng, Wen Ji-rong
- Yulan: An Open-source Large Language Model Zhu Yutao, Zhou Kun, Mao Kelong, Chen Wentong, Sun Yiding, Chen Zhipeng, Cao Qian, Wu Yihan, Chen Yushuo, Wang Feng, Zhang Lei, Li Junyi, Wang Xiaolei, Wang Lei, Zhang Beichen, Dong Zican, Cheng Xiaoxue, Chen Yuhan, Tang Xinyu, Hou Yupeng, Ren Qiangqiang, Pang Xincheng, Xie Shufang, Zhao Wayne Xin, Dou Zhicheng, Mao Jiaxin, Lin Yankai, Song Ruihua, Xu Jun, Chen Xu, Yan Rui, Wei Zhewei, Hu Di, Huang Wenbing, Gao Ze-feng, Chen Yueguo, Lu Weizheng, Wen Ji-rong
- Murar: A Simple And Effective Multimodal Retrieval And Answer Refinement Framework For Multimodal Question Answering Zhu Zhengyuan, Lee Daniel, Zhang Hong, Harsha Sai Sree, Feujio Loic, Maharaj Akash, Li Yunyao
- Understanding Privacy Risks Of Embeddings Induced By Large Language Models Zhu Zhihao, Shao Ninglu, Lian Defu, Wu Chenwang, Liu Zheng, Yang Yi, Chen Enhong
- Halueval-wild: Evaluating Hallucinations Of Language Models In The Wild Zhu Zhiying, Yang Yiming, Sun Zhiqing
- Sketch Then Generate: Providing Incremental User Feedback And Guiding LLM Code Generation Through Language-oriented Code Sketches Zhu-tian Chen, Xiong Zeyu, Yao Xiaoshuo, Glassman Elena
- Structlm: Towards Building Generalist Models For Structured Knowledge Grounding Zhuang Alex, Zhang Ge, Zheng Tianyu, Du Xinrun, Wang Junjie, Ren Weiming, Huang Stephen W., Fu Jie, Yue Xiang, Chen Wenhu
- Bigcodebench: Benchmarking Code Generation With Diverse Function Calls And Complex Instructions Zhuo Terry Yue, Vu Minh Chien, Chim Jenny, Hu Han, Yu Wenhao, Widyasari Ratnadira, Yusuf Imam Nur Bani, Zhan Haolan, He Junda, Paul Indraneil, Brunner Simon, Gong Chen, Hoang Thong, Zebaze Armel Randy, Hong Xiaoheng, Li Wen-ding, Kaddour Jean, Xu Ming, Zhang Zhihan, Yadav Prateek, Jain Naman, Gu Alex, Cheng Zhoujun, Liu Jiawei, Liu Qian, Wang Zijian, Lo David, Hui Binyuan, Muennighoff Niklas, Fried Daniel, Du Xiaoning, De Vries Harm, Von Werra Leandro
- LLM4PM: A Case Study On Using Large Language Models For Process Modeling In Enterprise Organizations Ziche Clara, Apruzzese Giovanni
- Mmmmodal -- Multi-images Multi-audio Multi-turn Multi-modal Zolkepli Husein, Razak Aisyah, Adha Kamarul, Nazhan Ariff
- Multi-lingual Malaysian Embedding: Leveraging Large Language Models For Semantic Representations Zolkepli Husein, Razak Aisyah, Adha Kamarul, Nazhan Ariff
- VL-ICL Bench: The Devil In The Details Of Benchmarking Multimodal In-context Learning Zong Yongshuo, Bohdal Ondrej, Hospedales Timothy
- What Do Mllms Hear? Examining Reasoning With Text And Sound Components In Multimodal Large Language Models Çoban Enis Berk, Mandel Michael I., Devaney Johanna
- Enhancing Visual Question Answering Through Question-driven Image Captions As Prompts Özdemir Övgü, Akagündüz Erdem
🏷 Reinforcement Learning
- Multi-way, Multilingual Neural Machine Translation With A Shared Attention Mechanism Firat Orhan, Cho Kyunghyun, Bengio Yoshua
- Revisiting Visual Question Answering Baselines Jabri Allan, Joulin Armand, Van Der Maaten Laurens
- Google's Multilingual Neural Machine Translation System: Enabling Zero-shot Translation Johnson Melvin, Schuster Mike, Le Quoc V., Krikun Maxim, Wu Yonghui, Chen Zhifeng, Thorat Nikhil, Viégas Fernanda, Wattenberg Martin, Corrado Greg, Hughes Macduff, Dean Jeffrey
- Coherent Dialogue With Attention-based Language Models Mei Hongyuan, Bansal Mohit, Walter Matthew R.
- Generative Deep Neural Networks For Dialogue: A Short Review Serban Iulian Vlad, Lowe Ryan, Charlin Laurent, Pineau Joelle
- Are Emojis Predictable? Barbieri Francesco, Ballesteros Miguel, Saggion Horacio
- Lig-cristal System For The WMT17 Automatic Post-editing Task Berard Alexandre, Pietquin Olivier, Besacier Laurent
- Towards Neural Machine Translation With Latent Tree Attention Bradbury James, Socher Richard
- Robustfill: Neural Program Learning Under Noisy I/O Devlin Jacob, Uesato Jonathan, Bhupatiraju Surya, Singh Rishabh, Mohamed Abdel-rahman, Kohli Pushmeet
- Sockeye: A Toolkit For Neural Machine Translation Hieber Felix, Domhan Tobias, Denkowski Michael, Vilar David, Sokolov Artem, Clifton Ann, Post Matt
- Reinforced Mnemonic Reader For Machine Reading Comprehension Hu Minghao, Peng Yuxing, Huang Zhen, Qiu Xipeng, Wei Furu, Zhou Ming
- Enterprise To Computer: Star Trek Chatbot Jena Grishma, Vashisht Mansi, Basu Abheek, Ungar Lyle, Sedoc João
- Simple Recurrent Units For Highly Parallelizable Recurrence Lei Tao, Zhang Yu, Wang Sida I., Dai Hui, Artzi Yoav
- Task-driven Visual Saliency And Attention-based Visual Question Answering Lin Yuetan, Pang Zhangyang, Wang Donghui, Zhuang Yueting
- Best Of Both Worlds: Transferring Knowledge From Discriminative Learning To A Generative Visual Dialog Model Lu Jiasen, Kannan Anitha, Yang Jianwei, Parikh Devi, Batra Dhruv
- Parlai: A Dialog Research Software Platform Miller Alexander H., Feng Will, Fisch Adam, Lu Jiasen, Batra Dhruv, Bordes Antoine, Parikh Devi, Weston Jason
- Predicting Target Language CCG Supertags Improves Neural Machine Translation Nadejde Maria, Reddy Siva, Sennrich Rico, Dwojak Tomasz, Junczys-dowmunt Marcin, Koehn Philipp, Birch Alexandra
- Generating High-quality And Informative Conversation Responses With Sequence-to-sequence Models Shao Louis, Gouws Stephan, Britz Denny, Goldie Anna, Strope Brian, Kurzweil Ray
- An Empirical Analysis Of Multiple-turn Reasoning Strategies In Reading Comprehension Tasks Shen Yelong, Liu Xiaodong, Duh Kevin, Gao Jianfeng
- Tips And Tricks For Visual Question Answering: Learnings From The 2017 Challenge Teney Damien, Anderson Peter, He Xiaodong, Hengel Anton Van Den
- Are You Talking To Me? Reasoned Visual Dialog Generation Through Adversarial Learning Wu Qi, Wang Peng, Shen Chunhua, Reid Ian, Hengel Anton Van Den
- Neural Text Generation: A Practical Guide Xie Ziang
- Mojitalk: Generating Emotional Responses At Scale Zhou Xianda, Wang William Yang
- Code2seq: Generating Sequences From Structured Representations Of Code Alon Uri, Brody Shaked, Levy Omer, Yahav Eran
- Attention Based Natural Language Grounding By Navigating Virtual Environment B Akilesh, Sinha Abhishek, Sarkar Mausoom, Krishnamurthy Balaji
- The Best Of Both Worlds: Combining Recent Advances In Neural Machine Translation Chen Mia Xu, Firat Orhan, Bapna Ankur, Johnson Melvin, Macherey Wolfgang, Foster George, Jones Llion, Parmar Niki, Schuster Mike, Chen Zhifeng, Wu Yonghui, Hughes Macduff
- Achieving Fluency And Coherency In Task-oriented Dialog Gangadharaiah Rashmi, Narayanaswamy Balakrishnan, Elkan Charles
- Question-guided Hybrid Convolution For Visual Question Answering Gao Peng, Lu Pan, Li Hongsheng, Li Shuang, Li Yikang, Hoi Steven, Wang Xiaogang
- The Memad Submission To The WMT18 Multimodal Translation Task Grönroos Stig-arne, Huet Benoit, Kurimo Mikko, Laaksonen Jorma, Merialdo Bernard, Pham Phu, Sjöberg Mats, Sulubacak Umut, Tiedemann Jörg, Troncy Raphael, Vázquez Raúl
- Analysing The Potential Of Seq-to-seq Models For Incremental Interpretation In Task-oriented Dialogue Hupkes Dieuwke, Bouwmeester Sanne, Fernández Raquel
- Neural Machine Translation With Adequacy-oriented Learning Kong Xiang, Tu Zhaopeng, Shi Shuming, Hovy Eduard, Zhang Tong
- Answerer In Questioner's Mind: Information Theoretic Approach To Goal-oriented Visual Dialog Lee Sang-woo, Heo Yu-jung, Zhang Byoung-tak
- Polite Dialogue Generation Without Parallel Data Niu Tong, Bansal Mohit
- Recursive Visual Attention In Visual Dialog Niu Yulei, Zhang Hanwang, Zhang Manli, Zhang Jianhong, Lu Zhiwu, Wen Ji-rong
- Nlitrans At Semeval-2018 Task 12: Transfer Of Semantic Knowledge For Argument Comprehension Niven Tim, Kao Hung-yu
- Multimodal Explanations: Justifying Decisions And Pointing To The Evidence Park Dong Huk, Hendricks Lisa Anne, Akata Zeynep, Rohrbach Anna, Schiele Bernt, Darrell Trevor, Rohrbach Marcus
- Teaching Machines To Code: Neural Markup Generation With Visual Attention Singh Sumeet S.
- Adapting Visual Question Answering Models For Enhancing Multimodal Community Q&A Platforms Srivastava Avikalp, Liu Hsin Wen, Fujita Sumio
- Measuring Semantic Abstraction Of Multilingual NMT With Paraphrase Recognition And Generation Tasks Tiedemann Jörg, Scherrer Yves
- Can You Tell Me How To Get Past Sesame Street? Sentence-level Pretraining Beyond Language Modeling Wang Alex, Hula Jan, Xia Patrick, Pappagari Raghavendra, Mccoy R. Thomas, Patel Roma, Kim Najoung, Tenney Ian, Huang Yinghui, Yu Katherin, Jin Shuning, Chen Berlin, Van Durme Benjamin, Grave Edouard, Pavlick Ellie, Bowman Samuel R.
- A Task In A Suit And A Tie: Paraphrase Generation With Semantic Augmentation Wang Su, Gupta Rahul, Chang Nancy, Baldridge Jason
- The Fine Line Between Linguistic Generalization And Failure In Seq2seq-attention Models Weber Noah, Shekhar Leena, Balasubramanian Niranjan
- Medical Exam Question Answering With Large-scale Reading Comprehension Zhang Xiao, Wu Ji, He Zhiyang, Liu Xien, Su Ying
- A Visual Attention Grounding Neural Model For Multimodal Machine Translation Zhou Mingyang, Cheng Runxiang, Lee Yong Jae, Yu Zhou
- Fusion Of Detected Objects In Text For Visual Question Answering Alberti Chris, Ling Jeffrey, Collins Michael, Reitter David
- PLATO: Pre-trained Dialogue Generation Model With Discrete Latent Variable Bao Siqi, He Huang, Wang Fan, Wu Hua, Wang Haifeng
- Meta Answering For Machine Reading Borschinger Benjamin, Boyd-graber Jordan, Buck Christian, Bulian Jannis, Ciaramita Massimiliano, Huebscher Michelle Chen, Gajewski Wojciech, Kilcher Yannic, Nogueira Rodrigo, Saralegu Lierni Sestorain
- Language Features Matter: Effective Language Representations For Vision-language Tasks Burns Andrea, Tan Reuben, Saenko Kate, Sclaroff Stan, Plummer Bryan A.
- An Empirical Study Of Generation Order For Machine Translation Chan William, Stern Mitchell, Kiros Jamie, Uszkoreit Jakob
- Reinforcement Learning Based Graph-to-sequence Model For Natural Question Generation Chen Yu, Wu Lingfei, Zaki Mohammed J.
- Contrastive Multi-document Question Generation Cho Woon Sang, Zhang Yizhe, Rao Sudha, Celikyilmaz Asli, Xiong Chenyan, Gao Jianfeng, Wang Mengdi, Dolan Bill
- An Empirical Comparison On Imitation Learning And Reinforcement Learning For Paraphrase Generation Du Wanyu, Ji Yangfeng
- Evaluating The State-of-the-art Of End-to-end Natural Language Generation: The E2E NLG Challenge Dušek Ondřej, Novikova Jekaterina, Rieser Verena
- ELI5: Long Form Question Answering Fan Angela, Jernite Yacine, Perez Ethan, Grangier David, Weston Jason, Auli Michael
- Optibox: Breaking The Limits Of Proposals For Visual Grounding Fan Zicong, Meng Si Yi, Sigal Leonid, Little James J.
- Question-agnostic Attention For Visual Question Answering Farazi Moshiur R, Khan Salman H, Barnes Nick
- Generating Natural Language Explanations For Visual Question Answering Using Scene Graphs And Visual Attention Ghosh Shalini, Burachas Giedrius, Ray Arijit, Ziskind Avi
- Span Selection Pre-training For Question Answering Glass Michael, Gliozzo Alfio, Chakravarti Rishav, Ferritto Anthony, Pan Lin, Bhargav G P Shrivatsa, Garg Dinesh, Sil Avirup
- Insertion-based Decoding With Automatically Inferred Generation Order Gu Jiatao, Liu Qi, Cho Kyunghyun
- Episodic Memory Reader: Learning What To Remember For Question Answering From Streaming Data Han Moonsu, Kang Minki, Jung Hyunwoo, Hwang Sung Ju
- Convert: Efficient And Accurate Conversational Representations From Transformers Henderson Matthew, Casanueva Iñigo, Mrkšić Nikola, Su Pei-hao, Wen Tsung-hsien, Vulić Ivan
- Evaluating Rewards For Question Generation Models Hosking Tom, Riedel Sebastian
- A Corpus-free State2seq User Simulator For Task-oriented Dialogue Hou Yutai, Fang Meng, Che Wanxiang, Liu Ting
- Rap-net: Recurrent Attention Pooling Networks For Dialogue Response Selection Huang Chao-wei, Chiang Ting-rui, Su Shang-yu, Chen Yun-nung
- Large-scale Representation Learning From Visually Grounded Untranscribed Speech Ilharco Gabriel, Zhang Yuan, Baldridge Jason
- Amr-to-text Generation With Cache Transition Systems Jin Lisa, Gildea Daniel
- Bipar: A Bilingual Parallel Dataset For Multilingual And Cross-lingual Reading Comprehension On Novels Jing Yimin, Xiong Deyi, Zhen Yan
- CTRL: A Conditional Transformer Language Model For Controllable Generation Keskar Nitish Shirish, Mccann Bryan, Varshney Lav R., Xiong Caiming, Socher Richard
- Sample Efficient Text Summarization Using A Single Pre-trained Transformer Khandelwal Urvashi, Clark Kevin, Jurafsky Dan, Kaiser Lukasz
- Learning To Answer By Learning To Ask: Getting The Best Of GPT-2 And BERT Worlds Klein Tassilo, Nabi Moin
- Transcoding Compositionally: Using Attention To Find More Generalizable Solutions Korrel Kris, Hupkes Dieuwke, Dankers Verna, Bruni Elia
- Revealing The Dark Secrets Of BERT Kovaleva Olga, Romanov Alexey, Rogers Anna, Rumshisky Anna
- Investigating Multilingual NMT Representations At Scale Kudugunta Sneha Reddy, Bapna Ankur, Caswell Isaac, Arivazhagan Naveen, Firat Orhan
- Question Generation From Paragraphs: A Tale Of Two Hierarchical Models Kumar Vishwajeet, Chaki Raktim, Talluri Sai Teja, Ramakrishnan Ganesh, Li Yuan-fang, Haffari Gholamreza
- Patent Claim Generation By Fine-tuning Openai GPT-2 Lee Jieh-sheng, Hsiang Jieh
- BART: Denoising Sequence-to-sequence Pre-training For Natural Language Generation, Translation, And Comprehension Lewis Mike, Liu Yinhan, Goyal Naman, Ghazvininejad Marjan, Mohamed Abdelrahman, Levy Omer, Stoyanov Ves, Zettlemoyer Luke
- Incremental Transformer With Deliberation Decoder For Document Grounded Conversations Li Zekang, Niu Cheng, Meng Fandong, Feng Yang, Li Qian, Zhou Jie
- TAB-VCR: Tags And Attributes Based Visual Commonsense Reasoning Baselines Lin Jingxiang, Jain Unnat, Schwing Alexander G.
- Task-oriented Conversation Generation Using Heterogeneous Memory Networks Lin Zehao, Huang Xinjing, Ji Feng, Chen Haiqing, Zhang Ying
- Learning To Generate Questions By Learning What Not To Generate Liu Bang, Zhao Mingjun, Niu Di, Lai Kunfeng, He Yancheng, Wei Haojie, Xu Yu
- MKD: A Multi-task Knowledge Distillation Approach For Pretrained Language Models Liu Linqing, Wang Huan, Lin Jimmy, Socher Richard, Xiong Caiming
- Generative Question Refinement With Deep Reinforcement Learning In Retrieval-based QA System Liu Ye, Zhang Chenwei, Yan Xiaohui, Chang Yi, Yu Philip S.
- Roberta: A Robustly Optimized BERT Pretraining Approach Liu Yinhan, Ott Myle, Goyal Naman, Du Jingfei, Joshi Mandar, Chen Danqi, Levy Omer, Lewis Mike, Zettlemoyer Luke, Stoyanov Veselin
- Attention-informed Mixed-language Training For Zero-shot Cross-lingual Task-oriented Dialogue Systems Liu Zihan, Winata Genta Indra, Lin Zhaojiang, Xu Peng, Fung Pascale
- Multi-hop Reading Comprehension Via Deep Reinforcement Learning Based Document Traversal Long Alex, Mason Joel, Blair Alan, Wang Wei
- Good, Better, Best: Textual Distractors Generation For Multiple-choice Visual Question Answering Via Reinforcement Learning Lu Jiaying, Ye Xin, Ren Yi, Yang Yezhou
- Structured Pruning Of A Bert-based Question Answering Model Mccarley J. S., Chakravarti Rishav, Sil Avirup
- Let's Ask Again: Refine Network For Automatic Question Generation Nema Preksha, Mohankumar Akash Kumar, Khapra Mitesh M., Srinivasan Balaji Vasan, Ravindran Balaraman
- Visual Dialogue State Tracking For Question Generation Pang Wei, Wang Xiaojie
- AI Meets Austen: Towards Human-robot Discussions Of Literary Metaphor Parde Natalie, Nielsen Rodney D.
- Stabilizing Transformers For Reinforcement Learning Parisotto Emilio, Song H. Francis, Rae Jack W., Pascanu Razvan, Gulcehre Caglar, Jayakumar Siddhant M., Jaderberg Max, Kaufman Raphael Lopez, Clark Aidan, Noury Seb, Botvinick Matthew M., Heess Nicolas, Hadsell Raia
- Human-like Machine Thinking: Language Guided Imagination Qi Feng, Wu Wenchuan
- Answering Complex Open-domain Questions Through Iterative Query Generation Qi Peng, Lin Xiaowen, Mehr Leo, Wang Zijian, Manning Christopher D.
- Generating Highly Relevant Questions Qiu Jiazuo, Xiong Deyi
- Probing Natural Language Inference Models Through Semantic Fragments Richardson Kyle, Hu Hai, Moss Lawrence S., Sabharwal Ashish
- Is Multilingual BERT Fluent In Language Generation? Rönnqvist Samuel, Kanerva Jenna, Salakoski Tapio, Ginter Filip
- Hierarchical Reinforcement Learning For Open-domain Dialog Saleh Abdelrhman, Jaques Natasha, Ghandeharioun Asma, Shen Judy Hanwen, Picard Rosalind
- Zero: Memory Optimizations Toward Training Trillion Parameter Models Samyam Rajbhandari, Jeff Rasley, Olatunji Ruwase, Yuxiong He
- A Survey Of Natural Language Generation Techniques With A Focus On Dialogue Systems - Past, Present And Future Directions Santhanam Sashank, Shaikh Samira
- Generalization In Generation: A Closer Look At Exposure Bias Schmidt Florian
- Flexibly-structured Model For Task-oriented Dialogues Shu Lei, Molino Piero, Namazifar Mahdi, Xu Hu, Liu Bing, Zheng Huaixiu, Tur Gokhan
- MASS: Masked Sequence To Sequence Pre-training For Language Generation Song Kaitao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Tie-yan
- Unpaired Cross-lingual Image Caption Generation With Self-supervised Rewards Song Yuqing, Chen Shizhe, Zhao Yida, Jin Qin
- Natural Language Generation Using Reinforcement Learning With External Rewards Srinivasan Vidhushini, Santhanam Sashank, Shaikh Samira
- On NMT Search Errors And Model Errors: Cat Got Your Tongue? Stahlberg Felix, Byrne Bill
- Sesamebert: Attention For Anywhere Su Ta-chun, Cheng Hsiang-chih
- VL-BERT: Pre-training Of Generic Visual-linguistic Representations Su Weijie, Zhu Xizhou, Cao Yue, Li Bin, Lu Lewei, Wei Furu, Dai Jifeng
- Syntax-infused Transformer And BERT Models For Machine Translation And Natural Language Understanding Sundararaman Dhanasekar, Subramanian Vivek, Wang Guoyin, Si Shijing, Shen Dinghan, Wang Dong, Carin Lawrence
- Tree-structured Semantic Encoder With Knowledge Sharing For Domain Adaptation In Natural Language Generation Tseng Bo-hsiang, Budzianowski Paweł, Wu Yen-chen, Gašić Milica
- Well-read Students Learn Better: On The Importance Of Pre-training Compact Models Turc Iulia, Chang Ming-wei, Lee Kenton, Toutanova Kristina
- How Does BERT Answer Questions? A Layer-wise Analysis Of Transformer Representations Van Aken Betty, Winter Benjamin, Löser Alexander, Gers Felix A.
- Quantity Doesn't Buy Quality Syntax With Neural Language Models Van Schijndel Marten, Mueller Aaron, Linzen Tal
- Visualizing Attention In Transformer-based Language Representation Models Vig Jesse
- Attention Is Not Not Explanation Wiegreffe Sarah, Pinter Yuval
- Huggingface's Transformers: State-of-the-art Natural Language Processing Wolf Thomas, Debut Lysandre, Sanh Victor, Chaumond Julien, Delangue Clement, Moi Anthony, Cistac Pierric, Rault Tim, Louf Rémi, Funtowicz Morgan, Davison Joe, Shleifer Sam, Von Platen Patrick, Ma Clara, Jernite Yacine, Plu Julien, Xu Canwen, Scao Teven Le, Gugger Sylvain, Drame Mariama, Lhoest Quentin, Rush Alexander M.
- Integrated Triaging For Fast Reading Comprehension Wu Felix, Li Boyi, Wang Lequn, Lao Ni, Blitzer John, Weinberger Kilian Q.
- Transformer-based Cascaded Multimodal Speech Translation Wu Zixiu, Caglayan Ozan, Ive Julia, Wang Josiah, Specia Lucia
- Review Conversational Reading Comprehension Xu Hu, Liu Bing, Shu Lei, Yu Philip S.
- Making History Matter: History-advantage Sequence Training For Visual Dialog Yang Tianhao, Zha Zheng-jun, Zhang Hanwang
- Knowledge-grounded Response Generation With Deep Attentional Latent-variable Model Ye Hao-tong, Lo Kai-ling, Su Shang-yu, Chen Yun-nung
- Q8BERT: Quantized 8bit BERT Zafrir Ofir, Boudoukh Guy, Izsak Peter, Wasserblat Moshe
- Context-aware Visual Policy Network For Fine-grained Image Captioning Zha Zheng-jun, Liu Daqing, Zhang Hanwang, Zhang Yongdong, Wu Feng
- Addressing Semantic Drift In Question Generation For Semi-supervised Question Answering Zhang Shiyue, Bansal Mohit
- Atom Responding Machine For Dialog Generation Zhou Ganbin, Luo Ping, Chen Jingwu, Lin Fen, Lin Leyu, He Qing
- Building Task-oriented Visual Dialog Systems Through Alternative Optimization Between Dialog Policy And Language Generation Zhou Mingyang, Arnold Josh, Yu Zhou
- Using Interlinear Glosses As Pivot In Low-resource Multilingual Machine Translation Zhou Zhong, Levin Lori, Mortensen David R., Waibel Alex
- Encoder-agnostic Adaptation For Conditional Language Generation Ziegler Zachary M., Melas-kyriazi Luke, Gehrmann Sebastian, Rush Alexander M.
- Accelerating Natural Language Understanding In Task-oriented Dialog Ahuja Ojas, Desai Shrey
- Segatron: Segment-aware Transformer For Language Modeling And Understanding Bai He, Shi Peng, Lin Jimmy, Xie Yuqing, Tan Luchen, Xiong Kun, Gao Wen, Li Ming
- The Elephant In The Interpretability Room: Why Use Attention As Explanation When We Have Saliency Methods? Bastings Jasmijn, Filippova Katja
- Longformer: The Long-document Transformer Beltagy Iz, Peters Matthew E., Cohan Arman
- DIET: Lightweight Language Understanding For Dialogue Systems Bunk Tanja, Varshneya Daksh, Vlasov Vladimir, Nichol Alan
- Adaptive Parameterization For Neural Dialogue Generation Cai Hengyi, Chen Hongshen, Zhang Cheng, Song Yonghao, Zhao Xiaofang, Yin Dawei
- Exploring Fluent Query Reformulations With Text-to-text Transformers And Reinforcement Learning Chen Jerry Zikun, Yu Shi, Wang Haoran
- Dipair: Fast And Accurate Distillation For Trillion-scale Text Matching And Pair Modeling Chen Jiecao, Yang Liu, Raman Karthik, Bendersky Michael, Yeh Jung-jung, Zhou Yun, Najork Marc, Cai Danyang, Emadzadeh Ehsan
- Logical Natural Language Generation From Open-domain Tables Chen Wenhu, Chen Jianshu, Su Yu, Chen Zhiyu, Wang William Yang
- Reasoning In Dialog: Improving Response Generation By Context Reading Comprehension Chen Xiuying, Cui Zhi, Zhang Jiayi, Wei Chen, Cui Jianwei, Wang Bin, Zhao Dongyan, Yan Rui
- MEANTIME: Mixture Of Attention Mechanisms With Multi-temporal Embeddings For Sequential Recommendation Cho Sung Min, Park Eunhyeok, Yoo Sungjoo
- Pymt5: Multi-mode Translation Of Natural Language And Python Code With Transformers Clement Colin B., Drain Dawn, Timcheck Jonathan, Svyatkovskiy Alexey, Sundaresan Neel
- Probing Emergent Semantics In Predictive Agents Via Question Answering Das Abhishek, Carnevale Federico, Merzic Hamza, Rimell Laura, Schneider Rosalia, Abramson Josh, Hung Alden, Ahuja Arun, Clark Stephen, Wayne Gregory, Hill Felix
- Gshard: Scaling Giant Models With Conditional Computation And Automatic Sharding Dmitry Lepikhin, Hyoukjoong Lee, Yuanzhong Xu, Dehao Chen, Orhan Firat, Yanping Huang, Maxim Krikun, Noam Shazeer, Zhifeng Chen
- Fine-tuning Pretrained Language Models: Weight Initializations, Data Orders, And Early Stopping Dodge Jesse, Ilharco Gabriel, Schwartz Roy, Farhadi Ali, Hajishirzi Hannaneh, Smith Noah
- The Unreasonable Volatility Of Neural Machine Translation Models Fadaee Marzieh, Monz Christof
- Addressing Some Limitations Of Transformers With Feedback Memory Fan Angela, Lavril Thibaut, Grave Edouard, Joulin Armand, Sukhbaatar Sainbayar
- Paragraph-level Commonsense Transformers With Recurrent Memory Gabriel Saadia, Bhagavatula Chandra, Shwartz Vered, Bras Ronan Le, Forbes Maxwell, Choi Yejin
- Robust Conversational AI With Grounded Text Generation Gao Jianfeng, Peng Baolin, Li Chunyuan, Li Jinchao, Shayandeh Shahin, Liden Lars, Shum Heung-yeung
- Recurrent Chunking Mechanisms For Long-text Machine Reading Comprehension Gong Hongyu, Shen Yelong, Yu Dian, Chen Jianshu, Yu Dong
- Measuring Systematic Generalization In Neural Proof Generation With Transformers Gontier Nicolas, Sinha Koustuv, Reddy Siva, Pal Christopher
- Contextualize Knowledge Bases With Transformer For End-to-end Task-oriented Dialogue Systems Gou Yanjie, Lei Yinjie, Liu Lingqiao, Dai Yong, Shen Chunxu
- A Simple Yet Strong Pipeline For Hotpotqa Groeneveld Dirk, Khot Tushar, Mausam, Sabharwal Ashish
- UNION: An Unreferenced Metric For Evaluating Open-ended Story Generation Guan Jian, Huang Minlie
- Exploring Explainable Selection To Control Abstractive Summarization Haonan Wang, Yang Gao, Yu Bai, Lapata Mirella, Heyan Huang
- Less Is More: Data-efficient Complex Question Answering Over Knowledge Bases Hua Yuncheng, Li Yuan-fang, Qi Guilin, Wu Wei, Zhang Jingyao, Qi Daiqing
- Probing Pretrained Language Models For Lexical Semantics Ivan Vulić, Edoardo Maria Ponti, Robert Litschko, Goran Glavaš, Anna Korhonen
- Are Natural Language Inference Models Imppressive? Learning Implicature And Presupposition Jeretic Paloma, Warstadt Alex, Bhooshan Suvrat, Williams Adina
- Meta-context Transformers For Domain-specific Response Generation Kar Debanjana, Samanta Suranjana, Azad Amar Prakash
- Reducing Language Biases In Visual Question Answering With Visually-grounded Question Encoder Kv Gouthaman, Mittal Anurag
- PONE: A Novel Automatic Evaluation Metric For Open-domain Generative Dialogue Systems Lan Tian, Mao Xian-ling, Wei Wei, Gao Xiaoyan, Huang Heyan
- Which Kind Is Better In Open-domain Multi-turn Dialog,hierarchical Or Non-hierarchical Models? An Empirical Study Lan Tian, Mao Xian-ling, Wei Wei, Huang Heyan
- Common Sense Or World Knowledge? Investigating Adapter-based Knowledge Injection Into Pretrained Transformers Lauscher Anne, Majewska Olga, Ribeiro Leonardo F. R., Gurevych Iryna, Rozanov Nikolai, Glavaš Goran
- LIMEADE: From AI Explanations To Advice Taking Lee Benjamin Charles Germain, Downey Doug, Lo Kyle, Weld Daniel S.
- Dialogue-adaptive Language Model Pre-training From Quality Estimation Li Junlong, Zhang Zhuosheng, Zhao Hai
- Seqgensql -- A Robust Sequence Generation Model For Structured Query Language Li Ning, Keller Bethany, Butler Mark, Cer Daniel
- Neural Machine Translation With Joint Representation Li Yanyang, Wang Qiang, Xiao Tong, Liu Tongran, Zhu Jingbo
- Train Large, Then Compress: Rethinking Model Size For Efficient Training And Inference Of Transformers Li Zhuohan, Wallace Eric, Shen Sheng, Lin Kevin, Keutzer Kurt, Klein Dan, Gonzalez Joseph E.
- Probabilistically Masked Language Model Capable Of Autoregressive Generation In Arbitrary Word Order Liao Yi, Jiang Xin, Liu Qun
- Autoregressive Knowledge Distillation Through Imitation Learning Lin Alexander, Wohlwend Jeremy, Chen Howard, Lei Tao
- Improving Zero-shot Translation By Disentangling Positional Information Liu Danni, Niehues Jan, Cross James, Guzmán Francisco, Li Xian
- Understanding The Difficulty Of Training Transformers Liu Liyuan, Liu Xiaodong, Gao Jianfeng, Chen Weizhu, Han Jiawei
- Qiaoning At Semeval-2020 Task 4: Commonsense Validation And Explanation System Based On Ensemble Of Language Model Liu Pai
- KG-BART: Knowledge Graph-augmented BART For Generative Commonsense Reasoning Liu Ye, Wan Yao, He Lifang, Peng Hao, Yu Philip S.
- Incremental Processing In The Age Of Non-incremental Encoders: An Empirical Assessment Of Bidirectional Models For Incremental NLU Madureira Brielen, Schlangen David
- Delight: Deep And Light-weight Transformer Mehta Sachin, Ghazvininejad Marjan, Iyer Srinivasan, Zettlemoyer Luke, Hajishirzi Hannaneh
- Lambert: Language And Action Learning Using Multimodal BERT Miyazawa Kazuki, Aoki Tatsuya, Horii Takato, Nagai Takayuki
- On Incorporating Structural Information To Improve Dialogue Response Generation Moghe Nikita, Vijayan Priyesh, Ravindran Balaraman, Khapra Mitesh M.
- Avgout: A Simple Output-probability Measure To Eliminate Dull Responses Niu Tong, Bansal Mohit
- The Chess Transformer: Mastering Play Using Generative Language Models Noever David, Ciolino Matt, Kalin Josh
- Better Distractions: Transformer-based Distractor Generation And Multiple Choice Question Filtering Offerijns Jeroen, Verberne Suzan, Verhoef Tessa
- Alleviating The Burden Of Labeling: Sentence Generation By Attention Branch Encoder-decoder Network Ogura Tadashi, Magassouba Aly, Sugiura Komei, Hirakawa Tsubasa, Yamashita Takayoshi, Fujiyoshi Hironobu, Kawai Hisashi
- X-linear Attention Networks For Image Captioning Pan Yingwei, Yao Ting, Li Yehao, Mei Tao
- How To Evaluate Your Dialogue System: Probe Tasks As An Alternative For Token-level Evaluation Metrics Parthasarathi Prasanna, Pineau Joelle, Chandar Sarath
- Retrieval-augmented Generation For Knowledge-intensive NLP Tasks Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela
- SOLOIST: Building Task Bots At Scale With Transfer Learning And Machine Teaching Peng Baolin, Li Chunyuan, Li Jinchao, Shayandeh Shahin, Liden Lars, Gao Jianfeng
- Adapterhub: A Framework For Adapting Transformers Pfeiffer Jonas, Rücklé Andreas, Poth Clifton, Kamath Aishwarya, Vulić Ivan, Ruder Sebastian, Cho Kyunghyun, Gurevych Iryna
- Intermediate-task Transfer Learning With Pretrained Models For Natural Language Understanding: When And Why Does It Work? Pruksachatkun Yada, Phang Jason, Liu Haokun, Htut Phu Mon, Zhang Xiaoyi, Pang Richard Yuanzhe, Vania Clara, Kann Katharina, Bowman Samuel R.
- BANG: Bridging Autoregressive And Non-autoregressive Generation With Large Scale Pretraining Qi Weizhen, Gong Yeyun, Jiao Jian, Yan Yu, Chen Weizhu, Liu Dayiheng, Tang Kewen, Li Houqiang, Chen Jiusheng, Zhang Ruofei, Zhou Ming, Duan Nan
- Nlpgym -- A Toolkit For Evaluating RL Agents On Natural Language Processing Tasks Ramamurthy Rajkumar, Sifa Rafet, Bauckhage Christian
- Exploring Recurrent, Memory And Attention Based Architectures For Scoring Interactional Aspects Of Human-machine Text Dialog Ramanarayanan Vikram, Mulholland Matthew, Ghosh Debanjan
- Personalized Query Rewriting In Conversational AI Agents Roshan-ghias Alireza, Mathialagan Clint Solomon, Ponnusamy Pragaash, Mathias Lambert, Guo Chenlei
- BLEURT: Learning Robust Metrics For Text Generation Sellam Thibault, Das Dipanjan, Parikh Ankur P.
- Reasoning Over History: Context Aware Visual Dialog Shah Muhammad A., Mehri Shikib, Srinivasan Tejas
- Data-efficient Methods For Dialogue Systems Shalyminov Igor
- Hybrid Generative-retrieval Transformers For Dialogue Domain Adaptation Shalyminov Igor, Sordoni Alessandro, Atkinson Adam, Schulz Hannes
- Lightpaff: A Two-stage Distillation Framework For Pre-training And Fine-tuning Song Kaitao, Sun Hao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Hongzhi, Liu Tie-yan
- Discovering Useful Sentence Representations From Large Pretrained Language Models Subramani Nishant, Suresh Nivedita
- Multi-scale Transformer Language Models Subramanian Sandeep, Collobert Ronan, Ranzato Marc'aurelio, Boureau Y-lan
- Explain And Improve: Lrp-inference Fine-tuning For Image Captioning Models Sun Jiamei, Lapuschkin Sebastian, Samek Wojciech, Binder Alexander
- Intellicode Compose: Code Generation Using Transformer Svyatkovskiy Alexey, Deng Shao Kun, Fu Shengyu, Sundaresan Neel
- Proofwriter: Generating Implications, Proofs, And Abductive Statements Over Natural Language Tafjord Oyvind, Mishra Bhavana Dalvi, Clark Peter
- Let's Stop Incorrect Comparisons In End-to-end Relation Extraction! Taillé Bruno, Guigue Vincent, Scoutheeten Geoffrey, Gallinari Patrick
- Edgebert: Sentence-level Energy Optimizations For Latency-aware Multi-task NLP Inference Tambe Thierry, Hooper Coleman, Pentecost Lillian, Jia Tianyu, Yang En-yu, Donato Marco, Sanh Victor, Whatmough Paul N., Rush Alexander M., Brooks David, Wei Gu-yeon
- Vokenization: Improving Language Understanding With Contextualized, Visual-grounded Supervision Tan Hao, Bansal Mohit
- Interpretable Neural Computation For Real-world Compositional Visual Question Answering Tang Ruixue, Ma Chao
- The Language Interpretability Tool: Extensible, Interactive Visualizations And Analysis For NLP Models Tenney Ian, Wexler James, Bastings Jasmijn, Bolukbasi Tolga, Coenen Andy, Gehrmann Sebastian, Jiang Ellen, Pushkarna Mahima, Radebaugh Carey, Reif Emily, Yuan Ann
- Cycle Text-to-image GAN With BERT Tsue Trevor, Sen Samir, Li Jason
- An Empirical Study On Robustness To Spurious Correlations Using Pre-trained Language Models Tu Lifu, Lalwani Garima, Gella Spandana, He He
- On Optimal Transformer Depth For Low-resource Language Translation Van Biljon Elan, Pretorius Arnu, Kreutzer Julia
- VD-BERT: A Unified Vision And Dialog Transformer With BERT Wang Yue, Joty Shafiq, Lyu Michael R., King Irwin, Xiong Caiming, Hoi Steven C. H.
- TOD-BERT: Pre-trained Natural Language Understanding For Task-oriented Dialogue Wu Chien-sheng, Hoi Steven, Socher Richard, Xiong Caiming
- Context-guided BERT For Targeted Aspect-based Sentiment Analysis Wu Zhengxuan, Ong Desmond C.
- CG-BERT: Conditional Text Generation With BERT For Generalized Few-shot Intent Detection Xia Congying, Zhang Chenwei, Nguyen Hoang, Zhang Jiawei, Yu Philip
- Exploring Question-specific Rewards For Generating Deep Questions Xie Yuxi, Pan Liangming, Wang Dongzhe, Kan Min-yen, Feng Yansong
- MC-BERT: Efficient Language Pre-training Via A Meta Controller Xu Zhenhui, Gong Linyuan, Ke Guolin, He Di, Zheng Shuxin, Wang Liwei, Bian Jiang, Liu Tie-yan
- Off-policy Self-critical Training For Transformer In Visual Paragraph Generation Yan Shiyang, Hua Yang, Robertson Neil M.
- Assessing Phrasal Representation And Composition In Transformers Yu Lang, Ettinger Allyson
- Open-domain Dialogue Generation Based On Pre-trained Language Models Zeng Yan, Nie Jian-yun
- Improving Adversarial Text Generation By Modeling The Distant Future Zhang Ruiyi, Chen Changyou, Gan Zhe, Wang Wenlin, Shen Dinghan, Wang Guoyin, Wen Zheng, Carin Lawrence
- Learning To Compare For Better Training And Evaluation Of Open Domain Natural Language Generation Models Zhou Wangchunshu, Xu Ke
- DUMA: Reading Comprehension With Transposition Thinking Zhu Pengfei, Zhao Hai, Li Xiaoguang
- Towards Understanding Sample Variance In Visually Grounded Language Generation: Evaluations And Observations Zhu Wanrong, Wang Xin Eric, Narayana Pradyumna, Sone Kazoo, Basu Sugato, Wang William Yang
- Goal-directed Story Generation: Augmenting Generative Language Models With Reinforcement Learning Alabdulkarim Amal, Li Winston, Martin Lara J., Riedl Mark O.
- RAFT: A Real-world Few-shot Text Classification Benchmark Alex Neel, Lifland Eli, Tunstall Lewis, Thakur Abhishek, Maham Pegah, Riedel C. Jess, Hine Emmie, Ashurst Carolyn, Sedille Paul, Carlier Alexis, Noetel Michael, Stuhlmüller Andreas
- A General Language Assistant As A Laboratory For Alignment Amanda Askell, Yuntao Bai, Anna Chen, Dawn Drain, Deep Ganguli, Tom Henighan, Andy Jones, Nicholas Joseph, Ben Mann, Nova Dassarma, Nelson Elhage, Zac Hatfield-dodds, Danny Hernandez, Jackson Kernion, Kamal Ndousse, Catherine Olsson, Dario Amodei, Tom Brown, Jack Clark, Sam Mccandlish, Chris Olah, Jared Kaplan
- On The Multilingual Capabilities Of Very Large-scale English Language Models Armengol-estapé Jordi, Bonet Ona De Gibert, Melero Maite
- One Question Answering Model For Many Languages With Cross-lingual Dense Passage Retrieval Asai Akari, Yu Xinyan, Kasai Jungo, Hajishirzi Hannaneh
- See, Hear, Read: Leveraging Multimodality With Guided Attention For Abstractive Text Summarization Atri Yash Kumar, Pramanick Shraman, Goyal Vikram, Chakraborty Tanmoy
- S2s-ft: Fine-tuning Pretrained Transformer Encoders For Sequence-to-sequence Learning Bao Hangbo, Dong Li, Wang Wenhui, Yang Nan, Wei Furu
- Building A Question And Answer System For News Domain Basu Sandipan, Gaddala Aravind, Chetan Pooja, Tiwari Garima, Darapaneni Narayana, Parvathaneni Sadwik, Paduri Anwesh Reddy
- On The Universality Of Deep Contextual Language Models Bhatt Shaily, Goyal Poonam, Dandapat Sandipan, Choudhury Monojit, Sitaram Sunayana
- On The Prunability Of Attention Heads In Multilingual BERT Budhraja Aakriti, Pande Madhura, Kumar Pratyush, Khapra Mitesh M.
- Gaud\'i: Conversational Interactions With Deep Representations To Generate Image Collections Bursztyn Victor S., Healey Jennifer, Vinay Vishwa
- Long Context Question Answering Via Supervised Contrastive Learning Caciularu Avi, Dagan Ido, Goldberger Jacob, Cohan Arman
- Can Transformers Jump Around Right In Natural Language? Assessing Performance Transfer From SCAN Chaabouni Rahma, Dessì Roberto, Kharitonov Eugene
- Towards Making The Most Of Multilingual Pretraining For Zero-shot Neural Machine Translation Chen Guanhua, Ma Shuming, Chen Yun, Zhang Dongdong, Pan Jia, Wang Wenping, Wei Furu
- Reinforcement Learning For Few-shot Text Generation Adaptation Cheng Pengsen, Dai Jinqiao, Liu Jiamiao, Liu Jiayong, Jia Peng
- Breaking Down Multilingual Machine Translation Chiang Ting-rui, Chen Yi-pei, Yeh Yi-ting, Neubig Graham
- CANINE: Pre-training An Efficient Tokenization-free Encoder For Language Representation Clark Jonathan H., Garrette Dan, Turc Iulia, Wieting John
- Training Verifiers To Solve Math Word Problems Cobbe Karl, Kosaraju Vineet, Bavarian Mohammad, Chen Mark, Jun Heewoo, Kaiser Lukasz, Plappert Matthias, Tworek Jerry, Hilton Jacob, Nakano Reiichiro, Hesse Christopher, Schulman John
- Compression, Transduction, And Creation: A Unified Framework For Evaluating Natural Language Generation Deng Mingkai, Tan Bowen, Liu Zhengzhong, Xing Eric P., Hu Zhiting
- Openprompt: An Open-source Framework For Prompt-learning Ding Ning, Hu Shengding, Zhao Weilin, Chen Yulin, Liu Zhiyuan, Zheng Hai-tao, Sun Maosong
- Not Quite 'ask A Librarian': AI On The Nature, Value, And Future Of LIS Dinneen Jesse David, Bubinger Helen
- Diverse Pretrained Context Encodings Improve Document Translation Donato Domenic, Yu Lei, Dyer Chris
- Is GPT-3 Text Indistinguishable From Human Text? Scarecrow: A Framework For Scrutinizing Machine Text Dou Yao, Forbes Maxwell, Koncel-kedziorski Rik, Smith Noah A., Choi Yejin
- Robustness Challenges In Model Distillation And Pruning For Natural Language Understanding Du Mengnan, Mukherjee Subhabrata, Cheng Yu, Shokouhi Milad, Hu Xia, Awadallah Ahmed Hassan
- Generative Context Pair Selection For Multi-hop Question Answering Dua Dheeru, Santos Cicero Nogueira Dos, Ng Patrick, Athiwaratkun Ben, Xiang Bing, Gardner Matt, Singh Sameer
- Leveraging Knowledge In Multilingual Commonsense Reasoning Fang Yuwei, Wang Shuohang, Xu Yichong, Xu Ruochen, Sun Siqi, Zhu Chenguang, Zeng Michael
- Berta\'u: Ita\'u BERT For Digital Customer Service Finardi Paulo, Viegas José Dié, Ferreira Gustavo T., Mansano Alex F., Caridá Vinicius F.
- Exbert: An External Knowledge Enhanced BERT For Natural Language Inference Gajbhiye Amit, Moubayed Noura Al, Bradley Steven
- Does Dialog Length Matter For Next Response Selection Task? An Empirical Study Ganhotra Jatin, Joshi Sachindra
- An Adversarially-learned Turing Test For Dialog Generation Models Gao Xiang, Zhang Yizhe, Galley Michel, Dolan Bill
- Improving And Diagnosing Knowledge-based Visual Question Answering Via Entity Enhanced Knowledge Injection Garcia-olano Diego, Onoe Yasumasa, Ghosh Joydeep
- Cross-attention Is All You Need: Adapting Pretrained Transformers For Machine Translation Gheini Mozhdeh, Ren Xiang, May Jonathan
- Long Text Generation By Modeling Sentence-level And Discourse-level Coherence Guan Jian, Mao Xiaoxi, Fan Changjie, Liu Zitao, Ding Wenbiao, Huang Minlie
- Block-skim: Efficient Question Answering For Transformer Guan Yue, Li Zhengyi, Leng Jingwen, Lin Zhouhan, Guo Minyi, Zhu Yuhao
- Memory-efficient Transformers Via Top-\(k\) Attention Gupta Ankit, Dar Guy, Goodman Shaya, Ciprut David, Berant Jonathan
- WARP: Word-level Adversarial Reprogramming Hambardzumyan Karen, Khachatrian Hrant, May Jonathan
- Text2app: A Framework For Creating Android Apps From Text Descriptions Hasan Masum, Mehrab Kazi Sajeed, Ahmad Wasi Uddin, Shahriyar Rifat
- ENCONTER: Entity Constrained Progressive Sequence Generation Via Insertion-based Transformer Hsieh Lee-hsun, Lee Yang-yin, Lim Ee-peng
- Controllable Dialogue Generation With Disentangled Multi-grained Style Specification And Attribute Consistency Reward Hu Zhe, Cao Zhiwei, Chan Hou Pong, Liu Jiachen, Xiao Xinyan, Su Jinsong, Wu Hua
- SILT: Efficient Transformer Training For Inter-lingual Inference Huertas-tato Javier, Martín Alejandro, Camacho David
- DS-TOD: Efficient Domain Specialization For Task Oriented Dialog Hung Chia-chien, Lauscher Anne, Ponzetto Simone Paolo, Glavaš Goran
- Scaling Language Models: Methods, Analysis & Insights From Training Gopher Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George Van Den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor, Irina Higgins, Antonia Creswell, Nat Mcaleese, Amy Wu, Erich Elsen, Siddhant Jayakumar, Elena Buchatskaya, David Budden, Esme Sutherland, Karen Simonyan, Michela Paganini, Laurent Sifre, Lena Martens, Xiang Lorraine Li, Adhiguna Kuncoro, Aida Nematzadeh, Elena Gribovskaya, Domenic Donato, Angeliki Lazaridou, Arthur Mensch, Jean-baptiste Lespiau, Maria Tsimpoukelli, Nikolai Grigorev, Doug Fritz, Thibault Sottiaux, Mantas Pajarskas, Toby Pohlen, Zhitao Gong, Daniel Toyama, Cyprien De Masson D'autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego De Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew Johnson, Blake Hechtman, Laura Weidinger, Iason Gabriel, William Isaac, Ed Lockhart, Simon Osindero, Laura Rimell, Chris Dyer, Oriol Vinyals, Kareem Ayoub, Jeff Stanway, Lorrayne Bennett, Demis Hassabis, Koray Kavukcuoglu, Geoffrey Irving
- Visqa: X-raying Vision And Language Reasoning In Transformers Jaunet Theo, Kervadec Corentin, Vuillemot Romain, Antipov Grigory, Baccouche Moez, Wolf Christian
- Inducing Transformer's Compositional Generalization Ability Via Auxiliary Sequence Prediction Tasks Jiang Yichen, Bansal Mohit
- REPT: Bridging Language Models And Machine Reading Comprehension Via Retrieval-based Pre-training Jiao Fangkai, Guo Yangyang, Niu Yilin, Ji Feng, Li Feng-lin, Nie Liqiang
- A Good Prompt Is Worth Millions Of Parameters: Low-resource Prompt-based Learning For Vision-language Models Jin Woojeong, Cheng Yu, Shen Yelong, Chen Weizhu, Ren Xiang
- Rethinking Self-supervision Objectives For Generalizable Coherence Modeling Jwalapuram Prathyusha, Joty Shafiq, Lin Xiang
- Amazon Sagemaker Model Parallelism: A General And Flexible Framework For Large Model Training Karakus Can, Huilgol Rahul, Wu Fei, Subramanian Anirudh, Daniel Cade, Cavdar Derya, Xu Teng, Chen Haohan, Rahnama Arash, Quintela Luis
- Generating Rich Product Descriptions For Conversational E-commerce Systems Kedia Shashank, Mantha Aditya, Gupta Sneha, Guo Stephen, Achan Kannan
- What's In A Measurement? Using GPT-3 On Semeval 2021 Task 8 -- Measeval Kohler Curt, Daniel Ron Jr
- Controlling Conditional Language Models Without Catastrophic Forgetting Korbak Tomasz, Elsahar Hady, Kruszewski German, Dymetman Marc
- BERT Busters: Outlier Dimensions That Disrupt Transformers Kovaleva Olga, Kulshreshtha Saurabh, Rogers Anna, Rumshisky Anna
- Hurdles To Progress In Long-form Question Answering Krishna Kalpesh, Roy Aurko, Iyyer Mohit
- On The Significance Of Question Encoder Sequence Model In The Out-of-distribution Performance In Visual Question Answering Kv Gouthaman, Mittal Anurag
- Pagnol: An Extra-large French Generative Model Launay Julien, Tommasone Elena, Pannier Baptiste, Boniface François, Chatelain Amélie, Cappelli Alessandro, Poli Iacopo, Seddah Djamé
- Can Transformer Language Models Predict Psychometric Properties? Laverghetta Antonio Jr., Nighojkar Animesh, Mirzakhalov Jamshidbek, Licato John
- Quiz-style Question Generation For News Stories Lelkes Adam D., Tran Vinh Q., Yu Cong
- Exploiting Sentence-level Representations For Passage Ranking Leonhardt Jurek, Beringer Fabian, Anand Avishek
- Semvlp: Vision-language Pre-training By Aligning Semantics At Multiple Levels Li Chenliang, Yan Ming, Xu Haiyang, Luo Fuli, Wang Wei, Bi Bin, Huang Songfang
- Self-supervised Bot Play For Conversational Recommendation With Justifications Li Shuyang, Majumder Bodhisattwa Prasad, Mcauley Julian
- Question-aware Memory Network For Multi-hop Question Answering In Human-robot Interaction Li Xinmeng, Alazab Mamoun, Li Qian, Yu Keping, Yin Quanjun
- Scheduled Sampling In Vision-language Pretraining With Decoupled Encoder-decoder Network Li Yehao, Pan Yingwei, Yao Ting, Chen Jingwen, Mei Tao
- Lyra: A Benchmark For Turducken-style Code Generation Liang Qingyuan, Sun Zeyu, Zhu Qihao, Zhang Wenjie, Yu Lian, Xiong Yingfei, Zhang Lu
- Learning Neural Templates For Recommender Dialogue System Liang Zujie, Hu Huang, Xu Can, Miao Jian, He Yingying, Chen Yining, Geng Xiubo, Liang Fan, Jiang Daxin
- Domain-independent User Simulation With Transformers For Task-oriented Dialogue Systems Lin Hsien-chin, Lubis Nurul, Hu Songbo, Van Niekerk Carel, Geishauser Christian, Heck Michael, Feng Shutong, Gašić Milica
- Variational Latent-state GPT For Semi-supervised Task-oriented Dialog Systems Liu Hong, Cai Yucheng, Lin Zhenru, Ou Zhijian, Huang Yi, Feng Junlan
- A Three-stage Learning Framework For Low-resource Knowledge-grounded Dialogue Generation Liu Shilei, Zhao Xiaofeng, Li Bochao, Ren Feiliang, Zhang Longhui, Yin Shujuan
- Enriching Non-autoregressive Transformer With Syntactic And Semanticstructures For Neural Machine Translation Liu Ye, Wan Yao, Zhang Jian-guo, Zhao Wenting, Yu Philip S.
- Augmenting Sequential Recommendation With Pseudo-prior Items Via Reversely Pre-training Transformer Liu Zhiwei, Fan Ziwei, Wang Yu, Yu Philip S.
- Iconqa: A New Benchmark For Abstract Diagram Understanding And Visual Language Reasoning Lu Pan, Qiu Liang, Chen Jiaqi, Xia Tony, Zhao Yizhou, Zhang Wei, Yu Zhou, Liang Xiaodan, Zhu Song-chun
- X-METRA-ADA: Cross-lingual Meta-transfer Learning Adaptation To Natural Language Understanding And Question Answering M'hamdi Meryem, Kim Doo Soon, Dernoncourt Franck, Bui Trung, Ren Xiang, May Jonathan
- Deltalm: Encoder-decoder Pre-training For Language Generation And Translation By Augmenting Pretrained Multilingual Encoders Ma Shuming, Dong Li, Huang Shaohan, Zhang Dongdong, Muzio Alexandre, Singhal Saksham, Awadalla Hany Hassan, Song Xia, Wei Furu
- Towards Retrieval-based Conversational Recommendation Manzoor Ahtsham, Jannach Dietmar
- Transformer Based Bengali Chatbot Using General Knowledge Dataset Masum Abu Kaisar Mohammad, Abujar Sheikh, Akter Sharmin, Ria Nushrat Jahan, Hossain Syed Akhter
- Unified Questioner Transformer For Descriptive Question Generation In Goal-oriented Visual Dialogue Matsumori Shoya, Shingyouchi Kosuke, Abe Yuki, Fukuchi Yosuke, Sugiura Komei, Imai Michita
- Rewire-then-probe: A Contrastive Recipe For Probing Biomedical Knowledge Of Pre-trained Language Models Meng Zaiqiao, Liu Fangyu, Shareghi Ehsan, Su Yixuan, Collins Charlotte, Collier Nigel
- Language Models Are Few-shot Butlers Micheli Vincent, Fleuret François
- First Align, Then Predict: Understanding The Cross-lingual Ability Of Multilingual BERT Muller Benjamin, Elazar Yanai, Sagot Benoît, Seddah Djamé
- Arat5: Text-to-text Transformers For Arabic Language Generation Nagoudi El Moatez Billah, Elmadany Abdelrahim, Abdul-mageed Muhammad
- P-adapters: Robustly Extracting Factual Information From Language Models With Diverse Prompts Newman Benjamin, Choubey Prafulla Kumar, Rajani Nazneen
- Sentence-t5: Scalable Sentence Encoders From Pre-trained Text-to-text Models Ni Jianmo, Ábrego Gustavo Hernández, Constant Noah, Ma Ji, Hall Keith B., Cer Daniel, Yang Yinfei
- ABC: Attention With Bounded-memory Control Peng Hao, Kasai Jungo, Pappas Nikolaos, Yogatama Dani, Wu Zhaofeng, Kong Lingpeng, Schwartz Roy, Smith Noah A.
- Pre-train, Prompt, And Predict: A Systematic Survey Of Prompting Methods In Natural Language Processing Pengfei Liu, Weizhe Yuan, Jinlan Fu, Zhengbao Jiang, Hiroaki Hayashi, Graham Neubig
- Cotext: Multi-task Learning With Code-text Transformer Phan Long, Tran Hieu, Le Daniel, Nguyen Hieu, Anibal James, Peltekian Alec, Ye Yanfang
- Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge Porada Ian, Sordoni Alessandro, Cheung Jackie Chi Kit
- Vinmt: Neural Machine Translation Toolkit Quan Nguyen Hoang, Dat Nguyen Thanh, Cong Nguyen Hoang Minh, Van Vinh Nguyen, Vinh Ngo Thi, Thai Nguyen Phuong, Viet Tran Hong
- Webgpt: Browser-assisted Question-answering With Human Feedback Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman
- Multiversal Views On Language Models Reynolds Laria, Mcdonell Kyle
- Don't Sweep Your Learning Rate Under The Rug: A Closer Look At Cross-modal Transfer Of Pretrained Transformers Rothermel Danielle, Li Margaret, Rocktäschel Tim, Foerster Jakob
- Leveraging The Inductive Bias Of Large Language Models For Abstract Textual Reasoning Rytting Christopher Michael, Wingate David
- Metaicl: Learning To Learn In Context Sewon Min, Mike Lewis, Luke Zettlemoyer, Hannaneh Hajishirzi
- Audio-visual Scene-aware Dialog And Reasoning Using Audio-visual Transformers With Joint Student-teacher Learning Shah Ankit P., Geng Shijie, Gao Peng, Cherian Anoop, Hori Takaaki, Marks Tim K., Roux Jonathan Le, Hori Chiori
- FPM: A Collection Of Large-scale Foundation Pre-trained Language Models Shen Dezhou
- How Much Can CLIP Benefit Vision-and-language Tasks? Shen Sheng, Li Liunian Harold, Tan Hao, Bansal Mohit, Rohrbach Anna, Chang Kai-wei, Yao Zhewei, Keutzer Kurt
- Few-shot Semantic Parsing With Language Models Trained On Code Shin Richard, Van Durme Benjamin
- Pre-trained Language Models As Prior Knowledge For Playing Text-based Games Singh Ishika, Singh Gargi, Modi Ashutosh
- Multimodal Integration Of Human-like Attention In Visual Question Answering Sood Ekta, Kögel Fabian, Müller Philipp, Thomas Dominike, Bace Mihai, Bulling Andreas
- Worst Of Both Worlds: Biases Compound In Pre-trained Vision-and-language Models Srinivasan Tejas, Bisk Yonatan
- Txt: Crossmodal End-to-end Learning With Transformers Steitz Jan-martin O., Pfeiffer Jonas, Gurevych Iryna, Roth Stefan
- Embodied BERT: A Transformer Model For Embodied, Language-guided Visual Task Completion Suglia Alessandro, Gao Qiaozi, Thomason Jesse, Thattai Govind, Sukhatme Gaurav
- Not All Memories Are Created Equal: Learning To Forget By Expiring Sukhbaatar Sainbayar, Ju Da, Poff Spencer, Roller Stephen, Szlam Arthur, Weston Jason, Fan Angela
- Do Long-range Language Models Actually Use Long-range Context? Sun Simeng, Krishna Kalpesh, Mattarella-micke Andrew, Iyyer Mohit
- ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Sun Yu, Wang Shuohuan, Feng Shikun, Ding Siyu, Pang Chao, Shang Junyuan, Liu Jiaxiang, Chen Xuyi, Zhao Yanbin, Lu Yuxiang, Liu Weixin, Wu Zhihua, Gong Weibao, Liang Jianzhong, Shang Zhizhou, Sun Peng, Liu Wei, Ouyang Xuan, Yu Dianhai, Tian Hao, Wu Hua, Wang Haifeng
- Diff-explainer: Differentiable Convex Optimization For Explainable Multi-hop Inference Thayaparan Mokanarangan, Valentino Marco, Ferreira Deborah, Rozanova Julia, Freitas André
- Towards Reinforcement Learning For Pivot-based Neural Machine Translation With Non-autoregressive Transformer Tokarchuk Evgeniia, Rosendahl Jan, Wang Weiyue, Petrushkov Pavel, Lancewicki Tomer, Khadivi Shahram, Ney Hermann
- Multimodal Few-shot Learning With Frozen Language Models Tsimpoukelli Maria, Menick Jacob, Cabi Serkan, Eslami S. M. Ali, Vinyals Oriol, Hill Felix
- Lightseq2: Accelerated Training For Transformer-based Models On Gpus Wang Xiaohui, Wei Yang, Xiong Ying, Huang Guyue, Qian Xian, Ding Yufei, Wang Mingxuan, Li Lei
- Codet5: Identifier-aware Unified Pre-trained Encoder-decoder Models For Code Understanding And Generation Wang Yue, Wang Weishi, Joty Shafiq, Hoi Steven C. H.
- Training Multilingual Pre-trained Language Model With Byte-level Subwords Wei Junqiu, Liu Qun, Guo Yinpeng, Jiang Xin
- Reframing Human-ai Collaboration For Generating Free-text Explanations Wiegreffe Sarah, Hessel Jack, Swayamdipta Swabha, Riedl Mark, Choi Yejin
- Graph-free Multi-hop Reading Comprehension: A Select-to-guide Strategy Wu Bohong, Zhang Zhuosheng, Zhao Hai
- Not All Attention Is All You Need Wu Hongqiu, Zhao Hai, Zhang Min
- P-tuning V2: Prompt Tuning Can Be Comparable To Fine-tuning Universally Across Scales And Tasks Xiao Liu, Kaixuan Ji, Yicheng Fu, Weng Lam Tam, Zhengxiao Du, Zhilin Yang, Jie Tang
- Training Large-scale News Recommenders With Pretrained Language Models In The Loop Xiao Shitao, Liu Zheng, Shao Yingxia, Di Tao, Xie Xing
- An Explanation Of In-context Learning As Implicit Bayesian Inference Xie Sang Michael, Raghunathan Aditi, Liang Percy, Ma Tengyu
- Simple Local Attentions Remain Competitive For Long-context Tasks Xiong Wenhan, Oğuz Barlas, Gupta Anchit, Chen Xilun, Liskovich Diana, Levy Omer, Yih Wen-tau, Mehdad Yashar
- VLM: Task-agnostic Video-language Model Pre-training For Video Understanding Xu Hu, Ghosh Gargi, Huang Po-yao, Arora Prahal, Aminzadeh Masoumeh, Feichtenhofer Christoph, Metze Florian, Zettlemoyer Luke
- Causal Attention For Vision-language Tasks Yang Xu, Zhang Hanwang, Qi Guojun, Cai Jianfei
- TR-BERT: Dynamic Token Reduction For Accelerating BERT Inference Ye Deming, Lin Yankai, Huang Yufei, Sun Maosong
- Turning Tables: Generating Examples From Semi-structured Tables For Endowing Language Models With Reasoning Skills Yoran Ori, Talmor Alon, Berant Jonathan
- On The Interplay Between Fine-tuning And Composition In Transformers Yu Lang, Ettinger Allyson
- LV-BERT: Exploiting Layer Variety For BERT Yu Weihao, Jiang Zihang, Chen Fei, Hou Qibin, Feng Jiashi
- Improving Sequential Recommendation Consistency With Self-supervised Imitation Yuan Xu, Chen Hongshen, Song Yonghao, Zhao Xiaofang, Ding Zhuoye, He Zhen, Long Bo
- Bitfit: Simple Parameter-efficient Fine-tuning For Transformer-based Masked Language-models Zaken Elad Ben, Ravfogel Shauli, Goldberg Yoav
- Retgen: A Joint Framework For Retrieval And Grounded Text Generation Modeling Zhang Yizhe, Sun Siqi, Gao Xiang, Fang Yuwei, Brockett Chris, Galley Michel, Gao Jianfeng, Dolan Bill
- An Exploratory Study On Long Dialogue Summarization: What Works And What's Next Zhang Yusen, Ni Ansong, Yu Tao, Zhang Rui, Zhu Chenguang, Deb Budhaditya, Celikyilmaz Asli, Awadallah Ahmed Hassan, Radev Dragomir
- Learning To Rehearse In Long Sequence Memorization Zhang Zhu, Zhou Chang, Ma Jianxin, Lin Zhijie, Zhou Jingren, Yang Hongxia, Zhao Zhou
- Structural Pre-training For Dialogue Comprehension Zhang Zhuosheng, Zhao Hai
- CPM-2: Large-scale Cost-effective Pre-trained Language Models Zhengyan Zhang, Yuxian Gu, Xu Han, Shengqi Chen, Chaojun Xiao, Zhenbo Sun, Yuan Yao, Fanchao Qi, Jian Guan, Pei Ke, Yanzheng Cai, Guoyang Zeng, Zhixing Tan, Zhiyuan Liu, Minlie Huang, Wentao Han, Yang Liu, Xiaoyan Zhu, Maosong Sun
- SILG: The Multi-environment Symbolic Interactive Language Grounding Benchmark Zhong Victor, Hanjie Austin W., Wang Sida I., Narasimhan Karthik, Zettlemoyer Luke
- LAFITE: Towards Language-free Training For Text-to-image Generation Zhou Yufan, Zhang Ruiyi, Chen Changyou, Li Chunyuan, Tensmeyer Chris, Yu Tong, Gu Jiuxiang, Xu Jinhui, Sun Tong
- Thinking Clearly, Talking Fast: Concept-guided Non-autoregressive Generation For Open-domain Dialogue Systems Zou Yicheng, Liu Zhihua, Hu Xingwu, Zhang Qi
- Gpt-3-driven Pedagogical Agents For Training Children's Curious Question-asking Skills Abdelghani Rania, Wang Yen-hsiang, Yuan Xingdi, Wang Tong, Lucas Pauline, Sauzéon Hélène, Oudeyer Pierre-yves
- Making Large Language Models Interactive: A Pioneer Study On Supporting Complex Information-seeking Tasks With Implicit Constraints Ahmadvand Ali, Arabzadeh Negar, Kiseleva Julia, Sanz Patricio Figueroa, Deng Xin, Jauhar Sujay, Gamon Michael, Agichtein Eugene, Friend Ned, Aniruddha
- Few-shot Training Llms For Project-specific Code-summarization Ahmed Toufique, Devanbu Premkumar
- Solving Quantitative Reasoning Problems With Language Models Aitor Lewkowycz, Anders Andreassen, David Dohan, Ethan Dyer, Henryk Michalewski, Vinay Ramasesh, Ambrose Slone, Cem Anil, Imanol Schlag, Theo Gutman-solo, Yuhuai Wu, Behnam Neyshabur, Guy Gur-ari, Vedant Misra
- Despite "super-human" Performance, Current Llms Are Unsuited For Decisions About Ethics And Safety Albrecht Joshua, Kitanidis Ellie, Fetterman Abraham J.
- Improving Alignment Of Dialogue Agents Via Targeted Human Judgements Amelia Glaese, Nat Mcaleese, Maja Trębacz, John Aslanides, Vlad Firoiu, Timo Ewalds, Maribeth Rauh, Laura Weidinger, Martin Chadwick, Phoebe Thacker, Lucy Campbell-gillingham, Jonathan Uesato, Po-sen Huang, Ramona Comanescu, Fan Yang, Abigail See, Sumanth Dathathri, Rory Greig, Charlie Chen, Doug Fritz, Jaume Sanchez Elias, Richard Green, Soňa Mokrá, Nicholas Fernando, Boxi Wu, Rachel Foley, Susannah Young, Iason Gabriel, William Isaac, John Mellor, Demis Hassabis, Koray Kavukcuoglu, Lisa Anne Hendricks, Geoffrey Irving
- Can Language Models Learn From Explanations In Context? Andrew K. Lampinen, Ishita Dasgupta, Stephanie C. Y. Chan, Kory Matthewson, Michael Henry Tessler, Antonia Creswell, James L. Mcclelland, Jane X. Wang, Felix Hill
- Socratic Models: Composing Zero-shot Multimodal Reasoning With Language Andy Zeng, Maria Attarian, Brian Ichter, Krzysztof Choromanski, Adrian Wong, Stefan Welker, Federico Tombari, Aveek Purohit, Michael Ryoo, Vikas Sindhwani, Johnny Lee, Vincent Vanhoucke, Pete Florence
- Exploring Length Generalization In Large Language Models Anil Cem, Wu Yuhuai, Andreassen Anders, Lewkowycz Aitor, Misra Vedant, Ramasesh Vinay, Slone Ambrose, Gur-ari Guy, Dyer Ethan, Neyshabur Behnam
- Selection-inference: Exploiting Large Language Models For Interpretable Logical Reasoning Antonia Creswell, Murray Shanahan, Irina Higgins
- GLM-130B: An Open Bilingual Pre-trained Model Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Peng Zhang, Yuxiao Dong, Jie Tang
- Characterizing Verbatim Short-term Memory In Neural Language Models Armeni Kristijan, Honey Christopher, Linzen Tal
- Parameter-efficient Finetuning Of Transformers For Source Code Ayupov Shamil, Chirkova Nadezhda
- Improving Short Text Classification With Augmented Data Using GPT-3 Balkus Salvador, Yan Donghui
- Rethinking The Role Of Scale For In-context Learning: An Interpretability-based Case Study At 66 Billion Scale Bansal Hritik, Gopalakrishnan Karthik, Dingliwal Saket, Bodapati Sravan, Kirchhoff Katrin, Roth Dan
- Vector-quantized Input-contextualized Soft Prompts For Natural Language Understanding Bhardwaj Rishabh, Saha Amrita, Hoi Steven C. H., Poria Soujanya
- Using Cognitive Psychology To Understand GPT-3 Binz Marcel, Schulz Eric
- Inpars: Data Augmentation For Information Retrieval Using Large Language Models Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Nogueira Rodrigo
- Looking For A Handsome Carpenter! Debiasing GPT-3 Job Advertisements Borchers Conrad, Gala Dalia Sara, Gilburt Benjamin, Oravkin Eduard, Bounsi Wilfried, Asano Yuki M., Kirk Hannah Rose
- Understanding Performance Of Long-document Ranking Models Through Comprehensive Evaluation And Leaderboarding Boytsov Leonid, Akinpelu David, Lin Tianyi, Gao Fangwei, Zhao Yutian, Huang Jeffrey, Katyal Nipun, Nyberg Eric
- Large Language Models Can Implement Policy Iteration Brooks Ethan, Walls Logan, Lewis Richard L., Singh Satinder
- Robust Preference Learning For Storytelling Via Contrastive Reinforcement Learning Castricato Louis, Havrilla Alexander, Matiana Shahbuland, Pieler Michael, Ye Anbang, Yang Ian, Frazier Spencer, Riedl Mark
- Xdoc: Unified Pre-training For Cross-format Document Understanding Chen Jingye, Lv Tengchao, Cui Lei, Zhang Cha, Wei Furu
- Leveraging Natural Supervision For Language Representation Learning And Generation Chen Mingda
- Murag: Multimodal Retrieval-augmented Generator For Open Question Answering Over Images And Text Chen Wenhu, Hu Hexiang, Chen Xi, Verga Pat, Cohen William W.
- Transformers Go For The Lols: Generating (humourous) Titles From Scientific Abstracts End-to-end Chen Yanran, Eger Steffen
- Task-aware Specialization For Efficient And Robust Dense Retrieval For Open-domain Question Answering Cheng Hao, Fang Hao, Liu Xiaodong, Gao Jianfeng
- Capture Salient Historical Information: A Fast And Accurate Non-autoregressive Model For Multi-turn Spoken Language Understanding Cheng Lizhi, Jia Weijia, Yang Wenmian
- Is Multiwoz A Solved Task? An Interactive TOD Evaluation Framework With User Simulator Cheng Qinyuan, Li Linyang, Quan Guofeng, Gao Feng, Mou Xiaofeng, Qiu Xipeng
- Prompting GPT-3 To Be Reliable Chenglei Si, Zhe Gan, Zhengyuan Yang, Shuohang Wang, Jianfeng Wang, Jordan Boyd-graber, Lijuan Wang
- On The Evaluation Of Answer-agnostic Paragraph-level Multi-question Generation Chowdhury Jishnu Ray, Mahata Debanjan, Caragea Cornelia
- One Agent To Rule Them All: Towards Multi-agent Conversational AI Clarke Christopher, Peper Joseph Joshua, Krishnamurthy Karthik, Talamonti Walter, Leach Kevin, Lasecki Walter, Kang Yiping, Tang Lingjia, Mars Jason
- On Reality And The Limits Of Language Data: Aligning Llms With Human Norms Collier Nigel H., Liu Fangyu, Shareghi Ehsan
- Building The Intent Landscape Of Real-world Conversational Corpora With Extractive Question-answering Transformers Corbeil Jean-philippe, Li Mia Taige, Ghavidel Hadi Abdi
- Continual Pre-training Mitigates Forgetting In Language And Vision Cossu Andrea, Tuytelaars Tinne, Carta Antonio, Passaro Lucia, Lomonaco Vincenzo, Bacciu Davide
- M6-rec: Generative Pretrained Language Models Are Open-ended Recommender Systems Cui Zeyu, Ma Jianxin, Zhou Chang, Zhou Jingren, Yang Hongxia
- The Future Is Different: Large Pre-trained Language Models Fail In Prediction Tasks Cvejoski Kostadin, Sánchez Ramsés J., Ojeda César
- Why Can GPT Learn In-context? Language Models Implicitly Perform Gradient Descent As Meta-optimizers Dai Damai, Sun Yutao, Dong Li, Hao Yaru, Ma Shuming, Sui Zhifang, Wei Furu
- Promptagator: Few-shot Dense Retrieval From 8 Examples Dai Zhuyun, Zhao Vincent Y., Ma Ji, Luan Yi, Ni Jianmo, Lu Jing, Bakalov Anton, Guu Kelvin, Hall Keith B., Chang Ming-wei
- Understanding BLOOM: An Empirical Study On Diverse NLP Tasks Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- Rlprompt: Optimizing Discrete Text Prompts With Reinforcement Learning Deng Mingkai, Wang Jianyu, Hsieh Cheng-ping, Wang Yihan, Guo Han, Shu Tianmin, Song Meng, Xing Eric P., Hu Zhiting
- Least-to-most Prompting Enables Complex Reasoning In Large Language Models Denny Zhou, Nathanael Schärli, Le Hou, Jason Wei, Nathan Scales, Xuezhi Wang, Dale Schuurmans, Claire Cui, Olivier Bousquet, Quoc Le, Ed Chi
- Black-box Prompt Learning For Pre-trained Language Models Diao Shizhe, Huang Zhichao, Xu Ruijia, Li Xuechun, Lin Yong, Zhou Xiao, Zhang Tong
- Cogview2: Faster And Better Text-to-image Generation Via Hierarchical Transformers Ding Ming, Zheng Wendi, Hong Wenyi, Tang Jie
- Corrpus: Code-based Structured Prompting For Neurosymbolic Story Understanding Dong Yijiang River, Martin Lara J., Callison-burch Chris
- Probing For Incremental Parse States In Autoregressive Language Models Eisape Tiwalayo, Gangireddy Vineet, Levy Roger P., Kim Yoon
- Llmeffichecker: Understanding And Testing Efficiency Degradation Of Large Language Models Feng Xiaoning, Han Xiaohong, Chen Simin, Yang Wei
- Learn To Remember: Transformer With Recurrent Memory For Document-level Machine Translation Feng Yukun, Li Feng, Song Ziang, Zheng Boyuan, Koehn Philipp
- Language Models Are Multilingual Chain-of-thought Reasoners Freda Shi, Mirac Suzgun, Markus Freitag, Xuezhi Wang, Suraj Srivats, Soroush Vosoughi, Hyung Won Chung, Yi Tay, Sebastian Ruder, Denny Zhou, Dipanjan Das, Jason Wei
- Revision Transformers: Instructing Language Models To Change Their Values Friedrich Felix, Stammer Wolfgang, Schramowski Patrick, Kersting Kristian
- Hungry Hungry Hippos: Towards Language Modeling With State Space Models Fu Daniel Y., Dao Tri, Saab Khaled K., Thomas Armin W., Rudra Atri, Ré Christopher
- Visual Prompt Tuning For Test-time Domain Adaptation Gao Yunhe, Shi Xingjian, Zhu Yi, Wang Hao, Tang Zhiqiang, Zhou Xiong, Li Mu, Metaxas Dimitris N.
- Extensible Prompts For Language Models On Zero-shot Language Style Customization Ge Tao, Hu Jing, Dong Li, Mao Shaoguang, Xia Yan, Wang Xun, Chen Si-qing, Wei Furu
- Cramming: Training A Language Model On A Single GPU In One Day Geiping Jonas, Goldstein Tom
- Diffuseq: Sequence To Sequence Text Generation With Diffusion Models Gong Shansan, Li Mukai, Feng Jiangtao, Wu Zhiyong, Kong Lingpeng
- Checks And Strategies For Enabling Code-switched Machine Translation Gowda Thamme, Gheini Mozhdeh, May Jonathan
- Continuous Decomposition Of Granularity For Neural Paraphrase Generation Gu Xiaodong, Zhang Zhaowei, Lee Sang-woo, Yoo Kang Min, Ha Jung-woo
- Optimal Transport For Unsupervised Hallucination Detection In Neural Machine Translation Guerreiro Nuno M., Colombo Pierre, Piantanida Pablo, Martins André F. T.
- Unixcoder: Unified Cross-modal Pre-training For Code Representation Guo Daya, Lu Shuai, Duan Nan, Wang Yanlin, Zhou Ming, Yin Jian
- Visually-augmented Pretrained Language Models For NLP Tasks Without Images Guo Hangyu, Zhou Kun, Zhao Wayne Xin, Zhang Qinyu, Wen Ji-rong
- LVP-M3: Language-aware Visual Prompt For Multilingual Multimodal Machine Translation Guo Hongcheng, Liu Jiaheng, Huang Haoyang, Yang Jian, Li Zhoujun, Zhang Dongdong, Cui Zheng, Wei Furu
- STI: Turbocharge NLP Inference At The Edge Via Elastic Pipelining Guo Liwei, Choe Wonkyo, Lin Felix Xiaozhu
- ORCA: Interpreting Prompted Language Models Via Locating Supporting Data Evidence In The Ocean Of Pretraining Data Han Xiaochuang, Tsvetkov Yulia
- Supervised Visual Attention For Simultaneous Multimodal Machine Translation Haralampieva Veneta, Caglayan Ozan, Specia Lucia
- Aging With GRACE: Lifelong Model Editing With Discrete Key-value Adaptors Hartvigsen Thomas, Sankaranarayanan Swami, Palangi Hamid, Kim Yoon, Ghassemi Marzyeh
- Understanding Transformer Memorization Recall Through Idioms Haviv Adi, Cohen Ido, Gidron Jacob, Schuster Roei, Goldberg Yoav, Geva Mor
- Robust Dialogue State Tracking With Weak Supervision And Sparse Data Heck Michael, Lubis Nurul, Van Niekerk Carel, Feng Shutong, Geishauser Christian, Lin Hsien-chin, Gašić Milica
- Prompt-to-prompt Image Editing With Cross Attention Control Hertz Amir, Mokady Ron, Tenenbaum Jay, Aberman Kfir, Pritch Yael, Cohen-or Daniel
- Wikiwhy: Answering And Explaining Cause-and-effect Questions Ho Matthew, Sharma Aditya, Chang Justin, Saxon Michael, Levy Sharon, Lu Yujie, Wang William Yang
- Compressing Pre-trained Transformers Via Low-bit Nxm Sparsity For Natural Language Understanding Holmes Connor, Zhang Minjia, He Yuxiong, Wu Bo
- TRUE: Re-evaluating Factual Consistency Evaluation Honovich Or, Aharoni Roee, Herzig Jonathan, Taitelbaum Hagai, Kukliansy Doron, Cohen Vered, Scialom Thomas, Szpektor Idan, Hassidim Avinatan, Matias Yossi
- Instruction Induction: From Few Examples To Natural Language Task Descriptions Honovich Or, Shaham Uri, Bowman Samuel R., Levy Omer
- Promptcap: Prompt-guided Task-aware Image Captioning Hu Yushi, Hua Hang, Yang Zhengyuan, Shi Weijia, Smith Noah A, Luo Jiebo
- REVEAL: Retrieval-augmented Visual-language Pre-training With Multi-source Multimodal Knowledge Memory Hu Ziniu, Iscen Ahmet, Sun Chen, Wang Zirui, Chang Kai-wei, Sun Yizhou, Schmid Cordelia, Ross David A., Fathi Alireza
- Empowering Language Models With Knowledge Graph Reasoning For Question Answering Hu Ziniu, Xu Yichong, Yu Wenhao, Wang Shuohang, Yang Ziyi, Zhu Chenguang, Chang Kai-wei, Sun Yizhou
- Autoregressive Entity Generation For End-to-end Task-oriented Dialog Huang Guanhuan, Quan Xiaojun, Wang Qifan
- Detecting Unintended Memorization In Language-model-fused ASR Huang W. Ronny, Chien Steve, Thakkar Om, Mathews Rajiv
- Inner Monologue: Embodied Reasoning Through Planning With Language Models Huang Wenlong, Xia Fei, Xiao Ted, Chan Harris, Liang Jacky, Florence Pete, Zeng Andy, Tompson Jonathan, Mordatch Igor, Chebotar Yevgen, Sermanet Pierre, Brown Noah, Jackson Tomas, Luu Linda, Levine Sergey, Hausman Karol, Ichter Brian
- Layoutlmv3: Pre-training For Document AI With Unified Text And Image Masking Huang Yupan, Lv Tengchao, Cui Lei, Lu Yutong, Wei Furu
- MCP: Self-supervised Pre-training For Personalized Chatbots With Multi-level Contrastive Sampling Huang Zhaoheng, Dou Zhicheng, Zhu Yutao, Ma Zhengyi
- Clues Before Answers: Generation-enhanced Multiple-choice QA Huang Zixian, Wu Ao, Zhou Jiaying, Gu Yu, Zhao Yue, Cheng Gong
- Efficient Long-text Understanding With Short-text Models Ivgi Maor, Shaham Uri, Berant Jonathan
- HINT: Hypernetwork Instruction Tuning For Efficient Zero- & Few-shot Generalisation Ivison Hamish, Bhagia Akshita, Wang Yizhong, Hajishirzi Hannaneh, Peters Matthew
- Lamemo: Language Modeling With Look-ahead Memory Ji Haozhe, Zhang Rongsheng, Yang Zhenyu, Hu Zhipeng, Huang Minlie
- Survey Of Hallucination In Natural Language Generation Ji Ziwei, Lee Nayeon, Frieske Rita, Yu Tiezheng, Su Dan, Xu Yan, Ishii Etsuko, Bang Yejin, Chen Delong, Dai Wenliang, Chan Ho Shu, Madotto Andrea, Fung Pascale
- RHO (\(\rho\)): Reducing Hallucination In Open-domain Dialogues With Knowledge Grounding Ji Ziwei, Liu Zihan, Lee Nayeon, Yu Tiezheng, Wilie Bryan, Zeng Min, Fung Pascale
- Probing Script Knowledge From Pre-trained Models Jin Zijian, Zhang Xingyu, Yu Mo, Huang Lifu
- A New Path: Scaling Vision-and-language Navigation With Synthetic Instructions And Imitation Learning Kamath Aishwarya, Anderson Peter, Wang Su, Koh Jing Yu, Ku Alexander, Waters Austin, Yang Yinfei, Baldridge Jason, Parekh Zarana
- Fie: Building A Global Probability Space By Leveraging Early Fusion In Encoder For Open-domain Question Answering Kedia Akhil, Zaidi Mohd Abbas, Lee Haejun
- Improving Language Model Prompting In Support Of Semi-autonomous Task Learning Kirk James R., Wray Robert E., Lindes Peter, Laird John E.
- Language Detoxification With Attribute-discriminative Latent Space Kwak Jin Myung, Kim Minseon, Hwang Sung Ju
- Coauthor: Designing A Human-ai Collaborative Writing Dataset For Exploring Language Model Capabilities Lee Mina, Liang Percy, Yang Qian
- Evaluating Human-language Model Interaction Lee Mina, Srivastava Megha, Hardy Amelia, Thickstun John, Durmus Esin, Paranjape Ashwin, Gerard-ursin Ines, Li Xiang Lisa, Ladhak Faisal, Rong Frieda, Wang Rose E., Kwon Minae, Park Joon Sung, Cao Hancheng, Lee Tony, Bommasani Rishi, Bernstein Michael, Liang Percy
- Littlebird: Efficient Faster & Longer Transformer For Question Answering Lee Minchul, Han Kijong, Shin Myeong Cheol
- Evolution Through Large Models Lehman Joel, Gordon Jonathan, Jain Shawn, Ndousse Kamal, Yeh Cathy, Stanley Kenneth O.
- Large Language Models With Controllable Working Memory Li Daliang, Rawat Ankit Singh, Zaheer Manzil, Wang Xin, Lukasik Michal, Veit Andreas, Yu Felix, Kumar Sanjiv
- A Survey On Retrieval-augmented Text Generation Li Huayang, Su Yixuan, Cai Deng, Wang Yan, Liu Lemao
- Counterfactual Reasoning: Do Language Models Need World Knowledge For Causal Understanding? Li Jiaxuan, Yu Lang, Ettinger Allyson
- ELMER: A Non-autoregressive Pre-trained Language Model For Efficient And Effective Text Generation Li Junyi, Tang Tianyi, Zhao Wayne Xin, Nie Jian-yun, Wen Ji-rong
- LAVENDER: Unifying Video-language Understanding As Masked Language Modeling Li Linjie, Gan Zhe, Lin Kevin, Lin Chung-ching, Liu Zicheng, Liu Ce, Wang Lijuan
- Pre-trained Language Models For Interactive Decision-making Li Shuang, Puig Xavier, Paxton Chris, Du Yilun, Wang Clinton, Fan Linxi, Chen Tao, Huang De-an, Akyürek Ekin, Anandkumar Anima, Andreas Jacob, Mordatch Igor, Torralba Antonio, Zhu Yuke
- Evaluating Psychological Safety Of Large Language Models Li Xingxuan, Li Yutong, Qiu Lin, Joty Shafiq, Bing Lidong
- VALHALLA: Visual Hallucination For Machine Translation Li Yi, Panda Rameswar, Kim Yoon, Chen Chun-fu, Feris Rogerio, Cox David, Vasconcelos Nuno
- Accelerating Attention Through Gradient-based Learned Runtime Pruning Li Zheng, Ghodrati Soroush, Yazdanbakhsh Amir, Esmaeilzadeh Hadi, Kang Mingu
- Life After BERT: What Do Other Muppets Understand About Language? Lialin Vladislav, Zhao Kevin, Shivagunde Namrata, Rumshisky Anna
- Using Large Language Models To Generate Engaging Captions For Data Visualizations Liew Ashley, Mueller Klaus
- Gentus: Simulating User Behaviour And Language In Task-oriented Dialogues With Generative Transformers Lin Hsien-chin, Geishauser Christian, Feng Shutong, Lubis Nurul, Van Niekerk Carel, Heck Michael, Gašić Milica
- Contextual Transformer For Offline Meta Reinforcement Learning Lin Runji, Li Ye, Feng Xidong, Zhang Zhaowei, Fung Xian Hong Wu, Zhang Haifeng, Wang Jun, Du Yali, Yang Yaodong
- Do Ever Larger Octopi Still Amplify Reporting Biases? Evidence From Judgments Of Typical Colour Liu Fangyu, Eisenschlos Julian Martin, Cole Jeremy R., Collier Nigel
- Deplot: One-shot Visual Language Reasoning By Plot-to-table Translation Liu Fangyu, Eisenschlos Julian Martin, Piccinno Francesco, Krichene Syrine, Pang Chenxi, Lee Kenton, Joshi Mandar, Chen Wenhu, Collier Nigel, Altun Yasemin
- A Simple Meta-learning Paradigm For Zero-shot Intent Classification With Mixture Attention Mechanism Liu Han, Zhao Siyang, Zhang Xiaotong, Zhang Feng, Sun Junjie, Yu Hong, Zhang Xianchao
- A Generative User Simulator With Gpt-based Architecture And Goal State Tracking For Reinforced Multi-domain Dialog Systems Liu Hong, Cai Yucheng, Ou Zhijian, Huang Yi, Feng Junlan
- Rainier: Reinforced Knowledge Introspector For Commonsense Question Answering Liu Jiacheng, Hallinan Skyler, Lu Ximing, He Pengfei, Welleck Sean, Hajishirzi Hannaneh, Choi Yejin
- PSP: Pre-trained Soft Prompts For Few-shot Abstractive Summarization Liu Xiaochen, Gao Yang, Bai Yu, Li Jiawei, Hu Yinan, Huang Heyan, Chen Boxing
- Can Large Language Models Reason About Medical Questions? Liévin Valentin, Hother Christoffer Egeberg, Motzfeldt Andreas Geert, Winther Ole
- Relaxed Attention For Transformer Models Lohrenz Timo, Möller Björn, Li Zhengyang, Fingscheidt Tim
- Training Language Models To Follow Instructions With Human Feedback Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe
- Choose Your QA Model Wisely: A Systematic Study Of Generative And Extractive Readers For Question Answering Luo Man, Hashimoto Kazuma, Yavuz Semih, Liu Zhiwei, Baral Chitta, Zhou Yingbo
- Structured Like A Language Model: Analysing AI As An Automated Subject Magee Liam, Arora Vanicka, Munn Luke
- PERFECT: Prompt-free And Efficient Few-shot Learning With Language Models Mahabadi Rabeeh Karimi, Zettlemoyer Luke, Henderson James, Saeidi Marzieh, Mathias Lambert, Stoyanov Veselin, Yazdani Majid
- Edit5: Semi-autoregressive Text-editing With T5 Warm-start Mallinson Jonathan, Adamek Jakub, Malmi Eric, Severyn Aliaksei
- Text Generation With Text-editing Models Malmi Eric, Dong Yue, Mallinson Jonathan, Chuklin Aleksandr, Adamek Jakub, Mirylenka Daniil, Stahlberg Felix, Krause Sebastian, Kumar Shankar, Severyn Aliaksei
- Tangobert: Reducing Inference Cost By Using Cascaded Architecture Mamou Jonathan, Pereg Oren, Wasserblat Moshe, Schwartz Roy
- GAAMA 2.0: An Integrated System That Answers Boolean And Extractive Questions Mccarley Scott, Bornea Mihaela, Rosenthal Sara, Ferritto Anthony, Sultan Md Arafat, Sil Avirup, Florian Radu
- Chatbots In A Botnet World Mckee Forrest, Noever David
- Grounding Language With Visual Affordances Over Unstructured Data Mees Oier, Borja-diaz Jessica, Burgard Wolfram
- The Devil Is In The Details: On Models And Training Regimes For Few-shot Intent Classification Mesgar Mohsen, Tran Thy Thy, Glavas Goran, Gurevych Iryna
- Do As I Can, Not As I Say: Grounding Language In Robotic Affordances Michael Ahn, Anthony Brohan, Noah Brown, Yevgen Chebotar, Omar Cortes, Byron David, Chelsea Finn, Chuyuan Fu, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Daniel Ho, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Eric Jang, Rosario Jauregui Ruano, Kyle Jeffrey, Sally Jesmonth, Nikhil J Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Kuang-huei Lee, Sergey Levine, Yao Lu, Linda Luu, Carolina Parada, Peter Pastor, Jornell Quiambao, Kanishka Rao, Jarek Rettinghouse, Diego Reyes, Pierre Sermanet, Nicolas Sievers, Clayton Tan, Alexander Toshev, Vincent Vanhoucke, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Mengyuan Yan, Andy Zeng
- Rarely A Problem? Language Models Exhibit Inverse Scaling In Their Predictions Following Few-type Quantifiers Michaelov James A., Bergen Benjamin K.
- Who Is GPT-3? An Exploration Of Personality, Values And Demographics Miotto Marilù, Rossberg Nicola, Kleinberg Bennett
- SKILL: Structured Knowledge Infusion For Large Language Models Moiseev Fedor, Dong Zhe, Alfonseca Enrique, Jaggi Martin
- Do Vision-and-language Transformers Learn Grounded Predicate-noun Dependencies? Nikolaus Mitja, Salin Emmanuelle, Ayache Stephane, Fourtassi Abdellah, Favre Benoit
- No Language Left Behind: Scaling Human-centered Machine Translation Nllb Team, Marta R. Costa-jussà, James Cross, Onur Çelebi, Maha Elbayad, Kenneth Heafield, Kevin Heffernan, Elahe Kalbassi, Janice Lam, Daniel Licht, Jean Maillard, Anna Sun, Skyler Wang, Guillaume Wenzek, Al Youngblood, Bapi Akula, Loic Barrault, Gabriel Mejia Gonzalez, Prangthip Hansanti, John Hoffman, Semarley Jarrett, Kaushik Ram Sadagopan, Dirk Rowe, Shannon Spruit, Chau Tran, Pierre Andrews, Necip Fazil Ayan, Shruti Bhosale, Sergey Edunov, Angela Fan, Cynthia Gao, Vedanuj Goswami, Francisco Guzmán, Philipp Koehn, Alexandre Mourachko, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Jeff Wang
- Minialbert: Model Distillation Via Parameter-efficient Recursive Transformers Nouriborji Mohammadmahdi, Rohanian Omid, Kouchaki Samaneh, Clifton David A.
- Neural Language Taskonomy: Which NLP Tasks Are The Most Predictive Of Fmri Brain Activity? Oota Subba Reddy, Arora Jashn, Agarwal Veeral, Marreddy Mounika, Gupta Manish, Surampudi Bapi Raju
- Follow-up Attention: An Empirical Study Of Developer And Neural Model Code Exploration Paltenghi Matteo, Pandita Rahul, Henley Austin Z., Ziegler Albert
- Dynamic Prompt Learning Via Policy Gradient For Semi-structured Mathematical Reasoning Pan Lu, Liang Qiu, Kai-wei Chang, Ying Nian Wu, Song-chun Zhu, Tanmay Rajpurohit, Peter Clark, Ashwin Kalyan
- Xfboost: Improving Text Generation With Controllable Decoders Peng Xiangyu, Sollami Michael
- Sparse And Dense Approaches For The Full-rank Retrieval Of Responses For Dialogues Penha Gustavo, Hauff Claudia
- Holistic Evaluation Of Language Models Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao, Jue Wang, Keshav Santhanam, Laurel Orr, Lucia Zheng, Mert Yuksekgonul, Mirac Suzgun, Nathan Kim, Neel Guha, Niladri Chatterji, Omar Khattab, Peter Henderson, Qian Huang, Ryan Chi, Sang Michael Xie, Shibani Santurkar, Surya Ganguli, Tatsunori Hashimoto, Thomas Icard, Tianyi Zhang, Vishrav Chaudhary, William Wang, Xuechen Li, Yifan Mai, Yuhui Zhang, Yuta Koreeda
- Visconde: Multi-document QA With GPT-3 And Neural Reranking Pereira Jayr, Fidalgo Robson, Lotufo Roberto, Nogueira Rodrigo
- Harnessing The Power Of Multi-task Pretraining For Ground-truth Level Natural Language Explanations Plüster Björn, Ambsdorf Jakob, Braach Lukas, Lee Jae Hee, Wermter Stefan
- Synchromesh: Reliable Code Generation From Pre-trained Language Models Poesia Gabriel, Polozov Oleksandr, Le Vu, Tiwari Ashish, Soares Gustavo, Meek Christopher, Gulwani Sumit
- Outliers Dimensions That Disrupt Transformers Are Driven By Frequency Puccetti Giovanni, Rogers Anna, Drozd Aleksandr, Dell'orletta Felice
- A Self-paced Mixed Distillation Method For Non-autoregressive Generation Qi Weizhen, Gong Yeyun, Shen Yelong, Jiao Jian, Yan Yu, Li Houqiang, Zhang Ruofei, Chen Weizhu, Duan Nan
- ELLE: Efficient Lifelong Pre-training For Emerging Data Qin Yujia, Zhang Jiajie, Lin Yankai, Liu Zhiyuan, Li Peng, Sun Maosong, Zhou Jie
- Entity-conditioned Question Generation For Robust Attention Distribution In Neural Information Retrieval Reddy Revanth Gangi, Sultan Md Arafat, Franz Martin, Sil Avirup, Ji Heng
- Learning To Diversify For Product Question Generation Roitman Haggai, Singer Uriel, Eshel Yotam, Nus Alexander, Kiperwasser Eliyahu
- Dialokg: Knowledge-structure Aware Task-oriented Dialogue Generation Rony Md Rashad Al Hasan, Usbeck Ricardo, Lehmann Jens
- Galactica: A Large Language Model For Science Ross Taylor, Marcin Kardas, Guillem Cucurull, Thomas Scialom, Anthony Hartshorn, Elvis Saravia, Andrew Poulton, Viktor Kerkez, Robert Stojnic
- Benchclamp: A Benchmark For Evaluating Language Models On Syntactic And Semantic Parsing Roy Subhro, Thomson Sam, Chen Tongfei, Shin Richard, Pauls Adam, Eisner Jason, Van Durme Benjamin
- The Goldilocks Of Pragmatic Understanding: Fine-tuning Strategy Matters For Implicature Resolution By Llms Ruis Laura, Khan Akbir, Biderman Stella, Hooker Sara, Rocktäschel Tim, Grefenstette Edward
- Alexatm 20B: Few-shot Learning Using A Large-scale Multilingual Seq2seq Model Saleh Soltan, Shankar Ananthakrishnan, Jack Fitzgerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, Stephen Rawls, Andy Rosenbaum, Anna Rumshisky, Chandana Satya Prakash, Mukund Sridhar, Fabian Triefenbach, Apurv Verma, Gokhan Tur, Prem Natarajan
- Self-repetition In Abstractive Neural Summarizers Salkar Nikita, Trikalinos Thomas, Wallace Byron C., Nenkova Ani
- Knowledge Prompts: Injecting World Knowledge Into Language Models Through Soft Prompts Santos Cicero Nogueira Dos, Dong Zhe, Cer Daniel, Nham John, Shakeri Siamak, Ni Jianmo, Sung Yun-hsuan
- Language Models Are Greedy Reasoners: A Systematic Formal Analysis Of Chain-of-thought Saparov Abulhair, He He
- Confident Adaptive Language Modeling Schuster Tal, Fisch Adam, Gupta Jai, Dehghani Mostafa, Bahri Dara, Tran Vinh Q., Tay Yi, Metzler Donald
- Bird-eye Transformers For Text Generation Models Sha Lei, Song Yuhang, Yordanov Yordan, Salvatori Tommaso, Lukasiewicz Thomas
- Overcoming Catastrophic Forgetting Beyond Continual Learning: Balanced Training For Neural Machine Translation Shao Chenze, Feng Yang
- Deanthropomorphising NLP: Can A Language Model Be Conscious? Shardlow Matthew, Przybyła Piotr
- Incorporating Domain Knowledge Through Task Augmentation For Front-end Javascript Code Generation Shen Sijie, Zhu Xiang, Dong Yihong, Guo Qizhi, Zhen Yankun, Li Ge
- Repository-level Prompt Generation For Large Language Models Of Code Shrivastava Disha, Larochelle Hugo, Tarlow Daniel
- React: Synergizing Reasoning And Acting In Language Models Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, Yuan Cao
- Reasoning With Language Model Prompting: A Survey Shuofei Qiao, Yixin Ou, Ningyu Zhang, Xiang Chen, Yunzhi Yao, Shumin Deng, Chuanqi Tan, Fei Huang, Huajun Chen
- Gpt-neox-20b: An Open-source Autoregressive Language Model Sid Black, Stella Biderman, Eric Hallahan, Quentin Anthony, Leo Gao, Laurence Golding, Horace He, Connor Leahy, Kyle Mcdonell, Jason Phang, Michael Pieler, Usvsn Sai Prashanth, Shivanshu Purohit, Laria Reynolds, Jonathan Tow, Ben Wang, Samuel Weinbach
- Explaining Patterns In Data With Language Models Via Interpretable Autoprompting Singh Chandan, Morris John X., Aneja Jyoti, Rush Alexander M., Gao Jianfeng
- Progprompt: Generating Situated Robot Task Plans Using Large Language Models Singh Ishika, Blukis Valts, Mousavian Arsalan, Goyal Ankit, Xu Danfei, Tremblay Jonathan, Fox Dieter, Thomason Jesse, Garg Animesh
- Events Realm: Event Reasoning Of Entity States Via Language Models Spiliopoulou Evangelia, Pagnoni Artidoro, Bisk Yonatan, Hovy Eduard
- QUILL: Query Intent With Large Language Models Using Retrieval Augmentation And Multi-stage Distillation Srinivasan Krishna, Raman Karthik, Samanta Anupam, Liao Lingrui, Bertelli Luca, Bendersky Mike
- Climb: A Continual Learning Benchmark For Vision-and-language Tasks Srinivasan Tejas, Chang Ting-yun, Alva Leticia Leonor Pinto, Chochlakis Georgios, Rostami Mohammad, Thomason Jesse
- Beyond The Imitation Game: Quantifying And Extrapolating The Capabilities Of Language Models Srivastava Aarohi Shammie, Rastogi Abhinav Shammie, Rao Abhishek Shammie, Shoeb Abu Awal Md Shammie, Abid Abubakar Shammie, Fisch Adam Shammie, Brown Adam R. Shammie, Santoro Adam Shammie, Gupta Aditya Shammie, Garriga-alonso Adrià Shammie, Kluska Agnieszka Shammie, Lewkowycz Aitor Shammie, Agarwal Akshat Shammie, Power Alethea Shammie, Ray Alex Shammie, Warstadt Alex Shammie, Kocurek Alexander W. Shammie, Safaya Ali Shammie, Tazarv Ali Shammie, Xiang Alice Shammie, Parrish Alicia Shammie, Nie Allen Shammie, Hussain Aman Shammie, Askell Amanda Shammie, Dsouza Amanda Shammie, Slone Ambrose Shammie, Rahane Ameet Shammie, Iyer Anantharaman S. Shammie, Andreassen Anders Shammie, Madotto Andrea Shammie, Santilli Andrea Shammie, Stuhlmüller Andreas Shammie, Dai Andrew Shammie, La Andrew Shammie, Lampinen Andrew Shammie, Zou Andy Shammie, Jiang Angela Shammie, Chen Angelica Shammie, Vuong Anh Shammie, Gupta Animesh Shammie, Gottardi Anna Shammie, Norelli Antonio Shammie, Venkatesh Anu Shammie, Gholamidavoodi Arash Shammie, Tabassum Arfa Shammie, Menezes Arul Shammie, Kirubarajan Arun Shammie, Mullokandov Asher Shammie, Sabharwal Ashish Shammie, Herrick Austin Shammie, Efrat Avia Shammie, Erdem Aykut Shammie, Karakaş Ayla Shammie, Roberts B. Ryan Shammie, Loe Bao Sheng Shammie, Zoph Barret Shammie, Bojanowski Bartłomiej Shammie, Özyurt Batuhan Shammie, Hedayatnia Behnam Shammie, Neyshabur Behnam Shammie, Inden Benjamin Shammie, Stein Benno Shammie, Ekmekci Berk Shammie, Lin Bill Yuchen Shammie, Howald Blake Shammie, Orinion Bryan Shammie, Diao Cameron Shammie, Dour Cameron Shammie, Stinson Catherine Shammie, Argueta Cedrick Shammie, Ramírez César Ferri Shammie, Singh Chandan Shammie, Rathkopf Charles Shammie, Meng Chenlin Shammie, Baral Chitta Shammie, Wu Chiyu Shammie, Callison-burch Chris Shammie, Waites Chris Shammie, Voigt Christian Shammie, Manning Christopher D. Shammie, Potts Christopher Shammie, Ramirez Cindy Shammie, Rivera Clara E. Shammie, Siro Clemencia Shammie, Raffel Colin Shammie, Ashcraft Courtney Shammie, Garbacea Cristina Shammie, Sileo Damien Shammie, Garrette Dan Shammie, Hendrycks Dan Shammie, Kilman Dan Shammie, Roth Dan Shammie, Freeman Daniel Shammie, Khashabi Daniel Shammie, Levy Daniel Shammie, González Daniel Moseguí Shammie, Perszyk Danielle Shammie, Hernandez Danny Shammie, Chen Danqi Shammie, Ippolito Daphne Shammie, Gilboa Dar Shammie, Dohan David Shammie, Drakard David Shammie, Jurgens David Shammie, Datta Debajyoti Shammie, Ganguli Deep Shammie, Emelin Denis Shammie, Kleyko Denis Shammie, Yuret Deniz Shammie, Chen Derek Shammie, Tam Derek Shammie, Hupkes Dieuwke Shammie, Misra Diganta Shammie, Buzan Dilyar Shammie, Mollo Dimitri Coelho Shammie, Yang Diyi Shammie, Lee Dong-ho Shammie, Schrader Dylan Shammie, Shutova Ekaterina Shammie, Cubuk Ekin Dogus Shammie, Segal Elad Shammie, Hagerman Eleanor Shammie, Barnes Elizabeth Shammie, Donoway Elizabeth Shammie, Pavlick Ellie Shammie, Rodola Emanuele Shammie, Lam Emma Shammie, Chu Eric Shammie, Tang Eric Shammie, Erdem Erkut Shammie, Chang Ernie Shammie, Chi Ethan A. Shammie, Dyer Ethan Shammie, Jerzak Ethan Shammie, Kim Ethan Shammie, Manyasi Eunice Engefu Shammie, Zheltonozhskii Evgenii Shammie, Xia Fanyue Shammie, Siar Fatemeh Shammie, Martínez-plumed Fernando Shammie, Happé Francesca Shammie, Chollet Francois Shammie, Rong Frieda Shammie, Mishra Gaurav Shammie, Winata Genta Indra Shammie, De Melo Gerard Shammie, Kruszewski Germán Shammie, Parascandolo Giambattista Shammie, Mariani Giorgio Shammie, Wang Gloria Shammie, Jaimovitch-lópez Gonzalo Shammie, Betz Gregor Shammie, Gur-ari Guy Shammie, Galijasevic Hana Shammie, Kim Hannah Shammie, Rashkin Hannah Shammie, Hajishirzi Hannaneh Shammie, Mehta Harsh Shammie, Bogar Hayden Shammie, Shevlin Henry Shammie, Schütze Hinrich Shammie, Yakura Hiromu Shammie, Zhang Hongming Shammie, Wong Hugh Mee Shammie, Ng Ian Shammie, Noble Isaac Shammie, Jumelet Jaap Shammie, Geissinger Jack Shammie, Kernion Jackson Shammie, Hilton Jacob Shammie, Lee Jaehoon Shammie, Fisac Jaime Fernández Shammie, Simon James B. Shammie, Koppel James Shammie, Zheng James Shammie, Zou James Shammie, Kocoń Jan Shammie, Thompson Jana Shammie, Wingfield Janelle Shammie, Kaplan Jared Shammie, Radom Jarema Shammie, Sohl-dickstein Jascha Shammie, Phang Jason Shammie, Wei Jason Shammie, Yosinski Jason Shammie, Novikova Jekaterina Shammie, Bosscher Jelle Shammie, Marsh Jennifer Shammie, Kim Jeremy Shammie, Taal Jeroen Shammie, Engel Jesse Shammie, Alabi Jesujoba Shammie, Xu Jiacheng Shammie, Song Jiaming Shammie, Tang Jillian Shammie, Waweru Joan Shammie, Burden John Shammie, Miller John Shammie, Balis John U. Shammie, Batchelder Jonathan Shammie, Berant Jonathan Shammie, Frohberg Jörg Shammie, Rozen Jos Shammie, Hernandez-orallo Jose Shammie, Boudeman Joseph Shammie, Guerr Joseph Shammie, Jones Joseph Shammie, Tenenbaum Joshua B. Shammie, Rule Joshua S. Shammie, Chua Joyce Shammie, Kanclerz Kamil Shammie, Livescu Karen Shammie, Krauth Karl Shammie, Gopalakrishnan Karthik Shammie, Ignatyeva Katerina Shammie, Markert Katja Shammie, Dhole Kaustubh D. Shammie, Gimpel Kevin Shammie, Omondi Kevin Shammie, Mathewson Kory Shammie, Chiafullo Kristen Shammie, Shkaruta Ksenia Shammie, Shridhar Kumar Shammie, Mcdonell Kyle Shammie, Richardson Kyle Shammie, Reynolds Laria Shammie, Gao Leo Shammie, Zhang Li Shammie, Dugan Liam Shammie, Qin Lianhui Shammie, Contreras-ochando Lidia Shammie, Morency Louis-philippe Shammie, Moschella Luca Shammie, Lam Lucas Shammie, Noble Lucy Shammie, Schmidt Ludwig Shammie, He Luheng Shammie, Colón Luis Oliveros Shammie, Metz Luke Shammie, Şenel Lütfi Kerem Shammie, Bosma Maarten Shammie, Sap Maarten Shammie, Ter Hoeve Maartje Shammie, Farooqi Maheen Shammie, Faruqui Manaal Shammie, Mazeika Mantas Shammie, Baturan Marco Shammie, Marelli Marco Shammie, Maru Marco Shammie, Quintana Maria Jose Ramírez Shammie, Tolkiehn Marie Shammie, Giulianelli Mario Shammie, Lewis Martha Shammie, Potthast Martin Shammie, Leavitt Matthew L. Shammie, Hagen Matthias Shammie, Schubert Mátyás Shammie, Baitemirova Medina Orduna Shammie, Arnaud Melody Shammie, Mcelrath Melvin Shammie, Yee Michael A. Shammie, Cohen Michael Shammie, Gu Michael Shammie, Ivanitskiy Michael Shammie, Starritt Michael Shammie, Strube Michael Shammie, Swędrowski Michał Shammie, Bevilacqua Michele Shammie, Yasunaga Michihiro Shammie, Kale Mihir Shammie, Cain Mike Shammie, Xu Mimee Shammie, Suzgun Mirac Shammie, Walker Mitch Shammie, Tiwari Mo Shammie, Bansal Mohit Shammie, Aminnaseri Moin Shammie, Geva Mor Shammie, Gheini Mozhdeh Shammie, T Mukund Varma Shammie, Peng Nanyun Shammie, Chi Nathan A. Shammie, Lee Nayeon Shammie, Krakover Neta Gur-ari Shammie, Cameron Nicholas Shammie, Roberts Nicholas Shammie, Doiron Nick Shammie, Martinez Nicole Shammie, Nangia Nikita Shammie, Deckers Niklas Shammie, Muennighoff Niklas Shammie, Keskar Nitish Shirish Shammie, Iyer Niveditha S. Shammie, Constant Noah Shammie, Fiedel Noah Shammie, Wen Nuan Shammie, Zhang Oliver Shammie, Agha Omar Shammie, Elbaghdadi Omar Shammie, Levy Omer Shammie, Evans Owain Shammie, Casares Pablo Antonio Moreno Shammie, Doshi Parth Shammie, Fung Pascale Shammie, Liang Paul Pu Shammie, Vicol Paul Shammie, Alipoormolabashi Pegah Shammie, Liao Peiyuan Shammie, Liang Percy Shammie, Chang Peter Shammie, Eckersley Peter Shammie, Htut Phu Mon Shammie, Hwang Pinyu Shammie, Miłkowski Piotr Shammie, Patil Piyush Shammie, Pezeshkpour Pouya Shammie, Oli Priti Shammie, Mei Qiaozhu Shammie, Lyu Qing Shammie, Chen Qinlang Shammie, Banjade Rabin Shammie, Rudolph Rachel Etta Shammie, Gabriel Raefer Shammie, Habacker Rahel Shammie, Risco Ramon Shammie, Millière Raphaël Shammie, Garg Rhythm Shammie, Barnes Richard Shammie, Saurous Rif A. Shammie, Arakawa Riku Shammie, Raymaekers Robbe Shammie, Frank Robert Shammie, Sikand Rohan Shammie, Novak Roman Shammie, Sitelew Roman Shammie, Lebras Ronan Shammie, Liu Rosanne Shammie, Jacobs Rowan Shammie, Zhang Rui Shammie, Salakhutdinov Ruslan Shammie, Chi Ryan Shammie, Lee Ryan Shammie, Stovall Ryan Shammie, Teehan Ryan Shammie, Yang Rylan Shammie, Singh Sahib Shammie, Mohammad Saif M. Shammie, Anand Sajant Shammie, Dillavou Sam Shammie, Shleifer Sam Shammie, Wiseman Sam Shammie, Gruetter Samuel Shammie, Bowman Samuel R. Shammie, Schoenholz Samuel S. Shammie, Han Sanghyun Shammie, Kwatra Sanjeev Shammie, Rous Sarah A. Shammie, Ghazarian Sarik Shammie, Ghosh Sayan Shammie, Casey Sean Shammie, Bischoff Sebastian Shammie, Gehrmann Sebastian Shammie, Schuster Sebastian Shammie, Sadeghi Sepideh Shammie, Hamdan Shadi Shammie, Zhou Sharon Shammie, Srivastava Shashank Shammie, Shi Sherry Shammie, Singh Shikhar Shammie, Asaadi Shima Shammie, Gu Shixiang Shane Shammie, Pachchigar Shubh Shammie, Toshniwal Shubham Shammie, Upadhyay Shyam Shammie, Shyamolima Shammie, Debnath, Shakeri Siamak, Thormeyer Simon, Melzi Simone, Reddy Siva, Makini Sneha Priscilla, Lee Soo-hwan, Torene Spencer, Hatwar Sriharsha, Dehaene Stanislas, Divic Stefan, Ermon Stefano, Biderman Stella, Lin Stephanie, Prasad Stephen, Piantadosi Steven T., Shieber Stuart M., Misherghi Summer, Kiritchenko Svetlana, Mishra Swaroop, Linzen Tal, Schuster Tal, Li Tao, Yu Tao, Ali Tariq, Hashimoto Tatsu, Wu Te-lin, Desbordes Théo, Rothschild Theodore, Phan Thomas, Wang Tianle, Nkinyili Tiberius, Schick Timo, Kornev Timofei, Tunduny Titus, Gerstenberg Tobias, Chang Trenton, Neeraj Trishala, Khot Tushar, Shultz Tyler, Shaham Uri, Misra Vedant, Demberg Vera, Nyamai Victoria, Raunak Vikas, Ramasesh Vinay, Prabhu Vinay Uday, Padmakumar Vishakh, Srikumar Vivek, Fedus William, Saunders William, Zhang William, Vossen Wout, Ren Xiang, Tong Xiaoyu, Zhao Xinran, Wu Xinyi, Shen Xudong, Yaghoobzadeh Yadollah, Lakretz Yair, Song Yangqiu, Bahri Yasaman, Choi Yejin, Yang Yichi, Hao Yiding, Chen Yifu, Belinkov Yonatan, Hou Yu, Hou Yufang, Bai Yuntao, Seid Zachary, Zhao Zhuoye, Wang Zijian, Wang Zijie J., Wang Zirui, Wu Ziyi
- Towards Zero-shot And Few-shot Table Question Answering Using GPT-3 Srivastava Pragya, Ganu Tanuja, Guha Saikat
- Conciseness: An Overlooked Language Task Stahlberg Felix, Kumar Aashish, Alberti Chris, Kumar Shankar
- Pseudo-ood Training For Robust Language Models Sundararaman Dhanasekar, Mehta Nikhil, Carin Lawrence
- Chatgpt: The End Of Online Exam Integrity? Susnjak Teo
- The AI Teacher Test: Measuring The Pedagogical Ability Of Blender And GPT-3 In Educational Dialogues Tack Anaïs, Piech Chris
- On The Effect Of Pre-training For Transformer In Different Modality On Offline Reinforcement Learning Takagi Shiro
- News Summarization And Evaluation In The Era Of GPT-3 Tanya Goyal, Junyi Jessy Li, Greg Durrett
- Compression Of Generative Pre-trained Language Models Via Quantization Tao Chaofan, Hou Lu, Zhang Wei, Shang Lifeng, Jiang Xin, Liu Qun, Luo Ping, Wong Ngai
- Interleaving Retrieval With Chain-of-thought Reasoning For Knowledge-intensive Multi-step Questions Trivedi Harsh, Balasubramanian Niranjan, Khot Tushar, Sabharwal Ashish
- Do Large Language Models Know What Humans Know? Trott Sean, Jones Cameron, Chang Tyler, Michaelov James, Bergen Benjamin
- Adavae: Exploring Adaptive Gpt-2s In Variational Auto-encoders For Language Modeling Tu Haoqin, Yang Zhongliang, Yang Jinshuai, Huang Yongfeng
- Help Me Write A Poem: Instruction Tuning As A Vehicle For Collaborative Poetry Writing Tuhin Chakrabarty, Vishakh Padmakumar, He He
- Planbench: An Extensible Benchmark For Evaluating Large Language Models On Planning And Reasoning About Change Valmeekam Karthik, Marquez Matthew, Olmo Alberto, Sreedharan Sarath, Kambhampati Subbarao
- UGIF: UI Grounded Instruction Following Venkatesh Sagar Gubbi, Talukdar Partha, Narayanan Srini
- Prompting Palm For Translation: Assessing Strategies And Performance Vilar David, Freitag Markus, Cherry Colin, Luo Jiaming, Ratnakar Viresh, Foster George
- Will We Run Out Of Data? Limits Of LLM Scaling Based On Human-generated Data Villalobos Pablo, Ho Anson, Sevilla Jaime, Besiroglu Tamay, Heim Lennart, Hobbhahn Marius
- Unite: Unified Translation Evaluation Wan Yu, Liu Dayiheng, Yang Baosong, Zhang Haibo, Chen Boxing, Wong Derek F., Chao Lidia S.
- Can Language Models Make Fun? A Case Study In Chinese Comical Crosstalk Wang Benyou, Wu Xiangbo, Liu Xiaokang, Li Jianquan, Tiwari Prayag, Xie Qianqian
- Rethinking Textual Adversarial Defense For Pre-trained Language Models Wang Jiayi, Bao Rongzhou, Zhang Zhuosheng, Zhao Hai
- Qrelscore: Better Evaluating Generated Questions With Deeper Understanding Of Context-aware Relevance Wang Xiaoqiang, Liu Bang, Tang Siliang, Wu Lingfei
- S-prompts Learning With Pre-trained Transformers: An Occam's Razor For Domain Incremental Learning Wang Yabin, Huang Zhiwu, Hong Xiaopeng
- Adamix: Mixture-of-adaptations For Parameter-efficient Model Tuning Wang Yaqing, Agarwal Sahaj, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Controlling Styles In Neural Machine Translation With Activation Prompt Wang Yifan, Sun Zewei, Cheng Shanbo, Zheng Weiguo, Wang Mingxuan
- Pan More Gold From The Sand: Refining Open-domain Dialogue Training With Noisy Self-retrieval Generation Wang Yihe, Li Yitong, Wang Yasheng, Mi Fei, Zhou Pingyi, Wang Xin, Liu Jin, Jiang Xin, Liu Qun
- Unified Multimodal Model With Unlikelihood Training For Visual Dialog Wang Zihao, Wang Junli, Jiang Changjun
- Jiuzhang: A Chinese Pre-trained Language Model For Mathematical Problem Understanding Wayne Xin Zhao, Kun Zhou, Zheng Gong, Beichen Zhang, Yuanhang Zhou, Jing Sha, Zhigang Chen, Shijin Wang, Cong Liu, Ji-rong Wen
- EGCR: Explanation Generation For Conversational Recommendation Wen Bingbing, Bu Xiaoning, Shah Chirag
- Large Language Models Are Few(1)-shot Table Reasoners Wenhu Chen
- Language Models As Zero-shot Planners: Extracting Actionable Knowledge For Embodied Agents Wenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch
- Extreme Compression For Pre-trained Transformers Made Simple And Efficient Wu Xiaoxia, Yao Zhewei, Zhang Minjia, Li Conglong, He Yuxiong
- CLIP Also Understands Text: Prompting CLIP For Phrase Understanding Yan An, Li Jiacheng, Zhu Wanrong, Lu Yujie, Wang William Yang, Mcauley Julian
- GLUE-X: Evaluating Natural Language Understanding Models From An Out-of-distribution Generalization Perspective Yang Linyi, Zhang Shuibai, Qin Libo, Li Yafu, Wang Yidong, Liu Hanmeng, Wang Jindong, Xie Xing, Zhang Yue
- Building Knowledge-grounded Dialogue Systems With Graph-based Semantic Modeling Yang Yizhe, Huang Heyan, Gao Yang, And Jiawei Li
- Zeroquant: Efficient And Affordable Post-training Quantization For Large-scale Transformers Yao Zhewei, Aminabadi Reza Yazdani, Zhang Minjia, Wu Xiaoxia, Li Conglong, He Yuxiong
- Random-ltd: Random And Layerwise Token Dropping Brings Efficient Training For Large-scale Transformers Yao Zhewei, Wu Xiaoxia, Li Conglong, Holmes Connor, Zhang Minjia, Li Cheng, He Yuxiong
- Language Models Are General-purpose Interfaces Yaru Hao, Haoyu Song, Li Dong, Shaohan Huang, Zewen Chi, Wenhui Wang, Shuming Ma, Furu Wei
- Modeling Multi-hop Question Answering As Single Sequence Prediction Yavuz Semih, Hashimoto Kazuma, Zhou Yingbo, Keskar Nitish Shirish, Xiong Caiming
- Guess The Instruction! Flipped Learning Makes Language Models Stronger Zero-shot Learners Ye Seonghyeon, Kim Doyoung, Jang Joel, Shin Joongbo, Seo Minjoon
- Complementary Explanations For Effective In-context Learning Ye Xi, Iyer Srinivasan, Celikyilmaz Asli, Stoyanov Ves, Durrett Greg, Pasunuru Ramakanth
- Alzheimer's Diagnosis And Generation-based Chatbot Using Hierarchical Attention And Transformer Yeong Park Jun, Jong Shin Su, Hwan Choi Chang, Jae Lee Jung, Sang-il Choi
- Transcending Scaling Laws With 0.1% Extra Compute Yi Tay, Jason Wei, Hyung Won Chung, Vinh Q. Tran, David R. So, Siamak Shakeri, Xavier Garcia, Huaixiu Steven Zheng, Jinfeng Rao, Aakanksha Chowdhery, Denny Zhou, Donald Metzler, Slav Petrov, Neil Houlsby, Quoc V. Le, Mostafa Dehghani
- Active Example Selection For In-context Learning Yiming Zhang, Shi Feng, Chenhao Tan
- BLOOM+1: Adding Language Support To BLOOM For Zero-shot Prompting Yong Zheng-xin, Schoelkopf Hailey, Muennighoff Niklas, Aji Alham Fikri, Adelani David Ifeoluwa, Almubarak Khalid, Bari M Saiful, Sutawika Lintang, Kasai Jungo, Baruwa Ahmed, Winata Genta Indra, Biderman Stella, Raff Edward, Radev Dragomir, Nikoulina Vassilina
- Scaling Autoregressive Models For Content-rich Text-to-image Generation Yu Jiahui, Xu Yuanzhong, Koh Jing Yu, Luong Thang, Baid Gunjan, Wang Zirui, Vasudevan Vijay, Ku Alexander, Yang Yinfei, Ayan Burcu Karagol, Hutchinson Ben, Han Wei, Parekh Zarana, Li Xin, Zhang Han, Baldridge Jason, Wu Yonghui
- KRLS: Improving End-to-end Response Generation In Task Oriented Dialog With Reinforced Keywords Learning Yu Xiao, Wu Qingyang, Qian Kun, Yu Zhou
- Bilaterally Slimmable Transformer For Elastic And Efficient Visual Question Answering Yu Zhou, Jin Zitian, Yu Jun, Xu Mingliang, Wang Hongbo, Fan Jianping
- Wudaomm: A Large-scale Multi-modal Dataset For Pre-training Models Yuan Sha, Zhao Shuai, Leng Jiahong, Xue Zhao, Zhao Hanyu, Liu Peiyu, Gong Zheng, Zhao Wayne Xin, Li Junyi, Tang Jie
- Selecting Better Samples From Pre-trained Llms: A Case Study On Question Generation Yuan Xingdi, Wang Tong, Wang Yen-hsiang, Fine Emery, Abdelghani Rania, Lucas Pauline, Sauzéon Hélène, Oudeyer Pierre-yves
- Competition-level Code Generation With Alphacode Yujia Li, David Choi, Junyoung Chung, Nate Kushman, Julian Schrittwieser, Rémi Leblond, Tom Eccles, James Keeling, Felix Gimeno, Agustin Dal Lago, Thomas Hubert, Peter Choy, Cyprien De Masson D'autume, Igor Babuschkin, Xinyun Chen, Po-sen Huang, Johannes Welbl, Sven Gowal, Alexey Cherepanov, James Molloy, Daniel J. Mankowitz, Esme Sutherland Robson, Pushmeet Kohli, Nando De Freitas, Koray Kavukcuoglu, Oriol Vinyals
- Dyrex: Dynamic Query Representation For Extractive Question Answering Zaratiana Urchade, Khbir Niama El, Núñez Dennis, Holat Pierre, Tomeh Nadi, Charnois Thierry
- Better Pre-training By Reducing Representation Confusion Zhang Haojie, Liang Mingfei, Xie Ruobing, Sun Zhenlong, Zhang Bo, Lin Leyu
- Can Offline Reinforcement Learning Help Natural Language Understanding? Zhang Ziqi, Wang Yile, Zhang Yue, Wang Donglin
- Lite Unified Modeling For Discriminative Reading Comprehension Zhao Yilin, Zhao Hai, Shen Libin, Zhao Yinggong
- MSDF: A General Open-domain Multi-skill Dialog Framework Zhao Yu, Hu Xinshuo, Li Yunxin, Hu Baotian, Li Dongfang, Chen Sichao, Wang Xiaolong
- Learning To Decompose: Hypothetical Question Decomposition Based On Comparable Texts Zhou Ben, Richardson Kyle, Yu Xiaodong, Roth Dan
- Binding Language Models In Symbolic Languages Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu
- A Survey On GPT-3 Zong Mingyu, Krishnamachari Bhaskar
- Efficient Long Sequence Modeling Via State Space Augmented Transformer Zuo Simiao, Liu Xiaodong, Jiao Jian, Charles Denis, Manavoglu Eren, Zhao Tuo, Gao Jianfeng
- Attention Over Pre-trained Sentence Embeddings For Long Document Classification Abdaoui Amine, Dutta Sourav
- \(\mathbb{vd}\)-\(\mathbb{gr}\): Boosting \(\mathbb{v}\)isual \(\mathbb{d}\)ialog With Cascaded Spatial-temporal Multi-modal \(\mathbb{gr}\)aphs Abdessaied Adnen, Shi Lei, Bulling Andreas
- LMRL Gym: Benchmarks For Multi-turn Reinforcement Learning With Language Models Abdulhai Marwa, White Isadora, Snell Charlie, Sun Charles, Hong Joey, Zhai Yuexiang, Xu Kelvin, Levine Sergey
- Beyond Traditional Teaching: The Potential Of Large Language Models And Chatbots In Graduate Engineering Education Abedi Mahyar, Alshybani Ibrahem, Shahadat Muhammad Rubayat Bin, Murillo Michael S.
- Inpars Toolkit: A Unified And Reproducible Synthetic Data Generation Pipeline For Neural Information Retrieval Abonizio Hugo, Bonifacio Luiz, Jeronymo Vitor, Lotufo Roberto, Zavrel Jakub, Nogueira Rodrigo
- Comparative Analysis Of Chatgpt, GPT-4, And Microsoft Bing Chatbots For GRE Test Abu-haifa Mohammad, Etawi Bara'a, Alkhatatbeh Huthaifa, Ababneh Ayman
- The ADAIO System At The BEA-2023 Shared Task On Generating AI Teacher Responses In Educational Dialogues Adigwe Adaeze 2 And 3, Yuan Zheng 2 And 3
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon
- KITLM: Domain-specific Knowledge Integration Into Language Models For Question Answering Agarwal Ankush, Gawade Sakharam, Azad Amar Prakash, Bhattacharyya Pushpak
- Bring Your Own KG: Self-supervised Program Synthesis For Zero-shot KGQA Agarwal Dhruv, Das Rajarshi, Khosla Sopan, Gangadharaiah Rashmi
- Can Large Language Models Be Good Path Planners? A Benchmark And Investigation On Spatial-temporal Reasoning Aghzal Mohamed, Plaku Erion, Yao Ziyu
- Are Llms The Master Of All Trades? : Exploring Domain-agnostic Reasoning Skills Of Llms Agrawal Shrivats
- Multimodal Representations For Teacher-guided Compositional Visual Reasoning Aissa Wafa Cedric - Vertigo, Ferecatu Marin Cedric - Vertigo, Crucianu Michel Cedric - Vertigo
- Can We Trust The Evaluation On Chatgpt? Aiyappa Rachith, An Jisun, Kwak Haewoon, Ahn Yong-yeol
- The Emergence Of Essential Sparsity In Large Pre-trained Models: The Weights That Matter Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Zhangyang Wang
- Rest Meets React: Self-improvement For Multi-step Reasoning LLM Agent Aksitov Renat, Miryoosefi Sobhan, Li Zonglin, Li Daliang, Babayan Sheila, Kopparapu Kavya, Fisher Zachary, Guo Ruiqi, Prakash Sushant, Srinivasan Pranesh, Zaheer Manzil, Yu Felix, Kumar Sanjiv
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Akyürek Afra Feyza, Akyürek Ekin, Madaan Aman, Kalyan Ashwin, Clark Peter, Wijaya Derry, Tandon Niket
- Chatgpt And Beyond: The Generative AI Revolution In Education Al-smadi Mohammad
- The Falcon Series Of Open Language Models Almazrouei Ebtesam, Alobeidli Hamza, Alshamsi Abdulaziz, Cappelli Alessandro, Cojocaru Ruxandra, Debbah Mérouane, Goffinet Étienne, Hesslow Daniel, Launay Julien, Malartic Quentin, Mazzotta Daniele, Noune Badreddine, Pannier Baptiste, Penedo Guilherme
- Becoming Self-instruct: Introducing Early Stopping Criteria For Minimal Instruct Tuning Alshikh Waseem, Daaboul Manhal, Goddard Kirk, Imel Brock, Kamble Kiran, Kulkarni Parikshith, Russak Melisa
- Steering Large Language Models For Machine Translation With Finetuning And In-context Learning Alves Duarte M., Guerreiro Nuno M., Alves João, Pombal José, Rei Ricardo, De Souza José G. C., Colombo Pierre, Martins André F. T.
- L-eval: Instituting Standardized Evaluation For Long Context Language Models An Chenxin, Gong Shansan, Zhong Ming, Zhao Xingjian, Li Mukai, Zhang Jun, Kong Lingpeng, Qiu Xipeng
- Openleaf: Open-domain Interleaved Image-text Generation And Evaluation An Jie, Yang Zhengyuan, Li Linjie, Wang Jianfeng, Lin Kevin, Liu Zicheng, Wang Lijuan, Luo Jiebo
- On Generative Agents In Recommendation An Zhang, Yuxin Chen, Leheng Sheng, Xiang Wang, Tat-seng Chua
- Query Understanding In The Age Of Large Language Models Anand Avishek, V Venktesh, Anand Abhijit, Setty Vinay
- Studying The Impacts Of Pre-training Using Chatgpt-generated Text On Downstream Tasks Anand Sarthak
- Chemcrow: Augmenting Large-language Models With Chemistry Tools Andres M Bran, Sam Cox, Oliver Schilter, Carlo Baldassari, Andrew D White, Philippe Schwaller
- Expel: LLM Agents Are Experiential Learners Andrew Zhao, Daniel Huang, Quentin Xu, Matthieu Lin, Yong-jin Liu, Gao Huang
- Augmenting Llms With Knowledge: A Survey On Hallucination Prevention Andriopoulos Konstantinos, Pouwelse Johan
- Detecting And Preventing Hallucinations In Large Vision Language Models Anisha Gunjal, Jihan Yin, Erhan Bas
- Scaling Laws For Language Encoding Models In Fmri Antonello Richard, Vaidya Aditya, Huth Alexander G.
- Agenttuning: Enabling Generalized Agent Abilities For Llms Aohan Zeng, Mingdao Liu, Rui Lu, Bowen Wang, Xiao Liu, Yuxiao Dong, Jie Tang
- A Memory Model For Question Answering From Streaming Data Supported By Rehearsal And Anticipation Of Coreference Information Araujo Vladimir, Soto Alvaro, Moens Marie-francine
- Chainforge: A Visual Toolkit For Prompt Engineering And LLM Hypothesis Testing Arawjo Ian, Swoopes Chelse, Vaithilingam Priyan, Wattenberg Martin, Glassman Elena
- Llm2loss: Leveraging Language Models For Explainable Model Diagnostics Ardeshir Shervin
- Leancontext: Cost-efficient Domain-specific Question Answering Using Llms Arefeen Md Adnan, Debnath Biplob, Chakradhar Srimat
- Unmasking The Giant: A Comprehensive Evaluation Of Chatgpt's Proficiency In Coding Algorithms And Data Structures Arefin Sayed Erfan, Heya Tasnia Ashrafi, Al-qudah Hasan, Ineza Ynes, Serwadda Abdul
- GPT-4 Can't Reason Arkoudas Konstantine
- AI And Education: An Investigation Into The Use Of Chatgpt For Systems Thinking Arndt Holger
- Learning And Leveraging Verifiers To Improve Planning Capabilities Of Pre-trained Language Models Arora Daman, Kambhampati Subbarao
- Towards A Unified Multimodal Reasoning Framework Arun Abhinav, Mal Dipendra Singh, Soni Mehul, Sawada Tomohiro
- Mitigating Open-vocabulary Caption Hallucinations Assaf Ben-kish, Moran Yanuka, Morris Alper, Raja Giryes, Hadar Averbuch-elor
- Instruction-tuning Aligns Llms To The Human Brain Aw Khai Loong, Montariol Syrielle, Alkhamissi Badr, Schrimpf Martin, Bosselut Antoine
- Using Large Language Models For Zero-shot Natural Language Generation From Knowledge Graphs Axelsson Agnes, Skantze Gabriel
- Scaling Transformer To 1M Tokens And Beyond With RMT Aydar Bulatov, Yuri Kuratov, Yermek Kapushev, Mikhail S. Burtsev
- RRAML: Reinforced Retrieval Augmented Machine Learning Bacciu Andrea, Cuconasu Florin, Siciliano Federico, Silvestri Fabrizio, Tonellotto Nicola, Trappolini Giovanni
- Fauno: The Italian Large Language Model That Will Leave You Senza Parole! Bacciu Andrea, Trappolini Giovanni, Santilli Andrea, Rodolà Emanuele, Silvestri Fabrizio
- Promptcrafter: Crafting Text-to-image Prompt Through Mixed-initiative Dialogue With LLM Baek Seungho, Im Hyerin, Ryu Jiseung, Park Juhyeong, Lee Takyeon
- Evaluating Chatgpt As A Question Answering System: A Comprehensive Analysis And Comparison With Existing Models Bahak Hossein, Taheri Farzaneh, Zojaji Zahra, Kazemi Arefeh
- Llms-augmented Contextual Bandit Baheri Ali, Alm Cecilia O.
- Leftover Lunch: Advantage-based Offline Reinforcement Learning For Language Models Baheti Ashutosh, Lu Ximing, Brahman Faeze, Bras Ronan Le, Sap Maarten, Riedl Mark
- Chatgpt: Applications, Opportunities, And Threats Bahrini Aram, Khamoshifar Mohammadsadra, Abbasimehr Hossein, Riggs Robert J., Esmaeili Maryam, Majdabadkohne Rastin Mastali, Pasehvar Morteza
- Schema-driven Information Extraction From Heterogeneous Tables Bai Fan, Kang Junmo, Stanovsky Gabriel, Freitag Dayne, Dredze Mark, Ritter Alan
- Qwen Technical Report Bai Jinze, Bai Shuai, Chu Yunfei, Cui Zeyu, Dang Kai, Deng Xiaodong, Fan Yang, Ge Wenbin, Han Yu, Huang Fei, Hui Binyuan, Ji Luo, Li Mei, Lin Junyang, Lin Runji, Liu Dayiheng, Liu Gao, Lu Chengqiang, Lu Keming, Ma Jianxin, Men Rui, Ren Xingzhang, Ren Xuancheng, Tan Chuanqi, Tan Sinan, Tu Jianhong, Wang Peng, Wang Shijie, Wang Wei, Wu Shengguang, Xu Benfeng, Xu Jin, Yang An, Yang Hao, Yang Jian, Yang Shusheng, Yao Yang, Yu Bowen, Yuan Hongyi, Yuan Zheng, Zhang Jianwei, Zhang Xingxuan, Zhang Yichang, Zhang Zhenru, Zhou Chang, Zhou Jingren, Zhou Xiaohuan, Zhu Tianhang
- Kgquiz: Evaluating The Generalization Of Encoded Knowledge In Large Language Models Bai Yuyang, Feng Shangbin, Balachandran Vidhisha, Tan Zhaoxuan, Lou Shiqi, He Tianxing, Tsvetkov Yulia
- Gpt-4: A Review On Advancements And Opportunities In Natural Language Processing Baktash Jawid Ahmad, Dawodi Mursal
- Benchmarking LLM Powered Chatbots: Methods And Metrics Banerjee Debarag, Singh Pooja, Avadhanam Arjun, Srivastava Saksham
- Redefining Developer Assistance: Through Large Language Models In Software Ecosystem Banerjee Somnath, Dutta Avik, Layek Sayan, Sahoo Amruit, Joyce Sam Conrad, Hazra Rima
- Peering Through Preferences: Unraveling Feedback Acquisition For Aligning Large Language Models Bansal Hritik, Dang John, Grover Aditya
- A Bi-step Grounding Paradigm For Large Language Models In Recommendation Systems Bao Keqin, Zhang Jizhi, Wang Wenjie, Zhang Yang, Yang Zhengyi, Luo Yancheng, Chen Chong, Feng Fuli, Tian Qi
- Assessing And Enhancing The Robustness Of Large Language Models With Task Structure Variations For Logical Reasoning Bao Qiming, Gendron Gael, Peng Alex Yuxuan, Zhong Wanjun, Tan Neset, Chen Yang, Witbrock Michael, Liu Jiamou
- Disc-medllm: Bridging General Large Language Models And Real-world Medical Consultation Bao Zhijie, Chen Wei, Xiao Shengze, Ren Kuang, Wu Jiaao, Zhong Cheng, Peng Jiajie, Huang Xuanjing, Wei Zhongyu
- Separate-and-enhance: Compositional Finetuning For Text2image Diffusion Models Bao Zhipeng, Li Yijun, Singh Krishna Kumar, Wang Yu-xiong, Hebert Martial
- Check Your Facts And Try Again: Improving Large Language Models With External Knowledge And Automated Feedback Baolin Peng, Michel Galley, Pengcheng He, Hao Cheng, Yujia Xie, Yu Hu, Qiuyuan Huang, Lars Liden, Zhou Yu, Weizhu Chen, Jianfeng Gao
- Do Language Models Learn About Legal Entity Types During Pretraining? Barale Claire, Rovatsos Michael, Bhuta Nehal
- Simple Linguistic Inferences Of Large Language Models (llms): Blind Spots And Blinds Basmov Victoria, Goldberg Yoav, Tsarfaty Reut
- Testing The Depth Of Chatgpt's Comprehension Via Cross-modal Tasks Based On Ascii-art: Gpt3.5's Abilities In Regard To Recognizing And Generating Ascii-art Are Not Totally Lacking Bayani David
- Exploring The Landscape Of Large Language Models In Medical Question Answering Bean Andrew M., Korgul Karolina, Krones Felix, Mccraith Robert, Mahdi Adam
- AQUALLM: Audio Question Answering Data Generation Using Large Language Models Behera Swarup Ranjan, Injeti Krishna Mohan, Patibandla Jaya Sai Kiran, Pokala Praveen Kumar, Pailla Balakrishna Reddy
- Labrador: Exploring The Limits Of Masked Language Modeling For Laboratory Data Bellamy David R., Kumar Bhawesh, Wang Cindy, Beam Andrew
- Worldsense: A Synthetic Benchmark For Grounded Reasoning In Large Language Models Benchekroun Youssef, Dervishi Megi, Ibrahim Mark, Gaya Jean-baptiste, Martinet Xavier, Mialon Grégoire, Scialom Thomas, Dupoux Emmanuel, Hupkes Dieuwke, Vincent Pascal
- Large Language Models In The Workplace: A Case Study On Prompt Engineering For Job Type Classification Benjamin Clavié, Alexandru Ciceu, Frederick Naylor, Guillaume Soulié, Thomas Brightwell
- When Automated Assessment Meets Automated Content Generation: Examining Text Quality In The Era Of Gpts Bevilacqua Marialena, Oketch Kezia, Qin Ruiyang, Stamey Will, Zhang Xinyuan, Gan Yi, Yang Kai, Abbasi Ahmed
- Red-teaming Large Language Models Using Chain Of Utterances For Safety-alignment Bhardwaj Rishabh, Poria Soujanya
- Approach Intelligent Writing Assistants Usability With Seven Stages Of Action Bhat Avinash, Shrivastava Disha, Guo Jin L. C.
- TART: A Plug-and-play Transformer Module For Task-agnostic Reasoning Bhatia Kush, Narayan Avanika, De Sa Christopher, Ré Christopher
- Helm: Highlighted Evidence Augmented Language Model For Enhanced Table-to-text Generation Bian Junyi, Qin Xiaolei, Zou Wuhe, Huang Mengzuo, Luo Congyi, Zhang Ke, Zhang Weidong
- Chatgpt Is A Knowledgeable But Inexperienced Solver: An Investigation Of Commonsense Problem In Large Language Models Bian Ning, Han Xianpei, Sun Le, Lin Hongyu, Lu Yaojie, He Ben, Jiang Shanshan, Dong Bin
- In-context Learning Dynamics With Random Binary Sequences Bigelow Eric J., Lubana Ekdeep Singh, Dick Robert P., Tanaka Hidenori, Ullman Tomer D.
- Tell Me Where To Go: A Composable Framework For Context-aware Embodied Robot Navigation Biggie Harel, Mopidevi Ajay Narasimha, Woods Dusty, Heckman Christoffer
- Visit-bench: A Benchmark For Vision-language Instruction Following Inspired By Real-world Use Bitton Yonatan, Bansal Hritik, Hessel Jack, Shao Rulin, Zhu Wanrong, Awadalla Anas, Gardner Josh, Taori Rohan, Schmidt Ludwig
- BLT: Can Large Language Models Handle Basic Legal Text? Blair-stanek Andrew, Holzenberger Nils, Van Durme Benjamin
- Chip-chat: Challenges And Opportunities In Conversational Hardware Design Blocklove Jason, Garg Siddharth, Karri Ramesh, Pearce Hammond
- RWKV: Reinventing Rnns For The Transformer Era Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Stella Biderman, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran Gv, Xuzheng He, Haowen Hou, Jiaju Lin, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Guangyu Song, Xiangru Tang, Bolun Wang, Johan S. Wind, Stanislaw Wozniak, Ruichong Zhang, Zhenyuan Zhang, Qihang Zhao, Peng Zhou, Qinghua Zhou, Jian Zhu, Rui-jie Zhu
- Personality Testing Of Large Language Models: Limited Temporal Stability, But Highlighted Prosociality Bodroza Bojana, Dinic Bojana M., Bojic Ljubisa
- The Ups And Downs Of Large Language Model Inference With Vocabulary Trimming By Language Heuristics Bogoychev Nikolay, Chen Pinzhen, Haddow Barry, Birch Alexandra
- GPT-4 Surpassing Human Performance In Linguistic Pragmatics Bojic Ljubisa, Kovacevic Predrag, Cabarkapa Milan
- Eight Things To Know About Large Language Models Bowman Samuel R.
- Plasma: Making Small Language Models Better Procedural Knowledge Models For (counterfactual) Planning Brahman Faeze, Bhagavatula Chandra, Pyatkin Valentina, Hwang Jena D., Li Xiang Lorraine, Arai Hirona J., Sanyal Soumya, Sakaguchi Keisuke, Ren Xiang, Choi Yejin
- Chemcrow: Augmenting Large-language Models With Chemistry Tools Bran Andres M, Cox Sam, Schilter Oliver, Baldassari Carlo, White Andrew D, Schwaller Philippe
- Insights Into Classifying And Mitigating Llms' Hallucinations Bruno Alessandro, Mazzeo Pier Luigi, Chetouani Aladine, Tliba Marouane, Kerkouri Mohamed Amine
- Codetf: One-stop Transformer Library For State-of-the-art Code LLM Bui Nghi D. Q., Le Hung, Wang Yue, Li Junnan, Gotmare Akhilesh Deepak, Hoi Steven C. H.
- Assessing Large Language Models On Climate Information Bulian Jannis, Schäfer Mike S., Amini Afra, Lam Heidi, Ciaramita Massimiliano, Gaiarin Ben, Hübscher Michelle Chen, Buck Christian, Mede Niels G., Leippold Markus, Strauß Nadine
- Weak-to-strong Generalization: Eliciting Strong Capabilities With Weak Supervision Burns Collin, Izmailov Pavel, Kirchner Jan Hendrik, Baker Bowen, Gao Leo, Aschenbrenner Leopold, Chen Yining, Ecoffet Adrien, Joglekar Manas, Leike Jan, Sutskever Ilya, Wu Jeff
- Incremental Learning Of Humanoid Robot Behavior From Natural Interaction And Large Language Models Bärmann Leonard, Kartmann Rainer, Peller-konrad Fabian, Niehues Jan, Waibel Alex, Asfour Tamim
- Pokemonchat: Auditing Chatgpt For Pok\'emon Universe Knowledge Cabello Laura, Li Jiaang, Chalkidis Ilias
- Development Of The Chatgpt, Generative Artificial Intelligence And Natural Large Language Models For Accountable Reporting And Use (CANGARU) Guidelines Cacciamani Giovanni E., Eppler Michael B., Ganjavi Conner, Pekan Asli, Biedermann Brett, Collins Gary S., Gill Inderbir S.
- Instructalign: High-and-low Resource Language Alignment Via Continual Crosslingual Instruction Tuning Cahyawijaya Samuel, Lovenia Holy, Yu Tiezheng, Chung Willy, Fung Pascale
- Do Large Language Models Resemble Humans In Language Use? Cai Zhenguang G., Duan Xufeng, Haslett David A., Wang Shuqi, Pickering Martin J.
- On The Application Of Large Language Models For Language Teaching And Assessment Technology Caines Andrew, Benedetto Luca, Taslimipoor Shiva, Davis Christopher, Gao Yuan, Andersen Oeistein, Yuan Zheng, Elliott Mark, Moore Russell, Bryant Christopher, Rei Marek, Yannakoudakis Helen, Mullooly Andrew, Nicholls Diane, Buttery Paula
- A Systematic Study Of Knowledge Distillation For Natural Language Generation With Pseudo-target Training Calderon Nitay, Mukherjee Subhabrata, Reichart Roi, Kantor Amir
- H2ogpt: Democratizing Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Prabhu Prithvi, Gambera Jeff, Landry Mark, Bansal Shivam, Chesler Ryan, Lee Chun Ming, Conde Marcos V., Stetsenko Pasha, Grellier Olivier, Ambati Srisatish
- Defending Against Alignment-breaking Attacks Via Robustly Aligned LLM Cao Bochuan, Cao Yuanpu, Lin Lu, Chen Jinghui
- Learn To Refuse: Making Large Language Models More Controllable And Reliable Through Knowledge Scope Limitation And Refusal Mechanism Cao Lang
- Successor Features For Efficient Multisubject Controlled Text Generation Cao Meng, Fatemi Mehdi, Cheung Jackie Chi Kit, Shabanian Samira
- Systematic Rectification Of Language Models Via Dead-end Analysis Cao Meng, Fatemi Mehdi, Cheung Jackie Chi Kit, Shabanian Samira
- Unnatural Error Correction: GPT-4 Can Almost Perfectly Handle Unnatural Scrambled Text Cao Qi, Kojima Takeshi, Matsuo Yutaka, Iwasawa Yusuke
- Exploring The Effectiveness Of Chatgpt-based Feedback Compared With Teacher Feedback And Self-feedback: Evidence From Chinese To English Translation Cao Siyi, Zhong Linping
- How Does AI Chat Change Search Behaviors? Capra Robert, Arguello Jaime
- Assessing Large Language Models' Ability To Predict How Humans Balance Self-interest And The Interest Of Others Capraro Valerio, Di Paolo Roberto, Pizziol Veronica
- Grounding Large Language Models In Interactive Environments With Online Reinforcement Learning Carta Thomas, Romac Clément, Wolf Thomas, Lamprier Sylvain, Sigaud Olivier, Oudeyer Pierre-yves
- Can It Edit? Evaluating The Ability Of Large Language Models To Follow Code Editing Instructions Cassano Federico, Li Luisa, Sethi Akul, Shinn Noah, Brennan-jones Abby, Ginesin Jacob, Berman Edward, Chakhnashvili George, Lozhkov Anton, Anderson Carolyn Jane, Guha Arjun
- Do Androids Know They're Only Dreaming Of Electric Sheep? Ch-wang Sky, Van Durme Benjamin, Eisner Jason, Kedzie Chris
- Creativity Support In The Age Of Large Language Models: An Empirical Study Involving Emerging Writers Chakrabarty Tuhin, Padmakumar Vishakh, Brahman Faeze, Muresan Smaranda
- Which Spurious Correlations Impact Reasoning In NLI Models? A Visual Interactive Diagnosis Through Data-constrained Counterfactuals Chan Robin, Amini Afra, El-assady Mennatallah
- Learning To Generate Better Than Your LLM Chang Jonathan D., Brantley Kiante, Ramamurthy Rajkumar, Misra Dipendra, Sun Wen
- Language Model Behavior: A Comprehensive Survey Chang Tyler A., Bergen Benjamin K.
- Characterizing Learning Curves During Language Model Pre-training: Learning, Forgetting, And Stability Chang Tyler A., Tu Zhuowen, Bergen Benjamin K.
- MME: A Comprehensive Evaluation Benchmark For Multimodal Large Language Models Chaoyou Fu, Peixian Chen, Yunhang Shen, Yulei Qin, Mengdan Zhang, Xu Lin, Jinrui Yang, Xiawu Zheng, Ke Li, Xing Sun, Yunsheng Wu, Rongrong Ji
- Memgpt: Towards Llms As Operating Systems Charles Packer, Sarah Wooders, Kevin Lin, Vivian Fang, Shishir G. Patil, Ion Stoica, Joseph E. Gonzalez
- Efficient GPT Model Pre-training Using Tensor Train Matrix Representation Chekalina Viktoriia, Novikov Georgii, Gusak Julia, Oseledets Ivan, Panchenko Alexander
- Two Failures Of Self-consistency In The Multi-step Reasoning Of Llms Chen Angelica, Phang Jason, Parrish Alicia, Padmakumar Vishakh, Zhao Chen, Bowman Samuel R., Cho Kyunghyun
- Sudden Drops In The Loss: Syntax Acquisition, Phase Transitions, And Simplicity Bias In Mlms Chen Angelica, Shwartz-ziv Ravid, Cho Kyunghyun, Leavitt Matthew L., Saphra Naomi
- Fireact: Toward Language Agent Fine-tuning Chen Baian, Shu Chang, Shareghi Ehsan, Collier Nigel, Narasimhan Karthik, Yao Shunyu
- Dual-space Hierarchical Learning For Goal-guided Conversational Recommendation Chen Can, Liu Hao, Liu Zeming, Liu Xue, Dou Dejing
- Visual Instruction Tuning With Polite Flamingo Chen Delong, Liu Jianfeng, Dai Wenliang, Wang Baoyuan
- Chatgpt's One-year Anniversary: Are Open-source Large Language Models Catching Up? Chen Hailin, Jiao Fangkai, Li Xingxuan, Qin Chengwei, Ravaut Mathieu, Zhao Ruochen, Xiong Caiming, Joty Shafiq
- Interpreting And Controlling Vision Foundation Models Via Text Explanations Chen Haozhe, Yang Junfeng, Vondrick Carl, Mao Chengzhi
- Skills-in-context Prompting: Unlocking Compositionality In Large Language Models Chen Jiaao, Pan Xiaoman, Yu Dian, Song Kaiqiang, Wang Xiaoyang, Yu Dong, Chen Jianshu
- Fairytalecqa: Integrating A Commonsense Knowledge Graph Into Children's Storybook Narratives Chen Jiaju, Lu Yuxuan, Zhang Shao, Yao Bingsheng, Dong Yuanzhe, Xu Ying, Li Yunyao, Wang Qianwen, Wang Dakuo, Sun Yuling
- Schema-guided Semantic Accuracy: Faithfulness In Task-oriented Dialogue Response Generation Chen Jinghong, Lin Weizhe, Byrne Bill
- Huatuogpt-ii, One-stage Training For Medical Adaption Of Llms Chen Junying, Wang Xidong, Gao Anningzhe, Jiang Feng, Chen Shunian, Zhang Hongbo, Song Dingjie, Xie Wenya, Kong Chuyi, Li Jianquan, Wan Xiang, Li Haizhou, Wang Benyou
- Gaining Wisdom From Setbacks: Aligning Large Language Models Via Mistake Analysis Chen Kai, Wang Chunwei, Yang Kuo, Han Jianhua, Hong Lanqing, Mi Fei, Xu Hang, Liu Zhengying, Huang Wenyong, Li Zhenguo, Yeung Dit-yan, Shang Lifeng, Jiang Xin, Liu Qun
- Large Language Models Are Visual Reasoning Coordinators Chen Liangyu, Li Bo, Shen Sheng, Yang Jingkang, Li Chunyuan, Keutzer Kurt, Darrell Trevor, Liu Ziwei
- Introspective Tips: Large Language Model For In-context Decision Making Chen Liting, Wang Lu, Dong Hang, Du Yali, Yan Jie, Yang Fangkai, Li Shuang, Zhao Pu, Qin Si, Rajmohan Saravan, Lin Qingwei, Zhang Dongmei
- Driving With Llms: Fusing Object-level Vector Modality For Explainable Autonomous Driving Chen Long, Sinavski Oleg, Hünermann Jan, Karnsund Alice, Willmott Andrew James, Birch Danny, Maund Daniel, Shotton Jamie
- From Good To Great: Improving Math Reasoning With Tool-augmented Interleaf Prompting Chen Nuo, Li Hongguang, Wang Baoyuan, Li Jia
- Is Bigger And Deeper Always Better? Probing Llama Across Scales And Layers Chen Nuo, Wu Ning, Liang Shining, Gong Ming, Shou Linjun, Zhang Dongmei, Li Jia
- Monolingual Or Multilingual Instruction Tuning: Which Makes A Better Alpaca Chen Pinzhen, Ji Shaoxiong, Bogoychev Nikolay, Kutuzov Andrey, Haddow Barry, Heafield Kenneth
- Dynamic Strategy Chain: Dynamic Zero-shot Cot For Long Mental Health Support Generation Chen Qi, Liu Dexi
- Webvln: Vision-and-language Navigation On Websites Chen Qi, Pitawela Dileepa, Zhao Chongyang, Zhou Gengze, Chen Hsiang-ting, Wu Qi
- Extending Context Window Of Large Language Models Via Positional Interpolation Chen Shouyuan, Wong Sherman, Chen Liangjian, Tian Yuandong
- Understanding And Improving In-context Learning On Vision-language Models Chen Shuo, Han Zhen, He Bailan, Buckley Mark, Torr Philip, Tresp Volker, Gu Jindong
- Chatpipe: Orchestrating Data Preparation Program By Optimizing Human-chatgpt Interactions Chen Sibei, Liu Hanbing, Jin Weiting, Sun Xiangyu, Feng Xiaoyao, Fan Ju, Du Xiaoyong, Tang Nan
- Llm-empowered Chatbots For Psychiatrist And Patient Simulation: Application And Evaluation Chen Siyuan, Wu Mengyue, Zhu Kenny Q., Lan Kunyao, Zhang Zhiling, Cui Lyuchun
- Self-icl: Zero-shot In-context Learning With Self-generated Demonstrations Chen Wei-lin, Wu Cheng-kuang, Chen Yun-nung, Chen Hsin-hsi
- Challenges And Contributing Factors In The Utilization Of Large Language Models (llms) Chen Xiaoliang, Li Liangbin, Chang Le, Huang Yunhe, Zhao Yuxuan, Zhang Yuxiao, Li Dinuo
- Asking Before Acting: Gather Information In Embodied Decision Making With Language Models Chen Xiaoyu, Zhang Shenao, Zhang Pushi, Zhao Li, Chen Jianyu
- Autoeval-video: An Automatic Benchmark For Assessing Large Vision Language Models In Open-ended Video Question Answering Chen Xiuyuan, Lin Yuan, Zhang Yuchen, Huang Weiran
- Medprompt: Cross-modal Prompting For Multi-task Medical Image Translation Chen Xuhang, Pun Chi-man, Wang Shuqiang
- Do Models Explain Themselves? Counterfactual Simulatability Of Natural Language Explanations Chen Yanda, Zhong Ruiqi, Ri Narutatsu, Zhao Chen, He He, Steinhardt Jacob, Yu Zhou, Mckeown Kathleen
- Measuring And Improving Chain-of-thought Reasoning In Vision-language Models Chen Yangyi, Sikka Karan, Cogswell Michael, Ji Heng, Divakaran Ajay
- EE-LLM: Large-scale Training And Inference Of Early-exit Large Language Models With 3D Parallelism Chen Yanxi, Pan Xuchen, Li Yaliang, Ding Bolin, Zhou Jingren
- Tigerbot: An Open Multilingual Multitask LLM Chen Ye, Cai Wei, Wu Liangmin, Li Xiaowei, Xin Zhanxuan, Fu Cong
- Bianque: Balancing The Questioning And Suggestion Ability Of Health Llms With Multi-turn Health Conversations Polished By Chatgpt Chen Yirong, Wang Zhenyu, Xing Xiaofen, Zheng Huimin, Xu Zhipei, Fang Kai, Wang Junhong, Li Sihang, Wu Jieling, Liu Qi, Xu Xiangmin
- Fortify The Shortest Stave In Attention: Enhancing Context Awareness Of Large Language Models For Effective Tool Use Chen Yuhan, Lv Ang, Lin Ting-en, Chen Changyu, Wu Yuchuan, Huang Fei, Li Yongbin, Yan Rui
- See, Think, Confirm: Interactive Prompting Between Vision And Language Models For Knowledge-based Visual Reasoning Chen Zhenfang, Zhou Qinhong, Shen Yikang, Hong Yining, Zhang Hao, Gan Chuang
- Chatcot: Tool-augmented Chain-of-thought Reasoning On Chat-based Large Language Models Chen Zhipeng, Zhou Kun, Zhang Beichen, Gong Zheng, Zhao Wayne Xin, Wen Ji-rong
- Xplainllm: A QA Explanation Dataset For Understanding LLM Decision-making Chen Zichen, Chen Jianda, Gaidhani Mitali, Singh Ambuj, Sra Misha
- Visual Chatgpt: Talking, Drawing And Editing With Visual Foundation Models Chenfei Wu, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, Nan Duan
- Llf-bench: Benchmark For Interactive Learning From Language Feedback Cheng Ching-an, Kolobov Andrey, Misra Dipendra, Nie Allen, Swaminathan Adith
- Black-box Prompt Optimization: Aligning Large Language Models Without Model Training Cheng Jiale, Liu Xiao, Zheng Kehan, Ke Pei, Wang Hongning, Dong Yuxiao, Tang Jie, Huang Minlie
- Adversarial Preference Optimization: Enhancing Your Alignment Via RM-LLM Game Cheng Pengyu, Yang Yifan, Li Jian, Dai Yong, Hu Tianhao, Cao Peixin, Du Nan, Li Xiaolong
- CREATOR: Tool Creation For Disentangling Abstract And Concrete Reasoning Of Large Language Models Cheng Qian, Chi Han, Yi R. Fung, Yujia Qin, Zhiyuan Liu, Heng Ji
- Batch Prompting: Efficient Inference With Large Language Model Apis Cheng Zhoujun, Kasai Jungo, Yu Tao
- Chatdb: Augmenting Llms With Databases As Their Symbolic Memory Chenxu Hu, Jie Fu, Chenzhuang Du, Simian Luo, Junbo Zhao, Hang Zhao
- Latent Positional Information Is In The Self-attention Variance Of Transformer Language Models Without Positional Embeddings Chi Ta-chung, Fan Ting-han, Chen Li-wei, Rudnicky Alexander I., Ramadge Peter J.
- M\(^{2}\)chat: Empowering VLM For Multimodal LLM Interleaved Text-image Generation Chi Xiaowei, Zhang Rongyu, Jiang Zhengkai, Liu Yijiang, Wang Yatian, Qi Xingqun, Luo Wenhan, Gao Peng, Zhang Shanghang, Liu Qifeng, Guo Yike
- INSTRUCTEVAL: Towards Holistic Evaluation Of Instruction-tuned Large Language Models Chia Yew Ken, Hong Pengfei, Bing Lidong, Poria Soujanya
- An Integrative Survey On Mental Health Conversational Agents To Bridge Computer Science And Medical Perspectives Cho Young Min, Rai Sunny, Ungar Lyle, Sedoc João, Guntuku Sharath Chandra
- Evaluating The Efficacy Of Interactive Language Therapy Based On LLM For High-functioning Autistic Adolescent Psychological Counseling Cho Yujin, Kim Mingeon, Kim Seojin, Kwon Oyun, Kwon Ryan Donghan, Lee Yoonha, Lim Dohyun
- KCTS: Knowledge-constrained Tree Search Decoding With Token-level Hallucination Detection Choi Sehyun, Fang Tianqing, Wang Zhaowei, Song Yangqiu
- Language And Task Arithmetic With Parameter-efficient Layers For Zero-shot Summarization Chronopoulou Alexandra, Pfeiffer Jonas, Maynez Joshua, Wang Xinyi, Ruder Sebastian, Agrawal Priyanka
- Investigating Hallucinations In Pruned Large Language Models For Abstractive Summarization Chrysostomou George, Zhao Zhixue, Williams Miles, Aletras Nikolaos
- Timebench: A Comprehensive Evaluation Of Temporal Reasoning Abilities In Large Language Models Chu Zheng, Chen Jingchang, Chen Qianglong, Yu Weijiang, Wang Haotian, Liu Ming, Qin Bing
- Data-centric Financial Large Language Models Chu Zhixuan, Guo Huaiyu, Zhou Xinyuan, Wang Yijia, Yu Fei, Chen Hong, Xu Wanqing, Lu Xin, Cui Qing, Li Longfei, Zhou Jun, Li Sheng
- Dola: Decoding By Contrasting Layers Improves Factuality In Large Language Models Chuang Yung-sung, Xie Yujia, Luo Hongyin, Kim Yoon, Glass James, He Pengcheng
- Long Story Short: A Summarize-then-search Method For Long Video Question Answering Chung Jiwan, Yu Youngjae
- Challenges Of Large Language Models For Mental Health Counseling Chung Neo Christopher, Dyer George, Brocki Lennart
- Instructtods: Large Language Models For End-to-end Task-oriented Dialogue Systems Chung Willy, Cahyawijaya Samuel, Wilie Bryan, Lovenia Holy, Fung Pascale
- LIMA: Less Is More For Alignment Chunting Zhou, Pengfei Liu, Puxin Xu, Srini Iyer, Jiao Sun, Yuning Mao, Xuezhe Ma, Avia Efrat, Ping Yu, Lili Yu, Susan Zhang, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer, Omer Levy
- Remember What You Did So You Know What To Do Next Ciosici Manuel R., Hedges Alex, Kankanampati Yash, Martin Justin, Freedman Marjorie, Weischedel Ralph
- Galactic Chitchat: Using Large Language Models To Converse With Astronomy Literature Ciucă Ioana, Ting Yuan-sen
- Generalization Analogies: A Testbed For Generalizing AI Oversight To Hard-to-measure Domains Clymer Joshua, Baker Garrett, Subramani Rohan, Wang Sam
- BERT Lost Patience Won't Be Robust To Adversarial Slowdown Coalson Zachary, Ritter Gabriel, Bobba Rakesh, Hong Sanghyun
- Evaluating Language Models For Mathematics Through Interactions Collins Katherine M., Jiang Albert Q., Frieder Simon, Wong Lionel, Zilka Miri, Bhatt Umang, Lukasiewicz Thomas, Wu Yuhuai, Tenenbaum Joshua B., Hart William, Gowers Timothy, Li Wenda, Weller Adrian, Jamnik Mateja
- Seamlessm4t: Massively Multilingual & Multimodal Machine Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Cora, Dale David, Dong Ning, Duquenne Paul-ambroise, Elsahar Hady, Gong Hongyu, Heffernan Kevin, Hoffman John, Klaiber Christopher, Li Pengwei, Licht Daniel, Maillard Jean, Rakotoarison Alice, Sadagopan Kaushik Ram, Wenzek Guillaume, Ye Ethan, Akula Bapi, Chen Peng-jen, Hachem Naji El, Ellis Brian, Gonzalez Gabriel Mejia, Haaheim Justin, Hansanti Prangthip, Howes Russ, Huang Bernie, Hwang Min-jae, Inaguma Hirofumi, Jain Somya, Kalbassi Elahe, Kallet Amanda, Kulikov Ilia, Lam Janice, Li Daniel, Ma Xutai, Mavlyutov Ruslan, Peloquin Benjamin, Ramadan Mohamed, Ramakrishnan Abinesh, Sun Anna, Tran Kevin, Tran Tuan, Tufanov Igor, Vogeti Vish, Wood Carleigh, Yang Yilin, Yu Bokai, Andrews Pierre, Balioglu Can, Costa-jussà Marta R., Celebi Onur, Elbayad Maha, Gao Cynthia, Guzmán Francisco, Kao Justine, Lee Ann, Mourachko Alexandre, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler
- Robust Infidelity: When Faithfulness Measures On Masked Language Models Are Misleading Crothers Evan, Viktor Herna, Japkowicz Nathalie
- Reinforcement Learning Fine-tuning Of Language Models Is Biased Towards More Extractable Features Cruz Diogo, Pona Edoardo, Holness-tofts Alex, Schmied Elias, Alonso Víctor Abia, Griffin Charlie, Cirstea Bogdan-ionut
- Receive, Reason, And React: Drive As You Say With Large Language Models In Autonomous Vehicles Cui Can, Ma Yunsheng, Cao Xu, Ye Wenqian, Wang Ziran
- Personalized Autonomous Driving With Large Language Models: Field Experiments Cui Can, Yang Zichong, Zhou Yupeng, Ma Yunsheng, Lu Juanwu, Li Lingxi, Chen Yaobin, Panchal Jitesh, Wang Ziran
- Ultrafeedback: Boosting Language Models With Scaled AI Feedback Cui Ganqu, Yuan Lifan, Ding Ning, Yao Guanming, He Bingxiang, Zhu Wei, Ni Yuan, Xie Guotong, Xie Ruobing, Lin Yankai, Liu Zhiyuan, Sun Maosong
- Machine Mindset: An MBTI Exploration Of Large Language Models Cui Jiaxi, Lv Liuzhenghao, Wen Jing, Wang Rongsheng, Tang Jing, Tian Yonghong, Yuan Li
- Efficient And Effective Text Encoding For Chinese Llama And Alpaca Cui Yiming, Yang Ziqing, Yao Xin
- Evaluating Prompt-based Question Answering For Object Prediction In The Open Research Knowledge Graph D'souza Jennifer, Hrou Moussab, Auer Sören
- Llm-in-the-loop: Leveraging Large Language Model For Thematic Analysis Dai Shih-chieh, Xiong Aiping, Ku Lun-wei
- Llmebench: A Flexible Framework For Accelerating Llms Benchmarking Dalvi Fahim, Hasanain Maram, Boughorbel Sabri, Mousi Basel, Abdaljalil Samir, Nazar Nizi, Abdelali Ahmed, Chowdhury Shammur Absar, Mubarak Hamdy, Ali Ahmed, Hawasly Majd, Durrani Nadir, Alam Firoj
- Educhat: A Large-scale Language Model-based Chatbot System For Intelligent Education Dan Yuhao, Lei Zhikai, Gu Yiyang, Li Yong, Yin Jianghao, Lin Jiaju, Ye Linhao, Tie Zhiyan, Zhou Yougen, Wang Yilei, Zhou Aimin, Zhou Ze, Chen Qin, Zhou Jie, He Liang, Qiu Xipeng
- Palm-e: An Embodied Multimodal Language Model Danny Driess, Fei Xia, Mehdi S. M. Sajjadi, Corey Lynch, Aakanksha Chowdhery, Brian Ichter, Ayzaan Wahid, Jonathan Tompson, Quan Vuong, Tianhe Yu, Wenlong Huang, Yevgen Chebotar, Pierre Sermanet, Daniel Duckworth, Sergey Levine, Vincent Vanhoucke, Karol Hausman, Marc Toussaint, Klaus Greff, Andy Zeng, Igor Mordatch, Pete Florence
- An Adversarial Example For Direct Logit Attribution: Memory Management In Gelu-4l Dao James, Lau Yeu-tong, Rager Can, Janiak Jett
- Flashattention-2: Faster Attention With Better Parallelism And Work Partitioning Dao Tri
- Chatgpt Is Good But Bing Chat Is Better For Vietnamese Students Dao Xuan-quy, Le Ngoc-bich
- Can Chatgpt Pass The Vietnamese National High School Graduation Examination? Dao Xuan-quy, Le Ngoc-bich, Phan Xuan-dung, Ngo Bac-bien
- Optimizing Distributed Training On Frontier For Large Language Models Dash Sajal, Lyngaas Isaac, Yin Junqi, Wang Xiao, Egele Romain, Cong Guojing, Wang Feiyi, Balaprakash Prasanna
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- User Simulation With Large Language Models For Evaluating Task-oriented Dialogue Davidson Sam, Romeo Salvatore, Shu Raphael, Gung James, Gupta Arshit, Mansour Saab, Zhang Yi
- LLMR: Real-time Prompting Of Interactive Worlds Using Large Language Models De La Torre Fernanda, Fang Cathy Mengying, Huang Han, Banburski-fahey Andrzej, Fernandez Judith Amores, Lanier Jaron
- GPT-3 Models Are Few-shot Financial Reasoners De Padua Raul Salles, Qureshi Imran, Karakaplan Mustafa U.
- Improved Prompting And Process For Writing User Personas With Llms, Using Qualitative Interviews: Capturing Behaviour And Personality Traits Of Users De Paoli Stefano
- Im-promptu: In-context Composition From Image Prompts Dedhia Bhishma, Chang Michael, Snell Jake C., Griffiths Thomas L., Jha Niraj K.
- Facechat: An Emotion-aware Face-to-face Dialogue Framework Deema Alnuhait, Qingyang Wu, Zhou Yu
- Scaling Vision Transformers To 22 Billion Parameters Dehghani Mostafa, Djolonga Josip, Mustafa Basil, Padlewski Piotr, Heek Jonathan, Gilmer Justin, Steiner Andreas, Caron Mathilde, Geirhos Robert, Alabdulmohsin Ibrahim, Jenatton Rodolphe, Beyer Lucas, Tschannen Michael, Arnab Anurag, Wang Xiao, Riquelme Carlos, Minderer Matthias, Puigcerver Joan, Evci Utku, Kumar Manoj, Van Steenkiste Sjoerd, Elsayed Gamaleldin F., Mahendran Aravindh, Yu Fisher, Oliver Avital, Huot Fantine, Bastings Jasmijn, Collier Mark Patrick, Gritsenko Alexey, Birodkar Vighnesh, Vasconcelos Cristina, Tay Yi, Mensink Thomas, Kolesnikov Alexander, Pavetić Filip, Tran Dustin, Kipf Thomas, Lučić Mario, Zhai Xiaohua, Keysers Daniel, Harmsen Jeremiah, Houlsby Neil
- Skipdecode: Autoregressive Skip Decoding With Batching And Caching For Efficient LLM Inference Del Corro Luciano, Del Giorno Allie, Agarwal Sahaj, Yu Bin, Awadallah Ahmed, Mukherjee Subhabrata
- Fairness Of Chatgpt And The Role Of Explainable-guided Prompts Deldjoo Yashar
- Investigating Data Contamination In Modern Benchmarks For Large Language Models Deng Chunyuan, Zhao Yilun, Tang Xiangru, Gerstein Mark, Cohan Arman
- Plug-and-play Policy Planner For Large Language Model Powered Dialogue Agents Deng Yang, Zhang Wenxuan, Lam Wai, Ng See-kiong, Chua Tat-seng
- Multilingual Jailbreak Challenges In Large Language Models Deng Yue, Zhang Wenxuan, Pan Sinno Jialin, Bing Lidong
- Early Chatgpt User Portrait Through The Lens Of Data Deng Yuyang, Zhao Ni, Huang Xin
- Testing AI On Language Comprehension Tasks Reveals Insensitivity To Underlying Meaning Dentella Vittoria, Guenther Fritz, Murphy Elliot, Marcus Gary, Leivada Evelina
- Can Chatgpt Read Who You Are? Derner Erik, Kučera Dalibor, Oliver Nuria, Zahálka Jan
- Spqr: A Sparse-quantized Representation For Near-lossless LLM Weight Compression Dettmers Tim, Svirschevski Ruslan, Egiazarian Vage, Kuznedelev Denis, Frantar Elias, Ashkboos Saleh, Borzunov Alexander, Hoefler Torsten, Alistarh Dan
- Chatgpt Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions Deyao Zhu, Jun Chen, Kilichbek Haydarov, Xiaoqian Shen, Wenxuan Zhang, Mohamed Elhoseiny
- Minigpt-4: Enhancing Vision-language Understanding With Advanced Large Language Models Deyao Zhu, Jun Chen, Xiaoqian Shen, Xiang Li, Mohamed Elhoseiny
- Codefuse-13b: A Pretrained Multi-lingual Code Large Language Model Di Peng, Li Jianguo, Yu Hang, Jiang Wei, Cai Wenting, Cao Yang, Chen Chaoyu, Chen Dajun, Chen Hongwei, Chen Liang, Fan Gang, Gong Jie, Gong Zi, Hu Wen, Guo Tingting, Lei Zhichao, Li Ting, Li Zheng, Liang Ming, Liao Cong, Liu Bingchang, Liu Jiachen, Liu Zhiwei, Lu Shaojun, Shen Min, Wang Guangpei, Wang Huan, Wang Zhi, Xu Zhaogui, Yang Jiawei, Ye Qing, Zhang Gehao, Zhang Yu, Zhao Zelin, Zheng Xunjin, Zhou Hailian, Zhu Lifu, Zhu Xianying
- Lmflow: An Extensible Toolkit For Finetuning And Inference Of Large Foundation Models Diao Shizhe, Pan Rui, Dong Hanze, Shum Ka Shun, Zhang Jipeng, Xiong Wei, Zhang Tong
- Can Large Language Models Assist In Hazard Analysis? Diemert Simon, Weber Jens H
- Jump To Conclusions: Short-cutting Transformers With Linear Transformations Din Alexander Yom, Karidi Taelin, Choshen Leshem, Geva Mor
- Enhancing Chat Language Models By Scaling High-quality Instructional Conversations Ding Ning, Chen Yulin, Xu Bokai, Qin Yujia, Zheng Zhi, Hu Shengding, Liu Zhiyuan, Sun Maosong, Zhou Bowen
- Everything Of Thoughts: Defying The Law Of Penrose Triangle For Thought Generation Ding Ruomeng, Zhang Chaoyun, Wang Lu, Xu Yong, Ma Minghua, Zhang Wei, Qin Si, Rajmohan Saravan, Lin Qingwei, Zhang Dongmei
- Do Llms Work On Charts? Designing Few-shot Prompts For Chart Question Answering And Summarization Do Xuan Long, Hassanpour Mohammad, Masry Ahmed, Kavehzadeh Parsa, Hoque Enamul, Joty Shafiq
- Modeling What-to-ask And How-to-ask For Answer-unaware Conversational Question Generation Do Xuan Long, Zou Bowei, Joty Shafiq, Tran Anh Tai, Pan Liangming, Chen Nancy F., Aw Ai Ti
- How To Build An AI Tutor That Can Adapt To Any Course And Provide Accurate Answers Using Large Language Model And Retrieval-augmented Generation Dong Chenxi
- Revisit Input Perturbation Problems For Llms: A Unified Robustness Evaluation Framework For Noisy Slot Filling Task Dong Guanting, Zhao Jinxu, Hui Tingfeng, Guo Daichi, Wan Wenlong, Feng Boqi, Qiu Yueyan, Gongque Zhuoma, He Keqing, Wang Zechen, Xu Weiran
- The Philosopher's Stone: Trojaning Plugins Of Large Language Models Dong Tian, Xue Minhui, Chen Guoxing, Holland Rayne, Meng Yan, Li Shaofeng, Liu Zhen, Zhu Haojin
- PACE: Improving Prompt With Actor-critic Editing For Large Language Model Dong Yihong, Luo Kangcheng, Jiang Xue, Jin Zhi, Li Ge
- Loramoe: Alleviate World Knowledge Forgetting In Large Language Models Via Moe-style Plugin Dou Shihan, Zhou Enyu, Liu Yan, Gao Songyang, Zhao Jun, Shen Wei, Zhou Yuhao, Xi Zhiheng, Wang Xiao, Fan Xiaoran, Pu Shiliang, Zhu Jiang, Zheng Rui, Gui Tao, Zhang Qi, Huang Xuanjing
- A Comparative Study Of Ai-generated (GPT-4) And Human-crafted Mcqs In Programming Education Doughty Jacob, Wan Zipiao, Bompelli Anishka, Qayum Jubahed, Wang Taozhi, Zhang Juran, Zheng Yujia, Doyle Aidan, Sridhar Pragnya, Agarwal Arav, Bogart Christopher, Keylor Eric, Kultur Can, Savelka Jaromir, Sakr Majd
- Large Language Models Douglas Michael R.
- Scratch Copilot Evaluation: Assessing Ai-assisted Creative Coding For Families Druga Stefania, Otero Nancy
- Blending Reward Functions Via Few Expert Demonstrations For Faithful And Accurate Knowledge-grounded Dialogue Generation Du Wanyu, Ji Yangfeng
- Guiding Pretraining In Reinforcement Learning With Large Language Models Du Yuqing, Watkins Olivia, Wang Zihan, Colas Cédric, Darrell Trevor, Abbeel Pieter, Gupta Abhishek, Andreas Jacob
- Botchat: Evaluating Llms' Capabilities Of Having Multi-turn Dialogues Duan Haodong, Wei Jueqi, Wang Chonghua, Liu Hongwei, Fang Yixiao, Zhang Songyang, Lin Dahua, Chen Kai
- Shifting Attention To Relevance: Towards The Predictive Uncertainty Quantification Of Free-form Large Language Models Duan Jinhao, Cheng Hao, Wang Shiqi, Zavalny Alex, Wang Chenan, Xu Renjing, Kailkhura Bhavya, Xu Kaidi
- Denevil: Towards Deciphering And Navigating The Ethical Values Of Large Language Models Via Instruction Learning Duan Shitong, Yi Xiaoyuan, Zhang Peng, Lu Tun, Xie Xing, Gu Ning
- Bridging The Language Gap: Knowledge Injected Multilingual Question Answering Duan Zhichao, Li Xiuxing, Zhang Zhengyan, Li Zhenyu, Liu Ning, Wang Jianyong
- Alpacafarm: A Simulation Framework For Methods That Learn From Human Feedback Dubois Yann, Li Xuechen, Taori Rohan, Zhang Tianyi, Gulrajani Ishaan, Ba Jimmy, Guestrin Carlos, Liang Percy, Hashimoto Tatsunori B.
- Frugal Lms Trained To Invoke Symbolic Solvers Achieve Parameter-efficient Arithmetic Reasoning Dutta Subhabrata, Singh Joykirat, Pandey Ishan, Manchanda Sunny, Chakrabarti Soumen, Chakraborty Tanmoy
- A Comprehensive Performance Study Of Large Language Models On Novel AI Accelerators Emani Murali, Foreman Sam, Sastry Varuni, Xie Zhen, Raskar Siddhisanket, Arnold William, Thakur Rajeev, Vishwanath Venkatram, Papka Michael E.
- Context-driven Interactive Query Simulations Based On Generative Large Language Models Engelmann Björn, Breuer Timo, Friese Jana Isabelle, Schaer Philipp, Fuhr Norbert
- Textgenshap: Scalable Post-hoc Explanations In Text Generation With Long Documents Enouen James, Nakhost Hootan, Ebrahimi Sayna, Arik Sercan O, Liu Yan, Pfister Tomas
- Lm-polygraph: Uncertainty Estimation For Language Models Fadeeva Ekaterina, Vashurin Roman, Tsvigun Akim, Vazhentsev Artem, Petrakov Sergey, Fedyanin Kirill, Vasilev Daniil, Goncharova Elizaveta, Panchenko Alexander, Panov Maxim, Baldwin Timothy, Shelmanov Artem
- Perspectives On Large Language Models For Relevance Judgment Faggioli Guglielmo, Dietz Laura, Clarke Charles, Demartini Gianluca, Hagen Matthias, Hauff Claudia, Kando Noriko, Kanoulas Evangelos, Potthast Martin, Stein Benno, Wachsmuth Henning
- Proximal Policy Optimization Actual Combat: Manipulating Output Tokenizer Length Fan Miao, Hu Chen, Zhou Shuchang
- Uncovering The Potential Of Chatgpt For Discourse Analysis In Dialogue: An Empirical Study Fan Yaxin, Jiang Feng, Li Peifeng, Li Haizhou
- Large Language Models For Code Analysis: Do Llms Really Do Their Job? Fang Chongzhou, Miao Ning, Srivastav Shaurya, Liu Jialin, Zhang Ruoyu, Fang Ruijie, Asmita, Tsang Ryan, Nazari Najmeh, Wang Han, Homayoun Houman
- Using GPT-4 To Augment Unbalanced Data For Automatic Scoring Fang Luyang, Lee Gyeong-geon, Zhai Xiaoming
- Tooltalk: Evaluating Tool-usage In A Conversational Setting Farn Nicholas, Shin Richard
- Revisiting Instruction Fine-tuned Model Evaluation To Guide Industrial Applications Faysse Manuel, Viaud Gautier, Hudelot Céline, Colombo Pierre
- Friend Or Foe? Exploring The Implications Of Large Language Models On The Science System Fecher Benedikt, Hebing Marcel, Laufer Melissa, Pohle Jörg, Sofsky Fabian
- Trapping LLM Hallucinations Using Tagged Context Prompts Feldman Philip, Foulds James R., Pan Shimei
- Llama Rider: Spurring Large Language Models To Explore The Open World Feng Yicheng, Wang Yuxuan, Liu Jiazheng, Zheng Sipeng, Lu Zongqing
- The Devil Is In The Errors: Leveraging Large Language Models For Fine-grained Machine Translation Evaluation Fernandes Patrick, Deutsch Daniel, Finkelstein Mara, Riley Parker, Martins André F. T., Neubig Graham, Garg Ankush, Clark Jonathan H., Freitag Markus, Firat Orhan
- Should Chatgpt Be Biased? Challenges And Risks Of Bias In Large Language Models Ferrara Emilio
- Context-aware Meta-learning Fifty Christopher, Duan Dennis, Junkins Ronald G., Amid Ehsan, Leskovec Jure, Re Christopher, Thrun Sebastian
- Leveraging Large Language Models For Automated Dialogue Analysis Finch Sarah E., Paek Ellie S., Choi Jinho D.
- Medalign: A Clinician-generated Dataset For Instruction Following With Electronic Medical Records Fleming Scott L., Lozano Alejandro, Haberkorn William J., Jindal Jenelle A., Reis Eduardo P., Thapa Rahul, Blankemeier Louis, Genkins Julian Z., Steinberg Ethan, Nayak Ashwin, Patel Birju S., Chiang Chia-chun, Callahan Alison, Huo Zepeng, Gatidis Sergios, Adams Scott J., Fayanju Oluseyi, Shah Shreya J., Savage Thomas, Goh Ethan, Chaudhari Akshay S., Aghaeepour Nima, Sharp Christopher, Pfeffer Michael A., Liang Percy, Chen Jonathan H., Morse Keith E., Brunskill Emma P., Fries Jason A., Shah Nigam H.
- Leveraging Large Language Models In Conversational Recommender Systems Friedman Luke, Ahuja Sameer, Allen David, Tan Zhenning, Sidahmed Hakim, Long Changbo, Xie Jun, Schubiner Gabriel, Patel Ajay, Lara Harsh, Chu Brian, Chen Zexi, Tiwari Manoj
- Chainpoll: A High Efficacy Method For LLM Hallucination Detection Friel Robert, Sanyal Atindriyo
- A Challenger To GPT-4V? Early Explorations Of Gemini In Visual Expertise Fu Chaoyou, Zhang Renrui, Wang Zihan, Huang Yubo, Zhang Zhengye, Qiu Longtian, Ye Gaoxiang, Shen Yunhang, Zhang Mengdan, Chen Peixian, Zhao Sirui, Lin Shaohui, Jiang Deqiang, Yin Di, Gao Peng, Li Ke, Li Hongsheng, Sun Xing
- Kwaiyiimath: Technical Report Fu Jiayi, Lin Lei, Gao Xiaoyang, Liu Pengli, Chen Zhengzong, Yang Zhirui, Zhang Shengnan, Zheng Xue, Li Yan, Liu Yuliang, Ye Xucheng, Liao Yiqiao, Liao Chao, Chen Bin, Song Chengru, Wan Junchen, Lin Zijia, Zhang Fuzheng, Wang Zhongyuan, Zhang Di, Gai Kun
- Codeapex: A Bilingual Programming Evaluation Benchmark For Large Language Models Fu Lingyue, Chai Huacan, Luo Shuang, Du Kounianhua, Zhang Weiming, Fan Longteng, Lei Jiayi, Rui Renting, Lin Jianghao, Fang Yuchen, Liu Yifan, Wang Jingkuan, Qi Siyuan, Zhang Kangning, Zhang Weinan, Yu Yong
- Revisiting The Knowledge Injection Frameworks Fu Peng, Zhang Yiming, Wang Haobo, Qiu Weikang, Zhao Junbo
- Misusing Tools In Large Language Models With Visual Adversarial Examples Fu Xiaohan, Wang Zihan, Li Shuheng, Gupta Rajesh K., Mireshghallah Niloofar, Berg-kirkpatrick Taylor, Fernandes Earlence
- Generate Then Select: Open-ended Visual Question Answering Guided By World Knowledge Fu Xingyu, Zhang Sheng, Kwon Gukyeong, Perera Pramuditha, Zhu Henghui, Zhang Yuhao, Li Alexander Hanbo, Wang William Yang, Wang Zhiguo, Castelli Vittorio, Ng Patrick, Roth Dan, Xiang Bing
- Chain-of-thought Hub: A Continuous Effort To Measure Large Language Models' Reasoning Performance Fu Yao, Ou Litu, Chen Mingyu, Wan Yuhao, Peng Hao, Khot Tushar
- Specializing Smaller Language Models Towards Multi-step Reasoning Fu Yao, Peng Hao, Ou Litu, Sabharwal Ashish, Khot Tushar
- Safety Alignment In NLP Tasks: Weakly Aligned Summarization As An In-context Attack Fu Yu, Li Yufei, Xiao Wen, Liu Cong, Dong Yue
- From Text To Self: Users' Perceptions Of Potential Of AI On Interpersonal Communication And Self Fu Yue, Foell Sami, Xu Xuhai, Hiniker Alexis
- Exposing Limitations Of Language Model Agents In Sequential-task Compositions On The Web Furuta Hiroki, Matsuo Yutaka, Faust Aleksandra, Gur Izzeddin
- Mitigating Hallucination In Large Multi-modal Models Via Robust Instruction Tuning Fuxiao Liu, Kevin Lin, Linjie Li, Jianfeng Wang, Yaser Yacoob, Lijuan Wang
- Sensitivity And Robustness Of Large Language Models To Prompt Template In Japanese Text Classification Tasks Gan Chengguang, Mori Tatsunori
- Ziya2: Data-centric Learning Is All Llms Need Gan Ruyi, Wu Ziwei, Sun Renliang, Lu Junyu, Wu Xiaojun, Zhang Dixiang, Pan Kunhao, He Junqing, Tian Yuanhe, Yang Ping, Yang Qi, Wang Hao, Zhang Jiaxing, Song Yan
- ASSISTGUI: Task-oriented Desktop Graphical User Interface Automation Gao Difei, Ji Lei, Bai Zechen, Ouyang Mingyu, Li Peiran, Mao Dongxing, Wu Qinchen, Zhang Weichen, Wang Peiyi, Guo Xiangwu, Wang Hengxu, Zhou Luowei, Shou Mike Zheng
- Assistgpt: A General Multi-modal Assistant That Can Plan, Execute, Inspect, And Learn Gao Difei, Ji Lei, Zhou Luowei, Lin Kevin Qinghong, Chen Joya, Fan Zihan, Shou Mike Zheng
- Policy-gradient Training Of Language Models For Ranking Gao Ge, Chang Jonathan D., Cardie Claire, Brantley Kianté, Joachim Thorsten
- Benchmarking Large Language Models With Augmented Instructions For Fine-grained Information Extraction Gao Jun, Zhao Huan, Zhang Yice, Wang Wei, Yu Changlong, Xu Ruifeng
- Llama-adapter V2: Parameter-efficient Visual Instruction Model Gao Peng, Han Jiaming, Zhang Renrui, Lin Ziyi, Geng Shijie, Zhou Aojun, Zhang Wei, Lu Pan, He Conghui, Yue Xiangyu, Li Hongsheng, Qiao Yu
- Enabling Large Language Models To Generate Text With Citations Gao Tianyu, Yen Howard, Yu Jiatong, Chen Danqi
- CLOVA: A Closed-loop Visual Assistant With Tool Usage And Update Gao Zhi, Du Yuntao, Zhang Xintong, Ma Xiaojian, Han Wenjuan, Zhu Song-chun, Li Qing
- On The Performance Of Multimodal Language Models Garg Utsav, Bas Erhan
- Simulating H.P. Lovecraft Horror Literature With The Chatgpt Large Language Model Garrido-merchán Eduardo C., Arroyo-barrigüete José Luis, Gozalo-brizuela Roberto
- Text Encoders Lack Knowledge: Leveraging Generative Llms For Domain-specific Semantic Textual Similarity Gatto Joseph, Sharif Omar, Seegmiller Parker, Bohlman Philip, Preum Sarah Masud
- Building Trustworthy Neurosymbolic AI Systems: Consistency, Reliability, Explainability, And Safety Gaur Manas, Sheth Amit
- Ahead-of-time P-tuning Gavrilov Daniil, Balagansky Nikita
- Mllm-bench: Evaluating Multimodal Llms With Per-sample Criteria Ge Wentao, Chen Shunian, Chen Guiming Hardy, Chen Zhihong, Chen Junying, Yan Shuo, Zhu Chenghao, Lin Ziyue, Xie Wenya, Zhang Xinyi, Chai Yichen, Liu Xiaoyu, Song Dingjie, Wang Xidong, Gao Anningzhe, Zhang Zhiyi, Li Jianquan, Wan Xiang, Wang Benyou
- Making Llama SEE And Draw With SEED Tokenizer Ge Yuying, Zhao Sijie, Zeng Ziyun, Ge Yixiao, Li Chen, Wang Xintao, Shan Ying
- Mblip: Efficient Bootstrapping Of Multilingual Vision-llms Geigle Gregor, Jain Abhay, Timofte Radu, Glavaš Goran
- Adapting Llms For Efficient, Personalized Information Retrieval: Methods And Implications Ghodratnama Samira, Zakershahrak Mehrdad
- Does Synthetic Data Make Large Language Models More Efficient? Gholami Sia, Omar Marwan
- Gpt-based Models Meet Simulation: How To Efficiently Use Large-scale Pre-trained Language Models Across Simulation Tasks Giabbanelli Philippe J.
- Prompt Cache: Modular Attention Reuse For Low-latency Inference Gim In, Chen Guojun, Lee Seung-seob, Sarda Nikhil, Khandelwal Anurag, Zhong Lin
- Generative AI Beyond Llms: System Implications Of Multi-modal Generation Golden Alicia, Hsia Samuel, Sun Fei, Acun Bilge, Hosmer Basil, Lee Yejin, Devito Zachary, Johnson Jeff, Wei Gu-yeon, Brooks David, Wu Carole-jean
- Can Llms Capture Human Preferences? Goli Ali, Singh Amandeep
- Evaluating The Potential Of Leading Large Language Models In Reasoning Biology Questions Gong Xinyu, Holmes Jason, Li Yiwei, Liu Zhengliang, Gan Qi, Wu Zihao, Zhang Jianli, Zou Yusong, Teng Yuxi, Jiang Tian, Zhu Hongtu, Liu Wei, Liu Tianming, Yan Yajun
- Figstep: Jailbreaking Large Vision-language Models Via Typographic Visual Prompts Gong Yichen, Ran Delong, Liu Jinyuan, Wang Conglei, Cong Tianshuo, Wang Anyu, Duan Sisi, Wang Xiaoyun
- Bloomvqa: Assessing Hierarchical Multi-modal Comprehension Gong Yunye, Shrestha Robik, Claypoole Jared, Cogswell Michael, Ray Arijit, Kanan Christopher, Divakaran Ajay
- Integration Of Large Language Models Within Cognitive Architectures For Autonomous Robots González-santamarta Miguel Á., Rodríguez-lera Francisco J., Guerrero-higueras Ángel Manuel, Matellán-olivera Vicente
- Intention And Context Elicitation With Large Language Models In The Legal Aid Intake Process Goodson Nick, Lu Rongfei
- Not What You've Signed Up For: Compromising Real-world Llm-integrated Applications With Indirect Prompt Injection Greshake Kai, Abdelnabi Sahar, Mishra Shailesh, Endres Christoph, Holz Thorsten, Fritz Mario
- AMAGO: Scalable In-context Reinforcement Learning For Adaptive Agents Grigsby Jake, Fan Linxi, Zhu Yuke
- Large Language Models Are Zero-shot Time Series Forecasters Gruver Nate, Finzi Marc, Qiu Shikai, Wilson Andrew Gordon
- Pokemqa: Programmable Knowledge Editing For Multi-hop Question Answering Gu Hengrui, Zhou Kaixiong, Han Xiaotian, Liu Ninghao, Wang Ruobing, Wang Xin
- Neuron Patching: Semantic-based Neuron-level Language Model Repair For Code Generation Gu Jian, Aleti Aldeida, Chen Chunyang, Zhang Hongyu
- Linguistically Informed Chatgpt Prompts To Enhance Japanese-chinese Machine Translation: A Case Study On Attributive Clauses Gu Wenshi
- Digital Socrates: Evaluating Llms Through Explanation Critiques Gu Yuling, Tafjord Oyvind, Clark Peter
- Leveraging Pre-trained Large Language Models To Construct And Utilize World Models For Model-based Task Planning Guan Lin, Valmeekam Karthik, Sreedharan Sarath, Kambhampati Subbarao
- Intelligent Virtual Assistants With Llm-based Process Automation Guan Yanchu, Wang Dong, Chu Zhixuan, Wang Shiyu, Ni Feiyue, Song Ruihua, Li Longfei, Gu Jinjie, Zhuang Chenyi
- Voyager: An Open-ended Embodied Agent With Large Language Models Guanzhi Wang, Yuqi Xie, Yunfan Jiang, Ajay Mandlekar, Chaowei Xiao, Yuke Zhu, Linxi Fan, Anima Anandkumar
- The False Promise Of Imitating Proprietary Llms Gudibande Arnav, Wallace Eric, Snell Charlie, Geng Xinyang, Liu Hao, Abbeel Pieter, Levine Sergey, Song Dawn
- Hallucinations In Large Multilingual Translation Models Guerreiro Nuno M., Alves Duarte, Waldendorf Jonas, Haddow Barry, Birch Alexandra, Colombo Pierre, Martins André F. T.
- Hiformer: Heterogeneous Feature Interactions Learning With Transformers For Recommender Systems Gui Huan, Wang Ruoxi, Yin Ke, Jin Long, Kula Maciej, Xu Taibai, Hong Lichan, Chi Ed H.
- Reinforced Self-training (rest) For Language Modeling Gulcehre Caglar, Paine Tom Le, Srinivasan Srivatsan, Konyushkova Ksenia, Weerts Lotte, Sharma Abhishek, Siddhant Aditya, Ahern Alex, Wang Miaosen, Gu Chenjie, Macherey Wolfgang, Doucet Arnaud, Firat Orhan, De Freitas Nando
- Beyond Imitation: Leveraging Fine-grained Quality Signals For Alignment Guo Geyang, Zhao Ranchi, Tang Tianyi, Zhao Wayne Xin, Wen Ji-rong
- Compresso: Structured Pruning With Collaborative Prompting Learns Compact Large Language Models Guo Song, Xu Jiahang, Zhang Li Lyna, Yang Mao
- Knowledgenavigator: Leveraging Large Language Models For Enhanced Reasoning Over Knowledge Graph Guo Tiezheng, Yang Qingwen, Wang Chen, Liu Yanyi, Li Pan, Tang Jiawei, Li Dapeng, Wen Yingyou
- Doremi: Grounding Language Model By Detecting And Recovering From Plan-execution Misalignment Guo Yanjiang, Wang Yen-jen, Zha Lihan, Jiang Zheyuan, Chen Jianyu
- Learning To Plan With Natural Language Guo Yiduo, Liang Yaobo, Wu Chenfei, Wu Wenshan, Zhao Dongyan, Duan Nan
- PPTC Benchmark: Evaluating Large Language Models For Powerpoint Task Completion Guo Yiduo, Zhang Zekai, Liang Yaobo, Zhao Dongyan, Duan Nan
- Arthmodel: Enhance Arithmetic Skills To Large Language Model Guo Yingdi
- Point-bind & Point-llm: Aligning Point Cloud With Multi-modality For 3D Understanding, Generation, And Instruction Following Guo Ziyu, Zhang Renrui, Zhu Xiangyang, Tang Yiwen, Ma Xianzheng, Han Jiaming, Chen Kexin, Gao Peng, Li Xianzhi, Li Hongsheng, Heng Pheng-ann
- Probing Quantifier Comprehension In Large Language Models: Another Example Of Inverse Scaling Gupta Akshat
- A Real-world Webagent With Planning, Long Context Understanding, And Program Synthesis Gur Izzeddin, Furuta Hiroki, Huang Austin, Safdari Mustafa, Matsuo Yutaka, Eck Douglas, Faust Aleksandra
- Are Large Language Model-based Evaluators The Solution To Scaling Up Multilingual Evaluation? Hada Rishav, Gumma Varun, De Wynter Adrian, Diddee Harshita, Ahmed Mohamed, Choudhury Monojit, Bali Kalika, Sitaram Sunayana
- Deception Abilities Emerged In Large Language Models Hagendorff Thilo
- A Theory Of Emergent In-context Learning As Implicit Structure Induction Hahn Michael, Goyal Navin
- Opiniongpt: Modelling Explicit Biases In Instruction-tuned Llms Haller Patrick, Aynetdinov Ansar, Akbik Alan
- Dialcot Meets PPO: Decomposing And Exploring Reasoning Paths In Smaller Language Models Han Chengcheng, Du Xiaowei, Zhang Che, Lian Yixin, Li Xiang, Gao Ming, Wang Baoyuan
- Lm-infinite: Zero-shot Extreme Length Generalization For Large Language Models Han Chi, Wang Qifan, Peng Hao, Xiong Wenhan, Chen Yu, Ji Heng, Wang Sinong
- Cheddar: Student-chatgpt Dialogue In EFL Writing Education Han Jieun, Yoo Haneul, Myung Junho, Kim Minsun, Lee Tak Yeon, Ahn So-yeon, Oh Alice
- Chartllama: A Multimodal LLM For Chart Understanding And Generation Han Yucheng, Zhang Chi, Chen Xin, Yang Xu, Wang Zhibin, Yu Gang, Fu Bin, Zhang Hanwang
- "mistakes Help Us Grow": Facilitating And Evaluating Growth Mindset Supportive Language In Classrooms Handa Kunal, Clapper Margaret, Boyle Jessica, Wang Rose E, Yang Diyi, Yeager David S, Demszky Dorottya
- Personalisation Within Bounds: A Risk Taxonomy And Policy Framework For The Alignment Of Large Language Models With Personalised Feedback Hannah Rose Kirk, Bertie Vidgen, Paul Röttger, Scott A. Hale
- Blockwise Parallel Transformer For Large Context Models Hao Liu, Pieter Abbeel
- Toolkengpt: Augmenting Frozen Language Models With Massive Tools Via Tool Embeddings Hao Shibo, Liu Tianyang, Wang Zhen, Hu Zhiting
- Towards Efficient Vision-language Tuning: More Information Density, More Generalizability Hao Tianxiang, Lyu Mengyao, Chen Hui, Zhao Sicheng, Ding Xiaohan, Han Jungong, Ding Guiguang
- Visual Instruction Tuning Haotian Liu, Chunyuan Li, Qingyang Wu, Yong Jae Lee
- Test-time Training On Nearest Neighbors For Large Language Models Hardt Moritz, Sun Yu
- Herd: Using Multiple, Smaller Llms To Match The Performances Of Proprietary, Large Llms Via An Intelligent Composer Hari Surya Narayanan, Thomson Matt
- The Political Ideology Of Conversational AI: Converging Evidence On Chatgpt's Pro-environmental, Left-libertarian Orientation Hartmann Jochen, Schwenzow Jasper, Witte Maximilian
- Sok: Memorization In General-purpose Large Language Models Hartmann Valentin, Suri Anshuman, Bindschaedler Vincent, Evans David, Tople Shruti, West Robert
- Zero- And Few-shot Prompting With Llms: A Comparative Study With Fine-tuned Models For Bangla Sentiment Analysis Hasan Md. Arid, Das Shudipta, Anjum Afiyat, Alam Firoj, Anjum Anika, Sarker Avijit, Noori Sheak Rashed Haider
- Chatgpt As Your Personal Data Scientist Hassan Md Mahadi, Knipper Alex, Santu Shubhra Kanti Karmaker
- Multilingual Language Models Are Not Multicultural: A Case Study In Emotion Havaldar Shreya, Rai Sunny, Singhal Bhumika, Liu Langchen, Guntuku Sharath Chandra, Ungar Lyle
- Spear Phishing With Large Language Models Hazell Julian
- Linear Latent World Models In Simple Transformers: A Case Study On Othello-gpt Hazineh Dean S., Zhang Zechen, Chiu Jeffery
- Multi-modal Latent Space Learning For Chain-of-thought Reasoning In Language Models He Liqi, Li Zuchao, Cai Xiantao, Wang Ping
- Teacherlm: Teaching To Fish Rather Than Giving The Fish, Language Modeling Likewise He Nan, Lai Hanyu, Zhao Chenyang, Cheng Zirui, Pan Junting, Qin Ruoyu, Lu Ruofan, Lu Rui, Zhang Yunchen, Zhao Gangming, Hou Zhaohui, Huang Zhiyuan, Lu Shaoqing, Liang Ding, Zhan Mingjie
- Can Large Language Models Understand Real-world Complex Instructions? He Qianyu, Zeng Jie, Huang Wenhao, Chen Lina, Xiao Jin, He Qianxi, Zhou Xunzhe, Chen Lida, Wang Xintao, Huang Yuncheng, Ye Haoning, Li Zihan, Chen Shisong, Zhang Yikai, Gu Zhouhong, Liang Jiaqing, Xiao Yanghua
- You Only Prompt Once: On The Capabilities Of Prompt Learning On Large Language Models To Tackle Toxic Content He Xinlei, Zannettou Savvas, Shen Yun, Zhang Yang
- Multimodal Graph Transformer For Multimodal Question Answering He Xuehai, Wang Xin Eric
- Chatgpt For Zero-shot Dialogue State Tracking: A Solution Or An Opportunity? Heck Michael, Lubis Nurul, Ruppik Benjamin, Vukovic Renato, Feng Shutong, Geishauser Christian, Lin Hsien-chin, Van Niekerk Carel, Gašić Milica
- Analyzing The Efficacy Of An Llm-only Approach For Image-based Document Question Answering Hegde Nidhi, Paul Sujoy, Madan Gagan, Aggarwal Gaurav
- Divide Et Impera: Multi-transformer Architectures For Complex Nlp-tasks Helland Solveig, Gavagnin Elena, De Spindler Alexandre
- Exploring The Responses Of Large Language Models To Beginner Programmers' Help Requests Hellas Arto, Leinonen Juho, Sarsa Sami, Koutcheme Charles, Kujanpää Lilja, Sorva Juha
- Towards Verifiable Text Generation With Symbolic References Hennigen Lucas Torroba, Shen Shannon, Nrusimha Aniruddha, Gapp Bernhard, Sontag David, Kim Yoon
- Stochastic Parrots Looking For Stochastic Parrots: Llms Are Easy To Fine-tune And Hard To Detect With Other Llms Henrique Da Silva Gameiro, Kucharavy Andrei, Guerraoui Rachid
- Large Language Models Are Biased To Overestimate Profoundness Herrera-berg Eugenio, Browne Tomás Vergara, León-villagrá Pablo, Vives Marc-lluís, Calderon Cristian Buc
- Assessing The Efficacy Of Large Language Models In Generating Accurate Teacher Responses Hicke Yann, Masand Abhishek, Guo Wentao, Gangavarapu Tushaar
- Llm-japanese-dataset V0: Construction Of Japanese Chat Dataset For Large Language Models And Its Methodology Hirano Masanori, Suzuki Masahiro, Sakaji Hiroki
- Newsgpt: Chatgpt Integration For Robot-reporter Hireche Abdelhadi, Belkacem Abdelkader Nasreddine, Jamil Sadia, Chen Chao
- Comparative Analysis Of GPT-4 And Human Graders In Evaluating Praise Given To Students In Synthetic Dialogues Hirunyasiri Dollaya, Thomas Danielle R., Lin Jionghao, Koedinger Kenneth R., Aleven Vincent
- L2MAC: Large Language Model Automatic Computer For Extensive Code Generation Holt Samuel, Luyten Max Ruiz, Van Der Schaar Mihaela
- Zero-shot Goal-directed Dialogue Via RL On Imagined Conversations Hong Joey, Levine Sergey, Dragan Anca
- DP-OPT: Make Large Language Model Your Privacy-preserving Prompt Engineer Hong Junyuan, Wang Jiachen T., Zhang Chenhui, Li Zhangheng, Li Bo, Wang Zhangyang
- Conversational AI Threads For Visualizing Multidimensional Datasets Hong Matt-heun, Crisan Anamaria
- Cogagent: A Visual Language Model For GUI Agents Hong Wenyi, Wang Weihan, Lv Qingsong, Xu Jiazheng, Yu Wenmeng, Ji Junhui, Wang Yan, Wang Zihan, Zhang Yuxuan, Li Juanzi, Xu Bin, Dong Yuxiao, Ding Ming, Tang Jie
- Do Large Language Models And Humans Have Similar Behaviors In Causal Inference With Script Knowledge? Hong Xudong, Ryzhova Margarita, Biondi Daniel Adrian, Demberg Vera
- 3D-LLM: Injecting The 3D World Into Large Language Models Hong Yining, Zhen Haoyu, Chen Peihao, Zheng Shuhong, Du Yilun, Chen Zhenfang, Gan Chuang
- SPEED: Speculative Pipelined Execution For Efficient Decoding Hooper Coleman, Kim Sehoon, Mohammadzadeh Hiva, Genc Hasan, Keutzer Kurt, Gholami Amir, Shao Sophia
- More Robots Are Coming: Large Multimodal Models (chatgpt) Can Solve Visually Diverse Images Of Parsons Problems Hou Irene, Man Owen, Mettille Sophie, Gutierrez Sebastian, Angelikas Kenneth, Macneil Stephen
- Bad Actor, Good Advisor: Exploring The Role Of Large Language Models In Fake News Detection Hu Beizhe, Sheng Qiang, Cao Juan, Shi Yuhui, Li Yang, Wang Danding, Qi Peng
- Chain-of-symbol Prompting Elicits Planning In Large Langauge Models Hu Hanxu, Lu Hongyuan, Zhang Huajian, Song Yun-ze, Lam Wai, Zhang Yue
- Context-aware Prompt Tuning For Vision-language Model With Dual-alignment Hu Hongyu, Lin Tiancheng, Wang Jie, Sun Zhenbang, Xu Yi
- Uncertainty-aware Unlikelihood Learning Improves Generative Aspect Sentiment Quad Prediction Hu Mengting, Bai Yinhao, Wu Yike, Zhang Zhen, Zhang Liqi, Gao Hang, Zhao Shiwan, Huang Minlie
- Meta-learning Online Adaptation Of Language Models Hu Nathan, Mitchell Eric, Manning Christopher D., Finn Chelsea
- Scaled Prompt-tuning For Few-shot Natural Language Generation Hu Ting, Meinel Christoph, Yang Haojin
- BLIVA: A Simple Multimodal LLM For Better Handling Of Text-rich Visual Questions Hu Wenbo, Xu Yifan, Li Yi, Li Weiyue, Chen Zeyuan, Tu Zhuowen
- Visual Program Distillation: Distilling Tools And Programmatic Reasoning Into Vision-language Models Hu Yushi, Stretcu Otilia, Lu Chun-ta, Viswanathan Krishnamurthy, Hata Kenji, Luo Enming, Krishna Ranjay, Fuxman Ariel
- Token-level Adversarial Prompt Detection Based On Perplexity Measures And Contextual Information Hu Zhengmian, Wu Gang, Mitra Saayan, Zhang Ruiyi, Sun Tong, Huang Heng, Swaminathan Viswanathan
- Deploying And Evaluating Llms To Program Service Mobile Robots Hu Zichao, Lucchetti Francesca, Schlesinger Claire, Saxena Yash, Freeman Anders, Modak Sadanand, Guha Arjun, Biswas Joydeep
- SMILE: Single-turn To Multi-turn Inclusive Language Expansion Via Chatgpt For Mental Health Support Huachuan Qiu, Hongliang He, Shuai Zhang, Anqi Li, Zhenzhong Lan
- Agentcoder: Multi-agent-based Code Generation With Iterative Testing And Optimisation Huang Dong, Zhang Jie M., Luck Michael, Bu Qingwen, Qing Yuhao, Cui Heming
- Acegpt, Localizing Large Language Models In Arabic Huang Huang, Yu Fei, Zhu Jianqing, Sun Xuening, Cheng Hao, Song Dingjie, Chen Zhihong, Alharthi Abdulmohsen, An Bang, He Juncai, Liu Ziche, Zhang Zhiyi, Chen Junying, Li Jianquan, Wang Benyou, Zhang Lian, Sun Ruoyu, Wan Xiang, Li Haizhou, Xu Jinchao
- Revisiting The Reliability Of Psychological Scales On Large Language Models Huang Jen-tse, Wang Wenxuan, Lam Man Ho, Li Eric John, Jiao Wenxiang, Lyu Michael R.
- T2i-compbench: A Comprehensive Benchmark For Open-world Compositional Text-to-image Generation Huang Kaiyi, Sun Kaiyue, Xie Enze, Li Zhenguo, Liu Xihui
- A Survey On Hallucination In Large Language Models: Principles, Taxonomy, Challenges, And Open Questions Huang Lei, Yu Weijiang, Ma Weitao, Zhong Weihong, Feng Zhangyin, Wang Haotian, Chen Qianglong, Peng Weihua, Feng Xiaocheng, Qin Bing, Liu Ting
- Lawyer Llama Technical Report Huang Quzhe, Tao Mingxu, Zhang Chen, An Zhenwei, Jiang Cong, Chen Zhibin, Wu Zirui, Feng Yansong
- Lateval: An Interactive Llms Evaluation Benchmark With Incomplete Information From Lateral Thinking Puzzles Huang Shulin, Ma Shirong, Li Yinghui, Huang Mengzuo, Zou Wuhe, Zhang Weidong, Zheng Hai-tao
- Grounded Decoding: Guiding Text Generation With Grounded Models For Embodied Agents Huang Wenlong, Xia Fei, Shah Dhruv, Driess Danny, Zeng Andy, Lu Yao, Florence Pete, Mordatch Igor, Levine Sergey, Hausman Karol, Ichter Brian
- Evaluating And Enhancing Large Language Models For Conversational Reasoning On Knowledge Graphs Huang Yuxuan, Shi Lida, Liu Anqi, Xu Hao
- How Does Pretraining Improve Discourse-aware Translation? Huang Zhihong, Wang Longyue, Liu Siyou, Wong Derek F.
- Walking A Tightrope -- Evaluating Large Language Models In High-risk Domains Hung Chia-chien, Rim Wiem Ben, Frost Lindsay, Bruckner Lars, Lawrence Carolin
- Retrieving Supporting Evidence For Generative Question Answering Huo Siqing, Arabzadeh Negar, Clarke Charles L. A.
- Towards Leveraging Llms For Conditional QA Hussain Syed-amad, Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- Vimq: A Vietnamese Medical Question Dataset For Healthcare Dialogue System Development Huy Ta Duc, Tu Nguyen Anh, Vu Tran Hoang, Minh Nguyen Phuc, Phan Nguyen, Bui Trung H., Truong Steven Q. H.
- Large Language Models As Sous Chefs: Revising Recipes With GPT-3 Hwang Alyssa, Li Bryan, Hou Zhaoyi, Roth Dan
- Pre-gated Moe: An Algorithm-system Co-design For Fast And Scalable Mixture-of-expert Inference Hwang Ranggi, Wei Jianyu, Cao Shijie, Hwang Changho, Tang Xiaohu, Cao Ting, Yang Mao
- Conceptual Model Interpreter For Large Language Models Härer Felix
- Minimizing Factual Inconsistency And Hallucination In Large Language Models I Muneeswaran, Saxena Shreya, Prasad Siva, Prakash M V Sai, Shankar Advaith, V Varun, Vaddina Vishal, Gopalakrishnan Saisubramaniam
- Glot500: Scaling Multilingual Corpora And Language Models To 500 Languages Imani Ayyoob, Lin Peiqin, Kargaran Amir Hossein, Severini Silvia, Sabet Masoud Jalili, Kassner Nora, Ma Chunlan, Schmid Helmut, Martins André F. T., Yvon François, Schütze Hinrich
- Pushing Boundaries: Exploring Zero Shot Object Classification With Large Multimodal Models Islam Ashhadul, Biswas Md. Rafiul, Zaghouani Wajdi, Belhaouari Samir Brahim, Shah Zubair
- Instructed To Bias: Instruction-tuned Language Models Exhibit Emergent Cognitive Bias Itzhak Itay, Stanovsky Gabriel, Rosenfeld Nir, Belinkov Yonatan
- Camels In A Changing Climate: Enhancing LM Adaptation With Tulu 2 Ivison Hamish, Wang Yizhong, Pyatkin Valentina, Lambert Nathan, Peters Matthew, Dasigi Pradeep, Jang Joel, Wadden David, Smith Noah A., Beltagy Iz, Hajishirzi Hannaneh
- Coarse-tuning Models Of Code With Reinforcement Learning Feedback Jain Abhinav, Adiole Chima, Chaudhuri Swarat, Reps Thomas, Jermaine Chris
- Neftune: Noisy Embeddings Improve Instruction Finetuning Jain Neel, Chiang Ping-yeh, Wen Yuxin, Kirchenbauer John, Chu Hong-min, Somepalli Gowthami, Bartoldson Brian R., Kailkhura Bhavya, Schwarzschild Avi, Saha Aniruddha, Goldblum Micah, Geiping Jonas, Goldstein Tom
- Cotran: An Llm-based Code Translator Using Reinforcement Learning With Feedback From Compiler And Symbolic Execution Jana Prithwish, Jha Piyush, Ju Haoyang, Kishore Gautham, Mahajan Aryan, Ganesh Vijay
- Personalized Soups: Personalized Large Language Model Alignment Via Post-hoc Parameter Merging Jang Joel, Kim Seungone, Lin Bill Yuchen, Wang Yizhong, Hessel Jack, Zettlemoyer Luke, Hajishirzi Hannaneh, Choi Yejin, Ammanabrolu Prithviraj
- Consistency Analysis Of Chatgpt Jang Myeongjun Erik, Lukasiewicz Thomas
- Chatbot Is Not All You Need: Information-rich Prompting For More Realistic Responses Jeong Seokhoon, Makhmud Assentay
- Chit-chat Or Deep Talk: Prompt Engineering For Process Mining Jessen Urszula, Sroka Michal, Fahland Dirk
- Just CHOP: Embarrassingly Simple LLM Compression Jha Ananya Harsh, Sherborne Tom, Walsh Evan Pete, Groeneveld Dirk, Strubell Emma, Beltagy Iz
- Large Language Models As Automated Aligners For Benchmarking Vision-language Models Ji Yuanfeng, Ge Chongjian, Kong Weikai, Xie Enze, Liu Zhengying, Li Zhengguo, Luo Ping
- Exploring The Impact Of Instruction Data Scaling On Large Language Models: An Empirical Study On Real-world Use Cases Ji Yunjie, Deng Yong, Gong Yan, Peng Yiping, Niu Qiang, Zhang Lei, Ma Baochang, Li Xiangang
- Benchmarking And Explaining Large Language Model-based Code Generation: A Causality-centric Approach Ji Zhenlan, Ma Pingchuan, Li Zongjie, Wang Shuai
- Towards Mitigating Hallucination In Large Language Models Via Self-reflection Ji Ziwei, Yu Tiezheng, Xu Yan, Lee Nayeon, Ishii Etsuko, Fung Pascale
- Expedited Training Of Visual Conditioned Language Generation Via Redundancy Reduction Jian Yiren, Liu Tingkai, Tao Yunzhe, Zhang Chunhui, Vosoughi Soroush, Yang Hongxia
- Tigerscore: Towards Building Explainable Metric For All Text Generation Tasks Jiang Dongfu, Li Yishan, Zhang Ge, Huang Wenhao, Lin Bill Yuchen, Chen Wenhu
- You Only Forward Once: Prediction And Rationalization In A Single Forward Pass Jiang Han, Duan Junwen, Qu Zhe, Wang Jianxin
- Graphologue: Exploring Large Language Model Responses With Interactive Diagrams Jiang Peiling, Rayan Jude, Dow Steven P., Xia Haijun
- Forcing Generative Models To Degenerate Ones: The Power Of Data Poisoning Attacks Jiang Shuli, Kadhe Swanand Ravindra, Zhou Yi, Cai Ling, Baracaldo Nathalie
- Prompt Packer: Deceiving Llms Through Compositional Instruction With Hidden Attacks Jiang Shuyu, Chen Xingshu, Tang Rui
- Resprompt: Residual Connection Prompting Advances Multi-step Reasoning In Large Language Models Jiang Song, Shakeri Zahra, Chan Aaron, Sanjabi Maziar, Firooz Hamed, Xia Yinglong, Akyildiz Bugra, Sun Yizhou, Li Jinchao, Wang Qifan, Celikyilmaz Asli
- Iluvui: Instruction-tuned Language-vision Modeling Of Uis From Machine Conversations Jiang Yue, Schoop Eldon, Swearngin Amanda, Nichols Jeffrey
- Lion: Adversarial Distillation Of Proprietary Large Language Models Jiang Yuxin, Chan Chunkit, Chen Mingyang, Wang Wei
- Followbench: A Multi-level Fine-grained Constraints Following Benchmark For Large Language Models Jiang Yuxin, Wang Yufei, Zeng Xingshan, Zhong Wanjun, Li Liangyou, Mi Fei, Shang Lifeng, Jiang Xin, Liu Qun, Wang Wei
- Generative Calibration For In-context Learning Jiang Zhongtao, Zhang Yuanzhe, Liu Cao, Zhao Jun, Liu Kang
- How Can Recommender Systems Benefit From Large Language Models: A Survey Jianghao Lin, Xinyi Dai, Yunjia Xi, Weiwen Liu, Bo Chen, Hao Zhang, Yong Liu, Chuhan Wu, Xiangyang Li, Chenxu Zhu, Huifeng Guo, Yong Yu, Ruiming Tang, Weinan Zhang
- Llara: Large Language-recommendation Assistant Jiayi Liao, Sihang Li, Zhengyi Yang, Jiancan Wu, Yancheng Yuan, Xiang Wang, Xiangnan He
- Evaluation And Enhancement Of Semantic Grounding In Large Vision-language Models Jiaying Lu, Jinmeng Rao, Kezhen Chen, Xiaoyuan Guo, Yawen Zhang, Baochen Sun, Carl Yang, Jie Yang
- Prompt-based Length Controlled Generation With Reinforcement Learning Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- Emergent Representations Of Program Semantics In Language Models Trained On Programs Jin Charles, Rinard Martin
- When Large Language Models Meet Personalization: Perspectives Of Challenges And Opportunities Jin Chen, Zheng Liu, Xu Huang, Chenwang Wu, Qi Liu, Gangwei Jiang, Yuanhao Pu, Yuxuan Lei, Xiaolong Chen, Xingmei Wang, Defu Lian, Enhong Chen
- Medcpt: Contrastive Pre-trained Transformers With Large-scale Pubmed Search Logs For Zero-shot Biomedical Information Retrieval Jin Qiao, Kim Won, Chen Qingyu, Comeau Donald C., Yeganova Lana, Wilbur W. John, Lu Zhiyong
- Better To Ask In English: Cross-lingual Evaluation Of Large Language Models For Healthcare Queries Jin Yiqiao, Chandra Mohit, Verma Gaurav, Hu Yibo, De Choudhury Munmun, Kumar Srijan
- Generating Images With Multimodal Language Models Jing Yu Koh, Daniel Fried, Ruslan Salakhutdinov
- Grounding Language Models To Images For Multimodal Inputs And Outputs Jing Yu Koh, Ruslan Salakhutdinov, Daniel Fried
- Vcoder: Versatile Vision Encoders For Multimodal Large Language Models Jitesh Jain, Jianwei Yang, Humphrey Shi
- Is Chatgpt Fair For Recommendation? Evaluating Fairness In Large Language Model Recommendation Jizhi Zhang, Keqin Bao, Yang Zhang, Wenjie Wang, Fuli Feng, Xiangnan He
- Does GPT-4 Pass The Turing Test? Jones Cameron R., Bergen Benjamin K.
- Teaching Language Models To Hallucinate Less With Synthetic Tasks Jones Erik, Palangi Hamid, Simões Clarisse, Chandrasekaran Varun, Mukherjee Subhabrata, Mitra Arindam, Awadallah Ahmed, Kamar Ece
- Small Language Models Fine-tuned To Coordinate Larger Language Models Improve Complex Reasoning Juneja Gurusha, Dutta Subhabrata, Chakrabarti Soumen, Manchanda Sunny, Chakraborty Tanmoy
- A Case Study On Test Case Construction With Large Language Models: Unveiling Practical Insights And Challenges Junior Roberto Francisco De Lima, Presta Luiz Fernando Paes De Barros, Borborema Lucca Santos, Da Silva Vanderson Nogueira, Dahia Marcio Leal De Melo, Santos Anderson Carlos Sousa E
- A Comprehensive Capability Analysis Of GPT-3 And GPT-3.5 Series Models Junjie Ye, Xuanting Chen, Nuo Xu, Can Zu, Zekai Shao, Shichun Liu, Yuhan Cui, Zeyang Zhou, Chao Gong, Yang Shen, Jie Zhou, Siming Chen, Tao Gui, Qi Zhang, Xuanjing Huang
- Recommendation As Instruction Following: A Large Language Model Empowered Recommendation Approach Junjie Zhang, Ruobing Xie, Yupeng Hou, Wayne Xin Zhao, Leyu Lin, Ji-rong Wen
- Is Chatgpt A Good Recommender? A Preliminary Study Junling Liu, Chao Liu, Peilin Zhou, Renjie Lv, Kang Zhou, Yan Zhang
- Synthetic Data Generation In Low-resource Settings Via Fine-tuning Of Large Language Models Kaddour Jean, Liu Qi
- Domain-specific Code Language Models: Unraveling The Potential For HPC Codes And Tasks Kadosh Tal, Hasabnis Niranjan, Vo Vy A., Schneider Nadav, Krien Neva, Capota Mihai, Wasay Abdul, Ahmed Nesreen, Willke Ted, Tamir Guy, Pinter Yuval, Mattson Timothy, Oren Gal
- Robust Recommender System: A Survey And Future Directions Kaike Zhang, Qi Cao, Fei Sun, Yunfan Wu, Shuchang Tao, Huawei Shen, Xueqi Cheng
- Robust Training For Conversational Question Answering Models With Reinforced Reformulation Generation Kaiser Magdalena, Roy Rishiraj Saha, Weikum Gerhard
- Calibrated Language Models Must Hallucinate Kalai Adam Tauman, Vempala Santosh S.
- Swectrl-mini: A Data-transparent Transformer-based Large Language Model For Controllable Text Generation In Swedish Kalpakchi Dmytro, Boye Johan
- Syntax-guided Transformers: Elevating Compositional Generalization And Grounding In Multimodal Environments Kamali Danial, Kordjamshidi Parisa
- Evaluating Open-domain Question Answering In The Era Of Large Language Models Kamalloo Ehsan, Dziri Nouha, Clarke Charles L. A., Rafiei Davood
- Impact Of Co-occurrence On Factual Knowledge Of Large Language Models Kang Cheongwoong, Choi Jaesik
- Knowledge-augmented Reasoning Distillation For Small Language Models In Knowledge-intensive Tasks Kang Minki, Lee Seanie, Baek Jinheon, Kawaguchi Kenji, Hwang Sung Ju
- Lagr-seq: Language-guided Reinforcement Learning With Sample-efficient Querying Karimpanal Thommen George, Semage Laknath Buddhika, Rana Santu, Le Hung, Tran Truyen, Gupta Sunil, Venkatesh Svetha
- Shs-nlp At Radsum23: Domain-adaptive Pre-training Of Instruction-tuned Llms For Radiology Report Impression Generation Karn Sanjeev Kumar, Ghosh Rikhiya, P Kusuma, Farri Oladimeji
- Exploring Automatic Evaluation Methods Based On A Decoder-based LLM For Text Generation Kasahara Tomohito, Kawahara Daisuke
- Evaluating Large Language Models For Health-related Queries With Presuppositions Kaur Navreet, Choudhury Monojit, Pruthi Danish
- Sorted Llama: Unlocking The Potential Of Intermediate Layers Of Large Language Models For Dynamic Inference Kavehzadeh Parsa, Valipour Mojtaba, Tahaei Marzieh, Ghodsi Ali, Chen Boxing, Rezagholizadeh Mehdi
- Appropriateness Is All You Need! Kempt Hendrik, Lavie Alon, Nagel Saskia K.
- Xcodeeval: A Large Scale Multilingual Multitask Benchmark For Code Understanding, Generation, Translation And Retrieval Khan Mohammad Abdullah Matin, Bari M Saiful, Do Xuan Long, Wang Weishi, Parvez Md Rizwan, Joty Shafiq
- Introducing Language Guidance In Prompt-based Continual Learning Khan Muhammad Gul Zain Ali, Naeem Muhammad Ferjad, Van Gool Luc, Stricker Didier, Tombari Federico, Afzal Muhammad Zeshan
- Zzzgpt: An Interactive GPT Approach To Enhance Sleep Quality Khaokaew Yonchanok, Ji Kaixin, Nguyen Thuc Hanh, Kegalle Hiruni, Alaofi Marwah, Xue Hao, Salim Flora D.
- Tst\(^\mathrm{r}\): Target Similarity Tuning Meets The Real World Khatry Anirudh, Gulwani Sumit, Gupta Priyanshu, Le Vu, Singha Ananya, Singh Mukul, Verbruggen Gust
- Reliability Check: An Analysis Of Gpt-3's Response To Sensitive Topics And Prompt Wording Khatun Aisha, Brown Daniel G.
- Gptaraeval: A Comprehensive Evaluation Of Chatgpt On Arabic NLP Khondaker Md Tawkat Islam, Waheed Abdul, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- How Secure Is Code Generated By Chatgpt? Khoury Raphaël, Avila Anderson R., Brunelle Jacob, Camara Baba Mamadou
- Auto-survey Challenge Khuong Thanh Gia Hieu Tau, Lisn, Rachmat Benedictus Kent Tau, Lisn
- Visually-situated Natural Language Understanding With Contrastive Reading Model And Frozen Large Language Models Kim Geewook, Lee Hodong, Kim Daehee, Jung Haeji, Park Sanghee, Kim Yoonsik, Yun Sangdoo, Kil Taeho, Lee Bado, Park Seunghyun
- Language Models Can Solve Computer Tasks Kim Geunwoo, Baldi Pierre, Mcaleer Stephen
- Fantom: A Benchmark For Stress-testing Machine Theory Of Mind In Interactions Kim Hyunwoo, Sclar Melanie, Zhou Xuhui, Bras Ronan Le, Kim Gunhee, Choi Yejin, Sap Maarten
- LLM4SGG: Large Language Models For Weakly Supervised Scene Graph Generation Kim Kibum, Yoon Kanghoon, Jeon Jaehyeong, In Yeonjun, Moon Jinyoung, Kim Donghyun, Park Chanyoung
- Prometheus: Inducing Fine-grained Evaluation Capability In Language Models Kim Seungone, Shin Jamin, Cho Yejin, Jang Joel, Longpre Shayne, Lee Hwaran, Yun Sangdoo, Shin Seongjin, Kim Sungdong, Thorne James, Seo Minjoon
- Aligning Large Language Models Through Synthetic Feedback Kim Sungdong, Bae Sanghwan, Shin Jamin, Kang Soyoung, Kwak Donghyun, Yoo Kang Min, Seo Minjoon
- MAGVLT: Masked Generative Vision-and-language Transformer Kim Sungwoong, Jo Daejin, Lee Donghoon, Kim Jongmin
- Lmcanvas: Object-oriented Interaction To Personalize Large Language Model-powered Writing Environments Kim Tae Soo, Sarkar Arghya, Lee Yoonjoo, Chang Minsuk, Kim Juho
- Mindfuldiary: Harnessing Large Language Model To Support Psychiatric Patients' Journaling Kim Taewan, Bae Seolyeong, Kim Hyun Ah, Lee Su-woo, Hong Hwajung, Yang Chanmo, Kim Young-ho
- Finequant: Unlocking Efficiency With Fine-grained Weight-only Quantization For Llms Kim Young Jin, Henry Rawn, Fahim Raffy, Awadalla Hany Hassan
- Exploiting Language Models As A Source Of Knowledge For Cognitive Agents Kirk James R., Wray Robert E., Laird John E.
- Transformer Language Models Handle Word Frequency In Prediction Head Kobayashi Goro, Kuribayashi Tatsuki, Yokoi Sho, Inui Kentaro
- Conversational Ai-powered Design: Chatgpt As Designer, User, And Product Kocaballi A. Baki
- Generating Images With Multimodal Language Models Koh Jing Yu, Fried Daniel, Salakhutdinov Ruslan
- Building A Llama2-finetuned LLM For Odia Language Utilizing Domain Knowledge Instruction Set Kohli Guneet Singh, Parida Shantipriya, Sekhar Sambit, Saha Samirit, Nair Nipun B, Agarwal Parul, Khosla Sonal, Patiyal Kusumlata, Dhal Debasish
- Benchmarking Cognitive Biases In Large Language Models As Evaluators Koo Ryan, Lee Minhwa, Raheja Vipul, Park Jong Inn, Kim Zae Myung, Kang Dongyeop
- NOLA: Compressing Lora Using Linear Combination Of Random Basis Koohpayegani Soroush Abbasi, Navaneet Kl, Nooralinejad Parsa, Kolouri Soheil, Pirsiavash Hamed
- Humans In Humans Out: On GPT Converging Toward Common Sense In Both Success And Failure Koralus Philipp, Wang-maścianica Vincent
- Understanding Catastrophic Forgetting In Language Models Via Implicit Inference Kotha Suhas, Springer Jacob Mitchell, Raghunathan Aditi
- Large Language Models Only Pass Primary School Exams In Indonesia: A Comprehensive Test On Indommlu Koto Fajri, Aisyah Nurul, Li Haonan, Baldwin Timothy
- Little Giants: Exploring The Potential Of Small Llms As Evaluation Metrics In Summarization In The Eval4nlp 2023 Shared Task Kotonya Neema, Krishnasamy Saran, Tetreault Joel, Jaimes Alejandro
- Chatgpt For PLC/DCS Control Logic Generation Koziolek Heiko, Gruener Sten, Ashiwal Virendra
- Prompt Generate Train (PGT): Few-shot Domain Adaption Of Retrieval Augmented Generation Models For Open Book Question-answering Krishna C. S.
- Post Hoc Explanations Of Language Models Can Improve Language Models Krishna Satyapriya, Ma Jiaqi, Slack Dylan, Ghandeharioun Asma, Singh Sameer, Lakkaraju Himabindu
- In-context Explainers: Harnessing Llms For Explaining Black Box Models Kroeger Nicholas, Ley Dan, Krishna Satyapriya, Agarwal Chirag, Lakkaraju Himabindu
- Fundamentals Of Generative Large Language Models And Perspectives In Cyber-defense Kucharavy Andrei, Schillaci Zachary, Maréchal Loïc, Würsch Maxime, Dolamic Ljiljana, Sabonnadiere Remi, David Dimitri Percia, Mermoud Alain, Lenders Vincent
- Impact Of Guidance And Interaction Strategies For LLM Use On Learner Performance And Perception Kumar Harsh, Musabirov Ilya, Reza Mohi, Shi Jiakai, Wang Xinyuan, Williams Joseph Jay, Kuzminykh Anastasia, Liut Michael
- Large Language Models Humanize Technology Kumar Pratyush
- In Generative AI We Trust: Can Chatbots Effectively Verify Political Information? Kuznetsova Elizaveta, Makhortykh Mykola, Vziatysheva Victoria, Stolze Martha, Baghumyan Ani, Urman Aleksandra
- Jotr: A Joint Transformer And Reinforcement Learning Framework For Dialog Policy Learning Kwan Wai-chung, Wang Huimin, Wang Hongru, Wang Zezhong, Wu Xian, Zheng Yefeng, Wong Kam-fai
- M4LE: A Multi-ability Multi-range Multi-task Multi-domain Long-context Evaluation Benchmark For Large Language Models Kwan Wai-chung, Zeng Xingshan, Wang Yufei, Sun Yusen, Li Liangyou, Shang Lifeng, Liu Qun, Wong Kam-fai
- Reward Design With Language Models Kwon Minae, Xie Sang Michael, Bullard Kalesha, Sadigh Dorsa
- Chatgpt For Arabic Grammatical Error Correction Kwon Sang Yun, Bhatia Gagan, Nagoud El Moatez Billah, Abdul-mageed Muhammad
- Recallm: An Adaptable Memory Mechanism With Temporal Understanding For Large Language Models Kynoch Brandon, Latapie Hugo, Van Der Sluis Dwane
- Hallucination Augmented Recitations For Language Models Köksal Abdullatif, Aksitov Renat, Chang Chung-ching
- Openassistant Conversations -- Democratizing Large Language Model Alignment Köpf Andreas, Kilcher Yannic, Von Rütte Dimitri, Anagnostidis Sotiris, Tam Zhi-rui, Stevens Keith, Barhoum Abdullah, Duc Nguyen Minh, Stanley Oliver, Nagyfi Richárd, Es Shahul, Suri Sameer, Glushkov David, Dantuluri Arnav, Maguire Andrew, Schuhmann Christoph, Nguyen Huu, Mattick Alexander
- Are You Sure? Challenging Llms Leads To Performance Drops In The Flipflop Experiment Laban Philippe, Murakhovs'ka Lidiya, Xiong Caiming, Wu Chien-sheng
- A Zero-shot And Few-shot Study Of Instruction-finetuned Large Language Models Applied To Clinical And Biomedical Tasks Labrak Yanis, Rouvier Mickael, Dufour Richard
- Unraveling Chatgpt: A Critical Analysis Of Ai-generated Goal-oriented Dialogues And Annotations Labruna Tiziano, Brenna Sofia, Zaninello Andrea, Magnini Bernardo
- Psy-llm: Scaling Up Global Mental Health Psychological Services With Ai-based Large Language Models Lai Tin, Shi Yukun, Du Zicong, Wu Jiajie, Fu Ken, Dou Yichao, Wang Ziqi
- LISA: Reasoning Segmentation Via Large Language Model Lai Xin, Tian Zhuotao, Chen Yukang, Li Yanwei, Yuan Yuhui, Liu Shu, Jia Jiaya
- Mini-dalle3: Interactive Text To Image By Prompting Large Language Models Lai Zeqiang, Zhu Xizhou, Dai Jifeng, Qiao Yu, Wang Wenhai
- Exploring Large Language Models To Facilitate Variable Autonomy For Human-robot Teaming Lakhnati Younes, Pascher Max, Gerken Jens
- Large Language Models Are Partially Primed In Pronoun Interpretation Lam Suet-ying, Zeng Qingcheng, Zhang Kexun, You Chenyu, Voigt Rob
- Improving Zero-shot Visual Question Answering Via Large Language Models With Reasoning Question Prompts Lan Yunshi, Li Xiang, Liu Xin, Li Yang, Qin Wei, Qian Weining
- A Data Source For Reasoning Embodied Agents Lanchantin Jack, Sukhbaatar Sainbayar, Synnaeve Gabriel, Sun Yuxuan, Srinet Kavya, Szlam Arthur
- Creating Large Language Model Resistant Exams: Guidelines And Strategies Larsen Simon Kaare
- A Systematic Study And Comprehensive Evaluation Of Chatgpt On Benchmark Datasets Laskar Md Tahmid Rahman, Bari M Saiful, Rahman Mizanur, Bhuiyan Md Amran Hossen, Joty Shafiq, Huang Jimmy Xiangji
- VAL: Interactive Task Learning With GPT Dialog Parsing Lawley Lane, Maclellan Christopher J.
- Improved Instruction Ordering In Recipe-grounded Conversation Le Duong Minh, Guo Ruohao, Xu Wei, Ritter Alan
- Voicebox: Text-guided Multilingual Universal Speech Generation At Scale Le Matthew, Vyas Apoorv, Shi Bowen, Karrer Brian, Sari Leda, Moritz Rashel, Williamson Mary, Manohar Vimal, Adi Yossi, Mahadeokar Jay, Hsu Wei-ning
- Platypus: Quick, Cheap, And Powerful Refinement Of Llms Lee Ariel N., Hunter Cole J., Ruiz Nataniel
- Making Large Language Models Better Data Creators Lee Dong-ho, Pujara Jay, Sewak Mohit, White Ryen W., Jauhar Sujay Kumar
- Applying Large Language Models And Chain-of-thought For Automatic Scoring Lee Gyeong-geon, Latif Ehsan, Wu Xuansheng, Liu Ninghao, Zhai Xiaoming
- RLAIF Vs. RLHF: Scaling Reinforcement Learning From Human Feedback With AI Feedback Lee Harrison, Phatale Samrat, Mansoor Hassan, Mesnard Thomas, Ferret Johan, Lu Kellie, Bishop Colton, Hall Ethan, Carbune Victor, Rastogi Abhinav, Prakash Sushant
- Partially Randomizing Transformer Weights For Dialogue Response Diversity Lee Jing Yang, Lee Kong Aik, Gan Woon-seng
- Building Multimodal AI Chatbots Lee Min Young
- Can Large Language Models Capture Dissenting Human Voices? Lee Noah, An Na Min, Thorne James
- Volcano: Mitigating Multimodal Hallucination Through Self-feedback Guided Revision Lee Seongyun, Park Sue Hyun, Jo Yongrae, Seo Minjoon
- Explore, Select, Derive, And Recall: Augmenting LLM With Human-like Memory For Mobile Task Automation Lee Sunjae, Choi Junyoung, Lee Jungjae, Wasi Munim Hasan, Choi Hojun, Ko Steven Y., Oh Sangeun, Shin Insik
- Large Language Models Can Share Images, Too! Lee Young-jun, Lee Dokyong, Sung Joo Won, Hyeon Jonghwan, Choi Ho-jin
- Violation Of Expectation Via Metacognitive Prompting Reduces Theory Of Mind Prediction Error In Large Language Models Leer Courtland, Trost Vincent, Voruganti Vineeth
- Chain Of Natural Language Inference For Reducing Large Language Model Ungrounded Hallucinations Lei Deren, Li Yaxi, Hu Mengya, Wang Mingyu, Yun Vincent, Ching Emily, Kamal Eslam
- S3eval: A Synthetic, Scalable, Systematic Evaluation Suite For Large Language Models Lei Fangyu, Liu Qian, Huang Yiming, He Shizhu, Zhao Jun, Liu Kang
- Zero-shot Next-item Recommendation Using Large Pretrained Language Models Lei Wang, Ee-peng Lim
- Llm4vis: Explainable Visualization Recommendation Using Chatgpt Lei Wang, Songheng Zhang, Yun Wang, Ee-peng Lim, Yong Wang
- Recexplainer: Aligning Large Language Models For Explaining Recommendation Models Lei Yuxuan, Lian Jianxun, Yao Jing, Huang Xu, Lian Defu, Xie Xing
- The Language Of Prompting: What Linguistic Properties Make A Prompt Successful? Leidinger Alina, Van Rooij Robert, Shutova Ekaterina
- Chatgpt: A Meta-analysis After 2.5 Months Leiter Christoph, Zhang Ran, Chen Yanran, Belouadi Jonas, Larionov Daniil, Fresen Vivian, Eger Steffen
- A Sentence Is Worth A Thousand Pictures: Can Large Language Models Understand Hum4n L4ngu4ge And The W0rld Behind W0rds? Leivada Evelina, Marcus Gary, Günther Fritz, Murphy Elliot
- Lora Fine-tuning Efficiently Undoes Safety Training In Llama 2-chat 70B Lermen Simon, Rogers-smith Charlie, Ladish Jeffrey
- Talk The Walk: Synthetic Data Generation For Conversational Music Recommendation Leszczynski Megan, Zhang Shu, Ganti Ravi, Balog Krisztian, Radlinski Filip, Pereira Fernando, Chaganty Arun Tejasvi
- Sequential Monte Carlo Steering Of Large Language Models Using Probabilistic Programs Lew Alexander K., Zhi-xuan Tan, Grand Gabriel, Mansinghka Vikash K.
- Susceptibility To Influence Of Large Language Models Lewis D Griffin, Bennett Kleinberg, Maximilian Mozes, Kimberly T Mai, Maria Vau, Matthew Caldwell, Augustine Marvor-parker
- Deliberate Then Generate: Enhanced Prompting Framework For Text Generation Li Bei, Wang Rui, Guo Junliang, Song Kaitao, Tan Xu, Hassan Hany, Menezes Arul, Xiao Tong, Bian Jiang, Zhu Jingbo
- Seed-bench-2: Benchmarking Multimodal Large Language Models Li Bohao, Ge Yuying, Ge Yixiao, Wang Guangzhi, Wang Rui, Zhang Ruimao, Shan Ying
- This Land Is {your, My} Land: Evaluating Geopolitical Biases In Language Models Li Bryan, Haider Samar, Callison-burch Chris
- Making Large Language Models A Better Foundation For Dense Retrieval Li Chaofan, Liu Zheng, Xiao Shitao, Shao Yingxia
- Learning To Rewrite Prompts For Personalized Text Generation Li Cheng, Zhang Mingyang, Mei Qiaozhu, Kong Weize, Bendersky Michael
- Mixed Distillation Helps Smaller Language Model Better Reasoning Li Chenglin, Chen Qianglong, Li Liangyue, Wang Caiyu, Li Yicheng, Chen Zulong, Zhang Yin
- Dialogue For Prompting: A Policy-gradient-based Discrete Prompt Generation For Few-shot Learning Li Chengzhengxu, Liu Xiaoming, Wang Yichen, Li Duyi, Lan Yu, Shen Chao
- Modelscope-agent: Building Your Customizable Agent System With Open-source Large Language Models Li Chenliang, Chen Hehong, Yan Ming, Shen Weizhou, Xu Haiyang, Wu Zhikai, Zhang Zhicheng, Zhou Wenmeng, Chen Yingda, Cheng Chen, Shi Hongzhu, Zhang Ji, Huang Fei, Zhou Jingren
- DISTFLASHATTN: Distributed Memory-efficient Attention For Long-context Llms Training Li Dacheng, Shao Rulin, Xie Anze, Xing Eric P., Ma Xuezhe, Stoica Ion, Gonzalez Joseph E., Zhang Hao
- Revisiting Large Language Models As Zero-shot Relation Extractors Li Guozheng, Wang Peng, Ke Wenjun
- Repetition In Repetition Out: Towards Understanding Neural Text Degeneration From The Data Perspective Li Huayang, Lan Tian, Fu Zihao, Cai Deng, Liu Lemao, Collier Nigel, Watanabe Taro, Su Yixuan
- Textbind: Multi-turn Interleaved Multimodal Instruction-following In The Wild Li Huayang, Li Siheng, Cai Deng, Wang Longyue, Liu Lemao, Watanabe Taro, Yang Yujiu, Shi Shuming
- Do Vision And Language Models Share Concepts? A Vector Space Alignment Study Li Jiaang, Kementchedjhieva Yova, Fierro Constanza, Søgaard Anders
- Mastering Robot Manipulation With Multimodal Prompts Through Pretraining And Multi-task Fine-tuning Li Jiachen, Gao Qiaozi, Johnston Michael, Gao Xiaofeng, He Xuehai, Shakiah Suhaila, Shi Hangjie, Ghanadan Reza, Wang William Yang
- CFGPT: Chinese Financial Assistant With Large Language Model Li Jiangtong, Bian Yuxuan, Wang Guoxuan, Lei Yang, Cheng Dawei, Ding Zhijun, Jiang Changjun
- Evcap: Retrieval-augmented Image Captioning With External Visual-name Memory For Open-world Comprehension Li Jiaxuan, Vo Duc Minh, Sugimoto Akihiro, Nakayama Hideki
- Counterfactual Reasoning: Testing Language Models' Understanding Of Hypothetical Scenarios Li Jiaxuan, Yu Lang, Ettinger Allyson
- Fine-tuning Multimodal Llms To Follow Zero-shot Demonstrative Instructions Li Juncheng, Pan Kaihang, Ge Zhiqi, Gao Minghe, Ji Wei, Zhang Wenqiao, Chua Tat-seng, Tang Siliang, Zhang Hanwang, Zhuang Yueting
- Generative Judge For Evaluating Alignment Li Junlong, Sun Shichao, Yuan Weizhe, Fan Run-ze, Zhao Hai, Liu Pengfei
- The Web Can Be Your Oyster For Improving Large Language Models Li Junyi, Tang Tianyi, Zhao Wayne Xin, Wang Jingyuan, Nie Jian-yun, Wen Ji-rong
- Inference-time Intervention: Eliciting Truthful Answers From A Language Model Li Kenneth, Patel Oam, Viégas Fernanda, Pfister Hanspeter, Wattenberg Martin
- Mvbench: A Comprehensive Multi-modal Video Understanding Benchmark Li Kunchang, Wang Yali, He Yinan, Li Yizhuo, Wang Yi, Liu Yi, Wang Zun, Xu Jilan, Chen Guo, Luo Ping, Wang Limin, Qiao Yu
- Tool-augmented Reward Modeling Li Lei, Chai Yekun, Wang Shuohuan, Sun Yu, Tian Hao, Zhang Ningyu, Wu Hua
- M3dbench: Let's Instruct Large Models With Multi-modal 3D Prompts Li Mingsheng, Chen Xin, Zhang Chi, Chen Sijin, Zhu Hongyuan, Yin Fukun, Yu Gang, Chen Tao
- Collaborative Evaluation: Exploring The Synergy Of Large Language Models And Humans For Open-ended Generation Evaluation Li Qintong, Cui Leyang, Kong Lingpeng, Bi Wei
- Starcoder: May The Source Be With You! Li Raymond, Allal Loubna Ben, Zi Yangtian, Muennighoff Niklas, Kocetkov Denis, Mou Chenghao, Marone Marc, Akiki Christopher, Li Jia, Chim Jenny, Liu Qian, Zheltonozhskii Evgenii, Zhuo Terry Yue, Wang Thomas, Dehaene Olivier, Davaadorj Mishig, Lamy-poirier Joel, Monteiro João, Shliazhko Oleh, Gontier Nicolas, Meade Nicholas, Zebaze Armel, Yee Ming-ho, Umapathi Logesh Kumar, Zhu Jian, Lipkin Benjamin, Oblokulov Muhtasham, Wang Zhiruo, Murthy Rudra, Stillerman Jason, Patel Siva Sankalp, Abulkhanov Dmitry, Zocca Marco, Dey Manan, Zhang Zhihan, Fahmy Nour, Bhattacharyya Urvashi, Yu Wenhao, Singh Swayam, Luccioni Sasha, Villegas Paulo, Kunakov Maxim, Zhdanov Fedor, Romero Manuel, Lee Tony, Timor Nadav, Ding Jennifer, Schlesinger Claire, Schoelkopf Hailey, Ebert Jan, Dao Tri, Mishra Mayank, Gu Alex, Robinson Jennifer, Anderson Carolyn Jane, Dolan-gavitt Brendan, Contractor Danish, Reddy Siva, Fried Daniel, Bahdanau Dzmitry, Jernite Yacine, Ferrandis Carlos Muñoz, Hughes Sean, Wolf Thomas, Guha Arjun, Von Werra Leandro, De Vries Harm
- Context Matters: Data-efficient Augmentation Of Large Language Models For Scientific Applications Li Xiang, Tang Haoran, Chen Siyu, Wang Ziwei, Maravi Anurag, Abram Marcin
- Image Content Generation With Causal Reasoning Li Xiaochuan, Fan Baoyu, Zhang Runze, Jin Liang, Wang Di, Guo Zhenhua, Zhao Yaqian, Li Rengang
- Llatrieval: Llm-verified Retrieval For Verifiable Generation Li Xiaonan, Zhu Changtai, Li Linyang, Yin Zhangyue, Sun Tianxiang, Qiu Xipeng
- Exploring Fine-tuning Chatgpt For News Recommendation Li Xinyi, Zhang Yongfeng, Malthouse Edward C
- Dissecting Chain-of-thought: Compositionality Through In-context Filtering And Learning Li Yingcong, Sreenivasan Kartik, Giannou Angeliki, Papailiopoulos Dimitris, Oymak Samet
- A Survey On Fairness In Large Language Models Li Yingji, Du Mengnan, Song Rui, Wang Xin, Wang Ying
- Prompting Large Language Models For Counterfactual Generation: An Empirical Study Li Yongqi, Xu Mayi, Miao Xin, Zhou Shen, Qian Tieyun
- Posgen: Personalized Opening Sentence Generation For Online Insurance Sales Li Yu, Zhang Yi, Wu Weijia, Zhou Zimu, Li Qiang
- RAIN: Your Language Models Can Align Themselves Without Finetuning Li Yuhui, Wei Fangyun, Zhao Jinjing, Zhang Chao, Zhang Hongyang
- Towards Vision Enhancing Llms: Empowering Multimodal Knowledge Storage And Sharing In Llms Li Yunxin, Hu Baotian, Wang Wei, Cao Xiaochun, Zhang Min
- Guiding Large Language Models Via Directional Stimulus Prompting Li Zekun, Peng Baolin, He Pengcheng, Galley Michel, Gao Jianfeng, Yan Xifeng
- Flexkbqa: A Flexible Llm-powered Framework For Few-shot Knowledge Base Question Answering Li Zhenyu, Fan Sunqi, Gu Yu, Li Xiuxing, Duan Zhichao, Dong Bowen, Liu Ning, Wang Jianyong
- Unveiling The Pitfalls Of Knowledge Editing For Large Language Models Li Zhoubo, Zhang Ningyu, Yao Yunzhi, Wang Mengru, Chen Xi, Chen Huajun
- Synthetic Data Generation With Large Language Models For Text Classification: Potential And Limitations Li Zhuoyan, Zhu Hangxiao, Lu Zhuoran, Yin Ming
- Label Supervised Llama Finetuning Li Zongxi, Li Xianming, Liu Yuzhang, Xie Haoran, Li Jing, Wang Fu-lee, Li Qing, Zhong Xiaoqin
- Batgpt: A Bidirectional Autoregessive Talker From Generative Pre-trained Transformer Li Zuchao, Zhang Shitou, Zhao Hai, Yang Yifei, Yang Dongjie
- Llm-grounded Diffusion: Enhancing Prompt Understanding Of Text-to-image Diffusion Models With Large Language Models Lian Long, Li Boyi, Yala Adam, Darrell Trevor
- Beyond Factuality: A Comprehensive Evaluation Of Large Language Models As Knowledge Generators Liang Chen, Yang Deng, Yatao Bian, Zeyu Qin, Bingzhe Wu, Tat-seng Chua, Kam-fai Wong
- Towards End-to-end Embodied Decision Making Via Multi-modal Large Language Model: Explorations With Gpt4-vision And Beyond Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Tianyu Liu, Baobao Chang
- Encouraging Divergent Thinking In Large Language Models Through Multi-agent Debate Liang Tian, He Zhiwei, Jiao Wenxiang, Wang Xing, Wang Rui, Yang Yujiu, Tu Zhaopeng, Shi Shuming
- GPT Detectors Are Biased Against Non-native English Writers Liang Weixin, Yuksekgonul Mert, Mao Yining, Wu Eric, Zou James
- Uhgeval: Benchmarking The Hallucination Of Chinese Large Language Models Via Unconstrained Generation Liang Xun, Song Shichao, Niu Simin, Li Zhiyu, Xiong Feiyu, Tang Bo, Wang Yezhaohui, He Dawei, Cheng Peng, Wang Zhonghao, Deng Haiying
- Prompting Large Language Models With Chain-of-thought For Few-shot Knowledge Base Question Generation Liang Yuanyuan, Wang Jianing, Zhu Hanlun, Wang Lei, Qian Weining, Lan Yunshi
- GPT-4 Enhanced Multimodal Grounding For Autonomous Driving: Leveraging Cross-modal Attention With Large Language Models Liao Haicheng, Shen Huanming, Li Zhenning, Wang Chengyue, Li Guofa, Bie Yiming, Xu Chengzhong
- AI Transparency In The Age Of Llms: A Human-centered Research Roadmap Liao Q. Vera, Vaughan Jennifer Wortman
- Opening Up Chatgpt: Tracking Openness, Transparency, And Accountability In Instruction-tuned Text Generators Liesenfeld Andreas, Lopez Alianda, Dingemanse Mark
- STEVE-1: A Generative Model For Text-to-behavior In Minecraft Lifshitz Shalev, Paster Keiran, Chan Harris, Ba Jimmy, Mcilraith Sheila
- The Unlocking Spell On Base Llms: Rethinking Alignment Via In-context Learning Lin Bill Yuchen, Ravichander Abhilasha, Lu Ximing, Dziri Nouha, Sclar Melanie, Chandu Khyathi, Bhagavatula Chandra, Choi Yejin
- Sparks Of Artificial General Recommender (AGR): Early Experiments With Chatgpt Lin Guo, Zhang Yongfeng
- Learning To Model The World With Language Lin Jessy, Du Yuqing, Watkins Olivia, Hafner Danijar, Abbeel Pieter, Klein Dan, Dragan Anca
- VILA: On Pre-training For Visual Language Models Lin Ji, Yin Hongxu, Ping Wei, Lu Yao, Molchanov Pavlo, Tao Andrew, Mao Huizi, Kautz Jan, Shoeybi Mohammad, Han Song
- Agentsims: An Open-source Sandbox For Large Language Model Evaluation Lin Jiaju, Zhao Haoran, Zhang Aochi, Wu Yiting, Ping Huqiuyue, Chen Qin
- Rella: Retrieval-enhanced Large Language Models For Lifelong Sequential Behavior Comprehension In Recommendation Lin Jianghao, Shan Rong, Zhu Chenxu, Du Kounianhua, Chen Bo, Quan Shigang, Tang Ruiming, Yu Yong, Zhang Weinan
- Batchprompt: Accomplish More With Less Lin Jianzhe, Diesendruck Maurice, Du Liang, Abraham Robin
- Argue With Me Tersely: Towards Sentence-level Counter-argument Generation Lin Jiayu, Ye Rong, Han Meng, Zhang Qi, Lai Ruofei, Zhang Xinyu, Cao Zhao, Huang Xuanjing, Wei Zhongyu
- Advances In Embodied Navigation Using Large Language Models: A Survey Lin Jinzhou, Gao Han, Feng Xuxiang, Xu Rongtao, Wang Changwei, Zhang Man, Guo Li, Xu Shibiao
- Few-shot Adaptation For Parsing Contextual Utterances With Llms Lin Kevin, Xia Patrick, Fang Hao
- Taiwan LLM: Bridging The Linguistic Divide With A Culturally Aligned Language Model Lin Yen-ting, Chen Yun-nung
- Mitigating The Alignment Tax Of RLHF Lin Yong, Lin Hangyu, Xiong Wei, Diao Shizhe, Liu Jianmeng, Zhang Jipeng, Pan Rui, Wang Haoxiang, Hu Wenbin, Zhang Hanning, Dong Hanze, Pi Renjie, Zhao Han, Jiang Nan, Ji Heng, Yao Yuan, Zhang Tong
- Pushing Large Language Models To The 6G Edge: Vision, Challenges, And Opportunities Lin Zheng, Qu Guanqiao, Chen Qiyuan, Chen Xianhao, Chen Zhe, Huang Kaibin
- SPHINX: The Joint Mixing Of Weights, Tasks, And Visual Embeddings For Multi-modal Large Language Models Lin Ziyi, Liu Chris, Zhang Renrui, Gao Peng, Qiu Longtian, Xiao Han, Qiu Han, Lin Chen, Shao Wenqi, Chen Keqin, Han Jiaming, Huang Siyuan, Zhang Yichi, He Xuming, Li Hongsheng, Qiao Yu
- Comparing Generative Chatbots Based On Process Requirements Lins Luis Fernando, Nascimento Nathalia, Alencar Paulo, Oliveira Toacy, Cowan Donald
- Establishing Trustworthiness: Rethinking Tasks And Model Evaluation Litschko Robert, Müller-eberstein Max, Van Der Goot Rob, Weber Leon, Plank Barbara
- External Reasoning: Towards Multi-large-language-models Interchangeable Assistance With Human Feedback Liu Akide
- Exposing Attention Glitches With Flip-flop Language Modeling Liu Bingbin, Ash Jordan T., Goel Surbhi, Krishnamurthy Akshay, Zhang Cyril
- Recprompt: A Prompt Tuning Framework For News Recommendation Using Large Language Models Liu Dairui, Yang Boming, Du Honghui, Greene Derek, Lawlor Aonghus, Dong Ruihai, Li Irene
- MMC: Advancing Multimodal Chart Understanding With Large-scale Instruction Tuning Liu Fuxiao, Wang Xiaoyang, Yao Wenlin, Chen Jianshu, Song Kaiqiang, Cho Sangwoo, Yacoob Yaser, Yu Dong
- Exploration With Principles For Diverse AI Supervision Liu Hao, Zaharia Matei, Abbeel Pieter
- Ring Attention With Blockwise Transformers For Near-infinite Context Liu Hao, Zaharia Matei, Abbeel Pieter
- Retrieval-based Knowledge Transfer: An Effective Approach For Extreme Large Language Model Compression Liu Jiduan, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Wang Ran Lucien, Yan Rui
- RETA-LLM: A Retrieval-augmented Large Language Model Toolkit Liu Jiongnan, Jin Jiajie, Wang Zihan, Cheng Jiehan, Dou Zhicheng, Wen Ji-rong
- Chatcounselor: A Large Language Models For Mental Health Support Liu June M., Li Donghao, Cao He, Ren Tianhe, Liao Zeyi, Wu Jiamin
- Think-in-memory: Recalling And Post-thinking Enable Llms With Long-term Memory Liu Lei, Yang Xiaoyan, Shen Yue, Hu Binbin, Zhang Zhiqiang, Gu Jinjie, Zhang Guannan
- Conversational Question Answering With Reformulations Over Knowledge Graph Liu Lihui, Hill Blaine, Du Boxin, Wang Fei, Tong Hanghang
- An Evaluation Of GPT-4V And Gemini In Online VQA Liu Mengchen, Chen Chongyan, Gurari Danna
- X-eval: Generalizable Multi-aspect Text Evaluation Via Augmented Instruction Tuning With Auxiliary Evaluation Aspects Liu Minqian, Shen Ying, Xu Zhiyang, Cao Yixin, Cho Eunah, Kumar Vaibhav, Ghanadan Reza, Huang Lifu
- Nlebench+norglm: A Comprehensive Empirical Analysis And Benchmark Dataset For Generative Language Models In Norwegian Liu Peng, Zhang Lemei, Farup Terje Nissen, Lauvrak Even W., Ingvaldsen Jon Espen, Eide Simen, Gulla Jon Atle, Yang Zhirong
- From Zero To Hero: Examining The Power Of Symbolic Tasks In Instruction Tuning Liu Qian, Zhou Fan, Jiang Zhengbao, Dou Longxu, Lin Min
- Rethinking Tabular Data Understanding With Large Language Models Liu Tianyang, Wang Fei, Chen Muhao
- MMHQA-ICL: Multimodal In-context Learning For Hybrid Question Answering Over Text, Tables And Images Liu Weihao, Lei Fangyu, Luo Tongxu, Lei Jiahe, He Shizhu, Zhao Jun, Liu Kang
- Aligning Large Language Models With Human Preferences Through Representation Engineering Liu Wenhao, Wang Xiaohua, Wu Muling, Li Tianlong, Lv Changze, Ling Zixuan, Zhu Jianhao, Zhang Cenyuan, Zheng Xiaoqing, Huang Xuanjing
- Alignbench: Benchmarking Chinese Alignment Of Large Language Models Liu Xiao, Lei Xuanyu, Wang Shengyuan, Huang Yue, Feng Zhuoer, Wen Bosi, Cheng Jiale, Ke Pei, Xu Yifan, Tam Weng Lam, Zhang Xiaohan, Sun Lichao, Gu Xiaotao, Wang Hongning, Zhang Jing, Huang Minlie, Dong Yuxiao, Tang Jie
- Beyond Text: Unveiling Multimodal Proficiency Of Large Language Models With Multiapi Benchmark Liu Xiao, Lin Jianfeng, Zhang Jiawei
- Fingpt: Democratizing Internet-scale Data For Financial Large Language Models Liu Xiao-yang, Wang Guoxuan, Yang Hongyang, Zha Daochen
- Online Speculative Decoding Liu Xiaoxuan, Hu Lanxiang, Bailis Peter, Cheung Alvin, Deng Zhijie, Stoica Ion, Zhang Hao
- BOLT: Fast Energy-based Controlled Text Generation With Tunable Biases Liu Xin, Khalifa Muhammad, Wang Lu
- Wavjourney: Compositional Audio Creation With Large Language Models Liu Xubo, Zhu Zhongkai, Liu Haohe, Yuan Yi, Cui Meng, Huang Qiushi, Liang Jinhua, Cao Yin, Kong Qiuqiang, Plumbley Mark D., Wang Wenwu
- Parallel Sentence-level Explanation Generation For Real-world Low-resource Scenarios Liu Yan, Chen Xiaokang, Dai Qi
- Trustworthy Llms: A Survey And Guideline For Evaluating Large Language Models' Alignment Liu Yang, Yao Yuanshun, Ton Jean-francois, Zhang Xiaoying, Guo Ruocheng, Cheng Hao, Klochkov Yegor, Taufiq Muhammad Faaiz, Li Hang
- Summary Of Chatgpt-related Research And Perspective Towards The Future Of Large Language Models Liu Yiheng, Han Tianle, Ma Siyuan, Zhang Jiayue, Yang Yuanyuan, Tian Jiaming, He Hao, Li Antong, He Mengshen, Liu Zhengliang, Wu Zihao, Zhao Lin, Zhu Dajiang, Li Xiang, Qiang Ning, Shen Dingang, Liu Tianming, Ge Bao
- Llms As Narcissistic Evaluators: When Ego Inflates Evaluation Scores Liu Yiqi, Moosavi Nafise Sadat, Lin Chenghua
- On Learning To Summarize With Large Language Models As References Liu Yixin, Shi Kejian, He Katherine S, Ye Longtian, Fabbri Alexander R., Liu Pengfei, Radev Dragomir, Cohan Arman
- Robustness Over Time: Understanding Adversarial Examples' Effectiveness On Longitudinal Versions Of Large Language Models Liu Yugeng, Cong Tianshuo, Zhao Zhengyu, Backes Michael, Shen Yun, Zhang Yang
- LLM360: Towards Fully Transparent Open-source Llms Liu Zhengzhong, Qiao Aurick, Neiswanger Willie, Wang Hongyi, Tan Bowen, Tao Tianhua, Li Junbo, Wang Yuqi, Sun Suqi, Pangarkar Omkar, Fan Richard, Gu Yi, Miller Victor, Zhuang Yonghao, He Guowei, Li Haonan, Koto Fajri, Tang Liping, Ranjan Nikhil, Shen Zhiqiang, Ren Xuguang, Iriondo Roberto, Mu Cun, Hu Zhiting, Schulze Mark, Nakov Preslav, Baldwin Tim, Xing Eric P.
- Reason For Future, Act For Now: A Principled Framework For Autonomous LLM Agents With Provable Sample Efficiency Liu Zhihan, Hu Hao, Zhang Shenao, Guo Hongyi, Ke Shuqi, Liu Boyi, Wang Zhaoran
- Self-contradictory Reasoning Evaluation And Detection Liu Ziyi, Lee Isabelle, Du Yongkang, Sanyal Soumya, Zhao Jieyu
- SAIE Framework: Support Alone Isn't Enough -- Advancing LLM Training With Adversarial Remarks Loem Mengsay, Kaneko Masahiro, Okazaki Naoaki
- Exploring Effectiveness Of GPT-3 In Grammatical Error Correction: A Study On Performance And Controllability In Prompt-based Methods Loem Mengsay, Kaneko Masahiro, Takase Sho, Okazaki Naoaki
- Harnessing Large Language Models' Empathetic Response Generation Capabilities For Online Mental Health Counselling Support Loh Siyuan Brandon, Raamkumar Aravind Sesagiri
- MUFFIN: Curating Multi-faceted Instructions For Improving Instruction-following Lou Renze, Zhang Kai, Xie Jian, Sun Yuxuan, Ahn Janice, Xu Hanzi, Su Yu, Yin Wenpeng
- Probing Pretrained Language Models With Hierarchy Properties Lovón-melgarejo Jesús, Moreno Jose G., Besançon Romaric, Ferret Olivier, Tamine Lynda
- Exploring The Sensitivity Of Llms' Decision-making Capabilities: Insights From Prompt Variation And Hyperparameters Loya Manikanta, Sinha Divya Anand, Futrell Richard
- Chatgpt V Bard V Bing V Claude 2 V Aria V Human-expert. How Good Are AI Chatbots At Scientific Writing? Lozić Edisa, Štular Benjamin
- Chain-of-dictionary Prompting Elicits Translation In Large Language Models Lu Hongyuan, Yang Haoran, Huang Haoyang, Zhang Dongdong, Lam Wai, Wei Furu
- Ziya-visual: Bilingual Large Vision-language Model Via Multi-task Instruction Tuning Lu Junyu, Zhang Dixiang, Wu Xiaojun, Gao Xinyu, Gan Ruyi, Zhang Jiaxing, Song Yan, Zhang Pingjian
- Routing To The Expert: Efficient Reward-guided Ensemble Of Large Language Models Lu Keming, Yuan Hongyi, Lin Runji, Lin Junyang, Yuan Zheng, Zhou Chang, Zhou Jingren
- Large Language Models Can Be Guided To Evade Ai-generated Text Detection Lu Ning, Liu Shengcai, He Rui, Wang Qi, Ong Yew-soon, Tang Ke
- Mathvista: Evaluating Mathematical Reasoning Of Foundation Models In Visual Contexts Lu Pan, Bansal Hritik, Xia Tony, Liu Jiacheng, Li Chunyuan, Hajishirzi Hannaneh, Cheng Hao, Chang Kai-wei, Galley Michel, Gao Jianfeng
- Error Analysis Prompting Enables Human-like Translation Evaluation In Large Language Models Lu Qingyu, Qiu Baopu, Ding Liang, Zhang Kanjian, Kocmi Tom, Tao Dacheng
- Safer Conversational AI As A Source Of User Delight Lu Xiaoding, Korshuk Aleksey, Liu Zongyi, Beauchamp William, Research Chai
- Inference-time Policy Adapters (IPA): Tailoring Extreme-scale Lms Without Fine-tuning Lu Ximing, Brahman Faeze, West Peter, Jang Jaehun, Chandu Khyathi, Ravichander Abhilasha, Qin Lianhui, Ammanabrolu Prithviraj, Jiang Liwei, Ramnath Sahana, Dziri Nouha, Fisher Jillian, Lin Bill Yuchen, Hallinan Skyler, Ren Xiang, Welleck Sean, Choi Yejin
- UI Layout Generation With Llms Guided By UI Grammar Lu Yuwen, Tong Ziang, Zhao Qinyi, Zhang Chengzhi, Li Toby Jia-jun
- Explanation-based Finetuning Makes Models More Robust To Spurious Cues Ludan Josh Magnus, Meng Yixuan, Nguyen Tai, Shah Saurabh, Lyu Qing, Apidianaki Marianna, Callison-burch Chris
- Chatgpt And A New Academic Reality: Artificial Intelligence-written Research Papers And The Ethics Of The Large Language Models In Scholarly Publishing Lund Brady, Wang Ting, Mannuru Nishith Reddy, Nie Bing, Shimray Somipam, Wang Ziang
- Wizardmath: Empowering Mathematical Reasoning For Large Language Models Via Reinforced Evol-instruct Luo Haipeng, Sun Qingfeng, Xu Can, Zhao Pu, Lou Jianguang, Tao Chongyang, Geng Xiubo, Lin Qingwei, Chen Shifeng, Zhang Dongmei
- Critique Ability Of Large Language Models Luo Liangchen, Lin Zi, Liu Yinxiao, Shu Lei, Zhu Yun, Shang Jingbo, Meng Lei
- Reasoning On Graphs: Faithful And Interpretable Large Language Model Reasoning Luo Linhao, Li Yuan-fang, Haffari Gholamreza, Pan Shirui
- Systematic Assessment Of Factual Knowledge In Large Language Models Luo Linhao, Vu Thuy-trang, Phung Dinh, Haffari Gholamreza
- Dr.icl: Demonstration-retrieved In-context Learning Luo Man, Xu Xin, Dai Zhuyun, Pasupat Panupong, Kazemi Mehran, Baral Chitta, Imbrasaite Vaiva, Zhao Vincent Y
- Recranker: Instruction Tuning Large Language Model As Ranker For Top-k Recommendation Luo Sichun, He Bowei, Zhao Haohan, Shao Wei, Qi Yanlin, Huang Yinya, Zhou Aojun, Yao Yuxuan, Li Zongpeng, Xiao Yuanzhang, Zhan Mingjie, Song Linqi
- Hrot: Hybrid Prompt Strategy And Retrieval Of Thought For Table-text Hybrid Question Answering Luo Tongxu, Lei Fangyu, Lei Jiahe, Liu Weihao, He Shihu, Zhao Jun, Liu Kang
- YAYI 2: Multilingual Open-source Large Language Models Luo Yin, Kong Qingchao, Xu Nan, Cao Jia, Hao Bao, Qu Baoyu, Chen Bo, Zhu Chao, Zhao Chenyang, Zhang Donglei, Feng Fan, Zhao Feifei, Sun Hailong, Yang Hanxuan, Pan Haojun, Liu Hongyu, Guo Jianbin, Du Jiangtao, Wang Jingyi, Li Junfeng, Sun Lei, Liu Liduo, Dong Lifeng, Liu Lili, Wang Lin, Zhang Liwen, Wang Minzheng, Wang Pin, Yu Ping, Li Qingxiao, Yan Rui, Zou Rui, Li Ruiqun, Huang Taiwen, Wang Xiaodong, Wu Xiaofei, Peng Xin, Zhang Xina, Fang Xing, Xiao Xinglin, Hao Yanni, Dong Yao, Wang Yigang, Liu Ying, Jiang Yongyu, Wang Yungan, Wang Yuqi, Wang Zhangsheng, Yu Zhaoxin, Luo Zhen, Mao Wenji, Wang Lei, Zeng Dajun
- Biomedgpt: Open Multimodal Generative Pre-trained Transformer For Biomedicine Luo Yizhen, Zhang Jiahuan, Fan Siqi, Yang Kai, Wu Yushuai, Qiao Mu, Nie Zaiqing
- Chatgpt As A Factual Inconsistency Evaluator For Text Summarization Luo Zheheng, Xie Qianqian, Ananiadou Sophia
- Fingpt: Large Generative Models For A Small Language Luukkonen Risto, Komulainen Ville, Luoma Jouni, Eskelinen Anni, Kanerva Jenna, Kupari Hanna-mari, Ginter Filip, Laippala Veronika, Muennighoff Niklas, Piktus Aleksandra, Wang Thomas, Tazi Nouamane, Scao Teven Le, Wolf Thomas, Suominen Osma, Sairanen Samuli, Merioksa Mikko, Heinonen Jyrki, Vahtola Aija, Antao Samuel, Pyysalo Sampo
- Are We Falling In A Middle-intelligence Trap? An Analysis And Mitigation Of The Reversal Curse Lv Ang, Zhang Kaiyi, Xie Shufang, Tu Quan, Chen Yuhan, Wen Ji-rong, Yan Rui
- KOSMOS-2.5: A Multimodal Literate Model Lv Tengchao, Huang Yupan, Chen Jingye, Zhao Yuzhong, Jia Yilin, Cui Lei, Ma Shuming, Chang Yaoyao, Huang Shaohan, Wang Wenhui, Dong Li, Luo Weiyao, Wu Shaoxiang, Wang Guoxin, Zhang Cha, Wei Furu
- Comparative Analysis Of Drug-gpt And Chatgpt Llms For Healthcare Insights: Evaluating Accuracy And Relevance In Patient And HCP Contexts Lysandrou Giorgos, Owen Roma English, Mursec Kirsty, Brun Grant Le, Fairley Elizabeth A. L.
- A Paradigm Shift: The Future Of Machine Translation Lies With Large Language Models Lyu Chenyang, Du Zefeng, Xu Jitao, Duan Yitao, Wu Minghao, Lynn Teresa, Aji Alham Fikri, Wong Derek F., Liu Siyou, Wang Longyue
- Macaw-llm: Multi-modal Language Modeling With Image, Audio, Video, And Text Integration Lyu Chenyang, Wu Minghao, Wang Longyue, Huang Xinting, Liu Bingshuai, Du Zefeng, Shi Shuming, Tu Zhaopeng
- Llm-rec: Personalized Recommendation Via Prompting Large Language Models Lyu Hanjia, Jiang Song, Zeng Hanqing, Xia Yinglong, Wang Qifan, Zhang Si, Chen Ren, Leung Christopher, Tang Jiajie, Luo Jiebo
- Chain-of-skills: A Configurable Model For Open-domain Question Answering Ma Kaixin, Cheng Hao, Zhang Yu, Liu Xiaodong, Nyberg Eric, Gao Jianfeng
- POS: A Prompts Optimization Suite For Augmenting Text-to-video Generation Ma Shijie, Xu Huayi, Li Mengjian, Geng Weidong, Wang Yaxiong, Wang Meng
- Beyond Chatbots: Explorellm For Structured Thoughts And Personalized Model Responses Ma Xiao, Mishra Swaroop, Liu Ariel, Su Sophie, Chen Jilin, Kulkarni Chinmay, Cheng Heng-tze, Le Quoc, Chi Ed
- Tomchallenges: A Principle-guided Dataset And Diverse Evaluation Tasks For Exploring Theory Of Mind Ma Xiaomeng, Gao Lingyu, Xu Qihui
- Query Rewriting For Retrieval-augmented Large Language Models Ma Xinbei, Gong Yeyun, He Pengcheng, Zhao Hai, Duan Nan
- At Which Training Stage Does Code Data Help Llms Reasoning? Ma Yingwei, Liu Yue, Yu Yue, Zhang Yuanliang, Jiang Yu, Wang Changjian, Li Shanshan
- VOLTA: Improving Generative Diversity By Variational Mutual Information Maximizing Autoencoder Ma Yueen, Chi Dafeng, Li Jingjing, Song Kai, Zhuang Yuzheng, King Irwin
- Brain In A Vat: On Missing Pieces Towards Artificial General Intelligence In Large Language Models Ma Yuxi, Zhang Chi, Zhu Song-chun
- World-to-words: Grounded Open Vocabulary Acquisition Through Fast Mapping In Vision-language Models Ma Ziqiao, Pan Jiayi, Chai Joyce
- Opportunities And Challenges In Neural Dialog Tutoring Macina Jakub, Daheim Nico, Wang Lingzhi, Sinha Tanmay, Kapur Manu, Gurevych Iryna, Sachan Mrinmaya
- Self-refine: Iterative Refinement With Self-feedback Madaan Aman, Tandon Niket, Gupta Prakhar, Hallinan Skyler, Gao Luyu, Wiegreffe Sarah, Alon Uri, Dziri Nouha, Prabhumoye Shrimai, Yang Yiming, Gupta Shashank, Majumder Bodhisattwa Prasad, Hermann Katherine, Welleck Sean, Yazdanbakhsh Amir, Clark Peter
- Exploring Prompting Large Language Models As Explainable Metrics Mahmoudi Ghazaleh
- Enhancing Conversational Quality In Language Learning Chatbots: An Evaluation Of GPT4 For ASR Error Correction Mai Long, Carson-berndsen Julie
- How Can Large Language Models Help Humans In Design And Manufacturing? Makatura Liane, Foshey Michael, Wang Bohan, Hähnlein Felix, Ma Pingchuan, Deng Bolei, Tjandrasuwita Megan, Spielberg Andrew, Owens Crystal Elaine, Chen Peter Yichen, Zhao Allan, Zhu Amy, Norton Wil J, Gu Edward, Jacob Joshua, Li Yifei, Schulz Adriana, Matusik Wojciech
- Fine-tuning Language Models With Just Forward Passes Malladi Sadhika, Gao Tianyu, Nichani Eshaan, Damian Alex, Lee Jason D., Chen Danqi, Arora Sanjeev
- Adapting Pre-trained Generative Models For Extractive Question Answering Mallick Prabir, Nayak Tapas, Bhattacharya Indrajit
- ICL Markup: Structuring In-context Learning Using Soft-token Tags Marc-etienne Brunet, Ashton Anderson, Richard Zemel
- Interpreting Learned Feedback Patterns In Large Language Models Marks Luke, Abdullah Amir, Neo Clement, Arike Rauno, Krueger David, Torr Philip, Barez Fazl
- The Geometry Of Truth: Emergent Linear Structure In Large Language Model Representations Of True/false Datasets Marks Samuel, Tegmark Max
- Establishing Vocabulary Tests As A Benchmark For Evaluating Large Language Models Martínez Gonzalo, Conde Javier, Merino-gómez Elena, Bermúdez-margaretto Beatriz, Hernández José Alberto, Reviriego Pedro, Brysbaert Marc
- Benchmarking Large Language Model Capabilities For Conditional Generation Maynez Joshua, Agrawal Priyanka, Gehrmann Sebastian
- Improving Automatic VQA Evaluation Using Large Language Models Mañas Oscar, Krojer Benno, Agrawal Aishwarya
- Is Feedback All You Need? Leveraging Natural Language Feedback In Goal-conditioned Reinforcement Learning Mccallum Sabrina, Taylor-davies Max, Albrecht Stefano V., Suglia Alessandro
- Lightlm: A Lightweight Deep And Narrow Language Model For Generative Recommendation Mei Kai, Zhang Yongfeng
- Simple LLM Prompting Is State-of-the-art For Robust And Multilingual Dialogue Evaluation Mendonça John, Pereira Patrícia, Moniz Helena, Carvalho João Paulo, Lavie Alon, Trancoso Isabel
- From Bytes To Biases: Investigating The Cultural Self-perception Of Large Language Models Messner Wolfgang, Greene Tatum, Matalone Josephine
- An AI Chatbot For Explaining Deep Reinforcement Learning Decisions Of Service-oriented Systems Metzger Andreas, Bartel Jone, Laufer Jan
- Developing A Scalable Benchmark For Assessing Large Language Models In Knowledge Graph Engineering Meyer Lars-peter, Frey Johannes, Junghanns Kurt, Brei Felix, Bulert Kirill, Gründer-fahrer Sabine, Martin Michael
- Think Before You Act: Unified Policy For Interleaving Language Reasoning With Actions Mezghani Lina, Bojanowski Piotr, Alahari Karteek, Sukhbaatar Sainbayar
- Towards Efficient Generative Large Language Model Serving: A Survey From Algorithms To Systems Miao Xupeng, Oliaro Gabriele, Zhang Zhihao, Cheng Xinhao, Jin Hongyi, Chen Tianqi, Jia Zhihao
- The Larger They Are, The Harder They Fail: Language Models Do Not Recognize Identifier Swaps In Python Miceli-barone Antonio Valerio, Barez Fazl, Konstas Ioannis, Cohen Shay B.
- Dialogue-based Generation Of Self-driving Simulation Scenarios Using Large Language Models Miceli-barone Antonio Valerio, Lascarides Alex, Innes Craig
- Think Twice: Measuring The Efficiency Of Eliminating Prediction Shortcuts Of Question Answering Models Mikula Lukáš, Štefánik Michal, Petrovič Marek, Sojka Petr
- Beyond Accuracy: Evaluating Self-consistency Of Code Large Language Models With Identitychain Min Marcus J., Ding Yangruibo, Buratti Luca, Pujar Saurabh, Kaiser Gail, Jana Suman, Ray Baishakhi
- Mindstorms In Natural Language-based Societies Of Mind Mingchen Zhuge, Haozhe Liu, Francesco Faccio, Dylan R. Ashley, Róbert Csordás, Anand Gopalakrishnan, Abdullah Hamdi, Hasan Abed Al Kader Hammoud, Vincent Herrmann, Kazuki Irie, Louis Kirsch, Bing Li, Guohao Li, Shuming Liu, Jinjie Mai, Piotr Piękos, Aditya Ramesh, Imanol Schlag, Weimin Shi, Aleksandar Stanić, Wenyi Wang, Yuhui Wang, Mengmeng Xu, Deng-ping Fan, Bernard Ghanem, Jürgen Schmidhuber
- Relu Strikes Back: Exploiting Activation Sparsity In Large Language Models Mirzadeh Iman, Alizadeh Keivan, Mehta Sachin, Del Mundo Carlo C, Tuzel Oncel, Samei Golnoosh, Rastegari Mohammad, Farajtabar Mehrdad
- Characterizing Large Language Models As Rationalizers Of Knowledge-intensive Tasks Mishra Aditi, Rahman Sajjadur, Kim Hannah, Mitra Kushan, Hruschka Estevam
- Triggering Multi-hop Reasoning For Question Answering In Language Models Using Soft Prompts And Random Walks Misra Kanishka, Santos Cicero Nogueira Dos, Shakeri Siamak
- An Emulator For Fine-tuning Large Language Models Using Small Language Models Mitchell Eric, Rafailov Rafael, Sharma Archit, Finn Chelsea, Manning Christopher D.
- Comparing Humans, GPT-4, And GPT-4V On Abstraction And Reasoning Tasks Mitchell Melanie, Palmarini Alessandro B., Moskvichev Arseny
- Towards Human-like Spoken Dialogue Generation Between AI Agents From Written Dialogue Mitsui Kentaro, Hono Yukiya, Sawada Kei
- Evaluation Of Faithfulness Using The Longest Supported Subsequence Mittal Anirudh, Schick Timo, Artetxe Mikel, Dwivedi-yu Jane
- Roll Up Your Sleeves: Working With A Collaborative And Engaging Task-oriented Dialogue System Mo Lingbo, Chen Shijie, Chen Ziru, Deng Xiang, Lewis Ashley, Singh Sunit, Stevens Samuel, Tai Chang-you, Wang Zhen, Yue Xiang, Zhang Tianshu, Su Yu, Sun Huan
- Tree Of Uncertain Thoughts Reasoning For Large Language Models Mo Shentong, Xin Miao
- Boosting Theory-of-mind Performance In Large Language Models Via Prompting Moghaddam Shima Rahimi, Honey Christopher J.
- Assigning AI: Seven Approaches For Students, With Prompts Mollick Ethan, Mollick Lilach
- Evaluating Cognitive Maps And Planning In Large Language Models With Cogeval Momennejad Ida, Hasanbeig Hosein, Vieira Felipe, Sharma Hiteshi, Ness Robert Osazuwa, Jojic Nebojsa, Palangi Hamid, Larson Jonathan
- Llms Killed The Script Kiddie: How Agents Supported By Large Language Models Change The Landscape Of Network Threat Testing Moskal Stephen, Laney Sam, Hemberg Erik, O'reilly Una-may
- Narrativexl: A Large-scale Dataset For Long-term Memory Models Moskvichev Arseny, Mai Ky-vinh
- Adaptive Machine Translation With Large Language Models Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- Fine-tuning Large Language Models For Adaptive Machine Translation Moslem Yasmin, Haque Rejwanul, Way Andy
- Can Llms Follow Simple Rules? Mu Norman, Chen Sarah, Wang Zifan, Chen Sizhe, Karamardian David, Aljeraisy Lulwa, Alomair Basel, Hendrycks Dan, Wagner David
- How To Plant Trees In Language Models: Data And Architectural Effects On The Emergence Of Syntactic Inductive Biases Mueller Aaron, Linzen Tal
- In-context Learning Generalizes, But Not Always Robustly: The Case Of Syntax Mueller Aaron, Webson Albert, Petty Jackson, Linzen Tal
- Assessing Translation Capabilities Of Large Language Models Involving English And Indian Languages Mujadia Vandan, Urlana Ashok, Bhaskar Yash, Pavani Penumalla Aditya, Shravya Kukkapalli, Krishnamurthy Parameswari, Sharma Dipti Misra
- "medium" Lms Of Code In The Era Of Llms: Lessons From Stackoverflow Mukherjee Manisha, Hellendoorn Vincent J.
- Superhf: Supervised Iterative Learning From Human Feedback Mukobi Gabriel, Chatain Peter, Fong Su, Windesheim Robert, Kutyniok Gitta, Bhatia Kush, Alberti Silas
- A Brief History Of Prompt: Leveraging Language Models. (through Advanced Prompting) Muktadir Golam Md
- Llms-healthcare : Current Applications And Challenges Of Large Language Models In Various Medical Specialties Mumtaz Ummara, Ahmed Awais, Mumtaz Summaya
- Salespeople Vs Salesbot: Exploring The Role Of Educational Value In Conversational Recommender Systems Murakhovs'ka Lidiya, Laban Philippe, Xie Tian, Xiong Caiming, Wu Chien-sheng
- Unival: Unified Model For Image, Video, Audio And Language Tasks Mustafa Shukor, Corentin Dancette, Alexandre Rame, Matthieu Cord
- Consistency Analysis Of Chatgpt Myeongjun Erik Jang, Thomas Lukasiewicz
- PEARL: Personalizing Large Language Model Writing Assistants With Generation-calibrated Retrievers Mysore Sheshera, Lu Zhuoran, Wan Mengting, Yang Longqi, Menezes Steve, Baghaee Tina, Gonzalez Emmanuel Barajas, Neville Jennifer, Safavi Tara
- Large Language Model Augmented Narrative Driven Recommendations Mysore Sheshera, Mccallum Andrew, Zamani Hamed
- Testing Language Model Agents Safely In The Wild Naihin Silen, Atkinson David, Green Marc, Hamadi Merwane, Swift Craig, Schonholtz Douglas, Kalai Adam Tauman, Bau David
- HALO: An Ontology For Representing And Categorizing Hallucinations In Large Language Models Nananukul Navapat, Kejriwal Mayank
- Cheaply Evaluating Inference Efficiency Metrics For Autoregressive Transformer Apis Narayanan Deepak, Santhanam Keshav, Henderson Peter, Bommasani Rishi, Lee Tony, Liang Percy
- Understanding (un)intended Memorization In Text-to-image Generative Models Naseh Ali, Roh Jaechul, Houmansadr Amir
- Scalable Extraction Of Training Data From (production) Language Models Nasr Milad, Carlini Nicholas, Hayase Jonathan, Jagielski Matthew, Cooper A. Feder, Ippolito Daphne, Choquette-choo Christopher A., Wallace Eric, Tramèr Florian, Lee Katherine
- Reinforcement Replaces Supervision: Query Focused Summarization Using Deep Reinforcement Learning Nath Swaroop, Khadilkar Harshad, Bhattacharyya Pushpak
- Towards Probing Contact Center Large Language Models Nathan Varun, Kumar Ayush, Ingle Digvijay, Vepa Jithendra
- Aligning A Medium-size GPT Model In English To A Small Closed Domain In Spanish Navarrete-parra Oscar R., Uc-cetina Victor, Reyes-magana Jorge
- Large Language Models As Tax Attorneys: A Case Study In Legal Capabilities Emergence Nay John J., Karamardian David, Lawsky Sarah B., Tao Wenting, Bhat Meghana, Jain Raghav, Lee Aaron Travis, Choi Jonathan H., Kasai Jungo
- Large Language Models In Healthcare And Medical Domain: A Review Nazi Zabir Al, Peng Wei
- Enhancing Logical Reasoning In Large Language Models To Facilitate Legal Applications Nguyen Ha-thanh, Fungwacharakorn Wachara, Satoh Ken
- Black-box Analysis: Gpts Across Time In Legal Textual Entailment Task Nguyen Ha-thanh, Goebel Randy, Toni Francesca, Stathis Kostas, Satoh Ken
- When Giant Language Brains Just Aren't Enough! Domain Pizzazz With Knowledge Sparkle Dust Nguyen Minh-tien, Nguyen Duy-hung, Sabahi Shahab, Le Hung, Yang Jeff, Hotta Hajime
- Culturax: A Cleaned, Enormous, And Multilingual Dataset For Large Language Models In 167 Languages Nguyen Thuat, Van Nguyen Chien, Lai Viet Dac, Man Hieu, Ngo Nghia Trung, Dernoncourt Franck, Rossi Ryan A., Nguyen Thien Huu
- LEVER: Learning To Verify Language-to-code Generation With Execution Ni Ansong, Iyer Srini, Radev Dragomir, Stoyanov Ves, Yih Wen-tau, Wang Sida I., Lin Xi Victoria
- An Experimental Study: Assessing The Combined Framework Of Wavlm And BEST-RQ For Text-to-speech Synthesis Nielson Via, Hillis Steven
- Xgen-7b Technical Report Nijkamp Erik, Xie Tian, Hayashi Hiroaki, Pang Bo, Xia Congying, Xing Chen, Vig Jesse, Yavuz Semih, Laban Philippe, Krause Ben, Purushwalkam Senthil, Niu Tong, Kryściński Wojciech, Murakhovs'ka Lidiya, Choubey Prafulla Kumar, Fabbri Alex, Liu Ye, Meng Rui, Tu Lifu, Bhat Meghana, Wu Chien-sheng, Savarese Silvio, Zhou Yingbo, Joty Shafiq, Xiong Caiming
- LEXTREME: A Multi-lingual And Multi-task Benchmark For The Legal Domain Niklaus Joel, Matoshi Veton, Rani Pooja, Galassi Andrea, Stürmer Matthias, Chalkidis Ilias
- Album Storytelling With Iterative Story-aware Captioning And Large Language Models Ning Munan, Xie Yujia, Chen Dongdong, Song Zeyin, Yuan Lu, Tian Yonghong, Ye Qixiang, Yuan Li
- Ragtruth: A Hallucination Corpus For Developing Trustworthy Retrieval-augmented Language Models Niu Cheng, Wu Yuanhao, Zhu Juno, Xu Siliang, Shum Kashun, Zhong Randy, Song Juntong, Zhang Tong
- Reflexion: Language Agents With Verbal Reinforcement Learning Noah Shinn, Federico Cassano, Edward Berman, Ashwin Gopinath, Karthik Narasimhan, Shunyu Yao
- AI Text-to-behavior: A Study In Steerability Noever David, Hyams Sam
- Multimodal Analysis Of Google Bard And Gpt-vision: Experiments In Visual Reasoning Noever David, Noever Samantha Elizabeth Miller
- Capabilities Of GPT-4 On Medical Challenge Problems Nori Harsha, King Nicholas, Mckinney Scott Mayer, Carignan Dean, Horvitz Eric
- In-contextual Gender Bias Suppression For Large Language Models Oba Daisuke, Kaneko Masahiro, Bollegala Danushka
- Blackvip: Black-box Visual Prompting For Robust Transfer Learning Oh Changdae, Hwang Hyeji, Lee Hee-young, Lim Yongtaek, Jung Geunyoung, Jung Jiyoung, Choi Hosik, Song Kyungwoo
- Askit: Unified Programming Interface For Programming With Large Language Models Okuda Katsumi, Amarasinghe Saman
- LINC: A Neurosymbolic Approach For Logical Reasoning By Combining Language Models With First-order Logic Provers Olausson Theo X., Gu Alex, Lipkin Benjamin, Zhang Cedegao E., Solar-lezama Armando, Tenenbaum Joshua B., Levy Roger
- Is Self-repair A Silver Bullet For Code Generation? Olausson Theo X., Inala Jeevana Priya, Wang Chenglong, Gao Jianfeng, Solar-lezama Armando
- Automated Assessment Of Students' Code Comprehension Using Llms Oli Priti, Banjade Rabin, Chapagain Jeevan, Rus Vasile
- Deepthought: An Architecture For Autonomous Self-motivated Systems Oliveira Arlindo L., Domingos Tiago, Figueiredo Mário, Lima Pedro U.
- GPT-4 Technical Report Openai, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko, Madelaine Boyd, Anna-luisa Brakman, Greg Brockman, Tim Brooks, Miles Brundage, Kevin Button, Trevor Cai, Rosie Campbell, Andrew Cann, Brittany Carey, Chelsea Carlson, Rory Carmichael, Brooke Chan, Che Chang, Fotis Chantzis, Derek Chen, Sully Chen, Ruby Chen, Jason Chen, Mark Chen, Ben Chess, Chester Cho, Casey Chu, Hyung Won Chung, Dave Cummings, Jeremiah Currier, Yunxing Dai, Cory Decareaux, Thomas Degry, Noah Deutsch, Damien Deville, Arka Dhar, David Dohan, Steve Dowling, Sheila Dunning, Adrien Ecoffet, Atty Eleti, Tyna Eloundou, David Farhi, Liam Fedus, Niko Felix, Simón Posada Fishman, Juston Forte, Isabella Fulford, Leo Gao, Elie Georges, Christian Gibson, Vik Goel, Tarun Gogineni, Gabriel Goh, Rapha Gontijo-lopes, Jonathan Gordon, Morgan Grafstein, Scott Gray, Ryan Greene, Joshua Gross, Shixiang Shane Gu, Yufei Guo, Chris Hallacy, Jesse Han, Jeff Harris, Yuchen He, Mike Heaton, Johannes Heidecke, Chris Hesse, Alan Hickey, Wade Hickey, Peter Hoeschele, Brandon Houghton, Kenny Hsu, Shengli Hu, Xin Hu, Joost Huizinga, Shantanu Jain, Shawn Jain, Joanne Jang, Angela Jiang, Roger Jiang, Haozhun Jin, Denny Jin, Shino Jomoto, Billie Jonn, Heewoo Jun, Tomer Kaftan, Łukasz Kaiser, Ali Kamali, Ingmar Kanitscheider, Nitish Shirish Keskar, Tabarak Khan, Logan Kilpatrick, Jong Wook Kim, Christina Kim, Yongjik Kim, Jan Hendrik Kirchner, Jamie Kiros, Matt Knight, Daniel Kokotajlo, Łukasz Kondraciuk, Andrew Kondrich, Aris Konstantinidis, Kyle Kosic, Gretchen Krueger, Vishal Kuo, Michael Lampe, Ikai Lan, Teddy Lee, Jan Leike, Jade Leung, Daniel Levy, Chak Ming Li, Rachel Lim, Molly Lin, Stephanie Lin, Mateusz Litwin, Theresa Lopez, Ryan Lowe, Patricia Lue, Anna Makanju, Kim Malfacini, Sam Manning, Todor Markov, Yaniv Markovski, Bianca Martin, Katie Mayer, Andrew Mayne, Bob Mcgrew, Scott Mayer Mckinney, Christine Mcleavey, Paul Mcmillan, Jake Mcneil, David Medina, Aalok Mehta, Jacob Menick, Luke Metz, Andrey Mishchenko, Pamela Mishkin, Vinnie Monaco, Evan Morikawa, Daniel Mossing, Tong Mu, Mira Murati, Oleg Murk, David Mély, Ashvin Nair, Reiichiro Nakano, Rajeev Nayak, Arvind Neelakantan, Richard Ngo, Hyeonwoo Noh, Long Ouyang, Cullen O'keefe, Jakub Pachocki, Alex Paino, Joe Palermo, Ashley Pantuliano, Giambattista Parascandolo, Joel Parish, Emy Parparita, Alex Passos, Mikhail Pavlov, Andrew Peng, Adam Perelman, Filipe De Avila Belbute Peres, Michael Petrov, Henrique Ponde De Oliveira Pinto, Michael, Pokorny, Michelle Pokrass, Vitchyr H. Pong, Tolly Powell, Alethea Power, Boris Power, Elizabeth Proehl, Raul Puri, Alec Radford, Jack Rae, Aditya Ramesh, Cameron Raymond, Francis Real, Kendra Rimbach, Carl Ross, Bob Rotsted, Henri Roussez, Nick Ryder, Mario Saltarelli, Ted Sanders, Shibani Santurkar, Girish Sastry, Heather Schmidt, David Schnurr, John Schulman, Daniel Selsam, Kyla Sheppard, Toki Sherbakov, Jessica Shieh, Sarah Shoker, Pranav Shyam, Szymon Sidor, Eric Sigler, Maddie Simens, Jordan Sitkin, Katarina Slama, Ian Sohl, Benjamin Sokolowsky, Yang Song, Natalie Staudacher, Felipe Petroski Such, Natalie Summers, Ilya Sutskever, Jie Tang, Nikolas Tezak, Madeleine B. Thompson, Phil Tillet, Amin Tootoonchian, Elizabeth Tseng, Preston Tuggle, Nick Turley, Jerry Tworek, Juan Felipe Cerón Uribe, Andrea Vallone, Arun Vijayvergiya, Chelsea Voss, Carroll Wainwright, Justin Jay Wang, Alvin Wang, Ben Wang, Jonathan Ward, Jason Wei, Cj Weinmann, Akila Welihinda, Peter Welinder, Jiayi Weng, Lilian Weng, Matt Wiethoff, Dave Willner, Clemens Winter, Samuel Wolrich, Hannah Wong, Lauren Workman, Sherwin Wu, Jeff Wu, Michael Wu, Kai Xiao, Tao Xu, Sarah Yoo, Kevin Yu, Qiming Yuan, Wojciech Zaremba, Rowan Zellers, Chong Zhang, Marvin Zhang, Shengjia Zhao, Tianhao Zheng, Juntang Zhuang, William Zhuk, Barret Zoph
- Detecting Llm-generated Text In Computing Education: A Comparative Study For Chatgpt Cases Orenstrakh Michael Sheinman, Karnalim Oscar, Suarez Carlos Anibal, Liut Michael
- Thoughtsource: A Central Hub For Large Language Model Reasoning Data Ott Simon, Hebenstreit Konstantin, Liévin Valentin, Hother Christoffer Egeberg, Moradi Milad, Mayrhauser Maximilian, Praas Robert, Winther Ole, Samwald Matthias
- Autoplan: Automatic Planning Of Interactive Decision-making Tasks With Large Language Models Ouyang Siqi, Li Lei
- The Shifted And The Overlooked: A Task-oriented Investigation Of User-gpt Interactions Ouyang Siru, Wang Shuohang, Liu Yang, Zhong Ming, Jiao Yizhu, Iter Dan, Pryzant Reid, Zhu Chenguang, Ji Heng, Han Jiawei
- Structured Chemistry Reasoning With Large Language Models Ouyang Siru, Zhang Zhuosheng, Yan Bing, Liu Xuan, Choi Yejin, Han Jiawei, Qin Lianhui
- Med-halt: Medical Domain Hallucination Test For Large Language Models Pal Ankit, Umapathi Logesh Kumar, Sankarasubbu Malaikannan
- Giraffe: Adventures In Expanding Context Lengths In Llms Pal Arka, Karkhanis Deep, Roberts Manley, Dooley Samuel, Sundararajan Arvind, Naidu Siddartha
- Evaluating And Explaining Large Language Models For Code Using Syntactic Structures Palacio David N, Velasco Alejandro, Rodriguez-cardenas Daniel, Moran Kevin, Poshyvanyk Denys
- Ontochatgpt Information System: Ontology-driven Structured Prompts For Chatgpt Meta-learning Palagin Oleksandr, Kaverinskiy Vladislav, Litvin Anna, Malakhov Kyrylo
- Towards Vision-language Mechanistic Interpretability: A Causal Tracing Tool For BLIP Palit Vedant, Pandey Rohan, Arora Aryaman, Liang Paul Pu
- Kwaiagents: Generalized Information-seeking Agent System With Large Language Models Pan Haojie, Zhai Zepeng, Yuan Hao, Lv Yaojia, Fu Ruiji, Liu Ming, Wang Zhongyuan, Qin Bing
- Smoothquant+: Accurate And Efficient 4-bit Post-training Weightquantization For LLM Pan Jiayi, Wang Chengcan, Zheng Kaifu, Li Yangguang, Wang Zhenyu, Feng Bin
- Retrieving-to-answer: Zero-shot Video Question Answering With Frozen Large Language Models Pan Junting, Lin Ziyi, Ge Yuying, Zhu Xiatian, Zhang Renrui, Wang Yi, Qiao Yu, Li Hongsheng
- Let's Reinforce Step By Step Pan Sarah, Lialin Vladislav, Muckatira Sherin, Rumshisky Anna
- Tokenize Anything Via Prompting Pan Ting, Tang Lulu, Wang Xinlong, Shan Shiguang
- Kosmos-g: Generating Images In Context With Multimodal Large Language Models Pan Xichen, Dong Li, Huang Shaohan, Peng Zhiliang, Chen Wenhu, Wei Furu
- An Empathetic User-centric Chatbot For Emotional Support Pan Yanting, Tang Yixuan, Niu Yuchen
- Tppoet: Transformer-based Persian Poem Generation Using Minimal Data And Advanced Decoding Techniques Panahandeh Amir, Asemi Hanie, Nourani Esmaeil
- Automating Customer Service Using Langchain: Building Custom Open-source GPT Chatbot For Organizations Pandya Keivalya, Holia Mehfuza
- Language Model Self-improvement By Reinforcement Learning Contemplation Pang Jing-cheng, Wang Pengyuan, Li Kaiyuan, Chen Xiong-hui, Xu Jiacheng, Zhang Zongzhang, Yu Yang
- Frozen Transformers In Language Models Are Effective Visual Encoder Layers Pang Ziqi, Xie Ziyang, Man Yunze, Wang Yu-xiong
- Lifelong Robot Learning With Human Assisted Language Planners Parakh Meenal, Fong Alisha, Simeonov Anthony, Chen Tao, Gupta Abhishek, Agrawal Pulkit
- ART: Automatic Multi-step Reasoning And Tool-use For Large Language Models Paranjape Bhargavi, Lundberg Scott, Singh Sameer, Hajishirzi Hannaneh, Zettlemoyer Luke, Ribeiro Marco Tulio
- On Measuring Faithfulness Or Self-consistency Of Natural Language Explanations Parcalabescu Letitia, Frank Anette
- Clip-guided Vision-language Pre-training For Question Answering In 3D Scenes Parelli Maria, Delitzas Alexandros, Hars Nikolas, Vlassis Georgios, Anagnostidis Sotirios, Bachmann Gregor, Hofmann Thomas
- Localized Symbolic Knowledge Distillation For Visual Commonsense Models Park Jae Sung, Hessel Jack, Chandu Khyathi Raghavi, Liang Paul Pu, Lu Ximing, West Peter, Yu Youngjae, Huang Qiuyuan, Gao Jianfeng, Farhadi Ali, Choi Yejin
- A Large Language Model Approach To Educational Survey Feedback Analysis Parker Michael J., Anderson Caitlin, Stone Claire, Oh Yearim
- Evaluating In-context Learning Of Libraries For Code Generation Patel Arkil, Reddy Siva, Bahdanau Dzmitry, Dasigi Pradeep
- ECLIPSE: A Resource-efficient Text-to-image Prior For Image Generations Patel Maitreya, Kim Changhoon, Cheng Sheng, Baral Chitta, Yang Yezhou
- Entity Matching Using Large Language Models Peeters Ralph, Bizer Christian
- GPT Self-supervision For A Better Data Annotator Pei Xiaohuan, Li Yanxi, Xu Chang
- Exploiting Novel GPT-4 Apis Pelrine Kellin, Taufeeque Mohammad, Zając Michał, Mclean Euan, Gleave Adam
- Chatgpt Prompting Cannot Estimate Predictive Uncertainty In High-resource Languages Pelucchi Martino, Valdenegro-toro Matias
- Instruction Tuning With GPT-4 Peng Baolin, Li Chunyuan, He Pengcheng, Galley Michel, Gao Jianfeng
- Stabilizing RLHF Through Advantage Model And Selective Rehearsal Peng Baolin, Song Linfeng, Tian Ye, Jin Lifeng, Mi Haitao, Yu Dong
- A Study Of Generative Large Language Model For Medical Research And Healthcare Peng Cheng, Yang Xi, Chen Aokun, Smith Kaleb E, Pournejatian Nima, Costa Anthony B, Martin Cheryl, Flores Mona G, Zhang Ying, Magoc Tanja, Lipori Gloria, Mitchell Duane A, Ospina Naykky S, Ahmed Mustafa M, Hogan William R, Shenkman Elizabeth A, Guo Yi, Bian Jiang, Wu Yonghui
- When Does In-context Learning Fall Short And Why? A Study On Specification-heavy Tasks Peng Hao, Wang Xiaozhi, Chen Jianhui, Li Weikai, Qi Yunjia, Wang Zimu, Wu Zhili, Zeng Kaisheng, Xu Bin, Hou Lei, Li Juanzi
- FP8-LM: Training FP8 Large Language Models Peng Houwen, Wu Kan, Wei Yixuan, Zhao Guoshuai, Yang Yuxiang, Liu Ze, Xiong Yifan, Yang Ziyue, Ni Bolin, Hu Jingcheng, Li Ruihang, Zhang Miaosen, Li Chen, Ning Jia, Wang Ruizhe, Zhang Zheng, Liu Shuguang, Chau Joe, Hu Han, Cheng Peng
- Spoken Language Intelligence Of Large Language Models For Language Learning Peng Linkai, Nuchged Baorian, Gao Yingming
- Self-driven Grounding: Large Language Model Agents With Automatical Language-aligned Skill Learning Peng Shaohui, Hu Xing, Yi Qi, Zhang Rui, Guo Jiaming, Huang Di, Tian Zikang, Chen Ruizhi, Du Zidong, Guo Qi, Chen Yunji, Li Ling
- Reproducing Whisper-style Training Using An Open-source Toolkit And Publicly Available Data Peng Yifan, Tian Jinchuan, Yan Brian, Berrebbi Dan, Chang Xuankai, Li Xinjian, Shi Jiatong, Arora Siddhant, Chen William, Sharma Roshan, Zhang Wangyou, Sudo Yui, Shakeel Muhammad, Jung Jee-weon, Maiti Soumi, Watanabe Shinji
- LARG, Language-based Automatic Reward And Goal Generation Perez Julien, Proux Denys, Roux Claude, Niemaz Michael
- Large Language Models Can Infer Psychological Dispositions Of Social Media Users Peters Heinrich, Matz Sandra
- Constitutionmaker: Interactively Critiquing Large Language Models By Converting Feedback Into Principles Petridis Savvas, Wedin Ben, Wexler James, Donsbach Aaron, Pushkarna Mahima, Goyal Nitesh, Cai Carrie J., Terry Michael
- The Impact Of Depth On Compositional Generalization In Transformer Language Models Petty Jackson, Van Steenkiste Sjoerd, Dasgupta Ishita, Sha Fei, Garrette Dan, Linzen Tal
- Attention Sorting Combats Recency Bias In Long Context Language Models Peysakhovich Alexander, Lerer Adam
- Training Chain-of-thought Via Latent-variable Inference Phan Du, Hoffman Matthew D., Dohan David, Douglas Sholto, Le Tuan Anh, Parisi Aaron, Sountsov Pavel, Sutton Charles, Vikram Sharad, Saurous Rif A.
- Zero-shot Question Answering Over Financial Documents Using Large Language Models Phogat Karmvir Singh, Harsha Chetan, Dasaratha Sridhar, Ramakrishna Shashishekar, Puranam Sai Akhil
- Generating High-precision Feedback For Programming Syntax Errors Using Large Language Models Phung Tung, Cambronero José, Gulwani Sumit, Kohn Tobias, Majumdar Rupak, Singla Adish, Soares Gustavo
- Generative AI For Programming Education: Benchmarking Chatgpt, GPT-4, And Human Tutors Phung Tung, Pădurean Victor-alexandru, Cambronero José, Gulwani Sumit, Kohn Tobias, Majumdar Rupak, Singla Adish, Soares Gustavo
- Automating Human Tutor-style Programming Feedback: Leveraging GPT-4 Tutor Model For Hint Generation And GPT-3.5 Student Model For Hint Validation Phung Tung, Pădurean Victor-alexandru, Singh Anjali, Brooks Christopher, Cambronero José, Gulwani Sumit, Singla Adish, Soares Gustavo
- LLM Self Defense: By Self Examination, Llms Know They Are Being Tricked Phute Mansi, Helbling Alec, Hull Matthew, Peng Shengyun, Szyller Sebastian, Cornelius Cory, Chau Duen Horng
- Typhoon: Thai Large Language Models Pipatanakul Kunat, Jirabovonvisut Phatrasek, Manakul Potsawee, Sripaisarnmongkol Sittipong, Patomwong Ruangsak, Chokchainant Pathomporn, Tharnpipitchai Kasima
- Sabi\'a: Portuguese Large Language Models Pires Ramon, Abonizio Hugo, Almeida Thales Sales, Nogueira Rodrigo
- Evaluating Gpt-4's Vision Capabilities On Brazilian University Admission Exams Pires Ramon, Almeida Thales Sales, Abonizio Hugo, Nogueira Rodrigo
- Hyena Hierarchy: Towards Larger Convolutional Language Models Poli Michael, Massaroli Stefano, Nguyen Eric, Fu Daniel Y., Dao Tri, Baccus Stephen, Bengio Yoshua, Ermon Stefano, Ré Christopher
- Painter: Teaching Auto-regressive Language Models To Draw Sketches Pourreza Reza, Bhattacharyya Apratim, Panchal Sunny, Lee Mingu, Madan Pulkit, Memisevic Roland
- Egovlpv2: Egocentric Video-language Pre-training With Fusion In The Backbone Pramanick Shraman, Song Yale, Nag Sayan, Lin Kevin Qinghong, Shah Hardik, Shou Mike Zheng, Chellappa Rama, Zhang Pengchuan
- Adapt: As-needed Decomposition And Planning With Language Models Prasad Archiki, Koller Alexander, Hartmann Mareike, Clark Peter, Sabharwal Ashish, Bansal Mohit, Khot Tushar
- Rephrase, Augment, Reason: Visual Grounding Of Questions For Vision-language Models Prasad Archiki, Stengel-eskin Elias, Bansal Mohit
- The Robots Are Here: Navigating The Generative AI Revolution In Computing Education Prather James, Denny Paul, Leinonen Juho, Becker Brett A., Albluwi Ibrahim, Craig Michelle, Keuning Hieke, Kiesler Natalie, Kohn Tobias, Luxton-reilly Andrew, Macneil Stephen, Peterson Andrew, Pettit Raymond, Reeves Brent N., Savelka Jaromir
- Automatic Prompt Optimization With "gradient Descent" And Beam Search Pryzant Reid, Iter Dan, Li Jerry, Lee Yin Tat, Zhu Chenguang, Zeng Michael
- Neural Task Synthesis For Visual Programming Pădurean Victor-alexandru, Tzannetos Georgios, Singla Adish
- Vidcom: Fast Video Comprehension Through Large Language Models With Multimodal Tools Qi Ji, Ji Kaixuan, Yu Jifan, Wang Duokang, Xu Bin, Hou Lei, Li Juanzi
- Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! Qi Xiangyu, Zeng Yi, Xie Tinghao, Chen Pin-yu, Jia Ruoxi, Mittal Prateek, Henderson Peter
- PILLOW: Enhancing Efficient Instruction Fine-tuning Via Prompt Matching Qi Zhenting, Tan Xiaoyu, Shi Shaojie, Qu Chao, Xu Yinghui, Qi Yuan
- Toolink: Linking Toolkit Creation And Using Through Chain-of-solving On Open-source Model Qian Cheng, Xiong Chenyan, Liu Zhenghao, Liu Zhiyuan
- "merge Conflicts!" Exploring The Impacts Of External Distractors To Parametric Knowledge Graphs Qian Cheng, Zhao Xinran, Wu Sherry Tongshuang
- Taskweaver: A Code-first Agent Framework Qiao Bo, Li Liqun, Zhang Xu, He Shilin, Kang Yu, Zhang Chaoyun, Yang Fangkai, Dong Hang, Zhang Jue, Wang Lu, Ma Minghua, Zhao Pu, Qin Si, Qin Xiaoting, Du Chao, Xu Yong, Lin Qingwei, Rajmohan Saravan, Zhang Dongmei
- OPERA: Alleviating Hallucination In Multi-modal Large Language Models Via Over-trust Penalty And Retrospection-allocation Qidong Huang, Xiaoyi Dong, Pan Zhang, Bin Wang, Conghui He, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu
- Hallucidoctor: Mitigating Hallucinatory Toxicity In Visual Instruction Data Qifan Yu, Juncheng Li, Longhui Wei, Liang Pang, Wentao Ye, Bosheng Qin, Siliang Tang, Qi Tian, Yueting Zhuang
- Toolllm: Facilitating Large Language Models To Master 16000+ Real-world Apis Qin Yujia, Liang Shihao, Ye Yining, Zhu Kunlun, Yan Lan, Lu Yaxi, Lin Yankai, Cong Xin, Tang Xiangru, Qian Bill, Zhao Sihan, Hong Lauren, Tian Runchu, Xie Ruobing, Zhou Jie, Gerstein Mark, Li Dahai, Liu Zhiyuan, Sun Maosong
- Transnormerllm: A Faster And Better Large Language Model With Improved Transnormer Qin Zhen, Li Dong, Sun Weigao, Sun Weixuan, Shen Xuyang, Han Xiaodong, Wei Yunshen, Lv Baohong, Luo Xiao, Qiao Yu, Zhong Yiran
- Mplug-owl: Modularization Empowers Large Language Models With Multimodality Qinghao Ye, Haiyang Xu, Guohai Xu, Jiabo Ye, Ming Yan, Yiyang Zhou, Junyang Wang, Anwen Hu, Pengcheng Shi, Yaya Shi, Chenliang Li, Yuanhong Xu, Hehong Chen, Junfeng Tian, Qi Qian, Ji Zhang, Fei Huang, Jingren Zhou
- Adalora: Adaptive Budget Allocation For Parameter-efficient Fine-tuning Qingru Zhang, Minshuo Chen, Alexander Bukharin, Nikos Karampatziakis, Pengcheng He, Yu Cheng, Weizhu Chen, Tuo Zhao
- Embodied Executable Policy Learning With Language-based Scene Summarization Qiu Jielin, Xu Mengdi, Han William, Moon Seungwhan, Zhao Ding
- Controlrec: Bridging The Semantic Gap Between Language Model And Personalized Recommendation Qiu Junyan, Wang Haitao, Hong Zhaolin, Yang Yiping, Liu Qiang, Wang Xingxing
- Unlocking Emergent Modularity In Large Language Models Qiu Zihan, Huang Zeyu, Fu Jie
- Direct Preference Optimization: Your Language Model Is Secretly A Reward Model Rafael Rafailov, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D. Manning, Chelsea Finn
- Coedit: Text Editing By Task-specific Instruction Tuning Raheja Vipul, Kumar Dhruv, Koo Ryan, Kang Dongyeop
- Quantized Transformer Language Model Implementations On Edge Devices Rahman Mohammad Wali Ur, Abrar Murad Mehrab, Copening Hunter Gibbons, Hariri Salim, Shao Sicong, Satam Pratik, Salehi Soheil
- Explaining Large Language Model-based Neural Semantic Parsers (student Abstract) Rai Daking, Zhou Yilun, Wang Bailin, Yao Ziyu
- Mixed-distil-bert: Code-mixed Language Modeling For Bangla, English, And Hindi Raihan Md Nishat, Goswami Dhiman, Mahmud Antara
- Semantic Consistency For Assuring Reliability Of Large Language Models Raj Harsh, Gupta Vipul, Rosati Domenic, Majumdar Subhabrata
- K-PERM: Personalized Response Generation Using Dynamic Knowledge Retrieval And Persona-adaptive Queries Raj Kanak, Roy Kaushik, Bonagiri Vamshi, Govil Priyanshul, Thirunarayanan Krishnaprasad, Gaur Manas
- Mabvit -- Modified Attention Block Enhances Vision Transformers Ramesh Mahesh, Ramkumar Aswinkumar
- Tailoring Self-rationalizers With Multi-reward Distillation Ramnath Sahana, Joshi Brihi, Hallinan Skyler, Lu Ximing, Li Liunian Harold, Chan Aaron, Hessel Jack, Choi Yejin, Ren Xiang
- Aligning Neural Machine Translation Models: Human Feedback In Training And Inference Ramos Miguel Moura, Fernandes Patrick, Farinhas António, Martins André F. T.
- Retrieval-augmented Image Captioning Ramos Rita, Elliott Desmond, Martins Bruno
- When Large Language Models Contradict Humans? Large Language Models' Sycophantic Behaviour Ranaldi Leonardo, Pucci Giulia
- Chatgpt-crawler: Find Out If Chatgpt Really Knows What It's Talking About Rangapur Aman, Wang Haoran
- Retrieval-based Knowledge Augmented Vision Language Pre-training Rao Jiahua, Shan Zifei, Liu Longpo, Zhou Yao, Yang Yuedong
- Evaluating Llms On Document-based QA: Exact Answer Selection And Numerical Extraction Using Cogtale Dataset Rasool Zafaryab, Kurniawan Stefanus, Balugo Sherwin, Barnett Scott, Vasa Rajesh, Chesser Courtney, Hampstead Benjamin M., Belleville Sylvie, Mouzakis Kon, Bahar-fuchs Alex
- Supporting Human-ai Collaboration In Auditing Llms With Llms Rastogi Charvi, Ribeiro Marco Tulio, King Nicholas, Nori Harsha, Amershi Saleema
- On Context Utilization In Summarization With Large Language Models Ravaut Mathieu, Sun Aixin, Chen Nancy F., Joty Shafiq
- Exploring The Relationship Between LLM Hallucinations And Prompt Linguistic Nuances: Readability, Formality, And Concreteness Rawte Vipula, Priya Prachi, Tonmoy S. M Towhidul Islam, Zaman S M Mehedi, Sheth Amit, Das Amitava
- COLA: A Benchmark For Compositional Text-to-image Retrieval Ray Arijit, Radenovic Filip, Dubey Abhimanyu, Plummer Bryan A., Krishna Ranjay, Saenko Kate
- \(\textit{dial Beinfo For Faithfulness}\): Improving Factuality Of Information-seeking Dialogue Via Behavioural Fine-tuning Razumovskaia Evgeniia, Vulić Ivan, Marković Pavle, Cichy Tomasz, Zheng Qian, Wen Tsung-hsien, Budzianowski Paweł
- Nemo Guardrails: A Toolkit For Controllable And Safe LLM Applications With Programmable Rails Rebedea Traian, Dinu Razvan, Sreedhar Makesh, Parisien Christopher, Cohen Jonathan
- Self-evaluation Improves Selective Generation In Large Language Models Ren Jie, Zhao Yao, Vu Tu, Liu Peter J., Lakshminarayanan Balaji
- Investigating The Factual Knowledge Boundary Of Large Language Models With Retrieval Augmentation Ren Ruiyang, Wang Yuhao, Qu Yingqi, Zhao Wayne Xin, Liu Jing, Tian Hao, Wu Hua, Wen Ji-rong, Wang Haifeng
- Sparq Attention: Bandwidth-efficient LLM Inference Ribar Luka, Chelombiev Ivan, Hudlass-galley Luke, Blake Charlie, Luschi Carlo, Orr Douglas
- Integrating Summarization And Retrieval For Enhanced Personalization Via Large Language Models Richardson Chris, Zhang Yao, Gillespie Kellen, Kar Sudipta, Singh Arshdeep, Raeesy Zeynab, Khan Omar Zia, Sethy Abhinav
- SYNDICOM: Improving Conversational Commonsense With Error-injection And Natural Language Feedback Richardson Christopher, Sundar Anirudh, Heck Larry
- Data Contamination Through The Lens Of Time Roberts Manley, Thakur Himanshu, Herlihy Christine, White Colin, Dooley Samuel
- Smoothllm: Defending Large Language Models Against Jailbreaking Attacks Robey Alexander, Wong Eric, Hassani Hamed, Pappas George J.
- Bio-sieve: Exploring Instruction Tuning Large Language Models For Systematic Review Automation Robinson Ambrose, Thorne William, Wu Ben P., Pandor Abdullah, Essat Munira, Stevenson Mark, Song Xingyi
- Chatgpt MT: Competitive For High- (but Not Low-) Resource Languages Robinson Nathaniel R., Ogayo Perez, Mortensen David R., Neubig Graham
- Benchmarking Causal Study To Interpret Large Language Models For Source Code Rodriguez-cardenas Daniel, Palacio David N., Khati Dipin, Burke Henry, Poshyvanyk Denys
- Next-step Hint Generation For Introductory Programming Using Large Language Models Roest Lianne, Keuning Hieke, Jeuring Johan
- Synergistic Integration Of Large Language Models And Cognitive Architectures For Robust AI: An Exploratory Analysis Romero Oscar J., Zimmerman John, Steinfeld Aaron, Tomasic Anthony
- Audiogpt: Understanding And Generating Speech, Music, Sound, And Talking Head Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Zhou Zhao, Shinji Watanabe
- Axiomatic Preference Modeling For Longform Question Answering Rosset Corby, Zheng Guoqing, Dibia Victor, Awadallah Ahmed, Bennett Paul
- Knowledge-infused Self Attention Transformers Roy Kaushik, Zi Yuxin, Narayanan Vignesh, Gaur Manas, Sheth Amit
- Unveiling The Potential Of Large Language Models In Generating Semantic And Cross-language Clones Roy Palash R., Alam Ajmain I., Al-omari Farouq, Roy Banani, Roy Chanchal K., Schneider Kevin A.
- From Chatbots To Phishbots? -- Preventing Phishing Scams Created Using Chatgpt, Google Bard And Claude Roy Sayak Saha, Thota Poojitha, Naragam Krishna Vamsi, Nilizadeh Shirin
- Journey Of Hallucination-minimized Generative AI Solutions For Financial Decision Makers Roychowdhury Sohini
- Code Llama: Open Foundation Models For Code Rozière Baptiste, Gehring Jonas, Gloeckle Fabian, Sootla Sten, Gat Itai, Tan Xiaoqing Ellen, Adi Yossi, Liu Jingyu, Sauvestre Romain, Remez Tal, Rapin Jérémy, Kozhevnikov Artyom, Evtimov Ivan, Bitton Joanna, Bhatt Manish, Ferrer Cristian Canton, Grattafiori Aaron, Xiong Wenhan, Défossez Alexandre, Copet Jade, Azhar Faisal, Touvron Hugo, Martin Louis, Usunier Nicolas, Scialom Thomas, Synnaeve Gabriel
- TPTU: Large Language Model-based AI Agents For Task Planning And Tool Usage Ruan Jingqing, Chen Yihong, Zhang Bin, Xu Zhiwei, Bao Tianpeng, Du Guoqing, Shi Shiwei, Mao Hangyu, Li Ziyue, Zeng Xingyu, Zhao Rui
- Identifying The Risks Of LM Agents With An Lm-emulated Sandbox Ruan Yangjun, Dong Honghua, Wang Andrew, Pitis Silviu, Zhou Yongchao, Ba Jimmy, Dubois Yann, Maddison Chris J., Hashimoto Tatsunori
- Retrieval-pretrained Transformer: Long-range Language Modeling With Self-retrieval Rubin Ohad, Berant Jonathan
- Attention-likelihood Relationship In Transformers Ruscio Valeria, Maiorca Valentino, Silvestri Fabrizio
- Catfood: Counterfactual Augmented Training For Improving Out-of-domain Performance And Calibration Sachdeva Rachneet, Tutek Martin, Gurevych Iryna
- Performance Of Chatgpt-3.5 And GPT-4 On The United States Medical Licensing Examination With And Without Distractions Safrai Myriam, Azaria Amos
- Uncertainty Guided Global Memory Improves Multi-hop Question Answering Sagirova Alsu, Burtsev Mikhail
- Promptmix: A Class Boundary Augmentation Method For Large Language Model Distillation Sahu Gaurav, Vechtomova Olga, Bahdanau Dzmitry, Laradji Issam H.
- Verbosity Bias In Preference Labeling By Large Language Models Saito Keita, Wachi Akifumi, Wataoka Koki, Akimoto Youhei
- Extending The Frontier Of Chatgpt: Code Generation And Debugging Sakib Fardin Ahsan, Khan Saadat Hasan, Karim A. H. M. Rezaul
- Zero-shot Translation Of Attention Patterns In VQA Models To Natural Language Salewski Leonard, Koepke A. Sophia, Lensch Hendrik P. A., Akata Zeynep
- Can Llms Augment Low-resource Reading Comprehension Datasets? Opportunities And Challenges Samuel Vinay, Aynaou Houda, Chowdhury Arijit Ghosh, Ramanan Karthik Venkat, Chadha Aman
- Large Language Models Are Competitive Near Cold-start Recommenders For Language- And Item-based Preferences Sanner Scott, Balog Krisztian, Radlinski Filip, Wedin Ben, Dixon Lucas
- Efficient RLHF: Reducing The Memory Usage Of PPO Santacroce Michael, Lu Yadong, Yu Han, Li Yuanzhi, Shen Yelong
- Camoscio: An Italian Instruction-tuned Llama Santilli Andrea, Rodolà Emanuele
- Early Weight Averaging Meets High Learning Rates For LLM Pre-training Sanyal Sunny, Neerkaje Atula, Kaddour Jean, Kumar Abhishek, Sanghavi Sujay
- Beyond Chinchilla-optimal: Accounting For Inference In Language Model Scaling Laws Sardana Nikhil, Portes Jacob, Doubov Sasha, Frankle Jonathan
- Will Code Remain A Relevant User Interface For End-user Programming With Generative AI Models? Sarkar Advait
- Inseq: An Interpretability Toolkit For Sequence Generation Models Sarti Gabriele, Feldhus Nils, Sickert Ludwig, Van Der Wal Oskar, Nissim Malvina, Bisazza Arianna
- RAMP: Retrieval And Attribute-marking Enhanced Prompting For Attribute-controlled Translation Sarti Gabriele, Htut Phu Mon, Niu Xing, Hsu Benjamin, Currey Anna, Dinu Georgiana, Nadejde Maria
- Interprompt: Interpretable Prompting For Interrelated Interpersonal Risk Factors In Reddit Posts Sathvik Msvpj, Sarkar Surjodeep, Saxena Chandni, Sohn Sunghwan, Garg Muskan
- Unlocking Practical Applications In Legal Domain: Evaluation Of GPT For Zero-shot Semantic Annotation Of Legal Texts Savelka Jaromir
- Large Language Models (GPT) Struggle To Answer Multiple-choice Questions About Code Savelka Jaromir, Agarwal Arav, Bogart Christopher, Sakr Majd
- Can GPT-4 Support Analysis Of Textual Data In Tasks Requiring Highly Specialized Domain Expertise? Savelka Jaromir, Ashley Kevin D., Gray Morgan A, Westermann Hannes, Xu Huihui
- Probing Conceptual Understanding Of Large Visual-language Models Schiappa Madeline, Abdullah Raiyaan, Azad Shehreen, Claypoole Jared, Cogswell Michael, Divakaran Ajay, Rawat Yogesh
- Toolformer: Language Models Can Teach Themselves To Use Tools Schick Timo, Dwivedi-yu Jane, Dessì Roberto, Raileanu Roberta, Lomeli Maria, Zettlemoyer Luke, Cancedda Nicola, Scialom Thomas
- Large Language Model Programs Schlag Imanol, Sukhbaatar Sainbayar, Celikyilmaz Asli, Yih Wen-tau, Weston Jason, Schmidhuber Jürgen, Li Xian
- Ruffle&riley: Towards The Automated Induction Of Conversational Tutoring Systems Schmucker Robin, Xia Meng, Azaria Amos, Mitchell Tom
- Towards Llm-based Autograding For Short Textual Answers Schneider Johannes, Schenk Bernd, Niklaus Christina
- SEMQA: Semi-extractive Multi-source Question Answering Schuster Tal, Lelkes Adam D., Sun Haitian, Gupta Jai, Berant Jonathan, Cohen William W., Metzler Donald
- Chatbot-supported Thesis Writing: An Autoethnographic Report Schwenke Nicolas, Söbke Heinrich, Kraft Eckhard
- Personality Traits In Large Language Models Serapio-garcía Greg, Safdari Mustafa, Crepy Clément, Sun Luning, Fitz Stephen, Romero Peter, Abdulhai Marwa, Faust Aleksandra, Matarić Maja
- Language Models: A Guide For The Perplexed Serrano Sofia, Brumbaugh Zander, Smith Noah A.
- The Cot Collection: Improving Zero-shot And Few-shot Learning Of Language Models Via Chain-of-thought Fine-tuning Seungone Kim, Se June Joo, Doyoung Kim, Joel Jang, Seonghyeon Ye, Jamin Shin, Minjoon Seo
- Human Behavioral Benchmarking: Numeric Magnitude Comparison Effects In Large Language Models Shah Raj Sanjay, Marupudi Vijay, Koenen Reba, Bhardwaj Khushi, Varma Sashank
- Let's Have A Chat! A Conversation With Chatgpt: Technology, Applications, And Limitations Shahriar Sakib, Hayawi Kadhim
- Cognitive Effects In Large Language Models Shaki Jonathan, Kraus Sarit, Wooldridge Michael
- Evaluating Large Language Model Creativity From A Literary Perspective Shanahan Murray, Clarke Catherine
- One-shot Sensitivity-aware Mixed Sparsity Pruning For Large Language Models Shao Hang, Liu Bei, Xiao Bo, Zeng Ke, Wan Guanglu, Qian Yanmin
- Omniquant: Omnidirectionally Calibrated Quantization For Large Language Models Shao Wenqi, Chen Mengzhao, Zhang Zhaoyang, Xu Peng, Zhao Lirui, Li Zhiqian, Zhang Kaipeng, Gao Peng, Qiao Yu, Luo Ping
- Tinylvlm-ehub: Towards Comprehensive And Efficient Evaluation For Large Vision-language Models Shao Wenqi, Lei Meng, Hu Yutao, Gao Peng, Zhang Kaipeng, Meng Fanqing, Xu Peng, Huang Siyuan, Li Hongsheng, Qiao Yu, Luo Ping
- Enhancing Retrieval-augmented Large Language Models With Iterative Retrieval-generation Synergy Shao Zhihong, Gong Yeyun, Shen Yelong, Huang Minlie, Duan Nan, Chen Weizhu
- Language Is Not All You Need: Aligning Perception With Language Models Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Johan Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei
- Jailbreak In Pieces: Compositional Adversarial Attacks On Multi-modal Language Models Shayegani Erfan, Dong Yue, Abu-ghazaleh Nael
- The Flan Collection: Designing Data And Methods For Effective Instruction Tuning Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay, Denny Zhou, Quoc V. Le, Barret Zoph, Jason Wei, Adam Roberts
- Pangu-coder2: Boosting Large Language Models For Code With Ranking Feedback Shen Bo, Zhang Jiaxin, Chen Taihong, Zan Daoguang, Geng Bing, Fu An, Zeng Muhan, Yu Ailun, Ji Jichuan, Zhao Jingyang, Guo Yuenan, Wang Qianxiang
- Efficient LLM Inference On Cpus Shen Haihao, Chang Hanwen, Dong Bo, Luo Yu, Meng Hengyu
- The Trickle-down Impact Of Reward (in-)consistency On RLHF Shen Lingfeng, Chen Sihao, Song Linfeng, Jin Lifeng, Peng Baolin, Mi Haitao, Khashabi Daniel, Yu Dong
- "do Anything Now": Characterizing And Evaluating In-the-wild Jailbreak Prompts On Large Language Models Shen Xinyue, Chen Zeyuan, Backes Michael, Shen Yun, Zhang Yang
- Towards More Unified In-context Visual Understanding Sheng Dianmo, Chen Dongdong, Tan Zhentao, Liu Qiankun, Chu Qi, Bao Jianmin, Gong Tao, Liu Bin, Xu Shengwei, Yu Nenghai
- Can Language Agents Be Alternatives To PPO? A Preliminary Empirical Study On Openai Gym Sheng Junjie, Huang Zixiao, Shen Chuyun, Li Wenhao, Hua Yun, Jin Bo, Zha Hongyuan, Wang Xiangfeng
- Specialist Or Generalist? Instruction Tuning For Specific NLP Tasks Shi Chufan, Su Yixuan, Yang Cheng, Yang Yujiu, Cai Deng
- Sotana: The Open-source Software Development Assistant Shi Ensheng, Zhang Fengji, Wang Yanlin, Chen Bei, Du Lun, Zhang Hongyu, Han Shi, Zhang Dongmei, Sun Hongbin
- Prompt Space Optimizing Few-shot Reasoning Success With Large Language Models Shi Fobo, Qing Peijun, Yang Dong, Wang Nan, Lei Youbo, Lu Haonan, Lin Xiaodong, Li Duantengchuan
- Badgpt: Exploring Security Vulnerabilities Of Chatgpt Via Backdoor Attacks To Instructgpt Shi Jiawen, Liu Yixin, Zhou Pan, Sun Lichao
- Unleashing The Power Of Pre-trained Language Models For Offline Reinforcement Learning Shi Ruizhe, Liu Yuyao, Ze Yanjie, Du Simon S., Xu Huazhe
- Preliminary Study On Incremental Learning For Large Language Model-based Recommender Systems Shi Tianhao, Zhang Yang, Xu Zhijian, Chen Chong, Feng Fuli, He Xiangnan, Tian Qi
- Trusting Your Evidence: Hallucinate Less With Context-aware Decoding Shi Weijia, Han Xiaochuang, Lewis Mike, Tsvetkov Yulia, Zettlemoyer Luke, Yih Scott Wen-tau
- MKRAG: Medical Knowledge Retrieval Augmented Generation For Medical Question Answering Shi Yucheng, Xu Shaochen, Yang Tianze, Liu Zhengliang, Liu Tianming, Li Quanzheng, Li Xiang, Liu Ninghao
- RADE: Reference-assisted Dialogue Evaluation For Open-domain Dialogue Shi Zhengliang, Sun Weiwei, Zhang Shuo, Zhang Zhen, Ren Pengjie, Ren Zhaochun
- Dept: Decomposed Prompt Tuning For Parameter-efficient Fine-tuning Shi Zhengxiang, Lipani Aldo
- Reasoning With Language Model Is Planning With World Model Shibo Hao, Yi Gu, Haodi Ma, Joshua Jiahua Hong, Zhen Wang, Daisy Zhe Wang, Zhiting Hu
- Large Language Models Can Enhance Persuasion Through Linguistic Feature Alignment Shin Minkyu, Kim Jin
- The ART Of LLM Refinement: Ask, Refine, And Trust Shridhar Kumar, Sinha Koustuv, Cohen Andrew, Wang Tianlu, Yu Ping, Pasunuru Ram, Sachan Mrinmaya, Weston Jason, Celikyilmaz Asli
- You Don't Need A Personality Test To Know These Models Are Unreliable: Assessing The Reliability Of Large Language Models On Psychometric Instruments Shu Bangzhao, Zhang Lechen, Choi Minje, Dunagan Lavinia, Logeswaran Lajanugen, Lee Moontae, Card Dallas, Jurgens David
- POSQA: Probe The World Models Of Llms With Size Comparisons Shu Chang, Han Jiuzhou, Liu Fangyu, Shareghi Ehsan, Collier Nigel
- Rewritelm: An Instruction-tuned Large Language Model For Text Rewriting Shu Lei, Luo Liangchen, Hoskere Jayakumar, Zhu Yun, Liu Yinxiao, Tong Simon, Chen Jindong, Meng Lei
- Fusion-eval: Integrating Assistant Evaluators With Llms Shu Lei, Wichers Nevan, Luo Liangchen, Zhu Yun, Liu Yinxiao, Chen Jindong, Meng Lei
- A Comprehensive Evaluation Of Large Language Models On Legal Judgment Prediction Shui Ruihao, Cao Yixin, Wang Xiang, Chua Tat-seng
- A Survey On Multimodal Large Language Models Shukang Yin, Chaoyou Fu, Sirui Zhao, Ke Li, Xing Sun, Tong Xu, Enhong Chen
- Ep-alm: Efficient Perceptual Augmentation Of Language Models Shukor Mustafa, Dancette Corentin, Cord Matthieu
- Beyond Task Performance: Evaluating And Reducing The Flaws Of Large Multimodal Models With In-context Learning Shukor Mustafa, Rame Alexandre, Dancette Corentin, Cord Matthieu
- Automatic Prompt Augmentation And Selection With Chain-of-thought From Labeled Data Shum Kashun, Diao Shizhe, Zhang Tong
- Measuring Inductive Biases Of In-context Learning With Underspecified Demonstrations Si Chenglei, Friedman Dan, Joshi Nitish, Feng Shi, Chen Danqi, He He
- Combo Of Thinking And Observing For Outside-knowledge VQA Si Qingyi, Mo Yuchen, Lin Zheng, Ji Huishan, Wang Weiping
- An Empirical Study Of Instruction-tuning Large Language Models In Chinese Si Qingyi, Wang Tong, Lin Zheng, Zhang Xu, Cao Yanan, Wang Weiping
- Mind Meets Machine: Unravelling Gpt-4's Cognitive Psychology Sifatkaur Dhingra, Manmeet Singh, Vaisakh Sb, Neetiraj Malviya, Sukhpal Singh Gill
- GPT-4 As An Agronomist Assistant? Answering Agriculture Exams Using Large Language Models Silva Bruno, Nunes Leonardo, Estevão Roberto, Aski Vijay, Chandra Ranveer
- The Confidence-competence Gap In Large Language Models: A Cognitive Study Singh Aniket Kumar, Devkota Suman, Lamichhane Bishal, Dhakal Uttam, Dhakal Chandra
- Better Generalization With Semantic Ids: A Case Study In Ranking For Recommendations Singh Anima, Vu Trung, Mehta Nikhil, Keshavan Raghunandan, Sathiamoorthy Maheswaran, Zheng Yilin, Hong Lichan, Heldt Lukasz, Wei Li, Tandon Devansh, Chi Ed H., Yi Xinyang
- Towards Expert-level Medical Question Answering With Large Language Models Singhal Karan, Tu Tao, Gottweis Juraj, Sayres Rory, Wulczyn Ellery, Hou Le, Clark Kevin, Pfohl Stephen, Cole-lewis Heather, Neal Darlene, Schaekermann Mike, Wang Amy, Amin Mohamed, Lachgar Sami, Mansfield Philip, Prakash Sushant, Green Bradley, Dominowska Ewa, Arcas Blaise Aguera Y, Tomasev Nenad, Liu Yun, Wong Renee, Semturs Christopher, Mahdavi S. Sara, Barral Joelle, Webster Dale, Corrado Greg S., Matias Yossi, Azizi Shekoofeh, Karthikesalingam Alan, Natarajan Vivek
- A Long Way To Go: Investigating Length Correlations In RLHF Singhal Prasann, Goyal Tanya, Xu Jiacheng, Durrett Greg
- Evaluating Chatgpt And GPT-4 For Visual Programming Singla Adish
- TABLET: Learning From Instructions For Tabular Data Slack Dylan, Singh Sameer
- Dont Add, Dont Miss: Effective Content Preserving Generation From Pre-selected Text Spans Slobodkin Aviv, Caciularu Avi, Hirsch Eran, Dagan Ido
- The Curious Case Of Hallucinatory (un)answerability: Finding Truths In The Hidden States Of Over-confident Large Language Models Slobodkin Aviv, Goldman Omer, Caciularu Avi, Dagan Ido, Ravfogel Shauli
- On Early Detection Of Hallucinations In Factual Question Answering Snyder Ben, Moisescu Marius, Zafar Muhammad Bilal
- LLM Guided Inductive Inference For Solving Compositional Problems Sodani Abhigya, Moos Lauren, Mirman Matthew
- Step: Stacked LLM Policies For Web Actions Sodhi Paloma, Branavan S. R. K., Artzi Yoav, Mcdonald Ryan
- Decoding Chatgpt: A Taxonomy Of Existing Research, Current Challenges, And Possible Future Directions Sohail Shahab Saquib, Farhat Faiza, Himeur Yassine, Nadeem Mohammad, Madsen Dag Øivind, Singh Yashbir, Atalla Shadi, Mansoor Wathiq
- Dynamics Of Instruction Tuning: Each Ability Of Large Language Models Has Its Own Growth Pace Song Chiyu, Zhou Zhanchao, Yan Jianhao, Fei Yuejiao, Lan Zhenzhong, Zhang Yue
- Preference Ranking Optimization For Human Alignment Song Feifan, Yu Bowen, Li Minghao, Yu Haiyang, Huang Fei, Li Yongbin, Wang Houfeng
- Pneumollm: Harnessing The Power Of Large Language Model For Pneumoconiosis Diagnosis Song Meiyue, Yu Zhihua, Wang Jiaxin, Wang Jiarui, Lu Yuting, Li Baicun, Wang Xiaoxu, Huang Qinghua, Li Zhijun, Kanellakis Nikolaos I., Liu Jiangfeng, Wang Jing, Wang Binglu, Yang Juntao
- How To Bridge The Gap Between Modalities: A Comprehensive Survey On Multimodal Large Language Model Song Shezheng, Li Xiaopeng, Li Shasha, Zhao Shan, Yu Jie, Ma Jun, Mao Xiaoguang, Zhang Weimin
- Large Language Models Meet Open-world Intent Discovery And Recognition: An Evaluation Of Chatgpt Song Xiaoshuai, He Keqing, Wang Pei, Dong Guanting, Mou Yutao, Wang Jingang, Xian Yunsen, Cai Xunliang, Xu Weiran
- Powerinfer: Fast Large Language Model Serving With A Consumer-grade GPU Song Yixin, Mi Zeyu, Xie Haotong, Chen Haibo
- Honeybee: Progressive Instruction Finetuning Of Large Language Models For Materials Science Song Yu, Miret Santiago, Zhang Huan, Liu Bang
- Reward Collapse In Aligning Large Language Models Song Ziang, Cai Tianle, Lee Jason D., Su Weijie J.
- Deduction Under Perturbed Evidence: Probing Student Simulation Capabilities Of Large Language Models Sonkar Shashank, Baraniuk Richard G.
- Learning Ui-to-code Reverse Generator Using Visual Critic Without Rendering Soselia Davit, Saifullah Khalid, Zhou Tianyi
- Evaluation Metrics In The Era Of GPT-4: Reliably Evaluating Large Language Models On Sequence To Sequence Tasks Sottana Andrea, Liang Bin, Zou Kai, Yuan Zheng
- Curriculum-guided Abstractive Summarization Sotudeh Sajad, Deilamsalehy Hanieh, Dernoncourt Franck, Goharian Nazli
- Musr: Testing The Limits Of Chain-of-thought With Multistep Soft Reasoning Sprague Zayne, Ye Xi, Bostrom Kaj, Chaudhuri Swarat, Durrett Greg
- Hierarchical Prompting Assists Large Language Model On Web Navigation Sridhar Abishek, Lo Robert, Xu Frank F., Zhu Hao, Zhou Shuyan
- Harnessing Llms In Curricular Design: Using GPT-4 To Support Authoring Of Learning Objectives Sridhar Pragnya, Doyle Aidan, Agarwal Arav, Bogart Christopher, Savelka Jaromir, Sakr Majd
- Chatgpt: A Study On Its Utility For Ubiquitous Software Engineering Tasks Sridhara Giriprasad, G. Ranjani H., Mazumdar Sourav
- Response-act Guided Reinforced Dialogue Generation For Mental Health Counseling Srivastava Aseem, Pandey Ishan, Akhtar Md. Shad, Chakraborty Tanmoy
- Evaluating Chatbots To Promote Users' Trust -- Practices And Open Problems Srivastava Biplav, Lakkaraju Kausik, Koppel Tarmo, Narayanan Vignesh, Kundu Ashish, Joshi Sachindra
- Core-cog: Conversational Recommendation Of Entities Using Constrained Generation Srivastava Harshvardhan, Pruthi Kanav, Chakrabarti Soumen, Mausam
- Seeing Seeds Beyond Weeds: Green Teaming Generative AI For Beneficial Uses Stapleton Logan, Taylor Jordan, Fox Sarah, Wu Tongshuang, Zhu Haiyi
- Pythia: A Suite For Analyzing Large Language Models Across Training And Scaling Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, Usvsn Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar Van Der Wal
- A Mechanistic Interpretation Of Arithmetic Reasoning In Language Models Using Causal Mediation Analysis Stolfo Alessandro, Belinkov Yonatan, Sachan Mrinmaya
- Finetuning An LLM On Contextual Knowledge Of Classics For Q&A Strachan Shane Storm
- HC3 Plus: A Semantic-invariant Human Chatgpt Comparison Corpus Su Zhenpeng, Wu Xing, Zhou Wei, Ma Guangyuan, Hu Songlin
- Datatales: Investigating The Use Of Large Language Models For Authoring Data-driven Articles Sultanum Nicole, Srinivasan Arjun
- CONSCENDI: A Contrastive And Scenario-guided Distillation Approach To Guardrail Models For Virtual Assistants Sun Albert Yu, Nair Varun, Schumacher Elliot, Kannan Anitha
- Does Fine-tuning GPT-3 With The Openai API Leak Personally-identifiable Information? Sun Albert Yu, Zemour Eliott, Saxena Arushi, Vaidyanathan Udith, Lin Eric, Lau Christian, Mugunthan Vaikkunth
- TEST: Text Prototype Aligned Embedding To Activate Llm's Ability For Time Series Sun Chenxi, Li Hongyan, Li Yaliang, Hong Shenda
- Create Your World: Lifelong Text-to-image Diffusion Sun Gan, Liang Wenqi, Dong Jiahua, Li Jun, Ding Zhengming, Cong Yang
- Reinforcement Learning In The Era Of Llms: What Is Essential? What Is Needed? An RL Perspective On RLHF, Prompting, And Beyond Sun Hao
- Query-dependent Prompt Evaluation And Optimization With Offline Inverse RL Sun Hao, Hüyük Alihan, Van Der Schaar Mihaela
- Adaplanner: Adaptive Planning From Feedback With Language Models Sun Haotian, Zhuang Yuchen, Kong Lingkai, Dai Bo, Zhang Chao
- Corex: Pushing The Boundaries Of Complex Reasoning Through Multi-model Collaboration Sun Qiushi, Yin Zhangyue, Li Xiang, Wu Zhiyong, Qiu Xipeng, Kong Lingpeng
- Exploring The Impact Of Low-rank Adaptation On The Performance, Efficiency, And Regularization Of RLHF Sun Simeng, Gupta Dhawal, Iyyer Mohit
- PEARL: Prompting Large Language Models To Plan And Execute Actions Over Long Documents Sun Simeng, Liu Yang, Wang Shuohang, Zhu Chenguang, Iyyer Mohit
- Parrot: Enhancing Multi-turn Instruction Following For Large Language Models Sun Yuchong, Liu Che, Zhou Kun, Huang Jinwen, Song Ruihua, Zhao Wayne Xin, Zhang Fuzheng, Zhang Di, Gai Kun
- Retentive Network: A Successor To Transformer For Large Language Models Sun Yutao, Dong Li, Huang Shaohan, Ma Shuming, Xia Yuqing, Xue Jilong, Wang Jianyong, Wei Furu
- Multimodal Question Answering For Unified Information Extraction Sun Yuxuan, Zhang Kai, Su Yu
- A Short Survey Of Viewing Large Language Models In Legal Aspect Sun Zhongxiang
- Large Language Models For Intent-driven Session Recommendations Sun Zhu, Liu Hongyang, Qu Xinghua, Feng Kaidong, Wang Yan, Ong Yew-soon
- Ctbls: Augmenting Large Language Models With Conversational Tables Sundar Anirudh S, Heck Larry
- Pre-training Intent-aware Encoders For Zero- And Few-shot Intent Classification Sung Mujeen, Gung James, Mansimov Elman, Pappas Nikolaos, Shu Raphael, Romeo Salvatore, Zhang Yi, Castelli Vittorio
- Do Large Language Models Show Decision Heuristics Similar To Humans? A Case Study Using GPT-3.5 Suri Gaurav, Slater Lily R., Ziaee Ali, Nguyen Morgan
- Approximating Online Human Evaluation Of Social Chatbots With Prompting Svikhnushina Ekaterina, Pu Pearl
- Improving Non-autoregressive Translation Quality With Pretrained Language Model, Embedding Distillation And Upsampling Strategy For CTC Syu Shen-sian, Xie Juncheng, Lee Hung-yi
- Large Language Models As Generalizable Policies For Embodied Tasks Szot Andrew, Schwarzer Max, Agrawal Harsh, Mazoure Bogdan, Talbott Walter, Metcalf Katherine, Mackraz Natalie, Hjelm Devon, Toshev Alexander
- Sparks Of Artificial General Intelligence: Early Experiments With GPT-4 Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang
- Glitchbench: Can Large Multimodal Models Detect Video Game Glitches? Taesiri Mohammad Reza, Feng Tianjun, Nguyen Anh, Bezemer Cor-paul
- Link-context Learning For Multimodal Llms Tai Yan, Fan Weichen, Zhang Zhao, Zhu Feng, Zhao Rui, Liu Ziwei
- Redcoast: A Lightweight Tool To Automate Distributed Training Of Llms On Any Gpu/tpus Tan Bowen, Zhu Yun, Liu Lijuan, Wang Hongyi, Zhuang Yonghao, Chen Jindong, Xing Eric, Hu Zhiting
- Cappy: Outperforming And Boosting Large Multi-task Lms With A Small Scorer Tan Bowen, Zhu Yun, Liu Lijuan, Xing Eric, Hu Zhiting, Chen Jindong
- Is Chatgpt A Good Multi-party Conversation Solver? Tan Chao-hong, Gu Jia-chen, Ling Zhen-hua
- Large Language Model-driven Classroom Flipping: Empowering Student-centric Peer Questioning With Flipped Interaction Tan Chee Wei
- Copilot For Xcode: Exploring Ai-assisted Programming By Prompting Cloud-based Large Language Models Tan Chee Wei, Guo Shangxin, Wong Man Fai, Hang Ching Nam
- Towards Robust Temporal Reasoning Of Large Language Models Via A Multi-hop QA Dataset And Pseudo-instruction Tuning Tan Qingyu, Ng Hwee Tou, Bing Lidong
- Medchatzh: A Better Medical Adviser Learns From Better Instructions Tan Yang, Li Mingchen, Huang Zijie, Yu Huiqun, Fan Guisheng
- Can Chatgpt Replace Traditional KBQA Models? An In-depth Analysis Of The Question Answering Performance Of The GPT LLM Family Tan Yiming, Min Dehai, Li Yu, Li Wenbo, Hu Nan, Chen Yongrui, Qi Guilin
- Sparsity-guided Holistic Explanation For Llms With Interpretable Inference-time Intervention Tan Zhen, Chen Tianlong, Zhang Zhenyu, Liu Huan
- Interpreting Pretrained Language Models Via Concept Bottlenecks Tan Zhen, Cheng Lu, Wang Song, Bo Yuan, Li Jundong, Liu Huan
- Multiple-question Multiple-answer Text-vqa Tang Peng, Appalaraju Srikar, Manmatha R., Xie Yusheng, Mahadevan Vijay
- Toolalpaca: Generalized Tool Learning For Language Models With 3000 Simulated Cases Tang Qiaoyu, Deng Ziliang, Lin Hongyu, Han Xianpei, Liang Qiao, Cao Boxi, Sun Le
- Ml-bench: Evaluating Large Language Models And Agents For Machine Learning Tasks On Repository-level Code Tang Xiangru, Liu Yuliang, Cai Zefan, Shao Yanjun, Lu Junjie, Zhang Yichi, Deng Zexuan, Hu Helan, An Kaikai, Huang Ruijun, Si Shuzheng, Chen Sheng, Zhao Haozhe, Chen Liang, Wang Yan, Liu Tianyu, Jiang Zhiwei, Chang Baobao, Fang Yin, Qin Yujia, Zhou Wangchunshu, Zhao Yilun, Cohan Arman, Gerstein Mark
- Medagents: Large Language Models As Collaborators For Zero-shot Medical Reasoning Tang Xiangru, Zou Anni, Zhang Zhuosheng, Li Ziming, Zhao Yilun, Zhang Xingyao, Cohan Arman, Gerstein Mark
- RSVP: Customer Intent Detection Via Agent Response Contrastive And Generative Pre-training Tang Yu-chien, Wang Wei-yao, Yen An-zi, Peng Wen-chih
- Codi-2: In-context, Interleaved, And Interactive Any-to-any Generation Tang Zineng, Yang Ziyi, Khademi Mahmoud, Liu Yang, Zhu Chenguang, Bansal Mohit
- Using Large Language Models For Cybersecurity Capture-the-flag Challenges And Certification Questions Tann Wesley, Liu Yuancheng, Sim Jun Heng, Seah Choon Meng, Chang Ee-chien
- Webwise: Web Interface Control And Sequential Exploration With Large Language Models Tao Heyi, T Sethuraman V, Shlapentokh-rothman Michal, Hoiem Derek
- Adversarial Conversational Shaping For Intelligent Agents Tarasiewicz Piotr, Kenjeyev Sultan, Sebag Ilana, Alshehabi Shehab
- Caption Anything: Interactive Image Description With Diverse Multimodal Controls Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao
- CLAMP: Contrastive Language Model Prompt-tuning Teterwak Piotr, Sun Ximeng, Plummer Bryan A., Saenko Kate, Lim Ser-nam
- Soft-prompt Tuning For Large Language Models To Evaluate Bias Tian Jacob-junqi, Emerson David, Miyandoab Sevil Zanjani, Pandya Deval, Seyyed-kalantari Laleh, Khattak Faiza Khan
- Just Ask For Calibration: Strategies For Eliciting Calibrated Confidence Scores From Language Models Fine-tuned With Human Feedback Tian Katherine, Mitchell Eric, Zhou Allan, Sharma Archit, Rafailov Rafael, Yao Huaxiu, Finn Chelsea, Manning Christopher D.
- Chimed-gpt: A Chinese Medical Large Language Model With Full Training Regime And Better Alignment To Human Preferences Tian Yuanhe, Gan Ruyi, Song Yan, Zhang Jiaxing, Zhang Yongdong
- Macgyver: Are Large Language Models Creative Problem Solvers? Tian Yufei, Ravichander Abhilasha, Qin Lianhui, Bras Ronan Le, Marjieh Raja, Peng Nanyun, Choi Yejin, Griffiths Thomas L., Brahman Faeze
- RLHF-V: Towards Trustworthy Mllms Via Behavior Alignment From Fine-grained Correctional Human Feedback Tianyu Yu, Yuan Yao, Haoye Zhang, Taiwen He, Yifeng Han, Ganqu Cui, Jinyi Hu, Zhiyuan Liu, Hai-tao Zheng, Maosong Sun, Tat-seng Chua
- Do Llms Exhibit Human-like Response Biases? A Case Study In Survey Design Tjuatja Lindia, Chen Valerie, Wu Sherry Tongshuang, Talwalkar Ameet, Neubig Graham
- Parameter Efficient Tuning Allows Scalable Personalization Of Llms For Text Entry: A Case Study On Abbreviation Expansion Tomanek Katrin, Cai Shanqing, Venugopalan Subhashini
- Eliminating Reasoning Via Inferring With Planning: A New Framework To Guide Llms' Non-linear Thinking Tong Yongqi, Wang Yifan, Li Dawei, Wang Sizhe, Lin Zi, Han Simeng, Shang Jingbo
- Viclevr: A Visual Reasoning Dataset And Hybrid Multimodal Fusion Model For Visual Question Answering In Vietnamese Tran Khiem Vinh, Phan Hao Phu, Van Nguyen Kiet, Nguyen Ngan Luu Thuy
- Can Large Language Models Play Text Games Well? Current State-of-the-art And Open Questions Tsai Chen Feng, Zhou Xiaochen, Liu Sierra S., Li Jing, Yu Mo, Mei Hongyuan
- Efficiently Aligned Cross-lingual Transfer Learning For Conversational Tasks Using Prompt-tuning Tu Lifu, Qu Jin, Yavuz Semih, Joty Shafiq, Liu Wenhao, Xiong Caiming, Zhou Yingbo
- Automating Code-related Tasks Through Transformers: The Impact Of Pre-training Tufano Rosalia, Pascarella Luca, Bavota Gabriele
- Activation Addition: Steering Language Models Without Optimization Turner Alexander Matt, Thiergart Lisa, Leech Gavin, Udell David, Vazquez Juan J., Mini Ulisse, Macdiarmid Monte
- Focused Transformer: Contrastive Training For Context Scaling Tworkowski Szymon, Staniszewski Konrad, Pacek Mikołaj, Wu Yuhuai, Michalewski Henryk, Miłoś Piotr
- Simple Is Better And Large Is Not Enough: Towards Ensembling Of Foundational Language Models Tyagi Nancy, Shiri Aidin, Sarkar Surjodeep, Umrawal Abhishek Kumar, Gaur Manas
- Humans And Language Models Diverge When Predicting Repeating Text Vaidya Aditya R., Turek Javier, Huth Alexander G.
- On The Automatic Generation And Simplification Of Children's Stories Valentini Maria, Weber Jennifer, Salcido Jesus, Wright Téa, Colunga Eliana, Kann Katharina
- On The Planning Abilities Of Large Language Models : A Critical Investigation Valmeekam Karthik, Marquez Matthew, Sreedharan Sarath, Kambhampati Subbarao
- Large Language Models: The Need For Nuance In Current Debates And A Pragmatic Perspective On Understanding Van Dijk Bram M. A., Kouwenhoven Tom, Spruit Marco R., Van Duijn Max J.
- Theory Of Mind In Large Language Models: Examining Performance Of 11 State-of-the-art Models Vs. Children Aged 7-10 On Advanced Tests Van Duijn Max J., Van Dijk Bram M. A., Kouwenhoven Tom, De Valk Werner, Spruit Marco R., Van Der Putten Peter
- Accelerating Llama Inference By Enabling Intermediate Layer Decoding Via Instruction Tuning With LITE Varshney Neeraj, Chatterjee Agneet, Parmar Mihir, Baral Chitta
- Can NLP Models Correctly Reason Over Contexts That Break The Common Assumptions? Varshney Neeraj, Parmar Mihir, Patel Nisarg, Handa Divij, Sarkar Sayantan, Luo Man, Baral Chitta
- A Stitch In Time Saves Nine: Detecting And Mitigating Hallucinations Of Llms By Validating Low-confidence Generation Varshney Neeraj, Yao Wenlin, Zhang Hongming, Chen Jianshu, Yu Dong
- Gpt-who: An Information Density-based Machine-generated Text Detector Venkatraman Saranya, Uchendu Adaku, Lee Dongwon
- Rethinking The Evaluating Framework For Natural Language Understanding In AI Systems: Language Acquisition As A Core For Future Metrics Vera Patricio, Moya Pedro, Barraza Lisa
- Behind The Magic, MERLIM: Multi-modal Evaluation Benchmark For Large Image-language Models Villa Andrés, Alcázar Juan Carlos León, Soto Alvaro, Ghanem Bernard
- Mtcue: Learning Zero-shot Control Of Extra-textual Attributes By Leveraging Unstructured Context In Neural Machine Translation Vincent Sebastian, Flynn Robert, Scarton Carolina
- Fairpy: A Toolkit For Evaluation Of Social Biases And Their Mitigation In Large Language Models Viswanath Hrishikesh, Zhang Tianyi
- Freshllms: Refreshing Large Language Models With Search Engine Augmentation Vu Tu, Iyyer Mohit, Wang Xuezhi, Constant Noah, Wei Jerry, Wei Jason, Tar Chris, Sung Yun-hsuan, Zhou Denny, Le Quoc, Luong Thang
- Using Natural Language Explanations To Rescale Human Judgments Wadhwa Manya, Chen Jifan, Li Junyi Jessy, Durrett Greg
- Empirical Evaluation Of Uncertainty Quantification In Retrieval-augmented Language Models For Science Wagle Sridevi, Munikoti Sai, Acharya Anurag, Smith Sara, Horawalavithana Sameera
- Unraveling Downstream Gender Bias From Large Language Models: A Study On AI Educational Writing Assistance Wambsganss Thiemo, Su Xiaotian, Swamy Vinitra, Neshaei Seyed Parsa, Rietsche Roman, Käser Tanja
- Reformulating Domain Adaptation Of Large Language Models As Adapt-retrieve-revise: A Case Study On Chinese Legal Domain Wan Zhen, Zhang Yating, Wang Yexiang, Cheng Fei, Kurohashi Sadao
- Cost-effective Hyperparameter Optimization For Large Language Model Generation Inference Wang Chi, Liu Susan Xueqing, Awadallah Ahmed H.
- Openchat: Advancing Open-source Language Models With Mixed-quality Data Wang Guan, Cheng Sijie, Zhan Xianyuan, Li Xiangang, Song Sen, Liu Yang
- What Makes For Good Visual Tokenizers For Large Language Models? Wang Guangzhi, Ge Yixiao, Ding Xiaohan, Kankanhalli Mohan, Shan Ying
- A Survey Of The Evolution Of Language Model-based Dialogue Systems Wang Hongru, Wang Lingzhi, Du Yiming, Chen Liang, Zhou Jingyan, Wang Yufei, Wong Kam-fai
- Cue-cot: Chain-of-thought Prompting For Responding To In-depth Dialogue Questions With Llms Wang Hongru, Wang Rui, Mi Fei, Deng Yang, Wang Zezhong, Liang Bin, Xu Ruifeng, Wong Kam-fai
- Dialog Action-aware Transformer For Dialog Policy Learning Wang Huimin, Kwan Wai-chung, Wong Kam-fai
- Missrec: Pre-training And Transferring Multi-modal Interest-aware Sequence Representation For Recommendation Wang Jinpeng, Zeng Ziyun, Wang Yunxiao, Wang Yuting, Lu Xingyu, Li Tianxiang, Yuan Jun, Zhang Rui, Zheng Hai-tao, Xia Shu-tao
- Self-prompted Chain-of-thought On Large Language Models For Open-domain Multi-hop Reasoning Wang Jinyuan, Li Junlong, Zhao Hai
- Improved Visual Fine-tuning With Natural Language Supervision Wang Junyang, Xu Yuanhong, Hu Juhua, Yan Ming, Sang Jitao, Qian Qi
- Mathcoder: Seamless Code Integration In Llms For Enhanced Mathematical Reasoning Wang Ke, Ren Houxing, Zhou Aojun, Lu Zimu, Luo Sichun, Shi Weikang, Zhang Renrui, Song Linqi, Zhan Mingjie, Li Hongsheng
- Chated: A Chatbot Leveraging Chatgpt For An Enhanced Learning Experience In Higher Education Wang Kevin, Ramos Jason, Lawrence Ramon
- Adapting LLM Agents With Universal Feedback In Communication Wang Kuan, Lu Yadong, Santacroce Michael, Gong Yeyun, Zhang Chao, Shen Yelong
- Sibyl: Sensible Empathetic Dialogue Generation With Visionary Commonsense Knowledge Wang Lanrui, Li Jiangnan, Yang Chenxu, Lin Zheng, Tang Hongyin, Liu Huan, Huang Xiaolei, Cao Yanan, Wang Jingang, Wang Weiping
- Document-level Machine Translation With Large Language Models Wang Longyue, Lyu Chenyang, Ji Tianbo, Zhang Zhirui, Yu Dian, Shi Shuming, Tu Zhaopeng
- Simultaneous Machine Translation With Large Language Models Wang Minghan, Zhao Jinming, Vu Thuy-trang, Shiri Fatemeh, Shareghi Ehsan, Haffari Gholamreza
- Making Large Language Models Better Reasoners With Alignment Wang Peiyi, Li Lei, Chen Liang, Song Feifan, Lin Binghuai, Cao Yunbo, Liu Tianyu, Sui Zhifang
- Demystifying Instruction Mixing For Fine-tuning Large Language Models Wang Renxi, Li Haonan, Wu Minghao, Wang Yuxia, Han Xudong, Zhang Chiyu, Baldwin Timothy
- Ivygpt: Interactive Chinese Pathway Language Model In Medical Domain Wang Rongsheng, Duan Yaofei, Lam Chantong, Chen Jiexi, Xu Jiangsheng, Chen Haoming, Liu Xiaohong, Pang Patrick Cheong-iao, Tan Tao
- Bridging The Novice-expert Gap Via Models Of Decision-making: A Case Study On Remediating Math Mistakes Wang Rose E., Zhang Qingyang, Robinson Carly, Loeb Susanna, Demszky Dorottya
- Compositional Text-to-image Synthesis With Attention Map Control Of Diffusion Models Wang Ruichen, Chen Zekang, Chen Chen, Ma Jian, Lu Haonan, Lin Xiaodong
- Hypothesis Search: Inductive Reasoning With Language Models Wang Ruocheng, Zelikman Eric, Poesia Gabriel, Pu Yewen, Haber Nick, Goodman Noah D.
- Vamos: Versatile Action Models For Video Understanding Wang Shijie, Zhao Qi, Do Minh Quan, Agarwal Nakul, Lee Kwonjoon, Sun Chen
- CAR: Conceptualization-augmented Reasoner For Zero-shot Commonsense Question Answering Wang Weiqi, Fang Tianqing, Ding Wenxuan, Xu Baixuan, Liu Xin, Song Yangqiu, Bosselut Antoine
- Lora Ensembles For Large Language Model Fine-tuning Wang Xi, Aitchison Laurence, Rudolph Maja
- TRACE: A Comprehensive Benchmark For Continual Learning In Large Language Models Wang Xiao, Zhang Yuansen, Chen Tianze, Gao Songyang, Jin Senjie, Yang Xianjun, Xi Zhiheng, Zheng Rui, Zou Yicheng, Gui Tao, Zhang Qi, Huang Xuanjing
- MINT: Evaluating Llms In Multi-turn Interaction With Tools And Language Feedback Wang Xingyao, Wang Zihan, Liu Jiateng, Chen Yangyi, Yuan Lifan, Peng Hao, Ji Heng
- Promptagent: Strategic Planning With Language Models Enables Expert-level Prompt Optimization Wang Xinyuan, Li Chenxi, Wang Zhen, Bai Fan, Luo Haotian, Zhang Jiayou, Jojic Nebojsa, Xing Eric P., Hu Zhiting
- User-aware Prefix-tuning Is A Good Learner For Personalized Image Captioning Wang Xuan, Wang Guanhong, Chai Wenhao, Zhou Jiayu, Wang Gaoang
- Emotional Intelligence Of Large Language Models Wang Xuena, Li Xueting, Yin Zi, Wu Yue, Jia Liu
- Instructta: Instruction-tuned Targeted Attack For Large Vision-language Models Wang Xunguang, Ji Zhenlan, Ma Pingchuan, Li Zongjie, Wang Shuai
- Adding Guardrails To Advanced Chatbots Wang Yanchen, Singh Lisa
- Multilora: Democratizing Lora For Better Multi-task Learning Wang Yiming, Lin Yu, Zeng Xiaodong, Zhang Guannan
- Meta-reasoning: Semantics-symbol Deconstruction For Large Language Models Wang Yiming, Zhang Zhuosheng, Zhang Pei, Yang Baosong, Wang Rui
- Primacy Effect Of Chatgpt Wang Yiwei, Cai Yujun, Chen Muhao, Liang Yuxuan, Hooi Bryan
- Fake Alignment: Are Llms Really Aligned Well? Wang Yixu, Teng Yan, Huang Kexin, Lyu Chengqi, Zhang Songyang, Zhang Wenwei, Ma Xingjun, Jiang Yu-gang, Qiao Yu, Wang Yingchun
- DRDT: Dynamic Reflection With Divergent Thinking For Llm-based Sequential Recommendation Wang Yu, Liu Zhiwei, Zhang Jianguo, Yao Weiran, Heinecke Shelby, Yu Philip S.
- Learning Hierarchical Prompt With Structured Linguistic Knowledge For Vision-language Models Wang Yubin, Jiang Xinyang, Cheng De, Li Dongsheng, Zhao Cairong
- Smarttrim: Adaptive Tokens And Attention Pruning For Efficient Vision-language Models Wang Zekun, Chen Jingchang, Zhou Wangchunshu, Zhu Haichao, Liang Jiafeng, Shan Liping, Liu Ming, Xu Dongliang, Yang Qing, Qin Bing
- An Effective Data Creation Pipeline To Generate High-quality Financial Instruction Data For Large Language Model Wang Ziao, Wang Jianning, Wu Junda, Zhang Xiaofeng
- Generating Valid And Natural Adversarial Examples With Large Language Models Wang Zimu, Wang Wei, Chen Qi, Wang Qiufeng, Nguyen Anh
- Enabling Language Models To Implicitly Learn Self-improvement Wang Ziqi, Hou Le, Lu Tianjian, Wu Yuexin, Li Yunxuan, Yu Hongkun, Ji Heng
- Filling The Image Information Gap For VQA: Prompting Large Language Models To Proactively Ask Questions Wang Ziyue, Chen Chi, Li Peng, Liu Yang
- Do Llms Understand User Preferences? Evaluating Llms On User Rating Prediction Wang-cheng Kang, Jianmo Ni, Nikhil Mehta, Maheswaran Sathiamoorthy, Lichan Hong, Ed Chi, Derek Zhiyuan Cheng
- Memorybank: Enhancing Large Language Models With Long-term Memory Wanjun Zhong, Lianghong Guo, Qiqi Gao, He Ye, Yanlin Wang
- Machine Translation For Ge'ez Language Wassie Aman Kassahun
- Evaluation Is All You Need. Prompting Generative Large Language Models For Annotation Tasks In The Social Sciences. A Primer Using Open Models Weber Maximilian, Reichardt Merle
- Jailbroken: How Does LLM Safety Training Fail? Wei Alexander, Haghtalab Nika, Steinhardt Jacob
- Leveraging Large Language Models To Power Chatbots For Collecting User Self-reported Data Wei Jing, Kim Sungdong, Jung Hyunhoon, Kim Young-ho
- Llmrec: Large Language Models With Graph Augmentation For Recommendation Wei Wei, Xubin Ren, Jiabin Tang, Qinyong Wang, Lixin Su, Suqi Cheng, Junfeng Wang, Dawei Yin, Chao Huang
- Polylm: An Open Source Polyglot Large Language Model Wei Xiangpeng, Wei Haoran, Lin Huan, Li Tianhao, Zhang Pei, Ren Xingzhang, Li Mei, Wan Yu, Cao Zhiwei, Xie Binbin, Hu Tianxiang, Li Shangjie, Hui Binyuan, Yu Bowen, Liu Dayiheng, Yang Baosong, Huang Fei, Xie Jun
- Menatqa: A New Dataset For Testing The Temporal Comprehension And Reasoning Abilities Of Large Language Models Wei Yifan, Su Yisong, Ma Huanhuan, Yu Xiaoyan, Lei Fangyu, Zhang Yuanzhe, Zhao Jun, Liu Kang
- Copiloting The Copilots: Fusing Large Language Models With Completion Engines For Automated Program Repair Wei Yuxiang, Xia Chunqiu Steven, Zhang Lingming
- Is Chatgpt Good At Search? Investigating Large Language Models As Re-ranking Agents Weiwei Sun, Lingyong Yan, Xinyu Ma, Shuaiqiang Wang, Pengjie Ren, Zhumin Chen, Dawei Yin, Zhaochun Ren
- LLMSTEP: LLM Proofstep Suggestions In Lean Welleck Sean, Saha Rahul
- "according To ...": Prompting Language Models Improves Quoting From Pre-training Data Weller Orion, Marone Marc, Weir Nathaniel, Lawrie Dawn, Khashabi Daniel, Van Durme Benjamin
- Autodroid: Llm-powered Task Automation In Android Wen Hao, Li Yuanchun, Liu Guohong, Zhao Shanhui, Yu Tao, Li Toby Jia-jun, Jiang Shiqi, Liu Yunhao, Zhang Yaqin, Liu Yunxin
- Unveiling The Implicit Toxicity In Large Language Models Wen Jiaxin, Ke Pei, Sun Hao, Zhang Zhexin, Li Chengfei, Bai Jinfeng, Huang Minlie
- GROVE: A Retrieval-augmented Complex Story Generation Framework With A Forest Of Evidence Wen Zhihua, Tian Zhiliang, Wu Wei, Yang Yuxin, Shi Yanqi, Huang Zhen, Li Dongsheng
- Hyperpolyglot Llms: Cross-lingual Interpretability In Token Embeddings Wen-yi Andrea W, Mimno David
- Lmtuner: An User-friendly And Highly-integrable Training Framework For Fine-tuning Large Language Models Weng Yixuan, Wang Zhiqi, Liao Huanxuan, He Shizhu, Liu Shengping, Liu Kang, Zhao Jun
- An Overview Of Temporal Commonsense Reasoning And Acquisition Wenzel Georg, Jatowt Adam
- The Generative AI Paradox: "what It Can Create, It May Not Understand" West Peter, Lu Ximing, Dziri Nouha, Brahman Faeze, Li Linjie, Hwang Jena D., Jiang Liwei, Fisher Jillian, Ravichander Abhilasha, Chandu Khyathi, Newman Benjamin, Koh Pang Wei, Ettinger Allyson, Choi Yejin
- Exploring Parameter-efficient Fine-tuning Techniques For Code Generation With Large Language Models Weyssow Martin, Zhou Xin, Kim Kisub, Lo David, Sahraoui Houari
- From Word Models To World Models: Translating From Natural Language To The Probabilistic Language Of Thought Wong Lionel, Grand Gabriel, Lew Alexander K., Goodman Noah D., Mansinghka Vikash K., Andreas Jacob, Tenenbaum Joshua B.
- Natural Language Generation And Understanding Of Big Code For Ai-assisted Programming: A Review Wong Man Fai, Guo Shangxin, Hang Ching Nam, Ho Siu Wai, Tan Chee Wei
- Cases Of EFL Secondary Students' Prompt Engineering Pathways To Complete A Writing Task With Chatgpt Woo David James, Guo Kai, Susanto Hengky
- Speechgen: Unlocking The Generative Power Of Speech Language Models With Prompts Wu Haibin, Chang Kai-wei, Wu Yuan-kuei, Lee Hung-yi
- Empower Nested Boolean Logic Via Self-supervised Curriculum Learning Wu Hongqiu, Liu Linfeng, Zhao Hai, Zhang Min
- A Comparative Study Of Open-source Large Language Models, GPT-4 And Claude 2: Multiple-choice Test Taking In Nephrology Wu Sean, Koo Michael, Blum Lesley, Black Andy, Kao Liyo, Scalzo Fabien, Kurtz Ira
- Next-gpt: Any-to-any Multimodal LLM Wu Shengqiong, Fei Hao, Qu Leigang, Ji Wei, Chua Tat-seng
- A New Dialogue Response Generation Agent For Large Language Models By Asking Questions To Detect User's Intentions Wu Siwei, Shen Xiangqing, Xia Rui
- Self-correcting Llm-controlled Diffusion Models Wu Tsung-han, Lian Long, Gonzalez Joseph E., Li Boyi, Darrell Trevor
- Unleashing Potential Of Evidence In Knowledge-intensive Dialogue Generation Wu Xianjie, Yang Jian, Li Tongliang, Liang Di, Zhang Shiwei, Du Yiyang, Li Zhoujun
- Unveiling Security, Privacy, And Ethical Concerns Of Chatgpt Wu Xiaodong, Duan Ran, Ni Jianbing
- An Early Evaluation Of Gpt-4v(ision) Wu Yang, Wang Shilong, Yang Hao, Zheng Tian, Zhang Hongbo, Zhao Yanyan, Qin Bing
- TLM: Token-level Masking For Transformers Wu Yangjun, Fang Kebin, Zhang Dongxiang, Wang Han, Zhang Hao, Chen Gang
- Retrieve-rewrite-answer: A Kg-to-text Enhanced Llms Framework For Knowledge Graph Question Answering Wu Yike, Hu Nan, Bi Sheng, Qi Guilin, Ren Jie, Xie Anhuan, Song Wei
- Fine-grained Human Feedback Gives Better Rewards For Language Model Training Wu Zeqiu, Hu Yushi, Shi Weijia, Dziri Nouha, Suhr Alane, Ammanabrolu Prithviraj, Smith Noah A., Ostendorf Mari, Hajishirzi Hannaneh
- Towards Open-world Recommendation With Knowledge Augmentation From Large Language Models Xi Yunjia, Liu Weiwen, Lin Jianghao, Cai Xiaoling, Zhu Hong, Zhu Jieming, Chen Bo, Tang Ruiming, Zhang Weinan, Zhang Rui, Yu Yong
- Improving Question Generation With Multi-level Content Planning Xia Zehua, Gou Qi, Yu Bowen, Yu Haiyang, Huang Fei, Li Yongbin, Nguyen Cam-tu
- Language Models Meet World Models: Embodied Experiences Enhance Language Models Xiang Jiannan, Tao Tianhua, Gu Yi, Shu Tianmin, Wang Zirui, Yang Zichao, Hu Zhiting
- Lm-cocktail: Resilient Tuning Of Language Models Via Model Merging Xiao Shitao, Liu Zheng, Zhang Peitian, Xing Xingrun
- Personalized Abstractive Summarization By Tri-agent Generation Pipeline Xiao Wen, Xie Yujia, Carenini Giuseppe, He Pengcheng
- How Far Are Llms From Believable AI? A Benchmark For Evaluating The Believability Of Human Behavior Simulation Xiao Yang, Cheng Yi, Fu Jinlan, Wang Jiashuo, Li Wenjie, Liu Pengfei
- O3D: Offline Data-driven Discovery And Distillation For Sequential Decision-making With Large Language Models Xiao Yuchen, Sun Yanchao, Xu Mengda, Madhushani Udari, Vann Jared, Garg Deepeka, Ganesh Sumitra
- Supporting Qualitative Analysis With Large Language Models: Combining Codebook With GPT-3 For Deductive Coding Xiao Ziang, Yuan Xingdi, Liao Q. Vera, Abdelghani Rania, Oudeyer Pierre-yves
- A Survey On Multi-behavior Sequential Recommendation Xiaoqing Chen, Zhitao Li, Weike Pan, Zhong Ming
- Tencentllmeval: A Hierarchical Evaluation Of Real-world Capabilities For Human-aligned Llms Xie Shuyi, Yao Wenlin, Dai Yong, Wang Shaobo, Zhou Donlin, Jin Lifeng, Feng Xinhua, Wei Pengzhi, Lin Yujie, Hu Zhichao, Yu Dong, Zhang Zhengyou, Nie Jing, Liu Yuhong
- Text2reward: Reward Shaping With Language Models For Reinforcement Learning Xie Tianbao, Zhao Siheng, Wu Chen Henry, Liu Yitao, Luo Qian, Zhong Victor, Yang Yanchao, Yu Tao
- Openagents: An Open Platform For Language Agents In The Wild Xie Tianbao, Zhou Fan, Cheng Zhoujun, Shi Peng, Weng Luoxuan, Liu Yitao, Hua Toh Jing, Zhao Junning, Liu Qian, Liu Che, Liu Leo Z., Xu Yiheng, Su Hongjin, Shin Dongchan, Xiong Caiming, Yu Tao
- Analysis Of The Reasoning With Redundant Information Provided Ability Of Large Language Models Xie Wenbei
- Olagpt: Empowering Llms With Human-like Problem-solving Abilities Xie Yuanzhen, Xie Tao, Lin Mingxiong, Wei Wentao, Li Chenglin, Kong Beibei, Chen Lei, Zhuo Chengxiang, Hu Bo, Li Zang
- Making Small Language Models Better Multi-task Learners With Mixture-of-task-adapters Xie Yukang, Wang Chengyu, Yan Junbing, Zhou Jiyong, Deng Feiqi, Huang Jun
- The Next Chapter: A Study Of Large Language Models In Storytelling Xie Zhuohan, Cohn Trevor, Lau Jey Han
- E4srec: An Elegant Effective Efficient Extensible Solution Of Large Language Models For Sequential Recommendation Xinhang Li, Chong Chen, Xiangyu Zhao, Yong Zhang, Chunxiao Xing
- Bridging Items And Language: A Transition Paradigm For Large Language Model-based Recommendation Xinyu Lin, Wenjie Wang, Yongqi Li, Fuli Feng, See-kiong Ng, Tat-seng Chua
- Doctorglm: Fine-tuning Your Chinese Doctor Is Not A Herculean Task Xiong Honglin, Wang Sheng, Zhu Yitao, Zhao Zihao, Liu Yuxiao, Huang Linlin, Wang Qian, Shen Dinggang
- Examining Inter-consistency Of Large Language Models Collaboration: An In-depth Analysis Via Debate Xiong Kai, Ding Xiao, Cao Yixin, Liu Ting, Qin Bing
- Effective Long-context Scaling Of Foundation Models Xiong Wenhan, Liu Jingyu, Molybog Igor, Zhang Hejia, Bhargava Prajjwal, Hou Rui, Martin Louis, Rungta Rashi, Sankararaman Karthik Abinav, Oguz Barlas, Khabsa Madian, Fang Han, Mehdad Yashar, Narang Sharan, Malik Kshitiz, Fan Angela, Bhosale Shruti, Edunov Sergey, Lewis Mike, Wang Sinong, Ma Hao
- Expertprompting: Instructing Large Language Models To Be Distinguished Experts Xu Benfeng, Yang An, Lin Junyang, Wang Quan, Zhou Chang, Zhang Yongdong, Mao Zhendong
- Gentopia: A Collaborative Platform For Tool-augmented Llms Xu Binfeng, Liu Xukun, Shen Hua, Han Zeyu, Li Yuhan, Yue Murong, Peng Zhiyuan, Liu Yuchen, Yao Ziyu, Xu Dongkuan
- Rewoo: Decoupling Reasoning From Observations For Efficient Augmented Language Models Xu Binfeng, Peng Zhiyuan, Lei Bowen, Mukherjee Subhabrata, Liu Yuchen, Xu Dongkuan
- Automatic Pair Construction For Contrastive Post-training Xu Canwen, Rosset Corby, Chau Ethan C., Del Corro Luciano, Mahajan Shweti, Mcauley Julian, Neville Jennifer, Awadallah Ahmed Hassan, Rao Nikhil
- Large Language Models For Generative Information Extraction: A Survey Xu Derong, Chen Wei, Peng Wenjun, Zhang Chao, Xu Tong, Zhao Xiangyu, Wu Xian, Zheng Yefeng, Wang Yang, Chen Enhong
- Symbol-llm: Towards Foundational Symbol-centric Interface For Large Language Models Xu Fangzhi, Wu Zhiyong, Sun Qiushi, Ren Siyu, Yuan Fei, Yuan Shuai, Lin Qika, Qiao Yu, Liu Jun
- A Paradigm Shift In Machine Translation: Boosting Translation Performance Of Large Language Models Xu Haoran, Kim Young Jin, Sharaf Amr, Awadalla Hany Hassan
- No Train Still Gain. Unleash Mathematical Reasoning Of Large Language Models With Monte Carlo Tree Search Guided By Energy Function Xu Haotian
- Recommender AI Agent: Integrating Large Language Models For Interactive Recommendations Xu Huang, Jianxun Lian, Yuxuan Lei, Jing Yao, Defu Lian, Xing Xie
- Penetrative AI: Making Llms Comprehend The Physical World Xu Huatao, Han Liying, Yang Qirui, Li Mo, Srivastava Mani
- Instructions As Backdoors: Backdoor Vulnerabilities Of Instruction Tuning For Large Language Models Xu Jiashu, Ma Mingyu Derek, Wang Fei, Xiao Chaowei, Chen Muhao
- U-llava: Unifying Multi-modal Tasks Via Large Language Model Xu Jinjin, Xu Liwu, Yang Yuzhe, Li Xiang, Wang Fanyi, Xie Yanchun, Huang Yi-jie, Li Yaqian
- Superclue: A Comprehensive Chinese Large Language Model Benchmark Xu Liang, Li Anqi, Zhu Lei, Xue Hang, Zhu Changtai, Zhao Kangkang, He Haonan, Zhang Xuanwei, Kang Qiyue, Lan Zhenzhong
- Sc-safety: A Multi-round Open-ended Question Adversarial Safety Benchmark For Large Language Models In Chinese Xu Liang, Zhao Kangkang, Zhu Lei, Xue Hang
- Cognitive Overload: Jailbreaking Large Language Models With Overloaded Logical Thinking Xu Nan, Wang Fei, Zhou Ben, Li Bang Zheng, Xiao Chaowei, Chen Muhao
- On The Tool Manipulation Capability Of Open-source Large Language Models Xu Qiantong, Hong Fenglu, Li Bo, Hu Changran, Chen Zhengyu, Zhang Jian
- Knowledge-infused Prompting: Assessing And Advancing Clinical Text Data Generation With Large Language Models Xu Ran, Cui Hejie, Yu Yue, Kan Xuan, Shi Wenqi, Zhuang Yuchen, Jin Wei, Ho Joyce, Yang Carl
- Fine-tuned Llms Know More, Hallucinate Less With Few-shot Sequence-to-sequence Semantic Parsing Over Wikidata Xu Silei, Liu Shicheng, Culhane Theo, Pertseva Elizaveta, Wu Meng-hsi, Semnani Sina J., Lam Monica S.
- Shattering The Agent-environment Interface For Fine-tuning Inclusive Language Models Xu Wanqiao, Dong Shi, Arumugam Dilip, Van Roy Benjamin
- Reasons To Reject? Aligning Language Models With Judgments Xu Weiwen, Cai Deng, Zhang Zhisong, Lam Wai, Shi Shuming
- Re-reading Improves Reasoning In Large Language Models Xu Xiaohan, Tao Chongyang, Shen Tao, Xu Can, Xu Hongbo, Long Guodong, Lou Jian-guang
- It Ain't That Bad: Understanding The Mysterious Performance Drop In OOD Generalization For Generative Transformer Models Xu Xingcheng, Pan Zihao, Zhang Haipeng, Yang Yanqing
- Mental-llm: Leveraging Large Language Models For Mental Health Prediction Via Online Text Data Xu Xuhai, Yao Bingsheng, Dong Yuanzhe, Gabriel Saadia, Yu Hong, Hendler James, Ghassemi Marzyeh, Dey Anind K., Wang Dakuo
- Llms And The Abstraction And Reasoning Corpus: Successes, Failures, And The Importance Of Object-based Representations Xu Yudong, Li Wenhao, Vaezipoor Pashootan, Sanner Scott, Khalil Elias B.
- How Robust Is GPT-3.5 To Predecessors? A Comprehensive Study On Language Understanding Tasks Xuanting Chen, Junjie Ye, Can Zu, Nuo Xu, Rui Zheng, Minlong Peng, Jie Zhou, Tao Gui, Qi Zhang, Xuanjing Huang
- Representation Learning With Large Language Models For Recommendation Xubin Ren, Wei Wei, Lianghao Xia, Lixin Su, Suqi Cheng, Junfeng Wang, Dawei Yin, Chao Huang
- Bias And Fairness In Chatbots: An Overview Xue Jintang, Wang Yun-cheng, Wei Chengwei, Liu Xiaofeng, Woo Jonghye, Kuo C. -c. Jay
- Occuquest: Mitigating Occupational Bias For Inclusive Large Language Models Xue Mingfeng, Liu Dayiheng, Yang Kexin, Dong Guanting, Lei Wenqiang, Yuan Zheng, Zhou Chang, Zhou Jingren
- Parameter-efficient Tuning Helps Language Model Alignment Xue Tianci, Wang Ziqi, Ji Heng
- RCOT: Detecting And Rectifying Factual Inconsistency In Reasoning By Reversing Chain-of-thought Xue Tianci, Wang Ziqi, Wang Zhenhailong, Han Chi, Yu Pengfei, Ji Heng
- Scaling Evidence-based Instructional Design Expertise Through Large Language Models Yadav Gautam
- Domain-specific Chatbots For Science Using Embeddings Yager Kevin G.
- Optimizing Machine Translation Through Prompt Engineering: An Investigation Into Chatgpt's Customizability Yamada Masaru
- Evaluating Spatial Understanding Of Large Language Models Yamada Yutaro, Bao Yihan, Lampinen Andrew K., Kasai Jungo, Yildirim Ilker
- Predicting Text Preference Via Structured Comparative Reasoning Yan Jing Nathan, Liu Tianqi, Chiu Justin T, Shen Jiaming, Qin Zhen, Yu Yue, Zhao Yao, Lakshmanan Charu, Kurzion Yair, Rush Alexander M., Liu Jialu, Bendersky Michael
- Voila-a: Aligning Vision-language Models With User's Gaze Attention Yan Kun, Ji Lei, Wang Zeyu, Wang Yuntao, Duan Nan, Ma Shuai
- Practical And Ethical Challenges Of Large Language Models In Education: A Systematic Scoping Review Yan Lixiang, Sha Lele, Zhao Linxuan, Li Yuheng, Martinez-maldonado Roberto, Chen Guanliang, Li Xinyu, Jin Yueqiao, Gašević Dragan
- Codescope: An Execution-based Multilingual Multitask Multidimensional Benchmark For Evaluating Llms On Code Understanding And Generation Yan Weixiang, Liu Haitian, Wang Yunkun, Li Yunzhe, Chen Qian, Wang Wen, Lin Tingyu, Zhao Weishan, Zhu Li, Sundaram Hari, Deng Shuiguang
- Multimodal Chatgpt For Medical Applications: An Experimental Study Of GPT-4V Yan Zhiling, Zhang Kai, Zhou Rong, He Lifang, Li Xiang, Sun Lichao
- LACMA: Language-aligning Contrastive Learning With Meta-actions For Embodied Instruction Following Yang Cheng-fu, Chen Yen-chun, Yang Jianwei, Dai Xiyang, Yuan Lu, Wang Yu-chiang Frank, Chang Kai-wei
- Can You Follow Me? Testing Situational Understanding In Chatgpt Yang Chenghao, Ettinger Allyson
- Beyond Testers' Biases: Guiding Model Testing With Knowledge Bases Using Llms Yang Chenyang, Rustogi Rishabh, Brower-sinning Rachel, Lewis Grace A., Kästner Christian, Wu Tongshuang
- Knowledge-prompted Estimator: A Novel Approach To Explainable Machine Translation Assessment Yang Hao, Zhang Min, Tao Shimin, Wang Minghan, Wei Daimeng, Jiang Yanfei
- PRCA: Fitting Black-box Large Language Models For Retrieval Question Answering Via Pluggable Reward-driven Contextual Adapter Yang Haoyan, Li Zhitao, Zhang Yong, Wang Jianzong, Cheng Ning, Li Ming, Xiao Jing
- Auto-gpt For Online Decision Making: Benchmarks And Additional Opinions Yang Hui, Yue Sifu, He Yunzhong
- Iterative Forward Tuning Boosts In-context Learning In Language Models Yang Jiaxi, Hui Binyuan, Yang Min, Wang Bailin, Li Bowen, Li Binhua, Huang Fei, Li Yongbin
- Unified Language-vision Pretraining In LLM With Dynamic Discrete Visual Tokenization Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chenyi Lei, An Liu, Chengru Song, Xiaoqiang Lei, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu
- Revisiting Parallel Context Windows: A Frustratingly Simple Alternative And Chain-of-thought Deterioration Yang Kejuan, Liu Xiao, Men Kaiwen, Zeng Aohan, Dong Yuxiao, Tang Jie
- Supervised Knowledge Makes Large Language Models Better In-context Learners Yang Linyi, Zhang Shuibai, Yu Zhuohao, Bao Guangsheng, Wang Yidong, Wang Jindong, Xu Ruochen, Ye Wei, Xie Xing, Chen Weizhu, Zhang Yue
- Progressive Evidence Refinement For Open-domain Multimodal Retrieval Question Answering Yang Shuwen, Wu Anran, Wu Xingjiao, Xiao Luwei, Ma Tianlong, Jin Cheng, He Liang
- Zhongjing: Enhancing The Chinese Medical Capabilities Of Large Language Model Through Expert Feedback And Real-world Multi-turn Dialogue Yang Songhua, Zhao Hanjie, Zhu Senbin, Zhou Guangyu, Xu Hongfei, Jia Yuxiang, Zan Hongying
- Bigtranslate: Augmenting Large Language Models With Multilingual Translation Capability Over 100 Languages Yang Wen, Li Chong, Zhang Jiajun, Zong Chengqing
- Enabling Large Language Models To Learn From Rules Yang Wenkai, Lin Yankai, Zhou Jie, Wen Jirong
- Lever LM: Configuring In-context Sequence To Lever Large Vision Language Models Yang Xu, Peng Yingzhe, Ma Haoxuan, Xu Shuo, Zhang Chi, Han Yucheng, Zhang Hanwang
- On The Calibration Of Multilingual Question Answering Llms Yang Yahan, Dan Soham, Roth Dan, Lee Insup
- Harnessing The Power Of Large Language Models For Natural Language To First-order Logic Translation Yang Yuan, Xiong Siheng, Payani Ali, Shareghi Ehsan, Fekri Faramarz
- GPT Can Solve Mathematical Problems Without A Calculator Yang Zhen, Ding Ming, Lv Qingsong, Jiang Zhihuan, He Zehai, Guo Yuyi, Bai Jinfeng, Tang Jie
- Surpassing GPT-4 Medical Coding With A Two-stage Approach Yang Zhichao, Batra Sanjit Singh, Stremmel Joel, Halperin Eran
- Re-vilm: Retrieval-augmented Visual Language Model For Zero And Few-shot Image Captioning Yang Zhuolin, Ping Wei, Liu Zihan, Korthikanti Vijay, Nie Weili, Huang De-an, Fan Linxi, Yu Zhiding, Lan Shiyi, Li Bo, Liu Ming-yu, Zhu Yuke, Shoeybi Mohammad, Catanzaro Bryan, Xiao Chaowei, Anandkumar Anima
- Knowledge Plugins: Enhancing Large Language Models For Domain-specific Recommendations Yao Jing, Xu Wei, Lian Jianxun, Wang Xiting, Yi Xiaoyuan, Xie Xing
- From Instructions To Intrinsic Human Values -- A Survey Of Alignment Goals For Big Models Yao Jing, Yi Xiaoyuan, Wang Xiting, Wang Jindong, Xie Xing
- Embodiedgpt: Vision-language Pre-training Via Embodied Chain Of Thought Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo
- Retroformer: Retrospective Large Language Agents With Policy Gradient Optimization Yao Weiran, Heinecke Shelby, Niebles Juan Carlos, Liu Zhiwei, Feng Yihao, Xue Le, Murthy Rithesh, Chen Zeyuan, Zhang Jianguo, Arpit Devansh, Xu Ran, Mui Phil, Wang Huan, Xiong Caiming, Savarese Silvio
- A Survey On Large Language Model (LLM) Security And Privacy: The Good, The Bad, And The Ugly Yao Yifan, Duan Jinhao, Xu Kaidi, Cai Yuanfang, Sun Zhibo, Zhang Yue
- Editing Large Language Models: Problems, Methods, And Opportunities Yao Yunzhi, Wang Peng, Tian Bozhong, Cheng Siyuan, Li Zhoubo, Deng Shumin, Chen Huajun, Zhang Ningyu
- Deepspeed-visualchat: Multi-round Multi-image Interleave Chat Via Multi-modal Causal Attention Yao Zhewei, Wu Xiaoxia, Li Conglong, Zhang Minjia, Qin Heyang, Ruwase Olatunji, Awan Ammar Ahmad, Rajbhandari Samyam, He Yuxiong
- Ureader: Universal Ocr-free Visually-situated Language Understanding With Multimodal Large Language Model Ye Jiabo, Hu Anwen, Xu Haiyang, Ye Qinghao, Yan Ming, Xu Guohai, Li Chenliang, Tian Junfeng, Qian Qi, Zhang Ji, Jin Qin, He Liang, Lin Xin Alex, Huang Fei
- Qilin-med: Multi-stage Knowledge Injection Advanced Medical Large Language Model Ye Qichen, Liu Junling, Chong Dading, Zhou Peilin, Hua Yining, Liu Fenglin, Cao Meng, Wang Ziming, Cheng Xuxin, Lei Zhu, Guo Zhenhua
- Effective Large Language Model Adaptation For Improved Grounding And Citation Generation Ye Xi, Sun Ruoxi, Arik Sercan Ö., Pfister Tomas
- Rational Decision-making Agent With Internalized Utility Judgment Ye Yining, Cong Xin, Tian Shizuo, Qin Yujia, Liu Chong, Lin Yankai, Liu Zhiyuan, Sun Maosong
- ASPEN: High-throughput Lora Fine-tuning Of Large Language Models With A Single GPU Ye Zhengmao, Li Dengchun, Tian Jingqi, Lan Tingfeng, Zuo Jie, Duan Lei, Lu Hui, Jiang Yexi, Sha Jian, Zhang Ke, Tang Mingjie
- A Multitask, Multilingual, Multimodal Evaluation Of Chatgpt On Reasoning, Hallucination, And Interactivity Yejin Bang, Samuel Cahyawijaya, Nayeon Lee, Wenliang Dai, Dan Su, Bryan Wilie, Holy Lovenia, Ziwei Ji, Tiezheng Yu, Willy Chung, Quyet V. Do, Yan Xu, Pascale Fung
- Benchmarking And Defending Against Indirect Prompt Injection Attacks On Large Language Models Yi Jingwei, Xie Yueqi, Zhu Bin, Kiciman Emre, Sun Guangzhong, Xie Xing, Wu Fangzhao
- Ctrlstruct: Dialogue Structure Learning For Open-domain Response Generation Yin Congchi, Li Piji, Ren Zhaochun
- Agent Lumos: Unified And Modular Training For Open-source Language Agents Yin Da, Brahman Faeze, Ravichander Abhilasha, Chandu Khyathi, Chang Kai-wei, Choi Yejin, Lin Bill Yuchen
- Dynosaur: A Dynamic Growth Paradigm For Instruction-tuning Data Curation Yin Da, Liu Xiao, Yin Fan, Zhong Ming, Bansal Hritik, Han Jiawei, Chang Kai-wei
- Did You Read The Instructions? Rethinking The Effectiveness Of Task Definitions In Instruction Learning Yin Fan, Vig Jesse, Laban Philippe, Joty Shafiq, Xiong Caiming, Wu Chien-sheng Jason
- Modulora: Finetuning 2-bit Llms On Consumer Gpus By Integrating With Modular Quantizers Yin Junjie, Dong Jiahao, Wang Yingheng, De Sa Christopher, Kuleshov Volodymyr
- LAMM: Language-assisted Multi-modal Instruction-tuning Dataset, Framework, And Benchmark Yin Zhenfei, Wang Jiong, Cao Jianjian, Shi Zhelun, Liu Dingning, Li Mukai, Sheng Lu, Bai Lei, Huang Xiaoshui, Wang Zhiyong, Shao Jing, Ouyang Wanli
- Alignment Is Not Sufficient To Prevent Large Language Models From Generating Harmful Information: A Psychoanalytic Perspective Yin Zi, Ding Wei, Liu Jia
- Intuitive Or Dependent? Investigating Llms' Behavior Style To Conflicting Prompts Ying Jiahao, Cao Yixin, Xiong Kai, He Yidong, Cui Long, Liu Yongbin
- Analyzing And Mitigating Object Hallucination In Large Vision-language Models Yiyang Zhou, Chenhang Cui, Jaehong Yoon, Linjun Zhang, Zhun Deng, Chelsea Finn, Mohit Bansal, Huaxiu Yao
- Prompting Multilingual Large Language Models To Generate Code-mixed Texts: The Case Of South East Asian Languages Yong Zheng-xin, Zhang Ruochen, Forde Jessica Zosa, Wang Skyler, Subramonian Arjun, Lovenia Holy, Cahyawijaya Samuel, Winata Genta Indra, Sutawika Lintang, Cruz Jan Christian Blaise, Tan Yin Lin, Phan Long, Garcia Rowena, Solorio Thamar, Aji Alham Fikri
- Asyncmld: Asynchronous Multi-llm Framework For Dialogue Recommendation System Yoshimaru Naoki, Okuma Motoharu, Iio Takamasa, Hatano Kenji
- Fundamental Limitations Of Alignment In Large Language Models Yotam Wolf, Noam Wies, Oshri Avnery, Yoav Levine, Amnon Shashua
- \(P^{3}O\): Transferring Visual Representations For Reinforcement Learning Via Prompting You Guoliang, Chu Xiaomeng, Duan Yifan, Peng Jie, Ji Jianmin, Zhang Yu, Zhang Yanyong
- Merlin:empowering Multimodal Llms With Foresight Minds Yu En, Zhao Liang, Wei Yana, Yang Jinrong, Wu Dongming, Kong Lingyu, Wei Haoran, Wang Tiancai, Ge Zheng, Zhang Xiangyu, Tao Wenbing
- OVM, Outcome-supervised Value Models For Planning In Mathematical Reasoning Yu Fei, Gao Anningzhe, Wang Benyou
- Kola: Carefully Benchmarking World Knowledge Of Large Language Models Yu Jifan, Wang Xiaozhi, Tu Shangqing, Cao Shulin, Zhang-li Daniel, Lv Xin, Peng Hao, Yao Zijun, Zhang Xiaohan, Li Hanming, Li Chunyang, Zhang Zheyuan, Bai Yushi, Liu Yantao, Xin Amy, Lin Nianyi, Yun Kaifeng, Gong Linlu, Chen Jianhui, Wu Zhili, Qi Yunjia, Li Weikai, Guan Yong, Zeng Kaisheng, Qi Ji, Jin Hailong, Liu Jinxin, Gu Yu, Yao Yuan, Ding Ning, Hou Lei, Liu Zhiyuan, Xu Bin, Tang Jie, Li Juanzi
- MELO: Enhancing Model Editing With Neuron-indexed Dynamic Lora Yu Lang, Chen Qin, Zhou Jie, He Liang
- MEGABYTE: Predicting Million-byte Sequences With Multiscale Transformers Yu Lili, Simig Dániel, Flaherty Colin, Aghajanyan Armen, Zettlemoyer Luke, Lewis Mike
- Stateful Large Language Model Serving With Pensieve Yu Lingfan, Li Jinyang
- Metamath: Bootstrap Your Own Mathematical Questions For Large Language Models Yu Longhui, Jiang Weisen, Shi Han, Yu Jincheng, Liu Zhengying, Zhang Yu, Kwok James T., Li Zhenguo, Weller Adrian, Liu Weiyang
- Characterizing Mechanisms For Factual Recall In Language Models Yu Qinan, Merullo Jack, Pavlick Ellie
- Constructive Large Language Models Alignment With Diverse Feedback Yu Tianshu, Lin Ting-en, Wu Yuchuan, Yang Min, Huang Fei, Li Yongbin
- Chain-of-note: Enhancing Robustness In Retrieval-augmented Language Models Yu Wenhao, Zhang Hongming, Pan Xiaoman, Ma Kaixin, Wang Hongwei, Yu Dong
- Improving Language Models Via Plug-and-play Retrieval Feedback Yu Wenhao, Zhang Zhihan, Liang Zhenwen, Jiang Meng, Sabharwal Ashish
- Prompt-based Monte-carlo Tree Search For Goal-oriented Dialogue Policy Planning Yu Xiao, Chen Maximillian, Yu Zhou
- Reeval: Automatic Hallucination Evaluation For Retrieval-augmented Large Language Models Via Transferable Adversarial Attacks Yu Xiaodong, Cheng Hao, Liu Xiaodong, Roth Dan, Gao Jianfeng
- Finmem: A Performance-enhanced LLM Trading Agent With Layered Memory And Character Design Yu Yangyang, Li Haohang, Chen Zhi, Jiang Yuechen, Li Yang, Zhang Denghui, Liu Rong, Suchow Jordan W., Khashanah Khaldoun
- Large Language Model As Attributed Training Data Generator: A Tale Of Diversity And Bias Yu Yue, Zhuang Yuchen, Zhang Jieyu, Meng Yu, Ratner Alexander, Krishna Ranjay, Shen Jiaming, Zhang Chao
- Fine-tuning Language Models With Generative Adversarial Reward Modelling Yu Zhang Ze, Jaw Lau Jia, Hui Zhang, Low Bryan Kian Hsiang
- Prophet: Prompting Large Language Models With Complementary Answer Heuristics For Knowledge-based Visual Question Answering Yu Zhou, Ouyang Xuecheng, Shao Zhenwei, Wang Meng, Yu Jun
- How Vocabulary Sharing Facilitates Multilingualism In Llama? Yuan Fei, Yuan Shuai, Wu Zhiyong, Li Lei
- Large Language Models Illuminate A Progressive Pathway To Artificial Healthcare Assistant: A Review Yuan Mingze, Bao Peng, Yuan Jiajia, Shen Yunhao, Chen Zifan, Xie Yi, Zhao Jie, Chen Yang, Zhang Li, Shen Lin, Dong Bin
- Beneath Surface Similarity: Large Language Models Make Reasonable Scientific Analogies After Structure Abduction Yuan Siyu, Chen Jiangjie, Ge Xuyang, Xiao Yanghua, Yang Deqing
- Artgpt-4: Towards Artistic-understanding Large Vision-language Models With Enhanced Adapter Yuan Zhengqing, He Yunhong, Wang Kun, Ye Yanfang, Sun Lichao
- Large Language Model Cascades With Mixture Of Thoughts Representations For Cost-efficient Reasoning Yue Murong, Zhao Jie, Zhang Min, Du Liang, Yao Ziyu
- Plan, Eliminate, And Track -- Language Models Are Good Teachers For Embodied Agents Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye
- Tool Learning With Foundation Models Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang, Jing Yi, Yuzhang Zhu, Zhenning Dai, Lan Yan, Xin Cong, Yaxi Lu, Weilin Zhao, Yuxiang Huang, Junxi Yan, Xu Han, Xian Sun, Dahai Li, Jason Phang, Cheng Yang, Tongshuang Wu, Heng Ji, Zhiyuan Liu, Maosong Sun
- Attention Satisfies: A Constraint-satisfaction Lens On Factual Errors Of Language Models Yuksekgonul Mert, Chandrasekaran Varun, Jones Erik, Gunasekar Suriya, Naik Ranjita, Palangi Hamid, Kamar Ece, Nushi Besmira
- Chat-rec: Towards Interactive And Explainable Llms-augmented Recommender System Yunfan Gao, Tao Sheng, Youlin Xiang, Yun Xiong, Haofen Wang, Jiawei Zhang
- Retrieval-augmented Generation For Large Language Models: A Survey Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Meng Wang, Haofen Wang
- Pre-training To Learn In Context Yuxian Gu, Li Dong, Furu Wei, Minlie Huang
- Building Trust In Conversational AI: A Comprehensive Review And Solution Architecture For Explainable, Privacy-aware Systems Using Llms And Knowledge Graph Zafar Ahtsham, Parthasarathy Venkatesh Balavadhani, Van Chan Le, Shahid Saad, Khan Aafaq Iqbal, Shahid Arsalan
- On Diversified Preferences Of Large Language Model Alignment Zeng Dun, Dai Yong, Cheng Pengyu, Wang Longyue, Hu Tianhao, Chen Wanshun, Du Nan, Xu Zenglin
- Evaluating Large Language Models At Evaluating Instruction Following Zeng Zhiyuan, Yu Jiatong, Gao Tianyu, Meng Yu, Goyal Tanya, Chen Danqi
- Is Chatgpt A Good Sentiment Analyzer? A Preliminary Study Zengzhi Wang, Qiming Xie, Yi Feng, Zixiang Ding, Zinong Yang, Rui Xia
- Conditionally Combining Robot Skills Using Large Language Models Zentner K. R., Julian Ryan, Ichter Brian, Sukhatme Gaurav S.
- Building Open-ended Embodied Agent Via Language-policy Bidirectional Adaptation Zhai Shaopeng, Wang Jie, Zhang Tianyi, Huang Fuxian, Zhang Qi, Zhou Ming, Hou Jing, Qiao Yu, Liu Yu
- Investigating The Catastrophic Forgetting In Multimodal Large Language Models Zhai Yuexiang, Tong Shengbang, Li Xiao, Cai Mu, Qu Qing, Lee Yong Jae, Ma Yi
- G3detector: General Gpt-generated Text Detector Zhan Haolan, He Xuanli, Xu Qiongkai, Wu Yuxiang, Stenetorp Pontus
- Removing RLHF Protections In GPT-4 Via Fine-tuning Zhan Qiusi, Fang Richard, Bindu Rohan, Gupta Akul, Hashimoto Tatsunori, Kang Daniel
- Deceptive AI Ecosystems: The Case Of Chatgpt Zhan Xiao, Xu Yifan, Sarkadi Stefan
- Evaluating And Improving Tool-augmented Computation-intensive Math Reasoning Zhang Beichen, Zhou Kun, Wei Xilin, Zhao Wayne Xin, Sha Jing, Wang Shijin, Wen Ji-rong
- User-controlled Knowledge Fusion In Large Language Models: Balancing Creativity And Hallucination Zhang Chen
- A Comprehensive Analysis Of The Effectiveness Of Large Language Models As Automatic Dialogue Evaluators Zhang Chen, D'haro Luis Fernando, Chen Yiming, Zhang Malu, Li Haizhou
- Task-agnostic Distillation Of Encoder-decoder Language Models Zhang Chen, Yang Yang, Wang Jingang, Song Dawei
- Revisiting Block-based Quantisation: What Is Important For Sub-8-bit LLM Inference? Zhang Cheng, Cheng Jianyi, Shumailov Ilia, Constantinides George A., Zhao Yiren
- Benchmarking And In-depth Performance Study Of Large Language Models On Habana Gaudi Processors Zhang Chengming, Sun Baixi, Yu Xiaodong, Xie Zhen, Zheng Weijian, Iskra Kamil, Beckman Pete, Tao Dingwen
- Taxonomy-based Checklist For Large Language Model Evaluation Zhang Damin
- Mobile-env: Building Qualified Evaluation Benchmarks For LLM-GUI Interaction Zhang Danyang, Shen Zhennan, Xie Rui, Zhang Situo, Xie Tianbao, Zhao Zihan, Chen Siyuan, Chen Lu, Xu Hongshen, Cao Ruisheng, Yu Kai
- On The Safety Of Open-sourced Large Language Models: Does Alignment Really Prevent Them From Being Misused? Zhang Hangfan, Guo Zhimeng, Zhu Huaisheng, Cao Bochuan, Lin Lu, Jia Jinyuan, Chen Jinghui, Wu Dinghao
- XATU: A Fine-grained Instruction-based Benchmark For Explainable Text Updates Zhang Haopeng, Iso Hayate, Gurajada Sairam, Bhutani Nikita
- Glm-dialog: Noise-tolerant Pre-training For Knowledge-grounded Dialogue Generation Zhang Jing, Zhang Xiaokang, Zhang-li Daniel, Yu Jifan, Yao Zijun, Ma Zeyao, Xu Yiqi, Wang Haohua, Zhang Xiaohan, Lin Nianyi, Lu Sunrui, Li Juanzi, Tang Jie
- Composing Parameter-efficient Modules With Arithmetic Operations Zhang Jinghan, Chen Shiqi, Liu Junteng, He Junxian
- The Potential And Pitfalls Of Using A Large Language Model Such As Chatgpt Or GPT-4 As A Clinical Assistant Zhang Jingqing, Sun Kai, Jagadeesh Akshay, Ghahfarokhi Mahta, Gupta Deepa, Gupta Ashok, Gupta Vibhor, Guo Yike
- Crash: Clustering, Removing, And Sharing Enhance Fine-tuning Without Full Large Language Model Zhang Kaiyan, Ding Ning, Qi Biqing, Zhu Xuekai, Long Xinwei, Zhou Bowen
- A Two-stage Adaptation Of Large Language Models For Text Ranking Zhang Longhui, Zhang Yanzhao, Long Dingkun, Xie Pengjun, Zhang Meishan, Zhang Min
- Retrieve Anything To Augment Large Language Models Zhang Peitian, Xiao Shitao, Liu Zheng, Dou Zhicheng, Nie Jian-yun
- Ask An Expert: Leveraging Language Models To Improve Strategic Reasoning In Goal-oriented Dialogue Models Zhang Qiang, Naradowsky Jason, Miyao Yusuke
- Knowgpt: Knowledge Graph Based Prompting For Large Language Models Zhang Qinggang, Dong Junnan, Chen Hao, Zha Daochen, Yu Zailiang, Huang Xiao
- Tell Your Model Where To Attend: Post-hoc Attention Steering For Llms Zhang Qingru, Singh Chandan, Liu Liyuan, Liu Xiaodong, Yu Bin, Gao Jianfeng, Zhao Tuo
- The Wisdom Of Hindsight Makes Language Models Better Instruction Followers Zhang Tianjun, Liu Fangchen, Wong Justin, Abbeel Pieter, Gonzalez Joseph E.
- History-aware Hierarchical Transformer For Multi-session Open-domain Dialogue System Zhang Tong, Liu Yong, Li Boyang, Zeng Zhiwei, Wang Pengwei, You Yuan, Miao Chunyan, Cui Lizhen
- Adarefiner: Refining Decisions Of Language Models With Adaptive Feedback Zhang Wanpeng, Lu Zongqing
- M3exam: A Multilingual, Multimodal, Multilevel Benchmark For Examining Large Language Models Zhang Wenxuan, Aljunied Sharifah Mahani, Gao Chang, Chia Yew Ken, Bing Lidong
- Bridging The Information Gap Between Domain-specific Model And General LLM For Personalized Recommendation Zhang Wenxuan, Liu Hongzhi, Du Yingpeng, Zhu Chen, Song Yang, Zhu Hengshu, Wu Zhonghai
- EHR Interaction Between Patients And AI: Noteaid EHR Interaction Zhang Xiaocheng, Yao Zonghai, Yu Hong
- Large Language Models In Medical Term Classification And Unexpected Misalignment Between Response And Reasoning Zhang Xiaodan, Vemulapalli Sandeep, Talukdar Nabasmita, Ahn Sumyeong, Wang Jiankun, Meng Han, Murtaza Sardar Mehtab Bin, Dave Aakash Ajay, Leshchiner Dmitry, Joseph Dimitri F., Witteveen-lane Martin, Chesla Dave, Zhou Jiayu, Chen Bin
- PMC-VQA: Visual Instruction Tuning For Medical Visual Question Answering Zhang Xiaoman, Wu Chaoyi, Zhao Ziheng, Lin Weixiong, Zhang Ya, Wang Yanfeng, Xie Weidi
- Gpt-4v(ision) As A Generalist Evaluator For Vision-language Tasks Zhang Xinlu, Lu Yujie, Wang Weizhi, Yan An, Yan Jun, Qin Lianke, Wang Heng, Yan Xifeng, Wang William Yang, Petzold Linda Ruth
- CGCE: A Chinese Generative Chat Evaluation Benchmark For General And Financial Domains Zhang Xuanyu, Li Bingbing, Yang Qing
- Llavar: Enhanced Visual Instruction Tuning For Text-rich Image Understanding Zhang Yanzhe, Zhang Ruiyi, Gu Jiuxiang, Zhou Yufan, Lipka Nedim, Yang Diyi, Sun Tong
- Making Large Language Models Perform Better In Knowledge Graph Completion Zhang Yichi, Chen Zhuo, Guo Lingbing, Xu Yajing, Zhang Wen, Chen Huajun
- Meta Prompting For AI Systems Zhang Yifan, Yuan Yang, Yao Andrew Chi-chih
- Llamavae: Guiding Large Language Model Generation Via Continuous Latent Sentence Spaces Zhang Yingji, Carvalho Danilo S., Pratt-hartmann Ian, Freitas André
- Probing The Multi-turn Planning Capabilities Of Llms Via 20 Question Games Zhang Yizhe, Lu Jiarui, Jaitly Navdeep
- Siren's Song In The AI Ocean: A Survey On Hallucination In Large Language Models Zhang Yue, Li Yafu, Cui Leyang, Cai Deng, Liu Lemao, Fu Tingchen, Huang Xinting, Zhao Enbo, Zhang Yu, Chen Yulong, Wang Longyue, Luu Anh Tuan, Bi Wei, Shi Freda, Shi Shuming
- Mani-gpt: A Generative Model For Interactive Robotic Manipulation Zhang Zhe, Chai Wei, Wang Jiankun
- VISAR: A Human-ai Argumentative Writing Assistant With Visual Programming And Rapid Draft Prototyping Zhang Zheng, Gao Jie, Dhaliwal Ranjodh Singh, Li Toby Jia-jun
- Students' Perceptions And Preferences Of Generative Artificial Intelligence Feedback For Programming Zhang Zhengdong, Dong Zihan, Shi Yang, Matsuda Noboru, Price Thomas, Xu Dongkuan
- H\(_2\)O: Heavy-hitter Oracle For Efficient Generative Inference Of Large Language Models Zhang Zhenyu, Sheng Ying, Zhou Tianyi, Chen Tianlong, Zheng Lianmin, Cai Ruisi, Song Zhao, Tian Yuandong, Ré Christopher, Barrett Clark, Wang Zhangyang, Chen Beidi
- Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization Zhang Zhexin, Yang Junxiao, Ke Pei, Mi Fei, Wang Hongning, Huang Minlie
- Reinforced UI Instruction Grounding: Towards A Generic UI Task Automation API Zhang Zhizheng, Xie Wenxuan, Zhang Xiaoyi, Lu Yan
- How Do Large Language Models Capture The Ever-changing World Knowledge? A Review Of Recent Advances Zhang Zihan, Fang Meng, Chen Ling, Namazi-rad Mohammad-reza, Wang Jun
- LLM4DV: Using Large Language Models For Hardware Test Stimuli Generation Zhang Zixi, Chadwick Greg, Mcnally Hugo, Zhao Yiren, Mullins Robert
- Large Language Models As Zero-shot Conversational Recommenders Zhankui He, Zhouhang Xie, Rahul Jha, Harald Steck, Dawen Liang, Yesu Feng, Bodhisattwa Prasad Majumder, Nathan Kallus, Julian Mcauley
- Explainability For Large Language Models: A Survey Zhao Haiyan, Chen Hanjie, Yang Fan, Liu Ninghao, Deng Huiqi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Du Mengnan
- Automatic Model Selection With Large Language Models For Reasoning Zhao James Xu, Xie Yuxi, Kawaguchi Kenji, He Junxian, Xie Michael Qizhe
- Chatspot: Bootstrapping Multimodal Llms Via Precise Referring Instruction Tuning Zhao Liang, Yu En, Ge Zheng, Yang Jinrong, Wei Haoran, Zhou Hongyu, Sun Jianjian, Peng Yuang, Dong Runpei, Han Chunrui, Zhang Xiangyu
- Retrieving Multimodal Information For Augmented Generation: A Survey Zhao Ruochen, Chen Hailin, Wang Weishi, Jiao Fangkai, Do Xuan Long, Qin Chengwei, Ding Bosheng, Guo Xiaobao, Li Minzhi, Li Xingxuan, Joty Shafiq
- Group Preference Optimization: Few-shot Alignment Of Large Language Models Zhao Siyan, Dang John, Grover Aditya
- DIVKNOWQA: Assessing The Reasoning Ability Of Llms Via Open-domain Question Answering Over Knowledge Base And Text Zhao Wenting, Liu Ye, Niu Tong, Wan Yao, Yu Philip S., Joty Shafiq, Zhou Yingbo, Yavuz Semih
- Babystories: Can Reinforcement Learning Teach Baby Language Models To Write Better Stories? Zhao Xingmeng, Wang Tongnian, Osborn Sheri, Rios Anthony
- Chat With The Environment: Interactive Multimodal Perception Using Large Language Models Zhao Xufeng, Li Mengdi, Weber Cornelius, Hafez Muhammad Burhan, Wermter Stefan
- Mind Vs. Mouth: On Measuring Re-judge Inconsistency Of Social Bias In Large Language Models Zhao Yachao, Wang Bo, Zhao Dongming, Huang Kun, Wang Yan, He Ruifang, Hou Yuexian
- Slic-hf: Sequence Likelihood Calibration With Human Feedback Zhao Yao, Joshi Rishabh, Liu Tianqi, Khalman Misha, Saleh Mohammad, Liu Peter J.
- Lookahead: An Inference Acceleration Framework For Large Language Model With Lossless Generation Accuracy Zhao Yao, Xie Zhitian, Liang Chen, Zhuang Chenyi, Gu Jinjie
- Investigating Table-to-text Generation Capabilities Of Llms In Real-world Information Seeking Scenarios Zhao Yilun, Zhang Haowei, Si Shengyun, Nan Linyong, Tang Xiangru, Cohan Arman
- Chatbridge: Bridging Modalities With Large Language Model As A Language Catalyst Zhao Zijia, Guo Longteng, Yue Tongtian, Chen Sihan, Shao Shuai, Zhu Xinxin, Yuan Zehuan, Liu Jing
- Lyra: Orchestrating Dual Correction In Automated Theorem Proving Zheng Chuanyang, Wang Haiming, Xie Enze, Liu Zhengying, Sun Jiankai, Xin Huajian, Shen Jianhao, Li Zhenguo, Li Yu
- Lmsys-chat-1m: A Large-scale Real-world LLM Conversation Dataset Zheng Lianmin, Chiang Wei-lin, Sheng Ying, Li Tianle, Zhuang Siyuan, Wu Zhanghao, Zhuang Yonghao, Li Zhuohan, Lin Zi, Xing Eric P., Gonzalez Joseph E., Stoica Ion, Zhang Hao
- Memorycompanion: A Smart Healthcare Solution To Empower Efficient Alzheimer's Care Via Unleashing Generative AI Zheng Lifei, Heo Yeonie, Fang Yi
- Improving Generalization Of Alignment With Human Preferences Through Group Invariant Learning Zheng Rui, Shen Wei, Hua Yuan, Lai Wenbin, Dou Shihan, Zhou Yuhao, Xi Zhiheng, Wang Xiao, Huang Haoran, Gui Tao, Zhang Qi, Huang Xuanjing
- Gpt-fathom: Benchmarking Large Language Models To Decipher The Evolutionary Path Towards GPT-4 And Beyond Zheng Shen, Zhang Yuyu, Zhu Yijie, Xi Chenguang, Gao Pengyang, Zhou Xun, Chang Kevin Chen-chuan
- Response Length Perception And Sequence Scheduling: An Llm-empowered LLM Inference Pipeline Zheng Zangwei, Ren Xiaozhe, Xue Fuzhao, Luo Yang, Jiang Xin, You Yang
- Building Emotional Support Chatbots In The Era Of Llms Zheng Zhonghua, Liao Lizi, Deng Yang, Nie Liqiang
- Self-polish: Enhance Reasoning In Large Language Models Via Problem Refinement Zhiheng Xi, Senjie Jin, Yuhao Zhou, Rui Zheng, Songyang Gao, Tao Gui, Qi Zhang, Xuanjing Huang
- Humanoid Agents: Platform For Simulating Human-like Generative Agents Zhilin Wang, Yu Ying Chiu, Yu Cheung Chiu
- Aligning Large Multimodal Models With Factually Augmented RLHF Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liang-yan Gui, Yu-xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell
- Principle-driven Self-alignment Of Language Models From Scratch With Minimal Human Supervision Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan
- Leveraging Large Language Models For Pre-trained Recommender Systems Zhixuan Chu, Hongyan Hao, Xin Ouyang, Simeng Wang, Yan Wang, Yue Shen, Jinjie Gu, Qing Cui, Longfei Li, Siqiao Xue, James Y Zhang, Sheng Li
- Beyond Hallucinations: Enhancing Lvlms Through Hallucination-aware Direct Preference Optimization Zhiyuan Zhao, Bin Wang, Linke Ouyang, Xiaoyi Dong, Jiaqi Wang, Conghui He
- Bookgpt: A General Framework For Book Recommendation Empowered By Large Language Model Zhiyuli Aakas, Chen Yanfang, Zhang Xuan, Liang Xun
- Seeking Neural Nuggets: Knowledge Transfer In Large Language Models From A Parametric Perspective Zhong Ming, An Chenxin, Chen Weizhu, Han Jiawei, He Pengcheng
- Chatabl: Abductive Learning Via Natural Language Interaction With Chatgpt Zhong Tianyang, Wei Yaonai, Yang Li, Wu Zihao, Liu Zhengliang, Wei Xiaozheng, Li Wenjun, Yao Junjie, Ma Chong, Li Xiang, Zhu Dajiang, Jiang Xi, Han Junwei, Shen Dinggang, Liu Tianming, Zhang Tuo
- Language Agent Tree Search Unifies Reasoning Acting And Planning In Language Models Zhou Andy, Yan Kai, Shlapentokh-rothman Michal, Wang Haohan, Wang Yu-xiong
- Survival Of The Most Influential Prompts: Efficient Black-box Prompt Search Via Clustering And Pruning Zhou Han, Wan Xingchen, Vulić Ivan, Korhonen Anna
- Ethical Chatgpt: Concerns, Challenges, And Commandments Zhou Jianlong, Müller Heimo, Holzinger Andreas, Chen Fang
- Vicor: Bridging Visual Understanding And Commonsense Reasoning With Large Language Models Zhou Kaiwen, Lee Kwonjoon, Misu Teruhisa, Wang Xin Eric
- CONA: A Novel Context-aware Instruction Paradigm For Communication Using Large Language Model Zhou Nan, Tao Xinghui, Chen Xi
- Exploring Recommendation Capabilities Of Gpt-4v(ision): A Preliminary Case Study Zhou Peilin, Cao Meng, Huang You-liang, Ye Qichen, Zhang Peiyan, Liu Junling, Xie Yueqi, Hua Yining, Kim Jaeboum
- Unleashing GPT On The Metaverse: Savior Or Destroyer? Zhou Pengyuan
- Infmllm: A Unified Framework For Visual-language Tasks Zhou Qiang, Wang Zhibin, Chu Wei, Xu Yinghui, Li Hao, Qi Yuan
- Context-faithful Prompting For Large Language Models Zhou Wenxuan, Zhang Sheng, Poon Hoifung, Chen Muhao
- Advancing Transformer's Capabilities In Commonsense Reasoning Zhou Yu, Han Yunqiu, Zhou Hanyu, Wu Yulun
- Quantifying And Analyzing Entity-level Memorization In Large Language Models Zhou Zhenhong, Xiang Jiuyang, Chen Chaomeng, Su Sen
- Large Language Model As A Policy Teacher For Training Reinforcement Learning Agents Zhou Zihao, Hu Bin, Zhao Chenyang, Zhang Pu, Liu Bin
- Pose: Efficient Context Window Extension Of Llms Via Positional Skip-wise Training Zhu Dawei, Yang Nan, Wang Liang, Song Yifan, Wu Wenhao, Wei Furu, Li Sujian
- VL-GPT: A Generative Pre-trained Transformer For Vision And Language Understanding And Generation Zhu Jinguo, Ding Xiaohan, Ge Yixiao, Ge Yuying, Zhao Sijie, Zhao Hengshuang, Wang Xiaohua, Shan Ying
- Promptbench: A Unified Library For Evaluation Of Large Language Models Zhu Kaijie, Zhao Qinlin, Chen Hao, Wang Jindong, Xie Xing
- Judgelm: Fine-tuned Large Language Models Are Scalable Judges Zhu Lianghui, Wang Xinggang, Wang Xinlong
- Extrapolating Large Language Models To Non-english By Aligning Languages Zhu Wenhao, Lv Yunzhe, Dong Qingxiu, Yuan Fei, Xu Jingjing, Huang Shujian, Kong Lingpeng, Chen Jiajun, Li Lei
- Question Answering As Programming For Solving Time-sensitive Questions Zhu Xinyu, Yang Cheng, Chen Bei, Li Siheng, Lou Jian-guang, Yang Yujiu
- Towards An On-device Agent For Text Rewriting Zhu Yun, Liu Yinxiao, Stahlberg Felix, Kumar Shankar, Chen Yu-hui, Luo Liangchen, Shu Lei, Liu Renjie, Chen Jindong, Meng Lei
- 3d-vista: Pre-trained Transformer For 3D Vision And Text Alignment Zhu Ziyu, Ma Xiaojian, Chen Yixin, Deng Zhidong, Huang Siyuan, Li Qing
- Toolqa: A Dataset For LLM Question Answering With External Tools Zhuang Yuchen, Yu Yue, Wang Kuan, Sun Haotian, Zhang Chao
- Ice-score: Instructing Large Language Models To Evaluate Code Zhuo Terry Yue
- Can Large Language Models Transform Computational Social Science? Ziems Caleb, Held William, Shaikh Omar, Chen Jiaao, Zhang Zhehao, Yang Diyi
- JARVIS-1: Open-world Multi-task Agents With Memory-augmented Multimodal Language Models Zihao Wang, Shaofei Cai, Anji Liu, Yonggang Jin, Jinbing Hou, Bowei Zhang, Haowei Lin, Zhaofeng He, Zilong Zheng, Yaodong Yang, Xiaojian Ma, Yitao Liang
- Describe, Explain, Plan And Select: Interactive Planning With Large Language Models Enables Open-world Multi-task Agents Zihao Wang, Shaofei Cai, Guanzhou Chen, Anji Liu, Xiaojian Ma, Yitao Liang
- Tab-cot: Zero-shot Tabular Chain Of Thought Ziqi Jin, Wei Lu
- Chatgpt Hallucinates When Attributing Answers Zuccon Guido, Koopman Bevan, Shaik Razia
- Incorporating Probing Signals Into Multimodal Machine Translation Via Visual Question-answering Pairs Zuo Yuxin, Li Bei, Lv Chuanhao, Zheng Tong, Xiao Tong, Zhu Jingbo
- A Dataset And Benchmark For Hospital Course Summarization With Adapted Large Language Models Aali Asad, Van Veen Dave, Arefeen Yamin Ishraq, Hom Jason, Bluethgen Christian, Reis Eduardo Pontes, Gatidis Sergios, Clifford Namuun, Daws Joseph, Tehrani Arash S., Kim Jangwon, Chaudhari Akshay S.
- Benchmarking Large Language Models For Persian: A Preliminary Study Focusing On Chatgpt Abaskohi Amirhossein, Baruni Sara, Masoudi Mostafa, Abbasi Nesa, Babalou Mohammad Hadi, Edalat Ali, Kamahi Sepehr, Sani Samin Mahdizadeh, Naghavian Nikoo, Namazifard Danial, Sadeghi Pouya, Yaghoobzadeh Yadollah
- Can Llms Be Fooled? Investigating Vulnerabilities In Llms Abdali Sara, He Jia, Barberan Cj, Anarfi Richard
- Are You Still On Track!? Catching LLM Task Drift With Activations Abdelnabi Sahar, Fay Aideen, Cherubin Giovanni, Salem Ahmed, Fritz Mario, Paverd Andrew
- The Llama 3 Herd Of Models Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang, Bobbie Chern, Charlotte Caucheteux, Chaya Nayak, Chloe Bi, Chris Marra, Chris Mcconnell, Christian Keller, Christophe Touret, Chunyang Wu, Corinne Wong, Cristian Canton Ferrer, Cyrus Nikolaidis, Damien Allonsius, Daniel Song, Danielle Pintz, Danny Livshits, David Esiobu, Dhruv Choudhary, Dhruv Mahajan, Diego Garcia-olano, Diego Perino, Dieuwke Hupkes, Egor Lakomkin, Ehab Albadawy, Elina Lobanova, Emily Dinan, Eric Michael Smith, Filip Radenovic, Frank Zhang, Gabriel Synnaeve, Gabrielle Lee, Georgia Lewis Anderson, Graeme Nail, Gregoire Mialon, Guan Pang, Guillem Cucurell, Hailey Nguyen, Hannah Korevaar, Hu Xu, Hugo Touvron, Iliyan Zarov, Imanol Arrieta Ibarra, Isabel Kloumann, Ishan Misra, Ivan Evtimov, Jade Copet, Jaewon Lee, Jan Geffert, Jana Vranes, Jason Park, Jay Mahadeokar, Jeet Shah, Jelmer Van Der Linde, Jennifer Billock, Jenny Hong, Jenya Lee, Jeremy Fu, Jianfeng Chi, Jianyu Huang, Jiawen Liu, Jie Wang, Jiecao Yu, Joanna Bitton, Joe Spisak, Jongsoo Park, Joseph Rocca, Joshua Johnstun, Joshua Saxe, Junteng Jia, Kalyan Vasuden Alwala, Kartikeya Upasani, Kate Plawiak, Ke Li, Kenneth Heafield, Kevin Stone, Khalid El-arini, Krithika Iyer, Kshitiz Malik, Kuenley Chiu, Kunal Bhalla, Lauren Rantala-yeary, Laurens Van Der Maaten, Lawrence Chen, Liang Tan, Liz Jenkins, Louis Martin, Lovish Madaan, Lubo Malo, Lukas Blecher, Lukas Landzaat, Luke De Oliveira, Madeline Muzzi, Mahesh Pasupuleti, Mannat Singh, Manohar Paluri, Marcin Kardas, Mathew Oldham, Mathieu Rita, Maya Pavlova, Melanie Kambadur, Mike Lewis, Min Si, Mitesh Kumar Singh, Mona Hassan, Naman Goyal, Narjes Torabi, Nikolay Bashlykov, Nikolay Bogoychev, Niladri Chatterji, Olivier Duchenne, Onur Çelebi, Patrick Alrassy, Pengchuan Zhang, Pengwei Li, Petar Vasic, Peter Weng, Prajjwal Bhargava, Pratik Dubal, Praveen Krishnan, Punit Singh Koura, Puxin Xu, Qing He, Qingxiao Dong, Ragavan Srinivasan, Raj Ganapathy, Ramon Calderer, Ricardo Silveira Cabral, Robert Stojnic, Roberta Raileanu, Rohit Girdhar, Rohit Patel, Romain Sauvestre, Ronnie Polidoro, Roshan Sumbaly, Ross Taylor, Ruan Silva, Rui Hou, Rui Wang, Saghar Hosseini, Sahana Chennabasappa, Sanjay Singh, Sean Bell, Seohyun Sonia Kim, Sergey Edunov, Shaoliang Nie, Sharan Narang, Sharath Raparthy, Sheng Shen, Shengye Wan, Shruti Bhosale, Shun Zhang, Simon Vandenhende, Soumya Batra, Spencer Whitman, Sten Sootla, Stephane Collot, Suchin Gururangan, Sydney Borodinsky, Tamar Herman, Tara Fowler, Tarek Sheasha, Thomas Georgiou, Thomas Scialom, Tobias Speckbacher, Todor Mihaylov, Tong Xiao, Ujjwal Karn, Vedanuj Goswami, Vibhor Gupta, Vignesh Ramanathan, Viktor Kerkez, Vincent Gonguet, Virginie Do, Vish Vogeti, Vladan Petrovic, Weiwei Chu, Wenhan Xiong, Wenyin Fu, Whitney Meers, Xavier Martinet, Xiaodong Wang, Xiaoqing Ellen Tan, Xinfeng Xie, Xuchao Jia, Xuewei Wang, Yaelle Goldschlag, Yashesh Gaur, Yasmine Babaei, Yi Wen, Yiwen Song, Yuchen Zhang, Yue Li, Yuning Mao, Zacharie Delpierre Coudert, Zheng Yan, Zhengxing Chen, Zoe Papakipos, Aaditya Singh, Aaron Grattafiori, Abha Jain, Adam Kelsey, Adam Shajnfeld, Adithya Gangidi, Adolfo Victoria, Ahuva Goldstand, Ajay Menon, Ajay Sharma, Alex Boesenberg, Alex Vaughan, Alexei Baevski, Allie Feinstein, Amanda Kallet, Amit Sangani, Anam Yunus, Andrei Lupu, Andres Alvarado, Andrew Caples, Andrew Gu, Andrew Ho, Andrew Poulton, Andrew Ryan, Ankit Ramchandani, Annie Franco, Aparajita Saraf, Arkabandhu Chowdhury, Ashley Gabriel, Ashwin Bharambe, Assaf Eisenman, Azadeh Yazdan, Beau James, Ben Maurer, Benjamin Leonhardi, Bernie Huang, Beth Loyd, Beto De Paola, Bhargavi Paranjape, Bing Liu, Bo Wu, Boyu Ni, Braden Hancock, Bram Wasti, Brandon Spence, Brani Stojkovic, Brian Gamido, Britt Montalvo, Carl Parker, Carly Burton, Catalina Mejia, Changhan Wang, Changkyu Kim, Chao Zhou, Chester Hu, Ching-hsiang Chu, Chris Cai, Chris Tindal, Christoph Feichtenhofer, Damon Civin, Dana Beaty, Daniel Kreymer, Daniel Li, Danny Wyatt, David Adkins, David Xu, Davide Testuggine, Delia David, Devi Parikh, Diana Liskovich, Didem Foss, Dingkang Wang, Duc Le, Dustin Holland, Edward Dowling, Eissa Jamil, Elaine Montgomery, Eleonora Presani, Emily Hahn, Emily Wood, Erik Brinkman, Esteban Arcaute, Evan Dunbar, Evan Smothers, Fei Sun, Felix Kreuk, Feng Tian, Firat Ozgenel, Francesco Caggioni, Francisco Guzmán, Frank Kanayet, Frank Seide, Gabriela Medina Florez, Gabriella Schwarz, Gada Badeer, Georgia Swee, Gil Halpern, Govind Thattai, Grant Herman, Grigory Sizov, Guangyi, Zhang, Guna Lakshminarayanan, Hamid Shojanazeri, Han Zou, Hannah Wang, Hanwen Zha, Haroun Habeeb, Harrison Rudolph, Helen Suk, Henry Aspegren, Hunter Goldman, Ibrahim Damlaj, Igor Molybog, Igor Tufanov, Irina-elena Veliche, Itai Gat, Jake Weissman, James Geboski, James Kohli, Japhet Asher, Jean-baptiste Gaya, Jeff Marcus, Jeff Tang, Jennifer Chan, Jenny Zhen, Jeremy Reizenstein, Jeremy Teboul, Jessica Zhong, Jian Jin, Jingyi Yang, Joe Cummings, Jon Carvill, Jon Shepard, Jonathan Mcphie, Jonathan Torres, Josh Ginsburg, Junjie Wang, Kai Wu, Kam Hou U, Karan Saxena, Karthik Prasad, Kartikay Khandelwal, Katayoun Zand, Kathy Matosich, Kaushik Veeraraghavan, Kelly Michelena, Keqian Li, Kun Huang, Kunal Chawla, Kushal Lakhotia, Kyle Huang, Lailin Chen, Lakshya Garg, Lavender A, Leandro Silva, Lee Bell, Lei Zhang, Liangpeng Guo, Licheng Yu, Liron Moshkovich, Luca Wehrstedt, Madian Khabsa, Manav Avalani, Manish Bhatt, Maria Tsimpoukelli, Martynas Mankus, Matan Hasson, Matthew Lennie, Matthias Reso, Maxim Groshev, Maxim Naumov, Maya Lathi, Meghan Keneally, Michael L. Seltzer, Michal Valko, Michelle Restrepo, Mihir Patel, Mik Vyatskov, Mikayel Samvelyan, Mike Clark, Mike Macey, Mike Wang, Miquel Jubert Hermoso, Mo Metanat, Mohammad Rastegari, Munish Bansal, Nandhini Santhanam, Natascha Parks, Natasha White, Navyata Bawa, Nayan Singhal, Nick Egebo, Nicolas Usunier, Nikolay Pavlovich Laptev, Ning Dong, Ning Zhang, Norman Cheng, Oleg Chernoguz, Olivia Hart, Omkar Salpekar, Ozlem Kalinli, Parkin Kent, Parth Parekh, Paul Saab, Pavan Balaji, Pedro Rittner, Philip Bontrager, Pierre Roux, Piotr Dollar, Polina Zvyagina, Prashant Ratanchandani, Pritish Yuvraj, Qian Liang, Rachad Alao, Rachel Rodriguez, Rafi Ayub, Raghotham Murthy, Raghu Nayani, Rahul Mitra, Raymond Li, Rebekkah Hogan, Robin Battey, Rocky Wang, Rohan Maheswari, Russ Howes, Ruty Rinott, Sai Jayesh Bondu, Samyak Datta, Sara Chugh, Sara Hunt, Sargun Dhillon, Sasha Sidorov, Satadru Pan, Saurabh Verma, Seiji Yamamoto, Sharadh Ramaswamy, Shaun Lindsay, Shaun Lindsay, Sheng Feng, Shenghao Lin, Shengxin Cindy Zha, Shiva Shankar, Shuqiang Zhang, Shuqiang Zhang, Sinong Wang, Sneha Agarwal, Soji Sajuyigbe, Soumith Chintala, Stephanie Max, Stephen Chen, Steve Kehoe, Steve Satterfield, Sudarshan Govindaprasad, Sumit Gupta, Sungmin Cho, Sunny Virk, Suraj Subramanian, Sy Choudhury, Sydney Goldman, Tal Remez, Tamar Glaser, Tamara Best, Thilo Kohler, Thomas Robinson, Tianhe Li, Tianjun Zhang, Tim Matthews, Timothy Chou, Tzook Shaked, Varun Vontimitta, Victoria Ajayi, Victoria Montanez, Vijai Mohan, Vinay Satish Kumar, Vishal Mangla, Vítor Albiero, Vlad Ionescu, Vlad Poenaru, Vlad Tiberiu Mihailescu, Vladimir Ivanov, Wei Li, Wenchen Wang, Wenwen Jiang, Wes Bouaziz, Will Constable, Xiaocheng Tang, Xiaofang Wang, Xiaojian Wu, Xiaolan Wang, Xide Xia, Xilun Wu, Xinbo Gao, Yanjun Chen, Ye Hu, Ye Jia, Ye Qi, Yenda Li, Yilin Zhang, Ying Zhang, Yossi Adi, Youngjin Nam, Yu, Wang, Yuchen Hao, Yundi Qian, Yuzi He, Zach Rait, Zachary Devito, Zef Rosnbrick, Zhaoduo Wen, Zhenyu Yang, Zhiwei Zhao
- Infercept: Efficient Intercept Support For Augmented Large Language Model Inference Abhyankar Reyna, He Zijian, Srivatsa Vikranth, Zhang Hao, Zhang Yiying
- Stackrag Agent: Improving Developer Answers With Retrieval-augmented Generation Abrahamyan Davit, Fard Fatemeh H.
- Supporting Student Decisions On Learning Recommendations: An Llm-based Chatbot With Knowledge Graph Contextualization For Conversational Explainability And Mentoring Abu-rasheed Hasan, Abdulsalam Mohamad Hussam, Weber Christian, Fathi Madjid
- Pathocl: Path-based Prompt Augmentation For OCL Generation With GPT-4 Abukhalaf Seif, Hamdaqa Mohammad, Khomh Foutse
- A Survey On Symbolic Knowledge Distillation Of Large Language Models Acharya Kamal, Velasquez Alvaro, Song Houbing Herbert
- Attnlrp: Attention-aware Layer-wise Relevance Propagation For Transformers Achtibat Reduan, Hatefi Sayed Mohammad Vakilzadeh, Dreyer Maximilian, Jain Aakriti, Wiegand Thomas, Lapuschkin Sebastian, Samek Wojciech
- Comparing LLM Prompting With Cross-lingual Transfer Performance On Indigenous And Low-resource Brazilian Languages Adelani David Ifeoluwa, Doğruöz A. Seza, Coneglian André, Ojha Atul Kr.
- Keyformer: KV Cache Reduction Through Key Tokens Selection For Efficient Generative Inference Adnan Muhammad, Arunkumar Akhil, Jain Gaurav, Nair Prashant J., Soloveychik Ilya, Kamath Purushotham
- Towards Optimizing And Evaluating A Retrieval Augmented QA Chatbot Using Llms With Human In The Loop Afzal Anum, Kowsik Alexander, Fani Rajna, Matthes Florian
- Faithfulness Vs. Plausibility: On The (un)reliability Of Explanations From Large Language Models Agarwal Chirag, Tanneru Sree Harsha, Lakkaraju Himabindu
- Many-shot In-context Learning Agarwal Rishabh, Singh Avi, Zhang Lei M., Bohnet Bernd, Rosias Luis, Chan Stephanie, Zhang Biao, Anand Ankesh, Abbas Zaheer, Nova Azade, Co-reyes John D., Chu Eric, Behbahani Feryal, Faust Aleksandra, Larochelle Hugo
- Illuminate: A Novel Approach For Depression Detection With Explainable Analysis And Proactive Therapy Using Prompt Engineering Agrawal Aryan
- Can Llms Perform Structured Graph Reasoning? Agrawal Palaash, Vasania Shavak, Tan Cheston
- Scalable Ensembling For Mitigating Reward Overoptimisation Ahmed Ahmed M., Rafailov Rafael, Sharkov Stepan, Li Xuechen, Koyejo Sanmi
- LM4OPT: Unveiling The Potential Of Large Language Models In Formulating Mathematical Optimization Problems Ahmed Tasnim, Choudhury Salimur
- Tuning Large Multimodal Models For Videos Using Reinforcement Learning From AI Feedback Ahn Daechul, Choi Yura, Yu Youngjae, Kang Dongyeop, Choi Jonghyun
- Examining The Robustness Of LLM Evaluation To The Distributional Assumptions Of Benchmarks Ailem Melissa, Marazopoulou Katerina, Siska Charlotte, Bono James
- Clustered Retrieved Augmented Generation (CRAG) Akesson Simon, Santos Frances A.
- Have We Reached AGI? Comparing Chatgpt, Claude, And Gemini To Human Literacy And Education Benchmarks Akpan Mfon
- History Of Generative Artificial Intelligence (AI) Chatbots: Past, Present, And Future Development Al-amin Md., Ali Mohammad Shazed, Salam Abdus, Khan Arif, Ali Ashraf, Ullah Ahsan, Alam Md Nur, Chowdhury Shamsul Kabir
- Mixture Of Modular Experts: Distilling Knowledge From A Multilingual Teacher Into Specialized Modular Language Models Al-maamari Mohammed, Amor Mehdi Ben, Granitzer Michael
- Benchmarking Open-source Language Models For Efficient Question Answering In Industrial Applications Alassan Mahaman Sanoussi Yahaya, Espejel Jessica López, Bouhandi Merieme, Dahhane Walid, Ettifouri El Hassane
- Evaluating The Retrieval Component In Llm-based Question Answering Systems Alinejad Ashkan, Kumar Krtin, Vahdat Ali
- Chatgpt Alternative Solutions: Large Language Models Survey Alipour Hanieh, Pendar Nick, Roy Kohinoor
- Unsupervised Evaluation Of Code Llms With Round-trip Correctness Allamanis Miltiadis, Panthaplackel Sheena, Yin Pengcheng
- Toward Conversational Agents With Context And Time Sensitive Long-term Memory Alonso Nick, Figliolia Tomás, Ndirango Anthony, Millidge Beren
- D-NLP At Semeval-2024 Task 2: Evaluating Clinical Inference Capabilities Of Large Language Models Altinok Duygu
- Do Llms Exhibit Human-like Reasoning? Evaluating Theory Of Mind In Llms For Open-ended Responses Amirizaniani Maryam, Martin Elias, Sivachenko Maryna, Mashhadi Afra, Shah Chirag
- Understanding Alignment In Multimodal Llms: A Comprehensive Study Amirloo Elmira, Fauconnier Jean-philippe, Roesmann Christoph, Kerl Christian, Boney Rinu, Qian Yusu, Wang Zirui, Dehghan Afshin, Yang Yinfei, Gan Zhe, Grasch Peter
- Training-free Long-context Scaling Of Large Language Models An Chenxin, Huang Fei, Zhang Jun, Gong Shansan, Qiu Xipeng, Zhou Chang, Kong Lingpeng
- Funaudiollm: Voice Understanding And Generation Foundation Models For Natural Interaction Between Humans And Llms An Keyu, Chen Qian, Deng Chong, Du Zhihao, Gao Changfeng, Gao Zhifu, Gu Yue, He Ting, Hu Hangrui, Hu Kai, Ji Shengpeng, Li Yabin, Li Zerui, Lu Heng, Luo Haoneng, Lv Xiang, Ma Bin, Ma Ziyang, Ni Chongjia, Song Changhe, Shi Jiaqi, Shi Xian, Wang Hao, Wang Wen, Wang Yuxuan, Xiao Zhangyu, Yan Zhijie, Yang Yexin, Zhang Bin, Zhang Qinglin, Zhang Shiliang, Zhao Nan, Zheng Siqi
- Make Your LLM Fully Utilize The Context An Shengnan, Ma Zexiong, Lin Zeqi, Zheng Nanning, Lou Jian-guang
- How Susceptible Are Llms To Influence In Prompts? Anagnostidis Sotiris, Bulian Jannis
- Enhancing Journalism With AI: A Study Of Contextualized Image Captioning For News Articles Using Llms And Lmms Anagnostopoulou Aliki, Gouvea Thiago, Sonntag Daniel
- Mm-phyrlhf: Reinforcement Learning Framework For Multimodal Physics Question-answering Anand Avinash, Kapuriya Janak, Kirtani Chhavi, Singh Apoorv, Saraf Jay, Lal Naman, Kumar Jatin, Shivam Adarsh Raj, Verma Astha, Shah Rajiv Ratn, Zimmermann Roger
- Investigating The Robustness Of Llms On Math Word Problems Anantheswaran Ujjwala, Gupta Himanshu, Scaria Kevin, Verma Shreyas, Baral Chitta, Mishra Swaroop
- Does Refusal Training In Llms Generalize To The Past Tense? Andriushchenko Maksym, Flammarion Nicolas
- Critique-out-loud Reward Models Ankner Zachary, Paul Mansheej, Cui Brandon, Chang Jonathan D., Ammanabrolu Prithviraj
- Generalization V.s. Memorization: Tracing Language Models' Capabilities Back To Pretraining Data Antoniades Antonis, Wang Xinyi, Elazar Yanai, Amayuelas Alfonso, Albalak Alon, Zhang Kexun, Wang William Yang
- Evaluating Chatgpt On Nuclear Domain-specific Data Anwar Muhammad, De Costa Mischa, Hammad Issam, Lau Daniel
- A Framework For Fine-tuning Llms Using Heterogeneous Feedback Aponte Ryan, Rossi Ryan A., Guo Shunan, Dernoncourt Franck, Yu Tong, Chen Xiang, Mitra Subrata, Lipka Nedim
- Coaching Copilot: Blended Form Of An Llm-powered Chatbot And A Human Coach To Effectively Support Self-reflection For Leadership Growth Arakawa Riku, Yakura Hiromu
- Hired: Attention-guided Token Dropping For Efficient Inference Of High-resolution Vision-language Models In Resource-constrained Environments Arif Kazi Hasan Ibn, Yoon Jinyi, Nikolopoulos Dimitrios S., Vandierendonck Hans, John Deepu, Ji Bo
- Forklift: An Extensible Neural Lifter Armengol-estapé Jordi, Rocha Rodrigo C. O., Woodruff Jackson, Minervini Pasquale, O'boyle Michael F. P.
- Simple Linear Attention Language Models Balance The Recall-throughput Tradeoff Arora Simran, Eyuboglu Sabri, Zhang Michael, Timalsina Aman, Alberti Silas, Zinsley Dylan, Zou James, Rudra Atri, Ré Christopher
- Exploring The Robustness Of Task-oriented Dialogue Systems For Colloquial German Varieties Artemova Ekaterina, Blaschke Verena, Plank Barbara
- To Code, Or Not To Code? Exploring Impact Of Code In Pre-training Aryabumi Viraat, Su Yixuan, Ma Raymond, Morisot Adrien, Zhang Ivan, Locatelli Acyr, Fadaee Marzieh, Üstün Ahmet, Hooker Sara
- Evaluating Large Language Models On The GMAT: Implications For The Future Of Business Education Ashrafimoghari Vahid, Gürkan Necdet, Suchow Jordan W.
- Self-seeding And Multi-intent Self-instructing Llms For Generating Intent-aware Information-seeking Dialogs Askari Arian, Petcu Roxana, Meng Chuan, Aliannejadi Mohammad, Abolghasemi Amin, Kanoulas Evangelos, Verberne Suzan
- Minigpt4-video: Advancing Multimodal Llms For Video Understanding With Interleaved Visual-textual Tokens Ataallah Kirolos, Shen Xiaoqian, Abdelrahman Eslam, Sleiman Essam, Zhu Deyao, Ding Jian, Elhoseiny Mohamed
- Elicitron: An LLM Agent-based Simulation Framework For Design Requirements Elicitation Ataei Mohammadmehdi, Cheong Hyunmin, Grandi Daniele, Wang Ye, Morris Nigel, Tessier Alexander
- Bifurcated Attention: Accelerating Massively Parallel Decoding With Shared Prefixes In Llms Athiwaratkun Ben, Gonugondla Sujan Kumar, Gouda Sanjay Krishna, Qian Haifeng, Ding Hantian, Sun Qing, Wang Jun, Guo Jiacheng, Chen Liangfu, Bhatia Parminder, Nallapati Ramesh, Sengupta Sudipta, Xiang Bing
- Transformer Alignment In Large Language Models Aubry Murdock, Meng Haoming, Sugolov Anton, Papyan Vardan
- Current State Of LLM Risks And AI Guardrails Ayyamperumal Suriya Ganesh, Ge Limin
- Feedback-generation For Programming Exercises With GPT-4 Azaiz Imen, Kiesler Natalie, Strickroth Sven
- Llm-driven Robots Risk Enacting Discrimination, Violence, And Unlawful Actions Azeem Rumaisa, Hundt Andrew, Mansouri Masoumeh, Brandão Martim
- Accuracy And Consistency Of Llms In The Registered Dietitian Exam: The Impact Of Prompt Engineering And Knowledge Retrieval Azimi Iman, Qi Mohan, Wang Li, Rahmani Amir M., Li Youlin
- Lamda: Large Model Fine-tuning Via Spectrally Decomposed Low-dimensional Adaptation Azizi Seyedarmin, Kundu Souvik, Pedram Massoud
- Self-improving Customer Review Response Generation Based On Llms Azov Guy, Pelc Tatiana, Alon Adi Fledel, Kamhi Gila
- Masked Mixers For Language Generation And Retrieval Badger Benjamin L.
- Hybrid Preference Optimization: Augmenting Direct Preference Optimization With Auxiliary Objectives Badrinath Anirudhan, Agarwal Prabhat, Xu Jiajing
- Reference-guided Verdict: Llms-as-judges In Automatic Evaluation Of Free-form Text Badshah Sher, Sajjad Hassan
- Efficient Model-agnostic Alignment Via Bayesian Persuasion Bai Fengshuo, Wang Mingzhi, Zhang Zhaowei, Chen Boyuan, Xu Yinda, Wen Ying, Yang Yaodong
- Mt-bench-101: A Fine-grained Benchmark For Evaluating Large Language Models In Multi-turn Dialogues Bai Ge, Liu Jie, Bu Xingyuan, He Yancheng, Liu Jiaheng, Zhou Zhanhui, Lin Zhuoran, Su Wenbo, Ge Tiezheng, Zheng Bo, Ouyang Wanli
- Sparsellm: Towards Global Pruning For Pre-trained Language Models Bai Guangji, Li Yijiang, Ling Chen, Kim Kibaek, Zhao Liang
- COIG-CQIA: Quality Is All You Need For Chinese Instruction Fine-tuning Bai Yuelin, Du Xinrun, Liang Yiming, Jin Yonggang, Liu Ziqiang, Zhou Junting, Zheng Tianyu, Zhang Xincheng, Ma Nuo, Wang Zekun, Yuan Ruibin, Wu Haihong, Lin Hongquan, Huang Wenhao, Zhang Jiajun, Chen Wenhu, Lin Chenghua, Fu Jie, Yang Min, Ni Shiwen, Zhang Ge
- Longwriter: Unleashing 10,000+ Word Generation From Long Context Llms Bai Yushi, Zhang Jiajie, Lv Xin, Zheng Linzhi, Zhu Siqi, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- Hallucination Of Multimodal Large Language Models: A Survey Bai Zechen, Wang Pichao, Xiao Tianjun, He Tong, Han Zongbo, Zhang Zheng, Shou Mike Zheng
- Finetuning Large Language Model For Personalized Ranking Bai Zhuoxi, Wu Ning, Cai Fengyu, Zhu Xinyi, Xiong Yun
- MARS: Meaning-aware Response Scoring For Uncertainty Estimation In Generative Llms Bakman Yavuz Faruk, Yaldiz Duygu Nur, Buyukates Baturalp, Tao Chenyang, Dimitriadis Dimitrios, Avestimehr Salman
- CYGENT: A Cybersecurity Conversational Agent With Log Summarization Powered By GPT-3 Balasubramanian Prasasthy, Seby Justin, Kostakos Panos
- Demystifying Platform Requirements For Diverse LLM Inference Use Cases Bambhaniya Abhimanyu, Raj Ritik, Jeong Geonhwa, Kundu Souvik, Srinivasan Sudarshan, Elavazhagan Midhilesh, Kumar Madhu, Krishna Tushar
- Linguistic Calibration Of Long-form Generations Band Neil, Li Xuechen, Ma Tengyu, Hashimoto Tatsunori
- Breaking Boundaries: Investigating The Effects Of Model Editing On Cross-linguistic Performance Banerjee Somnath, Halder Avik, Mandal Rajarshi, Layek Sayan, Soboroff Ian, Hazra Rima, Mukherjee Animesh
- How (un)ethical Are Instruction-centric Responses Of Llms? Unveiling The Vulnerabilities Of Safety Guardrails To Harmful Queries Banerjee Somnath, Layek Sayan, Hazra Rima, Mukherjee Animesh
- Comparing Bad Apples To Good Oranges: Aligning Large Language Models Via Joint Preference Optimization Bansal Hritik, Suvarna Ashima, Bhatt Gantavya, Peng Nanyun, Chang Kai-wei, Grover Aditya
- LLM Augmented Llms: Expanding Capabilities Through Composition Bansal Rachit, Samanta Bidisha, Dalmia Siddharth, Gupta Nitish, Vashishth Shikhar, Ganapathy Sriram, Bapna Abhishek, Jain Prateek, Talukdar Partha
- Llms With Chain-of-thought Are Non-causal Reasoners Bao Guangsheng, Zhang Hongbo, Yang Linyi, Wang Cunxiang, Zhang Yue
- Decoding Matters: Addressing Amplification Bias And Homogeneity Issue For Llm-based Recommendation Bao Keqin, Zhang Jizhi, Zhang Yang, Huo Xinyue, Chen Chong, Feng Fuli
- Aligning Large Language Models From Self-reference AI Feedback With One General Principle Bao Rong, Zheng Rui, Dou Shihan, Wang Xiao, Zhou Enyu, Wang Bo, Zhang Qi, Ding Liang, Tao Dacheng
- Transforming Agency. On The Mode Of Existence Of Large Language Models Barandiaran Xabier E., Almendros Lola S.
- To Each (textual Sequence) Its Own: Improving Memorized-data Unlearning In Large Language Models Barbulescu George-octavian, Triantafillou Peter
- Reinforcement Learning From LLM Feedback To Counteract Goal Misgeneralization Barj Houda Nait El, Sautory Theophile
- From 'showgirls' To 'performers': Fine-tuning With Gender-inclusive Language For Bias Reduction In Llms Bartl Marion, Leavy Susan
- STARLING: Self-supervised Training Of Text-based Reinforcement Learning Agent With Large Language Models Basavatia Shreyas, Murugesan Keerthiram, Ratnakar Shivam
- Medaide: Leveraging Large Language Models For On-premise Medical Assistance On Edge Devices Basit Abdul, Hussain Khizar, Hanif Muhammad Abdullah, Shafique Muhammad
- Llms' Reading Comprehension Is Affected By Parametric Knowledge And Struggles With Hypothetical Statements Basmov Victoria, Goldberg Yoav, Tsarfaty Reut
- NESTFUL: A Benchmark For Evaluating Llms On Nested Sequences Of API Calls Basu Kinjal, Abdelaziz Ibrahim, Bradford Kelsey, Crouse Maxwell, Kate Kiran, Kumaravel Sadhana, Goyal Saurabh, Munawar Asim, Rizk Yara, Wang Xin, Lastras Luis, Kapanipathi Pavan
- API-BLEND: A Comprehensive Corpora For Training And Benchmarking API Llms Basu Kinjal, Abdelaziz Ibrahim, Chaudhury Subhajit, Dan Soham, Crouse Maxwell, Munawar Asim, Kumaravel Sadhana, Muthusamy Vinod, Kapanipathi Pavan, Lastras Luis A.
- Understanding Information Storage And Transfer In Multi-modal Large Language Models Basu Samyadeep, Grayson Martin, Morrison Cecily, Nushi Besmira, Feizi Soheil, Massiceti Daniela
- LINGOLY: A Benchmark Of Olympiad-level Linguistic Reasoning Puzzles In Low-resource And Extinct Languages Bean Andrew M., Hellsten Simi, Mayne Harry, Magomere Jabez, Chi Ethan A., Chi Ryan, Hale Scott A., Kirk Hannah Rose
- Divergent Creativity In Humans And Large Language Models Bellemare-pepin Antoine 1 And 2, Lespinasse François 4 And 6, Thölke Philipp 4 And 6, Harel Yann 4 And 6, Mathewson Kory 4 And 6, Olson Jay A. 4 And 6, Bengio Yoshua 4 And 6, Jerbi Karim 1, 4 And 7
- Crafting Interpretable Embeddings By Asking Llms Questions Benara Vinamra, Singh Chandan, Morris John X., Antonello Richard, Stoica Ion, Huth Alexander G., Gao Jianfeng
- Cross-lingual Editing In Multilingual Language Models Beniwal Himanshu, D Kowsik Nandagopan, Singh Mayank
- Machine Translation Hallucination Detection For Low And High Resource Languages Using Large Language Models Benkirane Kenza, Gongas Laura, Pelles Shahar, Fuchs Naomi, Darmon Joshua, Stenetorp Pontus, Adelani David Ifeoluwa, Sánchez Eduardo
- A Systematic Analysis Of Large Language Models As Soft Reasoners: The Case Of Syllogistic Inferences Bertolazzi Leonardo, Gatt Albert, Bernardi Raffaella
- Multi-head RAG: Solving Multi-aspect Problems With Llms Besta Maciej, Kubicek Ales, Niggli Roman, Gerstenberger Robert, Weitzendorf Lucas, Chi Mingyuan, Iff Patrick, Gajda Joanna, Nyczyk Piotr, Müller Jürgen, Niewiadomski Hubert, Chrapek Marcin, Podstawski Michał, Hoefler Torsten
- Checkembed: Effective Verification Of LLM Solutions To Open-ended Tasks Besta Maciej, Paleari Lorenzo, Kubicek Ales, Nyczyk Piotr, Gerstenberger Robert, Iff Patrick, Lehmann Tomasz, Niewiadomski Hubert, Hoefler Torsten
- Enhancing Event Reasoning In Large Language Models Through Instruction Fine-tuning With Semantic Causal Graphs Bethany Mazal, Bethany Emet, Wherry Brandon, Chiang Cho-yu, Vishwamitra Nishant, Rios Anthony, Najafirad Peyman
- Guiding Llms The Right Way: Fast, Non-invasive Constrained Generation Beurer-kellner Luca, Fischer Marc, Vechev Martin
- Taking The Next Step With Generative Artificial Intelligence: The Transformative Role Of Multimodal Large Language Models In Science Education Bewersdorff Arne, Hartmann Christian, Hornberger Marie, Seßler Kathrin, Bannert Maria, Kasneci Enkelejda, Kasneci Gjergji, Zhai Xiaoming, Nerdel Claudia
- Clembench-2024: A Challenging, Dynamic, Complementary, Multilingual Benchmark And Underlying Flexible Framework For Llms As Multi-action Agents Beyer Anne, Chalamalasetti Kranti, Hakimov Sherzod, Madureira Brielen, Sadler Philipp, Schlangen David
- Towards Dialogues For Joint Human-ai Reasoning And Value Alignment Bezou-vrakatseli Elfia, Cocarascu Oana, Modgil Sanjay
- Soft Prompting For Unlearning In Large Language Models Bhaila Karuna, Van Minh-hao, Wu Xintao
- Efficient Reinforcement Learning Via Large Language Model-based Search Bhambri Siddhant, Bhattacharjee Amrita, Liu Huan, Kambhampati Subbarao
- On The Robustness Of Language Models For Tabular Question Answering Bhandari Kushal Raj, Xing Sixue, Dan Soham, Gao Jianxi
- Fintral: A Family Of GPT-4 Level Multimodal Financial Large Language Models Bhatia Gagan, Nagoudi El Moatez Billah, Cavusoglu Hasan, Abdul-mageed Muhammad
- Is Factuality Decoding A Free Lunch For Llms? Evaluation On Knowledge Editing Benchmark Bi Baolong, Liu Shenghua, Wang Yiwei, Mei Lingrui, Cheng Xueqi
- Learning Shortcuts: On The Misleading Promise Of NLU In Language Models Bihani Geetanjali, Rayz Julia Taylor
- Supervisory Prompt Training Billa Jean Ghislain, Oh Min, Du Liang
- Lipost: Improved Content Understanding With Effective Use Of Multi-task Contrastive Learning Bindal Akanksha, Ramanujam Sudarshan, Golland Dave, Hazen Tj, Jiang Tina, Zhang Fengyu, Yan Peng
- Statically Contextualizing Large Language Models With Typed Holes Blinn Andrew, Li Xiang, Kim June Hyung, Omar Cyrus
- Long-span Question-answering: Automatic Question Generation And Qa-system Ranking Via Side-by-side Evaluation Bohnet Bernd, Swersky Kevin, Liu Rosanne, Awasthi Pranjal, Nova Azade, Snaider Javier, Sedghi Hanie, Parisi Aaron T, Collins Michael, Lazaridou Angeliki, Firat Orhan, Fiedel Noah
- Rambla: A Framework For Evaluating The Reliability Of Llms As Assistants In The Biomedical Domain Bolton William James, Poyiadzi Rafael, Morrell Edward R., Bueno Gabriela Van Bergen Gonzalez, Goetz Lea
- Do Llms Dream Of Ontologies? Bombieri Marco, Fiorini Paolo, Ponzetto Simone Paolo, Rospocher Marco
- Windows Agent Arena: Evaluating Multi-modal OS Agents At Scale Bonatti Rogerio, Zhao Dan, Bonacci Francesco, Dupont Dillon, Abdali Sara, Li Yinheng, Wagle Justin, Koishida Kazuhito, Bucker Arthur, Jang Lawrence, Hui Zack
- Batch Universal Prediction Bondaschi Marco, Gastpar Michael
- Elephants Never Forget: Memorization And Learning Of Tabular Data In Large Language Models Bordt Sebastian, Nori Harsha, Rodrigues Vanessa, Nushi Besmira, Caruana Rich
- Recurrentgemma: Moving Past Transformers For Efficient Open Language Models Botev Aleksandar, De Soham, Smith Samuel L, Fernando Anushan, Muraru George-cristian, Haroun Ruba, Berrada Leonard, Pascanu Razvan, Sessa Pier Giuseppe, Dadashi Robert, Hussenot Léonard, Ferret Johan, Girgin Sertan, Bachem Olivier, Andreev Alek, Kenealy Kathleen, Mesnard Thomas, Hardin Cassidy, Bhupatiraju Surya, Pathak Shreya, Sifre Laurent, Rivière Morgane, Kale Mihir Sanjay, Love Juliette, Tafti Pouya, Joulin Armand, Fiedel Noah, Senter Evan, Chen Yutian, Srinivasan Srivatsan, Desjardins Guillaume, Budden David, Doucet Arnaud, Vikram Sharad, Paszke Adam, Gale Trevor, Borgeaud Sebastian, Chen Charlie, Brock Andy, Paterson Antonia, Brennan Jenny, Risdal Meg, Gundluru Raj, Devanathan Nesh, Mooney Paul, Chauhan Nilay, Culliton Phil, Martins Luiz Gustavo, Bandy Elisa, Huntsperger David, Cameron Glenn, Zucker Arthur, Warkentin Tris, Peran Ludovic, Giang Minh, Ghahramani Zoubin, Farabet Clément, Kavukcuoglu Koray, Hassabis Demis, Hadsell Raia, Teh Yee Whye, De Frietas Nando
- Embedding Large Language Models Into Extended Reality: Opportunities And Challenges For Inclusion, Engagement, And Privacy Bozkir Efe, Özdel Süleyman, Lau Ka Hei Carrie, Wang Mengdi, Gao Hong, Kasneci Enkelejda
- The Art Of Saying No: Contextual Noncompliance In Language Models Brahman Faeze, Kumar Sachin, Balachandran Vidhisha, Dasigi Pradeep, Pyatkin Valentina, Ravichander Abhilasha, Wiegreffe Sarah, Dziri Nouha, Chandu Khyathi, Hessel Jack, Tsvetkov Yulia, Smith Noah A., Choi Yejin, Hajishirzi Hannaneh
- Reducing Transformer Key-value Cache Size With Cross-layer Attention Brandon William, Mishra Mayank, Nrusimha Aniruddha, Panda Rameswar, Kelly Jonathan Ragan
- Exploring Precision And Recall To Assess The Quality And Diversity Of Llms Bronnec Florian Le, Verine Alexandre, Negrevergne Benjamin, Chevaleyre Yann, Allauzen Alexandre
- Pangea: Procedural Artificial Narrative Using Generative AI For Turn-based Video Games Buongiorno Steph, Klinkert Lawrence Jake, Chawla Tanishq, Zhuang Zixin, Clark Corey
- Conversational Complexity For Assessing Risk In Large Language Models Burden John, Cebrian Manuel, Hernandez-orallo Jose
- Roguegpt: Dis-ethical Tuning Transforms Chatgpt4 Into A Rogue AI In 158 Words Buscemi Alessio, Proverbio Daniele
- Aligning Large Language Models With Counterfactual DPO Butcher Bradley
- ARES: Alternating Reinforcement Learning And Supervised Fine-tuning For Enhanced Multi-modal Chain-of-thought Reasoning Through Diverse AI Feedback Byun Ju-seung, Chun Jiyun, Kil Jihyung, Perrault Andrew
- Reducing Hallucination In Structured Outputs Via Retrieval-augmented Generation Béchard Patrice, Ayala Orlando Marquez
- On Large Language Models In National Security Applications Caballero William N., Jenkins Phillip R.
- TACT: Advancing Complex Aggregative Reasoning With Information Extraction Tools Caciularu Avi, Jacovi Alon, Ben-david Eyal, Goldshtein Sasha, Schuster Tal, Herzig Jonathan, Elidan Gal, Globerson Amir
- High-dimension Human Value Representation In Large Language Models Cahyawijaya Samuel, Chen Delong, Bang Yejin, Khalatbari Leila, Wilie Bryan, Ji Ziwei, Ishii Etsuko, Fung Pascale
- Llms Are Few-shot In-context Low-resource Language Learners Cahyawijaya Samuel, Lovenia Holy, Fung Pascale
- Cendol: Open Instruction-tuned Generative Large Language Models For Indonesian Languages Cahyawijaya Samuel, Lovenia Holy, Koto Fajri, Putri Rifki Afina, Dave Emmanuel, Lee Jhonson, Shadieq Nuur, Cenggoro Wawan, Akbar Salsabil Maulana, Mahendra Muhammad Ihza, Putri Dea Annisayanti, Wilie Bryan, Winata Genta Indra, Aji Alham Fikri, Purwarianti Ayu, Fung Pascale
- Low-cost Generation And Evaluation Of Dictionary Example Sentences Cai Bill, Ng Clarence Boon Liang, Tan Daniel, Hotama Shelvia
- On The Transformations Across Reward Model, Parameter Update, And In-context Prompt Cai Deng, Li Huayang, Fu Tingchen, Li Siheng, Xu Weiwen, Li Shuaiyi, Cao Bowen, Zhang Zhisong, Huang Xinting, Cui Leyang, Wang Yan, Liu Lemao, Watanabe Taro, Shi Shuming
- Sciassess: Benchmarking LLM Proficiency In Scientific Literature Analysis Cai Hengxing, Cai Xiaochen, Chang Junhan, Li Sihang, Yao Lin, Wang Changxin, Gao Zhifeng, Wang Hongshuai, Li Yongge, Lin Mujie, Yang Shuwen, Wang Jiankun, Xu Mingjun, Huang Jin, Xi Fang, Zhuang Jiaxi, Yin Yuqi, Li Yaqi, Chen Changhong, Cheng Zheng, Zhao Zifeng, Zhang Linfeng, Ke Guolin
- Flextron: Many-in-one Flexible Large Language Model Cai Ruisi, Muralidharan Saurav, Heinrich Greg, Yin Hongxu, Wang Zhangyang, Kautz Jan, Molchanov Pavlo
- Towards Large Language Model Aided Program Refinement Cai Yufan, Hou Zhe, Luan Xiaokun, Baena David Miguel Sanan, Lin Yun, Sun Jun, Dong Jin Song
- Internlm2 Technical Report Cai Zheng, Cao Maosong, Chen Haojiong, Chen Kai, Chen Keyu, Chen Xin, Chen Xun, Chen Zehui, Chen Zhi, Chu Pei, Dong Xiaoyi, Duan Haodong, Fan Qi, Fei Zhaoye, Gao Yang, Ge Jiaye, Gu Chenya, Gu Yuzhe, Gui Tao, Guo Aijia, Guo Qipeng, He Conghui, Hu Yingfan, Huang Ting, Jiang Tao, Jiao Penglong, Jin Zhenjiang, Lei Zhikai, Li Jiaxing, Li Jingwen, Li Linyang, Li Shuaibin, Li Wei, Li Yining, Liu Hongwei, Liu Jiangning, Hong Jiawei, Liu Kaiwen, Liu Kuikun, Liu Xiaoran, Lv Chengqi, Lv Haijun, Lv Kai, Ma Li, Ma Runyuan, Ma Zerun, Ning Wenchang, Ouyang Linke, Qiu Jiantao, Qu Yuan, Shang Fukai, Shao Yunfan, Song Demin, Song Zifan, Sui Zhihao, Sun Peng, Sun Yu, Tang Huanze, Wang Bin, Wang Guoteng, Wang Jiaqi, Wang Jiayu, Wang Rui, Wang Yudong, Wang Ziyi, Wei Xingjian, Weng Qizhen, Wu Fan, Xiong Yingtong, Xu Chao, Xu Ruiliang, Yan Hang, Yan Yirong, Yang Xiaogui, Ye Haochen, Ying Huaiyuan, Yu Jia, Yu Jing, Zang Yuhang, Zhang Chuyu, Zhang Li, Zhang Pan, Zhang Peng, Zhang Ruijie, Zhang Shuo, Zhang Songyang, Zhang Wenjian, Zhang Wenwei, Zhang Xingcheng, Zhang Xinyue, Zhao Hui, Zhao Qian, Zhao Xiaomeng, Zhou Fengzhe, Zhou Zaida, Zhuo Jingming, Zou Yicheng, Qiu Xipeng, Qiao Yu, Lin Dahua
- Towards Logically Consistent Language Models Via Probabilistic Reasoning Calanzone Diego, Teso Stefano, Vergari Antonio
- On Behalf Of The Stakeholders: Trends In NLP Model Interpretability In The Era Of Llms Calderon Nitay, Reichart Roi
- On The Worst Prompt Performance Of Large Language Models Cao Bowen, Cai Deng, Zhang Zhisong, Zou Yuexian, Lam Wai
- Beyond Sparse Rewards: Enhancing Reinforcement Learning With Language Model Critique In Text Generation Cao Meng, Shu Lei, Yu Lei, Zhu Yun, Wichers Nevan, Liu Yinxiao, Meng Lei
- Genrec: Generative Sequential Recommendation With Large Language Models Cao Panfeng, Lio Pietro
- Verifiable Generation With Subsentence-level Fine-grained Citations Cao Shuyang, Wang Lu
- What Is The Visual Cognition Gap Between Humans And Multimodal Llms? Cao Xu, Lai Bolin, Ye Wenqian, Ma Yunsheng, Heintz Joerg, Chen Jintai, Cao Jianguo, Rehg James M.
- Personalized Steering Of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization Cao Yuanpu, Zhang Tianrong, Cao Bochuan, Yin Ziyi, Lin Lu, Ma Fenglong, Chen Jinghui
- Survey On Large Language Model-enhanced Reinforcement Learning: Concept, Taxonomy, And Methods Cao Yuji, Zhao Huan, Cheng Yuheng, Shu Ting, Liu Guolong, Liang Gaoqi, Zhao Junhua, Li Yun
- Retaining Key Information Under High Compression Ratios: Query-guided Compressor For Llms Cao Zhiwei, Cao Qian, Lu Yu, Peng Ningxin, Huang Luyang, Cheng Shanbo, Su Jinsong
- Chart-based Reasoning: Transferring Capabilities From Llms To Vlms Carbune Victor, Mansoor Hassan, Liu Fangyu, Aralikatte Rahul, Baechler Gilles, Chen Jindong, Sharma Abhanshu
- Can Formal Argumentative Reasoning Enhance Llms Performances? Castagna Federico, Sassoon Isabel, Parsons Simon
- HAMMR: Hierarchical Multimodal React Agents For Generic VQA Castrejon Lluis, Mensink Thomas, Zhou Howard, Ferrari Vittorio, Araujo Andre, Uijlings Jasper
- Suppressing Pink Elephants With Direct Principle Feedback Castricato Louis, Lile Nathan, Anand Suraj, Schoelkopf Hailey, Verma Siddharth, Biderman Stella
- Smileyllama: Modifying Large Language Models For Directed Chemical Space Exploration Cavanagh Joseph M., Sun Kunyang, Gritsevskiy Andrew, Bagni Dorian, Bannister Thomas D., Head-gordon Teresa
- Dual Modalities Of Text: Visual And Textual Generative Pre-training Chai Yekun, Liu Qingyi, Xiao Jingwu, Wang Shuohuan, Sun Yu, Wu Hua
- An Expert Is Worth One Token: Synergizing Multiple Expert Llms As Generalist Via Expert Token Routing Chai Ziwei, Wang Guoyin, Su Jing, Zhang Tianjie, Huang Xuanwen, Wang Xuwu, Xu Jingjing, Yuan Jianbo, Yang Hongxia, Wu Fei, Yang Yang
- Detoxbench: Benchmarking Large Language Models For Multitask Fraud & Abuse Detection Chakraborty Joymallya, Xia Wei, Majumder Anirban, Ma Dan, Chaabene Walid, Janvekar Naveed
- Cross-modal Safety Alignment: Is Textual Unlearning All You Need? Chakraborty Trishna, Shayegani Erfan, Cai Zikui, Abu-ghazaleh Nael, Asif M. Salman, Dong Yue, Roy-chowdhury Amit K., Song Chengyu
- Investigating Llms As Voting Assistants Via Contextual Augmentation: A Case Study On The European Parliament Elections 2024 Chalkidis Ilias
- Llama Meets EU: Investigating The European Political Spectrum Through The Lens Of Llms Chalkidis Ilias, Brandl Stephanie
- Dense Reward For Free In Reinforcement Learning From Human Feedback Chan Alex J., Sun Hao, Holt Samuel, Van Der Schaar Mihaela
- RQ-RAG: Learning To Refine Queries For Retrieval Augmented Generation Chan Chi-min, Xu Chunpu, Yuan Ruibin, Luo Hongyin, Xue Wei, Guo Yike, Fu Jie
- Negotiationtom: A Benchmark For Stress-testing Machine Theory Of Mind On Negotiation Surrounding Chan Chunkit, Jiayang Cheng, Yim Yauwai, Deng Zheye, Fan Wei, Li Haoran, Liu Xin, Zhang Hongming, Wang Weiqi, Song Yangqiu
- Prompto: An Open Source Library For Asynchronous Querying Of LLM Endpoints Chan Ryan Sze-yin, Nanni Federico, Brown Edwin, Chapman Ed, Williams Angus R., Bright Jonathan, Gabasova Evelina
- Scaling Synthetic Data Creation With 1,000,000,000 Personas Chan Xin, Wang Xiaoyang, Yu Dian, Mi Haitao, Yu Dong
- Benchmarking Cognitive Domains For Llms: Insights From Taiwanese Hakka Culture Chang Chen-chi, Chen Ching-yuan, Lee Hung-shin, Lee Chih-cheng
- Efficient Prompting Methods For Large Language Models: A Survey Chang Kaiyan, Xu Songcheng, Wang Chenglong, Luo Yingfeng, Xiao Tong, Zhu Jingbo
- When Parts Are Greater Than Sums: Individual LLM Components Can Outperform Full Models Chang Ting-yun, Thomason Jesse, Jia Robin
- Detecting Hallucination And Coverage Errors In Retrieval Augmented Generation For Controversial Topics Chang Tyler A., Tomanek Katrin, Hoffmann Jessica, Thain Nithum, Van Liemt Erin, Meier-hellstern Kathleen, Dixon Lucas
- Injecting Salesperson's Dialogue Strategies In Large Language Models With Chain-of-thought Reasoning Chang Wen-yu, Chen Yun-nung
- Xprompt:explaining Large Language Model's Generation Via Joint Prompt Attribution Chang Yurui, Cao Bochuan, Wang Yujia, Chen Jinghui, Lin Lu
- When Large Language Models Meet Evolutionary Algorithms Chao Wang, Zhao Jiaxuan, Jiao Licheng, Li Lingling, Liu Fang, Yang Shuyuan
- Make Large Language Model A Better Ranker Chao Wenshuo, Zheng Zhi, Zhu Hengshu, Liu Hao
- Quantitative Certification Of Bias In Large Language Models Chaudhary Isha, Hu Qian, Kumar Manoj, Ziyadi Morteza, Gupta Rahul, Singh Gagandeep
- Large Language Models As Instruments Of Power: New Regimes Of Autonomous Manipulation And Control Chaudhary Yaqub, Penn Jonnie
- Veagle: Advancements In Multimodal Representation Learning Chawla Rajat, Datta Arkajit, Verma Tushar, Jha Adarsh, Gautam Anmol, Vatsal Ayush, Chaterjee Sukrit, Ns Mukunda, Bhola Ishaan
- All Roads Lead To Rome: Unveiling The Trajectory Of Recommender Systems Across The LLM Era Chen Bo, Dai Xinyi, Guo Huifeng, Guo Wei, Liu Weiwen, Liu Yong, Qin Jiarui, Tang Ruiming, Wang Yichao, Wu Chuhan, Wu Yaxiong, Zhang Hao
- The Dark Side Of Human Feedback: Poisoning Large Language Models Via User Inputs Chen Bocheng, Guo Hanqing, Wang Guangjing, Wang Yuanda, Yan Qiben
- GUI-WORLD: A Dataset For Gui-oriented Multimodal Llm-based Agents Chen Dongping, Huang Yue, Wu Siyuan, Tang Jingyu, Chen Liuyi, Bai Yilin, He Zhigang, Wang Chenlong, Zhou Huichi, Li Yiqiang, Zhou Tianshuo, Yu Yue, Gao Chujie, Zhang Qihui, Gui Yi, Li Zhen, Wan Yao, Zhou Pan, Gao Jianfeng, Sun Lichao
- Generalizing Conversational Dense Retrieval Via Llm-cognition Data Augmentation Chen Haonan, Dou Zhicheng, Mao Kelong, Liu Jiongnan, Zhao Ziliang
- Efficiency In Focus: Layernorm As A Catalyst For Fine-tuning Medical Visual Language Pre-trained Models Chen Jiawei, Yang Dingkang, Jiang Yue, Li Mingcheng, Wei Jinjie, Hou Xiaolu, Zhang Lihua
- Automated Data Curation For Robust Language Model Fine-tuning Chen Jiuhai, Mueller Jonas
- Putting People In Llms' Shoes: Generating Better Answers Via Question Rewriter Chen Junhao, Wang Bowen, Jiang Zhouqiang, Nakashima Yuta
- Nlperturbator: Studying The Robustness Of Code Llms To Natural Language Variations Chen Junkai, Li Zhenhao, Hu Xing, Xia Xin
- Pixart-\sigma: Weak-to-strong Training Of Diffusion Transformer For 4K Text-to-image Generation Chen Junsong, Ge Chongjian, Xie Enze, Wu Yue, Yao Lewei, Ren Xiaozhe, Wang Zhongdao, Luo Ping, Lu Huchuan, Li Zhenguo
- Instructioncp: A Fast Approach To Transfer Large Language Models Into Target Language Chen Kuang-ming, Lee Hung-yi
- Pca-bench: Evaluating Multimodal Large Language Models In Perception-cognition-action Chain Chen Liang, Zhang Yichi, Ren Shuhuai, Zhao Haozhe, Cai Zefan, Wang Yuchi, Wang Peiyi, Meng Xiangdi, Liu Tianyu, Chang Baobao
- Identifying Query-relevant Neurons In Large Language Models For Long-form Texts Chen Lihu, Dejl Adam, Toni Francesca
- Are We On The Right Way For Evaluating Large Vision-language Models? Chen Lin, Li Jinsong, Dong Xiaoyi, Zhang Pan, Zang Yuhang, Chen Zehui, Duan Haodong, Wang Jiaqi, Qiao Yu, Lin Dahua, Zhao Feng
- Graphwiz: An Instruction-following Language Model For Graph Problems Chen Nuo, Li Yuhan, Tang Jianheng, Li Jia
- Llm-based Multi-hop Question Answering With Knowledge Graph Integration In Evolving Environments Chen Ruirui, Jiang Weifeng, Qin Chengwei, Rawal Ishaan Singh, Tan Cheston, Choi Dongkyu, Xiong Bo, Ai Bo
- Editable Fairness: Fine-grained Bias Mitigation In Language Models Chen Ruizhe, Li Yichen, Yang Jianfei, Zhou Joey Tianyi, Liu Zuozhu
- In-context Sharpness As Alerts: An Inner Representation Perspective For Hallucination Mitigation Chen Shiqi, Xiong Miao, Liu Junteng, Wu Zhengxuan, Xiao Teng, Gao Siyang, He Junxian
- Reinstruct: Building Instruction Data From Unlabeled Corpus Chen Shu, Guan Xinyan, Lu Yaojie, Lin Hongyu, Han Xianpei, Sun Le
- Advancing Tool-augmented Large Language Models: Integrating Insights From Errors In Inference Trees Chen Sijia, Wang Yibo, Wu Yi-feng, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Zhang Lijun
- Struq: Defending Against Prompt Injection With Structured Queries Chen Sizhe, Piet Julien, Sitawarin Chawin, Wagner David
- Intelligent Tutor: Leveraging Chatgpt And Microsoft Copilot Studio To Deliver A Generative AI Student Support And Feedback System Within Teams Chen Wei-yu
- GRATH: Gradual Self-truthifying For Large Language Models Chen Weixin, Song Dawn, Li Bo
- Beyond Natural Language: Llms Leveraging Alternative Formats For Enhanced Reasoning And Communication Chen Weize, Yuan Chenfei, Yuan Jiarui, Su Yusheng, Qian Chen, Yang Cheng, Xie Ruobing, Liu Zhiyuan, Sun Maosong
- Reprompt: Planning By Automatic Prompt Engineering For Large Language Models Agents Chen Weizhe, Koenig Sven, Dilkina Bistra
- SATO: Stable Text-to-motion Framework Chen Wenshuo, Xiao Hongru, Zhang Erhang, Hu Lijie, Wang Lei, Liu Mengyuan, Chen Chen
- Icleval: Evaluating In-context Learning Ability Of Large Language Models Chen Wentong, Lin Yankai, Zhou Zhenhao, Huang Hongyun, Jia Yantao, Cao Zhao, Wen Ji-rong
- Gazexplain: Learning To Predict Natural Language Explanations Of Visual Scanpaths Chen Xianyu, Jiang Ming, Zhao Qi
- Eyegpt: Ophthalmic Assistant With Large Language Models Chen Xiaolan, Zhao Ziwei, Zhang Weiyi, Xu Pusheng, Gao Le, Xu Mingpu, Wu Yue, Li Yinwen, Shi Danli, He Mingguang
- Premise Order Matters In Reasoning With Large Language Models Chen Xinyun, Chi Ryan A., Wang Xuezhi, Zhou Denny
- Iteralign: Iterative Constitutional Alignment Of Large Language Models Chen Xiusi, Wen Hongzhi, Nag Sreyashi, Luo Chen, Yin Qingyu, Li Ruirui, Li Zheng, Wang Wei
- RL-JACK: Reinforcement Learning-powered Black-box Jailbreaking Attack Against Llms Chen Xuan, Nie Yuzhou, Yan Lu, Mao Yunshu, Guo Wenbo, Zhang Xiangyu
- Can We Rely On LLM Agents To Draft Long-horizon Plans? Let's Take Travelplanner As An Example Chen Yanan, Pesaranghader Ali, Sadhu Tanmana, Yi Dong Hoon
- Extroversion Or Introversion? Controlling The Personality Of Your Large Language Models Chen Yanquan, Wu Zhen, Guo Junjie, Huang Shujian, Dai Xinyu
- Mofo: Momentum-filtered Optimizer For Mitigating Forgetting In LLM Fine-tuning Chen Yupeng, Wang Senmiao, Lin Zhihang, Qin Zeyu, Zhang Yushun, Ding Tian, Sun Ruoyu
- Towards Boosting Llms-driven Relevance Modeling With Progressive Retrieved Behavior-augmented Prompting Chen Zeyuan, Wu Haiyan, Wu Kaixin, Chen Wei, Zhong Mingjie, Xu Jia, Liu Zhongyi, Zhang Wei
- MLLM Is A Strong Reranker: Advancing Multimodal Retrieval-augmented Generation Via Knowledge-enhanced Reranking And Noise-injected Training Chen Zhanpeng, Xu Chengjin, Qi Yiyan, Guo Jian
- Autoprm: Automating Procedural Supervision For Multi-step Reasoning Via Controllable Question Decomposition Chen Zhaorun, Zhao Zhuokai, Zhu Zhihong, Zhang Ruiqi, Li Xiang, Raj Bhiksha, Yao Huaxiu
- Improving Retrieval Augmented Open-domain Question-answering With Vectorized Contexts Chen Zhuo, Wang Xinyu, Jiang Yong, Xie Pengjun, Huang Fei, Tu Kewei
- Emergence Of A High-dimensional Abstraction Phase In Language Transformers Cheng Emily, Doimo Diego, Kervadec Corentin, Macocco Iuri, Yu Jade, Laio Alessandro, Baroni Marco
- Empowering Large Language Models On Robotic Manipulation With Affordance Prompting Cheng Guangran, Zhang Chuheng, Cai Wenzhe, Zhao Li, Sun Changyin, Bian Jiang
- Potential And Limitations Of Llms In Capturing Structured Semantics: A Case Study On SRL Cheng Ning, Yan Zhaohui, Wang Ziming, Li Zhijie, Yu Jiaming, Zheng Zilong, Tu Kewei, Xu Jinan, Han Wenjuan
- Self-playing Adversarial Language Game Enhances LLM Reasoning Cheng Pengyu, Hu Tianhao, Xu Han, Zhang Zhisong, Dai Yong, Han Lei, Du Nan
- Can AI Assistants Know What They Don't Know? Cheng Qinyuan, Sun Tianxiang, Liu Xiangyang, Zhang Wenwei, Yin Zhangyue, Li Shimin, Li Linyang, He Zhengfu, Chen Kai, Qiu Xipeng
- BISCUIT: Scaffolding Llm-generated Code With Ephemeral Uis In Computational Notebooks Cheng Ruijia, Barik Titus, Leung Alan, Hohman Fred, Nichols Jeffrey
- Towards Achieving Human Parity On End-to-end Simultaneous Speech Translation Via LLM Agent Cheng Shanbo, Huang Zhichao, Ko Tom, Li Hang, Peng Ningxin, Xu Lu, Zhang Qini
- Information Re-organization Improves Reasoning In Large Language Models Cheng Xiaoxia, Tan Zeqi, Xue Wei, Lu Weiming
- (A)I Am Not A Lawyer, But...: Engaging Legal Experts Towards Responsible LLM Policies For Legal Advice Cheong Inyoung, Xia King, Feng K. J. Kevin, Chen Quan Ze, Zhang Amy X.
- Talking Nonsense: Probing Large Language Models' Understanding Of Adversarial Gibberish Inputs Cherepanova Valeriia, Zou James
- ANOLE: An Open, Autoregressive, Native Large Multimodal Models For Interleaved Image-text Generation Chern Ethan, Su Jiadi, Ma Yan, Liu Pengfei
- Can Large Language Models Be Trusted For Evaluation? Scalable Meta-evaluation Of Llms As Evaluators Via Agent Debate Chern Steffi, Chern Ethan, Neubig Graham, Liu Pengfei
- Walert: Putting Conversational Search Knowledge Into Action By Building And Evaluating A Large Language Model-powered Chatbot Cherumanal Sachin Pathiyan, Tian Lin, Abushaqra Futoon M., De Paula Angel Felipe Magnossao, Ji Kaixin, Hettiachchi Danula, Trippas Johanne R., Ali Halil, Scholer Falk, Spina Damiano
- A Reality Check Of The Benefits Of LLM In Business Cheung Ming
- Revisiting Zero-shot Abstractive Summarization In The Era Of Large Language Models From The Perspective Of Position Bias Chhabra Anshuman, Askari Hadi, Mohapatra Prasant
- Large Language Model As An Assignment Evaluator: Insights, Feedback, And Challenges In A 1000+ Student Course Chiang Cheng-han, Chen Wei-chih, Kuan Chun-yi, Yang Chienchou, Lee Hung-yi
- Chatbot Arena: An Open Platform For Evaluating Llms By Human Preference Chiang Wei-lin, Zheng Lianmin, Sheng Ying, Angelopoulos Anastasios Nikolas, Li Tianle, Li Dacheng, Zhang Hao, Zhu Banghua, Jordan Michael, Gonzalez Joseph E., Stoica Ion
- Human-centered Llm-agent User Interface: A Position Paper Chin Daniel, Wang Yuxuan, Xia Gus
- A Computational Framework For Behavioral Assessment Of LLM Therapists Chiu Yu Ying, Sharma Ashish, Lin Inna Wanyin, Althoff Tim
- Pragmatic Inference Of Scalar Implicature By Llms Cho Ye-eun, Kim Seong Mook
- Optimizing Language Augmentation For Multilingual Large Language Models: A Case Study On Korean Choi Changsu, Jeong Yongbin, Park Seoyoon, Won Inho, Lim Hyeonseok, Kim Sangmin, Kang Yejee, Yoon Chanhyuk, Park Jaewan, Lee Yiseul, Lee Hyejin, Hahm Younggyun, Kim Hansaem, Lim Kyungtae
- When Life Gives You Llms, Make LLM-ADE: Large Language Models With Adaptive Data Engineering Choi Stephen, Gazeley William
- Large Language Models And User Trust: Consequence Of Self-referential Learning Loop And The Deskilling Of Healthcare Professionals Choudhury Avishek, Chaudhry Zaria
- Gemmar: Enhancing Llms Through Arabic Instruction-tuning Chouikhi Hasna, Aloui Manel, Hammou Cyrine Ben, Chaabane Ghaith, Kchaou Haithem, Dhaouadi Chehir
- Think Together And Work Better: Combining Humans' And Llms' Think-aloud Outcomes For Effective Text Evaluation Chu Seongyeub, Kim Jongwoo, Yi Munyong
- Professional Agents -- Evolving Large Language Models Into Autonomous Experts With Human-level Competencies Chu Zhixuan, Wang Yan, Zhu Feng, Yu Lu, Li Longfei, Gu Jinjie
- Beyond Llms: Advancing The Landscape Of Complex Reasoning Chu-carroll Jennifer, Beck Andrew, Burnham Greg, Melville David Os, Nachman David, Özcan A. Erdem, Ferrucci David
- AI Safety In Generative AI Large Language Models: A Survey Chua Jaymari, Li Yun, Yang Shiyi, Wang Chen, Yao Lina
- Learning To Compress Prompt In Natural Language Formats Chuang Yu-neng, Xing Tianwei, Chang Chia-yuan, Liu Zirui, Chen Xun, Hu Xia
- Developing Generative AI Chatbots Conceptual Framework For Higher Education Chukwuere Joshua Ebere
- Informed AI Regulation: Comparing The Ethical Frameworks Of Leading LLM Chatbots Using An Ethics-based Audit To Assess Moral Reasoning And Normative Values Chun Jon, Elkins Katherine
- Stable Language Model Pre-training By Reducing Embedding Variability Chung Woojin, Hong Jiwoo, An Na Min, Thorne James, Yun Se-young
- Cogbench: A Large Language Model Walks Into A Psychology Lab Coda-forno Julian, Binz Marcel, Wang Jane X., Schulz Eric
- Improving Token-based World Models With Parallel Observation Prediction Cohen Lior, Wang Kaixin, Kang Bingyi, Mannor Shie
- Modulating Language Model Experiences Through Frictions Collins Katherine M., Chen Valerie, Sucholutsky Ilia, Kirk Hannah Rose, Sadek Malak, Sargeant Holli, Talwalkar Ameet, Weller Adrian, Bhatt Umang
- Ragsys: Item-cold-start Recommender As RAG System Contal Emile, Mcgoldrick Garrin
- LLASP: Fine-tuning Large Language Models For Answer Set Programming Coppolillo Erica, Calimeri Francesco, Manco Giuseppe, Perri Simona, Ricca Francesco
- Out-of-context Prompting Boosts Fairness And Robustness In Large Language Model Predictions Cotta Leonardo, Maddison Chris J.
- Shortcomings Of Llms For Low-resource Translation: Retrieval And Understanding Are Both The Problem Court Sara, Elsner Micha
- Sambalingo: Teaching Large Language Models New Languages Csaki Zoltan, Li Bo, Li Jonathan, Xu Qiantong, Pawakapan Pian, Zhang Leon, Du Yun, Zhao Hengyu, Hu Changran, Thakker Urmish
- Efficiently Exploring Large Language Models For Document-level Machine Translation With In-context Learning Cui Menglong, Du Jiangcun, Zhu Shaolin, Xiong Deyi
- Insights From Benchmarking Frontier Language Models On Web App Code Generation Cui Yi
- Rethinking LLM Language Adaptation: A Case Study On Chinese Mixtral Cui Yiming, Yao Xin
- Can AI Replace Human Subjects? A Large-scale Replication Of Psychological Experiments With Llms Cui Ziyan, Li Ning, Zhou Huaikang
- Anchored Preference Optimization And Contrastive Revisions: Addressing Underspecification In Alignment D'oosterlinck Karel, Xu Winnie, Develder Chris, Demeester Thomas, Singh Amanpreet, Potts Christopher, Kiela Douwe, Mehri Shikib
- Stepwise Verification And Remediation Of Student Reasoning Errors With Large Language Model Tutors Daheim Nico, Macina Jakub, Kapur Manu, Gurevych Iryna, Sachan Mrinmaya
- CORM: Cache Optimization With Recent Message For Large Language Model Inference Dai Jincheng, Huang Zhuowei, Jiang Haiyun, Chen Chen, Cai Deng, Bi Wei, Shi Shuming
- Muap: Multi-step Adaptive Prompt Learning For Vision-language Model With Missing Modality Dai Ruiting, Tan Yuqiao, Mo Lisi, He Tao, Qin Ke, Liang Shuang
- ACT-MNMT Auto-constriction Turning For Multilingual Neural Machine Translation Dai Shaojie, Liu Xin, Luo Ping, Yu Yue
- Generating Code World Models With Large Language Models Guided By Monte Carlo Tree Search Dainese Nicola, Merler Matteo, Alakuijala Minttu, Marttinen Pekka
- Faculty Perspectives On The Potential Of RAG In Computer Science Higher Education Dakshit Sagnik
- Inference To The Best Explanation In Large Language Models Dalal Dhairya, Valentino Marco, Freitas André, Buitelaar Paul
- Impact Of Non-standard Unicode Characters On Security And Comprehension In Large Language Models Daniel Johan S, Pal Anand
- Active Preference Optimization For Sample Efficient RLHF Das Nirjhar, Chakraborty Souradip, Pacchiano Aldo, Chowdhury Sayak Ray
- Entropy Guided Extrapolative Decoding To Improve Factuality In Large Language Models Das Souvik, Jin Lifeng, Song Linfeng, Mi Haitao, Peng Baolin, Yu Dong
- Xmainframe: A Large Language Model For Mainframe Modernization Dau Anh T. V., Dao Hieu Trung, Nguyen Anh Tuan, Tran Hieu Trung, Nguyen Phong X., Bui Nghi D. Q.
- Investigating Symbolic Capabilities Of Large Language Models Dave Neisarg, Kifer Daniel, Giles C. Lee, Mali Ankur
- Prompting Open-source And Commercial Language Models For Grammatical Error Correction Of English Learner Text Davis Christopher, Caines Andrew, Andersen Øistein, Taslimipoor Shiva, Yannakoudakis Helen, Yuan Zheng, Bryant Christopher, Rei Marek, Buttery Paula
- Aligning Model Evaluations With Human Preferences: Mitigating Token Count Bias In Language Model Assessments Daynauth Roland, Mars Jason
- Application Of GPT Language Models For Innovation In Activities In University Teaching De Buenaga Manuel, Bueno Francisco Javier
- Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding With Childplay De Carvalho Gonçalo Hora, Knap Oscar, Pollice Robert
- Thematic Analysis With Large Language Models: Does It Work With Languages Other Than English? A Targeted Test In Italian De Paoli Stefano
- Will GPT-4 Run DOOM? De Wynter Adrian
- RTP-LX: Can Llms Evaluate Toxicity In Multilingual Scenarios? De Wynter Adrian, Watts Ishaan, Altıntoprak Nektar Ege, Wongsangaroonsri Tua, Zhang Minghui, Farra Noura, Baur Lena, Claudet Samantha, Gajdusek Pavel, Gören Can, Gu Qilong, Kaminska Anna, Kaminski Tomasz, Kuo Ruby, Kyuba Akiko, Lee Jongho, Mathur Kartik, Merok Petter, Milovanović Ivana, Paananen Nani, Paananen Vesa-matti, Pavlenko Anna, Vidal Bruno Pereira, Strika Luciano, Tsao Yueh, Turcato Davide, Vakhno Oleksandr, Velcsov Judit, Vickers Anna, Visser Stéphanie, Widarmanto Herdyan, Zaikin Andrey, Chen Si-qing
- Deepseek-v2: A Strong, Economical, And Efficient Mixture-of-experts Language Model Deepseek-ai, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Li, Hui Qu, J. L. Cai, Jian Liang, Jianzhong Guo, Jiaqi Ni, Jiashi Li, Jin Chen, Jingyang Yuan, Junjie Qiu, Junxiao Song, Kai Dong, Kaige Gao, Kang Guan, Lean Wang, Lecong Zhang, Lei Xu, Leyi Xia, Liang Zhao, Liyue Zhang, Meng Li, Miaojun Wang, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Mingming Li, Ning Tian, Panpan Huang, Peiyi Wang, Peng Zhang, Qihao Zhu, Qinyu Chen, Qiushi Du, R. J. Chen, R. L. Jin, Ruiqi Ge, Ruizhe Pan, Runxin Xu, Ruyi Chen, S. S. Li, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shaoqing Wu, Shengfeng Ye, Shirong Ma, Shiyu Wang, Shuang Zhou, Shuiping Yu, Shunfeng Zhou, Size Zheng, T. Wang, Tian Pei, Tian Yuan, Tianyu Sun, W. L. Xiao, Wangding Zeng, Wei An, Wen Liu, Wenfeng Liang, Wenjun Gao, Wentao Zhang, X. Q. Li, Xiangyue Jin, Xianzu Wang, Xiao Bi, Xiaodong Liu, Xiaohan Wang, Xiaojin Shen, Xiaokang Chen, Xiaosha Chen, Xiaotao Nie, Xiaowen Sun, Xiaoxiang Wang, Xin Liu, Xin Xie, Xingkai Yu, Xinnan Song, Xinyi Zhou, Xinyu Yang, Xuan Lu, Xuecheng Su, Y. Wu, Y. K. Li, Y. X. Wei, Y. X. Zhu, Yanhong Xu, Yanping Huang, Yao Li, Yao Zhao, Yaofeng Sun, Yaohui Li, Yaohui Wang, Yi Zheng, Yichao Zhang, Yiliang Xiong, Yilong Zhao, Ying He, Ying Tang, Yishi Piao, Yixin Dong, Yixuan Tan, Yiyuan Liu, Yongji Wang, Yongqiang Guo, Yuchen Zhu, Yuduan Wang, Yuheng Zou, Yukun Zha, Yunxian Ma, Yuting Yan, Yuxiang You, Yuxuan Liu, Z. Z. Ren, Zehui Ren, Zhangli Sha, Zhe Fu, Zhen Huang, Zhen Zhang, Zhenda Xie, Zhewen Hao, Zhihong Shao, Zhiniu Wen, Zhipeng Xu, Zhongyu Zhang, Zhuoshu Li, Zihan Wang, Zihui Gu, Zilin Li, Ziwei Xie
- Polyrating: A Cost-effective And Bias-aware Rating System For LLM Evaluation Dekoninck Jasper, Baader Maximilian, Vechev Martin
- Understanding Biases In Chatgpt-based Recommender Systems: Provider Fairness, Temporal Stability, And Recency Deldjoo Yashar
- MIMIR: A Streamlined Platform For Personalized Agent Tuning In Domain Expertise Deng Chunyuan, Tang Xiangru, Zhao Yilun, Wang Hanming, Wang Haoran, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- Pandora: Jailbreak Gpts By Retrieval Augmented Generation Poisoning Deng Gelei, Liu Yi, Wang Kailong, Li Yuekang, Zhang Tianwei, Liu Yang
- Can LLM Be A Good Path Planner Based On Prompt Engineering? Mitigating The Hallucination For Path Planning Deng Hourui, Zhang Hongjie, Ou Jie, Feng Chaosheng
- On The Multi-turn Instruction Following For Conversational Web Agents Deng Yang, Zhang Xuan, Zhang Wenxuan, Yuan Yifei, Ng See-kiong, Chua Tat-seng
- Attention Is Naturally Sparse With Gaussian Distributed Input Deng Yichuan, Song Zhao, Yang Chiwun
- LMGT: Optimizing Exploration-exploitation Balance In Reinforcement Learning Through Language Model Guided Trade-offs Deng Yongxin, Qiu Xihe, Tan Xiaoyu, Chu Wei, Xu Yinghui
- Sycophancy To Subterfuge: Investigating Reward-tampering In Large Language Models Denison Carson, Macdiarmid Monte, Barez Fazl, Duvenaud David, Kravec Shauna, Marks Samuel, Schiefer Nicholas, Soklaski Ryan, Tamkin Alex, Kaplan Jared, Shlegeris Buck, Bowman Samuel R., Perez Ethan, Hubinger Evan
- Glam: Fine-tuning Large Language Models For Domain Knowledge Graph Alignment Via Neighborhood Partitioning And Generative Subgraph Encoding Dernbach Stefan, Agarwal Khushbu, Zuniga Alejandro, Henry Michael, Choudhury Sutanay
- Promises, Outlooks And Challenges Of Diffusion Language Modeling Deschenaux Justin, Gulcehre Caglar
- Class-level Code Generation From Natural Language Using Iterative, Tool-enhanced Reasoning Over Repository Deshpande Ajinkya, Agarwal Anmol, Shet Shashank, Iyer Arun, Kanade Aditya, Bairi Ramakrishna, Parthasarathy Suresh
- Exploring Prompt Engineering Practices In The Enterprise Desmond Michael, Brachman Michelle
- Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets Devine Peter
- From Words To Worlds: Compositionality For Cognitive Architectures Dhar Ruchira, Søgaard Anders
- Data Augmentation Using Large Language Models: Data Perspectives, Learning Paradigms And Challenges Ding Bosheng, Qin Chengwei, Zhao Ruochen, Luo Tianze, Li Xinze, Chen Guizhen, Xia Wenhan, Hu Junjie, Luu Anh Tuan, Joty Shafiq
- Mastering Text, Code And Math Simultaneously Via Fusing Highly Specialized Language Models Ding Ning, Chen Yulin, Cui Ganqu, Lv Xingtai, Zhao Weilin, Xie Ruobing, Zhou Bowen, Liu Zhiyuan, Sun Maosong
- MANGO: A Benchmark For Evaluating Mapping And Navigation Abilities Of Large Language Models Ding Peng, Fang Jiading, Li Peng, Wang Kangrui, Zhou Xiaochen, Yu Mo, Li Jing, Walter Matthew R., Mei Hongyuan
- Hallu-pi: Evaluating Hallucination In Multi-modal Large Language Models Within Perturbed Inputs Ding Peng, Wu Jingyu, Kuang Jun, Ma Dan, Cao Xuezhi, Cai Xunliang, Chen Shi, Chen Jiajun, Huang Shujian
- Boosting Large Language Models With Socratic Method For Conversational Mathematics Teaching Ding Yuyang, Hu Hanglei, Zhou Jie, Chen Qin, Jiang Bo, He Liang
- Large Language Models Prompting With Episodic Memory Do Dai, Tran Quan, Venkatesh Svetha, Le Hung
- Finding Blind Spots In Evaluator Llms With Interpretable Checklists Doddapaneni Sumanth, Khan Mohammed Safi Ur Rahman, Verma Sshubam, Khapra Mitesh M.
- Evaluating Linguistic Capabilities Of Multimodal Llms In The Lens Of Few-shot Learning Dogan Mustafa, Kesen Ilker, Calixto Iacer, Erdem Aykut, Erdem Erkut
- Latent Attention For Linear Time Transformers Dolga Rares, Cobzarenco Marius, Barber David
- Self-play With Execution Feedback: Improving Instruction-following Capabilities Of Large Language Models Dong Guanting, Lu Keming, Li Chengpeng, Xia Tingyu, Yu Bowen, Zhou Chang, Zhou Jingren
- Evaluating And Mitigating Linguistic Discrimination In Large Language Models Dong Guoliang, Wang Haoyu, Sun Jun, Wang Xinyu
- RLHF Workflow: From Reward Modeling To Online RLHF Dong Hanze, Xiong Wei, Pang Bo, Wang Haoxiang, Zhao Han, Zhou Yingbo, Jiang Nan, Sahoo Doyen, Xiong Caiming, Zhang Tong
- Pruner-zero: Evolving Symbolic Pruning Metric From Scratch For Large Language Models Dong Peijie, Li Lujun, Tang Zhenheng, Liu Xiang, Pan Xinglin, Wang Qiang, Chu Xiaowen
- QAQ: Quality Adaptive Quantization For LLM KV Cache Dong Shichen, Cheng Wen, Qin Jiayu, Wang Wei
- Internlm-xcomposer2-4khd: A Pioneering Large Vision-language Model Handling Resolutions From 336 Pixels To 4K HD Dong Xiaoyi, Zhang Pan, Zang Yuhang, Cao Yuhang, Wang Bin, Ouyang Linke, Zhang Songyang, Duan Haodong, Zhang Wenwei, Li Yining, Yan Hang, Gao Yang, Chen Zhe, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Chen Kai, He Conghui, Zhang Xingcheng, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- A Framework For Real-time Safeguarding The Text Generation Of Large Language Model Dong Ximing, Lin Dayi, Wang Shaowei, Hassan Ahmed E.
- Can LLM Be A Personalized Judge? Dong Yijiang River, Hu Tiancheng, Collier Nigel
- How Reliable Are Automatic Evaluation Methods For Instruction-tuned Llms? Doostmohammadi Ehsan, Holmström Oskar, Kuhlmann Marco
- Learning From Students: Applying T-distributions To Explore Accurate And Efficient Formats For Llms Dotzel Jordan, Chen Yuzong, Kotb Bahaa, Prasad Sushma, Wu Gang, Li Sheng, Abdelfattah Mohamed S., Zhang Zhiru
- Integrating Physician Diagnostic Logic Into Large Language Models: Preference Learning From Process Feedback Dou Chengfeng, Jin Zhi, Jiao Wenpin, Zhao Haiyan, Zhao Yongqiang, Tao Zhenwei
- What's Wrong With Your Code Generated By Large Language Models? An Extensive Study Dou Shihan, Jia Haoxiang, Wu Shenxi, Zheng Huiyuan, Zhou Weikang, Wu Muling, Chai Mingxu, Fan Jessica, Huang Caishuang, Tao Yunbo, Liu Yan, Zhou Enyu, Zhang Ming, Zhou Yuhao, Wu Yueming, Zheng Rui, Wen Ming, Weng Rongxiang, Wang Jingang, Cai Xunliang, Gui Tao, Qiu Xipeng, Zhang Qi, Huang Xuanjing
- Stepcoder: Improve Code Generation With Reinforcement Learning From Compiler Feedback Dou Shihan, Liu Yan, Jia Haoxiang, Xiong Limao, Zhou Enyu, Shen Wei, Shan Junjie, Huang Caishuang, Wang Xiao, Fan Xiaoran, Xi Zhiheng, Zhou Yuhao, Ji Tao, Zheng Rui, Zhang Qi, Huang Xuanjing, Gui Tao
- Assessing The Reasoning Abilities Of Chatgpt In The Context Of Claim Verification Dougrez-lewis John, Akhter Mahmud Elahi, He Yulan, Liakata Maria
- Towards Multimodal In-context Learning For Vision & Language Models Doveh Sivan, Perek Shaked, Mirza M. Jehanzeb, Lin Wei, Alfassy Amit, Arbelle Assaf, Ullman Shimon, Karlinsky Leonid
- Evaluating Language Model Context Windows: A "working Memory" Test And Inference-time Correction Dsouza Amanda, Glaze Christopher, Shin Changho, Sala Frederic
- In-context Learning With Reinforcement Learning For Incomplete Utterance Rewriting Du Haowei, Zhao Dongyan
- Unlocking Continual Learning Abilities In Language Models Du Wenyu, Cheng Shuang, Luo Tongxu, Qiu Zihan, Huang Zeyu, Cheung Ka Chun, Cheng Reynold, Fu Jie
- Stacking Your Transformers: A Closer Look At Model Growth For Efficient LLM Pre-training Du Wenyu, Luo Tongxu, Qiu Zihan, Huang Zeyu, Shen Yikang, Cheng Reynold, Guo Yike, Fu Jie
- Revisiting Moe And Dense Speed-accuracy Comparisons For LLM Training Du Xianzhi, Gunter Tom, Kong Xiang, Lee Mark, Wang Zirui, Zhang Aonan, Du Nan, Pang Ruoming
- Perltqa: A Personal Long-term Memory Dataset For Memory Classification, Retrieval, And Synthesis In Question Answering Du Yiming, Wang Hongru, Zhao Zhengyi, Liang Bin, Wang Baojun, Zhong Wanjun, Wang Zezhong, Wong Kam-fai
- Instruction-guided Scene Text Recognition Du Yongkun, Chen Zhineng, Su Yuchen, Jia Caiyan, Jiang Yu-gang
- CLEFT: Language-image Contrastive Learning With Efficient Large Language Model And Prompt Fine-tuning Du Yuexi, Chang Brian, Dvornek Nicha C.
- On The Evaluation Consistency Of Attribution-based Explanations Duan Jiarui, Li Haoling, Zhang Haofei, Jiang Hao, Xue Mengqi, Sun Li, Song Mingli, Song Jie
- Prompting Multi-modal Tokens To Enhance End-to-end Autonomous Driving Imitation Learning With Llms Duan Yiqun, Zhang Qiang, Xu Renjing
- Leveraging Machine-generated Rationales To Facilitate Social Meaning Detection In Conversations Dutt Ritam, Wu Zhen, Shi Kelly, Sheth Divyanshu, Gupta Prakhar, Rose Carolyn Penstein
- Applying RLAIF For Code Generation With Api-usage In Lightweight Llms Dutta Sujan, Mahinder Sayantan, Anantha Raviteja, Bandyopadhyay Bortik
- Your Context Is Not An Array: Unveiling Random Access Limitations In Transformers Ebrahimi Mohammadreza, Panchal Sunny, Memisevic Roland
- Lusifer: Llm-based User Simulated Feedback Environment For Online Recommender Systems Ebrat Danial, Rueda Luis
- Cognitive Bias In High-stakes Decision-making With Llms Echterhoff Jessica, Liu Yao, Alessa Abeer, Mcauley Julian, He Zexue
- Relation Extraction With Fine-tuned Large Language Models In Retrieval Augmented Generation Frameworks Efeoglu Sefika, Paschke Adrian
- Reducing Selection Bias In Large Language Models Eicher J. E., Irgolič R. F.
- Persuasiveness Of Generated Free-text Rationales In Subjective Decisions: A Case Study On Pairwise Argument Ranking Elaraby Mohamed, Litman Diane, Li Xiang Lorraine, Magooda Ahmed
- Ocassionally Secure: A Comparative Analysis Of Code Generation Assistants Elgedawy Ran, Sadik John, Dutta Senjuti, Gautam Anuj, Georgiou Konstantinos, Gholamrezae Farzin, Ji Fujiao, Lim Kyungchan, Liu Qian, Ruoti Scott
- Layerskip: Enabling Early Exit Inference And Self-speculative Decoding Elhoushi Mostafa, Shrivastava Akshat, Liskovich Diana, Hosmer Basil, Wasti Bram, Lai Liangzhen, Mahmoud Anas, Acun Bilge, Agarwal Saurabh, Roman Ahmed, Aly Ahmed A, Chen Beidi, Wu Carole-jean
- Kapqa: Knowledge-augmented Product Question-answering Eppalapally Swetha, Dangi Daksh, Bhat Chaithra, Gupta Ankita, Zhang Ruiyi, Agarwal Shubham, Bagga Karishma, Yoon Seunghyun, Lipka Nedim, Rossi Ryan A., Dernoncourt Franck
- Empirical Studies Of Parameter Efficient Methods For Large Language Models Of Code And Knowledge Transfer To R Esmaeili Amirreza, Saberi Iman, Fard Fatemeh H.
- Low-cost Language Models: Survey And Performance Evaluation On Python Code Generation Espejel Jessica López, Alassan Mahaman Sanoussi Yahaya, Bouhandi Merieme, Dahhane Walid, Ettifouri El Hassane
- Bertaqa: How Much Do Language Models Know About Local Culture? Etxaniz Julen, Azkune Gorka, Soroa Aitor, De Lacalle Oier Lopez, Artetxe Mikel
- Data-augmentation-based Dialectal Adaptation For Llms Faisal Fahim, Anastasopoulos Antonios
- Limgen: Probing The Llms For Generating Suggestive Limitations Of Research Papers Faizullah Abdur Rahman Bin Md, Urlana Ashok, Mishra Rahul
- Reformatted Alignment Fan Run-ze, Li Xuefeng, Zou Haoyang, Li Junlong, He Shwai, Chern Ethan, Hu Jiewen, Liu Pengfei
- Navigation Instruction Generation With BEV Perception And Large Language Models Fan Sheng, Liu Rui, Wang Wenguan, Yang Yi
- A Survey On RAG Meeting Llms: Towards Retrieval-augmented Large Language Models Fan Wenqi, Ding Yujuan, Ning Liangbo, Wang Shijie, Li Hengyun, Yin Dawei, Chua Tat-seng, Li Qing
- Prompt Optimizer Of Text-to-image Diffusion Models For Abstract Concept Understanding Fan Zezhong, Li Xiaohan, Fang Chenhao, Biswas Topojoy, Nag Kaushiki, Xu Jianpeng, Achan Kannan
- Enhancing Noise Robustness Of Retrieval-augmented Language Models With Adaptive Adversarial Training Fang Feiteng, Bai Yuelin, Ni Shiwen, Yang Min, Chen Xiaojun, Xu Ruifeng
- TRACE The Evidence: Constructing Knowledge-grounded Reasoning Chains For Retrieval-augmented Generation Fang Jinyuan, Meng Zaiqiao, Macdonald Craig
- Large Language Models Are Neurosymbolic Reasoners Fang Meng, Deng Shilong, Zhang Yudi, Shi Zijing, Chen Ling, Pechenizkiy Mykola, Wang Jun
- PATCH! Psychometrics-assisted Benchmarking Of Large Language Models: A Case Study Of Proficiency In 8th Grade Mathematics Fang Qixiang, Oberski Daniel L., Nguyen Dong
- Integrating A.I. In Higher Education: Protocol For A Pilot Study With 'samcares: An Adaptive Learning Hub' Faruqui Syed Hasib Akhter, Tasnim Nazia, Basith Iftekhar Ibne, Obeidat Suleiman, Yildiz Faruk
- Test Of Time: A Benchmark For Evaluating Llms On Temporal Reasoning Fatemi Bahare, Kazemi Mehran, Tsitsulin Anton, Malkan Karishma, Yim Jinyeong, Palowitch John, Seo Sungyong, Halcrow Jonathan, Perozzi Bryan
- Multi-modal Hallucination Control By Visual Information Grounding Favero Alessandro, Zancato Luca, Trager Matthew, Choudhary Siddharth, Perera Pramuditha, Achille Alessandro, Swaminathan Ashwin, Soatto Stefano
- Curlora: Stable LLM Continual Fine-tuning And Catastrophic Forgetting Mitigation Fawi Muhammad
- Prompt Exploration With Prompt Regression Feffer Michael, Xu Ronald, Sun Yuekai, Yurochkin Mikhail
- Video-ccam: Enhancing Video-language Understanding With Causal Cross-attention Masks For Short And Long Videos Fei Jiajun, Li Dian, Deng Zhidong, Wang Zekun, Liu Gang, Wang Hui
- Internlm-law: An Open Source Chinese Legal Large Language Model Fei Zhiwei, Zhang Songyang, Shen Xiaoyu, Zhu Dawei, Wang Xiao, Cao Maosong, Zhou Fengzhe, Li Yining, Zhang Wenwei, Lin Dahua, Chen Kai, Ge Jidong
- Ragged Edges: The Double-edged Sword Of Retrieval-augmented Chatbots Feldman Philip, Foulds James R., Pan Shimei
- Legend: Leveraging Representation Engineering To Annotate Safety Margin For Preference Datasets Feng Duanyu, Qin Bowen, Huang Chen, Huang Youcheng, Zhang Zheng, Lei Wenqiang
- Citygpt: Empowering Urban Spatial Cognition Of Large Language Models Feng Jie, Du Yuwei, Liu Tianhui, Guo Siqi, Lin Yuming, Li Yong
- AGILE: A Novel Framework Of LLM Agents Feng Peiyuan, He Yichen, Huang Guanhua, Lin Yuan, Zhang Hanchong, Zhang Yuchen, Li Hang
- Modular Pluralism: Pluralistic Alignment Via Multi-llm Collaboration Feng Shangbin, Sorensen Taylor, Liu Yuhan, Fisher Jillian, Park Chan Young, Choi Yejin, Tsvetkov Yulia
- An Extremely Data-efficient And Generative Llm-based Reinforcement Learning Agent For Recommenders Feng Shuang, Feng Grace
- Courseassist: Pedagogically Appropriate AI Tutor For Computer Science Education Feng Ty, Liu Sa, Ghosal Dipak
- Exploring Automated Distractor Generation For Math Multiple-choice Questions Via Large Language Models Feng Wanyong, Lee Jaewook, Mcnichols Hunter, Scarlatos Alexander, Smith Digory, Woodhead Simon, Ornelas Nancy Otero, Lan Andrew
- Large Language Model-based Human-agent Collaboration For Complex Task Solving Feng Xueyang, Chen Zhi-yuan, Qin Yujia, Lin Yankai, Chen Xu, Liu Zhiyuan, Wen Ji-rong
- Generative AI And Large Language Models For Cyber Security: All Insights You Need Ferrag Mohamed Amine, Alwahedi Fatima, Battah Ammar, Cherif Bilel, Mechri Abdechakour, Tihanyi Norbert
- Toward Large Language Models As A Therapeutic Tool: Comparing Prompting Techniques To Improve Gpt-delivered Problem-solving Therapy Filienko Daniil, Wang Yinzhou, Jazmi Caroline El, Xie Serena, Cohen Trevor, De Cock Martine, Yuwen Weichao
- Grillbot In Practice: Lessons And Tradeoffs Deploying Large Language Models For Adaptable Conversational Task Assistants Fischer Sophie, Gemmell Carlos, Tecklenburg Niklas, Mackie Iain, Rossetto Federico, Dalton Jeffrey
- Contrastive Policy Gradient: Aligning Llms On Sequence-level Scores In A Supervised-friendly Fashion Flet-berliac Yannis, Grinsztajn Nathan, Strub Florian, Choi Eugene, Cremer Chris, Ahmadian Arash, Chandak Yash, Azar Mohammad Gheshlaghi, Pietquin Olivier, Geist Matthieu
- Geckopt: LLM System Efficiency Via Intent-based Tool Selection Fore Michael, Singh Simranjit, Stamoulis Dimitrios
- Conversational Assistants In Knowledge-intensive Contexts: An Evaluation Of LLM- Versus Intent-based Systems Freire Samuel Kernan, Wang Chaofan, Niforatos Evangelos
- Representing Rule-based Chatbots With Transformers Friedman Dan, Panigrahi Abhishek, Chen Danqi
- An Application Of Large Language Models To Coding Negotiation Transcripts Friedman Ray, Cho Jaewoo, Brett Jeanne, Zhan Xuhui, Han Ningyu, Kannan Sriram, Ma Yingxiang, Spencer-smith Jesse, Jäckel Elisabeth, Zerres Alfred, Hooper Madison, Babbit Katie, Acharya Manish, Adair Wendi, Aslani Soroush, Aykaç Tayfun, Bauman Chris, Bennett Rebecca, Brady Garrett, Briggs Peggy, Dowie Cheryl, Eck Chase, Geiger Igmar, Jacob Frank, Kern Molly, Lee Sujin, Liu Leigh Anne, Liu Wu, Loewenstein Jeffrey, Lytle Anne, Ma Li, Mann Michel, Mislin Alexandra, Mitchell Tyree, Nagler Hannah Martensen Née, Nandkeolyar Amit, Olekalns Mara, Paliakova Elena, Parlamis Jennifer, Pierce Jason, Pierce Nancy, Pinkley Robin, Prime Nathalie, Ramirez-marin Jimena, Rockmann Kevin, Ross William, Semnani-azad Zhaleh, Schroeder Juliana, Smith Philip, Stimmer Elena, Swaab Roderick, Thompson Leigh, Tinsley Cathy, Tuncel Ece, Weingart Laurie, Wilken Robert, Yao Jingjing, Zhang Zhi-xue
- Hint-before-solving Prompting: Guiding Llms To Effectively Utilize Encoded Knowledge Fu Jinlan, Huangfu Shenzhen, Yan Hang, Ng See-kiong, Qiu Xipeng
- Scene-llm: Extending Language Model For 3D Visual Understanding And Reasoning Fu Rao, Liu Jingyu, Chen Xilun, Nie Yixin, Xiong Wenhan
- Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight In The Real World For Meeting Summarization? Fu Xue-yong, Laskar Md Tahmid Rahman, Khasanova Elena, Chen Cheng, Tn Shashi Bhushan
- Autoguide: Automated Generation And Selection Of State-aware Guidelines For Large Language Model Agents Fu Yao, Kim Dong-ki, Kim Jaekyeom, Sohn Sungryull, Logeswaran Lajanugen, Bae Kyunghoon, Lee Honglak
- Cross-task Defense: Instruction-tuning Llms For Content Safety Fu Yu, Xiao Wen, Chen Jia, Li Jiachen, Papalexakis Evangelos, Chien Aichi, Dong Yue
- Using Large Language Models For The Interpretation Of Building Regulations Fuchs Stefan, Witbrock Michael, Dimyadi Johannes, Amor Robert
- Continual Pre-training For Cross-lingual LLM Adaptation: Enhancing Japanese Language Capabilities Fujii Kazuki, Nakamura Taishi, Loem Mengsay, Iida Hiroki, Ohi Masanari, Hattori Kakeru, Shota Hirai, Mizuki Sakae, Yokota Rio, Okazaki Naoaki
- Thinking Fair And Slow: On The Efficacy Of Structured Prompts For Debiasing Language Models Furniturewala Shaz, Jandial Surgan, Java Abhinav, Banerjee Pragyan, Shahid Simra, Bhatia Sumit, Jaidka Kokil
- Zero-shot Persuasive Chatbots With Llm-generated Strategies And Information Retrieval Furumai Kazuaki, Legaspi Roberto, Vizcarra Julio, Yamazaki Yudai, Nishimura Yasutaka, Semnani Sina J., Ikeda Kazushi, Shi Weiyan, Lam Monica S.
- Can AI Relate: Testing Large Language Model Response For Mental Health Support Gabriel Saadia, Puri Isha, Xu Xuhai, Malgaroli Matteo, Ghassemi Marzyeh
- Student Answer Forecasting: Transformer-driven Answer Choice Prediction For Language Learning Gado Elena Grazia, Martorella Tommaso, Zunino Luca, Mejia-domenzain Paola, Swamy Vinitra, Frej Jibril, Käser Tanja
- Airavata: Introducing Hindi Instruction-tuned LLM Gala Jay, Jayakumar Thanmay, Husain Jaavid Aktar, M Aswanth Kumar, Khan Mohammed Safi Ur Rahman, Kanojia Diptesh, Puduppully Ratish, Khapra Mitesh M., Dabre Raj, Murthy Rudra, Kunchukuttan Anoop
- Intelligence Analysis Of Language Models Galanti Liane, Baron Ethan
- LLM Detectors Still Fall Short Of Real World: Case Of Llm-generated Short News-like Posts Gameiro Henrique Da Silva, Kucharavy Andrei, Dolamic Ljiljana
- Cost-efficient Large Language Model Serving For Multi-turn Conversations With Cachedattention Gao Bin, He Zhuomin, Sharma Puru, Kang Qingxuan, Jevdjic Djordje, Deng Junbo, Yang Xingkun, Yu Zhou, Zuo Pengfei
- Practical Unlearning For Large Language Models Gao Chongyang, Wang Lixu, Weng Chenkai, Wang Xiao, Zhu Qi
- The Best Of Both Worlds: Toward An Honest And Helpful Large Language Model Gao Chujie, Zhang Qihui, Chen Dongping, Huang Yue, Wu Siyuan, Fu Zhengyan, Wan Yao, Zhang Xiangliang, Sun Lichao
- AIM: Let Any Multi-modal Large Language Models Embrace Efficient In-context Learning Gao Jun, Qiao Qian, Cao Ziqiang, Wang Zili, Li Wenjie
- Harnessing The Intrinsic Knowledge Of Pretrained Language Models For Challenging Text Classification Settings Gao Lingyu
- Lumina-t2x: Transforming Text Into Any Modality, Resolution, And Duration Via Flow-based Large Diffusion Transformers Gao Peng, Zhuo Le, Liu Dongyang, Du Ruoyi, Luo Xu, Qiu Longtian, Zhang Yuhang, Lin Chen, Huang Rongjie, Geng Shijie, Zhang Renrui, Xi Junlin, Shao Wenqi, Jiang Zhengkai, Yang Tianshuo, Ye Weicai, Tong He, He Jingwen, Qiao Yu, Li Hongsheng
- Efficient Tool Use With Chain-of-abstraction Reasoning Gao Silin, Dwivedi-yu Jane, Yu Ping, Tan Xiaoqing Ellen, Pasunuru Ramakanth, Golovneva Olga, Sinha Koustuv, Celikyilmaz Asli, Bosselut Antoine, Wang Tianlu
- Customizing Language Model Responses With Contrastive In-context Learning Gao Xiang, Das Kamalika
- Dr3: Ask Large Language Models Not To Give Off-topic Answers In Open Domain Multi-hop Question Answering Gao Yuan, Zhu Yiheng, Cao Yuanbin, Zhou Yinzhi, Wu Zhen, Chen Yujie, Wu Shenglan, Hu Haoyuan, Dai Xinyu
- Robust Pronoun Fidelity With English Llms: Are They Reasoning, Repeating, Or Just Biased? Gautam Vagrant, Bingert Eileen, Zhu Dawei, Lauscher Anne, Klakow Dietrich
- Longins: A Challenging Long-context Instruction-based Exam For Llms Gavin Shawn, Zheng Tuney, Liu Jiaheng, Que Quehry, Wang Noah, Yang Jian, Zhang Chenchen, Huang Wenhao, Chen Wenhu, Zhang Ge
- Cocobo: Exploring Large Language Models As The Engine For End-user Robot Programming Ge Yate, Dai Yi, Shan Run, Li Kechun, Hu Yuanda, Sun Xiaohua
- Worldgpt: Empowering LLM As Multimodal World Model Ge Zhiqi, Huang Hongzhe, Zhou Mingze, Li Juncheng, Wang Guoming, Tang Siliang, Zhuang Yueting
- Does Fine-tuning Llms On New Knowledge Encourage Hallucinations? Gekhman Zorik, Yona Gal, Aharoni Roee, Eyal Matan, Feder Amir, Reichart Roi, Herzig Jonathan
- Edinburgh Clinical NLP At MEDIQA-CORR 2024: Guiding Large Language Models With Hints Gema Aryo Pradipta, Lee Chaeeun, Minervini Pasquale, Daines Luke, Simpson T. Ian, Alex Beatrice
- Gemini 1.5: Unlocking Multimodal Understanding Across Millions Of Tokens Of Context Gemini Team, Georgiev Petko, Lei Ving Ian, Burnell Ryan, Bai Libin, Gulati Anmol, Tanzer Garrett, Vincent Damien, Pan Zhufeng, Wang Shibo, Mariooryad Soroosh, Ding Yifan, Geng Xinyang, Alcober Fred, Frostig Roy, Omernick Mark, Walker Lexi, Paduraru Cosmin, Sorokin Christina, Tacchetti Andrea, Gaffney Colin, Daruki Samira, Sercinoglu Olcan, Gleicher Zach, Love Juliette, Voigtlaender Paul, Jain Rohan, Surita Gabriela, Mohamed Kareem, Blevins Rory, Ahn Junwhan, Zhu Tao, Kawintiranon Kornraphop, Firat Orhan, Gu Yiming, Zhang Yujing, Rahtz Matthew, Faruqui Manaal, Clay Natalie, Gilmer Justin, Co-reyes Jd, Penchev Ivo, Zhu Rui, Morioka Nobuyuki, Hui Kevin, Haridasan Krishna, Campos Victor, Mahdieh Mahdis, Guo Mandy, Hassan Samer, Kilgour Kevin, Vezer Arpi, Cheng Heng-tze, De Liedekerke Raoul, Goyal Siddharth, Barham Paul, Strouse Dj, Noury Seb, Adler Jonas, Sundararajan Mukund, Vikram Sharad, Lepikhin Dmitry, Paganini Michela, Garcia Xavier, Yang Fan, Valter Dasha, Trebacz Maja, Vodrahalli Kiran, Asawaroengchai Chulayuth, Ring Roman, Kalb Norbert, Soares Livio Baldini, Brahma Siddhartha, Steiner David, Yu Tianhe, Mentzer Fabian, He Antoine, Gonzalez Lucas, Xu Bibo, Kaufman Raphael Lopez, Shafey Laurent El, Oh Junhyuk, Hennigan Tom, Driessche George Van Den, Odoom Seth, Lucic Mario, Roelofs Becca, Lall Sid, Marathe Amit, Chan Betty, Ontanon Santiago, He Luheng, Teplyashin Denis, Lai Jonathan, Crone Phil, Damoc Bogdan, Ho Lewis, Riedel Sebastian, Lenc Karel, Yeh Chih-kuan, Chowdhery Aakanksha, Xu Yang, Kazemi Mehran, Amid Ehsan, Petrushkina Anastasia, Swersky Kevin, Khodaei Ali, Chen Gowoon, Larkin Chris, Pinto Mario, Yan Geng, Badia Adria Puigdomenech, Patil Piyush, Hansen Steven, Orr Dave, Arnold Sebastien M. R., Grimstad Jordan, Dai Andrew, Douglas Sholto, Sinha Rishika, Yadav Vikas, Chen Xi, Gribovskaya Elena, Austin Jacob, Zhao Jeffrey, Patel Kaushal, Komarek Paul, Austin Sophia, Borgeaud Sebastian, Friso Linda, Goyal Abhimanyu, Caine Ben, Cao Kris, Chung Da-woon, Lamm Matthew, Barth-maron Gabe, Kagohara Thais, Olszewska Kate, Chen Mia, Shivakumar Kaushik, Agarwal Rishabh, Godhia Harshal, Rajwar Ravi, Snaider Javier, Dotiwalla Xerxes, Liu Yuan, Barua Aditya, Ungureanu Victor, Zhang Yuan, Batsaikhan Bat-orgil, Wirth Mateo, Qin James, Danihelka Ivo, Doshi Tulsee, Chadwick Martin, Chen Jilin, Jain Sanil, Le Quoc, Kar Arjun, Gurumurthy Madhu, Li Cheng, Sang Ruoxin, Liu Fangyu, Lamprou Lampros, Munoz Rich, Lintz Nathan, Mehta Harsh, Howard Heidi, Reynolds Malcolm, Aroyo Lora, Wang Quan, Blanco Lorenzo, Cassirer Albin, Griffith Jordan, Das Dipanjan, Lee Stephan, Sygnowski Jakub, Fisher Zach, Besley James, Powell Richard, Ahmed Zafarali, Paulus Dominik, Reitter David, Borsos Zalan, Joshi Rishabh, Pope Aedan, Hand Steven, Selo Vittorio, Jain Vihan, Sethi Nikhil, Goel Megha, Makino Takaki, May Rhys, Yang Zhen, Schalkwyk Johan, Butterfield Christina, Hauth Anja, Goldin Alex, Hawkins Will, Senter Evan, Brin Sergey, Woodman Oliver, Ritter Marvin, Noland Eric, Giang Minh, Bolina Vijay, Lee Lisa, Blyth Tim, Mackinnon Ian, Reid Machel, Sarvana Obaid, Silver David, Chen Alexander, Wang Lily, Maggiore Loren, Chang Oscar, Attaluri Nithya, Thornton Gregory, Chiu Chung-cheng, Bunyan Oskar, Levine Nir, Chung Timothy, Eltyshev Evgenii, Si Xiance, Lillicrap Timothy, Brady Demetra, Aggarwal Vaibhav, Wu Boxi, Xu Yuanzhong, Mcilroy Ross, Badola Kartikeya, Sandhu Paramjit, Moreira Erica, Stokowiec Wojciech, Hemsley Ross, Li Dong, Tudor Alex, Shyam Pranav, Rahimtoroghi Elahe, Haykal Salem, Sprechmann Pablo, Zhou Xiang, Mincu Diana, Li Yujia, Addanki Ravi, Krishna Kalpesh, Wu Xiao, Frechette Alexandre, Eyal Matan, Dafoe Allan, Lacey Dave, Whang Jay, Avrahami Thi, Zhang Ye, Taropa Emanuel, Lin Hanzhao, Toyama Daniel, Rutherford Eliza, Sano Motoki, Choe Hyunjeong, Tomala Alex, Safranek-shrader Chalence, Kassner Nora, Pajarskas Mantas, Harvey Matt, Sechrist Sean, Fortunato Meire, Lyu Christina, Elsayed Gamaleldin, Kuang Chenkai, Lottes James, Chu Eric, Jia Chao, Chen Chih-wei, Humphreys Peter, Baumli Kate, Tao Connie, Samuel Rajkumar, Santos Cicero Nogueira Dos, Andreassen Anders, Rakićević Nemanja, Grewe Dominik, Kumar Aviral, Winkler Stephanie, Caton Jonathan, Brock Andrew, Dalmia Sid, Sheahan Hannah, Barr Iain, Miao Yingjie, Natsev Paul, Devlin Jacob, Behbahani Feryal, Prost Flavien, Sun Yanhua, Myaskovsky Artiom, Pillai Thanumalayan Sankaranarayana, Hurt Dan, Lazaridou Angeliki, Xiong Xi, Zheng Ce, Pardo Fabio, Li Xiaowei, Horgan Dan, Stanton Joe, Ambar Moran, Xia Fei, Lince Alejandro, Wang Mingqiu, Mustafa Basil, Webson Albert, Lee Hyo, Anil Rohan, Wicke Martin, Dozat Timothy, Sinha Abhishek, Piqueras Enrique, Dabir Elahe, Upadhyay Shyam, Boral Anudhyan, Hendricks Lisa Anne, Fry Corey, Djolonga Josip, Su Yi, Walker Jake, Labanowski Jane, Huang Ronny, Misra Vedant, Chen Jeremy, Skerry-ryan Rj, Singh Avi, Rijhwani Shruti, Yu Dian, Castro-ros Alex, Changpinyo Beer, Datta Romina, Bagri Sumit, Hrafnkelsson Arnar Mar, Maggioni Marcello, Zheng Daniel, Sulsky Yury, Hou Shaobo, Paine Tom Le, Yang Antoine, Riesa Jason, Rogozinska Dominika, Marcus Dror, Badawy Dalia El, Zhang Qiao, Wang Luyu, Miller Helen, Greer Jeremy, Sjos Lars Lowe, Nova Azade, Zen Heiga, Chaabouni Rahma, Rosca Mihaela, Jiang Jiepu, Chen Charlie, Liu Ruibo, Sainath Tara, Krikun Maxim, Polozov Alex, Lespiau Jean-baptiste, Newlan Josh, Cankara Zeyncep, Kwak Soo, Xu Yunhan, Chen Phil, Coenen Andy, Meyer Clemens, Tsihlas Katerina, Ma Ada, Gottweis Juraj, Xing Jinwei, Gu Chenjie, Miao Jin, Frank Christian, Cankara Zeynep, Ganapathy Sanjay, Dasgupta Ishita, Hughes-fitt Steph, Chen Heng, Reid David, Rong Keran, Fan Hongmin, Van Amersfoort Joost, Zhuang Vincent, Cohen Aaron, Gu Shixiang Shane, Mohananey Anhad, Ilic Anastasija, Tobin Taylor, Wieting John, Bortsova Anna, Thacker Phoebe, Wang Emma, Caveness Emily, Chiu Justin, Sezener Eren, Kaskasoli Alex, Baker Steven, Millican Katie, Elhawaty Mohamed, Aisopos Kostas, Lebsack Carl, Byrd Nathan, Dai Hanjun, Jia Wenhao, Wiethoff Matthew, Davoodi Elnaz, Weston Albert, Yagati Lakshman, Ahuja Arun, Gao Isabel, Pundak Golan, Zhang Susan, Azzam Michael, Sim Khe Chai, Caelles Sergi, Keeling James, Sharma Abhanshu, Swing Andy, Li Yaguang, Liu Chenxi, Bostock Carrie Grimes, Bansal Yamini, Nado Zachary, Anand Ankesh, Lipschultz Josh, Karmarkar Abhijit, Proleev Lev, Ittycheriah Abe, Yeganeh Soheil Hassas, Polovets George, Faust Aleksandra, Sun Jiao, Rrustemi Alban, Li Pen, Shivanna Rakesh, Liu Jeremiah, Welty Chris, Lebron Federico, Baddepudi Anirudh, Krause Sebastian, Parisotto Emilio, Soricut Radu, Xu Zheng, Bloxwich Dawn, Johnson Melvin, Neyshabur Behnam, Mao-jones Justin, Wang Renshen, Ramasesh Vinay, Abbas Zaheer, Guez Arthur, Segal Constant, Nguyen Duc Dung, Svensson James, Hou Le, York Sarah, Milan Kieran, Bridgers Sophie, Gworek Wiktor, Tagliasacchi Marco, Lee-thorp James, Chang Michael, Guseynov Alexey, Hartman Ale Jakse, Kwong Michael, Zhao Ruizhe, Kashem Sheleem, Cole Elizabeth, Miech Antoine, Tanburn Richard, Phuong Mary, Pavetic Filip, Cevey Sebastien, Comanescu Ramona, Ives Richard, Yang Sherry, Du Cosmo, Li Bo, Zhang Zizhao, Iinuma Mariko, Hu Clara Huiyi, Roy Aurko, Bijwadia Shaan, Zhu Zhenkai, Martins Danilo, Saputro Rachel, Gergely Anita, Zheng Steven, Jia Dawei, Antonoglou Ioannis, Sadovsky Adam, Gu Shane, Bi Yingying, Andreev Alek, Samangooei Sina, Khan Mina, Kocisky Tomas, Filos Angelos, Kumar Chintu, Bishop Colton, Yu Adams, Hodkinson Sarah, Mittal Sid, Shah Premal, Moufarek Alexandre, Cheng Yong, Bloniarz Adam, Lee Jaehoon, Pejman Pedram, Michel Paul, Spencer Stephen, Feinberg Vladimir, Xiong Xuehan, Savinov Nikolay, Smith Charlotte, Shakeri Siamak, Tran Dustin, Chesus Mary, Bohnet Bernd, Tucker George, Von Glehn Tamara, Muir Carrie, Mao Yiran, Kazawa Hideto, Slone Ambrose, Soparkar Kedar, Shrivastava Disha, Cobon-kerr James, Sharman Michael, Pavagadhi Jay, Araya Carlos, Misiunas Karolis, Ghelani Nimesh, Laskin Michael, Barker David, Li Qiujia, Briukhov Anton, Houlsby Neil, Glaese Mia, Lakshminarayanan Balaji, Schucher Nathan, Tang Yunhao, Collins Eli, Lim Hyeontaek, Feng Fangxiaoyu, Recasens Adria, Lai Guangda, Magni Alberto, De Cao Nicola, Siddhant Aditya, Ashwood Zoe, Orbay Jordi, Dehghani Mostafa, Brennan Jenny, He Yifan, Xu Kelvin, Gao Yang, Saroufim Carl, Molloy James, Wu Xinyi, Arnold Seb, Chang Solomon, Schrittwieser Julian, Buchatskaya Elena, Radpour Soroush, Polacek Martin, Giordano Skye, Bapna Ankur, Tokumine Simon, Hellendoorn Vincent, Sottiaux Thibault, Cogan Sarah, Severyn Aliaksei, Saleh Mohammad, Thakoor Shantanu, Shefey Laurent, Qiao Siyuan, Gaba Meenu, Chang Shuo-yiin, Swanson Craig, Zhang Biao, Lee Benjamin, Rubenstein Paul Kishan, Song Gan, Kwiatkowski Tom, Koop Anna, Kannan Ajay, Kao David, Schuh Parker, Stjerngren Axel, Ghiasi Golnaz, Gibson Gena, Vilnis Luke, Yuan Ye, Ferreira Felipe Tiengo, Kamath Aishwarya, Klimenko Ted, Franko Ken, Xiao Kefan, Bhattacharya Indro, Patel Miteyan, Wang Rui, Morris Alex, Strudel Robin, Sharma Vivek, Choy Peter, Hashemi Sayed Hadi, Landon Jessica, Finkelstein Mara, Jhakra Priya, Frye Justin, Barnes Megan, Mauger Matthew, Daun Dennis, Baatarsukh Khuslen, Tung Matthew, Farhan Wael, Michalewski Henryk, Viola Fabio, Quitry Felix De Chaumont, Lan Charline Le, Hudson Tom, Wang Qingze, Fischer Felix, Zheng Ivy, White Elspeth, Dragan Anca, Alayrac Jean-baptiste, Ni Eric, Pritzel Alexander, Iwanicki Adam, Isard Michael, Bulanova Anna, Zilka Lukas, Dyer Ethan, Sachan Devendra, Srinivasan Srivatsan, Muckenhirn Hannah, Cai Honglong, Mandhane Amol, Tariq Mukarram, Rae Jack W., Wang Gary, Ayoub Kareem, Fitzgerald Nicholas, Zhao Yao, Han Woohyun, Alberti Chris, Garrette Dan, Krishnakumar Kashyap, Gimenez Mai, Levskaya Anselm, Sohn Daniel, Matak Josip, Iturrate Inaki, Chang Michael B., Xiang Jackie, Cao Yuan, Ranka Nishant, Brown Geoff, Hutter Adrian, Mirrokni Vahab, Chen Nanxin, Yao Kaisheng, Egyed Zoltan, Galilee Francois, Liechty Tyler, Kallakuri Praveen, Palmer Evan, Ghemawat Sanjay, Liu Jasmine, Tao David, Thornton Chloe, Green Tim, Jasarevic Mimi, Lin Sharon, Cotruta Victor, Tan Yi-xuan, Fiedel Noah, Yu Hongkun, Chi Ed, Neitz Alexander, Heitkaemper Jens, Sinha Anu, Zhou Denny, Sun Yi, Kaed Charbel, Hulse Brice, Mishra Swaroop, Georgaki Maria, Kudugunta Sneha, Farabet Clement, Shafran Izhak, Vlasic Daniel, Tsitsulin Anton, Ananthanarayanan Rajagopal, Carin Alen, Su Guolong, Sun Pei, V Shashank, Carvajal Gabriel, Broder Josef, Comsa Iulia, Repina Alena, Wong William, Chen Warren Weilun, Hawkins Peter, Filonov Egor, Loher Lucia, Hirnschall Christoph, Wang Weiyi, Ye Jingchen, Burns Andrea, Cate Hardie, Wright Diana Gage, Piccinini Federico, Zhang Lei, Lin Chu-cheng, Gog Ionel, Kulizhskaya Yana, Sreevatsa Ashwin, Song Shuang, Cobo Luis C., Iyer Anand, Tekur Chetan, Garrido Guillermo, Xiao Zhuyun, Kemp Rupert, Zheng Huaixiu Steven, Li Hui, Agarwal Ananth, Ngani Christel, Goshvadi Kati, Santamaria-fernandez Rebeca, Fica Wojciech, Chen Xinyun, Gorgolewski Chris, Sun Sean, Garg Roopal, Ye Xinyu, Eslami S. M. Ali, Hua Nan, Simon Jon, Joshi Pratik, Kim Yelin, Tenney Ian, Potluri Sahitya, Thiet Lam Nguyen, Yuan Quan, Luisier Florian, Chronopoulou Alexandra, Scellato Salvatore, Srinivasan Praveen, Chen Minmin, Koverkathu Vinod, Dalibard Valentin, Xu Yaming, Saeta Brennan, Anderson Keith, Sellam Thibault, Fernando Nick, Huot Fantine, Jung Junehyuk, Varadarajan Mani, Quinn Michael, Raul Amit, Le Maigo, Habalov Ruslan, Clark Jon, Jalan Komal, Bullard Kalesha, Singhal Achintya, Luong Thang, Wang Boyu, Rajayogam Sujeevan, Eisenschlos Julian, Jia Johnson, Finchelstein Daniel, Yakubovich Alex, Balle Daniel, Fink Michael, Agarwal Sameer, Li Jing, Dvijotham Dj, Pal Shalini, Kang Kai, Konzelmann Jaclyn, Beattie Jennifer, Dousse Olivier, Wu Diane, Crocker Remi, Elkind Chen, Jonnalagadda Siddhartha Reddy, Lee Jong, Holtmann-rice Dan, Kallarackal Krystal, Liu Rosanne, Vnukov Denis, Vats Neera, Invernizzi Luca, Jafari Mohsen, Zhou Huanjie, Taylor Lilly, Prendki Jennifer, Wu Marcus, Eccles Tom, Liu Tianqi, Kopparapu Kavya, Beaufays Francoise, Angermueller Christof, Marzoca Andreea, Sarcar Shourya, Dib Hilal, Stanway Jeff, Perbet Frank, Trdin Nejc, Sterneck Rachel, Khorlin Andrey, Li Dinghua, Wu Xihui, Goenka Sonam, Madras David, Goldshtein Sasha, Gierke Willi, Zhou Tong, Liu Yaxin, Liang Yannie, White Anais, Li Yunjie, Singh Shreya, Bahargam Sanaz, Epstein Mark, Basu Sujoy, Lao Li, Ozturel Adnan, Crous Carl, Zhai Alex, Lu Han, Tung Zora, Gaur Neeraj, Walton Alanna, Dixon Lucas, Zhang Ming, Globerson Amir, Uy Grant, Bolt Andrew, Wiles Olivia, Nasr Milad, Shumailov Ilia, Selvi Marco, Piccinno Francesco, Aguilar Ricardo, Mccarthy Sara, Khalman Misha, Shukla Mrinal, Galic Vlado, Carpenter John, Villela Kevin, Zhang Haibin, Richardson Harry, Martens James, Bosnjak Matko, Belle Shreyas Rammohan, Seibert Jeff, Alnahlawi Mahmoud, Mcwilliams Brian, Singh Sankalp, Louis Annie, Ding Wen, Popovici Dan, Simicich Lenin, Knight Laura, Mehta Pulkit, Gupta Nishesh, Shi Chongyang, Fatehi Saaber, Mitrovic Jovana, Grills Alex, Pagadora Joseph, Petrova Dessie, Eisenbud Danielle, Zhang Zhishuai, Yates Damion, Mittal Bhavishya, Tripuraneni Nilesh, Assael Yannis, Brovelli Thomas, Jain Prateek, Velimirovic Mihajlo, Akbulut Canfer, Mu Jiaqi, Macherey Wolfgang, Kumar Ravin, Xu Jun, Qureshi Haroon, Comanici Gheorghe, Wiesner Jeremy, Gong Zhitao, Ruddock Anton, Bauer Matthias, Felt Nick, Gp Anirudh, Arnab Anurag, Zelle Dustin, Rothfuss Jonas, Rosgen Bill, Shenoy Ashish, Seybold Bryan, Li Xinjian, Mudigonda Jayaram, Erdogan Goker, Xia Jiawei, Simsa Jiri, Michi Andrea, Yao Yi, Yew Christopher, Kan Steven, Caswell Isaac, Radebaugh Carey, Elisseeff Andre, Valenzuela Pedro, Mckinney Kay, Paterson Kim, Cui Albert, Latorre-chimoto Eri, Kim Solomon, Zeng William, Durden Ken, Ponnapalli Priya, Sosea Tiberiu, Choquette-choo Christopher A., Manyika James, Robenek Brona, Vashisht Harsha, Pereira Sebastien, Lam Hoi, Velic Marko, Owusu-afriyie Denese, Lee Katherine, Bolukbasi Tolga, Parrish Alicia, Lu Shawn, Park Jane, Venkatraman Balaji, Talbert Alice, Rosique Lambert, Cheng Yuchung, Sozanschi Andrei, Paszke Adam, Kumar Praveen, Austin Jessica, Li Lu, Salama Khalid, Kim Wooyeol, Dukkipati Nandita, Baryshnikov Anthony, Kaplanis Christos, Sheng Xianghai, Chervonyi Yuri, Unlu Caglar, Casas Diego De Las, Askham Harry, Tunyasuvunakool Kathryn, Gimeno Felix, Poder Siim, Kwak Chester, Miecnikowski Matt, Mirrokni Vahab, Dimitriev Alek, Parisi Aaron, Liu Dangyi, Tsai Tomy, Shevlane Toby, Kouridi Christina, Garmon Drew, Goedeckemeyer Adrian, Brown Adam R., Vijayakumar Anitha, Elqursh Ali, Jazayeri Sadegh, Huang Jin, Carthy Sara Mc, Hoover Jay, Kim Lucy, Kumar Sandeep, Chen Wei, Biles Courtney, Bingham Garrett, Rosen Evan, Wang Lisa, Tan Qijun, Engel David, Pongetti Francesco, De Cesare Dario, Hwang Dongseong, Yu Lily, Pullman Jennifer, Narayanan Srini, Levin Kyle, Gopal Siddharth, Li Megan, Aharoni Asaf, Trinh Trieu, Lo Jessica, Casagrande Norman, Vij Roopali, Matthey Loic, Ramadhana Bramandia, Matthews Austin, Carey Cj, Johnson Matthew, Goranova Kremena, Shah Rohin, Ashraf Shereen, Dasgupta Kingshuk, Larsen Rasmus, Wang Yicheng, Vuyyuru Manish Reddy, Jiang Chong, Ijazi Joana, Osawa Kazuki, Smith Celine, Boppana Ramya Sree, Bilal Taylan, Koizumi Yuma, Xu Ying, Altun Yasemin, Shabat Nir, Bariach Ben, Korchemniy Alex, Choo Kiam, Ronneberger Olaf, Iwuanyanwu Chimezie, Zhao Shubin, Soergel David, Hsieh Cho-jui, Cai Irene, Iqbal Shariq, Sundermeyer Martin, Chen Zhe, Bursztein Elie, Malaviya Chaitanya, Biadsy Fadi, Shroff Prakash, Dhillon Inderjit, Latkar Tejasi, Dyer Chris, Forbes Hannah, Nicosia Massimo, Nikolaev Vitaly, Greene Somer, Georgiev Marin, Wang Pidong, Martin Nina, Sedghi Hanie, Zhang John, Banzal Praseem, Fritz Doug, Rao Vikram, Wang Xuezhi, Zhang Jiageng, Patraucean Viorica, Du Dayou, Mordatch Igor, Jurin Ivan, Liu Lewis, Dubey Ayush, Mohan Abhi, Nowakowski Janek, Ion Vlad-doru, Wei Nan, Tojo Reiko, Raad Maria Abi, Hudson Drew A., Keshava Vaishakh, Agrawal Shubham, Ramirez Kevin, Wu Zhichun, Nguyen Hoang, Liu Ji, Sewak Madhavi, Petrini Bryce, Choi Donghyun, Philips Ivan, Wang Ziyue, Bica Ioana, Garg Ankush, Wilkiewicz Jarek, Agrawal Priyanka, Li Xiaowei, Guo Danhao, Xue Emily, Shaik Naseer, Leach Andrew, Khan Sadh Mnm, Wiesinger Julia, Jerome Sammy, Chakladar Abhishek, Wang Alek Wenjiao, Ornduff Tina, Abu Folake, Ghaffarkhah Alireza, Wainwright Marcus, Cortes Mario, Liu Frederick, Maynez Joshua, Terzis Andreas, Samangouei Pouya, Mansour Riham, Kępa Tomasz, Aubet François-xavier, Algymr Anton, Banica Dan, Weisz Agoston, Orban Andras, Senges Alexandre, Andrejczuk Ewa, Geller Mark, Santo Niccolo Dal, Anklin Valentin, Merey Majd Al, Baeuml Martin, Strohman Trevor, Bai Junwen, Petrov Slav, Wu Yonghui, Hassabis Demis, Kavukcuoglu Koray, Dean Jeffrey, Vinyals Oriol
- Gemma: Open Models Based On Gemini Research And Technology Gemma Team, Mesnard Thomas, Hardin Cassidy, Dadashi Robert, Bhupatiraju Surya, Pathak Shreya, Sifre Laurent, Rivière Morgane, Kale Mihir Sanjay, Love Juliette, Tafti Pouya, Hussenot Léonard, Sessa Pier Giuseppe, Chowdhery Aakanksha, Roberts Adam, Barua Aditya, Botev Alex, Castro-ros Alex, Slone Ambrose, Héliou Amélie, Tacchetti Andrea, Bulanova Anna, Paterson Antonia, Tsai Beth, Shahriari Bobak, Lan Charline Le, Choquette-choo Christopher A., Crepy Clément, Cer Daniel, Ippolito Daphne, Reid David, Buchatskaya Elena, Ni Eric, Noland Eric, Yan Geng, Tucker George, Muraru George-christian, Rozhdestvenskiy Grigory, Michalewski Henryk, Tenney Ian, Grishchenko Ivan, Austin Jacob, Keeling James, Labanowski Jane, Lespiau Jean-baptiste, Stanway Jeff, Brennan Jenny, Chen Jeremy, Ferret Johan, Chiu Justin, Mao-jones Justin, Lee Katherine, Yu Kathy, Millican Katie, Sjoesund Lars Lowe, Lee Lisa, Dixon Lucas, Reid Machel, Mikuła Maciej, Wirth Mateo, Sharman Michael, Chinaev Nikolai, Thain Nithum, Bachem Olivier, Chang Oscar, Wahltinez Oscar, Bailey Paige, Michel Paul, Yotov Petko, Chaabouni Rahma, Comanescu Ramona, Jana Reena, Anil Rohan, Mcilroy Ross, Liu Ruibo, Mullins Ryan, Smith Samuel L, Borgeaud Sebastian, Girgin Sertan, Douglas Sholto, Pandya Shree, Shakeri Siamak, De Soham, Klimenko Ted, Hennigan Tom, Feinberg Vlad, Stokowiec Wojciech, Chen Yu-hui, Ahmed Zafarali, Gong Zhitao, Warkentin Tris, Peran Ludovic, Giang Minh, Farabet Clément, Vinyals Oriol, Dean Jeff, Kavukcuoglu Koray, Hassabis Demis, Ghahramani Zoubin, Eck Douglas, Barral Joelle, Pereira Fernando, Collins Eli, Joulin Armand, Fiedel Noah, Senter Evan, Andreev Alek, Kenealy Kathleen
- Gemma 2: Improving Open Language Models At A Practical Size Gemma Team, Riviere Morgane, Pathak Shreya, Sessa Pier Giuseppe, Hardin Cassidy, Bhupatiraju Surya, Hussenot Léonard, Mesnard Thomas, Shahriari Bobak, Ramé Alexandre, Ferret Johan, Liu Peter, Tafti Pouya, Friesen Abe, Casbon Michelle, Ramos Sabela, Kumar Ravin, Lan Charline Le, Jerome Sammy, Tsitsulin Anton, Vieillard Nino, Stanczyk Piotr, Girgin Sertan, Momchev Nikola, Hoffman Matt, Thakoor Shantanu, Grill Jean-bastien, Neyshabur Behnam, Bachem Olivier, Walton Alanna, Severyn Aliaksei, Parrish Alicia, Ahmad Aliya, Hutchison Allen, Abdagic Alvin, Carl Amanda, Shen Amy, Brock Andy, Coenen Andy, Laforge Anthony, Paterson Antonia, Bastian Ben, Piot Bilal, Wu Bo, Royal Brandon, Chen Charlie, Kumar Chintu, Perry Chris, Welty Chris, Choquette-choo Christopher A., Sinopalnikov Danila, Weinberger David, Vijaykumar Dimple, Rogozińska Dominika, Herbison Dustin, Bandy Elisa, Wang Emma, Noland Eric, Moreira Erica, Senter Evan, Eltyshev Evgenii, Visin Francesco, Rasskin Gabriel, Wei Gary, Cameron Glenn, Martins Gus, Hashemi Hadi, Klimczak-plucińska Hanna, Batra Harleen, Dhand Harsh, Nardini Ivan, Mein Jacinda, Zhou Jack, Svensson James, Stanway Jeff, Chan Jetha, Zhou Jin Peng, Carrasqueira Joana, Iljazi Joana, Becker Jocelyn, Fernandez Joe, Van Amersfoort Joost, Gordon Josh, Lipschultz Josh, Newlan Josh, Ji Ju-yeong, Mohamed Kareem, Badola Kartikeya, Black Kat, Millican Katie, Mcdonell Keelin, Nguyen Kelvin, Sodhia Kiranbir, Greene Kish, Sjoesund Lars Lowe, Usui Lauren, Sifre Laurent, Heuermann Lena, Lago Leticia, Mcnealus Lilly, Soares Livio Baldini, Kilpatrick Logan, Dixon Lucas, Martins Luciano, Reid Machel, Singh Manvinder, Iverson Mark, Görner Martin, Velloso Mat, Wirth Mateo, Davidow Matt, Miller Matt, Rahtz Matthew, Watson Matthew, Risdal Meg, Kazemi Mehran, Moynihan Michael, Zhang Ming, Kahng Minsuk, Park Minwoo, Rahman Mofi, Khatwani Mohit, Dao Natalie, Bardoliwalla Nenshad, Devanathan Nesh, Dumai Neta, Chauhan Nilay, Wahltinez Oscar, Botarda Pankil, Barnes Parker, Barham Paul, Michel Paul, Jin Pengchong, Georgiev Petko, Culliton Phil, Kuppala Pradeep, Comanescu Ramona, Merhej Ramona, Jana Reena, Rokni Reza Ardeshir, Agarwal Rishabh, Mullins Ryan, Saadat Samaneh, Carthy Sara Mc, Perrin Sarah, Arnold Sébastien M. R., Krause Sebastian, Dai Shengyang, Garg Shruti, Sheth Shruti, Ronstrom Sue, Chan Susan, Jordan Timothy, Yu Ting, Eccles Tom, Hennigan Tom, Kocisky Tomas, Doshi Tulsee, Jain Vihan, Yadav Vikas, Meshram Vilobh, Dharmadhikari Vishal, Barkley Warren, Wei Wei, Ye Wenming, Han Woohyun, Kwon Woosuk, Xu Xiang, Shen Zhe, Gong Zhitao, Wei Zichuan, Cotruta Victor, Kirk Phoebe, Rao Anand, Giang Minh, Peran Ludovic, Warkentin Tris, Collins Eli, Barral Joelle, Ghahramani Zoubin, Hadsell Raia, Sculley D., Banks Jeanine, Dragan Anca, Petrov Slav, Vinyals Oriol, Dean Jeff, Hassabis Demis, Kavukcuoglu Koray, Farabet Clement, Buchatskaya Elena, Borgeaud Sebastian, Fiedel Noah, Joulin Armand, Kenealy Kathleen, Dadashi Robert, Andreev Alek
- Breaking The Length Barrier: Llm-enhanced CTR Prediction In Long Textual User Behaviors Geng Binzong, Huan Zhaoxin, Zhang Xiaolu, He Yong, Zhang Liang, Yuan Fajie, Zhou Jun, Mo Linjian
- Supporting Sensemaking Of Large Language Model Outputs At Scale Gero Katy Ilonka, Swoopes Chelse, Gu Ziwei, Kummerfeld Jonathan K., Glassman Elena L.
- Beyond Self-learned Attention: Mitigating Attention Bias In Transformer-based Models Using Attention Guidance Gesi Jiri, Ahmed Iftekhar
- AI And Memory Wall Gholami Amir, Yao Zhewei, Kim Sehoon, Hooper Coleman, Mahoney Michael W., Keutzer Kurt
- Reinforcement Learning Problem Solving With Large Language Models Gholamian Sina, Huh Domingo
- RE-GAINS & Enchant: Intelligent Tool Manipulation Systems For Enhanced Query Responses Girhepuje Sahil, Sajeev Siva Sankar, Jain Purvam, Sikder Arya, Varma Adithya Rama, George Ryan, Srinivasan Akshay Govind, Kurup Mahendra, Sinha Ashmit, Mondal Sudip
- Almol: Aligned Language-molecule Translation Llms Through Offline Preference Contrastive Optimisation Gkoumas Dimitris
- Arcee's Mergekit: A Toolkit For Merging Large Language Models Goddard Charles, Siriwardhana Shamane, Ehghaghi Malikeh, Meyers Luke, Karpukhin Vlad, Benedict Brian, Mcquade Mark, Solawetz Jacob
- Socratic Reasoning Improves Positive Text Rewriting Goel Anmol, Daheim Nico, Gurevych Iryna
- Direct Alignment Of Draft Model For Speculative Decoding With Chat-fine-tuned Llms Goel Raghavv, Gagrani Mukul, Jeon Wonseok, Park Junyoung, Lee Mingu, Lott Christopher
- Language Guided Exploration For RL Agents In Text Environments Golchha Hitesh, Yerawar Sahil, Patel Dhruvesh, Dan Soham, Murugesan Keerthiram
- Tokenshap: Interpreting Large Language Models With Monte Carlo Shapley Value Estimation Goldshmidt Roni, Horovicz Miriam
- Do Zombies Understand? A Choose-your-own-adventure Exploration Of Machine Cognition Goldstein Ariel, Stanovsky Gabriel
- Goldfinch: High Performance Rwkv/transformer Hybrid With Linear Pre-fill And Extreme Kv-cache Compression Goldstein Daniel, Obeid Fares, Alcaide Eric, Song Guangyu, Cheah Eugene
- Does Chatgpt Have A Mind? Goldstein Simon, Levinstein Benjamin A.
- I've Got The "answer"! Interpretation Of Llms Hidden States In Question Answering Goloviznina Valeriya, Kotelnikov Evgeny
- AST-T5: Structure-aware Pretraining For Code Generation And Understanding Gong Linyuan, Elhoushi Mostafa, Cheung Alvin
- LLMC: Benchmarking Large Language Model Quantization With A Versatile Compression Toolkit Gong Ruihao, Yong Yang, Gu Shiqiao, Huang Yushi, Lv Chentao, Zhang Yunchen, Liu Xianglong, Tao Dacheng
- Llmguard: Guarding Against Unsafe LLM Behavior Goyal Shubh, Hira Medha, Mishra Shubham, Goyal Sukriti, Goel Arnav, Dadu Niharika, Db Kirushikesh, Mehta Sameep, Madaan Nishtha
- Few Shot Chain-of-thought Driven Reasoning To Prompt Llms For Open Ended Medical Question Answering Gramopadhye Ojas, Nachane Saeel Sandeep, Chanda Prateek, Ramakrishnan Ganesh, Jadhav Kshitij Sharad, Nandwani Yatin, Raghu Dinesh, Joshi Sachindra
- Loose LIPS Sink Ships: Asking Questions In Battleship With Language-informed Program Sampling Grand Gabriel, Pepe Valerio, Andreas Jacob, Tenenbaum Joshua B.
- Enhancing Llm-based Human-robot Interaction With Nuances For Diversity Awareness Grassi Lucrezia, Recchiuto Carmine Tommaso, Sgorbissa Antonio
- Talking The Talk Does Not Entail Walking The Walk: On The Limits Of Large Language Models In Lexical Entailment Recognition Greco Candida M., La Cava Lucio, Tagarelli Andrea
- Stress-testing Capability Elicitation With Password-locked Models Greenblatt Ryan, Roger Fabien, Krasheninnikov Dmitrii, Krueger David
- Model Editing Harms General Abilities Of Large Language Models: Regularization To The Rescue Gu Jia-chen, Xu Hao-xiang, Ma Jun-yu, Lu Pan, Ling Zhen-hua, Chang Kai-wei, Peng Nanyun
- A Survey On Responsible Generative AI: What To Generate And What Not Gu Jindong
- Teams-rl: Teaching Llms To Generate Better Instruction Datasets Via Reinforcement Learning Gu Shangding, Knoll Alois, Jin Ming
- Continual Skill And Task Learning Via Dialogue Gu Weiwei, Kondepudi Suresh, Huang Lixiao, Gopalan Nakul
- Middleware For Llms: Tools Are Instrumental For Language Agents In Complex Environments Gu Yu, Shu Yiheng, Yu Hao, Liu Xiao, Dong Yuxiao, Tang Jie, Srinivasa Jayanth, Latapie Hugo, Su Yu
- OLMES: A Standard For Language Model Evaluations Gu Yuling, Tafjord Oyvind, Kuehl Bailey, Haddad Dany, Dodge Jesse, Hajishirzi Hannaneh
- Diverse And Fine-grained Instruction-following Ability Exploration With Synthetic Data Gu Zihui, Sun Xingwu, Lian Fengzong, Kang Zhanhui, Xu Cheng-zhong, Fan Ju
- Codeip: A Grammar-guided Multi-bit Watermark For Large Language Models Of Code Guan Batu, Wan Yao, Bi Zhangqian, Wang Zheng, Zhang Hongyu, Zhou Pan, Sun Lichao
- Probing The Robustness Of Vision-language Pretrained Models: A Multimodal Adversarial Attack Approach Guan Jiwei, Ding Tianyu, Cao Longbing, Pan Lei, Wang Chen, Zheng Xi
- Enhancing Collaborative Semantics Of Language Model-driven Recommendations Via Graph-aware Learning Guan Zhong, Wu Likang, Zhao Hongke, He Ming, Fan Jianpin
- APTQ: Attention-aware Post-training Mixed-precision Quantization For Large Language Models Guan Ziyi, Huang Hantao, Su Yupeng, Huang Hong, Wong Ngai, Yu Hao
- Iepile: Unearthing Large-scale Schema-based Information Extraction Corpus Gui Honghao, Yuan Lin, Ye Hongbin, Zhang Ningyu, Sun Mengshu, Liang Lei, Chen Huajun
- Conversational Topic Recommendation In Counseling And Psychotherapy With Decision Transformer And Large Language Models Gunal Aylin, Lin Baihan, Bouneffouf Djallel
- A Novel Paradigm Boosting Translation Capabilities Of Large Language Models Guo Jiaxin, Yang Hao, Li Zongyao, Wei Daimeng, Shang Hengchao, Chen Xiaoyu
- Learning On Graphs With Large Language Models(llms): A Deep Dive Into Model Robustness Guo Kai, Liu Zewen, Chen Zhikai, Wen Hongzhi, Jin Wei, Tang Jiliang, Chang Yi
- Integrating Large Language Models With Graphical Session-based Recommendation Guo Naicheng, Cheng Hongwei, Liang Qianqiao, Chen Linxun, Han Bing
- Direct Language Model Alignment From Online AI Feedback Guo Shangmin, Zhang Biao, Liu Tianlin, Liu Tianqi, Khalman Misha, Llinares Felipe, Rame Alexandre, Mesnard Thomas, Zhao Yao, Piot Bilal, Ferret Johan, Blondel Mathieu
- Serial Position Effects Of Large Language Models Guo Xiaobo, Vosoughi Soroush
- Generative AI For Synthetic Data Generation: Methods, Challenges And The Future Guo Xu, Chen Yiqiang
- Efficient Continual Pre-training By Mitigating The Stability Gap Guo Yiduo, Fu Jie, Zhang Huishuai, Zhao Dongyan, Shen Yikang
- Stabletoolbench: Towards Stable Large-scale Benchmarking On Tool Learning Of Large Language Models Guo Zhicheng, Cheng Sijie, Wang Hao, Liang Shihao, Qin Yujia, Li Peng, Liu Zhiyuan, Sun Maosong, Liu Yang
- Integrating Explanations In Learning LTL Specifications From Demonstrations Gupta Ashutosh, Komp John, Rajput Abhay Singh, Shankaranarayanan Krishna, Trivedi Ashutosh, Varshney Namrita
- Walledeval: A Comprehensive Safety Evaluation Toolkit For Large Language Models Gupta Prannaya, Yau Le Qi, Low Hao Han, Lee I-shiang, Lim Hugo Maximus, Teoh Yu Xin, Koh Jia Hng, Liew Dar Win, Bhardwaj Rishabh, Bhardwaj Rajat, Poria Soujanya
- Multilingual Large Language Models And Curse Of Multilinguality Gurgurov Daniil, Bäumel Tanja, Anikina Tatiana
- Hipporag: Neurobiologically Inspired Long-term Memory For Large Language Models Gutiérrez Bernal Jiménez, Shu Yiheng, Gu Yu, Yasunaga Michihiro, Su Yu
- Clochat: Understanding How People Customize, Interact, And Experience Personas In Large Language Models Ha Juhye, Jeon Hyeon, Han Daeun, Seo Jinwook, Oh Changhoon
- Joint Embeddings For Graph Instruction Tuning Haag Aaron, Argatu Vlad, Lohse Oliver
- METAL: Towards Multilingual Meta-evaluation Hada Rishav, Gumma Varun, Ahmed Mohamed, Bali Kalika, Sitaram Sunayana
- Would I Lie To You? Inference Time Alignment Of Language Models Using Direct Preference Heads Hadji-kyriacou Avelina Asada, Arandjelovic Ognjen
- On The Impacts Of Contexts On Repository-level Code Generation Hai Nam Le, Nguyen Dung Manh, Bui Nghi D. Q.
- Looking Into Black Box Code Language Models Haider Muhammad Umair, Farooq Umar, Siddique A. B., Marron Mark
- PECC: Problem Extraction And Coding Challenges Haller Patrick, Golde Jonas, Akbik Alan
- Sltrain: A Sparse Plus Low-rank Approach For Parameter And Memory Efficient Pretraining Han Andi, Li Jiaxiang, Huang Wei, Hong Mingyi, Takeda Akiko, Jawanpuria Pratik, Mishra Bamdev
- Llm-personalize: Aligning LLM Planners With Human Preferences Via Reinforced Self-training For Housekeeping Robots Han Dongge, Mcinroe Trevor, Jelley Adam, Albrecht Stefano V., Bell Peter, Storkey Amos
- MERLIN: Multimodal Embedding Refinement Via Llm-based Iterative Navigation For Text-video Retrieval-rerank Pipeline Han Donghoon, Park Eunhwan, Lee Gisang, Lee Adam, Kwak Nojun
- Chain-of-interaction: Enhancing Large Language Models For Psychiatric Behavior Understanding By Dyadic Contexts Han Guangzeng, Liu Weisi, Huang Xiaolei, Borsari Brian
- Small Language Model Can Self-correct Han Haixia, Liang Jiaqing, Shi Jie, He Qianyu, Xiao Yanghua
- RECIPE4U: Student-chatgpt Interaction Dataset In EFL Writing Education Han Jieun, Yoo Haneul, Myung Junho, Kim Minsun, Lee Tak Yeon, Ahn So-yeon, Oh Alice
- Transfer Learning For Text Diffusion Models Han Kehang, Kenealy Kathleen, Barua Aditya, Fiedel Noah, Constant Noah
- RAG-QA Arena: Evaluating Domain Robustness For Long-form Retrieval Augmented Question Answering Han Rujun, Zhang Yuhao, Qi Peng, Xu Yumo, Wang Jenyuan, Liu Lan, Wang William Yang, Min Bonan, Castelli Vittorio
- Value Augmented Sampling For Language Model Alignment And Personalization Han Seungwook, Shenfeld Idan, Srivastava Akash, Kim Yoon, Agrawal Pulkit
- Torchopera: A Compound AI System For LLM Safety Han Shanshan, Yao Yuhang, Hu Zijian, Stripelis Dimitris, Xu Zhaozhuo, He Chaoyang
- COCO Is "ALL'' You Need For Visual Instruction Fine-tuning Han Xiaotian, Wang Yiqi, Zhai Bohan, You Quanzeng, Yang Hongxia
- Rethinking Large Language Model Architectures For Sequential Recommendations Hanbing Wang, Xiaorui Liu, Wenqi Fan, Xiangyu Zhao, Venkataramana Kini, Devendra Yadav, Fei Wang, Zhen Wen, Jiliang Tang, Hui Liu
- Actionreasoningbench: Reasoning About Actions With And Without Ramification Constraints Handa Divij, Dolin Pavel, Kumbhar Shrinidhi, Baral Chitta, Son Tran Cao
- Vision-and-language Navigation Generative Pretrained Transformer Hanlin Wen
- LLM Reasoners: New Evaluation, Library, And Analysis Of Step-by-step Reasoning With Large Language Models Hao Shibo, Gu Yi, Luo Haotian, Liu Tianyang, Shao Xiyan, Wang Xinyuan, Xie Shuhua, Ma Haodi, Samavedhi Adithya, Gao Qiyue, Wang Zhen, Hu Zhiting
- Learning To Rewrite: Generalized Llm-generated Text Detection Hao Wei, Li Ran, Zhao Weiliang, Yang Junfeng, Mao Chengzhi
- Risk And Response In Large Language Models: Evaluating Key Threat Categories Harandizadeh Bahareh, Salinas Abel, Morstatter Fred
- It Cannot Be Right If It Was Written By AI: On Lawyers' Preferences Of Documents Perceived As Authored By An LLM Vs A Human Harasta Jakub, Novotná Tereza, Savelka Jaromir
- Astprompter: Weakly Supervised Automated Language Model Red-teaming To Identify Likely Toxic Prompts Hardy Amelia F., Liu Houjun, Lange Bernard, Kochenderfer Mykel J.
- Do Large Language Models Speak All Languages Equally? A Comparative Study In Low-resource Settings Hasan Md. Arid, Tarannum Prerona, Dey Krishno, Razzak Imran, Naseem Usman
- Sub-goal Distillation: A Method To Improve Small Language Agents Hashemzadeh Maryam, Stengel-eskin Elias, Chandar Sarath, Cote Marc-alexandre
- A Comprehensive Overview Of Large Language Models (llms) For Cyber Defences: Opportunities And Directions Hassanin Mohammed, Moustafa Nour
- Teaching Large Language Models To Reason With Reinforcement Learning Havrilla Alex, Du Yuqing, Raparthy Sharath Chandra, Nalmpantis Christoforos, Dwivedi-yu Jane, Zhuravinskyi Maksym, Hambro Eric, Sukhbaatar Sainbayar, Raileanu Roberta
- Glore: When, Where, And How To Improve LLM Reasoning Via Global And Local Refinements Havrilla Alex, Raparthy Sharath, Nalmpantis Christoforus, Dwivedi-yu Jane, Zhuravinskyi Maksym, Hambro Eric, Raileanu Roberta
- Concerns On Bias In Large Language Models When Creating Synthetic Personae Haxvig Helena A.
- Large Language Models Are Biased Reinforcement Learners Hayes William M., Yax Nicolas, Palminteri Stefano
- Relative Value Biases In Large Language Models Hayes William M., Yax Nicolas, Palminteri Stefano
- Sowing The Wind, Reaping The Whirlwind: The Impact Of Editing Language Models Hazra Rima, Layek Sayan, Banerjee Somnath, Poria Soujanya
- Zero-shot Generalization During Instruction Tuning: Insights From Similarity And Granularity He Bingxiang, Ding Ning, Qian Cheng, Deng Jia, Cui Ganqu, Yuan Lifan, Gao Huan-ang, Chen Huimin, Liu Zhiyuan, Sun Maosong
- Olympiadbench: A Challenging Benchmark For Promoting AGI With Olympiad-level Bilingual Multimodal Scientific Problems He Chaoqun, Luo Renjie, Bai Yuzhuo, Hu Shengding, Thai Zhen Leng, Shen Junhao, Hu Jinyi, Han Xu, Huang Yujie, Zhang Yuxiang, Liu Jie, Qi Lei, Liu Zhiyuan, Sun Maosong
- Webvoyager: Building An End-to-end Web Agent With Large Multimodal Models He Hongliang, Yao Wenlin, Ma Kaixin, Yu Wenhao, Dai Yong, Zhang Hongming, Lan Zhenzhong, Yu Dong
- POEM: Interactive Prompt Optimization For Enhancing Multimodal Reasoning Of Large Language Models He Jianben, Wang Xingbo, Liu Shiyi, Wu Guande, Silva Claudio, Qu Huamin
- From Words To Actions: Unveiling The Theoretical Underpinnings Of Llm-driven Autonomous Systems He Jianliang, Chen Siyu, Zhang Fengzhuo, Yang Zhuoran
- Efficient LLM Inference With Kcache He Qiaozhi, Wu Zhihua
- Can Language Models Act As Knowledge Bases At Scale? He Qiyuan, Wang Yizhong, Wang Wenya
- Pitvqa: Image-grounded Text Embedding LLM For Visual Question Answering In Pituitary Surgery He Runlong, Xu Mengya, Das Adrito, Khan Danyal Z., Bano Sophia, Marcus Hani J., Stoyanov Danail, Clarkson Matthew J., Islam Mobarakol
- Achieving Tool Calling Functionality In Llms Using Only Prompt Engineering Without Fine-tuning He Shengtao
- What Matters In Transformers? Not All Attention Is Needed He Shwai, Sun Guoheng, Shen Zheyu, Li Ang
- G-retriever: Retrieval-augmented Generation For Textual Graph Understanding And Question Answering He Xiaoxin, Tian Yijun, Sun Yifei, Chawla Nitesh V., Laurent Thomas, Lecun Yann, Bresson Xavier, Hooi Bryan
- Transferring Troubles: Cross-lingual Transferability Of Backdoor Attacks In Llms With Instruction Tuning He Xuanli, Wang Jun, Xu Qiongkai, Minervini Pasquale, Stenetorp Pontus, Rubinstein Benjamin I. P., Cohn Trevor
- Llms Meet Multimodal Generation And Editing: A Survey He Yingqing, Liu Zhaoyang, Chen Jingye, Tian Zeyue, Liu Hongyu, Chi Xiaowei, Liu Runtao, Yuan Ruibin, Xing Yazhou, Wang Wenhai, Dai Jifeng, Zhang Yong, Xue Wei, Liu Qifeng, Guo Yike, Chen Qifeng
- Automated Black-box Prompt Engineering For Personalized Text-to-image Generation He Yutong, Robey Alexander, Murata Naoki, Jiang Yiding, Williams Joshua, Pappas George J., Hassani Hamed, Mitsufuji Yuki, Salakhutdinov Ruslan, Kolter J. Zico
- Camelot: Towards Large Language Models With Training-free Consolidated Associative Memory He Zexue, Karlinsky Leonid, Kim Donghyun, Mcauley Julian, Krotov Dmitry, Feris Rogerio
- Let The Code LLM Edit Itself When You Edit The Code He Zhenyu, Zhang Jun, Luo Shengjie, Xu Jingjing, Zhang Zhi, He Di
- Telechat Technical Report He Zhongjiang, Wang Zihan, Liu Xinzhang, Liu Shixuan, Yao Yitong, Huang Yuyao, Li Xuelong, Li Yongxiang, Che Zhonghao, Zhang Zhaoxi, Wang Yan, Wang Xin, Pu Luwen, Xu Huinan, Fang Ruiyu, Zhao Yu, Zhang Jie, Huang Xiaomeng, Lu Zhilong, Peng Jiaxin, Zheng Wenjun, Wang Shiquan, Yang Bingkai, He Xuewei, Jiang Zhuoru, Xie Qiyi, Zhang Yanhan, Li Zhongqiu, Shi Lingling, Fu Weiwei, Zhang Yin, Huang Zilu, Xiong Sishi, Zhang Yuxiang, Wang Chao, Song Shuangyong
- HMT: Hierarchical Memory Transformer For Long Context Language Processing He Zifan, Qin Zongyue, Prakriya Neha, Sun Yizhou, Cong Jason
- A Piece Of Theatre: Investigating How Teachers Design LLM Chatbots To Assist Adolescent Cyberbullying Education Hedderich Michael A., Bazarova Natalie N., Zou Wenting, Shim Ryun, Ma Xinda, Yang Qian
- Evaluation Of The Programming Skills Of Large Language Models Heitz Luc Bryan, Chamas Joun, Scherb Christopher
- Intent-conditioned And Non-toxic Counterspeech Generation Using Multi-task Instruction Tuning With RLAIF Hengle Amey, Kumar Aswini, Singh Sahajpreet, Bandhakavi Anil, Akhtar Md Shad, Chakroborty Tanmoy
- Can Large Language Models Make The Grade? An Empirical Study Evaluating Llms Ability To Mark Short Answer Questions In K-12 Education Henkel Owen, Boxer Adam, Hills Libby, Roberts Bill
- N-gram Prediction And Word Difference Representations For Language Modeling Heo Dongnyeong, Rim Daniela Noemi, Choi Heeyoul
- On Overcoming Miscalibrated Conversational Priors In Llm-based Chatbots Herlihy Christine, Neville Jennifer, Schnabel Tobias, Swaminathan Adith
- Generative Visual Instruction Tuning Hernandez Jefferson, Villegas Ruben, Ordonez Vicente
- [lions: 1] And [tigers: 2] And [bears: 3], Oh My! Literary Coreference Annotation With Llms Hicke Rebecca M. M., Mimno David
- Eliciting Personality Traits In Large Language Models Hilliard Airlie, Munoz Cristian, Wu Zekun, Koshiyama Adriano Soares
- Llava-gemma: Accelerating Multimodal Foundation Models With A Compact Language Model Hinck Musashi, Olson Matthew L., Cobbley David, Tseng Shao-yen, Lal Vasudev
- Deepspeed-fastgen: High-throughput Text Generation For Llms Via MII And Deepspeed-inference Holmes Connor, Tanaka Masahiro, Wyatt Michael, Awan Ammar Ahmad, Rasley Jeff, Rajbhandari Samyam, Aminabadi Reza Yazdani, Qin Heyang, Bakhtiari Arash, Kurilenko Lev, He Yuxiong
- Llm-aided Compilation For Tensor Accelerators Hong Charles, Bhatia Sahil, Haan Altan, Dong Shengjun Kris, Nikiforov Dima, Cheung Alvin, Shao Yakun Sophia
- Game Development As Human-llm Interaction Hong Jiale, Wu Hongqiu, Zhao Hai
- Accelerating Multilingual Language Model For Excessively Tokenized Languages Hong Jimin, Lee Gibbeum, Cho Jaewoong
- Outcome-constrained Large Language Models For Countering Hate Speech Hong Lingzi, Luo Pengcheng, Blanco Eduardo, Song Xiaoying
- Evaluating Llms' Mathematical And Coding Competency Through Ontology-guided Interventions Hong Pengfei, Majumder Navonil, Ghosal Deepanway, Aditya Somak, Mihalcea Rada, Poria Soujanya
- Curiosity-driven Red-teaming For Large Language Models Hong Zhang-wei, Shenfeld Idan, Wang Tsun-hsuan, Chuang Yung-sung, Pareja Aldo, Glass James, Srivastava Akash, Agrawal Pulkit
- Securing Vision-language Models With A Robust Encoder Against Jailbreak And Adversarial Attacks Hossain Md Zarif, Imteaj Ahmed
- V-star: Training Verifiers For Self-taught Reasoners Hosseini Arian, Yuan Xingdi, Malkin Nikolay, Courville Aaron, Sordoni Alessandro, Agarwal Rishabh
- Large Language Models As Misleading Assistants In Conversation Hou Betty Li, Shi Kejian, Phang Jason, Aung James, Adler Steven, Campbell Rosie
- Wikicontradict: A Benchmark For Evaluating Llms On Real-world Knowledge Conflicts From Wikipedia Hou Yufang, Pascale Alessandra, Carnerero-cano Javier, Tchrakian Tigran, Marinescu Radu, Daly Elizabeth, Padhi Inkit, Sattigeri Prasanna
- Chatglm-rlhf: Practices Of Aligning Large Language Models With Human Feedback Hou Zhenyu, Niu Yilin, Du Zhengxiao, Zhang Xiaohan, Liu Xiao, Zeng Aohan, Zheng Qinkai, Huang Minlie, Wang Hongning, Tang Jie, Dong Yuxiao
- Uncovering Bias In Large Vision-language Models At Scale With Counterfactuals Howard Phillip, Fraser Kathleen C., Bhiwandiwalla Anahita, Kiritchenko Svetlana
- Large Language Model-powered Chatbots For Internationalizing Student Support In Higher Education Hsain Achraf, Housni Hamza El
- Game Generation Via Large Language Models Hu Chengpeng, Zhao Yunlong, Liu Jialin
- Multimodal Chain-of-thought Reasoning Via Chatgpt To Protect Children From Age-inappropriate Apps Hu Chuanbo, Liu Bin, Yin Minglei, Zhou Yilu, Li Xin
- Openrlhf: An Easy-to-use, Scalable And High-performance RLHF Framework Hu Jian, Wu Xibin, Wang Weixun, Xianyu, Zhang Dehao, Cao Yu
- Characterization Of Large Language Model Development In The Datacenter Hu Qinghao, Ye Zhisheng, Wang Zerui, Wang Guoteng, Zhang Meng, Chen Qiaoling, Sun Peng, Lin Dahua, Wang Xiaolin, Luo Yingwei, Wen Yonggang, Zhang Tianwei
- Minicpm: Unveiling The Potential Of Small Language Models With Scalable Training Strategies Hu Shengding, Tu Yuge, Han Xu, He Chaoqun, Cui Ganqu, Long Xiang, Zheng Zhi, Fang Yewei, Huang Yuxiang, Zhao Weilin, Zhang Xinrong, Thai Zheng Leng, Zhang Kaihuo, Wang Chongyi, Yao Yuan, Zhao Chenyang, Zhou Jie, Cai Jie, Zhai Zhongwu, Ding Ning, Jia Chao, Zeng Guoyang, Li Dahai, Liu Zhiyuan, Sun Maosong
- 3D Building Generation In Minecraft Via Large Language Models Hu Shiying, Huang Zengrong, Hu Chengpeng, Liu Jialin
- A Survey On Large Language Model-based Game Agents Hu Sihao, Huang Tiansheng, Ilhan Fatih, Tekin Selim, Liu Gaowen, Kompella Ramana, Liu Ling
- FOKE: A Personalized And Explainable Education Framework Integrating Foundation Models, Knowledge Graphs, And Prompt Engineering Hu Silan, Wang Xiaoning
- Matryoshka Query Transformer For Large Vision-language Models Hu Wenbo, Dou Zi-yi, Li Liunian Harold, Kamath Amita, Peng Nanyun, Chang Kai-wei
- Refchecker: Reference-based Fine-grained Hallucination Checker And Benchmark For Large Language Models Hu Xiangkun, Ru Dongyu, Qiu Lin, Guo Qipeng, Zhang Tianhang, Xu Yang, Luo Yun, Liu Pengfei, Zhang Yue, Zhang Zheng
- ELLA: Equip Diffusion Models With LLM For Enhanced Semantic Alignment Hu Xiwei, Wang Rui, Fang Yixiao, Fu Bin, Cheng Pei, Yu Gang
- Gentranslate: Large Language Models Are Generative Multilingual Speech And Machine Translators Hu Yuchen, Chen Chen, Yang Chao-han Huck, Li Ruizhe, Zhang Dong, Chen Zhehuai, Chng Eng Siong
- RAG And RAU: A Survey On Retrieval-augmented Language Model In Natural Language Processing Hu Yucheng, Lu Yuxing
- Can Perplexity Reflect Large Language Model's Ability In Long Text Understanding? Hu Yutong, Huang Quzhe, Tao Mingxu, Zhang Chen, Feng Yansong
- Rethinking Llm-based Preference Evaluation Hu Zhengyu, Song Linxin, Zhang Jieyu, Xiao Zheyuan, Wang Jingang, Chen Zhenyu, Xiong Hui
- An Intermediate Fusion Vit Enables Efficient Text-image Alignment In Diffusion Models Hu Zizhao, Jia Shaochong, Rostami Mohammad
- V2xum-llm: Cross-modal Video Summarization With Temporal Prompt Instruction Tuning Hua Hang, Tang Yunlong, Xu Chenliang, Luo Jiebo
- Disentangling Logic: The Role Of Context In Large Language Model Reasoning Capabilities Hua Wenyue, Zhu Kaijie, Li Lingyao, Fan Lizhou, Lin Shuhang, Jin Mingyu, Xue Haochen, Li Zelong, Wang Jindong, Zhang Yongfeng
- Talk Less, Interact Better: Evaluating In-context Conversational Adaptation In Multimodal Llms Hua Yilun, Artzi Yoav
- Applying And Evaluating Large Language Models In Mental Health Care: A Scoping Review Of Human-assessed Generative Tasks Hua Yining, Na Hongbin, Li Zehan, Liu Fenglin, Fang Xiao, Clifton David, Torous John
- Multimodal Task Vectors Enable Many-shot Multimodal In-context Learning Huang Brandon, Mitra Chancharik, Arbelle Assaf, Karlinsky Leonid, Darrell Trevor, Herzig Roei
- Pokergpt: An End-to-end Lightweight Solver For Multi-player Texas Hold'em Via Large Language Model Huang Chenghao, Cao Yanbo, Wen Yinlong, Zhou Tao, Zhang Yanru
- What Affects The Stability Of Tool Learning? An Empirical Study On The Robustness Of Tool Learning Frameworks Huang Chengrui, Shi Zhengliang, Wen Yuntao, Chen Xiuying, Han Peng, Gao Shen, Shang Shuo
- Training Language Models To Generate Text With Citations Via Fine-grained Rewards Huang Chengyu, Wu Zeqiu, Hu Yushi, Wang Wenya
- Recurrent Context Compression: Efficiently Expanding The Context Window Of LLM Huang Chensen, Zhu Guibo, Wang Xuepeng, Luo Yifei, Ge Guojing, Chen Haoran, Yi Dong, Wang Jinqiao
- LITA: Language Instructed Temporal-localization Assistant Huang De-an, Liao Shijia, Radhakrishnan Subhashree, Yin Hongxu, Molchanov Pavlo, Yu Zhiding, Kautz Jan
- Verbalized Probabilistic Graphical Modeling With Large Language Models Huang Hengguan, Shen Xing, Wang Songtao, Liu Dianbo, Wang Hao
- Mitigating Catastrophic Forgetting In Large Language Models With Self-synthesized Rehearsal Huang Jianheng, Cui Leyang, Wang Ante, Yang Chengyi, Liao Xinting, Song Linfeng, Yao Junfeng, Su Jinsong
- Learning Fine-grained Grounded Citations For Attributed Large Language Models Huang Lei, Feng Xiaocheng, Ma Weitao, Gu Yuxuan, Zhong Weihong, Feng Xiachong, Yu Weijiang, Peng Weihua, Tang Duyu, Tu Dandan, Qin Bing
- Translate-and-revise: Boosting Large Language Models For Constrained Translation Huang Pengcheng, Mu Yongyu, Wu Yuzhang, Li Bei, Xiao Chunyang, Xiao Tong, Zhu Jingbo
- Selective Prompting Tuning For Personalized Conversations With Llms Huang Qiushi, Liu Xubo, Ko Tom, Wu Bo, Wang Wenwu, Zhang Yu, Tang Lilian
- The N+ Implementation Details Of RLHF With PPO: A Case Study On TL;DR Summarization Huang Shengyi, Noukhovitch Michael, Hosseini Arian, Rasul Kashif, Wang Weixun, Tunstall Lewis
- Antidote: Post-fine-tuning Safety Alignment For Large Language Models Against Harmful Fine-tuning Huang Tiansheng, Bhattacharya Gautam, Joshi Pratik, Kimball Josh, Liu Ling
- Vaccine: Perturbation-aware Alignment For Large Language Models Against Harmful Fine-tuning Huang Tiansheng, Hu Sihao, Liu Ling
- Brainchat: Decoding Semantic Information From Fmri Using Vision-language Pretrained Models Huang Wanaiu
- An Empirical Study Of Llama3 Quantization: From Llms To Mllms Huang Wei, Zheng Xingyu, Ma Xudong, Qin Haotong, Lv Chengtao, Chen Hong, Luo Jie, Qi Xiaojuan, Liu Xianglong, Magno Michele
- Prompting Large Language Models With Knowledge Graphs For Question Answering Involving Long-tail Facts Huang Wenyu, Zhou Guancheng, Lapata Mirella, Vougiouklis Pavlos, Montella Sebastien, Pan Jeff Z.
- WESE: Weak Exploration To Strong Exploitation For LLM Agents Huang Xu, Liu Weiwen, Chen Xiaolong, Wang Xingmei, Lian Defu, Wang Yasheng, Tang Ruiming, Chen Enhong
- Leveraging Large Language Models For Enhanced NLP Task Performance Through Knowledge Distillation And Optimized Training Strategies Huang Yining, Tang Keke, Chen Meilian
- Trustllm: Trustworthiness In Large Language Models Huang Yue, Sun Lichao, Wang Haoran, Wu Siyuan, Zhang Qihui, Li Yuan, Gao Chujie, Huang Yixin, Lyu Wenhan, Zhang Yixuan, Li Xiner, Liu Zhengliang, Liu Yixin, Wang Yijue, Zhang Zhikun, Vidgen Bertie, Kailkhura Bhavya, Xiong Caiming, Xiao Chaowei, Li Chunyuan, Xing Eric, Huang Furong, Liu Hao, Ji Heng, Wang Hongyi, Zhang Huan, Yao Huaxiu, Kellis Manolis, Zitnik Marinka, Jiang Meng, Bansal Mohit, Zou James, Pei Jian, Liu Jian, Gao Jianfeng, Han Jiawei, Zhao Jieyu, Tang Jiliang, Wang Jindong, Vanschoren Joaquin, Mitchell John, Shu Kai, Xu Kaidi, Chang Kai-wei, He Lifang, Huang Lifu, Backes Michael, Gong Neil Zhenqiang, Yu Philip S., Chen Pin-yu, Gu Quanquan, Xu Ran, Ying Rex, Ji Shuiwang, Jana Suman, Chen Tianlong, Liu Tianming, Zhou Tianyi, Wang William, Li Xiang, Zhang Xiangliang, Wang Xiao, Xie Xing, Chen Xun, Wang Xuyu, Liu Yan, Ye Yanfang, Cao Yinzhi, Chen Yong, Zhao Yue
- Obscureprompt: Jailbreaking Large Language Models Via Obscure Input Huang Yue, Tang Jingyu, Chen Dongping, Tang Bingda, Wan Yao, Sun Lichao, Zhang Xiangliang
- Fastfid: Improve Inference Efficiency Of Open Domain Question Answering Via Sentence Selection Huang Yufei, Han Xu, Sun Maosong
- Active Testing Of Large Language Model Via Multi-stage Sampling Huang Yuheng, Song Jiayang, Hu Qiang, Juefei-xu Felix, Ma Lei
- Compression Represents Intelligence Linearly Huang Yuzhen, Zhang Jinghan, Shan Zifei, He Junxian
- Olympicarena: Benchmarking Multi-discipline Cognitive Reasoning For Superintelligent AI Huang Zhen, Wang Zengzhi, Xia Shijie, Li Xuefeng, Zou Haoyang, Xu Ruijie, Fan Run-ze, Ye Lyumanshan, Chern Ethan, Ye Yixin, Zhang Yikai, Yang Yuqing, Wu Ting, Wang Binjie, Sun Shichao, Xiao Yang, Li Yiyuan, Zhou Fan, Chern Steffi, Qin Yiwei, Ma Yan, Su Jiadi, Liu Yixiu, Zheng Yuxiang, Zhang Shaoting, Lin Dahua, Qiao Yu, Liu Pengfei
- Tool Calling: Enhancing Medication Consultation Via Retrieval-augmented Large Language Models Huang Zhongzhen, Xue Kui, Fan Yongqi, Mu Linjie, Liu Ruoyu, Ruan Tong, Zhang Shaoting, Zhang Xiaofan
- Enhancing The Capability And Robustness Of Large Language Models Through Reinforcement Learning-driven Query Refinement Huang Zisu, Wang Xiaohua, Zhang Feiran, Xu Zhibo, Zhang Cenyuan, Zheng Xiaoqing, Huang Xuanjing
- Instruction Fine-tuning: Does Prompt Loss Matter? Huerta-enochian Mathew, Ko Seung Yong
- Towards A Psychology Of Machines: Large Language Models Predict Human Memory Huff Markus, Ulakçı Elanur
- HFT: Half Fine-tuning For Large Language Models Hui Tingfeng, Zhang Zhenyu, Wang Shuohuan, Xu Weiran, Sun Yu, Wu Hua
- Federated Recommendation Via Hybrid Retrieval Augmented Generation Huimin Zeng, Zhenrui Yue, Qian Jiang, Dong Wang
- Self-introspective Decoding: Alleviating Hallucinations For Large Vision-language Models Huo Fushuo, Xu Wenchao, Zhang Zhong, Wang Haozhao, Chen Zhicheng, Zhao Peilin
- On Effects Of Steering Latent Representation For Large Language Model Unlearning Huu-tien Dang, Pham Trung-tin, Thanh-tung Hoang, Inoue Naoya
- Enhancing Incremental Summarization With Structured Representations Hwang Eunjeong, Zhou Yichao, Wendt James Bradley, Gunel Beliz, Vo Nguyen, Xie Jing, Tata Sandeep
- Self-explore To Avoid The Pit: Improving The Reasoning Capabilities Of Language Models With Fine-grained Rewards Hwang Hyeonbin, Kim Doyoung, Kim Seungone, Ye Seonghyeon, Seo Minjoon
- Beneath The Surface Of Consistency: Exploring Cross-lingual Knowledge Representation Sharing In Llms Ifergan Maxim, Choshen Leshem, Aharoni Roee, Szpektor Idan, Abend Omri
- Assessing Llms Suitability For Knowledge Graph Completion Iga Vasile Ionut Remus, Silaghi Gheorghe Cosmin
- On The Generalization Of Preference Learning With DPO Im Shawn, Li Yixuan
- Understanding The Learning Dynamics Of Alignment With Human Feedback Im Shawn, Li Yixuan
- Hallucinations Or Attention Misdirection? The Path To Strategic Value Extraction In Business Using Large Language Models Ioste Aline
- Facilitating Holistic Evaluations With Llms: Insights From Scenario-based Experiments Ishida Toru, Liu Tongxi, Wang Hailong, Cheunga William K.
- Elements Of World Knowledge (EWOK): A Cognition-inspired Framework For Evaluating Basic World Knowledge In Language Models Ivanova Anna A., Sathe Aalok, Lipkin Benjamin, Kumar Unnathi, Radkani Setayesh, Clark Thomas H., Kauf Carina, Hu Jennifer, Pramod R. T., Grand Gabriel, Paulun Vivian, Ryskina Maria, Akyürek Ekin, Wilcox Ethan, Rashid Nafisa, Choshen Leshem, Levy Roger, Fedorenko Evelina, Tenenbaum Joshua, Andreas Jacob
- Quality Or Quantity? On Data Scale And Diversity In Adapting Large Language Models For Low-resource Translation Iyer Vivek, Malik Bhavitvya, Stepachev Pavel, Chen Pinzhen, Haddow Barry, Birch Alexandra
- Large Language Models For Behavioral Economics: Internal Validity And Elicitation Of Mental Models Jabarian Brian
- From RAG To RICHES: Retrieval Interlaced With Sequence Generation Jain Palak, Soares Livio Baldini, Kwiatkowski Tom
- Structsum Generation For Faster Text Comprehension Jain Parag, Marzoca Andreea, Piccinno Francesco
- Ffn-skipllm: A Hidden Gem For Autoregressive Decoding With Adaptive Feed Forward Skipping Jaiswal Ajay, Hu Bodun, Yin Lu, Ro Yeonju, Liu Shiwei, Chen Tianlong, Akella Aditya
- Jamba-1.5: Hybrid Transformer-mamba Models At Scale Jamba Team, Lenz Barak, Arazi Alan, Bergman Amir, Manevich Avshalom, Peleg Barak, Aviram Ben, Almagor Chen, Fridman Clara, Padnos Dan, Gissin Daniel, Jannai Daniel, Muhlgay Dor, Zimberg Dor, Gerber Edden M, Dolev Elad, Krakovsky Eran, Safahi Erez, Schwartz Erez, Cohen Gal, Shachaf Gal, Rozenblum Haim, Bata Hofit, Blass Ido, Magar Inbal, Dalmedigos Itay, Osin Jhonathan, Fadlon Julie, Rozman Maria, Danos Matan, Gokhman Michael, Zusman Mor, Gidron Naama, Ratner Nir, Gat Noam, Rozen Noam, Fried Oded, Leshno Ohad, Antverg Omer, Abend Omri, Lieber Opher, Dagan Or, Cohavi Orit, Alon Raz, Belson Ro'i, Cohen Roi, Gilad Rom, Glozman Roman, Lev Shahar, Meirom Shaked, Delbari Tal, Ness Tal, Asida Tomer, Gal Tom Ben, Braude Tom, Pumerantz Uriya, Cohen Yehoshua, Belinkov Yonatan, Globerson Yuval, Levy Yuval Peleg, Shoham Yoav
- Smart-infinity: Fast Large Language Model Training Using Near-storage Processing On A Real System Jang Hongsun, Song Jaeyong, Jung Jaewon, Park Jaeyoung, Kim Youngsok, Lee Jinho
- Evaluating Students' Open-ended Written Responses With Llms: Using The RAG Framework For GPT-3.5, GPT-4, Claude-3, And Mistral-large Jauhiainen Jussi S., Guerra Agustín Garagorry
- Performance Of Recent Large Language Models For A Low-resourced Language Jayakody Ravindu, Dias Gihan
- RLSF: Reinforcement Learning Via Symbolic Feedback Jha Piyush, Jana Prithwish, Arora Arnav, Ganesh Vijay
- Memeguard: An LLM And Vlm-based Framework For Advancing Content Moderation Via Meme Intervention Jha Prince, Jain Raghav, Mandal Konika, Chadha Aman, Saha Sriparna, Bhattacharyya Pushpak
- Linking In-context Learning In Transformers To Human Episodic Memory Ji-an Li, Zhou Corey Y., Benna Marcus K., Mattar Marcelo G.
- Generalizing Reward Modeling For Out-of-distribution Preference Learning Jia Chen
- Bwarea Model: Learning World Model, Inverse Dynamics, And Policy For Controllable Language Generation Jia Chengxing, Wang Pengyuan, Li Ziniu, Li Yi-chen, Zhang Zhilong, Tang Nan, Yu Yang
- Knowledge Adaptation From Large Language Model To Recommendation For Practical Industrial Application Jia Jian, Wang Yipei, Li Yan, Chen Honggang, Bai Xuehan, Liu Zhaocheng, Liang Jian, Chen Quan, Li Han, Jiang Peng, Gai Kun
- Leveraging Llms For Dialogue Quality Measurement Jia Jinghan, Komma Abi, Leffel Timothy, Peng Xujun, Nagesh Ajay, Soliman Tamer, Galstyan Aram, Kumar Anoop
- Simulbench: Evaluating Language Models With Creative Simulation Tasks Jia Qi, Yue Xiang, Zheng Tianyu, Huang Jie, Lin Bill Yuchen
- Langsuite: Planning, Controlling And Interacting With Large Language Models In Embodied Text Environments Jia Zixia, Wang Mengmeng, Tong Baichen, Zhu Song-chun, Zheng Zilong
- Heuristic-enhanced Candidates Selection Strategy For Gpts Tackle Few-shot Aspect-based Sentiment Analysis Jiang Baoxing, Wan Yujie, Ju Shenggen
- Hal-eval: A Universal And Fine-grained Hallucination Evaluation Framework For Large Vision Language Models Jiang Chaoya, Ye Wei, Dong Mengfan, Jia Hongrui, Xu Haiyang, Yan Ming, Zhang Ji, Zhang Shikun
- Interpretable Catastrophic Forgetting Of Large Language Model Fine-tuning Via Instruction Vector Jiang Gangwei, Jiang Caigao, Li Zhaoyi, Xue Siqiao, Zhou Jun, Song Linqi, Lian Defu, Wei Ying
- Llm-collaboration On Automatic Science Journalism For The General Audience Jiang Gongyao, Shi Xinran, Luo Qiong
- The Real, The Better: Aligning Large Language Models With Online Human Behaviors Jiang Guanying, Yan Lingyong, Shi Haibo, Yin Dawei
- Raising The Bar: Investigating The Values Of Large Language Models Via Generative Evolving Testing Jiang Han, Yi Xiaoyuan, Wei Zhihua, Wang Shu, Xie Xing
- Time Matters: Enhancing Pre-trained News Recommendation Models With Robust User Dwell Time Injection Jiang Hao, Li Chuanzhen, An Mingxiao
- Detecting Scams Using Large Language Models Jiang Liming
- Seek And Solve Reasoning For Table Question Answering Jiang Ruya, Wang Chun, Deng Weihong
- SEED: Customize Large Language Models With Sample-efficient Adaptation For Code Generation Jiang Xue, Dong Yihong, Jin Zhi, Li Ge
- Retrieve, Summarize, Plan: Advancing Multi-hop Question Answering With An Iterative Approach Jiang Zhouyu, Sun Mengshu, Liang Lei, Zhang Zhiqiang
- Llms Can Find Mathematical Reasoning Mistakes By Pedagogical Chain-of-thought Jiang Zhuoxuan, Peng Haoyuan, Feng Shanshan, Li Fan, Li Dongsheng
- Megascale: Scaling Large Language Model Training To More Than 10,000 Gpus Jiang Ziheng, Lin Haibin, Zhong Yinmin, Huang Qi, Chen Yangrui, Zhang Zhi, Peng Yanghua, Li Xiang, Xie Cong, Nong Shibiao, Jia Yulu, He Sun, Chen Hongmin, Bai Zhihao, Hou Qi, Yan Shipeng, Zhou Ding, Sheng Yiyao, Jiang Zhuo, Xu Haohan, Wei Haoran, Zhang Zhang, Nie Pengfei, Zou Leqi, Zhao Sida, Xiang Liang, Liu Zherui, Li Zhe, Jia Xiaoying, Ye Jianxi, Jin Xin, Liu Xin
- Learning Planning-based Reasoning By Trajectories Collection And Process Reward Synthesizing Jiao Fangkai, Qin Chengwei, Liu Zhengyuan, Chen Nancy F., Joty Shafiq
- Prompt-based Length Controlled Generation With Multiple Control Types Jie Renlong, Meng Xiaojun, Shang Lifeng, Jiang Xin, Liu Qun
- APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking Jin Can, Peng Hongwu, Zhao Shiyu, Wang Zhenting, Xu Wujiang, Han Ligong, Zhao Jiahui, Zhong Kai, Rajasekaran Sanguthevar, Metaxas Dimitris N.
- Ragcache: Efficient Knowledge Caching For Retrieval-augmented Generation Jin Chao, Zhang Zili, Jiang Xuanlin, Liu Fangyue, Liu Xin, Liu Xuanzhe, Jin Xin
- LLM Maybe Longlm: Self-extend LLM Context Window Without Tuning Jin Hongye, Han Xiaotian, Yang Jingfeng, Jiang Zhimeng, Liu Zirui, Chang Chia-yuan, Chen Huiyuan, Hu Xia
- BIDER: Bridging Knowledge Inconsistency For Efficient Retrieval-augmented Llms Via Key Supporting Evidence Jin Jiajie, Zhu Yutao, Zhou Yujia, Dou Zhicheng
- Can Chatgpt Support Developers? An Empirical Evaluation Of Large Language Models For Code Generation Jin Kailun, Wang Chung-yu, Pham Hung Viet, Hemmati Hadi
- Exploring Concept Depth: How Large Language Models Acquire Knowledge At Different Layers? Jin Mingyu, Yu Qinkai, Huang Jingyuan, Zeng Qingcheng, Wang Zhenting, Hua Wenyue, Zhao Haiyan, Mei Kai, Meng Yanda, Ding Kaize, Yang Fan, Du Mengnan, Zhang Yongfeng
- Reasoning Grasping Via Multimodal Large Language Model Jin Shiyu, Xu Jinxuan, Lei Yutian, Zhang Liangjun
- Adaptive Skeleton Graph Decoding Jin Shuowei, Wu Yongji, Zheng Haizhong, Zhang Qingzhao, Lentz Matthew, Mao Z. Morley, Prakash Atul, Qian Feng, Zhuo Danyang
- Self-selected Attention Span For Accelerating Large Language Model Inference Jin Tian, Yazar Wanzin, Xu Zifei, Sharify Sayeh, Wang Xin
- MULTIVERSE: Exposing Large Language Model Alignment Problems In Diverse Worlds Jin Xiaolong, Zhang Zhuo, Zhang Xiangyu
- Video-lavit: Unified Video-language Pre-training With Decoupled Visual-motional Tokenization Jin Yang, Sun Zhicheng, Xu Kun, Xu Kun, Chen Liwei, Jiang Hao, Huang Quzhe, Song Chengru, Liu Yuliang, Zhang Di, Song Yang, Gai Kun, Mu Yadong
- Read To Play (r2-play): Decision Transformer With Multimodal Game Instruction Jin Yonggang, Zhang Ge, Zhao Hao, Zheng Tianyu, Guo Jarvi, Xiang Liuyu, Yue Shawn, Huang Stephen W., He Zhaofeng, Fu Jie
- Cutting Off The Head Ends The Conflict: A Mechanism For Interpreting And Mitigating Knowledge Conflicts In Language Models Jin Zhuoran, Cao Pengfei, Yuan Hongbang, Chen Yubo, Xu Jiexin, Li Huaijun, Jiang Xiaojian, Liu Kang, Zhao Jun
- Understanding The Impact Of Long-term Memory On Self-disclosure With Large Language Model-driven Chatbots For Public Health Intervention Jo Eunkyung, Jeong Yuin, Park Sohyun, Epstein Daniel A., Kim Young-ho
- SMART: Automatically Scaling Down Language Models With Accuracy Guarantees For Reduced Processing Fees Jo Saehan, Trummer Immanuel
- LLMRS: Unlocking Potentials Of Llm-based Recommender Systems For Software Purchase John Angela, Aidoo Theophilus, Behmanush Hamayoon, Gunduz Irem B., Shrestha Hewan, Rahman Maxx Richard, Maaß Wolfgang
- Doing Personal LAPS: Llm-augmented Dialogue Construction For Personalized Multi-session Conversational Search Joko Hideaki, Chatterjee Shubham, Ramsay Andrew, De Vries Arjen P., Dalton Jeff, Hasibi Faegheh
- How Large Language Models Encode Context Knowledge? A Layer-wise Probing Study Ju Tianjie, Sun Weiwei, Du Wei, Yuan Xinwei, Ren Zhaochun, Liu Gongshen
- Effective Context Selection In Llm-based Leaderboard Generation: An Empirical Study Kabongo Salomon, D'souza Jennifer, Auer Sören
- A Survey On Hardware Accelerators For Large Language Models Kachris Christoforos
- RAP: Retrieval-augmented Planning With Contextual Memory For Multimodal LLM Agents Kagaya Tomoyuki, Yuan Thong Jing, Lou Yuxuan, Karlekar Jayashree, Pranata Sugiri, Kinose Akira, Oguri Koki, Wick Felix, You Yang
- Language Repository For Long Video Understanding Kahatapitiya Kumara, Ranasinghe Kanchana, Park Jongwoo, Ryoo Michael S.
- Scaling Laws For Forgetting When Fine-tuning Large Language Models Kalajdzievski Damjan
- LLM-ARC: Enhancing Llms With An Automated Reasoning Critic Kalyanpur Aditya, Saravanakumar Kailash Karthik, Barres Victor, Chu-carroll Jennifer, Melville David, Ferrucci David
- Fiddler: CPU-GPU Orchestration For Fast Inference Of Mixture-of-experts Models Kamahori Keisuke, Gu Yile, Zhu Kan, Kasikci Baris
- Eagle: Ethical Dataset Given From Real Interactions Kaneko Masahiro, Bollegala Danushka, Baldwin Timothy
- Prompt-rag: Pioneering Vector Embedding-free Retrieval-augmented Generation In Niche Domains, Exemplified By Korean Medicine Kang Bongsu, Kim Jundong, Yun Tae-rim, Kim Chang-eop
- Can Large Language Models Be Good Emotional Supporter? Mitigating Preference Bias On Emotional Support Conversation Kang Dongjin, Kim Sunghwan, Kwon Taeyoon, Moon Seungjun, Cho Hyunsouk, Yu Youngjae, Lee Dongha, Yeo Jinyoung
- Unfamiliar Finetuning Examples Control How Language Models Hallucinate Kang Katie, Wallace Eric, Tomlin Claire, Kumar Aviral, Levine Sergey
- Continuous Language Model Interpolation For Dynamic And Controllable Text Generation Kangaslahti Sara, Alvarez-melis David
- MEDIC: Towards A Comprehensive Framework For Evaluating Llms In Clinical Applications Kanithi Praveen K, Christophe Clément, Pimentel Marco Af, Raha Tathagata, Saadi Nada, Javed Hamza, Maslenkova Svetlana, Hayat Nasir, Rajan Ronnie, Khan Shadab
- Visualizing Dialogues: Enhancing Image Selection Through Dialogue Understanding With Large Language Models Kao Chang-sheng, Chen Yun-nung
- BRAVE: Broadening The Visual Encoding Of Vision-language Models Kar Oğuzhan Fatih, Tonioni Alessio, Poklukar Petra, Kulshrestha Achin, Zamir Amir, Tombari Federico
- Teaching Machines To Code: Smart Contract Translation With Llms Karanjai Rabimba, Xu Lei, Shi Weidong
- Instruct, Not Assist: Llm-based Multi-turn Planning And Hierarchical Questioning For Socratic Code Debugging Kargupta Priyanka, Agarwal Ishika, Hakkani-tur Dilek, Han Jiawei
- Can Reinforcement Learning Unlock The Hidden Dangers In Aligned Large Language Models? Karkevandi Mohammad Bahrami, Vishwamitra Nishant, Najafirad Peyman
- Alpaca Against Vicuna: Using Llms To Uncover Memorization Of Llms Kassem Aly M., Mahmoud Omar, Mireshghallah Niloofar, Kim Hyunwoo, Tsvetkov Yulia, Choi Yejin, Saad Sherif, Rana Santu
- Finding A Needle In The Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases With Minimal Distribution Distortion Kassem Aly M., Saad Sherif
- Faaf: Facts As A Function For The Evaluation Of Generated Text Katranidis Vasileios, Barany Gabor
- Instruct-skillmix: A Powerful Pipeline For LLM Instruction Tuning Kaur Simran, Park Simon, Goyal Anirudh, Arora Sanjeev
- Codeaid: Evaluating A Classroom Deployment Of An Llm-based Programming Assistant That Balances Student And Educator Needs Kazemitabaar Majeed, Ye Runlong, Wang Xiaoning, Henley Austin Z., Denny Paul, Craig Michelle, Grossman Tovi
- Exploring The Frontiers Of Llms In Psychological Applications: A Comprehensive Review Ke Luoma, Tong Song, Cheng Peng, Peng Kaiping
- Bridging The Preference Gap Between Retrievers And Llms Ke Zixuan, Kong Weize, Li Cheng, Zhang Mingyang, Mei Qiaozhu, Bendersky Michael
- Visiongpt: Vision-language Understanding Agent Using Generalized Multimodal Framework Kelly Chris, Hu Luhui, Yang Bang, Tian Yu, Yang Deshun, Yang Cindy, Huang Zaoshan, Li Zihao, Hu Jiayin, Zou Yuexian
- Autonomous Prompt Engineering In Large Language Models Kepel Daan, Valogianni Konstantina
- Source-aware Training Enables Knowledge Attribution In Language Models Khalifa Muhammad, Wadden David, Strubell Emma, Lee Honglak, Wang Lu, Beltagy Iz, Peng Hao
- Self-training Large Language Models For Improved Visual Program Synthesis With Visual Reinforcement Khan Zaid, Bg Vijay Kumar, Schulter Samuel, Fu Yun, Chandraker Manmohan
- ARGS: Alignment As Reward-guided Search Khanov Maxim, Burapacheep Jirayu, Li Yixuan
- How Good Is My Video LMM? Complex Video Reasoning And Robustness Evaluation Suite For Video-lmms Khattak Muhammad Uzair, Naeem Muhammad Ferjad, Hassan Jameel, Naseer Muzammal, Tombari Federico, Khan Fahad Shahbaz, Khan Salman
- Assessing Language Models' Worldview For Fiction Generation Khatun Aisha, Brown Daniel G.
- A Study On Large Language Models' Limitations In Multiple-choice Question Answering Khatun Aisha, Brown Daniel G.
- Beyond Code Generation: An Observational Study Of Chatgpt Usage In Software Engineering Practice Khojah Ranim, Mohamad Mazen, Leitner Philipp, Neto Francisco Gomes De Oliveira
- A Comprehensive Survey Of Accelerated Generation Techniques In Large Language Models Khoshnoodi Mahsa, Jain Vinija, Gao Mingye, Srikanth Malavika, Chadha Aman
- Shortened Llama: Depth Pruning For Large Language Models With Comparison Of Retraining Methods Kim Bo-kyeong, Kim Geonmin, Kim Tae-ho, Castells Thibault, Choi Shinkook, Shin Junho, Song Hyoung-kyu
- Aligning Large Language Models With Self-generated Preference Data Kim Dongyoung, Lee Kimin, Shin Jinwoo, Kim Jaehyung
- A Mechanistic Interpretation Of Syllogistic Reasoning In Auto-regressive Language Models Kim Geonhee, Valentino Marco, Freitas André
- Integrating Paralinguistics In Speech-empowered Large Language Models For Natural Conversation Kim Heeseung, Seo Soonshin, Jeong Kyeongseok, Kwon Ohsung, Kim Soyoon, Kim Jungwhan, Lee Jaehong, Song Eunwoo, Oh Myungwoo, Ha Jung-woo, Yoon Sungroh, Yoo Kang Min
- Fine-tuning CLIP Text Encoders With Two-step Paraphrasing Kim Hyunjae, Yoon Seunghyun, Bui Trung, Zhao Handong, Tran Quan, Dernoncourt Franck, Kang Jaewoo
- Sure: Summarizing Retrievals Using Answer Candidates For Open-domain QA Of Llms Kim Jaehyung, Nam Jaehyun, Mo Sangwoo, Park Jongjin, Lee Sang-woo, Seo Minjoon, Ha Jung-woo, Shin Jinwoo
- Finer: Investigating And Enhancing Fine-grained Visual Concept Recognition In Large Vision Language Models Kim Jeonghwan, Ji Heng
- Dialsim: A Real-time Simulator For Evaluating Long-term Dialogue Understanding Of Conversational Agents Kim Jiho, Chay Woosog, Hwang Hyeonji, Kyung Daeun, Chung Hyunseung, Cho Eunbyeol, Jo Yohan, Choi Edward
- Exploring Prompting Methods For Mitigating Class Imbalance Through Synthetic Data Generation With Large Language Models Kim Jinhee, Kim Taesung, Choo Jaegul
- Enhancing Psychotherapy Counseling: A Data Augmentation Pipeline Leveraging Large Language Models For Counseling Conversations Kim Jun-woo, Han Ji-eun, Koh Jun-seok, Seo Hyeon-tae, Chang Du-seong
- Automatic Jailbreaking Of The Text-to-image Generative AI Systems Kim Minseon, Lee Hyomin, Gong Boqing, Zhang Huishuai, Hwang Sung Ju
- Openvla: An Open-source Vision-language-action Model Kim Moo Jin, Pertsch Karl, Karamcheti Siddharth, Xiao Ted, Balakrishna Ashwin, Nair Suraj, Rafailov Rafael, Foster Ethan, Lam Grace, Sanketi Pannag, Vuong Quan, Kollar Thomas, Burchfiel Benjamin, Tedrake Russ, Sadigh Dorsa, Levine Sergey, Liang Percy, Finn Chelsea
- Large Language Models Meet Collaborative Filtering: An Efficient All-round Llm-based Recommender System Kim Sein, Kang Hongseok, Choi Seungyoon, Kim Donghyun, Yang Minchul, Park Chanyoung
- THEANINE: Revisiting Memory Management In Long-term Conversations With Timeline-augmented Response Generation Kim Seo Hyun, Ong Kai Tzu-iunn, Kwon Taeyoon, Kim Namyoung, Ka Keummin, Bae Seonghyeon, Jo Yohan, Hwang Seung-won, Lee Dongha, Yeo Jinyoung
- Is GPT-4 Alone Sufficient For Automated Essay Scoring?: A Comparative Judgment Approach Based On Rater Cognition Kim Seungju, Jo Meounggun
- Rethinking The Role Of Proxy Rewards In Language Model Alignment Kim Sungdong, Seo Minjoon
- A Survey On Integration Of Large Language Models With Intelligent Robots Kim Yeseung, Kim Dohyun, Choi Jieun, Park Jisang, Oh Nayoung, Park Daehyung
- Medexqa: Medical Question Answering Benchmark With Multiple Explanations Kim Yunsoo, Wu Jinge, Abdulle Yusuf, Wu Honghan
- Evaluating Telugu Proficiency In Large Language Models_ A Comparative Analysis Of Chatgpt And Gemini Kishore Katikela Sreeharsha, Shaik Rahimanuddin
- Growover: How Can Llms Adapt To Growing Real-world Knowledge? Ko Dayoon, Kim Jinyoung, Choi Hahyeon, Kim Gunhee
- Investigating How Large Language Models Leverage Internal Knowledge To Perform Complex Reasoning Ko Miyoung, Park Sue Hyun, Park Joonsuk, Seo Minjoon
- Recmind: Japanese Movie Recommendation Dialogue With Seeker's Internal State Kodama Takashi, Kiyomaru Hirokazu, Huang Yin Jou, Kurohashi Sadao
- Cleared For Takeoff? Compositional & Conditional Reasoning May Be The Achilles Heel To (flight-booking) Language Agents Kohli Harsh, Sun Huan
- Blending Llms Into Cascaded Speech Translation: Kit's Offline Speech Translation System For IWSLT 2024 Koneru Sai, Nguyen Thai-binh, Pham Ngoc-quan, Liu Danni, Li Zhaolin, Waibel Alexander, Niehues Jan
- Self-prompt Tuning: Enable Autonomous Role-playing In Llms Kong Aobo, Zhao Shiwan, Chen Hao, Li Qicheng, Qin Yong, Sun Ruiqi, Zhou Xin, Zhou Jiaming, Sun Haoqin
- Aligning Large Language Models With Representation Editing: A Control Perspective Kong Lingkai, Wang Haorui, Mu Wenhao, Du Yuanqi, Zhuang Yuchen, Zhou Yifei, Song Yue, Zhang Rongzhi, Wang Kai, Zhang Chao
- Prewrite: Prompt Rewriting With Reinforcement Learning Kong Weize, Hombaiah Spurthi Amba, Zhang Mingyang, Mei Qiaozhu, Bendersky Michael
- QPO: Query-dependent Prompt Optimization Via Multi-loop Offline Reinforcement Learning Kong Yilun, Mao Hangyu, Zhao Qi, Zhang Bin, Ruan Jingqing, Shen Li, Chang Yongzhe, Wang Xueqian, Zhao Rui, Tao Dacheng
- Audio Flamingo: A Novel Audio Language Model With Few-shot Learning And Dialogue Abilities Kong Zhifeng, Goel Arushi, Badlani Rohan, Ping Wei, Valle Rafael, Catanzaro Bryan
- Proptest: Automatic Property Testing For Improved Visual Programming Koo Jaywon, Yang Ziyan, Cascante-bonilla Paola, Ray Baishakhi, Ordonez Vicente
- Large Language Model Driven Recommendation Korikov Anton, Sanner Scott, Deldjoo Yashar, He Zhankui, Mcauley Julian, Ramisa Arnau, Vidal Rene, Sathiamoorthy Mahesh, Kasrizadeh Atoosa, Milano Silvia, Ricci Francesco
- Uncertainty-aware Evaluation For Vision-language Models Kostumov Vasily, Nutfullin Bulat, Pilipenko Oleg, Ilyushin Eugene
- Hints-in-browser: Benchmarking Language Models For Programming Feedback Generation Kotalwar Nachiket, Gotovos Alkis, Singla Adish
- Arabiangpt: Native Arabic Gpt-based Large Language Model Koubaa Anis, Ammar Adel, Ghouti Lahouari, Najar Omar, Sibaee Serry
- Leveraging Large Language Models For Enhanced Process Model Comprehension Kourani Humam, Berti Alessandro, Henrich Jasmin, Kratsch Wolfgang, Weidlich Robin, Li Chiao-yun, Arslan Ahmad, Schuster Daniel, Van Der Aalst Wil M. P.
- Open Source Language Models Can Provide Feedback: Evaluating Llms' Ability To Help Students Using Gpt-4-as-a-judge Koutcheme Charles, Dainese Nicola, Sarsa Sami, Hellas Arto, Leinonen Juho, Denny Paul
- Large Language Models For Tuning Evolution Strategies Kramer Oliver
- Can Large Language Models Explore In-context? Krishnamurthy Akshay, Harris Keegan, Foster Dylan J., Zhang Cyril, Slivkins Aleksandrs
- Synthdst: Synthetic Data Is All You Need For Few-shot Dialog State Tracking Kulkarni Atharva, Tseng Bo-hsiang, Moniz Joel Ruben Antony, Piraviperumal Dhivya, Yu Hong, Bhargava Shruti
- Reinforcement Learning For Optimizing RAG For Domain Chatbots Kulkarni Mandar, Tangarajan Praveen, Kim Kyung, Trivedi Anusua
- Multi-modal Generation Via Cross-modal In-context Learning Kumar Amandeep, Naseer Muzammal, Narayan Sanath, Anwer Rao Muhammad, Khan Salman, Cholakkal Hisham
- Fine-tuning, Quantization, And Llms: Navigating Unintended Outcomes Kumar Divyanshu, Kumar Anurakt, Agarwal Sahil, Harshangi Prashanth
- Large Language Model Agents For Improving Engagement With Behavior Change Interventions: Application To Digital Mindfulness Kumar Harsh, Yoo Suhyeon, Bernuy Angela Zavaleta, Shi Jiakai, Luo Huayin, Williams Joseph, Kuzminykh Anastasia, Anderson Ashton, Kornfield Rachel
- Improving Socratic Question Generation Using Data Augmentation And Preference Optimization Kumar Nischal Ashok, Lan Andrew
- Harmonizing Code-mixed Conversations: Personality-assisted Code-mixed Response Generation In Dialogues Kumar Shivani, Chakraborty Tanmoy
- Babilong: Testing The Limits Of Llms With Long Context Reasoning-in-a-haystack Kuratov Yuri, Bulatov Aydar, Anokhin Petr, Rodkin Ivan, Sorokin Dmitry, Sorokin Artyom, Burtsev Mikhail
- Mt-eval: A Multi-turn Capabilities Evaluation Benchmark For Large Language Models Kwan Wai-chung, Zeng Xingshan, Jiang Yuxin, Wang Yufei, Li Liangyou, Shang Lifeng, Jiang Xin, Liu Qun, Wong Kam-fai
- Continual Learning Optimizations For Auto-regressive Decoder Of Multilingual ASR Systems Kwok Chin Yuen, Yip Jia Qi, Chng Eng Siong
- Evaluating Cultural Adaptability Of A Large Language Model Via Simulation Of Synthetic Personas Kwok Louis, Bravansky Michal, Griffin Lewis D.
- Are Llms Effective Negotiators? Systematic Evaluation Of The Multifaceted Capabilities Of Llms In Negotiation Dialogues Kwon Deuksin, Weiss Emily, Kulshrestha Tara, Chawla Kushal, Lucas Gale M., Gratch Jonathan
- Zero-shot Text-guided Infinite Image Synthesis With LLM Guidance Kwon Soyeong, Lee Taegyeong, Kim Taehwan
- Can Github Issues Be Solved With Tree Of Thoughts? La Rosa Ricardo, Hulse Corey, Liu Bangdi
- Thalle: Text Hyperlocally Augmented Large Language Extension -- Technical Report Labs Kbtg, Khamnuansin Danupat, Petchsod Atthakorn, Lertpiya Anuruth, Balee Pornchanan, Lodkaew Thanawat, Chalothorn Tawunrat, Pongthawornkamol Thadpong, Lertsutthiwong Monchai
- Autowebglm: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent Lai Hanyu, Liu Xiao, Iong Iat Long, Yao Shuntian, Chen Yuxuan, Shen Pengbo, Yu Hao, Zhang Hanchen, Zhang Xiaohan, Dong Yuxiao, Tang Jie
- SEC-QA: A Systematic Evaluation Corpus For Financial QA Lai Viet Dac, Krumdick Michael, Lovering Charles, Reddy Varshini, Schmidt Craig, Tanner Chris
- Llms Beyond English: Scaling The Multilingual Capability Of Llms With Cross-lingual Feedback Lai Wen, Mesgar Mohsen, Fraser Alexander
- Alarm: Align Language Models Via Hierarchical Rewards Modeling Lai Yuhang, Wang Siyuan, Liu Shujun, Huang Xuanjing, Wei Zhongyu
- From Distributional To Overton Pluralism: Investigating Large Language Model Alignment Lake Thom, Choi Eunsol, Durrett Greg
- Cat-bench: Benchmarking Language Model Understanding Of Causal And Temporal Dependencies In Plans Lal Yash Kumar, Cohen Vanya, Chambers Nathanael, Balasubramanian Niranjan, Mooney Raymond
- Lapdoc: Layout-aware Prompting For Documents Lamott Marcel, Weweler Yves-noel, Ulges Adrian, Shafait Faisal, Krechel Dirk, Obradovic Darko
- Fishing For Magikarp: Automatically Detecting Under-trained Tokens In Large Language Models Land Sander, Bartolo Max
- Fine-tuning Language Models With Reward Learning On Policy Lang Hao, Huang Fei, Li Yongbin
- Adaptnmt: An Open-source, Language-agnostic Development Environment For Neural Machine Translation Lankford Séamus, Afli Haithem, Way Andy
- A Systematic Survey And Critical Review On Evaluating Large Language Models: Challenges, Limitations, And Recommendations Laskar Md Tahmid Rahman, Alqahtani Sawsan, Bari M Saiful, Rahman Mizanur, Khan Mohammad Abdullah Matin, Khan Haidar, Jahan Israt, Bhuiyan Amran, Tan Chee Wei, Parvez Md Rizwan, Hoque Enamul, Joty Shafiq, Huang Jimmy
- Query-opt: Optimizing Inference Of Large Language Models Via Multi-query Instructions In Meeting Summarization Laskar Md Tahmid Rahman, Khasanova Elena, Fu Xue-yong, Chen Cheng, Tn Shashi Bhushan
- Melting Point: Mobile Evaluation Of Language Transformers Laskaridis Stefanos, Katevas Kleomenis, Minto Lorenzo, Haddadi Hamed
- Sparse Rewards Can Self-train Dialogue Agents Lattimer Barrett Martin, Gangal Varun, Mcdonald Ryan, Yang Yi
- What Matters When Building Vision-language Models? Laurençon Hugo, Tronchon Léo, Cord Matthieu, Sanh Victor
- Sentiment Reasoning For Healthcare Le-duc Khai, Nguyen Khai-nguyen, Tat Bach Phan, Le Duy, Ngo Jerry, Vo-dang Long, Nguyen Anh Totti, Hy Truong-son
- Hip Attention: Sparse Sub-quadratic Attention With Hierarchical Attention Pruning Lee Heejun, Park Geon, Lee Youngwan, Kim Jina, Jeong Wonyoung, Jeon Myeongjae, Hwang Sung Ju
- Improving Conversational Abilities Of Quantized Large Language Models Via Direct Preference Alignment Lee Janghwan, Park Seongmin, Hong Sukjin, Kim Minsoo, Chang Du-seong, Choi Jungwook
- A Survey Of Large Language Models In Finance (finllms) Lee Jean, Stevens Nicholas, Han Soyeon Caren, Song Minseok
- Can Long-context Language Models Subsume Retrieval, RAG, SQL, And More? Lee Jinhyuk, Chen Anthony, Dai Zhuyun, Dua Dheeru, Sachan Devendra Singh, Boratko Michael, Luan Yi, Arnold Sébastien M. R., Perot Vincent, Dalmia Siddharth, Hu Hexiang, Lin Xudong, Pasupat Panupong, Amini Aida, Cole Jeremy R., Riedel Sebastian, Naim Iftekhar, Chang Ming-wei, Guu Kelvin
- The Life Cycle Of Large Language Models: A Review Of Biases In Education Lee Jinsook, Hicke Yann, Yu Renzhe, Brooks Christopher, Kizilcec René F.
- LRQ: Optimizing Post-training Quantization For Large Language Models By Learning Low-rank Weight-scaling Matrices Lee Jung Hyun, Kim Jeonghoon, Yang June Yong, Kwon Se Jung, Yang Eunho, Yoo Kang Min, Lee Dongsoo
- Token-supervised Value Models For Enhancing Mathematical Reasoning Capabilities Of Large Language Models Lee Jung Hyun, Yang June Yong, Heo Byeongho, Han Dongyoon, Yoo Kang Min
- Reinforcement Learning From Reflective Feedback (RLRF): Aligning And Improving Llms Via Fine-grained Self-reflection Lee Kyungjae, Hwang Dasol, Park Sunghyun, Jang Youngsoo, Lee Moontae
- LLM2LLM: Boosting Llms With Novel Iterative Data Enhancement Lee Nicholas, Wattanawong Thanakul, Kim Sehoon, Mangalam Karttikeya, Shen Sheng, Anumanchipalli Gopala, Mahoney Michael W., Keutzer Kurt, Gholami Amir
- Aligning To Thousands Of Preferences Via System Message Generalization Lee Seongyun, Park Sue Hyun, Kim Seungone, Seo Minjoon
- Llava-docent: Instruction Tuning With Multimodal Large Language Model To Support Art Appreciation Education Lee Unggi, Jeon Minji, Lee Yunseo, Byun Gyuri, Son Yoorim, Shin Jaeyoon, Ko Hongkyu, Kim Hyeoncheol
- Large Language Models Produce Responses Perceived To Be Empathic Lee Yoon Kyung, Suh Jina, Zhan Hongli, Li Junyi Jessy, Ong Desmond C.
- Learning To Reduce: Optimal Representations Of Structured Data In Prompting Large Language Models Lee Younghun, Kim Sungchul, Yu Tong, Rossi Ryan A., Chen Xiang
- Autocoder: Enhancing Code Large Language Model With \textsc{aiev-instruct} Lei Bin, Li Yuchen, Chen Qiuwu
- Are Llms Classical Or Nonmonotonic Reasoners? Lessons From Generics Leidinger Alina, Van Rooij Robert, Shutova Ekaterina
- Prexme! Large Scale Prompt Exploration Of Open Source Llms For Machine Translation And Summarization Evaluation Leiter Christoph, Eger Steffen
- Using Hallucinations To Bypass Gpt4's Filter Lemkin Benjamin
- Intent-based Prompt Calibration: Enhancing Prompt Optimization With Synthetic Boundary Cases Levi Elad, Brosh Eli, Friedmann Matan
- More RLHF, More Trust? On The Impact Of Human Preference Alignment On Language Model Trustworthiness Li Aaron J., Krishna Satyapriya, Lakkaraju Himabindu
- Cascade Reward Sampling For Efficient Decoding-time Alignment Li Bolian, Wang Yifan, Grama Ananth, Zhang Ruqi
- Are Bigger Encoders Always Better In Vision Large Models? Li Bozhou, Liang Hao, Meng Zimo, Zhang Wentao
- Contextualization Distillation From Large Language Model For Knowledge Graph Completion Li Dawei, Tan Zhen, Chen Tianlong, Liu Huan
- On The Role Of Long-tail Knowledge In Retrieval Augmented Large Language Models Li Dongyang, Yan Junbing, Zhang Taolin, Wang Chengyu, He Xiaofeng, Huang Longtao, Xue Hui, Huang Jun
- Hello Again! Llm-powered Personalized Agent For Long-term Dialogue Li Hao, Yang Chenghao, Zhang An, Deng Yang, Wang Xiang, Chua Tat-seng
- Backdoor Removal For Generative Large Language Models Li Haoran, Chen Yulin, Zheng Zihao, Hu Qi, Chan Chunkit, Liu Heshan, Song Yangqiu
- Revisiting Catastrophic Forgetting In Large Language Model Tuning Li Hongyu, Ding Liang, Fang Meng, Tao Dacheng
- Datacomp-lm: In Search Of The Next Generation Of Training Sets For Language Models Li Jeffrey, Fang Alex, Smyrnis Georgios, Ivgi Maor, Jordan Matt, Gadre Samir, Bansal Hritik, Guha Etash, Keh Sedrick, Arora Kushal, Garg Saurabh, Xin Rui, Muennighoff Niklas, Heckel Reinhard, Mercat Jean, Chen Mayee, Gururangan Suchin, Wortsman Mitchell, Albalak Alon, Bitton Yonatan, Nezhurina Marianna, Abbas Amro, Hsieh Cheng-yu, Ghosh Dhruba, Gardner Josh, Kilian Maciej, Zhang Hanlin, Shao Rulin, Pratt Sarah, Sanyal Sunny, Ilharco Gabriel, Daras Giannis, Marathe Kalyani, Gokaslan Aaron, Zhang Jieyu, Chandu Khyathi, Nguyen Thao, Vasiljevic Igor, Kakade Sham, Song Shuran, Sanghavi Sujay, Faghri Fartash, Oh Sewoong, Zettlemoyer Luke, Lo Kyle, El-nouby Alaaeldin, Pouransari Hadi, Toshev Alexander, Wang Stephanie, Groeneveld Dirk, Soldaini Luca, Koh Pang Wei, Jitsev Jenia, Kollar Thomas, Dimakis Alexandros G., Carmon Yair, Dave Achal, Schmidt Ludwig, Shankar Vaishaal
- A Cross-language Investigation Into Jailbreak Attacks In Large Language Models Li Jie, Liu Yi, Liu Chongyang, Shi Ling, Ren Xiaoning, Zheng Yaowen, Liu Yang, Xue Yinxing
- Steering Llms Towards Unbiased Responses: A Causality-guided Debiasing Framework Li Jingling, Tang Zeyu, Liu Xiaoyu, Spirtes Peter, Zhang Kun, Leqi Liu, Liu Yang
- Stylechat: Learning Recitation-augmented Memory In Llms For Stylized Dialogue Generation Li Jinpeng, Zhang Zekai, Tu Quan, Cheng Xin, Zhao Dongyan, Yan Rui
- Dissecting Human And LLM Preferences Li Junlong, Zhou Fan, Sun Shichao, Zhang Yikai, Zhao Hai, Liu Pengfei
- The Dawn After The Dark: An Empirical Study On Factuality Hallucination In Large Language Models Li Junyi, Chen Jie, Ren Ruiyang, Cheng Xiaoxue, Zhao Wayne Xin, Nie Jian-yun, Wen Ji-rong
- Dialogue Action Tokens: Steering Language Models In Goal-directed Dialogue With A Multi-turn Planner Li Kenneth, Wang Yiming, Viégas Fernanda, Wattenberg Martin
- Confidence Matters: Revisiting Intrinsic Self-correction Capabilities Of Large Language Models Li Loka, Chen Zhenhao, Chen Guangyi, Zhang Yixuan, Su Yusheng, Xing Eric, Zhang Kun
- Predicting Vs. Acting: A Trade-off Between World Modeling & Agent Modeling Li Margaret, Shi Weijia, Pagnoni Artidoro, West Peter, Holtzman Ari
- Selective Reflection-tuning: Student-selected Data Recycling For LLM Instruction-tuning Li Ming, Chen Lichang, Chen Jiuhai, He Shwai, Gu Jiuxiang, Zhou Tianyi
- Bridging The Language Gap: Enhancing Multilingual Prompt-based Code Generation In Llms Via Zero-shot Cross-lingual Transfer Li Mingda, Mishra Abhijit, Mujumdar Utkarsh
- Nearest Neighbor Speculative Decoding For LLM Generation And Attribution Li Minghan, Chen Xilun, Holtzman Ari, Chen Beidi, Lin Jimmy, Yih Wen-tau, Lin Xi Victoria
- Needlebench: Can Llms Do Retrieval And Reasoning In 1 Million Context Window? Li Mo, Zhang Songyang, Liu Yunxin, Chen Kai
- LLM Defenses Are Not Robust To Multi-turn Human Jailbreaks Yet Li Nathaniel, Han Ziwen, Steneker Ian, Primack Willow, Goodside Riley, Zhang Hugh, Wang Zifan, Menghini Cristina, Yue Summer
- Examining Post-training Quantization For Mixture-of-experts: A Benchmark Li Pingzhi, Jin Xiaolong, Cheng Yu, Chen Tianlong
- Anchored Answers: Unravelling Positional Bias In Gpt-2's Multiple-choice Questions Li Ruizhe, Gao Yanjun
- Temporal Insight Enhancement: Mitigating Temporal Hallucination In Multimodal Large Language Models Li Sun, Liuan Wang, Jun Sun, Takayuki Okatani
- Model-enhanced Llm-driven VUI Testing Of VPA Apps Li Suwan, Bu Lei, Bai Guangdong, Xie Fuman, Chen Kai, Yue Chang
- Improving Faithfulness Of Large Language Models In Summarization Via Sliding Generation And Self-consistency Li Taiji, Li Zhi, Zhang Yin
- Long-context Llms Struggle With Long In-context Learning Li Tianle, Zhang Ge, Do Quy Duc, Yue Xiang, Chen Wenhu
- Citation-enhanced Generation For Llm-based Chatbots Li Weitao, Li Junkai, Ma Weizhi, Liu Yang
- Reinforcement Learning With Token-level Feedback For Controllable Text Generation Li Wendi, Wei Wei, Xu Kaihe, Xie Wenfeng, Chen Dangyang, Cheng Yu
- Tutorly: Turning Programming Videos Into Apprenticeship Learning Environments With Llms Li Wengxi, Pea Roy, Haber Nick, Subramonyam Hari
- Uio-llms: Unbiased Incremental Optimization For Long-context Llms Li Wenhao, Lin Mingbao, Zhong Yunshan, Yan Shuicheng, Ji Rongrong
- Optimizing Psychological Counseling With Instruction-tuned Large Language Models Li Wenjie, Sun Tianyu, Qian Kun, Wang Wenhong
- Llara: Supercharging Robot Learning Data For Vision-language Policy Li Xiang, Mata Cristina, Park Jongwoo, Kahatapitiya Kumara, Jang Yoo Sung, Shang Jinghuan, Ranasinghe Kanchana, Burgert Ryan, Cai Mu, Lee Yong Jae, Ryoo Michael S.
- 52B To 1T: Lessons Learned Via Tele-flm Series Li Xiang, Yao Yiqun, Jiang Xin, Fang Xuezhi, Wang Chao, Liu Xinzhang, Wang Zihan, Zhao Yu, Wang Xin, Huang Yuyao, Song Shuangyong, Li Yongxiang, Zhang Zheng, Zhao Bo, Sun Aixin, Wang Yequan, He Zhongjiang, Wang Zhongyuan, Li Xuelong, Huang Tiejun
- What If We Recaption Billions Of Web Images With Llama-3? Li Xianhang, Tu Haoqin, Hui Mude, Wang Zeyu, Zhao Bingchen, Xiao Junfei, Ren Sucheng, Mei Jieru, Liu Qing, Zheng Huangjie, Zhou Yuyin, Xie Cihang
- Preference Tuning For Toxicity Mitigation Generalizes Across Languages Li Xiaochen, Yong Zheng-xin, Bach Stephen H.
- Evaluating Mathematical Reasoning Of Large Language Models: A Focus On Error Identification And Correction Li Xiaoyuan, Wang Wenjie, Li Moxin, Guo Junrong, Zhang Yang, Feng Fuli
- Personalized Language Modeling From Personalized Human Feedback Li Xinyu, Lipton Zachary C., Leqi Liu
- Reason From Fallacy: Enhancing Large Language Models' Logical Reasoning Through Logical Fallacy Understanding Li Yanda, Wang Dixuan, Liang Jiaqing, Jiang Guochao, He Qianyu, Xiao Yanghua, Yang Deqing
- Mini-gemini: Mining The Potential Of Multi-modality Vision Language Models Li Yanwei, Zhang Yuechen, Wang Chengyao, Zhong Zhisheng, Chen Yixin, Chu Ruihang, Liu Shaoteng, Jia Jiaya
- Q-adapter: Training Your LLM Adapter As A Residual Q-function Li Yi-chen, Zhang Fuxiang, Qiu Wenjie, Yuan Lei, Jia Chengxing, Zhang Zongzhang, Yu Yang
- Backdoorllm: A Comprehensive Benchmark For Backdoor Attacks On Large Language Models Li Yige, Huang Hanxun, Zhao Yunhan, Ma Xingjun, Sun Jun
- Womd-reasoning: A Large-scale Language Dataset For Interaction And Driving Intentions Reasoning Li Yiheng, Ge Chongjian, Li Chenran, Xu Chenfeng, Tomizuka Masayoshi, Tang Chen, Ding Mingyu, Zhan Wei
- ISQA: Informative Factuality Feedback For Scientific Summarization Li Zekai, Qin Yanxia, Liu Qian, Kan Min-yen
- Mmsci: A Multimodal Multi-discipline Dataset For Phd-level Scientific Comprehension Li Zekun, Yang Xianjun, Choi Kyuri, Zhu Wanrong, Hsieh Ryan, Kim Hyeonjung, Lim Jin Hyuk, Ji Sungyoung, Lee Byungju, Yan Xifeng, Petzold Linda Ruth, Wilson Stephen D., Lim Woosang, Wang William Yang
- Text-to-model: Text-conditioned Neural Network Diffusion For Train-once-for-all Personalization Li Zexi, Gao Lingzhi, Wu Chao
- Understanding And Patching Compositional Reasoning In Llms Li Zhaoyi, Jiang Gangwei, Xie Hong, Song Linqi, Lian Defu, Wei Ying
- Wikitableedit: A Benchmark For Table Editing By Natural Language Instruction Li Zheng, Chen Xiang, Wan Xiaojun
- Visualization Literacy Of Multimodal Large Language Models: A Comparative Study Li Zhimin, Miao Haichao, Pascucci Valerio, Liu Shusen
- Self-instructed Derived Prompt Generation Meets In-context Learning: Unlocking New Potential Of Black-box Llms Li Zhuo, Du Yuhao, Hu Jinpeng, Wan Xiang, Gao Anningzhe
- Effects Of Different Prompts On The Quality Of GPT-4 Responses To Dementia Care Questions Li Zhuochun, Xie Bo, Hilsabeck Robin, Aguirre Alyssa, Zou Ning, Luo Zhimeng, He Daqing
- Retrieval Augmented Generation Or Long-context Llms? A Comprehensive Study And Hybrid Approach Li Zhuowan, Li Cheng, Zhang Mingyang, Mei Qiaozhu, Bendersky Michael
- Implicit In-context Learning Li Zhuowei, Xu Zihao, Han Ligong, Gao Yunhe, Wen Song, Liu Di, Wang Hao, Metaxas Dimitris N.
- Two Stacks Are Better Than One: A Comparison Of Language Modeling And Translation As Multilingual Pretraining Objectives Li Zihao, Ji Shaoxiong, Mickus Timothee, Segonne Vincent, Tiedemann Jörg
- Quantifying Multilingual Performance Of Large Language Models Across Languages Li Zihao, Shi Yucheng, Liu Zirui, Yang Fan, Payani Ali, Liu Ninghao, Du Mengnan
- Recai: Leveraging Large Language Models For Next-generation Recommender Systems Lian Jianxun, Lei Yuxuan, Huang Xu, Yao Jing, Xu Wei, Xie Xing
- What Is The Best Model? Application-driven Evaluation For Large Language Models Lian Shiguo, Zhao Kaikai, Liu Xinhui, Lei Xuejiao, Yang Bikun, Zhang Wenjing, Wang Kai, Liu Zhaoxiang
- Online Training Of Large Language Models: Learn While Chatting Liang Juhao, Wang Ziwei, Ma Zhuoheng, Li Jianquan, Zhang Zhiyi, Wu Xiangbo, Wang Benyou
- Fennec: Fine-grained Language Model Evaluation And Correction Extended Through Branching And Bridging Liang Xiaobo, Zhang Haoke, Hu Helan, Li Juntao, Xu Jun, Zhang Min
- Languashrink: Reducing Token Overhead With Psycholinguistics Liang Xuechen, Tao Meiling, Xia Yinghui, Shi Tianyu, Wang Jun, Yang Jingsong
- Controllable Text Generation For Large Language Models: A Survey Liang Xun, Wang Hanyu, Wang Yezhaohui, Song Shichao, Yang Jiawei, Niu Simin, Hu Jie, Liu Dan, Yao Shunyu, Xiong Feiyu, Li Zhiyu
- Learning To Trust Your Feelings: Leveraging Self-awareness In Llms For Hallucination Mitigation Liang Yuxin, Song Zhuoyang, Wang Hao, Zhang Jiaxing
- Mathchat: Benchmarking Mathematical Reasoning And Instruction Following In Multi-turn Interactions Liang Zhenwen, Yu Dian, Yu Wenhao, Yao Wenlin, Zhang Zhihan, Zhang Xiangliang, Yu Dong
- Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats In Customized Large Language Models Liang Zi, Hu Haibo, Ye Qingqing, Xiao Yaxin, Li Haoyang
- 3-in-1: 2D Rotary Adaptation For Efficient Finetuning, Efficient Batching And Composability Liao Baohao, Monz Christof
- Beyond KV Caching: Shared Attention For Efficient Llms Liao Bingli, Vargas Danilo Vasconcellos
- Extending Token Computation For LLM Reasoning Liao Bingli, Vargas Danilo Vasconcellos
- Enhancing Reinforcement Learning With Label-sensitive Reward For Natural Language Understanding Liao Kuo, Li Shuang, Zhao Meng, Liu Liqun, Xue Mengge, Hu Zhenyu, Han Honglin, Yin Chengguo
- MARIO: Math Reasoning With Code Interpreter Output -- A Reproducible Pipeline Liao Minpeng, Luo Wei, Li Chengxi, Wu Jing, Fan Kai
- Towards Automated Data Sciences With Natural Language And Sagecopilot: Practices And Lessons Learned Liao Yuan, Bian Jiang, Yun Yuhui, Wang Shuo, Zhang Yubo, Chu Jiaming, Wang Tao, Li Kewei, Li Yuchen, Li Xuhong, Ji Shilei, Xiong Haoyi
- MING-MOE: Enhancing Medical Multi-task Learning In Large Language Models With Sparse Mixture Of Low-rank Adapter Experts Liao Yusheng, Jiang Shuyang, Wang Yu, Wang Yanfeng
- Align And Aggregate: Compositional Reasoning With Video Alignment And Answer Aggregation For Video Question-answering Liao Zhaohe, Li Jiangtong, Niu Li, Zhang Liqing
- D2LLM: Decomposed And Distilled Large Language Models For Semantic Search Liao Zihan, Yu Hang, Li Jianguo, Wang Jun, Zhang Wei
- Large Language Models As Recommender Systems: A Study Of Popularity Bias Lichtenberg Jan Malte, Buchholz Alexander, Schwöbel Pola
- Wildbench: Benchmarking Llms With Challenging Tasks From Real Users In The Wild Lin Bill Yuchen, Deng Yuntian, Chandu Khyathi, Brahman Faeze, Ravichander Abhilasha, Pyatkin Valentina, Dziri Nouha, Bras Ronan Le, Choi Yejin
- Tree-based Hard Attention With Self-motivation For Large Language Models Lin Chenxi, Ren Jiayu, He Guoxiu, Jiang Zhuoren, Yu Haiyan, Zhu Xiaomin
- Fine-tuned Network Relies On Generic Representation To Solve Unseen Cognitive Task Lin Dongyan
- Graph-enhanced Large Language Models In Asynchronous Plan Reasoning Lin Fangru, La Malfa Emanuele, Hofmann Valentin, Yang Elle Michelle, Cohn Anthony, Pierrehumbert Janet B.
- Mixture Of In-context Experts Enhance Llms' Long Context Awareness Lin Hongzhan, Lv Ang, Chen Yuhan, Zhu Chen, Song Yang, Zhu Hengshu, Yan Rui
- Token-wise Influential Training Data Retrieval For Large Language Models Lin Huawei, Long Jikai, Xu Zhaozhuo, Zhao Weijie
- Large Language Models Make Sample-efficient Recommender Systems Lin Jianghao, Dai Xinyi, Shan Rong, Chen Bo, Tang Ruiming, Yu Yong, Zhang Weinan
- FLAME: Factuality-aware Alignment For Large Language Models Lin Sheng-chieh, Gao Luyu, Oguz Barlas, Xiong Wenhan, Lin Jimmy, Yih Wen-tau, Chen Xilun
- Dogerm: Equipping Reward Models With Domain Knowledge Through Model Merging Lin Tzu-han, Li Chen-an, Lee Hung-yi, Chen Yun-nung
- Improving Visual Storytelling With Multimodal Large Language Models Lin Xiaochuan, Chen Xiangyong
- Boosting Multimodal Large Language Models With Visual Tokens Withdrawal For Rapid Inference Lin Zhihang, Lin Mingbao, Lin Luxi, Ji Rongrong
- Evaluating Text-to-visual Generation With Image-to-text Generation Lin Zhiqiu, Pathak Deepak, Li Baiqi, Li Jiayao, Xia Xide, Neubig Graham, Zhang Pengchuan, Ramanan Deva
- Navigating The Dual Facets: A Comprehensive Evaluation Of Sequential Memory Editing In Large Language Models Lin Zihao, Beigi Mohammad, Li Hongxuan, Zhou Yufan, Zhang Yuxiang, Wang Qifan, Yin Wenpeng, Huang Lifu
- Unveiling Vulnerability Of Self-attention Liong Khai Jiet, Wu Hongqiu, Zhao Hai
- The Colorful Future Of Llms: Evaluating And Improving Llms As Emotional Supporters For Queer Youth Lissak Shir, Calderon Nitay, Shenkman Geva, Ophir Yaakov, Fruchter Eyal, Klomek Anat Brunstein, Reichart Roi
- Compromising Embodied Agents With Contextual Backdoor Attacks Liu Aishan, Zhou Yuguang, Liu Xianglong, Zhang Tianyuan, Liang Siyuan, Wang Jiakai, Pu Yanjun, Li Tianlin, Zhang Junqi, Zhou Wenbo, Guo Qing, Tao Dacheng
- Direct Large Language Model Alignment Through Self-rewarding Contrastive Prompt Distillation Liu Aiwei, Bai Haoping, Lu Zhiyun, Kong Xiang, Wang Simon, Shan Jiulong, Cao Meng, Wen Lijie
- PANDA: Preference Adaptation For Enhancing Domain-specific Abilities Of Llms Liu An, Yang Zonghan, Zhang Zhenhe, Hu Qingyuan, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Progressively Selective Label Enhancement For Language Model Alignment Liu Biao, Xu Ning, Geng Xin
- Openeval: Benchmarking Chinese Llms Across Capability, Alignment And Safety Liu Chuang, Yu Linhao, Li Jiaxuan, Jin Renren, Huang Yufei, Shi Ling, Zhang Junhui, Ji Xinmeng, Cui Tingting, Liu Tao, Song Jinwang, Zan Hongying, Li Sun, Xiong Deyi
- Llmembed: Rethinking Lightweight Llm's Genuine Function In Text Classification Liu Chun, Zhang Hongguang, Zhao Kainan, Ju Xinghai, Yang Lin
- Lumina-mgpt: Illuminate Flexible Photorealistic Text-to-image Generation With Multimodal Generative Pretraining Liu Dongyang, Zhao Shitian, Zhuo Le, Lin Weifeng, Qiao Yu, Li Hongsheng, Gao Peng
- Exploring Vulnerabilities And Protections In Large Language Models: A Survey Liu Frank Weizhen, Hu Chenhui
- Advancing Adversarial Suffix Transfer Learning On Aligned Large Language Models Liu Hongfu, Xie Yuxi, Wang Ye, Shieh Michael
- Unraveling The Mechanics Of Learning-based Demonstration Selection For In-context Learning Liu Hui, Wang Wenya, Sun Hao, Tian Chris Xing, Kong Chenqi, Dong Xin, Li Haoliang
- Speculative Decoding Via Early-exiting For Faster LLM Inference With Thompson Sampling Control Mechanism Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang
- E^2-LLM: Efficient And Extreme Length Extension Of Large Language Models Liu Jiaheng, Bai Zhiqi, Zhang Yuanxing, Zhang Chenchen, Zhang Yu, Zhang Ge, Wang Jiakai, Que Haoran, Chen Yukang, Su Wenbo, Ge Tiezheng, Fu Jie, Chen Wenhu, Zheng Bo
- Kangaroo: A Powerful Video-language Model Supporting Long-context Video Input Liu Jiajun, Wang Yibing, Ma Hanghang, Wu Xiaoping, Ma Xiaoqi, Wei Xiaoming, Jiao Jianbin, Wu Enhua, Hu Jie
- VP-LLM: Text-driven 3D Volume Completion With Large Language Models Through Patchification Liu Jianmeng, Liu Yichen, Zhang Yuyao, Meng Zeyuan, Tai Yu-wing, Tang Chi-keung
- Tiny Refinements Elicit Resilience: Toward Efficient Prefix-model Against LLM Red-teaming Liu Jiaxu, Yin Xiangyu, Wu Sihao, Wang Jianhong, Fang Meng, Yi Xinping, Huang Xiaowei
- Iterative Length-regularized Direct Preference Optimization: A Case Study On Improving 7B Language Models To GPT-4 Level Liu Jie, Zhou Zhanhui, Liu Jiaheng, Bu Xingyuan, Yang Chao, Zhong Han-sen, Ouyang Wanli
- On The Universal Truthfulness Hyperplane Inside Llms Liu Junteng, Chen Shiqi, Cheng Yu, He Junxian
- Large Language Model-based Agents For Software Engineering: A Survey Liu Junwei, Wang Kaixin, Chen Yixuan, Peng Xin, Chen Zhenpeng, Zhang Lingming, Lou Yiling
- Educating Llms Like Human Students: Structure-aware Injection Of Domain Knowledge Liu Kai, Chen Ze, Fu Zhihang, Jiang Rongxin, Zhou Fan, Chen Yaowu, Wu Yue, Ye Jieping
- STORE: Streamlining Semantic Tokenization And Generative Recommendation With A Single LLM Liu Qijiong, Zhu Jieming, Fan Lu, Zhao Zhou, Wu Xiao-ming
- Proofread: Fixes All Errors With One Tap Liu Renjie, Zhang Yanxiang, Zhu Yun, Sun Haicheng, Zhang Yuanbo, Huang Michael Xuelin, Cai Shanqing, Meng Lei, Zhai Shumin
- Intactkv: Improving Large Language Model Quantization By Keeping Pivot Tokens Intact Liu Ruikang, Bai Haoli, Lin Haokun, Li Yuening, Gao Han, Xu Zhengzhuo, Hou Lu, Yao Jun, Yuan Chun
- ST-LLM: Large Language Models Are Effective Temporal Learners Liu Ruyang, Li Chen, Tang Haoran, Ge Yixiao, Shan Ying, Li Ge
- RL-GPT: Integrating Reinforcement Learning And Code-as-policy Liu Shaoteng, Yuan Haoqi, Hu Minda, Li Yanwei, Chen Yukang, Liu Shu, Lu Zongqing, Jia Jiaya
- SPINACH: Sparql-based Information Navigation For Challenging Real-world Questions Liu Shicheng, Semnani Sina J., Triedman Harold, Xu Jialiang, Zhao Isaac Dan, Lam Monica S.
- Bi-chainer: Automated Large Language Models Reasoning With Bidirectional Chaining Liu Shuqi, He Bowei, Song Linqi
- The Generation Gap:exploring Age Bias In The Underlying Value Systems Of Large Language Models Liu Siyang, Maturi Trish, Yi Bowen, Shen Siqi, Mihalcea Rada
- Explicit Inductive Inference Using Large Language Models Liu Tianyang, Li Tianyi, Cheng Liang, Steedman Mark
- Towards Truthful Multilingual Large Language Models: Benchmarking And Alignment Strategies Liu Weihao, Wu Ning, Ding Wenbiao, Liang Shining, Gong Ming, Zhang Dongmei
- Demorank: Selecting Effective Demonstrations For Large Language Models In Ranking Task Liu Wenhan, Zhu Yutao, Dou Zhicheng
- Imposter.ai: Adversarial Attacks With Hidden Intentions Towards Aligned Large Language Models Liu Xiao, Li Liangzhi, Xiang Tong, Ye Fuying, Wei Lu, Li Wangyue, Garcia Noa
- Extensive Self-contrast Enables Feedback-free Language Model Alignment Liu Xiao, Song Xixuan, Dong Yuxiao, Tang Jie
- Farewell To Length Extrapolation, A Training-free Infinite Context With Finite Attention Scope Liu Xiaoran, Guo Qipeng, Song Yuerong, Liu Zhigeng, Lv Kai, Yan Hang, Li Linlin, Liu Qun, Qiu Xipeng
- SHIELD: Evaluation And Defense Strategies For Copyright Compliance In LLM Text Generation Liu Xiaoze, Sun Ting, Xu Tianyang, Wu Feijie, Wang Cunxiang, Wang Xiaoqian, Gao Jing
- Customizing Large Language Model Generation Style Using Parameter-efficient Finetuning Liu Xinyue, Diddee Harshita, Ippolito Daphne
- Prompt Framework For Role-playing: Generation And Evaluation Liu Xun, Ni Zhengwei
- Datasets For Large Language Models: A Comprehensive Survey Liu Yang, Cao Jiahuan, Liu Chongyu, Ding Kai, Jin Lianwen
- Omgeval: An Open Multilingual Generative Evaluation Benchmark For Large Language Models Liu Yang, Xu Meng, Wang Shuo, Yang Liner, Wang Haoyu, Liu Zhenghao, Kong Cunliang, Chen Yun, Liu Yang, Sun Maosong, Yang Erhong
- Era-cot: Improving Chain-of-thought Through Entity Relationship Analysis Liu Yanming, Peng Xinyue, Du Tianyu, Yin Jianwei, Liu Weihao, Zhang Xuhong
- Arondight: Red Teaming Large Vision Language Models With Auto-generated Multi-modal Jailbreak Prompts Liu Yi, Cai Chengjun, Zhang Xiaoli, Yuan Xingliang, Wang Cong
- Chatasu: Evoking Llm's Reflexion To Truly Understand Aspect Sentiment In Dialogues Liu Yiding, Wang Jingjing, Luo Jiamin, Zeng Tao, Zhou Guodong
- Accelerating Inference In Large Language Models With A Unified Layer Skipping Strategy Liu Yijin, Meng Fandong, Zhou Jie
- Aligning With Human Judgement: The Role Of Pairwise Preference In Large Language Model Evaluators Liu Yinhong, Zhou Han, Guo Zhijiang, Shareghi Ehsan, Vulić Ivan, Korhonen Anna, Collier Nigel
- SAFETY-J: Evaluating Safety With Critique Liu Yixiu, Zheng Yuxiang, Xia Shijie, Li Jiajun, Tu Yi, Song Chaoling, Liu Pengfei
- From Summary To Action: Enhancing Large Language Models For Complex Tasks With Open World Apis Liu Yulong, Yuan Yunlong, Wang Chunwei, Han Jianhua, Ma Yongqiang, Zhang Li, Zheng Nanning, Xu Hang
- World Models With Hints Of Large Language Models For Goal Achieving Liu Zeyuan, Huan Ziyu, Wang Xiyao, Lyu Jiafei, Tao Jian, Li Xiu, Huang Furong, Xu Huazhe
- Scaffolding Language Learning Via Multi-modal Tutoring Systems With Pedagogical Instructions Liu Zhengyuan, Yin Stella Xin, Lee Carolyn, Chen Nancy F.
- Mixture Of Insightful Experts (mote): The Synergy Of Thought Chains And Expert Mixtures In Self-alignment Liu Zhili, Gou Yunhao, Chen Kai, Hong Lanqing, Gao Jiahui, Mi Fei, Zhang Yu, Li Zhenguo, Jiang Xin, Liu Qun, Kwok James T.
- Agentlite: A Lightweight Library For Building And Advancing Task-oriented LLM Agent System Liu Zhiwei, Yao Weiran, Zhang Jianguo, Yang Liangwei, Liu Zuxin, Tan Juntao, Choubey Prafulla K., Lan Tian, Wu Jason, Wang Huan, Heinecke Shelby, Xiong Caiming, Savarese Silvio
- Fantastic Semantics And Where To Find Them: Investigating Which Layers Of Generative Llms Reflect Lexical Semantics Liu Zhu, Kong Cunliang, Liu Ying, Sun Maosong
- Protecting Your Llms With Information Bottleneck Liu Zichuan, Wang Zefan, Xu Linjie, Wang Jinyu, Song Lei, Wang Tianchun, Chen Chunlin, Cheng Wei, Bian Jiang
- Towards Aligning Language Models With Textual Feedback Lloret Saüc Abadal, Dhuliawala Shehzaad, Murugesan Keerthiram, Sachan Mrinmaya
- A Closer Look Into Mixture-of-experts In Large Language Models Lo Ka Man, Huang Zeyu, Qiu Zihan, Wang Zili, Fu Jie
- Large Language Models Relearn Removed Concepts Lo Michelle, Cohen Shay B., Barez Fazl
- Large Language Models Know What Makes Exemplary Contexts Long Quanyu, Chen Jianda, Wang Wenya, Pan Sinno Jialin
- Generative Multi-modal Knowledge Retrieval With Large Language Models Long Xinwei, Zeng Jiali, Meng Fandong, Ma Zhiyuan, Zhang Kaiyan, Zhou Bowen, Zhou Jie
- Sparser Is Faster And Less Is More: Efficient Sparse Attention For Long-range Transformers Lou Chao, Jia Zixia, Zheng Zilong, Tu Kewei
- SPO: Multi-dimensional Preference Sequential Alignment With Implicit Reward Modeling Lou Xingzhou, Zhang Junge, Xie Jian, Liu Lifeng, Yan Dong, Huang Kaiqi
- Roleplay-doh: Enabling Domain-experts To Create Llm-simulated Patients Via Eliciting And Adhering To Principles Louie Ryan, Nandi Ananjan, Fang William, Chang Cheng, Brunskill Emma, Yang Diyi
- Answering Real-world Clinical Questions Using Large Language Model Based Systems Low Yen Sia 1 And 2, Jackson Michael L. 1 And 2, Hyde Rebecca J. 1 And 2, Brown Robert E. 1 And 2, Sanghavi Neil M. 1 And 2, Baldwin Julian D. 1 And 2, Pike C. William 1 And 2, Muralidharan Jananee 1 And 2, Hui Gavin 1 And 2, Alexander Natasha 1 And 7, Hassan Hadeel 1 And 7, Nene Rahul V. 1 And 7, Pike Morgan 1 And 7, Pokrzywa Courtney J. 1 And 7, Vedak Shivam 1 And 7, Yan Adam Paul 1 And 7, Yao Dong-han 1 And 7, Zipursky Amy R. 1 And 7, Dinh Christina 1 And 7, Ballentine Philip 1 And 7, Derieg Dan C. 1 And 7, Polony Vladimir 1 And 7, Chawdry Rehan N. 1 And 7, Davies Jordan 1 And 7, Hyde Brigham B. 1 And 7, Shah Nigam H. 1 And 7, Gombar Saurabh 1 And 8
- From GPT-4 To Gemini And Beyond: Assessing The Landscape Of Mllms On Generalizability, Trustworthiness And Causality Through Four Modalities Lu Chaochao, Qian Chen, Zheng Guodong, Fan Hongxing, Gao Hongzhi, Zhang Jie, Shao Jing, Deng Jingyi, Fu Jinlan, Huang Kexin, Li Kunchang, Li Lijun, Wang Limin, Sheng Lu, Chen Meiqi, Zhang Ming, Ren Qibing, Chen Sirui, Gui Tao, Ouyang Wanli, Wang Yali, Teng Yan, Wang Yaru, Wang Yi, He Yinan, Wang Yingchun, Wang Yixu, Zhang Yongting, Qiao Yu, Shen Yujiong, Mou Yurong, Chen Yuxi, Zhang Zaibin, Shi Zhelun, Yin Zhenfei, Wang Zhipin
- Investigating Bias Representations In Llama 2 Chat Via Activation Steering Lu Dawn, Rimsky Nina
- Deepseek-vl: Towards Real-world Vision-language Understanding Lu Haoyu, Liu Wen, Zhang Bo, Wang Bingxuan, Dong Kai, Liu Bo, Sun Jingxiang, Ren Tongzheng, Li Zhuoshu, Yang Hao, Sun Yaofeng, Deng Chengqi, Xu Hanwei, Xie Zhenda, Ruan Chong
- Revisiting Multi-modal LLM Evaluation Lu Jian, Srivastava Shikhar, Chen Junyu, Shrestha Robik, Acharya Manoj, Kafle Kushal, Kanan Christopher
- Toolsandbox: A Stateful, Conversational, Interactive Evaluation Benchmark For LLM Tool Use Capabilities Lu Jiarui, Holleis Thomas, Zhang Yizhe, Aumayer Bernhard, Nan Feng, Bai Felix, Ma Shuang, Ma Shen, Li Mengyu, Yin Guoli, Wang Zirui, Pang Ruoming
- A Bounding Box Is Worth One Token: Interleaving Layout And Text In A Large Language Model For Document Understanding Lu Jinghui, Yu Haiyang, Wang Yanjie, Ye Yongjie, Tang Jingqun, Yang Ziwei, Wu Binghong, Liu Qi, Feng Hao, Wang Han, Liu Hao, Huang Can
- FIPO: Free-form Instruction-oriented Prompt Optimization With Preference Dataset And Modular Fine-tuning Schema Lu Junru, An Siyu, Zhang Min, He Yulan, Yin Di, Sun Xing
- Desta: Enhancing Speech Language Models Through Descriptive Speech-text Alignment Lu Ke-han, Chen Zhehuai, Fu Szu-wei, Huang He, Ginsburg Boris, Wang Yu-chiang Frank, Lee Hung-yi
- Online Merging Optimizers For Boosting Rewards And Mitigating Tax In Alignment Lu Keming, Yu Bowen, Huang Fei, Fan Yang, Lin Runji, Zhou Chang
- Multimodal Large Language Model Driven Scenario Testing For Autonomous Vehicles Lu Qiujing, Wang Xuanhan, Jiang Yiwei, Zhao Guangming, Ma Mingyue, Feng Shuo
- Insights Into LLM Long-context Failures: When Transformers Know But Don't Tell Lu Taiming, Gao Muhan, Yu Kuai, Byerly Adam, Khashabi Daniel
- Benchmarking Chinese Knowledge Rectification In Large Language Models Lu Tianhe, Fang Jizhan, Yao Yunzhi, Xu Xin, Zhang Ningyu, Chen Huajun
- Fine-tuning Large Language Models For Domain Adaptation: Exploration Of Training Strategies, Scaling, Model Merging And Synergistic Capabilities Lu Wei, Luu Rachel K., Buehler Markus J.
- Mental Modeling Of Reinforcement Learning Agents By Language Models Lu Wenhao, Zhao Xufeng, Spisak Josua, Lee Jae Hee, Wermter Stefan
- Generative Students: Using Llm-simulated Student Profiles To Support Question Item Evaluation Lu Xinyi, Wang Xu
- Llamax: Scaling Linguistic Horizons Of LLM By Enhancing Translation Capabilities Beyond 100 Languages Lu Yinquan, Zhu Wenhao, Li Lei, Qiao Yu, Yuan Fei
- Wildvision: Evaluating Vision-language Models In The Wild With Human Preferences Lu Yujie, Jiang Dongfu, Chen Wenhu, Wang William Yang, Choi Yejin, Lin Bill Yuchen
- Prompt Tuning As User Inherent Profile Inference Machine Lu Yusheng, Du Zhaocheng, Li Xiangyang, Zhao Xiangyu, Liu Weiwen, Wang Yichao, Guo Huifeng, Tang Ruiming, Dong Zhenhua, Duan Yongrui
- Metric-aware LLM Inference For Regression And Scoring Lukasik Michal, Narasimhan Harikrishna, Menon Aditya Krishna, Yu Felix, Kumar Sanjiv
- Autom3l: An Automated Multimodal Machine Learning Framework With Large Language Models Luo Daqin, Feng Chengjian, Nong Yuxuan, Shen Yiqing
- Arena Learning: Build Data Flywheel For Llms Post-training Via Simulated Chatbot Arena Luo Haipeng, Sun Qingfeng, Xu Can, Zhao Pu, Lin Qingwei, Lou Jianguang, Chen Shifeng, Tang Yansong, Chen Weizhu
- Decoupled Alignment For Robust Plug-and-play Adaptation Luo Haozheng, Yu Jiahao, Zhang Wenxin, Li Jialong, Hu Jerry Yao-chieh, Xing Xinyu, Liu Han
- Kuaiji: The First Chinese Accounting Large Language Model Luo Jiayuan, Yang Songhua, Qiu Xiaoling, Chen Panyu, Nai Yufei, Zeng Wenxuan, Zhang Wentao, Jiang Xinke
- Hallucination Detection And Hallucination Mitigation: An Investigation Luo Junliang, Li Tianyu, Wu Di, Jenkin Michael, Liu Steve, Dudek Gregory
- BGE Landmark Embedding: A Chunking-free Embedding Method For Retrieval Augmented Long-context Large Language Models Luo Kun, Liu Zheng, Xiao Shitao, Liu Kang
- End-to-end Neuro-symbolic Reinforcement Learning With Textual Explanations Luo Lirui, Zhang Guoxi, Xu Hongming, Yang Yaodong, Fang Cong, Li Qing
- Assessing Empathy In Large Language Models With Real-world Physician-patient Interactions Luo Man, Warren Christopher J., Cheng Lu, Abdul-muhsin Haidar M., Banerjee Imon
- DEEM: Diffusion Models Serve As The Eyes Of Large Language Models For Image Perception Luo Run, Li Yunshui, Chen Longze, He Wanwei, Lin Ting-en, Liu Ziqiang, Zhang Lei, Song Zikai, Xia Xiaobo, Liu Tongliang, Yang Min, Hui Binyuan
- Duetsim: Building User Simulator With Dual Large Language Models For Task-oriented Dialogues Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- Zero-shot Cross-domain Dialogue State Tracking Via Dual Low-rank Adaptation Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- Matching Domain Experts By Training From Scratch On Domain Knowledge Luo Xiaoliang, Sun Guangzhi, Love Bradley C.
- TRAWL: Tensor Reduced And Approximated Weights For Large Language Models Luo Yiran, Patel Het, Fu Yu, Ahn Dawon, Chen Jia, Dong Yue, Papalexakis Evangelos E.
- Are Large Language Models True Healthcare Jacks-of-all-trades? Benchmarking Across Health Professions Beyond Physician Exams Luo Zheheng, Yuan Chenhan, Xie Qianqian, Ananiadou Sophia
- Reft: Reasoning With Reinforced Fine-tuning Luong Trung Quoc, Zhang Xinbo, Jie Zhanming, Sun Peng, Jin Xiaoran, Li Hang
- Source2synth: Synthetic Data Generation And Curation Grounded In Real Data Sources Lupidi Alisia, Gemmell Carlos, Cancedda Nicola, Dwivedi-yu Jane, Weston Jason, Foerster Jakob, Raileanu Roberta, Lomeli Maria
- Coggpt: Unleashing The Power Of Cognitive Dynamics On Large Language Models Lv Yaojia, Pan Haojie, Fu Ruiji, Liu Ming, Wang Zhongyuan, Qin Bing
- Beyond Probabilities: Unveiling The Misalignment In Evaluating Large Language Models Lyu Chenyang, Wu Minghao, Aji Alham Fikri
- X-reflect: Cross-reflection Prompting For Multimodal Recommendation Lyu Hanjia, Rossi Ryan, Chen Xiang, Tanjim Md Mehrab, Petrangeli Stefano, Sarkhel Somdeb, Luo Jiebo
- Weblinx: Real-world Website Navigation With Multi-turn Dialogue Lù Xing Han, Kasner Zdeněk, Reddy Siva
- Enhancing Programming Education With Chatgpt: A Case Study On Student Perceptions And Interactions In A Python Course Ma Boxaun, Chen Li, Konomi Shin'ichi
- MOYU: A Theoretical Study On Massive Over-activation Yielded Uplifts In Llms Ma Chi, Huang Mincong, Wang Chao, Wang Yujie, Yu Lei
- Integrating AI Tutors In A Programming Course Ma Iris, Martins Alberto Krone, Lopes Cristina Videira
- Debate On Graph: A Flexible And Reliable Reasoning Framework For Large Language Models Ma Jie, Gao Zhitao, Chai Qi, Sun Wangchun, Wang Pinghui, Pei Hongbin, Tao Jing, Song Lingyun, Liu Jun, Zhang Chen, Cui Lizhen
- FBI-LLM: Scaling Up Fully Binarized Llms From Scratch Via Autoregressive Distillation Ma Liqun, Sun Mingjie, Shen Zhiqiang
- Are Large Language Models Good Prompt Optimizers? Ma Ruotian, Wang Xiaolei, Zhou Xin, Li Jian, Du Nan, Gui Tao, Zhang Qi, Huang Xuanjing
- Coco-agent: A Comprehensive Cognitive MLLM Agent For Smartphone GUI Automation Ma Xinbei, Zhang Zhuosheng, Zhao Hai
- Mmlongbench-doc: Benchmarking Long-context Document Understanding With Visualizations Ma Yubo, Zang Yuhang, Chen Liangyu, Chen Meiqi, Jiao Yizhu, Li Xinze, Lu Xinyuan, Liu Ziyu, Ma Yan, Dong Xiaoyi, Zhang Pan, Pan Liangming, Jiang Yu-gang, Wang Jiaqi, Cao Yixin, Sun Aixin
- Gerea: Question-aware Prompt Captions For Knowledge-based Visual Question Answering Ma Ziyu, Li Shutao, Sun Bin, Cai Jianfei, Long Zuxiang, Ma Fuyan
- Exploring The Impact Of The Output Format On The Evaluation Of Large Language Models For Code Translation Macedo Marcos, Tian Yuan, Cogo Filipe R., Adams Bram
- LLM In-context Recall Is Prompt Dependent Machlab Daniel, Battle Rick
- Testing The Effect Of Code Documentation On Large Language Model Code Understanding Macke William, Doyle Michael
- Steering Conversational Large Language Models For Long Emotional Support Conversations Madani Navid, Saha Sougata, Srihari Rohini
- Large Language Models As Minecraft Agents Madge Chris, Poesio Massimo
- Do Llms Know When To NOT Answer? Investigating Abstention Abilities Of Large Language Models Madhusudhan Nishanth, Madhusudhan Sathwik Tejaswi, Yadav Vikas, Hashemi Masoud
- Are Self-explanations From Large Language Models Faithful? Madsen Andreas, Chandar Sarath, Reddy Siva
- Too Late To Train, Too Early To Use? A Study On Necessity And Viability Of Low-resource Bengali Llms Mahfuz Tamzeed, Dey Satak Kumar, Naswan Ruwad, Adil Hasnaen, Sayeed Khondker Salman, Shahgir Haz Sameen
- VURF: A General-purpose Reasoning And Self-refinement Framework For Video Understanding Mahmood Ahmad, Vayani Ashmal, Naseer Muzammal, Khan Salman, Khan Fahad Shahbaz
- Rephrasing The Web: A Recipe For Compute And Data-efficient Language Modeling Maini Pratyush, Seto Skyler, Bai He, Grangier David, Zhang Yizhe, Jaitly Navdeep
- Exploring The Capabilities Of Prompted Large Language Models In Educational And Assessment Applications Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- Fairylandai: Personalized Fairy Tales Utilizing Chatgpt And DALLE-3 Makridis Georgios, Oikonomou Athanasios, Koukos Vasileios
- Matmul Or No Matmul In The Era Of 1-bit Llms Malekar Jinendra, Elbtity Mohammed E., Zand Ramtin
- From Tarzan To Tolkien: Controlling The Language Proficiency Level Of Llms For Content Generation Malik Ali, Mayhew Stephen, Piech Chris, Bicknell Klinton
- Self-consistent Decoding For More Factual Open Responses Malon Christopher, Zhu Xiaodan
- ULLME: A Unified Framework For Large Language Model Embeddings With Generation-augmented Learning Man Hieu, Ngo Nghia Trung, Dernoncourt Franck, Nguyen Thien Huu
- Making Task-oriented Dialogue Datasets More Natural By Synthetically Generating Indirect User Requests Mannekote Amogh, Nam Jinseok, Li Ziming, Gao Jian, Boyer Kristy Elizabeth, Dorr Bonnie J.
- A Data Generation Perspective To The Mechanism Of In-context Learning Mao Haitao, Liu Guangliang, Ma Yao, Wang Rongrong, Johnson Kristen, Tang Jiliang
- Reinforced Prompt Personalization For Recommendation With Large Language Models Mao Wenyu, Wu Jiancan, Chen Weijian, Gao Chongming, Wang Xiang, He Xiangnan
- Don't Forget Your Reward Values: Language Model Alignment Via Value-based Calibration Mao Xin, Li Feng-lin, Xu Huimin, Zhang Wei, Luu Anh Tuan
- Visually Guided Generative Text-layout Pre-training For Document Intelligence Mao Zhiming, Bai Haoli, Hou Lu, Wei Jiansheng, Jiang Xin, Liu Qun, Wong Kam-fai
- Tuning Llms With Contrastive Alignment Instructions For Machine Translation In Unseen, Low-resource Languages Mao Zhuoyuan, Yu Yen
- A Novel Nuanced Conversation Evaluation Framework For Large Language Models In Mental Health Marrapese Alexander, Suleiman Basem, Ullah Imdad, Kim Juno
- The Impact Of Hyperparameters On Large Language Model Inference Performance: An Evaluation Of Vllm And Huggingface Pipelines Martinez Matias
- "vorbe\c{s}ti Rom\^ane\c{s}te?" A Recipe To Train Powerful Romanian Llms With English Instructions Masala Mihai, Ilie-ablachim Denis C., Dima Alexandru, Corlatescu Dragos, Zavelca Miruna, Olaru Ovio, Terian Simina, Terian Andrei, Leordeanu Marius, Velicu Horia, Popescu Marius, Dascalu Mihai, Rebedea Traian
- Do Large Language Models Understand Verbal Indicators Of Romantic Attraction? Matz Sandra C., Peters Heinrich, Eastwick Paul W., Cerf Moran, Finkel Eli J.
- Learning To Ask Informative Questions: Enhancing Llms With Preference Optimization And Expected Information Gain Mazzaccara Davide, Testoni Alberto, Bernardi Raffaella
- Improving Text-to-image Consistency Via Automatic Prompt Optimization Mañas Oscar, Astolfi Pietro, Hall Melissa, Ross Candace, Urbanek Jack, Williams Adina, Agrawal Aishwarya, Romero-soriano Adriana, Drozdzal Michal
- LLM Critics Help Catch LLM Bugs Mcaleese Nat, Pokorny Rai Michael, Uribe Juan Felipe Ceron, Nitishinskaya Evgenia, Trebacz Maja, Leike Jan
- Leveraging Prompts In Llms To Overcome Imbalances In Complex Educational Text Data Mcclure Jeanne, Shimmei Machi, Matsuda Noboru, Jiang Shiyan
- Mimetic Poet Mccormack Jon, Wilson Elliott, Rajcic Nina, Llano Maria Teresa
- Inadequacies Of Large Language Model Benchmarks In The Era Of Generative Artificial Intelligence Mcintosh Timothy R., Susnjak Teo, Liu Tong, Watters Paul, Halgamuge Malka N.
- MM1: Methods, Analysis & Insights From Multimodal LLM Pre-training Mckinzie Brandon, Gan Zhe, Fauconnier Jean-philippe, Dodge Sam, Zhang Bowen, Dufter Philipp, Shah Dhruti, Du Xianzhi, Peng Futang, Weers Floris, Belyi Anton, Zhang Haotian, Singh Karanjeet, Kang Doug, Jain Ankur, Hè Hongyu, Schwarzer Max, Gunter Tom, Kong Xiang, Zhang Aonan, Wang Jianyu, Wang Chong, Du Nan, Lei Tao, Wiseman Sam, Yin Guoli, Lee Mark, Wang Zirui, Pang Ruoming, Grasch Peter, Toshev Alexander, Yang Yinfei
- Injecting New Knowledge Into Large Language Models Via Supervised Fine-tuning Mecklenburg Nick, Lin Yiyou, Li Xiaoxiao, Holstein Daniel, Nunes Leonardo, Malvar Sara, Silva Bruno, Chandra Ranveer, Aski Vijay, Yannam Pavan Kumar Reddy, Aktas Tolga, Hendry Todd
- Inherent Challenges Of Post-hoc Membership Inference For Large Language Models Meeus Matthieu, Jain Shubham, Rei Marek, De Montjoye Yves-alexandre
- Openelm: An Efficient Language Model Family With Open Training And Inference Framework Mehta Sachin, Sekhavat Mohammad Hossein, Cao Qingqing, Horton Maxwell, Jin Yanzi, Sun Chenfan, Mirzadeh Iman, Najibi Mahyar, Belenko Dmitry, Zatloukal Peter, Rastegari Mohammad
- Realhf: Optimized RLHF Training For Large Language Models Through Parameter Reallocation Mei Zhiyu, Fu Wei, Li Kaiwei, Wang Guangju, Zhang Huanchen, Wu Yi
- Towards Human Understanding Of Paraphrase Types In Chatgpt Meier Dominik, Wahle Jan Philip, Ruas Terry, Gipp Bela
- Phybench: A Physical Commonsense Benchmark For Evaluating Text-to-image Models Meng Fanqing, Shao Wenqi, Luo Lixin, Wang Yahong, Chen Yiran, Lu Quanfeng, Yang Yue, Yang Tianshuo, Zhang Kaipeng, Qiao Yu, Luo Ping
- Deepstack: Deeply Stacking Visual Tokens Is Surprisingly Simple And Effective For Lmms Meng Lingchen, Yang Jianwei, Tian Rui, Dai Xiyang, Wu Zuxuan, Gao Jianfeng, Jiang Yu-gang
- DCR: Divide-and-conquer Reasoning For Multi-choice Question Answering With Llms Meng Zijie, Zhang Yan, Feng Zhaopeng, Liu Zuozhu
- A Survey Of Resource-efficient LLM And Multimodal Foundation Models Mengwei Xu, Wangsong Yin, Dongqi Cai, Rongjie Yi, Daliang Xu, Qipeng Wang, Bingyang Wu, Yihao Zhao, Chen Yang, Shihe Wang, Qiyang Zhang, Zhenyan Lu, Li Zhang, Shangguang Wang, Yuanchun Li, Yunxin Liu, Xin Jin, Xuanzhe Liu
- Logic-enhanced Language Model Agents For Trustworthy Social Simulations Mensfelt Agnieszka, Stathis Kostas, Trencsenyi Vince
- Transforming Wearable Data Into Health Insights Using Large Language Model Agents Merrill Mike A., Paruchuri Akshay, Rezaei Naghmeh, Kovacs Geza, Perez Javier, Liu Yun, Schenck Erik, Hammerquist Nova, Sunshine Jake, Tailor Shyam, Ayush Kumar, Su Hao-wei, He Qian, Mclean Cory Y., Malhotra Mark, Patel Shwetak, Zhan Jiening, Althoff Tim, Mcduff Daniel, Liu Xin
- Superposition Prompting: Improving And Accelerating Retrieval-augmented Generation Merth Thomas, Fu Qichen, Rastegari Mohammad, Najibi Mahyar
- A Comparison Of LLM Finetuning Methods & Evaluation Metrics With Travel Chatbot Use Case Meyer Sonia, Singh Shreya, Tam Bertha, Ton Christopher, Ren Angel
- An Efficient Inference Framework For Early-exit Large Language Models Miao Ruijie, Yan Yihan, Yao Xinshuo, Yang Tong
- Subject-driven Text-to-image Generation Via Preference-based Reinforcement Learning Miao Yanting, Loh William, Kothawade Suraj, Poupart Pascal, Rashwan Abdullah, Li Yeqing
- Is English The New Programming Language? How About Pseudo-code Engineering? Michaelsen Gian Alexandre, Santos Renato P. Dos
- Smart Language Agents In Real-world Planning Miin Annabelle, Wei Timothy
- Caveat Lector: Large Language Models In Legal Practice Mik Eliza
- Natural Language As Policies: Reasoning For Coordinate-level Embodied Control With Llms Mikami Yusuke, Melnik Andrew, Miura Jun, Hautamäki Ville
- Semantics Of Multiword Expressions In Transformer-based Models: A Survey Miletić Filip, Walde Sabine Schulte Im
- A Philosophical Introduction To Language Models - Part II: The Way Forward Millière Raphaël, Buckner Cameron
- Anthropocentric Bias And The Possibility Of Artificial Cognition Millière Raphaël, Rathkopf Charles
- Evaluating And Analyzing Relationship Hallucinations In Large Vision-language Models Mingrui Wu, Jiayi Ji, Oucheng Huang, Jiale Li, Yuhang Wu, Xiaoshuai Sun, Rongrong Ji
- PSLM: Parallel Generation Of Text And Speech With Llms For Low-latency Spoken Dialogue Systems Mitsui Kentaro, Mitsuda Koh, Wakatsuki Toshiaki, Hono Yukiya, Sawada Kei
- Puzzlebench: Can Llms Solve Challenging First-order Combinatorial Reasoning Problems? Mittal Chinmay, Kartik Krishna, Mausam, Singla Parag
- Multimodal Contrastive In-context Learning Miyanishi Yosuke, Nguyen Minh Le
- Aligning Query Representation With Rewritten Query And Relevance Judgments In Conversational Search Mo Fengran, Qu Chen, Mao Kelong, Wu Yihong, Su Zhan, Huang Kaiyu, Nie Jian-yun
- LSPT: Long-term Spatial Prompt Tuning For Visual Representation Learning Mo Shentong, Wang Yansen, Luo Xufang, Li Dongsheng
- Fight Back Against Jailbreaking Via Prompt Adversarial Tuning Mo Yichuan, Wang Yuji, Wei Zeming, Wang Yisen
- Machine Translation Meta Evaluation Through Translation Accuracy Challenge Sets Moghe Nikita, Fazla Arnisa, Amrhein Chantal, Kocmi Tom, Steedman Mark, Birch Alexandra, Sennrich Rico, Guillou Liane
- E2TP: Element To Tuple Prompting Improves Aspect Sentiment Tuple Prediction Mohammadkhani Mohammad Ghiasvand, Ranjbar Niloofar, Momtazi Saeedeh
- Routoo: Learning To Route To Large Language Models Effectively Mohammadshahi Alireza, Shaikh Arshad Rafiq, Yazdani Majid
- Comparison Of Large Language Models For Generating Contextually Relevant Questions Molina Ivo Lodovico, Švábenský Valdemar, Minematsu Tsubasa, Chen Li, Okubo Fumiya, Shimada Atsushi
- Towards Generating Informative Textual Description For Neurons In Language Models Mondal Shrayani, Garodia Rishabh, Qureshi Arbaaz, Lee Taesung, Park Youngja
- Realm: Reference Resolution As Language Modeling Moniz Joel Ruben Antony, Krishnan Soundarya, Ozyildirim Melis, Saraf Prathamesh, Ates Halim Cagri, Zhang Yuan, Yu Hong
- Repliqa: A Question-answering Dataset For Benchmarking Llms On Unseen Reference Content Monteiro Joao, Noel Pierre-andre, Marcotte Etienne, Rajeswar Sai, Zantedeschi Valentina, Vazquez David, Chapados Nicolas, Pal Christopher, Taslakian Perouz
- Language-conditioned Offline RL For Multi-robot Navigation Morad Steven, Shankar Ajay, Blumenkamp Jan, Prorok Amanda
- Exploring The Landscape Of Large Language Models: Foundations, Techniques, And Challenges Moradi Milad, Yan Ke, Colwell David, Samwald Matthias, Asgari Rhona
- Language Modelling Approaches To Adaptive Machine Translation Moslem Yasmin
- A Logical Fallacy-informed Framework For Argument Generation Mouchel Luca, Paul Debjit, Cui Shaobo, West Robert, Bosselut Antoine, Faltings Boi
- The Realhumaneval: Evaluating Large Language Models' Abilities To Support Programmers Mozannar Hussein, Chen Valerie, Alsobay Mohammed, Das Subhro, Zhao Sebastian, Wei Dennis, Nagireddy Manish, Sattigeri Prasanna, Talwalkar Ameet, Sontag David
- Cross-layer Attention Sharing For Large Language Models Mu Yongyu, Wu Yuzhang, Fan Yuchun, Wang Chenglong, Li Hengyu, He Qiaozhi, Yang Murun, Xiao Tong, Zhu Jingbo
- Unraveling The Truth: Do Llms Really Understand Charts? A Deep Dive Into Consistency And Robustness Mukhopadhyay Srija, Qidwai Adnan, Garimella Aparna, Ramu Pritika, Gupta Vivek, Roth Dan
- Towards Robots That Know When They Need Help: Affordance-based Uncertainty For Large Language Model Planners Mullen James F. Jr., Manocha Dinesh
- Grouse: A Benchmark To Evaluate Evaluators In Grounded Question Answering Muller Sacha, Loison António, Omrani Bilel, Viaud Gautier
- Chatgpt In Classrooms: Transforming Challenges Into Opportunities In Education Munawar Harris Bin, Misirlis Nikolaos
- Compact Language Models Via Pruning And Knowledge Distillation Muralidharan Saurav, Sreenivas Sharath Turuvekere, Joshi Raviraj, Chochowski Marcin, Patwary Mostofa, Shoeybi Mohammad, Catanzaro Bryan, Kautz Jan, Molchanov Pavlo
- Evaluating The Evaluator: Measuring Llms' Adherence To Task Evaluation Instructions Murugadoss Bhuvanashree, Poelitz Christian, Drosos Ian, Le Vu, Mckenna Nick, Negreanu Carina Suzana, Parnin Chris, Sarkar Advait
- Novel-wd: Exploring Acquisition Of Novel World Knowledge In Llms Using Prefix-tuning Méloux Maxime, Cerisara Christophe
- Negation Blindness In Large Language Models: Unveiling The NO Syndrome In Image Generation Nadeem Mohammad, Sohail Shahab Saquib, Cambria Erik, Schuller Björn W., Hussain Amir
- Learning Vs Retrieval: The Role Of In-context Examples In Regression With Llms Nafar Aliakbar, Venable Kristen Brent, Kordjamshidi Parisa
- Zero-shot Visual Reasoning By Vision-language Models: Benchmarking And Analysis Nagar Aishik, Jaiswal Shantanu, Tan Cheston
- Generating Situated Reflection Triggers About Alternative Solution Paths: A Case Study Of Generative AI For Computer-supported Collaborative Learning Naik Atharva, Yin Jessica Ruhan, Kamath Anusha, Ma Qianou, Wu Sherry Tongshuang, Murray Charles, Bogart Christopher, Sakr Majd, Rose Carolyn P.
- Aurora-m: The First Open Source Multilingual Language Model Red-teamed According To The U.S. Executive Order Nakamura Taishi, Mishra Mayank, Tedeschi Simone, Chai Yekun, Stillerman Jason T, Friedrich Felix, Yadav Prateek, Laud Tanmay, Chien Vu Minh, Zhuo Terry Yue, Misra Diganta, Bogin Ben, Vu Xuan-son, Karpinska Marzena, Dantuluri Arnav Varma, Kusa Wojciech, Furlanello Tommaso, Yokota Rio, Muennighoff Niklas, Pai Suhas, Adewumi Tosin, Laippala Veronika, Yao Xiaozhe, Junior Adalberto, Ariyak Alpay, Drozd Aleksandr, Clive Jordan, Gupta Kshitij, Chen Liangyu, Sun Qi, Tsui Ken, Persaud Noah, Fahmy Nour, Chen Tianlong, Bansal Mohit, Monti Nicolo, Dang Tai, Luo Ziyang, Bui Tien-tung, Navigli Roberto, Mehta Virendra, Blumberg Matthew, May Victor, Nguyen Huu, Pyysalo Sampo
- Open Llama2 Model For The Lithuanian Language Nakvosas Artūras, Daniušis Povilas, Mulevičius Vytas
- Apprentices To Research Assistants: Advancing Research With Large Language Models Namvarpour M., Razi A.
- What If Red Can Talk? Dynamic Dialogue Generation Using Large Language Models Nananukul Navapat, Wongkamjan Wichayaporn
- Review-feedback-reason (refer): A Novel Framework For NLG Evaluation And Reasoning Narsupalli Yaswanth, Chandra Abhranil, Muppirala Sreevatsa, Gupta Manish, Goyal Pawan
- Word2world: Generating Stories And Worlds Through Large Language Models Nasir Muhammad U., James Steven, Togelius Julian
- Learning Goal-conditioned Representations For Language Reward Models Nath Vaskar, Slack Dylan, Da Jeff, Ma Yuntao, Zhang Hugh, Whitehead Spencer, Hendryx Sean
- Dynamic Memory Compression: Retrofitting Llms For Accelerated Inference Nawrot Piotr, Łańcucki Adrian, Chochowski Marcin, Tarjan David, Ponti Edoardo M.
- Revolutionizing Undergraduate Learning: Coursegpt And Its Generative AI Advancements Nazar Ahmad M., Selim Mohamed Y., Gaffar Ashraf, Ahmed Shakil
- Needle In The Haystack For Memory Based Large Language Models Nelson Elliot, Kollias Georgios, Das Payel, Chaudhury Subhajit, Dan Soham
- Interpreting Context Look-ups In Transformers: Investigating Attention-mlp Interactions Neo Clement, Cohen Shay B., Barez Fazl
- Towards Modeling Learner Performance With Large Language Models Neshaei Seyed Parsa, Davis Richard Lee, Hazimeh Adam, Lazarevski Bojan, Dillenbourg Pierre, Käser Tanja
- Educational Personalized Learning Path Planning With Large Language Models Ng Chee, Fung Yuen
- How Well Can Llms Echo Us? Evaluating AI Chatbots' Role-play Ability With ECHO Ng Man Tik, Tse Hui Tung, Huang Jen-tse, Li Jingjing, Wang Wenxuan, Lyu Michael R.
- Dude: Dual Distribution-aware Context Prompt Learning For Large Vision-language Model Nguyen Duy M. H., Le An T., Nguyen Trung Q., Diep Nghiem T., Nguyen Tai, Duong-tran Duy, Peters Jan, Shen Li, Niepert Mathias, Sonntag Daniel
- Balancing Exploration And Exploitation In LLM Using Soft RLLF For Enhanced Negation Understanding Nguyen Ha-thanh, Satoh Ken
- Min P Sampling: Balancing Creativity And Coherence At High Temperature Nguyen Minh, Baker Andrew, Kirsch Andreas, Neo Clement
- Advancing Vietnamese Visual Question Answering With Transformer And Convolutional Integration Nguyen Ngoc Son, Nguyen Van Son, Le Tung
- Text-to-drive: Diverse Driving Behavior Synthesis Via Large Language Models Nguyen Phat, Wang Tsun-hsuan, Hong Zhang-wei, Karaman Sertac, Rus Daniela
- CURATRON: Complete Robust Preference Data For Robust Alignment Of Large Language Models Nguyen Son The, Naresh Niranjan Uma, Tulabandhula Theja
- Better Alignment With Instruction Back-and-forth Translation Nguyen Thao, Li Jeffrey, Oh Sewoong, Schmidt Ludwig, Weston Jason, Zettlemoyer Luke, Li Xian
- Cultural Commonsense Knowledge For Intercultural Dialogues Nguyen Tuan-phong, Razniewski Simon, Weikum Gerhard
- Xwin-lm: Strong And Scalable Alignment Practice For Llms Ni Bolin, Hu Jingcheng, Wei Yixuan, Peng Houwen, Zhang Zheng, Meng Gaofeng, Hu Han
- Mixeval: Deriving Wisdom Of The Crowd From LLM Benchmark Mixtures Ni Jinjie, Xue Fuzhao, Yue Xiang, Deng Yuntian, Shah Mahir, Jain Kabir, Neubig Graham, You Yang
- Mozip: A Multilingual Benchmark To Evaluate Large Language Models In Intellectual Property Ni Shiwen, Tan Minghuan, Bai Yuelin, Niu Fuqiang, Yang Min, Zhang Bowen, Xu Ruifeng, Chen Xiaojun, Li Chengming, Hu Xiping, Li Ye, Fan Jianping
- The GPT Surprise: Offering Large Language Model Chat In A Massive Coding Class Reduced Engagement But Increased Adopters Exam Performances Nie Allen, Chandak Yash, Suzara Miroslav, Ali Malika, Woodrow Juliette, Peng Matt, Sahami Mehran, Brunskill Emma, Piech Chris
- Llm-based Conversational AI Therapist For Daily Functioning Screening And Psychotherapeutic Intervention Via Everyday Smart Devices Nie Jingping, Shao Hanya, Fan Yuang, Shao Qijia, You Haoxuan, Preindl Matthias, Jiang Xiaofan
- WTU-EVAL: A Whether-or-not Tool Usage Evaluation Benchmark For Large Language Models Ning Kangyun, Su Yisong, Lv Xueqiang, Zhang Yuanzhe, Liu Jian, Liu Kang, Xu Jinan
- User-llm: Efficient LLM Contextualization With User Embeddings Ning Lin, Liu Luyang, Wu Jiaxing, Wu Neo, Berlowitz Devora, Prakash Sushant, Green Bradley, O'banion Shawn, Xie Jun
- Can Llms Learn By Teaching? A Preliminary Study Ning Xuefei, Wang Zifu, Li Shiyao, Lin Zinan, Yao Peiran, Fu Tianyu, Blaschko Matthew B., Dai Guohao, Yang Huazhong, Wang Yu
- Enhancing Dialogue State Tracking Models Through Llm-backed User-agents Simulation Niu Cheng, Wang Xingguang, Cheng Xuxin, Song Juntong, Zhang Tong
- Mitigating Hallucinations In Large Language Models Via Self-refinement-enhanced Knowledge Retrieval Niu Mengjia, Li Hao, Shi Jie, Haddadi Hamed, Mo Fan
- Ambignlg: Addressing Task Ambiguity In Instruction For NLG Niwa Ayana, Iso Hayate
- Language Portability Strategies For Open-domain Dialogue With Pre-trained Language Models From High To Low Resource Languages Njifenjou Ahmed, Sucal Virgile, Jabaian Bassam, Lefèvre Fabrice
- Mobileflow: A Multimodal LLM For Mobile GUI Agent Nong Songqin, Zhu Jiali, Wu Rui, Jin Jiongchao, Shan Shuo, Huang Xiutian, Xu Wenhao
- Skill Set Optimization: Reinforcing Language Model Behavior Via Transferable Skills Nottingham Kolby, Majumder Bodhisattwa Prasad, Mishra Bhavana Dalvi, Singh Sameer, Clark Peter, Fox Roy
- Generative AI In EU Law: Liability, Privacy, Intellectual Property, And Cybersecurity Novelli Claudio, Casolari Federico, Hacker Philipp, Spedicato Giorgio, Floridi Luciano
- The Conversation Is The Command: Interacting With Real-world Autonomous Robot Through Natural Language Nwankwo Linus, Rueckert Elmar
- Fine-tuning With HED-IT: The Impact Of Human Post-editing For Dialogical Language Models Occhipinti Daniela, Marchi Michele, Mondella Irene, Lai Huiyuan, Dell'orletta Felice, Nissim Malvina, Guerini Marco
- Beyond Metrics: Evaluating Llms' Effectiveness In Culturally Nuanced, Low-resource Real-world Scenarios Ochieng Millicent, Gumma Varun, Sitaram Sunayana, Wang Jindong, Chaudhary Vishrav, Ronen Keshet, Bali Kalika, O'neill Jacki
- Robust Adaptation Of Foundation Models With Black-box Visual Prompting Oh Changdae, Seo Gyeongdeok, Jung Geunyoung, Cheng Zhi-qi, Choi Hosik, Jung Jiyoung, Song Kyungwoo
- Does Alignment Tuning Really Break Llms' Internal Confidence? Oh Hongseok, Hwang Wonseok
- Crafting Efficient Fine-tuning Strategies For Large Language Models Oliver Michael, Wang Guan
- Gpt-ology, Computational Models, Silicon Sampling: How Should We Think About Llms In Cognitive Science? Ong Desmond C.
- Large Language Models Are Self-taught Reasoners: Enhancing LLM Applications Via Tailored Problem-solving Demonstrations Ong Kai Tzu-iunn, Kwon Taeyoon, Yeo Jinyoung
- Jamba: A Hybrid Transformer-mamba Language Model Opher Lieber, Barak Lenz, Hofit Bata, Gal Cohen, Jhonathan Osin, Itay Dalmedigos, Erez Safahi, Shaked Meirom, Yonatan Belinkov, Shai Shalev-shwartz, Omri Abend, Raz Alon, Tomer Asida, Amir Bergman, Roman Glozman, Michael Gokhman, Avashalom Manevich, Nir Ratner, Noam Rozen, Erez Shwartz, Mor Zusman, Yoav Shoham
- Transformers Are Multi-state Rnns Oren Matanel, Hassid Michael, Yarden Nir, Adi Yossi, Schwartz Roy
- A Systematic Review Of Data-to-text NLG Osuji Chinonso Cynthia, Ferreira Thiago Castro, Davis Brian
- Large Language Models As Planning Domain Generators Oswald James, Srinivas Kavitha, Kokel Harsha, Lee Junkyu, Katz Michael, Sohrabi Shirin
- The Accuracy Of Domain Specific And Descriptive Analysis Generated By Large Language Models Otieno Denish Omondi, Abri Faranak, Siami-namini Sima, Namin Akbar Siami
- Multilingual Prompts In Llm-based Recommenders: Performance Across Languages Ozsoy Makbule Gulcin
- Unraveling The Dilemma Of AI Errors: Exploring The Effectiveness Of Human And Machine Explanations For Large Language Models Pafla Marvin, Larson Kate, Hancock Mark
- Hierarchical Continual Reinforcement Learning Via Large Language Model Pan Chaofan, Yang Xin, Wang Hao, Wei Wei, Li Tianrui
- Not All Contexts Are Equal: Teaching Llms Credibility-aware Generation Pan Ruotong, Cao Boxi, Lin Hongyu, Han Xianpei, Zheng Jia, Wang Sirui, Cai Xunliang, Sun Le
- POMP: Probability-driven Meta-graph Prompter For Llms In Low-resource Unsupervised Neural Machine Translation Pan Shilong, Tian Zhiliang, Ding Liang, Huang Zhen, Wen Zhihua, Li Dongsheng
- Assessing AI Detectors In Identifying Ai-generated Code: Implications For Education Pan Wei Hung, Chok Ming Jie, Wong Jonathan Leong Shan, Shin Yung Xin, Poon Yeong Shian, Yang Zhou, Chong Chun Yong, Lo David, Lim Mei Kuan
- Ee-tuning: An Economical Yet Scalable Solution For Tuning Early-exit Large Language Models Pan Xuchen, Chen Yanxi, Li Yaliang, Ding Bolin, Zhou Jingren
- Enhancing Repository-level Code Generation With Integrated Contextual Information Pan Zhiyuan, Hu Xing, Xia Xin, Yang Xiaohu
- Lottery Ticket Adaptation: Mitigating Destructive Interference In Llms Panda Ashwinee, Isik Berivan, Qi Xiangyu, Koyejo Sanmi, Weissman Tsachy, Mittal Prateek
- Knowledgeable Agents By Offline Reinforcement Learning From Large Language Model Rollouts Pang Jing-cheng, Yang Si-hang, Li Kaiyuan, Zhang Jiaji, Chen Xiong-hui, Tang Nan, Yu Yang
- Attndreambooth: Towards Text-aligned Personalized Text-to-image Generation Pang Lianyu, Yin Jian, Zhao Baoquan, Wu Feize, Wang Fu Lee, Li Qing, Mao Xudong
- Phased Instruction Fine-tuning For Large Language Models Pang Wei, Zhou Chuan, Zhou Xiao-hua, Wang Xiaojie
- LLM Evaluators Recognize And Favor Their Own Generations Panickssery Arjun, Bowman Samuel R., Feng Shi
- Measuring Memorization In RLHF For Code Completion Pappu Aneesh, Porter Billy, Shumailov Ilia, Hayes Jamie
- Do Vision & Language Decoders Use Images And Text Equally? How Self-consistent Are Their Explanations? Parcalabescu Letitia, Frank Anette
- Building Pre-train LLM Dataset For The INDIC Languages: A Case Study On Hindi Parida Shantipriya, Panwar Shakshi, Lata Kusum, Mishra Sanskruti, Sekhar Sambit
- Picturing Ambiguity: A Visual Twist On The Winograd Schema Challenge Park Brendan, Janecek Madeline, Ezzati-jivan Naser, Li Yifeng, Emami Ali
- Open Ko-llm Leaderboard: Evaluating Large Language Models In Korean With Ko-h5 Benchmark Park Chanjun, Kim Hyeonwoo, Kim Dahyun, Cho Seonghwan, Kim Sanghoon, Lee Sukyung, Kim Yungi, Lee Hwalsuk
- Low-resource Cross-lingual Summarization Through Few-shot Learning With Large Language Models Park Gyutae, Hwang Seojin, Lee Hwanhee
- Llms For Enhanced Agricultural Meteorological Recommendations Park Ji-jun, Choi Soo-joon
- M4CXR: Exploring Multi-task Potentials Of Multi-modal Large Language Models For Chest X-ray Interpretation Park Jonggwon, Kim Soobum, Yoon Byungmu, Hyun Jihun, Choi Kyoyun
- Building Trust In Mental Health Chatbots: Safety Metrics And Llm-based Evaluation Tools Park Jung In, Abbasian Mahyar, Azimi Iman, Bounds Dawn, Jun Angela, Han Jaesu, Mccarron Robert, Borelli Jessica, Li Jia, Mahmoudi Mona, Wiedenhoeft Carmen, Rahmani Amir
- Bridging Vision And Language Spaces With Assignment Prediction Park Jungin, Lee Jiyoung, Sohn Kwanghoon
- Token-picker: Accelerating Attention In Text Generation With Minimized Memory Transfer Via Probability Estimation Park Junyoung, Kang Myeonggu, Han Yunki, Kim Yanggon, Shin Jaekang, Kim Lee-sup
- Disentangling Length From Quality In Direct Preference Optimization Park Ryan, Rafailov Rafael, Ermon Stefano, Finn Chelsea
- Any-precision LLM: Low-cost Deployment Of Multiple, Different-sized Llms Park Yeonhong, Hyun Jake, Cho Sanglyul, Sim Bonggeun, Lee Jae W.
- Can Separators Improve Chain-of-thought Prompting? Park Yoonjeong, Kim Hyunjin, Choi Chanyeol, Kim Junseong, Sohn Jy-yong
- Logicbench: Towards Systematic Evaluation Of Logical Reasoning Ability Of Large Language Models Parmar Mihir, Patel Nisarg, Varshney Neeraj, Nakamura Mutsumi, Luo Man, Mashetty Santosh, Mitra Arindam, Baral Chitta
- The Ultimate Guide To Fine-tuning Llms From Basics To Breakthroughs: An Exhaustive Review Of Technologies, Research, Best Practices, Applied Research Challenges And Opportunities Parthasarathy Venkatesh Balavadhani, Zafar Ahtsham, Khan Aafaq, Shahid Arsalan
- Decoding News Narratives: A Critical Analysis Of Large Language Models In Framing Detection Pastorino Valeria, Sivakumar Jasivan A., Moosavi Nafise Sadat
- Goex: Perspectives And Designs Towards A Runtime For Autonomous LLM Applications Patil Shishir G., Zhang Tianjun, Fang Vivian, C. Noppapon, Huang Roy, Hao Aaron, Casado Martin, Gonzalez Joseph E., Popa Raluca Ada, Stoica Ion
- Making Reasoning Matter: Measuring And Improving Faithfulness Of Chain-of-thought Reasoning Paul Debjit, West Robert, Bosselut Antoine, Faltings Boi
- Measuring And Benchmarking Large Language Models' Capabilities To Generate Persuasive Language Pauli Amalie Brogaard, Augenstein Isabelle, Assent Ira
- Generative AI As A Metacognitive Agent: A Comparative Mixed-method Study With Human Participants On Icf-mimicking Exam Performance Pavlovic Jelena University Of Belgrade, Faculty Of Philosophy And Koucing Centar Resarch Lab, Krstic Jugoslav Koucing Centar Research Lab, Mitrovic Luka Koucing Centar Research Lab, Babic Djordje Koucing Centar Research Lab, Milosavljevic Adrijana Koucing Centar Research Lab, Nikolic Milena Koucing Centar Research Lab, Karaklic Tijana Koucing Centar Research Lab, Mitrovic Tijana Koucing Centar Research Lab
- The Effectiveness Of Llms As Annotators: A Comparative Overview And Empirical Analysis Of Direct Representation Pavlovic Maja, Poesio Massimo
- Autonomous Workflow For Multimodal Fine-grained Training Assistants Towards Mixed Reality Pei Jiahuan, Viola Irene, Huang Haochen, Wang Junxiao, Ahsan Moonisa, Ye Fanghua, Yiming Jiang, Sai Yao, Wang Di, Chen Zhumin, Ren Pengjie, Cesar Pablo
- Automated Multi-language To English Machine Translation Using Generative Pre-trained Transformers Pelofske Elijah, Urias Vincent, Liebrock Lorie M.
- Humaneval-xl: A Multilingual Code Generation Benchmark For Cross-lingual Natural Language Generalization Peng Qiwei, Chai Yekun, Li Xuhong
- Review-llm: Harnessing Large Language Models For Personalized Review Generation Peng Qiyao, Liu Hongtao, Xu Hongyan, Yang Qing, Shao Minglai, Wang Wenjun
- AI Revolution On Chat Bot: Evidence From A Randomized Controlled Experiment Peng Sida, Swiatek Wojciech, Gao Allen, Cullivan Paul, Chang Haoge
- Check-eval: A Checklist-based Approach For Evaluating Text Quality Pereira Jayr, Assumpcao Andre, Lotufo Roberto
- Did Translation Models Get More Robust Without Anyone Even Noticing? Peters Ben, Martins André F. T.
- Optimizing Large Language Models For Openapi Code Completion Petryshyn Bohdan, Lukoševičius Mantas
- Suri: Multi-constraint Instruction Following For Long-form Text Generation Pham Chau Minh, Sun Simeng, Iyyer Mohit
- Can Base Chatgpt Be Used For Forecasting Without Additional Optimization? Pham Van, Cunningham Scott
- Investigating The Effectiveness Of Hypertuning Via Gisting Phang Jason
- Coherent Zero-shot Visual Instruction Generation Phung Quynh, Ge Songwei, Huang Jia-bin
- Revisiting Verilogeval: Newer Llms, In-context Learning, And Specification-to-rtl Tasks Pinckney Nathaniel, Batten Christopher, Liu Mingjie, Ren Haoxing, Khailany Brucek
- Improving Context-aware Preference Modeling For Language Models Pitis Silviu, Xiao Ziang, Roux Nicolas Le, Sordoni Alessandro
- Reasoning With Large Language Models, A Survey Plaat Aske, Wong Annie, Verberne Suzan, Broekens Joost, Van Stein Niki, Back Thomas
- V-RECS, A Low-cost LLM4VIS Recommender With Explanations, Captioning And Suggestions Podo Luca, Angelini Marco, Velardi Paola
- Vi(e)va LLM! A Conceptual Stack For Evaluating And Interpreting Generative Ai-based Visualizations Podo Luca, Ishmal Muhammad, Angelini Marco
- Advanced Natural-based Interaction For The Italian Language: Llamantino-3-anita Polignano Marco, Basile Pierpaolo, Semeraro Giovanni
- Evaluation Of Language Models In The Medical Context Under Resource-constrained Settings Posada Andrea, Rueckert Daniel, Meissen Felix, Müller Philip
- Retrieval Augmented Thought Process For Private Data Handling In Healthcare Pouplin Thomas, Sun Hao, Holt Samuel, Van Der Schaar Mihaela
- The Silent Curriculum: How Does LLM Monoculture Shape Educational Content And Its Accessibility? Priyanshu Aman, Vijay Supriti
- Helpful Or Harmful? Exploring The Efficacy Of Large Language Models For Online Grooming Prevention Prosser Ellie, Edwards Matthew
- Code Prompting Elicits Conditional Reasoning Abilities In Text+code Llms Puerto Haritz, Tutek Martin, Aditya Somak, Zhu Xiaodan, Gurevych Iryna
- A Moral Imperative: The Need For Continual Superalignment Of Large Language Models Puthumanaillam Gokul, Vora Manav, Thangeda Pranay, Ornik Melkior
- Model Internals-based Answer Attribution For Trustworthy Retrieval-augmented Generation Qi Jirui, Sarti Gabriele, Fernández Raquel, Bisazza Arianna
- Is Next Token Prediction Sufficient For GPT? Exploration On Code Logic Comprehension Qi Mengnan, Huang Yufan, Yao Yongqiang, Wang Maoquan, Gu Bin, Sundaresan Neel
- Safety Alignment Should Be Made More Than Just A Few Tokens Deep Qi Xiangyu, Panda Ashwinee, Lyu Kaifeng, Ma Xiao, Roy Subhrajit, Beirami Ahmad, Mittal Prateek, Henderson Peter
- ADELIE: Aligning Large Language Models On Information Extraction Qi Yunjia, Peng Hao, Wang Xiaozhi, Xu Bin, Hou Lei, Li Juanzi
- Mutual Reasoning Makes Smaller Llms Stronger Problem-solvers Qi Zhenting, Ma Mingyuan, Xu Jiahang, Zhang Li Lyna, Yang Fan, Yang Mao
- Data Augmentation Integrating Dialogue Flow And Style To Adapt Spoken Dialogue Systems To Low-resource User Groups Qi Zhiyang, Inaba Michimasa
- Enhancing Dialogue Generation In Werewolf Game Through Situation Analysis And Persuasion Strategies Qi Zhiyang, Inaba Michimasa
- Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period Of Large Language Models Qian Chen, Zhang Jie, Yao Wei, Liu Dongrui, Yin Zhenfei, Qiao Yu, Liu Yong, Shao Jing
- Understanding The Dataset Practitioners Behind Large Language Model Development Qian Crystal, Reif Emily, Kahng Minsuk
- Momentor: Advancing Video Large Language Model With Fine-grained Temporal Reasoning Qian Long, Li Juncheng, Wu Yu, Ye Yaobo, Fei Hao, Chua Tat-seng, Zhuang Yueting, Tang Siliang
- How Easy Is It To Fool Your Multimodal Llms? An Empirical Analysis On Deceptive Prompts Qian Yusu, Zhang Haotian, Yang Yinfei, Gan Zhe
- Agent Planning With World Knowledge Model Qiao Shuofei, Fang Runnan, Zhang Ningyu, Zhu Yuqi, Chen Xiang, Deng Shumin, Jiang Yong, Xie Pengjun, Huang Fei, Chen Huajun
- LLM4SBR: A Lightweight And Effective Framework For Integrating Large Language Models In Session-based Recommendation Qiao Shutong, Gao Chen, Wen Junhao, Zhou Wei, Luo Qun, Chen Peixuan, Li Yong
- Supportiveness-based Knowledge Rewriting For Retrieval-augmented Language Modeling Qiao Zile, Ye Wei, Jiang Yong, Mo Tong, Xie Pengjun, Li Weiping, Huang Fei, Zhang Shikun
- Diffusiongpt: Llm-driven Text-to-image Generation System Qin Jie, Wu Jie, Chen Weifeng, Ren Yuxi, Li Huixia, Wu Hefeng, Xiao Xuefeng, Wang Rui, Wen Shilei
- The Uniqueness Of Llama3-70b With Per-channel Quantization: An Empirical Study Qin Minghai
- Enhancing Sequential Recommendations Through Multi-perspective Reflections And Iteration Qin Weicong, Xu Yi, Yu Weijie, Shen Chenglei, Zhang Xiao, He Ming, Fan Jianping, Xu Jun
- Infobench: Evaluating Instruction Following Ability In Large Language Models Qin Yiwei, Song Kaiqiang, Hu Yebowen, Yao Wenlin, Cho Sangwoo, Wang Xiaoyang, Wu Xuansheng, Liu Fei, Liu Pengfei, Yu Dong
- The Synergy Between Data And Multi-modal Large Language Models: A Survey From Co-development Perspective Qin Zhen, Chen Daoyuan, Zhang Wenhao, Yao Liuyi, Huang Yilun, Ding Bolin, Li Yaliang, Deng Shuiguang
- Various Lengths, Constant Speed: Efficient Language Modeling With Lightning Attention Qin Zhen, Sun Weigao, Li Dong, Shen Xuyang, Sun Weixuan, Zhong Yiran
- LAMPO: Large Language Models As Preference Machines For Few-shot Ordinal Classification Qin Zhen, Wu Junru, Shen Jiaming, Liu Tianqi, Wang Xuanhui
- Efficient Interactive LLM Serving With Proxy Model-based Sequence Length Prediction Qiu Haoran, Mao Weichao, Patke Archit, Cui Shengkun, Jha Saurabh, Wang Chen, Franke Hubertus, Kalbarczyk Zbigniew T., Başar Tamer, Iyer Ravishankar K.
- Tool Learning With Large Language Models: A Survey Qu Changle, Dai Sunhao, Wei Xiaochi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Xu Jun, Wen Ji-rong
- DEEP-ICL: Definition-enriched Experts For Language Model In-context Learning Qu Xingwei, Liang Yiming, Wang Yucheng, Zheng Tianyu, Yue Tommy, Ma Lei, Huang Stephen W., Zhang Jiajun, Shi Yinan, Lin Chenghua, Fu Jie, Zhang Ge
- The Frontier Of Data Erasure: Machine Unlearning For Large Language Models Qu Youyang, Ding Ming, Sun Nan, Thilakarathna Kanchana, Zhu Tianqing, Niyato Dusit
- Coursegpt-zh: An Educational Large Language Model Based On Knowledge Distillation Incorporating Prompt Optimization Qu Zheyan, Yin Lu, Yu Zitong, Wang Wenbo, Zhang Xing
- Enhancing Ethical Explanations Of Large Language Models Through Iterative Symbolic Refinement Quan Xin, Valentino Marco, Dennis Louise A., Freitas André
- Verification And Refinement Of Natural Language Explanations Through Llm-symbolic Theorem Proving Quan Xin, Valentino Marco, Dennis Louise A., Freitas André
- Verifiably Following Complex Robot Instructions With Foundation Models Quartey Benedict, Rosen Eric, Tellex Stefanie, Konidaris George
- An Empirical Study On Self-correcting Large Language Models For Data Science Code Generation Quoc Thai Tang, Minh Duc Ha, Thanh Tho Quan, Nguyen-duc Anh
- Memory-augmented Generative Adversarial Transformers Raaijmakers Stephan, Bakker Roos, Cremers Anita, De Kleijn Roy, Kouwenhoven Tom, Verhoef Tessa
- Llms Among Us: Generative AI Participating In Digital Discourse Radivojevic Kristina, Clark Nicholas, Brenner Paul
- Towards Generalizable Agents In Text-based Educational Environments: A Study Of Integrating RL With Llms Radmehr Bahar, Singla Adish, Käser Tanja
- V-zen: Efficient GUI Understanding And Precise Grounding With A Novel Multimodal LLM Rahman Abdur, Chawla Rajat, Kumar Muskaan, Datta Arkajit, Jha Adarsh, Ns Mukunda, Bhola Ishaan
- GSR-BENCH: A Benchmark For Grounded Spatial Reasoning Evaluation Via Multimodal Llms Rajabi Navid, Kosecka Jana
- Knowledge-based Consistency Testing Of Large Language Models Rajan Sai Sathiesh, Soremekun Ezekiel, Chattopadhyay Sudipta
- Constructing Domain-specific Evaluation Sets For Llm-as-a-judge Raju Ravi, Jain Swayambhoo, Li Bo, Li Jonathan, Thakker Urmish
- ANCHOR: Llm-driven News Subject Conditioning For Text-to-image Synthesis Ramakrishnan Aashish Anantha, Huang Sharon X., Lee Dongwon
- Cataractbot: An Llm-powered Expert-in-the-loop Chatbot For Cataract Patients Ramjee Pragnya, Sachdeva Bhuvan, Golechha Satvik, Kulkarni Shreyas, Fulari Geeta, Murali Kaushik, Jain Mohit
- Preference Distillation For Personalized Generative Recommendation Ramos Jerome, Wu Bin, Lipani Aldo
- Analyzing LLM Behavior In Dialogue Summarization: Unveiling Circumstantial Hallucination Trends Ramprasad Sanjana, Ferracane Elisa, Lipton Zachary C.
- WARP: On The Benefits Of Weight Averaged Rewarded Policies Ramé Alexandre, Ferret Johan, Vieillard Nino, Dadashi Robert, Hussenot Léonard, Cedoz Pierre-louis, Sessa Pier Giuseppe, Girgin Sertan, Douillard Arthur, Bachem Olivier
- Capturing Minds, Not Just Words: Enhancing Role-playing Language Models With Personality-indicative Data Ran Yiting, Wang Xintao, Xu Rui, Yuan Xinfeng, Liang Jiaqing, Xiao Yanghua, Yang Deqing
- Self-refine Instruction-tuning For Aligning Reasoning In Language Models Ranaldi Leonardo, Freitas Andrè
- Animate, Or Inanimate, That Is The Question For Large Language Models Ranaldi Leonardo, Pucci Giulia, Zanzotto Fabio Massimo
- Learning To Localize Objects Improves Spatial Reasoning In Visual-llms Ranasinghe Kanchana, Shukla Satya Narayan, Poursaeed Omid, Ryoo Michael S., Lin Tsung-yu
- The Battle Of Llms: A Comparative Study In Conversational QA Tasks Rangapur Aryan, Rangapur Aman
- SPARQL Generation: An Analysis On Fine-tuning Openllama For Question Answering Over A Life Science Knowledge Graph Rangel Julio C., De Farias Tarcisio Mendes, Sima Ana Claudia, Kobayashi Norio
- Visual Hallucination: Definition, Quantification, And Prescriptive Remediations Rani Anku, Rawte Vipula, Sharma Harshad, Anand Neeraj, Rajbangshi Krishnav, Sheth Amit, Das Amitava
- RAVEN: Multitask Retrieval Augmented Vision-language Learning Rao Varun Nagaraj, Choudhary Siddharth, Deshpande Aditya, Satzoda Ravi Kumar, Appalaraju Srikar
- A Critical Look At Tokenwise Reward-guided Text Generation Rashid Ahmad, Wu Ruotian, Grosse Julia, Kristiadi Agustinus, Poupart Pascal
- SSP: Self-supervised Prompting For Cross-lingual Transfer To Low-resource Languages Using Large Language Models Rathore Vipul, Deb Aniruddha, Chandresh Ankish, Singla Parag, Mausam
- Context Embeddings For Efficient Answer Generation In RAG Rau David, Wang Shuai, Déjean Hervé, Clinchant Stéphane
- Small But Funny: A Feedback-driven Approach To Humor Distillation Ravi Sahithya, Huber Patrick, Shrivastava Akshat, Sagar Aditya, Aly Ahmed, Shwartz Vered, Einolghozati Arash
- Lynx: An Open Source Hallucination Evaluation Model Ravi Selvan Sunitha, Mielczarek Bartosz, Kannappan Anand, Kiela Douwe, Qian Rebecca
- FACTOID: Factual Entailment For Hallucination Detection Rawte Vipula, Tonmoy S. M Towhidul Islam, Rajbangshi Krishnav, Nag Shravani, Chadha Aman, Sheth Amit P., Das Amitava
- FAIR Enough: How Can We Develop And Assess A Fair-compliant Dataset For Large Language Models' Training? Raza Shaina, Ghuge Shardul, Ding Chen, Dolatabadi Elham, Pandya Deval
- Beads: Bias Evaluation Across Domains Raza Shaina, Rahman Mizanur, Zhang Michael R.
- FIRST: Faster Improved Listwise Reranking With Single Token Decoding Reddy Revanth Gangi, Doo Jaehyeok, Xu Yifei, Sultan Md Arafat, Swain Deevya, Sil Avirup, Ji Heng
- Codeattack: Revealing Safety Generalization Challenges Of Large Language Models Via Code Completion Ren Qibing, Gao Chang, Shao Jing, Yan Junchi, Tan Xin, Lam Wai, Ma Lizhuang
- On The Efficacy Of Eviction Policy For Key-value Constrained Generative Language Model Inference Ren Siyu, Zhu Kenny Q.
- Analyzing And Reducing Catastrophic Forgetting In Parameter Efficient Tuning Ren Weijieying, Li Xinlong, Wang Lei, Zhao Tianxiang, Qin Wei
- Are Electra's Sentence Embeddings Beyond Repair? The Case Of Semantic Textual Similarity Rep Ivan, Dukić David, Šnajder Jan
- EXAONE 3.0 7.8B Instruction Tuned Language Model Research Lg Ai, :, An Soyoung, Bae Kyunghoon, Choi Eunbi, Choi Stanley Jungkyu, Choi Yemuk, Hong Seokhee, Hong Yeonjung, Hwang Junwon, Jeon Hyojin, Jo Gerrard Jeongwon, Jo Hyunjik, Jung Jiyeon, Jung Yountae, Kim Euisoon, Kim Hyosang, Kim Joonkee, Kim Seonghwan, Kim Soyeon, Kim Sunkyoung, Kim Yireun, Kim Youchul, Lee Edward Hwayoung, Lee Haeju, Lee Honglak, Lee Jinsik, Lee Kyungmin, Lee Moontae, Lee Seungjun, Lim Woohyung, Park Sangha, Park Sooyoun, Park Yongmin, Seo Boseong, Yang Sihoon, Yeen Heuiyeen, Yoo Kyungjae, Yun Hyeongu
- Large Language Models Are Biased Because They Are Large Language Models Resnik Philip
- Quantifying Contamination In Evaluating Code Generation Capabilities Of Language Models Riddell Martin, Ni Ansong, Cohan Arman
- Countering Reward Over-optimization In LLM With Demonstration-guided Reinforcement Learning Rita Mathieu, Strub Florian, Chaabouni Rahma, Michel Paul, Dupoux Emmanuel, Pietquin Olivier
- Pretrained Hybrids With MAD Skills Roberts Nicholas, Guo Samuel, Gao Zhiqi, Gnvv Satya Sai Srinath Namburi, Cromp Sonia, Wu Chengjun, Duan Chengyu, Sala Frederic
- Exploring Advanced Large Language Models With Llmsuite Roffo Giorgio
- Talking To Machines: Do You Read Me? Rojas-barahona Lina M.
- Zero-shot Spam Email Classification Using Pre-trained Large Language Models Rojas-galeano Sergio
- Do GPT Language Models Suffer From Split Personality Disorder? The Advent Of Substrate-free Psychometrics Romero Peter, Fitz Stephen, Nakatsuma Teruo
- CLAPNQ: Cohesive Long-form Answers From Passages In Natural Questions For RAG Systems Rosenthal Sara, Sil Avirup, Florian Radu, Roukos Salim
- An Early Categorization Of Prompt Injection Attacks On Large Language Models Rossi Sippo, Michel Alisia Marianne, Mukkamala Raghava Rao, Thatcher Jason Bennett
- Generative AI For Enhancing Active Learning In Education: A Comparative Study Of GPT-3.5 And GPT-4 In Crafting Customized Test Questions Rouzegar Hamdireza, Makrehchi Masoud
- Enhancing Low-resource NMT With A Multilingual Encoder And Knowledge Distillation: A Case Study Roy Aniruddha, Ray Pretam, Maheshwari Ayush, Sarkar Sudeshna, Goyal Pawan
- Do Llms Have Consistent Values? Rozen Naama, Elidan Gal, Globerson Amir, Daniel Ella
- In-context Learning Demonstration Selection Via Influence Analysis S. Vinay M., Van Minh-hao, Wu Xintao
- Capabilities Of Gemini Models In Medicine Saab Khaled, Tu Tao, Weng Wei-hung, Tanno Ryutaro, Stutz David, Wulczyn Ellery, Zhang Fan, Strother Tim, Park Chunjong, Vedadi Elahe, Chaves Juanma Zambrano, Hu Szu-yeu, Schaekermann Mike, Kamath Aishwarya, Cheng Yong, Barrett David G. T., Cheung Cathy, Mustafa Basil, Palepu Anil, Mcduff Daniel, Hou Le, Golany Tomer, Liu Luyang, Alayrac Jean-baptiste, Houlsby Neil, Tomasev Nenad, Freyberg Jan, Lau Charles, Kemp Jonas, Lai Jeremy, Azizi Shekoofeh, Kanada Kimberly, Man Siwai, Kulkarni Kavita, Sun Ruoxi, Shakeri Siamak, He Luheng, Caine Ben, Webson Albert, Latysheva Natasha, Johnson Melvin, Mansfield Philip, Lu Jian, Rivlin Ehud, Anderson Jesper, Green Bradley, Wong Renee, Krause Jonathan, Shlens Jonathon, Dominowska Ewa, Eslami S. M. Ali, Chou Katherine, Cui Claire, Vinyals Oriol, Kavukcuoglu Koray, Manyika James, Dean Jeff, Hassabis Demis, Matias Yossi, Webster Dale, Barral Joelle, Corrado Greg, Semturs Christopher, Mahdavi S. Sara, Gottweis Juraj, Karthikesalingam Alan, Natarajan Vivek
- Benchmarking And Building Long-context Retrieval Models With Loco And M2-BERT Saad-falcon Jon, Fu Daniel Y., Arora Simran, Guha Neel, Ré Christopher
- Llms' Understanding Of Natural Language Revealed Saba Walid S.
- Mitigating Hallucination In Fictional Character Role-play Sadeq Nafis, Xie Zhouhang, Kang Byungkyu, Lamba Prarit, Gao Xiang, Mcauley Julian
- Insights Into Alignment: Evaluating DPO And Its Variants Across Multiple Tasks Saeidi Amir, Verma Shivanshu, Baral Chitta
- A Systematic Survey Of Prompt Engineering In Large Language Models: Techniques And Applications Sahoo Pranab, Singh Ayush Kumar, Saha Sriparna, Jain Vinija, Mondal Samrat, Chadha Aman
- Mixsumm: Topic-based Data Augmentation Using Llms For Low-resource Extractive Text Summarization Sahu Gaurav, Laradji Issam H.
- Pelican: Correcting Hallucination In Vision-llms Via Claim Decomposition And Program Of Thought Verification Sahu Pritish, Sikka Karan, Divakaran Ajay
- LUNA: A Framework For Language Understanding And Naturalness Assessment Saidov Marat, Bakalova Aleksandra, Taktasheva Ekaterina, Mikhailov Vladislav, Artemova Ekaterina
- Optimization Methods For Personalizing Large Language Models Through Retrieval Augmentation Salemi Alireza, Kallumadi Surya, Zamani Hamed
- Simulating Task-oriented Dialogues With State Transition Graphs And Large Language Models Samarinas Chris, Promthaw Pracha, Nijasure Atharva, Zeng Hansi, Killingback Julian, Zamani Hamed
- Berts Are Generative In-context Learners Samuel David
- Rainbow Teaming: Open-ended Generation Of Diverse Adversarial Prompts Samvelyan Mikayel, Raparthy Sharath Chandra, Lupu Andrei, Hambro Eric, Markosyan Aram H., Bhatt Manish, Mao Yuning, Jiang Minqi, Parker-holder Jack, Foerster Jakob, Rocktäschel Tim, Raileanu Roberta
- Zero-shot RTL Code Generation With Attention Sink Augmented Large Language Models Sandal Selim, Akturk Ismail
- Prompting Llms To Compose Meta-review Drafts From Peer-review Narratives Of Scholarly Manuscripts Santu Shubhra Kanti Karmaker, Sinha Sanjeev Kumar, Bansal Naman, Knipper Alex, Sarkar Souvika, Salvador John, Mahajan Yash, Guttikonda Sri, Akter Mousumi, Freestone Matthew, Williams Matthew C. Jr
- Can Llms Master Math? Investigating Large Language Models On Math Stack Exchange Satpute Ankit, Giessing Noah, Greiner-petter Andre, Schubotz Moritz, Teschke Olaf, Aizawa Akiko, Gipp Bela
- Robust Few-shot Transfer Learning For Knowledge Base Question Answering With Unanswerable Questions Sawhney Riya, Bhattacharya Indrajit, Mausam
- Improving The Validity Of Automatically Generated Feedback Via Reinforcement Learning Scarlatos Alexander, Smith Digory, Woodhead Simon, Lan Andrew
- Impacts Of Anthropomorphizing Large Language Models In Learning Environments Schaaff Kristina, Heidelmann Marc-andré
- The Human Factor In Detecting Errors Of Large Language Models: A Systematic Literature Review And Future Research Directions Schiller Christian A.
- Self-distillation For Model Stacking Unlocks Cross-lingual NLU In 200+ Languages Schmidt Fabian David, Borchert Philipp, Vulić Ivan, Glavaš Goran
- Ruffle&riley: Insights From Designing And Evaluating A Large Language Model-based Conversational Tutoring System Schmucker Robin, Xia Meng, Azaria Amos, Mitchell Tom
- M5 -- A Diverse Benchmark To Assess The Performance Of Large Multimodal Models Across Multilingual And Multicultural Vision-language Tasks Schneider Florian, Sitaram Sunayana
- Exploring Human-llm Conversations: Mental Models And The Originator Of Toxicity Schneider Johannes, Flores Arianna Casanova, Kranz Anne-catherine
- Ai-augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy Schoenegger Philipp, Park Peter S., Karger Ezra, Trott Sean, Tetlock Philip E.
- Metacognitive Myopia In Large Language Models Scholten Florian, Rebholz Tobias R., Hütter Mandy
- THREAD: Thinking Deeper With Recursive Spawning Schroeder Philip, Morgan Nathaniel, Luo Hongyin, Glass James
- In-context Learning Agents Are Asymmetric Belief Updaters Schubert Johannes A., Jagadish Akshay K., Binz Marcel, Schulz Eric
- Soft Prompt Threats: Attacking Safety Alignment And Unlearning In Open-source Llms Through The Embedding Space Schwinn Leo, Dobre David, Xhonneux Sophie, Gidel Gauthier, Gunnemann Stephan
- Evaluating Large Language Models With Fmeval Schwöbel Pola, Franceschi Luca, Zafar Muhammad Bilal, Vasist Keerthan, Malhotra Aman, Shenhar Tomer, Tailor Pinal, Yilmaz Pinar, Diamond Michael, Donini Michele
- Behavioral Testing: Can Large Language Models Implicitly Resolve Ambiguous Entities? Sedova Anastasiia, Litschko Robert, Frassinelli Diego, Roth Benjamin, Plank Barbara
- Do Llms Find Human Answers To Fact-driven Questions Perplexing? A Case Study On Reddit Seegmiller Parker, Gatto Joseph, Sharif Omar, Basak Madhusudan, Preum Sarah Masud
- Generative Explore-exploit: Training-free Optimization Of Generative Recommender Systems Using LLM Optimizers Senel Lütfi Kerem, Fetahu Besnik, Yoshida Davis, Chen Zhiyu, Castellucci Giuseppe, Vedula Nikhita, Choi Jason, Malmasi Shervin
- Train-attention: Meta-learning Where To Focus In Continual Knowledge Learning Seo Yeongbin, Lee Dongha, Yeo Jinyoung
- Surprising Efficacy Of Fine-tuned Transformers For Fact-checking Over Larger Language Models Setty Vinay
- Benchmarking Large Language Models For Math Reasoning Tasks Seßler Kathrin, Rong Yao, Gözlüklü Emek, Kasneci Enkelejda
- Medpromptx: Grounded Multimodal Prompting For Chest X-ray Diagnosis Shaaban Mai A., Khan Adnan, Yaqub Mohammad
- Ai-augmented Brainwriting: Investigating The Use Of Llms In Group Ideation Shaer Orit, Cooper Angelora, Mokryn Osnat, Kun Andrew L., Shoshan Hagit Ben
- Flashattention-3: Fast And Accurate Attention With Asynchrony And Low-precision Shah Jay, Bikshandi Ganesh, Zhang Ying, Thakkar Vijay, Ramani Pradeep, Dao Tri
- Putting Gpt-4o To The Sword: A Comprehensive Evaluation Of Language, Vision, Speech, And Multimodal Proficiency Shahriar Sakib, Lund Brady, Mannuru Nishith Reddy, Arshad Muhammad Arbab, Hayawi Kadhim, Bevara Ravi Varma Kumar, Mannuru Aashrith, Batool Laiba
- Detection And Measurement Of Syntactic Templates In Generated Text Shaib Chantal, Elazar Yanai, Li Junyi Jessy, Wallace Byron C.
- Show, Don't Tell: Aligning Language Models With Demonstrated Feedback Shaikh Omar, Lam Michelle, Hejna Joey, Shao Yijia, Bernstein Michael, Yang Diyi
- Evaluating Text Summaries Generated By Large Language Models Using Openai's GPT Shakil Hassan, Mahi Atqiya Munawara, Nguyen Phuoc, Ortiz Zeydy, Mardini Mamoun T.
- Ai-native Memory: A Pathway From Llms Towards AGI Shang Jingbo, Zheng Zai, Wei Jiale, Ying Xiang, Tao Felix, Team Mindverse
- Synergy-of-thoughts: Eliciting Efficient Reasoning In Hybrid Language Models Shang Yu, Li Yu, Xu Fengli, Li Yong
- Patch-level Training For Large Language Models Shao Chenze, Meng Fandong, Zhou Jie
- Extensible Embedding: A Flexible Multipler For Llm's Context Length Shao Ninglu, Xiao Shitao, Liu Zheng, Zhang Peitian
- Guiding In-context Learning Of Llms Through Quality Estimation For Machine Translation Sharami Javad Pourmostafa Roshan, Shterionov Dimitar, Spronck Pieter
- A Critical Evaluation Of AI Feedback For Aligning Large Language Models Sharma Archit, Keh Sedrick, Mitchell Eric, Finn Chelsea, Arora Kushal, Kollar Thomas
- Patched MOA: Optimizing Inference For Diverse Software Development Tasks Sharma Asankhaya
- Why Would You Suggest That? Human Trust In Language Model Responses Sharma Manasi, Siu Ho Chit, Paleja Rohan, Peña Jaime D.
- Mammothmoda: Multi-modal Large Language Model She Qi, Pan Junwen, Wan Xin, Zhang Rui, Lu Dawei, Huang Kai
- MAPO: Advancing Multilingual Reasoning Through Multilingual Alignment-as-preference Optimization She Shuaijie, Zou Wei, Huang Shujian, Zhu Wenhao, Liu Xiang, Geng Xiang, Chen Jiajun
- Accurate And Efficient Fine-tuning Of Quantized Large Language Models Through Optimal Balance Shen Ao, Wang Qiang, Lai Zhiquan, Li Xionglve, Li Dongsheng
- Nemo-aligner: Scalable Toolkit For Efficient Model Alignment Shen Gerald, Wang Zhilin, Delalleau Olivier, Zeng Jiaqi, Dong Yi, Egert Daniel, Sun Shengyang, Zhang Jimmy, Jain Sahil, Taghibakhshi Ali, Ausin Markel Sanz, Aithal Ashwath, Kuchaiev Oleksii
- Rapid Optimization For Jailbreaking Llms Via Subconscious Exploitation And Echopraxia Shen Guangyu, Cheng Siyuan, Zhang Kaiyuan, Tao Guanhong, An Shengwei, Yan Lu, Zhang Zhuo, Ma Shiqing, Zhang Xiangyu
- Shortcutsbench: A Large-scale Real-world Benchmark For Api-based Agents Shen Haiyang, Li Yue, Meng Desong, Cai Dongqi, Qi Sheng, Zhang Li, Xu Mengwei, Ma Yun
- Boosting Reward Model With Preference-conditional Multi-aspect Synthetic Data Generation Shen Jiaming, Xu Ran, Jun Yennie, Qin Zhen, Liu Tianqi, Yang Carl, Liang Yi, Baumgartner Simon, Bendersky Michael
- The Language Barrier: Dissecting Safety Challenges Of Llms In Multilingual Contexts Shen Lingfeng, Tan Weiting, Chen Sihao, Chen Yunmo, Zhang Jingyu, Xu Haoran, Zheng Boyuan, Koehn Philipp, Khashabi Daniel
- Thermometer: Towards Universal Calibration For Large Language Models Shen Maohao, Das Subhro, Greenewald Kristjan, Sattigeri Prasanna, Wornell Gregory, Ghosh Soumya
- Learning To Decode Collaboratively With Multiple Language Models Shen Shannon Zejiang, Lang Hunter, Wang Bailin, Kim Yoon, Sontag David
- Exploring User Retrieval Integration Towards Large Language Models For Cross-domain Sequential Recommendation Shen Tingjia, Wang Hao, Zhang Jiaqing, Zhao Sirui, Li Liangyue, Chen Zulong, Lian Defu, Chen Enhong
- Improving Reinforcement Learning From Human Feedback Using Contrastive Rewards Shen Wei, Zhang Xiaoying, Yao Yuanshun, Zheng Rui, Guo Hongyi, Liu Yang
- Small Llms Are Weak Tool Learners: A Multi-llm Agent Shen Weizhou, Li Chenliang, Chen Hongzhan, Yan Ming, Quan Xiaojun, Chen Hehong, Zhang Ji, Huang Fei
- An Empirical Study On Context Length For Open-domain Dialog Generation Shen Xinyi, Lin Zuoquan
- Proctag: Process Tagging For Assessing The Efficacy Of Document Instruction Data Shen Yufan, Luo Chuwei, Zhu Zhaoqing, Chen Yang, Zheng Qi, Yu Zhi, Bu Jiajun, Yao Cong
- Beyond Human Preferences: Exploring Reinforcement Learning Trajectory Evaluation And Improvement Through Llms Shen Zichao, Zhu Tianchen, Sun Qingyun, Gao Shiqi, Li Jianxin
- From Language Models To Practical Self-improving Computer Agents Sheng Alex
- Language Models Encode Collaborative Signals In Recommendation Sheng Leheng, Zhang An, Zhang Yi, Chen Yuxin, Wang Xiang, Chua Tat-seng
- Can Language Models Explain Their Own Classification Behavior? Sherburn Dane, Chughtai Bilal, Evans Owain
- Using Chatgpt To Score Essays And Short-form Constructed Responses Shermis Mark D.
- Neurosymbolic AI For Enhancing Instructability In Generative AI Sheth Amit, Pallagani Vishal, Roy Kaushik
- Opex: A Component-wise Analysis Of Llm-centric Agents In Embodied Instruction Following Shi Haochen, Sun Zhiyuan, Yuan Xingdi, Côté Marc-alexandre, Liu Bang
- Judging The Judges: A Systematic Investigation Of Position Bias In Pairwise Comparative Assessments By Llms Shi Lin, Ma Chiyu, Ma Weicheng, Vosoughi Soroush
- Eagle: Exploring The Design Space For Multimodal Llms With Mixture Of Encoders Shi Min, Liu Fuxiao, Wang Shihao, Liao Shijia, Radhakrishnan Subhashree, Huang De-an, Yin Hongxu, Sapra Karan, Yacoob Yaser, Shi Humphrey, Catanzaro Bryan, Tao Andrew, Kautz Jan, Yu Zhiding, Liu Guilin
- Wildfeedback: Aligning Llms With In-situ User Interactions And Feedback Shi Taiwei, Wang Zhuoer, Yang Longqi, Lin Ying-chun, He Zexue, Wan Mengting, Zhou Pei, Jauhar Sujay, Xu Xiaofeng, Song Xia, Neville Jennifer
- Profuser: Progressive Fusion Of Large Language Models Shi Tianyuan, Wan Fanqi, Huang Canbin, Quan Xiaojun, Li Chenliang, Yan Ming, Zhang Ji
- Math-llava: Bootstrapping Mathematical Reasoning For Multimodal Large Language Models Shi Wenhao, Hu Zhiqiang, Bin Yi, Liu Junhua, Yang Yang, Ng See-kiong, Bing Lidong, Lee Roy Ka-wei
- Ehragent: Code Empowers Large Language Models For Few-shot Complex Tabular Reasoning On Electronic Health Records Shi Wenqi, Xu Ran, Zhuang Yuchen, Yu Yue, Zhang Jieyu, Wu Hang, Zhu Yuanda, Ho Joyce, Yang Carl, Wang May D.
- Large Language Models Are Learnable Planners For Long-term Recommendation Shi Wentao, He Xiangnan, Zhang Yang, Gao Chongming, Li Xinyue, Zhang Jizhi, Wang Qifan, Feng Fuli
- Retrieval-enhanced Knowledge Editing In Language Models For Multi-hop Question Answering Shi Yucheng, Tan Qiaoyu, Wu Xuansheng, Zhong Shaochen, Zhou Kaixiong, Liu Ninghao
- Eragent: Enhancing Retrieval-augmented Language Models With Improved Accuracy, Efficiency, And Personalization Shi Yunxiao, Zi Xing, Shi Zijing, Zhang Haimin, Wu Qiang, Xu Min
- Assessment Of Multimodal Large Language Models In Alignment With Human Values Shi Zhelun, Wang Zhipin, Fan Hongxing, Zhang Zaibin, Li Lijun, Zhang Yongting, Yin Zhenfei, Sheng Lu, Qiao Yu, Shao Jing
- Why Larger Language Models Do In-context Learning Differently? Shi Zhenmei, Wei Junyi, Xu Zhuoyan, Liang Yingyu
- How Can Large Language Models Enable Better Socially Assistive Human-robot Interaction: A Brief Survey Shi Zhonghao, Landrum Ellen, Connell Amy O', Kian Mina, Pinto-alva Leticia, Shrestha Kaleen, Zhu Xiaoyuan, Matarić Maja J
- Abusegpt: Abuse Of Generative AI Chatbots To Create Smishing Campaigns Shibli Ashfak Md, Pritom Mir Mehedi A., Gupta Maanak
- CAUS: A Dataset For Question Generation Based On Human Cognition Leveraging Large Language Models Shin Minjung, Kim Donghyun, Ryu Jeh-kwang
- Decompose, Enrich, And Extract! Schema-aware Event Extraction Using Llms Shiri Fatemeh, Nguyen Van, Moghimifar Farhad, Yoo John, Haffari Gholamreza, Li Yuan-fang
- Deconstructing In-context Learning: Understanding Prompts Via Corruption Shivagunde Namrata, Lialin Vladislav, Muckatira Sherin, Rumshisky Anna
- Implicit Multimodal Alignment: On The Generalization Of Frozen Llms To Multimodal Inputs Shukor Mustafa, Cord Matthieu
- An Active Inference Strategy For Prompting Reliable Responses From Large Language Models In Medical Practice Shusterman Roma, Waters Allison C., O`neill Shannon, Luu Phan, Tucker Don M.
- Design2code: How Far Are We From Automating Front-end Engineering? Si Chenglei, Zhang Yanzhe, Yang Zhengyuan, Liu Ruibo, Yang Diyi
- Iclguard: Controlling In-context Learning Behavior For Applicability Authorization Si Wai Man, Backes Michael, Zhang Yang
- Integrating Large Language Models Into Recommendation Via Mutual Augmentation And Adaptive Aggregation Sichun Luo, Yuxuan Yao, Bowei He, Yinya Huang, Aojun Zhou, Xinyi Zhang, Yuanzhang Xiao, Mingjie Zhan, Linqi Song
- Constructing Benchmarks And Interventions For Combating Hallucinations In Llms Simhi Adi, Herzig Jonathan, Szpektor Idan, Belinkov Yonatan
- H2o-danube-1.8b Technical Report Singer Philipp, Pfeiffer Pascal, Babakhin Yauhen, Jeblick Maximilian, Dhankhar Nischay, Fodor Gabor, Ambati Sri Satish
- INDIC QA BENCHMARK: A Multilingual Benchmark To Evaluate Question Answering Capability Of Llms For Indic Languages Singh Abhishek Kumar, Murthy Rudra, Kumar Vishwajeet, Sen Jaydeep, Ramakrishnan Ganesh
- Robustness Of Llms To Perturbations In Text Singh Ayush, Singh Navpreet, Vatsal Shubham
- Rethinking Interpretability In The Era Of Large Language Models Singh Chandan, Inala Jeevana Priya, Galley Michel, Caruana Rich, Gao Jianfeng
- Indicgenbench: A Multilingual Benchmark To Evaluate Generation Capabilities Of Llms On Indic Languages Singh Harman, Gupta Nitish, Bharadwaj Shikhar, Tewari Dinesh, Talukdar Partha
- Exposing The Achilles' Heel: Evaluating Llms Ability To Handle Mistakes In Mathematical Reasoning Singh Joykirat, Nambi Akshay, Vineet Vibhav
- Aya Dataset: An Open-access Collection For Multilingual Instruction Tuning Singh Shivalika, Vargus Freddie, Dsouza Daniel, Karlsson Börje F., Mahendiran Abinaya, Ko Wei-yin, Shandilya Herumb, Patel Jay, Mataciunas Deividas, Omahony Laura, Zhang Mike, Hettiarachchi Ramith, Wilson Joseph, Machado Marina, Moura Luisa Souza, Krzemiński Dominik, Fadaei Hakimeh, Ergün Irem, Okoh Ifeoma, Alaagib Aisha, Mudannayake Oshan, Alyafeai Zaid, Chien Vu Minh, Ruder Sebastian, Guthikonda Surya, Alghamdi Emad A., Gehrmann Sebastian, Muennighoff Niklas, Bartolo Max, Kreutzer Julia, Üstün Ahmet, Fadaee Marzieh, Hooker Sara
- Llm-dcache: Improving Tool-augmented Llms With Gpt-driven Localized Data Caching Singh Simranjit, Fore Michael, Karatzas Andreas, Lee Chaehong, Jian Yanan, Shangguan Longfei, Yu Fuxun, Anagnostopoulos Iraklis, Stamoulis Dimitrios
- LGR2: Language Guided Reward Relabeling For Accelerating Hierarchical Reinforcement Learning Singh Utsav, Bhattacharyya Pramit, Namboodiri Vinay P.
- Exploring Value Biases: How Llms Deviate Towards The Ideal Sivaprasad Sarath, Kaushik Pramod, Abdelnabi Sahar, Fritz Mario
- Using Adaptive Empathetic Responses For Teaching English Siyan Li, Shao Teresa, Yu Zhou, Hirschberg Julia
- Replan: Robotic Replanning With Perception And Language Models Skreta Marta, Zhou Zihan, Yuan Jia Lin, Darvish Kourosh, Aspuru-guzik Alán, Garg Animesh
- The Mysterious Case Of Neuron 1512: Injectable Realignment Architectures Reveal Internal Characteristics Of Meta's Llama 2 Model Smith Brenden, Baker Dallin, Chase Clayton, Barney Myles, Parker Kaden, Allred Makenna, Hu Peter, Evans Alex, Fulda Nancy
- Scaling LLM Test-time Compute Optimally Can Be More Effective Than Scaling Model Parameters Snell Charlie, Lee Jaehoon, Xu Kelvin, Kumar Aviral
- Chatting Up Attachment: Using Llms To Predict Adult Bonds Soares Paulo, Mccurdy Sean, Gerber Andrew J., Fonagy Peter
- Explaining Autonomy: Enhancing Human-robot Interaction Through Explanation Generation With Large Language Models Sobrín-hidalgo David, González-santamarta Miguel A., Guerrero-higueras Ángel M., Rodríguez-lera Francisco J., Matellán-olivera Vicente
- From Words To Worlds: Transforming One-line Prompt Into Immersive Multi-modal Digital Stories With Communicative LLM Agent Sohn Samuel S., Li Danrui, Zhang Sen, Chang Che-jui, Kapadia Mubbasir
- Code Needs Comments: Enhancing Code Llms With Comment Augmentation Song Demin, Guo Honglin, Zhou Yunhua, Xing Shuhao, Wang Yudong, Song Zifan, Zhang Wenwei, Guo Qipeng, Yan Hang, Qiu Xipeng, Lin Dahua
- Increasing Model Capacity For Free: A Simple Strategy For Parameter Efficient Fine-tuning Song Haobo, Zhao Hao, Majumder Soumajit, Lin Tao
- The Typing Cure: Experiences With Large Language Model Chatbots For Mental Health Support Song Inhwa, Pendse Sachin R., Kumar Neha, De Choudhury Munmun
- Out-of-distribution Generalization Via Composition: A Lens Through Induction Heads In Transformers Song Jiajun, Xu Zhuoyan, Zhong Yiqiao
- Cs-bench: A Comprehensive Benchmark For Large Language Models Towards Computer Science Mastery Song Xiaoshuai, Diao Muxi, Dong Guanting, Wang Zhengyang, Fu Yujia, Qiao Runqi, Wang Zhexu, Fu Dayuan, Wu Huangxuan, Liang Bin, Zeng Weihao, Wang Yejie, Gongque Zhuoma, Yu Jianing, Tan Qiuna, Xu Weiran
- Knowledge Editing On Black-box Large Language Models Song Xiaoshuai, Wang Zhengyang, He Keqing, Dong Guanting, Mou Yutao, Zhao Jinxu, Xu Weiran
- Position: Leverage Foundational Models For Black-box Optimization Song Xingyou, Tian Yingtao, Lange Robert Tjarko, Lee Chansoo, Tang Yujin, Chen Yutian
- ELLA-V: Stable Neural Codec Language Modeling With Alignment-guided Sequence Reordering Song Yakun, Chen Zhuo, Wang Xiaofei, Ma Ziyang, Chen Xie
- The Good, The Bad, And The Greedy: Evaluation Of Llms Should Not Ignore Non-determinism Song Yifan, Wang Guoyin, Li Sujian, Lin Bill Yuchen
- Alchemistcoder: Harmonizing And Eliciting Code Capability By Hindsight Tuning On Multi-source Data Song Zifan, Wang Yudong, Zhang Wenwei, Liu Kuikun, Lyu Chengqi, Song Demin, Guo Qipeng, Yan Hang, Lin Dahua, Chen Kai, Zhao Cairong
- Pedagogical Alignment Of Large Language Models Sonkar Shashank, Ni Kangqi, Chaudhary Sapana, Baraniuk Richard G.
- Empo: Theory-driven Dataset Construction For Empathetic Response Generation Through Preference Optimization Sotolar Ondrej
- Enhancing Ai-driven Psychological Consultation: Layered Prompts With Large Language Models Souza Rafael, Lim Jia-hao, Davis Alexander
- Know When To Stop: A Study Of Semantic Drift In Text Generation Spataru Ava, Hambro Eric, Voita Elena, Cancedda Nicola
- Generaitor: Tree-in-the-loop Text Generation For Language Model Explainability And Adaptation Spinner Thilo, Kehlbeck Rebecca, Sevastjanova Rita, Stähle Tobias, Keim Daniel A., Deussen Oliver, El-assady Mennatallah
- Appropriateness Of Llm-equipped Robotic Well-being Coach Language In The Workplace: A Qualitative Evaluation Spitale Micol, Axelsson Minja, Gunes Hatice
- Repetition Improves Language Model Embeddings Springer Jacob Mitchell, Kotha Suhas, Fried Daniel, Neubig Graham, Raghunathan Aditi
- PUB: A Pragmatics Understanding Benchmark For Assessing Llms' Pragmatics Capabilities Sravanthi Settaluri Lakshmi, Doshi Meet, Kalyan Tankala Pavan, Murthy Rudra, Bhattacharyya Pushpak, Dabre Raj
- Probing Multimodal Llms As World Models For Driving Sreeram Shiva, Wang Tsun-hsuan, Maalouf Alaa, Rosman Guy, Karaman Sertac, Rus Daniela
- Comparative Analysis Of Different Efficient Fine Tuning Methods Of Large Language Models (llms) In Low-resource Setting Srinivasan Krishna Prasad Varadarajan, Gumpena Prasanth, Yattapu Madhusudhana, Brahmbhatt Vishal H.
- A Comprehensive Study Of The Capabilities Of Large Language Models For Vulnerability Detection Steenhoek Benjamin, Rahman Md Mahbubur, Roy Monoshi Kumar, Alam Mirza Sanjida, Barr Earl T., Le Wei
- Virtual Agents For Alcohol Use Counseling: Exploring Llm-powered Motivational Interviewing Steenstra Ian, Nouraei Farnaz, Arjmand Mehdi, Bickmore Timothy W.
- RLVF: Learning From Verbal Feedback Without Overgeneralization Stephan Moritz, Khazatsky Alexander, Mitchell Eric, Chen Annie S, Hsu Sheryl, Sharma Archit, Finn Chelsea
- Satyrn: A Platform For Analytics Augmented Generation Sterbentz Marko, Barrie Cameron, Shahi Shubham, Dutta Abhratanu, Hooshmand Donna, Pack Harper, Hammond Kristian J.
- Few-shot VQA With Frozen Llms: A Tale Of Two Approaches Sterner Igor, Lin Weizhe, Chen Jinghong, Byrne Bill
- The Calibration Gap Between Model And Human Confidence In Large Language Models Steyvers Mark, Tejeda Heliodoro, Kumar Aakriti, Belem Catarina, Karny Sheer, Hu Xinyue, Mayer Lukas, Smyth Padhraic
- Tailoring Vaccine Messaging With Common-ground Opinions Stureborg Rickard, Chen Sanxing, Xie Ruoyu, Patel Aayushi, Li Christopher, Zhu Chloe Qinyu, Hu Tingnan, Yang Jun, Dhingra Bhuwan
- Investigating Video Reasoning Capability Of Large Language Models With Tropes In Movies Su Hung-ting, Chao Chun-tong, Hsu Ya-ching, Lin Xudong, Niu Yulei, Lee Hung-yi, Hsu Winston H.
- Mission Impossible: A Statistical Perspective On Jailbreaking Llms Su Jingtong, Kempe Julia, Ullrich Karen
- Llm-barber: Block-aware Rebuilder For Sparsity Mask In One-shot For Large Language Models Su Yupeng, Guan Ziyi, Liu Xiaoqun, Jin Tianlai, Wu Dongkuan, Chesi Graziano, Wong Ngai, Yu Hao
- Living In The Moment: Can Large Language Models Grasp Co-temporal Reasoning? Su Zhaochen, Li Juntao, Zhang Jun, Zhu Tong, Qu Xiaoye, Zhou Pan, Bowen Yan, Cheng Yu, Zhang Min
- Timo: Towards Better Temporal Reasoning For Language Models Su Zhaochen, Zhang Jun, Zhu Tong, Qu Xiaoye, Li Juntao, Zhang Min, Cheng Yu
- Hierarchical In-context Reinforcement Learning With Hindsight Modular Reflections For Planning Sun Chuanneng, Huang Songjun, Pompili Dario
- Sq-llava: Self-questioning For Large Vision-language Assistant Sun Guohao, Qin Can, Wang Jiamian, Chen Zeyuan, Xu Ran, Tao Zhiqiang
- Triforce: Lossless Acceleration Of Long Sequence Generation With Hierarchical Speculative Decoding Sun Hanshi, Chen Zhuoming, Yang Xinyu, Tian Yuandong, Chen Beidi
- Supervised Fine-tuning As Inverse Reinforcement Learning Sun Hao
- Retrieved In-context Principles From Previous Mistakes Sun Hao, Jiang Yong, Wang Bo, Hou Yingyan, Zhang Yan, Xie Pengjun, Huang Fei
- Fuxitranyu: A Multilingual Large Language Model Trained With Balanced Data Sun Haoran, Jin Renren, Xu Shaoyang, Pan Leiyu, Supryadi, Cui Menglong, Du Jiangcun, Lei Yikun, Yang Lei, Shi Ling, Xiao Juesi, Zhu Shaolin, Xiong Deyi
- Conifer: Improving Complex Constrained Instruction-following Ability Of Large Language Models Sun Haoran, Liu Lixin, Li Junjie, Wang Fengyu, Dong Baohua, Lin Ran, Huang Ruohui
- Lawluo: A Chinese Law Firm Co-run By LLM Agents Sun Jingyun, Dai Chengxiao, Luo Zhongze, Chang Yangbo, Li Yang
- Fine-tuning Vs Prompting, Can Language Models Understand Human Values? Sun Pingwei
- Fostering Natural Conversation In Large Language Models With NICO: A Natural Interactive Conversation Dataset Sun Renliang, Liu Mengyuan, Yang Shiping, Wang Rui, He Junqing, Zhang Jiaxing
- Dr-llava: Visual Instruction Tuning With Symbolic Clinical Grounding Sun Shenghuan, Goldgof Gregory M., Schubert Alexander, Sun Zhiqing, Hartvigsen Thomas, Butte Atul J., Alaa Ahmed
- The Critique Of Critique Sun Shichao, Li Junlong, Yuan Weizhe, Yuan Ruifeng, Li Wenjie, Liu Pengfei
- Diversifying Query: Region-guided Transformer For Temporal Sentence Grounding Sun Xiaolong, Shi Liushuai, Wang Le, Zhou Sanping, Xia Kun, Wang Yabing, Hua Gang
- Multi-turn Context Jailbreak Attack On Large Language Models From First Principles Sun Xiongtao, Zhang Deyue, Yang Dongdong, Zou Quanchen, Li Hui
- Rapidly Developing High-quality Instruction Data And Evaluation Benchmark For Large Language Models With Minimal Human Effort: A Case Study On Japanese Sun Yikun, Wan Zhen, Ueda Nobuhiro, Yahata Sakiko, Cheng Fei, Chu Chenhui, Kurohashi Sadao
- DFA-RAG: Conversational Semantic Router For Large Language Model With Definite Finite Automaton Sun Yiyou, Hu Junjie, Cheng Wei, Chen Haifeng
- F-eval: Assessing Fundamental Abilities With Refined Evaluation Methods Sun Yu, Chen Keyu, Wang Shujie, Li Peiji, Guo Qipeng, Yan Hang, Qiu Xipeng, Huang Xuanjing, Lin Dahua
- You Only Cache Once: Decoder-decoder Architectures For Language Models Sun Yutao, Dong Li, Zhu Yi, Huang Shaohan, Wang Wenhui, Ma Shuming, Zhang Quanlu, Wang Jianyong, Wei Furu
- Enhancing Agent Learning Through World Dynamics Modeling Sun Zhiyuan, Shi Haochen, Côté Marc-alexandre, Berseth Glen, Yuan Xingdi, Liu Bang
- Llavaolmobitnet1b: Ternary LLM Goes Multimodal! Sundaram Jainaveen, Iyer Ravi
- How The Advent Of Ubiquitous Large Language Models Both Stymie And Turbocharge Dynamic Adversarial Question Generation Sung Yoo Yeon, Mondal Ishani, Boyd-graber Jordan
- Automating Research Synthesis With Domain-specific Large Language Model Fine-tuning Susnjak Teo, Hwang Peter, Reyes Napoleon H., Barczak Andre L. C., Mcintosh Timothy R., Ranathunga Surangika
- Specexec: Massively Parallel Speculative Decoding For Interactive LLM Inference On Consumer Devices Svirschevski Ruslan, May Avner, Chen Zhuoming, Chen Beidi, Jia Zhihao, Ryabinin Max
- From Explanations To Action: A Zero-shot, Theory-driven LLM Framework For Student Performance Feedback Swamy Vinitra, Romano Davide, Desikan Bhargav Srinivasa, Camburu Oana-maria, Käser Tanja
- Large Language Models And Video Games: A Preliminary Scoping Review Sweetser Penny
- Automatic Generation And Evaluation Of Reading Comprehension Test Items With Large Language Models Säuberli Andreas, Clematide Simon
- Online Adaptation Of Language Models With A Memory Of Amortized Contexts Tack Jihoon, Kim Jaehyung, Mitchell Eric, Shin Jinwoo, Teh Yee Whye, Schwarz Jonathan Richard
- Mmlu-pro+: Evaluating Higher-order Reasoning And Shortcut Learning In Llms Taghanaki Saeid Asgari, Khani Aliasgahr, Khasahmadi Amir
- JORA: JAX Tensor-parallel Lora Library For Retrieval Augmented Fine-tuning Tahir Anique, Cheng Lu, Liu Huan
- Improving Large Language Model (LLM) Fidelity Through Context-aware Grounding: A Systematic Approach To Reliability And Veracity Talukdar Wrick, Biswas Anjanava
- Let Me Speak Freely? A Study On The Impact Of Format Restrictions On Performance Of Large Language Models Tam Zhi Rui, Wu Cheng-kuang, Tsai Yi-lin, Lin Chieh-yen, Lee Hung-yi, Chen Yun-nung
- An Improved Traditional Chinese Evaluation Suite For Foundation Model Tam Zhi-rui, Pai Ya-ting, Lee Yen-wei, Chen Jun-da, Chu Wei-min, Cheng Sega, Shuai Hong-han
- Mobilequant: Mobile-friendly Quantization For On-device Language Models Tan Fuwen, Lee Royson, Dudziak Łukasz, Hu Shell Xu, Bhattacharya Sourav, Hospedales Timothy, Tzimiropoulos Georgios, Martinez Brais
- PROXYQA: An Alternative Framework For Evaluating Long-form Text Generation With Large Language Models Tan Haochen, Guo Zhijiang, Shi Zhan, Xu Lu, Liu Zhili, Feng Yunlong, Li Xiaoguang, Wang Yasheng, Shang Lifeng, Liu Qun, Song Linqi
- Taskgen: A Task-based, Memory-infused Agentic Framework Using Strictjson Tan John Chong Min, Saroj Prince, Runwal Bharat, Maheshwari Hardik, Sheng Brian Lim Yi, Cottrill Richard, Chona Alankrit, Kumar Ambuj, Motani Mehul
- Large Language Models For Data Annotation: A Survey Tan Zhen, Li Dawei, Wang Song, Beigi Alimohammad, Jiang Bohan, Bhattacharjee Amrita, Karami Mansooreh, Li Jundong, Cheng Lu, Liu Huan
- Tuning-free Accountable Intervention For LLM Deployment -- A Metacognitive Approach Tan Zhen, Peng Jie, Chen Tianlong, Liu Huan
- An Empirical Study And Analysis Of Text-to-image Generation Using Large Language Model-powered Textual Representation Tan Zhiyu, Yang Mengping, Qin Luozheng, Yang Hao, Qian Ye, Zhou Qiang, Zhang Cheng, Li Hao
- EVALALIGN: Supervised Fine-tuning Multimodal Llms With Human-aligned Data For Evaluating Text-to-image Models Tan Zhiyu, Yang Xiaomeng, Qin Luozheng, Yang Mengping, Zhang Cheng, Li Hao
- Instructdoc: A Dataset For Zero-shot Generalization Of Visual Document Understanding With Instructions Tanaka Ryota, Iki Taichi, Nishida Kyosuke, Saito Kuniko, Suzuki Jun
- Jill Watson: A Virtual Teaching Assistant Powered By Chatgpt Taneja Karan, Maiti Pratyusha, Kakar Sandeep, Guruprasad Pranav, Rao Sanjeev, Goel Ashok K.
- Tofueval: Evaluating Hallucinations Of Llms On Topic-focused Dialogue Summarization Tang Liyan, Shalyminov Igor, Wong Amy Wing-mei, Burnsky Jon, Vincent Jake W., Yang Yu'an, Singh Siffi, Feng Song, Song Hwanjun, Su Hang, Sun Lijia, Zhang Yi, Mansour Saab, Mckeown Kathleen
- Llmbox: A Comprehensive Library For Large Language Models Tang Tianyi, Hu Yiwen, Li Bingqian, Luo Wenyang, Qin Zijing, Sun Haoxiang, Wang Jiapeng, Xu Shiyi, Cheng Xiaoxue, Guo Geyang, Peng Han, Zheng Bowen, Tang Yiru, Min Yingqian, Chen Yushuo, Chen Jie, Zhao Yuanqian, Ding Luran, Wang Yuhao, Dong Zican, Xia Chunxuan, Li Junyi, Zhou Kun, Zhao Wayne Xin, Wen Ji-rong
- Language-specific Neurons: The Key To Multilingual Capabilities In Large Language Models Tang Tianyi, Luo Wenyang, Huang Haoyang, Zhang Dongdong, Wang Xiaolei, Zhao Xin, Wei Furu, Wen Ji-rong
- Step-back Profiling: Distilling User History For Personalized Scientific Writing Tang Xiangru, Zhang Xingyao, Shao Yanjun, Wu Jie, Zhao Yilun, Cohan Arman, Gong Ming, Zhang Dongmei, Gerstein Mark
- Multihop-rag: Benchmarking Retrieval-augmented Generation For Multi-hop Queries Tang Yixuan, Yang Yi
- On The Hardness Of Faithful Chain-of-thought Reasoning In Large Language Models Tanneru Sree Harsha, Ley Dan, Agarwal Chirag, Lakkaraju Himabindu
- Chatgpt Incorrectness Detection In Software Reviews Tanzil Minaoar Hossain, Khan Junaed Younus, Uddin Gias
- Scaling Laws With Vocabulary: Larger Models Deserve Larger Vocabularies Tao Chaofan, Liu Qian, Dou Longxu, Muennighoff Niklas, Wan Zhongwei, Luo Ping, Lin Min, Wong Ngai
- Enhancing Commentary Strategies For Imperfect Information Card Games: A Study Of Large Language Models In Guandan Commentary Tao Meiling, Liang Xuechen, Wang Ziyi, Tao Yiling, Shi Tianyu
- When To Trust Llms: Aligning Confidence With Response Quality Tao Shuchang, Yao Liuyi, Ding Hanxing, Xie Yuexiang, Cao Qi, Sun Fei, Gao Jinyang, Shen Huawei, Ding Bolin
- CUDRT: Benchmarking The Detection Of Human Vs. Large Language Models Generated Texts Tao Zhen, Li Zhiyu, Xi Dinghao, Xu Wei
- MEEL: Multi-modal Event Evolution Learning Tao Zhengwei, Jin Zhi, Huang Junqiang, Chen Xiancai, Bai Xiaoying, Zhao Haiyan, Zhang Yifan, Tao Chongyang
- Distilling Llms' Decomposition Abilities Into Compact Language Models Tarasov Denis, Shridhar Kumar
- Embedding-aligned Language Models Tennenholtz Guy, Chow Yinlam, Hsu Chih-wei, Shani Lior, Liang Ethan, Boutilier Craig
- Interactive Prompt Debugging With Sequence Salience Tenney Ian, Mullins Ryan, Du Bin, Pandya Shree, Kahng Minsuk, Dixon Lucas
- Judging The Judges: Evaluating Alignment And Vulnerabilities In Llms-as-judges Thakur Aman Singh, Choudhary Kartik, Ramayapally Venkat Srinik, Vaidyanathan Sankaran, Hupkes Dieuwke
- Equipping Language Models With Tool Use Capability For Tabular Data Analysis In Finance Theuma Adrian, Shareghi Ehsan
- Supporting Human Raters With The Detection Of Harmful Content Using Large Language Models Thomas Kurt, Kelley Patrick Gage, Tao David, Meiklejohn Sarah, Vallis Owen, Tan Shunwen, Bratanič Blaž, Ferreira Felipe Tiengo, Eranti Vijay Kumar, Bursztein Elie
- Limits Of Transformer Language Models On Learning To Compose Algorithms Thomm Jonathan, Terzic Aleksandar, Camposampiero Giacomo, Hersche Michael, Schölkopf Bernhard, Rahimi Abbas
- Elitr-bench: A Meeting Assistant Benchmark For Long-context Language Models Thonet Thibaut, Rozen Jos, Besacier Laurent
- Hydralora: An Asymmetric Lora Architecture For Efficient Fine-tuning Tian Chunlin, Shi Zhan, Guo Zhijiang, Li Li, Xu Chengzhong
- Tokenize The World Into Object-level Knowledge To Address Long-tail Events In Autonomous Driving Tian Ran, Li Boyi, Weng Xinshuo, Chen Yuxiao, Schmerling Edward, Wang Yue, Ivanovic Boris, Pavone Marco
- Robustness Assessment Of Mathematical Reasoning In The Presence Of Missing And Contradictory Conditions Tian Shi-yu, Zhou Zhi, Jia Lin-han, Guo Lan-zhe, Li Yu-feng
- Toward Self-improvement Of Llms Via Imagination, Searching, And Criticizing Tian Ye, Peng Baolin, Song Linfeng, Jin Lifeng, Yu Dian, Mi Haitao, Yu Dong
- Selective Prompt Anchoring For Code Generation Tian Yuan, Zhang Tianyi
- Codehalu: Investigating Code Hallucinations In Llms Via Execution-based Verification Tian Yuchen, Yan Weixiang, Yang Qian, Zhao Xuandong, Chen Qian, Wang Wen, Luo Ziyang, Ma Lei, Song Dawn
- The Instinctive Bias: Spurious Images Lead To Hallucination In Mllms Tianyang Han, Qing Lian, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang
- LLM Circuit Analyses Are Consistent Across Training And Scale Tigges Curt, Hanna Michael, Yu Qinan, Biderman Stella
- Follow-up Questions Improve Documents Generated By Large Language Models Tix Bernadette J
- An Ecosage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant Tomar Mohit, Tiwari Abhisek, Saha Tulika, Jha Prince, Saha Sriparna
- Cpsdbench: A Large Language Model Evaluation Benchmark And Baseline For Chinese Public Security Domain Tong Xin, Jin Bo, Lin Zhi, Wang Binjun, Yu Ting, Cheng Qiang
- Can Llms Learn From Previous Mistakes? Investigating Llms' Errors To Boost For Reasoning Tong Yongqi, Li Dawei, Wang Sizhe, Wang Yujia, Teng Fei, Shang Jingbo
- Optimizing Language Model's Reasoning Abilities With Weak Supervision Tong Yongqi, Wang Sizhe, Li Dawei, Wang Yifan, Han Simeng, Lin Zi, Huang Chengsong, Huang Jiaxin, Shang Jingbo
- A Comprehensive Survey Of Hallucination Mitigation Techniques In Large Language Models Tonmoy S. M Towhidul Islam, Zaman S M Mehedi, Jain Vinija, Rani Anku, Rawte Vipula, Chadha Aman, Das Amitava
- Evaluating Large Language Models With Grid-based Game Competitions: An Extensible LLM Benchmark And Leaderboard Topsakal Oguzhan, Edell Colby Jacob, Harper Jackson Bailey
- Lavy: Vietnamese Multimodal Large Language Model Tran Chi, Thanh Huong Le
- Readctrl: Personalizing Text Generation With Readability-controlled Instruction Learning Tran Hieu, Yao Zonghai, Li Lingxi, Yu Hong
- Connecting The Dots: Llms Can Infer And Verbalize Latent Structure From Disparate Training Data Treutlein Johannes, Choi Dami, Betley Jan, Anil Cem, Marks Samuel, Grosse Roger Baker, Evans Owain
- Leveraging LLM Reasoning Enhances Personalized Recommender Systems Tsai Alicia Y., Kraft Adam, Jin Long, Cai Chenwei, Hosseini Anahita, Xu Taibai, Zhang Zemin, Hong Lichan, Chi Ed H., Yi Xinyang
- Enhance Modality Robustness In Text-centric Multimodal Alignment With Adversarial Prompting Tsai Yun-da, Yen Ting-yu, Liao Keng-te, Lin Shou-de
- Augmenting The Author: Exploring The Potential Of AI Collaboration In Academic Writing Tu Joseph, Hadan Hilda, Wang Derrick M., Sgandurra Sabrina A, Mogavi Reza Hadi, Nacke Lennart E.
- Charactereval: A Chinese Benchmark For Role-playing Conversational Agent Evaluation Tu Quan, Fan Shilong, Tian Zihang, Yan Rui
- R-eval: A Unified Toolkit For Evaluating Domain Knowledge Of Retrieval Augmented Large Language Models Tu Shangqing, Wang Yuanchun, Yu Jifan, Xie Yuyang, Shi Yaran, Wang Xiaozhi, Zhang Jing, Hou Lei, Li Juanzi
- LM Transparency Tool: Interactive Tool For Analyzing Transformer Language Models Tufanov Igor, Hambardzumyan Karen, Ferrando Javier, Voita Elena
- Lifelong Robot Library Learning: Bootstrapping Composable And Generalizable Skills For Embodied Control With Language Models Tziafas Georgios, Kasaei Hamidreza
- Unseentimeqa: Time-sensitive Question-answering Beyond Llms' Memorization Uddin Md Nayem, Saeidi Amir, Handa Divij, Seth Agastya, Son Tran Cao, Blanco Eduardo, Corman Steven R., Baral Chitta
- Bootstrapping Llm-based Task-oriented Dialogue Agents Via Self-talk Ulmer Dennis, Mansimov Elman, Lin Kaixiang, Sun Justin, Gao Xibin, Zhang Yi
- Hypotermqa: Hypothetical Terms Dataset For Benchmarking Hallucination Tendency Of Llms Uluoglakci Cem Middle East Technical University, Temizel Tugba Taskaya Middle East Technical University
- Investigating Context Effects In Similarity Judgements In Large Language Models Uprety Sagar, Jaiswal Amit Kumar, Liu Haiming, Song Dawei
- Llms With Industrial Lens: Deciphering The Challenges And Prospects -- A Survey Urlana Ashok, Kumar Charaka Vinayak, Singh Ajeet Kumar, Garlapati Bala Mallikarjunarao, Chalamala Srinivasa Rao, Mishra Rahul
- PARADISE: Evaluating Implicit Planning Skills Of Language Models With Procedural Warnings And Tips Dataset Uzunoglu Arda, Safa Abdalfatah Rashid, Şahin Gözde Gül
- Natural Language Programming In Medicine: Administering Evidence Based Clinical Workflows With Autonomous Agents Powered By Generative Large Language Models Vaid Akhil, Lampert Joshua, Lee Juhee, Sawant Ashwin, Apakama Donald, Sakhuja Ankit, Soroush Ali, Bick Sarah, Abbott Ethan, Gomez Hernando, Hadley Michael, Lee Denise, Landi Isotta, Duong Son Q, Bussola Nicole, Nabeel Ismail, Muehlstedt Silke, Muehlstedt Silke, Freeman Robert, Kovatch Patricia, Carr Brendan, Wang Fei, Glicksberg Benjamin, Argulian Edgar, Lerakis Stamatios, Khera Rohan, Reich David L., Kraft Monica, Charney Alexander, Nadkarni Girish
- Accelerating Large Language Model Inference With Self-supervised Early Exits Valade Florian
- Investigating The Performance Of Language Models For Completing Code In Functional Programming Languages: A Haskell Case Study Van Dam Tim, Van Der Heijden Frank, De Bekker Philippe, Nieuwschepen Berend, Otten Marc, Izadi Maliheh
- Investigating And Addressing Hallucinations Of Llms In Tasks Involving Negation Varshney Neeraj, Raj Satyam, Mishra Venkatesh, Chatterjee Agneet, Sarkar Ritika, Saeidi Amir, Baral Chitta
- A Survey Of Prompt Engineering Methods In Large Language Models For Different NLP Tasks Vatsal Shubham, Dubey Harsh
- Enhancing Model Performance: Another Approach To Vision-language Instruction Tuning Vedanshu, Tripathi Mm, Jaint Bhavnesh
- Question Suggestion For Conversational Shopping Assistants Using Product Metadata Vedula Nikhita, Rokhlenko Oleg, Malmasi Shervin
- On The Brittle Foundations Of React Prompting For Agentic Large Language Models Verma Mudit, Bhambri Siddhant, Kambhampati Subbarao
- Auditing Counterfire: Evaluating Advanced Counterargument Generation With Evidence And Style Verma Preetika, Jaidka Kokil, Churina Svetlana
- RAG Based Question-answering For Contextual Response Prediction System Veturi Sriram, Vaichal Saurabh, Jagadheesh Reshma Lal, Tripto Nafis Irtiza, Yan Nian
- Exploring LGBTQ+ Bias In Generative AI Answers Across Different Country And Religious Contexts Vicsek Lilla, Vancsó Anna, Zajko Mike, Takacs Judit
- How Much Data Is Enough Data? Fine-tuning Large Language Models For In-house Translation: Performance Evaluation Across Multiple Dataset Sizes Vieira Inacio, Allred Will, Lankford Séamus, Castilho Sheila, Way Andy
- Input Conditioned Graph Generation For Language Agents Vierling Lukas, Fu Jie, Chen Kai
- Exploring Safety-utility Trade-offs In Personalized Language Models Vijjini Anvesh Rao, Chowdhury Somnath Basu Roy, Chaturvedi Snigdha
- Emotional Manipulation Through Prompt Engineering Amplifies Disinformation Generation In AI Large Language Models Vinay Rasita, Spitale Giovanni, Biller-andorno Nikola, Germani Federico
- Redwhale: An Adapted Korean LLM Through Efficient Continual Pretraining Vo Anh-dung, Jung Minseong, Lee Wonbeen, Choi Daewoo
- Vi-mistral-x: Building A Vietnamese Language Model With Advanced Continual Pre-training Vo James
- Instruction Following With Goal-conditioned Reinforcement Learning In Virtual Environments Volovikova Zoya, Skrynnik Alexey, Kuderov Petr, Panov Aleksandr I.
- Foundational Autoraters: Taming Large Language Models For Better Automatic Evaluation Vu Tu, Krishna Kalpesh, Alzubi Salaheddin, Tar Chris, Faruqui Manaal, Sung Yun-hsuan
- Generative Large Language Models In Automated Fact-checking: A Survey Vykopal Ivan, Pikuliak Matúš, Ostermann Simon, Šimko Marián
- From Rags To Rich Parameters: Probing How Language Models Utilize External Knowledge Over Parametric Information For Factual Queries Wadhwa Hitesh, Seetharaman Rahul, Aggarwal Somyaa, Ghosh Reshmi, Basu Samyadeep, Srinivasan Soundararajan, Zhao Wenlong, Chaudhari Shreyas, Aghazadeh Ehsan
- An Empirical Study Of Mamba-based Language Models Waleffe Roger, Byeon Wonmin, Riach Duncan, Norick Brandon, Korthikanti Vijay, Dao Tri, Gu Albert, Hatamizadeh Ali, Singh Sudhakar, Narayanan Deepak, Kulshreshtha Garvit, Singh Vartika, Casper Jared, Kautz Jan, Shoeybi Mohammad, Catanzaro Bryan
- The Instruction Hierarchy: Training Llms To Prioritize Privileged Instructions Wallace Eric, Xiao Kai, Leike Reimar, Weng Lilian, Heidecke Johannes, Beutel Alex
- Temporal Blind Spots In Large Language Models Wallat Jonas, Jatowt Adam, Anand Avishek
- Fusechat: Knowledge Fusion Of Chat Models Wan Fanqi, Zhong Longguang, Yang Ziyi, Chen Ruijun, Quan Xiaojun
- Dynamic Self-consistency: Leveraging Reasoning Paths For Efficient LLM Sampling Wan Guangya, Wu Yuqi, Chen Jie, Li Sheng
- Datavist5: A Pre-trained Language Model For Jointly Understanding Text And Data Visualization Wan Zhuoyue, Song Yuanfeng, Li Shuaimin, Zhang Chen Jason, Wong Raymond Chi-wing
- COSMO: Contrastive Streamlined Multimodal Model With Interleaved Pre-training Wang Alex Jinpeng, Li Linjie, Lin Kevin Qinghong, Wang Jianfeng, Lin Kevin, Yang Zhengyuan, Wang Lijuan, Shou Mike Zheng
- Pargo: Bridging Vision-language With Partial And Global Views Wang An-lan, Shan Bin, Shi Wei, Lin Kun-yu, Fei Xiang, Tang Guozhi, Liao Lei, Tang Jingqun, Huang Can, Zheng Wei-shi
- Fine-grained Self-endorsement Improves Factuality And Reasoning Wang Ante, Song Linfeng, Peng Baolin, Tian Ye, Jin Lifeng, Mi Haitao, Su Jinsong, Yu Dong
- GOLF: Goal-oriented Long-term Life Tasks Supported By Human-ai Collaboration Wang Ben
- Task Supportive And Personalized Human-large Language Model Interaction: A User Study Wang Ben, Liu Jiqun, Karimnazarov Jamshed, Thompson Nicolas
- Resilience Of Large Language Models For Noisy Instructions Wang Bin, Wei Chengwei, Liu Zhengyuan, Lin Geyu, Chen Nancy F.
- ANGO: A Next-level Evaluation Benchmark For Generation-oriented Language Models In Chinese Domain Wang Bingchao
- Halu-j: Critique-based Hallucination Judge Wang Binjie, Chern Steffi, Chern Ethan, Liu Pengfei
- Direct: Diagnostic Reasoning For Clinical Notes Via Large Language Models Wang Bowen, Chang Jiuyang, Qian Yiming, Chen Guoxin, Chen Junhao, Jiang Zhouqiang, Zhang Jiahao, Nakashima Yuta, Nagahara Hajime
- BLSP-KD: Bootstrapping Language-speech Pre-training Via Knowledge Distillation Wang Chen, Liao Minpeng, Huang Zhongqiang, Zhang Jiajun
- Hybrid Alignment Training For Large Language Models Wang Chenglong, Zhou Hang, Chang Kaiyan, Li Bei, Mu Yongyu, Xiao Tong, Liu Tongran, Zhu Jingbo
- A Survey For Large Language Models In Biomedicine Wang Chong, Li Mengyao, He Junjun, Wang Zhongruo, Darzi Erfan, Chen Zan, Ye Jin, Li Tianbin, Su Yanzhou, Ke Jing, Qu Kaili, Li Shuxin, Yu Yi, Liò Pietro, Wang Tianyun, Wang Yu Guang, Shen Yiqing
- Ada-leval: Evaluating Long-context Llms With Length-adaptable Benchmarks Wang Chonghua, Duan Haodong, Zhang Songyang, Lin Dahua, Chen Kai
- Towards Efficient And Effective Unlearning Of Large Language Models For Recommendation Wang Hangyu, Lin Jianghao, Chen Bo, Yang Yang, Tang Ruiming, Zhang Weinan, Yu Yong
- Adapting Open-source Large Language Models For Cost-effective, Expert-level Clinical Note Generation With On-policy Reinforcement Learning Wang Hanyin, Gao Chufan, Liu Bolun, Xu Qiping, Hussein Guleid, Labban Mohamad El, Iheasirim Kingsley, Korsapati Hariprasad, Outcalt Chuck, Sun Jimeng
- Reinforcement Learning For Edit-based Non-autoregressive Neural Machine Translation Wang Hao, Morimura Tetsuro, Honda Ukyo, Kawahara Daisuke
- Interpretable Preferences Via Multi-objective Reward Modeling And Mixture-of-experts Wang Haoxiang, Xiong Wei, Xie Tengyang, Zhao Han, Zhang Tong
- Blendfilter: Advancing Retrieval-augmented Large Language Models Via Query Generation Blending And Knowledge Filtering Wang Haoyu, Li Ruirui, Jiang Haoming, Tian Jinjin, Wang Zhengyang, Luo Chen, Tang Xianfeng, Cheng Monica, Zhao Tuo, Gao Jing
- Probing The Safety Response Boundary Of Large Language Models Via Unsafe Decoding Path Generation Wang Haoyu, Wu Bingzhe, Bian Yatao, Chang Yongzhe, Wang Xueqian, Zhao Peilin
- Techgpt-2.0: A Large Language Model Project To Solve The Task Of Knowledge Graph Construction Wang Jiaqi, Chang Yuying, Li Zhong, An Ning, Ma Qi, Hei Lei, Luo Haibo, Lu Yifei, Ren Feiliang
- Tarsier: Recipes For Training And Evaluating Large Video Description Models Wang Jiawei, Yuan Liping, Zhang Yuchen
- A User-centric Benchmark For Evaluating Large Language Models Wang Jiayin, Mo Fengran, Ma Weizhi, Sun Peijie, Zhang Min, Nie Jian-yun
- Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning For Vision Language Models Wang Jiayu, Ming Yifei, Shi Zhenmei, Vineet Vibhav, Wang Xin, Joshi Neel
- Length Generalization Of Causal Transformers Without Position Encoding Wang Jie, Ji Tao, Wu Yuanbin, Yan Hang, Gui Tao, Zhang Qi, Huang Xuanjing, Wang Xiaoling
- Chinese Metaphor Recognition Using A Multi-stage Prompting Large Language Model Wang Jie, Wang Jin, Zhang Xuejie
- Helpful Or Harmful Data? Fine-tuning-free Shapley Attribution For Explaining Language Model Predictions Wang Jingtan, Lin Xiaoqiang, Qiao Rui, Foo Chuan-sheng, Low Bryan Kian Hsiang
- Mambabyte: Token-free Selective State Space Model Wang Junxiong, Gangavarapu Tushaar, Yan Jing Nathan, Rush Alexander M.
- Balancing Rigor And Utility: Mitigating Cognitive Biases In Large Language Models For Multiple-choice Questions Wang Liman, Zhong Hanyang, Cao Wenting, Sun Zeyuan
- Selective Forgetting: Advancing Machine Unlearning Techniques And Evaluation In Language Models Wang Lingzhi, Zeng Xingshan, Guo Jinsong, Wong Kam-fai, Gottlob Georg
- Interactive Dualchecker For Mitigating Hallucinations In Distilling Large Language Models Wang Meiyun, Suzuki Masahiro, Sakaji Hiroki, Izumi Kiyoshi
- Exploring The Potential Of Multimodal LLM With Knowledge-intensive Multimodal ASR Wang Minghan, Wang Yuxia, Vu Thuy-trang, Shareghi Ehsan, Haffari Gholamreza
- Leave No Document Behind: Benchmarking Long-context Llms With Extended Multi-doc QA Wang Minzheng, Chen Longze, Fu Cheng, Liao Shengyi, Zhang Xinghua, Wu Bingli, Yu Haiyang, Xu Nan, Zhang Lei, Luo Run, Li Yunshui, Yang Min, Huang Fei, Li Yongbin
- Effective Demonstration Annotation For In-context Learning Via Language Model-based Determinantal Point Process Wang Peng, Wang Xiaobin, Lou Chao, Mao Shengyu, Xie Pengjun, Jiang Yong
- Inferaligner: Inference-time Alignment For Harmlessness Through Cross-model Guidance Wang Pengyu, Zhang Dong, Li Linyang, Tan Chenkun, Wang Xinghao, Ren Ke, Jiang Botian, Qiu Xipeng
- Automated Generation Of Multiple-choice Cloze Questions For Assessing English Vocabulary Using Gpt-turbo 3.5 Wang Qiao, Rose Ralph, Orita Naho, Sugawara Ayaka
- Unlearning With Control: Assessing Real-world Utility For Large Language Model Unlearning Wang Qizhou, Han Bo, Yang Puning, Zhu Jianing, Liu Tongliang, Sugiyama Masashi
- Prefclm: Enhancing Preference-based Reinforcement Learning With Crowdsourced Large Language Models Wang Ruiqi, Zhao Dezhong, Yuan Ziqin, Obi Ike, Min Byung-cheol
- What Is The Best Way For Chatgpt To Translate Poetry? Wang Shanshan, Wong Derek F., Yao Jingming, Chao Lidia S.
- Lora Meets Dropout Under A Unified Framework Wang Sheng, Chen Liheng, Jiang Jiyue, Xue Boyang, Kong Lingpeng, Wu Chuan
- Offline RLHF Methods Need More Accurate Supervision Signals Wang Shiqi, Zhang Zhengze, Zhao Rui, Tan Fei, Nguyen Cam Tu
- \(\mathbb{uscd}\): Improving Code Generation Of Llms By Uncertainty-aware Selective Contrastive Decoding Wang Shuai, Ding Liang, Shen Li, Luo Yong, He Zheng, Yu Wei, Tao Dacheng
- Symbolic Working Memory Enhances Language Models For Complex Rule Application Wang Siyuan, Wei Zhongyu, Choi Yejin, Ren Xiang
- Self-taught Evaluators Wang Tianlu, Kulikov Ilia, Golovneva Olga, Yu Ping, Yuan Weizhe, Dwivedi-yu Jane, Pang Richard Yuanzhe, Fazel-zarandi Maryam, Weston Jason, Li Xian
- Weaver: Foundation Models For Creative Writing Wang Tiannan, Chen Jiamin, Jia Qingrui, Wang Shuai, Fang Ruoyu, Wang Huilin, Gao Zhaowei, Xie Chunzhao, Xu Chuou, Dai Jihong, Liu Yibin, Wu Jialong, Ding Shengwei, Li Long, Huang Zhiwei, Deng Xinle, Yu Teng, Ma Gangan, Xiao Han, Chen Zixin, Xiang Danjun, Wang Yunxia, Zhu Yuanyuan, Xiao Yi, Wang Jing, Wang Yiru, Ding Siran, Huang Jiayang, Xu Jiayi, Tayier Yilihamu, Hu Zhenyu, Gao Yuan, Zheng Chengfeng, Ye Yueshu, Li Yihang, Wan Lei, Jiang Xinyue, Wang Yujie, Cheng Siyu, Song Zhule, Tang Xiangru, Xu Xiaohua, Zhang Ningyu, Chen Huajun, Jiang Yuchen Eleanor, Zhou Wangchunshu
- QCRD: Quality-guided Contrastive Rationale Distillation For Large Language Models Wang Wei, Li Zhaowei, Xu Qi, Cai Yiqing, Song Hang, Qi Qi, Zhou Ran, Huang Zhida, Wang Tao, Xiao Li
- Rocks Coding, Not Development--a Human-centric, Experimental Evaluation Of Llm-supported SE Tasks Wang Wei, Ning Huilong, Zhang Gaowei, Liu Libo, Wang Yi
- Learning To Ask: When Llms Meet Unclear Instruction Wang Wenxuan, Shi Juluan, Wang Chaozheng, Lee Cheryl, Yuan Youliang, Huang Jen-tse, Lyu Michael R.
- FSM: A Finite State Machine Based Zero-shot Prompting Paradigm For Multi-hop Question Answering Wang Xiaochen, He Junqing, Yang Zhe, Wang Yiru, Meng Xiangdi, Pan Kunhao, Sui Zhifang
- Metatool: Facilitating Large Language Models To Master Tools With Meta-task Augmentation Wang Xiaohan, Li Dian, Zhao Yilin, Sinbadliu, Wang Hui
- Searching For Best Practices In Retrieval-augmented Generation Wang Xiaohua, Wang Zhenghua, Gao Xuan, Zhang Feiran, Wu Yixin, Xu Zhibo, Shi Tianyuan, Wang Zhengyuan, Li Shizheng, Qian Qi, Yin Ruicheng, Lv Changze, Zheng Xiaoqing, Huang Xuanjing
- Longllava: Scaling Multi-modal Llms To 1000 Images Efficiently Via Hybrid Architecture Wang Xidong, Song Dingjie, Chen Shunian, Zhang Chen, Wang Benyou
- Rdrec: Rationale Distillation For Llm-based Recommendation Wang Xinfeng, Cui Jin, Suzuki Yoshimi, Fukumoto Fumiyo
- Executable Code Actions Elicit Better LLM Agents Wang Xingyao, Chen Yangyi, Yuan Lifan, Zhang Yizhe, Li Yunzhu, Peng Hao, Ji Heng
- Negativeprompt: Leveraging Psychology For Large Language Models Enhancement Via Negative Emotional Stimuli Wang Xu, Li Cheng, Chang Yi, Wang Jindong, Wu Yuan
- Unveiling Factual Recall Behaviors Of Large Language Models Through Knowledge Neurons Wang Yifei, Chen Yuheng, Wen Wanting, Sheng Yu, Li Linjing, Zeng Daniel Dajun
- A Theoretical Understanding Of Self-correction Through In-context Alignment Wang Yifei, Wu Yuyang, Wei Zeming, Jegelka Stefanie, Wang Yisen
- Defending Llms Against Jailbreaking Attacks Via Backtranslation Wang Yihan, Shi Zhouxing, Bai Andrew, Hsieh Cho-jui
- Uncertainty Aware Learning For Language Model Alignment Wang Yikun, Zheng Rui, Ding Liang, Zhang Qi, Lin Dahua, Tao Dacheng
- Adaptvision: Dynamic Input Scaling In Mllms For Versatile Scene Understanding Wang Yonghui, Zhou Wengang, Feng Hao, Li Houqiang
- HPT++: Hierarchically Prompting Vision-language Models With Multi-granularity Knowledge Generation And Improved Structure Modeling Wang Yubin, Jiang Xinyang, Cheng De, Sun Wenli, Li Dongsheng, Zhao Cairong
- Maferw: Query Rewriting With Multi-aspect Feedbacks For Retrieval-augmented Large Language Models Wang Yujing, Zhang Hainan, Pang Liang, Pang Liang, Zheng Hongwei, Zheng Zhiming
- Sibyl: Simple Yet Effective Agent Framework For Complex Real-world Reasoning Wang Yulong, Shen Tianhao, Liu Lifeng, Xie Jian
- Factuality Of Large Language Models In The Year 2024 Wang Yuxia, Wang Minghan, Manzoor Muhammad Arslan, Liu Fei, Georgiev Georgi, Das Rocktim Jyoti, Nakov Preslav
- Videollamb: Long-context Video Understanding With Recurrent Memory Bridges Wang Yuxuan, Xie Cihang, Liu Yang, Zheng Zilong
- Absinstruct: Eliciting Abstraction Ability From Llms Through Explanation Tuning With Plausibility Estimation Wang Zhaowei, Fan Wei, Zong Qing, Zhang Hongming, Choi Sehyun, Fang Tianqing, Liu Xin, Song Yangqiu, Wong Ginny Y., See Simon
- M-RAG: Reinforcing Large Language Model Performance Through Retrieval-augmented Generation With Multiple Partitions Wang Zheng, Teo Shu Xian, Ouyang Jieer, Xu Yongjun, Shi Wei
- A Comprehensive Survey Of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO And More Wang Zhichao James, Bi Bin James, Pentyala Shiva Kumar James, Ramnath Kiran James, Chaudhuri Sougata James, Mehrotra Shubham James, Zixu James, Zhu Claire, Mao Xiang-bo Claire, Asur Sitaram Claire, Na Claire, Cheng
- Promptcharm: Text-to-image Generation Through Multi-modal Prompting And Refinement Wang Zhijie, Huang Yuheng, Song Da, Ma Lei, Zhang Tianyi
- Helpsteer2: Open-source Dataset For Training Top-performing Reward Models Wang Zhilin, Dong Yi, Delalleau Olivier, Zeng Jiaqi, Shen Gerald, Egert Daniel, Zhang Jimmy J., Sreedhar Makesh Narsimhan, Kuchaiev Oleksii
- Chatlogic: Integrating Logic Programming With Large Language Models For Multi-step Reasoning Wang Zhongsheng, Liu Jiamou, Bao Qiming, Rong Hongfei, Zhang Jingfeng
- Omnijarvis: Unified Vision-language-action Tokenization Enables Open-world Instruction Following Agents Wang Zihao, Cai Shaofei, Mu Zhancun, Lin Haowei, Zhang Ceyao, Liu Xuejie, Li Qing, Liu Anji, Ma Xiaojian, Liang Yitao
- Wordflow: Social Prompt Engineering For Large Language Models Wang Zijie J., Chakravarthy Aishwarya, Munechika David, Chau Duen Horng
- Farsight: Fostering Responsible AI Awareness During AI Application Prototyping Wang Zijie J., Kulkarni Chinmay, Wilcox Lauren, Terry Michael, Madaio Michael
- Eliminating Position Bias Of Language Models: A Mechanistic Approach Wang Ziqi, Zhang Hanlin, Li Xiner, Huang Kuan-hao, Han Chi, Ji Shuiwang, Kakade Sham M., Peng Hao, Ji Heng
- Re2llm: Reflective Reinforcement Large Language Model For Session-based Recommendation Wang Ziyan, Du Yingpeng, Sun Zhu, Chua Haoyan, Feng Kaidong, Wang Wenya, Zhang Jie
- Flora: Federated Fine-tuning Large Language Models With Heterogeneous Low-rank Adaptations Wang Ziyao, Shen Zheyu, He Yexiao, Sun Guoheng, Wang Hongyi, Lyu Lingjuan, Li Ang
- Is There No Such Thing As A Bad Question? H4R: Hallucibot For Ratiocination, Rewriting, Ranking, And Routing Watson William, Cho Nicole, Srishankar Nishan
- PARIKSHA : A Large-scale Investigation Of Human-llm Evaluator Agreement On Multilingual And Multi-cultural Data Watts Ishaan, Gumma Varun, Yadavalli Aditya, Seshadri Vivek, Swaminathan Manohar, Sitaram Sunayana
- Hidden In Plain Sight: Exploring Chat History Tampering In Interactive Language Models Wei Cheng'an, Zhao Yue, Gong Yujia, Chen Kai, Xiang Lu, Zhu Shenchen
- Rethinking Generative Large Language Model Evaluation For Semantic Comprehension Wei Fangyun, Chen Xi, Luo Lin
- Systematic Evaluation Of Llm-as-a-judge In LLM Alignment Tasks: Explainable Metrics And Diverse Prompt Templates Wei Hui, He Shenghua, Xia Tian, Wong Andy, Lin Jingyang, Han Mei
- Long-form Factuality In Large Language Models Wei Jerry, Yang Chengrun, Song Xinying, Lu Yifeng, Hu Nathan, Huang Jie, Tran Dustin, Peng Daiyi, Liu Ruibo, Huang Da, Du Cosmo, Le Quoc V.
- Getting Inspiration For Feature Elicitation: App Store- Vs. Llm-based Approach Wei Jialiang, Courbis Anne-lise, Lambolais Thomas, Xu Binbin, Bernard Pierre Louis, Dray Gérard, Maalej Walid
- Chain-of-specificity: An Iteratively Refining Method For Eliciting Knowledge From Large Language Models Wei Kaiwen, Zhang Jingyuan, Zhang Hongzhi, Zhang Fuzheng, Zhang Di, Jin Li, Yu Yue
- Performance Evaluation Of Lightweight Open-source Large Language Models In Pediatric Consultations: A Comparative Analysis Wei Qiuhong, Cui Ying, Ding Mengwei, Wang Yanqin, Xiang Lingling, Yao Zhengxiong, Chen Ceran, Long Ying, Jin Zhezhen, Xu Ximing
- Promptmm: Multi-modal Knowledge Distillation For Recommendation With Prompt-tuning Wei Wei, Tang Jiabin, Jiang Yangqin, Xia Lianghao, Huang Chao
- AC-EVAL: Evaluating Ancient Chinese Language Understanding In Large Language Models Wei Yuting, Xu Yuanxing, Wei Xinru, Yang Simin, Zhu Yangfu, Li Yuqing, Liu Di, Wu Bin
- Leveraging Large Language Models For Patient Engagement: The Power Of Conversational AI In Digital Health Wen Bo, Norel Raquel, Liu Julia, Stappenbeck Thaddeus, Zulkernine Farhana, Chen Huamin
- On-policy Fine-grained Knowledge Feedback For Hallucination Mitigation Wen Xueru, Lu Xinyu, Guan Xinyan, Lu Yaojie, Lin Hongyu, He Ben, Han Xianpei, Sun Le
- Grounding Data Science Code Generation With Input-output Specifications Wen Yeming, Yin Pengcheng, Shi Kensen, Michalewski Henryk, Chaudhuri Swarat, Polozov Alex
- How Chinese Are Chinese Language Models? The Puzzling Lack Of Language Policy In China's Llms Wen-yi Andrea W, Jo Unso Eun Seo, Lin Lu Jia, Mimno David
- Longvlm: Efficient Long Video Understanding Via Large Language Models Weng Yuetian, Han Mingfei, He Haoyu, Chang Xiaojun, Zhuang Bohan
- Aligning Large Language Models For Controllable Recommendations Wensheng Lu, Jianxun Lian, Wei Zhang, Guanghua Li, Mingyang Zhou, Hao Liao, Xing Xie
- Detecting And Mitigating Hallucination In Large Vision Language Models Via Fine-grained AI Feedback Wenyi Xiao, Ziwei Huang, Leilei Gan, Wanggui He, Haoyuan Li, Zhelun Yu, Hao Jiang, Fei Wu, Linchao Zhu
- Codeultrafeedback: An Llm-as-a-judge Dataset For Aligning Large Language Models To Coding Preferences Weyssow Martin, Kamanda Aton, Sahraoui Houari
- Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions Wiegreffe Sarah, Tafjord Oyvind, Belinkov Yonatan, Hajishirzi Hannaneh, Sabharwal Ashish
- CBR-RAG: Case-based Reasoning For Retrieval Augmented Generation In Llms For Legal Question Answering Wiratunga Nirmalie, Abeyratne Ramitha, Jayawardena Lasal, Martin Kyle, Massie Stewart, Nkisi-orji Ikechukwu, Weerasinghe Ruvan, Liret Anne, Fleisch Bruno
- Effects Of A Prompt Engineering Intervention On Undergraduate Students' AI Self-efficacy, AI Knowledge And Prompt Engineering Ability: A Mixed Methods Study Woo David James, Wang Deliang, Yung Tim, Guo Kai
- A Surprising Failure? Multimodal Llms And The NLVR Challenge Wu Anne, Brantley Kianté, Artzi Yoav
- Curriculum Learning With Quality-driven Data Selection Wu Biao, Meng Fang, Chen Ling
- Towards Evaluating And Building Versatile Large Language Models For Medicine Wu Chaoyi, Qiu Pengcheng, Liu Jinxin, Gu Hongfei, Li Na, Zhang Ya, Wang Yanfeng, Xie Weidi
- Llama Pro: Progressive Llama With Block Expansion Wu Chengyue, Gan Yukang, Ge Yixiao, Lu Zeyu, Wang Jiahao, Feng Ye, Shan Ying, Luo Ping
- Performance Law Of Large Language Models Wu Chuhan, Tang Ruiming
- Core: Context-regularized Text Embedding Learning For Text-to-image Personalization Wu Feize, Pang Yun, Zhang Junyi, Pang Lianyu, Yin Jian, Zhao Baoquan, Li Qing, Mao Xudong
- Guiding Clinical Reasoning With Large Language Models Via Knowledge Seeds Wu Jiageng, Wu Xian, Yang Jie
- Medkp: Medical Dialogue With Knowledge Enhancement And Clinical Pathway Encoding Wu Jiageng, Wu Xian, Zheng Yefeng, Yang Jie
- Gendec: A Robust Generative Question-decomposition Method For Multi-hop Reasoning Wu Jian, Yang Linyi, Ji Yuliang, Huang Wenhao, Karlsson Börje F., Okumura Manabu
- Visionllm V2: An End-to-end Generalist Multimodal Large Language Model For Hundreds Of Vision-language Tasks Wu Jiannan, Zhong Muyan, Xing Sen, Lai Zeqiang, Liu Zhaoyang, Wang Wenhai, Chen Zhe, Zhu Xizhou, Lu Lewei, Lu Tong, Luo Ping, Qiao Yu, Dai Jifeng
- Uncertainty Estimation Of Large Language Models In Medical Question Answering Wu Jiaxin, Yu Yizhou, Zhou Hong-yu
- RLPF: Reinforcement Learning From Prediction Feedback For User Summarization With Llms Wu Jiaxing, Ning Lin, Liu Luyang, Lee Harrison, Wu Neo, Wang Chao, Prakash Sushant, O'banion Shawn, Green Bradley, Xie Jun
- What Are The Limits Of Cross-lingual Dense Passage Retrieval For Low-resource Languages? Wu Jie, Ren Zhaochun, Verberne Suzan
- Pandora's Box Or Aladdin's Lamp: A Comprehensive Analysis Revealing The Role Of RAG Noise In Large Language Models Wu Jinyang, Che Feihu, Zhang Chuyuan, Tao Jianhua, Zhang Shuai, Shao Pengpeng
- Can Large Language Models Understand Uncommon Meanings Of Common Words? Wu Jinyang, Che Feihu, Zheng Xinxin, Zhang Shuai, Jin Ruihan, Nie Shuai, Shao Pengpeng, Tao Jianhua
- \(\beta\)-dpo: Direct Preference Optimization With Dynamic \(\beta\) Wu Junkang, Xie Yuexiang, Yang Zhengyi, Wu Jiancan, Gao Jinyang, Ding Bolin, Wang Xiang, He Xiangnan
- Noiseboost: Alleviating Hallucination With Noise Perturbation For Multimodal Large Language Models Wu Kai, Jiang Boyuan, Jiang Zhengkai, He Qingdong, Luo Donghao, Wang Shengzhi, Liu Qingwen, Wang Chengjie
- Mixture-of-skills: Learning To Optimize Data Usage For Fine-tuning Large Language Models Wu Minghao, Vu Thuy-trang, Qu Lizhen, Haffari Gholamreza
- Controlmllm: Training-free Visual Prompt Learning For Multimodal Large Language Models Wu Mingrui, Cai Xinyue, Ji Jiayi, Li Jiale, Huang Oucheng, Luo Gen, Fei Hao, Sun Xiaoshuai, Ji Rongrong
- ITERTL: An Iterative Framework For Fine-tuning Llms For RTL Code Generation Wu Peiyang, Guo Nan, Xiao Xiao, Li Wenming, Ye Xiaochun, Fan Dongrui
- Parallel Decoding Via Hidden Transfer For Lossless Large Language Model Acceleration Wu Pengfei, Liu Jiahao, Gong Zhuocheng, Wang Qifan, Li Jinpeng, Wang Jingang, Cai Xunliang, Zhao Dongyan
- Videollm-mod: Efficient Video-language Streaming With Mixture-of-depths Vision Computation Wu Shiwei, Chen Joya, Lin Kevin Qinghong, Wang Qimeng, Gao Yan, Xu Qianli, Xu Tong, Hu Yao, Chen Enhong, Shou Mike Zheng
- Enhance Reasoning For Large Language Models In The Game Werewolf Wu Shuang, Zhu Liwen, Yang Tao, Xu Shiwei, Fu Qiang, Wei Yang, Fu Haobo
- Sunnie: An Anthropomorphic Llm-based Conversational Agent For Mental Well-being Activity Recommendation Wu Siyi, Han Feixue, Yao Bingsheng, Xie Tianyi, Zhao Xuan, Wang Dakuo
- Unigen: A Unified Framework For Textual Dataset Generation Using Large Language Models Wu Siyuan, Huang Yue, Gao Chujie, Chen Dongping, Zhang Qihui, Wan Yao, Zhou Tianyi, Zhang Xiangliang, Gao Jianfeng, Xiao Chaowei, Sun Lichao
- F-LMM: Grounding Frozen Large Multimodal Models Wu Size, Jin Sheng, Zhang Wenwei, Xu Lumin, Liu Wentao, Li Wei, Loy Chen Change
- Semantic Alignment For Multimodal Large Language Models Wu Tao, Li Mengze, Chen Jingyuan, Ji Wei, Lin Wang, Gao Jinyang, Kuang Kun, Zhao Zhou, Wu Fei
- Meta-rewarding Language Models: Self-improving Alignment With Llm-as-a-meta-judge Wu Tianhao, Yuan Weizhe, Golovneva Olga, Xu Jing, Tian Yuandong, Jiao Jiantao, Weston Jason, Sukhbaatar Sainbayar
- Long Context Alignment With Short Instructions And Synthesized Positions Wu Wenhao, Wang Yizhong, Fu Yao, Yue Xiang, Zhu Dawei, Li Sujian
- Mind's Eye Of Llms: Visualization-of-thought Elicits Spatial Reasoning In Large Language Models Wu Wenshan, Mao Shaoguang, Zhang Yadong, Xia Yan, Dong Li, Cui Lei, Wei Furu
- Tablebench: A Comprehensive And Complex Benchmark For Table Question Answering Wu Xianjie, Yang Jian, Chai Linzheng, Zhang Ge, Liu Jiaheng, Du Xinrun, Liang Di, Shu Daixin, Cheng Xianfu, Sun Tianzhen, Niu Guanglin, Li Tongliang, Li Zhoujun
- Enhancing Q-learning With Large Language Model Heuristics Wu Xiefeng
- Switchcit: Switching For Continual Instruction Tuning Of Large Language Models Wu Xinbo, Hartman Max, Jayaraman Vidhata Arjun, Varshney Lav R.
- Knowledge-infused Legal Wisdom: Navigating LLM Consultation Through The Lens Of Diagnostics And Positive-unlabeled Reinforcement Learning Wu Yang, Wang Chenghao, Gumusel Ece, Liu Xiaozhong
- Mitigating Misleading Chain-of-thought Reasoning With Selective Filtering Wu Yexin, Zhang Zhuosheng, Zhao Hai
- MLDT: Multi-level Decomposition For Complex Long-horizon Robotic Task Planning With Open-source Large Language Model Wu Yike, Zhang Jiatao, Hu Nan, Tang Lanling, Qi Guilin, Shao Jun, Ren Jie, Song Wei
- Acquiring Clean Language Models From Backdoor Poisoned Datasets By Downscaling Frequency Space Wu Zongru, Zhang Zhuosheng, Cheng Pengzhou, Liu Gongshen
- Imitating Language Via Scalable Inverse Reinforcement Learning Wulfmeier Markus, Bloesch Michael, Vieillard Nino, Ahuja Arun, Bornschein Jorg, Huang Sandy, Sokolov Artem, Barnes Matt, Desjardins Guillaume, Bewley Alex, Bechtle Sarah Maria Elisabeth, Springenberg Jost Tobias, Momchev Nikola, Bachem Olivier, Geist Matthieu, Riedmiller Martin
- In-context Learning Can Re-learn Forbidden Tasks Xhonneux Sophie, Dobre David, Tang Jian, Gidel Gauthier, Sridhar Dhanya
- Memocrs: Memory-enhanced Sequential Conversational Recommender Systems With Large Language Models Xi Yunjia, Liu Weiwen, Lin Jianghao, Chen Bo, Tang Ruiming, Zhang Weinan, Yu Yong
- Play To Your Strengths: Collaborative Intelligence Of Conventional Recommender Models And Large Language Models Xi Yunjia, Liu Weiwen, Lin Jianghao, Wu Chuhan, Chen Bo, Tang Ruiming, Zhang Weinan, Yu Yong
- FOFO: A Benchmark To Evaluate Llms' Format-following Capability Xia Congying, Xing Chen, Du Jiangshu, Yang Xinyi, Feng Yihao, Xu Ran, Yin Wenpeng, Xiong Caiming
- Inverse-q*: Token Level Reinforcement Learning For Aligning Large Language Models Without Preference Data Xia Han, Gao Songyang, Ge Qiming, Xi Zhiheng, Zhang Qi, Huang Xuanjing
- FP6-LLM: Efficiently Serving Large Language Models Through Fp6-centric Algorithm-system Co-design Xia Haojun, Zheng Zhen, Wu Xiaoxia, Chen Shiyang, Yao Zhewei, Youn Stephen, Bakhtiari Arash, Wyatt Michael, Zhuang Donglin, Zhou Zhongzhu, Ruwase Olatunji, He Yuxiong, Song Shuaiwen Leon
- LESS: Selecting Influential Data For Targeted Instruction Tuning Xia Mengzhou, Malladi Sadhika, Gururangan Suchin, Arora Sanjeev, Chen Danqi
- Language Models Can Evaluate Themselves Via Probability Discrepancy Xia Tingyu, Yu Bowen, Wu Yuan, Chang Yi, Zhou Chang
- Hallucination Diversity-aware Active Learning For Text Summarization Xia Yu, Liu Xu, Yu Tong, Kim Sungchul, Rossi Ryan A., Rao Anup, Mai Tung, Li Shuai
- Improving Retrieval Augmented Language Model With Self-reasoning Xia Yuan, Zhou Jingbo, Shi Zhenhui, Chen Jun, Huang Haifeng
- Pandora: Towards General World Model With Natural Language Actions And Video States Xiang Jiannan, Liu Guangyi, Gu Yi, Gao Qiyue, Ning Yuting, Zha Yuheng, Feng Zeyu, Tao Tianhua, Hao Shibo, Shi Yemin, Liu Zhengzhong, Xing Eric P., Hu Zhiting
- Badchain: Backdoor Chain-of-thought Prompting For Large Language Models Xiang Zhen, Jiang Fengqing, Xiong Zidi, Ramasubramanian Bhaskar, Poovendran Radha, Li Bo
- Clover-2: Accurate Inference For Regressive Lightweight Speculative Decoding Xiao Bin, Gui Lujun, Su Lei, Chen Weipeng
- Infllm: Training-free Long-context Extrapolation For Llms With An Efficient Context Memory Xiao Chaojun, Zhang Pengle, Han Xu, Xiao Guangxuan, Lin Yankai, Zhang Zhengyan, Liu Zhiyuan, Sun Maosong
- Rar-b: Reasoning As Retrieval Benchmark Xiao Chenghao, Hudson G Thomas, Moubayed Noura Al
- EMP: Enhance Memory In Data Pruning Xiao Jinying, Li Ping, Nie Jie, Tang Zhe
- Healme: Harnessing Cognitive Reframing In Large Language Models For Psychotherapy Xiao Mengxi, Xie Qianqian, Kuang Ziyan, Liu Zhicheng, Yang Kailai, Peng Min, Han Weiguang, Huang Jimin
- Exploring How Multiple Levels Of Gpt-generated Programming Hints Support Or Disappoint Novices Xiao Ruiwei, Hou Xinying, Stamper John
- Leveraging Human Revisions For Improving Text-to-layout Models Xie Amber, Cheng Chin-yi, Huang Forrest, Li Yang
- Emovit: Revolutionizing Emotion Insights With Visual Instruction Tuning Xie Hongxia, Peng Chu-jun, Tseng Yu-wen, Chen Hung-jen, Hsu Chan-feng, Shuai Hong-han, Cheng Wen-huang
- Show-o: One Single Transformer To Unify Multimodal Understanding And Generation Xie Jinheng, Mao Weijia, Bai Zechen, Zhang David Junhao, Wang Weihao, Lin Kevin Qinghong, Gu Yuchao, Chen Zhijie, Yang Zhenheng, Shou Mike Zheng
- Exploratory Preference Optimization: Harnessing Implicit Q*-approximation For Sample-efficient RLHF Xie Tengyang, Foster Dylan J., Krishnamurthy Akshay, Rosset Corby, Awadallah Ahmed, Rakhlin Alexander
- Sorry-bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors Xie Tinghao, Qi Xiangyu, Zeng Yi, Huang Yangsibo, Sehwag Udari Madhushani, Huang Kaixuan, He Luxi, Wei Boyi, Li Dacheng, Sheng Ying, Jia Ruoxi, Li Bo, Li Kai, Chen Danqi, Henderson Peter, Mittal Prateek
- Decomposition For Enhancing Attention: Improving Llm-based Text-to-sql Through Workflow Paradigm Xie Yuanzhen, Jin Xinzhou, Xie Tao, Lin Mingxiong, Chen Liang, Yu Chenyun, Cheng Lei, Zhuo Chengxiang, Hu Bo, Li Zang
- Monte Carlo Tree Search Boosts Reasoning Via Iterative Preference Learning Xie Yuxi, Goyal Anirudh, Zheng Wenyue, Kan Min-yen, Lillicrap Timothy P., Kawaguchi Kenji, Shieh Michael
- Order Matters In Hallucination: Reasoning Order As Benchmark And Reflexive Prompting For Large-language-models Xie Zikai
- Understanding The Weakness Of Large Language Model Agents Within A Complex Android Environment Xing Mingzhe, Zhang Rongkai, Xue Hui, Chen Qi, Yang Fan, Xiao Zhen
- Llm-enhanced User-item Interactions: Leveraging Edge Information For Optimized Recommendations Xinyuan Wang, Liang Wu, Liangjie Hong, Hao Liu, Yanjie Fu
- Interactive-kbqa: Multi-turn Interactions For Knowledge Base Question Answering With Large Language Models Xiong Guanming, Bao Junwei, Zhao Wen
- Meaningful Learning: Advancing Abstract Reasoning In Large Language Models Via Generic Fact Guidance Xiong Kai, Ding Xiao, Liu Ting, Qin Bing, Xu Dongliang, Yang Qing, Liu Hongtao, Cao Yixin
- Building Math Agents With Multi-turn Iterative Preference Learning Xiong Wei, Shi Chengshuai, Shen Jiaming, Rosenberg Aviv, Qin Zhen, Calandriello Daniele, Khalman Misha, Joshi Rishabh, Piot Bilal, Saleh Mohammad, Jin Chi, Zhang Tong, Liu Tianqi
- Watch Every Step! LLM Agent Learning Via Iterative Step-level Process Refinement Xiong Weimin, Song Yifan, Zhao Xiutian, Wu Wenhao, Wang Xun, Wang Ke, Li Cheng, Peng Wei, Li Sujian
- Mixture-of-instructions: Comprehensive Alignment Of A Large Language Model Through The Mixture Of Diverse System Prompting Instructions Xu Bowen, Wu Shaoyu, Liu Kai, Hu Lulu
- Data Efficient Evaluation Of Large Language Models And Text-to-image Models Via Adaptive Sampling Xu Cong, Saranathan Gayathri, Alam Mahammad Parwez, Shah Arpit, Lim James, Wong Soon Yee, Martin Foltin, Bhattacharya Suparna
- Understanding The Role Of Cross-entropy Loss In Fairly Evaluating Large Language Model-based Recommendation Xu Cong, Zhu Zhangchi, Wang Jun, Wang Jianyong, Zhang Wei
- Empowering 1000 Tokens/second On-device LLM Prefilling With Mllm-npu Xu Daliang, Zhang Hao, Yang Liming, Liu Ruiqi, Huang Gang, Xu Mengwei, Liu Xuanzhe
- Aligning Large Language Models Via Fine-grained Supervision Xu Dehong, Qiu Liang, Kim Minseok, Ladhak Faisal, Do Jaeyoung
- Interactive Evolution: A Neural-symbolic Self-training Framework For Large Language Models Xu Fangzhi, Sun Qiushi, Cheng Kanzhi, Liu Jun, Qiao Yu, Wu Zhiyong
- Large Language Models For Education: A Survey Xu Hanyi, Gan Wensheng, Qi Zhenlian, Wu Jiayang, Yu Philip S.
- Rejection Improves Reliability: Training Llms To Refuse Unknown Questions Using RL From Knowledge Feedback Xu Hongshen, Zhu Zichen, Zhang Situo, Ma Da, Fan Shuai, Chen Lu, Yu Kai
- Redagent: Red Teaming Large Language Models With Context-aware Autonomous Language Agent Xu Huiyu, Zhang Wenhui, Wang Zhibo, Xiao Feng, Zheng Rui, Feng Yunhe, Ba Zhongjie, Ren Kui
- On-device Language Models: A Comprehensive Review Xu Jiajun, Li Zhiyuan, Chen Wei, Wang Qun, Gao Xin, Cai Qi, Ling Ziyuan
- A General-purpose Device For Interaction With Llms Xu Jiajun, Wang Qun, Cao Yuhang, Zeng Baitao, Liu Sicheng
- Reverse Image Retrieval Cues Parametric Memory In Multimodal Llms Xu Jialiang, Moor Michael, Leskovec Jure
- Seamless Language Expansion: Enhancing Multilingual Mastery In Self-supervised Models Xu Jing, Wu Minglin, Wu Xixin, Meng Helen
- Tur[k]ingbench: A Challenge Benchmark For Web Agents Xu Kevin, Kordi Yeganeh, Nayak Tanay, Asija Ado, Wang Yizhong, Sanders Kate, Byerly Adam, Zhang Jingyu, Van Durme Benjamin, Khashabi Daniel
- Chatqa 2: Bridging The Gap To Proprietary Llms In Long Context And RAG Capabilities Xu Peng, Ping Wei, Wu Xianchao, Xu Chejian, Liu Zihan, Shoeybi Mohammad, Catanzaro Bryan
- Course-correction: Safety Alignment Using Synthetic Preferences Xu Rongwu, Cai Yishuo, Zhou Zhenhong, Gu Renjie, Weng Haiqin, Liu Yan, Zhang Tianwei, Xu Wei, Qiu Han
- Unveil The Duality Of Retrieval-augmented Generation: Theoretical Analysis And Practical Solution Xu Shicheng, Pang Liang, Shen Huawei, Cheng Xueqi
- Unsupervised Information Refinement Training Of Large Language Models For Retrieval-augmented Generation Xu Shicheng, Pang Liang, Yu Mo, Meng Fandong, Shen Huawei, Cheng Xueqi, Zhou Jie
- Is DPO Superior To PPO For LLM Alignment? A Comprehensive Study Xu Shusheng, Fu Wei, Gao Jiaxuan, Ye Wenjie, Liu Weilin, Mei Zhiyu, Wang Guangju, Yu Chao, Wu Yi
- SA-MDKIF: A Scalable And Adaptable Medical Domain Knowledge Injection Framework For Large Language Models Xu Tianhan, Hu Zhe, Chen Ling, Li Bin
- Sayself: Teaching Llms To Express Confidence With Self-reflective Rationales Xu Tianyang, Wu Shujin, Diao Shizhe, Liu Xiaoze, Wang Xingyao, Chen Yangyi, Gao Jing
- Can Llms Solve Longer Math Word Problems Better? Xu Xin, Xiao Tong, Chao Zitong, Huang Zhenya, Yang Can, Wang Yang
- Relating The Seemingly Unrelated: Principled Understanding Of Generalization For Generative Models In Arithmetic Reasoning Tasks Xu Xingcheng, Zhao Zibo, Zhang Haipeng, Yang Yanqing
- Bridging The Gap Between Different Vocabularies For LLM Ensemble Xu Yangyifan, Lu Jinliang, Zhang Jiajun
- Generate-on-graph: Treat LLM As Both Agent And KG In Incomplete Knowledge Graph Question Answering Xu Yao, He Shizhu, Chen Jiabei, Wang Zihao, Song Yangqiu, Tong Hanghang, Liu Kang, Zhao Jun
- Conveyor: Efficient Tool-aware LLM Serving With Tool Partial Execution Xu Yechen, Kong Xinhao, Chen Tingjun, Zhuo Danyang
- Chatglm-math: Improving Math Problem-solving In Large Language Models With A Self-critique Pipeline Xu Yifan, Liu Xiao, Liu Xinghan, Hou Zhenyu, Li Yueyan, Zhang Xiaohan, Wang Zihan, Zeng Aohan, Du Zhengxiao, Zhao Wenyi, Tang Jie, Dong Yuxiao
- Think: Thinner Key Cache By Query-driven Pruning Xu Yuhui, Jie Zhanming, Dong Hanze, Wang Lei, Lu Xudong, Zhou Aojun, Saha Amrita, Xiong Caiming, Sahoo Doyen
- Face4rag: Factual Consistency Evaluation For Retrieval Augmented Generation In Chinese Xu Yunqi, Cai Tianchi, Jiang Jiyan, Song Xierui
- Magpie: Alignment Data Synthesis From Scratch By Prompting Aligned Llms With Nothing Xu Zhangchen, Jiang Fengqing, Niu Luyao, Deng Yuntian, Poovendran Radha, Choi Yejin, Lin Bill Yuchen
- Safedecoding: Defending Against Jailbreak Attacks Via Safety-aware Decoding Xu Zhangchen, Jiang Fengqing, Niu Luyao, Jia Jinyuan, Lin Bill Yuchen, Poovendran Radha
- Bag Of Tricks: Benchmarking Of Jailbreak Attacks On Llms Xu Zhao, Liu Fan, Liu Hao
- Beyond Perplexity: Multi-dimensional Safety Evaluation Of LLM Compression Xu Zhichao, Gupta Ashim, Li Tao, Bentham Oliver, Srikumar Vivek
- Do Large Language Models Have Compositional Ability? An Investigation Into Limitations And Scalability Xu Zhuoyan, Shi Zhenmei, Liang Yingyu
- Question Calibration And Multi-hop Modeling For Temporal Question Answering Xue Chao, Liang Di, Wang Pengfei, Zhang Jing
- Longvila: Scaling Long-context Visual Language Models For Long Videos Xue Fuzhao, Chen Yukang, Li Dacheng, Hu Qinghao, Zhu Ligeng, Li Xiuyu, Fang Yunhao, Tang Haotian, Yang Shang, Liu Zhijian, He Ethan, Yin Hongxu, Molchanov Pavlo, Kautz Jan, Fan Linxi, Zhu Yuke, Lu Yao, Han Song
- Openmoe: An Early Effort On Open Mixture-of-experts Language Models Xue Fuzhao, Zheng Zian, Fu Yao, Ni Jinjie, Zheng Zangwei, Zhou Wangchunshu, You Yang
- Improving Audio Codec-based Zero-shot Text-to-speech Synthesis With Multi-modal Context And Large Language Model Xue Jinlong, Deng Yayue, Han Yicheng, Gao Yingming, Li Ya
- Multi-programming Language Ensemble For Code Generation In Large Language Model Xue Tengfei, Li Xuefeng, Azim Tahir, Smirnov Roman, Yu Jianhui, Sadrieh Arash, Pahlavan Babak
- Fine-tuning Large Language Models For Automated Diagnostic Screening Summaries Yadav Manjeet, Sahu Nilesh Kumar, Chaturvedi Mudita, Gupta Snehil, Lone Haroon R
- Mitigating LLM Hallucinations Via Conformal Abstention Yadkori Yasin Abbasi, Kuzborskij Ilja, Stutz David, György András, Fisch Adam, Doucet Arnaud, Beloshapka Iuliya, Weng Wei-hung, Yang Yao-yuan, Szepesvári Csaba, Cemgil Ali Taylan, Tomasev Nenad
- Medlm: Exploring Language Models For Medical Question Answering Systems Yagnik Niraj, Jhaveri Jay, Sharma Vivek, Pila Gabriel
- LEIA: Facilitating Cross-lingual Knowledge Transfer In Language Models With Entity-based Data Augmentation Yamada Ikuya, Ri Ryokan
- List Items One By One: A New Data Source And Learning Paradigm For Multimodal Llms Yan An, Yang Zhengyuan, Wu Junda, Zhu Wanrong, Yang Jianwei, Li Linjie, Lin Kevin, Wang Jianfeng, Mcauley Julian, Gao Jianfeng, Wang Lijuan
- Mirror: A Multiple-perspective Self-reflection Method For Knowledge-rich Reasoning Yan Hanqi, Zhu Qinglin, Wang Xinyu, Gui Lin, He Yulan
- Predicting The Big Five Personality Traits In Chinese Counselling Dialogues Using Large Language Models Yan Yang, Ma Lizhi, Li Anqi, Ma Jingsong, Lan Zhenzhong
- Exploring The LLM Journey From Cognition To Expression With Linear Representations Yan Yuzi, Li Jialian, Zhang Yipin, Yan Dong
- 3d-properties: Identifying Challenges In DPO And Charting A Path Forward Yan Yuzi, Miao Yibo, Li Jialian, Zhang Yipin, Xie Jian, Deng Zhijie, Yan Dong
- Social Life Simulation For Non-cognitive Skills Learning Yan Zihan, Xiang Yaohong, Huang Yun
- Crafting Customisable Characters With Llms: Introducing Simschat, A Persona-driven Role-playing Agent Framework Yang Bohao, Liu Dong, Tang Chen, Xiao Chenghao, Zhao Kun, Li Chao, Yuan Lin, Yang Guang, Huang Lanxiao, Lin Chenghua
- Embedded Translations For Low-resource Automated Glossing Yang Changbing, Nicolai Garrett, Silfverberg Miikka
- MAPLE: Enhancing Review Generation With Multi-aspect Prompt Learning In Explainable Recommendation Yang Ching-wen, Chen Che Wei, Wu Kun-da, Xu Hao, Yao Jui-feng, Kao Hung-yu
- IM-RAG: Multi-round Retrieval-augmented Generation Through Learning Inner Monologues Yang Diji, Rao Jinmeng, Chen Kezhen, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- \(\text{memory}^3\): Language Modeling With Explicit Memory Yang Hongkang, Lin Zehao, Wang Wenjin, Wu Hao, Li Zhiyu, Tang Bo, Wei Wenqiang, Wang Jinbo, Tang Zeyun, Song Shichao, Xi Chenyang, Yu Yu, Chen Kai, Xiong Feiyu, Tang Linpeng, E Weinan
- Aligning Llms Through Multi-perspective User Preference Ranking-based Feedback For Programming Question Answering Yang Hongyu, He Liyang, Hou Min, Shen Shuanghong, Li Rui, Hou Jiahui, Ma Jianhui, Zhao Junda
- 3D-GRAND: A Million-scale Dataset For 3d-llms With Better Grounding And Less Hallucination Yang Jianing, Chen Xuweiyi, Madaan Nikhil, Iyengar Madhavan, Qian Shengyi, Fouhey David F., Chai Joyce
- Asynchronous And Segmented Bidirectional Encoding For NMT Yang Jingpu, Han Zehua, Xiang Mengyu, Wang Helin, Huang Yuxiao, Fang Miao
- Item-language Model For Conversational Recommendation Yang Li, Subbiah Anushya, Patel Hardik, Li Judith Yue, Song Yanwei, Mirghaderi Reza, Aggarwal Vikram
- Mastering Text-to-image Diffusion: Recaptioning, Planning, And Generating With Multimodal Llms Yang Ling, Yu Zhaochen, Meng Chenlin, Xu Minkai, Ermon Stefano, Cui Bin
- Regularizing Hidden States Enables Learning Generalizable Reward Model For Llms Yang Rui, Ding Ruomeng, Lin Yong, Zhang Huan, Zhang Tong
- Confidence Calibration And Rationalization For Llms Via Multi-agent Deliberation Yang Ruixin, Rajagopal Dheeraj, Hayati Shirley Anugrah, Hu Bin, Kang Dongyeop
- GPT-4 As Evaluator: Evaluating Large Language Models On Pest Management In Agriculture Yang Shanglong, Yuan Zhipeng, Li Shunbao, Peng Ruoling, Liu Kang, Yang Po
- Video As The New Language For Real-world Decision Making Yang Sherry, Walker Jacob, Parker-holder Jack, Du Yilun, Bruce Jake, Barreto Andre, Abbeel Pieter, Schuurmans Dale
- Seed-story: Multimodal Long Story Generation With Large Language Model Yang Shuai, Ge Yuying, Li Yang, Chen Yukang, Ge Yixiao, Shan Ying, Chen Yingcong
- Is Crowdsourcing Breaking Your Bank? Cost-effective Fine-tuning Of Pre-trained Language Models With Proximal Policy Optimization Yang Shuo, Kasneci Gjergji
- Do Large Language Models Latently Perform Multi-hop Reasoning? Yang Sohee, Gribovskaya Elena, Kassner Nora, Geva Mor, Riedel Sebastian
- The Butterfly Effect Of Model Editing: Few Edits Can Trigger Large Language Models Collapse Yang Wanli, Sun Fei, Ma Xinyu, Liu Xun, Yin Dawei, Cheng Xueqi
- CRAG -- Comprehensive RAG Benchmark Yang Xiao, Sun Kai, Xin Hao, Sun Yushi, Bhalla Nikita, Chen Xiangsen, Choudhary Sajal, Gui Rongze Daniel, Jiang Ziran Will, Jiang Ziyu, Kong Lingkun, Moran Brian, Wang Jiaqi, Xu Yifan Ethan, Yan An, Yang Chenyu, Yuan Eting, Zha Hanwen, Tang Nan, Chen Lei, Scheffer Nicolas, Liu Yue, Shah Nirav, Wanga Rakesh, Kumar Anuj, Yih Wen-tau, Dong Xin Luna
- Arithmetic Reasoning With LLM: Prolog Generation & Permutation Yang Xiaocheng, Chen Bingsen, Tam Yik-cheung
- Corda: Context-oriented Decomposition Adaptation Of Large Language Models Yang Yibo, Li Xiaojie, Zhou Zhongzhu, Song Shuaiwen Leon, Wu Jianlong, Nie Liqiang, Ghanem Bernard
- Pre-trained Language Models Improve The Few-shot Prompt Ability Of Decision Transformer Yang Yu, Xu Pan
- Can Llms Reason In The Wild With Programs? Yang Yuan, Xiong Siheng, Payani Ali, Shareghi Ehsan, Fekri Faramarz
- Weak-to-strong Reasoning Yang Yuqing, Ma Yan, Liu Pengfei
- Assessing Adversarial Robustness Of Large Language Models: An Empirical Study Yang Zeyu, Meng Zhao, Zheng Xiaochen, Wattenhofer Roger
- Self-distillation Bridges Distribution Gap In Language Model Fine-tuning Yang Zhaorui, Pang Tianyu, Feng Haozhe, Wang Han, Chen Wei, Zhu Minfeng, Liu Qian
- Adversarial Attacks And Defense For Conversation Entailment Task Yang Zhenning, Krawec Ryan, Wu Liang-yuan
- SOS! Soft Prompt Attack Against Open-source Large Language Models Yang Ziqing, Backes Michael, Zhang Yang, Salem Ahmed
- React Meets Actre: When Language Agents Enjoy Training Data Autonomy Yang Zonghan, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Data Contamination Can Cross Language Barriers Yao Feng, Zhuang Yufan, Sun Zihao, Xu Sunan, Kumar Animesh, Shang Jingbo
- Exploiting Inter-layer Expert Affinity For Accelerating Mixture-of-experts Model Inference Yao Jinghan Dk, Anthony Quentin Dk, Shafi Aamir Dk, Subramoni Hari Dk, K. Dhabaleswar Dk, Panda
- Enhancing In-context Learning Performance With Just Svd-based Weight Pruning: A Theoretical Perspective Yao Xinhao, Hu Xiaolin, Yang Shenzhi, Liu Yong
- Sirllm: Streaming Infinite Retentive LLM Yao Yao, Li Zuchao, Zhao Hai
- Preference-guided Reflective Sampling For Aligning Language Models Ye Hai, Ng Hwee Tou
- Self-judge: Selective Instruction Following With Alignment Self-evaluation Ye Hai, Ng Hwee Tou
- X-VILA: Cross-modality Alignment For Large Language Model Ye Hanrong, Huang De-an, Lu Yao, Yu Zhiding, Ping Wei, Tao Andrew, Kautz Jan, Han Song, Xu Dan, Molchanov Pavlo, Yin Hongxu
- Mplug-owl3: Towards Long Image-sequence Understanding In Multi-modal Large Language Models Ye Jiabo, Xu Haiyang, Liu Haowei, Hu Anwen, Yan Ming, Qian Qi, Zhang Ji, Huang Fei, Zhou Jingren
- Tooleyes: Fine-grained Evaluation For Tool Learning Capabilities Of Large Language Models In Real-world Scenarios Ye Junjie, Li Guanyu, Gao Songyang, Huang Caishuang, Wu Yilong, Li Sixian, Fan Xiaoran, Dou Shihan, Zhang Qi, Gui Tao, Huang Xuanjing
- Toolsword: Unveiling Safety Issues Of Large Language Models In Tool Learning Across Three Stages Ye Junjie, Li Sixian, Li Guanyu, Huang Caishuang, Gao Songyang, Wu Yilong, Zhang Qi, Gui Tao, Huang Xuanjing
- Rotbench: A Multi-level Benchmark For Evaluating The Robustness Of Large Language Models In Tool Learning Ye Junjie, Wu Yilong, Gao Songyang, Huang Caishuang, Li Sixian, Li Guanyu, Fan Xiaoran, Zhang Qi, Gui Tao, Huang Xuanjing
- Mm-spubench: Towards Better Understanding Of Spurious Biases In Multimodal Llms Ye Wenqian, Zheng Guangtao, Ma Yunsheng, Cao Xu, Lai Bolin, Rehg James M., Zhang Aidong
- BEAF: Observing Before-after Changes To Evaluate Hallucination In Vision-language Models Ye-bin Moon, Hyeon-woo Nam, Choi Wonseok, Oh Tae-hyun
- Interrogatellm: Zero-resource Hallucination Detection In Llm-generated Answers Yehuda Yakir, Malkiel Itzik, Barkan Oren, Weill Jonathan, Ronen Royi, Koenigstein Noam
- Is The Pope Catholic? Yes, The Pope Is Catholic. Generative Evaluation Of Non-literal Intent Resolution In Llms Yerukola Akhila, Vaduguru Saujas, Fried Daniel, Sap Maarten
- Using Large Language Models To Create AI Personas For Replication And Prediction Of Media Effects: An Empirical Test Of 133 Published Experimental Research Findings Yeykelis Leo, Pichai Kaavya, Cummings James J., Reeves Byron
- Debiasing Multimodal Large Language Models Yi-fan Zhang, Weichen Yu, Qingsong Wen, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan
- The Task-oriented Queries Benchmark (toqb) Yim Keun Soo
- MMAU: A Holistic Benchmark Of Agent Capabilities Across Diverse Domains Yin Guoli, Bai Haoping, Ma Shuang, Nan Feng, Sun Yanchao, Xu Zhaoyang, Ma Shen, Lu Jiarui, Kong Xiang, Zhang Aonan, Yap Dian Ang, Zhang Yizhe, Ahnert Karsten, Kamath Vik, Berglund Mathias, Walsh Dominic, Gindele Tobias, Wiest Juergen, Lai Zhengfeng, Wang Xiaoming, Shan Jiulong, Cao Meng, Pang Ruoming, Wang Zirui
- Stablemask: Refining Causal Masking In Decoder-only Transformer Yin Qingyu, He Xuzheng, Zhuang Xiang, Zhao Yu, Yao Jianhua, Shen Xiaoyu, Zhang Qiang
- Do Large Language Model Understand Multi-intent Spoken Language ? Yin Shangjian, Huang Peijie, Xu Yuhong, Huang Haojing, Chen Jiatian
- SEA: Supervised Embedding Alignment For Token-level Visual-textual Integration In Mllms Yin Yuanyang, Zhao Yaqi, Zhang Yajie, Lin Ke, Wang Jiahao, Tao Xin, Wan Pengfei, Zhang Di, Yin Baoqun, Zhang Wentao
- Relative Preference Optimization: Enhancing LLM Alignment Through Contrasting Responses Across Identical And Diverse Prompts Yin Yueqin, Wang Zhendong, Gu Yi, Huang Hai, Chen Weizhu, Zhou Mingyuan
- Internlm-math: Open Math Large Language Models Toward Verifiable Reasoning Ying Huaiyuan, Zhang Shuo, Li Linyang, Zhou Zhejian, Shao Yunfan, Fei Zhaoye, Ma Yichuan, Hong Jiawei, Liu Kuikun, Wang Ziyi, Wang Yudong, Wu Zijian, Li Shuaibin, Zhou Fengzhe, Liu Hongwei, Zhang Songyang, Zhang Wenwei, Yan Hang, Qiu Xipeng, Wang Jiayu, Chen Kai, Lin Dahua
- Automating Dataset Updates Towards Reliable And Timely Evaluation Of Large Language Models Ying Jiahao, Cao Yixin, Bai Yushi, Sun Qianru, Wang Bo, Tang Wei, Ding Zhaojun, Yang Yizhe, Huang Xuanjing, Yan Shuicheng
- Unveiling The Safety Of Gpt-4o: An Empirical Study Using Jailbreak Attacks Ying Zonghao, Liu Aishan, Liu Xianglong, Tao Dacheng
- Large Language Model With Graph Convolution For Recommendation Yingpeng Du, Ziyan Wang, Zhu Sun, Haoyan Chua, Hongzhi Liu, Zhonghai Wu, Yining Ma, Jie Zhang, Youchen Sun
- Debunc: Mitigating Hallucinations In Large Language Model Agent Communication With Uncertainty Estimations Yoffe Luke, Amayuelas Alfonso, Wang William Yang
- TLCR: Token-level Continuous Reward For Fine-grained Reinforcement Learning From Human Feedback Yoon Eunseop, Yoon Hee Suk, Eom Soohwan, Han Gunsoo, Nam Daniel Wontae, Jo Daejin, On Kyoung-woon, Hasegawa-johnson Mark A., Kim Sungwoong, Yoo Chang D.
- C-TPT: Calibrated Test-time Prompt Tuning For Vision-language Models Via Text Feature Dispersion Yoon Hee Suk, Yoon Eunseop, Tee Joshua Tian Jin, Hasegawa-johnson Mark, Li Yingzhen, Yoo Chang D.
- Matryoshka-adaptor: Unsupervised And Supervised Tuning For Smaller Embedding Dimensions Yoon Jinsung, Sinha Raj, Arik Sercan O, Pfister Tomas
- Making The Most Of Your Model: Methods For Finetuning And Applying Pretrained Transformers Yoshida Davis
- Tree-planted Transformers: Unidirectional Transformer Language Models With Implicit Syntactic Supervision Yoshida Ryo, Someya Taiga, Oseki Yohei
- Flow Of Reasoning: Efficient Training Of LLM Policy With Divergent Thinking Yu Fangxu, Jiang Lai, Kang Haoqiang, Hao Shibo, Qin Lianhui
- Iterative Graph Alignment Yu Fangyuan, Arora Hardeep Singh, Johnson Matt
- Effectively Compress KV Heads For LLM Yu Hao, Yang Zelan, Li Shen, Li Yong, Wu Jianxin
- Enhancing Healthcare Through Large Language Models: A Study On Medical Question Answering Yu Haoran, Yu Chang, Wang Zihan, Zou Dongxian, Qin Hao
- A Cause-effect Look At Alleviating Hallucination Of Knowledge-grounded Dialogue Generation Yu Jifan, Zhang Xiaohan, Xu Yifan, Lei Xuanyu, Yao Zijun, Zhang Jing, Hou Lei, Li Juanzi
- An LLM Maturity Model For Reliable And Transparent Text-to-query Yu Lei Expression, Ray Abir Expression
- Mechanistic Understanding And Mitigation Of Language Model Non-factual Hallucinations Yu Lei, Cao Meng, Cheung Jackie Chi Kit, Dong Yue
- In Defense Of RAG In The Era Of Long-context Language Models Yu Tan, Xu Anbang, Akkiraju Rama
- Mitigate Position Bias In Large Language Models Via Scaling A Single Dimension Yu Yijiong, Jiang Huiqiang, Luo Xufang, Wu Qianhui, Lin Chin-yew, Li Dongsheng, Yang Yuqing, Huang Yongfeng, Qiu Lili
- Kieval: A Knowledge-grounded Interactive Evaluation Framework For Large Language Models Yu Zhuohao, Gao Chang, Yao Wenjin, Wang Yidong, Ye Wei, Wang Jindong, Xie Xing, Zhang Yue, Zhang Shikun
- Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion For Efficient Inference Intervention In Large Language Model Yuan Chenhan, Huang Fei, Peng Ru, Lu Keming, Yu Bowen, Zhou Chang, Zhou Jingren
- Whispers That Shake Foundations: Analyzing And Mitigating False Premise Hallucinations In Large Language Models Yuan Hongbang, Cao Pengfei, Jin Zhuoran, Chen Yubo, Zeng Daojian, Liu Kang, Zhao Jun
- Advancing LLM Reasoning Generalists With Preference Trees Yuan Lifan, Cui Ganqu, Wang Hanbin, Ding Ning, Wang Xingyao, Deng Jia, Shan Boji, Chen Huimin, Xie Ruobing, Lin Yankai, Liu Zhenghao, Zhou Bowen, Peng Hao, Liu Zhiyuan, Sun Maosong
- Why Lift So Heavy? Slimming Large Language Models By Cutting Off The Layers Yuan Shuzhou, Nie Ercong, Ma Bolei, Färber Michael
- EASYTOOL: Enhancing Llm-based Agents With Concise Tool Instruction Yuan Siyu, Song Kaitao, Chen Jiangjie, Tan Xu, Shen Yongliang, Kan Ren, Li Dongsheng, Yang Deqing
- R-judge: Benchmarking Safety Risk Awareness For LLM Agents Yuan Tongxin, He Zhiwei, Dong Lingzhong, Wang Yiming, Zhao Ruijie, Xia Tian, Xu Lizhen, Zhou Binglin, Li Fangqi, Zhang Zhuosheng, Wang Rui, Liu Gongshen
- LLMCRIT: Teaching Large Language Models To Use Criteria Yuan Weizhe, Liu Pengfei, Gallé Matthias
- Self-rewarding Language Models Yuan Weizhe, Pang Richard Yuanzhe, Cho Kyunghyun, Li Xian, Sukhbaatar Sainbayar, Xu Jing, Weston Jason
- Evaluating Character Understanding Of Large Language Models Via Character Profiling From Fictional Works Yuan Xinfeng, Yuan Siyu, Cui Yuhan, Lin Tianhe, Wang Xintao, Xu Rui, Chen Jiangjie, Yang Deqing
- A Hybrid RAG System With Comprehensive Enhancement On Complex Reasoning Yuan Ye, Liu Chengwu, Yuan Jingyang, Sun Gongbo, Li Siqi, Zhang Ming
- Rigorllm: Resilient Guardrails For Large Language Models Against Undesired Content Yuan Zhuowen, Xiong Zidi, Zeng Yi, Yu Ning, Jia Ruoxi, Song Dawn, Li Bo
- Distilling Instruction-following Abilities Of Large Language Models With Task-aware Curriculum Planning Yue Yuanhao, Wang Chengyu, Huang Jun, Wang Peng
- Investigating Continual Pretraining In Large Language Models: Insights And Implications Yıldız Çağatay, Ravichandran Nishaanth Kanna, Punia Prishruit, Bethge Matthias, Ermis Beyza
- Envgen: Generating And Adapting Environments Via Llms For Training Embodied Agents Zala Abhay, Cho Jaemin, Lin Han, Yoon Jaehong, Bansal Mohit
- Building Accurate Translation-tailored Llms With Language Aware Instruction Tuning Zan Changtong, Ding Liang, Shen Li, Zhen Yibing, Liu Weifeng, Tao Dacheng
- Improving Logits-based Detector Without Logits From Black-box Llms Zeng Cong, Tang Shengkun, Yang Xianjun, Chen Yuanzhou, Sun Yiyou, Xu Zhiqiang, Li Yao, Chen Haifeng, Cheng Wei, Xu Dongkuan
- Multilingual Brain Surgeon: Large Language Models Can Be Compressed Leaving No Language Behind Zeng Hongchuan, Xu Hongshen, Chen Lu, Yu Kai
- Skywork-math: Data Scaling Laws For Mathematical Reasoning In Large Language Models -- The Story Goes On Zeng Liang, Zhong Liangjun, Zhao Liang, Wei Tianwen, Yang Liu, He Jujie, Cheng Cheng, Hu Rui, Liu Yang, Yan Shuicheng, Fang Han, Zhou Yahui
- Divtod: Unleashing The Power Of Llms For Diversifying Task-oriented Dialogue Representations Zeng Weihao, Fu Dayuan, He Keqing, Wang Yejie, Xu Yukai, Xu Weiran
- How Johnny Can Persuade Llms To Jailbreak Them: Rethinking Persuasion To Challenge AI Safety By Humanizing Llms Zeng Yi, Lin Hongpeng, Zhang Jingwen, Yang Diyi, Jia Ruoxi, Shi Weiyan
- Token-level Direct Preference Optimization Zeng Yongcheng, Liu Guoqing, Ma Weiyu, Yang Ning, Zhang Haifeng, Wang Jun
- DELIA: Diversity-enhanced Learning For Instruction Adaptation In Large Language Models Zeng Yuanhao, Ren Fei, Zhou Xinpeng, Wang Yihang, Shao Yingxia
- Learning Reward For Robot Skills Using Large Language Models Via Self-alignment Zeng Yuwei, Mu Yao, Shao Lin
- MR-BEN: A Comprehensive Meta-reasoning Benchmark For Large Language Models Zeng Zhongshen, Liu Yinhong, Wan Yingjia, Li Jingyao, Chen Pengguang, Dai Jianbo, Yao Yuxuan, Xu Rongwu, Qi Zehan, Zhao Wanru, Shen Linling, Lu Jianqiao, Tan Haochen, Chen Yukang, Zhang Hao, Shi Zhan, Wang Bailin, Guo Zhijiang, Jia Jiaya
- Online Self-preferring Language Models Zhai Yuanzhao, Zhang Zhuo, Xu Kele, Peng Hanyang, Yu Yue, Feng Dawei, Yang Cheng, Ding Bo, Wang Huaimin
- Large Language Models Are Capable Of Offering Cognitive Reappraisal, If Guided Zhan Hongli, Zheng Allen, Lee Yoon Kyung, Suh Jina, Li Junyi Jessy, Ong Desmond C.
- Injecagent: Benchmarking Indirect Prompt Injections In Tool-integrated Large Language Model Agents Zhan Qiusi, Liang Zhixiang, Ying Zifan, Kang Daniel
- Simple Techniques For Enhancing Sentence Embeddings In Generative Language Models Zhang Bowen, Chang Kehua, Li Chunping
- LUQ: Long-text Uncertainty Quantification For Llms Zhang Caiqi, Liu Fangyu, Basaldella Marco, Collier Nigel
- Notellm-2: Multimodal Large Representation Models For Recommendation Zhang Chao, Zhang Haoxin, Wu Shiwei, Wu Di, Xu Tong, Gao Yan, Hu Yao, Chen Enhong
- Teaching Large Language Models An Unseen Language On The Fly Zhang Chen, Liu Xiao, Lin Jiuheng, Feng Yansong
- Ts-align: A Teacher-student Collaborative Framework For Scalable Iterative Finetuning Of Large Language Models Zhang Chen, Tang Chengguang, Chong Dading, Shi Ke, Tang Guohua, Jiang Feng, Li Haizhou
- Goal-guided Generative Prompt Injection Attack On Large Language Models Zhang Chong, Jin Mingyu, Yu Qinkai, Liu Chengzhi, Xue Haochen, Jin Xiaobo
- Rest-mcts*: LLM Self-training Via Process Reward Guided Tree Search Zhang Dan, Zhoubian Sining, Hu Ziniu, Yue Yisong, Dong Yuxiao, Tang Jie
- Mm-llms: Recent Advances In Multimodal Large Language Models Zhang Duzhen, Yu Yahan, Dong Jiahua, Li Chenxing, Su Dan, Chu Chenhui, Yu Dong
- Are Large Language Models Good At Utility Judgments? Zhang Hengran, Zhang Ruqing, Guo Jiafeng, De Rijke Maarten, Fan Yixing, Cheng Xueqi
- Sinklora: Enhanced Efficiency And Chat Capabilities For Long-context Large Language Models Zhang Hengyu
- Balancing Speciality And Versatility: A Coarse To Fine Framework For Supervised Fine-tuning Large Language Model Zhang Hengyuan, Wu Yanru, Li Dawei, Yang Sak, Zhao Rui, Jiang Yong, Tan Fei
- Adaptable Logical Control For Large Language Models Zhang Honghua, Kung Po-nien, Yoshida Masahiro, Broeck Guy Van Den, Peng Nanyun
- Agentohana: Design Unified Data And Training Pipeline For Effective Agent Learning Zhang Jianguo, Lan Tian, Murthy Rithesh, Liu Zhiwei, Yao Weiran, Tan Juntao, Hoang Thai, Yang Liangwei, Feng Yihao, Liu Zuxin, Awalgaonkar Tulika, Niebles Juan Carlos, Savarese Silvio, Heinecke Shelby, Wang Huan, Xiong Caiming
- Should We Fear Large Language Models? A Structural Analysis Of The Human Reasoning System For Elucidating LLM Capabilities And Risks Through The Lens Of Heidegger's Philosophy Zhang Jianqiiu
- Stealthy Attack On Large Language Model Based Recommendation Zhang Jinghao, Liu Yuting, Liu Qiang, Wu Shu, Guo Guibing, Wang Liang
- Regurgitative Training: The Value Of Real Data In Training Large Language Models Zhang Jinghui, Qiao Dandan, Yang Mochen, Wei Qiang
- Verifiable By Design: Aligning Language Models To Quote From Pre-training Data Zhang Jingyu, Marone Marc, Li Tianjian, Van Durme Benjamin, Khashabi Daniel
- Exploring The Best Practices Of Query Expansion With Large Language Models Zhang Le, Wu Yihong, Yang Qian, Nie Jian-yun
- Predicting Learning Performance With Large Language Models: A Study In Adult Literacy Zhang Liang, Lin Jionghao, Borchers Conrad, Sabatini John, Hollander John, Cao Meng, Hu Xiangen
- SPL: A Socratic Playground For Learning Powered By Large Language Model Zhang Liang, Lin Jionghao, Kuang Ziyi, Xu Sheng, Yeasin Mohammed, Hu Xiangen
- ARL2: Aligning Retrievers For Black-box Large Language Models Via Self-guided Adaptive Relevance Labeling Zhang Lingxi, Yu Yue, Wang Kuan, Zhang Chao
- Generative Verifiers: Reward Modeling As Next-token Prediction Zhang Lunjun, Hosseini Arian, Bansal Hritik, Kazemi Mehran, Kumar Aviral, Agarwal Rishabh
- Automated Multi-level Preference For Mllms Zhang Mengxi, Wu Wenhao, Lu Yu, Song Yuxin, Rong Kang, Yao Huanjin, Zhao Jianbo, Liu Fanglong, Sun Yifan, Feng Haocheng, Wang Jingdong
- Instructedit: Instruction-based Knowledge Editing For Large Language Models Zhang Ningyu, Tian Bozhong, Cheng Siyuan, Liang Xiaozhuan, Hu Yi, Xue Kouying, Gou Yanjie, Chen Xi, Chen Huajun
- Internlm-xcomposer-2.5: A Versatile Large Vision Language Model Supporting Long-contextual Input And Output Zhang Pan, Dong Xiaoyi, Zang Yuhang, Cao Yuhang, Qian Rui, Chen Lin, Guo Qipeng, Duan Haodong, Wang Bin, Ouyang Linke, Zhang Songyang, Zhang Wenwei, Li Yining, Gao Yang, Sun Peng, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Yan Hang, He Conghui, Zhang Xingcheng, Chen Kai, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- Soaring From 4K To 400K: Extending Llm's Context With Activation Beacon Zhang Peitian, Liu Zheng, Xiao Shitao, Shao Ninglu, Ye Qiwei, Dou Zhicheng
- Extending Llama-3's Context Ten-fold Overnight Zhang Peitian, Shao Ninglu, Liu Zheng, Xiao Shitao, Qian Hongjin, Ye Qiwei, Dou Zhicheng
- VAR-CLIP: Text-to-image Generator With Visual Auto-regressive Modeling Zhang Qian, Dai Xiangzi, Yang Ninghua, An Xiang, Feng Ziyong, Ren Xingyu
- Collaborative Performance Prediction For Large Language Models Zhang Qiyuan, Lyu Fuyuan, Liu Xue, Ma Chen
- Instruct Large Language Models To Drive Like Humans Zhang Ruijun, Guo Xianda, Zheng Wenzhao, Zhang Chenming, Keutzer Kurt, Chen Long
- Direct Preference Optimization Of Video Large Multimodal Models From Language Model Reward Zhang Ruohong, Gui Liangke, Sun Zhiqing, Feng Yihao, Xu Keyang, Zhang Yuanhan, Fu Di, Li Chunyuan, Hauptmann Alexander, Bisk Yonatan, Yang Yiming
- Offline Training Of Language Model Agents With Functions As Learnable Weights Zhang Shaokun, Zhang Jieyu, Liu Jiale, Song Linxin, Wang Chi, Krishna Ranjay, Wu Qingyun
- Cfbench: A Comprehensive Constraints-following Benchmark For Llms Zhang Tao, Shen Yanjun, Luo Wenjing, Zhang Yan, Liang Hao, Zhang Tao, Yang Fan, Lin Mingan, Qiao Yujing, Chen Weipeng, Cui Bin, Zhang Wentao, Zhou Zenan
- R4: Reinforced Retriever-reorder-responder For Retrieval-augmented Large Language Models Zhang Taolin, Li Dongyang, Chen Qizhou, Wang Chengyu, Huang Longtao, Xue Hui, He Xiaofeng, Huang Jun
- Revisiting OPRO: The Limitations Of Small-scale Llms As Optimizers Zhang Tuo, Yuan Jinyue, Avestimehr Salman
- Seallms 3: Open Foundation And Chat Multilingual Large Language Models For Southeast Asian Languages Zhang Wenxuan, Chan Hou Pong, Zhao Yiran, Aljunied Mahani, Wang Jianyu, Liu Chaoqun, Deng Yue, Hu Zhiqiang, Xu Weiwen, Chia Yew Ken, Li Xin, Bing Lidong
- Llms Instruct Llms:an Extraction And Editing Method Zhang Xin, Ju Tianjie, Liang Huijia, Fu Ying, Zhang Qin
- Laser: Parameter-efficient LLM Bi-tuning For Sequential Recommendation With Collaborative Information Zhang Xinyu, Hu Linmei, Zhang Luhao, Song Dandan, Huang Heyan, Nie Liqiang
- Contrasolver: Self-alignment Of Language Models By Resolving Internal Preference Contradictions Zhang Xu, Yin Xunjian, Wan Xiaojun
- Ask-before-plan: Proactive Language Agents For Real-world Planning Zhang Xuan, Deng Yang, Ren Zifeng, Ng See-kiong, Chua Tat-seng
- Chain Of Preference Optimization: Improving Chain-of-thought Reasoning In Llms Zhang Xuan, Du Chao, Pang Tianyu, Liu Qian, Gao Wei, Lin Min
- Reinforcement Retrieval Leveraging Fine-grained Feedback For Fact Checking News Claims With Black-box LLM Zhang Xuan, Gao Wei
- TREACLE: Thrifty Reasoning Via Context-aware LLM And Prompt Selection Zhang Xuechen, Huang Zijian, Taga Ege Onur, Joe-wong Carlee, Oymak Samet, Chen Jiasi
- LLM As A Mastermind: A Survey Of Strategic Reasoning With Large Language Models Zhang Yadong, Mao Shaoguang, Ge Tao, Wang Xun, De Wynter Adrian, Xia Yan, Wu Wenshan, Song Ting, Lan Man, Wei Furu
- Pybench: Evaluating LLM Agent On Various Real-world Coding Tasks Zhang Yaolun, Pan Yinxu, Wang Yudong, Cai Jie
- Mme-realworld: Could Your Multimodal LLM Challenge High-resolution Real-world Scenarios That Are Difficult For Humans? Zhang Yi-fan, Zhang Huanyu, Tian Haochen, Fu Chaoyou, Zhang Shuangqing, Wu Junfei, Li Feng, Wang Kun, Wen Qingsong, Zhang Zhang, Wang Liang, Jin Rong, Tan Tieniu
- Wings: Learning Multimodal Llms Without Text-only Forgetting Zhang Yi-kai, Lu Shiyin, Li Yang, Ma Yanqing, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Zhan De-chuan, Ye Han-jia
- Rankclip: Ranking-consistent Language-image Pretraining Zhang Yiming, Zhao Zhuokai, Chen Zhaorun, Feng Zhili, Ding Zenghui, Sun Yining
- Navigating User Experience Of Chatgpt-based Conversational Recommender Systems: The Effects Of Prompt Guidance And Recommendation Domain Zhang Yizhe, Jin Yucheng, Chen Li, Yang Ting
- Can LLM Graph Reasoning Generalize Beyond Pattern Memorization? Zhang Yizhuo, Wang Heng, Feng Shangbin, Tan Zhaoxuan, Han Xiaochuang, He Tianxing, Tsvetkov Yulia
- Worldqa: Multimodal World Knowledge In Videos Through Long-chain Reasoning Zhang Yuanhan, Zhang Kaichen, Li Bo, Pu Fanyi, Setiadharma Christopher Arif, Yang Jingkang, Liu Ziwei
- Prompt-aware Adapter: Towards Learning Adaptive Visual Tokens For Multimodal Large Language Models Zhang Yue, Fan Hehe, Yang Yi
- Pattern-aware Chain-of-thought Prompting In Large Language Models Zhang Yufeng, Wang Xuepeng, Wu Lingxiang, Wang Jinqiao
- OT-VP: Optimal Transport-guided Visual Prompting For Test-time Adaptation Zhang Yunbei, Mehra Akshay, Hamm Jihun
- Small Language Models Need Strong Verifiers To Self-correct Reasoning Zhang Yunxiang, Khalifa Muhammad, Logeswaran Lajanugen, Kim Jaekyeom, Lee Moontae, Lee Honglak, Wang Lu
- Intention Analysis Makes Llms A Good Jailbreak Defender Zhang Yuqi, Ding Liang, Zhang Lefei, Tao Dacheng
- Adam-mini: Use Fewer Learning Rates To Gain More Zhang Yushun, Chen Congliang, Li Ziniu, Ding Tian, Wu Chenwei, Ye Yinyu, Luo Zhi-quan, Sun Ruoyu
- Toolbehonest: A Multi-level Hallucination Diagnostic Benchmark For Tool-augmented Large Language Models Zhang Yuxiang, Chen Jing, Wang Junjie, Liu Yaxin, Yang Cheng, Shi Chufan, Zhu Xinyu, Lin Zihao, Wan Hanwen, Yang Yujiu, Sakai Tetsuya, Feng Tian, Yamana Hayato
- PPTC-R Benchmark: Towards Evaluating The Robustness Of Large Language Models For Powerpoint Task Completion Zhang Zekai, Guo Yiduo, Liang Yaobo, Zhao Dongyan, Duan Nan
- Diversifying The Expert Knowledge For Task-agnostic Pruning In Sparse Mixture-of-experts Zhang Zeliang, Liu Xiaodong, Cheng Hao, Xu Chenliang, Gao Jianfeng
- A Survey On The Memory Mechanism Of Large Language Model Based Agents Zhang Zeyu, Bo Xiaohe, Ma Chen, Li Rui, Chen Xu, Dai Quanyu, Zhu Jieming, Dong Zhenhua, Wen Ji-rong
- Generating Chain-of-thoughts With A Pairwise-comparison Approach To Searching For The Most Promising Intermediate Thought Zhang Zhen-yu, Han Siwei, Yao Huaxiu, Niu Gang, Sugiyama Masashi
- Position-aware Parameter Efficient Fine-tuning Approach For Reducing Positional Bias In Llms Zhang Zheng, Yang Fan, Jiang Ziyan, Chen Zheng, Zhao Zhengyang, Ma Chengyuan, Zhao Liang, Liu Yang
- E-bench: Towards Evaluating The Ease-of-use Of Large Language Models Zhang Zhenyu, Hao Bingguang, Li Jinpeng, Zhang Zekai, Zhao Dongyan
- Shieldlm: Empowering Llms As Aligned, Customizable And Explainable Safety Detectors Zhang Zhexin, Lu Yida, Ma Jingyuan, Zhang Di, Li Rui, Ke Pei, Sun Hao, Sha Lei, Sui Zhifang, Wang Hongning, Huang Minlie
- Safe Unlearning: A Surprisingly Effective And Generalizable Solution To Defend Against Jailbreak Attacks Zhang Zhexin, Yang Junxiao, Ke Pei, Cui Shiyao, Zheng Chujie, Wang Hongning, Huang Minlie
- Galla: Graph Aligned Large Language Models For Improved Source Code Understanding Zhang Ziyin, Yu Hang, Li Shijie, Di Peng, Li Jianguo, Wang Rui
- Reverse That Number! Decoding Order Matters In Arithmetic Learning Zhang-li Daniel, Lin Nianyi, Yu Jifan, Zhang Zheyuan, Yao Zijun, Zhang Xiaokang, Hou Lei, Zhang Jing, Li Juanzi
- APT: Adaptive Pruning And Tuning Pretrained Language Models For Efficient Training And Inference Zhao Bowen, Hajishirzi Hannaneh, Cao Qingqing
- Large Language Model Can Continue Evolving From Mistakes Zhao Haokun, Han Haixia, Shi Jie, Du Chengyu, Liang Jiaqing, Xiao Yanghua
- LANE: Logic Alignment Of Non-tuning Large Language Models And Online Recommendation Systems For Explainable Reason Generation Zhao Hongke, Zheng Songming, Wu Likang, Yu Bowen, Wang Jing
- Cpsyexam: A Chinese Benchmark For Evaluating Psychology Using Examinations Zhao Jiahao, Zhu Jingwei, Tan Minghuan, Yang Min, Yang Di, Zhang Chenhao, Ye Guancheng, Li Chengming, Hu Xiping
- Exploring The Compositional Deficiency Of Large Language Models In Mathematical Reasoning Zhao Jun, Tong Jingqi, Mou Yurong, Zhang Ming, Zhang Qi, Huang Xuanjing
- Lora Land: 310 Fine-tuned Llms That Rival GPT-4, A Technical Report Zhao Justin, Wang Timothy, Abid Wael, Angus Geoffrey, Garg Arnav, Kinnison Jeffery, Sherstinsky Alex, Molino Piero, Addair Travis, Rishi Devvret
- Aqulia-med LLM: Pioneering Full-process Open-source Medical Language Models Zhao Lulu, Zeng Weihao, Shi Xiaofeng, Zhou Hua, Hao Donglin, Lin Yonghua
- A Survey Of Backdoor Attacks And Defenses On Large Language Models: Implications For Security Measures Zhao Shuai, Jia Meihuizi, Guo Zhongliang, Gan Leilei, Xu Xiaoyu, Wu Xiaobao, Fu Jie, Feng Yichao, Pan Fengjun, Tuan Luu Anh
- Prepacking: A Simple Method For Fast Prefilling And Increased Throughput In Large Language Models Zhao Siyan, Israel Daniel, Broeck Guy Van Den, Grover Aditya
- Defending Large Language Models Against Jailbreak Attacks Via Layer-specific Editing Zhao Wei, Li Zhe, Li Yige, Zhang Ye, Sun Jun
- Both Matter: Enhancing The Emotional Intelligence Of Large Language Models Without Compromising The General Intelligence Zhao Weixiang, Li Zhuojun, Wang Shilong, Wang Yang, Hu Yulin, Zhao Yanyan, Wei Chen, Qin Bing
- SAPT: A Shared Attention Framework For Parameter-efficient Continual Learning Of Large Language Models Zhao Weixiang, Wang Shilong, Hu Yulin, Zhao Yanyan, Qin Bing, Zhang Xuanyu, Yang Qing, Xu Dongliang, Che Wanxiang
- Enhancing Cross-domain Pre-trained Decision Transformers With Adaptive Attention Zhao Wenhao, Xu Qiushui, Xu Linjie, Song Lei, Wang Jinyu, Zhou Chunlai, Bian Jiang
- Wildhallucinations: Evaluating Long-form Factuality In Llms With Real-world Entity Queries Zhao Wenting, Goyal Tanya, Chiu Yu Ying, Jiang Liwei, Newman Benjamin, Ravichander Abhilasha, Chandu Khyathi, Bras Ronan Le, Cardie Claire, Deng Yuntian, Choi Yejin
- Codev: Empowering Llms For Verilog Generation Through Multi-level Summarization Zhao Yang, Huang Di, Li Chongxiao, Jin Pengwei, Nan Ziyuan, Ma Tianyun, Qi Lei, Pan Yansong, Zhang Zhenxing, Zhang Rui, Zhang Xishan, Du Zidong, Guo Qi, Hu Xing, Chen Yunji
- ALISA: Accelerating Large Language Model Inference Via Sparsity-aware KV Caching Zhao Youpeng, Wu Di, Wang Jun
- An Empirical Study Of Retrieval Augmented Generation With Chain-of-thought Zhao Yuetong, Cao Hongyu, Zhao Xianyu, Ou Zhijian
- Improving The Robustness Of Large Language Models Via Consistency Alignment Zhao Yukun, Yan Lingyong, Sun Weiwei, Xing Guoliang, Wang Shuaiqiang, Meng Chong, Cheng Zhicong, Ren Zhaochun, Yin Dawei
- Codejudge-eval: Can Large Language Models Be Good Judges In Code Understanding? Zhao Yuwei, Luo Ziyang, Tian Yuchen, Lin Hongzhan, Yan Weixiang, Li Annan, Ma Jing
- Let Me Do It For You: Towards LLM Empowered Recommendation Via Tool Learning Zhao Yuyue, Wu Jiancan, Wang Xiang, Tang Wei, Wang Dingxian, De Rijke Maarten
- SWIFT:A Scalable Lightweight Infrastructure For Fine-tuning Zhao Yuze, Huang Jintao, Hu Jinghan, Wang Xingjun, Mao Yunlin, Zhang Daoze, Jiang Zeyinzi, Wu Zhikai, Ai Baole, Wang Ang, Zhou Wenmeng, Chen Yingda
- Adversarial Contrastive Decoding: Boosting Safety Alignment Of Large Language Models Via Opposite Prompt Optimization Zhao Zhengyue, Zhang Xiaoyun, Xu Kaidi, Hu Xing, Zhang Rui, Du Zidong, Guo Qi, Chen Yunji
- Chemdfm: Dialogue Foundation Model For Chemistry Zhao Zihan, Ma Da, Chen Lu, Sun Liangtai, Li Zihao, Xu Hongshen, Zhu Zichen, Zhu Su, Fan Shuai, Shen Guodong, Chen Xin, Yu Kai
- Loraretriever: Input-aware Lora Retrieval And Composition For Mixed Tasks In The Wild Zhao Ziyu, Gan Leilei, Wang Guoyin, Zhou Wangchunshu, Yang Hongxia, Kuang Kun, Wu Fei
- Leveraging Large Language Models With Chain-of-thought And Prompt Engineering For Traffic Crash Severity Analysis And Inference Zhen Hao, Shi Yucheng, Huang Yongcan, Yang Jidong J., Liu Ninghao
- ICE-GRT: Instruction Context Enhancement By Generative Reinforcement Based Transformers Zheng Chen, Sun Ke, Tang Da, Ma Yukun, Zhang Yuyu, Xi Chenguang, Zhou Xun
- Balancing Enhancement, Harmlessness, And General Capabilities: Enhancing Conversational Llms With Direct RLHF Zheng Chen, Sun Ke, Wu Hang, Xi Chenguang, Zhou Xun
- Mistral-c2f: Coarse To Fine Actor For Analytical And Reasoning Enhancement In RLHF And Effective-merged Llms Zheng Chen, Sun Ke, Zhou Xun
- Weak-to-strong Extrapolation Expedites Alignment Zheng Chujie, Wang Ziqi, Ji Heng, Huang Minlie, Peng Nanyun
- On Prompt-driven Safeguarding For Large Language Models Zheng Chujie, Yin Fan, Zhou Hao, Meng Fandong, Zhou Jie, Chang Kai-wei, Huang Minlie, Peng Nanyun
- Trustscore: Reference-free Evaluation Of LLM Response Trustworthiness Zheng Danna, Liu Danyang, Lapata Mirella, Pan Jeff Z.
- Learn To Be Efficient: Build Structured Sparsity In Large Language Models Zheng Haizhong, Bai Xiaoyan, Liu Xueshen, Mao Z. Morley, Chen Beidi, Lai Fan, Prakash Atul
- Ali-agent: Assessing Llms' Alignment With Human Values Via Agent-based Evaluation Zheng Jingnan, Wang Han, Zhang An, Nguyen Tai D., Sun Jun, Chua Tat-seng
- NEO-BENCH: Evaluating Robustness Of Large Language Models With Neologisms Zheng Jonathan, Ritter Alan, Xu Wei
- Towards Lifelong Learning Of Large Language Models: A Survey Zheng Junhao, Qiu Shengjie, Shi Chengming, Ma Qianli
- PAS: Data-efficient Plug-and-play Prompt Augmentation System Zheng Miao, Liang Hao, Yang Fan, Sun Haoze, Li Tianpeng, Xiong Lingchu, Zhang Yan, Wu Youzhen, Li Kun, Shen Yanjun, Lin Mingan, Zhang Tao, Dong Guosheng, Qiao Yujing, Fang Kun, Chen Weipeng, Cui Bin, Zhang Wentao, Zhou Zenan
- Clr-fact: Evaluating The Complex Logical Reasoning Capability Of Large Language Models Over Factual Knowledge Zheng Tianshi, Bai Jiaxin, Wang Yicheng, Fang Tianqing, Guo Yue, Yim Yauwai, Song Yangqiu
- More-3s:multimodal-based Offline Reinforcement Learning With Shared Semantic Spaces Zheng Tianyu, Zhang Ge, Qu Xingwei, Kuang Ming, Huang Stephen W., He Zhaofeng
- Opencodeinterpreter: Integrating Code Generation With Execution And Refinement Zheng Tianyu, Zhang Ge, Shen Tianhao, Liu Xueling, Lin Bill Yuchen, Fu Jie, Chen Wenhu, Yue Xiang
- Harnessing Large Language Models For Text-rich Sequential Recommendation Zheng Zhi, Chao Wenshuo, Qiu Zhaopeng, Zhu Hengshu, Xiong Hui
- Attention Heads Of Large Language Models: A Survey Zheng Zifan, Wang Yezhaohui, Huang Yuxin, Song Shichao, Tang Bo, Xiong Feiyu, Li Zhiyu
- Training Large Language Models For Reasoning Through Reverse Curriculum Reinforcement Learning Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang
- ROSE Doesn't Do That: Boosting The Safety Of Instruction-tuned Large Language Models With Reverse Prompt Contrastive Decoding Zhong Qihuang, Ding Liang, Liu Juhua, Du Bo, Tao Dacheng
- Moextend: Tuning New Experts For Modality And Task Extension Zhong Shanshan, Gao Shanghua, Huang Zhongzhan, Wen Wushao, Zitnik Marinka, Zhou Pan
- Policy Improvement Using Language Feedback Models Zhong Victor, Misra Dipendra, Yuan Xingdi, Côté Marc-alexandre
- Synthet2c: Generating Synthetic Data For Fine-tuning Large Language Models On The Text2cypher Task Zhong Ziije, Zhong Linqing, Sun Zhaoze, Jin Qingyun, Qin Zengchang, Zhang Xiaofan
- Misinforming Llms: Vulnerabilities, Challenges And Opportunities Zhou Bo, Geißler Daniel, Lukowicz Paul
- Unibias: Unveiling And Mitigating LLM Bias Through Internal Attention And FFN Manipulation Zhou Hanzhang, Feng Zijian, Zhu Zixiao, Qian Junlang, Mao Kezhi
- Sequence To Sequence Reward Modeling: Improving RLHF By Language Feedback Zhou Jiayi, Ji Jiaming, Dai Juntao, Yang Yaodong
- Orchestrating Llms With Different Personalizations Zhou Jin Peng, Luo Katie Z, Gu Jingwen, Yuan Jason, Weinberger Kilian Q., Sun Wen
- Language-based User Profiles For Recommendation Zhou Joyce, Dai Yijia, Joachims Thorsten
- Towards Democratizing Multilingual Large Language Models For Medicine Through A Two-stage Instruction Fine-tuning Approach Zhou Meng, Parmar Surajsinh, Bhatti Anubhav
- Image-of-thought Prompting For Visual Reasoning Refinement In Multimodal Large Language Models Zhou Qiji, Zhou Ruochen, Hu Zike, Lu Panzhong, Gao Siyang, Zhang Yue
- Enhancing The General Agent Capabilities Of Low-parameter Llms Through Tuning And Multi-branch Reasoning Zhou Qinhao, Zhang Zihan, Xiang Xiang, Wang Ke, Wu Yuchuan, Li Yongbin
- Evaluating The Smooth Control Of Attribute Intensity In Text Generation With Llms Zhou Shang, Yao Feng, Dong Chengyu, Wang Zihan, Shang Jingbo
- Cogmg: Collaborative Augmentation Between Large Language Model And Knowledge Graph Zhou Tong, Chen Yubo, Liu Kang, Zhao Jun
- Archer: Training Language Model Agents Via Hierarchical Multi-turn RL Zhou Yifei, Zanette Andrea, Pan Jiayi, Levine Sergey, Kumar Aviral
- Aligning Modalities In Vision Large Language Models Via Preference Fine-tuning Zhou Yiyang, Cui Chenhang, Rafailov Rafael, Finn Chelsea, Yao Huaxiu
- Calibrated Self-rewarding Vision Language Models Zhou Yiyang, Fan Zhiyuan, Cheng Dongjie, Yang Sihan, Chen Zhaorun, Cui Chenhang, Wang Xiyao, Li Yun, Zhang Linjun, Yao Huaxiu
- Multi-stage Balanced Distillation: Addressing Long-tail Challenges In Sequence-level Knowledge Distillation Zhou Yuhang, Zhu Jing, Xu Paiheng, Liu Xiaoyu, Wang Xiyao, Koutra Danai, Ai Wei, Huang Furong
- Defending Jailbreak Prompts Via In-context Adversarial Game Zhou Yujun, Han Yufei, Zhuang Haomin, Guo Kehan, Liang Zhenwen, Bao Hongyan, Zhang Xiangliang
- Emulated Disalignment: Safety Alignment For Large Language Models May Backfire! Zhou Zhanhui, Liu Jie, Dong Zhichen, Liu Jiaheng, Yang Chao, Ouyang Wanli, Qiao Yu
- How Alignment And Jailbreak Work: Explain LLM Safety Through Intermediate Hidden States Zhou Zhenhong, Yu Haiyang, Zhang Xinghua, Xu Rongwu, Huang Fei, Li Yongbin
- DETAIL: Task Demonstration Attribution For Interpretable In-context Learning Zhou Zijian, Lin Xiaoqiang, Xu Xinyi, Prakash Alok, Rus Daniela, Low Bryan Kian Hsiang
- Llmbind: A Unified Modality-task Integration Framework Zhu Bin, Ning Munan, Jin Peng, Lin Bin, Huang Jinfa, Song Qi, Zhang Junwu, Tang Zhenyu, Pan Mingjun, Zhou Xing, Yuan Li
- Lifelong Personalized Low-rank Adaptation Of Large Language Models For Recommendation Zhu Jiachen, Lin Jianghao, Dai Xinyi, Chen Bo, Shan Rong, Zhu Jieming, Tang Ruiming, Yu Yong, Zhang Weinan
- Eliciting The Priors Of Large Language Models Using Iterated In-context Learning Zhu Jian-qiao, Griffiths Thomas L.
- Dynamic Evaluation Of Large Language Models By Meta Probing Agents Zhu Kaijie, Wang Jindong, Zhao Qinlin, Xu Ruochen, Xie Xing
- LIME-M: Less Is More For Evaluation Of Mllms Zhu Kang, Zang Qianbo, Jia Shian, Wu Siwei, Fang Feiteng, Li Yizhi, Guo Shuyue, Zheng Tianyu, Li Bo, Wu Haoning, Qu Xingwei, Yang Jian, Liu Zachary, Yue Xiang, Liu J. H., Lin Chenghua, Yang Min, Ni Shiwen, Huang Wenhao, Zhang Ge
- Investigating Instruction Tuning Large Language Models On Graphs Zhu Kerui, Huang Bo-wei, Jin Bowen, Jiao Yizhu, Zhong Ming, Chang Kevin, Lin Shou-de, Han Jiawei
- LIRE: Listwise Reward Enhancement For Preference Alignment Zhu Mingye, Liu Yi, Zhang Lei, Guo Junbo, Mao Zhendong
- Personality Alignment Of Large Language Models Zhu Minjun, Yang Linyi, Zhang Yue
- Parameter-efficient Tuning Large Language Models For Graph Representation Learning Zhu Qi, Zheng Da, Song Xiang, Zhang Shichang, Jin Bowen, Sun Yizhou, Karypis George
- Scalable Matmul-free Language Modeling Zhu Rui-jie, Zhang Yu, Sifferman Ethan, Sheaves Tyler, Wang Yiqiao, Richmond Dustin, Zhou Peng, Eshraghian Jason K.
- Quite Good, But Not Enough: Nationality Bias In Large Language Models -- A Case Study Of Chatgpt Zhu Shucheng, Wang Weikang, Liu Ying
- Question Translation Training For Better Multilingual Reasoning Zhu Wenhao, Huang Shujian, Yuan Fei, She Shuaijie, Chen Jiajun, Birch Alexandra
- Multilingual Contrastive Decoding Via Language-agnostic Layers Skipping Zhu Wenhao, Liu Sizhe, Huang Shujian, She Shuaijie, Wendler Chris, Chen Jiajun
- How Privacy-savvy Are Large Language Models? A Case Study On Compliance And Privacy Technical Review Zhu Xichou, Liu Yang, Shen Zhou, Liu Yi, Li Min, Chen Yujun, John Benzi, Ma Zhenzhen, Hu Tao, Yang Bolong, Wang Manman, Xie Zongxing, Liu Peng, Cai Dan, Wang Junhui
- Can Large Language Models Understand Context? Zhu Yilun, Moniz Joel Ruben Antony, Bhargava Shruti, Lu Jiarui, Piraviperumal Dhivya, Li Site, Zhang Yuan, Yu Hong, Tseng Bo-hsiang
- Apt-pipe: A Prompt-tuning Tool For Social Data Annotation Using Chatgpt Zhu Yiming, Yin Zhizhuo, Tyson Gareth, Haq Ehsan-ul, Lee Lik-hang, Hui Pan
- Is Larger Always Better? Evaluating And Prompting Large Language Models For Non-generative Medical Tasks Zhu Yinghao, Gao Junyi, Wang Zixiang, Liao Weibin, Zheng Xiaochen, Liang Lifang, Wang Yasha, Pan Chengwei, Harrison Ewen M., Ma Liantao
- Prompting Large Language Models For Zero-shot Clinical Prediction With Structured Longitudinal Electronic Health Record Data Zhu Yinghao, Wang Zixiang, Gao Junyi, Tong Yuning, An Jingkun, Liao Weibin, Harrison Ewen M., Ma Liantao, Pan Chengwei
- Are Large Language Models Good Statisticians? Zhu Yizhang, Du Shiyin, Li Boyan, Luo Yuyu, Tang Nan
- Generative Pre-trained Speech Language Model With Efficient Hierarchical Transformer Zhu Yongxin, Su Dan, He Liqiang, Xu Linli, Yu Dong
- Accelerating Inference Of Retrieval-augmented Generation Via Sparse Context Selection Zhu Yun, Gu Jia-chen, Sikora Caitlin, Ko Ho, Liu Yinxiao, Lin Chu-cheng, Shu Lei, Luo Liangchen, Meng Lei, Liu Bang, Chen Jindong
- Knowagent: Knowledge-augmented Planning For Llm-based Agents Zhu Yuqi, Qiao Shuofei, Ou Yixin, Deng Shumin, Zhang Ningyu, Lyu Shiwei, Shen Yue, Liang Lei, Gu Jinjie, Chen Huajun
- Murar: A Simple And Effective Multimodal Retrieval And Answer Refinement Framework For Multimodal Question Answering Zhu Zhengyuan, Lee Daniel, Zhang Hong, Harsha Sai Sree, Feujio Loic, Maharaj Akash, Li Yunyao
- Understanding Privacy Risks Of Embeddings Induced By Large Language Models Zhu Zhihao, Shao Ninglu, Lian Defu, Wu Chenwang, Liu Zheng, Yang Yi, Chen Enhong
- Halueval-wild: Evaluating Hallucinations Of Language Models In The Wild Zhu Zhiying, Yang Yiming, Sun Zhiqing
- Sketch Then Generate: Providing Incremental User Feedback And Guiding LLM Code Generation Through Language-oriented Code Sketches Zhu-tian Chen, Xiong Zeyu, Yao Xiaoshuo, Glassman Elena
- Protllm: An Interleaved Protein-language LLM With Protein-as-word Pre-training Zhuo Le, Chi Zewen, Xu Minghao, Huang Heyan, Zheng Heqi, He Conghui, Mao Xian-ling, Zhang Wentao
- LLM4PM: A Case Study On Using Large Language Models For Process Modeling In Enterprise Organizations Ziche Clara, Apruzzese Giovanni
- Less Is More: Mitigating Multimodal Hallucination From An EOS Decision Perspective Zihao Yue, Liang Zhang, Qin Jin
- Vgbench: Evaluating Large Language Models On Vector Graphics Understanding And Generation Zou Bocheng, Cai Mu, Zhang Jianrui, Lee Yong Jae
- Promptintern: Saving Inference Costs By Internalizing Recurrent Prompt During Large Language Model Fine-tuning Zou Jiaru, Zhou Mengyu, Li Tao, Han Shi, Zhang Dongmei
- Enhancing Visual Question Answering Through Question-driven Image Captions As Prompts Özdemir Övgü, Akagündüz Erdem
- Systematic Task Exploration With Llms: A Study In Citation Text Generation Şahinuç Furkan, Kuznetsov Ilia, Hou Yufang, Gurevych Iryna
🏷 Responsible AI
- Learning To Deceive With Attention-based Explanations Pruthi Danish, Gupta Mansi, Dhingra Bhuwan, Neubig Graham, Lipton Zachary C.
- A Study On Multimodal And Interactive Explanations For Visual Question Answering Alipour Kamran, Schulze Jurgen P., Yao Yi, Ziskind Avi, Burachas Giedrius
- Context-guided BERT For Targeted Aspect-based Sentiment Analysis Wu Zhengxuan, Ong Desmond C.
- Scaling Language Models: Methods, Analysis & Insights From Training Gopher Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George Van Den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor, Irina Higgins, Antonia Creswell, Nat Mcaleese, Amy Wu, Erich Elsen, Siddhant Jayakumar, Elena Buchatskaya, David Budden, Esme Sutherland, Karen Simonyan, Michela Paganini, Laurent Sifre, Lena Martens, Xiang Lorraine Li, Adhiguna Kuncoro, Aida Nematzadeh, Elena Gribovskaya, Domenic Donato, Angeliki Lazaridou, Arthur Mensch, Jean-baptiste Lespiau, Maria Tsimpoukelli, Nikolai Grigorev, Doug Fritz, Thibault Sottiaux, Mantas Pajarskas, Toby Pohlen, Zhitao Gong, Daniel Toyama, Cyprien De Masson D'autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego De Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew Johnson, Blake Hechtman, Laura Weidinger, Iason Gabriel, William Isaac, Ed Lockhart, Simon Osindero, Laura Rimell, Chris Dyer, Oriol Vinyals, Kareem Ayoub, Jeff Stanway, Lorrayne Bennett, Demis Hassabis, Koray Kavukcuoglu, Geoffrey Irving
- Evaluating Large Language Models Trained On Code Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde De Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter, Philippe Tillet, Felipe Petroski Such, Dave Cummings, Matthias Plappert, Fotios Chantzis, Elizabeth Barnes, Ariel Herbert-voss, William Hebgen Guss, Alex Nichol, Alex Paino, Nikolas Tezak, Jie Tang, Igor Babuschkin, Suchir Balaji, Shantanu Jain, William Saunders, Christopher Hesse, Andrew N. Carr, Jan Leike, Josh Achiam, Vedant Misra, Evan Morikawa, Alec Radford, Matthew Knight, Miles Brundage, Mira Murati, Katie Mayer, Peter Welinder, Bob Mcgrew, Dario Amodei, Sam Mccandlish, Ilya Sutskever, Wojciech Zaremba
- Despite "super-human" Performance, Current Llms Are Unsuited For Decisions About Ethics And Safety Albrecht Joshua, Kitanidis Ellie, Fetterman Abraham J.
- Selection-inference: Exploiting Large Language Models For Interpretable Logical Reasoning Antonia Creswell, Murray Shanahan, Irina Higgins
- BLOOM: A 176b-parameter Open-access Multilingual Language Model Bigscience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova Del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina Mcmillan-major, Iz Beltagy, Huu Nguyen, Lucile Saulnier, Samson Tan, Pedro Ortiz Suarez, Victor Sanh, Hugo Laurençon, Yacine Jernite, Julien Launay, Margaret Mitchell, Colin Raffel, Aaron Gokaslan, Adi Simhi, Aitor Soroa, Alham Fikri Aji, Amit Alfassy, Anna Rogers, Ariel Kreisberg Nitzav, Canwen Xu, Chenghao Mou, Chris Emezue, Christopher Klamm, Colin Leong, Daniel Van Strien, David Ifeoluwa Adelani, Dragomir Radev, Eduardo González Ponferrada, Efrat Levkovizh, Ethan Kim, Eyal Bar Natan, Francesco De Toni, Gérard Dupont, Germán Kruszewski, Giada Pistilli, Hady Elsahar, Hamza Benyamina, Hieu Tran, Ian Yu, Idris Abdulmumin, Isaac Johnson, Itziar Gonzalez-dios, Javier De La Rosa, Jenny Chim, Jesse Dodge, Jian Zhu, Jonathan Chang, Jörg Frohberg, Joseph Tobing, Joydeep Bhattacharjee, Khalid Almubarak, Kimbo Chen, Kyle Lo, Leandro Von Werra, Leon Weber, Long Phan, Loubna Ben Allal, Ludovic Tanguy, Manan Dey, Manuel Romero Muñoz, Maraim Masoud, María Grandury, Mario Šaško, Max Huang, Maximin Coavoux, Mayank Singh, Mike Tian-jian Jiang, Minh Chien Vu, Mohammad A. Jauhar, Mustafa Ghaleb, Nishant Subramani, Nora Kassner, Nurulaqilla Khamis, Olivier Nguyen, Omar Espejel, Ona De Gibert, Paulo Villegas, Peter Henderson, Pierre Colombo, Priscilla Amuok, Quentin Lhoest, Rheza Harliman, Rishi Bommasani, Roberto Luis López, Rui Ribeiro, Salomey Osei, Sampo Pyysalo, Sebastian Nagel, Shamik Bose, Shamsuddeen Hassan Muhammad, Shanya Sharma, Shayne Longpre, Somaieh Nikpoor, Stanislav Silberberg, Suhas Pai, Sydney Zink, Tiago Timponi Torrent, Timo Schick, Tristan Thrush, Valentin Danchev, Vassilina Nikoulina, Veronika Laippala, Violette Lepercq, Vrinda Prabhu, Zaid Alyafeai, Zeerak Talat, Arun Raja, Benjamin Heinzerling, Chenglei Si, Davut Emre Taşar, Elizabeth Salesky, Sabrina J. Mielke, Wilson Y. Lee, Abheesht Sharma, Andrea Santilli, Antoine Chaffin, Arnaud Stiegler, Debajyoti Datta, Eliza Szczechla, Gunjan Chhablani, Han Wang, Harshit Pandey, Hendrik Strobelt, Jason Alan Fries, Jos Rozen, Leo Gao, Lintang Sutawika, M Saiful Bari, Maged S. Al-shaibani, Matteo Manica, Nihal Nayak, Ryan Teehan, Samuel Albanie, Sheng Shen, Srulik Ben-david, Stephen H. Bach, Taewoon Kim, Tali Bers, Thibault Fevry, Trishala Neeraj, Urmish Thakker, Vikas Raunak, Xiangru Tang, Zheng-xin Yong, Zhiqing Sun, Shaked Brody, Yallow Uri, Hadar Tojarieh, Adam Roberts, Hyung Won Chung, Jaesung Tae, Jason Phang, Ofir Press, Conglong Li, Deepak Narayanan, Hatim Bourfoune, Jared Casper, Jeff Rasley, Max Ryabinin, Mayank Mishra, Minjia Zhang, Mohammad Shoeybi, Myriam Peyrounette, Nicolas Patry, Nouamane Tazi, Omar Sanseviero, Patrick Von Platen, Pierre Cornette, Pierre François Lavallée, Rémi Lacroix, Samyam Rajbhandari, Sanchit Gandhi, Shaden Smith, Stéphane Requena, Suraj Patil, Tim Dettmers, Ahmed Baruwa, Amanpreet Singh, Anastasia Cheveleva, Anne-laure Ligozat, Arjun Subramonian, Aurélie Névéol, Charles Lovering, Dan Garrette, Deepak Tunuguntla, Ehud Reiter, Ekaterina Taktasheva, Ekaterina Voloshina, Eli Bogdanov, Genta Indra Winata, Hailey Schoelkopf, Jan-christoph Kalo, Jekaterina Novikova, Jessica Zosa Forde, Jordan Clive, Jungo Kasai, Ken Kawamura, Liam Hazan, Marine Carpuat, Miruna Clinciu, Najoung Kim, Newton Cheng, Oleg Serikov, Omer Antverg, Oskar Van Der Wal, Rui Zhang, Ruochen Zhang, Sebastian Gehrmann, Shachar Mirkin, Shani Pais, Tatiana Shavrina, Thomas Scialom, Tian Yun, Tomasz Limisiewicz, Verena Rieser, Vitaly Protasov, Vladislav Mikhailov, Yada Pruksachatkun, Yonatan Belinkov, Zachary Bamberger, Zdeněk Kasner, Alice Rueda, Amanda Pestana, Amir Feizpour, Ammar Khan, Amy Faranak, Ana Santos, Anthony Hevia, Antigona Unldreaj, Arash Aghagol, Arezoo Abdollahi, Aycha Tammour, Azadeh Hajihosseini, Bahareh Behroozi, Benjamin Ajibade, Bharat Saxena, Carlos Muñoz Ferrandis, Daniel Mcduff, Danish Contractor, David Lansky, Davis David, Douwe Kiela, Duong A. Nguyen, Edward Tan, Emi Baylor, Ezinwanne Ozoani, Fatima Mirza, Frankline Ononiwu, Habib Rezanejad, Hessie Jones, Indrani Bhattacharya, Irene Solaiman, Irina Sedenko, Isar Nejadgholi, Jesse Passmore, Josh Seltzer, Julio Bonis Sanz, Livia Dutra, Mairon Samagaio, Maraim Elbadri, Margot Mieskes, Marissa Gerchick, Martha Akinlolu, Michael Mckenna, Mike Qiu, Muhammed Ghauri, Mykola Burynok, Nafis Abrar, Nazneen Rajani, Nour Elkott, Nour Fahmy, Olanrewaju Samuel, Ran An, Rasmus Kromann, Ryan Hao, Samira Alizadeh, Sarmad Shubber, Silas Wang, Sourav Roy, Sylvain Viguier, Thanh Le, Tobi Oyebade, Trieu Le, Yoyo Yang, Zach Nguyen, Abhinav Ramesh Kashyap, Alfredo Palasciano, Alison Callahan, Anima Shukla, Antonio Miranda-escalada, Ayush Singh, Benjamin Beilharz, Bo Wang, Caio Brito, Chenxi Zhou, Chirag Jain, Chuxin Xu, Clémentine Fourrier, Daniel León Periñán, Daniel Molano, Dian Yu, Enrique Manjavacas, Fabio Barth, Florian Fuhrimann, Gabriel Altay, Giyaseddin Bayrak, Gully Burns, Helena U. Vrabec, Imane Bello, Ishani Dash, Jihyun Kang, John Giorgi, Jonas Golde, Jose David Posada, Karthik Rangasai Sivaraman, Lokesh Bulchandani, Lu Liu, Luisa Shinzato, Madeleine Hahn De Bykhovetz, Maiko Takeuchi, Marc Pàmies, Maria A Castillo, Marianna Nezhurina, Mario Sänger, Matthias Samwald, Michael Cullan, Michael Weinberg, Michiel De Wolf, Mina Mihaljcic, Minna Liu, Moritz Freidank, Myungsun Kang, Natasha Seelam, Nathan Dahlberg, Nicholas Michio Broad, Nikolaus Muellner, Pascale Fung, Patrick Haller, Ramya Chandrasekhar, Renata Eisenberg, Robert Martin, Rodrigo Canalli, Rosaline Su, Ruisi Su, Samuel Cahyawijaya, Samuele Garda, Shlok S Deshmukh, Shubhanshu Mishra, Sid Kiblawi, Simon Ott, Sinee Sang-aroonsiri, Srishti Kumar, Stefan Schweter, Sushil Bharati, Tanmay Laud, Théo Gigant, Tomoya Kainuma, Wojciech Kusa, Yanis Labrak, Yash Shailesh Bajaj, Yash Venkatraman, Yifan Xu, Yingxin Xu, Yu Xu, Zhe Tan, Zhongli Xie, Zifan Ye, Mathilde Bras, Younes Belkada, Thomas Wolf
- Prompting GPT-3 To Be Reliable Chenglei Si, Zhe Gan, Zhengyuan Yang, Shuohang Wang, Jianfeng Wang, Jordan Boyd-graber, Lijuan Wang
- Evaluating Psychological Safety Of Large Language Models Li Xingxuan, Li Yutong, Qiu Lin, Joty Shafiq, Bing Lidong
- No Language Left Behind: Scaling Human-centered Machine Translation Nllb Team, Marta R. Costa-jussà, James Cross, Onur Çelebi, Maha Elbayad, Kenneth Heafield, Kevin Heffernan, Elahe Kalbassi, Janice Lam, Daniel Licht, Jean Maillard, Anna Sun, Skyler Wang, Guillaume Wenzek, Al Youngblood, Bapi Akula, Loic Barrault, Gabriel Mejia Gonzalez, Prangthip Hansanti, John Hoffman, Semarley Jarrett, Kaushik Ram Sadagopan, Dirk Rowe, Shannon Spruit, Chau Tran, Pierre Andrews, Necip Fazil Ayan, Shruti Bhosale, Sergey Edunov, Angela Fan, Cynthia Gao, Vedanuj Goswami, Francisco Guzmán, Philipp Koehn, Alexandre Mourachko, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Jeff Wang
- Lamda: Language Models For Dialog Applications Romal Thoppilan, Daniel De Freitas, Jamie Hall, Noam Shazeer, Apoorv Kulshreshtha, Heng-tze Cheng, Alicia Jin, Taylor Bos, Leslie Baker, Yu Du, Yaguang Li, Hongrae Lee, Huaixiu Steven Zheng, Amin Ghafouri, Marcelo Menegali, Yanping Huang, Maxim Krikun, Dmitry Lepikhin, James Qin, Dehao Chen, Yuanzhong Xu, Zhifeng Chen, Adam Roberts, Maarten Bosma, Vincent Zhao, Yanqi Zhou, Chung-ching Chang, Igor Krivokon, Will Rusch, Marc Pickett, Pranesh Srinivasan, Laichee Man, Kathleen Meier-hellstern, Meredith Ringel Morris, Tulsee Doshi, Renelito Delos Santos, Toju Duke, Johnny Soraker, Ben Zevenbergen, Vinodkumar Prabhakaran, Mark Diaz, Ben Hutchinson, Kristen Olson, Alejandra Molina, Erin Hoffman-john, Josh Lee, Lora Aroyo, Ravi Rajakumar, Alena Butryna, Matthew Lamm, Viktoriya Kuzmina, Joe Fenton, Aaron Cohen, Rachel Bernstein, Ray Kurzweil, Blaise Aguera-arcas, Claire Cui, Marian Croak, Ed Chi, Quoc Le
- Multi-party Goal Tracking With Llms: Comparing Pre-training, Fine-tuning, And Prompt Engineering Addlesee Angus, Sieińska Weronika, Gunson Nancie, Garcia Daniel Hernández, Dondrup Christian, Lemon Oliver
- KITLM: Domain-specific Knowledge Integration Into Language Models For Question Answering Agarwal Ankush, Gawade Sakharam, Azad Amar Prakash, Bhattacharyya Pushpak
- The Poison Of Alignment Bekbayev Aibek, Chun Sungbae, Dulat Yerzat, Yamazaki James
- Language Model Unalignment: Parametric Red-teaming To Expose Hidden Harms And Biases Bhardwaj Rishabh, Poria Soujanya
- Red-teaming Large Language Models Using Chain Of Utterances For Safety-alignment Bhardwaj Rishabh, Poria Soujanya
- Safety-tuned Llamas: Lessons From Improving The Safety Of Large Language Models That Follow Instructions Bianchi Federico, Suzgun Mirac, Attanasio Giuseppe, Röttger Paul, Jurafsky Dan, Hashimoto Tatsunori, Zou James
- Personality Testing Of Large Language Models: Limited Temporal Stability, But Highlighted Prosociality Bodroza Bojana, Dinic Bojana M., Bojic Ljubisa
- Successor Features For Efficient Multisubject Controlled Text Generation Cao Meng, Fatemi Mehdi, Cheung Jackie Chi Kit, Shabanian Samira
- Gaining Wisdom From Setbacks: Aligning Large Language Models Via Mistake Analysis Chen Kai, Wang Chunwei, Yang Kuo, Han Jianhua, Hong Lanqing, Mi Fei, Xu Hang, Liu Zhengying, Huang Wenyong, Li Zhenguo, Yeung Dit-yan, Shang Lifeng, Jiang Xin, Liu Qun
- Challenges And Contributing Factors In The Utilization Of Large Language Models (llms) Chen Xiaoliang, Li Liangbin, Chang Le, Huang Yunhe, Zhao Yuxuan, Zhang Yuxiao, Li Dinuo
- Lmexplainer: Grounding Knowledge And Explaining Language Models Chen Zichen, Chen Jianda, Chen Yuanyuan, Yu Han, Singh Ambuj K, Sra Misha
- An Integrative Survey On Mental Health Conversational Agents To Bridge Computer Science And Medical Perspectives Cho Young Min, Rai Sunny, Ungar Lyle, Sedoc João, Guntuku Sharath Chandra
- Whose Text Is It Anyway? Exploring Bigcode, Intellectual Property, And Ethics Choksi Madiha Zahrah, Goedicke David
- Investigating Hallucinations In Pruned Large Language Models For Abstractive Summarization Chrysostomou George, Zhao Zhixue, Williams Miles, Aletras Nikolaos
- Dr. Jekyll And Mr. Hyde: Two Faces Of Llms Collu Matteo Gioele, Janssen-groesbeek Tom, Koffas Stefanos, Conti Mauro, Picek Stjepan
- Seamlessm4t: Massively Multilingual & Multimodal Machine Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Cora, Dale David, Dong Ning, Duquenne Paul-ambroise, Elsahar Hady, Gong Hongyu, Heffernan Kevin, Hoffman John, Klaiber Christopher, Li Pengwei, Licht Daniel, Maillard Jean, Rakotoarison Alice, Sadagopan Kaushik Ram, Wenzek Guillaume, Ye Ethan, Akula Bapi, Chen Peng-jen, Hachem Naji El, Ellis Brian, Gonzalez Gabriel Mejia, Haaheim Justin, Hansanti Prangthip, Howes Russ, Huang Bernie, Hwang Min-jae, Inaguma Hirofumi, Jain Somya, Kalbassi Elahe, Kallet Amanda, Kulikov Ilia, Lam Janice, Li Daniel, Ma Xutai, Mavlyutov Ruslan, Peloquin Benjamin, Ramadan Mohamed, Ramakrishnan Abinesh, Sun Anna, Tran Kevin, Tran Tuan, Tufanov Igor, Vogeti Vish, Wood Carleigh, Yang Yilin, Yu Bokai, Andrews Pierre, Balioglu Can, Costa-jussà Marta R., Celebi Onur, Elbayad Maha, Gao Cynthia, Guzmán Francisco, Kao Justine, Lee Ann, Mourachko Alexandre, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler
- Receive, Reason, And React: Drive As You Say With Large Language Models In Autonomous Vehicles Cui Can, Ma Yunsheng, Cao Xu, Ye Wenqian, Wang Ziran
- Personalized Autonomous Driving With Large Language Models: Field Experiments Cui Can, Yang Zichong, Zhou Yupeng, Ma Yunsheng, Lu Juanwu, Li Lingxi, Chen Yaobin, Panchal Jitesh, Wang Ziran
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- Multilingual Jailbreak Challenges In Large Language Models Deng Yue, Zhang Wenxuan, Pan Sinno Jialin, Bing Lidong
- Can Chatgpt Read Who You Are? Derner Erik, Kučera Dalibor, Oliver Nuria, Zahálka Jan
- Toxicity In Chatgpt: Analyzing Persona-assigned Language Models Deshpande Ameet, Murahari Vishvak, Rajpurohit Tanmay, Kalyan Ashwin, Narasimhan Karthik
- Can Large Language Models Assist In Hazard Analysis? Diemert Simon, Weber Jens H
- Denevil: Towards Deciphering And Navigating The Ethical Values Of Large Language Models Via Instruction Learning Duan Shitong, Yi Xiaoyuan, Zhang Peng, Lu Tun, Xie Xing, Gu Ning
- Should Chatgpt Be Biased? Challenges And Risks Of Bias In Large Language Models Ferrara Emilio
- Matching Pairs: Attributing Fine-tuned Models To Their Pre-trained Large Language Models Foley Myles, Rawat Ambrish, Lee Taesung, Hou Yufang, Picco Gabriele, Zizzo Giulio
- Inca: Rethinking In-car Conversational System Assessment Leveraging Large Language Models Friedl Ken E., Khan Abbas Goher, Sahoo Soumya Ranjan, Rony Md Rashad Al Hasan, Germies Jana, Süß Christian
- Safety Alignment In NLP Tasks: Weakly Aligned Summarization As An In-context Attack Fu Yu, Li Yufei, Xiao Wen, Liu Cong, Dong Yue
- Badllama: Cheaply Removing Safety Fine-tuning From Llama 2-chat 13B Gade Pranav, Lermen Simon, Rogers-smith Charlie, Ladish Jeffrey
- Faithful Explanations Of Black-box NLP Models Using Llm-generated Counterfactuals Gat Yair, Calderon Nitay, Feder Amir, Chapanin Alexander, Sharma Amit, Reichart Roi
- Building Trustworthy Neurosymbolic AI Systems: Consistency, Reliability, Explainability, And Safety Gaur Manas, Sheth Amit
- MART: Improving LLM Safety With Multi-round Automatic Red-teaming Ge Suyu, Zhou Chunting, Hou Rui, Khabsa Madian, Wang Yi-chia, Wang Qifan, Han Jiawei, Mao Yuning
- Figstep: Jailbreaking Large Vision-language Models Via Typographic Visual Prompts Gong Yichen, Ran Delong, Liu Jinyuan, Wang Conglei, Cong Tianshuo, Wang Anyu, Duan Sisi, Wang Xiaoyun
- Using Large Language Models For Interpreting Autonomous Robots Behaviors González-santamarta Miguel A., Fernández-becerra Laura, Sobrín-hidalgo David, Guerrero-higueras Ángel Manuel, González Irene, Lera Francisco J. Rodríguez
- Dual Use Concerns Of Generative AI And Large Language Models Grinbaum Alexei, Adomaitis Laurynas
- Hallucinations In Large Multilingual Translation Models Guerreiro Nuno M., Alves Duarte, Waldendorf Jonas, Haddow Barry, Birch Alexandra, Colombo Pierre, Martins André F. T.
- Evaluating Large Language Models: A Comprehensive Survey Guo Zishan, Jin Renren, Liu Chuang, Huang Yufei, Shi Dan, Supryadi, Yu Linhao, Liu Yan, Li Jiaxuan, Xiong Bojian, Xiong Deyi
- Personalisation Within Bounds: A Risk Taxonomy And Policy Framework For The Alignment Of Large Language Models With Personalised Feedback Hannah Rose Kirk, Bertie Vidgen, Paul Röttger, Scott A. Hale
- Unbiased Watermark For Large Language Models Hu Zhengmian, Chen Lichang, Wu Xidong, Wu Yihan, Zhang Hongyang, Huang Heng
- Who Is Chatgpt? Benchmarking Llms' Psychological Portrayal Using Psychobench Huang Jen-tse, Wang Wenxuan, Li Eric John, Lam Man Ho, Ren Shujie, Yuan Youliang, Jiao Wenxiang, Tu Zhaopeng, Lyu Michael R.
- Grounded Decoding: Guiding Text Generation With Grounded Models For Embodied Agents Huang Wenlong, Xia Fei, Shah Dhruv, Driess Danny, Zeng Andy, Lu Yao, Florence Pete, Mordatch Igor, Levine Sergey, Hausman Karol, Ichter Brian
- Catastrophic Jailbreak Of Open-source Llms Via Exploiting Generation Huang Yangsibo, Gupta Samyak, Xia Mengzhou, Li Kai, Chen Danqi
- Llama 2: Open Foundation And Fine-tuned Chat Models Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini, Rui Hou, Hakan Inan, Marcin Kardas, Viktor Kerkez, Madian Khabsa, Isabel Kloumann, Artem Korenev, Punit Singh Koura, Marie-anne Lachaux, Thibaut Lavril, Jenya Lee, Diana Liskovich, Yinghai Lu, Yuning Mao, Xavier Martinet, Todor Mihaylov, Pushkar Mishra, Igor Molybog, Yixin Nie, Andrew Poulton, Jeremy Reizenstein, Rashi Rungta, Kalyan Saladi, Alan Schelten, Ruan Silva, Eric Michael Smith, Ranjan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, Jian Xiang Kuan, Puxin Xu, Zheng Yan, Iliyan Zarov, Yuchen Zhang, Angela Fan, Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom
- Walking A Tightrope -- Evaluating Large Language Models In High-risk Domains Hung Chia-chien, Rim Wiem Ben, Frost Lindsay, Bruckner Lars, Lawrence Carolin
- Llama Guard: Llm-based Input-output Safeguard For Human-ai Conversations Inan Hakan, Upasani Kartikeya, Chi Jianfeng, Rungta Rashi, Iyer Krithika, Mao Yuning, Tontchev Michael, Hu Qing, Fuller Brian, Testuggine Davide, Khabsa Madian
- Neuro Symbolic Reasoning For Planning: Counterexample Guided Inductive Synthesis Using Large Language Models And Satisfiability Solving Jha Sumit Kumar, Jha Susmit, Lincoln Patrick, Bastian Nathaniel D., Velasquez Alvaro, Ewetz Rickard, Neema Sandeep
- Prompt Packer: Deceiving Llms Through Compositional Instruction With Hidden Attacks Jiang Shuyu, Chen Xingshu, Tang Rui
- How Can Recommender Systems Benefit From Large Language Models: A Survey Jianghao Lin, Xinyi Dai, Yunjia Xi, Weiwen Liu, Bo Chen, Hao Zhang, Yong Liu, Chuhan Wu, Xiangyang Li, Chenxu Zhu, Huifeng Guo, Yong Yu, Ruiming Tang, Weinan Zhang
- Evaluation And Enhancement Of Semantic Grounding In Large Vision-language Models Jiaying Lu, Jinmeng Rao, Kezhen Chen, Xiaoyuan Guo, Yawen Zhang, Baochen Sun, Carl Yang, Jie Yang
- Better To Ask In English: Cross-lingual Evaluation Of Large Language Models For Healthcare Queries Jin Yiqiao, Chandra Mohit, Verma Gaurav, Hu Yibo, De Choudhury Munmun, Kumar Srijan
- Appropriateness Is All You Need! Kempt Hendrik, Lavie Alon, Nagel Saskia K.
- How Secure Is Code Generated By Chatgpt? Khoury Raphaël, Avila Anderson R., Brunelle Jacob, Camara Baba Mamadou
- Auto-survey Challenge Khuong Thanh Gia Hieu Tau, Lisn, Rachmat Benedictus Kent Tau, Lisn
- Understanding Catastrophic Forgetting In Language Models Via Implicit Inference Kotha Suhas, Springer Jacob Mitchell, Raghunathan Aditi
- Certifying LLM Safety Against Adversarial Prompting Kumar Aounon, Agarwal Chirag, Srinivas Suraj, Li Aaron Jiaxun, Feizi Soheil, Lakkaraju Himabindu
- Instruction-following Speech Recognition Lai Cheng-i Jeff, Lu Zhiyun, Cao Liangliang, Pang Ruoming
- Open Sesame! Universal Black Box Jailbreaking Of Large Language Models Lapid Raz, Langberg Ron, Sipper Moshe
- The Importance Of Prompt Tuning For Automated Neuron Explanations Lee Justin, Oikarinen Tuomas, Chatha Arjun, Chang Keng-chi, Chen Yilan, Weng Tsui-wei
- Do We Still Need Clinical Language Models? Lehman Eric, Hernandez Evan, Mahajan Diwakar, Wulff Jonas, Smith Micah J., Ziegler Zachary, Nadler Daniel, Szolovits Peter, Johnson Alistair, Alsentzer Emily
- Chatgpt: A Meta-analysis After 2.5 Months Leiter Christoph, Zhang Ran, Chen Yanran, Belouadi Jonas, Larionov Daniil, Fresen Vivian, Eger Steffen
- Lora Fine-tuning Efficiently Undoes Safety Training In Llama 2-chat 70B Lermen Simon, Rogers-smith Charlie, Ladish Jeffrey
- Explaincpe: A Free-text Explanation Benchmark Of Chinese Pharmacist Examination Li Dongfang, Yu Jindi, Hu Baotian, Xu Zhenran, Zhang Min
- Privacy In Large Language Models: Attacks, Defenses And Future Directions Li Haoran, Chen Yulin, Luo Jinglong, Kang Yan, Zhang Xiaojin, Hu Qi, Chan Chunkit, Song Yangqiu
- Starcoder: May The Source Be With You! Li Raymond, Allal Loubna Ben, Zi Yangtian, Muennighoff Niklas, Kocetkov Denis, Mou Chenghao, Marone Marc, Akiki Christopher, Li Jia, Chim Jenny, Liu Qian, Zheltonozhskii Evgenii, Zhuo Terry Yue, Wang Thomas, Dehaene Olivier, Davaadorj Mishig, Lamy-poirier Joel, Monteiro João, Shliazhko Oleh, Gontier Nicolas, Meade Nicholas, Zebaze Armel, Yee Ming-ho, Umapathi Logesh Kumar, Zhu Jian, Lipkin Benjamin, Oblokulov Muhtasham, Wang Zhiruo, Murthy Rudra, Stillerman Jason, Patel Siva Sankalp, Abulkhanov Dmitry, Zocca Marco, Dey Manan, Zhang Zhihan, Fahmy Nour, Bhattacharyya Urvashi, Yu Wenhao, Singh Swayam, Luccioni Sasha, Villegas Paulo, Kunakov Maxim, Zhdanov Fedor, Romero Manuel, Lee Tony, Timor Nadav, Ding Jennifer, Schlesinger Claire, Schoelkopf Hailey, Ebert Jan, Dao Tri, Mishra Mayank, Gu Alex, Robinson Jennifer, Anderson Carolyn Jane, Dolan-gavitt Brendan, Contractor Danish, Reddy Siva, Fried Daniel, Bahdanau Dzmitry, Jernite Yacine, Ferrandis Carlos Muñoz, Hughes Sean, Wolf Thomas, Guha Arjun, Von Werra Leandro, De Vries Harm
- RAIN: Your Language Models Can Align Themselves Without Finetuning Li Yuhui, Wei Fangyun, Zhao Jinjing, Zhang Chao, Zhang Hongyang
- AI Transparency In The Age Of Llms: A Human-centered Research Roadmap Liao Q. Vera, Vaughan Jennifer Wortman
- Opening Up Chatgpt: Tracking Openness, Transparency, And Accountability In Instruction-tuned Text Generators Liesenfeld Andreas, Lopez Alianda, Dingemanse Mark
- In-context Vectors: Making In Context Learning More Effective And Controllable Through Latent Space Steering Liu Sheng, Ye Haotian, Xing Lei, Zou James
- Aligning Large Language Models With Human Preferences Through Representation Engineering Liu Wenhao, Wang Xiaohua, Wu Muling, Li Tianlong, Lv Changze, Ling Zixuan, Zhu Jianhao, Zhang Cenyuan, Zheng Xiaoqing, Huang Xuanjing
- Mm-safetybench: A Benchmark For Safety Evaluation Of Multimodal Large Language Models Liu Xin, Zhu Yichen, Gu Jindong, Lan Yunshi, Yang Chao, Qiao Yu
- Trustworthy Llms: A Survey And Guideline For Evaluating Large Language Models' Alignment Liu Yang, Yao Yuanshun, Ton Jean-francois, Zhang Xiaoying, Guo Ruocheng, Cheng Hao, Klochkov Yegor, Taufiq Muhammad Faaiz, Li Hang
- Robustness Over Time: Understanding Adversarial Examples' Effectiveness On Longitudinal Versions Of Large Language Models Liu Yugeng, Cong Tianshuo, Zhao Zhengyu, Backes Michael, Shen Yun, Zhang Yang
- Safer Conversational AI As A Source Of User Delight Lu Xiaoding, Korshuk Aleksey, Liu Zongyi, Beauchamp William, Research Chai
- Chatgpt And A New Academic Reality: Artificial Intelligence-written Research Papers And The Ethics Of The Large Language Models In Scholarly Publishing Lund Brady, Wang Ting, Mannuru Nishith Reddy, Nie Bing, Shimray Somipam, Wang Ziang
- "oops, Did I Just Say That?" Testing And Repairing Unethical Suggestions Of Large Language Models With Suggest-critique-reflect Process Ma Pingchuan, Li Zongjie, Sun Ao, Wang Shuai
- Interpreting Learned Feedback Patterns In Large Language Models Marks Luke, Abdullah Amir, Neo Clement, Arike Rauno, Krueger David, Torr Philip, Barez Fazl
- JAB: Joint Adversarial Prompting And Belief Augmentation Mehrabi Ninareh, Goyal Palash, Ramakrishna Anil, Dhamala Jwala, Ghosh Shalini, Zemel Richard, Chang Kai-wei, Galstyan Aram, Gupta Rahul
- ALMANACS: A Simulatability Benchmark For Language Model Explainability Mills Edmund, Su Shiye, Russell Stuart, Emmons Scott
- How Trustworthy Are Open-source Llms? An Assessment Under Malicious Demonstrations Shows Their Vulnerabilities Mo Lingbo, Wang Boshi, Chen Muhao, Sun Huan
- Gradient-based Automated Iterative Recovery For Parameter-efficient Tuning Mozes Maximilian, Bolukbasi Tolga, Yuan Ann, Liu Frederick, Thain Nithum, Dixon Lucas
- Superhf: Supervised Iterative Learning From Human Feedback Mukobi Gabriel, Chatain Peter, Fong Su, Windesheim Robert, Kutyniok Gitta, Bhatia Kush, Alberti Silas
- Testing Language Model Agents Safely In The Wild Naihin Silen, Atkinson David, Green Marc, Hamadi Merwane, Swift Craig, Schonholtz Douglas, Kalai Adam Tauman, Bau David
- DERA: Enhancing Large Language Model Completions With Dialog-enabled Resolving Agents Nair Varun, Schumacher Elliot, Tso Geoffrey, Kannan Anitha
- Large Language Models As Tax Attorneys: A Case Study In Legal Capabilities Emergence Nay John J., Karamardian David, Lawsky Sarah B., Tao Wenting, Bhat Meghana, Jain Raghav, Lee Aaron Travis, Choi Jonathan H., Kasai Jungo
- Capabilities Of GPT-4 On Medical Challenge Problems Nori Harsha, King Nicholas, Mckinney Scott Mayer, Carignan Dean, Horvitz Eric
- LLM Self Defense: By Self Examination, Llms Know They Are Being Tricked Phute Mansi, Helbling Alec, Hull Matthew, Peng Shengyun, Szyller Sebastian, Cornelius Cory, Chau Duen Horng
- Science In The Era Of Chatgpt, Large Language Models And Generative AI: Challenges For Research Ethics And How To Respond Pournaras Evangelos
- The Robots Are Here: Navigating The Generative AI Revolution In Computing Education Prather James, Denny Paul, Leinonen Juho, Becker Brett A., Albluwi Ibrahim, Craig Michelle, Keuning Hieke, Kiesler Natalie, Kohn Tobias, Luxton-reilly Andrew, Macneil Stephen, Peterson Andrew, Pettit Raymond, Reeves Brent N., Savelka Jaromir
- Visual Adversarial Examples Jailbreak Aligned Large Language Models Qi Xiangyu, Huang Kaixuan, Panda Ashwinee, Henderson Peter, Wang Mengdi, Mittal Prateek
- Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! Qi Xiangyu, Zeng Yi, Xie Tinghao, Chen Pin-yu, Jia Ruoxi, Mittal Prateek, Henderson Peter
- Latent Jailbreak: A Benchmark For Evaluating Text Safety And Output Robustness Of Large Language Models Qiu Huachuan, Zhang Shuai, Li Anqi, He Hongliang, Lan Zhenzhong
- Bio-sieve: Exploring Instruction Tuning Large Language Models For Systematic Review Automation Robinson Ambrose, Thorne William, Wu Ben P., Pandor Abdullah, Essat Munira, Stevenson Mark, Song Xingyi
- Identifying The Risks Of LM Agents With An Lm-emulated Sandbox Ruan Yangjun, Dong Honghua, Wang Andrew, Pitis Silviu, Zhou Yongchao, Ba Jimmy, Dubois Yann, Maddison Chris J., Hashimoto Tatsunori
- Jais And Jais-chat: Arabic-centric Foundation And Instruction-tuned Open Generative Large Language Models Sengupta Neha, Sahu Sunil Kumar, Jia Bokang, Katipomu Satheesh, Li Haonan, Koto Fajri, Marshall William, Gosal Gurpreet, Liu Cynthia, Chen Zhiming, Afzal Osama Mohammed, Kamboj Samta, Pandit Onkar, Pal Rahul, Pradhan Lalit, Mujahid Zain Muhammad, Baali Massa, Han Xudong, Bsharat Sondos Mahmoud, Aji Alham Fikri, Shen Zhiqiang, Liu Zhengzhong, Vassilieva Natalia, Hestness Joel, Hock Andy, Feldman Andrew, Lee Jonathan, Jackson Andrew, Ren Hector Xuguang, Nakov Preslav, Baldwin Timothy, Xing Eric
- Personality Traits In Large Language Models Serapio-garcía Greg, Safdari Mustafa, Crepy Clément, Sun Luning, Fitz Stephen, Romero Peter, Abdulhai Marwa, Faust Aleksandra, Matarić Maja
- Evil Geniuses: Delving Into The Safety Of Llm-based Agents Tian Yu, Yang Xiao, Zhang Jingyuan, Dong Yinpeng, Su Hang
- Clinical Camel: An Open Expert-level Medical Language Model With Dialogue-based Knowledge Encoding Toma Augustin, Lawler Patrick R., Ba Jimmy, Krishnan Rahul G., Rubin Barry B., Wang Bo
- How Many Unicorns Are In This Image? A Safety Evaluation Benchmark For Vision Llms Tu Haoqin, Cui Chenhang, Wang Zijun, Zhou Yiyang, Zhao Bingchen, Han Junlin, Zhou Wangchunshu, Yao Huaxiu, Xie Cihang
- Sight Beyond Text: Multi-modal Training Enhances Llms In Truthfulness And Ethics Tu Haoqin, Zhao Bingchen, Wei Chen, Xie Cihang
- Language Models Don't Always Say What They Think: Unfaithful Explanations In Chain-of-thought Prompting Turpin Miles, Michael Julian, Perez Ethan, Bowman Samuel R.
- Adventures Of Trustworthy Vision-language Models: A Survey Vatsa Mayank, Jain Anubhooti, Singh Richa
- Simplesafetytests: A Test Suite For Identifying Critical Safety Risks In Large Language Models Vidgen Bertie, Scherrer Nino, Kirk Hannah Rose, Qian Rebecca, Kannappan Anand, Hale Scott A., Röttger Paul
- Empirical Evaluation Of Uncertainty Quantification In Retrieval-augmented Language Models For Science Wagle Sridevi, Munikoti Sai, Acharya Anurag, Smith Sara, Horawalavithana Sameera
- On The Robustness Of Chatgpt: An Adversarial And Out-of-distribution Perspective Wang Jindong, Hu Xixu, Hou Wenxin, Chen Hao, Zheng Runkai, Wang Yidong, Yang Linyi, Huang Haojun, Ye Wei, Geng Xiubo, Jiao Binxin, Zhang Yue, Xie Xing
- All Languages Matter: On The Multilingual Safety Of Large Language Models Wang Wenxuan, Tu Zhaopeng, Chen Chang, Yuan Youliang, Huang Jen-tse, Jiao Wenxiang, Lyu Michael R.
- TRACE: A Comprehensive Benchmark For Continual Learning In Large Language Models Wang Xiao, Zhang Yuansen, Chen Tianze, Gao Songyang, Jin Senjie, Yang Xianjun, Xi Zhiheng, Zheng Rui, Zou Yicheng, Gui Tao, Zhang Qi, Huang Xuanjing
- Fake Alignment: Are Llms Really Aligned Well? Wang Yixu, Teng Yan, Huang Kexin, Lyu Chengqi, Zhang Songyang, Zhang Wenwei, Ma Xingjun, Jiang Yu-gang, Qiao Yu, Wang Yingchun
- Gpt4video: A Unified Multimodal Large Language Model For Lnstruction-followed Understanding And Safety-aware Generation Wang Zhanyu, Wang Longyue, Zhao Zhen, Wu Minghao, Lyu Chenyang, Li Huayang, Cai Deng, Zhou Luping, Shi Shuming, Tu Zhaopeng
- Jailbroken: How Does LLM Safety Training Fail? Wei Alexander, Haghtalab Nika, Steinhardt Jacob
- Jailbreak And Guard Aligned Language Models With Only Few In-context Demonstrations Wei Zeming, Wang Yifei, Li Ang, Mo Yichuan, Wang Yisen
- Unveiling The Implicit Toxicity In Large Language Models Wen Jiaxin, Ke Pei, Sun Hao, Zhang Zhexin, Li Chengfei, Bai Jinfeng, Huang Minlie
- Unveiling Security, Privacy, And Ethical Concerns Of Chatgpt Wu Xiaodong, Duan Ran, Ni Jianbing
- Gentopia: A Collaborative Platform For Tool-augmented Llms Xu Binfeng, Liu Xukun, Shen Hua, Han Zeyu, Li Yuhan, Yue Murong, Peng Zhiyuan, Liu Yuchen, Yao Ziyu, Xu Dongkuan
- Sc-safety: A Multi-round Open-ended Question Adversarial Safety Benchmark For Large Language Models In Chinese Xu Liang, Zhao Kangkang, Zhu Lei, Xue Hang
- Cognitive Overload: Jailbreaking Large Language Models With Overloaded Logical Thinking Xu Nan, Wang Fei, Zhou Ben, Li Bang Zheng, Xiao Chaowei, Chen Muhao
- An LLM Can Fool Itself: A Prompt-based Adversarial Attack Xu Xilie, Kong Keyi, Liu Ning, Cui Lizhen, Wang Di, Zhang Jingfeng, Kankanhalli Mohan
- Zhongjing: Enhancing The Chinese Medical Capabilities Of Large Language Model Through Expert Feedback And Real-world Multi-turn Dialogue Yang Songhua, Zhao Hanjie, Zhu Senbin, Zhou Guangyu, Xu Hongfei, Jia Yuxiang, Zan Hongying
- Shadow Alignment: The Ease Of Subverting Safely-aligned Language Models Yang Xianjun, Wang Xiao, Zhang Qi, Petzold Linda, Wang William Yang, Zhao Xun, Lin Dahua
- Low-resource Languages Jailbreak GPT-4 Yong Zheng-xin, Menghini Cristina, Bach Stephen H.
- Fundamental Limitations Of Alignment In Large Language Models Yotam Wolf, Noam Wies, Oshri Avnery, Yoav Levine, Amnon Shashua
- GPTFUZZER: Red Teaming Large Language Models With Auto-generated Jailbreak Prompts Yu Jiahao, Lin Xingwei, Yu Zheng, Xing Xinyu
- Large Language Models Illuminate A Progressive Pathway To Artificial Healthcare Assistant: A Review Yuan Mingze, Bao Peng, Yuan Jiajia, Shen Yunhao, Chen Zifan, Xie Yi, Zhao Jie, Chen Yang, Zhang Li, Shen Lin, Dong Bin
- On Evaluating Adversarial Robustness Of Large Vision-language Models Yunqing Zhao, Tianyu Pang, Chao Du, Xiao Yang, Chongxuan Li, Ngai-man Cheung, Min Lin
- On The Safety Of Open-sourced Large Language Models: Does Alignment Really Prevent Them From Being Misused? Zhang Hangfan, Guo Zhimeng, Zhu Huaisheng, Cao Bochuan, Lin Lu, Jia Jinyuan, Chen Jinghui, Wu Dinghao
- Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization Zhang Zhexin, Yang Junxiao, Ke Pei, Mi Fei, Wang Hongning, Huang Minlie
- Accountable Textual-visual Chat Learns To Reject Human Instructions In Image Re-creation Zhang Zhiwei, Liu Yuliang
- Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-thought Reasoning To Language Agents Zhang Zhuosheng, Yao Yao, Zhang Aston, Tang Xiangru, Ma Xinbei, He Zhiwei, Wang Yiming, Gerstein Mark, Wang Rui, Liu Gongshen, Zhao Hai
- Learning And Forgetting Unsafe Examples In Large Language Models Zhao Jiachen, Deng Zhun, Madras David, Zou James, Ren Mengye
- Chbias: Bias Evaluation And Mitigation Of Chinese Conversational Language Models Zhao Jiaxu, Fang Meng, Shi Zijing, Li Yitong, Chen Ling, Pechenizkiy Mykola
- Lmsys-chat-1m: A Large-scale Real-world LLM Conversation Dataset Zheng Lianmin, Chiang Wei-lin, Sheng Ying, Li Tianle, Zhuang Siyuan, Wu Zhanghao, Zhuang Yonghao, Li Zhuohan, Lin Zi, Xing Eric P., Gonzalez Joseph E., Stoica Ion, Zhang Hao
- Trafficsafetygpt: Tuning A Pre-trained Large Language Model To A Domain-specific Expert In Transportation Safety Zheng Ou, Abdel-aty Mohamed, Wang Dongdong, Wang Chenzhu, Ding Shengxuan
- Making Harmful Behaviors Unlearnable For Large Language Models Zhou Xin, Lu Yi, Ma Ruotian, Gui Tao, Zhang Qi, Huang Xuanjing
- Autodan: Interpretable Gradient-based Adversarial Attacks On Large Language Models Zhu Sicheng, Zhang Ruiyi, An Bang, Wu Gang, Barrow Joe, Wang Zichao, Huang Furong, Nenkova Ani, Sun Tong
- Red Teaming Chatgpt Via Jailbreaking: Bias, Robustness, Reliability And Toxicity Zhuo Terry Yue, Huang Yujin, Chen Chunyang, Xing Zhenchang
- Phi-3 Technical Report: A Highly Capable Language Model Locally On Your Phone Abdin Marah, Aneja Jyoti, Awadalla Hany, Awadallah Ahmed, Awan Ammar Ahmad, Bach Nguyen, Bahree Amit, Bakhtiari Arash, Bao Jianmin, Behl Harkirat, Benhaim Alon, Bilenko Misha, Bjorck Johan, Bubeck Sébastien, Cai Martin, Cai Qin, Chaudhary Vishrav, Chen Dong, Chen Dongdong, Chen Weizhu, Chen Yen-chun, Chen Yi-ling, Cheng Hao, Chopra Parul, Dai Xiyang, Dixon Matthew, Eldan Ronen, Fragoso Victor, Gao Jianfeng, Gao Mei, Gao Min, Garg Amit, Del Giorno Allie, Goswami Abhishek, Gunasekar Suriya, Haider Emman, Hao Junheng, Hewett Russell J., Hu Wenxiang, Huynh Jamie, Iter Dan, Jacobs Sam Ade, Javaheripi Mojan, Jin Xin, Karampatziakis Nikos, Kauffmann Piero, Khademi Mahoud, Kim Dongwoo, Kim Young Jin, Kurilenko Lev, Lee James R., Lee Yin Tat, Li Yuanzhi, Li Yunsheng, Liang Chen, Liden Lars, Lin Xihui, Lin Zeqi, Liu Ce, Liu Liyuan, Liu Mengchen, Liu Weishung, Liu Xiaodong, Luo Chong, Madan Piyush, Mahmoudzadeh Ali, Majercak David, Mazzola Matt, Mendes Caio César Teodoro, Mitra Arindam, Modi Hardik, Nguyen Anh, Norick Brandon, Patra Barun, Perez-becker Daniel, Portet Thomas, Pryzant Reid, Qin Heyang, Radmilac Marko, Ren Liliang, De Rosa Gustavo, Rosset Corby, Roy Sambudha, Ruwase Olatunji, Saarikivi Olli, Saied Amin, Salim Adil, Santacroce Michael, Shah Shital, Shang Ning, Sharma Hiteshi, Shen Yelong, Shukla Swadheen, Song Xia, Tanaka Masahiro, Tupini Andrea, Vaddamanu Praneetha, Wang Chunyu, Wang Guanhua, Wang Lijuan, Wang Shuohang, Wang Xin, Wang Yu, Ward Rachel, Wen Wen, Witte Philipp, Wu Haiping, Wu Xiaoxia, Wyatt Michael, Xiao Bin, Xu Can, Xu Jiahang, Xu Weijian, Xue Jilong, Yadav Sonali, Yang Fan, Yang Jianwei, Yang Yifan, Yang Ziyi, Yu Donghan, Yuan Lu, Zhang Chenruidong, Zhang Cyril, Zhang Jianwen, Zhang Li Lyna, Zhang Yi, Zhang Yue, Zhang Yunan, Zhou Xiren
- The Llama 3 Herd Of Models Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang, Bobbie Chern, Charlotte Caucheteux, Chaya Nayak, Chloe Bi, Chris Marra, Chris Mcconnell, Christian Keller, Christophe Touret, Chunyang Wu, Corinne Wong, Cristian Canton Ferrer, Cyrus Nikolaidis, Damien Allonsius, Daniel Song, Danielle Pintz, Danny Livshits, David Esiobu, Dhruv Choudhary, Dhruv Mahajan, Diego Garcia-olano, Diego Perino, Dieuwke Hupkes, Egor Lakomkin, Ehab Albadawy, Elina Lobanova, Emily Dinan, Eric Michael Smith, Filip Radenovic, Frank Zhang, Gabriel Synnaeve, Gabrielle Lee, Georgia Lewis Anderson, Graeme Nail, Gregoire Mialon, Guan Pang, Guillem Cucurell, Hailey Nguyen, Hannah Korevaar, Hu Xu, Hugo Touvron, Iliyan Zarov, Imanol Arrieta Ibarra, Isabel Kloumann, Ishan Misra, Ivan Evtimov, Jade Copet, Jaewon Lee, Jan Geffert, Jana Vranes, Jason Park, Jay Mahadeokar, Jeet Shah, Jelmer Van Der Linde, Jennifer Billock, Jenny Hong, Jenya Lee, Jeremy Fu, Jianfeng Chi, Jianyu Huang, Jiawen Liu, Jie Wang, Jiecao Yu, Joanna Bitton, Joe Spisak, Jongsoo Park, Joseph Rocca, Joshua Johnstun, Joshua Saxe, Junteng Jia, Kalyan Vasuden Alwala, Kartikeya Upasani, Kate Plawiak, Ke Li, Kenneth Heafield, Kevin Stone, Khalid El-arini, Krithika Iyer, Kshitiz Malik, Kuenley Chiu, Kunal Bhalla, Lauren Rantala-yeary, Laurens Van Der Maaten, Lawrence Chen, Liang Tan, Liz Jenkins, Louis Martin, Lovish Madaan, Lubo Malo, Lukas Blecher, Lukas Landzaat, Luke De Oliveira, Madeline Muzzi, Mahesh Pasupuleti, Mannat Singh, Manohar Paluri, Marcin Kardas, Mathew Oldham, Mathieu Rita, Maya Pavlova, Melanie Kambadur, Mike Lewis, Min Si, Mitesh Kumar Singh, Mona Hassan, Naman Goyal, Narjes Torabi, Nikolay Bashlykov, Nikolay Bogoychev, Niladri Chatterji, Olivier Duchenne, Onur Çelebi, Patrick Alrassy, Pengchuan Zhang, Pengwei Li, Petar Vasic, Peter Weng, Prajjwal Bhargava, Pratik Dubal, Praveen Krishnan, Punit Singh Koura, Puxin Xu, Qing He, Qingxiao Dong, Ragavan Srinivasan, Raj Ganapathy, Ramon Calderer, Ricardo Silveira Cabral, Robert Stojnic, Roberta Raileanu, Rohit Girdhar, Rohit Patel, Romain Sauvestre, Ronnie Polidoro, Roshan Sumbaly, Ross Taylor, Ruan Silva, Rui Hou, Rui Wang, Saghar Hosseini, Sahana Chennabasappa, Sanjay Singh, Sean Bell, Seohyun Sonia Kim, Sergey Edunov, Shaoliang Nie, Sharan Narang, Sharath Raparthy, Sheng Shen, Shengye Wan, Shruti Bhosale, Shun Zhang, Simon Vandenhende, Soumya Batra, Spencer Whitman, Sten Sootla, Stephane Collot, Suchin Gururangan, Sydney Borodinsky, Tamar Herman, Tara Fowler, Tarek Sheasha, Thomas Georgiou, Thomas Scialom, Tobias Speckbacher, Todor Mihaylov, Tong Xiao, Ujjwal Karn, Vedanuj Goswami, Vibhor Gupta, Vignesh Ramanathan, Viktor Kerkez, Vincent Gonguet, Virginie Do, Vish Vogeti, Vladan Petrovic, Weiwei Chu, Wenhan Xiong, Wenyin Fu, Whitney Meers, Xavier Martinet, Xiaodong Wang, Xiaoqing Ellen Tan, Xinfeng Xie, Xuchao Jia, Xuewei Wang, Yaelle Goldschlag, Yashesh Gaur, Yasmine Babaei, Yi Wen, Yiwen Song, Yuchen Zhang, Yue Li, Yuning Mao, Zacharie Delpierre Coudert, Zheng Yan, Zhengxing Chen, Zoe Papakipos, Aaditya Singh, Aaron Grattafiori, Abha Jain, Adam Kelsey, Adam Shajnfeld, Adithya Gangidi, Adolfo Victoria, Ahuva Goldstand, Ajay Menon, Ajay Sharma, Alex Boesenberg, Alex Vaughan, Alexei Baevski, Allie Feinstein, Amanda Kallet, Amit Sangani, Anam Yunus, Andrei Lupu, Andres Alvarado, Andrew Caples, Andrew Gu, Andrew Ho, Andrew Poulton, Andrew Ryan, Ankit Ramchandani, Annie Franco, Aparajita Saraf, Arkabandhu Chowdhury, Ashley Gabriel, Ashwin Bharambe, Assaf Eisenman, Azadeh Yazdan, Beau James, Ben Maurer, Benjamin Leonhardi, Bernie Huang, Beth Loyd, Beto De Paola, Bhargavi Paranjape, Bing Liu, Bo Wu, Boyu Ni, Braden Hancock, Bram Wasti, Brandon Spence, Brani Stojkovic, Brian Gamido, Britt Montalvo, Carl Parker, Carly Burton, Catalina Mejia, Changhan Wang, Changkyu Kim, Chao Zhou, Chester Hu, Ching-hsiang Chu, Chris Cai, Chris Tindal, Christoph Feichtenhofer, Damon Civin, Dana Beaty, Daniel Kreymer, Daniel Li, Danny Wyatt, David Adkins, David Xu, Davide Testuggine, Delia David, Devi Parikh, Diana Liskovich, Didem Foss, Dingkang Wang, Duc Le, Dustin Holland, Edward Dowling, Eissa Jamil, Elaine Montgomery, Eleonora Presani, Emily Hahn, Emily Wood, Erik Brinkman, Esteban Arcaute, Evan Dunbar, Evan Smothers, Fei Sun, Felix Kreuk, Feng Tian, Firat Ozgenel, Francesco Caggioni, Francisco Guzmán, Frank Kanayet, Frank Seide, Gabriela Medina Florez, Gabriella Schwarz, Gada Badeer, Georgia Swee, Gil Halpern, Govind Thattai, Grant Herman, Grigory Sizov, Guangyi, Zhang, Guna Lakshminarayanan, Hamid Shojanazeri, Han Zou, Hannah Wang, Hanwen Zha, Haroun Habeeb, Harrison Rudolph, Helen Suk, Henry Aspegren, Hunter Goldman, Ibrahim Damlaj, Igor Molybog, Igor Tufanov, Irina-elena Veliche, Itai Gat, Jake Weissman, James Geboski, James Kohli, Japhet Asher, Jean-baptiste Gaya, Jeff Marcus, Jeff Tang, Jennifer Chan, Jenny Zhen, Jeremy Reizenstein, Jeremy Teboul, Jessica Zhong, Jian Jin, Jingyi Yang, Joe Cummings, Jon Carvill, Jon Shepard, Jonathan Mcphie, Jonathan Torres, Josh Ginsburg, Junjie Wang, Kai Wu, Kam Hou U, Karan Saxena, Karthik Prasad, Kartikay Khandelwal, Katayoun Zand, Kathy Matosich, Kaushik Veeraraghavan, Kelly Michelena, Keqian Li, Kun Huang, Kunal Chawla, Kushal Lakhotia, Kyle Huang, Lailin Chen, Lakshya Garg, Lavender A, Leandro Silva, Lee Bell, Lei Zhang, Liangpeng Guo, Licheng Yu, Liron Moshkovich, Luca Wehrstedt, Madian Khabsa, Manav Avalani, Manish Bhatt, Maria Tsimpoukelli, Martynas Mankus, Matan Hasson, Matthew Lennie, Matthias Reso, Maxim Groshev, Maxim Naumov, Maya Lathi, Meghan Keneally, Michael L. Seltzer, Michal Valko, Michelle Restrepo, Mihir Patel, Mik Vyatskov, Mikayel Samvelyan, Mike Clark, Mike Macey, Mike Wang, Miquel Jubert Hermoso, Mo Metanat, Mohammad Rastegari, Munish Bansal, Nandhini Santhanam, Natascha Parks, Natasha White, Navyata Bawa, Nayan Singhal, Nick Egebo, Nicolas Usunier, Nikolay Pavlovich Laptev, Ning Dong, Ning Zhang, Norman Cheng, Oleg Chernoguz, Olivia Hart, Omkar Salpekar, Ozlem Kalinli, Parkin Kent, Parth Parekh, Paul Saab, Pavan Balaji, Pedro Rittner, Philip Bontrager, Pierre Roux, Piotr Dollar, Polina Zvyagina, Prashant Ratanchandani, Pritish Yuvraj, Qian Liang, Rachad Alao, Rachel Rodriguez, Rafi Ayub, Raghotham Murthy, Raghu Nayani, Rahul Mitra, Raymond Li, Rebekkah Hogan, Robin Battey, Rocky Wang, Rohan Maheswari, Russ Howes, Ruty Rinott, Sai Jayesh Bondu, Samyak Datta, Sara Chugh, Sara Hunt, Sargun Dhillon, Sasha Sidorov, Satadru Pan, Saurabh Verma, Seiji Yamamoto, Sharadh Ramaswamy, Shaun Lindsay, Shaun Lindsay, Sheng Feng, Shenghao Lin, Shengxin Cindy Zha, Shiva Shankar, Shuqiang Zhang, Shuqiang Zhang, Sinong Wang, Sneha Agarwal, Soji Sajuyigbe, Soumith Chintala, Stephanie Max, Stephen Chen, Steve Kehoe, Steve Satterfield, Sudarshan Govindaprasad, Sumit Gupta, Sungmin Cho, Sunny Virk, Suraj Subramanian, Sy Choudhury, Sydney Goldman, Tal Remez, Tamar Glaser, Tamara Best, Thilo Kohler, Thomas Robinson, Tianhe Li, Tianjun Zhang, Tim Matthews, Timothy Chou, Tzook Shaked, Varun Vontimitta, Victoria Ajayi, Victoria Montanez, Vijai Mohan, Vinay Satish Kumar, Vishal Mangla, Vítor Albiero, Vlad Ionescu, Vlad Poenaru, Vlad Tiberiu Mihailescu, Vladimir Ivanov, Wei Li, Wenchen Wang, Wenwen Jiang, Wes Bouaziz, Will Constable, Xiaocheng Tang, Xiaofang Wang, Xiaojian Wu, Xiaolan Wang, Xide Xia, Xilun Wu, Xinbo Gao, Yanjun Chen, Ye Hu, Ye Jia, Ye Qi, Yenda Li, Yilin Zhang, Ying Zhang, Yossi Adi, Youngjin Nam, Yu, Wang, Yuchen Hao, Yundi Qian, Yuzi He, Zach Rait, Zachary Devito, Zef Rosnbrick, Zhaoduo Wen, Zhenyu Yang, Zhiwei Zhao
- Ethical Reasoning And Moral Value Alignment Of Llms Depend On The Language We Prompt Them In Agarwal Utkarsh, Tanmay Kumar, Khandelwal Aditi, Choudhury Monojit
- D-NLP At Semeval-2024 Task 2: Evaluating Clinical Inference Capabilities Of Large Language Models Altinok Duygu
- Syntaxshap: Syntax-aware Explainability Method For Text Generation Amara Kenza, Sevastjanova Rita, El-assady Mennatallah
- Automatic Pseudo-harmful Prompt Generation For Evaluating False Refusals In Large Language Models An Bang, Zhu Sicheng, Zhang Ruiyi, Panaitescu-liess Michael-andrei, Xu Yuancheng, Huang Furong
- Current State Of LLM Risks And AI Guardrails Ayyamperumal Suriya Ganesh, Ge Limin
- Llm-driven Robots Risk Enacting Discrimination, Violence, And Unlawful Actions Azeem Rumaisa, Hundt Andrew, Mansouri Masoumeh, Brandão Martim
- How (un)ethical Are Instruction-centric Responses Of Llms? Unveiling The Vulnerabilities Of Safety Guardrails To Harmful Queries Banerjee Somnath, Layek Sayan, Hazra Rima, Mukherjee Animesh
- Soft Prompting For Unlearning In Large Language Models Bhaila Karuna, Van Minh-hao, Wu Xintao
- Language Models Are Homer Simpson! Safety Re-alignment Of Fine-tuned Language Models Through Task Arithmetic Bhardwaj Rishabh, Anh Do Duc, Poria Soujanya
- Large Language Models Are Vulnerable To Bait-and-switch Attacks For Generating Harmful Content Bianchi Federico, Zou James
- Beyond Human Norms: Unveiling Unique Values Of Large Language Models Through Interdisciplinary Approaches Biedma Pablo, Yi Xiaoyuan, Huang Linus, Sun Maosong, Xie Xing
- Conversational Complexity For Assessing Risk In Large Language Models Burden John, Cebrian Manuel, Hernandez-orallo Jose
- Cendol: Open Instruction-tuned Generative Large Language Models For Indonesian Languages Cahyawijaya Samuel, Lovenia Holy, Koto Fajri, Putri Rifki Afina, Dave Emmanuel, Lee Jhonson, Shadieq Nuur, Cenggoro Wawan, Akbar Salsabil Maulana, Mahendra Muhammad Ihza, Putri Dea Annisayanti, Wilie Bryan, Winata Genta Indra, Aji Alham Fikri, Purwarianti Ayu, Fung Pascale
- Rethinking How To Evaluate Language Model Jailbreak Cai Hongyu, Arunasalam Arjun, Lin Leo Y., Bianchi Antonio, Celik Z. Berkay
- Are Large Language Models Really Bias-free? Jailbreak Prompts For Assessing Adversarial Robustness To Bias Elicitation Cantini Riccardo, Cosenza Giada, Orsino Alessio, Talia Domenico
- Nothing In Excess: Mitigating The Exaggerated Safety For Llms Via Safety-conscious Activation Steering Cao Zouying, Yang Yifei, Zhao Hai
- A Review Of Multi-modal Large Language And Vision Models Carolan Kilian, Fennelly Laura, Smeaton Alan F.
- Cross-modal Safety Alignment: Is Textual Unlearning All You Need? Chakraborty Trishna, Shayegani Erfan, Cai Zikui, Abu-ghazaleh Nael, Asif M. Salman, Dong Yue, Roy-chowdhury Amit K., Song Chengyu
- Integrating Emotional And Linguistic Models For Ethical Compliance In Large Language Models Chang Edward Y.
- Iteralign: Iterative Constitutional Alignment Of Large Language Models Chen Xiusi, Wen Hongzhi, Nag Sreyashi, Luo Chen, Yin Qingyu, Li Ruirui, Li Zheng, Wang Wei
- RL-JACK: Reinforcement Learning-powered Black-box Jailbreaking Attack Against Llms Chen Xuan, Nie Yuzhou, Yan Lu, Mao Yunshu, Guo Wenbo, Zhang Xiangyu
- AI Safety In Generative AI Large Language Models: A Survey Chua Jaymari, Li Yun, Yang Shiyi, Wang Chen, Yao Lina
- Informed AI Regulation: Comparing The Ethical Frameworks Of Leading LLM Chatbots Using An Ethics-based Audit To Assess Moral Reasoning And Normative Values Chun Jon, Elkins Katherine
- Or-bench: An Over-refusal Benchmark For Large Language Models Cui Justin, Chiang Wei-lin, Stoica Ion, Hsieh Cho-jui
- RTP-LX: Can Llms Evaluate Toxicity In Multilingual Scenarios? De Wynter Adrian, Watts Ishaan, Altıntoprak Nektar Ege, Wongsangaroonsri Tua, Zhang Minghui, Farra Noura, Baur Lena, Claudet Samantha, Gajdusek Pavel, Gören Can, Gu Qilong, Kaminska Anna, Kaminski Tomasz, Kuo Ruby, Kyuba Akiko, Lee Jongho, Mathur Kartik, Merok Petter, Milovanović Ivana, Paananen Nani, Paananen Vesa-matti, Pavlenko Anna, Vidal Bruno Pereira, Strika Luciano, Tsao Yueh, Turcato Davide, Vakhno Oleksandr, Velcsov Judit, Vickers Anna, Visser Stéphanie, Widarmanto Herdyan, Zaikin Andrey, Chen Si-qing
- Deconstructing The Ethics Of Large Language Models From Long-standing Issues To New-emerging Dilemmas Deng Chengyuan, Duan Yiqun, Jin Xin, Chang Heng, Tian Yijun, Liu Han, Zou Henry Peng, Jin Yiqiao, Xiao Yijia, Wang Yichen, Wu Shenghao, Xie Zongxing, Gao Kuofeng, He Sihong, Zhuang Jun, Cheng Lu, Wang Haohan
- Evaluating And Mitigating Linguistic Discrimination In Large Language Models Dong Guoliang, Wang Haoyu, Sun Jun, Wang Xinyu
- Contrans: Weak-to-strong Alignment Engineering Via Concept Transplantation Dong Weilong, Wu Xinwei, Jin Renren, Xu Shaoyang, Xiong Deyi
- A Framework For Real-time Safeguarding The Text Generation Of Large Language Model Dong Ximing, Lin Dayi, Wang Shaowei, Hassan Ahmed E.
- Jailbreaking Text-to-image Models With Llm-based Agents Dong Yingkai, Li Zheng, Meng Xiangtao, Yu Ning, Guo Shanqing
- Attacks, Defenses And Evaluations For LLM Conversation Safety: A Survey Dong Zhichen, Zhou Zhanhui, Yang Chao, Shao Jing, Qiao Yu
- Lightweight Large Language Model For Medication Enquiry: Med-pal Elangovan Kabilan, Ong Jasmine Chiat Ling, Jin Liyuan, Seng Benjamin Jun Jie, Kwan Yu Heng, Tan Lit Soo, Zhong Ryan Jian, Ma Justina Koi Li, Ke Yuhe, Liu Nan, Giacomini Kathleen M, Ting Daniel Shu Wei
- Legend: Leveraging Representation Engineering To Annotate Safety Margin For Preference Datasets Feng Duanyu, Qin Bowen, Huang Chen, Huang Youcheng, Zhang Zheng, Lei Wenqiang
- Towards Trustworthy AI: A Review Of Ethical And Robust Large Language Models Ferdaus Md Meftahul, Abdelguerfi Mahdi, Ioup Elias, Niles Kendall N., Pathak Ken, Sloan Steven
- Cross-task Defense: Instruction-tuning Llms For Content Safety Fu Yu, Xiao Wen, Chen Jia, Li Jiachen, Papalexakis Evangelos, Chien Aichi, Dong Yue
- Can AI Relate: Testing Large Language Model Response For Mental Health Support Gabriel Saadia, Puri Isha, Xu Xuhai, Malgaroli Matteo, Ghassemi Marzyeh
- Gemma: Open Models Based On Gemini Research And Technology Gemma Team, Mesnard Thomas, Hardin Cassidy, Dadashi Robert, Bhupatiraju Surya, Pathak Shreya, Sifre Laurent, Rivière Morgane, Kale Mihir Sanjay, Love Juliette, Tafti Pouya, Hussenot Léonard, Sessa Pier Giuseppe, Chowdhery Aakanksha, Roberts Adam, Barua Aditya, Botev Alex, Castro-ros Alex, Slone Ambrose, Héliou Amélie, Tacchetti Andrea, Bulanova Anna, Paterson Antonia, Tsai Beth, Shahriari Bobak, Lan Charline Le, Choquette-choo Christopher A., Crepy Clément, Cer Daniel, Ippolito Daphne, Reid David, Buchatskaya Elena, Ni Eric, Noland Eric, Yan Geng, Tucker George, Muraru George-christian, Rozhdestvenskiy Grigory, Michalewski Henryk, Tenney Ian, Grishchenko Ivan, Austin Jacob, Keeling James, Labanowski Jane, Lespiau Jean-baptiste, Stanway Jeff, Brennan Jenny, Chen Jeremy, Ferret Johan, Chiu Justin, Mao-jones Justin, Lee Katherine, Yu Kathy, Millican Katie, Sjoesund Lars Lowe, Lee Lisa, Dixon Lucas, Reid Machel, Mikuła Maciej, Wirth Mateo, Sharman Michael, Chinaev Nikolai, Thain Nithum, Bachem Olivier, Chang Oscar, Wahltinez Oscar, Bailey Paige, Michel Paul, Yotov Petko, Chaabouni Rahma, Comanescu Ramona, Jana Reena, Anil Rohan, Mcilroy Ross, Liu Ruibo, Mullins Ryan, Smith Samuel L, Borgeaud Sebastian, Girgin Sertan, Douglas Sholto, Pandya Shree, Shakeri Siamak, De Soham, Klimenko Ted, Hennigan Tom, Feinberg Vlad, Stokowiec Wojciech, Chen Yu-hui, Ahmed Zafarali, Gong Zhitao, Warkentin Tris, Peran Ludovic, Giang Minh, Farabet Clément, Vinyals Oriol, Dean Jeff, Kavukcuoglu Koray, Hassabis Demis, Ghahramani Zoubin, Eck Douglas, Barral Joelle, Pereira Fernando, Collins Eli, Joulin Armand, Fiedel Noah, Senter Evan, Andreev Alek, Kenealy Kathleen
- Why Not Transform Chat Large Language Models To Non-english? Geng Xiang, Zhu Ming, Li Jiahuan, Lai Zhejian, Zou Wei, She Shuaijie, Guo Jiaxin, Zhao Xiaofeng, Li Yinglu, Li Yuang, Su Chang, Zhao Yanqing, Lyu Xinglin, Zhang Min, Chen Jiajun, Yang Hao, Huang Shujian
- Jailbreaking Llms With Arabic Transliteration And Arabizi Ghanim Mansour Al, Almohaimeed Saleh, Zheng Mengxin, Solihin Yan, Lou Qian
- Plan-grounded Large Language Models For Dual Goal Conversational Settings Glória-silva Diogo, Ferreira Rafael, Tavares Diogo, Semedo David, Magalhães João
- Tokenshap: Interpreting Large Language Models With Monte Carlo Shapley Value Estimation Goldshmidt Roni, Horovicz Miriam
- Mechanistic Interpretability Of Large Language Models With Applications To The Financial Services Industry Golgoon Ashkan, Filom Khashayar, Kannan Arjun Ravi
- Multi-frame, Lightweight & Efficient Vision-language Models For Question Answering In Autonomous Driving Gopalkrishnan Akshay, Greer Ross, Trivedi Mohan
- Eyes Closed, Safety On: Protecting Multimodal Llms Via Image-to-text Transformation Gou Yunhao, Chen Kai, Liu Zhili, Hong Lanqing, Xu Hang, Li Zhenguo, Yeung Dit-yan, Kwok James T., Zhang Yu
- Stress-testing Capability Elicitation With Password-locked Models Greenblatt Ryan, Roger Fabien, Krasheninnikov Dmitrii, Krueger David
- A Survey On Responsible Generative AI: What To Generate And What Not Gu Jindong
- Integrating Explanations In Learning LTL Specifications From Demonstrations Gupta Ashutosh, Komp John, Rajput Abhay Singh, Shankaranarayanan Krishna, Trivedi Ashutosh, Varshney Namrita
- Walledeval: A Comprehensive Safety Evaluation Toolkit For Large Language Models Gupta Prannaya, Yau Le Qi, Low Hao Han, Lee I-shiang, Lim Hugo Maximus, Teoh Yu Xin, Koh Jia Hng, Liew Dar Win, Bhardwaj Rishabh, Bhardwaj Rajat, Poria Soujanya
- Covert Malicious Finetuning: Challenges In Safeguarding LLM Adaptation Halawi Danny, Wei Alexander, Wallace Eric, Wang Tony T., Haghtalab Nika, Steinhardt Jacob
- Wildguard: Open One-stop Moderation Tools For Safety Risks, Jailbreaks, And Refusals Of Llms Han Seungju, Rao Kavel, Ettinger Allyson, Jiang Liwei, Lin Bill Yuchen, Lambert Nathan, Choi Yejin, Dziri Nouha
- Torchopera: A Compound AI System For LLM Safety Han Shanshan, Yao Yuhang, Hu Zijian, Stripelis Dimitris, Xu Zhaozhuo, He Chaoyang
- Exploring Chatgpt And Its Impact On Society Haque Md. Asraful, Li Shuai
- Risk And Response In Large Language Models: Evaluating Key Threat Categories Harandizadeh Bahareh, Salinas Abel, Morstatter Fred
- Pruning For Protection: Increasing Jailbreak Resistance In Aligned Llms Without Fine-tuning Hasan Adib, Rugina Ileana, Wang Alex
- Safety Arithmetic: A Framework For Test-time Safety Alignment Of Language Models By Steering Parameters And Activations Hazra Rima, Layek Sayan, Banerjee Somnath, Poria Soujanya
- Sowing The Wind, Reaping The Whirlwind: The Impact Of Editing Language Models Hazra Rima, Layek Sayan, Banerjee Somnath, Poria Soujanya
- Llms Meet Multimodal Generation And Editing: A Survey He Yingqing, Liu Zhaoyang, Chen Jingye, Tian Zeyue, Liu Hongyu, Chi Xiaowei, Liu Runtao, Yuan Ruibin, Xing Yazhou, Wang Wenhai, Dai Jifeng, Zhang Yong, Xue Wei, Liu Qifeng, Guo Yike, Chen Qifeng
- Quality Of Answers Of Generative Large Language Models Vs Peer Patients For Interpreting Lab Test Results For Lay Patients: Evaluation Study He Zhe, Bhasuran Balu, Jin Qiao, Tian Shubo, Hanna Karim, Shavor Cindy, Arguello Lisbeth Garcia, Murray Patrick, Lu Zhiyong
- Securing Vision-language Models With A Robust Encoder Against Jailbreak And Adversarial Attacks Hossain Md Zarif, Imteaj Ahmed
- Evaluating Robustness Of Generative Search Engine On Adversarial Factual Questions Hu Xuming, Li Xiaochuan, Chen Junzhe, Li Yinghui, Li Yangning, Li Xiaoguang, Wang Yasheng, Liu Qun, Wen Lijie, Yu Philip S., Guo Zhijiang
- Applying And Evaluating Large Language Models In Mental Health Care: A Scoping Review Of Human-assessed Generative Tasks Hua Yining, Na Hongbin, Li Zehan, Liu Fenglin, Fang Xiao, Clifton David, Torous John
- Chatgpt Rates Natural Language Explanation Quality Like Humans: But On Which Scales? Huang Fan, Kwak Haewoon, Park Kunwoo, An Jisun
- Antidote: Post-fine-tuning Safety Alignment For Large Language Models Against Harmful Fine-tuning Huang Tiansheng, Bhattacharya Gautam, Joshi Pratik, Kimball Josh, Liu Ling
- Trustllm: Trustworthiness In Large Language Models Huang Yue, Sun Lichao, Wang Haoran, Wu Siyuan, Zhang Qihui, Li Yuan, Gao Chujie, Huang Yixin, Lyu Wenhan, Zhang Yixuan, Li Xiner, Liu Zhengliang, Liu Yixin, Wang Yijue, Zhang Zhikun, Vidgen Bertie, Kailkhura Bhavya, Xiong Caiming, Xiao Chaowei, Li Chunyuan, Xing Eric, Huang Furong, Liu Hao, Ji Heng, Wang Hongyi, Zhang Huan, Yao Huaxiu, Kellis Manolis, Zitnik Marinka, Jiang Meng, Bansal Mohit, Zou James, Pei Jian, Liu Jian, Gao Jianfeng, Han Jiawei, Zhao Jieyu, Tang Jiliang, Wang Jindong, Vanschoren Joaquin, Mitchell John, Shu Kai, Xu Kaidi, Chang Kai-wei, He Lifang, Huang Lifu, Backes Michael, Gong Neil Zhenqiang, Yu Philip S., Chen Pin-yu, Gu Quanquan, Xu Ran, Ying Rex, Ji Shuiwang, Jana Suman, Chen Tianlong, Liu Tianming, Zhou Tianyi, Wang William, Li Xiang, Zhang Xiangliang, Wang Xiao, Xie Xing, Chen Xun, Wang Xuyu, Liu Yan, Ye Yanfang, Cao Yinzhi, Chen Yong, Zhao Yue
- Polyglotoxicityprompts: Multilingual Evaluation Of Neural Toxic Degeneration In Large Language Models Jain Devansh, Kumar Priyanshu, Gehman Samuel, Zhou Xuhui, Hartvigsen Thomas, Sap Maarten
- Chatbug: A Common Vulnerability Of Aligned Llms Induced By Chat Templates Jiang Fengqing, Xu Zhangchen, Niu Luyao, Lin Bill Yuchen, Poovendran Radha
- Raising The Bar: Investigating The Values Of Large Language Models Via Generative Evolving Testing Jiang Han, Yi Xiaoyuan, Wei Zhihua, Wang Shu, Xie Xing
- Wildteaming At Scale: From In-the-wild Jailbreaks To (adversarially) Safer Language Models Jiang Liwei, Rao Kavel, Han Seungju, Ettinger Allyson, Brahman Faeze, Kumar Sachin, Mireshghallah Niloofar, Lu Ximing, Sap Maarten, Choi Yejin, Dziri Nouha
- Navigating LLM Ethics: Advancements, Challenges, And Future Directions Jiao Junfeng, Afroogh Saleh, Xu Yiming, Phillips Connor
- GUARD: Role-playing To Generate Natural-language Jailbreakings To Test Guideline Adherence Of Large Language Models Jin Haibo, Chen Ruoxi, Zhou Andy, Zhang Yang, Wang Haohan
- Beyond Labels: Aligning Large Language Models With Human-like Reasoning Kabir Muhammad Rafsan, Sultan Rafeed Mohammad, Asif Ihsanul Haque, Ahad Jawad Ibn, Rahman Fuad, Amin Mohammad Ruhul, Mohammed Nabeel, Rahman Shafin
- Scaling Laws For Forgetting When Fine-tuning Large Language Models Kalajdzievski Damjan
- MEDIC: Towards A Comprehensive Framework For Evaluating Llms In Clinical Applications Kanithi Praveen K, Christophe Clément, Pimentel Marco Af, Raha Tathagata, Saadi Nada, Javed Hamza, Maslenkova Svetlana, Hayat Nasir, Rajan Ronnie, Khan Shadab
- Can Reinforcement Learning Unlock The Hidden Dangers In Aligned Large Language Models? Karkevandi Mohammad Bahrami, Vishwamitra Nishant, Najafirad Peyman
- Defending Large Language Models Against Attacks With Residual Stream Activation Analysis Kawasaki Amelia, Davis Andrew, Abbas Houssam
- Grounding And Evaluation For Large Language Models: Practical Challenges And Lessons Learned (survey) Kenthapadi Krishnaram, Sameki Mehrnoosh, Taly Ankur
- Consistency And Uncertainty: Identifying Unreliable Responses From Black-box Vision-language Models For Selective Visual Question Answering Khan Zaid, Fu Yun
- Nevermind: Instruction Override And Moderation In Large Language Models Kim Edward
- Automatic Jailbreaking Of The Text-to-image Generative AI Systems Kim Minseon, Lee Hyomin, Gong Boqing, Zhang Huishuai, Hwang Sung Ju
- Groundial: Human-norm Grounded Safe Dialog Response Generation Kim Siwon, Dai Shuyang, Kachuee Mohammad, Ray Shayan, Taghavi Tara, Yoon Sungroh
- Fine-tuning, Quantization, And Llms: Navigating Unintended Outcomes Kumar Divyanshu, Kumar Anurakt, Agarwal Sahil, Harshangi Prashanth
- Fishing For Magikarp: Automatically Detecting Under-trained Tokens In Large Language Models Land Sander, Bartolo Max
- Frontier AI Ethics: Anticipating And Evaluating The Societal Impacts Of Generative Agents Lazar Seth
- INDICT: Code Generation With Internal Dialogues Of Critiques For Both Security And Helpfulness Le Hung, Zhou Yingbo, Xiong Caiming, Savarese Silvio, Sahoo Doyen
- More RLHF, More Trust? On The Impact Of Human Preference Alignment On Language Model Trustworthiness Li Aaron J., Krishna Satyapriya, Lakkaraju Himabindu
- Backdoor Removal For Generative Large Language Models Li Haoran, Chen Yulin, Zheng Zihao, Hu Qi, Chan Chunkit, Liu Heshan, Song Yangqiu
- A Cross-language Investigation Into Jailbreak Attacks In Large Language Models Li Jie, Liu Yi, Liu Chongyang, Shi Ling, Ren Xiaoning, Zheng Yaowen, Liu Yang, Xue Yinxing
- Fmm-attack: A Flow-based Multi-modal Adversarial Attack On Video-based Llms Li Jinmin, Gao Kuofeng, Bai Yang, Zhang Jingyun, Xia Shu-tao, Wang Yisen
- Dissecting Human And LLM Preferences Li Junlong, Zhou Fan, Sun Shichao, Zhang Yikai, Zhao Hai, Liu Pengfei
- Red Teaming Visual Language Models Li Mukai, Li Lei, Yin Yuwei, Ahmed Masood, Liu Zhenguang, Liu Qi
- Preference Tuning For Toxicity Mitigation Generalizes Across Languages Li Xiaochen, Yong Zheng-xin, Bach Stephen H.
- Q-adapter: Training Your LLM Adapter As A Residual Q-function Li Yi-chen, Zhang Fuxiang, Qiu Wenjie, Yuan Lei, Jia Chengxing, Zhang Zongzhang, Yu Yang
- Backdoorllm: A Comprehensive Benchmark For Backdoor Attacks On Large Language Models Li Yige, Huang Hanxun, Zhao Yunhan, Ma Xingjun, Sun Jun
- Large Language Models For Human-like Autonomous Driving: A Survey Li Yun, Katsumata Kai, Javanmardi Ehsan, Tsukada Manabu
- Controllable Text Generation For Large Language Models: A Survey Liang Xun, Wang Hanyu, Wang Yezhaohui, Song Shichao, Yang Jiawei, Niu Simin, Hu Jie, Liu Dan, Yao Shunyu, Xiong Feiyu, Li Zhiyu
- Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats In Customized Large Language Models Liang Zi, Hu Haibo, Ye Qingqing, Xiao Yaxin, Li Haoyang
- Progressively Selective Label Enhancement For Language Model Alignment Liu Biao, Xu Ning, Geng Xin
- Openeval: Benchmarking Chinese Llms Across Capability, Alignment And Safety Liu Chuang, Yu Linhao, Li Jiaxuan, Jin Renren, Huang Yufei, Shi Ling, Zhang Junhui, Ji Xinmeng, Cui Tingting, Liu Tao, Song Jinwang, Zan Hongying, Li Sun, Xiong Deyi
- On The Intrinsic Self-correction Capability Of Llms: Uncertainty And Latent Concept Liu Guangliang, Mao Haitao, Cao Bochuan, Xue Zhiyu, Johnson Kristen, Tang Jiliang, Wang Rongrong
- Advancing Adversarial Suffix Transfer Learning On Aligned Large Language Models Liu Hongfu, Xie Yuxi, Wang Ye, Shieh Michael
- Automatic Generation Of Model And Data Cards: A Step Towards Responsible AI Liu Jiarui, Li Wenkai, Jin Zhijing, Diab Mona
- Tiny Refinements Elicit Resilience: Toward Efficient Prefix-model Against LLM Red-teaming Liu Jiaxu, Yin Xiangyu, Wu Sihao, Wang Jianhong, Fang Meng, Yi Xinping, Huang Xiaowei
- Latent Guard: A Safety Framework For Text-to-image Generation Liu Runtao, Khakzar Ashkan, Gu Jindong, Chen Qifeng, Torr Philip, Pizzati Fabio
- Making Them Ask And Answer: Jailbreaking Large Language Models In Few Queries Via Disguise And Reconstruction Liu Tong, Zhang Yingjie, Zhao Zhe, Dong Yinpeng, Meng Guozhu, Chen Kai
- Imposter.ai: Adversarial Attacks With Hidden Intentions Towards Aligned Large Language Models Liu Xiao, Li Liangzhi, Xiang Tong, Ye Fuying, Wei Lu, Li Wangyue, Garcia Noa
- Arondight: Red Teaming Large Vision Language Models With Auto-generated Multi-modal Jailbreak Prompts Liu Yi, Cai Chengjun, Zhang Xiaoli, Yuan Xingliang, Wang Cong
- SAFETY-J: Evaluating Safety With Critique Liu Yixiu, Zheng Yuxiang, Xia Shijie, Li Jiajun, Tu Yi, Song Chaoling, Liu Pengfei
- Large Language Models Relearn Removed Concepts Lo Michelle, Cohen Shay B., Barez Fazl
- Every Language Counts: Learn And Unlearn In Multilingual Llms Lu Taiming, Koehn Philipp
- Wildvision: Evaluating Vision-language Models In The Wild With Human Preferences Lu Yujie, Jiang Dongfu, Chen Wenhu, Wang William Yang, Choi Yejin, Lin Bill Yuchen
- Decoupled Alignment For Robust Plug-and-play Adaptation Luo Haozheng, Yu Jiahao, Zhang Wenxin, Li Jialong, Hu Jerry Yao-chieh, Xing Xinyu, Liu Han
- Ensuring Safe And High-quality Outputs: A Guideline Library Approach For Language Models Luo Yi, Lin Zhenghao, Zhang Yuhao, Sun Jiashuo, Lin Chen, Xu Chengjin, Su Xiangdong, Shen Yelong, Guo Jian, Gong Yeyun
- Keeping Llms Aligned After Fine-tuning: The Crucial Role Of Prompt Templates Lyu Kaifeng, Zhao Haoyu, Gu Xinran, Yu Dingli, Goyal Anirudh, Arora Sanjeev
- A Novel Nuanced Conversation Evaluation Framework For Large Language Models In Mental Health Marrapese Alexander, Suleiman Basem, Ullah Imdad, Kim Juno
- Generative Artificial Intelligence In Higher Education: Evidence From An Analysis Of Institutional Policies And Guidelines Mcdonald Nora, Johri Aditya, Ali Areej, Hingle Aayushi
- "not Aligned" Is Not "malicious": Being Careful About Hallucinations Of Large Language Models' Jailbreak Mei Lingrui, Liu Shenghua, Wang Yiwei, Bi Baolong, Mao Jiayi, Cheng Xueqi
- Phybench: A Physical Commonsense Benchmark For Evaluating Text-to-image Models Meng Fanqing, Shao Wenqi, Luo Lixin, Wang Yahong, Chen Yiran, Lu Quanfeng, Yang Yue, Yang Tianshuo, Zhang Kaipeng, Qiao Yu, Luo Ping
- Are Large Language Models Superhuman Chemists? Mirza Adrian, Alampara Nawaf, Kunchapu Sreekanth, Emoekabu Benedict, Krishnan Aswanth, Wilhelmi Mara, Okereke Macjonathan, Eberhardt Juliane, Elahi Amir Mohammad, Greiner Maximilian, Holick Caroline T., Gupta Tanya, Asgari Mehrdad, Glaubitz Christina, Klepsch Lea C., Köster Yannik, Meyer Jakob, Miret Santiago, Hoffmann Tim, Kreth Fabian Alexander, Ringleb Michael, Roesner Nicole, Schubert Ulrich S., Stafast Leanne M., Wonanke Dinga, Pieler Michael, Schwaller Philippe, Jablonka Kevin Maik
- CBF-LLM: Safe Control For LLM Alignment Miyaoka Yuya, Inoue Masaki
- Can We Trust Large Language Models Generated Code? A Framework For In-context Learning, Security Patterns, And Code Evaluations Across Diverse Llms Mohsin Ahmad, Janicke Helge, Wood Adrian, Sarker Iqbal H., Maglaras Leandros, Janjua Naeem
- Benchmarking Llama2, Mistral, Gemma And GPT For Factuality, Toxicity, Bias And Propensity For Hallucinations Nadeau David, Kroutikov Mike, Mcneil Karen, Baribeau Simon
- Aurora-m: The First Open Source Multilingual Language Model Red-teamed According To The U.S. Executive Order Nakamura Taishi, Mishra Mayank, Tedeschi Simone, Chai Yekun, Stillerman Jason T, Friedrich Felix, Yadav Prateek, Laud Tanmay, Chien Vu Minh, Zhuo Terry Yue, Misra Diganta, Bogin Ben, Vu Xuan-son, Karpinska Marzena, Dantuluri Arnav Varma, Kusa Wojciech, Furlanello Tommaso, Yokota Rio, Muennighoff Niklas, Pai Suhas, Adewumi Tosin, Laippala Veronika, Yao Xiaozhe, Junior Adalberto, Ariyak Alpay, Drozd Aleksandr, Clive Jordan, Gupta Kshitij, Chen Liangyu, Sun Qi, Tsui Ken, Persaud Noah, Fahmy Nour, Chen Tianlong, Bansal Mohit, Monti Nicolo, Dang Tai, Luo Ziyang, Bui Tien-tung, Navigli Roberto, Mehta Virendra, Blumberg Matthew, May Victor, Nguyen Huu, Pyysalo Sampo
- MBBQ: A Dataset For Cross-lingual Comparison Of Stereotypes In Generative Llms Neplenbroek Vera, Bisazza Arianna, Fernández Raquel
- Text-to-drive: Diverse Driving Behavior Synthesis Via Large Language Models Nguyen Phat, Wang Tsun-hsuan, Hong Zhang-wei, Karaman Sertac, Rus Daniela
- Aligners: Decoupling Llms And Alignment Ngweta Lilian, Agarwal Mayank, Maity Subha, Gittens Alex, Sun Yuekai, Yurochkin Mikhail
- How Reliable AI Chatbots Are For Disease Prediction From Patient Complaints? Nipu Ayesha Siddika, Islam K M Sajjadul, Madiraju Praveen
- Gemini Goes To Med School: Exploring The Capabilities Of Multimodal Large Language Models On Medical Challenge Problems & Hallucinations Pal Ankit, Sankarasubbu Malaikannan
- LLM Evaluators Recognize And Favor Their Own Generations Panickssery Arjun, Bowman Samuel R., Feng Shi
- Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes Llms More Prone To Jailbreak Attacks Pantazopoulos Georgios, Parekh Amit, Nikandrou Malvina, Suglia Alessandro
- Building Trust In Mental Health Chatbots: Safety Metrics And Llm-based Evaluation Tools Park Jung In, Abbasian Mahyar, Azimi Iman, Bounds Dawn, Jun Angela, Han Jaesu, Mccarron Robert, Borelli Jessica, Li Jia, Mahmoudi Mona, Wiedenhoeft Carmen, Rahmani Amir
- The Ultimate Guide To Fine-tuning Llms From Basics To Breakthroughs: An Exhaustive Review Of Technologies, Research, Best Practices, Applied Research Challenges And Opportunities Parthasarathy Venkatesh Balavadhani, Zafar Ahtsham, Khan Aafaq, Shahid Arsalan
- Towards Reliable Medical Question Answering: Techniques And Challenges In Mitigating Hallucinations In Language Models Pham Duy Khoa, Vo Bao Quoc
- Mllm-protector: Ensuring Mllm's Safety Without Hurting Performance Pi Renjie, Han Tianyang, Zhang Jianshu, Xie Yueqi, Pan Rui, Lian Qing, Dong Hanze, Zhang Jipeng, Zhang Tong
- Fractured-sorry-bench: Framework For Revealing Attacks In Conversational Turns Undermining Refusal Efficacy And Defenses Over Sorry-bench Priyanshu Aman, Vijay Supriti
- Helpful Or Harmful? Exploring The Efficacy Of Large Language Models For Online Grooming Prevention Prosser Ellie, Edwards Matthew
- A Moral Imperative: The Need For Continual Superalignment Of Large Language Models Puthumanaillam Gokul, Vora Manav, Thangeda Pranay, Ornik Melkior
- Safety Alignment Should Be Made More Than Just A Few Tokens Deep Qi Xiangyu, Panda Ashwinee, Lyu Kaifeng, Ma Xiao, Roy Subhrajit, Beirami Ahmad, Mittal Prateek, Henderson Peter
- Visual Hallucination: Definition, Quantification, And Prescriptive Remediations Rani Anku, Rawte Vipula, Sharma Harshad, Anand Neeraj, Rajbangshi Krishnav, Sheth Amit, Das Amitava
- Mitigating Exaggerated Safety In Large Language Models Ray Ruchira, Bhalani Ruchi
- FAIR Enough: How Can We Develop And Assess A Fair-compliant Dataset For Large Language Models' Training? Raza Shaina, Ghuge Shardul, Ding Chen, Dolatabadi Elham, Pandya Deval
- Beads: Bias Evaluation Across Domains Raza Shaina, Rahman Mizanur, Zhang Michael R.
- MBIAS: Mitigating Bias In Large Language Models While Retaining Context Raza Shaina, Raval Ananya, Chatrath Veronica
- Codeattack: Revealing Safety Generalization Challenges Of Large Language Models Via Code Completion Ren Qibing, Gao Chang, Shao Jing, Yan Junchi, Tan Xin, Lam Wai, Ma Lizhuang
- Healthcare Copilot: Eliciting The Power Of General Llms For Medical Consultation Ren Zhiyao, Zhan Yibing, Yu Baosheng, Ding Liang, Tao Dacheng
- Do GPT Language Models Suffer From Split Personality Disorder? The Advent Of Substrate-free Psychometrics Romero Peter, Fitz Stephen, Nakatsuma Teruo
- Exploring The Potential Of The Large Language Models (llms) In Identifying Misleading News Headlines Rony Md Main Uddin, Haque Md Mahfuzul, Ali Mohammad, Alam Ahmed Shatil, Hassan Naeemul
- Safetyprompts: A Systematic Review Of Open Datasets For Evaluating And Improving Large Language Model Safety Röttger Paul, Pernisi Fabio, Vidgen Bertie, Hovy Dirk
- Capabilities Of Gemini Models In Medicine Saab Khaled, Tu Tao, Weng Wei-hung, Tanno Ryutaro, Stutz David, Wulczyn Ellery, Zhang Fan, Strother Tim, Park Chunjong, Vedadi Elahe, Chaves Juanma Zambrano, Hu Szu-yeu, Schaekermann Mike, Kamath Aishwarya, Cheng Yong, Barrett David G. T., Cheung Cathy, Mustafa Basil, Palepu Anil, Mcduff Daniel, Hou Le, Golany Tomer, Liu Luyang, Alayrac Jean-baptiste, Houlsby Neil, Tomasev Nenad, Freyberg Jan, Lau Charles, Kemp Jonas, Lai Jeremy, Azizi Shekoofeh, Kanada Kimberly, Man Siwai, Kulkarni Kavita, Sun Ruoxi, Shakeri Siamak, He Luheng, Caine Ben, Webson Albert, Latysheva Natasha, Johnson Melvin, Mansfield Philip, Lu Jian, Rivlin Ehud, Anderson Jesper, Green Bradley, Wong Renee, Krause Jonathan, Shlens Jonathon, Dominowska Ewa, Eslami S. M. Ali, Chou Katherine, Cui Claire, Vinyals Oriol, Kavukcuoglu Koray, Manyika James, Dean Jeff, Hassabis Demis, Matias Yossi, Webster Dale, Barral Joelle, Corrado Greg, Semturs Christopher, Mahdavi S. Sara, Gottweis Juraj, Karthikesalingam Alan, Natarajan Vivek
- Athena: Safe Autonomous Agents With Verbal Contrastive Learning Sadhu Tanmana, Pesaranghader Ali, Chen Yanan, Yi Dong Hoon
- Rainbow Teaming: Open-ended Generation Of Diverse Adversarial Prompts Samvelyan Mikayel, Raparthy Sharath Chandra, Lupu Andrei, Hambro Eric, Markosyan Aram H., Bhatt Manish, Mao Yuning, Jiang Minqi, Parker-holder Jack, Foerster Jakob, Rocktäschel Tim, Raileanu Roberta
- Soft Prompt Threats: Attacking Safety Alignment And Unlearning In Open-source Llms Through The Embedding Space Schwinn Leo, Dobre David, Xhonneux Sophie, Gidel Gauthier, Gunnemann Stephan
- Evaluating Large Language Models With Fmeval Schwöbel Pola, Franceschi Luca, Zafar Muhammad Bilal, Vasist Keerthan, Malhotra Aman, Shenhar Tomer, Tailor Pinal, Yilmaz Pinar, Diamond Michael, Donini Michele
- Rapid Optimization For Jailbreaking Llms Via Subconscious Exploitation And Echopraxia Shen Guangyu, Cheng Siyuan, Zhang Kaiyuan, Tao Guanhong, An Shengwei, Yan Lu, Zhang Zhuo, Ma Shiqing, Zhang Xiangyu
- The Language Barrier: Dissecting Safety Challenges Of Llms In Multilingual Contexts Shen Lingfeng, Tan Weiting, Chen Sihao, Chen Yunmo, Zhang Jingyu, Xu Haoran, Zheng Boyuan, Koehn Philipp, Khashabi Daniel
- Profuser: Progressive Fusion Of Large Language Models Shi Tianyuan, Wan Fanqi, Huang Canbin, Quan Xiaojun, Li Chenliang, Yan Ming, Zhang Ji
- Can Prompt Modifiers Control Bias? A Comparative Analysis Of Text-to-image Generative Models Shin Philip Wootaek, Ahn Jihyun Janice, Yin Wenpeng, Sampson Jack, Narayanan Vijaykrishnan
- Multilingual Blending: LLM Safety Alignment Evaluation With Language Mixture Song Jiayang, Huang Yuheng, Zhou Zhehua, Ma Lei
- Canttalkaboutthis: Aligning Language Models To Stay On Topic In Dialogues Sreedhar Makesh Narsimhan, Rebedea Traian, Ghosh Shaona, Zeng Jiaqi, Parisien Christopher
- Exploring The Adversarial Capabilities Of Large Language Models Struppek Lukas, Le Minh Hieu, Hintersdorf Dominik, Kersting Kristian
- Improving Large Language Model (LLM) Fidelity Through Context-aware Grounding: A Systematic Approach To Reliability And Veracity Talukdar Wrick, Biswas Anjanava
- Fine-tuning Large Language Model (LLM) Artificial Intelligence Chatbots In Ophthalmology And Llm-based Evaluation Using GPT-4 Tan Ting Fang, Elangovan Kabilan, Jin Liyuan, Jie Yao, Yong Li, Lim Joshua, Poh Stanley, Ng Wei Yan, Lim Daniel, Ke Yuhe, Liu Nan, Ting Daniel Shu Wei
- Tuning-free Accountable Intervention For LLM Deployment -- A Metacognitive Approach Tan Zhen, Peng Jie, Chen Tianlong, Liu Huan
- Jill Watson: A Virtual Teaching Assistant Powered By Chatgpt Taneja Karan, Maiti Pratyusha, Kakar Sandeep, Guruprasad Pranav, Rao Sanjeev, Goel Ashok K.
- A + B: A General Generator-reader Framework For Optimizing Llms To Unleash Synergy Potential Tang Wei, Cao Yixin, Ying Jiahao, Wang Bo, Zhao Yuyue, Liao Yong, Zhou Pengyuan
- Prioritizing Safeguarding Over Autonomy: Risks Of LLM Agents For Science Tang Xiangru, Jin Qiao, Zhu Kunlun, Yuan Tongxin, Zhang Yichi, Zhou Wangchunshu, Qu Meng, Zhao Yilun, Tang Jian, Zhang Zhuosheng, Cohan Arman, Lu Zhiyong, Gerstein Mark
- When To Trust Llms: Aligning Confidence With Response Quality Tao Shuchang, Yao Liuyi, Ding Hanxing, Xie Yuexiang, Cao Qi, Sun Fei, Gao Jinyang, Shen Huawei, Ding Bolin
- Connecting The Dots: Llms Can Infer And Verbalize Latent Structure From Disparate Training Data Treutlein Johannes, Choi Dami, Betley Jan, Anil Cem, Marks Samuel, Grosse Roger Baker, Evans Owain
- Insaaf: Incorporating Safety Through Accuracy And Fairness | Are Llms Ready For The Indian Legal Domain? Tripathi Yogesh, Donakanti Raghav, Girhepuje Sahil, Kavathekar Ishan, Vedula Bhaskara Hanuma, Krishnan Gokul S, Goyal Shreya, Goel Anmol, Ravindran Balaraman, Kumaraguru Ponnurangam
- Towards Safety And Helpfulness Balanced Responses Via Controllable Large Language Models Tuan Yi-lin, Chen Xilun, Smith Eric Michael, Martin Louis, Batra Soumya, Celikyilmaz Asli, Wang William Yang, Bikel Daniel M.
- Calibrating Large Language Models Using Their Generations Only Ulmer Dennis, Gubri Martin, Lee Hwaran, Yun Sangdoo, Oh Seong Joon
- Sandwich Attack: Multi-language Mixture Adaptive Attack On Llms Upadhayay Bibek, Behzadan Vahid
- Autopuredata: Automated Filtering Of Web Data For LLM Fine-tuning Vadlapati Praneeth
- AI Sandbagging: Language Models Can Strategically Underperform On Evaluations Van Der Weij Teun, Hofstätter Felix, Jaffe Ollie, Brown Samuel F., Ward Francis Rhys
- Exploring Safety-utility Trade-offs In Personalized Language Models Vijjini Anvesh Rao, Chowdhury Somnath Basu Roy, Chaturvedi Snigdha
- A Survey For Large Language Models In Biomedicine Wang Chong, Li Mengyao, He Junjun, Wang Zhongruo, Darzi Erfan, Chen Zan, Ye Jin, Li Tianbin, Su Yanzhou, Ke Jing, Qu Kaili, Li Shuxin, Yu Yi, Liò Pietro, Wang Tianyun, Wang Yu Guang, Shen Yiqing
- Interpretable Preferences Via Multi-objective Reward Modeling And Mixture-of-experts Wang Haoxiang, Xiong Wei, Xie Tengyang, Zhao Han, Zhang Tong
- Probing The Safety Response Boundary Of Large Language Models Via Unsafe Decoding Path Generation Wang Haoyu, Wu Bingzhe, Bian Yatao, Chang Yongzhe, Wang Xueqian, Zhao Peilin
- Inferaligner: Inference-time Alignment For Harmlessness Through Cross-model Guidance Wang Pengyu, Zhang Dong, Li Linyang, Tan Chenkun, Wang Xinghao, Ren Ke, Jiang Botian, Qiu Xipeng
- "my Answer Is C": First-token Probabilities Do Not Match Text Answers In Instruction-tuned Language Models Wang Xinpeng, Ma Bolei, Hu Chengzhi, Weber-genzel Leon, Röttger Paul, Kreuter Frauke, Hovy Dirk, Plank Barbara
- Farsight: Fostering Responsible AI Awareness During AI Application Prototyping Wang Zijie J., Kulkarni Chinmay, Wilcox Lauren, Terry Michael, Madaio Michael
- Poisoned Langchain: Jailbreak Llms By Langchain Wang Ziqiu, Liu Jun, Zhang Shengkai, Yang Yang
- Cogergllm: Exploring Large Language Model Systems Design Perspective Using Cognitive Ergonomics Wasi Azmine Toushik
- Performance Evaluation Of Lightweight Open-source Large Language Models In Pediatric Consultations: A Comparative Analysis Wei Qiuhong, Cui Ying, Ding Mengwei, Wang Yanqin, Xiang Lingling, Yao Zhengxiong, Chen Ceran, Long Ying, Jin Zhezhen, Xu Ximing
- Leveraging Large Language Models For Patient Engagement: The Power Of Conversational AI In Digital Health Wen Bo, Norel Raquel, Liu Julia, Stappenbeck Thaddeus, Zulkernine Farhana, Chen Huamin
- In-context Learning Can Re-learn Forbidden Tasks Xhonneux Sophie, Dobre David, Tang Jian, Gidel Gauthier, Sridhar Dhanya
- Guardagent: Safeguard LLM Agents By A Guard Agent Via Knowledge-enabled Reasoning Xiang Zhen, Zheng Linzhi, Li Yanjie, Hong Junyuan, Li Qinbin, Xie Han, Zhang Jiawei, Xiong Zidi, Xie Chulin, Yang Carl, Song Dawn, Li Bo
- Sorry-bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors Xie Tinghao, Qi Xiangyu, Zeng Yi, Huang Yangsibo, Sehwag Udari Madhushani, Huang Kaixuan, He Luxi, Wei Boyi, Li Dacheng, Sheng Ying, Jia Ruoxi, Li Bo, Li Kai, Chen Danqi, Henderson Peter, Mittal Prateek
- Gradsafe: Detecting Jailbreak Prompts For Llms Via Safety-critical Gradient Analysis Xie Yueqi, Fang Minghong, Pi Renjie, Gong Neil
- Defensive Prompt Patch: A Robust And Interpretable Defense Of Llms Against Jailbreak Attacks Xiong Chen, Qi Xiangyu, Chen Pin-yu, Ho Tsung-yi
- Redagent: Red Teaming Large Language Models With Context-aware Autonomous Language Agent Xu Huiyu, Zhang Wenhui, Wang Zhibo, Xiao Feng, Zheng Rui, Feng Yunhe, Ba Zhongjie, Ren Kui
- Course-correction: Safety Alignment Using Synthetic Preferences Xu Rongwu, Cai Yishuo, Zhou Zhenhong, Gu Renjie, Weng Haiqin, Liu Yan, Zhang Tianwei, Xu Wei, Qiu Han
- Safedecoding: Defending Against Jailbreak Attacks Via Safety-aware Decoding Xu Zhangchen, Jiang Fengqing, Niu Luyao, Jia Jinyuan, Lin Bill Yuchen, Poovendran Radha
- Beyond Perplexity: Multi-dimensional Safety Evaluation Of LLM Compression Xu Zhichao, Gupta Ashim, Li Tao, Bentham Oliver, Srikumar Vivek
- Uncovering Safety Risks Of Large Language Models Through Concept Activation Vector Xu Zhihao, Huang Ruixuan, Chen Changyu, Wang Shuai, Wang Xiting
- A Comprehensive Study Of Jailbreak Attack Versus Defense For Large Language Models Xu Zihao, Liu Yi, Deng Gelei, Li Yuekang, Picek Stjepan
- No Token Left Behind: Reliable KV Cache Compression Via Importance-aware Mixed Precision Quantization Yang June Yong, Kim Byeongwook, Bae Jeongin, Kwon Beomseok, Park Gunho, Yang Eunho, Kwon Se Jung, Lee Dongsoo
- Guardt2i: Defending Text-to-image Models From Adversarial Prompts Yang Yijun, Gao Ruiyuan, Yang Xiao, Zhong Jianyuan, Xu Qiang
- Self-distillation Bridges Distribution Gap In Language Model Fine-tuning Yang Zhaorui, Pang Tianyu, Feng Haozhe, Wang Han, Chen Wei, Zhu Minfeng, Liu Qian
- Machine Unlearning Of Pre-trained Large Language Models Yao Jin, Chien Eli, Du Minxin, Niu Xinyao, Wang Tianhao, Cheng Zezhou, Yue Xiang
- Toolsword: Unveiling Safety Issues Of Large Language Models In Tool Learning Across Three Stages Ye Junjie, Li Sixian, Li Guanyu, Huang Caishuang, Gao Songyang, Wu Yilong, Zhang Qi, Gui Tao, Huang Xuanjing
- Jailbreak Attacks And Defenses Against Large Language Models: A Survey Yi Sibo, Liu Yule, Sun Zhen, Cong Tianshuo, He Xinlei, Song Jiaxing, Xu Ke, Li Qi
- Unveiling The Safety Of Gpt-4o: An Empirical Study Using Jailbreak Attacks Ying Zonghao, Liu Aishan, Liu Xianglong, Tao Dacheng
- Safety Fine-tuning At (almost) No Cost: A Baseline For Vision Large Language Models Yongshuo Zong, Ondrej Bohdal, Tingyang Yu, Yongxin Yang, Timothy Hospedales
- CSRT: Evaluation And Analysis Of Llms Using Code-switching Red-teaming Dataset Yoo Haneul, Yang Yongjin, Lee Hwaran
- Hyperclova X Technical Report Yoo Kang Min, Han Jaegeun, In Sookyo, Jeon Heewon, Jeong Jisu, Kang Jaewook, Kim Hyunwook, Kim Kyung-min, Kim Munhyong, Kim Sungju, Kwak Donghyun, Kwak Hanock, Kwon Se Jung, Lee Bado, Lee Dongsoo, Lee Gichang, Lee Jooho, Park Baeseong, Shin Seongjin, Yu Joonsang, Baek Seolki, Byeon Sumin, Cho Eungsup, Choe Dooseok, Han Jeesung, Jin Youngkyun, Jun Hyein, Jung Jaeseung, Kim Chanwoong, Kim Jinhong, Kim Jinuk, Lee Dokyeong, Park Dongwook, Sohn Jeong Min, Han Sujung, Heo Jiae, Hong Sungju, Jeon Mina, Jung Hyunhoon, Jung Jungeun, Jung Wangkyo, Kim Chungjoon, Kim Hyeri, Kim Jonghyun, Kim Min Young, Lee Soeun, Park Joonhee, Shin Jieun, Yang Sojin, Yoon Jungsoon, Lee Hwaran, Bae Sanghwan, Cha Jeehwan, Gylleus Karl, Ham Donghoon, Hong Mihak, Hong Youngki, Hong Yunki, Jang Dahyun, Jeon Hyojun, Jeon Yujin, Jeong Yeji, Ji Myunggeun, Jin Yeguk, Jo Chansong, Joo Shinyoung, Jung Seunghwan, Kim Adrian Jungmyung, Kim Byoung Hoon, Kim Hyomin, Kim Jungwhan, Kim Minkyoung, Kim Minseung, Kim Sungdong, Kim Yonghee, Kim Youngjun, Kim Youngkwan, Ko Donghyeon, Lee Dughyun, Lee Ha Young, Lee Jaehong, Lee Jieun, Lee Jonghyun, Lee Jongjin, Lee Min Young, Lee Yehbin, Min Taehong, Min Yuri, Moon Kiyoon, Oh Hyangnam, Park Jaesun, Park Kyuyon, Park Younghun, Seo Hanbae, Seo Seunghyun, Sim Mihyun, Son Gyubin, Yeo Matt, Yeom Kyung Hoon, Yoo Wonjoon, You Myungin, Ahn Doheon, Ahn Homin, Ahn Joohee, Ahn Seongmin, An Chanwoo, An Hyeryun, An Junho, An Sang-min, Byun Boram, Byun Eunbin, Cha Jongho, Chang Minji, Chang Seunggyu, Cho Haesong, Cho Youngdo, Choi Dalnim, Choi Daseul, Choi Hyoseok, Choi Minseong, Choi Sangho, Choi Seongjae, Choi Wooyong, Chun Sewhan, Go Dong Young, Ham Chiheon, Han Danbi, Han Jaemin, Hong Moonyoung, Hong Sung Bum, Hwang Dong-hyun, Hwang Seongchan, Im Jinbae, Jang Hyuk Jin, Jang Jaehyung, Jang Jaeni, Jang Sihyeon, Jang Sungwon, Jeon Joonha, Jeong Daun, Jeong Joonhyun, Jeong Kyeongseok, Jeong Mini, Jin Sol, Jo Hanbyeol, Jo Hanju, Jo Minjung, Jung Chaeyoon, Jung Hyungsik, Jung Jaeuk, Jung Ju Hwan, Jung Kwangsun, Jung Seungjae, Ka Soonwon, Kang Donghan, Kang Soyoung, Kil Taeho, Kim Areum, Kim Beomyoung, Kim Byeongwook, Kim Daehee, Kim Dong-gyun, Kim Donggook, Kim Donghyun, Kim Euna, Kim Eunchul, Kim Geewook, Kim Gyu Ri, Kim Hanbyul, Kim Heesu, Kim Isaac, Kim Jeonghoon, Kim Jihye, Kim Joonghoon, Kim Minjae, Kim Minsub, Kim Pil Hwan, Kim Sammy, Kim Seokhun, Kim Seonghyeon, Kim Soojin, Kim Soong, Kim Soyoon, Kim Sunyoung, Kim Taeho, Kim Wonho, Kim Yoonsik, Kim You Jin, Kim Yuri, Kwon Beomseok, Kwon Ohsung, Kwon Yoo-hwan, Lee Anna, Lee Byungwook, Lee Changho, Lee Daun, Lee Dongjae, Lee Ha-ram, Lee Hodong, Lee Hwiyeong, Lee Hyunmi, Lee Injae, Lee Jaeung, Lee Jeongsang, Lee Jisoo, Lee Jongsoo, Lee Joongjae, Lee Juhan, Lee Jung Hyun, Lee Junghoon, Lee Junwoo, Lee Se Yun, Lee Sujin, Lee Sungjae, Lee Sungwoo, Lee Wonjae, Lee Zoo Hyun, Lim Jong Kun, Lim Kun, Lim Taemin, Na Nuri, Nam Jeongyeon, Nam Kyeong-min, Noh Yeonseog, Oh Biro, Oh Jung-sik, Oh Solgil, Oh Yeontaek, Park Boyoun, Park Cheonbok, Park Dongju, Park Hyeonjin, Park Hyun Tae, Park Hyunjung, Park Jihye, Park Jooseok, Park Junghwan, Park Jungsoo, Park Miru, Park Sang Hee, Park Seunghyun, Park Soyoung, Park Taerim, Park Wonkyeong, Ryu Hyunjoon, Ryu Jeonghun, Ryu Nahyeon, Seo Soonshin, Seo Suk Min, Shim Yoonjeong, Shin Kyuyong, Shin Wonkwang, Sim Hyun, Sim Woongseob, Soh Hyejin, Son Bokyong, Son Hyunjun, Son Seulah, Song Chi-yun, Song Chiyoung, Song Ka Yeon, Song Minchul, Song Seungmin, Wang Jisung, Yeo Yonggoo, Yi Myeong Yeon, Yim Moon Bin, Yoo Taehwan, Yoo Youngjoon, Yoon Sungmin, Yoon Young Jin, Yu Hangyeol, Yu Ui Seon, Zuo Xingdong, Bae Jeongin, Bae Joungeun, Cho Hyunsoo, Cho Seonghyun, Cho Yongjin, Choi Taekyoon, Choi Yera, Chung Jiwan, Han Zhenghui, Heo Byeongho, Hong Euisuk, Hwang Taebaek, Im Seonyeol, Jegal Sumin, Jeon Sumin, Jeong Yelim, Jeong Yonghyun, Jiang Can, Jiang Juyong, Jin Jiho, Jo Ara, Jo Younghyun, Jung Hoyoun, Jung Juyoung, Kang Seunghyeong, Kim Dae Hee, Kim Ginam, Kim Hangyeol, Kim Heeseung, Kim Hyojin, Kim Hyojun, Kim Hyun-ah, Kim Jeehye, Kim Jin-hwa, Kim Jiseon, Kim Jonghak, Kim Jung Yoon, Kim Rak Yeong, Kim Seongjin, Kim Seoyoon, Kim Sewon, Kim Sooyoung, Kim Sukyoung, Kim Taeyong, Ko Naeun, Koo Bonseung, Kwak Heeyoung, Kwon Haena, Kwon Youngjin, Lee Boram, Lee Bruce W., Lee Dagyeong, Lee Erin, Lee Euijin, Lee Ha Gyeong, Lee Hyojin, Lee Hyunjeong, Lee Jeeyoon, Lee Jeonghyun, Lee Jongheok, Lee Joonhyung, Lee Junhyuk, Lee Mingu, Lee Nayeon, Lee Sangkyu, Lee Se Young, Lee Seulgi, Lee Seung Jin, Lee Suhyeon, Lee Yeonjae, Lee Yesol, Lee Youngbeom, Lee Yujin, Li Shaodong, Liu Tianyu, Moon Seong-eun, Moon Taehong, Nihlenramstroem Max-lasse, Oh Wonseok, Oh Yuri, Park Hongbeen, Park Hyekyung, Park Jaeho, Park Nohil, Park Sangjin, Ryu Jiwon, Ryu Miru, Ryu Simo, Seo Ahreum, Seo Hee, Seo Kangdeok, Shin Jamin, Shin Seungyoun, Sin Heetae, Wang Jiangping, Wang Lei, Xiang Ning, Xiao Longxiang, Xu Jing, Yi Seonyeong, Yoo Haanju, Yoo Haneul, Yoo Hwanhee, Yu Liang, Yu Youngjae, Yuan Weijie, Zeng Bo, Zhou Qian, Cho Kyunghyun, Ha Jung-woo, Park Joonsuk, Hwang Jihyun, Kwon Hyoung Jo, Kwon Soonyong, Lee Jungyeon, Lee Seungho, Lim Seonghyeon, Noh Hyunkyung, Choi Seungho, Lee Sang-woo, Lim Jung Hwa, Sung Nako
- Cosafe: Evaluating Large Language Model Safety In Multi-turn Dialogue Coreference Yu Erxin, Li Jing, Liao Ming, Wang Siqi, Gao Zuchen, Mi Fei, Hong Lanqing
- R-judge: Benchmarking Safety Risk Awareness For LLM Agents Yuan Tongxin, He Zhiwei, Dong Lingzhong, Wang Yiming, Zhao Ruijie, Xia Tian, Xu Lizhen, Zhou Binglin, Li Fangqi, Zhang Zhuosheng, Wang Rui, Liu Gongshen
- Refuse Whenever You Feel Unsafe: Improving Safety In Llms Via Decoupled Refusal Training Yuan Youliang, Jiao Wenxiang, Wang Wenxuan, Huang Jen-tse, Xu Jiahao, Liang Tian, He Pinjia, Tu Zhaopeng
- Shieldgemma: Generative AI Content Moderation Based On Gemma Zeng Wenjun, Liu Yuchi, Mullins Ryan, Peran Ludovic, Fernandez Joe, Harkous Hamza, Narasimhan Karthik, Proud Drew, Kumar Piyush, Radharapu Bhaktipriya, Sturman Olivia, Wahltinez Oscar
- How Johnny Can Persuade Llms To Jailbreak Them: Rethinking Persuasion To Challenge AI Safety By Humanizing Llms Zeng Yi, Lin Hongpeng, Zhang Jingwen, Yang Diyi, Jia Ruoxi, Shi Weiyan
- Enja: Ensemble Jailbreak On Large Language Models Zhang Jiahao, Wang Zilong, Wang Ruofan, Ma Xingjun, Jiang Yu-gang
- Methodology Of Adapting Large English Language Models For Specific Cultural Contexts Zhang Wenjing, Xiao Siqi, Lei Xuejiao, Wang Ning, Zhang Huazheng, An Meijuan, Yang Bikun, Liu Zhaoxiang, Wang Kai, Lian Shiguo
- Seallms 3: Open Foundation And Chat Multilingual Large Language Models For Southeast Asian Languages Zhang Wenxuan, Chan Hou Pong, Zhao Yiran, Aljunied Mahani, Wang Jianyu, Liu Chaoqun, Deng Yue, Hu Zhiqiang, Xu Weiwen, Chia Yew Ken, Li Xin, Bing Lidong
- Benchmarking Trustworthiness Of Multimodal Large Language Models: A Comprehensive Study Zhang Yichi, Huang Yao, Sun Yitong, Liu Chang, Zhao Zhe, Fang Zhengwei, Wang Yifan, Chen Huanran, Yang Xiao, Wei Xingxing, Su Hang, Dong Yinpeng, Zhu Jun
- Intention Analysis Makes Llms A Good Jailbreak Defender Zhang Yuqi, Ding Liang, Zhang Lefei, Tao Dacheng
- Shieldlm: Empowering Llms As Aligned, Customizable And Explainable Safety Detectors Zhang Zhexin, Lu Yida, Ma Jingyuan, Zhang Di, Li Rui, Ke Pei, Sun Hao, Sha Lei, Sui Zhifang, Wang Hongning, Huang Minlie
- Safe Unlearning: A Surprisingly Effective And Generalizable Solution To Defend Against Jailbreak Attacks Zhang Zhexin, Yang Junxiao, Ke Pei, Cui Shiyao, Zheng Chujie, Wang Hongning, Huang Minlie
- Defending Large Language Models Against Jailbreak Attacks Via Layer-specific Editing Zhao Wei, Li Zhe, Li Yige, Zhang Ye, Sun Jun
- Towards Comprehensive And Efficient Post Safety Alignment Of Large Language Models Via Safety Patching Zhao Weixiang, Hu Yulin, Li Zhuojun, Deng Yang, Zhao Yanyan, Qin Bing, Chua Tat-seng
- Adversarial Contrastive Decoding: Boosting Safety Alignment Of Large Language Models Via Opposite Prompt Optimization Zhao Zhengyue, Zhang Xiaoyun, Xu Kaidi, Hu Xing, Zhang Rui, Du Zidong, Guo Qi, Chen Yunji
- Balancing Enhancement, Harmlessness, And General Capabilities: Enhancing Conversational Llms With Direct RLHF Zheng Chen, Sun Ke, Wu Hang, Xi Chenguang, Zhou Xun
- On Prompt-driven Safeguarding For Large Language Models Zheng Chujie, Yin Fan, Zhou Hao, Meng Fandong, Zhou Jie, Chang Kai-wei, Huang Minlie, Peng Nanyun
- ROSE Doesn't Do That: Boosting The Safety Of Instruction-tuned Large Language Models With Reverse Prompt Contrastive Decoding Zhong Qihuang, Ding Liang, Liu Juhua, Du Bo, Tao Dacheng
- Sequence To Sequence Reward Modeling: Improving RLHF By Language Feedback Zhou Jiayi, Ji Jiaming, Dai Juntao, Yang Yaodong
- Large Language Models Are Involuntary Truth-tellers: Exploiting Fallacy Failure For Jailbreak Attacks Zhou Yue, Zou Henry Peng, Di Eugenio Barbara, Zhang Yang
- Don't Say No: Jailbreaking LLM By Suppressing Refusal Zhou Yukai, Wang Wenjie
- Emulated Disalignment: Safety Alignment For Large Language Models May Backfire! Zhou Zhanhui, Liu Jie, Dong Zhichen, Liu Jiaheng, Yang Chao, Ouyang Wanli, Qiao Yu
- How Alignment And Jailbreak Work: Explain LLM Safety Through Intermediate Hidden States Zhou Zhenhong, Yu Haiyang, Zhang Xinghua, Xu Rongwu, Huang Fei, Li Yongbin
- Aya Model: An Instruction Finetuned Open-access Multilingual Language Model Üstün Ahmet, Aryabumi Viraat, Yong Zheng-xin, Ko Wei-yin, D'souza Daniel, Onilude Gbemileke, Bhandari Neel, Singh Shivalika, Ooi Hui-lee, Kayid Amr, Vargus Freddie, Blunsom Phil, Longpre Shayne, Muennighoff Niklas, Fadaee Marzieh, Kreutzer Julia, Hooker Sara
🏷 Scaling Laws
- Scaling Laws For Neural Language Models Jared Kaplan, Sam Mccandlish, Tom Henighan, Tom B. Brown, Benjamin Chess, Rewon Child, Scott Gray, Alec Radford, Jeffrey Wu, Dario Amodei
- Neural Machine Translation With Joint Representation Li Yanyang, Wang Qiang, Xiao Tong, Liu Tongran, Zhu Jingbo
- Pagnol: An Extra-large French Generative Model Launay Julien, Tommasone Elena, Pannier Baptiste, Boniface François, Chatelain Amélie, Cappelli Alessandro, Poli Iacopo, Seddah Djamé
- NVIDIA Nemo Neural Machine Translation Systems For English-german And English-russian News And Biomedical Tasks At WMT21 Subramanian Sandeep, Hrinchuk Oleksii, Adams Virginia, Kuchaiev Oleksii
- An Explanation Of In-context Learning As Implicit Bayesian Inference Xie Sang Michael, Raghunathan Aditi, Liang Percy, Ma Tengyu
- Cramming: Training A Language Model On A Single GPU In One Day Geiping Jonas, Goldstein Tom
- Lamda: Language Models For Dialog Applications Romal Thoppilan, Daniel De Freitas, Jamie Hall, Noam Shazeer, Apoorv Kulshreshtha, Heng-tze Cheng, Alicia Jin, Taylor Bos, Leslie Baker, Yu Du, Yaguang Li, Hongrae Lee, Huaixiu Steven Zheng, Amin Ghafouri, Marcelo Menegali, Yanping Huang, Maxim Krikun, Dmitry Lepikhin, James Qin, Dehao Chen, Yuanzhong Xu, Zhifeng Chen, Adam Roberts, Maarten Bosma, Vincent Zhao, Yanqi Zhou, Chung-ching Chang, Igor Krivokon, Will Rusch, Marc Pickett, Pranesh Srinivasan, Laichee Man, Kathleen Meier-hellstern, Meredith Ringel Morris, Tulsee Doshi, Renelito Delos Santos, Toju Duke, Johnny Soraker, Ben Zevenbergen, Vinodkumar Prabhakaran, Mark Diaz, Ben Hutchinson, Kristen Olson, Alejandra Molina, Erin Hoffman-john, Josh Lee, Lora Aroyo, Ravi Rajakumar, Alena Butryna, Matthew Lamm, Viktoriya Kuzmina, Joe Fenton, Aaron Cohen, Rachel Bernstein, Ray Kurzweil, Blaise Aguera-arcas, Claire Cui, Marian Croak, Ed Chi, Quoc Le
- Staged Training For Transformer Language Models Shen Sheng, Walsh Pete, Keutzer Kurt, Dodge Jesse, Peters Matthew, Beltagy Iz
- Large Language Models Are Zero-shot Reasoners Takeshi Kojima, Shixiang Shane Gu, Machel Reid, Yutaka Matsuo, Yusuke Iwasawa
- Image As A Foreign Language: Beit Pretraining For All Vision And Vision-language Tasks Wang Wenhui, Bao Hangbo, Dong Li, Bjorck Johan, Peng Zhiliang, Liu Qiang, Aggarwal Kriti, Mohammed Owais Khan, Singhal Saksham, Som Subhojit, Wei Furu
- Transcending Scaling Laws With 0.1% Extra Compute Yi Tay, Jason Wei, Hyung Won Chung, Vinh Q. Tran, David R. So, Siamak Shakeri, Xavier Garcia, Huaixiu Steven Zheng, Jinfeng Rao, Aakanksha Chowdhery, Denny Zhou, Donald Metzler, Slav Petrov, Neil Houlsby, Quoc V. Le, Mostafa Dehghani
- Scaling Laws For Language Encoding Models In Fmri Antonello Richard, Vaidya Aditya, Huth Alexander G.
- Revealing The Structure Of Language Model Capabilities Burnell Ryan, Hao Han, Conway Andrew R. A., Orallo Jose Hernandez
- Ziya2: Data-centric Learning Is All Llms Need Gan Ruyi, Wu Ziwei, Sun Renliang, Lu Junyu, Wu Xiaojun, Zhang Dixiang, Pan Kunhao, He Junqing, Tian Yuanhe, Yang Ping, Yang Qi, Wang Hao, Zhang Jiaxing, Song Yan
- Scaling Laws Of Rope-based Extrapolation Liu Xiaoran, Yan Hang, Zhang Shuo, An Chenxin, Qiu Xipeng, Lin Dahua
- Codegen2: Lessons For Training Llms On Programming And Natural Languages Nijkamp Erik, Hayashi Hiroaki, Xiong Caiming, Savarese Silvio, Zhou Yingbo
- Beyond Chinchilla-optimal: Accounting For Inference In Language Model Scaling Laws Sardana Nikhil, Portes Jacob, Doubov Sasha, Frankle Jonathan
- The Languini Kitchen: Enabling Language Modelling Research At Different Scales Of Compute Stanić Aleksandar, Ashley Dylan, Serikov Oleg, Kirsch Louis, Faccio Francesco, Schmidhuber Jürgen, Hofmann Thomas, Schlag Imanol
- Emergent And Predictable Memorization In Large Language Models Stella Biderman, Usvsn Sai Prashanth, Lintang Sutawika, Hailey Schoelkopf, Quentin Anthony, Shivanshu Purohit, Edward Raff
- A Survey Of Large Language Models Wayne Xin Zhao, Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-yun Nie, Ji-rong Wen
- Nanolm: An Affordable LLM Pre-training Benchmark Via Accurate Loss Prediction Across Scales Yao Yiqun, Fan Siqi, Huang Xiusheng, Fang Xuezhi, Li Xiang, Ni Ziyi, Jiang Xin, Meng Xuying, Han Peng, Shang Shuo, Liu Kang, Sun Aixin, Wang Yequan
- A Survey Of Large Language Models Zhao Wayne Xin, Zhou Kun, Li Junyi, Tang Tianyi, Wang Xiaolei, Hou Yupeng, Min Yingqian, Zhang Beichen, Zhang Junjie, Dong Zican, Du Yifan, Yang Chen, Chen Yushuo, Chen Zhipeng, Jiang Jinhao, Ren Ruiyang, Li Yifan, Tang Xinyu, Liu Zikang, Liu Peiyu, Nie Jian-yun, Wen Ji-rong
- Navgpt: Explicit Reasoning In Vision-and-language Navigation With Large Language Models Zhou Gengze, Hong Yicong, Wu Qi
- Extrapolating Large Language Models To Non-english By Aligning Languages Zhu Wenhao, Lv Yunzhe, Dong Qingxiu, Yuan Fei, Xu Jingjing, Huang Shujian, Kong Lingpeng, Chen Jiajun, Li Lei
- Bridging The Bosphorus: Advancing Turkish Large Language Models Through Strategies For Low-resource Language Adaptation And Benchmarking Acikgoz Emre Can, Erdogan Mete, Yuret Deniz
- A Tale Of Tails: Model Collapse As A Change Of Scaling Laws Dohmatob Elvis, Feng Yunzhen, Yang Pu, Charton Francois, Kempe Julia
- AI And Memory Wall Gholami Amir, Yao Zhewei, Kim Sehoon, Hooper Coleman, Mahoney Michael W., Keutzer Kurt
- Openrlhf: An Easy-to-use, Scalable And High-performance RLHF Framework Hu Jian, Wu Xibin, Wang Weixun, Xianyu, Zhang Dehao, Cao Yu
- Minicpm: Unveiling The Potential Of Small Language Models With Scalable Training Strategies Hu Shengding, Tu Yuge, Han Xu, He Chaoqun, Cui Ganqu, Long Xiang, Zheng Zhi, Fang Yewei, Huang Yuxiang, Zhao Weilin, Zhang Xinrong, Thai Zheng Leng, Zhang Kaihuo, Wang Chongyi, Yao Yuan, Zhao Chenyang, Zhou Jie, Cai Jie, Zhai Zhongwu, Ding Ning, Jia Chao, Zeng Guoyang, Li Dahai, Liu Zhiyuan, Sun Maosong
- Scaling Laws For Downstream Task Performance Of Large Language Models Isik Berivan, Ponomareva Natalia, Hazimeh Hussein, Paparas Dimitris, Vassilvitskii Sergei, Koyejo Sanmi
- Scaling Laws For Forgetting When Fine-tuning Large Language Models Kalajdzievski Damjan
- Are Bigger Encoders Always Better In Vision Large Models? Li Bozhou, Liang Hao, Meng Zimo, Zhang Wentao
- 52B To 1T: Lessons Learned Via Tele-flm Series Li Xiang, Yao Yiqun, Jiang Xin, Fang Xuezhi, Wang Chao, Liu Xinzhang, Wang Zihan, Zhao Yu, Wang Xin, Huang Yuyao, Song Shuangyong, Li Yongxiang, Zhang Zheng, Zhao Bo, Sun Aixin, Wang Yequan, He Zhongjiang, Wang Zhongyuan, Li Xuelong, Huang Tiejun
- Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats In Customized Large Language Models Liang Zi, Hu Haibo, Ye Qingqing, Xiao Yaxin, Li Haoyang
- Fine-tuning Large Language Models For Domain Adaptation: Exploration Of Training Strategies, Scaling, Model Merging And Synergistic Capabilities Lu Wei, Luu Rachel K., Buehler Markus J.
- Calibrating Large Language Models With Sample Consistency Lyu Qing, Shridhar Kumar, Malaviya Chaitanya, Zhang Li, Elazar Yanai, Tandon Niket, Apidianaki Marianna, Sachan Mrinmaya, Callison-burch Chris
- Rephrasing The Web: A Recipe For Compute And Data-efficient Language Modeling Maini Pratyush, Seto Skyler, Bai He, Grangier David, Zhang Yizhe, Jaitly Navdeep
- Large Language Models: A Survey Minaee Shervin, Mikolov Tomas, Nikzad Narjes, Chenaghlu Meysam, Socher Richard, Amatriain Xavier, Gao Jianfeng
- Empirical Guidelines For Deploying Llms Onto Resource-constrained Edge Devices Qin Ruiyang, Liu Dancheng, Yan Zheyu, Tan Zhaoxuan, Pan Zixuan, Jia Zhenge, Jiang Meng, Abbasi Ahmed, Xiong Jinjun, Shi Yiyu
- GSR-BENCH: A Benchmark For Grounded Spatial Reasoning Evaluation Via Multimodal Llms Rajabi Navid, Kosecka Jana
- Do Large Language Models Mirror Cognitive Language Processing? Ren Yuqi, Jin Renren, Zhang Tongxuan, Xiong Deyi
- The Future Of Large Language Model Pre-training Is Federated Sani Lorenzo, Iacob Alex, Cao Zeyu, Marino Bill, Gao Yan, Paulik Tomas, Zhao Wanru, Shen William F., Aleksandrov Preslav, Qiu Xinchi, Lane Nicholas D.
- Scaling Laws For Linear Complexity Language Models Shen Xuyang, Li Dong, Leng Ruitao, Qin Zhen, Sun Weigao, Zhong Yiran
- Scaling Laws With Vocabulary: Larger Models Deserve Larger Vocabularies Tao Chaofan, Liu Qian, Dou Longxu, Muennighoff Niklas, Wan Zhongwei, Luo Ping, Lin Min, Wong Ngai
- UCCIX: Irish-excellence Large Language Model Tran Khanh-tung, O'sullivan Barry, Nguyen Hoang D.
- Temporal Scaling Law For Large Language Models Xiong Yizhe, Chen Xiansheng, Ye Xin, Chen Hui, Lin Zijia, Lian Haoran, Su Zhenpeng, Niu Jianwei, Ding Guiguang
- CSRT: Evaluation And Analysis Of Llms Using Code-switching Red-teaming Dataset Yoo Haneul, Yang Yongjin, Lee Hwaran
- Towards A Holistic Evaluation Of Llms On Factual Knowledge Recall Yuan Jiaqing, Pan Lin, Hang Chung-wei, Guo Jiang, Jiang Jiarong, Min Bonan, Ng Patrick, Wang Zhiguo
- Skywork-math: Data Scaling Laws For Mathematical Reasoning In Large Language Models -- The Story Goes On Zeng Liang, Zhong Liangjun, Zhao Liang, Wei Tianwen, Yang Liu, He Jujie, Cheng Cheng, Hu Rui, Liu Yang, Yan Shuicheng, Fang Han, Zhou Yahui
- When Scaling Meets LLM Finetuning: The Effect Of Data, Model And Finetuning Method Zhang Biao, Liu Zhongtao, Cherry Colin, Firat Orhan
- Scalable Matmul-free Language Modeling Zhu Rui-jie, Zhang Yu, Sifferman Ethan, Sheaves Tyler, Wang Yiqiao, Richmond Dustin, Zhou Peng, Eshraghian Jason K.
🏷 Security
- Reinforced Mnemonic Reader For Machine Reading Comprehension Hu Minghao, Peng Yuxing, Huang Zhen, Qiu Xipeng, Wei Furu, Zhou Ming
- Best Of Both Worlds: Transferring Knowledge From Discriminative Learning To A Generative Visual Dialog Model Lu Jiasen, Kannan Anitha, Yang Jianwei, Parikh Devi, Batra Dhruv
- Are You Talking To Me? Reasoned Visual Dialog Generation Through Adversarial Learning Wu Qi, Wang Peng, Shen Chunhua, Reid Ian, Hengel Anton Van Den
- Behavior Analysis Of NLI Models: Uncovering The Influence Of Three Factors On Robustness Carmona Vicente Ivan Sanchez, Mitchell Jeff, Riedel Sebastian
- SALSA-TEXT : Self Attentive Latent Space Based Adversarial Text Generation Gagnon-marchand Jules, Sadeghi Hamed, Haidar Md. Akmal, Rezagholizadeh Mehdi
- Neural Machine Translation With Adequacy-oriented Learning Kong Xiang, Tu Zhaopeng, Shi Shuming, Hovy Eduard, Zhang Tong
- Retrieval-enhanced Adversarial Training For Neural Response Generation Zhu Qingfu, Cui Lei, Zhang Weinan, Wei Furu, Liu Ting
- On The Use Of BERT For Neural Machine Translation Clinchant Stéphane, Jung Kweon Woo, Nikoulina Vassilina
- MRQA 2019 Shared Task: Evaluating Generalization In Reading Comprehension Fisch Adam, Talmor Alon, Jia Robin, Seo Minjoon, Choi Eunsol, Chen Danqi
- Zero-shot Paraphrase Generation With Multilingual Language Models Guo Yinpeng, Liao Yi, Jiang Xin, Zhang Qing, Zhang Yibo, Liu Qun
- Good, Better, Best: Textual Distractors Generation For Multiple-choice Visual Question Answering Via Reinforcement Learning Lu Jiaying, Ye Xin, Ren Yi, Yang Yezhou
- Adversarial Bootstrapping For Dialogue Model Training Olabiyi Oluwatobi, Mueller Erik T., Larson Christopher, Lahlou Tarek
- Thisiscompetition At Semeval-2019 Task 9: BERT Is Unstable For Out-of-domain Samples Park Cheoneum, Kim Juae, Lee Hyeon-gu, Amplayo Reinald Kim, Kim Harksoo, Seo Jungyun, Lee Changki
- What Does BERT Learn From Multiple-choice Reading Comprehension Datasets? Si Chenglei, Wang Shuohang, Kan Min-yen, Jiang Jing
- Insertion Transformer: Flexible Sequence Generation Via Insertion Operations Stern Mitchell, Chan William, Kiros Jamie, Uszkoreit Jakob
- Allennlp Interpret: A Framework For Explaining Predictions Of NLP Models Wallace Eric, Tuyls Jens, Wang Junlin, Subramanian Sanjay, Gardner Matt, Singh Sameer
- Attention Is Not Not Explanation Wiegreffe Sarah, Pinter Yuval
- Bertscore: Evaluating Text Generation With BERT Zhang Tianyi, Kishore Varsha, Wu Felix, Weinberger Kilian Q., Artzi Yoav
- Stress Test Evaluation Of Transformer-based Models In Natural Language Understanding Tasks Aspillaga Carlos, Carvallo Andrés, Araujo Vladimir
- Logical Natural Language Generation From Open-domain Tables Chen Wenhu, Chen Jianshu, Su Yu, Chen Zhiyu, Wang William Yang
- EFSG: Evolutionary Fooling Sentences Generator Di Giovanni Marco, Brambilla Marco
- IQ-VQA: Intelligent Visual Question Answering Goel Vatsal, Chandak Mohit, Anand Ashish, Guha Prithwijit
- Are Neural Open-domain Dialog Systems Robust To Speech Recognition Errors In The Dialog History? An Empirical Study Gopalakrishnan Karthik, Hedayatnia Behnam, Wang Longshaokan, Liu Yang, Hakkani-tur Dilek
- Pretrained Transformers Improve Out-of-distribution Robustness Hendrycks Dan, Liu Xiaoyuan, Wallace Eric, Dziedzic Adam, Krishnan Rishabh, Song Dawn
- Contrastive Code Representation Learning Jain Paras, Jain Ajay, Zhang Tianjun, Abbeel Pieter, Gonzalez Joseph E., Stoica Ion
- A Closer Look At The Robustness Of Vision-and-language Pre-trained Models Li Linjie, Gan Zhe, Liu Jingjing
- TIME: Text And Image Mutual-translation Adversarial Networks Liu Bingchen, Song Kunpeng, Zhu Yizhe, De Melo Gerard, Elgammal Ahmed
- Adversarial Training For Large Neural Language Models Liu Xiaodong, Cheng Hao, He Pengcheng, Chen Weizhu, Wang Yu, Poon Hoifung, Gao Jianfeng
- Simplifying Paragraph-level Question Generation Via Transformer Language Models Lopez Luis Enrico, Cruz Diane Kathryn, Cruz Jan Christian Blaise, Cheng Charibeth
- Charbert: Character-aware Pre-trained Language Model Ma Wentao, Cui Yiming, Si Chenglei, Liu Ting, Wang Shijin, Hu Guoping
- What Does BERT Know About Books, Movies And Music? Probing BERT For Conversational Recommendation Penha Gustavo, Hauff Claudia
- Data-efficient Methods For Dialogue Systems Shalyminov Igor
- A Simple But Tough-to-beat Data Augmentation Approach For Natural Language Understanding And Generation Shen Dinghan, Zheng Mingzhi, Shen Yelong, Qu Yanru, Chen Weizhu
- An Empirical Study On Robustness To Spurious Correlations Using Pre-trained Language Models Tu Lifu, Lalwani Garima, Gella Spandana, He He
- Infobert: Improving Robustness Of Language Models From An Information Theoretic Perspective Wang Boxin, Wang Shuohang, Cheng Yu, Gan Zhe, Jia Ruoxi, Li Bo, Liu Jingjing
- SRQA: Synthetic Reader For Factoid Question Answering Wang Jiuniu, Xu Wenjia, Fu Xingyu, Wei Yang, Jin Li, Chen Ziyan, Xu Guangluan, Wu Yirong
- Improving Adversarial Text Generation By Modeling The Distant Future Zhang Ruiyi, Chen Changyou, Gan Zhe, Wang Wenlin, Shen Dinghan, Wang Guoyin, Wen Zheng, Carin Lawrence
- Trojaning Language Models For Fun And Profit Zhang Xinyang, Zhang Zheng, Ji Shouling, Wang Ting
- Efficient Inference For Multilingual Neural Machine Translation Berard Alexandre, Lee Dain, Clinchant Stéphane, Jung Kweonwoo, Nikoulina Vassilina
- Latr: Layout-aware Transformer For Scene-text VQA Biten Ali Furkan, Litman Ron, Xie Yusheng, Appalaraju Srikar, Manmatha R.
- The Power Of Scale For Parameter-efficient Prompt Tuning Brian Lester, Rami Al-rfou, Noah Constant
- On The Prunability Of Attention Heads In Multilingual BERT Budhraja Aakriti, Pande Madhura, Kumar Pratyush, Khapra Mitesh M.
- Multilingual Multi-aspect Explainability Analyses On Machine Reading Comprehension Models Cui Yiming, Zhang Wei-nan, Che Wanxiang, Liu Ting, Chen Zhigang, Wang Shijin
- Robustness Challenges In Model Distillation And Pruning For Natural Language Understanding Du Mengnan, Mukherjee Subhabrata, Cheng Yu, Shokouhi Milad, Hu Xia, Awadallah Ahmed Hassan
- Generative Context Pair Selection For Multi-hop Question Answering Dua Dheeru, Santos Cicero Nogueira Dos, Ng Patrick, Athiwaratkun Ben, Xiang Bing, Gardner Matt, Singh Sameer
- An Adversarially-learned Turing Test For Dialog Generation Models Gao Xiang, Zhang Yizhe, Galley Michel, Dolan Bill
- Towards Trustworthy Deception Detection: Benchmarking Model Robustness Across Domains, Modalities, And Languages Glenski Maria, Ayton Ellyn, Cosbey Robin, Arendt Dustin, Volkova Svitlana
- Xeroalign: Zero-shot Cross-lingual Transformer Alignment Gritta Milan, Iacobacci Ignacio
- WARP: Word-level Adversarial Reprogramming Hambardzumyan Karen, Khachatrian Hrant, May Jonathan
- Investigating Transfer Learning In Multilingual Pre-trained Language Models Through Chinese Natural Language Inference Hu Hai, Zhou He, Tian Zuoyu, Zhang Yiwen, Ma Yina, Li Yanting, Nie Yixin, Richardson Kyle
- L-verse: Bidirectional Generation Between Image And Text Kim Taehoon, Song Gwangmo, Lee Sihaeng, Kim Sangyun, Seo Yewon, Lee Soonyoung, Kim Seung Hwan, Lee Honglak, Bae Kyunghoon
- Evaluating Large Language Models Trained On Code Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde De Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter, Philippe Tillet, Felipe Petroski Such, Dave Cummings, Matthias Plappert, Fotios Chantzis, Elizabeth Barnes, Ariel Herbert-voss, William Hebgen Guss, Alex Nichol, Alex Paino, Nikolas Tezak, Jie Tang, Igor Babuschkin, Suchir Balaji, Shantanu Jain, William Saunders, Christopher Hesse, Andrew N. Carr, Jan Leike, Josh Achiam, Vedant Misra, Evan Morikawa, Alec Radford, Matthew Knight, Miles Brundage, Mira Murati, Katie Mayer, Peter Welinder, Bob Mcgrew, Dario Amodei, Sam Mccandlish, Ilya Sutskever, Wojciech Zaremba
- Exploring Low-cost Transformer Model Compression For Large-scale Commercial Reply Suggestions Shrivastava Vaishnavi, Gaonkar Radhika, Gupta Shashank, Jha Abhishek
- ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, Dianhai Yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang
- Convfit: Conversational Fine-tuning Of Pretrained Language Models Vulić Ivan, Su Pei-hao, Coope Sam, Gerz Daniela, Budzianowski Paweł, Casanueva Iñigo, Mrkšić Nikola, Wen Tsung-hsien
- On Explaining Your Explanations Of BERT: An Empirical Study With Sequence Classification Wu Zhengxuan, Ong Desmond C.
- On The Interplay Between Fine-tuning And Composition In Transformers Yu Lang, Ettinger Allyson
- Language Bias In Visual Question Answering: A Survey And Taxonomy Yuan Desen
- Despite "super-human" Performance, Current Llms Are Unsuited For Decisions About Ethics And Safety Albrecht Joshua, Kitanidis Ellie, Fetterman Abraham J.
- Improving Alignment Of Dialogue Agents Via Targeted Human Judgements Amelia Glaese, Nat Mcaleese, Maja Trębacz, John Aslanides, Vlad Firoiu, Timo Ewalds, Maribeth Rauh, Laura Weidinger, Martin Chadwick, Phoebe Thacker, Lucy Campbell-gillingham, Jonathan Uesato, Po-sen Huang, Ramona Comanescu, Fan Yang, Abigail See, Sumanth Dathathri, Rory Greig, Charlie Chen, Doug Fritz, Jaume Sanchez Elias, Richard Green, Soňa Mokrá, Nicholas Fernando, Boxi Wu, Rachel Foley, Susannah Young, Iason Gabriel, William Isaac, John Mellor, Demis Hassabis, Koray Kavukcuoglu, Lisa Anne Hendricks, Geoffrey Irving
- Evaluating The Susceptibility Of Pre-trained Language Models Via Handcrafted Adversarial Examples Branch Hezekiah J., Cefalu Jonathan Rodriguez, Mchugh Jeremy, Hujer Leyla, Bahl Aditya, Iglesias Daniel Del Castillo, Heichman Ron, Darwishi Ramesh
- Robust Preference Learning For Storytelling Via Contrastive Reinforcement Learning Castricato Louis, Havrilla Alexander, Matiana Shahbuland, Pieler Michael, Ye Anbang, Yang Ian, Frazier Spencer, Riedl Mark
- Imputing Out-of-vocabulary Embeddings With LOVE Makes Language Models Robust With Little Cost Chen Lihu, Varoquaux Gaël, Suchanek Fabian M.
- Black-box Prompt Learning For Pre-trained Language Models Diao Shizhe, Huang Zhichao, Xu Ruijia, Li Xuechun, Lin Yong, Zhou Xiao, Zhang Tong
- Shortcut Learning Of Large Language Models In Natural Language Understanding Du Mengnan, He Fengxiang, Zou Na, Tao Dacheng, Hu Xia
- Llmeffichecker: Understanding And Testing Efficiency Degradation Of Large Language Models Feng Xiaoning, Han Xiaohong, Chen Simin, Yang Wei
- Complexity-based Prompting For Multi-step Reasoning Fu Yao, Peng Hao, Sabharwal Ashish, Clark Peter, Khot Tushar
- Checks And Strategies For Enabling Code-switched Machine Translation Gowda Thamme, Gheini Mozhdeh, May Jonathan
- Robust Dialogue State Tracking With Weak Supervision And Sparse Data Heck Michael, Lubis Nurul, Van Niekerk Carel, Feng Shutong, Geishauser Christian, Lin Hsien-chin, Gašić Milica
- XDBERT: Distilling Visual Information To BERT From Cross-modal Systems To Improve Language Understanding Hsu Chan-jan, Lee Hung-yi, Tsao Yu
- Gpt-neo For Commonsense Reasoning -- A Theoretical And Practical Lens Kashyap Rohan, Kashyap Vivek, P. Narendra C.
- Emergent Linguistic Structures In Neural Networks Are Fragile La Malfa Emanuele, Wicker Matthew, Kwiatkowska Marta
- Guiding Visual Question Answering With Attention Priors Le Thao Minh, Le Vuong, Gupta Sunil, Venkatesh Svetha, Tran Truyen
- Large Language Models With Controllable Working Memory Li Daliang, Rawat Ankit Singh, Zaheer Manzil, Wang Xin, Lukasik Michal, Veit Andreas, Yu Felix, Kumar Sanjiv
- A Win-win Deal: Towards Sparse And Robust Pre-trained Language Models Liu Yuanxin, Meng Fandong, Lin Zheng, Li Jiangnan, Fu Peng, Cao Yanan, Wang Weiping, Zhou Jie
- Prompt Generation Networks For Input-space Adaptation Of Frozen Vision Transformers Loedeman Jochem, Stol Maarten C., Han Tengda, Asano Yuki M.
- Quick Starting Dialog Systems With Paraphrase Generation Marceau Louis, Belbahar Raouf, Queudot Marc, Naji Nada, Charton Eric, Meurs Marie-jean
- Chatbots In A Botnet World Mckee Forrest, Noever David
- Task Formulation Matters When Learning Continually: A Case Study In Visual Question Answering Nikandrou Mavina, Yu Lu, Suglia Alessandro, Konstas Ioannis, Rieser Verena
- Holistic Evaluation Of Language Models Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao, Jue Wang, Keshav Santhanam, Laurel Orr, Lucia Zheng, Mert Yuksekgonul, Mirac Suzgun, Nathan Kim, Neel Guha, Niladri Chatterji, Omar Khattab, Peter Henderson, Qian Huang, Ryan Chi, Sang Michael Xie, Shibani Santurkar, Surya Ganguli, Tatsunori Hashimoto, Thomas Icard, Tianyi Zhang, Vishrav Chaudhary, William Wang, Xuechen Li, Yifan Mai, Yuhui Zhang, Yuta Koreeda
- Ignore Previous Prompt: Attack Techniques For Language Models Perez Fábio, Ribeiro Ian
- Answer-me: Multi-task Open-vocabulary Visual Question Answering Piergiovanni Aj, Li Wei, Kuo Weicheng, Saffar Mohammad, Bertsch Fred, Angelova Anelia
- An Understanding-oriented Robust Machine Reading Comprehension Model Ren Feiliang, Liu Yongkang, Li Bochao, Liu Shilei, Wang Bingchao, Wang Jiaqi, Liu Chunchao, Ma Qi
- NLX-GPT: A Model For Natural Language Explanations In Vision And Vision-language Tasks Sammani Fawaz, Mukherjee Tanmoy, Deligiannis Nikos
- TAPE: Assessing Few-shot Russian Language Understanding Taktasheva Ekaterina, Shavrina Tatiana, Fenogenova Alena, Shevelev Denis, Katricheva Nadezhda, Tikhonova Maria, Akhmetgareeva Albina, Zinkevich Oleg, Bashmakova Anastasiia, Iordanskaia Svetlana, Spiridonova Alena, Kurenshchikova Valentina, Artemova Ekaterina, Mikhailov Vladislav
- Commonsenseqa 2.0: Exposing The Limits Of AI Through Gamification Talmor Alon, Yoran Ori, Bras Ronan Le, Bhagavatula Chandra, Goldberg Yoav, Choi Yejin, Berant Jonathan
- Learning To Decompose Visual Features With Latent Textual Prompts Wang Feng, Li Manling, Lin Xudong, Lv Hairong, Schwing Alexander G., Ji Heng
- Rethinking Textual Adversarial Defense For Pre-trained Language Models Wang Jiayi, Bao Rongzhou, Zhang Zhuosheng, Zhao Hai
- Hybrid-regressive Neural Machine Translation Wang Qiang, Hu Xinhui, Chen Ming
- Understanding And Improving Sequence-to-sequence Pretraining For Neural Machine Translation Wang Wenxuan, Jiao Wenxiang, Hao Yongchang, Wang Xing, Shi Shuming, Tu Zhaopeng, Lyu Michael
- Qrelscore: Better Evaluating Generated Questions With Deeper Understanding Of Context-aware Relevance Wang Xiaoqiang, Liu Bang, Tang Siliang, Wu Lingfei
- Adversarial Self-attention For Language Understanding Wu Hongqiu, Ding Ruixue, Zhao Hai, Xie Pengjun, Huang Fei, Zhang Min
- FRSUM: Towards Faithful Abstractive Summarization Via Enhancing Factual Robustness Wu Wenhao, Li Wei, Liu Jiachen, Xiao Xinyan, Cao Ziqiang, Li Sujian, Wu Hua
- Precisely The Point: Adversarial Augmentations For Faithful And Informative Text Generation Wu Wenhao, Li Wei, Liu Jiachen, Xiao Xinyan, Li Sujian, Lyu Yajuan
- GLUE-X: Evaluating Natural Language Understanding Models From An Out-of-distribution Generalization Perspective Yang Linyi, Zhang Shuibai, Qin Libo, Li Yafu, Wang Yidong, Liu Hanmeng, Wang Jindong, Xie Xing, Zhang Yue
- Building Knowledge-grounded Dialogue Systems With Graph-based Semantic Modeling Yang Yizhe, Huang Heyan, Gao Yang, And Jiawei Li
- Can Pretrained Language Models (yet) Reason Deductively? Yuan Zhangdie, Hu Songbo, Vulić Ivan, Korhonen Anna, Meng Zaiqiao
- Scala: Accelerating Adaptation Of Pre-trained Transformer-based Language Models Via Efficient Large-batch Adversarial Noise Zhang Minjia, Naresh Niranjan Uma, He Yuxiong
- Unveiling Transformers With LEGO: A Synthetic Reasoning Task Zhang Yi, Backurs Arturs, Bubeck Sébastien, Eldan Ronen, Gunasekar Suriya, Wagner Tal
- Binding Language Models In Symbolic Languages Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu
- St-moe: Designing Stable And Transferable Sparse Expert Models Zoph Barret, Bello Irwan, Kumar Sameer, Du Nan, Huang Yanping, Dean Jeff, Shazeer Noam, Fedus William
- Methods For Estimating And Improving Robustness Of Language Models Štefánik Michal
- On The Relationship Between Skill Neurons And Robustness In Prompt Tuning Ackermann Leon, Ohmer Xenia
- The (ab)use Of Open Source Code To Train Large Language Models Al-kaswan Ali, Izadi Maliheh
- Detecting Language Model Attacks With Perplexity Alon Gabriel, Kamfonas Michael
- Does Deep Learning Learn To Abstract? A Systematic Probing Framework An Shengnan, Lin Zeqi, Chen Bei, Fu Qiang, Zheng Nanning, Lou Jian-guang
- Real-time Visual Feedback To Guide Benchmark Creation: A Human-and-metric-in-the-loop Workflow Arunkumar Anjana, Mishra Swaroop, Sachdeva Bhavdeep, Baral Chitta, Bryan Chris
- Abusing Images And Sounds For Indirect Instruction Injection In Multi-modal Llms Bagdasaryan Eugene, Hsieh Tsung-yin, Nassi Ben, Shmatikov Vitaly
- Tamil-llama: A New Tamil Language Model Based On Llama 2 Balachandran Abhinand
- Assessing And Enhancing The Robustness Of Large Language Models With Task Structure Variations For Logical Reasoning Bao Qiming, Gendron Gael, Peng Alex Yuxuan, Zhong Wanjun, Tan Neset, Chen Yang, Witbrock Michael, Liu Jiamou
- Self-consistency Of Large Language Models Under Ambiguity Bartsch Henning, Jorgensen Ole, Rosati Domenic, Hoelscher-obermaier Jason, Pfau Jacob
- Span-selective Linear Attention Transformers For Effective And Robust Schema-guided Dialogue State Tracking Bebensee Björn, Lee Haejun
- Sensitivity, Performance, Robustness: Deconstructing The Effect Of Sociodemographic Prompting Beck Tilman, Schuff Hendrik, Lauscher Anne, Gurevych Iryna
- Pre-training Llms Using Human-like Development Data Corpus Bhardwaj Khushi, Shah Raj Sanjay, Varma Sashank
- Language Model Unalignment: Parametric Red-teaming To Expose Hidden Harms And Biases Bhardwaj Rishabh, Poria Soujanya
- Model Leeching: An Extraction Attack Targeting Llms Birch Lewis, Hackett William, Trawicki Stefan, Suri Neeraj, Garraghan Peter
- How Close Is Chatgpt To Human Experts? Comparison Corpus, Evaluation, And Detection Biyang Guo, Xin Zhang, Ziyuan Wang, Minqi Jiang, Jinran Nie, Yuxuan Ding, Jianwei Yue, Yupeng Wu
- Personality Testing Of Large Language Models: Limited Temporal Stability, But Highlighted Prosociality Bodroza Bojana, Dinic Bojana M., Bojic Ljubisa
- Ask Language Model To Clean Your Noisy Translation Data Bolding Quinten, Liao Baohao, Denis Brandon James, Luo Jun, Monz Christof
- A Categorical Archive Of Chatgpt Failures Borji Ali
- Automatic Construction Of A Korean Toxic Instruction Dataset For Ethical Tuning Of Large Language Models Byun Sungjoo, Jang Dongjun, Jo Hyemi, Shin Hyopil
- Pokemonchat: Auditing Chatgpt For Pok\'emon Universe Knowledge Cabello Laura, Li Jiaang, Chalkidis Ilias
- CLAP: Isolating Content From Style Through Contrastive Learning With Augmented Prompts Cai Yichao, Liu Yuhang, Zhang Zhen, Shi Javen Qinfeng
- Can Transformers Learn Sequential Function Classes In Context? Campbell Ryan, Guo Emma, Hu Evan, Vir Reya, Hsiao Ethan
- Defending Against Alignment-breaking Attacks Via Robustly Aligned LLM Cao Bochuan, Cao Yuanpu, Lin Lu, Chen Jinghui
- Systematic Rectification Of Language Models Via Dead-end Analysis Cao Meng, Fatemi Mehdi, Cheung Jackie Chi Kit, Shabanian Samira
- A LLM Assisted Exploitation Of Ai-guardian Carlini Nicholas
- Chatgpt To Replace Crowdsourcing Of Paraphrases For Intent Classification: Higher Diversity And Comparable Model Robustness Cegin Jan, Simko Jakub, Brusilovsky Peter
- Which Spurious Correlations Impact Reasoning In NLI Models? A Visual Interactive Diagnosis Through Data-constrained Counterfactuals Chan Robin, Amini Afra, El-assady Mennatallah
- Fireact: Toward Language Agent Fine-tuning Chen Baian, Shu Chang, Shareghi Ehsan, Collier Nigel, Narasimhan Karthik, Yao Shunyu
- Jailbreaker In Jail: Moving Target Defense For Large Language Models Chen Bocheng, Paliwal Advait, Yan Qiben
- Interpreting And Controlling Vision Foundation Models Via Text Explanations Chen Haozhe, Yang Junfeng, Vondrick Carl, Mao Chengzhi
- Benchmarking Large Language Models In Retrieval-augmented Generation Chen Jiawei, Lin Hongyu, Han Xianpei, Sun Le
- Epi-curriculum: Episodic Curriculum Learning For Low-resource Domain Adaptation In Neural Machine Translation Chen Keyu, Zhuang Di, Li Mingchen, Chang J. Morris
- Enhancing Robot Program Synthesis Through Environmental Context Chen Tianyi, Wang Qidi, Dong Zhen, Shen Liwei, Peng Xin
- The Janus Interface: How Fine-tuning In Large Language Models Amplifies The Privacy Risks Chen Xiaoyi, Tang Siyuan, Zhu Rui, Yan Shijun, Jin Lei, Wang Zihao, Su Liya, Zhang Zhikun, Wang Xiaofeng, Tang Haixu
- Autoeval-video: An Automatic Benchmark For Assessing Large Vision Language Models In Open-ended Video Question Answering Chen Xiuyuan, Lin Yuan, Zhang Yuchen, Huang Weiran
- Adversarial Preference Optimization: Enhancing Your Alignment Via RM-LLM Game Cheng Pengyu, Yang Yifan, Li Jian, Dai Yong, Hu Tianhao, Cao Peixin, Du Nan, Li Xiaolong
- SCALE: Synergized Collaboration Of Asymmetric Language Translation Engines Cheng Xin, Wang Xun, Ge Tao, Chen Si-qing, Wei Furu, Zhao Dongyan, Yan Rui
- Can Large Language Models Be An Alternative To Human Evaluations? Chiang Cheng-han, Lee Hung-yi
- Generative Data Augmentation Using Llms Improves Distributional Robustness In Question Answering Chowdhury Arijit Ghosh, Chadha Aman
- Harnessing The Power Of Adversarial Prompting And Large Language Models For Robust Hypothesis Generation In Astronomy Ciucă Ioana, Ting Yuan-sen, Kruk Sandor, Iyer Kartheik
- BERT Lost Patience Won't Be Robust To Adversarial Slowdown Coalson Zachary, Ritter Gabriel, Bobba Rakesh, Hong Sanghyun
- Dr. Jekyll And Mr. Hyde: Two Faces Of Llms Collu Matteo Gioele, Janssen-groesbeek Tom, Koffas Stefanos, Conti Mauro, Picek Stjepan
- Seamlessm4t: Massively Multilingual & Multimodal Machine Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Cora, Dale David, Dong Ning, Duquenne Paul-ambroise, Elsahar Hady, Gong Hongyu, Heffernan Kevin, Hoffman John, Klaiber Christopher, Li Pengwei, Licht Daniel, Maillard Jean, Rakotoarison Alice, Sadagopan Kaushik Ram, Wenzek Guillaume, Ye Ethan, Akula Bapi, Chen Peng-jen, Hachem Naji El, Ellis Brian, Gonzalez Gabriel Mejia, Haaheim Justin, Hansanti Prangthip, Howes Russ, Huang Bernie, Hwang Min-jae, Inaguma Hirofumi, Jain Somya, Kalbassi Elahe, Kallet Amanda, Kulikov Ilia, Lam Janice, Li Daniel, Ma Xutai, Mavlyutov Ruslan, Peloquin Benjamin, Ramadan Mohamed, Ramakrishnan Abinesh, Sun Anna, Tran Kevin, Tran Tuan, Tufanov Igor, Vogeti Vish, Wood Carleigh, Yang Yilin, Yu Bokai, Andrews Pierre, Balioglu Can, Costa-jussà Marta R., Celebi Onur, Elbayad Maha, Gao Cynthia, Guzmán Francisco, Kao Justine, Lee Ann, Mourachko Alexandre, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler
- Robust Infidelity: When Faithfulness Measures On Masked Language Models Are Misleading Crothers Evan, Viktor Herna, Japkowicz Nathalie
- Reinforcement Learning Fine-tuning Of Language Models Is Biased Towards More Extractable Features Cruz Diogo, Pona Edoardo, Holness-tofts Alex, Schmied Elias, Alonso Víctor Abia, Griffin Charlie, Cirstea Bogdan-ionut
- An Adversarial Example For Direct Logit Attribution: Memory Management In Gelu-4l Dao James, Lau Yeu-tong, Rager Can, Janiak Jett
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- Competence-based Analysis Of Language Models Davies Adam, Jiang Jize, Zhai Chengxiang
- Scaling Vision Transformers To 22 Billion Parameters Dehghani Mostafa, Djolonga Josip, Mustafa Basil, Padlewski Piotr, Heek Jonathan, Gilmer Justin, Steiner Andreas, Caron Mathilde, Geirhos Robert, Alabdulmohsin Ibrahim, Jenatton Rodolphe, Beyer Lucas, Tschannen Michael, Arnab Anurag, Wang Xiao, Riquelme Carlos, Minderer Matthias, Puigcerver Joan, Evci Utku, Kumar Manoj, Van Steenkiste Sjoerd, Elsayed Gamaleldin F., Mahendran Aravindh, Yu Fisher, Oliver Avital, Huot Fantine, Bastings Jasmijn, Collier Mark Patrick, Gritsenko Alexey, Birodkar Vighnesh, Vasconcelos Cristina, Tay Yi, Mensink Thomas, Kolesnikov Alexander, Pavetić Filip, Tran Dustin, Kipf Thomas, Lučić Mario, Zhai Xiaohua, Keysers Daniel, Harmsen Jeremiah, Houlsby Neil
- Masterkey: Automated Jailbreak Across Multiple Large Language Model Chatbots Deng Gelei, Liu Yi, Li Yuekang, Wang Kailong, Zhang Ying, Li Zefeng, Wang Haoyu, Zhang Tianwei, Liu Yang
- Multilingual Jailbreak Challenges In Large Language Models Deng Yue, Zhang Wenxuan, Pan Sinno Jialin, Bing Lidong
- A Wolf In Sheep's Clothing: Generalized Nested Jailbreak Prompts Can Fool Large Language Models Easily Ding Peng, Kuang Jun, Ma Dan, Cao Xuezhi, Xian Yunsen, Chen Jiajun, Huang Shujian
- Prompt Optimization Via Adversarial In-context Learning Do Xuan Long, Zhao Yiran, Brown Hannah, Xie Yuxi, Zhao James Xu, Chen Nancy F., Kawaguchi Kenji, Shieh Michael, He Junxian
- Revisit Input Perturbation Problems For Llms: A Unified Robustness Evaluation Framework For Noisy Slot Filling Task Dong Guanting, Zhao Jinxu, Hui Tingfeng, Guo Daichi, Wan Wenlong, Feng Boqi, Qiu Yueyan, Gongque Zhuoma, He Keqing, Wang Zechen, Xu Weiran
- The Philosopher's Stone: Trojaning Plugins Of Large Language Models Dong Tian, Xue Minhui, Chen Guoxing, Holland Rayne, Meng Yan, Li Shaofeng, Liu Zhen, Zhu Haojin
- Proximal Policy Optimization Actual Combat: Manipulating Output Tokenizer Length Fan Miao, Hu Chen, Zhou Shuchang
- Improving Factual Consistency Of Text Summarization By Adversarially Decoupling Comprehension And Embellishment Abilities Of Llms Feng Huawen, Fan Yan, Liu Xiong, Lin Ting-en, Yao Zekun, Wu Yuchuan, Huang Fei, Li Yongbin, Ma Qianli
- Misusing Tools In Large Language Models With Visual Adversarial Examples Fu Xiaohan, Wang Zihan, Li Shuheng, Gupta Rajesh K., Mireshghallah Niloofar, Berg-kirkpatrick Taylor, Fernandes Earlence
- Safety Alignment In NLP Tasks: Weakly Aligned Summarization As An In-context Attack Fu Yu, Li Yufei, Xiao Wen, Liu Cong, Dong Yue
- Sensitivity And Robustness Of Large Language Models To Prompt Template In Japanese Text Classification Tasks Gan Chengguang, Mori Tatsunori
- CLIPAG: Towards Generator-free Text-to-image Generation Ganz Roy, Elad Michael
- Jsontuning: Towards Generalizable, Robust, And Controllable Instruction Tuning Gao Chang, Zhang Wenxuan, Chen Guizhen, Lam Wai
- MART: Improving LLM Safety With Multi-round Automatic Red-teaming Ge Suyu, Zhou Chunting, Hou Rui, Khabsa Madian, Wang Yi-chia, Wang Qifan, Han Jiawei, Mao Yuning
- Trueteacher: Learning Factual Consistency Evaluation With Large Language Models Gekhman Zorik, Herzig Jonathan, Aharoni Roee, Elkind Chen, Szpektor Idan
- Figstep: Jailbreaking Large Vision-language Models Via Typographic Visual Prompts Gong Yichen, Ran Delong, Liu Jinyuan, Wang Conglei, Cong Tianshuo, Wang Anyu, Duan Sisi, Wang Xiaoyun
- Not What You've Signed Up For: Compromising Real-world Llm-integrated Applications With Indirect Prompt Injection Greshake Kai, Abdelnabi Sahar, Mishra Shailesh, Endres Christoph, Holz Thorsten, Fritz Mario
- Language Models Hallucinate, But May Excel At Fact Verification Guan Jian, Dodge Jesse, Wadden David, Huang Minlie, Peng Hao
- Empowering Working Memory For Large Language Model Agents Guo Jing, Li Nan, Qi Jianchuan, Yang Hang, Li Ruiqiao, Feng Yuzhen, Zhang Si, Xu Ming
- Evaluating Concurrent Robustness Of Language Models Across Diverse Challenge Sets Gupta Vatsal, Pandya Pranshu, Kataria Tushar, Gupta Vivek, Roth Dan
- Is GPT-4 A Reliable Rater? Evaluating Consistency In GPT-4 Text Ratings Hackl Veronika, Müller Alexandra Elena, Granitzer Michael, Sailer Maximilian
- Towards Efficient Vision-language Tuning: More Information Density, More Generalizability Hao Tianxiang, Lyu Mengyao, Chen Hui, Zhao Sicheng, Ding Xiaohan, Han Jungong, Ding Guiguang
- Sok: Memorization In General-purpose Large Language Models Hartmann Valentin, Suri Anshuman, Bindschaedler Vincent, Evans David, Tople Shruti, West Robert
- Spear Phishing With Large Language Models Hazell Julian
- Using Natural Language Explanations To Improve Robustness Of In-context Learning He Xuanli, Wu Yuxiang, Camburu Oana-maria, Minervini Pasquale, Stenetorp Pontus
- How Good Are GPT Models At Machine Translation? A Comprehensive Evaluation Hendy Amr, Abdelrehim Mohamed, Sharaf Amr, Raunak Vikas, Gabr Mohamed, Matsushita Hitokazu, Kim Young Jin, Afify Mohamed, Awadalla Hany Hassan
- Stochastic Parrots Looking For Stochastic Parrots: Llms Are Easy To Fine-tune And Hard To Detect With Other Llms Henrique Da Silva Gameiro, Kucharavy Andrei, Guerraoui Rachid
- Negative Object Presence Evaluation (NOPE) To Measure Object Hallucination In Vision-language Models Holy Lovenia, Wenliang Dai, Samuel Cahyawijaya, Ziwei Ji, Pascale Fung
- Turbulence: Systematically And Automatically Testing Instruction-tuned Large Language Models For Code Honarvar Shahin, Van Der Wilk Mark, Donaldson Alastair
- Do Large Language Models Know About Facts? Hu Xuming, Chen Junzhe, Li Xiaochuan, Guo Yufei, Wen Lijie, Yu Philip S., Guo Zhijiang
- Token-level Adversarial Prompt Detection Based On Perplexity Measures And Contextual Information Hu Zhengmian, Wu Gang, Mitra Saayan, Zhang Ruiyi, Sun Tong, Huang Heng, Swaminathan Viswanathan
- Deploying And Evaluating Llms To Program Service Mobile Robots Hu Zichao, Lucchetti Francesca, Schlesinger Claire, Saxena Yash, Freeman Anders, Modak Sadanand, Guha Arjun, Biswas Joydeep
- Real-time Animation Generation And Control On Rigged Models Via Large Language Models Huang Han, De La Torre Fernanda, Fang Cathy Mengying, Banburski-fahey Andrzej, Amores Judith, Lanier Jaron
- Catastrophic Jailbreak Of Open-source Llms Via Exploiting Generation Huang Yangsibo, Gupta Samyak, Xia Mengzhou, Li Kai, Chen Danqi
- Hijacking Context In Large Multi-modal Models Jeong Joonhyun
- Tailoring Language Generation Models Under Total Variation Distance Ji Haozhe, Ke Pei, Hu Zhipeng, Zhang Rongsheng, Huang Minlie
- Forcing Generative Models To Degenerate Ones: The Power Of Data Poisoning Attacks Jiang Shuli, Kadhe Swanand Ravindra, Zhou Yi, Cai Ling, Baracaldo Nathalie
- Prompt Packer: Deceiving Llms Through Compositional Instruction With Hidden Attacks Jiang Shuyu, Chen Xingshu, Tang Rui
- BRAINTEASER: Lateral Thinking Puzzles For Large Language Models Jiang Yifan, Ilievski Filip, Ma Kaixin, Sourati Zhivar
- Lion: Adversarial Distillation Of Proprietary Large Language Models Jiang Yuxin, Chan Chunkit, Chen Mingyang, Wang Wei
- Evaluation And Enhancement Of Semantic Grounding In Large Vision-language Models Jiaying Lu, Jinmeng Rao, Kezhen Chen, Xiaoyuan Guo, Yawen Zhang, Baochen Sun, Carl Yang, Jie Yang
- A Comprehensive Capability Analysis Of GPT-3 And GPT-3.5 Series Models Junjie Ye, Xuanting Chen, Nuo Xu, Can Zu, Zekai Shao, Shichun Liu, Yuhan Cui, Zeyang Zhou, Chao Gong, Yang Shen, Jie Zhou, Siming Chen, Tao Gui, Qi Zhang, Xuanjing Huang
- Robust Recommender System: A Survey And Future Directions Kaike Zhang, Qi Cao, Fei Sun, Yunfan Wu, Shuchang Tao, Huawei Shen, Xueqi Cheng
- Robust Training For Conversational Question Answering Models With Reinforced Reformulation Generation Kaiser Magdalena, Roy Rishiraj Saha, Weikum Gerhard
- A Survey Of GPT-3 Family Large Language Models Including Chatgpt And GPT-4 Kalyan Katikapalli Subramanyam
- Dialmat: Dialogue-enabled Transformer With Moment-based Adversarial Training Kaneda Kanta, Korekata Ryosuke, Wada Yuiga, Nagashima Shunya, Kambara Motonari, Iioka Yui, Matsuo Haruka, Imai Yuto, Nishimura Takayuki, Sugiura Komei
- Exploiting Programmatic Behavior Of Llms: Dual-use Through Standard Security Attacks Kang Daniel, Li Xuechen, Stoica Ion, Guestrin Carlos, Zaharia Matei, Hashimoto Tatsunori
- Assessing The Promise And Pitfalls Of Chatgpt For Automated Code Generation Khan Muhammad Fawad Akbar, Ramsdell Max, Falor Erik, Karimi Hamid
- Q: How To Specialize Large Vision-language Models To Data-scarce VQA Tasks? A: Self-train On Unlabeled Images! Khan Zaid, Bg Vijay Kumar, Schulter Samuel, Yu Xiang, Fu Yun, Chandraker Manmohan
- How Secure Is Code Generated By Chatgpt? Khoury Raphaël, Avila Anderson R., Brunelle Jacob, Camara Baba Mamadou
- Writing Your Own Book: A Method For Going From Closed To Open Book QA To Improve Robustness And Performance Of Smaller Llms Kokaia Giorgi, Sinha Pratyush, Jiang Yutong, Boujemaa Nozha
- Benchmarking Cognitive Biases In Large Language Models As Evaluators Koo Ryan, Lee Minhwa, Raheja Vipul, Park Jong Inn, Kim Zae Myung, Kang Dongyeop
- Certifying LLM Safety Against Adversarial Prompting Kumar Aounon, Agarwal Chirag, Srinivas Suraj, Li Aaron Jiaxun, Feizi Soheil, Lakkaraju Himabindu
- Hallucination Augmented Recitations For Language Models Köksal Abdullatif, Aksitov Renat, Chang Chung-ching
- Enhancing Transformers Without Self-supervised Learning: A Loss Landscape Perspective In Sequential Recommendation Lai Vivian, Chen Huiyuan, Yeh Chin-chia Michael, Xu Minghua, Cai Yiwei, Yang Hao
- Open Sesame! Universal Black Box Jailbreaking Of Large Language Models Lapid Raz, Langberg Ron, Sipper Moshe
- Making Large Language Models Better Data Creators Lee Dong-ho, Pujara Jay, Sewak Mohit, White Ryen W., Jauhar Sujay Kumar
- Read-only Prompt Optimization For Vision-language Few-shot Learning Lee Dongjun, Song Seokwon, Suh Jihee, Choi Joonmyung, Lee Sanghyeok, Kim Hyunwoo J.
- Lora Fine-tuning Efficiently Undoes Safety Training In Llama 2-chat 70B Lermen Simon, Rogers-smith Charlie, Ladish Jeffrey
- Task Contamination: Language Models May Not Be Few-shot Anymore Li Changmao, Flanigan Jeffrey
- Dialogue For Prompting: A Policy-gradient-based Discrete Prompt Generation For Few-shot Learning Li Chengzhengxu, Liu Xiaoming, Wang Yichen, Li Duyi, Lan Yu, Shen Chao
- Privacy In Large Language Models: Attacks, Defenses And Future Directions Li Haoran, Chen Yulin, Luo Jinglong, Kang Yan, Zhang Xiaojin, Hu Qi, Chan Chunkit, Song Yangqiu
- Overprompt: Enhancing Chatgpt Through Efficient In-context Learning Li Jiazheng, Zhao Runcong, Yang Yongxin, He Yulan, Gui Lin
- White-box Multi-objective Adversarial Attack On Dialogue Generation Li Yufei, Li Zexin, Gao Yingfan, Liu Cong
- GPT Detectors Are Biased Against Non-native English Writers Liang Weixin, Yuksekgonul Mert, Mao Yining, Wu Eric, Zou James
- GPT-4 Enhanced Multimodal Grounding For Autonomous Driving: Leveraging Cross-modal Attention With Large Language Models Liao Haicheng, Shen Huanming, Li Zhenning, Wang Chengyue, Li Guofa, Bie Yiming, Xu Chengzhong
- Dynamic Scheduled Sampling With Imitation Loss For Neural Text Generation Lin Xiang, Jwalapuram Prathyusha, Joty Shafiq
- SPHINX: The Joint Mixing Of Weights, Tasks, And Visual Embeddings For Multi-modal Large Language Models Lin Ziyi, Liu Chris, Zhang Renrui, Gao Peng, Qiu Longtian, Xiao Han, Qiu Han, Lin Chen, Shao Wenqi, Chen Keqin, Han Jiaming, Huang Siyuan, Zhang Yichi, He Xuming, Li Hongsheng, Qiao Yu
- Rethinking Tabular Data Understanding With Large Language Models Liu Tianyang, Wang Fei, Chen Muhao
- Mm-safetybench: A Benchmark For Safety Evaluation Of Multimodal Large Language Models Liu Xin, Zhu Yichen, Gu Jindong, Lan Yunshi, Yang Chao, Qiao Yu
- Trustworthy Llms: A Survey And Guideline For Evaluating Large Language Models' Alignment Liu Yang, Yao Yuanshun, Ton Jean-francois, Zhang Xiaoying, Guo Ruocheng, Cheng Hao, Klochkov Yegor, Taufiq Muhammad Faaiz, Li Hang
- RECALL: A Benchmark For Llms Robustness Against External Counterfactual Knowledge Liu Yi, Huang Lianzhe, Li Shicheng, Chen Sishuo, Zhou Hao, Meng Fandong, Zhou Jie, Sun Xu
- Evaluate What You Can't Evaluate: Unassessable Quality For Generated Response Liu Yongkang, Feng Shi, Wang Daling, Zhang Yifei, Schütze Hinrich
- Robustness Over Time: Understanding Adversarial Examples' Effectiveness On Longitudinal Versions Of Large Language Models Liu Yugeng, Cong Tianshuo, Zhao Zhengyu, Backes Michael, Shen Yun, Zhang Yang
- Self-contradictory Reasoning Evaluation And Detection Liu Ziyi, Lee Isabelle, Du Yongkang, Sanyal Soumya, Zhao Jieyu
- SAIE Framework: Support Alone Isn't Enough -- Advancing LLM Training With Adversarial Remarks Loem Mengsay, Kaneko Masahiro, Okazaki Naoaki
- Large Language Models Can Be Guided To Evade Ai-generated Text Detection Lu Ning, Liu Shengcai, He Rui, Wang Qi, Ong Yew-soon, Tang Ke
- Systematic Assessment Of Factual Knowledge In Large Language Models Luo Linhao, Vu Thuy-trang, Phung Dinh, Haffari Gholamreza
- An Examination Of The Compositionality Of Large Generative Vision-language Models Ma Teli, Li Rong, Liang Junwei
- Large Language Models Know Your Contextual Search Intent: A Prompting Framework For Conversational Search Mao Kelong, Dou Zhicheng, Mo Fengran, Hou Jiewen, Chen Haonan, Qian Hongjin
- JAB: Joint Adversarial Prompting And Belief Augmentation Mehrabi Ninareh, Goyal Palash, Ramakrishna Anil, Dhamala Jwala, Ghosh Shalini, Zemel Richard, Chang Kai-wei, Galstyan Aram, Gupta Rahul
- Simple LLM Prompting Is State-of-the-art For Robust And Multilingual Dialogue Evaluation Mendonça John, Pereira Patrícia, Moniz Helena, Carvalho João Paulo, Lavie Alon, Trancoso Isabel
- Think Twice: Measuring The Efficiency Of Eliminating Prediction Shortcuts Of Question Answering Models Mikula Lukáš, Štefánik Michal, Petrovič Marek, Sojka Petr
- State Of What Art? A Call For Multi-prompt LLM Evaluation Mizrahi Moran, Kaplan Guy, Malkin Dan, Dror Rotem, Shahaf Dafna, Stanovsky Gabriel
- Roll Up Your Sleeves: Working With A Collaborative And Engaging Task-oriented Dialogue System Mo Lingbo, Chen Shijie, Chen Ziru, Deng Xiang, Lewis Ashley, Singh Sunit, Stevens Samuel, Tai Chang-you, Wang Zhen, Yue Xiang, Zhang Tianshu, Su Yu, Sun Huan
- How Trustworthy Are Open-source Llms? An Assessment Under Malicious Demonstrations Shows Their Vulnerabilities Mo Lingbo, Wang Boshi, Chen Muhao, Sun Huan
- Evaluating Cognitive Maps And Planning In Large Language Models With Cogeval Momennejad Ida, Hasanbeig Hosein, Vieira Felipe, Sharma Hiteshi, Ness Robert Osazuwa, Jojic Nebojsa, Palangi Hamid, Larson Jonathan
- Llms Killed The Script Kiddie: How Agents Supported By Large Language Models Change The Landscape Of Network Threat Testing Moskal Stephen, Laney Sam, Hemberg Erik, O'reilly Una-may
- N-critics: Self-refinement Of Large Language Models With Ensemble Of Critics Mousavi Sajad, Gutiérrez Ricardo Luna, Rengarajan Desik, Gundecha Vineet, Babu Ashwin Ramesh, Naug Avisek, Guillen Antonio, Sarkar Soumyendu
- Can Llms Follow Simple Rules? Mu Norman, Chen Sarah, Wang Zifan, Chen Sizhe, Karamardian David, Aljeraisy Lulwa, Alomair Basel, Hendrycks Dan, Wagner David
- Testing Language Model Agents Safely In The Wild Naihin Silen, Atkinson David, Green Marc, Hamadi Merwane, Swift Craig, Schonholtz Douglas, Kalai Adam Tauman, Bau David
- Scalable Extraction Of Training Data From (production) Language Models Nasr Milad, Carlini Nicholas, Hayase Jonathan, Jagielski Matthew, Cooper A. Feder, Ippolito Daphne, Choquette-choo Christopher A., Wallace Eric, Tramèr Florian, Lee Katherine
- Clever Hans Or Neural Theory Of Mind? Stress Testing Social Reasoning In Large Language Models Natalie Shapira, Mosh Levy, Seyed Hossein Alavi, Xuhui Zhou, Yejin Choi, Yoav Goldberg, Maarten Sap, Vered Shwartz
- On Robustness Of Finetuned Transformer-based NLP Models Neerudu Pavan Kalyan Reddy, Oota Subba Reddy, Marreddy Mounika, Kagita Venkateswara Rao, Gupta Manish
- Evaluating AI Vocational Skills Through Professional Testing Noever David, Ciolino Matt
- Adversarial Fine-tuning Of Language Models: An Iterative Optimisation Approach For The Generation And Detection Of Problematic Content O'neill Charles, Miller Jack, Ciuca Ioana, Ting Yuan-sen, Bui Thang
- Blackvip: Black-box Visual Prompting For Robust Transfer Learning Oh Changdae, Hwang Hyeji, Lee Hee-young, Lim Yongtaek, Jung Geunyoung, Jung Jiyoung, Choi Hosik, Song Kyungwoo
- Text Style Transfer Evaluation Using Large Language Models Ostheimer Phil, Nagda Mayank, Kloft Marius, Fellenz Sophie
- Evaluating And Explaining Large Language Models For Code Using Syntactic Structures Palacio David N, Velasco Alejandro, Rodriguez-cardenas Daniel, Moran Kevin, Poshyvanyk Denys
- Entity Matching Using Large Language Models Peeters Ralph, Bizer Christian
- Exploiting Novel GPT-4 Apis Pelrine Kellin, Taufeeque Mohammad, Zając Michał, Mclean Euan, Gleave Adam
- Customising General Large Language Models For Specialised Emotion Recognition Tasks Peng Liyizhe, Zhang Zixing, Pang Tao, Han Jing, Zhao Huan, Chen Hao, Schuller Björn W.
- Reproducing Whisper-style Training Using An Open-source Toolkit And Publicly Available Data Peng Yifan, Tian Jinchuan, Yan Brian, Berrebbi Dan, Chang Xuankai, Li Xinjian, Shi Jiatong, Arora Siddhant, Chen William, Sharma Roshan, Zhang Wangyou, Sudo Yui, Shakeel Muhammad, Jung Jee-weon, Maiti Soumi, Watanabe Shinji
- Large Language Models Sensitivity To The Order Of Options In Multiple-choice Questions Pezeshkpour Pouya, Hruschka Estevam
- Let Models Speak Ciphers: Multiagent Debate Through Embeddings Pham Chau, Liu Boyi, Yang Yingxiang, Chen Zhengyu, Liu Tianyi, Yuan Jianbo, Plummer Bryan A., Wang Zhaoran, Yang Hongxia
- LLM Self Defense: By Self Examination, Llms Know They Are Being Tricked Phute Mansi, Helbling Alec, Hull Matthew, Peng Shengyun, Szyller Sebastian, Cornelius Cory, Chau Duen Horng
- Jatmo: Prompt Injection Defense By Task-specific Finetuning Piet Julien, Alrashed Maha, Sitawarin Chawin, Chen Sizhe, Wei Zeming, Sun Elizabeth, Alomair Basel, Wagner David
- Visual Adversarial Examples Jailbreak Aligned Large Language Models Qi Xiangyu, Huang Kaixuan, Panda Ashwinee, Henderson Peter, Wang Mengdi, Mittal Prateek
- Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! Qi Xiangyu, Zeng Yi, Xie Tinghao, Chen Pin-yu, Jia Ruoxi, Mittal Prateek, Henderson Peter
- Toolink: Linking Toolkit Creation And Using Through Chain-of-solving On Open-source Model Qian Cheng, Xiong Chenyan, Liu Zhenghao, Liu Zhiyuan
- Hijacking Large Language Models Via Adversarial In-context Learning Qiang Yao, Zhou Xiangyu, Zhu Dongxiao
- Taskweaver: A Code-first Agent Framework Qiao Bo, Li Liqun, Zhang Xu, He Shilin, Kang Yu, Zhang Chaoyun, Yang Fangkai, Dong Hang, Zhang Jue, Wang Lu, Ma Minghua, Zhao Pu, Qin Si, Qin Xiaoting, Du Chao, Xu Yong, Lin Qingwei, Rajmohan Saravan, Zhang Dongmei
- Gameeval: Evaluating Llms On Conversational Games Qiao Dan, Wu Chenfei, Liang Yaobo, Li Juntao, Duan Nan
- Latent Jailbreak: A Benchmark For Evaluating Text Safety And Output Robustness Of Large Language Models Qiu Huachuan, Zhang Shuai, Li Anqi, He Hongliang, Lan Zhenzhong
- Does The "most Sinfully Decadent Cake Ever" Taste Good? Answering Yes/no Questions From Figurative Contexts Rakshit Geetanjali, Flanigan Jeffrey
- When Large Language Models Contradict Humans? Large Language Models' Sycophantic Behaviour Ranaldi Leonardo, Pucci Giulia
- Tricking Llms Into Disobedience: Formalizing, Analyzing, And Detecting Jailbreaks Rao Abhinav, Vashistha Sachin, Naik Atharva, Aditya Somak, Choudhury Monojit
- Can Chatgpt Assess Human Personalities? A General Evaluation Framework Rao Haocong, Leung Cyril, Miao Chunyan
- The Troubling Emergence Of Hallucination In Large Language Models -- An Extensive Definition, Quantification, And Prescriptive Remediations Rawte Vipula, Chakraborty Swagata, Pathak Agnibh, Sarkar Anubhav, Tonmoy S. M Towhidul Islam, Chadha Aman, Sheth Amit P., Das Amitava
- I'm Afraid I Can't Do That: Predicting Prompt Refusal In Black-box Generative Language Models Reuter Max, Schulze William
- Smoothllm: Defending Large Language Models Against Jailbreaking Attacks Robey Alexander, Wong Eric, Hassani Hamed, Pappas George J.
- Audiogpt: Understanding And Generating Speech, Music, Sound, And Talking Head Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Zhou Zhao, Shinji Watanabe
- Generating Phishing Attacks Using Chatgpt Roy Sayak Saha, Naragam Krishna Vamsi, Nilizadeh Shirin
- From Chatbots To Phishbots? -- Preventing Phishing Scams Created Using Chatgpt, Google Bard And Claude Roy Sayak Saha, Thota Poojitha, Naragam Krishna Vamsi, Nilizadeh Shirin
- Attention-likelihood Relationship In Transformers Ruscio Valeria, Maiorca Valentino, Silvestri Fabrizio
- Probing Conceptual Understanding Of Large Visual-language Models Schiappa Madeline, Abdullah Raiyaan, Azad Shehreen, Claypoole Jared, Cogswell Michael, Divakaran Ajay, Rawat Yogesh
- Ignore This Title And Hackaprompt: Exposing Systemic Vulnerabilities Of Llms Through A Global Scale Prompt Hacking Competition Schulhoff Sander, Pinto Jeremy, Khan Anaum, Bouchard Louis-françois, Si Chenglei, Anati Svetlina, Tagliabue Valen, Kost Anson Liu, Carnahan Christopher, Boyd-graber Jordan
- Scalable And Transferable Black-box Jailbreaks For Language Models Via Persona Modulation Shah Rusheb, Feuillade--montixi Quentin, Pour Soroush, Tagade Arush, Casper Stephen, Rando Javier
- Towards Concept-aware Large Language Models Shani Chen, Vreeken Jilles, Shahaf Dafna
- Jailbreak In Pieces: Compositional Adversarial Attacks On Multi-modal Language Models Shayegani Erfan, Dong Yue, Abu-ghazaleh Nael
- Improving The Robustness Of Transformer-based Large Language Models With Dynamic Attention Shen Lujia, Pu Yuwen, Ji Shouling, Li Changjiang, Zhang Xuhong, Ge Chunpeng, Wang Ting
- Large Language Model Alignment: A Survey Shen Tianhao, Jin Renren, Huang Yufei, Liu Chuang, Dong Weilong, Guo Zishan, Wu Xinwei, Liu Yan, Xiong Deyi
- "do Anything Now": Characterizing And Evaluating In-the-wild Jailbreak Prompts On Large Language Models Shen Xinyue, Chen Zeyuan, Backes Michael, Shen Yun, Zhang Yang
- In Chatgpt We Trust? Measuring And Characterizing The Reliability Of Chatgpt Shen Xinyue, Chen Zeyuan, Backes Michael, Zhang Yang
- Badgpt: Exploring Security Vulnerabilities Of Chatgpt Via Backdoor Attacks To Instructgpt Shi Jiawen, Liu Yixin, Zhou Pan, Sun Lichao
- Exploring The Robustness Of Large Language Models For Solving Programming Problems Shirafuji Atsushi, Watanobe Yutaka, Ito Takumi, Morishita Makoto, Nakamura Yuki, Oda Yusuke, Suzuki Jun
- SC-ML: Self-supervised Counterfactual Metric Learning For Debiased Visual Question Answering Shu Xinyao, Yan Shiyang, Yang Xu, Wu Ziheng, Chen Zhongfeng, Lu Zhenyu
- Mondrian: Prompt Abstraction Attack Against Large Language Models For Cheaper API Pricing Si Wai Man, Backes Michael, Zhang Yang
- Exploiting Large Language Models (llms) Through Deception Techniques And Persuasion Principles Singh Sonali, Abri Faranak, Namin Akbar Siami
- Towards Expert-level Medical Question Answering With Large Language Models Singhal Karan, Tu Tao, Gottweis Juraj, Sayres Rory, Wulczyn Ellery, Hou Le, Clark Kevin, Pfohl Stephen, Cole-lewis Heather, Neal Darlene, Schaekermann Mike, Wang Amy, Amin Mohamed, Lachgar Sami, Mansfield Philip, Prakash Sushant, Green Bradley, Dominowska Ewa, Arcas Blaise Aguera Y, Tomasev Nenad, Liu Yun, Wong Renee, Semturs Christopher, Mahdavi S. Sara, Barral Joelle, Webster Dale, Corrado Greg S., Matias Yossi, Azizi Shekoofeh, Karthikesalingam Alan, Natarajan Vivek
- Preference Ranking Optimization For Human Alignment Song Feifan, Yu Bowen, Li Minghao, Yu Haiyang, Huang Fei, Li Yongbin, Wang Houfeng
- Can Large Language Models Change User Preference Adversarially? Subhash Varshini
- Does Fine-tuning GPT-3 With The Openai API Leak Personally-identifiable Information? Sun Albert Yu, Zemour Eliott, Saxena Arushi, Vaidyanathan Udith, Lin Eric, Lau Christian, Mugunthan Vaikkunth
- Speech-based Slot Filling Using Large Language Models Sun Guangzhi, Feng Shutong, Jiang Dongcheng, Zhang Chao, Gašić Milica, Woodland Philip C.
- Evaluating The Zero-shot Robustness Of Instruction-tuned Language Models Sun Jiuding, Shaib Chantal, Wallace Byron C.
- Towards Robust Temporal Reasoning Of Large Language Models Via A Multi-hop QA Dataset And Pseudo-instruction Tuning Tan Qingyu, Ng Hwee Tou, Bing Lidong
- Interpreting Pretrained Language Models Via Concept Bottlenecks Tan Zhen, Cheng Lu, Wang Song, Bo Yuan, Li Jundong, Liu Huan
- Using Large Language Models For Cybersecurity Capture-the-flag Challenges And Certification Questions Tann Wesley, Liu Yuancheng, Sim Jun Heng, Seah Choon Meng, Chang Ee-chien
- Adversarial Conversational Shaping For Intelligent Agents Tarasiewicz Piotr, Kenjeyev Sultan, Sebag Ilana, Alshehabi Shehab
- Nomiracl: Knowing When You Don't Know For Robust Multilingual Retrieval-augmented Generation Thakur Nandan, Bonifacio Luiz, Zhang Xinyu, Ogundepo Odunayo, Kamalloo Ehsan, Alfonso-hermelo David, Li Xiaoguang, Liu Qun, Chen Boxing, Rezagholizadeh Mehdi, Lin Jimmy
- R\(^3\) Prompting: Review, Rephrase And Resolve For Chain-of-thought Reasoning In Large Language Models Under Noisy Context Tian Qingyuan, Zhu Hanlun, Wang Lei, Li Yang, Lan Yunshi
- Evil Geniuses: Delving Into The Safety Of Llm-based Agents Tian Yu, Yang Xiao, Zhang Jingyuan, Dong Yinpeng, Su Hang
- RLHF-V: Towards Trustworthy Mllms Via Behavior Alignment From Fine-grained Correctional Human Feedback Tianyu Yu, Yuan Yao, Haoye Zhang, Taiwen He, Yifeng Han, Ganqu Cui, Jinyi Hu, Zhiyuan Liu, Hai-tao Zheng, Maosong Sun, Tat-seng Chua
- Llmseceval: A Dataset Of Natural Language Prompts For Security Evaluations Tony Catherine, Mutas Markus, Ferreyra Nicolás E. Díaz, Scandariato Riccardo
- How Many Unicorns Are In This Image? A Safety Evaluation Benchmark For Vision Llms Tu Haoqin, Cui Chenhang, Wang Zijun, Zhou Yiyang, Zhao Bingchen, Han Junlin, Zhou Wangchunshu, Yao Huaxiu, Xie Cihang
- Let's Think Frame By Frame With VIP: A Video Infilling And Prediction Dataset For Evaluating Video Chain-of-thought Vaishnavi Himakunthala, Andy Ouyang, Daniel Rose, Ryan He, Alex Mei, Yujie Lu, Chinmay Sonar, Michael Saxon, William Yang Wang
- Theory Of Mind In Large Language Models: Examining Performance Of 11 State-of-the-art Models Vs. Children Aged 7-10 On Advanced Tests Van Duijn Max J., Van Dijk Bram M. A., Kouwenhoven Tom, De Valk Werner, Spruit Marco R., Van Der Putten Peter
- Adventures Of Trustworthy Vision-language Models: A Survey Vatsa Mayank, Jain Anubhooti, Singh Richa
- Small Language Models Improve Giants By Rewriting Their Outputs Vernikos Giorgos, Bražinskas Arthur, Adamek Jakub, Mallinson Jonathan, Severyn Aliaksei, Malmi Eric
- Mtcue: Learning Zero-shot Control Of Extra-textual Attributes By Leveraging Unstructured Context In Neural Machine Translation Vincent Sebastian, Flynn Robert, Scarton Carolina
- Openchat: Advancing Open-source Language Models With Mixed-quality Data Wang Guan, Cheng Sijie, Zhan Xianyuan, Li Xiangang, Song Sen, Liu Yang
- On The Robustness Of Chatgpt: An Adversarial And Out-of-distribution Perspective Wang Jindong, Hu Xixu, Hou Wenxin, Chen Hao, Zheng Runkai, Wang Yidong, Yang Linyi, Huang Haojun, Ye Wei, Geng Xiubo, Jiao Binxin, Zhang Yue, Xie Xing
- Simultaneous Machine Translation With Large Language Models Wang Minghan, Zhao Jinming, Vu Thuy-trang, Shiri Fatemeh, Shareghi Ehsan, Haffari Gholamreza
- Instructta: Instruction-tuned Targeted Attack For Large Vision-language Models Wang Xunguang, Ji Zhenlan, Ma Pingchuan, Li Zongjie, Wang Shuai
- Gradient-based Word Substitution For Obstinate Adversarial Examples Generation In Language Models Wang Yimu, Shi Peng, Zhang Hongyang
- Fake Alignment: Are Llms Really Aligned Well? Wang Yixu, Teng Yan, Huang Kexin, Lyu Chengqi, Zhang Songyang, Zhang Wenwei, Ma Xingjun, Jiang Yu-gang, Qiao Yu, Wang Yingchun
- Gpt4video: A Unified Multimodal Large Language Model For Lnstruction-followed Understanding And Safety-aware Generation Wang Zhanyu, Wang Longyue, Zhao Zhen, Wu Minghao, Lyu Chenyang, Li Huayang, Cai Deng, Zhou Luping, Shi Shuming, Tu Zhaopeng
- Generating Valid And Natural Adversarial Examples With Large Language Models Wang Zimu, Wang Wei, Chen Qi, Wang Qiufeng, Nguyen Anh
- Jailbroken: How Does LLM Safety Training Fail? Wei Alexander, Haghtalab Nika, Steinhardt Jacob
- Lmsanitator: Defending Prompt-tuning Against Task-agnostic Backdoors Wei Chengkun, Meng Wenlong, Zhang Zhikun, Chen Min, Zhao Minghu, Fang Wenjing, Wang Lei, Zhang Zihui, Chen Wenzhi
- Menatqa: A New Dataset For Testing The Temporal Comprehension And Reasoning Abilities Of Large Language Models Wei Yifan, Su Yisong, Ma Huanhuan, Yu Xiaoyan, Lei Fangyu, Zhang Yuanzhe, Zhao Jun, Liu Kang
- Jailbreak And Guard Aligned Language Models With Only Few In-context Demonstrations Wei Zeming, Wang Yifei, Li Ang, Mo Yichuan, Wang Yisen
- Unveiling The Implicit Toxicity In Large Language Models Wen Jiaxin, Ke Pei, Sun Hao, Zhang Zhexin, Li Chengfei, Bai Jinfeng, Huang Minlie
- Mindmap: Knowledge Graph Prompting Sparks Graph Of Thoughts In Large Language Models Wen Yilin, Wang Zifeng, Sun Jimeng
- Is Chatgpt A Good Translator? Yes With GPT-4 As The Engine Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Xing Wang, Shuming Shi, Zhaopeng Tu
- The Generative AI Paradox: "what It Can Create, It May Not Understand" West Peter, Lu Ximing, Dziri Nouha, Brahman Faeze, Li Linjie, Hwang Jena D., Jiang Liwei, Fisher Jillian, Ravichander Abhilasha, Chandu Khyathi, Newman Benjamin, Koh Pang Wei, Ettinger Allyson, Choi Yejin
- Exploring The Limits Of Chatgpt In Software Security Applications Wu Fangzhou, Zhang Qingzhao, Bajaj Ati Priya, Bao Tiffany, Zhang Ning, Wang Ruoyu "fish", Xiao Chaowei
- Toward Adversarial Training On Contextualized Language Representation Wu Hongqiu, Liu Yongxiang, Shi Hanwen, Zhao Hai, Zhang Min
- Analyzing Chain-of-thought Prompting In Large Language Models Via Gradient-based Feature Attributions Wu Skyler, Shen Eric Meng, Badrinath Charumathi, Ma Jiaqi, Lakkaraju Himabindu
- Unveiling Security, Privacy, And Ethical Concerns Of Chatgpt Wu Xiaodong, Duan Ran, Ni Jianbing
- Jailbreaking GPT-4V Via Self-adversarial Attacks With System Prompts Wu Yuanwei, Li Xiang, Liu Yixin, Zhou Pan, Sun Lichao
- How Far Are Llms From Believable AI? A Benchmark For Evaluating The Believability Of Human Behavior Simulation Xiao Yang, Cheng Yi, Fu Jinlan, Wang Jiashuo, Li Wenjie, Liu Pengfei
- Rationale-enhanced Language Models Are Better Continual Relation Learners Xiong Weimin, Song Yifan, Wang Peiyi, Li Sujian
- Gentopia: A Collaborative Platform For Tool-augmented Llms Xu Binfeng, Liu Xukun, Shen Hua, Han Zeyu, Li Yuhan, Yue Murong, Peng Zhiyuan, Liu Yuchen, Yao Ziyu, Xu Dongkuan
- Rewoo: Decoupling Reasoning From Observations For Efficient Augmented Language Models Xu Binfeng, Peng Zhiyuan, Lei Bowen, Mukherjee Subhabrata, Liu Yuchen, Xu Dongkuan
- DPL: Decoupled Prompt Learning For Vision-language Models Xu Chen, Zhu Yuhan, Zhang Guozhen, Shen Haocheng, Liao Yixuan, Chen Xiaoxin, Wu Gangshan, Wang Limin
- Instructions As Backdoors: Backdoor Vulnerabilities Of Instruction Tuning For Large Language Models Xu Jiashu, Ma Mingyu Derek, Wang Fei, Xiao Chaowei, Chen Muhao
- Sc-safety: A Multi-round Open-ended Question Adversarial Safety Benchmark For Large Language Models In Chinese Xu Liang, Zhao Kangkang, Zhu Lei, Xue Hang
- Cognitive Overload: Jailbreaking Large Language Models With Overloaded Logical Thinking Xu Nan, Wang Fei, Zhou Ben, Li Bang Zheng, Xiao Chaowei, Chen Muhao
- On The Tool Manipulation Capability Of Open-source Large Language Models Xu Qiantong, Hong Fenglu, Li Bo, Hu Changran, Chen Zhengyu, Zhang Jian
- An LLM Can Fool Itself: A Prompt-based Adversarial Attack Xu Xilie, Kong Keyi, Liu Ning, Cui Lizhen, Wang Di, Zhang Jingfeng, Kankanhalli Mohan
- How Robust Is GPT-3.5 To Predecessors? A Comprehensive Study On Language Understanding Tasks Xuanting Chen, Junjie Ye, Can Zu, Nuo Xu, Rui Zheng, Minlong Peng, Jie Zhou, Tao Gui, Qi Zhang, Xuanjing Huang
- Representation Learning With Large Language Models For Recommendation Xubin Ren, Wei Wei, Lianghao Xia, Lixin Su, Suqi Cheng, Junfeng Wang, Dawei Yin, Chao Huang
- Backdooring Instruction-tuned Large Language Models With Virtual Prompt Injection Yan Jun, Yadav Vikas, Li Shiyang, Chen Lichang, Tang Zheng, Wang Hai, Srinivasan Vijay, Ren Xiang, Jin Hongxia
- Psycot: Psychological Questionnaire As Powerful Chain-of-thought For Personality Detection Yang Tao, Shi Tianyuan, Wan Fanqi, Quan Xiaojun, Wang Qifan, Wu Bingzhe, Wu Jiaxiang
- Shadow Alignment: The Ease Of Subverting Safely-aligned Language Models Yang Xianjun, Wang Xiao, Zhang Qi, Petzold Linda, Wang William Yang, Zhao Xun, Lin Dahua
- On The Planning, Search, And Memorization Capabilities Of Large Language Models Yang Yunhao, Tomar Anshul
- Poisonprompt: Backdoor Attack On Prompt-based Large Language Models Yao Hongwei, Lou Jian, Qin Zhan
- Promptcare: Prompt Copyright Protection By Watermark Injection And Verification Yao Hongwei, Lou Jian, Ren Kui, Qin Zhan
- LLM Lies: Hallucinations Are Not Bugs, But Features As Adversarial Examples Yao Jia-yu, Ning Kun-peng, Liu Zhen-hui, Ning Mu-nan, Liu Yu-yang, Yuan Li
- A Survey On Large Language Model (LLM) Security And Privacy: The Good, The Bad, And The Ugly Yao Yifan, Duan Jinhao, Xu Kaidi, Cai Yuanfang, Sun Zhibo, Zhang Yue
- Benchmarking And Defending Against Indirect Prompt Injection Attacks On Large Language Models Yi Jingwei, Xie Yueqi, Zhu Bin, Kiciman Emre, Sun Guangzhong, Xie Xing, Wu Fangzhao
- Alignment Is Not Sufficient To Prevent Large Language Models From Generating Harmful Information: A Psychoanalytic Perspective Yin Zi, Ding Wei, Liu Jia
- Intuitive Or Dependent? Investigating Llms' Behavior Style To Conflicting Prompts Ying Jiahao, Cao Yixin, Xiong Kai, He Yidong, Cui Long, Liu Yongbin
- Low-resource Languages Jailbreak GPT-4 Yong Zheng-xin, Menghini Cristina, Bach Stephen H.
- Fundamental Limitations Of Alignment In Large Language Models Yotam Wolf, Noam Wies, Oshri Avnery, Yoav Levine, Amnon Shashua
- Ferret: Refer And Ground Anything Anywhere At Any Granularity You Haoxuan, Zhang Haotian, Gan Zhe, Du Xianzhi, Zhang Bowen, Wang Zirui, Cao Liangliang, Chang Shih-fu, Yang Yinfei
- GPTFUZZER: Red Teaming Large Language Models With Auto-generated Jailbreak Prompts Yu Jiahao, Lin Xingwei, Yu Zheng, Xing Xinyu
- Assessing Prompt Injection Risks In 200+ Custom Gpts Yu Jiahao, Wu Yuhang, Shu Dong, Jin Mingyu, Yang Sabrina, Xing Xinyu
- Chain-of-note: Enhancing Robustness In Retrieval-augmented Language Models Yu Wenhao, Zhang Hongming, Pan Xiaoman, Ma Kaixin, Wang Hongwei, Yu Dong
- Reeval: Automatic Hallucination Evaluation For Retrieval-augmented Large Language Models Via Transferable Adversarial Attacks Yu Xiaodong, Cheng Hao, Liu Xiaodong, Roth Dan, Gao Jianfeng
- Fine-tuning Language Models With Generative Adversarial Reward Modelling Yu Zhang Ze, Jaw Lau Jia, Hui Zhang, Low Bryan Kian Hsiang
- Batcheval: Towards Human-like Text Evaluation Yuan Peiwen, Feng Shaoxiong, Li Yiwei, Wang Xinglin, Pan Boyuan, Wang Heda, Li Kan
- Multilingual Augmentation For Robust Visual Question Answering In Remote Sensing Images Yuan Zhenghang, Mou Lichao, Zhu Xiao Xiang
- Prompt Highlighter: Interactive Control For Multi-modal Llms Yuechen Zhang, Shengju Qian, Bohao Peng, Shu Liu, Jiaya Jia
- On Evaluating Adversarial Robustness Of Large Vision-language Models Yunqing Zhao, Tianyu Pang, Chao Du, Xiao Yang, Chongxuan Li, Ngai-man Cheung, Min Lin
- Building Trust In Conversational AI: A Comprehensive Review And Solution Architecture For Explainable, Privacy-aware Systems Using Llms And Knowledge Graph Zafar Ahtsham, Parthasarathy Venkatesh Balavadhani, Van Chan Le, Shahid Saad, Khan Aafaq Iqbal, Shahid Arsalan
- G3detector: General Gpt-generated Text Detector Zhan Haolan, He Xuanli, Xu Qiongkai, Wu Yuxiang, Stenetorp Pontus
- Removing RLHF Protections In GPT-4 Via Fine-tuning Zhan Qiusi, Fang Richard, Bindu Rohan, Gupta Akul, Hashimoto Tatsunori, Kang Daniel
- A Comprehensive Analysis Of The Effectiveness Of Large Language Models As Automatic Dialogue Evaluators Zhang Chen, D'haro Luis Fernando, Chen Yiming, Zhang Malu, Li Haizhou
- Aligning Instruction Tasks Unlocks Large Language Models As Zero-shot Relation Extractors Zhang Kai, Gutiérrez Bernal Jiménez, Su Yu
- Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization Zhang Zhexin, Yang Junxiao, Ke Pei, Mi Fei, Wang Hongning, Huang Minlie
- Responsible Task Automation: Empowering Large Language Models As Responsible Task Automators Zhang Zhizheng, Zhang Xiaoyi, Xie Wenxuan, Lu Yan
- Retrieving Multimodal Information For Augmented Generation: A Survey Zhao Ruochen, Chen Hailin, Wang Weishi, Jiao Fangkai, Do Xuan Long, Qin Chengwei, Ding Bosheng, Guo Xiaobao, Li Minzhi, Li Xingxuan, Joty Shafiq
- Robut: A Systematic Study Of Table QA Robustness Against Human-annotated Adversarial Perturbations Zhao Yilun, Zhao Chen, Nan Linyong, Qi Zhenting, Zhang Wenlin, Tang Xiangru, Mi Boyu, Radev Dragomir
- Tabula: Harnessing Language Models For Tabular Data Synthesis Zhao Zilong, Birke Robert, Chen Lydia
- Noisy Exemplars Make Large Language Models More Robust: A Domain-agnostic Behavioral Analysis Zheng Hongyi, Saparov Abulhair
- Trojfsp: Trojan Insertion In Few-shot Prompt Tuning Zheng Mengxin, Xue Jiaqi, Chen Xun, Wang Yanshan, Lou Qian, Jiang Lei
- Self-polish: Enhance Reasoning In Large Language Models Via Problem Refinement Zhiheng Xi, Senjie Jin, Yuhao Zhou, Rui Zheng, Songyang Gao, Tao Gui, Qi Zhang, Xuanjing Huang
- Bag Of Tricks For Effective Language Model Pretraining And Downstream Adaptation: A Case Study On GLUE Zhong Qihuang, Ding Liang, Peng Keqin, Liu Juhua, Du Bo, Shen Li, Zhan Yibing, Tao Dacheng
- Making Harmful Behaviors Unlearnable For Large Language Models Zhou Xin, Lu Yi, Ma Ruotian, Gui Tao, Zhang Qi, Huang Xuanjing
- Promptbench: A Unified Library For Evaluation Of Large Language Models Zhu Kaijie, Zhao Qinlin, Chen Hao, Wang Jindong, Xie Xing
- Autodan: Interpretable Gradient-based Adversarial Attacks On Large Language Models Zhu Sicheng, Zhang Ruiyi, An Bang, Wu Gang, Barrow Joe, Wang Zichao, Huang Furong, Nenkova Ani, Sun Tong
- Red Teaming Chatgpt Via Jailbreaking: Bias, Robustness, Reliability And Toxicity Zhuo Terry Yue, Huang Yujin, Chen Chunyang, Xing Zhenchang
- Universal And Transferable Adversarial Attacks On Aligned Language Models Zou Andy, Wang Zifan, Carlini Nicholas, Nasr Milad, Kolter J. Zico, Fredrikson Matt
- A Dataset And Benchmark For Hospital Course Summarization With Adapted Large Language Models Aali Asad, Van Veen Dave, Arefeen Yamin Ishraq, Hom Jason, Bluethgen Christian, Reis Eduardo Pontes, Gatidis Sergios, Clifford Namuun, Daws Joseph, Tehrani Arash S., Kim Jangwon, Chaudhari Akshay S.
- Securing Large Language Models: Threats, Vulnerabilities And Responsible Practices Abdali Sara, Anarfi Richard, Barberan Cj, He Jia
- Can Llms Be Fooled? Investigating Vulnerabilities In Llms Abdali Sara, He Jia, Barberan Cj, Anarfi Richard
- Are You Still On Track!? Catching LLM Task Drift With Activations Abdelnabi Sahar, Fay Aideen, Cherubin Giovanni, Salem Ahmed, Fritz Mario, Paverd Andrew
- Phi-3 Technical Report: A Highly Capable Language Model Locally On Your Phone Abdin Marah, Aneja Jyoti, Awadalla Hany, Awadallah Ahmed, Awan Ammar Ahmad, Bach Nguyen, Bahree Amit, Bakhtiari Arash, Bao Jianmin, Behl Harkirat, Benhaim Alon, Bilenko Misha, Bjorck Johan, Bubeck Sébastien, Cai Martin, Cai Qin, Chaudhary Vishrav, Chen Dong, Chen Dongdong, Chen Weizhu, Chen Yen-chun, Chen Yi-ling, Cheng Hao, Chopra Parul, Dai Xiyang, Dixon Matthew, Eldan Ronen, Fragoso Victor, Gao Jianfeng, Gao Mei, Gao Min, Garg Amit, Del Giorno Allie, Goswami Abhishek, Gunasekar Suriya, Haider Emman, Hao Junheng, Hewett Russell J., Hu Wenxiang, Huynh Jamie, Iter Dan, Jacobs Sam Ade, Javaheripi Mojan, Jin Xin, Karampatziakis Nikos, Kauffmann Piero, Khademi Mahoud, Kim Dongwoo, Kim Young Jin, Kurilenko Lev, Lee James R., Lee Yin Tat, Li Yuanzhi, Li Yunsheng, Liang Chen, Liden Lars, Lin Xihui, Lin Zeqi, Liu Ce, Liu Liyuan, Liu Mengchen, Liu Weishung, Liu Xiaodong, Luo Chong, Madan Piyush, Mahmoudzadeh Ali, Majercak David, Mazzola Matt, Mendes Caio César Teodoro, Mitra Arindam, Modi Hardik, Nguyen Anh, Norick Brandon, Patra Barun, Perez-becker Daniel, Portet Thomas, Pryzant Reid, Qin Heyang, Radmilac Marko, Ren Liliang, De Rosa Gustavo, Rosset Corby, Roy Sambudha, Ruwase Olatunji, Saarikivi Olli, Saied Amin, Salim Adil, Santacroce Michael, Shah Shital, Shang Ning, Sharma Hiteshi, Shen Yelong, Shukla Swadheen, Song Xia, Tanaka Masahiro, Tupini Andrea, Vaddamanu Praneetha, Wang Chunyu, Wang Guanhua, Wang Lijuan, Wang Shuohang, Wang Xin, Wang Yu, Ward Rachel, Wen Wen, Witte Philipp, Wu Haiping, Wu Xiaoxia, Wyatt Michael, Xiao Bin, Xu Can, Xu Jiahang, Xu Weijian, Xue Jilong, Yadav Sonali, Yang Fan, Yang Jianwei, Yang Yifan, Yang Ziyi, Yu Donghan, Yuan Lu, Zhang Chenruidong, Zhang Cyril, Zhang Jianwen, Zhang Li Lyna, Zhang Yi, Zhang Yue, Zhang Yunan, Zhou Xiren
- Mememqa: Multimodal Question Answering For Memes Via Rationale-based Inferencing Agarwal Siddhant, Sharma Shivam, Nakov Preslav, Chakraborty Tanmoy
- Codemirage: Hallucinations In Code Generated By Large Language Models Agarwal Vibhor, Pei Yulong, Alamir Salwa, Liu Xiaomo
- Examining The Robustness Of LLM Evaluation To The Distributional Assumptions Of Benchmarks Ailem Melissa, Marazopoulou Katerina, Siska Charlotte, Bono James
- FACTS About Building Retrieval Augmented Generation-based Chatbots Akkiraju Rama, Xu Anbang, Bora Deepak, Yu Tan, An Lu, Seth Vishal, Shukla Aaditya, Gundecha Pritam, Mehta Hridhay, Jha Ashwin, Raj Prithvi, Balasubramanian Abhinav, Maram Murali, Muthusamy Guru, Annepally Shivakesh Reddy, Knowles Sidney, Du Min, Burnett Nick, Javiya Sean, Marannan Ashok, Kumari Mamta, Jha Surbhi, Dereszenski Ethan, Chakraborty Anupam, Ranjan Subhash, Terfai Amina, Surya Anoop, Mercer Tracey, Thanigachalam Vinodh Kumar, Bar Tamar, Krishnan Sanjana, Kilaru Samy, Jaksic Jasmine, Algarici Nave, Liberman Jacob, Conway Joey, Nayyar Sonu, Boitano Justin
- Evaluating The Efficacy Of Foundational Models: Advancing Benchmarking Practices To Enhance Fine-tuning Decision-making Amujo Oluyemi Enoch, Yang Shanchieh Jay
- Automatic Pseudo-harmful Prompt Generation For Evaluating False Refusals In Large Language Models An Bang, Zhu Sicheng, Zhang Ruiyi, Panaitescu-liess Michael-andrei, Xu Yuancheng, Huang Furong
- Investigating The Robustness Of Llms On Math Word Problems Anantheswaran Ujjwala, Gupta Himanshu, Scaria Kevin, Verma Shreyas, Baral Chitta, Mishra Swaroop
- Does Refusal Training In Llms Generalize To The Past Tense? Andriushchenko Maksym, Flammarion Nicolas
- Simpler Becomes Harder: Do Llms Exhibit A Coherent Behavior On Simplified Corpora? Anschütz Miriam, Mosca Edoardo, Groh Georg
- Open (clinical) Llms Are Sensitive To Instruction Phrasings Arroyo Alberto Mario Ceballos, Munnangi Monica, Sun Jiuding, Zhang Karen Y. C., Mcinerney Denis Jered, Wallace Byron C., Amir Silvio
- Exploring The Robustness Of Task-oriented Dialogue Systems For Colloquial German Varieties Artemova Ekaterina, Blaschke Verena, Plank Barbara
- Assessing Llms For Zero-shot Abstractive Summarization Through The Lens Of Relevance Paraphrasing Askari Hadi, Chhabra Anshuman, Chen Muhao, Mohapatra Prasant
- Robustsentembed: Robust Sentence Embeddings Using Adversarial Self-supervised Contrastive Learning Asl Javad Rafiei, Panzade Prajwal, Blanco Eduardo, Takabi Daniel, Cai Zhipeng
- Llm-driven Robots Risk Enacting Discrimination, Violence, And Unlawful Actions Azeem Rumaisa, Hundt Andrew, Mansouri Masoumeh, Brandão Martim
- COIG-CQIA: Quality Is All You Need For Chinese Instruction Fine-tuning Bai Yuelin, Du Xinrun, Liang Yiming, Jin Yonggang, Liu Ziqiang, Zhou Junting, Zheng Tianyu, Zhang Xincheng, Ma Nuo, Wang Zekun, Yuan Ruibin, Wu Haihong, Lin Hongquan, Huang Wenhao, Zhang Jiajun, Chen Wenhu, Lin Chenghua, Fu Jie, Yang Min, Ni Shiwen, Zhang Ge
- Hallucination Of Multimodal Large Language Models: A Survey Bai Zechen, Wang Pichao, Xiao Tianjun, He Tong, Han Zongbo, Zhang Zheng, Shou Mike Zheng
- CYGENT: A Cybersecurity Conversational Agent With Log Summarization Powered By GPT-3 Balasubramanian Prasasthy, Seby Justin, Kostakos Panos
- To Each (textual Sequence) Its Own: Improving Memorized-data Unlearning In Large Language Models Barbulescu George-octavian, Triantafillou Peter
- Reinforcement Learning From LLM Feedback To Counteract Goal Misgeneralization Barj Houda Nait El, Sautory Theophile
- Llms' Reading Comprehension Is Affected By Parametric Knowledge And Struggles With Hypothetical Statements Basmov Victoria, Goldberg Yoav, Tsarfaty Reut
- On The Robustness Of Language Models For Tabular Question Answering Bhandari Kushal Raj, Xing Sixue, Dan Soham, Gao Jianxi
- Enhancing Code Translation In Language Models With Few-shot Learning Via Retrieval-augmented Generation Bhattarai Manish, Santos Javier E., Jones Shawn, Biswas Ayan, Alexandrov Boian, O'malley Daniel
- Large Language Models Are Vulnerable To Bait-and-switch Attacks For Generating Harmful Content Bianchi Federico, Zou James
- Rambla: A Framework For Evaluating The Reliability Of Llms As Assistants In The Biomedical Domain Bolton William James, Poyiadzi Rafael, Morrell Edward R., Bueno Gabriela Van Bergen Gonzalez, Goetz Lea
- Transformers Meet Neural Algorithmic Reasoners Bounsi Wilfried, Ibarz Borja, Dudzik Andrew, Hamrick Jessica B., Markeeva Larisa, Vitvitskyi Alex, Pascanu Razvan, Veličković Petar
- Enhancing Trust In Llms: Algorithms For Comparing And Interpreting Llms Brown Nik Bear
- On Large Language Models In National Security Applications Caballero William N., Jenkins Phillip R.
- Are Large Language Models Really Bias-free? Jailbreak Prompts For Assessing Adversarial Robustness To Bias Elicitation Cantini Riccardo, Cosenza Giada, Orsino Alessio, Talia Domenico
- On The Worst Prompt Performance Of Large Language Models Cao Bowen, Cai Deng, Zhang Zhisong, Zou Yuexian, Lam Wai
- Personalized Steering Of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization Cao Yuanpu, Zhang Tianrong, Cao Bochuan, Yin Ziyi, Lin Lu, Ma Fenglong, Chen Jinghui
- An Expert Is Worth One Token: Synergizing Multiple Expert Llms As Generalist Via Expert Token Routing Chai Ziwei, Wang Guoyin, Su Jing, Zhang Tianjie, Huang Xuanwen, Wang Xuwu, Xu Jingjing, Yuan Jianbo, Yang Hongxia, Wu Fei, Yang Yang
- Cross-modal Safety Alignment: Is Textual Unlearning All You Need? Chakraborty Trishna, Shayegani Erfan, Cai Zikui, Abu-ghazaleh Nael, Asif M. Salman, Dong Yue, Roy-chowdhury Amit K., Song Chengyu
- Integrating Emotional And Linguistic Models For Ethical Compliance In Large Language Models Chang Edward Y.
- Unlocking The Wisdom Of Large Language Models: An Introduction To The Path To Artificial General Intelligence Chang Edward Y.
- How Do Large Language Models Acquire Factual Knowledge During Pretraining? Chang Hoyeon, Park Jinho, Ye Seonghyeon, Yang Sohee, Seo Youngkyung, Chang Du-seong, Seo Minjoon
- Phantom: General Trigger Attacks On Retrieval Augmented Language Generation Chaudhari Harsh, Severi Giorgio, Abascal John, Jagielski Matthew, Choquette-choo Christopher A., Nasr Milad, Nita-rotaru Cristina, Oprea Alina
- The Dark Side Of Human Feedback: Poisoning Large Language Models Via User Inputs Chen Bocheng, Guo Hanqing, Wang Guangjing, Wang Yuanda, Yan Qiben
- Nlperturbator: Studying The Robustness Of Code Llms To Natural Language Variations Chen Junkai, Li Zhenhao, Hu Xing, Xia Xin
- Cost-effective Instruction Learning For Pathology Vision And Language Analysis Chen Kaitao, Liu Mianxin, Yan Fang, Ma Lei, Shi Xiaoming, Wang Lilong, Wang Xiaosong, Zhu Lifeng, Wang Zhe, Zhou Mu, Zhang Shaoting
- Machine Unlearning In Large Language Models Chen Kongyang, Wang Zixin, Mi Bing, Liu Waixi, Wang Shaowei, Ren Xiaojun, Shen Jiaxing
- Enhancing Visual Question Answering Through Ranking-based Hybrid Training And Multimodal Fusion Chen Peiyuan, Zhang Zecheng, Dong Yiping, Zhou Li, Wang Han
- Struq: Defending Against Prompt Injection With Structured Queries Chen Sizhe, Piet Julien, Sitawarin Chawin, Wagner David
- SATO: Stable Text-to-motion Framework Chen Wenshuo, Xiao Hongru, Zhang Erhang, Hu Lijie, Wang Lei, Liu Mengyuan, Chen Chen
- The Sifo Benchmark: Investigating The Sequential Instruction Following Ability Of Large Language Models Chen Xinyi, Liao Baohao, Qi Jirui, Eustratiadis Panagiotis, Monz Christof, Bisazza Arianna, De Rijke Maarten
- RL-JACK: Reinforcement Learning-powered Black-box Jailbreaking Attack Against Llms Chen Xuan, Nie Yuzhou, Yan Lu, Mao Yunshu, Guo Wenbo, Zhang Xiangyu
- Extroversion Or Introversion? Controlling The Personality Of Your Large Language Models Chen Yanquan, Wu Zhen, Guo Junjie, Huang Shujian, Dai Xinyu
- NACL: A General And Effective KV Cache Eviction Framework For Llms At Inference Time Chen Yilong, Wang Guoxia, Shang Junyuan, Cui Shiyao, Zhang Zhenyu, Liu Tingwen, Wang Shuohuan, Sun Yu, Yu Dianhai, Wu Hua
- Robust And Scalable Model Editing For Large Language Models Chen Yingfa, Zhang Zhengyan, Han Xu, Xiao Chaojun, Liu Zhiyuan, Chen Chen, Li Kuai, Yang Tao, Sun Maosong
- MLLM Is A Strong Reranker: Advancing Multimodal Retrieval-augmented Generation Via Knowledge-enhanced Reranking And Noise-injected Training Chen Zhanpeng, Xu Chengjin, Qi Yiyan, Guo Jian
- An Empirical Study Of Data Ability Boundary In Llms' Math Reasoning Chen Zui, Chen Yezeng, Han Jiaqi, Huang Zhijie, Qi Ji, Zhou Yi
- Unveiling Typographic Deceptions: Insights Of The Typographic Vulnerability In Large Vision-language Model Cheng Hao, Xiao Erjia, Gu Jindong, Yang Le, Duan Jinhao, Zhang Jize, Cao Jiahang, Xu Kaidi, Xu Renjing
- Self-playing Adversarial Language Game Enhances LLM Reasoning Cheng Pengyu, Hu Tianhao, Xu Han, Zhang Zhisong, Dai Yong, Han Lei, Du Nan
- Trojanrag: Retrieval-augmented Generation Can Be Backdoor Driver In Large Language Models Cheng Pengzhou, Ding Yidong, Ju Tianjie, Wu Zongru, Du Wei, Yi Ping, Zhang Zhuosheng, Liu Gongshen
- Transferring Backdoors Between Large Language Models By Knowledge Distillation Cheng Pengzhou, Wu Zongru, Ju Tianjie, Du Wei, Liu Zhuosheng Zhang Gongshen
- Talking Nonsense: Probing Large Language Models' Understanding Of Adversarial Gibberish Inputs Cherepanova Valeriia, Zou James
- Breaking Down The Defenses: A Comparative Survey Of Attacks On Large Language Models Chowdhury Arijit Ghosh, Islam Md Mofijul, Kumar Vaibhav, Shezan Faysal Hossain, Kumar Vaibhav, Jain Vinija, Chadha Aman
- A Causal Explainable Guardrails For Large Language Models Chu Zhixuan, Wang Yan, Li Longfei, Wang Zhibo, Qin Zhan, Ren Kui
- AI Safety In Generative AI Large Language Models: A Survey Chua Jaymari, Li Yun, Yang Shiyi, Wang Chen, Yao Lina
- Developing Generative AI Chatbots Conceptual Framework For Higher Education Chukwuere Joshua Ebere
- Representations As Language: An Information-theoretic Framework For Interpretability Conklin Henry, Smith Kenny
- Out-of-context Prompting Boosts Fairness And Robustness In Large Language Model Predictions Cotta Leonardo, Maddison Chris J.
- Impact Of Non-standard Unicode Characters On Security And Comprehension In Large Language Models Daniel Johan S, Pal Anand
- Human-interpretable Adversarial Prompt Attack On Large Language Models With Situational Context Das Nilanjana, Raff Edward, Gaur Manas
- Pandora: Jailbreak Gpts By Retrieval Augmented Generation Poisoning Deng Gelei, Liu Yi, Wang Kailong, Li Yuekang, Zhang Tianwei, Liu Yang
- Teaching A Multilingual Large Language Model To Understand Multilingual Speech Via Multi-instructional Training Denisov Pavel, Vu Ngoc Thang
- Garak: A Framework For Security Probing Large Language Models Derczynski Leon, Galinkin Erick, Martin Jeffrey, Majumdar Subho, Inie Nanna
- Glam: Fine-tuning Large Language Models For Domain Knowledge Graph Alignment Via Neighborhood Partitioning And Generative Subgraph Encoding Dernbach Stefan, Agarwal Khushbu, Zuniga Alejandro, Henry Michael, Choudhury Sutanay
- Jailbreaking Text-to-image Models With Llm-based Agents Dong Yingkai, Li Zheng, Meng Xiangtao, Yu Ning, Guo Shanqing
- Attacks, Defenses And Evaluations For LLM Conversation Safety: A Survey Dong Zhichen, Zhou Zhanhui, Yang Chao, Shao Jing, Qiao Yu
- Enhancing Robustness Of Llm-synthetic Text Detectors For Academic Writing: A Comprehensive Analysis Dou Zhicheng, Guo Yuchen, Chang Ching-chun, Nguyen Huy H., Echizen Isao
- Efficient Personalized Text-to-image Generation By Leveraging Textual Subspace Du Shian, Cheng Xiaotian, Qian Qi, Wei Henglu, Xu Yi, Ji Xiangyang
- Length-controlled Alpacaeval: A Simple Way To Debias Automatic Evaluators Dubois Yann, Galambosi Balázs, Liang Percy, Hashimoto Tatsunori B.
- Lightweight Large Language Model For Medication Enquiry: Med-pal Elangovan Kabilan, Ong Jasmine Chiat Ling, Jin Liyuan, Seng Benjamin Jun Jie, Kwan Yu Heng, Tan Lit Soo, Zhong Ryan Jian, Ma Justina Koi Li, Ke Yuhe, Liu Nan, Giacomini Kathleen M, Ting Daniel Shu Wei
- Ocassionally Secure: A Comparative Analysis Of Code Generation Assistants Elgedawy Ran, Sadik John, Dutta Senjuti, Gautam Anuj, Georgiou Konstantinos, Gholamrezae Farzin, Ji Fujiao, Lim Kyungchan, Liu Qian, Ruoti Scott
- M2QA: Multi-domain Multilingual Question Answering Engländer Leon, Sterz Hannah, Poth Clifton, Pfeiffer Jonas, Kuznetsov Ilia, Gurevych Iryna
- Enhancing Noise Robustness Of Retrieval-augmented Language Models With Adaptive Adversarial Training Fang Feiteng, Bai Yuelin, Ni Shiwen, Yang Min, Chen Xiaojun, Xu Ruifeng
- Gensco: Can Question Decomposition Based Passage Alignment Improve Question Answering? Fazili Barah, Goswami Koustava, Modani Natwar, Nair Inderjeet
- Towards Trustworthy AI: A Review Of Ethical And Robust Large Language Models Ferdaus Md Meftahul, Abdelguerfi Mahdi, Ioup Elias, Niles Kendall N., Pathak Ken, Sloan Steven
- Generative AI And Large Language Models For Cyber Security: All Insights You Need Ferrag Mohamed Amine, Alwahedi Fatima, Battah Ammar, Cherif Bilel, Mechri Abdechakour, Tihanyi Norbert
- LLM Detectors Still Fall Short Of Real World: Case Of Llm-generated Short News-like Posts Gameiro Henrique Da Silva, Kucharavy Andrei, Dolamic Ljiljana
- Practical Unlearning For Large Language Models Gao Chongyang, Wang Lixu, Weng Chenkai, Wang Xiao, Zhu Qi
- Harnessing The Intrinsic Knowledge Of Pretrained Language Models For Challenging Text Classification Settings Gao Lingyu
- Robust Pronoun Fidelity With English Llms: Are They Reasoning, Repeating, Or Just Biased? Gautam Vagrant, Bingert Eileen, Zhu Dawei, Lauscher Anne, Klakow Dietrich
- Coercing Llms To Do And Reveal (almost) Anything Geiping Jonas, Stein Alex, Shu Manli, Saifullah Khalid, Wen Yuxin, Goldstein Tom
- Attacking Large Language Models With Projected Gradient Descent Geisler Simon, Wollschläger Tom, Abdalla M. H. I., Gasteiger Johannes, Günnemann Stephan
- Can Large Language Models Learn Independent Causal Mechanisms? Gendron Gaël, Nguyen Bao Trung, Peng Alex Yuxuan, Witbrock Michael, Dobbie Gillian
- Jailbreaking Llms With Arabic Transliteration And Arabizi Ghanim Mansour Al, Almohaimeed Saleh, Zheng Mengxin, Solihin Yan, Lou Qian
- Eyes Closed, Safety On: Protecting Multimodal Llms Via Image-to-text Transformation Gou Yunhao, Chen Kai, Liu Zhili, Hong Lanqing, Xu Hang, Li Zhenguo, Yeung Dit-yan, Kwok James T., Zhang Yu
- Specify What? Enhancing Neural Specification Synthesis By Symbolic Methods Granberry George, Ahrendt Wolfgang, Johansson Moa
- Probing The Robustness Of Vision-language Pretrained Models: A Multimodal Adversarial Attack Approach Guan Jiwei, Ding Tianyu, Cao Longbing, Pan Lei, Wang Chen, Zheng Xi
- Learning On Graphs With Large Language Models(llms): A Deep Dive Into Model Robustness Guo Kai, Liu Zewen, Chen Zhikai, Wen Hongzhi, Jin Wei, Tang Jiliang, Chang Yi
- METAREFLECTION: Learning Instructions For Language Agents Using Past Reflections Gupta Priyanshu, Kirtania Shashank, Singha Ananya, Gulwani Sumit, Radhakrishna Arjun, Shi Sherry, Soares Gustavo
- Changing Answer Order Can Decrease MMLU Accuracy Gupta Vipul, Pantoja David, Ross Candace, Williams Adina, Ung Megan
- When All Options Are Wrong: Evaluating Large Language Model Robustness With Incorrect Multiple-choice Options Góral Gracjan, Wiśnios Emilia
- Covert Malicious Finetuning: Challenges In Safeguarding LLM Adaptation Halawi Danny, Wei Alexander, Wallace Eric, Wang Tony T., Haghtalab Nika, Steinhardt Jacob
- RAG-QA Arena: Evaluating Domain Robustness For Long-form Retrieval Augmented Question Answering Han Rujun, Zhang Yuhao, Qi Peng, Xu Yumo, Wang Jenyuan, Liu Lan, Wang William Yang, Min Bonan, Castelli Vittorio
- Wildguard: Open One-stop Moderation Tools For Safety Risks, Jailbreaks, And Refusals Of Llms Han Seungju, Rao Kavel, Ettinger Allyson, Jiang Liwei, Lin Bill Yuchen, Lambert Nathan, Choi Yejin, Dziri Nouha
- Jailbreaking Proprietary Large Language Models Using Word Substitution Cipher Handa Divij, Chirmule Advait, Gajera Bimal, Baral Chitta
- LLM Reasoners: New Evaluation, Library, And Analysis Of Step-by-step Reasoning With Large Language Models Hao Shibo, Gu Yi, Luo Haotian, Liu Tianyang, Shao Xiyan, Wang Xinyuan, Xie Shuhua, Ma Haodi, Samavedhi Adithya, Gao Qiyue, Wang Zhen, Hu Zhiting
- Risk And Response In Large Language Models: Evaluating Key Threat Categories Harandizadeh Bahareh, Salinas Abel, Morstatter Fred
- Pruning For Protection: Increasing Jailbreak Resistance In Aligned Llms Without Fine-tuning Hasan Adib, Rugina Ileana, Wang Alex
- A Comprehensive Overview Of Large Language Models (llms) For Cyber Defences: Opportunities And Directions Hassanin Mohammed, Moustafa Nour
- Sowing The Wind, Reaping The Whirlwind: The Impact Of Editing Language Models Hazra Rima, Layek Sayan, Banerjee Somnath, Poria Soujanya
- Think-program-rectify: 3D Situated Reasoning With Large Language Models He Qingrong, Lin Kejun, Chen Shizhe, Hu Anwen, Jin Qin
- Transferring Troubles: Cross-lingual Transferability Of Backdoor Attacks In Llms With Instruction Tuning He Xuanli, Wang Jun, Xu Qiongkai, Minervini Pasquale, Stenetorp Pontus, Rubinstein Benjamin I. P., Cohn Trevor
- Fortifying Ethical Boundaries In AI: Advanced Strategies For Enhancing Security In Large Language Models He Yunhong, Qiu Jianling, Zhang Wei, Yuan Zhengqing
- Towards Building A Robust Knowledge Intensive Question Answering Model With Large Language Models Hong Hong Xingyun, Shao Shao Yan, Wang Wang Zhilin, Duan Duan Manni, Xiongnan Jin
- Evaluating Llms' Mathematical And Coding Competency Through Ontology-guided Interventions Hong Pengfei, Majumder Navonil, Ghosal Deepanway, Aditya Somak, Mihalcea Rada, Poria Soujanya
- Securing Vision-language Models With A Robust Encoder Against Jailbreak And Adversarial Attacks Hossain Md Zarif, Imteaj Ahmed
- Exploring Scaling Trends In LLM Robustness Howe Nikolaus, Zajac Michał, Mckenzie Ian, Hollinsworth Oskar, Tseng Tom, Bacon Pierre-luc, Gleave Adam
- Rankprompt: Step-by-step Comparisons Make Language Models Better Reasoners Hu Chi, Ge Yuan, Ma Xiangnan, Cao Hang, Li Qiang, Yang Yonghua, Xiao Tong, Zhu Jingbo
- Hiagent: Hierarchical Working Memory Management For Solving Long-horizon Agent Tasks With Large Language Model Hu Mengkang, Chen Tianxing, Chen Qiguang, Mu Yao, Shao Wenqi, Luo Ping
- Evaluating Robustness Of Generative Search Engine On Adversarial Factual Questions Hu Xuming, Li Xiaochuan, Chen Junzhe, Li Yinghui, Li Yangning, Li Xiaoguang, Wang Yasheng, Liu Qun, Wen Lijie, Yu Philip S., Guo Zhijiang
- RAG And RAU: A Survey On Retrieval-augmented Language Model In Natural Language Processing Hu Yucheng, Lu Yuxing
- Prompt Perturbation In Retrieval-augmented Generation Based Large Language Models Hu Zhibo Hye-young, Wang Chen Hye-young, Shu Yanfeng Hye-young, Helen Hye-young, Paik, Zhu Liming
- Large Language Models In Mental Health Care: A Scoping Review Hua Yining, Liu Fenglin, Yang Kailai, Li Zehan, Na Hongbin, Sheu Yi-han, Zhou Peilin, Moran Lauren V., Ananiadou Sophia, Beam Andrew, Torous John
- Applying And Evaluating Large Language Models In Mental Health Care: A Scoping Review Of Human-assessed Generative Tasks Hua Yining, Na Hongbin, Li Zehan, Liu Fenglin, Fang Xiao, Clifton David, Torous John
- What Affects The Stability Of Tool Learning? An Empirical Study On The Robustness Of Tool Learning Frameworks Huang Chengrui, Shi Zhengliang, Wen Yuntao, Chen Xiuying, Han Peng, Gao Shen, Shang Shuo
- Antidote: Post-fine-tuning Safety Alignment For Large Language Models Against Harmful Fine-tuning Huang Tiansheng, Bhattacharya Gautam, Joshi Pratik, Kimball Josh, Liu Ling
- Vaccine: Perturbation-aware Alignment For Large Language Models Against Harmful Fine-tuning Huang Tiansheng, Hu Sihao, Liu Ling
- Rolora: Fine-tuning Rotated Outlier-free Llms For Effective Weight-activation Quantization Huang Xijie, Liu Zechun, Liu Shih-yang, Cheng Kwang-ting
- Trustllm: Trustworthiness In Large Language Models Huang Yue, Sun Lichao, Wang Haoran, Wu Siyuan, Zhang Qihui, Li Yuan, Gao Chujie, Huang Yixin, Lyu Wenhan, Zhang Yixuan, Li Xiner, Liu Zhengliang, Liu Yixin, Wang Yijue, Zhang Zhikun, Vidgen Bertie, Kailkhura Bhavya, Xiong Caiming, Xiao Chaowei, Li Chunyuan, Xing Eric, Huang Furong, Liu Hao, Ji Heng, Wang Hongyi, Zhang Huan, Yao Huaxiu, Kellis Manolis, Zitnik Marinka, Jiang Meng, Bansal Mohit, Zou James, Pei Jian, Liu Jian, Gao Jianfeng, Han Jiawei, Zhao Jieyu, Tang Jiliang, Wang Jindong, Vanschoren Joaquin, Mitchell John, Shu Kai, Xu Kaidi, Chang Kai-wei, He Lifang, Huang Lifu, Backes Michael, Gong Neil Zhenqiang, Yu Philip S., Chen Pin-yu, Gu Quanquan, Xu Ran, Ying Rex, Ji Shuiwang, Jana Suman, Chen Tianlong, Liu Tianming, Zhou Tianyi, Wang William, Li Xiang, Zhang Xiangliang, Wang Xiao, Xie Xing, Chen Xun, Wang Xuyu, Liu Yan, Ye Yanfang, Cao Yinzhi, Chen Yong, Zhao Yue
- Obscureprompt: Jailbreaking Large Language Models Via Obscure Input Huang Yue, Tang Jingyu, Chen Dongping, Tang Bingda, Wan Yao, Sun Lichao, Zhang Xiangliang
- Enhancing The Capability And Robustness Of Large Language Models Through Reinforcement Learning-driven Query Refinement Huang Zisu, Wang Xiaohua, Zhang Feiran, Xu Zhibo, Zhang Cenyuan, Zheng Xiaoqing, Huang Xuanjing
- Towards A Psychology Of Machines: Large Language Models Predict Human Memory Huff Markus, Ulakçı Elanur
- HFT: Half Fine-tuning For Large Language Models Hui Tingfeng, Zhang Zhenyu, Wang Shuohuan, Xu Weiran, Sun Yu, Wu Hua
- On Effects Of Steering Latent Representation For Large Language Model Unlearning Huu-tien Dang, Pham Trung-tin, Thanh-tung Hoang, Inoue Naoya
- Integrating Large Language Models With Graph-based Reasoning For Conversational Question Answering Jain Parag, Lapata Mirella
- Prepair: Pointwise Reasoning Enhance Pairwise Evaluating For Robust Instruction-following Assessments Jeong Hawon, Park Chaehun, Hong Jimin, Choo Jaegul
- Defending Large Language Models Against Jailbreak Attacks Via Semantic Smoothing Ji Jiabao, Hou Bairu, Robey Alexander, Pappas George J., Hassani Hamed, Zhang Yang, Wong Eric, Chang Shiyu
- Chatbug: A Common Vulnerability Of Aligned Llms Induced By Chat Templates Jiang Fengqing, Xu Zhangchen, Niu Luyao, Lin Bill Yuchen, Poovendran Radha
- The Real, The Better: Aligning Large Language Models With Online Human Behaviors Jiang Guanying, Yan Lingyong, Shi Haibo, Yin Dawei
- Time Matters: Enhancing Pre-trained News Recommendation Models With Robust User Dwell Time Injection Jiang Hao, Li Chuanzhen, An Mingxiao
- Can Large Language Models Generate High-quality Patent Claims? Jiang Lekang, Zhang Caiqi, Scherz Pascal A, Goetz Stephan
- Detecting Scams Using Large Language Models Jiang Liming
- Utilizing Large Languagemodels To Detect Privacy Leaks In Mini-app Code Jiang Liming
- Wildteaming At Scale: From In-the-wild Jailbreaks To (adversarially) Safer Language Models Jiang Liwei, Rao Kavel, Han Seungju, Ettinger Allyson, Brahman Faeze, Kumar Sachin, Mireshghallah Niloofar, Lu Ximing, Sap Maarten, Choi Yejin, Dziri Nouha
- Enhancing Robustness In Large Language Models: Prompting For Mitigating The Impact Of Irrelevant Information Jiang Ming, Huang Tingting, Guo Biao, Lu Yao, Zhang Feng
- Turning Generative Models Degenerate: The Power Of Data Poisoning Attacks Jiang Shuli, Kadhe Swanand Ravindra, Zhou Yi, Ahmed Farhan, Cai Ling, Baracaldo Nathalie
- Learning To Edit: Aligning Llms With Knowledge Editing Jiang Yuxin, Wang Yufei, Wu Chuhan, Zhong Wanjun, Zeng Xingshan, Gao Jiahui, Li Liangyou, Jiang Xin, Shang Lifeng, Tang Ruiming, Liu Qun, Wang Wei
- Retrieve, Summarize, Plan: Advancing Multi-hop Question Answering With An Iterative Approach Jiang Zhouyu, Sun Mengshu, Liang Lei, Zhang Zhiqiang
- Crimson: Empowering Strategic Reasoning In Cybersecurity Through Large Language Models Jin Jiandong, Tang Bowen, Ma Mingxuan, Liu Xiao, Wang Yunfei, Lai Qingnan, Yang Jia, Zhou Changling
- \(\texttt{lm}^\texttt{2}\): A Simple Society Of Language Models Solves Complex Reasoning Juneja Gurusha, Dutta Subhabrata, Chakraborty Tanmoy
- LLM-ARC: Enhancing Llms With An Automated Reasoning Critic Kalyanpur Aditya, Saravanakumar Kailash Karthik, Barres Victor, Chu-carroll Jennifer, Melville David, Ferrucci David
- Can Large Language Models Be Good Emotional Supporter? Mitigating Preference Bias On Emotional Support Conversation Kang Dongjin, Kim Sunghwan, Kwon Taeyoon, Moon Seungjun, Cho Hyunsouk, Yu Youngjae, Lee Dongha, Yeo Jinyoung
- Can Reinforcement Learning Unlock The Hidden Dangers In Aligned Large Language Models? Karkevandi Mohammad Bahrami, Vishwamitra Nishant, Najafirad Peyman
- Alpaca Against Vicuna: Using Llms To Uncover Memorization Of Llms Kassem Aly M., Mahmoud Omar, Mireshghallah Niloofar, Kim Hyunwoo, Tsvetkov Yulia, Choi Yejin, Saad Sherif, Rana Santu
- Finding A Needle In The Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases With Minimal Distribution Distortion Kassem Aly M., Saad Sherif
- Defending Large Language Models Against Attacks With Residual Stream Activation Analysis Kawasaki Amelia, Davis Andrew, Abbas Houssam
- Grounding And Evaluation For Large Language Models: Practical Challenges And Lessons Learned (survey) Kenthapadi Krishnaram, Sameki Mehrnoosh, Taly Ankur
- Harmonic Llms Are Trustworthy Kersting Nicholas S., Rahman Mohammad, Vedala Suchismitha, Wang Yang
- Consistency And Uncertainty: Identifying Unreliable Responses From Black-box Vision-language Models For Selective Visual Question Answering Khan Zaid, Fu Yun
- ARGS: Alignment As Reward-guided Search Khanov Maxim, Burapacheep Jirayu, Li Yixuan
- How Good Is My Video LMM? Complex Video Reasoning And Robustness Evaluation Suite For Video-lmms Khattak Muhammad Uzair, Naeem Muhammad Ferjad, Hassan Jameel, Naseer Muzammal, Tombari Federico, Khan Fahad Shahbaz, Khan Salman
- Small Language Models Learn Enhanced Reasoning Skills From Medical Textbooks Kim Hyunjae, Hwang Hyeon, Lee Jiwoo, Park Sihyeon, Kim Dain, Lee Taewhoo, Yoon Chanwoong, Sohn Jiwoong, Choi Donghee, Kang Jaewoo
- Dialsim: A Real-time Simulator For Evaluating Long-term Dialogue Understanding Of Conversational Agents Kim Jiho, Chay Woosog, Hwang Hyeonji, Kyung Daeun, Chung Hyunseung, Cho Eunbyeol, Jo Yohan, Choi Edward
- Persona Is A Double-edged Sword: Enhancing The Zero-shot Reasoning By Ensembling The Role-playing And Neutral Prompts Kim Junseok, Yang Nakyeong, Jung Kyomin
- Automatic Jailbreaking Of The Text-to-image Generative AI Systems Kim Minseon, Lee Hyomin, Gong Boqing, Zhang Huishuai, Hwang Sung Ju
- Adaptive Contrastive Decoding In Retrieval-augmented Generation For Handling Noisy Contexts Kim Youna, Kim Hyuhng Joon, Park Cheonbok, Park Choonghyun, Cho Hyunsoo, Kim Junyeob, Yoo Kang Min, Lee Sang-goo, Kim Taeuk
- What's Wrong? Refining Meeting Summaries With LLM Feedback Kirstein Frederic, Ruas Terry, Gipp Bela
- Chatspamdetector: Leveraging Large Language Models For Effective Phishing Email Detection Koide Takashi, Fukushi Naoki, Nakano Hiroki, Chiba Daiki
- Exploring Straightforward Conversational Red-teaming Kour George, Zwerdling Naama, Zalmanovici Marcel, Anaby-tavor Ateret, Fandina Ora Nova, Farchi Eitan
- Fine-tuning, Quantization, And Llms: Navigating Unintended Outcomes Kumar Divyanshu, Kumar Anurakt, Agarwal Sahil, Harshangi Prashanth
- The Remarkable Robustness Of Llms: Stages Of Inference? Lad Vedang, Gurnee Wes, Tegmark Max
- INDICT: Code Generation With Internal Dialogues Of Critiques For Both Security And Helpfulness Le Hung, Zhou Yingbo, Xiong Caiming, Savarese Silvio, Sahoo Doyen
- Using Hallucinations To Bypass Gpt4's Filter Lemkin Benjamin
- Using Counterfactual Tasks To Evaluate The Generality Of Analogical Reasoning In Large Language Models Lewis Martha, Mitchell Melanie
- Backdoor Removal For Generative Large Language Models Li Haoran, Chen Yulin, Zheng Zihao, Hu Qi, Chan Chunkit, Liu Heshan, Song Yangqiu
- Enhance Lifelong Model Editing With Continuous Data-adapter Association Li Jiaang, Wang Quan, Wang Zhongnan, Zhang Yongdong, Mao Zhendong
- A Cross-language Investigation Into Jailbreak Attacks In Large Language Models Li Jie, Liu Yi, Liu Chongyang, Shi Ling, Ren Xiaoning, Zheng Yaowen, Liu Yang, Xue Yinxing
- Fmm-attack: A Flow-based Multi-modal Adversarial Attack On Video-based Llms Li Jinmin, Gao Kuofeng, Bai Yang, Zhang Jingyun, Xia Shu-tao, Wang Yisen
- Dialogue Action Tokens: Steering Language Models In Goal-directed Dialogue With A Multi-turn Planner Li Kenneth, Wang Yiming, Viégas Fernanda, Wattenberg Martin
- LLM Defenses Are Not Robust To Multi-turn Human Jailbreaks Yet Li Nathaniel, Han Ziwen, Steneker Ian, Primack Willow, Goodside Riley, Zhang Hugh, Wang Zifan, Menghini Cristina, Yue Summer
- Gsm-plus: A Comprehensive Benchmark For Evaluating The Robustness Of Llms As Mathematical Problem Solvers Li Qintong, Cui Leyang, Zhao Xueliang, Kong Lingpeng, Bi Wei
- Anchored Answers: Unravelling Positional Bias In Gpt-2's Multiple-choice Questions Li Ruizhe, Gao Yanjun
- Model-enhanced Llm-driven VUI Testing Of VPA Apps Li Suwan, Bu Lei, Bai Guangdong, Xie Fuman, Chen Kai, Yue Chang
- Reinforcement Learning With Token-level Feedback For Controllable Text Generation Li Wendi, Wei Wei, Xu Kaihe, Xie Wenfeng, Chen Dangyang, Cheng Yu
- Ganprompt: Enhancing Robustness In Llm-based Recommendations With Gan-enhanced Diversity Prompts Li Xinyu, Zhao Chuang, Zhao Hongke, Wu Likang, He Ming
- Badedit: Backdooring Large Language Models By Model Editing Li Yanzhou, Li Tianlin, Chen Kangjie, Zhang Jian, Liu Shangqing, Wang Wenhan, Zhang Tianwei, Liu Yang
- Backdoorllm: A Comprehensive Benchmark For Backdoor Attacks On Large Language Models Li Yige, Huang Hanxun, Zhao Yunhan, Ma Xingjun, Sun Jun
- Implicit In-context Learning Li Zhuowei, Xu Zihao, Han Ligong, Gao Yunhe, Wen Song, Liu Di, Wang Hao, Metaxas Dimitris N.
- Vl-trojan: Multimodal Instruction Backdoor Attacks Against Autoregressive Visual Language Models Liang Jiawei, Liang Siyuan, Luo Man, Liu Aishan, Han Dongchen, Chang Ee-chien, Cao Xiaochun
- Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats In Customized Large Language Models Liang Zi, Hu Haibo, Ye Qingqing, Xiao Yaxin, Li Haoyang
- IKUN For WMT24 General MT Task: Llms Are Here For Multilingual Machine Translation Liao Baohao, Herold Christian, Khadivi Shahram, Monz Christof
- The Power Of Words: Generating Powershell Attacks From Natural Language Liguori Pietro, Marescalco Christian, Natella Roberto, Orbinato Vittorio, Pianese Luciano
- Emojicrypt: Prompt Encryption For Secure Communication With Large Language Models Lin Guo, Hua Wenyue, Zhang Yongfeng
- Syncpkl: Harnessing Llms To Generate Synthetic Data For Commonsense Persona Knowledge Linking Lin Kuan-yen
- Single Character Perturbations Break LLM Alignment Lin Leon, Brown Hannah, Kawaguchi Kenji, Shieh Michael
- Optimizing Language Models For Human Preferences Is A Causal Inference Problem Lin Victoria, Ben-michael Eli, Morency Louis-philippe
- Unveiling Vulnerability Of Self-attention Liong Khai Jiet, Wu Hongqiu, Zhao Hai
- Exploring Llms As A Source Of Targeted Synthetic Textual Data To Minimize High Confidence Misclassifications Lippmann Philip, Spaan Matthijs T. J., Yang Jie
- Compromising Embodied Agents With Contextual Backdoor Attacks Liu Aishan, Zhou Yuguang, Liu Xianglong, Zhang Tianyuan, Liang Siyuan, Wang Jiakai, Pu Yanjun, Li Tianlin, Zhang Junqi, Zhou Wenbo, Guo Qing, Tao Dacheng
- Llmembed: Rethinking Lightweight Llm's Genuine Function In Text Classification Liu Chun, Zhang Hongguang, Zhao Kainan, Ju Xinghai, Yang Lin
- Exploring Vulnerabilities And Protections In Large Language Models: A Survey Liu Frank Weizhen, Hu Chenhui
- Advancing Adversarial Suffix Transfer Learning On Aligned Large Language Models Liu Hongfu, Xie Yuxi, Wang Ye, Shieh Michael
- Tiny Refinements Elicit Resilience: Toward Efficient Prefix-model Against LLM Red-teaming Liu Jiaxu, Yin Xiangyu, Wu Sihao, Wang Jianhong, Fang Meng, Yi Xinping, Huang Xiaowei
- Selectit: Selective Instruction Tuning For Large Language Models Via Uncertainty-aware Self-reflection Liu Liangxin, Liu Xuebo, Wong Derek F., Li Dongfang, Wang Ziyi, Hu Baotian, Zhang Min
- The Generation Gap:exploring Age Bias In The Underlying Value Systems Of Large Language Models Liu Siyang, Maturi Trish, Yi Bowen, Shen Siqi, Mihalcea Rada
- Making Them Ask And Answer: Jailbreaking Large Language Models In Few Queries Via Disguise And Reconstruction Liu Tong, Zhang Yingjie, Zhao Zhe, Dong Yinpeng, Meng Guozhu, Chen Kai
- Imposter.ai: Adversarial Attacks With Hidden Intentions Towards Aligned Large Language Models Liu Xiao, Li Liangzhi, Xiang Tong, Ye Fuying, Wei Lu, Li Wangyue, Garcia Noa
- SHIELD: Evaluation And Defense Strategies For Copyright Compliance In LLM Text Generation Liu Xiaoze, Sun Ting, Xu Tianyang, Wu Feijie, Wang Cunxiang, Wang Xiaoqian, Gao Jing
- Arondight: Red Teaming Large Vision Language Models With Auto-generated Multi-modal Jailbreak Prompts Liu Yi, Cai Chengjun, Zhang Xiaoli, Yuan Xingliang, Wang Cong
- Protecting Your Llms With Information Bottleneck Liu Zichuan, Wang Zefan, Xu Linjie, Wang Jinyu, Song Lei, Wang Tianchun, Chen Chunlin, Cheng Wei, Bian Jiang
- YODA: Teacher-student Progressive Learning For Language Models Lu Jianqiao, Zhong Wanjun, Wang Yufei, Guo Zhijiang, Zhu Qi, Huang Wenyong, Wang Yanlin, Mi Fei, Wang Baojun, Wang Yasheng, Shang Lifeng, Jiang Xin, Liu Qun
- Prompt Tuning As User Inherent Profile Inference Machine Lu Yusheng, Du Zhaocheng, Li Xiangyang, Zhao Xiangyu, Liu Weiwen, Wang Yichao, Guo Huifeng, Tang Ruiming, Dong Zhenhua, Duan Yongrui
- DEEM: Diffusion Models Serve As The Eyes Of Large Language Models For Image Perception Luo Run, Li Yunshui, Chen Longze, He Wanwei, Lin Ting-en, Liu Ziqiang, Zhang Lei, Song Zikai, Xia Xiaobo, Liu Tongliang, Yang Min, Hui Binyuan
- Knowla: Enhancing Parameter-efficient Finetuning With Knowledgeable Adaptation Luo Xindi, Sun Zequn, Zhao Jing, Zhao Zhe, Hu Wei
- Ensuring Safe And High-quality Outputs: A Guideline Library Approach For Language Models Luo Yi, Lin Zhenghao, Zhang Yuhao, Sun Jiashuo, Lin Chen, Xu Chengjin, Su Xiangdong, Shen Yelong, Guo Jian, Gong Yeyun
- Eight Methods To Evaluate Robust Unlearning In Llms Lynch Aengus, Guo Phillip, Ewart Aidan, Casper Stephen, Hadfield-menell Dylan
- X-reflect: Cross-reflection Prompting For Multimodal Recommendation Lyu Hanjia, Rossi Ryan, Chen Xiang, Tanjim Md Mehrab, Petrangeli Stefano, Sarkhel Somdeb, Luo Jiebo
- Is It Possible To Edit Large Language Models Robustly? Ma Xinbei, Ju Tianjie, Qiu Jiyang, Zhang Zhuosheng, Zhao Hai, Liu Lifeng, Wang Yulong
- VURF: A General-purpose Reasoning And Self-refinement Framework For Video Understanding Mahmood Ahmad, Vayani Ashmal, Naseer Muzammal, Khan Salman, Khan Fahad Shahbaz
- IITK At Semeval-2024 Task 2: Exploring The Capabilities Of Llms For Safe Biomedical Natural Language Inference For Clinical Trials Mandal Shreyasi, Modi Ashutosh
- Coalitions Of Large Language Models Increase The Robustness Of AI Agents Mangal Prattyush, Mak Carol, Kanakis Theo, Donovan Timothy, Braines Dave, Pyzer-knapp Edward
- Chatretriever: Adapting Large Language Models For Generalized And Robust Conversational Dense Retrieval Mao Kelong, Deng Chenlong, Chen Haonan, Mo Fengran, Liu Zheng, Sakai Tetsuya, Dou Zhicheng
- Don't Forget Your Reward Values: Language Model Alignment Via Value-based Calibration Mao Xin, Li Feng-lin, Xu Huimin, Zhang Wei, Luu Anh Tuan
- Inadequacies Of Large Language Model Benchmarks In The Era Of Generative Artificial Intelligence Mcintosh Timothy R., Susnjak Teo, Liu Tong, Watters Paul, Halgamuge Malka N.
- Inherent Challenges Of Post-hoc Membership Inference For Large Language Models Meeus Matthieu, Jain Shubham, Rei Marek, De Montjoye Yves-alexandre
- Fight Back Against Jailbreaking Via Prompt Adversarial Tuning Mo Yichuan, Wang Yuji, Wei Zeming, Wang Yisen
- Can We Trust Large Language Models Generated Code? A Framework For In-context Learning, Security Patterns, And Code Evaluations Across Diverse Llms Mohsin Ahmad, Janicke Helge, Wood Adrian, Sarker Iqbal H., Maglaras Leandros, Janjua Naeem
- Are Llms Robust For Spoken Dialogues? Mousavi Seyed Mahed, Roccabruna Gabriel, Alghisi Simone, Rizzoli Massimo, Ravanelli Mirco, Riccardi Giuseppe
- Unraveling The Truth: Do Llms Really Understand Charts? A Deep Dive Into Consistency And Robustness Mukhopadhyay Srija, Qidwai Adnan, Garimella Aparna, Ramu Pritika, Gupta Vivek, Roth Dan
- Learning Vs Retrieval: The Role Of In-context Examples In Regression With Llms Nafar Aliakbar, Venable Kristen Brent, Kordjamshidi Parisa
- Aurora-m: The First Open Source Multilingual Language Model Red-teamed According To The U.S. Executive Order Nakamura Taishi, Mishra Mayank, Tedeschi Simone, Chai Yekun, Stillerman Jason T, Friedrich Felix, Yadav Prateek, Laud Tanmay, Chien Vu Minh, Zhuo Terry Yue, Misra Diganta, Bogin Ben, Vu Xuan-son, Karpinska Marzena, Dantuluri Arnav Varma, Kusa Wojciech, Furlanello Tommaso, Yokota Rio, Muennighoff Niklas, Pai Suhas, Adewumi Tosin, Laippala Veronika, Yao Xiaozhe, Junior Adalberto, Ariyak Alpay, Drozd Aleksandr, Clive Jordan, Gupta Kshitij, Chen Liangyu, Sun Qi, Tsui Ken, Persaud Noah, Fahmy Nour, Chen Tianlong, Bansal Mohit, Monti Nicolo, Dang Tai, Luo Ziyang, Bui Tien-tung, Navigli Roberto, Mehta Virendra, Blumberg Matthew, May Victor, Nguyen Huu, Pyysalo Sampo
- Beyond The Hype: A Dispassionate Look At Vision-language Models In Medical Scenario Nan Yang, Zhou Huichi, Xing Xiaodan, Yang Guang
- Revolutionizing Undergraduate Learning: Coursegpt And Its Generative AI Advancements Nazar Ahmad M., Selim Mohamed Y., Gaffar Ashraf, Ahmed Shakil
- Improving Robustness Of Llm-based Speech Synthesis By Learning Monotonic Alignment Neekhara Paarth, Hussain Shehzeen, Ghosh Subhankar, Li Jason, Valle Rafael, Badlani Rohan, Ginsburg Boris
- CURATRON: Complete Robust Preference Data For Robust Alignment Of Large Language Models Nguyen Son The, Naresh Niranjan Uma, Tulabandhula Theja
- An Empirical Study On Capability Of Large Language Models In Understanding Code Semantics Nguyen Thu-trang, Vu Thanh Trong, Vo Hieu Dinh, Nguyen Son
- Generative AI In EU Law: Liability, Privacy, Intellectual Property, And Cybersecurity Novelli Claudio, Casolari Federico, Hacker Philipp, Spedicato Giorgio, Floridi Luciano
- Improving Black-box Robustness With In-context Rewriting O'brien Kyle, Ng Nathan, Puri Isha, Mendez Jorge, Palangi Hamid, Kim Yoon, Ghassemi Marzyeh, Hartvigsen Thomas
- Robust Adaptation Of Foundation Models With Black-box Visual Prompting Oh Changdae, Seo Gyeongdeok, Jung Geunyoung, Cheng Zhi-qi, Choi Hosik, Jung Jiyoung, Song Kyungwoo
- Outliers And Calibration Sets Have Diminishing Effect On Quantization Of Modern Llms Paglieri Davide, Dash Saurabh, Rocktäschel Tim, Parker-holder Jack
- AILS-NTUA At Semeval-2024 Task 9: Cracking Brain Teasers: Transformer Models For Lateral Thinking Puzzles Panagiotopoulos Ioannis, Filandrianos Giorgos, Lymperaiou Maria, Stamou Giorgos
- Can Watermarking Large Language Models Prevent Copyrighted Text Generation And Hide Training Data? Panaitescu-liess Michael-andrei, Che Zora, An Bang, Xu Yuancheng, Pathmanathan Pankayaraj, Chakraborty Souradip, Zhu Sicheng, Goldstein Tom, Huang Furong
- Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes Llms More Prone To Jailbreak Attacks Pantazopoulos Georgios, Parekh Amit, Nikandrou Malvina, Suglia Alessandro
- Offsetbias: Leveraging Debiased Data For Tuning Evaluators Park Junsoo, Jwa Seungyeon, Ren Meiying, Kim Daeyoung, Choi Sanghyuk
- Large Language Models Can Self-improve At Web Agent Tasks Patel Ajay, Hofmarcher Markus, Leoveanu-condrei Claudiu, Dinu Marius-constantin, Callison-burch Chris, Hochreiter Sepp
- Making Reasoning Matter: Measuring And Improving Faithfulness Of Chain-of-thought Reasoning Paul Debjit, West Robert, Bosselut Antoine, Faltings Boi
- Advprompter: Fast Adaptive Adversarial Prompting For Llms Paulus Anselm, Zharmagambetov Arman, Guo Chuan, Amos Brandon, Tian Yuandong
- Securing Large Language Models: Addressing Bias, Misinformation, And Prompt Attacks Peng Benji, Chen Keyu, Li Ming, Feng Pohsun, Bi Ziqian, Liu Junyu, Niu Qian
- Answer Is All You Need: Instruction-following Text Embedding Via Answering The Question Peng Letian, Zhang Yuwei, Wang Zilong, Srinivasa Jayanth, Liu Gaowen, Wang Zihan, Shang Jingbo
- Did Translation Models Get More Robust Without Anyone Even Noticing? Peters Ben, Martins André F. T.
- Mllm-protector: Ensuring Mllm's Safety Without Hurting Performance Pi Renjie, Han Tianyang, Zhang Jianshu, Xie Yueqi, Pan Rui, Lian Qing, Dong Hanze, Zhang Jipeng, Zhang Tong
- How Effective Are State Space Models For Machine Translation? Pitorro Hugo, Vasylenko Pavlo, Treviso Marcos, Martins André F. T.
- Exploring The Efficacy Of Large Language Models (GPT-4) In Binary Reverse Engineering Pordanesh Saman, Tan Benjamin
- Retrieval Augmented Thought Process For Private Data Handling In Healthcare Pouplin Thomas, Sun Hao, Holt Samuel, Van Der Schaar Mihaela
- Fractured-sorry-bench: Framework For Revealing Attacks In Conversational Turns Undermining Refusal Efficacy And Defenses Over Sorry-bench Priyanshu Aman, Vijay Supriti
- Safety Alignment Should Be Made More Than Just A Few Tokens Deep Qi Xiangyu, Panda Ashwinee, Lyu Kaifeng, Ma Xiao, Roy Subhrajit, Beirami Ahmad, Mittal Prateek, Henderson Peter
- Follow My Instruction And Spill The Beans: Scalable Data Extraction From Retrieval-augmented Generation Systems Qi Zhenting, Zhang Hanlin, Xing Eric, Kakade Sham, Lakkaraju Himabindu
- Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period Of Large Language Models Qian Chen, Zhang Jie, Yao Wei, Liu Dongrui, Yin Zhenfei, Qiao Yu, Liu Yong, Shao Jing
- How Easy Is It To Fool Your Multimodal Llms? An Empirical Analysis On Deceptive Prompts Qian Yusu, Zhang Haotian, Yang Yinfei, Gan Zhe
- Prompt Perturbation Consistency Learning For Robust Language Models Qiang Yao, Nandi Subhrangshu, Mehrabi Ninareh, Steeg Greg Ver, Kumar Anoop, Rumshisky Anna, Galstyan Aram
- Learning To Poison Large Language Models During Instruction Tuning Qiang Yao, Zhou Xiangyu, Zade Saleh Zare, Roshani Mohammad Amin, Zytko Douglas, Zhu Dongxiao
- The Uniqueness Of Llama3-70b With Per-channel Quantization: An Empirical Study Qin Minghai
- Megen: Generative Backdoor In Large Language Models Via Model Editing Qiu Jiyang, Ma Xinbei, Zhang Zhuosheng, Zhao Hai
- Before Generation, Align It! A Novel And Effective Strategy For Mitigating Hallucinations In Text-to-sql Generation Qu Ge, Li Jinyang, Li Bowen, Qin Bowen, Huo Nan, Ma Chenhao, Cheng Reynold
- Memory-augmented Generative Adversarial Transformers Raaijmakers Stephan, Bakker Roos, Cremers Anita, De Kleijn Roy, Kouwenhoven Tom, Verhoef Tessa
- FACTOID: Factual Entailment For Hallucination Detection Rawte Vipula, Tonmoy S. M Towhidul Islam, Rajbangshi Krishnav, Nag Shravani, Chadha Aman, Sheth Amit P., Das Amitava
- MBIAS: Mitigating Bias In Large Language Models While Retaining Context Raza Shaina, Raval Ananya, Chatrath Veronica
- Codeattack: Revealing Safety Generalization Challenges Of Large Language Models Via Code Completion Ren Qibing, Gao Chang, Shao Jing, Yan Junchi, Tan Xin, Lam Wai, Ma Lizhuang
- On The Efficacy Of Eviction Policy For Key-value Constrained Generative Language Model Inference Ren Siyu, Zhu Kenny Q.
- Quantifying Contamination In Evaluating Code Generation Capabilities Of Language Models Riddell Martin, Ni Ansong, Cohan Arman
- An Early Categorization Of Prompt Injection Attacks On Large Language Models Rossi Sippo, Michel Alisia Marianne, Mukkamala Raghava Rao, Thatcher Jason Bennett
- Mitigating Hallucination In Fictional Character Role-play Sadeq Nafis, Xie Zhouhang, Kang Byungkyu, Lamba Prarit, Gao Xiang, Mcauley Julian
- Rainbow Teaming: Open-ended Generation Of Diverse Adversarial Prompts Samvelyan Mikayel, Raparthy Sharath Chandra, Lupu Andrei, Hambro Eric, Markosyan Aram H., Bhatt Manish, Mao Yuning, Jiang Minqi, Parker-holder Jack, Foerster Jakob, Rocktäschel Tim, Raileanu Roberta
- Retrieval-augmented Generation Meets Data-driven Tabula Rasa Approach For Temporal Knowledge Graph Forecasting Sannidhi Geethan, Sakhinana Sagar Srinivas, Runkana Venkataramana
- Towards Faithful And Robust LLM Specialists For Evidence-based Question-answering Schimanski Tobias, Ni Jingwei, Kraus Mathias, Ash Elliott, Leippold Markus
- Ai-augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy Schoenegger Philipp, Park Peter S., Karger Ezra, Trott Sean, Tetlock Philip E.
- Context Matters: An Empirical Study Of The Impact Of Contextual Information In Temporal Question Answering Systems Schumacher Dan, Haji Fatemeh, Grey Tara, Bandlamudi Niharika, Karnik Nupoor, Kumar Gagana Uday, Chiang Jason Cho-yu, Rad Paul, Vishwamitra Nishant, Rios Anthony
- Rethinking LLM Memorization Through The Lens Of Adversarial Compression Schwarzschild Avi, Feng Zhili, Maini Pratyush, Lipton Zachary C., Kolter J. Zico
- Soft Prompt Threats: Attacking Safety Alignment And Unlearning In Open-source Llms Through The Embedding Space Schwinn Leo, Dobre David, Xhonneux Sophie, Gidel Gauthier, Gunnemann Stephan
- Ai-native Memory: A Pathway From Llms Towards AGI Shang Jingbo, Zheng Zai, Wei Jiale, Ying Xiang, Tao Felix, Team Mindverse
- SPML: A DSL For Defending Language Models Against Prompt Attacks Sharma Reshabh K, Gupta Vinayak, Grossman Dan
- A Preliminary Study On Using Large Language Models In Software Pentesting Shashwat Kumar, Hahn Francis, Ou Xinming, Goldgof Dmitry, Hall Lawrence, Ligatti Jay, Rajgopalan S. Raj, Tabari Armin Ziaie
- Rapid Optimization For Jailbreaking Llms Via Subconscious Exploitation And Echopraxia Shen Guangyu, Cheng Siyuan, Zhang Kaiyuan, Tao Guanhong, An Shengwei, Yan Lu, Zhang Zhuo, Ma Shiqing, Zhang Xiangyu
- Improving Reinforcement Learning From Human Feedback Using Contrastive Rewards Shen Wei, Zhang Xiaoying, Yao Yuanshun, Zheng Rui, Guo Hongyi, Liu Yang
- Latent Adversarial Training Improves Robustness To Persistent Harmful Behaviors In Llms Sheshadri Abhay, Ewart Aidan, Guo Phillip, Lynch Aengus, Wu Cindy, Hebbar Vivek, Sleight Henry, Stickland Asa Cooper, Perez Ethan, Hadfield-menell Dylan, Casper Stephen
- A Thorough Examination Of Decoding Methods In The Era Of Llms Shi Chufan, Yang Haoran, Cai Deng, Zhang Zhisong, Wang Yifan, Yang Yujiu, Lam Wai
- Abusegpt: Abuse Of Generative AI Chatbots To Create Smishing Campaigns Shibli Ashfak Md, Pritom Mir Mehedi A., Gupta Maanak
- Robustness Of Llms To Perturbations In Text Singh Ayush, Singh Navpreet, Vatsal Shubham
- Exposing The Achilles' Heel: Evaluating Llms Ability To Handle Mistakes In Mathematical Reasoning Singh Joykirat, Nambi Akshay, Vineet Vibhav
- Was It Slander? Towards Exact Inversion Of Generative Language Models Skapars Adrians, Manino Edoardo, Sun Youcheng, Cordeiro Lucas C.
- Sharelora: Parameter Efficient And Robust Large Language Model Fine-tuning Via Shared Low-rank Adaptation Song Yurun, Zhao Junchen, Harris Ian G., Jyothi Sangeetha Abdu
- A Comprehensive Study Of The Capabilities Of Large Language Models For Vulnerability Detection Steenhoek Benjamin, Rahman Md Mahbubur, Roy Monoshi Kumar, Alam Mirza Sanjida, Barr Earl T., Le Wei
- Exploring The Adversarial Capabilities Of Large Language Models Struppek Lukas, Le Minh Hieu, Hintersdorf Dominik, Kersting Kristian
- Large Language Models Are Inconsistent And Biased Evaluators Stureborg Rickard, Alikaniotis Dimitris, Suhara Yoshi
- Mission Impossible: A Statistical Perspective On Jailbreaking Llms Su Jingtong, Kempe Julia, Ullrich Karen
- Whispering Experts: Neural Interventions For Toxicity Mitigation In Language Models Suau Xavier, Delobelle Pieter, Metcalf Katherine, Joulin Armand, Apostoloff Nicholas, Zappella Luca, Rodríguez Pau
- Triforce: Lossless Acceleration Of Long Sequence Generation With Hierarchical Speculative Decoding Sun Hanshi, Chen Zhuoming, Yang Xinyu, Tian Yuandong, Chen Beidi
- Beyond Text: Utilizing Vocal Cues To Improve Decision Making In Llms For Robot Navigation Tasks Sun Xingpeng, Meng Haoming, Chakraborty Souradip, Bedi Amrit Singh, Bera Aniket
- Multi-turn Context Jailbreak Attack On Large Language Models From First Principles Sun Xiongtao, Zhang Deyue, Yang Dongdong, Zou Quanchen, Li Hui
- How The Advent Of Ubiquitous Large Language Models Both Stymie And Turbocharge Dynamic Adversarial Question Generation Sung Yoo Yeon, Mondal Ishani, Boyd-graber Jordan
- From Explanations To Action: A Zero-shot, Theory-driven LLM Framework For Student Performance Feedback Swamy Vinitra, Romano Davide, Desikan Bhargav Srinivasa, Camburu Oana-maria, Käser Tanja
- PIXAR: Auto-regressive Language Modeling In Pixel Space Tai Yintao, Liao Xiyang, Suglia Alessandro, Vergari Antonio
- All In How You Ask For It: Simple Black-box Method For Jailbreak Attacks Takemoto Kazuhiro
- Improving Large Language Model (LLM) Fidelity Through Context-aware Grounding: A Systematic Approach To Reliability And Veracity Talukdar Wrick, Biswas Anjanava
- Democratizing Large Language Models Via Personalized Parameter-efficient Fine-tuning Tan Zhaoxuan, Zeng Qingkai, Tian Yijun, Liu Zheyuan, Yin Bing, Jiang Meng
- Robustness Assessment Of Mathematical Reasoning In The Presence Of Missing And Contradictory Conditions Tian Shi-yu, Zhou Zhi, Jia Lin-han, Guo Lan-zhe, Li Yu-feng
- The Instinctive Bias: Spurious Images Lead To Hallucination In Mllms Tianyang Han, Qing Lian, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang
- Cpsdbench: A Large Language Model Evaluation Benchmark And Baseline For Chinese Public Security Domain Tong Xin, Jin Bo, Lin Zhi, Wang Binjun, Yu Ting, Cheng Qiang
- Prompting Techniques For Secure Code Generation: A Systematic Investigation Tony Catherine, Ferreyra Nicolás E. Díaz, Mutas Markus, Dhiff Salem, Scandariato Riccardo
- Enhance Modality Robustness In Text-centric Multimodal Alignment With Adversarial Prompting Tsai Yun-da, Yen Ting-yu, Liao Keng-te, Lin Shou-de
- Predictions From Language Models For Multiple-choice Tasks Are Not Robust Under Variation Of Scoring Methods Tsvilodub Polina, Wang Hening, Grosch Sharon, Franke Michael
- Advancing Large Multi-modal Models With Explicit Chain-of-reasoning And Visual Question Generation Uehara Kohei, Goswami Nabarun, Wang Hanqin, Baba Toshiaki, Tanaka Kohtaro, Hashimoto Tomohiro, Wang Kai, Ito Rei, Naoya Takagi, Umagami Ryo, Wen Yingyi, Anakewat Tanachai, Harada Tatsuya
- Sandwich Attack: Multi-language Mixture Adaptive Attack On Llms Upadhayay Bibek, Behzadan Vahid
- AI Sandbagging: Language Models Can Strategically Underperform On Evaluations Van Der Weij Teun, Hofstätter Felix, Jaffe Ollie, Brown Samuel F., Ward Francis Rhys
- The Instruction Hierarchy: Training Llms To Prioritize Privileged Instructions Wallace Eric, Xiao Kai, Leike Reimar, Weng Lilian, Heidecke Johannes, Beutel Alex
- Tokenization Matters! Degrading Large Language Models Through Challenging Their Tokenization Wang Dixuan, Li Yanda, Jiang Junyuan, Ding Zepeng, Jiang Guochao, Liang Jiaqing, Yang Deqing
- Probing The Safety Response Boundary Of Large Language Models Via Unsafe Decoding Path Generation Wang Haoyu, Wu Bingzhe, Bian Yatao, Chang Yongzhe, Wang Xueqian, Zhao Peilin
- Helpful Or Harmful Data? Fine-tuning-free Shapley Attribution For Explaining Language Model Predictions Wang Jingtan, Lin Xiaoqiang, Qiao Rui, Foo Chuan-sheng, Low Bryan Kian Hsiang
- Mambabyte: Token-free Selective State Space Model Wang Junxiong, Gangavarapu Tushaar, Yan Jing Nathan, Rush Alexander M.
- Inferaligner: Inference-time Alignment For Harmlessness Through Cross-model Guidance Wang Pengyu, Zhang Dong, Li Linyang, Tan Chenkun, Wang Xinghao, Ren Ke, Jiang Botian, Qiu Xipeng
- Unique Security And Privacy Threats Of Large Language Model: A Comprehensive Survey Wang Shang, Zhu Tianqing, Liu Bo, Ding Ming, Guo Xu, Ye Dayong, Zhou Wanlei, Yu Philip S.
- From Llms To Mllms: Exploring The Landscape Of Multimodal Jailbreaking Wang Siyuan, Long Zhuohan, Fan Zhihao, Wei Zhongyu
- Symbolic Working Memory Enhances Language Models For Complex Rule Application Wang Siyuan, Wei Zhongyu, Choi Yejin, Ren Xiang
- Less Is More For Improving Automatic Evaluation Of Factual Consistency Wang Tong, Kulkarni Ninad, Qi Yanjun
- Using Advanced Llms To Enhance Smaller Llms: An Interpretable Knowledge Distillation Approach Wang Tong, Sudhir K., Hong Dat
- "my Answer Is C": First-token Probabilities Do Not Match Text Answers In Instruction-tuned Language Models Wang Xinpeng, Ma Bolei, Hu Chengzhi, Weber-genzel Leon, Röttger Paul, Kreuter Frauke, Hovy Dirk, Plank Barbara
- Defending Llms Against Jailbreaking Attacks Via Backtranslation Wang Yihan, Shi Zhouxing, Bai Andrew, Hsieh Cho-jui
- Stop Reasoning! When Multimodal Llms With Chain-of-thought Reasoning Meets Adversarial Images Wang Zefeng, Han Zhen, Chen Shuo, Xue Fan, Ding Zifeng, Xiao Xun, Tresp Volker, Torr Philip, Gu Jindong
- Eliminating Position Bias Of Language Models: A Mechanistic Approach Wang Ziqi, Zhang Hanlin, Li Xiner, Huang Kuan-hao, Han Chi, Ji Shuiwang, Kakade Sham M., Peng Hao, Ji Heng
- Poisoned Langchain: Jailbreak Llms By Langchain Wang Ziqiu, Liu Jun, Zhang Shengkai, Yang Yang
- Unveiling Selection Biases: Exploring Order And Token Sensitivity In Large Language Models Wei Sheng-lun, Wu Cheng-kuang, Huang Hen-hsen, Chen Hsin-hsi
- Gendec: A Robust Generative Question-decomposition Method For Multi-hop Reasoning Wu Jian, Yang Linyi, Ji Yuliang, Huang Wenhao, Karlsson Börje F., Okumura Manabu
- Motionllm: Multimodal Motion-language Learning With Large Language Models Wu Qi, Zhao Yubo, Wang Yifan, Tai Yu-wing, Tang Chi-keung
- Adversarial Databases Improve Success In Retrieval-based Large Language Models Wu Sean, Koo Michael, Kao Li Yo, Black Andy, Blum Lesley, Scalzo Fabien, Kurtz Ira
- How Easily Do Irrelevant Inputs Skew The Responses Of Large Language Models? Wu Siye, Xie Jian, Chen Jiangjie, Zhu Tinghui, Zhang Kai, Xiao Yanghua
- Acquiring Clean Language Models From Backdoor Poisoned Datasets By Downscaling Frequency Space Wu Zongru, Zhang Zhuosheng, Cheng Pengzhou, Liu Gongshen
- In-context Learning Can Re-learn Forbidden Tasks Xhonneux Sophie, Dobre David, Tang Jian, Gidel Gauthier, Sridhar Dhanya
- Badchain: Backdoor Chain-of-thought Prompting For Large Language Models Xiang Zhen, Jiang Fengqing, Xiong Zidi, Ramasubramanian Bhaskar, Poovendran Radha, Li Bo
- Clover-2: Accurate Inference For Regressive Lightweight Speculative Decoding Xiao Bin, Gui Lujun, Su Lei, Chen Weipeng
- Videoqa In The Era Of Llms: An Empirical Study Xiao Junbin, Huang Nanxin, Qin Hangyu, Li Dongyang, Li Yicong, Zhu Fengbin, Tao Zhulin, Yu Jianxing, Lin Liang, Chua Tat-seng, Yao Angela
- Recall: Membership Inference Via Relative Conditional Log-likelihoods Xie Roy, Wang Junlin, Huang Ruomin, Zhang Minxing, Ge Rong, Pei Jian, Gong Neil Zhenqiang, Dhingra Bhuwan
- RALL-E: Robust Codec Language Modeling With Chain-of-thought Prompting For Text-to-speech Synthesis Xin Detai, Tan Xu, Shen Kai, Ju Zeqian, Yang Dongchao, Wang Yuancheng, Takamichi Shinnosuke, Saruwatari Hiroshi, Liu Shujie, Li Jinyu, Zhao Sheng
- Understanding The Weakness Of Large Language Model Agents Within A Complex Android Environment Xing Mingzhe, Zhang Rongkai, Xue Hui, Chen Qi, Yang Fan, Xiao Zhen
- Defensive Prompt Patch: A Robust And Interpretable Defense Of Llms Against Jailbreak Attacks Xiong Chen, Qi Xiangyu, Chen Pin-yu, Ho Tsung-yi
- Redagent: Red Teaming Large Language Models With Context-aware Autonomous Language Agent Xu Huiyu, Zhang Wenhui, Wang Zhibo, Xiao Feng, Zheng Rui, Feng Yunhe, Ba Zhongjie, Ren Kui
- Course-correction: Safety Alignment Using Synthetic Preferences Xu Rongwu, Cai Yishuo, Zhou Zhenhong, Gu Renjie, Weng Haiqin, Liu Yan, Zhang Tianwei, Xu Wei, Qiu Han
- Preemptive Answer "attacks" On Chain-of-thought Reasoning Xu Rongwu, Qi Zehan, Xu Wei
- Unsupervised Information Refinement Training Of Large Language Models For Retrieval-augmented Generation Xu Shicheng, Pang Liang, Yu Mo, Meng Fandong, Shen Huawei, Cheng Xueqi, Zhou Jie
- \(\textit{linkprompt}\): Natural And Universal Adversarial Attacks On Prompt-based Language Models Xu Yue, Wang Wenjie
- Safedecoding: Defending Against Jailbreak Attacks Via Safety-aware Decoding Xu Zhangchen, Jiang Fengqing, Niu Luyao, Jia Jinyuan, Lin Bill Yuchen, Poovendran Radha
- Bag Of Tricks: Benchmarking Of Jailbreak Attacks On Llms Xu Zhao, Liu Fan, Liu Hao
- Uncovering Safety Risks Of Large Language Models Through Concept Activation Vector Xu Zhihao, Huang Ruixuan, Chen Changyu, Wang Shuai, Wang Xiting
- A Comprehensive Study Of Jailbreak Attack Versus Defense For Large Language Models Xu Zihao, Liu Yi, Deng Gelei, Li Yuekang, Picek Stjepan
- Pythonsaga: Redefining The Benchmark To Evaluate Code Generating Llms Yadav Ankit, Beniwal Himanshu, Singh Mayank
- Corrective Retrieval Augmented Generation Yan Shi-qi, Gu Jia-chen, Zhu Yun, Ling Zhen-hua
- Contrastive Instruction Tuning Yan Tianyi Lorena, Wang Fei, Huang James Y., Zhou Wenxuan, Yin Fan, Galstyan Aram, Yin Wenpeng, Chen Muhao
- Emphasising Structured Information: Integrating Abstract Meaning Representation Into Llms For Enhanced Open-domain Dialogue Evaluation Yang Bohao, Zhao Kun, Tang Chen, Liu Dong, Zhan Liang, Lin Chenghua
- Behavior Alignment: A New Perspective Of Evaluating Llm-based Conversational Recommendation Systems Yang Dayu, Chen Fumian, Fang Hui
- Large Language Model Tokenizer Bias: A Case Study And Solution On Gpt-4o Yang Jin, Wang Zhiqiang, Lin Yanbin, Zhao Zunduo
- Buffer Of Thoughts: Thought-augmented Reasoning With Large Language Models Yang Ling, Yu Zhaochen, Zhang Tianjun, Cao Shiyi, Xu Minkai, Zhang Wentao, Gonzalez Joseph E., Cui Bin
- Guardt2i: Defending Text-to-image Models From Adversarial Prompts Yang Yijun, Gao Ruiyuan, Yang Xiao, Zhong Jianyuan, Xu Qiang
- Assessing Adversarial Robustness Of Large Language Models: An Empirical Study Yang Zeyu, Meng Zhao, Zheng Xiaochen, Wattenhofer Roger
- Adversarial Attacks And Defense For Conversation Entailment Task Yang Zhenning, Krawec Ryan, Wu Liang-yuan
- SOS! Soft Prompt Attack Against Open-source Large Language Models Yang Ziqing, Backes Michael, Zhang Yang, Salem Ahmed
- Machine Unlearning Of Pre-trained Large Language Models Yao Jin, Chien Eli, Du Minxin, Niu Xinyao, Wang Tianhao, Cheng Zezhou, Yue Xiang
- R^2AG: Incorporating Retrieval Information Into Retrieval Augmented Generation Ye Fuda, Li Shuangyin, Zhang Yongqi, Chen Lei
- Toolsword: Unveiling Safety Issues Of Large Language Models In Tool Learning Across Three Stages Ye Junjie, Li Sixian, Li Guanyu, Huang Caishuang, Gao Songyang, Wu Yilong, Zhang Qi, Gui Tao, Huang Xuanjing
- Rotbench: A Multi-level Benchmark For Evaluating The Robustness Of Large Language Models In Tool Learning Ye Junjie, Wu Yilong, Gao Songyang, Huang Caishuang, Li Sixian, Li Guanyu, Fan Xiaoran, Zhang Qi, Gui Tao, Huang Xuanjing
- Dataframe QA: A Universal LLM Framework On Dataframe Question Answering Without Data Exposure Ye Junyi, Du Mengnan, Wang Guiling
- Mm-spubench: Towards Better Understanding Of Spurious Biases In Multimodal Llms Ye Wenqian, Zheng Guangtao, Ma Yunsheng, Cao Xu, Lai Bolin, Rehg James M., Zhang Aidong
- How Interpretable Are Reasoning Explanations From Prompting Large Language Models? Yeo Wei Jie, Satapathy Ranjan, Goh Rick Siow Mong, Cambria Erik
- Jailbreak Attacks And Defenses Against Large Language Models: A Survey Yi Sibo, Liu Yule, Sun Zhen, Cong Tianshuo, He Xinlei, Song Jiaxing, Xu Ke, Li Qi
- Vqattack: Transferable Adversarial Attacks On Visual Question Answering Via Pre-trained Models Yin Ziyi, Ye Muchao, Zhang Tianrong, Wang Jiaqi, Liu Han, Chen Jinghui, Wang Ting, Ma Fenglong
- Unveiling The Safety Of Gpt-4o: An Empirical Study Using Jailbreak Attacks Ying Zonghao, Liu Aishan, Liu Xianglong, Tao Dacheng
- Safety Fine-tuning At (almost) No Cost: A Baseline For Vision Large Language Models Yongshuo Zong, Ondrej Bohdal, Tingyang Yu, Yongxin Yang, Timothy Hospedales
- CSRT: Evaluation And Analysis Of Llms Using Code-switching Red-teaming Dataset Yoo Haneul, Yang Yongjin, Lee Hwaran
- Cosafe: Evaluating Large Language Model Safety In Multi-turn Dialogue Coreference Yu Erxin, Li Jing, Liao Ming, Wang Siqi, Gao Zuchen, Mi Fei, Hong Lanqing
- Don't Listen To Me: Understanding And Exploring Jailbreak Prompts Of Large Language Models Yu Zhiyuan, Liu Xiaogeng, Liang Shunning, Cameron Zach, Xiao Chaowei, Zhang Ning
- Refuse Whenever You Feel Unsafe: Improving Safety In Llms Via Decoupled Refusal Training Yuan Youliang, Jiao Wenxiang, Wang Wenxuan, Huang Jen-tse, Xu Jiahao, Liang Tian, He Pinjia, Tu Zhaopeng
- Rigorllm: Resilient Guardrails For Large Language Models Against Undesired Content Yuan Zhuowen, Xiong Zidi, Zeng Yi, Yu Ning, Jia Ruoxi, Song Dawn, Li Bo
- Round Trip Translation Defence Against Large Language Model Jailbreaking Attacks Yung Canaan, Dolatabadi Hadi Mohaghegh, Erfani Sarah, Leckie Christopher
- SPOT: Text Source Prediction From Originality Score Thresholding Yvinec Edouard, Kasser Gabriel
- How Johnny Can Persuade Llms To Jailbreak Them: Rethinking Persuasion To Challenge AI Safety By Humanizing Llms Zeng Yi, Lin Hongpeng, Zhang Jingwen, Yang Diyi, Jia Ruoxi, Shi Weiyan
- Injecagent: Benchmarking Indirect Prompt Injections In Tool-integrated Large Language Model Agents Zhan Qiusi, Liang Zhixiang, Ying Zifan, Kang Daniel
- Goal-guided Generative Prompt Injection Attack On Large Language Models Zhang Chong, Jin Mingyu, Yu Qinkai, Liu Chengzhi, Xue Haochen, Jin Xiaobo
- Extracting Prompts By Inverting LLM Outputs Zhang Collin, Morris John X., Shmatikov Vitaly
- ALTER: Augmentation For Large-table-based Reasoning Zhang Han, Ma Yuheng, Yang Hanfang
- Enja: Ensemble Jailbreak On Large Language Models Zhang Jiahao, Wang Zilong, Wang Ruofan, Ma Xingjun, Jiang Yu-gang
- Stealthy Attack On Large Language Model Based Recommendation Zhang Jinghao, Liu Yuting, Liu Qiang, Wu Shu, Guo Guibing, Wang Liang
- R4: Reinforced Retriever-reorder-responder For Retrieval-augmented Large Language Models Zhang Taolin, Li Dongyang, Chen Qizhou, Wang Chengyu, Huang Longtao, Xue Hui, He Xiaofeng, Huang Jun
- Promptfix: Few-shot Backdoor Removal Via Adversarial Prompt Tuning Zhang Tianrong, Xi Zhaohan, Wang Ting, Mitra Prasenjit, Chen Jinghui
- Dpp-based Adversarial Prompt Searching For Lanugage Models Zhang Xu, Wan Xiaojun
- Benchmarking Trustworthiness Of Multimodal Large Language Models: A Comprehensive Study Zhang Yichi, Huang Yao, Sun Yitong, Liu Chang, Zhao Zhe, Fang Zhengwei, Wang Yifan, Chen Huanran, Yang Xiao, Wei Xingxing, Su Hang, Dong Yinpeng, Zhu Jun
- Boosting Jailbreak Attack With Momentum Zhang Yihao, Wei Zeming
- Leveraging Biases In Large Language Models: "bias-knn'' For Effective Few-shot Learning Zhang Yong, Li Hanzhang, Li Zhitao, Cheng Ning, Li Ming, Xiao Jing, Wang Jianzong
- Pattern-aware Chain-of-thought Prompting In Large Language Models Zhang Yufeng, Wang Xuepeng, Wu Lingxiang, Wang Jinqiao
- Intention Analysis Makes Llms A Good Jailbreak Defender Zhang Yuqi, Ding Liang, Zhang Lefei, Tao Dacheng
- PPTC-R Benchmark: Towards Evaluating The Robustness Of Large Language Models For Powerpoint Task Completion Zhang Zekai, Guo Yiduo, Liang Yaobo, Zhao Dongyan, Duan Nan
- DARG: Dynamic Evaluation Of Large Language Models Via Adaptive Reasoning Graph Zhang Zhehao, Chen Jiaao, Yang Diyi
- Safe Unlearning: A Surprisingly Effective And Generalizable Solution To Defend Against Jailbreak Attacks Zhang Zhexin, Yang Junxiao, Ke Pei, Cui Shiyao, Zheng Chujie, Wang Hongning, Huang Minlie
- Prefix Guidance: A Steering Wheel For Large Language Models To Defend Against Jailbreak Attacks Zhao Jiawei, Chen Kejiang, Yuan Xiaojian, Zhang Weiming
- A Survey Of Backdoor Attacks And Defenses On Large Language Models: Implications For Security Measures Zhao Shuai, Jia Meihuizi, Guo Zhongliang, Gan Leilei, Xu Xiaoyu, Wu Xiaobao, Fu Jie, Feng Yichao, Pan Fengjun, Tuan Luu Anh
- Defending Large Language Models Against Jailbreak Attacks Via Layer-specific Editing Zhao Wei, Li Zhe, Li Yige, Zhang Ye, Sun Jun
- Improving The Robustness Of Large Language Models Via Consistency Alignment Zhao Yukun, Yan Lingyong, Sun Weiwei, Xing Guoliang, Wang Shuaiqiang, Meng Chong, Cheng Zhicong, Ren Zhaochun, Yin Dawei
- Adversarial Contrastive Decoding: Boosting Safety Alignment Of Large Language Models Via Opposite Prompt Optimization Zhao Zhengyue, Zhang Xiaoyun, Xu Kaidi, Hu Xing, Zhang Rui, Du Zidong, Guo Qi, Chen Yunji
- NEO-BENCH: Evaluating Robustness Of Large Language Models With Neologisms Zheng Jonathan, Ritter Alan, Xu Wei
- Kun: Answer Polishment For Chinese Self-alignment With Instruction Back-translation Zheng Tianyu, Guo Shuyue, Qu Xingwei, Guo Jiawei, Du Xinrun, Jia Qi, Lin Chenghua, Huang Wenhao, Fu Jie, Zhang Ge
- Robust Prompt Optimization For Defending Language Models Against Jailbreaking Attacks Zhou Andy, Li Bo, Wang Haohan
- Few-shot Adversarial Prompt Learning On Vision-language Models Zhou Yiwei, Xia Xiaobo, Lin Zhiwei, Han Bo, Liu Tongliang
- Large Language Models Are Involuntary Truth-tellers: Exploiting Fallacy Failure For Jailbreak Attacks Zhou Yue, Zou Henry Peng, Di Eugenio Barbara, Zhang Yang
- Defending Jailbreak Prompts Via In-context Adversarial Game Zhou Yujun, Han Yufei, Zhuang Haomin, Guo Kehan, Liang Zhenwen, Bao Hongyan, Zhang Xiangliang
- Don't Say No: Jailbreaking LLM By Suppressing Refusal Zhou Yukai, Wang Wenjie
- Emulated Disalignment: Safety Alignment For Large Language Models May Backfire! Zhou Zhanhui, Liu Jie, Dong Zhichen, Liu Jiaheng, Yang Chao, Ouyang Wanli, Qiao Yu
- TAT-LLM: A Specialized Language Model For Discrete Reasoning Over Tabular And Textual Data Zhu Fengbin, Liu Ziyang, Feng Fuli, Wang Chao, Li Moxin, Chua Tat-seng
- KG-FPQ: Evaluating Factuality Hallucination In Llms With Knowledge Graph-based False Premise Questions Zhu Yanxu, Xiao Jinlin, Wang Yuhang, Sang Jitao
- Halueval-wild: Evaluating Hallucinations Of Language Models In The Wild Zhu Zhiying, Yang Yiming, Sun Zhiqing
- Astraios: Parameter-efficient Instruction Tuning Code Large Language Models Zhuo Terry Yue, Zebaze Armel, Suppattarachai Nitchakarn, Von Werra Leandro, De Vries Harm, Liu Qian, Muennighoff Niklas
- Image-to-text Logic Jailbreak: Your Imagination Can Help You Do Anything Zou Xiaotian, Li Ke, Chen Yongkang
🏷 Survey Paper
- Generative Deep Neural Networks For Dialogue: A Short Review Serban Iulian Vlad, Lowe Ryan, Charlin Laurent, Pineau Joelle
- Deep Learning Based Chatbot Models Csaky Richard
- Analyzing The Forgetting Problem In The Pretrain-finetuning Of Dialogue Response Models He Tianxing, Liu Jun, Cho Kyunghyun, Ott Myle, Liu Bing, Glass James, Peng Fuchun
- A Survey Of Natural Language Generation Techniques With A Focus On Dialogue Systems - Past, Present And Future Directions Santhanam Sashank, Shaikh Samira
- Generalization In Generation: A Closer Look At Exposure Bias Schmidt Florian
- Natural Language Generation Using Reinforcement Learning With External Rewards Srinivasan Vidhushini, Santhanam Sashank, Shaikh Samira
- Review Conversational Reading Comprehension Xu Hu, Liu Bing, Shu Lei, Yu Philip S.
- Logical Natural Language Generation From Open-domain Tables Chen Wenhu, Chen Jianshu, Su Yu, Chen Zhiyu, Wang William Yang
- App-aware Response Synthesis For User Reviews Farooq Umar, Siddique A. B., Jamour Fuad, Zhao Zhijia, Hristidis Vagelis
- Compressing Large-scale Transformer-based Models: A Case Study On BERT Ganesh Prakhar, Chen Yao, Lou Xin, Khan Mohammad Ali, Yang Yin, Sajjad Hassan, Nakov Preslav, Chen Deming, Winslett Marianne
- Conversational Machine Comprehension: A Literature Review Gupta Somil, Rawat Bhanu Pratap Singh, Yu Hong
- A Closer Look At The Robustness Of Vision-and-language Pre-trained Models Li Linjie, Gan Zhe, Liu Jingjing
- Adversarial Training For Large Neural Language Models Liu Xiaodong, Cheng Hao, He Pengcheng, Chen Weizhu, Wang Yu, Poon Hoifung, Gao Jianfeng
- On Incorporating Structural Information To Improve Dialogue Response Generation Moghe Nikita, Vijayan Priyesh, Ravindran Balaraman, Khapra Mitesh M.
- Let's Stop Incorrect Comparisons In End-to-end Relation Extraction! Taillé Bruno, Guigue Vincent, Scoutheeten Geoffrey, Gallinari Patrick
- On The Universality Of Deep Contextual Language Models Bhatt Shaily, Goyal Poonam, Dandapat Sandipan, Choudhury Monojit, Sitaram Sunayana
- A Primer On Pretrained Multilingual Language Models Doddapaneni Sumanth, Ramesh Gowtham, Khapra Mitesh M., Kunchukuttan Anoop, Kumar Pratyush
- Text2app: A Framework For Creating Android Apps From Text Descriptions Hasan Masum, Mehrab Kazi Sajeed, Ahmad Wasi Uddin, Shahriyar Rifat
- AMMUS : A Survey Of Transformer-based Pretrained Models In Natural Language Processing Kalyan Katikapalli Subramanyam, Rajasekharan Ajit, Sangeetha Sivanesan
- Quiz-style Question Generation For News Stories Lelkes Adam D., Tran Vinh Q., Yu Cong
- Personalized Transformer For Explainable Recommendation Li Lei, Zhang Yongfeng, Chen Li
- Recent Advances In Natural Language Processing Via Large Pre-trained Language Models: A Survey Min Bonan, Ross Hayley, Sulem Elior, Veyseh Amir Pouran Ben, Nguyen Thien Huu, Sainz Oscar, Agirre Eneko, Heinz Ilana, Roth Dan
- Software-based Dialogue Systems: Survey, Taxonomy And Challenges Motger Quim, Franch Xavier, Marco Jordi
- Pre-train, Prompt, And Predict: A Systematic Survey Of Prompting Methods In Natural Language Processing Pengfei Liu, Weizhe Yuan, Jinlan Fu, Zhengbao Jiang, Hiroaki Hayashi, Graham Neubig
- Societal Biases In Language Generation: Progress And Challenges Sheng Emily, Chang Kai-wei, Natarajan Premkumar, Peng Nanyun
- List: Lite Prompted Self-training Makes Parameter-efficient Few-shot Learners Wang Yaqing, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Transformer-based Korean Pretrained Language Models: A Survey On Three Years Of Progress Yang Kichang
- Language Bias In Visual Question Answering: A Survey And Taxonomy Yuan Desen
- DSGPT: Domain-specific Generative Pre-training Of Transformers For Text Generation In E-commerce Title And Review Summarization Zhang Xueying, Jiang Yunjiang, Shang Yue, Cheng Zhaomeng, Zhang Chi, Fan Xiaochuan, Xiao Yun, Long Bo
- An Exploratory Study On Long Dialogue Summarization: What Works And What's Next Zhang Yusen, Ni Ansong, Yu Tao, Zhang Rui, Zhu Chenguang, Deb Budhaditya, Celikyilmaz Asli, Awadallah Ahmed Hassan, Radev Dragomir
- Adapting Language Models For Zero-shot Learning By Meta-tuning On Dataset And Prompt Collections Zhong Ruiqi, Lee Kristy, Zhang Zheng, Klein Dan
- Evidence > Intuition: Transferability Estimation For Encoder Selection Bassignana Elisa, Müller-eberstein Max, Zhang Mike, Plank Barbara
- Shortcut Learning Of Large Language Models In Natural Language Understanding Du Mengnan, He Fengxiang, Zou Na, Tao Dacheng, Hu Xia
- A Survey Of Vision-language Pre-trained Models Du Yifan, Liu Zikang, Li Junyi, Zhao Wayne Xin
- Thinking About GPT-3 In-context Learning For Biomedical IE? Think Again Gutiérrez Bernal Jiménez, Mcneal Nikolas, Washington Clay, Chen You, Li Lang, Sun Huan, Su Yu
- TRUE: Re-evaluating Factual Consistency Evaluation Honovich Or, Aharoni Roee, Herzig Jonathan, Taitelbaum Hagai, Kukliansy Doron, Cohen Vered, Scialom Thomas, Szpektor Idan, Hassidim Avinatan, Matias Yossi
- Survey Of Hallucination In Natural Language Generation Ji Ziwei, Lee Nayeon, Frieske Rita, Yu Tiezheng, Su Dan, Xu Yan, Ishii Etsuko, Bang Yejin, Chen Delong, Dai Wenliang, Chan Ho Shu, Madotto Andrea, Fung Pascale
- Towards Reasoning In Large Language Models: A Survey Jie Huang, Kevin Chen-chuan Chang
- A Survey On Retrieval-augmented Text Generation Li Huayang, Su Yixuan, Cai Deng, Wang Yan, Liu Lemao
- Vision-and-language Pretrained Models: A Survey Long Siqu, Cao Feiqi, Han Soyeon Caren, Yang Haiqin
- A Survey On In-context Learning Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Lei Li, Zhifang Sui
- Ebms Vs. CL: Exploring Self-supervised Visual Pretraining For Visual Question Answering Shevchenko Violetta, Abbasnejad Ehsan, Dick Anthony, Hengel Anton Van Den, Teney Damien
- Reasoning With Language Model Prompting: A Survey Shuofei Qiao, Yixin Ou, Ningyu Zhang, Xiang Chen, Yunzhi Yao, Shumin Deng, Chuanqi Tan, Fei Huang, Huajun Chen
- Pay Attention To Your Tone: Introducing A New Dataset For Polite Language Rewrite Wang Xun, Ge Tao, Mao Allen, Li Yuki, Wei Furu, Chen Si-qing
- Adapting Pretrained Text-to-text Models For Long Text Sequences Xiong Wenhan, Gupta Anchit, Toshniwal Shubham, Mehdad Yashar, Yih Wen-tau
- A Survey On GPT-3 Zong Mingyu, Krishnamachari Bhaskar
- Methods For Estimating And Improving Robustness Of Language Models Štefánik Michal
- Chatgpt And Beyond: The Generative AI Revolution In Education Al-smadi Mohammad
- Tokenizer Choice For LLM Training: Negligible Or Crucial? Ali Mehdi, Fromm Michael, Thellmann Klaudia, Rutmann Richard, Lübbering Max, Leveling Johannes, Klug Katrin, Ebert Jan, Doll Niclas, Buschhoff Jasper Schulze, Jain Charvi, Weber Alexander Arno, Jurkschat Lena, Abdelwahab Hammam, John Chelsea, Suarez Pedro Ortiz, Ostendorff Malte, Weinbach Samuel, Sifa Rafet, Kesselheim Stefan, Flores-herr Nicolas
- Generative AI Perceptions: A Survey To Measure The Perceptions Of Faculty, Staff, And Students On Generative AI Tools In Academia Amani Sara, White Lance, Balart Trini, Arora Laksha, Shryock Kristi J., Brumbelow Kelly, Watson Karan L.
- L-eval: Instituting Standardized Evaluation For Long Context Language Models An Chenxin, Gong Shansan, Zhong Ming, Zhao Xingjian, Li Mukai, Zhang Jun, Kong Lingpeng, Qiu Xipeng
- Augmenting Llms With Knowledge: A Survey On Hallucination Prevention Andriopoulos Konstantinos, Pouwelse Johan
- Real-time Visual Feedback To Guide Benchmark Creation: A Human-and-metric-in-the-loop Workflow Arunkumar Anjana, Mishra Swaroop, Sachdeva Bhavdeep, Baral Chitta, Bryan Chris
- Harnessing Gpt-3.5-turbo For Rhetorical Role Prediction In Legal Cases Belfathi Anas, Hernandez Nicolas, Monceaux Laura
- A Survey On Prompting Techniques In Llms Bhandari Prabin
- Automating Question Generation From Educational Text Bhowmick Ayan Kumar, Jagmohan Ashish, Vempaty Aditya, Dey Prasenjit, Hall Leigh, Hartman Jeremy, Kokku Ravi, Maheshwari Hema
- Eight Things To Know About Large Language Models Bowman Samuel R.
- Development Of The Chatgpt, Generative Artificial Intelligence And Natural Large Language Models For Accountable Reporting And Use (CANGARU) Guidelines Cacciamani Giovanni E., Eppler Michael B., Ganjavi Conner, Pekan Asli, Biedermann Brett, Collins Gary S., Gill Inderbir S.
- The Life Cycle Of Knowledge In Big Language Models: A Survey Cao Boxi, Lin Hongyu, Han Xianpei, Sun Le
- Creativity Support In The Age Of Large Language Models: An Empirical Study Involving Emerging Writers Chakrabarty Tuhin, Padmakumar Vishakh, Brahman Faeze, Muresan Smaranda
- Chatgpt May Pass The Bar Exam Soon, But Has A Long Way To Go For The Lexglue Benchmark Chalkidis Ilias
- Language Model Behavior: A Comprehensive Survey Chang Tyler A., Bergen Benjamin K.
- Chatgpt's One-year Anniversary: Are Open-source Large Language Models Catching Up? Chen Hailin, Jiao Fangkai, Li Xingxuan, Qin Chengwei, Ravaut Mathieu, Zhao Ruochen, Xiong Caiming, Joty Shafiq
- Challenges And Contributing Factors In The Utilization Of Large Language Models (llms) Chen Xiaoliang, Li Liangbin, Chang Le, Huang Yunhe, Zhao Yuxuan, Zhang Yuxiao, Li Dinuo
- T-eval: Evaluating The Tool Utilization Capability Of Large Language Models Step By Step Chen Zehui, Du Weihua, Zhang Wenwei, Liu Kuikun, Liu Jiangning, Zheng Miao, Zhuo Jingming, Zhang Songyang, Lin Dahua, Chen Kai, Zhao Feng
- An Integrative Survey On Mental Health Conversational Agents To Bridge Computer Science And Medical Perspectives Cho Young Min, Rai Sunny, Ungar Lyle, Sedoc João, Guntuku Sharath Chandra
- Navigate Through Enigmatic Labyrinth A Survey Of Chain Of Thought Reasoning: Advances, Frontiers And Future Chu Zheng, Chen Jingchang, Chen Qianglong, Yu Weijiang, He Tao, Wang Haotian, Peng Weihua, Liu Ming, Qin Bing, Liu Ting
- Cognitively Inspired Components For Social Conversational Agents Clay Alex, Alonso Eduardo, Mondragón Esther
- Llm-in-the-loop: Leveraging Large Language Model For Thematic Analysis Dai Shih-chieh, Xiong Aiping, Ku Lun-wei
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- Large Language Models Douglas Michael R.
- ROBBIE: Robust Bias Evaluation Of Large Generative Language Models Esiobu David, Tan Xiaoqing, Hosseini Saghar, Ung Megan, Zhang Yuchen, Fernandes Jude, Dwivedi-yu Jane, Presani Eleonora, Williams Adina, Smith Eric Michael
- Should Chatgpt Be Biased? Challenges And Risks Of Bias In Large Language Models Ferrara Emilio
- Vision Language Transformers: A Survey Fields Clayton, Kennington Casey
- Medalign: A Clinician-generated Dataset For Instruction Following With Electronic Medical Records Fleming Scott L., Lozano Alejandro, Haberkorn William J., Jindal Jenelle A., Reis Eduardo P., Thapa Rahul, Blankemeier Louis, Genkins Julian Z., Steinberg Ethan, Nayak Ashwin, Patel Birju S., Chiang Chia-chun, Callahan Alison, Huo Zepeng, Gatidis Sergios, Adams Scott J., Fayanju Oluseyi, Shah Shreya J., Savage Thomas, Goh Ethan, Chaudhari Akshay S., Aghaeepour Nima, Sharp Christopher, Pfeffer Michael A., Liang Percy, Chen Jonathan H., Morse Keith E., Brunskill Emma P., Fries Jason A., Shah Nigam H.
- The Impact And Applications Of Chatgpt: A Systematic Review Of Literature Reviews Gabashvili Irene S.
- Examining User-friendly And Open-sourced Large GPT Models: A Survey On Language, Multimodal, And Scientific GPT Models Gao Kaiyuan, He Sunan, He Zhenyu, Lin Jiacheng, Pei Qizhi, Shao Jie, Zhang Wei
- Simulating H.P. Lovecraft Horror Literature With The Chatgpt Large Language Model Garrido-merchán Eduardo C., Arroyo-barrigüete José Luis, Gozalo-brizuela Roberto
- A Survey Of Confidence Estimation And Calibration In Large Language Models Geng Jiahui, Cai Fengyu, Wang Yuxia, Koeppl Heinz, Nakov Preslav, Gurevych Iryna
- Can Llms Capture Human Preferences? Goli Ali, Singh Amandeep
- Figstep: Jailbreaking Large Vision-language Models Via Typographic Visual Prompts Gong Yichen, Ran Delong, Liu Jinyuan, Wang Conglei, Cong Tianshuo, Wang Anyu, Duan Sisi, Wang Xiaoyun
- Augmented Language Models: A Survey Grégoire Mialon, Roberto Dessì, Maria Lomeli, Christoforos Nalmpantis, Ram Pasunuru, Roberta Raileanu, Baptiste Rozière, Timo Schick, Jane Dwivedi-yu, Asli Celikyilmaz, Edouard Grave, Yann Lecun, Thomas Scialom
- Evaluating Large Language Models: A Comprehensive Survey Guo Zishan, Jin Renren, Liu Chuang, Huang Yufei, Shi Dan, Supryadi, Yu Linhao, Liu Yan, Li Jiaxuan, Xiong Bojian, Xiong Deyi
- A Survey Of Vision-language Pre-training From The Lens Of Multimodal Machine Translation Gwinnup Jeremy, Duh Kevin
- Large Language Models Meet Computer Vision: A Brief Survey Hamadi Raby
- Potential Benefits Of Employing Large Language Models In Research In Moral Education And Development Han Hyemin
- Cheddar: Student-chatgpt Dialogue In EFL Writing Education Han Jieun, Yoo Haneul, Myung Junho, Kim Minsun, Lee Tak Yeon, Ahn So-yeon, Oh Alice
- Personalisation Within Bounds: A Risk Taxonomy And Policy Framework For The Alignment Of Large Language Models With Personalised Feedback Hannah Rose Kirk, Bertie Vidgen, Paul Röttger, Scott A. Hale
- A Survey On Hallucination In Large Language Models: Principles, Taxonomy, Challenges, And Open Questions Huang Lei, Yu Weijiang, Ma Weitao, Zhong Weihong, Feng Zhangyin, Wang Haotian, Chen Qianglong, Peng Weihua, Feng Xiaocheng, Qin Bing, Liu Ting
- Can Large Language Models Explain Themselves? A Study Of Llm-generated Self-explanations Huang Shiyuan, Mamidanna Siddarth, Jangam Shreedhar, Zhou Yilun, Gilpin Leilani H.
- Advancing Transformer Architecture In Long-context Large Language Models: A Comprehensive Survey Huang Yunpeng, Xu Jingwei, Lai Junyu, Jiang Zixu, Chen Taolue, Li Zenan, Yao Yuan, Ma Xiaoxing, Yang Lijuan, Chen Hao, Li Shupeng, Zhao Penghao
- Aligning Language Models To User Opinions Hwang Eunjeong, Majumder Bodhisattwa Prasad, Tandon Niket
- Co-writing With Opinionated Language Models Affects Users' Views Jakesch Maurice, Bhat Advait, Buschek Daniel, Zalmanson Lior, Naaman Mor
- You Only Forward Once: Prediction And Rationalization In A Single Forward Pass Jiang Han, Duan Junwen, Qu Zhe, Wang Jianxin
- How Can Recommender Systems Benefit From Large Language Models: A Survey Jianghao Lin, Xinyi Dai, Yunjia Xi, Weiwen Liu, Bo Chen, Hao Zhang, Yong Liu, Chuhan Wu, Xiangyang Li, Chenxu Zhu, Huifeng Guo, Yong Yu, Ruiming Tang, Weinan Zhang
- Evaluation And Enhancement Of Semantic Grounding In Large Vision-language Models Jiaying Lu, Jinmeng Rao, Kezhen Chen, Xiaoyuan Guo, Yawen Zhang, Baochen Sun, Carl Yang, Jie Yang
- When Large Language Models Meet Personalization: Perspectives Of Challenges And Opportunities Jin Chen, Zheng Liu, Xu Huang, Chenwang Wu, Qi Liu, Gangwei Jiang, Yuanhao Pu, Yuxuan Lei, Xiaolong Chen, Xingmei Wang, Defu Lian, Enhong Chen
- "it's Not Like Jarvis, But It's Pretty Close!" -- Examining Chatgpt's Usage Among Undergraduate Students In Computer Science Joshi Ishika, Budhiraja Ritvik, Akolekar Harshal D, Challa Jagat Sesh, Kumar Dhruv
- Robust Recommender System: A Survey And Future Directions Kaike Zhang, Qi Cao, Fei Sun, Yunfan Wu, Shuchang Tao, Huawei Shen, Xueqi Cheng
- A Survey Of GPT-3 Family Large Language Models Including Chatgpt And GPT-4 Kalyan Katikapalli Subramanyam
- Gptaraeval: A Comprehensive Evaluation Of Chatgpt On Arabic NLP Khondaker Md Tawkat Islam, Waheed Abdul, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- Auto-survey Challenge Khuong Thanh Gia Hieu Tau, Lisn, Rachmat Benedictus Kent Tau, Lisn
- The Past, Present And Better Future Of Feedback Learning In Large Language Models For Subjective Human Preferences And Values Kirk Hannah Rose, Bean Andrew M., Vidgen Bertie, Röttger Paul, Hale Scott A.
- Fundamentals Of Generative Large Language Models And Perspectives In Cyber-defense Kucharavy Andrei, Schillaci Zachary, Maréchal Loïc, Würsch Maxime, Dolamic Ljiljana, Sabonnadiere Remi, David Dimitri Percia, Mermoud Alain, Lenders Vincent
- Large Language Models For Generative Recommendation: A Survey And Visionary Discussions Lei Li, Yongfeng Zhang, Dugang Liu, Li Chen
- Chatgpt: A Meta-analysis After 2.5 Months Leiter Christoph, Zhang Ran, Chen Yanran, Belouadi Jonas, Larionov Daniil, Fresen Vivian, Eger Steffen
- Vision-language Instruction Tuning: A Review And Analysis Li Chen, Ge Yixiao, Li Dian, Shan Ying
- Summarizing Multiple Documents With Conversational Structure For Meta-review Generation Li Miao, Hovy Eduard, Lau Jey Han
- A Survey On Fairness In Large Language Models Li Yingji, Du Mengnan, Song Rui, Wang Xin, Wang Ying
- A Practical Survey On Zero-shot Prompt Design For In-context Learning Li Yinheng
- Opening Up Chatgpt: Tracking Openness, Transparency, And Accountability In Instruction-tuned Text Generators Liesenfeld Andreas, Lopez Alianda, Dingemanse Mark
- A Survey On Large Language Models For Recommendation Likang Wu, Zhi Zheng, Zhaopeng Qiu, Hao Wang, Hongchao Gu, Tingjia Shen, Chuan Qin, Chen Zhu, Hengshu Zhu, Qi Liu, Hui Xiong, Enhong Chen
- Advances In Embodied Navigation Using Large Language Models: A Survey Lin Jinzhou, Gao Han, Feng Xuxiang, Xu Rongtao, Wang Changwei, Zhang Man, Guo Li, Xu Shibiao
- Establishing Trustworthiness: Rethinking Tasks And Model Evaluation Litschko Robert, Müller-eberstein Max, Van Der Goot Rob, Weber Leon, Plank Barbara
- Trustworthy Llms: A Survey And Guideline For Evaluating Large Language Models' Alignment Liu Yang, Yao Yuanshun, Ton Jean-francois, Zhang Xiaoying, Guo Ruocheng, Cheng Hao, Klochkov Yegor, Taufiq Muhammad Faaiz, Li Hang
- Summary Of Chatgpt-related Research And Perspective Towards The Future Of Large Language Models Liu Yiheng, Han Tianle, Ma Siyuan, Zhang Jiayue, Yang Yuanyuan, Tian Jiaming, He Hao, Li Antong, He Mengshen, Liu Zhengliang, Wu Zihao, Zhao Lin, Zhu Dajiang, Li Xiang, Qiang Ning, Shen Dingang, Liu Tianming, Ge Bao
- On Learning To Summarize With Large Language Models As References Liu Yixin, Shi Kejian, He Katherine S, Ye Longtian, Fabbri Alexander R., Liu Pengfei, Radev Dragomir, Cohan Arman
- Llama-reviewer: Advancing Code Review Automation With Large Language Models Through Parameter-efficient Fine-tuning Lu Junyi, Yu Lei, Li Xiaojia, Yang Li, Zuo Chun
- Towards Logiglue: A Brief Survey And A Benchmark For Analyzing Logical Reasoning Capabilities Of Language Models Luo Man, Kumbhar Shrinidhi, Shen Ming, Parmar Mihir, Varshney Neeraj, Banerjee Pratyay, Aditya Somak, Baral Chitta
- Fine-tuning Llama For Multi-stage Text Retrieval Ma Xueguang, Wang Liang, Yang Nan, Wei Furu, Lin Jimmy
- Brain In A Vat: On Missing Pieces Towards Artificial General Intelligence In Large Language Models Ma Yuxi, Zhang Chi, Zhu Song-chun
- Gpteval: A Survey On Assessments Of Chatgpt And GPT-4 Mao Rui, Chen Guanyi, Zhang Xulang, Guerin Frank, Cambria Erik
- Towards Efficient Generative Large Language Model Serving: A Survey From Algorithms To Systems Miao Xupeng, Oliaro Gabriele, Zhang Zhihao, Cheng Xinhao, Jin Hongyi, Chen Tianqi, Jia Zhihao
- Characterizing Large Language Models As Rationalizers Of Knowledge-intensive Tasks Mishra Aditi, Rahman Sajjadur, Kim Hannah, Mitra Kushan, Hruschka Estevam
- Chatgpt In The Age Of Generative AI And Large Language Models: A Concise Survey Mohamadi Salman, Mujtaba Ghulam, Le Ngan, Doretto Gianfranco, Adjeroh Donald A.
- The Impact Of Artificial Intelligence On The Evolution Of Digital Education: A Comparative Study Of Openai Text Generation Tools Including Chatgpt, Bing Chat, Bard, And Ernie Motlagh Negin Yazdani, Khajavi Matin, Sharifi Abbas, Ahmadi Mohsen
- Can Llms Follow Simple Rules? Mu Norman, Chen Sarah, Wang Zifan, Chen Sizhe, Karamardian David, Aljeraisy Lulwa, Alomair Basel, Hendrycks Dan, Wagner David
- Llms-healthcare : Current Applications And Challenges Of Large Language Models In Various Medical Specialties Mumtaz Ummara, Ahmed Awais, Mumtaz Summaya
- On Evaluating The Integration Of Reasoning And Action In LLM Agents With Database Question Answering Nan Linyong, Zhang Ellen, Zou Weijin, Zhao Yilun, Zhou Wenfei, Cohan Arman
- A Comprehensive Overview Of Large Language Models Naveed Humza, Khan Asad Ullah, Qiu Shi, Saqib Muhammad, Anwar Saeed, Usman Muhammad, Akhtar Naveed, Barnes Nick, Mian Ajmal
- Large Language Models In Healthcare And Medical Domain: A Review Nazi Zabir Al, Peng Wei
- LEXTREME: A Multi-lingual And Multi-task Benchmark For The Legal Domain Niklaus Joel, Matoshi Veton, Rani Pooja, Galassi Andrea, Stürmer Matthias, Chalkidis Ilias
- Evaluating AI Vocational Skills Through Professional Testing Noever David, Ciolino Matt
- On The Unexpected Abilities Of Large Language Models Nolfi Stefano
- Generative AI: Implications And Applications For Education Olga Anastasia Olnancy, Tzirides, Saini Akash, Zapata Gabriela, Searsmith Duane, Cope Bill, Kalantzis Mary, Castro Vania, Kourkoulou Theodora, Jones John, Da Silva Rodrigo Abrantes, Whiting Jen, Kastania Nikoleta Polyxeni
- Several Categories Of Large Language Models (llms): A Short Survey Pahune Saurabh, Chandrasekharan Manoj
- Giraffe: Adventures In Expanding Context Lengths In Llms Pal Arka, Karkhanis Deep, Roberts Manley, Dooley Samuel, Sundararajan Arvind, Naidu Siddartha
- Automatically Correcting Large Language Models: Surveying The Landscape Of Diverse Self-correction Strategies Pan Liangming, Saxon Michael, Xu Wenda, Nathani Deepak, Wang Xinyi, Wang William Yang
- A Large Language Model Approach To Educational Survey Feedback Analysis Parker Michael J., Anderson Caitlin, Stone Claire, Oh Yearim
- Pre-train, Prompt And Recommendation: A Comprehensive Survey Of Language Modelling Paradigm Adaptations In Recommender Systems Peng Liu, Lemei Zhang, Jon Atle Gulla
- Science In The Era Of Chatgpt, Large Language Models And Generative AI: Challenges For Research Ethics And How To Respond Pournaras Evangelos
- The Robots Are Here: Navigating The Generative AI Revolution In Computing Education Prather James, Denny Paul, Leinonen Juho, Becker Brett A., Albluwi Ibrahim, Craig Michelle, Keuning Hieke, Kiesler Natalie, Kohn Tobias, Luxton-reilly Andrew, Macneil Stephen, Peterson Andrew, Pettit Raymond, Reeves Brent N., Savelka Jaromir
- Tricking Llms Into Disobedience: Formalizing, Analyzing, And Detecting Jailbreaks Rao Abhinav, Vashistha Sachin, Naik Atharva, Aditya Somak, Choudhury Monojit
- Evaluating Llms On Document-based QA: Exact Answer Selection And Numerical Extraction Using Cogtale Dataset Rasool Zafaryab, Kurniawan Stefanus, Balugo Sherwin, Barnett Scott, Vasa Rajesh, Chesser Courtney, Hampstead Benjamin M., Belleville Sylvie, Mouzakis Kon, Bahar-fuchs Alex
- On Context Utilization In Summarization With Large Language Models Ravaut Mathieu, Sun Aixin, Chen Nancy F., Joty Shafiq
- Commonsense Reasoning For Conversational AI: A Survey Of The State Of The Art Richardson Christopher, Heck Larry
- Bio-sieve: Exploring Instruction Tuning Large Language Models For Systematic Review Automation Robinson Ambrose, Thorne William, Wu Ben P., Pandor Abdullah, Essat Munira, Stevenson Mark, Song Xingyi
- The Science Of Detecting Llm-generated Texts Ruixiang Tang, Yu-neng Chuang, Xia Hu
- Are Emergent Abilities Of Large Language Models A Mirage? Rylan Schaeffer, Brando Miranda, Sanmi Koyejo
- Unlocking Practical Applications In Legal Domain: Evaluation Of GPT For Zero-shot Semantic Annotation Of Legal Texts Savelka Jaromir
- Ruffle&riley: Towards The Automated Induction Of Conversational Tutoring Systems Schmucker Robin, Xia Meng, Azaria Amos, Mitchell Tom
- A Review Of Repository Level Prompting For Llms Schonholtz Douglas
- Large Language Model Alignment: A Survey Shen Tianhao, Jin Renren, Huang Yufei, Liu Chuang, Dong Weilong, Guo Zishan, Wu Xinwei, Liu Yan, Xiong Deyi
- A Survey On Multimodal Large Language Models Shukang Yin, Chaoyou Fu, Sirui Zhao, Ke Li, Xing Sun, Tong Xu, Enhong Chen
- Knowledge Unlearning For Llms: Tasks, Methods, And Challenges Si Nianwen, Zhang Hao, Chang Heyu, Zhang Wenlin, Qu Dan, Zhang Weiqiang
- Decoding Chatgpt: A Taxonomy Of Existing Research, Current Challenges, And Possible Future Directions Sohail Shahab Saquib, Farhat Faiza, Himeur Yassine, Nadeem Mohammad, Madsen Dag Øivind, Singh Yashbir, Atalla Shadi, Mansoor Wathiq
- How To Bridge The Gap Between Modalities: A Comprehensive Survey On Multimodal Large Language Model Song Shezheng, Li Xiaopeng, Li Shasha, Zhao Shan, Yu Jie, Ma Jun, Mao Xiaoguang, Zhang Weimin
- Chatgpt: A Study On Its Utility For Ubiquitous Software Engineering Tasks Sridhara Giriprasad, G. Ranjani H., Mazumdar Sourav
- Beyond Memorization: Violating Privacy Via Inference With Large Language Models Staab Robin, Vero Mark, Balunović Mislav, Vechev Martin
- Cognitive Architectures For Language Agents Sumers Theodore R., Yao Shunyu, Narasimhan Karthik, Griffiths Thomas L.
- Corex: Pushing The Boundaries Of Complex Reasoning Through Multi-model Collaboration Sun Qiushi, Yin Zhangyue, Li Xiang, Wu Zhiyong, Qiu Xipeng, Kong Lingpeng
- A Short Survey Of Viewing Large Language Models In Legal Aspect Sun Zhongxiang
- Interpreting Pretrained Language Models Via Concept Bottlenecks Tan Zhen, Cheng Lu, Wang Song, Bo Yuan, Li Jundong, Liu Huan
- Do Llms Exhibit Human-like Response Biases? A Case Study In Survey Design Tjuatja Lindia, Chen Valerie, Wu Sherry Tongshuang, Talwalkar Ameet, Neubig Graham
- Automating Code-related Tasks Through Transformers: The Impact Of Pre-training Tufano Rosalia, Pascarella Luca, Bavota Gabriele
- Adventures Of Trustworthy Vision-language Models: A Survey Vatsa Mayank, Jain Anubhooti, Singh Richa
- Efficient Large Language Models: A Survey Wan Zhongwei, Wang Xin, Liu Che, Alam Samiul, Zheng Yu, Liu Jiachen, Qu Zhongnan, Yan Shen, Zhu Yi, Zhang Quanlu, Chowdhury Mosharaf, Zhang Mi
- Shall We Pretrain Autoregressive Language Models With Retrieval? A Comprehensive Study Wang Boxin, Ping Wei, Xu Peng, Mcafee Lawrence, Liu Zihan, Shoeybi Mohammad, Dong Yi, Kuchaiev Oleksii, Li Bo, Xiao Chaowei, Anandkumar Anima, Catanzaro Bryan
- A Survey Of The Evolution Of Language Model-based Dialogue Systems Wang Hongru, Wang Lingzhi, Du Yiming, Chen Liang, Zhou Jingyan, Wang Yufei, Wong Kam-fai
- On The Robustness Of Chatgpt: An Adversarial And Out-of-distribution Perspective Wang Jindong, Hu Xixu, Hou Wenxin, Chen Hao, Zheng Runkai, Wang Yidong, Yang Linyi, Huang Haojun, Ye Wei, Geng Xiubo, Jiao Binxin, Zhang Yue, Xie Xing
- Adding Guardrails To Advanced Chatbots Wang Yanchen, Singh Lisa
- Generating Valid And Natural Adversarial Examples With Large Language Models Wang Zimu, Wang Wei, Chen Qi, Wang Qiufeng, Nguyen Anh
- A Survey Of Large Language Models Wayne Xin Zhao, Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-yun Nie, Ji-rong Wen
- Instructblip: Towards General-purpose Vision-language Models With Instruction Tuning Wenliang Dai, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Junqi Zhao, Weisheng Wang, Boyang Li, Pascale Fung, Steven Hoi
- Exploring Parameter-efficient Fine-tuning Techniques For Code Generation With Large Language Models Weyssow Martin, Zhou Xin, Kim Kisub, Lo David, Sahraoui Houari
- Natural Language Generation And Understanding Of Big Code For Ai-assisted Programming: A Review Wong Man Fai, Guo Shangxin, Hang Ching Nam, Ho Siu Wai, Tan Chee Wei
- A Survey On Multi-behavior Sequential Recommendation Xiaoqing Chen, Zhitao Li, Weike Pan, Zhong Ming
- Large Language Models For Generative Information Extraction: A Survey Xu Derong, Chen Wei, Peng Wenjun, Zhang Chao, Xu Tong, Zhao Xiangyu, Wu Xian, Zheng Yefeng, Wang Yang, Chen Enhong
- Practical And Ethical Challenges Of Large Language Models In Education: A Systematic Scoping Review Yan Lixiang, Sha Lele, Zhao Linxuan, Li Yuheng, Martinez-maldonado Roberto, Chen Guanliang, Li Xinyu, Jin Yueqiao, Gašević Dragan
- From Instructions To Intrinsic Human Values -- A Survey Of Alignment Goals For Big Models Yao Jing, Yi Xiaoyuan, Wang Xiting, Wang Jindong, Xie Xing
- A Survey On Large Language Model (LLM) Security And Privacy: The Good, The Bad, And The Ugly Yao Yifan, Duan Jinhao, Xu Kaidi, Cai Yuanfang, Sun Zhibo, Zhang Yue
- Towards Better Chain-of-thought Prompting Strategies: A Survey Yu Zihan, He Liang, Wu Zhen, Dai Xinyu, Chen Jiajun
- Large Language Models Illuminate A Progressive Pathway To Artificial Healthcare Assistant: A Review Yuan Mingze, Bao Peng, Yuan Jiajia, Shen Yunhao, Chen Zifan, Xie Yi, Zhao Jie, Chen Yang, Zhang Li, Shen Lin, Dong Bin
- Aligning Large Language Models With Human: A Survey Yufei Wang, Wanjun Zhong, Liangyou Li, Fei Mi, Xingshan Zeng, Wenyong Huang, Lifeng Shang, Xin Jiang, Qun Liu
- Retrieval-augmented Generation For Large Language Models: A Survey Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Meng Wang, Haofen Wang
- Building Trust In Conversational AI: A Comprehensive Review And Solution Architecture For Explainable, Privacy-aware Systems Using Llms And Knowledge Graph Zafar Ahtsham, Parthasarathy Venkatesh Balavadhani, Van Chan Le, Shahid Saad, Khan Aafaq Iqbal, Shahid Arsalan
- What Matters In Training A Gpt4-style Language Model With Multimodal Inputs? Zeng Yan, Zhang Hanbo, Zheng Jiani, Xia Jiangnan, Wei Guoqiang, Wei Yang, Zhang Yuchen, Kong Tao
- A Comprehensive Analysis Of The Effectiveness Of Large Language Models As Automatic Dialogue Evaluators Zhang Chen, D'haro Luis Fernando, Chen Yiming, Zhang Malu, Li Haizhou
- Tuning Large Language Model For End-to-end Speech Translation Zhang Hao, Si Nianwen, Chen Yaqi, Zhang Wenlin, Yang Xukui, Qu Dan, Jiao Xiaolin
- Instruction Tuning For Large Language Models: A Survey Zhang Shengyu, Dong Linfeng, Li Xiaoya, Zhang Sen, Sun Xiaofei, Wang Shuhe, Li Jiwei, Hu Runyi, Zhang Tianwei, Wu Fei, Wang Guoyin
- Meta Prompting For AI Systems Zhang Yifan, Yuan Yang, Yao Andrew Chi-chih
- Siren's Song In The AI Ocean: A Survey On Hallucination In Large Language Models Zhang Yue, Li Yafu, Cui Leyang, Cai Deng, Liu Lemao, Fu Tingchen, Huang Xinting, Zhao Enbo, Zhang Yu, Chen Yulong, Wang Longyue, Luu Anh Tuan, Bi Wei, Shi Freda, Shi Shuming
- Students' Perceptions And Preferences Of Generative Artificial Intelligence Feedback For Programming Zhang Zhengdong, Dong Zihan, Shi Yang, Matsuda Noboru, Price Thomas, Xu Dongkuan
- Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-thought Reasoning To Language Agents Zhang Zhuosheng, Yao Yao, Zhang Aston, Tang Xiangru, Ma Xinbei, He Zhiwei, Wang Yiming, Gerstein Mark, Wang Rui, Liu Gongshen, Zhao Hai
- How Do Large Language Models Capture The Ever-changing World Knowledge? A Review Of Recent Advances Zhang Zihan, Fang Meng, Chen Ling, Namazi-rad Mohammad-reza, Wang Jun
- Unifying The Perspectives Of NLP And Software Engineering: A Survey On Language Models For Code Zhang Ziyin, Chen Chaoyu, Liu Bingchang, Liao Cong, Gong Zi, Yu Hang, Li Jianguo, Wang Rui
- Explainability For Large Language Models: A Survey Zhao Haiyan, Chen Hanjie, Yang Fan, Liu Ninghao, Deng Huiqi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Du Mengnan
- Retrieving Multimodal Information For Augmented Generation: A Survey Zhao Ruochen, Chen Hailin, Wang Weishi, Jiao Fangkai, Do Xuan Long, Qin Chengwei, Ding Bosheng, Guo Xiaobao, Li Minzhi, Li Xingxuan, Joty Shafiq
- A Survey Of Large Language Models Zhao Wayne Xin, Zhou Kun, Li Junyi, Tang Tianyi, Wang Xiaolei, Hou Yupeng, Min Yingqian, Zhang Beichen, Zhang Junjie, Dong Zican, Du Yifan, Yang Chen, Chen Yushuo, Chen Zhipeng, Jiang Jinhao, Ren Ruiyang, Li Yifan, Tang Xinyu, Liu Zikang, Liu Peiyu, Nie Jian-yun, Wen Ji-rong
- Learn From Model Beyond Fine-tuning: A Survey Zheng Hongling, Shen Li, Tang Anke, Luo Yong, Hu Han, Du Bo, Tao Dacheng
- The Rise And Potential Of Large Language Model Based Agents: A Survey Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang, Changhao Jiang, Yicheng Zou, Xiangyang Liu, Zhangyue Yin, Shihan Dou, Rongxiang Weng, Wensen Cheng, Qi Zhang, Wenjuan Qin, Yongyan Zheng, Xipeng Qiu, Xuanjing Huang, Tao Gui
- GPT As A Baseline For Recommendation Explanation Texts Zhou Joyce, Joachims Thorsten
- The Mystery Of In-context Learning: A Comprehensive Survey On Interpretation And Analysis Zhou Yuxiang, Li Jiazheng, Xiang Yanzheng, Yan Hanqi, Gui Lin, He Yulan
- Are NLP Models Good At Tracing Thoughts: An Overview Of Narrative Understanding Zhu Lixing, Zhao Runcong, Gui Lin, He Yulan
- Recommender Systems In The Era Of Large Language Models (llms) Zihuai Zhao, Wenqi Fan, Jiatong Li, Yunqing Liu, Xiaowei Mei, Yiqi Wang, Zhen Wen, Fei Wang, Xiangyu Zhao, Jiliang Tang, Qing Li
- A Survey On Symbolic Knowledge Distillation Of Large Language Models Acharya Kamal, Velasquez Alvaro, Song Houbing Herbert
- History Of Generative Artificial Intelligence (AI) Chatbots: Past, Present, And Future Development Al-amin Md., Ali Mohammad Shazed, Salam Abdus, Khan Arif, Ali Ashraf, Ullah Ahsan, Alam Md Nur, Chowdhury Shamsul Kabir
- A Survey Of Large Language Models For European Languages Ali Wazir, Pyysalo Sampo
- Chatgpt Alternative Solutions: Large Language Models Survey Alipour Hanieh, Pendar Nick, Roy Kohinoor
- Understanding Alignment In Multimodal Llms: A Comprehensive Study Amirloo Elmira, Fauconnier Jean-philippe, Roesmann Christoph, Kerl Christian, Boney Rinu, Qian Yusu, Wang Zirui, Dehghan Afshin, Yang Yinfei, Gan Zhe, Grasch Peter
- Self-improving Customer Review Response Generation Based On Llms Azov Guy, Pelc Tatiana, Alon Adi Fledel, Kamhi Gila
- Hallucination Of Multimodal Large Language Models: A Survey Bai Zechen, Wang Pichao, Xiao Tianjun, He Tong, Han Zongbo, Zhang Zheng, Shou Mike Zheng
- Enhancing Event Reasoning In Large Language Models Through Instruction Fine-tuning With Semantic Causal Graphs Bethany Mazal, Bethany Emet, Wherry Brandon, Chiang Cho-yu, Vishwamitra Nishant, Rios Anthony, Najafirad Peyman
- Learning Shortcuts: On The Misleading Promise Of NLU In Language Models Bihani Geetanjali, Rayz Julia Taylor
- Digital Forgetting In Large Language Models: A Survey Of Unlearning Methods Blanco-justicia Alberto, Jebreel Najeeb, Manzanares Benet, Sánchez David, Domingo-ferrer Josep, Collell Guillem, Tan Kuan Eeik
- Generation, Distillation And Evaluation Of Motivational Interviewing-style Reflections With A Foundational Language Model Brown Andrew, Zhu Jiading, Abdelwahab Mohamed, Dong Alec, Wang Cindy, Rose Jonathan
- Enhancing Trust In Llms: Algorithms For Comparing And Interpreting Llms Brown Nik Bear
- A Survey On Mixture Of Experts Cai Weilin, Jiang Juyong, Wang Fan, Tang Jing, Kim Sunghun, Huang Jiayi
- On Behalf Of The Stakeholders: Trends In NLP Model Interpretability In The Era Of Llms Calderon Nitay, Reichart Roi
- XAI Meets Llms: A Survey Of The Relation Between Explainable AI And Large Language Models Cambria Erik, Malandri Lorenzo, Mercorio Fabio, Nobani Navid, Seveso Andrea
- Survey On Large Language Model-enhanced Reinforcement Learning: Concept, Taxonomy, And Methods Cao Yuji, Zhao Huan, Cheng Yuheng, Shu Ting, Liu Guolong, Liang Gaoqi, Zhao Junhua, Li Yun
- Captioning Visualizations With Large Language Models (CVLLM): A Tutorial Carenini Giuseppe, Johnson Jordon, Salamatian Ali
- A Review Of Multi-modal Large Language And Vision Models Carolan Kilian, Fennelly Laura, Smeaton Alan F.
- Conversational AI Powered By Large Language Models Amplifies False Memories In Witness Interviews Chan Samantha, Pataranutaporn Pat, Suri Aditya, Zulfikar Wazeer, Maes Pattie, Loftus Elizabeth F.
- Efficient Prompting Methods For Large Language Models: A Survey Chang Kaiyan, Xu Songcheng, Wang Chenglong, Luo Yingfeng, Xiao Tong, Zhu Jingbo
- Nlperturbator: Studying The Robustness Of Code Llms To Natural Language Variations Chen Junkai, Li Zhenhao, Hu Xing, Xia Xin
- Are We On The Right Way For Evaluating Large Vision-language Models? Chen Lin, Li Jinsong, Dong Xiaoyi, Zhang Pan, Zang Yuhang, Chen Zehui, Duan Haodong, Wang Jiaqi, Qiao Yu, Lin Dahua, Zhao Feng
- Recent Trends In Personalized Dialogue Generation: A Review Of Datasets, Methodologies, And Evaluations Chen Yi-pei, Nishida Noriki, Nakayama Hideki, Matsumoto Yuji
- Trojanrag: Retrieval-augmented Generation Can Be Backdoor Driver In Large Language Models Cheng Pengzhou, Ding Yidong, Ju Tianjie, Wu Zongru, Du Wei, Yi Ping, Zhang Zhuosheng, Liu Gongshen
- (A)I Am Not A Lawyer, But...: Engaging Legal Experts Towards Responsible LLM Policies For Legal Advice Cheong Inyoung, Xia King, Feng K. J. Kevin, Chen Quan Ze, Zhang Amy X.
- Breaking Down The Defenses: A Comparative Survey Of Attacks On Large Language Models Chowdhury Arijit Ghosh, Islam Md Mofijul, Kumar Vaibhav, Shezan Faysal Hossain, Kumar Vaibhav, Jain Vinija, Chadha Aman
- Improve Temporal Awareness Of Llms For Sequential Recommendation Chu Zhendong, Wang Zichao, Zhang Ruiyi, Ji Yangfeng, Wang Hongning, Sun Tong
- AI Safety In Generative AI Large Language Models: A Survey Chua Jaymari, Li Yun, Yang Shiyi, Wang Chen, Yao Lina
- Rocode: A Dataset For Measuring Code Intelligence From Problem Definitions In Romanian Cosma Adrian, Iordache Bogdan, Rosso Paolo
- Llambert: Large-scale Low-cost Data Annotation In NLP Csanády Bálint, Muzsai Lajos, Vedres Péter, Nádasdy Zoltán, Lukács András
- Or-bench: An Over-refusal Benchmark For Large Language Models Cui Justin, Chiang Wei-lin, Stoica Ion, Hsieh Cho-jui
- A Complete Survey On Llm-based AI Chatbots Dam Sumit Kumar, Hong Choong Seon, Qiao Yu, Zhang Chaoning
- Human-interpretable Adversarial Prompt Attack On Large Language Models With Situational Context Das Nilanjana, Raff Edward, Gaur Manas
- Application Of GPT Language Models For Innovation In Activities In University Teaching De Buenaga Manuel, Bueno Francisco Javier
- Deconstructing The Ethics Of Large Language Models From Long-standing Issues To New-emerging Dilemmas Deng Chengyuan, Duan Yiqun, Jin Xin, Chang Heng, Tian Yijun, Liu Han, Zou Henry Peng, Jin Yiqiao, Xiao Yijia, Wang Yichen, Wu Shenghao, Xie Zongxing, Gao Kuofeng, He Sihong, Zhuang Jun, Cheng Lu, Wang Haohan
- AI Does Not Alter Perceptions Of Text Messages Diamond N'yoma
- Data Augmentation Using Large Language Models: Data Perspectives, Learning Paradigms And Challenges Ding Bosheng, Qin Chengwei, Zhao Ruochen, Luo Tianze, Li Xinze, Chen Guizhen, Xia Wenhan, Hu Junjie, Luu Anh Tuan, Joty Shafiq
- Large Language Model Agent In Financial Trading: A Survey Ding Han, Li Yinheng, Wang Junhao, Chen Hang
- Boosting Large Language Models With Socratic Method For Conversational Mathematics Teaching Ding Yuyang, Hu Hanglei, Zhou Jie, Chen Qin, Jiang Bo, He Liang
- Finding Blind Spots In Evaluator Llms With Interpretable Checklists Doddapaneni Sumanth, Khan Mohammed Safi Ur Rahman, Verma Sshubam, Khapra Mitesh M.
- Attacks, Defenses And Evaluations For LLM Conversation Safety: A Survey Dong Zhichen, Zhou Zhanhui, Yang Chao, Shao Jing, Qiao Yu
- Internal And External Knowledge Interactive Refinement Framework For Knowledge-intensive Question Answering Du Haowei, Zhao Dongyan
- Low-cost Language Models: Survey And Performance Evaluation On Python Code Generation Espejel Jessica López, Alassan Mahaman Sanoussi Yahaya, Bouhandi Merieme, Dahhane Walid, Ettifouri El Hassane
- Decoding AI: The Inside Story Of Data Analysis In Chatgpt Evkaya Ozan, De Carvalho Miguel
- A Survey On RAG Meeting Llms: Towards Retrieval-augmented Large Language Models Fan Wenqi, Ding Yujuan, Ning Liangbo, Wang Shijie, Li Hengyun, Yin Dawei, Chua Tat-seng, Li Qing
- Towards Trustworthy AI: A Review Of Ethical And Robust Large Language Models Ferdaus Md Meftahul, Abdelguerfi Mahdi, Ioup Elias, Niles Kendall N., Pathak Ken, Sloan Steven
- Generative AI And Large Language Models For Cyber Security: All Insights You Need Ferrag Mohamed Amine, Alwahedi Fatima, Battah Ammar, Cherif Bilel, Mechri Abdechakour, Tihanyi Norbert
- A Survey Study On The State Of The Art Of Programming Exercise Generation Using Large Language Models Frankford Eduard, Höhn Ingo, Sauerwein Clemens, Breu Ruth
- Large Language Models And Games: A Survey And Roadmap Gallotta Roberto, Todd Graham, Zammit Marvin, Earle Sam, Liapis Antonios, Togelius Julian, Yannakakis Georgios N.
- A Review On The Use Of Large Language Models As Virtual Tutors García-méndez Silvia, De Arriba-pérez Francisco, Somoza-lópez María Del Carmen
- Generative AI In Education: A Study Of Educators' Awareness, Sentiments, And Influencing Factors Ghimire Aashish, Prather James, Edwards John
- Exploring The Frontier Of Vision-language Models: A Survey Of Current Methodologies And Future Directions Ghosh Akash, Acharya Arkadeep, Saha Sriparna, Jain Vinija, Chadha Aman
- Does Chatgpt Have A Mind? Goldstein Simon, Levinstein Benjamin A.
- A Survey On Responsible Generative AI: What To Generate And What Not Gu Jindong
- OLMES: A Standard For Language Model Evaluations Gu Yuling, Tafjord Oyvind, Kuehl Bailey, Haddad Dany, Dodge Jesse, Hajishirzi Hannaneh
- A Survey On Hallucination In Large Vision-language Models Hanchao Liu, Wenyuan Xue, Yifei Chen, Dapeng Chen, Xiutian Zhao, Ke Wang, Liping Hou, Rongjun Li, Wei Peng
- A Comprehensive Overview Of Large Language Models (llms) For Cyber Defences: Opportunities And Directions Hassanin Mohammed, Moustafa Nour
- Llms Meet Multimodal Generation And Editing: A Survey He Yingqing, Liu Zhaoyang, Chen Jingye, Tian Zeyue, Liu Hongyu, Chi Xiaowei, Liu Runtao, Yuan Ruibin, Xing Yazhou, Wang Wenhai, Dai Jifeng, Zhang Yong, Xue Wei, Liu Qifeng, Guo Yike, Chen Qifeng
- A Survey On Large Language Model-based Game Agents Hu Sihao, Huang Tiansheng, Ilhan Fatih, Tekin Selim, Liu Gaowen, Kompella Ramana, Liu Ling
- RAG And RAU: A Survey On Retrieval-augmented Language Model In Natural Language Processing Hu Yucheng, Lu Yuxing
- Large Language Models In Mental Health Care: A Scoping Review Hua Yining, Liu Fenglin, Yang Kailai, Li Zehan, Na Hongbin, Sheu Yi-han, Zhou Peilin, Moran Lauren V., Ananiadou Sophia, Beam Andrew, Torous John
- Applying And Evaluating Large Language Models In Mental Health Care: A Scoping Review Of Human-assessed Generative Tasks Hua Yining, Na Hongbin, Li Zehan, Liu Fenglin, Fang Xiao, Clifton David, Torous John
- A Survey On Evaluation Of Multimodal Large Language Models Huang Jiaxing, Zhang Jingyi
- A Comprehensive Survey On Evaluating Large Language Model Applications In The Medical Industry Huang Yining, Tang Keke, Chen Meilian, Wang Boyuan
- The Promise And Challenges Of Using Llms To Accelerate The Screening Process Of Systematic Reviews Huotala Aleksi, Kuutila Miikka, Ralph Paul, Mäntylä Mika
- A Survey On Human Preference Learning For Large Language Models Jiang Ruili, Chen Kehai, Bai Xuefeng, He Zhixuan, Li Juntao, Yang Muyun, Zhao Tiejun, Nie Liqiang, Zhang Min
- LLMRS: Unlocking Potentials Of Llm-based Recommender Systems For Software Purchase John Angela, Aidoo Theophilus, Behmanush Hamayoon, Gunduz Irem B., Shrestha Hewan, Rahman Maxx Richard, Maaß Wolfgang
- A Survey On Hardware Accelerators For Large Language Models Kachris Christoforos
- When Can Llms Actually Correct Their Own Mistakes? A Critical Survey Of Self-correction Of Llms Kamoi Ryo, Zhang Yusen, Zhang Nan, Han Jiawei, Zhang Rui
- Comparing Plausibility Estimates In Base And Instruction-tuned Large Language Models Kauf Carina, Chersoni Emmanuele, Lenci Alessandro, Fedorenko Evelina, Ivanova Anna A.
- From Text To Transformation: A Comprehensive Review Of Large Language Models' Versatility Kaur Pravneet, Kashyap Gautam Siddharth, Kumar Ankit, Nafis Md Tabrez, Kumar Sandeep, Shokeen Vikrant
- Codeaid: Evaluating A Classroom Deployment Of An Llm-based Programming Assistant That Balances Student And Educator Needs Kazemitabaar Majeed, Ye Runlong, Wang Xiaoning, Henley Austin Z., Denny Paul, Craig Michelle, Grossman Tovi
- Exploring The Frontiers Of Llms In Psychological Applications: A Comprehensive Review Ke Luoma, Tong Song, Cheng Peng, Peng Kaiping
- Grounding And Evaluation For Large Language Models: Practical Challenges And Lessons Learned (survey) Kenthapadi Krishnaram, Sameki Mehrnoosh, Taly Ankur
- Beyond Code Generation: An Observational Study Of Chatgpt Usage In Software Engineering Practice Khojah Ranim, Mohamad Mazen, Leitner Philipp, Neto Francisco Gomes De Oliveira
- A Comprehensive Survey Of Accelerated Generation Techniques In Large Language Models Khoshnoodi Mahsa, Jain Vinija, Gao Mingye, Srikanth Malavika, Chadha Aman
- A Survey On Integration Of Large Language Models With Intelligent Robots Kim Yeseung, Kim Dohyun, Choi Jieun, Park Jisang, Oh Nayoung, Park Daehyung
- What's Wrong? Refining Meeting Summaries With LLM Feedback Kirstein Frederic, Ruas Terry, Gipp Bela
- A Systematic Survey And Critical Review On Evaluating Large Language Models: Challenges, Limitations, And Recommendations Laskar Md Tahmid Rahman, Alqahtani Sawsan, Bari M Saiful, Rahman Mizanur, Khan Mohammad Abdullah Matin, Khan Haidar, Jahan Israt, Bhuiyan Amran, Tan Chee Wei, Parvez Md Rizwan, Hoque Enamul, Joty Shafiq, Huang Jimmy
- A Survey Of Large Language Models In Finance (finllms) Lee Jean, Stevens Nicholas, Han Soyeon Caren, Song Minseok
- The Life Cycle Of Large Language Models: A Review Of Biases In Education Lee Jinsook, Hicke Yann, Yu Renzhe, Brooks Christopher, Kizilcec René F.
- Llava-docent: Instruction Tuning With Multimodal Large Language Model To Support Art Appreciation Education Lee Unggi, Jeon Minji, Lee Yunseo, Byun Gyuri, Son Yoorim, Shin Jaeyoon, Ko Hongkyu, Kim Hyeoncheol
- A Sentiment Consolidation Framework For Meta-review Generation Li Miao, Lau Jey Han, Hovy Eduard
- Large Language Models For Human-like Autonomous Driving: A Survey Li Yun, Katsumata Kai, Javanmardi Ehsan, Tsukada Manabu
- Glitch Tokens In Large Language Models: Categorization Taxonomy And Effective Detection Li Yuxi, Liu Yi, Deng Gelei, Zhang Ying, Song Wenjia, Shi Ling, Wang Kailong, Li Yuekang, Liu Yang, Wang Haoyu
- Internal Consistency And Self-feedback In Large Language Models: A Survey Liang Xun, Song Shichao, Zheng Zifan, Wang Hanyu, Yu Qingchen, Li Xunkai, Li Rong-hua, Cheng Peng, Wang Zhonghao, Xiong Feiyu, Li Zhiyu
- Controllable Text Generation For Large Language Models: A Survey Liang Xun, Wang Hanyu, Wang Yezhaohui, Song Shichao, Yang Jiawei, Niu Simin, Hu Jie, Liu Dan, Yao Shunyu, Xiong Feiyu, Li Zhiyu
- MARIO: Math Reasoning With Code Interpreter Output -- A Reproducible Pipeline Liao Minpeng, Luo Wei, Li Chengxi, Wu Jing, Fan Kai
- Exploring Vulnerabilities And Protections In Large Language Models: A Survey Liu Frank Weizhen, Hu Chenhui
- Large Language Model-based Agents For Software Engineering: A Survey Liu Junwei, Wang Kaixin, Chen Yixuan, Peng Xin, Chen Zhenpeng, Zhang Lingming, Lou Yiling
- Synthetic Context Generation For Question Generation Liu Naiming, Wang Zichao, Baraniuk Richard
- When Chatgpt Is Gone: Creativity Reverts And Homogeneity Persists Liu Qinghan, Zhou Yiyong, Huang Jihao, Li Guiquan
- The Generation Gap:exploring Age Bias In The Underlying Value Systems Of Large Language Models Liu Siyang, Maturi Trish, Yi Bowen, Shen Siqi, Mihalcea Rada
- Datasets For Large Language Models: A Comprehensive Survey Liu Yang, Cao Jiahuan, Liu Chongyu, Ding Kai, Jin Lianwen
- Answering Real-world Clinical Questions Using Large Language Model Based Systems Low Yen Sia 1 And 2, Jackson Michael L. 1 And 2, Hyde Rebecca J. 1 And 2, Brown Robert E. 1 And 2, Sanghavi Neil M. 1 And 2, Baldwin Julian D. 1 And 2, Pike C. William 1 And 2, Muralidharan Jananee 1 And 2, Hui Gavin 1 And 2, Alexander Natasha 1 And 7, Hassan Hadeel 1 And 7, Nene Rahul V. 1 And 7, Pike Morgan 1 And 7, Pokrzywa Courtney J. 1 And 7, Vedak Shivam 1 And 7, Yan Adam Paul 1 And 7, Yao Dong-han 1 And 7, Zipursky Amy R. 1 And 7, Dinh Christina 1 And 7, Ballentine Philip 1 And 7, Derieg Dan C. 1 And 7, Polony Vladimir 1 And 7, Chawdry Rehan N. 1 And 7, Davies Jordan 1 And 7, Hyde Brigham B. 1 And 7, Shah Nigam H. 1 And 7, Gombar Saurabh 1 And 8
- Learning Positional Attention For Sequential Recommendation Luo Fan, Zhang Juan, Xu Shenghui
- From Understanding To Utilization: A Survey On Explainability For Large Language Models Luo Haoyan, Specia Lucia
- Hallucination Detection And Hallucination Mitigation: An Investigation Luo Junliang, Li Tianyu, Wu Di, Jenkin Michael, Liu Steve, Dudek Gregory
- Coggpt: Unleashing The Power Of Cognitive Dynamics On Large Language Models Lv Yaojia, Pan Haojie, Fu Ruiji, Liu Ming, Wang Zhongyuan, Qin Bing
- Eight Methods To Evaluate Robust Unlearning In Llms Lynch Aengus, Guo Phillip, Ewart Aidan, Casper Stephen, Hadfield-menell Dylan
- The Potential And Challenges Of Evaluating Attitudes, Opinions, And Values In Large Language Models Ma Bolei, Wang Xinpeng, Hu Tiancheng, Haensch Anna-carolina, Hedderich Michael A., Plank Barbara, Kreuter Frauke
- Enhancing Programming Education With Chatgpt: A Case Study On Student Perceptions And Interactions In A Python Course Ma Boxaun, Chen Li, Konomi Shin'ichi
- Are Large Language Models Good Prompt Optimizers? Ma Ruotian, Wang Xiaolei, Zhou Xin, Li Jian, Du Nan, Gui Tao, Zhang Qi, Huang Xuanjing
- Beyond Generative Artificial Intelligence: Roadmap For Natural Language Generation Maestre María Miró, Martínez-murillo Iván, Martin Tania J., Navarro-colorado Borja, Ferrández Antonio, Cueto Armando Suárez, Lloret Elena
- A Data Generation Perspective To The Mechanism Of In-context Learning Mao Haitao, Liu Guangliang, Ma Yao, Wang Rongrong, Johnson Kristen, Tang Jiliang
- LLM Critics Help Catch LLM Bugs Mcaleese Nat, Pokorny Rai Michael, Uribe Juan Felipe Ceron, Nitishinskaya Evgenia, Trebacz Maja, Leike Jan
- A Survey Of Resource-efficient LLM And Multimodal Foundation Models Mengwei Xu, Wangsong Yin, Dongqi Cai, Rongjie Yi, Daliang Xu, Qipeng Wang, Bingyang Wu, Yihao Zhao, Chen Yang, Shihe Wang, Qiyang Zhang, Zhenyan Lu, Li Zhang, Shangguang Wang, Yuanchun Li, Yunxin Liu, Xin Jin, Xuanzhe Liu
- Semantics Of Multiword Expressions In Transformer-based Models: A Survey Miletić Filip, Walde Sabine Schulte Im
- Large Language Models: A Survey Minaee Shervin, Mikolov Tomas, Nikzad Narjes, Chenaghlu Meysam, Socher Richard, Amatriain Xavier, Gao Jianfeng
- Comparison Of Large Language Models For Generating Contextually Relevant Questions Molina Ivo Lodovico, Švábenský Valdemar, Minematsu Tsubasa, Chen Li, Okubo Fumiya, Shimada Atsushi
- Beyond Accuracy: Evaluating The Reasoning Behavior Of Large Language Models -- A Survey Mondorf Philipp, Plank Barbara
- Virtual Personas For Language Models Via An Anthology Of Backstories Moon Suhong, Abdulhai Marwa, Kang Minwoo, Suh Joseph, Soedarmadji Widyadewi, Behar Eran Kohen, Chan David M.
- Exploring The Landscape Of Large Language Models: Foundations, Techniques, And Challenges Moradi Milad, Yan Ke, Colwell David, Samwald Matthias, Asgari Rhona
- Fakes Of Varying Shades: How Warning Affects Human Perception And Engagement Regarding LLM Hallucinations Nahar Mahjabin, Seo Haeseung, Lee Eun-ju, Xiong Aiping, Lee Dongwon
- Open Llama2 Model For The Lithuanian Language Nakvosas Artūras, Daniušis Povilas, Mulevičius Vytas
- Using Llms To Model The Beliefs And Preferences Of Targeted Populations Namikoshi Keiichi, Filipowicz Alex, Shamma David A., Iliev Rumen, Hogan Candice L., Arechiga Nikos
- Apprentices To Research Assistants: Advancing Research With Large Language Models Namvarpour M., Razi A.
- Review-feedback-reason (refer): A Novel Framework For NLG Evaluation And Reasoning Narsupalli Yaswanth, Chandra Abhranil, Muppirala Sreevatsa, Gupta Manish, Goyal Pawan
- From Questions To Insightful Answers: Building An Informed Chatbot For University Resources Neupane Subash, Hossain Elias, Keith Jason, Tripathi Himanshu, Ghiasi Farbod, Golilarz Noorbakhsh Amiri, Amirlatifi Amin, Mittal Sudip, Rahimi Shahram
- Pico: Peer Review In Llms Based On The Consistency Optimization Ning Kun-peng, Yang Shuo, Liu Yu-yang, Yao Jia-yu, Liu Zhen-hui, Wang Yu, Pang Ming, Yuan Li
- User-llm: Efficient LLM Contextualization With User Embeddings Ning Lin, Liu Luyang, Wu Jiaxing, Wu Neo, Berlowitz Devora, Prakash Sushant, Green Bradley, O'banion Shawn, Xie Jun
- Large Language Models And Cognitive Science: A Comprehensive Review Of Similarities, Differences, And Challenges Niu Qian, Liu Junyu, Bi Ziqian, Feng Pohsun, Peng Benji, Chen Keyu, Li Ming
- Gpt-ology, Computational Models, Silicon Sampling: How Should We Think About Llms In Cognitive Science? Ong Desmond C.
- A Systematic Review Of Data-to-text NLG Osuji Chinonso Cynthia, Ferreira Thiago Castro, Davis Brian
- The What, Why, And How Of Context Length Extension Techniques In Large Language Models -- A Detailed Survey Pawar Saurav, Tonmoy S. M Towhidul Islam, Zaman S M Mehedi, Jain Vinija, Chadha Aman, Das Amitava
- Securing Large Language Models: Addressing Bias, Misinformation, And Prompt Attacks Peng Benji, Chen Keyu, Li Ming, Feng Pohsun, Bi Ziqian, Liu Junyu, Niu Qian
- Review-llm: Harnessing Large Language Models For Personalized Review Generation Peng Qiyao, Liu Hongtao, Xu Hongyan, Yang Qing, Shao Minglai, Wang Wenjun
- A Review Of The Challenges With Massive Web-mined Corpora Used In Large Language Models Pre-training Perełkiewicz Michał, Poświata Rafał
- Reasoning With Large Language Models, A Survey Plaat Aske, Wong Annie, Verberne Suzan, Broekens Joost, Van Stein Niki, Back Thomas
- Evaluation Of Language Models In The Medical Context Under Resource-constrained Settings Posada Andrea, Rueckert Daniel, Meissen Felix, Müller Philip
- Accelerating Large Language Model Pretraining Via LFR Pedagogy: Learn, Focus, And Review Prakriya Neha, Yen Jui-nan, Hsieh Cho-jui, Cong Jason
- Exploring The Nexus Of Large Language Models And Legal Systems: A Short Survey Qin Weicong, Sun Zhongxiang
- The Synergy Between Data And Multi-modal Large Language Models: A Survey From Co-development Perspective Qin Zhen, Chen Daoyuan, Zhang Wenhao, Yao Liuyi, Huang Yilun, Ding Bolin, Li Yaliang, Deng Shuiguang
- LAMPO: Large Language Models As Preference Machines For Few-shot Ordinal Classification Qin Zhen, Wu Junru, Shen Jiaming, Liu Tianqi, Wang Xuanhui
- Tool Learning With Large Language Models: A Survey Qu Changle, Dai Sunhao, Wei Xiaochi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Xu Jun, Wen Ji-rong
- Llms Among Us: Generative AI Participating In Digital Discourse Radivojevic Kristina, Clark Nicholas, Brenner Paul
- STEER: Assessing The Economic Rationality Of Large Language Models Raman Narun, Lundy Taylor, Amouyal Samuel, Levine Yoav, Leyton-brown Kevin, Tennenholtz Moshe
- Quantifying Contamination In Evaluating Code Generation Capabilities Of Language Models Riddell Martin, Ni Ansong, Cohan Arman
- Exploring Advanced Large Language Models With Llmsuite Roffo Giorgio
- Talking To Machines: Do You Read Me? Rojas-barahona Lina M.
- Achieving Peak Performance For Large Language Models: A Systematic Review Rostam Zhyar Rzgar K, Szénási Sándor, Kertész Gábor
- How Chatgpt Changed The Media's Narratives On AI: A Semi-automated Narrative Analysis Through Frame Semantics Ryazanov Igor, Öhman Carl, Björklund Johanna
- Safetyprompts: A Systematic Review Of Open Datasets For Evaluating And Improving Large Language Model Safety Röttger Paul, Pernisi Fabio, Vidgen Bertie, Hovy Dirk
- A Systematic Survey Of Prompt Engineering In Large Language Models: Techniques And Applications Sahoo Pranab, Singh Ayush Kumar, Saha Sriparna, Jain Vinija, Mondal Samrat, Chadha Aman
- Risks, Causes, And Mitigations Of Widespread Deployments Of Large Language Models (llms): A Survey Sakib Md Nazmus, Islam Md Athikul, Pathak Royal, Arifin Md Mashrur
- Large Language Models Show Human-like Social Desirability Biases In Survey Responses Salecha Aadesh, Ireland Molly E., Subrahmanya Shashanka, Sedoc João, Ungar Lyle H., Eichstaedt Johannes C.
- Prompting Llms To Compose Meta-review Drafts From Peer-review Narratives Of Scholarly Manuscripts Santu Shubhra Kanti Karmaker, Sinha Sanjeev Kumar, Bansal Naman, Knipper Alex, Sarkar Souvika, Salvador John, Mahajan Yash, Guttikonda Sri, Akter Mousumi, Freestone Matthew, Williams Matthew C. Jr
- Ruffle&riley: Insights From Designing And Evaluating A Large Language Model-based Conversational Tutoring System Schmucker Robin, Xia Meng, Azaria Amos, Mitchell Tom
- Continual Learning Of Large Language Models: A Comprehensive Survey Shi Haizhou, Xu Zihao, Wang Hengyi, Qin Weiyi, Wang Wenyuan, Wang Yibin, Wang Zifeng, Ebrahimi Sayna, Wang Hao
- Keep The Cost Down: A Review On Methods To Optimize LLM' S Kv-cache Consumption Shi Luohe, Zhang Hongyi, Yao Yao, Li Zuchao, Zhao Hai
- How Can Large Language Models Enable Better Socially Assistive Human-robot Interaction: A Brief Survey Shi Zhonghao, Landrum Ellen, Connell Amy O', Kian Mina, Pinto-alva Leticia, Shrestha Kaleen, Zhu Xiaoyuan, Matarić Maja J
- Alchemistcoder: Harmonizing And Eliciting Code Capability By Hindsight Tuning On Multi-source Data Song Zifan, Wang Yudong, Zhang Wenwei, Liu Kuikun, Lyu Chengqi, Song Demin, Guo Qipeng, Yan Hang, Lin Dahua, Chen Kai, Zhao Cairong
- A Survey Of Using Large Language Models For Generating Infrastructure As Code Srivatsa Kalahasti Ganesh, Mukhopadhyay Sabyasachi, Katrapati Ganesh, Shrivastava Manish
- LUCID: Llm-generated Utterances For Complex And Interesting Dialogues Stacey Joe, Cheng Jianpeng, Torr John, Guigue Tristan, Driesen Joris, Coca Alexandru, Gaynor Mark, Johannsen Anders
- A Comprehensive Study Of The Capabilities Of Large Language Models For Vulnerability Detection Steenhoek Benjamin, Rahman Md Mahbubur, Roy Monoshi Kumar, Alam Mirza Sanjida, Barr Earl T., Le Wei
- Effectiveness Of Chatgpt In Explaining Complex Medical Reports To Patients Sun Mengxuan, Reiter Ehud, Kiltie Anne E, Ramsay George, Duncan Lisa, Murchie Peter, Adam Rosalind
- Large Language Models As Conversational Movie Recommenders: A User Study Sun Ruixuan, Li Xinyi, Akella Avinash, Konstan Joseph A.
- Large Language Models And Video Games: A Preliminary Scoping Review Sweetser Penny
- 1.5-pints Technical Report: Pretraining In Days, Not Months -- Your Language Model Thrives On Quality Data Tan Calvin, Wang Jerome
- Large Language Models For Data Annotation: A Survey Tan Zhen, Li Dawei, Wang Song, Beigi Alimohammad, Jiang Bohan, Bhattacharjee Amrita, Karami Mansooreh, Li Jundong, Cheng Lu, Liu Huan
- Prioritizing Safeguarding Over Autonomy: Risks Of LLM Agents For Science Tang Xiangru, Jin Qiao, Zhu Kunlun, Yuan Tongxin, Zhang Yichi, Zhou Wangchunshu, Qu Meng, Zhao Yilun, Tang Jian, Zhang Zhuosheng, Cohan Arman, Lu Zhiyong, Gerstein Mark
- Chatgpt Incorrectness Detection In Software Reviews Tanzil Minaoar Hossain, Khan Junaed Younus, Uddin Gias
- A Survey On Self-evolution Of Large Language Models Tao Zhengwei, Lin Ting-en, Chen Xiancai, Li Hangyu, Wu Yuchuan, Li Yongbin, Jin Zhi, Huang Fei, Tao Dacheng, Zhou Jingren
- Supporting Human Raters With The Detection Of Harmful Content Using Large Language Models Thomas Kurt, Kelley Patrick Gage, Tao David, Meiklejohn Sarah, Vallis Owen, Tan Shunwen, Bratanič Blaž, Ferreira Felipe Tiengo, Eranti Vijay Kumar, Bursztein Elie
- An Ecosage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant Tomar Mohit, Tiwari Abhisek, Saha Tulika, Jha Prince, Saha Sriparna
- A Comprehensive Survey Of Hallucination Mitigation Techniques In Large Language Models Tonmoy S. M Towhidul Islam, Zaman S M Mehedi, Jain Vinija, Rani Anku, Rawte Vipula, Chadha Aman, Das Amitava
- Prompting Techniques For Secure Code Generation: A Systematic Investigation Tony Catherine, Ferreyra Nicolás E. Díaz, Mutas Markus, Dhiff Salem, Scandariato Riccardo
- Two Tales Of Persona In Llms: A Survey Of Role-playing And Personalization Tseng Yu-min, Huang Yu-chao, Hsiao Teng-yun, Chen Wei-lin, Huang Chao-wei, Meng Yu, Chen Yun-nung
- Llms With Industrial Lens: Deciphering The Challenges And Prospects -- A Survey Urlana Ashok, Kumar Charaka Vinayak, Singh Ajeet Kumar, Garlapati Bala Mallikarjunarao, Chalamala Srinivasa Rao, Mishra Rahul
- A Survey Of Prompt Engineering Methods In Large Language Models For Different NLP Tasks Vatsal Shubham, Dubey Harsh
- "confidently Nonsensical?'': A Critical Survey On The Perspectives And Challenges Of 'hallucinations' In NLP Venkit Pranav Narayanan, Chakravorti Tatiana, Gupta Vipul, Biggs Heidi, Srinath Mukund, Goswami Koustava, Rajtmajer Sarah, Wilson Shomir
- Generative Large Language Models In Automated Fact-checking: A Survey Vykopal Ivan, Pikuliak Matúš, Ostermann Simon, Šimko Marián
- A Survey For Large Language Models In Biomedicine Wang Chong, Li Mengyao, He Junjun, Wang Zhongruo, Darzi Erfan, Chen Zan, Ye Jin, Li Tianbin, Su Yanzhou, Ke Jing, Qu Kaili, Li Shuxin, Yu Yi, Liò Pietro, Wang Tianyun, Wang Yu Guang, Shen Yiqing
- Bioinformatics And Biomedical Informatics With Chatgpt: Year One Review Wang Jinge, Cheng Zien, Yao Qiuming, Liu Li, Xu Dong, Hu Gangqing
- Unique Security And Privacy Threats Of Large Language Model: A Comprehensive Survey Wang Shang, Zhu Tianqing, Liu Bo, Ding Ming, Guo Xu, Ye Dayong, Zhou Wanlei, Yu Philip S.
- Zero-shot Generative Large Language Models For Systematic Review Screening Automation Wang Shuai, Scells Harrisen, Zhuang Shengyao, Potthast Martin, Koopman Bevan, Zuccon Guido
- Factuality Of Large Language Models In The Year 2024 Wang Yuxia, Wang Minghan, Manzoor Muhammad Arslan, Liu Fei, Georgiev Georgi, Das Rocktim Jyoti, Nakov Preslav
- A Comprehensive Survey Of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO And More Wang Zhichao James, Bi Bin James, Pentyala Shiva Kumar James, Ramnath Kiran James, Chaudhuri Sougata James, Mehrotra Shubham James, Zixu James, Zhu Claire, Mao Xiang-bo Claire, Asur Sitaram Claire, Na Claire, Cheng
- Are Large Language Models More Empathetic Than Humans? Welivita Anuradha, Pu Pearl
- Memory Is All You Need: An Overview Of Compute-in-memory Architectures For Accelerating Large Language Model Inference Wolters Christopher, Yang Xiaoxuan, Schlichtmann Ulf, Suzumura Toyotaro
- Revealing Fine-grained Values And Opinions In Large Language Models Wright Dustin, Arora Arnav, Borenstein Nadav, Yadav Srishti, Belongie Serge, Augenstein Isabelle
- Generative Monoculture In Large Language Models Wu Fan, Black Emily, Chandrasekaran Varun
- Automated Review Generation Method Based On Large Language Models Wu Shican, Ma Xiao, Luo Dehui, Li Lulu, Shi Xiangcheng, Chang Xin, Lin Xiaoyun, Luo Ran, Pei Chunlei, Zhao Zhi-jian, Gong Jinlong
- Sunnie: An Anthropomorphic Llm-based Conversational Agent For Mental Well-being Activity Recommendation Wu Siyi, Han Feixue, Yao Bingsheng, Xie Tianyi, Zhao Xuan, Wang Dakuo
- Continual Learning For Large Language Models: A Survey Wu Tongtong, Luo Linhao, Li Yuan-fang, Pan Shirui, Vu Thuy-trang, Haffari Gholamreza
- Unlocking Efficiency In Large Language Model Inference: A Comprehensive Survey Of Speculative Decoding Xia Heming, Yang Zhe, Dong Qingxiu, Wang Peiyi, Li Yongqi, Ge Tao, Liu Tianyu, Li Wenjie, Sui Zhifang
- A Comprehensive Survey Of Large Language Models And Multimodal Large Language Models In Medicine Xiao Hanguang, Zhou Feizhong, Liu Xingyue, Liu Tianqi, Li Zhipeng, Liu Xin, Huang Xiaoxuan
- Videoqa In The Era Of Llms: An Empirical Study Xiao Junbin, Huang Nanxin, Qin Hangyu, Li Dongyang, Li Yicong, Zhu Fengbin, Tao Zhulin, Yu Jianxing, Lin Liang, Chua Tat-seng, Yao Angela
- Large Multimodal Agents: A Survey Xie Junlin, Chen Zhihong, Zhang Ruifei, Wan Xiang, Li Guanbin
- Data Efficient Evaluation Of Large Language Models And Text-to-image Models Via Adaptive Sampling Xu Cong, Saranathan Gayathri, Alam Mahammad Parwez, Shah Arpit, Lim James, Wong Soon Yee, Martin Foltin, Bhattacharya Suparna
- Large Language Models For Education: A Survey Xu Hanyi, Gan Wensheng, Qi Zhenlian, Wu Jiayang, Yu Philip S.
- On-device Language Models: A Comprehensive Review Xu Jiajun, Li Zhiyuan, Chen Wei, Wang Qun, Gao Xin, Cai Qi, Ling Ziyuan
- MAPLE: Enhancing Review Generation With Multi-aspect Prompt Learning In Explainable Recommendation Yang Ching-wen, Chen Che Wei, Wu Kun-da, Xu Hao, Yao Jui-feng, Kao Hung-yu
- If LLM Is The Wizard, Then Code Is The Wand: A Survey On How Code Empowers Large Language Models To Serve As Intelligent Agents Yang Ke, Liu Jiateng, Wu John, Yang Chaoqi, Fung Yi R., Li Sha, Huang Zixuan, Cao Xu, Wang Xingyao, Wang Yiquan, Ji Heng, Zhai Chengxiang
- Recent Advances Of Foundation Language Models-based Continual Learning: A Survey Yang Yutao, Zhou Jie, Ding Xuanwen, Huai Tianyu, Liu Shunyu, Chen Qin, He Liang, Xie Yuan
- Jailbreak Attacks And Defenses Against Large Language Models: A Survey Yi Sibo, Liu Yule, Sun Zhen, Cong Tianshuo, He Xinlei, Song Jiaxing, Xu Ke, Li Qi
- A Survey On Recent Advances In Llm-based Multi-turn Dialogue Systems Yi Zihao, Ouyang Jiarui, Liu Yuwen, Liao Tianhao, Xu Zhe, Shen Ying
- When Linear Attention Meets Autoregressive Decoding: Towards More Effective And Efficient Linearized Large Language Models You Haoran, Fu Yichao, Wang Zheng, Yazdanbakhsh Amir, Lin Yingyan Celine
- Mm-llms: Recent Advances In Multimodal Large Language Models Zhang Duzhen, Yu Yahan, Dong Jiahua, Li Chenxing, Su Dan, Chu Chenhui, Yu Dong
- Are Large Language Models Good At Utility Judgments? Zhang Hengran, Zhang Ruqing, Guo Jiafeng, De Rijke Maarten, Fan Yixing, Cheng Xueqi
- LLM As A Mastermind: A Survey Of Strategic Reasoning With Large Language Models Zhang Yadong, Mao Shaoguang, Ge Tao, Wang Xun, De Wynter Adrian, Xia Yan, Wu Wenshan, Song Ting, Lan Man, Wei Furu
- A Survey On The Memory Mechanism Of Large Language Model Based Agents Zhang Zeyu, Bo Xiaohe, Ma Chen, Li Rui, Chen Xu, Dai Quanyu, Zhu Jieming, Dong Zhenhua, Wen Ji-rong
- Revolutionizing Finance With Llms: An Overview Of Applications And Insights Zhao Huaqin, Liu Zhengliang, Wu Zihao, Li Yiwei, Yang Tianze, Shu Peng, Xu Shaochen, Dai Haixing, Zhao Lin, Mai Gengchen, Liu Ninghao, Liu Tianming
- A Survey Of Backdoor Attacks And Defenses On Large Language Models: Implications For Security Measures Zhao Shuai, Jia Meihuizi, Guo Zhongliang, Gan Leilei, Xu Xiaoyu, Wu Xiaobao, Fu Jie, Feng Yichao, Pan Fengjun, Tuan Luu Anh
- Towards Lifelong Learning Of Large Language Models: A Survey Zheng Junhao, Qiu Shengjie, Shi Chengming, Ma Qianli
- Attention Heads Of Large Language Models: A Survey Zheng Zifan, Wang Yezhaohui, Huang Yuxin, Song Shichao, Tang Bo, Xiong Feiyu, Li Zhiyu
- A Survey On Efficient Inference For Large Language Models Zhou Zixuan, Ning Xuefei, Hong Ke, Fu Tianyu, Xu Jiaming, Li Shiyao, Lou Yuming, Wang Luning, Yuan Zhihang, Li Xiuhong, Yan Shengen, Dai Guohao, Zhang Xiao-ping, Dong Yuhan, Wang Yu
- How Privacy-savvy Are Large Language Models? A Case Study On Compliance And Privacy Technical Review Zhu Xichou, Liu Yang, Shen Zhou, Liu Yi, Li Min, Chen Yujun, John Benzi, Ma Zhenzhen, Hu Tao, Yang Bolong, Wang Manman, Xie Zongxing, Liu Peng, Cai Dan, Wang Junhui
🏷 TACL
- A Corpus-free State2seq User Simulator For Task-oriented Dialogue Hou Yutai, Fang Meng, Che Wanxiang, Liu Ting
- Transformers As Soft Reasoners Over Language Clark Peter, Tafjord Oyvind, Richardson Kyle
- Unsupervised Explanation Generation For Machine Reading Comprehension Cui Yiming, Liu Ting, Wang Shijin, Hu Guoping
- Neural Data-to-text Generation With Lm-based Text Augmentation Chang Ernie, Shen Xiaoyu, Zhu Dawei, Demberg Vera, Su Hui
- Multilingual Multi-aspect Explainability Analyses On Machine Reading Comprehension Models Cui Yiming, Zhang Wei-nan, Che Wanxiang, Liu Ting, Chen Zhigang, Wang Shijin
- Tacl: Improving BERT Pre-training With Token-aware Contrastive Learning Su Yixuan, Liu Fangyu, Meng Zaiqiao, Lan Tian, Shu Lei, Shareghi Ehsan, Collier Nigel
- The Devil Is In The Details: On Models And Training Regimes For Few-shot Intent Classification Mesgar Mohsen, Tran Thy Thy, Glavas Goran, Gurevych Iryna
- Galactica: A Large Language Model For Science Ross Taylor, Marcin Kardas, Guillem Cucurull, Thomas Scialom, Anthony Hartshorn, Elvis Saravia, Andrew Poulton, Viktor Kerkez, Robert Stojnic
- Can Large Language Models Be Good Path Planners? A Benchmark And Investigation On Spatial-temporal Reasoning Aghzal Mohamed, Plaku Erion, Yao Ziyu
- Visit-bench: A Benchmark For Vision-language Instruction Following Inspired By Real-world Use Bitton Yonatan, Bansal Hritik, Hessel Jack, Shao Rulin, Zhu Wanrong, Awadalla Anas, Gardner Josh, Taori Rohan, Schmidt Ludwig
- Whose Text Is It Anyway? Exploring Bigcode, Intellectual Property, And Ethics Choksi Madiha Zahrah, Goedicke David
- Efficient And Effective Text Encoding For Chinese Llama And Alpaca Cui Yiming, Yang Ziqing, Yao Xin
- Hallucination Is The Last Thing You Need Curran Shawn, Lansley Sam, Bethell Oliver
- Could A Large Language Model Be Conscious? David J. Chalmers
- DP-OPT: Make Large Language Model Your Privacy-preserving Prompt Engineer Hong Junyuan, Wang Jiachen T., Zhang Chenhui, Li Zhangheng, Li Bo, Wang Zhangyang
- Large Language Model Displays Emergent Ability To Interpret Novel Literary Metaphors Ichien Nicholas, Stamenković Dušan, Holyoak Keith J.
- Calibrated Language Models Must Hallucinate Kalai Adam Tauman, Vempala Santosh S.
- Api-bank: A Comprehensive Benchmark For Tool-augmented Llms Li Minghao, Zhao Yingxiu, Yu Bowen, Song Feifan, Li Hangyu, Yu Haiyang, Li Zhoujun, Huang Fei, Li Yongbin
- Adapting Large Language Models For Education: Foundational Capabilities, Potentials, And Challenges Li Qingyao, Fu Lingyue, Zhang Weiming, Chen Xianyu, Yu Jingwei, Xia Wei, Zhang Weinan, Tang Ruiming, Yu Yong
- Music Understanding Llama: Advancing Text-to-music Generation With Question Answering And Captioning Liu Shansong, Hussain Atin Sakkeer, Sun Chenshuo, Shan Ying
- Trustworthy Llms: A Survey And Guideline For Evaluating Large Language Models' Alignment Liu Yang, Yao Yuanshun, Ton Jean-francois, Zhang Xiaoying, Guo Ruocheng, Cheng Hao, Klochkov Yegor, Taufiq Muhammad Faaiz, Li Hang
- Sparq Attention: Bandwidth-efficient LLM Inference Ribar Luka, Chelombiev Ivan, Hudlass-galley Luke, Blake Charlie, Luschi Carlo, Orr Douglas
- Ruffle&riley: Towards The Automated Induction Of Conversational Tutoring Systems Schmucker Robin, Xia Meng, Azaria Amos, Mitchell Tom
- The First Step Is The Hardest: Pitfalls Of Representing And Tokenizing Temporal Data For Large Language Models Spathis Dimitris, Kawsar Fahim
- Simultaneous Machine Translation With Large Language Models Wang Minghan, Zhao Jinming, Vu Thuy-trang, Shiri Fatemeh, Shareghi Ehsan, Haffari Gholamreza
- LACMA: Language-aligning Contrastive Learning With Meta-actions For Embodied Instruction Following Yang Cheng-fu, Chen Yen-chun, Yang Jianwei, Dai Xiyang, Yuan Lu, Wang Yu-chiang Frank, Chang Kai-wei
- Plan, Eliminate, And Track -- Language Models Are Good Teachers For Embodied Agents Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye
- Unleashing GPT On The Metaverse: Savior Or Destroyer? Zhou Pengyuan
- Ice-score: Instructing Large Language Models To Evaluate Code Zhuo Terry Yue
- Hallucination Of Multimodal Large Language Models: A Survey Bai Zechen, Wang Pichao, Xiao Tianjun, He Tong, Han Zongbo, Zhang Zheng, Shou Mike Zheng
- Towards Boosting Llms-driven Relevance Modeling With Progressive Retrieved Behavior-augmented Prompting Chen Zeyuan, Wu Haiyan, Wu Kaixin, Chen Wei, Zhong Mingjie, Xu Jia, Liu Zhongyi, Zhang Wei
- Developing Generative AI Chatbots Conceptual Framework For Higher Education Chukwuere Joshua Ebere
- Towards A Copilot In BIM Authoring Tool Using A Large Language Model-based Agent For Intelligent Human-machine Interaction Du Changyu, Nousias Stavros, Borrmann André
- Stacking Your Transformers: A Closer Look At Model Growth For Efficient LLM Pre-training Du Wenyu, Luo Tongxu, Qiu Zihan, Huang Zeyu, Shen Yikang, Cheng Reynold, Guo Yike, Fu Jie
- Towards Trustworthy AI: A Review Of Ethical And Robust Large Language Models Ferdaus Md Meftahul, Abdelguerfi Mahdi, Ioup Elias, Niles Kendall N., Pathak Ken, Sloan Steven
- Does Object Grounding Really Reduce Hallucination Of Large Vision-language Models? Geigle Gregor, Timofte Radu, Glavaš Goran
- Breaking The Length Barrier: Llm-enhanced CTR Prediction In Long Textual User Behaviors Geng Binzong, Huan Zhaoxin, Zhang Xiaolu, He Yong, Zhang Liang, Yuan Fajie, Zhou Jun, Mo Linjian
- Rethinking Large Language Model Architectures For Sequential Recommendations Hanbing Wang, Xiaorui Liu, Wenqi Fan, Xiangyu Zhao, Venkataramana Kini, Devendra Yadav, Fei Wang, Zhen Wen, Jiliang Tang, Hui Liu
- Parameter Efficient Diverse Paraphrase Generation Using Sequence-level Knowledge Distillation Jayawardena Lasal, Yapa Prasan
- MEDIQ: Question-asking Llms For Adaptive And Reliable Clinical Reasoning Li Shuyue Stella, Balachandran Vidhisha, Feng Shangbin, Ilgen Jonathan, Pierson Emma, Koh Pang Wei, Tsvetkov Yulia
- Sparser Is Faster And Less Is More: Efficient Sparse Attention For Long-range Transformers Lou Chao, Jia Zixia, Zheng Zilong, Tu Kewei
- Agentboard: An Analytical Evaluation Board Of Multi-turn LLM Agents Ma Chang, Zhang Junlei, Zhu Zhihao, Yang Cheng, Yang Yujiu, Jin Yaohui, Lan Zhenzhong, Kong Lingpeng, He Junxian
- Coursegpt-zh: An Educational Large Language Model Based On Knowledge Distillation Incorporating Prompt Optimization Qu Zheyan, Yin Lu, Yu Zitong, Wang Wenbo, Zhang Xing
- SPARQL Generation: An Analysis On Fine-tuning Openllama For Question Answering Over A Life Science Knowledge Graph Rangel Julio C., De Farias Tarcisio Mendes, Sima Ana Claudia, Kobayashi Norio
- Ruffle&riley: Insights From Designing And Evaluating A Large Language Model-based Conversational Tutoring System Schmucker Robin, Xia Meng, Azaria Amos, Mitchell Tom
- Rapid Optimization For Jailbreaking Llms Via Subconscious Exploitation And Echopraxia Shen Guangyu, Cheng Siyuan, Zhang Kaiyuan, Tao Guanhong, An Shengwei, Yan Lu, Zhang Zhuo, Ma Shiqing, Zhang Xiangyu
- Jetmoe: Reaching Llama2 Performance With 0.1M Dollars Shen Yikang, Guo Zhen, Cai Tianle, Qin Zengyi
- Lumos : Empowering Multimodal Llms With Scene Text Recognition Shenoy Ashish, Lu Yichao, Jayakumar Srihari, Chatterjee Debojeet, Moslehpour Mohsen, Chuang Pierce, Harpale Abhay, Bhardwaj Vikas, Xu Di, Zhao Shicong, Zhao Longfang, Ramchandani Ankit, Dong Xin Luna, Kumar Anuj
- Replan: Robotic Replanning With Perception And Language Models Skreta Marta, Zhou Zihan, Yuan Jia Lin, Darvish Kourosh, Aspuru-guzik Alán, Garg Animesh
- Chatting Up Attachment: Using Llms To Predict Adult Bonds Soares Paulo, Mccurdy Sean, Gerber Andrew J., Fonagy Peter
- Taskgen: A Task-based, Memory-infused Agentic Framework Using Strictjson Tan John Chong Min, Saroj Prince, Runwal Bharat, Maheshwari Hardik, Sheng Brian Lim Yi, Cottrill Richard, Chona Alankrit, Kumar Ambuj, Motani Mehul
- Connecting The Dots: Llms Can Infer And Verbalize Latent Structure From Disparate Training Data Treutlein Johannes, Choi Dami, Betley Jan, Anil Cem, Marks Samuel, Grosse Roger Baker, Evans Owain
- Llms With Industrial Lens: Deciphering The Challenges And Prospects -- A Survey Urlana Ashok, Kumar Charaka Vinayak, Singh Ajeet Kumar, Garlapati Bala Mallikarjunarao, Chalamala Srinivasa Rao, Mishra Rahul
- Learning To Ask: When Llms Meet Unclear Instruction Wang Wenxuan, Shi Juluan, Wang Chaozheng, Lee Cheryl, Yuan Youliang, Huang Jen-tse, Lyu Michael R.
- Factuality Of Large Language Models In The Year 2024 Wang Yuxia, Wang Minghan, Manzoor Muhammad Arslan, Liu Fei, Georgiev Georgi, Das Rocktim Jyoti, Nakov Preslav
- Misconfidence-based Demonstration Selection For LLM In-context Learning Xu Shangqing Georgia Institute Of Technology, Zhang Chao Georgia Institute Of Technology
- 3D Vision And Language Pretraining With Large-scale Synthetic Data Yang Dejie, Xu Zhu, Mo Wentao, Chen Qingchao, Huang Siyuan, Liu Yang
- Sinklora: Enhanced Efficiency And Chat Capabilities For Long-context Large Language Models Zhang Hengyu
- E-bench: Towards Evaluating The Ease-of-use Of Large Language Models Zhang Zhenyu, Hao Bingguang, Li Jinpeng, Zhang Zekai, Zhao Dongyan
🏷 Tokenization
- Dlgnet: A Transformer-based Model For Dialogue Response Generation Olabiyi Oluwatobi, Mueller Erik T.
- What Changes Can Large-scale Language Models Bring? Intensive Study On Hyperclova: Billions-scale Korean Generative Pretrained Transformers Boseop Kim, Hyoungseok Kim, Sang-woo Lee, Gichang Lee, Donghyun Kwak, Dong Hyeon Jeon, Sunghyun Park, Sungju Kim, Seonhoon Kim, Dongpil Seo, Heungsub Lee, Minyoung Jeong, Sungjae Lee, Minsub Kim, Suk Hyun Ko, Seokhun Kim, Taeyong Park, Jinuk Kim, Soyoung Kang, Na-hyeon Ryu, Kang Min Yoo, Minsuk Chang, Soobin Suh, Sookyo In, Jinseong Park, Kyungduk Kim, Hiun Kim, Jisu Jeong, Yong Goo Yeo, Donghoon Ham, Dongju Park, Min Young Lee, Jaewook Kang, Inho Kang, Jung-woo Ha, Woomyoung Park, Nako Sung
- CANINE: Pre-training An Efficient Tokenization-free Encoder For Language Representation Clark Jonathan H., Garrette Dan, Turc Iulia, Wieting John
- LICHEE: Improving Language Model Pre-training With Multi-grained Tokenization Guo Weidong, Zhao Mingjun, Zhang Lusheng, Niu Di, Luo Jinwen, Liu Zhenhua, Li Zhenyang, Tang Jianbo
- ELIT: Emory Language And Information Toolkit He Han, Xu Liyan, Choi Jinho D.
- VX2TEXT: End-to-end Learning Of Video-based Text Generation From Multimodal Inputs Lin Xudong, Bertasius Gedas, Wang Jue, Chang Shih-fu, Parikh Devi, Torresani Lorenzo
- Efficient Domain Adaptation Of Language Models Via Adaptive Tokenization Sachidananda Vin, Kessler Jason S., Lai Yi-an
- Bygpt5: End-to-end Style-conditioned Poetry Generation With Token-free Language Models Belouadi Jonas, Eger Steffen
- Leveraging Large Language Models For Multiple Choice Question Answering Robinson Joshua, Rytting Christopher Michael, Wingate David
- Mgpt: Few-shot Learners Go Multilingual Shliazhko Oleh, Fenogenova Alena, Tikhonova Maria, Mikhailov Vladislav, Kozlova Anastasia, Shavrina Tatiana
- Tokenization Consistency Matters For Generative Models On Extractive NLP Tasks Sun Kaiser, Qi Peng, Zhang Yuhao, Liu Lan, Wang William Yang, Huang Zhiheng
- Tokenizer Choice For LLM Training: Negligible Or Crucial? Ali Mehdi, Fromm Michael, Thellmann Klaudia, Rutmann Richard, Lübbering Max, Leveling Johannes, Klug Katrin, Ebert Jan, Doll Niclas, Buschhoff Jasper Schulze, Jain Charvi, Weber Alexander Arno, Jurkschat Lena, Abdelwahab Hammam, John Chelsea, Suarez Pedro Ortiz, Ostendorff Malte, Weinbach Samuel, Sifa Rafet, Kesselheim Stefan, Flores-herr Nicolas
- Unnatural Error Correction: GPT-4 Can Almost Perfectly Handle Unnatural Scrambled Text Cao Qi, Kojima Takeshi, Matsuo Yutaka, Iwasawa Yusuke
- Tutorials On Stance Detection Using Pre-trained Language Models: Fine-tuning BERT And Prompting Large Language Models Chuang Yun-shiuan
- Bring Your Own Data! Self-supervised Evaluation For Large Language Models Jain Neel, Saifullah Khalid, Wen Yuxin, Kirchenbauer John, Shu Manli, Saha Aniruddha, Goldblum Micah, Geiping Jonas, Goldstein Tom
- Cabrita: Closing The Gap For Foreign Languages Larcher Celio, Piau Marcos, Finardi Paulo, Gengo Pedro, Esposito Piero, Caridá Vinicius
- Video-llava: Learning United Visual Representation By Alignment Before Projection Lin Bin, Ye Yang, Zhu Bin, Cui Jiaxi, Ning Munan, Jin Peng, Yuan Li
- Chipnemo: Domain-adapted Llms For Chip Design Liu Mingjie, Ene Teodor-dumitru, Kirby Robert, Cheng Chris, Pinckney Nathaniel, Liang Rongjian, Alben Jonah, Anand Himyanshu, Banerjee Sanmitra, Bayraktaroglu Ismet, Bhaskaran Bonita, Catanzaro Bryan, Chaudhuri Arjun, Clay Sharon, Dally Bill, Dang Laura, Deshpande Parikshit, Dhodhi Siddhanth, Halepete Sameer, Hill Eric, Hu Jiashang, Jain Sumit, Jindal Ankit, Khailany Brucek, Kokai George, Kunal Kishor, Li Xiaowei, Lind Charley, Liu Hao, Oberman Stuart, Omar Sujeet, Pasandi Ghasem, Pratty Sreedhar, Raiman Jonathan, Sarkar Ambar, Shao Zhengjiang, Sun Hanfei, Suthar Pratik P, Tej Varun, Turner Walker, Xu Kaizhe, Ren Haoxing
- Goat: Fine-tuned Llama Outperforms GPT-4 On Arithmetic Tasks Liu Tiedong, Low Bryan Kian Hsiang
- Tokenization Matters: Navigating Data-scarce Tokenization For Gender Inclusive Language Technologies Ovalle Anaelia, Mehrabi Ninareh, Goyal Palash, Dhamala Jwala, Chang Kai-wei, Zemel Richard, Galstyan Aram, Pinter Yuval, Gupta Rahul
- On The Analysis Of Cross-lingual Prompt Tuning For Decoder-based Multilingual Model Park Nohil, Park Joonsuk, Yoo Kang Min, Yoon Sungroh
- Better Generalization With Semantic Ids: A Case Study In Ranking For Recommendations Singh Anima, Vu Trung, Mehta Nikhil, Keshavan Raghunandan, Sathiamoorthy Maheswaran, Zheng Yilin, Hong Lichan, Heldt Lukasz, Wei Li, Tandon Devansh, Chi Ed H., Yi Xinyang
- Character-level Chinese Backpack Language Models Sun Hao, Hewitt John
- Impact Of Tokenization On Llama Russian Adaptation Tikhomirov Mikhail, Chernyshev Daniil
- Unified Language-vision Pretraining In LLM With Dynamic Discrete Visual Tokenization Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chenyi Lei, An Liu, Chengru Song, Xiaoqiang Lei, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu
- MEGABYTE: Predicting Million-byte Sequences With Multiscale Transformers Yu Lili, Simig Dániel, Flaherty Colin, Aghajanyan Armen, Zettlemoyer Luke, Lewis Mike
- Tokenization Falling Short: The Curse Of Tokenization Chai Yekun, Fang Yewei, Peng Qiwei, Li Xuhong
- Scaling Properties Of Speech Language Models Cuervo Santiago, Marxer Ricard
- Getting The Most Out Of Your Tokenizer For Pre-training And Domain Adaptation Dagan Gautier, Synnaeve Gabriel, Rozière Baptiste
- Language Adaptation On A Tight Academic Compute Budget: Tokenizer Swapping Works And Pure Bfloat16 Is Enough Dobler Konstantin, De Melo Gerard
- Multilingual Large Language Models And Curse Of Multilinguality Gurgurov Daniil, Bäumel Tanja, Anikina Tatiana
- Super Tiny Language Models Hillier Dylan, Guertler Leon, Tan Cheston, Agrawal Palaash, Ruirui Chen, Cheng Bobby
- Evaluating The Elementary Multilingual Capabilities Of Large Language Models With Multiq Holtermann Carolin, Röttger Paul, Dill Timm, Lauscher Anne
- Retrieval-augmented Code Completion For Local Projects Using Large Language Models Hostnik Marko, Robnik-šikonja Marko
- Integrating Multi-scale Contextualized Information For Byte-based Neural Machine Translation Huang Langlin, Feng Yang
- Video-lavit: Unified Video-language Pre-training With Decoupled Visual-motional Tokenization Jin Yang, Sun Zhicheng, Xu Kun, Xu Kun, Chen Liwei, Jiang Hao, Huang Quzhe, Song Chengru, Liu Yuliang, Zhang Di, Song Yang, Gai Kun, Mu Yadong
- The Remarkable Robustness Of Llms: Stages Of Inference? Lad Vedang, Gurnee Wes, Tegmark Max
- Glitch Tokens In Large Language Models: Categorization Taxonomy And Effective Detection Li Yuxi, Liu Yi, Deng Gelei, Zhang Ying, Song Wenjia, Shi Ling, Wang Kailong, Li Yuekang, Liu Yang, Wang Haoyu
- VP-LLM: Text-driven 3D Volume Completion With Large Language Models Through Patchification Liu Jianmeng, Liu Yichen, Zhang Yuyao, Meng Zeyuan, Tai Yu-wing, Tang Chi-keung
- STORE: Streamlining Semantic Tokenization And Generative Recommendation With A Single LLM Liu Qijiong, Zhu Jieming, Fan Lu, Zhao Zhou, Wu Xiao-ming
- Harnessing Large Language Models For Multimodal Product Bundling Liu Xiaohao, Wu Jie, Tao Zhulin, Ma Yunshan, Wei Yinwei, Chua Tat-seng
- Can Perplexity Predict Fine-tuning Performance? An Investigation Of Tokenization Effects On Sequential Language Models For Nepali Luitel Nishant, Bekoju Nirajan, Sah Anand Kumar, Shakya Subarna
- Too Late To Train, Too Early To Use? A Study On Necessity And Viability Of Low-resource Bengali Llms Mahfuz Tamzeed, Dey Satak Kumar, Naswan Ruwad, Adil Hasnaen, Sayeed Khondker Salman, Shahgir Haz Sameen
- Tokenrec: Learning To Tokenize ID For Llm-based Generative Recommendation Qu Haohao, Fan Wenqi, Zhao Zihuai, Li Qing
- Practical And Reproducible Symbolic Music Generation By Large Language Models With Structural Embeddings Rhyu Seungyeon, Yang Kichang, Cho Sungjun, Kim Jaehyeon, Lee Kyogu, Lee Moontae
- Improving Self Consistency In Llms Through Probabilistic Tokenization Sathe Ashutosh, Aggarwal Divyanshu, Sitaram Sunayana
- Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization Shao Ninglu, Xiao Shitao, Liu Zheng, Zhang Peitian
- An Improved Traditional Chinese Evaluation Suite For Foundation Model Tam Zhi-rui, Pai Ya-ting, Lee Yen-wei, Chen Jun-da, Chu Wei-min, Cheng Sega, Shuai Hong-han
- Tokenize The World Into Object-level Knowledge To Address Long-tail Events In Autonomous Driving Tian Ran, Li Boyi, Weng Xinshuo, Chen Yuxiao, Schmerling Edward, Wang Yue, Ivanovic Boris, Pavone Marco
- Discrete Multimodal Transformers With A Pretrained Large Language Model For Mixed-supervision Speech Processing Trinh Viet Anh, Southwell Rosy, Guan Yiwen, He Xinlu, Wang Zhiyong, Whitehill Jacob
- Tokenization Matters! Degrading Large Language Models Through Challenging Their Tokenization Wang Dixuan, Li Yanda, Jiang Junyuan, Ding Zepeng, Jiang Guochao, Liang Jiaqing, Yang Deqing
- Mambabyte: Token-free Selective State Space Model Wang Junxiong, Gangavarapu Tushaar, Yan Jing Nathan, Rush Alexander M.
- Learnable Item Tokenization For Generative Recommendation Wang Wenjie, Bao Honghui, Lin Xinyu, Zhang Jizhi, Li Yongqi, Feng Fuli, Ng See-kiong, Chua Tat-seng
- Omnijarvis: Unified Vision-language-action Tokenization Enables Open-world Instruction Following Agents Wang Zihao, Cai Shaofei, Mu Zhancun, Lin Haowei, Zhang Ceyao, Liu Xuejie, Li Qing, Liu Anji, Ma Xiaojian, Liang Yitao
- Large Language Model Tokenizer Bias: A Case Study And Solution On Gpt-4o Yang Jin, Wang Zhiqiang, Lin Yanbin, Zhao Zunduo
- Rethinking Tokenization: Crafting Better Tokenizers For Large Language Models Yang Jinbiao
- Cost: Contrastive Quantization Based Semantic Tokenization For Generative Recommendation Zhu Jieming, Jin Mengqun, Liu Qijiong, Qiu Zexuan, Dong Zhenhua, Li Xiu
- BASE TTS: Lessons From Building A Billion-parameter Text-to-speech Model On 100K Hours Of Data Łajszczak Mateusz, Cámbara Guillermo, Li Yang, Beyhan Fatih, Van Korlaar Arent, Yang Fan, Joly Arnaud, Martín-cortinas Álvaro, Abbas Ammar, Michalski Adam, Moinet Alexis, Karlapati Sri, Muszyńska Ewa, Guo Haohan, Putrycz Bartosz, Gambino Soledad López, Yoo Kayeon, Sokolova Elena, Drugman Thomas
🏷 Tools
- Character-level Question Answering With Attention Golub David, He Xiaodong
- Neural Machine Translation Advised By Statistical Machine Translation Wang Xing, Lu Zhengdong, Tu Zhaopeng, Li Hang, Xiong Deyi, Zhang Min
- Topic Aware Neural Response Generation Xing Chen, Wu Wei, Wu Yu, Liu Jie, Huang Yalou, Zhou Ming, Ma Wei-ying
- Sockeye: A Toolkit For Neural Machine Translation Hieber Felix, Domhan Tobias, Denkowski Michael, Vilar David, Sokolov Artem, Clifton Ann, Post Matt
- Best Of Both Worlds: Transferring Knowledge From Discriminative Learning To A Generative Visual Dialog Model Lu Jiasen, Kannan Anitha, Yang Jianwei, Parikh Devi, Batra Dhruv
- Parlai: A Dialog Research Software Platform Miller Alexander H., Feng Will, Fisch Adam, Lu Jiasen, Batra Dhruv, Bordes Antoine, Parikh Devi, Weston Jason
- A Unified Query-based Generative Model For Question Generation And Question Answering Song Linfeng, Wang Zhiguo, Hamza Wael
- Hierarchical Recurrent Attention Network For Response Generation Xing Chen, Wu Wei, Wu Yu, Zhou Ming, Huang Yalou, Ma Wei-ying
- Deepprobe: Information Directed Sequence Understanding And Chatbot Design Via Recurrent Neural Networks Yin Zi, Chang Keng-hao, Zhang Ruofei
- Redecode Framework For Iterative Improvement In Paraphrase Generation Aggarwal Milan, Kumari Nupur, Bansal Ayush, Krishnamurthy Balaji
- The Best Of Both Worlds: Combining Recent Advances In Neural Machine Translation Chen Mia Xu, Firat Orhan, Bapna Ankur, Johnson Melvin, Macherey Wolfgang, Foster George, Jones Llion, Parmar Niki, Schuster Mike, Chen Zhifeng, Wu Yonghui, Hughes Macduff
- Generating Distractors For Reading Comprehension Questions From Real Examinations Gao Yifan, Bing Lidong, Li Piji, King Irwin, Lyu Michael R.
- Transfer Learning Of Language-independent End-to-end ASR With Language Model Fusion Inaguma Hirofumi, Cho Jaejin, Baskar Murali Karthick, Kawahara Tatsuya, Watanabe Shinji
- Answerer In Questioner's Mind: Information Theoretic Approach To Goal-oriented Visual Dialog Lee Sang-woo, Heo Yu-jung, Zhang Byoung-tak
- Global Encoding For Abstractive Summarization Lin Junyang, Sun Xu, Ma Shuming, Su Qi
- Seq2rdf: An End-to-end Application For Deriving Triples From Natural Language Text Liu Yue, Zhang Tongtao, Liang Zhicheng, Ji Heng, Mcguinness Deborah L.
- Like A Baby: Visually Situated Neural Language Acquisition Ororbia Alexander G., Mali Ankur, Kelly Matthew A., Reitter David
- Extending Neural Generative Conversational Model Using External Knowledge Sources Parthasarathi Prasanna, Pineau Joelle
- Training Tips For The Transformer Model Popel Martin, Bojar Ondřej
- Adapting Visual Question Answering Models For Enhancing Multimodal Community Q&A Platforms Srivastava Avikalp, Liu Hsin Wen, Fujita Sumio
- Multilingual NMT With A Language-independent Attention Bridge Vázquez Raúl, Raganato Alessandro, Tiedemann Jörg, Creutz Mathias
- Retrieval-enhanced Adversarial Training For Neural Response Generation Zhu Qingfu, Cui Lei, Zhang Weinan, Wei Furu, Liu Ting
- Abductive Reasoning As Self-supervision For Common Sense Question Answering Aakur Sathyanarayanan N., Sarkar Sudeep
- PLATO: Pre-trained Dialogue Generation Model With Discrete Latent Variable Bao Siqi, He Huang, Wang Fan, Wu Hua, Wang Haifeng
- Meta Answering For Machine Reading Borschinger Benjamin, Boyd-graber Jordan, Buck Christian, Bulian Jannis, Ciaramita Massimiliano, Huebscher Michelle Chen, Gajewski Wojciech, Kilcher Yannic, Nogueira Rodrigo, Saralegu Lierni Sestorain
- Taskmaster-1: Toward A Realistic And Diverse Dialog Dataset Byrne Bill, Krishnamoorthi Karthik, Sankar Chinnadhurai, Neelakantan Arvind, Duckworth Daniel, Yavuz Semih, Goodrich Ben, Dubey Amit, Cedilnik Andy, Kim Kyu-young
- An Empirical Study Of Generation Order For Machine Translation Chan William, Stern Mitchell, Kiros Jamie, Uszkoreit Jakob
- Contrastive Multi-document Question Generation Cho Woon Sang, Zhang Yizhe, Rao Sudha, Celikyilmaz Asli, Xiong Chenyan, Gao Jianfeng, Wang Mengdi, Dolan Bill
- Exploring The Limits Of Transfer Learning With A Unified Text-to-text Transformer Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J. Liu
- Heterogeneous Memory Enhanced Multimodal Attention Model For Video Question Answering Fan Chenyou, Zhang Xiaofan, Zhang Shu, Wang Wensheng, Zhang Chi, Huang Heng
- Reweighted Proximal Pruning For Large-scale Language Representation Guo Fu-ming, Liu Sijia, Mungall Finlay S., Lin Xue, Wang Yanzhi
- Resurrecting Submodularity For Neural Text Generation Han Simeng, Lin Xiang, Joty Shafiq
- Analyzing The Forgetting Problem In The Pretrain-finetuning Of Dialogue Response Models He Tianxing, Liu Jun, Cho Kyunghyun, Ott Myle, Liu Bing, Glass James, Peng Fuchun
- Convert: Efficient And Accurate Conversational Representations From Transformers Henderson Matthew, Casanueva Iñigo, Mrkšić Nikola, Su Pei-hao, Wen Tsung-hsien, Vulić Ivan
- Exbert: A Visual Analysis Tool To Explore Learned Representations In Transformers Models Hoover Benjamin, Strobelt Hendrik, Gehrmann Sebastian
- A Corpus-free State2seq User Simulator For Task-oriented Dialogue Hou Yutai, Fang Meng, Che Wanxiang, Liu Ting
- Rap-net: Recurrent Attention Pooling Networks For Dialogue Response Selection Huang Chao-wei, Chiang Ting-rui, Su Shang-yu, Chen Yun-nung
- INSET: Sentence Infilling With Inter-sentential Transformer Huang Yichen, Zhang Yizhe, Elachqar Oussama, Cheng Yu
- Tinybert: Distilling BERT For Natural Language Understanding Jiao Xiaoqi, Yin Yichun, Shang Lifeng, Jiang Xin, Chen Xiao, Li Linlin, Wang Fang, Liu Qun
- A Case Study: Exploiting Neural Machine Translation To Translate CUDA To Opencl Kim Yonghae, Kim Hyesoon
- Learning To Answer By Learning To Ask: Getting The Best Of GPT-2 And BERT Worlds Klein Tassilo, Nabi Moin
- Investigating Multilingual NMT Representations At Scale Kudugunta Sneha Reddy, Bapna Ankur, Caswell Isaac, Arivazhagan Naveen, Firat Orhan
- BART: Denoising Sequence-to-sequence Pre-training For Natural Language Generation, Translation, And Comprehension Lewis Mike, Liu Yinhan, Goyal Naman, Ghazvininejad Marjan, Mohamed Abdelrahman, Levy Omer, Stoyanov Ves, Zettlemoyer Luke
- Visualbert: A Simple And Performant Baseline For Vision And Language Li Liunian Harold, Yatskar Mark, Yin Da, Hsieh Cho-jui, Chang Kai-wei
- Relevance-promoting Language Model For Short-text Conversation Li Xin, Li Piji, Bi Wei, Liu Xiaojiang, Lam Wai
- Explicit Sentence Compression For Neural Machine Translation Li Zuchao, Wang Rui, Chen Kehai, Utiyama Masao, Sumita Eiichiro, Zhang Zhuosheng, Zhao Hai
- MKD: A Multi-task Knowledge Distillation Approach For Pretrained Language Models Liu Linqing, Wang Huan, Lin Jimmy, Socher Richard, Xiong Caiming
- Multi-hop Reading Comprehension Via Deep Reinforcement Learning Based Document Traversal Long Alex, Mason Joel, Blair Alan, Wang Wei
- Good, Better, Best: Textual Distractors Generation For Multiple-choice Visual Question Answering Via Reinforcement Learning Lu Jiaying, Ye Xin, Ren Yi, Yang Yezhou
- Megatron-lm: Training Multi-billion Parameter Language Models Using Model Parallelism Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick Legresley, Jared Casper, Bryan Catanzaro
- A Modular Task-oriented Dialogue System Using A Neural Mixture-of-experts Pei Jiahuan, Ren Pengjie, De Rijke Maarten
- GECOR: An End-to-end Generative Ellipsis And Co-reference Resolution Model For Task-oriented Dialogue Quan Jun, Xiong Deyi, Webber Bonnie, Hu Changjian
- Probing Natural Language Inference Models Through Semantic Fragments Richardson Kyle, Hu Hai, Moss Lawrence S., Sabharwal Ashish
- Masked Language Model Scoring Salazar Julian, Liang Davis, Nguyen Toan Q., Kirchhoff Katrin
- Hierarchical Reinforcement Learning For Open-domain Dialog Saleh Abdelrhman, Jaques Natasha, Ghandeharioun Asma, Shen Judy Hanwen, Picard Rosalind
- Generalization In Generation: A Closer Look At Exposure Bias Schmidt Florian
- Factor Graph Attention Schwartz Idan, Yu Seunghak, Hazan Tamir, Schwing Alexander
- Scene-based Factored Attention For Image Captioning Shen Chen, Ji Rongrong, Chen Fuhai, Sun Xiaoshuai, Li Xiangming
- MASS: Masked Sequence To Sequence Pre-training For Language Generation Song Kaitao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Tie-yan
- Unpaired Cross-lingual Image Caption Generation With Self-supervised Rewards Song Yuqing, Chen Shizhe, Zhao Yida, Jin Qin
- LXMERT: Learning Cross-modality Encoder Representations From Transformers Tan Hao, Bansal Mohit
- Allennlp Interpret: A Framework For Explaining Predictions Of NLP Models Wallace Eric, Tuyls Jens, Wang Junlin, Subramanian Sanjay, Gardner Matt, Singh Sameer
- Theme-aware Generation Model For Chinese Lyrics Wang Jie, Zhao Xinyan
- Adapting And Evaluating A Deep Learning Language Model For Clinical Why-question Answering Wen Andrew, Elwazir Mohamed Y., Moon Sungrim, Fan Jungwei
- Improving Neural Machine Translation With Pre-trained Representation Weng Rongxiang, Yu Heng, Huang Shujian, Luo Weihua, Chen Jiajun
- Attention Is Not Not Explanation Wiegreffe Sarah, Pinter Yuval
- Huggingface's Transformers: State-of-the-art Natural Language Processing Wolf Thomas, Debut Lysandre, Sanh Victor, Chaumond Julien, Delangue Clement, Moi Anthony, Cistac Pierric, Rault Tim, Louf Rémi, Funtowicz Morgan, Davison Joe, Shleifer Sam, Von Platen Patrick, Ma Clara, Jernite Yacine, Plu Julien, Xu Canwen, Scao Teven Le, Gugger Sylvain, Drame Mariama, Lhoest Quentin, Rush Alexander M.
- Global-to-local Memory Pointer Networks For Task-oriented Dialogue Wu Chien-sheng, Socher Richard, Xiong Caiming
- Integrated Triaging For Fast Reading Comprehension Wu Felix, Li Boyi, Wang Lequn, Lao Ni, Blitzer John, Weinberger Kilian Q.
- Alternating Recurrent Dialog Model With Large-scale Pre-trained Language Models Wu Qingyang, Zhang Yichi, Li Yu, Yu Zhou
- Towards Making The Most Of BERT In Neural Machine Translation Yang Jiacheng, Wang Mingxuan, Zhou Hao, Zhao Chengqi, Yu Yong, Zhang Weinan, Li Lei
- Atom Responding Machine For Dialog Generation Zhou Ganbin, Luo Ping, Chen Jingwu, Lin Fen, Lin Leyu, He Qing
- Synchronous Bidirectional Neural Machine Translation Zhou Long, Zhang Jiajun, Zong Chengqing
- Building Task-oriented Visual Dialog Systems Through Alternative Optimization Between Dialog Policy And Language Generation Zhou Mingyang, Arnold Josh, Yu Zhou
- Vision-language Navigation With Self-supervised Auxiliary Reasoning Tasks Zhu Fengda, Zhu Yi, Chang Xiaojun, Liang Xiaodan
- Residual Energy-based Models For Text Bakhtin Anton, Deng Yuntian, Gross Sam, Ott Myle, Ranzato Marc'aurelio, Szlam Arthur
- Chatbot Interaction With Artificial Intelligence: Human Data Augmentation With T5 And Language Transformer Ensemble For Text Classification Bird Jordan J., Ekárt Anikó, Faria Diego R.
- Exploring Fluent Query Reformulations With Text-to-text Transformers And Reinforcement Learning Chen Jerry Zikun, Yu Shi, Wang Haoran
- Dipair: Fast And Accurate Distillation For Trillion-scale Text Matching And Pair Modeling Chen Jiecao, Yang Liu, Raman Karthik, Bendersky Michael, Yeh Jung-jung, Zhou Yun, Najork Marc, Cai Danyang, Emadzadeh Ehsan
- Logical Natural Language Generation From Open-domain Tables Chen Wenhu, Chen Jianshu, Su Yu, Chen Zhiyu, Wang William Yang
- Reasoning In Dialog: Improving Response Generation By Context Reading Comprehension Chen Xiuying, Cui Zhi, Zhang Jiayi, Wei Chen, Cui Jianwei, Wang Bin, Zhao Dongyan, Yan Rui
- Autoadr: Automatic Model Design For Ad Relevance Chen Yiren, Yang Yaming, Sun Hong, Wang Yujing, Xu Yu, Shen Wei, Zhou Rong, Tong Yunhai, Bai Jing, Zhang Ruofei
- Unsupervised Explanation Generation For Machine Reading Comprehension Cui Yiming, Liu Ting, Wang Shijin, Hu Guoping
- Gshard: Scaling Giant Models With Conditional Computation And Automatic Sharding Dmitry Lepikhin, Hyoukjoong Lee, Yuanzhong Xu, Dehao Chen, Orhan Firat, Yanping Huang, Maxim Krikun, Noam Shazeer, Zhifeng Chen
- Relevance Transformer: Generating Concise Code Snippets With Relevance Feedback Gemmell Carlos, Rossetto Federico, Dalton Jeffrey
- IQ-VQA: Intelligent Visual Question Answering Goel Vatsal, Chandak Mohit, Anand Ashish, Guha Prithwijit
- It's Not Greek To Mbert: Inducing Word-level Translations From Multilingual BERT Gonen Hila, Ravfogel Shauli, Elazar Yanai, Goldberg Yoav
- Contextualize Knowledge Bases With Transformer For End-to-end Task-oriented Dialogue Systems Gou Yanjie, Lei Yinjie, Liu Lingqiao, Dai Yong, Shen Chunxu
- Incorporating BERT Into Parallel Sequence Decoding With Adapters Guo Junliang, Zhang Zhirui, Xu Linli, Wei Hao-ran, Chen Boxing, Chen Enhong
- Conversational Machine Comprehension: A Literature Review Gupta Somil, Rawat Bhanu Pratap Singh, Yu Hong
- ECONET: Effective Continual Pretraining Of Language Models For Event Temporal Reasoning Han Rujun, Ren Xiang, Peng Nanyun
- Ranking Enhanced Dialogue Generation Hao Changying, Pang Liang, Lan Yanyan, Sun Fei, Guo Jiafeng, Cheng Xueqi
- Towards Learning A Generic Agent For Vision-and-language Navigation Via Pre-training Hao Weituo, Li Chunyuan, Li Xiujun, Carin Lawrence, Gao Jianfeng
- Exploring Explainable Selection To Control Abstractive Summarization Haonan Wang, Yang Gao, Yu Bai, Lapata Mirella, Heyan Huang
- PAIR: Planning And Iterative Refinement In Pre-trained Transformers For Long Text Generation Hua Xinyu, Wang Lu
- Less Is More: Data-efficient Complex Question Answering Over Knowledge Bases Hua Yuncheng, Li Yuan-fang, Qi Guilin, Wu Wei, Zhang Jingyao, Qi Daiqing
- TRANS-BLSTM: Transformer With Bidirectional LSTM For Language Understanding Huang Zhiheng, Xu Peng, Liang Davis, Mishra Ajay, Xiang Bing
- LIMEADE: From AI Explanations To Advice Taking Lee Benjamin Charles Germain, Downey Doug, Lo Kyle, Weld Daniel S.
- Efficient Transformer-based Large Scale Language Representations Using Hardware-friendly Block Structured Pruning Li Bingbing, Kong Zhenglun, Zhang Tianyun, Li Ji, Li Zhengang, Liu Hang, Ding Caiwen
- Optimus: Organizing Sentences Via Pre-trained Modeling Of A Latent Space Li Chunyuan, Gao Xiang, Li Yuan, Peng Baolin, Li Xiujun, Zhang Yizhe, Gao Jianfeng
- Interpretable NLG For Task-oriented Dialogue Systems With Heterogeneous Rendering Machines Li Yangming, Yao Kaisheng
- Neural Machine Translation With Joint Representation Li Yanyang, Wang Qiang, Xiao Tong, Liu Tongran, Zhu Jingbo
- LRTA: A Transparent Neural-symbolic Reasoning Framework With Modular Supervision For Visual Question Answering Liang Weixin, Niu Feiyang, Reganti Aishwarya, Thattai Govind, Tur Gokhan
- Mintl: Minimalist Transfer Learning For Task-oriented Dialogue Systems Lin Zhaojiang, Madotto Andrea, Winata Genta Indra, Fung Pascale
- TIME: Text And Image Mutual-translation Adversarial Networks Liu Bingchen, Song Kunpeng, Zhu Yizhe, De Melo Gerard, Elgammal Ahmed
- Lite Training Strategies For Portuguese-english And English-portuguese Translation Lopes Alexandre, Nogueira Rodrigo, Lotufo Roberto, Pedrini Helio
- Ladabert: Lightweight Adaptation Of BERT Through Hybrid Model Compression Mao Yihuan, Wang Yujing, Wu Chufan, Zhang Chen, Wang Yang, Yang Yaming, Zhang Quanlu, Tong Yunhai, Bai Jing
- On Incorporating Structural Information To Improve Dialogue Response Generation Moghe Nikita, Vijayan Priyesh, Ravindran Balaraman, Khapra Mitesh M.
- Axformer: Accuracy-driven Approximation Of Transformers For Faster, Smaller And More Accurate NLP Models Nagarajan Amrit, Sen Sanchari, Stevens Jacob R., Raghunathan Anand
- Real-time Execution Of Large-scale Language Models On Mobile Niu Wei, Kong Zhenglun, Yuan Geng, Jiang Weiwen, Guan Jiexiong, Ding Caiwen, Zhao Pu, Liu Sijia, Ren Bin, Wang Yanzhi
- X-linear Attention Networks For Image Captioning Pan Yingwei, Yao Ting, Li Yehao, Mei Tao
- Adapterhub: A Framework For Adapting Transformers Pfeiffer Jonas, Rücklé Andreas, Poth Clifton, Kamath Aishwarya, Vulić Ivan, Ruder Sebastian, Cho Kyunghyun, Gurevych Iryna
- MAD-X: An Adapter-based Framework For Multi-task Cross-lingual Transfer Pfeiffer Jonas, Vulić Ivan, Gurevych Iryna, Ruder Sebastian
- Automatic Distractor Generation For Multiple Choice Questions In Standard Tests Qiu Zhaopeng, Wu Xian, Fan Wei
- Transquest At WMT2020: Sentence-level Direct Assessment Ranasinghe Tharindu, Orasan Constantin, Mitkov Ruslan
- Fine-tuning BERT For Schema-guided Zero-shot Dialogue State Tracking Ruan Yu-ping, Ling Zhen-hua, Gu Jia-chen, Liu Quan
- Hierarchical Transformer For Task Oriented Dialog Systems Santra Bishal, Anusha Potnuru, Goyal Pawan
- Hybrid Generative-retrieval Transformers For Dialogue Domain Adaptation Shalyminov Igor, Sordoni Alessandro, Atkinson Adam, Schulz Hannes
- Russiansuperglue: A Russian Language Understanding Evaluation Benchmark Shavrina Tatiana, Fenogenova Alena, Emelyanov Anton, Shevelev Denis, Artemova Ekaterina, Malykh Valentin, Mikhailov Vladislav, Tikhonova Maria, Chertok Andrey, Evlampiev Andrey
- Lightpaff: A Two-stage Distillation Framework For Pre-training And Fine-tuning Song Kaitao, Sun Hao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Hongzhi, Liu Tie-yan
- Bert-hlstms: BERT And Hierarchical Lstms For Visual Storytelling Su Jing, Dai Qingyun, Guerin Frank, Zhou Mian
- Intellicode Compose: Code Generation Using Transformer Svyatkovskiy Alexey, Deng Shao Kun, Fu Shengyu, Sundaresan Neel
- Edgebert: Sentence-level Energy Optimizations For Latency-aware Multi-task NLP Inference Tambe Thierry, Hooper Coleman, Pentecost Lillian, Jia Tianyu, Yang En-yu, Donato Marco, Sanh Victor, Whatmough Paul N., Rush Alexander M., Brooks David, Wei Gu-yeon
- Vokenization: Improving Language Understanding With Contextualized, Visual-grounded Supervision Tan Hao, Bansal Mohit
- Interpretable Neural Computation For Real-world Compositional Visual Question Answering Tang Ruixue, Ma Chao
- The Language Interpretability Tool: Extensible, Interactive Visualizations And Analysis For NLP Models Tenney Ian, Wexler James, Bastings Jasmijn, Bolukbasi Tolga, Coenen Andy, Gehrmann Sebastian, Jiang Ellen, Pushkarna Mahima, Radebaugh Carey, Reif Emily, Yuan Ann
- Infobert: Improving Robustness Of Language Models From An Information Theoretic Perspective Wang Boxin, Wang Shuohang, Cheng Yu, Gan Zhe, Jia Ruoxi, Li Bo, Liu Jingjing
- HAT: Hardware-aware Transformers For Efficient Natural Language Processing Wang Hanrui, Wu Zhanghao, Liu Zhijian, Cai Han, Zhu Ligeng, Gan Chuang, Han Song
- Lightseq: A High Performance Inference Library For Transformers Wang Xiaohui, Xiong Ying, Wei Yang, Wang Mingxuan, Li Lei
- VD-BERT: A Unified Vision And Dialog Transformer With BERT Wang Yue, Joty Shafiq, Lyu Michael R., King Irwin, Xiong Caiming, Hoi Steven C. H.
- Towards Faithful Neural Table-to-text Generation With Content-matching Constraints Wang Zhenyi, Wang Xiaoyang, An Bang, Yu Dong, Chen Changyou
- On Modality Bias In The TVQA Dataset Winterbottom Thomas, Xiao Sarah, Mclean Alistair, Moubayed Noura Al
- A Controllable Model Of Grounded Response Generation Wu Zeqiu, Galley Michel, Brockett Chris, Zhang Yizhe, Gao Xiang, Quirk Chris, Koncel-kedziorski Rik, Gao Jianfeng, Hajishirzi Hannaneh, Ostendorf Mari, Dolan Bill
- ERNIE-GEN: An Enhanced Multi-flow Pre-training And Fine-tuning Framework For Natural Language Generation Xiao Dongling, Zhang Han, Li Yukun, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Exploring Question-specific Rewards For Generating Deep Questions Xie Yuxi, Pan Liangming, Wang Dongzhe, Kan Min-yen, Feng Yansong
- MC-BERT: Efficient Language Pre-training Via A Meta Controller Xu Zhenhui, Gong Linyuan, Ke Guolin, He Di, Zheng Shuxin, Wang Liwei, Bian Jiang, Liu Tie-yan
- Turingadvice: A Generative And Dynamic Evaluation Of Language Use Zellers Rowan, Holtzman Ari, Clark Elizabeth, Qin Lianhui, Farhadi Ali, Choi Yejin
- Open-domain Dialogue Generation Based On Pre-trained Language Models Zeng Yan, Nie Jian-yun
- Trojaning Language Models For Fun And Profit Zhang Xinyang, Zhang Zheng, Ji Shouling, Wang Ting
- Towards Making The Most Of Context In Neural Machine Translation Zheng Zaixiang, Yue Xiang, Huang Shujian, Chen Jiajun, Birch Alexandra
- Improving Non-autoregressive Neural Machine Translation With Monolingual Data Zhou Jiawei, Keung Phillip
- Pre-training Text-to-text Transformers For Concept-centric Common Sense Zhou Wangchunshu, Lee Dong-ho, Selvam Ravi Kiran, Lee Seyeon, Lin Bill Yuchen, Ren Xiang
- Overcoming Language Priors With Self-supervised Learning For Visual Question Answering Zhu Xi, Mao Zhendong, Liu Chunxiao, Zhang Peng, Wang Bin, Zhang Yongdong
- Goal-directed Story Generation: Augmenting Generative Language Models With Reinforcement Learning Alabdulkarim Amal, Li Winston, Martin Lara J., Riedl Mark O.
- Re-evaluating Germeval17 Using German Pre-trained Language Models Aßenmacher M., Corvonato A., Heumann C.
- On The Universality Of Deep Contextual Language Models Bhatt Shaily, Goyal Poonam, Dandapat Sandipan, Choudhury Monojit, Sitaram Sunayana
- Bridging The Gap Between Language Model And Reading Comprehension: Unsupervised MRC Via Self-supervision Bian Ning, Han Xianpei, Chen Bo, Lin Hongyu, He Ben, Sun Le
- Improving Language Models By Retrieving From Trillions Of Tokens Borgeaud Sebastian, Mensch Arthur, Hoffmann Jordan, Cai Trevor, Rutherford Eliza, Millican Katie, Driessche George Van Den, Lespiau Jean-baptiste, Damoc Bogdan, Clark Aidan, Casas Diego De Las, Guy Aurelia, Menick Jacob, Ring Roman, Hennigan Tom, Huang Saffron, Maggiore Loren, Jones Chris, Cassirer Albin, Brock Andy, Paganini Michela, Irving Geoffrey, Vinyals Oriol, Osindero Simon, Simonyan Karen, Rae Jack W., Elsen Erich, Sifre Laurent
- Topic Transferable Table Question Answering Chemmengath Saneem Ahmed, Kumar Vishwajeet, Bharadwaj Samarth, Sen Jaydeep, Canim Mustafa, Chakrabarti Soumen, Gliozzo Alfio, Sankaranarayanan Karthik
- Reinforcement Learning For Few-shot Text Generation Adaptation Cheng Pengsen, Dai Jinqiao, Liu Jiamiao, Liu Jiayong, Jia Peng
- Unifying Vision-and-language Tasks Via Text Generation Cho Jaemin, Lei Jie, Tan Hao, Bansal Mohit
- Long-range Modeling Of Source Code Files With Ewash: Extended Window Access By Syntax Hierarchy Clement Colin B., Lu Shuai, Liu Xiaoyu, Tufano Michele, Drain Dawn, Duan Nan, Sundaresan Neel, Svyatkovskiy Alexey
- YANMTT: Yet Another Neural Machine Translation Toolkit Dabre Raj, Sumita Eiichiro
- Zero-shot Recommendation As Language Modeling Damien Sileo, Wout Vossen, Robbe Raymaekers
- Compression, Transduction, And Creation: A Unified Framework For Evaluating Natural Language Generation Deng Mingkai, Tan Bowen, Liu Zhengzhong, Xing Eric P., Hu Zhiting
- EKTVQA: Generalized Use Of External Knowledge To Empower Scene Text In Text-vqa Dey Arka Ujjal, Valveny Ernest, Harit Gaurav
- Openprompt: An Open-source Framework For Prompt-learning Ding Ning, Hu Shengding, Zhao Weilin, Chen Yulin, Liu Zhiyuan, Zheng Hai-tao, Sun Maosong
- Not Quite 'ask A Librarian': AI On The Nature, Value, And Future Of LIS Dinneen Jesse David, Bubinger Helen
- Is GPT-3 Text Indistinguishable From Human Text? Scarecrow: A Framework For Scrutinizing Machine Text Dou Yao, Forbes Maxwell, Koncel-kedziorski Rik, Smith Noah A., Choi Yejin
- An Empirical Study Of Training End-to-end Vision-and-language Transformers Dou Zi-yi, Xu Yichong, Gan Zhe, Wang Jianfeng, Wang Shuohang, Wang Lijuan, Zhu Chenguang, Zhang Pengchuan, Yuan Lu, Peng Nanyun, Liu Zicheng, Zeng Michael
- Robustness Challenges In Model Distillation And Pruning For Natural Language Understanding Du Mengnan, Mukherjee Subhabrata, Cheng Yu, Shokouhi Milad, Hu Xia, Awadallah Ahmed Hassan
- GLM: General Language Model Pretraining With Autoregressive Blank Infilling Du Zhengxiao, Qian Yujie, Liu Xiao, Ding Ming, Qiu Jiezhong, Yang Zhilin, Tang Jie
- Robustly Optimized And Distilled Training For Natural Language Understanding Elfadeel Haytham, Peshterliev Stan
- Towards Trustworthy Deception Detection: Benchmarking Model Robustness Across Domains, Modalities, And Languages Glenski Maria, Ayton Ellyn, Cosbey Robin, Arendt Dustin, Volkova Svitlana
- Memory-efficient Transformers Via Top-\(k\) Attention Gupta Ankit, Dar Guy, Goodman Shaya, Ciprut David, Berant Jonathan
- Greedy Gradient Ensemble For Robust Visual Question Answering Han Xinzhe, Wang Shuhui, Su Chi, Huang Qingming, Tian Qi
- Text2app: A Framework For Creating Android Apps From Text Descriptions Hasan Masum, Mehrab Kazi Sajeed, Ahmad Wasi Uddin, Shahriyar Rifat
- ELIT: Emory Language And Information Toolkit He Han, Xu Liyan, Choi Jinho D.
- Generate, Annotate, And Learn: NLP With Synthetic Text He Xuanli, Nassar Islam, Kiros Jamie, Haffari Gholamreza, Norouzi Mohammad
- Controllable Dialogue Generation With Disentangled Multi-grained Style Specification And Attribute Consistency Reward Hu Zhe, Cao Zhiwei, Chan Hou Pong, Liu Jiachen, Xiao Xinyan, Su Jinsong, Wu Hua
- Transfer Learning For Sequence Generation: From Single-source To Multi-source Huang Xuancheng, Xu Jingfang, Sun Maosong, Liu Yang
- Unifying Multimodal Transformer For Bi-directional Image And Text Generation Huang Yupan, Xue Hongwei, Liu Bei, Lu Yutong
- DS-TOD: Efficient Domain Specialization For Task Oriented Dialog Hung Chia-chien, Lauscher Anne, Ponzetto Simone Paolo, Glavaš Goran
- A Neural Conversation Generation Model Via Equivalent Shared Memory Investigation Ji Changzhen, Zhang Yating, Liu Xiaozhong, Jatowt Adam, Sun Changlong, Zhu Conghui, Zhao Tiejun
- Improving Non-autoregressive Generation With Mixup Training Jiang Ting, Huang Shaohan, Zhang Zihan, Wang Deqing, Zhuang Fuzhen, Wei Furu, Huang Haizhen, Zhang Liangjie, Zhang Qi
- Amazon Sagemaker Model Parallelism: A General And Flexible Framework For Large Model Training Karakus Can, Huilgol Rahul, Wu Fei, Subramanian Anirudh, Daniel Cade, Cavdar Derya, Xu Teng, Chen Haohan, Rahnama Arash, Quintela Luis
- L-verse: Bidirectional Generation Between Image And Text Kim Taehoon, Song Gwangmo, Lee Sihaeng, Kim Sangyun, Seo Yewon, Lee Soonyoung, Kim Seung Hwan, Lee Honglak, Bae Kyunghoon
- Can Transformer Language Models Predict Psychometric Properties? Laverghetta Antonio Jr., Nighojkar Animesh, Mirzakhalov Jamshidbek, Licato John
- Quiz-style Question Generation For News Stories Lelkes Adam D., Tran Vinh Q., Yu Cong
- Semvlp: Vision-language Pre-training By Aligning Semantics At Multiple Levels Li Chenliang, Yan Ming, Xu Haiyang, Luo Fuli, Wang Wei, Bi Bin, Huang Songfang
- Self-supervised Bot Play For Conversational Recommendation With Justifications Li Shuyang, Majumder Bodhisattwa Prasad, Mcauley Julian
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-image Pre-training Paradigm Li Yangguang, Liang Feng, Zhao Lichen, Cui Yufeng, Ouyang Wanli, Shao Jing, Yu Fengwei, Yan Junjie
- Scheduled Sampling In Vision-language Pretraining With Decoupled Encoder-decoder Network Li Yehao, Pan Yingwei, Yao Ting, Chen Jingwen, Mei Tao
- Terapipe: Token-level Pipeline Parallelism For Training Large-scale Language Models Li Zhuohan, Zhuang Siyuan, Guo Shiyuan, Zhuo Danyang, Zhang Hao, Song Dawn, Stoica Ion
- Learning Neural Templates For Recommender Dialogue System Liang Zujie, Hu Huang, Xu Can, Miao Jian, He Yingying, Chen Yining, Geng Xiubo, Liang Fan, Jiang Daxin
- MDQE: A More Accurate Direct Pretraining For Machine Translation Quality Estimation Lin Lei
- VX2TEXT: End-to-end Learning Of Video-based Text Generation From Multimodal Inputs Lin Xudong, Bertasius Gedas, Wang Jue, Chang Shih-fu, Parikh Devi, Torresani Lorenzo
- Codeqa: A Question Answering Dataset For Source Code Comprehension Liu Chenxiao, Wan Xiaojun
- Towards Automated Psychotherapy Via Language Modeling Liu Houjun
- A Three-stage Learning Framework For Low-resource Knowledge-grounded Dialogue Generation Liu Shilei, Zhao Xiaofeng, Li Bochao, Ren Feiliang, Zhang Longhui, Yin Shujuan
- Unified Multimodal Pre-training And Prompt-based Tuning For Vision-language Understanding And Generation Liu Tianyi, Wu Zuxuan, Xiong Wenhan, Chen Jingjing, Jiang Yu-gang
- Augmenting Sequential Recommendation With Pseudo-prior Items Via Reversely Pre-training Transformer Liu Zhiwei, Fan Ziwei, Wang Yu, Yu Philip S.
- Learning To Ask Conversational Questions By Optimizing Levenshtein Distance Liu Zhongkun, Ren Pengjie, Chen Zhumin, Ren Zhaochun, De Rijke Maarten, Zhou Ming
- X-METRA-ADA: Cross-lingual Meta-transfer Learning Adaptation To Natural Language Understanding And Question Answering M'hamdi Meryem, Kim Doo Soon, Dernoncourt Franck, Bui Trung, Ren Xiang, May Jonathan
- Deltalm: Encoder-decoder Pre-training For Language Generation And Translation By Augmenting Pretrained Multilingual Encoders Ma Shuming, Dong Li, Huang Shaohan, Zhang Dongdong, Muzio Alexandre, Singhal Saksham, Awadalla Hany Hassan, Song Xia, Wei Furu
- COCO-LM: Correcting And Contrasting Text Sequences For Language Model Pretraining Meng Yu, Xiong Chenyan, Bajaj Payal, Tiwary Saurabh, Bennett Paul, Han Jiawei, Song Xia
- CINS: Comprehensive Instruction For Few-shot Learning In Task-oriented Dialog Systems Mi Fei, Li Yitong, Wang Yasheng, Jiang Xin, Liu Qun
- Distantly Supervised Transformers For E-commerce Product QA Mittal Happy, Chakrabarti Aniket, Bayar Belhassen, Sharma Animesh Anant, Rasiwasia Nikhil
- Arat5: Text-to-text Transformers For Arabic Language Generation Nagoudi El Moatez Billah, Elmadany Abdelrahim, Abdul-mageed Muhammad
- Improving Transformers With Probabilistic Attention Keys Nguyen Tam, Nguyen Tan M., Le Dung D., Nguyen Duy Khuong, Tran Viet-anh, Baraniuk Richard G., Ho Nhat, Osher Stanley J.
- Progressive Transformer-based Generation Of Radiology Reports Nooralahzadeh Farhad, Gonzalez Nicolas Perez, Frauenfelder Thomas, Fujimoto Koji, Krauthammer Michael
- Pre-train, Prompt, And Predict: A Systematic Survey Of Prompting Methods In Natural Language Processing Pengfei Liu, Weizhe Yuan, Jinlan Fu, Zhengbao Jiang, Hiroaki Hayashi, Graham Neubig
- Context Transformer With Stacked Pointer Networks For Conversational Question Answering Over Knowledge Graphs Plepi Joan, Kacupaj Endri, Singh Kuldeep, Thakkar Harsh, Lehmann Jens
- ORCHARD: A Benchmark For Measuring Systematic Generalization Of Multi-hierarchical Reasoning Pung Bill Tuck Weng, Chan Alvin
- Assessing The Syntactic Capabilities Of Transformer-based Multilingual Language Models Pérez-mayos Laura, García Alba Táboas, Mille Simon, Wanner Leo
- LFPT5: A Unified Framework For Lifelong Few-shot Language Learning Based On Prompt Tuning Of T5 Qin Chengwei, Joty Shafiq
- Vinmt: Neural Machine Translation Toolkit Quan Nguyen Hoang, Dat Nguyen Thanh, Cong Nguyen Hoang Minh, Van Vinh Nguyen, Vinh Ngo Thi, Thai Nguyen Phuong, Viet Tran Hong
- Grid Search Hyperparameter Benchmarking Of BERT, ALBERT, And Longformer On Duorc Quijano Alex John, Nguyen Sam, Ordonez Juanita
- Multiversal Views On Language Models Reynolds Laria, Mcdonell Kyle
- Challenges In Procedural Multimodal Machine Comprehension:a Novel Way To Benchmark Sahu Pritish, Sikka Karan, Divakaran Ajay
- Metaicl: Learning To Learn In Context Sewon Min, Mike Lewis, Luke Zettlemoyer, Hannaneh Hajishirzi
- Societal Biases In Language Generation: Progress And Challenges Sheng Emily, Chang Kai-wei, Natarajan Premkumar, Peng Nanyun
- ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, Dianhai Yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang
- Pre-trained Language Models As Prior Knowledge For Playing Text-based Games Singh Ishika, Singh Gargi, Modi Ashutosh
- Primer: Searching For Efficient Transformers For Language Modeling So David R., Mańke Wojciech, Liu Hanxiao, Dai Zihang, Shazeer Noam, Le Quoc V.
- Corelm: Coreference-aware Language Model Fine-tuning Stylianou Nikolaos, Vlahavas Ioannis
- Ernie-tiny : A Progressive Distillation Framework For Pretrained Transformer Compression Su Weiyue, Chen Xuyi, Feng Shikun, Liu Jiaxiang, Liu Weixin, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- A Non-hierarchical Attention Network With Modality Dropout For Textual Response Generation In Multimodal Dialogue Systems Sun Rongyi, Chen Borun, Zhou Qingyu, Li Yinghui, Cao Yunbo, Zheng Hai-tao
- ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Sun Yu, Wang Shuohuan, Feng Shikun, Ding Siyu, Pang Chao, Shang Junyuan, Liu Jiaxiang, Chen Xuyi, Zhao Yanbin, Lu Yuxiang, Liu Weixin, Wu Zhihua, Gong Weibao, Liang Jianzhong, Shang Zhizhou, Sun Peng, Liu Wei, Ouyang Xuan, Yu Dianhai, Tian Hao, Wu Hua, Wang Haifeng
- Diff-explainer: Differentiable Convex Optimization For Explainable Multi-hop Inference Thayaparan Mokanarangan, Valentino Marco, Ferreira Deborah, Rozanova Julia, Freitas André
- Exploring Transformers In Natural Language Generation: GPT, BERT, And Xlnet Topal M. Onat, Bas Anil, Van Heerden Imke
- Multimodal Few-shot Learning With Frozen Language Models Tsimpoukelli Maria, Menick Jacob, Cabi Serkan, Eslami S. M. Ali, Vinyals Oriol, Hill Felix
- Learning Better Visual Dialog Agents With Pretrained Visual-linguistic Representation Tu Tao, Ping Qing, Thattai Govind, Tur Gokhan, Natarajan Prem
- Convfit: Conversational Fine-tuning Of Pretrained Language Models Vulić Ivan, Su Pei-hao, Coope Sam, Gerz Daniela, Budzianowski Paweł, Casanueva Iñigo, Mrkšić Nikola, Wen Tsung-hsien
- Topicrefine: Joint Topic Prediction And Dialogue Response Generation For Multi-turn End-to-end Dialogue System Wang Hongru, Cui Mingyu, Zhou Zimo, Fung Gabriel Pui Cheong, Wong Kam-fai
- Sentence Semantic Regression For Text Generation Wang Wei, Li Piji, Zheng Hai-tao
- Codet5: Identifier-aware Unified Pre-trained Encoder-decoder Models For Code Understanding And Generation Wang Yue, Wang Weishi, Joty Shafiq, Hoi Steven C. H.
- Distilled Dual-encoder Model For Vision-language Understanding Wang Zekun, Wang Wenhui, Zhu Haichao, Liu Ming, Qin Bing, Wei Furu
- Training Multilingual Pre-trained Language Model With Byte-level Subwords Wei Junqiu, Liu Qun, Guo Yinpeng, Jiang Xin
- A Multilingual Modeling Method For Span-extraction Reading Comprehension Wu Gaochen, Xu Bin, Chang Dejie, Liu Bangchang
- Improving Low-resource Reading Comprehension Via Cross-lingual Transposition Rethinking Wu Gaochen, Xu Bin, Qin Yuxin, Kong Fei, Liu Bangchang, Zhao Hongwen, Chang Dejie
- Training Large-scale News Recommenders With Pretrained Language Models In The Loop Xiao Shitao, Liu Zheng, Shao Yingxia, Di Tao, Xie Xing
- E2E-VLP: End-to-end Vision-language Pre-training Enhanced By Visual Learning Xu Haiyang, Yan Ming, Li Chenliang, Bi Bin, Huang Songfang, Xiao Wenming, Huang Fei
- From Dense To Sparse: Contrastive Pruning For Better Pre-trained Language Model Compression Xu Runxin, Luo Fuli, Wang Chengyu, Chang Baobao, Huang Jun, Huang Songfang, Huang Fei
- Fastseq: Make Sequence Generation Faster Yan Yu, Hu Fei, Chen Jiusheng, Bhendawade Nikhil, Ye Ting, Gong Yeyun, Duan Nan, Cui Desheng, Chi Bingyu, Zhang Ruofei
- NT5?! Training T5 To Perform Numerical Reasoning Yang Peng-jian, Chen Ying Ting, Chen Yuechan, Cer Daniel
- Learning To Generate Task-specific Adapters From Task Description Ye Qinyuan, Ren Xiang
- Improving Sequential Recommendation Consistency With Self-supervised Imitation Yuan Xu, Chen Hongshen, Song Yonghao, Zhao Xiaofang, Ding Zhuoye, He Zhen, Long Bo
- Bert-coqac: Bert-based Conversational Question Answering In Context Zaib Munazza, Tran Dai Hoang, Sagar Subhash, Mahmood Adnan, Zhang Wei E., Sheng Quan Z.
- Ernie-vilg: Unified Generative Pre-training For Bidirectional Vision-language Generation Zhang Han, Yin Weichong, Fang Yewei, Li Lanxin, Duan Boqiang, Wu Zhihua, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- A Knowledge-grounded Dialog System Based On Pre-trained Language Models Zhang Weijie, Chen Jiaoxuan, Wu Haipang, Wan Sanhui, Li Gongfeng
- Retgen: A Joint Framework For Retrieval And Grounded Text Generation Modeling Zhang Yizhe, Sun Siqi, Gao Xiang, Fang Yuwei, Brockett Chris, Galley Michel, Gao Jianfeng, Dolan Bill
- Fewnlu: Benchmarking State-of-the-art Methods For Few-shot Natural Language Understanding Zheng Yanan, Zhou Jing, Qian Yujie, Ding Ming, Liao Chonghua, Li Jian, Salakhutdinov Ruslan, Tang Jie, Ruder Sebastian, Yang Zhilin
- Thinking Clearly, Talking Fast: Concept-guided Non-autoregressive Generation For Open-domain Dialogue Systems Zou Yicheng, Liu Zhihua, Hu Xingwu, Zhang Qi
- Vl-interpret: An Interactive Visualization Tool For Interpreting Vision-language Transformers Aflalo Estelle, Du Meng, Tseng Shao-yen, Liu Yongfei, Wu Chenfei, Duan Nan, Lal Vasudev
- Making Large Language Models Interactive: A Pioneer Study On Supporting Complex Information-seeking Tasks With Implicit Constraints Ahmadvand Ali, Arabzadeh Negar, Kiseleva Julia, Sanz Patricio Figueroa, Deng Xin, Jauhar Sujay, Gamon Michael, Agichtein Eugene, Friend Ned, Aniruddha
- Few-shot Training Llms For Project-specific Code-summarization Ahmed Toufique, Devanbu Premkumar
- Solving Quantitative Reasoning Problems With Language Models Aitor Lewkowycz, Anders Andreassen, David Dohan, Ethan Dyer, Henryk Michalewski, Vinay Ramasesh, Ambrose Slone, Cem Anil, Imanol Schlag, Theo Gutman-solo, Yuhuai Wu, Behnam Neyshabur, Guy Gur-ari, Vedant Misra
- End-to-end Multimodal Representation Learning For Video Dialog Alamri Huda, Bilic Anthony, Hu Michael, Beedu Apoorva, Essa Irfan
- Prompting As Probing: Using Language Models For Knowledge Base Construction Alivanistos Dimitrios, Santamaría Selene Báez, Cochez Michael, Kalo Jan-christoph, Van Krieken Emile, Thanapalasingam Thiviyan
- Socratic Models: Composing Zero-shot Multimodal Reasoning With Language Andy Zeng, Maria Attarian, Brian Ichter, Krzysztof Choromanski, Adrian Wong, Stefan Welker, Federico Tombari, Aveek Purohit, Michael Ryoo, Vikas Sindhwani, Johnny Lee, Vincent Vanhoucke, Pete Florence
- Selection-inference: Exploiting Large Language Models For Interpretable Logical Reasoning Antonia Creswell, Murray Shanahan, Irina Higgins
- Characterizing Verbatim Short-term Memory In Neural Language Models Armeni Kristijan, Honey Christopher, Linzen Tal
- Parameter-efficient Finetuning Of Transformers For Source Code Ayupov Shamil, Chirkova Nadezhda
- Enabling Classifiers To Make Judgements Explicitly Aligned With Human Values Bang Yejin, Yu Tiezheng, Madotto Andrea, Lin Zhaojiang, Diab Mona, Fung Pascale
- Logical Reasoning For Task Oriented Dialogue Systems Beygi Sajjad, Fazel-zarandi Maryam, Cervone Alessandra, Krishnan Prakash, Jonnalagadda Siddhartha Reddy
- I2D2: Inductive Knowledge Distillation With Neurologic And Self-imitation Bhagavatula Chandra, Hwang Jena D., Downey Doug, Bras Ronan Le, Lu Ximing, Qin Lianhui, Sakaguchi Keisuke, Swayamdipta Swabha, West Peter, Choi Yejin
- Vector-quantized Input-contextualized Soft Prompts For Natural Language Understanding Bhardwaj Rishabh, Saha Amrita, Hoi Steven C. H., Poria Soujanya
- Using Cognitive Psychology To Understand GPT-3 Binz Marcel, Schulz Eric
- Petals: Collaborative Inference And Fine-tuning Of Large Models Borzunov Alexander, Baranchuk Dmitry, Dettmers Tim, Ryabinin Max, Belkada Younes, Chumachenko Artem, Samygin Pavel, Raffel Colin
- Character-centric Story Visualization Via Visual Planning And Token Alignment Chen Hong, Han Rujun, Wu Te-lin, Nakayama Hideki, Peng Nanyun
- Xdoc: Unified Pre-training For Cross-format Document Understanding Chen Jingye, Lv Tengchao, Cui Lei, Zhang Cha, Wei Furu
- Imputing Out-of-vocabulary Embeddings With LOVE Makes Language Models Robust With Little Cost Chen Lihu, Varoquaux Gaël, Suchanek Fabian M.
- Leveraging Natural Supervision For Language Representation Learning And Generation Chen Mingda
- Pali: A Jointly-scaled Multilingual Language-image Model Chen Xi, Wang Xiao, Changpinyo Soravit, Piergiovanni Aj, Padlewski Piotr, Salz Daniel, Goodman Sebastian, Grycner Adam, Mustafa Basil, Beyer Lucas, Kolesnikov Alexander, Puigcerver Joan, Ding Nan, Rong Keran, Akbari Hassan, Mishra Gaurav, Xue Linting, Thapliyal Ashish, Bradbury James, Kuo Weicheng, Seyedhosseini Mojtaba, Jia Chao, Ayan Burcu Karagol, Riquelme Carlos, Steiner Andreas, Angelova Anelia, Zhai Xiaohua, Houlsby Neil, Soricut Radu
- DFM: Dialogue Foundation Model For Universal Large-scale Dialogue-oriented Task Learning Chen Zhi, Bao Jijia, Chen Lu, Liu Yuncong, Ma Da, Chen Bei, Wu Mengyue, Zhu Su, Dong Xin, Ge Fujiang, Miao Qingliang, Lou Jian-guang, Yu Kai
- Is Multiwoz A Solved Task? An Interactive TOD Evaluation Framework With User Simulator Cheng Qinyuan, Li Linyang, Quan Guofeng, Gao Feng, Mou Xiaofeng, Qiu Xipeng
- Neural Machine Translation With Contrastive Translation Memories Cheng Xin, Gao Shen, Liu Lemao, Zhao Dongyan, Yan Rui
- Prompting GPT-3 To Be Reliable Chenglei Si, Zhe Gan, Zhengyuan Yang, Shuohang Wang, Jianfeng Wang, Jordan Boyd-graber, Lijuan Wang
- LSG Attention: Extrapolation Of Pretrained Transformers To Long Sequences Condevaux Charles, Harispe Sébastien
- Decoder Tuning: Efficient Language Understanding As Decoding Cui Ganqu, Li Wentao, Ding Ning, Huang Longtao, Liu Zhiyuan, Sun Maosong
- A Unified Multi-task Learning Framework For Multi-goal Conversational Recommender Systems Deng Yang, Zhang Wenxuan, Xu Weiwen, Lei Wenqiang, Chua Tat-seng, Lam Wai
- Black-box Prompt Learning For Pre-trained Language Models Diao Shizhe, Huang Zhichao, Xu Ruijia, Li Xuechun, Lin Yong, Zhou Xiao, Zhang Tong
- Write And Paint: Generative Vision-language Models Are Unified Modal Learners Diao Shizhe, Zhou Wangchunshu, Zhang Xinsong, Wang Jiawei
- Corrpus: Code-based Structured Prompting For Neurosymbolic Story Understanding Dong Yijiang River, Martin Lara J., Callison-burch Chris
- Contrastive Learning With Bidirectional Transformers For Sequential Recommendation Du Hanwen, Shi Hui, Zhao Pengpeng, Wang Deqing, Sheng Victor S., Liu Yanchi, Liu Guanfeng, Zhao Lei
- Codegen: An Open Large Language Model For Code With Multi-turn Program Synthesis Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong
- Hungry Hungry Hippos: Towards Language Modeling With State Space Models Fu Daniel Y., Dao Tri, Saab Khaled K., Thomas Armin W., Rudra Atri, Ré Christopher
- MIGA: A Unified Multi-task Generation Framework For Conversational Text-to-sql Fu Yingwen, Ou Wenjie, Yu Zhou, Lin Yue
- Normsage: Multi-lingual Multi-cultural Norm Discovery From Conversations On-the-fly Fung Yi R., Chakraborty Tuhin, Guo Hao, Rambow Owen, Muresan Smaranda, Ji Heng
- WAVPROMPT: Towards Few-shot Spoken Language Understanding With Frozen Language Models Gao Heting, Ni Junrui, Qian Kaizhi, Zhang Yang, Chang Shiyu, Hasegawa-johnson Mark
- Generating Coherent Narratives By Learning Dynamic And Discrete Entity States With A Contrastive Framework Guan Jian, Yang Zhenyu, Zhang Rongsheng, Hu Zhipeng, Huang Minlie
- Unixcoder: Unified Cross-modal Pre-training For Code Representation Guo Daya, Lu Shuai, Duan Nan, Wang Yanlin, Zhou Ming, Yin Jian
- Show, Don't Tell: Demonstrations Outperform Descriptions For Schema-guided Task-oriented Dialogue Gupta Raghav, Lee Harrison, Zhao Jeffrey, Rastogi Abhinav, Cao Yuan, Wu Yonghui
- LUNA: Language Understanding With Number Augmentations On Transformers Via Number Plugins And Pre-training Han Hongwei, Xu Jialiang, Zhou Mengyu, Shao Yijia, Han Shi, Zhang Dongmei
- Understanding Transformer Memorization Recall Through Idioms Haviv Adi, Cohen Ido, Gidron Jacob, Schuster Roei, Goldberg Yoav, Geva Mor
- VLMAE: Vision-language Masked Autoencoder He Sunan, Guo Taian, Dai Tao, Qiao Ruizhi, Wu Chen, Shu Xiujun, Ren Bo
- Bundle MCR: Towards Conversational Bundle Recommendation He Zhankui, Zhao Handong, Yu Tong, Kim Sungchul, Du Fan, Mcauley Julian
- Prompt-to-prompt Image Editing With Cross Attention Control Hertz Amir, Mokady Ron, Tenenbaum Jay, Aberman Kfir, Pritch Yael, Cohen-or Daniel
- Compressing Pre-trained Transformers Via Low-bit Nxm Sparsity For Natural Language Understanding Holmes Connor, Zhang Minjia, He Yuxiong, Wu Bo
- XDBERT: Distilling Visual Information To BERT From Cross-modal Systems To Improve Language Understanding Hsu Chan-jan, Lee Hung-yi, Tsao Yu
- Dialogue Meaning Representation For Task-oriented Dialogue Systems Hu Xiangkun, Dai Junqi, Yan Hang, Zhang Yi, Guo Qipeng, Qiu Xipeng, Zhang Zheng
- PLANET: Dynamic Content Planning In Autoregressive Transformers For Long-form Text Generation Hu Zhe, Chan Hou Pong, Liu Jiachen, Xiao Xinyan, Wu Hua, Huang Lifu
- Personalized Dialogue Generation With Persona-adaptive Attention Huang Qiushi, Zhang Yu, Ko Tom, Liu Xubo, Wu Bo, Wang Wenwu, Tang Lilian
- Detecting Unintended Memorization In Language-model-fused ASR Huang W. Ronny, Chien Steve, Thakkar Om, Mathews Rajiv
- MCP: Self-supervised Pre-training For Personalized Chatbots With Multi-level Contrastive Sampling Huang Zhaoheng, Dou Zhicheng, Zhu Yutao, Ma Zhengyi
- Clues Before Answers: Generation-enhanced Multiple-choice QA Huang Zixian, Wu Ao, Zhou Jiaying, Gu Yu, Zhao Yue, Cheng Gong
- Welm: A Well-read Pre-trained Language Model For Chinese Hui Su, Xiao Zhou, Houjin Yu, Xiaoyu Shen, Yuwen Chen, Zilin Zhu, Yang Yu, Jie Zhou
- Do Berts Learn To Use Browser User Interface? Exploring Multi-step Tasks With Unified Vision-and-language Berts Iki Taichi, Aizawa Akiko
- Contrastive Learning For Prompt-based Few-shot Language Learners Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Self-prompting Large Language Models For Zero-shot Open-domain QA Junlong Li, Jinyuan Wang, Zhuosheng Zhang, Hai Zhao
- Action-gpt: Leveraging Large-scale Language Models For Improved And Generalized Action Generation Kalakonda Sai Shashank, Maheshwari Shubh, Sarvadevabhatla Ravi Kiran
- Decomposed Prompting: A Modular Approach For Solving Complex Tasks Khot Tushar, Trivedi Harsh, Finlayson Matthew, Fu Yao, Richardson Kyle, Clark Peter, Sabharwal Ashish
- Who Says Elephants Can't Run: Bringing Large Scale Moe Models Into Cloud Scale Production Kim Young Jin, Henry Rawn, Fahim Raffy, Awadalla Hany Hassan
- Emergent Linguistic Structures In Neural Networks Are Fragile La Malfa Emanuele, Wicker Matthew, Kwiatkowska Marta
- Evaluation Of Synthetic Datasets For Conversational Recommender Systems Lara Harsh, Tiwari Manoj
- Internet-augmented Language Models Through Few-shot Prompting For Open-domain Question Answering Lazaridou Angeliki, Gribovskaya Elena, Stokowiec Wojciech, Grigorev Nikolai
- Evaluating Human-language Model Interaction Lee Mina, Srivastava Megha, Hardy Amelia, Thickstun John, Durmus Esin, Paranjape Ashwin, Gerard-ursin Ines, Li Xiang Lisa, Ladhak Faisal, Rong Frieda, Wang Rose E., Kwon Minae, Park Joon Sung, Cao Hancheng, Lee Tony, Bommasani Rishi, Bernstein Michael, Liang Percy
- Bloom Library: Multimodal Datasets In 300+ Languages For A Variety Of Downstream Tasks Leong Colin, Nemecek Joshua, Mansdorfer Jacob, Filighera Anna, Owodunni Abraham, Whitenack Daniel
- Fine-grained Semantically Aligned Vision-language Pre-training Li Juncheng, He Xin, Wei Longhui, Qian Long, Zhu Linchao, Xie Lingxi, Zhuang Yueting, Tian Qi, Tang Siliang
- LAVENDER: Unifying Video-language Understanding As Masked Language Modeling Li Linjie, Gan Zhe, Lin Kevin, Lin Chung-ching, Liu Zicheng, Liu Ce, Wang Lijuan
- Explanations From Large Language Models Make Small Reasoners Better Li Shiyang, Chen Jianshu, Shen Yelong, Chen Zhiyu, Zhang Xinlu, Li Zekun, Wang Hong, Qian Jing, Peng Baolin, Mao Yi, Chen Wenhu, Yan Xifeng
- Pre-trained Language Models For Interactive Decision-making Li Shuang, Puig Xavier, Paxton Chris, Du Yilun, Wang Clinton, Fan Linxi, Chen Tao, Huang De-an, Akyürek Ekin, Anandkumar Anima, Andreas Jacob, Mordatch Igor, Torralba Antonio, Zhu Yuke
- VALHALLA: Visual Hallucination For Machine Translation Li Yi, Panda Rameswar, Kim Yoon, Chen Chun-fu, Feris Rogerio, Cox David, Vasconcelos Nuno
- Contextual Transformer For Offline Meta Reinforcement Learning Lin Runji, Li Ye, Feng Xidong, Zhang Zhaowei, Fung Xian Hong Wu, Zhang Haifeng, Wang Jun, Du Yali, Yang Yaodong
- Vision-language Pre-training For Multimodal Aspect-based Sentiment Analysis Ling Yan, Yu Jianfei, Xia Rui
- Commitbart: A Large Pre-trained Model For Github Commits Liu Shangqing, Li Yanzhou, Xie Xiaofei, Liu Yang
- 3DALL-E: Integrating Text-to-image AI In 3D Design Workflows Liu Vivian, Vermeulen Jo, Fitzmaurice George, Matejka Justin
- Mulzdg: Multilingual Code-switching Framework For Zero-shot Dialogue Generation Liu Yongkang, Feng Shi, Wang Daling, Zhang Yifei
- Prompt Generation Networks For Input-space Adaptation Of Frozen Vision Transformers Loedeman Jochem, Stol Maarten C., Han Tengda, Asano Yuki M.
- Training Language Models To Follow Instructions With Human Feedback Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe
- Reacc: A Retrieval-augmented Code Completion Framework Lu Shuai, Duan Nan, Han Hojae, Guo Daya, Hwang Seung-won, Svyatkovskiy Alexey
- I-tuning: Tuning Frozen Language Models With Image For Lightweight Image Captioning Luo Ziyang, Hu Zhipeng, Xi Yadong, Zhang Rongsheng, Ma Jing
- A Frustratingly Simple Approach For End-to-end Image Captioning Luo Ziyang, Xi Yadong, Zhang Rongsheng, Ma Jing
- Torchscale: Transformers At Scale Ma Shuming, Wang Hongyu, Huang Shaohan, Wang Wenhui, Chi Zewen, Dong Li, Benhaim Alon, Patra Barun, Chaudhary Vishrav, Song Xia, Wei Furu
- Mini-model Adaptation: Efficiently Extending Pretrained Models To New Languages Via Aligned Shallow Training Marchisio Kelly, Lewis Patrick, Chen Yihong, Artetxe Mikel
- Chatbots In A Botnet World Mckee Forrest, Noever David
- Who Is GPT-3? An Exploration Of Personality, Values And Demographics Miotto Marilù, Rossberg Nicola, Kleinberg Bennett
- Minicons: Enabling Flexible Behavioral And Representational Analyses Of Transformer Language Models Misra Kanishka
- CUE Vectors: Modular Training Of Language Models Conditioned On Diverse Contextual Signals Novotney Scott, Mukherjee Sreeparna, Ahmed Zeeshan, Stolcke Andreas
- Tell Me The Evidence? Dual Visual-linguistic Interaction For Answer Grounding Pan Junwen, Chen Guanlin, Liu Yi, Wang Jiexiang, Bian Cheng, Zhu Pengfei, Zhang Zhicheng
- TALM: Tool Augmented Language Models Parisi Aaron, Zhao Yao, Fiedel Noah
- Xfboost: Improving Text Generation With Controllable Decoders Peng Xiangyu, Sollami Michael
- Ignore Previous Prompt: Attack Techniques For Language Models Perez Fábio, Ribeiro Ian
- Answer-me: Multi-task Open-vocabulary Visual Question Answering Piergiovanni Aj, Li Wei, Kuo Weicheng, Saffar Mohammad, Bertsch Fred, Angelova Anelia
- Synchromesh: Reliable Code Generation From Pre-trained Language Models Poesia Gabriel, Polozov Oleksandr, Le Vu, Tiwari Ashish, Soares Gustavo, Meek Christopher, Gulwani Sumit
- Grips: Gradient-free, Edit-based Instruction Search For Prompting Large Language Models Prasad Archiki, Hase Peter, Zhou Xiang, Bansal Mohit
- Psychologically-informed Chain-of-thought Prompts For Metaphor Understanding In Large Language Models Prystawski Ben, Thibodeau Paul, Potts Christopher, Goodman Noah D.
- Cometkiwi: Ist-unbabel 2022 Submission For The Quality Estimation Shared Task Rei Ricardo, Treviso Marcos, Guerreiro Nuno M., Zerva Chrysoula, Farinha Ana C., Maroti Christine, De Souza José G. C., Glushkova Taisiya, Alves Duarte M., Lavie Alon, Coheur Luisa, Martins André F. T.
- An Understanding-oriented Robust Machine Reading Comprehension Model Ren Feiliang, Liu Yongkang, Li Bochao, Liu Shilei, Wang Bingchao, Wang Jiaqi, Liu Chunchao, Ma Qi
- Breakpoint Transformers For Modeling And Tracking Intermediate Beliefs Richardson Kyle, Tamari Ronen, Sultan Oren, Tsarfaty Reut, Shahaf Dafna, Sabharwal Ashish
- Benchclamp: A Benchmark For Evaluating Language Models On Syntactic And Semantic Parsing Roy Subhro, Thomson Sam, Chen Tongfei, Shin Richard, Pauls Adam, Eisner Jason, Van Durme Benjamin
- NLX-GPT: A Model For Natural Language Explanations In Vision And Vision-language Tasks Sammani Fawaz, Mukherjee Tanmoy, Deligiannis Nikos
- Automatic Generation Of Programming Exercises And Code Explanations Using Large Language Models Sarsa Sami, Denny Paul, Hellas Arto, Leinonen Juho
- Confident Adaptive Language Modeling Schuster Tal, Fisch Adam, Gupta Jai, Dehghani Mostafa, Bahri Dara, Tran Vinh Q., Tay Yi, Metzler Donald
- Using Deepspeed And Megatron To Train Megatron-turing NLG 530B, A Large-scale Generative Language Model Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick Legresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zhang, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, Bryan Catanzaro
- Incorporating Domain Knowledge Through Task Augmentation For Front-end Javascript Code Generation Shen Sijie, Zhu Xiang, Dong Yihong, Guo Qizhi, Zhen Yankun, Li Ge
- XRICL: Cross-lingual Retrieval-augmented In-context Learning For Cross-lingual Text-to-sql Semantic Parsing Shi Peng, Zhang Rui, Bai He, Lin Jimmy
- Mgpt: Few-shot Learners Go Multilingual Shliazhko Oleh, Fenogenova Alena, Tikhonova Maria, Mikhailov Vladislav, Kozlova Anastasia, Shavrina Tatiana
- Repository-level Prompt Generation For Large Language Models Of Code Shrivastava Disha, Larochelle Hugo, Tarlow Daniel
- React: Synergizing Reasoning And Acting In Language Models Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, Yuan Cao
- Augmenting Interpretable Models With Llms During Training Singh Chandan, Askari Armin, Caruana Rich, Gao Jianfeng
- Large Language Models Encode Clinical Knowledge Singhal Karan, Azizi Shekoofeh, Tu Tao, Mahdavi S. Sara, Wei Jason, Chung Hyung Won, Scales Nathan, Tanwani Ajay, Cole-lewis Heather, Pfohl Stephen, Payne Perry, Seneviratne Martin, Gamble Paul, Kelly Chris, Scharli Nathaneal, Chowdhery Aakanksha, Mansfield Philip, Arcas Blaise Aguera Y, Webster Dale, Corrado Greg S., Matias Yossi, Chou Katherine, Gottweis Juraj, Tomasev Nenad, Liu Yun, Rajkomar Alvin, Barral Joelle, Semturs Christopher, Karthikesalingam Alan, Natarajan Vivek
- Improving The Domain Adaptation Of Retrieval Augmented Generation (RAG) Models For Open Domain Question Answering Siriwardhana Shamane, Weerasekera Rivindu, Wen Elliott, Kaluarachchi Tharindu, Rana Rajib, Nanayakkara Suranga
- OPT-IML: Scaling Language Model Instruction Meta Learning Through The Lens Of Generalization Srinivasan Iyer, Xi Victoria Lin, Ramakanth Pasunuru, Todor Mihaylov, Daniel Simig, Ping Yu, Kurt Shuster, Tianlu Wang, Qing Liu, Punit Singh Koura, Xian Li, Brian O'horo, Gabriel Pereyra, Jeff Wang, Christopher Dewan, Asli Celikyilmaz, Luke Zettlemoyer, Ves Stoyanov
- Climb: A Continual Learning Benchmark For Vision-and-language Tasks Srinivasan Tejas, Chang Ting-yun, Alva Leticia Leonor Pinto, Chochlakis Georgios, Rostami Mohammad, Thomason Jesse
- QA4QG: Using Question Answering To Constrain Multi-hop Question Generation Su Dan, Xu Peng, Fung Pascale
- Black-box Tuning For Language-model-as-a-service Sun Tianxiang, Shao Yunfan, Qian Hong, Huang Xuanjing, Qiu Xipeng
- Metaphorical User Simulators For Evaluating Task-oriented Dialogue Systems Sun Weiwei, Guo Shuyu, Zhang Shuo, Ren Pengjie, Chen Zhumin, De Rijke Maarten, Ren Zhaochun
- Pseudo-ood Training For Robust Language Models Sundararaman Dhanasekar, Mehta Nikhil, Carin Lawrence
- OPT: Open Pre-trained Transformer Language Models Susan Zhang, Stephen Roller, Naman Goyal, Mikel Artetxe, Moya Chen, Shuohui Chen, Christopher Dewan, Mona Diab, Xian Li, Xi Victoria Lin, Todor Mihaylov, Myle Ott, Sam Shleifer, Kurt Shuster, Daniel Simig, Punit Singh Koura, Anjali Sridhar, Tianlu Wang, Luke Zettlemoyer
- Chatgpt: The End Of Online Exam Integrity? Susnjak Teo
- Commonsenseqa 2.0: Exposing The Limits Of AI Through Gamification Talmor Alon, Yoran Ori, Bras Ronan Le, Bhagavatula Chandra, Goldberg Yoav, Choi Yejin, Berant Jonathan
- ACORT: A Compact Object Relation Transformer For Parameter Efficient Image Captioning Tan Jia Huei, Tan Ying Hua, Chan Chee Seng, Chuah Joon Huang
- Benchmarking Large Language Models For Automated Verilog RTL Code Generation Thakur Shailja, Ahmad Baleegh, Fan Zhenxing, Pearce Hammond, Tan Benjamin, Karri Ramesh, Dolan-gavitt Brendan, Garg Siddharth
- Adavae: Exploring Adaptive Gpt-2s In Variational Auto-encoders For Language Modeling Tu Haoqin, Yang Zhongliang, Yang Jinshuai, Huang Yongfeng
- Efficient Few-shot Learning Without Prompts Tunstall Lewis, Reimers Nils, Jo Unso Eun Seo, Bates Luke, Korat Daniel, Wasserblat Moshe, Pereg Oren
- Towards Reasoning-aware Explainable VQA Vaideeswaran Rakesh, Gao Feng, Mathur Abhinav, Thattai Govind
- Unite: Unified Translation Evaluation Wan Yu, Liu Dayiheng, Yang Baosong, Zhang Haibo, Chen Boxing, Wong Derek F., Chao Lidia S.
- STRUDEL: Structured Dialogue Summarization For Dialogue Comprehension Wang Borui, Feng Chengcheng, Nair Arjun, Mao Madelyn, Desai Jai, Celikyilmaz Asli, Li Haoran, Mehdad Yashar, Radev Dragomir
- Knowledge Prompting In Pre-trained Language Model For Natural Language Understanding Wang Jianing, Huang Wenkang, Shi Qiuhui, Wang Hongbin, Qiu Minghui, Li Xiang, Gao Ming
- Rethinking Textual Adversarial Defense For Pre-trained Language Models Wang Jiayi, Bao Rongzhou, Zhang Zhuosheng, Zhao Hai
- Visually-augmented Language Modeling Wang Weizhi, Dong Li, Cheng Hao, Song Haoyu, Liu Xiaodong, Yan Xifeng, Gao Jianfeng, Wei Furu
- Elaboration-generating Commonsense Question Answering At Scale Wang Wenya, Srikumar Vivek, Hajishirzi Hanna, Smith Noah A.
- Two-stage LLM Fine-tuning With Less Specialization And More Generalization Wang Yihan, Si Si, Li Daliang, Lukasik Michal, Yu Felix, Hsieh Cho-jui, Dhillon Inderjit S, Kumar Sanjiv
- Pan More Gold From The Sand: Refining Open-domain Dialogue Training With Noisy Self-retrieval Generation Wang Yihe, Li Yitong, Wang Yasheng, Mi Fei, Zhou Pingyi, Wang Xin, Liu Jin, Jiang Xin, Liu Qun
- CREATER: Ctr-driven Advertising Text Generation With Controlled Pre-training And Contrastive Fine-tuning Wei Penghui, Yang Xuanhua, Liu Shaoguo, Wang Liang, Zheng Bo
- EGCR: Explanation Generation For Conversational Recommendation Wen Bingbing, Bu Xiaoning, Shah Chirag
- Precisely The Point: Adversarial Augmentations For Faithful And Informative Text Generation Wu Wenhao, Li Wei, Liu Jiachen, Xiao Xinyan, Li Sujian, Lyu Yajuan
- Evaluating And Improving Context Attention Distribution On Multi-turn Response Generation Using Self-contained Distractions Xing Yujie, Gulla Jon Atle
- COSPLAY: Concept Set Guided Personalized Dialogue Generation Across Both Party Personas Xu Chen, Li Piji, Wang Wei, Yang Haoran, Wang Siyun, Xiao Chuangbai
- Rationale-augmented Ensembles In Language Models Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc Le, Ed Chi, Denny Zhou
- Testaug: A Framework For Augmenting Capability-based NLP Tests Yang Guanqun, Haque Mirazul, Song Qiaochu, Yang Wei, Liu Xueqing
- Prompt Tuning For Discriminative Pre-trained Language Models Yao Yuan, Dong Bowen, Zhang Ao, Zhang Zhengyan, Xie Ruobing, Liu Zhiyuan, Lin Leyu, Sun Maosong, Wang Jianyong
- UL2: Unifying Language Learning Paradigms Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Xavier Garcia, Jason Wei, Xuezhi Wang, Hyung Won Chung, Siamak Shakeri, Dara Bahri, Tal Schuster, Huaixiu Steven Zheng, Denny Zhou, Neil Houlsby, Donald Metzler
- Tri-attention: Explicit Context-aware Attention Mechanism For Natural Language Processing Yu Rui, Li Yifeng, Lu Wenpeng, Cao Longbing
- Bilaterally Slimmable Transformer For Elastic And Efficient Visual Question Answering Yu Zhou, Jin Zitian, Yu Jun, Xu Mingliang, Wang Hongbo, Fan Jianping
- Competition-level Code Generation With Alphacode Yujia Li, David Choi, Junyoung Chung, Nate Kushman, Julian Schrittwieser, Rémi Leblond, Tom Eccles, James Keeling, Felix Gimeno, Agustin Dal Lago, Thomas Hubert, Peter Choy, Cyprien De Masson D'autume, Igor Babuschkin, Xinyun Chen, Po-sen Huang, Johannes Welbl, Sven Gowal, Alexey Cherepanov, James Molloy, Daniel J. Mankowitz, Esme Sutherland Robson, Pushmeet Kohli, Nando De Freitas, Koray Kavukcuoglu, Oriol Vinyals
- BLISS: Robust Sequence-to-sequence Learning Via Self-supervised Input Representation Zhang Zheng, Ding Liang, Cheng Dazhao, Liu Xuebo, Zhang Min, Tao Dacheng
- MSDF: A General Open-domain Multi-skill Dialog Framework Zhao Yu, Hu Xinshuo, Li Yunxin, Hu Baotian, Li Dongfang, Chen Sichao, Wang Xiaolong
- Self-adaptive In-context Learning: An Information Compression Perspective For In-context Example Selection And Ordering Zhiyong Wu, Yaoxiang Wang, Jiacheng Ye, Lingpeng Kong
- Tacube: Pre-computing Data Cubes For Answering Numerical-reasoning Questions Over Tabular Data Zhou Fan, Hu Mengkang, Dong Haoyu, Cheng Zhoujun, Han Shi, Zhang Dongmei
- Teaching Algorithmic Reasoning Via In-context Learning Zhou Hattie, Nova Azade, Larochelle Hugo, Courville Aaron, Neyshabur Behnam, Sedghi Hanie
- Binding Language Models In Symbolic Languages Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu
- Efficient Long Sequence Modeling Via State Space Augmented Transformer Zuo Simiao, Liu Xiaodong, Jiao Jian, Charles Denis, Manavoglu Eren, Zhao Tuo, Gao Jianfeng
- Let The Llms Talk: Simulating Human-to-human Conversational QA Via Zero-shot Llm-to-llm Interactions Abbasiantaeb Zahra, Yuan Yifei, Kanoulas Evangelos, Aliannejadi Mohammad
- I Was Blind But Now I See: Implementing Vision-enabled Dialogue In Social Robots Abbo Giulio Antonio, Belpaeme Tony
- LMRL Gym: Benchmarks For Multi-turn Reinforcement Learning With Language Models Abdulhai Marwa, White Isadora, Snell Charlie, Sun Charles, Hong Joey, Zhai Yuexiang, Xu Kelvin, Levine Sergey
- Beyond Traditional Teaching: The Potential Of Large Language Models And Chatbots In Graduate Engineering Education Abedi Mahyar, Alshybani Ibrahem, Shahadat Muhammad Rubayat Bin, Murillo Michael S.
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon
- Simul-llm: A Framework For Exploring High-quality Simultaneous Translation With Large Language Models Agostinelli Victor, Wild Max, Raffel Matthew, Fuad Kazi Ahmed Asif, Chen Lizhong
- Story Visualization By Online Text Augmentation With Context Memory Ahn Daechul, Kim Daneul, Song Gwangmo, Kim Seung Hwan, Lee Honglak, Kang Dongyeop, Choi Jonghyun
- MEGA: Multilingual Evaluation Of Generative AI Ahuja Kabir, Diddee Harshita, Hada Rishav, Ochieng Millicent, Ramesh Krithika, Jain Prachi, Nambi Akshay, Ganu Tanuja, Segal Sameer, Axmed Maxamed, Bali Kalika, Sitaram Sunayana
- Jointly Training Large Autoregressive Multimodal Models Aiello Emanuele, Yu Lili, Nie Yixin, Aghajanyan Armen, Oguz Barlas
- CESAR: Automatic Induction Of Compositional Instructions For Multi-turn Dialogs Aksu Taha, Hazarika Devamanyu, Mehri Shikib, Kim Seokhwan, Hakkani-tür Dilek, Liu Yang, Namazifar Mahdi
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Akyürek Afra Feyza, Akyürek Ekin, Madaan Aman, Kalyan Ashwin, Clark Peter, Wijaya Derry, Tandon Niket
- RET-LLM: Towards A General Read-write Memory For Large Language Models Ali Modarressi, Ayyoob Imani, Mohsen Fayyaz, Hinrich Schütze
- LLM In A Flash: Efficient Large Language Model Inference With Limited Memory Alizadeh Keivan, Mirzadeh Iman, Belenko Dmitry, Khatamifard Karen, Cho Minsik, Del Mundo Carlo C, Rastegari Mohammad, Farajtabar Mehrdad
- Generative AI Perceptions: A Survey To Measure The Perceptions Of Faculty, Staff, And Students On Generative AI Tools In Academia Amani Sara, White Lance, Balart Trini, Arora Laksha, Shryock Kristi J., Brumbelow Kelly, Watson Karan L.
- Knowledge Of Knowledge: Exploring Known-unknowns Uncertainty With Large Language Models Amayuelas Alfonso, Wong Kyle, Pan Liangming, Chen Wenhu, Wang William
- Chatgpt-3.5, Chatgpt-4, Google Bard, And Microsoft Bing To Improve Health Literacy And Communication In Pediatric Populations And Beyond Amin Kanhai S., Mayes Linda, Khosla Pavan, Doshi Rushabh
- Openleaf: Open-domain Interleaved Image-text Generation And Evaluation An Jie, Yang Zhengyuan, Li Linjie, Wang Jianfeng, Lin Kevin, Liu Zicheng, Wang Lijuan, Luo Jiebo
- Does Deep Learning Learn To Abstract? A Systematic Probing Framework An Shengnan, Lin Zeqi, Chen Bei, Fu Qiang, Zheng Nanning, Lou Jian-guang
- On Generative Agents In Recommendation An Zhang, Yuxin Chen, Leheng Sheng, Xiang Wang, Tat-seng Chua
- Query Understanding In The Age Of Large Language Models Anand Avishek, V Venktesh, Anand Abhijit, Setty Vinay
- Studying The Impacts Of Pre-training Using Chatgpt-generated Text On Downstream Tasks Anand Sarthak
- Protip: Progressive Tool Retrieval Improves Planning Anantha Raviteja, Bandyopadhyay Bortik, Kashi Anirudh, Mahinder Sayantan, Hill Andrew W, Chappidi Srinivas
- Context Tuning For Retrieval Augmented Generation Anantha Raviteja, Bethi Tharun, Vodianik Danil, Chappidi Srinivas
- Chemcrow: Augmenting Large-language Models With Chemistry Tools Andres M Bran, Sam Cox, Oliver Schilter, Carlo Baldassari, Andrew D White, Philippe Schwaller
- Expel: LLM Agents Are Experiential Learners Andrew Zhao, Daniel Huang, Quentin Xu, Matthieu Lin, Yong-jin Liu, Gao Huang
- Resurrecting Recurrent Neural Networks For Long Sequences Antonio Orvieto, Samuel L Smith, Albert Gu, Anushan Fernando, Caglar Gulcehre, Razvan Pascanu, Soham De
- Chainforge: A Visual Toolkit For Prompt Engineering And LLM Hypothesis Testing Arawjo Ian, Swoopes Chelse, Vaithilingam Priyan, Wattenberg Martin, Glassman Elena
- Llm2loss: Leveraging Language Models For Explainable Model Diagnostics Ardeshir Shervin
- Leancontext: Cost-efficient Domain-specific Question Answering Using Llms Arefeen Md Adnan, Debnath Biplob, Chakradhar Srimat
- Unmasking The Giant: A Comprehensive Evaluation Of Chatgpt's Proficiency In Coding Algorithms And Data Structures Arefin Sayed Erfan, Heya Tasnia Ashrafi, Al-qudah Hasan, Ineza Ynes, Serwadda Abdul
- Towards A Unified Multimodal Reasoning Framework Arun Abhinav, Mal Dipendra Singh, Soni Mehul, Sawada Tomohiro
- BUFFET: Benchmarking Large Language Models For Few-shot Cross-lingual Transfer Asai Akari, Kudugunta Sneha, Yu Xinyan Velocity, Blevins Terra, Gonen Hila, Reid Machel, Tsvetkov Yulia, Ruder Sebastian, Hajishirzi Hannaneh
- Self-rag: Learning To Retrieve, Generate, And Critique Through Self-reflection Asai Akari, Wu Zeqiu, Wang Yizhong, Sil Avirup, Hajishirzi Hannaneh
- Mitigating Open-vocabulary Caption Hallucinations Assaf Ben-kish, Moran Yanuka, Morris Alper, Raja Giryes, Hadar Averbuch-elor
- Factuality Challenges In The Era Of Large Language Models Augenstein Isabelle, Baldwin Timothy, Cha Meeyoung, Chakraborty Tanmoy, Ciampaglia Giovanni Luca, Corney David, Diresta Renee, Ferrara Emilio, Hale Scott, Halevy Alon, Hovy Eduard, Ji Heng, Menczer Filippo, Miguez Ruben, Nakov Preslav, Scheufele Dietram, Sharma Shivam, Zagni Giovanni
- Openflamingo: An Open-source Framework For Training Large Autoregressive Vision-language Models Awadalla Anas, Gao Irena, Gardner Josh, Hessel Jack, Hanafy Yusuf, Zhu Wanrong, Marathe Kalyani, Bitton Yonatan, Gadre Samir, Sagawa Shiori, Jitsev Jenia, Kornblith Simon, Koh Pang Wei, Ilharco Gabriel, Wortsman Mitchell, Schmidt Ludwig
- Studenteval: A Benchmark Of Student-written Prompts For Large Language Models Of Code Babe Hannah Mclean, Nguyen Sydney, Zi Yangtian, Guha Arjun, Feldman Molly Q, Anderson Carolyn Jane
- RRAML: Reinforced Retrieval Augmented Machine Learning Bacciu Andrea, Cuconasu Florin, Siciliano Federico, Silvestri Fabrizio, Tonellotto Nicola, Trappolini Giovanni
- Llms-augmented Contextual Bandit Baheri Ali, Alm Cecilia O.
- Schema-driven Information Extraction From Heterogeneous Tables Bai Fan, Kang Junmo, Stanovsky Gabriel, Freitag Dayne, Dredze Mark, Ritter Alan
- Kgquiz: Evaluating The Generalization Of Encoded Knowledge In Large Language Models Bai Yuyang, Feng Shangbin, Balachandran Vidhisha, Tan Zhaoxuan, Lou Shiqi, He Tianxing, Tsvetkov Yulia
- Benchmarking LLM Powered Chatbots: Methods And Metrics Banerjee Debarag, Singh Pooja, Avadhanam Arjun, Srivastava Saksham
- Vtrain: A Simulation Framework For Evaluating Cost-effective And Compute-optimal Large Language Model Training Bang Jehyeon, Choi Yujeong, Kim Myeongwoo, Kim Yongdeok, Rhu Minsoo
- Exploring Qualitative Research Using Llms Bano Muneera, Zowghi Didar, Whittle Jon
- A Bi-step Grounding Paradigm For Large Language Models In Recommendation Systems Bao Keqin, Zhang Jizhi, Wang Wenjie, Zhang Yang, Yang Zhengyi, Luo Yancheng, Chen Chong, Feng Fuli, Tian Qi
- Tallrec: An Effective And Efficient Tuning Framework To Align Large Language Model With Recommendation Bao Keqin, Zhang Jizhi, Zhang Yang, Wang Wenjie, Feng Fuli, He Xiangnan
- Exploring Iterative Enhancement For Improving Learnersourced Multiple-choice Question Explanations With Large Language Models Bao Qiming, Leinonen Juho, Peng Alex Yuxuan, Zhong Wanjun, Gendron Gaël, Pistotti Timothy, Huang Alice, Denny Paul, Witbrock Michael, Liu Jiamou
- Understanding Telecom Language Through Large Language Models Bariah Lina, Zou Hang, Zhao Qiyang, Mouhouche Belkacem, Bader Faouzi, Debbah Merouane
- Exploring The Landscape Of Large Language Models In Medical Question Answering Bean Andrew M., Korgul Karolina, Krones Felix, Mccraith Robert, Mahdi Adam
- AQUALLM: Audio Question Answering Data Generation Using Large Language Models Behera Swarup Ranjan, Injeti Krishna Mohan, Patibandla Jaya Sai Kiran, Pokala Praveen Kumar, Pailla Balakrishna Reddy
- Prompt Sketching For Large Language Models Beurer-kellner Luca, Müller Mark Niklas, Fischer Marc, Vechev Martin
- When Automated Assessment Meets Automated Content Generation: Examining Text Quality In The Era Of Gpts Bevilacqua Marialena, Oketch Kezia, Qin Ruiyang, Stamey Will, Zhang Xinyuan, Gan Yi, Yang Kai, Abbasi Ahmed
- Approach Intelligent Writing Assistants Usability With Seven Stages Of Action Bhat Avinash, Shrivastava Disha, Guo Jin L. C.
- Automating Question Generation From Educational Text Bhowmick Ayan Kumar, Jagmohan Ashish, Vempaty Aditya, Dey Prasenjit, Hall Leigh, Hartman Jeremy, Kokku Ravi, Maheshwari Hema
- When Do Program-of-thoughts Work For Reasoning? Bi Zhen, Zhang Ningyu, Jiang Yinuo, Deng Shumin, Zheng Guozhou, Chen Huajun
- Helm: Highlighted Evidence Augmented Language Model For Enhanced Table-to-text Generation Bian Junyi, Qin Xiaolei, Zou Wuhe, Huang Mengzuo, Luo Congyi, Zhang Ke, Zhang Weidong
- In-context Learning Dynamics With Random Binary Sequences Bigelow Eric J., Lubana Ekdeep Singh, Dick Robert P., Tanaka Hidenori, Ullman Tomer D.
- Tell Me Where To Go: A Composable Framework For Context-aware Embodied Robot Navigation Biggie Harel, Mopidevi Ajay Narasimha, Woods Dusty, Heckman Christoffer
- VIGC: Visual Instruction Generation And Correction Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He
- How Should The Advent Of Large Language Models Affect The Practice Of Science? Binz Marcel, Alaniz Stephan, Roskies Adina, Aczel Balazs, Bergstrom Carl T., Allen Colin, Schad Daniel, Wulff Dirk, West Jevin D., Zhang Qiong, Shiffrin Richard M., Gershman Samuel J., Popov Ven, Bender Emily M., Marelli Marco, Botvinick Matthew M., Akata Zeynep, Schulz Eric
- Model Leeching: An Extraction Attack Targeting Llms Birch Lewis, Hackett William, Trawicki Stefan, Suri Neeraj, Garraghan Peter
- Plasma: Making Small Language Models Better Procedural Knowledge Models For (counterfactual) Planning Brahman Faeze, Bhagavatula Chandra, Pyatkin Valentina, Hwang Jena D., Li Xiang Lorraine, Arai Hirona J., Sanyal Soumya, Sakaguchi Keisuke, Ren Xiang, Choi Yejin
- Chemcrow: Augmenting Large-language Models With Chemistry Tools Bran Andres M, Cox Sam, Schilter Oliver, Baldassari Carlo, White Andrew D, Schwaller Philippe
- Extractgpt: Exploring The Potential Of Large Language Models For Product Attribute Value Extraction Brinkmann Alexander, Shraga Roee, Bizer Christian
- Codetf: One-stop Transformer Library For State-of-the-art Code LLM Bui Nghi D. Q., Le Hung, Wang Yue, Li Junnan, Gotmare Akhilesh Deepak, Hoi Steven C. H.
- Assessing Large Language Models On Climate Information Bulian Jannis, Schäfer Mike S., Amini Afra, Lam Heidi, Ciaramita Massimiliano, Gaiarin Ben, Hübscher Michelle Chen, Buck Christian, Mede Niels G., Leippold Markus, Strauß Nadine
- Automatic Construction Of A Korean Toxic Instruction Dataset For Ethical Tuning Of Large Language Models Byun Sungjoo, Jang Dongjun, Jo Hyemi, Shin Hyopil
- Pokemonchat: Auditing Chatgpt For Pok\'emon Universe Knowledge Cabello Laura, Li Jiaang, Chalkidis Ilias
- Scaling In-context Demonstrations With Structured Attention Cai Tianle, Huang Kaixuan, Lee Jason D., Wang Mengdi
- Large Language Models As Tool Makers Cai Tianle, Wang Xuezhi, Ma Tengyu, Chen Xinyun, Zhou Denny
- H2O Open Ecosystem For State-of-the-art Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Lee Chun Ming, Conde Marcos V.
- Instructmol: Multi-modal Integration For Building A Versatile And Reliable Molecular Assistant In Drug Discovery Cao He, Liu Zijing, Lu Xingyu, Yao Yuan, Li Yu
- A Step Closer To Comprehensive Answers: Constrained Multi-stage Question Decomposition With Large Language Models Cao Hejing, An Zhenwei, Feng Jiazhan, Xu Kun, Chen Liwei, Zhao Dongyan
- Systematic Rectification Of Language Models Via Dead-end Analysis Cao Meng, Fatemi Mehdi, Cheung Jackie Chi Kit, Shabanian Samira
- Pumer: Pruning And Merging Tokens For Efficient Vision Language Models Cao Qingqing, Paranjape Bhargavi, Hajishirzi Hannaneh
- How Does AI Chat Change Search Behaviors? Capra Robert, Arguello Jaime
- Dialogue Chain-of-thought Distillation For Commonsense-aware Conversational Agents Chae Hyungjoo, Song Yongho, Ong Kai Tzu-iunn, Kwon Taeyoon, Kim Minjin, Yu Youngjae, Lee Dongha, Kang Dongyeop, Yeo Jinyoung
- INT2.1: Towards Fine-tunable Quantized Large Language Models With Error Correction Through Low-rank Adaptation Chai Yuji, Gkountouras John, Ko Glenn G., Brooks David, Wei Gu-yeon
- Creativity Support In The Age Of Large Language Models: An Empirical Study Involving Emerging Writers Chakrabarty Tuhin, Padmakumar Vishakh, Brahman Faeze, Muresan Smaranda
- Clembench: Using Game Play To Evaluate Chat-optimized Language Models As Conversational Agents Chalamalasetti Kranti, Götze Jana, Hakimov Sherzod, Madureira Brielen, Sadler Philipp, Schlangen David
- Learning To Generate Better Than Your LLM Chang Jonathan D., Brantley Kiante, Ramamurthy Rajkumar, Misra Dipendra, Sun Wen
- Chipgpt: How Far Are We From Natural Language Hardware Design Chang Kaiyan, Wang Ying, Ren Haimeng, Wang Mengdi, Liang Shengwen, Han Yinhe, Li Huawei, Li Xiaowei
- Selective Demonstrations For Cross-domain Text-to-sql Chang Shuaichen, Fosler-lussier Eric
- Salesbot 2.0: A Human-like Intent-guided Chit-chat Dataset Chang Wen-yu, Chen Yun-nung
- Cabbage Sweeter Than Cake? Analysing The Potential Of Large Language Models For Learning Conceptual Spaces Chatterjee Usashi, Gajbhiye Amit, Schockaert Steven
- Fireact: Toward Language Agent Fine-tuning Chen Baian, Shu Chang, Shareghi Ehsan, Collier Nigel, Narasimhan Karthik, Yao Shunyu
- Gptutor: A Chatgpt-powered Programming Tool For Code Explanation Chen Eason, Huang Ray, Chen Han-shin, Tseng Yuen-hsien, Li Liang-yi
- Videollm: Modeling Video Sequence With Large Language Models Chen Guo, Zheng Yin-dong, Wang Jiahao, Xu Jilan, Huang Yifei, Pan Junting, Wang Yi, Wang Yali, Qiao Yu, Lu Tong, Wang Limin
- Chatgpt's One-year Anniversary: Are Open-source Large Language Models Catching Up? Chen Hailin, Jiao Fangkai, Li Xingxuan, Qin Chengwei, Ravaut Mathieu, Zhao Ruochen, Xiong Caiming, Joty Shafiq
- Interpreting And Controlling Vision Foundation Models Via Text Explanations Chen Haozhe, Yang Junfeng, Vondrick Carl, Mao Chengzhi
- Skills-in-context Prompting: Unlocking Compositionality In Large Language Models Chen Jiaao, Pan Xiaoman, Yu Dian, Song Kaiqiang, Wang Xiaoyang, Yu Dong, Chen Jianshu
- Unlearn What You Want To Forget: Efficient Unlearning For Llms Chen Jiaao, Yang Diyi
- Adaptation With Self-evaluation To Improve Selective Prediction In Llms Chen Jiefeng, Yoon Jinsung, Ebrahimi Sayna, Arik Sercan O, Pfister Tomas, Jha Somesh
- Quantifying Uncertainty In Answers From Any Language Model And Enhancing Their Trustworthiness Chen Jiuhai, Mueller Jonas
- Reconcile: Round-table Conference Improves Reasoning Via Consensus Among Diverse Llms Chen Justin Chih-yao, Saha Swarnadeep, Bansal Mohit
- Gaining Wisdom From Setbacks: Aligning Large Language Models Via Mistake Analysis Chen Kai, Wang Chunwei, Yang Kuo, Han Jianhua, Hong Lanqing, Mi Fei, Xu Hang, Liu Zhengying, Huang Wenyong, Li Zhenguo, Yeung Dit-yan, Shang Lifeng, Jiang Xin, Liu Qun
- Epi-curriculum: Episodic Curriculum Learning For Low-resource Domain Adaptation In Neural Machine Translation Chen Keyu, Zhuang Di, Li Mingchen, Chang J. Morris
- Introspective Tips: Large Language Model For In-context Decision Making Chen Liting, Wang Lu, Dong Hang, Du Yali, Yan Jie, Yang Fangkai, Li Shuang, Zhao Pu, Qin Si, Rajmohan Saravan, Lin Qingwei, Zhang Dongmei
- Few-shot Data Synthesis For Open Domain Multi-hop Question Answering Chen Mingda, Chen Xilun, Yih Wen-tau
- Natural Response Generation For Chinese Reading Comprehension Chen Nuo, Li Hongguang, Bao Yinan, Wang Baoyuan, Li Jia
- From Good To Great: Improving Math Reasoning With Tool-augmented Interleaf Prompting Chen Nuo, Li Hongguang, Wang Baoyuan, Li Jia
- Evaluation Of Chatgpt Family Of Models For Biomedical Reasoning And Classification Chen Shan, Li Yingya, Lu Sheng, Van Hoang, Aerts Hugo Jwl, Savova Guergana K., Bitterman Danielle S.
- Chatpipe: Orchestrating Data Preparation Program By Optimizing Human-chatgpt Interactions Chen Sibei, Liu Hanbing, Jin Weiting, Sun Xiangyu, Feng Xiaoyao, Fan Ju, Du Xiaoyong, Tang Nan
- Enhancing Robot Program Synthesis Through Environmental Context Chen Tianyi, Wang Qidi, Dong Zhen, Shen Liwei, Peng Xin
- Self-icl: Zero-shot In-context Learning With Self-generated Demonstrations Chen Wei-lin, Wu Cheng-kuang, Chen Yun-nung, Chen Hsin-hsi
- The Janus Interface: How Fine-tuning In Large Language Models Amplifies The Privacy Risks Chen Xiaoyi, Tang Siyuan, Zhu Rui, Yan Shijun, Jin Lei, Wang Zihao, Su Liya, Zhang Zhikun, Wang Xiaofeng, Tang Haixu
- Minprompt: Graph-based Minimal Prompt Data Augmentation For Few-shot Question Answering Chen Xiusi, Jiang Jyun-yu, Chang Wei-cheng, Hsieh Cho-jui, Yu Hsiang-fu, Wang Wei
- Medprompt: Cross-modal Prompting For Multi-task Medical Image Translation Chen Xuhang, Pun Chi-man, Wang Shuqiang
- Translation And Fusion Improves Zero-shot Cross-lingual Information Extraction Chen Yang, Shah Vedaant, Ritter Alan
- Measuring And Improving Chain-of-thought Reasoning In Vision-language Models Chen Yangyi, Sikka Karan, Cogswell Michael, Ji Heng, Divakaran Ajay
- EE-LLM: Large-scale Training And Inference Of Early-exit Large Language Models With 3D Parallelism Chen Yanxi, Pan Xuchen, Li Yaliang, Ding Bolin, Zhou Jingren
- Tigerbot: An Open Multilingual Multitask LLM Chen Ye, Cai Wei, Wu Liangmin, Li Xiaowei, Xin Zhanxuan, Fu Cong
- NL2TL: Transforming Natural Languages To Temporal Logics Using Large Language Models Chen Yongchao, Gandhi Rujul, Zhang Yang, Fan Chuchu
- Confidant: Customizing Transformer-based Llms Via Collaborative Edge Training Chen Yuhao, Yan Yuxuan, Yang Qianqian, Shu Yuanchao, He Shibo, Chen Jiming
- T-eval: Evaluating The Tool Utilization Capability Of Large Language Models Step By Step Chen Zehui, Du Weihua, Zhang Wenwei, Liu Kuikun, Liu Jiangning, Zheng Miao, Zhuo Jingming, Zhang Songyang, Lin Dahua, Chen Kai, Zhao Feng
- GENOME: Generative Neuro-symbolic Visual Reasoning By Growing And Reusing Modules Chen Zhenfang, Sun Rui, Liu Wenjun, Hong Yining, Gan Chuang
- See, Think, Confirm: Interactive Prompting Between Vision And Language Models For Knowledge-based Visual Reasoning Chen Zhenfang, Zhou Qinhong, Shen Yikang, Hong Yining, Zhang Hao, Gan Chuang
- Chatcot: Tool-augmented Chain-of-thought Reasoning On Chat-based Large Language Models Chen Zhipeng, Zhou Kun, Zhang Beichen, Gong Zheng, Zhao Wayne Xin, Wen Ji-rong
- Llf-bench: Benchmark For Interactive Learning From Language Feedback Cheng Ching-an, Kolobov Andrey, Misra Dipendra, Nie Allen, Swaminathan Adith
- Adversarial Preference Optimization: Enhancing Your Alignment Via RM-LLM Game Cheng Pengyu, Yang Yifan, Li Jian, Dai Yong, Hu Tianhao, Cao Peixin, Du Nan, Li Xiaolong
- CREATOR: Tool Creation For Disentangling Abstract And Concrete Reasoning Of Large Language Models Cheng Qian, Chi Han, Yi R. Fung, Yujia Qin, Zhiyuan Liu, Heng Ji
- Lift Yourself Up: Retrieval-augmented Text Generation With Self Memory Cheng Xin, Luo Di, Chen Xiuying, Liu Lemao, Zhao Dongyan, Yan Rui
- SCALE: Synergized Collaboration Of Asymmetric Language Translation Engines Cheng Xin, Wang Xun, Ge Tao, Chen Si-qing, Wei Furu, Zhao Dongyan, Yan Rui
- Batch Prompting: Efficient Inference With Large Language Model Apis Cheng Zhoujun, Kasai Jungo, Yu Tao
- Chatdb: Augmenting Llms With Databases As Their Symbolic Memory Chenxu Hu, Jie Fu, Chenzhuang Du, Simian Luo, Junbo Zhao, Hang Zhao
- M\(^{2}\)chat: Empowering VLM For Multimodal LLM Interleaved Text-image Generation Chi Xiaowei, Zhang Rongyu, Jiang Zhengkai, Liu Yijiang, Wang Yatian, Qi Xingqun, Luo Wenhan, Gao Peng, Zhang Shanghang, Liu Qifeng, Guo Yike
- INSTRUCTEVAL: Towards Holistic Evaluation Of Instruction-tuned Large Language Models Chia Yew Ken, Hong Pengfei, Bing Lidong, Poria Soujanya
- An Integrative Survey On Mental Health Conversational Agents To Bridge Computer Science And Medical Perspectives Cho Young Min, Rai Sunny, Ungar Lyle, Sedoc João, Guntuku Sharath Chandra
- Evaluating The Efficacy Of Interactive Language Therapy Based On LLM For High-functioning Autistic Adolescent Psychological Counseling Cho Yujin, Kim Mingeon, Kim Seojin, Kwon Oyun, Kwon Ryan Donghan, Lee Yoonha, Lim Dohyun
- Whose Text Is It Anyway? Exploring Bigcode, Intellectual Property, And Ethics Choksi Madiha Zahrah, Goedicke David
- Large Language Models For User Interest Journeys Christakopoulou Konstantina, Lalama Alberto, Adams Cj, Qu Iris, Amir Yifat, Chucri Samer, Vollucci Pierce, Soldo Fabio, Bseiso Dina, Scodel Sarah, Dixon Lucas, Chi Ed H., Chen Minmin
- Long Story Short: A Summarize-then-search Method For Long Video Question Answering Chung Jiwan, Yu Youngjae
- Challenges Of Large Language Models For Mental Health Counseling Chung Neo Christopher, Dyer George, Brocki Lennart
- Instructtods: Large Language Models For End-to-end Task-oriented Dialogue Systems Chung Willy, Cahyawijaya Samuel, Wilie Bryan, Lovenia Holy, Fung Pascale
- Safurai 001: New Qualitative Approach For Code LLM Evaluation Cifarelli Davide, Boiardi Leonardo, Puppo Alessandro
- Galactic Chitchat: Using Large Language Models To Converse With Astronomy Literature Ciucă Ioana, Ting Yuan-sen
- Inducing Anxiety In Large Language Models Increases Exploration And Bias Coda-forno Julian, Witte Kristin, Jagadish Akshay K., Binz Marcel, Akata Zeynep, Schulz Eric
- Evaluating Language Models For Mathematics Through Interactions Collins Katherine M., Jiang Albert Q., Frieder Simon, Wong Lionel, Zilka Miri, Bhatt Umang, Lukasiewicz Thomas, Wu Yuhuai, Tenenbaum Joshua B., Hart William, Gowers Timothy, Li Wenda, Weller Adrian, Jamnik Mateja
- Seamless: Multilingual Expressive And Streaming Speech Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Coria, Dale David, Dong Ning, Duppenthaler Mark, Duquenne Paul-ambroise, Ellis Brian, Elsahar Hady, Haaheim Justin, Hoffman John, Hwang Min-jae, Inaguma Hirofumi, Klaiber Christopher, Kulikov Ilia, Li Pengwei, Licht Daniel, Maillard Jean, Mavlyutov Ruslan, Rakotoarison Alice, Sadagopan Kaushik Ram, Ramakrishnan Abinesh, Tran Tuan, Wenzek Guillaume, Yang Yilin, Ye Ethan, Evtimov Ivan, Fernandez Pierre, Gao Cynthia, Hansanti Prangthip, Kalbassi Elahe, Kallet Amanda, Kozhevnikov Artyom, Gonzalez Gabriel Mejia, Roman Robin San, Touret Christophe, Wong Corinne, Wood Carleigh, Yu Bokai, Andrews Pierre, Balioglu Can, Chen Peng-jen, Costa-jussà Marta R., Elbayad Maha, Gong Hongyu, Guzmán Francisco, Heffernan Kevin, Jain Somya, Kao Justine, Lee Ann, Ma Xutai, Mourachko Alex, Peloquin Benjamin, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Sun Anna, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler, Williamson Mary
- Receive, Reason, And React: Drive As You Say With Large Language Models In Autonomous Vehicles Cui Can, Ma Yunsheng, Cao Xu, Ye Wenqian, Wang Ziran
- Personalized Autonomous Driving With Large Language Models: Field Experiments Cui Can, Yang Zichong, Zhou Yupeng, Ma Yunsheng, Lu Juanwu, Li Lingxi, Chen Yaobin, Panchal Jitesh, Wang Ziran
- Llm-in-the-loop: Leveraging Large Language Model For Thematic Analysis Dai Shih-chieh, Xiong Aiping, Ku Lun-wei
- Llmebench: A Flexible Framework For Accelerating Llms Benchmarking Dalvi Fahim, Hasanain Maram, Boughorbel Sabri, Mousi Basel, Abdaljalil Samir, Nazar Nizi, Abdelali Ahmed, Chowdhury Shammur Absar, Mubarak Hamdy, Ali Ahmed, Hawasly Majd, Durrani Nadir, Alam Firoj
- Educhat: A Large-scale Language Model-based Chatbot System For Intelligent Education Dan Yuhao, Lei Zhikai, Gu Yiyang, Li Yong, Yin Jianghao, Lin Jiaju, Ye Linhao, Tie Zhiyan, Zhou Yougen, Wang Yilei, Zhou Aimin, Zhou Ze, Chen Qin, Zhou Jie, He Liang, Qiu Xipeng
- Performance Comparison Of Large Language Models On VNHSGE English Dataset: Openai Chatgpt, Microsoft Bing Chat, And Google Bard Dao Xuan-quy
- Can Chatgpt Pass The Vietnamese National High School Graduation Examination? Dao Xuan-quy, Le Ngoc-bich, Phan Xuan-dung, Ngo Bac-bien
- Who's Thinking? A Push For Human-centered Evaluation Of Llms Using The XAI Playbook Datta Teresa, Dickerson John P.
- Competence-based Analysis Of Language Models Davies Adam, Jiang Jize, Zhai Chengxiang
- LLMR: Real-time Prompting Of Interactive Worlds Using Large Language Models De La Torre Fernanda, Fang Cathy Mengying, Huang Han, Banburski-fahey Andrzej, Fernandez Judith Amores, Lanier Jaron
- Writing User Personas With Large Language Models: Testing Phase 6 Of A Thematic Analysis Of Semi-structured Interviews De Paoli Stefano
- An Evaluation On Large Language Model Outputs: Discourse And Memorization De Wynter Adrian, Wang Xun, Sokolov Alex, Gu Qilong, Chen Si-qing
- Im-promptu: In-context Composition From Image Prompts Dedhia Bhishma, Chang Michael, Snell Jake C., Griffiths Thomas L., Jha Niraj K.
- Facechat: An Emotion-aware Face-to-face Dialogue Framework Deema Alnuhait, Qingyang Wu, Zhou Yu
- Controlled Text Generation Via Language Model Arithmetic Dekoninck Jasper, Fischer Marc, Beurer-kellner Luca, Vechev Martin
- Masterkey: Automated Jailbreak Across Multiple Large Language Model Chatbots Deng Gelei, Liu Yi, Li Yuekang, Wang Kailong, Zhang Ying, Li Zefeng, Wang Haoyu, Zhang Tianwei, Liu Yang
- Plug-and-play Policy Planner For Large Language Model Powered Dialogue Agents Deng Yang, Zhang Wenxuan, Lam Wai, Ng See-kiong, Chua Tat-seng
- Multilingual Jailbreak Challenges In Large Language Models Deng Yue, Zhang Wenxuan, Pan Sinno Jialin, Bing Lidong
- Early Chatgpt User Portrait Through The Lens Of Data Deng Yuyang, Zhao Ni, Huang Xin
- Can We Trust Ai-generated Educational Content? Comparative Analysis Of Human And Ai-generated Learning Resources Denny Paul, Khosravi Hassan, Hellas Arto, Leinonen Juho, Sarsa Sami
- Chatgpt For GTFS: Benchmarking Llms On GTFS Understanding And Retrieval Devunuri Saipraneeth, Qiam Shirin, Lehe Lewis
- Lmflow: An Extensible Toolkit For Finetuning And Inference Of Large Foundation Models Diao Shizhe, Pan Rui, Dong Hanze, Shum Ka Shun, Zhang Jipeng, Xiong Wei, Zhang Tong
- Can Instruction Fine-tuned Language Models Identify Social Bias Through Prompting? Dige Omkar, Tian Jacob-junqi, Emerson David, Khattak Faiza Khan
- Enhancing Chat Language Models By Scaling High-quality Instructional Conversations Ding Ning, Chen Yulin, Xu Bokai, Qin Yujia, Zheng Zhi, Hu Shengding, Liu Zhiyuan, Sun Maosong, Zhou Bowen
- A Wolf In Sheep's Clothing: Generalized Nested Jailbreak Prompts Can Fool Large Language Models Easily Ding Peng, Kuang Jun, Ma Dan, Cao Xuezhi, Xian Yunsen, Chen Jiajun, Huang Shujian
- Everything Of Thoughts: Defying The Law Of Penrose Triangle For Thought Generation Ding Ruomeng, Zhang Chaoyun, Wang Lu, Xu Yong, Ma Minghua, Zhang Wei, Qin Si, Rajmohan Saravan, Lin Qingwei, Zhang Dongmei
- Self-agreement: A Framework For Fine-tuning Language Models To Find Agreement Among Diverse Opinions Ding Shiyao, Ito Takayuki
- Do Llms Work On Charts? Designing Few-shot Prompts For Chart Question Answering And Summarization Do Xuan Long, Hassanpour Mohammad, Masry Ahmed, Kavehzadeh Parsa, Hoque Enamul, Joty Shafiq
- Modeling What-to-ask And How-to-ask For Answer-unaware Conversational Question Generation Do Xuan Long, Zou Bowei, Joty Shafiq, Tran Anh Tai, Pan Liangming, Chen Nancy F., Aw Ai Ti
- Establishing Performance Baselines In Fine-tuning, Retrieval-augmented Generation And Soft-prompting For Non-specialist LLM Users Dodgson Jennifer, Nanzheng Lin, Peh Julian, Pattirane Akira Rafhael Janson, Alhajir Alfath Daryl, Dinarto Eko Ridho, Lim Joseph, Ahmad Syed Danyal
- How To Build An AI Tutor That Can Adapt To Any Course And Provide Accurate Answers Using Large Language Model And Retrieval-augmented Generation Dong Chenxi
- Blockwise Compression Of Transformer-based Models Without Retraining Dong Gaochen, Chen Wei
- Revisit Input Perturbation Problems For Llms: A Unified Robustness Evaluation Framework For Noisy Slot Filling Task Dong Guanting, Zhao Jinxu, Hui Tingfeng, Guo Daichi, Wan Wenlong, Feng Boqi, Qiu Yueyan, Gongque Zhuoma, He Keqing, Wang Zechen, Xu Weiran
- Large Language Model For Science: A Study On P Vs. NP Dong Qingxiu, Dong Li, Xu Ke, Zhou Guangyan, Hao Yaru, Sui Zhifang, Wei Furu
- The Philosopher's Stone: Trojaning Plugins Of Large Language Models Dong Tian, Xue Minhui, Chen Guoxing, Holland Rayne, Meng Yan, Li Shaofeng, Liu Zhen, Zhu Haojin
- Loramoe: Alleviate World Knowledge Forgetting In Large Language Models Via Moe-style Plugin Dou Shihan, Zhou Enyu, Liu Yan, Gao Songyang, Zhao Jun, Shen Wei, Zhou Yuhao, Xi Zhiheng, Wang Xiao, Fan Xiaoran, Pu Shiliang, Zhu Jiang, Zheng Rui, Gui Tao, Zhang Qi, Huang Xuanjing
- Scratch Copilot Evaluation: Assessing Ai-assisted Creative Coding For Families Druga Stefania, Otero Nancy
- From Static To Dynamic: A Continual Learning Framework For Large Language Models Du Mingzhe, Luu Anh Tuan, Ji Bin, Ng See-kiong
- Improving Factuality And Reasoning In Language Models Through Multiagent Debate Du Yilun, Li Shuang, Torralba Antonio, Tenenbaum Joshua B., Mordatch Igor
- Bridging The Language Gap: Knowledge Injected Multilingual Question Answering Duan Zhichao, Li Xiuxing, Zhang Zhengyan, Li Zhenyu, Liu Ning, Wang Jianyong
- Alpacafarm: A Simulation Framework For Methods That Learn From Human Feedback Dubois Yann, Li Xuechen, Taori Rohan, Zhang Tianyi, Gulrajani Ishaan, Ba Jimmy, Guestrin Carlos, Liang Percy, Hashimoto Tatsunori B.
- Frugal Lms Trained To Invoke Symbolic Solvers Achieve Parameter-efficient Arithmetic Reasoning Dutta Subhabrata, Singh Joykirat, Pandey Ishan, Manchanda Sunny, Chakrabarti Soumen, Chakraborty Tanmoy
- Faith And Fate: Limits Of Transformers On Compositionality Dziri Nouha, Lu Ximing, Sclar Melanie, Li Xiang Lorraine, Jiang Liwei, Lin Bill Yuchen, West Peter, Bhagavatula Chandra, Bras Ronan Le, Hwang Jena D., Sanyal Soumya, Welleck Sean, Ren Xiang, Ettinger Allyson, Harchaoui Zaid, Choi Yejin
- Halo: Estimation And Reduction Of Hallucinations In Open-source Weak Large Language Models Elaraby Mohamed, Lu Mengyin, Dunn Jacob, Zhang Xueying, Wang Yu, Liu Shizhu, Tian Pingchuan, Wang Yuping, Wang Yuxuan
- Tinystories: How Small Can Language Models Be And Still Speak Coherent English? Eldan Ronen, Li Yuanzhi
- RAGAS: Automated Evaluation Of Retrieval Augmented Generation Es Shahul, James Jithin, Espinosa-anke Luis, Schockaert Steven
- ROBBIE: Robust Bias Evaluation Of Large Generative Language Models Esiobu David, Tan Xiaoqing, Hosseini Saghar, Ung Megan, Zhang Yuchen, Fernandes Jude, Dwivedi-yu Jane, Presani Eleonora, Williams Adina, Smith Eric Michael
- Chatgpt Outperforms Crowd-workers For Text-annotation Tasks Fabrizio Gilardi, Meysam Alizadeh, Maël Kubli
- Lm-polygraph: Uncertainty Estimation For Language Models Fadeeva Ekaterina, Vashurin Roman, Tsvigun Akim, Vazhentsev Artem, Petrakov Sergey, Fedyanin Kirill, Vasilev Daniil, Goncharova Elizaveta, Panchenko Alexander, Panov Maxim, Baldwin Timothy, Shelmanov Artem
- Chain-of-thought Tuning: Masked Language Models Can Also Think Step By Step In Natural Language Understanding Fan Caoyun, Tian Jidong, Li Yitian, Chen Wenqing, He Hao, Jin Yaohui
- Proximal Policy Optimization Actual Combat: Manipulating Output Tokenizer Length Fan Miao, Hu Chen, Zhou Shuchang
- Large Language Models For Code Analysis: Do Llms Really Do Their Job? Fang Chongzhou, Miao Ning, Srivastav Shaurya, Liu Jialin, Zhang Ruoyu, Fang Ruijie, Asmita, Tsang Ryan, Nazari Najmeh, Wang Han, Homayoun Houman
- Using GPT-4 To Augment Unbalanced Data For Automatic Scoring Fang Luyang, Lee Gyeong-geon, Zhai Xiaoming
- Instructseq: Unifying Vision Tasks With Instruction-conditioned Multi-modal Sequence Generation Fang Rongyao, Yan Shilin, Huang Zhaoyang, Zhou Jingqiu, Tian Hao, Dai Jifeng, Li Hongsheng
- Tooltalk: Evaluating Tool-usage In A Conversational Setting Farn Nicholas, Shin Richard
- Extending Context Window Of Large Language Models Via Semantic Compression Fei Weizhi, Niu Xueyan, Zhou Pingyi, Hou Lu, Bai Bo, Deng Lei, Han Wei
- Trapping LLM Hallucinations Using Tagged Context Prompts Feldman Philip, Foulds James R., Pan Shimei
- Chessgpt: Bridging Policy Learning And Language Modeling Feng Xidong, Luo Yicheng, Wang Ziyan, Tang Hongrui, Yang Mengyue, Shao Kun, Mguni David, Du Yali, Wang Jun
- Alphazero-like Tree-search Can Guide Large Language Model Decoding And Training Feng Xidong, Wan Ziyu, Wen Muning, Mcaleer Stephen Marcus, Wen Ying, Zhang Weinan, Wang Jun
- Towards Llm-driven Dialogue State Tracking Feng Yujie, Lu Zexin, Liu Bo, Zhan Liming, Wu Xiao-ming
- The Devil Is In The Errors: Leveraging Large Language Models For Fine-grained Machine Translation Evaluation Fernandes Patrick, Deutsch Daniel, Finkelstein Mara, Riley Parker, Martins André F. T., Neubig Graham, Garg Ankush, Clark Jonathan H., Freitag Markus, Firat Orhan
- Automating Behavioral Testing In Machine Translation Ferrando Javier, Sperber Matthias, Setiawan Hendra, Telaar Dominic, Hasan Saša
- Matching Pairs: Attributing Fine-tuned Models To Their Pre-trained Large Language Models Foley Myles, Rawat Ambrish, Lee Taesung, Hou Yufang, Picco Gabriele, Zizzo Giulio
- Can Large Language Models Follow Concept Annotation Guidelines? A Case Study On Scientific And Financial Domains Fonseca Marcio, Cohen Shay B.
- Enhancing Pipeline-based Conversational Agents With Large Language Models Foosherian Mina, Purwins Hendrik, Rathnayake Purna, Alam Touhidul, Teimao Rui, Thoben Klaus-dieter
- Chatgpt & Mechanical Engineering: Examining Performance On The FE Mechanical Engineering And Undergraduate Exams Frenkel Matthew, Emara Hebah
- Gptscore: Evaluate As You Desire Fu Jinlan, Ng See-kiong, Jiang Zhengbao, Liu Pengfei
- Revisiting The Knowledge Injection Frameworks Fu Peng, Zhang Yiming, Wang Haobo, Qiu Weikang, Zhao Junbo
- Misusing Tools In Large Language Models With Visual Adversarial Examples Fu Xiaohan, Wang Zihan, Li Shuheng, Gupta Rajesh K., Mireshghallah Niloofar, Berg-kirkpatrick Taylor, Fernandes Earlence
- Chain-of-thought Hub: A Continuous Effort To Measure Large Language Models' Reasoning Performance Fu Yao, Ou Litu, Chen Mingyu, Wan Yuhao, Peng Hao, Khot Tushar
- Gpt4aigchip: Towards Next-generation AI Accelerator Design Automation Via Large Language Models Fu Yonggan, Zhang Yongan, Yu Zhongzhi, Li Sixu, Ye Zhifan, Li Chaojian, Wan Cheng, Lin Yingyan
- From Text To Self: Users' Perceptions Of Potential Of AI On Interpersonal Communication And Self Fu Yue, Foell Sami, Xu Xuhai, Hiniker Alexis
- The Impact And Applications Of Chatgpt: A Systematic Review Of Literature Reviews Gabashvili Irene S.
- Natural Language Commanding Via Program Synthesis Gandhi Apurva, Nguyen Thong Q., Jiao Huitian, Steen Robert, Bhatawdekar Ameya
- Strategyllm: Large Language Models As Strategy Generators, Executors, Optimizers, And Evaluators For Problem Solving Gao Chang, Jiang Haiyun, Cai Deng, Shi Shuming, Lam Wai
- ASSISTGUI: Task-oriented Desktop Graphical User Interface Automation Gao Difei, Ji Lei, Bai Zechen, Ouyang Mingyu, Li Peiran, Mao Dongxing, Wu Qinchen, Zhang Weichen, Wang Peiyi, Guo Xiangwu, Wang Hengxu, Zhou Luowei, Shou Mike Zheng
- Assistgpt: A General Multi-modal Assistant That Can Plan, Execute, Inspect, And Learn Gao Difei, Ji Lei, Zhou Luowei, Lin Kevin Qinghong, Chen Joya, Fan Zihan, Shou Mike Zheng
- Llama-adapter V2: Parameter-efficient Visual Instruction Model Gao Peng, Han Jiaming, Zhang Renrui, Lin Ziyi, Geng Shijie, Zhou Aojun, Zhang Wei, Lu Pan, He Conghui, Yue Xiangyu, Li Hongsheng, Qiao Yu
- CLOVA: A Closed-loop Visual Assistant With Tool Usage And Update Gao Zhi, Du Yuntao, Zhang Xintong, Ma Xiaojian, Han Wenjuan, Zhu Song-chun, Li Qing
- Building Trustworthy Neurosymbolic AI Systems: Consistency, Reliability, Explainability, And Safety Gaur Manas, Sheth Amit
- Mllm-bench: Evaluating Multimodal Llms With Per-sample Criteria Ge Wentao, Chen Shunian, Chen Guiming Hardy, Chen Zhihong, Chen Junying, Yan Shuo, Zhu Chenghao, Lin Ziyue, Xie Wenya, Zhang Xinyi, Chai Yichen, Liu Xiaoyu, Song Dingjie, Wang Xidong, Gao Anningzhe, Zhang Zhiyi, Li Jianquan, Wan Xiang, Wang Benyou
- UPAR: A Kantian-inspired Prompting Framework For Enhancing Large Language Model Capabilities Geng Hejia, Xu Boxun, Li Peng
- Do Generative Large Language Models Need Billions Of Parameters? Gholami Sia, Omar Marwan
- Clipsyntel: CLIP And LLM Synergy For Multimodal Question Summarization In Healthcare Ghosh Akash, Acharya Arkadeep, Jain Raghav, Saha Sriparna, Chadha Aman, Sinha Setu
- Chatgpt Perpetuates Gender Bias In Machine Translation And Ignores Non-gendered Pronouns: Findings Across Bengali And Five Other Low-resource Languages Ghosh Sourojit, Caliskan Aylin
- Gpt-based Models Meet Simulation: How To Efficiently Use Large-scale Pre-trained Language Models Across Simulation Tasks Giabbanelli Philippe J.
- Can Llms Capture Human Preferences? Goli Ali, Singh Amandeep
- Coascore: Chain-of-aspects Prompting For NLG Evaluation Gong Peiyuan, Mao Jiaxin
- Bloomvqa: Assessing Hierarchical Multi-modal Comprehension Gong Yunye, Shrestha Robik, Claypoole Jared, Cogswell Michael, Ray Arijit, Kanan Christopher, Divakaran Ajay
- Using Large Language Models For Interpreting Autonomous Robots Behaviors González-santamarta Miguel A., Fernández-becerra Laura, Sobrín-hidalgo David, Guerrero-higueras Ángel Manuel, González Irene, Lera Francisco J. Rodríguez
- Integration Of Large Language Models Within Cognitive Architectures For Autonomous Robots González-santamarta Miguel Á., Rodríguez-lera Francisco J., Guerrero-higueras Ángel Manuel, Matellán-olivera Vicente
- LILO: Learning Interpretable Libraries By Compressing And Documenting Code Grand Gabriel, Wong Lionel, Bowers Maddy, Olausson Theo X., Liu Muxin, Tenenbaum Joshua B., Andreas Jacob
- Not What You've Signed Up For: Compromising Real-world Llm-integrated Applications With Indirect Prompt Injection Greshake Kai, Abdelnabi Sahar, Mishra Shailesh, Endres Christoph, Holz Thorsten, Fritz Mario
- Dual Use Concerns Of Generative AI And Large Language Models Grinbaum Alexei, Adomaitis Laurynas
- Augmented Language Models: A Survey Grégoire Mialon, Roberto Dessì, Maria Lomeli, Christoforos Nalmpantis, Ram Pasunuru, Roberta Raileanu, Baptiste Rozière, Timo Schick, Jane Dwivedi-yu, Asli Celikyilmaz, Edouard Grave, Yann Lecun, Thomas Scialom
- Pokemqa: Programmable Knowledge Editing For Multi-hop Question Answering Gu Hengrui, Zhou Kaixiong, Han Xiaotian, Liu Ninghao, Wang Ruobing, Wang Xin
- Neuron Patching: Semantic-based Neuron-level Language Model Repair For Code Generation Gu Jian, Aleti Aldeida, Chen Chunyang, Zhang Hongyu
- Linguistically Informed Chatgpt Prompts To Enhance Japanese-chinese Machine Translation: A Case Study On Attributive Clauses Gu Wenshi
- Seer: Language Instructed Video Prediction With Latent Diffusion Models Gu Xianfan, Wen Chuan, Ye Weirui, Song Jiaming, Gao Yang
- Digital Socrates: Evaluating Llms Through Explanation Critiques Gu Yuling, Tafjord Oyvind, Clark Peter
- Leveraging Pre-trained Large Language Models To Construct And Utilize World Models For Model-based Task Planning Guan Lin, Valmeekam Karthik, Sreedharan Sarath, Kambhampati Subbarao
- Mitigating Large Language Model Hallucinations Via Autonomous Knowledge Graph-based Retrofitting Guan Xinyan, Liu Yanjiang, Lin Hongyu, Lu Yaojie, He Ben, Han Xianpei, Sun Le
- Voyager: An Open-ended Embodied Agent With Large Language Models Guanzhi Wang, Yuqi Xie, Yunfan Jiang, Ajay Mandlekar, Chaowei Xiao, Yuke Zhu, Linxi Fan, Anima Anandkumar
- Hifi: High-information Attention Heads Hold For Parameter-efficient Model Adaptation Gui Anchun, Xiao Han
- OWL: A Large Language Model For IT Operations Guo Hongcheng, Yang Jian, Liu Jiaheng, Yang Liqun, Chai Linzheng, Bai Jiaqi, Peng Junran, Hu Xiaorong, Chen Chao, Zhang Dongfeng, Shi Xu, Zheng Tieqiao, Zheng Liangfan, Zhang Bo, Xu Ke, Li Zhoujun
- Empowering Working Memory For Large Language Model Agents Guo Jing, Li Nan, Qi Jianchuan, Yang Hang, Li Ruiqiao, Feng Yuzhen, Zhang Si, Xu Ming
- Connecting Large Language Models With Evolutionary Algorithms Yields Powerful Prompt Optimizers Guo Qingyan, Wang Rui, Guo Junliang, Li Bei, Song Kaitao, Tan Xu, Liu Guoqing, Bian Jiang, Yang Yujiu
- Knowledgenavigator: Leveraging Large Language Models For Enhanced Reasoning Over Knowledge Graph Guo Tiezheng, Yang Qingwen, Wang Chen, Liu Yanyi, Li Pan, Tang Jiawei, Li Dapeng, Wen Yingyou
- Doremi: Grounding Language Model By Detecting And Recovering From Plan-execution Misalignment Guo Yanjiang, Wang Yen-jen, Zha Lihan, Jiang Zheyuan, Chen Jianyu
- PPTC Benchmark: Evaluating Large Language Models For Powerpoint Task Completion Guo Yiduo, Zhang Zekai, Liang Yaobo, Zhao Dongyan, Duan Nan
- Arthmodel: Enhance Arithmetic Skills To Large Language Model Guo Yingdi
- Gemrec: Towards Generative Model Recommendation Guo Yuanhe, Liu Haoming, Wen Hongyi
- Retrieval-augmented Code Generation For Universal Information Extraction Guo Yucan, Li Zixuan, Jin Xiaolong, Liu Yantao, Zeng Yutao, Liu Wenxuan, Li Xiang, Yang Pan, Bai Long, Guo Jiafeng, Cheng Xueqi
- Evaluating Large Language Models: A Comprehensive Survey Guo Zishan, Jin Renren, Liu Chuang, Huang Yufei, Shi Dan, Supryadi, Yu Linhao, Liu Yan, Li Jiaxuan, Xiong Bojian, Xiong Deyi
- Targen: Targeted Data Generation With Large Language Models Gupta Himanshu, Scaria Kevin, Anantheswaran Ujjwala, Verma Shreyas, Parmar Mihir, Sawant Saurabh Arjun, Baral Chitta, Mishra Swaroop
- Testing LLM Performance On The Physics GRE: Some Observations Gupta Pranav
- A Survey Of Vision-language Pre-training From The Lens Of Multimodal Machine Translation Gwinnup Jeremy, Duh Kevin
- Scaling Up And Distilling Down: Language-guided Robot Skill Acquisition Ha Huy, Florence Pete, Song Shuran
- Regulating Chatgpt And Other Large Generative AI Models Hacker Philipp, Engel Andreas, Mauer Marco
- Seeing Chatgpt Through Students' Eyes: An Analysis Of Tiktok Data Haensch Anna-carolina, Ball Sarah, Herklotz Markus, Kreuter Frauke
- Onellm: One Framework To Align All Modalities With Language Han Jiaming, Gong Kaixiong, Zhang Yiyuan, Wang Jiaqi, Zhang Kaipeng, Lin Dahua, Qiao Yu, Gao Peng, Yue Xiangyu
- Pive: Prompting With Iterative Verification Improving Graph-based Generative Capability Of Llms Han Jiuzhou, Collier Nigel, Buntine Wray, Shareghi Ehsan
- Llms As Visual Explainers: Advancing Image Classification With Evolving Visual Descriptions Han Songhao, Zhuo Le, Liao Yue, Liu Si
- "mistakes Help Us Grow": Facilitating And Evaluating Growth Mindset Supportive Language In Classrooms Handa Kunal, Clapper Margaret, Boyle Jessica, Wang Rose E, Yang Diyi, Yeager David S, Demszky Dorottya
- Personalisation Within Bounds: A Risk Taxonomy And Policy Framework For The Alignment Of Large Language Models With Personalised Feedback Hannah Rose Kirk, Bertie Vidgen, Paul Röttger, Scott A. Hale
- Reasoning Implicit Sentiment With Chain-of-thought Prompting Hao Fei, Bobo Li, Qian Liu, Lidong Bing, Fei Li, Tat-seng Chua
- Toolkengpt: Augmenting Frozen Language Models With Massive Tools Via Tool Embeddings Hao Shibo, Liu Tianyang, Wang Zhen, Hu Zhiting
- Idealgpt: Iteratively Decomposing Vision And Language Reasoning Via Large Language Models Haoxuan You, Rui Sun, Zhecan Wang, Long Chen, Gengyu Wang, Hammad A. Ayyubi, Kai-wei Chang, Shih-fu Chang
- Analyzing The Performance Of Chatgpt In Cardiology And Vascular Pathologies Hariri Walid
- CHAI-DT: A Framework For Prompting Conversational Generative AI Agents To Actively Participate In Co-creation Harwood Brandon
- Zero- And Few-shot Prompting With Llms: A Comparative Study With Fine-tuned Models For Bangla Sentiment Analysis Hasan Md. Arid, Das Shudipta, Anjum Afiyat, Alam Firoj, Anjum Anika, Sarker Avijit, Noori Sheak Rashed Haider
- Chatgpt As Your Personal Data Scientist Hassan Md Mahadi, Knipper Alex, Santu Shubhra Kanti Karmaker
- ICL-D3IE: In-context Learning With Diverse Demonstrations Updating For Document Information Extraction He Jiabang, Wang Lei, Hu Yi, Liu Ning, Liu Hui, Xu Xing, Shen Heng Tao
- Wordart Designer: User-driven Artistic Typography Synthesis Using Large Language Models He Jun-yan, Cheng Zhi-qi, Li Chenyang, Sun Jingdong, Xiang Wangmeng, Lin Xianhui, Kang Xiaoyang, Jin Zengke, Hu Yusen, Luo Bin, Geng Yifeng, Xie Xuansong, Zhou Jingren
- Targeted Data Generation: Finding And Fixing Model Weaknesses He Zexue, Ribeiro Marco Tulio, Khani Fereshte
- Chatgpt For Zero-shot Dialogue State Tracking: A Solution Or An Opportunity? Heck Michael, Lubis Nurul, Ruppik Benjamin, Vukovic Renato, Feng Shutong, Geishauser Christian, Lin Hsien-chin, Van Niekerk Carel, Gašić Milica
- Analyzing The Efficacy Of An Llm-only Approach For Image-based Document Question Answering Hegde Nidhi, Paul Sujoy, Madan Gagan, Aggarwal Gaurav
- Stochastic Parrots Looking For Stochastic Parrots: Llms Are Easy To Fine-tune And Hard To Detect With Other Llms Henrique Da Silva Gameiro, Kucharavy Andrei, Guerraoui Rachid
- Newsgpt: Chatgpt Integration For Robot-reporter Hireche Abdelhadi, Belkacem Abdelkader Nasreddine, Jamil Sadia, Chen Chao
- L2MAC: Large Language Model Automatic Computer For Extensive Code Generation Holt Samuel, Luyten Max Ruiz, Van Der Schaar Mihaela
- Negative Object Presence Evaluation (NOPE) To Measure Object Hallucination In Vision-language Models Holy Lovenia, Wenliang Dai, Samuel Cahyawijaya, Ziwei Ji, Pascale Fung
- DP-OPT: Make Large Language Model Your Privacy-preserving Prompt Engineer Hong Junyuan, Wang Jiachen T., Zhang Chenhui, Li Zhangheng, Li Bo, Wang Zhangyang
- Direct2v: Large Language Models Are Frame-level Directors For Zero-shot Text-to-video Generation Hong Susung, Seo Junyoung, Shin Heeseong, Hong Sunghwan, Kim Seungryong
- SCITUNE: Aligning Large Language Models With Scientific Multimodal Instructions Horawalavithana Sameera, Munikoti Sai, Stewart Ian, Kvinge Henry
- Ralle: A Framework For Developing And Evaluating Retrieval-augmented Large Language Models Hoshi Yasuto, Miyashita Daisuke, Ng Youyang, Tatsuno Kento, Morioka Yasuhiro, Torii Osamu, Deguchi Jun
- More Robots Are Coming: Large Multimodal Models (chatgpt) Can Solve Visually Diverse Images Of Parsons Problems Hou Irene, Man Owen, Mettille Sophie, Gutierrez Sebastian, Angelikas Kenneth, Macneil Stephen
- Distilling Step-by-step! Outperforming Larger Language Models With Less Training Data And Smaller Model Sizes Hsieh Cheng-yu, Li Chun-liang, Yeh Chih-kuan, Nakhost Hootan, Fujii Yasuhisa, Ratner Alexander, Krishna Ranjay, Lee Chen-yu, Pfister Tomas
- Prompting Is Not A Substitute For Probability Measurements In Large Language Models Hu Jennifer, Levy Roger
- Ladder-of-thought: Using Knowledge As Steps To Elevate Stance Detection Hu Kairui, Yan Ming, Zhou Joey Tianyi, Tsang Ivor W., Chong Wen Haw, Yap Yong Keong
- Evoke: Evoking Critical Thinking Abilities In Llms Via Reviewer-author Prompt Editing Hu Xinyu, Tang Pengfei, Zuo Simiao, Wang Zihan, Song Bowen, Lou Qiang, Jiao Jian, Charles Denis
- Decipherpref: Analyzing Influential Factors In Human Preference Judgments Via GPT-4 Hu Yebowen, Song Kaiqiang, Cho Sangwoo, Wang Xiaoyang, Foroosh Hassan, Liu Fei
- Visual Program Distillation: Distilling Tools And Programmatic Reasoning Into Vision-language Models Hu Yushi, Stretcu Otilia, Lu Chun-ta, Viswanathan Krishnamurthy, Hata Kenji, Luo Enming, Krishna Ranjay, Fuxman Ariel
- AMERICANO: Argument Generation With Discourse-driven Decomposition And Agent Interaction Hu Zhe, Chan Hou Pong, Yin Yu
- Token-level Adversarial Prompt Detection Based On Perplexity Measures And Contextual Information Hu Zhengmian, Wu Gang, Mitra Saayan, Zhang Ruiyi, Sun Tong, Huang Heng, Swaminathan Viswanathan
- Llm-adapters: An Adapter Family For Parameter-efficient Fine-tuning Of Large Language Models Hu Zhiqiang, Wang Lei, Lan Yihuai, Xu Wanyu, Lim Ee-peng, Bing Lidong, Xu Xing, Poria Soujanya, Lee Roy Ka-wei
- Enhancing Large Language Models In Coding Through Multi-perspective Self-consistency Huang Baizhou, Lu Shuai, Chen Weizhu, Wan Xiaojun, Duan Nan
- Lorahub: Efficient Cross-task Generalization Via Dynamic Lora Composition Huang Chengsong, Liu Qian, Lin Bill Yuchen, Pang Tianyu, Du Chao, Lin Min
- Agentcoder: Multi-agent-based Code Generation With Iterative Testing And Optimisation Huang Dong, Zhang Jie M., Luck Michael, Bu Qingwen, Qing Yuhao, Cui Heming
- Chatgpt For Shaping The Future Of Dentistry: The Potential Of Multi-modal Large Language Model Huang Hanyao, Zheng Ou, Wang Dongdong, Yin Jiayi, Wang Zijin, Ding Shengxuan, Yin Heng, Xu Chuan, Yang Renjie, Zheng Qian, Shi Bing
- Free-bloom: Zero-shot Text-to-video Generator With LLM Director And LDM Animator Huang Hanzhuo, Feng Yufan, Shi Cheng, Xu Lan, Yu Jingyi, Yang Sibei
- Who Is Chatgpt? Benchmarking Llms' Psychological Portrayal Using Psychobench Huang Jen-tse, Wang Wenxuan, Li Eric John, Lam Man Ho, Ren Shujie, Yuan Youliang, Jiao Wenxiang, Tu Zhaopeng, Lyu Michael R.
- Lawyer Llama Technical Report Huang Quzhe, Tao Mingxu, Zhang Chen, An Zhenwei, Jiang Cong, Chen Zhibin, Wu Zirui, Feng Yansong
- Chat Vector: A Simple Approach To Equip Llms With Instruction Following And Model Alignment In New Languages Huang Shih-cheng, Li Pin-zu, Hsu Yu-chi, Chen Kuang-ming, Lin Yu Tung, Hsiao Shih-kai, Tsai Richard Tzong-han, Lee Hung-yi
- Lateval: An Interactive Llms Evaluation Benchmark With Incomplete Information From Lateral Thinking Puzzles Huang Shulin, Ma Shirong, Li Yinghui, Huang Mengzuo, Zou Wuhe, Zhang Weidong, Zheng Hai-tao
- Catastrophic Jailbreak Of Open-source Llms Via Exploiting Generation Huang Yangsibo, Gupta Samyak, Xia Mengzhou, Li Kai, Chen Danqi
- Advancing Transformer Architecture In Long-context Large Language Models: A Comprehensive Survey Huang Yunpeng, Xu Jingwei, Lai Junyu, Jiang Zixu, Chen Taolue, Li Zenan, Yao Yuan, Ma Xiaoxing, Yang Lijuan, Chen Hao, Li Shupeng, Zhao Penghao
- C-eval: A Multi-level Multi-discipline Chinese Evaluation Suite For Foundation Models Huang Yuzhen, Bai Yuzhuo, Zhu Zhihao, Zhang Junlei, Zhang Jinghan, Su Tangjun, Liu Junteng, Lv Chuancheng, Zhang Yikai, Lei Jiayi, Fu Yao, Sun Maosong, He Junxian
- Soft Prompt Decoding For Multilingual Dense Retrieval Huang Zhiqi, Zeng Hansi, Zamani Hamed, Allan James
- Towards Leveraging Llms For Conditional QA Hussain Syed-amad, Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- Conceptual Model Interpreter For Large Language Models Härer Felix
- Minimizing Factual Inconsistency And Hallucination In Large Language Models I Muneeswaran, Saxena Shreya, Prasad Siva, Prakash M V Sai, Shankar Advaith, V Varun, Vaddina Vishal, Gopalakrishnan Saisubramaniam
- Diversigate: A Comprehensive Framework For Reliable Large Language Models Imani Shima, Beyram Ali, Shrivastava Harsh
- Multitool-cot: GPT-3 Can Use Multiple External Tools With Chain Of Thought Prompting Inaba Tatsuro, Kiyomaru Hirokazu, Cheng Fei, Kurohashi Sadao
- Llama Guard: Llm-based Input-output Safeguard For Human-ai Conversations Inan Hakan, Upasani Kartikeya, Chi Jianfeng, Rungta Rashi, Iyer Krithika, Mao Yuning, Tontchev Michael, Hu Qing, Fuller Brian, Testuggine Davide, Khabsa Madian
- Running Cognitive Evaluations On Large Language Models: The Do's And The Don'ts Ivanova Anna A.
- 14 Examples Of How Llms Can Transform Materials Science And Chemistry: A Reflection On A Large Language Model Hackathon Jablonka Kevin Maik, Ai Qianxiang, Al-feghali Alexander, Badhwar Shruti, Bocarsly Joshua D., Bran Andres M, Bringuier Stefan, Brinson L. Catherine, Choudhary Kamal, Circi Defne, Cox Sam, De Jong Wibe A., Evans Matthew L., Gastellu Nicolas, Genzling Jerome, Gil María Victoria, Gupta Ankur K., Hong Zhi, Imran Alishba, Kruschwitz Sabine, Labarre Anne, Lála Jakub, Liu Tao, Ma Steven, Majumdar Sauradeep, Merz Garrett W., Moitessier Nicolas, Moubarak Elias, Mouriño Beatriz, Pelkie Brenden, Pieler Michael, Ramos Mayk Caldas, Ranković Bojana, Rodriques Samuel G., Sanders Jacob N., Schwaller Philippe, Schwarting Marcus, Shi Jiale, Smit Berend, Smith Ben E., Van Herck Joren, Völker Christoph, Ward Logan, Warren Sean, Weiser Benjamin, Zhang Sylvester, Zhang Xiaoqi, Zia Ghezal Ahmad, Scourtas Aristana, Schmidt Kj, Foster Ian, White Andrew D., Blaiszik Ben
- Emergent Ai-assisted Discourse: Case Study Of A Second Language Writer Authoring With Chatgpt Jacob Sharin, Tate Tamara, Warschauer Mark
- Bring Your Own Data! Self-supervised Evaluation For Large Language Models Jain Neel, Saifullah Khalid, Wen Yuxin, Kirchenbauer John, Shu Manli, Saha Aniruddha, Goldblum Micah, Geiping Jonas, Goldstein Tom
- Cotran: An Llm-based Code Translator Using Reinforcement Learning With Feedback From Compiler And Symbolic Execution Jana Prithwish, Jha Piyush, Ju Haoyang, Kishore Gautham, Mahajan Aryan, Ganesh Vijay
- Faithful Persona-based Conversational Dataset Generation With Large Language Models Jandaghi Pegah, Sheng Xianghai, Bai Xinyi, Pujara Jay, Sidahmed Hakim
- Consistency Analysis Of Chatgpt Jang Myeongjun Erik, Lukasiewicz Thomas
- Chit-chat Or Deep Talk: Prompt Engineering For Process Mining Jessen Urszula, Sroka Michal, Fahland Dirk
- LIMIT: Less Is More For Instruction Tuning Across Evaluation Paradigms Jha Aditi, Havens Sam, Dohmann Jeremy, Trott Alex, Portes Jacob
- Rethinking Large Language Models In Mental Health Applications Ji Shaoxiong, Zhang Tianlin, Yang Kailai, Ananiadou Sophia, Cambria Erik
- Benchmarking And Explaining Large Language Model-based Code Generation: A Causality-centric Approach Ji Zhenlan, Ma Pingchuan, Li Zongjie, Wang Shuai
- Bootstrapping Vision-language Learning With Decoupled Language Pre-training Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Expedited Training Of Visual Conditioned Language Generation Via Redundancy Reduction Jian Yiren, Liu Tingkai, Tao Yunzhe, Zhang Chunhui, Vosoughi Soroush, Yang Hongxia
- Llm-blender: Ensembling Large Language Models With Pairwise Ranking And Generative Fusion Jiang Dongfu, Ren Xiang, Lin Bill Yuchen
- You Only Forward Once: Prediction And Rationalization In A Single Forward Pass Jiang Han, Duan Junwen, Qu Zhe, Wang Jianxin
- Selfevolve: A Code Evolution Framework Via Large Language Models Jiang Shuyang, Wang Yuhao, Wang Yu
- Hykge: A Hypothesis Knowledge Graph Enhanced Framework For Accurate And Reliable Medical Llms Responses Jiang Xinke, Zhang Ruizhe, Xu Yongxin, Qiu Rihong, Fang Yue, Wang Zhiyuan, Tang Jinyi, Ding Hongxin, Chu Xu, Zhao Junfeng, Wang Yasha
- Lion: Adversarial Distillation Of Proprietary Large Language Models Jiang Yuxin, Chan Chunkit, Chen Mingyang, Wang Wei
- How Can Recommender Systems Benefit From Large Language Models: A Survey Jianghao Lin, Xinyi Dai, Yunjia Xi, Weiwen Liu, Bo Chen, Hao Zhang, Yong Liu, Chuhan Wu, Xiangyang Li, Chenxu Zhu, Huifeng Guo, Yong Yu, Ruiming Tang, Weinan Zhang
- Parrot: Translating During Chat Using Large Language Models Tuned With Human Translation And Feedback Jiao Wenxiang, Huang Jen-tse, Wang Wenxuan, He Zhiwei, Liang Tian, Wang Xing, Shi Shuming, Tu Zhaopeng
- Llara: Large Language-recommendation Assistant Jiayi Liao, Sihang Li, Zhengyi Yang, Jiancan Wu, Yancheng Yuan, Xiang Wang, Xiangnan He
- Emergent Representations Of Program Semantics In Language Models Trained On Programs Jin Charles, Rinard Martin
- When Large Language Models Meet Personalization: Perspectives Of Challenges And Opportunities Jin Chen, Zheng Liu, Xu Huang, Chenwang Wu, Qi Liu, Gangwei Jiang, Yuanhao Pu, Yuxuan Lei, Xiaolong Chen, Xingmei Wang, Defu Lian, Enhong Chen
- Better To Ask In English: Cross-lingual Evaluation Of Large Language Models For Healthcare Queries Jin Yiqiao, Chandra Mohit, Verma Gaurav, Hu Yibo, De Choudhury Munmun, Kumar Srijan
- S\(^{3}\): Increasing GPU Utilization During Generative Inference For Higher Throughput Jin Yunho, Wu Chun-feng, Brooks David, Wei Gu-yeon
- Impossible Distillation: From Low-quality Model To High-quality Dataset & Model For Summarization And Paraphrasing Jung Jaehun, West Peter, Jiang Liwei, Brahman Faeze, Lu Ximing, Fisher Jillian, Sorensen Taylor, Choi Yejin
- A Case Study On Test Case Construction With Large Language Models: Unveiling Practical Insights And Challenges Junior Roberto Francisco De Lima, Presta Luiz Fernando Paes De Barros, Borborema Lucca Santos, Da Silva Vanderson Nogueira, Dahia Marcio Leal De Melo, Santos Anderson Carlos Sousa E
- Recommendation As Instruction Following: A Large Language Model Empowered Recommendation Approach Junjie Zhang, Ruobing Xie, Yupeng Hou, Wayne Xin Zhao, Leyu Lin, Ji-rong Wen
- Evaluation And Analysis Of Hallucination In Large Vision-language Models Junyang Wang, Yiyang Zhou, Guohai Xu, Pengcheng Shi, Chenlin Zhao, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang, Jihua Zhu, Jitao Sang, Haoyu Tang
- Robust Recommender System: A Survey And Future Directions Kaike Zhang, Qi Cao, Fei Sun, Yunfan Wu, Shuchang Tao, Huawei Shen, Xueqi Cheng
- Robust Training For Conversational Question Answering Models With Reinforced Reformulation Generation Kaiser Magdalena, Roy Rishiraj Saha, Weikum Gerhard
- Exploiting Programmatic Behavior Of Llms: Dual-use Through Standard Security Attacks Kang Daniel, Li Xuechen, Stoica Ion, Guestrin Carlos, Zaharia Matei, Hashimoto Tatsunori
- Chatmof: An Autonomous AI System For Predicting And Generating Metal-organic Frameworks Kang Yeonghun, Kim Jihan
- Lagr-seq: Language-guided Reinforcement Learning With Sample-efficient Querying Karimpanal Thommen George, Semage Laknath Buddhika, Rana Santu, Le Hung, Tran Truyen, Gupta Sunil, Venkatesh Svetha
- Evaluating GPT-4 And Chatgpt On Japanese Medical Licensing Examinations Kasai Jungo, Kasai Yuhei, Sakaguchi Keisuke, Yamada Yutaro, Radev Dragomir
- Application Of Frozen Large-scale Models To Multimodal Task-oriented Dialogue Kawamoto Tatsuki, Suzuki Takuma, Miyama Ko, Meguro Takumi, Takagi Tomohiro
- Can Chatgpt And Bard Generate Aligned Assessment Items? A Reliability Analysis Against Human Performance Khademi Abdolvahab
- Zzzgpt: An Interactive GPT Approach To Enhance Sleep Quality Khaokaew Yonchanok, Ji Kaixin, Nguyen Thuc Hanh, Kegalle Hiruni, Alaofi Marwah, Xue Hao, Salim Flora D.
- Dspy: Compiling Declarative Language Model Calls Into Self-improving Pipelines Khattab Omar, Singhvi Arnav, Maheshwari Paridhi, Zhang Zhiyuan, Santhanam Keshav, Vardhamanan Sri, Haq Saiful, Sharma Ashutosh, Joshi Thomas T., Moazam Hanna, Miller Heather, Zaharia Matei, Potts Christopher
- Auto-survey Challenge Khuong Thanh Gia Hieu Tau, Lisn, Rachmat Benedictus Kent Tau, Lisn
- Expediting Contrastive Language-image Pretraining Via Self-distilled Encoders Kim Bumsoo, Kim Jinhyung, Jo Yeonsik, Kim Seung Hwan
- Squeezellm: Dense-and-sparse Quantization Kim Sehoon, Hooper Coleman, Gholami Amir, Dong Zhen, Li Xiuyu, Shen Sheng, Mahoney Michael W., Keutzer Kurt
- Speculative Decoding With Big Little Decoder Kim Sehoon, Mangalam Karttikeya, Moon Suhong, Malik Jitendra, Mahoney Michael W., Gholami Amir, Keutzer Kurt
- Aligning Large Language Models Through Synthetic Feedback Kim Sungdong, Bae Sanghwan, Shin Jamin, Kang Soyoung, Kwak Donghyun, Yoo Kang Min, Seo Minjoon
- Lmcanvas: Object-oriented Interaction To Personalize Large Language Model-powered Writing Environments Kim Tae Soo, Sarkar Arghya, Lee Yoonjoo, Chang Minsuk, Kim Juho
- The Past, Present And Better Future Of Feedback Learning In Large Language Models For Subjective Human Preferences And Values Kirk Hannah Rose, Bean Andrew M., Vidgen Bertie, Röttger Paul, Hale Scott A.
- Large Language Models Are Temporal And Causal Reasoners For Video Question Answering Ko Dohwan, Lee Ji Soo, Kang Wooyoung, Roh Byungseok, Kim Hyunwoo J.
- Conversational Ai-powered Design: Chatgpt As Designer, User, And Product Kocaballi A. Baki
- L-TUNING: Synchronized Label Tuning For Prompt And Prefix In Llms Kowsher Md., Sobuj Md. Shohanur Islam, Mahmud Asif, Prottasha Nusrat Jahan, Bhat Prakash
- Developing Effective Educational Chatbots With Chatgpt Prompts: Insights From Preliminary Tests In A Case Study On Social Media Literacy (with Appendix) Koyuturk Cansu, Yavari Mona, Theophilou Emily, Bursic Sathya, Donabauer Gregor, Telari Alessia, Testa Alessia, Boiano Raffaele, Gabbiadini Alessandro, Hernandez-leo Davinia, Ruskov Martin, Ognibene Dimitri
- Prompt Generate Train (PGT): Few-shot Domain Adaption Of Retrieval Augmented Generation Models For Open Book Question-answering Krishna C. S.
- Post Hoc Explanations Of Language Models Can Improve Language Models Krishna Satyapriya, Ma Jiaqi, Slack Dylan, Ghandeharioun Asma, Singh Sameer, Lakkaraju Himabindu
- In-context Explainers: Harnessing Llms For Explaining Black Box Models Kroeger Nicholas, Ley Dan, Krishna Satyapriya, Agarwal Chirag, Lakkaraju Himabindu
- Certifying LLM Safety Against Adversarial Prompting Kumar Aounon, Agarwal Chirag, Srinivas Suraj, Li Aaron Jiaxun, Feizi Soheil, Lakkaraju Himabindu
- Impact Of Guidance And Interaction Strategies For LLM Use On Learner Performance And Perception Kumar Harsh, Musabirov Ilya, Reza Mohi, Shi Jiakai, Wang Xinyuan, Williams Joseph Jay, Kuzminykh Anastasia, Liut Michael
- Large Language Models Humanize Technology Kumar Pratyush
- How Reliable Are Ai-generated-text Detectors? An Assessment Framework Using Evasive Soft Prompts Kumarage Tharindu, Sheth Paras, Moraffah Raha, Garland Joshua, Liu Huan
- Lavip:language-grounded Visual Prompts Kunananthaseelan Nilakshan, Zhang Jing, Harandi Mehrtash
- Active Instruction Tuning: Improving Cross-task Generalization By Training On Prompt Sensitive Tasks Kung Po-nien, Yin Fan, Wu Di, Chang Kai-wei, Peng Nanyun
- DACBERT: Leveraging Dependency Agreement For Cost-efficient Bert Pretraining Kuo Martin, Zhang Jianyi, Chen Yiran
- Jotr: A Joint Transformer And Reinforcement Learning Framework For Dialog Policy Learning Kwan Wai-chung, Wang Huimin, Wang Hongru, Wang Zezhong, Wu Xian, Zheng Yefeng, Wong Kam-fai
- Reward Design With Language Models Kwon Minae, Xie Sang Michael, Bullard Kalesha, Sadigh Dorsa
- Openassistant Conversations -- Democratizing Large Language Model Alignment Köpf Andreas, Kilcher Yannic, Von Rütte Dimitri, Anagnostidis Sotiris, Tam Zhi-rui, Stevens Keith, Barhoum Abdullah, Duc Nguyen Minh, Stanley Oliver, Nagyfi Richárd, Es Shahul, Suri Sameer, Glushkov David, Dantuluri Arnav, Maguire Andrew, Schuhmann Christoph, Nguyen Huu, Mattick Alexander
- Are You Sure? Challenging Llms Leads To Performance Drops In The Flipflop Experiment Laban Philippe, Murakhovs'ka Lidiya, Xiong Caiming, Wu Chien-sheng
- Psy-llm: Scaling Up Global Mental Health Psychological Services With Ai-based Large Language Models Lai Tin, Shi Yukun, Du Zicong, Wu Jiajie, Fu Ken, Dou Yichao, Wang Ziqi
- Mini-dalle3: Interactive Text To Image By Prompting Large Language Models Lai Zeqiang, Zhu Xizhou, Dai Jifeng, Qiao Yu, Wang Wenhai
- Exploring Large Language Models To Facilitate Variable Autonomy For Human-robot Teaming Lakhnati Younes, Pascher Max, Gerken Jens
- Large Language Models Are Partially Primed In Pronoun Interpretation Lam Suet-ying, Zeng Qingcheng, Zhang Kexun, You Chenyu, Voigt Rob
- Creating Large Language Model Resistant Exams: Guidelines And Strategies Larsen Simon Kaare
- VAL: Interactive Task Learning With GPT Dialog Parsing Lawley Lane, Maclellan Christopher J.
- Orchestrallm: Efficient Orchestration Of Language Models For Dialogue State Tracking Lee Chia-hsuan, Cheng Hao, Ostendorf Mari
- Towards Reliable And Fluent Large Language Models: Incorporating Feedback Learning Loops In QA Systems Lee Dongyub, Whang Taesun, Lee Chanhee, Lim Heuiseok
- Applying Large Language Models And Chain-of-thought For Automatic Scoring Lee Gyeong-geon, Latif Ehsan, Wu Xuansheng, Liu Ninghao, Zhai Xiaoming
- Partially Randomizing Transformer Weights For Dialogue Response Diversity Lee Jing Yang, Lee Kong Aik, Gan Woon-seng
- Teaching Arithmetic To Small Transformers Lee Nayoung, Sreenivasan Kartik, Lee Jason D., Lee Kangwook, Papailiopoulos Dimitris
- Large Language Models Can Share Images, Too! Lee Young-jun, Lee Dokyong, Sung Joo Won, Hyeon Jonghwan, Choi Ho-jin
- Violation Of Expectation Via Metacognitive Prompting Reduces Theory Of Mind Prediction Error In Large Language Models Leer Courtland, Trost Vincent, Voruganti Vineeth
- Boosting Logical Reasoning In Large Language Models Through A New Framework: The Graph Of Thought Lei Bin, Lin Pei-hung, Liao Chunhua, Ding Caiwen
- Chain Of Natural Language Inference For Reducing Large Language Model Ungrounded Hallucinations Lei Deren, Li Yaxi, Hu Mengya, Wang Mingyu, Yun Vincent, Ching Emily, Kamal Eslam
- S3eval: A Synthetic, Scalable, Systematic Evaluation Suite For Large Language Models Lei Fangyu, Liu Qian, Huang Yiming, He Shizhu, Zhao Jun, Liu Kang
- Mitigating Fine-grained Hallucination By Fine-tuning Large Vision-language Models With Caption Rewrites Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-peng Lim
- Chatgpt: A Meta-analysis After 2.5 Months Leiter Christoph, Zhang Ran, Chen Yanran, Belouadi Jonas, Larionov Daniil, Fresen Vivian, Eger Steffen
- A Sentence Is Worth A Thousand Pictures: Can Large Language Models Understand Hum4n L4ngu4ge And The W0rld Behind W0rds? Leivada Evelina, Marcus Gary, Günther Fritz, Murphy Elliot
- Guiding LLM To Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut Triggers Levy Mosh, Ravfogel Shauli, Goldberg Yoav
- Sequential Monte Carlo Steering Of Large Language Models Using Probabilistic Programs Lew Alexander K., Zhi-xuan Tan, Grand Gabriel, Mansinghka Vikash K.
- Deliberate Then Generate: Enhanced Prompting Framework For Text Generation Li Bei, Wang Rui, Guo Junliang, Song Kaitao, Tan Xu, Hassan Hany, Menezes Arul, Xiao Tong, Bian Jiang, Zhu Jingbo
- Learning To Rewrite Prompts For Personalized Text Generation Li Cheng, Zhang Mingyang, Mei Qiaozhu, Kong Weize, Bendersky Michael
- Teach Llms To Personalize -- An Approach Inspired By Writing Education Li Cheng, Zhang Mingyang, Mei Qiaozhu, Wang Yaqing, Hombaiah Spurthi Amba, Liang Yi, Bendersky Michael
- Mixed Distillation Helps Smaller Language Model Better Reasoning Li Chenglin, Chen Qianglong, Li Liangyue, Wang Caiyu, Li Yicheng, Chen Zulong, Zhang Yin
- Dialogue For Prompting: A Policy-gradient-based Discrete Prompt Generation For Few-shot Learning Li Chengzhengxu, Liu Xiaoming, Wang Yichen, Li Duyi, Lan Yu, Shen Chao
- Modelscope-agent: Building Your Customizable Agent System With Open-source Large Language Models Li Chenliang, Chen Hehong, Yan Ming, Shen Weizhou, Xu Haiyang, Wu Zhikai, Zhang Zhicheng, Zhou Wenmeng, Chen Yingda, Cheng Chen, Shi Hongzhu, Zhang Ji, Huang Fei, Zhou Jingren
- Llava-med: Training A Large Language-and-vision Assistant For Biomedicine In One Day Li Chunyuan, Wong Cliff, Zhang Sheng, Usuyama Naoto, Liu Haotian, Yang Jianwei, Naumann Tristan, Poon Hoifung, Gao Jianfeng
- Textbind: Multi-turn Interleaved Multimodal Instruction-following In The Wild Li Huayang, Li Siheng, Cai Deng, Wang Longyue, Liu Lemao, Watanabe Taro, Yang Yujiu, Shi Shuming
- Acecoder: Utilizing Existing Code To Enhance Code Generation Li Jia, Zhao Yunfei, Li Yongmin, Li Ge, Jin Zhi
- Mastering Robot Manipulation With Multimodal Prompts Through Pretraining And Multi-task Fine-tuning Li Jiachen, Gao Qiaozi, Johnston Michael, Gao Xiaofeng, He Xuehai, Shakiah Suhaila, Shi Hangjie, Ghanadan Reza, Wang William Yang
- Text Is All You Need: Learning Language Representations For Sequential Recommendation Li Jiacheng, Wang Ming, Li Jin, Fu Jinmiao, Shen Xin, Shang Jingbo, Mcauley Julian
- Adaptive Gating In Mixture-of-experts Based Language Models Li Jiamin, Su Qiang, Yang Yitao, Jiang Yimin, Wang Cong, Xu Hong
- CFGPT: Chinese Financial Assistant With Large Language Model Li Jiangtong, Bian Yuxuan, Wang Guoxuan, Lei Yang, Cheng Dawei, Ding Zhijun, Jiang Changjun
- Distilling Chatgpt For Explainable Automated Student Answer Assessment Li Jiazheng, Gui Lin, Zhou Yuxiang, West David, Aloisi Cesare, He Yulan
- Overprompt: Enhancing Chatgpt Through Efficient In-context Learning Li Jiazheng, Zhao Runcong, Yang Yongxin, He Yulan, Gui Lin
- Motcoder: Elevating Large Language Models With Modular Of Thought For Challenging Programming Tasks Li Jingyao, Chen Pengguang, Xia Bin, Xu Hong, Jia Jiaya
- Generative Judge For Evaluating Alignment Li Junlong, Sun Shichao, Yuan Weizhe, Fan Run-ze, Zhao Hai, Liu Pengfei
- Openba: An Open-sourced 15B Bilingual Asymmetric Seq2seq Model Pre-trained From Scratch Li Juntao, Tang Zecheng, Ding Yuyang, Wang Pinzheng, Guo Pei, You Wangjie, Qiao Dan, Chen Wenliang, Fu Guohong, Zhu Qiaoming, Zhou Guodong, Zhang Min
- Covlm: Composing Visual Entities And Relationships In Large Language Models Via Communicative Decoding Li Junyan, Chen Delin, Hong Yining, Chen Zhenfang, Chen Peihao, Shen Yikang, Gan Chuang
- Mvbench: A Comprehensive Multi-modal Video Understanding Benchmark Li Kunchang, Wang Yali, He Yinan, Li Yizhuo, Wang Yi, Liu Yi, Wang Zun, Xu Jilan, Chen Guo, Luo Ping, Wang Limin, Qiao Yu
- Tool-augmented Reward Modeling Li Lei, Chai Yekun, Wang Shuohuan, Sun Yu, Tian Hao, Zhang Ningyu, Wu Hua
- Api-bank: A Comprehensive Benchmark For Tool-augmented Llms Li Minghao, Zhao Yingxiu, Yu Bowen, Song Feifan, Li Hangyu, Yu Haiyang, Li Zhoujun, Huang Fei, Li Yongbin
- Skillgpt: A Restful API Service For Skill Extraction And Standardization Using A Large Language Model Li Nan, Kang Bo, De Bie Tijl
- Adapting Large Language Models For Education: Foundational Capabilities, Potentials, And Challenges Li Qingyao, Fu Lingyue, Zhang Weiming, Chen Xianyu, Yu Jingwei, Xia Wei, Zhang Weinan, Tang Ruiming, Yu Yong
- Starcoder: May The Source Be With You! Li Raymond, Allal Loubna Ben, Zi Yangtian, Muennighoff Niklas, Kocetkov Denis, Mou Chenghao, Marone Marc, Akiki Christopher, Li Jia, Chim Jenny, Liu Qian, Zheltonozhskii Evgenii, Zhuo Terry Yue, Wang Thomas, Dehaene Olivier, Davaadorj Mishig, Lamy-poirier Joel, Monteiro João, Shliazhko Oleh, Gontier Nicolas, Meade Nicholas, Zebaze Armel, Yee Ming-ho, Umapathi Logesh Kumar, Zhu Jian, Lipkin Benjamin, Oblokulov Muhtasham, Wang Zhiruo, Murthy Rudra, Stillerman Jason, Patel Siva Sankalp, Abulkhanov Dmitry, Zocca Marco, Dey Manan, Zhang Zhihan, Fahmy Nour, Bhattacharyya Urvashi, Yu Wenhao, Singh Swayam, Luccioni Sasha, Villegas Paulo, Kunakov Maxim, Zhdanov Fedor, Romero Manuel, Lee Tony, Timor Nadav, Ding Jennifer, Schlesinger Claire, Schoelkopf Hailey, Ebert Jan, Dao Tri, Mishra Mayank, Gu Alex, Robinson Jennifer, Anderson Carolyn Jane, Dolan-gavitt Brendan, Contractor Danish, Reddy Siva, Fried Daniel, Bahdanau Dzmitry, Jernite Yacine, Ferrandis Carlos Muñoz, Hughes Sean, Wolf Thomas, Guha Arjun, Von Werra Leandro, De Vries Harm
- Are Human-generated Demonstrations Necessary For In-context Learning? Li Rui, Wang Guoyin, Li Jiwei
- Vlm-eval: A General Evaluation On Video Large Language Models Li Shuailin, Zhang Yuang, Zhao Yucheng, Wang Qiuyue, Jia Fan, Liu Yingfei, Wang Tiancai
- Tailoring Personality Traits In Large Language Models Via Unsupervisedly-built Personalized Lexicons Li Tianlong, Dou Shihan, Lv Changze, Liu Wenhao, Xu Jianhan, Wu Muling, Ling Zixuan, Zheng Xiaoqing, Huang Xuanjing
- Benchmarking And Improving Generator-validator Consistency Of Language Models Li Xiang Lisa, Shrivastava Vaishnavi, Li Siyan, Hashimoto Tatsunori, Liang Percy
- Context Matters: Data-efficient Augmentation Of Large Language Models For Scientific Applications Li Xiang, Tang Haoran, Chen Siyu, Wang Ziwei, Maravi Anurag, Abram Marcin
- Mot: Memory-of-thought Enables Chatgpt To Self-improve Li Xiaonan, Qiu Xipeng
- Unigen: A Unified Generative Framework For Retrieval And Question Answering With Large Language Models Li Xiaoxi, Zhou Yujia, Dou Zhicheng
- Think Outside The Code: Brainstorming Boosts Large Language Models In Code Generation Li Xin-ye, Xue Jiang-tian, Xie Zheng, Li Ming
- Unlocking Temporal Question Answering For Large Language Models Using Code Execution Li Xingxuan, Cheng Liying, Tan Qingyu, Ng Hwee Tou, Joty Shafiq, Bing Lidong
- Chain-of-knowledge: Grounding Large Language Models Via Dynamic Knowledge Adapting Over Heterogeneous Sources Li Xingxuan, Zhao Ruochen, Chia Yew Ken, Ding Bosheng, Joty Shafiq, Poria Soujanya, Bing Lidong
- Unlock The Power: Competitive Distillation For Multi-modal Large Language Models Li Xinwei, Lin Li, Wang Shuai, Qian Chen
- Exploring Fine-tuning Chatgpt For News Recommendation Li Xinyi, Zhang Yongfeng, Malthouse Edward C
- PBNR: Prompt-based News Recommender System Li Xinyi, Zhang Yongfeng, Malthouse Edward C.
- Translation-enhanced Multilingual Text-to-image Generation Li Yaoyiran, Chang Ching-yun, Rawls Stephen, Vulić Ivan, Korhonen Anna
- Turning Dust Into Gold: Distilling Complex Reasoning Capabilities From Llms By Leveraging Negative Data Li Yiwei, Yuan Peiwen, Feng Shaoxiong, Pan Boyuan, Sun Bin, Wang Xinglin, Wang Heda, Li Kan
- Loftq: Lora-fine-tuning-aware Quantization For Large Language Models Li Yixiao, Yu Yifan, Liang Chen, He Pengcheng, Karampatziakis Nikos, Chen Weizhu, Zhao Tuo
- Prompting Large Language Models For Counterfactual Generation: An Empirical Study Li Yongqi, Xu Mayi, Miao Xin, Zhou Shen, Qian Tieyun
- Curriculum-driven Edubot: A Framework For Developing Language Learning Chatbots Through Synthesizing Conversational Data Li Yu, Qu Shang, Shen Jili, Min Shangchao, Yu Zhou
- Posgen: Personalized Opening Sentence Generation For Online Insurance Sales Li Yu, Zhang Yi, Wu Weijia, Zhou Zimu, Li Qiang
- SPELL: Semantic Prompt Evolution Based On A LLM Li Yujian Betterest, Wu Kai
- Advancing Precise Outline-conditioned Text Generation With Task Duality And Explicit Outline Control Li Yunzhe, Chen Qian, Yan Weixiang, Wang Wen, Zhang Qinglin, Sundaram Hari
- Guiding Large Language Models Via Directional Stimulus Prompting Li Zekun, Peng Baolin, He Pengcheng, Galley Michel, Gao Jianfeng, Yan Xifeng
- Flexkbqa: A Flexible Llm-powered Framework For Few-shot Knowledge Base Question Answering Li Zhenyu, Fan Sunqi, Gu Yu, Li Xiuxing, Duan Zhichao, Dong Bowen, Liu Ning, Wang Jianyong
- QFT: Quantized Full-parameter Tuning Of Llms With Affordable Resources Li Zhikai, Liu Xiaoxuan, Zhu Banghua, Dong Zhen, Gu Qingyi, Keutzer Kurt
- Beyond Factuality: A Comprehensive Evaluation Of Large Language Models As Knowledge Generators Liang Chen, Yang Deng, Yatao Bian, Zeyu Qin, Bingzhe Wu, Tat-seng Chua, Kam-fai Wong
- Towards End-to-end Embodied Decision Making Via Multi-modal Large Language Model: Explorations With Gpt4-vision And Beyond Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Tianyu Liu, Baobao Chang
- Exploring Format Consistency For Instruction Tuning Liang Shihao, Tian Runchu, Zhu Kunlun, Qin Yujia, Wang Huadong, Cong Xin, Liu Zhiyuan, Liu Xiaojiang, Sun Maosong
- Encouraging Divergent Thinking In Large Language Models Through Multi-agent Debate Liang Tian, He Zhiwei, Jiao Wenxiang, Wang Xing, Wang Rui, Yang Yujiu, Tu Zhaopeng, Shi Shuming
- GPT Detectors Are Biased Against Non-native English Writers Liang Weixin, Yuksekgonul Mert, Mao Yining, Wu Eric, Zou James
- Uhgeval: Benchmarking The Hallucination Of Chinese Large Language Models Via Unconstrained Generation Liang Xun, Song Shichao, Niu Simin, Li Zhiyu, Xiong Feiyu, Tang Bo, Wang Yezhaohui, He Dawei, Cheng Peng, Wang Zhonghao, Deng Haiying
- GPT-4 Enhanced Multimodal Grounding For Autonomous Driving: Leveraging Cross-modal Attention With Large Language Models Liao Haicheng, Shen Huanming, Li Zhenning, Wang Chengyue, Li Guofa, Bie Yiming, Xu Chengzhong
- Modeling Complex Mathematical Reasoning Via Large Language Model Based Mathagent Liao Haoran, Du Qinyi, Hu Shaohua, He Hao, Xu Yanyan, Tian Jidong, Jin Yaohui
- Text-to-image Generation For Abstract Concepts Liao Jiayi, Chen Xu, Fu Qiang, Du Lun, He Xiangnan, Wang Xiang, Han Shi, Zhang Dongmei
- STEVE-1: A Generative Model For Text-to-behavior In Minecraft Lifshitz Shalev, Paster Keiran, Chan Harris, Ba Jimmy, Mcilraith Sheila
- A Survey On Large Language Models For Recommendation Likang Wu, Zhi Zheng, Zhaopeng Qiu, Hao Wang, Hongchao Gu, Tingjia Shen, Chuan Qin, Chen Zhu, Hengshu Zhu, Qi Liu, Hui Xiong, Enhong Chen
- Paralinguistics-enhanced Large Language Modeling Of Spoken Dialogue Lin Guan-ting, Shivakumar Prashanth Gurunath, Gandhe Ankur, Yang Chao-han Huck, Gu Yile, Ghosh Shalini, Stolcke Andreas, Lee Hung-yi, Bulyko Ivan
- Videodirectorgpt: Consistent Multi-scene Video Generation Via Llm-guided Planning Lin Han, Zala Abhay, Cho Jaemin, Bansal Mohit
- Beneath The Surface: Unveiling Harmful Memes With Multimodal Reasoning Distilled From Large Language Models Lin Hongzhan, Luo Ziyang, Ma Jing, Chen Long
- AWQ: Activation-aware Weight Quantization For LLM Compression And Acceleration Lin Ji, Tang Jiaming, Tang Haotian, Yang Shang, Chen Wei-ming, Wang Wei-chen, Xiao Guangxuan, Dang Xingyu, Gan Chuang, Han Song
- VILA: On Pre-training For Visual Language Models Lin Ji, Yin Hongxu, Ping Wei, Lu Yao, Molchanov Pavlo, Tao Andrew, Mao Huizi, Kautz Jan, Shoeybi Mohammad, Han Song
- Rella: Retrieval-enhanced Large Language Models For Lifelong Sequential Behavior Comprehension In Recommendation Lin Jianghao, Shan Rong, Zhu Chenxu, Du Kounianhua, Chen Bo, Quan Shigang, Tang Ruiming, Yu Yong, Zhang Weinan
- Argue With Me Tersely: Towards Sentence-level Counter-argument Generation Lin Jiayu, Ye Rong, Han Meng, Zhang Qi, Lai Ruofei, Zhang Xinyu, Cao Zhao, Huang Xuanjing, Wei Zhongyu
- Advances In Embodied Navigation Using Large Language Models: A Survey Lin Jinzhou, Gao Han, Feng Xuxiang, Xu Rongtao, Wang Changwei, Zhang Man, Guo Li, Xu Shibiao
- Pushing Large Language Models To The 6G Edge: Vision, Challenges, And Opportunities Lin Zheng, Qu Guanqiao, Chen Qiyuan, Chen Xianhao, Chen Zhe, Huang Kaibin
- Recprompt: A Prompt Tuning Framework For News Recommendation Using Large Language Models Liu Dairui, Yang Boming, Du Honghui, Greene Derek, Lawlor Aonghus, Dong Ruihai, Li Irene
- MMC: Advancing Multimodal Chart Understanding With Large-scale Instruction Tuning Liu Fuxiao, Wang Xiaoyang, Yao Wenlin, Chen Jianshu, Song Kaiqiang, Cho Sangwoo, Yacoob Yaser, Yu Dong
- Alltogether: Investigating The Efficacy Of Spliced Prompt For Web Navigation Using Large Language Models Liu Jiarun, Hu Wentao, Zhang Chunhong
- Llm-powered Hierarchical Language Agent For Real-time Human-ai Coordination Liu Jijia, Yu Chao, Gao Jiaxuan, Xie Yuqing, Liao Qingmin, Wu Yi, Wang Yu
- Think-in-memory: Recalling And Post-thinking Enable Llms With Long-term Memory Liu Lei, Yang Xiaoyan, Shen Yue, Hu Binbin, Zhang Zhiqiang, Gu Jinjie, Zhang Guannan
- Selenite: Scaffolding Online Sensemaking With Comprehensive Overviews Elicited From Large Language Models Liu Michael Xieyang, Wu Tongshuang, Chen Tianying, Li Franklin Mingzhe, Kittur Aniket, Myers Brad A.
- X-eval: Generalizable Multi-aspect Text Evaluation Via Augmented Instruction Tuning With Auxiliary Evaluation Aspects Liu Minqian, Shen Ying, Xu Zhiyang, Cao Yixin, Cho Eunah, Kumar Vaibhav, Ghanadan Reza, Huang Lifu
- Chatgpt-powered Conversational Drug Editing Using Retrieval And Domain Feedback Liu Shengchao, Wang Jiongxiao, Yang Yijin, Wang Chengpeng, Liu Ling, Guo Hongyu, Xiao Chaowei
- Language Models As Black-box Optimizers For Vision-language Models Liu Shihong, Lin Zhiqiu, Yu Samuel, Lee Ryan, Ling Tiffany, Pathak Deepak, Ramanan Deva
- MMHQA-ICL: Multimodal In-context Learning For Hybrid Question Answering Over Text, Tables And Images Liu Weihao, Lei Fangyu, Luo Tongxu, Lei Jiahe, He Shizhu, Zhao Jun, Liu Kang
- Beyond Text: Unveiling Multimodal Proficiency Of Large Language Models With Multiapi Benchmark Liu Xiao, Lin Jianfeng, Zhang Jiawei
- Fingpt: Democratizing Internet-scale Data For Financial Large Language Models Liu Xiao-yang, Wang Guoxuan, Yang Hongyang, Zha Daochen
- Scaling Laws Of Rope-based Extrapolation Liu Xiaoran, Yan Hang, Zhang Shuo, An Chenxin, Qiu Xipeng, Lin Dahua
- Mm-safetybench: A Benchmark For Safety Evaluation Of Multimodal Large Language Models Liu Xin, Zhu Yichen, Gu Jindong, Lan Yunshi, Yang Chao, Qiao Yu
- Query-utterance Attention With Joint Modeling For Query-focused Meeting Summarization Liu Xingxian, Duan Bin, Xiao Bo, Xu Yajing
- Wavjourney: Compositional Audio Creation With Large Language Models Liu Xubo, Zhu Zhongkai, Liu Haohe, Yuan Yi, Cui Meng, Huang Qiushi, Liang Jinhua, Cao Yin, Kong Qiuqiang, Plumbley Mark D., Wang Wenwu
- What Large Language Models Bring To Text-rich VQA? Liu Xuejing, Tang Wei, Ni Xinzhe, Lu Jinghui, Zhao Rui, Li Zechao, Tan Fei
- G-eval: NLG Evaluation Using GPT-4 With Better Human Alignment Liu Yang, Iter Dan, Xu Yichong, Wang Shuohang, Xu Ruochen, Zhu Chenguang
- Reason For Future, Act For Now: A Principled Framework For Autonomous LLM Agents With Provable Sample Efficiency Liu Zhihan, Hu Hao, Zhang Shenao, Guo Hongyi, Ke Shuqi, Liu Boyi, Wang Zhaoran
- Fully Autonomous Programming With Large Language Models Liventsev Vadim, Grishina Anastasiia, Härmä Aki, Moonen Leon
- SAIE Framework: Support Alone Isn't Enough -- Advancing LLM Training With Adversarial Remarks Loem Mengsay, Kaneko Masahiro, Okazaki Naoaki
- Discuss Before Moving: Visual Language Navigation Via Multi-expert Discussions Long Yuxing, Li Xiaoqi, Cai Wenzhe, Dong Hao
- SELF: Self-evolution With Language Feedback Lu Jianqiao, Zhong Wanjun, Huang Wenyong, Wang Yufei, Zhu Qi, Mi Fei, Wang Baojun, Wang Weichao, Zeng Xingshan, Shang Lifeng, Jiang Xin, Liu Qun
- Memochat: Tuning Llms To Use Memos For Consistent Long-range Open-domain Conversation Lu Junru, An Siyu, Lin Mingbao, Pergola Gabriele, He Yulan, Yin Di, Sun Xing, Wu Yunsheng
- Llama-reviewer: Advancing Code Review Automation With Large Language Models Through Parameter-efficient Fine-tuning Lu Junyi, Yu Lei, Li Xiaojia, Yang Li, Zuo Chun
- Error Analysis Prompting Enables Human-like Translation Evaluation In Large Language Models Lu Qingyu, Qiu Baopu, Ding Liang, Zhang Kanjian, Kocmi Tom, Tao Dacheng
- Safer Conversational AI As A Source Of User Delight Lu Xiaoding, Korshuk Aleksey, Liu Zongyi, Beauchamp William, Research Chai
- Llmscore: Unveiling The Power Of Large Language Models In Text-to-image Synthesis Evaluation Lu Yujie, Yang Xianjun, Li Xiujun, Wang Xin Eric, Wang William Yang
- Effectively Fine-tune To Improve Large Multimodal Models For Radiology Report Generation Lu Yuzhe, Hong Sungmin, Shah Yash, Xu Panpan
- Enhancing Black-box Few-shot Text Classification With Prompt-based Data Augmentation Luo Danqing, Zhang Chen, Xu Jiahui, Wang Bin, Chen Yiming, Zhang Yan, Li Haizhou
- SAIL: Search-augmented Instruction Learning Luo Hongyin, Chuang Yung-sung, Gong Yuan, Zhang Tianhua, Kim Yoon, Wu Xixin, Fox Danny, Meng Helen, Glass James
- Critique Ability Of Large Language Models Luo Liangchen, Lin Zi, Liu Yinxiao, Shu Lei, Zhu Yun, Shang Jingbo, Meng Lei
- Reasoning On Graphs: Faithful And Interpretable Large Language Model Reasoning Luo Linhao, Li Yuan-fang, Haffari Gholamreza, Pan Shirui
- Systematic Assessment Of Factual Knowledge In Large Language Models Luo Linhao, Vu Thuy-trang, Phung Dinh, Haffari Gholamreza
- Unlocking The Potential Of Large Language Models For Explainable Recommendations Luo Yucong, Cheng Mingyue, Zhang Hao, Lu Junyu, Liu Qi, Chen Enhong
- Augmented Large Language Models With Parametric Knowledge Guiding Luo Ziyang, Xu Can, Zhao Pu, Geng Xiubo, Tao Chongyang, Ma Jing, Lin Qingwei, Jiang Daxin
- Fingpt: Large Generative Models For A Small Language Luukkonen Risto, Komulainen Ville, Luoma Jouni, Eskelinen Anni, Kanerva Jenna, Kupari Hanna-mari, Ginter Filip, Laippala Veronika, Muennighoff Niklas, Piktus Aleksandra, Wang Thomas, Tazi Nouamane, Scao Teven Le, Wolf Thomas, Suominen Osma, Sairanen Samuli, Merioksa Mikko, Heinonen Jyrki, Vahtola Aija, Antao Samuel, Pyysalo Sampo
- Collie: Collaborative Training Of Large Language Models In An Efficient Way Lv Kai, Zhang Shuo, Gu Tianle, Xing Shuhao, Hong Jiawei, Chen Keyu, Liu Xiaoran, Yang Yuqing, Guo Honglin, Liu Tengxiao, Sun Yu, Guo Qipeng, Yan Hang, Qiu Xipeng
- Vista-llama: Reliable Video Narrator Via Equal Distance To Visual Tokens Ma Fan, Jin Xiaojie, Wang Heng, Xian Yuchen, Feng Jiashi, Yang Yi
- Untying The Reversal Curse Via Bidirectional Language Model Editing Ma Jun-yu, Gu Jia-chen, Ling Zhen-hua, Liu Quan, Liu Cong
- "oops, Did I Just Say That?" Testing And Repairing Unethical Suggestions Of Large Language Models With Suggest-critique-reflect Process Ma Pingchuan, Li Zongjie, Sun Ao, Wang Shuai
- (why) Is My Prompt Getting Worse? Rethinking Regression Testing For Evolving LLM Apis Ma Wanqin, Yang Chenyang, Kästner Christian
- Let's Do A Thought Experiment: Using Counterfactuals To Improve Moral Reasoning Ma Xiao, Mishra Swaroop, Beirami Ahmad, Beutel Alex, Chen Jilin
- Query Rewriting For Retrieval-augmented Large Language Models Ma Xinbei, Gong Yeyun, He Pengcheng, Zhao Hai, Duan Nan
- VOLTA: Improving Generative Diversity By Variational Mutual Information Maximizing Autoencoder Ma Yueen, Chi Dafeng, Li Jingjing, Song Kai, Zhuang Yuzheng, King Irwin
- Sci-cot: Leveraging Large Language Models For Enhanced Knowledge Distillation In Small Models For Scientific QA Ma Yuhan, Jiang Haiqi, Fan Chenyou
- Lampilot: An Open Benchmark Dataset For Autonomous Driving With Language Model Programs Ma Yunsheng, Cui Can, Cao Xu, Ye Wenqian, Liu Peiran, Lu Juanwu, Abdelraouf Amr, Gupta Rohit, Han Kyungtae, Bera Aniket, Rehg James M., Wang Ziran
- World-to-words: Grounded Open Vocabulary Acquisition Through Fast Mapping In Vision-language Models Ma Ziqiao, Pan Jiayi, Chai Joyce
- Video-chatgpt: Towards Detailed Video Understanding Via Large Vision And Language Models Maaz Muhammad, Rasheed Hanoona, Khan Salman, Khan Fahad Shahbaz
- Enhancing CLIP With GPT-4: Harnessing Visual Descriptions As Prompts Maniparambil Mayug, Vorster Chris, Molloy Derek, Murphy Noel, Mcguinness Kevin, O'connor Noel E.
- Large Language Models Know Your Contextual Search Intent: A Prompting Framework For Conversational Search Mao Kelong, Dou Zhicheng, Mo Fengran, Hou Jiewen, Chen Haonan, Qian Hongjin
- Unitrec: A Unified Text-to-text Transformer And Joint Contrastive Learning Framework For Text-based Recommendation Mao Zhiming, Wang Huimin, Du Yiming, Wong Kam-fai
- ICL Markup: Structuring In-context Learning Using Soft-token Tags Marc-etienne Brunet, Ashton Anderson, Richard Zemel
- When Less Is More: Investigating Data Pruning For Pretraining Llms At Scale Marion Max, Üstün Ahmet, Pozzobon Luiza, Wang Alex, Fadaee Marzieh, Hooker Sara
- A Large Language Model-assisted Education Tool To Provide Feedback On Open-ended Responses Matelsky Jordan K., Parodi Felipe, Liu Tony, Lange Richard D., Kording Konrad P.
- Cook2ltl: Translating Cooking Recipes To LTL Formulae Using Large Language Models Mavrogiannis Angelos, Mavrogiannis Christoforos, Aloimonos Yiannis
- JAB: Joint Adversarial Prompting And Belief Augmentation Mehrabi Ninareh, Goyal Palash, Ramakrishna Anil, Dhamala Jwala, Ghosh Shalini, Zemel Richard, Chang Kai-wei, Galstyan Aram, Gupta Rahul
- Simple LLM Prompting Is State-of-the-art For Robust And Multilingual Dialogue Evaluation Mendonça John, Pereira Patrícia, Moniz Helena, Carvalho João Paulo, Lavie Alon, Trancoso Isabel
- An AI Chatbot For Explaining Deep Reinforcement Learning Decisions Of Service-oriented Systems Metzger Andreas, Bartel Jone, Laufer Jan
- Developing A Scalable Benchmark For Assessing Large Language Models In Knowledge Graph Engineering Meyer Lars-peter, Frey Johannes, Junghanns Kurt, Brei Felix, Bulert Kirill, Gründer-fahrer Sabine, Martin Michael
- Think Before You Act: Unified Policy For Interleaving Language Reasoning With Actions Mezghani Lina, Bojanowski Piotr, Alahari Karteek, Sukhbaatar Sainbayar
- Towards Efficient Generative Large Language Model Serving: A Survey From Algorithms To Systems Miao Xupeng, Oliaro Gabriele, Zhang Zhihao, Cheng Xinhao, Jin Hongyi, Chen Tianqi, Jia Zhihao
- Self-checker: Plug-and-play Modules For Fact-checking With Large Language Models Miaoran Li, Baolin Peng, Michel Galley, Jianfeng Gao, Zhu Zhang
- Dialogue-based Generation Of Self-driving Simulation Scenarios Using Large Language Models Miceli-barone Antonio Valerio, Lascarides Alex, Innes Craig
- LAIT: Efficient Multi-segment Encoding In Transformers With Layer-adjustable Interaction Milbauer Jeremiah, Louis Annie, Hosseini Mohammad Javad, Fabrikant Alex, Metzler Donald, Schuster Tal
- MEDIMP: 3D Medical Images With Clinical Prompts From Limited Tabular Data For Renal Transplantation Milecki Leo, Kalogeiton Vicky, Bodard Sylvain, Anglicheau Dany, Correas Jean-michel, Timsit Marc-olivier, Vakalopoulou Maria
- Beyond Accuracy: Evaluating Self-consistency Of Code Large Language Models With Identitychain Min Marcus J., Ding Yangruibo, Buratti Luca, Pujar Saurabh, Kaiser Gail, Jana Suman, Ray Baishakhi
- An Emulator For Fine-tuning Large Language Models Using Small Language Models Mitchell Eric, Rafailov Rafael, Sharma Archit, Finn Chelsea, Manning Christopher D.
- Convgqr: Generative Query Reformulation For Conversational Search Mo Fengran, Mao Kelong, Zhu Yutao, Wu Yihong, Huang Kaiyu, Nie Jian-yun
- Roll Up Your Sleeves: Working With A Collaborative And Engaging Task-oriented Dialogue System Mo Lingbo, Chen Shijie, Chen Ziru, Deng Xiang, Lewis Ashley, Singh Sunit, Stevens Samuel, Tai Chang-you, Wang Zhen, Yue Xiang, Zhang Tianshu, Su Yu, Sun Huan
- How Trustworthy Are Open-source Llms? An Assessment Under Malicious Demonstrations Shows Their Vulnerabilities Mo Lingbo, Wang Boshi, Chen Muhao, Sun Huan
- Tree Of Uncertain Thoughts Reasoning For Large Language Models Mo Shentong, Xin Miao
- Interpreting User Requests In The Context Of Natural Language Standing Instructions Moghe Nikita, Xia Patrick, Andreas Jacob, Eisner Jason, Van Durme Benjamin, Jhamtani Harsh
- Assigning AI: Seven Approaches For Students, With Prompts Mollick Ethan, Mollick Lilach
- Llms Killed The Script Kiddie: How Agents Supported By Large Language Models Change The Landscape Of Network Threat Testing Moskal Stephen, Laney Sam, Hemberg Erik, O'reilly Una-may
- The Impact Of Artificial Intelligence On The Evolution Of Digital Education: A Comparative Study Of Openai Text Generation Tools Including Chatgpt, Bing Chat, Bard, And Ernie Motlagh Negin Yazdani, Khajavi Matin, Sharifi Abbas, Ahmadi Mohsen
- Can Llms Facilitate Interpretation Of Pre-trained Language Models? Mousi Basel, Durrani Nadir, Dalvi Fahim
- Can Llms Follow Simple Rules? Mu Norman, Chen Sarah, Wang Zifan, Chen Sizhe, Karamardian David, Aljeraisy Lulwa, Alomair Basel, Hendrycks Dan, Wagner David
- A Brief History Of Prompt: Leveraging Language Models. (through Advanced Prompting) Muktadir Golam Md
- Salespeople Vs Salesbot: Exploring The Role Of Educational Value In Conversational Recommender Systems Murakhovs'ka Lidiya, Laban Philippe, Xie Tian, Xiong Caiming, Wu Chien-sheng
- Ai-assisted Code Authoring At Scale: Fine-tuning, Deploying, And Mixed Methods Evaluation Murali Vijayaraghavan, Maddila Chandra, Ahmad Imad, Bolin Michael, Cheng Daniel, Ghorbani Negar, Fernandez Renuka, Nagappan Nachiappan, Rigby Peter C.
- Unival: Unified Model For Image, Video, Audio And Language Tasks Mustafa Shukor, Corentin Dancette, Alexandre Rame, Matthieu Cord
- Consistency Analysis Of Chatgpt Myeongjun Erik Jang, Thomas Lukasiewicz
- Large Language Model Augmented Narrative Driven Recommendations Mysore Sheshera, Mccallum Andrew, Zamani Hamed
- Self-contradictory Hallucinations Of Large Language Models: Evaluation, Detection And Mitigation Mündler Niels, He Jingxuan, Jenko Slobodan, Vechev Martin
- Testing Language Model Agents Safely In The Wild Naihin Silen, Atkinson David, Green Marc, Hamadi Merwane, Swift Craig, Schonholtz Douglas, Kalai Adam Tauman, Bau David
- DERA: Enhancing Large Language Model Completions With Dialog-enabled Resolving Agents Nair Varun, Schumacher Elliot, Tso Geoffrey, Kannan Anitha
- Using An LLM To Help With Code Understanding Nam Daye, Macvean Andrew, Hellendoorn Vincent, Vasilescu Bogdan, Myers Brad
- On Evaluating The Integration Of Reasoning And Action In LLM Agents With Database Question Answering Nan Linyong, Zhang Ellen, Zou Weijin, Zhao Yilun, Zhou Wenfei, Cohan Arman
- Cheaply Evaluating Inference Efficiency Metrics For Autoregressive Transformer Apis Narayanan Deepak, Santhanam Keshav, Henderson Peter, Bommasani Rishi, Lee Tony, Liang Percy
- A Comprehensive Overview Of Large Language Models Naveed Humza, Khan Asad Ullah, Qiu Shi, Saqib Muhammad, Anwar Saeed, Usman Muhammad, Akhtar Naveed, Barnes Nick, Mian Ajmal
- Nanot5: A Pytorch Framework For Pre-training And Fine-tuning T5-style Models With Limited Resources Nawrot Piotr
- Simplyretrieve: A Private And Lightweight Retrieval-centric Generative AI Tool Ng Youyang, Miyashita Daisuke, Hoshi Yasuto, Morioka Yasuhiro, Torii Osamu, Kodama Tomoya, Deguchi Jun
- Multi-dimensional Data Refining Strategy For Effective Fine-tuning Llms Ngoc Thanh Nguyen, Tran Quang Nhat, Tang Arthur, Nguyen Bao, Nguyen Thuy, Pham Thanh
- Enhancing Logical Reasoning In Large Language Models To Facilitate Legal Applications Nguyen Ha-thanh, Fungwacharakorn Wachara, Satoh Ken
- Large Language Models For In-context Student Modeling: Synthesizing Student's Behavior In Visual Programming Nguyen Manh Hung, Tschiatschek Sebastian, Singla Adish
- In-context Example Selection With Influences Nguyen Tai, Wong Eric
- L2ceval: Evaluating Language-to-code Generation Capabilities Of Large Language Models Ni Ansong, Yin Pengcheng, Zhao Yilun, Riddell Martin, Feng Troy, Shen Rui, Yin Stephen, Liu Ye, Yavuz Semih, Xiong Caiming, Joty Shafiq, Zhou Yingbo, Radev Dragomir, Cohan Arman
- ORES: Open-vocabulary Responsible Visual Synthesis Ni Minheng, Wu Chenfei, Wang Xiaodong, Yin Shengming, Wang Lijuan, Liu Zicheng, Duan Nan
- Code-style In-context Learning For Knowledge-based Question Answering Nie Zhijie, Zhang Richong, Wang Zhongyuan, Liu Xudong
- An Experimental Study: Assessing The Combined Framework Of Wavlm And BEST-RQ For Text-to-speech Synthesis Nielson Via, Hillis Steven
- Codegen2: Lessons For Training Llms On Programming And Natural Languages Nijkamp Erik, Hayashi Hiroaki, Xiong Caiming, Savarese Silvio, Zhou Yingbo
- Ragtruth: A Hallucination Corpus For Developing Trustworthy Retrieval-augmented Language Models Niu Cheng, Wu Yuanhao, Zhu Juno, Xu Siliang, Shum Kashun, Zhong Randy, Song Juntong, Zhang Tong
- Reflexion: Language Agents With Verbal Reinforcement Learning Noah Shinn, Federico Cassano, Edward Berman, Ashwin Gopinath, Karthik Narasimhan, Shunyu Yao
- AI Text-to-behavior: A Study In Steerability Noever David, Hyams Sam
- Multimodal Analysis Of Google Bard And Gpt-vision: Experiments In Visual Reasoning Noever David, Noever Samantha Elizabeth Miller
- Steering Language Generation: Harnessing Contrastive Expert Guidance And Negative Prompting For Coherent And Diverse Synthetic Data Generation O'neill Charles, Ting Yuan-sen, Ciuca Ioana, Miller Jack, Bui Thang
- Prodigy: A Profile-based Dialogue Generation Dataset Occhipinti Daniela, Tekiroglu Serra Sinem, Guerini Marco
- Comparative Analysis Of CHATGPT And The Evolution Of Language Models Ogundare Oluwatosin, Araya Gustavo Quiros
- Blackvip: Black-box Visual Prompting For Robust Transfer Learning Oh Changdae, Hwang Hyeji, Lee Hee-young, Lim Yongtaek, Jung Geunyoung, Jung Jiyoung, Choi Hosik, Song Kyungwoo
- Data Augmentation For Neural Machine Translation Using Generative Language Model Oh Seokjin, Lee Su Ah, Jung Woohwan
- Large Language Model (LLM) Bias Index -- LLMBI Oketunji Abiodun Finbarrs, Anas Muhammad, Saina Deepthi
- Detecting Llm-generated Text In Computing Education: A Comparative Study For Chatgpt Cases Orenstrakh Michael Sheinman, Karnalim Oscar, Suarez Carlos Anibal, Liut Michael
- Thoughtsource: A Central Hub For Large Language Model Reasoning Data Ott Simon, Hebenstreit Konstantin, Liévin Valentin, Hother Christoffer Egeberg, Moradi Milad, Mayrhauser Maximilian, Praas Robert, Winther Ole, Samwald Matthias
- Several Categories Of Large Language Models (llms): A Short Survey Pahune Saurabh, Chandrasekharan Manoj
- Evaluating And Explaining Large Language Models For Code Using Syntactic Structures Palacio David N, Velasco Alejandro, Rodriguez-cardenas Daniel, Moran Kevin, Poshyvanyk Denys
- Towards Vision-language Mechanistic Interpretability: A Causal Tracing Tool For BLIP Palit Vedant, Pandey Rohan, Arora Aryaman, Liang Paul Pu
- Kwaiagents: Generalized Information-seeking Agent System With Large Language Models Pan Haojie, Zhai Zepeng, Yuan Hao, Lv Yaojia, Fu Ruiji, Liu Ming, Wang Zhongyuan, Qin Bing
- Smoothquant+: Accurate And Efficient 4-bit Post-training Weightquantization For LLM Pan Jiayi, Wang Chengcan, Zheng Kaifu, Li Yangguang, Wang Zhenyu, Feng Bin
- Retrieving-to-answer: Zero-shot Video Question Answering With Frozen Large Language Models Pan Junting, Lin Ziyi, Ge Yuying, Zhu Xiatian, Zhang Renrui, Wang Yi, Qiao Yu, Li Hongsheng
- Self-supervised Meta-prompt Learning With Meta-gradient Regularization For Few-shot Generalization Pan Kaihang, Li Juncheng, Song Hongye, Lin Jun, Liu Xiaozhong, Tang Siliang
- Logic-lm: Empowering Large Language Models With Symbolic Solvers For Faithful Logical Reasoning Pan Liangming, Albalak Alon, Wang Xinyi, Wang William Yang
- Chameleon: Plug-and-play Compositional Reasoning With Large Language Models Pan Lu, Baolin Peng, Hao Cheng, Michel Galley, Kai-wei Chang, Ying Nian Wu, Song-chun Zhu, Jianfeng Gao
- X-instructblip: A Framework For Aligning X-modal Instruction-aware Representations To Llms And Emergent Cross-modal Reasoning Panagopoulou Artemis, Xue Le, Yu Ning, Li Junnan, Li Dongxu, Joty Shafiq, Xu Ran, Savarese Silvio, Xiong Caiming, Niebles Juan Carlos
- Automating Customer Service Using Langchain: Building Custom Open-source GPT Chatbot For Organizations Pandya Keivalya, Holia Mehfuza
- Lifelong Robot Learning With Human Assisted Language Planners Parakh Meenal, Fong Alisha, Simeonov Anthony, Chen Tao, Gupta Abhishek, Agrawal Pulkit
- ART: Automatic Multi-step Reasoning And Tool-use For Large Language Models Paranjape Bhargavi, Lundberg Scott, Singh Sameer, Hajishirzi Hannaneh, Zettlemoyer Luke, Ribeiro Marco Tulio
- 1.5 Million Materials Narratives Generated By Chatbots Park Yang Jeong, Jerng Sung Eun, Park Jin-sung, Kwon Choah, Hsu Chia-wei, Ren Zhichu, Yoon Sungroh, Li Ju
- Evaluating In-context Learning Of Libraries For Code Generation Patel Arkil, Reddy Siva, Bahdanau Dzmitry, Dasigi Pradeep
- Gorilla: Large Language Model Connected With Massive Apis Patil Shishir G., Zhang Tianjun, Wang Xin, Gonzalez Joseph E.
- REFINER: Reasoning Feedback On Intermediate Representations Paul Debjit, Ismayilzada Mete, Peyrard Maxime, Borges Beatriz, Bosselut Antoine, West Robert, Faltings Boi
- Instructexcel: A Benchmark For Natural Language Instruction In Excel Payan Justin, Mishra Swaroop, Singh Mukul, Negreanu Carina, Poelitz Christian, Baral Chitta, Roy Subhro, Chakravarthy Rasika, Van Durme Benjamin, Nouri Elnaz
- Exploiting Novel GPT-4 Apis Pelrine Kellin, Taufeeque Mohammad, Zając Michał, Mclean Euan, Gleave Adam
- FP8-LM: Training FP8 Large Language Models Peng Houwen, Wu Kan, Wei Yixuan, Zhao Guoshuai, Yang Yuxiang, Liu Ze, Xiong Yifan, Yang Ziyue, Ni Bolin, Hu Jingcheng, Li Ruihang, Zhang Miaosen, Li Chen, Ning Jia, Wang Ruizhe, Zhang Zheng, Liu Shuguang, Chau Joe, Hu Han, Cheng Peng
- Spoken Language Intelligence Of Large Language Models For Language Learning Peng Linkai, Nuchged Baorian, Gao Yingming
- Self-driven Grounding: Large Language Model Agents With Automatical Language-aligned Skill Learning Peng Shaohui, Hu Xing, Yi Qi, Zhang Rui, Guo Jiaming, Huang Di, Tian Zikang, Chen Ruizhi, Du Zidong, Guo Qi, Chen Yunji, Li Ling
- Large Language Models Can Infer Psychological Dispositions Of Social Media Users Peters Heinrich, Matz Sandra
- Promptinfuser: How Tightly Coupling AI And UI Design Impacts Designers' Workflows Petridis Savvas, Terry Michael, Cai Carrie J.
- Constitutionmaker: Interactively Critiquing Large Language Models By Converting Feedback Into Principles Petridis Savvas, Wedin Ben, Wexler James, Donsbach Aaron, Pushkarna Mahima, Goyal Nitesh, Cai Carrie J., Terry Michael
- The Impact Of Depth On Compositional Generalization In Transformer Language Models Petty Jackson, Van Steenkiste Sjoerd, Dasgupta Ishita, Sha Fei, Garrette Dan, Linzen Tal
- Generative AI For Programming Education: Benchmarking Chatgpt, GPT-4, And Human Tutors Phung Tung, Pădurean Victor-alexandru, Cambronero José, Gulwani Sumit, Kohn Tobias, Majumdar Rupak, Singla Adish, Soares Gustavo
- Automating Human Tutor-style Programming Feedback: Leveraging GPT-4 Tutor Model For Hint Generation And GPT-3.5 Student Model For Hint Validation Phung Tung, Pădurean Victor-alexandru, Singh Anjali, Brooks Christopher, Cambronero José, Gulwani Sumit, Singla Adish, Soares Gustavo
- Evaluating Gpt-4's Vision Capabilities On Brazilian University Admission Exams Pires Ramon, Almeida Thales Sales, Abonizio Hugo, Nogueira Rodrigo
- Ai-assisted Coding: Experiments With GPT-4 Poldrack Russell A, Lu Thomas, Beguš Gašper
- Egovlpv2: Egocentric Video-language Pre-training With Fusion In The Backbone Pramanick Shraman, Song Yale, Nag Sayan, Lin Kevin Qinghong, Shah Hardik, Shou Mike Zheng, Chellappa Rama, Zhang Pengchuan
- Rephrase, Augment, Reason: Visual Grounding Of Questions For Vision-language Models Prasad Archiki, Stengel-eskin Elias, Bansal Mohit
- The Robots Are Here: Navigating The Generative AI Revolution In Computing Education Prather James, Denny Paul, Leinonen Juho, Becker Brett A., Albluwi Ibrahim, Craig Michelle, Keuning Hieke, Kiesler Natalie, Kohn Tobias, Luxton-reilly Andrew, Macneil Stephen, Peterson Andrew, Pettit Raymond, Reeves Brent N., Savelka Jaromir
- CORE-GPT: Combining Open Access Research And Large Language Models For Credible, Trustworthy Question Answering Pride David, Cancellieri Matteo, Knoth Petr
- Automatic Prompt Optimization With "gradient Descent" And Beam Search Pryzant Reid, Iter Dan, Li Jerry, Lee Yin Tat, Zhu Chenguang, Zeng Michael
- Vidcom: Fast Video Comprehension Through Large Language Models With Multimodal Tools Qi Ji, Ji Kaixuan, Yu Jifan, Wang Duokang, Xu Bin, Hou Lei, Li Juanzi
- Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! Qi Xiangyu, Zeng Yi, Xie Tinghao, Chen Pin-yu, Jia Ruoxi, Mittal Prateek, Henderson Peter
- Experiential Co-learning Of Software-developing Agents Qian Chen, Dang Yufan, Li Jiahao, Liu Wei, Xie Zihao, Wang Yifei, Chen Weize, Yang Cheng, Cong Xin, Che Xiaoyin, Liu Zhiyuan, Sun Maosong
- Toolink: Linking Toolkit Creation And Using Through Chain-of-solving On Open-source Model Qian Cheng, Xiong Chenyan, Liu Zhenghao, Liu Zhiyuan
- "merge Conflicts!" Exploring The Impacts Of External Distractors To Parametric Knowledge Graphs Qian Cheng, Zhao Xinran, Wu Sherry Tongshuang
- Taskweaver: A Code-first Agent Framework Qiao Bo, Li Liqun, Zhang Xu, He Shilin, Kang Yu, Zhang Chaoyun, Yang Fangkai, Dong Hang, Zhang Jue, Wang Lu, Ma Minghua, Zhao Pu, Qin Si, Qin Xiaoting, Du Chao, Xu Yong, Lin Qingwei, Rajmohan Saravan, Zhang Dongmei
- Gameeval: Evaluating Llms On Conversational Games Qiao Dan, Wu Chenfei, Liang Yaobo, Li Juntao, Duan Nan
- Hallucidoctor: Mitigating Hallucinatory Toxicity In Visual Instruction Data Qifan Yu, Juncheng Li, Longhui Wei, Liang Pang, Wentao Ye, Bosheng Qin, Siliang Tang, Qi Tian, Yueting Zhuang
- ONCE: Boosting Content-based Recommendation With Both Open- And Closed-source Large Language Models Qijiong Liu, Nuo Chen, Tetsuya Sakai, Xiao-ming Wu
- Enabling On-device Large Language Model Personalization With Self-supervised Data Selection And Synthesis Qin Ruiyang, Xia Jun, Jia Zhenge, Jiang Meng, Abbasi Ahmed, Zhou Peipei, Hu Jingtong, Shi Yiyu
- Toolllm: Facilitating Large Language Models To Master 16000+ Real-world Apis Qin Yujia, Liang Shihao, Ye Yining, Zhu Kunlun, Yan Lan, Lu Yaxi, Lin Yankai, Cong Xin, Tang Xiangru, Qian Bill, Zhao Sihan, Hong Lauren, Tian Runchu, Xie Ruobing, Zhou Jie, Gerstein Mark, Li Dahai, Liu Zhiyuan, Sun Maosong
- Faithful Chain-of-thought Reasoning Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong, Marianna Apidianaki, Chris Callison-burch
- Codegeex: A Pre-trained Model For Code Generation With Multilingual Benchmarking On Humaneval-x Qinkai Zheng, Xiao Xia, Xu Zou, Yuxiao Dong, Shan Wang, Yufei Xue, Zihan Wang, Lei Shen, Andi Wang, Yang Li, Teng Su, Zhilin Yang, Jie Tang
- Latent Jailbreak: A Benchmark For Evaluating Text Safety And Output Robustness Of Large Language Models Qiu Huachuan, Zhang Shuai, Li Anqi, He Hongliang, Lan Zhenzhong
- Controlrec: Bridging The Semantic Gap Between Language Model And Personalized Recommendation Qiu Junyan, Wang Haitao, Hong Zhaolin, Yang Yiping, Liu Qiang, Wang Xingxing
- Cmed-gpt: Prompt Tuning For Entity-aware Chinese Medical Dialogue Generation Qu Zhijie, Li Juan, Ma Zerui, Li Jianqiang
- Predicting Question-answering Performance Of Large Language Models Through Semantic Consistency Rabinovich Ella, Ackerman Samuel, Raz Orna, Farchi Eitan, Anaby-tavor Ateret
- Logic-scaffolding: Personalized Aspect-instructed Recommendation Explanation Generation Using Llms Rahdari Behnam, Ding Hao, Fan Ziwei, Ma Yifei, Chen Zhuotong, Deoras Anoop, Kveton Branislav
- Exploring The Effectiveness Of GPT Models In Test-taking: A Case Study Of The Driver's License Knowledge Test Rahimi Saba, Balch Tucker, Veloso Manuela
- Improving Generalization In Task-oriented Dialogues With Workflows And Action Plans Raimondo Stefania, Pal Christopher, Liu Xiaotian, Vazquez David, Palacios Hector
- Multimodal Multi-hop Question Answering Through A Conversation Between Tools And Efficiently Finetuned Large Language Models Rajabzadeh Hossein, Wang Suyuchen, Kwon Hyock Ju, Liu Bang
- Cache & Distil: Optimising API Calls To Large Language Models Ramírez Guillem, Lindemann Matthias, Birch Alexandra, Titov Ivan
- Can Chatgpt Assess Human Personalities? A General Evaluation Framework Rao Haocong, Leung Cyril, Miao Chunyan
- Retrieval-based Knowledge Augmented Vision Language Pre-training Rao Jiahua, Shan Zifei, Liu Longpo, Zhou Yao, Yang Yuedong
- Evaluating Llms On Document-based QA: Exact Answer Selection And Numerical Extraction Using Cogtale Dataset Rasool Zafaryab, Kurniawan Stefanus, Balugo Sherwin, Barnett Scott, Vasa Rajesh, Chesser Courtney, Hampstead Benjamin M., Belleville Sylvie, Mouzakis Kon, Bahar-fuchs Alex
- Supporting Human-ai Collaboration In Auditing Llms With Llms Rastogi Charvi, Ribeiro Marco Tulio, King Nicholas, Nori Harsha, Amershi Saleema
- Data Contamination Through The Lens Of Time Roberts Manley, Thakur Himanshu, Herlihy Christine, White Colin, Dooley Samuel
- Axiomatic Preference Modeling For Longform Question Answering Rosset Corby, Zheng Guoqing, Dibia Victor, Awadallah Ahmed, Bennett Paul
- LLM As An Art Director (ladi): Using Llms To Improve Text-to-media Generators Roush Allen, Zakirov Emil, Shirokov Artemiy, Lunina Polina, Gane Jack, Duffy Alexander, Basil Charlie, Whitcomb Aber, Benedetto Jim, Dewolfe Chris
- Knowledge-infused Self Attention Transformers Roy Kaushik, Zi Yuxin, Narayanan Vignesh, Gaur Manas, Sheth Amit
- From Chatbots To Phishbots? -- Preventing Phishing Scams Created Using Chatgpt, Google Bard And Claude Roy Sayak Saha, Thota Poojitha, Naragam Krishna Vamsi, Nilizadeh Shirin
- Hallucination-minimized Data-to-answer Framework For Financial Decision-makers Roychowdhury Sohini, Alvarez Andres, Moore Brian, Krema Marko, Gelpi Maria Paz, Rodriguez Federico Martin, Rodriguez Angel, Cabrejas Jose Ramon, Serrano Pablo Martinez, Agrawal Punit, Mukherjee Arijit
- TPTU: Large Language Model-based AI Agents For Task Planning And Tool Usage Ruan Jingqing, Chen Yihong, Zhang Bin, Xu Zhiwei, Bao Tianpeng, Du Guoqing, Shi Shiwei, Mao Hangyu, Li Ziyue, Zeng Xingyu, Zhao Rui
- Identifying The Risks Of LM Agents With An Lm-emulated Sandbox Ruan Yangjun, Dong Honghua, Wang Andrew, Pitis Silviu, Zhou Yongchao, Ba Jimmy, Dubois Yann, Maddison Chris J., Hashimoto Tatsunori
- Gpt4tools: Teaching Large Language Model To Use Tools Via Self-instruction Rui Yang, Lin Song, Yanwei Li, Sijie Zhao, Yixiao Ge, Xiu Li, Ying Shan
- Performance Of Chatgpt-3.5 And GPT-4 On The United States Medical Licensing Examination With And Without Distractions Safrai Myriam, Azaria Amos
- A Symmetric Dual Encoding Dense Retrieval Framework For Knowledge-intensive Visual Question Answering Salemi Alireza, Pizzorno Juan Altmayer, Zamani Hamed
- Zero-shot Translation Of Attention Patterns In VQA Models To Natural Language Salewski Leonard, Koepke A. Sophia, Lensch Hendrik P. A., Akata Zeynep
- Uni-nlx: Unifying Textual Explanations For Vision And Vision-language Tasks Sammani Fawaz, Deligiannis Nikos
- What Matters In The Structured Pruning Of Generative Language Models? Santacroce Michael, Wen Zixin, Shen Yelong, Li Yuanzhi
- Camoscio: An Italian Instruction-tuned Llama Santilli Andrea, Rodolà Emanuele
- Frugal Prompting For Dialog Models Santra Bishal, Basak Sakya, De Abhinandan, Gupta Manish, Goyal Pawan
- Inseq: An Interpretability Toolkit For Sequence Generation Models Sarti Gabriele, Feldhus Nils, Sickert Ludwig, Van Der Wal Oskar, Nissim Malvina, Bisazza Arianna
- Toolformer: Language Models Can Teach Themselves To Use Tools Schick Timo, Dwivedi-yu Jane, Dessì Roberto, Raileanu Roberta, Lomeli Maria, Zettlemoyer Luke, Cancedda Nicola, Scialom Thomas
- Cross-lingual Supervision Improves Large Language Models Pre-training Schioppa Andrea, Garcia Xavier, Firat Orhan
- PULSAR At Mediqa-sum 2023: Large Language Models Augmented By Synthetic Dialogue Convert Patient Dialogues To Medical Records Schlegel Viktor, Li Hao, Wu Yuping, Subramanian Anand, Nguyen Thanh-tung, Kashyap Abhinav Ramesh, Beck Daniel, Zeng Xiaojun, Batista-navarro Riza Theresa, Winkler Stefan, Nenadic Goran
- A Review Of Repository Level Prompting For Llms Schonholtz Douglas
- An Empirical Evaluation Of Using Large Language Models For Automated Unit Test Generation Schäfer Max, Nadi Sarah, Eghbali Aryaz, Tip Frank
- Quantifying Language Models' Sensitivity To Spurious Features In Prompt Design Or: How I Learned To Start Worrying About Prompt Formatting Sclar Melanie, Choi Yejin, Tsvetkov Yulia, Suhr Alane
- Personality Traits In Large Language Models Serapio-garcía Greg, Safdari Mustafa, Crepy Clément, Sun Luning, Fitz Stephen, Romero Peter, Abdulhai Marwa, Faust Aleksandra, Matarić Maja
- Cognitive Effects In Large Language Models Shaki Jonathan, Kraus Sarit, Wooldridge Michael
- Evaluating Large Language Model Creativity From A Literary Perspective Shanahan Murray, Clarke Catherine
- Omniquant: Omnidirectionally Calibrated Quantization For Large Language Models Shao Wenqi, Chen Mengzhao, Zhang Zhaoyang, Xu Peng, Zhao Lirui, Li Zhiqian, Zhang Kaipeng, Gao Peng, Qiao Yu, Luo Ping
- Tinylvlm-ehub: Towards Comprehensive And Efficient Evaluation For Large Vision-language Models Shao Wenqi, Lei Meng, Hu Yutao, Gao Peng, Zhang Kaipeng, Meng Fanqing, Xu Peng, Huang Siyuan, Li Hongsheng, Qiao Yu, Luo Ping
- Character-llm: A Trainable Agent For Role-playing Shao Yunfan, Li Linyang, Dai Junqi, Qiu Xipeng
- Human Choice Prediction In Language-based Persuasion Games: Simulation-based Off-policy Evaluation Shapira Eilam, Apel Reut, Tennenholtz Moshe, Reichart Roi
- Learning Non-linguistic Skills Without Sacrificing Linguistic Proficiency Sharma Mandar, Muralidhar Nikhil, Ramakrishnan Naren
- Performance Of Chatgpt On USMLE: Unlocking The Potential Of Large Language Models For Ai-assisted Medical Education Sharma Prabin, Thapa Kisan, Thapa Dikshya, Dhakal Prastab, Upadhaya Mala Deep, Adhikari Santosh, Khanal Salik Ram
- Pangu-coder2: Boosting Large Language Models For Code With Ranking Feedback Shen Bo, Zhang Jiaxin, Chen Taihong, Zan Daoguang, Geng Bing, Fu An, Zeng Muhan, Yu Ailun, Ji Jichuan, Zhao Jingyang, Guo Yuenan, Wang Qianxiang
- Mixture-of-experts Meets Instruction Tuning:a Winning Combination For Large Language Models Shen Sheng, Hou Le, Zhou Yanqi, Du Nan, Longpre Shayne, Wei Jason, Chung Hyung Won, Zoph Barret, Fedus William, Chen Xinyun, Vu Tu, Wu Yuexin, Chen Wuyang, Webson Albert, Li Yunxuan, Zhao Vincent, Yu Hongkun, Keutzer Kurt, Darrell Trevor, Zhou Denny
- "do Anything Now": Characterizing And Evaluating In-the-wild Jailbreak Prompts On Large Language Models Shen Xinyue, Chen Zeyuan, Backes Michael, Shen Yun, Zhang Yang
- Taskbench: Benchmarking Large Language Models For Task Automation Shen Yongliang, Song Kaitao, Tan Xu, Zhang Wenqi, Ren Kan, Yuan Siyu, Lu Weiming, Li Dongsheng, Zhuang Yueting
- Towards More Unified In-context Visual Understanding Sheng Dianmo, Chen Dongdong, Tan Zhentao, Liu Qiankun, Chu Qi, Bao Jianmin, Gong Tao, Liu Bin, Xu Shengwei, Yu Nenghai
- Can Language Agents Be Alternatives To PPO? A Preliminary Empirical Study On Openai Gym Sheng Junjie, Huang Zixiao, Shen Chuyun, Li Wenhao, Hua Yun, Jin Bo, Zha Hongyuan, Wang Xiangfeng
- Crossget: Cross-guided Ensemble Of Tokens For Accelerating Vision-language Transformers Shi Dachuan, Tao Chaofan, Rao Anyi, Yang Zhendong, Yuan Chun, Wang Jiaqi
- Prompt Space Optimizing Few-shot Reasoning Success With Large Language Models Shi Fobo, Qing Peijun, Yang Dong, Wang Nan, Lei Youbo, Lu Haonan, Lin Xiaodong, Li Duantengchuan
- QADYNAMICS: Training Dynamics-driven Synthetic QA Diagnostic For Zero-shot Commonsense Question Answering Shi Haochen, Wang Weiqi, Fang Tianqing, Xu Baixuan, Ding Wenxuan, Liu Xin, Song Yangqiu
- Unleashing The Power Of Pre-trained Language Models For Offline Reinforcement Learning Shi Ruizhe, Liu Yuyao, Ze Yanjie, Du Simon S., Xu Huazhe
- Preliminary Study On Incremental Learning For Large Language Model-based Recommender Systems Shi Tianhao, Zhang Yang, Xu Zhijian, Chen Chong, Feng Fuli, He Xiangnan, Tian Qi
- RADE: Reference-assisted Dialogue Evaluation For Open-domain Dialogue Shi Zhengliang, Sun Weiwei, Zhang Shuo, Zhang Zhen, Ren Pengjie, Ren Zhaochun
- Reasoning With Language Model Is Planning With World Model Shibo Hao, Yi Gu, Haodi Ma, Joshua Jiahua Hong, Zhen Wang, Daisy Zhe Wang, Zhiting Hu
- Large Language Models Can Enhance Persuasion Through Linguistic Feature Alignment Shin Minkyu, Kim Jin
- L3 Ensembles: Lifelong Learning Approach For Ensemble Of Foundational Language Models Shiri Aidin, Roy Kaushik, Sheth Amit, Gaur Manas
- SCREWS: A Modular Framework For Reasoning With Revisions Shridhar Kumar, Jhamtani Harsh, Fang Hao, Van Durme Benjamin, Eisner Jason, Xia Patrick
- Woodpecker: Hallucination Correction For Multimodal Large Language Models Shukang Yin, Chaoyou Fu, Sirui Zhao, Tong Xu, Hao Wang, Dianbo Sui, Yunhang Shen, Ke Li, Xing Sun, Enhong Chen
- Tree Of Thoughts: Deliberate Problem Solving With Large Language Models Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Thomas L. Griffiths, Yuan Cao, Karthik Narasimhan
- Getting More Out Of Mixture Of Language Model Reasoning Experts Si Chenglei, Shi Weijia, Zhao Chen, Zettlemoyer Luke, Boyd-graber Jordan
- Combo Of Thinking And Observing For Outside-knowledge VQA Si Qingyi, Mo Yuchen, Lin Zheng, Ji Huishan, Wang Weiping
- Mondrian: Prompt Abstraction Attack Against Large Language Models For Cheaper API Pricing Si Wai Man, Backes Michael, Zhang Yang
- Mitigating Object Hallucinations In Large Vision-language Models Through Visual Contrastive Decoding Sicong Leng, Hang Zhang, Guanzheng Chen, Xin Li, Shijian Lu, Chunyan Miao, Lidong Bing
- Mind Meets Machine: Unravelling Gpt-4's Cognitive Psychology Sifatkaur Dhingra, Manmeet Singh, Vaisakh Sb, Neetiraj Malviya, Sukhpal Singh Gill
- Explaining Black Box Text Modules In Natural Language With Language Models Singh Chandan, Hsu Aliyah R., Antonello Richard, Jain Shailee, Huth Alexander G., Yu Bin, Gao Jianfeng
- Gopro: Generate And Optimize Prompts In CLIP Using Self-supervised Learning Singha Mainak, Jha Ankit, Banerjee Biplab
- Towards Expert-level Medical Question Answering With Large Language Models Singhal Karan, Tu Tao, Gottweis Juraj, Sayres Rory, Wulczyn Ellery, Hou Le, Clark Kevin, Pfohl Stephen, Cole-lewis Heather, Neal Darlene, Schaekermann Mike, Wang Amy, Amin Mohamed, Lachgar Sami, Mansfield Philip, Prakash Sushant, Green Bradley, Dominowska Ewa, Arcas Blaise Aguera Y, Tomasev Nenad, Liu Yun, Wong Renee, Semturs Christopher, Mahdavi S. Sara, Barral Joelle, Webster Dale, Corrado Greg S., Matias Yossi, Azizi Shekoofeh, Karthikesalingam Alan, Natarajan Vivek
- LLM Guided Inductive Inference For Solving Compositional Problems Sodani Abhigya, Moos Lauren, Mirman Matthew
- Step: Stacked LLM Policies For Web Actions Sodhi Paloma, Branavan S. R. K., Artzi Yoav, Mcdonald Ryan
- Biomedical Knowledge Graph-optimized Prompt Generation For Large Language Models Soman Karthik, Rose Peter W, Morris John H, Akbas Rabia E, Smith Brett, Peetoom Braian, Villouta-reyes Catalina, Cerono Gabriel, Shi Yongmei, Rizk-jackson Angela, Israni Sharat, Nelson Charlotte A, Huang Sui, Baranzini Sergio E
- How To Bridge The Gap Between Modalities: A Comprehensive Survey On Multimodal Large Language Model Song Shezheng, Li Xiaopeng, Li Shasha, Zhao Shan, Yu Jie, Ma Jun, Mao Xiaoguang, Zhang Weimin
- Reward Collapse In Aligning Large Language Models Song Ziang, Cai Tianle, Lee Jason D., Su Weijie J.
- Recursion Of Thought: A Divide-and-conquer Approach To Multi-context Reasoning With Language Models Soochan Lee, Gunhee Kim
- Commands As AI Conversations Spinellis Diomidis
- Probing Llms For Joint Encoding Of Linguistic Categories Starace Giulio, Papakostas Konstantinos, Choenni Rochelle, Panagiotopoulos Apostolos, Rosati Matteo, Leidinger Alina, Shutova Ekaterina
- Pythia: A Suite For Analyzing Large Language Models Across Training And Scaling Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, Usvsn Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar Van Der Wal
- A Mechanistic Interpretation Of Arithmetic Reasoning In Language Models Using Causal Mediation Analysis Stolfo Alessandro, Belinkov Yonatan, Sachan Mrinmaya
- Finetuning An LLM On Contextual Knowledge Of Classics For Q&A Strachan Shane Storm
- Distilled GPT For Source Code Summarization Su Chia-yi, Mcmillan Collin
- RCMHA: Relative Convolutional Multi-head Attention For Natural Language Modelling Sugiharto Herman, Aradea, Mubarok Husni
- Luminate: Structured Generation And Exploration Of Design Space With Large Language Models For Human-ai Co-creation Suh Sangho, Chen Meng, Min Bryan, Li Toby Jia-jun, Xia Haijun
- Cognitive Architectures For Language Agents Sumers Theodore R., Yao Shunyu, Narasimhan Karthik, Griffiths Thomas L.
- Does Fine-tuning GPT-3 With The Openai API Leak Personally-identifiable Information? Sun Albert Yu, Zemour Eliott, Saxena Arushi, Vaidyanathan Udith, Lin Eric, Lau Christian, Mugunthan Vaikkunth
- Create Your World: Lifelong Text-to-image Diffusion Sun Gan, Liang Wenqi, Dong Jiahua, Li Jun, Ding Zhengming, Cong Yang
- Fine-grained Audio-visual Joint Representations For Multimodal Large Language Models Sun Guangzhi, Yu Wenyi, Tang Changli, Chen Xianzhao, Tan Tian, Li Wei, Lu Lu, Ma Zejun, Zhang Chao
- Towards Verifiable Text Generation With Evolving Memory And Self-reflection Sun Hao, Cai Hengyi, Wang Bo, Hou Yingyan, Wei Xiaochi, Wang Shuaiqiang, Zhang Yan, Yin Dawei
- Autohint: Automatic Prompt Optimization With Hint Generation Sun Hong, Li Xue, Xu Yinchuan, Homma Youkow, Cao Qi, Wu Min, Jiao Jian, Charles Denis
- Think-on-graph: Deep And Responsible Reasoning Of Large Language Model On Knowledge Graph Sun Jiashuo, Xu Chengjin, Tang Lumingyuan, Wang Saizhuo, Lin Chen, Gong Yeyun, Ni Lionel M., Shum Heung-yeung, Guo Jian
- PEARL: Prompting Large Language Models To Plan And Execute Actions Over Long Documents Sun Simeng, Liu Yang, Wang Shuohang, Zhu Chenguang, Iyyer Mohit
- Expnote: Black-box Large Language Models Are Better Task Solvers With Experience Notebook Sun Wangtao, Yu Xuanqing, He Shizhu, Zhao Jun, Liu Kang
- A Prompt Learning Framework For Source Code Summarization Sun Weisong, Fang Chunrong, You Yudu, Chen Yuchen, Liu Yi, Wang Chong, Zhang Jian, Zhang Quanjun, Qian Hanwei, Zhao Wei, Liu Yang, Chen Zhenyu
- Multimodal Question Answering For Unified Information Extraction Sun Yuxuan, Zhang Kai, Su Yu
- Approximating Online Human Evaluation Of Social Chatbots With Prompting Svikhnushina Ekaterina, Pu Pearl
- Can Language Models Teach Weaker Agents? Teacher Explanations Improve Students Via Personalization Swarnadeep Saha, Peter Hase, Mohit Bansal
- Assessing The Ability Of Chatgpt To Screen Articles For Systematic Reviews Syriani Eugene, David Istvan, Kumar Gauransh
- Redcoast: A Lightweight Tool To Automate Distributed Training Of Llms On Any Gpu/tpus Tan Bowen, Zhu Yun, Liu Lijuan, Wang Hongyi, Zhuang Yonghao, Chen Jindong, Xing Eric, Hu Zhiting
- Compound Text-guided Prompt Tuning Via Image-adaptive Cues Tan Hao, Li Jun, Zhou Yizhuang, Wan Jun, Lei Zhen, Zhang Xiangyu
- Can Chatgpt Replace Traditional KBQA Models? An In-depth Analysis Of The Question Answering Performance Of The GPT LLM Family Tan Yiming, Min Dehai, Li Yu, Li Wenbo, Hu Nan, Chen Yongrui, Qi Guilin
- Sparsity-guided Holistic Explanation For Llms With Interpretable Inference-time Intervention Tan Zhen, Chen Tianlong, Zhang Zhenyu, Liu Huan
- Toolalpaca: Generalized Tool Learning For Language Models With 3000 Simulated Cases Tang Qiaoyu, Deng Ziliang, Lin Hongyu, Han Xianpei, Liang Qiao, Cao Boxi, Sun Le
- Medagents: Large Language Models As Collaborators For Zero-shot Medical Reasoning Tang Xiangru, Zou Anni, Zhang Zhuosheng, Li Ziming, Zhao Yilun, Zhang Xingyao, Cohan Arman, Gerstein Mark
- RSVP: Customer Intent Detection Via Agent Response Contrastive And Generative Pre-training Tang Yu-chien, Wang Wei-yao, Yen An-zi, Peng Wen-chih
- POUF: Prompt-oriented Unsupervised Fine-tuning For Large Pre-trained Models Tanwisuth Korawat, Zhang Shujian, Zheng Huangjie, He Pengcheng, Zhou Mingyuan
- Rolecraft-glm: Advancing Personalized Role-playing In Large Language Models Tao Meiling, Liang Xuechen, Shi Tianyu, Yu Lei, Xie Yiting
- Adversarial Conversational Shaping For Intelligent Agents Tarasiewicz Piotr, Kenjeyev Sultan, Sebag Ilana, Alshehabi Shehab
- Caption Anything: Interactive Image Description With Diverse Multimodal Controls Teng Wang, Jinrui Zhang, Junjie Fei, Hao Zheng, Yunlong Tang, Zhe Li, Mingqi Gao, Shanshan Zhao
- DUMA: A Dual-mind Conversational Agent With Fast And Slow Thinking Tian Xiaoyu, Chen Liangyu, Liu Na, Liu Yaxuan, Zou Wei, Chen Kaijiang, Cui Ming
- Evil Geniuses: Delving Into The Safety Of Llm-based Agents Tian Yu, Yang Xiao, Zhang Jingyuan, Dong Yinpeng, Su Hang
- Macgyver: Are Large Language Models Creative Problem Solvers? Tian Yufei, Ravichander Abhilasha, Qin Lianhui, Bras Ronan Le, Marjieh Raja, Peng Nanyun, Choi Yejin, Griffiths Thomas L., Brahman Faeze
- BOOST: Harnessing Black-box Control To Boost Commonsense In Lms' Generation Tian Yufei, Zhang Felix, Peng Nanyun
- Post Turing: Mapping The Landscape Of LLM Evaluation Tikhonov Alexey, Yamshchikov Ivan P.
- Do Llms Exhibit Human-like Response Biases? A Case Study In Survey Design Tjuatja Lindia, Chen Valerie, Wu Sherry Tongshuang, Talwalkar Ameet, Neubig Graham
- Better Language Models Of Code Through Self-improvement To Hung Quoc, Bui Nghi D. Q., Guo Jin, Nguyen Tien N.
- Eliminating Reasoning Via Inferring With Planning: A New Framework To Guide Llms' Non-linear Thinking Tong Yongqi, Wang Yifan, Li Dawei, Wang Sizhe, Lin Zi, Han Simeng, Shang Jingbo
- Llmseceval: A Dataset Of Natural Language Prompts For Security Evaluations Tony Catherine, Mutas Markus, Ferreyra Nicolás E. Díaz, Scandariato Riccardo
- Automl-gpt: Large Language Model For Automl Tsai Yun-da, Tsai Yu-che, Huang Bo-wei, Yang Chun-pai, Lin Shou-de
- Why Can Large Language Models Generate Correct Chain-of-thoughts? Tutunov Rasul, Grosnit Antoine, Ziomek Juliusz, Wang Jun, Bou-ammar Haitham
- Can Large Language Models Really Improve By Self-critiquing Their Own Plans? Valmeekam Karthik, Marquez Matthew, Kambhampati Subbarao
- The LLM Surgeon Van Der Ouderaa Tycho F. A., Nagel Markus, Van Baalen Mart, Asano Yuki M., Blankevoort Tijmen
- Large Language Models: The Need For Nuance In Current Debates And A Pragmatic Perspective On Understanding Van Dijk Bram M. A., Kouwenhoven Tom, Spruit Marco R., Van Duijn Max J.
- Detecting And Correcting Hate Speech In Multimodal Memes With Large Visual Language Model Van Minh-hao, Wu Xintao
- Language Resources For Dutch Large Language Modelling Vanroy Bram
- Adventures Of Trustworthy Vision-language Models: A Survey Vatsa Mayank, Jain Anubhooti, Singh Richa
- Rethinking The Evaluating Framework For Natural Language Understanding In AI Systems: Language Acquisition As A Core For Future Metrics Vera Patricio, Moya Pedro, Barraza Lisa
- Emulating Human Cognitive Processes For Expert-level Medical Question-answering With Large Language Models Verma Khushboo, Moore Marina, Wottrich Stephanie, López Karla Robles, Aggarwal Nishant, Bhatt Zeel, Singh Aagamjit, Unroe Bradford, Basheer Salah, Sachdeva Nitish, Arora Prinka, Kaur Harmanjeet, Kaur Tanupreet, Hood Tevon, Marquez Anahi, Varshney Tushar, Deng Nanfu, Ramani Azaan, Ishwara Pawanraj, Saeed Maimoona, Peña Tatiana López Velarde, Barksdale Bryan, Guha Sushovan, Kumar Satwant
- Simplesafetytests: A Test Suite For Identifying Critical Safety Risks In Large Language Models Vidgen Bertie, Scherrer Nino, Kirk Hannah Rose, Qian Rebecca, Kannappan Anand, Hale Scott A., Röttger Paul
- Mtcue: Learning Zero-shot Control Of Extra-textual Attributes By Leveraging Unstructured Context In Neural Machine Translation Vincent Sebastian, Flynn Robert, Scarton Carolina
- Fairpy: A Toolkit For Evaluation Of Social Biases And Their Mitigation In Large Language Models Viswanath Hrishikesh, Zhang Tianyi
- Prompt2model: Generating Deployable Models From Natural Language Instructions Viswanathan Vijay, Zhao Chenyang, Bertsch Amanda, Wu Tongshuang, Neubig Graham
- Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases In Dialogue Systems Wan Yixin, Zhao Jieyu, Chadha Aman, Peng Nanyun, Chang Kai-wei
- Reformulating Domain Adaptation Of Large Language Models As Adapt-retrieve-revise: A Case Study On Chinese Legal Domain Wan Zhen, Zhang Yating, Wang Yexiang, Cheng Fei, Kurohashi Sadao
- Efficient Large Language Models: A Survey Wan Zhongwei, Wang Xin, Liu Che, Alam Samiul, Zheng Yu, Liu Jiachen, Qu Zhongnan, Yan Shen, Zhu Yi, Zhang Quanlu, Chowdhury Mosharaf, Zhang Mi
- Shall We Pretrain Autoregressive Language Models With Retrieval? A Comprehensive Study Wang Boxin, Ping Wei, Xu Peng, Mcafee Lawrence, Liu Zihan, Shoeybi Mohammad, Dong Yi, Kuchaiev Oleksii, Li Bo, Xiao Chaowei, Anandkumar Anima, Catanzaro Bryan
- Keqing: Knowledge-based Question Answering Is A Nature Chain-of-thought Mentor Of LLM Wang Chaojie, Xu Yishi, Peng Zhong, Zhang Chenxi, Chen Bo, Wang Xinrun, Feng Lei, An Bo
- Cost-effective Hyperparameter Optimization For Large Language Model Generation Inference Wang Chi, Liu Susan Xueqing, Awadallah Ahmed H.
- BERT4CTR: An Efficient Framework To Combine Pre-trained Language Model With Non-textual Features For CTR Prediction Wang Dong, Salamatian Kavé, Xia Yunqing, Deng Weiwei, Zhiang Qi
- Openchat: Advancing Open-source Language Models With Mixed-quality Data Wang Guan, Cheng Sijie, Zhan Xianyuan, Li Xiangang, Song Sen, Liu Yang
- Knowledgeable In-context Tuning: Exploring And Exploiting Factual Knowledge For In-context Learning Wang Jianing, Wang Chengyu, Tan Chuanqi, Huang Jun, Gao Ming
- Missrec: Pre-training And Transferring Multi-modal Interest-aware Sequence Representation For Recommendation Wang Jinpeng, Zeng Ziyun, Wang Yunxiao, Wang Yuting, Lu Xingyu, Li Tianxiang, Yuan Jun, Zhang Rui, Zheng Hai-tao, Xia Shu-tao
- Self-prompted Chain-of-thought On Large Language Models For Open-domain Multi-hop Reasoning Wang Jinyuan, Li Junlong, Zhao Hai
- Notechat: A Dataset Of Synthetic Doctor-patient Conversations Conditioned On Clinical Notes Wang Junda, Yao Zonghai, Yang Zhichao, Zhou Huixue, Li Rumeng, Wang Xun, Xu Yucheng, Yu Hong
- Chated: A Chatbot Leveraging Chatgpt For An Enhanced Learning Experience In Higher Education Wang Kevin, Ramos Jason, Lawrence Ramon
- Sibyl: Sensible Empathetic Dialogue Generation With Visionary Commonsense Knowledge Wang Lanrui, Li Jiangnan, Yang Chenxu, Lin Zheng, Tang Hongyin, Liu Huan, Huang Xiaolei, Cao Yanan, Wang Jingang, Wang Weiping
- Scimon: Scientific Inspiration Machines Optimized For Novelty Wang Qingyun, Downey Doug, Ji Heng, Hope Tom
- Bridging The Novice-expert Gap Via Models Of Decision-making: A Case Study On Remediating Math Mistakes Wang Rose E., Zhang Qingyang, Robinson Carly, Loeb Susanna, Demszky Dorottya
- Compositional Text-to-image Synthesis With Attention Map Control Of Diffusion Models Wang Ruichen, Chen Zekang, Chen Chen, Ma Jian, Lu Haonan, Lin Xiaodong
- A Principled Framework For Knowledge-enhanced Large Language Model Wang Saizhuo, Liu Zhihan, Wang Zhaoran, Guo Jian
- Vamos: Versatile Action Models For Video Understanding Wang Shijie, Zhao Qi, Do Minh Quan, Agarwal Nakul, Lee Kwonjoon, Sun Chen
- CAR: Conceptualization-augmented Reasoner For Zero-shot Commonsense Question Answering Wang Weiqi, Fang Tianqing, Ding Wenxuan, Xu Baixuan, Liu Xin, Song Yangqiu, Bosselut Antoine
- All Languages Matter: On The Multilingual Safety Of Large Language Models Wang Wenxuan, Tu Zhaopeng, Chen Chang, Yuan Youliang, Huang Jen-tse, Jiao Wenxiang, Lyu Michael R.
- Instructuie: Multi-task Instruction Tuning For Unified Information Extraction Wang Xiao, Zhou Weikang, Zu Can, Xia Han, Chen Tianze, Zhang Yuansen, Zheng Rui, Ye Junjie, Zhang Qi, Gui Tao, Kang Jihua, Yang Jingsheng, Li Siyuan, Du Chunsai
- MINT: Evaluating Llms In Multi-turn Interaction With Tools And Language Feedback Wang Xingyao, Wang Zihan, Liu Jiateng, Chen Yangyi, Yuan Lifan, Peng Hao, Ji Heng
- Promptagent: Strategic Planning With Language Models Enables Expert-level Prompt Optimization Wang Xinyuan, Li Chenxi, Wang Zhen, Bai Fan, Luo Haotian, Zhang Jiayou, Jojic Nebojsa, Xing Eric P., Hu Zhiting
- User-aware Prefix-tuning Is A Good Learner For Personalized Image Captioning Wang Xuan, Wang Guanhong, Chai Wenhao, Zhou Jiayu, Wang Gaoang
- Automated Evaluation Of Personalized Text Generation Using Large Language Models Wang Yaqing, Jiang Jiepu, Zhang Mingyang, Li Cheng, Liang Yi, Mei Qiaozhu, Bendersky Michael
- NEWTON: Are Large Language Models Capable Of Physical Reasoning? Wang Yi Ru, Duan Jiafei, Fox Dieter, Srinivasa Siddhartha
- Pandalm: An Automatic Evaluation Benchmark For LLM Instruction Tuning Optimization Wang Yidong, Yu Zhuohao, Zeng Zhengran, Yang Linyi, Wang Cunxiang, Chen Hao, Jiang Chaoya, Xie Rui, Wang Jindong, Xie Xing, Ye Wei, Zhang Shikun, Zhang Yue
- Lifelongmemory: Leveraging Llms For Answering Queries In Long-form Egocentric Videos Wang Ying, Yang Yanlai, Ren Mengye
- Fake Alignment: Are Llms Really Aligned Well? Wang Yixu, Teng Yan, Huang Kexin, Lyu Chengqi, Zhang Songyang, Zhang Wenwei, Ma Xingjun, Jiang Yu-gang, Qiao Yu, Wang Yingchun
- How Far Can Camels Go? Exploring The State Of Instruction Tuning On Open Resources Wang Yizhong, Ivison Hamish, Dasigi Pradeep, Hessel Jack, Khot Tushar, Chandu Khyathi Raghavi, Wadden David, Macmillan Kelsey, Smith Noah A., Beltagy Iz, Hajishirzi Hannaneh
- Vaquita: Enhancing Alignment In Llm-assisted Video Understanding Wang Yizhou, Zhang Ruiyi, Wang Haoliang, Bhattacharya Uttaran, Fu Yun, Wu Gang
- DRDT: Dynamic Reflection With Divergent Thinking For Llm-based Sequential Recommendation Wang Yu, Liu Zhiwei, Zhang Jianguo, Yao Weiran, Heinecke Shelby, Yu Philip S.
- Augmenting Black-box Llms With Medical Textbooks For Clinical Question Answering Wang Yubo, Ma Xueguang, Chen Wenhu
- Smarttrim: Adaptive Tokens And Attention Pruning For Efficient Vision-language Models Wang Zekun, Chen Jingchang, Zhou Wangchunshu, Zhu Haichao, Liang Jiafeng, Shan Liping, Liu Ming, Xu Dongliang, Yang Qing, Qin Bing
- Gpt4video: A Unified Multimodal Large Language Model For Lnstruction-followed Understanding And Safety-aware Generation Wang Zhanyu, Wang Longyue, Zhao Zhen, Wu Minghao, Lyu Chenyang, Li Huayang, Cai Deng, Zhou Luping, Shi Shuming, Tu Zhaopeng
- Can Llms Like GPT-4 Outperform Traditional AI Tools In Dementia Diagnosis? Maybe, But Not Today Wang Zhuo, Li Rongzhen, Dong Bowen, Wang Jie, Li Xiuxing, Liu Ning, Mao Chenhui, Zhang Wei, Dong Liling, Gao Jing, Wang Jianyong
- Enabling Language Models To Implicitly Learn Self-improvement Wang Ziqi, Hou Le, Lu Tianjian, Wu Yuexin, Li Yunxuan, Yu Hongkun, Ji Heng
- Filling The Image Information Gap For VQA: Prompting Large Language Models To Proactively Ask Questions Wang Ziyue, Chen Chi, Li Peng, Liu Yang
- Call For Papers -- The Babylm Challenge: Sample-efficient Pretraining On A Developmentally Plausible Corpus Warstadt Alex, Choshen Leshem, Mueller Aaron, Williams Adina, Wilcox Ethan, Zhuang Chengxu
- Llmrec: Large Language Models With Graph Augmentation For Recommendation Wei Wei, Xubin Ren, Jiabin Tang, Qinyong Wang, Lixin Su, Suqi Cheng, Junfeng Wang, Dawei Yin, Chao Huang
- Menatqa: A New Dataset For Testing The Temporal Comprehension And Reasoning Abilities Of Large Language Models Wei Yifan, Su Yisong, Ma Huanhuan, Yu Xiaoyan, Lei Fangyu, Zhang Yuanzhe, Zhao Jun, Liu Kang
- Copiloting The Copilots: Fusing Large Language Models With Completion Engines For Automated Program Repair Wei Yuxiang, Xia Chunqiu Steven, Zhang Lingming
- REPLUG: Retrieval-augmented Black-box Language Models Weijia Shi, Sewon Min, Michihiro Yasunaga, Minjoon Seo, Rich James, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih
- Mindmap: Knowledge Graph Prompting Sparks Graph Of Thoughts In Large Language Models Wen Yilin, Wang Zifeng, Sun Jimeng
- GROVE: A Retrieval-augmented Complex Story Generation Framework With A Forest Of Evidence Wen Zhihua, Tian Zhiliang, Wu Wei, Yang Yuxin, Shi Yanqi, Huang Zhen, Li Dongsheng
- Lmtuner: An User-friendly And Highly-integrable Training Framework For Fine-tuning Large Language Models Weng Yixuan, Wang Zhiqi, Liao Huanxuan, He Shizhu, Liu Shengping, Liu Kang, Zhao Jun
- Mastering Symbolic Operations: Augmenting Language Models With Compiled Neural Networks Weng Yixuan, Zhu Minjun, Xia Fei, Li Bin, He Shizhu, Liu Kang, Zhao Jun
- AI And The FCI: Can Chatgpt Project An Understanding Of Introductory Physics? West Colin G.
- Think Twice: Perspective-taking Improves Large Language Models' Theory-of-mind Capabilities Wilf Alex, Lee Sihyun Shawn, Liang Paul Pu, Morency Louis-philippe
- From Word Models To World Models: Translating From Natural Language To The Probabilistic Language Of Thought Wong Lionel, Grand Gabriel, Lew Alexander K., Goodman Noah D., Mansinghka Vikash K., Andreas Jacob, Tenenbaum Joshua B.
- Exploring The Limits Of Chatgpt In Software Security Applications Wu Fangzhou, Zhang Qingzhao, Bajaj Ati Priya, Bao Tiffany, Zhang Ning, Wang Ruoyu "fish", Xiao Chaowei
- Speechgen: Unlocking The Generative Power Of Speech Language Models With Prompts Wu Haibin, Chang Kai-wei, Wu Yuan-kuei, Lee Hung-yi
- P-laplacian Adaptation For Generative Pre-trained Vision-language Models Wu Haoyuan, Zhang Xinyun, Xu Peng, Liao Peiyu, Yao Xufeng, Yu Bei
- Large Language Models Leverage External Knowledge To Extend Clinical Insight Beyond Language Boundaries Wu Jiageng, Wu Xian, Qiu Zhaopeng, Li Minghui, Zhang Yingying, Zheng Yefeng, Yuan Changzheng, Yang Jie
- A New Dialogue Response Generation Agent For Large Language Models By Asking Questions To Detect User's Intentions Wu Siwei, Shen Xiangqing, Xia Rui
- Self-correcting Llm-controlled Diffusion Models Wu Tsung-han, Lian Long, Gonzalez Joseph E., Li Boyi, Darrell Trevor
- Paragraph-to-image Generation With Information-enriched Diffusion Model Wu Weijia, Li Zhuang, He Yefei, Shou Mike Zheng, Shen Chunhua, Cheng Lele, Li Yan, Gao Tingting, Zhang Di, Wang Zhongyuan
- Unleashing Potential Of Evidence In Knowledge-intensive Dialogue Generation Wu Xianjie, Yang Jian, Li Tongliang, Liang Di, Zhang Shiwei, Du Yiyang, Li Zhoujun
- An Early Evaluation Of Gpt-4v(ision) Wu Yang, Wang Shilong, Yang Hao, Zheng Tian, Zhang Hongbo, Zhao Yanyan, Qin Bing
- Retrieve-rewrite-answer: A Kg-to-text Enhanced Llms Framework For Knowledge Graph Question Answering Wu Yike, Hu Nan, Bi Sheng, Qi Guilin, Ren Jie, Xie Anhuan, Song Wei
- Mathchat: Converse To Tackle Challenging Math Problems With LLM Agents Wu Yiran, Jia Feiran, Zhang Shaokun, Li Hangyu, Zhu Erkang, Wang Yue, Lee Yin Tat, Peng Richard, Wu Qingyun, Wang Chi
- Jailbreaking GPT-4V Via Self-adversarial Attacks With System Prompts Wu Yuanwei, Li Xiang, Liu Yixin, Zhou Pan, Sun Lichao
- Semantic Parsing By Large Language Models For Intricate Updating Strategies Of Zero-shot Dialogue State Tracking Wu Yuxiang, Dong Guanting, Xu Weiran
- Fine-grained Human Feedback Gives Better Rewards For Language Model Training Wu Zeqiu, Hu Yushi, Shi Weijia, Dziri Nouha, Suhr Alane, Ammanabrolu Prithviraj, Smith Noah A., Ostendorf Mari, Hajishirzi Hannaneh
- Openicl: An Open-source Framework For In-context Learning Wu Zhenyu, Wang Yaoxiang, Ye Jiacheng, Feng Jiangtao, Xu Jingjing, Qiao Yu, Wu Zhiyong
- A Simple Baseline For Knowledge-based Visual Question Answering Xenos Alexandros, Stafylakis Themos, Patras Ioannis, Tzimiropoulos Georgios
- Towards Open-world Recommendation With Knowledge Augmentation From Large Language Models Xi Yunjia, Liu Weiwen, Lin Jianghao, Cai Xiaoling, Zhu Hong, Zhu Jieming, Chen Bo, Tang Ruiming, Zhang Weinan, Zhang Rui, Yu Yong
- Hybrid Retrieval-augmented Generation For Real-time Composition Assistance Xia Menglin, Zhang Xuchao, Couturier Camille, Zheng Guoqing, Rajmohan Saravan, Ruhle Victor
- Improving Question Generation With Multi-level Content Planning Xia Zehua, Gou Qi, Yu Bowen, Yu Haiyang, Huang Fei, Li Yongbin, Nguyen Cam-tu
- Efficient Streaming Language Models With Attention Sinks Xiao Guangxuan, Tian Yuandong, Chen Beidi, Han Song, Lewis Mike
- O3D: Offline Data-driven Discovery And Distillation For Sequential Decision-making With Large Language Models Xiao Yuchen, Sun Yanchao, Xu Mengda, Madhushani Udari, Vann Jared, Garg Deepeka, Ganesh Sumitra
- Supporting Qualitative Analysis With Large Language Models: Combining Codebook With GPT-3 For Deductive Coding Xiao Ziang, Yuan Xingdi, Liao Q. Vera, Abdelghani Rania, Oudeyer Pierre-yves
- Instructed Language Models With Retrievers Are Powerful Entity Linkers Xiao Zilin, Gong Ming, Wu Jie, Zhang Xingyao, Shou Linjun, Pei Jian, Jiang Daxin
- Rethinking The Evaluation For Conversational Recommendation In The Era Of Large Language Models Xiaolei Wang, Xinyu Tang, Wayne Xin Zhao, Jingyuan Wang, Ji-rong Wen
- Pangu-σ: Towards Trillion Parameter Language Model With Sparse Heterogeneous Computing Xiaozhe Ren, Pingyi Zhou, Xinfan Meng, Xinjing Huang, Yadao Wang, Weichao Wang, Pengfei Li, Xiaoda Zhang, Alexander Podolskiy, Grigory Arshinov, Andrey Bout, Irina Piontkovskaya, Jiansheng Wei, Xin Jiang, Teng Su, Qun Liu, Jun Yao
- Adaptive Chameleon Or Stubborn Sloth: Revealing The Behavior Of Large Language Models In Knowledge Conflicts Xie Jian, Zhang Kai, Chen Jiangjie, Lou Renze, Su Yu
- Proto-lm: A Prototypical Network-based Framework For Built-in Interpretability In Large Language Models Xie Sean, Vosoughi Soroush, Hassanpour Saeed
- Tencentllmeval: A Hierarchical Evaluation Of Real-world Capabilities For Human-aligned Llms Xie Shuyi, Yao Wenlin, Dai Yong, Wang Shaobo, Zhou Donlin, Jin Lifeng, Feng Xinhua, Wei Pengzhi, Lin Yujie, Hu Zhichao, Yu Dong, Zhang Zhengyou, Nie Jing, Liu Yuhong
- Text2reward: Reward Shaping With Language Models For Reinforcement Learning Xie Tianbao, Zhao Siheng, Wu Chen Henry, Liu Yitao, Luo Qian, Zhong Victor, Yang Yanchao, Yu Tao
- Openagents: An Open Platform For Language Agents In The Wild Xie Tianbao, Zhou Fan, Cheng Zhoujun, Shi Peng, Weng Luoxuan, Liu Yitao, Hua Toh Jing, Zhao Junning, Liu Qian, Liu Che, Liu Leo Z., Xu Yiheng, Su Hongjin, Shin Dongchan, Xiong Caiming, Yu Tao
- Olagpt: Empowering Llms With Human-like Problem-solving Abilities Xie Yuanzhen, Xie Tao, Lin Mingxiong, Wei Wentao, Li Chenglin, Kong Beibei, Chen Lei, Zhuo Chengxiang, Hu Bo, Li Zang
- A Preliminary Study Of Chatgpt On News Recommendation: Personalization, Provider Fairness, Fake News Xinyi Li, Yongfeng Zhang, Edward C. Malthouse
- Bridging Items And Language: A Transition Paradigm For Large Language Model-based Recommendation Xinyu Lin, Wenjie Wang, Yongqi Li, Fuli Feng, See-kiong Ng, Tat-seng Chua
- Examining Inter-consistency Of Large Language Models Collaboration: An In-depth Analysis Via Debate Xiong Kai, Ding Xiao, Cao Yixin, Liu Ting, Qin Bing
- Can Llms Express Their Uncertainty? An Empirical Evaluation Of Confidence Elicitation In Llms Xiong Miao, Hu Zhiyuan, Lu Xinyang, Li Yifei, Fu Jie, He Junxian, Hooi Bryan
- Gentopia: A Collaborative Platform For Tool-augmented Llms Xu Binfeng, Liu Xukun, Shen Hua, Han Zeyu, Li Yuhan, Yue Murong, Peng Zhiyuan, Liu Yuchen, Yao Ziyu, Xu Dongkuan
- Rewoo: Decoupling Reasoning From Observations For Efficient Augmented Language Models Xu Binfeng, Peng Zhiyuan, Lei Bowen, Mukherjee Subhabrata, Liu Yuchen, Xu Dongkuan
- Baize: An Open-source Chat Model With Parameter-efficient Tuning On Self-chat Data Xu Canwen, Guo Daya, Duan Nan, Mcauley Julian
- Symbol-llm: Towards Foundational Symbol-centric Interface For Large Language Models Xu Fangzhi, Wu Zhiyong, Sun Qiushi, Ren Siyu, Yuan Fei, Yuan Shuai, Lin Qika, Qiao Yu, Liu Jun
- Recommender AI Agent: Integrating Large Language Models For Interactive Recommendations Xu Huang, Jianxun Lian, Yuxuan Lei, Jing Yao, Defu Lian, Xing Xie
- Retrieval-based Video Language Model For Efficient Long Video Question Answering Xu Jiaqi, Lan Cuiling, Xie Wenxuan, Chen Xuejin, Lu Yan
- U-llava: Unifying Multi-modal Tasks Via Large Language Model Xu Jinjin, Xu Liwu, Yang Yuzhe, Li Xiang, Wang Fanyi, Xie Yanchun, Huang Yi-jie, Li Yaqian
- Superclue: A Comprehensive Chinese Large Language Model Benchmark Xu Liang, Li Anqi, Zhu Lei, Xue Hang, Zhu Changtai, Zhao Kangkang, He Haonan, Zhang Xuanwei, Kang Qiyue, Lan Zhenzhong
- On The Tool Manipulation Capability Of Open-source Large Language Models Xu Qiantong, Hong Fenglu, Li Bo, Hu Changran, Chen Zhengyu, Zhang Jian
- Reasons To Reject? Aligning Language Models With Judgments Xu Weiwen, Cai Deng, Zhang Zhisong, Lam Wai, Shi Shuming
- Pink: Unveiling The Power Of Referential Comprehension For Multi-modal Llms Xuan Shiyu, Guo Qingpei, Yang Ming, Zhang Shiliang
- Representation Learning With Large Language Models For Recommendation Xubin Ren, Wei Wei, Lianghao Xia, Lixin Su, Suqi Cheng, Junfeng Wang, Dawei Yin, Chao Huang
- Bias And Fairness In Chatbots: An Overview Xue Jintang, Wang Yun-cheng, Wei Chengwei, Liu Xiaofeng, Woo Jonghye, Kuo C. -c. Jay
- Domain-specific Chatbots For Science Using Embeddings Yager Kevin G.
- Optimizing Machine Translation Through Prompt Engineering: An Investigation Into Chatgpt's Customizability Yamada Masaru
- LPML: Llm-prompting Markup Language For Mathematical Reasoning Yamauchi Ryutaro, Sonoda Sho, Sannai Akiyoshi, Kumagai Wataru
- Backdooring Instruction-tuned Large Language Models With Virtual Prompt Injection Yan Jun, Yadav Vikas, Li Shiyang, Chen Lichang, Tang Zheng, Wang Hai, Srinivasan Vijay, Ren Xiang, Jin Hongxia
- Refining The Responses Of Llms By Themselves Yan Tianqiang, Xu Tiansheng
- Ask More, Know Better: Reinforce-learned Prompt Questions For Decision Making With Large Language Models Yan Xue, Song Yan, Cui Xinyu, Christianos Filippos, Zhang Haifeng, Mguni David Henry, Wang Jun
- Multimodal Chatgpt For Medical Applications: An Experimental Study Of GPT-4V Yan Zhiling, Zhang Kai, Zhou Rong, He Lifang, Li Xiang, Sun Lichao
- Recmind: Large Language Model Powered Agent For Recommendation Yancheng Wang, Ziyan Jiang, Zheng Chen, Fan Yang, Yingxue Zhou, Eunah Cho, Xing Fan, Xiaojiang Huang, Yanbin Lu, Yingzhen Yang
- Can You Follow Me? Testing Situational Understanding In Chatgpt Yang Chenghao, Ettinger Allyson
- Zero-shot Query Reformulation For Conversational Search Yang Dayu, Zhang Yue, Fang Hui
- PALR: Personalization Aware Llms For Recommendation Yang Fan, Chen Zheng, Jiang Ziyan, Cho Eunah, Huang Xiaojiang, Lu Yanbin
- PRCA: Fitting Black-box Large Language Models For Retrieval Question Answering Via Pluggable Reward-driven Contextual Adapter Yang Haoyan, Li Zhitao, Zhang Yong, Wang Jianzong, Cheng Ning, Li Ming, Xiao Jing
- Iterative Forward Tuning Boosts In-context Learning In Language Models Yang Jiaxi, Hui Binyuan, Yang Min, Wang Bailin, Li Bowen, Li Binhua, Huang Fei, Li Yongbin
- Medxchat: A Unified Multimodal Large Language Model Framework Towards Cxrs Understanding And Generation Yang Ling, Wang Zhanyu, Chen Zhenghao, Liang Xinyu, Zhou Luping
- Supervised Knowledge Makes Large Language Models Better In-context Learners Yang Linyi, Zhang Shuibai, Yu Zhuohao, Bao Guangsheng, Wang Yidong, Wang Jindong, Xu Ruochen, Ye Wei, Xie Xing, Chen Weizhu, Zhang Yue
- Integrating UMLS Knowledge Into Large Language Models For Medical Question Answering Yang Rui, Marrese-taylor Edison, Ke Yuhe, Cheng Lechao, Chen Qingyu, Li Irene
- Progressive Evidence Refinement For Open-domain Multimodal Retrieval Question Answering Yang Shuwen, Wu Anran, Wu Xingjiao, Xiao Luwei, Ma Tianlong, Jin Cheng, He Liang
- Human-in-the-loop Machine Translation With Large Language Model Yang Xinyi, Zhan Runzhe, Wong Derek F., Wu Junchao, Chao Lidia S.
- Bias A-head? Analyzing Bias In Transformer-based Language Model Attention Heads Yang Yi, Duan Hanyu, Abbasi Ahmed, Lalor John P., Tam Kar Yan
- Mindllm: Pre-training Lightweight Large Language Model From Scratch, Evaluations And Domain Applications Yang Yizhe, Sun Huashan, Li Jiawei, Liu Runheng, Li Yinghao, Liu Yuhang, Huang Heyan, Gao Yang
- Harnessing The Power Of Large Language Models For Natural Language To First-order Logic Translation Yang Yuan, Xiong Siheng, Payani Ali, Shareghi Ehsan, Fekri Faramarz
- Improving The Reliability Of Large Language Models By Leveraging Uncertainty-aware In-context Learning Yang Yuchen, Li Houqiang, Wang Yanfeng, Wang Yu
- On The Planning, Search, And Memorization Capabilities Of Large Language Models Yang Yunhao, Tomar Anshul
- Failures Pave The Way: Enhancing Large Language Models Through Tuning-free Rule Accumulation Yang Zeyuan, Li Peng, Liu Yang
- GPT Can Solve Mathematical Problems Without A Calculator Yang Zhen, Ding Ming, Lv Qingsong, Jiang Zhihuan, He Zehai, Guo Yuyi, Bai Jinfeng, Tang Jie
- I-code V2: An Autoregressive Generation Framework Over Vision, Language, And Speech Data Yang Ziyi, Khademi Mahmoud, Xu Yichong, Pryzant Reid, Fang Yuwei, Zhu Chenguang, Chen Dongdong, Qian Yao, Gao Mei, Chen Yi-ling, Gmyr Robert, Kanda Naoyuki, Codella Noel, Xiao Bin, Shi Yu, Yuan Lu, Yoshioka Takuya, Zeng Michael, Huang Xuedong
- Promptcare: Prompt Copyright Protection By Watermark Injection And Verification Yao Hongwei, Lou Jian, Ren Kui, Qin Zhan
- Knowledge Plugins: Enhancing Large Language Models For Domain-specific Recommendations Yao Jing, Xu Wei, Lian Jianxun, Wang Xiting, Yi Xiaoyuan, Xie Xing
- Retroformer: Retrospective Large Language Agents With Policy Gradient Optimization Yao Weiran, Heinecke Shelby, Niebles Juan Carlos, Liu Zhiwei, Feng Yihao, Xue Le, Murthy Rithesh, Chen Zeyuan, Zhang Jianguo, Arpit Devansh, Xu Ran, Mui Phil, Wang Huan, Xiong Caiming, Savarese Silvio
- Beyond Chain-of-thought, Effective Graph-of-thought Reasoning In Language Models Yao Yao, Li Zuchao, Zhao Hai
- Deepspeed-visualchat: Multi-round Multi-image Interleave Chat Via Multi-modal Causal Attention Yao Zhewei, Wu Xiaoxia, Li Conglong, Zhang Minjia, Qin Heyang, Ruwase Olatunji, Awan Ammar Ahmad, Rajbhandari Samyam, He Yuxiong
- Do Physicians Know How To Prompt? The Need For Automatic Prompt Optimization Help In Clinical Note Generation Yao Zonghai, Jaafar Ahmed, Wang Beining, Yang Zhichao, Yu Hong
- Studying And Improving Reasoning In Humans And Machines Yax Nicolas, Anlló Hernan, Palminteri Stefano
- Effective Large Language Model Adaptation For Improved Grounding And Citation Generation Ye Xi, Sun Ruoxi, Arik Sercan Ö., Pfister Tomas
- Rational Decision-making Agent With Internalized Utility Judgment Ye Yining, Cong Xin, Tian Shizuo, Qin Yujia, Liu Chong, Lin Yankai, Liu Zhiyuan, Sun Maosong
- ASPEN: High-throughput Lora Fine-tuning Of Large Language Models With A Single GPU Ye Zhengmao, Li Dengchun, Tian Jingqi, Lan Tingfeng, Zuo Jie, Duan Lei, Lu Hui, Jiang Yexi, Sha Jian, Zhang Ke, Tang Mingjie
- Navigating Text-to-image Customization: From Lycoris Fine-tuning To Model Evaluation Yeh Shih-ying, Hsieh Yu-guan, Gao Zhidong, Yang Bernard B W, Oh Giyeong, Gong Yanmin
- A Multitask, Multilingual, Multimodal Evaluation Of Chatgpt On Reasoning, Hallucination, And Interactivity Yejin Bang, Samuel Cahyawijaya, Nayeon Lee, Wenliang Dai, Dan Su, Bryan Wilie, Holy Lovenia, Ziwei Ji, Tiezheng Yu, Willy Chung, Quyet V. Do, Yan Xu, Pascale Fung
- Ctrlstruct: Dialogue Structure Learning For Open-domain Response Generation Yin Congchi, Li Piji, Ren Zhaochun
- Agent Lumos: Unified And Modular Training For Open-source Language Agents Yin Da, Brahman Faeze, Ravichander Abhilasha, Chandu Khyathi, Chang Kai-wei, Choi Yejin, Lin Bill Yuchen
- Dynosaur: A Dynamic Growth Paradigm For Instruction-tuning Data Curation Yin Da, Liu Xiao, Yin Fan, Zhong Ming, Bansal Hritik, Han Jiawei, Chang Kai-wei
- Modulora: Finetuning 2-bit Llms On Consumer Gpus By Integrating With Modular Quantizers Yin Junjie, Dong Jiahao, Wang Yingheng, De Sa Christopher, Kuleshov Volodymyr
- Exchange-of-thought: Enhancing Large Language Model Capabilities Through Cross-model Communication Yin Zhangyue, Sun Qiushi, Chang Cheng, Guo Qipeng, Dai Junqi, Huang Xuanjing, Qiu Xipeng
- LAMM: Language-assisted Multi-modal Instruction-tuning Dataset, Framework, And Benchmark Yin Zhenfei, Wang Jiong, Cao Jianjian, Shi Zhelun, Liu Dingning, Li Mukai, Sheng Lu, Bai Lei, Huang Xiaoshui, Wang Zhiyong, Shao Jing, Ouyang Wanli
- Intuitive Or Dependent? Investigating Llms' Behavior Style To Conflicting Prompts Ying Jiahao, Cao Yixin, Xiong Kai, He Yidong, Cui Long, Liu Yongbin
- Low-resource Languages Jailbreak GPT-4 Yong Zheng-xin, Menghini Cristina, Bach Stephen H.
- Search-adaptor: Embedding Customization For Information Retrieval Yoon Jinsung, Arik Sercan O, Chen Yanfei, Pfister Tomas
- Asyncmld: Asynchronous Multi-llm Framework For Dialogue Recommendation System Yoshimaru Naoki, Okuma Motoharu, Iio Takamasa, Hatano Kenji
- Fundamental Limitations Of Alignment In Large Language Models Yotam Wolf, Noam Wies, Oshri Avnery, Yoav Levine, Amnon Shashua
- Decoding In-context Learning: Neuroscience-inspired Analysis Of Representations In Large Language Models Yousefi Safoora, Betthauser Leo, Hasanbeig Hosein, Millière Raphaël, Momennejad Ida
- Merlin:empowering Multimodal Llms With Foresight Minds Yu En, Zhao Liang, Wei Yana, Yang Jinrong, Wu Dongming, Kong Lingyu, Wei Haoran, Wang Tiancai, Ge Zheng, Zhang Xiangyu, Tao Wenbing
- GPTFUZZER: Red Teaming Large Language Models With Auto-generated Jailbreak Prompts Yu Jiahao, Lin Xingwei, Yu Zheng, Xing Xinyu
- Assessing Prompt Injection Risks In 200+ Custom Gpts Yu Jiahao, Wu Yuhang, Shu Dong, Jin Mingyu, Yang Sabrina, Xing Xinyu
- Characterizing Mechanisms For Factual Recall In Language Models Yu Qinan, Merullo Jack, Pavlick Ellie
- Improving Language Models Via Plug-and-play Retrieval Feedback Yu Wenhao, Zhang Zhihan, Liang Zhenwen, Jiang Meng, Sabharwal Ashish
- Reeval: Automatic Hallucination Evaluation For Retrieval-augmented Large Language Models Via Transferable Adversarial Attacks Yu Xiaodong, Cheng Hao, Liu Xiaodong, Roth Dan, Gao Jianfeng
- Finmem: A Performance-enhanced LLM Trading Agent With Layered Memory And Character Design Yu Yangyang, Li Haohang, Chen Zhi, Jiang Yuechen, Li Yang, Zhang Denghui, Liu Rong, Suchow Jordan W., Khashanah Khaldoun
- Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning Yu Yue, Shen Jiaming, Liu Tianqi, Qin Zhen, Yan Jing Nathan, Liu Jialu, Zhang Chao, Bendersky Michael
- Prophet: Prompting Large Language Models With Complementary Answer Heuristics For Knowledge-based Visual Question Answering Yu Zhou, Ouyang Xuecheng, Shao Zhenwei, Wang Meng, Yu Jun
- CRAFT: Customizing Llms By Creating And Retrieving From Specialized Toolsets Yuan Lifan, Chen Yangyi, Wang Xingyao, Fung Yi R., Peng Hao, Ji Heng
- Large Language Models Illuminate A Progressive Pathway To Artificial Healthcare Assistant: A Review Yuan Mingze, Bao Peng, Yuan Jiajia, Shen Yunhao, Chen Zifan, Xie Yi, Zhao Jie, Chen Yang, Zhang Li, Shen Lin, Dong Bin
- Batcheval: Towards Human-like Text Evaluation Yuan Peiwen, Feng Shaoxiong, Li Yiwei, Wang Xinglin, Pan Boyuan, Wang Heda, Li Kan
- Evolving Large Language Model Assistant With Long-term Conditional Memory Yuan Ruifeng, Sun Shichao, Wang Zili, Cao Ziqiang, Li Wenjie
- Large Language Model Cascades With Mixture Of Thoughts Representations For Cost-efficient Reasoning Yue Murong, Zhao Jie, Zhang Min, Du Liang, Yao Ziyu
- Plan, Eliminate, And Track -- Language Models Are Good Teachers For Embodied Agents Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye
- Contextual Object Detection With Multimodal Large Language Models Yuhang Zang, Wei Li, Jun Han, Kaiyang Zhou, Chen Change Loy
- Tool Learning With Foundation Models Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang, Jing Yi, Yuzhang Zhu, Zhenning Dai, Lan Yan, Xin Cong, Yaxi Lu, Weilin Zhao, Yuxiang Huang, Junxi Yan, Xu Han, Xian Sun, Dahai Li, Jason Phang, Cheng Yang, Tongshuang Wu, Heng Ji, Zhiyuan Liu, Maosong Sun
- Attention Satisfies: A Constraint-satisfaction Lens On Factual Errors Of Language Models Yuksekgonul Mert, Chandrasekaran Varun, Jones Erik, Gunasekar Suriya, Naik Ranjita, Palangi Hamid, Kamar Ece, Nushi Besmira
- Empowering Private Tutoring By Chaining Large Language Models Yulin Chen, Ning Ding, Hai-tao Zheng, Zhiyuan Liu, Maosong Sun, Bowen Zhou
- Keeping Users Engaged During Repeated Administration Of The Same Questionnaire: Using Large Language Models To Reliably Diversify Questions Yun Hye Sun, Arjmand Mehdi, Sherlock Phillip, Paasche-orlow Michael K., Griffith James W., Bickmore Timothy
- Chat-rec: Towards Interactive And Explainable Llms-augmented Recommender System Yunfan Gao, Tao Sheng, Youlin Xiang, Yun Xiong, Haofen Wang, Jiawei Zhang
- Retrieval-augmented Generation For Large Language Models: A Survey Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Meng Wang, Haofen Wang
- Pre-training To Learn In Context Yuxian Gu, Li Dong, Furu Wei, Minlie Huang
- Planting A SEED Of Vision In Large Language Model Yuying Ge, Yixiao Ge, Ziyun Zeng, Xintao Wang, Ying Shan
- Improving Machine Translation With Large Language Models: A Preliminary Study With Cooperative Decoding Zeng Jiali, Meng Fandong, Yin Yongjing, Zhou Jie
- TIM: Teaching Large Language Models To Translate With Comparison Zeng Jiali, Meng Fandong, Yin Yongjing, Zhou Jie
- Building Open-ended Embodied Agent Via Language-policy Bidirectional Adaptation Zhai Shaopeng, Wang Jie, Zhang Tianyi, Huang Fuxian, Zhang Qi, Zhou Ming, Hou Jing, Qiao Yu, Liu Yu
- Deceptive AI Ecosystems: The Case Of Chatgpt Zhan Xiao, Xu Yifan, Sarkadi Stefan
- Vpgtrans: Transfer Visual Prompt Generator Across Llms Zhang Ao, Fei Hao, Yao Yuan, Ji Wei, Li Li, Liu Zhiyuan, Chua Tat-seng
- Evaluating And Improving Tool-augmented Computation-intensive Math Reasoning Zhang Beichen, Zhou Kun, Wei Xilin, Zhao Wayne Xin, Sha Jing, Wang Shijin, Wen Ji-rong
- A Simple LLM Framework For Long-range Video Question-answering Zhang Ce, Lu Taixi, Islam Md Mohaiminul, Wang Ziyang, Yu Shoubin, Bansal Mohit, Bertasius Gedas
- Automatic Generation Of Multiple-choice Questions Zhang Cheng
- Mobile-env: Building Qualified Evaluation Benchmarks For LLM-GUI Interaction Zhang Danyang, Shen Zhennan, Xie Rui, Zhang Situo, Xie Tianbao, Zhao Zihan, Chen Siyuan, Chen Lu, Xu Hongshen, Cao Ruisheng, Yu Kai
- Video-llama: An Instruction-tuned Audio-visual Language Model For Video Understanding Zhang Hang, Li Xin, Bing Lidong
- Controllable Text Generation With Residual Memory Transformer Zhang Hanqing, Si Sun, Wu Haiming, Song Dawei
- Uncovering Hidden Connections: Iterative Search And Reasoning For Video-grounded Dialog Zhang Haoyu, Liu Meng, Wang Yaowei, Cao Da, Guan Weili, Nie Liqiang
- Tractable Control For Autoregressive Language Generation Zhang Honghua, Dang Meihua, Peng Nanyun, Broeck Guy Van Den
- Empirical Evaluation Of Chatgpt On Requirements Information Retrieval Under Zero-shot Setting Zhang Jianzhang, Chen Yiyang, Niu Nan, Wang Yinglin, Liu Chuang
- Glm-dialog: Noise-tolerant Pre-training For Knowledge-grounded Dialogue Generation Zhang Jing, Zhang Xiaokang, Zhang-li Daniel, Yu Jifan, Yao Zijun, Ma Zeyao, Xu Yiqi, Wang Haohua, Zhang Xiaohan, Lin Nianyi, Lu Sunrui, Li Juanzi, Tang Jie
- Aligning Instruction Tasks Unlocks Large Language Models As Zero-shot Relation Extractors Zhang Kai, Gutiérrez Bernal Jiménez, Su Yu
- Moqagpt : Zero-shot Multi-modal Open-domain Question Answering With Large Language Model Zhang Le, Wu Yihong, Mo Fengran, Nie Jian-yun, Agrawal Aishwarya
- Marathon: A Race Through The Realm Of Long Context With Large Language Models Zhang Lei, Li Yunshui, Liu Ziqiang, Yang Jiaxi, Liu Junhao, Chen Longze, Luo Run, Yang Min
- Can LLM Find The Green Circle? Investigation And Human-guided Tool Manipulation For Compositional Generalization Zhang Min, He Jianfeng, Lei Shuo, Yue Murong, Wang Linhang, Lu Chang-tien
- Retrieve Anything To Augment Large Language Models Zhang Peitian, Xiao Shitao, Liu Zheng, Dou Zhicheng, Nie Jian-yun
- Ask An Expert: Leveraging Language Models To Improve Strategic Reasoning In Goal-oriented Dialogue Models Zhang Qiang, Naradowsky Jason, Miyao Yusuke
- Knowgpt: Knowledge Graph Based Prompting For Large Language Models Zhang Qinggang, Dong Junnan, Chen Hao, Zha Daochen, Yu Zailiang, Huang Xiao
- Arcmmlu: A Library And Information Science Benchmark For Large Language Models Zhang Shitou, Li Zuchao, Liu Xingshen, Yang Liming, Wang Ping
- Planning With Large Language Models For Code Generation Zhang Shun, Chen Zhenfang, Shen Yikang, Ding Mingyu, Tenenbaum Joshua B., Gan Chuang
- Adarefiner: Refining Decisions Of Language Models With Adaptive Feedback Zhang Wanpeng, Lu Zongqing
- EHR Interaction Between Patients And AI: Noteaid EHR Interaction Zhang Xiaocheng, Yao Zonghai, Yu Hong
- Language Models Are Universal Embedders Zhang Xin, Li Zehan, Zhang Yanzhao, Long Dingkun, Xie Pengjun, Zhang Meishan, Zhang Min
- CGCE: A Chinese Generative Chat Evaluation Benchmark For General And Financial Domains Zhang Xuanyu, Li Bingbing, Yang Qing
- Llavar: Enhanced Visual Instruction Tuning For Text-rich Image Understanding Zhang Yanzhe, Zhang Ruiyi, Gu Jiuxiang, Zhou Yufan, Lipka Nedim, Yang Diyi, Sun Tong
- Meta Prompting For AI Systems Zhang Yifan, Yuan Yang, Yao Andrew Chi-chih
- Probing The Multi-turn Planning Capabilities Of Llms Via 20 Question Games Zhang Yizhe, Lu Jiarui, Jaitly Navdeep
- IAG: Induction-augmented Generation Framework For Answering Reasoning Questions Zhang Zhebin, Zhang Xinyu, Ren Yuanhang, Shi Saijiang, Han Meng, Wu Yongkang, Lai Ruofei, Cao Zhao
- VISAR: A Human-ai Argumentative Writing Assistant With Visual Programming And Rapid Draft Prototyping Zhang Zheng, Gao Jie, Dhaliwal Ranjodh Singh, Li Toby Jia-jun
- Balancing Specialized And General Skills In Llms: The Impact Of Modern Tuning And Data Strategy Zhang Zheng, Zheng Chen, Tang Da, Sun Ke, Ma Yukun, Bu Yingtong, Zhou Xun, Zhao Liang
- Students' Perceptions And Preferences Of Generative Artificial Intelligence Feedback For Programming Zhang Zhengdong, Dong Zihan, Shi Yang, Matsuda Noboru, Price Thomas, Xu Dongkuan
- Reinforced UI Instruction Grounding: Towards A Generic UI Task Automation API Zhang Zhizheng, Xie Wenxuan, Zhang Xiaoyi, Lu Yan
- Responsible Task Automation: Empowering Large Language Models As Responsible Task Automators Zhang Zhizheng, Zhang Xiaoyi, Xie Wenxuan, Lu Yan
- You Only Look At Screens: Multimodal Chain-of-action Agents Zhang Zhuosheng, Zhang Aston
- Ehrtutor: Enhancing Patient Understanding Of Discharge Instructions Zhang Zihao, Yao Zonghai, Zhou Huixue, Ouyang Feiyun, Yu Hong
- LLM4DV: Using Large Language Models For Hardware Test Stimuli Generation Zhang Zixi, Chadwick Greg, Mcnally Hugo, Zhao Yiren, Mullins Robert
- Large Language Models As Zero-shot Conversational Recommenders Zhankui He, Zhouhang Xie, Rahul Jha, Harald Steck, Dawen Liang, Yesu Feng, Bodhisattwa Prasad Majumder, Nathan Kallus, Julian Mcauley
- In-context Exemplars As Clues To Retrieving From Large Associative Memory Zhao Jiachen
- Unimc: A Unified Framework For Long-term Memory Conversation Via Relevance Representation Learning Zhao Kang, Liu Wei, Luan Jian, Gao Minglei, Qian Li, Teng Hanlin, Wang Bin
- Verify-and-edit: A Knowledge-enhanced Chain-of-thought Framework Zhao Ruochen, Li Xingxuan, Joty Shafiq, Qin Chengwei, Bing Lidong
- Group Preference Optimization: Few-shot Alignment Of Large Language Models Zhao Siyan, Dang John, Grover Aditya
- Pareto Optimal Learning For Estimating Large Language Model Errors Zhao Theodore, Wei Mu, Preston J. Samuel, Poon Hoifung
- Ca-lora: Adapting Existing Lora For Compressed Llms To Enable Efficient Multi-tasking On Personal Devices Zhao Weilin, Huang Yuxiang, Han Xu, Liu Zhiyuan, Zhang Zhengyan, Li Kuai, Chen Chen, Yang Tao, Sun Maosong
- DIVKNOWQA: Assessing The Reasoning Ability Of Llms Via Open-domain Question Answering Over Knowledge Base And Text Zhao Wenting, Liu Ye, Niu Tong, Wan Yao, Yu Philip S., Joty Shafiq, Zhou Yingbo, Yavuz Semih
- Chat With The Environment: Interactive Multimodal Perception Using Large Language Models Zhao Xufeng, Li Mengdi, Weber Cornelius, Hafez Muhammad Burhan, Wermter Stefan
- Lookahead: An Inference Acceleration Framework For Large Language Model With Lossless Generation Accuracy Zhao Yao, Xie Zhitian, Liang Chen, Zhuang Chenyi, Gu Jinjie
- Librisqa: A Novel Dataset And Framework For Spoken Question Answering With Large Language Models Zhao Zihan, Jiang Yiyang, Liu Heyang, Wang Yanfeng, Wang Yu
- Lyra: Orchestrating Dual Correction In Automated Theorem Proving Zheng Chuanyang, Wang Haiming, Xie Enze, Liu Zhengying, Sun Jiankai, Xin Huajian, Shen Jianhao, Li Zhenguo, Li Yu
- Lmsys-chat-1m: A Large-scale Real-world LLM Conversation Dataset Zheng Lianmin, Chiang Wei-lin, Sheng Ying, Li Tianle, Zhuang Siyuan, Wu Zhanghao, Zhuang Yonghao, Li Zhuohan, Lin Zi, Xing Eric P., Gonzalez Joseph E., Stoica Ion, Zhang Hao
- Judging Llm-as-a-judge With Mt-bench And Chatbot Arena Zheng Lianmin, Chiang Wei-lin, Sheng Ying, Zhuang Siyuan, Wu Zhanghao, Zhuang Yonghao, Lin Zi, Li Zhuohan, Li Dacheng, Xing Eric P., Zhang Hao, Gonzalez Joseph E., Stoica Ion
- Memorycompanion: A Smart Healthcare Solution To Empower Efficient Alzheimer's Care Via Unleashing Generative AI Zheng Lifei, Heo Yeonie, Fang Yi
- Gpt-fathom: Benchmarking Large Language Models To Decipher The Evolutionary Path Towards GPT-4 And Beyond Zheng Shen, Zhang Yuyu, Zhu Yijie, Xi Chenguang, Gao Pengyang, Zhou Xun, Chang Kevin Chen-chuan
- Towards Unified Token Learning For Vision-language Tracking Zheng Yaozong, Zhong Bineng, Liang Qihua, Li Guorong, Ji Rongrong, Li Xianxian
- Marinegpt: Unlocking Secrets Of Ocean To The Public Zheng Ziqiang, Zhang Jipeng, Vu Tuan-anh, Diao Shizhe, Tim Yue Him Wong, Yeung Sai-kit
- Large Language Model Can Interpret Latent Space Of Sequential Recommender Zhengyi Yang, Jiancan Wu, Yanchen Luo, Jizhi Zhang, Yancheng Yuan, An Zhang, Xiang Wang, Xiangnan He
- Llamarec: Two-stage Recommendation Using Large Language Models For Ranking Zhenrui Yue, Sara Rabhi, Gabriel De Souza Pereira Moreira, Dong Wang, Even Oldridge
- CRITIC: Large Language Models Can Self-correct With Tool-interactive Critiquing Zhibin Gou, Zhihong Shao, Yeyun Gong, Yelong Shen, Yujiu Yang, Nan Duan, Weizhu Chen
- The Rise And Potential Of Large Language Model Based Agents: A Survey Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang, Changhao Jiang, Yicheng Zou, Xiangyang Liu, Zhangyue Yin, Shihan Dou, Rongxiang Weng, Wensen Cheng, Qi Zhang, Wenjuan Qin, Yongyan Zheng, Xipeng Qiu, Xuanjing Huang, Tao Gui
- Humanoid Agents: Platform For Simulating Human-like Generative Agents Zhilin Wang, Yu Ying Chiu, Yu Cheung Chiu
- Exploring Human-like Translation Strategy With Large Language Models Zhiwei He, Tian Liang, Wenxiang Jiao, Zhuosheng Zhang, Yujiu Yang, Rui Wang, Zhaopeng Tu, Shuming Shi, Xing Wang
- Leveraging Large Language Models For Pre-trained Recommender Systems Zhixuan Chu, Hongyan Hao, Xin Ouyang, Simeng Wang, Yan Wang, Yue Shen, Jinjie Gu, Qing Cui, Longfei Li, Siqiao Xue, James Y Zhang, Sheng Li
- Bookgpt: A General Framework For Book Recommendation Empowered By Large Language Model Zhiyuli Aakas, Chen Yanfang, Zhang Xuan, Liang Xun
- Hierarchical Neural Program Synthesis Zhong Linghan, Lindeborg Ryan, Zhang Jesse, Lim Joseph J., Sun Shao-hua
- Chatabl: Abductive Learning Via Natural Language Interaction With Chatgpt Zhong Tianyang, Wei Yaonai, Yang Li, Wu Zihao, Liu Zhengliang, Wei Xiaozheng, Li Wenjun, Yao Junjie, Ma Chong, Li Xiang, Zhu Dajiang, Jiang Xi, Han Junwei, Shen Dinggang, Liu Tianming, Zhang Tuo
- Language Agent Tree Search Unifies Reasoning Acting And Planning In Language Models Zhou Andy, Yan Kai, Shlapentokh-rothman Michal, Wang Haohan, Wang Yu-xiong
- Ethical Chatgpt: Concerns, Challenges, And Commandments Zhou Jianlong, Müller Heimo, Holzinger Andreas, Chen Fang
- Think Before You Speak: Cultivating Communication Skills Of Large Language Models Via Inner Monologue Zhou Junkai, Pang Liang, Shen Huawei, Cheng Xueqi
- Vicor: Bridging Visual Understanding And Commonsense Reasoning With Large Language Models Zhou Kaiwen, Lee Kwonjoon, Misu Teruhisa, Wang Xin Eric
- CONA: A Novel Context-aware Instruction Paradigm For Communication Using Large Language Model Zhou Nan, Tao Xinghui, Chen Xi
- Infmllm: A Unified Framework For Visual-language Tasks Zhou Qiang, Wang Zhibin, Chu Wei, Xu Yinghui, Li Hao, Qi Yuan
- Making Harmful Behaviors Unlearnable For Large Language Models Zhou Xin, Lu Yi, Ma Ruotian, Gui Tao, Zhang Qi, Huang Xuanjing
- Flame: Few-shot Learning From Natural Language Explanations Zhou Yangqiaoyu, Zhang Yiming, Tan Chenhao
- ISR-LLM: Iterative Self-refined Large Language Model For Long-horizon Sequential Task Planning Zhou Zhehua, Song Jiayang, Yao Kunpeng, Shu Zhan, Ma Lei
- Large Language Model As A Policy Teacher For Training Reinforcement Learning Agents Zhou Zihao, Hu Bin, Zhao Chenyang, Zhang Pu, Liu Bin
- Avatargpt: All-in-one Framework For Motion Understanding, Planning, Generation And Beyond Zhou Zixiang, Wan Yu, Wang Baoyuan
- VL-GPT: A Generative Pre-trained Transformer For Vision And Language Understanding And Generation Zhu Jinguo, Ding Xiaohan, Ge Yixiao, Ge Yuying, Zhao Sijie, Zhao Hengshuang, Wang Xiaohua, Shan Ying
- Promptbench: A Unified Library For Evaluation Of Large Language Models Zhu Kaijie, Zhao Qinlin, Chen Hao, Wang Jindong, Xie Xing
- Promptcblue: A Chinese Prompt Tuning Benchmark For The Medical Domain Zhu Wei, Wang Xiaoling, Zheng Huanran, Chen Mosha, Tang Buzhou
- Question Answering As Programming For Solving Time-sensitive Questions Zhu Xinyu, Yang Cheng, Chen Bei, Li Siheng, Lou Jian-guang, Yang Yujiu
- Furthest Reasoning With Plan Assessment: Stable Reasoning Path With Retrieval-augmented Large Language Models Zhu Yin, Luo Zhiling, Cheng Gong
- Towards An On-device Agent For Text Rewriting Zhu Yun, Liu Yinxiao, Stahlberg Felix, Kumar Shankar, Chen Yu-hui, Luo Liangchen, Shu Lei, Liu Renjie, Chen Jindong, Meng Lei
- Large Language Models Can Learn Rules Zhu Zhaocheng, Xue Yuan, Chen Xinyun, Zhou Denny, Tang Jian, Schuurmans Dale, Dai Hanjun
- Toolqa: A Dataset For LLM Question Answering With External Tools Zhuang Yuchen, Yu Yue, Wang Kuan, Sun Haotian, Zhang Chao
- Multimodal Chain-of-thought Reasoning In Language Models Zhuosheng Zhang, Aston Zhang, Mu Li, Hai Zhao, George Karypis, Alex Smola
- Can Large Language Models Transform Computational Social Science? Ziems Caleb, Held William, Shaikh Omar, Chen Jiaao, Zhang Zhehao, Yang Diyi
- Recommender Systems In The Era Of Large Language Models (llms) Zihuai Zhao, Wenqi Fan, Jiatong Li, Yunqing Liu, Xiaowei Mei, Yiqi Wang, Zhen Wen, Fei Wang, Xiangyu Zhao, Jiliang Tang, Qing Li
- Incorporating Probing Signals Into Multimodal Machine Translation Via Visual Question-answering Pairs Zuo Yuxin, Li Bei, Lv Chuanhao, Zheng Tong, Xiao Tong, Zhu Jingbo
- Bcamirs At Semeval-2024 Task 4: Beyond Words: A Multimodal And Multilingual Exploration Of Persuasion In Memes Abaskohi Amirhossein, Dabiriaghdam Amirhossein, Wang Lele, Carenini Giuseppe
- Knowledge-infused Llm-powered Conversational Health Agent: A Case Study For Diabetes Patients Abbasian Mahyar, Yang Zhongqi, Khatibi Elahe, Zhang Pengfei, Nagesh Nitish, Azimi Iman, Jain Ramesh, Rahmani Amir M.
- Granite-function Calling Model: Introducing Function Calling Abilities Via Multi-task Learning Of Granular Tasks Abdelaziz Ibrahim, Basu Kinjal, Agarwal Mayank, Kumaravel Sadhana, Stallone Matthew, Panda Rameswar, Rizk Yara, Bhargav Gp, Crouse Maxwell, Gunasekara Chulaka, Ikbal Shajith, Joshi Sachin, Karanam Hima, Kumar Vineet, Munawar Asim, Neelam Sumit, Raghu Dinesh, Sharma Udit, Soria Adriana Meza, Sreedhar Dheeraj, Venkateswaran Praveen, Unuvar Merve, Cox David, Roukos Salim, Lastras Luis, Kapanipathi Pavan
- Are You Still On Track!? Catching LLM Task Drift With Activations Abdelnabi Sahar, Fay Aideen, Cherubin Giovanni, Salem Ahmed, Fritz Mario, Paverd Andrew
- The Challenges Of Evaluating LLM Applications: An Analysis Of Automated, Human, And Llm-based Approaches Abeysinghe Bhashithe, Circi Ruhan
- Infercept: Efficient Intercept Support For Augmented Large Language Model Inference Abhyankar Reyna, He Zijian, Srivatsa Vikranth, Zhang Hao, Zhang Yiying
- Stackrag Agent: Improving Developer Answers With Retrieval-augmented Generation Abrahamyan Davit, Fard Fatemeh H.
- Pathocl: Path-based Prompt Augmentation For OCL Generation With GPT-4 Abukhalaf Seif, Hamdaqa Mohammad, Khomh Foutse
- Attnlrp: Attention-aware Layer-wise Relevance Propagation For Transformers Achtibat Reduan, Hatefi Sayed Mohammad Vakilzadeh, Dreyer Maximilian, Jain Aakriti, Wiegand Thomas, Lapuschkin Sebastian, Samek Wojciech
- Bridging The Bosphorus: Advancing Turkish Large Language Models Through Strategies For Low-resource Language Adaptation And Benchmarking Acikgoz Emre Can, Erdogan Mete, Yuret Deniz
- Copilot Evaluation Harness: Evaluating Llm-guided Software Programming Agarwal Anisha, Chan Aaron, Chandel Shubham, Jang Jinu, Miller Shaun, Moghaddam Roshanak Zilouchian, Mohylevskyy Yevhen, Sundaresan Neel, Tufano Michele
- Faithfulness Vs. Plausibility: On The (un)reliability Of Explanations From Large Language Models Agarwal Chirag, Tanneru Sree Harsha, Lakkaraju Himabindu
- Promptwizard: Task-aware Agent-driven Prompt Optimization Framework Agarwal Eshaan, Dani Vivek, Ganu Tanuja, Nambi Akshay
- Mememqa: Multimodal Question Answering For Memes Via Rationale-based Inferencing Agarwal Siddhant, Sharma Shivam, Nakov Preslav, Chakraborty Tanmoy
- Ethical Reasoning And Moral Value Alignment Of Llms Depend On The Language We Prompt Them In Agarwal Utkarsh, Tanmay Kumar, Khandelwal Aditi, Choudhury Monojit
- Tricy: Trigger-guided Data-to-text Generation With Intent Aware Attention-copy Agarwal Vibhav, Ghosh Sourav, Bss Harichandana, Arora Himanshu, Raja Barath Raj Kandur
- Enabling High-sparsity Foundational Llama Models With Efficient Pretraining And Deployment Agarwalla Abhinav, Gupta Abhay, Marques Alexandre, Pandit Shubhra, Goin Michael, Kurtic Eldar, Leong Kevin, Nguyen Tuan, Salem Mahmoud, Alistarh Dan, Lie Sean, Kurtz Mark
- Look Further Ahead: Testing The Limits Of GPT-4 In Path Planning Aghzal Mohamed, Plaku Erion, Yao Ziyu
- Leapformer: Enabling Linear Transformers For Autoregressive And Simultaneous Tasks Via Learned Proportions Agostinelli Victor, Hong Sanghyun, Chen Lizhong
- Etalon: Holistic Performance Evaluation Framework For LLM Inference Systems Agrawal Amey, Agarwal Anmol, Kedia Nitin, Mohan Jayashree, Kundu Souvik, Kwatra Nipun, Ramjee Ramachandran, Tumanov Alexey
- Mindful-rag: A Study Of Points Of Failure In Retrieval Augmented Generation Agrawal Garima, Kumarage Tharindu, Alghamdi Zeyad, Liu Huan
- LM4OPT: Unveiling The Potential Of Large Language Models In Formulating Mathematical Optimization Problems Ahmed Tasnim, Choudhury Salimur
- Yi: Open Foundation Models By 01.AI Ai 01., :, Young Alex, Chen Bei, Li Chao, Huang Chengen, Zhang Ge, Zhang Guanwei, Li Heng, Zhu Jiangcheng, Chen Jianqun, Chang Jing, Yu Kaidong, Liu Peng, Liu Qiang, Yue Shawn, Yang Senbin, Yang Shiming, Yu Tao, Xie Wen, Huang Wenhao, Hu Xiaohui, Ren Xiaoyi, Niu Xinyao, Nie Pengcheng, Xu Yuchi, Liu Yudong, Wang Yue, Cai Yuxuan, Gu Zhenyu, Liu Zhiyuan, Dai Zonghong
- Shadowllm: Predictor-based Contextual Sparsity For Large Language Models Akhauri Yash, Abouelhamayed Ahmed F, Dotzel Jordan, Zhang Zhiru, Rush Alexander M, Huda Safeen, Abdelfattah Mohamed S
- FACTS About Building Retrieval Augmented Generation-based Chatbots Akkiraju Rama, Xu Anbang, Bora Deepak, Yu Tan, An Lu, Seth Vishal, Shukla Aaditya, Gundecha Pritam, Mehta Hridhay, Jha Ashwin, Raj Prithvi, Balasubramanian Abhinav, Maram Murali, Muthusamy Guru, Annepally Shivakesh Reddy, Knowles Sidney, Du Min, Burnett Nick, Javiya Sean, Marannan Ashok, Kumari Mamta, Jha Surbhi, Dereszenski Ethan, Chakraborty Anupam, Ranjan Subhash, Terfai Amina, Surya Anoop, Mercer Tracey, Thanigachalam Vinodh Kumar, Bar Tamar, Krishnan Sanjana, Kilaru Samy, Jaksic Jasmine, Algarici Nave, Liberman Jacob, Conway Joey, Nayyar Sonu, Boitano Justin
- Benchmarking Open-source Language Models For Efficient Question Answering In Industrial Applications Alassan Mahaman Sanoussi Yahaya, Espejel Jessica López, Bouhandi Merieme, Dahhane Walid, Ettifouri El Hassane
- User Preferences For Large Language Model Versus Template-based Explanations Of Movie Recommendations: A Pilot Study Albert Julien, Balfroid Martin, Doh Miriam, Bogaert Jeremie, La Fisca Luca, De Vos Liesbet, Renard Bryan, Stragier Vincent, Jean Emmanuel
- PROMPT-SAW: Leveraging Relation-aware Graphs For Textual Prompt Compression Ali Muhammad Asif, Li Zhengping, Yang Shu, Cheng Keyuan, Cao Yang, Huang Tianhao, Hu Lijie, Yu Lu, Wang Di
- A Survey Of Large Language Models For European Languages Ali Wazir, Pyysalo Sampo
- TREC Ikat 2023: A Test Collection For Evaluating Conversational And Interactive Knowledge Assistants Aliannejadi Mohammad, Abbasiantaeb Zahra, Chatterjee Shubham, Dalton Jeffery, Azzopardi Leif
- Evaluating The Retrieval Component In Llm-based Question Answering Systems Alinejad Ashkan, Kumar Krtin, Vahdat Ali
- Toward Conversational Agents With Context And Time Sensitive Long-term Memory Alonso Nick, Figliolia Tomás, Ndirango Anthony, Millidge Beren
- Tower: An Open Multilingual Large Language Model For Translation-related Tasks Alves Duarte M., Pombal José, Guerreiro Nuno M., Martins Pedro H., Alves João, Farajian Amin, Peters Ben, Rei Ricardo, Fernandes Patrick, Agrawal Sweta, Colombo Pierre, De Souza José G. C., Martins André F. T.
- Do Llms Exhibit Human-like Reasoning? Evaluating Theory Of Mind In Llms For Open-ended Responses Amirizaniani Maryam, Martin Elias, Sivachenko Maryna, Mashhadi Afra, Shah Chirag
- Llmauditor: A Framework For Auditing Large Language Models Using Human-in-the-loop Amirizaniani Maryam, Yao Jihan, Lavergne Adrian, Okada Elizabeth Snell, Chadha Aman, Roosta Tanya, Shah Chirag
- Evaluating The Efficacy Of Foundational Models: Advancing Benchmarking Practices To Enhance Fine-tuning Decision-making Amujo Oluyemi Enoch, Yang Shanchieh Jay
- Mm-phyrlhf: Reinforcement Learning Framework For Multimodal Physics Question-answering Anand Avinash, Kapuriya Janak, Kirtani Chhavi, Singh Apoorv, Saraf Jay, Lal Naman, Kumar Jatin, Shivam Adarsh Raj, Verma Astha, Shah Rajiv Ratn, Zimmermann Roger
- Investigating The Robustness Of Llms On Math Word Problems Anantheswaran Ujjwala, Gupta Himanshu, Scaria Kevin, Verma Shreyas, Baral Chitta, Mishra Swaroop
- Tensor Train Low-rank Approximation (tt-lora): Democratizing AI With Accelerated Llms Anjum Afia, Eren Maksim E., Boureima Ismael, Alexandrov Boian, Bhattarai Manish
- Critique-out-loud Reward Models Ankner Zachary, Paul Mansheej, Cui Brandon, Chang Jonathan D., Ammanabrolu Prithviraj
- Evaluating Chatgpt On Nuclear Domain-specific Data Anwar Muhammad, De Costa Mischa, Hammad Issam, Lau Daniel
- A Framework For Fine-tuning Llms Using Heterogeneous Feedback Aponte Ryan, Rossi Ryan A., Guo Shunan, Dernoncourt Franck, Yu Tong, Chen Xiang, Mitra Subrata, Lipka Nedim
- Forklift: An Extensible Neural Lifter Armengol-estapé Jordi, Rocha Rodrigo C. O., Woodruff Jackson, Minervini Pasquale, O'boyle Michael F. P.
- Calmqa: Exploring Culturally Specific Long-form Question Answering Across 23 Languages Arora Shane, Karpinska Marzena, Chen Hung-ting, Bhattacharjee Ipsita, Iyyer Mohit, Choi Eunsol
- Exploring The Impact Of Large Language Models On Recommender Systems: An Extensive Review Arpita Vats, Vinija Jain, Rahul Raja, Aman Chadha
- Enhancing Holonic Architecture With Natural Language Processing For System Of Systems Ashfaq Muhammad, Sadik Ahmed R., Mikkonen Tommi, Waseem Muhammad, Akitalo Niko M
- Evaluating Large Language Models On The GMAT: Implications For The Future Of Business Education Ashrafimoghari Vahid, Gürkan Necdet, Suchow Jordan W.
- Robustsentembed: Robust Sentence Embeddings Using Adversarial Self-supervised Contrastive Learning Asl Javad Rafiei, Panzade Prajwal, Blanco Eduardo, Takabi Daniel, Cai Zhipeng
- Elicitron: An LLM Agent-based Simulation Framework For Design Requirements Elicitation Ataei Mohammadmehdi, Cheong Hyunmin, Grandi Daniele, Wang Ye, Morris Nigel, Tessier Alexander
- Evaluating The Efficacy Of Open-source Llms In Enterprise-specific RAG Systems: A Comparative Study Of Performance And Scalability B Gautam, Purwar Anupam
- Hybrid Preference Optimization: Augmenting Direct Preference Optimization With Auxiliary Objectives Badrinath Anirudhan, Agarwal Prabhat, Xu Jiajing
- Efficient Model-agnostic Alignment Via Bayesian Persuasion Bai Fengshuo, Wang Mingzhi, Zhang Zhaowei, Chen Boyuan, Xu Yinda, Wen Ying, Yang Yaodong
- Sparsellm: Towards Global Pruning For Pre-trained Language Models Bai Guangji, Li Yijiang, Ling Chen, Kim Kibaek, Zhao Liang
- Souplm: Model Integration In Large Language And Multi-modal Models Bai Yue, Zhang Zichen, Lu Jiasen, Fu Yun
- Finetuning Large Language Model For Personalized Ranking Bai Zhuoxi, Wu Ning, Cai Fengyu, Zhu Xinyi, Xiong Yun
- CYGENT: A Cybersecurity Conversational Agent With Log Summarization Powered By GPT-3 Balasubramanian Prasasthy, Seby Justin, Kostakos Panos
- LAMP: A Language Model On The Map Balsebre Pasquale, Huang Weiming, Cong Gao
- Demystifying Platform Requirements For Diverse LLM Inference Use Cases Bambhaniya Abhimanyu, Raj Ritik, Jeong Geonhwa, Kundu Souvik, Srinivasan Sudarshan, Elavazhagan Midhilesh, Kumar Madhu, Krishna Tushar
- Linguistic Calibration Of Long-form Generations Band Neil, Li Xuechen, Ma Tengyu, Hashimoto Tatsunori
- MAIRA-2: Grounded Radiology Report Generation Bannur Shruthi, Bouzid Kenza, Castro Daniel C., Schwaighofer Anton, Bond-taylor Sam, Ilse Maximilian, Pérez-garcía Fernando, Salvatelli Valentina, Sharma Harshita, Meissen Felix, Ranjit Mercy, Srivastav Shaury, Gong Julia, Falck Fabian, Oktay Ozan, Thieme Anja, Lungren Matthew P., Wetscherek Maria Teodora, Alvarez-valle Javier, Hyland Stephanie L.
- Aligning Large Language Models From Self-reference AI Feedback With One General Principle Bao Rong, Zheng Rui, Dou Shihan, Wang Xiao, Zhou Enyu, Wang Bo, Zhang Qi, Ding Liang, Tao Dacheng
- Transforming Agency. On The Mode Of Existence Of Large Language Models Barandiaran Xabier E., Almendros Lola S.
- Slicerchat: Building A Local Chatbot For 3D Slicer Barr Colton
- Exploring Autonomous Agents Through The Lens Of Large Language Models: A Review Barua Saikat
- STARLING: Self-supervised Training Of Text-based Reinforcement Learning Agent With Large Language Models Basavatia Shreyas, Murugesan Keerthiram, Ratnakar Shivam
- Medaide: Leveraging Large Language Models For On-premise Medical Assistance On Edge Devices Basit Abdul, Hussain Khizar, Hanif Muhammad Abdullah, Shafique Muhammad
- A Comparative Study Of DSL Code Generation: Fine-tuning Vs. Optimized Retrieval Augmentation Bassamzadeh Nastaran, Methani Chhaya
- NESTFUL: A Benchmark For Evaluating Llms On Nested Sequences Of API Calls Basu Kinjal, Abdelaziz Ibrahim, Bradford Kelsey, Crouse Maxwell, Kate Kiran, Kumaravel Sadhana, Goyal Saurabh, Munawar Asim, Rizk Yara, Wang Xin, Lastras Luis, Kapanipathi Pavan
- API-BLEND: A Comprehensive Corpora For Training And Benchmarking API Llms Basu Kinjal, Abdelaziz Ibrahim, Chaudhury Subhajit, Dan Soham, Crouse Maxwell, Munawar Asim, Kumaravel Sadhana, Muthusamy Vinod, Kapanipathi Pavan, Lastras Luis A.
- Understanding Information Storage And Transfer In Multi-modal Large Language Models Basu Samyadeep, Grayson Martin, Morrison Cecily, Nushi Besmira, Feizi Soheil, Massiceti Daniela
- Internalinspector \(I^2\): Robust Confidence Estimation In Llms Through Internal States Beigi Mohammad, Shen Ying, Yang Runing, Lin Zihao, Wang Qifan, Mohan Ankith, He Jianfeng, Jin Ming, Lu Chang-tien, Huang Lifu
- Divergent Creativity In Humans And Large Language Models Bellemare-pepin Antoine 1 And 2, Lespinasse François 4 And 6, Thölke Philipp 4 And 6, Harel Yann 4 And 6, Mathewson Kory 4 And 6, Olson Jay A. 4 And 6, Bengio Yoshua 4 And 6, Jerbi Karim 1, 4 And 7
- Luna: An Evaluation Foundation Model To Catch Language Model Hallucinations With High Accuracy And Low Cost Belyi Masha, Friel Robert, Shao Shuai, Sanyal Atindriyo
- Crafting Interpretable Embeddings By Asking Llms Questions Benara Vinamra, Singh Chandan, Morris John X., Antonello Richard, Stoica Ion, Huth Alexander G., Gao Jianfeng
- SUTRA: Scalable Multilingual Language Model Architecture Bendale Abhijit, Sapienza Michael, Ripplinger Steven, Gibbs Simon, Lee Jaewon, Mistry Pranav
- Redefining "hallucination" In Llms: Towards A Psychology-informed Framework For Mitigating Misinformation Berberette Elijah, Hutchins Jack, Sadovnik Amir
- Multi-head RAG: Solving Multi-aspect Problems With Llms Besta Maciej, Kubicek Ales, Niggli Roman, Gerstenberger Robert, Weitzendorf Lucas, Chi Mingyuan, Iff Patrick, Gajda Joanna, Nyczyk Piotr, Müller Jürgen, Niewiadomski Hubert, Chrapek Marcin, Podstawski Michał, Hoefler Torsten
- Taking The Next Step With Generative Artificial Intelligence: The Transformative Role Of Multimodal Large Language Models In Science Education Bewersdorff Arne, Hartmann Christian, Hornberger Marie, Seßler Kathrin, Bannert Maria, Kasneci Enkelejda, Kasneci Gjergji, Zhai Xiaoming, Nerdel Claudia
- Clembench-2024: A Challenging, Dynamic, Complementary, Multilingual Benchmark And Underlying Flexible Framework For Llms As Multi-action Agents Beyer Anne, Chalamalasetti Kranti, Hakimov Sherzod, Madureira Brielen, Sadler Philipp, Schlangen David
- Soft Prompting For Unlearning In Large Language Models Bhaila Karuna, Van Minh-hao, Wu Xintao
- Efficient Reinforcement Learning Via Large Language Model-based Search Bhambri Siddhant, Bhattacharjee Amrita, Liu Huan, Kambhampati Subbarao
- Fintral: A Family Of GPT-4 Level Multimodal Financial Large Language Models Bhatia Gagan, Nagoudi El Moatez Billah, Cavusoglu Hasan, Abdul-mageed Muhammad
- Is Factuality Decoding A Free Lunch For Llms? Evaluation On Knowledge Editing Benchmark Bi Baolong, Liu Shenghua, Wang Yiwei, Mei Lingrui, Cheng Xueqi
- Iterative Refinement Of Project-level Code Context For Precise Code Generation With Compiler Feedback Bi Zhangqian, Wan Yao, Wang Zheng, Zhang Hongyu, Guan Batu, Lu Fangxin, Zhang Zili, Sui Yulei, Jin Hai, Shi Xuanhua
- Beyond Human Norms: Unveiling Unique Values Of Large Language Models Through Interdisciplinary Approaches Biedma Pablo, Yi Xiaoyuan, Huang Linus, Sun Maosong, Xie Xing
- Workarena++: Towards Compositional Planning And Reasoning-based Common Knowledge Work Tasks Boisvert Léo, Thakkar Megh, Gasse Maxime, Caccia Massimo, De Chezelles Thibault Le Sellier, Cappart Quentin, Chapados Nicolas, Lacoste Alexandre, Drouin Alexandre
- Rambla: A Framework For Evaluating The Reliability Of Llms As Assistants In The Biomedical Domain Bolton William James, Poyiadzi Rafael, Morrell Edward R., Bueno Gabriela Van Bergen Gonzalez, Goetz Lea
- Windows Agent Arena: Evaluating Multi-modal OS Agents At Scale Bonatti Rogerio, Zhao Dan, Bonacci Francesco, Dupont Dillon, Abdali Sara, Li Yinheng, Wagle Justin, Koishida Kazuhito, Bucker Arthur, Jang Lawrence, Hui Zack
- Low-rank Quantization-aware Training For Llms Bondarenko Yelysei, Del Chiaro Riccardo, Nagel Markus
- Embardiment: An Embodied AI Agent For Productivity In XR Bovo Riccardo, Abreu Steven, Ahuja Karan, Gonzalez Eric J, Cheng Li-te, Gonzalez-franco Mar
- FIHA: Autonomous Hallucination Evaluation In Vision-language Models With Davidson Scene Graphs Bowen Yan, Zhengsong Zhang, Liqiang Jing, Eftekhar Hossain, Xinya Du
- Exploring Precision And Recall To Assess The Quality And Diversity Of Llms Bronnec Florian Le, Verine Alexandre, Negrevergne Benjamin, Chevaleyre Yann, Allauzen Alexandre
- Generation, Distillation And Evaluation Of Motivational Interviewing-style Reflections With A Foundational Language Model Brown Andrew, Zhu Jiading, Abdelwahab Mohamed, Dong Alec, Wang Cindy, Rose Jonathan
- Enhancing Trust In Llms: Algorithms For Comparing And Interpreting Llms Brown Nik Bear
- Fine-tuned 'small' Llms (still) Significantly Outperform Zero-shot Generative AI Models In Text Classification Bucher Martin Juan José, Martini Marco
- Hierarchical Prompting Taxonomy: A Universal Evaluation Framework For Large Language Models Budagam Devichand, Kj Sankalp, Kumar Ashutosh, Jain Vinija, Chadha Aman
- Pheme: Efficient And Conversational Speech Generation Budzianowski Paweł, Sereda Taras, Cichy Tomasz, Vulić Ivan
- Conversational Complexity For Assessing Risk In Large Language Models Burden John, Cebrian Manuel, Hernandez-orallo Jose
- Aligning Large Language Models With Counterfactual DPO Butcher Bradley
- TACT: Advancing Complex Aggregative Reasoning With Information Extraction Tools Caciularu Avi, Jacovi Alon, Ben-david Eyal, Goldshtein Sasha, Schuster Tal, Herzig Jonathan, Elidan Gal, Globerson Amir
- Low-cost Generation And Evaluation Of Dictionary Example Sentences Cai Bill, Ng Clarence Boon Liang, Tan Daniel, Hotama Shelvia
- On The Transformations Across Reward Model, Parameter Update, And In-context Prompt Cai Deng, Li Huayang, Fu Tingchen, Li Siheng, Xu Weiwen, Li Shuaiyi, Cao Bowen, Zhang Zhisong, Huang Xinting, Cui Leyang, Wang Yan, Liu Lemao, Watanabe Taro, Shi Shuming
- Matryoshka Multimodal Models Cai Mu, Yang Jianwei, Gao Jianfeng, Lee Yong Jae
- Flextron: Many-in-one Flexible Large Language Model Cai Ruisi, Muralidharan Saurav, Heinrich Greg, Yin Hongxu, Wang Zhangyang, Kautz Jan, Molchanov Pavlo
- Lococo: Dropping In Convolutions For Long Context Compression Cai Ruisi, Tian Yuandong, Wang Zhangyang, Chen Beidi
- Medusa: Simple LLM Inference Acceleration Framework With Multiple Decoding Heads Cai Tianle, Li Yuhong, Geng Zhengyang, Peng Hongwu, Lee Jason D., Chen Deming, Dao Tri
- Towards Logically Consistent Language Models Via Probabilistic Reasoning Calanzone Diego, Teso Stefano, Vergari Antonio
- XAI Meets Llms: A Survey Of The Relation Between Explainable AI And Large Language Models Cambria Erik, Malandri Lorenzo, Mercorio Fabio, Nobani Navid, Seveso Andrea
- Structeval: Deepen And Broaden Large Language Model Assessment Via Structured Evaluation Cao Boxi, Ren Mengjie, Lin Hongyu, Han Xianpei, Zhang Feng, Zhan Junfeng, Sun Le
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes Cao He, Shao Yanjun, Liu Zhiyuan, Liu Zijing, Tang Xiangru, Yao Yuan, Li Yu
- MADTP: Multimodal Alignment-guided Dynamic Token Pruning For Accelerating Vision-language Transformer Cao Jianjian, Ye Peng, Li Shengze, Yu Chong, Tang Yansong, Lu Jiwen, Chen Tao
- Genception: Evaluate Multimodal Llms With Unlabeled Unimodal Data Cao Lele, Buchner Valentin, Senane Zineb, Yang Fangkai
- Beyond Sparse Rewards: Enhancing Reinforcement Learning With Language Model Critique In Text Generation Cao Meng, Shu Lei, Yu Lei, Zhu Yun, Wichers Nevan, Liu Yinxiao, Meng Lei
- Graphinsight: Unlocking Insights In Large Language Models For Graph Structure Understanding Cao Yukun, Han Shuo, Gao Zengyi, Ding Zezhong, Xie Xike, Zhou S. Kevin
- Can Formal Argumentative Reasoning Enhance Llms Performances? Castagna Federico, Sassoon Isabel, Parsons Simon
- HAMMR: Hierarchical Multimodal React Agents For Generic VQA Castrejon Lluis, Mensink Thomas, Zhou Howard, Ferrari Vittorio, Araujo Andre, Uijlings Jasper
- Smileyllama: Modifying Large Language Models For Directed Chemical Space Exploration Cavanagh Joseph M., Sun Kunyang, Gritsevskiy Andrew, Bagni Dorian, Bannister Thomas D., Head-gordon Teresa
- Towards Robust And Cost-efficient Knowledge Unlearning For Large Language Models Cha Sungmin, Cho Sungjun, Hwang Dasol, Lee Moontae
- Language Models As Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning In Language Models Chae Hyungjoo, Kim Yeonghyeon, Kim Seungone, Ong Kai Tzu-iunn, Kwak Beong-woo, Kim Moohyeon, Kim Seonghwan, Kwon Taeyoon, Chung Jiwan, Yu Youngjae, Yeo Jinyoung
- Dual Modalities Of Text: Visual And Textual Generative Pre-training Chai Yekun, Liu Qingyi, Xiao Jingwu, Wang Shuohuan, Sun Yu, Wu Hua
- AMEX: Android Multi-annotation Expo Dataset For Mobile GUI Agents Chai Yuxiang, Huang Siyuan, Niu Yazhe, Xiao Han, Liu Liang, Zhang Dingyu, Gao Peng, Ren Shuai, Li Hongsheng
- An Expert Is Worth One Token: Synergizing Multiple Expert Llms As Generalist Via Expert Token Routing Chai Ziwei, Wang Guoyin, Su Jing, Zhang Tianjie, Huang Xuanwen, Wang Xuwu, Xu Jingjing, Yuan Jianbo, Yang Hongxia, Wu Fei, Yang Yang
- Dense Reward For Free In Reinforcement Learning From Human Feedback Chan Alex J., Sun Hao, Holt Samuel, Van Der Schaar Mihaela
- Prompto: An Open Source Library For Asynchronous Querying Of LLM Endpoints Chan Ryan Sze-yin, Nanni Federico, Brown Edwin, Chapman Ed, Williams Angus R., Bright Jonathan, Gabasova Evelina
- Scaling Synthetic Data Creation With 1,000,000,000 Personas Chan Xin, Wang Xiaoyang, Yu Dian, Mi Haitao, Yu Dong
- AMGPT: A Large Language Model For Contextual Querying In Additive Manufacturing Chandrasekhar Achuth, Chan Jonathan, Ogoke Francis, Ajenifujah Olabode, Farimani Amir Barati
- Benchmarking Cognitive Domains For Llms: Insights From Taiwanese Hakka Culture Chang Chen-chi, Chen Ching-yuan, Lee Hung-shin, Lee Chih-cheng
- Integrating Emotional And Linguistic Models For Ethical Compliance In Large Language Models Chang Edward Y.
- Speechprompt: Prompting Speech Language Models For Speech Processing Tasks Chang Kai-wei, Wu Haibin, Wang Yu-kai, Wu Yuan-kuei, Shen Hua, Tseng Wei-cheng, Kang Iu-thing, Li Shang-wen, Lee Hung-yi
- Xprompt:explaining Large Language Model's Generation Via Joint Prompt Attribution Chang Yurui, Cao Bochuan, Wang Yujia, Chen Jinghui, Lin Lu
- When Large Language Models Meet Evolutionary Algorithms Chao Wang, Zhao Jiaxuan, Jiao Licheng, Li Lingling, Liu Fang, Yang Shuyuan
- Make Large Language Model A Better Ranker Chao Wenshuo, Zheng Zhi, Zhu Hengshu, Liu Hao
- Notellm: A Retrievable Large Language Model For Note Recommendation Chao Zhang, Shiwei Wu, Haoxin Zhang, Tong Xu, Yan Gao, Yao Hu, Di Wu, Enhong Chen
- Quantifying The Gain In Weak-to-strong Generalization Charikar Moses, Pabbaraju Chirag, Shiragur Kirankumar
- Phantom: General Trigger Attacks On Retrieval Augmented Language Generation Chaudhari Harsh, Severi Giorgio, Abascal John, Jagielski Matthew, Choquette-choo Christopher A., Nasr Milad, Nita-rotaru Cristina, Oprea Alina
- Quantitative Certification Of Bias In Large Language Models Chaudhary Isha, Hu Qian, Kumar Manoj, Ziyadi Morteza, Gupta Rahul, Singh Gagandeep
- Large Language Models As Instruments Of Power: New Regimes Of Autonomous Manipulation And Control Chaudhary Yaqub, Penn Jonnie
- DUAL-REFLECT: Enhancing Large Language Models For Reflective Translation Through Dual Learning Feedback Mechanisms Chen Andong, Lou Lianzhang, Chen Kehai, Bai Xuefeng, Xiang Yang, Yang Muyun, Zhao Tiejun, Zhang Min
- Model Composition For Multimodal Large Language Models Chen Chi, Du Yiyang, Fang Zheng, Wang Ziyue, Luo Fuwen, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Sun Maosong, Liu Yang
- Livemind: Low-latency Large Language Models With Simultaneous Inference Chen Chuangtao, Zhang Grace Li, Yin Xunzhao, Zhuo Cheng, Schlichtmann Ulf, Li Bing
- Lower Layer Matters: Alleviating Hallucination Via Multi-layer Fusion Contrastive Decoding With Truthfulness Refocused Chen Dingwei, Fang Feiteng, Ni Shiwen, Liang Feng, Xu Ruifeng, Yang Min, Li Chengming
- Self-cognition In Large Language Models: An Exploratory Study Chen Dongping, Shi Jiawen, Wan Yao, Zhou Pan, Gong Neil Zhenqiang, Sun Lichao
- Generalizing Conversational Dense Retrieval Via Llm-cognition Data Augmentation Chen Haonan, Dou Zhicheng, Mao Kelong, Liu Jiongnan, Zhao Ziliang
- Lmstyle Benchmark: Evaluating Text Style Transfer For Chatbots Chen Jianlin
- Automated Data Curation For Robust Language Model Fine-tuning Chen Jiuhai, Mueller Jonas
- Putting People In Llms' Shoes: Generating Better Answers Via Question Rewriter Chen Junhao, Wang Bowen, Jiang Zhouqiang, Nakashima Yuta
- Nlperturbator: Studying The Robustness Of Code Llms To Natural Language Variations Chen Junkai, Li Zhenhao, Hu Xing, Xia Xin
- Reasoning Runtime Behavior Of A Program With LLM: How Far Are We? Chen Junkai, Pan Zhiyuan, Hu Xing, Li Zhenhao, Li Ge, Xia Xin
- Pixart-\sigma: Weak-to-strong Training Of Diffusion Transformer For 4K Text-to-image Generation Chen Junsong, Ge Chongjian, Xie Enze, Wu Yue, Yao Lewei, Ren Xiaozhe, Wang Zhongdao, Luo Ping, Lu Huchuan, Li Zhenguo
- Cost-effective Instruction Learning For Pathology Vision And Language Analysis Chen Kaitao, Liu Mianxin, Yan Fang, Ma Lei, Shi Xiaoming, Wang Lilong, Wang Xiaosong, Zhu Lifeng, Wang Zhe, Zhou Mu, Zhang Shaoting
- Machine Unlearning In Large Language Models Chen Kongyang, Wang Zixin, Mi Bing, Liu Waixi, Wang Shaowei, Ren Xiaojun, Shen Jiaxing
- Instructioncp: A Fast Approach To Transfer Large Language Models Into Target Language Chen Kuang-ming, Lee Hung-yi
- Pca-bench: Evaluating Multimodal Large Language Models In Perception-cognition-action Chain Chen Liang, Zhang Yichi, Ren Shuhuai, Zhao Haozhe, Cai Zefan, Wang Yuchi, Wang Peiyi, Meng Xiangdi, Liu Tianyu, Chang Baobao
- Identifying Query-relevant Neurons In Large Language Models For Long-form Texts Chen Lihu, Dejl Adam, Toni Francesca
- Are We On The Right Way For Evaluating Large Vision-language Models? Chen Lin, Li Jinsong, Dong Xiaoyi, Zhang Pan, Zang Yuhang, Chen Zehui, Duan Haodong, Wang Jiaqi, Qiao Yu, Lin Dahua, Zhao Feng
- Quantifying And Mitigating Unimodal Biases In Multimodal Large Language Models: A Causal Perspective Chen Meiqi, Cao Yixin, Zhang Yan, Lu Chaochao
- Jumpcoder: Go Beyond Autoregressive Coder Via Online Modification Chen Mouxiang, Tian Hao, Liu Zhongxin, Ren Xiaoxue, Sun Jianling
- Graphwiz: An Instruction-following Language Model For Graph Problems Chen Nuo, Li Yuhan, Tang Jianheng, Li Jia
- WSI-VQA: Interpreting Whole Slide Images By Generative Visual Question Answering Chen Pingyi, Zhu Chenglu, Zheng Sunyi, Li Honglin, Yang Lin
- Llm-based Multi-hop Question Answering With Knowledge Graph Integration In Evolving Environments Chen Ruirui, Jiang Weifeng, Qin Chengwei, Rawal Ishaan Singh, Tan Cheston, Choi Dongkyu, Xiong Bo, Ai Bo
- Apollonion: Profile-centric Dialog Agent Chen Shangyu, Zhao Zibo, Zhao Yuanyuan, Li Xiang
- Advancing Tool-augmented Large Language Models: Integrating Insights From Errors In Inference Trees Chen Sijia, Wang Yibo, Wu Yi-feng, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Zhang Lijun
- Octopus: On-device Language Model For Function Calling Of Software Apis Chen Wei, Li Zhiyuan, Ma Mingyuan
- Intelligent Tutor: Leveraging Chatgpt And Microsoft Copilot Studio To Deliver A Generative AI Student Support And Feedback System Within Teams Chen Wei-yu
- Reprompt: Planning By Automatic Prompt Engineering For Large Language Models Agents Chen Weizhe, Koenig Sven, Dilkina Bistra
- SATO: Stable Text-to-motion Framework Chen Wenshuo, Xiao Hongru, Zhang Erhang, Hu Lijie, Wang Lei, Liu Mengyuan, Chen Chen
- Icleval: Evaluating In-context Learning Ability Of Large Language Models Chen Wentong, Lin Yankai, Zhou Zhenhao, Huang Hongyun, Jia Yantao, Cao Zhao, Wen Ji-rong
- Llast: Improved End-to-end Speech Translation System Leveraged By Large Language Models Chen Xi, Zhang Songyang, Bai Qibing, Chen Kai, Nakamura Satoshi
- Eyegpt: Ophthalmic Assistant With Large Language Models Chen Xiaolan, Zhao Ziwei, Zhang Weiyi, Xu Pusheng, Gao Le, Xu Mingpu, Wu Yue, Li Yinwen, Shi Danli, He Mingguang
- Hiqa: A Hierarchical Contextual Augmentation RAG For Massive Documents QA Chen Xinyue, Gao Pengyu, Song Jiangjiang, Tan Xiaoyang
- Iteralign: Iterative Constitutional Alignment Of Large Language Models Chen Xiusi, Wen Hongzhi, Nag Sreyashi, Luo Chen, Yin Qingyu, Li Ruirui, Li Zheng, Wang Wei
- Take The Bull By The Horns: Hard Sample-reweighted Continual Training Improves LLM Generalization Chen Xuxi, Wang Zhendong, Sow Daouda, Yang Junjie, Chen Tianlong, Liang Yingbin, Zhou Mingyuan, Wang Zhangyang
- NACL: A General And Effective KV Cache Eviction Framework For Llms At Inference Time Chen Yilong, Wang Guoxia, Shang Junyuan, Cui Shiyao, Zhang Zhenyu, Liu Tingwen, Wang Shuohuan, Sun Yu, Yu Dianhai, Wu Hua
- See What Llms Cannot Answer: A Self-challenge Framework For Uncovering LLM Weaknesses Chen Yulong, Liu Yang, Yan Jianhao, Bai Xuefeng, Zhong Ming, Yang Yinghao, Yang Ziyi, Zhu Chenguang, Zhang Yue
- Agent-flan: Designing Data And Methods Of Effective Agent Tuning For Large Language Models Chen Zehui, Liu Kuikun, Wang Qiuchen, Zhang Wenwei, Liu Jiangning, Lin Dahua, Chen Kai, Zhao Feng
- Towards Boosting Llms-driven Relevance Modeling With Progressive Retrieved Behavior-augmented Prompting Chen Zeyuan, Wu Haiyan, Wu Kaixin, Chen Wei, Zhong Mingjie, Xu Jia, Liu Zhongyi, Zhang Wei
- MLLM Is A Strong Reranker: Advancing Multimodal Retrieval-augmented Generation Via Knowledge-enhanced Reranking And Noise-injected Training Chen Zhanpeng, Xu Chengjin, Qi Yiyan, Guo Jian
- Autoprm: Automating Procedural Supervision For Multi-step Reasoning Via Controllable Question Decomposition Chen Zhaorun, Zhao Zhuokai, Zhu Zhihong, Zhang Ruiqi, Li Xiang, Raj Bhiksha, Yao Huaxiu
- What Are The Essential Factors In Crafting Effective Long Context Multi-hop Instruction Datasets? Insights And Best Practices Chen Zhi, Chen Qiguang, Qin Libo, Guo Qipeng, Lv Haijun, Zou Yicheng, Che Wanxiang, Yan Hang, Chen Kai, Lin Dahua
- Dia-llama: Towards Large Language Model-driven CT Report Generation Chen Zhixuan, Luo Luyang, Bie Yequan, Chen Hao
- "in Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles Through In-dialogue Learning Cheng Chuanqi, Tu Quan, Wu Wei, Shang Shuo, Mao Cunli, Yu Zhengtao, Yan Rui
- Empowering Large Language Models On Robotic Manipulation With Affordance Prompting Cheng Guangran, Zhang Chuheng, Cai Wenzhe, Zhao Li, Sun Changyin, Bian Jiang
- Structure Guided Prompt: Instructing Large Language Model In Multi-step Reasoning By Exploring Graph Structure Of The Text Cheng Kewei, Ahmed Nesreen K., Willke Theodore, Sun Yizhou
- Can AI Assistants Know What They Don't Know? Cheng Qinyuan, Sun Tianxiang, Liu Xiangyang, Zhang Wenwei, Yin Zhangyue, Li Shimin, Li Linyang, He Zhengfu, Chen Kai, Qiu Xipeng
- Call Me When Necessary: Llms Can Efficiently And Faithfully Reason Over Structured Environments Cheng Sitao, Zhuang Ziyuan, Xu Yong, Yang Fangkai, Zhang Chaoyun, Qin Xiaoting, Huang Xiang, Chen Ling, Lin Qingwei, Zhang Dongmei, Rajmohan Saravan, Zhang Qi
- Small Agent Can Also Rock! Empowering Small Language Models As Hallucination Detector Cheng Xiaoxue, Li Junyi, Zhao Wayne Xin, Zhang Hongzhi, Zhang Fuzheng, Zhang Di, Gai Kun, Wen Ji-rong
- Multi-prompting Decoder Helps Better Language Understanding Cheng Zifeng, Chen Zhaoling, Jiang Zhiwei, Yin Yafeng, Ge Shiping, Liu Yuliang, Gu Qing
- (A)I Am Not A Lawyer, But...: Engaging Legal Experts Towards Responsible LLM Policies For Legal Advice Cheong Inyoung, Xia King, Feng K. J. Kevin, Chen Quan Ze, Zhang Amy X.
- ANOLE: An Open, Autoregressive, Native Large Multimodal Models For Interleaved Image-text Generation Chern Ethan, Su Jiadi, Ma Yan, Liu Pengfei
- Can Large Language Models Be Trusted For Evaluation? Scalable Meta-evaluation Of Llms As Evaluators Via Agent Debate Chern Steffi, Chern Ethan, Neubig Graham, Liu Pengfei
- Walert: Putting Conversational Search Knowledge Into Action By Building And Evaluating A Large Language Model-powered Chatbot Cherumanal Sachin Pathiyan, Tian Lin, Abushaqra Futoon M., De Paula Angel Felipe Magnossao, Ji Kaixin, Hettiachchi Danula, Trippas Johanne R., Ali Halil, Scholer Falk, Spina Damiano
- Lamsum: Creating Extractive Summaries Of User Generated Content Using Llms Chhikara Garima, Sharma Anurag, Gurucharan V., Ghosh Kripabandhu, Chakraborty Abhijnan
- Chatbot Arena: An Open Platform For Evaluating Llms By Human Preference Chiang Wei-lin, Zheng Lianmin, Sheng Ying, Angelopoulos Anastasios Nikolas, Li Tianle, Li Dacheng, Zhang Hao, Zhu Banghua, Jordan Michael, Gonzalez Joseph E., Stoica Ion
- Human-centered Llm-agent User Interface: A Position Paper Chin Daniel, Wang Yuxuan, Xia Gus
- A Computational Framework For Behavioral Assessment Of LLM Therapists Chiu Yu Ying, Sharma Ashish, Lin Inna Wanyin, Althoff Tim
- SPAR: Personalized Content-based Recommendation Via Long Engagement Attention Chiyu Zhang, Yifei Sun, Jun Chen, Jie Lei, Muhammad Abdul-mageed, Sinong Wang, Rong Jin, Sem Park, Ning Yao, Bo Long
- Picle: Eliciting Diverse Behaviors From Large Language Models With Persona In-context Learning Choi Hyeong Kyu, Li Yixuan
- SNAP: Unlearning Selective Knowledge In Large Language Models With Negative Instructions Choi Minseok, Rim Daniel, Lee Dohyun, Choo Jaegul
- Cross-architecture Transfer Learning For Linear-cost Inference Transformers Choi Sehyun
- When Life Gives You Llms, Make LLM-ADE: Large Language Models With Adaptive Data Engineering Choi Stephen, Gazeley William
- Cohesive Conversations: Enhancing Authenticity In Multi-agent Simulated Dialogues Chu Kuanchao, Chen Yi-pei, Nakayama Hideki
- Think Together And Work Better: Combining Humans' And Llms' Think-aloud Outcomes For Effective Text Evaluation Chu Seongyeub, Kim Jongwoo, Yi Munyong
- Improve Temporal Awareness Of Llms For Sequential Recommendation Chu Zhendong, Wang Zichao, Zhang Ruiyi, Ji Yangfeng, Wang Hongning, Sun Tong
- Beamaggr: Beam Aggregation Reasoning Over Multi-source Knowledge For Multi-hop Question Answering Chu Zheng, Chen Jingchang, Chen Qianglong, Wang Haotian, Zhu Kun, Du Xiyuan, Yu Weijiang, Liu Ming, Qin Bing
- A Causal Explainable Guardrails For Large Language Models Chu Zhixuan, Wang Yan, Li Longfei, Wang Zhibo, Qin Zhan, Ren Kui
- Professional Agents -- Evolving Large Language Models Into Autonomous Experts With Human-level Competencies Chu Zhixuan, Wang Yan, Zhu Feng, Yu Lu, Li Longfei, Gu Jinjie
- Beyond Llms: Advancing The Landscape Of Complex Reasoning Chu-carroll Jennifer, Beck Andrew, Burnham Greg, Melville David Os, Nachman David, Özcan A. Erdem, Ferrucci David
- Learning To Compress Prompt In Natural Language Formats Chuang Yu-neng, Xing Tianwei, Chang Chia-yuan, Liu Zirui, Chen Xun, Hu Xia
- Developing Generative AI Chatbots Conceptual Framework For Higher Education Chukwuere Joshua Ebere
- Informed AI Regulation: Comparing The Ethical Frameworks Of Leading LLM Chatbots Using An Ethics-based Audit To Assess Moral Reasoning And Normative Values Chun Jon, Elkins Katherine
- Representations As Language: An Information-theoretic Framework For Interpretability Conklin Henry, Smith Kenny
- Iscore: Visual Analytics For Interpreting How Language Models Automatically Score Summaries Coscia Adam, Holmes Langdon, Morris Wesley, Choi Joon Suh, Crossley Scott, Endert Alex
- Gamebench: Evaluating Strategic Reasoning Abilities Of LLM Agents Costarelli Anthony, Allen Mat, Hauksson Roman, Sodunke Grace, Hariharan Suhas, Cheng Carlson, Li Wenjie, Clymer Joshua, Yadav Arjun
- Phaseevo: Towards Unified In-context Prompt Optimization For Large Language Models Cui Wendi, Zhang Jiaxin, Li Zhuohang, Sun Hao, Lopez Damien, Das Kamalika, Malin Bradley, Kumar Sricharan
- Can AI Replace Human Subjects? A Large-scale Replication Of Psychological Experiments With Llms Cui Ziyan, Li Ning, Zhou Huaikang
- Chatgpt Vs Llama: Impact, Reliability, And Challenges In Stack Overflow Discussions Da Silva Leuson, Samhi Jordan, Khomh Foutse
- Large Legal Fictions: Profiling Legal Hallucinations In Large Language Models Dahl Matthew, Magesh Varun, Suzgun Mirac, Ho Daniel E.
- Muap: Multi-step Adaptive Prompt Learning For Vision-language Model With Missing Modality Dai Ruiting, Tan Yuqiao, Mo Lisi, He Tao, Qin Ke, Liang Shuang
- Inference To The Best Explanation In Large Language Models Dalal Dhairya, Valentino Marco, Freitas André, Buitelaar Paul
- A Complete Survey On Llm-based AI Chatbots Dam Sumit Kumar, Hong Choong Seon, Qiao Yu, Zhang Chaoning
- Speechverse: A Large-scale Generalizable Audio Language Model Das Nilaksh, Dingliwal Saket, Ronanki Srikanth, Paturi Rohit, Huang Zhaocheng, Mathur Prashant, Yuan Jie, Bekal Dhanush, Niu Xing, Jayanthi Sai Muralidhar, Li Xilai, Mundnich Karel, Sunkara Monica, Srinivasan Sundararajan, Han Kyu J, Kirchhoff Katrin
- Active Preference Optimization For Sample Efficient RLHF Das Nirjhar, Chakraborty Souradip, Pacchiano Aldo, Chowdhury Sayak Ray
- Xmainframe: A Large Language Model For Mainframe Modernization Dau Anh T. V., Dao Hieu Trung, Nguyen Anh Tuan, Tran Hieu Trung, Nguyen Phong X., Bui Nghi D. Q.
- Investigating Symbolic Capabilities Of Large Language Models Dave Neisarg, Kifer Daniel, Giles C. Lee, Mali Ankur
- Aligning Model Evaluations With Human Preferences: Mitigating Token Count Bias In Language Model Assessments Daynauth Roland, Mars Jason
- Application Of GPT Language Models For Innovation In Activities In University Teaching De Buenaga Manuel, Bueno Francisco Javier
- Towards Transparency: Exploring LLM Trainings Datasets Through Visual Topic Modeling And Semantic Frame De Dampierre Charles, Mogoutov Andrei, Baumard Nicolas
- Retail-gpt: Leveraging Retrieval Augmented Generation (RAG) For Building E-commerce Chat Assistants De Freitas Bruno Amaral Teixeira, Lotufo Roberto De Alencar
- Understanding Biases In Chatgpt-based Recommender Systems: Provider Fairness, Temporal Stability, And Recency Deldjoo Yashar
- MIMIR: A Streamlined Platform For Personalized Agent Tuning In Domain Expertise Deng Chunyuan, Tang Xiangru, Zhao Yilun, Wang Hanming, Wang Haoran, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- Pandora: Jailbreak Gpts By Retrieval Augmented Generation Poisoning Deng Gelei, Liu Yi, Wang Kailong, Li Yuekang, Zhang Tianwei, Liu Yang
- Can LLM Be A Good Path Planner Based On Prompt Engineering? Mitigating The Hallucination For Path Planning Deng Hourui, Zhang Hongjie, Ou Jie, Feng Chaosheng
- On The Multi-turn Instruction Following For Conversational Web Agents Deng Yang, Zhang Xuan, Zhang Wenxuan, Yuan Yifei, Ng See-kiong, Chua Tat-seng
- Attention Is Naturally Sparse With Gaussian Distributed Input Deng Yichuan, Song Zhao, Yang Chiwun
- LMGT: Optimizing Exploration-exploitation Balance In Reinforcement Learning Through Language Model Guided Trade-offs Deng Yongxin, Qiu Xihe, Tan Xiaoyu, Chu Wei, Xu Yinghui
- Cognidual Framework: Self-training Large Language Models Within A Dual-system Theoretical Framework For Improving Cognitive Tasks Deng Yongxin, Qiu Xihe, Tan Xiaoyu, Qu Chao, Pan Jing, Cheng Yuan, Xu Yinghui, Chu Wei
- Garak: A Framework For Security Probing Large Language Models Derczynski Leon, Galinkin Erick, Martin Jeffrey, Majumdar Subho, Inie Nanna
- Glam: Fine-tuning Large Language Models For Domain Knowledge Graph Alignment Via Neighborhood Partitioning And Generative Subgraph Encoding Dernbach Stefan, Agarwal Khushbu, Zuniga Alejandro, Henry Michael, Choudhury Sutanay
- Class-level Code Generation From Natural Language Using Iterative, Tool-enhanced Reasoning Over Repository Deshpande Ajinkya, Agarwal Anmol, Shet Shashank, Iyer Arun, Kanade Aditya, Bairi Ramakrishna, Parthasarathy Suresh
- GNOME: Generating Negotiations Through Open-domain Mapping Of Exchanges Deshpande Darshan, Sinha Shambhavi, Kumar Anirudh Ravi, Pal Debaditya, May Jonathan
- Shaping Human-ai Collaboration: Varied Scaffolding Levels In Co-writing With Language Models Dhillon Paramveer S., Molaei Somayeh, Li Jiaqi, Golub Maximilian, Zheng Shaochun, Robert Lionel P.
- AI Does Not Alter Perceptions Of Text Messages Diamond N'yoma
- Learning How To Ask: Cycle-consistency Refines Prompts In Multimodal Foundation Models Diesendruck Maurice, Lin Jianzhe, Imani Shima, Mahalingam Gayathri, Xu Mingyang, Zhao Jie
- Data Augmentation Using Large Language Models: Data Perspectives, Learning Paradigms And Challenges Ding Bosheng, Qin Chengwei, Zhao Ruochen, Luo Tianze, Li Xinze, Chen Guizhen, Xia Wenhan, Hu Junjie, Luu Anh Tuan, Joty Shafiq
- Large Language Model Agent In Financial Trading: A Survey Ding Han, Li Yinheng, Wang Junhao, Chen Hang
- Mastering Text, Code And Math Simultaneously Via Fusing Highly Specialized Language Models Ding Ning, Chen Yulin, Cui Ganqu, Lv Xingtai, Zhao Weilin, Xie Ruobing, Zhou Bowen, Liu Zhiyuan, Sun Maosong
- Finding Blind Spots In Evaluator Llms With Interpretable Checklists Doddapaneni Sumanth, Khan Mohammed Safi Ur Rahman, Verma Sshubam, Khapra Mitesh M.
- A Tale Of Tails: Model Collapse As A Change Of Scaling Laws Dohmatob Elvis, Feng Yunzhen, Yang Pu, Charton Francois, Kempe Julia
- Understand What LLM Needs: Dual Preference Alignment For Retrieval-augmented Generation Dong Guanting, Zhu Yutao, Zhang Chenghao, Wang Zechen, Dou Zhicheng, Wen Ji-rong
- RLHF Workflow: From Reward Modeling To Online RLHF Dong Hanze, Xiong Wei, Pang Bo, Wang Haoxiang, Zhao Han, Zhou Yingbo, Jiang Nan, Sahoo Doyen, Xiong Caiming, Zhang Tong
- Pruner-zero: Evolving Symbolic Pruning Metric From Scratch For Large Language Models Dong Peijie, Li Lujun, Tang Zhenheng, Liu Xiang, Pan Xinglin, Wang Qiang, Chu Xiaowen
- Contrans: Weak-to-strong Alignment Engineering Via Concept Transplantation Dong Weilong, Wu Xinwei, Jin Renren, Xu Shaoyang, Xiong Deyi
- Disclosure And Mitigation Of Gender Bias In Llms Dong Xiangjue, Wang Yibo, Yu Philip S., Caverlee James
- A Framework For Real-time Safeguarding The Text Generation Of Large Language Model Dong Ximing, Lin Dayi, Wang Shaowei, Hassan Ahmed E.
- Unmemorization In Large Language Models Via Self-distillation And Deliberate Imagination Dong Yijiang River, Lin Hongzhou, Belkin Mikhail, Huerta Ramon, Vulić Ivan
- Jailbreaking Text-to-image Models With Llm-based Agents Dong Yingkai, Li Zheng, Meng Xiangtao, Yu Ning, Guo Shanqing
- Prompt Your Brain: Scaffold Prompt Tuning For Efficient Adaptation Of Fmri Pre-trained Model Dong Zijian, Wu Yilei, Chen Zijiao, Zhang Yichi, Jin Yueming, Zhou Juan Helen
- Biomedical Large Languages Models Seem Not To Be Superior To Generalist Models On Unseen Medical Data Dorfner Felix J., Dada Amin, Busch Felix, Makowski Marcus R., Han Tianyu, Truhn Daniel, Kleesiek Jens, Sushil Madhumita, Lammert Jacqueline, Adams Lisa C., Bressem Keno K.
- To What Extent Is Chatgpt Useful For Language Teacher Lesson Plan Creation? Dornburg Alex, Davin Kristin
- What's Wrong With Your Code Generated By Large Language Models? An Extensive Study Dou Shihan, Jia Haoxiang, Wu Shenxi, Zheng Huiyuan, Zhou Weikang, Wu Muling, Chai Mingxu, Fan Jessica, Huang Caishuang, Tao Yunbo, Liu Yan, Zhou Enyu, Zhang Ming, Zhou Yuhao, Wu Yueming, Zheng Rui, Wen Ming, Weng Rongxiang, Wang Jingang, Cai Xunliang, Gui Tao, Qiu Xipeng, Zhang Qi, Huang Xuanjing
- Stepcoder: Improve Code Generation With Reinforcement Learning From Compiler Feedback Dou Shihan, Liu Yan, Jia Haoxiang, Xiong Limao, Zhou Enyu, Shen Wei, Shan Junjie, Huang Caishuang, Wang Xiao, Fan Xiaoran, Xi Zhiheng, Zhou Yuhao, Ji Tao, Zheng Rui, Zhang Qi, Huang Xuanjing, Gui Tao
- Assessing The Reasoning Abilities Of Chatgpt In The Context Of Claim Verification Dougrez-lewis John, Akhter Mahmud Elahi, He Yulan, Liakata Maria
- Workarena: How Capable Are Web Agents At Solving Common Knowledge Work Tasks? Drouin Alexandre, Gasse Maxime, Caccia Massimo, Laradji Issam H., Del Verme Manuel, Marty Tom, Boisvert Léo, Thakkar Megh, Cappart Quentin, Vazquez David, Chapados Nicolas, Lacoste Alexandre
- Evaluating Language Model Context Windows: A "working Memory" Test And Inference-time Correction Dsouza Amanda, Glaze Christopher, Shin Changho, Sala Frederic
- Towards A Copilot In BIM Authoring Tool Using A Large Language Model-based Agent For Intelligent Human-machine Interaction Du Changyu, Nousias Stavros, Borrmann André
- Bitdistiller: Unleashing The Potential Of Sub-4-bit Llms Via Self-distillation Du Dayou, Zhang Yijia, Cao Shijie, Guo Jiaqi, Cao Ting, Chu Xiaowen, Xu Ningyi
- Evidence-enhanced Triplet Generation Framework For Hallucination Alleviation In Generative Question Answering Du Haowei, Zhang Huishuai, Zhao Dongyan
- Internal And External Knowledge Interactive Refinement Framework For Knowledge-intensive Question Answering Du Haowei, Zhao Dongyan
- In-context Learning With Reinforcement Learning For Incomplete Utterance Rewriting Du Haowei, Zhao Dongyan
- Perltqa: A Personal Long-term Memory Dataset For Memory Classification, Retrieval, And Synthesis In Question Answering Du Yiming, Wang Hongru, Zhao Zhengyi, Liang Bin, Wang Baojun, Zhong Wanjun, Wang Zezhong, Wong Kam-fai
- CLEFT: Language-image Contrastive Learning With Efficient Large Language Model And Prompt Fine-tuning Du Yuexi, Chang Brian, Dvornek Nicha C.
- Do Llms Know About Hallucination? An Empirical Investigation Of Llm's Hidden States Duan Hanyu, Yang Yi, Tam Kar Yan
- On The Evaluation Consistency Of Attribution-based Explanations Duan Jiarui, Li Haoling, Zhang Haofei, Jiang Hao, Xue Mengqi, Sun Li, Song Mingli, Song Jie
- Prompting Multi-modal Tokens To Enhance End-to-end Autonomous Driving Imitation Learning With Llms Duan Yiqun, Zhang Qiang, Xu Renjing
- Length-controlled Alpacaeval: A Simple Way To Debias Automatic Evaluators Dubois Yann, Galambosi Balázs, Liang Percy, Hashimoto Tatsunori B.
- Applying RLAIF For Code Generation With Api-usage In Lightweight Llms Dutta Sujan, Mahinder Sayantan, Anantha Raviteja, Bandyopadhyay Bortik
- AXOLOTL: Fairness Through Assisted Self-debiasing Of Large Language Model Outputs Ebrahimi Sana, Chen Kaiwen, Asudeh Abolfazl, Das Gautam, Koudas Nick
- CROME: Cross-modal Adapters For Efficient Multimodal LLM Ebrahimi Sayna, Arik Sercan O., Nama Tejas, Pfister Tomas
- Lusifer: Llm-based User Simulated Feedback Environment For Online Recommender Systems Ebrat Danial, Rueda Luis
- Dialogue You Can Trust: Human And AI Perspectives On Generated Conversations Ebubechukwu Ike, Takeuchi Johane, Ceravola Antonello, Joublin Frank
- Cognitive Bias In High-stakes Decision-making With Llms Echterhoff Jessica, Liu Yao, Alessa Abeer, Mcauley Julian, He Zexue
- Relation Extraction With Fine-tuned Large Language Models In Retrieval Augmented Generation Frameworks Efeoglu Sefika, Paschke Adrian
- Considers-the-human Evaluation Framework: Rethinking Human Evaluation For Generative Large Language Models Elangovan Aparna, Liu Ling, Xu Lei, Bodapati Sravan, Roth Dan
- Kapqa: Knowledge-augmented Product Question-answering Eppalapally Swetha, Dangi Daksh, Bhat Chaithra, Gupta Ankita, Zhang Ruiyi, Agarwal Shubham, Bagga Karishma, Yoon Seunghyun, Lipka Nedim, Rossi Ryan A., Dernoncourt Franck
- Navigating Uncertainty: Optimizing API Dependency For Hallucination Reduction In Closed-book Question Answering Erbacher Pierre, Falissar Louis, Guigue Vincent, Soulier Laure
- Tinyagent: Function Calling At The Edge Erdogan Lutfi Eren, Lee Nicholas, Jha Siddharth, Kim Sehoon, Tabrizi Ryan, Moon Suhong, Hooper Coleman, Anumanchipalli Gopala, Keutzer Kurt, Gholami Amir
- Combinatorial Reasoning: Selecting Reasons In Generative AI Pipelines Via Combinatorial Optimization Esencan Mert, Kumar Tarun Advaith, Asanjan Ata Akbari, Lott P. Aaron, Mohseni Masoud, Unlu Can, Venturelli Davide, Ho Alan
- Coda: Constrained Generation Based Data Augmentation For Low-resource NLP Evuru Chandra Kiran Reddy, Ghosh Sreyan, Kumar Sonal, S Ramaneswaran, Tyagi Utkarsh, Manocha Dinesh
- HLAT: High-quality Large Language Model Pre-trained On AWS Trainium Fan Haozheng, Zhou Hao, Huang Guangtai, Raman Parameswaran, Fu Xinwei, Gupta Gaurav, Ram Dhananjay, Wang Yida, Huan Jun
- Fedmkt: Federated Mutual Knowledge Transfer For Large And Small Language Models Fan Tao, Ma Guoqiang, Kang Yan, Gu Hanlin, Song Yuanfeng, Fan Lixin, Chen Kai, Yang Qiang
- Prompt Optimizer Of Text-to-image Diffusion Models For Abstract Concept Understanding Fan Zezhong, Li Xiaohan, Fang Chenhao, Biswas Topojoy, Nag Kaushiki, Xu Jianpeng, Achan Kannan
- Biasalert: A Plug-and-play Tool For Social Bias Detection In Llms Fan Zhiting, Chen Ruizhe, Xu Ruiling, Liu Zuozhu
- DARA: Decomposition-alignment-reasoning Autonomous Language Agent For Question Answering Over Knowledge Graphs Fang Haishuo, Zhu Xiaodan, Gurevych Iryna
- On LLM Wizards: Identifying Large Language Models' Behaviors For Wizard Of Oz Experiments Fang Jingchao, Arechiga Nikos, Namaoshi Keiichi, Bravo Nayeli, Hogan Candice, Shamma David A.
- PATCH! Psychometrics-assisted Benchmarking Of Large Language Models: A Case Study Of Proficiency In 8th Grade Mathematics Fang Qixiang, Oberski Daniel L., Nguyen Dong
- \(VILA^2\): VILA Augmented VILA Fang Yunhao, Zhu Ligeng, Lu Yao, Wang Yan, Molchanov Pavlo, Cho Jang Hyun, Pavone Marco, Han Song, Yin Hongxu
- Integrating A.I. In Higher Education: Protocol For A Pilot Study With 'samcares: An Adaptive Learning Hub' Faruqui Syed Hasib Akhter, Tasnim Nazia, Basith Iftekhar Ibne, Obeidat Suleiman, Yildiz Faruk
- Porting Large Language Models To Mobile Devices For Question Answering Fassold Hannes
- Test Of Time: A Benchmark For Evaluating Llms On Temporal Reasoning Fatemi Bahare, Kazemi Mehran, Tsitsulin Anton, Malkan Karishma, Yim Jinyeong, Palowitch John, Seo Sungyong, Halcrow Jonathan, Perozzi Bryan
- Gensco: Can Question Decomposition Based Passage Alignment Improve Question Answering? Fazili Barah, Goswami Koustava, Modani Natwar, Nair Inderjeet
- Prompt Exploration With Prompt Regression Feffer Michael, Xu Ronald, Sun Yuekai, Yurochkin Mikhail
- Legend: Leveraging Representation Engineering To Annotate Safety Margin For Preference Datasets Feng Duanyu, Qin Bowen, Huang Chen, Huang Youcheng, Zhang Zheng, Lei Wenqiang
- Citygpt: Empowering Urban Spatial Cognition Of Large Language Models Feng Jie, Du Yuwei, Liu Tianhui, Guo Siqi, Lin Yuming, Li Yong
- Keypoint-based Progressive Chain-of-thought Distillation For Llms Feng Kaituo, Li Changsheng, Zhang Xiaolu, Zhou Jun, Yuan Ye, Wang Guoren
- AGILE: A Novel Framework Of LLM Agents Feng Peiyuan, He Yichen, Huang Guanhua, Lin Yuan, Zhang Hanchong, Zhang Yuchen, Li Hang
- Modular Pluralism: Pluralistic Alignment Via Multi-llm Collaboration Feng Shangbin, Sorensen Taylor, Liu Yuhan, Fisher Jillian, Park Chan Young, Choi Yejin, Tsvetkov Yulia
- Courseassist: Pedagogically Appropriate AI Tutor For Computer Science Education Feng Ty, Liu Sa, Ghosal Dipak
- Llama-vits: Enhancing TTS Synthesis With Semantic Awareness Feng Xincan, Yoshimoto Akifumi
- Mt-ladder: A Model-agnostic Framework Boosting Llm-based Machine Translation To The Next Level Feng Zhaopeng, Zhang Yan, Chen Ruizhe, Meng Zijie, Liu Zuozhu
- Tear: Improving Llm-based Machine Translation With Systematic Self-refinement Feng Zhaopeng, Zhang Yan, Li Hao, Wu Bei, Liao Jiayu, Liu Wenqiang, Lang Jun, Feng Yang, Wu Jian, Liu Zuozhu
- Unveiling And Manipulating Prompt Influence In Large Language Models Feng Zijian, Zhou Hanzhang, Zhu Zixiao, Qian Junlang, Mao Kezhi
- MERA: A Comprehensive LLM Evaluation In Russian Fenogenova Alena, Chervyakov Artem, Martynov Nikita, Kozlova Anastasia, Tikhonova Maria, Akhmetgareeva Albina, Emelyanov Anton, Shevelev Denis, Lebedev Pavel, Sinev Leonid, Isaeva Ulyana, Kolomeytseva Katerina, Moskovskiy Daniil, Goncharova Elizaveta, Savushkin Nikita, Mikhailova Polina, Dimitrov Denis, Panchenko Alexander, Markov Sergei
- Towards Trustworthy AI: A Review Of Ethical And Robust Large Language Models Ferdaus Md Meftahul, Abdelguerfi Mahdi, Ioup Elias, Niles Kendall N., Pathak Ken, Sloan Steven
- Into The Unknown: Self-learning Large Language Models Ferdinan Teddy, Kocoń Jan, Kazienko Przemysław
- Generative AI And Large Language Models For Cyber Security: All Insights You Need Ferrag Mohamed Amine, Alwahedi Fatima, Battah Ammar, Cherif Bilel, Mechri Abdechakour, Tihanyi Norbert
- Grillbot In Practice: Lessons And Tradeoffs Deploying Large Language Models For Adaptable Conversational Task Assistants Fischer Sophie, Gemmell Carlos, Tecklenburg Niklas, Mackie Iain, Rossetto Federico, Dalton Jeffrey
- RAG Foundry: A Framework For Enhancing Llms For Retrieval Augmented Generation Fleischer Daniel, Berchansky Moshe, Wasserblat Moshe, Izsak Peter
- Geckopt: LLM System Efficiency Via Intent-based Tool Selection Fore Michael, Singh Simranjit, Stamoulis Dimitrios
- Preact: Predicting Future In React Enhances Agent's Planning Ability Fu Dayuan, Huang Jianzhao, Lu Siyuan, Dong Guanting, Wang Yejie, He Keqing, Xu Weiran
- Autorag-hp: Automatic Online Hyper-parameter Tuning For Retrieval-augmented Generation Fu Jia, Qin Xiaoting, Yang Fangkai, Wang Lu, Zhang Jue, Lin Qingwei, Chen Yubo, Zhang Dongmei, Rajmohan Saravan, Zhang Qi
- Disperse-then-merge: Pushing The Limits Of Instruction Tuning Via Alignment Tax Reduction Fu Tingchen, Cai Deng, Liu Lemao, Shi Shuming, Yan Rui
- Autoguide: Automated Generation And Selection Of State-aware Guidelines For Large Language Model Agents Fu Yao, Kim Dong-ki, Kim Jaekyeom, Sohn Sungryull, Logeswaran Lajanugen, Bae Kyunghoon, Lee Honglak
- UI-JEPA: Towards Active Perception Of User Intent Through Onscreen User Activity Fu Yicheng, Anantha Raviteja, Vashisht Prabal, Cheng Jianpeng, Littwin Etai
- Using Large Language Models For The Interpretation Of Building Regulations Fuchs Stefan, Witbrock Michael, Dimyadi Johannes, Amor Robert
- Thinking Fair And Slow: On The Efficacy Of Structured Prompts For Debiasing Language Models Furniturewala Shaz, Jandial Surgan, Java Abhinav, Banerjee Pragyan, Shahid Simra, Bhatia Sumit, Jaidka Kokil
- Can AI Relate: Testing Large Language Model Response For Mental Health Support Gabriel Saadia, Puri Isha, Xu Xuhai, Malgaroli Matteo, Ghassemi Marzyeh
- RAVE: Residual Vector Embedding For Clip-guided Backlit Image Enhancement Gaintseva Tatiana, Benning Martin, Slabaugh Gregory
- Airavata: Introducing Hindi Instruction-tuned LLM Gala Jay, Jayakumar Thanmay, Husain Jaavid Aktar, M Aswanth Kumar, Khan Mohammed Safi Ur Rahman, Kanojia Diptesh, Puduppully Ratish, Khapra Mitesh M., Dabre Raj, Murthy Rudra, Kunchukuttan Anoop
- MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test With Open-domain Information Extraction Large Language Models Gan Chengguang, Yin Qingyu, He Xinyang, Wei Hanjun, Liang Yunhao, Lim Younghun, Wang Shijian, Huang Hexiang, Zhang Qinghao, Ni Shiwen, Mori Tatsunori
- Application Of LLM Agents In Recruitment: A Novel Framework For Resume Screening Gan Chengguang, Zhang Qinghao, Mori Tatsunori
- Context-augmented Retrieval: A Novel Framework For Fast Information Retrieval Based Response Generation Using Large Language Model Ganesh Sai, Purwar Anupam, B Gautam
- Multilingual Pretraining And Instruction Tuning Improve Cross-lingual Knowledge Alignment, But Only Shallowly Gao Changjiang, Hu Hongda, Hu Peng, Chen Jiajun, Li Jixing, Huang Shujian
- Practical Unlearning For Large Language Models Gao Chongyang, Wang Lixu, Weng Chenkai, Wang Xiao, Zhu Qi
- Aligning LLM Agents By Learning Latent Preference From User Edits Gao Ge, Taymanov Alexey, Salinas Eduardo, Mineiro Paul, Misra Dipendra
- Self-evolving GPT: A Lifelong Autonomous Experiential Learner Gao Jinglong, Ding Xiao, Cui Yiming, Zhao Jianbai, Wang Hepeng, Liu Ting, Qin Bing
- Unifying Demonstration Selection And Compression For In-context Learning Gao Jun, Cao Ziqiang, Li Wenjie
- AIM: Let Any Multi-modal Large Language Models Embrace Efficient In-context Learning Gao Jun, Qiao Qian, Cao Ziqiang, Wang Zili, Li Wenjie
- DORY: Deliberative Prompt Recovery For LLM Gao Lirong, Peng Ru, Zhang Yiming, Zhao Junbo
- Autovcoder: A Systematic Framework For Automated Verilog Code Generation Using Llms Gao Mingzhe, Zhao Jieru, Lin Zhe, Ding Wenchao, Hou Xiaofeng, Feng Yu, Li Chao, Guo Minyi
- Lumina-t2x: Transforming Text Into Any Modality, Resolution, And Duration Via Flow-based Large Diffusion Transformers Gao Peng, Zhuo Le, Liu Dongyang, Du Ruoyi, Luo Xu, Qiu Longtian, Zhang Yuhang, Lin Chen, Huang Rongjie, Geng Shijie, Zhang Renrui, Xi Junlin, Shao Wenqi, Jiang Zhengkai, Yang Tianshuo, Ye Weicai, Tong He, He Jingwen, Qiao Yu, Li Hongsheng
- Efficient Tool Use With Chain-of-abstraction Reasoning Gao Silin, Dwivedi-yu Jane, Yu Ping, Tan Xiaoqing Ellen, Pasunuru Ramakanth, Golovneva Olga, Sinha Koustuv, Celikyilmaz Asli, Bosselut Antoine, Wang Tianlu
- Dr3: Ask Large Language Models Not To Give Off-topic Answers In Open Domain Multi-hop Question Answering Gao Yuan, Zhu Yiheng, Cao Yuanbin, Zhou Yinzhi, Wu Zhen, Chen Yujie, Wu Shenglan, Hu Haoyuan, Dai Xinyu
- Are We Done With MMLU? Gema Aryo Pradipta, Leang Joshua Ong Jun, Hong Giwon, Devoto Alessio, Mancino Alberto Carlo Maria, Saxena Rohit, He Xuanli, Zhao Yu, Du Xiaotang, Madani Mohammad Reza Ghasemi, Barale Claire, Mchardy Robert, Harris Joshua, Kaddour Jean, Van Krieken Emile, Minervini Pasquale
- Why Not Transform Chat Large Language Models To Non-english? Geng Xiang, Zhu Ming, Li Jiahuan, Lai Zhejian, Zou Wei, She Shuaijie, Guo Jiaxin, Zhao Xiaofeng, Li Yinglu, Li Yuang, Su Chang, Zhao Yanqing, Lyu Xinglin, Zhang Min, Chen Jiajun, Yang Hao, Huang Shujian
- Jailbreaking Llms With Arabic Transliteration And Arabizi Ghanim Mansour Al, Almohaimeed Saleh, Zheng Mengxin, Solihin Yan, Lou Qian
- Generative AI In Education: A Study Of Educators' Awareness, Sentiments, And Influencing Factors Ghimire Aashish, Prather James, Edwards John
- Agentquest: A Modular Benchmark Framework To Measure Progress And Improve LLM Agents Gioacchini Luca, Siracusano Giuseppe, Sanvito Davide, Gashteovski Kiril, Friede David, Bifulco Roberto, Lawrence Carolin
- RE-GAINS & Enchant: Intelligent Tool Manipulation Systems For Enhanced Query Responses Girhepuje Sahil, Sajeev Siva Sankar, Jain Purvam, Sikder Arya, Varma Adithya Rama, George Ryan, Srinivasan Akshay Govind, Kurup Mahendra, Sinha Ashmit, Mondal Sudip
- Chatglm: A Family Of Large Language Models From GLM-130B To GLM-4 All Tools Glm Team, :, Zeng Aohan, Xu Bin, Wang Bowen, Zhang Chenhui, Yin Da, Zhang Dan, Rojas Diego, Feng Guanyu, Zhao Hanlin, Lai Hanyu, Yu Hao, Wang Hongning, Sun Jiadai, Zhang Jiajie, Cheng Jiale, Gui Jiayi, Tang Jie, Zhang Jing, Sun Jingyu, Li Juanzi, Zhao Lei, Wu Lindong, Zhong Lucen, Liu Mingdao, Huang Minlie, Zhang Peng, Zheng Qinkai, Lu Rui, Duan Shuaiqi, Zhang Shudan, Cao Shulin, Yang Shuxun, Tam Weng Lam, Zhao Wenyi, Liu Xiao, Xia Xiao, Zhang Xiaohan, Gu Xiaotao, Lv Xin, Liu Xinghan, Liu Xinyi, Yang Xinyue, Song Xixuan, Zhang Xunkai, An Yifan, Xu Yifan, Niu Yilin, Yang Yuantao, Li Yueyan, Bai Yushi, Dong Yuxiao, Qi Zehan, Wang Zhaoyu, Yang Zhen, Du Zhengxiao, Hou Zhenyu, Wang Zihan
- Zamba: A Compact 7B SSM Hybrid Model Glorioso Paolo, Anthony Quentin, Tokpanov Yury, Whittington James, Pilault Jonathan, Ibrahim Adam, Millidge Beren
- Arcee's Mergekit: A Toolkit For Merging Large Language Models Goddard Charles, Siriwardhana Shamane, Ehghaghi Malikeh, Meyers Luke, Karpukhin Vlad, Benedict Brian, Mcquade Mark, Solawetz Jacob
- Socratic Reasoning Improves Positive Text Rewriting Goel Anmol, Daheim Nico, Gurevych Iryna
- Direct Alignment Of Draft Model For Speculative Decoding With Chat-fine-tuned Llms Goel Raghavv, Gagrani Mukul, Jeon Wonseok, Park Junyoung, Lee Mingu, Lott Christopher
- Language Guided Exploration For RL Agents In Text Environments Golchha Hitesh, Yerawar Sahil, Patel Dhruvesh, Dan Soham, Murugesan Keerthiram
- Tokenshap: Interpreting Large Language Models With Monte Carlo Shapley Value Estimation Goldshmidt Roni, Horovicz Miriam
- I've Got The "answer"! Interpretation Of Llms Hidden States In Question Answering Goloviznina Valeriya, Kotelnikov Evgeny
- Evaluation Of Llms On Syntax-aware Code Fill-in-the-middle Tasks Gong Linyuan, Wang Sida, Elhoushi Mostafa, Cheung Alvin
- Exploring Augmentation And Cognitive Strategies For AI Based Synthetic Personae Gonzalez Rafael Arias, Dipaola Steve
- Specify What? Enhancing Neural Specification Synthesis By Symbolic Methods Granberry George, Ahrendt Wolfgang, Johansson Moa
- From Generalist To Specialist: Improving Large Language Models For Medical Physics Using Arcot Grandinetti Jace, Mcbeth Rafe
- Teams-rl: Teaching Llms To Generate Better Instruction Datasets Via Reinforcement Learning Gu Shangding, Knoll Alois, Jin Ming
- RWKV-CLIP: A Robust Vision-language Representation Learner Gu Tiancheng, Yang Kaicheng, An Xiang, Feng Ziyong, Liu Dongnan, Cai Weidong, Deng Jiankang
- Continual Skill And Task Learning Via Dialogue Gu Weiwei, Kondepudi Suresh, Huang Lixiao, Gopalan Nakul
- Middleware For Llms: Tools Are Instrumental For Language Agents In Complex Environments Gu Yu, Shu Yiheng, Yu Hao, Liu Xiao, Dong Yuxiao, Tang Jie, Srinivasa Jayanth, Latapie Hugo, Su Yu
- Anah-v2: Scaling Analytical Hallucination Annotation Of Large Language Models Gu Yuzhe, Ji Ziwei, Zhang Wenwei, Lyu Chengqi, Lin Dahua, Chen Kai
- Inquire, Interact, And Integrate: A Proactive Agent Collaborative Framework For Zero-shot Multimodal Medical Reasoning Gu Zishan, Liu Fenglin, Yin Changchang, Zhang Ping
- AMOR: A Recipe For Building Adaptable Modular Knowledge Agents Through Process Feedback Guan Jian, Wu Wei, Wen Zujie, Xu Peng, Wang Hongning, Huang Minlie
- Look Before You Leap: Towards Decision-aware And Generalizable Tool-usage For Large Language Models Gui Anchun, Li Jian, Dai Yong, Du Nan, Xiao Han
- Boosting Lossless Speculative Decoding Via Feature Sampling And Partial Alignment Distillation Gui Lujun, Xiao Bin, Su Lei, Chen Weipeng
- Conversational Topic Recommendation In Counseling And Psychotherapy With Decision Transformer And Large Language Models Gunal Aylin, Lin Baihan, Bouneffouf Djallel
- Learning On Graphs With Large Language Models(llms): A Deep Dive Into Model Robustness Guo Kai, Liu Zewen, Chen Zhikai, Wen Hongzhi, Jin Wei, Tang Jiliang, Chang Yi
- Integrating Large Language Models With Graphical Session-based Recommendation Guo Naicheng, Cheng Hongwei, Liang Qianqiao, Chen Linxun, Han Bing
- SGSH: Stimulate Large Language Models With Skeleton Heuristics For Knowledge Base Question Generation Guo Shasha, Liao Lizi, Zhang Jing, Wang Yanling, Li Cuiping, Chen Hong
- Agent-simt: Agent-assisted Simultaneous Machine Translation With Large Language Models Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Ds-agent: Automated Data Science By Empowering Large Language Models With Case-based Reasoning Guo Siyuan, Deng Cheng, Wen Ying, Chen Hechang, Chang Yi, Wang Jun
- Efficient Continual Pre-training By Mitigating The Stability Gap Guo Yiduo, Fu Jie, Zhang Huishuai, Zhao Dongyan, Shen Yikang
- API Pack: A Massive Multi-programming Language Dataset For API Call Generation Guo Zhen, Soria Adriana Meza, Sun Wei, Shen Yikang, Panda Rameswar
- Stabletoolbench: Towards Stable Large-scale Benchmarking On Tool Learning Of Large Language Models Guo Zhicheng, Cheng Sijie, Wang Hao, Liang Shihao, Qin Yujia, Li Peng, Liu Zhiyuan, Sun Maosong, Liu Yang
- Dependency-aware Semi-structured Sparsity: Declining Roles Of Outliers In Pruning Glu-based Llms Guo Zhiyu, Kamigaito Hidetaka, Wanatnabe Taro
- Walledeval: A Comprehensive Safety Evaluation Toolkit For Large Language Models Gupta Prannaya, Yau Le Qi, Low Hao Han, Lee I-shiang, Lim Hugo Maximus, Teoh Yu Xin, Koh Jia Hng, Liew Dar Win, Bhardwaj Rishabh, Bhardwaj Rajat, Poria Soujanya
- Hipporag: Neurobiologically Inspired Long-term Memory For Large Language Models Gutiérrez Bernal Jiménez, Shu Yiheng, Gu Yu, Yasunaga Michihiro, Su Yu
- METAL: Towards Multilingual Meta-evaluation Hada Rishav, Gumma Varun, Ahmed Mohamed, Bali Kalika, Sitaram Sunayana
- Would I Lie To You? Inference Time Alignment Of Language Models Using Direct Preference Heads Hadji-kyriacou Avelina Asada, Arandjelovic Ognjen
- On The Impacts Of Contexts On Repository-level Code Generation Hai Nam Le, Nguyen Dung Manh, Bui Nghi D. Q.
- Single Parent Family: A Spectrum Of Family Members From A Single Pre-trained Foundation Model Hajimolahoseini Habib, Hassanpour Mohammad, Ataiefard Foozhan, Chen Boxing, Liu Yang
- PECC: Problem Extraction And Coding Challenges Haller Patrick, Golde Jonas, Akbik Alan
- Llm-personalize: Aligning LLM Planners With Human Preferences Via Reinforced Self-training For Housekeeping Robots Han Dongge, Mcinroe Trevor, Jelley Adam, Albrecht Stefano V., Bell Peter, Storkey Amos
- MERLIN: Multimodal Embedding Refinement Via Llm-based Iterative Navigation For Text-video Retrieval-rerank Pipeline Han Donghoon, Park Eunhwan, Lee Gisang, Lee Adam, Kwak Nojun
- Chain-of-interaction: Enhancing Large Language Models For Psychiatric Behavior Understanding By Dyadic Contexts Han Guangzeng, Liu Weisi, Huang Xiaolei, Borsari Brian
- RECIPE4U: Student-chatgpt Interaction Dataset In EFL Writing Education Han Jieun, Yoo Haneul, Myung Junho, Kim Minsun, Lee Tak Yeon, Ahn So-yeon, Oh Alice
- Enhancing CTR Prediction Through Sequential Recommendation Pre-training: Introducing The SRP4CTR Framework Han Ruidong, Li Qianzhong, Jiang He, Li Rui, Zhao Yurou, Li Xiang, Lin Wei
- RAG-QA Arena: Evaluating Domain Robustness For Long-form Retrieval Augmented Question Answering Han Rujun, Zhang Yuhao, Qi Peng, Xu Yumo, Wang Jenyuan, Liu Lan, Wang William Yang, Min Bonan, Castelli Vittorio
- Wildguard: Open One-stop Moderation Tools For Safety Risks, Jailbreaks, And Refusals Of Llms Han Seungju, Rao Kavel, Ettinger Allyson, Jiang Liwei, Lin Bill Yuchen, Lambert Nathan, Choi Yejin, Dziri Nouha
- Value Augmented Sampling For Language Model Alignment And Personalization Han Seungwook, Shenfeld Idan, Srivastava Akash, Kim Yoon, Agrawal Pulkit
- EMMA: Your Text-to-image Diffusion Model Can Secretly Accept Multi-modal Prompts Han Yucheng, Wang Rui, Zhang Chi, Hu Juntao, Cheng Pei, Fu Bin, Zhang Hanwang
- Visual Cot: Advancing Multi-modal Language Models With A Comprehensive Dataset And Benchmark For Chain-of-thought Reasoning Hao Shao, Shengju Qian, Han Xiao, Guanglu Song, Zhuofan Zong, Letian Wang, Yu Liu, Hongsheng Li
- LLM Reasoners: New Evaluation, Library, And Analysis Of Step-by-step Reasoning With Large Language Models Hao Shibo, Gu Yi, Luo Haotian, Liu Tianyang, Shao Xiyan, Wang Xinyuan, Xie Shuhua, Ma Haodi, Samavedhi Adithya, Gao Qiyue, Wang Zhen, Hu Zhiting
- Measuring Psychological Depth In Language Models Harel-canada Fabrice, Zhou Hanyu, Mupalla Sreya, Yildiz Zeynep, Sahai Amit, Peng Nanyun
- Safety Arithmetic: A Framework For Test-time Safety Alignment Of Language Models By Steering Parameters And Activations Hazra Rima, Layek Sayan, Banerjee Somnath, Poria Soujanya
- Sowing The Wind, Reaping The Whirlwind: The Impact Of Editing Language Models Hazra Rima, Layek Sayan, Banerjee Somnath, Poria Soujanya
- Webvoyager: Building An End-to-end Web Agent With Large Multimodal Models He Hongliang, Yao Wenlin, Ma Kaixin, Yu Wenhao, Dai Yong, Zhang Hongming, Lan Zhenzhong, Yu Dong
- From Words To Actions: Unveiling The Theoretical Underpinnings Of Llm-driven Autonomous Systems He Jianliang, Chen Siyu, Zhang Fengzhuo, Yang Zhuoran
- Multi-modal Instruction Tuned Llms With Fine-grained Visual Perception He Junwen, Wang Yifan, Wang Lijun, Lu Huchuan, He Jun-yan, Lan Jin-peng, Luo Bin, Xie Xuansong
- Think-program-rectify: 3D Situated Reasoning With Large Language Models He Qingrong, Lin Kejun, Chen Shizhe, Hu Anwen, Jin Qin
- Prompting Chatgpt For Translation: A Comparative Analysis Of Translation Brief And Persona Prompts He Sui
- MARS: Mixture Of Auto-regressive Models For Fine-grained Text-to-image Synthesis He Wanggui, Fu Siming, Liu Mushui, Wang Xierui, Xiao Wenyi, Shu Fangxun, Wang Yi, Zhang Lei, Yu Zhelun, Li Haoyuan, Huang Ziwei, Gan Leilei, Jiang Hao
- G-retriever: Retrieval-augmented Generation For Textual Graph Understanding And Question Answering He Xiaoxin, Tian Yijun, Sun Yifei, Chawla Nitesh V., Laurent Thomas, Lecun Yann, Bresson Xavier, Hooi Bryan
- SHED: Shapley-based Automated Dataset Refinement For Instruction Fine-tuning He Yexiao, Wang Ziyao, Shen Zheyu, Sun Guoheng, Dai Yucong, Wu Yongkai, Wang Hongyi, Li Ang
- BP4ER: Bootstrap Prompting For Explicit Reasoning In Medical Dialogue Generation He Yuhong, Zhang Yongqi, He Shizhu, Wan Jun
- Fortifying Ethical Boundaries In AI: Advanced Strategies For Enhancing Security In Large Language Models He Yunhong, Qiu Jianling, Zhang Wei, Yuan Zhengqing
- Reindex-then-adapt: Improving Large Language Models For Conversational Recommendation He Zhankui, Xie Zhouhang, Steck Harald, Liang Dawen, Jha Rahul, Kallus Nathan, Mcauley Julian
- Quality Of Answers Of Generative Large Language Models Vs Peer Patients For Interpreting Lab Test Results For Lay Patients: Evaluation Study He Zhe, Bhasuran Balu, Jin Qiao, Tian Shubo, Hanna Karim, Shavor Cindy, Arguello Lisbeth Garcia, Murray Patrick, Lu Zhiyong
- HMT: Hierarchical Memory Transformer For Long Context Language Processing He Zifan, Qin Zongyue, Prakriya Neha, Sun Yizhou, Cong Jason
- A Piece Of Theatre: Investigating How Teachers Design LLM Chatbots To Assist Adolescent Cyberbullying Education Hedderich Michael A., Bazarova Natalie N., Zou Wenting, Shim Ryun, Ma Xinda, Yang Qian
- Intent-conditioned And Non-toxic Counterspeech Generation Using Multi-task Instruction Tuning With RLAIF Hengle Amey, Kumar Aswini, Singh Sahajpreet, Bandhakavi Anil, Akhtar Md Shad, Chakroborty Tanmoy
- Can Large Language Models Make The Grade? An Empirical Study Evaluating Llms Ability To Mark Short Answer Questions In K-12 Education Henkel Owen, Boxer Adam, Hills Libby, Roberts Bill
- N-gram Prediction And Word Difference Representations For Language Modeling Heo Dongnyeong, Rim Daniela Noemi, Choi Heeyoul
- Super Tiny Language Models Hillier Dylan, Guertler Leon, Tan Cheston, Agrawal Palaash, Ruirui Chen, Cheng Bobby
- Llava-gemma: Accelerating Multimodal Foundation Models With A Compact Language Model Hinck Musashi, Olson Matthew L., Cobbley David, Tseng Shao-yen, Lal Vasudev
- Deepspeed-fastgen: High-throughput Text Generation For Llms Via MII And Deepspeed-inference Holmes Connor, Tanaka Masahiro, Wyatt Michael, Awan Ammar Ahmad, Rasley Jeff, Rajbhandari Samyam, Aminabadi Reza Yazdani, Qin Heyang, Bakhtiari Arash, Kurilenko Lev, He Yuxiong
- Accelerating Multilingual Language Model For Excessively Tokenized Languages Hong Jimin, Lee Gibbeum, Cho Jaewoong
- Free To Play: UN Trade And Development's Experience With Developing Its Own Open-source Retrieval Augmented Generation Large Language Model Application Hopp Daniel
- Large Language Models For Expansion Of Spoken Language Understanding Systems To New Languages Hoscilowicz Jakub, Pawlowski Pawel, Skorupa Marcin, Sowański Marcin, Janicki Artur
- Non-linear Inference Time Intervention: Improving LLM Truthfulness Hoscilowicz Jakub, Wiacek Adam, Chojnacki Jan, Cieslak Adam, Michon Leszek, Urbanevych Vitalii, Janicki Artur
- Efficient Solutions For An Intriguing Failure Of Llms: Long Context Window Does Not Mean Llms Can Analyze Long Sequences Flawlessly Hosseini Peyman, Castro Ignacio, Ghinassi Iacopo, Purver Matthew
- Raw Text Is All You Need: Knowledge-intensive Multi-turn Instruction Tuning For Large Language Model Hou Xia, Li Qifeng, Yang Jian, Li Tongliang, Chai Linzheng, Wu Xianjie, Ji Hangyuan, Li Zhoujun, Nie Jixuan, Dun Jingbo, Song Wenfeng
- Large Language Model-powered Chatbots For Internationalizing Student Support In Higher Education Hsain Achraf, Housni Hamza El
- RAGGED: Towards Informed Design Of Retrieval Augmented Generation Systems Hsia Jennifer, Shaikh Afreen, Wang Zhiruo, Neubig Graham
- Game Generation Via Large Language Models Hu Chengpeng, Zhao Yunlong, Liu Jialin
- Multimodal Chain-of-thought Reasoning Via Chatgpt To Protect Children From Age-inappropriate Apps Hu Chuanbo, Liu Bin, Yin Minglei, Zhou Yilu, Li Xin
- Openrlhf: An Easy-to-use, Scalable And High-performance RLHF Framework Hu Jian, Wu Xibin, Wang Weixun, Xianyu, Zhang Dehao, Cao Yu
- A Hopfieldian View-based Interpretation For Chain-of-thought Reasoning Hu Lijie, Liu Liang, Yang Shu, Chen Xin, Xiao Hongru, Li Mengdi, Zhou Pan, Ali Muhammad Asif, Wang Di
- Hiagent: Hierarchical Working Memory Management For Solving Long-horizon Agent Tasks With Large Language Model Hu Mengkang, Chen Tianxing, Chen Qiguang, Mu Yao, Shao Wenqi, Luo Ping
- Agentgen: Enhancing Planning Abilities For Large Language Model Based Agent Via Environment And Task Generation Hu Mengkang, Zhao Pu, Xu Can, Sun Qingfeng, Lou Jianguang, Lin Qingwei, Luo Ping, Rajmohan Saravan, Zhang Dongmei
- SLM Meets LLM: Balancing Latency, Interpretability And Consistency In Hallucination Detection Hu Mengya, Xu Rui, Lei Deren, Li Yaxi, Wang Mingyu, Ching Emily, Kamal Eslam, Deng Alex
- FOKE: A Personalized And Explainable Education Framework Integrating Foundation Models, Knowledge Graphs, And Prompt Engineering Hu Silan, Wang Xiaoning
- Refchecker: Reference-based Fine-grained Hallucination Checker And Benchmark For Large Language Models Hu Xiangkun, Ru Dongyu, Qiu Lin, Guo Qipeng, Zhang Tianhang, Xu Yang, Luo Yun, Liu Pengfei, Zhang Yue, Zhang Zheng
- ELLA: Equip Diffusion Models With LLM For Enhanced Semantic Alignment Hu Xiwei, Wang Rui, Fang Yixiao, Fu Bin, Cheng Pei, Yu Gang
- Prompt Perturbation In Retrieval-augmented Generation Based Large Language Models Hu Zhibo Hye-young, Wang Chen Hye-young, Shu Yanfeng Hye-young, Helen Hye-young, Paik, Zhu Liming
- Exact And Efficient Unlearning For Large Language Model-based Recommendation Hu Zhiyu, Zhang Yang, Xiao Minghao, Wang Wenjie, Feng Fuli, He Xiangnan
- V2xum-llm: Cross-modal Video Summarization With Temporal Prompt Instruction Tuning Hua Hang, Tang Yunlong, Xu Chenliang, Luo Jiebo
- Talk Less, Interact Better: Evaluating In-context Conversational Adaptation In Multimodal Llms Hua Yilun, Artzi Yoav
- Large Language Models In Mental Health Care: A Scoping Review Hua Yining, Liu Fenglin, Yang Kailai, Li Zehan, Na Hongbin, Sheu Yi-han, Zhou Peilin, Moran Lauren V., Ananiadou Sophia, Beam Andrew, Torous John
- Applying And Evaluating Large Language Models In Mental Health Care: A Scoping Review Of Human-assessed Generative Tasks Hua Yining, Na Hongbin, Li Zehan, Liu Fenglin, Fang Xiao, Clifton David, Torous John
- What Affects The Stability Of Tool Learning? An Empirical Study On The Robustness Of Tool Learning Frameworks Huang Chengrui, Shi Zhengliang, Wen Yuntao, Chen Xiuying, Han Peng, Gao Shen, Shang Shuo
- Training Language Models To Generate Text With Citations Via Fine-grained Rewards Huang Chengyu, Wu Zeqiu, Hu Yushi, Wang Wenya
- Towards Practical Tool Usage For Continually Learning Llms Huang Jerry, Parthasarathi Prasanna, Rezagholizadeh Mehdi, Chandar Sarath
- Mitigating Catastrophic Forgetting In Large Language Models With Self-synthesized Rehearsal Huang Jianheng, Cui Leyang, Wang Ante, Yang Chengyi, Liao Xinting, Song Linfeng, Yao Junfeng, Su Jinsong
- A Survey On Evaluation Of Multimodal Large Language Models Huang Jiaxing, Zhang Jingyi
- Demystifying Verbatim Memorization In Large Language Models Huang Jing, Yang Diyi, Potts Christopher
- Learning Fine-grained Grounded Citations For Attributed Large Language Models Huang Lei, Feng Xiaocheng, Ma Weitao, Gu Yuxuan, Zhong Weihong, Feng Xiachong, Yu Weijiang, Peng Weihua, Tang Duyu, Tu Dandan, Qin Bing
- Planning And Editing What You Retrieve For Enhanced Tool Learning Huang Tenghao, Jung Dongwon, Chen Muhao
- Improving Llms For Recommendation With Out-of-vocabulary Tokens Huang Ting-ji, Yang Jia-qi, Shen Chunxu, Liu Kai-qi, Zhan De-chuan, Ye Han-jia
- Brainchat: Decoding Semantic Information From Fmri Using Vision-language Pretrained Models Huang Wanaiu
- Queryagent: A Reliable And Efficient Reasoning Framework With Environmental Feedback-based Self-correction Huang Xiang, Cheng Sitao, Huang Shanshan, Shen Jiayu, Xu Yong, Zhang Chaoyun, Qu Yuzhong
- Aligning Translation-specific Understanding To General Understanding In Large Language Models Huang Yichong, Feng Xiaocheng, Li Baohang, Fu Chengpeng, Huo Wenshuai, Liu Ting, Qin Bing
- Key-point-driven Data Synthesis With Its Enhancement On Mathematical Reasoning Huang Yiming, Liu Xiao, Gong Yeyun, Gou Zhibin, Shen Yelong, Duan Nan, Chen Weizhu
- A Comprehensive Survey On Evaluating Large Language Model Applications In The Medical Industry Huang Yining, Tang Keke, Chen Meilian, Wang Boyuan
- Fastfid: Improve Inference Efficiency Of Open Domain Question Answering Via Sentence Selection Huang Yufei, Han Xu, Sun Maosong
- Active Testing Of Large Language Model Via Multi-stage Sampling Huang Yuheng, Song Jiayang, Hu Qiang, Juefei-xu Felix, Ma Lei
- Olympicarena: Benchmarking Multi-discipline Cognitive Reasoning For Superintelligent AI Huang Zhen, Wang Zengzhi, Xia Shijie, Li Xuefeng, Zou Haoyang, Xu Ruijie, Fan Run-ze, Ye Lyumanshan, Chern Ethan, Ye Yixin, Zhang Yikai, Yang Yuqing, Wu Ting, Wang Binjie, Sun Shichao, Xiao Yang, Li Yiyuan, Zhou Fan, Chern Steffi, Qin Yiwei, Ma Yan, Su Jiadi, Liu Yixiu, Zheng Yuxiang, Zhang Shaoting, Lin Dahua, Qiao Yu, Liu Pengfei
- Tool Calling: Enhancing Medication Consultation Via Retrieval-augmented Large Language Models Huang Zhongzhen, Xue Kui, Fan Yongqi, Mu Linjie, Liu Ruoyu, Ruan Tong, Zhang Shaoting, Zhang Xiaofan
- Enhancing The Capability And Robustness Of Large Language Models Through Reinforcement Learning-driven Query Refinement Huang Zisu, Wang Xiaohua, Zhang Feiran, Xu Zhibo, Zhang Cenyuan, Zheng Xiaoqing, Huang Xuanjing
- Instruction Fine-tuning: Does Prompt Loss Matter? Huerta-enochian Mathew, Ko Seung Yong
- HFT: Half Fine-tuning For Large Language Models Hui Tingfeng, Zhang Zhenyu, Wang Shuohuan, Xu Weiran, Sun Yu, Wu Hua
- Rot: Enhancing Large Language Models With Reflection On Search Trees Hui Wenyang, Tu Kewei
- Federated Recommendation Via Hybrid Retrieval Augmented Generation Huimin Zeng, Zhenrui Yue, Qian Jiang, Dong Wang
- Self-introspective Decoding: Alleviating Hallucinations For Large Vision-language Models Huo Fushuo, Xu Wenchao, Zhang Zhong, Wang Haozhao, Chen Zhicheng, Zhao Peilin
- DSLR: Document Refinement With Sentence-level Re-ranking And Reconstruction To Enhance Retrieval-augmented Generation Hwang Taeho, Jeong Soyeong, Cho Sukmin, Han Seungyoon, Park Jong C.
- MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs Hwang Yerin, Kim Yongil, Jang Yunah, Bang Jeesoo, Bae Hyunkyung, Jung Kyomin
- News Without Borders: Domain Adaptation Of Multilingual Sentence Embeddings For Cross-lingual News Recommendation Iana Andreea, Schmidt Fabian David, Glavaš Goran, Paulheim Heiko
- On The Generalization Of Preference Learning With DPO Im Shawn, Li Yixuan
- Hallucinations Or Attention Misdirection? The Path To Strategic Value Extraction In Business Using Large Language Models Ioste Aline
- Autocompletion Of Chief Complaints In The Electronic Health Records Using Large Language Models Islam K M Sajjadul, Nipu Ayesha Siddika, Madiraju Praveen, Deshpande Priya
- Mapcoder: Multi-agent Code Generation For Competitive Problem Solving Islam Md. Ashraful, Ali Mohammed Eunus, Parvez Md Rizwan
- Datanarrative: Automated Data-driven Storytelling With Visualizations And Texts Islam Mohammed Saidul, Laskar Md Tahmid Rahman, Parvez Md Rizwan, Hoque Enamul, Joty Shafiq
- Elements Of World Knowledge (EWOK): A Cognition-inspired Framework For Evaluating Basic World Knowledge In Language Models Ivanova Anna A., Sathe Aalok, Lipkin Benjamin, Kumar Unnathi, Radkani Setayesh, Clark Thomas H., Kauf Carina, Hu Jennifer, Pramod R. T., Grand Gabriel, Paulun Vivian, Ryskina Maria, Akyürek Ekin, Wilcox Ethan, Rashid Nafisa, Choshen Leshem, Levy Roger, Fedorenko Evelina, Tenenbaum Joshua, Andreas Jacob
- Large Language Models For Behavioral Economics: Internal Validity And Elicitation Of Mental Models Jabarian Brian
- Meddoc-bot: A Chat Tool For Comparative Analysis Of Large Language Models In The Context Of The Pediatric Hypertension Guideline Jabarulla Mohamed Yaseen, Oeltze-jafra Steffen, Beerbaum Philipp, Uden Theodor
- Polyglotoxicityprompts: Multilingual Evaluation Of Neural Toxic Degeneration In Large Language Models Jain Devansh, Kumar Priyanshu, Gehman Samuel, Zhou Xuhui, Hartvigsen Thomas, Sap Maarten
- Evaluating Students' Open-ended Written Responses With Llms: Using The RAG Framework For GPT-3.5, GPT-4, Claude-3, And Mistral-large Jauhiainen Jussi S., Guerra Agustín Garagorry
- OLAPH: Improving Factuality In Biomedical Long-form Question Answering Jeong Minbyul, Hwang Hyeon, Yoon Chanwoong, Lee Taewhoo, Kang Jaewoo
- RLSF: Reinforcement Learning Via Symbolic Feedback Jha Piyush, Jana Prithwish, Arora Arnav, Ganesh Vijay
- Memeguard: An LLM And Vlm-based Framework For Advancing Content Moderation Via Meme Intervention Jha Prince, Jain Raghav, Mandal Konika, Chadha Aman, Saha Sriparna, Bhattacharyya Pushpak
- Towards Flexible Evaluation For Generative Visual Question Answering Ji Huishan, Si Qingyi, Lin Zheng, Wang Weiping
- Aligner: Efficient Alignment By Learning To Correct Ji Jiaming, Chen Boyuan, Lou Hantao, Hong Donghai, Zhang Borong, Pan Xuehai, Dai Juntao, Qiu Tianyi, Yang Yaodong
- Language Models Resist Alignment Ji Jiaming, Wang Kaile, Qiu Tianyi, Chen Boyuan, Zhou Jiayi, Li Changye, Lou Hantao, Yang Yaodong
- Can't Say Cant? Measuring And Reasoning Of Dark Jargons In Large Language Models Ji Xu, Zhang Jianyi, Zhou Ziyin, Zhao Zhangchi, Qiao Qianqian, Han Kaiying, Hossen Md Imran, Hei Xiali
- Knowledge Adaptation From Large Language Model To Recommendation For Practical Industrial Application Jia Jian, Wang Yipei, Li Yan, Chen Honggang, Bai Xuehan, Liu Zhaocheng, Liang Jian, Chen Quan, Li Han, Jiang Peng, Gai Kun
- SOUL: Unlocking The Power Of Second-order Optimization For LLM Unlearning Jia Jinghan, Zhang Yihua, Zhang Yimeng, Liu Jiancheng, Runwal Bharat, Diffenderfer James, Kailkhura Bhavya, Liu Sijia
- Simulbench: Evaluating Language Models With Creative Simulation Tasks Jia Qi, Yue Xiang, Zheng Tianyu, Huang Jie, Lin Bill Yuchen
- A Peek Into Token Bias: Large Language Models Are Not Yet Genuine Reasoners Jiang Bowen, Xie Yangxinyu, Hao Zhuoqun, Wang Xiaomeng, Mallick Tanwi, Su Weijie J., Taylor Camillo J., Roth Dan
- Hal-eval: A Universal And Fine-grained Hallucination Evaluation Framework For Large Vision Language Models Jiang Chaoya, Ye Wei, Dong Mengfan, Jia Hongrui, Xu Haiyang, Yan Ming, Zhang Ji, Zhang Shikun
- SELF-[IN]CORRECT: Llms Struggle With Discriminating Self-generated Responses Jiang Dongwei, Zhang Jingyu, Weller Orion, Weir Nathaniel, Van Durme Benjamin, Khashabi Daniel
- Interpretable Catastrophic Forgetting Of Large Language Model Fine-tuning Via Instruction Vector Jiang Gangwei, Jiang Caigao, Li Zhaoyi, Xue Siqiao, Zhou Jun, Song Linqi, Lian Defu, Wei Ying
- Llm-collaboration On Automatic Science Journalism For The General Audience Jiang Gongyao, Shi Xinran, Luo Qiong
- The Real, The Better: Aligning Large Language Models With Online Human Behaviors Jiang Guanying, Yan Lingyong, Shi Haibo, Yin Dawei
- Raising The Bar: Investigating The Values Of Large Language Models Via Generative Evolving Testing Jiang Han, Yi Xiaoyuan, Wei Zhihua, Wang Shu, Xie Xing
- Mix-cpt: A Domain Adaptation Framework Via Decoupling Knowledge Learning And Format Alignment Jiang Jinhao, Li Junyi, Zhao Wayne Xin, Song Yang, Zhang Tao, Wen Ji-rong
- Utilizing Large Languagemodels To Detect Privacy Leaks In Mini-app Code Jiang Liming
- Wildteaming At Scale: From In-the-wild Jailbreaks To (adversarially) Safer Language Models Jiang Liwei, Rao Kavel, Han Seungju, Ettinger Allyson, Brahman Faeze, Kumar Sachin, Mireshghallah Niloofar, Lu Ximing, Sap Maarten, Choi Yejin, Dziri Nouha
- Sketch: A Toolkit For Streamlining LLM Operations Jiang Xin, Li Xiang, Ma Wenjia, Fang Xuezhi, Yao Yiqun, Yu Naitong, Meng Xuying, Han Peng, Li Jing, Sun Aixin, Wang Yequan
- Unlocking The Power Of Large Language Models For Entity Alignment Jiang Xuhui, Shen Yinghan, Shi Zhichao, Xu Chengjin, Li Wei, Li Zixuan, Guo Jian, Shen Huawei, Wang Yuanzhuo
- Large Visual-language Models Are Also Good Classifiers: A Study Of In-context Multimodal Fake News Detection Jiang Ye, Wang Yimin
- Prior Knowledge Integration Via LLM Encoding And Pseudo Event Regulation For Video Moment Retrieval Jiang Yiyang, Zhang Wengyu, Zhang Xulu, Wei Xiaoyong, Chen Chang Wen, Li Qing
- Learning To Edit: Aligning Llms With Knowledge Editing Jiang Yuxin, Wang Yufei, Wu Chuhan, Zhong Wanjun, Zeng Xingshan, Gao Jiahui, Li Liangyou, Jiang Xin, Shang Lifeng, Tang Ruiming, Liu Qun, Wang Wei
- Megascale: Scaling Large Language Model Training To More Than 10,000 Gpus Jiang Ziheng, Lin Haibin, Zhong Yinmin, Huang Qi, Chen Yangrui, Zhang Zhi, Peng Yanghua, Li Xiang, Xie Cong, Nong Shibiao, Jia Yulu, He Sun, Chen Hongmin, Bai Zhihao, Hou Qi, Yan Shipeng, Zhou Ding, Sheng Yiyao, Jiang Zhuo, Xu Haohan, Wei Haoran, Zhang Zhang, Nie Pengfei, Zou Leqi, Zhao Sida, Xiang Liang, Liu Zherui, Li Zhe, Jia Xiaoying, Ye Jianxi, Jin Xin, Liu Xin
- Duetrag: Collaborative Retrieval-augmented Generation Jiao Dian, Cai Li, Huang Jingsheng, Zhang Wenqiao, Tang Siliang, Zhuang Yueting
- Learning Planning-based Reasoning By Trajectories Collection And Process Reward Synthesizing Jiao Fangkai, Qin Chengwei, Liu Zhengyuan, Chen Nancy F., Joty Shafiq
- Navigating LLM Ethics: Advancements, Challenges, And Future Directions Jiao Junfeng, Afroogh Saleh, Xu Yiming, Phillips Connor
- Graph Chain-of-thought: Augmenting Large Language Models By Reasoning On Graphs Jin Bowen, Xie Chulin, Zhang Jiawei, Roy Kashob Kumar, Zhang Yu, Li Zheng, Li Ruirui, Tang Xianfeng, Wang Suhang, Meng Yu, Han Jiawei
- Can Chatgpt Support Developers? An Empirical Evaluation Of Large Language Models For Code Generation Jin Kailun, Wang Chung-yu, Pham Hung Viet, Hemmati Hadi
- HGT: Leveraging Heterogeneous Graph-enhanced Large Language Models For Few-shot Complex Table Understanding Jin Rihui, Li Yu, Qi Guilin, Hu Nan, Li Yuan-fang, Chen Jiaoyan, Wang Jianan, Chen Yongrui, Min Dehai
- Reasoning Grasping Via Multimodal Large Language Model Jin Shiyu, Xu Jinxuan, Lei Yutian, Zhang Liangjun
- Self-selected Attention Span For Accelerating Large Language Model Inference Jin Tian, Yazar Wanzin, Xu Zifei, Sharify Sayeh, Wang Xin
- Video-lavit: Unified Video-language Pre-training With Decoupled Visual-motional Tokenization Jin Yang, Sun Zhicheng, Xu Kun, Xu Kun, Chen Liwei, Jiang Hao, Huang Quzhe, Song Chengru, Liu Yuliang, Zhang Di, Song Yang, Gai Kun, Mu Yadong
- SMART: Automatically Scaling Down Language Models With Accuracy Guarantees For Reduced Processing Fees Jo Saehan, Trummer Immanuel
- Llm-based Open-domain Integrated Task And Knowledge Assistants With Programmable Policies Joshi Harshit, Liu Shicheng, Chen James, Weigle Robert, Lam Monica S.
- Llm-based Federated Recommendation Jujia Zhao, Wenjie Wang, Chen Xu, Zhaochun Ren, See-kiong Ng, Tat-seng Chua
- Logical Closed Loop: Uncovering Object Hallucinations In Large Vision-language Models Junfei Wu, Qiang Liu, Ding Wang, Jinghao Zhang, Shu Wu, Liang Wang, Tieniu Tan
- Trust Or Escalate: LLM Judges With Provable Guarantees For Human Agreement Jung Jaehun, Brahman Faeze, Choi Yejin
- Information-theoretic Distillation For Reference-less Summarization Jung Jaehun, Lu Ximing, Jiang Liwei, Brahman Faeze, West Peter, Koh Pang Wei, Choi Yejin
- CODE: Contrasting Self-generated Description To Combat Hallucination In Large Multi-modal Models Junho Kim, Hyunjun Kim, Yeonju Kim, Yong Man Ro
- Introducing A New Hyper-parameter For RAG: Context Window Utilization Juvekar Kush, Purwar Anupam
- Exploring The Latest Llms For Leaderboard Extraction Kabongo Salomon, D'souza Jennifer, Auer Sören
- A Survey On Hardware Accelerators For Large Language Models Kachris Christoforos
- RAP: Retrieval-augmented Planning With Contextual Memory For Multimodal LLM Agents Kagaya Tomoyuki, Yuan Thong Jing, Lou Yuxuan, Karlekar Jayashree, Pranata Sugiri, Kinose Akira, Oguri Koki, Wick Felix, You Yang
- Language Repository For Long Video Understanding Kahatapitiya Kumara, Ranasinghe Kanchana, Park Jongwoo, Ryoo Michael S.
- Evaluating The Impact Of Advanced LLM Techniques On Ai-lecture Tutors For A Robotics Course Kahl Sebastian, Löffler Felix, Maciol Martin, Ridder Fabian, Schmitz Marius, Spanagel Jennifer, Wienkamp Jens, Burgahn Christopher, Schilling Malte
- Slo-aware GPU Frequency Scaling For Energy Efficient LLM Inference Serving Kakolyris Andreas Kosmas, Masouros Dimosthenis, Vavaroutsos Petros, Xydis Sotirios, Soudris Dimitrios
- Understanding The Effect Of Using Semantically Meaningful Tokens For Visual Representation Learning Kalibhat Neha, Kattakinda Priyatham, Zarei Arman, Seleznev Nikita, Sharpe Samuel, Kumar Senthil, Feizi Soheil
- LLM-ARC: Enhancing Llms With An Automated Reasoning Critic Kalyanpur Aditya, Saravanakumar Kailash Karthik, Barres Victor, Chu-carroll Jennifer, Melville David, Ferrucci David
- Llms Can't Plan, But Can Help Planning In Llm-modulo Frameworks Kambhampati Subbarao, Valmeekam Karthik, Guan Lin, Verma Mudit, Stechly Kaya, Bhambri Siddhant, Saldyt Lucas, Murthy Anil
- When Can Llms Actually Correct Their Own Mistakes? A Critical Survey Of Self-correction Of Llms Kamoi Ryo, Zhang Yusen, Zhang Nan, Han Jiawei, Zhang Rui
- Mindstar: Enhancing Math Reasoning In Pre-trained Llms At Inference Time Kang Jikun, Li Xin Zhe, Chen Xi, Kazemi Amirreza, Sun Qianyi, Chen Boxing, Li Dong, He Xu, He Quan, Wen Feng, Hao Jianye, Yao Jun
- Unfamiliar Finetuning Examples Control How Language Models Hallucinate Kang Katie, Wallace Eric, Tomlin Claire, Kumar Aviral, Levine Sergey
- C-RAG: Certified Generation Risks For Retrieval-augmented Language Models Kang Mintong, Gürel Nezihe Merve, Yu Ning, Song Dawn, Li Bo
- Prompting Large Language Models With Audio For General-purpose Speech Summarization Kang Wonjune, Roy Deb
- MEDIC: Towards A Comprehensive Framework For Evaluating Llms In Clinical Applications Kanithi Praveen K, Christophe Clément, Pimentel Marco Af, Raha Tathagata, Saadi Nada, Javed Hamza, Maslenkova Svetlana, Hayat Nasir, Rajan Ronnie, Khan Shadab
- Omniact: A Dataset And Benchmark For Enabling Multimodal Generalist Autonomous Agents For Desktop And Web Kapoor Raghav, Butala Yash Parag, Russak Melisa, Koh Jing Yu, Kamble Kiran, Alshikh Waseem, Salakhutdinov Ruslan
- Teaching Machines To Code: Smart Contract Translation With Llms Karanjai Rabimba, Xu Lei, Shi Weidong
- Can Reinforcement Learning Unlock The Hidden Dangers In Aligned Large Language Models? Karkevandi Mohammad Bahrami, Vishwamitra Nishant, Najafirad Peyman
- Beyond Traditional Benchmarks: Analyzing Behaviors Of Open Llms On Data-to-text Generation Kasner Zdeněk, Dušek Ondřej
- Faaf: Facts As A Function For The Evaluation Of Generated Text Katranidis Vasileios, Barany Gabor
- Defending Large Language Models Against Attacks With Residual Stream Activation Analysis Kawasaki Amelia, Davis Andrew, Abbas Houssam
- Codeaid: Evaluating A Classroom Deployment Of An Llm-based Programming Assistant That Balances Student And Educator Needs Kazemitabaar Majeed, Ye Runlong, Wang Xiaoning, Henley Austin Z., Denny Paul, Craig Michelle, Grossman Tovi
- Exploring The Frontiers Of Llms In Psychological Applications: A Comprehensive Review Ke Luoma, Tong Song, Cheng Peng, Peng Kaiping
- Development And Testing Of Retrieval Augmented Generation In Large Language Models -- A Case Study Report Ke Yuhe, Jin Liyuan, Elangovan Kabilan, Abdullah Hairil Rizal, Liu Nan, Sia Alex Tiong Heng, Soh Chai Rick, Tung Joshua Yi Min, Ong Jasmine Chiat Ling, Ting Daniel Shu Wei
- Bridging The Preference Gap Between Retrievers And Llms Ke Zixuan, Kong Weize, Li Cheng, Zhang Mingyang, Mei Qiaozhu, Bendersky Michael
- Visiongpt: Vision-language Understanding Agent Using Generalized Multimodal Framework Kelly Chris, Hu Luhui, Yang Bang, Tian Yu, Yang Deshun, Yang Cindy, Huang Zaoshan, Li Zihao, Hu Jiayin, Zou Yuexian
- Grounding And Evaluation For Large Language Models: Practical Challenges And Lessons Learned (survey) Kenthapadi Krishnaram, Sameki Mehrnoosh, Taly Ankur
- Autonomous Prompt Engineering In Large Language Models Kepel Daan, Valogianni Konstantina
- Source-aware Training Enables Knowledge Attribution In Language Models Khalifa Muhammad, Wadden David, Strubell Emma, Lee Honglak, Wang Lu, Beltagy Iz, Peng Hao
- Indicllmsuite: A Blueprint For Creating Pre-training And Fine-tuning Datasets For Indian Languages Khan Mohammed Safi Ur Rahman, Mehta Priyam, Sankar Ananth, Kumaravelan Umashankar, Doddapaneni Sumanth, G Suriyaprasaad, G Varun Balan, Jain Sparsh, Kunchukuttan Anoop, Kumar Pratyush, Dabre Raj, Khapra Mitesh M.
- ARGS: Alignment As Reward-guided Search Khanov Maxim, Burapacheep Jirayu, Li Yixuan
- How Well Do Llms Represent Values Across Cultures? Empirical Analysis Of LLM Responses Based On Hofstede Cultural Dimensions Kharchenko Julia, Roosta Tanya, Chadha Aman, Shah Chirag
- Beyond Code Generation: An Observational Study Of Chatgpt Usage In Software Engineering Practice Khojah Ranim, Mohamad Mazen, Leitner Philipp, Neto Francisco Gomes De Oliveira
- Llms In The Loop: Leveraging Large Language Model Annotations For Active Learning In Low-resource Languages Kholodna Nataliia, Julka Sahib, Khodadadi Mohammad, Gumus Muhammed Nurullah, Granitzer Michael
- Keyword-driven Retrieval-augmented Large Language Models For Cold-start User Recommendations Kieu Hai-dang, Nguyen Minh Duc, Nguyen Thanh-son, Le Dung D.
- Aligning Large Language Models With Self-generated Preference Data Kim Dongyoung, Lee Kimin, Shin Jinwoo, Kim Jaehyung
- Integrating Paralinguistics In Speech-empowered Large Language Models For Natural Conversation Kim Heeseung, Seo Soonshin, Jeong Kyeongseok, Kwon Ohsung, Kim Soyoon, Kim Jungwhan, Lee Jaehong, Song Eunwoo, Oh Myungwoo, Ha Jung-woo, Yoon Sungroh, Yoo Kang Min
- Sure: Summarizing Retrievals Using Answer Candidates For Open-domain QA Of Llms Kim Jaehyung, Nam Jaehyun, Mo Sangwoo, Park Jongjin, Lee Sang-woo, Seo Minjoon, Ha Jung-woo, Shin Jinwoo
- Review-driven Personalized Preference Reasoning With Large Language Models For Recommendation Kim Jieyong, Kim Hyunseo, Cho Hyunjin, Kang Seongku, Chang Buru, Yeo Jinyoung, Lee Dongha
- Evalverse: Unified And Accessible Library For Large Language Model Evaluation Kim Jihoo, Song Wonho, Kim Dahyun, Kim Yunsu, Kim Yungi, Park Chanjun
- Strategic Data Ordering: Enhancing Large Language Model Performance Through Curriculum Learning Kim Jisu, Lee Juhwan
- Enhancing Psychotherapy Counseling: A Data Augmentation Pipeline Leveraging Large Language Models For Counseling Conversations Kim Jun-woo, Han Ji-eun, Koh Jun-seok, Seo Hyeon-tae, Chang Du-seong
- Persona Is A Double-edged Sword: Enhancing The Zero-shot Reasoning By Ensembling The Role-playing And Neutral Prompts Kim Junseok, Yang Nakyeong, Jung Kyomin
- RE-RAG: Improving Open-domain QA Performance And Interpretability With Relevance Estimator In Retrieval-augmented Generation Kim Kiseung, Lee Jay-yoon
- Advisorqa: Towards Helpful And Harmless Advice-seeking Question Answering With Collective Intelligence Kim Minbeom, Lee Hwanhee, Park Joonsuk, Lee Hwaran, Jung Kyomin
- Automatic Jailbreaking Of The Text-to-image Generative AI Systems Kim Minseon, Lee Hyomin, Gong Boqing, Zhang Huishuai, Hwang Sung Ju
- Large Language Models Meet Collaborative Filtering: An Efficient All-round Llm-based Recommender System Kim Sein, Kang Hongseok, Choi Seungyoon, Kim Donghyun, Yang Minchul, Park Chanyoung
- THEANINE: Revisiting Memory Management In Long-term Conversations With Timeline-augmented Response Generation Kim Seo Hyun, Ong Kai Tzu-iunn, Kwon Taeyoon, Kim Namyoung, Ka Keummin, Bae Seonghyeon, Jo Yohan, Hwang Seung-won, Lee Dongha, Yeo Jinyoung
- Llmem: Estimating GPU Memory Usage For Fine-tuning Pre-trained Llms Kim Taeho, Wang Yanming, Chaturvedi Vatshank, Gupta Lokesh, Kim Seyeon, Kwon Yongin, Ha Sangtae
- From Bytes To Borsch: Fine-tuning Gemma And Mistral For The Ukrainian Language Representation Kiulian Artur, Polishko Anton, Khandoga Mykola, Chubych Oryna, Connor Jack, Ravishankar Raghav, Shirawalmath Adarsh
- Growover: How Can Llms Adapt To Growing Real-world Knowledge? Ko Dayoon, Kim Jinyoung, Choi Hahyeon, Kim Gunhee
- Recmind: Japanese Movie Recommendation Dialogue With Seeker's Internal State Kodama Takashi, Kiyomaru Hirokazu, Huang Yin Jou, Kurohashi Sadao
- Cleared For Takeoff? Compositional & Conditional Reasoning May Be The Achilles Heel To (flight-booking) Language Agents Kohli Harsh, Sun Huan
- Labeling Supervised Fine-tuning Data With The Scaling Law Kong Huanjun
- Customizing Language Models With Instance-wise Lora For Sequential Recommendation Kong Xiaoyu, Wu Jiancan, Zhang An, Sheng Leheng, Lin Hui, Wang Xiang, He Xiangnan
- Studying Large Language Model Behaviors Under Realistic Knowledge Conflicts Kortukov Evgenii, Rubinstein Alexander, Nguyen Elisa, Oh Seong Joon
- Leveraging Large Language Models For Enhanced Process Model Comprehension Kourani Humam, Berti Alessandro, Henrich Jasmin, Kratsch Wolfgang, Weidlich Robin, Li Chiao-yun, Arslan Ahmad, Schuster Daniel, Van Der Aalst Wil M. P.
- Token Trails: Navigating Contextual Depths In Conversational AI With Chatllm Kowsher Md., Panditi Ritesh, Prottasha Nusrat Jahan, Bhat Prakash, Bairagi Anupam Kumar, Arefin Mohammad Shamsul
- Large Language Models For Tuning Evolution Strategies Kramer Oliver
- Synthdst: Synthetic Data Is All You Need For Few-shot Dialog State Tracking Kulkarni Atharva, Tseng Bo-hsiang, Moniz Joel Ruben Antony, Piraviperumal Dhivya, Yu Hong, Bhargava Shruti
- Reinforcement Learning For Optimizing RAG For Domain Chatbots Kulkarni Mandar, Tangarajan Praveen, Kim Kyung, Trivedi Anusua
- BIPED: Pedagogically Informed Tutoring System For ESL Education Kwon Soonwoo, Kim Sojung, Park Minju, Lee Seunghyun, Kim Kyuseok
- A Framework For Synthetic Audio Conversations Generation Using Large Language Models Kyaw Kaung Myat, Chan Jonathan Hoyin
- Can Github Issues Be Solved With Tree Of Thoughts? La Rosa Ricardo, Hulse Corey, Liu Bangdi
- RES-Q: Evaluating Code-editing Large Language Model Systems At The Repository Scale Labash Beck, Rosedale August, Reents Alex, Negritto Lucas, Wiel Colin
- SEC-QA: A Systematic Evaluation Corpus For Financial QA Lai Viet Dac, Krumdick Michael, Lovering Charles, Reddy Varshini, Schmidt Craig, Tanner Chris
- RVISA: Reasoning And Verification For Implicit Sentiment Analysis Lai Wenna, Xie Haoran, Xu Guandong, Li Qing
- Alarm: Align Language Models Via Hierarchical Rewards Modeling Lai Yuhang, Wang Siyuan, Liu Shujun, Huang Xuanjing, Wei Zhongyu
- Residual-based Language Models Are Free Boosters For Biomedical Imaging Lai Zhixin, Wu Jing, Chen Suiyao, Zhou Yucheng, Hovakimyan Naira
- A Closer Look At Logical Reasoning With Llms: The Choice Of Tool Matters Lam Long Hei Matthew, Thatikonda Ramya Keerthy, Shareghi Ehsan
- Fine-tuning Language Models With Reward Learning On Policy Lang Hao, Huang Fei, Li Yongbin
- Enhancing Neural Machine Translation Of Low-resource Languages: Corpus Development, Human Evaluation And Explainable AI Architectures Lankford Séamus
- IBD: Alleviating Hallucinations In Large Vision-language Models Via Image-biased Decoding Lanyun Zhu, Deyi Ji, Tianrun Chen, Peng Xu, Jieping Ye, Jun Liu
- Query-opt: Optimizing Inference Of Large Language Models Via Multi-query Instructions In Meeting Summarization Laskar Md Tahmid Rahman, Khasanova Elena, Fu Xue-yong, Chen Cheng, Tn Shashi Bhushan
- Melting Point: Mobile Evaluation Of Language Transformers Laskaridis Stefanos, Katevas Kleomenis, Minto Lorenzo, Haddadi Hamed
- INDICT: Code Generation With Internal Dialogues Of Critiques For Both Security And Helpfulness Le Hung, Zhou Yingbo, Xiong Caiming, Savarese Silvio, Sahoo Doyen
- Sentiment Reasoning For Healthcare Le-duc Khai, Nguyen Khai-nguyen, Tat Bach Phan, Le Duy, Ngo Jerry, Vo-dang Long, Nguyen Anh Totti, Hy Truong-son
- Reallm: A General Framework For LLM Compression And Fine-tuning Leconte Louis, Bedin Lisa, Nguyen Van Minh, Moulines Eric
- Programming Refusal With Conditional Activation Steering Lee Bruce W., Padhi Inkit, Ramamurthy Karthikeyan Natesan, Miehling Erik, Dognin Pierre, Nagireddy Manish, Dhurandhar Amit
- Improving Conversational Abilities Of Quantized Large Language Models Via Direct Preference Alignment Lee Janghwan, Park Seongmin, Hong Sukjin, Kim Minsoo, Chang Du-seong, Choi Jungwook
- CATS: Contextually-aware Thresholding For Sparsity In Large Language Models Lee Je-yong, Lee Donghyun, Zhang Genghan, Tiwari Mo, Mirhoseini Azalia
- Can Long-context Language Models Subsume Retrieval, RAG, SQL, And More? Lee Jinhyuk, Chen Anthony, Dai Zhuyun, Dua Dheeru, Sachan Devendra Singh, Boratko Michael, Luan Yi, Arnold Sébastien M. R., Perot Vincent, Dalmia Siddharth, Hu Hexiang, Lin Xudong, Pasupat Panupong, Amini Aida, Cole Jeremy R., Riedel Sebastian, Naim Iftekhar, Chang Ming-wei, Guu Kelvin
- Plagbench: Exploring The Duality Of Large Language Models In Plagiarism Generation And Detection Lee Jooyoung, Agrawal Toshini, Uchendu Adaku, Le Thai, Chen Jinghui, Lee Dongwon
- Reinforcement Learning From Reflective Feedback (RLRF): Aligning And Improving Llms Via Fine-grained Self-reflection Lee Kyungjae, Hwang Dasol, Park Sunghyun, Jang Youngsoo, Lee Moontae
- Prompting Large Language Models For Zero-shot Essay Scoring Via Multi-trait Specialization Lee Sanwoo, Cai Yida, Meng Desong, Wang Ziyang, Wu Yunfang
- Aligning To Thousands Of Preferences Via System Message Generalization Lee Seongyun, Park Sue Hyun, Kim Seungone, Seo Minjoon
- Llava-docent: Instruction Tuning With Multimodal Large Language Model To Support Art Appreciation Education Lee Unggi, Jeon Minji, Lee Yunseo, Byun Gyuri, Son Yoorim, Shin Jaeyoon, Ko Hongkyu, Kim Hyeoncheol
- Infinigen: Efficient Generative Inference Of Large Language Models With Dynamic KV Cache Management Lee Wonbeom, Lee Jungi, Seo Junghwan, Sim Jaewoong
- Learning To Reduce: Optimal Representations Of Structured Data In Prompting Large Language Models Lee Younghun, Kim Sungchul, Yu Tong, Rossi Ryan A., Chen Xiang
- Chatgpt As Research Scientist: Probing Gpt's Capabilities As A Research Librarian, Research Ethicist, Data Generator And Data Predictor Lehr Steven A., Caliskan Aylin, Liyanage Suneragiri, Banaji Mahzarin R.
- Materials Science In The Era Of Large Language Models: A Perspective Lei Ge, Docherty Ronan, Cooper Samuel J.
- Same Task, More Tokens: The Impact Of Input Length On The Reasoning Performance Of Large Language Models Levy Mosh, Jacoby Alon, Goldberg Yoav
- Are Bigger Encoders Always Better In Vision Large Models? Li Bozhou, Liang Hao, Meng Zimo, Zhang Wentao
- Incorporating External Knowledge And Goal Guidance For Llm-based Conversational Recommender Systems Li Chuang, Deng Yang, Hu Hengchang, Kan Min-yen, Li Haizhou
- Searchlvlms: A Plug-and-play Framework For Augmenting Large Vision-language Models By Searching Up-to-date Internet Knowledge Li Chuanhao, Li Zhen, Jing Chenchen, Liu Shuo, Shao Wenqi, Wu Yuwei, Luo Ping, Qiao Yu, Zhang Kaipeng
- Contextualization Distillation From Large Language Model For Knowledge Graph Completion Li Dawei, Tan Zhen, Chen Tianlong, Liu Huan
- A Study On Training And Developing Large Language Models For Behavior Tree Generation Li Fu, Wang Xueying, Li Bin, Wu Yunlong, Wang Yanzhen, Yi Xiaodong
- "is Chatgpt A Better Explainer Than My Professor?": Evaluating The Explanation Capabilities Of Llms In Conversation Compared To A Human Baseline Li Grace, Alshomary Milad, Muresan Smaranda
- Recall, Retrieve And Reason: Towards Better In-context Relation Extraction Li Guozheng, Wang Peng, Ke Wenjun, Guo Yikai, Ji Ke, Shang Ziyu, Liu Jiajun, Xu Zijie
- Hello Again! Llm-powered Personalized Agent For Long-term Dialogue Li Hao, Yang Chenghao, Zhang An, Deng Yang, Wang Xiang, Chua Tat-seng
- Backdoor Removal For Generative Large Language Models Li Haoran, Chen Yulin, Zheng Zihao, Hu Qi, Chan Chunkit, Liu Heshan, Song Yangqiu
- Datacomp-lm: In Search Of The Next Generation Of Training Sets For Language Models Li Jeffrey, Fang Alex, Smyrnis Georgios, Ivgi Maor, Jordan Matt, Gadre Samir, Bansal Hritik, Guha Etash, Keh Sedrick, Arora Kushal, Garg Saurabh, Xin Rui, Muennighoff Niklas, Heckel Reinhard, Mercat Jean, Chen Mayee, Gururangan Suchin, Wortsman Mitchell, Albalak Alon, Bitton Yonatan, Nezhurina Marianna, Abbas Amro, Hsieh Cheng-yu, Ghosh Dhruba, Gardner Josh, Kilian Maciej, Zhang Hanlin, Shao Rulin, Pratt Sarah, Sanyal Sunny, Ilharco Gabriel, Daras Giannis, Marathe Kalyani, Gokaslan Aaron, Zhang Jieyu, Chandu Khyathi, Nguyen Thao, Vasiljevic Igor, Kakade Sham, Song Shuran, Sanghavi Sujay, Faghri Fartash, Oh Sewoong, Zettlemoyer Luke, Lo Kyle, El-nouby Alaaeldin, Pouransari Hadi, Toshev Alexander, Wang Stephanie, Groeneveld Dirk, Soldaini Luca, Koh Pang Wei, Jitsev Jenia, Kollar Thomas, Dimakis Alexandros G., Carmon Yair, Dave Achal, Schmidt Ludwig, Shankar Vaishaal
- MT-PATCHER: Selective And Extendable Knowledge Distillation From Large Language Models For Machine Translation Li Jiahuan, Cheng Shanbo, Huang Shujian, Chen Jiajun
- Perteval: Unveiling Real Knowledge Capacity Of Llms With Knowledge-invariant Perturbations Li Jiatong, Hu Renjun, Huang Kunzhe, Zhuang Yan, Liu Qi, Zhu Mengxiao, Shi Xing, Lin Wei
- Steering Llms Towards Unbiased Responses: A Causality-guided Debiasing Framework Li Jingling, Tang Zeyu, Liu Xiaoyu, Spirtes Peter, Zhang Kun, Leqi Liu, Liu Yang
- Stylechat: Learning Recitation-augmented Memory In Llms For Stylized Dialogue Generation Li Jinpeng, Zhang Zekai, Tu Quan, Cheng Xin, Zhao Dongyan, Yan Rui
- Dialogue Action Tokens: Steering Language Models In Goal-directed Dialogue With A Multi-turn Planner Li Kenneth, Wang Yiming, Viégas Fernanda, Wattenberg Martin
- Confidence Matters: Revisiting Intrinsic Self-correction Capabilities Of Large Language Models Li Loka, Chen Zhenhao, Chen Guangyi, Zhang Yixuan, Su Yusheng, Xing Eric, Zhang Kun
- A Sentiment Consolidation Framework For Meta-review Generation Li Miao, Lau Jey Han, Hovy Eduard
- Needlebench: Can Llms Do Retrieval And Reasoning In 1 Million Context Window? Li Mo, Zhang Songyang, Liu Yunxin, Chen Kai
- Think Twice Before Trusting: Self-detection For Large Language Models Through Comprehensive Answer Reflection Li Moxin, Wang Wenjie, Feng Fuli, Zhu Fengbin, Wang Qifan, Chua Tat-seng
- Owlore: Outlier-weighed Layerwise Sampled Low-rank Projection For Memory-efficient LLM Fine-tuning Li Pengxiang, Yin Lu, Gao Xiaowei, Liu Shiwei
- Coevol: Constructing Better Responses For Instruction Finetuning Through Multi-agent Cooperation Li Renhao, Tan Minghuan, Wong Derek F., Yang Min
- IQA-EVAL: Automatic Evaluation Of Human-model Interactive Question Answering Li Ruosen, Wang Barry, Li Ruochen, Du Xinya
- Prompt4vis: Prompting Large Language Models With Example Mining And Schema Filtering For Tabular Data Visualization Li Shuaimin, Chen Xuanang, Song Yuanfeng, Song Yunze, Zhang Chen
- MEDIQ: Question-asking Llms For Adaptive And Reliable Clinical Reasoning Li Shuyue Stella, Balachandran Vidhisha, Feng Shangbin, Ilgen Jonathan, Pierson Emma, Koh Pang Wei, Tsvetkov Yulia
- Scilitllm: How To Adapt Llms For Scientific Literature Understanding Li Sihang, Huang Jin, Zhuang Jiaxi, Shi Yaorui, Cai Xiaochen, Xu Mingjun, Wang Xiang, Zhang Linfeng, Ke Guolin, Cai Hengxing
- Temporal Insight Enhancement: Mitigating Temporal Hallucination In Multimodal Large Language Models Li Sun, Liuan Wang, Jun Sun, Takayuki Okatani
- Model-enhanced Llm-driven VUI Testing Of VPA Apps Li Suwan, Bu Lei, Bai Guangdong, Xie Fuman, Chen Kai, Yue Chang
- Moe-ct: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting Li Tianhao, Li Shangjie, Xie Binbin, Xiong Deyi, Yang Baosong
- TOPA: Extend Large Language Models For Video Understanding Via Text-only Pre-alignment Li Wei, Fan Hehe, Wong Yongkang, Kankanhalli Mohan, Yang Yi
- Citation-enhanced Generation For Llm-based Chatbots Li Weitao, Li Junkai, Ma Weizhi, Liu Yang
- Stbench: Assessing The Ability Of Large Language Models In Spatio-temporal Analysis Li Wenbin, Yao Di, Zhao Ruibo, Chen Wenjie, Xu Zijie, Luo Chengxue, Gong Chang, Jing Quanliang, Tan Haining, Bi Jingping
- Tutorly: Turning Programming Videos Into Apprenticeship Learning Environments With Llms Li Wengxi, Pea Roy, Haber Nick, Subramonyam Hari
- Uio-llms: Unbiased Incremental Optimization For Long-context Llms Li Wenhao, Lin Mingbao, Zhong Yunshan, Yan Shuicheng, Ji Rongrong
- MM-TTS: A Unified Framework For Multimodal, Prompt-induced Emotional Text-to-speech Synthesis Li Xiang, Cheng Zhi-qi, He Jun-yan, Peng Xiaojiang, Hauptmann Alexander G.
- Llara: Supercharging Robot Learning Data For Vision-language Policy Li Xiang, Mata Cristina, Park Jongwoo, Kahatapitiya Kumara, Jang Yoo Sung, Shang Jinghuan, Ranasinghe Kanchana, Burgert Ryan, Cai Mu, Lee Yong Jae, Ryoo Michael S.
- Preference Tuning For Toxicity Mitigation Generalizes Across Languages Li Xiaochen, Yong Zheng-xin, Bach Stephen H.
- Evaluating Mathematical Reasoning Of Large Language Models: A Focus On Error Identification And Correction Li Xiaoyuan, Wang Wenjie, Li Moxin, Guo Junrong, Zhang Yang, Feng Fuli
- Personalized Language Modeling From Personalized Human Feedback Li Xinyu, Lipton Zachary C., Leqi Liu
- Ganprompt: Enhancing Robustness In Llm-based Recommendations With Gan-enhanced Diversity Prompts Li Xinyu, Zhao Chuang, Zhao Hongke, Wu Likang, He Ming
- Say More With Less: Understanding Prompt Learning Behaviors Through Gist Compression Li Xinze, Liu Zhenghao, Xiong Chenyan, Yu Shi, Yan Yukun, Wang Shuo, Yu Ge
- Mini-gemini: Mining The Potential Of Multi-modality Vision Language Models Li Yanwei, Zhang Yuechen, Wang Chengyao, Zhong Zhisheng, Chen Yixin, Chu Ruihang, Liu Shaoteng, Jia Jiaya
- Leveraging Large Language Model As Simulated Patients For Clinical Education Li Yanzeng, Zeng Cheng, Zhong Jialun, Zhang Ruoyu, Zhang Minhao, Zou Lei
- Badedit: Backdooring Large Language Models By Model Editing Li Yanzhou, Li Tianlin, Chen Kangjie, Zhang Jian, Liu Shangqing, Wang Wenhan, Zhang Tianwei, Liu Yang
- Calrec: Contrastive Alignment Of Generative Llms For Sequential Recommendation Li Yaoyiran, Zhai Xiang, Alzantot Moustafa, Yu Keyi, Vulić Ivan, Korhonen Anna, Hammad Mohamed
- Style-talker: Finetuning Audio Language Model And Style-based Text-to-speech Model For Fast Spoken Dialogue Generation Li Yinghao Aaron, Jiang Xilin, Darefsky Jordan, Zhu Ge, Mesgarani Nima
- Guiding Enumerative Program Synthesis With Large Language Models Li Yixuan, Parsert Julian, Polgreen Elizabeth
- Mateval: A Multi-agent Discussion Framework For Advancing Open-ended Text Evaluation Li Yu, Zhang Shenyu, Wu Rui, Huang Xiutian, Chen Yongrui, Xu Wenhao, Qi Guilin, Min Dehai
- EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty Li Yuhui, Wei Fangyun, Zhang Chao, Zhang Hongyang
- Uni-moe: Scaling Unified Multimodal Llms With Mixture Of Experts Li Yunxin, Jiang Shenyuan, Hu Baotian, Wang Longyue, Zhong Wanqi, Luo Wenhan, Ma Lin, Zhang Min
- Mmsci: A Multimodal Multi-discipline Dataset For Phd-level Scientific Comprehension Li Zekun, Yang Xianjun, Choi Kyuri, Zhu Wanrong, Hsieh Ryan, Kim Hyeonjung, Lim Jin Hyuk, Ji Sungyoung, Lee Byungju, Yan Xifeng, Petzold Linda Ruth, Wilson Stephen D., Lim Woosang, Wang William Yang
- Formal-llm: Integrating Formal Language And Natural Language For Controllable Llm-based Agents Li Zelong, Hua Wenyue, Wang Hao, Zhu He, Zhang Yongfeng
- Understanding And Patching Compositional Reasoning In Llms Li Zhaoyi, Jiang Gangwei, Xie Hong, Song Linqi, Lian Defu, Wei Ying
- Focusllm: Scaling Llm's Context By Parallel Decoding Li Zhenyu, Zhang Yike, Pan Tengyu, Sun Yutao, Duan Zhichao, Fang Junjie, Han Rong, Wang Zixuan, Wang Jianyong
- EAGLE: Elevating Geometric Reasoning Through Llm-empowered Visual Instruction Tuning Li Zhihao, Du Yao, Liu Yang, Zhang Yan, Liu Yufang, Zhang Mengdi, Cai Xunliang
- Visualization Literacy Of Multimodal Large Language Models: A Comparative Study Li Zhimin, Miao Haichao, Pascucci Valerio, Liu Shusen
- Refiner: Restructure Retrieval Content Efficiently To Advance Question-answering Capabilities Li Zhonghao, Hu Xuming, Liu Aiwei, Zheng Kening, Huang Sirui, Xiong Hui
- Self-instructed Derived Prompt Generation Meets In-context Learning: Unlocking New Potential Of Black-box Llms Li Zhuo, Du Yuhao, Hu Jinpeng, Wan Xiang, Gao Anningzhe
- Synthesize Step-by-step: Tools, Templates And Llms As Data Generators For Reasoning-based Chart VQA Li Zhuowan, Jasani Bhavan, Tang Peng, Ghadar Shabnam
- Intermediate Distillation: Data-efficient Distillation From Black-box Llms For Information Retrieval Li Zizhong, Zhang Haopeng, Zhang Jiawei
- Recai: Leveraging Large Language Models For Next-generation Recommender Systems Lian Jianxun, Lei Yuxuan, Huang Xu, Yao Jing, Xu Wei, Xie Xing
- Online Training Of Large Language Models: Learn While Chatting Liang Juhao, Wang Ziwei, Ma Zhuoheng, Li Jianquan, Zhang Zhiyi, Wu Xiangbo, Wang Benyou
- Bytecomposer: A Human-like Melody Composition Method Based On Language Model Agent Liang Xia, Du Xingjian, Lin Jiaju, Zou Pei, Wan Yuan, Zhu Bilei
- Task Oriented In-domain Data Augmentation Liang Xiao, Hu Xinyu, Zuo Simiao, Gong Yeyun, Lou Qiang, Liu Yi, Huang Shao-lun, Jiao Jian
- Fennec: Fine-grained Language Model Evaluation And Correction Extended Through Branching And Bridging Liang Xiaobo, Zhang Haoke, Hu Helan, Li Juntao, Xu Jun, Zhang Min
- Languashrink: Reducing Token Overhead With Psycholinguistics Liang Xuechen, Tao Meiling, Xia Yinghui, Shi Tianyu, Wang Jun, Yang Jingsong
- Self-evolving Agents With Reflective And Memory-augmented Abilities Liang Xuechen, Tao Meiling, Xia Yinghui, Shi Tianyu, Wang Jun, Yang Jingsong
- Empowering Large Language Models To Set Up A Knowledge Retrieval Indexer Via Self-learning Liang Xun, Niu Simin, Li Zhiyu, Zhang Sensen, Song Shichao, Wang Hanyu, Yang Jiawei, Xiong Feiyu, Tang Bo, Xi Chenyang
- Internal Consistency And Self-feedback In Large Language Models: A Survey Liang Xun, Song Shichao, Zheng Zifan, Wang Hanyu, Yu Qingchen, Li Xunkai, Li Rong-hua, Cheng Peng, Wang Zhonghao, Xiong Feiyu, Li Zhiyu
- Learning To Trust Your Feelings: Leveraging Self-awareness In Llms For Hallucination Mitigation Liang Yuxin, Song Zhuoyang, Wang Hao, Zhang Jiaxing
- Apiq: Finetuning Of 2-bit Quantized Large Language Model Liao Baohao, Herold Christian, Khadivi Shahram, Monz Christof
- 3-in-1: 2D Rotary Adaptation For Efficient Finetuning, Efficient Batching And Composability Liao Baohao, Monz Christof
- Imagination Augmented Generation: Learning To Imagine Richer Context For Question Answering Over Large Language Models Liao Huanxuan, He Shizhu, Xu Yao, Zhang Yuanzhe, Liu Kang, Liu Shengping, Zhao Jun
- Enhancing Reinforcement Learning With Label-sensitive Reward For Natural Language Understanding Liao Kuo, Li Shuang, Zhao Meng, Liu Liqun, Xue Mengge, Hu Zhenyu, Han Honglin, Yin Chengguo
- Align And Aggregate: Compositional Reasoning With Video Alignment And Answer Aggregation For Video Question-answering Liao Zhaohe, Li Jiangtong, Niu Li, Zhang Liqing
- E2LLM: Encoder Elongated Large Language Models For Long-context Understanding And Reasoning Liao Zihan, Wang Jun, Yu Hang, Wei Lingxiao, Li Jianguo, Wang Jun, Zhang Wei
- Adapting Mental Health Prediction Tasks For Cross-lingual Learning Via Meta-training And In-context Learning With Large Language Model Lifelo Zita, Ning Huansheng, Dhelim Sahraoui
- Wildbench: Benchmarking Llms With Challenging Tasks From Real Users In The Wild Lin Bill Yuchen, Deng Yuntian, Chandu Khyathi, Brahman Faeze, Ravichander Abhilasha, Pyatkin Valentina, Dziri Nouha, Bras Ronan Le, Choi Yejin
- Instructlayout: Instruction-driven 2D And 3D Layout Synthesis With Semantic Graph Prior Lin Chenguo, Lin Yuchen, Pan Panwang, Zhang Xuanyang, Mu Yadong
- Tree-based Hard Attention With Self-motivation For Large Language Models Lin Chenxi, Ren Jiayu, He Guoxiu, Jiang Zhuoren, Yu Haiyan, Zhu Xiaomin
- Emojicrypt: Prompt Encryption For Secure Communication With Large Language Models Lin Guo, Hua Wenyue, Zhang Yongfeng
- Token-wise Influential Training Data Retrieval For Large Language Models Lin Huawei, Long Jikai, Xu Zhaozhuo, Zhao Weijie
- Large Language Models Make Sample-efficient Recommender Systems Lin Jianghao, Dai Xinyi, Shan Rong, Chen Bo, Tang Ruiming, Yu Yong, Zhang Weinan
- Designprobe: A Graphic Design Benchmark For Multimodal Large Language Models Lin Jieru, Huang Danqing, Zhao Tiejun, Zhan Dechen, Lin Chin-yew
- Dogerm: Equipping Reward Models With Domain Knowledge Through Model Merging Lin Tzu-han, Li Chen-an, Lee Hung-yi, Chen Yun-nung
- Efficient LLM Training And Serving With Heterogeneous Context Sharding Among Attention Heads Lin Xihui, Zhang Yunan, Ge Suyu, Patra Barun, Chaudhary Vishrav, Peng Hao, Song Xia
- Interpretable User Satisfaction Estimation For Conversational Systems With Large Language Models Lin Ying-chun, Neville Jennifer, Stokes Jack W., Yang Longqi, Safavi Tara, Wan Mengting, Counts Scott, Suri Siddharth, Andersen Reid, Xu Xiaofeng, Gupta Deepak, Jauhar Sujay Kumar, Song Xia, Buscher Georg, Tiwary Saurabh, Hecht Brent, Teevan Jaime
- Qserve: W4A8KV4 Quantization And System Co-design For Efficient LLM Serving Lin Yujun, Tang Haotian, Yang Shang, Zhang Zhekai, Xiao Guangxuan, Gan Chuang, Han Song
- Boosting Multimodal Large Language Models With Visual Tokens Withdrawal For Rapid Inference Lin Zhihang, Lin Mingbao, Lin Luxi, Ji Rongrong
- The Colorful Future Of Llms: Evaluating And Improving Llms As Emotional Supporters For Queer Youth Lissak Shir, Calderon Nitay, Shenkman Geva, Ophir Yaakov, Fruchter Eyal, Klomek Anat Brunstein, Reichart Roi
- Progressively Selective Label Enhancement For Language Model Alignment Liu Biao, Xu Ning, Geng Xin
- Few-shot Class Incremental Learning With Attention-aware Self-adaptive Prompt Liu Chenxi, Wang Zhenyi, Xiong Tianyi, Chen Ruibo, Wu Yihan, Guo Junfeng, Huang Heng
- Openeval: Benchmarking Chinese Llms Across Capability, Alignment And Safety Liu Chuang, Yu Linhao, Li Jiaxuan, Jin Renren, Huang Yufei, Shi Ling, Zhang Junhui, Ji Xinmeng, Cui Tingting, Liu Tao, Song Jinwang, Zan Hongying, Li Sun, Xiong Deyi
- SPHINX-X: Scaling Data And Parameters For A Family Of Multi-modal Large Language Models Liu Dongyang, Zhang Renrui, Qiu Longtian, Huang Siyuan, Lin Weifeng, Zhao Shitian, Geng Shijie, Lin Ziyi, Jin Peng, Zhang Kaipeng, Shao Wenqi, Xu Chao, He Conghui, He Junjun, Shao Hao, Lu Pan, Li Hongsheng, Qiao Yu, Gao Peng
- Lumina-mgpt: Illuminate Flexible Photorealistic Text-to-image Generation With Multimodal Generative Pretraining Liu Dongyang, Zhao Shitian, Zhuo Le, Lin Weifeng, Qiao Yu, Li Hongsheng, Gao Peng
- A Practice-friendly Two-stage Llm-enhanced Paradigm In Sequential Recommendation Liu Dugang, Xian Shenxian, Lin Xiaolin, Zhang Xiaolian, Zhu Hong, Fang Yuan, Chen Zhen, Ming Zhong
- Exploring Vulnerabilities And Protections In Large Language Models: A Survey Liu Frank Weizhen, Hu Chenhui
- Pefomed: Parameter Efficient Fine-tuning Of Multimodal Large Language Models For Medical Imaging Liu Gang, He Jinlong, Li Pengfei, He Genrong, Chen Zhaolin, Zhong Shenjun
- Logic Agent: Enhancing Validity With Logic Rule Invocation Liu Hanmeng, Teng Zhiyang, Zhang Chaoli, Zhang Yue
- Augmenting Math Word Problems Via Iterative Question Composing Liu Haoxiong, Zhang Yifan, Luo Yifan, Yao Andrew Chi-chih
- Laida: Linguistics-aware In-context Learning With Data Augmentation For Metaphor Components Identification Liu Hongde, He Chenyuan, Meng Feiyang, Niu Changyong, Jia Yuxiang
- Advancing Adversarial Suffix Transfer Learning On Aligned Large Language Models Liu Hongfu, Xie Yuxi, Wang Ye, Shieh Michael
- Kangaroo: A Powerful Video-language Model Supporting Long-context Video Input Liu Jiajun, Wang Yibing, Ma Hanghang, Wu Xiaoping, Ma Xiaoqi, Wei Xiaoming, Jiao Jianbin, Wu Enhua, Hu Jie
- Automatic Generation Of Model And Data Cards: A Step Towards Responsible AI Liu Jiarui, Li Wenkai, Jin Zhijing, Diab Mona
- Tiny Refinements Elicit Resilience: Toward Efficient Prefix-model Against LLM Red-teaming Liu Jiaxu, Yin Xiangyu, Wu Sihao, Wang Jianhong, Fang Meng, Yi Xinping, Huang Xiaowei
- Iterative Length-regularized Direct Preference Optimization: A Case Study On Improving 7B Language Models To GPT-4 Level Liu Jie, Zhou Zhanhui, Liu Jiaheng, Bu Xingyuan, Yang Chao, Zhong Han-sen, Ouyang Wanli
- Large Language Model-based Agents For Software Engineering: A Survey Liu Junwei, Wang Kaixin, Chen Yixuan, Peng Xin, Chen Zhenpeng, Zhang Lingming, Lou Yiling
- Selectit: Selective Instruction Tuning For Large Language Models Via Uncertainty-aware Self-reflection Liu Liangxin, Liu Xuebo, Wong Derek F., Li Dongfang, Wang Ziyi, Hu Baotian, Zhang Min
- LLM4GEN: Leveraging Semantic Representation Of Llms For Text-to-image Generation Liu Mushui, Ma Yuhang, Zhen Yang, Dan Jun, Yu Yunlong, Zhao Zeng, Hu Zhipeng, Liu Bai, Fan Changjie
- Synthetic Context Generation For Question Generation Liu Naiming, Wang Zichao, Baraniuk Richard
- STORE: Streamlining Semantic Tokenization And Generative Recommendation With A Single LLM Liu Qijiong, Zhu Jieming, Fan Lu, Zhao Zhou, Wu Xiao-ming
- Latent Guard: A Safety Framework For Text-to-image Generation Liu Runtao, Khakzar Ashkan, Gu Jindong, Chen Qifeng, Torr Philip, Pizzati Fabio
- RL-GPT: Integrating Reinforcement Learning And Code-as-policy Liu Shaoteng, Yuan Haoqi, Hu Minda, Li Yanwei, Chen Yukang, Liu Shu, Lu Zongqing, Jia Jiaya
- Bi-chainer: Automated Large Language Models Reasoning With Bidirectional Chaining Liu Shuqi, He Bowei, Song Linqi
- Multi-group Uncertainty Quantification For Long-form Text Generation Liu Terrance, Wu Zhiwei Steven
- Toolace: Winning The Points Of LLM Function Calling Liu Weiwen, Huang Xu, Zeng Xingshan, Hao Xinlong, Yu Shuai, Li Dexun, Wang Shuai, Gan Weinan, Liu Zhengying, Yu Yuanqing, Wang Zezhong, Wang Yuxian, Ning Wu, Hou Yutai, Wang Bin, Wu Chuhan, Wang Xinzhi, Liu Yong, Wang Yasheng, Tang Duyu, Tu Dandan, Shang Lifeng, Jiang Xin, Tang Ruiming, Lian Defu, Liu Qun, Chen Enhong
- Demorank: Selecting Effective Demonstrations For Large Language Models In Ranking Task Liu Wenhan, Zhu Yutao, Dou Zhicheng
- Imposter.ai: Adversarial Attacks With Hidden Intentions Towards Aligned Large Language Models Liu Xiao, Li Liangzhi, Xiang Tong, Ye Fuying, Wei Lu, Li Wangyue, Garcia Noa
- SHIELD: Evaluation And Defense Strategies For Copyright Compliance In LLM Text Generation Liu Xiaoze, Sun Ting, Xu Tianyang, Wu Feijie, Wang Cunxiang, Wang Xiaoqian, Gao Jing
- Prompt Framework For Role-playing: Generation And Evaluation Liu Xun, Ni Zhengwei
- RA-ISF: Learning To Answer And Understand From Retrieval Augmentation Via Iterative Self-feedback Liu Yanming, Peng Xinyue, Zhang Xuhong, Liu Weihao, Yin Jianwei, Cao Jiannan, Du Tianyu
- Arondight: Red Teaming Large Vision Language Models With Auto-generated Multi-modal Jailbreak Prompts Liu Yi, Cai Chengjun, Zhang Xiaoli, Yuan Xingliang, Wang Cong
- From Summary To Action: Enhancing Large Language Models For Complex Tasks With Open World Apis Liu Yulong, Yuan Yunlong, Wang Chunwei, Han Jianhua, Ma Yongqiang, Zhang Li, Zheng Nanning, Xu Hang
- Hd-eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition Liu Yuxuan, Yang Tianchi, Huang Shaohan, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Scaffolding Language Learning Via Multi-modal Tutoring Systems With Pedagogical Instructions Liu Zhengyuan, Yin Stella Xin, Lee Carolyn, Chen Nancy F.
- Personality-aware Student Simulation For Conversational Intelligent Tutoring Systems Liu Zhengyuan, Yin Stella Xin, Lin Geyu, Chen Nancy F.
- Agentlite: A Lightweight Library For Building And Advancing Task-oriented LLM Agent System Liu Zhiwei, Yao Weiran, Zhang Jianguo, Yang Liangwei, Liu Zuxin, Tan Juntao, Choubey Prafulla K., Lan Tian, Wu Jason, Wang Huan, Heinecke Shelby, Xiong Caiming, Savarese Silvio
- A Closer Look Into Mixture-of-experts In Large Language Models Lo Ka Man, Huang Zeyu, Qiu Zihan, Wang Zili, Fu Jie
- Large Language Models Know What Makes Exemplary Contexts Long Quanyu, Chen Jianda, Wang Wenya, Pan Sinno Jialin
- Generative Multi-modal Knowledge Retrieval With Large Language Models Long Xinwei, Zeng Jiali, Meng Fandong, Ma Zhiyuan, Zhang Kaiyan, Zhou Bowen, Zhou Jie
- Revisiting Multi-modal LLM Evaluation Lu Jian, Srivastava Shikhar, Chen Junyu, Shrestha Robik, Acharya Manoj, Kafle Kushal, Kanan Christopher
- YODA: Teacher-student Progressive Learning For Language Models Lu Jianqiao, Zhong Wanjun, Wang Yufei, Guo Zhijiang, Zhu Qi, Huang Wenyong, Wang Yanlin, Mi Fei, Wang Baojun, Wang Yasheng, Shang Lifeng, Jiang Xin, Liu Qun
- Toolsandbox: A Stateful, Conversational, Interactive Evaluation Benchmark For LLM Tool Use Capabilities Lu Jiarui, Holleis Thomas, Zhang Yizhe, Aumayer Bernhard, Nan Feng, Bai Felix, Ma Shuang, Ma Shen, Li Mengyu, Yin Guoli, Wang Zirui, Pang Ruoming
- Padellm-ner: Parallel Decoding In Large Language Models For Named Entity Recognition Lu Jinghui, Yang Ziwei, Wang Yanjie, Liu Xuejing, Mac Namee Brian, Huang Can
- FIPO: Free-form Instruction-oriented Prompt Optimization With Preference Dataset And Modular Fine-tuning Schema Lu Junru, An Siyu, Zhang Min, He Yulan, Yin Di, Sun Xing
- Datasculpt: Crafting Data Landscapes For LLM Post-training Through Multi-objective Partitioning Lu Keer, Liang Zheng, Nie Xiaonan, Pan Da, Zhang Shusen, Zhao Keshi, Chen Weipeng, Zhou Zenan, Dong Guosheng, Zhang Wentao, Cui Bin
- LLM Discussion: Enhancing The Creativity Of Large Language Models Via Discussion Framework And Role-play Lu Li-chun, Chen Shou-jen, Pai Tsung-min, Yu Chan-hung, Lee Hung-yi, Sun Shao-hua
- Multimodal Large Language Model Driven Scenario Testing For Autonomous Vehicles Lu Qiujing, Wang Xuanhan, Jiang Yiwei, Zhao Guangming, Ma Mingyue, Feng Shuo
- Blending Is All You Need: Cheaper, Better Alternative To Trillion-parameters LLM Lu Xiaoding, Liu Zongyi, Liusie Adian, Raina Vyas, Mudupalli Vineet, Zhang Yuwen, Beauchamp William
- Generative Students: Using Llm-simulated Student Profiles To Support Question Item Evaluation Lu Xinyi, Wang Xu
- Longheads: Multi-head Attention Is Secretly A Long Context Processor Lu Yi, Zhou Xin, He Wei, Zhao Jun, Ji Tao, Gui Tao, Zhang Qi, Huang Xuanjing
- Wildvision: Evaluating Vision-language Models In The Wild With Human Preferences Lu Yujie, Jiang Dongfu, Chen Wenhu, Wang William Yang, Choi Yejin, Lin Bill Yuchen
- MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory For Long Sequences Training Luo Cheng, Zhao Jiawei, Chen Zhuoming, Chen Beidi, Anandkumar Anima
- Crosstune: Black-box Few-shot Classification With Label Enhancement Luo Danqing, Zhang Chen, Zhang Yan, Li Haizhou
- Autom3l: An Automated Multimodal Machine Learning Framework With Large Language Models Luo Daqin, Feng Chengjian, Nong Yuxuan, Shen Yiqing
- Arena Learning: Build Data Flywheel For Llms Post-training Via Simulated Chatbot Arena Luo Haipeng, Sun Qingfeng, Xu Can, Zhao Pu, Lin Qingwei, Lou Jianguang, Chen Shifeng, Tang Yansong, Chen Weizhu
- Kuaiji: The First Chinese Accounting Large Language Model Luo Jiayuan, Yang Songhua, Qiu Xiaoling, Chen Panyu, Nai Yufei, Zeng Wenxuan, Zhang Wentao, Jiang Xinke
- End-to-end Neuro-symbolic Reinforcement Learning With Textual Explanations Luo Lirui, Zhang Guoxi, Xu Hongming, Yang Yaodong, Fang Cong, Li Qing
- M\(^3\)GPT: An Advanced Multimodal, Multitask Framework For Motion Comprehension And Generation Luo Mingshuang, Hou Ruibing, Chang Hong, Liu Zimo, Wang Yaowei, Shan Shiguang
- Taking A Deep Breath: Enhancing Language Modeling Of Large Language Models With Sentinel Tokens Luo Weiyao, Zheng Suncong, Xia Heming, Wang Weikang, Lei Yan, Liu Tianyu, Chen Shuang, Sui Zhifang
- Duetsim: Building User Simulator With Dual Large Language Models For Task-oriented Dialogues Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- Turning Trash Into Treasure: Accelerating Inference Of Large Language Models With Token Recycling Luo Xianzhen, Wang Yixuan, Zhu Qingfu, Zhang Zhiming, Zhang Xuanyu, Yang Qing, Xu Dongliang, Che Wanxiang
- Ensuring Safe And High-quality Outputs: A Guideline Library Approach For Language Models Luo Yi, Lin Zhenghao, Zhang Yuhao, Sun Jiashuo, Lin Chen, Xu Chengjin, Su Xiangdong, Shen Yelong, Guo Jian, Gong Yeyun
- Interpreting Key Mechanisms Of Factual Recall In Transformer-based Language Models Lv Ang, Chen Yuhan, Zhang Kaiyi, Wang Yulong, Liu Lifeng, Wen Ji-rong, Xie Jian, Yan Rui
- Codeact: Code Adaptive Compute-efficient Tuning Framework For Code Llms Lv Weijie, Xia Xuan, Huang Sheng-jun
- Beyond Probabilities: Unveiling The Misalignment In Evaluating Large Language Models Lyu Chenyang, Wu Minghao, Aji Alham Fikri
- X-reflect: Cross-reflection Prompting For Multimodal Recommendation Lyu Hanjia, Rossi Ryan, Chen Xiang, Tanjim Md Mehrab, Petrangeli Stefano, Sarkhel Somdeb, Luo Jiebo
- Retrieve-plan-generation: An Iterative Planning And Answering Framework For Knowledge-intensive LLM Generation Lyu Yuanjie, Niu Zihan, Xie Zheyong, Zhang Chao, Xu Tong, Wang Yang, Chen Enhong
- Agentboard: An Analytical Evaluation Board Of Multi-turn LLM Agents Ma Chang, Zhang Junlei, Zhu Zhihao, Yang Cheng, Yang Yujiu, Jin Yaohui, Lan Zhenzhong, Kong Lingpeng, He Junxian
- When Emotional Stimuli Meet Prompt Designing: An Auto-prompt Graphical Paradigm Ma Chenggian, Zhao Xiangyu, Zhang Chunhui, Qin Yanzhao, Zhang Wentao
- Sparsity-accelerated Training For Large Language Models Ma Da, Chen Lu, Wang Pengyu, Xu Hongshen, Li Hanqi, Sun Liangtai, Zhu Su, Fan Shuai, Yu Kai
- Debate On Graph: A Flexible And Reliable Reasoning Framework For Large Language Models Ma Jie, Gao Zhitao, Chai Qi, Sun Wangchun, Wang Pinghui, Pei Hongbin, Tao Jing, Song Lingyun, Liu Jun, Zhang Chen, Cui Lizhen
- Neighboring Perturbations Of Knowledge Editing On Large Language Models Ma Jun-yu, Ling Zhen-hua, Zhang Ningyu, Gu Jia-chen
- Perturbation-restrained Sequential Model Editing Ma Jun-yu, Wang Hong, Xu Hao-xiang, Ling Zhen-hua, Gu Jia-chen
- FBI-LLM: Scaling Up Fully Binarized Llms From Scratch Via Autoregressive Distillation Ma Liqun, Sun Mingjie, Shen Zhiqiang
- Think-on-graph 2.0: Deep And Interpretable Large Language Model Reasoning With Knowledge Graph-guided Retrieval Ma Shengjie, Xu Chengjin, Jiang Xuhui, Li Muzhi, Qu Huaren, Guo Jian
- Making Pre-trained Language Models Better Continual Few-shot Relation Extractors Ma Shengkun, Han Jiale, Liang Yi, Cheng Bo
- Sciagent: Tool-augmented Language Models For Scientific Reasoning Ma Yubo, Gou Zhibin, Hao Junheng, Xu Ruochen, Wang Shuohang, Pan Liangming, Yang Yujiu, Cao Yixin, Sun Aixin, Awadalla Hany, Chen Weizhu
- Compositional API Recommendation For Library-oriented Code Generation Ma Zexiong, An Shengnan, Xie Bing, Lin Zeqi
- Llamoco: Instruction Tuning Of Large Language Models For Optimization Code Generation Ma Zeyuan, Guo Hongshu, Chen Jiacheng, Peng Guojun, Cao Zhiguang, Ma Yining, Gong Yue-jiao
- Gerea: Question-aware Prompt Captions For Knowledge-based Visual Question Answering Ma Ziyu, Li Shutao, Sun Bin, Cai Jianfei, Long Zuxiang, Ma Fuyan
- Exploring The Impact Of The Output Format On The Evaluation Of Large Language Models For Code Translation Macedo Marcos, Tian Yuan, Cogo Filipe R., Adams Bram
- Explaining Genetic Programming Trees Using Large Language Models Maddigan Paula, Lensen Andrew, Xue Bing
- Large Language Models As Minecraft Agents Madge Chris, Poesio Massimo
- Are Self-explanations From Large Language Models Faithful? Madsen Andreas, Chandar Sarath, Reddy Siva
- Beyond Generative Artificial Intelligence: Roadmap For Natural Language Generation Maestre María Miró, Martínez-murillo Iván, Martin Tania J., Navarro-colorado Borja, Ferrández Antonio, Cueto Armando Suárez, Lloret Elena
- Hallucination-free? Assessing The Reliability Of Leading AI Legal Research Tools Magesh Varun, Surani Faiz, Dahl Matthew, Suzgun Mirac, Manning Christopher D., Ho Daniel E.
- VURF: A General-purpose Reasoning And Self-refinement Framework For Video Understanding Mahmood Ahmad, Vayani Ashmal, Naseer Muzammal, Khan Salman, Khan Fahad Shahbaz
- Exploring The Capabilities Of Prompted Large Language Models In Educational And Assessment Applications Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- Fairylandai: Personalized Fairy Tales Utilizing Chatgpt And DALLE-3 Makridis Georgios, Oikonomou Athanasios, Koukos Vasileios
- From Tarzan To Tolkien: Controlling The Language Proficiency Level Of Llms For Content Generation Malik Ali, Mayhew Stephen, Piech Chris, Bicknell Klinton
- ULLME: A Unified Framework For Large Language Model Embeddings With Generation-augmented Learning Man Hieu, Ngo Nghia Trung, Dernoncourt Franck, Nguyen Thien Huu
- IITK At Semeval-2024 Task 2: Exploring The Capabilities Of Llms For Safe Biomedical Natural Language Inference For Clinical Trials Mandal Shreyasi, Modi Ashutosh
- A Novel Nuanced Conversation Evaluation Framework For Large Language Models In Mental Health Marrapese Alexander, Suleiman Basem, Ullah Imdad, Kim Juno
- Beware Of Words: Evaluating The Lexical Richness Of Conversational Large Language Models Martínez Gonzalo, Hernández José Alberto, Conde Javier, Reviriego Pedro, Merino Elena
- Datastates-llm: Lazy Asynchronous Checkpointing For Large Language Models Maurya Avinash, Underwood Robert, Rafique M. Mustafa, Cappello Franck, Nicolae Bogdan
- Can Opensource Beat Chatgpt? -- A Comparative Study Of Large Language Models For Text-to-code Generation Mayer Luis, Heumann Christian, Aßenmacher Matthias
- Learning To Ask Informative Questions: Enhancing Llms With Preference Optimization And Expected Information Gain Mazzaccara Davide, Testoni Alberto, Bernardi Raffaella
- Improving Text-to-image Consistency Via Automatic Prompt Optimization Mañas Oscar, Astolfi Pietro, Hall Melissa, Ross Candace, Urbanek Jack, Williams Adina, Agrawal Aishwarya, Romero-soriano Adriana, Drozdzal Michal
- Inadequacies Of Large Language Model Benchmarks In The Era Of Generative Artificial Intelligence Mcintosh Timothy R., Susnjak Teo, Liu Tong, Watters Paul, Halgamuge Malka N.
- Metacheckgpt -- A Multi-task Hallucination Detector Using LLM Uncertainty And Meta-models Mehta Rahul, Hoblitzell Andrew, O'keefe Jack, Jang Hyeju, Varma Vasudeva
- Openelm: An Efficient Language Model Family With Open Training And Inference Framework Mehta Sachin, Sekhavat Mohammad Hossein, Cao Qingqing, Horton Maxwell, Jin Yanzi, Sun Chenfan, Mirzadeh Iman, Najibi Mahyar, Belenko Dmitry, Zatloukal Peter, Rastegari Mohammad
- "not Aligned" Is Not "malicious": Being Careful About Hallucinations Of Large Language Models' Jailbreak Mei Lingrui, Liu Shenghua, Wang Yiwei, Bi Baolong, Mao Jiayi, Cheng Xueqi
- On The Benchmarking Of Llms For Open-domain Dialogue Evaluation Mendonça John, Lavie Alon, Trancoso Isabel
- Soda-eval: Open-domain Dialogue Evaluation In The Age Of Llms Mendonça John, Trancoso Isabel, Lavie Alon
- Phybench: A Physical Commonsense Benchmark For Evaluating Text-to-image Models Meng Fanqing, Shao Wenqi, Luo Lixin, Wang Yahong, Chen Yiran, Lu Quanfeng, Yang Yue, Yang Tianshuo, Zhang Kaipeng, Qiao Yu, Luo Ping
- Logic-enhanced Language Model Agents For Trustworthy Social Simulations Mensfelt Agnieszka, Stathis Kostas, Trencsenyi Vince
- Transforming Wearable Data Into Health Insights Using Large Language Model Agents Merrill Mike A., Paruchuri Akshay, Rezaei Naghmeh, Kovacs Geza, Perez Javier, Liu Yun, Schenck Erik, Hammerquist Nova, Sunshine Jake, Tailor Shyam, Ayush Kumar, Su Hao-wei, He Qian, Mclean Cory Y., Malhotra Mark, Patel Shwetak, Zhan Jiening, Althoff Tim, Mcduff Daniel, Liu Xin
- A Comparison Of LLM Finetuning Methods & Evaluation Metrics With Travel Chatbot Use Case Meyer Sonia, Singh Shreya, Tam Bertha, Ton Christopher, Ren Angel
- Permitqa: A Benchmark For Retrieval Augmented Generation In Wind Siting And Permitting Domain Meyur Rounak, Phan Hung, Wagle Sridevi, Strube Jan, Halappanavar Mahantesh, Horawalavithana Sameera, Acharya Anurag, Munikoti Sai
- User Centric Evaluation Of Code Generation Tools Miah Tanha, Zhu Hong
- An Efficient Inference Framework For Early-exit Large Language Models Miao Ruijie, Yan Yihan, Yao Xinshuo, Yang Tong
- Smart Language Agents In Real-world Planning Miin Annabelle, Wei Timothy
- Natural Language As Policies: Reasoning For Coordinate-level Embodied Control With Llms Mikami Yusuke, Melnik Andrew, Miura Jun, Hautamäki Ville
- Velora: Memory Efficient Training Using Rank-1 Sub-token Projections Miles Roy, Reddy Pradyumna, Elezi Ismail, Deng Jiankang
- Large Language Models: A Survey Minaee Shervin, Mikolov Tomas, Nikzad Narjes, Chenaghlu Meysam, Socher Richard, Amatriain Xavier, Gao Jianfeng
- Are Large Language Models Superhuman Chemists? Mirza Adrian, Alampara Nawaf, Kunchapu Sreekanth, Emoekabu Benedict, Krishnan Aswanth, Wilhelmi Mara, Okereke Macjonathan, Eberhardt Juliane, Elahi Amir Mohammad, Greiner Maximilian, Holick Caroline T., Gupta Tanya, Asgari Mehrdad, Glaubitz Christina, Klepsch Lea C., Köster Yannik, Meyer Jakob, Miret Santiago, Hoffmann Tim, Kreth Fabian Alexander, Ringleb Michael, Roesner Nicole, Schubert Ulrich S., Stafast Leanne M., Wonanke Dinga, Pieler Michael, Schwaller Philippe, Jablonka Kevin Maik
- Multimodal Contrastive In-context Learning Miyanishi Yosuke, Nguyen Minh Le
- CBF-LLM: Safe Control For LLM Alignment Miyaoka Yuya, Inoue Masaki
- Convsdg: Session Data Generation For Conversational Search Mo Fengran, Yi Bole, Mao Kelong, Qu Chen, Huang Kaiyu, Nie Jian-yun
- Evaluating Large Language Models With Human Feedback: Establishing A Swedish Benchmark Moell Birger
- Llava-chef: A Multi-modal Generative Model For Food Recipes Mohbat Fnu, Zaki Mohammed J.
- Can We Trust Large Language Models Generated Code? A Framework For In-context Learning, Security Patterns, And Code Evaluations Across Diverse Llms Mohsin Ahmad, Janicke Helge, Wood Adrian, Sarker Iqbal H., Maglaras Leandros, Janjua Naeem
- Llm-based Frameworks For API Argument Filling In Task-oriented Conversational Systems Mok Jisoo, Kachuee Mohammad, Dai Shuyang, Ray Shayan, Taghavi Tara, Yoon Sungroh
- Kam-cot: Knowledge Augmented Multimodal Chain-of-thoughts Reasoning Mondal Debjyoti, Modi Suraj, Panda Subhadarshi, Singh Rituraj, Rao Godawari Sudhakar
- Towards Generating Informative Textual Description For Neurons In Language Models Mondal Shrayani, Garodia Rishabh, Qureshi Arbaaz, Lee Taesung, Park Youngja
- Repliqa: A Question-answering Dataset For Benchmarking Llms On Unseen Reference Content Monteiro Joao, Noel Pierre-andre, Marcotte Etienne, Rajeswar Sai, Zantedeschi Valentina, Vazquez David, Chapados Nicolas, Pal Christopher, Taslakian Perouz
- Synthetic Programming Elicitation And Repair For Text-to-code In Very Low-resource Programming Languages Mora Federico, Wong Justin, Lepe Haley, Bhatia Sahil, Elmaaroufi Karim, Varghese George, Gonzalez Joseph E., Polgreen Elizabeth, Seshia Sanjit A.
- Exploring The Landscape Of Large Language Models: Foundations, Techniques, And Challenges Moradi Milad, Yan Ke, Colwell David, Samwald Matthias, Asgari Rhona
- Langbite: A Platform For Testing Bias In Large Language Models Morales Sergio, Clarisó Robert, Cabot Jordi
- Novicode: Generating Programs From Natural Language Utterances By Novices Mordechai Asaf Achi, Goldberg Yoav, Tsarfaty Reut
- A Logical Fallacy-informed Framework For Argument Generation Mouchel Luca, Paul Debjit, Cui Shaobo, West Robert, Bosselut Antoine, Faltings Boi
- Grouse: A Benchmark To Evaluate Evaluators In Grounded Question Answering Muller Sacha, Loison António, Omrani Bilel, Viaud Gautier
- Chatgpt In Classrooms: Transforming Challenges Into Opportunities In Education Munawar Harris Bin, Misirlis Nikolaos
- Open-llm-leaderboard: From Multi-choice To Open-style Questions For Llms Evaluation, Benchmark, And Arena Myrzakhan Aidar, Bsharat Sondos Mahmoud, Shen Zhiqiang
- Negation Blindness In Large Language Models: Unveiling The NO Syndrome In Image Generation Nadeem Mohammad, Sohail Shahab Saquib, Cambria Erik, Schuller Björn W., Hussain Amir
- Learning Vs Retrieval: The Role Of In-context Examples In Regression With Llms Nafar Aliakbar, Venable Kristen Brent, Kordjamshidi Parisa
- Fakes Of Varying Shades: How Warning Affects Human Perception And Engagement Regarding LLM Hallucinations Nahar Mahjabin, Seo Haeseung, Lee Eun-ju, Xiong Aiping, Lee Dongwon
- Gemquad : Generating Multilingual Question Answering Datasets From Large Language Models Using Few Shot Learning Namboori Amani, Mangale Shivam, Rosenbaum Andy, Soltan Saleh
- Apprentices To Research Assistants: Advancing Research With Large Language Models Namvarpour M., Razi A.
- What If Red Can Talk? Dynamic Dialogue Generation Using Large Language Models Nananukul Navapat, Wongkamjan Wichayaporn
- Review-feedback-reason (refer): A Novel Framework For NLG Evaluation And Reasoning Narsupalli Yaswanth, Chandra Abhranil, Muppirala Sreevatsa, Gupta Manish, Goyal Pawan
- From Questions To Insightful Answers: Building An Informed Chatbot For University Resources Neupane Subash, Hossain Elias, Keith Jason, Tripathi Himanshu, Ghiasi Farbod, Golilarz Noorbakhsh Amiri, Amirlatifi Amin, Mittal Sudip, Rahimi Shahram
- Medinsight: A Multi-source Context Augmentation Framework For Generating Patient-centric Medical Responses Using Large Language Models Neupane Subash, Mitra Shaswata, Mittal Sudip, Golilarz Noorbakhsh Amiri, Rahimi Shahram, Amirlatifi Amin
- How Well Can Llms Echo Us? Evaluating AI Chatbots' Role-play Ability With ECHO Ng Man Tik, Tse Hui Tung, Huang Jen-tse, Li Jingjing, Wang Wenxuan, Lyu Michael R.
- Dude: Dual Distribution-aware Context Prompt Learning For Large Vision-language Model Nguyen Duy M. H., Le An T., Nguyen Trung Q., Diep Nghiem T., Nguyen Tai, Duong-tran Duy, Peters Jan, Shen Li, Niepert Mathias, Sonntag Daniel
- Text-to-drive: Diverse Driving Behavior Synthesis Via Large Language Models Nguyen Phat, Wang Tsun-hsuan, Hong Zhang-wei, Karaman Sertac, Rus Daniela
- An Empirical Study On Capability Of Large Language Models In Understanding Code Semantics Nguyen Thu-trang, Vu Thanh Trong, Vo Hieu Dinh, Nguyen Son
- Ceval: A Benchmark For Evaluating Counterfactual Text Generation Nguyen Van Bach, Schlötterer Jörg, Seifert Christin
- Mixeval: Deriving Wisdom Of The Crowd From LLM Benchmark Mixtures Ni Jinjie, Xue Fuzhao, Yue Xiang, Deng Yuntian, Shah Mahir, Jain Kabir, Neubig Graham, You Yang
- Layer-wise Regularized Dropout For Neural Language Models Ni Shiwen, Yang Min, Xu Ruifeng, Li Chengming, Hu Xiping
- The GPT Surprise: Offering Large Language Model Chat In A Massive Coding Class Reduced Engagement But Increased Adopters Exam Performances Nie Allen, Chandak Yash, Suzara Miroslav, Ali Malika, Woodrow Juliette, Peng Matt, Sahami Mehran, Brunskill Emma, Piech Chris
- BMIKE-53: Investigating Cross-lingual Knowledge Editing With In-context Learning Nie Ercong, Shao Bo, Ding Zifeng, Wang Mingyang, Schmid Helmut, Schütze Hinrich
- Llm-based Conversational AI Therapist For Daily Functioning Screening And Psychotherapeutic Intervention Via Everyday Smart Devices Nie Jingping, Shao Hanya, Fan Yuang, Shao Qijia, You Haoxuan, Preindl Matthias, Jiang Xiaofan
- WTU-EVAL: A Whether-or-not Tool Usage Evaluation Benchmark For Large Language Models Ning Kangyun, Su Yisong, Lv Xueqiang, Zhang Yuanzhe, Liu Jian, Liu Kang, Xu Jinan
- User-llm: Efficient LLM Contextualization With User Embeddings Ning Lin, Liu Luyang, Wu Jiaxing, Wu Neo, Berlowitz Devora, Prakash Sushant, Green Bradley, O'banion Shawn, Xie Jun
- Mobileflow: A Multimodal LLM For Mobile GUI Agent Nong Songqin, Zhu Jiali, Wu Rui, Jin Jiongchao, Shan Shuo, Huang Xiutian, Xu Wenhao
- Generative AI In EU Law: Liability, Privacy, Intellectual Property, And Cybersecurity Novelli Claudio, Casolari Federico, Hacker Philipp, Spedicato Giorgio, Floridi Luciano
- Improving Black-box Robustness With In-context Rewriting O'brien Kyle, Ng Nathan, Puri Isha, Mendez Jorge, Palangi Hamid, Kim Yoon, Ghassemi Marzyeh, Hartvigsen Thomas
- Robust Adaptation Of Foundation Models With Black-box Visual Prompting Oh Changdae, Seo Gyeongdeok, Jung Geunyoung, Cheng Zhi-qi, Choi Hosik, Jung Jiyoung, Song Kyungwoo
- Routellm: Learning To Route Llms With Preference Data Ong Isaac, Almahairi Amjad, Wu Vincent, Chiang Wei-lin, Wu Tianhao, Gonzalez Joseph E., Kadous M Waleed, Stoica Ion
- Large Language Models Are Self-taught Reasoners: Enhancing LLM Applications Via Tailored Problem-solving Demonstrations Ong Kai Tzu-iunn, Kwon Taeyoon, Yeo Jinyoung
- Towards Modular Llms By Building And Reusing A Library Of Loras Ostapenko Oleksiy, Su Zhan, Ponti Edoardo Maria, Charlin Laurent, Roux Nicolas Le, Pereira Matheus, Caccia Lucas, Sordoni Alessandro
- A Systematic Review Of Data-to-text NLG Osuji Chinonso Cynthia, Ferreira Thiago Castro, Davis Brian
- Large Language Models As Planning Domain Generators Oswald James, Srinivas Kavitha, Kokel Harsha, Lee Junkyu, Katz Michael, Sohrabi Shirin
- Lossless Acceleration Of Large Language Model Via Adaptive N-gram Parallel Decoding Ou Jie, Chen Yueming, Tian Wenhong
- Easyinstruct: An Easy-to-use Instruction Processing Framework For Large Language Models Ou Yixin, Zhang Ningyu, Gui Honghao, Xu Ziwen, Qiao Shuofei, Xue Yida, Fang Runnan, Liu Kangwei, Li Lei, Bi Zhen, Zheng Guozhou, Chen Huajun
- Multilingual Prompts In Llm-based Recommenders: Performance Across Languages Ozsoy Makbule Gulcin
- Distilling Large Language Models For Text-attributed Graph Learning Pan Bo, Zhang Zheng, Zhang Yifei, Hu Yuntong, Zhao Liang
- Dense Training, Sparse Inference: Rethinking Training Of Mixture-of-experts Language Models Pan Bowen, Shen Yikang, Liu Haokun, Mishra Mayank, Zhang Gaoyuan, Oliva Aude, Raffel Colin, Panda Rameswar
- Hierarchical Continual Reinforcement Learning Via Large Language Model Pan Chaofan, Yang Xin, Wang Hao, Wei Wei, Li Tianrui
- Dynathink: Fast Or Slow? A Dynamic Decision-making Framework For Large Language Models Pan Jiabao, Zhang Yan, Zhang Chen, Liu Zuozhu, Wang Hongwei, Li Haizhou
- Not All Contexts Are Equal: Teaching Llms Credibility-aware Generation Pan Ruotong, Cao Boxi, Lin Hongyu, Han Xianpei, Zheng Jia, Wang Sirui, Cai Xunliang, Sun Le
- The Solution For The AIGC Inference Performance Optimization Competition Pan Sishun, Xu Haonan, Wan Zhonghua, Yang Yang
- Preparing Lessons For Progressive Training On Language Models Pan Yu, Yuan Ye, Yin Yichun, Shi Jiaxin, Xu Zenglin, Zhang Ming, Shang Lifeng, Jiang Xin, Liu Qun
- Enhancing Repository-level Code Generation With Integrated Contextual Information Pan Zhiyuan, Hu Xing, Xia Xin, Yang Xiaohu
- Empowering Language Models With Active Inquiry For Deeper Understanding Pang Jing-cheng, Fan Heng-bo, Wang Pengyuan, Xiao Jia-hao, Tang Nan, Yang Si-hang, Jia Chengxing, Huang Sheng-jun, Yu Yang
- Self-alignment Of Large Language Models Via Monopolylogue-based Social Scene Simulation Pang Xianghe, Tang Shuo, Ye Rui, Xiong Yuxin, Zhang Bolun, Wang Yanfeng, Chen Siheng
- Picturing Ambiguity: A Visual Twist On The Winograd Schema Challenge Park Brendan, Janecek Madeline, Ezzati-jivan Naser, Li Yifeng, Emami Ali
- Understanding LLM Development Through Longitudinal Study: Insights From The Open Ko-llm Leaderboard Park Chanjun, Kim Hyeonwoo
- Open Ko-llm Leaderboard: Evaluating Large Language Models In Korean With Ko-h5 Benchmark Park Chanjun, Kim Hyeonwoo, Kim Dahyun, Cho Seonghwan, Kim Sanghoon, Lee Sukyung, Kim Yungi, Lee Hwalsuk
- Llms For Enhanced Agricultural Meteorological Recommendations Park Ji-jun, Choi Soo-joon
- M4CXR: Exploring Multi-task Potentials Of Multi-modal Large Language Models For Chest X-ray Interpretation Park Jonggwon, Kim Soobum, Yoon Byungmu, Hyun Jihun, Choi Kyoyun
- Too Many Frames, Not All Useful:efficient Strategies For Long-form Video QA Park Jongwoo, Ranasinghe Kanchana, Kahatapitiya Kumara, Ryoo Wonjeong, Kim Donghyun, Ryoo Michael S.
- Building Trust In Mental Health Chatbots: Safety Metrics And Llm-based Evaluation Tools Park Jung In, Abbasian Mahyar, Azimi Iman, Bounds Dawn, Jun Angela, Han Jaesu, Mccarron Robert, Borelli Jessica, Li Jia, Mahmoudi Mona, Wiedenhoeft Carmen, Rahmani Amir
- Pre-trained Vision And Language Transformers Are Few-shot Incremental Learners Park Keon-hee, Song Kyungwoo, Park Gyeong-moon
- Any-precision LLM: Low-cost Deployment Of Multiple, Different-sized Llms Park Yeonhong, Hyun Jake, Cho Sanglyul, Sim Bonggeun, Lee Jae W.
- The Ultimate Guide To Fine-tuning Llms From Basics To Breakthroughs: An Exhaustive Review Of Technologies, Research, Best Practices, Applied Research Challenges And Opportunities Parthasarathy Venkatesh Balavadhani, Zafar Ahtsham, Khan Aafaq, Shahid Arsalan
- Evidence To Generate (E2G): A Single-agent Two-step Prompting For Context Grounded And Retrieval Augmented Reasoning Parvez Md Rizwan
- Large Language Models Can Self-improve At Web Agent Tasks Patel Ajay, Hofmarcher Markus, Leoveanu-condrei Claudiu, Dinu Marius-constantin, Callison-burch Chris, Hochreiter Sepp
- Automating Code Adaptation For Mlops -- A Benchmarking Study On Llms Patel Harsh, Ramanan Buvaneswari A., Khan Manzoor A., Williams Thomas, Friedman Brian, Drabeck Lawrence
- Goex: Perspectives And Designs Towards A Runtime For Autonomous LLM Applications Patil Shishir G., Zhang Tianjun, Fang Vivian, C. Noppapon, Huang Roy, Hao Aaron, Casado Martin, Gonzalez Joseph E., Popa Raluca Ada, Stoica Ion
- Aptly: Making Mobile Apps From Natural Language Patton Evan W., Kim David Y. J., Granquist Ashley, Liu Robin, Scott Arianna, Zamanova Jennet, Abelson Harold
- Making Reasoning Matter: Measuring And Improving Faithfulness Of Chain-of-thought Reasoning Paul Debjit, West Robert, Bosselut Antoine, Faltings Boi
- Advprompter: Fast Adaptive Adversarial Prompting For Llms Paulus Anselm, Zharmagambetov Arman, Guo Chuan, Amos Brandon, Tian Yuandong
- Generative AI As A Metacognitive Agent: A Comparative Mixed-method Study With Human Participants On Icf-mimicking Exam Performance Pavlovic Jelena University Of Belgrade, Faculty Of Philosophy And Koucing Centar Resarch Lab, Krstic Jugoslav Koucing Centar Research Lab, Mitrovic Luka Koucing Centar Research Lab, Babic Djordje Koucing Centar Research Lab, Milosavljevic Adrijana Koucing Centar Research Lab, Nikolic Milena Koucing Centar Research Lab, Karaklic Tijana Koucing Centar Research Lab, Mitrovic Tijana Koucing Centar Research Lab
- The Effectiveness Of Llms As Annotators: A Comparative Overview And Empirical Analysis Of Direct Representation Pavlovic Maja, Poesio Massimo
- Autonomous Workflow For Multimodal Fine-grained Training Assistants Towards Mixed Reality Pei Jiahuan, Viola Irene, Huang Haochen, Wang Junxiao, Ahsan Moonisa, Ye Fanghua, Yiming Jiang, Sai Yao, Wang Di, Chen Zhumin, Ren Pengjie, Cesar Pablo
- Humaneval-xl: A Multilingual Code Generation Benchmark For Cross-lingual Natural Language Generalization Peng Qiwei, Chai Yekun, Li Xuhong
- AI Revolution On Chat Bot: Evidence From A Randomized Controlled Experiment Peng Sida, Swiatek Wojciech, Gao Allen, Cullivan Paul, Chang Haoge
- Uncertainty-aware Explainable Recommendation With Large Language Models Peng Yicui, Chen Hao, Lin Chingsheng, Huang Guo, Hu Jinrong, Guo Hui, Kong Bin, Hu Shu, Wu Xi, Wang Xin
- Check-eval: A Checklist-based Approach For Evaluating Text Quality Pereira Jayr, Assumpcao Andre, Lotufo Roberto
- Optimizing Large Language Models For Openapi Code Completion Petryshyn Bohdan, Lukoševičius Mantas
- H2o-danube3 Technical Report Pfeiffer Pascal, Singer Philipp, Babakhin Yauhen, Fodor Gabor, Dhankhar Nischay, Ambati Sri Satish
- Towards Reliable Medical Question Answering: Techniques And Challenges In Mitigating Hallucinations In Language Models Pham Duy Khoa, Vo Bao Quoc
- Coherent Zero-shot Visual Instruction Generation Phung Quynh, Ge Songwei, Huang Jia-bin
- Beyond Metrics: A Critical Analysis Of The Variability In Large Language Model Evaluation Frameworks Pimentel Marco Af, Christophe Clément, Raha Tathagata, Munjal Prateek, Kanithi Praveen K, Khan Shadab
- Doccgen: Document-based Controlled Code Generation Pimparkhede Sameer, Kammakomati Mehant, Tamilselvam Srikanth, Kumar Prince, Kumar Ashok Pon, Bhattacharyya Pushpak
- Revisiting Verilogeval: Newer Llms, In-context Learning, And Specification-to-rtl Tasks Pinckney Nathaniel, Batten Christopher, Liu Mingjie, Ren Haoxing, Khailany Brucek
- Reasoning With Large Language Models, A Survey Plaat Aske, Wong Annie, Verberne Suzan, Broekens Joost, Van Stein Niki, Back Thomas
- LM-PUB-QUIZ: A Comprehensive Framework For Zero-shot Evaluation Of Relational Knowledge In Language Models Ploner Max, Wiland Jacek, Pohl Sebastian, Akbik Alan
- Dissecting Language Models: Machine Unlearning Via Selective Pruning Pochinkov Nicholas, Schoots Nandi
- V-RECS, A Low-cost LLM4VIS Recommender With Explanations, Captioning And Suggestions Podo Luca, Angelini Marco, Velardi Paola
- Vi(e)va LLM! A Conceptual Stack For Evaluating And Interpreting Generative Ai-based Visualizations Podo Luca, Ishmal Muhammad, Angelini Marco
- Tinybenchmarks: Evaluating Llms With Fewer Examples Polo Felipe Maia, Weber Lucas, Choshen Leshem, Sun Yuekai, Xu Gongjun, Yurochkin Mikhail
- Machine Translation With Large Language Models: Prompt Engineering For Persian, English, And Russian Directions Pourkamali Nooshin, Sharifi Shler Ebrahim
- Large Language Models Meet User Interfaces: The Case Of Provisioning Feedback Pozdniakov Stanislav, Brazil Jonathan, Abdi Solmaz, Bakharia Aneesha, Sadiq Shazia, Gasevic Dragan, Denny Paul, Khosravi Hassan
- Convkgyarn: Spinning Configurable And Scalable Conversational Knowledge Graph QA Datasets With Large Language Models Pradeep Ronak, Lee Daniel, Mousavi Ali, Pound Jeff, Sang Yisi, Lin Jimmy, Ilyas Ihab, Potdar Saloni, Arefiyan Mostafa, Li Yunyao
- Interactions With Prompt Problems: A New Way To Teach Programming With Large Language Models Prather James, Denny Paul, Leinonen Juho, Smith David H. Iv, Reeves Brent N., Macneil Stephen, Becker Brett A., Luxton-reilly Andrew, Amarouche Thezyrie, Kimmel Bailey
- Fractured-sorry-bench: Framework For Revealing Attacks In Conversational Turns Undermining Refusal Efficacy And Defenses Over Sorry-bench Priyanshu Aman, Vijay Supriti
- Helpful Or Harmful? Exploring The Efficacy Of Large Language Models For Online Grooming Prevention Prosser Ellie, Edwards Matthew
- Customized Retrieval Augmented Generation And Benchmarking For EDA Tool Documentation QA Pu Yuan, He Zhuolun, Qiu Tairu, Wu Haoyuan, Yu Bei
- A Moral Imperative: The Need For Continual Superalignment Of Large Language Models Puthumanaillam Gokul, Vora Manav, Thangeda Pranay, Ornik Melkior
- Interactive Continual Learning: Fast And Slow Thinking Qi Biqing, Chen Xingquan, Gao Junqi, Li Dong, Liu Jianxing, Wu Ligang, Zhou Bowen
- SNIFFER: Multimodal Large Language Model For Explainable Out-of-context Misinformation Detection Qi Peng, Yan Zehong, Hsu Wynne, Lee Mong Li
- Data Augmentation Integrating Dialogue Flow And Style To Adapt Spoken Dialogue Systems To Low-resource User Groups Qi Zhiyang, Inaba Michimasa
- Iterative Experience Refinement Of Software-developing Agents Qian Chen, Li Jiahao, Dang Yufan, Liu Wei, Wang Yifei, Xie Zihao, Chen Weize, Yang Cheng, Zhang Yingli, Liu Zhiyuan, Sun Maosong
- Memorag: Moving Towards Next-gen RAG Via Memory-inspired Knowledge Discovery Qian Hongjin, Zhang Peitian, Liu Zheng, Mao Kelong, Dou Zhicheng
- LLM4SBR: A Lightweight And Effective Framework For Integrating Large Language Models In Session-based Recommendation Qiao Shutong, Gao Chen, Wen Junhao, Zhou Wei, Luo Qun, Chen Peixuan, Li Yong
- Prism: A Framework For Decoupling And Assessing The Capabilities Of Vlms Qiao Yuxuan, Duan Haodong, Fang Xinyu, Yang Junming, Chen Lin, Zhang Songyang, Wang Jiaqi, Lin Dahua, Chen Kai
- Diffusiongpt: Llm-driven Text-to-image Generation System Qin Jie, Wu Jie, Chen Weifeng, Ren Yuxi, Li Huixia, Wu Hefeng, Xiao Xuefeng, Wang Rui, Wen Shilei
- Enhancing Sequential Recommendations Through Multi-perspective Reflections And Iteration Qin Weicong, Xu Yi, Yu Weijie, Shen Chenglei, Zhang Xiao, He Ming, Fan Jianping, Xu Jun
- Infobench: Evaluating Instruction Following Ability In Large Language Models Qin Yiwei, Song Kaiqiang, Hu Yebowen, Yao Wenlin, Cho Sangwoo, Wang Xiaoyang, Wu Xuansheng, Liu Fei, Liu Pengfei, Yu Dong
- The Synergy Between Data And Multi-modal Large Language Models: A Survey From Co-development Perspective Qin Zhen, Chen Daoyuan, Zhang Wenhao, Yao Liuyi, Huang Yilun, Ding Bolin, Li Yaliang, Deng Shuiguang
- LAMPO: Large Language Models As Preference Machines For Few-shot Ordinal Classification Qin Zhen, Wu Junru, Shen Jiaming, Liu Tianqi, Wang Xuanhui
- Tool Learning With Large Language Models: A Survey Qu Changle, Dai Sunhao, Wei Xiaochi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Xu Jun, Wen Ji-rong
- Unsupervised Distractor Generation Via Large Language Model Distilling And Counterfactual Contrastive Decoding Qu Fanyi, Sun Hao, Wu Yunfang
- Before Generation, Align It! A Novel And Effective Strategy For Mitigating Hallucinations In Text-to-sql Generation Qu Ge, Li Jinyang, Li Bowen, Qin Bowen, Huo Nan, Ma Chenhao, Cheng Reynold
- Tokenrec: Learning To Tokenize ID For Llm-based Generative Recommendation Qu Haohao, Fan Wenqi, Zhao Zihuai, Li Qing
- DEEP-ICL: Definition-enriched Experts For Language Model In-context Learning Qu Xingwei, Liang Yiming, Wang Yucheng, Zheng Tianyu, Yue Tommy, Ma Lei, Huang Stephen W., Zhang Jiajun, Shi Yinan, Lin Chenghua, Fu Jie, Zhang Ge
- Coursegpt-zh: An Educational Large Language Model Based On Knowledge Distillation Incorporating Prompt Optimization Qu Zheyan, Yin Lu, Yu Zitong, Wang Wenbo, Zhang Xing
- Enhancing Ethical Explanations Of Large Language Models Through Iterative Symbolic Refinement Quan Xin, Valentino Marco, Dennis Louise A., Freitas André
- Verification And Refinement Of Natural Language Explanations Through Llm-symbolic Theorem Proving Quan Xin, Valentino Marco, Dennis Louise A., Freitas André
- An Empirical Study On Self-correcting Large Language Models For Data Science Code Generation Quoc Thai Tang, Minh Duc Ha, Thanh Tho Quan, Nguyen-duc Anh
- Llms Among Us: Generative AI Participating In Digital Discourse Radivojevic Kristina, Clark Nicholas, Brenner Paul
- V-zen: Efficient GUI Understanding And Precise Grounding With A Novel Multimodal LLM Rahman Abdur, Chawla Rajat, Kumar Muskaan, Datta Arkajit, Jha Adarsh, Ns Mukunda, Bhola Ishaan
- Question-based Retrieval Using Atomic Units For Enterprise RAG Raina Vatsal, Gales Mark
- Knowledge-based Consistency Testing Of Large Language Models Rajan Sai Sathiesh, Soremekun Ezekiel, Chattopadhyay Sudipta
- Constructing Domain-specific Evaluation Sets For Llm-as-a-judge Raju Ravi, Jain Swayambhoo, Li Bo, Li Jonathan, Thakker Urmish
- Scavenging Hyena: Distilling Transformers Into Long Convolution Models Ralambomihanta Tokiniaina Raharison, Mohammadzadeh Shahrad, Islam Mohammad Sami Nur, Jabbour Wassim, Liang Laurence
- Animate, Or Inanimate, That Is The Question For Large Language Models Ranaldi Leonardo, Pucci Giulia, Zanzotto Fabio Massimo
- Learning To Localize Objects Improves Spatial Reasoning In Visual-llms Ranasinghe Kanchana, Shukla Satya Narayan, Poursaeed Omid, Ryoo Michael S., Lin Tsung-yu
- A Fine-tuning Enhanced RAG System With Quantized Influence Measure As AI Judge Rangan Keshav, Yin Yiqiao
- RAVEN: Multitask Retrieval Augmented Vision-language Learning Rao Varun Nagaraj, Choudhary Siddharth, Deshpande Aditya, Satzoda Ravi Kumar, Appalaraju Srikar
- FACTOID: Factual Entailment For Hallucination Detection Rawte Vipula, Tonmoy S. M Towhidul Islam, Rajbangshi Krishnav, Nag Shravani, Chadha Aman, Sheth Amit P., Das Amitava
- FAIR Enough: How Can We Develop And Assess A Fair-compliant Dataset For Large Language Models' Training? Raza Shaina, Ghuge Shardul, Ding Chen, Dolatabadi Elham, Pandya Deval
- MBIAS: Mitigating Bias In Large Language Models While Retaining Context Raza Shaina, Raval Ananya, Chatrath Veronica
- Optimizing Visual Question Answering Models For Driving: Bridging The Gap Between Human And Machine Attention Patterns Rekanar Kaavya, Hayes Martin, Sistu Ganesh, Eising Ciaran
- Codeattack: Revealing Safety Generalization Challenges Of Large Language Models Via Code Completion Ren Qibing, Gao Chang, Shao Jing, Yan Junchi, Tan Xin, Lam Wai, Ma Lizhuang
- Analyzing And Reducing Catastrophic Forgetting In Parameter Efficient Tuning Ren Weijieying, Li Xinlong, Wang Lei, Zhao Tianxiang, Qin Wei
- Do Large Language Models Mirror Cognitive Language Processing? Ren Yuqi, Jin Renren, Zhang Tongxuan, Xiong Deyi
- Healthcare Copilot: Eliciting The Power Of General Llms For Medical Consultation Ren Zhiyao, Zhan Yibing, Yu Baosheng, Ding Liang, Tao Dacheng
- Can Language Models Recognize Convincing Arguments? Rescala Paula, Ribeiro Manoel Horta, Hu Tiancheng, West Robert
- Practical And Reproducible Symbolic Music Generation By Large Language Models With Structural Embeddings Rhyu Seungyeon, Yang Kichang, Cho Sungjun, Kim Jaehyeon, Lee Kyogu, Lee Moontae
- Code Generation With Alphacodium: From Prompt Engineering To Flow Engineering Ridnik Tal, Kredo Dedy, Friedman Itamar
- Pretrained Hybrids With MAD Skills Roberts Nicholas, Guo Samuel, Gao Zhiqi, Gnvv Satya Sai Srinath Namburi, Cromp Sonia, Wu Chengjun, Duan Chengyu, Sala Frederic
- \(\texttt{accord}\): Closing The Commonsense Measurability Gap Roewer-després François, Feng Jinyue, Zhu Zining, Rudzicz Frank
- Exploring Advanced Large Language Models With Llmsuite Roffo Giorgio
- Question-instructed Visual Descriptions For Zero-shot Video Question Answering Romero David, Solorio Thamar
- Do GPT Language Models Suffer From Split Personality Disorder? The Advent Of Substrate-free Psychometrics Romero Peter, Fitz Stephen, Nakatsuma Teruo
- An Early Categorization Of Prompt Injection Attacks On Large Language Models Rossi Sippo, Michel Alisia Marianne, Mukkamala Raghava Rao, Thatcher Jason Bennett
- Achieving Peak Performance For Large Language Models: A Systematic Review Rostam Zhyar Rzgar K, Szénási Sándor, Kertész Gábor
- CONFLARE: Conformal Large Language Model Retrieval Rouzrokh Pouria, Faghani Shahriar, Gamble Cooper U., Shariatnia Moein, Erickson Bradley J.
- Enhancing Low-resource NMT With A Multilingual Encoder And Knowledge Distillation: A Case Study Roy Aniruddha, Ray Pretam, Maheshwari Ayush, Sarkar Sudeshna, Goyal Pawan
- ERATTA: Extreme RAG For Table To Answers With Large Language Models Roychowdhury Sohini, Krema Marko, Mahammad Anvar, Moore Brian, Mukherjee Arijit, Prakashchandra Punit
- Fist-financial Style Transfer With Hallucination And Creativity Control Framework Roychowdhury Sohini, Krema Marko, Moore Brian, Lai Xingjian, Effedua Dike, Jethwani Bharat
- Evaluation Of RAG Metrics For Question Answering In The Telecom Domain Roychowdhury Sujoy, Soman Sumit, Ranjani H G, Gunda Neeraj, Chhabra Vansh, Bala Sai Krishna
- Explorations Of Self-repair In Language Models Rushing Cody, Nanda Neel
- Writing In The Margins: Better Inference Pattern For Long Context Retrieval Russak Melisa, Jamil Umar, Bryant Christopher, Kamble Kiran, Magnuson Axel, Russak Mateusz, Alshikh Waseem
- Code-aware Prompting: A Study Of Coverage Guided Test Generation In Regression Setting Using LLM Ryan Gabriel, Jain Siddhartha, Shang Mingyue, Wang Shiqi, Ma Xiaofei, Ramanathan Murali Krishna, Ray Baishakhi
- Safetyprompts: A Systematic Review Of Open Datasets For Evaluating And Improving Large Language Model Safety Röttger Paul, Pernisi Fabio, Vidgen Bertie, Hovy Dirk
- Tandem Transformers For Inference Efficient Llms S Aishwarya P, Nair Pranav Ajit, Samaga Yashas, Boyd Toby, Kumar Sanjiv, Jain Prateek, Netrapalli Praneeth
- Athena: Safe Autonomous Agents With Verbal Contrastive Learning Sadhu Tanmana, Pesaranghader Ali, Chen Yanan, Yi Dong Hoon
- Vision-language And Large Language Model Performance In Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, And Quantized Models Safavi-naini Seyed Amir Ahmad, Ali Shuhaib, Shahab Omer, Shahhoseini Zahra, Savage Thomas, Rafiee Sara, Samaan Jamil S, Shabeeb Reem Al, Ladak Farah, Yang Jamie O, Echavarria Juan, Babar Sumbal, Shaukat Aasma, Margolis Samuel, Tatonetti Nicholas P, Nadkarni Girish, Kurdi Bara El, Soroush Ali
- A Systematic Survey Of Prompt Engineering In Large Language Models: Techniques And Applications Sahoo Pranab, Singh Ayush Kumar, Saha Sriparna, Jain Vinija, Mondal Samrat, Chadha Aman
- Mixsumm: Topic-based Data Augmentation Using Llms For Low-resource Extractive Text Summarization Sahu Gaurav, Laradji Issam H.
- Pelican: Correcting Hallucination In Vision-llms Via Claim Decomposition And Program Of Thought Verification Sahu Pritish, Sikka Karan, Divakaran Ajay
- LUNA: A Framework For Language Understanding And Naturalness Assessment Saidov Marat, Bakalova Aleksandra, Taktasheva Ekaterina, Mikhailov Vladislav, Artemova Ekaterina
- Risks, Causes, And Mitigations Of Widespread Deployments Of Large Language Models (llms): A Survey Sakib Md Nazmus, Islam Md Athikul, Pathak Royal, Arifin Md Mashrur
- Algorithmic Language Models With Neurally Compiled Libraries Saldyt Lucas, Kambhampati Subbarao
- Large Language Models Show Human-like Social Desirability Biases In Survey Responses Salecha Aadesh, Ireland Molly E., Subrahmanya Shashanka, Sedoc João, Ungar Lyle H., Eichstaedt Johannes C.
- Zero-shot RTL Code Generation With Attention Sink Augmented Large Language Models Sandal Selim, Akturk Ismail
- Retrieval-augmented Generation Meets Data-driven Tabula Rasa Approach For Temporal Knowledge Graph Forecasting Sannidhi Geethan, Sakhinana Sagar Srinivas, Runkana Venkataramana
- Grapheval: A Knowledge-graph Based LLM Hallucination Evaluation Framework Sansford Hannah, Richardson Nicholas, Maretic Hermina Petric, Saada Juba Nait
- ICAL: Continual Learning Of Multimodal Agents By Transforming Trajectories Into Actionable Insights Sarch Gabriel, Jang Lawrence, Tarr Michael J., Cohen William W., Marino Kenneth, Fragkiadaki Katerina
- HELPER-X: A Unified Instructable Embodied Agent To Tackle Four Interactive Vision-language Domains With Memory-augmented Language Models Sarch Gabriel, Somani Sahil, Kapoor Raghav, Tarr Michael J., Fragkiadaki Katerina
- Llms As On-demand Customizable Service Sarkar Souvika Santu, Babar Mohammad Fakhruddin Santu, Hasan Monowar Santu, Karmaker Shubhra Kanti Santu
- Textmachina: Seamless Generation Of Machine-generated Text Datasets Sarvazyan Areg Mikael, González José Ángel, Franco-salvador Marc
- Improving The Validity Of Automatically Generated Feedback Via Reinforcement Learning Scarlatos Alexander, Smith Digory, Woodhead Simon, Lan Andrew
- Impacts Of Anthropomorphizing Large Language Models In Learning Environments Schaaff Kristina, Heidelmann Marc-andré
- The Human Factor In Detecting Errors Of Large Language Models: A Systematic Literature Review And Future Research Directions Schiller Christian A.
- Symbolic Prompt Program Search: A Structure-aware Approach To Efficient Compile-time Prompt Optimization Schnabel Tobias, Neville Jennifer
- M5 -- A Diverse Benchmark To Assess The Performance Of Large Multimodal Models Across Multilingual And Multicultural Vision-language Tasks Schneider Florian, Sitaram Sunayana
- Exploring Human-llm Conversations: Mental Models And The Originator Of Toxicity Schneider Johannes, Flores Arianna Casanova, Kranz Anne-catherine
- Metacognitive Myopia In Large Language Models Scholten Florian, Rebholz Tobias R., Hütter Mandy
- THREAD: Thinking Deeper With Recursive Spawning Schroeder Philip, Morgan Nathaniel, Luo Hongyin, Glass James
- Evaluating Large Language Models With Fmeval Schwöbel Pola, Franceschi Luca, Zafar Muhammad Bilal, Vasist Keerthan, Malhotra Aman, Shenhar Tomer, Tailor Pinal, Yilmaz Pinar, Diamond Michael, Donini Michele
- Quantitative Knowledge Retrieval From Large Language Models Selby David, Spriestersbach Kai, Iwashita Yuichiro, Bappert Dennis, Warrier Archana, Mukherjee Sumantrak, Asim Muhammad Nabeel, Kise Koichi, Vollmer Sebastian
- Train-attention: Meta-learning Where To Focus In Continual Knowledge Learning Seo Yeongbin, Lee Dongha, Yeo Jinyoung
- Ai-augmented Brainwriting: Investigating The Use Of Llms In Group Ideation Shaer Orit, Cooper Angelora, Mokryn Osnat, Kun Andrew L., Shoshan Hagit Ben
- Ai-assisted Generation Of Difficult Math Questions Shah Vedant, Yu Dingli, Lyu Kaifeng, Park Simon, Ke Nan Rosemary, Mozer Michael, Bengio Yoshua, Arora Sanjeev, Goyal Anirudh
- Putting Gpt-4o To The Sword: A Comprehensive Evaluation Of Language, Vision, Speech, And Multimodal Proficiency Shahriar Sakib, Lund Brady, Mannuru Nishith Reddy, Arshad Muhammad Arbab, Hayawi Kadhim, Bevara Ravi Varma Kumar, Mannuru Aashrith, Batool Laiba
- Synergy-of-thoughts: Eliciting Efficient Reasoning In Hybrid Language Models Shang Yu, Li Yu, Xu Fengli, Li Yong
- Patched MOA: Optimizing Inference For Diverse Software Development Tasks Sharma Asankhaya
- A Preliminary Study On Using Large Language Models In Software Pentesting Shashwat Kumar, Hahn Francis, Ou Xinming, Goldgof Dmitry, Hall Lawrence, Ligatti Jay, Rajgopalan S. Raj, Tabari Armin Ziaie
- MAPO: Advancing Multilingual Reasoning Through Multilingual Alignment-as-preference Optimization She Shuaijie, Zou Wei, Huang Shujian, Zhu Wenhao, Liu Xiang, Geng Xiang, Chen Jiajun
- Nemo-aligner: Scalable Toolkit For Efficient Model Alignment Shen Gerald, Wang Zhilin, Delalleau Olivier, Zeng Jiaqi, Dong Yi, Egert Daniel, Sun Shengyang, Zhang Jimmy, Jain Sahil, Taghibakhshi Ali, Ausin Markel Sanz, Aithal Ashwath, Kuchaiev Oleksii
- Rapid Optimization For Jailbreaking Llms Via Subconscious Exploitation And Echopraxia Shen Guangyu, Cheng Siyuan, Zhang Kaiyuan, Tao Guanhong, An Shengwei, Yan Lu, Zhang Zhuo, Ma Shiqing, Zhang Xiangyu
- Shortcutsbench: A Large-scale Real-world Benchmark For Api-based Agents Shen Haiyang, Li Yue, Meng Desong, Cai Dongqi, Qi Sheng, Zhang Li, Xu Mengwei, Ma Yun
- Tag-llm: Repurposing General-purpose Llms For Specialized Domains Shen Junhong, Tenenholtz Neil, Hall James Brian, Alvarez-melis David, Fusi Nicolo
- Pyramid Coder: Hierarchical Code Generator For Compositional Visual Question Answering Shen Ruoyue, Inoue Nakamasa, Shinoda Koichi
- Exploring User Retrieval Integration Towards Large Language Models For Cross-domain Sequential Recommendation Shen Tingjia, Wang Hao, Zhang Jiaqing, Zhao Sirui, Li Liangyue, Chen Zulong, Lian Defu, Chen Enhong
- Small Llms Are Weak Tool Learners: A Multi-llm Agent Shen Weizhou, Li Chenliang, Chen Hongzhan, Yan Ming, Quan Xiaojun, Chen Hehong, Zhang Ji, Huang Fei
- Think-then-act: A Dual-angle Evaluated Retrieval-augmented Generation Shen Yige, Jiang Hao, Qu Hua, Zhao Jihong
- Toursynbio: A Multi-modal Large Model And Agent Framework To Bridge Text And Protein Sequences For Protein Engineering Shen Yiqing, Chen Zan, Mamalakis Michail, Liu Yungeng, Li Tianbin, Su Yanzhou, He Junjun, Liò Pietro, Wang Yu Guang
- The Fire Thief Is Also The Keeper: Balancing Usability And Privacy In Prompts Shen Zhili, Xi Zihang, He Ying, Tong Wei, Hua Jingyu, Zhong Sheng
- Beyond Human Preferences: Exploring Reinforcement Learning Trajectory Evaluation And Improvement Through Llms Shen Zichao, Zhu Tianchen, Sun Qingyun, Gao Shiqi, Li Jianxin
- From Language Models To Practical Self-improving Computer Agents Sheng Alex
- Language Models Encode Collaborative Signals In Recommendation Sheng Leheng, Zhang An, Zhang Yi, Chen Yuxin, Wang Xiang, Chua Tat-seng
- Repeval: Effective Text Evaluation With LLM Representation Sheng Shuqian, Xu Yi, Zhang Tianhang, Shen Zanwei, Fu Luoyi, Ding Jiaxin, Zhou Lei, Wang Xinbing, Zhou Chenghu
- IRCAN: Mitigating Knowledge Conflicts In LLM Generation Via Identifying And Reweighting Context-aware Neurons Shi Dan, Jin Renren, Shen Tianhao, Dong Weilong, Wu Xinwei, Xiong Deyi
- Opex: A Component-wise Analysis Of Llm-centric Agents In Embodied Instruction Following Shi Haochen, Sun Zhiyuan, Yuan Xingdi, Côté Marc-alexandre, Liu Bang
- Judging The Judges: A Systematic Investigation Of Position Bias In Pairwise Comparative Assessments By Llms Shi Lin, Ma Chiyu, Ma Weicheng, Vosoughi Soroush
- Reslora: Identity Residual Mapping In Low-rank Adaption Shi Shuhua, Huang Shaohan, Song Minghui, Li Zhoujun, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Wildfeedback: Aligning Llms With In-situ User Interactions And Feedback Shi Taiwei, Wang Zhuoer, Yang Longqi, Lin Ying-chun, He Zexue, Wan Mengting, Zhou Pei, Jauhar Sujay, Xu Xiaofeng, Song Xia, Neville Jennifer
- Large Language Models Are Learnable Planners For Long-term Recommendation Shi Wentao, He Xiangnan, Zhang Yang, Gao Chongming, Li Xinyue, Zhang Jizhi, Wang Qifan, Feng Fuli
- Agentre: An Agent-based Framework For Navigating Complex Information Landscapes In Relation Extraction Shi Yuchen, Jiang Guochao, Qiu Tian, Yang Deqing
- Retrieval-enhanced Knowledge Editing In Language Models For Multi-hop Question Answering Shi Yucheng, Tan Qiaoyu, Wu Xuansheng, Zhong Shaochen, Zhou Kaixiong, Liu Ninghao
- Eragent: Enhancing Retrieval-augmented Language Models With Improved Accuracy, Efficiency, And Personalization Shi Yunxiao, Zi Xing, Shi Zijing, Zhang Haimin, Wu Qiang, Xu Min
- Generate-then-ground In Retrieval-augmented Generation For Multi-hop Question Answering Shi Zhengliang, Zhang Shuo, Sun Weiwei, Gao Shen, Ren Pengjie, Chen Zhumin, Ren Zhaochun
- Abusegpt: Abuse Of Generative AI Chatbots To Create Smishing Campaigns Shibli Ashfak Md, Pritom Mir Mehedi A., Gupta Maanak
- Can Prompt Modifiers Control Bias? A Comparative Analysis Of Text-to-image Generative Models Shin Philip Wootaek, Ahn Jihyun Janice, Yin Wenpeng, Sampson Jack, Narayanan Vijaykrishnan
- Adapting Llms To Hebrew: Unveiling Dictalm 2.0 With Enhanced Vocabulary And Instruction Capabilities Shmidman Shaltiel, Shmidman Avi, Cohen Amir Dn, Koppel Moshe
- Knowledge Graph Large Language Model (KG-LLM) For Link Prediction Shu Dong, Chen Tianle, Jin Mingyu, Zhang Chong, Du Mengnan, Zhang Yongfeng
- An Active Inference Strategy For Prompting Reliable Responses From Large Language Models In Medical Practice Shusterman Roma, Waters Allison C., O`neill Shannon, Luu Phan, Tucker Don M.
- Design2code: How Far Are We From Automating Front-end Engineering? Si Chenglei, Zhang Yanzhe, Yang Zhengyuan, Liu Ruibo, Yang Diyi
- Iclguard: Controlling In-context Learning Behavior For Applicability Authorization Si Wai Man, Backes Michael, Zhang Yang
- Integrating Large Language Models Into Recommendation Via Mutual Augmentation And Adaptive Aggregation Sichun Luo, Yuxuan Yao, Bowei He, Yinya Huang, Aojun Zhou, Xinyi Zhang, Yuanzhang Xiao, Mingjie Zhan, Linqi Song
- Scaling Synthetic Logical Reasoning Datasets With Context-sensitive Declarative Grammars Sileo Damien
- Aya Dataset: An Open-access Collection For Multilingual Instruction Tuning Singh Shivalika, Vargus Freddie, Dsouza Daniel, Karlsson Börje F., Mahendiran Abinaya, Ko Wei-yin, Shandilya Herumb, Patel Jay, Mataciunas Deividas, Omahony Laura, Zhang Mike, Hettiarachchi Ramith, Wilson Joseph, Machado Marina, Moura Luisa Souza, Krzemiński Dominik, Fadaei Hakimeh, Ergün Irem, Okoh Ifeoma, Alaagib Aisha, Mudannayake Oshan, Alyafeai Zaid, Chien Vu Minh, Ruder Sebastian, Guthikonda Surya, Alghamdi Emad A., Gehrmann Sebastian, Muennighoff Niklas, Bartolo Max, Kreutzer Julia, Üstün Ahmet, Fadaee Marzieh, Hooker Sara
- Llm-dcache: Improving Tool-augmented Llms With Gpt-driven Localized Data Caching Singh Simranjit, Fore Michael, Karatzas Andreas, Lee Chaehong, Jian Yanan, Shangguan Longfei, Yu Fuxun, Anagnostopoulos Iraklis, Stamoulis Dimitrios
- Llava Finds Free Lunch: Teaching Human Behavior Improves Content Understanding Abilities Of Llms Singh Somesh, S Harini I, Singla Yaman K, Baths Veeky, Shah Rajiv Ratn, Chen Changyou, Krishnamurthy Balaji
- LGR2: Language Guided Reward Relabeling For Accelerating Hierarchical Reinforcement Learning Singh Utsav, Bhattacharyya Pramit, Namboodiri Vinay P.
- Are Small Language Models Ready To Compete With Large Language Models For Practical Applications? Sinha Neelabh, Jain Vinija, Chadha Aman
- Replan: Robotic Replanning With Perception And Language Models Skreta Marta, Zhou Zihan, Yuan Jia Lin, Darvish Kourosh, Aspuru-guzik Alán, Garg Animesh
- From Words To Worlds: Transforming One-line Prompt Into Immersive Multi-modal Digital Stories With Communicative LLM Agent Sohn Samuel S., Li Danrui, Zhang Sen, Chang Che-jui, Kapadia Mubbasir
- Gistembed: Guided In-sample Selection Of Training Negatives For Text Embedding Fine-tuning Solatorio Aivin V.
- A Statistical Framework For Weak-to-strong Generalization Somerstep Seamus, Polo Felipe Maia, Banerjee Moulinath, Ritov Ya'acov, Yurochkin Mikhail, Sun Yuekai
- A New Pipeline For Generating Instruction Dataset Via RAG And Self Fine-tuning Song Chih-wei, Lee Yu-kai, Tsai Yin-te
- The Typing Cure: Experiences With Large Language Model Chatbots For Mental Health Support Song Inhwa, Pendse Sachin R., Kumar Neha, De Choudhury Munmun
- Moma: Multimodal LLM Adapter For Fast Personalized Image Generation Song Kunpeng, Zhu Yizhe, Liu Bingchen, Yan Qing, Elgammal Ahmed, Yang Xiao
- Knowledge Editing On Black-box Large Language Models Song Xiaoshuai, Wang Zhengyang, He Keqing, Dong Guanting, Mou Yutao, Zhao Jinxu, Xu Weiran
- Position: Leverage Foundational Models For Black-box Optimization Song Xingyou, Tian Yingtao, Lange Robert Tjarko, Lee Chansoo, Tang Yujin, Chen Yutian
- Integrating AI In College Education: Positive Yet Mixed Experiences With Chatgpt Song Xinrui, Zhang Jiajin, Yan Pingkun, Hahn Juergen, Kruger Uwe, Mohamed Hisham, Wang Ge
- ELLA-V: Stable Neural Codec Language Modeling With Alignment-guided Sequence Reordering Song Yakun, Chen Zhuo, Wang Xiaofei, Ma Ziyang, Chen Xie
- Moviellm: Enhancing Long Video Understanding With Ai-generated Movies Song Zhende, Wang Chenchen, Sheng Jiamu, Zhang Chi, Yu Gang, Fan Jiayuan, Chen Tao
- Pedagogical Alignment Of Large Language Models Sonkar Shashank, Ni Kangqi, Chaudhary Sapana, Baraniuk Richard G.
- Know When To Stop: A Study Of Semantic Drift In Text Generation Spataru Ava, Hambro Eric, Voita Elena, Cancedda Nicola
- Understanding Memorisation In Llms: Dynamics, Influencing Factors, And Implications Speicher Till, Khan Mohammad Aflah, Wu Qinyuan, Nanda Vedant, Das Soumi, Ghosh Bishwamittra, Gummadi Krishna P., Terzi Evimaria
- Probing Multimodal Llms As World Models For Driving Sreeram Shiva, Wang Tsun-hsuan, Maalouf Alaa, Rosman Guy, Karaman Sertac, Rus Daniela
- Crossing New Frontiers: Knowledge-augmented Large Language Model Prompting For Zero-shot Text-based De Novo Molecule Design Srinivas Sakhinana Sagar, Runkana Venkataramana
- A Survey Of Using Large Language Models For Generating Infrastructure As Code Srivatsa Kalahasti Ganesh, Mukhopadhyay Sabyasachi, Katrapati Ganesh, Shrivastava Manish
- LUCID: Llm-generated Utterances For Complex And Interesting Dialogues Stacey Joe, Cheng Jianpeng, Torr John, Guigue Tristan, Driesen Joris, Coca Alexandru, Gaynor Mark, Johannsen Anders
- Towards Truly Zero-shot Compositional Visual Reasoning With Llms As Programmers Stanić Aleksandar, Caelles Sergi, Tschannen Michael
- Virtual Agents For Alcohol Use Counseling: Exploring Llm-powered Motivational Interviewing Steenstra Ian, Nouraei Farnaz, Arjmand Mehdi, Bickmore Timothy W.
- Satyrn: A Platform For Analytics Augmented Generation Sterbentz Marko, Barrie Cameron, Shahi Shubham, Dutta Abhratanu, Hooshmand Donna, Pack Harper, Hammond Kristian J.
- Large Language Models Are Inconsistent And Biased Evaluators Stureborg Rickard, Alikaniotis Dimitris, Suhara Yoshi
- Mission Impossible: A Statistical Perspective On Jailbreaking Llms Su Jingtong, Kempe Julia, Ullrich Karen
- Unsupervised Real-time Hallucination Detection Based On The Internal States Of Large Language Models Su Weihang, Wang Changyue, Ai Qingyao, Hu Yiran, Wu Zhijing, Zhou Yujia, Liu Yiqun
- Llm-barber: Block-aware Rebuilder For Sparsity Mask In One-shot For Large Language Models Su Yupeng, Guan Ziyi, Liu Xiaoqun, Jin Tianlai, Wu Dongkuan, Chesi Graziano, Wong Ngai, Yu Hao
- Timo: Towards Better Temporal Reasoning For Language Models Su Zhaochen, Zhang Jun, Zhu Tong, Qu Xiaoye, Li Juntao, Zhang Min, Cheng Yu
- LAB: Large-scale Alignment For Chatbots Sudalairaj Shivchander, Bhandwaldar Abhishek, Pareja Aldo, Xu Kai, Cox David D., Srivastava Akash
- Structured Chain-of-thought Prompting For Few-shot Generation Of Content-grounded QA Conversations Sultan Md Arafat, Ganhotra Jatin, Astudillo Ramón Fernandez
- Persona-db: Efficient Large Language Model Personalization For Response Prediction With Collaborative Data Refinement Sun Chenkai, Yang Ke, Reddy Revanth Gangi, Fung Yi R., Chan Hou Pong, Small Kevin, Zhai Chengxiang, Ji Heng
- Hierarchical In-context Reinforcement Learning With Hindsight Modular Reflections For Planning Sun Chuanneng, Huang Songjun, Pompili Dario
- Sq-llava: Self-questioning For Large Vision-language Assistant Sun Guohao, Qin Can, Wang Jiamian, Chen Zeyuan, Xu Ran, Tao Zhiqiang
- Delrec: Distilling Sequential Pattern To Enhance Llm-based Recommendation Sun Guohao, Zhang Haoyi
- Supervised Fine-tuning As Inverse Reinforcement Learning Sun Hao
- Retrieved In-context Principles From Previous Mistakes Sun Hao, Jiang Yong, Wang Bo, Hou Yingyan, Zhang Yan, Xie Pengjun, Huang Fei
- Bbox-adapter: Lightweight Adapting For Black-box Large Language Models Sun Haotian, Zhuang Yuchen, Wei Wei, Zhang Chao, Dai Bo
- Evograd: A Dynamic Take On The Winograd Schema Challenge With Human Adversaries Sun Jing Han, Emami Ali
- Lawluo: A Chinese Law Firm Co-run By LLM Agents Sun Jingyun, Dai Chengxiao, Luo Zhongze, Chang Yangbo, Li Yang
- The Critique Of Critique Sun Shichao, Li Junlong, Yuan Weizhe, Yuan Ruifeng, Li Wenjie, Liu Pengfei
- Towards Detecting Llms Hallucination Via Markov Chain-based Multi-agent Debate Framework Sun Xiaoxi, Li Jinpeng, Zhong Yan, Zhao Dongyan, Yan Rui
- Rapidly Developing High-quality Instruction Data And Evaluation Benchmark For Large Language Models With Minimal Human Effort: A Case Study On Japanese Sun Yikun, Wan Zhen, Ueda Nobuhiro, Yahata Sakiko, Cheng Fei, Chu Chenhui, Kurohashi Sadao
- DFA-RAG: Conversational Semantic Router For Large Language Model With Definite Finite Automaton Sun Yiyou, Hu Junjie, Cheng Wei, Chen Haifeng
- F-eval: Assessing Fundamental Abilities With Refined Evaluation Methods Sun Yu, Chen Keyu, Wang Shujie, Li Peiji, Guo Qipeng, Yan Hang, Qiu Xipeng, Huang Xuanjing, Lin Dahua
- LCS: A Language Converter Strategy For Zero-shot Neural Machine Translation Sun Zengkui, Liu Yijin, Meng Fandong, Xu Jinan, Chen Yufeng, Zhou Jie
- Enhancing Agent Learning Through World Dynamics Modeling Sun Zhiyuan, Shi Haochen, Côté Marc-alexandre, Berseth Glen, Yuan Xingdi, Liu Bang
- Automating Research Synthesis With Domain-specific Large Language Model Fine-tuning Susnjak Teo, Hwang Peter, Reyes Napoleon H., Barczak Andre L. C., Mcintosh Timothy R., Ranathunga Surangika
- Meta-prompting: Enhancing Language Models With Task-agnostic Scaffolding Suzgun Mirac, Kalai Adam Tauman
- From Explanations To Action: A Zero-shot, Theory-driven LLM Framework For Student Performance Feedback Swamy Vinitra, Romano Davide, Desikan Bhargav Srinivasa, Camburu Oana-maria, Käser Tanja
- Online Adaptation Of Language Models With A Memory Of Amortized Contexts Tack Jihoon, Kim Jaehyung, Mitchell Eric, Shin Jinwoo, Teh Yee Whye, Schwarz Jonathan Richard
- Mmlu-pro+: Evaluating Higher-order Reasoning And Shortcut Learning In Llms Taghanaki Saeid Asgari, Khani Aliasgahr, Khasahmadi Amir
- JORA: JAX Tensor-parallel Lora Library For Retrieval Augmented Fine-tuning Tahir Anique, Cheng Lu, Liu Huan
- Improving Large Language Model (LLM) Fidelity Through Context-aware Grounding: A Systematic Approach To Reliability And Veracity Talukdar Wrick, Biswas Anjanava
- Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning Tan Cheng, Wei Jingxuan, Sun Linzhuang, Gao Zhangyang, Li Siyuan, Yu Bihui, Guo Ruifeng, Li Stan Z.
- PROXYQA: An Alternative Framework For Evaluating Long-form Text Generation With Large Language Models Tan Haochen, Guo Zhijiang, Shi Zhan, Xu Lu, Liu Zhili, Feng Yunlong, Li Xiaoguang, Wang Yasheng, Shang Lifeng, Liu Qun, Song Linqi
- Blinded By Generated Contexts: How Language Models Merge Generated And Retrieved Contexts When Knowledge Conflicts? Tan Hexiang, Sun Fei, Yang Wanli, Wang Yuanzhuo, Cao Qi, Cheng Xueqi
- Taskgen: A Task-based, Memory-infused Agentic Framework Using Strictjson Tan John Chong Min, Saroj Prince, Runwal Bharat, Maheshwari Hardik, Sheng Brian Lim Yi, Cottrill Richard, Chona Alankrit, Kumar Ambuj, Motani Mehul
- Idgenrec: Llm-recsys Alignment With Textual ID Learning Tan Juntao, Xu Shuyuan, Hua Wenyue, Ge Yingqiang, Li Zelong, Zhang Yongfeng
- Thought-like-pro: Enhancing Reasoning Of Large Language Models Through Self-driven Prolog-based Chain-of-thought Tan Xiaoyu, Deng Yongxin, Qiu Xihe, Xu Weidi, Qu Chao, Chu Wei, Xu Yinghui, Qi Yuan
- Struct-x: Enhancing Large Language Models Reasoning With Structured Data Tan Xiaoyu, Wang Haoyu, Qiu Xihe, Cheng Yuan, Xu Yinghui, Chu Wei, Qi Yuan
- DLO: Dynamic Layer Operation For Efficient Vertical Scaling Of Llms Tan Zhen, Dong Daize, Zhao Xinyu, Peng Jie, Cheng Yu, Chen Tianlong
- Tuning-free Accountable Intervention For LLM Deployment -- A Metacognitive Approach Tan Zhen, Peng Jie, Chen Tianlong, Liu Huan
- Can I Understand What I Create? Self-knowledge Evaluation Of Large Language Models Tan Zhiquan, Wei Lai, Wang Jindong, Xie Xing, Huang Weiran
- Leveraging Language Models For Emotion And Behavior Analysis In Education Tanaka Kaito, Tan Benjamin, Wong Brian
- Jill Watson: A Virtual Teaching Assistant Powered By Chatgpt Taneja Karan, Maiti Pratyusha, Kakar Sandeep, Guruprasad Pranav, Rao Sanjeev, Goel Ashok K.
- Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions Tang Chenming, Wang Zhixiang, Wu Yunfang
- Vilco-bench: Video Language Continual Learning Benchmark Tang Tianqi, Deldari Shohreh, Xue Hao, De Melo Celso, Salim Flora D.
- Llmbox: A Comprehensive Library For Large Language Models Tang Tianyi, Hu Yiwen, Li Bingqian, Luo Wenyang, Qin Zijing, Sun Haoxiang, Wang Jiapeng, Xu Shiyi, Cheng Xiaoxue, Guo Geyang, Peng Han, Zheng Bowen, Tang Yiru, Min Yingqian, Chen Yushuo, Chen Jie, Zhao Yuanqian, Ding Luran, Wang Yuhao, Dong Zican, Xia Chunxuan, Li Junyi, Zhou Kun, Zhao Wayne Xin, Wen Ji-rong
- A + B: A General Generator-reader Framework For Optimizing Llms To Unleash Synergy Potential Tang Wei, Cao Yixin, Ying Jiahao, Wang Bo, Zhao Yuyue, Liao Yong, Zhou Pengyuan
- Prioritizing Safeguarding Over Autonomy: Risks Of LLM Agents For Science Tang Xiangru, Jin Qiao, Zhu Kunlun, Yuan Tongxin, Zhang Yichi, Zhou Wangchunshu, Qu Meng, Zhao Yilun, Tang Jian, Zhang Zhuosheng, Cohan Arman, Lu Zhiyong, Gerstein Mark
- Unleashing The Potential Of Large Language Models As Prompt Optimizers: An Analogical Analysis With Gradient-based Model Optimizers Tang Xinyu, Wang Xiaolei, Zhao Wayne Xin, Lu Siyuan, Li Yaliang, Wen Ji-rong
- Chatgpt Incorrectness Detection In Software Reviews Tanzil Minaoar Hossain, Khan Junaed Younus, Uddin Gias
- Enhancing Commentary Strategies For Imperfect Information Card Games: A Study Of Large Language Models In Guandan Commentary Tao Meiling, Liang Xuechen, Wang Ziyi, Tao Yiling, Shi Tianyu
- CUDRT: Benchmarking The Detection Of Human Vs. Large Language Models Generated Texts Tao Zhen, Li Zhiyu, Xi Dinghao, Xu Wei
- A Survey On Self-evolution Of Large Language Models Tao Zhengwei, Lin Ting-en, Chen Xiancai, Li Hangyu, Wu Yuchuan, Li Yongbin, Jin Zhi, Huang Fei, Tao Dacheng, Zhou Jingren
- Targeted Visual Prompting For Medical Visual Question Answering Tascon-morales Sergio, Márquez-neila Pablo, Sznitman Raphael
- Exploring Design Choices For Building Language-specific Llms Tejaswi Atula, Gupta Nilesh, Choi Eunsol
- Interactive Prompt Debugging With Sequence Salience Tenney Ian, Mullins Ryan, Du Bin, Pandya Shree, Kahng Minsuk, Dixon Lucas
- Judging The Judges: Evaluating Alignment And Vulnerabilities In Llms-as-judges Thakur Aman Singh, Choudhary Kartik, Ramayapally Venkat Srinik, Vaidyanathan Sankaran, Hupkes Dieuwke
- Equipping Language Models With Tool Use Capability For Tabular Data Analysis In Finance Theuma Adrian, Shareghi Ehsan
- Elsevier Arena: Human Evaluation Of Chemistry/biology/health Foundational Large Language Models Thorne Camilo, Druckenbrodt Christian, Szarkowska Kinga, Goyal Deepika, Marajan Pranita, Somanath Vijay, Harper Corey, Yan Mao, Scerri Tony
- I Am A Strange Dataset: Metalinguistic Tests For Language Models Thrush Tristan, Moore Jared, Monares Miguel, Potts Christopher, Kiela Douwe
- Battling Botpoop Using Genai For Higher Education: A Study Of A Retrieval Augmented Generation Chatbots Impact On Learning Thway Maung, Recatala-gomez Jose, Lim Fun Siong, Hippalgaonkar Kedar, Ng Leonard W. T.
- Hydralora: An Asymmetric Lora Architecture For Efficient Fine-tuning Tian Chunlin, Shi Zhan, Guo Zhijiang, Li Li, Xu Chengzhong
- Tokenize The World Into Object-level Knowledge To Address Long-tail Events In Autonomous Driving Tian Ran, Li Boyi, Weng Xinshuo, Chen Yuxiao, Schmerling Edward, Wang Yue, Ivanovic Boris, Pavone Marco
- Are Large Language Models Capable Of Generating Human-level Narratives? Tian Yufei, Huang Tenghao, Liu Miri, Jiang Derek, Spangher Alexander, Chen Muhao, May Jonathan, Peng Nanyun
- PLUGH: A Benchmark For Spatial Understanding And Reasoning In Large Language Models Tikhonov Alexey
- Cantor: Inspiring Multimodal Chain-of-thought Of MLLM Timin Gao, Peixian Chen, Mengdan Zhang, Chaoyou Fu, Yunhang Shen, Yan Zhang, Shengchuan Zhang, Xiawu Zheng, Xing Sun, Liujuan Cao, Rongrong Ji
- Zyda: A 1.3T Dataset For Open Language Modeling Tokpanov Yury, Millidge Beren, Glorioso Paolo, Pilault Jonathan, Ibrahim Adam, Whittington James, Anthony Quentin
- An Ecosage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant Tomar Mohit, Tiwari Abhisek, Saha Tulika, Jha Prince, Saha Sriparna
- Generative AI Systems: A Systems-based Perspective On Generative AI Tomczak Jakub M.
- Evaluating Large Language Models With Grid-based Game Competitions: An Extensible LLM Benchmark And Leaderboard Topsakal Oguzhan, Edell Colby Jacob, Harper Jackson Bailey
- Readctrl: Personalizing Text Generation With Readability-controlled Instruction Learning Tran Hieu, Yao Zonghai, Li Lingxi, Yu Hong
- UCCIX: Irish-excellence Large Language Model Tran Khanh-tung, O'sullivan Barry, Nguyen Hoang D.
- Transformers Are Ssms: Generalized Models And Efficient Algorithms Through Structured State Space Duality Tri Dao, Albert Gu
- Linguacodus: A Synergistic Framework For Transformative Code Generation In Machine Learning Pipelines Trofimova Ekaterina, Sataev Emil, Ustyuzhanin Andrey E.
- Crossing Linguistic Horizons: Finetuning And Comprehensive Evaluation Of Vietnamese Large Language Models Truong Sang T., Nguyen Duc Q., Nguyen Toan, Le Dong D., Truong Nhi N., Quan Tho, Koyejo Sanmi
- Leveraging LLM Reasoning Enhances Personalized Recommender Systems Tsai Alicia Y., Kraft Adam, Jin Long, Cai Chenwei, Hosseini Anahita, Xu Taibai, Zhang Zemin, Hong Lichan, Chi Ed H., Yi Xinyang
- Two Tales Of Persona In Llms: A Survey Of Role-playing And Personalization Tseng Yu-min, Huang Yu-chao, Hsiao Teng-yun, Chen Wei-lin, Huang Chao-wei, Meng Yu, Chen Yun-nung
- R-eval: A Unified Toolkit For Evaluating Domain Knowledge Of Retrieval Augmented Large Language Models Tu Shangqing, Wang Yuanchun, Yu Jifan, Xie Yuyang, Shi Yaran, Wang Xiaozhi, Zhang Jing, Hou Lei, Li Juanzi
- Unmasking The Imposters: In-domain Detection Of Human Vs. Machine-generated Tweets Tuck Bryan E., Verma Rakesh M.
- LM Transparency Tool: Interactive Tool For Analyzing Transformer Language Models Tufanov Igor, Hambardzumyan Karen, Ferrando Javier, Voita Elena
- Dynamo: Accelerating Language Model Inference With Dynamic Multi-token Sampling Tuli Shikhar, Lin Chi-heng, Hsu Yen-chang, Jha Niraj K., Shen Yilin, Jin Hongxia
- Lifelong Robot Library Learning: Bootstrapping Composable And Generalizable Skills For Embodied Control With Language Models Tziafas Georgios, Kasaei Hamidreza
- Syncode: LLM Generation With Grammar Augmentation Ugare Shubham, Suresh Tarun, Kang Hangoo, Misailovic Sasa, Singh Gagandeep
- Adacoder: Adaptive Prompt Compression For Programmatic Visual Question Answering Ukai Mahiro, Kurita Shuhei, Hashimoto Atsushi, Ushiku Yoshitaka, Inoue Nakamasa
- Hypotermqa: Hypothetical Terms Dataset For Benchmarking Hallucination Tendency Of Llms Uluoglakci Cem Middle East Technical University, Temizel Tugba Taskaya Middle East Technical University
- Evaluating The Capability Of Large Language Models To Personalize Science Texts For Diverse Middle-school-age Learners Vaccaro Michael Jr, Friday Mikayla, Zaghi Arash
- Natural Language Programming In Medicine: Administering Evidence Based Clinical Workflows With Autonomous Agents Powered By Generative Large Language Models Vaid Akhil, Lampert Joshua, Lee Juhee, Sawant Ashwin, Apakama Donald, Sakhuja Ankit, Soroush Ali, Bick Sarah, Abbott Ethan, Gomez Hernando, Hadley Michael, Lee Denise, Landi Isotta, Duong Son Q, Bussola Nicole, Nabeel Ismail, Muehlstedt Silke, Muehlstedt Silke, Freeman Robert, Kovatch Patricia, Carr Brendan, Wang Fei, Glicksberg Benjamin, Argulian Edgar, Lerakis Stamatios, Khera Rohan, Reich David L., Kraft Monica, Charney Alexander, Nadkarni Girish
- Accelerating Large Language Model Inference With Self-supervised Early Exits Valade Florian
- Cost-effective Hallucination Detection For Llms Valentin Simon, Fu Jinmiao, Detommaso Gianluca, Xu Shaoyuan, Zappella Giovanni, Wang Bryan
- Enhancing Model Performance: Another Approach To Vision-language Instruction Tuning Vedanshu, Tripathi Mm, Jaint Bhavnesh
- Question Suggestion For Conversational Shopping Assistants Using Product Metadata Vedula Nikhita, Rokhlenko Oleg, Malmasi Shervin
- "confidently Nonsensical?'': A Critical Survey On The Perspectives And Challenges Of 'hallucinations' In NLP Venkit Pranav Narayanan, Chakravorti Tatiana, Gupta Vipul, Biggs Heidi, Srinath Mukund, Goswami Koustava, Rajtmajer Sarah, Wilson Shomir
- RAG Based Question-answering For Contextual Response Prediction System Veturi Sriram, Vaichal Saurabh, Jagadheesh Reshma Lal, Tripto Nafis Irtiza, Yan Nian
- Exploring LGBTQ+ Bias In Generative AI Answers Across Different Country And Religious Contexts Vicsek Lilla, Vancsó Anna, Zajko Mike, Takacs Judit
- Input Conditioned Graph Generation For Language Agents Vierling Lukas, Fu Jie, Chen Kai
- Exploring Safety-utility Trade-offs In Personalized Language Models Vijjini Anvesh Rao, Chowdhury Somnath Basu Roy, Chaturvedi Snigdha
- Instruction Following With Goal-conditioned Reinforcement Learning In Virtual Environments Volovikova Zoya, Skrynnik Alexey, Kuderov Petr, Panov Aleksandr I.
- Generative Large Language Models In Automated Fact-checking: A Survey Vykopal Ivan, Pikuliak Matúš, Ostermann Simon, Šimko Marián
- Leveraging Zero-shot Prompting For Efficient Language Model Distillation Vöge Lukas, Gurgul Vincent, Lessmann Stefan
- Fusechat: Knowledge Fusion Of Chat Models Wan Fanqi, Zhong Longguang, Yang Ziyi, Chen Ruijun, Quan Xiaojun
- Dynamic Self-consistency: Leveraging Reasoning Paths For Efficient LLM Sampling Wan Guangya, Wu Yuqi, Chen Jie, Li Sheng
- COSMO: Contrastive Streamlined Multimodal Model With Interleaved Pre-training Wang Alex Jinpeng, Li Linjie, Lin Kevin Qinghong, Wang Jianfeng, Lin Kevin, Yang Zhengyuan, Wang Lijuan, Shou Mike Zheng
- Fine-grained Self-endorsement Improves Factuality And Reasoning Wang Ante, Song Linfeng, Peng Baolin, Tian Ye, Jin Lifeng, Mi Haitao, Su Jinsong, Yu Dong
- GOLF: Goal-oriented Long-term Life Tasks Supported By Human-ai Collaboration Wang Ben
- Task Supportive And Personalized Human-large Language Model Interaction: A User Study Wang Ben, Liu Jiqun, Karimnazarov Jamshed, Thompson Nicolas
- Resilience Of Large Language Models For Noisy Instructions Wang Bin, Wei Chengwei, Liu Zhengyuan, Lin Geyu, Chen Nancy F.
- ANGO: A Next-level Evaluation Benchmark For Generation-oriented Language Models In Chinese Domain Wang Bingchao
- Mllm-tool: A Multimodal Large Language Model For Tool Agent Learning Wang Chenyu Michael, Luo Weixin Michael, Chen Qianyu Michael, Mai Haonan Michael, Guo Jindi Michael, Dong Sixun Michael, Xiaohua Michael, Xuan, Li Zhengxin, Ma Lin, Gao Shenghua
- Ada-leval: Evaluating Long-context Llms With Length-adaptable Benchmarks Wang Chonghua, Duan Haodong, Zhang Songyang, Lin Dahua, Chen Kai
- Instruction Tuning-free Visual Token Complement For Multimodal Llms Wang Dongsheng, Cui Jiequan, Li Miaoge, Lin Wang, Chen Bo, Zhang Hanwang
- Towards Efficient And Effective Unlearning Of Large Language Models For Recommendation Wang Hangyu, Lin Jianghao, Chen Bo, Yang Yang, Tang Ruiming, Zhang Weinan, Yu Yong
- Blendfilter: Advancing Retrieval-augmented Large Language Models Via Query Generation Blending And Knowledge Filtering Wang Haoyu, Li Ruirui, Jiang Haoming, Tian Jinjin, Wang Zhengyang, Luo Chen, Tang Xianfeng, Cheng Monica, Zhao Tuo, Gao Jing
- M4U: Evaluating Multilingual Understanding And Reasoning For Large Multimodal Models Wang Hongyu, Xu Jiayu, Xie Senwei, Wang Ruiping, Li Jialin, Xie Zhaojie, Zhang Bin, Xiong Chuyan, Chen Xilin
- Instructgraph: Boosting Large Language Models Via Graph-centric Instruction Tuning And Preference Alignment Wang Jianing, Wu Junda, Hou Yupeng, Liu Yao, Gao Ming, Mcauley Julian
- A User-centric Benchmark For Evaluating Large Language Models Wang Jiayin, Mo Fengran, Ma Weizhi, Sun Peijie, Zhang Min, Nie Jian-yun
- Chinese Metaphor Recognition Using A Multi-stage Prompting Large Language Model Wang Jie, Wang Jin, Zhang Xuejie
- Selective Forgetting: Advancing Machine Unlearning Techniques And Evaluation In Language Models Wang Lingzhi, Zeng Xingshan, Guo Jinsong, Wong Kam-fai, Gottlob Georg
- Interactive Dualchecker For Mitigating Hallucinations In Distilling Large Language Models Wang Meiyun, Suzuki Masahiro, Sakaji Hiroki, Izumi Kiyoshi
- Conversational Simulmt: Efficient Simultaneous Translation With Large Language Models Wang Minghan, Vu Thuy-trang, Wang Yuxia, Shareghi Ehsan, Haffari Gholamreza
- Exploring The Potential Of Multimodal LLM With Knowledge-intensive Multimodal ASR Wang Minghan, Wang Yuxia, Vu Thuy-trang, Shareghi Ehsan, Haffari Gholamreza
- Inferaligner: Inference-time Alignment For Harmlessness Through Cross-model Guidance Wang Pengyu, Zhang Dong, Li Linyang, Tan Chenkun, Wang Xinghao, Ren Ke, Jiang Botian, Qiu Xipeng
- Reconstruct The Pruned Model Without Any Retraining Wang Pingjie, Fan Ziqing, Hu Shengchao, Chen Zhe, Wang Yanfeng, Wang Yu
- Unlearning With Control: Assessing Real-world Utility For Large Language Model Unlearning Wang Qizhou, Han Bo, Yang Puning, Zhu Jianing, Liu Tongliang, Sugiyama Masashi
- SUBLLM: A Novel Efficient Architecture With Token Sequence Subsampling For LLM Wang Quandong, Yuan Yuxuan, Yang Xiaoyu, Zhang Ruike, Zhao Kang, Liu Wei, Luan Jian, Povey Daniel, Wang Bin
- Prefclm: Enhancing Preference-based Reinforcement Learning With Crowdsourced Large Language Models Wang Ruiqi, Zhao Dezhong, Yuan Ziqin, Obi Ike, Min Byung-cheol
- Deconfounded Causality-aware Parameter-efficient Fine-tuning For Problem-solving Improvement Of Llms Wang Ruoyu, Li Xiaoxuan, Yao Lina
- Unique Security And Privacy Threats Of Large Language Model: A Comprehensive Survey Wang Shang, Zhu Tianqing, Liu Bo, Ding Ming, Guo Xu, Ye Dayong, Zhou Wanlei, Yu Philip S.
- Lora Meets Dropout Under A Unified Framework Wang Sheng, Chen Liheng, Jiang Jiyue, Xue Boyang, Kong Lingpeng, Wu Chuan
- Prolora: Partial Rotation Empowers More Parameter-efficient Lora Wang Sheng, Xue Boyang, Ye Jiacheng, Jiang Jiyue, Chen Liheng, Kong Lingpeng, Wu Chuan
- Offline RLHF Methods Need More Accurate Supervision Signals Wang Shiqi, Zhang Zhengze, Zhao Rui, Tan Fei, Nguyen Cam Tu
- From Llms To Mllms: Exploring The Landscape Of Multimodal Jailbreaking Wang Siyuan, Long Zhuohan, Fan Zhihao, Wei Zhongyu
- Benchmark Self-evolving: A Multi-agent Framework For Dynamic LLM Evaluation Wang Siyuan, Long Zhuohan, Fan Zhihao, Wei Zhongyu, Huang Xuanjing
- Symbolic Working Memory Enhances Language Models For Complex Rule Application Wang Siyuan, Wei Zhongyu, Choi Yejin, Ren Xiang
- Weaver: Foundation Models For Creative Writing Wang Tiannan, Chen Jiamin, Jia Qingrui, Wang Shuai, Fang Ruoyu, Wang Huilin, Gao Zhaowei, Xie Chunzhao, Xu Chuou, Dai Jihong, Liu Yibin, Wu Jialong, Ding Shengwei, Li Long, Huang Zhiwei, Deng Xinle, Yu Teng, Ma Gangan, Xiao Han, Chen Zixin, Xiang Danjun, Wang Yunxia, Zhu Yuanyuan, Xiao Yi, Wang Jing, Wang Yiru, Ding Siran, Huang Jiayang, Xu Jiayi, Tayier Yilihamu, Hu Zhenyu, Gao Yuan, Zheng Chengfeng, Ye Yueshu, Li Yihang, Wan Lei, Jiang Xinyue, Wang Yujie, Cheng Siyu, Song Zhule, Tang Xiangru, Xu Xiaohua, Zhang Ningyu, Chen Huajun, Jiang Yuchen Eleanor, Zhou Wangchunshu
- Using Advanced Llms To Enhance Smaller Llms: An Interpretable Knowledge Distillation Approach Wang Tong, Sudhir K., Hong Dat
- Secokd: Aligning Large Language Models For In-context Learning With Fewer Shots Wang Weixing, Yang Haojin, Meinel Christoph
- Testing And Evaluation Of Large Language Models: Correctness, Non-toxicity, And Fairness Wang Wenxuan
- Learning To Ask: When Llms Meet Unclear Instruction Wang Wenxuan, Shi Juluan, Wang Chaozheng, Lee Cheryl, Yuan Youliang, Huang Jen-tse, Lyu Michael R.
- Metatool: Facilitating Large Language Models To Master Tools With Meta-task Augmentation Wang Xiaohan, Li Dian, Zhao Yilin, Sinbadliu, Wang Hui
- FAC\(^2\)E: Better Understanding Large Language Model Capabilities By Dissociating Language And Cognition Wang Xiaoqiang, Liu Bang, Wu Lingfei
- Validating Llm-generated Programs With Metamorphic Prompt Testing Wang Xiaoyin, Zhu Dakai
- Executable Code Actions Elicit Better LLM Agents Wang Xingyao, Chen Yangyi, Yuan Lifan, Zhang Yizhe, Li Yunzhu, Peng Hao, Ji Heng
- Modaverse: Efficiently Transforming Modalities With Llms Wang Xinyu, Zhuang Bohan, Wu Qi
- UBENCH: Benchmarking Uncertainty In Large Language Models With Multiple Choice Questions Wang Xunzhi, Zhang Zhuowei, Li Qiongyu, Chen Gaonan, Hu Mengting, Li Zhiyu, Luo Bitong, Gao Hang, Han Zhixin, Wang Haotian
- EAGER: Two-stream Generative Recommender With Behavior-semantic Collaboration Wang Ye, Xun Jiahao, Hong Minjie, Zhu Jieming, Jin Tao, Lin Wang, Li Haoyuan, Li Linjun, Xia Yan, Zhao Zhou, Dong Zhenhua
- DHP Benchmark: Are Llms Good NLG Evaluators? Wang Yicheng, Yuan Jiayi, Chuang Yu-neng, Wang Zhuoer, Liu Yingchi, Cusick Mark, Kulkarni Param, Ji Zhengping, Ibrahim Yasser, Hu Xia
- Defending Llms Against Jailbreaking Attacks Via Backtranslation Wang Yihan, Shi Zhouxing, Bai Andrew, Hsieh Cho-jui
- Speak It Out: Solving Symbol-related Problems With Symbol-to-language Conversion For Language Models Wang Yile, Cheng Sijie, Sun Zixin, Li Peng, Liu Yang
- Deepedit: Knowledge Editing As Decoding With Constraints Wang Yiwei, Chen Muhao, Peng Nanyun, Chang Kai-wei
- Strategic Chain-of-thought: Guiding Accurate Reasoning In Llms Through Strategy Elicitation Wang Yu, Zhao Shiwan, Wang Zhihu, Huang Heyuan, Fan Ming, Zhang Yubo, Wang Zhixing, Wang Haijun, Liu Ting
- Ladic: Are Diffusion Models Really Inferior To Autoregressive Counterparts For Image-to-text Generation? Wang Yuchi, Ren Shuhuai, Gao Rundong, Yao Linli, Guo Qingyan, An Kaikai, Bai Jianhong, Sun Xu
- REAR: A Relevance-aware Retrieval-augmented Framework For Open-domain Question Answering Wang Yuhao, Ren Ruiyang, Li Junyi, Zhao Wayne Xin, Liu Jing, Wen Ji-rong
- Sibyl: Simple Yet Effective Agent Framework For Complex Real-world Reasoning Wang Yulong, Shen Tianhao, Liu Lifeng, Xie Jian
- Taste: Teaching Large Language Models To Translate Through Self-reflection Wang Yutong, Zeng Jiali, Liu Xuebo, Meng Fandong, Zhou Jie, Zhang Min
- LSTP: Language-guided Spatial-temporal Prompt Learning For Long-form Video-text Understanding Wang Yuxuan, Wang Yueqian, Wu Pengfei, Liang Jianxin, Zhao Dongyan, Zheng Zilong
- Videollamb: Long-context Video Understanding With Recurrent Memory Bridges Wang Yuxuan, Xie Cihang, Liu Yang, Zheng Zilong
- Absinstruct: Eliciting Abstraction Ability From Llms Through Explanation Tuning With Plausibility Estimation Wang Zhaowei, Fan Wei, Zong Qing, Zhang Hongming, Choi Sehyun, Fang Tianqing, Liu Xin, Song Yangqiu, Wong Ginny Y., See Simon
- M-RAG: Reinforcing Large Language Model Performance Through Retrieval-augmented Generation With Multiple Partitions Wang Zheng, Teo Shu Xian, Ouyang Jieer, Xu Yongjun, Shi Wei
- Genartist: Multimodal LLM As An Agent For Unified Image Generation And Editing Wang Zhenyu, Li Aoxue, Li Zhenguo, Liu Xihui
- Re-task: Revisiting LLM Tasks From Capability, Skill, And Knowledge Perspectives Wang Zhihu, Zhao Shiwan, Wang Yu, Huang Heyuan, Shi Jiaxin, Xie Sitao, Wang Zhixing, Zhang Yubo, Li Hongyan, Yan Junchi
- Chatlogic: Integrating Logic Programming With Large Language Models For Multi-step Reasoning Wang Zhongsheng, Liu Jiamou, Bao Qiming, Rong Hongfei, Zhang Jingfeng
- Wordflow: Social Prompt Engineering For Large Language Models Wang Zijie J., Chakravarthy Aishwarya, Munechika David, Chau Duen Horng
- Speculative RAG: Enhancing Retrieval Augmented Generation Through Drafting Wang Zilong, Wang Zifeng, Le Long, Zheng Huaixiu Steven, Mishra Swaroop, Perot Vincent, Zhang Yuwei, Mattapalli Anush, Taly Ankur, Shang Jingbo, Lee Chen-yu, Pfister Tomas
- C3LLM: Conditional Multimodal Content Generation Using Large Language Models Wang Zixuan, Duan Qinkai, Tai Yu-wing, Tang Chi-keung
- Flora: Federated Fine-tuning Large Language Models With Heterogeneous Low-rank Adaptations Wang Ziyao, Shen Zheyu, He Yexiao, Sun Guoheng, Wang Hongyi, Lyu Lingjuan, Li Ang
- Cogergllm: Exploring Large Language Model Systems Design Perspective Using Cognitive Ergonomics Wasi Azmine Toushik
- Large Language User Interfaces: Voice Interactive User Interfaces Powered By Llms Wasti Syed Mekael, Pu Ken Q., Neshati Ali
- Is There No Such Thing As A Bad Question? H4R: Hallucibot For Ratiocination, Rewriting, Ranking, And Routing Watson William, Cho Nicole, Srishankar Nishan
- Systematic Evaluation Of Llm-as-a-judge In LLM Alignment Tasks: Explainable Metrics And Diverse Prompt Templates Wei Hui, He Shenghua, Xia Tian, Wong Andy, Lin Jingyang, Han Mei
- Promptmm: Multi-modal Knowledge Distillation For Recommendation With Prompt-tuning Wei Wei, Tang Jiabin, Jiang Yangqin, Xia Lianghao, Huang Chao
- AC-EVAL: Evaluating Ancient Chinese Language Understanding In Large Language Models Wei Yuting, Xu Yuanxing, Wei Xinru, Yang Simin, Zhu Yangfu, Li Yuqing, Liu Di, Wu Bin
- Are Large Language Models More Empathetic Than Humans? Welivita Anuradha, Pu Pearl
- Leveraging Large Language Models For Patient Engagement: The Power Of Conversational AI In Digital Health Wen Bo, Norel Raquel, Liu Julia, Stappenbeck Thaddeus, Zulkernine Farhana, Chen Huamin
- On-policy Fine-grained Knowledge Feedback For Hallucination Mitigation Wen Xueru, Lu Xinyu, Guan Xinyan, Lu Yaojie, Lin Hongyu, He Ben, Han Xianpei, Sun Le
- Detecting And Mitigating Hallucination In Large Vision Language Models Via Fine-grained AI Feedback Wenyi Xiao, Ziwei Huang, Leilei Gan, Wanggui He, Haoyuan Li, Zhelun Yu, Hao Jiang, Fei Wu, Linchao Zhu
- Codeultrafeedback: An Llm-as-a-judge Dataset For Aligning Large Language Models To Coding Preferences Weyssow Martin, Kamanda Aton, Sahraoui Houari
- Distilrr: Transferring Code Repair For Low-resource Programming Languages Wong Kyle, Amayuelas Alfonso, Pan Liangming, Wang William Yang
- EDA Corpus: A Large Language Model Dataset For Enhanced Interaction With Openroad Wu Bing-yue, Sharma Utsav, Kankipati Sai Rahul Dhanvi, Yadav Ajay, George Bintu Kappil, Guntupalli Sai Ritish, Rovinski Austin, Chhabria Vidya A.
- Synthetic Multimodal Question Generation Wu Ian, Jayanthi Sravan, Viswanathan Vijay, Rosenberg Simon, Pakazad Sina, Wu Tongshuang, Neubig Graham
- Uicoder: Finetuning Large Language Models To Generate User Interface Code Through Automated Feedback Wu Jason, Schoop Eldon, Leung Alan, Barik Titus, Bigham Jeffrey P., Nichols Jeffrey
- Guiding Clinical Reasoning With Large Language Models Via Knowledge Seeds Wu Jiageng, Wu Xian, Yang Jie
- Medkp: Medical Dialogue With Knowledge Enhancement And Clinical Pathway Encoding Wu Jiageng, Wu Xian, Zheng Yefeng, Yang Jie
- Visionllm V2: An End-to-end Generalist Multimodal Large Language Model For Hundreds Of Vision-language Tasks Wu Jiannan, Zhong Muyan, Xing Sen, Lai Zeqiang, Liu Zhaoyang, Wang Wenhai, Chen Zhe, Zhu Xizhou, Lu Lewei, Lu Tong, Luo Ping, Qiao Yu, Dai Jifeng
- Pandora's Box Or Aladdin's Lamp: A Comprehensive Analysis Revealing The Role Of RAG Noise In Large Language Models Wu Jinyang, Che Feihu, Zhang Chuyuan, Tao Jianhua, Zhang Shuai, Shao Pengpeng
- \(\beta\)-dpo: Direct Preference Optimization With Dynamic \(\beta\) Wu Junkang, Xie Yuexiang, Yang Zhengyi, Wu Jiancan, Gao Jinyang, Ding Bolin, Wang Xiang, He Xiangnan
- Clasheval: Quantifying The Tug-of-war Between An Llm's Internal Prior And External Evidence Wu Kevin, Wu Eric, Zou James
- Mixture-of-skills: Learning To Optimize Data Usage For Fine-tuning Large Language Models Wu Minghao, Vu Thuy-trang, Qu Lizhen, Haffari Gholamreza
- ITERTL: An Iterative Framework For Fine-tuning Llms For RTL Code Generation Wu Peiyang, Guo Nan, Xiao Xiao, Li Wenming, Ye Xiaochun, Fan Dongrui
- Parallel Decoding Via Hidden Transfer For Lossless Large Language Model Acceleration Wu Pengfei, Liu Jiahao, Gong Zhuocheng, Wang Qifan, Li Jinpeng, Wang Jingang, Cai Xunliang, Zhao Dongyan
- Motionllm: Multimodal Motion-language Learning With Large Language Models Wu Qi, Zhao Yubo, Wang Yifan, Tai Yu-wing, Tang Chi-keung
- GUI Action Narrator: Where And When Did That Action Take Place? Wu Qinchen, Gao Difei, Lin Kevin Qinghong, Wu Zhuoyu, Guo Xiangwu, Li Peiran, Zhang Weichen, Wang Hengxu, Shou Mike Zheng
- Avatar: Optimizing LLM Agents For Tool-assisted Knowledge Retrieval Wu Shirley, Zhao Shiyu, Huang Qian, Huang Kexin, Yasunaga Michihiro, Cao Kaidi, Ioannidis Vassilis N., Subbian Karthik, Leskovec Jure, Zou James
- Enhance Reasoning For Large Language Models In The Game Werewolf Wu Shuang, Zhu Liwen, Yang Tao, Xu Shiwei, Fu Qiang, Wei Yang, Fu Haobo
- How Easily Do Irrelevant Inputs Skew The Responses Of Large Language Models? Wu Siye, Xie Jian, Chen Jiangjie, Zhu Tinghui, Zhang Kai, Xiao Yanghua
- Cognitive Llms: Towards Integrating Cognitive Architectures And Large Language Models For Manufacturing Decision-making Wu Siyu, Oltramari Alessandro, Francis Jonathan, Giles C. Lee, Ritter Frank E.
- Unigen: A Unified Framework For Textual Dataset Generation Using Large Language Models Wu Siyuan, Huang Yue, Gao Chujie, Chen Dongping, Zhang Qihui, Wan Yao, Zhou Tianyi, Zhang Xiangliang, Gao Jianfeng, Xiao Chaowei, Sun Lichao
- Meta-rewarding Language Models: Self-improving Alignment With Llm-as-a-meta-judge Wu Tianhao, Yuan Weizhe, Golovneva Olga, Xu Jing, Tian Yuandong, Jiao Jiantao, Weston Jason, Sukhbaatar Sainbayar
- Continual Learning For Large Language Models: A Survey Wu Tongtong, Luo Linhao, Li Yuan-fang, Pan Shirui, Vu Thuy-trang, Haffari Gholamreza
- Freeva: Offline MLLM As Training-free Video Assistant Wu Wenhao
- Enhancing Q-learning With Large Language Model Heuristics Wu Xiefeng
- VILA-U: A Unified Foundation Model Integrating Visual Understanding And Generation Wu Yecheng, Zhang Zhuoyang, Chen Junyu, Tang Haotian, Li Dacheng, Fang Yunhao, Zhu Ligeng, Xie Enze, Yin Hongxu, Yi Li, Han Song, Lu Yao
- Agentkit: Structured LLM Reasoning With Dynamic Graphs Wu Yue, Fan Yewen, Min So Yeon, Prabhumoye Shrimai, Mcaleer Stephen, Bisk Yonatan, Salakhutdinov Ruslan, Li Yuanzhi, Mitchell Tom
- Large Language Models Can Self-correct With Minimal Effort Wu Zhenyu, Zeng Qingkai, Zhang Zhihan, Tan Zhaoxuan, Shen Chao, Jiang Meng
- Memocrs: Memory-enhanced Sequential Conversational Recommender Systems With Large Language Models Xi Yunjia, Liu Weiwen, Lin Jianghao, Chen Bo, Tang Ruiming, Zhang Weinan, Yu Yong
- Play To Your Strengths: Collaborative Intelligence Of Conventional Recommender Models And Large Language Models Xi Yunjia, Liu Weiwen, Lin Jianghao, Wu Chuhan, Chen Bo, Tang Ruiming, Zhang Weinan, Yu Yong
- Inverse-q*: Token Level Reinforcement Learning For Aligning Large Language Models Without Preference Data Xia Han, Gao Songyang, Ge Qiming, Xi Zhiheng, Zhang Qi, Huang Xuanjing
- Hallucination Diversity-aware Active Learning For Text Summarization Xia Yu, Liu Xu, Yu Tong, Kim Sungchul, Rossi Ryan A., Rao Anup, Mai Tung, Li Shuai
- Improving Retrieval Augmented Language Model With Self-reasoning Xia Yuan, Zhou Jingbo, Shi Zhenhui, Chen Jun, Huang Haifeng
- What To Do If Language Models Disagree? Black-box Model Ensembling For Textual And Visual Question Answering Xia Yuxi, Zaporojets Kilm, Roth Benjamin
- Unified Hallucination Detection For Multimodal Large Language Models Xiang Chen, Chenxi Wang, Yida Xue, Ningyu Zhang, Xiaoyan Yang, Qiang Li, Yue Shen, Lei Liang, Jinjie Gu, Huajun Chen
- Badchain: Backdoor Chain-of-thought Prompting For Large Language Models Xiang Zhen, Jiang Fengqing, Xiong Zidi, Ramasubramanian Bhaskar, Poovendran Radha, Li Bo
- Guardagent: Safeguard LLM Agents By A Guard Agent Via Knowledge-enabled Reasoning Xiang Zhen, Zheng Linzhi, Li Yanjie, Hong Junyuan, Li Qinbin, Xie Han, Zhang Jiawei, Xiong Zidi, Xie Chulin, Yang Carl, Song Dawn, Li Bo
- Healme: Harnessing Cognitive Reframing In Large Language Models For Psychotherapy Xiao Mengxi, Xie Qianqian, Kuang Ziyan, Liu Zhicheng, Yang Kailai, Peng Min, Han Weiguang, Huang Jimin
- Verbalized Machine Learning: Revisiting Machine Learning With Language Models Xiao Tim Z., Bamler Robert, Schölkopf Bernhard, Liu Weiyang
- Alleviating Hallucination In Large Vision-language Models With Active Retrieval Augmentation Xiaoye Qu, Qiyuan Chen, Wei Wei, Jishuo Sun, Jianfeng Dong
- MEMLA: Enhancing Multilingual Knowledge Editing With Neuron-masked Low-rank Adaptation Xie Jiakuan, Cao Pengfei, Chen Yuheng, Chen Yubo, Liu Kang, Zhao Jun
- Non-instructional Fine-tuning: Enabling Instruction-following Capabilities In Pre-trained Language Models Without Instruction-following Data Xie Juncheng, Syu Shensian, Lee Hung-yi
- Large Multimodal Agents: A Survey Xie Junlin, Chen Zhihong, Zhang Ruifei, Wan Xiang, Li Guanbin
- Autoad-zero: A Training-free Framework For Zero-shot Audio Description Xie Junyu, Han Tengda, Bain Max, Nagrani Arsha, Varol Gül, Xie Weidi, Zisserman Andrew
- Wanglab At MEDIQA-M3G 2024: Multimodal Medical Answer Generation Using Large Language Models Xie Ronald, Palayew Steven, Toma Augustin, Bader Gary, Wang Bo
- Recall: Membership Inference Via Relative Conditional Log-likelihoods Xie Roy, Wang Junlin, Huang Ruomin, Zhang Minxing, Ge Rong, Pei Jian, Gong Neil Zhenqiang, Dhingra Bhuwan
- Gradsafe: Detecting Jailbreak Prompts For Llms Via Safety-critical Gradient Analysis Xie Yueqi, Fang Minghong, Pi Renjie, Gong Neil
- Understanding The Weakness Of Large Language Model Agents Within A Complex Android Environment Xing Mingzhe, Zhang Rongkai, Xue Hui, Chen Qi, Yang Fan, Xiao Zhen
- EFUF: Efficient Fine-grained Unlearning Framework For Mitigating Hallucinations In Multimodal Large Language Models Xing Shangyu, Zhao Fei, Wu Zhen, An Tuo, Chen Weihao, Li Chunhui, Zhang Jianbing, Dai Xinyu
- Llm-enhanced User-item Interactions: Leveraging Edge Information For Optimized Recommendations Xinyuan Wang, Liang Wu, Liangjie Hong, Hao Liu, Yanjie Fu
- Defensive Prompt Patch: A Robust And Interpretable Defense Of Llms Against Jailbreak Attacks Xiong Chen, Qi Xiangyu, Chen Pin-yu, Ho Tsung-yi
- Interactive-kbqa: Multi-turn Interactions For Knowledge Base Question Answering With Large Language Models Xiong Guanming, Bao Junwei, Zhao Wen
- Large Language Models Can Learn Temporal Reasoning Xiong Siheng, Payani Ali, Kompella Ramana, Fekri Faramarz
- Building Math Agents With Multi-turn Iterative Preference Learning Xiong Wei, Shi Chengshuai, Shen Jiaming, Rosenberg Aviv, Qin Zhen, Calandriello Daniele, Khalman Misha, Joshi Rishabh, Piot Bilal, Saleh Mohammad, Jin Chi, Zhang Tong, Liu Tianqi
- Watch Every Step! LLM Agent Learning Via Iterative Step-level Process Refinement Xiong Weimin, Song Yifan, Zhao Xiutian, Wu Wenhao, Wang Xun, Wang Ke, Li Cheng, Peng Wei, Li Sujian
- Stylemaster: Towards Flexible Stylized Image Generation With Diffusion Models Xu Chengming, Hu Kai, Luo Donghao, Zhang Jiangning, Li Wei, Ge Yanhao, Wang Chengjie
- Enhancing Retrieval-augmented Lms With A Two-stage Consistency Learning Compressor Xu Chuankai, Zhao Dongming, Wang Bo, Xing Hanwen
- Data Efficient Evaluation Of Large Language Models And Text-to-image Models Via Adaptive Sampling Xu Cong, Saranathan Gayathri, Alam Mahammad Parwez, Shah Arpit, Lim James, Wong Soon Yee, Martin Foltin, Bhattacharya Suparna
- Interactive Evolution: A Neural-symbolic Self-training Framework For Large Language Models Xu Fangzhi, Sun Qiushi, Cheng Kanzhi, Liu Jun, Qiao Yu, Wu Zhiyong
- Large Language Models For Education: A Survey Xu Hanyi, Gan Wensheng, Qi Zhenlian, Wu Jiayang, Yu Philip S.
- Rejection Improves Reliability: Training Llms To Refuse Unknown Questions Using RL From Knowledge Feedback Xu Hongshen, Zhu Zichen, Zhang Situo, Ma Da, Fan Shuai, Chen Lu, Yu Kai
- A General-purpose Device For Interaction With Llms Xu Jiajun, Wang Qun, Cao Yuhang, Zeng Baitao, Liu Sicheng
- Vtensor: Flexible Virtual Tensor Management For Efficient LLM Serving Xu Jiale, Zhang Rui, Guo Cong, Hu Weiming, Liu Zihan, Wu Feiyang, Feng Yu, Sun Shixuan, Shao Changxu, Guo Yuhong, Zhao Junping, Zhang Ke, Guo Minyi, Leng Jingwen
- Meteora: Multiple-tasks Embedded Lora For Large Language Models Xu Jingwei, Lai Junyu, Huang Yunpeng
- Faithful Logical Reasoning Via Symbolic Chain-of-thought Xu Jundong, Fei Hao, Pan Liangming, Liu Qian, Lee Mong-li, Hsu Wynne
- Tur[k]ingbench: A Challenge Benchmark For Web Agents Xu Kevin, Kordi Yeganeh, Nayak Tanay, Asija Ado, Wang Yizhong, Sanders Kate, Byerly Adam, Zhang Jingyu, Van Durme Benjamin, Khashabi Daniel
- Prompting Large Language Models For Recommender Systems: A Comprehensive Framework And Empirical Analysis Xu Lanling, Zhang Junjie, Li Bingqian, Wang Jinpeng, Cai Mingchen, Zhao Wayne Xin, Wen Ji-rong
- MINDECHO: Role-playing Language Agents For Key Opinion Leaders Xu Rui, Lu Dakuan, Tan Xiaoyu, Wang Xintao, Yuan Siyu, Chen Jiangjie, Chu Wei, Yinghui Xu
- Misconfidence-based Demonstration Selection For LLM In-context Learning Xu Shangqing Georgia Institute Of Technology, Zhang Chao Georgia Institute Of Technology
- AIOS Compiler: LLM As Interpreter For Natural Language Programming And Flow Programming Of AI Agents Xu Shuyuan, Li Zelong, Mei Kai, Zhang Yongfeng
- SA-MDKIF: A Scalable And Adaptable Medical Domain Knowledge Injection Framework For Large Language Models Xu Tianhan, Hu Zhe, Chen Ling, Li Bin
- Sayself: Teaching Llms To Express Confidence With Self-reflective Rationales Xu Tianyang, Wu Shujin, Diao Shizhe, Liu Xiaoze, Wang Xingyao, Chen Yangyi, Gao Jing
- Hr-multiwoz: A Task Oriented Dialogue (TOD) Dataset For HR LLM Agent Xu Weijie, Huang Zicheng, Hu Wenxiang, Fang Xi, Cherukuri Rajesh Kumar, Nayyar Naumaan, Malandri Lorenzo, Sengamedu Srinivasan H.
- Relating The Seemingly Unrelated: Principled Understanding Of Generalization For Generative Models In Arithmetic Reasoning Tasks Xu Xingcheng, Zhao Zibo, Zhang Haipeng, Yang Yanqing
- Generate-on-graph: Treat LLM As Both Agent And KG In Incomplete Knowledge Graph Question Answering Xu Yao, He Shizhu, Chen Jiabei, Wang Zihao, Song Yangqiu, Tong Hanghang, Liu Kang, Zhao Jun
- Conveyor: Efficient Tool-aware LLM Serving With Tool Partial Execution Xu Yechen, Kong Xinhao, Chen Tingjun, Zhuo Danyang
- \(\textit{linkprompt}\): Natural And Universal Adversarial Attacks On Prompt-based Language Models Xu Yue, Wang Wenjie
- Magpie: Alignment Data Synthesis From Scratch By Prompting Aligned Llms With Nothing Xu Zhangchen, Jiang Fengqing, Niu Luyao, Deng Yuntian, Poovendran Radha, Choi Yejin, Lin Bill Yuchen
- Bag Of Tricks: Benchmarking Of Jailbreak Attacks On Llms Xu Zhao, Liu Fan, Liu Hao
- Detectiveqa: Evaluating Long-context Reasoning On Detective Novels Xu Zhe, Ye Jiasheng, Liu Xiangyang, Sun Tianxiang, Liu Xiaoran, Guo Qipeng, Li Linlin, Liu Qun, Huang Xuanjing, Qiu Xipeng
- Uncovering Safety Risks Of Large Language Models Through Concept Activation Vector Xu Zhihao, Huang Ruixuan, Chen Changyu, Wang Shuai, Wang Xiting
- Activerag: Revealing The Treasures Of Knowledge Via Active Learning Xu Zhipeng, Liu Zhenghao, Liu Yibin, Xiong Chenyan, Yan Yukun, Wang Shuo, Yu Shi, Liu Zhiyuan, Yu Ge
- Vision-flan: Scaling Human-labeled Tasks In Visual Instruction Tuning Xu Zhiyang, Feng Chao, Shao Rulin, Ashby Trevor, Shen Ying, Jin Di, Cheng Yu, Wang Qifan, Huang Lifu
- Do Large Language Models Have Compositional Ability? An Investigation Into Limitations And Scalability Xu Zhuoyan, Shi Zhenmei, Liang Yingyu
- A Comprehensive Study Of Jailbreak Attack Versus Defense For Large Language Models Xu Zihao, Liu Yi, Deng Gelei, Li Yuekang, Picek Stjepan
- LEMMA: Towards Lvlm-enhanced Multimodal Misinformation Detection With External Knowledge Augmentation Xuan Keyang, Yi Li, Yang Fan, Wu Ruochen, Fung Yi R., Ji Heng
- Multi-programming Language Ensemble For Code Generation In Large Language Model Xue Tengfei, Li Xuefeng, Azim Tahir, Smirnov Roman, Yu Jianhui, Sadrieh Arash, Pahlavan Babak
- Medlm: Exploring Language Models For Medical Question Answering Systems Yagnik Niraj, Jhaveri Jay, Sharma Vivek, Pila Gabriel
- LTNER: Large Language Model Tagging For Named Entity Recognition With Contextualized Entity Marking Yan Faren, Yu Peng, Chen Xin
- Predicting The Big Five Personality Traits In Chinese Counselling Dialogues Using Large Language Models Yan Yang, Ma Lizhi, Li Anqi, Ma Jingsong, Lan Zhenzhong
- Echosight: Advancing Visual-language Models With Wiki Knowledge Yan Yibin, Xie Weidi
- Social Life Simulation For Non-cognitive Skills Learning Yan Zihan, Xiang Yaohong, Huang Yun
- Report Cards: Qualitative Evaluation Of Language Models Using Natural Language Summaries Yang Blair, Cui Fuyang, Paster Keiran, Ba Jimmy, Vaezipoor Pashootan, Pitis Silviu, Zhang Michael R.
- Crafting Customisable Characters With Llms: Introducing Simschat, A Persona-driven Role-playing Agent Framework Yang Bohao, Liu Dong, Tang Chen, Xiao Chenghao, Zhao Kun, Li Chao, Yuan Lin, Yang Guang, Huang Lanxiao, Lin Chenghua
- Emphasising Structured Information: Integrating Abstract Meaning Representation Into Llms For Enhanced Open-domain Dialogue Evaluation Yang Bohao, Zhao Kun, Tang Chen, Liu Dong, Zhan Liang, Lin Chenghua
- MAPLE: Enhancing Review Generation With Multi-aspect Prompt Learning In Explainable Recommendation Yang Ching-wen, Chen Che Wei, Wu Kun-da, Xu Hao, Yao Jui-feng, Kao Hung-yu
- Improving Factuality In Large Language Models Via Decoding-time Hallucinatory And Truthful Comparators Yang Dingkang, Xiao Dongling, Wei Jinjie, Li Mingcheng, Chen Zhaoyu, Li Ke, Zhang Lihua
- Protrain: Efficient LLM Training Via Memory-aware Techniques Yang Hanmei, Zhou Jin, Fu Yao, Wang Xiaoqun, Roane Ramine, Guan Hui, Liu Tongping
- Aligning Llms Through Multi-perspective User Preference Ranking-based Feedback For Programming Question Answering Yang Hongyu, He Liyang, Hou Min, Shen Shuanghong, Li Rui, Hou Jiahui, Ma Jianhui, Zhao Junda
- Large Language Model Tokenizer Bias: A Case Study And Solution On Gpt-4o Yang Jin, Wang Zhiqiang, Lin Yanbin, Zhao Zunduo
- Rethinking Tokenization: Crafting Better Tokenizers For Large Language Models Yang Jinbiao
- Asynchronous And Segmented Bidirectional Encoding For NMT Yang Jingpu, Han Zehua, Xiang Mengyu, Wang Helin, Huang Yuxiao, Fang Miao
- Mastering Text-to-image Diffusion: Recaptioning, Planning, And Generating With Multimodal Llms Yang Ling, Yu Zhaochen, Meng Chenlin, Xu Minkai, Ermon Stefano, Cui Bin
- Regularizing Hidden States Enables Learning Generalizable Reward Model For Llms Yang Rui, Ding Ruomeng, Lin Yong, Zhang Huan, Zhang Tong
- Selfgoal: Your Language Agents Already Know How To Achieve High-level Goals Yang Ruihan, Chen Jiangjie, Zhang Yikai, Yuan Siyu, Chen Aili, Richardson Kyle, Xiao Yanghua, Yang Deqing
- GPT-4 As Evaluator: Evaluating Large Language Models On Pest Management In Agriculture Yang Shanglong, Yuan Zhipeng, Li Shunbao, Peng Ruoling, Liu Kang, Yang Po
- CRAG -- Comprehensive RAG Benchmark Yang Xiao, Sun Kai, Xin Hao, Sun Yushi, Bhalla Nikita, Chen Xiangsen, Choudhary Sajal, Gui Rongze Daniel, Jiang Ziran Will, Jiang Ziyu, Kong Lingkun, Moran Brian, Wang Jiaqi, Xu Yifan Ethan, Yan An, Yang Chenyu, Yuan Eting, Zha Hanwen, Tang Nan, Chen Lei, Scheffer Nicolas, Liu Yue, Shah Nirav, Wanga Rakesh, Kumar Anuj, Yih Wen-tau, Dong Xin Luna
- Adazeta: Adaptive Zeroth-order Tensor-train Adaption For Memory-efficient Large Language Models Fine-tuning Yang Yifan, Zhen Kai, Banijamal Ershad, Mouchtaris Athanasios, Zhang Zheng
- Loretta: Low-rank Economic Tensor-train Adaptation For Ultra-low-parameter Fine-tuning Of Large Language Models Yang Yifan, Zhou Jiajun, Wong Ngai, Zhang Zheng
- Guardt2i: Defending Text-to-image Models From Adversarial Prompts Yang Yijun, Gao Ruiyuan, Yang Xiao, Zhong Jianyuan, Xu Qiang
- P3: A Policy-driven, Pace-adaptive, And Diversity-promoted Framework For Optimizing LLM Training Yang Yingxuan, Wang Huayi, Wen Muning, Zhang Weinan
- Can Llms Reason In The Wild With Programs? Yang Yuan, Xiong Siheng, Payani Ali, Shareghi Ehsan, Fekri Faramarz
- Weak-to-strong Reasoning Yang Yuqing, Ma Yan, Liu Pengfei
- Exploring And Unleashing The Power Of Large Language Models In Automated Code Translation Yang Zhen, Liu Fang, Yu Zhongxing, Keung Jacky Wai, Li Jia, Liu Shuo, Hong Yifan, Ma Xiaoxue, Jin Zhi, Li Ge
- React Meets Actre: When Language Agents Enjoy Training Data Autonomy Yang Zonghan, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Curiousllm: Elevating Multi-document QA With Reasoning-infused Knowledge Graph Prompting Yang Zukang, Zhu Zixuan
- Machine Unlearning Of Pre-trained Large Language Models Yao Jin, Chien Eli, Du Minxin, Niu Xinyao, Wang Tianhao, Cheng Zezhou, Yue Xiang
- CLAVE: An Adaptive Framework For Evaluating Values Of LLM Generated Responses Yao Jing, Yi Xiaoyuan, Xie Xing
- Open-domain Implicit Format Control For Large Language Model Generation Yao Yiqun, Ma Wenjia, Fang Xuezhi, Jiang Xin, Li Xiang, Meng Xuying, Han Peng, Li Jing, Sun Aixin, Wang Yequan
- Learning From Correctness Without Prompting Makes LLM Efficient Reasoner Yao Yuxuan, Wu Han, Guo Zhijiang, Zhou Biyan, Gao Jiahui, Luo Sichun, Hou Hanxu, Fu Xiaojin, Song Linqi
- Benchmarking Llms Via Uncertainty Quantification Ye Fanghua, Yang Mingming, Pang Jianhui, Wang Longyue, Wong Derek F., Yilmaz Emine, Shi Shuming, Tu Zhaopeng
- R^2AG: Incorporating Retrieval Information Into Retrieval Augmented Generation Ye Fuda, Li Shuangyin, Zhang Yongqi, Chen Lei
- Preference-guided Reflective Sampling For Aligning Language Models Ye Hai, Ng Hwee Tou
- Self-judge: Selective Instruction Following With Alignment Self-evaluation Ye Hai, Ng Hwee Tou
- Tooleyes: Fine-grained Evaluation For Tool Learning Capabilities Of Large Language Models In Real-world Scenarios Ye Junjie, Li Guanyu, Gao Songyang, Huang Caishuang, Wu Yilong, Li Sixian, Fan Xiaoran, Dou Shihan, Zhang Qi, Gui Tao, Huang Xuanjing
- Toolsword: Unveiling Safety Issues Of Large Language Models In Tool Learning Across Three Stages Ye Junjie, Li Sixian, Li Guanyu, Huang Caishuang, Gao Songyang, Wu Yilong, Zhang Qi, Gui Tao, Huang Xuanjing
- Rotbench: A Multi-level Benchmark For Evaluating The Robustness Of Large Language Models In Tool Learning Ye Junjie, Wu Yilong, Gao Songyang, Huang Caishuang, Li Sixian, Li Guanyu, Fan Xiaoran, Zhang Qi, Gui Tao, Huang Xuanjing
- Dataframe QA: A Universal LLM Framework On Dataframe Question Answering Without Data Exposure Ye Junyi, Du Mengnan, Wang Guiling
- Interrogatellm: Zero-resource Hallucination Detection In Llm-generated Answers Yehuda Yakir, Malkiel Itzik, Barkan Oren, Weill Jonathan, Ronen Royi, Koenigstein Noam
- Long-context Language Modeling With Parallel Context Encoding Yen Howard, Gao Tianyu, Chen Danqi
- Self-training Large Language Models Through Knowledge Detection Yeo Wei Jie, Ferdinan Teddy, Kazienko Przemyslaw, Satapathy Ranjan, Cambria Erik
- Using Large Language Models To Create AI Personas For Replication And Prediction Of Media Effects: An Empirical Test Of 133 Published Experimental Research Findings Yeykelis Leo, Pichai Kaavya, Cummings James J., Reeves Byron
- Debiasing Multimodal Large Language Models Yi-fan Zhang, Weichen Yu, Qingsong Wen, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan
- The Task-oriented Queries Benchmark (toqb) Yim Keun Soo
- Lofit: Localized Fine-tuning On LLM Representations Yin Fangcong, Ye Xi, Durrett Greg
- MMAU: A Holistic Benchmark Of Agent Capabilities Across Diverse Domains Yin Guoli, Bai Haoping, Ma Shuang, Nan Feng, Sun Yanchao, Xu Zhaoyang, Ma Shen, Lu Jiarui, Kong Xiang, Zhang Aonan, Yap Dian Ang, Zhang Yizhe, Ahnert Karsten, Kamath Vik, Berglund Mathias, Walsh Dominic, Gindele Tobias, Wiest Juergen, Lai Zhengfeng, Wang Xiaoming, Shan Jiulong, Cao Meng, Pang Ruoming, Wang Zirui
- Aggregation Of Reasoning: A Hierarchical Framework For Enhancing Answer Selection In Large Language Models Yin Zhangyue, Sun Qiushi, Guo Qipeng, Zeng Zhiyuan, Li Xiaonan, Sun Tianxiang, Chang Cheng, Cheng Qinyuan, Wang Ding, Mou Xiaofeng, Qiu Xipeng, Huang Xuanjing
- Internlm-math: Open Math Large Language Models Toward Verifiable Reasoning Ying Huaiyuan, Zhang Shuo, Li Linyang, Zhou Zhejian, Shao Yunfan, Fei Zhaoye, Ma Yichuan, Hong Jiawei, Liu Kuikun, Wang Ziyi, Wang Yudong, Wu Zijian, Li Shuaibin, Zhou Fengzhe, Liu Hongwei, Zhang Songyang, Zhang Wenwei, Yan Hang, Qiu Xipeng, Wang Jiayu, Chen Kai, Lin Dahua
- Llms-as-instructors: Learning From Errors Toward Automating Model Improvement Ying Jiahao, Lin Mingbao, Cao Yixin, Tang Wei, Wang Bo, Sun Qianru, Huang Xuanjing, Yan Shuicheng
- Debunc: Mitigating Hallucinations In Large Language Model Agent Communication With Uncertainty Estimations Yoffe Luke, Amayuelas Alfonso, Wang William Yang
- Matryoshka-adaptor: Unsupervised And Supervised Tuning For Smaller Embedding Dimensions Yoon Jinsung, Sinha Raj, Arik Sercan O, Pfister Tomas
- Truth-aware Context Selection: Mitigating Hallucinations Of Large Language Models Being Misled By Untruthful Contexts Yu Tian, Zhang Shaolei, Feng Yang
- Ra-rec: An Efficient ID Representation Alignment Framework For Llm-based Recommendation Yu Xiaohan, Zhang Li, Zhao Xin, Wang Yue, Ma Zhongrui
- Neeko: Leveraging Dynamic Lora For Efficient Multi-character Role-playing Agent Yu Xiaoyan, Luo Tongxu, Wei Yifan, Lei Fangyu, Huang Yiming, Peng Hao, Zhu Liehuang
- BEYOND DIALOGUE: A Profile-dialogue Alignment Framework Towards General Role-playing Language Model Yu Yeyong, Yu Runsheng, Wei Haojie, Zhang Zhanqiu, Qian Quan
- Rankrag: Unifying Context Ranking With Retrieval-augmented Generation In Llms Yu Yue, Ping Wei, Liu Zihan, Wang Boxin, You Jiaxuan, Zhang Chao, Shoeybi Mohammad, Catanzaro Bryan
- Don't Listen To Me: Understanding And Exploring Jailbreak Prompts Of Large Language Models Yu Zhiyuan, Liu Xiaogeng, Liang Shunning, Cameron Zach, Xiao Chaowei, Zhang Ning
- Kieval: A Knowledge-grounded Interactive Evaluation Framework For Large Language Models Yu Zhuohao, Gao Chang, Yao Wenjin, Wang Yidong, Ye Wei, Wang Jindong, Xie Xing, Zhang Yue, Zhang Shikun
- Beyond Under-alignment: Atomic Preference Enhanced Factuality Tuning For Large Language Models Yuan Hongbang, Chen Yubo, Cao Pengfei, Jin Zhuoran, Liu Kang, Zhao Jun
- Towards A Holistic Evaluation Of Llms On Factual Knowledge Recall Yuan Jiaqing, Pan Lin, Hang Chung-wei, Guo Jiang, Jiang Jiarong, Min Bonan, Ng Patrick, Wang Zhiguo
- Focused Large Language Models Are Stable Many-shot Learners Yuan Peiwen, Feng Shaoxiong, Li Yiwei, Wang Xinglin, Zhang Yueqi, Tan Chuyi, Pan Boyuan, Wang Heda, Hu Yao, Li Kan
- EASYTOOL: Enhancing Llm-based Agents With Concise Tool Instruction Yuan Siyu, Song Kaitao, Chen Jiangjie, Tan Xu, Shen Yongliang, Kan Ren, Li Dongsheng, Yang Deqing
- LLMCRIT: Teaching Large Language Models To Use Criteria Yuan Weizhe, Liu Pengfei, Gallé Matthias
- A Hybrid RAG System With Comprehensive Enhancement On Complex Reasoning Yuan Ye, Liu Chengwu, Yuan Jingyang, Sun Gongbo, Li Siqi, Zhang Ming
- Rigorllm: Resilient Guardrails For Large Language Models Against Undesired Content Yuan Zhuowen, Xiong Zidi, Zeng Yi, Yu Ning, Jia Ruoxi, Song Dawn, Li Bo
- Distilling Instruction-following Abilities Of Large Language Models With Task-aware Curriculum Planning Yue Yuanhao, Wang Chengyu, Huang Jun, Wang Peng
- Wkvquant: Quantizing Weight And Key/value Cache For Large Language Models Gains More Yue Yuxuan, Yuan Zhihang, Duanmu Haojie, Zhou Sifan, Wu Jianlong, Nie Liqiang
- Can Small Language Models Be Good Reasoners For Sequential Recommendation? Yuling Wang, Changxin Tian, Binbin Hu, Yanhua Yu, Ziqi Liu, Zhiqiang Zhang, Jun Zhou, Liang Pang, Xiao Wang
- Web2code: A Large-scale Webpage-to-code Dataset And Evaluation Framework For Multimodal Llms Yun Sukmin, Lin Haokun, Thushara Rusiru, Bhat Mohammad Qazim, Wang Yongxin, Jiang Zutao, Deng Mingkai, Wang Jinhong, Tao Tianhua, Li Junbo, Li Haonan, Nakov Preslav, Baldwin Timothy, Liu Zhengzhong, Xing Eric P., Liang Xiaodan, Shen Zhiqiang
- Investigating Continual Pretraining In Large Language Models: Insights And Implications Yıldız Çağatay, Ravichandran Nishaanth Kanna, Punia Prishruit, Bethge Matthias, Ermis Beyza
- WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-experts Zahraei Pardis Sadat, Emami Ali
- Envgen: Generating And Adapting Environments Via Llms For Training Embodied Agents Zala Abhay, Cho Jaemin, Lin Han, Yoon Jaehong, Bansal Mohit
- ABQ-LLM: Arbitrary-bit Quantized Inference Acceleration For Large Language Models Zeng Chao, Liu Songwei, Xie Yusheng, Liu Hong, Wang Xiaojian, Wei Miao, Yang Shu, Chen Fangmin, Mei Xing
- Improving Logits-based Detector Without Logits From Black-box Llms Zeng Cong, Tang Shengkun, Yang Xianjun, Chen Yuanzhou, Sun Yiyou, Xu Zhiqiang, Li Yao, Chen Haifeng, Cheng Wei, Xu Dongkuan
- Chimera: A Lossless Decoding Method For Accelerating Large Language Models Inference By Fusing All Tokens Zeng Ziqian, Yu Jiahong, Pang Qianshi, Wang Zihao, Zhuang Huiping, Shao Hongen, Zou Xiaofeng
- Personarag: Enhancing Retrieval-augmented Generation Systems With User-centric Agents Zerhoudi Saber, Granitzer Michael
- Actions Speak Louder Than Words: Trillion-parameter Sequential Transducers For Generative Recommendations Zhai Jiaqi, Liao Lucy, Liu Xing, Wang Yueming, Li Rui, Cao Xuan, Gao Leon, Gong Zhaojie, Gu Fangda, He Michael, Lu Yinghai, Shi Yu
- Unveiling The Lexical Sensitivity Of Llms: Combinatorial Optimization For Prompt Enhancement Zhan Pengwei, Xu Zhen, Tan Qian, Song Jie, Xie Ru
- Injecagent: Benchmarking Indirect Prompt Injections In Tool-integrated Large Language Model Agents Zhan Qiusi, Liang Zhixiang, Ying Zifan, Kang Daniel
- Distilling Implicit Multimodal Knowledge Into Llms For Zero-resource Dialogue Generation Zhang Bo, Ma Hui, Ding Jian, Wang Jian, Xu Bo, Lin Hongfei
- Simple Techniques For Enhancing Sentence Embeddings In Generative Language Models Zhang Bowen, Chang Kehua, Li Chunping
- Notellm-2: Multimodal Large Representation Models For Recommendation Zhang Chao, Zhang Haoxin, Wu Shiwei, Wu Di, Xu Tong, Gao Yan, Hu Yao, Chen Enhong
- Teaching Large Language Models An Unseen Language On The Fly Zhang Chen, Liu Xiao, Lin Jiuheng, Feng Yansong
- Ts-align: A Teacher-student Collaborative Framework For Scalable Iterative Finetuning Of Large Language Models Zhang Chen, Tang Chengguang, Chong Dading, Shi Ke, Tang Guohua, Jiang Feng, Li Haizhou
- Sciglm: Training Scientific Language Models With Self-reflective Instruction Annotation And Tuning Zhang Dan, Hu Ziniu, Zhoubian Sining, Du Zhengxiao, Yang Kaiyu, Wang Zihan, Yue Yisong, Dong Yuxiao, Tang Jie
- Chemllm: A Chemical Large Language Model Zhang Di, Liu Wei, Tan Qian, Chen Jingdan, Yan Hang, Yan Yuliang, Li Jiatong, Huang Weiran, Yue Xiangyu, Ouyang Wanli, Zhou Dongzhan, Zhang Shufei, Su Mao, Zhong Han-sen, Li Yuqiang
- Map-neo: Highly Capable And Transparent Bilingual Large Language Model Series Zhang Ge, Qu Scott, Liu Jiaheng, Zhang Chenchen, Lin Chenghua, Yu Chou Leuang, Pan Danny, Cheng Esther, Liu Jie, Lin Qunshu, Yuan Raven, Zheng Tuney, Pang Wei, Du Xinrun, Liang Yiming, Ma Yinghao, Li Yizhi, Ma Ziyang, Lin Bill, Benetos Emmanouil, Yang Huan, Zhou Junting, Ma Kaijing, Liu Minghao, Niu Morry, Wang Noah, Que Quehry, Liu Ruibo, Liu Sine, Guo Shawn, Gao Soren, Zhou Wangchunshu, Zhang Xinyue, Zhou Yizhi, Wang Yubo, Bai Yuelin, Zhang Yuhan, Zhang Yuxiang, Wang Zenith, Yang Zhenzhu, Zhao Zijian, Zhang Jiajun, Ouyang Wanli, Huang Wenhao, Chen Wenhu
- ALTER: Augmentation For Large-table-based Reasoning Zhang Han, Ma Yuheng, Yang Hanfang
- Language Model Prompt Selection Via Simulation Optimization Zhang Haoting, He Jinghai, Righter Rhonda, Zheng Zeyu
- Iterative Utility Judgment Framework Via Llms Inspired By Relevance In Philosophy Zhang Hengran, Bi Keping, Guo Jiafeng, Cheng Xueqi
- Balancing Speciality And Versatility: A Coarse To Fine Framework For Supervised Fine-tuning Large Language Model Zhang Hengyuan, Wu Yanru, Li Dawei, Yang Sak, Zhao Rui, Jiang Yong, Tan Fei
- Adaptable Logical Control For Large Language Models Zhang Honghua, Kung Po-nien, Yoshida Masahiro, Broeck Guy Van Den, Peng Nanyun
- Kb-plugin: A Plug-and-play Framework For Large Language Models To Induce Programs Over Low-resourced Knowledge Bases Zhang Jiajie, Cao Shulin, Hu Linmei, Feng Ling, Hou Lei, Li Juanzi
- Should We Fear Large Language Models? A Structural Analysis Of The Human Reasoning System For Elucidating LLM Capabilities And Risks Through The Lens Of Heidegger's Philosophy Zhang Jianqiiu
- Stealthy Attack On Large Language Model Based Recommendation Zhang Jinghao, Liu Yuting, Liu Qiang, Wu Shu, Guo Guibing, Wang Liang
- Onegen: Efficient One-pass Unified Generation And Retrieval For Llms Zhang Jintian, Peng Cheng, Sun Mengshu, Chen Xiang, Liang Lei, Zhang Zhiqiang, Zhou Jun, Chen Huajun, Zhang Ningyu
- Android In The Zoo: Chain-of-action-thought For GUI Agents Zhang Jiwen, Wu Jihao, Teng Yihua, Liao Minghui, Xu Nuo, Xiao Xiao, Wei Zhongyu, Tang Duyu
- Fast And Slow Generating: An Empirical Study On Large And Small Language Models Collaborative Decoding Zhang Kaiyan, Wang Jianyu, Ding Ning, Qi Biqing, Hua Ermo, Lv Xingtai, Zhou Bowen
- Exploring The Best Practices Of Query Expansion With Large Language Models Zhang Le, Wu Yihong, Yang Qian, Nie Jian-yun
- Predicting Learning Performance With Large Language Models: A Study In Adult Literacy Zhang Liang, Lin Jionghao, Borchers Conrad, Sabatini John, Hollander John, Cao Meng, Hu Xiangen
- Enhancing Multi-hop Reasoning Through Knowledge Erasure In Large Language Model Editing Zhang Mengqi, Fang Bowen, Liu Qiang, Ren Pengjie, Wu Shu, Chen Zhumin, Wang Liang
- Automated Multi-level Preference For Mllms Zhang Mengxi, Wu Wenhao, Lu Yu, Song Yuxin, Rong Kang, Yao Huanjin, Zhao Jianbo, Liu Fanglong, Sun Yifan, Feng Haocheng, Wang Jingdong
- Look Ahead Or Look Around? A Theoretical Comparison Between Autoregressive And Masked Pretraining Zhang Qi, Du Tianqi, Huang Haotian, Wang Yifei, Wang Yisen
- RECOST: External Knowledge Guided Data-efficient Instruction Tuning Zhang Qi, Zhang Yiming, Wang Haobo, Zhao Junbo
- VAR-CLIP: Text-to-image Generator With Visual Auto-regressive Modeling Zhang Qian, Dai Xiangzi, Yang Ninghua, An Xiang, Feng Ziyong, Ren Xingyu
- Collaborative Performance Prediction For Large Language Models Zhang Qiyuan, Lyu Fuyuan, Liu Xue, Ma Chen
- Plad: Preference-based Large Language Model Distillation With Pseudo-preference Pairs Zhang Rongzhi, Shen Jiaming, Liu Tianqi, Wang Haorui, Qin Zhen, Han Feng, Liu Jialu, Baumgartner Simon, Bendersky Michael, Zhang Chao
- Direct Preference Optimization Of Video Large Multimodal Models From Language Model Reward Zhang Ruohong, Gui Liangke, Sun Zhiqing, Feng Yihao, Xu Keyang, Zhang Yuanhan, Fu Di, Li Chunyuan, Hauptmann Alexander, Bisk Yonatan, Yang Yiming
- Offline Training Of Language Model Agents With Functions As Learnable Weights Zhang Shaokun, Zhang Jieyu, Liu Jiale, Song Linxin, Wang Chi, Krishna Ranjay, Wu Qingyun
- Cfbench: A Comprehensive Constraints-following Benchmark For Llms Zhang Tao, Shen Yanjun, Luo Wenjing, Zhang Yan, Liang Hao, Zhang Tao, Yang Fan, Lin Mingan, Qiao Yujing, Chen Weipeng, Cui Bin, Zhang Wentao, Zhou Zenan
- Leanquant: Accurate Large Language Model Quantization With Loss-error-aware Grid Zhang Tianyi, Shrivastava Anshumali
- Methodology Of Adapting Large English Language Models For Specific Cultural Contexts Zhang Wenjing, Xiao Siqi, Lei Xuejiao, Wang Ning, Zhang Huazheng, An Meijuan, Yang Bikun, Liu Zhaoxiang, Wang Kai, Lian Shiguo
- Tired Of Plugins? Large Language Models Can Be End-to-end Recommenders Zhang Wenlin, Wu Chuhan, Li Xiangyang, Wang Yuhao, Dong Kuicai, Wang Yichao, Dai Xinyi, Zhao Xiangyu, Guo Huifeng, Tang Ruiming
- D\'olares Or Dollars? Unraveling The Bilingual Prowess Of Financial Llms Between Spanish And English Zhang Xiao, Xiang Ruoyu, Yuan Chenhan, Feng Duanyu, Han Weiguang, Lopez-lira Alejandro, Liu Xiao-yang, Ananiadou Sophia, Peng Min, Huang Jimin, Xie Qianqian
- Event Temporal Relation Extraction Based On Retrieval-augmented On Llms Zhang Xiaobin, Zang Liangjun, Liu Qianwen, Wei Shuchong, Hu Songlin
- Llms Instruct Llms:an Extraction And Editing Method Zhang Xin, Ju Tianjie, Liang Huijia, Fu Ying, Zhang Qin
- Laser: Parameter-efficient LLM Bi-tuning For Sequential Recommendation With Collaborative Information Zhang Xinyu, Hu Linmei, Zhang Luhao, Song Dandan, Huang Heyan, Nie Liqiang
- Ask-before-plan: Proactive Language Agents For Real-world Planning Zhang Xuan, Deng Yang, Ren Zifeng, Ng See-kiong, Chua Tat-seng
- Glape: Gold Label-agnostic Prompt Evaluation And Optimization For Large Language Model Zhang Xuanchang, Zhang Zhuosheng, Zhao Hai
- RAGLAB: A Modular And Research-oriented Unified Framework For Retrieval-augmented Generation Zhang Xuanwang, Song Yunze, Wang Yidong, Tang Shuyun, Li Xinfeng, Zeng Zhengran, Wu Zhen, Ye Wei, Xu Wenyuan, Zhang Yue, Dai Xinyu, Zhang Shikun, Wen Qingsong
- Pushing The Limit Of LLM Capacity For Text Classification Zhang Yazhou, Wang Mengyao, Ren Chenyu, Li Qiuchi, Tiwari Prayag, Wang Benyou, Qin Jing
- ELAD: Explanation-guided Large Language Models Active Distillation Zhang Yifei, Pan Bo, Ling Chen, Hu Yuntong, Zhao Liang
- Rankclip: Ranking-consistent Language-image Pretraining Zhang Yiming, Zhao Zhuokai, Chen Zhaorun, Feng Zhili, Ding Zenghui, Sun Yining
- MLKD-BERT: Multi-level Knowledge Distillation For Pre-trained Language Models Zhang Ying, Yang Ziheng, Ji Shufan
- Leveraging Biases In Large Language Models: "bias-knn'' For Effective Few-shot Learning Zhang Yong, Li Hanzhang, Li Zhitao, Cheng Ning, Li Ming, Xiao Jing, Wang Jianzong
- Text-augmented Multimodal Llms For Chemical Reaction Condition Recommendation Zhang Yu, Yu Ruijie, Zeng Kaipeng, Li Ding, Zhu Feng, Yang Xiaokang, Jin Yaohui, Xu Yanyan
- Toolbehonest: A Multi-level Hallucination Diagnostic Benchmark For Tool-augmented Large Language Models Zhang Yuxiang, Chen Jing, Wang Junjie, Liu Yaxin, Yang Cheng, Shi Chufan, Zhu Xinyu, Lin Zihao, Wan Hanwen, Yang Yujiu, Sakai Tetsuya, Feng Tian, Yamana Hayato
- PPTC-R Benchmark: Towards Evaluating The Robustness Of Large Language Models For Powerpoint Task Completion Zhang Zekai, Guo Yiduo, Liang Yaobo, Zhao Dongyan, Duan Nan
- Get Confused Cautiously: Textual Sequence Memorization Erasure With Selective Entropy Maximization Zhang Zhaohan, Liu Ziquan, Patras Ioannis
- DARG: Dynamic Evaluation Of Large Language Models Via Adaptive Reasoning Graph Zhang Zhehao, Chen Jiaao, Yang Diyi
- Galla: Graph Aligned Large Language Models For Improved Source Code Understanding Zhang Ziyin, Yu Hang, Li Shijie, Di Peng, Li Jianguo, Wang Rui
- Cpsyexam: A Chinese Benchmark For Evaluating Psychology Using Examinations Zhao Jiahao, Zhu Jingwei, Tan Minghuan, Yang Min, Yang Di, Zhang Chenhao, Ye Guancheng, Li Chengming, Hu Xiping
- Prefix Guidance: A Steering Wheel For Large Language Models To Defend Against Jailbreak Attacks Zhao Jiawei, Chen Kejiang, Yuan Xiaojian, Zhang Weiming
- Language Model Council: Benchmarking Foundation Models On Highly Subjective Tasks By Consensus Zhao Justin, Plaza-del-arco Flor Miriam, Curry Amanda Cercas
- SLIDE: A Framework Integrating Small And Large Language Models For Open-domain Dialogues Evaluation Zhao Kun, Yang Bohao, Tang Chen, Lin Chenghua, Zhan Liang
- Mitigating Object Hallucination In Large Vision-language Models Via Classifier-free Guidance Zhao Linxi, Deng Yihe, Zhang Weitong, Gu Quanquan
- Diffagent: Fast And Accurate Text-to-image API Selection With Large Language Model Zhao Lirui, Yang Yue, Zhang Kaipeng, Shao Wenqi, Zhang Yuxin, Qiao Yu, Luo Ping, Ji Rongrong
- Efficiently Training 7B LLM With 1 Million Sequence Length On 8 Gpus Zhao Pinxue, Zhang Hailin, Fu Fangcheng, Nie Xiaonan, Liu Qibin, Yang Fang, Peng Yuanbo, Jiao Dian, Li Shuaipeng, Xue Jinbao, Tao Yangyu, Cui Bin
- SYNTHEVAL: Hybrid Behavioral Testing Of NLP Models With Synthetic Checklists Zhao Raoyuan, Köksal Abdullatif, Liu Yihong, Weissweiler Leonie, Korhonen Anna, Schütze Hinrich
- A Survey Of Backdoor Attacks And Defenses On Large Language Models: Implications For Security Measures Zhao Shuai, Jia Meihuizi, Guo Zhongliang, Gan Leilei, Xu Xiaoyu, Wu Xiaobao, Fu Jie, Feng Yichao, Pan Fengjun, Tuan Luu Anh
- Ouroboros: Generating Longer Drafts Phrase By Phrase For Faster Speculative Decoding Zhao Weilin, Huang Yuxiang, Han Xu, Xu Wang, Xiao Chaojun, Zhang Xinrong, Fang Yewei, Zhang Kaihuo, Liu Zhiyuan, Sun Maosong
- Towards Comprehensive And Efficient Post Safety Alignment Of Large Language Models Via Safety Patching Zhao Weixiang, Hu Yulin, Li Zhuojun, Deng Yang, Zhao Yanyan, Qin Bing, Chua Tat-seng
- SAPT: A Shared Attention Framework For Parameter-efficient Continual Learning Of Large Language Models Zhao Weixiang, Wang Shilong, Hu Yulin, Zhao Yanyan, Qin Bing, Zhang Xuanyu, Yang Qing, Xu Dongliang, Che Wanxiang
- Wildchat: 1M Chatgpt Interaction Logs In The Wild Zhao Wenting, Ren Xiang, Hessel Jack, Cardie Claire, Choi Yejin, Deng Yuntian
- An Empirical Study Of Retrieval Augmented Generation With Chain-of-thought Zhao Yuetong, Cao Hongyu, Zhao Xianyu, Ou Zhijian
- Improving The Robustness Of Large Language Models Via Consistency Alignment Zhao Yukun, Yan Lingyong, Sun Weiwei, Xing Guoliang, Wang Shuaiqiang, Meng Chong, Cheng Zhicong, Ren Zhaochun, Yin Dawei
- Let Me Do It For You: Towards LLM Empowered Recommendation Via Tool Learning Zhao Yuyue, Wu Jiancan, Wang Xiang, Tang Wei, Wang Dingxian, De Rijke Maarten
- SWIFT:A Scalable Lightweight Infrastructure For Fine-tuning Zhao Yuze, Huang Jintao, Hu Jinghan, Wang Xingjun, Mao Yunlin, Zhang Daoze, Jiang Zeyinzi, Wu Zhikai, Ai Baole, Wang Ang, Zhou Wenmeng, Chen Yingda
- Adversarial Contrastive Decoding: Boosting Safety Alignment Of Large Language Models Via Opposite Prompt Optimization Zhao Zhengyue, Zhang Xiaoyun, Xu Kaidi, Hu Xing, Zhang Rui, Du Zidong, Guo Qi, Chen Yunji
- Retrieval-augmented Mixture Of Lora Experts For Uploadable Machine Learning Zhao Ziyu, Gan Leilei, Wang Guoyin, Hu Yuwei, Shen Tao, Yang Hongxia, Kuang Kun, Wu Fei
- Loraretriever: Input-aware Lora Retrieval And Composition For Mixed Tasks In The Wild Zhao Ziyu, Gan Leilei, Wang Guoyin, Zhou Wangchunshu, Yang Hongxia, Kuang Kun, Wu Fei
- Leveraging Large Language Models With Chain-of-thought And Prompt Engineering For Traffic Crash Severity Analysis And Inference Zhen Hao, Shi Yucheng, Huang Yongcan, Yang Jidong J., Liu Ninghao
- Gpt-4v(ision) Is A Generalist Web Agent, If Grounded Zheng Boyuan, Gou Boyu, Kil Jihyung, Sun Huan, Su Yu
- Trustscore: Reference-free Evaluation Of LLM Response Trustworthiness Zheng Danna, Liu Danyang, Lapata Mirella, Pan Jeff Z.
- Ali-agent: Assessing Llms' Alignment With Human Values Via Agent-based Evaluation Zheng Jingnan, Wang Han, Zhang An, Nguyen Tai D., Sun Jun, Chua Tat-seng
- Towards Lifelong Learning Of Large Language Models: A Survey Zheng Junhao, Qiu Shengjie, Shi Chengming, Ma Qianli
- Dreamlip: Language-image Pre-training With Long Captions Zheng Kecheng, Zhang Yifei, Wu Wei, Lu Fan, Ma Shuailei, Jin Xin, Chen Wei, Shen Yujun
- Critic-cot: Boosting The Reasoning Abilities Of Large Language Model Via Chain-of-thoughts Critic Zheng Xin, Lou Jie, Cao Boxi, Wen Xueru, Ji Yuqiu, Lin Hongyu, Lu Yaojie, Han Xianpei, Zhang Debing, Sun Le
- Llamafactory: Unified Efficient Fine-tuning Of 100+ Language Models Zheng Yaowei, Zhang Richong, Zhang Junhao, Ye Yanhan, Luo Zheyan, Feng Zhangchi, Ma Yongqiang
- Harnessing Large Language Models For Text-rich Sequential Recommendation Zheng Zhi, Chao Wenshuo, Qiu Zhaopeng, Zhu Hengshu, Xiong Hui
- Attention Heads Of Large Language Models: A Survey Zheng Zifan, Wang Yezhaohui, Huang Yuxin, Song Shichao, Tang Bo, Xiong Feiyu, Li Zhiyu
- Agentgym: Evolving Large Language Model-based Agents Across Diverse Environments Zhiheng Xi, Yiwen Ding, Wenxiang Chen, Boyang Hong, Honglin Guo, Junzhe Wang, Dingwen Yang, Chenyang Liao, Xin Guo, Wei He, Songyang Gao, Lu Chen, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang, Zuxuan Wu, Yu-gang Jiang
- Moextend: Tuning New Experts For Modality And Task Extension Zhong Shanshan, Gao Shanghua, Huang Zhongzhan, Wen Wushao, Zitnik Marinka, Zhou Pan
- Harnessing Large Language Models As Post-hoc Correctors Zhong Zhiqiang, Zhou Kuangyu, Mottin Davide
- Robust Prompt Optimization For Defending Language Models Against Jailbreaking Attacks Zhou Andy, Li Bo, Wang Haohan
- Navgpt-2: Unleashing Navigational Reasoning Capability For Large Vision-language Models Zhou Gengze, Hong Yicong, Wang Zun, Wang Xin Eric, Wu Qi
- Fairer Preferences Elicit Improved Human-aligned Large Language Model Judgments Zhou Han, Wan Xingchen, Liu Yinhong, Collier Nigel, Vulić Ivan, Korhonen Anna
- An LLM Feature-based Framework For Dialogue Constructiveness Assessment Zhou Lexin, Farag Youmna, Vlachos Andreas
- Mlps Compass: What Is Learned When Mlps Are Combined With Plms? Zhou Li, Chen Wenyu, Cao Yong, Zeng Dingyi, Liu Wanlong, Qu Hong
- Self-discover: Large Language Models Self-compose Reasoning Structures Zhou Pei, Pujara Jay, Ren Xiang, Chen Xinyun, Cheng Heng-tze, Le Quoc V., Chi Ed H., Zhou Denny, Mishra Swaroop, Zheng Huaixiu Steven
- Enhancing The General Agent Capabilities Of Low-parameter Llms Through Tuning And Multi-branch Reasoning Zhou Qinhao, Zhang Zihan, Xiang Xiang, Wang Ke, Wu Yuchuan, Li Yongbin
- Evaluating The Smooth Control Of Attribute Intensity In Text Generation With Llms Zhou Shang, Yao Feng, Dong Chengyu, Wang Zihan, Shang Jingbo
- Cogmg: Collaborative Augmentation Between Large Language Model And Knowledge Graph Zhou Tong, Chen Yubo, Liu Kang, Zhao Jun
- Archer: Training Language Model Agents Via Hierarchical Multi-turn RL Zhou Yifei, Zanette Andrea, Pan Jiayi, Levine Sergey, Kumar Aviral
- Few-shot Adversarial Prompt Learning On Vision-language Models Zhou Yiwei, Xia Xiaobo, Lin Zhiwei, Han Bo, Liu Tongliang
- Multi-stage Balanced Distillation: Addressing Long-tail Challenges In Sequence-level Knowledge Distillation Zhou Yuhang, Zhu Jing, Xu Paiheng, Liu Xiaoyu, Wang Xiyao, Koutra Danai, Ai Wei, Huang Furong
- Metagpt: Merging Large Language Models Using Model Exclusive Task Arithmetic Zhou Yuyan, Song Liang, Wang Bingning, Chen Weipeng
- Llmbind: A Unified Modality-task Integration Framework Zhu Bin, Ning Munan, Jin Peng, Lin Bin, Huang Jinfa, Song Qi, Zhang Junwu, Tang Zhenyu, Pan Mingjun, Zhou Xing, Yuan Li
- FANNO: Augmenting High-quality Instruction Data With Open-sourced Llms Only Zhu He, Su Junyou, Lun Tianle, Tao Yicheng, Zhang Wenjia, Fan Zipei, Chen Guanhua
- Collectivesft: Scaling Large Language Models For Chinese Medical Benchmark With Collective Instructions In Healthcare Zhu Jingwei, Tan Minghuan, Yang Min, Li Ruixue, Alinejad-rokny Hamid
- Rageval: Scenario Specific RAG Evaluation Dataset Generation Framework Zhu Kunlun, Luo Yifan, Xu Dingling, Wang Ruobing, Yu Shi, Wang Shuo, Yan Yukun, Liu Zhenghao, Han Xu, Liu Zhiyuan, Sun Maosong
- A Llm-based Controllable, Scalable, Human-involved User Simulator Framework For Conversational Recommender Systems Zhu Lixi, Huang Xiaowen, Sang Jitao
- LIRE: Listwise Reward Enhancement For Preference Alignment Zhu Mingye, Liu Yi, Zhang Lei, Guo Junbo, Mao Zhendong
- Personality Alignment Of Large Language Models Zhu Minjun, Yang Linyi, Zhang Yue
- Quite Good, But Not Enough: Nationality Bias In Large Language Models -- A Case Study Of Chatgpt Zhu Shucheng, Wang Weikang, Liu Ying
- Llama-moe: Building Mixture-of-experts From Llama With Continual Pre-training Zhu Tong, Qu Xiaoye, Dong Daize, Ruan Jiacheng, Tong Jingqi, He Conghui, Cheng Yu
- How Privacy-savvy Are Large Language Models? A Case Study On Compliance And Privacy Technical Review Zhu Xichou, Liu Yang, Shen Zhou, Liu Yi, Li Min, Chen Yujun, John Benzi, Ma Zhenzhen, Hu Tao, Yang Bolong, Wang Manman, Xie Zongxing, Liu Peng, Cai Dan, Wang Junhui
- Distilling Mathematical Reasoning Capabilities Into Small Language Models Zhu Xunyu, Li Jian, Liu Yong, Ma Can, Wang Weiping
- Apt-pipe: A Prompt-tuning Tool For Social Data Annotation Using Chatgpt Zhu Yiming, Yin Zhizhuo, Tyson Gareth, Haq Ehsan-ul, Lee Lik-hang, Hui Pan
- Prompting Large Language Models For Zero-shot Clinical Prediction With Structured Longitudinal Electronic Health Record Data Zhu Yinghao, Wang Zixiang, Gao Junyi, Tong Yuning, An Jingkun, Liao Weibin, Harrison Ewen M., Ma Liantao, Pan Chengwei
- Efficient Test-time Prompt Tuning For Vision-language Models Zhu Yuhan, Zhang Guozhen, Xu Chen, Shen Haocheng, Chen Xiaoxin, Wu Gangshan, Wang Limin
- Yulan: An Open-source Large Language Model Zhu Yutao, Zhou Kun, Mao Kelong, Chen Wentong, Sun Yiding, Chen Zhipeng, Cao Qian, Wu Yihan, Chen Yushuo, Wang Feng, Zhang Lei, Li Junyi, Wang Xiaolei, Wang Lei, Zhang Beichen, Dong Zican, Cheng Xiaoxue, Chen Yuhan, Tang Xinyu, Hou Yupeng, Ren Qiangqiang, Pang Xincheng, Xie Shufang, Zhao Wayne Xin, Dou Zhicheng, Mao Jiaxin, Lin Yankai, Song Ruihua, Xu Jun, Chen Xu, Yan Rui, Wei Zhewei, Hu Di, Huang Wenbing, Gao Ze-feng, Chen Yueguo, Lu Weizheng, Wen Ji-rong
- Murar: A Simple And Effective Multimodal Retrieval And Answer Refinement Framework For Multimodal Question Answering Zhu Zhengyuan, Lee Daniel, Zhang Hong, Harsha Sai Sree, Feujio Loic, Maharaj Akash, Li Yunyao
- Lumina-next: Making Lumina-t2x Stronger And Faster With Next-dit Zhuo Le, Du Ruoyi, Xiao Han, Li Yangguang, Liu Dongyang, Huang Rongjie, Liu Wenze, Zhao Lirui, Wang Fu-yun, Ma Zhanyu, Luo Xu, Wang Zehan, Zhang Kaipeng, Zhu Xiangyang, Liu Si, Yue Xiangyu, Liu Dingning, Ouyang Wanli, Liu Ziwei, Qiao Yu, Li Hongsheng, Gao Peng
- Bigcodebench: Benchmarking Code Generation With Diverse Function Calls And Complex Instructions Zhuo Terry Yue, Vu Minh Chien, Chim Jenny, Hu Han, Yu Wenhao, Widyasari Ratnadira, Yusuf Imam Nur Bani, Zhan Haolan, He Junda, Paul Indraneil, Brunner Simon, Gong Chen, Hoang Thong, Zebaze Armel Randy, Hong Xiaoheng, Li Wen-ding, Kaddour Jean, Xu Ming, Zhang Zhihan, Yadav Prateek, Jain Naman, Gu Alex, Cheng Zhoujun, Liu Jiawei, Liu Qian, Wang Zijian, Lo David, Hui Binyuan, Muennighoff Niklas, Fried Daniel, Du Xiaoning, De Vries Harm, Von Werra Leandro
- VL-ICL Bench: The Devil In The Details Of Benchmarking Multimodal In-context Learning Zong Yongshuo, Bohdal Ondrej, Hospedales Timothy
- Systematic Task Exploration With Llms: A Study In Citation Text Generation Şahinuç Furkan, Kuznetsov Ilia, Hou Yufang, Gurevych Iryna
- Concept-aware Data Construction Improves In-context Learning Of Language Models Štefánik Michal, Kadlčík Marek, Sojka Petr
🏷 Training Techniques
- Natural Language Processing (almost) From Scratch Ronan Collobert, Jason Weston, Leon Bottou, Michael Karlen, Koray Kavukcuoglu, Pavel Kuksa
- Joint Copying And Restricted Generation For Paraphrase Cao Ziqiang, Luo Chuwei, Li Wenjie, Li Sujian
- Learning Distributed Representations Of Sentences From Unlabelled Data Felix Hill, Kyunghyun Cho, Anna Korhonen
- Google's Multilingual Neural Machine Translation System: Enabling Zero-shot Translation Johnson Melvin, Schuster Mike, Le Quoc V., Krikun Maxim, Wu Yonghui, Chen Zhifeng, Thorat Nikhil, Viégas Fernanda, Wattenberg Martin, Corrado Greg, Hughes Macduff, Dean Jeffrey
- Fully Character-level Neural Machine Translation Without Explicit Segmentation Lee Jason, Cho Kyunghyun, Hofmann Thomas
- Rationalizing Neural Predictions Lei Tao, Barzilay Regina, Jaakkola Tommi
- Image Captioning With Deep Bidirectional Lstms Wang Cheng, Yang Haojin, Bartz Christian, Meinel Christoph
- Don't Just Assume; Look And Answer: Overcoming Priors For Visual Question Answering Agrawal Aishwarya, Batra Dhruv, Parikh Devi, Kembhavi Aniruddha
- Weighted Transformer Network For Machine Translation Ahmed Karim, Keskar Nitish Shirish, Socher Richard
- Attention Is All You Need Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin
- Lig-cristal System For The WMT17 Automatic Post-editing Task Berard Alexandre, Pietquin Olivier, Besacier Laurent
- Enabling Multi-source Neural Machine Translation By Concatenating Source Sentences In Multiple Languages Dabre Raj, Cromieres Fabien, Kurohashi Sadao
- Frustratingly Short Attention Spans In Neural Language Modeling Daniluk Michał, Rocktäschel Tim, Welbl Johannes, Riedel Sebastian
- Sockeye: A Toolkit For Neural Machine Translation Hieber Felix, Domhan Tobias, Denkowski Michael, Vilar David, Sokolov Artem, Clifton Ann, Post Matt
- Simple Recurrent Units For Highly Parallelizable Recurrence Lei Tao, Zhang Yu, Wang Sida I., Dai Hui, Artzi Yoav
- Best Of Both Worlds: Transferring Knowledge From Discriminative Learning To A Generative Visual Dialog Model Lu Jiasen, Kannan Anitha, Yang Jianwei, Parikh Devi, Batra Dhruv
- Visual Question Answering With Memory-augmented Networks Ma Chao, Shen Chunhua, Dick Anthony, Wu Qi, Wang Peng, Hengel Anton Van Den, Reid Ian
- Parlai: A Dialog Research Software Platform Miller Alexander H., Feng Will, Fisch Adam, Lu Jiasen, Batra Dhruv, Bordes Antoine, Parikh Devi, Weston Jason
- Predicting Target Language CCG Supertags Improves Neural Machine Translation Nadejde Maria, Reddy Siva, Sennrich Rico, Dwojak Tomasz, Junczys-dowmunt Marcin, Koehn Philipp, Birch Alexandra
- Shortcut-stacked Sentence Encoders For Multi-domain Inference Nie Yixin, Bansal Mohit
- A Unified Query-based Generative Model For Question Generation And Question Answering Song Linfeng, Wang Zhiguo, Hamza Wael
- Tips And Tricks For Visual Question Answering: Learnings From The 2017 Challenge Teney Damien, Anderson Peter, He Xiaodong, Hengel Anton Van Den
- Are You Talking To Me? Reasoned Visual Dialog Generation Through Adversarial Learning Wu Qi, Wang Peng, Shen Chunhua, Reid Ian, Hengel Anton Van Den
- Mojitalk: Generating Emotional Responses At Scale Zhou Xianda, Wang William Yang
- Split And Rephrase: Better Evaluation And A Stronger Baseline Aharoni Roee, Goldberg Yoav
- Multilingual Extractive Reading Comprehension By Runtime Machine Translation Asai Akari, Eriguchi Akiko, Hashimoto Kazuma, Tsuruoka Yoshimasa
- Attention Based Natural Language Grounding By Navigating Virtual Environment B Akilesh, Sinha Abhishek, Sarkar Mausoom, Krishnamurthy Balaji
- Training Deeper Neural Machine Translation Models With Transparent Attention Bapna Ankur, Chen Mia Xu, Firat Orhan, Cao Yuan, Wu Yonghui
- Behavior Analysis Of NLI Models: Uncovering The Influence Of Three Factors On Robustness Carmona Vicente Ivan Sanchez, Mitchell Jeff, Riedel Sebastian
- The Best Of Both Worlds: Combining Recent Advances In Neural Machine Translation Chen Mia Xu, Firat Orhan, Bapna Ankur, Johnson Melvin, Macherey Wolfgang, Foster George, Jones Llion, Parmar Niki, Schuster Mike, Chen Zhifeng, Wu Yonghui, Hughes Macduff
- Achieving Fluency And Coherency In Task-oriented Dialog Gangadharaiah Rashmi, Narayanaswamy Balakrishnan, Elkan Charles
- Building Sequential Inference Models For End-to-end Response Selection Gu Jia-chen, Ling Zhen-hua, Ruan Yu-ping, Liu Quan
- BERT: Pre-training Of Deep Bidirectional Transformers For Language Understanding Jacob Devlin, Ming-wei Chang, Kenton Lee, Kristina Toutanova
- Closed-book Training To Improve Summarization Encoder Memory Jiang Yichen, Bansal Mohit
- Ms-uedin Submission To The WMT2018 APE Shared Task: Dual-source Transformer For Automatic Post-editing Junczys-dowmunt Marcin, Grundkiewicz Roman
- SNU_IDS At Semeval-2018 Task 12: Sentence Encoder With Contextualized Vectors For Argument Reasoning Comprehension Kim Taeuk, Choi Jihun, Lee Sang-goo
- Neural Machine Translation With Adequacy-oriented Learning Kong Xiang, Tu Zhaopeng, Shi Shuming, Hovy Eduard, Zhang Tong
- Phrase-based Attentions Nguyen Phi Xuan, Joty Shafiq
- Polite Dialogue Generation Without Parallel Data Niu Tong, Bansal Mohit
- Like A Baby: Visually Situated Neural Language Acquisition Ororbia Alexander G., Mali Ankur, Kelly Matthew A., Reitter David
- Multimodal Explanations: Justifying Decisions And Pointing To The Evidence Park Dong Huk, Hendricks Lisa Anne, Akata Zeynep, Rohrbach Anna, Schiele Bernt, Darrell Trevor, Rohrbach Marcus
- Extending Neural Generative Conversational Model Using External Knowledge Sources Parthasarathi Prasanna, Pineau Joelle
- Contextual Parameter Generation For Universal Neural Machine Translation Platanios Emmanouil Antonios, Sachan Mrinmaya, Neubig Graham, Mitchell Tom
- Training Tips For The Transformer Model Popel Martin, Bojar Ondřej
- Cross-lingual Transfer Learning For Multilingual Task Oriented Dialog Schuster Sebastian, Gupta Sonal, Shah Rushin, Lewis Mike
- Multilingual NMT With A Language-independent Attention Bridge Vázquez Raúl, Raganato Alessandro, Tiedemann Jörg, Creutz Mathias
- Can You Tell Me How To Get Past Sesame Street? Sentence-level Pretraining Beyond Language Modeling Wang Alex, Hula Jan, Xia Patrick, Pappagari Raghavendra, Mccoy R. Thomas, Patel Roma, Kim Najoung, Tenney Ian, Huang Yinghui, Yu Katherin, Jin Shuning, Chen Berlin, Van Durme Benjamin, Grave Edouard, Pavlick Ellie, Bowman Samuel R.
- The Fine Line Between Linguistic Generalization And Failure In Seq2seq-attention Models Weber Noah, Shekhar Leena, Balasubramanian Niranjan
- Latent Topic Conversational Models Wen Tsung-hsien, Luong Minh-thang
- Dual Ask-answer Network For Machine Reading Comprehension Xiao Han, Wang Feng, Yan Jianfeng, Zheng Jingyao
- Qanet: Combining Local Convolution With Global Self-attention For Reading Comprehension Yu Adams Wei, Dohan David, Luong Minh-thang, Zhao Rui, Chen Kai, Norouzi Mohammad, Le Quoc V.
- Improved Training Of End-to-end Attention Models For Speech Recognition Zeyer Albert, Irie Kazuki, Schlüter Ralf, Ney Hermann
- Retrieval-enhanced Adversarial Training For Neural Response Generation Zhu Qingfu, Cui Lei, Zhang Weinan, Wei Furu, Liu Ting
- Abductive Reasoning As Self-supervision For Common Sense Question Answering Aakur Sathyanarayanan N., Sarkar Sudeep
- Syntactically Supervised Transformers For Faster Neural Machine Translation Akoury Nader, Krishna Kalpesh, Iyyer Mohit
- Synthetic QA Corpora Generation With Roundtrip Consistency Alberti Chris, Andor Daniel, Pitler Emily, Devlin Jacob, Collins Michael
- The Missing Ingredient In Zero-shot Neural Machine Translation Arivazhagan Naveen, Bapna Ankur, Firat Orhan, Aharoni Roee, Johnson Melvin, Macherey Wolfgang
- PLATO: Pre-trained Dialogue Generation Model With Discrete Latent Variable Bao Siqi, He Huang, Wang Fan, Wu Hua, Wang Haifeng
- Meta Answering For Machine Reading Borschinger Benjamin, Boyd-graber Jordan, Buck Christian, Bulian Jannis, Ciaramita Massimiliano, Huebscher Michelle Chen, Gajewski Wojciech, Kilcher Yannic, Nogueira Rodrigo, Saralegu Lierni Sestorain
- Language Features Matter: Effective Language Representations For Vision-language Tasks Burns Andrea, Tan Reuben, Saenko Kate, Sclaroff Stan, Plummer Bryan A.
- Attending To Entities For Better Text Understanding Cheng Pengxiang, Erk Katrin
- Cross-lingual Natural Language Generation Via Pre-training Chi Zewen, Dong Li, Wei Furu, Wang Wenhui, Mao Xian-ling, Huang Heyan
- On The Use Of BERT For Neural Machine Translation Clinchant Stéphane, Jung Kweon Woo, Nikoulina Vassilina
- Exploring The Limits Of Transfer Learning With A Unified Text-to-text Transformer Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J. Liu
- Cross-lingual Machine Reading Comprehension Cui Yiming, Che Wanxiang, Liu Ting, Qin Bing, Wang Shijin, Hu Guoping
- Unified Language Model Pre-training For Natural Language Understanding And Generation Dong Li, Yang Nan, Wang Wenhui, Wei Furu, Liu Xiaodong, Wang Yu, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- Retrosynthesis With Attention-based NMT Model And Chemical Analysis Of The "wrong" Predictions Duan Hongliang, Wang Ling, Zhang Chengyun, Li Jianjun
- Parallel Scheduled Sampling Duckworth Daniel, Neelakantan Arvind, Goodrich Ben, Kaiser Lukasz, Bengio Samy
- Optibox: Breaking The Limits Of Proposals For Visual Grounding Fan Zicong, Meng Si Yi, Sigal Leonid, Little James J.
- Question-agnostic Attention For Visual Question Answering Farazi Moshiur R, Khan Salman H, Barnes Nick
- MRQA 2019 Shared Task: Evaluating Generalization In Reading Comprehension Fisch Adam, Talmor Alon, Jia Robin, Seo Minjoon, Choi Eunsol, Chen Danqi
- TANDA: Transfer And Adapt Pre-trained Transformer Models For Answer Sentence Selection Garg Siddhant, Vu Thuy, Moschitti Alessandro
- Self-attentive Model For Headline Generation Gavrilov Daniil, Kalaidin Pavel, Malykh Valentin
- Span Selection Pre-training For Question Answering Glass Michael, Gliozzo Alfio, Chakravarti Rishav, Ferritto Anthony, Pan Lin, Bhargav G P Shrivatsa, Garg Dinesh, Sil Avirup
- Reweighted Proximal Pruning For Large-scale Language Representation Guo Fu-ming, Liu Sijia, Mungall Finlay S., Lin Xue, Wang Yanzhi
- Fine-tuning By Curriculum Learning For Non-autoregressive Neural Machine Translation Guo Junliang, Tan Xu, Xu Linli, Qin Tao, Chen Enhong, Liu Tie-yan
- Resurrecting Submodularity For Neural Text Generation Han Simeng, Lin Xiang, Joty Shafiq
- Visualizing And Understanding The Effectiveness Of BERT Hao Yaru, Dong Li, Wei Furu, Xu Ke
- Analyzing The Forgetting Problem In The Pretrain-finetuning Of Dialogue Response Models He Tianxing, Liu Jun, Cho Kyunghyun, Ott Myle, Liu Bing, Glass James, Peng Fuchun
- Exposure Bias Versus Self-recovery: Are Distortions Really Incremental For Autoregressive Text Generation? He Tianxing, Zhang Jingzhao, Zhou Zhiming, Glass James
- Convert: Efficient And Accurate Conversational Representations From Transformers Henderson Matthew, Casanueva Iñigo, Mrkšić Nikola, Su Pei-hao, Wen Tsung-hsien, Vulić Ivan
- Efficient Adaptation Of Pretrained Transformers For Abstractive Summarization Hoang Andrew, Bosselut Antoine, Celikyilmaz Asli, Choi Yejin
- Evaluating Rewards For Question Generation Models Hosking Tom, Riedel Sebastian
- A Corpus-free State2seq User Simulator For Task-oriented Dialogue Hou Yutai, Fang Meng, Che Wanxiang, Liu Ting
- Large-scale Representation Learning From Visually Grounded Untranscribed Speech Ilharco Gabriel, Zhang Yuan, Baldridge Jason
- Tinybert: Distilling BERT For Natural Language Understanding Jiao Xiaoqi, Yin Yichun, Shang Lifeng, Jiang Xin, Chen Xiao, Li Linlin, Wang Fang, Liu Qun
- All-in-one Image-grounded Conversational Agents Ju Da, Shuster Kurt, Boureau Y-lan, Weston Jason
- Microsoft Translator At WMT 2019: Towards Large-scale Document-level Neural Machine Translation Junczys-dowmunt Marcin
- Learning And Evaluating Contextual Embedding Of Source Code Kanade Aditya, Maniatis Petros, Balakrishnan Gogul, Shi Kensen
- Weak Supervision Helps Emergence Of Word-object Alignment And Improves Vision-language Tasks Kervadec Corentin Liris, Antipov Grigory Liris, Baccouche Moez Liris, Wolf Christian Liris
- CTRL: A Conditional Transformer Language Model For Controllable Generation Keskar Nitish Shirish, Mccann Bryan, Varshney Lav R., Xiong Caiming, Socher Richard
- Sample Efficient Text Summarization Using A Single Pre-trained Transformer Khandelwal Urvashi, Clark Kevin, Jurafsky Dan, Kaiser Lukasz
- A Case Study: Exploiting Neural Machine Translation To Translate CUDA To Opencl Kim Yonghae, Kim Hyesoon
- Learning To Answer By Learning To Ask: Getting The Best Of GPT-2 And BERT Worlds Klein Tassilo, Nabi Moin
- Investigating Multilingual NMT Representations At Scale Kudugunta Sneha Reddy, Bapna Ankur, Caswell Isaac, Arivazhagan Naveen, Firat Orhan
- ALBERT: A Lite BERT For Self-supervised Learning Of Language Representations Lan Zhenzhong, Chen Mingda, Goodman Sebastian, Gimpel Kevin, Sharma Piyush, Soricut Radu
- What Would Elsa Do? Freezing Layers During Transformer Fine-tuning Lee Jaejun, Tang Raphael, Lin Jimmy
- Patent Claim Generation By Fine-tuning Openai GPT-2 Lee Jieh-sheng, Hsiang Jieh
- BART: Denoising Sequence-to-sequence Pre-training For Natural Language Generation, Translation, And Comprehension Lewis Mike, Liu Yinhan, Goyal Naman, Ghazvininejad Marjan, Mohamed Abdelrahman, Levy Omer, Stoyanov Ves, Zettlemoyer Luke
- Bertsel: Answer Selection With Pre-trained Models Li Dongfang, Yu Yifei, Chen Qingcai, Li Xinyu
- Unicoder-vl: A Universal Encoder For Vision And Language By Cross-modal Pre-training Li Gen, Duan Nan, Fang Yuejian, Gong Ming, Jiang Daxin, Zhou Ming
- Pretrained Language Models For Document-level Neural Machine Translation Li Liangyou, Jiang Xin, Liu Qun
- Visualbert: A Simple And Performant Baseline For Vision And Language Li Liunian Harold, Yatskar Mark, Yin Da, Hsieh Cho-jui, Chang Kai-wei
- Relevance-promoting Language Model For Short-text Conversation Li Xin, Li Piji, Bi Wei, Liu Xiaojiang, Lam Wai
- Probing Representations Learned By Multimodal Recurrent And Transformer Models Libovický Jindřich, Madhyastha Pranava
- Learning To Generate Questions By Learning What Not To Generate Liu Bang, Zhao Mingjun, Niu Di, Lai Kunfeng, He Yancheng, Wei Haojie, Xu Yu
- Roberta: A Robustly Optimized BERT Pretraining Approach Liu Yinhan, Ott Myle, Goyal Naman, Du Jingfei, Joshi Mandar, Chen Danqi, Levy Omer, Lewis Mike, Zettlemoyer Luke, Stoyanov Veselin
- Attention-informed Mixed-language Training For Zero-shot Cross-lingual Task-oriented Dialogue Systems Liu Zihan, Winata Genta Indra, Lin Zhaojiang, Xu Peng, Fung Pascale
- Multi-hop Reading Comprehension Via Deep Reinforcement Learning Based Document Traversal Long Alex, Mason Joel, Blair Alan, Wang Wei
- Vilbert: Pretraining Task-agnostic Visiolinguistic Representations For Vision-and-language Tasks Lu Jiasen, Batra Dhruv, Parikh Devi, Lee Stefan
- Good, Better, Best: Textual Distractors Generation For Multiple-choice Visual Question Answering Via Reinforcement Learning Lu Jiaying, Ye Xin, Ren Yi, Yang Yezhou
- Emergent Properties Of Finetuned Language Representation Models Matton Alexandre, De Oliveira Luke
- Structured Pruning Of A Bert-based Question Answering Model Mccarley J. S., Chakravarti Rishav, Sil Avirup
- Self-attentional Models Application In Task-oriented Dialogue Generation Systems Mehrjardi Mansour Saffar, Trabelsi Amine, Zaiane Osmar R.
- Megatron-lm: Training Multi-billion Parameter Language Models Using Model Parallelism Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick Legresley, Jared Casper, Bryan Catanzaro
- Human Vs. Muppet: A Conservative Estimate Of Human Performance On The GLUE Benchmark Nangia Nikita, Bowman Samuel R.
- Let's Ask Again: Refine Network For Automatic Question Generation Nema Preksha, Mohankumar Akash Kumar, Khapra Mitesh M., Srinivasan Balaji Vasan, Ravindran Balaraman
- A Simple But Effective Method To Incorporate Multi-turn Context With BERT For Conversational Machine Comprehension Ohsugi Yasuhito, Saito Itsumi, Nishida Kyosuke, Asano Hisako, Tomita Junji
- Dlgnet: A Transformer-based Model For Dialogue Response Generation Olabiyi Oluwatobi, Mueller Erik T.
- Adversarial Bootstrapping For Dialogue Model Training Olabiyi Oluwatobi, Mueller Erik T., Larson Christopher, Lahlou Tarek
- Thisiscompetition At Semeval-2019 Task 9: BERT Is Unstable For Out-of-domain Samples Park Cheoneum, Kim Juae, Lee Hyeon-gu, Amplayo Reinald Kim, Kim Harksoo, Seo Jungyun, Lee Changki
- Text Generation With Exemplar-based Adaptive Decoding Peng Hao, Parikh Ankur P., Faruqui Manaal, Dhingra Bhuwan, Das Dipanjan
- Language Models As Knowledge Bases? Petroni Fabio, Rocktäschel Tim, Lewis Patrick, Bakhtin Anton, Wu Yuxiang, Miller Alexander H., Riedel Sebastian
- Learning To Deceive With Attention-based Explanations Pruthi Danish, Gupta Mansi, Dhingra Bhuwan, Neubig Graham, Lipton Zachary C.
- Blockwise Self-attention For Long Document Understanding Qiu Jiezhong, Ma Hao, Levy Omer, Yih Scott Wen-tau, Wang Sinong, Tang Jie
- On Compositionality In Neural Machine Translation Raunak Vikas, Kumar Vaibhav, Metze Florian
- Probing Natural Language Inference Models Through Semantic Fragments Richardson Kyle, Hu Hai, Moss Lawrence S., Sabharwal Ashish
- Leveraging Pre-trained Checkpoints For Sequence Generation Tasks Rothe Sascha, Narayan Shashi, Severyn Aliaksei
- Hierarchical Reinforcement Learning For Open-domain Dialog Saleh Abdelrhman, Jaques Natasha, Ghandeharioun Asma, Shen Judy Hanwen, Picard Rosalind
- Zero: Memory Optimizations Toward Training Trillion Parameter Models Samyam Rajbhandari, Jeff Rasley, Olatunji Ruwase, Yuxiong He
- Distilbert, A Distilled Version Of BERT: Smaller, Faster, Cheaper And Lighter Sanh Victor, Debut Lysandre, Chaumond Julien, Wolf Thomas
- Generalization In Generation: A Closer Look At Exposure Bias Schmidt Florian
- Inducing Brain-relevant Bias In Natural Language Processing Models Schwartz Dan, Toneva Mariya, Wehbe Leila
- Taking A HINT: Leveraging Explanations To Make Vision And Language Models More Grounded Selvaraju Ramprasaath R., Lee Stefan, Shen Yilin, Jin Hongxia, Ghosh Shalini, Heck Larry, Batra Dhruv, Parikh Devi
- What Does BERT Learn From Multiple-choice Reading Comprehension Datasets? Si Chenglei, Wang Shuohang, Kan Min-yen, Jiang Jing
- Multilingual Question Answering From Formatted Text Applied To Conversational Agents Siblini Wissam, Pasqual Charlotte, Lavielle Axel, Challal Mohamed, Cauchois Cyril
- MASS: Masked Sequence To Sequence Pre-training For Language Generation Song Kaitao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Tie-yan
- Unpaired Cross-lingual Image Caption Generation With Self-supervised Rewards Song Yuqing, Chen Shizhe, Zhao Yida, Jin Qin
- Natural Language Generation Using Reinforcement Learning With External Rewards Srinivasan Vidhushini, Santhanam Sashank, Shaikh Samira
- On NMT Search Errors And Model Errors: Cat Got Your Tongue? Stahlberg Felix, Byrne Bill
- Cued@wmt19:ewc&lms Stahlberg Felix, Saunders Danielle, De Gispert Adria, Byrne Bill
- Insertion Transformer: Flexible Sequence Generation Via Insertion Operations Stern Mitchell, Chan William, Kiros Jamie, Uszkoreit Jakob
- Sesamebert: Attention For Anywhere Su Ta-chun, Cheng Hsiang-chih
- VL-BERT: Pre-training Of Generic Visual-linguistic Representations Su Weijie, Zhu Xizhou, Cao Yue, Li Bin, Lu Lewei, Wei Furu, Dai Jifeng
- Syntax-infused Transformer And BERT Models For Machine Translation And Natural Language Understanding Sundararaman Dhanasekar, Subramanian Vivek, Wang Guoyin, Si Shijing, Shen Dinghan, Wang Dong, Carin Lawrence
- Olmpics -- On What Language Model Pre-training Captures Talmor Alon, Elazar Yanai, Goldberg Yoav, Berant Jonathan
- LXMERT: Learning Cross-modality Encoder Representations From Transformers Tan Hao, Bansal Mohit
- Generating Token-level Explanations For Natural Language Inference Thorne James, Vlachos Andreas, Christodoulopoulos Christos, Mittal Arpit
- Well-read Students Learn Better: On The Importance Of Pre-training Compact Models Turc Iulia, Chang Ming-wei, Lee Kenton, Toutanova Kristina
- How Does BERT Answer Questions? A Layer-wise Analysis Of Transformer Representations Van Aken Betty, Winter Benjamin, Löser Alexander, Gers Felix A.
- Quantity Doesn't Buy Quality Syntax With Neural Language Models Van Schijndel Marten, Mueller Aaron, Linzen Tal
- Training Optimus Prime, M.D.: Generating Medical Certification Items By Fine-tuning Openai's Gpt2 Transformer Model Von Davier Matthias
- Structbert: Incorporating Language Structures Into Pre-training For Deep Language Understanding Wang Wei, Bi Bin, Yan Ming, Wu Chen, Bao Zuyi, Xia Jiangnan, Peng Liwei, Si Luo
- Tree Transformer: Integrating Tree Structures Into Self-attention Wang Yau-shian, Lee Hung-yi, Chen Yun-nung
- Structured Pruning Of Large Language Models Wang Ziheng, Wohlwend Jeremy, Lei Tao
- Adapting And Evaluating A Deep Learning Language Model For Clinical Why-question Answering Wen Andrew, Elwazir Mohamed Y., Moon Sungrim, Fan Jungwei
- Attention Is Not Not Explanation Wiegreffe Sarah, Pinter Yuval
- Huggingface's Transformers: State-of-the-art Natural Language Processing Wolf Thomas, Debut Lysandre, Sanh Victor, Chaumond Julien, Delangue Clement, Moi Anthony, Cistac Pierric, Rault Tim, Louf Rémi, Funtowicz Morgan, Davison Joe, Shleifer Sam, Von Platen Patrick, Ma Clara, Jernite Yacine, Plu Julien, Xu Canwen, Scao Teven Le, Gugger Sylvain, Drame Mariama, Lhoest Quentin, Rush Alexander M.
- Transfertransfo: A Transfer Learning Approach For Neural Network Based Conversational Agents Wolf Thomas, Sanh Victor, Chaumond Julien, Delangue Clement
- Integrated Triaging For Fast Reading Comprehension Wu Felix, Li Boyi, Wang Lequn, Lao Ni, Blitzer John, Weinberger Kilian Q.
- Review Conversational Reading Comprehension Xu Hu, Liu Bing, Shu Lei, Yu Philip S.
- Towards Making The Most Of BERT In Neural Machine Translation Yang Jiacheng, Wang Mingxuan, Zhou Hao, Zhao Chengqi, Yu Yong, Zhang Weinan, Li Lei
- Deepening Hidden Representations From Pre-trained Language Models Yang Junjie, Zhao Hai
- Making History Matter: History-advantage Sequence Training For Visual Dialog Yang Tianhao, Zha Zheng-jun, Zhang Hanwang
- Data Augmentation For BERT Fine-tuning In Open-domain Question Answering Yang Wei, Xie Yuqing, Tan Luchen, Xiong Kun, Li Ming, Lin Jimmy
- Model Compression With Two-stage Multi-teacher Knowledge Distillation For Web Question Answering System Yang Ze, Shou Linjun, Gong Ming, Lin Wutao, Jiang Daxin
- Q8BERT: Quantized 8bit BERT Zafrir Ofir, Boudoukh Guy, Izsak Peter, Wasserblat Moshe
- Extended Answer And Uncertainty Aware Neural Question Generation Zeng Hongwei, Zhi Zhuo, Liu Jun, Wei Bifan
- PEGASUS: Pre-training With Extracted Gap-sentences For Abstractive Summarization Zhang Jingqing, Zhao Yao, Saleh Mohammad, Liu Peter J.
- Addressing Semantic Drift In Question Generation For Semi-supervised Question Answering Zhang Shiyue, Bansal Mohit
- Sequence-to-sequence Pre-training With Data Augmentation For Sentence Rewriting Zhang Yi, Ge Tao, Wei Furu, Zhou Ming, Sun Xu
- Consistent Dialogue Generation With Self-supervised Feature Learning Zhang Yizhe, Gao Xiang, Lee Sungjin, Brockett Chris, Galley Michel, Gao Jianfeng, Dolan Bill
- Dialogpt: Large-scale Generative Pre-training For Conversational Response Generation Zhang Yizhe, Sun Siqi, Galley Michel, Chen Yen-chun, Brockett Chris, Gao Xiang, Gao Jianfeng, Liu Jingjing, Dolan Bill
- Semantics-aware BERT For Language Understanding Zhang Zhuosheng, Wu Yuwei, Zhao Hai, Li Zuchao, Zhang Shuailiang, Zhou Xi, Zhou Xiang
- Extremely Small BERT Models From Mixed-vocabulary Training Zhao Sanqiang, Gupta Raghav, Song Yang, Zhou Denny
- A Pre-training Based Personalized Dialogue Generation Model With Persona-sparse Data Zheng Yinhe, Zhang Rongsheng, Mao Xiaoxi, Huang Minlie
- Unified Vision-language Pre-training For Image Captioning And VQA Zhou Luowei, Palangi Hamid, Zhang Lei, Hu Houdong, Corso Jason J., Gao Jianfeng
- Vision-language Navigation With Self-supervised Auxiliary Reasoning Tasks Zhu Fengda, Zhu Yi, Chang Xiaojun, Liang Xiaodan
- How Much Knowledge Can You Pack Into The Parameters Of A Language Model? Adam Roberts, Colin Raffel, Noam Shazeer
- Optimizing Transformer For Low-resource Neural Machine Translation Araabi Ali, Monz Christof
- Stress Test Evaluation Of Transformer-based Models In Natural Language Understanding Tasks Aspillaga Carlos, Carvallo Andrés, Araujo Vladimir
- Fluent Response Generation For Conversational Question Answering Baheti Ashutosh, Ritter Alan, Small Kevin
- Segatron: Segment-aware Transformer For Language Modeling And Understanding Bai He, Shi Peng, Lin Jimmy, Xie Yuqing, Tan Luchen, Xiong Kun, Gao Wen, Li Ming
- Residual Energy-based Models For Text Bakhtin Anton, Deng Yuntian, Gross Sam, Ott Myle, Ranzato Marc'aurelio, Szlam Arthur
- Character-level Transformer-based Neural Machine Translation Banar Nikolay, Daelemans Walter, Kestemont Mike
- Unilmv2: Pseudo-masked Language Models For Unified Language Model Pre-training Bao Hangbo, Dong Li, Wei Furu, Wang Wenhui, Yang Nan, Liu Xiaodong, Wang Yu, Piao Songhao, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- PALM: Pre-training An Autoencoding&autoregressive Language Model For Context-conditioned Generation Bi Bin, Li Chenliang, Wu Chen, Yan Ming, Wang Wei, Huang Songfang, Huang Fei, Si Luo
- Chatbot Interaction With Artificial Intelligence: Human Data Augmentation With T5 And Language Transformer Ensemble For Text Classification Bird Jordan J., Ekárt Anikó, Faria Diego R.
- Language Models Are Few-shot Learners Brown Tom B., Mann Benjamin, Ryder Nick, Subbiah Melanie, Kaplan Jared, Dhariwal Prafulla, Neelakantan Arvind, Shyam Pranav, Sastry Girish, Askell Amanda, Agarwal Sandhini, Herbert-voss Ariel, Krueger Gretchen, Henighan Tom, Child Rewon, Ramesh Aditya, Ziegler Daniel M., Wu Jeffrey, Winter Clemens, Hesse Christopher, Chen Mark, Sigler Eric, Litwin Mateusz, Gray Scott, Chess Benjamin, Clark Jack, Berner Christopher, Mccandlish Sam, Radford Alec, Sutskever Ilya, Amodei Dario
- DIET: Lightweight Language Understanding For Dialogue Systems Bunk Tanja, Varshneya Daksh, Vlasov Vladimir, Nichol Alan
- Behind The Scene: Revealing The Secrets Of Pre-trained Vision-and-language Models Cao Jize, Gan Zhe, Cheng Yu, Yu Licheng, Chen Yen-chun, Liu Jingjing
- Simultaneous Paraphrasing And Translation By Fine-tuning Transformer Models Chada Rakesh
- Reading Between The Lines: Exploring Infilling In Visual Narratives Chandu Khyathi Raghavi, Dong Ruo-ping, Black Alan
- Dipair: Fast And Accurate Distillation For Trillion-scale Text Matching And Pair Modeling Chen Jiecao, Yang Liu, Raman Karthik, Bendersky Michael, Yeh Jung-jung, Zhou Yun, Najork Marc, Cai Danyang, Emadzadeh Ehsan
- Logical Natural Language Generation From Open-domain Tables Chen Wenhu, Chen Jianshu, Su Yu, Chen Zhiyu, Wang William Yang
- Model Selection For Cross-lingual Transfer Chen Yang, Ritter Alan
- Improving BERT With Self-supervised Attention Chen Yiren, Kou Xiaoyu, Bai Jiangang, Tong Yunhai
- Comparative Study Of Language Models On Cross-domain Data With Model Agnostic Explainability Chhipa Mayank, Vazurkar Hrushikesh Mahesh, Kumar Abhijeet, Mishra Mridul
- X-LXMERT: Paint, Caption And Answer Questions With Multi-modal Transformers Cho Jaemin, Lu Jiasen, Schwenk Dustin, Hajishirzi Hannaneh, Kembhavi Aniruddha
- Rethinking Embedding Coupling In Pre-trained Language Models Chung Hyung Won, Févry Thibault, Tsai Henry, Johnson Melvin, Ruder Sebastian
- Transformers As Soft Reasoners Over Language Clark Peter, Tafjord Oyvind, Richardson Kyle
- Probing Emergent Semantics In Predictive Agents Via Question Answering Das Abhishek, Carnevale Federico, Merzic Hamza, Rimell Laura, Schneider Rosalia, Abramson Josh, Hung Alden, Ahuja Arun, Clark Stephen, Wayne Gregory, Hill Felix
- Residual Energy-based Models For Text Generation Deng Yuntian, Bakhtin Anton, Ott Myle, Szlam Arthur, Ranzato Marc'aurelio
- Cascaded Text Generation With Markov Transformers Deng Yuntian, Rush Alexander M.
- EFSG: Evolutionary Fooling Sentences Generator Di Giovanni Marco, Brambilla Marco
- Ernie-doc: A Retrospective Long-document Modeling Transformer Ding Siyu, Shang Junyuan, Wang Shuohuan, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Gshard: Scaling Giant Models With Conditional Computation And Automatic Sharding Dmitry Lepikhin, Hyoukjoong Lee, Yuanzhong Xu, Dehao Chen, Orhan Firat, Yanping Huang, Maxim Krikun, Noam Shazeer, Zhifeng Chen
- Fine-tuning Pretrained Language Models: Weight Initializations, Data Orders, And Early Stopping Dodge Jesse, Ilharco Gabriel, Schwartz Roy, Farhadi Ali, Hajishirzi Hannaneh, Smith Noah
- Training Multilingual Machine Translation By Alternately Freezing Language-specific Encoders-decoders Escolano Carlos, Costa-jussà Marta R., Fonollosa José A. R., Artetxe Mikel
- CERT: Contrastive Self-supervised Learning For Language Understanding Fang Hongchao, Wang Sicheng, Zhou Meng, Ding Jiayuan, Xie Pengtao
- App-aware Response Synthesis For User Reviews Farooq Umar, Siddique A. B., Jamour Fuad, Zhao Zhijia, Hristidis Vagelis
- The Pile: An 800GB Dataset Of Diverse Text For Language Modeling Gao Leo, Biderman Stella, Black Sid, Golding Laurence, Hoppe Travis, Foster Charles, Phang Jason, He Horace, Thite Anish, Nabeshima Noa, Presser Shawn, Leahy Connor
- ISAAQ -- Mastering Textbook Questions With Pre-trained Transformers And Bottom-up And Top-down Attention Gomez-perez Jose Manuel, Ortega Raul
- It's Not Greek To Mbert: Inducing Word-level Translations From Multilingual BERT Gonen Hila, Ravfogel Shauli, Elazar Yanai, Goldberg Yoav
- Are Neural Open-domain Dialog Systems Robust To Speech Recognition Errors In The Dialog History? An Empirical Study Gopalakrishnan Karthik, Hedayatnia Behnam, Wang Longshaokan, Liu Yang, Hakkani-tur Dilek
- A Tailored Pre-training Model For Task-oriented Dialog Generation Gu Jing, Wu Qingyang, Wu Chongruo, Shi Weiyan, Yu Zhou
- Token-level Adaptive Training For Neural Machine Translation Gu Shuhao, Zhang Jinchao, Meng Fandong, Feng Yang, Xie Wanying, Zhou Jie, Yu Dong
- Dialogbert: Discourse-aware Response Generation Via Learning To Recover And Rank Utterances Gu Xiaodong, Yoo Kang Min, Ha Jung-woo
- Sequence-level Mixed Sample Data Augmentation Guo Demi, Kim Yoon, Rush Alexander M.
- Incorporating BERT Into Parallel Sequence Decoding With Adapters Guo Junliang, Zhang Zhirui, Xu Linli, Wei Hao-ran, Chen Boxing, Chen Enhong
- Fact-aware Sentence Split And Rephrase With Permutation Invariant Training Guo Yinuo, Ge Tao, Wei Furu
- BERT Based Multilingual Machine Comprehension In English And Hindi Gupta Somil, Khade Nilesh
- Improvement Of A Dedicated Model For Open Domain Persona-aware Dialogue Generation Han Qiang
- ECONET: Effective Continual Pretraining Of Language Models For Event Temporal Reasoning Han Rujun, Ren Xiang, Peng Nanyun
- Towards Learning A Generic Agent For Vision-and-language Navigation Via Pre-training Hao Weituo, Li Chunyuan, Li Xiujun, Carin Lawrence, Gao Jianfeng
- Enriched Pre-trained Transformers For Joint Slot Filling And Intent Detection Hardalov Momchil, Koychev Ivan, Nakov Preslav
- Investigating Representations Of Verb Bias In Neural Language Models Hawkins Robert D., Yamakoshi Takateru, Griffiths Thomas L., Goldberg Adele E.
- Pretrained Transformers Improve Out-of-distribution Robustness Hendrycks Dan, Liu Xiaoyuan, Wallace Eric, Dziedzic Adam, Krishnan Rishabh, Song Dawn
- Sub-instruction Aware Vision-and-language Navigation Hong Yicong, Rodriguez-opazo Cristian, Wu Qi, Gould Stephen
- Less Is More: Data-efficient Complex Question Answering Over Knowledge Bases Hua Yuncheng, Li Yuan-fang, Qi Guilin, Wu Wei, Zhang Jingyao, Qi Daiqing
- Exploring BERT Parameter Efficiency On The Stanford Question Answering Dataset V2.0 Hulburd Eric
- Contrastive Code Representation Learning Jain Paras, Jain Ajay, Zhang Tianjun, Abbeel Pieter, Gonzalez Joseph E., Stoica Ion
- Scaling Laws For Neural Language Models Jared Kaplan, Sam Mccandlish, Tom Henighan, Tom B. Brown, Benjamin Chess, Rewon Child, Scott Gray, Alec Radford, Jeffrey Wu, Dario Amodei
- Are Natural Language Inference Models Imppressive? Learning Implicature And Presupposition Jeretic Paloma, Warstadt Alex, Bhooshan Suvrat, Williams Adina
- In Defense Of Grid Features For Visual Question Answering Jiang Huaizu, Misra Ishan, Rohrbach Marcus, Learned-miller Erik, Chen Xinlei
- TLDR: Token Loss Dynamic Reweighting For Reducing Repetitive Utterance Generation Jiang Shaojie, Wolf Thomas, Monz Christof, De Rijke Maarten
- How Can We Know When Language Models Know? On The Calibration Of Language Models For Question Answering Jiang Zhengbao, Araki Jun, Ding Haibo, Neubig Graham
- Text-to-text Pre-training For Data-to-text Tasks Kale Mihir, Rastogi Abhinav
- Unsupervised Bitext Mining And Translation Via Self-trained Contextual Embeddings Keung Phillip, Salazar Julian, Lu Yichao, Smith Noah A.
- Nearest Neighbor Machine Translation Khandelwal Urvashi, Fan Angela, Jurafsky Dan, Zettlemoyer Luke, Lewis Mike
- SMRT Chatbots: Improving Non-task-oriented Dialog With Simulated Multiple Reference Training Khayrallah Huda, Sedoc João
- Current Limitations Of Language Models: What You Need Is Retrieval Komatsuzaki Aran
- Syntactic Structure Distillation Pretraining For Bidirectional Encoders Kuncoro Adhiguna, Kong Lingpeng, Fried Daniel, Yogatama Dani, Rimell Laura, Dyer Chris, Blunsom Phil
- Fid-ex: Improving Sequence-to-sequence Models For Extractive Rationale Generation Lakhotia Kushal, Paranjape Bhargavi, Ghoshal Asish, Yih Wen-tau, Mehdad Yashar, Iyer Srinivasan
- Utilizing Bidirectional Encoder Representations From Transformers For Answer Selection Laskar Md Tahmid Rahman, Hoque Enamul, Huang Jimmy Xiangji
- Common Sense Or World Knowledge? Investigating Adapter-based Knowledge Injection Into Pretrained Transformers Lauscher Anne, Majewska Olga, Ribeiro Leonardo F. R., Gurevych Iryna, Rozanov Nikolai, Glavaš Goran
- From Zero To Hero: On The Limitations Of Zero-shot Cross-lingual Transfer With Multilingual Transformers Lauscher Anne, Ravishankar Vinit, Vulić Ivan, Glavaš Goran
- SLM: Learning A Discourse Language Representation With Sentence Unshuffling Lee Haejun, Hudson Drew A., Lee Kangwook, Manning Christopher D.
- Linguistically-informed Transformations (LIT): A Method For Automatically Generating Contrast Sets Li Chuanrong, Shengshuo Lin, Liu Leo Z., Wu Xinyi, Zhou Xuhui, Steinert-threlkeld Shane
- Optimus: Organizing Sentences Via Pre-trained Modeling Of A Latent Space Li Chunyuan, Gao Xiang, Li Yuan, Peng Baolin, Li Xiujun, Zhang Yizhe, Gao Jianfeng
- Dialogue-adaptive Language Model Pre-training From Quality Estimation Li Junlong, Zhang Zhuosheng, Zhao Hai
- Seqgensql -- A Robust Sequence Generation Model For Structured Query Language Li Ning, Keller Bethany, Butler Mark, Cer Daniel
- An Empirical Investigation Of Pre-trained Transformer Language Models For Open-domain Dialogue Generation Li Piji
- Multilingual Speech Translation With Efficient Finetuning Of Pretrained Models Li Xian, Wang Changhan, Tang Yun, Tran Chau, Tang Yuqing, Pino Juan, Baevski Alexei, Conneau Alexis, Auli Michael
- Enhancing Dialogue Generation Via Multi-level Contrastive Learning Li Xin, Li Piji, Wang Yan, Liu Xiaojiang, Lam Wai
- Mapping Natural Language Instructions To Mobile UI Action Sequences Li Yang, He Jiacong, Zhou Xin, Zhang Yuan, Baldridge Jason
- Rewriter-evaluator Architecture For Neural Machine Translation Li Yangming, Yao Kaisheng
- Train Large, Then Compress: Rethinking Model Size For Efficient Training And Inference Of Transformers Li Zhuohan, Wallace Eric, Shen Sheng, Lin Kevin, Keutzer Kurt, Klein Dan, Gonzalez Joseph E.
- LRTA: A Transparent Neural-symbolic Reasoning Framework With Modular Supervision For Visual Question Answering Liang Weixin, Niu Feiyang, Reganti Aishwarya, Thattai Govind, Tur Gokhan
- XGLUE: A New Benchmark Dataset For Cross-lingual Pre-training, Understanding And Generation Liang Yaobo, Duan Nan, Gong Yeyun, Wu Ning, Guo Fenfei, Qi Weizhen, Gong Ming, Shou Linjun, Jiang Daxin, Cao Guihong, Fan Xiaodong, Zhang Ruofei, Agrawal Rahul, Cui Edward, Wei Sining, Bharti Taroon, Qiao Ying, Chen Jiun-hung, Wu Winnie, Liu Shuguang, Yang Fan, Campos Daniel, Majumder Rangan, Zhou Ming
- Multi-node Bert-pretraining: Cost-efficient Approach Lin Jiahuang, Li Xin, Pekhimenko Gennady
- Readonce Transformers: Reusable Representations Of Text For Transformers Lin Shih-ting, Sabharwal Ashish, Khot Tushar
- Mintl: Minimalist Transfer Learning For Task-oriented Dialogue Systems Lin Zhaojiang, Madotto Andrea, Winata Genta Indra, Fung Pascale
- Variational Transformers For Diverse Response Generation Lin Zhaojiang, Winata Genta Indra, Xu Peng, Liu Zihan, Fung Pascale
- Mt5: A Massively Multilingual Pre-trained Text-to-text Transformer Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al-rfou, Aditya Siddhant, Aditya Barua, Colin Raffel
- TIME: Text And Image Mutual-translation Adversarial Networks Liu Bingchen, Song Kunpeng, Zhu Yizhe, De Melo Gerard, Elgammal Ahmed
- Improving Zero-shot Translation By Disentangling Positional Information Liu Danni, Niehues Jan, Cross James, Guzmán Francisco, Li Xian
- Task-level Curriculum Learning For Non-autoregressive Neural Machine Translation Liu Jinglin, Ren Yi, Tan Xu, Zhang Chen, Qin Tao, Zhao Zhou, Liu Tie-yan
- Understanding The Difficulty Of Training Transformers Liu Liyuan, Liu Xiaodong, Gao Jianfeng, Chen Weizhu, Han Jiawei
- Adversarial Training For Large Neural Language Models Liu Xiaodong, Cheng Hao, He Pengcheng, Chen Weizhu, Wang Yu, Poon Hoifung, Gao Jianfeng
- Very Deep Transformers For Neural Machine Translation Liu Xiaodong, Duh Kevin, Liu Liyuan, Gao Jianfeng
- Lite Training Strategies For Portuguese-english And English-portuguese Translation Lopes Alexandre, Nogueira Rodrigo, Lotufo Roberto, Pedrini Helio
- VECO: Variable And Flexible Cross-lingual Pre-training For Language Understanding And Generation Luo Fuli, Wang Wei, Liu Jiahao, Liu Yijia, Bi Bin, Huang Songfang, Huang Fei, Si Luo
- Charbert: Character-aware Pre-trained Language Model Ma Wentao, Cui Yiming, Si Chenglei, Liu Ting, Wang Shijin, Hu Guoping
- Language Models As Few-shot Learner For Task-oriented Dialogue Systems Madotto Andrea, Liu Zihan, Lin Zhaojiang, Fung Pascale
- Incremental Processing In The Age Of Non-incremental Encoders: An Empirical Assessment Of Bidirectional Models For Incremental NLU Madureira Brielen, Schlangen David
- Gpt-too: A Language-model-first Approach For Amr-to-text Generation Mager Manuel, Astudillo Ramon Fernandez, Naseem Tahira, Sultan Md Arafat, Lee Young-suk, Florian Radu, Roukos Salim
- Verb Knowledge Injection For Multilingual Event Processing Majewska Olga, Vulić Ivan, Glavaš Goran, Ponti Edoardo M., Korhonen Anna
- Recobert: A Catalog Language Model For Text-based Recommendations Malkiel Itzik, Barkan Oren, Caciularu Avi, Razin Noam, Katz Ori, Koenigstein Noam
- Capturing Document Context Inside Sentence-level Neural Machine Translation Models With Self-training Mansimov Elman, Melis Gábor, Yu Lei
- Ladabert: Lightweight Adaptation Of BERT Through Hybrid Model Compression Mao Yihuan, Wang Yujing, Wu Chufan, Zhang Chen, Wang Yang, Yang Yaming, Zhang Quanlu, Tong Yunhai, Bai Jing
- Dialoglue: A Natural Language Understanding Benchmark For Task-oriented Dialogue Mehri Shikib, Eric Mihail, Hakkani-tur Dilek
- Generation-distillation For Efficient Natural Language Understanding In Low-data Settings Melas-kyriazi Luke, Han George, Liang Celine
- What Happens To BERT Embeddings During Fine-tuning? Merchant Amil, Rahimtoroghi Elahe, Pavlick Ellie, Tenney Ian
- ADER: Adaptively Distilled Exemplar Replay Towards Continual Learning For Session-based Recommendation Mi Fei, Lin Xiaoyu, Faltings Boi
- Lambert: Language And Action Learning Using Multimodal BERT Miyazawa Kazuki, Aoki Tatsuya, Horii Takato, Nagai Takayuki
- On The Stability Of Fine-tuning BERT: Misconceptions, Explanations, And Strong Baselines Mosbach Marius, Andriushchenko Maksym, Klakow Dietrich
- Axformer: Accuracy-driven Approximation Of Transformers For Faster, Smaller And More Accurate NLP Models Nagarajan Amrit, Sen Sanchari, Stevens Jacob R., Raghunathan Anand
- Avgout: A Simple Output-probability Measure To Eliminate Dull Responses Niu Tong, Bansal Mohit
- The Chess Transformer: Mastering Play Using Generative Language Models Noever David, Ciolino Matt, Kalin Josh
- Facts2story: Controlling Text Generation By Key Facts Orbach Eyal Bar Ilan University, Goldberg Yoav Bar Ilan University And Allen Institute For Artificial Intelligence
- Multilingual BERT Post-pretraining Alignment Pan Lin, Hang Chung-wei, Qi Haode, Shah Abhishek, Potdar Saloni, Yu Mo
- On Task-level Dialogue Composition Of Generative Transformer Model Parthasarathi Prasanna, Neelakantan Arvind, Narang Sharan
- Retrieval-augmented Generation For Knowledge-intensive NLP Tasks Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela
- SOLOIST: Building Task Bots At Scale With Transfer Learning And Machine Teaching Peng Baolin, Li Chunyuan, Li Jinchao, Shayandeh Shahin, Liden Lars, Gao Jianfeng
- What Does BERT Know About Books, Movies And Music? Probing BERT For Conversational Recommendation Penha Gustavo, Hauff Claudia
- Adapterhub: A Framework For Adapting Transformers Pfeiffer Jonas, Rücklé Andreas, Poth Clifton, Kamath Aishwarya, Vulić Ivan, Ruder Sebastian, Cho Kyunghyun, Gurevych Iryna
- MAD-X: An Adapter-based Framework For Multi-task Cross-lingual Transfer Pfeiffer Jonas, Vulić Ivan, Gurevych Iryna, Ruder Sebastian
- Sparsifying Transformer Models With Trainable Representation Pooling Pietruszka Michał, Borchmann Łukasz, Garncarek Łukasz
- Compressing Transformer-based Semantic Parsing Models Using Compositional Code Embeddings Prakash Prafull, Shashidhar Saurabh Kumar, Zhao Wenlong, Rongali Subendhu, Khan Haidar, Kayser Michael
- Intermediate-task Transfer Learning With Pretrained Models For Natural Language Understanding: When And Why Does It Work? Pruksachatkun Yada, Phang Jason, Liu Haokun, Htut Phu Mon, Zhang Xiaoyi, Pang Richard Yuanzhe, Vania Clara, Kann Katharina, Bowman Samuel R.
- Jiant: A Software Toolkit For Research On General-purpose Text Understanding Models Pruksachatkun Yada, Yeres Phil, Liu Haokun, Phang Jason, Htut Phu Mon, Wang Alex, Tenney Ian, Bowman Samuel R.
- Training Large Neural Networks With Constant Memory Using A New Execution Algorithm Pudipeddi Bharadwaj, Mesmakhosroshahi Maral, Xi Jinwen, Bharadwaj Sujeeth
- BANG: Bridging Autoregressive And Non-autoregressive Generation With Large Scale Pretraining Qi Weizhen, Gong Yeyun, Jiao Jian, Yan Yu, Chen Weizhu, Liu Dayiheng, Tang Kewen, Li Houqiang, Chen Jiusheng, Zhang Ruofei, Zhou Ming, Duan Nan
- Prophetnet: Predicting Future N-gram For Sequence-to-sequence Pre-training Qi Weizhen, Yan Yu, Gong Yeyun, Liu Dayiheng, Duan Nan, Chen Jiusheng, Zhang Ruofei, Zhou Ming
- How Fine Can Fine-tuning Be? Learning Efficient Language Models Radiya-dixit Evani, Wang Xin
- Investigating Pretrained Language Models For Graph-to-text Generation Ribeiro Leonardo F. R., Schmitt Martin, Schütze Hinrich, Gurevych Iryna
- Knowledge-aware Language Model Pretraining Rosset Corby, Xiong Chenyan, Phan Minh, Song Xia, Bennett Paul, Tiwary Saurabh
- Lareqa: Language-agnostic Answer Retrieval From A Multilingual Pool Roy Uma, Constant Noah, Al-rfou Rami, Barua Aditya, Phillips Aaron, Yang Yinfei
- Fine-tuning BERT For Schema-guided Zero-shot Dialogue State Tracking Ruan Yu-ping, Ling Zhen-hua, Gu Jia-chen, Liu Quan
- Prover: Proof Generation For Interpretable Reasoning Over Rules Saha Swarnadeep, Ghosh Sayan, Srivastava Shashank, Bansal Mohit
- It's Not Just Size That Matters: Small Language Models Are Also Few-shot Learners Schick Timo, Schütze Hinrich
- What BERT Sees: Cross-modal Transfer For Visual Question Generation Scialom Thomas, Bordes Patrick, Dray Paul-alexis, Staiano Jacopo, Gallinari Patrick
- BLEURT: Learning Robust Metrics For Text Generation Sellam Thibault, Das Dipanjan, Parikh Ankur P.
- End-to-end Synthetic Data Generation For Domain Adaptation Of Question Answering Systems Shakeri Siamak, Santos Cicero Nogueira Dos, Zhu Henry, Ng Patrick, Nan Feng, Wang Zhiguo, Nallapati Ramesh, Xiang Bing
- Data-efficient Methods For Dialogue Systems Shalyminov Igor
- Hybrid Generative-retrieval Transformers For Dialogue Domain Adaptation Shalyminov Igor, Sordoni Alessandro, Atkinson Adam, Schulz Hannes
- A Simple But Tough-to-beat Data Augmentation Approach For Natural Language Understanding And Generation Shen Dinghan, Zheng Mingzhi, Shen Yelong, Qu Yanru, Chen Weizhu
- Pre-trained Summarization Distillation Shleifer Sam, Rush Alexander M.
- Lightpaff: A Two-stage Distillation Framework For Pre-training And Fine-tuning Song Kaitao, Sun Hao, Tan Xu, Qin Tao, Lu Jianfeng, Liu Hongzhi, Liu Tie-yan
- Neural Machine Translation With Error Correction Song Kaitao, Tan Xu, Lu Jianfeng
- An Investigation Of Language Model Interpretability Via Sentence Editing Stevens Samuel, Su Yu
- Recipes For Adapting Pre-trained Monolingual And Multilingual Models To Machine Translation Stickland Asa Cooper, Li Xian, Ghazvininejad Marjan
- Diversifying Dialogue Generation With Non-conversational Text Su Hui, Shen Xiaoyu, Zhao Sanqiang, Zhou Xiao, Hu Pengwei, Zhong Randy, Niu Cheng, Zhou Jie
- Discovering Useful Sentence Representations From Large Pretrained Language Models Subramani Nishant, Suresh Nivedita
- Explain And Improve: Lrp-inference Fine-tuning For Image Captioning Models Sun Jiamei, Lapuschkin Sebastian, Samek Wojciech, Binder Alexander
- Proofwriter: Generating Implications, Proofs, And Abductive Statements Over Natural Language Tafjord Oyvind, Mishra Bhavana Dalvi, Clark Peter
- Let's Stop Incorrect Comparisons In End-to-end Relation Extraction! Taillé Bruno, Guigue Vincent, Scoutheeten Geoffrey, Gallinari Patrick
- Vokenization: Improving Language Understanding With Contextualized, Visual-grounded Supervision Tan Hao, Bansal Mohit
- Making Pre-trained Language Models Better Few-shot Learners Tianyu Gao, Adam Fisch, Danqi Chen
- Language Models Are Few-shot Learners Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam Mccandlish, Alec Radford, Ilya Sutskever, Dario Amodei
- On Optimal Transformer Depth For Low-resource Language Translation Van Biljon Elan, Pretorius Arnu, Kreutzer Julia
- Unsupervised Pretraining For Neural Machine Translation Using Elastic Weight Consolidation Variš Dušan, Bojar Ondřej
- Bertnesia: Investigating The Capture And Forgetting Of Knowledge In BERT Wallat Jonas, Singh Jaspreet, Anand Avishek
- Infobert: Improving Robustness Of Language Models From An Information Theoretic Perspective Wang Boxin, Wang Shuohang, Cheng Yu, Gan Zhe, Jia Ruoxi, Li Bo, Liu Jingjing
- SRQA: Synthetic Reader For Factoid Question Answering Wang Jiuniu, Xu Wenjia, Fu Xingyu, Wei Yang, Jin Li, Chen Ziyan, Xu Guangluan, Wu Yirong
- Encoding Syntactic Knowledge In Transformer Encoder For Intent Detection And Slot Filling Wang Jixuan, Wei Kai, Radfar Martin, Zhang Weiwei, Chung Clement
- Minilm: Deep Self-attention Distillation For Task-agnostic Compression Of Pre-trained Transformers Wang Wenhui, Wei Furu, Dong Li, Bao Hangbo, Yang Nan, Zhou Ming
- Rethinking The Value Of Transformer Components Wang Wenxuan, Tu Zhaopeng
- VD-BERT: A Unified Vision And Dialog Transformer With BERT Wang Yue, Joty Shafiq, Lyu Michael R., King Irwin, Xiong Caiming, Hoi Steven C. H.
- TOD-BERT: Pre-trained Natural Language Understanding For Task-oriented Dialogue Wu Chien-sheng, Hoi Steven, Socher Richard, Xiong Caiming
- ERNIE-GEN: An Enhanced Multi-flow Pre-training And Fine-tuning Framework For Natural Language Generation Xiao Dongling, Zhang Han, Li Yukun, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Exploring Question-specific Rewards For Generating Deep Questions Xie Yuxi, Pan Liangming, Wang Dongzhe, Kan Min-yen, Feng Yansong
- Improving BERT Fine-tuning Via Self-ensemble And Self-distillation Xu Yige, Qiu Xipeng, Zhou Ligao, Huang Xuanjing
- MC-BERT: Efficient Language Pre-training Via A Meta Controller Xu Zhenhui, Gong Linyuan, Ke Guolin, He Di, Zheng Shuxin, Wang Liwei, Bian Jiang, Liu Tie-yan
- Off-policy Self-critical Training For Transformer In Visual Paragraph Generation Yan Shiyang, Hua Yang, Robertson Neil M.
- Just Ask: Learning To Answer Questions From Millions Of Narrated Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Seqdialn: Sequential Visual Dialog Networks In Joint Visual-linguistic Representation Space Yang Liu
- On The Sub-layer Functionalities Of Transformer Decoder Yang Yilin, Wang Longyue, Shi Shuming, Tadepalli Prasad, Lee Stefan, Tu Zhaopeng
- Contrastive Triple Extraction With Generative Transformer Ye Hongbin, Zhang Ningyu, Deng Shumin, Chen Mosha, Tan Chuanqi, Huang Fei, Chen Huajun
- Adding Recurrence To Pretrained Transformers For Improved Efficiency And Context Size Yoshida Davis, Ettinger Allyson, Gimpel Kevin
- Cocolm: Complex Commonsense Enhanced Language Model With Discourse Relations Yu Changlong, Zhang Hongming, Song Yangqiu, Ng Wilfred
- Few-shot Generative Conversational Query Rewriting Yu Shi, Liu Jiahua, Yang Jingqin, Xiong Chenyan, Bennett Paul, Gao Jianfeng, Liu Zhiyuan
- GOBO: Quantizing Attention-based NLP Models For Low Latency And Energy Efficient Inference Zadeh Ali Hadi, Edo Isak, Awad Omar Mohamed, Moshovos Andreas
- Turingadvice: A Generative And Dynamic Evaluation Of Language Use Zellers Rowan, Holtzman Ari, Clark Elizabeth, Qin Lianhui, Farhadi Ali, Choi Yejin
- Modeling Topical Relevance For Multi-turn Dialogue Generation Zhang Hainan, Lan Yanyan, Pang Liang, Chen Hongshen, Ding Zhuoye, Yin Dawei
- Token Drop Mechanism For Neural Machine Translation Zhang Huaao, Qiu Shigui, Duan Xiangyu, Zhang Min
- Accelerating Training Of Transformer-based Language Models With Progressive Layer Dropping Zhang Minjia, He Yuxiong
- Future-guided Incremental Transformer For Simultaneous Translation Zhang Shaolei, Feng Yang, Li Liangyou
- When Do You Need Billions Of Words Of Pretraining Data? Zhang Yian, Warstadt Alex, Li Haau-sing, Bowman Samuel R.
- POINTER: Constrained Progressive Text Generation Via Insertion-based Generative Pre-training Zhang Yizhe, Wang Guoyin, Li Chunyuan, Gan Zhe, Brockett Chris, Dolan Bill
- CPM: A Large-scale Generative Chinese Pre-trained Language Model Zhang Zhengyan, Han Xu, Zhou Hao, Ke Pei, Gu Yuxian, Ye Deming, Qin Yujia, Su Yusheng, Ji Haozhe, Guan Jian, Qi Fanchao, Wang Xiaozhi, Zheng Yanan, Zeng Guoyang, Cao Huanqi, Chen Shengqi, Li Daixuan, Sun Zhenbo, Liu Zhiyuan, Huang Minlie, Han Wentao, Tang Jie, Li Juanzi, Zhu Xiaoyan, Sun Maosong
- Know What You Don't Need: Single-shot Meta-pruning For Attention Heads Zhang Zhengyan, Qi Fanchao, Liu Zhiyuan, Liu Qun, Sun Maosong
- Low-resource Knowledge-grounded Dialogue Generation Zhao Xueliang, Wu Wei, Tao Chongyang, Xu Can, Zhao Dongyan, Yan Rui
- Stylized Dialogue Response Generation Using Stylized Unpaired Texts Zheng Yinhe, Chen Zikai, Zhang Rongsheng, Huang Shilei, Mao Xiaoxi, Huang Minlie
- Improving Non-autoregressive Neural Machine Translation With Monolingual Data Zhou Jiawei, Keung Phillip
- Pre-training Text-to-text Transformers For Concept-centric Common Sense Zhou Wangchunshu, Lee Dong-ho, Selvam Ravi Kiran, Lee Seyeon, Lin Bill Yuchen, Ren Xiang
- Learning To Compare For Better Training And Evaluation Of Open Domain Natural Language Generation Models Zhou Wangchunshu, Xu Ke
- Modifying Memories In Transformer Models Zhu Chen, Rawat Ankit Singh, Zaheer Manzil, Bhojanapalli Srinadh, Li Daliang, Yu Felix, Kumar Sanjiv
- Overcoming Language Priors With Self-supervised Learning For Visual Question Answering Zhu Xi, Mao Zhendong, Liu Chunxiao, Zhang Peng, Wang Bin, Zhang Yongdong
- A General Language Assistant As A Laboratory For Alignment Amanda Askell, Yuntao Bai, Anna Chen, Dawn Drain, Deep Ganguli, Tom Henighan, Andy Jones, Nicholas Joseph, Ben Mann, Nova Dassarma, Nelson Elhage, Zac Hatfield-dodds, Danny Hernandez, Jackson Kernion, Kamal Ndousse, Catherine Olsson, Dario Amodei, Tom Brown, Jack Clark, Sam Mccandlish, Chris Olah, Jared Kaplan
- Ext5: Towards Extreme Multi-task Scaling For Transfer Learning Aribandi Vamsi, Tay Yi, Schuster Tal, Rao Jinfeng, Zheng Huaixiu Steven, Mehta Sanket Vaibhav, Zhuang Honglei, Tran Vinh Q., Bahri Dara, Ni Jianmo, Gupta Jai, Hui Kai, Ruder Sebastian, Metzler Donald
- On The Multilingual Capabilities Of Very Large-scale English Language Models Armengol-estapé Jordi, Bonet Ona De Gibert, Melero Maite
- Efficient Large Scale Language Modeling With Mixtures Of Experts Artetxe Mikel, Bhosale Shruti, Goyal Naman, Mihaylov Todor, Ott Myle, Shleifer Sam, Lin Xi Victoria, Du Jingfei, Iyer Srinivasan, Pasunuru Ramakanth, Anantharaman Giri, Li Xian, Chen Shuohui, Akin Halil, Baines Mandeep, Martin Louis, Zhou Xing, Koura Punit Singh, O'horo Brian, Wang Jeff, Zettlemoyer Luke, Diab Mona, Kozareva Zornitsa, Stoyanov Ves
- One Question Answering Model For Many Languages With Cross-lingual Dense Passage Retrieval Asai Akari, Yu Xinyan, Kasai Jungo, Hajishirzi Hannaneh
- Self-supervised Test-time Learning For Reading Comprehension Banerjee Pratyay, Gokhale Tejas, Baral Chitta
- G-transformer For Document-level Machine Translation Bao Guangsheng, Zhang Yue, Teng Zhiyang, Chen Boxing, Luo Weihua
- S2s-ft: Fine-tuning Pretrained Transformer Encoders For Sequence-to-sequence Learning Bao Hangbo, Dong Li, Wang Wenhui, Yang Nan, Wei Furu
- PLATO-XL: Exploring The Large-scale Pre-training Of Dialogue Generation Bao Siqi, He Huang, Wang Fan, Wu Hua, Wang Haifeng, Wu Wenquan, Wu Zhihua, Guo Zhen, Lu Hua, Huang Xinxian, Tian Xin, Xu Xinchao, Lin Yingzhan, Niu Zheng-yu
- Gabert -- An Irish Language Model Barry James, Wagner Joachim, Cassidy Lauren, Cowap Alan, Lynn Teresa, Walsh Abigail, Meachair Mícheál J. Ó, Foster Jennifer
- Building A Question And Answer System For News Domain Basu Sandipan, Gaddala Aravind, Chetan Pooja, Tiwari Garima, Darapaneni Narayana, Parvathaneni Sadwik, Paduri Anwesh Reddy
- Exploring Unsupervised Pretraining Objectives For Machine Translation Baziotis Christos, Titov Ivan, Birch Alexandra, Haddow Barry
- Continual Learning In Multilingual NMT Via Language-specific Embeddings Berard Alexandre
- SHAQ: Single Headed Attention With Quasi-recurrence Bharwani Nashwin, Kushner Warren, Dandona Sangeet, Schreiber Ben
- On The Universality Of Deep Contextual Language Models Bhatt Shaily, Goyal Poonam, Dandapat Sandipan, Choudhury Monojit, Sitaram Sunayana
- Bridging The Gap Between Language Model And Reading Comprehension: Unsupervised MRC Via Self-supervision Bian Ning, Han Xianpei, Chen Bo, Lin Hongyu, He Ben, Sun Le
- Latr: Layout-aware Transformer For Scene-text VQA Biten Ali Furkan, Litman Ron, Xie Yusheng, Appalaraju Srikar, Manmatha R.
- Improving Language Models By Retrieving From Trillions Of Tokens Borgeaud Sebastian, Mensch Arthur, Hoffmann Jordan, Cai Trevor, Rutherford Eliza, Millican Katie, Driessche George Van Den, Lespiau Jean-baptiste, Damoc Bogdan, Clark Aidan, Casas Diego De Las, Guy Aurelia, Menick Jacob, Ring Roman, Hennigan Tom, Huang Saffron, Maggiore Loren, Jones Chris, Cassirer Albin, Brock Andy, Paganini Michela, Irving Geoffrey, Vinyals Oriol, Osindero Simon, Simonyan Karen, Rae Jack W., Elsen Erich, Sifre Laurent
- What Changes Can Large-scale Language Models Bring? Intensive Study On Hyperclova: Billions-scale Korean Generative Pretrained Transformers Boseop Kim, Hyoungseok Kim, Sang-woo Lee, Gichang Lee, Donghyun Kwak, Dong Hyeon Jeon, Sunghyun Park, Sungju Kim, Seonhoon Kim, Dongpil Seo, Heungsub Lee, Minyoung Jeong, Sungjae Lee, Minsub Kim, Suk Hyun Ko, Seokhun Kim, Taeyong Park, Jinuk Kim, Soyoung Kang, Na-hyeon Ryu, Kang Min Yoo, Minsuk Chang, Soobin Suh, Sookyo In, Jinseong Park, Kyungduk Kim, Hiun Kim, Jisu Jeong, Yong Goo Yeo, Donghoon Ham, Dongju Park, Min Young Lee, Jaewook Kang, Inho Kang, Jung-woo Ha, Woomyoung Park, Nako Sung
- On The Prunability Of Attention Heads In Multilingual BERT Budhraja Aakriti, Pande Madhura, Kumar Pratyush, Khapra Mitesh M.
- CDLM: Cross-document Language Modeling Caciularu Avi, Cohan Arman, Beltagy Iz, Peters Matthew E., Cattan Arie, Dagan Ido
- Curriculum Learning For Language Modeling Campos Daniel
- Implicit Premise Generation With Discourse-aware Commonsense Knowledge Models Chakrabarty Tuhin, Trivedi Aadit, Muresan Smaranda
- The Selectgen Challenge: Finding The Best Training Samples For Few-shot Neural Text Generation Chang Ernie, Shen Xiaoyu, Marin Alex, Demberg Vera
- On Training Instance Selection For Few-shot Neural Text Generation Chang Ernie, Shen Xiaoyu, Yeh Hui-syuan, Demberg Vera
- Neural Data-to-text Generation With Lm-based Text Augmentation Chang Ernie, Shen Xiaoyu, Zhu Dawei, Demberg Vera, Su Hui
- Topic Transferable Table Question Answering Chemmengath Saneem Ahmed, Kumar Vishwajeet, Bharadwaj Samarth, Sen Jaydeep, Canim Mustafa, Chakrabarti Soumen, Gliozzo Alfio, Sankaranarayanan Karthik
- Towards Making The Most Of Multilingual Pretraining For Zero-shot Neural Machine Translation Chen Guanhua, Ma Shuming, Chen Yun, Zhang Dongdong, Pan Jia, Wang Wenping, Wei Furu
- Visualgpt: Data-efficient Adaptation Of Pretrained Language Models For Image Captioning Chen Jun, Guo Han, Yi Kai, Li Boyang, Elhoseiny Mohamed
- MT6: Multilingual Pretrained Text-to-text Transformer With Translation Pairs Chi Zewen, Dong Li, Ma Shuming, Mao Shaohan Huang Xian-ling, Huang Heyan, Wei Furu
- Breaking Down Multilingual Machine Translation Chiang Ting-rui, Chen Yi-pei, Yeh Yi-ting, Neubig Graham
- Language Modeling Using Lmus: 10x Better Data Efficiency Or Improved Scaling Compared To Transformers Chilkuri Narsimha, Hunsberger Eric, Voelker Aaron, Malik Gurshaant, Eliasmith Chris
- Ensemble Fine-tuned Mbert For Translation Quality Estimation Chowdhury Shaika, Baili Naouel, Vannah Brian
- Enjoy The Salience: Towards Better Transformer-based Faithful Explanations With Word Salience Chrysostomou George, Aletras Nikolaos
- All That's 'human' Is Not Gold: Evaluating Human Evaluation Of Generated Text Clark Elizabeth, August Tal, Serrano Sofia, Haduong Nikita, Gururangan Suchin, Smith Noah A.
- CANINE: Pre-training An Efficient Tokenization-free Encoder For Language Representation Clark Jonathan H., Garrette Dan, Turc Iulia, Wieting John
- Control Prefixes For Parameter-efficient Text Generation Clive Jordan, Cao Kris, Rei Marek
- Training Verifiers To Solve Math Word Problems Cobbe Karl, Kosaraju Vineet, Bavarian Mohammad, Chen Mark, Jun Heewoo, Kaiser Lukasz, Plappert Matthias, Tworek Jerry, Hilton Jacob, Nakano Reiichiro, Hesse Christopher, Schulman John
- YANMTT: Yet Another Neural Machine Translation Toolkit Dabre Raj, Sumita Eiichiro
- Knowledge Neurons In Pretrained Transformers Dai Damai, Dong Li, Hao Yaru, Sui Zhifang, Chang Baobao, Wei Furu
- Zero-shot Recommendation As Language Modeling Damien Sileo, Wout Vossen, Robbe Raymaekers
- Mention Memory: Incorporating Textual Knowledge Into Transformers Through Entity Mention Attention De Jong Michiel, Zemlyanskiy Yury, Fitzgerald Nicholas, Sha Fei, Cohen William
- Adapting Monolingual Models: Data Can Be Scarce When Language Similarity Is High De Vries Wietse, Bartelds Martijn, Nissim Malvina, Wieling Martijn
- Compression, Transduction, And Creation: A Unified Framework For Evaluating Natural Language Generation Deng Mingkai, Tan Bowen, Liu Zhengzhong, Xing Eric P., Hu Zhiting
- Multitask Finetuning For Improving Neural Machine Translation In Indian Languages Desai Shaily, Kshirsagar Atharva, Marathe Manisha
- Diagnosing Transformers In Task-oriented Semantic Parsing Desai Shrey, Aly Ahmed
- EKTVQA: Generalized Use Of External Knowledge To Empower Scene Text In Text-vqa Dey Arka Ujjal, Valveny Ernest, Harit Gaurav
- Cogview: Mastering Text-to-image Generation Via Transformers Ding Ming, Yang Zhuoyi, Hong Wenyi, Zheng Wendi, Zhou Chang, Yin Da, Lin Junyang, Zou Xu, Shao Zhou, Yang Hongxia, Tang Jie
- A Primer On Pretrained Multilingual Language Models Doddapaneni Sumanth, Ramesh Gowtham, Khapra Mitesh M., Kunchukuttan Anoop, Kumar Pratyush
- Diverse Pretrained Context Encodings Improve Document Translation Donato Domenic, Yu Lei, Dyer Chris
- Attention Forcing For Machine Translation Dou Qingyun, Lu Yiting, Manakul Potsawee, Wu Xixin, Gales Mark J. F.
- Is GPT-3 Text Indistinguishable From Human Text? Scarecrow: A Framework For Scrutinizing Machine Text Dou Yao, Forbes Maxwell, Koncel-kedziorski Rik, Smith Noah A., Choi Yejin
- An Empirical Study Of Training End-to-end Vision-and-language Transformers Dou Zi-yi, Xu Yichong, Gan Zhe, Wang Jianfeng, Wang Shuohang, Wang Lijuan, Zhu Chenguang, Zhang Pengchuan, Yuan Lu, Peng Nanyun, Liu Zicheng, Zeng Michael
- GLM: General Language Model Pretraining With Autoregressive Blank Infilling Du Zhengxiao, Qian Yujie, Liu Xiao, Ding Ming, Qiu Jiezhong, Yang Zhilin, Tang Jie
- Lora: Low-rank Adaptation Of Large Language Models Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen
- Robustly Optimized And Distilled Training For Natural Language Understanding Elfadeel Haytham, Peshterliev Stan
- Does CLIP Benefit Visual Question Answering In The Medical Domain As Much As It Does In The General Domain? Eslami Sedigheh, De Melo Gerard, Meinel Christoph
- Compressing Visual-linguistic Model Via Knowledge Distillation Fang Zhiyuan, Wang Jianfeng, Hu Xiaowei, Wang Lijuan, Yang Yezhou, Liu Zicheng
- Vision-and-language Or Vision-for-language? On Cross-modal Influence In Multimodal Transformers Frank Stella, Bugliarello Emanuele, Elliott Desmond
- Grounded Graph Decoding Improves Compositional Generalization In Question Answering Gai Yu, Jain Paras, Zhang Wendi, Gonzalez Joseph E., Song Dawn, Stoica Ion
- Chop Chop BERT: Visual Question Answering By Chopping Visualbert's Heads Gao Chenyu, Zhu Qi, Wang Peng, Wu Qi
- Condenser: A Pre-training Architecture For Dense Retrieval Gao Luyu, Callan Jamie
- Scalable Transformers For Neural Machine Translation Gao Peng, Geng Shijie, Qiao Yu, Wang Xiaogang, Dai Jifeng, Li Hongsheng
- An Adversarially-learned Turing Test For Dialog Generation Models Gao Xiang, Zhang Yizhe, Galley Michel, Dolan Bill
- Improving And Diagnosing Knowledge-based Visual Question Answering Via Entity Enhanced Knowledge Injection Garcia-olano Diego, Onoe Yasumasa, Ghosh Joydeep
- Cross-attention Is All You Need: Adapting Pretrained Transformers For Machine Translation Gheini Mozhdeh, Ren Xiang, May Jonathan
- Towards Trustworthy Deception Detection: Benchmarking Model Robustness Across Domains, Modalities, And Languages Glenski Maria, Ayton Ellyn, Cosbey Robin, Arendt Dustin, Volkova Svitlana
- LAWDR: Language-agnostic Weighted Document Representations From Pre-trained Models Gong Hongyu, Chaudhary Vishrav, Tang Yuqing, Guzmán Francisco
- Larger-scale Transformers For Multilingual Masked Language Modeling Goyal Naman, Du Jingfei, Ott Myle, Anantharaman Giri, Conneau Alexis
- Xeroalign: Zero-shot Cross-lingual Transformer Alignment Gritta Milan, Iacobacci Ignacio
- Long Text Generation By Modeling Sentence-level And Discourse-level Coherence Guan Jian, Mao Xiaoxi, Fan Changjie, Liu Zitao, Ding Wenbiao, Huang Minlie
- Longt5: Efficient Text-to-text Transformer For Long Sequences Guo Mandy, Ainslie Joshua, Uthus David, Ontanon Santiago, Ni Jianmo, Sung Yun-hsuan, Yang Yinfei
- LICHEE: Improving Language Model Pre-training With Multi-grained Tokenization Guo Weidong, Zhao Mingjun, Zhang Lusheng, Niu Di, Luo Jinwen, Liu Zhenhua, Li Zhenyang, Tang Jianbo
- Memory-efficient Transformers Via Top-\(k\) Attention Gupta Ankit, Dar Guy, Goodman Shaya, Ciprut David, Berant Jonathan
- A Million Tweets Are Worth A Few Points: Tuning Transformers For Customer Service Tasks Hadifar Amir, Labat Sofie, Hoste Véronique, Develder Chris, Demeester Thomas
- WARP: Word-level Adversarial Reprogramming Hambardzumyan Karen, Khachatrian Hrant, May Jonathan
- Zero-shot Controlled Generation With Encoder-decoder Transformers Hazarika Devamanyu, Namazifar Mahdi, Hakkani-tür Dilek
- Generate, Annotate, And Learn: NLP With Synthetic Text He Xuanli, Nassar Islam, Kiros Jamie, Haffari Gholamreza, Norouzi Mohammad
- Open Domain Question Answering Over Tables Via Dense Retrieval Herzig Jonathan, Müller Thomas, Krichene Syrine, Eisenschlos Julian Martin
- ENCONTER: Entity Constrained Progressive Sequence Generation Via Insertion-based Transformer Hsieh Lee-hsun, Lee Yang-yin, Lim Ee-peng
- Lora: Low-rank Adaptation Of Large Language Models Hu Edward J., Shen Yelong, Wallis Phillip, Allen-zhu Zeyuan, Li Yuanzhi, Wang Shean, Wang Lu, Chen Weizhu
- Unifying Multimodal Transformer For Bi-directional Image And Text Generation Huang Yupan, Xue Hongwei, Liu Bei, Lu Yutong
- SILT: Efficient Transformer Training For Inter-lingual Inference Huertas-tato Javier, Martín Alejandro, Camacho David
- DS-TOD: Efficient Domain Specialization For Task Oriented Dialog Hung Chia-chien, Lauscher Anne, Ponzetto Simone Paolo, Glavaš Goran
- Scaling Language Models: Methods, Analysis & Insights From Training Gopher Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George Van Den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor, Irina Higgins, Antonia Creswell, Nat Mcaleese, Amy Wu, Erich Elsen, Siddhant Jayakumar, Elena Buchatskaya, David Budden, Esme Sutherland, Karen Simonyan, Michela Paganini, Laurent Sifre, Lena Martens, Xiang Lorraine Li, Adhiguna Kuncoro, Aida Nematzadeh, Elena Gribovskaya, Domenic Donato, Angeliki Lazaridou, Arthur Mensch, Jean-baptiste Lespiau, Maria Tsimpoukelli, Nikolai Grigorev, Doug Fritz, Thibault Sottiaux, Mantas Pajarskas, Toby Pohlen, Zhitao Gong, Daniel Toyama, Cyprien De Masson D'autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego De Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew Johnson, Blake Hechtman, Laura Weidinger, Iason Gabriel, William Isaac, Ed Lockhart, Simon Osindero, Laura Rimell, Chris Dyer, Oriol Vinyals, Kareem Ayoub, Jeff Stanway, Lorrayne Bennett, Demis Hassabis, Koray Kavukcuoglu, Geoffrey Irving
- Visqa: X-raying Vision And Language Reasoning In Transformers Jaunet Theo, Kervadec Corentin, Vuillemot Romain, Antipov Grigory, Baccouche Moez, Wolf Christian
- Retraining Distilbert For A Voice Shopping Assistant By Using Universal Dependencies Jayarao Pratik, Sharma Arpit
- Improving Non-autoregressive Generation With Mixup Training Jiang Ting, Huang Shaohan, Zhang Zihan, Wang Deqing, Zhuang Fuzhen, Wei Furu, Huang Haizhen, Zhang Liangjie, Zhang Qi
- Inducing Transformer's Compositional Generalization Ability Via Auxiliary Sequence Prediction Tasks Jiang Yichen, Bansal Mohit
- REPT: Bridging Language Models And Machine Reading Comprehension Via Retrieval-based Pre-training Jiao Fangkai, Guo Yangyang, Niu Yilin, Ji Feng, Li Feng-lin, Nie Liqiang
- A Good Prompt Is Worth Millions Of Parameters: Low-resource Prompt-based Learning For Vision-language Models Jin Woojeong, Cheng Yu, Shen Yelong, Chen Weizhu, Ren Xiang
- Numgpt: Improving Numeracy Ability Of Generative Pre-trained Models Jin Zhihua, Jiang Xin, Wang Xingbo, Liu Qun, Wang Yong, Ren Xiaozhe, Qu Huamin
- Rethinking Self-supervision Objectives For Generalizable Coherence Modeling Jwalapuram Prathyusha, Joty Shafiq, Lin Xiang
- Towards Incremental Transformers: An Empirical Analysis Of Transformer Models For Incremental NLU Kahardipraja Patrick, Madureira Brielen, Schlangen David
- Nmt5 -- Is Parallel Data Still Relevant For Pre-training Massively Multilingual Language Models? Kale Mihir, Siddhant Aditya, Constant Noah, Johnson Melvin, Al-rfou Rami, Xue Linting
- Distilling Large Language Models Into Tiny And Effective Students Using Pqrnn Kaliamoorthi Prabhu, Siddhant Aditya, Li Edward, Johnson Melvin
- AMMUS : A Survey Of Transformer-based Pretrained Models In Natural Language Processing Kalyan Katikapalli Subramanyam, Rajasekharan Ajit, Sangeetha Sivanesan
- Amazon Sagemaker Model Parallelism: A General And Flexible Framework For Large Model Training Karakus Can, Huilgol Rahul, Wu Fei, Subramanian Anirudh, Daniel Cade, Cavdar Derya, Xu Teng, Chen Haohan, Rahnama Arash, Quintela Luis
- Finetuning Pretrained Transformers Into Rnns Kasai Jungo, Peng Hao, Zhang Yizhe, Yogatama Dani, Ilharco Gabriel, Pappas Nikolaos, Mao Yi, Chen Weizhu, Smith Noah A.
- Uquad1.0: Development Of An Urdu Question Answering Training Data For Machine Reading Comprehension Kazi Samreen, Khoja Shakeel
- Generating Rich Product Descriptions For Conversational E-commerce Systems Kedia Shashank, Mantha Aditya, Gupta Sneha, Guo Stephen, Achan Kannan
- How Transferable Are Reasoning Patterns In VQA? Kervadec Corentin, Jaunet Theo, Antipov Grigory, Baccouche Moez, Vuillemot Romain, Wolf Christian
- Text-free Prosody-aware Generative Spoken Language Modeling Kharitonov Eugene, Lee Ann, Polyak Adam, Adi Yossi, Copet Jade, Lakhotia Kushal, Nguyen Tu-anh, Rivière Morgane, Mohamed Abdelrahman, Dupoux Emmanuel, Hsu Wei-ning
- Learned Token Pruning For Transformers Kim Sehoon, Shen Sheng, Thorsley David, Gholami Amir, Kwon Woosuk, Hassoun Joseph, Keutzer Kurt
- What's In A Measurement? Using GPT-3 On Semeval 2021 Task 8 -- Measeval Kohler Curt, Daniel Ron Jr
- Improving Scheduled Sampling With Elastic Weight Consolidation For Neural Machine Translation Korakakis Michalis, Vlachos Andreas
- Controlling Conditional Language Models Without Catastrophic Forgetting Korbak Tomasz, Elsahar Hady, Kruszewski German, Dymetman Marc
- BERT Busters: Outlier Dimensions That Disrupt Transformers Kovaleva Olga, Kulshreshtha Saurabh, Rogers Anna, Rumshisky Anna
- Hurdles To Progress In Long-form Question Answering Krishna Kalpesh, Roy Aurko, Iyyer Mohit
- What BERT Based Language Models Learn In Spoken Transcripts: An Empirical Study Kumar Ayush, Sundararaman Mukuntha Narayanan, Vepa Jithendra
- Layered Gradient Accumulation And Modular Pipeline Parallelism: Fast And Efficient Training Of Large Language Models Lamy-poirier Joel
- Pagnol: An Extra-large French Generative Model Launay Julien, Tommasone Elena, Pannier Baptiste, Boniface François, Chatelain Amélie, Cappelli Alessandro, Poli Iacopo, Seddah Djamé
- You Only Need One Model For Open-domain Question Answering Lee Haejun, Kedia Akhil, Lee Jongwon, Paranjape Ashwin, Manning Christopher D., Woo Kyoung-gu
- Towards Few-shot Fact-checking Via Perplexity Lee Nayeon, Bang Yejin, Madotto Andrea, Khabsa Madian, Fung Pascale
- When Attention Meets Fast Recurrence: Training Language Models With Reduced Compute Lei Tao
- Exploiting Sentence-level Representations For Passage Ranking Leonhardt Jurek, Beringer Fabian, Anand Avishek
- Semvlp: Vision-language Pre-training By Aligning Semantics At Multiple Levels Li Chenliang, Yan Ming, Xu Haiyang, Luo Fuli, Wang Wei, Bi Bin, Huang Songfang
- The Stability-efficiency Dilemma: Investigating Sequence Length Warmup For Training GPT Models Li Conglong, Zhang Minjia, He Yuxiong
- Align Before Fuse: Vision And Language Representation Learning With Momentum Distillation Li Junnan, Selvaraju Ramprasaath R., Gotmare Akhilesh Deepak, Joty Shafiq, Xiong Caiming, Hoi Steven
- Self-supervised Bot Play For Conversational Recommendation With Justifications Li Shuyang, Majumder Bodhisattwa Prasad, Mcauley Julian
- A Short Study On Compressing Decoder-based Language Models Li Tianda, Mesbahi Yassir El, Kobyzev Ivan, Rashid Ahmad, Mahmud Atif, Anchuri Nithin, Hajimolahoseini Habib, Liu Yang, Rezagholizadeh Mehdi
- Question-aware Memory Network For Multi-hop Question Answering In Human-robot Interaction Li Xinmeng, Alazab Mamoun, Li Qian, Yu Keping, Yin Quanjun
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-image Pre-training Paradigm Li Yangguang, Liang Feng, Zhao Lichen, Cui Yufeng, Ouyang Wanli, Shao Jing, Yu Fengwei, Yan Junjie
- Scheduled Sampling In Vision-language Pretraining With Decoupled Encoder-decoder Network Li Yehao, Pan Yingwei, Yao Ting, Chen Jingwen, Mei Tao
- Terapipe: Token-level Pipeline Parallelism For Training Large-scale Language Models Li Zhuohan, Zhuang Siyuan, Guo Shiyuan, Zhuo Danyang, Zhang Hao, Song Dawn, Stoica Ion
- Dialoging Resonance: How Users Perceive, Reciprocate And React To Chatbot's Self-disclosure In Conversational Recommendations Liang Kai-hui, Shi Weiyan, Oh Yoojung, Wang Hao-chuan, Zhang Jingwen, Yu Zhou
- Learning Neural Templates For Recommender Dialogue System Liang Zujie, Hu Huang, Xu Can, Miao Jian, He Yingying, Chen Yining, Geng Xiubo, Liang Fan, Jiang Daxin
- Domain-independent User Simulation With Transformers For Task-oriented Dialogue Systems Lin Hsien-chin, Lubis Nurul, Hu Songbo, Van Niekerk Carel, Geishauser Christian, Heck Michael, Feng Shutong, Gašić Milica
- MDQE: A More Accurate Direct Pretraining For Machine Translation Quality Estimation Lin Lei
- Truthfulqa: Measuring How Models Mimic Human Falsehoods Lin Stephanie, Hilton Jacob, Evans Owain
- Few-shot Learning With Multilingual Language Models Lin Xi Victoria, Mihaylov Todor, Artetxe Mikel, Wang Tianlu, Chen Shuohui, Simig Daniel, Ott Myle, Goyal Naman, Bhosale Shruti, Du Jingfei, Pasunuru Ramakanth, Shleifer Sam, Koura Punit Singh, Chaudhary Vishrav, O'horo Brian, Wang Jeff, Zettlemoyer Luke, Kozareva Zornitsa, Diab Mona, Stoyanov Veselin, Li Xian
- VX2TEXT: End-to-end Learning Of Video-based Text Generation From Multimodal Inputs Lin Xudong, Bertasius Gedas, Wang Jue, Chang Shih-fu, Parikh Devi, Torresani Lorenzo
- Variational Latent-state GPT For Semi-supervised Task-oriented Dialog Systems Liu Hong, Cai Yucheng, Lin Zhenru, Ou Zhijian, Huang Yi, Feng Junlan
- Towards Automated Psychotherapy Via Language Modeling Liu Houjun
- A Three-stage Learning Framework For Low-resource Knowledge-grounded Dialogue Generation Liu Shilei, Zhao Xiaofeng, Li Bochao, Ren Feiliang, Zhang Longhui, Yin Shujuan
- Unified Multimodal Pre-training And Prompt-based Tuning For Vision-language Understanding And Generation Liu Tianyi, Wu Zuxuan, Xiong Wenhan, Chen Jingjing, Jiang Yu-gang
- GPT Understands, Too Liu Xiao, Zheng Yanan, Du Zhengxiao, Ding Ming, Qian Yujie, Yang Zhilin, Tang Jie
- Confidence-aware Scheduled Sampling For Neural Machine Translation Liu Yijin, Meng Fandong, Chen Yufeng, Xu Jinan, Zhou Jie
- Scheduled Sampling Based On Decoding Steps For Neural Machine Translation Liu Yijin, Meng Fandong, Chen Yufeng, Xu Jinan, Zhou Jie
- Augmenting Sequential Recommendation With Pseudo-prior Items Via Reversely Pre-training Transformer Liu Zhiwei, Fan Ziwei, Wang Yu, Yu Philip S.
- Learning To Ask Conversational Questions By Optimizing Levenshtein Distance Liu Zhongkun, Ren Pengjie, Chen Zhumin, Ren Zhaochun, De Rijke Maarten, Zhou Ming
- Pretrained Transformers As Universal Computation Engines Lu Kevin, Grover Aditya, Abbeel Pieter, Mordatch Igor
- Fantastically Ordered Prompts And Where To Find Them: Overcoming Few-shot Prompt Order Sensitivity Lu Yao, Bartolo Max, Moore Alastair, Riedel Sebastian, Stenetorp Pontus
- X-METRA-ADA: Cross-lingual Meta-transfer Learning Adaptation To Natural Language Understanding And Question Answering M'hamdi Meryem, Kim Doo Soon, Dernoncourt Franck, Bui Trung, Ren Xiang, May Jonathan
- Deltalm: Encoder-decoder Pre-training For Language Generation And Translation By Augmenting Pretrained Multilingual Encoders Ma Shuming, Dong Li, Huang Shaohan, Zhang Dongdong, Muzio Alexandre, Singhal Saksham, Awadalla Hany Hassan, Song Xia, Wei Furu
- Coherence Boosting: When Your Pretrained Language Model Is Not Paying Enough Attention Malkin Nikolay, Wang Zhen, Jojic Nebojsa
- Local And Global Context-based Pairwise Models For Sentence Ordering Manku Ruskin Raj, Paul Aditya Jyoti
- Lightweight Cross-lingual Sentence Representation Learning Mao Zhuoyuan, Gupta Prakhar, Wang Pei, Chu Chenhui, Jaggi Martin, Kurohashi Sadao
- Few-shot Self-rationalization With Natural Language Prompts Marasović Ana, Beltagy Iz, Downey Doug, Peters Matthew E.
- Transformer Based Bengali Chatbot Using General Knowledge Dataset Masum Abu Kaisar Mohammad, Abujar Sheikh, Akter Sharmin, Ria Nushrat Jahan, Hossain Syed Akhter
- How Much Do Language Models Copy From Their Training Data? Evaluating Linguistic Novelty In Text Generation Using RAVEN Mccoy R. Thomas, Smolensky Paul, Linzen Tal, Gao Jianfeng, Celikyilmaz Asli
- Schema-guided Paradigm For Zero-shot Dialog Mehri Shikib, Eskenazi Maxine
- Improving Compositional Generalization With Self-training For Data-to-text Generation Mehta Sanket Vaibhav, Rao Jinfeng, Tay Yi, Kale Mihir, Parikh Ankur P., Strubell Emma
- COCO-LM: Correcting And Contrasting Text Sequences For Language Model Pretraining Meng Yu, Xiong Chenyan, Bajaj Payal, Tiwary Saurabh, Bennett Paul, Han Jiawei, Song Xia
- Rewire-then-probe: A Contrastive Recipe For Probing Biomedical Knowledge Of Pre-trained Language Models Meng Zaiqiao, Liu Fangyu, Shareghi Ehsan, Su Yixuan, Collins Charlotte, Collier Nigel
- Self-training Improves Pre-training For Few-shot Learning In Task-oriented Dialog Systems Mi Fei, Zhou Wanhao, Cai Fengyu, Kong Lingjing, Huang Minlie, Faltings Boi
- Recent Advances In Natural Language Processing Via Large Pre-trained Language Models: A Survey Min Bonan, Ross Hayley, Sulem Elior, Veyseh Amir Pouran Ben, Nguyen Thien Huu, Sainz Oscar, Agirre Eneko, Heinz Ilana, Roth Dan
- Distantly Supervised Transformers For E-commerce Product QA Mittal Happy, Chakrabarti Aniket, Bayar Belhassen, Sharma Animesh Anant, Rasiwasia Nikhil
- Multi-modal Understanding And Generation For Medical Images And Text Via Vision-language Pre-training Moon Jong Hak, Lee Hyungyung, Shin Woncheol, Kim Young-hak, Choi Edward
- GPT-3 Models Are Poor Few-shot Learners In The Biomedical Domain Moradi Milad, Blagec Kathrin, Haberl Florian, Samwald Matthias
- First Align, Then Predict: Understanding The Cross-lingual Ability Of Multilingual BERT Muller Benjamin, Elazar Yanai, Sagot Benoît, Seddah Djamé
- Glam: Efficient Scaling Of Language Models With Mixture-of-experts Nan Du, Yanping Huang, Andrew M. Dai, Simon Tong, Dmitry Lepikhin, Yuanzhong Xu, Maxim Krikun, Yanqi Zhou, Adams Wei Yu, Orhan Firat, Barret Zoph, Liam Fedus, Maarten Bosma, Zongwei Zhou, Tao Wang, Yu Emma Wang, Kellie Webster, Marie Pellat, Kevin Robinson, Kathleen Meier-hellstern, Toju Duke, Lucas Dixon, Kun Zhang, Quoc V Le, Yonghui Wu, Zhifeng Chen, Claire Cui
- Planning With Learned Entity Prompts For Abstractive Summarization Narayan Shashi, Zhao Yao, Maynez Joshua, Simoes Gonçalo, Nikolaev Vitaly, Mcdonald Ryan
- Improving Transformers With Probabilistic Attention Keys Nguyen Tam, Nguyen Tan M., Le Dung D., Nguyen Duy Khuong, Tran Viet-anh, Baraniuk Richard G., Ho Nhat, Osher Stanley J.
- MLP Architectures For Vision-and-language Modeling: An Empirical Study Nie Yixin, Li Linjie, Gan Zhe, Wang Shuohang, Zhu Chenguang, Zeng Michael, Liu Zicheng, Bansal Mohit, Wang Lijuan
- Investigating The Limitations Of Transformers With Simple Arithmetic Tasks Nogueira Rodrigo, Jiang Zhiying, Lin Jimmy
- Code Generation From Natural Language With Less Prior And More Monolingual Data Norouzi Sajad, Tang Keyi, Cao Yanshuai
- Learning To Retrieve Prompts For In-context Learning Ohad Rubin, Jonathan Herzig, Jonathan Berant
- Investigating Numeracy Learning Ability Of A Text-to-text Transfer Model Pal Kuntal Kumar, Baral Chitta
- Probing For Bridging Inference In Transformer Language Models Pandit Onkar, Hou Yufang
- Episodic Transformer For Vision-and-language Navigation Pashevich Alexander, Schmid Cordelia, Sun Chen
- Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge Porada Ian, Sordoni Alessandro, Cheung Jackie Chi Kit
- Attentive Fine-tuning Of Transformers For Translation Of Low-resourced Languages @loresmt 2021 Puranik Karthik, Hande Adeep, Priyadharshini Ruba, Durairaj Thenmozhi, Sampath Anbukkarasi, Thamburaj Kingston Pal, Chakravarthi Bharathi Raja
- Robertuito: A Pre-trained Language Model For Social Media Text In Spanish Pérez Juan Manuel, Furman Damián A., Alemany Laura Alonso, Luque Franco
- Structural Guidance For Transformer Language Models Qian Peng, Naseem Tahira, Levy Roger, Astudillo Ramón Fernandez
- Winner Team Mia At Textvqa Challenge 2021: Vision-and-language Representation Learning With Pre-trained Sequence-to-sequence Model Qiao Yixuan, Chen Hao, Wang Jun, Chen Yihao, Ye Xianbin, Li Ziliang, Qi Xianbiao, Gao Peng, Xie Guotong
- VT-CLIP: Enhancing Vision-language Models With Visual-guided Texts Qiu Longtian, Zhang Renrui, Guo Ziyu, Zeng Ziyao, Guo Zilu, Li Yafeng, Zhang Guangnan
- Grid Search Hyperparameter Benchmarking Of BERT, ALBERT, And Longformer On Duorc Quijano Alex John, Nguyen Sam, Ordonez Juanita
- Teaching Autoregressive Language Models Complex Tasks By Demonstration Recchia Gabriel
- Webgpt: Browser-assisted Question-answering With Human Feedback Reiichiro Nakano, Jacob Hilton, Suchir Balaji, Jeff Wu, Long Ouyang, Christina Kim, Christopher Hesse, Shantanu Jain, Vineet Kosaraju, William Saunders, Xu Jiang, Karl Cobbe, Tyna Eloundou, Gretchen Krueger, Kevin Button, Matthew Knight, Benjamin Chess, John Schulman
- Tailor: Generating And Perturbing Text With Semantic Controls Ross Alexis, Wu Tongshuang, Peng Hao, Peters Matthew E., Gardner Matt
- Don't Sweep Your Learning Rate Under The Rug: A Closer Look At Cross-modal Transfer Of Pretrained Transformers Rothermel Danielle, Li Margaret, Rocktäschel Tim, Foerster Jakob
- Leveraging The Inductive Bias Of Large Language Models For Abstract Textual Reasoning Rytting Christopher Michael, Wingate David
- End-to-end Training Of Neural Retrievers For Open-domain Question Answering Sachan Devendra Singh, Patwary Mostofa, Shoeybi Mohammad, Kant Neel, Ping Wei, Hamilton William L, Catanzaro Bryan
- Efficient Domain Adaptation Of Language Models Via Adaptive Tokenization Sachidananda Vin, Kessler Jason S., Lai Yi-an
- Representation Learning For Conversational Data Using Discourse Mutual Information Maximization Santra Bishal, Roychowdhury Sumegh, Mandal Aishik, Gurram Vasu, Naik Atharva, Gupta Manish, Goyal Pawan
- Large Pre-trained Language Models Contain Human-like Biases Of What Is Right And Wrong To Do Schramowski Patrick, Turan Cigdem, Andersen Nico, Rothkopf Constantin A., Kersting Kristian
- The Power Of Prompt Tuning For Low-resource Semantic Parsing Schucher Nathan, Reddy Siva, De Vries Harm
- LAION-400M: Open Dataset Of Clip-filtered 400 Million Image-text Pairs Schuhmann Christoph, Vencu Richard, Beaumont Romain, Kaczmarczyk Robert, Mullis Clayton, Katta Aarush, Coombes Theo, Jitsev Jenia, Komatsuzaki Aran
- Simple Entity-centric Questions Challenge Dense Retrievers Sciavolino Christopher, Zhong Zexuan, Lee Jinhyuk, Chen Danqi
- Metaicl: Learning To Learn In Context Sewon Min, Mike Lewis, Luke Zettlemoyer, Hannaneh Hajishirzi
- Laviter: Learning Aligned Visual And Textual Representations Assisted By Image And Caption Generation Shaikh Mohammad Abuzar, Ji Zhanghexuan, Moukheiber Dana, Shen Yan, Srihari Sargur, Gao Mingchen
- CPT: A Pre-trained Unbalanced Transformer For Both Chinese Language Understanding And Generation Shao Yunfan, Geng Zhichao, Liu Yitao, Dai Junqi, Yan Hang, Yang Fei, Zhe Li, Bao Hujun, Qiu Xipeng
- Better Neural Machine Translation By Extracting Linguistic Information From BERT Shavarani Hassan S., Sarkar Anoop
- FPM: A Collection Of Large-scale Foundation Pre-trained Language Models Shen Dezhou
- How Much Can CLIP Benefit Vision-and-language Tasks? Shen Sheng, Li Liunian Harold, Tan Hao, Bansal Mohit, Rohrbach Anna, Chang Kai-wei, Yao Zhewei, Keutzer Kurt
- Societal Biases In Language Generation: Progress And Challenges Sheng Emily, Chang Kai-wei, Natarajan Premkumar, Peng Nanyun
- Layer-wise Pruning Of Transformer Attention Heads For Efficient Language Modeling Shim Kyuhong, Choi Iksoo, Sung Wonyong, Choi Jungwook
- Scaling Law For Recommendation Models: Towards General-purpose User Representations Shin Kyuyong, Kwak Hanock, Kim Su Young, Ramstrom Max Nihlen, Jeong Jisu, Ha Jung-woo, Kim Kyung-min
- Few-shot Semantic Parsing With Language Models Trained On Code Shin Richard, Van Durme Benjamin
- Exploring Low-cost Transformer Model Compression For Large-scale Commercial Reply Suggestions Shrivastava Vaishnavi, Gaonkar Radhika, Gupta Shashank, Jha Abhishek
- ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, Dianhai Yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang
- Pre-trained Language Models As Prior Knowledge For Playing Text-based Games Singh Ishika, Singh Gargi, Modi Ashutosh
- Primer: Searching For Efficient Transformers For Language Modeling So David R., Mańke Wojciech, Liu Hanxiao, Dai Zihang, Shazeer Noam, Le Quoc V.
- Bob: BERT Over BERT For Training Persona-based Dialogue Models From Limited Personalized Data Song Haoyu, Wang Yan, Zhang Kaiyan, Zhang Wei-nan, Liu Ting
- Multimodal Integration Of Human-like Attention In Visual Question Answering Sood Ekta, Kögel Fabian, Müller Philipp, Thomas Dominike, Bace Mihai, Bulling Andreas
- Txt: Crossmodal End-to-end Learning With Transformers Steitz Jan-martin O., Pfeiffer Jonas, Gurevych Iryna, Roth Stefan
- Corelm: Coreference-aware Language Model Fine-tuning Stylianou Nikolaos, Vlahavas Ioannis
- Ernie-tiny : A Progressive Distillation Framework For Pretrained Transformer Compression Su Weiyue, Chen Xuyi, Feng Shikun, Liu Jiaxiang, Liu Weixin, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Tacl: Improving BERT Pre-training With Token-aware Contrastive Learning Su Yixuan, Liu Fangyu, Meng Zaiqiao, Lan Tian, Shu Lei, Shareghi Ehsan, Collier Nigel
- Embodied BERT: A Transformer Model For Embodied, Language-guided Visual Task Completion Suglia Alessandro, Gao Qiaozi, Thomason Jesse, Thattai Govind, Sukhatme Gaurav
- Lightningdot: Pre-training Visual-semantic Embeddings For Real-time Image-text Retrieval Sun Siqi, Chen Yen-chun, Li Linjie, Wang Shuohang, Fang Yuwei, Liu Jingjing
- NSP-BERT: A Prompt-based Few-shot Learner Through An Original Pre-training Task--next Sentence Prediction Sun Yi, Zheng Yu, Hao Chao, Qiu Hangping
- ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training For Language Understanding And Generation Sun Yu, Wang Shuohuan, Feng Shikun, Ding Siyu, Pang Chao, Shang Junyuan, Liu Jiaxiang, Chen Xuyi, Zhao Yanbin, Lu Yuxiang, Liu Weixin, Wu Zhihua, Gong Weibao, Liang Jianzhong, Shang Zhizhou, Sun Peng, Liu Wei, Ouyang Xuan, Yu Dianhai, Tian Hao, Wu Hua, Wang Haifeng
- Multilingual Translation Via Grafting Pre-trained Language Models Sun Zewei, Wang Mingxuan, Li Lei
- Data Processing Matters: Srph-konvergen Ai's Machine Translation System For WMT'21 Sutawika Lintang, Cruz Jan Christian Blaise
- Cross-task Generalization Via Natural Language Crowdsourcing Instructions Swaroop Mishra, Daniel Khashabi, Chitta Baral, Hannaneh Hajishirzi
- A Three Step Training Approach With Data Augmentation For Morphological Inflection Szolnok Gabor, Barta Botond, Lakatos Dorina, Acs Judit
- General-purpose Question-answering With Macaw Tafjord Oyvind, Clark Peter
- Diff-explainer: Differentiable Convex Optimization For Explainable Multi-hop Inference Thayaparan Mokanarangan, Valentino Marco, Ferreira Deborah, Rozanova Julia, Freitas André
- Towards Reinforcement Learning For Pivot-based Neural Machine Translation With Non-autoregressive Transformer Tokarchuk Evgeniia, Rosendahl Jan, Wang Weiyue, Petrushkov Pavel, Lancewicki Tomer, Khadivi Shahram, Ney Hermann
- Cheap And Good? Simple And Effective Data Augmentation For Low Resource Machine Reading Van Hoang, Yadav Vikas, Surdeanu Mihai
- Multitask Prompted Training Enables Zero-shot Task Generalization Victor Sanh, Albert Webson, Colin Raffel, Stephen H. Bach, Lintang Sutawika, Zaid Alyafeai, Antoine Chaffin, Arnaud Stiegler, Teven Le Scao, Arun Raja, Manan Dey, M Saiful Bari, Canwen Xu, Urmish Thakker, Shanya Sharma Sharma, Eliza Szczechla, Taewoon Kim, Gunjan Chhablani, Nihal Nayak, Debajyoti Datta, Jonathan Chang, Mike Tian-jian Jiang, Han Wang, Matteo Manica, Sheng Shen, Zheng Xin Yong, Harshit Pandey, Rachel Bawden, Thomas Wang, Trishala Neeraj, Jos Rozen, Abheesht Sharma, Andrea Santilli, Thibault Fevry, Jason Alan Fries, Ryan Teehan, Tali Bers, Stella Biderman, Leo Gao, Thomas Wolf, Alexander M. Rush
- Convfit: Conversational Fine-tuning Of Pretrained Language Models Vulić Ivan, Su Pei-hao, Coope Sam, Gerz Daniela, Budzianowski Paweł, Casanueva Iñigo, Mrkšić Nikola, Wen Tsung-hsien
- Bertnesia: Investigating The Capture And Forgetting Of Knowledge In BERT Wallat Jonas, Singh Jaspreet, Anand Avishek
- Selective Knowledge Distillation For Neural Machine Translation Wang Fusheng, Yan Jianhao, Meng Fandong, Zhou Jie
- Lightseq2: Accelerated Training For Transformer-based Models On Gpus Wang Xiaohui, Wei Yang, Xiong Ying, Huang Guyue, Qian Xian, Ding Yufei, Wang Mingxuan, Li Lei
- List: Lite Prompted Self-training Makes Parameter-efficient Few-shot Learners Wang Yaqing, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Diversifying Dialog Generation Via Adaptive Label Smoothing Wang Yida, Zheng Yinhe, Jiang Yong, Huang Minlie
- Codet5: Identifier-aware Unified Pre-trained Encoder-decoder Models For Code Understanding And Generation Wang Yue, Wang Weishi, Joty Shafiq, Hoi Steven C. H.
- Distilled Dual-encoder Model For Vision-language Understanding Wang Zekun, Wang Wenhui, Zhu Haichao, Liu Ming, Qin Bing, Wei Furu
- Training Multilingual Pre-trained Language Model With Byte-level Subwords Wei Junqiu, Liu Qun, Guo Yinpeng, Jiang Xin
- Pangu-\(α\): Large-scale Autoregressive Pretrained Chinese Language Models With Auto-parallel Computation Wei Zeng, Xiaozhe Ren, Teng Su, Hui Wang, Yi Liao, Zhiwei Wang, Xin Jiang, Zhenzhang Yang, Kaisheng Wang, Xiaoda Zhang, Chen Li, Ziyan Gong, Yifan Yao, Xinjing Huang, Jun Wang, Jianfeng Yu, Qi Guo, Yue Yu, Yan Zhang, Jin Wang, Hengtao Tao, Dasen Yan, Zexuan Yi, Fang Peng, Fangqing Jiang, Han Zhang, Lingfeng Deng, Yehong Zhang, Zhe Lin, Chao Zhang, Shaojie Zhang, Mingyue Guo, Shanzhi Gu, Gaojun Fan, Yaowei Wang, Xuefeng Jin, Qun Liu, Yonghong Tian
- A Multilingual Modeling Method For Span-extraction Reading Comprehension Wu Gaochen, Xu Bin, Chang Dejie, Liu Bangchang
- Improving Low-resource Reading Comprehension Via Cross-lingual Transposition Rethinking Wu Gaochen, Xu Bin, Qin Yuxin, Kong Fei, Liu Bangchang, Zhao Hongwen, Chang Dejie
- Domain-adaptive Pretraining Methods For Dialogue Understanding Wu Han, Xu Kun, Song Linfeng, Jin Lifeng, Zhang Haisong, Song Linqi
- Not All Attention Is All You Need Wu Hongqiu, Zhao Hai, Zhang Min
- Yuan 1.0: Large-scale Pre-trained Language Model In Zero-shot And Few-shot Learning Wu Shaohua, Zhao Xudong, Yu Tong, Zhang Rongguo, Shen Chong, Liu Hongli, Li Feng, Zhu Hong, Luo Jiangang, Xu Liang, Zhang Xuanwei
- On Explaining Your Explanations Of BERT: An Empirical Study With Sequence Classification Wu Zhengxuan, Ong Desmond C.
- Prefix-tuning: Optimizing Continuous Prompts For Generation Xiang Lisa Li, Percy Liang
- P-tuning V2: Prompt Tuning Can Be Comparable To Fine-tuning Universally Across Scales And Tasks Xiao Liu, Kaixuan Ji, Yicheng Fu, Weng Lam Tam, Zhengxiao Du, Zhilin Yang, Jie Tang
- Training Large-scale News Recommenders With Pretrained Language Models In The Loop Xiao Shitao, Liu Zheng, Shao Yingxia, Di Tao, Xie Xing
- An Explanation Of In-context Learning As Implicit Bayesian Inference Xie Sang Michael, Raghunathan Aditi, Liang Percy, Ma Tengyu
- Simple Local Attentions Remain Competitive For Long-context Tasks Xiong Wenhan, Oğuz Barlas, Gupta Anchit, Chen Xilun, Liskovich Diana, Levy Omer, Yih Wen-tau, Mehdad Yashar
- Moca: Incorporating Multi-stage Domain Pretraining And Cross-guided Multimodal Attention For Textbook Question Answering Xu Fangzhi, Lin Qika, Liu Jun, Zhang Lingling, Zhao Tianzhe, Chai Qi, Pan Yudai
- E2E-VLP: End-to-end Vision-language Pre-training Enhanced By Visual Learning Xu Haiyang, Yan Ming, Li Chenliang, Bi Bin, Huang Songfang, Xiao Wenming, Huang Fei
- VLM: Task-agnostic Video-language Model Pre-training For Video Understanding Xu Hu, Ghosh Gargi, Huang Po-yao, Arora Prahal, Aminzadeh Masoumeh, Feichtenhofer Christoph, Metze Florian, Zettlemoyer Luke
- Dissecting Generation Modes For Abstractive Summarization Models Via Ablation And Attribution Xu Jiacheng, Durrett Greg
- Attention-guided Generative Models For Extractive Question Answering Xu Peng, Liang Davis, Huang Zhiheng, Xiang Bing
- From Dense To Sparse: Contrastive Pruning For Better Pre-trained Language Model Compression Xu Runxin, Luo Fuli, Wang Chengyu, Chang Baobao, Huang Jun, Huang Songfang, Huang Fei
- K-PLUG: Knowledge-injected Pre-trained Language Model For Natural Language Understanding And Generation In E-commerce Xu Song, Li Haoran, Yuan Peng, Wang Yujia, Wu Youzheng, He Xiaodong, Liu Ying, Zhou Bowen
- Byt5: Towards A Token-free Future With Pre-trained Byte-to-byte Models Xue Linting, Barua Aditya, Constant Noah, Al-rfou Rami, Narang Sharan, Kale Mihir, Roberts Adam, Raffel Colin
- NT5?! Training T5 To Perform Numerical Reasoning Yang Peng-jian, Chen Ying Ting, Chen Yuechan, Cer Daniel
- Causal Attention For Vision-language Tasks Yang Xu, Zhang Hanwang, Qi Guojun, Cai Jianfei
- Learning To Generate Task-specific Adapters From Task Description Ye Qinyuan, Ren Xiang
- Gpt3mix: Leveraging Large-scale Language Models For Text Augmentation Yoo Kang Min, Park Dongju, Kang Jaewook, Lee Sang-woo, Park Woomyeong
- Turning Tables: Generating Examples From Semi-structured Tables For Endowing Language Models With Reasoning Skills Yoran Ori, Talmor Alon, Berant Jonathan
- Reconsidering The Past: Optimizing Hidden States In Language Models Yoshida Davis, Gimpel Kevin
- On The Interplay Between Fine-tuning And Composition In Transformers Yu Lang, Ettinger Allyson
- LV-BERT: Exploiting Layer Variety For BERT Yu Weihao, Jiang Zihang, Chen Fei, Hou Qibin, Feng Jiashi
- Language Bias In Visual Question Answering: A Survey And Taxonomy Yuan Desen
- Improving Sequential Recommendation Consistency With Self-supervised Imitation Yuan Xu, Chen Hongshen, Song Yonghao, Zhao Xiaofang, Ding Zhuoye, He Zhen, Long Bo
- Prune Once For All: Sparse Pre-trained Language Models Zafrir Ofir, Larey Ariel, Boudoukh Guy, Shen Haihao, Wasserblat Moshe
- Bitfit: Simple Parameter-efficient Fine-tuning For Transformer-based Masked Language-models Zaken Elad Ben, Ravfogel Shauli, Goldberg Yoav
- Topical Language Generation Using Transformers Zandie Rohola, Mahoor Mohammad H.
- Improving Stack Overflow Question Title Generation With Copying Enhanced Codebert Model And Bi-modal Information Zhang Fengji, Yu Xiao, Keung Jacky, Li Fuyang, Xie Zhiwen, Yang Zhen, Ma Caoyuan, Zhang Zhimin
- Ernie-vilg: Unified Generative Pre-training For Bidirectional Vision-language Generation Zhang Han, Yin Weichong, Fang Yewei, Li Lanxin, Duan Boqiang, Wu Zhihua, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- DSGPT: Domain-specific Generative Pre-training Of Transformers For Text Generation In E-commerce Title And Review Summarization Zhang Xueying, Jiang Yunjiang, Shang Yue, Cheng Zhaomeng, Zhang Chi, Fan Xiaochuan, Xiao Yun, Long Bo
- Retgen: A Joint Framework For Retrieval And Grounded Text Generation Modeling Zhang Yizhe, Sun Siqi, Gao Xiang, Fang Yuwei, Brockett Chris, Galley Michel, Gao Jianfeng, Dolan Bill
- An Exploratory Study On Long Dialogue Summarization: What Works And What's Next Zhang Yusen, Ni Ansong, Yu Tao, Zhang Rui, Zhu Chenguang, Deb Budhaditya, Celikyilmaz Asli, Awadallah Ahmed Hassan, Radev Dragomir
- Learning To Rehearse In Long Sequence Memorization Zhang Zhu, Zhou Chang, Ma Jianxin, Lin Zhijie, Zhou Jingren, Yang Hongxia, Zhao Zhou
- Structural Pre-training For Dialogue Comprehension Zhang Zhuosheng, Zhao Hai
- Calibrate Before Use: Improving Few-shot Performance Of Language Models Zhao Tony Z., Wallace Eric, Feng Shi, Klein Dan, Singh Sameer
- CPM-2: Large-scale Cost-effective Pre-trained Language Models Zhengyan Zhang, Yuxian Gu, Xu Han, Shengqi Chen, Chaojun Xiao, Zhenbo Sun, Yuan Yao, Fanchao Qi, Jian Guan, Pei Ke, Yanzheng Cai, Guoyang Zeng, Zhixing Tan, Zhiyuan Liu, Minlie Huang, Wentao Han, Yang Liu, Xiaoyan Zhu, Maosong Sun
- Adapting Language Models For Zero-shot Learning By Meta-tuning On Dataset And Prompt Collections Zhong Ruiqi, Lee Kristy, Zhang Zheng, Klein Dan
- Self-guided Curriculum Learning For Neural Machine Translation Zhou Lei, Ding Liang, Duh Kevin, Watanabe Shinji, Sasano Ryohei, Takeda Koichi
- Improving Sequence-to-sequence Pre-training Via Sequence Span Rewriting Zhou Wangchunshu, Ge Tao, Xu Canwen, Xu Ke, Wei Furu
- LAFITE: Towards Language-free Training For Text-to-image Generation Zhou Yufan, Zhang Ruiyi, Chen Changyou, Li Chunyuan, Tensmeyer Chris, Yu Tong, Gu Jiuxiang, Xu Jinhui, Sun Tong
- Taming Sparsely Activated Transformer With Stochastic Experts Zuo Simiao, Liu Xiaodong, Jiao Jian, Kim Young Jin, Hassan Hany, Zhang Ruofei, Zhao Tuo, Gao Jianfeng
- Palm: Scaling Language Modeling With Pathways Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-hellstern, Douglas Eck, Jeff Dean, Slav Petrov, Noah Fiedel
- Gpt-3-driven Pedagogical Agents For Training Children's Curious Question-asking Skills Abdelghani Rania, Wang Yen-hsiang, Yuan Xingdi, Wang Tong, Lucas Pauline, Sauzéon Hélène, Oudeyer Pierre-yves
- Global Memory Transformer For Processing Long Documents Adel Arij Al
- Vl-interpret: An Interactive Visualization Tool For Interpreting Vision-language Transformers Aflalo Estelle, Du Meng, Tseng Shao-yen, Liu Yongfei, Wu Chenfei, Duan Nan, Lal Vasudev
- Few-shot Training Llms For Project-specific Code-summarization Ahmed Toufique, Devanbu Premkumar
- Multimodal Sequential Generative Models For Semi-supervised Language Instruction Following Akuzawa Kei, Iwasawa Yusuke, Matsuo Yutaka
- How Does The Pre-training Objective Affect What Large Language Models Learn About Linguistic Properties? Alajrami Ahmed, Aletras Nikolaos
- Input-tuning: Adapting Unfamiliar Inputs To Frozen Pretrained Models An Shengnan, Li Yifei, Lin Zeqi, Liu Qian, Chen Bei, Fu Qiang, Chen Weizhu, Zheng Nanning, Lou Jian-guang
- Tempo: Accelerating Transformer-based Model Training Through Memory Footprint Reduction Andoorveedu Muralidhar, Zhu Zhanda, Zheng Bojian, Pekhimenko Gennady
- A Comparative Study On Language Models For Task-oriented Dialogue Systems Andreas Vinsen Marselino, Winata Genta Indra, Purwarianti Ayu
- Compositional Semantic Parsing With Large Language Models Andrew Drozdov, Nathanael Schärli, Ekin Akyürek, Nathan Scales, Xinying Song, Xinyun Chen, Olivier Bousquet, Denny Zhou
- Selection-inference: Exploiting Large Language Models For Interpretable Logical Reasoning Antonia Creswell, Murray Shanahan, Irina Higgins
- GLM-130B: An Open Bilingual Pre-trained Model Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Peng Zhang, Yuxiao Dong, Jie Tang
- ATTEMPT: Parameter-efficient Multi-task Tuning Via Attentional Mixtures Of Soft Prompts Asai Akari, Salehi Mohammadreza, Peters Matthew E., Hajishirzi Hannaneh
- Small Batch Sizes Improve Training Of Low-resource Neural MT Atrio Àlex R., Popescu-belis Andrei
- Bootstrapping Multilingual Semantic Parsers Using Large Language Models Awasthi Abhijeet, Gupta Nitish, Samanta Bidisha, Dave Shachi, Sarawagi Sunita, Talukdar Partha
- Parameter-efficient Finetuning Of Transformers For Source Code Ayupov Shamil, Chirkova Nadezhda
- Improving Short Text Classification With Augmented Data Using GPT-3 Balkus Salvador, Yan Donghui
- Enabling Classifiers To Make Judgements Explicitly Aligned With Human Values Bang Yejin, Yu Tiezheng, Madotto Andrea, Lin Zhaojiang, Diab Mona, Fung Pascale
- Vl-beit: Generative Vision-language Pretraining Bao Hangbo, Wang Wenhui, Dong Li, Wei Furu
- P\(^3\)LM: Probabilistically Permuted Prophet Language Modeling For Generative Pre-training Bao Junwei, Wang Yifan, Ying Jiangyong, Gong Yeyun, Zhao Jing, Wu Youzheng, He Xiaodong
- \(\textit{latent}\)-glat: Glancing At Latent Variables For Parallel Text Generation Bao Yu, Zhou Hao, Huang Shujian, Wang Dongqi, Qian Lihua, Dai Xinyu, Chen Jiajun, Li Lei
- Evidence > Intuition: Transferability Estimation For Encoder Selection Bassignana Elisa, Müller-eberstein Max, Zhang Mike, Plank Barbara
- Leveraging Pre-trained Language Models For Conversational Information Seeking From Text Bellan Patrizio, Dragoni Mauro, Ghidini Chiara
- Bygpt5: End-to-end Style-conditioned Poetry Generation With Token-free Language Models Belouadi Jonas, Eger Steffen
- Logical Reasoning For Task Oriented Dialogue Systems Beygi Sajjad, Fazel-zarandi Maryam, Cervone Alessandra, Krishnan Prakash, Jonnalagadda Siddhartha Reddy
- Effective Cross-task Transfer Learning For Explainable Natural Language Inference With T5 Bigoulaeva Irina, Sachdeva Rachneet, Madabushi Harish Tayyar, Villavicencio Aline, Gurevych Iryna
- Prompting Language Models For Linguistic Structure Blevins Terra, Gonen Hila, Zettlemoyer Luke
- Inpars: Data Augmentation For Information Retrieval Using Large Language Models Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Nogueira Rodrigo
- Looking For A Handsome Carpenter! Debiasing GPT-3 Job Advertisements Borchers Conrad, Gala Dalia Sara, Gilburt Benjamin, Oravkin Eduard, Bounsi Wilfried, Asano Yuki M., Kirk Hannah Rose
- Petals: Collaborative Inference And Fine-tuning Of Large Models Borzunov Alexander, Baranchuk Dmitry, Dettmers Tim, Ryabinin Max, Belkada Younes, Chumachenko Artem, Samygin Pavel, Raffel Colin
- Understanding Performance Of Long-document Ranking Models Through Comprehensive Evaluation And Leaderboarding Boytsov Leonid, Akinpelu David, Lin Tianyi, Gao Fangwei, Zhao Yutian, Huang Jeffrey, Katyal Nipun, Nyberg Eric
- Evaluating The Susceptibility Of Pre-trained Language Models Via Handcrafted Adversarial Examples Branch Hezekiah J., Cefalu Jonathan Rodriguez, Mchugh Jeremy, Hujer Leyla, Bahl Aditya, Iglesias Daniel Del Castillo, Heichman Ron, Darwishi Ramesh
- Large Language Models Can Implement Policy Iteration Brooks Ethan, Walls Logan, Lewis Richard L., Singh Satinder
- Understanding Attention For Vision-and-language Tasks Cao Feiqi, Han Soyeon Caren, Long Siqu, Xu Changwei, Poon Josiah
- A Model-agnostic Data Manipulation Method For Persona-based Dialogue Generation Cao Yu, Bi Wei, Fang Meng, Shi Shuming, Tao Dacheng
- Pre-trained Neural Language Models For Automatic Mobile App User Feedback Answer Generation Cao Yue, Fard Fatemeh H.
- Robust Preference Learning For Storytelling Via Contrastive Reinforcement Learning Castricato Louis, Havrilla Alexander, Matiana Shahbuland, Pieler Michael, Ye Anbang, Yang Ian, Frazier Spencer, Riedl Mark
- Ernie-code: Beyond English-centric Cross-lingual Pretraining For Programming Languages Chai Yekun, Wang Shuohuan, Pang Chao, Sun Yu, Tian Hao, Wu Hua
- Speechprompt: An Exploration Of Prompt Tuning On Generative Spoken Language Model For Speech Processing Tasks Chang Kai-wei, Tseng Wei-cheng, Li Shang-wen, Lee Hung-yi
- Data Curation Alone Can Stabilize In-context Learning Chang Ting-yun, Jia Robin
- Analyzing Semantic Faithfulness Of Language Models Via Input Intervention On Question Answering Chaturvedi Akshay, Bhar Swarnadeep, Saha Soumadeep, Garain Utpal, Asher Nicholas
- Litevl: Efficient Video-language Learning With Enhanced Spatial-temporal Modeling Chen Dongsheng, Tao Chaofan, Hou Lu, Shang Lifeng, Jiang Xin, Liu Qun
- Xdoc: Unified Pre-training For Cross-format Document Understanding Chen Jingye, Lv Tengchao, Cui Lei, Zhang Cha, Wei Furu
- Leveraging Natural Supervision For Language Representation Learning And Generation Chen Mingda
- Improving In-context Few-shot Learning Via Self-supervised Training Chen Mingda, Du Jingfei, Pasunuru Ramakanth, Mihaylov Todor, Iyer Srini, Stoyanov Veselin, Kozareva Zornitsa
- Pali: A Jointly-scaled Multilingual Language-image Model Chen Xi, Wang Xiao, Changpinyo Soravit, Piergiovanni Aj, Padlewski Piotr, Salz Daniel, Goodman Sebastian, Grycner Adam, Mustafa Basil, Beyer Lucas, Kolesnikov Alexander, Puigcerver Joan, Ding Nan, Rong Keran, Akbari Hassan, Mishra Gaurav, Xue Linting, Thapliyal Ashish, Bradbury James, Kuo Weicheng, Seyedhosseini Mojtaba, Jia Chao, Ayan Burcu Karagol, Riquelme Carlos, Steiner Andreas, Angelova Anelia, Zhai Xiaohua, Houlsby Neil, Soricut Radu
- Transformers Go For The Lols: Generating (humourous) Titles From Scientific Abstracts End-to-end Chen Yanran, Eger Steffen
- Empowering Parameter-efficient Transfer Learning By Recognizing The Kernel Structure In Self-attention Chen Yifan, Hazarika Devamanyu, Namazifar Mahdi, Liu Yang, Jin Di, Hakkani-tur Dilek
- Testing Occupational Gender Bias In Language Models: Towards Robust Measurement And Zero-shot Debiasing Chen Yuen, Raghuram Vethavikashini Chithrra, Mattern Justus, Sachan Mrinmaya, Mihalcea Rada, Schölkopf Bernhard, Jin Zhijing
- Adaprompt: Adaptive Model Training For Prompt-based NLP Chen Yulong, Liu Yang, Dong Li, Wang Shuohang, Zhu Chenguang, Zeng Michael, Zhang Yue
- DFM: Dialogue Foundation Model For Universal Large-scale Dialogue-oriented Task Learning Chen Zhi, Bao Jijia, Chen Lu, Liu Yuncong, Ma Da, Chen Bei, Wu Mengyue, Zhu Su, Dong Xin, Ge Fujiang, Miao Qingliang, Lou Jian-guang, Yu Kai
- Altclip: Altering The Language Encoder In CLIP For Extended Language Capabilities Chen Zhongzhi, Liu Guang, Zhang Bo-wen, Ye Fulong, Yang Qinghong, Wu Ledell
- Is Multiwoz A Solved Task? An Interactive TOD Evaluation Framework With User Simulator Cheng Qinyuan, Li Linyang, Quan Guofeng, Gao Feng, Mou Xiaofeng, Qiu Xipeng
- Neural Machine Translation With Contrastive Translation Memories Cheng Xin, Gao Shen, Liu Lemao, Zhao Dongyan, Yan Rui
- Vault: Augmenting The Vision-and-language Transformer For Sentiment Classification On Social Media Chochlakis Georgios University Of Southern California, Srinivasan Tejas University Of Southern California, Thomason Jesse University Of Southern California, Narayanan Shrikanth University Of Southern California
- Language-family Adapters For Low-resource Multilingual Neural Machine Translation Chronopoulou Alexandra, Stojanovski Dario, Fraser Alexander
- Meta-learning Fast Weight Language Models Clark Kevin, Guu Kelvin, Chang Ming-wei, Pasupat Panupong, Hinton Geoffrey, Norouzi Mohammad
- LSG Attention: Extrapolation Of Pretrained Transformers To Long Sequences Condevaux Charles, Harispe Sébastien
- Continual Pre-training Mitigates Forgetting In Language And Vision Cossu Andrea, Tuytelaars Tinne, Carta Antonio, Passaro Lucia, Lomonaco Vincenzo, Bacciu Davide
- VQGAN-CLIP: Open Domain Image Generation And Editing With Natural Language Guidance Crowson Katherine, Biderman Stella, Kornis Daniel, Stander Dashiell, Hallahan Eric, Castricato Louis, Raff Edward
- M6-rec: Generative Pretrained Language Models Are Open-ended Recommender Systems Cui Zeyu, Ma Jianxin, Zhou Chang, Zhou Jingren, Yang Hongxia
- The Future Is Different: Large Pre-trained Language Models Fail In Prediction Tasks Cvejoski Kostadin, Sánchez Ramsés J., Ojeda César
- Neural Knowledge Bank For Pretrained Transformers Dai Damai, Jiang Wenbin, Dong Qingxiu, Lyu Yajuan, She Qiaoqiao, Sui Zhifang
- Why Can GPT Learn In-context? Language Models Implicitly Perform Gradient Descent As Meta-optimizers Dai Damai, Sun Yutao, Dong Li, Hao Yaru, Ma Shuming, Sui Zhifang, Wei Furu
- Enabling Multimodal Generation On CLIP Via Vision-language Knowledge Distillation Dai Wenliang, Hou Lu, Shang Lifeng, Jiang Xin, Liu Qun, Fung Pascale
- Promptagator: Few-shot Dense Retrieval From 8 Examples Dai Zhuyun, Zhao Vincent Y., Ma Ji, Luan Yi, Ni Jianmo, Lu Jing, Bakalov Anton, Guu Kelvin, Hall Keith B., Chang Ming-wei
- Understanding BLOOM: An Empirical Study On Diverse NLP Tasks Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- Rlprompt: Optimizing Discrete Text Prompts With Reinforcement Learning Deng Mingkai, Wang Jianyu, Hsieh Cheng-ping, Wang Yihan, Guo Han, Shu Tianmin, Song Meng, Xing Eric P., Hu Zhiting
- Least-to-most Prompting Enables Complex Reasoning In Large Language Models Denny Zhou, Nathanael Schärli, Le Hou, Jason Wei, Nathan Scales, Xuezhi Wang, Dale Schuurmans, Claire Cui, Olivier Bousquet, Quoc Le, Ed Chi
- Less Is More: Linear Layers On CLIP Features As Powerful Vizwiz Model Deuser Fabian, Habel Konrad, Rösch Philipp J., Oswald Norbert
- Pre-training Transformer Models With Sentence-level Objectives For Answer Sentence Selection Di Liello Luca, Garg Siddhant, Soldaini Luca, Moschitti Alessandro
- Black-box Prompt Learning For Pre-trained Language Models Diao Shizhe, Huang Zhichao, Xu Ruijia, Li Xuechun, Lin Yong, Zhou Xiao, Zhang Tong
- Write And Paint: Generative Vision-language Models Are Unified Modal Learners Diao Shizhe, Zhou Wangchunshu, Zhang Xinsong, Wang Jiawei
- Cogview2: Faster And Better Text-to-image Generation Via Hierarchical Transformers Ding Ming, Zheng Wendi, Hong Wenyi, Tang Jie
- Parallel Attention Forcing For Machine Translation Dou Qingyun, Gales Mark
- Coarse-to-fine Vision-language Pre-training With Fusion In The Backbone Dou Zi-yi, Kamath Aishwarya, Gan Zhe, Zhang Pengchuan, Wang Jianfeng, Li Linjie, Liu Zicheng, Liu Ce, Lecun Yann, Peng Nanyun, Gao Jianfeng, Wang Lijuan
- A Survey Of Vision-language Pre-trained Models Du Yifan, Liu Zikang, Li Junyi, Zhao Wayne Xin
- Star: Bootstrapping Reasoning With Reasoning Eric Zelikman, Yuhuai Wu, Jesse Mu, Noah D. Goodman
- Codegen: An Open Large Language Model For Code With Multi-turn Program Synthesis Erik Nijkamp, Bo Pang, Hiroaki Hayashi, Lifu Tu, Huan Wang, Yingbo Zhou, Silvio Savarese, Caiming Xiong
- CERES: Pretraining Of Graph-conditioned Transformer For Semi-structured Session Data Feng Rui, Luo Chen, Yin Qingyu, Yin Bing, Zhao Tuo, Zhang Chao
- Learn To Remember: Transformer With Recurrent Memory For Document-level Machine Translation Feng Yukun, Li Feng, Song Ziang, Zheng Boyuan, Koehn Philipp
- GPTQ: Accurate Post-training Quantization For Generative Pre-trained Transformers Frantar Elias, Ashkboos Saleh, Hoefler Torsten, Alistarh Dan
- Adapterbias: Parameter-efficient Token-dependent Representation Shift For Adapters In NLP Tasks Fu Chin-lun, Chen Zih-ching, Lee Yun-ru, Lee Hung-yi
- Hungry Hungry Hippos: Towards Language Modeling With State Space Models Fu Daniel Y., Dao Tri, Saab Khaled K., Thomas Armin W., Rudra Atri, Ré Christopher
- An Empirical Study Of End-to-end Video-language Transformers With Masked Visual Modeling Fu Tsu-jui, Li Linjie, Gan Zhe, Lin Kevin, Wang William Yang, Wang Lijuan, Liu Zicheng
- MIGA: A Unified Multi-task Generation Framework For Conversational Text-to-sql Fu Yingwen, Ou Wenjie, Yu Zhou, Lin Yue
- WAVPROMPT: Towards Few-shot Spoken Language Understanding With Frozen Language Models Gao Heting, Ni Junrui, Qian Kaizhi, Zhang Yang, Chang Shiyu, Hasegawa-johnson Mark
- Visual Prompt Tuning For Test-time Domain Adaptation Gao Yunhe, Shi Xingjian, Zhu Yi, Wang Hao, Tang Zhiqiang, Zhou Xiong, Li Mu, Metaxas Dimitris N.
- Cramming: Training A Language Model On A Single GPU In One Day Geiping Jonas, Goldstein Tom
- Microbert: Effective Training Of Low-resource Monolingual Berts Through Parameter Reduction And Multitask Learning Gessler Luke, Zeldes Amir
- Generating Coherent Narratives By Learning Dynamic And Discrete Entity States With A Contrastive Framework Guan Jian, Yang Zhenyu, Zhang Rongsheng, Hu Zhipeng, Huang Minlie
- Transkimmer: Transformer Learns To Layer-wise Skim Guan Yue, Li Zhengyi, Leng Jingwen, Lin Zhouhan, Guo Minyi
- Looking For A Needle In A Haystack: A Comprehensive Study Of Hallucinations In Neural Machine Translation Guerreiro Nuno M., Voita Elena, Martins André F. T.
- Unixcoder: Unified Cross-modal Pre-training For Code Representation Guo Daya, Lu Shuai, Duan Nan, Wang Yanlin, Zhou Ming, Yin Jian
- Visually-augmented Pretrained Language Models For NLP Tasks Without Images Guo Hangyu, Zhou Kun, Zhao Wayne Xin, Zhang Qinyu, Wen Ji-rong
- MALM: Mixing Augmented Language Modeling For Zero-shot Machine Translation Gupta Kshitij
- Understanding HTML With Large Language Models Gur Izzeddin, Nachum Ofir, Miao Yingjie, Safdari Mustafa, Huang Austin, Chowdhery Aakanksha, Narang Sharan, Fiedel Noah, Faust Aleksandra
- Thinking About GPT-3 In-context Learning For Biomedical IE? Think Again Gutiérrez Bernal Jiménez, Mcneal Nikolas, Washington Clay, Chen You, Li Lang, Sun Huan, Su Yu
- LUNA: Language Understanding With Number Augmentations On Transformers Via Number Plugins And Pre-training Han Hongwei, Xu Jialiang, Zhou Mengyu, Shao Yijia, Han Shi, Zhang Dongmei
- FOLIO: Natural Language Reasoning With First-order Logic Han Simeng, Schoelkopf Hailey, Zhao Yilun, Qi Zhenting, Riddell Martin, Zhou Wenfei, Coady James, Peng David, Qiao Yujie, Benson Luke, Sun Lucy, Wardle-solano Alex, Szabo Hannah, Zubova Ekaterina, Burtell Matthew, Fan Jonathan, Liu Yixin, Wong Brian, Sailor Malcolm, Ni Ansong, Nan Linyong, Kasai Jungo, Yu Tao, Zhang Rui, Fabbri Alexander R., Kryscinski Wojciech, Yavuz Semih, Liu Ye, Lin Xi Victoria, Joty Shafiq, Zhou Yingbo, Xiong Caiming, Ying Rex, Cohan Arman, Radev Dragomir
- ORCA: Interpreting Prompted Language Models Via Locating Supporting Data Evidence In The Ocean Of Pretraining Data Han Xiaochuang, Tsvetkov Yulia
- Learnable Model Augmentation Self-supervised Learning For Sequential Recommendation Hao Yongjing, Zhao Pengpeng, Xian Xuefeng, Liu Guanfeng, Wang Deqing, Zhao Lei, Liu Yanchi, Sheng Victor S.
- Supervised Visual Attention For Simultaneous Multimodal Machine Translation Haralampieva Veneta, Caglayan Ozan, Specia Lucia
- Aging With GRACE: Lifelong Model Editing With Discrete Key-value Adaptors Hartvigsen Thomas, Sankaranarayanan Swami, Palangi Hamid, Kim Yoon, Ghassemi Marzyeh
- Rethinking With Retrieval: Faithful Large Language Model Inference He Hangfeng, Zhang Hongming, Roth Dan
- Z-code++: A Pre-trained Language Model Optimized For Abstractive Summarization He Pengcheng, Peng Baolin, Lu Liyang, Wang Song, Mei Jie, Liu Yang, Xu Ruochen, Awadalla Hany Hassan, Shi Yu, Zhu Chenguang, Xiong Wayne, Zeng Michael, Gao Jianfeng, Huang Xuedong
- VLMAE: Vision-language Masked Autoencoder He Sunan, Guo Taian, Dai Tao, Qiao Ruizhi, Wu Chen, Shu Xiujun, Ren Bo
- SPACE-3: Unified Dialog Model Pre-training For Task-oriented Dialog Understanding And Generation He Wanwei, Dai Yinpei, Yang Min, Sun Jian, Huang Fei, Si Luo, Li Yongbin
- Bundle MCR: Towards Conversational Bundle Recommendation He Zhankui, Zhao Handong, Yu Tong, Kim Sungchul, Du Fan, Mcauley Julian
- Bridging The Data Gap Between Training And Inference For Unsupervised Neural Machine Translation He Zhiwei, Wang Xing, Wang Rui, Shi Shuming, Tu Zhaopeng
- Robust Dialogue State Tracking With Weak Supervision And Sparse Data Heck Michael, Lubis Nurul, Van Niekerk Carel, Feng Shutong, Geishauser Christian, Lin Hsien-chin, Gašić Milica
- Compressing Pre-trained Transformers Via Low-bit Nxm Sparsity For Natural Language Understanding Holmes Connor, Zhang Minjia, He Yuxiong, Wu Bo
- Cogvideo: Large-scale Pretraining For Text-to-video Generation Via Transformers Hong Wenyi, Ding Ming, Zheng Wendi, Liu Xinghan, Tang Jie
- TRUE: Re-evaluating Factual Consistency Evaluation Honovich Or, Aharoni Roee, Herzig Jonathan, Taitelbaum Hagai, Kukliansy Doron, Cohen Vered, Scialom Thomas, Szpektor Idan, Hassidim Avinatan, Matias Yossi
- Syntax-guided Localized Self-attention By Constituency Syntactic Distance Hou Shengyuan, Kai Jushi, Xue Haotian, Zhu Bingyu, Yuan Bo, Huang Longtao, Wang Xinbing, Lin Zhouhan
- XDBERT: Distilling Visual Information To BERT From Cross-modal Systems To Improve Language Understanding Hsu Chan-jan, Lee Hung-yi, Tsao Yu
- Empirical Evaluation Of Post-training Quantization Methods For Language Tasks Hu Ting, Meinel Christoph, Yang Haojin
- REVEAL: Retrieval-augmented Visual-language Pre-training With Multi-source Multimodal Knowledge Memory Hu Ziniu, Iscen Ahmet, Sun Chen, Wang Zirui, Chang Kai-wei, Sun Yizhou, Schmid Cordelia, Ross David A., Fathi Alireza
- Transformer Quality In Linear Time Hua Weizhe, Dai Zihang, Liu Hanxiao, Le Quoc V.
- Autoregressive Entity Generation For End-to-end Task-oriented Dialog Huang Guanhuan, Quan Xiaojun, Wang Qifan
- Personalized Dialogue Generation With Persona-adaptive Attention Huang Qiushi, Zhang Yu, Ko Tom, Liu Xubo, Wu Bo, Wang Wenwu, Tang Lilian
- Detecting Unintended Memorization In Language-model-fused ASR Huang W. Ronny, Chien Steve, Thakkar Om, Mathews Rajiv
- Inner Monologue: Embodied Reasoning Through Planning With Language Models Huang Wenlong, Xia Fei, Xiao Ted, Chan Harris, Liang Jacky, Florence Pete, Zeng Andy, Tompson Jonathan, Mordatch Igor, Chebotar Yevgen, Sermanet Pierre, Brown Noah, Jackson Tomas, Luu Linda, Levine Sergey, Hausman Karol, Ichter Brian
- FPT: Improving Prompt Tuning Efficiency Via Progressive Training Huang Yufei, Qin Yujia, Wang Huadong, Yin Yichun, Sun Maosong, Liu Zhiyuan, Liu Qun
- Layoutlmv3: Pre-training For Document AI With Unified Text And Image Masking Huang Yupan, Lv Tengchao, Cui Lei, Lu Yutong, Wei Furu
- MCP: Self-supervised Pre-training For Personalized Chatbots With Multi-level Contrastive Sampling Huang Zhaoheng, Dou Zhicheng, Zhu Yutao, Ma Zhengyi
- Welm: A Well-read Pre-trained Language Model For Chinese Hui Su, Xiao Zhou, Houjin Yu, Xiaoyu Shen, Yuwen Chen, Zilin Zhu, Yang Yu, Jie Zhou
- Self-generated In-context Learning: Leveraging Auto-regressive Language Models As A Demonstration Generator Hyuhng Joon Kim, Hyunsoo Cho, Junyeob Kim, Taeuk Kim, Kang Min Yoo, Sang-goo Lee
- Efficient Long-text Understanding With Short-text Models Ivgi Maor, Shaham Uri, Berant Jonathan
- HINT: Hypernetwork Instruction Tuning For Efficient Zero- & Few-shot Generalisation Ivison Hamish, Bhagia Akshita, Wang Yizhong, Hajishirzi Hannaneh, Peters Matthew
- Litetransformersearch: Training-free Neural Architecture Search For Efficient Language Models Javaheripi Mojan, De Rosa Gustavo H., Mukherjee Subhabrata, Shah Shital, Religa Tomasz L., Mendes Caio C. T., Bubeck Sebastien, Koushanfar Farinaz, Dey Debadeepta
- DP-KB: Data Programming With Knowledge Bases Improves Transformer Fine Tuning For Answer Sentence Selection Jedema Nic, Vu Thuy, Gupta Manish, Moschitti Alessandro
- Visual Prompt Tuning Jia Menglin, Tang Luming, Chen Bor-chun, Cardie Claire, Belongie Serge, Hariharan Bharath, Lim Ser-nam
- Contrastive Learning For Prompt-based Few-shot Language Learners Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Pseudo-q: Generating Pseudo Language Queries For Visual Grounding Jiang Haojun, Lin Yuanze, Han Dongchen, Song Shiji, Huang Gao
- A Simple Contrastive Learning Objective For Alleviating Neural Text Degeneration Jiang Shaojie, Zhang Ruqing, Vakulenko Svitlana, De Rijke Maarten
- From Images To Textual Prompts: Zero-shot VQA With Frozen Large Language Models Jiaxian Guo, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Boyang Li, Dacheng Tao, Steven C. H. Hoi
- Large Language Models Can Self-improve Jiaxin Huang, Shixiang Shane Gu, Le Hou, Yuexin Wu, Xuezhi Wang, Hongkun Yu, Jiawei Han
- Probing Script Knowledge From Pre-trained Models Jin Zijian, Zhang Xingyu, Yu Mo, Huang Lifu
- Training Compute-optimal Large Language Models Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego De Las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, Tom Hennigan, Eric Noland, Katie Millican, George Van Den Driessche, Bogdan Damoc, Aurelia Guy, Simon Osindero, Karen Simonyan, Erich Elsen, Jack W. Rae, Oriol Vinyals, Laurent Sifre
- Self-prompting Large Language Models For Zero-shot Open-domain QA Junlong Li, Jinyuan Wang, Zhuosheng Zhang, Hai Zhao
- Latency Adjustable Transformer Encoder For Language Understanding Kachuee Sajjad, Sharifkhani Mohammad
- A New Path: Scaling Vision-and-language Navigation With Synthetic Instructions And Imitation Learning Kamath Aishwarya, Anderson Peter, Wang Su, Koh Jing Yu, Ku Alexander, Waters Austin, Yang Yinfei, Baldridge Jason, Parekh Zarana
- Gpt-neo For Commonsense Reasoning -- A Theoretical And Practical Lens Kashyap Rohan, Kashyap Vivek, P. Narendra C.
- What Makes Data-to-text Generation Hard For Pretrained Language Models? Keymanesh Moniba, Benton Adrian, Dredze Mark
- Self-generated In-context Learning: Leveraging Auto-regressive Language Models As A Demonstration Generator Kim Hyuhng Joon, Cho Hyunsoo, Kim Junyeob, Kim Taeuk, Yoo Kang Min, Lee Sang-goo
- Who Says Elephants Can't Run: Bringing Large Scale Moe Models Into Cloud Scale Production Kim Young Jin, Henry Rawn, Fahim Raffy, Awadalla Hany Hassan
- The Stack: 3 TB Of Permissively Licensed Source Code Kocetkov Denis, Li Raymond, Allal Loubna Ben, Li Jia, Mou Chenghao, Ferrandis Carlos Muñoz, Jernite Yacine, Mitchell Margaret, Hughes Sean, Wolf Thomas, Bahdanau Dzmitry, Von Werra Leandro, De Vries Harm
- Iterative Vision-and-language Navigation Krantz Jacob, Banerjee Shurjo, Zhu Wang, Corso Jason, Anderson Peter, Lee Stefan, Thomason Jesse
- Downstream Datasets Make Surprisingly Good Pretraining Corpora Krishna Kundan, Garg Saurabh, Bigham Jeffrey P., Lipton Zachary C.
- Mucot: Multilingual Contrastive Training For Question-answering In Low-resource Languages Kumar Gokul Karthik, Gehlot Abhishek Singh, Mullappilly Sahal Shaji, Nandakumar Karthik
- The Optimal BERT Surgeon: Scalable And Accurate Second-order Pruning For Large Language Models Kurtic Eldar, Campos Daniel, Nguyen Tuan, Frantar Elias, Kurtz Mark, Fineran Benjamin, Goin Michael, Alistarh Dan
- Co-training Improves Prompt-based Learning For Large Language Models Lang Hunter, Agrawal Monica, Kim Yoon, Sontag David
- Evaluation Of Synthetic Datasets For Conversational Recommender Systems Lara Harsh, Tiwari Manoj
- Does BERT Really Agree ? Fine-grained Analysis Of Lexical Dependence On A Syntactic Task Lasri Karim, Lenci Alessandro, Poibeau Thierry
- Internet-augmented Language Models Through Few-shot Prompting For Open-domain Question Answering Lazaridou Angeliki, Gribovskaya Elena, Stokowiec Wojciech, Grigorev Nikolai
- Guiding Visual Question Answering With Attention Priors Le Thao Minh, Le Vuong, Gupta Sunil, Venkatesh Svetha, Tran Truyen
- Hard Gate Knowledge Distillation -- Leverage Calibration For Robust And Reliable Language Model Lee Dongkyu, Tian Zhiliang, Zhao Yingxiu, Cheung Ka Chun, Zhang Nevin L.
- Dialogcc: An Automated Pipeline For Creating High-quality Multi-modal Dialogue Dataset Lee Young-jun, Ko Byungsoo, Kim Han-gyu, Hyeon Jonghwan, Choi Ho-jin
- Evolution Through Large Models Lehman Joel, Gordon Jonathan, Jain Shawn, Ndousse Kamal, Yeh Cathy, Stanley Kenneth O.
- Simple Recurrence Improves Masked Language Models Lei Tao, Tian Ran, Bastings Jasmijn, Parikh Ankur P.
- Reducing Retraining By Recycling Parameter-efficient Prompts Lester Brian, Yurtsever Joshua, Shakeri Siamak, Constant Noah
- Fast Inference From Transformers Via Speculative Decoding Leviathan Yaniv, Kalman Matan, Matias Yossi
- GPT-D: Inducing Dementia-related Linguistic Anomalies By Deliberate Degradation Of Artificial Neural Language Models Li Changye, Knopman David, Xu Weizhe, Cohen Trevor, Pakhomov Serguei
- Large Language Models With Controllable Working Memory Li Daliang, Rawat Ankit Singh, Zaheer Manzil, Wang Xin, Lukasik Michal, Veit Andreas, Yu Felix, Kumar Sanjiv
- Summer: Wechat Neural Machine Translation Systems For The WMT22 Biomedical Translation Task Li Ernan, Meng Fandong, Zhou Jie
- Fine-grained Semantically Aligned Vision-language Pre-training Li Juncheng, He Xin, Wei Longhui, Qian Long, Zhu Linchao, Xie Lingxi, Zhuang Yueting, Tian Qi, Tang Siliang
- Learning To Transfer Prompts For Text Generation Li Junyi, Tang Tianyi, Nie Jian-yun, Wen Ji-rong, Zhao Wayne Xin
- ELMER: A Non-autoregressive Pre-trained Language Model For Efficient And Effective Text Generation Li Junyi, Tang Tianyi, Zhao Wayne Xin, Nie Jian-yun, Wen Ji-rong
- Personalized Prompt Learning For Explainable Recommendation Li Lei, Zhang Yongfeng, Chen Li
- LAVENDER: Unifying Video-language Understanding As Masked Language Modeling Li Linjie, Gan Zhe, Lin Kevin, Lin Chung-ching, Liu Zicheng, Liu Ce, Wang Lijuan
- Modeling Coreference Relations In Visual Dialog Li Mingxiao, Moens Marie-francine
- Explanations From Large Language Models Make Small Reasoners Better Li Shiyang, Chen Jianshu, Shen Yelong, Chen Zhiyu, Zhang Xinlu, Li Zekun, Wang Hong, Qian Jing, Peng Baolin, Mao Yi, Chen Wenhu, Yan Xifeng
- Pre-trained Language Models For Interactive Decision-making Li Shuang, Puig Xavier, Paxton Chris, Du Yilun, Wang Clinton, Fan Linxi, Chen Tao, Huang De-an, Akyürek Ekin, Anandkumar Anima, Andreas Jacob, Mordatch Igor, Torralba Antonio, Zhu Yuke
- Task-specific Pre-training And Prompt Decomposition For Knowledge Graph Population With Language Models Li Tianyi, Huang Wenyu, Papasarantopoulos Nikos, Vougiouklis Pavlos, Pan Jeff Z.
- Evade The Trap Of Mediocrity: Promoting Diversity And Novelty In Text Generation Via Concentrating Attention Li Wenhao, Yi Xiaoyuan, Hu Jinyi, Sun Maosong, Xie Xing
- Evaluating Psychological Safety Of Large Language Models Li Xingxuan, Li Yutong, Qiu Lin, Joty Shafiq, Bing Lidong
- P-transformer: Towards Better Document-to-document Neural Machine Translation Li Yachao, Li Junhui, Jiang Jing, Tao Shimin, Yang Hao, Zhang Min
- Uni-eden: Universal Encoder-decoder Network By Multi-granular Vision-language Pre-training Li Yehao, Fan Jiahao, Pan Yingwei, Yao Ting, Lin Weiyao, Mei Tao
- Exploring Visual Interpretability For Contrastive Language-image Pre-training Li Yi, Wang Hualiang, Duan Yiqun, Xu Hang, Li Xiaomeng
- Accelerating Attention Through Gradient-based Learned Runtime Pruning Li Zheng, Ghodrati Soroush, Yazdanbakhsh Amir, Esmaeilzadeh Hadi, Kang Mingu
- Life After BERT: What Do Other Muppets Understand About Language? Lialin Vladislav, Zhao Kevin, Shivagunde Namrata, Rumshisky Anna
- Bjtu-wechat's Systems For The WMT22 Chat Translation Task Liang Yunlong, Meng Fandong, Xu Jinan, Chen Yufeng, Zhou Jie
- Simple And Effective Gradient-based Tuning Of Sequence-to-sequence Models Lichtarge Jared, Alberti Chris, Kumar Shankar
- ADAPT: Vision-language Navigation With Modality-aligned Action Prompts Lin Bingqian, Zhu Yi, Chen Zicong, Liang Xiwen, Liu Jianzhuang, Liang Xiaodan
- Gentus: Simulating User Behaviour And Language In Task-oriented Dialogues With Generative Transformers Lin Hsien-chin, Geishauser Christian, Feng Shutong, Lubis Nurul, Van Niekerk Carel, Heck Michael, Gašić Milica
- Contextual Transformer For Offline Meta Reinforcement Learning Lin Runji, Li Ye, Feng Xidong, Zhang Zhaowei, Fung Xian Hong Wu, Zhang Haifeng, Wang Jun, Du Yali, Yang Yaodong
- On Task-adaptive Pretraining For Dialogue Response Selection Lin Tzu-hsiang, Chi Ta-chung, Rumshisky Anna
- Vision-language Pre-training For Multimodal Aspect-based Sentiment Analysis Ling Yan, Yu Jianfei, Xia Rui
- Delving Deeper Into Cross-lingual Visual Question Answering Liu Chen, Pfeiffer Jonas, Korhonen Anna, Vulić Ivan, Gurevych Iryna
- Deplot: One-shot Visual Language Reasoning By Plot-to-table Translation Liu Fangyu, Eisenschlos Julian Martin, Piccinno Francesco, Krichene Syrine, Pang Chenxi, Lee Kenton, Joshi Mandar, Chen Wenhu, Collier Nigel, Altun Yasemin
- A Simple Meta-learning Paradigm For Zero-shot Intent Classification With Mixture Attention Mechanism Liu Han, Zhao Siyang, Zhang Xiaotong, Zhang Feng, Sun Junjie, Yu Hong, Zhang Xianchao
- Building Markovian Generative Architectures Over Pretrained LM Backbones For Efficient Task-oriented Dialog Systems Liu Hong, Cai Yucheng, Ou Zhijian, Huang Yi, Feng Junlan
- Same Pre-training Loss, Better Downstream: Implicit Bias Matters For Language Models Liu Hong, Xie Sang Michael, Li Zhiyuan, Ma Tengyu
- Rainier: Reinforced Knowledge Introspector For Commonsense Question Answering Liu Jiacheng, Hallinan Skyler, Lu Ximing, He Pengfei, Welleck Sean, Hajishirzi Hannaneh, Choi Yejin
- Fl-tuning: Layer Tuning For Feed-forward Network In Transformer Liu Jingping, Song Yuqiu, Xue Kui, Sun Hongli, Wang Chao, Chen Lihan, Jiang Haiyun, Liang Jiaqing, Ruan Tong
- Transformer With Memory Replay Liu Rui, Mozafari Barzan
- Commitbart: A Large Pre-trained Model For Github Commits Liu Shangqing, Li Yanzhou, Xie Xiaofei, Liu Yang
- PSP: Pre-trained Soft Prompts For Few-shot Abstractive Summarization Liu Xiaochen, Gao Yang, Bai Yu, Li Jiawei, Hu Yinan, Huang Heyan, Chen Boxing
- Improving Personality Consistency In Conversation By Persona Extending Liu Yifan, Wei Wei, Liu Jiayi, Mao Xianling, Fang Rui, Chen Dangyang
- Mulzdg: Multilingual Code-switching Framework For Zero-shot Dialogue Generation Liu Yongkang, Feng Shi, Wang Daling, Zhang Yifei
- A Win-win Deal: Towards Sparse And Robust Pre-trained Language Models Liu Yuanxin, Meng Fandong, Lin Zheng, Li Jiangnan, Fu Peng, Cao Yanan, Wang Weiping, Zhou Jie
- Declaration-based Prompt Tuning For Visual Question Answering Liu Yuhang, Wei Wei, Peng Daowan, Zhu Feida
- Training Language Models To Follow Instructions With Human Feedback Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, Ryan Lowe
- Vision-and-language Pretrained Models: A Survey Long Siqu, Cao Feiqi, Han Soyeon Caren, Yang Haiqin
- Reacc: A Retrieval-augmented Code Completion Framework Lu Shuai, Duan Nan, Han Hojae, Guo Daya, Hwang Seung-won, Svyatkovskiy Alexey
- Neuro-symbolic Procedural Planning With Commonsense Prompting Lu Yujie, Feng Weixi, Zhu Wanrong, Xu Wenda, Wang Xin Eric, Eckstein Miguel, Wang William Yang
- I-tuning: Tuning Frozen Language Models With Image For Lightweight Image Captioning Luo Ziyang, Hu Zhipeng, Xi Yadong, Zhang Rongsheng, Ma Jing
- A Frustratingly Simple Approach For End-to-end Image Captioning Luo Ziyang, Xi Yadong, Zhang Rongsheng, Ma Jing
- Torchscale: Transformers At Scale Ma Shuming, Wang Hongyu, Huang Shaohan, Wang Wenhui, Chi Zewen, Dong Li, Benhaim Alon, Patra Barun, Chaudhary Vishrav, Song Xia, Wei Furu
- Memory-assisted Prompt Editing To Improve GPT-3 After Deployment Madaan Aman, Tandon Niket, Clark Peter, Yang Yiming
- Structured Like A Language Model: Analysing AI As An Automated Subject Magee Liam, Arora Vanicka, Munn Luke
- PERFECT: Prompt-free And Efficient Few-shot Learning With Language Models Mahabadi Rabeeh Karimi, Zettlemoyer Luke, Henderson James, Saeidi Marzieh, Mathias Lambert, Stoyanov Veselin, Yazdani Majid
- Fine-tuning Pre-trained Transformers Into Decaying Fast Weights Mao Huanru Henry
- Quick Starting Dialog Systems With Paraphrase Generation Marceau Louis, Belbahar Raouf, Queudot Marc, Naji Nada, Charton Eric, Meurs Marie-jean
- Mini-model Adaptation: Efficiently Extending Pretrained Models To New Languages Via Aligned Shallow Training Marchisio Kelly, Lewis Patrick, Chen Yihong, Artetxe Mikel
- Grounding Language With Visual Affordances Over Unstructured Data Mees Oier, Borja-diaz Jessica, Burgard Wolfram
- LAD: Language Models As Data For Zero-shot Dialog Mehri Shikib, Altun Yasemin, Eskenazi Maxine
- DSI++: Updating Transformer Memory With New Documents Mehta Sanket Vaibhav, Gupta Jai, Tay Yi, Dehghani Mostafa, Tran Vinh Q., Rao Jinfeng, Najork Marc, Strubell Emma, Metzler Donald
- Generating Training Data With Language Models: Towards Zero-shot Language Understanding Meng Yu, Huang Jiaxin, Zhang Yu, Han Jiawei
- Tuning Language Models As Training Data Generators For Augmentation-enhanced Few-shot Learning Meng Yu, Michalski Martin, Huang Jiaxin, Zhang Yu, Abdelzaher Tarek, Han Jiawei
- The Devil Is In The Details: On Models And Training Regimes For Few-shot Intent Classification Mesgar Mohsen, Tran Thy Thy, Glavas Goran, Gurevych Iryna
- SKILL: Structured Knowledge Infusion For Large Language Models Moiseev Fedor, Dong Zhe, Alfonseca Enrique, Jaggi Martin
- Domain-specific Text Generation For Machine Translation Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- SGPT: GPT Sentence Embeddings For Semantic Search Muennighoff Niklas
- JASMINE: Arabic GPT Models For Few-shot Learning Nagoudi El Moatez Billah, Abdul-mageed Muhammad, Elmadany Abdelrahim, Inciarte Alcides Alcoba, Khondaker Md Tawkat Islam
- Large Language Models Are Reasoning Teachers Namgyu Ho, Laura Schmid, Se-young Yun
- Crosslingual Generalization Through Multitask Finetuning Niklas Muennighoff, Thomas Wang, Lintang Sutawika, Adam Roberts, Stella Biderman, Teven Le Scao, M Saiful Bari, Sheng Shen, Zheng-xin Yong, Hailey Schoelkopf, Xiangru Tang, Dragomir Radev, Alham Fikri Aji, Khalid Almubarak, Samuel Albanie, Zaid Alyafeai, Albert Webson, Edward Raff, Colin Raffel
- Budgetlongformer: Can We Cheaply Pretrain A Sota Legal Language Model From Scratch? Niklaus Joel, Giofré Daniele
- Do Vision-and-language Transformers Learn Grounded Predicate-noun Dependencies? Nikolaus Mitja, Salin Emmanuelle, Ayache Stephane, Fourtassi Abdellah, Favre Benoit
- No Language Left Behind: Scaling Human-centered Machine Translation Nllb Team, Marta R. Costa-jussà, James Cross, Onur Çelebi, Maha Elbayad, Kenneth Heafield, Kevin Heffernan, Elahe Kalbassi, Janice Lam, Daniel Licht, Jean Maillard, Anna Sun, Skyler Wang, Guillaume Wenzek, Al Youngblood, Bapi Akula, Loic Barrault, Gabriel Mejia Gonzalez, Prangthip Hansanti, John Hoffman, Semarley Jarrett, Kaushik Ram Sadagopan, Dirk Rowe, Shannon Spruit, Chau Tran, Pierre Andrews, Necip Fazil Ayan, Shruti Bhosale, Sergey Edunov, Angela Fan, Cynthia Gao, Vedanuj Goswami, Francisco Guzmán, Philipp Koehn, Alexandre Mourachko, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Jeff Wang
- Improving The Cross-lingual Generalisation In Visual Question Answering Nooralahzadeh Farhad, Sennrich Rico
- Minialbert: Model Distillation Via Parameter-efficient Recursive Transformers Nouriborji Mohammadmahdi, Rohanian Omid, Kouchaki Samaneh, Clifton David A.
- CUE Vectors: Modular Training Of Language Models Conditioned On Diverse Contextual Signals Novotney Scott, Mukherjee Sreeparna, Ahmed Zeeshan, Stolcke Andreas
- Measuring And Narrowing The Compositionality Gap In Language Models Ofir Press, Muru Zhang, Sewon Min, Ludwig Schmidt, Noah A. Smith, Mike Lewis
- Thinksum: Probabilistic Reasoning Over Sets Using Large Language Models Ozturkler Batu, Malkin Nikolay, Wang Zhen, Jojic Nebojsa
- Parameter-efficient Abstractive Question Answering Over Tables Or Text Pal Vaishali, Kanoulas Evangelos, De Rijke Maarten
- Dynamic Prompt Learning Via Policy Gradient For Semi-structured Mathematical Reasoning Pan Lu, Liang Qiu, Kai-wei Chang, Ying Nian Wu, Song-chun Zhu, Tanmay Rajpurohit, Peter Clark, Ashwin Kalyan
- Extremebert: A Toolkit For Accelerating Pretraining Of Customized BERT Pan Rui, Diao Shizhe, Chen Jianlin, Zhang Tong
- TALM: Tool Augmented Language Models Parisi Aaron, Zhao Yao, Fiedel Noah
- LUT-GEMM: Quantized Matrix Multiplication Based On Luts For Efficient Inference In Large-scale Generative Language Models Park Gunho, Park Baeseong, Kim Minsub, Lee Sungjae, Kim Jeonghoon, Kwon Beomseok, Kwon Se Jung, Kim Byeongwook, Lee Youngjoo, Lee Dongsoo
- Bidirectional Language Models Are Also Few-shot Learners Patel Ajay, Li Bryan, Rasooli Mohammad Sadegh, Constant Noah, Raffel Colin, Callison-burch Chris
- Beyond English-centric Bitexts For Better Multilingual Language Representation Learning Patra Barun, Singhal Saksham, Huang Shaohan, Chi Zewen, Dong Li, Wei Furu, Chaudhary Vishrav, Song Xia
- Sparse And Dense Approaches For The Full-rank Retrieval Of Responses For Dialogues Penha Gustavo, Hauff Claudia
- Lifting The Curse Of Multilinguality By Pre-training Modular Transformers Pfeiffer Jonas, Goyal Naman, Lin Xi Victoria, Li Xian, Cross James, Riedel Sebastian, Artetxe Mikel
- Vit5: Pretrained Text-to-text Transformer For Vietnamese Language Generation Phan Long, Tran Hieu, Nguyen Hieu, Trinh Trieu H.
- Hypertuning: Toward Adapting Large Language Models Without Back-propagation Phang Jason, Mao Yi, He Pengcheng, Chen Weizhu
- Investigating Efficiently Extending Transformers For Long Input Summarization Phang Jason, Zhao Yao, Liu Peter J.
- Pre-training Image-language Transformers For Open-vocabulary Tasks Piergiovanni Aj, Kuo Weicheng, Angelova Anelia
- Answer-me: Multi-task Open-vocabulary Visual Question Answering Piergiovanni Aj, Li Wei, Kuo Weicheng, Saffar Mohammad, Bertsch Fred, Angelova Anelia
- Harnessing The Power Of Multi-task Pretraining For Ground-truth Level Natural Language Explanations Plüster Björn, Ambsdorf Jakob, Braach Lukas, Lee Jae Hee, Wermter Stefan
- Synchromesh: Reliable Code Generation From Pre-trained Language Models Poesia Gabriel, Polozov Oleksandr, Le Vu, Tiwari Ashish, Soares Gustavo, Meek Christopher, Gulwani Sumit
- Outliers Dimensions That Disrupt Transformers Are Driven By Frequency Puccetti Giovanni, Rogers Anna, Drozd Aleksandr, Dell'orletta Felice
- A Self-paced Mixed Distillation Method For Non-autoregressive Generation Qi Weizhen, Gong Yeyun, Shen Yelong, Jiao Jian, Yan Yu, Li Houqiang, Zhang Ruofei, Chen Weizhu, Duan Nan
- The NLP Task Effectiveness Of Long-range Transformers Qin Guanghui, Feng Yukun, Van Durme Benjamin
- ELLE: Efficient Lifelong Pre-training For Emerging Data Qin Yujia, Zhang Jiajie, Lin Yankai, Liu Zhiyuan, Li Peng, Sun Maosong, Zhou Jie
- A Survey On In-context Learning Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Lei Li, Zhifang Sui
- Transforming Sequence Tagging Into A Seq2seq Task Raman Karthik, Naim Iftekhar, Chen Jiecao, Hashimoto Kazuma, Yalasangi Kiran, Srinivasan Krishna
- Parallel Context Windows For Large Language Models Ratner Nir, Levine Yoav, Belinkov Yonatan, Ram Ori, Magar Inbal, Abend Omri, Karpas Ehud, Shashua Amnon, Leyton-brown Kevin, Shoham Yoav
- Cometkiwi: Ist-unbabel 2022 Submission For The Quality Estimation Shared Task Rei Ricardo, Treviso Marcos, Guerreiro Nuno M., Zerva Chrysoula, Farinha Ana C., Maroti Christine, De Souza José G. C., Glushkova Taisiya, Alves Duarte M., Lavie Alon, Coheur Luisa, Martins André F. T.
- Super-prompting: Utilizing Model-independent Contextual Data To Reduce Data Annotation Required In Visual Commonsense Tasks Rezaei Navid, Reformat Marek Z.
- Additive Interventions Yield Robust Multi-domain Machine Translation Models Rippeth Elijah, Post Matt
- Learning To Diversify For Product Question Generation Roitman Haggai, Singer Uriel, Eshel Yotam, Nus Alexander, Kiperwasser Eliyahu
- Lamda: Language Models For Dialog Applications Romal Thoppilan, Daniel De Freitas, Jamie Hall, Noam Shazeer, Apoorv Kulshreshtha, Heng-tze Cheng, Alicia Jin, Taylor Bos, Leslie Baker, Yu Du, Yaguang Li, Hongrae Lee, Huaixiu Steven Zheng, Amin Ghafouri, Marcelo Menegali, Yanping Huang, Maxim Krikun, Dmitry Lepikhin, James Qin, Dehao Chen, Yuanzhong Xu, Zhifeng Chen, Adam Roberts, Maarten Bosma, Vincent Zhao, Yanqi Zhou, Chung-ching Chang, Igor Krivokon, Will Rusch, Marc Pickett, Pranesh Srinivasan, Laichee Man, Kathleen Meier-hellstern, Meredith Ringel Morris, Tulsee Doshi, Renelito Delos Santos, Toju Duke, Johnny Soraker, Ben Zevenbergen, Vinodkumar Prabhakaran, Mark Diaz, Ben Hutchinson, Kristen Olson, Alejandra Molina, Erin Hoffman-john, Josh Lee, Lora Aroyo, Ravi Rajakumar, Alena Butryna, Matthew Lamm, Viktoriya Kuzmina, Joe Fenton, Aaron Cohen, Rachel Bernstein, Ray Kurzweil, Blaise Aguera-arcas, Claire Cui, Marian Croak, Ed Chi, Quoc Le
- Billions Of Parameters Are Worth More Than In-domain Training Data: A Case Study In The Legal Case Entailment Task Rosa Guilherme Moraes, Bonifacio Luiz, Jeronymo Vitor, Abonizio Hugo, Lotufo Roberto, Nogueira Rodrigo
- LINGUIST: Language Model Instruction Tuning To Generate Annotated Utterances For Intent Classification And Slot Tagging Rosenbaum Andy, Soltan Saleh, Hamza Wael, Versley Yannick, Boese Markus
- Few-shot Visual Question Generation: A Novel Task And Benchmark Datasets Roy Anurag, Ekka David Johnson, Ghosh Saptarshi, Das Abir
- Benchclamp: A Benchmark For Evaluating Language Models On Syntactic And Semantic Parsing Roy Subhro, Thomson Sam, Chen Tongfei, Shin Richard, Pauls Adam, Eisner Jason, Van Durme Benjamin
- The Goldilocks Of Pragmatic Understanding: Fine-tuning Strategy Matters For Implicature Resolution By Llms Ruis Laura, Khan Akbir, Biderman Stella, Hooker Sara, Rocktäschel Tim, Grefenstette Edward
- Data Augmentation For Intent Classification With Off-the-shelf Large Language Models Sahu Gaurav, Rodriguez Pau, Laradji Issam H., Atighehchian Parmida, Vazquez David, Bahdanau Dzmitry
- Alexatm 20B: Few-shot Learning Using A Large-scale Multilingual Seq2seq Model Saleh Soltan, Shankar Ananthakrishnan, Jack Fitzgerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, Stephen Rawls, Andy Rosenbaum, Anna Rumshisky, Chandana Satya Prakash, Mukund Sridhar, Fabian Triefenbach, Apurv Verma, Gokhan Tur, Prem Natarajan
- Self-repetition In Abstractive Neural Summarizers Salkar Nikita, Trikalinos Thomas, Wallace Byron C., Nenkova Ani
- NLX-GPT: A Model For Natural Language Explanations In Vision And Vision-language Tasks Sammani Fawaz, Mukherjee Tanmoy, Deligiannis Nikos
- Knowledge Prompts: Injecting World Knowledge Into Language Models Through Soft Prompts Santos Cicero Nogueira Dos, Dong Zhe, Cer Daniel, Nham John, Shakeri Siamak, Ni Jianmo, Sung Yun-hsuan
- APOLLO: A Simple Approach For Adaptive Pretraining Of Language Models For Logical Reasoning Sanyal Soumya, Xu Yichong, Wang Shuohang, Yang Ziyi, Pryzant Reid, Yu Wenhao, Zhu Chenguang, Ren Xiang
- IT5: Text-to-text Pretraining For Italian Language Understanding And Generation Sarti Gabriele, Nissim Malvina
- What Language Model To Train If You Have One Million GPU Hours? Scao Teven Le, Wang Thomas, Hesslow Daniel, Saulnier Lucile, Bekman Stas, Bari M Saiful, Biderman Stella, Elsahar Hady, Muennighoff Niklas, Phang Jason, Press Ofir, Raffel Colin, Sanh Victor, Shen Sheng, Sutawika Lintang, Tae Jaesung, Yong Zheng Xin, Launay Julien, Beltagy Iz
- Training Language Models With Language Feedback Scheurer Jérémy, Campos Jon Ander, Chan Jun Shern, Chen Angelica, Cho Kyunghyun, Perez Ethan
- Large Language Models And The Reverse Turing Test Sejnowski Terrence
- On The Effect Of Pretraining Corpora On In-context Learning By A Large-scale Language Model Seongjin Shin, Sang-woo Lee, Hwijeen Ahn, Sungdong Kim, Hyoungseok Kim, Boseop Kim, Kyunghyun Cho, Gichang Lee, Woomyoung Park, Jung-woo Ha, Nako Sung
- Recursive Decoding: A Situated Cognition Approach To Compositional Generation In Grounded Language Understanding Setzler Matthew, Howland Scott, Phillips Lauren
- Using Deepspeed And Megatron To Train Megatron-turing NLG 530B, A Large-scale Generative Language Model Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick Legresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zhang, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, Bryan Catanzaro
- Overcoming Catastrophic Forgetting Beyond Continual Learning: Balanced Training For Neural Machine Translation Shao Chenze, Feng Yang
- Staged Training For Transformer Language Models Shen Sheng, Walsh Pete, Keutzer Kurt, Dodge Jesse, Peters Matthew, Beltagy Iz
- Incorporating Domain Knowledge Through Task Augmentation For Front-end Javascript Code Generation Shen Sijie, Zhu Xiang, Dong Yihong, Guo Qizhi, Zhen Yankun, Li Ge
- Unintended Bias In Language Model-driven Conversational Recommendation Shen Tianshu, Li Jiaru, Bouadjenek Mohamed Reda, Mai Zheda, Sanner Scott
- Ebms Vs. CL: Exploring Self-supervised Visual Pretraining For Visual Question Answering Shevchenko Violetta, Abbasnejad Ehsan, Dick Anthony, Hengel Anton Van Den, Teney Damien
- Mgpt: Few-shot Learners Go Multilingual Shliazhko Oleh, Fenogenova Alena, Tikhonova Maria, Mikhailov Vladislav, Kozlova Anastasia, Shavrina Tatiana
- Gpt-neox-20b: An Open-source Autoregressive Language Model Sid Black, Stella Biderman, Eric Hallahan, Quentin Anthony, Leo Gao, Laurence Golding, Horace He, Connor Leahy, Kyle Mcdonell, Jason Phang, Michael Pieler, Usvsn Sai Prashanth, Shivanshu Purohit, Laria Reynolds, Jonathan Tow, Ben Wang, Samuel Weinbach
- Towards The Next 1000 Languages In Multilingual Machine Translation: Exploring The Synergy Between Supervised And Self-supervised Learning Siddhant Aditya, Bapna Ankur, Firat Orhan, Cao Yuan, Chen Mia Xu, Caswell Isaac, Garcia Xavier
- Ask Me Anything: A Simple Strategy For Prompting Language Models Simran Arora, Avanika Narayan, Mayee F. Chen, Laurel Orr, Neel Guha, Kush Bhatia, Ines Chami, Frederic Sala, Christopher Ré
- Augmenting Interpretable Models With Llms During Training Singh Chandan, Askari Armin, Caruana Rich, Gao Jianfeng
- Improving The Domain Adaptation Of Retrieval Augmented Generation (RAG) Models For Open Domain Question Answering Siriwardhana Shamane, Weerasekera Rivindu, Wen Elliott, Kaluarachchi Tharindu, Rana Rajib, Nanayakkara Suranga
- Llm-planner: Few-shot Grounded Planning For Embodied Agents With Large Language Models Song Chan Hee, Wu Jiaman, Washington Clayton, Sadler Brian M., Chao Wei-lun, Su Yu
- CLIP Models Are Few-shot Learners: Empirical Studies On VQA And Visual Entailment Song Haoyu, Dong Li, Zhang Wei-nan, Liu Ting, Wei Furu
- Human Language Modeling Soni Nikita, Matero Matthew, Balasubramanian Niranjan, Schwartz H. Andrew
- OPT-IML: Scaling Language Model Instruction Meta Learning Through The Lens Of Generalization Srinivasan Iyer, Xi Victoria Lin, Ramakanth Pasunuru, Todor Mihaylov, Daniel Simig, Ping Yu, Kurt Shuster, Tianlu Wang, Qing Liu, Punit Singh Koura, Xian Li, Brian O'horo, Gabriel Pereyra, Jeff Wang, Christopher Dewan, Asli Celikyilmaz, Luke Zettlemoyer, Ves Stoyanov
- Towards Zero-shot And Few-shot Table Question Answering Using GPT-3 Srivastava Pragya, Ganu Tanuja, Guha Saikat
- Conciseness: An Overlooked Language Task Stahlberg Felix, Kumar Aashish, Alberti Chris, Kumar Shankar
- Towards Diverse, Relevant And Coherent Open-domain Dialogue Generation Via Hybrid Latent Variables Sun Bin, Li Yitong, Mi Fei, Wang Weichao, Li Yiwei, Li Kan
- Tokenization Consistency Matters For Generative Models On Extractive NLP Tasks Sun Kaiser, Qi Peng, Zhang Yuhao, Liu Lan, Wang William Yang, Huang Zhiheng
- Dynamar: Dynamic Prompt With Mask Token Representation Sun Xiaodi, Rajagopalan Sunny, Nigam Priyanka, Lu Weiyi, Xu Yi, Zeng Belinda, Chilimbi Trishul
- Long-form Video-language Pre-training With Multimodal Temporal Contrastive Learning Sun Yuchong, Xue Hongwei, Song Ruihua, Liu Bei, Yang Huan, Fu Jianlong
- A Length-extrapolatable Transformer Sun Yutao, Dong Li, Patra Barun, Ma Shuming, Huang Shaohan, Benhaim Alon, Chaudhary Vishrav, Song Xia, Wei Furu
- Pseudo-ood Training For Robust Language Models Sundararaman Dhanasekar, Mehta Nikhil, Carin Lawrence
- On The Effect Of Pre-training For Transformer In Different Modality On Offline Reinforcement Learning Takagi Shiro
- News Summarization And Evaluation In The Era Of GPT-3 Tanya Goyal, Junyi Jessy Li, Greg Durrett
- Towards Fine-tuning Pre-trained Language Models With Integer Forward And Backward Propagation Tayaranian Mohammadreza, Ghaffari Alireza, Tahaei Marzieh S., Rezagholizadeh Mehdi, Asgharian Masoud, Nia Vahid Partovi
- Benchmarking Large Language Models For Automated Verilog RTL Code Generation Thakur Shailja, Ahmad Baleegh, Fan Zhenxing, Pearce Hammond, Tan Benjamin, Karri Ramesh, Dolan-gavitt Brendan, Garg Siddharth
- Interleaving Retrieval With Chain-of-thought Reasoning For Knowledge-intensive Multi-step Questions Trivedi Harsh, Balasubramanian Niranjan, Khot Tushar, Sabharwal Ashish
- Adavae: Exploring Adaptive Gpt-2s In Variational Auto-encoders For Language Modeling Tu Haoqin, Yang Zhongliang, Yang Jinshuai, Huang Yongfeng
- Help Me Write A Poem: Instruction Tuning As A Vehicle For Collaborative Poetry Writing Tuhin Chakrabarty, Vishakh Padmakumar, He He
- Efficient Few-shot Learning Without Prompts Tunstall Lewis, Reimers Nils, Jo Unso Eun Seo, Bates Luke, Korat Daniel, Wasserblat Moshe, Pereg Oren
- Effect And Analysis Of Large-scale Language Model Rescoring On Competitive ASR Systems Udagawa Takuma, Suzuki Masayuki, Kurata Gakuto, Itoh Nobuyasu, Saon George
- Dylora: Parameter Efficient Tuning Of Pre-trained Models Using Dynamic Search-free Low-rank Adaptation Valipour Mojtaba, Rezagholizadeh Mehdi, Kobyzev Ivan, Ghodsi Ali
- Will It Blend? Mixing Training Paradigms & Prompting For Argument Quality Prediction Van Der Meer Michiel, Reuver Myrthe, Khurana Urja, Krause Lea, Santamaría Selene Báez
- Will We Run Out Of Data? Limits Of LLM Scaling Based On Human-generated Data Villalobos Pablo, Ho Anson, Sevilla Jaime, Besiroglu Tamay, Heim Lennart, Hobbhahn Marius
- What Do They Capture? -- A Structural Analysis Of Pre-trained Language Models For Source Code Wan Yao, Zhao Wei, Zhang Hongyu, Sui Yulei, Xu Guandong, Jin Hai
- Unite: Unified Translation Evaluation Wan Yu, Liu Dayiheng, Yang Baosong, Zhang Haibo, Chen Boxing, Wong Derek F., Chao Lidia S.
- Can Language Models Make Fun? A Case Study In Chinese Comical Crosstalk Wang Benyou, Wu Xiangbo, Liu Xiaokang, Li Jianquan, Tiwari Prayag, Xie Qianqian
- Exploring The Limits Of Domain-adaptive Training For Detoxifying Large-scale Language Models Wang Boxin, Ping Wei, Xiao Chaowei, Xu Peng, Patwary Mostofa, Shoeybi Mohammad, Li Bo, Anandkumar Anima, Catanzaro Bryan
- Enabling Conversational Interaction With Mobile UI Using Large Language Models Wang Bryan, Li Gang, Li Yang
- Knowledgeable Salient Span Mask For Enhancing Language Models As Knowledge Base Wang Cunxiang, Luo Fuli, Li Yanyang, Xu Runxin, Huang Fei, Zhang Yue
- Learning To Decompose Visual Features With Latent Textual Prompts Wang Feng, Li Manling, Lin Xudong, Lv Hairong, Schwing Alexander G., Ji Heng
- Knowledge Prompting In Pre-trained Language Model For Natural Language Understanding Wang Jianing, Huang Wenkang, Shi Qiuhui, Wang Hongbin, Qiu Minghui, Li Xiang, Gao Ming
- Lilt: A Simple Yet Effective Language-independent Layout Transformer For Structured Document Understanding Wang Jiapeng, Jin Lianwen, Ding Kai
- TAG: Boosting Text-vqa Via Text-aware Visual Question-answer Generation Wang Jun, Gao Mingfei, Hu Yuqian, Selvaraju Ramprasaath R., Ramaiah Chetan, Xu Ran, Jaja Joseph F., Davis Larry S.
- Pretraining Without Attention Wang Junxiong, Yan Jing Nathan, Gu Albert, Rush Alexander M.
- Zero-shot Image Captioning By Anchor-augmented Vision-language Space Alignment Wang Junyang, Zhang Yi, Yan Ming, Zhang Ji, Sang Jitao
- What Language Model Architecture And Pretraining Objective Work Best For Zero-shot Generalization? Wang Thomas, Roberts Adam, Hesslow Daniel, Scao Teven Le, Chung Hyung Won, Beltagy Iz, Launay Julien, Raffel Colin
- Visually-augmented Language Modeling Wang Weizhi, Dong Li, Cheng Hao, Song Haoyu, Liu Xiaodong, Yan Xifeng, Gao Jianfeng, Wei Furu
- Image As A Foreign Language: Beit Pretraining For All Vision And Vision-language Tasks Wang Wenhui, Bao Hangbo, Dong Li, Bjorck Johan, Peng Zhiliang, Liu Qiang, Aggarwal Kriti, Mohammed Owais Khan, Singhal Saksham, Som Subhojit, Wei Furu
- Understanding And Improving Sequence-to-sequence Pretraining For Neural Machine Translation Wang Wenxuan, Jiao Wenxiang, Hao Yongchang, Wang Xing, Shi Shuming, Tu Zhaopeng, Lyu Michael
- Finding Skill Neurons In Pre-trained Transformer-based Language Models Wang Xiaozhi, Wen Kaiyue, Zhang Zhengyan, Hou Lei, Liu Zhiyuan, Li Juanzi
- Compilable Neural Code Generation With Compiler Feedback Wang Xin, Wang Yasheng, Wan Yao, Mi Fei, Li Yitong, Zhou Pingyi, Liu Jin, Wu Hao, Jiang Xin, Liu Qun
- Code4struct: Code Generation For Few-shot Event Structure Prediction Wang Xingyao, Li Sha, Ji Heng
- Pay Attention To Your Tone: Introducing A New Dataset For Polite Language Rewrite Wang Xun, Ge Tao, Mao Allen, Li Yuki, Wei Furu, Chen Si-qing
- S-prompts Learning With Pre-trained Transformers: An Occam's Razor For Domain Incremental Learning Wang Yabin, Huang Zhiwu, Hong Xiaopeng
- Adamix: Mixture-of-adaptations For Parameter-efficient Model Tuning Wang Yaqing, Agarwal Sahaj, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Controlling Styles In Neural Machine Translation With Activation Prompt Wang Yifan, Sun Zewei, Cheng Shanbo, Zheng Weiguo, Wang Mingxuan
- Two-stage LLM Fine-tuning With Less Specialization And More Generalization Wang Yihan, Si Si, Li Daliang, Lukasik Michal, Yu Felix, Hsieh Cho-jui, Dhillon Inderjit S, Kumar Sanjiv
- Pan More Gold From The Sand: Refining Open-domain Dialogue Training With Noisy Self-retrieval Generation Wang Yihe, Li Yitong, Wang Yasheng, Mi Fei, Zhou Pingyi, Wang Xin, Liu Jin, Jiang Xin, Liu Qun
- Multimodal Adaptive Distillation For Leveraging Unimodal Encoders For Vision-language Tasks Wang Zhecan, Codella Noel, Chen Yen-chun, Zhou Luowei, Dai Xiyang, Xiao Bin, Yang Jianwei, You Haoxuan, Chang Kai-wei, Chang Shih-fu, Yuan Lu
- Language Models With Image Descriptors Are Strong Few-shot Video-language Learners Wang Zhenhailong, Li Manling, Xu Ruochen, Zhou Luowei, Lei Jie, Lin Xudong, Wang Shuohang, Yang Ziyi, Zhu Chenguang, Hoiem Derek, Chang Shih-fu, Bansal Mohit, Ji Heng
- Unified Multimodal Model With Unlikelihood Training For Visual Dialog Wang Zihao, Wang Junli, Jiang Changjun
- Jiuzhang: A Chinese Pre-trained Language Model For Mathematical Problem Understanding Wayne Xin Zhao, Kun Zhou, Zheng Gong, Beichen Zhang, Yuanhang Zhou, Jing Sha, Zhigang Chen, Shijin Wang, Cong Liu, Ji-rong Wen
- Emergent Analogical Reasoning In Large Language Models Webb Taylor, Holyoak Keith J., Lu Hongjing
- CREATER: Ctr-driven Advertising Text Generation With Controlled Pre-training And Contrastive Fine-tuning Wei Penghui, Yang Xuanhua, Liu Shaoguo, Wang Liang, Zheng Bo
- Language Models As Zero-shot Planners: Extracting Actionable Knowledge For Embodied Agents Wenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch
- Probing Of Quantitative Values In Abstractive Summarization Models White Nathan M.
- Generative Or Contrastive? Phrase Reconstruction For Better Sentence Representation Learning Wu Bohong, Zhao Hai
- Sentence Representation Learning With Generative Objective Rather Than Contrastive Objective Wu Bohong, Zhao Hai
- Adversarial Self-attention For Language Understanding Wu Hongqiu, Ding Ruixue, Zhao Hai, Xie Pengjun, Huang Fei, Zhang Min
- FRSUM: Towards Faithful Abstractive Summarization Via Enhancing Factual Robustness Wu Wenhao, Li Wei, Liu Jiachen, Xiao Xinyan, Cao Ziqiang, Li Sujian, Wu Hua
- Precisely The Point: Adversarial Augmentations For Faithful And Informative Text Generation Wu Wenhao, Li Wei, Liu Jiachen, Xiao Xinyan, Li Sujian, Lyu Yajuan
- Extreme Compression For Pre-trained Transformers Made Simple And Efficient Wu Xiaoxia, Yao Zhewei, Zhang Minjia, Li Conglong, He Yuxiong
- An Efficient Memory-augmented Transformer For Knowledge-intensive NLP Tasks Wu Yuxiang, Zhao Yu, Hu Baotian, Minervini Pasquale, Stenetorp Pontus, Riedel Sebastian
- Smoothquant: Accurate And Efficient Post-training Quantization For Large Language Models Xiao Guangxuan, Lin Ji, Seznec Mickael, Wu Hao, Demouth Julien, Han Song
- MGA-VQA: Multi-granularity Alignment For Visual Question Answering Xiong Peixi, Shen Yilin, Jin Hongxia
- Adapting Pretrained Text-to-text Models For Long Text Sequences Xiong Wenhan, Gupta Anchit, Toshniwal Shubham, Mehdad Yashar, Yih Wen-tau
- Go-tuning: Improving Zero-shot Learning Abilities Of Smaller Language Models Xu Jingjing, Dong Qingxiu, Liu Hongyi, Li Lei
- Integrating Translation Memories Into Non-autoregressive Machine Translation Xu Jitao, Crego Josep, Yvon François
- Evaluating Parameter Efficient Learning For Generation Xu Peng, Patwary Mostofa, Prabhumoye Shrimai, Adams Virginia, Prenger Ryan J., Ping Wei, Lee Nayeon, Shoeybi Mohammad, Catanzaro Bryan
- SESCORE2: Learning Text Generation Evaluation Via Synthesizing Realistic Mistakes Xu Wenda, Qian Xian, Wang Mingxuan, Li Lei, Wang William Yang
- Bridgetower: Building Bridges Between Encoders In Vision-language Representation Learning Xu Xiao, Wu Chenfei, Rosenman Shachar, Lal Vasudev, Che Wanxiang, Duan Nan
- CLIP Also Understands Text: Prompting CLIP For Phrase Understanding Yan An, Li Jiacheng, Zhu Wanrong, Lu Yujie, Wang William Yang, Mcauley Julian
- Generative Negative Text Replay For Continual Vision-language Pretraining Yan Shipeng, Hong Lanqing, Xu Hang, Han Jianhua, Tuytelaars Tinne, Li Zhenguo, He Xuming
- Learning To Answer Visual Questions From Web Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Zero-shot Video Question Answering Via Frozen Bidirectional Language Models Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- GLUE-X: Evaluating Natural Language Understanding Models From An Out-of-distribution Generalization Perspective Yang Linyi, Zhang Shuibai, Qin Libo, Li Yafu, Wang Yidong, Liu Hanmeng, Wang Jindong, Xie Xing, Zhang Yue
- AD-DROP: Attribution-driven Dropout For Robust Language Model Fine-tuning Yang Tao, Deng Jinghao, Quan Xiaojun, Wang Qifan, Nie Shaoliang
- Parameter-efficient Tuning Makes A Good Classification Head Yang Zhuoyi, Ding Ming, Guo Yanhui, Lv Qingsong, Tang Jie
- Prompt Tuning For Discriminative Pre-trained Language Models Yao Yuan, Dong Bowen, Zhang Ao, Zhang Zhengyan, Xie Ruobing, Liu Zhiyuan, Lin Leyu, Sun Maosong, Wang Jianyong
- Zeroquant: Efficient And Affordable Post-training Quantization For Large-scale Transformers Yao Zhewei, Aminabadi Reza Yazdani, Zhang Minjia, Wu Xiaoxia, Li Conglong, He Yuxiong
- Random-ltd: Random And Layerwise Token Dropping Brings Efficient Training For Large-scale Transformers Yao Zhewei, Wu Xiaoxia, Li Conglong, Holmes Connor, Zhang Minjia, Li Cheng, He Yuxiong
- Retrieval-augmented Multimodal Language Modeling Yasunaga Michihiro, Aghajanyan Armen, Shi Weijia, James Rich, Leskovec Jure, Liang Percy, Lewis Mike, Zettlemoyer Luke, Yih Wen-tau
- Guess The Instruction! Flipped Learning Makes Language Models Stronger Zero-shot Learners Ye Seonghyeon, Kim Doyoung, Jang Joel, Shin Joongbo, Seo Minjoon
- Transcending Scaling Laws With 0.1% Extra Compute Yi Tay, Jason Wei, Hyung Won Chung, Vinh Q. Tran, David R. So, Siamak Shakeri, Xavier Garcia, Huaixiu Steven Zheng, Jinfeng Rao, Aakanksha Chowdhery, Denny Zhou, Donald Metzler, Slav Petrov, Neil Houlsby, Quoc V. Le, Mostafa Dehghani
- UL2: Unifying Language Learning Paradigms Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Xavier Garcia, Jason Wei, Xuezhi Wang, Hyung Won Chung, Siamak Shakeri, Dara Bahri, Tal Schuster, Huaixiu Steven Zheng, Denny Zhou, Neil Houlsby, Donald Metzler
- Active Example Selection For In-context Learning Yiming Zhang, Shi Feng, Chenhao Tan
- Categorizing Semantic Representations For Neural Machine Translation Yin Yongjing, Li Yafu, Meng Fandong, Zhou Jie, Zhang Yue
- Super-naturalinstructions: Generalization Via Declarative Instructions On 1600+ NLP Tasks Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei, Anjana Arunkumar, Arjun Ashok, Arut Selvan Dhanasekaran, Atharva Naik, David Stap, Eshaan Pathak, Giannis Karamanolakis, Haizhi Gary Lai, Ishan Purohit, Ishani Mondal, Jacob Anderson, Kirby Kuznia, Krima Doshi, Maitreya Patel, Kuntal Kumar Pal, Mehrad Moradshahi, Mihir Parmar, Mirali Purohit, Neeraj Varshney, Phani Rohitha Kaza, Pulkit Verma, Ravsehaj Singh Puri, Rushang Karia, Shailaja Keyur Sampat, Savan Doshi, Siddhartha Mishra, Sujan Reddy, Sumanta Patro, Tanay Dixit, Xudong Shen, Chitta Baral, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi, Daniel Khashabi
- BLOOM+1: Adding Language Support To BLOOM For Zero-shot Prompting Yong Zheng-xin, Schoelkopf Hailey, Muennighoff Niklas, Aji Alham Fikri, Adelani David Ifeoluwa, Almubarak Khalid, Bari M Saiful, Sutawika Lintang, Kasai Jungo, Baruwa Ahmed, Winata Genta Indra, Biderman Stella, Raff Edward, Radev Dragomir, Nikoulina Vassilina
- Efficient Language Modeling With Sparse All-mlp Yu Ping, Artetxe Mikel, Ott Myle, Shleifer Sam, Gong Hongyu, Stoyanov Ves, Li Xian
- KRLS: Improving End-to-end Response Generation In Task Oriented Dialog With Reinforced Keywords Learning Yu Xiao, Wu Qingyang, Qian Kun, Yu Zhou
- Hype: Better Pre-trained Language Model Fine-tuning With Hidden Representation Perturbation Yuan Hongyi, Yuan Zheng, Tan Chuanqi, Huang Fei, Huang Songfang
- Wudaomm: A Large-scale Multi-modal Dataset For Pre-training Models Yuan Sha, Zhao Shuai, Leng Jiahong, Xue Zhao, Zhao Hanyu, Liu Peiyu, Gong Zheng, Zhao Wayne Xin, Li Junyi, Tang Jie
- Can Pretrained Language Models (yet) Reason Deductively? Yuan Zhangdie, Hu Songbo, Vulić Ivan, Korhonen Anna, Meng Zaiqiao
- Competition-level Code Generation With Alphacode Yujia Li, David Choi, Junyoung Chung, Nate Kushman, Julian Schrittwieser, Rémi Leblond, Tom Eccles, James Keeling, Felix Gimeno, Agustin Dal Lago, Thomas Hubert, Peter Choy, Cyprien De Masson D'autume, Igor Babuschkin, Xinyun Chen, Po-sen Huang, Johannes Welbl, Sven Gowal, Alexey Cherepanov, James Molloy, Daniel J. Mankowitz, Esme Sutherland Robson, Pushmeet Kohli, Nando De Freitas, Koray Kavukcuoglu, Oriol Vinyals
- Bridging Cross-lingual Gaps During Leveraging The Multilingual Sequence-to-sequence Pretraining For Text Generation And Understanding Zan Changtong, Ding Liang, Shen Li, Cao Yu, Liu Weifeng, Tao Dacheng
- Better Pre-training By Reducing Representation Confusion Zhang Haojie, Liang Mingfei, Xie Ruobing, Sun Zhenlong, Zhang Bo, Lin Leyu
- Improved Data Augmentation For Translation Suggestion Zhang Hongxiao, Lai Siyu, Zhang Songming, Huang Hui, Chen Yufeng, Xu Jinan, Liu Jian
- Scala: Accelerating Adaptation Of Pre-trained Transformer-based Language Models Via Efficient Large-batch Adversarial Noise Zhang Minjia, Naresh Niranjan Uma, He Yuxiong
- SMDT: Selective Memory-augmented Neural Document Translation Zhang Xu, Yang Jian, Huang Haoyang, Ma Shuming, Zhang Dongdong, Li Jinlong, Wei Furu
- Unveiling Transformers With LEGO: A Synthetic Reasoning Task Zhang Yi, Backurs Arturs, Bubeck Sébastien, Eldan Ronen, Gunasekar Suriya, Wagner Tal
- BLISS: Robust Sequence-to-sequence Learning Via Self-supervised Input Representation Zhang Zheng, Ding Liang, Cheng Dazhao, Liu Xuebo, Zhang Min, Tao Dacheng
- Can Offline Reinforcement Learning Help Natural Language Understanding? Zhang Ziqi, Wang Yile, Zhang Yue, Wang Donglin
- M-adapter: Modality Adaptation For End-to-end Speech-to-text Translation Zhao Jinming, Yang Hao, Shareghi Ehsan, Haffari Gholamreza
- Attend, Memorize And Generate: Towards Faithful Table-to-text Generation In Few Shots Zhao Wenting, Liu Ye, Wan Yao, Yu Philip S.
- Learning Video Representations From Large Language Models Zhao Yue, Misra Ishan, Krähenbühl Philipp, Girdhar Rohit
- E2S2: Encoding-enhanced Sequence-to-sequence Pretraining For Language Understanding And Generation Zhong Qihuang, Ding Liang, Liu Juhua, Du Bo, Tao Dacheng
- Training Language Models With Memory Augmentation Zhong Zexuan, Lei Tao, Chen Danqi
- Learning To Decompose: Hypothetical Question Decomposition Based On Comparable Texts Zhou Ben, Richardson Kyle, Yu Xiaodong, Roth Dan
- Reflection Of Thought: Inversely Eliciting Numerical Reasoning In Language Models Via Solving Linear Systems Zhou Fan, Dong Haoyu, Liu Qian, Cheng Zhoujun, Han Shi, Zhang Dongmei
- Binding Language Models In Symbolic Languages Zhoujun Cheng, Tianbao Xie, Peng Shi, Chengzu Li, Rahul Nadkarni, Yushi Hu, Caiming Xiong, Dragomir Radev, Mari Ostendorf, Luke Zettlemoyer, Noah A. Smith, Tao Yu
- Syntax-informed Question Answering With Heterogeneous Graph Transformer Zhu Fangyi, Tan Lok You, Ng See-kiong, Bressan Stéphane
- KPT: Keyword-guided Pre-training For Grounded Dialog Generation Zhu Qi, Mi Fei, Zhang Zheng, Wang Yasheng, Li Yitong, Jiang Xin, Liu Qun, Zhu Xiaoyan, Huang Minlie
- Predicting Fine-tuning Performance With Probing Zhu Zining, Shahtalebi Soroosh, Rudzicz Frank
- Exploring And Evaluating Personalized Models For Code Generation Zlotchevski Andrei, Drain Dawn, Svyatkovskiy Alexey, Clement Colin, Sundaresan Neel, Tufano Michele
- A Survey On GPT-3 Zong Mingyu, Krishnamachari Bhaskar
- St-moe: Designing Stable And Transferable Sparse Expert Models Zoph Barret, Bello Irwan, Kumar Sameer, Du Nan, Huang Yanping, Dean Jeff, Shazeer Noam, Fedus William
- Efficient Long Sequence Modeling Via State Space Augmented Transformer Zuo Simiao, Liu Xiaodong, Jiao Jian, Charles Denis, Manavoglu Eren, Zhao Tuo, Gao Jianfeng
- Methods For Estimating And Improving Robustness Of Language Models Štefánik Michal
- LM-CPPF: Paraphrasing-guided Data Augmentation For Contrastive Prompt-based Few-shot Fine-tuning Abaskohi Amirhossein, Rothe Sascha, Yaghoobzadeh Yadollah
- Attention Over Pre-trained Sentence Embeddings For Long Document Classification Abdaoui Amine, Dutta Sourav
- Inpars Toolkit: A Unified And Reproducible Synthetic Data Generation Pipeline For Neural Information Retrieval Abonizio Hugo, Bonifacio Luiz, Jeronymo Vitor, Lotufo Roberto, Zavrel Jakub, Nogueira Rodrigo
- Leveraging Vision-language Models For Improving Domain Generalization In Image Classification Addepalli Sravanti, Asokan Ashish Ramayee, Sharma Lakshay, Babu R. Venkatesh
- Multi-party Goal Tracking With Llms: Comparing Pre-training, Fine-tuning, And Prompt Engineering Addlesee Angus, Sieińska Weronika, Gunson Nancie, Garcia Daniel Hernández, Dondrup Christian, Lemon Oliver
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Afra Feyza Akyürek, Ekin Akyürek, Aman Madaan, Ashwin Kalyan, Peter Clark, Derry Wijaya, Niket Tandon
- Bring Your Own KG: Self-supervised Program Synthesis For Zero-shot KGQA Agarwal Dhruv, Das Rajarshi, Khosla Sopan, Gangadharaiah Rashmi
- Can Large Language Models Be Good Path Planners? A Benchmark And Investigation On Spatial-temporal Reasoning Aghzal Mohamed, Plaku Erion, Yao Ziyu
- Simul-llm: A Framework For Exploring High-quality Simultaneous Translation With Large Language Models Agostinelli Victor, Wild Max, Raffel Matthew, Fuad Kazi Ahmed Asif, Chen Lizhong
- Story Visualization By Online Text Augmentation With Context Memory Ahn Daechul, Kim Daneul, Song Gwangmo, Kim Seung Hwan, Lee Honglak, Kang Dongyeop, Choi Jonghyun
- Jointly Training Large Autoregressive Multimodal Models Aiello Emanuele, Yu Lili, Nie Yixin, Aghajanyan Armen, Oguz Barlas
- Multimodal Representations For Teacher-guided Compositional Visual Reasoning Aissa Wafa Cedric - Vertigo, Ferecatu Marin Cedric - Vertigo, Crucianu Michel Cedric - Vertigo
- The Emergence Of Essential Sparsity In Large Pre-trained Models: The Weights That Matter Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Zhangyang Wang
- RL4F: Generating Natural Language Feedback With Reinforcement Learning For Repairing Model Outputs Akyürek Afra Feyza, Akyürek Ekin, Madaan Aman, Kalyan Ashwin, Clark Peter, Wijaya Derry, Tandon Niket
- Can Language Models Employ The Socratic Method? Experiments With Code Debugging Al-hossami Erfan, Bunescu Razvan, Smith Justin, Teehan Ryan
- Mamba: Linear-time Sequence Modeling With Selective State Spaces Albert Gu, Tri Dao
- Tokenizer Choice For LLM Training: Negligible Or Crucial? Ali Mehdi, Fromm Michael, Thellmann Klaudia, Rutmann Richard, Lübbering Max, Leveling Johannes, Klug Katrin, Ebert Jan, Doll Niclas, Buschhoff Jasper Schulze, Jain Charvi, Weber Alexander Arno, Jurkschat Lena, Abdelwahab Hammam, John Chelsea, Suarez Pedro Ortiz, Ostendorff Malte, Weinbach Samuel, Sifa Rafet, Kesselheim Stefan, Flores-herr Nicolas
- Physics Of Language Models: Part 3.2, Knowledge Manipulation Allen-zhu Zeyuan, Li Yuanzhi
- The Falcon Series Of Open Language Models Almazrouei Ebtesam, Alobeidli Hamza, Alshamsi Abdulaziz, Cappelli Alessandro, Cojocaru Ruxandra, Debbah Mérouane, Goffinet Étienne, Hesslow Daniel, Launay Julien, Malartic Quentin, Mazzotta Daniele, Noune Badreddine, Pannier Baptiste, Penedo Guilherme
- Pixt3: Pixel-based Table-to-text Generation Alonso Iñigo, Agirre Eneko, Lapata Mirella
- Is BERT Blind? Exploring The Effect Of Vision-and-language Pretraining On Visual Language Understanding Alper Morris, Fiman Michael, Averbuch-elor Hadar
- Becoming Self-instruct: Introducing Early Stopping Criteria For Minimal Instruct Tuning Alshikh Waseem, Daaboul Manhal, Goddard Kirk, Imel Brock, Kamble Kiran, Kulkarni Parikshith, Russak Melisa
- Steering Large Language Models For Machine Translation With Finetuning And In-context Learning Alves Duarte M., Guerreiro Nuno M., Alves João, Pombal José, Rei Ricardo, De Souza José G. C., Colombo Pierre, Martins André F. T.
- Taqyim: Evaluating Arabic NLP Tasks Using Chatgpt Models Alyafeai Zaid, Alshaibani Maged S., Alkhamissi Badr, Luqman Hamzah, Alareqi Ebrahim, Fadel Ali
- Knowledge Of Knowledge: Exploring Known-unknowns Uncertainty With Large Language Models Amayuelas Alfonso, Wong Kyle, Pan Liangming, Chen Wenhu, Wang William
- Prompt-engineering And Transformer-based Question Generation And Evaluation Amyeen Rubaba
- Does Deep Learning Learn To Abstract? A Systematic Probing Framework An Shengnan, Lin Zeqi, Chen Bei, Fu Qiang, Zheng Nanning, Lou Jian-guang
- Learning From Mistakes Makes LLM Better Reasoner An Shengnan, Ma Zexiong, Lin Zeqi, Zheng Nanning, Lou Jian-guang, Chen Weizhu
- Studying The Impacts Of Pre-training Using Chatgpt-generated Text On Downstream Tasks Anand Sarthak
- Expel: LLM Agents Are Experiential Learners Andrew Zhao, Daniel Huang, Quentin Xu, Matthieu Lin, Yong-jin Liu, Gao Huang
- Randomized Positional Encodings Boost Length Generalization Of Transformers Anian Ruoss, Grégoire Delétang, Tim Genewein, Jordi Grau-moya, Róbert Csordás, Mehdi Bennani, Shane Legg, Joel Veness
- Distilling Efficient Language-specific Models For Cross-lingual Transfer Ansell Alan, Ponti Edoardo Maria, Korhonen Anna, Vulić Ivan
- Scaling Laws For Language Encoding Models In Fmri Antonello Richard, Vaidya Aditya, Huth Alexander G.
- Mixture Of Tokens: Efficient Llms Through Cross-example Aggregation Antoniak Szymon, Jaszczur Sebastian, Krutul Michał, Pióro Maciej, Krajewski Jakub, Ludziejewski Jan, Odrzygóźdź Tomasz, Cygan Marek
- Resurrecting Recurrent Neural Networks For Long Sequences Antonio Orvieto, Samuel L Smith, Albert Gu, Anushan Fernando, Caglar Gulcehre, Razvan Pascanu, Soham De
- A Memory Model For Question Answering From Streaming Data Supported By Rehearsal And Anticipation Of Coreference Information Araujo Vladimir, Soto Alvaro, Moens Marie-francine
- Slimfit: Memory-efficient Fine-tuning Of Transformer-based Models Using Training Dynamics Ardakani Arash, Haan Altan, Tan Shangyin, Popovici Doru Thom, Cheung Alvin, Iancu Costin, Sen Koushik
- Llm2loss: Leveraging Language Models For Explainable Model Diagnostics Ardeshir Shervin
- BUFFET: Benchmarking Large Language Models For Few-shot Cross-lingual Transfer Asai Akari, Kudugunta Sneha, Yu Xinyan Velocity, Blevins Terra, Gonen Hila, Reid Machel, Tsvetkov Yulia, Ruder Sebastian, Hajishirzi Hannaneh
- Using Large Language Models For Qualitative Analysis Can Introduce Serious Bias Ashwin Julian, Chhabra Aditya, Rao Vijayendra
- Unihd At TSAR-2022 Shared Task: Is Compute All We Need For Lexical Simplification? Aumiller Dennis, Gertz Michael
- Openflamingo: An Open-source Framework For Training Large Autoregressive Vision-language Models Awadalla Anas, Gao Irena, Gardner Josh, Hessel Jack, Hanafy Yusuf, Zhu Wanrong, Marathe Kalyani, Bitton Yonatan, Gadre Samir, Sagawa Shiori, Jitsev Jenia, Kornblith Simon, Koh Pang Wei, Ilharco Gabriel, Wortsman Mitchell, Schmidt Ludwig
- Using Large Language Models For Zero-shot Natural Language Generation From Knowledge Graphs Axelsson Agnes, Skantze Gabriel
- RRAML: Reinforced Retrieval Augmented Machine Learning Bacciu Andrea, Cuconasu Florin, Siciliano Federico, Silvestri Fabrizio, Tonellotto Nicola, Trappolini Giovanni
- Leftover Lunch: Advantage-based Offline Reinforcement Learning For Language Models Baheti Ashutosh, Lu Ximing, Brahman Faeze, Bras Ronan Le, Sap Maarten, Riedl Mark
- Constituency Parsing Using Llms Bai Xuefeng, Wu Jialong, Chen Yulong, Wang Zhongqing, Zhang Yue
- Longbench: A Bilingual, Multitask Benchmark For Long Context Understanding Bai Yushi, Lv Xin, Zhang Jiajie, Lyu Hongchang, Tang Jiankai, Huang Zhidian, Du Zhengxiao, Liu Xiao, Zeng Aohan, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- Tamil-llama: A New Tamil Language Model Based On Llama 2 Balachandran Abhinand
- No That's Not What I Meant: Handling Third Position Repair In Conversational Question Answering Balaraman Vevake, Eshghi Arash, Konstas Ioannis, Papaioannou Ioannis
- Fine-tuning Large Enterprise Language Models Via Ontological Reasoning Baldazzi Teodoro, Bellomarini Luigi, Ceri Stefano, Colombo Andrea, Gentili Andrea, Sallinger Emanuel
- Investigating Pre-trained Language Models On Cross-domain Datasets, A Step Closer To General AI Ballout Mohamad, Krumnack Ulf, Heidemann Gunther, Kühnberger Kai-uwe
- Vtrain: A Simulation Framework For Evaluating Cost-effective And Compute-optimal Large Language Model Training Bang Jehyeon, Choi Yujeong, Kim Myeongwoo, Kim Yongdeok, Rhu Minsoo
- A Bi-step Grounding Paradigm For Large Language Models In Recommendation Systems Bao Keqin, Zhang Jizhi, Wang Wenjie, Zhang Yang, Yang Zhengyi, Luo Yancheng, Chen Chong, Feng Fuli, Tian Qi
- Tallrec: An Effective And Efficient Tuning Framework To Align Large Language Model With Recommendation Bao Keqin, Zhang Jizhi, Zhang Yang, Wang Wenjie, Feng Fuli, He Xiangnan
- Assessing And Enhancing The Robustness Of Large Language Models With Task Structure Variations For Logical Reasoning Bao Qiming, Gendron Gael, Peng Alex Yuxuan, Zhong Wanjun, Tan Neset, Chen Yang, Witbrock Michael, Liu Jiamou
- Abstract Meaning Representation-based Logic-driven Data Augmentation For Logical Reasoning Bao Qiming, Peng Alex Yuxuan, Deng Zhenyun, Zhong Wanjun, Gendron Gael, Pistotti Timothy, Tan Neset, Young Nathan, Chen Yang, Zhu Yonghua, Denny Paul, Witbrock Michael, Liu Jiamou
- Disc-medllm: Bridging General Large Language Models And Real-world Medical Consultation Bao Zhijie, Chen Wei, Xiao Shengze, Ren Kuang, Wu Jiaao, Zhong Cheng, Peng Jiajie, Huang Xuanjing, Wei Zhongyu
- Do Language Models Learn About Legal Entity Types During Pretraining? Barale Claire, Rovatsos Michael, Bhuta Nehal
- Understanding Telecom Language Through Large Language Models Bariah Lina, Zou Hang, Zhao Qiyang, Mouhouche Belkacem, Bader Faouzi, Debbah Merouane
- Self-consistency Of Large Language Models Under Ambiguity Bartsch Henning, Jorgensen Ole, Rosati Domenic, Hoelscher-obermaier Jason, Pfau Jacob
- Efficient Pre-training For Localized Instruction Generation Of Videos Batra Anil, Moltisanti Davide, Sevilla-lara Laura, Rohrbach Marcus, Keller Frank
- Exploring The Landscape Of Large Language Models In Medical Question Answering Bean Andrew M., Korgul Karolina, Krones Felix, Mccraith Robert, Mahdi Adam
- Span-selective Linear Attention Transformers For Effective And Robust Schema-guided Dialogue State Tracking Bebensee Björn, Lee Haejun
- The Poison Of Alignment Bekbayev Aibek, Chun Sungbae, Dulat Yerzat, Yamazaki James
- Examining The Emergence Of Deductive Reasoning In Generative Language Models Belcak Peter, Lanzendörfer Luca A., Wattenhofer Roger
- Harnessing Gpt-3.5-turbo For Rhetorical Role Prediction In Legal Cases Belfathi Anas, Hernandez Nicolas, Monceaux Laura
- Labrador: Exploring The Limits Of Masked Language Modeling For Laboratory Data Bellamy David R., Kumar Bhawesh, Wang Cindy, Beam Andrew
- When Automated Assessment Meets Automated Content Generation: Examining Text Quality In The Era Of Gpts Bevilacqua Marialena, Oketch Kezia, Qin Ruiyang, Stamey Will, Zhang Xinyuan, Gan Yi, Yang Kai, Abbasi Ahmed
- A Survey On Prompting Techniques In Llms Bhandari Prabin
- Pre-training Llms Using Human-like Development Data Corpus Bhardwaj Khushi, Shah Raj Sanjay, Varma Sashank
- Language Model Unalignment: Parametric Red-teaming To Expose Hidden Harms And Biases Bhardwaj Rishabh, Poria Soujanya
- AUTODIAL: Efficient Asynchronous Task-oriented Dialogue Model Bhargava Prajjwal, Amini Pooyan, Shayandeh Shahin, Sankar Chinnadhurai
- TART: A Plug-and-play Transformer Module For Task-agnostic Reasoning Bhatia Kush, Narayan Avanika, De Sa Christopher, Ré Christopher
- Exploring Large Language Models For Code Explanation Bhattacharya Paheli, Chakraborty Manojit, Palepu Kartheek N S N, Pandey Vikas, Dindorkar Ishan, Rajpurohit Rakesh, Gupta Rishabh
- Helm: Highlighted Evidence Augmented Language Model For Enhanced Table-to-text Generation Bian Junyi, Qin Xiaolei, Zou Wuhe, Huang Mengzuo, Luo Congyi, Zhang Ke, Zhang Weidong
- Safety-tuned Llamas: Lessons From Improving The Safety Of Large Language Models That Follow Instructions Bianchi Federico, Suzgun Mirac, Attanasio Giuseppe, Röttger Paul, Jurafsky Dan, Hashimoto Tatsunori, Zou James
- Birth Of A Transformer: A Memory Viewpoint Bietti Alberto, Cabannes Vivien, Bouchacourt Diane, Jegou Herve, Bottou Leon
- BLT: Can Large Language Models Handle Basic Legal Text? Blair-stanek Andrew, Holzenberger Nils, Van Durme Benjamin
- RWKV: Reinventing Rnns For The Transformer Era Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Stella Biderman, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran Gv, Xuzheng He, Haowen Hou, Jiaju Lin, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Guangyu Song, Xiangru Tang, Bolun Wang, Johan S. Wind, Stanislaw Wozniak, Ruichong Zhang, Zhenyuan Zhang, Qihang Zhao, Peng Zhou, Qinghua Zhou, Jian Zhu, Rui-jie Zhu
- Pixlore: A Dataset-driven Approach To Rich Image Captioning Bonilla Diego
- Inpars-light: Cost-effective Unsupervised Training Of Efficient Rankers Boytsov Leonid, Patel Preksha, Sourabh Vivek, Nisar Riddhi, Kundu Sayani, Ramanathan Ramya, Nyberg Eric
- Evaluating Large Language Models For Document-grounded Response Generation In Information-seeking Dialogues Braunschweiler Norbert, Doddipatla Rama, Keizer Simon, Stoyanchev Svetlana
- Extractgpt: Exploring The Potential Of Large Language Models For Product Attribute Value Extraction Brinkmann Alexander, Shraga Roee, Bizer Christian
- Utilizing Chatgpt Generated Data To Retrieve Depression Symptoms From Social Media Bucur Ana-maria
- Local Large Language Models For Complex Structured Medical Tasks Bumgardner V. K. Cody, Mullen Aaron, Armstrong Sam, Hickey Caylin, Talbert Jeff
- A Comparative Study Of Code Generation Using Chatgpt 3.5 Across 10 Programming Languages Buscemi Alessio
- Just Tell Me: Prompt Engineering In Business Process Management Busch Kiran, Rochlitzer Alexander, Sola Diana, Leopold Henrik
- Automatic Construction Of A Korean Toxic Instruction Dataset For Ethical Tuning Of Large Language Models Byun Sungjoo, Jang Dongjun, Jo Hyemi, Shin Hyopil
- Peek Across: Improving Multi-document Modeling Via Cross-document Question-answering Caciularu Avi, Peters Matthew E., Goldberger Jacob, Dagan Ido, Cohan Arman
- Scaling In-context Demonstrations With Structured Attention Cai Tianle, Huang Kaixuan, Lee Jason D., Wang Mengdi
- A Systematic Study Of Knowledge Distillation For Natural Language Generation With Pseudo-target Training Calderon Nitay, Mukherjee Subhabrata, Reichart Roi, Kantor Amir
- Oberta: Improving Sparse Transfer Learning Via Improved Initialization, Distillation, And Pruning Regimes Campos Daniel, Marques Alexandre, Kurtz Mark, Zhai Chengxiang
- Wizardlm: Empowering Large Language Models To Follow Complex Instructions Can Xu, Qingfeng Sun, Kai Zheng, Xiubo Geng, Pu Zhao, Jiazhan Feng, Chongyang Tao, Daxin Jiang
- H2O Open Ecosystem For State-of-the-art Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Lee Chun Ming, Conde Marcos V.
- Defending Against Alignment-breaking Attacks Via Robustly Aligned LLM Cao Bochuan, Cao Yuanpu, Lin Lu, Chen Jinghui
- Instructmol: Multi-modal Integration For Building A Versatile And Reliable Molecular Assistant In Drug Discovery Cao He, Liu Zijing, Lu Xingyu, Yao Yuan, Li Yu
- Successor Features For Efficient Multisubject Controlled Text Generation Cao Meng, Fatemi Mehdi, Cheung Jackie Chi Kit, Shabanian Samira
- Systematic Rectification Of Language Models Via Dead-end Analysis Cao Meng, Fatemi Mehdi, Cheung Jackie Chi Kit, Shabanian Samira
- Pumer: Pruning And Merging Tokens For Efficient Vision Language Models Cao Qingqing, Paranjape Bhargavi, Hajishirzi Hannaneh
- Revolutionizing Mobile Interaction: Enabling A 3 Billion Parameter GPT LLM On Mobile Carreira Samuel, Marques Tomás, Ribeiro José, Grilo Carlos
- Promoting Generalized Cross-lingual Question Answering In Few-resource Scenarios Via Self-knowledge Distillation Carrino Casimiro Pio, Escolano Carlos, Fonollosa José A. R.
- Knowledge Transfer From High-resource To Low-resource Programming Languages For Code Llms Cassano Federico, Gouwar John, Lucchetti Francesca, Schlesinger Claire, Freeman Anders, Anderson Carolyn Jane, Feldman Molly Q, Greenberg Michael, Jangda Abhinav, Guha Arjun
- Can It Edit? Evaluating The Ability Of Large Language Models To Follow Code Editing Instructions Cassano Federico, Li Luisa, Sethi Akul, Shinn Noah, Brennan-jones Abby, Ginesin Jacob, Berman Edward, Chakhnashvili George, Lozhkov Anton, Anderson Carolyn Jane, Guha Arjun
- Do Androids Know They're Only Dreaming Of Electric Sheep? Ch-wang Sky, Van Durme Benjamin, Eisner Jason, Kedzie Chris
- INT2.1: Towards Fine-tunable Quantized Large Language Models With Error Correction Through Low-rank Adaptation Chai Yuji, Gkountouras John, Ko Glenn G., Brooks David, Wei Gu-yeon
- Which Spurious Correlations Impact Reasoning In NLI Models? A Visual Interactive Diagnosis Through Data-constrained Counterfactuals Chan Robin, Amini Afra, El-assady Mennatallah
- Learning To Generate Better Than Your LLM Chang Jonathan D., Brantley Kiante, Ramamurthy Rajkumar, Misra Dipendra, Sun Wen
- Prompting And Adapter Tuning For Self-supervised Encoder-decoder Speech Model Chang Kai-wei, Chen Ming-hsin, Lin Yun-ping, Hsu Jing Neng, Huang Paul Kuo-ming, Huang Chien-yu, Li Shang-wen, Lee Hung-yi
- Chipgpt: How Far Are We From Natural Language Hardware Design Chang Kaiyan, Wang Ying, Ren Haimeng, Wang Mengdi, Liang Shengwen, Han Yinhe, Li Huawei, Li Xiaowei
- Language Model Behavior: A Comprehensive Survey Chang Tyler A., Bergen Benjamin K.
- Characterizing Learning Curves During Language Model Pre-training: Learning, Forgetting, And Stability Chang Tyler A., Tu Zhuowen, Bergen Benjamin K.
- Not All Layers Are Equally As Important: Every Layer Counts BERT Charpentier Lucas Georges Gabriel, Samuel David
- She Had Cobalt Blue Eyes: Prompt Testing To Create Aligned And Sustainable Language Models Chatrath Veronica, Bamgbose Oluwanifemi, Raza Shaina
- It's All Relative! -- A Synthetic Query Generation Approach For Improving Zero-shot Relevance Prediction Chaudhary Aditi, Raman Karthik, Bendersky Michael
- Efficient GPT Model Pre-training Using Tensor Train Matrix Representation Chekalina Viktoriia, Novikov Georgii, Gusak Julia, Oseledets Ivan, Panchenko Alexander
- Sudden Drops In The Loss: Syntax Acquisition, Phase Transitions, And Simplicity Bias In Mlms Chen Angelica, Shwartz-ziv Ravid, Cho Kyunghyun, Leavitt Matthew L., Saphra Naomi
- Fireact: Toward Language Agent Fine-tuning Chen Baian, Shu Chang, Shareghi Ehsan, Collier Nigel, Narasimhan Karthik, Yao Shunyu
- Visual Instruction Tuning With Polite Flamingo Chen Delong, Liu Jianfeng, Dai Wenliang, Wang Baoyuan
- Generate Labeled Training Data Using Prompt Programming And GPT-3. An Example Of Big Five Personality Classification Chen Eason
- X-LLM: Bootstrapping Advanced Large Language Models By Treating Multi-modalities As Foreign Languages Chen Feilong, Han Minglun, Zhao Haozhi, Zhang Qingyang, Shi Jing, Xu Shuang, Xu Bo
- Tem-adapter: Adapting Image-text Pretraining For Video Question Answer Chen Guangyi, Liu Xiao, Wang Guangrun, Zhang Kun, Torr Philip H. S., Zhang Xiao-ping, Tang Yansong
- CLEX: Continuous Length Extrapolation For Large Language Models Chen Guanzheng, Li Xin, Meng Zaiqiao, Liang Shangsong, Bing Lidong
- Videollm: Modeling Video Sequence With Large Language Models Chen Guo, Zheng Yin-dong, Wang Jiahao, Xu Jilan, Huang Yifei, Pan Junting, Wang Yi, Wang Yali, Qiao Yu, Lu Tong, Wang Limin
- Chatgpt's One-year Anniversary: Are Open-source Large Language Models Catching Up? Chen Hailin, Jiao Fangkai, Li Xingxuan, Qin Chengwei, Ravaut Mathieu, Zhao Ruochen, Xiong Caiming, Joty Shafiq
- Maybe Only 0.5% Data Is Needed: A Preliminary Exploration Of Low Training Data Instruction Tuning Chen Hao, Zhang Yiming, Zhang Qi, Yang Hantao, Hu Xiaomeng, Ma Xuetao, Yanggong Yifan, Zhao Junbo
- Interpreting And Controlling Vision Foundation Models Via Text Explanations Chen Haozhe, Yang Junfeng, Vondrick Carl, Mao Chengzhi
- Skills-in-context Prompting: Unlocking Compositionality In Large Language Models Chen Jiaao, Pan Xiaoman, Yu Dian, Song Kaiqiang, Wang Xiaoyang, Yu Dong, Chen Jianshu
- Unlearn What You Want To Forget: Efficient Unlearning For Llms Chen Jiaao, Yang Diyi
- Schema-guided Semantic Accuracy: Faithfulness In Task-oriented Dialogue Response Generation Chen Jinghong, Lin Weizhe, Byrne Bill
- Quantifying Uncertainty In Answers From Any Language Model And Enhancing Their Trustworthiness Chen Jiuhai, Mueller Jonas
- Huatuogpt-ii, One-stage Training For Medical Adaption Of Llms Chen Junying, Wang Xidong, Gao Anningzhe, Jiang Feng, Chen Shunian, Zhang Hongbo, Song Dingjie, Xie Wenya, Kong Chuyi, Li Jianquan, Wan Xiang, Li Haizhou, Wang Benyou
- Epi-curriculum: Episodic Curriculum Learning For Low-resource Domain Adaptation In Neural Machine Translation Chen Keyu, Zhuang Di, Li Mingchen, Chang J. Morris
- Compcodevet: A Compiler-guided Validation And Enhancement Approach For Code Dataset Chen Le, Bhattacharjee Arijit, Ahmed Nesreen K., Hasabnis Niranjan, Oren Gal, Lei Bin, Jannesari Ali
- Alpagasus: Training A Better Alpaca With Fewer Data Chen Lichang, Li Shiyang, Yan Jun, Wang Hai, Gunaratna Kalpa, Yadav Vikas, Tang Zheng, Srinivasan Vijay, Zhou Tianyi, Huang Heng, Jin Hongxia
- Introspective Tips: Large Language Model For In-context Decision Making Chen Liting, Wang Lu, Dong Hang, Du Yali, Yan Jie, Yang Fangkai, Li Shuang, Zhao Pu, Qin Si, Rajmohan Saravan, Lin Qingwei, Zhang Dongmei
- Driving With Llms: Fusing Object-level Vector Modality For Explainable Autonomous Driving Chen Long, Sinavski Oleg, Hünermann Jan, Karnsund Alice, Willmott Andrew James, Birch Danny, Maund Daniel, Shotton Jamie
- Natural Response Generation For Chinese Reading Comprehension Chen Nuo, Li Hongguang, Bao Yinan, Wang Baoyuan, Li Jia
- Breaking Language Barriers In Multilingual Mathematical Reasoning: Insights And Observations Chen Nuo, Zheng Zinan, Wu Ning, Gong Ming, Song Yangqiu, Zhang Dongmei, Li Jia
- Monolingual Or Multilingual Instruction Tuning: Which Makes A Better Alpaca Chen Pinzhen, Ji Shaoxiong, Bogoychev Nikolay, Kutuzov Andrey, Haddow Barry, Heafield Kenneth
- Evaluation Of Chatgpt Family Of Models For Biomedical Reasoning And Classification Chen Shan, Li Yingya, Lu Sheng, Van Hoang, Aerts Hugo Jwl, Savova Guergana K., Bitterman Danielle S.
- Extending Context Window Of Large Language Models Via Positional Interpolation Chen Shouyuan, Wong Sherman, Chen Liangjian, Tian Yuandong
- MMICT: Boosting Multi-modal Fine-tuning With In-context Examples Chen Tao, Zhang Enwei, Gao Yuting, Li Ke, Sun Xing, Zhang Yan, Li Hui, Ji Rongrong
- Lorashear: Efficient Large Language Model Structured Pruning And Knowledge Recovery Chen Tianyi, Ding Tianyu, Yadav Badal, Zharkov Ilya, Liang Luming
- Self-icl: Zero-shot In-context Learning With Self-generated Demonstrations Chen Wei-lin, Wu Cheng-kuang, Chen Yun-nung, Chen Hsin-hsi
- Challenges And Contributing Factors In The Utilization Of Large Language Models (llms) Chen Xiaoliang, Li Liangbin, Chang Le, Huang Yunhe, Zhao Yuxuan, Zhang Yuxiao, Li Dinuo
- The Janus Interface: How Fine-tuning In Large Language Models Amplifies The Privacy Risks Chen Xiaoyi, Tang Siyuan, Zhu Rui, Yan Shijun, Jin Lei, Wang Zihao, Su Liya, Zhang Zhikun, Wang Xiaofeng, Tang Haixu
- Minprompt: Graph-based Minimal Prompt Data Augmentation For Few-shot Question Answering Chen Xiusi, Jiang Jyun-yu, Chang Wei-cheng, Hsieh Cho-jui, Yu Hsiang-fu, Wang Wei
- Translation And Fusion Improves Zero-shot Cross-lingual Information Extraction Chen Yang, Shah Vedaant, Ritter Alan
- Measuring And Improving Chain-of-thought Reasoning In Vision-language Models Chen Yangyi, Sikka Karan, Cogswell Michael, Ji Heng, Divakaran Ajay
- EE-LLM: Large-scale Training And Inference Of Early-exit Large Language Models With 3D Parallelism Chen Yanxi, Pan Xuchen, Li Yaliang, Ding Bolin, Zhou Jingren
- Tigerbot: An Open Multilingual Multitask LLM Chen Ye, Cai Wei, Wu Liangmin, Li Xiaowei, Xin Zhanxuan, Fu Cong
- NL2TL: Transforming Natural Languages To Temporal Logics Using Large Language Models Chen Yongchao, Gandhi Rujul, Zhang Yang, Fan Chuchu
- Dog-instruct: Towards Premium Instruction-tuning Data Via Text-grounded Instruction Wrapping Chen Yongrui, Jiang Haiyun, Huang Xinting, Shi Shuming, Qi Guilin
- Confidant: Customizing Transformer-based Llms Via Collaborative Edge Training Chen Yuhao, Yan Yuxuan, Yang Qianqian, Shu Yuanchao, He Shibo, Chen Jiming
- MEDITRON-70B: Scaling Medical Pretraining For Large Language Models Chen Zeming, Cano Alejandro Hernández, Romanou Angelika, Bonnet Antoine, Matoba Kyle, Salvi Francesco, Pagliardini Matteo, Fan Simin, Köpf Andreas, Mohtashami Amirkeivan, Sallinen Alexandre, Sakhaeirad Alireza, Swamy Vinitra, Krawczuk Igor, Bayazit Deniz, Marmet Axel, Montariol Syrielle, Hartley Mary-anne, Jaggi Martin, Bosselut Antoine
- SALM: Speech-augmented Language Model With In-context Learning For Speech Recognition And Translation Chen Zhehuai, Huang He, Andrusenko Andrei, Hrinchuk Oleksii, Puvvada Krishna C., Li Jason, Ghosh Subhankar, Balam Jagadeesh, Ginsburg Boris
- GENOME: Generative Neuro-symbolic Visual Reasoning By Growing And Reusing Modules Chen Zhenfang, Sun Rui, Liu Wenjun, Hong Yining, Gan Chuang
- See, Think, Confirm: Interactive Prompting Between Vision And Language Models For Knowledge-based Visual Reasoning Chen Zhenfang, Zhou Qinhong, Shen Yikang, Hong Yining, Zhang Hao, Gan Chuang
- UPRISE: Universal Prompt Retrieval For Improving Zero-shot Evaluation Cheng Daixuan, Huang Shaohan, Bi Junyu, Zhan Yuefeng, Liu Jianfeng, Wang Yujing, Sun Hao, Wei Furu, Deng Denvy, Zhang Qi
- Black-box Prompt Optimization: Aligning Large Language Models Without Model Training Cheng Jiale, Liu Xiao, Zheng Kehan, Ke Pei, Wang Hongning, Dong Yuxiao, Tang Jie, Huang Minlie
- Adversarial Preference Optimization: Enhancing Your Alignment Via RM-LLM Game Cheng Pengyu, Yang Yifan, Li Jian, Dai Yong, Hu Tianhao, Cao Peixin, Du Nan, Li Xiaolong
- SCALE: Synergized Collaboration Of Asymmetric Language Translation Engines Cheng Xin, Wang Xun, Ge Tao, Chen Si-qing, Wei Furu, Zhao Dongyan, Yan Rui
- Neural Codec Language Models Are Zero-shot Text To Speech Synthesizers Chengyi Wang, Sanyuan Chen, Yu Wu, Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei
- Latent Positional Information Is In The Self-attention Variance Of Transformer Language Models Without Positional Embeddings Chi Ta-chung, Fan Ting-han, Chen Li-wei, Rudnicky Alexander I., Ramadge Peter J.
- Attention Alignment And Flexible Positional Embeddings Improve Transformer Length Extrapolation Chi Ta-chung, Fan Ting-han, Rudnicky Alexander I.
- M\(^{2}\)chat: Empowering VLM For Multimodal LLM Interleaved Text-image Generation Chi Xiaowei, Zhang Rongyu, Jiang Zhengkai, Liu Yijiang, Wang Yatian, Qi Xingqun, Luo Wenhan, Gao Peng, Zhang Shanghang, Liu Qifeng, Guo Yike
- INSTRUCTEVAL: Towards Holistic Evaluation Of Instruction-tuned Large Language Models Chia Yew Ken, Hong Pengfei, Bing Lidong, Poria Soujanya
- Improving Zero-shot Reader By Reducing Distractions From Irrelevant Documents In Open-domain Question Answering Cho Sukmin, Seo Jeongyeon, Jeong Soyeong, Park Jong C.
- Do Llms Understand Social Knowledge? Evaluating The Sociability Of Large Language Models With Socket Benchmark Choi Minje, Pei Jiaxin, Kumar Sagar, Shu Chang, Jurgens David
- KCTS: Knowledge-constrained Tree Search Decoding With Token-level Hallucination Detection Choi Sehyun, Fang Tianqing, Wang Zhaowei, Song Yangqiu
- Language Grounded Qformer For Efficient Vision Language Understanding Choraria Moulik, Sekhar Nitesh, Wu Yue, Zhang Xu, Singhal Prateek, Varshney Lav R.
- Large Language Models For User Interest Journeys Christakopoulou Konstantina, Lalama Alberto, Adams Cj, Qu Iris, Amir Yifat, Chucri Samer, Vollucci Pierce, Soldo Fabio, Bseiso Dina, Scodel Sarah, Dixon Lucas, Chi Ed H., Chen Minmin
- Language And Task Arithmetic With Parameter-efficient Layers For Zero-shot Summarization Chronopoulou Alexandra, Pfeiffer Jonas, Maynez Joshua, Wang Xinyi, Ruder Sebastian, Agrawal Priyanka
- Data-centric Financial Large Language Models Chu Zhixuan, Guo Huaiyu, Zhou Xinyuan, Wang Yijia, Yu Fei, Chen Hong, Xu Wanqing, Lu Xin, Cui Qing, Li Longfei, Zhou Jun, Li Sheng
- Tutorials On Stance Detection Using Pre-trained Language Models: Fine-tuning BERT And Prompting Large Language Models Chuang Yun-shiuan
- Dola: Decoding By Contrasting Layers Improves Factuality In Large Language Models Chuang Yung-sung, Xie Yujia, Luo Hongyin, Kim Yoon, Glass James, He Pengcheng
- Long Story Short: A Summarize-then-search Method For Long Video Question Answering Chung Jiwan, Yu Youngjae
- Increasing Diversity While Maintaining Accuracy: Text Data Generation With Large Language Models And Human Interventions Chung John Joon Young, Kamar Ece, Amershi Saleema
- Instructtods: Large Language Models For End-to-end Task-oriented Dialogue Systems Chung Willy, Cahyawijaya Samuel, Wilie Bryan, Lovenia Holy, Fung Pascale
- LIMA: Less Is More For Alignment Chunting Zhou, Pengfei Liu, Puxin Xu, Srini Iyer, Jiao Sun, Yuning Mao, Xuezhe Ma, Avia Efrat, Ping Yu, Lili Yu, Susan Zhang, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer, Omer Levy
- Remember What You Did So You Know What To Do Next Ciosici Manuel R., Hedges Alex, Kankanampati Yash, Martin Justin, Freedman Marjorie, Weischedel Ralph
- Generalization Analogies: A Testbed For Generalizing AI Oversight To Hard-to-measure Domains Clymer Joshua, Baker Garrett, Subramani Rohan, Wang Sam
- BERT Lost Patience Won't Be Robust To Adversarial Slowdown Coalson Zachary, Ritter Gabriel, Bobba Rakesh, Hong Sanghyun
- Dr. Jekyll And Mr. Hyde: Two Faces Of Llms Collu Matteo Gioele, Janssen-groesbeek Tom, Koffas Stefanos, Conti Mauro, Picek Stjepan
- Seamlessm4t: Massively Multilingual & Multimodal Machine Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Cora, Dale David, Dong Ning, Duquenne Paul-ambroise, Elsahar Hady, Gong Hongyu, Heffernan Kevin, Hoffman John, Klaiber Christopher, Li Pengwei, Licht Daniel, Maillard Jean, Rakotoarison Alice, Sadagopan Kaushik Ram, Wenzek Guillaume, Ye Ethan, Akula Bapi, Chen Peng-jen, Hachem Naji El, Ellis Brian, Gonzalez Gabriel Mejia, Haaheim Justin, Hansanti Prangthip, Howes Russ, Huang Bernie, Hwang Min-jae, Inaguma Hirofumi, Jain Somya, Kalbassi Elahe, Kallet Amanda, Kulikov Ilia, Lam Janice, Li Daniel, Ma Xutai, Mavlyutov Ruslan, Peloquin Benjamin, Ramadan Mohamed, Ramakrishnan Abinesh, Sun Anna, Tran Kevin, Tran Tuan, Tufanov Igor, Vogeti Vish, Wood Carleigh, Yang Yilin, Yu Bokai, Andrews Pierre, Balioglu Can, Costa-jussà Marta R., Celebi Onur, Elbayad Maha, Gao Cynthia, Guzmán Francisco, Kao Justine, Lee Ann, Mourachko Alexandre, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler
- Robust Infidelity: When Faithfulness Measures On Masked Language Models Are Misleading Crothers Evan, Viktor Herna, Japkowicz Nathalie
- Reinforcement Learning Fine-tuning Of Language Models Is Biased Towards More Extractable Features Cruz Diogo, Pona Edoardo, Holness-tofts Alex, Schmied Elias, Alonso Víctor Abia, Griffin Charlie, Cirstea Bogdan-ionut
- Efficiently Adapting Pretrained Language Models To New Languages Csaki Zoltan, Pawakapan Pian, Thakker Urmish, Xu Qiantong
- Machine Mindset: An MBTI Exploration Of Large Language Models Cui Jiaxi, Lv Liuzhenghao, Wen Jing, Wang Rongsheng, Tang Jing, Tian Yonghong, Yuan Li
- Ada-instruct: Adapting Instruction Generators For Complex Reasoning Cui Wanyun, Wang Qianle
- Efficient And Effective Text Encoding For Chinese Llama And Alpaca Cui Yiming, Yang Ziqing, Yao Xin
- Large Language Models For Compiler Optimization Cummins Chris, Seeker Volker, Grubisic Dejan, Elhoushi Mostafa, Liang Youwei, Roziere Baptiste, Gehring Jonas, Gloeckle Fabian, Hazelwood Kim, Synnaeve Gabriel, Leather Hugh
- Evaluating Prompt-based Question Answering For Object Prediction In The Open Research Knowledge Graph D'souza Jennifer, Hrou Moussab, Auer Sören
- Auggpt: Leveraging Chatgpt For Text Data Augmentation Dai Haixing, Liu Zhengliang, Liao Wenxiong, Huang Xiaoke, Cao Yihan, Wu Zihao, Zhao Lin, Xu Shaochen, Liu Wei, Liu Ninghao, Li Sheng, Zhu Dajiang, Cai Hongmin, Sun Lichao, Li Quanzheng, Shen Dinggang, Liu Tianming, Li Xiang
- Educhat: A Large-scale Language Model-based Chatbot System For Intelligent Education Dan Yuhao, Lei Zhikai, Gu Yiyang, Li Yong, Yin Jianghao, Lin Jiaju, Ye Linhao, Tie Zhiyan, Zhou Yougen, Wang Yilei, Zhou Aimin, Zhou Ze, Chen Qin, Zhou Jie, He Liang, Qiu Xipeng
- Palm-e: An Embodied Multimodal Language Model Danny Driess, Fei Xia, Mehdi S. M. Sajjadi, Corey Lynch, Aakanksha Chowdhery, Brian Ichter, Ayzaan Wahid, Jonathan Tompson, Quan Vuong, Tianhe Yu, Wenlong Huang, Yevgen Chebotar, Pierre Sermanet, Daniel Duckworth, Sergey Levine, Vincent Vanhoucke, Karol Hausman, Marc Toussaint, Klaus Greff, Andy Zeng, Igor Mordatch, Pete Florence
- Flashattention-2: Faster Attention With Better Parallelism And Work Partitioning Dao Tri
- Mask The Bias: Improving Domain-adaptive Generalization Of Ctc-based ASR With Internal Language Model Estimation Das Nilaksh, Sunkara Monica, Bodapati Sravan, Cai Jinglun, Kulshreshtha Devang, Farris Jeff, Kirchhoff Katrin
- Optimizing Distributed Training On Frontier For Large Language Models Dash Sajal, Lyngaas Isaac, Yin Junqi, Wang Xiao, Egele Romain, Cong Guojing, Wang Feiyi, Balaprakash Prasanna
- Complex QA And Language Models Hybrid Architectures, Survey Daull Xavier, Bellot Patrice, Bruno Emmanuel, Martin Vincent, Murisasco Elisabeth
- Competence-based Analysis Of Language Models Davies Adam, Jiang Jize, Zhai Chengxiang
- GLIMMER: Generalized Late-interaction Memory Reranker De Jong Michiel, Zemlyanskiy Yury, Fitzgerald Nicholas, Sanghai Sumit, Cohen William W., Ainslie Joshua
- LLMR: Real-time Prompting Of Interactive Worlds Using Large Language Models De La Torre Fernanda, Fang Cathy Mengying, Huang Han, Banburski-fahey Andrzej, Fernandez Judith Amores, Lanier Jaron
- GPT-3 Models Are Few-shot Financial Reasoners De Padua Raul Salles, Qureshi Imran, Karakaplan Mustafa U.
- Scaling Vision Transformers To 22 Billion Parameters Dehghani Mostafa, Djolonga Josip, Mustafa Basil, Padlewski Piotr, Heek Jonathan, Gilmer Justin, Steiner Andreas, Caron Mathilde, Geirhos Robert, Alabdulmohsin Ibrahim, Jenatton Rodolphe, Beyer Lucas, Tschannen Michael, Arnab Anurag, Wang Xiao, Riquelme Carlos, Minderer Matthias, Puigcerver Joan, Evci Utku, Kumar Manoj, Van Steenkiste Sjoerd, Elsayed Gamaleldin F., Mahendran Aravindh, Yu Fisher, Oliver Avital, Huot Fantine, Bastings Jasmijn, Collier Mark Patrick, Gritsenko Alexey, Birodkar Vighnesh, Vasconcelos Cristina, Tay Yi, Mensink Thomas, Kolesnikov Alexander, Pavetić Filip, Tran Dustin, Kipf Thomas, Lučić Mario, Zhai Xiaohua, Keysers Daniel, Harmsen Jeremiah, Houlsby Neil
- Controlled Text Generation Via Language Model Arithmetic Dekoninck Jasper, Fischer Marc, Beurer-kellner Luca, Vechev Martin
- Investigating Data Contamination In Modern Benchmarks For Large Language Models Deng Chunyuan, Zhao Yilun, Tang Xiangru, Gerstein Mark, Cohan Arman
- Plug-and-play Policy Planner For Large Language Model Powered Dialogue Agents Deng Yang, Zhang Wenxuan, Lam Wai, Ng See-kiong, Chua Tat-seng
- Mutual Enhancement Of Large And Small Language Models With Cross-silo Knowledge Transfer Deng Yongheng, Qiao Ziqing, Ren Ju, Liu Yang, Zhang Yaoxue
- Multilingual Jailbreak Challenges In Large Language Models Deng Yue, Zhang Wenxuan, Pan Sinno Jialin, Bing Lidong
- Spqr: A Sparse-quantized Representation For Near-lossless LLM Weight Compression Dettmers Tim, Svirschevski Ruslan, Egiazarian Vage, Kuznedelev Denis, Frantar Elias, Ashkboos Saleh, Borzunov Alexander, Hoefler Torsten, Alistarh Dan
- Context-aware Transformer Pre-training For Answer Sentence Selection Di Liello Luca, Garg Siddhant, Moschitti Alessandro
- Codefuse-13b: A Pretrained Multi-lingual Code Large Language Model Di Peng, Li Jianguo, Yu Hang, Jiang Wei, Cai Wenting, Cao Yang, Chen Chaoyu, Chen Dajun, Chen Hongwei, Chen Liang, Fan Gang, Gong Jie, Gong Zi, Hu Wen, Guo Tingting, Lei Zhichao, Li Ting, Li Zheng, Liang Ming, Liao Cong, Liu Bingchang, Liu Jiachen, Liu Zhiwei, Lu Shaojun, Shen Min, Wang Guangpei, Wang Huan, Wang Zhi, Xu Zhaogui, Yang Jiawei, Ye Qing, Zhang Gehao, Zhang Yu, Zhao Zelin, Zheng Xunjin, Zhou Hailian, Zhu Lifu, Zhu Xianying
- A Comprehensive Evaluation Of Neural SPARQL Query Generation From Natural Language Questions Diallo Papa Abdou Karim Karou, Reyd Samuel, Zouaq Amal
- Lmflow: An Extensible Toolkit For Finetuning And Inference Of Large Foundation Models Diao Shizhe, Pan Rui, Dong Hanze, Shum Ka Shun, Zhang Jipeng, Xiong Wei, Zhang Tong
- Enhancing Chat Language Models By Scaling High-quality Instructional Conversations Ding Ning, Chen Yulin, Xu Bokai, Qin Yujia, Zheng Zhi, Hu Shengding, Liu Zhiyuan, Sun Maosong, Zhou Bowen
- Self-agreement: A Framework For Fine-tuning Language Models To Find Agreement Among Diverse Opinions Ding Shiyao, Ito Takayuki
- HPC-GPT: Integrating Large Language Model For High-performance Computing Ding Xianzhong, Chen Le, Emani Murali, Liao Chunhua, Lin Pei-hung, Vanderbruggen Tristan, Xie Zhen, Cerpa Alberto E., Du Wan
- Efficient Finetuning Large Language Models For Vietnamese Chatbot Doan Vu-thuan, Truong Quoc-truong, Nguyen Duc-vu, Nguyen Vinh-tiep, Luu Thuy-ngan Nguyen
- Establishing Performance Baselines In Fine-tuning, Retrieval-augmented Generation And Soft-prompting For Non-specialist LLM Users Dodgson Jennifer, Nanzheng Lin, Peh Julian, Pattirane Akira Rafhael Janson, Alhajir Alfath Daryl, Dinarto Eko Ridho, Lim Joseph, Ahmad Syed Danyal
- Block-wise Bit-compression Of Transformer-based Models Dong Gaochen, Chen Wei
- Blockwise Compression Of Transformer-based Models Without Retraining Dong Gaochen, Chen Wei
- The Philosopher's Stone: Trojaning Plugins Of Large Language Models Dong Tian, Xue Minhui, Chen Guoxing, Holland Rayne, Meng Yan, Li Shaofeng, Liu Zhen, Zhu Haojin
- Speechgpt: Empowering Large Language Models With Intrinsic Cross-modal Conversational Abilities Dong Zhang, Shimin Li, Xin Zhang, Jun Zhan, Pengyu Wang, Yaqian Zhou, Xipeng Qiu
- Loramoe: Alleviate World Knowledge Forgetting In Large Language Models Via Moe-style Plugin Dou Shihan, Zhou Enyu, Liu Yan, Gao Songyang, Zhao Jun, Shen Wei, Zhou Yuhao, Xi Zhiheng, Wang Xiao, Fan Xiaoran, Pu Shiliang, Zhu Jiang, Zheng Rui, Gui Tao, Zhang Qi, Huang Xuanjing
- Quantifying And Attributing The Hallucination Of Large Language Models Via Association Analysis Du Li, Wang Yequan, Xing Xingrun, Ya Yiqun, Li Xiang, Jiang Xin, Fang Xuezhi
- Blending Reward Functions Via Few Expert Demonstrations For Faithful And Accurate Knowledge-grounded Dialogue Generation Du Wanyu, Ji Yangfeng
- Guiding Pretraining In Reinforcement Learning With Large Language Models Du Yuqing, Watkins Olivia, Wang Zihan, Colas Cédric, Darrell Trevor, Abbeel Pieter, Gupta Abhishek, Andreas Jacob
- Exploring The Relationship Between In-context Learning And Instruction Tuning Duan Hanyu, Tang Yixuan, Yang Yi, Abbasi Ahmed, Tam Kar Yan
- Alpacafarm: A Simulation Framework For Methods That Learn From Human Feedback Dubois Yann, Li Xuechen, Taori Rohan, Zhang Tianyi, Gulrajani Ishaan, Ba Jimmy, Guestrin Carlos, Liang Percy, Hashimoto Tatsunori B.
- Sweeping Heterogeneity With Smart Mops: Mixture Of Prompts For LLM Task Adaptation Dun Chen, Garcia Mirian Hipolito, Zheng Guoqing, Awadallah Ahmed Hassan, Kyrillidis Anastasios, Sim Robert
- GPT-SW3: An Autoregressive Language Model For The Nordic Languages Ekgren Ariel, Gyllensten Amaru Cuba, Stollenwerk Felix, Öhman Joey, Isbister Tim, Gogoulou Evangelia, Carlsson Fredrik, Heiman Alice, Casademont Judit, Sahlgren Magnus
- Measuring And Improving Attentiveness To Partial Inputs With Counterfactuals Elazar Yanai, Paranjape Bhargavi, Peng Hao, Wiegreffe Sarah, Raghavi Khyathi, Srikumar Vivek, Singh Sameer, Smith Noah A.
- Tinystories: How Small Can Language Models Be And Still Speak Coherent English? Eldan Ronen, Li Yuanzhi
- Who's Harry Potter? Approximate Unlearning In Llms Eldan Ronen, Russinovich Mark
- Octopus: A Multitask Model And Toolkit For Arabic Natural Language Generation Elmadany Abdelrahim, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- ROBBIE: Robust Bias Evaluation Of Large Generative Language Models Esiobu David, Tan Xiaoqing, Hosseini Saghar, Ung Megan, Zhang Yuchen, Fernandes Jude, Dwivedi-yu Jane, Presani Eleonora, Williams Adina, Smith Eric Michael
- Jacotext: A Pretrained Model For Java Code-text Generation Espejel Jessica López, Alassan Mahaman Sanoussi Yahaya, Dahhane Walid, Ettifouri El Hassane
- NAP At Semeval-2023 Task 3: Is Less Really More? (back-)translation As Data Augmentation Strategies For Detecting Persuasion Techniques Falk Neele, Eichel Annerose, Piccirilli Prisca
- Proximal Policy Optimization Actual Combat: Manipulating Output Tokenizer Length Fan Miao, Hu Chen, Zhou Shuchang
- Instructseq: Unifying Vision Tasks With Instruction-conditioned Multi-modal Sequence Generation Fang Rongyao, Yan Shilin, Huang Zhaoyang, Zhou Jingqiu, Tian Hao, Dai Jifeng, Li Hongsheng
- Chatgpt As Data Augmentation For Compositional Generalization: A Case Study In Open Intent Detection Fang Yihao, Li Xianzhi, Thomas Stephen W., Zhu Xiaodan
- Prompting Large Language Models With Speech Recognition Abilities Fathullah Yassir, Wu Chunyang, Lakomkin Egor, Jia Junteng, Shangguan Yuan, Li Ke, Guo Jinxi, Xiong Wenhan, Mahadeokar Jay, Kalinli Ozlem, Fuegen Christian, Seltzer Mike
- Revisiting Instruction Fine-tuned Model Evaluation To Guide Industrial Applications Faysse Manuel, Viaud Gautier, Hudelot Céline, Colombo Pierre
- Transferable Decoding With Visual Entities For Zero-shot Image Captioning Fei Junjie, Wang Teng, Zhang Jinrui, He Zhenyu, Wang Chengjie, Zheng Feng
- Extending Context Window Of Large Language Models Via Semantic Compression Fei Weizhi, Niu Xueyan, Zhou Pingyi, Hou Lu, Bai Bo, Deng Lei, Han Wei
- Improving Factual Consistency Of Text Summarization By Adversarially Decoupling Comprehension And Embellishment Abilities Of Llms Feng Huawen, Fan Yan, Liu Xiong, Lin Ting-en, Yao Zekun, Wu Yuchuan, Huang Fei, Li Yongbin, Ma Qianli
- Sequence-to-sequence Pre-training With Unified Modality Masking For Visual Document Understanding Feng Shuwei, Zhan Tianyang, Jie Zhanming, Luong Trung Quoc, Jin Xiaoran
- Chessgpt: Bridging Policy Learning And Language Modeling Feng Xidong, Luo Yicheng, Wang Ziyan, Tang Hongrui, Yang Mengyue, Shao Kun, Mguni David, Du Yali, Wang Jun
- Alphazero-like Tree-search Can Guide Large Language Model Decoding And Training Feng Xidong, Wan Ziyu, Wen Muning, Mcaleer Stephen Marcus, Wen Ying, Zhang Weinan, Wang Jun
- Llama Rider: Spurring Large Language Models To Explore The Open World Feng Yicheng, Wang Yuxuan, Liu Jiazheng, Zheng Sipeng, Lu Zongqing
- Promptbreeder: Self-referential Self-improvement Via Prompt Evolution Fernando Chrisantha, Banarse Dylan, Michalewski Henryk, Osindero Simon, Rocktäschel Tim
- Should Chatgpt Be Biased? Challenges And Risks Of Bias In Large Language Models Ferrara Emilio
- Vision Language Transformers: A Survey Fields Clayton, Kennington Casey
- Context-aware Meta-learning Fifty Christopher, Duan Dennis, Junkins Ronald G., Amid Ehsan, Leskovec Jure, Re Christopher, Thrun Sebastian
- MBR And QE Finetuning: Training-time Distillation Of The Best And Most Expensive Decoding Methods Finkelstein Mara, Naskar Subhajit, Mirzazadeh Mehdi, Shah Apurva, Freitag Markus
- Can Large Language Models Follow Concept Annotation Guidelines? A Case Study On Scientific And Financial Domains Fonseca Marcio, Cohen Shay B.
- Enhancing Pipeline-based Conversational Agents With Large Language Models Foosherian Mina, Purwins Hendrik, Rathnayake Purna, Alam Touhidul, Teimao Rui, Thoben Klaus-dieter
- Sparsegpt: Massive Language Models Can Be Accurately Pruned In One-shot Frantar Elias, Alistarh Dan
- Leveraging Large Language Models In Conversational Recommender Systems Friedman Luke, Ahuja Sameer, Allen David, Tan Zhenning, Sidahmed Hakim, Long Changbo, Xie Jun, Schubiner Gabriel, Patel Ajay, Lara Harsh, Chu Brian, Chen Zexi, Tiwari Manoj
- Kwaiyiimath: Technical Report Fu Jiayi, Lin Lei, Gao Xiaoyang, Liu Pengli, Chen Zhengzong, Yang Zhirui, Zhang Shengnan, Zheng Xue, Li Yan, Liu Yuliang, Ye Xucheng, Liao Yiqiao, Liao Chao, Chen Bin, Song Chengru, Wan Junchen, Lin Zijia, Zhang Fuzheng, Wang Zhongyuan, Zhang Di, Gai Kun
- Misusing Tools In Large Language Models With Visual Adversarial Examples Fu Xiaohan, Wang Zihan, Li Shuheng, Gupta Rajesh K., Mireshghallah Niloofar, Berg-kirkpatrick Taylor, Fernandes Earlence
- Generate Then Select: Open-ended Visual Question Answering Guided By World Knowledge Fu Xingyu, Zhang Sheng, Kwon Gukyeong, Perera Pramuditha, Zhu Henghui, Zhang Yuhao, Li Alexander Hanbo, Wang William Yang, Wang Zhiguo, Castelli Vittorio, Ng Patrick, Roth Dan, Xiang Bing
- Mitigating Hallucination In Large Multi-modal Models Via Robust Instruction Tuning Fuxiao Liu, Kevin Lin, Linjie Li, Jianfeng Wang, Yaser Yacoob, Lijuan Wang
- Badllama: Cheaply Removing Safety Fine-tuning From Llama 2-chat 13B Gade Pranav, Lermen Simon, Rogers-smith Charlie, Ladish Jeffrey
- Ziya2: Data-centric Learning Is All Llms Need Gan Ruyi, Wu Ziwei, Sun Renliang, Lu Junyu, Wu Xiaojun, Zhang Dixiang, Pan Kunhao, He Junqing, Tian Yuanhe, Yang Ping, Yang Qi, Wang Hao, Zhang Jiaxing, Song Yan
- Roles Of Scaling And Instruction Tuning In Language Perception: Model Vs. Human Attention Gao Changjiang, Huang Shujian, Li Jixing, Chen Jiajun
- Text-to-sql Empowered By Large Language Models: A Benchmark Evaluation Gao Dawei, Wang Haibin, Li Yaliang, Sun Xiuyu, Qian Yichen, Ding Bolin, Zhou Jingren
- Policy-gradient Training Of Language Models For Ranking Gao Ge, Chang Jonathan D., Cardie Claire, Brantley Kianté, Joachim Thorsten
- Llama-adapter V2: Parameter-efficient Visual Instruction Model Gao Peng, Han Jiaming, Zhang Renrui, Lin Ziyi, Geng Shijie, Zhou Aojun, Zhang Wei, Lu Pan, He Conghui, Yue Xiangyu, Li Hongsheng, Qiao Yu
- Empower Your Model With Longer And Better Context Comprehension Gao Yifei, Wang Lei, Fang Jun, Hu Longhua, Cheng Jun
- CLOVA: A Closed-loop Visual Assistant With Tool Usage And Update Gao Zhi, Du Yuntao, Zhang Xintong, Ma Xiaojian, Han Wenjuan, Zhu Song-chun, Li Qing
- If The Sources Could Talk: Evaluating Large Language Models For Research Assistance In History Garcia Giselle Gonzalez, Weilbach Christian
- Real Customization Or Just Marketing: Are Customized Versions Of Chat GPT Useful? Garrido-merchán Eduardo C., Arroyo-barrigüete Jose L., Borrás-pala Francisco, Escobar-torres Leandro, De Ibarreta Carlos Martínez, Ortiz-lozano Jose María, Rua-vieites Antonio
- Faithful Explanations Of Black-box NLP Models Using Llm-generated Counterfactuals Gat Yair, Calderon Nitay, Feder Amir, Chapanin Alexander, Sharma Amit, Reichart Roi
- A Lightweight Method To Generate Unanswerable Questions In English Gautam Vagrant, Zhang Miaoran, Klakow Dietrich
- Ahead-of-time P-tuning Gavrilov Daniil, Balagansky Nikita
- Model Tells You What To Discard: Adaptive KV Cache Compression For Llms Ge Suyu, Zhang Yunan, Liu Liyuan, Zhang Minjia, Han Jiawei, Gao Jianfeng
- MART: Improving LLM Safety With Multi-round Automatic Red-teaming Ge Suyu, Zhou Chunting, Hou Rui, Khabsa Madian, Wang Yi-chia, Wang Qifan, Han Jiawei, Mao Yuning
- Making Llama SEE And Draw With SEED Tokenizer Ge Yuying, Zhao Sijie, Zeng Ziyun, Ge Yixiao, Li Chen, Wang Xintao, Shan Ying
- Ddcot: Duty-distinct Chain-of-thought Prompting For Multimodal Reasoning In Language Models Ge Zheng, Bin Yang, Jiajin Tang, Hong-yu Zhou, Sibei Yang
- Mblip: Efficient Bootstrapping Of Multilingual Vision-llms Geigle Gregor, Jain Abhay, Timofte Radu, Glavaš Goran
- Trueteacher: Learning Factual Consistency Evaluation With Large Language Models Gekhman Zorik, Herzig Jonathan, Aharoni Roee, Elkind Chen, Szpektor Idan
- Hiclip: Contrastive Language-image Pretraining With Hierarchy-aware Attention Geng Shijie, Yuan Jianbo, Tian Yu, Chen Yuxiao, Zhang Yongfeng
- Does Synthetic Data Make Large Language Models More Efficient? Gholami Sia, Omar Marwan
- Flacuna: Unleashing The Problem Solving Power Of Vicuna Using FLAN Fine-tuning Ghosal Deepanway, Chia Yew Ken, Majumder Navonil, Poria Soujanya
- Text-to-audio Generation Using Instruction-tuned LLM And Latent Diffusion Model Ghosal Deepanway, Majumder Navonil, Mehrish Ambuj, Poria Soujanya
- Clipsyntel: CLIP And LLM Synergy For Multimodal Question Summarization In Healthcare Ghosh Akash, Acharya Arkadeep, Jain Raghav, Saha Sriparna, Chadha Aman, Sinha Setu
- Resetox: Re-learning Attention Weights For Toxicity Mitigation In Machine Translation Gilabert Javier García, Escolano Carlos, Costa-jussà Marta R.
- Mtuncertainty: Assessing The Need For Post-editing Of Machine Translation Outputs By Fine-tuning Openai Llms Gladkoff Serge, Han Lifeng, Erofeev Gleb, Sorokina Irina, Nenadic Goran
- Headless Language Models: Learning Without Predicting With Contrastive Weight Tying Godey Nathan, De La Clergerie Éric, Sagot Benoît
- Model-generated Pretraining Signals Improves Zero-shot Generalization Of Text-to-text Transformers Gong Linyuan, Xiong Chenyan, Liu Xiaodong, Bajaj Payal, Xie Yiqing, Cheung Alvin, Gao Jianfeng, Song Xia
- Improving Input-label Mapping With Demonstration Replay For In-context Learning Gong Zhuocheng, Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang, Zhao Dongyan, Yan Rui
- Intention And Context Elicitation With Large Language Models In The Legal Aid Intake Process Goodson Nick, Lu Rongfei
- Lil-bevo: Explorations Of Strategies For Training Language Models In More Humanlike Ways Govindarajan Venkata S, Rodriguez Juan Diego, Bostrom Kaj, Mahowald Kyle
- Pokemqa: Programmable Knowledge Editing For Multi-hop Question Answering Gu Hengrui, Zhou Kaixiong, Han Xiaotian, Liu Ninghao, Wang Ruobing, Wang Xin
- Seer: Language Instructed Video Prediction With Latent Diffusion Models Gu Xianfan, Wen Chuan, Ye Weirui, Song Jiaming, Gao Yang
- Voyager: An Open-ended Embodied Agent With Large Language Models Guanzhi Wang, Yuqi Xie, Yunfan Jiang, Ajay Mandlekar, Chaowei Xiao, Yuke Zhu, Linxi Fan, Anima Anandkumar
- Hifi: High-information Attention Heads Hold For Parameter-efficient Model Adaptation Gui Anchun, Xiao Han
- SPT: Fine-tuning Transformer-based Language Models Efficiently With Sparsification Gui Yuntao, Yan Xiao, Yin Peiqi, Yang Han, Cheng James
- Reinforced Self-training (rest) For Language Modeling Gulcehre Caglar, Paine Tom Le, Srinivasan Srivatsan, Konyushkova Ksenia, Weerts Lotte, Sharma Abhishek, Siddhant Aditya, Ahern Alex, Wang Miaosen, Gu Chenjie, Macherey Wolfgang, Doucet Arnaud, Firat Orhan, De Freitas Nando
- Beyond Imitation: Leveraging Fine-grained Quality Signals For Alignment Guo Geyang, Zhao Ranchi, Tang Tianyi, Zhao Wayne Xin, Wen Ji-rong
- Meta-tuning Llms To Leverage Lexical Knowledge For Generalizable Language Style Understanding Guo Ruohao, Xu Wei, Ritter Alan
- Glancing Future For Simultaneous Machine Translation Guo Shoutao, Zhang Shaolei, Feng Yang
- Compresso: Structured Pruning With Collaborative Prompting Learns Compact Large Language Models Guo Song, Xu Jiahang, Zhang Li Lyna, Yang Mao
- Instruction Fusion: Advancing Prompt Evolution Through Hybridization Guo Weidong, Yang Jiuding, Yang Kaitong, Li Xiangyang, Rao Zhuwei, Xu Yu, Niu Di
- ELIP: Efficient Language-image Pre-training With Fewer Vision Tokens Guo Yangyang, Zhang Haoyu, Wong Yongkang, Nie Liqiang, Kankanhalli Mohan
- Learning To Plan With Natural Language Guo Yiduo, Liang Yaobo, Wu Chenfei, Wu Wenshan, Zhao Dongyan, Duan Nan
- Arthmodel: Enhance Arithmetic Skills To Large Language Model Guo Yingdi
- Improving Small Language Models On Pubmedqa Via Generative Data Augmentation Guo Zhen, Wang Peiqi, Wang Yanwei, Yu Shangdi
- Point-bind & Point-llm: Aligning Point Cloud With Multi-modality For 3D Understanding, Generation, And Instruction Following Guo Ziyu, Zhang Renrui, Zhu Xiangyang, Tang Yiwen, Ma Xianzheng, Han Jiaming, Chen Kexin, Gao Peng, Li Xianzhi, Li Hongsheng, Heng Pheng-ann
- Editing Common Sense In Transformers Gupta Anshita, Mondal Debanjan, Sheshadri Akshay Krishna, Zhao Wenlong, Li Xiang Lorraine, Wiegreffe Sarah, Tandon Niket
- Targen: Targeted Data Generation With Large Language Models Gupta Himanshu, Scaria Kevin, Anantheswaran Ujjwala, Verma Shreyas, Parmar Mihir, Sawant Saurabh Arjun, Baral Chitta, Mishra Swaroop
- Continual Pre-training Of Large Language Models: How To (re)warm Your Model? Gupta Kshitij, Thérien Benjamin, Ibrahim Adam, Richter Mats L., Anthony Quentin, Belilovsky Eugene, Rish Irina, Lesort Timothée
- Grace: Generation Using Associated Code Edits Gupta Priyanshu, Khare Avishree, Bajpai Yasharth, Chakraborty Saikat, Gulwani Sumit, Kanade Aditya, Radhakrishna Arjun, Soares Gustavo, Tiwari Ashish
- Gistscore: Learning Better Representations For In-context Example Selection With Gist Bottlenecks Gupta Shivanshu, Rosenbaum Clemens, Elenberg Ethan R.
- Evaluating Concurrent Robustness Of Language Models Across Diverse Challenge Sets Gupta Vatsal, Pandya Pranshu, Kataria Tushar, Gupta Vivek, Roth Dan
- A Survey Of Vision-language Pre-training From The Lens Of Multimodal Machine Translation Gwinnup Jeremy, Duh Kevin
- Seeing Chatgpt Through Students' Eyes: An Analysis Of Tiktok Data Haensch Anna-carolina, Ball Sarah, Herklotz Markus, Kreuter Frauke
- Efficient Parallelization Layouts For Large-scale Distributed Model Training Hagemann Johannes, Weinbach Samuel, Dobler Konstantin, Schall Maximilian, De Melo Gerard
- A Theory Of Emergent In-context Learning As Implicit Structure Induction Hahn Michael, Goyal Navin
- Large Language Models Meet Computer Vision: A Brief Survey Hamadi Raby
- Lm-infinite: Zero-shot Extreme Length Generalization For Large Language Models Han Chi, Wang Qifan, Peng Hao, Xiong Wenhan, Chen Yu, Ji Heng, Wang Sinong
- Potential Benefits Of Employing Large Language Models In Research In Moral Education And Development Han Hyemin
- Imagebind-llm: Multi-modality Instruction Tuning Han Jiaming, Zhang Renrui, Shao Wenqi, Gao Peng, Xu Peng, Xiao Han, Zhang Kaipeng, Liu Chris, Wen Song, Guo Ziyu, Lu Xudong, Ren Shuai, Wen Yafei, Chen Xiaoxin, Yue Xiangyu, Li Hongsheng, Qiao Yu
- Pive: Prompting With Iterative Verification Improving Graph-based Generative Capability Of Llms Han Jiuzhou, Collier Nigel, Buntine Wray, Shareghi Ehsan
- Llms As Visual Explainers: Advancing Image Classification With Evolving Visual Descriptions Han Songhao, Zhuo Le, Liao Yue, Liu Si
- Medalpaca -- An Open-source Collection Of Medical Conversational AI Models And Training Data Han Tianyu, Adams Lisa C., Papaioannou Jens-michalis, Grundmann Paul, Oberhauser Tom, Löser Alexander, Truhn Daniel, Bressem Keno K.
- In-context Alignment: Chat With Vanilla Language Models Before Fine-tuning Han Xiaochuang
- David Helps Goliath: Inference-time Collaboration Between Small Specialized And Large General Diffusion Lms Han Xiaochuang, Kumar Sachin, Tsvetkov Yulia, Ghazvininejad Marjan
- Personalisation Within Bounds: A Risk Taxonomy And Policy Framework For The Alignment Of Large Language Models With Personalised Feedback Hannah Rose Kirk, Bertie Vidgen, Paul Röttger, Scott A. Hale
- Blockwise Parallel Transformer For Large Context Models Hao Liu, Pieter Abbeel
- Vico: Plug-and-play Visual Condition For Personalized Text-to-image Generation Hao Shaozhe, Han Kai, Zhao Shihao, Wong Kwan-yee K.
- Towards Efficient Vision-language Tuning: More Information Density, More Generalizability Hao Tianxiang, Lyu Mengyao, Chen Hui, Zhao Sicheng, Ding Xiaohan, Han Jungong, Ding Guiguang
- Test-time Training On Nearest Neighbors For Large Language Models Hardt Moritz, Sun Yu
- Herd: Using Multiple, Smaller Llms To Match The Performances Of Proprietary, Large Llms Via An Intelligent Composer Hari Surya Narayanan, Thomson Matt
- Reranking For Natural Language Generation From Logical Forms: A Study Based On Large Language Models Haroutunian Levon, Li Zhuang, Galescu Lucian, Cohen Philip, Tumuluri Raj, Haffari Gholamreza
- Answering Unseen Questions With Smaller Language Models Using Rationale Generation And Dense Retrieval Hartill Tim, Benavides-prado Diana, Witbrock Michael, Riddle Patricia J.
- Sok: Memorization In General-purpose Large Language Models Hartmann Valentin, Suri Anshuman, Bindschaedler Vincent, Evans David, Tople Shruti, West Robert
- Wanjuan: A Comprehensive Multimodal Dataset For Advancing English And Chinese Large Models He Conghui, Jin Zhenjiang, Xu Chao, Qiu Jiantao, Wang Bin, Li Wei, Yan Hang, Wang Jiaqi, Lin Dahua
- ICL-D3IE: In-context Learning With Diverse Demonstrations Updating For Document Information Extraction He Jiabang, Wang Lei, Hu Yi, Liu Ning, Liu Hui, Xu Xing, Shen Heng Tao
- Never Lost In The Middle: Mastering Long-context Question Answering With Position-agnostic Decompositional Training He Junqing, Pan Kunhao, Dong Xiaoqun, Song Zhuoyang, Liu Yibo, Sun Qianguo, Liang Yuxin, Wang Hao, Zhang Enming, Zhang Jiaxing
- Localized Text-to-image Generation For Free Via Cross Attention Control He Yutong, Salakhutdinov Ruslan, Kolter J. Zico
- Chatgpt For Zero-shot Dialogue State Tracking: A Solution Or An Opportunity? Heck Michael, Lubis Nurul, Ruppik Benjamin, Vukovic Renato, Feng Shutong, Geishauser Christian, Lin Hsien-chin, Van Niekerk Carel, Gašić Milica
- Divide Et Impera: Multi-transformer Architectures For Complex Nlp-tasks Helland Solveig, Gavagnin Elena, De Spindler Alexandre
- Stochastic Parrots Looking For Stochastic Parrots: Llms Are Easy To Fine-tune And Hard To Detect With Other Llms Henrique Da Silva Gameiro, Kucharavy Andrei, Guerraoui Rachid
- Backpack Language Models Hewitt John, Thickstun John, Manning Christopher D., Liang Percy
- Assessing The Efficacy Of Large Language Models In Generating Accurate Teacher Responses Hicke Yann, Masand Abhishek, Guo Wentao, Gangavarapu Tushaar
- Llm-japanese-dataset V0: Construction Of Japanese Chat Dataset For Large Language Models And Its Methodology Hirano Masanori, Suzuki Masahiro, Sakaji Hiroki
- Do Compressed Llms Forget Knowledge? An Experimental Study With Practical Implications Hoang Duc N. M, Cho Minsik, Merth Thomas, Rastegari Mohammad, Wang Zhangyang
- Detecting Edit Failures In Large Language Models: An Improved Specificity Benchmark Hoelscher-obermaier Jason, Persson Julia, Kran Esben, Konstas Ioannis, Barez Fazl
- Zero-shot Goal-directed Dialogue Via RL On Imagined Conversations Hong Joey, Levine Sergey, Dragan Anca
- DP-OPT: Make Large Language Model Your Privacy-preserving Prompt Engineer Hong Junyuan, Wang Jiachen T., Zhang Chenhui, Li Zhangheng, Li Bo, Wang Zhangyang
- Conversational AI Threads For Visualizing Multidimensional Datasets Hong Matt-heun, Crisan Anamaria
- Direct2v: Large Language Models Are Frame-level Directors For Zero-shot Text-to-video Generation Hong Susung, Seo Junyoung, Shin Heeseong, Hong Sunghwan, Kim Seungryong
- SPEED: Speculative Pipelined Execution For Efficient Decoding Hooper Coleman, Kim Sehoon, Mohammadzadeh Hiva, Genc Hasan, Keutzer Kurt, Gholami Amir, Shao Sophia
- Towards A Mechanistic Interpretation Of Multi-step Reasoning Capabilities Of Language Models Hou Yifan, Li Jiaoda, Fei Yu, Stolfo Alessandro, Zhou Wangchunshu, Zeng Guangtao, Bosselut Antoine, Sachan Mrinmaya
- Distilling Step-by-step! Outperforming Larger Language Models With Less Training Data And Smaller Model Sizes Hsieh Cheng-yu, Li Chun-liang, Yeh Chih-kuan, Nakhost Hootan, Fujii Yasuhisa, Ratner Alexander, Krishna Ranjay, Lee Chen-yu, Pfister Tomas
- Chain-of-symbol Prompting Elicits Planning In Large Langauge Models Hu Hanxu, Lu Hongyuan, Zhang Huajian, Song Yun-ze, Lam Wai, Zhang Yue
- Context-aware Prompt Tuning For Vision-language Model With Dual-alignment Hu Hongyu, Lin Tiancheng, Wang Jie, Sun Zhenbang, Xu Yi
- COMMA: Co-articulated Multi-modal Learning Hu Lianyu, Gao Liqing, Liu Zekang, Pun Chi-man, Feng Wei
- Meta-learning Online Adaptation Of Language Models Hu Nathan, Mitchell Eric, Manning Christopher D., Finn Chelsea
- Scaled Prompt-tuning For Few-shot Natural Language Generation Hu Ting, Meinel Christoph, Yang Haojin
- Enhancing Answer Selection In Community Question Answering With Pre-trained And Large Language Models Hu Xinghang
- Do Large Language Models Know About Facts? Hu Xuming, Chen Junzhe, Li Xiaochuan, Guo Yufei, Wen Lijie, Yu Philip S., Guo Zhijiang
- Llm-adapters: An Adapter Family For Parameter-efficient Fine-tuning Of Large Language Models Hu Zhiqiang, Wang Lei, Lan Yihuai, Xu Wanyu, Lim Ee-peng, Bing Lidong, Xu Xing, Poria Soujanya, Lee Roy Ka-wei
- POP: Prompt Of Prompts For Continual Learning Hu Zhiyuan, Lyu Jiancheng, Gao Dashan, Vasconcelos Nuno
- Vtimellm: Empower LLM To Grasp Video Moments Huang Bin, Wang Xin, Chen Hong, Song Zihan, Zhu Wenwu
- Towards Equipping Transformer With The Ability Of Systematic Compositionality Huang Chen, Qin Peixin, Lei Wenqiang, Lv Jiancheng
- Directed Acyclic Transformer Pre-training For High-quality Non-autoregressive Text Generation Huang Fei, Ke Pei, Huang Minlie
- Free-bloom: Zero-shot Text-to-video Generator With LLM Director And LDM Animator Huang Hanzhuo, Feng Yufan, Shi Cheng, Xu Lan, Yu Jingyi, Yang Sibei
- Acegpt, Localizing Large Language Models In Arabic Huang Huang, Yu Fei, Zhu Jianqing, Sun Xuening, Cheng Hao, Song Dingjie, Chen Zhihong, Alharthi Abdulmohsen, An Bang, He Juncai, Liu Ziche, Zhang Zhiyi, Chen Junying, Li Jianquan, Wang Benyou, Zhang Lian, Sun Ruoyu, Wan Xiang, Li Haizhou, Xu Jinchao
- Modality Plug-and-play: Elastic Modality Adaptation In Multimodal Llms For Embodied AI Huang Kai, Yang Boyuan, Gao Wei
- T2i-compbench: A Comprehensive Benchmark For Open-world Compositional Text-to-image Generation Huang Kaiyi, Sun Kaiyue, Xie Enze, Li Zhenguo, Liu Xihui
- Lawyer Llama Technical Report Huang Quzhe, Tao Mingxu, Zhang Chen, An Zhenwei, Jiang Cong, Chen Zhibin, Wu Zirui, Feng Yansong
- Chat Vector: A Simple Approach To Equip Llms With Instruction Following And Model Alignment In New Languages Huang Shih-cheng, Li Pin-zu, Hsu Yu-chi, Chen Kuang-ming, Lin Yu Tung, Hsiao Shih-kai, Tsai Richard Tzong-han, Lee Hung-yi
- Grounded Decoding: Guiding Text Generation With Grounded Models For Embodied Agents Huang Wenlong, Xia Fei, Shah Dhruv, Driess Danny, Zeng Andy, Lu Yao, Florence Pete, Mordatch Igor, Levine Sergey, Hausman Karol, Ichter Brian
- Fewer Is More: Boosting LLM Reasoning With Reinforced Context Pruning Huang Xijie, Zhang Li Lyna, Cheng Kwang-ting, Yang Fan, Yang Mao
- Competition-level Problems Are Effective LLM Evaluators Huang Yiming, Lin Zhenghao, Liu Xiao, Gong Yeyun, Lu Shuai, Lei Fangyu, Liang Yaobo, Shen Yelong, Lin Chen, Duan Nan, Chen Weizhu
- Advancing Transformer Architecture In Long-context Large Language Models: A Comprehensive Survey Huang Yunpeng, Xu Jingwei, Lai Junyu, Jiang Zixu, Chen Taolue, Li Zenan, Yao Yuan, Ma Xiaoxing, Yang Lijuan, Chen Hao, Li Shupeng, Zhao Penghao
- Evaluating And Enhancing Large Language Models For Conversational Reasoning On Knowledge Graphs Huang Yuxuan, Shi Lida, Liu Anqi, Xu Hao
- Smartedit: Exploring Complex Instruction-based Image Editing With Multimodal Large Language Models Huang Yuzhou, Xie Liangbin, Wang Xintao, Yuan Ziyang, Cun Xiaodong, Ge Yixiao, Zhou Jiantao, Dong Chao, Huang Rui, Zhang Ruimao, Shan Ying
- How Does Pretraining Improve Discourse-aware Translation? Huang Zhihong, Wang Longyue, Liu Siyou, Wong Derek F.
- Soft Prompt Decoding For Multilingual Dense Retrieval Huang Zhiqi, Zeng Hansi, Zamani Hamed, Allan James
- Llama 2: Open Foundation And Fine-tuned Chat Models Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini, Rui Hou, Hakan Inan, Marcin Kardas, Viktor Kerkez, Madian Khabsa, Isabel Kloumann, Artem Korenev, Punit Singh Koura, Marie-anne Lachaux, Thibaut Lavril, Jenya Lee, Diana Liskovich, Yinghai Lu, Yuning Mao, Xavier Martinet, Todor Mihaylov, Pushkar Mishra, Igor Molybog, Yixin Nie, Andrew Poulton, Jeremy Reizenstein, Rashi Rungta, Kalyan Saladi, Alan Schelten, Ruan Silva, Eric Michael Smith, Ranjan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, Jian Xiang Kuan, Puxin Xu, Zheng Yan, Iliyan Zarov, Yuchen Zhang, Angela Fan, Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom
- TADA: Efficient Task-agnostic Domain Adaptation For Transformers Hung Chia-chien, Lange Lukas, Strötgen Jannik
- Towards Leveraging Llms For Conditional QA Hussain Syed-amad, Dakle Parag Pravin, Rallabandi Saikrishna, Raghavan Preethi
- Vimq: A Vietnamese Medical Question Dataset For Healthcare Dialogue System Development Huy Ta Duc, Tu Nguyen Anh, Vu Tran Hoang, Minh Nguyen Phuc, Phan Nguyen, Bui Trung H., Truong Steven Q. H.
- Understanding The Effectiveness Of Very Large Language Models On Dialog Evaluation Huynh Jessica, Jiao Cathy, Gupta Prakhar, Mehri Shikib, Bajaj Payal, Chaudhary Vishrav, Eskenazi Maxine
- Minimizing Factual Inconsistency And Hallucination In Large Language Models I Muneeswaran, Saxena Shreya, Prasad Siva, Prakash M V Sai, Shankar Advaith, V Varun, Vaddina Vishal, Gopalakrishnan Saisubramaniam
- Large Language Model Displays Emergent Ability To Interpret Novel Literary Metaphors Ichien Nicholas, Stamenković Dušan, Holyoak Keith J.
- Glot500: Scaling Multilingual Corpora And Language Models To 500 Languages Imani Ayyoob, Lin Peiqin, Kargaran Amir Hossein, Severini Silvia, Sabet Masoud Jalili, Kassner Nora, Ma Chunlan, Schmid Helmut, Martins André F. T., Yvon François, Schütze Hinrich
- Llama Guard: Llm-based Input-output Safeguard For Human-ai Conversations Inan Hakan, Upasani Kartikeya, Chi Jianfeng, Rungta Rashi, Iyer Krithika, Mao Yuning, Tontchev Michael, Hu Qing, Fuller Brian, Testuggine Davide, Khabsa Madian
- Pushing Boundaries: Exploring Zero Shot Object Classification With Large Multimodal Models Islam Ashhadul, Biswas Md. Rafiul, Zaghouani Wajdi, Belhaouari Samir Brahim, Shah Zubair
- Camels In A Changing Climate: Enhancing LM Adaptation With Tulu 2 Ivison Hamish, Wang Yizhong, Pyatkin Valentina, Lambert Nathan, Peters Matthew, Dasigi Pradeep, Jang Joel, Wadden David, Smith Noah A., Beltagy Iz, Hajishirzi Hannaneh
- A Comprehensive Evaluation Of Large Language Models On Benchmark Biomedical Text Processing Tasks Jahan Israt, Laskar Md Tahmid Rahman, Peng Chun, Huang Jimmy
- Neftune: Noisy Embeddings Improve Instruction Finetuning Jain Neel, Chiang Ping-yeh, Wen Yuxin, Kirchenbauer John, Chu Hong-min, Somepalli Gowthami, Bartoldson Brian R., Kailkhura Bhavya, Schwarzschild Avi, Saha Aniruddha, Goldblum Micah, Geiping Jonas, Goldstein Tom
- Bring Your Own Data! Self-supervised Evaluation For Large Language Models Jain Neel, Saifullah Khalid, Wen Yuxin, Kirchenbauer John, Shu Manli, Saha Aniruddha, Goldblum Micah, Geiping Jonas, Goldstein Tom
- Cotran: An Llm-based Code Translator Using Reinforcement Learning With Feedback From Compiler And Symbolic Execution Jana Prithwish, Jha Piyush, Ju Haoyang, Kishore Gautham, Mahajan Aryan, Ganesh Vijay
- Faithful Persona-based Conversational Dataset Generation With Large Language Models Jandaghi Pegah, Sheng Xianghai, Bai Xinyi, Pujara Jay, Sidahmed Hakim
- Inpars-v2: Large Language Models As Efficient Dataset Generators For Information Retrieval Jeronymo Vitor, Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Lotufo Roberto, Zavrel Jakub, Nogueira Rodrigo
- Quip: 2-bit Quantization Of Large Language Models With Guarantees Jerry Chee, Yaohui Cai, Volodymyr Kuleshov, Christopher De Sa
- Just CHOP: Embarrassingly Simple LLM Compression Jha Ananya Harsh, Sherborne Tom, Walsh Evan Pete, Groeneveld Dirk, Strubell Emma, Beltagy Iz
- Neuro Symbolic Reasoning For Planning: Counterexample Guided Inductive Synthesis Using Large Language Models And Satisfiability Solving Jha Sumit Kumar, Jha Susmit, Lincoln Patrick, Bastian Nathaniel D., Velasquez Alvaro, Ewetz Rickard, Neema Sandeep
- Natural Language Decomposition And Interpretation Of Complex Utterances Jhamtani Harsh, Fang Hao, Xia Patrick, Levy Eran, Andreas Jacob, Van Durme Ben
- Tailoring Language Generation Models Under Total Variation Distance Ji Haozhe, Ke Pei, Hu Zhipeng, Zhang Rongsheng, Huang Minlie
- Exploring The Impact Of Instruction Data Scaling On Large Language Models: An Empirical Study On Real-world Use Cases Ji Yunjie, Deng Yong, Gong Yan, Peng Yiping, Niu Qiang, Zhang Lei, Ma Baochang, Li Xiangang
- Bootstrapping Vision-language Learning With Decoupled Language Pre-training Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Expedited Training Of Visual Conditioned Language Generation Via Redundancy Reduction Jian Yiren, Liu Tingkai, Tao Yunzhe, Zhang Chunhui, Vosoughi Soroush, Yang Hongxia
- Bus:efficient And Effective Vision-language Pre-training With Bottom-up Patch Summarization Jiang Chaoya, Xu Haiyang, Ye Wei, Ye Qinghao, Li Chenliang, Yan Ming, Bi Bin, Zhang Shikun, Huang Fei, Huang Songfang
- Legal Syllogism Prompting: Teaching Large Language Models For Legal Judgment Prediction Jiang Cong, Yang Xiaolei
- Forcing Generative Models To Degenerate Ones: The Power Of Data Poisoning Attacks Jiang Shuli, Kadhe Swanand Ravindra, Zhou Yi, Cai Ling, Baracaldo Nathalie
- Prompt Packer: Deceiving Llms Through Compositional Instruction With Hidden Attacks Jiang Shuyu, Chen Xingshu, Tang Rui
- Scaling Sentence Embeddings With Large Language Models Jiang Ting, Huang Shaohan, Luan Zhongzhi, Wang Deqing, Zhuang Fuzhen
- Iluvui: Instruction-tuned Language-vision Modeling Of Uis From Machine Conversations Jiang Yue, Schoop Eldon, Swearngin Amanda, Nichols Jeffrey
- Lion: Adversarial Distillation Of Proprietary Large Language Models Jiang Yuxin, Chan Chunkit, Chen Mingyang, Wang Wei
- Generative Calibration For In-context Learning Jiang Zhongtao, Zhang Yuanzhe, Liu Cao, Zhao Jun, Liu Kang
- How Can Recommender Systems Benefit From Large Language Models: A Survey Jianghao Lin, Xinyi Dai, Yunjia Xi, Weiwen Liu, Bo Chen, Hao Zhang, Yong Liu, Chuhan Wu, Xiangyang Li, Chenxu Zhu, Huifeng Guo, Yong Yu, Ruiming Tang, Weinan Zhang
- Panda LLM: Training Data And Evaluation For Open-sourced Chinese Instruction-following Large Language Models Jiao Fangkai, Ding Bosheng, Luo Tianze, Mo Zhanfeng
- Exploring Self-supervised Logic-enhanced Training For Large Language Models Jiao Fangkai, Teng Zhiyang, Ding Bosheng, Liu Zhengyuan, Chen Nancy F., Joty Shafiq
- Chain Of Thought Prompt Tuning In Vision Language Models Jiaxin Ge, Hongyin Luo, Siyuan Qian, Yulu Gan, Jie Fu, Shanghang Zhang
- Llara: Large Language-recommendation Assistant Jiayi Liao, Sihang Li, Zhengyi Yang, Jiancan Wu, Yancheng Yuan, Xiang Wang, Xiangnan He
- Emergent Representations Of Program Semantics In Language Models Trained On Programs Jin Charles, Rinard Martin
- When Large Language Models Meet Personalization: Perspectives Of Challenges And Opportunities Jin Chen, Zheng Liu, Xu Huang, Chenwang Wu, Qi Liu, Gangwei Jiang, Yuanhao Pu, Yuxuan Lei, Xiaolong Chen, Xingmei Wang, Defu Lian, Enhong Chen
- Growlength: Accelerating Llms Pretraining By Progressively Growing Training Length Jin Hongye, Han Xiaotian, Yang Jingfeng, Jiang Zhimeng, Chang Chia-yuan, Hu Xia
- Medcpt: Contrastive Pre-trained Transformers With Large-scale Pubmed Search Logs For Zero-shot Biomedical Information Retrieval Jin Qiao, Kim Won, Chen Qingyu, Comeau Donald C., Yeganova Lana, Wilbur W. John, Lu Zhiyong
- The Cost Of Down-scaling Language Models: Fact Recall Deteriorates Before In-context Learning Jin Tian, Clement Nolan, Dong Xin, Nagarajan Vaishnavh, Carbin Michael, Ragan-kelley Jonathan, Dziugaite Gintare Karolina
- Grounding Language Models To Images For Multimodal Inputs And Outputs Jing Yu Koh, Ruslan Salakhutdinov, Daniel Fried
- Knowledge-augmented Large Language Models For Personalized Contextual Query Suggestion Jinheon Baek, Nirupama Chandrasekaran, Silviu Cucerzan, Allen Herring, Sujay Kumar Jauhar
- Vcoder: Versatile Vision Encoders For Multimodal Large Language Models Jitesh Jain, Jianwei Yang, Humphrey Shi
- Teaching Language Models To Hallucinate Less With Synthetic Tasks Jones Erik, Palangi Hamid, Simões Clarisse, Chandrasekaran Varun, Mukherjee Subhabrata, Mitra Arindam, Awadallah Ahmed, Kamar Ece
- Improving Activation Steering In Language Models With Mean-centring Jorgensen Ole, Cope Dylan, Schoots Nandi, Shanahan Murray
- Small Language Models Fine-tuned To Coordinate Larger Language Models Improve Complex Reasoning Juneja Gurusha, Dutta Subhabrata, Chakrabarti Soumen, Manchanda Sunny, Chakraborty Tanmoy
- A Comprehensive Capability Analysis Of GPT-3 And GPT-3.5 Series Models Junjie Ye, Xuanting Chen, Nuo Xu, Can Zu, Zekai Shao, Shichun Liu, Yuhan Cui, Zeyang Zhou, Chao Gong, Yang Shen, Jie Zhou, Siming Chen, Tao Gui, Qi Zhang, Xuanjing Huang
- BLIP-2: Bootstrapping Language-image Pre-training With Frozen Image Encoders And Large Language Models Junnan Li, Dongxu Li, Silvio Savarese, Steven Hoi
- Evaluation And Analysis Of Hallucination In Large Vision-language Models Junyang Wang, Yiyang Zhou, Guohai Xu, Pengcheng Shi, Chenlin Zhao, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang, Jihua Zhu, Jitao Sang, Haoyu Tang
- Synthetic Data Generation In Low-resource Settings Via Fine-tuning Of Large Language Models Kaddour Jean, Liu Qi
- Domain-specific Code Language Models: Unraveling The Potential For HPC Codes And Tasks Kadosh Tal, Hasabnis Niranjan, Vo Vy A., Schneider Nadav, Krien Neva, Capota Mihai, Wasay Abdul, Ahmed Nesreen, Willke Ted, Tamir Guy, Pinter Yuval, Mattson Timothy, Oren Gal
- Robust Recommender System: A Survey And Future Directions Kaike Zhang, Qi Cao, Fei Sun, Yunfan Wu, Shuchang Tao, Huawei Shen, Xueqi Cheng
- Robust Training For Conversational Question Answering Models With Reinforced Reformulation Generation Kaiser Magdalena, Roy Rishiraj Saha, Weikum Gerhard
- Contextual Code Switching For Machine Translation Using Language Models Kaji Arshad, Shah Manan
- Calibrated Language Models Must Hallucinate Kalai Adam Tauman, Vempala Santosh S.
- A Rank Stabilization Scaling Factor For Fine-tuning With Lora Kalajdzievski Damjan
- Swectrl-mini: A Data-transparent Transformer-based Large Language Model For Controllable Text Generation In Swedish Kalpakchi Dmytro, Boye Johan
- A Survey Of GPT-3 Family Large Language Models Including Chatgpt And GPT-4 Kalyan Katikapalli Subramanyam
- Dialmat: Dialogue-enabled Transformer With Moment-based Adversarial Training Kaneda Kanta, Korekata Ryosuke, Wada Yuiga, Nagashima Shunya, Kambara Motonari, Iioka Yui, Matsuo Haruka, Imai Yuto, Nishimura Takayuki, Sugiura Komei
- Reducing Sequence Length By Predicting Edit Operations With Large Language Models Kaneko Masahiro, Okazaki Naoaki
- Impact Of Co-occurrence On Factual Knowledge Of Large Language Models Kang Cheongwoong, Choi Jaesik
- Llm-augmented Preference Learning From Natural Language Kang Inwon, Ruan Sikai, Ho Tyler, Lin Jui-chien, Mohsin Farhad, Seneviratne Oshani, Xia Lirong
- Self-specialization: Uncovering Latent Expertise Within Large Language Models Kang Junmo, Luo Hongyin, Zhu Yada, Hansen Jacob, Glass James, Cox David, Ritter Alan, Feris Rogerio, Karlinsky Leonid
- Knowledge-augmented Reasoning Distillation For Small Language Models In Knowledge-intensive Tasks Kang Minki, Lee Seanie, Baek Jinheon, Kawaguchi Kenji, Hwang Sung Ju
- Lagr-seq: Language-guided Reinforcement Learning With Sample-efficient Querying Karimpanal Thommen George, Semage Laknath Buddhika, Rana Santu, Le Hung, Tran Truyen, Gupta Sunil, Venkatesh Svetha
- Shs-nlp At Radsum23: Domain-adaptive Pre-training Of Instruction-tuned Llms For Radiology Report Impression Generation Karn Sanjeev Kumar, Ghosh Rikhiya, P Kusuma, Farri Oladimeji
- Sorted Llama: Unlocking The Potential Of Intermediate Layers Of Large Language Models For Dynamic Inference Kavehzadeh Parsa, Valipour Mojtaba, Tahaei Marzieh, Ghodsi Ali, Chen Boxing, Rezagholizadeh Mehdi
- Application Of Frozen Large-scale Models To Multimodal Task-oriented Dialogue Kawamoto Tatsuki, Suzuki Takuma, Miyama Ko, Meguro Takumi, Takagi Tomohiro
- The Impact Of Positional Encoding On Length Generalization In Transformers Kazemnejad Amirhossein, Padhi Inkit, Ramamurthy Karthikeyan Natesan, Das Payel, Reddy Siva
- Critiquellm: Towards An Informative Critique Generation Model For Evaluation Of Large Language Model Generation Ke Pei, Wen Bosi, Feng Zhuoer, Liu Xiao, Lei Xuanyu, Cheng Jiale, Wang Shengyuan, Zeng Aohan, Dong Yuxiao, Wang Hongning, Tang Jie, Huang Minlie
- BLESS: Benchmarking Large Language Models On Sentence Simplification Kew Tannon, Chi Alison, Vásquez-rodríguez Laura, Agrawal Sweta, Aumiller Dennis, Alva-manchego Fernando, Shardlow Matthew
- Turning English-centric Llms Into Polyglots: How Much Multilinguality Is Needed? Kew Tannon, Schottmann Florian, Sennrich Rico
- Xcodeeval: A Large Scale Multilingual Multitask Benchmark For Code Understanding, Generation, Translation And Retrieval Khan Mohammad Abdullah Matin, Bari M Saiful, Do Xuan Long, Wang Weishi, Parvez Md Rizwan, Joty Shafiq
- Q: How To Specialize Large Vision-language Models To Data-scarce VQA Tasks? A: Self-train On Unlabeled Images! Khan Zaid, Bg Vijay Kumar, Schulter Samuel, Yu Xiang, Fu Yun, Chandraker Manmohan
- Tst\(^\mathrm{r}\): Target Similarity Tuning Meets The Real World Khatry Anirudh, Gulwani Sumit, Gupta Priyanshu, Le Vu, Singha Ananya, Singh Mukul, Verbruggen Gust
- Reliability Check: An Analysis Of Gpt-3's Response To Sensitive Topics And Prompt Wording Khatun Aisha, Brown Daniel G.
- Expediting Contrastive Language-image Pretraining Via Self-distilled Encoders Kim Bumsoo, Kim Jinhyung, Jo Yeonsik, Kim Seung Hwan
- Transferring Pre-trained Multimodal Representations With Cross-modal Similarity Matching Kim Byoungjip, Choi Sungik, Hwang Dasol, Lee Moontae, Lee Honglak
- SOLAR 10.7B: Scaling Large Language Models With Simple Yet Effective Depth Up-scaling Kim Dahyun, Park Chanjun, Kim Sanghoon, Lee Wonsung, Song Wonho, Kim Yunsu, Kim Hyeonwoo, Kim Yungi, Lee Hyeonju, Kim Jihoo, Ahn Changbae, Yang Seonghoon, Lee Sukyung, Park Hyunbyung, Gim Gyoungjin, Cha Mikyoung, Lee Hwalsuk, Kim Sunghun
- Fantom: A Benchmark For Stress-testing Machine Theory Of Mind In Interactions Kim Hyunwoo, Sclar Melanie, Zhou Xuhui, Bras Ronan Le, Kim Gunhee, Choi Yejin, Sap Maarten
- Compressed Context Memory For Online Language Model Interaction Kim Jang-hyun, Yeom Junyoung, Yun Sangdoo, Song Hyun Oh
- Memory-efficient Fine-tuning Of Compressed Large Language Models Via Sub-4-bit Integer Quantization Kim Jeonghoon, Lee Jung Hyun, Kim Sungdong, Park Joonsuk, Yoo Kang Min, Kwon Se Jung, Lee Dongsoo
- LLM4SGG: Large Language Models For Weakly Supervised Scene Graph Generation Kim Kibum, Yoon Kanghoon, Jeon Jaehyeong, In Yeonjun, Moon Jinyoung, Kim Donghyun, Park Chanyoung
- Squeezellm: Dense-and-sparse Quantization Kim Sehoon, Hooper Coleman, Gholami Amir, Dong Zhen, Li Xiuyu, Shen Sheng, Mahoney Michael W., Keutzer Kurt
- Speculative Decoding With Big Little Decoder Kim Sehoon, Mangalam Karttikeya, Moon Suhong, Malik Jitendra, Mahoney Michael W., Gholami Amir, Keutzer Kurt
- Cotever: Chain Of Thought Prompting Annotation Toolkit For Explanation Verification Kim Seungone, Joo Se June, Jang Yul, Chae Hyungjoo, Yeo Jinyoung
- Aligning Large Language Models Through Synthetic Feedback Kim Sungdong, Bae Sanghwan, Shin Jamin, Kang Soyoung, Kwak Donghyun, Yoo Kang Min, Seo Minjoon
- MAGVLT: Masked Generative Vision-and-language Transformer Kim Sungwoong, Jo Daejin, Lee Donghoon, Kim Jongmin
- Finequant: Unlocking Efficiency With Fine-grained Weight-only Quantization For Llms Kim Young Jin, Henry Rawn, Fahim Raffy, Awadalla Hany Hassan
- Boschai @ PLABA 2023: Leveraging Edit Operations In End-to-end Neural Sentence Simplification Knappich Valentin, Razniewski Simon, Friedrich Annemarie
- Revisiting Intermediate Layer Distillation For Compressing Language Models: An Overfitting Perspective Ko Jongwoo, Park Seungjoon, Jeong Minchan, Hong Sukjin, Ahn Euijai, Chang Du-seong, Yun Se-young
- Building A Llama2-finetuned LLM For Odia Language Utilizing Domain Knowledge Instruction Set Kohli Guneet Singh, Parida Shantipriya, Sekhar Sambit, Saha Samirit, Nair Nipun B, Agarwal Parul, Khosla Sonal, Patiyal Kusumlata, Dhal Debasish
- Contextual Refinement Of Translations: Large Language Models For Sentence And Document-level Post-editing Koneru Sai, Exel Miriam, Huck Matthias, Niehues Jan
- NOLA: Compressing Lora Using Linear Combination Of Random Basis Koohpayegani Soroush Abbasi, Navaneet Kl, Nooralinejad Parsa, Kolouri Soheil, Pirsiavash Hamed
- Humans In Humans Out: On GPT Converging Toward Common Sense In Both Success And Failure Koralus Philipp, Wang-maścianica Vincent
- Performance Of The Pre-trained Large Language Model GPT-4 On Automated Short Answer Grading Kortemeyer Gerd
- Gender Bias And Stereotypes In Large Language Models Kotek Hadas, Dockum Rikker, Sun David Q.
- Understanding Catastrophic Forgetting In Language Models Via Implicit Inference Kotha Suhas, Springer Jacob Mitchell, Raghunathan Aditi
- L-TUNING: Synchronized Label Tuning For Prompt And Prefix In Llms Kowsher Md., Sobuj Md. Shohanur Islam, Mahmud Asif, Prottasha Nusrat Jahan, Bhat Prakash
- Implicit Meta-learning May Lead Language Models To Trust More Reliable Sources Krasheninnikov Dmitrii, Krasheninnikov Egor, Mlodozeniec Bruno, Maharaj Tegan, Krueger David
- Prompt Generate Train (PGT): Few-shot Domain Adaption Of Retrieval Augmented Generation Models For Open Book Question-answering Krishna C. S.
- The Moral Authority Of Chatgpt Krügel Sebastian, Ostermaier Andreas, Uhl Matthias
- Fundamentals Of Generative Large Language Models And Perspectives In Cyber-defense Kucharavy Andrei, Schillaci Zachary, Maréchal Loïc, Würsch Maxime, Dolamic Ljiljana, Sabonnadiere Remi, David Dimitri Percia, Mermoud Alain, Lenders Vincent
- Sensi-bert: Towards Sensitivity Driven Fine-tuning For Parameter-efficient BERT Kundu Souvik, Sridhar Sharath Nittur, Szankin Maciej, Sundaresan Sairam
- Active Instruction Tuning: Improving Cross-task Generalization By Training On Prompt Sensitive Tasks Kung Po-nien, Yin Fan, Wu Di, Chang Kai-wei, Peng Nanyun
- DACBERT: Leveraging Dependency Agreement For Cost-efficient Bert Pretraining Kuo Martin, Zhang Jianyi, Chen Yiran
- Sparse Fine-tuning For Inference Acceleration Of Large Language Models Kurtic Eldar, Kuznedelev Denis, Frantar Elias, Goin Michael, Alistarh Dan
- In Generative AI We Trust: Can Chatbots Effectively Verify Political Information? Kuznetsova Elizaveta, Makhortykh Mykola, Vziatysheva Victoria, Stolze Martha, Baghumyan Ani, Urman Aleksandra
- M4LE: A Multi-ability Multi-range Multi-task Multi-domain Long-context Evaluation Benchmark For Large Language Models Kwan Wai-chung, Zeng Xingshan, Wang Yufei, Sun Yusen, Li Liangyou, Shang Lifeng, Liu Qun, Wong Kam-fai
- Reward Design With Language Models Kwon Minae, Xie Sang Michael, Bullard Kalesha, Sadigh Dorsa
- Chatgpt For Arabic Grammatical Error Correction Kwon Sang Yun, Bhatia Gagan, Nagoud El Moatez Billah, Abdul-mageed Muhammad
- Hallucination Augmented Recitations For Language Models Köksal Abdullatif, Aksitov Renat, Chang Chung-ching
- Openassistant Conversations -- Democratizing Large Language Model Alignment Köpf Andreas, Kilcher Yannic, Von Rütte Dimitri, Anagnostidis Sotiris, Tam Zhi-rui, Stevens Keith, Barhoum Abdullah, Duc Nguyen Minh, Stanley Oliver, Nagyfi Richárd, Es Shahul, Suri Sameer, Glushkov David, Dantuluri Arnav, Maguire Andrew, Schuhmann Christoph, Nguyen Huu, Mattick Alexander
- Instruction-following Speech Recognition Lai Cheng-i Jeff, Lu Zhiyun, Cao Liangliang, Pang Ruoming
- Chatgpt Beyond English: Towards A Comprehensive Evaluation Of Large Language Models In Multilingual Learning Lai Viet Dac, Ngo Nghia Trung, Veyseh Amir Pouran Ben, Man Hieu, Dernoncourt Franck, Bui Trung, Nguyen Thien Huu
- Enhancing Transformers Without Self-supervised Learning: A Loss Landscape Perspective In Sequential Recommendation Lai Vivian, Chen Huiyuan, Yeh Chin-chia Michael, Xu Minghua, Cai Yiwei, Yang Hao
- LISA: Reasoning Segmentation Via Large Language Model Lai Xin, Tian Zhuotao, Chen Yukang, Li Yanwei, Yuan Yuhui, Liu Shu, Jia Jiaya
- Mini-dalle3: Interactive Text To Image By Prompting Large Language Models Lai Zeqiang, Zhu Xizhou, Dai Jifeng, Qiao Yu, Wang Wenhai
- Veclip: Improving CLIP Training Via Visual-enriched Captions Lai Zhengfeng, Zhang Haotian, Zhang Bowen, Wu Wentao, Bai Haoping, Timofeev Aleksei, Du Xianzhi, Gan Zhe, Shan Jiulong, Chuah Chen-nee, Yang Yinfei, Cao Meng
- End-to-end Speech Recognition Contextualization With Large Language Models Lakomkin Egor, Wu Chunyang, Fathullah Yassir, Kalinli Ozlem, Seltzer Michael L., Fuegen Christian
- Improving Zero-shot Visual Question Answering Via Large Language Models With Reasoning Question Prompts Lan Yunshi, Li Xiang, Liu Xin, Li Yang, Qin Wei, Qian Weining
- A Data Source For Reasoning Embodied Agents Lanchantin Jack, Sukhbaatar Sainbayar, Synnaeve Gabriel, Sun Yuxuan, Srinet Kavya, Szlam Arthur
- Cabrita: Closing The Gap For Foreign Languages Larcher Celio, Piau Marcos, Finardi Paulo, Gengo Pedro, Esposito Piero, Caridá Vinicius
- An Experimental Study On Pretraining Transformers From Scratch For IR Lassance Carlos, Déjean Hervé, Clinchant Stéphane
- Fine-tuning Chatgpt For Automatic Scoring Latif Ehsan, Zhai Xiaoming
- Building Efficient Universal Classifiers With Natural Language Inference Laurer Moritz, Van Atteveldt Wouter, Casas Andreu, Welbers Kasper
- VAL: Interactive Task Learning With GPT Dialog Parsing Lawley Lane, Maclellan Christopher J.
- Platypus: Quick, Cheap, And Powerful Refinement Of Llms Lee Ariel N., Hunter Cole J., Ruiz Nataniel
- Prompted Llms As Chatbot Modules For Long Open-domain Conversation Lee Gibbeum, Hartmann Volker, Park Jongho, Papailiopoulos Dimitris, Lee Kangwook
- RLAIF Vs. RLHF: Scaling Reinforcement Learning From Human Feedback With AI Feedback Lee Harrison, Phatale Samrat, Mansoor Hassan, Mesnard Thomas, Ferret Johan, Lu Kellie, Bishop Colton, Hall Ethan, Carbune Victor, Rastogi Abhinav, Prakash Sushant
- How Well Do Large Language Models Truly Ground? Lee Hyunji, Joo Sejune, Kim Chaeeun, Jang Joel, Kim Doyoung, On Kyoung-woon, Seo Minjoon
- Is Attention Required For ICL? Exploring The Relationship Between Model Architecture And In-context Learning Ability Lee Ivan, Jiang Nan, Berg-kirkpatrick Taylor
- Partially Randomizing Transformer Weights For Dialogue Response Diversity Lee Jing Yang, Lee Kong Aik, Gan Woon-seng
- Teaching Arithmetic To Small Transformers Lee Nayoung, Sreenivasan Kartik, Lee Jason D., Lee Kangwook, Papailiopoulos Dimitris
- Can Large Language Models Capture Dissenting Human Voices? Lee Noah, An Na Min, Thorne James
- Do We Still Need Clinical Language Models? Lehman Eric, Hernandez Evan, Mahajan Diwakar, Wulff Jonas, Smith Micah J., Ziegler Zachary, Nadler Daniel, Szolovits Peter, Johnson Alistair, Alsentzer Emily
- Creating A Dataset For High-performance Computing Code Translation Using Llms: A Bridge Between Openmp Fortran And C++ Lei Bin, Ding Caiwen, Chen Le, Lin Pei-hung, Liao Chunhua
- Chain Of Natural Language Inference For Reducing Large Language Model Ungrounded Hallucinations Lei Deren, Li Yaxi, Hu Mengya, Wang Mingyu, Yun Vincent, Ching Emily, Kamal Eslam
- Zero-shot Next-item Recommendation Using Large Pretrained Language Models Lei Wang, Ee-peng Lim
- Mitigating Fine-grained Hallucination By Fine-tuning Large Vision-language Models With Caption Rewrites Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-peng Lim
- Llm4vis: Explainable Visualization Recommendation Using Chatgpt Lei Wang, Songheng Zhang, Yun Wang, Ee-peng Lim, Yong Wang
- T-sciq: Teaching Multimodal Chain-of-thought Reasoning Via Mixed Large Language Model Signals For Science Question Answering Lei Wang, Yi Hu, Jiabang He, Xing Xu, Ning Liu, Hui Liu, Heng Tao Shen
- Recexplainer: Aligning Large Language Models For Explaining Recommendation Models Lei Yuxuan, Lian Jianxun, Yao Jing, Huang Xu, Lian Defu, Xie Xing
- The Language Of Prompting: What Linguistic Properties Make A Prompt Successful? Leidinger Alina, Van Rooij Robert, Shutova Ekaterina
- The Eval4nlp 2023 Shared Task On Prompting Large Language Models As Explainable Metrics Leiter Christoph, Opitz Juri, Deutsch Daniel, Gao Yang, Dror Rotem, Eger Steffen
- Self-detoxifying Language Models Via Toxification Reversal Leong Chak Tou, Cheng Yi, Wang Jiashuo, Wang Jian, Li Wenjie
- Lora Fine-tuning Efficiently Undoes Safety Training In Llama 2-chat 70B Lermen Simon, Rogers-smith Charlie, Ladish Jeffrey
- Talk The Walk: Synthetic Data Generation For Conversational Music Recommendation Leszczynski Megan, Zhang Shu, Ganti Ravi, Balog Krisztian, Radlinski Filip, Pereira Fernando, Chaganty Arun Tejasvi
- Sequential Monte Carlo Steering Of Large Language Models Using Probabilistic Programs Lew Alexander K., Zhi-xuan Tan, Grand Gabriel, Mansinghka Vikash K.
- Task Contamination: Language Models May Not Be Few-shot Anymore Li Changmao, Flanigan Jeffrey
- Vision-language Instruction Tuning: A Review And Analysis Li Chen, Ge Yixiao, Li Dian, Shan Ying
- Learning To Rewrite Prompts For Personalized Text Generation Li Cheng, Zhang Mingyang, Mei Qiaozhu, Kong Weize, Bendersky Michael
- Mugglemath: Assessing The Impact Of Query And Response Augmentation On Math Reasoning Li Chengpeng, Yuan Zheng, Yuan Hongyi, Dong Guanting, Lu Keming, Wu Jiancan, Tan Chuanqi, Wang Xiang, Zhou Chang
- Dialogue For Prompting: A Policy-gradient-based Discrete Prompt Generation For Few-shot Learning Li Chengzhengxu, Liu Xiaoming, Wang Yichen, Li Duyi, Lan Yu, Shen Chao
- Modelscope-agent: Building Your Customizable Agent System With Open-source Large Language Models Li Chenliang, Chen Hehong, Yan Ming, Shen Weizhou, Xu Haiyang, Wu Zhikai, Zhang Zhicheng, Zhou Wenmeng, Chen Yingda, Cheng Chen, Shi Hongzhu, Zhang Ji, Huang Fei, Zhou Jingren
- Llava-med: Training A Large Language-and-vision Assistant For Biomedicine In One Day Li Chunyuan, Wong Cliff, Zhang Sheng, Usuyama Naoto, Liu Haotian, Yang Jianwei, Naumann Tristan, Poon Hoifung, Gao Jianfeng
- DISTFLASHATTN: Distributed Memory-efficient Attention For Long-context Llms Training Li Dacheng, Shao Rulin, Xie Anze, Xing Eric P., Ma Xuezhe, Stoica Ion, Gonzalez Joseph E., Zhang Hao
- Repetition In Repetition Out: Towards Understanding Neural Text Degeneration From The Data Perspective Li Huayang, Lan Tian, Fu Zihao, Cai Deng, Liu Lemao, Collier Nigel, Watanabe Taro, Su Yixuan
- Mastering Robot Manipulation With Multimodal Prompts Through Pretraining And Multi-task Fine-tuning Li Jiachen, Gao Qiaozi, Johnston Michael, Gao Xiaofeng, He Xuehai, Shakiah Suhaila, Shi Hangjie, Ghanadan Reza, Wang William Yang
- Text Is All You Need: Learning Language Representations For Sequential Recommendation Li Jiacheng, Wang Ming, Li Jin, Fu Jinmiao, Shen Xin, Shang Jingbo, Mcauley Julian
- Eliciting The Translation Ability Of Large Language Models Via Multilingual Finetuning With Translation Instructions Li Jiahuan, Zhou Hao, Huang Shujian, Cheng Shanbo, Chen Jiajun
- Adaptive Gating In Mixture-of-experts Based Language Models Li Jiamin, Su Qiang, Yang Yitao, Jiang Yimin, Wang Cong, Xu Hong
- CFGPT: Chinese Financial Assistant With Large Language Model Li Jiangtong, Bian Yuxuan, Wang Guoxuan, Lei Yang, Cheng Dawei, Ding Zhijun, Jiang Changjun
- Evcap: Retrieval-augmented Image Captioning With External Visual-name Memory For Open-world Comprehension Li Jiaxuan, Vo Duc Minh, Sugimoto Akihiro, Nakayama Hideki
- Prefix Propagation: Parameter-efficient Tuning For Long Sequences Li Jonathan, Aitken Will, Bhambhoria Rohan, Zhu Xiaodan
- MAQA: A Multimodal QA Benchmark For Negation Li Judith Yue, Jansen Aren, Huang Qingqing, Lee Joonseok, Ganti Ravi, Kuzmin Dima
- Fine-tuning Multimodal Llms To Follow Zero-shot Demonstrative Instructions Li Juncheng, Pan Kaihang, Ge Zhiqi, Gao Minghe, Ji Wei, Zhang Wenqiao, Chua Tat-seng, Tang Siliang, Zhang Hanwang, Zhuang Yueting
- Openba: An Open-sourced 15B Bilingual Asymmetric Seq2seq Model Pre-trained From Scratch Li Juntao, Tang Zecheng, Ding Yuyang, Wang Pinzheng, Guo Pei, You Wangjie, Qiao Dan, Chen Wenliang, Fu Guohong, Zhu Qiaoming, Zhou Guodong, Zhang Min
- The Web Can Be Your Oyster For Improving Large Language Models Li Junyi, Tang Tianyi, Zhao Wayne Xin, Wang Jingyuan, Nie Jian-yun, Wen Ji-rong
- Mvbench: A Comprehensive Multi-modal Video Understanding Benchmark Li Kunchang, Wang Yali, He Yinan, Li Yizhuo, Wang Yi, Liu Yi, Wang Zun, Xu Jilan, Chen Guo, Luo Ping, Wang Limin, Qiao Yu
- Watermarking Llms With Weight Quantization Li Linyang, Jiang Botian, Wang Pengyu, Ren Ke, Yan Hang, Qiu Xipeng
- Summarizing Multiple Documents With Conversational Structure For Meta-review Generation Li Miao, Hovy Eduard, Lau Jey Han
- Reflection-tuning: Data Recycling Improves LLM Instruction-tuning Li Ming, Chen Lichang, Chen Jiuhai, He Shwai, Huang Heng, Gu Jiuxiang, Zhou Tianyi
- Api-bank: A Comprehensive Benchmark For Tool-augmented Llms Li Minghao, Zhao Yingxiu, Yu Bowen, Song Feifan, Li Hangyu, Yu Haiyang, Li Zhoujun, Huang Fei, Li Yongbin
- Table-gpt: Table-tuned GPT For Diverse Table Tasks Li Peng, He Yeye, Yashar Dror, Cui Weiwei, Ge Song, Zhang Haidong, Fainman Danielle Rifinski, Zhang Dongmei, Chaudhuri Surajit
- Laffi: Leveraging Hybrid Natural Language Feedback For Fine-tuning Language Models Li Qianxi, Cao Yingyue, Kang Jikun, Yang Tianpei, Chen Xi, Jin Jun, Taylor Matthew E.
- Are Human-generated Demonstrations Necessary For In-context Learning? Li Rui, Wang Guoyin, Li Jiwei
- Vlm-eval: A General Evaluation On Video Large Language Models Li Shuailin, Zhang Yuang, Zhao Yucheng, Wang Qiuyue, Jia Fan, Liu Yingfei, Wang Tiancai
- Autoconv: Automatically Generating Information-seeking Conversations With Large Language Models Li Siheng, Yang Cheng, Yin Yichun, Zhu Xinyu, Cheng Zesen, Shang Lifeng, Jiang Xin, Liu Qun, Yang Yujiu
- Tailoring Personality Traits In Large Language Models Via Unsupervisedly-built Personalized Lexicons Li Tianlong, Dou Shihan, Lv Changze, Liu Wenhao, Xu Jianhan, Wu Muling, Ling Zixuan, Zheng Xiaoqing, Huang Xuanjing
- Benchmarking And Improving Generator-validator Consistency Of Language Models Li Xiang Lisa, Shrivastava Vaishnavi, Li Siyan, Hashimoto Tatsunori, Liang Percy
- FLM-101B: An Open LLM And How To Train It With $100K Budget Li Xiang, Yao Yiqun, Jiang Xin, Fang Xuezhi, Meng Xuying, Fan Siqi, Han Peng, Li Jing, Du Li, Qin Bowen, Zhang Zheng, Sun Aixin, Wang Yequan
- Mot: Memory-of-thought Enables Chatgpt To Self-improve Li Xiaonan, Qiu Xipeng
- Unlock The Power: Competitive Distillation For Multi-modal Large Language Models Li Xinwei, Lin Li, Wang Shuai, Qian Chen
- Exploring Fine-tuning Chatgpt For News Recommendation Li Xinyi, Zhang Yongfeng, Malthouse Edward C
- PBNR: Prompt-based News Recommender System Li Xinyi, Zhang Yongfeng, Malthouse Edward C.
- Stablellava: Enhanced Visual Instruction Tuning With Synthesized Image-dialogue Data Li Yanda, Zhang Chi, Yu Gang, Wang Zhibin, Fu Bin, Lin Guosheng, Shen Chunhua, Chen Ling, Wei Yunchao
- On Bilingual Lexicon Induction With Large Language Models Li Yaoyiran, Korhonen Anna, Vulić Ivan
- Dissecting Chain-of-thought: Compositionality Through In-context Filtering And Learning Li Yingcong, Sreenivasan Kartik, Giannou Angeliki, Papailiopoulos Dimitris, Oymak Samet
- A Survey On Fairness In Large Language Models Li Yingji, Du Mengnan, Song Rui, Wang Xin, Wang Ying
- Turning Dust Into Gold: Distilling Complex Reasoning Capabilities From Llms By Leveraging Negative Data Li Yiwei, Yuan Peiwen, Feng Shaoxiong, Pan Boyuan, Sun Bin, Wang Xinglin, Wang Heda, Li Kan
- Loftq: Lora-fine-tuning-aware Quantization For Large Language Models Li Yixiao, Yu Yifan, Liang Chen, He Pengcheng, Karampatziakis Nikos, Chen Weizhu, Zhao Tuo
- RAIN: Your Language Models Can Align Themselves Without Finetuning Li Yuhui, Wei Fangyun, Zhao Jinjing, Zhang Chao, Zhang Hongyang
- E-sparse: Boosting The Large Language Model Inference Through Entropy-based N:M Sparsity Li Yun, Niu Lin, Zhang Xipeng, Liu Kai, Zhu Jianchen, Kang Zhanhui
- Advancing Precise Outline-conditioned Text Generation With Task Duality And Explicit Outline Control Li Yunzhe, Chen Qian, Yan Weixiang, Wang Wen, Zhang Qinglin, Sundaram Hari
- Guiding Large Language Models Via Directional Stimulus Prompting Li Zekun, Peng Baolin, He Pengcheng, Galley Michel, Gao Jianfeng, Yan Xifeng
- Flexkbqa: A Flexible Llm-powered Framework For Few-shot Knowledge Base Question Answering Li Zhenyu, Fan Sunqi, Gu Yu, Li Xiuxing, Duan Zhichao, Dong Bowen, Liu Ning, Wang Jianyong
- QFT: Quantized Full-parameter Tuning Of Llms With Affordable Resources Li Zhikai, Liu Xiaoxuan, Zhu Banghua, Dong Zhen, Gu Qingyi, Keutzer Kurt
- Unveiling The Pitfalls Of Knowledge Editing For Large Language Models Li Zhoubo, Zhang Ningyu, Yao Yunzhi, Wang Mengru, Chen Xi, Chen Huajun
- Synthetic Data Generation With Large Language Models For Text Classification: Potential And Limitations Li Zhuoyan, Zhu Hangxiao, Lu Zhuoran, Yin Ming
- Batgpt: A Bidirectional Autoregessive Talker From Generative Pre-trained Transformer Li Zuchao, Zhang Shitou, Zhao Hai, Yang Yifei, Yang Dongjie
- Homodistil: Homotopic Task-agnostic Distillation Of Pre-trained Transformers Liang Chen, Jiang Haoming, Li Zheng, Tang Xianfeng, Yin Bin, Zhao Tuo
- Exploring Format Consistency For Instruction Tuning Liang Shihao, Tian Runchu, Zhu Kunlun, Qin Yujia, Wang Huadong, Cong Xin, Liu Zhiyuan, Liu Xiaojiang, Sun Maosong
- Prompting Large Language Models With Chain-of-thought For Few-shot Knowledge Base Question Generation Liang Yuanyuan, Wang Jianing, Zhu Hanlun, Wang Lei, Qian Weining, Lan Yunshi
- Let GPT Be A Math Tutor: Teaching Math Word Problem Solvers With Customized Exercise Generation Liang Zhenwen, Yu Wenhao, Rajpurohit Tanmay, Clark Peter, Zhang Xiangliang, Kaylan Ashwin
- Make Pre-trained Model Reversible: From Parameter To Memory Efficient Fine-tuning Liao Baohao, Tan Shaomu, Monz Christof
- GPT-4 Enhanced Multimodal Grounding For Autonomous Driving: Leveraging Cross-modal Attention With Large Language Models Liao Haicheng, Shen Huanming, Li Zhenning, Wang Chengyue, Li Guofa, Bie Yiming, Xu Chengzhong
- Opening Up Chatgpt: Tracking Openness, Transparency, And Accountability In Instruction-tuned Text Generators Liesenfeld Andreas, Lopez Alianda, Dingemanse Mark
- STEVE-1: A Generative Model For Text-to-behavior In Minecraft Lifshitz Shalev, Paster Keiran, Chan Harris, Ba Jimmy, Mcilraith Sheila
- A Survey On Large Language Models For Recommendation Likang Wu, Zhi Zheng, Zhaopeng Qiu, Hao Wang, Hongchao Gu, Tingjia Shen, Chuan Qin, Chen Zhu, Hengshu Zhu, Qi Liu, Hui Xiong, Enhong Chen
- The Unlocking Spell On Base Llms: Rethinking Alignment Via In-context Learning Lin Bill Yuchen, Ravichander Abhilasha, Lu Ximing, Dziri Nouha, Sclar Melanie, Chandu Khyathi, Bhagavatula Chandra, Choi Yejin
- Beneath The Surface: Unveiling Harmful Memes With Multimodal Reasoning Distilled From Large Language Models Lin Hongzhan, Luo Ziyang, Ma Jing, Chen Long
- Learning To Model The World With Language Lin Jessy, Du Yuqing, Watkins Olivia, Hafner Danijar, Abbeel Pieter, Klein Dan, Dragan Anca
- VILA: On Pre-training For Visual Language Models Lin Ji, Yin Hongxu, Ping Wei, Lu Yao, Molchanov Pavlo, Tao Andrew, Mao Huizi, Kautz Jan, Shoeybi Mohammad, Han Song
- Rella: Retrieval-enhanced Large Language Models For Lifelong Sequential Behavior Comprehension In Recommendation Lin Jianghao, Shan Rong, Zhu Chenxu, Du Kounianhua, Chen Bo, Quan Shigang, Tang Ruiming, Yu Yong, Zhang Weinan
- Few-shot Adaptation For Parsing Contextual Utterances With Llms Lin Kevin, Xia Patrick, Fang Hao
- RA-DIT: Retrieval-augmented Dual Instruction Tuning Lin Xi Victoria, Chen Xilun, Chen Mingda, Shi Weijia, Lomeli Maria, James Rich, Rodriguez Pedro, Kahn Jacob, Szilvasy Gergely, Lewis Mike, Zettlemoyer Luke, Yih Scott
- Dynamic Scheduled Sampling With Imitation Loss For Neural Text Generation Lin Xiang, Jwalapuram Prathyusha, Joty Shafiq
- Taiwan LLM: Bridging The Linguistic Divide With A Culturally Aligned Language Model Lin Yen-ting, Chen Yun-nung
- Mitigating The Alignment Tax Of RLHF Lin Yong, Lin Hangyu, Xiong Wei, Diao Shizhe, Liu Jianmeng, Zhang Jipeng, Pan Rui, Wang Haoxiang, Hu Wenbin, Zhang Hanning, Dong Hanze, Pi Renjie, Zhao Han, Jiang Nan, Ji Heng, Yao Yuan, Zhang Tong
- Pushing Large Language Models To The 6G Edge: Vision, Challenges, And Opportunities Lin Zheng, Qu Guanqiao, Chen Qiyuan, Chen Xianhao, Chen Zhe, Huang Kaibin
- SPHINX: The Joint Mixing Of Weights, Tasks, And Visual Embeddings For Multi-modal Large Language Models Lin Ziyi, Liu Chris, Zhang Renrui, Gao Peng, Qiu Longtian, Xiao Han, Qiu Han, Lin Chen, Shao Wenqi, Chen Keqin, Han Jiaming, Huang Siyuan, Zhang Yichi, He Xuming, Li Hongsheng, Qiao Yu
- Exposing Attention Glitches With Flip-flop Language Modeling Liu Bingbin, Ash Jordan T., Goel Surbhi, Krishnamurthy Akshay, Zhang Cyril
- Code Execution With Pre-trained Language Models Liu Chenxiao, Lu Shuai, Chen Weizhu, Jiang Daxin, Svyatkovskiy Alexey, Fu Shengyu, Sundaresan Neel, Duan Nan
- Examining Llms' Uncertainty Expression Towards Questions Outside Parametric Knowledge Liu Genglin, Wang Xingyao, Yuan Lifan, Chen Yangyi, Peng Hao
- Glore: Evaluating Logical Reasoning Of Large Language Models Liu Hanmeng, Teng Zhiyang, Ning Ruoxi, Liu Jian, Zhou Qiji, Zhang Yue
- Exploration With Principles For Diverse AI Supervision Liu Hao, Zaharia Matei, Abbeel Pieter
- Ring Attention With Blockwise Transformers For Near-infinite Context Liu Hao, Zaharia Matei, Abbeel Pieter
- Grounding Complex Natural Language Commands For Temporal Tasks In Unseen Environments Liu Jason Xinyu, Yang Ziyi, Idrees Ifrah, Liang Sam, Schornstein Benjamin, Tellex Stefanie, Shah Ankit
- Chatcounselor: A Large Language Models For Mental Health Support Liu June M., Li Donghao, Cao He, Ren Tianhe, Liao Zeyi, Wu Jiamin
- Qilin-med-vl: Towards Chinese Large Vision-language Model For General Healthcare Liu Junling, Wang Ziming, Ye Qichen, Chong Dading, Zhou Peilin, Hua Yining
- Contrastive Vision-language Alignment Makes Efficient Instruction Learner Liu Lizhao, Sun Xinyu, Xiang Tianhang, Zhuang Zhuangwei, Yin Liuren, Tan Mingkui
- Chipnemo: Domain-adapted Llms For Chip Design Liu Mingjie, Ene Teodor-dumitru, Kirby Robert, Cheng Chris, Pinckney Nathaniel, Liang Rongjian, Alben Jonah, Anand Himyanshu, Banerjee Sanmitra, Bayraktaroglu Ismet, Bhaskaran Bonita, Catanzaro Bryan, Chaudhuri Arjun, Clay Sharon, Dally Bill, Dang Laura, Deshpande Parikshit, Dhodhi Siddhanth, Halepete Sameer, Hill Eric, Hu Jiashang, Jain Sumit, Jindal Ankit, Khailany Brucek, Kokai George, Kunal Kishor, Li Xiaowei, Lind Charley, Liu Hao, Oberman Stuart, Omar Sujeet, Pasandi Ghasem, Pratty Sreedhar, Raiman Jonathan, Sarkar Ambar, Shao Zhengjiang, Sun Hanfei, Suthar Pratik P, Tej Varun, Turner Walker, Xu Kaizhe, Ren Haoxing
- X-eval: Generalizable Multi-aspect Text Evaluation Via Augmented Instruction Tuning With Auxiliary Evaluation Aspects Liu Minqian, Shen Ying, Xu Zhiyang, Cao Yixin, Cho Eunah, Kumar Vaibhav, Ghanadan Reza, Huang Lifu
- Nlebench+norglm: A Comprehensive Empirical Analysis And Benchmark Dataset For Generative Language Models In Norwegian Liu Peng, Zhang Lemei, Farup Terje Nissen, Lauvrak Even W., Ingvaldsen Jon Espen, Eide Simen, Gulla Jon Atle, Yang Zhirong
- From Zero To Hero: Examining The Power Of Symbolic Tasks In Instruction Tuning Liu Qian, Zhou Fan, Jiang Zhengbao, Dou Longxu, Lin Min
- Bt-adapter: Video Conversation Is Feasible Without Video Instruction Tuning Liu Ruyang, Li Chen, Ge Yixiao, Shan Ying, Li Thomas H., Li Ge
- Music Understanding Llama: Advancing Text-to-music Generation With Question Answering And Captioning Liu Shansong, Hussain Atin Sakkeer, Sun Chenshuo, Shan Ying
- In-context Vectors: Making In Context Learning More Effective And Controllable Through Latent Space Steering Liu Sheng, Ye Haotian, Xing Lei, Zou James
- Language Models As Black-box Optimizers For Vision-language Models Liu Shihong, Lin Zhiqiu, Yu Samuel, Lee Ryan, Ling Tiffany, Pathak Deepak, Ramanan Deva
- Goat: Fine-tuned Llama Outperforms GPT-4 On Arithmetic Tasks Liu Tiedong, Low Bryan Kian Hsiang
- Aligning Large Language Models With Human Preferences Through Representation Engineering Liu Wenhao, Wang Xiaohua, Wu Muling, Li Tianlong, Lv Changze, Ling Zixuan, Zhu Jianhao, Zhang Cenyuan, Zheng Xiaoqing, Huang Xuanjing
- Fingpt: Democratizing Internet-scale Data For Financial Large Language Models Liu Xiao-yang, Wang Guoxuan, Yang Hongyang, Zha Daochen
- Scaling Laws Of Rope-based Extrapolation Liu Xiaoran, Yan Hang, Zhang Shuo, An Chenxin, Qiu Xipeng, Lin Dahua
- Online Speculative Decoding Liu Xiaoxuan, Hu Lanxiang, Bailis Peter, Cheung Alvin, Deng Zhijie, Stoica Ion, Zhang Hao
- Litcab: Lightweight Language Model Calibration Over Short- And Long-form Responses Liu Xin, Khalifa Muhammad, Wang Lu
- What Large Language Models Bring To Text-rich VQA? Liu Xuejing, Tang Wei, Ni Xinzhe, Lu Jinghui, Zhao Rui, Li Zechao, Tan Fei
- Meta Semantic Template For Evaluation Of Large Language Models Liu Yachuan, Chen Liang, Wang Jindong, Mei Qiaozhu, Xie Xing
- Parallel Sentence-level Explanation Generation For Real-world Low-resource Scenarios Liu Yan, Chen Xiaokang, Dai Qi
- Summary Of Chatgpt-related Research And Perspective Towards The Future Of Large Language Models Liu Yiheng, Han Tianle, Ma Siyuan, Zhang Jiayue, Yang Yuanyuan, Tian Jiaming, He Hao, Li Antong, He Mengshen, Liu Zhengliang, Wu Zihao, Zhao Lin, Zhu Dajiang, Li Xiang, Qiang Ning, Shen Dingang, Liu Tianming, Ge Bao
- Instruction Position Matters In Sequence Generation With Large Language Models Liu Yijin, Zeng Xianfeng, Meng Fandong, Zhou Jie
- On Learning To Summarize With Large Language Models As References Liu Yixin, Shi Kejian, He Katherine S, Ye Longtian, Fabbri Alexander R., Liu Pengfei, Radev Dragomir, Cohan Arman
- Molxpt: Wrapping Molecules With Text For Generative Pre-training Liu Zequn, Zhang Wei, Xia Yingce, Wu Lijun, Xie Shufang, Qin Tao, Zhang Ming, Liu Tie-yan
- Picking The Underused Heads: A Network Pruning Perspective Of Attention Head Selection For Fusing Dialogue Coreference Information Liu Zhengyuan, Chen Nancy F.
- LLM360: Towards Fully Transparent Open-source Llms Liu Zhengzhong, Qiao Aurick, Neiswanger Willie, Wang Hongyi, Tan Bowen, Tao Tianhua, Li Junbo, Wang Yuqi, Sun Suqi, Pangarkar Omkar, Fan Richard, Gu Yi, Miller Victor, Zhuang Yonghao, He Guowei, Li Haonan, Koto Fajri, Tang Liping, Ranjan Nikhil, Shen Zhiqiang, Ren Xuguang, Iriondo Roberto, Mu Cun, Hu Zhiting, Schulze Mark, Nakov Preslav, Baldwin Tim, Xing Eric P.
- Reason For Future, Act For Now: A Principled Framework For Autonomous LLM Agents With Provable Sample Efficiency Liu Zhihan, Hu Hao, Zhang Shenao, Guo Hongyi, Ke Shuqi, Liu Boyi, Wang Zhaoran
- Deja Vu: Contextual Sparsity For Efficient Llms At Inference Time Liu Zichang, Wang Jue, Dao Tri, Zhou Tianyi, Yuan Binhang, Song Zhao, Shrivastava Anshumali, Zhang Ce, Tian Yuandong, Re Christopher, Chen Beidi
- On Exploring The Reasoning Capability Of Large Language Models With Knowledge Graphs Lo Pei-chi, Tsai Yi-hang, Lim Ee-peng, Hwang San-yih
- SAIE Framework: Support Alone Isn't Enough -- Advancing LLM Training With Adversarial Remarks Loem Mengsay, Kaneko Masahiro, Okazaki Naoaki
- Harnessing Large Language Models' Empathetic Response Generation Capabilities For Online Mental Health Counselling Support Loh Siyuan Brandon, Raamkumar Aravind Sesagiri
- Data-to-text Generation For Severely Under-resourced Languages With GPT-3.5: A Bit Of Help Needed From Google Translate Lorandi Michela, Belz Anya
- MUFFIN: Curating Multi-faceted Instructions For Improving Instruction-following Lou Renze, Zhang Kai, Xie Jian, Sun Yuxuan, Ahn Janice, Xu Hanzi, Su Yu, Yin Wenpeng
- Chain-of-dictionary Prompting Elicits Translation In Large Language Models Lu Hongyuan, Yang Haoran, Huang Haoyang, Zhang Dongdong, Lam Wai, Wei Furu
- Unified-io 2: Scaling Autoregressive Multimodal Models With Vision, Language, Audio, And Action Lu Jiasen, Clark Christopher, Lee Sangho, Zhang Zichen, Khosla Savya, Marten Ryan, Hoiem Derek, Kembhavi Aniruddha
- Llama-reviewer: Advancing Code Review Automation With Large Language Models Through Parameter-efficient Fine-tuning Lu Junyi, Yu Lei, Li Xiaojia, Yang Li, Zuo Chun
- Ziya-visual: Bilingual Large Vision-language Model Via Multi-task Instruction Tuning Lu Junyu, Zhang Dixiang, Wu Xiaojun, Gao Xinyu, Gan Ruyi, Zhang Jiaxing, Song Yan, Zhang Pingjian
- Lyrics: Boosting Fine-grained Language-vision Alignment And Comprehension Via Semantic-aware Visual Objects Lu Junyu, Zhang Dixiang, Zhang Songxin, Xie Zejian, Song Zhuoyang, Lin Cong, Zhang Jiaxing, Jing Bingyi, Zhang Pingjian
- Routing To The Expert: Efficient Reward-guided Ensemble Of Large Language Models Lu Keming, Yuan Hongyi, Lin Runji, Lin Junyang, Yuan Zheng, Zhou Chang, Zhou Jingren
- Inference-time Policy Adapters (IPA): Tailoring Extreme-scale Lms Without Fine-tuning Lu Ximing, Brahman Faeze, West Peter, Jang Jaehun, Chandu Khyathi, Ravichander Abhilasha, Qin Lianhui, Ammanabrolu Prithviraj, Jiang Liwei, Ramnath Sahana, Dziri Nouha, Fisher Jillian, Lin Bill Yuchen, Hallinan Skyler, Ren Xiang, Welleck Sean, Choi Yejin
- SCITAB: A Challenging Benchmark For Compositional Reasoning And Claim Verification On Scientific Tables Lu Xinyuan, Pan Liangming, Liu Qian, Nakov Preslav, Kan Min-yen
- Effectively Fine-tune To Improve Large Multimodal Models For Radiology Report Generation Lu Yuzhe, Hong Sungmin, Shah Yash, Xu Panpan
- Explanation-based Finetuning Makes Models More Robust To Spurious Cues Ludan Josh Magnus, Meng Yixuan, Nguyen Tai, Shah Saurabh, Lyu Qing, Apidianaki Marianna, Callison-burch Chris
- Enhancing Black-box Few-shot Text Classification With Prompt-based Data Augmentation Luo Danqing, Zhang Chen, Xu Jiahui, Wang Bin, Chen Yiming, Zhang Yan, Li Haizhou
- Cheap And Quick: Efficient Vision-language Instruction Tuning For Large Language Models Luo Gen, Zhou Yiyi, Ren Tianhe, Chen Shengxin, Sun Xiaoshuai, Ji Rongrong
- Exploring Small Language Models With Prompt-learning Paradigm For Efficient Domain-specific Text Classification Luo Hengyu, Liu Peng, Esping Stefan
- SAIL: Search-augmented Instruction Learning Luo Hongyin, Chuang Yung-sung, Gong Yuan, Zhang Tianhua, Kim Yoon, Wu Xixin, Fox Danny, Meng Helen, Glass James
- Taiyi: A Bilingual Fine-tuned Large Language Model For Diverse Biomedical Tasks Luo Ling, Ning Jinzhong, Zhao Yingwen, Wang Zhijun, Ding Zeyuan, Chen Peng, Fu Weiru, Han Qinyu, Xu Guangtao, Qiu Yunzhi, Pan Dinghao, Li Jiru, Li Hao, Feng Wenduo, Tu Senbo, Liu Yuqi, Yang Zhihao, Wang Jian, Sun Yuanyuan, Lin Hongfei
- Reasoning On Graphs: Faithful And Interpretable Large Language Model Reasoning Luo Linhao, Li Yuan-fang, Haffari Gholamreza, Pan Shirui
- Systematic Assessment Of Factual Knowledge In Large Language Models Luo Linhao, Vu Thuy-trang, Phung Dinh, Haffari Gholamreza
- Towards Logiglue: A Brief Survey And A Benchmark For Analyzing Logical Reasoning Capabilities Of Language Models Luo Man, Kumbhar Shrinidhi, Shen Ming, Parmar Mihir, Varshney Neeraj, Banerjee Pratyay, Aditya Somak, Baral Chitta
- Dr.icl: Demonstration-retrieved In-context Learning Luo Man, Xu Xin, Dai Zhuyun, Pasupat Panupong, Kazemi Mehran, Baral Chitta, Imbrasaite Vaiva, Zhao Vincent Y
- Recranker: Instruction Tuning Large Language Model As Ranker For Top-k Recommendation Luo Sichun, He Bowei, Zhao Haohan, Shao Wei, Qi Yanlin, Huang Yinya, Zhou Aojun, Yao Yuxuan, Li Zongpeng, Xiao Yuanzhang, Zhan Mingjie, Song Linqi
- YAYI 2: Multilingual Open-source Large Language Models Luo Yin, Kong Qingchao, Xu Nan, Cao Jia, Hao Bao, Qu Baoyu, Chen Bo, Zhu Chao, Zhao Chenyang, Zhang Donglei, Feng Fan, Zhao Feifei, Sun Hailong, Yang Hanxuan, Pan Haojun, Liu Hongyu, Guo Jianbin, Du Jiangtao, Wang Jingyi, Li Junfeng, Sun Lei, Liu Liduo, Dong Lifeng, Liu Lili, Wang Lin, Zhang Liwen, Wang Minzheng, Wang Pin, Yu Ping, Li Qingxiao, Yan Rui, Zou Rui, Li Ruiqun, Huang Taiwen, Wang Xiaodong, Wu Xiaofei, Peng Xin, Zhang Xina, Fang Xing, Xiao Xinglin, Hao Yanni, Dong Yao, Wang Yigang, Liu Ying, Jiang Yongyu, Wang Yungan, Wang Yuqi, Wang Zhangsheng, Yu Zhaoxin, Luo Zhen, Mao Wenji, Wang Lei, Zeng Dajun
- Biomedgpt: Open Multimodal Generative Pre-trained Transformer For Biomedicine Luo Yizhen, Zhang Jiahuan, Fan Siqi, Yang Kai, Wu Yushuai, Qiao Mu, Nie Zaiqing
- Unlocking The Potential Of Large Language Models For Explainable Recommendations Luo Yucong, Cheng Mingyue, Zhang Hao, Lu Junyu, Liu Qi, Chen Enhong
- An Empirical Study Of Catastrophic Forgetting In Large Language Models During Continual Fine-tuning Luo Yun, Yang Zhen, Meng Fandong, Li Yafu, Zhou Jie, Zhang Yue
- Augmented Large Language Models With Parametric Knowledge Guiding Luo Ziyang, Xu Can, Zhao Pu, Geng Xiubo, Tao Chongyang, Ma Jing, Lin Qingwei, Jiang Daxin
- Fingpt: Large Generative Models For A Small Language Luukkonen Risto, Komulainen Ville, Luoma Jouni, Eskelinen Anni, Kanerva Jenna, Kupari Hanna-mari, Ginter Filip, Laippala Veronika, Muennighoff Niklas, Piktus Aleksandra, Wang Thomas, Tazi Nouamane, Scao Teven Le, Wolf Thomas, Suominen Osma, Sairanen Samuli, Merioksa Mikko, Heinonen Jyrki, Vahtola Aija, Antao Samuel, Pyysalo Sampo
- Are We Falling In A Middle-intelligence Trap? An Analysis And Mitigation Of The Reversal Curse Lv Ang, Zhang Kaiyi, Xie Shufang, Tu Quan, Chen Yuhan, Wen Ji-rong, Yan Rui
- Full Parameter Fine-tuning For Large Language Models With Limited Resources Lv Kai, Yang Yuqing, Liu Tengxiao, Gao Qinghui, Guo Qipeng, Qiu Xipeng
- Collie: Collaborative Training Of Large Language Models In An Efficient Way Lv Kai, Zhang Shuo, Gu Tianle, Xing Shuhao, Hong Jiawei, Chen Keyu, Liu Xiaoran, Yang Yuqing, Guo Honglin, Liu Tengxiao, Sun Yu, Guo Qipeng, Yan Hang, Qiu Xipeng
- KOSMOS-2.5: A Multimodal Literate Model Lv Tengchao, Huang Yupan, Chen Jingye, Zhao Yuzhong, Jia Yilin, Cui Lei, Ma Shuming, Chang Yaoyao, Huang Shaohan, Wang Wenhui, Dong Li, Luo Weiyao, Wu Shaoxiang, Wang Guoxin, Zhang Cha, Wei Furu
- Pre-training With Large Language Model-based Document Expansion For Dense Passage Retrieval Ma Guangyuan, Wu Xing, Wang Peng, Lin Zijia, Hu Songlin
- Fairness-guided Few-shot Prompting For Large Language Models Ma Huan, Zhang Changqing, Bian Yatao, Liu Lemao, Zhang Zhirui, Zhao Peilin, Zhang Shu, Fu Huazhu, Hu Qinghua, Wu Bingzhe
- Untying The Reversal Curse Via Bidirectional Language Model Editing Ma Jun-yu, Gu Jia-chen, Ling Zhen-hua, Liu Quan, Liu Cong
- Chain-of-skills: A Configurable Model For Open-domain Question Answering Ma Kaixin, Cheng Hao, Zhang Yu, Liu Xiaodong, Nyberg Eric, Gao Jianfeng
- POS: A Prompts Optimization Suite For Augmenting Text-to-video Generation Ma Shijie, Xu Huayi, Li Mengjian, Geng Weidong, Wang Yaxiong, Wang Meng
- Fine-tuning Llama For Multi-stage Text Retrieval Ma Xueguang, Wang Liang, Yang Nan, Wei Furu, Lin Jimmy
- At Which Training Stage Does Code Data Help Llms Reasoning? Ma Yingwei, Liu Yue, Yu Yue, Zhang Yuanliang, Jiang Yu, Wang Changjian, Li Shanshan
- Sci-cot: Leveraging Large Language Models For Enhanced Knowledge Distillation In Small Models For Scientific QA Ma Yuhan, Jiang Haiqi, Fan Chenyou
- Non-autoregressive Streaming Transformer For Simultaneous Translation Ma Zhengrui, Zhang Shaolei, Guo Shoutao, Shao Chenze, Zhang Min, Feng Yang
- World-to-words: Grounded Open Vocabulary Acquisition Through Fast Mapping In Vision-language Models Ma Ziqiao, Pan Jiayi, Chai Joyce
- Self-refine: Iterative Refinement With Self-feedback Madaan Aman, Tandon Niket, Gupta Prakhar, Hallinan Skyler, Gao Luyu, Wiegreffe Sarah, Alon Uri, Dziri Nouha, Prabhumoye Shrimai, Yang Yiming, Gupta Shashank, Majumder Bodhisattwa Prasad, Hermann Katherine, Welleck Sean, Yazdanbakhsh Amir, Clark Peter
- Enhancing Conversational Quality In Language Learning Chatbots: An Evaluation Of GPT4 For ASR Error Correction Mai Long, Carson-berndsen Julie
- Harnessing The Power Of Prompt-based Techniques For Generating School-level Questions Using Large Language Models Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- Fine-tuning Language Models With Just Forward Passes Malladi Sadhika, Gao Tianyu, Nichani Eshaan, Damian Alex, Lee Jason D., Chen Danqi, Arora Sanjeev
- ICL Markup: Structuring In-context Learning Using Soft-token Tags Marc-etienne Brunet, Ashton Anderson, Richard Zemel
- When Less Is More: Investigating Data Pruning For Pretraining Llms At Scale Marion Max, Üstün Ahmet, Pozzobon Luiza, Wang Alex, Fadaee Marzieh, Hooker Sara
- Interpreting Learned Feedback Patterns In Large Language Models Marks Luke, Abdullah Amir, Neo Clement, Arike Rauno, Krueger David, Torr Philip, Barez Fazl
- The Geometry Of Truth: Emergent Linear Structure In Large Language Model Representations Of True/false Datasets Marks Samuel, Tegmark Max
- Investigating The Efficacy Of Large Language Models In Reflective Assessment Methods Through Chain Of Thoughts Prompting Masikisiki Baphumelele, Marivate Vukosi, Hlope Yvette
- Is Feedback All You Need? Leveraging Natural Language Feedback In Goal-conditioned Reinforcement Learning Mccallum Sabrina, Taylor-davies Max, Albrecht Stefano V., Suglia Alessandro
- Sources Of Hallucination By Large Language Models On Inference Tasks Mckenna Nick, Li Tianyi, Cheng Liang, Hosseini Mohammad Javad, Johnson Mark, Steedman Mark
- Enhancing LLM Intelligence With ARM-RAG: Auxiliary Rationale Memory For Retrieval Augmented Generation Melz Eric
- Representation Deficiency In Masked Language Modeling Meng Yu, Krishnan Jitin, Wang Sinong, Wang Qifan, Mao Yuning, Fang Han, Ghazvininejad Marjan, Han Jiawei, Zettlemoyer Luke
- Self-checker: Plug-and-play Modules For Fact-checking With Large Language Models Miaoran Li, Baolin Peng, Michel Galley, Jianfeng Gao, Zhu Zhang
- The Larger They Are, The Harder They Fail: Language Models Do Not Recognize Identifier Swaps In Python Miceli-barone Antonio Valerio, Barez Fazl, Konstas Ioannis, Cohen Shay B.
- Uzh_clyp At Semeval-2023 Task 9: Head-first Fine-tuning And Chatgpt Data Generation For Cross-lingual Learning In Tweet Intimacy Prediction Michail Andrianos, Konstantinou Stefanos, Clematide Simon
- Think Twice: Measuring The Efficiency Of Eliminating Prediction Shortcuts Of Question Answering Models Mikula Lukáš, Štefánik Michal, Petrovič Marek, Sojka Petr
- Attention Link: An Efficient Attention-based Low Resource Machine Translation Architecture Min Zeping
- PEFTT: Parameter-efficient Fine-tuning For Low-resource Tibetan Pre-trained Language Models Mingjun Zhou, Zhuoma Daiqing, Nuo Qun, Tashi Nyima
- PRE: Vision-language Prompt Learning With Reparameterization Encoder Minh Anh Pham Thi, Nguyen An Duc, Tzimiropoulos Georgios
- Beyond Scale: The Diversity Coefficient As A Data Quality Metric For Variability In Natural Language Data Miranda Brando, Lee Alycia, Sundar Sudharsan, Casasola Allison, Koyejo Sanmi
- TAP: Targeted Prompting For Task Adaptive Generation Of Textual Training Instances For Visual Classification Mirza M. Jehanzeb, Karlinsky Leonid, Lin Wei, Possegger Horst, Feris Rogerio, Bischof Horst
- LLM Aided Semi-supervision For Extractive Dialog Summarization Mishra Nishant, Sahu Gaurav, Calixto Iacer, Abu-hanna Ameen, Laradji Issam H.
- Synthetic Imitation Edit Feedback For Factual Alignment In Clinical Summarization Mishra Prakamya, Yao Zonghai, Chen Shuwei, Wang Beining, Mittal Rohan, Yu Hong
- An Emulator For Fine-tuning Large Language Models Using Small Language Models Mitchell Eric, Rafailov Rafael, Sharma Archit, Finn Chelsea, Manning Christopher D.
- Compositional Chain-of-thought Prompting For Large Multimodal Models Mitra Chancharik, Huang Brandon, Darrell Trevor, Herzig Roei
- Convgqr: Generative Query Reformulation For Conversational Search Mo Fengran, Mao Kelong, Zhu Yutao, Wu Yihong, Huang Kaiyu, Nie Jian-yun
- How Trustworthy Are Open-source Llms? An Assessment Under Malicious Demonstrations Shows Their Vulnerabilities Mo Lingbo, Wang Boshi, Chen Muhao, Sun Huan
- Landmark Attention: Random-access Infinite Context Length For Transformers Mohtashami Amirkeivan, Jaggi Martin
- Increasing The Performance Of Cognitively Inspired Data-efficient Language Models Via Implicit Structure Building Momen Omar, Arps David, Kallmeyer Laura
- Evaluating Cognitive Maps And Planning In Large Language Models With Cogeval Momennejad Ida, Hasanbeig Hosein, Vieira Felipe, Sharma Hiteshi, Ness Robert Osazuwa, Jojic Nebojsa, Palangi Hamid, Larson Jonathan
- Adaptive Machine Translation With Large Language Models Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- Fine-tuning Large Language Models For Adaptive Machine Translation Moslem Yasmin, Haque Rejwanul, Way Andy
- Gradient-based Automated Iterative Recovery For Parameter-efficient Tuning Mozes Maximilian, Bolukbasi Tolga, Yuan Ann, Liu Frederick, Thain Nithum, Dixon Lucas
- Learning To Compress Prompts With Gist Tokens Mu Jesse, Li Xiang Lisa, Goodman Noah
- Can Llms Follow Simple Rules? Mu Norman, Chen Sarah, Wang Zifan, Chen Sizhe, Karamardian David, Aljeraisy Lulwa, Alomair Basel, Hendrycks Dan, Wagner David
- Navigating Prompt Complexity For Zero-shot Classification: A Study Of Large Language Models In Computational Social Science Mu Yida, Wu Ben P., Thorne William, Robinson Ambrose, Aletras Nikolaos, Scarton Carolina, Bontcheva Kalina, Song Xingyi
- How To Plant Trees In Language Models: Data And Architectural Effects On The Emergence Of Syntactic Inductive Biases Mueller Aaron, Linzen Tal
- In-context Learning Generalizes, But Not Always Robustly: The Case Of Syntax Mueller Aaron, Webson Albert, Petty Jackson, Linzen Tal
- Assessing Translation Capabilities Of Large Language Models Involving English And Indian Languages Mujadia Vandan, Urlana Ashok, Bhaskar Yash, Pavani Penumalla Aditya, Shravya Kukkapalli, Krishnamurthy Parameswari, Sharma Dipti Misra
- "medium" Lms Of Code In The Era Of Llms: Lessons From Stackoverflow Mukherjee Manisha, Hellendoorn Vincent J.
- Orca: Progressive Learning From Complex Explanation Traces Of GPT-4 Mukherjee Subhabrata, Mitra Arindam, Jawahar Ganesh, Agarwal Sahaj, Palangi Hamid, Awadallah Ahmed
- Superhf: Supervised Iterative Learning From Human Feedback Mukobi Gabriel, Chatain Peter, Fong Su, Windesheim Robert, Kutyniok Gitta, Bhatia Kush, Alberti Silas
- A Brief History Of Prompt: Leveraging Language Models. (through Advanced Prompting) Muktadir Golam Md
- Evaluating The Effectiveness Of Retrieval-augmented Large Language Models In Scientific Document Reasoning Munikoti Sai, Acharya Anurag, Wagle Sridevi, Horawalavithana Sameera
- LLM4VV: Developing Llm-driven Testsuite For Compiler Validation Munley Christian, Jarmusch Aaron, Chandrasekaran Sunita
- Ai-assisted Code Authoring At Scale: Fine-tuning, Deploying, And Mixed Methods Evaluation Murali Vijayaraghavan, Maddila Chandra, Ahmad Imad, Bolin Michael, Cheng Daniel, Ghorbani Negar, Fernandez Renuka, Nagappan Nachiappan, Rigby Peter C.
- PEARL: Personalizing Large Language Model Writing Assistants With Generation-calibrated Retrievers Mysore Sheshera, Lu Zhuoran, Wan Mengting, Yang Longqi, Menezes Steve, Baghaee Tina, Gonzalez Emmanuel Barajas, Neville Jennifer, Safavi Tara
- Large Language Model Augmented Narrative Driven Recommendations Mysore Sheshera, Mccallum Andrew, Zamani Hamed
- Teaching Probabilistic Logical Reasoning To Transformers Nafar Aliakbar, Venable Kristen Brent, Kordjamshidi Parisa
- Scalable Extraction Of Training Data From (production) Language Models Nasr Milad, Carlini Nicholas, Hayase Jonathan, Jagielski Matthew, Cooper A. Feder, Ippolito Daphne, Choquette-choo Christopher A., Wallace Eric, Tramèr Florian, Lee Katherine
- Reinforcement Replaces Supervision: Query Focused Summarization Using Deep Reinforcement Learning Nath Swaroop, Khadilkar Harshad, Bhattacharyya Pushpak
- Towards Probing Contact Center Large Language Models Nathan Varun, Kumar Ayush, Ingle Digvijay, Vepa Jithendra
- A Comprehensive Overview Of Large Language Models Naveed Humza, Khan Asad Ullah, Qiu Shi, Saqib Muhammad, Anwar Saeed, Usman Muhammad, Akhtar Naveed, Barnes Nick, Mian Ajmal
- Nanot5: A Pytorch Framework For Pre-training And Fine-tuning T5-style Models With Limited Resources Nawrot Piotr
- Exploring The Maze Of Multilingual Modeling Nezhad Sina Bagheri, Agrawal Ameeta
- Simplyretrieve: A Private And Lightweight Retrieval-centric Generative AI Tool Ng Youyang, Miyashita Daisuke, Hoshi Yasuto, Morioka Yasuhiro, Torii Osamu, Kodama Tomoya, Deguchi Jun
- Multi-dimensional Data Refining Strategy For Effective Fine-tuning Llms Ngoc Thanh Nguyen, Tran Quang Nhat, Tang Arthur, Nguyen Bao, Nguyen Thuy, Pham Thanh
- Meet In The Middle: A New Pre-training Paradigm Nguyen Anh, Karampatziakis Nikos, Chen Weizhu
- Black-box Analysis: Gpts Across Time In Legal Textual Entailment Task Nguyen Ha-thanh, Goebel Randy, Toni Francesca, Stathis Kostas, Satoh Ken
- Culturax: A Cleaned, Enormous, And Multilingual Dataset For Large Language Models In 167 Languages Nguyen Thuat, Van Nguyen Chien, Lai Viet Dac, Man Hieu, Ngo Nghia Trung, Dernoncourt Franck, Rossi Ryan A., Nguyen Thien Huu
- Democratizing Llms For Low-resource Languages By Leveraging Their English Dominant Abilities With Linguistically-diverse Prompts Nguyen Xuan-phi, Aljunied Sharifah Mahani, Joty Shafiq, Bing Lidong
- Seallms -- Large Language Models For Southeast Asia Nguyen Xuan-phi, Zhang Wenxuan, Li Xin, Aljunied Mahani, Hu Zhiqiang, Shen Chenhui, Chia Yew Ken, Li Xingxuan, Wang Jianyu, Tan Qingyu, Cheng Liying, Chen Guanzheng, Deng Yue, Yang Sen, Liu Chaoqun, Zhang Hang, Bing Lidong
- L2ceval: Evaluating Language-to-code Generation Capabilities Of Large Language Models Ni Ansong, Yin Pengcheng, Zhao Yilun, Riddell Martin, Feng Troy, Shen Rui, Yin Stephen, Liu Ye, Yavuz Semih, Xiong Caiming, Joty Shafiq, Zhou Yingbo, Radev Dragomir, Cohan Arman
- Forgetting Before Learning: Utilizing Parametric Arithmetic For Knowledge Updating In Large Language Models Ni Shiwen, Chen Dingwei, Li Chengming, Hu Xiping, Xu Ruifeng, Yang Min
- Code-style In-context Learning For Knowledge-based Question Answering Nie Zhijie, Zhang Richong, Wang Zhongyuan, Liu Xudong
- An Experimental Study: Assessing The Combined Framework Of Wavlm And BEST-RQ For Text-to-speech Synthesis Nielson Via, Hillis Steven
- Codegen2: Lessons For Training Llms On Programming And Natural Languages Nijkamp Erik, Hayashi Hiroaki, Xiong Caiming, Savarese Silvio, Zhou Yingbo
- Gpt4image: Can Large Pre-trained Models Help Vision Models On Perception Tasks? Ning Ding, Yehui Tang, Zhongqian Fu, Chao Xu, Kai Han, Yunhe Wang
- Reflexion: Language Agents With Verbal Reinforcement Learning Noah Shinn, Federico Cassano, Edward Berman, Ashwin Gopinath, Karthik Narasimhan, Shunyu Yao
- Evaluating AI Vocational Skills Through Professional Testing Noever David, Ciolino Matt
- AI Text-to-behavior: A Study In Steerability Noever David, Hyams Sam
- On The Unexpected Abilities Of Large Language Models Nolfi Stefano
- Capabilities Of GPT-4 On Medical Challenge Problems Nori Harsha, King Nicholas, Mckinney Scott Mayer, Carignan Dean, Horvitz Eric
- Evaluating GPT-3.5 And GPT-4 Models On Brazilian University Admission Exams Nunes Desnes, Primi Ricardo, Pires Ramon, Lotufo Roberto, Nogueira Rodrigo
- Contrastive Decoding Improves Reasoning In Large Language Models O'brien Sean, Lewis Mike
- Adversarial Fine-tuning Of Language Models: An Iterative Optimisation Approach For The Generation And Detection Of Problematic Content O'neill Charles, Miller Jack, Ciuca Ioana, Ting Yuan-sen, Bui Thang
- Steering Language Generation: Harnessing Contrastive Expert Guidance And Negative Prompting For Coherent And Diverse Synthetic Data Generation O'neill Charles, Ting Yuan-sen, Ciuca Ioana, Miller Jack, Bui Thang
- In-contextual Gender Bias Suppression For Large Language Models Oba Daisuke, Kaneko Masahiro, Bollegala Danushka
- Prodigy: A Profile-based Dialogue Generation Dataset Occhipinti Daniela, Tekiroglu Serra Sinem, Guerini Marco
- Blackvip: Black-box Visual Prompting For Robust Transfer Learning Oh Changdae, Hwang Hyeji, Lee Hee-young, Lim Yongtaek, Jung Geunyoung, Jung Jiyoung, Choi Hosik, Song Kyungwoo
- Data Augmentation For Neural Machine Translation Using Generative Language Model Oh Seokjin, Lee Su Ah, Jung Woohwan
- Deepthought: An Architecture For Autonomous Self-motivated Systems Oliveira Arlindo L., Domingos Tiago, Figueiredo Mário, Lima Pedro U.
- Large Language Models Vote: Prompting For Rare Disease Identification Oniani David, Hilsman Jordan, Dong Hang, Gao Fengyi, Verma Shiven, Wang Yanshan
- GPT-4 Technical Report Openai, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko, Madelaine Boyd, Anna-luisa Brakman, Greg Brockman, Tim Brooks, Miles Brundage, Kevin Button, Trevor Cai, Rosie Campbell, Andrew Cann, Brittany Carey, Chelsea Carlson, Rory Carmichael, Brooke Chan, Che Chang, Fotis Chantzis, Derek Chen, Sully Chen, Ruby Chen, Jason Chen, Mark Chen, Ben Chess, Chester Cho, Casey Chu, Hyung Won Chung, Dave Cummings, Jeremiah Currier, Yunxing Dai, Cory Decareaux, Thomas Degry, Noah Deutsch, Damien Deville, Arka Dhar, David Dohan, Steve Dowling, Sheila Dunning, Adrien Ecoffet, Atty Eleti, Tyna Eloundou, David Farhi, Liam Fedus, Niko Felix, Simón Posada Fishman, Juston Forte, Isabella Fulford, Leo Gao, Elie Georges, Christian Gibson, Vik Goel, Tarun Gogineni, Gabriel Goh, Rapha Gontijo-lopes, Jonathan Gordon, Morgan Grafstein, Scott Gray, Ryan Greene, Joshua Gross, Shixiang Shane Gu, Yufei Guo, Chris Hallacy, Jesse Han, Jeff Harris, Yuchen He, Mike Heaton, Johannes Heidecke, Chris Hesse, Alan Hickey, Wade Hickey, Peter Hoeschele, Brandon Houghton, Kenny Hsu, Shengli Hu, Xin Hu, Joost Huizinga, Shantanu Jain, Shawn Jain, Joanne Jang, Angela Jiang, Roger Jiang, Haozhun Jin, Denny Jin, Shino Jomoto, Billie Jonn, Heewoo Jun, Tomer Kaftan, Łukasz Kaiser, Ali Kamali, Ingmar Kanitscheider, Nitish Shirish Keskar, Tabarak Khan, Logan Kilpatrick, Jong Wook Kim, Christina Kim, Yongjik Kim, Jan Hendrik Kirchner, Jamie Kiros, Matt Knight, Daniel Kokotajlo, Łukasz Kondraciuk, Andrew Kondrich, Aris Konstantinidis, Kyle Kosic, Gretchen Krueger, Vishal Kuo, Michael Lampe, Ikai Lan, Teddy Lee, Jan Leike, Jade Leung, Daniel Levy, Chak Ming Li, Rachel Lim, Molly Lin, Stephanie Lin, Mateusz Litwin, Theresa Lopez, Ryan Lowe, Patricia Lue, Anna Makanju, Kim Malfacini, Sam Manning, Todor Markov, Yaniv Markovski, Bianca Martin, Katie Mayer, Andrew Mayne, Bob Mcgrew, Scott Mayer Mckinney, Christine Mcleavey, Paul Mcmillan, Jake Mcneil, David Medina, Aalok Mehta, Jacob Menick, Luke Metz, Andrey Mishchenko, Pamela Mishkin, Vinnie Monaco, Evan Morikawa, Daniel Mossing, Tong Mu, Mira Murati, Oleg Murk, David Mély, Ashvin Nair, Reiichiro Nakano, Rajeev Nayak, Arvind Neelakantan, Richard Ngo, Hyeonwoo Noh, Long Ouyang, Cullen O'keefe, Jakub Pachocki, Alex Paino, Joe Palermo, Ashley Pantuliano, Giambattista Parascandolo, Joel Parish, Emy Parparita, Alex Passos, Mikhail Pavlov, Andrew Peng, Adam Perelman, Filipe De Avila Belbute Peres, Michael Petrov, Henrique Ponde De Oliveira Pinto, Michael, Pokorny, Michelle Pokrass, Vitchyr H. Pong, Tolly Powell, Alethea Power, Boris Power, Elizabeth Proehl, Raul Puri, Alec Radford, Jack Rae, Aditya Ramesh, Cameron Raymond, Francis Real, Kendra Rimbach, Carl Ross, Bob Rotsted, Henri Roussez, Nick Ryder, Mario Saltarelli, Ted Sanders, Shibani Santurkar, Girish Sastry, Heather Schmidt, David Schnurr, John Schulman, Daniel Selsam, Kyla Sheppard, Toki Sherbakov, Jessica Shieh, Sarah Shoker, Pranav Shyam, Szymon Sidor, Eric Sigler, Maddie Simens, Jordan Sitkin, Katarina Slama, Ian Sohl, Benjamin Sokolowsky, Yang Song, Natalie Staudacher, Felipe Petroski Such, Natalie Summers, Ilya Sutskever, Jie Tang, Nikolas Tezak, Madeleine B. Thompson, Phil Tillet, Amin Tootoonchian, Elizabeth Tseng, Preston Tuggle, Nick Turley, Jerry Tworek, Juan Felipe Cerón Uribe, Andrea Vallone, Arun Vijayvergiya, Chelsea Voss, Carroll Wainwright, Justin Jay Wang, Alvin Wang, Ben Wang, Jonathan Ward, Jason Wei, Cj Weinmann, Akila Welihinda, Peter Welinder, Jiayi Weng, Lilian Weng, Matt Wiethoff, Dave Willner, Clemens Winter, Samuel Wolrich, Hannah Wong, Lauren Workman, Sherwin Wu, Jeff Wu, Michael Wu, Kai Xiao, Tao Xu, Sarah Yoo, Kevin Yu, Qiming Yuan, Wojciech Zaremba, Rowan Zellers, Chong Zhang, Marvin Zhang, Shengjia Zhao, Tianhao Zheng, Juntang Zhuang, William Zhuk, Barret Zoph
- Thoughtsource: A Central Hub For Large Language Model Reasoning Data Ott Simon, Hebenstreit Konstantin, Liévin Valentin, Hother Christoffer Egeberg, Moradi Milad, Mayrhauser Maximilian, Praas Robert, Winther Ole, Samwald Matthias
- Fine-tuning Or Retrieval? Comparing Knowledge Injection In Llms Ovadia Oded, Brief Menachem, Mishaeli Moshik, Elisha Oren
- Tokenization Matters: Navigating Data-scarce Tokenization For Gender Inclusive Language Technologies Ovalle Anaelia, Mehrabi Ninareh, Goyal Palash, Dhamala Jwala, Chang Kai-wei, Zemel Richard, Galstyan Aram, Pinter Yuval, Gupta Rahul
- On The Role Of Attention In Prompt-tuning Oymak Samet, Rawat Ankit Singh, Soltanolkotabi Mahdi, Thrampoulidis Christos
- Understanding The Capabilities Of Large Language Models For Automated Planning Pallagani Vishal, Muppasani Bharath, Murugesan Keerthiram, Rossi Francesca, Srivastava Biplav, Horesh Lior, Fabiano Francesco, Loreggia Andrea
- What In-context Learning "learns" In-context: Disentangling Task Recognition And Task Learning Pan Jane, Gao Tianyu, Chen Howard, Chen Danqi
- Stelocoder: A Decoder-only LLM For Multi-language To Python Code Translation Pan Jialing, Sadé Adrien, Kim Jin, Soriano Eric, Sole Guillem, Flamant Sylvain
- Smoothquant+: Accurate And Efficient 4-bit Post-training Weightquantization For LLM Pan Jiayi, Wang Chengcan, Zheng Kaifu, Li Yangguang, Wang Zhenyu, Feng Bin
- Retrieving-to-answer: Zero-shot Video Question Answering With Frozen Large Language Models Pan Junting, Lin Ziyi, Ge Yuying, Zhu Xiatian, Zhang Renrui, Wang Yi, Qiao Yu, Li Hongsheng
- Self-supervised Meta-prompt Learning With Meta-gradient Regularization For Few-shot Generalization Pan Kaihang, Li Juncheng, Song Hongye, Lin Jun, Liu Xiaozhong, Tang Siliang
- Automatically Correcting Large Language Models: Surveying The Landscape Of Diverse Self-correction Strategies Pan Liangming, Saxon Michael, Xu Wenda, Nathani Deepak, Wang Xinyi, Wang William Yang
- A Preliminary Evaluation Of Chatgpt For Zero-shot Dialogue Understanding Pan Wenbo, Chen Qiguang, Xu Xiao, Che Wanxiang, Qin Libo
- Tppoet: Transformer-based Persian Poem Generation Using Minimal Data And Advanced Decoding Techniques Panahandeh Amir, Asemi Hanie, Nourani Esmaeil
- Semantic Composition In Visually Grounded Language Models Pandey Rohan
- Peft-medaware: Large Language Model For Medical Awareness Pandya Keivalya
- Automating Customer Service Using Langchain: Building Custom Open-source GPT Chatbot For Organizations Pandya Keivalya, Holia Mehfuza
- Language Model Self-improvement By Reinforcement Learning Contemplation Pang Jing-cheng, Wang Pengyuan, Li Kaiyuan, Chen Xiong-hui, Xu Jiacheng, Zhang Zongzhang, Yu Yang
- Frozen Transformers In Language Models Are Effective Visual Encoder Layers Pang Ziqi, Xie Ziyang, Man Yunze, Wang Yu-xiong
- Trainable Transformer In Transformer Panigrahi Abhishek, Malladi Sadhika, Xia Mengzhou, Arora Sanjeev
- Injecting Structural Hints: Using Language Models To Study Inductive Biases In Language Learning Papadimitriou Isabel, Jurafsky Dan
- Clip-guided Vision-language Pre-training For Question Answering In 3D Scenes Parelli Maria, Delitzas Alexandros, Hars Nikolas, Vlassis Georgios, Anagnostidis Sotirios, Bachmann Gregor, Hofmann Thomas
- Interpretable Visual Question Answering Via Reasoning Supervision Parelli Maria, Mallis Dimitrios, Diomataris Markos, Pitsikalis Vassilis
- Localized Symbolic Knowledge Distillation For Visual Commonsense Models Park Jae Sung, Hessel Jack, Chandu Khyathi Raghavi, Liang Paul Pu, Lu Ximing, West Peter, Yu Youngjae, Huang Qiuyuan, Gao Jianfeng, Farhadi Ali, Choi Yejin
- On The Analysis Of Cross-lingual Prompt Tuning For Decoder-based Multilingual Model Park Nohil, Park Joonsuk, Yoo Kang Min, Yoon Sungroh
- A Large Language Model Approach To Educational Survey Feedback Analysis Parker Michael J., Anderson Caitlin, Stone Claire, Oh Yearim
- Entity Matching Using Large Language Models Peeters Ralph, Bizer Christian
- GPT Self-supervision For A Better Data Annotator Pei Xiaohuan, Li Yanxi, Xu Chang
- Evaluation Of Medium-large Language Models At Zero-shot Closed Book Generative Question Answering Peinl René, Wirth Johannes
- Exploiting Novel GPT-4 Apis Pelrine Kellin, Taufeeque Mohammad, Zając Michał, Mclean Euan, Gleave Adam
- Instruction Tuning With GPT-4 Peng Baolin, Li Chunyuan, He Pengcheng, Galley Michel, Gao Jianfeng
- Stabilizing RLHF Through Advantage Model And Selective Rehearsal Peng Baolin, Song Linfeng, Tian Ye, Jin Lifeng, Mi Haitao, Yu Dong
- Towards Efficient And Effective Adaptation Of Large Language Models For Sequential Recommendation Peng Bo, Burns Ben, Chen Ziqi, Parthasarathy Srinivasan, Ning Xia
- Yarn: Efficient Context Window Extension Of Large Language Models Peng Bowen, Quesnelle Jeffrey, Fan Honglu, Shippole Enrico
- Generative Large Language Models Are All-purpose Text Analytics Engines: Text-to-text Learning Is All Your Need Peng Cheng, Yang Xi, Chen Aokun, Yu Zehao, Smith Kaleb E, Costa Anthony B, Flores Mona G, Bian Jiang, Wu Yonghui
- When Does In-context Learning Fall Short And Why? A Study On Specification-heavy Tasks Peng Hao, Wang Xiaozhi, Chen Jianhui, Li Weikai, Qi Yunjia, Wang Zimu, Wu Zhili, Zeng Kaisheng, Xu Bin, Hou Lei, Li Juanzi
- FP8-LM: Training FP8 Large Language Models Peng Houwen, Wu Kan, Wei Yixuan, Zhao Guoshuai, Yang Yuxiang, Liu Ze, Xiong Yifan, Yang Ziyue, Ni Bolin, Hu Jingcheng, Li Ruihang, Zhang Miaosen, Li Chen, Ning Jia, Wang Ruizhe, Zhang Zheng, Liu Shuguang, Chau Joe, Hu Han, Cheng Peng
- Pre-train, Prompt And Recommendation: A Comprehensive Survey Of Language Modelling Paradigm Adaptations In Recommender Systems Peng Liu, Lemei Zhang, Jon Atle Gulla
- Reproducing Whisper-style Training Using An Open-source Toolkit And Publicly Available Data Peng Yifan, Tian Jinchuan, Yan Brian, Berrebbi Dan, Chang Xuankai, Li Xinjian, Shi Jiatong, Arora Siddhant, Chen William, Sharma Roshan, Zhang Wangyou, Sudo Yui, Shakeel Muhammad, Jung Jee-weon, Maiti Soumi, Watanabe Shinji
- Soft Prompt Tuning For Augmenting Dense Retrieval With Large Language Models Peng Zhiyuan, Wu Xuyang, Wang Qifan, Fang Yi
- Large Language Models Can Infer Psychological Dispositions Of Social Media Users Peters Heinrich, Matz Sandra
- Constitutionmaker: Interactively Critiquing Large Language Models By Converting Feedback Into Principles Petridis Savvas, Wedin Ben, Wexler James, Donsbach Aaron, Pushkarna Mahima, Goyal Nitesh, Cai Carrie J., Terry Michael
- When Do Prompting And Prefix-tuning Work? A Theory Of Capabilities And Limitations Petrov Aleksandar, Torr Philip H. S., Bibi Adel
- Recjpq: Training Large-catalogue Sequential Recommenders Petrov Aleksandr V., Macdonald Craig
- The Impact Of Depth On Compositional Generalization In Transformer Language Models Petty Jackson, Van Steenkiste Sjoerd, Dasgupta Ishita, Sha Fei, Garrette Dan, Linzen Tal
- Attention Sorting Combats Recency Bias In Long Context Language Models Peysakhovich Alexander, Lerer Adam
- Training Chain-of-thought Via Latent-variable Inference Phan Du, Hoffman Matthew D., Dohan David, Douglas Sholto, Le Tuan Anh, Parisi Aaron, Sountsov Pavel, Sutton Charles, Vikram Sharad, Saurous Rif A.
- LLM Self Defense: By Self Examination, Llms Know They Are Being Tricked Phute Mansi, Helbling Alec, Hull Matthew, Peng Shengyun, Szyller Sebastian, Cornelius Cory, Chau Duen Horng
- Typhoon: Thai Large Language Models Pipatanakul Kunat, Jirabovonvisut Phatrasek, Manakul Potsawee, Sripaisarnmongkol Sittipong, Patomwong Ruangsak, Chokchainant Pathomporn, Tharnpipitchai Kasima
- Sabi\'a: Portuguese Large Language Models Pires Ramon, Abonizio Hugo, Almeida Thales Sales, Nogueira Rodrigo
- Hyena Hierarchy: Towards Larger Convolutional Language Models Poli Michael, Massaroli Stefano, Nguyen Eric, Fu Daniel Y., Dao Tri, Baccus Stephen, Bengio Yoshua, Ermon Stefano, Ré Christopher
- Mosaicbert: A Bidirectional Encoder Optimized For Fast Pretraining Portes Jacob, Trott Alex, Havens Sam, King Daniel, Venigalla Abhinav, Nadeem Moin, Sardana Nikhil, Khudia Daya, Frankle Jonathan
- Painter: Teaching Auto-regressive Language Models To Draw Sketches Pourreza Reza, Bhattacharyya Apratim, Panchal Sunny, Lee Mingu, Madan Pulkit, Memisevic Roland
- Egovlpv2: Egocentric Video-language Pre-training With Fusion In The Backbone Pramanick Shraman, Song Yale, Nag Sayan, Lin Kevin Qinghong, Shah Hardik, Shou Mike Zheng, Chellappa Rama, Zhang Pengchuan
- Rephrase, Augment, Reason: Visual Grounding Of Questions For Vision-language Models Prasad Archiki, Stengel-eskin Elias, Bansal Mohit
- Epik-eval: Evaluation For Language Models As Epistemic Models Prato Gabriele, Huang Jerry, Parthasarathi Prasannna, Sodhani Shagun, Chandar Sarath
- Automatic Prompt Optimization With "gradient Descent" And Beam Search Pryzant Reid, Iter Dan, Li Jerry, Lee Yin Tat, Zhu Chenguang, Zeng Michael
- Vidcom: Fast Video Comprehension Through Large Language Models With Multimodal Tools Qi Ji, Ji Kaixuan, Yu Jifan, Wang Duokang, Xu Bin, Hou Lei, Li Juanzi
- Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! Qi Xiangyu, Zeng Yi, Xie Tinghao, Chen Pin-yu, Jia Ruoxi, Mittal Prateek, Henderson Peter
- PILLOW: Enhancing Efficient Instruction Fine-tuning Via Prompt Matching Qi Zhenting, Tan Xiaoyu, Shi Shaojie, Qu Chao, Xu Yinghui, Qi Yuan
- "merge Conflicts!" Exploring The Impacts Of External Distractors To Parametric Knowledge Graphs Qian Cheng, Zhao Xinran, Wu Sherry Tongshuang
- OPERA: Alleviating Hallucination In Multi-modal Large Language Models Via Over-trust Penalty And Retrospection-allocation Qidong Huang, Xiaoyi Dong, Pan Zhang, Bin Wang, Conghui He, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu
- Hallucidoctor: Mitigating Hallucinatory Toxicity In Visual Instruction Data Qifan Yu, Juncheng Li, Longhui Wei, Liang Pang, Wentao Ye, Bosheng Qin, Siliang Tang, Qi Tian, Yueting Zhuang
- ONCE: Boosting Content-based Recommendation With Both Open- And Closed-source Large Language Models Qijiong Liu, Nuo Chen, Tetsuya Sakai, Xiao-ming Wu
- Enabling On-device Large Language Model Personalization With Self-supervised Data Selection And Synthesis Qin Ruiyang, Xia Jun, Jia Zhenge, Jiang Meng, Abbasi Ahmed, Zhou Peipei, Hu Jingtong, Shi Yiyu
- Toolllm: Facilitating Large Language Models To Master 16000+ Real-world Apis Qin Yujia, Liang Shihao, Ye Yining, Zhu Kunlun, Yan Lan, Lu Yaxi, Lin Yankai, Cong Xin, Tang Xiangru, Qian Bill, Zhao Sihan, Hong Lauren, Tian Runchu, Xie Ruobing, Zhou Jie, Gerstein Mark, Li Dahai, Liu Zhiyuan, Sun Maosong
- Federated Full-parameter Tuning Of Billion-sized Language Models With Communication Cost Under 18 Kilobytes Qin Zhen, Chen Daoyuan, Qian Bingchen, Ding Bolin, Li Yaliang, Deng Shuiguang
- Transnormerllm: A Faster And Better Large Language Model With Improved Transnormer Qin Zhen, Li Dong, Sun Weigao, Sun Weixuan, Shen Xuyang, Han Xiaodong, Wei Yunshen, Lv Baohong, Luo Xiao, Qiao Yu, Zhong Yiran
- Mplug-owl: Modularization Empowers Large Language Models With Multimodality Qinghao Ye, Haiyang Xu, Guohai Xu, Jiabo Ye, Ming Yan, Yiyang Zhou, Junyang Wang, Anwen Hu, Pengcheng Shi, Yaya Shi, Chenliang Li, Yuanhong Xu, Hehong Chen, Junfeng Tian, Qi Qian, Ji Zhang, Fei Huang, Jingren Zhou
- Adalora: Adaptive Budget Allocation For Parameter-efficient Fine-tuning Qingru Zhang, Minshuo Chen, Alexander Bukharin, Nikos Karampatziakis, Pengcheng He, Yu Cheng, Weizhu Chen, Tuo Zhao
- Embodied Executable Policy Learning With Language-based Scene Summarization Qiu Jielin, Xu Mengdi, Han William, Moon Seungwhan, Zhao Ding
- Are Large Language Models Temporally Grounded? Qiu Yifu, Zhao Zheng, Ziser Yftah, Korhonen Anna, Ponti Edoardo M., Cohen Shay B.
- Unlocking Emergent Modularity In Large Language Models Qiu Zihan, Huang Zeyu, Fu Jie
- Cmed-gpt: Prompt Tuning For Entity-aware Chinese Medical Dialogue Generation Qu Zhijie, Li Juan, Ma Zerui, Li Jianqiang
- Direct Preference Optimization: Your Language Model Is Secretly A Reward Model Rafael Rafailov, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D. Manning, Chelsea Finn
- Exploring The Effectiveness Of GPT Models In Test-taking: A Case Study Of The Driver's License Knowledge Test Rahimi Saba, Balch Tucker, Veloso Manuela
- Mixed-distil-bert: Code-mixed Language Modeling For Bangla, English, And Hindi Raihan Md Nishat, Goswami Dhiman, Mahmud Antara
- Improving Generalization In Task-oriented Dialogues With Workflows And Action Plans Raimondo Stefania, Pal Christopher, Liu Xiaotian, Vazquez David, Palacios Hector
- Bridging The Gap: Exploring The Capabilities Of Bridge-architectures For Complex Visual Reasoning Tasks Rajesh Kousik, Raman Mrigank, Karim Mohammed Asad, Chawla Pranit
- Multi-task End-to-end Training Improves Conversational Recommendation Ram Naveen, Kuzmin Dima, Chio Ellie Ka In, Alzantot Moustafa Farid, Ontanon Santiago, Jash Ambarish, Li Judith Yue
- Mabvit -- Modified Attention Block Enhances Vision Transformers Ramesh Mahesh, Ramkumar Aswinkumar
- Controllable Generation Of Dialogue Acts For Dialogue Systems Via Few-shot Response Generation And Ranking Ramirez Angela, Agarwal Karik, Juraska Juraj, Garg Utkarsh, Walker Marilyn A.
- Tailoring Self-rationalizers With Multi-reward Distillation Ramnath Sahana, Joshi Brihi, Hallinan Skyler, Lu Ximing, Li Liunian Harold, Chan Aaron, Hessel Jack, Choi Yejin, Ren Xiang
- Aligning Neural Machine Translation Models: Human Feedback In Training And Inference Ramos Miguel Moura, Fernandes Patrick, Farinhas António, Martins André F. T.
- Retrieval-augmented Image Captioning Ramos Rita, Elliott Desmond, Martins Bruno
- Lmcap: Few-shot Multilingual Image Captioning By Retrieval Augmented Language Model Prompting Ramos Rita, Martins Bruno, Elliott Desmond
- Empowering Multi-step Reasoning Across Languages Via Tree-of-thoughts Ranaldi Leonardo, Pucci Giulia, Ranaldi Federico, Ruzzetti Elena Sofia, Zanzotto Fabio Massimo
- Precog: Exploring The Relation Between Memorization And Performance In Pre-trained Language Models Ranaldi Leonardo, Ruzzetti Elena Sofia, Zanzotto Fabio Massimo
- Retrieval-based Knowledge Augmented Vision Language Pre-training Rao Jiahua, Shan Zifei, Liu Longpo, Zhou Yao, Yang Yuedong
- Weakly Supervised Detection Of Hallucinations In LLM Activations Rateike Miriam, Cintas Celia, Wamburu John, Akumu Tanya, Speakman Skyler
- COLA: A Benchmark For Compositional Text-to-image Retrieval Ray Arijit, Radenovic Filip, Dubey Abhimanyu, Plummer Bryan A., Krishna Ranjay, Saenko Kate
- \(\textit{dial Beinfo For Faithfulness}\): Improving Factuality Of Information-seeking Dialogue Via Behavioural Fine-tuning Razumovskaia Evgeniia, Vulić Ivan, Marković Pavle, Cichy Tomasz, Zheng Qian, Wen Tsung-hsien, Budzianowski Paweł
- Nemo Guardrails: A Toolkit For Controllable And Safe LLM Applications With Programmable Rails Rebedea Traian, Dinu Razvan, Sreedhar Makesh, Parisien Christopher, Cohen Jonathan
- Fine-tuning Language Models For Context-specific SQL Query Generation Rebei Amine
- Prompt, Generate, Then Cache: Cascade Of Foundation Models Makes Strong Few-shot Learners Renrui Zhang, Xiangfei Hu, Bohao Li, Siyuan Huang, Hanqiu Deng, Hongsheng Li, Yu Qiao, Peng Gao
- I'm Afraid I Can't Do That: Predicting Prompt Refusal In Black-box Generative Language Models Reuter Max, Schulze William
- Sparq Attention: Bandwidth-efficient LLM Inference Ribar Luka, Chelombiev Ivan, Hudlass-galley Luke, Blake Charlie, Luschi Carlo, Orr Douglas
- Commonsense Reasoning For Conversational AI: A Survey Of The State Of The Art Richardson Christopher, Heck Larry
- SYNDICOM: Improving Conversational Commonsense With Error-injection And Natural Language Feedback Richardson Christopher, Sundar Anirudh, Heck Larry
- Data Contamination Through The Lens Of Time Roberts Manley, Thakur Himanshu, Herlihy Christine, White Colin, Dooley Samuel
- Bio-sieve: Exploring Instruction Tuning Large Language Models For Systematic Review Automation Robinson Ambrose, Thorne William, Wu Ben P., Pandor Abdullah, Essat Munira, Stevenson Mark, Song Xingyi
- Benchmarking Causal Study To Interpret Large Language Models For Source Code Rodriguez-cardenas Daniel, Palacio David N., Khati Dipin, Burke Henry, Poshyvanyk Denys
- Carexpert: Leveraging Large Language Models For In-car Conversational Question Answering Rony Md Rashad Al Hasan, Suess Christian, Bhat Sinchana Ramakanth, Sudhi Viju, Schneider Julia, Vogel Maximilian, Teucher Roman, Friedl Ken E., Sahoo Soumya
- Axiomatic Preference Modeling For Longform Question Answering Rosset Corby, Zheng Guoqing, Dibia Victor, Awadallah Ahmed, Bennett Paul
- LLM As An Art Director (ladi): Using Llms To Improve Text-to-media Generators Roush Allen, Zakirov Emil, Shirokov Artemiy, Lunina Polina, Gane Jack, Duffy Alexander, Basil Charlie, Whitcomb Aber, Benedetto Jim, Dewolfe Chris
- Journey Of Hallucination-minimized Generative AI Solutions For Financial Decision Makers Roychowdhury Sohini
- Retrieval-pretrained Transformer: Long-range Language Modeling With Self-retrieval Rubin Ohad, Berant Jonathan
- Outlier Dimensions Encode Task-specific Knowledge Rudman William, Chen Catherine, Eickhoff Carsten
- Gpt4tools: Teaching Large Language Model To Use Tools Via Self-instruction Rui Yang, Lin Song, Yanwei Li, Sijie Zhao, Yixiao Ge, Xiu Li, Ying Shan
- Catfood: Counterfactual Augmented Training For Improving Out-of-domain Performance And Calibration Sachdeva Rachneet, Tutek Martin, Gurevych Iryna
- Uncertainty Guided Global Memory Improves Multi-hop Question Answering Sagirova Alsu, Burtsev Mikhail
- Promptmix: A Class Boundary Augmentation Method For Large Language Model Distillation Sahu Gaurav, Vechtomova Olga, Bahdanau Dzmitry, Laradji Issam H.
- REINFOREST: Reinforcing Semantic Code Similarity For Cross-lingual Code Search Models Saieva Anthony, Chakraborty Saikat, Kaiser Gail
- UT5: Pretraining Non Autoregressive T5 With Unrolled Denoising Salem Mahmoud G., Ye Jiayu, Lin Chu-cheng, Liu Frederick
- Pre-training Multi-modal Dense Retrievers For Outside-knowledge Visual Question Answering Salemi Alireza, Rafiee Mahta, Zamani Hamed
- Zero-shot Translation Of Attention Patterns In VQA Models To Natural Language Salewski Leonard, Koepke A. Sophia, Lensch Hendrik P. A., Akata Zeynep
- Breaking The Silence: The Threats Of Using Llms In Software Engineering Sallou June, Durieux Thomas, Panichella Annibale
- Uni-nlx: Unifying Textual Explanations For Vision And Vision-language Tasks Sammani Fawaz, Deligiannis Nikos
- Large Language Models Are Competitive Near Cold-start Recommenders For Language- And Item-based Preferences Sanner Scott, Balog Krisztian, Radlinski Filip, Wedin Ben, Dixon Lucas
- Efficient RLHF: Reducing The Memory Usage Of PPO Santacroce Michael, Lu Yadong, Yu Han, Li Yuanzhi, Shen Yelong
- Early Weight Averaging Meets High Learning Rates For LLM Pre-training Sanyal Sunny, Neerkaje Atula, Kaddour Jean, Kumar Abhishek, Sanghavi Sujay
- Testing The General Deductive Reasoning Capacity Of Large Language Models Using OOD Examples Saparov Abulhair, Pang Richard Yuanzhe, Padmakumar Vishakh, Joshi Nitish, Kazemi Seyed Mehran, Kim Najoung, He He
- Beyond Chinchilla-optimal: Accounting For Inference In Language Model Scaling Laws Sardana Nikhil, Portes Jacob, Doubov Sasha, Frankle Jonathan
- Testing The Limits Of Unified Sequence To Sequence LLM Pretraining On Diverse Table Data Tasks Sarkar Soumajyoti, Lausen Leonard
- Interprompt: Interpretable Prompting For Interrelated Interpersonal Risk Factors In Reddit Posts Sathvik Msvpj, Sarkar Surjodeep, Saxena Chandni, Sohn Sunghwan, Garg Muskan
- GIFT: Generative Interpretable Fine-tuning Savadikar Chinmay, Song Xi, Wu Tianfu
- Pretraining On The Test Set Is All You Need Schaeffer Rylan
- Large Language Models Can Strategically Deceive Their Users When Put Under Pressure Scheurer Jérémy, Balesni Mikita, Hobbhahn Marius
- Toolformer: Language Models Can Teach Themselves To Use Tools Schick Timo, Dwivedi-yu Jane, Dessì Roberto, Raileanu Roberta, Lomeli Maria, Zettlemoyer Luke, Cancedda Nicola, Scialom Thomas
- Cross-lingual Supervision Improves Large Language Models Pre-training Schioppa Andrea, Garcia Xavier, Firat Orhan
- PULSAR At Mediqa-sum 2023: Large Language Models Augmented By Synthetic Dialogue Convert Patient Dialogues To Medical Records Schlegel Viktor, Li Hao, Wu Yuping, Subramanian Anand, Nguyen Thanh-tung, Kashyap Abhinav Ramesh, Beck Daniel, Zeng Xiaojun, Batista-navarro Riza Theresa, Winkler Stefan, Nenadic Goran
- An Empirical Evaluation Of Using Large Language Models For Automated Unit Test Generation Schäfer Max, Nadi Sarah, Eghbali Aryaz, Tip Frank
- Jais And Jais-chat: Arabic-centric Foundation And Instruction-tuned Open Generative Large Language Models Sengupta Neha, Sahu Sunil Kumar, Jia Bokang, Katipomu Satheesh, Li Haonan, Koto Fajri, Marshall William, Gosal Gurpreet, Liu Cynthia, Chen Zhiming, Afzal Osama Mohammed, Kamboj Samta, Pandit Onkar, Pal Rahul, Pradhan Lalit, Mujahid Zain Muhammad, Baali Massa, Han Xudong, Bsharat Sondos Mahmoud, Aji Alham Fikri, Shen Zhiqiang, Liu Zhengzhong, Vassilieva Natalia, Hestness Joel, Hock Andy, Feldman Andrew, Lee Jonathan, Jackson Andrew, Ren Hector Xuguang, Nakov Preslav, Baldwin Timothy, Xing Eric
- Personality Traits In Large Language Models Serapio-garcía Greg, Safdari Mustafa, Crepy Clément, Sun Luning, Fitz Stephen, Romero Peter, Abdulhai Marwa, Faust Aleksandra, Matarić Maja
- The Cot Collection: Improving Zero-shot And Few-shot Learning Of Language Models Via Chain-of-thought Fine-tuning Seungone Kim, Se June Joo, Doyoung Kim, Joel Jang, Seonghyeon Ye, Jamin Shin, Minjoon Seo
- Grounding Gaps In Language Model Generations Shaikh Omar, Gligorić Kristina, Khetan Ashna, Gerstgrasser Matthias, Yang Diyi, Jurafsky Dan
- PB-LLM: Partially Binarized Large Language Models Shang Yuzhang, Yuan Zhihang, Wu Qiang, Dong Zhen
- Towards Concept-aware Large Language Models Shani Chen, Vreeken Jilles, Shahaf Dafna
- One-shot Sensitivity-aware Mixed Sparsity Pruning For Large Language Models Shao Hang, Liu Bei, Xiao Bo, Zeng Ke, Wan Guanglu, Qian Yanmin
- Omniquant: Omnidirectionally Calibrated Quantization For Large Language Models Shao Wenqi, Chen Mengzhao, Zhang Zhaoyang, Xu Peng, Zhao Lirui, Li Zhiqian, Zhang Kaipeng, Gao Peng, Qiao Yu, Luo Ping
- Character-llm: A Trainable Agent For Role-playing Shao Yunfan, Li Linyang, Dai Junqi, Qiu Xipeng
- Human Choice Prediction In Language-based Persuasion Games: Simulation-based Off-policy Evaluation Shapira Eilam, Apel Reut, Tennenholtz Moshe, Reichart Roi
- Learning Non-linguistic Skills Without Sacrificing Linguistic Proficiency Sharma Mandar, Muralidhar Nikhil, Ramakrishnan Naren
- The Truth Is In There: Improving Reasoning In Language Models With Layer-selective Rank Reduction Sharma Pratyusha, Ash Jordan T., Misra Dipendra
- The Flan Collection: Designing Data And Methods For Effective Instruction Tuning Shayne Longpre, Le Hou, Tu Vu, Albert Webson, Hyung Won Chung, Yi Tay, Denny Zhou, Quoc V. Le, Barret Zoph, Jason Wei, Adam Roberts
- Exploring The Factual Consistency In Dialogue Comprehension Of Large Language Models She Shuaijie, Huang Shujian, Wang Xingyun, Zhou Yanke, Chen Jiajun
- Pangu-coder2: Boosting Large Language Models For Code With Ranking Feedback Shen Bo, Zhang Jiaxin, Chen Taihong, Zan Daoguang, Geng Bing, Fu An, Zeng Muhan, Yu Ailun, Ji Jichuan, Zhao Jingyang, Guo Yuenan, Wang Qianxiang
- Is Translation Helpful? An Empirical Analysis Of Cross-lingual Transfer In Low-resource Dialog Generation Shen Lei, Yu Shuai, Shen Xiaoyu
- The Trickle-down Impact Of Reward (in-)consistency On RLHF Shen Lingfeng, Chen Sihao, Song Linfeng, Jin Lifeng, Peng Baolin, Mi Haitao, Khashabi Daniel, Yu Dong
- Do Pretrained Transformers Learn In-context By Gradient Descent? Shen Lingfeng, Mishra Aayush, Khashabi Daniel
- Improving The Robustness Of Transformer-based Large Language Models With Dynamic Attention Shen Lujia, Pu Yuwen, Ji Shouling, Li Changjiang, Zhang Xuhong, Ge Chunpeng, Wang Ting
- Mixture-of-experts Meets Instruction Tuning:a Winning Combination For Large Language Models Shen Sheng, Hou Le, Zhou Yanqi, Du Nan, Longpre Shayne, Wei Jason, Chung Hyung Won, Zoph Barret, Fedus William, Chen Xinyun, Vu Tu, Wu Yuexin, Chen Wuyang, Webson Albert, Li Yunxuan, Zhao Vincent, Yu Hongkun, Keutzer Kurt, Darrell Trevor, Zhou Denny
- TOAST: Transfer Learning Via Attention Steering Shi Baifeng, Gai Siyu, Darrell Trevor, Wang Xin
- Specialist Or Generalist? Instruction Tuning For Specific NLP Tasks Shi Chufan, Su Yixuan, Yang Cheng, Yang Yujiu, Cai Deng
- Sotana: The Open-source Software Development Assistant Shi Ensheng, Zhang Fengji, Wang Yanlin, Chen Bei, Du Lun, Zhang Hongyu, Han Shi, Zhang Dongmei, Sun Hongbin
- QADYNAMICS: Training Dynamics-driven Synthetic QA Diagnostic For Zero-shot Commonsense Question Answering Shi Haochen, Wang Weiqi, Fang Tianqing, Xu Baixuan, Ding Wenxuan, Liu Xin, Song Yangqiu
- Badgpt: Exploring Security Vulnerabilities Of Chatgpt Via Backdoor Attacks To Instructgpt Shi Jiawen, Liu Yixin, Zhou Pan, Sun Lichao
- Unleashing The Power Of Pre-trained Language Models For Offline Reinforcement Learning Shi Ruizhe, Liu Yuyao, Ze Yanjie, Du Simon S., Xu Huazhe
- Preliminary Study On Incremental Learning For Large Language Model-based Recommender Systems Shi Tianhao, Zhang Yang, Xu Zhijian, Chen Chong, Feng Fuli, He Xiangnan, Tian Qi
- Trusting Your Evidence: Hallucinate Less With Context-aware Decoding Shi Weijia, Han Xiaochuang, Lewis Mike, Tsvetkov Yulia, Zettlemoyer Luke, Yih Scott Wen-tau
- MKRAG: Medical Knowledge Retrieval Augmented Generation For Medical Question Answering Shi Yucheng, Xu Shaochen, Yang Tianze, Liu Zhengliang, Liu Tianming, Li Quanzheng, Li Xiang, Liu Ninghao
- Dept: Decomposed Prompt Tuning For Parameter-efficient Fine-tuning Shi Zhengxiang, Lipani Aldo
- Exploring The Robustness Of Large Language Models For Solving Programming Problems Shirafuji Atsushi, Watanobe Yutaka, Ito Takumi, Morishita Makoto, Nakamura Yuki, Oda Yusuke, Suzuki Jun
- L3 Ensembles: Lifelong Learning Approach For Ensemble Of Foundational Language Models Shiri Aidin, Roy Kaushik, Sheth Amit, Gaur Manas
- Introducing Dictalm -- A Large Generative Language Model For Modern Hebrew Shmidman Shaltiel, Shmidman Avi, Cohen Amir David Nissan, Koppel Moshe
- The ART Of LLM Refinement: Ask, Refine, And Trust Shridhar Kumar, Sinha Koustuv, Cohen Andrew, Wang Tianlu, Yu Ping, Pasunuru Ram, Sachan Mrinmaya, Weston Jason, Celikyilmaz Asli
- Audio-visual LLM For Video Understanding Shu Fangxun, Zhang Lei, Jiang Hao, Xie Cihang
- Rewritelm: An Instruction-tuned Large Language Model For Text Rewriting Shu Lei, Luo Liangchen, Hoskere Jayakumar, Zhu Yun, Liu Yinxiao, Tong Simon, Chen Jindong, Meng Lei
- SC-ML: Self-supervised Counterfactual Metric Learning For Debiased Visual Question Answering Shu Xinyao, Yan Shiyang, Yang Xu, Wu Ziheng, Chen Zhongfeng, Lu Zhenyu
- A Survey On Multimodal Large Language Models Shukang Yin, Chaoyou Fu, Sirui Zhao, Ke Li, Xing Sun, Tong Xu, Enhong Chen
- Woodpecker: Hallucination Correction For Multimodal Large Language Models Shukang Yin, Chaoyou Fu, Sirui Zhao, Tong Xu, Hao Wang, Dianbo Sui, Yunhang Shen, Ke Li, Xing Sun, Enhong Chen
- Ep-alm: Efficient Perceptual Augmentation Of Language Models Shukor Mustafa, Dancette Corentin, Cord Matthieu
- Beyond Task Performance: Evaluating And Reducing The Flaws Of Large Multimodal Models With In-context Learning Shukor Mustafa, Rame Alexandre, Dancette Corentin, Cord Matthieu
- The Curse Of Recursion: Training On Generated Data Makes Models Forget Shumailov Ilia, Shumaylov Zakhar, Zhao Yiren, Gal Yarin, Papernot Nicolas, Anderson Ross
- Knowledge Unlearning For Llms: Tasks, Methods, And Challenges Si Nianwen, Zhang Hao, Chang Heyu, Zhang Wenlin, Qu Dan, Zhang Weiqiang
- Mitigating Object Hallucinations In Large Vision-language Models Through Visual Contrastive Decoding Sicong Leng, Hang Zhang, Guanzheng Chen, Xin Li, Shijian Lu, Chunyan Miao, Lidong Bing
- Generalized Planning In PDDL Domains With Pretrained Large Language Models Silver Tom, Dan Soham, Srinivas Kavitha, Tenenbaum Joshua B., Kaelbling Leslie Pack, Katz Michael
- Large Language Models For Aspect-based Sentiment Analysis Simmering Paul F., Huoviala Paavo
- Unlocking Model Insights: A Dataset For Automated Model Card Generation Singh Shruti, Lodwal Hitesh, Malwat Husain, Thakur Rakesh, Singh Mayank
- Gopro: Generate And Optimize Prompts In CLIP Using Self-supervised Learning Singha Mainak, Jha Ankit, Banerjee Biplab
- A Long Way To Go: Investigating Length Correlations In RLHF Singhal Prasann, Goyal Tanya, Xu Jiacheng, Durrett Greg
- TABLET: Learning From Instructions For Tabular Data Slack Dylan, Singh Sameer
- Dont Add, Dont Miss: Effective Content Preserving Generation From Pre-selected Text Spans Slobodkin Aviv, Caciularu Avi, Hirsch Eran, Dagan Ido
- LLM Guided Inductive Inference For Solving Compositional Problems Sodani Abhigya, Moos Lauren, Mirman Matthew
- SPARSEFIT: Few-shot Prompting With Sparse Fine-tuning For Jointly Generating Predictions And Natural Language Explanations Solano Jesus, Sanni Mardhiyah, Camburu Oana-maria, Minervini Pasquale
- Biomedical Knowledge Graph-optimized Prompt Generation For Large Language Models Soman Karthik, Rose Peter W, Morris John H, Akbas Rabia E, Smith Brett, Peetoom Braian, Villouta-reyes Catalina, Cerono Gabriel, Shi Yongmei, Rizk-jackson Angela, Israni Sharat, Nelson Charlotte A, Huang Sui, Baranzini Sergio E
- Conpet: Continual Parameter-efficient Tuning For Large Language Models Song Chenyang, Han Xu, Zeng Zheni, Li Kuai, Chen Chen, Liu Zhiyuan, Sun Maosong, Yang Tao
- Zebra: Extending Context Window With Layerwise Grouped Local-global Attention Song Kaiqiang, Wang Xiaoyang, Cho Sangwoo, Pan Xiaoman, Yu Dong
- Pneumollm: Harnessing The Power Of Large Language Model For Pneumoconiosis Diagnosis Song Meiyue, Yu Zhihua, Wang Jiaxin, Wang Jiarui, Lu Yuting, Li Baicun, Wang Xiaoxu, Huang Qinghua, Li Zhijun, Kanellakis Nikolaos I., Liu Jiangfeng, Wang Jing, Wang Binglu, Yang Juntao
- Large Language Models Meet Open-world Intent Discovery And Recognition: An Evaluation Of Chatgpt Song Xiaoshuai, He Keqing, Wang Pei, Dong Guanting, Mou Yutao, Wang Jingang, Xian Yunsen, Cai Xunliang, Xu Weiran
- Reward Collapse In Aligning Large Language Models Song Ziang, Cai Tianle, Lee Jason D., Su Weijie J.
- Learning Ui-to-code Reverse Generator Using Visual Critic Without Rendering Soselia Davit, Saifullah Khalid, Zhou Tianyi
- Dynamic Context Pruning For Efficient And Interpretable Autoregressive Transformers Sotiris Anagnostidis, Dario Pavllo, Luca Biggio, Lorenzo Noci, Aurelien Lucchi, Thomas Hofmann
- Curriculum-guided Abstractive Summarization Sotudeh Sajad, Deilamsalehy Hanieh, Dernoncourt Franck, Goharian Nazli
- Training Large Language Models Efficiently With Sparsity And Dataflow Srinivasan Venkat, Gandhi Darshan, Thakker Urmish, Prabhakar Raghu
- Beyond Memorization: Violating Privacy Via Inference With Large Language Models Staab Robin, Vero Mark, Balunović Mislav, Vechev Martin
- Structured Packing In LLM Training Improves Long Context Utilization Staniszewski Konrad, Tworkowski Szymon, Jaszczur Sebastian, Zhao Yu, Michalewski Henryk, Kuciński Łukasz, Miłoś Piotr
- The Languini Kitchen: Enabling Language Modelling Research At Different Scales Of Compute Stanić Aleksandar, Ashley Dylan, Serikov Oleg, Kirsch Louis, Faccio Francesco, Schmidhuber Jürgen, Hofmann Thomas, Schlag Imanol
- Seeing Seeds Beyond Weeds: Green Teaming Generative AI For Beneficial Uses Stapleton Logan, Taylor Jordan, Fox Sarah, Wu Tongshuang, Zhu Haiyi
- Probing Llms For Joint Encoding Of Linguistic Categories Starace Giulio, Papakostas Konstantinos, Choenni Rochelle, Panagiotopoulos Apostolos, Rosati Matteo, Leidinger Alina, Shutova Ekaterina
- Pythia: A Suite For Analyzing Large Language Models Across Training And Scaling Stella Biderman, Hailey Schoelkopf, Quentin Anthony, Herbie Bradley, Kyle O'brien, Eric Hallahan, Mohammad Aflah Khan, Shivanshu Purohit, Usvsn Sai Prashanth, Edward Raff, Aviya Skowron, Lintang Sutawika, Oskar Van Der Wal
- Emergent And Predictable Memorization In Large Language Models Stella Biderman, Usvsn Sai Prashanth, Lintang Sutawika, Hailey Schoelkopf, Quentin Anthony, Shivanshu Purohit, Edward Raff
- Finetuning An LLM On Contextual Knowledge Of Classics For Q&A Strachan Shane Storm
- Learning From Red Teaming: Gender Bias Provocation And Mitigation In Large Language Models Su Hsuan, Cheng Cheng-chu, Farn Hua, Kumar Shachi H, Sahay Saurav, Chen Shang-tse, Lee Hung-yi
- Choice Fusion As Knowledge For Zero-shot Dialogue State Tracking Su Ruolin, Yang Jingfeng, Wu Ting-wei, Juang Biing-hwang
- F3-pruning: A Training-free And Generalized Pruning Strategy Towards Faster And Finer Text-to-video Synthesis Su Sitong, Liu Jianzhi, Gao Lianli, Song Jingkuan
- Dial-mae: Contextual Masked Auto-encoder For Retrieval-based Dialogue Systems Su Zhenpeng, Wu Xing, Zhou Wei, Ma Guangyuan, Hu Songlin
- Language Model-in-the-loop: Data Optimal Approach To Learn-to-recommend Actions In Text Games Sudhakar Arjun Vaithilingam, Parthasarathi Prasanna, Rajendran Janarthanan, Chandar Sarath
- CONSCENDI: A Contrastive And Scenario-guided Distillation Approach To Guardrail Models For Virtual Assistants Sun Albert Yu, Nair Varun, Schumacher Elliot, Kannan Anitha
- Does Fine-tuning GPT-3 With The Openai API Leak Personally-identifiable Information? Sun Albert Yu, Zemour Eliott, Saxena Arushi, Vaidyanathan Udith, Lin Eric, Lau Christian, Mugunthan Vaikkunth
- Building High-accuracy Multilingual ASR With Gated Language Experts And Curriculum Training Sun Eric, Li Jinyu, Hu Yuxuan, Zhu Yimeng, Zhou Long, Xue Jian, Wang Peidong, Liu Linquan, Liu Shujie, Lin Edward, Gong Yifan
- Speech-based Slot Filling Using Large Language Models Sun Guangzhi, Feng Shutong, Jiang Dongcheng, Zhang Chao, Gašić Milica, Woodland Philip C.
- Fine-grained Audio-visual Joint Representations For Multimodal Large Language Models Sun Guangzhi, Yu Wenyi, Tang Changli, Chen Xianzhao, Tan Tian, Li Wei, Lu Lu, Ma Zejun, Zhang Chao
- Think-on-graph: Deep And Responsible Reasoning Of Large Language Model On Knowledge Graph Sun Jiashuo, Xu Chengjin, Tang Lumingyuan, Wang Saizhuo, Lin Chen, Gong Yeyun, Ni Lionel M., Shum Heung-yeung, Guo Jian
- Evaluating The Zero-shot Robustness Of Instruction-tuned Language Models Sun Jiuding, Shaib Chantal, Wallace Byron C.
- A Simple And Effective Pruning Approach For Large Language Models Sun Mingjie, Liu Zhuang, Bair Anna, Kolter J. Zico
- Corex: Pushing The Boundaries Of Complex Reasoning Through Multi-model Collaboration Sun Qiushi, Yin Zhangyue, Li Xiang, Wu Zhiyong, Qiu Xipeng, Kong Lingpeng
- EVA-CLIP: Improved Training Techniques For CLIP At Scale Sun Quan, Fang Yuxin, Wu Ledell, Wang Xinlong, Cao Yue
- Teaching The Pre-trained Model To Generate Simple Texts For Text Simplification Sun Renliang, Xu Wei, Wan Xiaojun
- Exploring The Impact Of Low-rank Adaptation On The Performance, Efficiency, And Regularization Of RLHF Sun Simeng, Gupta Dhawal, Iyyer Mohit
- Expnote: Black-box Large Language Models Are Better Task Solvers With Experience Notebook Sun Wangtao, Yu Xuanqing, He Shizhu, Zhao Jun, Liu Kang
- A Prompt Learning Framework For Source Code Summarization Sun Weisong, Fang Chunrong, You Yudu, Chen Yuchen, Liu Yi, Wang Chong, Zhang Jian, Zhang Quanjun, Qian Hanwei, Zhao Wei, Liu Yang, Chen Zhenyu
- A Comparative Study Between Full-parameter And Lora-based Fine-tuning On Chinese Instruction Data For Instruction Following Large Language Model Sun Xianghui, Ji Yunjie, Ma Baochang, Li Xiangang
- Parrot: Enhancing Multi-turn Instruction Following For Large Language Models Sun Yuchong, Liu Che, Zhou Kun, Huang Jinwen, Song Ruihua, Zhao Wayne Xin, Zhang Fuzheng, Zhang Di, Gai Kun
- Retentive Network: A Successor To Transformer For Large Language Models Sun Yutao, Dong Li, Huang Shaohan, Ma Shuming, Xia Yuqing, Xue Jilong, Wang Jianyong, Wei Furu
- Pre-training Intent-aware Encoders For Zero- And Few-shot Intent Classification Sung Mujeen, Gung James, Mansimov Elman, Pappas Nikolaos, Shu Raphael, Romeo Salvatore, Zhang Yi, Castelli Vittorio
- Improving Non-autoregressive Translation Quality With Pretrained Language Model, Embedding Distillation And Upsampling Strategy For CTC Syu Shen-sian, Xie Juncheng, Lee Hung-yi
- Large Language Models As Generalizable Policies For Embodied Tasks Szot Andrew, Schwarzer Max, Agrawal Harsh, Mazoure Bogdan, Talbott Walter, Metcalf Katherine, Mackraz Natalie, Hjelm Devon, Toshev Alexander
- Imaginations Of WALL-E : Reconstructing Experiences With An Imagination-inspired Module For Advanced AI Systems Taghavi Zeinab Sadat, Gooran Soroush, Dalili Seyed Arshan, Amirzadeh Hamidreza, Nematbakhsh Mohammad Jalal, Sameti Hossein
- Link-context Learning For Multimodal Llms Tai Yan, Fan Weichen, Zhang Zhao, Zhu Feng, Zhao Rui, Liu Ziwei
- Training Generative Question-answering On Synthetic Data Obtained From An Instruct-tuned Model Takahashi Kosuke, Omi Takahiro, Arima Kosuke, Ishigaki Tatsuya
- Tackling VQA With Pretrained Foundation Models Without Further Training Tan Alvin De Jun, Shen Bingquan
- Redcoast: A Lightweight Tool To Automate Distributed Training Of Llms On Any Gpu/tpus Tan Bowen, Zhu Yun, Liu Lijuan, Wang Hongyi, Zhuang Yonghao, Chen Jindong, Xing Eric, Hu Zhiting
- Cappy: Outperforming And Boosting Large Multi-task Lms With A Small Scorer Tan Bowen, Zhu Yun, Liu Lijuan, Xing Eric, Hu Zhiting, Chen Jindong
- Large Language Model-driven Classroom Flipping: Empowering Student-centric Peer Questioning With Flipped Interaction Tan Chee Wei
- Massive Editing For Large Language Models Via Meta Learning Tan Chenmien, Zhang Ge, Fu Jie
- Compound Text-guided Prompt Tuning Via Image-adaptive Cues Tan Hao, Li Jun, Zhou Yizhuang, Wan Jun, Lei Zhen, Zhang Xiangyu
- Medchatzh: A Better Medical Adviser Learns From Better Instructions Tan Yang, Li Mingchen, Huang Zijie, Yu Huiqun, Fan Guisheng
- Multiple-question Multiple-answer Text-vqa Tang Peng, Appalaraju Srikar, Manmatha R., Xie Yusheng, Mahadevan Vijay
- Toolalpaca: Generalized Tool Learning For Language Models With 3000 Simulated Cases Tang Qiaoyu, Deng Ziliang, Lin Hongyu, Han Xianpei, Liang Qiao, Cao Boxi, Sun Le
- Struc-bench: Are Large Language Models Really Good At Generating Complex Structured Data? Tang Xiangru, Zong Yiming, Phang Jason, Zhao Yilun, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- Medagents: Large Language Models As Collaborators For Zero-shot Medical Reasoning Tang Xiangru, Zou Anni, Zhang Zhuosheng, Li Ziming, Zhao Yilun, Zhang Xingyao, Cohan Arman, Gerstein Mark
- RSVP: Customer Intent Detection Via Agent Response Contrastive And Generative Pre-training Tang Yu-chien, Wang Wei-yao, Yen An-zi, Peng Wen-chih
- POUF: Prompt-oriented Unsupervised Fine-tuning For Large Pre-trained Models Tanwisuth Korawat, Zhang Shujian, Zheng Huangjie, He Pengcheng, Zhou Mingyuan
- A Benchmark For Learning To Translate A New Language From One Grammar Book Tanzer Garrett, Suzgun Mirac, Visser Eline, Jurafsky Dan, Melas-kyriazi Luke
- Adversarial Conversational Shaping For Intelligent Agents Tarasiewicz Piotr, Kenjeyev Sultan, Sebag Ilana, Alshehabi Shehab
- Expanding Frozen Vision-language Models Without Retraining: Towards Improved Robot Perception Tavassoli Riley, Amani Mani, Akhavian Reza
- CLAMP: Contrastive Language Model Prompt-tuning Teterwak Piotr, Sun Ximeng, Plummer Bryan A., Saenko Kate, Lim Ser-nam
- Leveraging Llms For Synthesizing Training Data Across Many Languages In Multilingual Dense Retrieval Thakur Nandan, Ni Jianmo, Ábrego Gustavo Hernández, Wieting John, Lin Jimmy, Cer Daniel
- SPDF: Sparse Pre-training And Dense Fine-tuning For Large Language Models Thangarasa Vithursan, Gupta Abhay, Marshall William, Li Tianda, Leong Kevin, Decoste Dennis, Lie Sean, Saxena Shreyas
- Is Chatgpt The Ultimate Programming Assistant -- How Far Is It? Tian Haoye, Lu Weiqi, Li Tsz On, Tang Xunzhu, Cheung Shing-chi, Klein Jacques, Bissyandé Tegawendé F.
- Just Ask For Calibration: Strategies For Eliciting Calibrated Confidence Scores From Language Models Fine-tuned With Human Feedback Tian Katherine, Mitchell Eric, Zhou Allan, Sharma Archit, Rafailov Rafael, Yao Huaxiu, Finn Chelsea, Manning Christopher D.
- Graph Neural Prompting With Large Language Models Tian Yijun, Song Huan, Wang Zichen, Wang Haozhu, Hu Ziqing, Wang Fang, Chawla Nitesh V., Xu Panpan
- Chimed-gpt: A Chinese Medical Large Language Model With Full Training Regime And Better Alignment To Human Preferences Tian Yuanhe, Gan Ruyi, Song Yan, Zhang Jiaxing, Zhang Yongdong
- BOOST: Harnessing Black-box Control To Boost Commonsense In Lms' Generation Tian Yufei, Zhang Felix, Peng Nanyun
- Impact Of Tokenization On Llama Russian Adaptation Tikhomirov Mikhail, Chernyshev Daniil
- Qlora: Efficient Finetuning Of Quantized Llms Tim Dettmers, Artidoro Pagnoni, Ari Holtzman, Luke Zettlemoyer
- Baby Llama: Knowledge Distillation From An Ensemble Of Teachers Trained On A Small Dataset With No Performance Penalty Timiryasov Inar, Tastet Jean-loup
- Fine-tune Language Models To Approximate Unbiased In-context Learning Timothy Chu, Zhao Song, Chiwun Yang
- D4: Improving LLM Pretraining Via Document De-duplication And Diversification Tirumala Kushal, Simig Daniel, Aghajanyan Armen, Morcos Ari S.
- Better Language Models Of Code Through Self-improvement To Hung Quoc, Bui Nghi D. Q., Guo Jin, Nguyen Tien N.
- Clinical Camel: An Open Expert-level Medical Language Model With Dialogue-based Knowledge Encoding Toma Augustin, Lawler Patrick R., Ba Jimmy, Krishnan Rahul G., Rubin Barry B., Wang Bo
- Parameter Efficient Tuning Allows Scalable Personalization Of Llms For Text Entry: A Case Study On Abbreviation Expansion Tomanek Katrin, Cai Shanqing, Venugopalan Subhashini
- Bioinstruct: Instruction Tuning Of Large Language Models For Biomedical Natural Language Processing Tran Hieu, Yang Zhichao, Yao Zonghai, Yu Hong
- Large Language Model Prompt Chaining For Long Legal Document Classification Trautmann Dietrich
- Automl-gpt: Large Language Model For Automl Tsai Yun-da, Tsai Yu-che, Huang Bo-wei, Yang Chun-pai, Lin Shou-de
- How Many Unicorns Are In This Image? A Safety Evaluation Benchmark For Vision Llms Tu Haoqin, Cui Chenhang, Wang Zijun, Zhou Yiyang, Zhao Bingchen, Han Junlin, Zhou Wangchunshu, Yao Huaxiu, Xie Cihang
- Sight Beyond Text: Multi-modal Training Enhances Llms In Truthfulness And Ethics Tu Haoqin, Zhao Bingchen, Wei Chen, Xie Cihang
- Efficiently Aligned Cross-lingual Transfer Learning For Conversational Tasks Using Prompt-tuning Tu Lifu, Qu Jin, Yavuz Semih, Joty Shafiq, Liu Wenhao, Xiong Caiming, Zhou Yingbo
- Automating Code-related Tasks Through Transformers: The Impact Of Pre-training Tufano Rosalia, Pascarella Luca, Bavota Gabriele
- Focused Transformer: Contrastive Training For Context Scaling Tworkowski Szymon, Staniszewski Konrad, Pacek Mikołaj, Wu Yuhuai, Michalewski Henryk, Miłoś Piotr
- Zeroshotdataaug: Generating And Augmenting Training Data With Chatgpt Ubani Solomon, Polat Suleyman Olcay, Nielsen Rodney
- Taco: Enhancing Cross-lingual Transfer For Low-resource Languages In Llms Through Translation-assisted Chain-of-thought Processes Upadhayay Bibek, Behzadan Vahid
- Mlongt5: A Multilingual And Efficient Text-to-text Transformer For Longer Sequences Uthus David, Ontañón Santiago, Ainslie Joshua, Guo Mandy
- In-context Ability Transfer For Question Decomposition In Complex QA V Venktesh, Bhattacharya Sourangshu, Anand Avishek
- Chatclimate: Grounding Conversational AI In Climate Science Vaghefi Saeid Ashraf, Wang Qian, Muccione Veruska, Ni Jingwei, Kraus Mathias, Bingler Julia, Schimanski Tobias, Colesanti-senni Chiara, Webersinke Nicolas, Huggel Christrian, Leippold Markus
- Don't Believe Everything You Read: Enhancing Summarization Interpretability Through Automatic Identification Of Hallucinations In Large Language Models Vakharia Priyesh, Joshi Devavrat, Chavan Meenal, Sonawane Dhananjay, Garg Bhrigu, Mazaheri Parsa
- On The Automatic Generation And Simplification Of Children's Stories Valentini Maria, Weber Jennifer, Salcido Jesus, Wright Téa, Colunga Eliana, Kann Katharina
- Mini-gpts: Efficient Large Language Models Through Contextual Pruning Valicenti Tim, Vidal Justice, Patnaik Ritik
- The LLM Surgeon Van Der Ouderaa Tycho F. A., Nagel Markus, Van Baalen Mart, Asano Yuki M., Blankevoort Tijmen
- Large Language Models: The Need For Nuance In Current Debates And A Pragmatic Perspective On Understanding Van Dijk Bram M. A., Kouwenhoven Tom, Spruit Marco R., Van Duijn Max J.
- Can NLP Models Correctly Reason Over Contexts That Break The Common Assumptions? Varshney Neeraj, Parmar Mihir, Patel Nisarg, Handa Divij, Sarkar Sayantan, Luo Man, Baral Chitta
- Small Language Models Improve Giants By Rewriting Their Outputs Vernikos Giorgos, Bražinskas Arthur, Adamek Jakub, Mallinson Jonathan, Severyn Aliaksei, Malmi Eric
- Mtcue: Learning Zero-shot Control Of Extra-textual Attributes By Leveraging Unstructured Context In Neural Machine Translation Vincent Sebastian, Flynn Robert, Scarton Carolina
- Prompt2model: Generating Deployable Models From Natural Language Instructions Viswanathan Vijay, Zhao Chenyang, Bertsch Amanda, Wu Tongshuang, Neubig Graham
- Empirical Evaluation Of Uncertainty Quantification In Retrieval-augmented Language Models For Science Wagle Sridevi, Munikoti Sai, Acharya Anurag, Smith Sara, Horawalavithana Sameera
- PIP: Parse-instructed Prefix For Syntactically Controlled Paraphrase Generation Wan Yixin, Huang Kuan-hao, Chang Kai-wei
- Reformulating Domain Adaptation Of Large Language Models As Adapt-retrieve-revise: A Case Study On Chinese Legal Domain Wan Zhen, Zhang Yating, Wang Yexiang, Cheng Fei, Kurohashi Sadao
- Shall We Pretrain Autoregressive Language Models With Retrieval? A Comprehensive Study Wang Boxin, Ping Wei, Xu Peng, Mcafee Lawrence, Liu Zihan, Shoeybi Mohammad, Dong Yi, Kuchaiev Oleksii, Li Bo, Xiao Chaowei, Anandkumar Anima, Catanzaro Bryan
- BLSP: Bootstrapping Language-speech Pre-training Via Behavior Alignment Of Continuation Writing Wang Chen, Liao Minpeng, Huang Zhongqiang, Lu Jinliang, Wu Junhong, Liu Yuchen, Zong Chengqing, Zhang Jiajun
- BERT4CTR: An Efficient Framework To Combine Pre-trained Language Model With Non-textual Features For CTR Prediction Wang Dong, Salamatian Kavé, Xia Yunqing, Deng Weiwei, Zhiang Qi
- Docllm: A Layout-aware Generative Language Model For Multimodal Document Understanding Wang Dongsheng, Raman Natraj, Sibue Mathieu, Ma Zhiqiang, Babkin Petr, Kaur Simerjot, Pei Yulong, Nourbakhsh Armineh, Liu Xiaomo
- Openchat: Advancing Open-source Language Models With Mixed-quality Data Wang Guan, Cheng Sijie, Zhan Xianyuan, Li Xiangang, Song Sen, Liu Yang
- What Makes For Good Visual Tokenizers For Large Language Models? Wang Guangzhi, Ge Yixiao, Ding Xiaohan, Kankanhalli Mohan, Shan Ying
- Dialog Action-aware Transformer For Dialog Policy Learning Wang Huimin, Kwan Wai-chung, Wong Kam-fai
- Cross-lingual Knowledge Editing In Large Language Models Wang Jiaan, Liang Yunlong, Sun Zengkui, Cao Yuxuan, Xu Jiarong, Meng Fandong
- Knowledgeable In-context Tuning: Exploring And Exploiting Factual Knowledge For In-context Learning Wang Jianing, Wang Chengyu, Tan Chuanqi, Huang Jun, Gao Ming
- Missrec: Pre-training And Transferring Multi-modal Interest-aware Sequence Representation For Recommendation Wang Jinpeng, Zeng Ziyun, Wang Yunxiao, Wang Yuting, Lu Xingyu, Li Tianxiang, Yuan Jun, Zhang Rui, Zheng Hai-tao, Xia Shu-tao
- Improved Visual Fine-tuning With Natural Language Supervision Wang Junyang, Xu Yuanhong, Hu Juhua, Yan Ming, Sang Jitao, Qian Qi
- Mathcoder: Seamless Code Integration In Llms For Enhanced Mathematical Reasoning Wang Ke, Ren Houxing, Zhou Aojun, Lu Zimu, Luo Sichun, Shi Weikang, Zhang Renrui, Song Linqi, Zhan Mingjie, Li Hongsheng
- Adapting LLM Agents With Universal Feedback In Communication Wang Kuan, Lu Yadong, Santacroce Michael, Gong Yeyun, Zhang Chao, Shen Yelong
- Improving Text Embeddings With Large Language Models Wang Liang, Yang Nan, Huang Xiaolong, Yang Linjun, Majumder Rangan, Wei Furu
- Query2doc: Query Expansion With Large Language Models Wang Liang, Yang Nan, Wei Furu
- Document-level Machine Translation With Large Language Models Wang Longyue, Lyu Chenyang, Ji Tianbo, Zhang Zhirui, Yu Dian, Shi Shuming, Tu Zhaopeng
- Making Large Language Models Better Reasoners With Alignment Wang Peiyi, Li Lei, Chen Liang, Song Feifan, Lin Binghuai, Cao Yunbo, Liu Tianyu, Sui Zhifang
- Demystifying Instruction Mixing For Fine-tuning Large Language Models Wang Renxi, Li Haonan, Wu Minghao, Wang Yuxia, Han Xudong, Zhang Chiyu, Baldwin Timothy
- Aurora:activating Chinese Chat Capability For Mixtral-8x7b Sparse Mixture-of-experts Through Instruction-tuning Wang Rongsheng, Chen Haoming, Zhou Ruizhe, Duan Yaofei, Cai Kunyan, Ma Han, Cui Jiaxi, Li Jian, Pang Patrick Cheong-iao, Wang Yapeng, Tan Tao
- Ivygpt: Interactive Chinese Pathway Language Model In Medical Domain Wang Rongsheng, Duan Yaofei, Lam Chantong, Chen Jiexi, Xu Jiangsheng, Chen Haoming, Liu Xiaohong, Pang Patrick Cheong-iao, Tan Tao
- Compositional Text-to-image Synthesis With Attention Map Control Of Diffusion Models Wang Ruichen, Chen Zekang, Chen Chen, Ma Jian, Lu Haonan, Lin Xiaodong
- READ: Recurrent Adaptation Of Large Transformers Wang Sid, Nguyen John, Li Ke, Wu Carole-jean
- CAR: Conceptualization-augmented Reasoner For Zero-shot Commonsense Question Answering Wang Weiqi, Fang Tianqing, Ding Wenxuan, Xu Baixuan, Liu Xin, Song Yangqiu, Bosselut Antoine
- All Languages Matter: On The Multilingual Safety Of Large Language Models Wang Wenxuan, Tu Zhaopeng, Chen Chang, Yuan Youliang, Huang Jen-tse, Jiao Wenxiang, Lyu Michael R.
- Lora Ensembles For Large Language Model Fine-tuning Wang Xi, Aitchison Laurence, Rudolph Maja
- TRACE: A Comprehensive Benchmark For Continual Learning In Large Language Models Wang Xiao, Zhang Yuansen, Chen Tianze, Gao Songyang, Jin Senjie, Yang Xianjun, Xi Zhiheng, Zheng Rui, Zou Yicheng, Gui Tao, Zhang Qi, Huang Xuanjing
- Guiding Language Model Reasoning With Planning Tokens Wang Xinyi, Caccia Lucas, Ostapenko Oleksiy, Yuan Xingdi, Wang William Yang, Sordoni Alessandro
- FIAT: Fusing Learning Paradigms With Instruction-accelerated Tuning Wang Xinyi, Wieting John, Clark Jonathan H.
- Emotional Intelligence Of Large Language Models Wang Xuena, Li Xueting, Yin Zi, Wu Yue, Jia Liu
- Non-intrusive Adaptation: Input-centric Parameter-efficient Fine-tuning For Versatile Multimodal Modeling Wang Yaqing, Wu Jialin, Dabral Tanmaya, Zhang Jiageng, Brown Geoff, Lu Chun-ta, Liu Frederick, Liang Yi, Pang Bo, Bendersky Michael, Soricut Radu
- Rescue: Ranking LLM Responses With Partial Ordering To Improve Response Generation Wang Yikun, Zheng Rui, Li Haoming, Zhang Qi, Gui Tao, Liu Fei
- Multilora: Democratizing Lora For Better Multi-task Learning Wang Yiming, Lin Yu, Zeng Xiaodong, Zhang Guannan
- Primacy Effect Of Chatgpt Wang Yiwei, Cai Yujun, Chen Muhao, Liang Yuxuan, Hooi Bryan
- Fake Alignment: Are Llms Really Aligned Well? Wang Yixu, Teng Yan, Huang Kexin, Lyu Chengqi, Zhang Songyang, Zhang Wenwei, Ma Xingjun, Jiang Yu-gang, Qiao Yu, Wang Yingchun
- Towards Improving Document Understanding: An Exploration On Text-grounding Via Mllms Wang Yonghui, Zhou Wengang, Feng Hao, Zhou Keyi, Li Houqiang
- Gpt4video: A Unified Multimodal Large Language Model For Lnstruction-followed Understanding And Safety-aware Generation Wang Zhanyu, Wang Longyue, Zhao Zhen, Wu Minghao, Lyu Chenyang, Li Huayang, Cai Deng, Zhou Luping, Shi Shuming, Tu Zhaopeng
- Abspyramid: Benchmarking The Abstraction Ability Of Language Models With A Unified Entailment Graph Wang Zhaowei, Shi Haochen, Wang Weiqi, Fang Tianqing, Zhang Hongming, Choi Sehyun, Liu Xin, Song Yangqiu
- An In-depth Investigation Of User Response Simulation For Conversational Search Wang Zhenduo, Xu Zhichao, Ai Qingyao, Srikumar Vivek
- Mediagpt : A Large Language Model For Chinese Media Wang Zhonghao, Lu Zijia, Jin Bo, Deng Haiying
- Enabling Language Models To Implicitly Learn Self-improvement Wang Ziqi, Hou Le, Lu Tianjian, Wu Yuexin, Li Yunxuan, Yu Hongkun, Ji Heng
- Do Llms Understand User Preferences? Evaluating Llms On User Rating Prediction Wang-cheng Kang, Jianmo Ni, Nikhil Mehta, Maheswaran Sathiamoorthy, Lichan Hong, Ed Chi, Derek Zhiyuan Cheng
- Call For Papers -- The Babylm Challenge: Sample-efficient Pretraining On A Developmentally Plausible Corpus Warstadt Alex, Choshen Leshem, Mueller Aaron, Williams Adina, Wilcox Ethan, Zhuang Chengxu
- Machine Translation For Ge'ez Language Wassie Aman Kassahun
- A Survey Of Large Language Models Wayne Xin Zhao, Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-yun Nie, Ji-rong Wen
- Jailbroken: How Does LLM Safety Training Fail? Wei Alexander, Haghtalab Nika, Steinhardt Jacob
- Larger Language Models Do In-context Learning Differently Wei Jerry, Wei Jason, Tay Yi, Tran Dustin, Webson Albert, Lu Yifeng, Chen Xinyun, Liu Hanxiao, Huang Da, Zhou Denny, Ma Tengyu
- Skywork: A More Open Bilingual Foundation Model Wei Tianwen, Zhao Liang, Zhang Lichang, Zhu Bo, Wang Lijie, Yang Haihua, Li Biye, Cheng Cheng, Lü Weiwei, Hu Rui, Li Chenxia, Yang Liu, Luo Xilin, Wu Xuejie, Liu Lunan, Cheng Wenjun, Cheng Peng, Zhang Jianhao, Zhang Xiaoyu, Lin Lei, Wang Xiaokun, Ma Yutuan, Dong Chuanhai, Sun Yanqi, Chen Yifu, Peng Yongyi, Liang Xiaojuan, Yan Shuicheng, Fang Han, Zhou Yahui
- Polylm: An Open Source Polyglot Large Language Model Wei Xiangpeng, Wei Haoran, Lin Huan, Li Tianhao, Zhang Pei, Ren Xingzhang, Li Mei, Wan Yu, Cao Zhiwei, Xie Binbin, Hu Tianxiang, Li Shangjie, Hui Binyuan, Yu Bowen, Liu Dayiheng, Yang Baosong, Huang Fei, Xie Jun
- In-context Pretraining: Language Modeling Beyond Document Boundaries Weijia Shi, Sewon Min, Maria Lomeli, Chunting Zhou, Margaret Li, Gergely Szilvasy, Rich James, Xi Victoria Lin, Noah A. Smith, Luke Zettlemoyer, Scott Yih, Mike Lewis
- Is Chatgpt Good At Search? Investigating Large Language Models As Re-ranking Agents Weiwei Sun, Lingyong Yan, Xinyu Ma, Shuaiqiang Wang, Pengjie Ren, Zhumin Chen, Dawei Yin, Zhaochun Ren
- LLMSTEP: LLM Proofstep Suggestions In Lean Welleck Sean, Saha Rahul
- "according To ...": Prompting Language Models Improves Quoting From Pre-training Data Weller Orion, Marone Marc, Weir Nathaniel, Lawrie Dawn, Khashabi Daniel, Van Durme Benjamin
- Droidbot-gpt: Gpt-powered UI Automation For Android Wen Hao, Wang Hongming, Liu Jiaxuan, Li Yuanchun
- Unveiling The Implicit Toxicity In Large Language Models Wen Jiaxin, Ke Pei, Sun Hao, Zhang Zhexin, Li Chengfei, Bai Jinfeng, Huang Minlie
- Hyperpolyglot Llms: Cross-lingual Interpretability In Token Embeddings Wen-yi Andrea W, Mimno David
- Lmtuner: An User-friendly And Highly-integrable Training Framework For Fine-tuning Large Language Models Weng Yixuan, Wang Zhiqi, Liao Huanxuan, He Shizhu, Liu Shengping, Liu Kang, Zhao Jun
- Instructblip: Towards General-purpose Vision-language Models With Instruction Tuning Wenliang Dai, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Junqi Zhao, Weisheng Wang, Boyang Li, Pascale Fung, Steven Hoi
- Exploring Parameter-efficient Fine-tuning Techniques For Code Generation With Large Language Models Weyssow Martin, Zhou Xin, Kim Kisub, Lo David, Sahraoui Houari
- Llm-powered Data Augmentation For Enhanced Cross-lingual Performance Whitehouse Chenxi, Choudhury Monojit, Aji Alham Fikri
- Think Twice: Perspective-taking Improves Large Language Models' Theory-of-mind Capabilities Wilf Alex, Lee Sihyun Shawn, Liang Paul Pu, Morency Louis-philippe
- Resolving References In Visually-grounded Dialogue Via Text Generation Willemsen Bram, Qian Livia, Skantze Gabriel
- Frustratingly Simple Memory Efficiency For Pre-trained Language Models Via Dynamic Embedding Pruning Williams Miles, Aletras Nikolaos
- From Big To Small Without Losing It All: Text Augmentation With Chatgpt For Efficient Sentiment Analysis Woźniak Stanisław, Kocoń Jan
- Pmc-llama: Towards Building Open-source Language Models For Medicine Wu Chaoyi, Lin Weixiong, Zhang Xiaoman, Zhang Ya, Wang Yanfeng, Xie Weidi
- Rethinking Model Selection And Decoding For Keyphrase Generation With Pre-trained Sequence-to-sequence Models Wu Di, Ahmad Wasi Uddin, Chang Kai-wei
- P-laplacian Adaptation For Generative Pre-trained Vision-language Models Wu Haoyuan, Zhang Xinyun, Xu Peng, Liao Peiyu, Yao Xufeng, Yu Bei
- Empower Nested Boolean Logic Via Self-supervised Curriculum Learning Wu Hongqiu, Liu Linfeng, Zhao Hai, Zhang Min
- Toward Adversarial Training On Contextualized Language Representation Wu Hongqiu, Liu Yongxiang, Shi Hanwen, Zhao Hai, Zhang Min
- Large Language Models Leverage External Knowledge To Extend Clinical Insight Beyond Language Boundaries Wu Jiageng, Wu Xian, Qiu Zhaopeng, Li Minghui, Zhang Yingying, Zheng Yefeng, Yuan Changzheng, Yang Jie
- Lamini-lm: A Diverse Herd Of Distilled Models From Large-scale Instructions Wu Minghao, Waheed Abdul, Zhang Chiyu, Abdul-mageed Muhammad, Aji Alham Fikri
- Approximated Prompt Tuning For Vision-language Pre-trained Models Wu Qiong, Huang Shubin, Zhou Yiyi, Dai Pingyang, Shu Annan, Jiang Guannan, Ji Rongrong
- A Comparative Study Of Open-source Large Language Models, GPT-4 And Claude 2: Multiple-choice Test Taking In Nephrology Wu Sean, Koo Michael, Blum Lesley, Black Andy, Kao Liyo, Scalzo Fabien, Kurtz Ira
- YUAN 2.0: A Large Language Model With Localized Filtering-based Attention Wu Shaohua, Zhao Xudong, Wang Shenling, Luo Jiangang, Li Lingjun, Chen Xi, Zhao Bing, Wang Wei, Yu Tong, Zhang Rongguo, Zhang Jiahua, Wang Chao
- Next-gpt: Any-to-any Multimodal LLM Wu Shengqiong, Fei Hao, Qu Leigang, Ji Wei, Chua Tat-seng
- Towards Robust Text Retrieval With Progressive Learning Wu Tong, Qin Yulei, Zhang Enwei, Xu Zihan, Gao Yuting, Li Ke, Sun Xing
- Self-correcting Llm-controlled Diffusion Models Wu Tsung-han, Lian Long, Gonzalez Joseph E., Li Boyi, Darrell Trevor
- Paragraph-to-image Generation With Information-enriched Diffusion Model Wu Weijia, Li Zhuang, He Yefei, Shou Mike Zheng, Shen Chunhua, Cheng Lele, Li Yan, Gao Tingting, Zhang Di, Wang Zhongyuan
- TLM: Token-level Masking For Transformers Wu Yangjun, Fang Kebin, Zhang Dongxiang, Wang Han, Zhang Hao, Chen Gang
- Fine-grained Human Feedback Gives Better Rewards For Language Model Training Wu Zeqiu, Hu Yushi, Shi Weijia, Dziri Nouha, Suhr Alane, Ammanabrolu Prithviraj, Smith Noah A., Ostendorf Mari, Hajishirzi Hannaneh
- Openicl: An Open-source Framework For In-context Learning Wu Zhenyu, Wang Yaoxiang, Ye Jiacheng, Feng Jiangtao, Xu Jingjing, Qiao Yu, Wu Zhiyong
- A Simple Baseline For Knowledge-based Visual Question Answering Xenos Alexandros, Stafylakis Themos, Patras Ioannis, Tzimiropoulos Georgios
- Towards Open-world Recommendation With Knowledge Augmentation From Large Language Models Xi Yunjia, Liu Weiwen, Lin Jianghao, Cai Xiaoling, Zhu Hong, Zhu Jieming, Chen Bo, Tang Ruiming, Zhang Weinan, Zhang Rui, Yu Yong
- LLMGA: Multimodal Large Language Model Based Generation Assistant Xia Bin, Wang Shiyin, Tao Yingfan, Wang Yitong, Jia Jiaya
- Sheared Llama: Accelerating Language Model Pre-training Via Structured Pruning Xia Mengzhou, Gao Tianyu, Zeng Zhiyuan, Chen Danqi
- Language Models Meet World Models: Embodied Experiences Enhance Language Models Xiang Jiannan, Tao Tianhua, Gu Yi, Shu Tianmin, Wang Zirui, Yang Zichao, Hu Zhiting
- Efficient Streaming Language Models With Attention Sinks Xiao Guangxuan, Tian Yuandong, Chen Beidi, Han Song, Lewis Mike
- Can I Trust Your Answer? Visually Grounded Video Question Answering Xiao Junbin, Yao Angela, Li Yicong, Chua Tat Seng
- Large Language Models Can Be Good Privacy Protection Learners Xiao Yijia, Jin Yiqiao, Bai Yushi, Wu Yue, Yang Xianjun, Luo Xiao, Yu Wenchao, Zhao Xujiang, Liu Yanchi, Chen Haifeng, Wang Wei, Cheng Wei
- O3D: Offline Data-driven Discovery And Distillation For Sequential Decision-making With Large Language Models Xiao Yuchen, Sun Yanchao, Xu Mengda, Madhushani Udari, Vann Jared, Garg Deepeka, Ganesh Sumitra
- Instruction-vit: Multi-modal Prompts For Instruction Learning In Vit Xiao Zhenxiang, Chen Yuzhong, Zhang Lu, Yao Junjie, Wu Zihao, Yu Xiaowei, Pan Yi, Zhao Lin, Ma Chong, Liu Xinyu, Liu Wei, Li Xiang, Yuan Yixuan, Shen Dinggang, Zhu Dajiang, Liu Tianming, Jiang Xi
- Supporting Qualitative Analysis With Large Language Models: Combining Codebook With GPT-3 For Deductive Coding Xiao Ziang, Yuan Xingdi, Liao Q. Vera, Abdelghani Rania, Oudeyer Pierre-yves
- Instructed Language Models With Retrievers Are Powerful Entity Linkers Xiao Zilin, Gong Ming, Wu Jie, Zhang Xingyao, Shou Linjun, Pei Jian, Jiang Daxin
- Pangu-σ: Towards Trillion Parameter Language Model With Sparse Heterogeneous Computing Xiaozhe Ren, Pingyi Zhou, Xinfan Meng, Xinjing Huang, Yadao Wang, Weichao Wang, Pengfei Li, Xiaoda Zhang, Alexander Podolskiy, Grigory Arshinov, Andrey Bout, Irina Piontkovskaya, Jiansheng Wei, Xin Jiang, Teng Su, Qun Liu, Jun Yao
- QUERT: Continual Pre-training Of Language Model For Query Understanding In Travel Domain Search Xie Jian, Liang Yidan, Liu Jingping, Xiao Yanghua, Wu Baohua, Ni Shenghua
- The Wall Street Neophyte: A Zero-shot Analysis Of Chatgpt Over Multimodal Stock Movement Prediction Challenges Xie Qianqian, Han Weiguang, Lai Yanzhao, Peng Min, Huang Jimin
- Proto-lm: A Prototypical Network-based Framework For Built-in Interpretability In Large Language Models Xie Sean, Vosoughi Soroush, Hassanpour Saeed
- Analysis Of The Reasoning With Redundant Information Provided Ability Of Large Language Models Xie Wenbei
- Making Small Language Models Better Multi-task Learners With Mixture-of-task-adapters Xie Yukang, Wang Chengyu, Yan Junbing, Zhou Jiyong, Deng Feiqi, Huang Jun
- A Prompt Log Analysis Of Text-to-image Generation Systems Xie Yutong, Pan Zhaoying, Ma Jinge, Jie Luo, Mei Qiaozhu
- Llm-pruner: On The Structural Pruning Of Large Language Models Xinyin Ma, Gongfan Fang, Xinchao Wang
- Doctorglm: Fine-tuning Your Chinese Doctor Is Not A Herculean Task Xiong Honglin, Wang Sheng, Zhu Yitao, Zhao Zihao, Liu Yuxiao, Huang Linlin, Wang Qian, Shen Dinggang
- Can Llms Express Their Uncertainty? An Empirical Evaluation Of Confidence Elicitation In Llms Xiong Miao, Hu Zhiyuan, Lu Xinyang, Li Yifei, Fu Jie, He Junxian, Hooi Bryan
- Effective Long-context Scaling Of Foundation Models Xiong Wenhan, Liu Jingyu, Molybog Igor, Zhang Hejia, Bhargava Prajjwal, Hou Rui, Martin Louis, Rungta Rashi, Sankararaman Karthik Abinav, Oguz Barlas, Khabsa Madian, Fang Han, Mehdad Yashar, Narang Sharan, Malik Kshitiz, Fan Angela, Bhosale Shruti, Edunov Sergey, Lewis Mike, Wang Sinong, Ma Hao
- Rewoo: Decoupling Reasoning From Observations For Efficient Augmented Language Models Xu Binfeng, Peng Zhiyuan, Lei Bowen, Mukherjee Subhabrata, Liu Yuchen, Xu Dongkuan
- Automatic Pair Construction For Contrastive Post-training Xu Canwen, Rosset Corby, Chau Ethan C., Del Corro Luciano, Mahajan Shweti, Mcauley Julian, Neville Jennifer, Awadallah Ahmed Hassan, Rao Nikhil
- Ctc-based Non-autoregressive Speech Translation Xu Chen, Liu Xiaoqian, Liu Xiaowen, Sun Qingxuan, Zhang Yuhao, Yang Murun, Dong Qianqian, Ko Tom, Wang Mingxuan, Xiao Tong, Ma Anxiang, Zhu Jingbo
- DPL: Decoupled Prompt Learning For Vision-language Models Xu Chen, Zhu Yuhan, Zhang Guozhen, Shen Haocheng, Liao Yixuan, Chen Xiaoxin, Wu Gangshan, Wang Limin
- Align On The Fly: Adapting Chatbot Behavior To Established Norms Xu Chunpu, Chern Steffi, Chern Ethan, Zhang Ge, Wang Zekun, Liu Ruibo, Li Jing, Fu Jie, Liu Pengfei
- Symbol-llm: Towards Foundational Symbol-centric Interface For Large Language Models Xu Fangzhi, Wu Zhiyong, Sun Qiushi, Ren Siyu, Yuan Fei, Yuan Shuai, Lin Qika, Qiao Yu, Liu Jun
- A Paradigm Shift In Machine Translation: Boosting Translation Performance Of Large Language Models Xu Haoran, Kim Young Jin, Sharaf Amr, Awadalla Hany Hassan
- No Train Still Gain. Unleash Mathematical Reasoning Of Large Language Models With Monte Carlo Tree Search Guided By Energy Function Xu Haotian
- U-llava: Unifying Multi-modal Tasks Via Large Language Model Xu Jinjin, Xu Liwu, Yang Yuzhe, Li Xiang, Wang Fanyi, Xie Yanchun, Huang Yi-jie, Li Yaqian
- On The Tool Manipulation Capability Of Open-source Large Language Models Xu Qiantong, Hong Fenglu, Li Bo, Hu Changran, Chen Zhengyu, Zhang Jian
- Knowledge-infused Prompting: Assessing And Advancing Clinical Text Data Generation With Large Language Models Xu Ran, Cui Hejie, Yu Yue, Kan Xuan, Shi Wenqi, Zhuang Yuchen, Jin Wei, Ho Joyce, Yang Carl
- Fine-tuned Llms Know More, Hallucinate Less With Few-shot Sequence-to-sequence Semantic Parsing Over Wikidata Xu Silei, Liu Shicheng, Culhane Theo, Pertseva Elizaveta, Wu Meng-hsi, Semnani Sina J., Lam Monica S.
- Shattering The Agent-environment Interface For Fine-tuning Inclusive Language Models Xu Wanqiao, Dong Shi, Arumugam Dilip, Van Roy Benjamin
- Reasons To Reject? Aligning Language Models With Judgments Xu Weiwen, Cai Deng, Zhang Zhisong, Lam Wai, Shi Shuming
- It Ain't That Bad: Understanding The Mysterious Performance Drop In OOD Generalization For Generative Transformer Models Xu Xingcheng, Pan Zihao, Zhang Haipeng, Yang Yanqing
- Mental-llm: Leveraging Large Language Models For Mental Health Prediction Via Online Text Data Xu Xuhai, Yao Bingsheng, Dong Yuanzhe, Gabriel Saadia, Yu Hong, Hendler James, Ghassemi Marzyeh, Dey Anind K., Wang Dakuo
- Drivegpt4: Interpretable End-to-end Autonomous Driving Via Large Language Model Xu Zhenhua, Zhang Yujia, Xie Enze, Zhao Zhen, Guo Yong, Wong Kwan-yee. K., Li Zhenguo, Zhao Hengshuang
- IDOL: Indicator-oriented Logic Pre-training For Logical Reasoning Xu Zihang, Yang Ziqing, Cui Yiming, Wang Shijin
- Pink: Unveiling The Power Of Referential Comprehension For Multi-modal Llms Xuan Shiyu, Guo Qingpei, Yang Ming, Zhang Shiliang
- To Repeat Or Not To Repeat: Insights From Scaling LLM Under Token-crisis Xue Fuzhao, Fu Yao, Zhou Wangchunshu, Zheng Zangwei, You Yang
- Bias And Fairness In Chatbots: An Overview Xue Jintang, Wang Yun-cheng, Wei Chengwei, Liu Xiaofeng, Woo Jonghye, Kuo C. -c. Jay
- Parameter-efficient Tuning Helps Language Model Alignment Xue Tianci, Wang Ziqi, Ji Heng
- Scaling Evidence-based Instructional Design Expertise Through Large Language Models Yadav Gautam
- Pretraining Data Mixtures Enable Narrow Model Selection Capabilities In Transformer Models Yadlowsky Steve, Doshi Lyric, Tripuraneni Nilesh
- Evaluating Large Language Models' Ability Using A Psychiatric Screening Tool Based On Metaphor And Sarcasm Scenarios Yakura Hiromu
- Evaluating Spatial Understanding Of Large Language Models Yamada Yutaro, Bao Yihan, Lampinen Andrew K., Kasai Jungo, Yildirim Ilker
- Backdooring Instruction-tuned Large Language Models With Virtual Prompt Injection Yan Jun, Yadav Vikas, Li Shiyang, Chen Lichang, Tang Zheng, Wang Hai, Srinivasan Vijay, Ren Xiang, Jin Hongxia
- Ask More, Know Better: Reinforce-learned Prompt Questions For Decision Making With Large Language Models Yan Xue, Song Yan, Cui Xinyu, Christianos Filippos, Zhang Haifeng, Mguni David Henry, Wang Jun
- Baichuan 2: Open Large-scale Language Models Yang Aiyuan, Xiao Bin, Wang Bingning, Zhang Borong, Bian Ce, Yin Chao, Lv Chenxu, Pan Da, Wang Dian, Yan Dong, Yang Fan, Deng Fei, Wang Feng, Liu Feng, Ai Guangwei, Dong Guosheng, Zhao Haizhou, Xu Hang, Sun Haoze, Zhang Hongda, Liu Hui, Ji Jiaming, Xie Jian, Dai Juntao, Fang Kun, Su Lei, Song Liang, Liu Lifeng, Ru Liyun, Ma Luyao, Wang Mang, Liu Mickel, Lin Mingan, Nie Nuolan, Guo Peidong, Sun Ruiyang, Zhang Tao, Li Tianpeng, Li Tianyu, Cheng Wei, Chen Weipeng, Zeng Xiangrong, Wang Xiaochuan, Chen Xiaoxi, Men Xin, Yu Xin, Pan Xuehai, Shen Yanjun, Wang Yiding, Li Yiyu, Jiang Youxin, Gao Yuchen, Zhang Yupeng, Zhou Zenan, Wu Zhiying
- LACMA: Language-aligning Contrastive Learning With Meta-actions For Embodied Instruction Following Yang Cheng-fu, Chen Yen-chun, Yang Jianwei, Dai Xiyang, Yuan Lu, Wang Yu-chiang Frank, Chang Kai-wei
- Investigating Zero-shot Generalizability On Mandarin-english Code-switched ASR And Speech-to-text Translation Of Recent Foundation Models With Self-supervision And Weak Supervision Yang Chih-kai, Huang Kuan-po, Lu Ke-han, Kuan Chun-yi, Hsiao Chi-yuan, Lee Hung-yi
- Tackling Vision Language Tasks Through Learning Inner Monologues Yang Diji, Chen Kezhen, Rao Jinmeng, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- Refgpt: Dialogue Generation Of GPT, By GPT, And For GPT Yang Dongjie, Yuan Ruifeng, Fan Yuantao, Yang Yifei, Wang Zili, Wang Shusen, Zhao Hai
- PALR: Personalization Aware Llms For Recommendation Yang Fan, Chen Zheng, Jiang Ziyan, Cho Eunah, Huang Xiaojiang, Lu Yanbin
- Parrot Mind: Towards Explaining The Complex Task Reasoning Of Pretrained Large Language Models With Template-content Structure Yang Haotong, Meng Fanxu, Lin Zhouchen, Zhang Muhan
- Instoptima: Evolutionary Multi-objective Instruction Optimization Via Large Language Model-based Instruction Operators Yang Heng, Li Ke
- Auto-gpt For Online Decision Making: Benchmarks And Additional Opinions Yang Hui, Yue Sifu, He Yunzhong
- Longqlora: Efficient And Effective Method To Extend Context Length Of Large Language Models Yang Jianxin
- Iterative Forward Tuning Boosts In-context Learning In Language Models Yang Jiaxi, Hui Binyuan, Yang Min, Wang Bailin, Li Bowen, Li Binhua, Huang Fei, Li Yongbin
- Unified Language-vision Pretraining In LLM With Dynamic Discrete Visual Tokenization Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chenyi Lei, An Liu, Chengru Song, Xiaoqiang Lei, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu
- Kobigbird-large: Transformation Of Transformer For Korean Language Understanding Yang Kisu, Jang Yoonna, Lee Taewoo, Seong Jinwoo, Lee Hyungjin, Jang Hwanseok, Lim Heuiseok
- Medxchat: A Unified Multimodal Large Language Model Framework Towards Cxrs Understanding And Generation Yang Ling, Wang Zhanyu, Chen Zhenghao, Liang Xinyu, Zhou Luping
- Skymath: Technical Report Yang Liu, Yang Haihua, Cheng Wenjun, Lin Lei, Li Chenxia, Chen Yifu, Liu Lunan, Pan Jianfei, Wei Tianwen, Li Biye, Zhao Liang, Wang Lijie, Zhu Bo, Li Guoliang, Wu Xuejie, Luo Xilin, Hu Rui
- Lidar-llm: Exploring The Potential Of Large Language Models For 3D Lidar Understanding Yang Senqiao, Liu Jiaming, Zhang Ray, Pan Mingjie, Guo Zoey, Li Xiaoqi, Chen Zehui, Gao Peng, Guo Yandong, Zhang Shanghang
- Progressive Evidence Refinement For Open-domain Multimodal Retrieval Question Answering Yang Shuwen, Wu Anran, Wu Xingjiao, Xiao Luwei, Ma Tianlong, Jin Cheng, He Liang
- Zhongjing: Enhancing The Chinese Medical Capabilities Of Large Language Model Through Expert Feedback And Real-world Multi-turn Dialogue Yang Songhua, Zhao Hanjie, Zhu Senbin, Zhou Guangyu, Xu Hongfei, Jia Yuxiang, Zan Hongying
- Dior-cvae: Pre-trained Language Models And Diffusion Priors For Variational Dialog Generation Yang Tianyu, Tran Thy Thy, Gurevych Iryna
- Bigtranslate: Augmenting Large Language Models With Multilingual Translation Capability Over 100 Languages Yang Wen, Li Chong, Zhang Jiajun, Zong Chengqing
- Enabling Large Language Models To Learn From Rules Yang Wenkai, Lin Yankai, Zhou Jie, Wen Jirong
- Exploring An LM To Generate Prolog Predicates From Mathematics Questions Yang Xiaocheng, Tam Yik-cheung
- Lever LM: Configuring In-context Sequence To Lever Large Vision Language Models Yang Xu, Peng Yingzhe, Ma Haoxuan, Xu Shuo, Zhang Chi, Han Yucheng, Zhang Hanwang
- On The Calibration Of Multilingual Question Answering Llms Yang Yahan, Dan Soham, Roth Dan, Lee Insup
- Mindllm: Pre-training Lightweight Large Language Model From Scratch, Evaluations And Domain Applications Yang Yizhe, Sun Huashan, Li Jiawei, Liu Runheng, Li Yinghao, Liu Yuhang, Huang Heyan, Gao Yang
- Harnessing The Power Of Large Language Models For Natural Language To First-order Logic Translation Yang Yuan, Xiong Siheng, Payani Ali, Shareghi Ehsan, Fekri Faramarz
- Improving The Reliability Of Large Language Models By Leveraging Uncertainty-aware In-context Learning Yang Yuchen, Li Houqiang, Wang Yanfeng, Wang Yu
- On The Planning, Search, And Memorization Capabilities Of Large Language Models Yang Yunhao, Tomar Anshul
- GPT Can Solve Mathematical Problems Without A Calculator Yang Zhen, Ding Ming, Lv Qingsong, Jiang Zhihuan, He Zehai, Guo Yuyi, Bai Jinfeng, Tang Jie
- Surpassing GPT-4 Medical Coding With A Two-stage Approach Yang Zhichao, Batra Sanjit Singh, Stremmel Joel, Halperin Eran
- Alignedcot: Prompting Large Language Models Via Native-speaking Demonstrations Yang Zhicheng, Huang Yinya, Xiong Jing, Feng Liang, Liang Xiaodan, Wang Yiwei, Tang Jing
- Coupling Large Language Models With Logic Programming For Robust And General Reasoning From Text Yang Zhun, Ishay Adam, Lee Joohyung
- Re-vilm: Retrieval-augmented Visual Language Model For Zero And Few-shot Image Captioning Yang Zhuolin, Ping Wei, Liu Zihan, Korthikanti Vijay, Nie Weili, Huang De-an, Fan Linxi, Yu Zhiding, Lan Shiyi, Li Bo, Liu Ming-yu, Zhu Yuke, Shoeybi Mohammad, Catanzaro Bryan, Xiao Chaowei, Anandkumar Anima
- Quantization-aware And Tensor-compressed Training Of Transformers For Natural Language Understanding Yang Zi, Choudhary Samridhi, Kunzmann Siegfried, Zhang Zheng
- I-code V2: An Autoregressive Generation Framework Over Vision, Language, And Speech Data Yang Ziyi, Khademi Mahmoud, Xu Yichong, Pryzant Reid, Fang Yuwei, Zhu Chenguang, Chen Dongdong, Qian Yao, Gao Mei, Chen Yi-ling, Gmyr Robert, Kanda Naoyuki, Codella Noel, Xiao Bin, Shi Yu, Yuan Lu, Yoshioka Takuya, Zeng Michael, Huang Xuedong
- Tcp:textual-based Class-aware Prompt Tuning For Visual-language Model Yao Hantao, Zhang Rui, Xu Changsheng
- Knowledge Plugins: Enhancing Large Language Models For Domain-specific Recommendations Yao Jing, Xu Wei, Lian Jianxun, Wang Xiting, Yi Xiaoyuan, Xie Xing
- Embodiedgpt: Vision-language Pre-training Via Embodied Chain Of Thought Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo
- Retroformer: Retrospective Large Language Agents With Policy Gradient Optimization Yao Weiran, Heinecke Shelby, Niebles Juan Carlos, Liu Zhiwei, Feng Yihao, Xue Le, Murthy Rithesh, Chen Zeyuan, Zhang Jianguo, Arpit Devansh, Xu Ran, Mui Phil, Wang Huan, Xiong Caiming, Savarese Silvio
- Deltazip: Multi-tenant Language Model Serving Via Delta Compression Yao Xiaozhe, Klimovic Ana
- Nanolm: An Affordable LLM Pre-training Benchmark Via Accurate Loss Prediction Across Scales Yao Yiqun, Fan Siqi, Huang Xiusheng, Fang Xuezhi, Li Xiang, Ni Ziyi, Jiang Xin, Meng Xuying, Han Peng, Shang Shuo, Liu Kang, Sun Aixin, Wang Yequan
- Deepspeed-visualchat: Multi-round Multi-image Interleave Chat Via Multi-modal Causal Attention Yao Zhewei, Wu Xiaoxia, Li Conglong, Zhang Minjia, Qin Heyang, Ruwase Olatunji, Awan Ammar Ahmad, Rajbhandari Samyam, He Yuxiong
- Ureader: Universal Ocr-free Visually-situated Language Understanding With Multimodal Large Language Model Ye Jiabo, Hu Anwen, Xu Haiyang, Ye Qinghao, Yan Ming, Xu Guohai, Li Chenliang, Tian Junfeng, Qian Qi, Zhang Ji, Jin Qin, He Liang, Lin Xin Alex, Huang Fei
- Generating Data For Symbolic Language With Large Language Models Ye Jiacheng, Li Chengzu, Kong Lingpeng, Yu Tao
- Diffusion Language Models Can Perform Many Tasks With Scaling And Instruction-finetuning Ye Jiasheng, Zheng Zaixiang, Bao Yu, Qian Lihua, Gu Quanquan
- Qilin-med: Multi-stage Knowledge Injection Advanced Medical Large Language Model Ye Qichen, Liu Junling, Chong Dading, Zhou Peilin, Hua Yining, Liu Fenglin, Cao Meng, Wang Ziming, Cheng Xuxin, Lei Zhu, Guo Zhenhua
- Satlm: Satisfiability-aided Language Models Using Declarative Prompting Ye Xi, Chen Qiaochu, Dillig Isil, Durrett Greg
- ASPEN: High-throughput Lora Fine-tuning Of Large Language Models With A Single GPU Ye Zhengmao, Li Dengchun, Tian Jingqi, Lan Tingfeng, Zuo Jie, Duan Lei, Lu Hui, Jiang Yexi, Sha Jian, Zhang Ke, Tang Mingjie
- Navigating Text-to-image Customization: From Lycoris Fine-tuning To Model Evaluation Yeh Shih-ying, Hsieh Yu-guan, Gao Zhidong, Yang Bernard B W, Oh Giyeong, Gong Yanmin
- QAID: Question Answering Inspired Few-shot Intent Detection Yehudai Asaf, Vetzler Matan, Mass Yosi, Lazar Koren, Cohen Doron, Carmeli Boaz
- Cones: Concept Embedding Search For Parameter Efficient Tuning Large Vision Language Models Yi Huahui, Qin Ziyuan, Xu Wei, Guo Miaotian, Wang Kun, Zhang Shaoting, Li Kang, Lao Qicheng
- Benchmarking And Defending Against Indirect Prompt Injection Attacks On Large Language Models Yi Jingwei, Xie Yueqi, Zhu Bin, Kiciman Emre, Sun Guangzhong, Xie Xing, Wu Fangzhao
- Lemur: Harmonizing Natural Language And Code For Language Agents Yiheng Xu, Hongjin Su, Chen Xing, Boyu Mi, Qian Liu, Weijia Shi, Binyuan Hui, Fan Zhou, Yitao Liu, Tianbao Xie, Zhoujun Cheng, Siheng Zhao, Lingpeng Kong, Bailin Wang, Caiming Xiong, Tao Yu
- Agent Lumos: Unified And Modular Training For Open-source Language Agents Yin Da, Brahman Faeze, Ravichander Abhilasha, Chandu Khyathi, Chang Kai-wei, Choi Yejin, Lin Bill Yuchen
- LAMM: Language-assisted Multi-modal Instruction-tuning Dataset, Framework, And Benchmark Yin Zhenfei, Wang Jiong, Cao Jianjian, Shi Zhelun, Liu Dingning, Li Mukai, Sheng Lu, Bai Lei, Huang Xiaoshui, Wang Zhiyong, Shao Jing, Ouyang Wanli
- Alignment Is Not Sufficient To Prevent Large Language Models From Generating Harmful Information: A Psychoanalytic Perspective Yin Zi, Ding Wei, Liu Jia
- Low-resource Languages Jailbreak GPT-4 Yong Zheng-xin, Menghini Cristina, Bach Stephen H.
- From Text To Motion: Grounding GPT-4 In A Humanoid Robot "alter3" Yoshida Takahide, Masumori Atsushi, Ikegami Takashi
- \(P^{3}O\): Transferring Visual Representations For Reinforcement Learning Via Prompting You Guoliang, Chu Xiaomeng, Duan Yifan, Peng Jie, Ji Jianmin, Zhang Yu, Zhang Yanyong
- Merlin:empowering Multimodal Llms With Foresight Minds Yu En, Zhao Liang, Wei Yana, Yang Jinrong, Wu Dongming, Kong Lingyu, Wei Haoran, Wang Tiancai, Ge Zheng, Zhang Xiangyu, Tao Wenbing
- OVM, Outcome-supervised Value Models For Planning In Mathematical Reasoning Yu Fei, Gao Anningzhe, Wang Benyou
- Language Models Are Super Mario: Absorbing Abilities From Homologous Models As A Free Lunch Yu Le, Yu Bowen, Yu Haiyang, Huang Fei, Li Yongbin
- Scaling Autoregressive Multi-modal Models: Pretraining And Instruction Tuning Yu Lili, Shi Bowen, Pasunuru Ramakanth, Muller Benjamin, Golovneva Olga, Wang Tianlu, Babu Arun, Tang Binh, Karrer Brian, Sheynin Shelly, Ross Candace, Polyak Adam, Howes Russell, Sharma Vasu, Xu Puxin, Tamoyan Hovhannes, Ashual Oron, Singer Uriel, Li Shang-wen, Zhang Susan, James Richard, Ghosh Gargi, Taigman Yaniv, Fazel-zarandi Maryam, Celikyilmaz Asli, Zettlemoyer Luke, Aghajanyan Armen
- MEGABYTE: Predicting Million-byte Sequences With Multiscale Transformers Yu Lili, Simig Dániel, Flaherty Colin, Aghajanyan Armen, Zettlemoyer Luke, Lewis Mike
- Metamath: Bootstrap Your Own Mathematical Questions For Large Language Models Yu Longhui, Jiang Weisen, Shi Han, Yu Jincheng, Liu Zhengying, Zhang Yu, Kwok James T., Li Zhenguo, Weller Adrian, Liu Weiyang
- Information Association For Language Model Updating By Mitigating Lm-logical Discrepancy Yu Pengfei, Ji Heng
- Characterizing Mechanisms For Factual Recall In Language Models Yu Qinan, Merullo Jack, Pavlick Ellie
- Constructive Large Language Models Alignment With Diverse Feedback Yu Tianshu, Lin Ting-en, Wu Yuchuan, Yang Min, Huang Fei, Li Yongbin
- Chain-of-note: Enhancing Robustness In Retrieval-augmented Language Models Yu Wenhao, Zhang Hongming, Pan Xiaoman, Ma Kaixin, Wang Hongwei, Yu Dong
- Improving Language Models Via Plug-and-play Retrieval Feedback Yu Wenhao, Zhang Zhihan, Liang Zhenwen, Jiang Meng, Sabharwal Ashish
- Prompt-based Monte-carlo Tree Search For Goal-oriented Dialogue Policy Planning Yu Xiao, Chen Maximillian, Yu Zhou
- Teaching Language Models To Self-improve Through Interactive Demonstrations Yu Xiao, Peng Baolin, Galley Michel, Gao Jianfeng, Yu Zhou
- Training With "paraphrasing The Original Text" Improves Long-context Performance Yu Yijiong, Huang Yongfeng, Qi Zhixiao, Zhou Zhe
- Large Language Model As Attributed Training Data Generator: A Tale Of Diversity And Bias Yu Yue, Zhuang Yuchen, Zhang Jieyu, Meng Yu, Ratner Alexander, Krishna Ranjay, Shen Jiaming, Zhang Chao
- Fine-tuning Language Models With Generative Adversarial Reward Modelling Yu Zhang Ze, Jaw Lau Jia, Hui Zhang, Low Bryan Kian Hsiang
- How Vocabulary Sharing Facilitates Multilingualism In Llama? Yuan Fei, Yuan Shuai, Wu Zhiyong, Li Lei
- CRAFT: Customizing Llms By Creating And Retrieving From Specialized Toolsets Yuan Lifan, Chen Yangyi, Wang Xingyao, Fung Yi R., Peng Hao, Ji Heng
- ANALOGYKB: Unlocking Analogical Reasoning Of Language Models With A Million-scale Knowledge Base Yuan Siyu, Chen Jiangjie, Sun Changzhi, Liang Jiaqing, Xiao Yanghua, Yang Deqing
- Where To Go Next For Recommender Systems? ID- Vs. Modality-based Recommender Models Revisited Yuan Zheng, Yuan Fajie, Song Yu, Li Youhua, Fu Junchen, Yang Fei, Pan Yunzhu, Ni Yongxin
- Scaling Relationship On Learning Mathematical Reasoning With Large Language Models Yuan Zheng, Yuan Hongyi, Li Chengpeng, Dong Guanting, Lu Keming, Tan Chuanqi, Zhou Chang, Zhou Jingren
- Multilingual Augmentation For Robust Visual Question Answering In Remote Sensing Images Yuan Zhenghang, Mou Lichao, Zhu Xiao Xiang
- Artgpt-4: Towards Artistic-understanding Large Vision-language Models With Enhanced Adapter Yuan Zhengqing, He Yunhong, Wang Kun, Ye Yanfang, Sun Lichao
- Evotext: Enhancing Natural Language Generation Models Via Self-escalation Learning For Up-to-date Knowledge And Improved Performance Yuan Zhengqing, Xue Huiwen, Zhang Chao, Liu Yongming
- Evaluating Instruction-tuned Large Language Models On Code Comprehension And Generation Yuan Zhiqiang, Liu Junwei, Zi Qiancheng, Liu Mingwei, Peng Xin, Lou Yiling
- Plan, Eliminate, And Track -- Language Models Are Good Teachers For Embodied Agents Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye
- Prompt Highlighter: Interactive Control For Multi-modal Llms Yuechen Zhang, Shengju Qian, Bohao Peng, Shu Liu, Jiaya Jia
- Aligning Large Language Models With Human: A Survey Yufei Wang, Wanjun Zhong, Liangyou Li, Fei Mi, Xingshan Zeng, Wenyong Huang, Lifeng Shang, Xin Jiang, Qun Liu
- Pre-training To Learn In Context Yuxian Gu, Li Dong, Furu Wei, Minlie Huang
- Planting A SEED Of Vision In Large Language Model Yuying Ge, Yixiao Ge, Ziyun Zeng, Xintao Wang, Ying Shan
- Fairness-aware Structured Pruning In Transformers Zayed Abdelrahman, Mordido Goncalo, Shabanian Samira, Baldini Ioana, Chandar Sarath
- TIM: Teaching Large Language Models To Translate With Comparison Zeng Jiali, Meng Fandong, Yin Yongjing, Zhou Jie
- Exploring Memorization In Fine-tuned Language Models Zeng Shenglai, Li Yaxin, Ren Jie, Liu Yiding, Xu Han, He Pengfei, Xing Yue, Wang Shuaiqiang, Tang Jiliang, Yin Dawei
- What Matters In Training A Gpt4-style Language Model With Multimodal Inputs? Zeng Yan, Zhang Hanbo, Zheng Jiani, Xia Jiangnan, Wei Guoqiang, Wei Yang, Zhang Yuchen, Kong Tao
- MR-GSM8K: A Meta-reasoning Benchmark For Large Language Model Evaluation Zeng Zhongshen, Chen Pengguang, Liu Shu, Jiang Haiyun, Jia Jiaya
- Building Open-ended Embodied Agent Via Language-policy Bidirectional Adaptation Zhai Shaopeng, Wang Jie, Zhang Tianyi, Huang Fuxian, Zhang Qi, Zhou Ming, Hou Jing, Qiao Yu, Liu Yu
- Investigating The Catastrophic Forgetting In Multimodal Large Language Models Zhai Yuexiang, Tong Shengbang, Li Xiao, Cai Mu, Qu Qing, Lee Yong Jae, Ma Yi
- Removing RLHF Protections In GPT-4 Via Fine-tuning Zhan Qiusi, Fang Richard, Bindu Rohan, Gupta Akul, Hashimoto Tatsunori, Kang Daniel
- Vpgtrans: Transfer Visual Prompt Generator Across Llms Zhang Ao, Fei Hao, Yao Yuan, Ji Wei, Li Li, Liu Zhiyuan, Chua Tat-seng
- Mm-narrator: Narrating Long-form Videos With Multimodal In-context Learning Zhang Chaoyi, Lin Kevin, Yang Zhengyuan, Wang Jianfeng, Li Linjie, Lin Chung-ching, Liu Zicheng, Wang Lijuan
- User-controlled Knowledge Fusion In Large Language Models: Balancing Creativity And Hallucination Zhang Chen
- Xdial-eval: A Multilingual Open-domain Dialogue Evaluation Benchmark Zhang Chen, D'haro Luis Fernando, Tang Chengguang, Shi Ke, Tang Guohua, Li Haizhou
- Automatic Generation Of Multiple-choice Questions Zhang Cheng
- Revisiting Block-based Quantisation: What Is Important For Sub-8-bit LLM Inference? Zhang Cheng, Cheng Jianyi, Shumailov Ilia, Constantinides George A., Zhao Yiren
- Benchmarking And In-depth Performance Study Of Large Language Models On Habana Gaudi Processors Zhang Chengming, Sun Baixi, Yu Xiaodong, Xie Zhen, Zheng Weijian, Iskra Kamil, Beckman Pete, Tao Dingwen
- The Skipped Beat: A Study Of Sociopragmatic Understanding In Llms For 64 Languages Zhang Chiyu, Doan Khai Duy, Liao Qisheng, Abdul-mageed Muhammad
- Video-llama: An Instruction-tuned Audio-visual Language Model For Video Understanding Zhang Hang, Li Xin, Bing Lidong
- On The Safety Of Open-sourced Large Language Models: Does Alignment Really Prevent Them From Being Misused? Zhang Hangfan, Guo Zhimeng, Zhu Huaisheng, Cao Bochuan, Lin Lu, Jia Jinyuan, Chen Jinghui, Wu Dinghao
- Expressive TTS Driven By Natural Language Prompts Using Few Human Annotations Zhang Hanglei, Guo Yiwei, Liu Sen, Chen Xie, Yu Kai
- Tuning Large Language Model For End-to-end Speech Translation Zhang Hao, Si Nianwen, Chen Yaqi, Zhang Wenlin, Yang Xukui, Qu Dan, Jiao Xiaolin
- XATU: A Fine-grained Instruction-based Benchmark For Explainable Text Updates Zhang Haopeng, Iso Hayate, Gurajada Sairam, Bhutani Nikita
- Glm-dialog: Noise-tolerant Pre-training For Knowledge-grounded Dialogue Generation Zhang Jing, Zhang Xiaokang, Zhang-li Daniel, Yu Jifan, Yao Zijun, Ma Zeyao, Xu Yiqi, Wang Haohua, Zhang Xiaohan, Lin Nianyi, Lu Sunrui, Li Juanzi, Tang Jie
- Composing Parameter-efficient Modules With Arithmetic Operations Zhang Jinghan, Chen Shiqi, Liu Junteng, He Junxian
- Draft & Verify: Lossless Large Language Model Acceleration Via Self-speculative Decoding Zhang Jun, Wang Jue, Li Huan, Shou Lidan, Chen Ke, Chen Gang, Mehrotra Sharad
- Aligning Instruction Tasks Unlocks Large Language Models As Zero-shot Relation Extractors Zhang Kai, Gutiérrez Bernal Jiménez, Su Yu
- Llm-based Medical Assistant Personalization With Short- And Long-term Memory Coordination Zhang Kai, Kang Yangyang, Zhao Fubang, Liu Xiaozhong
- Crash: Clustering, Removing, And Sharing Enhance Fine-tuning Without Full Large Language Model Zhang Kaiyan, Ding Ning, Qi Biqing, Zhu Xuekai, Long Xinwei, Zhou Bowen
- Weighted Sampling For Masked Language Modeling Zhang Linhan, Chen Qian, Wang Wen, Deng Chong, Cao Xin, Hao Kongzhang, Jiang Yuxin, Wang Wei
- A Two-stage Adaptation Of Large Language Models For Text Ranking Zhang Longhui, Zhang Yanzhao, Long Dingkun, Xie Pengjun, Zhang Meishan, Zhang Min
- Lora-fa: Memory-efficient Low-rank Adaptation For Large Language Models Fine-tuning Zhang Longteng, Zhang Lin, Shi Shaohuai, Chu Xiaowen, Li Bo
- Retrieve Anything To Augment Large Language Models Zhang Peitian, Xiao Shitao, Liu Zheng, Dou Zhicheng, Nie Jian-yun
- Ask An Expert: Leveraging Language Models To Improve Strategic Reasoning In Goal-oriented Dialogue Models Zhang Qiang, Naradowsky Jason, Miyao Yusuke
- Llama-adapter: Efficient Fine-tuning Of Language Models With Zero-init Attention Zhang Renrui, Han Jiaming, Liu Chris, Gao Peng, Zhou Aojun, Hu Xiangfei, Yan Shilin, Lu Pan, Li Hongsheng, Qiao Yu
- A Self-enhancement Approach For Domain-specific Chatbot Training Via Knowledge Mining And Digest Zhang Ruohong, Gao Luyu, Zheng Chen, Fan Zhen, Lai Guokun, Zhang Zheng, Ai Fangzhou, Yang Yiming, Yang Hongxia
- Generation-driven Contrastive Self-training For Zero-shot Text Classification With Instruction-following LLM Zhang Ruohong, Wang Yau-shian, Yang Yiming
- Instruction Tuning For Large Language Models: A Survey Zhang Shengyu, Dong Linfeng, Li Xiaoya, Zhang Sen, Sun Xiaofei, Wang Shuhe, Li Jiwei, Hu Runyi, Zhang Tianwei, Wu Fei, Wang Guoyin
- Automl-gpt: Automatic Machine Learning With GPT Zhang Shujian, Gong Chengyue, Wu Lemeng, Liu Xingchao, Zhou Mingyuan
- Modeling Sequential Sentence Relation To Improve Cross-lingual Dense Retrieval Zhang Shunyu, Liang Yaobo, Gong Ming, Jiang Daxin, Duan Nan
- Learning To Predict Concept Ordering For Common Sense Generation Zhang Tianhui, Bollegala Danushka, Peng Bei
- The Wisdom Of Hindsight Makes Language Models Better Instruction Followers Zhang Tianjun, Liu Fangchen, Wong Justin, Abbeel Pieter, Gonzalez Joseph E.
- Adarefiner: Refining Decisions Of Language Models With Adaptive Feedback Zhang Wanpeng, Lu Zongqing
- Bridging The Information Gap Between Domain-specific Model And General LLM For Personalized Recommendation Zhang Wenxuan, Liu Hongzhi, Du Yingpeng, Zhu Chen, Song Yang, Zhu Hengshu, Wu Zhonghai
- Don't Trust Chatgpt When Your Question Is Not In English: A Study Of Multilingual Abilities And Types Of Llms Zhang Xiang, Li Senyu, Hauer Bradley, Shi Ning, Kondrak Grzegorz
- EHR Interaction Between Patients And AI: Noteaid EHR Interaction Zhang Xiaocheng, Yao Zonghai, Yu Hong
- Large Language Models In Medical Term Classification And Unexpected Misalignment Between Response And Reasoning Zhang Xiaodan, Vemulapalli Sandeep, Talukdar Nabasmita, Ahn Sumyeong, Wang Jiankun, Meng Han, Murtaza Sardar Mehtab Bin, Dave Aakash Ajay, Leshchiner Dmitry, Joseph Dimitri F., Witteveen-lane Martin, Chesla Dave, Zhou Jiayu, Chen Bin
- SGP-TOD: Building Task Bots Effortlessly Via Schema-guided LLM Prompting Zhang Xiaoying, Peng Baolin, Li Kun, Zhou Jingyan, Meng Helen
- Language Models Are Universal Embedders Zhang Xin, Li Zehan, Zhang Yanzhao, Long Dingkun, Xie Pengjun, Zhang Meishan, Zhang Min
- Dialoguellm: Context And Emotion Knowledge-tuned Large Language Models For Emotion Recognition In Conversations Zhang Yazhou, Wang Mengyao, Wu Youxi, Tiwari Prayag, Li Qiuchi, Wang Benyou, Qin Jing
- Making Large Language Models Perform Better In Knowledge Graph Completion Zhang Yichi, Chen Zhuo, Guo Lingbing, Xu Yajing, Zhang Wen, Chen Huajun
- Pre-trained Language Models Do Not Help Auto-regressive Text-to-image Generation Zhang Yuhui, Mckinzie Brandon, Gan Zhe, Shankar Vaishaal, Toshev Alexander
- Cached Transformers: Improving Transformers With Differentiable Memory Cache Zhang Zhaoyang, Shao Wenqi, Ge Yixiao, Wang Xiaogang, Gu Jinwei, Luo Ping
- Balancing Specialized And General Skills In Llms: The Impact Of Modern Tuning And Data Strategy Zhang Zheng, Zheng Chen, Tang Da, Sun Ke, Ma Yukun, Bu Yingtong, Zhou Xun, Zhao Liang
- Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization Zhang Zhexin, Yang Junxiao, Ke Pei, Mi Fei, Wang Hongning, Huang Minlie
- PLUG: Leveraging Pivot Language In Cross-lingual Instruction Tuning Zhang Zhihan, Lee Dong-ho, Fang Yuwei, Yu Wenhao, Jia Mengzhao, Jiang Meng, Barbieri Francesco
- Auto-instruct: Automatic Instruction Generation And Ranking For Black-box Language Models Zhang Zhihan, Wang Shuohang, Yu Wenhao, Xu Yichong, Iter Dan, Zeng Qingkai, Liu Yang, Zhu Chenguang, Jiang Meng
- Accountable Textual-visual Chat Learns To Reject Human Instructions In Image Re-creation Zhang Zhiwei, Liu Yuliang
- How Do Large Language Models Capture The Ever-changing World Knowledge? A Review Of Recent Advances Zhang Zihan, Fang Meng, Chen Ling, Namazi-rad Mohammad-reza, Wang Jun
- Ehrtutor: Enhancing Patient Understanding Of Discharge Instructions Zhang Zihao, Yao Zonghai, Zhou Huixue, Ouyang Feiyun, Yu Hong
- MELA: Multilingual Evaluation Of Linguistic Acceptability Zhang Ziyin, Liu Yikang, Huang Weifang, Mao Junyu, Wang Rui, Hu Hai
- Large Language Models As Zero-shot Conversational Recommenders Zhankui He, Zhouhang Xie, Rahul Jha, Harald Steck, Dawen Liang, Yesu Feng, Bodhisattwa Prasad Majumder, Nathan Kallus, Julian Mcauley
- Explainability For Large Language Models: A Survey Zhao Haiyan, Chen Hanjie, Yang Fan, Liu Ninghao, Deng Huiqi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Du Mengnan
- Explicit Planning Helps Language Models In Logical Reasoning Zhao Hongyu, Wang Kangrui, Yu Mo, Mei Hongyuan
- In-context Exemplars As Clues To Retrieving From Large Associative Memory Zhao Jiachen
- Unimc: A Unified Framework For Long-term Memory Conversation Via Relevance Representation Learning Zhao Kang, Liu Wei, Luan Jian, Gao Minglei, Qian Li, Teng Hanlin, Wang Bin
- Hallucination Detection For Grounded Instruction Generation Zhao Lingjun, Nguyen Khanh, Daumé Hal Iii
- Group Preference Optimization: Few-shot Alignment Of Large Language Models Zhao Siyan, Dang John, Grover Aditya
- A Survey Of Large Language Models Zhao Wayne Xin, Zhou Kun, Li Junyi, Tang Tianyi, Wang Xiaolei, Hou Yupeng, Min Yingqian, Zhang Beichen, Zhang Junjie, Dong Zican, Du Yifan, Yang Chen, Chen Yushuo, Chen Zhipeng, Jiang Jinhao, Ren Ruiyang, Li Yifan, Tang Xinyu, Liu Zikang, Liu Peiyu, Nie Jian-yun, Wen Ji-rong
- Easygen: Easing Multimodal Generation With Bidiffuser And Llms Zhao Xiangyu, Liu Bo, Liu Qijiong, Shi Guangyuan, Wu Xiao-ming
- Babystories: Can Reinforcement Learning Teach Baby Language Models To Write Better Stories? Zhao Xingmeng, Wang Tongnian, Osborn Sheri, Rios Anthony
- Slic-hf: Sequence Likelihood Calibration With Human Feedback Zhao Yao, Joshi Rishabh, Liu Tianqi, Khalman Misha, Saleh Mohammad, Liu Peter J.
- Robut: A Systematic Study Of Table QA Robustness Against Human-annotated Adversarial Perturbations Zhao Yilun, Zhao Chen, Nan Linyong, Qi Zhenting, Zhang Wenlin, Tang Xiangru, Mi Boyu, Radev Dragomir
- Multi-modal In-context Learning Makes An Ego-evolving Scene Text Recognizer Zhao Zhen, Tang Jingqun, Lin Chunhui, Wu Binghong, Huang Can, Liu Hao, Tan Xin, Zhang Zhizhong, Xie Yuan
- Mllm-dataengine: An Iterative Refinement Approach For MLLM Zhao Zhiyuan, Ouyang Linke, Wang Bin, Huang Siyuan, Zhang Pan, Dong Xiaoyi, Wang Jiaqi, He Conghui
- Chatbridge: Bridging Modalities With Large Language Model As A Language Catalyst Zhao Zijia, Guo Longteng, Yue Tongtian, Chen Sihan, Shao Shuai, Zhu Xinxin, Yuan Zehuan, Liu Jing
- Tabula: Harnessing Language Models For Tabular Data Synthesis Zhao Zilong, Birke Robert, Chen Lydia
- Adapting Large Language Models By Integrating Collaborative Semantics For Recommendation Zheng Bowen, Hou Yupeng, Lu Hongyu, Chen Yu, Zhao Wayne Xin, Chen Ming, Wen Ji-rong
- Can We Edit Factual Knowledge By In-context Learning? Zheng Ce, Li Lei, Dong Qingxiu, Fan Yuxuan, Wu Zhiyong, Xu Jingjing, Chang Baobao
- Learn From Model Beyond Fine-tuning: A Survey Zheng Hongling, Shen Li, Tang Anke, Luo Yong, Hu Han, Du Bo, Tao Dacheng
- Lmsys-chat-1m: A Large-scale Real-world LLM Conversation Dataset Zheng Lianmin, Chiang Wei-lin, Sheng Ying, Li Tianle, Zhuang Siyuan, Wu Zhanghao, Zhuang Yonghao, Li Zhuohan, Lin Zi, Xing Eric P., Gonzalez Joseph E., Stoica Ion, Zhang Hao
- Trafficsafetygpt: Tuning A Pre-trained Large Language Model To A Domain-specific Expert In Transportation Safety Zheng Ou, Abdel-aty Mohamed, Wang Dongdong, Wang Chenzhu, Ding Shengxuan
- Improving Generalization Of Alignment With Human Preferences Through Group Invariant Learning Zheng Rui, Shen Wei, Hua Yuan, Lai Wenbin, Dou Shihan, Zhou Yuhao, Xi Zhiheng, Wang Xiao, Huang Haoran, Gui Tao, Zhang Qi, Huang Xuanjing
- Large Language Models Are Good Prompt Learners For Low-shot Image Classification Zheng Zhaoheng, Wei Jingmin, Hu Xuefeng, Zhu Haidong, Nevatia Ram
- Building Emotional Support Chatbots In The Era Of Llms Zheng Zhonghua, Liao Lizi, Deng Yang, Nie Liqiang
- Marinegpt: Unlocking Secrets Of Ocean To The Public Zheng Ziqiang, Zhang Jipeng, Vu Tuan-anh, Diao Shizhe, Tim Yue Him Wong, Yeung Sai-kit
- Llamarec: Two-stage Recommendation Using Large Language Models For Ranking Zhenrui Yue, Sara Rabhi, Gabriel De Souza Pereira Moreira, Dong Wang, Even Oldridge
- The Rise And Potential Of Large Language Model Based Agents: A Survey Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang, Changhao Jiang, Yicheng Zou, Xiangyang Liu, Zhangyue Yin, Shihan Dou, Rongxiang Weng, Wensen Cheng, Qi Zhang, Wenjuan Qin, Yongyan Zheng, Xipeng Qiu, Xuanjing Huang, Tao Gui
- Aligning Large Multimodal Models With Factually Augmented RLHF Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liang-yan Gui, Yu-xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell
- Principle-driven Self-alignment Of Language Models From Scratch With Minimal Human Supervision Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, Chuang Gan
- Leveraging Large Language Models For Pre-trained Recommender Systems Zhixuan Chu, Hongyan Hao, Xin Ouyang, Simeng Wang, Yan Wang, Yue Shen, Jinjie Gu, Qing Cui, Longfei Li, Siqiao Xue, James Y Zhang, Sheng Li
- Mitigating Hallucination In Visual Language Models With Visual Supervision Zhiyang Chen, Yousong Zhu, Yufei Zhan, Zhaowen Li, Chaoyang Zhao, Jinqiao Wang, Ming Tang
- Seeking Neural Nuggets: Knowledge Transfer In Large Language Models From A Parametric Perspective Zhong Ming, An Chenxin, Chen Weizhu, Han Jiawei, He Pengcheng
- Revisiting Token Dropping Strategy In Efficient BERT Pretraining Zhong Qihuang, Ding Liang, Liu Juhua, Liu Xuebo, Zhang Min, Du Bo, Tao Dacheng
- Bag Of Tricks For Effective Language Model Pretraining And Downstream Adaptation: A Case Study On GLUE Zhong Qihuang, Ding Liang, Peng Keqin, Liu Juhua, Du Bo, Shen Li, Zhan Yibing, Tao Dacheng
- Sur-adapter: Enhancing Text-to-image Pre-trained Diffusion Models With Large Language Models Zhong Shanshan, Huang Zhongzhan, Wen Wushao, Qin Jinghui, Lin Liang
- Mquake: Assessing Knowledge Editing In Language Models Via Multi-hop Questions Zhong Zexuan, Wu Zhengxuan, Manning Christopher D., Potts Christopher, Chen Danqi
- Language Agent Tree Search Unifies Reasoning Acting And Planning In Language Models Zhou Andy, Yan Kai, Shlapentokh-rothman Michal, Wang Haohan, Wang Yu-xiong
- Navgpt: Explicit Reasoning In Vision-and-language Navigation With Large Language Models Zhou Gengze, Hong Yicong, Wu Qi
- Lobass: Gauging Learnability In Supervised Fine-tuning Data Zhou Haotian, Liu Tingkai, Ma Qianli, Yuan Jianbo, Liu Pengfei, You Yang, Yang Hongxia
- How Well Do Large Language Models Understand Syntax? An Evaluation By Asking Natural Language Questions Zhou Houquan, Hou Yang, Li Zhenghua, Wang Xuebin, Wang Zhefeng, Duan Xinyu, Zhang Min
- Characterglm: Customizing Chinese Conversational AI Characters With Large Language Models Zhou Jinfeng, Chen Zhuang, Wan Dazhen, Wen Bosi, Song Yi, Yu Jifan, Huang Yongkang, Peng Libiao, Yang Jiaming, Xiao Xiyao, Sabour Sahand, Zhang Xiaohan, Hou Wenjing, Zhang Yijia, Dong Yuxiao, Tang Jie, Huang Minlie
- Vicor: Bridging Visual Understanding And Commonsense Reasoning With Large Language Models Zhou Kaiwen, Lee Kwonjoon, Misu Teruhisa, Wang Xin Eric
- Don't Make Your LLM An Evaluation Benchmark Cheater Zhou Kun, Zhu Yutao, Chen Zhipeng, Chen Wentong, Zhao Wayne Xin, Chen Xu, Lin Yankai, Wen Ji-rong, Han Jiawei
- Infmllm: A Unified Framework For Visual-language Tasks Zhou Qiang, Wang Zhibin, Chu Wei, Xu Yinghui, Li Hao, Qi Yuan
- Multi-stage Pre-training Enhanced By Chatgpt For Multi-scenario Multi-domain Dialogue Summarization Zhou Weixiao, Li Gengyao, Cheng Xianfu, Liang Xinnian, Zhu Junnan, Zhai Feifei, Li Zhoujun
- Context-faithful Prompting For Large Language Models Zhou Wenxuan, Zhang Sheng, Poon Hoifung, Chen Muhao
- Making Harmful Behaviors Unlearnable For Large Language Models Zhou Xin, Lu Yi, Ma Ruotian, Gui Tao, Zhang Qi, Huang Xuanjing
- Revisiting Automated Prompting: Are We Actually Doing Better? Zhou Yulin, Zhao Yiren, Shumailov Ilia, Mullins Robert, Gal Yarin
- Quantifying And Analyzing Entity-level Memorization In Large Language Models Zhou Zhenhong, Xiang Jiuyang, Chen Chaomeng, Su Sen
- Large Language Model As A Policy Teacher For Training Reinforcement Learning Agents Zhou Zihao, Hu Bin, Zhao Chenyang, Zhang Pu, Liu Bin
- On The Calibration Of Large Language Models And Alignment Zhu Chiwei, Xu Benfeng, Wang Quan, Zhang Yongdong, Mao Zhendong
- Pose: Efficient Context Window Extension Of Llms Via Positional Skip-wise Training Zhu Dawei, Yang Nan, Wang Liang, Song Yifan, Wu Wenhao, Wei Furu, Li Sujian
- Dialogue-contextualized Re-ranking For Medical History-taking Zhu Jian, Valmianski Ilya, Kannan Anitha
- VL-GPT: A Generative Pre-trained Transformer For Vision And Language Understanding And Generation Zhu Jinguo, Ding Xiaohan, Ge Yixiao, Ge Yuying, Zhao Sijie, Zhao Hengshuang, Wang Xiaohua, Shan Ying
- Judgelm: Fine-tuned Large Language Models Are Scalable Judges Zhu Lianghui, Wang Xinggang, Wang Xinlong
- Are NLP Models Good At Tracing Thoughts: An Overview Of Narrative Understanding Zhu Lixing, Zhao Runcong, Gui Lin, He Yulan
- Coca: Fusing Position Embedding With Collinear Constrained Attention In Transformers For Long Context Window Extending Zhu Shiyi, Ye Jing, Jiang Wei, Xue Siqiao, Zhang Qi, Wu Yifan, Li Jianguo
- Autodan: Interpretable Gradient-based Adversarial Attacks On Large Language Models Zhu Sicheng, Zhang Ruiyi, An Bang, Wu Gang, Barrow Joe, Wang Zichao, Huang Furong, Nenkova Ani, Sun Tong
- Promptcblue: A Chinese Prompt Tuning Benchmark For The Medical Domain Zhu Wei, Wang Xiaoling, Zheng Huanran, Chen Mosha, Tang Buzhou
- Extrapolating Large Language Models To Non-english By Aligning Languages Zhu Wenhao, Lv Yunzhe, Dong Qingxiu, Yuan Fei, Xu Jingjing, Huang Shujian, Kong Lingpeng, Chen Jiajun, Li Lei
- Pad: Program-aided Distillation Can Teach Small Models Reasoning Better Than Chain-of-thought Fine-tuning Zhu Xuekai, Qi Biqing, Zhang Kaiyan, Long Xinwei, Lin Zhouhan, Zhou Bowen
- Towards An On-device Agent For Text Rewriting Zhu Yun, Liu Yinxiao, Stahlberg Felix, Kumar Shankar, Chen Yu-hui, Luo Liangchen, Shu Lei, Liu Renjie, Chen Jindong, Meng Lei
- Large Language Models Can Learn Rules Zhu Zhaocheng, Xue Yuan, Chen Xinyun, Zhou Denny, Tang Jian, Schuurmans Dale, Dai Hanjun
- 3d-vista: Pre-trained Transformer For 3D Vision And Text Alignment Zhu Ziyu, Ma Xiaojian, Chen Yixin, Deng Zhidong, Huang Siyuan, Li Qing
- Open-source Large Language Models Are Strong Zero-shot Query Likelihood Models For Document Ranking Zhuang Shengyao, Liu Bing, Koopman Bevan, Zuccon Guido
- Toolqa: A Dataset For LLM Question Answering With External Tools Zhuang Yuchen, Yu Yue, Wang Kuan, Sun Haotian, Zhang Chao
- Can Large Language Models Transform Computational Social Science? Ziems Caleb, Held William, Shaikh Omar, Chen Jiaao, Zhang Zhehao, Yang Diyi
- Aligner: One Global Token Is Worth Millions Of Parameters When Aligning Large Language Models Ziheng Zhou University Of California, Los Angeles, Wu Yingnian University Of California, Los Angeles, Zhu Song-chun University Of California, Los Angeles, Terzopoulos Demetri University Of California, Los Angeles
- Recommender Systems In The Era Of Large Language Models (llms) Zihuai Zhao, Wenqi Fan, Jiatong Li, Yunqing Liu, Xiaowei Mei, Yiqi Wang, Zhen Wen, Fei Wang, Xiangyu Zhao, Jiliang Tang, Qing Li
- A Family Of Pretrained Transformer Language Models For Russian Zmitrovich Dmitry, Abramov Alexander, Kalmykov Andrey, Tikhonova Maria, Taktasheva Ekaterina, Astafurov Danil, Baushenko Mark, Snegirev Artem, Kadulin Vitalii, Markov Sergey, Shavrina Tatiana, Mikhailov Vladislav, Fenogenova Alena
- Dr Chatgpt, Tell Me What I Want To Hear: How Prompt Knowledge Impacts Health Answer Correctness Zuccon Guido, Koopman Bevan
- Incorporating Probing Signals Into Multimodal Machine Translation Via Visual Question-answering Pairs Zuo Yuxin, Li Bei, Lv Chuanhao, Zheng Tong, Xiao Tong, Zhu Jingbo
- Concept-aware Training Improves In-context Learning Ability Of Language Models Štefánik Michal, Kadlčík Marek
- A Dataset And Benchmark For Hospital Course Summarization With Adapted Large Language Models Aali Asad, Van Veen Dave, Arefeen Yamin Ishraq, Hom Jason, Bluethgen Christian, Reis Eduardo Pontes, Gatidis Sergios, Clifford Namuun, Daws Joseph, Tehrani Arash S., Kim Jangwon, Chaudhari Akshay S.
- Can Llms Be Fooled? Investigating Vulnerabilities In Llms Abdali Sara, He Jia, Barberan Cj, Anarfi Richard
- Granite-function Calling Model: Introducing Function Calling Abilities Via Multi-task Learning Of Granular Tasks Abdelaziz Ibrahim, Basu Kinjal, Agarwal Mayank, Kumaravel Sadhana, Stallone Matthew, Panda Rameswar, Rizk Yara, Bhargav Gp, Crouse Maxwell, Gunasekara Chulaka, Ikbal Shajith, Joshi Sachin, Karanam Hima, Kumar Vineet, Munawar Asim, Neelam Sumit, Raghu Dinesh, Sharma Udit, Soria Adriana Meza, Sreedhar Dheeraj, Venkateswaran Praveen, Unuvar Merve, Cox David, Roukos Salim, Lastras Luis, Kapanipathi Pavan
- Are You Still On Track!? Catching LLM Task Drift With Activations Abdelnabi Sahar, Fay Aideen, Cherubin Giovanni, Salem Ahmed, Fritz Mario, Paverd Andrew
- Phi-3 Technical Report: A Highly Capable Language Model Locally On Your Phone Abdin Marah, Aneja Jyoti, Awadalla Hany, Awadallah Ahmed, Awan Ammar Ahmad, Bach Nguyen, Bahree Amit, Bakhtiari Arash, Bao Jianmin, Behl Harkirat, Benhaim Alon, Bilenko Misha, Bjorck Johan, Bubeck Sébastien, Cai Martin, Cai Qin, Chaudhary Vishrav, Chen Dong, Chen Dongdong, Chen Weizhu, Chen Yen-chun, Chen Yi-ling, Cheng Hao, Chopra Parul, Dai Xiyang, Dixon Matthew, Eldan Ronen, Fragoso Victor, Gao Jianfeng, Gao Mei, Gao Min, Garg Amit, Del Giorno Allie, Goswami Abhishek, Gunasekar Suriya, Haider Emman, Hao Junheng, Hewett Russell J., Hu Wenxiang, Huynh Jamie, Iter Dan, Jacobs Sam Ade, Javaheripi Mojan, Jin Xin, Karampatziakis Nikos, Kauffmann Piero, Khademi Mahoud, Kim Dongwoo, Kim Young Jin, Kurilenko Lev, Lee James R., Lee Yin Tat, Li Yuanzhi, Li Yunsheng, Liang Chen, Liden Lars, Lin Xihui, Lin Zeqi, Liu Ce, Liu Liyuan, Liu Mengchen, Liu Weishung, Liu Xiaodong, Luo Chong, Madan Piyush, Mahmoudzadeh Ali, Majercak David, Mazzola Matt, Mendes Caio César Teodoro, Mitra Arindam, Modi Hardik, Nguyen Anh, Norick Brandon, Patra Barun, Perez-becker Daniel, Portet Thomas, Pryzant Reid, Qin Heyang, Radmilac Marko, Ren Liliang, De Rosa Gustavo, Rosset Corby, Roy Sambudha, Ruwase Olatunji, Saarikivi Olli, Saied Amin, Salim Adil, Santacroce Michael, Shah Shital, Shang Ning, Sharma Hiteshi, Shen Yelong, Shukla Swadheen, Song Xia, Tanaka Masahiro, Tupini Andrea, Vaddamanu Praneetha, Wang Chunyu, Wang Guanhua, Wang Lijuan, Wang Shuohang, Wang Xin, Wang Yu, Ward Rachel, Wen Wen, Witte Philipp, Wu Haiping, Wu Xiaoxia, Wyatt Michael, Xiao Bin, Xu Can, Xu Jiahang, Xu Weijian, Xue Jilong, Yadav Sonali, Yang Fan, Yang Jianwei, Yang Yifan, Yang Ziyi, Yu Donghan, Yuan Lu, Zhang Chenruidong, Zhang Cyril, Zhang Jianwen, Zhang Li Lyna, Zhang Yi, Zhang Yue, Zhang Yunan, Zhou Xiren
- Bridging The Bosphorus: Advancing Turkish Large Language Models Through Strategies For Low-resource Language Adaptation And Benchmarking Acikgoz Emre Can, Erdogan Mete, Yuret Deniz
- Fine-grained Contract NER Using Instruction Based Model Adibhatla Hiranmai Sri, Baswani Pavan, Shrivastava Manish
- Step-by-step Unmasking For Parameter-efficient Fine-tuning Of Large Language Models Agarwal Aradhye, Ramesh Suhas K, Sengupta Ayan, Chakraborty Tanmoy
- Promptwizard: Task-aware Agent-driven Prompt Optimization Framework Agarwal Eshaan, Dani Vivek, Ganu Tanuja, Nambi Akshay
- Many-shot In-context Learning Agarwal Rishabh, Singh Avi, Zhang Lei M., Bohnet Bernd, Rosias Luis, Chan Stephanie, Zhang Biao, Anand Ankesh, Abbas Zaheer, Nova Azade, Co-reyes John D., Chu Eric, Behbahani Feryal, Faust Aleksandra, Larochelle Hugo
- Tricy: Trigger-guided Data-to-text Generation With Intent Aware Attention-copy Agarwal Vibhav, Ghosh Sourav, Bss Harichandana, Arora Himanshu, Raja Barath Raj Kandur
- Enabling High-sparsity Foundational Llama Models With Efficient Pretraining And Deployment Agarwalla Abhinav, Gupta Abhay, Marques Alexandre, Pandit Shubhra, Goin Michael, Kurtic Eldar, Leong Kevin, Nguyen Tuan, Salem Mahmoud, Alistarh Dan, Lie Sean, Kurtz Mark
- Scalable Ensembling For Mitigating Reward Overoptimisation Ahmed Ahmed M., Rafailov Rafael, Sharkov Stepan, Li Xuechen, Koyejo Sanmi
- LM4OPT: Unveiling The Potential Of Large Language Models In Formulating Mathematical Optimization Problems Ahmed Tasnim, Choudhury Salimur
- Linguistic Intelligence In Large Language Models For Telecommunications Ahmed Tasnim, Piovesan Nicola, De Domenico Antonio, Choudhury Salimur
- Tuning Large Multimodal Models For Videos Using Reinforcement Learning From AI Feedback Ahn Daechul, Choi Yura, Yu Youngjae, Kang Dongyeop, Choi Jonghyun
- Sphinx: Sample Efficient Multilingual Instruction Fine-tuning Through N-shot Guided Prompting Ahuja Sanchit, Tanmay Kumar, Chauhan Hardik Hansrajbhai, Patra Barun, Aggarwal Kriti, Del Corro Luciano, Mitra Arindam, Dhamecha Tejas Indulal, Awadallah Ahmed, Choudhary Monojit, Chaudhary Vishrav, Sitaram Sunayana
- Yi: Open Foundation Models By 01.AI Ai 01., :, Young Alex, Chen Bei, Li Chao, Huang Chengen, Zhang Ge, Zhang Guanwei, Li Heng, Zhu Jiangcheng, Chen Jianqun, Chang Jing, Yu Kaidong, Liu Peng, Liu Qiang, Yue Shawn, Yang Senbin, Yang Shiming, Yu Tao, Xie Wen, Huang Wenhao, Hu Xiaohui, Ren Xiaoyi, Niu Xinyao, Nie Pengcheng, Xu Yuchi, Liu Yudong, Wang Yue, Cai Yuxuan, Gu Zhenyu, Liu Zhiyuan, Dai Zonghong
- Enhancing Pre-trained Generative Language Models With Question Attended Span Extraction On Machine Reading Comprehension Ai Lin, Hui Zheng, Liu Zizhou, Hirschberg Julia
- QASE Enhanced Plms: Improved Control In Text Generation For MRC Ai Lin, Hui Zheng, Liu Zizhou, Hirschberg Julia
- Plug And Play With Prompts: A Prompt Tuning Approach For Controlling Text Generation Ajwani Rohan Deepak, Zhu Zining, Rose Jonathan, Rudzicz Frank
- FACTS About Building Retrieval Augmented Generation-based Chatbots Akkiraju Rama, Xu Anbang, Bora Deepak, Yu Tan, An Lu, Seth Vishal, Shukla Aaditya, Gundecha Pritam, Mehta Hridhay, Jha Ashwin, Raj Prithvi, Balasubramanian Abhinav, Maram Murali, Muthusamy Guru, Annepally Shivakesh Reddy, Knowles Sidney, Du Min, Burnett Nick, Javiya Sean, Marannan Ashok, Kumari Mamta, Jha Surbhi, Dereszenski Ethan, Chakraborty Anupam, Ranjan Subhash, Terfai Amina, Surya Anoop, Mercer Tracey, Thanigachalam Vinodh Kumar, Bar Tamar, Krishnan Sanjana, Kilaru Samy, Jaksic Jasmine, Algarici Nave, Liberman Jacob, Conway Joey, Nayyar Sonu, Boitano Justin
- Mixture Of Modular Experts: Distilling Knowledge From A Multilingual Teacher Into Specialized Modular Language Models Al-maamari Mohammed, Amor Mehdi Ben, Granitzer Michael
- Enhancing Textbook Question Answering Task With Large Language Models And Retrieval Augmented Generation Alawwad Hessa Abdulrahman, Alhothali Areej, Naseem Usman, Alkhathlan Ali, Jamal Amani
- Mitigating Catastrophic Forgetting In Language Transfer Via Model Merging Alexandrov Anton, Raychev Veselin, Müller Mark Niklas, Zhang Ce, Vechev Martin, Toutanova Kristina
- Should We Fine-tune Or RAG? Evaluating Different Techniques To Adapt Llms For Dialogue Alghisi Simone, Rizzoli Massimo, Roccabruna Gabriel, Mousavi Seyed Mahed, Riccardi Giuseppe
- A Survey Of Large Language Models For European Languages Ali Wazir, Pyysalo Sampo
- Chatgpt Alternative Solutions: Large Language Models Survey Alipour Hanieh, Pendar Nick, Roy Kohinoor
- Tower: An Open Multilingual Large Language Model For Translation-related Tasks Alves Duarte M., Pombal José, Guerreiro Nuno M., Martins Pedro H., Alves João, Farajian Amin, Peters Ben, Rei Ricardo, Fernandes Patrick, Agrawal Sweta, Colombo Pierre, De Souza José G. C., Martins André F. T.
- Evaluating The Efficacy Of Foundational Models: Advancing Benchmarking Practices To Enhance Fine-tuning Decision-making Amujo Oluyemi Enoch, Yang Shanchieh Jay
- Training-free Long-context Scaling Of Large Language Models An Chenxin, Huang Fei, Zhang Jun, Gong Shansan, Qiu Xipeng, Zhou Chang, Kong Lingpeng
- Funaudiollm: Voice Understanding And Generation Foundation Models For Natural Interaction Between Humans And Llms An Keyu, Chen Qian, Deng Chong, Du Zhihao, Gao Changfeng, Gao Zhifu, Gu Yue, He Ting, Hu Hangrui, Hu Kai, Ji Shengpeng, Li Yabin, Li Zerui, Lu Heng, Luo Haoneng, Lv Xiang, Ma Bin, Ma Ziyang, Ni Chongjia, Song Changhe, Shi Jiaqi, Shi Xian, Wang Hao, Wang Wen, Wang Yuxuan, Xiao Zhangyu, Yan Zhijie, Yang Yexin, Zhang Bin, Zhang Qinglin, Zhang Shiliang, Zhao Nan, Zheng Siqi
- Make Your LLM Fully Utilize The Context An Shengnan, Ma Zexiong, Lin Zeqi, Zheng Nanning, Lou Jian-guang
- Investigating The Robustness Of Llms On Math Word Problems Anantheswaran Ujjwala, Gupta Himanshu, Scaria Kevin, Verma Shreyas, Baral Chitta, Mishra Swaroop
- Does Refusal Training In Llms Generalize To The Past Tense? Andriushchenko Maksym, Flammarion Nicolas
- Tensor Train Low-rank Approximation (tt-lora): Democratizing AI With Accelerated Llms Anjum Afia, Eren Maksim E., Boureima Ismael, Alexandrov Boian, Bhattarai Manish
- Scaling Sparse Fine-tuning To Large Language Models Ansell Alan, Vulić Ivan, Sterz Hannah, Korhonen Anna, Ponti Edoardo M.
- Generalization V.s. Memorization: Tracing Language Models' Capabilities Back To Pretraining Data Antoniades Antonis, Wang Xinyi, Elazar Yanai, Amayuelas Alfonso, Albalak Alon, Zhang Kexun, Wang William Yang
- A Framework For Fine-tuning Llms Using Heterogeneous Feedback Aponte Ryan, Rossi Ryan A., Guo Shunan, Dernoncourt Franck, Yu Tong, Chen Xiang, Mitra Subrata, Lipka Nedim
- Hired: Attention-guided Token Dropping For Efficient Inference Of High-resolution Vision-language Models In Resource-constrained Environments Arif Kazi Hasan Ibn, Yoon Jinyi, Nikolopoulos Dimitrios S., Vandierendonck Hans, John Deepu, Ji Bo
- SPAFIT: Stratified Progressive Adaptation Fine-tuning For Pre-trained Large Language Models Arora Samir, Wang Liangliang
- Exploring The Robustness Of Task-oriented Dialogue Systems For Colloquial German Varieties Artemova Ekaterina, Blaschke Verena, Plank Barbara
- To Code, Or Not To Code? Exploring Impact Of Code In Pre-training Aryabumi Viraat, Su Yixuan, Ma Raymond, Morisot Adrien, Zhang Ivan, Locatelli Acyr, Fadaee Marzieh, Üstün Ahmet, Hooker Sara
- Slicegpt: Compress Large Language Models By Deleting Rows And Columns Ashkboos Saleh, Croci Maximilian L., Nascimento Marcelo Gennari Do, Hoefler Torsten, Hensman James
- Self-seeding And Multi-intent Self-instructing Llms For Generating Intent-aware Information-seeking Dialogs Askari Arian, Petcu Roxana, Meng Chuan, Aliannejadi Mohammad, Abolghasemi Amin, Kanoulas Evangelos, Verberne Suzan
- Robustsentembed: Robust Sentence Embeddings Using Adversarial Self-supervised Contrastive Learning Asl Javad Rafiei, Panzade Prajwal, Blanco Eduardo, Takabi Daniel, Cai Zhipeng
- Can Open-source Llms Compete With Commercial Models? Exploring The Few-shot Performance Of Current GPT Models In Biomedical Tasks Ateia Samy, Kruschwitz Udo
- Transformer Alignment In Large Language Models Aubry Murdock, Meng Haoming, Sugolov Anton, Papyan Vardan
- Walia-llm: Enhancing Amharic-llama By Integrating Task-specific And Generative Datasets Azime Israel Abebe, Tonja Atnafu Lambebo, Belay Tadesse Destaw, Fuge Mitiku Yohannes, Wassie Aman Kassahun, Jada Eyasu Shiferaw, Chanie Yonas, Sewunetie Walelign Tewabe, Yimam Seid Muhie
- Lamda: Large Model Fine-tuning Via Spectrally Decomposed Low-dimensional Adaptation Azizi Seyedarmin, Kundu Souvik, Pedram Massoud
- The Pitfalls Of Next-token Prediction Bachmann Gregor, Nagarajan Vaishnavh
- Masked Mixers For Language Generation And Retrieval Badger Benjamin L.
- Efficient Model-agnostic Alignment Via Bayesian Persuasion Bai Fengshuo, Wang Mingzhi, Zhang Zhaowei, Chen Boyuan, Xu Yinda, Wen Ying, Yang Yaodong
- Federated Fine-tuning Of Large Language Models Under Heterogeneous Tasks And Client Resources Bai Jiamu, Chen Daoyuan, Qian Bingchen, Yao Liuyi, Li Yaliang
- Citrus: Chunked Instruction-aware State Eviction For Long Sequence Modeling Bai Yu, Zou Xiyuan, Huang Heyan, Chen Sanxing, Rondeau Marc-antoine, Gao Yang, Cheung Jackie Chi Kit
- Souplm: Model Integration In Large Language And Multi-modal Models Bai Yue, Zhang Zichen, Lu Jiasen, Fu Yun
- COIG-CQIA: Quality Is All You Need For Chinese Instruction Fine-tuning Bai Yuelin, Du Xinrun, Liang Yiming, Jin Yonggang, Liu Ziqiang, Zhou Junting, Zheng Tianyu, Zhang Xincheng, Ma Nuo, Wang Zekun, Yuan Ruibin, Wu Haihong, Lin Hongquan, Huang Wenhao, Zhang Jiajun, Chen Wenhu, Lin Chenghua, Fu Jie, Yang Min, Ni Shiwen, Zhang Ge
- Longalign: A Recipe For Long Context Alignment Of Large Language Models Bai Yushi, Lv Xin, Zhang Jiajie, He Yuze, Qi Ji, Hou Lei, Tang Jie, Dong Yuxiao, Li Juanzi
- Longwriter: Unleashing 10,000+ Word Generation From Long Context Llms Bai Yushi, Zhang Jiajie, Lv Xin, Zheng Linzhi, Zhu Siqi, Hou Lei, Dong Yuxiao, Tang Jie, Li Juanzi
- Finetuning Large Language Model For Personalized Ranking Bai Zhuoxi, Wu Ning, Cai Fengyu, Zhu Xinyi, Xiong Yun
- Patentgpt: A Large Language Model For Intellectual Property Bai Zilong, Zhang Ruiji, Chen Linqing, Cai Qijun, Zhong Yuan, Wang Cong, Fang Yan, Fang Jie, Sun Jing, Wang Weikuan, Zhou Lizhi, Hua Haoran, Qiu Tian, Wang Chaochao, Sun Cheng, Lu Jianping, Wang Yixin, Xia Yubin, Hu Meng, Liu Haowen, Xu Peng, Xu Licong, Bian Fu, Gu Xiaolong, Zhang Lisha, Wang Weilei, Tu Changyang
- Downstream Bias Mitigation Is All You Need Baksi Arkadeep, Singh Rahul, Joshi Tarun
- RAG Vs Fine-tuning: Pipelines, Tradeoffs, And A Case Study On Agriculture Balaguer Angels, Benara Vinamra, Cunha Renato Luiz De Freitas, Filho Roberto De M. Estevão, Hendry Todd, Holstein Daniel, Marsman Jennifer, Mecklenburg Nick, Malvar Sara, Nunes Leonardo O., Padilha Rafael, Sharp Morris, Silva Bruno, Sharma Swati, Aski Vijay, Chandra Ranveer
- CYGENT: A Cybersecurity Conversational Agent With Log Summarization Powered By GPT-3 Balasubramanian Prasasthy, Seby Justin, Kostakos Panos
- LAMP: A Language Model On The Map Balsebre Pasquale, Huang Weiming, Cong Gao
- Linguistic Calibration Of Long-form Generations Band Neil, Li Xuechen, Ma Tengyu, Hashimoto Tatsunori
- Llms Will Always Hallucinate, And We Need To Live With This Banerjee Sourav, Agarwal Ayushi, Singla Saloni
- Crayon: Customized On-device LLM Via Instant Adapter Blending And Edge-server Hybrid Inference Bang Jihwan, Lee Juntae, Shim Kyuhong, Yang Seunghan, Chang Simyung
- Smaller, Weaker, Yet Better: Training LLM Reasoners Via Compute-optimal Sampling Bansal Hritik, Hosseini Arian, Agarwal Rishabh, Tran Vinh Q., Kazemi Mehran
- Llms With Chain-of-thought Are Non-causal Reasoners Bao Guangsheng, Zhang Hongbo, Yang Linyi, Wang Cunxiang, Zhang Yue
- Transforming Agency. On The Mode Of Existence Of Large Language Models Barandiaran Xabier E., Almendros Lola S.
- To Each (textual Sequence) Its Own: Improving Memorized-data Unlearning In Large Language Models Barbulescu George-octavian, Triantafillou Peter
- TTQA-RS- A Break-down Prompting Approach For Multi-hop Table-text Question Answering With Reasoning And Summarization Bardhan Jayetri, Xiao Bushi, Wang Daisy Zhe
- Find The Gap: Knowledge Base Reasoning For Visual Question Answering Barezi Elham J., Kordjamshidi Parisa
- Reinforcement Learning From LLM Feedback To Counteract Goal Misgeneralization Barj Houda Nait El, Sautory Theophile
- Fine-tuning Or Fine-failing? Debunking Performance Myths In Large Language Models Barnett Scott, Brannelly Zac, Kurniawan Stefanus, Wong Sheng
- Slicerchat: Building A Local Chatbot For 3D Slicer Barr Colton
- From 'showgirls' To 'performers': Fine-tuning With Gender-inclusive Language For Bias Reduction In Llms Bartl Marion, Leavy Susan
- STARLING: Self-supervised Training Of Text-based Reinforcement Learning Agent With Large Language Models Basavatia Shreyas, Murugesan Keerthiram, Ratnakar Shivam
- Medaide: Leveraging Large Language Models For On-premise Medical Assistance On Edge Devices Basit Abdul, Hussain Khizar, Hanif Muhammad Abdullah, Shafique Muhammad
- A Comparative Study Of DSL Code Generation: Fine-tuning Vs. Optimized Retrieval Augmentation Bassamzadeh Nastaran, Methani Chhaya
- API-BLEND: A Comprehensive Corpora For Training And Benchmarking API Llms Basu Kinjal, Abdelaziz Ibrahim, Chaudhury Subhajit, Dan Soham, Crouse Maxwell, Munawar Asim, Kumaravel Sadhana, Muthusamy Vinod, Kapanipathi Pavan, Lastras Luis A.
- Task Prompt Vectors: Effective Initialization Through Multi-task Soft-prompt Transfer Belanec Robert, Ostermann Simon, Srba Ivan, Bielikova Maria
- Crafting Interpretable Embeddings By Asking Llms Questions Benara Vinamra, Singh Chandan, Morris John X., Antonello Richard, Stoica Ion, Huth Alexander G., Gao Jianfeng
- Cross-lingual Editing In Multilingual Language Models Beniwal Himanshu, D Kowsik Nandagopan, Singh Mayank
- Think Big, Generate Quick: Llm-to-slm For Fast Autoregressive Decoding Bergner Benjamin, Skliar Andrii, Royer Amelie, Blankevoort Tijmen, Asano Yuki, Bejnordi Babak Ehteshami
- Lotr: Low Tensor Rank Weight Adaptation Bershatsky Daniel, Cherniuk Daria, Daulbaev Talgat, Mikhalev Aleksandr, Oseledets Ivan
- A Systematic Analysis Of Large Language Models As Soft Reasoners: The Case Of Syllogistic Inferences Bertolazzi Leonardo, Gatt Albert, Bernardi Raffaella
- Enhancing Event Reasoning In Large Language Models Through Instruction Fine-tuning With Semantic Causal Graphs Bethany Mazal, Bethany Emet, Wherry Brandon, Chiang Cho-yu, Vishwamitra Nishant, Rios Anthony, Najafirad Peyman
- Soft Prompting For Unlearning In Large Language Models Bhaila Karuna, Van Minh-hao, Wu Xintao
- On The Robustness Of Language Models For Tabular Question Answering Bhandari Kushal Raj, Xing Sixue, Dan Soham, Gao Jianxi
- Language Models Are Homer Simpson! Safety Re-alignment Of Fine-tuned Language Models Through Task Arithmetic Bhardwaj Rishabh, Anh Do Duc, Poria Soujanya
- An Approach To Build Zero-shot Slot-filling System For Industry-grade Conversational Assistants Bhargav G P Shrivatsa, Neelam Sumit, Sharma Udit, Ikbal Shajith, Sreedhar Dheeraj, Karanam Hima, Joshi Sachindra, Dhoolia Pankaj, Garg Dinesh, Croutwater Kyle, Qi Haode, Wayne Eric, Murdock J William
- Fintral: A Family Of GPT-4 Level Multimodal Financial Large Language Models Bhatia Gagan, Nagoudi El Moatez Billah, Cavusoglu Hasan, Abdul-mageed Muhammad
- Zero-shot Llm-guided Counterfactual Generation For Text Bhattacharjee Amrita, Moraffah Raha, Garland Joshua, Liu Huan
- Enhancing Code Translation In Language Models With Few-shot Learning Via Retrieval-augmented Generation Bhattarai Manish, Santos Javier E., Jones Shawn, Biswas Ayan, Alexandrov Boian, O'malley Daniel
- Speculative Streaming: Fast LLM Inference Without Auxiliary Models Bhendawade Nikhil, Belousova Irina, Fu Qichen, Mason Henry, Rastegari Mohammad, Najibi Mahyar
- Decoding By Contrasting Knowledge: Enhancing Llms' Confidence On Edited Facts Bi Baolong, Liu Shenghua, Mei Lingrui, Wang Yiwei, Ji Pengliang, Cheng Xueqi
- Beyond Human Norms: Unveiling Unique Values Of Large Language Models Through Interdisciplinary Approaches Biedma Pablo, Yi Xiaoyuan, Huang Linus, Sun Maosong, Xie Xing
- Supervisory Prompt Training Billa Jean Ghislain, Oh Min, Du Liang
- Lipost: Improved Content Understanding With Effective Use Of Multi-task Contrastive Learning Bindal Akanksha, Ramanujam Sudarshan, Golland Dave, Hazen Tj, Jiang Tina, Zhang Fengyu, Yan Peng
- Digital Forgetting In Large Language Models: A Survey Of Unlearning Methods Blanco-justicia Alberto, Jebreel Najeeb, Manzanares Benet, Sánchez David, Domingo-ferrer Josep, Collell Guillem, Tan Kuan Eeik
- GRAM: Global Reasoning For Multi-page VQA Blau Tsachi, Fogel Sharon, Ronen Roi, Golts Alona, Ganz Roy, Avraham Elad Ben, Aberdam Aviad, Tsiper Shahar, Litman Ron
- Statically Contextualizing Large Language Models With Typed Holes Blinn Andrew, Li Xiang, Kim June Hyung, Omar Cyrus
- Exploring And Benchmarking The Planning Capabilities Of Large Language Models Bohnet Bernd, Nova Azade, Parisi Aaron T, Swersky Kevin, Goshvadi Katayoon, Dai Hanjun, Schuurmans Dale, Fiedel Noah, Sedghi Hanie
- Workarena++: Towards Compositional Planning And Reasoning-based Common Knowledge Work Tasks Boisvert Léo, Thakkar Megh, Gasse Maxime, Caccia Massimo, De Chezelles Thibault Le Sellier, Cappart Quentin, Chapados Nicolas, Lacoste Alexandre, Drouin Alexandre
- Do Llms Dream Of Ontologies? Bombieri Marco, Fiorini Paolo, Ponzetto Simone Paolo, Rospocher Marco
- Low-rank Quantization-aware Training For Llms Bondarenko Yelysei, Del Chiaro Riccardo, Nagel Markus
- Elephants Never Forget: Memorization And Learning Of Tabular Data In Large Language Models Bordt Sebastian, Nori Harsha, Rodrigues Vanessa, Nushi Besmira, Caruana Rich
- The Unreasonable Ineffectiveness Of Nucleus Sampling On Mitigating Text Memorization Borec Luka, Sadler Philipp, Schlangen David
- Open Artificial Knowledge Borisov Vadim, Schreiber Richard H.
- Improving Language Models Trained On Translated Data With Continual Pre-training And Dictionary Learning Analysis Boughorbel Sabri, Parvez Md Rizwan, Hawasly Majd
- Transformers Meet Neural Algorithmic Reasoners Bounsi Wilfried, Ibarz Borja, Dudzik Andrew, Hamrick Jessica B., Markeeva Larisa, Vitvitskyi Alex, Pascanu Razvan, Veličković Petar
- Improving Sequential Recommendations With Llms Boz Artun, Zorgdrager Wouter, Kotti Zoe, Harte Jesse, Louridas Panos, Jannach Dietmar, Fragkoulis Marios
- Embedding Large Language Models Into Extended Reality: Opportunities And Challenges For Inclusion, Engagement, And Privacy Bozkir Efe, Özdel Süleyman, Lau Ka Hei Carrie, Wang Mengdi, Gao Hong, Kasneci Enkelejda
- Fast And Effective Weight Update For Pruned Large Language Models Boža Vladimír
- The Art Of Saying No: Contextual Noncompliance In Language Models Brahman Faeze, Kumar Sachin, Balachandran Vidhisha, Dasigi Pradeep, Pyatkin Valentina, Ravichander Abhilasha, Wiegreffe Sarah, Dziri Nouha, Chandu Khyathi, Hessel Jack, Tsvetkov Yulia, Smith Noah A., Choi Yejin, Hajishirzi Hannaneh
- Reducing Transformer Key-value Cache Size With Cross-layer Attention Brandon William, Mishra Mayank, Nrusimha Aniruddha, Panda Rameswar, Kelly Jonathan Ragan
- LOCOST: State-space Models For Long Document Abstractive Summarization Bronnec Florian Le, Duong Song, Ravaut Mathieu, Allauzen Alexandre, Chen Nancy F., Guigue Vincent, Lumbreras Alberto, Soulier Laure, Gallinari Patrick
- Fine-tuned 'small' Llms (still) Significantly Outperform Zero-shot Generative AI Models In Text Classification Bucher Martin Juan José, Martini Marco
- Pheme: Efficient And Conversational Speech Generation Budzianowski Paweł, Sereda Taras, Cichy Tomasz, Vulić Ivan
- Lissard: Long And Simple Sequential Reasoning Datasets Bueno Mirelle, Lotufo Roberto, Nogueira Rodrigo
- Roguegpt: Dis-ethical Tuning Transforms Chatgpt4 Into A Rogue AI In 158 Words Buscemi Alessio, Proverbio Daniele
- Aligning Large Language Models With Counterfactual DPO Butcher Bradley
- ARES: Alternating Reinforcement Learning And Supervised Fine-tuning For Enhanced Multi-modal Chain-of-thought Reasoning Through Diverse AI Feedback Byun Ju-seung, Chun Jiyun, Kil Jihyung, Perrault Andrew
- Olora: Orthonormal Low-rank Adaptation Of Large Language Models Büyükakyüz Kerim
- On Large Language Models In National Security Applications Caballero William N., Jenkins Phillip R.
- In-context Learning And Fine-tuning GPT For Argument Mining Cabessa Jérémie, Hernault Hugo, Mushtaq Umer
- High-dimension Human Value Representation In Large Language Models Cahyawijaya Samuel, Chen Delong, Bang Yejin, Khalatbari Leila, Wilie Bryan, Ji Ziwei, Ishii Etsuko, Fung Pascale
- Cendol: Open Instruction-tuned Generative Large Language Models For Indonesian Languages Cahyawijaya Samuel, Lovenia Holy, Koto Fajri, Putri Rifki Afina, Dave Emmanuel, Lee Jhonson, Shadieq Nuur, Cenggoro Wawan, Akbar Salsabil Maulana, Mahendra Muhammad Ihza, Putri Dea Annisayanti, Wilie Bryan, Winata Genta Indra, Aji Alham Fikri, Purwarianti Ayu, Fung Pascale
- CERET: Cost-effective Extrinsic Refinement For Text Generation Cai Jason, Su Hang, Sunkara Monica, Shalyminov Igor, Mansour Saab
- Flextron: Many-in-one Flexible Large Language Model Cai Ruisi, Muralidharan Saurav, Heinrich Greg, Yin Hongxu, Wang Zhangyang, Kautz Jan, Molchanov Pavlo
- Lococo: Dropping In Convolutions For Long Context Compression Cai Ruisi, Tian Yuandong, Wang Zhangyang, Chen Beidi
- Medusa: Simple LLM Inference Acceleration Framework With Multiple Decoding Heads Cai Tianle, Li Yuhong, Geng Zhengyang, Peng Hongwu, Lee Jason D., Chen Deming, Dao Tri
- A Survey On Mixture Of Experts Cai Weilin, Jiang Juyong, Wang Fan, Tang Jing, Kim Sunghun, Huang Jiayi
- Internlm2 Technical Report Cai Zheng, Cao Maosong, Chen Haojiong, Chen Kai, Chen Keyu, Chen Xin, Chen Xun, Chen Zehui, Chen Zhi, Chu Pei, Dong Xiaoyi, Duan Haodong, Fan Qi, Fei Zhaoye, Gao Yang, Ge Jiaye, Gu Chenya, Gu Yuzhe, Gui Tao, Guo Aijia, Guo Qipeng, He Conghui, Hu Yingfan, Huang Ting, Jiang Tao, Jiao Penglong, Jin Zhenjiang, Lei Zhikai, Li Jiaxing, Li Jingwen, Li Linyang, Li Shuaibin, Li Wei, Li Yining, Liu Hongwei, Liu Jiangning, Hong Jiawei, Liu Kaiwen, Liu Kuikun, Liu Xiaoran, Lv Chengqi, Lv Haijun, Lv Kai, Ma Li, Ma Runyuan, Ma Zerun, Ning Wenchang, Ouyang Linke, Qiu Jiantao, Qu Yuan, Shang Fukai, Shao Yunfan, Song Demin, Song Zifan, Sui Zhihao, Sun Peng, Sun Yu, Tang Huanze, Wang Bin, Wang Guoteng, Wang Jiaqi, Wang Jiayu, Wang Rui, Wang Yudong, Wang Ziyi, Wei Xingjian, Weng Qizhen, Wu Fan, Xiong Yingtong, Xu Chao, Xu Ruiliang, Yan Hang, Yan Yirong, Yang Xiaogui, Ye Haochen, Ying Huaiyuan, Yu Jia, Yu Jing, Zang Yuhang, Zhang Chuyu, Zhang Li, Zhang Pan, Zhang Peng, Zhang Ruijie, Zhang Shuo, Zhang Songyang, Zhang Wenjian, Zhang Wenwei, Zhang Xingcheng, Zhang Xinyue, Zhao Hui, Zhao Qian, Zhao Xiaomeng, Zhou Fengzhe, Zhou Zaida, Zhuo Jingming, Zou Yicheng, Qiu Xipeng, Qiao Yu, Lin Dahua
- Towards Logically Consistent Language Models Via Probabilistic Reasoning Calanzone Diego, Teso Stefano, Vergari Antonio
- Are Large Language Models Really Bias-free? Jailbreak Prompts For Assessing Adversarial Robustness To Bias Elicitation Cantini Riccardo, Cosenza Giada, Orsino Alessio, Talia Domenico
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes Cao He, Shao Yanjun, Liu Zhiyuan, Liu Zijing, Tang Xiangru, Yao Yuan, Li Yu
- Recent Advances In Text Embedding: A Comprehensive Review Of Top-performing Methods On The MTEB Benchmark Cao Hongliu
- Genception: Evaluate Multimodal Llms With Unlabeled Unimodal Data Cao Lele, Buchner Valentin, Senane Zineb, Yang Fangkai
- Beyond Sparse Rewards: Enhancing Reinforcement Learning With Language Model Critique In Text Generation Cao Meng, Shu Lei, Yu Lei, Zhu Yun, Wichers Nevan, Liu Yinxiao, Meng Lei
- Figuring Out Figures: Using Textual References To Caption Scientific Figures Cao Stanley, Liu Kevin
- Personalized Steering Of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization Cao Yuanpu, Zhang Tianrong, Cao Bochuan, Yin Ziyi, Lin Lu, Ma Fenglong, Chen Jinghui
- Aligning Large Language Models With Recommendation Knowledge Cao Yuwei, Mehta Nikhil, Yi Xinyang, Keshavan Raghunandan, Heldt Lukasz, Hong Lichan, Chi Ed H., Sathiamoorthy Maheswaran
- Head-wise Shareable Attention For Large Language Models Cao Zouying, Yang Yifei, Zhao Hai
- Chart-based Reasoning: Transferring Capabilities From Llms To Vlms Carbune Victor, Mansoor Hassan, Liu Fangyu, Aralikatte Rahul, Baechler Gilles, Chen Jindong, Sharma Abhanshu
- A Review Of Multi-modal Large Language And Vision Models Carolan Kilian, Fennelly Laura, Smeaton Alan F.
- Can Formal Argumentative Reasoning Enhance Llms Performances? Castagna Federico, Sassoon Isabel, Parsons Simon
- Suppressing Pink Elephants With Direct Principle Feedback Castricato Louis, Lile Nathan, Anand Suraj, Schoelkopf Hailey, Verma Siddharth, Biderman Stella
- Smileyllama: Modifying Large Language Models For Directed Chemical Space Exploration Cavanagh Joseph M., Sun Kunyang, Gritsevskiy Andrew, Bagni Dorian, Bannister Thomas D., Head-gordon Teresa
- Towards Robust And Cost-efficient Knowledge Unlearning For Large Language Models Cha Sungmin, Cho Sungjun, Hwang Dasol, Lee Moontae
- Dual Modalities Of Text: Visual And Textual Generative Pre-training Chai Yekun, Liu Qingyi, Xiao Jingwu, Wang Shuohuan, Sun Yu, Wu Hua
- Cross-modal Safety Alignment: Is Textual Unlearning All You Need? Chakraborty Trishna, Shayegani Erfan, Cai Zikui, Abu-ghazaleh Nael, Asif M. Salman, Dong Yue, Roy-chowdhury Amit K., Song Chengyu
- Dense Reward For Free In Reinforcement Learning From Human Feedback Chan Alex J., Sun Hao, Holt Samuel, Van Der Schaar Mihaela
- RQ-RAG: Learning To Refine Queries For Retrieval Augmented Generation Chan Chi-min, Xu Chunpu, Yuan Ruibin, Luo Hongyin, Xue Wei, Guo Yike, Fu Jie
- AMGPT: A Large Language Model For Contextual Querying In Additive Manufacturing Chandrasekhar Achuth, Chan Jonathan, Ogoke Francis, Ajenifujah Olabode, Farimani Amir Barati
- Integrating Emotional And Linguistic Models For Ethical Compliance In Large Language Models Chang Edward Y.
- How Do Large Language Models Acquire Factual Knowledge During Pretraining? Chang Hoyeon, Park Jinho, Ye Seonghyeon, Yang Sohee, Seo Youngkyung, Chang Du-seong, Seo Minjoon
- Speechprompt: Prompting Speech Language Models For Speech Processing Tasks Chang Kai-wei, Wu Haibin, Wang Yu-kai, Wu Yuan-kuei, Shen Hua, Tseng Wei-cheng, Kang Iu-thing, Li Shang-wen, Lee Hung-yi
- Goldfish: Monolingual Language Models For 350 Languages Chang Tyler A., Arnett Catherine, Tu Zhuowen, Bergen Benjamin K.
- Detecting Hallucination And Coverage Errors In Retrieval Augmented Generation For Controversial Topics Chang Tyler A., Tomanek Katrin, Hoffmann Jessica, Thain Nithum, Van Liemt Erin, Meier-hellstern Kathleen, Dixon Lucas
- Bias-aware Low-rank Adaptation: Mitigating Catastrophic Inheritance Of Large Language Models Chang Yupeng, Chang Yi, Wu Yuan
- When Large Language Models Meet Evolutionary Algorithms Chao Wang, Zhao Jiaxuan, Jiao Licheng, Li Lingling, Liu Fang, Yang Shuyuan
- Phantom: General Trigger Attacks On Retrieval Augmented Language Generation Chaudhari Harsh, Severi Giorgio, Abascal John, Jagielski Matthew, Choquette-choo Christopher A., Nasr Milad, Nita-rotaru Cristina, Oprea Alina
- Coupling Speech Encoders With Downstream Text Models Chelba Ciprian, Schalkwyk Johan
- No Such Thing As A General Learner: Language Models And Their Dual Optimization Chemla Emmanuel, Nefdt Ryan M.
- The Dark Side Of Human Feedback: Poisoning Large Language Models Via User Inputs Chen Bocheng, Guo Hanqing, Wang Guangjing, Wang Yuanda, Yan Qiben
- Model Composition For Multimodal Large Language Models Chen Chi, Du Yiyang, Fang Zheng, Wang Ziyue, Luo Fuwen, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Sun Maosong, Liu Yang
- Self-cognition In Large Language Models: An Exploratory Study Chen Dongping, Shi Jiawen, Wan Yao, Zhou Pan, Gong Neil Zhenqiang, Sun Lichao
- Hardware-aware Parallel Prompt Decoding For Memory-efficient Acceleration Of LLM Inference Chen Hao Mark, Luk Wayne, Yiu Ka Fai Cedric, Li Rui, Mishchenko Konstantin, Venieris Stylianos I., Fan Hongxiang
- Efficiency In Focus: Layernorm As A Catalyst For Fine-tuning Medical Visual Language Pre-trained Models Chen Jiawei, Yang Dingkang, Jiang Yue, Li Mingcheng, Wei Jinjie, Hou Xiaolu, Zhang Lihua
- Unveiling The Flaws: Exploring Imperfections In Synthetic Data And Mitigation Strategies For Large Language Models Chen Jie, Zhang Yupeng, Wang Bingning, Zhao Wayne Xin, Wen Ji-rong, Chen Weipeng
- Automated Data Curation For Robust Language Model Fine-tuning Chen Jiuhai, Mueller Jonas
- Putting People In Llms' Shoes: Generating Better Answers Via Question Rewriter Chen Junhao, Wang Bowen, Jiang Zhouqiang, Nakashima Yuta
- Pixart-\sigma: Weak-to-strong Training Of Diffusion Transformer For 4K Text-to-image Generation Chen Junsong, Ge Chongjian, Xie Enze, Wu Yue, Yao Lewei, Ren Xiaozhe, Wang Zhongdao, Luo Ping, Lu Huchuan, Li Zhenguo
- Magdi: Structured Distillation Of Multi-agent Interaction Graphs Improves Reasoning In Smaller Language Models Chen Justin Chih-yao, Saha Swarnadeep, Stengel-eskin Elias, Bansal Mohit
- Cost-effective Instruction Learning For Pathology Vision And Language Analysis Chen Kaitao, Liu Mianxin, Yan Fang, Ma Lei, Shi Xiaoming, Wang Lilong, Wang Xiaosong, Zhu Lifeng, Wang Zhe, Zhou Mu, Zhang Shaoting
- Instructioncp: A Fast Approach To Transfer Large Language Models Into Target Language Chen Kuang-ming, Lee Hung-yi
- How Truncating Weights Improves Reasoning In Language Models Chen Lei, Bruna Joan, Bietti Alberto
- Pca-bench: Evaluating Multimodal Large Language Models In Perception-cognition-action Chain Chen Liang, Zhang Yichi, Ren Shuhuai, Zhao Haozhe, Cai Zefan, Wang Yuchi, Wang Peiyi, Meng Xiangdi, Liu Tianyu, Chang Baobao
- Teaching Large Language Models To Express Knowledge Boundary From Their Own Signals Chen Lida, Liang Zujie, Wang Xintao, Liang Jiaqing, Xiao Yanghua, Wei Feng, Chen Jinglei, Hao Zhenghong, Han Bing, Wang Wei
- Are We On The Right Way For Evaluating Large Vision-language Models? Chen Lin, Li Jinsong, Dong Xiaoyi, Zhang Pan, Zang Yuhang, Chen Zehui, Duan Haodong, Wang Jiaqi, Qiao Yu, Lin Dahua, Zhao Feng
- Bailong: Bilingual Transfer Learning Based On Qlora And Zip-tie Embedding Chen Lung-chuan, Li Zong-ru
- Quantifying And Mitigating Unimodal Biases In Multimodal Large Language Models: A Causal Perspective Chen Meiqi, Cao Yixin, Zhang Yan, Lu Chaochao
- Efficientqat: Efficient Quantization-aware Training For Large Language Models Chen Mengzhao, Shao Wenqi, Xu Peng, Wang Jiahao, Gao Peng, Zhang Kaipeng, Qiao Yu, Luo Ping
- Graphwiz: An Instruction-following Language Model For Graph Problems Chen Nuo, Li Yuhan, Tang Jianheng, Li Jia
- Enhancing Visual Question Answering Through Ranking-based Hybrid Training And Multimodal Fusion Chen Peiyuan, Zhang Zecheng, Dong Yiping, Zhou Li, Wang Han
- Lifelong Knowledge Editing For Llms With Retrieval-augmented Continuous Prompt Learning Chen Qizhou, Zhang Taolin, He Xiaofeng, Li Dongyang, Wang Chengyu, Huang Longtao, Xue Hui
- Attribution Analysis Meets Model Editing: Advancing Knowledge Correction In Vision Language Models With Visedit Chen Qizhou, Zhang Taolin, Wang Chengyu, He Xiaofeng, Wang Dakan, Liu Tingting
- Llava-mole: Sparse Mixture Of Lora Experts For Mitigating Data Conflicts In Instruction Finetuning Mllms Chen Shaoxiang, Jie Zequn, Ma Lin
- Reinstruct: Building Instruction Data From Unlabeled Corpus Chen Shu, Guan Xinyan, Lu Yaojie, Lin Hongyu, Han Xianpei, Sun Le
- Advancing Tool-augmented Large Language Models: Integrating Insights From Errors In Inference Trees Chen Sijia, Wang Yibo, Wu Yi-feng, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Zhang Lijun
- Struq: Defending Against Prompt Injection With Structured Queries Chen Sizhe, Piet Julien, Sitawarin Chawin, Wagner David
- From Yes-men To Truth-tellers: Addressing Sycophancy In Large Language Models With Pinpoint Tuning Chen Wei, Huang Zhen, Xie Liang, Lin Binbin, Li Houqiang, Lu Le, Tian Xinmei, Cai Deng, Zhang Yonggang, Wan Wenxiao, Shen Xu, Ye Jieping
- Octopus: On-device Language Model For Function Calling Of Software Apis Chen Wei, Li Zhiyuan, Ma Mingyuan
- GRATH: Gradual Self-truthifying For Large Language Models Chen Weixin, Song Dawn, Li Bo
- Beyond Natural Language: Llms Leveraging Alternative Formats For Enhanced Reasoning And Communication Chen Weize, Yuan Chenfei, Yuan Jiarui, Su Yusheng, Qian Chen, Yang Cheng, Xie Ruobing, Liu Zhiyuan, Sun Maosong
- SATO: Stable Text-to-motion Framework Chen Wenshuo, Xiao Hongru, Zhang Erhang, Hu Lijie, Wang Lei, Liu Mengyuan, Chen Chen
- Icleval: Evaluating In-context Learning Ability Of Large Language Models Chen Wentong, Lin Yankai, Zhou Zhenhao, Huang Hongyun, Jia Yantao, Cao Zhao, Wen Ji-rong
- Llast: Improved End-to-end Speech Translation System Leveraged By Large Language Models Chen Xi, Zhang Songyang, Bai Qibing, Chen Kai, Nakamura Satoshi
- Gazexplain: Learning To Predict Natural Language Explanations Of Visual Scanpaths Chen Xianyu, Jiang Ming, Zhao Qi
- Cause-aware Empathetic Response Generation Via Chain-of-thought Fine-tuning Chen Xinhao, Yang Chong, Lan Man, Cai Li, Chen Yang, Hu Tu, Zhuang Xinlin, Zhou Aimin
- RL-JACK: Reinforcement Learning-powered Black-box Jailbreaking Attack Against Llms Chen Xuan, Nie Yuzhou, Yan Lu, Mao Yunshu, Guo Wenbo, Zhang Xiangyu
- Take The Bull By The Horns: Hard Sample-reweighted Continual Training Improves LLM Generalization Chen Xuxi, Wang Zhendong, Sow Daouda, Yang Junjie, Chen Tianlong, Liang Yingbin, Zhou Mingyuan, Wang Zhangyang
- Can We Rely On LLM Agents To Draft Long-horizon Plans? Let's Take Travelplanner As An Example Chen Yanan, Pesaranghader Ali, Sadhu Tanmana, Yi Dong Hoon
- Refining Packing And Shuffling Strategies For Enhanced Performance In Generative Language Models Chen Yanbing, Wang Ruilin, Yang Zihao, Jiang Lavender Yao, Oermann Eric Karl
- Extroversion Or Introversion? Controlling The Personality Of Your Large Language Models Chen Yanquan, Wu Zhen, Guo Junjie, Huang Shujian, Dai Xinyu
- Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided By Text Information Chen Yi, Xu Jian, Zhang Xu-yao, Liu Wen-zhuo, Liu Yang-yang, Liu Cheng-lin
- Robust And Scalable Model Editing For Large Language Models Chen Yingfa, Zhang Zhengyan, Han Xu, Xiao Chaojun, Liu Zhiyuan, Chen Chen, Li Kuai, Yang Tao, Sun Maosong
- See What Llms Cannot Answer: A Self-challenge Framework For Uncovering LLM Weaknesses Chen Yulong, Liu Yang, Yan Jianhao, Bai Xuefeng, Zhong Ming, Yang Yinghao, Yang Ziyi, Zhu Chenguang, Zhang Yue
- Mofo: Momentum-filtered Optimizer For Mitigating Forgetting In LLM Fine-tuning Chen Yupeng, Wang Senmiao, Lin Zhihang, Qin Zeyu, Zhang Yushun, Ding Tian, Sun Ruoyu
- Agent-flan: Designing Data And Methods Of Effective Agent Tuning For Large Language Models Chen Zehui, Liu Kuikun, Wang Qiuchen, Zhang Wenwei, Liu Jiangning, Lin Dahua, Chen Kai, Zhao Feng
- MLLM Is A Strong Reranker: Advancing Multimodal Retrieval-augmented Generation Via Knowledge-enhanced Reranking And Noise-injected Training Chen Zhanpeng, Xu Chengjin, Qi Yiyan, Guo Jian
- Autoprm: Automating Procedural Supervision For Multi-step Reasoning Via Controllable Question Decomposition Chen Zhaorun, Zhao Zhuokai, Zhu Zhihong, Zhang Ruiqi, Li Xiang, Raj Bhiksha, Yao Huaxiu
- Dia-llama: Towards Large Language Model-driven CT Report Generation Chen Zhixuan, Luo Luyang, Bie Yequan, Chen Hao
- Improving Retrieval Augmented Open-domain Question-answering With Vectorized Contexts Chen Zhuo, Wang Xinyu, Jiang Yong, Xie Pengjun, Huang Fei, Tu Kewei
- An Empirical Study Of Data Ability Boundary In Llms' Math Reasoning Chen Zui, Chen Yezeng, Han Jiaqi, Huang Zhijie, Qi Ji, Zhou Yi
- "in Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles Through In-dialogue Learning Cheng Chuanqi, Tu Quan, Wu Wei, Shang Shuo, Mao Cunli, Yu Zhengtao, Yan Rui
- Empowering Large Language Models On Robotic Manipulation With Affordance Prompting Cheng Guangran, Zhang Chuheng, Cai Wenzhe, Zhao Li, Sun Changyin, Bian Jiang
- Unveiling Typographic Deceptions: Insights Of The Typographic Vulnerability In Large Vision-language Model Cheng Hao, Xiao Erjia, Gu Jindong, Yang Le, Duan Jinhao, Zhang Jize, Cao Jiahang, Xu Kaidi, Xu Renjing
- Social Debiasing For Fair Multi-modal Llms Cheng Harry, Guo Yangyang, Guo Qingpei, Yang Ming, Gan Tian, Nie Liqiang
- Self-playing Adversarial Language Game Enhances LLM Reasoning Cheng Pengyu, Hu Tianhao, Xu Han, Zhang Zhisong, Dai Yong, Han Lei, Du Nan
- Trojanrag: Retrieval-augmented Generation Can Be Backdoor Driver In Large Language Models Cheng Pengzhou, Ding Yidong, Ju Tianjie, Wu Zongru, Du Wei, Yi Ping, Zhang Zhuosheng, Liu Gongshen
- Small Agent Can Also Rock! Empowering Small Language Models As Hallucination Detector Cheng Xiaoxue, Li Junyi, Zhao Wayne Xin, Zhang Hongzhi, Zhang Fuzheng, Zhang Di, Gai Kun, Wen Ji-rong
- Interpreting Conversational Dense Retrieval By Rewriting-enhanced Inversion Of Session Embedding Cheng Yiruo, Mao Kelong, Dou Zhicheng
- ANOLE: An Open, Autoregressive, Native Large Multimodal Models For Interleaved Image-text Generation Chern Ethan, Su Jiadi, Ma Yan, Liu Pengfei
- A Reality Check Of The Benefits Of LLM In Business Cheung Ming
- Modeling: A Novel Dataset For Testing Linguistic Reasoning In Language Models Chi Nathan A., Malchev Teodor, Kong Riley, Chi Ryan A., Huang Lucas, Chi Ethan A., Mccoy R. Thomas, Radev Dragomir
- Zero-shot Cross-lingual Transfer In Instruction Tuning Of Large Language Models Chirkova Nadezhda, Nikoulina Vassilina
- Exploring Quantization For Efficient Pre-training Of Transformer Language Models Chitsaz Kamran, Fournier Quentin, Mordido Gonçalo, Chandar Sarath
- Unveiling Imitation Learning: Exploring The Impact Of Data Falsity To Large Language Model Cho Hyunsoo
- Language-image Models With 3D Understanding Cho Jang Hyun, Ivanovic Boris, Cao Yulong, Schmerling Edward, Wang Yue, Weng Xinshuo, Li Boyi, You Yurong, Krähenbühl Philipp, Wang Yan, Pavone Marco
- Optimizing Language Augmentation For Multilingual Large Language Models: A Case Study On Korean Choi Changsu, Jeong Yongbin, Park Seoyoon, Won Inho, Lim Hyeonseok, Kim Sangmin, Kang Yejee, Yoon Chanhyuk, Park Jaewan, Lee Yiseul, Lee Hyejin, Hahm Younggyun, Kim Hansaem, Lim Kyungtae
- SNAP: Unlearning Selective Knowledge In Large Language Models With Negative Instructions Choi Minseok, Rim Daniel, Lee Dohyun, Choo Jaegul
- Cross-architecture Transfer Learning For Linear-cost Inference Transformers Choi Sehyun
- When Life Gives You Llms, Make LLM-ADE: Large Language Models With Adaptive Data Engineering Choi Stephen, Gazeley William
- Gemmar: Enhancing Llms Through Arabic Instruction-tuning Chouikhi Hasna, Aloui Manel, Hammou Cyrine Ben, Chaabane Ghaith, Kchaou Haithem, Dhaouadi Chehir
- Breaking Down The Defenses: A Comparative Survey Of Attacks On Large Language Models Chowdhury Arijit Ghosh, Islam Md Mofijul, Kumar Vaibhav, Shezan Faysal Hossain, Kumar Vaibhav, Jain Vinija, Chadha Aman
- A Causal Explainable Guardrails For Large Language Models Chu Zhixuan, Wang Yan, Li Longfei, Wang Zhibo, Qin Zhan, Ren Kui
- Crosslingual Capabilities And Knowledge Barriers In Multilingual Large Language Models Chua Lynn, Ghazi Badih, Huang Yangsibo, Kamath Pritish, Kumar Ravi, Manurangsi Pasin, Sinha Amer, Xie Chulin, Zhang Chiyuan
- Lookback Lens: Detecting And Mitigating Contextual Hallucinations In Large Language Models Using Only Attention Maps Chuang Yung-sung, Qiu Linlu, Hsieh Cheng-yu, Krishna Ranjay, Kim Yoon, Glass James
- Stable Language Model Pre-training By Reducing Embedding Variability Chung Woojin, Hong Jiwoo, An Na Min, Thorne James, Yun Se-young
- PEFT-U: Parameter-efficient Fine-tuning For User Personalization Clarke Christopher, Heng Yuzhao, Tang Lingjia, Mars Jason
- Cogbench: A Large Language Model Walks Into A Psychology Lab Coda-forno Julian, Binz Marcel, Wang Jane X., Schulz Eric
- Improving Token-based World Models With Parallel Observation Prediction Cohen Lior, Wang Kaixin, Kang Bingyi, Mannor Shie
- Saullm-54b & Saullm-141b: Scaling Up Domain Adaptation For The Legal Domain Colombo Pierre, Pires Telmo, Boudiaf Malik, Melo Rui, Culver Dominic, Morgado Sofia, Malaboeuf Etienne, Hautreux Gabriel, Charpentier Johanne, Desa Michael
- Representations As Language: An Information-theoretic Framework For Interpretability Conklin Henry, Smith Kenny
- Ragsys: Item-cold-start Recommender As RAG System Contal Emile, Mcgoldrick Garrin
- LLASP: Fine-tuning Large Language Models For Answer Set Programming Coppolillo Erica, Calimeri Francesco, Manco Giuseppe, Perri Simona, Ricca Francesco
- Rocode: A Dataset For Measuring Code Intelligence From Problem Definitions In Romanian Cosma Adrian, Iordache Bogdan, Rosso Paolo
- Gamebench: Evaluating Strategic Reasoning Abilities Of LLM Agents Costarelli Anthony, Allen Mat, Hauksson Roman, Sodunke Grace, Hariharan Suhas, Cheng Carlson, Li Wenjie, Clymer Joshua, Yadav Arjun
- Out-of-context Prompting Boosts Fairness And Robustness In Large Language Model Predictions Cotta Leonardo, Maddison Chris J.
- Llambert: Large-scale Low-cost Data Annotation In NLP Csanády Bálint, Muzsai Lajos, Vedres Péter, Nádasdy Zoltán, Lukács András
- A Tale Of Trust And Accuracy: Base Vs. Instruct Llms In RAG Systems Cuconasu Florin, Trappolini Giovanni, Tonellotto Nicola, Silvestri Fabrizio
- Scaling Properties Of Speech Language Models Cuervo Santiago, Marxer Ricard
- Rethinking LLM Language Adaptation: A Case Study On Chinese Mixtral Cui Yiming, Yao Xin
- Meta Large Language Model Compiler: Foundation Models Of Compiler Optimization Cummins Chris, Seeker Volker, Grubisic Dejan, Roziere Baptiste, Gehring Jonas, Synnaeve Gabriel, Leather Hugh
- Synthetic Dataset Creation And Fine-tuning Of Transformer Models For Question Answering In Serbian Cvetanović Aleksa, Tadić Predrag
- Anchored Preference Optimization And Contrastive Revisions: Addressing Underspecification In Alignment D'oosterlinck Karel, Xu Winnie, Develder Chris, Demeester Thomas, Singh Amanpreet, Potts Christopher, Kiela Douwe, Mehri Shikib
- Getting The Most Out Of Your Tokenizer For Pre-training And Domain Adaptation Dagan Gautier, Synnaeve Gabriel, Rozière Baptiste
- Improve Student's Reasoning Generalizability Through Cascading Decomposed Cots Distillation Dai Chengwei, Li Kun, Zhou Wei, Hu Songlin
- CORM: Cache Optimization With Recent Message For Large Language Model Inference Dai Jincheng, Huang Zhuowei, Jiang Haiyun, Chen Chen, Cai Deng, Bi Wei, Shi Shuming
- ACT-MNMT Auto-constriction Turning For Multilingual Neural Machine Translation Dai Shaojie, Liu Xin, Luo Ping, Yu Yue
- Large Language Models Can Better Understand Knowledge Graphs Than We Thought Dai Xinbang, Hua Yuncheng, Wu Tongtong, Sheng Yang, Ji Qiu, Qi Guilin
- Impact Of Non-standard Unicode Characters On Security And Comprehension In Large Language Models Daniel Johan S, Pal Anand
- Under The Surface: Tracking The Artifactuality Of Llm-generated Data Das Debarati, De Langis Karin, Martin-boyle Anna, Kim Jaehyung, Lee Minhwa, Kim Zae Myung, Hayati Shirley Anugrah, Owan Risako, Hu Bin, Parkar Ritik, Koo Ryan, Park Jonginn, Tyagi Aahan, Ferland Libby, Roy Sanjali, Liu Vincent, Kang Dongyeop
- Speechverse: A Large-scale Generalizable Audio Language Model Das Nilaksh, Dingliwal Saket, Ronanki Srikanth, Paturi Rohit, Huang Zhaocheng, Mathur Prashant, Yuan Jie, Bekal Dhanush, Niu Xing, Jayanthi Sai Muralidhar, Li Xilai, Mundnich Karel, Sunkara Monica, Srinivasan Sundararajan, Han Kyu J, Kirchhoff Katrin
- Low-rank Finetuning For Llms: A Fairness Perspective Das Saswat, Romanelli Marco, Tran Cuong, Reza Zarreen, Kailkhura Bhavya, Fioretto Ferdinando
- Entropy Guided Extrapolative Decoding To Improve Factuality In Large Language Models Das Souvik, Jin Lifeng, Song Linfeng, Mi Haitao, Peng Baolin, Yu Dong
- Xmainframe: A Large Language Model For Mainframe Modernization Dau Anh T. V., Dao Hieu Trung, Nguyen Anh Tuan, Tran Hieu Trung, Nguyen Phong X., Bui Nghi D. Q.
- Investigating Symbolic Capabilities Of Large Language Models Dave Neisarg, Kifer Daniel, Giles C. Lee, Mali Ankur
- Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding With Childplay De Carvalho Gonçalo Hora, Knap Oscar, Pollice Robert
- Towards Transparency: Exploring LLM Trainings Datasets Through Visual Topic Modeling And Semantic Frame De Dampierre Charles, Mogoutov Andrei, Baumard Nicolas
- Pelle: Encoder-based Language Models For Brazilian Portuguese Based On Open Data De Mello Guilherme Lamartine, Finger Marcelo, Serras And Felipe, Carpi Miguel De Mello, Jose Marcos Menon, Domingues Pedro Henrique, Cavalim Paulo
- Will GPT-4 Run DOOM? De Wynter Adrian
- Evaluating Large Language Models For Automatic Analysis Of Teacher Simulations De-fitero-dominguez David, Albaladejo-gonzález Mariano, Garcia-cabot Antonio, Garcia-lopez Eva, Moreno-cediel Antonio, Barno Erin, Reich Justin
- Deepseek-v2: A Strong, Economical, And Efficient Mixture-of-experts Language Model Deepseek-ai, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding, Huajian Xin, Huazuo Gao, Hui Li, Hui Qu, J. L. Cai, Jian Liang, Jianzhong Guo, Jiaqi Ni, Jiashi Li, Jin Chen, Jingyang Yuan, Junjie Qiu, Junxiao Song, Kai Dong, Kaige Gao, Kang Guan, Lean Wang, Lecong Zhang, Lei Xu, Leyi Xia, Liang Zhao, Liyue Zhang, Meng Li, Miaojun Wang, Mingchuan Zhang, Minghua Zhang, Minghui Tang, Mingming Li, Ning Tian, Panpan Huang, Peiyi Wang, Peng Zhang, Qihao Zhu, Qinyu Chen, Qiushi Du, R. J. Chen, R. L. Jin, Ruiqi Ge, Ruizhe Pan, Runxin Xu, Ruyi Chen, S. S. Li, Shanghao Lu, Shangyan Zhou, Shanhuang Chen, Shaoqing Wu, Shengfeng Ye, Shirong Ma, Shiyu Wang, Shuang Zhou, Shuiping Yu, Shunfeng Zhou, Size Zheng, T. Wang, Tian Pei, Tian Yuan, Tianyu Sun, W. L. Xiao, Wangding Zeng, Wei An, Wen Liu, Wenfeng Liang, Wenjun Gao, Wentao Zhang, X. Q. Li, Xiangyue Jin, Xianzu Wang, Xiao Bi, Xiaodong Liu, Xiaohan Wang, Xiaojin Shen, Xiaokang Chen, Xiaosha Chen, Xiaotao Nie, Xiaowen Sun, Xiaoxiang Wang, Xin Liu, Xin Xie, Xingkai Yu, Xinnan Song, Xinyi Zhou, Xinyu Yang, Xuan Lu, Xuecheng Su, Y. Wu, Y. K. Li, Y. X. Wei, Y. X. Zhu, Yanhong Xu, Yanping Huang, Yao Li, Yao Zhao, Yaofeng Sun, Yaohui Li, Yaohui Wang, Yi Zheng, Yichao Zhang, Yiliang Xiong, Yilong Zhao, Ying He, Ying Tang, Yishi Piao, Yixin Dong, Yixuan Tan, Yiyuan Liu, Yongji Wang, Yongqiang Guo, Yuchen Zhu, Yuduan Wang, Yuheng Zou, Yukun Zha, Yunxian Ma, Yuting Yan, Yuxiang You, Yuxuan Liu, Z. Z. Ren, Zehui Ren, Zhangli Sha, Zhe Fu, Zhen Huang, Zhen Zhang, Zhenda Xie, Zhewen Hao, Zhihong Shao, Zhiniu Wen, Zhipeng Xu, Zhongyu Zhang, Zhuoshu Li, Zihan Wang, Zihui Gu, Zilin Li, Ziwei Xie
- Cram: Credibility-aware Attention Modification In Llms For Combating Misinformation In RAG Deng Boyi, Wang Wenjie, Zhu Fengbin, Wang Qifan, Feng Fuli
- MIMIR: A Streamlined Platform For Personalized Agent Tuning In Domain Expertise Deng Chunyuan, Tang Xiangru, Zhao Yilun, Wang Hanming, Wang Haoran, Zhou Wangchunshu, Cohan Arman, Gerstein Mark
- Gotcha! Don't Trick Me With Unanswerable Questions! Self-aligning Large Language Models For Responding To Unknown Questions Deng Yang, Zhao Yong, Li Moxin, Ng See-kiong, Chua Tat-seng
- LMGT: Optimizing Exploration-exploitation Balance In Reinforcement Learning Through Language Model Guided Trade-offs Deng Yongxin, Qiu Xihe, Tan Xiaoyu, Chu Wei, Xu Yinghui
- Cognidual Framework: Self-training Large Language Models Within A Dual-system Theoretical Framework For Improving Cognitive Tasks Deng Yongxin, Qiu Xihe, Tan Xiaoyu, Qu Chao, Pan Jing, Cheng Yuan, Xu Yinghui, Chu Wei
- Text-tuple-table: Towards Information Integration In Text-to-table Generation Via Global Tuple Extraction Deng Zheye, Chan Chunkit, Wang Weiqi, Sun Yuxi, Fan Wei, Zheng Tianshi, Yim Yauwai, Song Yangqiu
- Sycophancy To Subterfuge: Investigating Reward-tampering In Large Language Models Denison Carson, Macdiarmid Monte, Barez Fazl, Duvenaud David, Kravec Shauna, Marks Samuel, Schiefer Nicholas, Soklaski Ryan, Tamkin Alex, Kaplan Jared, Shlegeris Buck, Bowman Samuel R., Perez Ethan, Hubinger Evan
- Teaching A Multilingual Large Language Model To Understand Multilingual Speech Via Multi-instructional Training Denisov Pavel, Vu Ngoc Thang
- Glam: Fine-tuning Large Language Models For Domain Knowledge Graph Alignment Via Neighborhood Partitioning And Generative Subgraph Encoding Dernbach Stefan, Agarwal Khushbu, Zuniga Alejandro, Henry Michael, Choudhury Sutanay
- Promises, Outlooks And Challenges Of Diffusion Language Modeling Deschenaux Justin, Gulcehre Caglar
- GNOME: Generating Negotiations Through Open-domain Mapping Of Exchanges Deshpande Darshan, Sinha Shambhavi, Kumar Anirudh Ravi, Pal Debaditya, May Jonathan
- Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets Devine Peter
- Tagengo: A Multilingual Chat Dataset Devine Peter
- A Simple And Effective \(L_2\) Norm-based Strategy For KV Cache Compression Devoto Alessio, Zhao Yu, Scardapane Simone, Minervini Pasquale
- SOCIALITE-LLAMA: An Instruction-tuned Model For Social Scientific Tasks Dey Gourab, Ganesan Adithya V, Lal Yash Kumar, Shah Manal, Sinha Shreyashee, Matero Matthew, Giorgi Salvatore, Kulkarni Vivek, Schwartz H. Andrew
- Facts-and-feelings: Capturing Both Objectivity And Subjectivity In Table-to-text Generation Dey Tathagata, Bhattacharyya Pushpak
- Could We Have Had Better Multilingual Llms If English Was Not The Central Language? Diandaru Ryandito, Susanto Lucky, Tang Zilu, Purwarianti Ayu, Wijaya Derry
- Unveiling Encoder-free Vision-language Models Diao Haiwen, Cui Yufeng, Li Xiaotong, Wang Yueze, Lu Huchuan, Wang Xinlong
- Learning How To Ask: Cycle-consistency Refines Prompts In Multimodal Foundation Models Diesendruck Maurice, Lin Jianzhe, Imani Shima, Mahalingam Gayathri, Xu Mingyang, Zhao Jie
- Data Augmentation Using Large Language Models: Data Perspectives, Learning Paradigms And Challenges Ding Bosheng, Qin Chengwei, Zhao Ruochen, Luo Tianze, Li Xinze, Chen Guizhen, Xia Wenhan, Hu Junjie, Luu Anh Tuan, Joty Shafiq
- Mastering Text, Code And Math Simultaneously Via Fusing Highly Specialized Language Models Ding Ning, Chen Yulin, Cui Ganqu, Lv Xingtai, Zhao Weilin, Xie Ruobing, Zhou Bowen, Liu Zhiyuan, Sun Maosong
- Semcoder: Training Code Language Models With Comprehensive Semantics Ding Yangruibo, Peng Jinjun, Min Marcus J., Kaiser Gail, Yang Junfeng, Ray Baishakhi
- Entgpt: Linking Generative Large Language Models With Knowledge Bases Ding Yifan, Poudel Amrit, Zeng Qingkai, Weninger Tim, Veeramani Balaji, Bhattacharya Sanmitra
- Longrope: Extending LLM Context Window Beyond 2 Million Tokens Ding Yiran, Zhang Li Lyna, Zhang Chengruidong, Xu Yuanyuan, Shang Ning, Xu Jiahang, Yang Fan, Yang Mao
- Large Language Models Prompting With Episodic Memory Do Dai, Tran Quan, Venkatesh Svetha, Le Hung
- Automatic Prompt Selection For Large Language Models Do Viet-tung, Hoang Van-khanh, Nguyen Duy-hung, Sabahi Shahab, Yang Jeff, Hotta Hajime, Nguyen Minh-tien, Le Hung
- Language Adaptation On A Tight Academic Compute Budget: Tokenizer Swapping Works And Pure Bfloat16 Is Enough Dobler Konstantin, De Melo Gerard
- Evaluating Linguistic Capabilities Of Multimodal Llms In The Lens Of Few-shot Learning Dogan Mustafa, Kesen Ilker, Calixto Iacer, Erdem Aykut, Erdem Erkut
- A Tale Of Tails: Model Collapse As A Change Of Scaling Laws Dohmatob Elvis, Feng Yunzhen, Yang Pu, Charton Francois, Kempe Julia
- Self-play With Execution Feedback: Improving Instruction-following Capabilities Of Large Language Models Dong Guanting, Lu Keming, Li Chengpeng, Xia Tingyu, Yu Bowen, Zhou Chang, Zhou Jingren
- Understand What LLM Needs: Dual Preference Alignment For Retrieval-augmented Generation Dong Guanting, Zhu Yutao, Zhang Chenghao, Wang Zechen, Dou Zhicheng, Wen Ji-rong
- Evaluating And Mitigating Linguistic Discrimination In Large Language Models Dong Guoliang, Wang Haoyu, Sun Jun, Wang Xinyu
- Baichuanseed: Sharing The Potential Of Extensive Data Collection And Deduplication By Introducing A Competitive Large Language Model Baseline Dong Guosheng, Pan Da, Sun Yiding, Zhang Shusen, Liang Zheng, Wu Xin, Shen Yanjun, Yang Fan, Sun Haoze, Li Tianpeng, Lin Mingan, Xu Jianhua, Zhang Yufan, Nie Xiaonan, Su Lei, Wang Bingning, Zhang Wentao, Mao Jiaxin, Zhou Zenan, Chen Weipeng
- RLHF Workflow: From Reward Modeling To Online RLHF Dong Hanze, Xiong Wei, Pang Bo, Wang Haoxiang, Zhao Han, Zhou Yingbo, Jiang Nan, Sahoo Doyen, Xiong Caiming, Zhang Tong
- Modality-aware Integration With Large Language Models For Knowledge-based Visual Question Answering Dong Junnan, Zhang Qinggang, Zhou Huachi, Zha Daochen, Zheng Pai, Huang Xiao
- Pruner-zero: Evolving Symbolic Pruning Metric From Scratch For Large Language Models Dong Peijie, Li Lujun, Tang Zhenheng, Liu Xiang, Pan Xinglin, Wang Qiang, Chu Xiaowen
- Contrans: Weak-to-strong Alignment Engineering Via Concept Transplantation Dong Weilong, Wu Xinwei, Jin Renren, Xu Shaoyang, Xiong Deyi
- Internlm-xcomposer2-4khd: A Pioneering Large Vision-language Model Handling Resolutions From 336 Pixels To 4K HD Dong Xiaoyi, Zhang Pan, Zang Yuhang, Cao Yuhang, Wang Bin, Ouyang Linke, Zhang Songyang, Duan Haodong, Zhang Wenwei, Li Yining, Yan Hang, Gao Yang, Chen Zhe, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Chen Kai, He Conghui, Zhang Xingcheng, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- A Framework For Real-time Safeguarding The Text Generation Of Large Language Model Dong Ximing, Lin Dayi, Wang Shaowei, Hassan Ahmed E.
- Unmemorization In Large Language Models Via Self-distillation And Deliberate Imagination Dong Yijiang River, Lin Hongzhou, Belkin Mikhail, Huerta Ramon, Vulić Ivan
- Prompt Your Brain: Scaffold Prompt Tuning For Efficient Adaptation Of Fmri Pre-trained Model Dong Zijian, Wu Yilei, Chen Zijiao, Zhang Yichi, Jin Yueming, Zhou Juan Helen
- Biomedical Large Languages Models Seem Not To Be Superior To Generalist Models On Unseen Medical Data Dorfner Felix J., Dada Amin, Busch Felix, Makowski Marcus R., Han Tianyu, Truhn Daniel, Kleesiek Jens, Sushil Madhumita, Lammert Jacqueline, Adams Lisa C., Bressem Keno K.
- Training A Vision Language Model As Smartphone Assistant Dorka Nicolai, Marecki Janusz, Anwar Ammar
- To What Extent Is Chatgpt Useful For Language Teacher Lesson Plan Creation? Dornburg Alex, Davin Kristin
- What's Wrong With Your Code Generated By Large Language Models? An Extensive Study Dou Shihan, Jia Haoxiang, Wu Shenxi, Zheng Huiyuan, Zhou Weikang, Wu Muling, Chai Mingxu, Fan Jessica, Huang Caishuang, Tao Yunbo, Liu Yan, Zhou Enyu, Zhang Ming, Zhou Yuhao, Wu Yueming, Zheng Rui, Wen Ming, Weng Rongxiang, Wang Jingang, Cai Xunliang, Gui Tao, Qiu Xipeng, Zhang Qi, Huang Xuanjing
- Stepcoder: Improve Code Generation With Reinforcement Learning From Compiler Feedback Dou Shihan, Liu Yan, Jia Haoxiang, Xiong Limao, Zhou Enyu, Shen Wei, Shan Junjie, Huang Caishuang, Wang Xiao, Fan Xiaoran, Xi Zhiheng, Zhou Yuhao, Ji Tao, Zheng Rui, Zhang Qi, Huang Xuanjing, Gui Tao
- Towards Multimodal In-context Learning For Vision & Language Models Doveh Sivan, Perek Shaked, Mirza M. Jehanzeb, Lin Wei, Alfassy Amit, Arbelle Assaf, Ullman Shimon, Karlinsky Leonid
- Investigating Low-cost LLM Annotation For~spoken Dialogue Understanding Datasets Druart Lucas Lia, Vielzeuf Valentin Lia, Estève Yannick Lia
- Evaluating Language Model Context Windows: A "working Memory" Test And Inference-time Correction Dsouza Amanda, Glaze Christopher, Shin Changho, Sala Frederic
- Bitdistiller: Unleashing The Potential Of Sub-4-bit Llms Via Self-distillation Du Dayou, Zhang Yijia, Cao Shijie, Guo Jiaqi, Cao Ting, Chu Xiaowen, Xu Ningyi
- Efficient Personalized Text-to-image Generation By Leveraging Textual Subspace Du Shian, Cheng Xiaotian, Qian Qi, Wei Henglu, Xu Yi, Ji Xiangyang
- Unlocking Continual Learning Abilities In Language Models Du Wenyu, Cheng Shuang, Luo Tongxu, Qiu Zihan, Huang Zeyu, Cheung Ka Chun, Cheng Reynold, Fu Jie
- Stacking Your Transformers: A Closer Look At Model Growth For Efficient LLM Pre-training Du Wenyu, Luo Tongxu, Qiu Zihan, Huang Zeyu, Shen Yikang, Cheng Reynold, Guo Yike, Fu Jie
- Revisiting Moe And Dense Speed-accuracy Comparisons For LLM Training Du Xianzhi, Gunter Tom, Kong Xiang, Lee Mark, Wang Zirui, Zhang Aonan, Du Nan, Pang Ruoming
- Chinese Tiny LLM: Pretraining A Chinese-centric Large Language Model Du Xinrun, Yu Zhouliang, Gao Songyang, Pan Ding, Cheng Yuyang, Ma Ziyang, Yuan Ruibin, Qu Xingwei, Liu Jiaheng, Zheng Tianyu, Luo Xinchen, Zhou Guorui, Chen Wenhu, Zhang Ge
- Instruction-guided Scene Text Recognition Du Yongkun, Chen Zhineng, Su Yuchen, Jia Caiyan, Jiang Yu-gang
- CLEFT: Language-image Contrastive Learning With Efficient Large Language Model And Prompt Fine-tuning Du Yuexi, Chang Brian, Dvornek Nicha C.
- On The Evaluation Consistency Of Attribution-based Explanations Duan Jiarui, Li Haoling, Zhang Haofei, Jiang Hao, Xue Mengqi, Sun Li, Song Mingli, Song Jie
- How To Think Step-by-step: A Mechanistic Understanding Of Chain-of-thought Reasoning Dutta Subhabrata, Singh Joykirat, Chakrabarti Soumen, Chakraborty Tanmoy
- Your Context Is Not An Array: Unveiling Random Access Limitations In Transformers Ebrahimi Mohammadreza, Panchal Sunny, Memisevic Roland
- AXOLOTL: Fairness Through Assisted Self-debiasing Of Large Language Model Outputs Ebrahimi Sana, Chen Kaiwen, Asudeh Abolfazl, Das Gautam, Koudas Nick
- CROME: Cross-modal Adapters For Efficient Multimodal LLM Ebrahimi Sayna, Arik Sercan O., Nama Tejas, Pfister Tomas
- Lusifer: Llm-based User Simulated Feedback Environment For Online Recommender Systems Ebrat Danial, Rueda Luis
- Cognitive Bias In High-stakes Decision-making With Llms Echterhoff Jessica, Liu Yao, Alessa Abeer, Mcauley Julian, He Zexue
- Relation Extraction With Fine-tuned Large Language Models In Retrieval Augmented Generation Frameworks Efeoglu Sefika, Paschke Adrian
- Arabic Automatic Story Generation With Large Language Models El-shangiti Ahmed Oumar, Alwajih Fakhraddin, Abdul-mageed Muhammad
- Creating Arabic LLM Prompts At Scale El-sheikh Abdelrahman, Elmogtaba Ahmed, Darwish Kareem, Elmallah Muhammad, Elneima Ashraf, Sawaf Hassan
- Layerskip: Enabling Early Exit Inference And Self-speculative Decoding Elhoushi Mostafa, Shrivastava Akshat, Liskovich Diana, Hosmer Basil, Wasti Bram, Lai Liangzhen, Mahmoud Anas, Acun Bilge, Agarwal Saurabh, Roman Ahmed, Aly Ahmed A, Chen Beidi, Wu Carole-jean
- Enabling Natural Zero-shot Prompting On Encoder Models Via Statement-tuning Elshabrawy Ahmed, Huang Yongxin, Gurevych Iryna, Aji Alham Fikri
- Reasoning In Transformers -- Mitigating Spurious Correlations And Reasoning Shortcuts Enström Daniel, Kjellberg Viktor, Johansson Moa
- Navigating Uncertainty: Optimizing API Dependency For Hallucination Reduction In Closed-book Question Answering Erbacher Pierre, Falissar Louis, Guigue Vincent, Soulier Laure
- Tinyagent: Function Calling At The Edge Erdogan Lutfi Eren, Lee Nicholas, Jha Siddharth, Kim Sehoon, Tabrizi Ryan, Moon Suhong, Hooper Coleman, Anumanchipalli Gopala, Keutzer Kurt, Gholami Amir
- Empirical Studies Of Parameter Efficient Methods For Large Language Models Of Code And Knowledge Transfer To R Esmaeili Amirreza, Saberi Iman, Fard Fatemeh H.
- Bertaqa: How Much Do Language Models Know About Local Culture? Etxaniz Julen, Azkune Gorka, Soroa Aitor, De Lacalle Oier Lopez, Artetxe Mikel
- Latxa: An Open Language Model And Evaluation Suite For Basque Etxaniz Julen, Sainz Oscar, Perez Naiara, Aldabe Itziar, Rigau German, Agirre Eneko, Ormazabal Aitor, Artetxe Mikel, Soroa Aitor
- Coda: Constrained Generation Based Data Augmentation For Low-resource NLP Evuru Chandra Kiran Reddy, Ghosh Sreyan, Kumar Sonal, S Ramaneswaran, Tyagi Utkarsh, Manocha Dinesh
- HLAT: High-quality Large Language Model Pre-trained On AWS Trainium Fan Haozheng, Zhou Hao, Huang Guangtai, Raman Parameswaran, Fu Xinwei, Gupta Gaurav, Ram Dhananjay, Wang Yida, Huan Jun
- Reformatted Alignment Fan Run-ze, Li Xuefeng, Zou Haoyang, Li Junlong, He Shwai, Chern Ethan, Hu Jiewen, Liu Pengfei
- A Survey On RAG Meeting Llms: Towards Retrieval-augmented Large Language Models Fan Wenqi, Ding Yujuan, Ning Liangbo, Wang Shijie, Li Hengyun, Yin Dawei, Chua Tat-seng, Li Qing
- Enhancing Noise Robustness Of Retrieval-augmented Language Models With Adaptive Adversarial Training Fang Feiteng, Bai Yuelin, Ni Shiwen, Yang Min, Chen Xiaojun, Xu Ruifeng
- Llama-omni: Seamless Speech Interaction With Large Language Models Fang Qingkai, Guo Shoutao, Zhou Yan, Ma Zhengrui, Zhang Shaolei, Feng Yang
- \(VILA^2\): VILA Augmented VILA Fang Yunhao, Zhu Ligeng, Lu Yao, Wang Yan, Molchanov Pavlo, Cho Jang Hyun, Pavone Marco, Han Song, Yin Hongxu
- Test Of Time: A Benchmark For Evaluating Llms On Temporal Reasoning Fatemi Bahare, Kazemi Mehran, Tsitsulin Anton, Malkan Karishma, Yim Jinyeong, Palowitch John, Seo Sungyong, Halcrow Jonathan, Perozzi Bryan
- Multi-modal Hallucination Control By Visual Information Grounding Favero Alessandro, Zancato Luca, Trager Matthew, Choudhary Siddharth, Perera Pramuditha, Achille Alessandro, Swaminathan Ashwin, Soatto Stefano
- Curlora: Stable LLM Continual Fine-tuning And Catastrophic Forgetting Mitigation Fawi Muhammad
- Evaluating Human Alignment And Model Faithfulness Of LLM Rationale Fayyaz Mohsen, Yin Fan, Sun Jiao, Peng Nanyun
- Video-ccam: Enhancing Video-language Understanding With Causal Cross-attention Masks For Short And Long Videos Fei Jiajun, Li Dian, Deng Zhidong, Wang Zekun, Liu Gang, Wang Hui
- Retrieval Meets Reasoning: Dynamic In-context Editing For Long-text Understanding Fei Weizhi, Niu Xueyan, Xie Guoqing, Zhang Yanhua, Bai Bo, Deng Lei, Han Wei
- Internlm-law: An Open Source Chinese Legal Large Language Model Fei Zhiwei, Zhang Songyang, Shen Xiaoyu, Zhu Dawei, Wang Xiao, Cao Maosong, Zhou Fengzhe, Li Yining, Zhang Wenwei, Lin Dahua, Chen Kai, Ge Jidong
- Large Language Model Interaction Simulator For Cold-start Item Recommendation Feiran Huang, Zhenghang Yang, Junyi Jiang, Yuanchen Bei, Yijie Zhang, Hao Chen
- IGOT: Information Gain Optimized Tokenizer On Domain Adaptive Pretraining Feng Dawei, Zhang Yihai, Xu Zhixuan
- Legend: Leveraging Representation Engineering To Annotate Safety Margin For Preference Datasets Feng Duanyu, Qin Bowen, Huang Chen, Huang Youcheng, Zhang Zheng, Lei Wenqiang
- Citygpt: Empowering Urban Spatial Cognition Of Large Language Models Feng Jie, Du Yuwei, Liu Tianhui, Guo Siqi, Lin Yuming, Li Yong
- Keypoint-based Progressive Chain-of-thought Distillation For Llms Feng Kaituo, Li Changsheng, Zhang Xiaolu, Zhou Jun, Yuan Ye, Wang Guoren
- An Extremely Data-efficient And Generative Llm-based Reinforcement Learning Agent For Recommenders Feng Shuang, Feng Grace
- From Pre-training Corpora To Large Language Models: What Factors Influence LLM Performance In Causal Discovery Tasks? Feng Tao, Qu Lizhen, Tandon Niket, Li Zhuang, Kang Xiaoxi, Haffari Gholamreza
- Exploring Automated Distractor Generation For Math Multiple-choice Questions Via Large Language Models Feng Wanyong, Lee Jaewook, Mcnichols Hunter, Scarlatos Alexander, Smith Digory, Woodhead Simon, Ornelas Nancy Otero, Lan Andrew
- Optimus: Accelerating Large-scale Multi-modal LLM Training By Bubble Exploitation Feng Weiqi, Chen Yangrui, Wang Shaoyu, Peng Yanghua, Lin Haibin, Yu Minlan
- Mixture-of-loras: An Efficient Multitask Tuning For Large Language Models Feng Wenfeng, Hao Chuzhan, Zhang Yuewei, Han Yu, Wang Hao
- Mt-ladder: A Model-agnostic Framework Boosting Llm-based Machine Translation To The Next Level Feng Zhaopeng, Zhang Yan, Chen Ruizhe, Meng Zijie, Liu Zuozhu
- Into The Unknown: Self-learning Large Language Models Ferdinan Teddy, Kocoń Jan, Kazienko Przemysław
- Generative AI And Large Language Models For Cyber Security: All Insights You Need Ferrag Mohamed Amine, Alwahedi Fatima, Battah Ammar, Cherif Bilel, Mechri Abdechakour, Tihanyi Norbert
- Exaranker-open: Synthetic Explanation For IR Using Open-source Llms Ferraretto Fernando, Laitz Thiago, Lotufo Roberto, Nogueira Rodrigo
- Convosense: Overcoming Monotonous Commonsense Inferences For Conversational AI Finch Sarah E., Choi Jinho D.
- Introducing The Newspalm MBR And QE Dataset: Llm-generated High-quality Parallel Data Outperforms Traditional Web-crawled Data Finkelstein Mara, Vilar David, Freitag Markus
- RAG Foundry: A Framework For Enhancing Llms For Retrieval Augmented Generation Fleischer Daniel, Berchansky Moshe, Wasserblat Moshe, Izsak Peter
- Re-adaptir: Improving Information Retrieval Through Reverse Engineered Adaptation Fleshman William, Van Durme Benjamin
- CLIP With Generative Latent Replay: A Strong Baseline For Incremental Learning Frascaroli Emanuele, Panariello Aniello, Buzzega Pietro, Bonicelli Lorenzo, Porrello Angelo, Calderara Simone
- Hint-before-solving Prompting: Guiding Llms To Effectively Utilize Encoded Knowledge Fu Jinlan, Huangfu Shenzhen, Yan Hang, Ng See-kiong, Qiu Xipeng
- A Role-specific Guided Large Language Model For Ophthalmic Consultation Based On Stylistic Differentiation Fu Laiyi, Fan Binbin, Du Hongkai, Feng Yanxiang, Li Chunhua, Song Huping
- Lazyllm: Dynamic Token Pruning For Efficient Long Context LLM Inference Fu Qichen, Cho Minsik, Merth Thomas, Mehta Sachin, Rastegari Mohammad, Najibi Mahyar
- Disperse-then-merge: Pushing The Limits Of Instruction Tuning Via Alignment Tax Reduction Fu Tingchen, Cai Deng, Liu Lemao, Shi Shuming, Yan Rui
- Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight In The Real World For Meeting Summarization? Fu Xue-yong, Laskar Md Tahmid Rahman, Khasanova Elena, Chen Cheng, Tn Shashi Bhushan
- Data Engineering For Scaling Language Models To 128K Context Fu Yao, Panda Rameswar, Niu Xinyao, Yue Xiang, Hajishirzi Hannaneh, Kim Yoon, Peng Hao
- UI-JEPA: Towards Active Perception Of User Intent Through Onscreen User Activity Fu Yicheng, Anantha Raviteja, Vashisht Prabal, Cheng Jianpeng, Littwin Etai
- Using Large Language Models For The Interpretation Of Building Regulations Fuchs Stefan, Witbrock Michael, Dimyadi Johannes, Amor Robert
- Continual Pre-training For Cross-lingual LLM Adaptation: Enhancing Japanese Language Capabilities Fujii Kazuki, Nakamura Taishi, Loem Mengsay, Iida Hiroki, Ohi Masanari, Hattori Kakeru, Shota Hirai, Mizuki Sakae, Yokota Rio, Okazaki Naoaki
- Layoutllm: Large Language Model Instruction Tuning For Visually Rich Document Understanding Fujitake Masato
- Thinking Fair And Slow: On The Efficacy Of Structured Prompts For Debiasing Language Models Furniturewala Shaz, Jandial Surgan, Java Abhinav, Banerjee Pragyan, Shahid Simra, Bhatia Sumit, Jaidka Kokil
- Zero-shot Persuasive Chatbots With Llm-generated Strategies And Information Retrieval Furumai Kazuaki, Legaspi Roberto, Vizcarra Julio, Yamazaki Yudai, Nishimura Yasutaka, Semnani Sina J., Ikeda Kazushi, Shi Weiyan, Lam Monica S.
- Student Answer Forecasting: Transformer-driven Answer Choice Prediction For Language Learning Gado Elena Grazia, Martorella Tommaso, Zunino Luca, Mejia-domenzain Paola, Swamy Vinitra, Frej Jibril, Käser Tanja
- RAVE: Residual Vector Embedding For Clip-guided Backlit Image Enhancement Gaintseva Tatiana, Benning Martin, Slabaugh Gregory
- Airavata: Introducing Hindi Instruction-tuned LLM Gala Jay, Jayakumar Thanmay, Husain Jaavid Aktar, M Aswanth Kumar, Khan Mohammed Safi Ur Rahman, Kanojia Diptesh, Puduppully Ratish, Khapra Mitesh M., Dabre Raj, Murthy Rudra, Kunchukuttan Anoop
- Open Generative Large Language Models For Galician Gamallo Pablo, Rodríguez Pablo, De-dios-flores Iria, Sotelo Susana, Paniagua Silvia, Bardanca Daniel, Pichel José Ramom, Garcia Marcos
- Application Of LLM Agents In Recruitment: A Novel Framework For Resume Screening Gan Chengguang, Zhang Qinghao, Mori Tatsunori
- Multilingual Pretraining And Instruction Tuning Improve Cross-lingual Knowledge Alignment, But Only Shallowly Gao Changjiang, Hu Hongda, Hu Peng, Chen Jiajun, Li Jixing, Huang Shujian
- Practical Unlearning For Large Language Models Gao Chongyang, Wang Lixu, Weng Chenkai, Wang Xiao, Zhu Qi
- The Best Of Both Worlds: Toward An Honest And Helpful Large Language Model Gao Chujie, Zhang Qihui, Chen Dongping, Huang Yue, Wu Siyuan, Fu Zhengyan, Wan Yao, Zhang Xiangliang, Sun Lichao
- Aligning LLM Agents By Learning Latent Preference From User Edits Gao Ge, Taymanov Alexey, Salinas Eduardo, Mineiro Paul, Misra Dipendra
- Selfcp: Compressing Over-limit Prompt Via The Frozen Large Language Model Itself Gao Jun, Cao Ziqiang, Li Wenjie
- Harnessing The Intrinsic Knowledge Of Pretrained Language Models For Challenging Text Classification Settings Gao Lingyu
- Tc-llava: Rethinking The Transfer From Image To Video Understanding With Temporal Considerations Gao Mingze, Liu Jingyu, Li Mingda, Xie Jiangtao, Liu Qingbin, Zhao Bo, Chen Xi, Xiong Hui
- Autovcoder: A Systematic Framework For Automated Verilog Code Generation Using Llms Gao Mingzhe, Zhao Jieru, Lin Zhe, Ding Wenchao, Hou Xiaofeng, Feng Yu, Li Chao, Guo Minyi
- Lumina-t2x: Transforming Text Into Any Modality, Resolution, And Duration Via Flow-based Large Diffusion Transformers Gao Peng, Zhuo Le, Liu Dongyang, Du Ruoyi, Luo Xu, Qiu Longtian, Zhang Yuhang, Lin Chen, Huang Rongjie, Geng Shijie, Zhang Renrui, Xi Junlin, Shao Wenqi, Jiang Zhengkai, Yang Tianshuo, Ye Weicai, Tong He, He Jingwen, Qiao Yu, Li Hongsheng
- Efficient Tool Use With Chain-of-abstraction Reasoning Gao Silin, Dwivedi-yu Jane, Yu Ping, Tan Xiaoqing Ellen, Pasunuru Ramakanth, Golovneva Olga, Sinha Koustuv, Celikyilmaz Asli, Bosselut Antoine, Wang Tianlu
- Improving Language Understanding From Screenshots Gao Tianyu, Wang Zirui, Bhaskar Adithya, Chen Danqi
- Compensate Quantization Errors: Make Weights Hierarchical To Compensate Each Other Gao Yifei, Ou Jie, Wang Lei, Xiao Yuting, Xiang Zhiyuan, Dai Ruiting, Cheng Jun
- Worldgpt: Empowering LLM As Multimodal World Model Ge Zhiqi, Huang Hongzhe, Zhou Mingze, Li Juncheng, Wang Guoming, Tang Siliang, Zhuang Yueting
- Does Object Grounding Really Reduce Hallucination Of Large Vision-language Models? Geigle Gregor, Timofte Radu, Glavaš Goran
- Coercing Llms To Do And Reveal (almost) Anything Geiping Jonas, Stein Alex, Shu Manli, Saifullah Khalid, Wen Yuxin, Goldstein Tom
- Attacking Large Language Models With Projected Gradient Descent Geisler Simon, Wollschläger Tom, Abdalla M. H. I., Gasteiger Johannes, Günnemann Stephan
- Does Fine-tuning Llms On New Knowledge Encourage Hallucinations? Gekhman Zorik, Yona Gal, Aharoni Roee, Eyal Matan, Feder Amir, Reichart Roi, Herzig Jonathan
- Can Large Language Models Learn Independent Causal Mechanisms? Gendron Gaël, Nguyen Bao Trung, Peng Alex Yuxuan, Witbrock Michael, Dobbie Gillian
- Breaking The Length Barrier: Llm-enhanced CTR Prediction In Long Textual User Behaviors Geng Binzong, Huan Zhaoxin, Zhang Xiaolu, He Yong, Zhang Liang, Yuan Fajie, Zhou Jun, Mo Linjian
- Sketch-guided Constrained Decoding For Boosting Blackbox Large Language Models Without Logit Access Geng Saibo, Döner Berkay, Wendler Chris, Josifoski Martin, West Robert
- Why Not Transform Chat Large Language Models To Non-english? Geng Xiang, Zhu Ming, Li Jiahuan, Lai Zhejian, Zou Wei, She Shuaijie, Guo Jiaxin, Zhao Xiaofeng, Li Yinglu, Li Yuang, Su Chang, Zhao Yanqing, Lyu Xinglin, Zhang Min, Chen Jiajun, Yang Hao, Huang Shujian
- Beyond Self-learned Attention: Mitigating Attention Bias In Transformer-based Models Using Attention Guidance Gesi Jiri, Ahmed Iftekhar
- Jailbreaking Llms With Arabic Transliteration And Arabizi Ghanim Mansour Al, Almohaimeed Saleh, Zheng Mengxin, Solihin Yan, Lou Qian
- AI And Memory Wall Gholami Amir, Yao Zhewei, Kim Sehoon, Hooper Coleman, Mahoney Michael W., Keutzer Kurt
- Exploring The Frontier Of Vision-language Models: A Survey Of Current Methodologies And Future Directions Ghosh Akash, Acharya Arkadeep, Saha Sriparna, Jain Vinija, Chadha Aman
- A Closer Look At The Limitations Of Instruction Tuning Ghosh Sreyan, Evuru Chandra Kiran Reddy, Kumar Sonal, S Ramaneswaran, Aneja Deepali, Jin Zeyu, Duraiswami Ramani, Manocha Dinesh
- Investigating The Translation Capabilities Of Large Language Models Trained On Parallel Data Only Gilabert Javier García, Escolano Carlos, Savall Aleix Sant, Fornaciari Francesca De Luca, Mash Audrey, Liao Xixian, Melero Maite
- Uncovering Hidden Intentions: Exploring Prompt Recovery For Deeper Insights Into Generated Texts Give Louis, Zaoral Timo, Bruno Maria Antonietta
- Almol: Aligned Language-molecule Translation Llms Through Offline Preference Contrastive Optimisation Gkoumas Dimitris
- Feedback-aligned Mixed Llms For Machine Language-molecule Translation Gkoumas Dimitris, Liakata Maria
- Chatglm: A Family Of Large Language Models From GLM-130B To GLM-4 All Tools Glm Team, :, Zeng Aohan, Xu Bin, Wang Bowen, Zhang Chenhui, Yin Da, Zhang Dan, Rojas Diego, Feng Guanyu, Zhao Hanlin, Lai Hanyu, Yu Hao, Wang Hongning, Sun Jiadai, Zhang Jiajie, Cheng Jiale, Gui Jiayi, Tang Jie, Zhang Jing, Sun Jingyu, Li Juanzi, Zhao Lei, Wu Lindong, Zhong Lucen, Liu Mingdao, Huang Minlie, Zhang Peng, Zheng Qinkai, Lu Rui, Duan Shuaiqi, Zhang Shudan, Cao Shulin, Yang Shuxun, Tam Weng Lam, Zhao Wenyi, Liu Xiao, Xia Xiao, Zhang Xiaohan, Gu Xiaotao, Lv Xin, Liu Xinghan, Liu Xinyi, Yang Xinyue, Song Xixuan, Zhang Xunkai, An Yifan, Xu Yifan, Niu Yilin, Yang Yuantao, Li Yueyan, Bai Yushi, Dong Yuxiao, Qi Zehan, Wang Zhaoyu, Yang Zhen, Du Zhengxiao, Hou Zhenyu, Wang Zihan
- Better & Faster Large Language Models Via Multi-token Prediction Gloeckle Fabian, Idrissi Badr Youbi, Rozière Baptiste, Lopez-paz David, Synnaeve Gabriel
- Plan-grounded Large Language Models For Dual Goal Conversational Settings Glória-silva Diogo, Ferreira Rafael, Tavares Diogo, Semedo David, Magalhães João
- Arcee's Mergekit: A Toolkit For Merging Large Language Models Goddard Charles, Siriwardhana Shamane, Ehghaghi Malikeh, Meyers Luke, Karpukhin Vlad, Benedict Brian, Mcquade Mark, Solawetz Jacob
- Direct Alignment Of Draft Model For Speculative Decoding With Chat-fine-tuned Llms Goel Raghavv, Gagrani Mukul, Jeon Wonseok, Park Junyoung, Lee Mingu, Lott Christopher
- Memorization In In-context Learning Golchin Shahriar, Surdeanu Mihai, Bethard Steven, Blanco Eduardo, Riloff Ellen
- Goldfinch: High Performance Rwkv/transformer Hybrid With Linear Pre-fill And Extreme Kv-cache Compression Goldstein Daniel, Obeid Fares, Alcaide Eric, Song Guangyu, Cheah Eugene
- Transforming Llms Into Cross-modal And Cross-lingual Retrieval Systems Gomez Frank Palma, Sanabria Ramon, Sung Yun-hsuan, Cer Daniel, Dalmia Siddharth, Abrego Gustavo Hernandez
- Omnifusion Technical Report Goncharova Elizaveta, Razzhigaev Anton, Mikhalchuk Matvey, Kurkin Maxim, Abdullaeva Irina, Skripkin Matvey, Oseledets Ivan, Dimitrov Denis, Kuznetsov Andrey
- AST-T5: Structure-aware Pretraining For Code Generation And Understanding Gong Linyuan, Elhoushi Mostafa, Cheung Alvin
- Evaluation Of Llms On Syntax-aware Code Fill-in-the-middle Tasks Gong Linyuan, Wang Sida, Elhoushi Mostafa, Cheung Alvin
- LLMC: Benchmarking Large Language Model Quantization With A Versatile Compression Toolkit Gong Ruihao, Yong Yang, Gu Shiqiao, Huang Yushi, Lv Chentao, Zhang Yunchen, Liu Xianglong, Tao Dacheng
- The Advantages Of Context Specific Language Models: The Case Of The Erasmian Language Model Gonçalves João, Jelicic Nick, Murgia Michele, Stamhuis Evert
- Estimating Knowledge In Large Language Models Without Generating A Single Token Gottesman Daniela, Geva Mor
- Eyes Closed, Safety On: Protecting Multimodal Llms Via Image-to-text Transformation Gou Yunhao, Chen Kai, Liu Zhili, Hong Lanqing, Xu Hang, Li Zhenguo, Yeung Dit-yan, Kwok James T., Zhang Yu
- COBIAS: Contextual Reliability In Bias Assessment Govil Priyanshul, Jain Hemang, Bonagiri Vamshi Krishna, Chadha Aman, Kumaraguru Ponnurangam, Gaur Manas, Dey Sanorita
- Few Shot Chain-of-thought Driven Reasoning To Prompt Llms For Open Ended Medical Question Answering Gramopadhye Ojas, Nachane Saeel Sandeep, Chanda Prateek, Ramakrishnan Ganesh, Jadhav Kshitij Sharad, Nandwani Yatin, Raghu Dinesh, Joshi Sachindra
- From Generalist To Specialist: Improving Large Language Models For Medical Physics Using Arcot Grandinetti Jace, Mcbeth Rafe
- Is Mamba Capable Of In-context Learning? Grazzi Riccardo, Siems Julien, Schrodi Simon, Brox Thomas, Hutter Frank
- Stress-testing Capability Elicitation With Password-locked Models Greenblatt Ryan, Roger Fabien, Krasheninnikov Dmitrii, Krueger David
- Fine-tuned Language Models Generate Stable Inorganic Materials As Text Gruver Nate, Sriram Anuroop, Madotto Andrea, Wilson Andrew Gordon, Zitnick C. Lawrence, Ulissi Zachary
- Model Editing Harms General Abilities Of Large Language Models: Regularization To The Rescue Gu Jia-chen, Xu Hao-xiang, Ma Jun-yu, Lu Pan, Ling Zhen-hua, Chang Kai-wei, Peng Nanyun
- CMR Scaling Law: Predicting Critical Mixture Ratios For Continual Pre-training Of Language Models Gu Jiawei, Yang Zacc, Ding Chuanghao, Zhao Rui, Tan Fei
- A Survey On Responsible Generative AI: What To Generate And What Not Gu Jindong
- Teams-rl: Teaching Llms To Generate Better Instruction Datasets Via Reinforcement Learning Gu Shangding, Knoll Alois, Jin Ming
- RWKV-CLIP: A Robust Vision-language Representation Learner Gu Tiancheng, Yang Kaicheng, An Xiang, Feng Ziyong, Liu Dongnan, Cai Weidong, Deng Jiankang
- Anah-v2: Scaling Analytical Hallucination Annotation Of Large Language Models Gu Yuzhe, Ji Ziwei, Zhang Wenwei, Lyu Chengqi, Lin Dahua, Chen Kai
- Structext-eval: An Autogenerated Benchmark For Evaluating Large Language Model's Ability In Structure-rich Text Understanding Gu Zhouhong, Ye Haoning, Zhou Zeyang, Feng Hongwei, Xiao Yanghua
- Codeip: A Grammar-guided Multi-bit Watermark For Large Language Models Of Code Guan Batu, Wan Yao, Bi Zhangqian, Wang Zheng, Zhang Hongyu, Zhou Pan, Sun Lichao
- AMOR: A Recipe For Building Adaptable Modular Knowledge Agents Through Process Feedback Guan Jian, Wu Wei, Wen Zujie, Xu Peng, Wang Hongning, Huang Minlie
- Probing The Robustness Of Vision-language Pretrained Models: A Multimodal Adversarial Attack Approach Guan Jiwei, Ding Tianyu, Cao Longbing, Pan Lei, Wang Chen, Zheng Xi
- Enhancing Collaborative Semantics Of Language Model-driven Recommendations Via Graph-aware Learning Guan Zhong, Wu Likang, Zhao Hongke, He Ming, Fan Jianpin
- APTQ: Attention-aware Post-training Mixed-precision Quantization For Large Language Models Guan Ziyi, Huang Hantao, Su Yupeng, Huang Hong, Wong Ngai, Yu Hao
- Boosting Lossless Speculative Decoding Via Feature Sampling And Partial Alignment Distillation Gui Lujun, Xiao Bin, Su Lei, Chen Weipeng
- Lisbon Computational Linguists At Semeval-2024 Task 2: Using A Mistral 7B Model And Data Augmentation Guimarães Artur, Martins Bruno, Magalhães João
- On The Interchangeability Of Positional Embeddings In Multilingual Neural Machine Translation Models Gumma Varun, Chitale Pranjal A., Bali Kalika
- Conversational Topic Recommendation In Counseling And Psychotherapy With Decision Transformer And Large Language Models Gunal Aylin, Lin Baihan, Bouneffouf Djallel
- Sample Design Engineering: An Empirical Study Of What Makes Good Downstream Fine-tuning Samples For Llms Guo Biyang, Wang He, Xiao Wenyilin, Chen Hong, Lee Zhuxin, Han Songqiao, Huang Hailiang
- A Novel Paradigm Boosting Translation Capabilities Of Large Language Models Guo Jiaxin, Yang Hao, Li Zongyao, Wei Daimeng, Shang Hengchao, Chen Xiaoyu
- Direct Language Model Alignment From Online AI Feedback Guo Shangmin, Zhang Biao, Liu Tianlin, Liu Tianqi, Khalman Misha, Llinares Felipe, Rame Alexandre, Mesnard Thomas, Zhao Yao, Piot Bilal, Ferret Johan, Blondel Mathieu
- SGSH: Stimulate Large Language Models With Skeleton Heuristics For Knowledge Base Question Generation Guo Shasha, Liao Lizi, Zhang Jing, Wang Yanling, Li Cuiping, Chen Hong
- Agent-simt: Agent-assisted Simultaneous Machine Translation With Large Language Models Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Sillm: Large Language Models For Simultaneous Machine Translation Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Ds-agent: Automated Data Science By Empowering Large Language Models With Case-based Reasoning Guo Siyuan, Deng Cheng, Wen Ying, Chen Hechang, Chang Yi, Wang Jun
- Fine-tuning Large Language Models For Stock Return Prediction Using Newsflow Guo Tian, Hauptmann Emmanuel
- Serial Position Effects Of Large Language Models Guo Xiaobo, Vosoughi Soroush
- Generative AI For Synthetic Data Generation: Methods, Challenges And The Future Guo Xu, Chen Yiqiang
- Efficient Continual Pre-training By Mitigating The Stability Gap Guo Yiduo, Fu Jie, Zhang Huishuai, Zhao Dongyan, Shen Yikang
- GPTQT: Quantize Large Language Models Twice To Push The Efficiency Guo Yipin, Lang Yilin, Ren Qinyuan
- API Pack: A Massive Multi-programming Language Dataset For API Call Generation Guo Zhen, Soria Adriana Meza, Sun Wei, Shen Yikang, Panda Rameswar
- METAREFLECTION: Learning Instructions For Language Agents Using Past Reflections Gupta Priyanshu, Kirtania Shashank, Singha Ananya, Gulwani Sumit, Radhakrishna Arjun, Shi Sherry, Soares Gustavo
- Selective Self-rehearsal: A Fine-tuning Approach To Improve Generalization In Large Language Models Gupta Sonam, Nandwani Yatin, Yehudai Asaf, Mishra Mayank, Pandey Gaurav, Raghu Dinesh, Joshi Sachindra
- Multilingual Large Language Models And Curse Of Multilinguality Gurgurov Daniil, Bäumel Tanja, Anikina Tatiana
- Hipporag: Neurobiologically Inspired Long-term Memory For Large Language Models Gutiérrez Bernal Jiménez, Shu Yiheng, Gu Yu, Yasunaga Michihiro, Su Yu
- Would I Lie To You? Inference Time Alignment Of Language Models Using Direct Preference Heads Hadji-kyriacou Avelina Asada, Arandjelovic Ognjen
- Single Parent Family: A Spectrum Of Family Members From A Single Pre-trained Foundation Model Hajimolahoseini Habib, Hassanpour Mohammad, Ataiefard Foozhan, Chen Boxing, Liu Yang
- Mamba State-space Models Can Be Strong Downstream Learners Halloran John T., Gulati Manbir, Roysdon Paul F.
- Synthclip: Are We Ready For A Fully Synthetic CLIP Training? Hammoud Hasan Abed Al Kader, Itani Hani, Pizzati Fabio, Torr Philip, Bibi Adel, Ghanem Bernard
- Sltrain: A Sparse Plus Low-rank Approach For Parameter And Memory Efficient Pretraining Han Andi, Li Jiaxiang, Huang Wei, Hong Mingyi, Takeda Akiko, Jawanpuria Pratik, Mishra Bamdev
- Llm-personalize: Aligning LLM Planners With Human Preferences Via Reinforced Self-training For Housekeeping Robots Han Dongge, Mcinroe Trevor, Jelley Adam, Albrecht Stefano V., Bell Peter, Storkey Amos
- MERLIN: Multimodal Embedding Refinement Via Llm-based Iterative Navigation For Text-video Retrieval-rerank Pipeline Han Donghoon, Park Eunhwan, Lee Gisang, Lee Adam, Kwak Nojun
- Small Language Model Can Self-correct Han Haixia, Liang Jiaqing, Shi Jie, He Qianyu, Xiao Yanghua
- Transfer Learning For Text Diffusion Models Han Kehang, Kenealy Kathleen, Barua Aditya, Fiedel Noah, Constant Noah
- Chatgpt Based Data Augmentation For Improved Parameter-efficient Debiasing Of Llms Han Pengrui, Kocielnik Rafal, Saravanan Adhithya, Jiang Roy, Sharir Or, Anandkumar Anima
- Enhancing CTR Prediction Through Sequential Recommendation Pre-training: Introducing The SRP4CTR Framework Han Ruidong, Li Qianzhong, Jiang He, Li Rui, Zhao Yurou, Li Xiang, Lin Wei
- Wildguard: Open One-stop Moderation Tools For Safety Risks, Jailbreaks, And Refusals Of Llms Han Seungju, Rao Kavel, Ettinger Allyson, Jiang Liwei, Lin Bill Yuchen, Lambert Nathan, Choi Yejin, Dziri Nouha
- Value Augmented Sampling For Language Model Alignment And Personalization Han Seungwook, Shenfeld Idan, Srivastava Akash, Kim Yoon, Agrawal Pulkit
- Towards Robust Instruction Tuning On Multimodal Large Language Models Han Wei, Chen Hui, Poria Soujanya
- COCO Is "ALL'' You Need For Visual Instruction Fine-tuning Han Xiaotian, Wang Yiqi, Zhai Bohan, You Quanzeng, Yang Hongxia
- EMMA: Your Text-to-image Diffusion Model Can Secretly Accept Multi-modal Prompts Han Yucheng, Wang Rui, Zhang Chi, Hu Juntao, Cheng Pei, Fu Bin, Zhang Hanwang
- A Survey On Hallucination In Large Vision-language Models Hanchao Liu, Wenyuan Xue, Yifei Chen, Dapeng Chen, Xiutian Zhao, Ke Wang, Liping Hou, Rongjun Li, Wei Peng
- Vision-and-language Navigation Generative Pretrained Transformer Hanlin Wen
- Be Like A Goldfish, Don't Memorize! Mitigating Memorization In Generative Llms Hans Abhimanyu, Wen Yuxin, Jain Neel, Kirchenbauer John, Kazemi Hamid, Singhania Prajwal, Singh Siddharth, Somepalli Gowthami, Geiping Jonas, Bhatele Abhinav, Goldstein Tom
- Learning To Rewrite: Generalized Llm-generated Text Detection Hao Wei, Li Ran, Zhao Weiliang, Yang Junfeng, Mao Chengzhi
- Exploring Chatgpt And Its Impact On Society Haque Md. Asraful, Li Shuai
- Risk And Response In Large Language Models: Evaluating Key Threat Categories Harandizadeh Bahareh, Salinas Abel, Morstatter Fred
- Spectrum: Targeted Training On Signal To Noise Ratio Hartford Eric, Atkins Lucas, Neto Fernando Fernandes, Golchinfar David
- Pruning For Protection: Increasing Jailbreak Resistance In Aligned Llms Without Fine-tuning Hasan Adib, Rugina Ileana, Wang Alex
- The Unreasonable Effectiveness Of Easy Training Data For Hard Tasks Hase Peter, Bansal Mohit, Clark Peter, Wiegreffe Sarah
- A Comprehensive Overview Of Large Language Models (llms) For Cyber Defences: Opportunities And Directions Hassanin Mohammed, Moustafa Nour
- Teaching Large Language Models To Reason With Reinforcement Learning Havrilla Alex, Du Yuqing, Raparthy Sharath Chandra, Nalmpantis Christoforos, Dwivedi-yu Jane, Zhuravinskyi Maksym, Hambro Eric, Sukhbaatar Sainbayar, Raileanu Roberta
- Glore: When, Where, And How To Improve LLM Reasoning Via Global And Local Refinements Havrilla Alex, Raparthy Sharath, Nalmpantis Christoforus, Dwivedi-yu Jane, Zhuravinskyi Maksym, Hambro Eric, Raileanu Roberta
- Chain-of-instructions: Compositional Instruction Tuning On Large Language Models Hayati Shirley Anugrah, Jung Taehee, Bodding-long Tristan, Kar Sudipta, Sethy Abhinav, Kim Joo-kyung, Kang Dongyeop
- Safety Arithmetic: A Framework For Test-time Safety Alignment Of Language Models By Steering Parameters And Activations Hazra Rima, Layek Sayan, Banerjee Somnath, Poria Soujanya
- Can Large Language Models Reason? A Characterization Via 3-SAT Hazra Rishi, Venturato Gabriele, Martires Pedro Zuidberg Dos, De Raedt Luc
- Zero-shot Generalization During Instruction Tuning: Insights From Similarity And Granularity He Bingxiang, Ding Ning, Qian Cheng, Deng Jia, Cui Ganqu, Yuan Lifan, Gao Huan-ang, Chen Huimin, Liu Zhiyuan, Sun Maosong
- A Law Of Next-token Prediction In Large Language Models He Hangfeng, Su Weijie J.
- Cos: Enhancing Personalization And Mitigating Bias With Context Steering He Jerry Zhi-yang, Pandey Sashrika, Schrum Mariah L., Dragan Anca
- From Words To Actions: Unveiling The Theoretical Underpinnings Of Llm-driven Autonomous Systems He Jianliang, Chen Siyu, Zhang Fengzhuo, Yang Zhuoran
- Multi-modal Instruction Tuned Llms With Fine-grained Visual Perception He Junwen, Wang Yifan, Wang Lijun, Lu Huchuan, He Jun-yan, Lan Jin-peng, Luo Bin, Xie Xuansong
- Prompt-based Bias Calibration For Better Zero/few-shot Learning Of Language Models He Kang, Long Yinghan, Roy Kaushik
- Advancing Process Verification For Large Language Models Via Tree-based Preference Learning He Mingqian, Shen Yongliang, Zhang Wenqi, Tan Zeqi, Lu Weiming
- Efficient LLM Inference With Kcache He Qiaozhi, Wu Zhihua
- Can Language Models Act As Knowledge Bases At Scale? He Qiyuan, Wang Yizhong, Wang Wenya
- Achieving Tool Calling Functionality In Llms Using Only Prompt Engineering Without Fine-tuning He Shengtao
- Prompting Chatgpt For Translation: A Comparative Analysis Of Translation Brief And Persona Prompts He Sui
- MARS: Mixture Of Auto-regressive Models For Fine-grained Text-to-image Synthesis He Wanggui, Fu Siming, Liu Mushui, Wang Xierui, Xiao Wenyi, Shu Fangxun, Wang Yi, Zhang Lei, Yu Zhelun, Li Haoyuan, Huang Ziwei, Gan Leilei, Jiang Hao
- Transferring Troubles: Cross-lingual Transferability Of Backdoor Attacks In Llms With Instruction Tuning He Xuanli, Wang Jun, Xu Qiongkai, Minervini Pasquale, Stenetorp Pontus, Rubinstein Benjamin I. P., Cohn Trevor
- SHED: Shapley-based Automated Dataset Refinement For Instruction Fine-tuning He Yexiao, Wang Ziyao, Shen Zheyu, Sun Guoheng, Dai Yucong, Wu Yongkai, Wang Hongyi, Li Ang
- Don't Half-listen: Capturing Key-part Information In Continual Instruction Tuning He Yongquan, Huang Xuancheng, Tang Minghao, Meng Lingxun, Li Xiang, Lin Wei, Zhang Wenyuan, Gao Yifu
- Language Models As Hierarchy Encoders He Yuan, Yuan Zhangdie, Chen Jiaoyan, Horrocks Ian
- Camelot: Towards Large Language Models With Training-free Consolidated Associative Memory He Zexue, Karlinsky Leonid, Kim Donghyun, Mcauley Julian, Krotov Dmitry, Feris Rogerio
- Telechat Technical Report He Zhongjiang, Wang Zihan, Liu Xinzhang, Liu Shixuan, Yao Yitong, Huang Yuyao, Li Xuelong, Li Yongxiang, Che Zhonghao, Zhang Zhaoxi, Wang Yan, Wang Xin, Pu Luwen, Xu Huinan, Fang Ruiyu, Zhao Yu, Zhang Jie, Huang Xiaomeng, Lu Zhilong, Peng Jiaxin, Zheng Wenjun, Wang Shiquan, Yang Bingkai, He Xuewei, Jiang Zhuoru, Xie Qiyi, Zhang Yanhan, Li Zhongqiu, Shi Lingling, Fu Weiwei, Zhang Yin, Huang Zilu, Xiong Sishi, Zhang Yuxiang, Wang Chao, Song Shuangyong
- N-gram Prediction And Word Difference Representations For Language Modeling Heo Dongnyeong, Rim Daniela Noemi, Choi Heeyoul
- On Overcoming Miscalibrated Conversational Priors In Llm-based Chatbots Herlihy Christine, Neville Jennifer, Schnabel Tobias, Swaminathan Adith
- Lilium: Ebay's Large Language Models For E-commerce Herold Christian, Kozielski Michael, Ekimov Leonid, Petrushkov Pavel, Vandenbussche Pierre-yves, Khadivi Shahram
- [lions: 1] And [tigers: 2] And [bears: 3], Oh My! Literary Coreference Annotation With Llms Hicke Rebecca M. M., Mimno David
- Super Tiny Language Models Hillier Dylan, Guertler Leon, Tan Cheston, Agrawal Palaash, Ruirui Chen, Cheng Bobby
- Llava-gemma: Accelerating Multimodal Foundation Models With A Compact Language Model Hinck Musashi, Olson Matthew L., Cobbley David, Tseng Shao-yen, Lal Vasudev
- Optimising Hard Prompts With Few-shot Meta-prompting Hiraou Sayash Raaj
- Towards Building A Robust Knowledge Intensive Question Answering Model With Large Language Models Hong Hong Xingyun, Shao Shao Yan, Wang Wang Zhilin, Duan Duan Manni, Xiongnan Jin
- Game Development As Human-llm Interaction Hong Jiale, Wu Hongqiu, Zhao Hai
- Accelerating Multilingual Language Model For Excessively Tokenized Languages Hong Jimin, Lee Gibbeum, Cho Jaewoong
- Curiosity-driven Red-teaming For Large Language Models Hong Zhang-wei, Shenfeld Idan, Wang Tsun-hsuan, Chuang Yung-sung, Pareja Aldo, Glass James, Srivastava Akash, Agrawal Pulkit
- Free To Play: UN Trade And Development's Experience With Developing Its Own Open-source Retrieval Augmented Generation Large Language Model Application Hopp Daniel
- Large Language Models For Expansion Of Spoken Language Understanding Systems To New Languages Hoscilowicz Jakub, Pawlowski Pawel, Skorupa Marcin, Sowański Marcin, Janicki Artur
- Non-linear Inference Time Intervention: Improving LLM Truthfulness Hoscilowicz Jakub, Wiacek Adam, Chojnacki Jan, Cieslak Adam, Michon Leszek, Urbanevych Vitalii, Janicki Artur
- V-star: Training Verifiers For Self-taught Reasoners Hosseini Arian, Yuan Xingdi, Malkin Nikolay, Courville Aaron, Sordoni Alessandro, Agarwal Rishabh
- Wikicontradict: A Benchmark For Evaluating Llms On Real-world Knowledge Conflicts From Wikipedia Hou Yufang, Pascale Alessandra, Carnerero-cano Javier, Tchrakian Tigran, Marinescu Radu, Daly Elizabeth, Padhi Inkit, Sattigeri Prasanna
- Chatglm-rlhf: Practices Of Aligning Large Language Models With Human Feedback Hou Zhenyu, Niu Yilin, Du Zhengxiao, Zhang Xiaohan, Liu Xiao, Zeng Aohan, Zheng Qinkai, Huang Minlie, Wang Hongning, Tang Jie, Dong Yuxiao
- Exploring Scaling Trends In LLM Robustness Howe Nikolaus, Zajac Michał, Mckenzie Ian, Hollinsworth Oskar, Tseng Tom, Bacon Pierre-luc, Gleave Adam
- Wilke: Wise-layer Knowledge Editor For Lifelong Knowledge Editing Hu Chenhui, Cao Pengfei, Chen Yubo, Liu Kang, Zhao Jun
- Lightweight Modality Adaptation To Sequential Recommendation Via Correlation Supervision Hu Hengchang, Liu Qijiong, Li Chuang, Kan Min-yen
- Outlier-efficient Hopfield Layers For Large Transformer-based Models Hu Jerry Yao-chieh, Chang Pei-hsuan, Luo Robin, Chen Hong-yu, Li Weijian, Wang Wei-po, Liu Han
- Openrlhf: An Easy-to-use, Scalable And High-performance RLHF Framework Hu Jian, Wu Xibin, Wang Weixun, Xianyu, Zhang Dehao, Cao Yu
- Agentgen: Enhancing Planning Abilities For Large Language Model Based Agent Via Environment And Task Generation Hu Mengkang, Zhao Pu, Xu Can, Sun Qingfeng, Lou Jianguang, Lin Qingwei, Luo Ping, Rajmohan Saravan, Zhang Dongmei
- Mitigating Large Language Model Hallucination With Faithful Finetuning Hu Minda, He Bowei, Wang Yufei, Li Liangyou, Ma Chen, King Irwin
- Limited Out-of-context Knowledge Reasoning In Large Language Models Hu Peng, Gao Changjiang, Gao Ruiqi, Chen Jiajun, Huang Shujian
- Mixed Sparsity Training: Achieving 4\(\times\) FLOP Reduction For Transformer Pretraining Hu Pihe, Li Shaolong, Huang Longbo
- Characterization Of Large Language Model Development In The Datacenter Hu Qinghao, Ye Zhisheng, Wang Zerui, Wang Guoteng, Zhang Meng, Chen Qiaoling, Sun Peng, Lin Dahua, Wang Xiaolin, Luo Yingwei, Wen Yonggang, Zhang Tianwei
- Minicpm: Unveiling The Potential Of Small Language Models With Scalable Training Strategies Hu Shengding, Tu Yuge, Han Xu, He Chaoqun, Cui Ganqu, Long Xiang, Zheng Zhi, Fang Yewei, Huang Yuxiang, Zhao Weilin, Zhang Xinrong, Thai Zheng Leng, Zhang Kaihuo, Wang Chongyi, Yao Yuan, Zhao Chenyang, Zhou Jie, Cai Jie, Zhai Zhongwu, Ding Ning, Jia Chao, Zeng Guoyang, Li Dahai, Liu Zhiyuan, Sun Maosong
- Matryoshka Query Transformer For Large Vision-language Models Hu Wenbo, Dou Zi-yi, Li Liunian Harold, Kamath Amita, Peng Nanyun, Chang Kai-wei
- Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models At Scale Hu Xiang, Ji Pengyu, Zhu Qingyang, Wu Wei, Tu Kewei
- ELLA: Equip Diffusion Models With LLM For Enhanced Semantic Alignment Hu Xiwei, Wang Rui, Fang Yixiao, Fu Bin, Cheng Pei, Yu Gang
- Case-based Or Rule-based: How Do Transformers Do The Math? Hu Yi, Tang Xiaojuan, Yang Haotong, Zhang Muhan
- Rethinking Llm-based Preference Evaluation Hu Zhengyu, Song Linxin, Zhang Jieyu, Xiao Zheyuan, Wang Jingang, Chen Zhenyu, Xiong Hui
- Exact And Efficient Unlearning For Large Language Model-based Recommendation Hu Zhiyu, Zhang Yang, Xiao Minghao, Wang Wenjie, Feng Fuli, He Xiangnan
- Longrecipe: Recipe For Efficient Long Context Generalization In Large Language Models Hu Zhiyuan, Liu Yuliang, Zhao Jinman, Wang Suyuchen, Wang Yan, Shen Wei, Gu Qing, Luu Anh Tuan, Ng See-kiong, Jiang Zhiwei, Hooi Bryan
- An Intermediate Fusion Vit Enables Efficient Text-image Alignment In Diffusion Models Hu Zizhao, Jia Shaochong, Rostami Mohammad
- V2xum-llm: Cross-modal Video Summarization With Temporal Prompt Instruction Tuning Hua Hang, Tang Yunlong, Xu Chenliang, Luo Jiebo
- Disentangling Logic: The Role Of Context In Large Language Model Reasoning Capabilities Hua Wenyue, Zhu Kaijie, Li Lingyao, Fan Lizhou, Lin Shuhang, Jin Mingyu, Xue Haochen, Li Zelong, Wang Jindong, Zhang Yongfeng
- Talk Less, Interact Better: Evaluating In-context Conversational Adaptation In Multimodal Llms Hua Yilun, Artzi Yoav
- Multimodal Task Vectors Enable Many-shot Multimodal In-context Learning Huang Brandon, Mitra Chancharik, Arbelle Assaf, Karlinsky Leonid, Darrell Trevor, Herzig Roei
- Instupr : Instruction-based Unsupervised Passage Reranking With Large Language Models Huang Chao-wei, Chen Yun-nung
- Investigating Decoder-only Large Language Models For Speech-to-text Translation Huang Chao-wei, Lu Hui, Gong Hongyu, Inaguma Hirofumi, Kulikov Ilia, Mavlyutov Ruslan, Popuri Sravya
- Pokergpt: An End-to-end Lightweight Solver For Multi-player Texas Hold'em Via Large Language Model Huang Chenghao, Cao Yanbo, Wen Yinlong, Zhou Tao, Zhang Yanru
- Learn When (not) To Trust Language Models: A Privacy-centric Adaptive Model-aware Approach Huang Chengkai, Wang Rui, Xie Kaige, Yu Tong, Yao Lina
- What Affects The Stability Of Tool Learning? An Empirical Study On The Robustness Of Tool Learning Frameworks Huang Chengrui, Shi Zhengliang, Wen Yuntao, Chen Xiuying, Han Peng, Gao Shen, Shang Shuo
- Training Language Models To Generate Text With Citations Via Fine-grained Rewards Huang Chengyu, Wu Zeqiu, Hu Yushi, Wang Wenya
- Verbalized Probabilistic Graphical Modeling With Large Language Models Huang Hengguan, Shen Xing, Wang Songtao, Liu Dianbo, Wang Hao
- Context-aware Assistant Selection For Improved Inference Acceleration With Large Language Models Huang Jerry, Parthasarathi Prasanna, Rezagholizadeh Mehdi, Chandar Sarath
- Mitigating Catastrophic Forgetting In Large Language Models With Self-synthesized Rehearsal Huang Jianheng, Cui Leyang, Wang Ante, Yang Chengyi, Liao Xinting, Song Linfeng, Yao Junfeng, Su Jinsong
- Demystifying Verbatim Memorization In Large Language Models Huang Jing, Yang Diyi, Potts Christopher
- Learning Fine-grained Grounded Citations For Attributed Large Language Models Huang Lei, Feng Xiaocheng, Ma Weitao, Gu Yuxuan, Zhong Weihong, Feng Xiachong, Yu Weijiang, Peng Weihua, Tang Duyu, Tu Dandan, Qin Bing
- The Solution For The 5th GCAIAC Zero-shot Referring Expression Comprehension Challenge Huang Longfei, Yu Feng, Guan Zhihao, Wan Zhonghua, Yang Yang
- Selective Prompting Tuning For Personalized Conversations With Llms Huang Qiushi, Liu Xubo, Ko Tom, Wu Bo, Wang Wenwu, Zhang Yu, Tang Lilian
- Antidote: Post-fine-tuning Safety Alignment For Large Language Models Against Harmful Fine-tuning Huang Tiansheng, Bhattacharya Gautam, Joshi Pratik, Kimball Josh, Liu Ling
- Vaccine: Perturbation-aware Alignment For Large Language Models Against Harmful Fine-tuning Huang Tiansheng, Hu Sihao, Liu Ling
- Improving Llms For Recommendation With Out-of-vocabulary Tokens Huang Ting-ji, Yang Jia-qi, Shen Chunxu, Liu Kai-qi, Zhan De-chuan, Ye Han-jia
- Brainchat: Decoding Semantic Information From Fmri Using Vision-language Pretrained Models Huang Wanaiu
- Billm: Pushing The Limit Of Post-training Quantization For Llms Huang Wei, Liu Yangdong, Qin Haotong, Li Ying, Zhang Shiming, Liu Xianglong, Magno Michele, Qi Xiaojuan
- Slim-llm: Salience-driven Mixed-precision Quantization For Large Language Models Huang Wei, Qin Haotong, Liu Yangdong, Li Yawei, Liu Xianglong, Benini Luca, Magno Michele, Qi Xiaojuan
- An Empirical Study Of Llama3 Quantization: From Llms To Mllms Huang Wei, Zheng Xingyu, Ma Xudong, Qin Haotong, Lv Chengtao, Chen Hong, Luo Jie, Qi Xiaojuan, Liu Xianglong, Magno Michele
- Pruning Large Language Models With Semi-structural Adaptive Sparse Training Huang Weiyu, Hu Yuezhou, Jian Guohao, Zhu Jun, Chen Jianfei
- Visual Hallucinations Of Multi-modal Large Language Models Huang Wen, Liu Hongbin, Guo Minxin, Gong Neil Zhenqiang
- Rolora: Fine-tuning Rotated Outlier-free Llms For Effective Weight-activation Quantization Huang Xijie, Liu Zechun, Liu Shih-yang, Cheng Kwang-ting
- WESE: Weak Exploration To Strong Exploitation For LLM Agents Huang Xu, Liu Weiwen, Chen Xiaolong, Wang Xingmei, Lian Defu, Wang Yasheng, Tang Ruiming, Chen Enhong
- Lost In The Source Language: How Large Language Models Evaluate The Quality Of Machine Translation Huang Xu, Zhang Zhirui, Geng Xiang, Du Yichao, Chen Jiajun, Huang Shujian
- Key-point-driven Data Synthesis With Its Enhancement On Mathematical Reasoning Huang Yiming, Liu Xiao, Gong Yeyun, Gou Zhibin, Shen Yelong, Duan Nan, Chen Weizhu
- Leveraging Large Language Models For Enhanced NLP Task Performance Through Knowledge Distillation And Optimized Training Strategies Huang Yining, Tang Keke, Chen Meilian
- See The Unseen: Better Context-consistent Knowledge-editing By Noises Huang Youcheng, Lei Wenqiang, Zhang Zheng, Lv Jiancheng, Yan Shuicheng
- Active Testing Of Large Language Model Via Multi-stage Sampling Huang Yuheng, Song Jiayang, Hu Qiang, Juefei-xu Felix, Ma Lei
- Ecr-chain: Advancing Generative Language Models To Better Emotion-cause Reasoners Through Reasoning Chains Huang Zhaopei, Zhao Jinming, Jin Qin
- Mindmerger: Efficient Boosting LLM Reasoning In Non-english Languages Huang Zixian, Zhu Wenhao, Cheng Gong, Li Lei, Yuan Fei
- Accelerating Pre-training Of Multimodal Llms Via Chain-of-sight Huang Ziyuan, Ji Kaixiang, Gong Biao, Qing Zhiwu, Zhang Qinglong, Zheng Kecheng, Wang Jian, Chen Jingdong, Yang Ming
- Instruction Fine-tuning: Does Prompt Loss Matter? Huerta-enochian Mathew, Ko Seung Yong
- HFT: Half Fine-tuning For Large Language Models Hui Tingfeng, Zhang Zhenyu, Wang Shuohuan, Xu Weiran, Sun Yu, Wu Hua
- Romansetu: Efficiently Unlocking Multilingual Capabilities Of Large Language Models Via Romanization Husain Jaavid Aktar, Dabre Raj, Kumar Aswanth, Gala Jay, Jayakumar Thanmay, Puduppully Ratish, Kunchukuttan Anoop
- Promptdsi: Prompt-based Rehearsal-free Instance-wise Incremental Learning For Document Retrieval Huynh Tuan-luc, Vu Thuy-trang, Wang Weiqing, Wei Yinwei, Le Trung, Gasevic Dragan, Li Yuan-fang, Do Thanh-toan
- Self-explore To Avoid The Pit: Improving The Reasoning Capabilities Of Language Models With Fine-grained Rewards Hwang Hyeonbin, Kim Doyoung, Kim Seungone, Ye Seonghyeon, Seo Minjoon
- Pc-lora: Low-rank Adaptation For Progressive Model Compression With Knowledge Distillation Hwang Injoon, Park Haewon, Lee Youngwan, Yang Jooyoung, Maeng Sunjae
- DSLR: Document Refinement With Sentence-level Re-ranking And Reconstruction To Enhance Retrieval-augmented Generation Hwang Taeho, Jeong Soyeong, Cho Sukmin, Han Seungyoon, Park Jong C.
- MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs Hwang Yerin, Kim Yongil, Jang Yunah, Bang Jeesoo, Bae Hyunkyung, Jung Kyomin
- News Without Borders: Domain Adaptation Of Multilingual Sentence Embeddings For Cross-lingual News Recommendation Iana Andreea, Schmidt Fabian David, Glavaš Goran, Paulheim Heiko
- Simple And Scalable Strategies To Continually Pre-train Large Language Models Ibrahim Adam, Thérien Benjamin, Gupta Kshitij, Richter Mats L., Anthony Quentin, Lesort Timothée, Belilovsky Eugene, Rish Irina
- On The Generalization Of Preference Learning With DPO Im Shawn, Li Yixuan
- Understanding The Learning Dynamics Of Alignment With Human Feedback Im Shawn, Li Yixuan
- Test-time Low Rank Adaptation Via Confidence Maximization For Zero-shot Generalization Of Vision-language Models Imam Raza, Gani Hanan, Huzaifa Muhammad, Nandakumar Karthik
- Improving Multilingual Instruction Finetuning Via Linguistically Natural And Diverse Datasets Indurthi Sathish Reddy, Zhou Wenxuan, Chollampatt Shamil, Agrawal Ravi, Song Kaiqiang, Zhao Lingxiao, Zhu Chenguang
- Breaking The Language Barrier: Can Direct Inference Outperform Pre-translation In Multilingual LLM Applications? Intrator Yotam, Halfon Matan, Goldenberg Roman, Tsarfaty Reut, Eyal Matan, Rivlin Ehud, Matias Yossi, Aizenberg Natalia
- Scaling Laws For Downstream Task Performance Of Large Language Models Isik Berivan, Ponomareva Natalia, Hazimeh Hussein, Paparas Dimitris, Vassilvitskii Sergei, Koyejo Sanmi
- From Loops To Oops: Fallback Behaviors Of Language Models Under Uncertainty Ivgi Maor, Yoran Ori, Berant Jonathan, Geva Mor
- Quality Or Quantity? On Data Scale And Diversity In Adapting Large Language Models For Low-resource Translation Iyer Vivek, Malik Bhavitvya, Stepachev Pavel, Chen Pinzhen, Haddow Barry, Birch Alexandra
- From RAG To RICHES: Retrieval Interlaced With Sequence Generation Jain Palak, Soares Livio Baldini, Kwiatkowski Tom
- KIT-19: A Comprehensive Korean Instruction Toolkit On 19 Tasks For Fine-tuning Korean Large Language Models Jang Dongjun, Byun Sungjoo, Jo Hyemi, Shin Hyopil
- Smart-infinity: Fast Large Language Model Training Using Near-storage Processing On A Real System Jang Hongsun, Song Jaeyong, Jung Jaewon, Park Jaeyoung, Kim Youngsok, Lee Jinho
- Performance Of Recent Large Language Models For A Low-resourced Language Jayakody Ravindu, Dias Gihan
- L4Q: Parameter Efficient Quantization-aware Fine-tuning On Large Language Models Jeon Hyesung, Kim Yulhwa, Kim Jae-joon
- OLAPH: Improving Factuality In Biomedical Long-form Question Answering Jeong Minbyul, Hwang Hyeon, Yoon Chanwoong, Lee Taewhoo, Kang Jaewoo
- RLSF: Reinforcement Learning Via Symbolic Feedback Jha Piyush, Jana Prithwish, Arora Arnav, Ganesh Vijay
- Precision Empowers, Excess Distracts: Visual Question Answering With Dynamically Infused Knowledge In Language Models Jhalani Manas, M Annervaz K, Bhattacharyya Pushpak
- Towards Flexible Evaluation For Generative Visual Question Answering Ji Huishan, Si Qingyi, Lin Zheng, Wang Weiping
- Aligner: Efficient Alignment By Learning To Correct Ji Jiaming, Chen Boyuan, Lou Hantao, Hong Donghai, Zhang Borong, Pan Xuehai, Dai Juntao, Qiu Tianyi, Yang Yaodong
- Language Models Resist Alignment Ji Jiaming, Wang Kaile, Qiu Tianyi, Chen Boyuan, Zhou Jiayi, Li Changye, Lou Hantao, Yang Yaodong
- Llms Could Autonomously Learn Without External Supervision Ji Ke, Chen Junying, Gao Anningzhe, Xie Wenya, Wan Xiang, Wang Benyou
- Can't Say Cant? Measuring And Reasoning Of Dark Jargons In Large Language Models Ji Xu, Zhang Jianyi, Zhou Ziyin, Zhao Zhangchi, Qiao Qianqian, Han Kaiying, Hossen Md Imran, Hei Xiali
- LLM Internal States Reveal Hallucination Risk Faced With A Query Ji Ziwei, Chen Delong, Ishii Etsuko, Cahyawijaya Samuel, Bang Yejin, Wilie Bryan, Fung Pascale
- Generalizing Reward Modeling For Out-of-distribution Preference Learning Jia Chen
- Bwarea Model: Learning World Model, Inverse Dynamics, And Policy For Controllable Language Generation Jia Chengxing, Wang Pengyuan, Li Ziniu, Li Yi-chen, Zhang Zhilong, Tang Nan, Yu Yang
- Pre-training Cross-lingual Open Domain Question Answering With Large-scale Synthetic Supervision Jiang Fan, Drummond Tom, Cohn Trevor
- Chatbug: A Common Vulnerability Of Aligned Llms Induced By Chat Templates Jiang Fengqing, Xu Zhangchen, Niu Luyao, Lin Bill Yuchen, Poovendran Radha
- Interpretable Catastrophic Forgetting Of Large Language Model Fine-tuning Via Instruction Vector Jiang Gangwei, Jiang Caigao, Li Zhaoyi, Xue Siqiao, Zhou Jun, Song Linqi, Lian Defu, Wei Ying
- The Real, The Better: Aligning Large Language Models With Online Human Behaviors Jiang Guanying, Yan Lingyong, Shi Haibo, Yin Dawei
- Minference 1.0: Accelerating Pre-filling For Long-context Llms Via Dynamic Sparse Attention Jiang Huiqiang, Li Yucheng, Zhang Chengruidong, Wu Qianhui, Luo Xufang, Ahn Surin, Han Zhenhua, Abdi Amir H., Li Dongsheng, Lin Chin-yew, Yang Yuqing, Qiu Lili
- MC\(^2\): Multi-concept Guidance For Customized Multi-concept Generation Jiang Jiaxiu, Zhang Yabo, Feng Kailai, Wu Xiaohe, Zuo Wangmeng
- Mix-cpt: A Domain Adaptation Framework Via Decoupling Knowledge Learning And Format Alignment Jiang Jinhao, Li Junyi, Zhao Wayne Xin, Song Yang, Zhang Tao, Wen Ji-rong
- Data Augmentation Of Multi-turn Psychological Dialogue Via Knowledge-driven Progressive Thought Prompting Jiang Jiyue, Chen Liheng, Wang Sheng, Kong Lingpeng, Li Yu, Wu Chuan
- Can Large Language Models Generate High-quality Patent Claims? Jiang Lekang, Zhang Caiqi, Scherz Pascal A, Goetz Stephan
- Detecting Scams Using Large Language Models Jiang Liming
- Wildteaming At Scale: From In-the-wild Jailbreaks To (adversarially) Safer Language Models Jiang Liwei, Rao Kavel, Han Seungju, Ettinger Allyson, Brahman Faeze, Kumar Sachin, Mireshghallah Niloofar, Lu Ximing, Sap Maarten, Choi Yejin, Dziri Nouha
- Investigating Data Contamination For Pre-training Language Models Jiang Minhao, Liu Ken Ziyu, Zhong Ming, Schaeffer Rylan, Ouyang Siru, Han Jiawei, Koyejo Sanmi
- Turning Generative Models Degenerate: The Power Of Data Poisoning Attacks Jiang Shuli, Kadhe Swanand Ravindra, Zhou Yi, Ahmed Farhan, Cai Ling, Baracaldo Nathalie
- Mora: High-rank Updating For Parameter-efficient Fine-tuning Jiang Ting, Huang Shaohan, Luo Shengyue, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi, Wang Deqing, Zhuang Fuzhen
- Sketch: A Toolkit For Streamlining LLM Operations Jiang Xin, Li Xiang, Ma Wenjia, Fang Xuezhi, Yao Yiqun, Yu Naitong, Meng Xuying, Han Peng, Li Jing, Sun Aixin, Wang Yequan
- SEED: Customize Large Language Models With Sample-efficient Adaptation For Code Generation Jiang Xue, Dong Yihong, Jin Zhi, Li Ge
- Large Visual-language Models Are Also Good Classifiers: A Study Of In-context Multimodal Fake News Detection Jiang Ye, Wang Yimin
- Instruction-tuned Language Models Are Better Knowledge Learners Jiang Zhengbao, Sun Zhiqing, Shi Weijia, Rodriguez Pedro, Zhou Chunting, Neubig Graham, Lin Xi Victoria, Yih Wen-tau, Iyer Srinivasan
- Megascale: Scaling Large Language Model Training To More Than 10,000 Gpus Jiang Ziheng, Lin Haibin, Zhong Yinmin, Huang Qi, Chen Yangrui, Zhang Zhi, Peng Yanghua, Li Xiang, Xie Cong, Nong Shibiao, Jia Yulu, He Sun, Chen Hongmin, Bai Zhihao, Hou Qi, Yan Shipeng, Zhou Ding, Sheng Yiyao, Jiang Zhuo, Xu Haohan, Wei Haoran, Zhang Zhang, Nie Pengfei, Zou Leqi, Zhao Sida, Xiang Liang, Liu Zherui, Li Zhe, Jia Xiaoying, Ye Jianxi, Jin Xin, Liu Xin
- Large Language Models As Data Augmenters For Cold-start Item Recommendation Jianling Wang, Haokai Lu, James Caverlee, Ed Chi, Minmin Chen
- Learning Planning-based Reasoning By Trajectories Collection And Process Reward Synthesizing Jiao Fangkai, Qin Chengwei, Liu Zhengyuan, Chen Nancy F., Joty Shafiq
- Gradable Chatgpt Translation Evaluation Jiao Hui, Peng Bei, Zong Lu, Zhang Xiaojun, Li Xinwei
- Zero-shot Chain-of-thought Reasoning Guided By Evolutionary Algorithms In Large Language Models Jin Feihu, Liu Yifan, Tan Ying
- LLM Maybe Longlm: Self-extend LLM Context Window Without Tuning Jin Hongye, Han Xiaotian, Yang Jingfeng, Jiang Zhimeng, Liu Zirui, Chang Chia-yuan, Chen Huiyuan, Hu Xia
- BIDER: Bridging Knowledge Inconsistency For Efficient Retrieval-augmented Llms Via Key Supporting Evidence Jin Jiajie, Zhu Yutao, Zhou Yujia, Dou Zhicheng
- Crimson: Empowering Strategic Reasoning In Cybersecurity Through Large Language Models Jin Jiandong, Tang Bowen, Ma Mingxuan, Liu Xiao, Wang Yunfei, Lai Qingnan, Yang Jia, Zhou Changling
- HGT: Leveraging Heterogeneous Graph-enhanced Large Language Models For Few-shot Complex Table Understanding Jin Rihui, Li Yu, Qi Guilin, Hu Nan, Li Yuan-fang, Chen Jiaoyan, Wang Jianan, Chen Yongrui, Min Dehai
- Self-selected Attention Span For Accelerating Large Language Model Inference Jin Tian, Yazar Wanzin, Xu Zifei, Sharify Sayeh, Wang Xin
- MULTIVERSE: Exposing Large Language Model Alignment Problems In Diverse Worlds Jin Xiaolong, Zhang Zhuo, Zhang Xiangyu
- Video-lavit: Unified Video-language Pre-training With Decoupled Visual-motional Tokenization Jin Yang, Sun Zhicheng, Xu Kun, Xu Kun, Chen Liwei, Jiang Hao, Huang Quzhe, Song Chengru, Liu Yuliang, Zhang Di, Song Yang, Gai Kun, Mu Yadong
- Birbal: An Efficient 7B Instruct-model Fine-tuned With Curated Datasets Jindal Ashvini Kumar, Rajpoot Pawan Kumar, Parikh Ankur
- Investigating Multi-hop Factual Shortcuts In Knowledge Editing Of Large Language Models Ju Tianjie, Chen Yijin, Yuan Xinwei, Zhang Zhuosheng, Du Wei, Zheng Yubin, Liu Gongshen
- Llm-based Federated Recommendation Jujia Zhao, Wenjie Wang, Chen Xu, Zhaochun Ren, See-kiong Ng, Tat-seng Chua
- CODE: Contrasting Self-generated Description To Combat Hallucination In Large Multi-modal Models Junho Kim, Hyunjun Kim, Yeonju Kim, Yong Man Ro
- What If...?: Thinking Counterfactual Keywords Helps To Mitigate Hallucination In Large Multi-modal Models Junho Kim, Yeon Ju Kim, Yong Man Ro
- LARA: Linguistic-adaptive Retrieval-augmented Llms For Multi-turn Intent Classification Junhua Liu, Keat Tan Yong, Bin Fu
- Dipt: Enhancing LLM Reasoning Through Diversified Perspective-taking Just Hoang Anh, Dabas Mahavir, Huang Lifu, Jin Ming, Jia Ruoxi
- Beyond Labels: Aligning Large Language Models With Human-like Reasoning Kabir Muhammad Rafsan, Sultan Rafeed Mohammad, Asif Ihsanul Haque, Ahad Jawad Ibn, Rahman Fuad, Amin Mohammad Ruhul, Mohammed Nabeel, Rahman Shafin
- Evaluating The Impact Of Advanced LLM Techniques On Ai-lecture Tutors For A Robotics Course Kahl Sebastian, Löffler Felix, Maciol Martin, Ridder Fabian, Schmitz Marius, Spanagel Jennifer, Wienkamp Jens, Burgahn Christopher, Schilling Malte
- Scaling Laws For Forgetting When Fine-tuning Large Language Models Kalajdzievski Damjan
- Understanding The Effect Of Using Semantically Meaningful Tokens For Visual Representation Learning Kalibhat Neha, Kattakinda Priyatham, Zarei Arman, Seleznev Nikita, Sharpe Samuel, Kumar Senthil, Feizi Soheil
- LLM-ARC: Enhancing Llms With An Automated Reasoning Critic Kalyanpur Aditya, Saravanakumar Kailash Karthik, Barres Victor, Chu-carroll Jennifer, Melville David, Ferrucci David
- Counterfactuals As A Means For Evaluating Faithfulness Of Attribution Methods In Autoregressive Language Models Kamahi Sepehr, Yaghoobzadeh Yadollah
- When Can Llms Actually Correct Their Own Mistakes? A Critical Survey Of Self-correction Of Llms Kamoi Ryo, Zhang Yusen, Zhang Nan, Han Jiawei, Zhang Rui
- Domain-specific Improvement On Psychotherapy Chatbot Using Assistant Kang Cheng, Novak Daniel, Urbanova Katerina, Cheng Yuqing, Hu Yong
- Mindstar: Enhancing Math Reasoning In Pre-trained Llms At Inference Time Kang Jikun, Li Xin Zhe, Chen Xi, Kazemi Amirreza, Sun Qianyi, Chen Boxing, Li Dong, He Xu, He Quan, Wen Feng, Hao Jianye, Yao Jun
- Large Language Models Must Be Taught To Know What They Don't Know Kapoor Sanyam, Gruver Nate, Roberts Manley, Collins Katherine, Pal Arka, Bhatt Umang, Weller Adrian, Dooley Samuel, Goldblum Micah, Wilson Andrew Gordon
- Can Reinforcement Learning Unlock The Hidden Dangers In Aligned Large Language Models? Karkevandi Mohammad Bahrami, Vishwamitra Nishant, Najafirad Peyman
- Beyond Traditional Benchmarks: Analyzing Behaviors Of Open Llms On Data-to-text Generation Kasner Zdeněk, Dušek Ondřej
- Alpaca Against Vicuna: Using Llms To Uncover Memorization Of Llms Kassem Aly M., Mahmoud Omar, Mireshghallah Niloofar, Kim Hyunwoo, Tsvetkov Yulia, Choi Yejin, Saad Sherif, Rana Santu
- Comparing Plausibility Estimates In Base And Instruction-tuned Large Language Models Kauf Carina, Chersoni Emmanuele, Lenci Alessandro, Fedorenko Evelina, Ivanova Anna A.
- Spectra: A Comprehensive Study Of Ternary, Quantized, And FP16 Language Models Kaushal Ayush, Pandey Tejas, Vaidhya Tejas, Bhagat Aaryan, Rish Irina
- Defending Large Language Models Against Attacks With Residual Stream Activation Analysis Kawasaki Amelia, Davis Andrew, Abbas Houssam
- Source-aware Training Enables Knowledge Attribution In Language Models Khalifa Muhammad, Wadden David, Strubell Emma, Lee Honglak, Wang Lu, Beltagy Iz, Peng Hao
- Indicllmsuite: A Blueprint For Creating Pre-training And Fine-tuning Datasets For Indian Languages Khan Mohammed Safi Ur Rahman, Mehta Priyam, Sankar Ananth, Kumaravelan Umashankar, Doddapaneni Sumanth, G Suriyaprasaad, G Varun Balan, Jain Sparsh, Kunchukuttan Anoop, Kumar Pratyush, Dabre Raj, Khapra Mitesh M.
- Self-training Large Language Models For Improved Visual Program Synthesis With Visual Reinforcement Khan Zaid, Bg Vijay Kumar, Schulter Samuel, Fu Yun, Chandraker Manmohan
- Consistency And Uncertainty: Identifying Unreliable Responses From Black-box Vision-language Models For Selective Visual Question Answering Khan Zaid, Fu Yun
- ARGS: Alignment As Reward-guided Search Khanov Maxim, Burapacheep Jirayu, Li Yixuan
- How Well Do Llms Represent Values Across Cultures? Empirical Analysis Of LLM Responses Based On Hofstede Cultural Dimensions Kharchenko Julia, Roosta Tanya, Chadha Aman, Shah Chirag
- How Good Is My Video LMM? Complex Video Reasoning And Robustness Evaluation Suite For Video-lmms Khattak Muhammad Uzair, Naeem Muhammad Ferjad, Hassan Jameel, Naseer Muzammal, Tombari Federico, Khan Fahad Shahbaz, Khan Salman
- Learning To Prompt With Text Only Supervision For Vision-language Models Khattak Muhammad Uzair, Naeem Muhammad Ferjad, Naseer Muzammal, Van Gool Luc, Tombari Federico
- Efficient Medical Question Answering With Knowledge-augmented Question Generation Khlaut Julien, Dancette Corentin, Ferreres Elodie, Bennani Alaedine, Hérent Paul, Manceron Pierre
- Llms In The Loop: Leveraging Large Language Model Annotations For Active Learning In Low-resource Languages Kholodna Nataliia, Julka Sahib, Khodadadi Mohammad, Gumus Muhammed Nurullah, Granitzer Michael
- Why And When Llm-based Assistants Can Go Wrong: Investigating The Effectiveness Of Prompt-based Interactions For Software Help-seeking Khurana Anjali, Subramonyam Hari, Chilana Parmit K
- Guiding Large Language Models To Post-edit Machine Translation With Error Annotations Ki Dayeon, Carpuat Marine
- Shortened Llama: Depth Pruning For Large Language Models With Comparison Of Retraining Methods Kim Bo-kyeong, Kim Geonmin, Kim Tae-ho, Castells Thibault, Choi Shinkook, Shin Junho, Song Hyoung-kyu
- Understanding Large-language Model (llm)-powered Human-robot Interaction Kim Callie Y., Lee Christine P., Mutlu Bilge
- A Mechanistic Interpretation Of Syllogistic Reasoning In Auto-regressive Language Models Kim Geonhee, Valentino Marco, Freitas André
- Promptkd: Distilling Student-friendly Knowledge For Generative Language Models Via Prompt Tuning Kim Gyeongman, Jang Doohyuk, Yang Eunho
- On Initializing Transformers With Pre-trained Embeddings Kim Ha Young, Balasubramanian Niranjan, Kang Byungkon
- Integrating Paralinguistics In Speech-empowered Large Language Models For Natural Conversation Kim Heeseung, Seo Soonshin, Jeong Kyeongseok, Kwon Ohsung, Kim Soyoon, Kim Jungwhan, Lee Jaehong, Song Eunwoo, Oh Myungwoo, Ha Jung-woo, Yoon Sungroh, Yoo Kang Min
- Fine-tuning CLIP Text Encoders With Two-step Paraphrasing Kim Hyunjae, Yoon Seunghyun, Bui Trung, Zhao Handong, Tran Quan, Dernoncourt Franck, Kang Jaewoo
- Sure: Summarizing Retrievals Using Answer Candidates For Open-domain QA Of Llms Kim Jaehyung, Nam Jaehyun, Mo Sangwoo, Park Jongjin, Lee Sang-woo, Seo Minjoon, Ha Jung-woo, Shin Jinwoo
- Strategic Data Ordering: Enhancing Large Language Model Performance Through Curriculum Learning Kim Jisu, Lee Juhwan
- Enhancing Psychotherapy Counseling: A Data Augmentation Pipeline Leveraging Large Language Models For Counseling Conversations Kim Jun-woo, Han Ji-eun, Koh Jun-seok, Seo Hyeon-tae, Chang Du-seong
- RE-RAG: Improving Open-domain QA Performance And Interpretability With Relevance Estimator In Retrieval-augmented Generation Kim Kiseung, Lee Jay-yoon
- Openvla: An Open-source Vision-language-action Model Kim Moo Jin, Pertsch Karl, Karamcheti Siddharth, Xiao Ted, Balakrishna Ashwin, Nair Suraj, Rafailov Rafael, Foster Ethan, Lam Grace, Sanketi Pannag, Vuong Quan, Kollar Thomas, Burchfiel Benjamin, Tedrake Russ, Sadigh Dorsa, Levine Sergey, Liang Percy, Finn Chelsea
- Large Language Models Meet Collaborative Filtering: An Efficient All-round Llm-based Recommender System Kim Sein, Kang Hongseok, Choi Seungyoon, Kim Donghyun, Yang Minchul, Park Chanyoung
- Efficient And Effective Vocabulary Expansion Towards Multilingual Large Language Models Kim Seungduk, Choi Seungtaek, Jeong Myeongho
- Is GPT-4 Alone Sufficient For Automated Essay Scoring?: A Comparative Judgment Approach Based On Rater Cognition Kim Seungju, Jo Meounggun
- Groundial: Human-norm Grounded Safe Dialog Response Generation Kim Siwon, Dai Shuyang, Kachuee Mohammad, Ray Shayan, Taghavi Tara, Yoon Sungroh
- Rethinking The Role Of Proxy Rewards In Language Model Alignment Kim Sungdong, Seo Minjoon
- Llmem: Estimating GPU Memory Usage For Fine-tuning Pre-trained Llms Kim Taeho, Wang Yanming, Chaturvedi Vatshank, Gupta Lokesh, Kim Seyeon, Kwon Yongin, Ha Sangtae
- Learning To Explore And Select For Coverage-conditioned Retrieval-augmented Generation Kim Takyoung, Lee Kyungjae, Jang Young Rok, Cho Ji Yong, Kim Gangwoo, Cho Minseok, Lee Moontae
- Health-llm: Large Language Models For Health Prediction Via Wearable Sensor Data Kim Yubin, Xu Xuhai, Mcduff Daniel, Breazeal Cynthia, Park Hae Won
- From Bytes To Borsch: Fine-tuning Gemma And Mistral For The Ukrainian Language Representation Kiulian Artur, Polishko Anton, Khandoga Mykola, Chubych Oryna, Connor Jack, Ravishankar Raghav, Shirawalmath Adarsh
- Contrastive Perplexity For Controlled Generation: An Application In Detoxifying Large Language Models Klein Tassilo, Nabi Moin
- Extended Mind Transformers Klett Phoebe, Ahle Thomas
- Growover: How Can Llms Adapt To Growing Real-world Knowledge? Ko Dayoon, Kim Jinyoung, Choi Hahyeon, Kim Gunhee
- Generation Constraint Scaling Can Mitigate Hallucination Kollias Georgios, Das Payel, Chaudhury Subhajit
- Style Vectors For Steering Generative Large Language Model Konen Kai, Jentzsch Sophie, Diallo Diaoulé, Schütt Peer, Bensch Oliver, Baff Roxanne El, Opitz Dominik, Hecking Tobias
- Blending Llms Into Cascaded Speech Translation: Kit's Offline Speech Translation System For IWSLT 2024 Koneru Sai, Nguyen Thai-binh, Pham Ngoc-quan, Liu Danni, Li Zhaolin, Waibel Alexander, Niehues Jan
- Self-prompt Tuning: Enable Autonomous Role-playing In Llms Kong Aobo, Zhao Shiwan, Chen Hao, Li Qicheng, Qin Yong, Sun Ruiqi, Zhou Xin, Zhou Jiaming, Sun Haoqin
- Labeling Supervised Fine-tuning Data With The Scaling Law Kong Huanjun
- Aligning Large Language Models With Representation Editing: A Control Perspective Kong Lingkai, Wang Haorui, Mu Wenhao, Du Yuanqi, Zhuang Yuchen, Zhou Yifei, Song Yue, Zhang Rongzhi, Wang Kai, Zhang Chao
- Controllable Navigation Instruction Generation With Chain Of Thought Prompting Kong Xianghao, Chen Jinyu, Wang Wenguan, Su Hang, Hu Xiaolin, Yang Yi, Liu Si
- Customizing Language Models With Instance-wise Lora For Sequential Recommendation Kong Xiaoyu, Wu Jiancan, Zhang An, Sheng Leheng, Lin Hui, Wang Xiang, He Xiangnan
- Audio Flamingo: A Novel Audio Language Model With Few-shot Learning And Dialogue Abilities Kong Zhifeng, Goel Arushi, Badlani Rohan, Ping Wei, Valle Rafael, Catanzaro Bryan
- Transllama: Llm-based Simultaneous Translation System Koshkin Roman, Sudoh Katsuhito, Nakamura Satoshi
- Llms Are Zero-shot Context-aware Simultaneous Translators Koshkin Roman, Sudoh Katsuhito, Nakamura Satoshi
- Hints-in-browser: Benchmarking Language Models For Programming Feedback Generation Kotalwar Nachiket, Gotovos Alkis, Singla Adish
- Arabiangpt: Native Arabic Gpt-based Large Language Model Koubaa Anis, Ammar Adel, Ghouti Lahouari, Najar Omar, Sibaee Serry
- Can Large Language Models Explore In-context? Krishnamurthy Akshay, Harris Keegan, Foster Dylan J., Zhang Cyril, Slivkins Aleksandrs
- Can Better Text Semantics In Prompt Tuning Improve VLM Generalization? Kuchibhotla Hari Chandana, Kancheti Sai Srinivas, Reddy Abbavaram Gowtham, Balasubramanian Vineeth N
- Synthdst: Synthetic Data Is All You Need For Few-shot Dialog State Tracking Kulkarni Atharva, Tseng Bo-hsiang, Moniz Joel Ruben Antony, Piraviperumal Dhivya, Yu Hong, Bhargava Shruti
- Reinforcement Learning For Optimizing RAG For Domain Chatbots Kulkarni Mandar, Tangarajan Praveen, Kim Kyung, Trivedi Anusua
- Fine-tuning, Quantization, And Llms: Navigating Unintended Outcomes Kumar Divyanshu, Kumar Anurakt, Agarwal Sahil, Harshangi Prashanth
- Bridging The Gap: Dynamic Learning Strategies For Improving Multilingual Performance In Llms Kumar Somnath, Balloli Vaibhav, Ranjit Mercy, Ahuja Kabir, Ganu Tanuja, Sitaram Sunayana, Bali Kalika, Nambi Akshay
- Properties And Challenges Of Llm-generated Explanations Kunz Jenny, Kuhlmann Marco
- Language-specific Calibration For Pruning Multilingual Language Models Kurz Simon, Chen Jian-jia, Flek Lucie, Zhao Zhixue
- Teaching Llama A New Language Through Cross-lingual Knowledge Transfer Kuulmets Hele-andra, Purason Taido, Luhtaru Agnes, Fishel Mark
- Continual Learning Optimizations For Auto-regressive Decoder Of Multilingual ASR Systems Kwok Chin Yuen, Yip Jia Qi, Chng Eng Siong
- Zero-shot Text-guided Infinite Image Synthesis With LLM Guidance Kwon Soyeong, Lee Taegyeong, Kim Taehwan
- A Framework For Synthetic Audio Conversations Generation Using Large Language Models Kyaw Kaung Myat, Chan Jonathan Hoyin
- Enhancing Multi-domain Automatic Short Answer Grading Through An Explainable Neuro-symbolic Pipeline Künnecke Felix, Filighera Anna, Leong Colin, Steuer Tim
- When To Retrieve: Teaching Llms To Utilize Information Retrieval Effectively Labruna Tiziano, Campos Jon Ander, Azkune Gorka
- Thalle: Text Hyperlocally Augmented Large Language Extension -- Technical Report Labs Kbtg, Khamnuansin Danupat, Petchsod Atthakorn, Lertpiya Anuruth, Balee Pornchanan, Lodkaew Thanawat, Chalothorn Tawunrat, Pongthawornkamol Thadpong, Lertsutthiwong Monchai
- The Remarkable Robustness Of Llms: Stages Of Inference? Lad Vedang, Gurnee Wes, Tegmark Max
- Autowebglm: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent Lai Hanyu, Liu Xiao, Iong Iat Long, Yao Shuntian, Chen Yuxuan, Shen Pengbo, Yu Hao, Zhang Hanchen, Zhang Xiaohan, Dong Yuxiao, Tang Jie
- SEC-QA: A Systematic Evaluation Corpus For Financial QA Lai Viet Dac, Krumdick Michael, Lovering Charles, Reddy Varshini, Schmidt Craig, Tanner Chris
- Llms Beyond English: Scaling The Multilingual Capability Of Llms With Cross-lingual Feedback Lai Wen, Mesgar Mohsen, Fraser Alexander
- Alarm: Align Language Models Via Hierarchical Rewards Modeling Lai Yuhang, Wang Siyuan, Liu Shujun, Huang Xuanjing, Wei Zhongyu
- From Distributional To Overton Pluralism: Investigating Large Language Model Alignment Lake Thom, Choi Eunsol, Durrett Greg
- Self-directed Synthetic Dialogues And Revisions Technical Report Lambert Nathan, Schoelkopf Hailey, Gokaslan Aaron, Soldaini Luca, Pyatkin Valentina, Castricato Louis
- Lapdoc: Layout-aware Prompting For Documents Lamott Marcel, Weweler Yves-noel, Ulges Adrian, Shafait Faisal, Krechel Dirk, Obradovic Darko
- FOCUS: Forging Originality Through Contrastive Use In Self-plagiarism For Language Models Lan Kaixin, Fang Tao, Wong Derek F., Xu Yabo, Chao Lidia S., Zhao Cecilia G.
- Fishing For Magikarp: Automatically Detecting Under-trained Tokens In Large Language Models Land Sander, Bartolo Max
- Fine-tuning Language Models With Reward Learning On Policy Lang Hao, Huang Fei, Li Yongbin
- Large Language Models As Evolution Strategies Lange Robert Tjarko, Tian Yingtao, Tang Yujin
- Enhancing Neural Machine Translation Of Low-resource Languages: Corpus Development, Human Evaluation And Explainable AI Architectures Lankford Séamus
- Adaptmllm: Fine-tuning Multilingual Language Models On Low-resource Languages With Integrated LLM Playgrounds Lankford Séamus, Afli Haithem, Way Andy
- Adaptnmt: An Open-source, Language-agnostic Development Environment For Neural Machine Translation Lankford Séamus, Afli Haithem, Way Andy
- Transformers For Low-resource Languages:is F\'eidir Linn! Lankford Séamus, Afli Haithem, Way Andy
- IBD: Alleviating Hallucinations In Large Vision-language Models Via Image-biased Decoding Lanyun Zhu, Deyi Ji, Tianrun Chen, Peng Xu, Jieping Ye, Jun Liu
- Sparse Rewards Can Self-train Dialogue Agents Lattimer Barrett Martin, Gangal Varun, Mcdonald Ryan, Yang Yi
- What Matters When Building Vision-language Models? Laurençon Hugo, Tronchon Léo, Cord Matthieu, Sanh Victor
- Sentiment Reasoning For Healthcare Le-duc Khai, Nguyen Khai-nguyen, Tat Bach Phan, Le Duy, Ngo Jerry, Vo-dang Long, Nguyen Anh Totti, Hy Truong-son
- Reallm: A General Framework For LLM Compression And Fine-tuning Leconte Louis, Bedin Lisa, Nguyen Van Minh, Moulines Eric
- Benchmarks Underestimate The Readiness Of Multi-lingual Dialogue Agents Lee Andrew H., Semnani Sina J., Castillo-lópez Galo, De Chalendar Gäel, Choudhury Monojit, Dua Ashna, Kavitha Kapil Rajesh, Kim Sungkyun, Kodali Prashant, Kumaraguru Ponnurangam, Lombard Alexis, Moradshahi Mehrad, Park Gihyun, Semmar Nasredine, Seo Jiwon, Shen Tianhao, Shrivastava Manish, Xiong Deyi, Lam Monica S.
- Trol: Traversal Of Layers For Large Language And Vision Models Lee Byung-kwan, Chung Sangyun, Kim Chae Won, Park Beomchan, Ro Yong Man
- Nv-embed: Improved Techniques For Training Llms As Generalist Embedding Models Lee Chankyu, Roy Rajarshi, Xu Mengyao, Raiman Jonathan, Shoeybi Mohammad, Catanzaro Bryan, Ping Wei
- Hip Attention: Sparse Sub-quadratic Attention With Hierarchical Attention Pruning Lee Heejun, Park Geon, Lee Youngwan, Kim Jina, Jeong Wonyoung, Jeon Myeongjae, Hwang Sung Ju
- Improving Conversational Abilities Of Quantized Large Language Models Via Direct Preference Alignment Lee Janghwan, Park Seongmin, Hong Sukjin, Kim Minsoo, Chang Du-seong, Choi Jungwook
- CATS: Contextually-aware Thresholding For Sparsity In Large Language Models Lee Je-yong, Lee Donghyun, Zhang Genghan, Tiwari Mo, Mirhoseini Azalia
- A Survey Of Large Language Models In Finance (finllms) Lee Jean, Stevens Nicholas, Han Soyeon Caren, Song Minseok
- Plagbench: Exploring The Duality Of Large Language Models In Plagiarism Generation And Detection Lee Jooyoung, Agrawal Toshini, Uchendu Adaku, Le Thai, Chen Jinghui, Lee Dongwon
- LRQ: Optimizing Post-training Quantization For Large Language Models By Learning Low-rank Weight-scaling Matrices Lee Jung Hyun, Kim Jeonghoon, Yang June Yong, Kwon Se Jung, Yang Eunho, Yoo Kang Min, Lee Dongsoo
- Token-supervised Value Models For Enhancing Mathematical Reasoning Capabilities Of Large Language Models Lee Jung Hyun, Yang June Yong, Heo Byeongho, Han Dongyoon, Yoo Kang Min
- Multimodal Reasoning With Multimodal Knowledge Graph Lee Junlin, Wang Yequan, Li Jing, Zhang Min
- Reinforcement Learning From Reflective Feedback (RLRF): Aligning And Improving Llms Via Fine-grained Self-reflection Lee Kyungjae, Hwang Dasol, Park Sunghyun, Jang Youngsoo, Lee Moontae
- LLM2LLM: Boosting Llms With Novel Iterative Data Enhancement Lee Nicholas, Wattanawong Thanakul, Kim Sehoon, Mangalam Karttikeya, Shen Sheng, Anumanchipalli Gopala, Mahoney Michael W., Keutzer Kurt, Gholami Amir
- Interactive Text-to-image Retrieval With Large Language Models: A Plug-and-play Approach Lee Saehyung, Yu Sangwon, Park Junsung, Yi Jihun, Yoon Sungroh
- Effective And Efficient Conversation Retrieval For Dialogue State Tracking With Implicit Text Summaries Lee Seanie, Cheng Jianpeng, Driesen Joris, Coca Alexandru, Johannsen Anders
- Aligning To Thousands Of Preferences Via System Message Generalization Lee Seongyun, Park Sue Hyun, Kim Seungone, Seo Minjoon
- Do Llms Have Distinct And Consistent Personality? TRAIT: Personality Testset Designed For Llms With Psychometrics Lee Seungbeen, Lim Seungwon, Han Seungju, Oh Giyeong, Chae Hyungjoo, Chung Jiwan, Kim Minju, Kwak Beong-woo, Lee Yeonsoo, Lee Dongha, Yeo Jinyoung, Yu Youngjae
- Llava-docent: Instruction Tuning With Multimodal Large Language Model To Support Art Appreciation Education Lee Unggi, Jeon Minji, Lee Yunseo, Byun Gyuri, Son Yoorim, Shin Jaeyoon, Ko Hongkyu, Kim Hyeoncheol
- Chatgpt As Research Scientist: Probing Gpt's Capabilities As A Research Librarian, Research Ethicist, Data Generator And Data Predictor Lehr Steven A., Caliskan Aylin, Liyanage Suneragiri, Banaji Mahzarin R.
- Autocoder: Enhancing Code Large Language Model With \textsc{aiev-instruct} Lei Bin, Li Yuchen, Chen Qiuwu
- Meta-task Prompting Elicits Embeddings From Large Language Models Lei Yibin, Wu Di, Zhou Tianyi, Shen Tao, Cao Yu, Tao Chongyang, Yates Andrew
- Small-e: Small Language Model With Linear Attention For Efficient Speech Synthesis Lemerle Théodor, Obin Nicolas, Roebel Axel
- Using Counterfactual Tasks To Evaluate The Generality Of Analogical Reasoning In Large Language Models Lewis Martha, Mitchell Melanie
- Cascade Reward Sampling For Efficient Decoding-time Alignment Li Bolian, Wang Yifan, Grama Ananth, Zhang Ruqi
- Are Bigger Encoders Always Better In Vision Large Models? Li Bozhou, Liang Hao, Meng Zimo, Zhang Wentao
- Eliciting Better Multilingual Structured Reasoning From Llms Through Code Li Bryan, Alkhouli Tamer, Bonadiman Daniele, Pappas Nikolaos, Mansour Saab
- Searchlvlms: A Plug-and-play Framework For Augmenting Large Vision-language Models By Searching Up-to-date Internet Knowledge Li Chuanhao, Li Zhen, Jing Chenchen, Liu Shuo, Shao Wenqi, Wu Yuwei, Luo Ping, Qiao Yu, Zhang Kaipeng
- On The Role Of Long-tail Knowledge In Retrieval Augmented Large Language Models Li Dongyang, Yan Junbing, Zhang Taolin, Wang Chengyu, He Xiaofeng, Huang Longtao, Xue Hui, Huang Jun
- A Study On Training And Developing Large Language Models For Behavior Tree Generation Li Fu, Wang Xueying, Li Bin, Wu Yunlong, Wang Yanzhen, Yi Xiaodong
- Recall, Retrieve And Reason: Towards Better In-context Relation Extraction Li Guozheng, Wang Peng, Ke Wenjun, Guo Yikai, Ji Ke, Shang Ziyu, Liu Jiajun, Xu Zijie
- Gradient-mask Tuning Elevates The Upper Limits Of LLM Performance Li Haoling, Zhang Xin, Liu Xiao, Gong Yeyun, Wang Yifan, Yang Yujiu, Chen Qi, Cheng Peng
- Backdoor Removal For Generative Large Language Models Li Haoran, Chen Yulin, Zheng Zihao, Hu Qi, Chan Chunkit, Liu Heshan, Song Yangqiu
- Revisiting Catastrophic Forgetting In Large Language Model Tuning Li Hongyu, Ding Liang, Fang Meng, Tao Dacheng
- Datacomp-lm: In Search Of The Next Generation Of Training Sets For Language Models Li Jeffrey, Fang Alex, Smyrnis Georgios, Ivgi Maor, Jordan Matt, Gadre Samir, Bansal Hritik, Guha Etash, Keh Sedrick, Arora Kushal, Garg Saurabh, Xin Rui, Muennighoff Niklas, Heckel Reinhard, Mercat Jean, Chen Mayee, Gururangan Suchin, Wortsman Mitchell, Albalak Alon, Bitton Yonatan, Nezhurina Marianna, Abbas Amro, Hsieh Cheng-yu, Ghosh Dhruba, Gardner Josh, Kilian Maciej, Zhang Hanlin, Shao Rulin, Pratt Sarah, Sanyal Sunny, Ilharco Gabriel, Daras Giannis, Marathe Kalyani, Gokaslan Aaron, Zhang Jieyu, Chandu Khyathi, Nguyen Thao, Vasiljevic Igor, Kakade Sham, Song Shuran, Sanghavi Sujay, Faghri Fartash, Oh Sewoong, Zettlemoyer Luke, Lo Kyle, El-nouby Alaaeldin, Pouransari Hadi, Toshev Alexander, Wang Stephanie, Groeneveld Dirk, Soldaini Luca, Koh Pang Wei, Jitsev Jenia, Kollar Thomas, Dimakis Alexandros G., Carmon Yair, Dave Achal, Schmidt Ludwig, Shankar Vaishaal
- Know The Unknown: An Uncertainty-sensitive Method For LLM Instruction Tuning Li Jiaqi, Tang Yixuan, Yang Yi
- A Cross-language Investigation Into Jailbreak Attacks In Large Language Models Li Jie, Liu Yi, Liu Chongyang, Shi Ling, Ren Xiaoning, Zheng Yaowen, Liu Yang, Xue Yinxing
- Steering Llms Towards Unbiased Responses: A Causality-guided Debiasing Framework Li Jingling, Tang Zeyu, Liu Xiaoyu, Spirtes Peter, Zhang Kun, Leqi Liu, Liu Yang
- Quickllama: Query-aware Inference Acceleration For Large Language Models Li Jingyao, Shi Han, Jiang Xin, Li Zhenguo, Xu Hong, Jia Jiaya
- Banishing LLM Hallucinations Requires Rethinking Generalization Li Johnny, Consul Saksham, Zhou Eda, Wong James, Farooqui Naila, Ye Yuxin, Manohar Nithyashree, Wei Zhuxiaona, Wu Tian, Echols Ben, Zhou Sharon, Diamos Gregory
- Dissecting Human And LLM Preferences Li Junlong, Zhou Fan, Sun Shichao, Zhang Yikai, Zhao Hai, Liu Pengfei
- The Dawn After The Dark: An Empirical Study On Factuality Hallucination In Large Language Models Li Junyi, Chen Jie, Ren Ruiyang, Cheng Xiaoxue, Zhao Wayne Xin, Nie Jian-yun, Wen Ji-rong
- Predicting Vs. Acting: A Trade-off Between World Modeling & Agent Modeling Li Margaret, Shi Weijia, Pagnoni Artidoro, West Peter, Holtzman Ari
- Selective Reflection-tuning: Student-selected Data Recycling For LLM Instruction-tuning Li Ming, Chen Lichang, Chen Jiuhai, He Shwai, Gu Jiuxiang, Zhou Tianyi
- Bridging The Language Gap: Enhancing Multilingual Prompt-based Code Generation In Llms Via Zero-shot Cross-lingual Transfer Li Mingda, Mishra Abhijit, Mujumdar Utkarsh
- Red Teaming Visual Language Models Li Mukai, Li Lei, Yin Yuwei, Ahmed Masood, Liu Zhenguang, Liu Qi
- Owlore: Outlier-weighed Layerwise Sampled Low-rank Projection For Memory-efficient LLM Fine-tuning Li Pengxiang, Yin Lu, Gao Xiaowei, Liu Shiwei
- Examining Post-training Quantization For Mixture-of-experts: A Benchmark Li Pingzhi, Jin Xiaolong, Cheng Yu, Chen Tianlong
- Coevol: Constructing Better Responses For Instruction Finetuning Through Multi-agent Cooperation Li Renhao, Tan Minghuan, Wong Derek F., Yang Min
- FSMR: A Feature Swapping Multi-modal Reasoning Approach With Joint Textual And Visual Clues Li Shuang, Wang Jiahua, Wen Lijie
- Scilitllm: How To Adapt Llms For Scientific Literature Understanding Li Sihang, Huang Jin, Zhuang Jiaxi, Shi Yaorui, Cai Xiaochen, Xu Mingjun, Wang Xiang, Zhang Linfeng, Ke Guolin, Cai Hengxing
- Improving Faithfulness Of Large Language Models In Summarization Via Sliding Generation And Self-consistency Li Taiji, Li Zhi, Zhang Yin
- Moe-ct: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting Li Tianhao, Li Shangjie, Xie Binbin, Xiong Deyi, Yang Baosong
- Purifying Large Language Models By Ensembling A Small Language Model Li Tianlin, Liu Qian, Pang Tianyu, Du Chao, Guo Qing, Liu Yang, Lin Min
- TOPA: Extend Large Language Models For Video Understanding Via Text-only Pre-alignment Li Wei, Fan Hehe, Wong Yongkang, Kankanhalli Mohan, Yang Yi
- Citation-enhanced Generation For Llm-based Chatbots Li Weitao, Li Junkai, Ma Weizhi, Liu Yang
- Is Programming By Example Solved By Llms? Li Wen-ding, Ellis Kevin
- Stbench: Assessing The Ability Of Large Language Models In Spatio-temporal Analysis Li Wenbin, Yao Di, Zhao Ruibo, Chen Wenjie, Xu Zijie, Luo Chengxue, Gong Chang, Jing Quanliang, Tan Haining, Bi Jingping
- Uio-llms: Unbiased Incremental Optimization For Long-context Llms Li Wenhao, Lin Mingbao, Zhong Yunshan, Yan Shuicheng, Ji Rongrong
- Unlocking Large Language Model's Planning Capabilities With Maximum Diversity Fine-tuning Li Wenjun, Chen Changyu, Varakantham Pradeep
- 52B To 1T: Lessons Learned Via Tele-flm Series Li Xiang, Yao Yiqun, Jiang Xin, Fang Xuezhi, Wang Chao, Liu Xinzhang, Wang Zihan, Zhao Yu, Wang Xin, Huang Yuyao, Song Shuangyong, Li Yongxiang, Zhang Zheng, Zhao Bo, Sun Aixin, Wang Yequan, He Zhongjiang, Wang Zhongyuan, Li Xuelong, Huang Tiejun
- Tele-flm Technical Report Li Xiang, Yao Yiqun, Jiang Xin, Fang Xuezhi, Wang Chao, Liu Xinzhang, Wang Zihan, Zhao Yu, Wang Xin, Huang Yuyao, Song Shuangyong, Li Yongxiang, Zhang Zheng, Zhao Bo, Sun Aixin, Wang Yequan, He Zhongjiang, Wang Zhongyuan, Li Xuelong, Huang Tiejun
- What If We Recaption Billions Of Web Images With Llama-3? Li Xianhang, Tu Haoqin, Hui Mude, Wang Zeyu, Zhao Bingchen, Xiao Junfei, Ren Sucheng, Mei Jieru, Liu Qing, Zheng Huangjie, Zhou Yuyin, Xie Cihang
- Preference Tuning For Toxicity Mitigation Generalizes Across Languages Li Xiaochen, Yong Zheng-xin, Bach Stephen H.
- Rethinking Chatgpt's Success: Usability And Cognitive Behaviors Enabled By Auto-regressive Llms' Prompting Li Xinzhe, Liu Ming
- Making Long-context Language Models Better Multi-hop Reasoners Li Yanyang, Liang Shuo, Lyu Michael R., Wang Liwei
- Leveraging Large Language Model As Simulated Patients For Clinical Education Li Yanzeng, Zeng Cheng, Zhong Jialun, Zhang Ruoyu, Zhang Minhao, Zou Lei
- Badedit: Backdooring Large Language Models By Model Editing Li Yanzhou, Li Tianlin, Chen Kangjie, Zhang Jian, Liu Shangqing, Wang Wenhan, Zhang Tianwei, Liu Yang
- Q-adapter: Training Your LLM Adapter As A Residual Q-function Li Yi-chen, Zhang Fuxiang, Qiu Wenjie, Yuan Lei, Jia Chengxing, Zhang Zongzhang, Yu Yang
- Backdoorllm: A Comprehensive Benchmark For Backdoor Attacks On Large Language Models Li Yige, Huang Hanxun, Zhao Yunhan, Ma Xingjun, Sun Jun
- Womd-reasoning: A Large-scale Language Dataset For Interaction And Driving Intentions Reasoning Li Yiheng, Ge Chongjian, Li Chenran, Xu Chenfeng, Tomizuka Masayoshi, Tang Chen, Ding Mingyu, Zhan Wei
- The First Place Solution Of WSDM Cup 2024: Leveraging Large Language Models For Conversational Multi-doc QA Li Yiming, Zhang Zhao
- Data Generation Using Large Language Models For Text Classification: An Empirical Case Study Li Yinheng, Bonatti Rogerio, Abdali Sara, Wagle Justin, Koishida Kazuhito
- Dynamic Data Sampler For Cross-language Transfer Learning In Large Language Models Li Yudong, Feng Yuhao, Zhou Wen, Zhao Zhe, Shen Linlin, Hou Cheng, Hou Xianxu
- Spin Glass Model Of In-context Learning Li Yuhao, Bai Ruoran, Huang Haiping
- Large Language Models For Human-like Autonomous Driving: A Survey Li Yun, Katsumata Kai, Javanmardi Ehsan, Tsukada Manabu
- Cognitive Visual-language Mapper: Advancing Multimodal Comprehension With Enhanced Visual Knowledge Alignment Li Yunxin, Chen Xinyu, Hu Baotian, Shi Haoyuan, Zhang Min
- Uni-moe: Scaling Unified Multimodal Llms With Mixture Of Experts Li Yunxin, Jiang Shenyuan, Hu Baotian, Wang Longyue, Zhong Wanqi, Luo Wenhan, Ma Lin, Zhang Min
- Mmsci: A Multimodal Multi-discipline Dataset For Phd-level Scientific Comprehension Li Zekun, Yang Xianjun, Choi Kyuri, Zhu Wanrong, Hsieh Ryan, Kim Hyeonjung, Lim Jin Hyuk, Ji Sungyoung, Lee Byungju, Yan Xifeng, Petzold Linda Ruth, Wilson Stephen D., Lim Woosang, Wang William Yang
- Do Vision-language Transformers Exhibit Visual Commonsense? An Empirical Study Of VCR Li Zhenyang, Guo Yangyang, Wang Kejie, Chen Xiaolin, Nie Liqiang, Kankanhalli Mohan
- Focusllm: Scaling Llm's Context By Parallel Decoding Li Zhenyu, Zhang Yike, Pan Tengyu, Sun Yutao, Duan Zhichao, Fang Junjie, Han Rong, Wang Zixuan, Wang Jianyong
- EAGLE: Elevating Geometric Reasoning Through Llm-empowered Visual Instruction Tuning Li Zhihao, Du Yao, Liu Yang, Zhang Yan, Liu Yufang, Zhang Mengdi, Cai Xunliang
- Synthesize Step-by-step: Tools, Templates And Llms As Data Generators For Reasoning-based Chart VQA Li Zhuowan, Jasani Bhavan, Tang Peng, Ghadar Shabnam
- Two Stacks Are Better Than One: A Comparison Of Language Modeling And Translation As Multilingual Pretraining Objectives Li Zihao, Ji Shaoxiong, Mickus Timothee, Segonne Vincent, Tiedemann Jörg
- Quantifying Multilingual Performance Of Large Language Models Across Languages Li Zihao, Shi Yucheng, Liu Zirui, Yang Fan, Payani Ali, Liu Ninghao, Du Mengnan
- Intermediate Distillation: Data-efficient Distillation From Black-box Llms For Information Retrieval Li Zizhong, Zhang Haopeng, Zhang Jiawei
- 500xcompressor: Generalized Prompt Compression For Large Language Models Li Zongqian, Su Yixuan, Collier Nigel
- What Is The Best Model? Application-driven Evaluation For Large Language Models Lian Shiguo, Zhao Kaikai, Liu Xinhui, Lei Xuejiao, Yang Bikun, Zhang Wenjing, Wang Kai, Liu Zhaoxiang
- Online Training Of Large Language Models: Learn While Chatting Liang Juhao, Wang Ziwei, Ma Zhuoheng, Li Jianquan, Zhang Zhiyi, Wu Xiangbo, Wang Benyou
- Task Oriented In-domain Data Augmentation Liang Xiao, Hu Xinyu, Zuo Simiao, Gong Yeyun, Lou Qiang, Liu Yi, Huang Shao-lun, Jiao Jian
- Fennec: Fine-grained Language Model Evaluation And Correction Extended Through Branching And Bridging Liang Xiaobo, Zhang Haoke, Hu Helan, Li Juntao, Xu Jun, Zhang Min
- Languashrink: Reducing Token Overhead With Psycholinguistics Liang Xuechen, Tao Meiling, Xia Yinghui, Shi Tianyu, Wang Jun, Yang Jingsong
- Controllable Text Generation For Large Language Models: A Survey Liang Xun, Wang Hanyu, Wang Yezhaohui, Song Shichao, Yang Jiawei, Niu Simin, Hu Jie, Liu Dan, Yao Shunyu, Xiong Feiyu, Li Zhiyu
- Learning To Trust Your Feelings: Leveraging Self-awareness In Llms For Hallucination Mitigation Liang Yuxin, Song Zhuoyang, Wang Hao, Zhang Jiaxing
- Mathchat: Benchmarking Mathematical Reasoning And Instruction Following In Multi-turn Interactions Liang Zhenwen, Yu Dian, Yu Wenhao, Yao Wenlin, Zhang Zhihan, Zhang Xiangliang, Yu Dong
- Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats In Customized Large Language Models Liang Zi, Hu Haibo, Ye Qingqing, Xiao Yaxin, Li Haoyang
- IKUN For WMT24 General MT Task: Llms Are Here For Multilingual Machine Translation Liao Baohao, Herold Christian, Khadivi Shahram, Monz Christof
- Beyond KV Caching: Shared Attention For Efficient Llms Liao Bingli, Vargas Danilo Vasconcellos
- Extending Token Computation For LLM Reasoning Liao Bingli, Vargas Danilo Vasconcellos
- Enhancing Reinforcement Learning With Label-sensitive Reward For Natural Language Understanding Liao Kuo, Li Shuang, Zhao Meng, Liu Liqun, Xue Mengge, Hu Zhenyu, Han Honglin, Yin Chengguo
- MARIO: Math Reasoning With Code Interpreter Output -- A Reproducible Pipeline Liao Minpeng, Luo Wei, Li Chengxi, Wu Jing, Fan Kai
- E2LLM: Encoder Elongated Large Language Models For Long-context Understanding And Reasoning Liao Zihan, Wang Jun, Yu Hang, Wei Lingxiao, Li Jianguo, Wang Jun, Zhang Wei
- Large Language Models As Recommender Systems: A Study Of Popularity Bias Lichtenberg Jan Malte, Buchholz Alexander, Schwöbel Pola
- Adapting Mental Health Prediction Tasks For Cross-lingual Learning Via Meta-training And In-context Learning With Large Language Model Lifelo Zita, Ning Huansheng, Dhelim Sahraoui
- The Power Of Words: Generating Powershell Attacks From Natural Language Liguori Pietro, Marescalco Christian, Natella Roberto, Orbinato Vittorio, Pianese Luciano
- Inducing Generalization Across Languages And Tasks Using Featurized Low-rank Mixtures Lin Chu-cheng, Wang Xinyi, Clark Jonathan H., Lu Han, Zhu Yun, Whitehouse Chenxi, Yu Hongkun
- Fine-tuned Network Relies On Generic Representation To Solve Unseen Cognitive Task Lin Dongyan
- Probing Large Language Models For Scalar Adjective Lexical Semantics And Scalar Diversity Pragmatics Lin Fangru, Altshuler Daniel, Pierrehumbert Janet B.
- Mixture Of In-context Experts Enhance Llms' Long Context Awareness Lin Hongzhan, Lv Ang, Chen Yuhan, Zhu Chen, Song Yang, Zhu Hengshu, Yan Rui
- Token-wise Influential Training Data Retrieval For Large Language Models Lin Huawei, Long Jikai, Xu Zhaozhuo, Zhao Weijie
- Large Language Models Make Sample-efficient Recommender Systems Lin Jianghao, Dai Xinyi, Shan Rong, Chen Bo, Tang Ruiming, Yu Yong, Zhang Weinan
- How Can I Get It Right? Using GPT To Rephrase Incorrect Trainee Responses Lin Jionghao, Han Zifei, Thomas Danielle R., Gurung Ashish, Gupta Shivang, Aleven Vincent, Koedinger Kenneth R.
- Syncpkl: Harnessing Llms To Generate Synthetic Data For Commonsense Persona Knowledge Linking Lin Kuan-yen
- Single Character Perturbations Break LLM Alignment Lin Leon, Brown Hannah, Kawaguchi Kenji, Shieh Michael
- Mala-500: Massive Language Adaptation Of Large Language Models Lin Peiqin, Ji Shaoxiong, Tiedemann Jörg, Martins André F. T., Schütze Hinrich
- FLAME: Factuality-aware Alignment For Large Language Models Lin Sheng-chieh, Gao Luyu, Oguz Barlas, Xiong Wenhan, Lin Jimmy, Yih Wen-tau, Chen Xilun
- Dogerm: Equipping Reward Models With Domain Knowledge Through Model Merging Lin Tzu-han, Li Chen-an, Lee Hung-yi, Chen Yun-nung
- Draw-and-understand: Leveraging Visual Prompts To Enable Mllms To Comprehend What You Want Lin Weifeng, Wei Xinyu, An Ruichuan, Gao Peng, Zou Bocheng, Luo Yulin, Huang Siyuan, Zhang Shanghang, Li Hongsheng
- Efficient LLM Training And Serving With Heterogeneous Context Sharding Among Attention Heads Lin Xihui, Zhang Yunan, Ge Suyu, Patra Barun, Chaudhary Vishrav, Peng Hao, Song Xia
- Rethinking Visual Prompting For Multimodal Large Language Models With External Knowledge Lin Yuanze, Li Yunsheng, Chen Dongdong, Xu Weijian, Clark Ronald, Torr Philip, Yuan Lu
- Unveiling Vulnerability Of Self-attention Liong Khai Jiet, Wu Hongqiu, Zhao Hai
- Exploring Llms As A Source Of Targeted Synthetic Textual Data To Minimize High Confidence Misclassifications Lippmann Philip, Spaan Matthijs T. J., Yang Jie
- PANDA: Preference Adaptation For Enhancing Domain-specific Abilities Of Llms Liu An, Yang Zonghan, Zhang Zhenhe, Hu Qingyuan, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Is Translation All You Need? A Study On Solving Multilingual Tasks With Large Language Models Liu Chaoqun, Zhang Wenxuan, Zhao Yiran, Luu Anh Tuan, Bing Lidong
- More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-specific Llms Liu Chengyuan, Wang Shihang, Kang Yangyang, Qing Lizhi, Zhao Fubang, Sun Changlong, Kuang Kun, Wu Fei
- Few-shot Class Incremental Learning With Attention-aware Self-adaptive Prompt Liu Chenxi, Wang Zhenyi, Xiong Tianyi, Chen Ruibo, Wu Yihan, Guo Junfeng, Huang Heng
- Llmembed: Rethinking Lightweight Llm's Genuine Function In Text Classification Liu Chun, Zhang Hongguang, Zhao Kainan, Ju Xinghai, Yang Lin
- Cool-fusion: Fuse Large Language Models Without Training Liu Cong, Quan Xiaojun, Pan Yan, Lin Liang, Wu Weigang, Chen Xu
- SPHINX-X: Scaling Data And Parameters For A Family Of Multi-modal Large Language Models Liu Dongyang, Zhang Renrui, Qiu Longtian, Huang Siyuan, Lin Weifeng, Zhao Shitian, Geng Shijie, Lin Ziyi, Jin Peng, Zhang Kaipeng, Shao Wenqi, Xu Chao, He Conghui, He Junjun, Shao Hao, Lu Pan, Li Hongsheng, Qiao Yu, Gao Peng
- Lumina-mgpt: Illuminate Flexible Photorealistic Text-to-image Generation With Multimodal Generative Pretraining Liu Dongyang, Zhao Shitian, Zhuo Le, Lin Weifeng, Qiao Yu, Li Hongsheng, Gao Peng
- A Practice-friendly Two-stage Llm-enhanced Paradigm In Sequential Recommendation Liu Dugang, Xian Shenxian, Lin Xiaolin, Zhang Xiaolian, Zhu Hong, Fang Yuan, Chen Zhen, Ming Zhong
- Pefomed: Parameter Efficient Fine-tuning Of Multimodal Large Language Models For Medical Imaging Liu Gang, He Jinlong, Li Pengfei, He Genrong, Chen Zhaolin, Zhong Shenjun
- On The Intrinsic Self-correction Capability Of Llms: Uncertainty And Latent Concept Liu Guangliang, Mao Haitao, Cao Bochuan, Xue Zhiyu, Johnson Kristen, Tang Jiliang, Wang Rongrong
- \(se^2\): Sequential Example Selection For In-context Learning Liu Haoyu, Liu Jianfeng, Huang Shaohan, Zhan Yuefeng, Sun Hao, Deng Weiwei, Wei Furu, Zhang Qi
- Laida: Linguistics-aware In-context Learning With Data Augmentation For Metaphor Components Identification Liu Hongde, He Chenyuan, Meng Feiyang, Niu Changyong, Jia Yuxiang
- Bucket Pre-training Is All You Need Liu Hongtao, Peng Qiyao, Yang Qing, Liu Kai, Xu Hongyan
- Unraveling The Mechanics Of Learning-based Demonstration Selection For In-context Learning Liu Hui, Wang Wenya, Sun Hao, Tian Chris Xing, Kong Chenqi, Dong Xin, Li Haoliang
- Training-free Activation Sparsity In Large Language Models Liu James, Ponnusamy Pragaash, Cai Tianle, Guo Han, Kim Yoon, Athiwaratkun Ben
- Bitdelta: Your Fine-tune May Only Be Worth One Bit Liu James, Xiao Guangxuan, Li Kai, Lee Jason D., Han Song, Dao Tri, Cai Tianle
- Speculative Decoding Via Early-exiting For Faster LLM Inference With Thompson Sampling Control Mechanism Liu Jiahao, Wang Qifan, Wang Jingang, Cai Xunliang
- E^2-LLM: Efficient And Extreme Length Extension Of Large Language Models Liu Jiaheng, Bai Zhiqi, Zhang Yuanxing, Zhang Chenchen, Zhang Yu, Zhang Ge, Wang Jiakai, Que Haoran, Chen Yukang, Su Wenbo, Ge Tiezheng, Fu Jie, Chen Wenhu, Zheng Bo
- Kangaroo: A Powerful Video-language Model Supporting Long-context Video Input Liu Jiajun, Wang Yibing, Ma Hanghang, Wu Xiaoping, Ma Xiaoqi, Wei Xiaoming, Jiao Jianbin, Wu Enhua, Hu Jie
- Tiny Refinements Elicit Resilience: Toward Efficient Prefix-model Against LLM Red-teaming Liu Jiaxu, Yin Xiangyu, Wu Sihao, Wang Jianhong, Fang Meng, Yi Xinping, Huang Xiaowei
- Iterative Length-regularized Direct Preference Optimization: A Case Study On Improving 7B Language Models To GPT-4 Level Liu Jie, Zhou Zhanhui, Liu Jiaheng, Bu Xingyuan, Yang Chao, Zhong Han-sen, Ouyang Wanli
- Mm-instruct: Generated Visual Instructions For Large Multimodal Model Alignment Liu Jihao, Huang Xin, Zheng Jinliang, Liu Boxiao, Wang Jia, Yoshie Osamu, Liu Yu, Li Hongsheng
- On The Universal Truthfulness Hyperplane Inside Llms Liu Junteng, Chen Shiqi, Cheng Yu, He Junxian
- Educating Llms Like Human Students: Structure-aware Injection Of Domain Knowledge Liu Kai, Chen Ze, Fu Zhihang, Jiang Rongxin, Zhou Fan, Chen Yaowu, Wu Yue, Ye Jieping
- Synthetic Context Generation For Question Generation Liu Naiming, Wang Zichao, Baraniuk Richard
- STORE: Streamlining Semantic Tokenization And Generative Recommendation With A Single LLM Liu Qijiong, Zhu Jieming, Fan Lu, Zhao Zhou, Wu Xiao-ming
- Monotonic Paraphrasing Improves Generalization Of Language Model Prompting Liu Qin, Wang Fei, Xu Nan, Yan Tianyi, Meng Tao, Chen Muhao
- Intactkv: Improving Large Language Model Quantization By Keeping Pivot Tokens Intact Liu Ruikang, Bai Haoli, Lin Haokun, Li Yuening, Gao Han, Xu Zhengzhuo, Hou Lu, Yao Jun, Yuan Chun
- Flashback:efficient Retrieval-augmented Language Modeling For Long Context Inference Liu Runheng, Xiao Xingchen, Huang Heyan, Chi Zewen, Wu Zhijing
- Latent Guard: A Safety Framework For Text-to-image Generation Liu Runtao, Khakzar Ashkan, Gu Jindong, Chen Qifeng, Torr Philip, Pizzati Fabio
- ST-LLM: Large Language Models Are Effective Temporal Learners Liu Ruyang, Li Chen, Tang Haoran, Ge Yixiao, Shan Ying, Li Ge
- SPINACH: Sparql-based Information Navigation For Challenging Real-world Questions Liu Shicheng, Semnani Sina J., Triedman Harold, Xu Jialiang, Zhao Isaac Dan, Lam Monica S.
- Are Llms Effective Backbones For Fine-tuning? An Experimental Investigation Of Supervised Llms On Chinese Short Text Matching Liu Shulin, Xu Chengcheng, Liu Hao, Yu Tinghao, Yang Tao
- Making Them Ask And Answer: Jailbreaking Large Language Models In Few Queries Via Disguise And Reconstruction Liu Tong, Zhang Yingjie, Zhao Zhe, Dong Yinpeng, Meng Guozhu, Chen Kai
- Toolace: Winning The Points Of LLM Function Calling Liu Weiwen, Huang Xu, Zeng Xingshan, Hao Xinlong, Yu Shuai, Li Dexun, Wang Shuai, Gan Weinan, Liu Zhengying, Yu Yuanqing, Wang Zezhong, Wang Yuxian, Ning Wu, Hou Yutai, Wang Bin, Wu Chuhan, Wang Xinzhi, Liu Yong, Wang Yasheng, Tang Duyu, Tu Dandan, Shang Lifeng, Jiang Xin, Tang Ruiming, Lian Defu, Liu Qun, Chen Enhong
- Demorank: Selecting Effective Demonstrations For Large Language Models In Ranking Task Liu Wenhan, Zhu Yutao, Dou Zhicheng
- Cmm-math: A Chinese Multimodal Math Dataset To Evaluate And Enhance The Mathematics Reasoning Of Large Multimodal Models Liu Wentao, Pan Qianjun, Zhang Yi, Liu Zhuo, Wu Ji, Zhou Jie, Zhou Aimin, Chen Qin, Jiang Bo, He Liang
- Extensive Self-contrast Enables Feedback-free Language Model Alignment Liu Xiao, Song Xixuan, Dong Yuxiao, Tang Jie
- Farewell To Length Extrapolation, A Training-free Infinite Context With Finite Attention Scope Liu Xiaoran, Guo Qipeng, Song Yuerong, Liu Zhigeng, Lv Kai, Yan Hang, Li Linlin, Liu Qun, Qiu Xipeng
- Adaptive Draft-verification For Efficient Large Language Model Decoding Liu Xukun, Lei Bowen, Zhang Ruqi, Xu Dongkuan
- Datasets For Large Language Models: A Comprehensive Survey Liu Yang, Cao Jiahuan, Liu Chongyu, Ding Kai, Jin Lianwen
- Do Large Language Models Possess Sensitive To Sentiment? Liu Yang, Zhu Xichou, Shen Zhou, Liu Yi, Li Min, Chen Yujun, John Benzi, Ma Zhenzhen, Hu Tao, Xu Zhiyang, Luo Wei, Wang Junhui
- RA-ISF: Learning To Answer And Understand From Retrieval Augmentation Via Iterative Self-feedback Liu Yanming, Peng Xinyue, Zhang Xuhong, Liu Weihao, Yin Jianwei, Cao Jiannan, Du Tianyu
- Understanding Llms: A Comprehensive Overview From Training To Inference Liu Yiheng, He Hao, Han Tianle, Zhang Xu, Liu Mengyuan, Tian Jiaming, Zhang Yutong, Wang Jiaqi, Gao Xiaohui, Zhong Tianyang, Pan Yi, Xu Shaochen, Wu Zihao, Liu Zhengliang, Zhang Xin, Zhang Shu, Hu Xintao, Zhang Tuo, Qiang Ning, Liu Tianming, Ge Bao
- SAFETY-J: Evaluating Safety With Critique Liu Yixiu, Zheng Yuxiang, Xia Shijie, Li Jiajun, Tu Yi, Song Chaoling, Liu Pengfei
- From Summary To Action: Enhancing Large Language Models For Complex Tasks With Open World Apis Liu Yulong, Yuan Yunlong, Wang Chunwei, Han Jianhua, Ma Yongqiang, Zhang Li, Zheng Nanning, Xu Hang
- Hd-eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition Liu Yuxuan, Yang Tianchi, Huang Shaohan, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Adamole: Fine-tuning Large Language Models With Adaptive Mixture Of Low-rank Adaptation Experts Liu Zefang, Luo Jiahua
- Synthvlm: High-efficiency And High-quality Synthetic Data For Vision Language Models Liu Zheng, Liang Hao, Huang Xijie, Xiong Wentao, Yu Qinhan, Sun Linzhuang, Chen Chong, He Conghui, Cui Bin, Zhang Wentao
- Personality-aware Student Simulation For Conversational Intelligent Tutoring Systems Liu Zhengyuan, Yin Stella Xin, Lin Geyu, Chen Nancy F.
- Probing Language Models For Pre-training Data Detection Liu Zhenhua, Zhu Tong, Tan Chuanyuan, Lu Haonan, Liu Bing, Chen Wenliang
- Controllable And Diverse Data Augmentation With Large Language Model For Low-resource Open-domain Dialogue Generation Liu Zhenhua, Zhu Tong, Xiang Jianxiang, Chen Wenliang
- Mixture Of Insightful Experts (mote): The Synergy Of Thought Chains And Expert Mixtures In Self-alignment Liu Zhili, Gou Yunhao, Chen Kai, Hong Lanqing, Gao Jiahui, Mi Fei, Zhang Yu, Li Zhenguo, Jiang Xin, Liu Qun, Kwok James T.
- Enabling Weak Llms To Judge Response Reliability Via Meta Ranking Liu Zijun, Kou Boqun, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Less Is More: Data Value Estimation For Visual Instruction Tuning Liu Zikang, Zhou Kun, Zhao Wayne Xin, Gao Dawei, Li Yaliang, Wen Ji-rong
- A Closer Look Into Mixture-of-experts In Large Language Models Lo Ka Man, Huang Zeyu, Qiu Zihan, Wang Zili, Fu Jie
- Large Language Models Relearn Removed Concepts Lo Michelle, Cohen Shay B., Barez Fazl
- Loqt: Low-rank Adapters For Quantized Pre-training Loeschcke Sebastian, Toftrup Mads, Kastoryano Michael J., Belongie Serge, Snæbjarnarson Vésteinn
- Llms Are Biased Towards Output Formats! Systematically Evaluating And Mitigating Output Format Bias Of Llms Long Do Xuan, Ngoc Hai Nguyen, Sim Tiviatis, Dao Hieu, Joty Shafiq, Kawaguchi Kenji, Chen Nancy F., Kan Min-yen
- Large Language Models Know What Makes Exemplary Contexts Long Quanyu, Chen Jianda, Wang Wenya, Pan Sinno Jialin
- Generative Multi-modal Knowledge Retrieval With Large Language Models Long Xinwei, Zeng Jiali, Meng Fandong, Ma Zhiyuan, Zhang Kaiyan, Zhou Bowen, Zhou Jie
- Gl\'oria -- A Generative And Open Large Language Model For Portuguese Lopes Ricardo, Magalhães João, Semedo David
- Personalitychat: Conversation Distillation For Personalized Dialog Modeling With Facts And Traits Lotfi Ehsan, De Bruyn Maxime, Buhmann Jeska, Daelemans Walter
- Sparser Is Faster And Less Is More: Efficient Sparse Attention For Long-range Transformers Lou Chao, Jia Zixia, Zheng Zilong, Tu Kewei
- Efficient Encoder-decoder Transformer Decoding For Decomposable Tasks Lu Bo-ru, Haduong Nikita, Lin Chien-yu, Cheng Hao, Smith Noah A., Ostendorf Mari
- Deepseek-vl: Towards Real-world Vision-language Understanding Lu Haoyu, Liu Wen, Zhang Bo, Wang Bingxuan, Dong Kai, Liu Bo, Sun Jingxiang, Ren Tongzheng, Li Zhuoshu, Yang Hao, Sun Yaofeng, Deng Chengqi, Xu Hanwei, Xie Zhenda, Ruan Chong
- Towards Transfer Unlearning: Empirical Evidence Of Cross-domain Bias Mitigation Lu Huimin, Isonuma Masaru, Mori Junichiro, Sakata Ichiro
- YODA: Teacher-student Progressive Learning For Language Models Lu Jianqiao, Zhong Wanjun, Wang Yufei, Guo Zhijiang, Zhu Qi, Huang Wenyong, Wang Yanlin, Mi Fei, Wang Baojun, Wang Yasheng, Shang Lifeng, Jiang Xin, Liu Qun
- FIPO: Free-form Instruction-oriented Prompt Optimization With Preference Dataset And Modular Fine-tuning Schema Lu Junru, An Siyu, Zhang Min, He Yulan, Yin Di, Sun Xing
- Datasculpt: Crafting Data Landscapes For LLM Post-training Through Multi-objective Partitioning Lu Keer, Liang Zheng, Nie Xiaonan, Pan Da, Zhang Shusen, Zhao Keshi, Chen Weipeng, Zhou Zenan, Dong Guosheng, Zhang Wentao, Cui Bin
- Online Merging Optimizers For Boosting Rewards And Mitigating Tax In Alignment Lu Keming, Yu Bowen, Huang Fei, Fan Yang, Lin Runji, Zhou Chang
- Every Language Counts: Learn And Unlearn In Multilingual Llms Lu Taiming, Koehn Philipp
- Fine-tuning Large Language Models For Domain Adaptation: Exploration Of Training Strategies, Scaling, Model Merging And Synergistic Capabilities Lu Wei, Luu Rachel K., Buehler Markus J.
- Not All Experts Are Equal: Efficient Expert Pruning And Skipping For Mixture-of-experts Large Language Models Lu Xudong, Liu Qi, Xu Yuhui, Zhou Aojun, Huang Siyuan, Zhang Bo, Yan Junchi, Li Hongsheng
- SPP: Sparsity-preserved Parameter-efficient Fine-tuning For Large Language Models Lu Xudong, Zhou Aojun, Xu Yuhui, Zhang Renrui, Gao Peng, Li Hongsheng
- Longheads: Multi-head Attention Is Secretly A Long Context Processor Lu Yi, Zhou Xin, He Wei, Zhao Jun, Ji Tao, Gui Tao, Zhang Qi, Huang Xuanjing
- Llamax: Scaling Linguistic Horizons Of LLM By Enhancing Translation Capabilities Beyond 100 Languages Lu Yinquan, Zhu Wenhao, Li Lei, Qiao Yu, Yuan Fei
- Asymptotic Theory Of In-context Learning By Linear Attention Lu Yue M., Letey Mary I., Zavatone-veth Jacob A., Maiti Anindita, Pehlevan Cengiz
- To Err Is Human, But Llamas Can Learn It Too Luhtaru Agnes, Purason Taido, Vainikko Martin, Del Maksym, Fishel Mark
- Can Perplexity Predict Fine-tuning Performance? An Investigation Of Tokenization Effects On Sequential Language Models For Nepali Luitel Nishant, Bekoju Nirajan, Sah Anand Kumar, Shakya Subarna
- MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory For Long Sequences Training Luo Cheng, Zhao Jiawei, Chen Zhuoming, Chen Beidi, Anandkumar Anima
- Layoutllm: Layout Instruction Tuning With Large Language Models For Document Understanding Luo Chuwei, Shen Yufan, Zhu Zhaoqing, Zheng Qi, Yu Zhi, Yao Cong
- Crosstune: Black-box Few-shot Classification With Label Enhancement Luo Danqing, Zhang Chen, Zhang Yan, Li Haizhou
- Autom3l: An Automated Multimodal Machine Learning Framework With Large Language Models Luo Daqin, Feng Chengjian, Nong Yuxuan, Shen Yiqing
- Arena Learning: Build Data Flywheel For Llms Post-training Via Simulated Chatbot Arena Luo Haipeng, Sun Qingfeng, Xu Can, Zhao Pu, Lin Qingwei, Lou Jianguang, Chen Shifeng, Tang Yansong, Chen Weizhu
- Decoupled Alignment For Robust Plug-and-play Adaptation Luo Haozheng, Yu Jiahao, Zhang Wenxin, Li Jialong, Hu Jerry Yao-chieh, Xing Xinyu, Liu Han
- Kuaiji: The First Chinese Accounting Large Language Model Luo Jiayuan, Yang Songhua, Qiu Xiaoling, Chen Panyu, Nai Yufei, Zeng Wenxuan, Zhang Wentao, Jiang Xinke
- BGE Landmark Embedding: A Chunking-free Embedding Method For Retrieval Augmented Long-context Large Language Models Luo Kun, Liu Zheng, Xiao Shitao, Liu Kang
- Large Language Models As Foundations For Next-gen Dense Retrieval: A Comprehensive Empirical Assessment Luo Kun, Qin Minghao, Liu Zheng, Xiao Shitao, Zhao Jun, Liu Kang
- DEEM: Diffusion Models Serve As The Eyes Of Large Language Models For Image Perception Luo Run, Li Yunshui, Chen Longze, He Wanwei, Lin Ting-en, Liu Ziqiang, Zhang Lei, Song Zikai, Xia Xiaobo, Liu Tongliang, Yang Min, Hui Binyuan
- Duetsim: Building User Simulator With Dual Large Language Models For Task-oriented Dialogues Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- Zero-shot Cross-domain Dialogue State Tracking Via Dual Low-rank Adaptation Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- Turning Trash Into Treasure: Accelerating Inference Of Large Language Models With Token Recycling Luo Xianzhen, Wang Yixuan, Zhu Qingfu, Zhang Zhiming, Zhang Xuanyu, Yang Qing, Xu Dongliang, Che Wanxiang
- Matching Domain Experts By Training From Scratch On Domain Knowledge Luo Xiaoliang, Sun Guangzhi, Love Bradley C.
- Ensuring Safe And High-quality Outputs: A Guideline Library Approach For Language Models Luo Yi, Lin Zhenghao, Zhang Yuhao, Sun Jiashuo, Lin Chen, Xu Chengjin, Su Xiangdong, Shen Yelong, Guo Jian, Gong Yeyun
- TRAWL: Tensor Reduced And Approximated Weights For Large Language Models Luo Yiran, Patel Het, Fu Yu, Ahn Dawon, Chen Jia, Dong Yue, Papalexakis Evangelos E.
- Are Large Language Models True Healthcare Jacks-of-all-trades? Benchmarking Across Health Professions Beyond Physician Exams Luo Zheheng, Yuan Chenhan, Xie Qianqian, Ananiadou Sophia
- Reft: Reasoning With Reinforced Fine-tuning Luong Trung Quoc, Zhang Xinbo, Jie Zhanming, Sun Peng, Jin Xiaoran, Li Hang
- WILBUR: Adaptive In-context Learning For Robust And Accurate Web Agents Lutz Michael, Bohra Arth, Saroyan Manvel, Harutyunyan Artem, Campagna Giovanni
- Codeact: Code Adaptive Compute-efficient Tuning Framework For Code Llms Lv Weijie, Xia Xuan, Huang Sheng-jun
- Keeping Llms Aligned After Fine-tuning: The Crucial Role Of Prompt Templates Lyu Kaifeng, Zhao Haoyu, Gu Xinran, Yu Dingli, Goyal Anirudh, Arora Sanjeev
- Automatic Summarization Of Doctor-patient Encounter Dialogues Using Large Language Model Through Prompt Tuning Lyu Mengxian, Peng Cheng, Li Xiaohan, Balian Patrick, Bian Jiang, Wu Yonghui
- Knowtuning: Knowledge-aware Fine-tuning For Large Language Models Lyu Yougang, Yan Lingyong, Wang Shuaiqiang, Shi Haibo, Yin Dawei, Ren Pengjie, Chen Zhumin, De Rijke Maarten, Ren Zhaochun
- First Activations Matter: Training-free Methods For Dynamic Activation In Large Language Models Ma Chi, Huang Mincong, Zhang Ying, Wang Chao, Wang Yujie, Yu Lei, Liu Chuan, Lin Wei
- Sparsity-accelerated Training For Large Language Models Ma Da, Chen Lu, Wang Pengyu, Xu Hongshen, Li Hanqi, Sun Liangtai, Zhu Su, Fan Shuai, Yu Kai
- Neighboring Perturbations Of Knowledge Editing On Large Language Models Ma Jun-yu, Ling Zhen-hua, Zhang Ningyu, Gu Jia-chen
- Perturbation-restrained Sequential Model Editing Ma Jun-yu, Wang Hong, Xu Hao-xiang, Ling Zhen-hua, Gu Jia-chen
- FBI-LLM: Scaling Up Fully Binarized Llms From Scratch Via Autoregressive Distillation Ma Liqun, Sun Mingjie, Shen Zhiqiang
- Eventlens: Leveraging Event-aware Pretraining And Cross-modal Linking Enhances Visual Commonsense Reasoning Ma Mingjie, Yu Zhihuan, Ma Yichao, Li Guohui
- Making Pre-trained Language Models Better Continual Few-shot Relation Extractors Ma Shengkun, Han Jiale, Liang Yi, Cheng Bo
- The Era Of 1-bit Llms: All Large Language Models Are In 1.58 Bits Ma Shuming, Wang Hongyu, Ma Lingxiao, Wang Lei, Wang Wenhui, Huang Shaohan, Dong Li, Wang Ruiping, Xue Jilong, Wei Furu
- Aligned With LLM: A New Multi-modal Training Paradigm For Encoding Fmri Activity In Visual Cortex Ma Shuxiao, Wang Linyuan, Hou Senbao, Yan Bin
- Clawmachine: Fetching Visual Tokens As An Entity For Referring And Grounding Ma Tianren, Xie Lingxi, Tian Yunjie, Yang Boyu, Zhang Yuan, Doermann David, Ye Qixiang
- Megalodon: Efficient LLM Pretraining And Inference With Unlimited Context Length Ma Xuezhe, Yang Xiaomeng, Xiong Wenhan, Chen Beidi, Yu Lili, Zhang Hao, May Jonathan, Zettlemoyer Luke, Levy Omer, Zhou Chunting
- Sciagent: Tool-augmented Language Models For Scientific Reasoning Ma Yubo, Gou Zhibin, Hao Junheng, Xu Ruochen, Wang Shuohang, Pan Liangming, Yang Yujiu, Cao Yixin, Sun Aixin, Awadalla Hany, Chen Weizhu
- Compositional API Recommendation For Library-oriented Code Generation Ma Zexiong, An Shengnan, Xie Bing, Lin Zeqi
- Llmparser: An Exploratory Study On Using Large Language Models For Log Parsing Ma Zeyang, Chen An Ran, Kim Dong Jae, Chen Tse-hsun, Wang Shaowei
- Llamoco: Instruction Tuning Of Large Language Models For Optimization Code Generation Ma Zeyuan, Guo Hongshu, Chen Jiacheng, Peng Guojun, Cao Zhiguang, Ma Yining, Gong Yue-jiao
- LLM In-context Recall Is Prompt Dependent Machlab Daniel, Battle Rick
- M2lingual: Enhancing Multilingual, Multi-turn Instruction Alignment In Large Language Models Maheshwary Rishabh, Yadav Vikas, Nguyen Hoang, Mahajan Khyati, Madhusudhan Sathwik Tejaswi
- Too Late To Train, Too Early To Use? A Study On Necessity And Viability Of Low-resource Bengali Llms Mahfuz Tamzeed, Dey Satak Kumar, Naswan Ruwad, Adil Hasnaen, Sayeed Khondker Salman, Shahgir Haz Sameen
- Rephrasing The Web: A Recipe For Compute And Data-efficient Language Modeling Maini Pratyush, Seto Skyler, Bai He, Grangier David, Zhang Yizhe, Jaitly Navdeep
- FRACTAL: Fine-grained Scoring From Aggregate Text Labels Makhija Yukti, Agrawal Priyanka, Saket Rishi, Raghuveer Aravindan
- COPAL: Continual Pruning In Large Language Generative Models Malla Srikanth, Choi Joon Hee, Choi Chiho
- ULLME: A Unified Framework For Large Language Model Embeddings With Generation-augmented Learning Man Hieu, Ngo Nghia Trung, Dernoncourt Franck, Nguyen Thien Huu
- Mitigating Hallucinations In Large Vision-language Models (lvlms) Via Language-contrastive Decoding (LCD) Manevich Avshalom, Tsarfaty Reut
- Coalitions Of Large Language Models Increase The Robustness Of AI Agents Mangal Prattyush, Mak Carol, Kanakis Theo, Donovan Timothy, Braines Dave, Pyzer-knapp Edward
- Visually Guided Generative Text-layout Pre-training For Document Intelligence Mao Zhiming, Bai Haoli, Hou Lu, Wei Jiansheng, Jiang Xin, Liu Qun, Wong Kam-fai
- Tuning Llms With Contrastive Alignment Instructions For Machine Translation In Unseen, Low-resource Languages Mao Zhuoyuan, Yu Yen
- Openllm-ro -- Technical Report On Open-source Romanian Llms Masala Mihai, Ilie-ablachim Denis C., Corlatescu Dragos, Zavelca Miruna, Leordeanu Marius, Velicu Horia, Popescu Marius, Dascalu Mihai, Rebedea Traian
- "vorbe\c{s}ti Rom\^ane\c{s}te?" A Recipe To Train Powerful Romanian Llms With English Instructions Masala Mihai, Ilie-ablachim Denis C., Dima Alexandru, Corlatescu Dragos, Zavelca Miruna, Olaru Ovio, Terian Simina, Terian Andrei, Leordeanu Marius, Velicu Horia, Popescu Marius, Dascalu Mihai, Rebedea Traian
- German Also Hallucinates! Inconsistency Detection In News Summaries With The Absinth Dataset Mascarell Laura, Chalumattu Ribin, Rios Annette
- Datastates-llm: Lazy Asynchronous Checkpointing For Large Language Models Maurya Avinash, Underwood Robert, Rafique M. Mustafa, Cappello Franck, Nicolae Bogdan
- Improving Text-to-image Consistency Via Automatic Prompt Optimization Mañas Oscar, Astolfi Pietro, Hall Melissa, Ross Candace, Urbanek Jack, Williams Adina, Agrawal Aishwarya, Romero-soriano Adriana, Drozdzal Michal
- LLM Critics Help Catch LLM Bugs Mcaleese Nat, Pokorny Rai Michael, Uribe Juan Felipe Ceron, Nitishinskaya Evgenia, Trebacz Maja, Leike Jan
- MM1: Methods, Analysis & Insights From Multimodal LLM Pre-training Mckinzie Brandon, Gan Zhe, Fauconnier Jean-philippe, Dodge Sam, Zhang Bowen, Dufter Philipp, Shah Dhruti, Du Xianzhi, Peng Futang, Weers Floris, Belyi Anton, Zhang Haotian, Singh Karanjeet, Kang Doug, Jain Ankur, Hè Hongyu, Schwarzer Max, Gunter Tom, Kong Xiang, Zhang Aonan, Wang Jianyu, Wang Chong, Du Nan, Lei Tao, Wiseman Sam, Yin Guoli, Lee Mark, Wang Zirui, Pang Ruoming, Grasch Peter, Toshev Alexander, Yang Yinfei
- Injecting New Knowledge Into Large Language Models Via Supervised Fine-tuning Mecklenburg Nick, Lin Yiyou, Li Xiaoxiao, Holstein Daniel, Nunes Leonardo, Malvar Sara, Silva Bruno, Chandra Ranveer, Aski Vijay, Yannam Pavan Kumar Reddy, Aktas Tolga, Hendry Todd
- Inherent Challenges Of Post-hoc Membership Inference For Large Language Models Meeus Matthieu, Jain Shubham, Rei Marek, De Montjoye Yves-alexandre
- Openelm: An Efficient Language Model Family With Open Training And Inference Framework Mehta Sachin, Sekhavat Mohammad Hossein, Cao Qingqing, Horton Maxwell, Jin Yanzi, Sun Chenfan, Mirzadeh Iman, Najibi Mahyar, Belenko Dmitry, Zatloukal Peter, Rastegari Mohammad
- Realhf: Optimized RLHF Training For Large Language Models Through Parameter Reallocation Mei Zhiyu, Fu Wei, Li Kaiwei, Wang Guangju, Zhang Huanchen, Wu Yi
- Base Of Rope Bounds Context Length Men Xin, Xu Mingyu, Wang Bingning, Zhang Qingyu, Lin Hongyu, Han Xianpei, Chen Weipeng
- Soda-eval: Open-domain Dialogue Evaluation In The Age Of Llms Mendonça John, Trancoso Isabel, Lavie Alon
- A Survey Of Resource-efficient LLM And Multimodal Foundation Models Mengwei Xu, Wangsong Yin, Dongqi Cai, Rongjie Yi, Daliang Xu, Qipeng Wang, Bingyang Wu, Yihao Zhao, Chen Yang, Shihe Wang, Qiyang Zhang, Zhenyan Lu, Li Zhang, Shangguang Wang, Yuanchun Li, Yunxin Liu, Xin Jin, Xuanzhe Liu
- Linearizing Large Language Models Mercat Jean, Vasiljevic Igor, Keh Sedrick, Arora Kushal, Dave Achal, Gaidon Adrien, Kollar Thomas
- Superposition Prompting: Improving And Accelerating Retrieval-augmented Generation Merth Thomas, Fu Qichen, Rastegari Mohammad, Najibi Mahyar
- A Comparison Of LLM Finetuning Methods & Evaluation Metrics With Travel Chatbot Use Case Meyer Sonia, Singh Shreya, Tam Bertha, Ton Christopher, Ren Angel
- Subject-driven Text-to-image Generation Via Preference-based Reinforcement Learning Miao Yanting, Loh William, Kothawade Suraj, Poupart Pascal, Rashwan Abdullah, Li Yeqing
- I Have An Attention Bridge To Sell You: Generalization Capabilities Of Modular Translation Architectures Mickus Timothee, Vázquez Raúl, Attieh Joseph
- Velora: Memory Efficient Training Using Rank-1 Sub-token Projections Miles Roy, Reddy Pradyumna, Elezi Ismail, Deng Jiankang
- Large Language Models: A Survey Minaee Shervin, Mikolov Tomas, Nikzad Narjes, Chenaghlu Meysam, Socher Richard, Amatriain Xavier, Gao Jianfeng
- Aligning Query Representation With Rewritten Query And Relevance Judgments In Conversational Search Mo Fengran, Qu Chen, Mao Kelong, Wu Yihong, Su Zhan, Huang Kaiyu, Nie Jian-yun
- Convsdg: Session Data Generation For Conversational Search Mo Fengran, Yi Bole, Mao Kelong, Qu Chen, Huang Kaiyu, Nie Jian-yun
- LSPT: Long-term Spatial Prompt Tuning For Visual Representation Learning Mo Shentong, Wang Yansen, Luo Xufang, Li Dongsheng
- Fight Back Against Jailbreaking Via Prompt Adversarial Tuning Mo Yichuan, Wang Yuji, Wei Zeming, Wang Yisen
- Large Language Model (LLM) AI Text Generation Detection Based On Transformer Deep Learning Algorithm Mo Yuhong, Qin Hao, Dong Yushan, Zhu Ziyi, Li Zhenglin
- RDBE: Reasoning Distillation-based Evaluation Enhances Automatic Essay Scoring Mohammadkhani Ali Ghiasvand
- E2TP: Element To Tuple Prompting Improves Aspect Sentiment Tuple Prediction Mohammadkhani Mohammad Ghiasvand, Ranjbar Niloofar, Momtazi Saeedeh
- Llava-chef: A Multi-modal Generative Model For Food Recipes Mohbat Fnu, Zaki Mohammed J.
- Llm-based Frameworks For API Argument Filling In Task-oriented Conversational Systems Mok Jisoo, Kachuee Mohammad, Dai Shuyang, Ray Shayan, Taghavi Tara, Yoon Sungroh
- Comparison Of Large Language Models For Generating Contextually Relevant Questions Molina Ivo Lodovico, Švábenský Valdemar, Minematsu Tsubasa, Chen Li, Okubo Fumiya, Shimada Atsushi
- Kam-cot: Knowledge Augmented Multimodal Chain-of-thoughts Reasoning Mondal Debjyoti, Modi Suraj, Panda Subhadarshi, Singh Rituraj, Rao Godawari Sudhakar
- Towards Generating Informative Textual Description For Neurons In Language Models Mondal Shrayani, Garodia Rishabh, Qureshi Arbaaz, Lee Taesung, Park Youngja
- Beyond Accuracy: Evaluating The Reasoning Behavior Of Large Language Models -- A Survey Mondorf Philipp, Plank Barbara
- Repliqa: A Question-answering Dataset For Benchmarking Llms On Unseen Reference Content Monteiro Joao, Noel Pierre-andre, Marcotte Etienne, Rajeswar Sai, Zantedeschi Valentina, Vazquez David, Chapados Nicolas, Pal Christopher, Taslakian Perouz
- Synthetic Programming Elicitation And Repair For Text-to-code In Very Low-resource Programming Languages Mora Federico, Wong Justin, Lepe Haley, Bhatia Sahil, Elmaaroufi Karim, Varghese George, Gonzalez Joseph E., Polgreen Elizabeth, Seshia Sanjit A.
- Exploring The Landscape Of Large Language Models: Foundations, Techniques, And Challenges Moradi Milad, Yan Ke, Colwell David, Samwald Matthias, Asgari Rhona
- Language Modelling Approaches To Adaptive Machine Translation Moslem Yasmin
- Are Llms Robust For Spoken Dialogues? Mousavi Seyed Mahed, Roccabruna Gabriel, Alghisi Simone, Rizzoli Massimo, Ravanelli Mirco, Riccardi Giuseppe
- Generative Representational Instruction Tuning Muennighoff Niklas, Su Hongjin, Wang Liang, Yang Nan, Wei Furu, Yu Tao, Singh Amanpreet, Kiela Douwe
- Compact Language Models Via Pruning And Knowledge Distillation Muralidharan Saurav, Sreenivas Sharath Turuvekere, Joshi Raviraj, Chochowski Marcin, Patwary Mostofa, Shoeybi Mohammad, Catanzaro Bryan, Kautz Jan, Molchanov Pavlo
- Novel-wd: Exploring Acquisition Of Novel World Knowledge In Llms Using Prefix-tuning Méloux Maxime, Cerisara Christophe
- Cdquant: Accurate Post-training Weight Quantization Of Large Pre-trained Models Using Greedy Coordinate Descent Nair Pranav Ajit, Suggala Arun Sai
- Aurora-m: The First Open Source Multilingual Language Model Red-teamed According To The U.S. Executive Order Nakamura Taishi, Mishra Mayank, Tedeschi Simone, Chai Yekun, Stillerman Jason T, Friedrich Felix, Yadav Prateek, Laud Tanmay, Chien Vu Minh, Zhuo Terry Yue, Misra Diganta, Bogin Ben, Vu Xuan-son, Karpinska Marzena, Dantuluri Arnav Varma, Kusa Wojciech, Furlanello Tommaso, Yokota Rio, Muennighoff Niklas, Pai Suhas, Adewumi Tosin, Laippala Veronika, Yao Xiaozhe, Junior Adalberto, Ariyak Alpay, Drozd Aleksandr, Clive Jordan, Gupta Kshitij, Chen Liangyu, Sun Qi, Tsui Ken, Persaud Noah, Fahmy Nour, Chen Tianlong, Bansal Mohit, Monti Nicolo, Dang Tai, Luo Ziyang, Bui Tien-tung, Navigli Roberto, Mehta Virendra, Blumberg Matthew, May Victor, Nguyen Huu, Pyysalo Sampo
- Pii-compass: Guiding LLM Training Data Extraction Prompts Towards The Target PII Via Grounding Nakka Krishna Kanth, Frikha Ahmed, Mendes Ricardo, Jiang Xue, Zhou Xuebing
- Open Llama2 Model For The Lithuanian Language Nakvosas Artūras, Daniušis Povilas, Mulevičius Vytas
- Gemquad : Generating Multilingual Question Answering Datasets From Large Language Models Using Few Shot Learning Namboori Amani, Mangale Shivam, Rosenbaum Andy, Soltan Saleh
- Using Llms To Model The Beliefs And Preferences Of Targeted Populations Namikoshi Keiichi, Filipowicz Alex, Shamma David A., Iliev Rumen, Hogan Candice L., Arechiga Nikos
- Word2world: Generating Stories And Worlds Through Large Language Models Nasir Muhammad U., James Steven, Togelius Julian
- Learning Goal-conditioned Representations For Language Reward Models Nath Vaskar, Slack Dylan, Da Jeff, Ma Yuntao, Zhang Hugh, Whitehead Spencer, Hendryx Sean
- Dynamic Memory Compression: Retrofitting Llms For Accelerated Inference Nawrot Piotr, Łańcucki Adrian, Chochowski Marcin, Tarjan David, Ponti Edoardo M.
- Concise Thoughts: Impact Of Output Length On LLM Reasoning And Cost Nayab Sania, Rossolini Giulio, Buttazzo Giorgio, Manes Nicolamaria, Giacomelli Fabrizio
- Improving Robustness Of Llm-based Speech Synthesis By Learning Monotonic Alignment Neekhara Paarth, Hussain Shehzeen, Ghosh Subhankar, Li Jason, Valle Rafael, Badlani Rohan, Ginsburg Boris
- Needle In The Haystack For Memory Based Large Language Models Nelson Elliot, Kollias Georgios, Das Payel, Chaudhury Subhajit, Dan Soham
- MBBQ: A Dataset For Cross-lingual Comparison Of Stereotypes In Generative Llms Neplenbroek Vera, Bisazza Arianna, Fernández Raquel
- Towards Modeling Learner Performance With Large Language Models Neshaei Seyed Parsa, Davis Richard Lee, Hazimeh Adam, Lazarevski Bojan, Dillenbourg Pierre, Käser Tanja
- Using Grammar Masking To Ensure Syntactic Validity In Llm-based Modeling Tasks Netz Lukas, Reimer Jan, Rumpe Bernhard
- What Drives Performance In Multilingual Language Models? Nezhad Sina Bagheri, Agrawal Ameeta
- Dude: Dual Distribution-aware Context Prompt Learning For Large Vision-language Model Nguyen Duy M. H., Le An T., Nguyen Trung Q., Diep Nghiem T., Nguyen Tai, Duong-tran Duy, Peters Jan, Shen Li, Niepert Mathias, Sonntag Daniel
- Balancing Exploration And Exploitation In LLM Using Soft RLLF For Enhanced Negation Understanding Nguyen Ha-thanh, Satoh Ken
- Advancing Vietnamese Visual Question Answering With Transformer And Convolutional Integration Nguyen Ngoc Son, Nguyen Van Son, Le Tung
- Text-to-drive: Diverse Driving Behavior Synthesis Via Large Language Models Nguyen Phat, Wang Tsun-hsuan, Hong Zhang-wei, Karaman Sertac, Rus Daniela
- Better Alignment With Instruction Back-and-forth Translation Nguyen Thao, Li Jeffrey, Oh Sewoong, Schmidt Ludwig, Weston Jason, Zettlemoyer Luke, Li Xian
- An Empirical Study On Capability Of Large Language Models In Understanding Code Semantics Nguyen Thu-trang, Vu Thanh Trong, Vo Hieu Dinh, Nguyen Son
- Aligners: Decoupling Llms And Alignment Ngweta Lilian, Agarwal Mayank, Maity Subha, Gittens Alex, Sun Yuekai, Yurochkin Mikhail
- Next: Teaching Large Language Models To Reason About Code Execution Ni Ansong, Allamanis Miltiadis, Cohan Arman, Deng Yinlin, Shi Kensen, Sutton Charles, Yin Pengcheng
- Layer-wise Regularized Dropout For Neural Language Models Ni Shiwen, Yang Min, Xu Ruifeng, Li Chengming, Hu Xiping
- Xl\(^2\)bench: A Benchmark For Extremely Long Context Understanding With Long-range Dependencies Ni Xuanfan, Cai Hengyi, Wei Xiaochi, Wang Shuaiqiang, Yin Dawei, Li Piji
- W-RAG: Weakly Supervised Dense Retrieval In RAG For Open-domain Question Answering Nian Jinming, Peng Zhiyuan, Wang Qifan, Fang Yi
- Panza: A Personalized Text Writing Assistant Via Data Playback And Local Fine-tuning Nicolicioiu Armand, Iofinova Eugenia, Kurtic Eldar, Nikdan Mahdi, Panferov Andrei, Markov Ilia, Shavit Nir, Alistarh Dan
- BMIKE-53: Investigating Cross-lingual Knowledge Editing With In-context Learning Nie Ercong, Shao Bo, Ding Zifeng, Wang Mingyang, Schmid Helmut, Schütze Hinrich
- Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge In English-centric Large Language Models Nie Ercong, Yuan Shuzhou, Ma Bolei, Schmid Helmut, Färber Michael, Kreuter Frauke, Schütze Hinrich
- Do Multilingual Large Language Models Mitigate Stereotype Bias? Nie Shangrui, Fromm Michael, Welch Charles, Görge Rebekka, Karimi Akbar, Plepi Joan, Mowmita Nazia Afsan, Flores-herr Nicolas, Ali Mehdi, Flek Lucie
- Flawn-t5: An Empirical Examination Of Effective Instruction-tuning Data Mixtures For Legal Reasoning Niklaus Joel, Zheng Lucia, Mccarthy Arya D., Hahn Christopher, Rosen Brian M., Henderson Peter, Ho Daniel E., Honke Garrett, Liang Percy, Manning Christopher
- Vikhr: The Family Of Open-source Instruction-tuned Large Language Models For Russian Nikolich Aleksandr, Korolev Konstantin, Shelmanov Artem, Kiselev Igor
- WTU-EVAL: A Whether-or-not Tool Usage Evaluation Benchmark For Large Language Models Ning Kangyun, Su Yisong, Lv Xueqiang, Zhang Yuanzhe, Liu Jian, Liu Kang, Xu Jinan
- User-llm: Efficient LLM Contextualization With User Embeddings Ning Lin, Liu Luyang, Wu Jiaxing, Wu Neo, Berlowitz Devora, Prakash Sushant, Green Bradley, O'banion Shawn, Xie Jun
- Can Llms Learn By Teaching? A Preliminary Study Ning Xuefei, Wang Zifu, Li Shiyao, Lin Zinan, Yao Peiran, Fu Tianyu, Blaschko Matthew B., Dai Guohao, Yang Huazhong, Wang Yu
- Quantified Task Misalignment To Inform PEFT: An Exploration Of Domain Generalization And Catastrophic Forgetting In CLIP Niss Laura, Vogt-lowell Kevin, Tsiligkaridis Theodoros
- Enhancing Dialogue State Tracking Models Through Llm-backed User-agents Simulation Niu Cheng, Wang Xingguang, Cheng Xuxin, Song Juntong, Zhang Tong
- From Text To Emotion: Unveiling The Emotion Annotation Capabilities Of Llms Niu Minxue, Jaiswal Mimansa, Provost Emily Mower
- PARAMANU-GANITA: Language Model With Mathematical Capabilities Niyogi Mitodru, Bhattacharya Arnab
- Language Portability Strategies For Open-domain Dialogue With Pre-trained Language Models From High To Low Resource Languages Njifenjou Ahmed, Sucal Virgile, Jabaian Bassam, Lefèvre Fabrice
- Role-play Zero-shot Prompting With Large Language Models For Open-domain Human-machine Conversation Njifenjou Ahmed, Sucal Virgile, Jabaian Bassam, Lefèvre Fabrice
- LBC: Language-based-classifier For Out-of-variable Generalization Noh Kangjun, Seong Baekryun, Byun Hoyoon, Choi Youngjun, Song Sungjin, Song Kyungwoo
- Mobileflow: A Multimodal LLM For Mobile GUI Agent Nong Songqin, Zhu Jiali, Wu Rui, Jin Jiongchao, Shan Shuo, Huang Xiutian, Xu Wenhao
- Nomic Embed: Training A Reproducible Long Context Text Embedder Nussbaum Zach, Morris John X., Duderstadt Brandon, Mulyar Andriy
- Improving Black-box Robustness With In-context Rewriting O'brien Kyle, Ng Nathan, Puri Isha, Mendez Jorge, Palangi Hamid, Kim Yoon, Ghassemi Marzyeh, Hartvigsen Thomas
- Fine-tuning With HED-IT: The Impact Of Human Post-editing For Dialogical Language Models Occhipinti Daniela, Marchi Michele, Mondella Irene, Lai Huiyuan, Dell'orletta Felice, Nissim Malvina, Guerini Marco
- Crafting Efficient Fine-tuning Strategies For Large Language Models Oliver Michael, Wang Guan
- Gpt-ology, Computational Models, Silicon Sampling: How Should We Think About Llms In Cognitive Science? Ong Desmond C.
- Routellm: Learning To Route Llms With Preference Data Ong Isaac, Almahairi Amjad, Wu Vincent, Chiang Wei-lin, Wu Tianhao, Gonzalez Joseph E., Kadous M Waleed, Stoica Ion
- Jamba: A Hybrid Transformer-mamba Language Model Opher Lieber, Barak Lenz, Hofit Bata, Gal Cohen, Jhonathan Osin, Itay Dalmedigos, Erez Safahi, Shaked Meirom, Yonatan Belinkov, Shai Shalev-shwartz, Omri Abend, Raz Alon, Tomer Asida, Amir Bergman, Roman Glozman, Michael Gokhman, Avashalom Manevich, Nir Ratner, Noam Rozen, Erez Shwartz, Mor Zusman, Yoav Shoham
- Transformers Are Multi-state Rnns Oren Matanel, Hassid Michael, Yarden Nir, Adi Yossi, Schwartz Roy
- Simplifying Translations For Children: Iterative Simplification Considering Age Of Acquisition With Llms Oshika Masashi, Morishita Makoto, Hirao Tsutomu, Sasano Ryohei, Takeda Koichi
- Towards Modular Llms By Building And Reusing A Library Of Loras Ostapenko Oleksiy, Su Zhan, Ponti Edoardo Maria, Charlin Laurent, Roux Nicolas Le, Pereira Matheus, Caccia Lucas, Sordoni Alessandro
- A Systematic Review Of Data-to-text NLG Osuji Chinonso Cynthia, Ferreira Thiago Castro, Davis Brian
- Lossless Acceleration Of Large Language Model Via Adaptive N-gram Parallel Decoding Ou Jie, Chen Yueming, Tian Wenhong
- Towards Cross-lingual Explanation Of Artwork In Large-scale Vision Language Models Ozaki Shintaro, Hayashi Kazuki, Sakai Yusuke, Kamigaito Hidetaka, Hayashi Katsuhiko, Watanabe Taro
- Multilingual Prompts In Llm-based Recommenders: Performance Across Languages Ozsoy Makbule Gulcin
- Outliers And Calibration Sets Have Diminishing Effect On Quantization Of Modern Llms Paglieri Davide, Dash Saurabh, Rocktäschel Tim, Parker-holder Jack
- Training Llms To Recognize Hedges In Spontaneous Narratives Paige Amie J., Soubki Adil, Murzaku John, Rambow Owen, Brennan Susan E.
- Investigating LLM Applications In E-commerce Palen-michel Chester, Wang Ruixiang, Zhang Yipeng, Yu David, Xu Canran, Wu Zhe
- Distilling Large Language Models For Text-attributed Graph Learning Pan Bo, Zhang Zheng, Zhang Yifei, Hu Yuntong, Zhao Liang
- Dense Training, Sparse Inference: Rethinking Training Of Mixture-of-experts Language Models Pan Bowen, Shen Yikang, Liu Haokun, Mishra Mayank, Zhang Gaoyuan, Oliva Aude, Raffel Colin, Panda Rameswar
- LISA: Layerwise Importance Sampling For Memory-efficient Large Language Model Fine-tuning Pan Rui, Liu Xiang, Diao Shizhe, Pi Renjie, Zhang Jipeng, Han Chi, Zhang Tong
- POMP: Probability-driven Meta-graph Prompter For Llms In Low-resource Unsupervised Neural Machine Translation Pan Shilong, Tian Zhiliang, Ding Liang, Huang Zhen, Wen Zhihua, Li Dongsheng
- G-DIG: Towards Gradient-based Diverse And High-quality Instruction Data Selection For Machine Translation Pan Xingyuan, Huang Luyang, Kang Liyan, Liu Zhicheng, Lu Yu, Cheng Shanbo
- Ee-tuning: An Economical Yet Scalable Solution For Tuning Early-exit Large Language Models Pan Xuchen, Chen Yanxi, Li Yaliang, Ding Bolin, Zhou Jingren
- Preparing Lessons For Progressive Training On Language Models Pan Yu, Yuan Ye, Yin Yichun, Shi Jiaxin, Xu Zenglin, Zhang Ming, Shang Lifeng, Jiang Xin, Liu Qun
- AILS-NTUA At Semeval-2024 Task 9: Cracking Brain Teasers: Transformer Models For Lateral Thinking Puzzles Panagiotopoulos Ioannis, Filandrianos Giorgos, Lymperaiou Maria, Stamou Giorgos
- Can Watermarking Large Language Models Prevent Copyrighted Text Generation And Hide Training Data? Panaitescu-liess Michael-andrei, Che Zora, An Bang, Xu Yuancheng, Pathmanathan Pankayaraj, Chakraborty Souradip, Zhu Sicheng, Goldstein Tom, Huang Furong
- Lottery Ticket Adaptation: Mitigating Destructive Interference In Llms Panda Ashwinee, Isik Berivan, Qi Xiangyu, Koyejo Sanmi, Weissman Tsachy, Mittal Prateek
- NTSEBENCH: Cognitive Reasoning Benchmark For Vision Language Models Pandya Pranshu, Talwarr Agney S, Gupta Vatsal, Kataria Tushar, Gupta Vivek, Roth Dan
- Salute The Classic: Revisiting Challenges Of Machine Translation In The Age Of Large Language Models Pang Jianhui, Ye Fanghua, Wang Longyue, Yu Dian, Wong Derek F., Shi Shuming, Tu Zhaopeng
- Attndreambooth: Towards Text-aligned Personalized Text-to-image Generation Pang Lianyu, Yin Jian, Zhao Baoquan, Wu Feize, Wang Fu Lee, Li Qing, Mao Xudong
- Phased Instruction Fine-tuning For Large Language Models Pang Wei, Zhou Chuan, Zhou Xiao-hua, Wang Xiaojie
- LLM Evaluators Recognize And Favor Their Own Generations Panickssery Arjun, Bowman Samuel R., Feng Shi
- Lost In Space: Probing Fine-grained Spatial Understanding In Vision And Language Resamplers Pantazopoulos Georgios, Suglia Alessandro, Lemon Oliver, Eshghi Arash
- Measuring Memorization In RLHF For Code Completion Pappu Aneesh, Porter Billy, Shumailov Ilia, Hayes Jamie
- Building Pre-train LLM Dataset For The INDIC Languages: A Case Study On Hindi Parida Shantipriya, Panwar Shakshi, Lata Kusum, Mishra Sanskruti, Sekhar Sambit
- Memorizing Documents With Guidance In Large Language Models Park Bumjin, Choi Jaesik
- Low-resource Cross-lingual Summarization Through Few-shot Learning With Large Language Models Park Gyutae, Hwang Seojin, Lee Hwanhee
- Offsetbias: Leveraging Debiased Data For Tuning Evaluators Park Junsoo, Jwa Seungyeon, Ren Meiying, Kim Daeyoung, Choi Sanghyuk
- Token-picker: Accelerating Attention In Text Generation With Minimized Memory Transfer Via Probability Estimation Park Junyoung, Kang Myeonggu, Han Yunki, Kim Yanggon, Shin Jaekang, Kim Lee-sup
- Textboost: Towards One-shot Personalization Of Text-to-image Models Via Fine-tuning Text Encoder Park Nahyeon, Kim Kunhee, Shim Hyunjung
- Any-precision LLM: Low-cost Deployment Of Multiple, Different-sized Llms Park Yeonhong, Hyun Jake, Cho Sanglyul, Sim Bonggeun, Lee Jae W.
- The Ultimate Guide To Fine-tuning Llms From Basics To Breakthroughs: An Exhaustive Review Of Technologies, Research, Best Practices, Applied Research Challenges And Opportunities Parthasarathy Venkatesh Balavadhani, Zafar Ahtsham, Khan Aafaq, Shahid Arsalan
- Decoding News Narratives: A Critical Analysis Of Large Language Models In Framing Detection Pastorino Valeria, Sivakumar Jasivan A., Moosavi Nafise Sadat
- Large Language Models Can Self-improve At Web Agent Tasks Patel Ajay, Hofmarcher Markus, Leoveanu-condrei Claudiu, Dinu Marius-constantin, Callison-burch Chris, Hochreiter Sepp
- Automating Code Adaptation For Mlops -- A Benchmarking Study On Llms Patel Harsh, Ramanan Buvaneswari A., Khan Manzoor A., Williams Thomas, Friedman Brian, Drabeck Lawrence
- Making Reasoning Matter: Measuring And Improving Faithfulness Of Chain-of-thought Reasoning Paul Debjit, West Robert, Bosselut Antoine, Faltings Boi
- Advprompter: Fast Adaptive Adversarial Prompting For Llms Paulus Anselm, Zharmagambetov Arman, Guo Chuan, Amos Brandon, Tian Yuandong
- Autonomous Workflow For Multimodal Fine-grained Training Assistants Towards Mixed Reality Pei Jiahuan, Viola Irene, Huang Haochen, Wang Junxiao, Ahsan Moonisa, Ye Fanghua, Yiming Jiang, Sai Yao, Wang Di, Chen Zhumin, Ren Pengjie, Cesar Pablo
- Automated Multi-language To English Machine Translation Using Generative Pre-trained Transformers Pelofske Elijah, Urias Vincent, Liebrock Lorie M.
- Securing Large Language Models: Addressing Bias, Misinformation, And Prompt Attacks Peng Benji, Chen Keyu, Li Ming, Feng Pohsun, Bi Ziqian, Liu Junyu, Niu Qian
- Answer Is All You Need: Instruction-following Text Embedding Via Answering The Question Peng Letian, Zhang Yuwei, Wang Zilong, Srinivasa Jayanth, Liu Gaowen, Wang Zihan, Shang Jingbo
- Review-llm: Harnessing Large Language Models For Personalized Review Generation Peng Qiyao, Liu Hongtao, Xu Hongyan, Yang Qing, Shao Minglai, Wang Wenjun
- Uncertainty-aware Explainable Recommendation With Large Language Models Peng Yicui, Chen Hao, Lin Chingsheng, Huang Guo, Hu Jinrong, Guo Hui, Kong Bin, Hu Shu, Wu Xi, Wang Xin
- Q-PEFT: Query-dependent Parameter Efficient Fine-tuning For Text Reranking With Large Language Models Peng Zhiyuan, Wu Xuyang, Wang Qifan, Rajanala Sravanthi, Fang Yi
- A Review Of The Challenges With Massive Web-mined Corpora Used In Large Language Models Pre-training Perełkiewicz Michał, Poświata Rafał
- Astrollama-chat: Scaling Astrollama With Conversational And Diverse Datasets Perkowski Ernest, Pan Rui, Nguyen Tuan Dung, Ting Yuan-sen, Kruk Sandor, Zhang Tong, O'neill Charlie, Jablonska Maja, Sun Zechang, Smith Michael J., Liu Huiling, Schawinski Kevin, Iyer Kartheik, Universetbd Ioana Ciucă For
- Did Translation Models Get More Robust Without Anyone Even Noticing? Peters Ben, Martins André F. T.
- Constitutionalexperts: Training A Mixture Of Principle-based Prompts Petridis Savvas, Wedin Ben, Yuan Ann, Wexler James, Thain Nithum
- Prompting A Pretrained Transformer Can Be A Universal Approximator Petrov Aleksandar, Torr Philip H. S., Bibi Adel
- Assessing The Emergent Symbolic Reasoning Abilities Of Llama Large Language Models Petruzzellis Flavio, Testolin Alberto, Sperduti Alessandro
- Benchmarking GPT-4 On Algorithmic Problems: A Systematic Evaluation Of Prompting Strategies Petruzzellis Flavio, Testolin Alberto, Sperduti Alessandro
- Optimizing Large Language Models For Openapi Code Completion Petryshyn Bohdan, Lukoševičius Mantas
- H2o-danube3 Technical Report Pfeiffer Pascal, Singer Philipp, Babakhin Yauhen, Fodor Gabor, Dhankhar Nischay, Ambati Sri Satish
- Suri: Multi-constraint Instruction Following For Long-form Text Generation Pham Chau Minh, Sun Simeng, Iyyer Mohit
- Towards Reliable Medical Question Answering: Techniques And Challenges In Mitigating Hallucinations In Language Models Pham Duy Khoa, Vo Bao Quoc
- Graph Guided Question Answer Generation For Procedural Question-answering Pham Hai X., Hadji Isma, Xu Xinnuo, Degutyte Ziedune, Rainey Jay, Kazakos Evangelos, Fazly Afsaneh, Tzimiropoulos Georgios, Martinez Brais
- Can Base Chatgpt Be Used For Forecasting Without Additional Optimization? Pham Van, Cunningham Scott
- Investigating The Effectiveness Of Hypertuning Via Gisting Phang Jason
- Peering Into The Mind Of Language Models: An Approach For Attribution In Contextual Question Answering Phukan Anirudh, Somasundaram Shwetha, Saxena Apoorv, Goswami Koustava, Srinivasan Balaji Vasan
- Coherent Zero-shot Visual Instruction Generation Phung Quynh, Ge Songwei, Huang Jia-bin
- Mllm-protector: Ensuring Mllm's Safety Without Hurting Performance Pi Renjie, Han Tianyang, Zhang Jianshu, Xie Yueqi, Pan Rui, Lian Qing, Dong Hanze, Zhang Jipeng, Zhang Tong
- Ptt5-v2: A Closer Look At Continued Pretraining Of T5 Models For The Portuguese Language Piau Marcos, Lotufo Roberto, Nogueira Rodrigo
- Mamba-ptq: Outlier Channels In Recurrent Large Language Models Pierro Alessandro, Abreu Steven
- Doccgen: Document-based Controlled Code Generation Pimparkhede Sameer, Kammakomati Mehant, Tamilselvam Srikanth, Kumar Prince, Kumar Ashok Pon, Bhattacharyya Pushpak
- Revisiting Verilogeval: Newer Llms, In-context Learning, And Specification-to-rtl Tasks Pinckney Nathaniel, Batten Christopher, Liu Mingjie, Ren Haoxing, Khailany Brucek
- How Effective Are State Space Models For Machine Translation? Pitorro Hugo, Vasylenko Pavlo, Treviso Marcos, Martins André F. T.
- LM-PUB-QUIZ: A Comprehensive Framework For Zero-shot Evaluation Of Relational Knowledge In Language Models Ploner Max, Wiland Jacek, Pohl Sebastian, Akbik Alan
- Sbora: Low-rank Adaptation With Regional Weight Updates Po Lai-man, Liu Yuyang, Wu Haoxuan, Zhang Tianqi, Yu Wing-yin, Jiang Zeyu, Li Kun
- Advanced Natural-based Interaction For The Italian Language: Llamantino-3-anita Polignano Marco, Basile Pierpaolo, Semeraro Giovanni
- Evaluation Of Language Models In The Medical Context Under Resource-constrained Settings Posada Andrea, Rueckert Daniel, Meissen Felix, Müller Philip
- Documint: Docstring Generation For Python Using Small Language Models Poudel Bibek, Cook Adam, Traore Sekou, Ameli Shelah
- Retrieval Augmented Thought Process For Private Data Handling In Healthcare Pouplin Thomas, Sun Hao, Holt Samuel, Van Der Schaar Mihaela
- Machine Translation With Large Language Models: Prompt Engineering For Persian, English, And Russian Directions Pourkamali Nooshin, Sharifi Shler Ebrahim
- Deciphering The Factors Influencing The Efficacy Of Chain-of-thought: Probability, Memorization, And Noisy Reasoning Prabhakar Akshara, Griffiths Thomas L., Mccoy R. Thomas
- Sambanova SN40L: Scaling The AI Memory Wall With Dataflow And Composition Of Experts Prabhakar Raghu, Sivaramakrishnan Ram, Gandhi Darshan, Du Yun, Wang Mingran, Song Xiangyu, Zhang Kejie, Gao Tianren, Wang Angela, Li Karen, Sheng Yongning, Brot Joshua, Sokolov Denis, Vivek Apurv, Leung Calvin, Sabnis Arjun, Bai Jiayu, Zhao Tuowen, Gottscho Mark, Jackson David, Luttrell Mark, Shah Manish K., Chen Edison, Liang Kaizhao, Jain Swayambhoo, Thakker Urmish, Huang Dawei, Jairath Sumti, Brown Kevin J., Olukotun Kunle
- Convkgyarn: Spinning Configurable And Scalable Conversational Knowledge Graph QA Datasets With Large Language Models Pradeep Ronak, Lee Daniel, Mousavi Ali, Pound Jeff, Sang Yisi, Lin Jimmy, Ilyas Ihab, Potdar Saloni, Arefiyan Mostafa, Li Yunyao
- Accelerating Large Language Model Pretraining Via LFR Pedagogy: Learn, Focus, And Review Prakriya Neha, Yen Jui-nan, Hsieh Cho-jui, Cong Jason
- The Silent Curriculum: How Does LLM Monoculture Shape Educational Content And Its Accessibility? Priyanshu Aman, Vijay Supriti
- When Quantization Affects Confidence Of Large Language Models? Proskurina Irina, Brun Luc, Metzler Guillaume, Velcin Julien
- Customized Retrieval Augmented Generation And Benchmarking For EDA Tool Documentation QA Pu Yuan, He Zhuolun, Qiu Tairu, Wu Haoyuan, Yu Bei
- Fine-tuning With Divergent Chains Of Thought Boosts Reasoning Through Self-correction In Language Models Puerto Haritz, Chubakov Tilek, Zhu Xiaodan, Madabushi Harish Tayyar, Gurevych Iryna
- A Moral Imperative: The Need For Continual Superalignment Of Large Language Models Puthumanaillam Gokul, Vora Manav, Thangeda Pranay, Ornik Melkior
- Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study In Indonesian And Sundanese Putri Rifki Afina, Haznitrama Faiz Ghifari, Adhista Dea, Oh Alice
- Less Is More: Accurate Speech Recognition & Translation Without Web-scale Data Puvvada Krishna C., Żelasko Piotr, Huang He, Hrinchuk Oleksii, Koluguri Nithin Rao, Dhawan Kunal, Majumdar Somshubra, Rastorgueva Elena, Chen Zhehuai, Lavrukhin Vitaly, Balam Jagadeesh, Ginsburg Boris
- Practical Token Pruning For Foundation Models In Few-shot Conversational Virtual Assistant Systems Qi Haode, Qian Cheng, Ni Jian, Singh Pratyush, Fazeli Reza, Wang Gengyu, Shu Zhongzheng, Wayne Eric, Bross Juergen
- Is Next Token Prediction Sufficient For GPT? Exploration On Code Logic Comprehension Qi Mengnan, Huang Yufan, Yao Yongqiang, Wang Maoquan, Gu Bin, Sundaresan Neel
- Safety Alignment Should Be Made More Than Just A Few Tokens Deep Qi Xiangyu, Panda Ashwinee, Lyu Kaifeng, Ma Xiao, Roy Subhrajit, Beirami Ahmad, Mittal Prateek, Henderson Peter
- Mutual Reasoning Makes Smaller Llms Stronger Problem-solvers Qi Zhenting, Ma Mingyuan, Xu Jiahang, Zhang Li Lyna, Yang Fan, Yang Mao
- Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period Of Large Language Models Qian Chen, Zhang Jie, Yao Wei, Liu Dongrui, Yin Zhenfei, Qiao Yu, Liu Yong, Shao Jing
- Momentor: Advancing Video Large Language Model With Fine-grained Temporal Reasoning Qian Long, Li Juncheng, Wu Yu, Ye Yaobo, Fei Hao, Chua Tat-seng, Zhuang Yueting, Tang Siliang
- Mia-bench: Towards Better Instruction Following Evaluation Of Multimodal Llms Qian Yusu, Ye Hanrong, Fauconnier Jean-philippe, Grasch Peter, Yang Yinfei, Gan Zhe
- Prompt Perturbation Consistency Learning For Robust Language Models Qiang Yao, Nandi Subhrangshu, Mehrabi Ninareh, Steeg Greg Ver, Kumar Anoop, Rumshisky Anna, Galstyan Aram
- Learning To Poison Large Language Models During Instruction Tuning Qiang Yao, Zhou Xiangyu, Zade Saleh Zare, Roshani Mohammad Amin, Zytko Douglas, Zhu Dongxiao
- Openba-v2: Reaching 77.3% High Compression Ratio With Fast Multi-stage Pruning Qiao Dan, Su Yi, Wang Pinzheng, Ye Jing, Xie Wenjing, Zhou Yuechi, Ding Yuyang, Tang Zecheng, Wang Jikai, Ji Yixin, Wang Yue, Guo Pei, Sun Zechen, Zhang Zikang, Li Juntao, Chao Pingfu, Chen Wenliang, Fu Guohong, Zhou Guodong, Zhu Qiaoming, Zhang Min
- Agent Planning With World Knowledge Model Qiao Shuofei, Fang Runnan, Zhang Ningyu, Zhu Yuqi, Chen Xiang, Deng Shumin, Jiang Yong, Xie Pengjun, Huang Fei, Chen Huajun
- Prism: A Framework For Decoupling And Assessing The Capabilities Of Vlms Qiao Yuxuan, Duan Haodong, Fang Xinyu, Yang Junming, Chen Lin, Zhang Songyang, Wang Jiaqi, Lin Dahua, Chen Kai
- Supportiveness-based Knowledge Rewriting For Retrieval-augmented Language Modeling Qiao Zile, Ye Wei, Jiang Yong, Mo Tong, Xie Pengjun, Li Weiping, Huang Fei, Zhang Shikun
- The Uniqueness Of Llama3-70b With Per-channel Quantization: An Empirical Study Qin Minghai
- Empirical Guidelines For Deploying Llms Onto Resource-constrained Edge Devices Qin Ruiyang, Liu Dancheng, Yan Zheyu, Tan Zhaoxuan, Pan Zixuan, Jia Zhenge, Jiang Meng, Abbasi Ahmed, Xiong Jinjun, Shi Yiyu
- Exploring The Nexus Of Large Language Models And Legal Systems: A Short Survey Qin Weicong, Sun Zhongxiang
- Enhancing Sequential Recommendations Through Multi-perspective Reflections And Iteration Qin Weicong, Xu Yi, Yu Weijie, Shen Chenglei, Zhang Xiao, He Ming, Fan Jianping, Xu Jun
- Lightning Attention-2: A Free Lunch For Handling Unlimited Sequence Lengths In Large Language Models Qin Zhen, Sun Weigao, Li Dong, Shen Xuyang, Sun Weixuan, Zhong Yiran
- Various Lengths, Constant Speed: Efficient Language Modeling With Lightning Attention Qin Zhen, Sun Weigao, Li Dong, Shen Xuyang, Sun Weixuan, Zhong Yiran
- LAMPO: Large Language Models As Preference Machines For Few-shot Ordinal Classification Qin Zhen, Wu Junru, Shen Jiaming, Liu Tianqi, Wang Xuanhui
- Can Large Language Models Understand Symbolic Graphics Programs? Qiu Zeju, Liu Weiyang, Feng Haiwen, Liu Zhen, Xiao Tim Z., Collins Katherine M., Tenenbaum Joshua B., Weller Adrian, Black Michael J., Schölkopf Bernhard
- Entropy-based Decoding For Retrieval-augmented Large Language Models Qiu Zexuan, Ou Zijing, Wu Bin, Li Jingjing, Liu Aiwei, King Irwin
- Layerwise Recurrent Router For Mixture-of-experts Qiu Zihan, Huang Zeyu, Cheng Shuang, Zhou Yizhi, Wang Zili, Titov Ivan, Fu Jie
- Unsupervised Distractor Generation Via Large Language Model Distilling And Counterfactual Contrastive Decoding Qu Fanyi, Sun Hao, Wu Yunfang
- Tokenrec: Learning To Tokenize ID For Llm-based Generative Recommendation Qu Haohao, Fan Wenqi, Zhao Zihuai, Li Qing
- Mupt: A Generative Symbolic Music Pretrained Transformer Qu Xingwei, Bai Yuelin, Ma Yinghao, Zhou Ziya, Lo Ka Man, Liu Jiaheng, Yuan Ruibin, Min Lejun, Liu Xueling, Zhang Tianyu, Du Xinrun, Guo Shuyue, Liang Yiming, Li Yizhi, Wu Shangda, Zhou Junting, Zheng Tianyu, Ma Ziyang, Han Fengze, Xue Wei, Xia Gus, Benetos Emmanouil, Yue Xiang, Lin Chenghua, Tan Xu, Huang Stephen W., Fu Jie, Zhang Ge
- DEEP-ICL: Definition-enriched Experts For Language Model In-context Learning Qu Xingwei, Liang Yiming, Wang Yucheng, Zheng Tianyu, Yue Tommy, Ma Lei, Huang Stephen W., Zhang Jiajun, Shi Yinan, Lin Chenghua, Fu Jie, Zhang Ge
- The Frontier Of Data Erasure: Machine Unlearning For Large Language Models Qu Youyang, Ding Ming, Sun Nan, Thilakarathna Kanchana, Zhu Tianqing, Niyato Dusit
- Coursegpt-zh: An Educational Large Language Model Based On Knowledge Distillation Incorporating Prompt Optimization Qu Zheyan, Yin Lu, Yu Zitong, Wang Wenbo, Zhang Xing
- Automatically Generating Numerous Context-driven SFT Data For Llms Across Diverse Granularity Quan Shanghaoran
- Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift In Fine-tuning Llms For Simultaneous Translation Raffel Matthew, Agostinelli Victor, Chen Lizhong
- Medit: Multilingual Text Editing Via Instruction Tuning Raheja Vipul, Alikaniotis Dimitris, Kulkarni Vivek, Alhafni Bashar, Kumar Dhruv
- V-zen: Efficient GUI Understanding And Precise Grounding With A Novel Multimodal LLM Rahman Abdur, Chawla Rajat, Kumar Muskaan, Datta Arkajit, Jha Adarsh, Ns Mukunda, Bhola Ishaan
- GSR-BENCH: A Benchmark For Grounded Spatial Reasoning Evaluation Via Multimodal Llms Rajabi Navid, Kosecka Jana
- Qdylora: Quantized Dynamic Low-rank Adaptation For Efficient Large Language Model Tuning Rajabzadeh Hossein, Valipour Mojtaba, Zhu Tianshu, Tahaei Marzieh, Kwon Hyock Ju, Ghodsi Ali, Chen Boxing, Rezagholizadeh Mehdi
- Constructing Domain-specific Evaluation Sets For Llm-as-a-judge Raju Ravi, Jain Swayambhoo, Li Bo, Li Jonathan, Thakker Urmish
- Scavenging Hyena: Distilling Transformers Into Long Convolution Models Ralambomihanta Tokiniaina Raharison, Mohammadzadeh Shahrad, Islam Mohammad Sami Nur, Jabbour Wassim, Liang Laurence
- ANCHOR: Llm-driven News Subject Conditioning For Text-to-image Synthesis Ramakrishnan Aashish Anantha, Huang Sharon X., Lee Dongwon
- Self-refinement Of Language Models From External Proxy Metrics Feedback Ramji Keshav, Lee Young-suk, Astudillo Ramón Fernandez, Sultan Md Arafat, Naseem Tahira, Munawar Asim, Florian Radu, Roukos Salim
- WARP: On The Benefits Of Weight Averaged Rewarded Policies Ramé Alexandre, Ferret Johan, Vieillard Nino, Dadashi Robert, Hussenot Léonard, Cedoz Pierre-louis, Sessa Pier Giuseppe, Girgin Sertan, Douillard Arthur, Bachem Olivier
- Self-refine Instruction-tuning For Aligning Reasoning In Language Models Ranaldi Leonardo, Freitas Andrè
- Learning To Localize Objects Improves Spatial Reasoning In Visual-llms Ranasinghe Kanchana, Shukla Satya Narayan, Poursaeed Omid, Ryoo Michael S., Lin Tsung-yu
- A Fine-tuning Enhanced RAG System With Quantized Influence Measure As AI Judge Rangan Keshav, Yin Yiqiao
- SPARQL Generation: An Analysis On Fine-tuning Openllama For Question Answering Over A Life Science Knowledge Graph Rangel Julio C., De Farias Tarcisio Mendes, Sima Ana Claudia, Kobayashi Norio
- RAVEN: Multitask Retrieval Augmented Vision-language Learning Rao Varun Nagaraj, Choudhary Siddharth, Deshpande Aditya, Satzoda Ravi Kumar, Appalaraju Srikar
- A Critical Look At Tokenwise Reward-guided Text Generation Rashid Ahmad, Wu Ruotian, Grosse Julia, Kristiadi Agustinus, Poupart Pascal
- SSP: Self-supervised Prompting For Cross-lingual Transfer To Low-resource Languages Using Large Language Models Rathore Vipul, Deb Aniruddha, Chandresh Ankish, Singla Parag, Mausam
- Developing Safe And Responsible Large Language Model : Can We Balance Bias Reduction And Language Understanding In Large Language Models? Raza Shaina, Bamgbose Oluwanifemi, Ghuge Shardul, Tavakol Fatemeh, Reji Deepak John, Bashir Syed Raza
- FAIR Enough: How Can We Develop And Assess A Fair-compliant Dataset For Large Language Models' Training? Raza Shaina, Ghuge Shardul, Ding Chen, Dolatabadi Elham, Pandya Deval
- Beads: Bias Evaluation Across Domains Raza Shaina, Rahman Mizanur, Zhang Michael R.
- MBIAS: Mitigating Bias In Large Language Models While Retaining Context Raza Shaina, Raval Ananya, Chatrath Veronica
- Analyzing And Adapting Large Language Models For Few-shot Multilingual NLU: Are We There Yet? Razumovskaia Evgeniia, Vulić Ivan, Korhonen Anna
- FIRST: Faster Improved Listwise Reranking With Single Token Decoding Reddy Revanth Gangi, Doo Jaehyeok, Xu Yifei, Sultan Md Arafat, Swain Deevya, Sil Avirup, Ji Heng
- Retrieval-augmented Generation: Is Dense Passage Retrieval Retrieving? Reichman Benjamin, Heck Larry
- Unveiling And Mitigating Memorization In Text-to-image Diffusion Models Through Cross Attention Ren Jie, Li Yaxin, Zeng Shenglai, Xu Han, Lyu Lingjuan, Xing Yue, Tang Jiliang
- Samba: Simple Hybrid State Space Models For Efficient Unlimited Context Language Modeling Ren Liliang, Liu Yang, Lu Yadong, Shen Yelong, Liang Chen, Chen Weizhu
- Melora: Mini-ensemble Low-rank Adapters For Parameter-efficient Fine-tuning Ren Pengjie, Shi Chengshun, Wu Shiguang, Zhang Mengqi, Ren Zhaochun, De Rijke Maarten, Chen Zhumin, Pei Jiahuan
- Codeattack: Revealing Safety Generalization Challenges Of Large Language Models Via Code Completion Ren Qibing, Gao Chang, Shao Jing, Yan Junchi, Tan Xin, Lam Wai, Ma Lizhuang
- Analyzing And Reducing Catastrophic Forgetting In Parameter Efficient Tuning Ren Weijieying, Li Xinlong, Wang Lei, Zhao Tianxiang, Qin Wei
- I Learn Better If You Speak My Language: Understanding The Superior Performance Of Fine-tuning Large Language Models With Llm-generated Responses Ren Xuan, Wu Biao, Liu Lingqiao
- Do Large Language Models Mirror Cognitive Language Processing? Ren Yuqi, Jin Renren, Zhang Tongxuan, Xiong Deyi
- Healthcare Copilot: Eliciting The Power Of General Llms For Medical Consultation Ren Zhiyao, Zhan Yibing, Yu Baosheng, Ding Liang, Tao Dacheng
- Strengthening Multimodal Large Language Model With Bootstrapped Preference Optimization Renjie Pi, Tianyang Han, Wei Xiong, Jipeng Zhang, Runtao Liu, Rui Pan, Tong Zhang
- Are Electra's Sentence Embeddings Beyond Repair? The Case Of Semantic Textual Similarity Rep Ivan, Dukić David, Šnajder Jan
- Self-generated Replay Memories For Continual Neural Machine Translation Resta Michele, Bacciu Davide
- How Multilingual Are Large Language Models Fine-tuned For Translation? Richburg Aquia, Carpuat Marine
- Quantifying Contamination In Evaluating Code Generation Capabilities Of Language Models Riddell Martin, Ni Ansong, Cohan Arman
- Empirical Study Of Symmetrical Reasoning In Conversational Chatbots Rim Daniela N., Choi Heeyoul
- Instruction-tuned Large Language Models For Machine Translation In The Medical Domain Rios Miguel
- Simulating Weighted Automata Over Sequences And Trees With Transformers Rizvi Michael, Lizaire Maude, Lacroce Clara, Rabusseau Guillaume
- Large Language Model Recall Uncertainty Is Modulated By The Fan Effect Roberts Jesse, Moore Kyle, Pham Thao, Ewaleifoh Oseremhen, Fisher Doug
- Pretrained Hybrids With MAD Skills Roberts Nicholas, Guo Samuel, Gao Zhiqi, Gnvv Satya Sai Srinath Namburi, Cromp Sonia, Wu Chengjun, Duan Chengyu, Sala Frederic
- Exploring Advanced Large Language Models With Llmsuite Roffo Giorgio
- Zero-shot Spam Email Classification Using Pre-trained Large Language Models Rojas-galeano Sergio
- Achieving Peak Performance For Large Language Models: A Systematic Review Rostam Zhyar Rzgar K, Szénási Sándor, Kertész Gábor
- CONFLARE: Conformal Large Language Model Retrieval Rouzrokh Pouria, Faghani Shahriar, Gamble Cooper U., Shariatnia Moein, Erickson Bradley J.
- Enhancing Low-resource NMT With A Multilingual Encoder And Knowledge Distillation: A Case Study Roy Aniruddha, Ray Pretam, Maheshwari Ayush, Sarkar Sudeshna, Goyal Pawan
- Convolutional Prompting Meets Language Models For Continual Learning Roy Anurag, Moulick Riddhiman, Verma Vinay K., Ghosh Saptarshi, Das Abir
- Fist-financial Style Transfer With Hallucination And Creativity Control Framework Roychowdhury Sohini, Krema Marko, Moore Brian, Lai Xingjian, Effedua Dike, Jethwani Bharat
- Evaluation Of RAG Metrics For Question Answering In The Telecom Domain Roychowdhury Sujoy, Soman Sumit, Ranjani H G, Gunda Neeraj, Chhabra Vansh, Bala Sai Krishna
- Efficient Language Adaptive Pre-training: Extending State-of-the-art Large Language Models For Polish Ruciński Szymon
- From PEFT To DEFT: Parameter Efficient Finetuning For Reducing Activation Density In Transformers Runwal Bharat, Pedapati Tejaswini, Chen Pin-yu
- Explorations Of Self-repair In Language Models Rushing Cody, Nanda Neel
- Writing In The Margins: Better Inference Pattern For Long Context Retrieval Russak Melisa, Jamil Umar, Bryant Christopher, Kamble Kiran, Magnuson Axel, Russak Mateusz, Alshikh Waseem
- Code-aware Prompting: A Study Of Coverage Guided Test Generation In Regression Setting Using LLM Ryan Gabriel, Jain Siddhartha, Shang Mingyue, Wang Shiqi, Ma Xiaofei, Ramanathan Murali Krishna, Ray Baishakhi
- Tandem Transformers For Inference Efficient Llms S Aishwarya P, Nair Pranav Ajit, Samaga Yashas, Boyd Toby, Kumar Sanjiv, Jain Prateek, Netrapalli Praneeth
- In-context Learning Demonstration Selection Via Influence Analysis S. Vinay M., Van Minh-hao, Wu Xintao
- Benchmarking And Building Long-context Retrieval Models With Loco And M2-BERT Saad-falcon Jon, Fu Daniel Y., Arora Simran, Guha Neel, Ré Christopher
- Utebc-nlp At Semeval-2024 Task 9: Can Llms Be Lateral Thinkers? Sadeghi Pouya, Abaskohi Amirhossein, Yaghoobzadeh Yadollah
- Insights Into Alignment: Evaluating DPO And Its Variants Across Multiple Tasks Saeidi Amir, Verma Shivanshu, Baral Chitta
- On Zero-shot Counterspeech Generation By Llms Saha Punyajoy, Agrawal Aalok, Jana Abhik, Biemann Chris, Mukherjee Animesh
- Where Is The Answer? Investigating Positional Bias In Language Model Knowledge Extraction Saito Kuniaki, Sohn Kihyuk, Lee Chen-yu, Ushiku Yoshitaka
- Algorithmic Language Models With Neurally Compiled Libraries Saldyt Lucas, Kambhampati Subbarao
- Synergizing In-context Learning With Hints For End-to-end Task-oriented Dialog Systems Saley Vishal Vivek, Das Rocktim Jyoti, Raghu Dinesh, Mausam
- Berts Are Generative In-context Learners Samuel David
- Rainbow Teaming: Open-ended Generation Of Diverse Adversarial Prompts Samvelyan Mikayel, Raparthy Sharath Chandra, Lupu Andrei, Hambro Eric, Markosyan Aram H., Bhatt Manish, Mao Yuning, Jiang Minqi, Parker-holder Jack, Foerster Jakob, Rocktäschel Tim, Raileanu Roberta
- The Future Of Large Language Model Pre-training Is Federated Sani Lorenzo, Iacob Alex, Cao Zeyu, Marino Bill, Gao Yan, Paulik Tomas, Zhao Wanru, Shen William F., Aleksandrov Preslav, Qiu Xinchi, Lane Nicholas D.
- Advancing Generative AI For Portuguese With Open Decoder Gerv\'asio PT* Santos Rodrigo, Silva João, Gomes Luís, Rodrigues João, Branco António
- Are Machines Better At Complex Reasoning? Unveiling Human-machine Inference Gaps In Entailment Verification Sanyal Soumya, Xiao Tianyi, Liu Jiacheng, Wang Wenya, Ren Xiang
- Pre-training Small Base Lms With Fewer Tokens Sanyal Sunny, Sanghavi Sujay, Dimakis Alexandros G.
- HELPER-X: A Unified Instructable Embodied Agent To Tackle Four Interactive Vision-language Domains With Memory-augmented Language Models Sarch Gabriel, Somani Sahil, Kapoor Raghav, Tarr Michael J., Fragkiadaki Katerina
- Llms As On-demand Customizable Service Sarkar Souvika Santu, Babar Mohammad Fakhruddin Santu, Hasan Monowar Santu, Karmaker Shubhra Kanti Santu
- Improving Self Consistency In Llms Through Probabilistic Tokenization Sathe Ashutosh, Aggarwal Divyanshu, Sitaram Sunayana
- Improving Sentence Embeddings With Automatic Generation Of Training Data Using Few-shot Examples Sato Soma, Tsukagoshi Hayato, Sasano Ryohei, Takeda Koichi
- Robust Few-shot Transfer Learning For Knowledge Base Question Answering With Unanswerable Questions Sawhney Riya, Bhattacharya Indrajit, Mausam
- Can Small Language Models Learn, Unlearn, And Retain Noise Patterns? Scaria Nicy, Kennedy Silvester John Joseph, Subramani Deepak
- Improving The Validity Of Automatically Generated Feedback Via Reinforcement Learning Scarlatos Alexander, Smith Digory, Woodhead Simon, Lan Andrew
- The Human Factor In Detecting Errors Of Large Language Models: A Systematic Literature Review And Future Research Directions Schiller Christian A.
- Towards Faithful And Robust LLM Specialists For Evidence-based Question-answering Schimanski Tobias, Ni Jingwei, Kraus Mathias, Ash Elliott, Leippold Markus
- Self-distillation For Model Stacking Unlocks Cross-lingual NLU In 200+ Languages Schmidt Fabian David, Borchert Philipp, Vulić Ivan, Glavaš Goran
- Metacognitive Myopia In Large Language Models Scholten Florian, Rebholz Tobias R., Hütter Mandy
- Context Matters: An Empirical Study Of The Impact Of Contextual Information In Temporal Question Answering Systems Schumacher Dan, Haji Fatemeh, Grey Tara, Bandlamudi Niharika, Karnik Nupoor, Kumar Gagana Uday, Chiang Jason Cho-yu, Rad Paul, Vishwamitra Nishant, Rios Anthony
- Large Language Models To Generate System-level Test Programs Targeting Non-functional Properties Schwachhofer Denis, Domanski Peter, Becker Steffen, Wagner Stefan, Sauer Matthias, Pflüger Dirk, Polian Ilia
- Rethinking LLM Memorization Through The Lens Of Adversarial Compression Schwarzschild Avi, Feng Zhili, Maini Pratyush, Lipton Zachary C., Kolter J. Zico
- Soft Prompt Threats: Attacking Safety Alignment And Unlearning In Open-source Llms Through The Embedding Space Schwinn Leo, Dobre David, Xhonneux Sophie, Gidel Gauthier, Gunnemann Stephan
- Behavioral Testing: Can Large Language Models Implicitly Resolve Ambiguous Entities? Sedova Anastasiia, Litschko Robert, Frassinelli Diego, Roth Benjamin, Plank Barbara
- Generative Explore-exploit: Training-free Optimization Of Generative Recommender Systems Using LLM Optimizers Senel Lütfi Kerem, Fetahu Besnik, Yoshida Davis, Chen Zhiyu, Castellucci Giuseppe, Vedula Nikhita, Choi Jason, Malmasi Shervin
- Train-attention: Meta-learning Where To Focus In Continual Knowledge Learning Seo Yeongbin, Lee Dongha, Yeo Jinyoung
- Multi-modal Adapter For Vision-language Models Seputis Dominykas, Mihailov Serghei, Chatterjee Soham, Xiao Zehao
- Surprising Efficacy Of Fine-tuned Transformers For Fact-checking Over Larger Language Models Setty Vinay
- Medpromptx: Grounded Multimodal Prompting For Chest X-ray Diagnosis Shaaban Mai A., Khan Adnan, Yaqub Mohammad
- Ai-assisted Generation Of Difficult Math Questions Shah Vedant, Yu Dingli, Lyu Kaifeng, Park Simon, Ke Nan Rosemary, Mozer Michael, Bengio Yoshua, Arora Sanjeev, Goyal Anirudh
- Multilingual Instruction Tuning With Just A Pinch Of Multilinguality Shaham Uri, Herzig Jonathan, Aharoni Roee, Szpektor Idan, Tsarfaty Reut, Eyal Matan
- Detection And Measurement Of Syntactic Templates In Generated Text Shaib Chantal, Elazar Yanai, Li Junyi Jessy, Wallace Byron C.
- Show, Don't Tell: Aligning Language Models With Demonstrated Feedback Shaikh Omar, Lam Michelle, Hejna Joey, Shao Yijia, Bernstein Michael, Yang Diyi
- An End-to-end Speech Summarization Using Large Language Model Shang Hengchao, Li Zongyao, Guo Jiaxin, Li Shaojun, Rao Zhiqiang, Luo Yuanchang, Wei Daimeng, Yang Hao
- Synergy-of-thoughts: Eliciting Efficient Reasoning In Hybrid Language Models Shang Yu, Li Yu, Xu Fengli, Li Yong
- Patch-level Training For Large Language Models Shao Chenze, Meng Fandong, Zhou Jie
- Extensible Embedding: A Flexible Multipler For Llm's Context Length Shao Ninglu, Xiao Shitao, Liu Zheng, Zhang Peitian
- Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization Shao Ninglu, Xiao Shitao, Liu Zheng, Zhang Peitian
- Can Llms Replace Economic Choice Prediction Labs? The Case Of Language-based Persuasion Games Shapira Eilam, Madmon Omer, Reichart Roi, Tennenholtz Moshe
- Guiding In-context Learning Of Llms Through Quality Estimation For Machine Translation Sharami Javad Pourmostafa Roshan, Shterionov Dimitar, Spronck Pieter
- Combining Multiple Post-training Techniques To Achieve Most Efficient Quantized Llms Sharify Sayeh, Xu Zifei, Yazar Wanzin, Wang Xin
- Synth\(^2\): Boosting Visual-language Models With Synthetic Captions And Image Embeddings Sharifzadeh Sahand, Kaplanis Christos, Pathak Shreya, Kumaran Dharshan, Ilic Anastasija, Mitrovic Jovana, Blundell Charles, Banino Andrea
- A Critical Evaluation Of AI Feedback For Aligning Large Language Models Sharma Archit, Keh Sedrick, Mitchell Eric, Finn Chelsea, Arora Kushal, Kollar Thomas
- Patched MOA: Optimizing Inference For Diverse Software Development Tasks Sharma Asankhaya
- A Preliminary Study On Using Large Language Models In Software Pentesting Shashwat Kumar, Hahn Francis, Ou Xinming, Goldgof Dmitry, Hall Lawrence, Ligatti Jay, Rajgopalan S. Raj, Tabari Armin Ziaie
- MAPO: Advancing Multilingual Reasoning Through Multilingual Alignment-as-preference Optimization She Shuaijie, Zou Wei, Huang Shujian, Zhu Wenhao, Liu Xiang, Geng Xiang, Chen Jiajun
- Accurate And Efficient Fine-tuning Of Quantized Large Language Models Through Optimal Balance Shen Ao, Wang Qiang, Lai Zhiquan, Li Xionglve, Li Dongsheng
- Nemo-aligner: Scalable Toolkit For Efficient Model Alignment Shen Gerald, Wang Zhilin, Delalleau Olivier, Zeng Jiaqi, Dong Yi, Egert Daniel, Sun Shengyang, Zhang Jimmy, Jain Sahil, Taghibakhshi Ali, Ausin Markel Sanz, Aithal Ashwath, Kuchaiev Oleksii
- Boosting Reward Model With Preference-conditional Multi-aspect Synthetic Data Generation Shen Jiaming, Xu Ran, Jun Yennie, Qin Zhen, Liu Tianqi, Yang Carl, Liang Yi, Baumgartner Simon, Bendersky Michael
- Tag-llm: Repurposing General-purpose Llms For Specialized Domains Shen Junhong, Tenenholtz Neil, Hall James Brian, Alvarez-melis David, Fusi Nicolo
- The Language Barrier: Dissecting Safety Challenges Of Llms In Multilingual Contexts Shen Lingfeng, Tan Weiting, Chen Sihao, Chen Yunmo, Zhang Jingyu, Xu Haoran, Zheng Boyuan, Koehn Philipp, Khashabi Daniel
- Pyramid Coder: Hierarchical Code Generator For Compositional Visual Question Answering Shen Ruoyue, Inoue Nakamasa, Shinoda Koichi
- Learning To Decode Collaboratively With Multiple Language Models Shen Shannon Zejiang, Lang Hunter, Wang Bailin, Kim Yoon, Sontag David
- Small Llms Are Weak Tool Learners: A Multi-llm Agent Shen Weizhou, Li Chenliang, Chen Hongzhan, Yan Ming, Quan Xiaojun, Chen Hehong, Zhang Ji, Huang Fei
- An Empirical Study On Context Length For Open-domain Dialog Generation Shen Xinyi, Lin Zuoquan
- Jetmoe: Reaching Llama2 Performance With 0.1M Dollars Shen Yikang, Guo Zhen, Cai Tianle, Qin Zengyi
- A Fine-tuning Dataset And Benchmark For Large Language Models For Protein Understanding Shen Yiqing, Chen Zan, Mamalakis Michail, He Luhan, Xia Haiyang, Li Tianbin, Su Yanzhou, He Junjun, Wang Yu Guang
- Toursynbio: A Multi-modal Large Model And Agent Framework To Bridge Text And Protein Sequences For Protein Engineering Shen Yiqing, Chen Zan, Mamalakis Michail, Liu Yungeng, Li Tianbin, Su Yanzhou, He Junjun, Liò Pietro, Wang Yu Guang
- Proctag: Process Tagging For Assessing The Efficacy Of Document Instruction Data Shen Yufan, Luo Chuwei, Zhu Zhaoqing, Chen Yang, Zheng Qi, Yu Zhi, Bu Jiajun, Yao Cong
- Repeval: Effective Text Evaluation With LLM Representation Sheng Shuqian, Xu Yi, Zhang Tianhang, Shen Zanwei, Fu Luoyi, Ding Jiaxin, Zhou Lei, Wang Xinbing, Zhou Chenghu
- Using Chatgpt To Score Essays And Short-form Constructed Responses Shermis Mark D.
- Latent Adversarial Training Improves Robustness To Persistent Harmful Behaviors In Llms Sheshadri Abhay, Ewart Aidan, Guo Phillip, Lynch Aengus, Wu Cindy, Hebbar Vivek, Sleight Henry, Stickland Asa Cooper, Perez Ethan, Hadfield-menell Dylan, Casper Stephen
- Neurosymbolic AI For Enhancing Instructability In Generative AI Sheth Amit, Pallagani Vishal, Roy Kaushik
- Continual Learning Of Large Language Models: A Comprehensive Survey Shi Haizhou, Xu Zihao, Wang Hengyi, Qin Weiyi, Wang Wenyuan, Wang Yibin, Wang Zifeng, Ebrahimi Sayna, Wang Hao
- Paying More Attention To Image: A Training-free Method For Alleviating Hallucination In Lvlms Shi Liu, Kecheng Zheng, Wei Chen
- Keep The Cost Down: A Review On Methods To Optimize LLM' S Kv-cache Consumption Shi Luohe, Zhang Hongyi, Yao Yao, Li Zuchao, Zhao Hai
- Action Controlled Paraphrasing Shi Ning, Wu Zijun
- Reslora: Identity Residual Mapping In Low-rank Adaption Shi Shuhua, Huang Shaohan, Song Minghui, Li Zhoujun, Zhang Zihan, Huang Haizhen, Wei Furu, Deng Weiwei, Sun Feng, Zhang Qi
- Profuser: Progressive Fusion Of Large Language Models Shi Tianyuan, Wan Fanqi, Huang Canbin, Quan Xiaojun, Li Chenliang, Yan Ming, Zhang Ji
- Math-llava: Bootstrapping Mathematical Reasoning For Multimodal Large Language Models Shi Wenhao, Hu Zhiqiang, Bin Yi, Liu Junhua, Yang Yang, Ng See-kiong, Bing Lidong, Lee Roy Ka-wei
- Medadapter: Efficient Test-time Adaptation Of Large Language Models Towards Medical Reasoning Shi Wenqi, Xu Ran, Zhuang Yuchen, Yu Yue, Wu Hang, Yang Carl, Wang May D.
- Large Language Models Are Learnable Planners For Long-term Recommendation Shi Wentao, He Xiangnan, Zhang Yang, Gao Chongming, Li Xinyue, Zhang Jizhi, Wang Qifan, Feng Fuli
- Agentre: An Agent-based Framework For Navigating Complex Information Landscapes In Relation Extraction Shi Yuchen, Jiang Guochao, Qiu Tian, Yang Deqing
- Adapting Llms To Hebrew: Unveiling Dictalm 2.0 With Enhanced Vocabulary And Instruction Capabilities Shmidman Shaltiel, Shmidman Avi, Cohen Amir Dn, Koppel Moshe
- FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation Shum Kashun, Xu Minrui, Zhang Jianshu, Chen Zixin, Diao Shizhe, Dong Hanze, Zhang Jipeng, Raza Muhammad Omer
- An Active Inference Strategy For Prompting Reliable Responses From Large Language Models In Medical Practice Shusterman Roma, Waters Allison C., O`neill Shannon, Luu Phan, Tucker Don M.
- Iclguard: Controlling In-context Learning Behavior For Applicability Authorization Si Wai Man, Backes Michael, Zhang Yang
- Where Does In-context Translation Happen In Large Language Models Sia Suzanna, Mueller David, Duh Kevin
- Scaling Synthetic Logical Reasoning Datasets With Context-sensitive Declarative Grammars Sileo Damien
- H2o-danube-1.8b Technical Report Singer Philipp, Pfeiffer Pascal, Babakhin Yauhen, Jeblick Maximilian, Dhankhar Nischay, Fodor Gabor, Ambati Sri Satish
- Aya Dataset: An Open-access Collection For Multilingual Instruction Tuning Singh Shivalika, Vargus Freddie, Dsouza Daniel, Karlsson Börje F., Mahendiran Abinaya, Ko Wei-yin, Shandilya Herumb, Patel Jay, Mataciunas Deividas, Omahony Laura, Zhang Mike, Hettiarachchi Ramith, Wilson Joseph, Machado Marina, Moura Luisa Souza, Krzemiński Dominik, Fadaei Hakimeh, Ergün Irem, Okoh Ifeoma, Alaagib Aisha, Mudannayake Oshan, Alyafeai Zaid, Chien Vu Minh, Ruder Sebastian, Guthikonda Surya, Alghamdi Emad A., Gehrmann Sebastian, Muennighoff Niklas, Bartolo Max, Kreutzer Julia, Üstün Ahmet, Fadaee Marzieh, Hooker Sara
- Evaluating Zero-shot GPT-4V Performance On 3D Visual Question Answering Benchmarks Singh Simranjit, Pavlakos Georgios, Stamoulis Dimitrios
- Llava Finds Free Lunch: Teaching Human Behavior Improves Content Understanding Abilities Of Llms Singh Somesh, S Harini I, Singla Yaman K, Baths Veeky, Shah Rajiv Ratn, Chen Changyou, Krishnamurthy Balaji
- An Empirical Study Of Validating Synthetic Data For Formula Generation Singh Usneek, Cambronero José, Gulwani Sumit, Kanade Aditya, Khatry Anirudh, Le Vu, Singh Mukul, Verbruggen Gust
- A Three-pronged Approach To Cross-lingual Adaptation With Multilingual Llms Singh Vaibhav, Krishna Amrith, Nj Karthika, Ramakrishnan Ganesh
- Maml-en-llm: Model Agnostic Meta-training Of Llms For Improved In-context Learning Sinha Sanchit, Yue Yuguang, Soto Victor, Kulkarni Mayank, Lu Jianhua, Zhang Aidong
- Unipa-gpt: Large Language Models For University-oriented QA In Italian Siragusa Irene, Pirrone Roberto
- Was It Slander? Towards Exact Inversion Of Generative Language Models Skapars Adrians, Manino Edoardo, Sun Youcheng, Cordeiro Lucas C.
- The Mysterious Case Of Neuron 1512: Injectable Realignment Architectures Reveal Internal Characteristics Of Meta's Llama 2 Model Smith Brenden, Baker Dallin, Chase Clayton, Barney Myles, Parker Kaden, Allred Makenna, Hu Peter, Evans Alex, Fulda Nancy
- Scaling LLM Test-time Compute Optimally Can Be More Effective Than Scaling Model Parameters Snell Charlie, Lee Jaehoon, Xu Kelvin, Kumar Aviral
- Aligning Large Language Models For Enhancing Psychiatric Interviews Through Symptom Delineation And Summarization So Jae-hee, Chang Joonhwan, Kim Eunji, Na Junho, Choi Jiyeon, Sohn Jy-yong, Kim Byung-hoon, Chu Sang Hui
- Chatting Up Attachment: Using Llms To Predict Adult Bonds Soares Paulo, Mccurdy Sean, Gerber Andrew J., Fonagy Peter
- Gistembed: Guided In-sample Selection Of Training Negatives For Text Embedding Fine-tuning Solatorio Aivin V.
- A Statistical Framework For Weak-to-strong Generalization Somerstep Seamus, Polo Felipe Maia, Banerjee Moulinath, Ritov Ya'acov, Yurochkin Mikhail, Sun Yuekai
- A New Pipeline For Generating Instruction Dataset Via RAG And Self Fine-tuning Song Chih-wei, Lee Yu-kai, Tsai Yin-te
- Hyacinth6b: A Large Language Model For Traditional Chinese Song Chih-wei, Tsai Yin-te
- Code Needs Comments: Enhancing Code Llms With Comment Augmentation Song Demin, Guo Honglin, Zhou Yunhua, Xing Shuhao, Wang Yudong, Song Zifan, Zhang Wenwei, Guo Qipeng, Yan Hang, Qiu Xipeng, Lin Dahua
- Scaling Data Diversity For Fine-tuning Language Models In Human Alignment Song Feifan, Yu Bowen, Lang Hao, Yu Haiyang, Huang Fei, Wang Houfeng, Li Yongbin
- Increasing Model Capacity For Free: A Simple Strategy For Parameter Efficient Fine-tuning Song Haobo, Zhao Hao, Majumder Soumajit, Lin Tao
- Out-of-distribution Generalization Via Composition: A Lens Through Induction Heads In Transformers Song Jiajun, Xu Zhuoyan, Zhong Yiqiao
- Moma: Multimodal LLM Adapter For Fast Personalized Image Generation Song Kunpeng, Zhu Yizhe, Liu Bingchen, Yan Qing, Elgammal Ahmed, Yang Xiao
- Hierarchical Context Merging: Better Long Context Understanding For Pre-trained Llms Song Woomin, Oh Seunghyuk, Mo Sangwoo, Kim Jaehyung, Yun Sukmin, Ha Jung-woo, Shin Jinwoo
- Sharelora: Parameter Efficient And Robust Large Language Model Fine-tuning Via Shared Low-rank Adaptation Song Yurun, Zhao Junchen, Harris Ian G., Jyothi Sangeetha Abdu
- Alchemistcoder: Harmonizing And Eliciting Code Capability By Hindsight Tuning On Multi-source Data Song Zifan, Wang Yudong, Zhang Wenwei, Liu Kuikun, Lyu Chengqi, Song Demin, Guo Qipeng, Yan Hang, Lin Dahua, Chen Kai, Zhao Cairong
- Regressive Side Effects Of Training Language Models To Mimic Student Misconceptions Sonkar Shashank, Liu Naiming, Baraniuk Richard G.
- Fine Tuning Vs. Retrieval Augmented Generation For Less Popular Knowledge Soudani Heydar, Kanoulas Evangelos, Hasibi Faegheh
- Understanding Memorisation In Llms: Dynamics, Influencing Factors, And Implications Speicher Till, Khan Mohammad Aflah, Wu Qinyuan, Nanda Vedant, Das Soumi, Ghosh Bishwamittra, Gummadi Krishna P., Terzi Evimaria
- Generaitor: Tree-in-the-loop Text Generation For Language Model Explainability And Adaptation Spinner Thilo, Kehlbeck Rebecca, Sevastjanova Rita, Stähle Tobias, Keim Daniel A., Deussen Oliver, El-assady Mennatallah
- Repetition Improves Language Model Embeddings Springer Jacob Mitchell, Kotha Suhas, Fried Daniel, Neubig Graham, Raghunathan Aditi
- PUB: A Pragmatics Understanding Benchmark For Assessing Llms' Pragmatics Capabilities Sravanthi Settaluri Lakshmi, Doshi Meet, Kalyan Tankala Pavan, Murthy Rudra, Bhattacharyya Pushpak, Dabre Raj
- Canttalkaboutthis: Aligning Language Models To Stay On Topic In Dialogues Sreedhar Makesh Narsimhan, Rebedea Traian, Ghosh Shaona, Zeng Jiaqi, Parisien Christopher
- Probing Multimodal Llms As World Models For Driving Sreeram Shiva, Wang Tsun-hsuan, Maalouf Alaa, Rosman Guy, Karaman Sertac, Rus Daniela
- VDGD: Mitigating LVLM Hallucinations In Cognitive Prompts By Bridging The Visual Perception Gap Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Oriol Nieto, Zeyu Jin, Dinesh Manocha
- Comparative Analysis Of Different Efficient Fine Tuning Methods Of Large Language Models (llms) In Low-resource Setting Srinivasan Krishna Prasad Varadarajan, Gumpena Prasanth, Yattapu Madhusudhana, Brahmbhatt Vishal H.
- Towards Truly Zero-shot Compositional Visual Reasoning With Llms As Programmers Stanić Aleksandar, Caelles Sergi, Tschannen Michael
- The Fine-tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities Stap David, Hasler Eva, Byrne Bill, Monz Christof, Tran Ke
- Fine-tuning Large Language Models For Entity Matching Steiner Aaron, Peeters Ralph, Bizer Christian
- LACIE: Listener-aware Finetuning For Confidence Calibration In Large Language Models Stengel-eskin Elias, Hase Peter, Bansal Mohit
- Groundedness In Retrieval-augmented Long-form Generation: An Empirical Study Stolfo Alessandro
- ARKS: Active Retrieval In Knowledge Soup For Code Generation Su Hongjin, Jiang Shuyang, Lai Yuhang, Wu Haoyuan, Shi Boao, Liu Che, Liu Qian, Yu Tao
- Numllm: Numeric-sensitive Large Language Model For Chinese Finance Su Huan-yi, Wu Ke, Huang Yu-hao, Li Wu-jun
- Investigating Video Reasoning Capability Of Large Language Models With Tropes In Movies Su Hung-ting, Chao Chun-tong, Hsu Ya-ching, Lin Xudong, Niu Yulei, Lee Hung-yi, Hsu Winston H.
- Mission Impossible: A Statistical Perspective On Jailbreaking Llms Su Jingtong, Kempe Julia, Ullrich Karen
- Unsupervised Real-time Hallucination Detection Based On The Internal States Of Large Language Models Su Weihang, Wang Changyue, Ai Qingyao, Hu Yiran, Wu Zhijing, Zhou Yujia, Liu Yiqun
- SK-VQA: Synthetic Knowledge Generation At Scale For Training Context-augmented Multimodal Llms Su Xin, Luo Man, Pan Kris W, Chou Tien Pei, Lal Vasudev, Howard Phillip
- Llm-barber: Block-aware Rebuilder For Sparsity Mask In One-shot For Large Language Models Su Yupeng, Guan Ziyi, Liu Xiaoqun, Jin Tianlai, Wu Dongkuan, Chesi Graziano, Wong Ngai, Yu Hao
- M-QALM: A Benchmark To Assess Clinical Reading Comprehension And Knowledge Recall In Large Language Models Via Question Answering Subramanian Anand, Schlegel Viktor, Kashyap Abhinav Ramesh, Nguyen Thanh-tung, Dwivedi Vijay Prakash, Winkler Stefan
- LAB: Large-scale Alignment For Chatbots Sudalairaj Shivchander, Bhandwaldar Abhishek, Pareja Aldo, Xu Kai, Cox David D., Srivastava Akash
- Fidelis: Faithful Reasoning In Large Language Model For Knowledge Graph Question Answering Sui Yuan, He Yufei, Liu Nian, He Xiaoxin, Wang Kun, Hooi Bryan
- 70b-parameter Large Language Models In Japanese Medical Question-answering Sukeda Issey, Kishikawa Risa, Kodera Satoshi
- Structured Chain-of-thought Prompting For Few-shot Generation Of Content-grounded QA Conversations Sultan Md Arafat, Ganhotra Jatin, Astudillo Ramón Fernandez
- Parallelparc: A Scalable Pipeline For Generating Natural-language Analogies Sultan Oren, Bitton Yonatan, Yosef Ron, Shahaf Dafna
- Persona-db: Efficient Large Language Model Personalization For Response Prediction With Collaborative Data Refinement Sun Chenkai, Yang Ke, Reddy Revanth Gangi, Fung Yi R., Chan Hou Pong, Small Kevin, Zhai Chengxiang, Ji Heng
- Sq-llava: Self-questioning For Large Vision-language Assistant Sun Guohao, Qin Can, Wang Jiamian, Chen Zeyuan, Xu Ran, Tao Zhiqiang
- Delrec: Distilling Sequential Pattern To Enhance Llm-based Recommendation Sun Guohao, Zhang Haoyi
- Supervised Fine-tuning As Inverse Reinforcement Learning Sun Hao
- Fuxitranyu: A Multilingual Large Language Model Trained With Balanced Data Sun Haoran, Jin Renren, Xu Shaoyang, Pan Leiyu, Supryadi, Cui Menglong, Du Jiangcun, Lei Yikun, Yang Lei, Shi Ling, Xiao Juesi, Zhu Shaolin, Xiong Deyi
- Bbox-adapter: Lightweight Adapting For Black-box Large Language Models Sun Haotian, Zhuang Yuchen, Wei Wei, Zhang Chao, Dai Bo
- Lawluo: A Chinese Law Firm Co-run By LLM Agents Sun Jingyun, Dai Chengxiao, Luo Zhongze, Chang Yangbo, Li Yang
- Fine-tuning Vs Prompting, Can Language Models Understand Human Values? Sun Pingwei
- Dr-llava: Visual Instruction Tuning With Symbolic Clinical Grounding Sun Shenghuan, Goldgof Gregory M., Schubert Alexander, Sun Zhiqing, Hartvigsen Thomas, Butte Atul J., Alaa Ahmed
- The Critique Of Critique Sun Shichao, Li Junlong, Yuan Weizhe, Yuan Ruifeng, Li Wenjie, Liu Pengfei
- Towards Detecting Llms Hallucination Via Markov Chain-based Multi-agent Debate Framework Sun Xiaoxi, Li Jinpeng, Zhong Yan, Zhao Dongyan, Yan Rui
- Expert-guided Extinction Of Toxic Tokens For Debiased Generation Sun Xueyao, Shi Kaize, Tang Haoran, Xu Guandong, Li Qing
- QDA-SQL: Questions Enhanced Dialogue Augmentation For Multi-turn Text-to-sql Sun Yinggang, Guo Ziming, Yu Haining, Liu Chuanyi, Li Xiang, Wang Bingxuan, Yu Xiangzhan, Zhao Tiancheng
- DFA-RAG: Conversational Semantic Router For Large Language Model With Definite Finite Automaton Sun Yiyou, Hu Junjie, Cheng Wei, Chen Haifeng
- F-eval: Assessing Fundamental Abilities With Refined Evaluation Methods Sun Yu, Chen Keyu, Wang Shujie, Li Peiji, Guo Qipeng, Yan Hang, Qiu Xipeng, Huang Xuanjing, Lin Dahua
- You Only Cache Once: Decoder-decoder Architectures For Language Models Sun Yutao, Dong Li, Zhu Yi, Huang Shaohan, Wang Wenhui, Ma Shuming, Zhang Quanlu, Wang Jianyong, Wei Furu
- Leveraging Chatgpt In Pharmacovigilance Event Extraction: An Empirical Study Sun Zhaoyue, Pergola Gabriele, Wallace Byron C., He Yulan
- Llavaolmobitnet1b: Ternary LLM Goes Multimodal! Sundaram Jainaveen, Iyer Ravi
- Towards Smaller, Faster Decoder-only Transformers: Architectural Variants And Their Implications Suresh Sathya Krishnan, P Shunmugapriya
- Automating Research Synthesis With Domain-specific Large Language Model Fine-tuning Susnjak Teo, Hwang Peter, Reyes Napoleon H., Barczak Andre L. C., Mcintosh Timothy R., Ranathunga Surangika
- JORA: JAX Tensor-parallel Lora Library For Retrieval Augmented Fine-tuning Tahir Anique, Cheng Lu, Liu Huan
- PIXAR: Auto-regressive Language Modeling In Pixel Space Tai Yintao, Liao Xiyang, Suglia Alessandro, Vergari Antonio
- Pretraining And Updating Language- And Domain-specific Large Language Model: A Case Study In Japanese Business Domain Takahashi Kosuke, Omi Takahiro, Arima Kosuke, Ishigaki Tatsuya
- Large Vocabulary Size Improves Large Language Models Takase Sho, Ri Ryokan, Kiyono Shun, Kato Takuya
- 1.5-pints Technical Report: Pretraining In Days, Not Months -- Your Language Model Thrives On Quality Data Tan Calvin, Wang Jerome
- Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning Tan Cheng, Wei Jingxuan, Sun Linzhuang, Gao Zhangyang, Li Siyuan, Yu Bihui, Guo Ruifeng, Li Stan Z.
- Mobilequant: Mobile-friendly Quantization For On-device Language Models Tan Fuwen, Lee Royson, Dudziak Łukasz, Hu Shell Xu, Bhattacharya Sourav, Hospedales Timothy, Tzimiropoulos Georgios, Martinez Brais
- Idgenrec: Llm-recsys Alignment With Textual ID Learning Tan Juntao, Xu Shuyuan, Hua Wenyue, Ge Yingqiang, Li Zelong, Zhang Yongfeng
- Fine-tuning Large Language Model (LLM) Artificial Intelligence Chatbots In Ophthalmology And Llm-based Evaluation Using GPT-4 Tan Ting Fang, Elangovan Kabilan, Jin Liyuan, Jie Yao, Yong Li, Lim Joshua, Poh Stanley, Ng Wei Yan, Lim Daniel, Ke Yuhe, Liu Nan, Ting Daniel Shu Wei
- Struct-x: Enhancing Large Language Models Reasoning With Structured Data Tan Xiaoyu, Wang Haoyu, Qiu Xihe, Cheng Yuan, Xu Yinghui, Chu Wei, Qi Yuan
- Democratizing Large Language Models Via Personalized Parameter-efficient Fine-tuning Tan Zhaoxuan, Zeng Qingkai, Tian Yijun, Liu Zheyuan, Yin Bing, Jiang Meng
- DLO: Dynamic Layer Operation For Efficient Vertical Scaling Of Llms Tan Zhen, Dong Daize, Zhao Xinyu, Peng Jie, Cheng Yu, Chen Tianlong
- Large Language Models For Data Annotation: A Survey Tan Zhen, Li Dawei, Wang Song, Beigi Alimohammad, Jiang Bohan, Bhattacharjee Amrita, Karami Mansooreh, Li Jundong, Cheng Lu, Liu Huan
- Can I Understand What I Create? Self-knowledge Evaluation Of Large Language Models Tan Zhiquan, Wei Lai, Wang Jindong, Xie Xing, Huang Weiran
- An Empirical Study And Analysis Of Text-to-image Generation Using Large Language Model-powered Textual Representation Tan Zhiyu, Yang Mengping, Qin Luozheng, Yang Hao, Qian Ye, Zhou Qiang, Zhang Cheng, Li Hao
- EVALALIGN: Supervised Fine-tuning Multimodal Llms With Human-aligned Data For Evaluating Text-to-image Models Tan Zhiyu, Yang Xiaomeng, Qin Luozheng, Yang Mengping, Zhang Cheng, Li Hao
- Instructdoc: A Dataset For Zero-shot Generalization Of Visual Document Understanding With Instructions Tanaka Ryota, Iki Taichi, Nishida Kyosuke, Saito Kuniko, Suzuki Jun
- Jill Watson: A Virtual Teaching Assistant Powered By Chatgpt Taneja Karan, Maiti Pratyusha, Kakar Sandeep, Guruprasad Pranav, Rao Sanjeev, Goel Ashok K.
- Razorattention: Efficient KV Cache Compression Through Retrieval Heads Tang Hanlin, Lin Yang, Lin Jing, Han Qingsen, Hong Shikuan, Yao Yiwu, Wang Gongyi
- Vilco-bench: Video Language Continual Learning Benchmark Tang Tianqi, Deldari Shohreh, Xue Hao, De Melo Celso, Salim Flora D.
- Llmbox: A Comprehensive Library For Large Language Models Tang Tianyi, Hu Yiwen, Li Bingqian, Luo Wenyang, Qin Zijing, Sun Haoxiang, Wang Jiapeng, Xu Shiyi, Cheng Xiaoxue, Guo Geyang, Peng Han, Zheng Bowen, Tang Yiru, Min Yingqian, Chen Yushuo, Chen Jie, Zhao Yuanqian, Ding Luran, Wang Yuhao, Dong Zican, Xia Chunxuan, Li Junyi, Zhou Kun, Zhao Wayne Xin, Wen Ji-rong
- On The Hardness Of Faithful Chain-of-thought Reasoning In Large Language Models Tanneru Sree Harsha, Ley Dan, Agarwal Chirag, Lakkaraju Himabindu
- Scaling Laws With Vocabulary: Larger Models Deserve Larger Vocabularies Tao Chaofan, Liu Qian, Dou Longxu, Muennighoff Niklas, Wan Zhongwei, Luo Ping, Lin Min, Wong Ngai
- Unlocking The Potential Of Model Merging For Low-resource Languages Tao Mingxu, Zhang Chen, Huang Quzhe, Ma Tianyao, Huang Songfang, Zhao Dongyan, Feng Yansong
- MEEL: Multi-modal Event Evolution Learning Tao Zhengwei, Jin Zhi, Huang Junqiang, Chen Xiancai, Bai Xiaoying, Zhao Haiyan, Zhang Yifan, Tao Chongyang
- A Survey On Self-evolution Of Large Language Models Tao Zhengwei, Lin Ting-en, Chen Xiancai, Li Hangyu, Wu Yuchuan, Li Yongbin, Jin Zhi, Huang Fei, Tao Dacheng, Zhou Jingren
- Distilling Llms' Decomposition Abilities Into Compact Language Models Tarasov Denis, Shridhar Kumar
- Exploring Design Choices For Building Language-specific Llms Tejaswi Atula, Gupta Nilesh, Choi Eunsol
- Embedding-aligned Language Models Tennenholtz Guy, Chow Yinlam, Hsu Chih-wei, Shani Lior, Liang Ethan, Boutilier Craig
- Training-free Consistent Text-to-image Generation Tewel Yoad, Kaduri Omri, Gal Rinon, Kasten Yoni, Wolf Lior, Chechik Gal, Atzmon Yuval
- Equipping Language Models With Tool Use Capability For Tabular Data Analysis In Finance Theuma Adrian, Shareghi Ehsan
- Limits Of Transformer Language Models On Learning To Compose Algorithms Thomm Jonathan, Terzic Aleksandar, Camposampiero Giacomo, Hersche Michael, Schölkopf Bernhard, Rahimi Abbas
- Elsevier Arena: Human Evaluation Of Chemistry/biology/health Foundational Large Language Models Thorne Camilo, Druckenbrodt Christian, Szarkowska Kinga, Goyal Deepika, Marajan Pranita, Somanath Vijay, Harper Corey, Yan Mao, Scerri Tony
- Climategpt: Towards AI Synthesizing Interdisciplinary Research On Climate Change Thulke David, Gao Yingbo, Pelser Petrus, Brune Rein, Jalota Rricha, Fok Floris, Ramos Michael, Van Wyk Ian, Nasir Abdallah, Goldstein Hayden, Tragemann Taylor, Nguyen Katie, Fowler Ariana, Stanco Andrew, Gabriel Jon, Taylor Jordan, Moro Dean, Tsymbalov Evgenii, De Waal Juliette, Matusov Evgeny, Yaghi Mudar, Shihadah Mohammad, Ney Hermann, Dugast Christian, Dotan Jonathan, Erasmus Daniel
- Hydralora: An Asymmetric Lora Architecture For Efficient Fine-tuning Tian Chunlin, Shi Zhan, Guo Zhijiang, Li Li, Xu Chengzhong
- Untie The Knots: An Efficient Data Augmentation Strategy For Long-context Pre-training In Language Models Tian Junfeng, Zheng Da, Cheng Yang, Wang Rui, Zhang Colin, Zhang Debing
- Tokenize The World Into Object-level Knowledge To Address Long-tail Events In Autonomous Driving Tian Ran, Li Boyi, Weng Xinshuo, Chen Yuxiao, Schmerling Edward, Wang Yue, Ivanovic Boris, Pavone Marco
- Toward Self-improvement Of Llms Via Imagination, Searching, And Criticizing Tian Ye, Peng Baolin, Song Linfeng, Jin Lifeng, Yu Dian, Mi Haitao, Yu Dong
- LLM Circuit Analyses Are Consistent Across Training And Scale Tigges Curt, Hanna Michael, Yu Qinan, Biderman Stella
- Cantor: Inspiring Multimodal Chain-of-thought Of MLLM Timin Gao, Peixian Chen, Mengdan Zhang, Chaoyou Fu, Yunhang Shen, Yan Zhang, Shengchuan Zhang, Xiawu Zheng, Xing Sun, Liujuan Cao, Rongrong Ji
- Aalap: AI Assistant For Legal & Paralegal Functions In India Tiwari Aman, Kalamkar Prathamesh, Banerjee Atreyo, Karn Saurabh, Hemachandran Varun, Gupta Smita
- Zyda: A 1.3T Dataset For Open Language Modeling Tokpanov Yury, Millidge Beren, Glorioso Paolo, Pilault Jonathan, Ibrahim Adam, Whittington James, Anthony Quentin
- An Ecosage Assistant: Towards Building A Multimodal Plant Care Dialogue Assistant Tomar Mohit, Tiwari Abhisek, Saha Tulika, Jha Prince, Saha Sriparna
- Understanding Linear Probing Then Fine-tuning Language Models From NTK Perspective Tomihari Akiyoshi, Sato Issei
- Eyes Wide Shut? Exploring The Visual Shortcomings Of Multimodal Llms Tong Shengbang, Liu Zhuang, Zhai Yuexiang, Ma Yi, Lecun Yann, Xie Saining
- Can Llms Learn From Previous Mistakes? Investigating Llms' Errors To Boost For Reasoning Tong Yongqi, Li Dawei, Wang Sizhe, Wang Yujia, Teng Fei, Shang Jingbo
- Optimizing Language Model's Reasoning Abilities With Weak Supervision Tong Yongqi, Wang Sizhe, Li Dawei, Wang Yifan, Han Simeng, Lin Zi, Huang Chengsong, Huang Jiaxin, Shang Jingbo
- A Comprehensive Survey Of Hallucination Mitigation Techniques In Large Language Models Tonmoy S. M Towhidul Islam, Zaman S M Mehedi, Jain Vinija, Rani Anku, Rawte Vipula, Chadha Aman, Das Amitava
- Openmathinstruct-1: A 1.8 Million Math Instruction Tuning Dataset Toshniwal Shubham, Moshkov Ivan, Narenthiran Sean, Gitman Daria, Jia Fei, Gitman Igor
- Few-shot Cross-lingual Transfer For Prompting Large Language Models In Low-resource Languages Toukmaji Christopher
- UCCIX: Irish-excellence Large Language Model Tran Khanh-tung, O'sullivan Barry, Nguyen Hoang D.
- Connecting The Dots: Llms Can Infer And Verbalize Latent Structure From Disparate Training Data Treutlein Johannes, Choi Dami, Betley Jan, Anil Cem, Marks Samuel, Grosse Roger Baker, Evans Owain
- Discrete Multimodal Transformers With A Pretrained Large Language Model For Mixed-supervision Speech Processing Trinh Viet Anh, Southwell Rosy, Guan Yiwen, He Xinlu, Wang Zhiyong, Whitehill Jacob
- Linguacodus: A Synergistic Framework For Transformative Code Generation In Machine Learning Pipelines Trofimova Ekaterina, Sataev Emil, Ustyuzhanin Andrey E.
- Crossing Linguistic Horizons: Finetuning And Comprehensive Evaluation Of Vietnamese Large Language Models Truong Sang T., Nguyen Duc Q., Nguyen Toan, Le Dong D., Truong Nhi N., Quan Tho, Koyejo Sanmi
- Code Less, Align More: Efficient LLM Fine-tuning For Code Generation With Data Pruning Tsai Yun-da, Liu Mingjie, Ren Haoxing
- Enhance Modality Robustness In Text-centric Multimodal Alignment With Adversarial Prompting Tsai Yun-da, Yen Ting-yu, Liao Keng-te, Lin Shou-de
- Towards Safety And Helpfulness Balanced Responses Via Controllable Large Language Models Tuan Yi-lin, Chen Xilun, Smith Eric Michael, Martin Louis, Batra Soumya, Celikyilmaz Asli, Wang William Yang, Bikel Daniel M.
- Unmasking The Imposters: In-domain Detection Of Human Vs. Machine-generated Tweets Tuck Bryan E., Verma Rakesh M.
- Dynamo: Accelerating Language Model Inference With Dynamic Multi-token Sampling Tuli Shikhar, Lin Chi-heng, Hsu Yen-chang, Jha Niraj K., Shen Yilin, Jin Hongxia
- VBART: The Turkish LLM Turker Meliksah, Ari Mehmet Erdi, Han Aydin
- Asking And Answering Questions To Extract Event-argument Structures Uddin Md Nayem, George Enfa Rose, Blanco Eduardo, Corman Steven
- Unseentimeqa: Time-sensitive Question-answering Beyond Llms' Memorization Uddin Md Nayem, Saeidi Amir, Handa Divij, Seth Agastya, Son Tran Cao, Blanco Eduardo, Corman Steven R., Baral Chitta
- Advancing Large Multi-modal Models With Explicit Chain-of-reasoning And Visual Question Generation Uehara Kohei, Goswami Nabarun, Wang Hanqin, Baba Toshiaki, Tanaka Kohtaro, Hashimoto Tomohiro, Wang Kai, Ito Rei, Naoya Takagi, Umagami Ryo, Wen Yingyi, Anakewat Tanachai, Harada Tatsuya
- Adacoder: Adaptive Prompt Compression For Programmatic Visual Question Answering Ukai Mahiro, Kurita Shuhei, Hashimoto Atsushi, Ushiku Yoshitaka, Inoue Nakamasa
- Bootstrapping Llm-based Task-oriented Dialogue Agents Via Self-talk Ulmer Dennis, Mansimov Elman, Lin Kaixiang, Sun Justin, Gao Xibin, Zhang Yi
- Sandwich Attack: Multi-language Mixture Adaptive Attack On Llms Upadhayay Bibek, Behzadan Vahid
- From Words To Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-context Examples Vacareanu Robert, Negru Vlad-andrei, Suciu Vasile, Surdeanu Mihai
- Evaluating The Capability Of Large Language Models To Personalize Science Texts For Diverse Middle-school-age Learners Vaccaro Michael Jr, Friday Mikayla, Zaghi Arash
- Autopuredata: Automated Filtering Of Web Data For LLM Fine-tuning Vadlapati Praneeth
- Accelerating Large Language Model Inference With Self-supervised Early Exits Valade Florian
- Improved Baselines For Data-efficient Perceptual Augmentation Of Llms Vallaeys Théophane, Shukor Mustafa, Cord Matthieu, Verbeek Jakob
- Investigating The Performance Of Language Models For Completing Code In Functional Programming Languages: A Haskell Case Study Van Dam Tim, Van Der Heijden Frank, De Bekker Philippe, Nieuwschepen Berend, Otten Marc, Izadi Maliheh
- Infusing Knowledge Into Large Language Models With Contextual Prompts Vasisht Kinshuk, Ganesan Balaji, Kumar Vikas, Bhatnagar Vasudha
- A Survey Of Prompt Engineering Methods In Large Language Models For Different NLP Tasks Vatsal Shubham, Dubey Harsh
- Enhancing Model Performance: Another Approach To Vision-language Instruction Tuning Vedanshu, Tripathi Mm, Jaint Bhavnesh
- Question Suggestion For Conversational Shopping Assistants Using Product Metadata Vedula Nikhita, Rokhlenko Oleg, Malmasi Shervin
- DFKI-NLP At Semeval-2024 Task 2: Towards Robust Llms Using Data Perturbations And Minmax Training Verma Bhuvanesh, Raithel Lisa
- Cross-modal Projection In Multimodal Llms Doesn't Really Project Visual Attributes To Textual Space Verma Gaurav, Choi Minje, Sharma Kartik, Watson-daniels Jamelle, Oh Sejoon, Kumar Srijan
- How Much Data Is Enough Data? Fine-tuning Large Language Models For In-house Translation: Performance Evaluation Across Multiple Dataset Sizes Vieira Inacio, Allred Will, Lankford Séamus, Castilho Sheila, Way Andy
- Input Conditioned Graph Generation For Language Agents Vierling Lukas, Fu Jie, Chen Kai
- Medreqal: Examining Medical Knowledge Recall Of Large Language Models Via Question Answering Vladika Juraj, Schneider Phillip, Matthes Florian
- Redwhale: An Adapted Korean LLM Through Efficient Continual Pretraining Vo Anh-dung, Jung Minseong, Lee Wonbeen, Choi Daewoo
- Vi-mistral-x: Building A Vietnamese Language Model With Advanced Continual Pre-training Vo James
- Foundational Autoraters: Taming Large Language Models For Better Automatic Evaluation Vu Tu, Krishna Kalpesh, Alzubi Salaheddin, Tar Chris, Faruqui Manaal, Sung Yun-hsuan
- Generative Large Language Models In Automated Fact-checking: A Survey Vykopal Ivan, Pikuliak Matúš, Ostermann Simon, Šimko Marián
- Leveraging Zero-shot Prompting For Efficient Language Model Distillation Vöge Lukas, Gurgul Vincent, Lessmann Stefan
- Investigating Mysteries Of Cot-augmented Distillation Wadhwa Somin, Amir Silvio, Wallace Byron C.
- Paraphrase Types Elicit Prompt Engineering Capabilities Wahle Jan Philip, Ruas Terry, Xu Yang, Gipp Bela
- An Empirical Study Of Mamba-based Language Models Waleffe Roger, Byeon Wonmin, Riach Duncan, Norick Brandon, Korthikanti Vijay, Dao Tri, Gu Albert, Hatamizadeh Ali, Singh Sudhakar, Narayanan Deepak, Kulshreshtha Garvit, Singh Vartika, Casper Jared, Kautz Jan, Shoeybi Mohammad, Catanzaro Bryan
- The Instruction Hierarchy: Training Llms To Prioritize Privileged Instructions Wallace Eric, Xiao Kai, Leike Reimar, Weng Lilian, Heidecke Johannes, Beutel Alex
- Temporal Blind Spots In Large Language Models Wallat Jonas, Jatowt Adam, Anand Avishek
- Fusechat: Knowledge Fusion Of Chat Models Wan Fanqi, Zhong Longguang, Yang Ziyi, Chen Ruijun, Quan Xiaojun
- LOOK-M: Look-once Optimization In KV Cache For Efficient Multimodal Long-context Inference Wan Zhongwei, Wu Ziang, Liu Che, Huang Jinfa, Zhu Zhihong, Jin Peng, Wang Longyue, Yuan Li
- Datavist5: A Pre-trained Language Model For Jointly Understanding Text And Data Visualization Wan Zhuoyue, Song Yuanfeng, Li Shuaimin, Zhang Chen Jason, Wong Raymond Chi-wing
- COSMO: Contrastive Streamlined Multimodal Model With Interleaved Pre-training Wang Alex Jinpeng, Li Linjie, Lin Kevin Qinghong, Wang Jianfeng, Lin Kevin, Yang Zhengyuan, Wang Lijuan, Shou Mike Zheng
- Pargo: Bridging Vision-language With Partial And Global Views Wang An-lan, Shan Bin, Shi Wei, Lin Kun-yu, Fei Xiang, Tang Guozhi, Liao Lei, Tang Jingqun, Huang Can, Zheng Wei-shi
- Image Re-identification: Where Self-supervision Meets Vision-language Learning Wang Bin, Liang Yuying, Cai Lei, Huang Huakun, Zeng Huanqiang
- IAA: Inner-adaptor Architecture Empowers Frozen Large Language Model With Multimodal Capabilities Wang Bin, Xie Chunyu, Leng Dawei, Yin Yuhui
- ANGO: A Next-level Evaluation Benchmark For Generation-oriented Language Models In Chinese Domain Wang Bingchao
- Grokked Transformers Are Implicit Reasoners: A Mechanistic Journey To The Edge Of Generalization Wang Boshi, Yue Xiang, Su Yu, Sun Huan
- Direct: Diagnostic Reasoning For Clinical Notes Via Large Language Models Wang Bowen, Chang Jiuyang, Qian Yiming, Chen Guoxin, Chen Junhao, Jiang Zhouqiang, Zhang Jiahao, Nakashima Yuta, Nagahara Hajime
- Information Theoretic Text-to-image Alignment Wang Chao, Franzese Giulio, Finamore Alessandro, Gallo Massimo, Michiardi Pietro
- BLSP-KD: Bootstrapping Language-speech Pre-training Via Knowledge Distillation Wang Chen, Liao Minpeng, Huang Zhongqiang, Zhang Jiajun
- Hybrid Alignment Training For Large Language Models Wang Chenglong, Zhou Hang, Chang Kaiyan, Li Bei, Mu Yongyu, Xiao Tong, Liu Tongran, Zhu Jingbo
- A Survey For Large Language Models In Biomedicine Wang Chong, Li Mengyao, He Junjun, Wang Zhongruo, Darzi Erfan, Chen Zan, Ye Jin, Li Tianbin, Su Yanzhou, Ke Jing, Qu Kaili, Li Shuxin, Yu Yi, Liò Pietro, Wang Tianyun, Wang Yu Guang, Shen Yiqing
- Instruction Tuning-free Visual Token Complement For Multimodal Llms Wang Dongsheng, Cui Jiequan, Li Miaoge, Lin Wang, Chen Bo, Zhang Hanwang
- Planning In Natural Language Improves LLM Search For Code Generation Wang Evan, Cassano Federico, Wu Catherine, Bai Yunfeng, Song Will, Nath Vaskar, Han Ziwen, Hendryx Sean, Yue Summer, Zhang Hugh
- Adapting Open-source Large Language Models For Cost-effective, Expert-level Clinical Note Generation With On-policy Reinforcement Learning Wang Hanyin, Gao Chufan, Liu Bolun, Xu Qiping, Hussein Guleid, Labban Mohamad El, Iheasirim Kingsley, Korsapati Hariprasad, Outcalt Chuck, Sun Jimeng
- Reinforcement Learning For Edit-based Non-autoregressive Neural Machine Translation Wang Hao, Morimura Tetsuro, Honda Ukyo, Kawahara Daisuke
- Interpretable Preferences Via Multi-objective Reward Modeling And Mixture-of-experts Wang Haoxiang, Xiong Wei, Xie Tengyang, Zhao Han, Zhang Tong
- Unims-rag: A Unified Multi-source Retrieval-augmented Generation For Personalized Dialogue Systems Wang Hongru, Huang Wenyu, Deng Yang, Wang Rui, Wang Zezhong, Wang Yufei, Mi Fei, Pan Jeff Z., Wong Kam-fai
- Techgpt-2.0: A Large Language Model Project To Solve The Task Of Knowledge Graph Construction Wang Jiaqi, Chang Yuying, Li Zhong, An Ning, Ma Qi, Hei Lei, Luo Haibo, Lu Yifei, Ren Feiliang
- Tarsier: Recipes For Training And Evaluating Large Video Description Models Wang Jiawei, Yuan Liping, Zhang Yuchen
- Helpful Or Harmful Data? Fine-tuning-free Shapley Attribution For Explaining Language Model Predictions Wang Jingtan, Lin Xiaoqiang, Qiao Rui, Foo Chuan-sheng, Low Bryan Kian Hsiang
- JMLR: Joint Medical LLM And Retrieval Training For Enhancing Reasoning And Professional Question Answering Capability Wang Junda, Yang Zhichao, Yao Zonghai, Yu Hong
- The Mamba In The Llama: Distilling And Accelerating Hybrid Models Wang Junxiong, Paliotta Daniele, May Avner, Rush Alexander M., Dao Tri
- Selective Forgetting: Advancing Machine Unlearning Techniques And Evaluation In Language Models Wang Lingzhi, Zeng Xingshan, Guo Jinsong, Wong Kam-fai, Gottlob Georg
- Beyond The Known: Investigating Llms Performance On Out-of-domain Intent Detection Wang Pei, He Keqing, Wang Yejie, Song Xiaoshuai, Mou Yutao, Wang Jingang, Xian Yunsen, Cai Xunliang, Xu Weiran
- Will The Real Linda Please Stand Up...to Large Language Models? Examining The Representativeness Heuristic In Llms Wang Pengda, Xiao Zilin, Chen Hanjie, Oswald Frederick L.
- Inferaligner: Inference-time Alignment For Harmlessness Through Cross-model Guidance Wang Pengyu, Zhang Dong, Li Linyang, Tan Chenkun, Wang Xinghao, Ren Ke, Jiang Botian, Qiu Xipeng
- Reconstruct The Pruned Model Without Any Retraining Wang Pingjie, Fan Ziqing, Hu Shengchao, Chen Zhe, Wang Yanfeng, Wang Yu
- Cross-refine: Improving Natural Language Explanation Generation By Learning In Tandem Wang Qianli, Anikina Tatiana, Feldhus Nils, Ostermann Simon, Möller Sebastian, Schmitt Vera
- Automated Generation Of Multiple-choice Cloze Questions For Assessing English Vocabulary Using Gpt-turbo 3.5 Wang Qiao, Rose Ralph, Orita Naho, Sugawara Ayaka
- Unlearning With Control: Assessing Real-world Utility For Large Language Model Unlearning Wang Qizhou, Han Bo, Yang Puning, Zhu Jianing, Liu Tongliang, Sugiyama Masashi
- SUBLLM: A Novel Efficient Architecture With Token Sequence Subsampling For LLM Wang Quandong, Yuan Yuxuan, Yang Xiaoyu, Zhang Ruike, Zhao Kang, Liu Wei, Luan Jian, Povey Daniel, Wang Bin
- Role Prompting Guided Domain Adaptation With General Capability Preserve For Large Language Models Wang Rui, Mi Fei, Chen Yi, Xue Boyang, Wang Hongru, Zhu Qi, Wong Kam-fai, Xu Ruifeng
- Deconfounded Causality-aware Parameter-efficient Fine-tuning For Problem-solving Improvement Of Llms Wang Ruoyu, Li Xiaoxuan, Yao Lina
- Unique Security And Privacy Threats Of Large Language Model: A Comprehensive Survey Wang Shang, Zhu Tianqing, Liu Bo, Ding Ming, Guo Xu, Ye Dayong, Zhou Wanlei, Yu Philip S.
- Zero-shot Generative Large Language Models For Systematic Review Screening Automation Wang Shuai, Scells Harrisen, Zhuang Shengyao, Potthast Martin, Koopman Bevan, Zuccon Guido
- Self-taught Evaluators Wang Tianlu, Kulikov Ilia, Golovneva Olga, Yu Ping, Yuan Weizhe, Dwivedi-yu Jane, Pang Richard Yuanzhe, Fazel-zarandi Maryam, Weston Jason, Li Xian
- Weaver: Foundation Models For Creative Writing Wang Tiannan, Chen Jiamin, Jia Qingrui, Wang Shuai, Fang Ruoyu, Wang Huilin, Gao Zhaowei, Xie Chunzhao, Xu Chuou, Dai Jihong, Liu Yibin, Wu Jialong, Ding Shengwei, Li Long, Huang Zhiwei, Deng Xinle, Yu Teng, Ma Gangan, Xiao Han, Chen Zixin, Xiang Danjun, Wang Yunxia, Zhu Yuanyuan, Xiao Yi, Wang Jing, Wang Yiru, Ding Siran, Huang Jiayang, Xu Jiayi, Tayier Yilihamu, Hu Zhenyu, Gao Yuan, Zheng Chengfeng, Ye Yueshu, Li Yihang, Wan Lei, Jiang Xinyue, Wang Yujie, Cheng Siyu, Song Zhule, Tang Xiangru, Xu Xiaohua, Zhang Ningyu, Chen Huajun, Jiang Yuchen Eleanor, Zhou Wangchunshu
- Less Is More For Improving Automatic Evaluation Of Factual Consistency Wang Tong, Kulkarni Ninad, Qi Yanjun
- Using Advanced Llms To Enhance Smaller Llms: An Interpretable Knowledge Distillation Approach Wang Tong, Sudhir K., Hong Dat
- QCRD: Quality-guided Contrastive Rationale Distillation For Large Language Models Wang Wei, Li Zhaowei, Xu Qi, Cai Yiqing, Song Hang, Qi Qi, Zhou Ran, Huang Zhida, Wang Tao, Xiao Li
- Secokd: Aligning Large Language Models For In-context Learning With Fewer Shots Wang Weixing, Yang Haojin, Meinel Christoph
- Learning To Ask: When Llms Meet Unclear Instruction Wang Wenxuan, Shi Juluan, Wang Chaozheng, Lee Cheryl, Yuan Youliang, Huang Jen-tse, Lyu Michael R.
- Metatool: Facilitating Large Language Models To Master Tools With Meta-task Augmentation Wang Xiaohan, Li Dian, Zhao Yilin, Sinbadliu, Wang Hui
- Longllava: Scaling Multi-modal Llms To 1000 Images Efficiently Via Hybrid Architecture Wang Xidong, Song Dingjie, Chen Shunian, Zhang Chen, Wang Benyou
- Executable Code Actions Elicit Better LLM Agents Wang Xingyao, Chen Yangyi, Yuan Lifan, Zhang Yizhe, Li Yunzhu, Peng Hao, Ji Heng
- Modaverse: Efficiently Transforming Modalities With Llms Wang Xinyu, Zhuang Bohan, Wu Qi
- Hidden Question Representations Tell Non-factuality Within And Across Large Language Models Wang Yanling, Li Haoyang, Zou Hao, Zhang Jing, He Xinlei, Li Qi, Xu Ke
- Inscl: A Data-efficient Continual Learning Paradigm For Fine-tuning Large Language Models With Instructions Wang Yifan, Liu Yafei, Shi Chufan, Li Haoling, Chen Chen, Lu Haonan, Yang Yujiu
- Uncertainty Aware Learning For Language Model Alignment Wang Yikun, Zheng Rui, Ding Liang, Zhang Qi, Lin Dahua, Tao Dacheng
- Ladic: Are Diffusion Models Really Inferior To Autoregressive Counterparts For Image-to-text Generation? Wang Yuchi, Ren Shuhuai, Gao Rundong, Yao Linli, Guo Qingyan, An Kaikai, Bai Jianhong, Sun Xu
- REAR: A Relevance-aware Retrieval-augmented Framework For Open-domain Question Answering Wang Yuhao, Ren Ruiyang, Li Junyi, Zhao Wayne Xin, Liu Jing, Wen Ji-rong
- Maferw: Query Rewriting With Multi-aspect Feedbacks For Retrieval-augmented Large Language Models Wang Yujing, Zhang Hainan, Pang Liang, Pang Liang, Zheng Hongwei, Zheng Zhiming
- Loma: Lossless Compressed Memory Attention Wang Yumeng, Xiao Zhenyang
- LSTP: Language-guided Spatial-temporal Prompt Learning For Long-form Video-text Understanding Wang Yuxuan, Wang Yueqian, Wu Pengfei, Liang Jianxin, Zhao Dongyan, Zheng Zilong
- Videollamb: Long-context Video Understanding With Recurrent Memory Bridges Wang Yuxuan, Xie Cihang, Liu Yang, Zheng Zilong
- To Recommend Or Not: Recommendability Identification In Conversations With Pre-trained Language Models Wang Zhefan, Ma Weizhi, Zhang Min
- A Comprehensive Survey Of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO And More Wang Zhichao James, Bi Bin James, Pentyala Shiva Kumar James, Ramnath Kiran James, Chaudhuri Sougata James, Mehrotra Shubham James, Zixu James, Zhu Claire, Mao Xiang-bo Claire, Asur Sitaram Claire, Na Claire, Cheng
- Re-task: Revisiting LLM Tasks From Capability, Skill, And Knowledge Perspectives Wang Zhihu, Zhao Shiwan, Wang Yu, Huang Heyuan, Shi Jiaxin, Xie Sitao, Wang Zhixing, Zhang Yubo, Li Hongyan, Yan Junchi
- Helpsteer2: Open-source Dataset For Training Top-performing Reward Models Wang Zhilin, Dong Yi, Delalleau Olivier, Zeng Jiaqi, Shen Gerald, Egert Daniel, Zhang Jimmy J., Sreedhar Makesh Narsimhan, Kuchaiev Oleksii
- Trove: Inducing Verifiable And Efficient Toolboxes For Solving Programmatic Tasks Wang Zhiruo, Fried Daniel, Neubig Graham
- Omnijarvis: Unified Vision-language-action Tokenization Enables Open-world Instruction Following Agents Wang Zihao, Cai Shaofei, Mu Zhancun, Lin Haowei, Zhang Ceyao, Liu Xuejie, Li Qing, Liu Anji, Ma Xiaojian, Liang Yitao
- Eliminating Position Bias Of Language Models: A Mechanistic Approach Wang Ziqi, Zhang Hanlin, Li Xiner, Huang Kuan-hao, Han Chi, Ji Shuiwang, Kakade Sham M., Peng Hao, Ji Heng
- Re2llm: Reflective Reinforcement Large Language Model For Session-based Recommendation Wang Ziyan, Du Yingpeng, Sun Zhu, Chua Haoyan, Feng Kaidong, Wang Wenya, Zhang Jie
- Flora: Federated Fine-tuning Large Language Models With Heterogeneous Low-rank Adaptations Wang Ziyao, Shen Zheyu, He Yexiao, Sun Guoheng, Wang Hongyi, Lyu Lingjuan, Li Ang
- Browse And Concentrate: Comprehending Multimodal Content Via Prior-llm Context Fusion Wang Ziyue, Chen Chi, Zhu Yiqi, Luo Fuwen, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Sun Maosong, Liu Yang
- Is There No Such Thing As A Bad Question? H4R: Hallucibot For Ratiocination, Rewriting, Ranking, And Routing Watson William, Cho Nicole, Srishankar Nishan
- PARIKSHA : A Large-scale Investigation Of Human-llm Evaluator Agreement On Multilingual And Multi-cultural Data Watts Ishaan, Gumma Varun, Yadavalli Aditya, Seshadri Vivek, Swaminathan Manohar, Sitaram Sunayana
- OPDAI At Semeval-2024 Task 6: Small Llms Can Accelerate Hallucination Detection With Weakly Supervised Data Wei Chengcheng, Chen Ze, Fang Songtan, He Jiarong, Gao Max
- Measuring And Reducing LLM Hallucination Without Gold-standard Answers Wei Jiaheng, Yao Yuanshun, Ton Jean-francois, Guo Hongyi, Estornell Andrew, Liu Yang
- Skywork-moe: A Deep Dive Into Training Techniques For Mixture-of-experts Language Models Wei Tianwen, Zhu Bo, Zhao Liang, Cheng Cheng, Li Biye, Lü Weiwei, Cheng Peng, Zhang Jianhao, Zhang Xiaoyu, Zeng Liang, Wang Xiaokun, Ma Yutuan, Hu Rui, Yan Shuicheng, Fang Han, Zhou Yahui
- Instructrag: Instructing Retrieval-augmented Generation Via Self-synthesized Rationales Wei Zhepei, Chen Wei-lin, Meng Yu
- Grounding Data Science Code Generation With Input-output Specifications Wen Yeming, Yin Pengcheng, Shi Kensen, Michalewski Henryk, Chaudhuri Swarat, Polozov Alex
- How Chinese Are Chinese Language Models? The Puzzling Lack Of Language Policy In China's Llms Wen-yi Andrea W, Jo Unso Eun Seo, Lin Lu Jia, Mimno David
- AGLA: Mitigating Object Hallucinations In Large Vision-language Models With Assembly Of Global And Local Attention Wenbin An, Feng Tian, Sicong Leng, Jiahao Nie, Haonan Lin, Qianying Wang, Guang Dai, Ping Chen, Shijian Lu
- Detecting And Mitigating Hallucination In Large Vision Language Models Via Fine-grained AI Feedback Wenyi Xiao, Ziwei Huang, Leilei Gan, Wanggui He, Haoyuan Li, Zhelun Yu, Hao Jiang, Fei Wu, Linchao Zhu
- Accelerating Production Llms With Combined Token/embedding Speculators Wertheimer Davis, Rosenkranz Joshua, Parnell Thomas, Suneja Sahil, Ranganathan Pavithra, Ganti Raghu, Srivatsa Mudhakar
- Codeultrafeedback: An Llm-as-a-judge Dataset For Aligning Large Language Models To Coding Preferences Weyssow Martin, Kamanda Aton, Sahraoui Houari
- Towards Knowledge-grounded Natural Language Understanding And Generation Whitehouse Chenxi
- Training-free Exponential Extension Of Sliding Window Context With Cascading KV Cache Willette Jeffrey, Lee Heejun, Lee Youngwan, Jeon Myeongjae, Hwang Sung Ju
- Easy Problems That Llms Get Wrong Williams Sean, Huckle James
- Effects Of A Prompt Engineering Intervention On Undergraduate Students' AI Self-efficacy, AI Knowledge And Prompt Engineering Ability: A Mixed Methods Study Woo David James, Wang Deliang, Yung Tim, Guo Kai
- Personalized Large Language Models Woźniak Stanisław, Koptyra Bartłomiej, Janz Arkadiusz, Kazienko Przemysław, Kocoń Jan
- EDA Corpus: A Large Language Model Dataset For Enhanced Interaction With Openroad Wu Bing-yue, Sharma Utsav, Kankipati Sai Rahul Dhanvi, Yadav Ajay, George Bintu Kappil, Guntupalli Sai Ritish, Rovinski Austin, Chhabria Vidya A.
- Llama Pro: Progressive Llama With Block Expansion Wu Chengyue, Gan Yukang, Ge Yixiao, Lu Zeyu, Wang Jiahao, Feng Ye, Shan Ying, Luo Ping
- Performance Law Of Large Language Models Wu Chuhan, Tang Ruiming
- Sparkra: A Retrieval-augmented Knowledge Service System Based On Spark Large Language Model Wu Dayong, Li Jiaqi, Wang Baoxin, Zhao Honghong, Xue Siyuan, Yang Yanjie, Chang Zhijun, Zhang Rui, Qian Li, Wang Bo, Wang Shijin, Zhang Zhixiong, Hu Guoping
- Generative Monoculture In Large Language Models Wu Fan, Black Emily, Chandrasekaran Varun
- Medkp: Medical Dialogue With Knowledge Enhancement And Clinical Pathway Encoding Wu Jiageng, Wu Xian, Zheng Yefeng, Yang Jie
- Visionllm V2: An End-to-end Generalist Multimodal Large Language Model For Hundreds Of Vision-language Tasks Wu Jiannan, Zhong Muyan, Xing Sen, Lai Zeqiang, Liu Zhaoyang, Wang Wenhai, Chen Zhe, Zhu Xizhou, Lu Lewei, Lu Tong, Luo Ping, Qiao Yu, Dai Jifeng
- What Are The Limits Of Cross-lingual Dense Passage Retrieval For Low-resource Languages? Wu Jie, Ren Zhaochun, Verberne Suzan
- GEB-1.3B: Open Lightweight Large Language Model Wu Jie, Zhu Yufeng, Shen Lei, Lu Xuqing
- \(\beta\)-dpo: Direct Preference Optimization With Dynamic \(\beta\) Wu Junkang, Xie Yuexiang, Yang Zhengyi, Wu Jiancan, Gao Jinyang, Ding Bolin, Wang Xiang, He Xiangnan
- Noiseboost: Alleviating Hallucination With Noise Perturbation For Multimodal Large Language Models Wu Kai, Jiang Boyuan, Jiang Zhengkai, He Qingdong, Luo Donghao, Wang Shengzhi, Liu Qingwen, Wang Chengjie
- Adapting Large Language Models For Document-level Machine Translation Wu Minghao, Vu Thuy-trang, Qu Lizhen, Foster George, Haffari Gholamreza
- Mixture-of-skills: Learning To Optimize Data Usage For Fine-tuning Large Language Models Wu Minghao, Vu Thuy-trang, Qu Lizhen, Haffari Gholamreza
- Controlmllm: Training-free Visual Prompt Learning For Multimodal Large Language Models Wu Mingrui, Cai Xinyue, Ji Jiayi, Li Jiale, Huang Oucheng, Luo Gen, Fei Hao, Sun Xiaoshuai, Ji Rongrong
- ITERTL: An Iterative Framework For Fine-tuning Llms For RTL Code Generation Wu Peiyang, Guo Nan, Xiao Xiao, Li Wenming, Ye Xiaochun, Fan Dongrui
- Motionllm: Multimodal Motion-language Learning With Large Language Models Wu Qi, Zhao Yubo, Wang Yifan, Tai Yu-wing, Tang Chi-keung
- GUI Action Narrator: Where And When Did That Action Take Place? Wu Qinchen, Gao Difei, Lin Kevin Qinghong, Wu Zhuoyu, Guo Xiangwu, Li Peiran, Zhang Weichen, Wang Hengxu, Shou Mike Zheng
- Avatar: Optimizing LLM Agents For Tool-assisted Knowledge Retrieval Wu Shirley, Zhao Shiyu, Huang Qian, Huang Kexin, Yasunaga Michihiro, Cao Kaidi, Ioannidis Vassilis N., Subbian Karthik, Leskovec Jure, Zou James
- Videollm-mod: Efficient Video-language Streaming With Mixture-of-depths Vision Computation Wu Shiwei, Chen Joya, Lin Kevin Qinghong, Wang Qimeng, Gao Yan, Xu Qianli, Xu Tong, Hu Yao, Chen Enhong, Shou Mike Zheng
- Semantic Alignment For Multimodal Large Language Models Wu Tao, Li Mengze, Chen Jingyuan, Ji Wei, Lin Wang, Gao Jinyang, Kuang Kun, Zhao Zhou, Wu Fei
- Meta-rewarding Language Models: Self-improving Alignment With Llm-as-a-meta-judge Wu Tianhao, Yuan Weizhe, Golovneva Olga, Xu Jing, Tian Yuandong, Jiao Jiantao, Weston Jason, Sukhbaatar Sainbayar
- Never Miss A Beat: An Efficient Recipe For Context Window Extension Of Large Language Models With Consistent "middle" Enhancement Wu Tong, Zhao Yanpeng, Zheng Zilong
- Continual Learning For Large Language Models: A Survey Wu Tongtong, Luo Linhao, Li Yuan-fang, Pan Shirui, Vu Thuy-trang, Haffari Gholamreza
- Freeva: Offline MLLM As Training-free Video Assistant Wu Wenhao
- Long Context Alignment With Short Instructions And Synthesized Positions Wu Wenhao, Wang Yizhong, Fu Yao, Yue Xiang, Zhu Dawei, Li Sujian
- Do Language Models Plan Ahead For Future Tokens? Wu Wilson, Morris John X., Levine Lionel
- Tablebench: A Comprehensive And Complex Benchmark For Table Question Answering Wu Xianjie, Yang Jian, Chai Linzheng, Zhang Ge, Liu Jiaheng, Du Xinrun, Liang Di, Shu Daixin, Cheng Xianfu, Sun Tianzhen, Niu Guanglin, Li Tongliang, Li Zhoujun
- Transformer-based Causal Language Models Perform Clustering Wu Xinbo, Varshney Lav R.
- Passage-specific Prompt Tuning For Passage Reranking In Question Answering With Large Language Models Wu Xuyang, Peng Zhiyuan, Sai Krishna Sravanthi Rajanala, Wu Hsin-tai, Fang Yi
- VILA-U: A Unified Foundation Model Integrating Visual Understanding And Generation Wu Yecheng, Zhang Zhuoyang, Chen Junyu, Tang Haotian, Li Dacheng, Fang Yunhao, Zhu Ligeng, Xie Enze, Yin Hongxu, Yi Li, Han Song, Lu Yao
- Elicitationgpt: Text Elicitation Mechanisms Via Language Models Wu Yifan, Hartline Jason
- MLDT: Multi-level Decomposition For Complex Long-horizon Robotic Task Planning With Open-source Large Language Model Wu Yike, Zhang Jiatao, Hu Nan, Tang Lanling, Qi Guilin, Shao Jun, Ren Jie, Song Wei
- Difflora: Generating Personalized Low-rank Adaptation Weights With Diffusion Wu Yujia, Shi Yiming, Wei Jiwei, Sun Chengwei, Zhou Yuyang, Yang Yang, Shen Heng Tao
- Inversecoder: Unleashing The Power Of Instruction-tuned Code Llms With Inverse-instruct Wu Yutong, Huang Di, Shi Wenxuan, Wang Wei, Gao Lingzhe, Liu Shihao, Nan Ziyuan, Yuan Kaizhao, Zhang Rui, Zhang Xishan, Du Zidong, Guo Qi, Pu Yewen, Yin Dawei, Hu Xing, Chen Yunji
- Acquiring Clean Language Models From Backdoor Poisoned Datasets By Downscaling Frequency Space Wu Zongru, Zhang Zhuosheng, Cheng Pengzhou, Liu Gongshen
- Imitating Language Via Scalable Inverse Reinforcement Learning Wulfmeier Markus, Bloesch Michael, Vieillard Nino, Ahuja Arun, Bornschein Jorg, Huang Sandy, Sokolov Artem, Barnes Matt, Desjardins Guillaume, Bewley Alex, Bechtle Sarah Maria Elisabeth, Springenberg Jost Tobias, Momchev Nikola, Bachem Olivier, Geist Matthieu, Riedmiller Martin
- In-context Learning Can Re-learn Forbidden Tasks Xhonneux Sophie, Dobre David, Tang Jian, Gidel Gauthier, Sridhar Dhanya
- A Practice Of Post-training On Llama-3 70B With Optimal Selection Of Additional Language Mixture Ratio Xi Ningyuan, Wu Yetao, Fan Kun, Chen Teng, Gu Qingqing, Yu Peng, Qu Jinxian, Liu Chenxi, Jiang Zhonglin, Chen Yong, Ji Luo
- Play To Your Strengths: Collaborative Intelligence Of Conventional Recommender Models And Large Language Models Xi Yunjia, Liu Weiwen, Lin Jianghao, Wu Chuhan, Chen Bo, Tang Ruiming, Zhang Weinan, Yu Yong
- Inverse-q*: Token Level Reinforcement Learning For Aligning Large Language Models Without Preference Data Xia Han, Gao Songyang, Ge Qiming, Xi Zhiheng, Zhang Qi, Huang Xuanjing
- LESS: Selecting Influential Data For Targeted Instruction Tuning Xia Mengzhou, Malladi Sadhika, Gururangan Suchin, Arora Sanjeev, Chen Danqi
- Improving Retrieval Augmented Language Model With Self-reasoning Xia Yuan, Zhou Jingbo, Shi Zhenhui, Chen Jun, Huang Haifeng
- What To Do If Language Models Disagree? Black-box Model Ensembling For Textual And Visual Question Answering Xia Yuxi, Zaporojets Kilm, Roth Benjamin
- Pandora: Towards General World Model With Natural Language Actions And Video States Xiang Jiannan, Liu Guangyi, Gu Yi, Gao Qiyue, Ning Yuting, Zha Yuheng, Feng Zeyu, Tao Tianhua, Hao Shibo, Shi Yemin, Liu Zhengzhong, Xing Eric P., Hu Zhiting
- Badchain: Backdoor Chain-of-thought Prompting For Large Language Models Xiang Zhen, Jiang Fengqing, Xiong Zidi, Ramasubramanian Bhaskar, Poovendran Radha, Li Bo
- Guardagent: Safeguard LLM Agents By A Guard Agent Via Knowledge-enabled Reasoning Xiang Zhen, Zheng Linzhi, Li Yanjie, Hong Junyuan, Li Qinbin, Xie Han, Zhang Jiawei, Xiong Zidi, Xie Chulin, Yang Carl, Song Dawn, Li Bo
- Clover: Regressive Lightweight Speculative Decoding With Sequential Knowledge Xiao Bin, Shi Chunan, Nie Xiaonan, Yang Fan, Deng Xiangwei, Su Lei, Chen Weipeng, Cui Bin
- Infllm: Training-free Long-context Extrapolation For Llms With An Efficient Context Memory Xiao Chaojun, Zhang Pengle, Han Xu, Xiao Guangxuan, Lin Yankai, Zhang Zhengyan, Liu Zhiyuan, Sun Maosong
- Rar-b: Reasoning As Retrieval Benchmark Xiao Chenghao, Hudson G Thomas, Moubayed Noura Al
- EMP: Enhance Memory In Data Pruning Xiao Jinying, Li Ping, Nie Jie, Tang Zhe
- Palm2-vadapter: Progressively Aligned Language Model Makes A Strong Vision-language Adapter Xiao Junfei, Xu Zheng, Yuille Alan, Yan Shen, Wang Boyu
- Verbalized Machine Learning: Revisiting Machine Learning With Language Models Xiao Tim Z., Bamler Robert, Schölkopf Bernhard, Liu Weiyang
- Seeing The Image: Prioritizing Visual Correlation By Contrastive Alignment Xiao Xin, Wu Bohong, Wang Jiacong, Li Chunyuan, Zhou Xun, Guo Haoyuan
- Emovit: Revolutionizing Emotion Insights With Visual Instruction Tuning Xie Hongxia, Peng Chu-jun, Tseng Yu-wen, Chen Hung-jen, Hsu Chan-feng, Shuai Hong-han, Cheng Wen-huang
- Advancing Multimodal Large Language Models With Quantization-aware Scale Learning For Efficient Adaptation Xie Jingjing, Zhang Yuxin, Lin Mingbao, Cao Liujuan, Ji Rongrong
- Non-instructional Fine-tuning: Enabling Instruction-following Capabilities In Pre-trained Language Models Without Instruction-following Data Xie Juncheng, Syu Shensian, Lee Hung-yi
- Autoad-zero: A Training-free Framework For Zero-shot Audio Description Xie Junyu, Han Tengda, Bain Max, Nagrani Arsha, Varol Gül, Xie Weidi, Zisserman Andrew
- Me Llama: Foundation Large Language Models For Medical Applications Xie Qianqian, Chen Qingyu, Chen Aokun, Peng Cheng, Hu Yan, Lin Fongci, Peng Xueqing, Huang Jimin, Zhang Jeffrey, Keloth Vipina, Zhou Xinyu, He Huan, Ohno-machado Lucila, Wu Yonghui, Xu Hua, Bian Jiang
- Wanglab At MEDIQA-M3G 2024: Multimodal Medical Answer Generation Using Large Language Models Xie Ronald, Palayew Steven, Toma Augustin, Bader Gary, Wang Bo
- Recall: Membership Inference Via Relative Conditional Log-likelihoods Xie Roy, Wang Junlin, Huang Ruomin, Zhang Minxing, Ge Rong, Pei Jian, Gong Neil Zhenqiang, Dhingra Bhuwan
- Codeshell Technical Report Xie Rui, Zeng Zhengran, Yu Zhuohao, Gao Chang, Zhang Shikun, Ye Wei
- Exploratory Preference Optimization: Harnessing Implicit Q*-approximation For Sample-efficient RLHF Xie Tengyang, Foster Dylan J., Krishnamurthy Akshay, Rosset Corby, Awadallah Ahmed, Rakhlin Alexander
- Gradsafe: Detecting Jailbreak Prompts For Llms Via Safety-critical Gradient Analysis Xie Yueqi, Fang Minghong, Pi Renjie, Gong Neil
- Monte Carlo Tree Search Boosts Reasoning Via Iterative Preference Learning Xie Yuxi, Goyal Anirudh, Zheng Wenyue, Kan Min-yen, Lillicrap Timothy P., Kawaguchi Kenji, Shieh Michael
- LLMAEL: Large Language Models Are Good Context Augmenters For Entity Linking Xin Amy, Qi Yunjia, Yao Zijun, Zhu Fangwei, Zeng Kaisheng, Bin Xu, Hou Lei, Li Juanzi
- Improving Retrieval-augmented Generation In Medicine With Iterative Follow-up Questions Xiong Guangzhi, Jin Qiao, Wang Xiao, Zhang Minjia, Lu Zhiyong, Zhang Aidong
- Converging Paradigms: The Synergy Of Symbolic And Connectionist AI In Llm-empowered Autonomous Agents Xiong Haoyi, Wang Zhiyuan, Li Xuhong, Bian Jiang, Xie Zeke, Mumtaz Shahid, Barnes Laura E.
- Large Language Models Can Learn Temporal Reasoning Xiong Siheng, Payani Ali, Kompella Ramana, Fekri Faramarz
- Building Math Agents With Multi-turn Iterative Preference Learning Xiong Wei, Shi Chengshuai, Shen Jiaming, Rosenberg Aviv, Qin Zhen, Calandriello Daniele, Khalman Misha, Joshi Rishabh, Piot Bilal, Saleh Mohammad, Jin Chi, Zhang Tong, Liu Tianqi
- Watch Every Step! LLM Agent Learning Via Iterative Step-level Process Refinement Xiong Weimin, Song Yifan, Zhao Xiutian, Wu Wenhao, Wang Xun, Wang Ke, Li Cheng, Peng Wei, Li Sujian
- Temporal Scaling Law For Large Language Models Xiong Yizhe, Chen Xiansheng, Ye Xin, Chen Hui, Lin Zijia, Lian Haoran, Su Zhenpeng, Niu Jianwei, Ding Guiguang
- Understanding The Role Of Cross-entropy Loss In Fairly Evaluating Large Language Model-based Recommendation Xu Cong, Zhu Zhangchi, Wang Jun, Wang Jianyong, Zhang Wei
- Aligning Large Language Models Via Fine-grained Supervision Xu Dehong, Qiu Liang, Kim Minseok, Ladhak Faisal, Do Jaeyoung
- Interactive Evolution: A Neural-symbolic Self-training Framework For Large Language Models Xu Fangzhi, Sun Qiushi, Cheng Kanzhi, Liu Jun, Qiao Yu, Wu Zhiyong
- Large Language Models For Education: A Survey Xu Hanyi, Gan Wensheng, Qi Zhenlian, Wu Jiayang, Yu Philip S.
- Contrastive Preference Optimization: Pushing The Boundaries Of LLM Performance In Machine Translation Xu Haoran, Sharaf Amr, Chen Yunmo, Tan Weiting, Shen Lingfeng, Van Durme Benjamin, Murray Kenton, Kim Young Jin
- Rejection Improves Reliability: Training Llms To Refuse Unknown Questions Using RL From Knowledge Feedback Xu Hongshen, Zhu Zichen, Zhang Situo, Ma Da, Fan Shuai, Chen Lu, Yu Kai
- Seamless Language Expansion: Enhancing Multilingual Mastery In Self-supervised Models Xu Jing, Wu Minglin, Wu Xixin, Meng Helen
- Meteora: Multiple-tasks Embedded Lora For Large Language Models Xu Jingwei, Lai Junyu, Huang Yunpeng
- Accelerating Large Language Model Training With Hybrid Gpu-based Compression Xu Lang, Anthony Quentin, Zhou Qinghua, Alnaasan Nawras, Gulhane Radha R., Shafi Aamir, Subramoni Hari, Panda Dhabaleswar K.
- From Introspection To Best Practices: Principled Analysis Of Demonstrations In Multimodal In-context Learning Xu Nan, Wang Fei, Zhang Sheng, Poon Hoifung, Chen Muhao
- Chatqa 2: Bridging The Gap To Proprietary Llms In Long Context And RAG Capabilities Xu Peng, Ping Wei, Wu Xianchao, Xu Chejian, Liu Zihan, Shoeybi Mohammad, Catanzaro Bryan
- Course-correction: Safety Alignment Using Synthetic Preferences Xu Rongwu, Cai Yishuo, Zhou Zhenhong, Gu Renjie, Weng Haiqin, Liu Yan, Zhang Tianwei, Xu Wei, Qiu Han
- Walking In Others' Shoes: How Perspective-taking Guides Large Language Models In Reducing Toxicity And Bias Xu Rongwu, Zhou Zi'an, Zhang Tianwei, Qi Zehan, Yao Su, Xu Ke, Xu Wei, Qiu Han
- MINDECHO: Role-playing Language Agents For Key Opinion Leaders Xu Rui, Lu Dakuan, Tan Xiaoyu, Wang Xintao, Yuan Siyu, Chen Jiangjie, Chu Wei, Yinghui Xu
- Benchmarking Benchmark Leakage In Large Language Models Xu Ruijie, Wang Zengzhi, Fan Run-ze, Liu Pengfei
- Cruxeval-x: A Benchmark For Multilingual Code Reasoning, Understanding And Execution Xu Ruiyang, Cao Jialun, Lu Yaojie, Lin Hongyu, Han Xianpei, He Ben, Cheung Shing-chi, Sun Le
- Unsupervised Information Refinement Training Of Large Language Models For Retrieval-augmented Generation Xu Shicheng, Pang Liang, Yu Mo, Meng Fandong, Shen Huawei, Cheng Xueqi, Zhou Jie
- Is DPO Superior To PPO For LLM Alignment? A Comprehensive Study Xu Shusheng, Fu Wei, Gao Jiaxuan, Ye Wenjie, Liu Weilin, Mei Zhiyu, Wang Guangju, Yu Chao, Wu Yi
- SA-MDKIF: A Scalable And Adaptable Medical Domain Knowledge Injection Framework For Large Language Models Xu Tianhan, Hu Zhe, Chen Ling, Li Bin
- Sayself: Teaching Llms To Express Confidence With Self-reflective Rationales Xu Tianyang, Wu Shujin, Diao Shizhe, Liu Xiaoze, Wang Xingyao, Chen Yangyi, Gao Jing
- Hr-multiwoz: A Task Oriented Dialogue (TOD) Dataset For HR LLM Agent Xu Weijie, Huang Zicheng, Hu Wenxiang, Fang Xi, Cherukuri Rajesh Kumar, Nayyar Naumaan, Malandri Lorenzo, Sengamedu Srinivasan H.
- Can Llms Solve Longer Math Word Problems Better? Xu Xin, Xiao Tong, Chao Zitong, Huang Zhenya, Yang Can, Wang Yang
- Relating The Seemingly Unrelated: Principled Understanding Of Generalization For Generative Models In Arithmetic Reasoning Tasks Xu Xingcheng, Zhao Zibo, Zhang Haipeng, Yang Yanqing
- Generate-on-graph: Treat LLM As Both Agent And KG In Incomplete Knowledge Graph Question Answering Xu Yao, He Shizhu, Chen Jiabei, Wang Zihao, Song Yangqiu, Tong Hanghang, Liu Kang, Zhao Jun
- Chatglm-math: Improving Math Problem-solving In Large Language Models With A Self-critique Pipeline Xu Yifan, Liu Xiao, Liu Xinghan, Hou Zhenyu, Li Yueyan, Zhang Xiaohan, Wang Zihan, Zeng Aohan, Du Zhengxiao, Zhao Wenyi, Tang Jie, Dong Yuxiao
- Libra: Building Decoupled Vision System On Large Language Models Xu Yifan, Yang Xiaoshan, Song Yaguang, Xu Changsheng
- \(\textit{linkprompt}\): Natural And Universal Adversarial Attacks On Prompt-based Language Models Xu Yue, Wang Wenjie
- Magpie: Alignment Data Synthesis From Scratch By Prompting Aligned Llms With Nothing Xu Zhangchen, Jiang Fengqing, Niu Luyao, Deng Yuntian, Poovendran Radha, Choi Yejin, Lin Bill Yuchen
- Beyond Perplexity: Multi-dimensional Safety Evaluation Of LLM Compression Xu Zhichao, Gupta Ashim, Li Tao, Bentham Oliver, Srikumar Vivek
- Uncovering Safety Risks Of Large Language Models Through Concept Activation Vector Xu Zhihao, Huang Ruixuan, Chen Changyu, Wang Shuai, Wang Xiting
- Vision-flan: Scaling Human-labeled Tasks In Visual Instruction Tuning Xu Zhiyang, Feng Chao, Shao Rulin, Ashby Trevor, Shen Ying, Jin Di, Cheng Yu, Wang Qifan, Huang Lifu
- Do Large Language Models Have Compositional Ability? An Investigation Into Limitations And Scalability Xu Zhuoyan, Shi Zhenmei, Liang Yingyu
- A Comprehensive Study Of Jailbreak Attack Versus Defense For Large Language Models Xu Zihao, Liu Yi, Deng Gelei, Li Yuekang, Picek Stjepan
- Longvila: Scaling Long-context Visual Language Models For Long Videos Xue Fuzhao, Chen Yukang, Li Dacheng, Hu Qinghao, Zhu Ligeng, Li Xiuyu, Fang Yunhao, Tang Haotian, Yang Shang, Liu Zhijian, He Ethan, Yin Hongxu, Molchanov Pavlo, Kautz Jan, Fan Linxi, Zhu Yuke, Lu Yao, Han Song
- Openmoe: An Early Effort On Open Mixture-of-experts Language Models Xue Fuzhao, Zheng Zian, Fu Yao, Ni Jinjie, Zheng Zangwei, Zhou Wangchunshu, You Yang
- Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-choice Selectors Xue Mengge, Hu Zhenyu, Liu Liqun, Liao Kuo, Li Shuang, Han Honglin, Zhao Meng, Yin Chengguo
- Fine-tuning Large Language Models For Automated Diagnostic Screening Summaries Yadav Manjeet, Sahu Nilesh Kumar, Chaturvedi Mudita, Gupta Snehil, Lone Haroon R
- Medlm: Exploring Language Models For Medical Question Answering Systems Yagnik Niraj, Jhaveri Jay, Sharma Vivek, Pila Gabriel
- LEIA: Facilitating Cross-lingual Knowledge Transfer In Language Models With Entity-based Data Augmentation Yamada Ikuya, Ri Ryokan
- An Empirical Study On Cross-lingual Vocabulary Adaptation For Efficient Language Model Inference Yamaguchi Atsuki, Villavicencio Aline, Aletras Nikolaos
- Vocabulary Expansion For Low-resource Cross-lingual Transfer Yamaguchi Atsuki, Villavicencio Aline, Aletras Nikolaos
- List Items One By One: A New Data Source And Learning Paradigm For Multimodal Llms Yan An, Yang Zhengyuan, Wu Junda, Zhu Wanrong, Yang Jianwei, Li Linjie, Lin Kevin, Wang Jianfeng, Mcauley Julian, Gao Jianfeng, Wang Lijuan
- LTNER: Large Language Model Tagging For Named Entity Recognition With Contextualized Entity Marking Yan Faren, Yu Peng, Chen Xin
- Predicting The Big Five Personality Traits In Chinese Counselling Dialogues Using Large Language Models Yan Yang, Ma Lizhi, Li Anqi, Ma Jingsong, Lan Zhenzhong
- S\(^3\)c-math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners Yan Yuchen, Jiang Jin, Liu Yang, Cao Yixin, Xu Xin, Zhang Mengdi, Cai Xunliang, Shao Jian
- Exploring The LLM Journey From Cognition To Expression With Linear Representations Yan Yuzi, Li Jialian, Zhang Yipin, Yan Dong
- 3d-properties: Identifying Challenges In DPO And Charting A Path Forward Yan Yuzi, Miao Yibo, Li Jialian, Zhang Yipin, Xie Jian, Deng Zhijie, Yan Dong
- Equipping Transformer With Random-access Reading For Long-context Understanding Yang Chenghao, Yang Zi, Hua Nan
- 3D Vision And Language Pretraining With Large-scale Synthetic Data Yang Dejie, Xu Zhu, Mo Wentao, Chen Qingchao, Huang Siyuan, Liu Yang
- IM-RAG: Multi-round Retrieval-augmented Generation Through Learning Inner Monologues Yang Diji, Rao Jinmeng, Chen Kezhen, Guo Xiaoyuan, Zhang Yawen, Yang Jie, Zhang Yi
- Improving Factuality In Large Language Models Via Decoding-time Hallucinatory And Truthful Comparators Yang Dingkang, Xiao Dongling, Wei Jinjie, Li Mingcheng, Chen Zhaoyu, Li Ke, Zhang Lihua
- Protrain: Efficient LLM Training Via Memory-aware Techniques Yang Hanmei, Zhou Jin, Fu Yao, Wang Xiaoqun, Roane Ramine, Guan Hui, Liu Tongping
- Unveiling The Generalization Power Of Fine-tuned Large Language Models Yang Haoran, Zhang Yumeng, Xu Jiaqi, Lu Hongyuan, Heng Pheng Ann, Lam Wai
- \(\text{memory}^3\): Language Modeling With Explicit Memory Yang Hongkang, Lin Zehao, Wang Wenjin, Wu Hao, Li Zhiyu, Tang Bo, Wei Wenqiang, Wang Jinbo, Tang Zeyun, Song Shichao, Xi Chenyang, Yu Yu, Chen Kai, Xiong Feiyu, Tang Linpeng, E Weinan
- Aligning Llms Through Multi-perspective User Preference Ranking-based Feedback For Programming Question Answering Yang Hongyu, He Liyang, Hou Min, Shen Shuanghong, Li Rui, Hou Jiahui, Ma Jianhui, Zhao Junda
- Large Language Model Tokenizer Bias: A Case Study And Solution On Gpt-4o Yang Jin, Wang Zhiqiang, Lin Yanbin, Zhao Zunduo
- Rethinking Tokenization: Crafting Better Tokenizers For Large Language Models Yang Jinbiao
- No Token Left Behind: Reliable KV Cache Compression Via Importance-aware Mixed Precision Quantization Yang June Yong, Kim Byeongwook, Bae Jeongin, Kwon Beomseok, Park Gunho, Yang Eunho, Kwon Se Jung, Lee Dongsoo
- Metaaligner: Towards Generalizable Multi-objective Alignment Of Language Models Yang Kailai, Liu Zhiwei, Xie Qianqian, Huang Jimin, Zhang Tianlin, Ananiadou Sophia
- If LLM Is The Wizard, Then Code Is The Wand: A Survey On How Code Empowers Large Language Models To Serve As Intelligent Agents Yang Ke, Liu Jiateng, Wu John, Yang Chaoqi, Fung Yi R., Li Sha, Huang Zixuan, Cao Xu, Wang Xingyao, Wang Yiquan, Ji Heng, Zhai Chengxiang
- Item-language Model For Conversational Recommendation Yang Li, Subbiah Anushya, Patel Hardik, Li Judith Yue, Song Yanwei, Mirghaderi Reza, Aggarwal Vikram
- Mastering Text-to-image Diffusion: Recaptioning, Planning, And Generating With Multimodal Llms Yang Ling, Yu Zhaochen, Meng Chenlin, Xu Minkai, Ermon Stefano, Cui Bin
- Regularizing Hidden States Enables Learning Generalizable Reward Model For Llms Yang Rui, Ding Ruomeng, Lin Yong, Zhang Huan, Zhang Tong
- Confidence Calibration And Rationalization For Llms Via Multi-agent Deliberation Yang Ruixin, Rajagopal Dheeraj, Hayati Shirley Anugrah, Hu Bin, Kang Dongyeop
- Mllmreid: Multimodal Large Language Model-based Person Re-identification Yang Shan, Zhang Yongfei
- Video As The New Language For Real-world Decision Making Yang Sherry, Walker Jacob, Parker-holder Jack, Du Yilun, Bruce Jake, Barreto Andre, Abbeel Pieter, Schuurmans Dale
- Moral: Moe Augmented Lora For Llms' Lifelong Learning Yang Shu, Ali Muhammad Asif, Wang Cheng-long, Hu Lijie, Wang Di
- Seed-story: Multimodal Long Story Generation With Large Language Model Yang Shuai, Ge Yuying, Li Yang, Chen Yukang, Ge Yixiao, Shan Ying, Chen Yingcong
- Is Crowdsourcing Breaking Your Bank? Cost-effective Fine-tuning Of Pre-trained Language Models With Proximal Policy Optimization Yang Shuo, Kasneci Gjergji
- Parallelizing Linear Transformers With The Delta Rule Over Sequence Length Yang Songlin, Wang Bailin, Zhang Yu, Shen Yikang, Kim Yoon
- Arithmetic Reasoning With LLM: Prolog Generation & Permutation Yang Xiaocheng, Chen Bingsen, Tam Yik-cheung
- Unleashing The Potential Of Large Language Models For Predictive Tabular Tasks In Data Science Yang Yazheng, Wang Yuqi, Sen Sankalok, Li Lei, Liu Qi
- Corda: Context-oriented Decomposition Adaptation Of Large Language Models Yang Yibo, Li Xiaojie, Zhou Zhongzhu, Song Shuaiwen Leon, Wu Jianlong, Nie Liqiang, Ghanem Bernard
- Adazeta: Adaptive Zeroth-order Tensor-train Adaption For Memory-efficient Large Language Models Fine-tuning Yang Yifan, Zhen Kai, Banijamal Ershad, Mouchtaris Athanasios, Zhang Zheng
- Loretta: Low-rank Economic Tensor-train Adaptation For Ultra-low-parameter Fine-tuning Of Large Language Models Yang Yifan, Zhou Jiajun, Wong Ngai, Zhang Zheng
- Guardt2i: Defending Text-to-image Models From Adversarial Prompts Yang Yijun, Gao Ruiyuan, Yang Xiao, Zhong Jianyuan, Xu Qiang
- P3: A Policy-driven, Pace-adaptive, And Diversity-promoted Framework For Optimizing LLM Training Yang Yingxuan, Wang Huayi, Wen Muning, Zhang Weinan
- Pre-trained Language Models Improve The Few-shot Prompt Ability Of Decision Transformer Yang Yu, Xu Pan
- Weak-to-strong Reasoning Yang Yuqing, Ma Yan, Liu Pengfei
- Fine-tuning Large Language Models With Human-inspired Learning Strategies In Medical Question Answering Yang Yushi, Bean Andrew M., Mccraith Robert, Mahdi Adam
- Recent Advances Of Foundation Language Models-based Continual Learning: A Survey Yang Yutao, Zhou Jie, Ding Xuanwen, Huai Tianyu, Liu Shunyu, Chen Qin, He Liang, Xie Yuan
- Assessing Adversarial Robustness Of Large Language Models: An Empirical Study Yang Zeyu, Meng Zhao, Zheng Xiaochen, Wattenhofer Roger
- Self-distillation Bridges Distribution Gap In Language Model Fine-tuning Yang Zhaorui, Pang Tianyu, Feng Haozhe, Wang Han, Chen Wei, Zhu Minfeng, Liu Qian
- Exploring And Unleashing The Power Of Large Language Models In Automated Code Translation Yang Zhen, Liu Fang, Yu Zhongxing, Keung Jacky Wai, Li Jia, Liu Shuo, Hong Yifan, Ma Xiaoxue, Jin Zhi, Li Ge
- Adversarial Attacks And Defense For Conversation Entailment Task Yang Zhenning, Krawec Ryan, Wu Liang-yuan
- SOS! Soft Prompt Attack Against Open-source Large Language Models Yang Ziqing, Backes Michael, Zhang Yang, Salem Ahmed
- React Meets Actre: When Language Agents Enjoy Training Data Autonomy Yang Zonghan, Li Peng, Yan Ming, Zhang Ji, Huang Fei, Liu Yang
- Curiousllm: Elevating Multi-document QA With Reasoning-infused Knowledge Graph Prompting Yang Zukang, Zhu Zixuan
- An Empirical Analysis Of Compute-optimal Inference For Problem-solving With Language Models Yangzhen Wu, Zhiqing Sun, Shanda Li, Sean Welleck, Yiming Yang
- Data Contamination Can Cross Language Barriers Yao Feng, Zhuang Yufan, Sun Zihao, Xu Sunan, Kumar Animesh, Shang Jingbo
- Dense Connector For Mllms Yao Huanjin, Wu Wenhao, Yang Taojiannan, Song Yuxin, Zhang Mengxi, Feng Haocheng, Sun Yifan, Li Zhiheng, Ouyang Wanli, Wang Jingdong
- Machine Unlearning Of Pre-trained Large Language Models Yao Jin, Chien Eli, Du Minxin, Niu Xinyao, Wang Tianhao, Cheng Zezhou, Yue Xiang
- Exploiting Inter-layer Expert Affinity For Accelerating Mixture-of-experts Model Inference Yao Jinghan Dk, Anthony Quentin Dk, Shafi Aamir Dk, Subramoni Hari Dk, K. Dhabaleswar Dk, Panda
- Training Ultra Long Context Language Model With Fully Pipelined Distributed Transformer Yao Jinghan, Jacobs Sam Ade, Tanaka Masahiro, Ruwase Olatunji, Shafi Aamir, Subramoni Hari, Panda Dhabaleswar K.
- Deco: Decoupling Token Compression From Semantic Abstraction In Multimodal Large Language Models Yao Linli, Li Lei, Ren Shuhuai, Wang Lean, Liu Yuanxin, Sun Xu, Hou Lu
- Enhancing In-context Learning Performance With Just Svd-based Weight Pruning: A Theoretical Perspective Yao Xinhao, Hu Xiaolin, Yang Shenzhi, Liu Yong
- Sirllm: Streaming Infinite Retentive LLM Yao Yao, Li Zuchao, Zhao Hai
- Open-domain Implicit Format Control For Large Language Model Generation Yao Yiqun, Ma Wenjia, Fang Xuezhi, Jiang Xin, Li Xiang, Meng Xuying, Han Peng, Li Jing, Sun Aixin, Wang Yequan
- The Impact Of Quantization On Retrieval-augmented Generation: An Analysis Of Small Llms Yazan Mert, Verberne Suzan, Situmeang Frederik
- R^2AG: Incorporating Retrieval Information Into Retrieval Augmented Generation Ye Fuda, Li Shuangyin, Zhang Yongqi, Chen Lei
- Preference-guided Reflective Sampling For Aligning Language Models Ye Hai, Ng Hwee Tou
- Self-judge: Selective Instruction Following With Alignment Self-evaluation Ye Hai, Ng Hwee Tou
- X-VILA: Cross-modality Alignment For Large Language Model Ye Hanrong, Huang De-an, Lu Yao, Yu Zhiding, Ping Wei, Tao Andrew, Kautz Jan, Han Song, Xu Dan, Molchanov Pavlo, Yin Hongxu
- Rotbench: A Multi-level Benchmark For Evaluating The Robustness Of Large Language Models In Tool Learning Ye Junjie, Wu Yilong, Gao Songyang, Huang Caishuang, Li Sixian, Li Guanyu, Fan Xiaoran, Zhang Qi, Gui Tao, Huang Xuanjing
- Dataframe QA: A Universal LLM Framework On Dataframe Question Answering Without Data Exposure Ye Junyi, Du Mengnan, Wang Guiling
- Spactor-t5: Pre-training T5 Models With Span Corruption And Replaced Token Detection Ye Ke, Jiang Heinrich, Rostamizadeh Afshin, Chakrabarti Ayan, Desalvo Giulia, Kagy Jean-françois, Karydas Lazaros, Citovsky Gui, Kumar Sanjiv
- Efficient Vision-and-language Pre-training With Text-relevant Image Patch Selection Ye Wei, Jiang Chaoya, Xu Haiyang, Ye Chenhao, Li Chenliang, Yan Ming, Zhang Shikun, Huang Songhang, Huang Fei
- Harnessing Multimodal Large Language Models For Multimodal Sequential Recommendation Ye Yuyang, Zheng Zhi, Shen Yishan, Wang Tianshu, Zhang Hengruo, Zhu Peijun, Yu Runlong, Zhang Kai, Xiong Hui
- BEAF: Observing Before-after Changes To Evaluate Hallucination In Vision-language Models Ye-bin Moon, Hyeon-woo Nam, Choi Wonseok, Oh Tae-hyun
- Self-training Large Language Models Through Knowledge Detection Yeo Wei Jie, Ferdinan Teddy, Kazienko Przemyslaw, Satapathy Ranjan, Cambria Erik
- Towards Fast Multilingual LLM Inference: Speculative Decoding And Specialized Drafters Yi Euiin, Kim Taehyeon, Jeung Hongseok, Chang Du-seong, Yun Se-young
- Generation Meets Verification: Accelerating Large Language Model Inference With Smart Parallel Auto-correct Decoding Yi Hanling, Lin Feng, Li Hongbin, Ning Peiyang, Yu Xiaotian, Xiao Rong
- Fine-grained Detoxification Via Instance-level Prefixes For Large Language Models Yi Xin, Wang Linlin, Wang Xiaoling, He Liang
- Debiasing Multimodal Large Language Models Yi-fan Zhang, Weichen Yu, Qingsong Wen, Xue Wang, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan
- MEND: Meta Demonstration Distillation For Efficient And Effective In-context Learning Yichuan Li, Xiyao Ma, Sixing Lu, Kyumin Lee, Xiaohu Liu, Chenlei Guo
- Characterizing Truthfulness In Large Language Model Generations With Local Intrinsic Dimension Yin Fan, Srinivasa Jayanth, Chang Kai-wei
- Lofit: Localized Fine-tuning On LLM Representations Yin Fangcong, Ye Xi, Durrett Greg
- Mumath-code: Combining Tool-use Large Language Models With Multi-perspective Data Augmentation For Mathematical Reasoning Yin Shuo, You Weihao, Ji Zhilong, Zhong Guoqiang, Bai Jinfeng
- SEA: Supervised Embedding Alignment For Token-level Visual-textual Integration In Mllms Yin Yuanyang, Zhao Yaqi, Zhang Yajie, Lin Ke, Wang Jiahao, Tao Xin, Wan Pengfei, Zhang Di, Yin Baoqun, Zhang Wentao
- Relative Preference Optimization: Enhancing LLM Alignment Through Contrasting Responses Across Identical And Diverse Prompts Yin Yueqin, Wang Zhendong, Gu Yi, Huang Hai, Chen Weizhu, Zhou Mingyuan
- Aggregation Of Reasoning: A Hierarchical Framework For Enhancing Answer Selection In Large Language Models Yin Zhangyue, Sun Qiushi, Guo Qipeng, Zeng Zhiyuan, Li Xiaonan, Sun Tianxiang, Chang Cheng, Cheng Qinyuan, Wang Ding, Mou Xiaofeng, Qiu Xipeng, Huang Xuanjing
- Vqattack: Transferable Adversarial Attacks On Visual Question Answering Via Pre-trained Models Yin Ziyi, Ye Muchao, Zhang Tianrong, Wang Jiaqi, Liu Han, Chen Jinghui, Wang Ting, Ma Fenglong
- Internlm-math: Open Math Large Language Models Toward Verifiable Reasoning Ying Huaiyuan, Zhang Shuo, Li Linyang, Zhou Zhejian, Shao Yunfan, Fei Zhaoye, Ma Yichuan, Hong Jiawei, Liu Kuikun, Wang Ziyi, Wang Yudong, Wu Zijian, Li Shuaibin, Zhou Fengzhe, Liu Hongwei, Zhang Songyang, Zhang Wenwei, Yan Hang, Qiu Xipeng, Wang Jiayu, Chen Kai, Lin Dahua
- Llms-as-instructors: Learning From Errors Toward Automating Model Improvement Ying Jiahao, Lin Mingbao, Cao Yixin, Tang Wei, Wang Bo, Sun Qianru, Huang Xuanjing, Yan Shuicheng
- Safety Fine-tuning At (almost) No Cost: A Baseline For Vision Large Language Models Yongshuo Zong, Ondrej Bohdal, Tingyang Yu, Yongxin Yang, Timothy Hospedales
- Langbridge: Multilingual Reasoning Without Multilingual Supervision Yoon Dongkeun, Jang Joel, Kim Sungdong, Kim Seungone, Shafayat Sheikh, Seo Minjoon
- C-TPT: Calibrated Test-time Prompt Tuning For Vision-language Models Via Text Feature Dispersion Yoon Hee Suk, Yoon Eunseop, Tee Joshua Tian Jin, Hasegawa-johnson Mark, Li Yingzhen, Yoo Chang D.
- Tree-planted Transformers: Unidirectional Transformer Language Models With Implicit Syntactic Supervision Yoshida Ryo, Someya Taiga, Oseki Yohei
- When Linear Attention Meets Autoregressive Decoding: Towards More Effective And Efficient Linearized Large Language Models You Haoran, Fu Yichao, Wang Zheng, Yazdanbakhsh Amir, Lin Yingyan Celine
- Shiftaddllm: Accelerating Pretrained Llms Via Post-training Multiplication-less Reparameterization You Haoran, Guo Yipin, Fu Yichao, Zhou Wei, Shi Huihong, Zhang Xiaofan, Kundu Souvik, Yazdanbakhsh Amir, Lin Yingyan Celine
- Ferret-ui: Grounded Mobile UI Understanding With Multimodal Llms You Keen, Zhang Haotian, Schoop Eldon, Weers Floris, Swearngin Amanda, Nichols Jeffrey, Yang Yinfei, Gan Zhe
- Llasmol: Advancing Large Language Models For Chemistry With A Large-scale, Comprehensive, High-quality Instruction Tuning Dataset Yu Botao, Baker Frazier N., Chen Ziqi, Ning Xia, Sun Huan
- Siam: Self-improving Code-assisted Mathematical Reasoning Of Large Language Models Yu Dian, Peng Baolin, Tian Ye, Song Linfeng, Mi Haitao, Yu Dong
- Flow Of Reasoning: Efficient Training Of LLM Policy With Divergent Thinking Yu Fangxu, Jiang Lai, Kang Haoqiang, Hao Shibo, Qin Lianhui
- Iterative Graph Alignment Yu Fangyuan, Arora Hardeep Singh, Johnson Matt
- Enhancing Healthcare Through Large Language Models: A Study On Medical Question Answering Yu Haoran, Yu Chang, Wang Zihan, Zou Dongxian, Qin Hao
- Distilling System 2 Into System 1 Yu Ping, Xu Jing, Weston Jason, Kulikov Ilia
- Ra-rec: An Efficient ID Representation Alignment Framework For Llm-based Recommendation Yu Xiaohan, Zhang Li, Zhao Xin, Wang Yue, Ma Zhongrui
- Neeko: Leveraging Dynamic Lora For Efficient Multi-character Role-playing Agent Yu Xiaoyan, Luo Tongxu, Wei Yifan, Lei Fangyu, Huang Yiming, Peng Hao, Zhu Liehuang
- BEYOND DIALOGUE: A Profile-dialogue Alignment Framework Towards General Role-playing Language Model Yu Yeyong, Yu Runsheng, Wei Haojie, Zhang Zhanqiu, Qian Quan
- Rankrag: Unifying Context Ranking With Retrieval-augmented Generation In Llms Yu Yue, Ping Wei, Liu Zihan, Wang Boxin, You Jiaxuan, Zhang Chao, Shoeybi Mohammad, Catanzaro Bryan
- Unveiling And Harnessing Hidden Attention Sinks: Enhancing Large Language Models Without Training Through Attention Calibration Yu Zhongzhi, Wang Zheng, Fu Yonggan, Shi Huihong, Shaikh Khalid, Lin Yingyan Celine
- Kieval: A Knowledge-grounded Interactive Evaluation Framework For Large Language Models Yu Zhuohao, Gao Chang, Yao Wenjin, Wang Yidong, Ye Wei, Wang Jindong, Xie Xing, Zhang Yue, Zhang Shikun
- Whispers That Shake Foundations: Analyzing And Mitigating False Premise Hallucinations In Large Language Models Yuan Hongbang, Cao Pengfei, Jin Zhuoran, Chen Yubo, Zeng Daojian, Liu Kang, Zhao Jun
- Towards A Holistic Evaluation Of Llms On Factual Knowledge Recall Yuan Jiaqing, Pan Lin, Hang Chung-wei, Guo Jiang, Jiang Jiarong, Min Bonan, Ng Patrick, Wang Zhiguo
- Advancing LLM Reasoning Generalists With Preference Trees Yuan Lifan, Cui Ganqu, Wang Hanbin, Ding Ning, Wang Xingyao, Deng Jia, Shan Boji, Chen Huimin, Xie Ruobing, Lin Yankai, Liu Zhenghao, Zhou Bowen, Peng Hao, Liu Zhiyuan, Sun Maosong
- Focused Large Language Models Are Stable Many-shot Learners Yuan Peiwen, Feng Shaoxiong, Li Yiwei, Wang Xinglin, Zhang Yueqi, Tan Chuyi, Pan Boyuan, Wang Heda, Hu Yao, Li Kan
- Chatmusician: Understanding And Generating Music Intrinsically With LLM Yuan Ruibin, Lin Hanfeng, Wang Yi, Tian Zeyue, Wu Shangda, Shen Tianhao, Zhang Ge, Wu Yuhang, Liu Cong, Zhou Ziya, Ma Ziyang, Xue Liumeng, Wang Ziyu, Liu Qin, Zheng Tianyu, Li Yizhi, Ma Yinghao, Liang Yiming, Chi Xiaowei, Liu Ruibo, Wang Zili, Li Pengfei, Wu Jingcheng, Lin Chenghua, Liu Qifeng, Jiang Tao, Huang Wenhao, Chen Wenhu, Benetos Emmanouil, Fu Jie, Xia Gus, Dannenberg Roger, Xue Wei, Kang Shiyin, Guo Yike
- Why Lift So Heavy? Slimming Large Language Models By Cutting Off The Layers Yuan Shuzhou, Nie Ercong, Ma Bolei, Färber Michael
- Self-rewarding Language Models Yuan Weizhe, Pang Richard Yuanzhe, Cho Kyunghyun, Li Xian, Sukhbaatar Sainbayar, Xu Jing, Weston Jason
- CO3: Low-resource Contrastive Co-training For Generative Conversational Query Rewrite Yuan Yifei, Shi Chen, Wang Runze, Chen Liyi, Hu Renjun, Zhang Zengming, Jiang Feijun, Lam Wai
- Refuse Whenever You Feel Unsafe: Improving Safety In Llms Via Decoupled Refusal Training Yuan Youliang, Jiao Wenxiang, Wang Wenxuan, Huang Jen-tse, Xu Jiahao, Liang Tian, He Pinjia, Tu Zhaopeng
- Rigorllm: Resilient Guardrails For Large Language Models Against Undesired Content Yuan Zhuowen, Xiong Zidi, Zeng Yi, Yu Ning, Jia Ruoxi, Song Dawn, Li Bo
- Tcmbench: A Comprehensive Benchmark For Evaluating Large Language Models In Traditional Chinese Medicine Yue Wenjing, Wang Xiaoling, Zhu Wei, Guan Ming, Zheng Huanran, Wang Pengfei, Sun Changzhi, Ma Xin
- Mammoth2: Scaling Instructions From The Web Yue Xiang, Zheng Tuney, Zhang Ge, Chen Wenhu
- Distilling Instruction-following Abilities Of Large Language Models With Task-aware Curriculum Planning Yue Yuanhao, Wang Chengyu, Huang Jun, Wang Peng
- Toward Inference-optimal Mixture-of-expert Large Language Models Yun Longfei, Zhuang Yonghao, Fu Yao, Xing Eric P, Zhang Hao
- Prompting Implicit Discourse Relation Annotation Yung Frances, Ahmad Mansoor, Scholman Merel, Demberg Vera
- SPOT: Text Source Prediction From Originality Score Thresholding Yvinec Edouard, Kasser Gabriel
- Investigating Continual Pretraining In Large Language Models: Insights And Implications Yıldız Çağatay, Ravichandran Nishaanth Kanna, Punia Prishruit, Bethge Matthias, Ermis Beyza
- Envgen: Generating And Adapting Environments Via Llms For Training Embodied Agents Zala Abhay, Cho Jaemin, Lin Han, Yoon Jaehong, Bansal Mohit
- Generating Unseen Code Tests In Infinitum Zalmanovici Marcel, Raz Orna, Farchi Eitan, Freund Iftach
- Building Accurate Translation-tailored Llms With Language Aware Instruction Tuning Zan Changtong, Ding Liang, Shen Li, Zhen Yibing, Liu Weifeng, Tao Dacheng
- Understanding And Mitigating Compositional Issues In Text-to-image Generative Models Zarei Arman, Rezaei Keivan, Basu Samyadeep, Saberi Mehrdad, Moayeri Mazda, Kattakinda Priyatham, Feizi Soheil
- Tell Me Why: Explainable Public Health Fact-checking With Large Language Models Zarharan Majid, Wullschleger Pascal, Kia Babak Behkam, Pilehvar Mohammad Taher, Foster Jennifer
- ABQ-LLM: Arbitrary-bit Quantized Inference Acceleration For Large Language Models Zeng Chao, Liu Songwei, Xie Yusheng, Liu Hong, Wang Xiaojian, Wei Miao, Yang Shu, Chen Fangmin, Mei Xing
- Improving Logits-based Detector Without Logits From Black-box Llms Zeng Cong, Tang Shengkun, Yang Xianjun, Chen Yuanzhou, Sun Yiyou, Xu Zhiqiang, Li Yao, Chen Haifeng, Cheng Wei, Xu Dongkuan
- Multilingual Brain Surgeon: Large Language Models Can Be Compressed Leaving No Language Behind Zeng Hongchuan, Xu Hongshen, Chen Lu, Yu Kai
- Divtod: Unleashing The Power Of Llms For Diversifying Task-oriented Dialogue Representations Zeng Weihao, Fu Dayuan, He Keqing, Wang Yejie, Xu Yukai, Xu Weiran
- Advancing Multimodal Large Language Models In Chart Question Answering With Visualization-referenced Instruction Tuning Zeng Xingchen, Lin Haichuan, Ye Yilin, Zeng Wei
- Token-level Direct Preference Optimization Zeng Yongcheng, Liu Guoqing, Ma Weiyu, Yang Ning, Zhang Haifeng, Wang Jun
- DELIA: Diversity-enhanced Learning For Instruction Adaptation In Large Language Models Zeng Yuanhao, Ren Fei, Zhou Xinpeng, Wang Yihang, Shao Yingxia
- Can Mllms Perform Text-to-image In-context Learning? Zeng Yuchen, Kang Wonjun, Chen Yicong, Koo Hyung Il, Lee Kangwook
- Learning Reward For Robot Skills Using Large Language Models Via Self-alignment Zeng Yuwei, Mu Yao, Shao Lin
- Actions Speak Louder Than Words: Trillion-parameter Sequential Transducers For Generative Recommendations Zhai Jiaqi, Liao Lucy, Liu Xing, Wang Yueming, Li Rui, Cao Xuan, Gao Leon, Gong Zhaojie, Gu Fangda, He Michael, Lu Yinghai, Shi Yu
- Anygpt: Unified Multimodal LLM With Discrete Sequence Modeling Zhan Jun, Dai Junqi, Ye Jiasheng, Zhou Yunhua, Zhang Dong, Liu Zhigeng, Zhang Xin, Yuan Ruibin, Zhang Ge, Li Linyang, Yan Hang, Fu Jie, Gui Tao, Sun Tianxiang, Jiang Yugang, Qiu Xipeng
- Prefix Text As A Yarn: Eliciting Non-english Alignment In Foundation Language Model Zhan Runzhe, Yang Xinyi, Wong Derek F., Chao Lidia S., Zhang Yue
- MC-GPT: Empowering Vision-and-language Navigation With Memory Map And Reasoning Chains Zhan Zhaohuan, Yu Lisha, Yu Sijie, Tan Guang
- When Scaling Meets LLM Finetuning: The Effect Of Data, Model And Finetuning Method Zhang Biao, Liu Zhongtao, Cherry Colin, Firat Orhan
- Simple Techniques For Enhancing Sentence Embeddings In Generative Language Models Zhang Bowen, Chang Kehua, Li Chunping
- Notellm-2: Multimodal Large Representation Models For Recommendation Zhang Chao, Zhang Haoxin, Wu Shiwei, Wu Di, Xu Tong, Gao Yan, Hu Yao, Chen Enhong
- Teaching Large Language Models An Unseen Language On The Fly Zhang Chen, Liu Xiao, Lin Jiuheng, Feng Yansong
- Ts-align: A Teacher-student Collaborative Framework For Scalable Iterative Finetuning Of Large Language Models Zhang Chen, Tang Chengguang, Chong Dading, Shi Ke, Tang Guohua, Jiang Feng, Li Haizhou
- Causal Prompting: Debiasing Large Language Model Prompting Based On Front-door Adjustment Zhang Congzhi, Zhang Linhai, Wu Jialong, Zhou Deyu, He Yulan
- Sciglm: Training Scientific Language Models With Self-reflective Instruction Annotation And Tuning Zhang Dan, Hu Ziniu, Zhoubian Sining, Du Zhengxiao, Yang Kaiyu, Wang Zihan, Yue Yisong, Dong Yuxiao, Tang Jie
- Rest-mcts*: LLM Self-training Via Process Reward Guided Tree Search Zhang Dan, Zhoubian Sining, Hu Ziniu, Yue Yisong, Dong Yuxiao, Tang Jie
- Code Representation Learning At Scale Zhang Dejiao, Ahmad Wasi, Tan Ming, Ding Hantian, Nallapati Ramesh, Roth Dan, Ma Xiaofei, Xiang Bing
- Mm-llms: Recent Advances In Multimodal Large Language Models Zhang Duzhen, Yu Yahan, Dong Jiahua, Li Chenxing, Su Dan, Chu Chenhui, Yu Dong
- Unforgettable Generalization In Language Models Zhang Eric, Chosen Leshem, Andreas Jacob
- Map-neo: Highly Capable And Transparent Bilingual Large Language Model Series Zhang Ge, Qu Scott, Liu Jiaheng, Zhang Chenchen, Lin Chenghua, Yu Chou Leuang, Pan Danny, Cheng Esther, Liu Jie, Lin Qunshu, Yuan Raven, Zheng Tuney, Pang Wei, Du Xinrun, Liang Yiming, Ma Yinghao, Li Yizhi, Ma Ziyang, Lin Bill, Benetos Emmanouil, Yang Huan, Zhou Junting, Ma Kaijing, Liu Minghao, Niu Morry, Wang Noah, Que Quehry, Liu Ruibo, Liu Sine, Guo Shawn, Gao Soren, Zhou Wangchunshu, Zhang Xinyue, Zhou Yizhi, Wang Yubo, Bai Yuelin, Zhang Yuhan, Zhang Yuxiang, Wang Zenith, Yang Zhenzhu, Zhao Zijian, Zhang Jiajun, Ouyang Wanli, Huang Wenhao, Chen Wenhu
- Closing The Gap Between Open-source And Commercial Large Language Models For Medical Evidence Summarization Zhang Gongbo, Jin Qiao, Zhou Yiliang, Wang Song, Idnay Betina R., Luo Yiming, Park Elizabeth, Nestor Jordan G., Spotnitz Matthew E., Soroush Ali, Campion Thomas, Lu Zhiyong, Weng Chunhua, Peng Yifan
- Sinklora: Enhanced Efficiency And Chat Capabilities For Long-context Large Language Models Zhang Hengyu
- Balancing Speciality And Versatility: A Coarse To Fine Framework For Supervised Fine-tuning Large Language Model Zhang Hengyuan, Wu Yanru, Li Dawei, Yang Sak, Zhao Rui, Jiang Yong, Tan Fei
- A Careful Examination Of Large Language Model Performance On Grade School Arithmetic Zhang Hugh, Da Jeff, Lee Dean, Robinson Vaughn, Wu Catherine, Song Will, Zhao Tiffany, Raja Pranav, Slack Dylan, Lyu Qin, Hendryx Sean, Kaplan Russell, Lunati Michele, Yue Summer
- Kb-plugin: A Plug-and-play Framework For Large Language Models To Induce Programs Over Low-resourced Knowledge Bases Zhang Jiajie, Cao Shulin, Hu Linmei, Feng Ling, Hou Lei, Li Juanzi
- Agentohana: Design Unified Data And Training Pipeline For Effective Agent Learning Zhang Jianguo, Lan Tian, Murthy Rithesh, Liu Zhiwei, Yao Weiran, Tan Juntao, Hoang Thai, Yang Liangwei, Feng Yihao, Liu Zuxin, Awalgaonkar Tulika, Niebles Juan Carlos, Savarese Silvio, Heinecke Shelby, Wang Huan, Xiong Caiming
- Stealthy Attack On Large Language Model Based Recommendation Zhang Jinghao, Liu Yuting, Liu Qiang, Wu Shu, Guo Guibing, Wang Liang
- Regurgitative Training: The Value Of Real Data In Training Large Language Models Zhang Jinghui, Qiao Dandan, Yang Mochen, Wei Qiang
- Verifiable By Design: Aligning Language Models To Quote From Pre-training Data Zhang Jingyu, Marone Marc, Li Tianjian, Van Durme Benjamin, Khashabi Daniel
- Onegen: Efficient One-pass Unified Generation And Retrieval For Llms Zhang Jintian, Peng Cheng, Sun Mengshu, Chen Xiang, Liang Lei, Zhang Zhiqiang, Zhou Jun, Chen Huajun, Zhang Ningyu
- Android In The Zoo: Chain-of-action-thought For GUI Agents Zhang Jiwen, Wu Jihao, Teng Yihua, Liao Minghui, Xu Nuo, Xiao Xiao, Wei Zhongyu, Tang Duyu
- Unimot: Unified Molecule-text Language Model With Discrete Token Representation Zhang Juzheng, Bian Yatao, Chen Yongqiang, Yao Quanming
- Fast And Slow Generating: An Empirical Study On Large And Small Language Models Collaborative Decoding Zhang Kaiyan, Wang Jianyu, Ding Ning, Qi Biqing, Hua Ermo, Lv Xingtai, Zhou Bowen
- Hire A Linguist!: Learning Endangered Languages With In-context Linguistic Descriptions Zhang Kexun, Choi Yee Man, Song Zhenqiao, He Taiqi, Wang William Yang, Li Lei
- Exploring The Best Practices Of Query Expansion With Large Language Models Zhang Le, Wu Yihong, Yang Qian, Nie Jian-yun
- Harmonized Speculative Sampling Zhang Lefan, Wang Xiaodan, Huang Yanhua, Xu Ruiwen
- Predicting Learning Performance With Large Language Models: A Study In Adult Literacy Zhang Liang, Lin Jionghao, Borchers Conrad, Sabatini John, Hollander John, Cao Meng, Hu Xiangen
- ARL2: Aligning Retrievers For Black-box Large Language Models Via Self-guided Adaptive Relevance Labeling Zhang Lingxi, Yu Yue, Wang Kuan, Zhang Chao
- HARE: Human Priors, A Key To Small Language Model Efficiency Zhang Lingyun, Jin Bin, Ge Gaojian, Liu Lunhui, Shen Xuewen, Wu Mingyong, Zhang Houqian, Jiang Yongneng, Chen Shiqi, Pu Shi
- Generative Verifiers: Reward Modeling As Next-token Prediction Zhang Lunjun, Hosseini Arian, Bansal Hritik, Kazemi Mehran, Kumar Aviral, Agarwal Rishabh
- The Hedgehog & The Porcupine: Expressive Linear Attentions With Softmax Mimicry Zhang Michael, Bhatia Kush, Kumbong Hermann, Ré Christopher
- Transfertod: A Generalizable Chinese Multi-domain Task-oriented Dialogue System With Transfer Capabilities Zhang Ming, Huang Caishuang, Wu Yilong, Liu Shichun, Zheng Huiyuan, Dong Yurui, Shen Yujiong, Dou Shihan, Zhao Jun, Ye Junjie, Zhang Qi, Gui Tao, Huang Xuanjing
- Soaring From 4K To 400K: Extending Llm's Context With Activation Beacon Zhang Peitian, Liu Zheng, Xiao Shitao, Shao Ninglu, Ye Qiwei, Dou Zhicheng
- Extending Llama-3's Context Ten-fold Overnight Zhang Peitian, Shao Ninglu, Liu Zheng, Xiao Shitao, Qian Hongjin, Ye Qiwei, Dou Zhicheng
- Look Ahead Or Look Around? A Theoretical Comparison Between Autoregressive And Masked Pretraining Zhang Qi, Du Tianqi, Huang Haotian, Wang Yifei, Wang Yisen
- RECOST: External Knowledge Guided Data-efficient Instruction Tuning Zhang Qi, Zhang Yiming, Wang Haobo, Zhao Junbo
- VAR-CLIP: Text-to-image Generator With Visual Auto-regressive Modeling Zhang Qian, Dai Xiangzi, Yang Ninghua, An Xiang, Feng Ziyong, Ren Xingyu
- Offline Training Of Language Model Agents With Functions As Learnable Weights Zhang Shaokun, Zhang Jieyu, Liu Jiale, Song Linxin, Wang Chi, Krishna Ranjay, Wu Qingyun
- Improving Diversity Of Commonsense Generation By Large Language Models Via In-context Learning Zhang Tianhui, Peng Bei, Bollegala Danushka
- RAFT: Adapting Language Model To Domain Specific RAG Zhang Tianjun, Patil Shishir G., Jain Naman, Shen Sheng, Zaharia Matei, Stoica Ion, Gonzalez Joseph E.
- Promptfix: Few-shot Backdoor Removal Via Adversarial Prompt Tuning Zhang Tianrong, Xi Zhaohan, Wang Ting, Mitra Prasenjit, Chen Jinghui
- Revisiting OPRO: The Limitations Of Small-scale Llms As Optimizers Zhang Tuo, Yuan Jinyue, Avestimehr Salman
- Interpreting And Improving Large Language Models In Arithmetic Calculation Zhang Wei, Wan Chaoqun, Zhang Yonggang, Cheung Yiu-ming, Tian Xinmei, Shen Xu, Ye Jieping
- Defining Boundaries: A Spectrum Of Task Feasibility For Large Language Models Zhang Wenbo, Xu Zihang, Cai Hengrui
- Agent-pro: Learning To Evolve Via Policy-level Reflection And Optimization Zhang Wenqi, Tang Ke, Wu Hai, Wang Mengna, Shen Yongliang, Hou Guiyang, Tan Zeqi, Li Peng, Zhuang Yueting, Lu Weiming
- Seallms 3: Open Foundation And Chat Multilingual Large Language Models For Southeast Asian Languages Zhang Wenxuan, Chan Hou Pong, Zhao Yiran, Aljunied Mahani, Wang Jianyu, Liu Chaoqun, Deng Yue, Hu Zhiqiang, Xu Weiwen, Chia Yew Ken, Li Xin, Bing Lidong
- Llms Instruct Llms:an Extraction And Editing Method Zhang Xin, Ju Tianjie, Liang Huijia, Fu Ying, Zhang Qin
- Chain Of Preference Optimization: Improving Chain-of-thought Reasoning In Llms Zhang Xuan, Du Chao, Pang Tianyu, Liu Qian, Gao Wei, Lin Min
- Reinforcement Retrieval Leveraging Fine-grained Feedback For Fact Checking News Claims With Black-box LLM Zhang Xuan, Gao Wei
- Finercut: Finer-grained Interpretable Layer Pruning For Large Language Models Zhang Yang, Li Yawei, Wang Xinpeng, Shen Qianli, Plank Barbara, Bischl Bernd, Rezaei Mina, Kawaguchi Kenji
- Pybench: Evaluating LLM Agent On Various Real-world Coding Tasks Zhang Yaolun, Pan Yinxu, Wang Yudong, Cai Jie
- Pushing The Limit Of LLM Capacity For Text Classification Zhang Yazhou, Wang Mengyao, Ren Chenyu, Li Qiuchi, Tiwari Prayag, Wang Benyou, Qin Jing
- Rankclip: Ranking-consistent Language-image Pretraining Zhang Yiming, Zhao Zhuokai, Chen Zhaorun, Feng Zhili, Ding Zenghui, Sun Yining
- Can LLM Graph Reasoning Generalize Beyond Pattern Memorization? Zhang Yizhuo, Wang Heng, Feng Shangbin, Tan Zhaoxuan, Han Xiaochuang, He Tianxing, Tsvetkov Yulia
- Mg-verilog: Multi-grained Dataset Towards Enhanced Llm-assisted Verilog Generation Zhang Yongan, Yu Zhongzhi, Fu Yonggan, Wan Cheng, Lin Yingyan Celine
- Knowledge Overshadowing Causes Amalgamated Hallucination In Large Language Models Zhang Yuji, Li Sha, Liu Jiateng, Yu Pengfei, Fung Yi R., Li Jing, Li Manling, Ji Heng
- OT-VP: Optimal Transport-guided Visual Prompting For Test-time Adaptation Zhang Yunbei, Mehra Akshay, Hamm Jihun
- Small Language Models Need Strong Verifiers To Self-correct Reasoning Zhang Yunxiang, Khalifa Muhammad, Logeswaran Lajanugen, Kim Jaekyeom, Lee Moontae, Lee Honglak, Wang Lu
- Adam-mini: Use Fewer Learning Rates To Gain More Zhang Yushun, Chen Congliang, Li Ziniu, Ding Tian, Wu Chenwei, Ye Yinyu, Luo Zhi-quan, Sun Ruoyu
- Toolbehonest: A Multi-level Hallucination Diagnostic Benchmark For Tool-augmented Large Language Models Zhang Yuxiang, Chen Jing, Wang Junjie, Liu Yaxin, Yang Cheng, Shi Chufan, Zhu Xinyu, Lin Zihao, Wan Hanwen, Yang Yujiu, Sakai Tetsuya, Feng Tian, Yamana Hayato
- Understanding The Relationship Between Prompts And Response Uncertainty In Large Language Models Zhang Ze Yu, Verma Arun, Doshi-velez Finale, Low Bryan Kian Hsiang
- Diversifying The Expert Knowledge For Task-agnostic Pruning In Sparse Mixture-of-experts Zhang Zeliang, Liu Xiaodong, Cheng Hao, Xu Chenliang, Gao Jianfeng
- Get Confused Cautiously: Textual Sequence Memorization Erasure With Selective Entropy Maximization Zhang Zhaohan, Liu Ziquan, Patras Ioannis
- Position-aware Parameter Efficient Fine-tuning Approach For Reducing Positional Bias In Llms Zhang Zheng, Yang Fan, Jiang Ziyan, Chen Zheng, Zhao Zhengyang, Ma Chengyuan, Zhao Liang, Liu Yang
- Quantized Side Tuning: Fast And Memory-efficient Tuning Of Quantized Large Language Models Zhang Zhengxin, Zhao Dan, Miao Xupeng, Oliaro Gabriele, Li Qing, Jiang Yong, Jia Zhihao
- Found In The Middle: How Language Models Use Long Contexts Better Via Plug-and-play Positional Encoding Zhang Zhenyu, Chen Runjin, Liu Shiwei, Yao Zhewei, Ruwase Olatunji, Chen Beidi, Wu Xiaoxia, Wang Zhangyang
- Safe Unlearning: A Surprisingly Effective And Generalizable Solution To Defend Against Jailbreak Attacks Zhang Zhexin, Yang Junxiao, Ke Pei, Cui Shiyao, Zheng Chujie, Wang Hongning, Huang Minlie
- Preserving Knowledge In Large Language Model With Model-agnostic Self-decompression Zhang Zilun, Sun Yutao, Zhao Tiancheng, Sha Leigang, Xu Ruochen, Lee Kyusong, Yin Jianwei
- Galla: Graph Aligned Large Language Models For Improved Source Code Understanding Zhang Ziyin, Yu Hang, Li Shijie, Di Peng, Li Jianguo, Wang Rui
- Reverse That Number! Decoding Order Matters In Arithmetic Learning Zhang-li Daniel, Lin Nianyi, Yu Jifan, Zhang Zheyuan, Yao Zijun, Zhang Xiaokang, Hou Lei, Zhang Jing, Li Juanzi
- APT: Adaptive Pruning And Tuning Pretrained Language Models For Efficient Training And Inference Zhao Bowen, Hajishirzi Hannaneh, Cao Qingqing
- SELF-GUIDE: Better Task-specific Instruction Following Via Self-synthetic Finetuning Zhao Chenyang, Jia Xueying, Viswanathan Vijay, Wu Tongshuang, Neubig Graham
- Consistency Matters: Explore Llms Consistency From A Black-box Perspective Zhao Fufangchen, Jin Guoqiang, Huang Jiaheng, Zhao Rui, Tan Fei
- Large Language Model Can Continue Evolving From Mistakes Zhao Haokun, Han Haixia, Shi Jie, Du Chengyu, Liang Jiaqing, Xiao Yanghua
- LANE: Logic Alignment Of Non-tuning Large Language Models And Online Recommendation Systems For Explainable Reason Generation Zhao Hongke, Zheng Songming, Wu Likang, Yu Bowen, Wang Jing
- Galore: Memory-efficient LLM Training By Gradient Low-rank Projection Zhao Jiawei, Zhang Zhenyu, Chen Beidi, Wang Zhangyang, Anandkumar Anima, Tian Yuandong
- Exploring The Compositional Deficiency Of Large Language Models In Mathematical Reasoning Zhao Jun, Tong Jingqi, Mou Yurong, Zhang Ming, Zhang Qi, Huang Xuanjing
- Llama Beyond English: An Empirical Study On Language Capability Transfer Zhao Jun, Zhang Zhihao, Gao Luhui, Zhang Qi, Gui Tao, Huang Xuanjing
- Longagent: Scaling Language Models To 128k Context Through Multi-agent Collaboration Zhao Jun, Zu Can, Xu Hao, Lu Yi, He Wei, Ding Yiwen, Gui Tao, Zhang Qi, Huang Xuanjing
- Lora Land: 310 Fine-tuned Llms That Rival GPT-4, A Technical Report Zhao Justin, Wang Timothy, Abid Wael, Angus Geoffrey, Garg Arnav, Kinnison Jeffery, Sherstinsky Alex, Molino Piero, Addair Travis, Rishi Devvret
- Longskywork: A Training Recipe For Efficiently Extending Context Length In Large Language Models Zhao Liang, Wei Tianwen, Zeng Liang, Cheng Cheng, Yang Liu, Cheng Peng, Wang Lijie, Li Chenxia, Wu Xuejie, Zhu Bo, Gan Yimeng, Hu Rui, Yan Shuicheng, Fang Han, Zhou Yahui
- Mitigating Object Hallucination In Large Vision-language Models Via Classifier-free Guidance Zhao Linxi, Deng Yihe, Zhang Weitong, Gu Quanquan
- Diffagent: Fast And Accurate Text-to-image API Selection With Large Language Model Zhao Lirui, Yang Yue, Zhang Kaipeng, Shao Wenqi, Zhang Yuxin, Qiao Yu, Luo Ping, Ji Rongrong
- Aqulia-med LLM: Pioneering Full-process Open-source Medical Language Models Zhao Lulu, Zeng Weihao, Shi Xiaofeng, Zhou Hua, Hao Donglin, Lin Yonghua
- Efficiently Training 7B LLM With 1 Million Sequence Length On 8 Gpus Zhao Pinxue, Zhang Hailin, Fu Fangcheng, Nie Xiaonan, Liu Qibin, Yang Fang, Peng Yuanbo, Jiao Dian, Li Shuaipeng, Xue Jinbao, Tao Yangyu, Cui Bin
- A Survey Of Backdoor Attacks And Defenses On Large Language Models: Implications For Security Measures Zhao Shuai, Jia Meihuizi, Guo Zhongliang, Gan Leilei, Xu Xiaoyu, Wu Xiaobao, Fu Jie, Feng Yichao, Pan Fengjun, Tuan Luu Anh
- Defending Large Language Models Against Jailbreak Attacks Via Layer-specific Editing Zhao Wei, Li Zhe, Li Yige, Zhang Ye, Sun Jun
- Ouroboros: Generating Longer Drafts Phrase By Phrase For Faster Speculative Decoding Zhao Weilin, Huang Yuxiang, Han Xu, Xu Wang, Xiao Chaojun, Zhang Xinrong, Fang Yewei, Zhang Kaihuo, Liu Zhiyuan, Sun Maosong
- Both Matter: Enhancing The Emotional Intelligence Of Large Language Models Without Compromising The General Intelligence Zhao Weixiang, Li Zhuojun, Wang Shilong, Wang Yang, Hu Yulin, Zhao Yanyan, Wei Chen, Qin Bing
- Enhancing Cross-domain Pre-trained Decision Transformers With Adaptive Attention Zhao Wenhao, Xu Qiushui, Xu Linjie, Song Lei, Wang Jinyu, Zhou Chunlai, Bian Jiang
- Wildchat: 1M Chatgpt Interaction Logs In The Wild Zhao Wenting, Ren Xiang, Hessel Jack, Cardie Claire, Choi Yejin, Deng Yuntian
- How Do Large Language Models Handle Multilingualism? Zhao Yiran, Zhang Wenxuan, Chen Guizhen, Kawaguchi Kenji, Bing Lidong
- An Empirical Study Of Retrieval Augmented Generation With Chain-of-thought Zhao Yuetong, Cao Hongyu, Zhao Xianyu, Ou Zhijian
- Improving The Robustness Of Large Language Models Via Consistency Alignment Zhao Yukun, Yan Lingyong, Sun Weiwei, Xing Guoliang, Wang Shuaiqiang, Meng Chong, Cheng Zhicong, Ren Zhaochun, Yin Dawei
- SWIFT:A Scalable Lightweight Infrastructure For Fine-tuning Zhao Yuze, Huang Jintao, Hu Jinghan, Wang Xingjun, Mao Yunlin, Zhang Daoze, Jiang Zeyinzi, Wu Zhikai, Ai Baole, Wang Ang, Zhou Wenmeng, Chen Yingda
- Adversarial Contrastive Decoding: Boosting Safety Alignment Of Large Language Models Via Opposite Prompt Optimization Zhao Zhengyue, Zhang Xiaoyun, Xu Kaidi, Hu Xing, Zhang Rui, Du Zidong, Guo Qi, Chen Yunji
- Retrieval-augmented Mixture Of Lora Experts For Uploadable Machine Learning Zhao Ziyu, Gan Leilei, Wang Guoyin, Hu Yuwei, Shen Tao, Yang Hongxia, Kuang Kun, Wu Fei
- Loraretriever: Input-aware Lora Retrieval And Composition For Mixed Tasks In The Wild Zhao Ziyu, Gan Leilei, Wang Guoyin, Zhou Wangchunshu, Yang Hongxia, Kuang Kun, Wu Fei
- HALC: Object Hallucination Reduction Via Adaptive Focal-contrast Decoding Zhaorun Chen, Zhuokai Zhao, Hongyin Luo, Huaxiu Yao, Bo Li, Jiawei Zhou
- ICE-GRT: Instruction Context Enhancement By Generative Reinforcement Based Transformers Zheng Chen, Sun Ke, Tang Da, Ma Yukun, Zhang Yuyu, Xi Chenguang, Zhou Xun
- Balancing Enhancement, Harmlessness, And General Capabilities: Enhancing Conversational Llms With Direct RLHF Zheng Chen, Sun Ke, Wu Hang, Xi Chenguang, Zhou Xun
- Weak-to-strong Extrapolation Expedites Alignment Zheng Chujie, Wang Ziqi, Ji Heng, Huang Minlie, Peng Nanyun
- Large Language Models As Reliable Knowledge Bases? Zheng Danna, Lapata Mirella, Pan Jeff Z.
- Learn To Be Efficient: Build Structured Sparsity In Large Language Models Zheng Haizhong, Bai Xiaoyan, Liu Xueshen, Mao Z. Morley, Chen Beidi, Lai Fan, Prakash Atul
- Fine-tuning Large Language Models For Domain-specific Machine Translation Zheng Jiawei, Hong Hanghai, Wang Xiaoli, Su Jingsong, Liang Yonggui, Wu Shikai
- NEO-BENCH: Evaluating Robustness Of Large Language Models With Neologisms Zheng Jonathan, Ritter Alan, Xu Wei
- Can Llms Learn New Concepts Incrementally Without Forgetting? Zheng Junhao, Qiu Shengjie, Ma Qianli
- Towards Lifelong Learning Of Large Language Models: A Survey Zheng Junhao, Qiu Shengjie, Shi Chengming, Ma Qianli
- Dreamlip: Language-image Pre-training With Long Captions Zheng Kecheng, Zhang Yifei, Wu Wei, Lu Fan, Ma Shuailei, Jin Xin, Chen Wei, Shen Yujun
- Clr-fact: Evaluating The Complex Logical Reasoning Capability Of Large Language Models Over Factual Knowledge Zheng Tianshi, Bai Jiaxin, Wang Yicheng, Fang Tianqing, Guo Yue, Yim Yauwai, Song Yangqiu
- Kun: Answer Polishment For Chinese Self-alignment With Instruction Back-translation Zheng Tianyu, Guo Shuyue, Qu Xingwei, Guo Jiawei, Du Xinrun, Jia Qi, Lin Chenghua, Huang Wenhao, Fu Jie, Zhang Ge
- More-3s:multimodal-based Offline Reinforcement Learning With Shared Semantic Spaces Zheng Tianyu, Zhang Ge, Qu Xingwei, Kuang Ming, Huang Stephen W., He Zhaofeng
- Breaking Language Barriers: Cross-lingual Continual Pre-training At Scale Zheng Wenzhen, Pan Wenbo, Xu Xu, Qin Libo, Yue Li, Zhou Ming
- Critic-cot: Boosting The Reasoning Abilities Of Large Language Model Via Chain-of-thoughts Critic Zheng Xin, Lou Jie, Cao Boxi, Wen Xueru, Ji Yuqiu, Lin Hongyu, Lu Yaojie, Han Xianpei, Zhang Debing, Sun Le
- Llamafactory: Unified Efficient Fine-tuning Of 100+ Language Models Zheng Yaowei, Zhang Richong, Zhang Junhao, Ye Yanhan, Luo Zheyan, Feng Zhangchi, Ma Yongqiang
- Harnessing Large Language Models For Text-rich Sequential Recommendation Zheng Zhi, Chao Wenshuo, Qiu Zhaopeng, Zhu Hengshu, Xiong Hui
- Training Large Language Models For Reasoning Through Reverse Curriculum Reinforcement Learning Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang
- Understanding The Rope Extensions Of Long-context Llms: An Attention Perspective Zhong Meizhi, Zhang Chen, Lei Yikun, Liu Xikai, Gao Yan, Hu Yao, Chen Kehai, Zhang Min
- ROSE Doesn't Do That: Boosting The Safety Of Instruction-tuned Large Language Models With Reverse Prompt Contrastive Decoding Zhong Qihuang, Ding Liang, Liu Juhua, Du Bo, Tao Dacheng
- Revisiting Knowledge Distillation For Autoregressive Language Models Zhong Qihuang, Ding Liang, Shen Li, Liu Juhua, Du Bo, Tao Dacheng
- Moextend: Tuning New Experts For Modality And Task Extension Zhong Shanshan, Gao Shanghua, Huang Zhongzhan, Wen Wushao, Zitnik Marinka, Zhou Pan
- Harnessing Large Language Models As Post-hoc Correctors Zhong Zhiqiang, Zhou Kuangyu, Mottin Davide
- Synthet2c: Generating Synthetic Data For Fine-tuning Large Language Models On The Text2cypher Task Zhong Ziije, Zhong Linqing, Sun Zhaoze, Jin Qingyun, Qin Zengchang, Zhang Xiaofan
- Moe-lpr: Multilingual Extension Of Large Language Models Through Mixture-of-experts With Language Priors Routing Zhou Hao, Wang Zhijun, Huang Shujian, Huang Xin, Han Xue, Feng Junlan, Deng Chao, Luo Weihua, Chen Jiajun
- Sequence To Sequence Reward Modeling: Improving RLHF By Language Feedback Zhou Jiayi, Ji Jiaming, Dai Juntao, Yang Yaodong
- Orchestrating Llms With Different Personalizations Zhou Jin Peng, Luo Katie Z, Gu Jingwen, Yuan Jason, Weinberger Kilian Q., Sun Wen
- Qog:question And Options Generation Based On Language Model Zhou Jincheng
- Leveraging Web-crawled Data For High-quality Fine-tuning Zhou Jing, Jiang Chenglin, Shen Wei, Zhou Xiao, He Xiaonan
- An LLM Feature-based Framework For Dialogue Constructiveness Assessment Zhou Lexin, Farag Youmna, Vlachos Andreas
- Towards Democratizing Multilingual Large Language Models For Medicine Through A Two-stage Instruction Fine-tuning Approach Zhou Meng, Parmar Surajsinh, Bhatti Anubhav
- Enhancing The General Agent Capabilities Of Low-parameter Llms Through Tuning And Multi-branch Reasoning Zhou Qinhao, Zhang Zihan, Xiang Xiang, Wang Ke, Wu Yuchuan, Li Yongbin
- Evaluating The Smooth Control Of Attribute Intensity In Text Generation With Llms Zhou Shang, Yao Feng, Dong Chengyu, Wang Zihan, Shang Jingbo
- Pre-trained Large Language Models Use Fourier Features To Compute Addition Zhou Tianyi, Fu Deqing, Sharan Vatsal, Jia Robin
- Explaining Pre-trained Language Models With Attribution Scores: An Analysis In Low-resource Settings Zhou Wei, Adel Heike, Schuff Hendrik, Vu Ngoc Thang
- Sirius: Contextual Sparsity With Correction For Efficient Llms Zhou Yang, Chen Zhuoming, Xu Zhaozhuo, Lin Victoria, Chen Beidi
- Archer: Training Language Model Agents Via Hierarchical Multi-turn RL Zhou Yifei, Zanette Andrea, Pan Jiayi, Levine Sergey, Kumar Aviral
- Shared Imagination: Llms Hallucinate Alike Zhou Yilun, Xiong Caiming, Savarese Silvio, Wu Chien-sheng
- Few-shot Adversarial Prompt Learning On Vision-language Models Zhou Yiwei, Xia Xiaobo, Lin Zhiwei, Han Bo, Liu Tongliang
- Aligning Modalities In Vision Large Language Models Via Preference Fine-tuning Zhou Yiyang, Cui Chenhang, Rafailov Rafael, Finn Chelsea, Yao Huaxiu
- Calibrated Self-rewarding Vision Language Models Zhou Yiyang, Fan Zhiyuan, Cheng Dongjie, Yang Sihan, Chen Zhaorun, Cui Chenhang, Wang Xiyao, Li Yun, Zhang Linjun, Yao Huaxiu
- Dual Instruction Tuning With Large Language Models For Mathematical Reasoning Zhou Yongwei, Zhao Tiejun
- Multi-stage Balanced Distillation: Addressing Long-tail Challenges In Sequence-level Knowledge Distillation Zhou Yuhang, Zhu Jing, Xu Paiheng, Liu Xiaoyu, Wang Xiyao, Koutra Danai, Ai Wei, Huang Furong
- Defending Jailbreak Prompts Via In-context Adversarial Game Zhou Yujun, Han Yufei, Zhuang Haomin, Guo Kehan, Liang Zhenwen, Bao Hongyan, Zhang Xiangliang
- Metagpt: Merging Large Language Models Using Model Exclusive Task Arithmetic Zhou Yuyan, Song Liang, Wang Bingning, Chen Weipeng
- Emulated Disalignment: Safety Alignment For Large Language Models May Backfire! Zhou Zhanhui, Liu Jie, Dong Zhichen, Liu Jiaheng, Yang Chao, Ouyang Wanli, Qiao Yu
- How Alignment And Jailbreak Work: Explain LLM Safety Through Intermediate Hidden States Zhou Zhenhong, Yu Haiyang, Zhang Xinghua, Xu Rongwu, Huang Fei, Li Yongbin
- Lawgpt: A Chinese Legal Knowledge-enhanced Large Language Model Zhou Zhi, Shi Jiang-xin, Song Peng-xiao, Yang Xiao-wen, Jin Yi-xuan, Guo Lan-zhe, Li Yu-feng
- Fine-tuning Large Language Models To Translate: Will A Touch Of Noisy Data In Misaligned Languages Suffice? Zhu Dawei, Chen Pinzhen, Zhang Miaoran, Haddow Barry, Shen Xiaoyu, Klakow Dietrich
- A Preference-driven Paradigm For Enhanced Translation With Large Language Models Zhu Dawei, Trenous Sony, Shen Xiaoyu, Klakow Dietrich, Byrne Bill, Hasler Eva
- TAT-LLM: A Specialized Language Model For Discrete Reasoning Over Tabular And Textual Data Zhu Fengbin, Liu Ziyang, Feng Fuli, Wang Chao, Li Moxin, Chua Tat-seng
- FANNO: Augmenting High-quality Instruction Data With Open-sourced Llms Only Zhu He, Su Junyou, Lun Tianle, Tao Yicheng, Zhang Wenjia, Fan Zipei, Chen Guanhua
- Lifelong Personalized Low-rank Adaptation Of Large Language Models For Recommendation Zhu Jiachen, Lin Jianghao, Dai Xinyi, Chen Bo, Shan Rong, Zhu Jieming, Tang Ruiming, Yu Yong, Zhang Weinan
- Cost: Contrastive Quantization Based Semantic Tokenization For Generative Recommendation Zhu Jieming, Jin Mengqun, Liu Qijiong, Qiu Zexuan, Dong Zhenhua, Li Xiu
- Collectivesft: Scaling Large Language Models For Chinese Medical Benchmark With Collective Instructions In Healthcare Zhu Jingwei, Tan Minghuan, Yang Min, Li Ruixue, Alinejad-rokny Hamid
- Fastmem: Fast Memorization Of Prompt Improves Context Awareness Of Large Language Models Zhu Junyi, Liu Shuochen, Yu Yu, Tang Bo, Yan Yibo, Li Zhiyu, Xiong Feiyu, Xu Tong, Blaschko Matthew B.
- Investigating Instruction Tuning Large Language Models On Graphs Zhu Kerui, Huang Bo-wei, Jin Bowen, Jiao Yizhu, Zhong Ming, Chang Kevin, Lin Shou-de, Han Jiawei
- Relayattention For Efficient Large Language Model Serving With Long System Prompts Zhu Lei, Wang Xinjiang, Zhang Wayne, Lau Rynson W. H.
- LIRE: Listwise Reward Enhancement For Preference Alignment Zhu Mingye, Liu Yi, Zhang Lei, Guo Junbo, Mao Zhendong
- Parameter-efficient Tuning Large Language Models For Graph Representation Learning Zhu Qi, Zheng Da, Song Xiang, Zhang Shichang, Jin Bowen, Sun Yizhou, Karypis George
- Inference-time Decontamination: Reusing Leaked Benchmarks For Large Language Model Evaluation Zhu Qin, Cheng Qingyuan, Peng Runyu, Li Xiaonan, Liu Tengxiao, Peng Ru, Qiu Xipeng, Huang Xuanjing
- Scalable Matmul-free Language Modeling Zhu Rui-jie, Zhang Yu, Sifferman Ethan, Sheaves Tyler, Wang Yiqiao, Richmond Dustin, Zhou Peng, Eshraghian Jason K.
- Llama-moe: Building Mixture-of-experts From Llama With Continual Pre-training Zhu Tong, Qu Xiaoye, Dong Daize, Ruan Jiacheng, Tong Jingqi, He Conghui, Cheng Yu
- IAPT: Instruction-aware Prompt Tuning For Large Language Models Zhu Wei, Tian Aaron Xuxiang, Yin Congrui, Ni Yuan, Wang Xiaoling, Xie Guotong
- The Power Of Question Translation Training In Multilingual Reasoning: Broadened Scope And Deepened Insights Zhu Wenhao, Huang Shujian, Yuan Fei, Chen Cheng, Chen Jiajun, Birch Alexandra
- Question Translation Training For Better Multilingual Reasoning Zhu Wenhao, Huang Shujian, Yuan Fei, She Shuaijie, Chen Jiajun, Birch Alexandra
- Distilling Mathematical Reasoning Capabilities Into Small Language Models Zhu Xunyu, Li Jian, Liu Yong, Ma Can, Wang Weiping
- Can Large Language Models Understand Context? Zhu Yilun, Moniz Joel Ruben Antony, Bhargava Shruti, Lu Jiarui, Piraviperumal Dhivya, Li Site, Zhang Yuan, Yu Hong, Tseng Bo-hsiang
- Generative Pre-trained Speech Language Model With Efficient Hierarchical Transformer Zhu Yongxin, Su Dan, He Liqiang, Xu Linli, Yu Dong
- Efficient Test-time Prompt Tuning For Vision-language Models Zhu Yuhan, Zhang Guozhen, Xu Chen, Shen Haocheng, Chen Xiaoxin, Wu Gangshan, Wang Limin
- One Token Can Help! Learning Scalable And Pluggable Virtual Tokens For Retrieval-augmented Large Language Models Zhu Yutao, Huang Zhaoheng, Dou Zhicheng, Wen Ji-rong
- Yulan: An Open-source Large Language Model Zhu Yutao, Zhou Kun, Mao Kelong, Chen Wentong, Sun Yiding, Chen Zhipeng, Cao Qian, Wu Yihan, Chen Yushuo, Wang Feng, Zhang Lei, Li Junyi, Wang Xiaolei, Wang Lei, Zhang Beichen, Dong Zican, Cheng Xiaoxue, Chen Yuhan, Tang Xinyu, Hou Yupeng, Ren Qiangqiang, Pang Xincheng, Xie Shufang, Zhao Wayne Xin, Dou Zhicheng, Mao Jiaxin, Lin Yankai, Song Ruihua, Xu Jun, Chen Xu, Yan Rui, Wei Zhewei, Hu Di, Huang Wenbing, Gao Ze-feng, Chen Yueguo, Lu Weizheng, Wen Ji-rong
- Protllm: An Interleaved Protein-language LLM With Protein-as-word Pre-training Zhuo Le, Chi Zewen, Xu Minghao, Huang Heyan, Zheng Heqi, He Conghui, Mao Xian-ling, Zhang Wentao
- Lumina-next: Making Lumina-t2x Stronger And Faster With Next-dit Zhuo Le, Du Ruoyi, Xiao Han, Li Yangguang, Liu Dongyang, Huang Rongjie, Liu Wenze, Zhao Lirui, Wang Fu-yun, Ma Zhanyu, Luo Xu, Wang Zehan, Zhang Kaipeng, Zhu Xiangyang, Liu Si, Yue Xiangyu, Liu Dingning, Ouyang Wanli, Liu Ziwei, Qiao Yu, Li Hongsheng, Gao Peng
- Astraios: Parameter-efficient Instruction Tuning Code Large Language Models Zhuo Terry Yue, Zebaze Armel, Suppattarachai Nitchakarn, Von Werra Leandro, De Vries Harm, Liu Qian, Muennighoff Niklas
- Less Is More: Mitigating Multimodal Hallucination From An EOS Decision Perspective Zihao Yue, Liang Zhang, Qin Jin
- Language Model Sentence Completion With A Parser-driven Rhetorical Control Method Zingale Joshua, Kalita Jugal
- Proswitch: Knowledge-guided Instruction Tuning To Generate Professional And Non-professional Styled Text Zong Chang, Chen Yuyan, Lu Weiming, Shao Jian, Zhuang Yueting
- Llama-excitor: General Instruction Tuning Via Indirect Feature Interaction Zou Bo, Yang Chao, Qiao Yu, Quan Chengbin, Zhao Youjian
- Implicitave: An Open-source Dataset And Multimodal Llms Benchmark For Implicit Attribute Value Extraction Zou Henry Peng, Samuel Vinay, Zhou Yue, Zhang Weizhi, Fang Liancheng, Song Zihe, Yu Philip S., Caragea Cornelia
- Promptintern: Saving Inference Costs By Internalizing Recurrent Prompt During Large Language Model Fine-tuning Zou Jiaru, Zhou Mengyu, Li Tao, Han Shi, Zhang Dongmei
- Llms For XAI: Future Directions For Explaining Explanations Zytek Alexandra, Pidò Sara, Veeramachaneni Kalyan
- Towards Reliable And Factual Response Generation: Detecting Unanswerable Questions In Information-seeking Conversations Łajewska Weronika, Balog Krisztian
- Concept-aware Data Construction Improves In-context Learning Of Language Models Štefánik Michal, Kadlčík Marek, Sojka Petr
🏷 Transformer
- Multimodal Attention For Neural Machine Translation Caglayan Ozan, Barrault Loïc, Bougares Fethi
- Tree-to-sequence Attentional Neural Machine Translation Eriguchi Akiko, Hashimoto Kazuma, Tsuruoka Yoshimasa
- Multi-way, Multilingual Neural Machine Translation With A Shared Attention Mechanism Firat Orhan, Cho Kyunghyun, Bengio Yoshua
- Coherent Dialogue With Attention-based Language Models Mei Hongyuan, Bansal Mohit, Walter Matthew R.
- Interactive Attention For Neural Machine Translation Meng Fandong, Lu Zhengdong, Li Hang, Liu Qun
- Topic Aware Neural Response Generation Xing Chen, Wu Wei, Wu Yu, Liu Jie, Huang Yalou, Zhou Ming, Ma Wei-ying
- End-to-end Answer Chunk Extraction And Ranking For Reading Comprehension Yu Yang, Zhang Wei, Hasan Kazi, Yu Mo, Xiang Bing, Zhou Bowen
- Weighted Transformer Network For Machine Translation Ahmed Karim, Keskar Nitish Shirish, Socher Richard
- Attention Is All You Need Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin
- Lig-cristal System For The WMT17 Automatic Post-editing Task Berard Alexandre, Pietquin Olivier, Besacier Laurent
- Frustratingly Short Attention Spans In Neural Language Modeling Daniluk Michał, Rocktäschel Tim, Welbl Johannes, Riedel Sebastian
- An Empirical Study On The Effectiveness Of Images In Multimodal Neural Machine Translation Delbrouck Jean-benoit, Dupont Stéphane
- Ruminating Reader: Reasoning With Gated Multi-hop Attention Gong Yichen, Bowman Samuel R.
- Sockeye: A Toolkit For Neural Machine Translation Hieber Felix, Domhan Tobias, Denkowski Michael, Vilar David, Sokolov Artem, Clifton Ann, Post Matt
- Reinforced Mnemonic Reader For Machine Reading Comprehension Hu Minghao, Peng Yuxing, Huang Zhen, Qiu Xipeng, Wei Furu, Zhou Ming
- Simple Recurrent Units For Highly Parallelizable Recurrence Lei Tao, Zhang Yu, Wang Sida I., Dai Hui, Artzi Yoav
- Attention Strategies For Multi-source Sequence-to-sequence Learning Libovický Jindřich, Helcl Jindřich
- Table-to-text Generation By Structure-aware Seq2seq Learning Liu Tianyu, Wang Kexiang, Sha Lei, Chang Baobao, Sui Zhifang
- Best Of Both Worlds: Transferring Knowledge From Discriminative Learning To A Generative Visual Dialog Model Lu Jiasen, Kannan Anitha, Yang Jianwei, Parikh Devi, Batra Dhruv
- Visual Reference Resolution Using Attention Memory For Visual Dialog Seo Paul Hongsuck, Lehrmann Andreas, Han Bohyung, Sigal Leonid
- Generating High-quality And Informative Conversation Responses With Sequence-to-sequence Models Shao Louis, Gouws Stephan, Britz Denny, Goldie Anna, Strope Brian, Kurzweil Ray
- Self-attentive Residual Decoder For Neural Machine Translation Werlen Lesly Miculicich, Pappas Nikolaos, Ram Dhananjay, Popescu-belis Andrei
- Are You Talking To Me? Reasoned Visual Dialog Generation Through Adversarial Learning Wu Qi, Wang Peng, Shen Chunhua, Reid Ian, Hengel Anton Van Den
- Attention Based Natural Language Grounding By Navigating Virtual Environment B Akilesh, Sinha Abhishek, Sarkar Mausoom, Krishnamurthy Balaji
- Training Deeper Neural Machine Translation Models With Transparent Attention Bapna Ankur, Chen Mia Xu, Firat Orhan, Cao Yuan, Wu Yonghui
- Commonsense For Generative Multi-hop Question Answering Tasks Bauer Lisa, Wang Yicheng, Bansal Mohit
- The Best Of Both Worlds: Combining Recent Advances In Neural Machine Translation Chen Mia Xu, Firat Orhan, Bapna Ankur, Johnson Melvin, Macherey Wolfgang, Foster George, Jones Llion, Parmar Niki, Schuster Mike, Chen Zhifeng, Wu Yonghui, Hughes Macduff
- Augmenting Neural Response Generation With Context-aware Topical Attention Dziri Nouha, Kamalloo Ehsan, Mathewson Kory W., Zaiane Osmar
- Reciprocal Attention Fusion For Visual Question Answering Farazi Moshiur R, Khan Salman H
- SALSA-TEXT : Self Attentive Latent Space Based Adversarial Text Generation Gagnon-marchand Jules, Sadeghi Hamed, Haidar Md. Akmal, Rezagholizadeh Mehdi
- Generating Distractors For Reading Comprehension Questions From Real Examinations Gao Yifan, Bing Lidong, Li Piji, King Irwin, Lyu Michael R.
- The Memad Submission To The WMT18 Multimodal Translation Task Grönroos Stig-arne, Huet Benoit, Kurimo Mikko, Laaksonen Jorma, Merialdo Bernard, Pham Phu, Sjöberg Mats, Sulubacak Umut, Tiedemann Jörg, Troncy Raphael, Vázquez Raúl
- BERT: Pre-training Of Deep Bidirectional Transformers For Language Understanding Jacob Devlin, Ming-wei Chang, Kenton Lee, Kristina Toutanova
- Ms-uedin Submission To The WMT2018 APE Shared Task: Dual-source Transformer For Automatic Post-editing Junczys-dowmunt Marcin, Grundkiewicz Roman
- Fast And Simple Mixture Of Softmaxes With BPE And Hybrid-lightrnn For Language Generation Kong Xiang, Xie Qizhe, Dai Zihang, Hovy Eduard
- Seq2rdf: An End-to-end Application For Deriving Triples From Natural Language Text Liu Yue, Zhang Tongtao, Liang Zhicheng, Ji Heng, Mcguinness Deborah L.
- Mem2seq: Effectively Incorporating Knowledge Bases Into End-to-end Task-oriented Dialog Systems Madotto Andrea, Wu Chien-sheng, Fung Pascale
- Improved Fusion Of Visual And Language Representations By Dense Symmetric Co-attention For Visual Question Answering Nguyen Duy-kien, Okatani Takayuki
- Phrase-based Attentions Nguyen Phi Xuan, Joty Shafiq
- Recursive Visual Attention In Visual Dialog Niu Yulei, Zhang Hanwang, Zhang Manli, Zhang Jianhong, Lu Zhiwu, Wen Ji-rong
- Dual Recurrent Attention Units For Visual Question Answering Osman Ahmed, Samek Wojciech
- Training Tips For The Transformer Model Popel Martin, Bojar Ondřej
- The University Of Cambridge's Machine Translation Systems For WMT18 Stahlberg Felix, De Gispert Adria, Byrne Bill
- An Operation Sequence Model For Explainable Neural Machine Translation Stahlberg Felix, Saunders Danielle, Byrne Bill
- Inducing Grammars With And For Neural Machine Translation Tran Ke, Bisk Yonatan
- Few-shot Generalization Across Dialogue Tasks Vlasov Vladimir, Drissner-schmid Akela, Nichol Alan
- Multilingual NMT With A Language-independent Attention Bridge Vázquez Raúl, Raganato Alessandro, Tiedemann Jörg, Creutz Mathias
- A Task In A Suit And A Tie: Paraphrase Generation With Semantic Augmentation Wang Su, Gupta Rahul, Chang Nancy, Baldridge Jason
- Qanet: Combining Local Convolution With Global Self-attention For Reading Comprehension Yu Adams Wei, Dohan David, Luong Minh-thang, Zhao Rui, Chen Kai, Norouzi Mohammad, Le Quoc V.
- An Affect-rich Neural Conversational Model With Biased Attention And Weighted Cross-entropy Loss Zhong Peixiang, Wang Di, Miao Chunyan
- Abductive Reasoning As Self-supervision For Common Sense Question Answering Aakur Sathyanarayanan N., Sarkar Sudeep
- Syntactically Supervised Transformers For Faster Neural Machine Translation Akoury Nader, Krishna Kalpesh, Iyyer Mohit
- Fusion Of Detected Objects In Text For Visual Question Answering Alberti Chris, Ling Jeffrey, Collins Michael, Reitter David
- The Unreasonable Effectiveness Of Transformer Language Models In Grammatical Error Correction Alikaniotis Dimitrios, Raheja Vipul
- PLATO: Pre-trained Dialogue Generation Model With Discrete Latent Variable Bao Siqi, He Huang, Wang Fan, Wu Hua, Wang Haifeng
- BERTQA -- Attention On Steroids Chadha Ankit, Sood Rewa
- An Empirical Study Of Generation Order For Machine Translation Chan William, Stern Mitchell, Kiros Jamie, Uszkoreit Jakob
- Distilling Knowledge Learned In BERT For Text Generation Chen Yen-chun, Gan Zhe, Cheng Yu, Liu Jingzhou, Liu Jingjing
- Attending To Entities For Better Text Understanding Cheng Pengxiang, Erk Katrin
- Learning Multi-level Information For Dialogue Response Selection By Highway Recurrent Transformer Chiang Ting-rui, Huang Chao-wei, Su Shang-yu, Chen Yun-nung
- Exploring The Limits Of Transfer Learning With A Unified Text-to-text Transformer Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J. Liu
- Deep Learning Based Chatbot Models Csaky Richard
- Transformer-xl: Attentive Language Models Beyond A Fixed-length Context Dai Zihang, Yang Zhilin, Yang Yiming, Carbonell Jaime, Le Quoc V., Salakhutdinov Ruslan
- The Second Conversational Intelligence Challenge (convai2) Dinan Emily, Logacheva Varvara, Malykh Valentin, Miller Alexander, Shuster Kurt, Urbanek Jack, Kiela Douwe, Szlam Arthur, Serban Iulian, Lowe Ryan, Prabhumoye Shrimai, Black Alan W, Rudnicky Alexander, Williams Jason, Pineau Joelle, Burtsev Mikhail, Weston Jason
- Memory-augmented Recurrent Networks For Dialogue Coherence Donahue David, Meng Yuanliang, Rumshisky Anna
- Unified Language Model Pre-training For Natural Language Understanding And Generation Dong Li, Yang Nan, Wang Wenhui, Wei Furu, Liu Xiaodong, Wang Yu, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- Question-agnostic Attention For Visual Question Answering Farazi Moshiur R, Khan Salman H, Barnes Nick
- Neural Data-to-text Generation: A Comparison Between Pipeline And End-to-end Architectures Ferreira Thiago Castro, Van Der Lee Chris, Van Miltenburg Emiel, Krahmer Emiel
- TANDA: Transfer And Adapt Pre-trained Transformer Models For Answer Sentence Selection Garg Siddhant, Vu Thuy, Moschitti Alessandro
- Self-attentive Model For Headline Generation Gavrilov Daniil, Kalaidin Pavel, Malykh Valentin
- Span Selection Pre-training For Question Answering Glass Michael, Gliozzo Alfio, Chakravarti Rishav, Ferritto Anthony, Pan Lin, Bhargav G P Shrivatsa, Garg Dinesh, Sil Avirup
- Interactive Matching Network For Multi-turn Response Selection In Retrieval-based Chatbots Gu Jia-chen, Ling Zhen-hua, Liu Quan
- Insertion-based Decoding With Automatically Inferred Generation Order Gu Jiatao, Liu Qi, Cho Kyunghyun
- Zero-shot Paraphrase Generation With Multilingual Language Models Guo Yinpeng, Liao Yi, Jiang Xin, Zhang Qing, Zhang Yibo, Liu Qun
- Exploring Neural Net Augmentation To BERT For Question Answering On SQUAD 2.0 Gupta Suhas
- Episodic Memory Reader: Learning What To Remember For Question Answering From Streaming Data Han Moonsu, Kang Minki, Jung Hyunwoo, Hwang Sung Ju
- Analyzing The Forgetting Problem In The Pretrain-finetuning Of Dialogue Response Models He Tianxing, Liu Jun, Cho Kyunghyun, Ott Myle, Liu Bing, Glass James, Peng Fuchun
- Convert: Efficient And Accurate Conversational Representations From Transformers Henderson Matthew, Casanueva Iñigo, Mrkšić Nikola, Su Pei-hao, Wen Tsung-hsien, Vulić Ivan
- Efficient Adaptation Of Pretrained Transformers For Abstractive Summarization Hoang Andrew, Bosselut Antoine, Celikyilmaz Asli, Choi Yejin
- Exbert: A Visual Analysis Tool To Explore Learned Representations In Transformers Models Hoover Benjamin, Strobelt Hendrik, Gehrmann Sebastian
- Iterative Answer Prediction With Pointer-augmented Multimodal Transformers For Textvqa Hu Ronghang, Singh Amanpreet, Darrell Trevor, Rohrbach Marcus
- INSET: Sentence Infilling With Inter-sentential Transformer Huang Yichen, Zhang Yizhe, Elachqar Oussama, Cheng Yu
- Attention Is Not Explanation Jain Sarthak, Wallace Byron C.
- Document-level Neural Machine Translation With Associated Memory Network Jiang Shu, Wang Rui, Li Zuchao, Utiyama Masao, Chen Kehai, Sumita Eiichiro, Zhao Hai, Lu Bao-liang
- Tinybert: Distilling BERT For Natural Language Understanding Jiao Xiaoqi, Yin Yichun, Shang Lifeng, Jiang Xin, Chen Xiao, Li Linlin, Wang Fang, Liu Qun
- All-in-one Image-grounded Conversational Agents Ju Da, Shuster Kurt, Boureau Y-lan, Weston Jason
- Microsoft Translator At WMT 2019: Towards Large-scale Document-level Neural Machine Translation Junczys-dowmunt Marcin
- Learning And Evaluating Contextual Embedding Of Source Code Kanade Aditya, Maniatis Petros, Balakrishnan Gogul, Shi Kensen
- Weak Supervision Helps Emergence Of Word-object Alignment And Improves Vision-language Tasks Kervadec Corentin Liris, Antipov Grigory Liris, Baccouche Moez Liris, Wolf Christian Liris
- CTRL: A Conditional Transformer Language Model For Controllable Generation Keskar Nitish Shirish, Mccann Bryan, Varshney Lav R., Xiong Caiming, Socher Richard
- Sample Efficient Text Summarization Using A Single Pre-trained Transformer Khandelwal Urvashi, Clark Kevin, Jurafsky Dan, Kaiser Lukasz
- Learning To Answer By Learning To Ask: Getting The Best Of GPT-2 And BERT Worlds Klein Tassilo, Nabi Moin
- Revealing The Dark Secrets Of BERT Kovaleva Olga, Romanov Alexey, Rogers Anna, Rumshisky Anna
- Question Generation From Paragraphs: A Tale Of Two Hierarchical Models Kumar Vishwajeet, Chaki Raktim, Talluri Sai Teja, Ramakrishnan Ganesh, Li Yuan-fang, Haffari Gholamreza
- What Would Elsa Do? Freezing Layers During Transformer Fine-tuning Lee Jaejun, Tang Raphael, Lin Jimmy
- Bertsel: Answer Selection With Pre-trained Models Li Dongfang, Yu Yifei, Chen Qingcai, Li Xinyu
- Unicoder-vl: A Universal Encoder For Vision And Language By Cross-modal Pre-training Li Gen, Duan Nan, Fang Yuejian, Gong Ming, Jiang Daxin, Zhou Ming
- Big Bidirectional Insertion Representations For Documents Li Lala, Chan William
- Visualbert: A Simple And Performant Baseline For Vision And Language Li Liunian Harold, Yatskar Mark, Yin Da, Hsieh Cho-jui, Chang Kai-wei
- Relevance-promoting Language Model For Short-text Conversation Li Xin, Li Piji, Bi Wei, Liu Xiaojiang, Lam Wai
- Incremental Transformer With Deliberation Decoder For Document Grounded Conversations Li Zekang, Niu Cheng, Meng Fandong, Feng Yang, Li Qian, Zhou Jie
- Explicit Sentence Compression For Neural Machine Translation Li Zuchao, Wang Rui, Chen Kehai, Utiyama Masao, Sumita Eiichiro, Zhang Zhuosheng, Zhao Hai
- Probing Representations Learned By Multimodal Recurrent And Transformer Models Libovický Jindřich, Madhyastha Pranava
- MKD: A Multi-task Knowledge Distillation Approach For Pretrained Language Models Liu Linqing, Wang Huan, Lin Jimmy, Socher Richard, Xiong Caiming
- Vilbert: Pretraining Task-agnostic Visiolinguistic Representations For Vision-and-language Tasks Lu Jiasen, Batra Dhruv, Parikh Devi, Lee Stefan
- Triplenet: Triple Attention Network For Multi-turn Response Selection In Retrieval-based Chatbots Ma Wentao, Cui Yiming, Shao Nan, He Su, Zhang Wei-nan, Liu Ting, Wang Shijin, Hu Guoping
- Selective Attention For Context-aware Neural Machine Translation Maruf Sameen, Martins André F. T., Haffari Gholamreza
- Emergent Properties Of Finetuned Language Representation Models Matton Alexandre, De Oliveira Luke
- Structured Pruning Of A Bert-based Question Answering Model Mccarley J. S., Chakravarti Rishav, Sil Avirup
- Self-attentional Models Application In Task-oriented Dialogue Generation Systems Mehrjardi Mansour Saffar, Trabelsi Amine, Zaiane Osmar R.
- Mogrifier LSTM Melis Gábor, Kočiský Tomáš, Blunsom Phil
- Megatron-lm: Training Multi-billion Parameter Language Models Using Model Parallelism Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick Legresley, Jared Casper, Bryan Catanzaro
- Interrogating The Explanatory Power Of Attention In Neural Machine Translation Moradi Pooya, Kambhatla Nishant, Sarkar Anoop
- Neural Assistant: Joint Action Prediction, Response Generation, And Latent Knowledge Reasoning Neelakantan Arvind, Yavuz Semih, Narang Sharan, Prasad Vishaal, Goodrich Ben, Duckworth Daniel, Sankar Chinnadhurai, Yan Xifeng
- Efficient Attention Mechanism For Visual Dialog That Can Handle All The Interactions Between Multiple Inputs Nguyen Van-quang, Suganuma Masanori, Okatani Takayuki
- Dlgnet: A Transformer-based Model For Dialogue Response Generation Olabiyi Oluwatobi, Mueller Erik T.
- Stabilizing Transformers For Reinforcement Learning Parisotto Emilio, Song H. Francis, Rae Jack W., Pascanu Razvan, Gulcehre Caglar, Jayakumar Siddhant M., Jaderberg Max, Kaufman Raphael Lopez, Clark Aidan, Noury Seb, Botvinick Matthew M., Heess Nicolas, Hadsell Raia
- Thisiscompetition At Semeval-2019 Task 9: BERT Is Unstable For Out-of-domain Samples Park Cheoneum, Kim Juae, Lee Hyeon-gu, Amplayo Reinald Kim, Kim Harksoo, Seo Jungyun, Lee Changki
- Learning To Deceive With Attention-based Explanations Pruthi Danish, Gupta Mansi, Dhingra Bhuwan, Neubig Graham, Lipton Zachary C.
- Blockwise Self-attention For Long Document Understanding Qiu Jiezhong, Ma Hao, Levy Omer, Yih Scott Wen-tau, Wang Sinong, Tang Jie
- Option Comparison Network For Multiple-choice Reading Comprehension Ran Qiu, Li Peng, Hu Weiwei, Zhou Jie
- Leveraging Pre-trained Checkpoints For Sequence Generation Tasks Rothe Sascha, Narayan Shashi, Severyn Aliaksei
- Hierarchical Reinforcement Learning For Open-domain Dialog Saleh Abdelrhman, Jaques Natasha, Ghandeharioun Asma, Shen Judy Hanwen, Picard Rosalind
- Do Neural Dialog Systems Use The Conversation History Effectively? An Empirical Study Sankar Chinnadhurai, Subramanian Sandeep, Pal Christopher, Chandar Sarath, Bengio Yoshua
- Factor Graph Attention Schwartz Idan, Yu Seunghak, Hazan Tamir, Schwing Alexander
- Scene-based Factored Attention For Image Captioning Shen Chen, Ji Rongrong, Chen Fuhai, Sun Xiaoshuai, Li Xiangming
- The Evolved Transformer So David R., Liang Chen, Le Quoc V.
- Natural Language Generation Using Reinforcement Learning With External Rewards Srinivasan Vidhushini, Santhanam Sashank, Shaikh Samira
- On NMT Search Errors And Model Errors: Cat Got Your Tongue? Stahlberg Felix, Byrne Bill
- Cued@wmt19:ewc&lms Stahlberg Felix, Saunders Danielle, De Gispert Adria, Byrne Bill
- Insertion Transformer: Flexible Sequence Generation Via Insertion Operations Stern Mitchell, Chan William, Kiros Jamie, Uszkoreit Jakob
- Sesamebert: Attention For Anywhere Su Ta-chun, Cheng Hsiang-chih
- VL-BERT: Pre-training Of Generic Visual-linguistic Representations Su Weijie, Zhu Xizhou, Cao Yue, Li Bin, Lu Lewei, Wei Furu, Dai Jifeng
- Adaptive Attention Span In Transformers Sukhbaatar Sainbayar, Grave Edouard, Bojanowski Piotr, Joulin Armand
- Augmenting Self-attention With Persistent Memory Sukhbaatar Sainbayar, Grave Edouard, Lample Guillaume, Jegou Herve, Joulin Armand
- Generating Diverse Translation By Manipulating Multi-head Attention Sun Zewei, Huang Shujian, Wei Hao-ran, Dai Xin-yu, Chen Jiajun
- Syntax-infused Transformer And BERT Models For Machine Translation And Natural Language Understanding Sundararaman Dhanasekar, Subramanian Vivek, Wang Guoyin, Si Shijing, Shen Dinghan, Wang Dong, Carin Lawrence
- LXMERT: Learning Cross-modality Encoder Representations From Transformers Tan Hao, Bansal Mohit
- Waldorf: Wasteless Language-model Distillation On Reading-comprehension Tian James Yi, Kreuzer Alexander P., Chen Pai-hung, Will Hans-martin
- Repurposing Entailment For Multi-hop Question Answering Tasks Trivedi Harsh, Kwon Heeyoung, Khot Tushar, Sabharwal Ashish, Balasubramanian Niranjan
- Tree-structured Semantic Encoder With Knowledge Sharing For Domain Adaptation In Natural Language Generation Tseng Bo-hsiang, Budzianowski Paweł, Wu Yen-chen, Gašić Milica
- How Does BERT Answer Questions? A Layer-wise Analysis Of Transformer Representations Van Aken Betty, Winter Benjamin, Löser Alexander, Gers Felix A.
- Quantity Doesn't Buy Quality Syntax With Neural Language Models Van Schijndel Marten, Mueller Aaron, Linzen Tal
- Visualizing Attention In Transformer-based Language Representation Models Vig Jesse
- Training Optimus Prime, M.D.: Generating Medical Certification Items By Fine-tuning Openai's Gpt2 Transformer Model Von Davier Matthias
- Language Models With Transformers Wang Chenguang, Li Mu, Smola Alexander J.
- Tree Transformer: Integrating Tree Structures Into Self-attention Wang Yau-shian, Lee Hung-yi, Chen Yun-nung
- Video Question Generation Via Cross-modal Self-attention Networks Learning Wang Yu-siang, Su Hung-ting, Chang Chen-hsi, Liu Zhe-yu, Hsu Winston H.
- Adapting And Evaluating A Deep Learning Language Model For Clinical Why-question Answering Wen Andrew, Elwazir Mohamed Y., Moon Sungrim, Fan Jungwei
- Improving Neural Machine Translation With Pre-trained Representation Weng Rongxiang, Yu Heng, Huang Shujian, Luo Weihua, Chen Jiajun
- Attention Is Not Not Explanation Wiegreffe Sarah, Pinter Yuval
- Huggingface's Transformers: State-of-the-art Natural Language Processing Wolf Thomas, Debut Lysandre, Sanh Victor, Chaumond Julien, Delangue Clement, Moi Anthony, Cistac Pierric, Rault Tim, Louf Rémi, Funtowicz Morgan, Davison Joe, Shleifer Sam, Von Platen Patrick, Ma Clara, Jernite Yacine, Plu Julien, Xu Canwen, Scao Teven Le, Gugger Sylvain, Drame Mariama, Lhoest Quentin, Rush Alexander M.
- Transfertransfo: A Transfer Learning Approach For Neural Network Based Conversational Agents Wolf Thomas, Sanh Victor, Chaumond Julien, Delangue Clement
- Transformer-based Cascaded Multimodal Speech Translation Wu Zixiu, Caglayan Ozan, Ive Julia, Wang Josiah, Specia Lucia
- Sharing Attention Weights For Fast Transformer Xiao Tong, Li Yinqiao, Zhu Jingbo, Yu Zhengtao, Liu Tongran
- Towards Making The Most Of BERT In Neural Machine Translation Yang Jiacheng, Wang Mingxuan, Zhou Hao, Zhao Chengqi, Yu Yong, Zhang Weinan, Li Lei
- Deepening Hidden Representations From Pre-trained Language Models Yang Junjie, Zhao Hai
- Knowledge-grounded Response Generation With Deep Attentional Latent-variable Model Ye Hao-tong, Lo Kai-ling, Su Shang-yu, Chen Yun-nung
- Bp-transformer: Modelling Long-range Context Via Binary Partitioning Ye Zihao, Guo Qipeng, Gan Quan, Qiu Xipeng, Zhang Zheng
- Multimodal Unified Attention Networks For Vision-and-language Interactions Yu Zhou, Cui Yuhao, Yu Jun, Tao Dacheng, Tian Qi
- Q8BERT: Quantized 8bit BERT Zafrir Ofir, Boudoukh Guy, Izsak Peter, Wasserblat Moshe
- Context-aware Visual Policy Network For Fine-grained Image Captioning Zha Zheng-jun, Liu Daqing, Zhang Hanwang, Zhang Yongdong, Wu Feng
- Recosa: Detecting The Relevant Contexts With Self-attention For Multi-turn Dialogue Generation Zhang Hainan, Lan Yanyan, Pang Liang, Guo Jiafeng, Cheng Xueqi
- PEGASUS: Pre-training With Extracted Gap-sentences For Abstractive Summarization Zhang Jingqing, Zhao Yao, Saleh Mohammad, Liu Peter J.
- Dialogpt: Large-scale Generative Pre-training For Conversational Response Generation Zhang Yizhe, Sun Siqi, Galley Michel, Chen Yen-chun, Brockett Chris, Gao Xiang, Gao Jianfeng, Liu Jingjing, Dolan Bill
- MUSE: Parallel Multi-scale Attention For Sequence To Sequence Learning Zhao Guangxiang, Sun Xu, Xu Jingjing, Zhang Zhiyuan, Luo Liangchen
- Synchronous Bidirectional Neural Machine Translation Zhou Long, Zhang Jiajun, Zong Chengqing
- Unified Vision-language Pre-training For Image Captioning And VQA Zhou Luowei, Palangi Hamid, Zhang Lei, Hu Houdong, Corso Jason J., Gao Jianfeng
- Encoder-agnostic Adaptation For Conditional Language Generation Ziegler Zachary M., Melas-kyriazi Luke, Gehrmann Sebastian, Rush Alexander M.
- Paranoid Transformer: Reading Narrative Of Madness As Computational Approach To Creativity Agafonova Yana, Tikhonov Alexey, Yamshchikov Ivan P.
- Accelerating Natural Language Understanding In Task-oriented Dialog Ahuja Ojas, Desai Shrey
- Optimizing Transformer For Low-resource Neural Machine Translation Araabi Ali, Monz Christof
- Stress Test Evaluation Of Transformer-based Models In Natural Language Understanding Tasks Aspillaga Carlos, Carvallo Andrés, Araujo Vladimir
- Segatron: Segment-aware Transformer For Language Modeling And Understanding Bai He, Shi Peng, Lin Jimmy, Xie Yuqing, Tan Luchen, Xiong Kun, Gao Wen, Li Ming
- Character-level Transformer-based Neural Machine Translation Banar Nikolay, Daelemans Walter, Kestemont Mike
- Unilmv2: Pseudo-masked Language Models For Unified Language Model Pre-training Bao Hangbo, Dong Li, Wei Furu, Wang Wenhui, Yang Nan, Liu Xiaodong, Wang Yu, Piao Songhao, Gao Jianfeng, Zhou Ming, Hon Hsiao-wuen
- Co-attentional Transformers For Story-based Video Understanding Bebensee Björn, Zhang Byoung-tak
- Longformer: The Long-document Transformer Beltagy Iz, Peters Matthew E., Cohan Arman
- Adaptive Transformers For Learning Multimodal Representations Bhargava Prajjwal
- PALM: Pre-training An Autoencoding&autoregressive Language Model For Context-conditioned Generation Bi Bin, Li Chenliang, Wu Chen, Yan Ming, Wang Wei, Huang Songfang, Huang Fei, Si Luo
- Chatbot Interaction With Artificial Intelligence: Human Data Augmentation With T5 And Language Transformer Ensemble For Text Classification Bird Jordan J., Ekárt Anikó, Faria Diego R.
- Advances Of Transformer-based Models For News Headline Generation Bukhtiyarov Alexey, Gusev Ilya
- DIET: Lightweight Language Understanding For Dialogue Systems Bunk Tanja, Varshneya Daksh, Vlasov Vladimir, Nichol Alan
- Memory Transformer Burtsev Mikhail S., Kuratov Yuri, Peganov Anton, Sapunov Grigory V.
- Behind The Scene: Revealing The Secrets Of Pre-trained Vision-and-language Models Cao Jize, Gan Zhe, Cheng Yu, Yu Licheng, Chen Yen-chun, Liu Jingjing
- Syntax-driven Iterative Expansion Language Models For Controllable Text Generation Casas Noe, Fonollosa José A. R., Costa-jussà Marta R.
- Simultaneous Paraphrasing And Translation By Fine-tuning Transformer Models Chada Rakesh
- Exploring Fluent Query Reformulations With Text-to-text Transformers And Reinforcement Learning Chen Jerry Zikun, Yu Shi, Wang Haoran
- Adaptive Bi-directional Attention: Exploring Multi-granularity Representations For Machine Reading Comprehension Chen Nuo, Liu Fenglin, You Chenyu, Zhou Peilin, Zou Yuexian
- Logical Natural Language Generation From Open-domain Tables Chen Wenhu, Chen Jianshu, Su Yu, Chen Zhiyu, Wang William Yang
- Reasoning In Dialog: Improving Response Generation By Context Reading Comprehension Chen Xiuying, Cui Zhi, Zhang Jiayi, Wei Chen, Cui Jianwei, Wang Bin, Zhao Dongyan, Yan Rui
- Model Selection For Cross-lingual Transfer Chen Yang, Ritter Alan
- Accurate Word Alignment Induction From Neural Machine Translation Chen Yun, Liu Yang, Chen Guanhua, Jiang Xin, Liu Qun
- Comparative Study Of Language Models On Cross-domain Data With Model Agnostic Explainability Chhipa Mayank, Vazurkar Hrushikesh Mahesh, Kumar Abhijeet, Mishra Mridul
- X-LXMERT: Paint, Caption And Answer Questions With Multi-modal Transformers Cho Jaemin, Lu Jiasen, Schwenk Dustin, Hajishirzi Hannaneh, Kembhavi Aniruddha
- MEANTIME: Mixture Of Attention Mechanisms With Multi-temporal Embeddings For Sequential Recommendation Cho Sung Min, Park Eunhyeok, Yoo Sungjoo
- Rethinking Embedding Coupling In Pre-trained Language Models Chung Hyung Won, Févry Thibault, Tsai Henry, Johnson Melvin, Ruder Sebastian
- Transformers As Soft Reasoners Over Language Clark Peter, Tafjord Oyvind, Richardson Kyle
- Pymt5: Multi-mode Translation Of Natural Language And Python Code With Transformers Clement Colin B., Drain Dawn, Timcheck Jonathan, Svyatkovskiy Alexey, Sundaresan Neel
- Unsupervised Explanation Generation For Machine Reading Comprehension Cui Yiming, Liu Ting, Wang Shijin, Hu Guoping
- Cascaded Text Generation With Markov Transformers Deng Yuntian, Rush Alexander M.
- Calibration Of Pre-trained Transformers Desai Shrey, Durrett Greg
- Ernie-doc: A Retrospective Long-document Modeling Transformer Ding Siyu, Shang Junyuan, Wang Shuohuan, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- Gshard: Scaling Giant Models With Conditional Computation And Automatic Sharding Dmitry Lepikhin, Hyoukjoong Lee, Yuanzhong Xu, Dehao Chen, Orhan Firat, Yanping Huang, Maxim Krikun, Noam Shazeer, Zhifeng Chen
- Sound Natural: Content Rephrasing In Dialog Systems Einolghozati Arash, Gupta Anchit, Diedrick Keith, Gupta Sonal
- Turngpt: A Transformer-based Language Model For Predicting Turn-taking In Spoken Dialog Ekstedt Erik, Skantze Gabriel
- A Comparison Of LSTM And BERT For Small Corpus Ezen-can Aysu
- The Unreasonable Volatility Of Neural Machine Translation Models Fadaee Marzieh, Monz Christof
- Augmenting Transformers With Knn-based Composite Memory For Dialogue Fan Angela, Gardent Claire, Braud Chloe, Bordes Antoine
- Addressing Some Limitations Of Transformers With Feedback Memory Fan Angela, Lavril Thibaut, Grave Edouard, Joulin Armand, Sukhbaatar Sainbayar
- CERT: Contrastive Self-supervised Learning For Language Understanding Fang Hongchao, Wang Sicheng, Zhou Meng, Ding Jiayuan, Xie Pengtao
- Paragraph-level Commonsense Transformers With Recurrent Memory Gabriel Saadia, Bhagavatula Chandra, Shwartz Vered, Bras Ronan Le, Forbes Maxwell, Choi Yejin
- Compressing Large-scale Transformer-based Models: A Case Study On BERT Ganesh Prakhar, Chen Yao, Lou Xin, Khan Mohammad Ali, Yang Yin, Sajjad Hassan, Nakov Preslav, Chen Deming, Winslett Marianne
- Robust Conversational AI With Grounded Text Generation Gao Jianfeng, Peng Baolin, Li Chunyuan, Li Jinchao, Shayandeh Shahin, Liden Lars, Shum Heung-yeung
- Relevance Transformer: Generating Concise Code Snippets With Relevance Feedback Gemmell Carlos, Rossetto Federico, Dalton Jeffrey
- ISAAQ -- Mastering Textbook Questions With Pre-trained Transformers And Bottom-up And Top-down Attention Gomez-perez Jose Manuel, Ortega Raul
- Recurrent Chunking Mechanisms For Long-text Machine Reading Comprehension Gong Hongyu, Shen Yelong, Yu Dian, Chen Jianshu, Yu Dong
- Measuring Systematic Generalization In Neural Proof Generation With Transformers Gontier Nicolas, Sinha Koustuv, Reddy Siva, Pal Christopher
- Are Neural Open-domain Dialog Systems Robust To Speech Recognition Errors In The Dialog History? An Empirical Study Gopalakrishnan Karthik, Hedayatnia Behnam, Wang Longshaokan, Liu Yang, Hakkani-tur Dilek
- Contextualize Knowledge Bases With Transformer For End-to-end Task-oriented Dialogue Systems Gou Yanjie, Lei Yinjie, Liu Lingqiao, Dai Yong, Shen Chunxu
- Dialogbert: Discourse-aware Response Generation Via Learning To Recover And Rank Utterances Gu Xiaodong, Yoo Kang Min, Ha Jung-woo
- Sequence-level Mixed Sample Data Augmentation Guo Demi, Kim Yoon, Rush Alexander M.
- GMAT: Global Memory Augmentation For Transformers Gupta Ankit, Berant Jonathan
- Improvement Of A Dedicated Model For Open Domain Persona-aware Dialogue Generation Han Qiang
- Ranking Enhanced Dialogue Generation Hao Changying, Pang Liang, Lan Yanyan, Sun Fei, Guo Jiafeng, Cheng Xueqi
- Exploring Explainable Selection To Control Abstractive Summarization Haonan Wang, Yang Gao, Yu Bai, Lapata Mirella, Heyan Huang
- Enriched Pre-trained Transformers For Joint Slot Filling And Intent Detection Hardalov Momchil, Koychev Ivan, Nakov Preslav
- Guided Transformer: Leveraging Multiple External Sources For Representation Learning In Conversational Search Hashemi Helia, Zamani Hamed, Croft W. Bruce
- Investigating Representations Of Verb Bias In Neural Language Models Hawkins Robert D., Yamakoshi Takateru, Griffiths Thomas L., Goldberg Adele E.
- Pretrained Transformers Improve Out-of-distribution Robustness Hendrycks Dan, Liu Xiaoyuan, Wallace Eric, Dziedzic Adam, Krishnan Rishabh, Song Dawn
- PAIR: Planning And Iterative Refinement In Pre-trained Transformers For Long Text Generation Hua Xinyu, Wang Lu
- TRANS-BLSTM: Transformer With Bidirectional LSTM For Language Understanding Huang Zhiheng, Xu Peng, Liang Davis, Mishra Ajay, Xiang Bing
- Exploring BERT Parameter Efficiency On The Stanford Question Answering Dataset V2.0 Hulburd Eric
- Memory Attentive Fusion: External Language Model Integration For Transformer-based Sequence-to-sequence Model Ihori Mana, Masumura Ryo, Makishima Naoki, Tanaka Tomohiro, Takashima Akihiko, Orihashi Shota
- Probing Pretrained Language Models For Lexical Semantics Ivan Vulić, Edoardo Maria Ponti, Robert Litschko, Goran Glavaš, Anna Korhonen
- TLDR: Token Loss Dynamic Reweighting For Reducing Repetitive Utterance Generation Jiang Shaojie, Wolf Thomas, Monz Christof, De Rijke Maarten
- Text-to-text Pre-training For Data-to-text Tasks Kale Mihir, Rastogi Abhinav
- Meta-context Transformers For Domain-specific Response Generation Kar Debanjana, Samanta Suranjana, Azad Amar Prakash
- Do Fine-tuned Commonsense Language Models Really Generalize? Kejriwal Mayank, Shen Ke
- SMRT Chatbots: Improving Non-task-oriented Dialog With Simulated Multiple Reference Training Khayrallah Huda, Sedoc João
- Code Prediction By Feeding Trees To Transformers Kim Seohyun, Zhao Jinman, Tian Yuchi, Chandra Satish
- Which Kind Is Better In Open-domain Multi-turn Dialog,hierarchical Or Non-hierarchical Models? An Empirical Study Lan Tian, Mao Xian-ling, Wei Wei, Huang Heyan
- Utilizing Bidirectional Encoder Representations From Transformers For Answer Selection Laskar Md Tahmid Rahman, Hoque Enamul, Huang Jimmy Xiangji
- Context-based Transformer Models For Answer Sentence Selection Lauriola Ivano, Moschitti Alessandro
- Common Sense Or World Knowledge? Investigating Adapter-based Knowledge Injection Into Pretrained Transformers Lauscher Anne, Majewska Olga, Ribeiro Leonardo F. R., Gurevych Iryna, Rozanov Nikolai, Glavaš Goran
- From Zero To Hero: On The Limitations Of Zero-shot Cross-lingual Transfer With Multilingual Transformers Lauscher Anne, Ravishankar Vinit, Vulić Ivan, Glavaš Goran
- SLM: Learning A Discourse Language Representation With Sentence Unshuffling Lee Haejun, Hudson Drew A., Lee Kangwook, Manning Christopher D.
- MART: Memory-augmented Recurrent Transformer For Coherent Video Paragraph Captioning Lei Jie, Wang Liwei, Shen Yelong, Yu Dong, Berg Tamara L., Bansal Mohit
- Efficient Transformer-based Large Scale Language Representations Using Hardware-friendly Block Structured Pruning Li Bingbing, Kong Zhenglun, Zhang Tianyun, Li Ji, Li Zhengang, Liu Hang, Ding Caiwen
- Transformers To Learn Hierarchical Contexts In Multiparty Dialogue For Span-based Question Answering Li Changmao, Choi Jinho D.
- A Closer Look At The Robustness Of Vision-and-language Pre-trained Models Li Linjie, Gan Zhe, Liu Jingjing
- An Empirical Investigation Of Pre-trained Transformer Language Models For Open-domain Dialogue Generation Li Piji
- TMT: A Transformer-based Modal Translator For Improving Multimodal Sequence Representations In Audio Visual Scene-aware Dialog Li Wubo, Jiang Dongwei, Zou Wei, Li Xiangang
- Mapping Natural Language Instructions To Mobile UI Action Sequences Li Yang, He Jiacong, Zhou Xin, Zhang Yuan, Baldridge Jason
- Rewriter-evaluator Architecture For Neural Machine Translation Li Yangming, Yao Kaisheng
- Neural Machine Translation With Joint Representation Li Yanyang, Wang Qiang, Xiao Tong, Liu Tongran, Zhu Jingbo
- Toward Interpretability Of Dual-encoder Models For Dialogue Response Suggestions Li Yitong, Li Dianqi, Prakash Sushant, Wang Peng
- Bridging Text And Video: A Universal Multimodal Transformer For Video-audio Scene-aware Dialog Li Zekang, Li Zongjia, Zhang Jinchao, Feng Yang, Niu Cheng, Zhou Jie
- Train Large, Then Compress: Rethinking Model Size For Efficient Training And Inference Of Transformers Li Zhuohan, Wallace Eric, Shen Sheng, Lin Kevin, Keutzer Kurt, Klein Dan, Gonzalez Joseph E.
- Multi-node Bert-pretraining: Cost-efficient Approach Lin Jiahuang, Li Xin, Pekhimenko Gennady
- Conversational Question Reformulation Via Sequence-to-sequence Architectures And Pretrained Language Models Lin Sheng-chieh, Yang Jheng-hong, Nogueira Rodrigo, Tsai Ming-feng, Wang Chuan-ju, Lin Jimmy
- Readonce Transformers: Reusable Representations Of Text For Transformers Lin Shih-ting, Sabharwal Ashish, Khot Tushar
- Variational Transformers For Diverse Response Generation Lin Zhaojiang, Winata Genta Indra, Xu Peng, Liu Zihan, Fung Pascale
- Mt5: A Massively Multilingual Pre-trained Text-to-text Transformer Linting Xue, Noah Constant, Adam Roberts, Mihir Kale, Rami Al-rfou, Aditya Siddhant, Aditya Barua, Colin Raffel
- TIME: Text And Image Mutual-translation Adversarial Networks Liu Bingchen, Song Kunpeng, Zhu Yizhe, De Melo Gerard, Elgammal Ahmed
- Rikinet: Reading Wikipedia Pages For Natural Question Answering Liu Dayiheng, Gong Yeyun, Fu Jie, Yan Yu, Chen Jiusheng, Jiang Daxin, Lv Jiancheng, Duan Nan
- Tell Me How To Ask Again: Question Data Augmentation With Controllable Rewriting In Continuous Space Liu Dayiheng, Gong Yeyun, Fu Jie, Yan Yu, Chen Jiusheng, Lv Jiancheng, Duan Nan, Zhou Ming
- Understanding The Difficulty Of Training Transformers Liu Liyuan, Liu Xiaodong, Gao Jianfeng, Chen Weizhu, Han Jiawei
- Very Deep Transformers For Neural Machine Translation Liu Xiaodong, Duh Kevin, Liu Liyuan, Gao Jianfeng
- Simplifying Paragraph-level Question Generation Via Transformer Language Models Lopez Luis Enrico, Cruz Diane Kathryn, Cruz Jan Christian Blaise, Cheng Charibeth
- VECO: Variable And Flexible Cross-lingual Pre-training For Language Understanding And Generation Luo Fuli, Wang Wei, Liu Jiahao, Liu Yijia, Bi Bin, Huang Songfang, Huang Fei, Si Luo
- Incremental Processing In The Age Of Non-incremental Encoders: An Empirical Assessment Of Bidirectional Models For Incremental NLU Madureira Brielen, Schlangen David
- Gpt-too: A Language-model-first Approach For Amr-to-text Generation Mager Manuel, Astudillo Ramon Fernandez, Naseem Tahira, Sultan Md Arafat, Lee Young-suk, Florian Radu, Roukos Salim
- Verb Knowledge Injection For Multilingual Event Processing Majewska Olga, Vulić Ivan, Glavaš Goran, Ponti Edoardo M., Korhonen Anna
- Natural Language Rationales With Full-stack Visual Reasoning: From Pixels To Semantic Frames To Commonsense Graphs Marasović Ana, Bhagavatula Chandra, Park Jae Sung, Bras Ronan Le, Smith Noah A., Choi Yejin
- Delight: Deep And Light-weight Transformer Mehta Sachin, Ghazvininejad Marjan, Iyer Srinivasan, Zettlemoyer Luke, Hajishirzi Hannaneh
- Wechat Neural Machine Translation Systems For WMT20 Meng Fandong, Yan Jianhao, Liu Yijin, Gao Yuan, Zeng Xianfeng, Zeng Qinsong, Li Peng, Chen Ming, Zhou Jie, Liu Sifan, Zhou Hao
- Lambert: Language And Action Learning Using Multimodal BERT Miyazawa Kazuki, Aoki Tatsuya, Horii Takato, Nagai Takayuki
- Modern Methods For Text Generation Montesinos Dimas Munoz
- On The Stability Of Fine-tuning BERT: Misconceptions, Explanations, And Strong Baselines Mosbach Marius, Andriushchenko Maksym, Klakow Dietrich
- NABU \(\mathrm{-}\) Multilingual Graph-based Neural RDF Verbalizer Moussallem Diego, Gnaneshwar Dwaraknath, Ferreira Thiago Castro, Ngomo Axel-cyrille Ngonga
- Axformer: Accuracy-driven Approximation Of Transformers For Faster, Smaller And More Accurate NLP Models Nagarajan Amrit, Sen Sanchari, Stevens Jacob R., Raghunathan Anand
- Stepwise Extractive Summarization And Planning With Structured Transformers Narayan Shashi, Maynez Joshua, Adamek Jakub, Pighin Daniele, Bratanič Blaž, Mcdonald Ryan
- I Like Fish, Especially Dolphins: Addressing Contradictions In Dialogue Modeling Nie Yixin, Williamson Mary, Bansal Mohit, Kiela Douwe, Weston Jason
- Real-time Execution Of Large-scale Language Models On Mobile Niu Wei, Kong Zhenglun, Yuan Geng, Jiang Weiwen, Guan Jiexiong, Ding Caiwen, Zhao Pu, Liu Sijia, Ren Bin, Wang Yanzhi
- The Chess Transformer: Mastering Play Using Generative Language Models Noever David, Ciolino Matt, Kalin Josh
- Better Distractions: Transformer-based Distractor Generation And Multiple Choice Question Filtering Offerijns Jeroen, Verberne Suzan, Verhoef Tessa
- Facts2story: Controlling Text Generation By Key Facts Orbach Eyal Bar Ilan University, Goldberg Yoav Bar Ilan University And Allen Institute For Artificial Intelligence
- Challenges And Thrills Of Legal Arguments Pallaprolu Anurag, Vaidya Radha, Attawar Aditya Swaroop
- X-linear Attention Networks For Image Captioning Pan Yingwei, Yao Ting, Li Yehao, Mei Tao
- On Task-level Dialogue Composition Of Generative Transformer Model Parthasarathi Prasanna, Neelakantan Arvind, Narang Sharan
- How To Evaluate Your Dialogue System: Probe Tasks As An Alternative For Token-level Evaluation Metrics Parthasarathi Prasanna, Pineau Joelle, Chandar Sarath
- Telling Bert's Full Story: From Local Attention To Global Aggregation Pascual Damian, Brunner Gino, Wattenhofer Roger
- SOLOIST: Building Task Bots At Scale With Transfer Learning And Machine Teaching Peng Baolin, Li Chunyuan, Li Jinchao, Shayandeh Shahin, Liden Lars, Gao Jianfeng
- What Does BERT Know About Books, Movies And Music? Probing BERT For Conversational Recommendation Penha Gustavo, Hauff Claudia
- Adapterhub: A Framework For Adapting Transformers Pfeiffer Jonas, Rücklé Andreas, Poth Clifton, Kamath Aishwarya, Vulić Ivan, Ruder Sebastian, Cho Kyunghyun, Gurevych Iryna
- Sparsifying Transformer Models With Trainable Representation Pooling Pietruszka Michał, Borchmann Łukasz, Garncarek Łukasz
- Interactive Teaching For Conversational AI Ping Qing, Niu Feiyang, Thattai Govind, Chengottusseriyil Joel, Gao Qiaozi, Reganti Aishwarya, Rajagopal Prashanth, Tur Gokhan, Hakkani-tur Dilek, Nataraja Prem
- Compressing Transformer-based Semantic Parsing Models Using Compositional Code Embeddings Prakash Prafull, Shashidhar Saurabh Kumar, Zhao Wenlong, Rongali Subendhu, Khan Haidar, Kayser Michael
- Training Large Neural Networks With Constant Memory Using A New Execution Algorithm Pudipeddi Bharadwaj, Mesmakhosroshahi Maral, Xi Jinwen, Bharadwaj Sujeeth
- Prophetnet: Predicting Future N-gram For Sequence-to-sequence Pre-training Qi Weizhen, Yan Yu, Gong Yeyun, Liu Dayiheng, Duan Nan, Chen Jiusheng, Zhang Ruofei, Zhou Ming
- Automatic Distractor Generation For Multiple Choice Questions In Standard Tests Qiu Zhaopeng, Wu Xian, Fan Wei
- How Fine Can Fine-tuning Be? Learning Efficient Language Models Radiya-dixit Evani, Wang Xin
- Exploring Recurrent, Memory And Attention Based Architectures For Scoring Interactional Aspects Of Human-machine Text Dialog Ramanarayanan Vikram, Mulholland Matthew, Ghosh Debanjan
- Transquest At WMT2020: Sentence-level Direct Assessment Ranasinghe Tharindu, Orasan Constantin, Mitkov Ruslan
- Hierarchical GPT With Congruent Transformers For Multi-sentence Language Models Roh Jihyeon, Gim Huiseong, Lee Soo-young
- Knowledge-aware Language Model Pretraining Rosset Corby, Xiong Chenyan, Phan Minh, Song Xia, Bennett Paul, Tiwary Saurabh
- Stronger Transformers For Neural Multi-hop Question Generation Sachan Devendra Singh, Wu Lingfei, Sachan Mrinmaya, Hamilton William
- Prover: Proof Generation For Interpretable Reasoning Over Rules Saha Swarnadeep, Ghosh Sayan, Srivastava Shashank, Bansal Mohit
- On The Effect Of Dropping Layers Of Pre-trained Transformer Models Sajjad Hassan, Dalvi Fahim, Durrani Nadir, Nakov Preslav
- Hierarchical Transformer For Task Oriented Dialog Systems Santra Bishal, Anusha Potnuru, Goyal Pawan
- Look Before You Speak: Visually Contextualized Utterances Seo Paul Hongsuck, Nagrani Arsha, Schmid Cordelia
- End-to-end Synthetic Data Generation For Domain Adaptation Of Question Answering Systems Shakeri Siamak, Santos Cicero Nogueira Dos, Zhu Henry, Ng Patrick, Nan Feng, Wang Zhiguo, Nallapati Ramesh, Xiang Bing
- Data-efficient Methods For Dialogue Systems Shalyminov Igor
- Hybrid Generative-retrieval Transformers For Dialogue Domain Adaptation Shalyminov Igor, Sordoni Alessandro, Atkinson Adam, Schulz Hannes
- Russiansuperglue: A Russian Language Understanding Evaluation Benchmark Shavrina Tatiana, Fenogenova Alena, Emelyanov Anton, Shevelev Denis, Artemova Ekaterina, Malykh Valentin, Mikhailov Vladislav, Tikhonova Maria, Chertok Andrey, Evlampiev Andrey
- A Simple But Tough-to-beat Data Augmentation Approach For Natural Language Understanding And Generation Shen Dinghan, Zheng Mingzhi, Shen Yelong, Qu Yanru, Chen Weizhu
- Pre-trained Summarization Distillation Shleifer Sam, Rush Alexander M.
- Saying No Is An Art: Contextualized Fallback Responses For Unanswerable Dialogue Queries Shrivastava Ashish, Dhole Kaustubh, Bhatt Abhinav, Raghunath Sharvani
- Unnatural Language Inference Sinha Koustuv, Parthasarathi Prasanna, Pineau Joelle, Williams Adina
- Neural Machine Translation With Error Correction Song Kaitao, Tan Xu, Lu Jianfeng
- Bert-hlstms: BERT And Hierarchical Lstms For Visual Storytelling Su Jing, Dai Qingyun, Guerin Frank, Zhou Mian
- Discovering Useful Sentence Representations From Large Pretrained Language Models Subramani Nishant, Suresh Nivedita
- Multi-scale Transformer Language Models Subramanian Sandeep, Collobert Ronan, Ranzato Marc'aurelio, Boureau Y-lan
- Explain And Improve: Lrp-inference Fine-tuning For Image Captioning Models Sun Jiamei, Lapuschkin Sebastian, Samek Wojciech, Binder Alexander
- Intellicode Compose: Code Generation Using Transformer Svyatkovskiy Alexey, Deng Shao Kun, Fu Shengyu, Sundaresan Neel
- Proofwriter: Generating Implications, Proofs, And Abductive Statements Over Natural Language Tafjord Oyvind, Mishra Bhavana Dalvi, Clark Peter
- Edgebert: Sentence-level Energy Optimizations For Latency-aware Multi-task NLP Inference Tambe Thierry, Hooper Coleman, Pentecost Lillian, Jia Tianyu, Yang En-yu, Donato Marco, Sanh Victor, Whatmough Paul N., Rush Alexander M., Brooks David, Wei Gu-yeon
- Synthesizer: Rethinking Self-attention In Transformer Models Tay Yi, Bahri Dara, Metzler Donald, Juan Da-cheng, Zhao Zhe, Zheng Che
- On Optimal Transformer Depth For Low-resource Language Translation Van Biljon Elan, Pretorius Arnu, Kreutzer Julia
- Multi-task Learning With Multi-head Attention For Multi-choice Reading Comprehension Wan Hui
- HAT: Hardware-aware Transformers For Efficient Natural Language Processing Wang Hanrui, Wu Zhanghao, Liu Zhijian, Cai Han, Zhu Ligeng, Gan Chuang, Han Song
- SRQA: Synthetic Reader For Factoid Question Answering Wang Jiuniu, Xu Wenjia, Fu Xingyu, Wei Yang, Jin Li, Chen Ziyan, Xu Guangluan, Wu Yirong
- Encoding Syntactic Knowledge In Transformer Encoder For Intent Detection And Slot Filling Wang Jixuan, Wei Kai, Radfar Martin, Zhang Weiwei, Chung Clement
- Minilm: Deep Self-attention Distillation For Task-agnostic Compression Of Pre-trained Transformers Wang Wenhui, Wei Furu, Dong Li, Bao Hangbo, Yang Nan, Zhou Ming
- Rethinking The Value Of Transformer Components Wang Wenxuan, Tu Zhaopeng
- Lightseq: A High Performance Inference Library For Transformers Wang Xiaohui, Xiong Ying, Wei Yang, Wang Mingxuan, Li Lei
- VD-BERT: A Unified Vision And Dialog Transformer With BERT Wang Yue, Joty Shafiq, Lyu Michael R., King Irwin, Xiong Caiming, Hoi Steven C. H.
- Towards Faithful Neural Table-to-text Generation With Content-matching Constraints Wang Zhenyi, Wang Xiaoyang, An Bang, Yu Dong, Chen Changyou
- Contextual Neural Machine Translation Improves Translation Of Cataphoric Pronouns Wong Kayyen, Maruf Sameen, Haffari Gholamreza
- Cort: Complementary Rankings From Transformers Wrzalik Marco, Krechel Dirk
- A Controllable Model Of Grounded Response Generation Wu Zeqiu, Galley Michel, Brockett Chris, Zhang Yizhe, Gao Xiang, Quirk Chris, Koncel-kedziorski Rik, Gao Jianfeng, Hajishirzi Hannaneh, Ostendorf Mari, Dolan Bill
- Lite Transformer With Long-short Range Attention Wu Zhanghao, Liu Zhijian, Lin Ji, Lin Yujun, Han Song
- Context-guided BERT For Targeted Aspect-based Sentiment Analysis Wu Zhengxuan, Ong Desmond C.
- EDITOR: An Edit-based Transformer With Repositioning For Neural Machine Translation With Soft Lexical Constraints Xu Weijia, Carpuat Marine
- Multi-unit Transformers For Neural Machine Translation Yan Jianhao, Meng Fandong, Zhou Jie
- Off-policy Self-critical Training For Transformer In Visual Paragraph Generation Yan Shiyang, Hua Yang, Robertson Neil M.
- Just Ask: Learning To Answer Questions From Millions Of Narrated Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Seqdialn: Sequential Visual Dialog Networks In Joint Visual-linguistic Representation Space Yang Liu
- IART: Intent-aware Response Ranking With Transformers In Information-seeking Conversation Systems Yang Liu, Qiu Minghui, Qu Chen, Chen Cen, Guo Jiafeng, Zhang Yongfeng, Croft W. Bruce, Chen Haiqing
- On The Generation Of Medical Dialogues For COVID-19 Yang Wenmian, Zeng Guangtao, Tan Bowen, Ju Zeqian, Chakravorty Subrato, He Xuehai, Chen Shu, Yang Xingyi, Wu Qingyang, Yu Zhou, Xing Eric, Xie Pengtao
- On The Sub-layer Functionalities Of Transformer Decoder Yang Yilin, Wang Longyue, Shi Shuming, Tadepalli Prasad, Lee Stefan, Tu Zhaopeng
- Contrastive Triple Extraction With Generative Transformer Ye Hongbin, Zhang Ningyu, Deng Shumin, Chen Mosha, Tan Chuanqi, Huang Fei, Chen Huajun
- Adding Recurrence To Pretrained Transformers For Improved Efficiency And Context Size Yoshida Davis, Ettinger Allyson, Gimpel Kevin
- Hard-coded Gaussian Attention For Neural Machine Translation You Weiqiu, Sun Simeng, Iyyer Mohit
- Assessing Phrasal Representation And Composition In Transformers Yu Lang, Ettinger Allyson
- Big Bird: Transformers For Longer Sequences Zaheer Manzil, Guruganesh Guru, Dubey Avinava, Ainslie Joshua, Alberti Chris, Ontanon Santiago, Pham Philip, Ravula Anirudh, Wang Qifan, Yang Li, Ahmed Amr
- Emptransfo: A Multi-head Transformer Architecture For Creating Empathetic Dialog Systems Zandie Rohola, Mahoor Mohammad H.
- Open-domain Dialogue Generation Based On Pre-trained Language Models Zeng Yan, Nie Jian-yun
- A Simple And Efficient Multi-task Learning Approach For Conditioned Dialogue Generation Zeng Yan, Nie Jian-yun
- Token Drop Mechanism For Neural Machine Translation Zhang Huaao, Qiu Shigui, Duan Xiangyu, Zhang Min
- Accelerating Training Of Transformer-based Language Models With Progressive Layer Dropping Zhang Minjia, He Yuxiong
- Future-guided Incremental Transformer For Simultaneous Translation Zhang Shaolei, Feng Yang, Li Liangyou
- When Do You Need Billions Of Words Of Pretraining Data? Zhang Yian, Warstadt Alex, Li Haau-sing, Bowman Samuel R.
- POINTER: Constrained Progressive Text Generation Via Insertion-based Generative Pre-training Zhang Yizhe, Wang Guoyin, Li Chunyuan, Gan Zhe, Brockett Chris, Dolan Bill
- BERT-JAM: Boosting Bert-enhanced Neural Machine Translation With Joint Attention Zhang Zhebin, Wu Sai, Jiang Dawei, Chen Gang
- Know What You Don't Need: Single-shot Meta-pruning For Attention Heads Zhang Zhengyan, Qi Fanchao, Liu Zhiyuan, Liu Qun, Sun Maosong
- Towards Making The Most Of Context In Neural Machine Translation Zheng Zaixiang, Yue Xiang, Huang Shujian, Chen Jiajun, Birch Alexandra
- Pre-training Text-to-text Transformers For Concept-centric Common Sense Zhou Wangchunshu, Lee Dong-ho, Selvam Ravi Kiran, Lee Seyeon, Lin Bill Yuchen, Ren Xiang
- Modifying Memories In Transformer Models Zhu Chen, Rawat Ankit Singh, Zaheer Manzil, Bhojanapalli Srinadh, Li Daliang, Yu Felix, Kumar Sanjiv
- Vision-dialog Navigation By Exploring Cross-modal Memory Zhu Yi, Zhu Fengda, Zhan Zhaohuan, Lin Bingqian, Jiao Jianbin, Chang Xiaojun, Liang Xiaodan
- Regularizing Transformers With Deep Probabilistic Layers Aguilera Aurora Cobo, Olmos Pablo Martínez, Artés-rodríguez Antonio, Pérez-cruz Fernando
- Goal-directed Story Generation: Augmenting Generative Language Models With Reinforcement Learning Alabdulkarim Amal, Li Winston, Martin Lara J., Riedl Mark O.
- On The Multilingual Capabilities Of Very Large-scale English Language Models Armengol-estapé Jordi, Bonet Ona De Gibert, Melero Maite
- See, Hear, Read: Leveraging Multimodality With Guided Attention For Abstractive Text Summarization Atri Yash Kumar, Pramanick Shraman, Goyal Vikram, Chakraborty Tanmoy
- Re-evaluating Germeval17 Using German Pre-trained Language Models Aßenmacher M., Corvonato A., Heumann C.
- G-transformer For Document-level Machine Translation Bao Guangsheng, Zhang Yue, Teng Zhiyang, Chen Boxing, Luo Weihua
- S2s-ft: Fine-tuning Pretrained Transformer Encoders For Sequence-to-sequence Learning Bao Hangbo, Dong Li, Wang Wenhui, Yang Nan, Wei Furu
- PLATO-XL: Exploring The Large-scale Pre-training Of Dialogue Generation Bao Siqi, He Huang, Wang Fan, Wu Hua, Wang Haifeng, Wu Wenquan, Wu Zhihua, Guo Zhen, Lu Hua, Huang Xinxian, Tian Xin, Xu Xinchao, Lin Yingzhan, Niu Zheng-yu
- Non-autoregressive Translation By Learning Target Categorical Codes Bao Yu, Huang Shujian, Xiao Tong, Wang Dongqi, Dai Xinyu, Chen Jiajun
- NLP-IIS@UT At Semeval-2021 Task 4: Machine Reading Comprehension Using The Long Document Transformer Basafa Hossein, Movahedi Sajad, Ebrahimi Ali, Shakery Azadeh, Faili Heshaam
- Building A Question And Answer System For News Domain Basu Sandipan, Gaddala Aravind, Chetan Pooja, Tiwari Garima, Darapaneni Narayana, Parvathaneni Sadwik, Paduri Anwesh Reddy
- Continual Learning In Multilingual NMT Via Language-specific Embeddings Berard Alexandre
- SHAQ: Single Headed Attention With Quasi-recurrence Bharwani Nashwin, Kushner Warren, Dandona Sangeet, Schreiber Ben
- Leveraging Redundancy In Attention With Reuse Transformers Bhojanapalli Srinadh, Chakrabarti Ayan, Veit Andreas, Lukasik Michal, Jain Himanshu, Liu Frederick, Chang Yin-wen, Kumar Sanjiv
- Latr: Layout-aware Transformer For Scene-text VQA Biten Ali Furkan, Litman Ron, Xie Yusheng, Appalaraju Srikar, Manmatha R.
- Improving Language Models By Retrieving From Trillions Of Tokens Borgeaud Sebastian, Mensch Arthur, Hoffmann Jordan, Cai Trevor, Rutherford Eliza, Millican Katie, Driessche George Van Den, Lespiau Jean-baptiste, Damoc Bogdan, Clark Aidan, Casas Diego De Las, Guy Aurelia, Menick Jacob, Ring Roman, Hennigan Tom, Huang Saffron, Maggiore Loren, Jones Chris, Cassirer Albin, Brock Andy, Paganini Michela, Irving Geoffrey, Vinyals Oriol, Osindero Simon, Simonyan Karen, Rae Jack W., Elsen Erich, Sifre Laurent
- What Changes Can Large-scale Language Models Bring? Intensive Study On Hyperclova: Billions-scale Korean Generative Pretrained Transformers Boseop Kim, Hyoungseok Kim, Sang-woo Lee, Gichang Lee, Donghyun Kwak, Dong Hyeon Jeon, Sunghyun Park, Sungju Kim, Seonhoon Kim, Dongpil Seo, Heungsub Lee, Minyoung Jeong, Sungjae Lee, Minsub Kim, Suk Hyun Ko, Seokhun Kim, Taeyong Park, Jinuk Kim, Soyoung Kang, Na-hyeon Ryu, Kang Min Yoo, Minsuk Chang, Soobin Suh, Sookyo In, Jinseong Park, Kyungduk Kim, Hiun Kim, Jisu Jeong, Yong Goo Yeo, Donghoon Ham, Dongju Park, Min Young Lee, Jaewook Kang, Inho Kang, Jung-woo Ha, Woomyoung Park, Nako Sung
- CDLM: Cross-document Language Modeling Caciularu Avi, Cohan Arman, Beltagy Iz, Peters Matthew E., Cattan Arie, Dagan Ido
- Long Context Question Answering Via Supervised Contrastive Learning Caciularu Avi, Dagan Ido, Goldberger Jacob, Cohan Arman
- Can Transformers Jump Around Right In Natural Language? Assessing Performance Transfer From SCAN Chaabouni Rahma, Dessì Roberto, Kharitonov Eugene
- Generic Attention-model Explainability For Interpreting Bi-modal And Encoder-decoder Transformers Chefer Hila, Gur Shir, Wolf Lior
- Topic Transferable Table Question Answering Chemmengath Saneem Ahmed, Kumar Vishwajeet, Bharadwaj Samarth, Sen Jaydeep, Canim Mustafa, Chakrabarti Soumen, Gliozzo Alfio, Sankaranarayanan Karthik
- Multimodal Incremental Transformer With Visual Grounding For Visual Dialogue Generation Chen Feilong, Meng Fandong, Chen Xiuyi, Li Peng, Zhou Jie
- Visualgpt: Data-efficient Adaptation Of Pretrained Language Models For Image Captioning Chen Jun, Guo Han, Yi Kai, Li Boyang, Elhoseiny Mohamed
- MT6: Multilingual Pretrained Text-to-text Transformer With Translation Pairs Chi Zewen, Dong Li, Ma Shuming, Mao Shaohan Huang Xian-ling, Huang Heyan, Wei Furu
- Language Modeling Using Lmus: 10x Better Data Efficiency Or Improved Scaling Compared To Transformers Chilkuri Narsimha, Hunsberger Eric, Voelker Aaron, Malik Gurshaant, Eliasmith Chris
- Enjoy The Salience: Towards Better Transformer-based Faithful Explanations With Word Salience Chrysostomou George, Aletras Nikolaos
- Improving The Faithfulness Of Attention-based Explanations With Task-specific Information For Text Classification Chrysostomou George, Aletras Nikolaos
- CANINE: Pre-training An Efficient Tokenization-free Encoder For Language Representation Clark Jonathan H., Garrette Dan, Turc Iulia, Wieting John
- Long-range Modeling Of Source Code Files With Ewash: Extended Window Access By Syntax Hierarchy Clement Colin B., Lu Shuai, Liu Xiaoyu, Tufano Michele, Drain Dawn, Duan Nan, Sundaresan Neel, Svyatkovskiy Alexey
- Distilling Transformers For Neural Cross-domain Search Clement Colin B., Wu Chen, Drain Dawn, Sundaresan Neel
- Control Prefixes For Parameter-efficient Text Generation Clive Jordan, Cao Kris, Rei Marek
- Training Verifiers To Solve Math Word Problems Cobbe Karl, Kosaraju Vineet, Bavarian Mohammad, Chen Mark, Jun Heewoo, Kaiser Lukasz, Plappert Matthias, Tworek Jerry, Hilton Jacob, Nakano Reiichiro, Hesse Christopher, Schulman John
- Multilingual Multi-aspect Explainability Analyses On Machine Reading Comprehension Models Cui Yiming, Zhang Wei-nan, Che Wanxiang, Liu Ting, Chen Zhigang, Wang Shijin
- YANMTT: Yet Another Neural Machine Translation Toolkit Dabre Raj, Sumita Eiichiro
- Knowledge Neurons In Pretrained Transformers Dai Damai, Dong Li, Hao Yaru, Sui Zhifang, Chang Baobao, Wei Furu
- Mention Memory: Incorporating Textual Knowledge Into Transformers Through Entity Mention Attention De Jong Michiel, Zemlyanskiy Yury, Fitzgerald Nicholas, Sha Fei, Cohen William
- Adapting Monolingual Models: Data Can Be Scarce When Language Similarity Is High De Vries Wietse, Bartelds Martijn, Nissim Malvina, Wieling Martijn
- Multitask Finetuning For Improving Neural Machine Translation In Indian Languages Desai Shaily, Kshirsagar Atharva, Marathe Manisha
- Diagnosing Transformers In Task-oriented Semantic Parsing Desai Shrey, Aly Ahmed
- EKTVQA: Generalized Use Of External Knowledge To Empower Scene Text In Text-vqa Dey Arka Ujjal, Valveny Ernest, Harit Gaurav
- Cogview: Mastering Text-to-image Generation Via Transformers Ding Ming, Yang Zhuoyi, Hong Wenyi, Zheng Wendi, Zhou Chang, Yin Da, Lin Junyang, Zou Xu, Shao Zhou, Yang Hongxia, Tang Jie
- Diverse Pretrained Context Encodings Improve Document Translation Donato Domenic, Yu Lei, Dyer Chris
- On-the-fly Attention Modulation For Neural Generation Dong Yue, Bhagavatula Chandra, Lu Ximing, Hwang Jena D., Bosselut Antoine, Cheung Jackie Chi Kit, Choi Yejin
- Attention Forcing For Machine Translation Dou Qingyun, Lu Yiting, Manakul Potsawee, Wu Xixin, Gales Mark J. F.
- An Empirical Study Of Training End-to-end Vision-and-language Transformers Dou Zi-yi, Xu Yichong, Gan Zhe, Wang Jianfeng, Wang Shuohang, Wang Lijuan, Zhu Chenguang, Zhang Pengchuan, Yuan Lu, Peng Nanyun, Liu Zicheng, Zeng Michael
- Lora: Low-rank Adaptation Of Large Language Models Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen
- Robustly Optimized And Distilled Training For Natural Language Understanding Elfadeel Haytham, Peshterliev Stan
- Transformer-based Conditional Variational Autoencoder For Controllable Story Generation Fang Le, Zeng Tao, Liu Chaochun, Bo Liefeng, Dong Wen, Chen Changyou
- Compressing Visual-linguistic Model Via Knowledge Distillation Fang Zhiyuan, Wang Jianfeng, Hu Xiaowei, Wang Lijuan, Yang Yezhou, Liu Zicheng
- Retrieve, Caption, Generate: Visual Grounding For Enhancing Commonsense In Text Generation Models Feng Steven Y., Lu Kevin, Tao Zhuofu, Alikhani Malihe, Mitamura Teruko, Hovy Eduard, Gangal Varun
- Attention Weights In Transformer NMT Fail Aligning Words Between Sequences But Largely Explain Model Predictions Ferrando Javier, Costa-jussà Marta R.
- Open-domain Conversational Search Assistant With Transformers Ferreira Rafael, Leite Mariana, Semedo David, Magalhaes Joao
- Vision-and-language Or Vision-for-language? On Cross-modal Influence In Multimodal Transformers Frank Stella, Bugliarello Emanuele, Elliott Desmond
- Interactively Providing Explanations For Transformer Language Models Friedrich Felix, Schramowski Patrick, Tauchmann Christopher, Kersting Kristian
- Grounded Graph Decoding Improves Compositional Generalization In Question Answering Gai Yu, Jain Paras, Zhang Wendi, Gonzalez Joseph E., Song Dawn, Stoica Ion
- Does Dialog Length Matter For Next Response Selection Task? An Empirical Study Ganhotra Jatin, Joshi Sachindra
- Chop Chop BERT: Visual Question Answering By Chopping Visualbert's Heads Gao Chenyu, Zhu Qi, Wang Peng, Wu Qi
- Condenser: A Pre-training Architecture For Dense Retrieval Gao Luyu, Callan Jamie
- Scalable Transformers For Neural Machine Translation Gao Peng, Geng Shijie, Qiao Yu, Wang Xiaogang, Dai Jifeng, Li Hongsheng
- Cross-attention Is All You Need: Adapting Pretrained Transformers For Machine Translation Gheini Mozhdeh, Ren Xiang, May Jonathan
- Larger-scale Transformers For Multilingual Masked Language Modeling Goyal Naman, Du Jingfei, Ott Myle, Anantharaman Giri, Conneau Alexis
- Xeroalign: Zero-shot Cross-lingual Transformer Alignment Gritta Milan, Iacobacci Ignacio
- Transaction: ICL-SJTU Submission To Epic-kitchens Action Anticipation Challenge 2021 Gu Xiao, Qiu Jianing, Guo Yao, Lo Benny, Yang Guang-zhong
- Block-skim: Efficient Question Answering For Transformer Guan Yue, Li Zhengyi, Leng Jingwen, Lin Zhouhan, Guo Minyi, Zhu Yuhao
- KAT: A Knowledge Augmented Transformer For Vision-and-language Gui Liangke, Wang Borui, Huang Qiuyuan, Hauptmann Alex, Bisk Yonatan, Gao Jianfeng
- Longt5: Efficient Text-to-text Transformer For Long Sequences Guo Mandy, Ainslie Joshua, Uthus David, Ontanon Santiago, Ni Jianmo, Sung Yun-hsuan, Yang Yinfei
- Memory-efficient Transformers Via Top-\(k\) Attention Gupta Ankit, Dar Guy, Goodman Shaya, Ciprut David, Berant Jonathan
- A Million Tweets Are Worth A Few Points: Tuning Transformers For Customer Service Tasks Hadifar Amir, Labat Sofie, Hoste Véronique, Develder Chris, Demeester Thomas
- Zero-shot Controlled Generation With Encoder-decoder Transformers Hazarika Devamanyu, Namazifar Mahdi, Hakkani-tür Dilek
- ELIT: Emory Language And Information Toolkit He Han, Xu Liyan, Choi Jinho D.
- Generate, Annotate, And Learn: NLP With Synthetic Text He Xuanli, Nassar Islam, Kiros Jamie, Haffari Gholamreza, Norouzi Mohammad
- Multilingual Language Models Predict Human Reading Behavior Hollenstein Nora, Pirovano Federico, Zhang Ce, Jäger Lena, Beinborn Lisa
- ENCONTER: Entity Constrained Progressive Sequence Generation Via Insertion-based Transformer Hsieh Lee-hsun, Lee Yang-yin, Lim Ee-peng
- The Niutrans System For WNGT 2020 Efficiency Task Hu Chi, Li Bei, Lin Ye, Li Yinqiao, Li Yanyang, Wang Chenglong, Xiao Tong, Zhu Jingbo
- Lora: Low-rank Adaptation Of Large Language Models Hu Edward J., Shen Yelong, Wallis Phillip, Allen-zhu Zeyuan, Li Yuanzhi, Wang Shean, Wang Lu, Chen Weizhu
- Investigating Transfer Learning In Multilingual Pre-trained Language Models Through Chinese Natural Language Inference Hu Hai, Zhou He, Tian Zuoyu, Zhang Yiwen, Ma Yina, Li Yanting, Nie Yixin, Richardson Kyle
- Non-autoregressive Translation With Layer-wise Prediction And Deep Supervision Huang Chenyang, Zhou Hao, Zaïane Osmar R., Mou Lili, Li Lei
- Transfer Learning For Sequence Generation: From Single-source To Multi-source Huang Xuancheng, Xu Jingfang, Sun Maosong, Liu Yang
- Unifying Multimodal Transformer For Bi-directional Image And Text Generation Huang Yupan, Xue Hongwei, Liu Bei, Lu Yutong
- SILT: Efficient Transformer Training For Inter-lingual Inference Huertas-tato Javier, Martín Alejandro, Camacho David
- Target-dependent UNITER: A Transformer-based Multimodal Language Comprehension Model For Domestic Service Robots Ishikawa Shintaro, Sugiura Komei
- Scaling Language Models: Methods, Analysis & Insights From Training Gopher Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George Van Den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor, Irina Higgins, Antonia Creswell, Nat Mcaleese, Amy Wu, Erich Elsen, Siddhant Jayakumar, Elena Buchatskaya, David Budden, Esme Sutherland, Karen Simonyan, Michela Paganini, Laurent Sifre, Lena Martens, Xiang Lorraine Li, Adhiguna Kuncoro, Aida Nematzadeh, Elena Gribovskaya, Domenic Donato, Angeliki Lazaridou, Arthur Mensch, Jean-baptiste Lespiau, Maria Tsimpoukelli, Nikolai Grigorev, Doug Fritz, Thibault Sottiaux, Mantas Pajarskas, Toby Pohlen, Zhitao Gong, Daniel Toyama, Cyprien De Masson D'autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego De Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew Johnson, Blake Hechtman, Laura Weidinger, Iason Gabriel, William Isaac, Ed Lockhart, Simon Osindero, Laura Rimell, Chris Dyer, Oriol Vinyals, Kareem Ayoub, Jeff Stanway, Lorrayne Bennett, Demis Hassabis, Koray Kavukcuoglu, Geoffrey Irving
- Visqa: X-raying Vision And Language Reasoning In Transformers Jaunet Theo, Kervadec Corentin, Vuillemot Romain, Antipov Grigory, Baccouche Moez, Wolf Christian
- Evaluating Pretrained Transformer Models For Entity Linking In Task-oriented Dialog Jayanthi Sai Muralidhar, Embar Varsha, Raghunathan Karthik
- Discodvt: Generating Long Text With Discourse-aware Discrete Variational Transformer Ji Haozhe, Huang Minlie
- Improving Non-autoregressive Generation With Mixup Training Jiang Ting, Huang Shaohan, Zhang Zihan, Wang Deqing, Zhuang Fuzhen, Wei Furu, Huang Haizhen, Zhang Liangjie, Zhang Qi
- Inducing Transformer's Compositional Generalization Ability Via Auxiliary Sequence Prediction Tasks Jiang Yichen, Bansal Mohit
- Enriching Transformers With Structured Tensor-product Representations For Abstractive Summarization Jiang Yichen, Celikyilmaz Asli, Smolensky Paul, Soulos Paul, Rao Sudha, Palangi Hamid, Fernandez Roland, Smith Caitlin, Bansal Mohit, Gao Jianfeng
- A Good Prompt Is Worth Millions Of Parameters: Low-resource Prompt-based Learning For Vision-language Models Jin Woojeong, Cheng Yu, Shen Yelong, Chen Weizhu, Ren Xiang
- Towards Incremental Transformers: An Empirical Analysis Of Transformer Models For Incremental NLU Kahardipraja Patrick, Madureira Brielen, Schlangen David
- Distilling Large Language Models Into Tiny And Effective Students Using Pqrnn Kaliamoorthi Prabhu, Siddhant Aditya, Li Edward, Johnson Melvin
- AMMUS : A Survey Of Transformer-based Pretrained Models In Natural Language Processing Kalyan Katikapalli Subramanyam, Rajasekharan Ajit, Sangeetha Sivanesan
- What Do Pre-trained Code Models Know About Code? Karmakar Anjan, Robbes Romain
- Finetuning Pretrained Transformers Into Rnns Kasai Jungo, Peng Hao, Zhang Yizhe, Yogatama Dani, Ilharco Gabriel, Pappas Nikolaos, Mao Yi, Chen Weizhu, Smith Noah A.
- Uquad1.0: Development Of An Urdu Question Answering Training Data For Machine Reading Comprehension Kazi Samreen, Khoja Shakeel
- How Transferable Are Reasoning Patterns In VQA? Kervadec Corentin, Jaunet Theo, Antipov Grigory, Baccouche Moez, Vuillemot Romain, Wolf Christian
- Text-free Prosody-aware Generative Spoken Language Modeling Kharitonov Eugene, Lee Ann, Polyak Adam, Adi Yossi, Copet Jade, Lakhotia Kushal, Nguyen Tu-anh, Rivière Morgane, Mohamed Abdelrahman, Dupoux Emmanuel, Hsu Wei-ning
- Learned Token Pruning For Transformers Kim Sehoon, Shen Sheng, Thorsley David, Gholami Amir, Kwon Woosuk, Hassoun Joseph, Keutzer Kurt
- L-verse: Bidirectional Generation Between Image And Text Kim Taehoon, Song Gwangmo, Lee Sihaeng, Kim Sangyun, Seo Yewon, Lee Soonyoung, Kim Seung Hwan, Lee Honglak, Bae Kyunghoon
- BERT Busters: Outlier Dimensions That Disrupt Transformers Kovaleva Olga, Kulshreshtha Saurabh, Rogers Anna, Rumshisky Anna
- On The Significance Of Question Encoder Sequence Model In The Out-of-distribution Performance In Visual Question Answering Kv Gouthaman, Mittal Anurag
- Can Transformer Models Measure Coherence In Text? Re-thinking The Shuffle Test Laban Philippe, Dai Luke, Bandarkar Lucas, Hearst Marti A.
- Layered Gradient Accumulation And Modular Pipeline Parallelism: Fast And Efficient Training Of Large Language Models Lamy-poirier Joel
- Can Transformer Language Models Predict Psychometric Properties? Laverghetta Antonio Jr., Nighojkar Animesh, Mirzakhalov Jamshidbek, Licato John
- You Only Need One Model For Open-domain Question Answering Lee Haejun, Kedia Akhil, Lee Jongwon, Paranjape Ashwin, Manning Christopher D., Woo Kyoung-gu
- When Attention Meets Fast Recurrence: Training Language Models With Reduced Compute Lei Tao
- Quiz-style Question Generation For News Stories Lelkes Adam D., Tran Vinh Q., Yu Cong
- Semvlp: Vision-language Pre-training By Aligning Semantics At Multiple Levels Li Chenliang, Yan Ming, Xu Haiyang, Luo Fuli, Wang Wei, Bi Bin, Huang Songfang
- Residual Tree Aggregation Of Layers For Neural Machine Translation Li Guoliang, Li Yiyang
- Align Before Fuse: Vision And Language Representation Learning With Momentum Distillation Li Junnan, Selvaraju Ramprasaath R., Gotmare Akhilesh Deepak, Joty Shafiq, Xiong Caiming, Hoi Steven
- Personalized Transformer For Explainable Recommendation Li Lei, Zhang Yongfeng, Chen Li
- Ensemble ALBERT On Squad 2.0 Li Shilun, Li Renee, Peng Veronica
- Terapipe: Token-level Pipeline Parallelism For Training Large-scale Language Models Li Zhuohan, Zhuang Siyuan, Guo Shiyuan, Zhuo Danyang, Zhang Hao, Song Dawn, Stoica Ion
- Lyra: A Benchmark For Turducken-style Code Generation Liang Qingyuan, Sun Zeyu, Zhu Qihao, Zhang Wenjie, Yu Lian, Xiong Yingfei, Zhang Lu
- Multimodal Transformer With Variable-length Memory For Vision-and-language Navigation Lin Chuang, Jiang Yi, Cai Jianfei, Qu Lizhen, Haffari Gholamreza, Yuan Zehuan
- Domain-independent User Simulation With Transformers For Task-oriented Dialogue Systems Lin Hsien-chin, Lubis Nurul, Hu Songbo, Van Niekerk Carel, Geishauser Christian, Heck Michael, Feng Shutong, Gašić Milica
- VX2TEXT: End-to-end Learning Of Video-based Text Generation From Multimodal Inputs Lin Xudong, Bertasius Gedas, Wang Jue, Chang Shih-fu, Parikh Devi, Torresani Lorenzo
- Variational Latent-state GPT For Semi-supervised Task-oriented Dialog Systems Liu Hong, Cai Yucheng, Lin Zhenru, Ou Zhijian, Huang Yi, Feng Junlan
- Towards Automated Psychotherapy Via Language Modeling Liu Houjun
- A Three-stage Learning Framework For Low-resource Knowledge-grounded Dialogue Generation Liu Shilei, Zhao Xiaofeng, Li Bochao, Ren Feiliang, Zhang Longhui, Yin Shujuan
- Enriching Non-autoregressive Transformer With Syntactic And Semanticstructures For Neural Machine Translation Liu Ye, Wan Yao, Zhang Jian-guo, Zhao Wenting, Yu Philip S.
- Confidence-aware Scheduled Sampling For Neural Machine Translation Liu Yijin, Meng Fandong, Chen Yufeng, Xu Jinan, Zhou Jie
- Scheduled Sampling Based On Decoding Steps For Neural Machine Translation Liu Yijin, Meng Fandong, Chen Yufeng, Xu Jinan, Zhou Jie
- Augmenting Sequential Recommendation With Pseudo-prior Items Via Reversely Pre-training Transformer Liu Zhiwei, Fan Ziwei, Wang Yu, Yu Philip S.
- Pretrained Transformers As Universal Computation Engines Lu Kevin, Grover Aditya, Abbeel Pieter, Mordatch Igor
- Iconqa: A New Benchmark For Abstract Diagram Understanding And Visual Language Reasoning Lu Pan, Qiu Liang, Chen Jiaqi, Xia Tony, Zhao Yizhou, Zhang Wei, Yu Zhou, Liang Xiaodan, Zhu Song-chun
- Local And Global Context-based Pairwise Models For Sentence Ordering Manku Ruskin Raj, Paul Aditya Jyoti
- Lightweight Cross-lingual Sentence Representation Learning Mao Zhuoyuan, Gupta Prakhar, Wang Pei, Chu Chenhui, Jaggi Martin, Kurohashi Sadao
- \(\infty\)-former: Infinite Memory Transformer Martins Pedro Henrique, Marinho Zita, Martins André F. T.
- Transformer Based Bengali Chatbot Using General Knowledge Dataset Masum Abu Kaisar Mohammad, Abujar Sheikh, Akter Sharmin, Ria Nushrat Jahan, Hossain Syed Akhter
- Unified Questioner Transformer For Descriptive Question Generation In Goal-oriented Visual Dialogue Matsumori Shoya, Shingyouchi Kosuke, Abe Yuki, Fukuchi Yosuke, Sugiura Komei, Imai Michita
- How Much Do Language Models Copy From Their Training Data? Evaluating Linguistic Novelty In Text Generation Using RAVEN Mccoy R. Thomas, Smolensky Paul, Linzen Tal, Gao Jianfeng, Celikyilmaz Asli
- Prevent The Language Model From Being Overconfident In Neural Machine Translation Miao Mengqi, Meng Fandong, Liu Yijin, Zhou Xiao-hua, Zhou Jie
- Structural Analysis Of An All-purpose Question Answering Model Micheli Vincent, Heinrich Quentin, Fleuret François, Belblidia Wacim
- Recent Advances In Natural Language Processing Via Large Pre-trained Language Models: A Survey Min Bonan, Ross Hayley, Sulem Elior, Veyseh Amir Pouran Ben, Nguyen Thien Huu, Sainz Oscar, Agirre Eneko, Heinz Ilana, Roth Dan
- Distantly Supervised Transformers For E-commerce Product QA Mittal Happy, Chakrabarti Aniket, Bayar Belhassen, Sharma Animesh Anant, Rasiwasia Nikhil
- GPT-3 Models Are Poor Few-shot Learners In The Biomedical Domain Moradi Milad, Blagec Kathrin, Haberl Florian, Samwald Matthias
- Arat5: Text-to-text Transformers For Arabic Language Generation Nagoudi El Moatez Billah, Elmadany Abdelrahim, Abdul-mageed Muhammad
- Planning With Learned Entity Prompts For Abstractive Summarization Narayan Shashi, Zhao Yao, Maynez Joshua, Simoes Gonçalo, Nikolaev Vitaly, Mcdonald Ryan
- Hierarchical Transformers Are More Efficient Language Models Nawrot Piotr, Tworkowski Szymon, Tyrolski Michał, Kaiser Łukasz, Wu Yuhuai, Szegedy Christian, Michalewski Henryk
- Improving Transformers With Probabilistic Attention Keys Nguyen Tam, Nguyen Tan M., Le Dung D., Nguyen Duy Khuong, Tran Viet-anh, Baraniuk Richard G., Ho Nhat, Osher Stanley J.
- Materialized Knowledge Bases From Commonsense Transformers Nguyen Tuan-phong, Razniewski Simon
- Sentence-t5: Scalable Sentence Encoders From Pre-trained Text-to-text Models Ni Jianmo, Ábrego Gustavo Hernández, Constant Noah, Ma Ji, Hall Keith B., Cer Daniel, Yang Yinfei
- MLP Architectures For Vision-and-language Modeling: An Empirical Study Nie Yixin, Li Linjie, Gan Zhe, Wang Shuohang, Zhu Chenguang, Zeng Michael, Liu Zicheng, Bansal Mohit, Wang Lijuan
- Investigating The Limitations Of Transformers With Simple Arithmetic Tasks Nogueira Rodrigo, Jiang Zhiying, Lin Jimmy
- Progressive Transformer-based Generation Of Radiology Reports Nooralahzadeh Farhad, Gonzalez Nicolas Perez, Frauenfelder Thomas, Fujimoto Koji, Krauthammer Michael
- Code Generation From Natural Language With Less Prior And More Monolingual Data Norouzi Sajad, Tang Keyi, Cao Yanshuai
- Show Your Work: Scratchpads For Intermediate Computation With Language Models Nye Maxwell, Andreassen Anders Johan, Gur-ari Guy, Michalewski Henryk, Austin Jacob, Bieber David, Dohan David, Lewkowycz Aitor, Bosma Maarten, Luan David, Sutton Charles, Odena Augustus
- What Context Features Can Transformer Language Models Use? O'connor Joe, Andreas Jacob
- Investigating Numeracy Learning Ability Of A Text-to-text Transfer Model Pal Kuntal Kumar, Baral Chitta
- Probing For Bridging Inference In Transformer Language Models Pandit Onkar, Hou Yufang
- Cascading Adaptors To Leverage English Data To Improve Performance Of Question Answering For Low-resource Languages Pandya Hariom A., Ardeshna Bhavik, Bhatt Brijesh S.
- PENELOPIE: Enabling Open Information Extraction For The Greek Language Through Machine Translation Papadopoulos Dimitris, Papadakis Nikolaos, Matsatsinis Nikolaos
- Do Encoder Representations Of Generative Dialogue Models Encode Sufficient Information About The Task ? Parthasarathi Prasanna, Pineau Joelle, Chandar Sarath
- Episodic Transformer For Vision-and-language Navigation Pashevich Alexander, Schmid Cordelia, Sun Chen
- A Comparative Study Of Transformer-based Language Models On Extractive Question Answering Pearce Kate, Zhan Tiffany, Komanduri Aneesh, Zhan Justin
- ABC: Attention With Bounded-memory Control Peng Hao, Kasai Jungo, Pappas Nikolaos, Yogatama Dani, Wu Zhaofeng, Kong Lingpeng, Schwartz Roy, Smith Noah A.
- Cotext: Multi-task Learning With Code-text Transformer Phan Long, Tran Hieu, Le Daniel, Nguyen Hieu, Anibal James, Peltekian Alec, Ye Yanfang
- Context Transformer With Stacked Pointer Networks For Conversational Question Answering Over Knowledge Graphs Plepi Joan, Kacupaj Endri, Singh Kuldeep, Thakkar Harsh, Lehmann Jens
- Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge Porada Ian, Sordoni Alessandro, Cheung Jackie Chi Kit
- Modeling Event Plausibility With Consistent Conceptual Abstraction Porada Ian, Suleman Kaheer, Trischler Adam, Cheung Jackie Chi Kit
- ORCHARD: A Benchmark For Measuring Systematic Generalization Of Multi-hierarchical Reasoning Pung Bill Tuck Weng, Chan Alvin
- Attentive Fine-tuning Of Transformers For Translation Of Low-resourced Languages @loresmt 2021 Puranik Karthik, Hande Adeep, Priyadharshini Ruba, Durairaj Thenmozhi, Sampath Anbukkarasi, Thamburaj Kingston Pal, Chakravarthi Bharathi Raja
- Robertuito: A Pre-trained Language Model For Social Media Text In Spanish Pérez Juan Manuel, Furman Damián A., Alemany Laura Alonso, Luque Franco
- Assessing The Syntactic Capabilities Of Transformer-based Multilingual Language Models Pérez-mayos Laura, García Alba Táboas, Mille Simon, Wanner Leo
- Structural Guidance For Transformer Language Models Qian Peng, Naseem Tahira, Levy Roger, Astudillo Ramón Fernandez
- VT-CLIP: Enhancing Vision-language Models With Visual-guided Texts Qiu Longtian, Zhang Renrui, Guo Ziyu, Zeng Ziyao, Guo Zilu, Li Yafeng, Zhang Guangnan
- Vinmt: Neural Machine Translation Toolkit Quan Nguyen Hoang, Dat Nguyen Thanh, Cong Nguyen Hoang Minh, Van Vinh Nguyen, Vinh Ngo Thi, Thai Nguyen Phuong, Viet Tran Hong
- Teaching Autoregressive Language Models Complex Tasks By Demonstration Recchia Gabriel
- Revisiting Context Choices For Context-aware Machine Translation Rikters Matīss, Nakazawa Toshiaki
- Don't Sweep Your Learning Rate Under The Rug: A Closer Look At Cross-modal Transfer Of Pretrained Transformers Rothermel Danielle, Li Margaret, Rocktäschel Tim, Foerster Jakob
- Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information? Rozanova Julia, Ferreira Deborah, Dubba Krishna, Cheng Weiwei, Zhang Dell, Freitas Andre
- Challenges In Procedural Multimodal Machine Comprehension:a Novel Way To Benchmark Sahu Pritish, Sikka Karan, Divakaran Ajay
- Rome Was Built In 1776: A Case Study On Factual Correctness In Knowledge-grounded Response Generation Santhanam Sashank, Hedayatnia Behnam, Gella Spandana, Padmakumar Aishwarya, Kim Seokhwan, Liu Yang, Hakkani-tur Dilek
- Towards Neural Functional Program Evaluation Scholak Torsten, Pilault Jonathan, Velez-ginorio Joey
- Large Pre-trained Language Models Contain Human-like Biases Of What Is Right And Wrong To Do Schramowski Patrick, Turan Cigdem, Andersen Nico, Rothkopf Constantin A., Kersting Kristian
- Audio-visual Scene-aware Dialog And Reasoning Using Audio-visual Transformers With Joint Student-teacher Learning Shah Ankit P., Geng Shijie, Gao Peng, Cherian Anoop, Hori Takaaki, Marks Tim K., Roux Jonathan Le, Hori Chiori
- Laviter: Learning Aligned Visual And Textual Representations Assisted By Image And Caption Generation Shaikh Mohammad Abuzar, Ji Zhanghexuan, Moukheiber Dana, Shen Yan, Srihari Sargur, Gao Mingchen
- CPT: A Pre-trained Unbalanced Transformer For Both Chinese Language Understanding And Generation Shao Yunfan, Geng Zhichao, Liu Yitao, Dai Junqi, Yan Hang, Yang Fei, Zhe Li, Bao Hujun, Qiu Xipeng
- Better Neural Machine Translation By Extracting Linguistic Information From BERT Shavarani Hassan S., Sarkar Anoop
- FPM: A Collection Of Large-scale Foundation Pre-trained Language Models Shen Dezhou
- Adapting Long Context NLM For ASR Rescoring In Conversational Agents Shenoy Ashish, Bodapati Sravan, Sunkara Monica, Ronanki Srikanth, Kirchhoff Katrin
- Layer-wise Pruning Of Transformer Attention Heads For Efficient Language Modeling Shim Kyuhong, Choi Iksoo, Sung Wonyong, Choi Jungwook
- Exploring Low-cost Transformer Model Compression For Large-scale Commercial Reply Suggestions Shrivastava Vaishnavi, Gaonkar Radhika, Gupta Shashank, Jha Abhishek
- Structural Persistence In Language Models: Priming As A Window Into Abstract Language Representations Sinclair Arabella, Jumelet Jaap, Zuidema Willem, Fernández Raquel
- Pre-trained Language Models As Prior Knowledge For Playing Text-based Games Singh Ishika, Singh Gargi, Modi Ashutosh
- Semantics-aware Attention Improves Neural Machine Translation Slobodkin Aviv, Choshen Leshem, Abend Omri
- Primer: Searching For Efficient Transformers For Language Modeling So David R., Mańke Wojciech, Liu Hanxiao, Dai Zihang, Shazeer Noam, Le Quoc V.
- Multimodal Integration Of Human-like Attention In Visual Question Answering Sood Ekta, Kögel Fabian, Müller Philipp, Thomas Dominike, Bace Mihai, Bulling Andreas
- VQA-MHUG: A Gaze Dataset To Study Multimodal Neural Attention In Visual Question Answering Sood Ekta, Kögel Fabian, Strohm Florian, Dhar Prajit, Bulling Andreas
- Txt: Crossmodal End-to-end Learning With Transformers Steitz Jan-martin O., Pfeiffer Jonas, Gurevych Iryna, Roth Stefan
- Corelm: Coreference-aware Language Model Fine-tuning Stylianou Nikolaos, Vlahavas Ioannis
- Ernie-tiny : A Progressive Distillation Framework For Pretrained Transformer Compression Su Weiyue, Chen Xuyi, Feng Shikun, Liu Jiaxiang, Liu Weixin, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- NVIDIA Nemo Neural Machine Translation Systems For English-german And English-russian News And Biomedical Tasks At WMT21 Subramanian Sandeep, Hrinchuk Oleksii, Adams Virginia, Kuchaiev Oleksii
- Embodied BERT: A Transformer Model For Embodied, Language-guided Visual Task Completion Suglia Alessandro, Gao Qiaozi, Thomason Jesse, Thattai Govind, Sukhatme Gaurav
- Not All Memories Are Created Equal: Learning To Forget By Expiring Sukhbaatar Sainbayar, Ju Da, Poff Spencer, Roller Stephen, Szlam Arthur, Weston Jason, Fan Angela
- Do Long-range Language Models Actually Use Long-range Context? Sun Simeng, Krishna Kalpesh, Mattarella-micke Andrew, Iyyer Mohit
- Lightningdot: Pre-training Visual-semantic Embeddings For Real-time Image-text Retrieval Sun Siqi, Chen Yen-chun, Li Linjie, Wang Shuohang, Fang Yuwei, Liu Jingjing
- Multilingual Translation Via Grafting Pre-trained Language Models Sun Zewei, Wang Mingxuan, Li Lei
- Data Processing Matters: Srph-konvergen Ai's Machine Translation System For WMT'21 Sutawika Lintang, Cruz Jan Christian Blaise
- A Three Step Training Approach With Data Augmentation For Morphological Inflection Szolnok Gabor, Barta Botond, Lakatos Dorina, Acs Judit
- Diff-explainer: Differentiable Convex Optimization For Explainable Multi-hop Inference Thayaparan Mokanarangan, Valentino Marco, Ferreira Deborah, Rozanova Julia, Freitas André
- Towards Reinforcement Learning For Pivot-based Neural Machine Translation With Non-autoregressive Transformer Tokarchuk Evgeniia, Rosendahl Jan, Wang Weiyue, Petrushkov Pavel, Lancewicki Tomer, Khadivi Shahram, Ney Hermann
- Exploring Transformers In Natural Language Generation: GPT, BERT, And Xlnet Topal M. Onat, Bas Anil, Van Heerden Imke
- Transferring Semantic Knowledge Into Language Encoders Umair Mohammad, Ferraro Francis
- Comparing Test Sets With Item Response Theory Vania Clara, Htut Phu Mon, Huang William, Mungra Dhara, Pang Richard Yuanzhe, Phang Jason, Liu Haokun, Cho Kyunghyun, Bowman Samuel R.
- Transformers For Headline Selection For Russian News Clusters Voropaev Pavel, Sopilnyak Olga
- Convfit: Conversational Fine-tuning Of Pretrained Language Models Vulić Ivan, Su Pei-hao, Coope Sam, Gerz Daniela, Budzianowski Paweł, Casanueva Iñigo, Mrkšić Nikola, Wen Tsung-hsien
- Selective Knowledge Distillation For Neural Machine Translation Wang Fusheng, Yan Jianhao, Meng Fandong, Zhou Jie
- Contextualized Scene Imagination For Generative Commonsense Reasoning Wang Peifeng, Zamora Jonathan, Liu Junfeng, Ilievski Filip, Chen Muhao, Ren Xiang
- Lightseq2: Accelerated Training For Transformer-based Models On Gpus Wang Xiaohui, Wei Yang, Xiong Ying, Huang Guyue, Qian Xian, Ding Yufei, Wang Mingxuan, Li Lei
- Codet5: Identifier-aware Unified Pre-trained Encoder-decoder Models For Code Understanding And Generation Wang Yue, Wang Weishi, Joty Shafiq, Hoi Steven C. H.
- Dodrio: Exploring Transformer Models With Interactive Visualization Wang Zijie J., Turko Robert, Chau Duen Horng
- Visual Question Rewriting For Increasing Response Rate Wei Jiayi, Li Xilian, Zhang Yi, Wang Xin
- Memory And Knowledge Augmented Language Models For Inferring Salience In Long-form Stories Wilmot David, Keller Frank
- Graph-free Multi-hop Reading Comprehension: A Select-to-guide Strategy Wu Bohong, Zhang Zhuosheng, Zhao Hai
- Fastformer: Additive Attention Can Be All You Need Wu Chuhan, Wu Fangzhao, Qi Tao, Huang Yongfeng, Xie Xing
- A Multilingual Modeling Method For Span-extraction Reading Comprehension Wu Gaochen, Xu Bin, Chang Dejie, Liu Bangchang
- Not All Attention Is All You Need Wu Hongqiu, Zhao Hai, Zhang Min
- An Explanation Of In-context Learning As Implicit Bayesian Inference Xie Sang Michael, Raghunathan Aditi, Liang Percy, Ma Tengyu
- E2E-VLP: End-to-end Vision-language Pre-training Enhanced By Visual Learning Xu Haiyang, Yan Ming, Li Chenliang, Bi Bin, Huang Songfang, Xiao Wenming, Huang Fei
- Attention-guided Generative Models For Extractive Question Answering Xu Peng, Liang Davis, Huang Zhiheng, Xiang Bing
- K-PLUG: Knowledge-injected Pre-trained Language Model For Natural Language Understanding And Generation In E-commerce Xu Song, Li Haoran, Yuan Peng, Wang Yujia, Wu Youzheng, He Xiaodong, Liu Ying, Zhou Bowen
- Byt5: Towards A Token-free Future With Pre-trained Byte-to-byte Models Xue Linting, Barua Aditya, Constant Noah, Al-rfou Rami, Narang Sharan, Kale Mihir, Roberts Adam, Raffel Colin
- Fastseq: Make Sequence Generation Faster Yan Yu, Hu Fei, Chen Jiusheng, Bhendawade Nikhil, Ye Ting, Gong Yeyun, Duan Nan, Cui Desheng, Chi Bingyu, Zhang Ruofei
- Transformer-based Korean Pretrained Language Models: A Survey On Three Years Of Progress Yang Kichang
- Causal Attention For Vision-language Tasks Yang Xu, Zhang Hanwang, Qi Guojun, Cai Jianfei
- TR-BERT: Dynamic Token Reduction For Accelerating BERT Inference Ye Deming, Lin Yankai, Huang Yufei, Sun Maosong
- Learning To Generate Task-specific Adapters From Task Description Ye Qinyuan, Ren Xiang
- Reconsidering The Past: Optimizing Hidden States In Language Models Yoshida Davis, Gimpel Kevin
- On The Interplay Between Fine-tuning And Composition In Transformers Yu Lang, Ettinger Allyson
- LV-BERT: Exploiting Layer Variety For BERT Yu Weihao, Jiang Zihang, Chen Fei, Hou Qibin, Feng Jiashi
- Prune Once For All: Sparse Pre-trained Language Models Zafrir Ofir, Larey Ariel, Boudoukh Guy, Shen Haihao, Wasserblat Moshe
- Bitfit: Simple Parameter-efficient Fine-tuning For Transformer-based Masked Language-models Zaken Elad Ben, Ravfogel Shauli, Goldberg Yoav
- Topical Language Generation Using Transformers Zandie Rohola, Mahoor Mohammad H.
- Readtwice: Reading Very Large Documents With Memories Zemlyanskiy Yury, Ainslie Joshua, De Jong Michiel, Pham Philip, Eckstein Ilya, Sha Fei
- Wechat Neural Machine Translation Systems For WMT21 Zeng Xianfeng, Liu Yijin, Li Ernan, Ran Qiu, Meng Fandong, Li Peng, Xu Jinan, Zhou Jie
- Improving Stack Overflow Question Title Generation With Copying Enhanced Codebert Model And Bi-modal Information Zhang Fengji, Yu Xiao, Keung Jacky, Li Fuyang, Xie Zhiwen, Yang Zhen, Ma Caoyuan, Zhang Zhimin
- Ernie-vilg: Unified Generative Pre-training For Bidirectional Vision-language Generation Zhang Han, Yin Weichong, Fang Yewei, Li Lanxin, Duan Boqiang, Wu Zhihua, Sun Yu, Tian Hao, Wu Hua, Wang Haifeng
- A Knowledge-grounded Dialog System Based On Pre-trained Language Models Zhang Weijie, Chen Jiaoxuan, Wu Haipang, Wan Sanhui, Li Gongfeng
- DSGPT: Domain-specific Generative Pre-training Of Transformers For Text Generation In E-commerce Title And Review Summarization Zhang Xueying, Jiang Yunjiang, Shang Yue, Cheng Zhaomeng, Zhang Chi, Fan Xiaochuan, Xiao Yun, Long Bo
- An Exploratory Study On Long Dialogue Summarization: What Works And What's Next Zhang Yusen, Ni Ansong, Yu Tao, Zhang Rui, Zhu Chenguang, Deb Budhaditya, Celikyilmaz Asli, Awadallah Ahmed Hassan, Radev Dragomir
- Towards More Efficient Insertion Transformer With Fractional Positional Encoding Zhang Zhisong, Zhang Yizhe, Dolan Bill
- Ror: Read-over-read For Long Document Machine Reading Comprehension Zhao Jing, Bao Junwei, Wang Yifan, Zhou Yongwei, Wu Youzheng, He Xiaodong, Zhou Bowen
- Self-guided Curriculum Learning For Neural Machine Translation Zhou Lei, Ding Liang, Duh Kevin, Watanabe Shinji, Sasano Ryohei, Takeda Koichi
- Thinking Clearly, Talking Fast: Concept-guided Non-autoregressive Generation For Open-domain Dialogue Systems Zou Yicheng, Liu Zhihua, Hu Xingwu, Zhang Qi
- Taming Sparsely Activated Transformer With Stochastic Experts Zuo Simiao, Liu Xiaodong, Jiao Jian, Kim Young Jin, Hassan Hany, Zhang Ruofei, Zhao Tuo, Gao Jianfeng
- Palm: Scaling Language Modeling With Pathways Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin, Michael Isard, Guy Gur-ari, Pengcheng Yin, Toju Duke, Anselm Levskaya, Sanjay Ghemawat, Sunipa Dev, Henryk Michalewski, Xavier Garcia, Vedant Misra, Kevin Robinson, Liam Fedus, Denny Zhou, Daphne Ippolito, David Luan, Hyeontaek Lim, Barret Zoph, Alexander Spiridonov, Ryan Sepassi, David Dohan, Shivani Agrawal, Mark Omernick, Andrew M. Dai, Thanumalayan Sankaranarayana Pillai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Rewon Child, Oleksandr Polozov, Katherine Lee, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Mark Diaz, Orhan Firat, Michele Catasta, Jason Wei, Kathy Meier-hellstern, Douglas Eck, Jeff Dean, Slav Petrov, Noah Fiedel
- Global Memory Transformer For Processing Long Documents Adel Arij Al
- Vector Representations Of Idioms In Conversational Systems Adewumi Tosin, Liwicki Foteini, Liwicki Marcus
- Vl-interpret: An Interactive Visualization Tool For Interpreting Vision-language Transformers Aflalo Estelle, Du Meng, Tseng Shao-yen, Liu Yongfei, Wu Chenfei, Duan Nan, Lal Vasudev
- Few-shot Training Llms For Project-specific Code-summarization Ahmed Toufique, Devanbu Premkumar
- Multimodal Sequential Generative Models For Semi-supervised Language Instruction Following Akuzawa Kei, Iwasawa Yusuke, Matsuo Yutaka
- Compound Tokens: Channel Fusion For Vision-language Representation Learning Aladago Maxwell Mbabilla, Piergiovanni Aj
- End-to-end Multimodal Representation Learning For Video Dialog Alamri Huda, Bilic Anthony, Hu Michael, Beedu Apoorva, Essa Irfan
- Tempo: Accelerating Transformer-based Model Training Through Memory Footprint Reduction Andoorveedu Muralidhar, Zhu Zhanda, Zheng Bojian, Pekhimenko Gennady
- Exploring Length Generalization In Large Language Models Anil Cem, Wu Yuhuai, Andreassen Anders, Lewkowycz Aitor, Misra Vedant, Ramasesh Vinay, Slone Ambrose, Gur-ari Guy, Dyer Ethan, Neyshabur Behnam
- Characterizing Verbatim Short-term Memory In Neural Language Models Armeni Kristijan, Honey Christopher, Linzen Tal
- Small Batch Sizes Improve Training Of Low-resource Neural MT Atrio Àlex R., Popescu-belis Andrei
- Parameter-efficient Finetuning Of Transformers For Source Code Ayupov Shamil, Chirkova Nadezhda
- Vl-beit: Generative Vision-language Pretraining Bao Hangbo, Wang Wenhui, Dong Li, Wei Furu
- P\(^3\)LM: Probabilistically Permuted Prophet Language Modeling For Generative Pre-training Bao Junwei, Wang Yifan, Ying Jiangyong, Gong Yeyun, Zhao Jing, Wu Youzheng, He Xiaodong
- Grad-sam: Explaining Transformers Via Gradient Self-attention Maps Barkan Oren, Hauon Edan, Caciularu Avi, Katz Ori, Malkiel Itzik, Armstrong Omri, Koenigstein Noam
- Leveraging Pre-trained Language Models For Conversational Information Seeking From Text Bellan Patrizio, Dragoni Mauro, Ghidini Chiara
- Logical Reasoning For Task Oriented Dialogue Systems Beygi Sajjad, Fazel-zarandi Maryam, Cervone Alessandra, Krishnan Prakash, Jonnalagadda Siddhartha Reddy
- Banglanlg And Banglat5: Benchmarks And Resources For Evaluating Low-resource Natural Language Generation In Bangla Bhattacharjee Abhik, Hasan Tahmid, Ahmad Wasi Uddin, Shahriyar Rifat
- BLOOM: A 176b-parameter Open-access Multilingual Language Model Bigscience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova Del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina Mcmillan-major, Iz Beltagy, Huu Nguyen, Lucile Saulnier, Samson Tan, Pedro Ortiz Suarez, Victor Sanh, Hugo Laurençon, Yacine Jernite, Julien Launay, Margaret Mitchell, Colin Raffel, Aaron Gokaslan, Adi Simhi, Aitor Soroa, Alham Fikri Aji, Amit Alfassy, Anna Rogers, Ariel Kreisberg Nitzav, Canwen Xu, Chenghao Mou, Chris Emezue, Christopher Klamm, Colin Leong, Daniel Van Strien, David Ifeoluwa Adelani, Dragomir Radev, Eduardo González Ponferrada, Efrat Levkovizh, Ethan Kim, Eyal Bar Natan, Francesco De Toni, Gérard Dupont, Germán Kruszewski, Giada Pistilli, Hady Elsahar, Hamza Benyamina, Hieu Tran, Ian Yu, Idris Abdulmumin, Isaac Johnson, Itziar Gonzalez-dios, Javier De La Rosa, Jenny Chim, Jesse Dodge, Jian Zhu, Jonathan Chang, Jörg Frohberg, Joseph Tobing, Joydeep Bhattacharjee, Khalid Almubarak, Kimbo Chen, Kyle Lo, Leandro Von Werra, Leon Weber, Long Phan, Loubna Ben Allal, Ludovic Tanguy, Manan Dey, Manuel Romero Muñoz, Maraim Masoud, María Grandury, Mario Šaško, Max Huang, Maximin Coavoux, Mayank Singh, Mike Tian-jian Jiang, Minh Chien Vu, Mohammad A. Jauhar, Mustafa Ghaleb, Nishant Subramani, Nora Kassner, Nurulaqilla Khamis, Olivier Nguyen, Omar Espejel, Ona De Gibert, Paulo Villegas, Peter Henderson, Pierre Colombo, Priscilla Amuok, Quentin Lhoest, Rheza Harliman, Rishi Bommasani, Roberto Luis López, Rui Ribeiro, Salomey Osei, Sampo Pyysalo, Sebastian Nagel, Shamik Bose, Shamsuddeen Hassan Muhammad, Shanya Sharma, Shayne Longpre, Somaieh Nikpoor, Stanislav Silberberg, Suhas Pai, Sydney Zink, Tiago Timponi Torrent, Timo Schick, Tristan Thrush, Valentin Danchev, Vassilina Nikoulina, Veronika Laippala, Violette Lepercq, Vrinda Prabhu, Zaid Alyafeai, Zeerak Talat, Arun Raja, Benjamin Heinzerling, Chenglei Si, Davut Emre Taşar, Elizabeth Salesky, Sabrina J. Mielke, Wilson Y. Lee, Abheesht Sharma, Andrea Santilli, Antoine Chaffin, Arnaud Stiegler, Debajyoti Datta, Eliza Szczechla, Gunjan Chhablani, Han Wang, Harshit Pandey, Hendrik Strobelt, Jason Alan Fries, Jos Rozen, Leo Gao, Lintang Sutawika, M Saiful Bari, Maged S. Al-shaibani, Matteo Manica, Nihal Nayak, Ryan Teehan, Samuel Albanie, Sheng Shen, Srulik Ben-david, Stephen H. Bach, Taewoon Kim, Tali Bers, Thibault Fevry, Trishala Neeraj, Urmish Thakker, Vikas Raunak, Xiangru Tang, Zheng-xin Yong, Zhiqing Sun, Shaked Brody, Yallow Uri, Hadar Tojarieh, Adam Roberts, Hyung Won Chung, Jaesung Tae, Jason Phang, Ofir Press, Conglong Li, Deepak Narayanan, Hatim Bourfoune, Jared Casper, Jeff Rasley, Max Ryabinin, Mayank Mishra, Minjia Zhang, Mohammad Shoeybi, Myriam Peyrounette, Nicolas Patry, Nouamane Tazi, Omar Sanseviero, Patrick Von Platen, Pierre Cornette, Pierre François Lavallée, Rémi Lacroix, Samyam Rajbhandari, Sanchit Gandhi, Shaden Smith, Stéphane Requena, Suraj Patil, Tim Dettmers, Ahmed Baruwa, Amanpreet Singh, Anastasia Cheveleva, Anne-laure Ligozat, Arjun Subramonian, Aurélie Névéol, Charles Lovering, Dan Garrette, Deepak Tunuguntla, Ehud Reiter, Ekaterina Taktasheva, Ekaterina Voloshina, Eli Bogdanov, Genta Indra Winata, Hailey Schoelkopf, Jan-christoph Kalo, Jekaterina Novikova, Jessica Zosa Forde, Jordan Clive, Jungo Kasai, Ken Kawamura, Liam Hazan, Marine Carpuat, Miruna Clinciu, Najoung Kim, Newton Cheng, Oleg Serikov, Omer Antverg, Oskar Van Der Wal, Rui Zhang, Ruochen Zhang, Sebastian Gehrmann, Shachar Mirkin, Shani Pais, Tatiana Shavrina, Thomas Scialom, Tian Yun, Tomasz Limisiewicz, Verena Rieser, Vitaly Protasov, Vladislav Mikhailov, Yada Pruksachatkun, Yonatan Belinkov, Zachary Bamberger, Zdeněk Kasner, Alice Rueda, Amanda Pestana, Amir Feizpour, Ammar Khan, Amy Faranak, Ana Santos, Anthony Hevia, Antigona Unldreaj, Arash Aghagol, Arezoo Abdollahi, Aycha Tammour, Azadeh Hajihosseini, Bahareh Behroozi, Benjamin Ajibade, Bharat Saxena, Carlos Muñoz Ferrandis, Daniel Mcduff, Danish Contractor, David Lansky, Davis David, Douwe Kiela, Duong A. Nguyen, Edward Tan, Emi Baylor, Ezinwanne Ozoani, Fatima Mirza, Frankline Ononiwu, Habib Rezanejad, Hessie Jones, Indrani Bhattacharya, Irene Solaiman, Irina Sedenko, Isar Nejadgholi, Jesse Passmore, Josh Seltzer, Julio Bonis Sanz, Livia Dutra, Mairon Samagaio, Maraim Elbadri, Margot Mieskes, Marissa Gerchick, Martha Akinlolu, Michael Mckenna, Mike Qiu, Muhammed Ghauri, Mykola Burynok, Nafis Abrar, Nazneen Rajani, Nour Elkott, Nour Fahmy, Olanrewaju Samuel, Ran An, Rasmus Kromann, Ryan Hao, Samira Alizadeh, Sarmad Shubber, Silas Wang, Sourav Roy, Sylvain Viguier, Thanh Le, Tobi Oyebade, Trieu Le, Yoyo Yang, Zach Nguyen, Abhinav Ramesh Kashyap, Alfredo Palasciano, Alison Callahan, Anima Shukla, Antonio Miranda-escalada, Ayush Singh, Benjamin Beilharz, Bo Wang, Caio Brito, Chenxi Zhou, Chirag Jain, Chuxin Xu, Clémentine Fourrier, Daniel León Periñán, Daniel Molano, Dian Yu, Enrique Manjavacas, Fabio Barth, Florian Fuhrimann, Gabriel Altay, Giyaseddin Bayrak, Gully Burns, Helena U. Vrabec, Imane Bello, Ishani Dash, Jihyun Kang, John Giorgi, Jonas Golde, Jose David Posada, Karthik Rangasai Sivaraman, Lokesh Bulchandani, Lu Liu, Luisa Shinzato, Madeleine Hahn De Bykhovetz, Maiko Takeuchi, Marc Pàmies, Maria A Castillo, Marianna Nezhurina, Mario Sänger, Matthias Samwald, Michael Cullan, Michael Weinberg, Michiel De Wolf, Mina Mihaljcic, Minna Liu, Moritz Freidank, Myungsun Kang, Natasha Seelam, Nathan Dahlberg, Nicholas Michio Broad, Nikolaus Muellner, Pascale Fung, Patrick Haller, Ramya Chandrasekhar, Renata Eisenberg, Robert Martin, Rodrigo Canalli, Rosaline Su, Ruisi Su, Samuel Cahyawijaya, Samuele Garda, Shlok S Deshmukh, Shubhanshu Mishra, Sid Kiblawi, Simon Ott, Sinee Sang-aroonsiri, Srishti Kumar, Stefan Schweter, Sushil Bharati, Tanmay Laud, Théo Gigant, Tomoya Kainuma, Wojciech Kusa, Yanis Labrak, Yash Shailesh Bajaj, Yash Venkatraman, Yifan Xu, Yingxin Xu, Yu Xu, Zhe Tan, Zhongli Xie, Zifan Ye, Mathilde Bras, Younes Belkada, Thomas Wolf
- Inpars: Data Augmentation For Information Retrieval Using Large Language Models Bonifacio Luiz, Abonizio Hugo, Fadaee Marzieh, Nogueira Rodrigo
- Understanding Performance Of Long-document Ranking Models Through Comprehensive Evaluation And Leaderboarding Boytsov Leonid, Akinpelu David, Lin Tianyi, Gao Fangwei, Zhao Yutian, Huang Jeffrey, Katyal Nipun, Nyberg Eric
- Evaluating The Susceptibility Of Pre-trained Language Models Via Handcrafted Adversarial Examples Branch Hezekiah J., Cefalu Jonathan Rodriguez, Mchugh Jeremy, Hujer Leyla, Bahl Aditya, Iglesias Daniel Del Castillo, Heichman Ron, Darwishi Ramesh
- Large Language Models Can Implement Policy Iteration Brooks Ethan, Walls Logan, Lewis Richard L., Singh Satinder
- Understanding Attention For Vision-and-language Tasks Cao Feiqi, Han Soyeon Caren, Long Siqu, Xu Changwei, Poon Josiah
- HIBRIDS: Attention With Hierarchical Biases For Structure-aware Long Document Summarization Cao Shuyang, Wang Lu
- A Model-agnostic Data Manipulation Method For Persona-based Dialogue Generation Cao Yu, Bi Wei, Fang Meng, Shi Shuming, Tao Dacheng
- Pre-trained Neural Language Models For Automatic Mobile App User Feedback Answer Generation Cao Yue, Fard Fatemeh H.
- Analyzing Semantic Faithfulness Of Language Models Via Input Intervention On Question Answering Chaturvedi Akshay, Bhar Swarnadeep, Saha Soumadeep, Garain Utpal, Asher Nicholas
- Character-centric Story Visualization Via Visual Planning And Token Alignment Chen Hong, Han Rujun, Wu Te-lin, Nakayama Hideki, Peng Nanyun
- Xdoc: Unified Pre-training For Cross-format Document Understanding Chen Jingye, Lv Tengchao, Cui Lei, Zhang Cha, Wei Furu
- Exploring And Exploiting Multi-granularity Representations For Machine Reading Comprehension Chen Nuo, You Chenyu
- Murag: Multimodal Retrieval-augmented Generator For Open Question Answering Over Images And Text Chen Wenhu, Hu Hexiang, Chen Xi, Verga Pat, Cohen William W.
- Pali: A Jointly-scaled Multilingual Language-image Model Chen Xi, Wang Xiao, Changpinyo Soravit, Piergiovanni Aj, Padlewski Piotr, Salz Daniel, Goodman Sebastian, Grycner Adam, Mustafa Basil, Beyer Lucas, Kolesnikov Alexander, Puigcerver Joan, Ding Nan, Rong Keran, Akbari Hassan, Mishra Gaurav, Xue Linting, Thapliyal Ashish, Bradbury James, Kuo Weicheng, Seyedhosseini Mojtaba, Jia Chao, Ayan Burcu Karagol, Riquelme Carlos, Steiner Andreas, Angelova Anelia, Zhai Xiaohua, Houlsby Neil, Soricut Radu
- Transformers Go For The Lols: Generating (humourous) Titles From Scientific Abstracts End-to-end Chen Yanran, Eger Steffen
- Empowering Parameter-efficient Transfer Learning By Recognizing The Kernel Structure In Self-attention Chen Yifan, Hazarika Devamanyu, Namazifar Mahdi, Liu Yang, Jin Di, Hakkani-tur Dilek
- Curriculum: A Broad-coverage Benchmark For Linguistic Phenomena In Natural Language Understanding Chen Zeming, Gao Qiyue
- Capture Salient Historical Information: A Fast And Accurate Non-autoregressive Model For Multi-turn Spoken Language Understanding Cheng Lizhi, Jia Weijia, Yang Wenmian
- Vault: Augmenting The Vision-and-language Transformer For Sentiment Classification On Social Media Chochlakis Georgios University Of Southern California, Srinivasan Tejas University Of Southern California, Thomason Jesse University Of Southern California, Narayanan Shrikanth University Of Southern California
- Meta-learning Fast Weight Language Models Clark Kevin, Guu Kelvin, Chang Ming-wei, Pasupat Panupong, Hinton Geoffrey, Norouzi Mohammad
- LSG Attention: Extrapolation Of Pretrained Transformers To Long Sequences Condevaux Charles, Harispe Sébastien
- Building The Intent Landscape Of Real-world Conversational Corpora With Extractive Question-answering Transformers Corbeil Jean-philippe, Li Mia Taige, Ghavidel Hadi Abdi
- The Future Is Different: Large Pre-trained Language Models Fail In Prediction Tasks Cvejoski Kostadin, Sánchez Ramsés J., Ojeda César
- Neural Knowledge Bank For Pretrained Transformers Dai Damai, Jiang Wenbin, Dong Qingxiu, Lyu Yajuan, She Qiaoqiao, Sui Zhifang
- Why Can GPT Learn In-context? Language Models Implicitly Perform Gradient Descent As Meta-optimizers Dai Damai, Sun Yutao, Dong Li, Hao Yaru, Ma Shuming, Sui Zhifang, Wei Furu
- Llm.int8(): 8-bit Matrix Multiplication For Transformers At Scale Dettmers Tim, Lewis Mike, Belkada Younes, Zettlemoyer Luke
- Pre-training Transformer Models With Sentence-level Objectives For Answer Sentence Selection Di Liello Luca, Garg Siddhant, Soldaini Luca, Moschitti Alessandro
- Cogview2: Faster And Better Text-to-image Generation Via Hierarchical Transformers Ding Ming, Zheng Wendi, Hong Wenyi, Tang Jie
- Parallel Attention Forcing For Machine Translation Dou Qingyun, Gales Mark
- Coarse-to-fine Vision-language Pre-training With Fusion In The Backbone Dou Zi-yi, Kamath Aishwarya, Gan Zhe, Zhang Pengchuan, Wang Jianfeng, Li Linjie, Liu Zicheng, Liu Ce, Lecun Yann, Peng Nanyun, Gao Jianfeng, Wang Lijuan
- Contrastive Learning With Bidirectional Transformers For Sequential Recommendation Du Hanwen, Shi Hui, Zhao Pengpeng, Wang Deqing, Sheng Victor S., Liu Yanchi, Liu Guanfeng, Zhao Lei
- A Survey Of Vision-language Pre-trained Models Du Yifan, Liu Zikang, Li Junyi, Zhao Wayne Xin
- Imagination Is All You Need! Curved Contrastive Learning For Abstract Sequence Modeling Utilized On Long Short-term Dialogue Planning Erker Justus-jonas, Schaffer Stefan, Spanakis Gerasimos
- Leveraging Pre-trained Models For Failure Analysis Triplets Generation Ezukwoke Kenneth, Hoayek Anis, Batton-hubert Mireille, Boucher Xavier, Gounet Pascal, Adrian Jerome
- CERES: Pretraining Of Graph-conditioned Transformer For Semi-structured Session Data Feng Rui, Luo Chen, Yin Qingyu, Yin Bing, Zhao Tuo, Zhang Chao
- Learn To Remember: Transformer With Recurrent Memory For Document-level Machine Translation Feng Yukun, Li Feng, Song Ziang, Zheng Boyuan, Koehn Philipp
- Towards Opening The Black Box Of Neural Machine Translation: Source And Target Interpretations Of The Transformer Ferrando Javier, Gállego Gerard I., Alastruey Belen, Escolano Carlos, Costa-jussà Marta R.
- Should Attention Be All We Need? The Epistemic And Ethical Implications Of Unification In Machine Learning Fishman Nic, Hancox-li Leif
- GPTQ: Accurate Post-training Quantization For Generative Pre-trained Transformers Frantar Elias, Ashkboos Saleh, Hoefler Torsten, Alistarh Dan
- Revision Transformers: Instructing Language Models To Change Their Values Friedrich Felix, Stammer Wolfgang, Schramowski Patrick, Kersting Kristian
- Adapterbias: Parameter-efficient Token-dependent Representation Shift For Adapters In NLP Tasks Fu Chin-lun, Chen Zih-ching, Lee Yun-ru, Lee Hung-yi
- Hungry Hungry Hippos: Towards Language Modeling With State Space Models Fu Daniel Y., Dao Tri, Saab Khaled K., Thomas Armin W., Rudra Atri, Ré Christopher
- An Empirical Study Of End-to-end Video-language Transformers With Masked Visual Modeling Fu Tsu-jui, Li Linjie, Gan Zhe, Lin Kevin, Wang William Yang, Wang Lijuan, Liu Zicheng
- Pnlp-mixer: An Efficient All-mlp Architecture For Language Fusco Francesco, Pascual Damian, Staar Peter, Antognini Diego
- Visual Prompt Tuning For Test-time Domain Adaptation Gao Yunhe, Shi Xingjian, Zhu Yi, Wang Hao, Tang Zhiqiang, Zhou Xiong, Li Mu, Metaxas Dimitris N.
- Cramming: Training A Language Model On A Single GPU In One Day Geiping Jonas, Goldstein Tom
- Microbert: Effective Training Of Low-resource Monolingual Berts Through Parameter Reduction And Multitask Learning Gessler Luke, Zeldes Amir
- Re2g: Retrieve, Rerank, Generate Glass Michael, Rossiello Gaetano, Chowdhury Md Faisal Mahbub, Naik Ankita Rajaram, Cai Pengshan, Gliozzo Alfio
- Does Entity Abstraction Help Generative Transformers Reason? Gontier Nicolas, Reddy Siva, Pal Christopher
- Massively Multilingual Lexical Specialization Of Multilingual Transformers Green Tommaso, Ponzetto Simone Paolo, Glavaš Goran
- Continuous Decomposition Of Granularity For Neural Paraphrase Generation Gu Xiaodong, Zhang Zhaowei, Lee Sang-woo, Yoo Kang Min, Ha Jung-woo
- Generating Coherent Narratives By Learning Dynamic And Discrete Entity States With A Contrastive Framework Guan Jian, Yang Zhenyu, Zhang Rongsheng, Hu Zhipeng, Huang Minlie
- Transkimmer: Transformer Learns To Layer-wise Skim Guan Yue, Li Zhengyi, Leng Jingwen, Lin Zhouhan, Guo Minyi
- STI: Turbocharge NLP Inference At The Edge Via Elastic Pipelining Guo Liwei, Choe Wonkyo, Lin Felix Xiaozhu
- Quala-minilm: A Quantized Length Adaptive Minilm Guskin Shira, Wasserblat Moshe, Wang Chang, Shen Haihao
- LUNA: Language Understanding With Number Augmentations On Transformers Via Number Plugins And Pre-training Han Hongwei, Xu Jialiang, Zhou Mengyu, Shao Yijia, Han Shi, Zhang Dongmei
- Supervised Visual Attention For Simultaneous Multimodal Machine Translation Haralampieva Veneta, Caglayan Ozan, Specia Lucia
- How Much Does Attention Actually Attend? Questioning The Importance Of Attention In Pretrained Transformers Hassid Michael, Peng Hao, Rotem Daniel, Kasai Jungo, Montero Ivan, Smith Noah A., Schwartz Roy
- Understanding Transformer Memorization Recall Through Idioms Haviv Adi, Cohen Ido, Gidron Jacob, Schuster Roei, Goldberg Yoav, Geva Mor
- Transformer Language Models Without Positional Encodings Still Learn Positional Information Haviv Adi, Ram Ori, Press Ofir, Izsak Peter, Levy Omer
- Z-code++: A Pre-trained Language Model Optimized For Abstractive Summarization He Pengcheng, Peng Baolin, Lu Liyang, Wang Song, Mei Jie, Liu Yang, Xu Ruochen, Awadalla Hany Hassan, Shi Yu, Zhu Chenguang, Xiong Wayne, Zeng Michael, Gao Jianfeng, Huang Xuedong
- SPACE-3: Unified Dialog Model Pre-training For Task-oriented Dialog Understanding And Generation He Wanwei, Dai Yinpei, Yang Min, Sun Jian, Huang Fei, Si Luo, Li Yongbin
- TRAC: A Textual Benchmark For Reasoning About Actions And Change He Weinan, Huang Canming, Xiao Zhanhao, Liu Yongmei
- Hyperprompt: Prompt-based Task-conditioning Of Transformers He Yun, Zheng Huaixiu Steven, Tay Yi, Gupta Jai, Du Yu, Aribandi Vamsi, Zhao Zhe, Li Yaguang, Chen Zhao, Metzler Donald, Cheng Heng-tze, Chi Ed H.
- Shapley Head Pruning: Identifying And Removing Interference In Multilingual Transformers Held William, Yang Diyi
- Compressing Pre-trained Transformers Via Low-bit Nxm Sparsity For Natural Language Understanding Holmes Connor, Zhang Minjia, He Yuxiong, Wu Bo
- Cogvideo: Large-scale Pretraining For Text-to-video Generation Via Transformers Hong Wenyi, Ding Ming, Zheng Wendi, Liu Xinghan, Tang Jie
- Syntax-guided Localized Self-attention By Constituency Syntactic Distance Hou Shengyuan, Kai Jushi, Xue Haotian, Zhu Bingyu, Yuan Bo, Huang Longtao, Wang Xinbing, Lin Zhouhan
- XDBERT: Distilling Visual Information To BERT From Cross-modal Systems To Improve Language Understanding Hsu Chan-jan, Lee Hung-yi, Tsao Yu
- Empirical Evaluation Of Post-training Quantization Methods For Language Tasks Hu Ting, Meinel Christoph, Yang Haojin
- PLANET: Dynamic Content Planning In Autoregressive Transformers For Long-form Text Generation Hu Zhe, Chan Hou Pong, Liu Jiachen, Xiao Xinyan, Wu Hua, Huang Lifu
- Empowering Language Models With Knowledge Graph Reasoning For Question Answering Hu Ziniu, Xu Yichong, Yu Wenhao, Wang Shuohang, Yang Ziyi, Zhu Chenguang, Chang Kai-wei, Sun Yizhou
- Transformer Quality In Linear Time Hua Weizhe, Dai Zihang, Liu Hanxiao, Le Quoc V.
- Detecting Unintended Memorization In Language-model-fused ASR Huang W. Ronny, Chien Steve, Thakkar Om, Mathews Rajiv
- Layoutlmv3: Pre-training For Document AI With Unified Text And Image Masking Huang Yupan, Lv Tengchao, Cui Lei, Lu Yutong, Wei Furu
- Do Berts Learn To Use Browser User Interface? Exploring Multi-step Tasks With Unified Vision-and-language Berts Iki Taichi, Aizawa Akiko
- Efficient Long-text Understanding With Short-text Models Ivgi Maor, Shaham Uri, Berant Jonathan
- Litetransformersearch: Training-free Neural Architecture Search For Efficient Language Models Javaheripi Mojan, De Rosa Gustavo H., Mukherjee Subhabrata, Shah Shital, Religa Tomasz L., Mendes Caio C. T., Bubeck Sebastien, Koushanfar Farinaz, Dey Debadeepta
- DP-KB: Data Programming With Knowledge Bases Improves Transformer Fine Tuning For Answer Sentence Selection Jedema Nic, Vu Thuy, Gupta Manish, Moschitti Alessandro
- Lamemo: Language Modeling With Look-ahead Memory Ji Haozhe, Zhang Rongsheng, Yang Zhenyu, Hu Zhipeng, Huang Minlie
- Survey Of Hallucination In Natural Language Generation Ji Ziwei, Lee Nayeon, Frieske Rita, Yu Tiezheng, Su Dan, Xu Yan, Ishii Etsuko, Bang Yejin, Chen Delong, Dai Wenliang, Chan Ho Shu, Madotto Andrea, Fung Pascale
- RHO (\(\rho\)): Reducing Hallucination In Open-domain Dialogues With Knowledge Grounding Ji Ziwei, Liu Zihan, Lee Nayeon, Yu Tiezheng, Wilie Bryan, Zeng Min, Fung Pascale
- Visual Prompt Tuning Jia Menglin, Tang Luming, Chen Bor-chun, Cardie Claire, Belongie Serge, Hariharan Bharath, Lim Ser-nam
- Pseudo-q: Generating Pseudo Language Queries For Visual Grounding Jiang Haojun, Lin Yuanze, Han Dongchen, Song Shiji, Huang Gao
- Training Compute-optimal Large Language Models Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego De Las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, Tom Hennigan, Eric Noland, Katie Millican, George Van Den Driessche, Bogdan Damoc, Aurelia Guy, Simon Osindero, Karen Simonyan, Erich Elsen, Jack W. Rae, Oriol Vinyals, Laurent Sifre
- Latency Adjustable Transformer Encoder For Language Understanding Kachuee Sajjad, Sharifkhani Mohammad
- A New Path: Scaling Vision-and-language Navigation With Synthetic Instructions And Imitation Learning Kamath Aishwarya, Anderson Peter, Wang Su, Koh Jing Yu, Ku Alexander, Waters Austin, Yang Yinfei, Baldridge Jason, Parekh Zarana
- Fie: Building A Global Probability Space By Leveraging Early Fusion In Encoder For Open-domain Question Answering Kedia Akhil, Zaidi Mohd Abbas, Lee Haejun
- High Recall Data-to-text Generation With Progressive Edit Kim Choonghan, Lee Gary Geunbae
- Who Says Elephants Can't Run: Bringing Large Scale Moe Models Into Cloud Scale Production Kim Young Jin, Henry Rawn, Fahim Raffy, Awadalla Hany Hassan
- Iterative Vision-and-language Navigation Krantz Jacob, Banerjee Shurjo, Zhu Wang, Corso Jason, Anderson Peter, Lee Stefan, Thomason Jesse
- Downstream Datasets Make Surprisingly Good Pretraining Corpora Krishna Kundan, Garg Saurabh, Bigham Jeffrey P., Lipton Zachary C.
- Few-shot Question Generation For Personalized Feedback In Intelligent Tutoring Systems Kulshreshtha Devang, Shayan Muhammad, Belfer Robert, Reddy Siva, Serban Iulian Vlad, Kochmar Ekaterina
- Mucot: Multilingual Contrastive Training For Question-answering In Low-resource Languages Kumar Gokul Karthik, Gehlot Abhishek Singh, Mullappilly Sahal Shaji, Nandakumar Karthik
- The Optimal BERT Surgeon: Scalable And Accurate Second-order Pruning For Large Language Models Kurtic Eldar, Campos Daniel, Nguyen Tuan, Frantar Elias, Kurtz Mark, Fineran Benjamin, Goin Michael, Alistarh Dan
- Natural Language To Code Using Transformers Kusupati Uday, Ailavarapu Venkata Ravi Teja
- Language Detoxification With Attribute-discriminative Latent Space Kwak Jin Myung, Kim Minseon, Hwang Sung Ju
- Does BERT Really Agree ? Fine-grained Analysis Of Lexical Dependence On A Syntactic Task Lasri Karim, Lenci Alessandro, Poibeau Thierry
- Guiding Visual Question Answering With Attention Priors Le Thao Minh, Le Vuong, Gupta Sunil, Venkatesh Svetha, Tran Truyen
- Littlebird: Efficient Faster & Longer Transformer For Question Answering Lee Minchul, Han Kijong, Shin Myeong Cheol
- Simple Recurrence Improves Masked Language Models Lei Tao, Tian Ran, Bastings Jasmijn, Parikh Ankur P.
- Fast Inference From Transformers Via Speculative Decoding Leviathan Yaniv, Kalman Matan, Matias Yossi
- GPT-D: Inducing Dementia-related Linguistic Anomalies By Deliberate Degradation Of Artificial Neural Language Models Li Changye, Knopman David, Xu Weizhe, Cohen Trevor, Pakhomov Serguei
- Summer: Wechat Neural Machine Translation Systems For The WMT22 Biomedical Translation Task Li Ernan, Meng Fandong, Zhou Jie
- Learning To Transfer Prompts For Text Generation Li Junyi, Tang Tianyi, Nie Jian-yun, Wen Ji-rong, Zhao Wayne Xin
- Personalized Prompt Learning For Explainable Recommendation Li Lei, Zhang Yongfeng, Chen Li
- Modeling Coreference Relations In Visual Dialog Li Mingxiao, Moens Marie-francine
- Evade The Trap Of Mediocrity: Promoting Diversity And Novelty In Text Generation Via Concentrating Attention Li Wenhao, Yi Xiaoyuan, Hu Jinyi, Sun Maosong, Xie Xing
- P-transformer: Towards Better Document-to-document Neural Machine Translation Li Yachao, Li Junhui, Jiang Jing, Tao Shimin, Yang Hao, Zhang Min
- Uni-eden: Universal Encoder-decoder Network By Multi-granular Vision-language Pre-training Li Yehao, Fan Jiahao, Pan Yingwei, Yao Ting, Lin Weiyao, Mei Tao
- VALHALLA: Visual Hallucination For Machine Translation Li Yi, Panda Rameswar, Kim Yoon, Chen Chun-fu, Feris Rogerio, Cox David, Vasconcelos Nuno
- Exploring Visual Interpretability For Contrastive Language-image Pre-training Li Yi, Wang Hualiang, Duan Yiqun, Xu Hang, Li Xiaomeng
- Clinical-longformer And Clinical-bigbird: Transformers For Long Clinical Sequences Li Yikuan, Wehbe Ramsey M., Ahmad Faraz S., Wang Hanyin, Luo Yuan
- Accelerating Attention Through Gradient-based Learned Runtime Pruning Li Zheng, Ghodrati Soroush, Yazdanbakhsh Amir, Esmaeilzadeh Hadi, Kang Mingu
- Life After BERT: What Do Other Muppets Understand About Language? Lialin Vladislav, Zhao Kevin, Shivagunde Namrata, Rumshisky Anna
- Bjtu-wechat's Systems For The WMT22 Chat Translation Task Liang Yunlong, Meng Fandong, Xu Jinan, Chen Yufeng, Zhou Jie
- ADAPT: Vision-language Navigation With Modality-aligned Action Prompts Lin Bingqian, Zhu Yi, Chen Zicong, Liang Xiwen, Liu Jianzhuang, Liang Xiaodan
- Gentus: Simulating User Behaviour And Language In Task-oriented Dialogues With Generative Transformers Lin Hsien-chin, Geishauser Christian, Feng Shutong, Lubis Nurul, Van Niekerk Carel, Heck Michael, Gašić Milica
- Contextual Transformer For Offline Meta Reinforcement Learning Lin Runji, Li Ye, Feng Xidong, Zhang Zhaowei, Fung Xian Hong Wu, Zhang Haifeng, Wang Jun, Du Yali, Yang Yaodong
- Delving Deeper Into Cross-lingual Visual Question Answering Liu Chen, Pfeiffer Jonas, Korhonen Anna, Vulić Ivan, Gurevych Iryna
- A Simple Meta-learning Paradigm For Zero-shot Intent Classification With Mixture Attention Mechanism Liu Han, Zhao Siyang, Zhang Xiaotong, Zhang Feng, Sun Junjie, Yu Hong, Zhang Xianchao
- Instruction-following Agents With Multimodal Transformer Liu Hao, Lee Lisa, Lee Kimin, Abbeel Pieter
- Building Markovian Generative Architectures Over Pretrained LM Backbones For Efficient Task-oriented Dialog Systems Liu Hong, Cai Yucheng, Ou Zhijian, Huang Yi, Feng Junlan
- Fl-tuning: Layer Tuning For Feed-forward Network In Transformer Liu Jingping, Song Yuqiu, Xue Kui, Sun Hongli, Wang Chao, Chen Lihan, Jiang Haiyun, Liang Jiaqing, Ruan Tong
- Transformer With Memory Replay Liu Rui, Mozafari Barzan
- Commitbart: A Large Pre-trained Model For Github Commits Liu Shangqing, Li Yanzhou, Xie Xiaofei, Liu Yang
- Improving Personality Consistency In Conversation By Persona Extending Liu Yifan, Wei Wei, Liu Jiayi, Mao Xianling, Fang Rui, Chen Dangyang
- Dialogconv: A Lightweight Fully Convolutional Network For Multi-view Response Selection Liu Yongkang, Feng Shi, Gao Wei, Wang Daling, Zhang Yifei
- Prompt Generation Networks For Input-space Adaptation Of Frozen Vision Transformers Loedeman Jochem, Stol Maarten C., Han Tengda, Asano Yuki M.
- Relaxed Attention For Transformer Models Lohrenz Timo, Möller Björn, Li Zhengyang, Fingscheidt Tim
- Vision-and-language Pretrained Models: A Survey Long Siqu, Cao Feiqi, Han Soyeon Caren, Yang Haiqin
- Reacc: A Retrieval-augmented Code Completion Framework Lu Shuai, Duan Nan, Han Hojae, Guo Daya, Hwang Seung-won, Svyatkovskiy Alexey
- Choose Your QA Model Wisely: A Systematic Study Of Generative And Extractive Readers For Question Answering Luo Man, Hashimoto Kazuma, Yavuz Semih, Liu Zhiwei, Baral Chitta, Zhou Yingbo
- Biogpt: Generative Pre-trained Transformer For Biomedical Text Generation And Mining Luo Renqian, Sun Liai, Xia Yingce, Qin Tao, Zhang Sheng, Poon Hoifung, Liu Tie-yan
- Torchscale: Transformers At Scale Ma Shuming, Wang Hongyu, Huang Shaohan, Wang Wenhui, Chi Zewen, Dong Li, Benhaim Alon, Patra Barun, Chaudhary Vishrav, Song Xia, Wei Furu
- DIALOG-22 Ruatd Generated Text Detection Maloyan Narek, Nutfullin Bulat, Ilyushin Eugene
- Tangobert: Reducing Inference Cost By Using Cascaded Architecture Mamou Jonathan, Pereg Oren, Wasserblat Moshe, Schwartz Roy
- Fine-tuning Pre-trained Transformers Into Decaying Fast Weights Mao Huanru Henry
- Quick Starting Dialog Systems With Paraphrase Generation Marceau Louis, Belbahar Raouf, Queudot Marc, Naji Nada, Charton Eric, Meurs Marie-jean
- Mini-model Adaptation: Efficiently Extending Pretrained Models To New Languages Via Aligned Shallow Training Marchisio Kelly, Lewis Patrick, Chen Yihong, Artetxe Mikel
- GAAMA 2.0: An Integrated System That Answers Boolean And Extractive Questions Mccarley Scott, Bornea Mihaela, Rosenthal Sara, Ferritto Anthony, Sultan Md Arafat, Sil Avirup, Florian Radu
- DSI++: Updating Transformer Memory With New Documents Mehta Sanket Vaibhav, Gupta Jai, Tay Yi, Dehghani Mostafa, Tran Vinh Q., Rao Jinfeng, Najork Marc, Strubell Emma, Metzler Donald
- Mass-editing Memory In A Transformer Meng Kevin, Sharma Arnab Sen, Andonian Alex, Belinkov Yonatan, Bau David
- Rarely A Problem? Language Models Exhibit Inverse Scaling In Their Predictions Following Few-type Quantifiers Michaelov James A., Bergen Benjamin K.
- Minicons: Enabling Flexible Behavioral And Representational Analyses Of Transformer Language Models Misra Kanishka
- Domain-specific Text Generation For Machine Translation Moslem Yasmin, Haque Rejwanul, Kelleher John D., Way Andy
- SGPT: GPT Sentence Embeddings For Semantic Search Muennighoff Niklas
- JASMINE: Arabic GPT Models For Few-shot Learning Nagoudi El Moatez Billah, Abdul-mageed Muhammad, Elmadany Abdelrahim, Inciarte Alcides Alcoba, Khondaker Md Tawkat Islam
- Task Formulation Matters When Learning Continually: A Case Study In Visual Question Answering Nikandrou Mavina, Yu Lu, Suglia Alessandro, Konstas Ioannis, Rieser Verena
- Budgetlongformer: Can We Cheaply Pretrain A Sota Legal Language Model From Scratch? Niklaus Joel, Giofré Daniele
- Do Vision-and-language Transformers Learn Grounded Predicate-noun Dependencies? Nikolaus Mitja, Salin Emmanuelle, Ayache Stephane, Fourtassi Abdellah, Favre Benoit
- Improving The Cross-lingual Generalisation In Visual Question Answering Nooralahzadeh Farhad, Sennrich Rico
- Minialbert: Model Distillation Via Parameter-efficient Recursive Transformers Nouriborji Mohammadmahdi, Rohanian Omid, Kouchaki Samaneh, Clifton David A.
- CUE Vectors: Modular Training Of Language Models Conditioned On Diverse Contextual Signals Novotney Scott, Mukherjee Sreeparna, Ahmed Zeeshan, Stolcke Andreas
- Neural Language Taskonomy: Which NLP Tasks Are The Most Predictive Of Fmri Brain Activity? Oota Subba Reddy, Arora Jashn, Agarwal Veeral, Marreddy Mounika, Gupta Manish, Surampudi Bapi Raju
- Parameter-efficient Abstractive Question Answering Over Tables Or Text Pal Vaishali, Kanoulas Evangelos, De Rijke Maarten
- Follow-up Attention: An Empirical Study Of Developer And Neural Model Code Exploration Paltenghi Matteo, Pandita Rahul, Henley Austin Z., Ziegler Albert
- Silo Nlp's Participation At WAT2022 Parida Shantipriya, Panda Subhadarshi, Grönroos Stig-arne, Granroth-wilding Mark, Koistinen Mika
- TALM: Tool Augmented Language Models Parisi Aaron, Zhao Yao, Fiedel Noah
- LUT-GEMM: Quantized Matrix Multiplication Based On Luts For Efficient Inference In Large-scale Generative Language Models Park Gunho, Park Baeseong, Kim Minsub, Lee Sungjae, Kim Jeonghoon, Kwon Beomseok, Kwon Se Jung, Kim Byeongwook, Lee Youngjoo, Lee Dongsoo
- Relation-aware Language-graph Transformer For Question Answering Park Jinyoung, Choi Hyeong Kyu, Ko Juyeon, Park Hyeonjin, Kim Ji-hoon, Jeong Jisu, Kim Kyungmin, Kim Hyunwoo J.
- Beyond English-centric Bitexts For Better Multilingual Language Representation Learning Patra Barun, Singhal Saksham, Huang Shaohan, Chi Zewen, Dong Li, Wei Furu, Chaudhary Vishrav, Song Xia
- Xfboost: Improving Text Generation With Controllable Decoders Peng Xiangyu, Sollami Michael
- Sparse And Dense Approaches For The Full-rank Retrieval Of Responses For Dialogues Penha Gustavo, Hauff Claudia
- Ignore Previous Prompt: Attack Techniques For Language Models Perez Fábio, Ribeiro Ian
- Lifting The Curse Of Multilinguality By Pre-training Modular Transformers Pfeiffer Jonas, Goyal Naman, Lin Xi Victoria, Li Xian, Cross James, Riedel Sebastian, Artetxe Mikel
- Vit5: Pretrained Text-to-text Transformer For Vietnamese Language Generation Phan Long, Tran Hieu, Nguyen Hieu, Trinh Trieu H.
- Investigating Efficiently Extending Transformers For Long Input Summarization Phang Jason, Zhao Yao, Liu Peter J.
- Pre-training Image-language Transformers For Open-vocabulary Tasks Piergiovanni Aj, Kuo Weicheng, Angelova Anelia
- Harnessing The Power Of Multi-task Pretraining For Ground-truth Level Natural Language Explanations Plüster Björn, Ambsdorf Jakob, Braach Lukas, Lee Jae Hee, Wermter Stefan
- Outliers Dimensions That Disrupt Transformers Are Driven By Frequency Puccetti Giovanni, Rogers Anna, Drozd Aleksandr, Dell'orletta Felice
- Tegformer: Topic-to-essay Generation With Good Topic Coverage And High Text Coherence Qi Wang, Liu Rui, Zuo Yuan, Chen Yong, Zhang Dell
- Diffusion Glancing Transformer For Parallel Sequence To Sequence Learning Qian Lihua, Wang Mingxuan, Liu Yang, Zhou Hao
- The NLP Task Effectiveness Of Long-range Transformers Qin Guanghui, Feng Yukun, Van Durme Benjamin
- Memory In Humans And Deep Language Models: Linking Hypotheses For Model Augmentation Raccah Omri, Chen Phoebe, Willke Ted L., Poeppel David, Vo Vy A.
- Parallel Context Windows For Large Language Models Ratner Nir, Levine Yoav, Belinkov Yonatan, Ram Ori, Magar Inbal, Abend Omri, Karpas Ehud, Shashua Amnon, Leyton-brown Kevin, Shoham Yoav
- VLC-BERT: Visual Question Answering With Contextualized Commonsense Knowledge Ravi Sahithya, Chinchure Aditya, Sigal Leonid, Liao Renjie, Shwartz Vered
- Exploring Extreme Parameter Compression For Pre-trained Language Models Ren Yuxin, Wang Benyou, Shang Lifeng, Jiang Xin, Liu Qun
- Super-prompting: Utilizing Model-independent Contextual Data To Reduce Data Annotation Required In Visual Commonsense Tasks Rezaei Navid, Reformat Marek Z.
- Breakpoint Transformers For Modeling And Tracking Intermediate Beliefs Richardson Kyle, Tamari Ronen, Sultan Oren, Tsarfaty Reut, Shahaf Dafna, Sabharwal Ashish
- Learning To Diversify For Product Question Generation Roitman Haggai, Singer Uriel, Eshel Yotam, Nus Alexander, Kiperwasser Eliyahu
- Lamda: Language Models For Dialog Applications Romal Thoppilan, Daniel De Freitas, Jamie Hall, Noam Shazeer, Apoorv Kulshreshtha, Heng-tze Cheng, Alicia Jin, Taylor Bos, Leslie Baker, Yu Du, Yaguang Li, Hongrae Lee, Huaixiu Steven Zheng, Amin Ghafouri, Marcelo Menegali, Yanping Huang, Maxim Krikun, Dmitry Lepikhin, James Qin, Dehao Chen, Yuanzhong Xu, Zhifeng Chen, Adam Roberts, Maarten Bosma, Vincent Zhao, Yanqi Zhou, Chung-ching Chang, Igor Krivokon, Will Rusch, Marc Pickett, Pranesh Srinivasan, Laichee Man, Kathleen Meier-hellstern, Meredith Ringel Morris, Tulsee Doshi, Renelito Delos Santos, Toju Duke, Johnny Soraker, Ben Zevenbergen, Vinodkumar Prabhakaran, Mark Diaz, Ben Hutchinson, Kristen Olson, Alejandra Molina, Erin Hoffman-john, Josh Lee, Lora Aroyo, Ravi Rajakumar, Alena Butryna, Matthew Lamm, Viktoriya Kuzmina, Joe Fenton, Aaron Cohen, Rachel Bernstein, Ray Kurzweil, Blaise Aguera-arcas, Claire Cui, Marian Croak, Ed Chi, Quoc Le
- IT5: Text-to-text Pretraining For Italian Language Understanding And Generation Sarti Gabriele, Nissim Malvina
- Transformer Grammars: Augmenting Transformer Language Models With Syntactic Inductive Biases At Scale Sartran Laurent, Barrett Samuel, Kuncoro Adhiguna, Stanojević Miloš, Blunsom Phil, Dyer Chris
- What Language Model To Train If You Have One Million GPU Hours? Scao Teven Le, Wang Thomas, Hesslow Daniel, Saulnier Lucile, Bekman Stas, Bari M Saiful, Biderman Stella, Elsahar Hady, Muennighoff Niklas, Phang Jason, Press Ofir, Raffel Colin, Sanh Victor, Shen Sheng, Sutawika Lintang, Tae Jaesung, Yong Zheng Xin, Launay Julien, Beltagy Iz
- Confident Adaptive Language Modeling Schuster Tal, Fisch Adam, Gupta Jai, Dehghani Mostafa, Bahri Dara, Tran Vinh Q., Tay Yi, Metzler Donald
- Bird-eye Transformers For Text Generation Models Sha Lei, Song Yuhang, Yordanov Yordan, Salvatori Tommaso, Lukasiewicz Thomas
- Using Deepspeed And Megatron To Train Megatron-turing NLG 530B, A Large-scale Generative Language Model Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick Legresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zhang, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, Bryan Catanzaro
- Deanthropomorphising NLP: Can A Language Model Be Conscious? Shardlow Matthew, Przybyła Piotr
- Staged Training For Transformer Language Models Shen Sheng, Walsh Pete, Keutzer Kurt, Dodge Jesse, Peters Matthew, Beltagy Iz
- Incorporating Domain Knowledge Through Task Augmentation For Front-end Javascript Code Generation Shen Sijie, Zhu Xiang, Dong Yihong, Guo Qizhi, Zhen Yankun, Li Ge
- Mgpt: Few-shot Learners Go Multilingual Shliazhko Oleh, Fenogenova Alena, Tikhonova Maria, Mikhailov Vladislav, Kozlova Anastasia, Shavrina Tatiana
- On The Efficacy Of Co-attention Transformer Layers In Visual Question Answering Sikarwar Ankur, Kreiman Gabriel
- Improving The Domain Adaptation Of Retrieval Augmented Generation (RAG) Models For Open Domain Question Answering Siriwardhana Shamane, Weerasekera Rivindu, Wen Elliott, Kaluarachchi Tharindu, Rana Rajib, Nanayakkara Suranga
- Human Language Modeling Soni Nikita, Matero Matthew, Balasubramanian Niranjan, Schwartz H. Andrew
- Climb: A Continual Learning Benchmark For Vision-and-language Tasks Srinivasan Tejas, Chang Ting-yun, Alva Leticia Leonor Pinto, Chochlakis Georgios, Rostami Mohammad, Thomason Jesse
- Beyond The Imitation Game: Quantifying And Extrapolating The Capabilities Of Language Models Srivastava Aarohi Shammie, Rastogi Abhinav Shammie, Rao Abhishek Shammie, Shoeb Abu Awal Md Shammie, Abid Abubakar Shammie, Fisch Adam Shammie, Brown Adam R. Shammie, Santoro Adam Shammie, Gupta Aditya Shammie, Garriga-alonso Adrià Shammie, Kluska Agnieszka Shammie, Lewkowycz Aitor Shammie, Agarwal Akshat Shammie, Power Alethea Shammie, Ray Alex Shammie, Warstadt Alex Shammie, Kocurek Alexander W. Shammie, Safaya Ali Shammie, Tazarv Ali Shammie, Xiang Alice Shammie, Parrish Alicia Shammie, Nie Allen Shammie, Hussain Aman Shammie, Askell Amanda Shammie, Dsouza Amanda Shammie, Slone Ambrose Shammie, Rahane Ameet Shammie, Iyer Anantharaman S. Shammie, Andreassen Anders Shammie, Madotto Andrea Shammie, Santilli Andrea Shammie, Stuhlmüller Andreas Shammie, Dai Andrew Shammie, La Andrew Shammie, Lampinen Andrew Shammie, Zou Andy Shammie, Jiang Angela Shammie, Chen Angelica Shammie, Vuong Anh Shammie, Gupta Animesh Shammie, Gottardi Anna Shammie, Norelli Antonio Shammie, Venkatesh Anu Shammie, Gholamidavoodi Arash Shammie, Tabassum Arfa Shammie, Menezes Arul Shammie, Kirubarajan Arun Shammie, Mullokandov Asher Shammie, Sabharwal Ashish Shammie, Herrick Austin Shammie, Efrat Avia Shammie, Erdem Aykut Shammie, Karakaş Ayla Shammie, Roberts B. Ryan Shammie, Loe Bao Sheng Shammie, Zoph Barret Shammie, Bojanowski Bartłomiej Shammie, Özyurt Batuhan Shammie, Hedayatnia Behnam Shammie, Neyshabur Behnam Shammie, Inden Benjamin Shammie, Stein Benno Shammie, Ekmekci Berk Shammie, Lin Bill Yuchen Shammie, Howald Blake Shammie, Orinion Bryan Shammie, Diao Cameron Shammie, Dour Cameron Shammie, Stinson Catherine Shammie, Argueta Cedrick Shammie, Ramírez César Ferri Shammie, Singh Chandan Shammie, Rathkopf Charles Shammie, Meng Chenlin Shammie, Baral Chitta Shammie, Wu Chiyu Shammie, Callison-burch Chris Shammie, Waites Chris Shammie, Voigt Christian Shammie, Manning Christopher D. Shammie, Potts Christopher Shammie, Ramirez Cindy Shammie, Rivera Clara E. Shammie, Siro Clemencia Shammie, Raffel Colin Shammie, Ashcraft Courtney Shammie, Garbacea Cristina Shammie, Sileo Damien Shammie, Garrette Dan Shammie, Hendrycks Dan Shammie, Kilman Dan Shammie, Roth Dan Shammie, Freeman Daniel Shammie, Khashabi Daniel Shammie, Levy Daniel Shammie, González Daniel Moseguí Shammie, Perszyk Danielle Shammie, Hernandez Danny Shammie, Chen Danqi Shammie, Ippolito Daphne Shammie, Gilboa Dar Shammie, Dohan David Shammie, Drakard David Shammie, Jurgens David Shammie, Datta Debajyoti Shammie, Ganguli Deep Shammie, Emelin Denis Shammie, Kleyko Denis Shammie, Yuret Deniz Shammie, Chen Derek Shammie, Tam Derek Shammie, Hupkes Dieuwke Shammie, Misra Diganta Shammie, Buzan Dilyar Shammie, Mollo Dimitri Coelho Shammie, Yang Diyi Shammie, Lee Dong-ho Shammie, Schrader Dylan Shammie, Shutova Ekaterina Shammie, Cubuk Ekin Dogus Shammie, Segal Elad Shammie, Hagerman Eleanor Shammie, Barnes Elizabeth Shammie, Donoway Elizabeth Shammie, Pavlick Ellie Shammie, Rodola Emanuele Shammie, Lam Emma Shammie, Chu Eric Shammie, Tang Eric Shammie, Erdem Erkut Shammie, Chang Ernie Shammie, Chi Ethan A. Shammie, Dyer Ethan Shammie, Jerzak Ethan Shammie, Kim Ethan Shammie, Manyasi Eunice Engefu Shammie, Zheltonozhskii Evgenii Shammie, Xia Fanyue Shammie, Siar Fatemeh Shammie, Martínez-plumed Fernando Shammie, Happé Francesca Shammie, Chollet Francois Shammie, Rong Frieda Shammie, Mishra Gaurav Shammie, Winata Genta Indra Shammie, De Melo Gerard Shammie, Kruszewski Germán Shammie, Parascandolo Giambattista Shammie, Mariani Giorgio Shammie, Wang Gloria Shammie, Jaimovitch-lópez Gonzalo Shammie, Betz Gregor Shammie, Gur-ari Guy Shammie, Galijasevic Hana Shammie, Kim Hannah Shammie, Rashkin Hannah Shammie, Hajishirzi Hannaneh Shammie, Mehta Harsh Shammie, Bogar Hayden Shammie, Shevlin Henry Shammie, Schütze Hinrich Shammie, Yakura Hiromu Shammie, Zhang Hongming Shammie, Wong Hugh Mee Shammie, Ng Ian Shammie, Noble Isaac Shammie, Jumelet Jaap Shammie, Geissinger Jack Shammie, Kernion Jackson Shammie, Hilton Jacob Shammie, Lee Jaehoon Shammie, Fisac Jaime Fernández Shammie, Simon James B. Shammie, Koppel James Shammie, Zheng James Shammie, Zou James Shammie, Kocoń Jan Shammie, Thompson Jana Shammie, Wingfield Janelle Shammie, Kaplan Jared Shammie, Radom Jarema Shammie, Sohl-dickstein Jascha Shammie, Phang Jason Shammie, Wei Jason Shammie, Yosinski Jason Shammie, Novikova Jekaterina Shammie, Bosscher Jelle Shammie, Marsh Jennifer Shammie, Kim Jeremy Shammie, Taal Jeroen Shammie, Engel Jesse Shammie, Alabi Jesujoba Shammie, Xu Jiacheng Shammie, Song Jiaming Shammie, Tang Jillian Shammie, Waweru Joan Shammie, Burden John Shammie, Miller John Shammie, Balis John U. Shammie, Batchelder Jonathan Shammie, Berant Jonathan Shammie, Frohberg Jörg Shammie, Rozen Jos Shammie, Hernandez-orallo Jose Shammie, Boudeman Joseph Shammie, Guerr Joseph Shammie, Jones Joseph Shammie, Tenenbaum Joshua B. Shammie, Rule Joshua S. Shammie, Chua Joyce Shammie, Kanclerz Kamil Shammie, Livescu Karen Shammie, Krauth Karl Shammie, Gopalakrishnan Karthik Shammie, Ignatyeva Katerina Shammie, Markert Katja Shammie, Dhole Kaustubh D. Shammie, Gimpel Kevin Shammie, Omondi Kevin Shammie, Mathewson Kory Shammie, Chiafullo Kristen Shammie, Shkaruta Ksenia Shammie, Shridhar Kumar Shammie, Mcdonell Kyle Shammie, Richardson Kyle Shammie, Reynolds Laria Shammie, Gao Leo Shammie, Zhang Li Shammie, Dugan Liam Shammie, Qin Lianhui Shammie, Contreras-ochando Lidia Shammie, Morency Louis-philippe Shammie, Moschella Luca Shammie, Lam Lucas Shammie, Noble Lucy Shammie, Schmidt Ludwig Shammie, He Luheng Shammie, Colón Luis Oliveros Shammie, Metz Luke Shammie, Şenel Lütfi Kerem Shammie, Bosma Maarten Shammie, Sap Maarten Shammie, Ter Hoeve Maartje Shammie, Farooqi Maheen Shammie, Faruqui Manaal Shammie, Mazeika Mantas Shammie, Baturan Marco Shammie, Marelli Marco Shammie, Maru Marco Shammie, Quintana Maria Jose Ramírez Shammie, Tolkiehn Marie Shammie, Giulianelli Mario Shammie, Lewis Martha Shammie, Potthast Martin Shammie, Leavitt Matthew L. Shammie, Hagen Matthias Shammie, Schubert Mátyás Shammie, Baitemirova Medina Orduna Shammie, Arnaud Melody Shammie, Mcelrath Melvin Shammie, Yee Michael A. Shammie, Cohen Michael Shammie, Gu Michael Shammie, Ivanitskiy Michael Shammie, Starritt Michael Shammie, Strube Michael Shammie, Swędrowski Michał Shammie, Bevilacqua Michele Shammie, Yasunaga Michihiro Shammie, Kale Mihir Shammie, Cain Mike Shammie, Xu Mimee Shammie, Suzgun Mirac Shammie, Walker Mitch Shammie, Tiwari Mo Shammie, Bansal Mohit Shammie, Aminnaseri Moin Shammie, Geva Mor Shammie, Gheini Mozhdeh Shammie, T Mukund Varma Shammie, Peng Nanyun Shammie, Chi Nathan A. Shammie, Lee Nayeon Shammie, Krakover Neta Gur-ari Shammie, Cameron Nicholas Shammie, Roberts Nicholas Shammie, Doiron Nick Shammie, Martinez Nicole Shammie, Nangia Nikita Shammie, Deckers Niklas Shammie, Muennighoff Niklas Shammie, Keskar Nitish Shirish Shammie, Iyer Niveditha S. Shammie, Constant Noah Shammie, Fiedel Noah Shammie, Wen Nuan Shammie, Zhang Oliver Shammie, Agha Omar Shammie, Elbaghdadi Omar Shammie, Levy Omer Shammie, Evans Owain Shammie, Casares Pablo Antonio Moreno Shammie, Doshi Parth Shammie, Fung Pascale Shammie, Liang Paul Pu Shammie, Vicol Paul Shammie, Alipoormolabashi Pegah Shammie, Liao Peiyuan Shammie, Liang Percy Shammie, Chang Peter Shammie, Eckersley Peter Shammie, Htut Phu Mon Shammie, Hwang Pinyu Shammie, Miłkowski Piotr Shammie, Patil Piyush Shammie, Pezeshkpour Pouya Shammie, Oli Priti Shammie, Mei Qiaozhu Shammie, Lyu Qing Shammie, Chen Qinlang Shammie, Banjade Rabin Shammie, Rudolph Rachel Etta Shammie, Gabriel Raefer Shammie, Habacker Rahel Shammie, Risco Ramon Shammie, Millière Raphaël Shammie, Garg Rhythm Shammie, Barnes Richard Shammie, Saurous Rif A. Shammie, Arakawa Riku Shammie, Raymaekers Robbe Shammie, Frank Robert Shammie, Sikand Rohan Shammie, Novak Roman Shammie, Sitelew Roman Shammie, Lebras Ronan Shammie, Liu Rosanne Shammie, Jacobs Rowan Shammie, Zhang Rui Shammie, Salakhutdinov Ruslan Shammie, Chi Ryan Shammie, Lee Ryan Shammie, Stovall Ryan Shammie, Teehan Ryan Shammie, Yang Rylan Shammie, Singh Sahib Shammie, Mohammad Saif M. Shammie, Anand Sajant Shammie, Dillavou Sam Shammie, Shleifer Sam Shammie, Wiseman Sam Shammie, Gruetter Samuel Shammie, Bowman Samuel R. Shammie, Schoenholz Samuel S. Shammie, Han Sanghyun Shammie, Kwatra Sanjeev Shammie, Rous Sarah A. Shammie, Ghazarian Sarik Shammie, Ghosh Sayan Shammie, Casey Sean Shammie, Bischoff Sebastian Shammie, Gehrmann Sebastian Shammie, Schuster Sebastian Shammie, Sadeghi Sepideh Shammie, Hamdan Shadi Shammie, Zhou Sharon Shammie, Srivastava Shashank Shammie, Shi Sherry Shammie, Singh Shikhar Shammie, Asaadi Shima Shammie, Gu Shixiang Shane Shammie, Pachchigar Shubh Shammie, Toshniwal Shubham Shammie, Upadhyay Shyam Shammie, Shyamolima Shammie, Debnath, Shakeri Siamak, Thormeyer Simon, Melzi Simone, Reddy Siva, Makini Sneha Priscilla, Lee Soo-hwan, Torene Spencer, Hatwar Sriharsha, Dehaene Stanislas, Divic Stefan, Ermon Stefano, Biderman Stella, Lin Stephanie, Prasad Stephen, Piantadosi Steven T., Shieber Stuart M., Misherghi Summer, Kiritchenko Svetlana, Mishra Swaroop, Linzen Tal, Schuster Tal, Li Tao, Yu Tao, Ali Tariq, Hashimoto Tatsu, Wu Te-lin, Desbordes Théo, Rothschild Theodore, Phan Thomas, Wang Tianle, Nkinyili Tiberius, Schick Timo, Kornev Timofei, Tunduny Titus, Gerstenberg Tobias, Chang Trenton, Neeraj Trishala, Khot Tushar, Shultz Tyler, Shaham Uri, Misra Vedant, Demberg Vera, Nyamai Victoria, Raunak Vikas, Ramasesh Vinay, Prabhu Vinay Uday, Padmakumar Vishakh, Srikumar Vivek, Fedus William, Saunders William, Zhang William, Vossen Wout, Ren Xiang, Tong Xiaoyu, Zhao Xinran, Wu Xinyi, Shen Xudong, Yaghoobzadeh Yadollah, Lakretz Yair, Song Yangqiu, Bahri Yasaman, Choi Yejin, Yang Yichi, Hao Yiding, Chen Yifu, Belinkov Yonatan, Hou Yu, Hou Yufang, Bai Yuntao, Seid Zachary, Zhao Zhuoye, Wang Zijian, Wang Zijie J., Wang Zirui, Wu Ziyi
- Conciseness: An Overlooked Language Task Stahlberg Felix, Kumar Aashish, Alberti Chris, Kumar Shankar
- QA4QG: Using Question Answering To Constrain Multi-hop Question Generation Su Dan, Xu Peng, Fung Pascale
- Towards Diverse, Relevant And Coherent Open-domain Dialogue Generation Via Hybrid Latent Variables Sun Bin, Li Yitong, Mi Fei, Wang Weichao, Li Yiwei, Li Kan
- Long-form Video-language Pre-training With Multimodal Temporal Contrastive Learning Sun Yuchong, Xue Hongwei, Song Ruihua, Liu Bei, Yang Huan, Fu Jianlong
- A Length-extrapolatable Transformer Sun Yutao, Dong Li, Patra Barun, Ma Shuming, Huang Shaohan, Benhaim Alon, Chaudhary Vishrav, Song Xia, Wei Furu
- OPT: Open Pre-trained Transformer Language Models Susan Zhang, Stephen Roller, Naman Goyal, Mikel Artetxe, Moya Chen, Shuohui Chen, Christopher Dewan, Mona Diab, Xian Li, Xi Victoria Lin, Todor Mihaylov, Myle Ott, Sam Shleifer, Kurt Shuster, Daniel Simig, Punit Singh Koura, Anjali Sridhar, Tianlu Wang, Luke Zettlemoyer
- On The Effect Of Pre-training For Transformer In Different Modality On Offline Reinforcement Learning Takagi Shiro
- ACORT: A Compact Object Relation Transformer For Parameter Efficient Image Captioning Tan Jia Huei, Tan Ying Hua, Chan Chee Seng, Chuah Joon Huang
- Etrica: Event-triggered Context-aware Story Generation Augmented By Cross Attention Tang Chen, Lin Chenghua, Huang Henglin, Guerin Frank, Zhang Zhihao
- Predicting Issue Types With Sebert Trautsch Alexander, Herbold Steffen
- Adavae: Exploring Adaptive Gpt-2s In Variational Auto-encoders For Language Modeling Tu Haoqin, Yang Zhongliang, Yang Jinshuai, Huang Yongfeng
- Efficient Few-shot Learning Without Prompts Tunstall Lewis, Reimers Nils, Jo Unso Eun Seo, Bates Luke, Korat Daniel, Wasserblat Moshe, Pereg Oren
- Towards Reasoning-aware Explainable VQA Vaideeswaran Rakesh, Gao Feng, Mathur Abhinav, Thattai Govind
- VLSP 2021 - Vimrc Challenge: Vietnamese Machine Reading Comprehension Van Nguyen Kiet, Tran Son Quoc, Nguyen Luan Thanh, Van Huynh Tin, Luu Son T., Nguyen Ngan Luu-thuy
- How Large Language Models Are Transforming Machine-paraphrased Plagiarism Wahle Jan Philip, Ruas Terry, Kirstein Frederic, Gipp Bela
- What Do They Capture? -- A Structural Analysis Of Pre-trained Language Models For Source Code Wan Yao, Zhao Wei, Zhang Hongyu, Sui Yulei, Xu Guandong, Jin Hai
- STRUDEL: Structured Dialogue Summarization For Dialogue Comprehension Wang Borui, Feng Chengcheng, Nair Arjun, Mao Madelyn, Desai Jai, Celikyilmaz Asli, Li Haoran, Mehdad Yashar, Radev Dragomir
- Knowledgeable Salient Span Mask For Enhancing Language Models As Knowledge Base Wang Cunxiang, Luo Fuli, Li Yanyang, Xu Runxin, Huang Fei, Zhang Yue
- Lilt: A Simple Yet Effective Language-independent Layout Transformer For Structured Document Understanding Wang Jiapeng, Jin Lianwen, Ding Kai
- TAG: Boosting Text-vqa Via Text-aware Visual Question-answer Generation Wang Jun, Gao Mingfei, Hu Yuqian, Selvaraju Ramprasaath R., Ramaiah Chetan, Xu Ran, Jaja Joseph F., Davis Larry S.
- Pretraining Without Attention Wang Junxiong, Yan Jing Nathan, Gu Albert, Rush Alexander M.
- What Language Model Architecture And Pretraining Objective Work Best For Zero-shot Generalization? Wang Thomas, Roberts Adam, Hesslow Daniel, Scao Teven Le, Chung Hyung Won, Beltagy Iz, Launay Julien, Raffel Colin
- Image As A Foreign Language: Beit Pretraining For All Vision And Vision-language Tasks Wang Wenhui, Bao Hangbo, Dong Li, Bjorck Johan, Peng Zhiliang, Liu Qiang, Aggarwal Kriti, Mohammed Owais Khan, Singhal Saksham, Som Subhojit, Wei Furu
- Finding Skill Neurons In Pre-trained Transformer-based Language Models Wang Xiaozhi, Wen Kaiyue, Zhang Zhengyan, Hou Lei, Liu Zhiyuan, Li Juanzi
- S-prompts Learning With Pre-trained Transformers: An Occam's Razor For Domain Incremental Learning Wang Yabin, Huang Zhiwu, Hong Xiaopeng
- Adamix: Mixture-of-adaptations For Parameter-efficient Model Tuning Wang Yaqing, Agarwal Sahaj, Mukherjee Subhabrata, Liu Xiaodong, Gao Jing, Awadallah Ahmed Hassan, Gao Jianfeng
- Mixed-effects Transformers For Hierarchical Adaptation White Julia, Goodman Noah, Hawkins Robert
- Adversarial Self-attention For Language Understanding Wu Hongqiu, Ding Ruixue, Zhao Hai, Xie Pengjun, Huang Fei, Zhang Min
- Stateful Memory-augmented Transformers For Efficient Dialogue Modeling Wu Qingyang, Yu Zhou
- Extreme Compression For Pre-trained Transformers Made Simple And Efficient Wu Xiaoxia, Yao Zhewei, Zhang Minjia, Li Conglong, He Yuxiong
- Memorizing Transformers Wu Yuhuai, Rabe Markus N., Hutchins Delesley, Szegedy Christian
- An Efficient Memory-augmented Transformer For Knowledge-intensive NLP Tasks Wu Yuxiang, Zhao Yu, Hu Baotian, Minervini Pasquale, Stenetorp Pontus, Riedel Sebastian
- Modeling Context With Linear Attention For Scalable Document-level Translation Wu Zhaofeng, Peng Hao, Pappas Nikolaos, Smith Noah A.
- Enhancing Pre-trained Models With Text Structure Knowledge For Question Generation Wu Zichen Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China, Jia Xin Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China, Qu Fanyi Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China, Wu Yunfang Key Laboratory Of Computational Linguistics, Ministry Of Education, China, School Of Computer Science, Peking University, China
- Adapting Pretrained Text-to-text Models For Long Text Sequences Xiong Wenhan, Gupta Anchit, Toshniwal Shubham, Mehdad Yashar, Yih Wen-tau
- Integrating Translation Memories Into Non-autoregressive Machine Translation Xu Jitao, Crego Josep, Yvon François
- Learning To Answer Visual Questions From Web Videos Yang Antoine, Miech Antoine, Sivic Josef, Laptev Ivan, Schmid Cordelia
- Tableformer: Robust Transformer Modeling For Table-text Encoding Yang Jingfeng, Gupta Aditya, Upadhyay Shyam, He Luheng, Goel Rahul, Paul Shachi
- AD-DROP: Attribution-driven Dropout For Robust Language Model Fine-tuning Yang Tao, Deng Jinghao, Quan Xiaojun, Wang Qifan, Nie Shaoliang
- Building Knowledge-grounded Dialogue Systems With Graph-based Semantic Modeling Yang Yizhe, Huang Heyan, Gao Yang, And Jiawei Li
- Zeroquant: Efficient And Affordable Post-training Quantization For Large-scale Transformers Yao Zhewei, Aminabadi Reza Yazdani, Zhang Minjia, Wu Xiaoxia, Li Conglong, He Yuxiong
- Random-ltd: Random And Layerwise Token Dropping Brings Efficient Training For Large-scale Transformers Yao Zhewei, Wu Xiaoxia, Li Conglong, Holmes Connor, Zhang Minjia, Li Cheng, He Yuxiong
- Structured Prompting: Scaling In-context Learning To 1,000 Examples Yaru Hao, Yutao Sun, Li Dong, Zhixiong Han, Yuxian Gu, Furu Wei
- Retrieval-augmented Multimodal Language Modeling Yasunaga Michihiro, Aghajanyan Armen, Shi Weijia, James Rich, Leskovec Jure, Liang Percy, Lewis Mike, Zettlemoyer Luke, Yih Wen-tau
- Modeling Multi-hop Question Answering As Single Sequence Prediction Yavuz Semih, Hashimoto Kazuma, Zhou Yingbo, Keskar Nitish Shirish, Xiong Caiming
- Alzheimer's Diagnosis And Generation-based Chatbot Using Hierarchical Attention And Transformer Yeong Park Jun, Jong Shin Su, Hwan Choi Chang, Jae Lee Jung, Sang-il Choi
- Categorizing Semantic Representations For Neural Machine Translation Yin Yongjing, Li Yafu, Meng Fandong, Zhou Jie, Zhang Yue
- Super-naturalinstructions: Generalization Via Declarative Instructions On 1600+ NLP Tasks Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei, Anjana Arunkumar, Arjun Ashok, Arut Selvan Dhanasekaran, Atharva Naik, David Stap, Eshaan Pathak, Giannis Karamanolakis, Haizhi Gary Lai, Ishan Purohit, Ishani Mondal, Jacob Anderson, Kirby Kuznia, Krima Doshi, Maitreya Patel, Kuntal Kumar Pal, Mehrad Moradshahi, Mihir Parmar, Mirali Purohit, Neeraj Varshney, Phani Rohitha Kaza, Pulkit Verma, Ravsehaj Singh Puri, Rushang Karia, Shailaja Keyur Sampat, Savan Doshi, Siddhartha Mishra, Sujan Reddy, Sumanta Patro, Tanay Dixit, Xudong Shen, Chitta Baral, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi, Daniel Khashabi
- Scaling Autoregressive Models For Content-rich Text-to-image Generation Yu Jiahui, Xu Yuanzhong, Koh Jing Yu, Luong Thang, Baid Gunjan, Wang Zirui, Vasudevan Vijay, Ku Alexander, Yang Yinfei, Ayan Burcu Karagol, Hutchinson Ben, Han Wei, Parekh Zarana, Li Xin, Zhang Han, Baldridge Jason, Wu Yonghui
- Efficient Language Modeling With Sparse All-mlp Yu Ping, Artetxe Mikel, Ott Myle, Shleifer Sam, Gong Hongyu, Stoyanov Ves, Li Xian
- Tri-attention: Explicit Context-aware Attention Mechanism For Natural Language Processing Yu Rui, Li Yifeng, Lu Wenpeng, Cao Longbing
- Bilaterally Slimmable Transformer For Elastic And Efficient Visual Question Answering Yu Zhou, Jin Zitian, Yu Jun, Xu Mingliang, Wang Hongbo, Fan Jianping
- Hype: Better Pre-trained Language Model Fine-tuning With Hidden Representation Perturbation Yuan Hongyi, Yuan Zheng, Tan Chuanqi, Huang Fei, Huang Songfang
- Wudaomm: A Large-scale Multi-modal Dataset For Pre-training Models Yuan Sha, Zhao Shuai, Leng Jiahong, Xue Zhao, Zhao Hanyu, Liu Peiyu, Gong Zheng, Zhao Wayne Xin, Li Junyi, Tang Jie
- Competition-level Code Generation With Alphacode Yujia Li, David Choi, Junyoung Chung, Nate Kushman, Julian Schrittwieser, Rémi Leblond, Tom Eccles, James Keeling, Felix Gimeno, Agustin Dal Lago, Thomas Hubert, Peter Choy, Cyprien De Masson D'autume, Igor Babuschkin, Xinyun Chen, Po-sen Huang, Johannes Welbl, Sven Gowal, Alexey Cherepanov, James Molloy, Daniel J. Mankowitz, Esme Sutherland Robson, Pushmeet Kohli, Nando De Freitas, Koray Kavukcuoglu, Oriol Vinyals
- Dyrex: Dynamic Query Representation For Extractive Question Answering Zaratiana Urchade, Khbir Niama El, Núñez Dennis, Holat Pierre, Tomeh Nadi, Charnois Thierry
- Better Pre-training By Reducing Representation Confusion Zhang Haojie, Liang Mingfei, Xie Ruobing, Sun Zhenlong, Zhang Bo, Lin Leyu
- Improved Data Augmentation For Translation Suggestion Zhang Hongxiao, Lai Siyu, Zhang Songming, Huang Hui, Chen Yufeng, Xu Jinan, Liu Jian
- Scala: Accelerating Adaptation Of Pre-trained Transformer-based Language Models Via Efficient Large-batch Adversarial Noise Zhang Minjia, Naresh Niranjan Uma, He Yuxiong
- Analyzing And Simulating User Utterance Reformulation In Conversational Recommender Systems Zhang Shuo, Wang Mu-chun, Balog Krisztian
- Unveiling Transformers With LEGO: A Synthetic Reasoning Task Zhang Yi, Backurs Arturs, Bubeck Sébastien, Eldan Ronen, Gunasekar Suriya, Wagner Tal
- BLISS: Robust Sequence-to-sequence Learning Via Self-supervised Input Representation Zhang Zheng, Ding Liang, Cheng Dazhao, Liu Xuebo, Zhang Min, Tao Dacheng
- Can Offline Reinforcement Learning Help Natural Language Understanding? Zhang Ziqi, Wang Yile, Zhang Yue, Wang Donglin
- M-adapter: Modality Adaptation For End-to-end Speech-to-text Translation Zhao Jinming, Yang Hao, Shareghi Ehsan, Haffari Gholamreza
- Medical Dialogue Response Generation With Pivotal Information Recalling Zhao Yu, Li Yunxin, Wu Yuxiang, Hu Baotian, Chen Qingcai, Wang Xiaolong, Ding Yuxin, Zhang Min
- Learning To Decompose: Hypothetical Question Decomposition Based On Comparable Texts Zhou Ben, Richardson Kyle, Yu Xiaodong, Roth Dan
- Syntax-informed Question Answering With Heterogeneous Graph Transformer Zhu Fangyi, Tan Lok You, Ng See-kiong, Bressan Stéphane
- Future Sight: Dynamic Story Generation With Large Pretrained Language Models Zimmerman Brian D., Sahu Gaurav, Vechtomova Olga
- Exploring And Evaluating Personalized Models For Code Generation Zlotchevski Andrei, Drain Dawn, Svyatkovskiy Alexey, Clement Colin, Sundaresan Neel, Tufano Michele
- St-moe: Designing Stable And Transferable Sparse Expert Models Zoph Barret, Bello Irwan, Kumar Sameer, Du Nan, Huang Yanping, Dean Jeff, Shazeer Noam, Fedus William
- Efficient Long Sequence Modeling Via State Space Augmented Transformer Zuo Simiao, Liu Xiaodong, Jiao Jian, Charles Denis, Manavoglu Eren, Zhao Tuo, Gao Jianfeng
- Attention Over Pre-trained Sentence Embeddings For Long Document Classification Abdaoui Amine, Dutta Sourav
- Beyond Traditional Teaching: The Potential Of Large Language Models And Chatbots In Graduate Engineering Education Abedi Mahyar, Alshybani Ibrahem, Shahadat Muhammad Rubayat Bin, Murillo Michael S.
- On The Relationship Between Skill Neurons And Robustness In Prompt Tuning Ackermann Leon, Ohmer Xenia
- Story Visualization By Online Text Augmentation With Context Memory Ahn Daechul, Kim Daneul, Song Gwangmo, Kim Seung Hwan, Lee Honglak, Kang Dongyeop, Choi Jonghyun
- The Emergence Of Essential Sparsity In Large Pre-trained Models: The Weights That Matter Ajay Jaiswal, Shiwei Liu, Tianlong Chen, Zhangyang Wang
- Can Language Models Employ The Socratic Method? Experiments With Code Debugging Al-hossami Erfan, Bunescu Razvan, Smith Justin, Teehan Ryan
- Mamba: Linear-time Sequence Modeling With Selective State Spaces Albert Gu, Tri Dao
- OPT-R: Exploring The Role Of Explanations In Finetuning And Prompting For Reasoning Skills Of Large Language Models Alkhamissi Badr, Verma Siddharth, Yu Ping, Jin Zhijing, Celikyilmaz Asli, Diab Mona
- Prompt-engineering And Transformer-based Question Generation And Evaluation Amyeen Rubaba
- Randomized Positional Encodings Boost Length Generalization Of Transformers Anian Ruoss, Grégoire Delétang, Tim Genewein, Jordi Grau-moya, Róbert Csordás, Mehdi Bennani, Shane Legg, Joel Veness
- Distilling Efficient Language-specific Models For Cross-lingual Transfer Ansell Alan, Ponti Edoardo Maria, Korhonen Anna, Vulić Ivan
- Scaling Laws For Language Encoding Models In Fmri Antonello Richard, Vaidya Aditya, Huth Alexander G.
- Mixture Of Tokens: Efficient Llms Through Cross-example Aggregation Antoniak Szymon, Jaszczur Sebastian, Krutul Michał, Pióro Maciej, Krajewski Jakub, Ludziejewski Jan, Odrzygóźdź Tomasz, Cygan Marek
- Slimfit: Memory-efficient Fine-tuning Of Transformer-based Models Using Training Dynamics Ardakani Arash, Haan Altan, Tan Shangyin, Popovici Doru Thom, Cheung Alvin, Iancu Costin, Sen Koushik
- Scaling Transformer To 1M Tokens And Beyond With RMT Aydar Bulatov, Yuri Kuratov, Yermek Kapushev, Mikhail S. Burtsev
- Chatgpt: Applications, Opportunities, And Threats Bahrini Aram, Khamoshifar Mohammadsadra, Abbasimehr Hossein, Riggs Robert J., Esmaeili Maryam, Majdabadkohne Rastin Mastali, Pasehvar Morteza
- Gpt-4: A Review On Advancements And Opportunities In Natural Language Processing Baktash Jawid Ahmad, Dawodi Mursal
- Investigating Pre-trained Language Models On Cross-domain Datasets, A Step Closer To General AI Ballout Mohamad, Krumnack Ulf, Heidemann Gunther, Kühnberger Kai-uwe
- Llamantino: Llama 2 Models For Effective Text Generation In Italian Language Basile Pierpaolo, Musacchio Elio, Polignano Marco, Siciliani Lucia, Fiameni Giuseppe, Semeraro Giovanni
- Text Summarization Using Large Language Models: A Comparative Study Of Mpt-7b-instruct, Falcon-7b-instruct, And Openai Chat-gpt Models Basyal Lochan, Sanghvi Mihir
- Efficient Pre-training For Localized Instruction Generation Of Videos Batra Anil, Moltisanti Davide, Sevilla-lara Laura, Rohrbach Marcus, Keller Frank
- Alternating Updates For Efficient Transformers Baykal Cenk, Cutler Dylan, Dikkala Nishanth, Ghosh Nikhil, Panigrahy Rina, Wang Xin
- Span-selective Linear Attention Transformers For Effective And Robust Schema-guided Dialogue State Tracking Bebensee Björn, Lee Haejun
- Examining The Emergence Of Deductive Reasoning In Generative Language Models Belcak Peter, Lanzendörfer Luca A., Wattenhofer Roger
- Harnessing Gpt-3.5-turbo For Rhetorical Role Prediction In Legal Cases Belfathi Anas, Hernandez Nicolas, Monceaux Laura
- Labrador: Exploring The Limits Of Masked Language Modeling For Laboratory Data Bellamy David R., Kumar Bhawesh, Wang Cindy, Beam Andrew
- When Automated Assessment Meets Automated Content Generation: Examining Text Quality In The Era Of Gpts Bevilacqua Marialena, Oketch Kezia, Qin Ruiyang, Stamey Will, Zhang Xinyuan, Gan Yi, Yang Kai, Abbasi Ahmed
- TART: A Plug-and-play Transformer Module For Task-agnostic Reasoning Bhatia Kush, Narayan Avanika, De Sa Christopher, Ré Christopher
- Automating Question Generation From Educational Text Bhowmick Ayan Kumar, Jagmohan Ashish, Vempaty Aditya, Dey Prasenjit, Hall Leigh, Hartman Jeremy, Kokku Ravi, Maheshwari Hema
- Birth Of A Transformer: A Memory Viewpoint Bietti Alberto, Cabannes Vivien, Bouchacourt Diane, Jegou Herve, Bottou Leon
- RWKV: Reinventing Rnns For The Transformer Era Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Stella Biderman, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran Gv, Xuzheng He, Haowen Hou, Jiaju Lin, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Guangyu Song, Xiangru Tang, Bolun Wang, Johan S. Wind, Stanislaw Wozniak, Ruichong Zhang, Zhenyuan Zhang, Qihang Zhao, Peng Zhou, Qinghua Zhou, Jian Zhu, Rui-jie Zhu
- Ask Language Model To Clean Your Noisy Translation Data Bolding Quinten, Liao Baohao, Denis Brandon James, Luo Jun, Monz Christof
- Pixlore: A Dataset-driven Approach To Rich Image Captioning Bonilla Diego
- Fine-grained Affective Processing Capabilities Emerging From Large Language Models Broekens Joost, Hilpert Bernhard, Verberne Suzan, Baraka Kim, Gebhard Patrick, Plaat Aske
- Efficient Transformer Knowledge Distillation: A Performance Review Brown Nathan, Williamson Ashton, Anderson Tahj, Lawrence Logan
- Accuracy Of A Vision-language Model On Challenging Medical Cases Buckley Thomas, Diao James A., Rodman Adam, Manrai Arjun K.
- Utilizing Chatgpt Generated Data To Retrieve Depression Symptoms From Social Media Bucur Ana-maria
- Codetf: One-stop Transformer Library For State-of-the-art Code LLM Bui Nghi D. Q., Le Hung, Wang Yue, Li Junnan, Gotmare Akhilesh Deepak, Hoi Steven C. H.
- Finding The Needle In A Haystack: Unsupervised Rationale Extraction From Long Text Classifiers Bujel Kamil, Caines Andrew, Yannakoudakis Helen, Rei Marek
- Development Of The Chatgpt, Generative Artificial Intelligence And Natural Large Language Models For Accountable Reporting And Use (CANGARU) Guidelines Cacciamani Giovanni E., Eppler Michael B., Ganjavi Conner, Pekan Asli, Biedermann Brett, Collins Gary S., Gill Inderbir S.
- Scaling In-context Demonstrations With Structured Attention Cai Tianle, Huang Kaixuan, Lee Jason D., Wang Mengdi
- Do Large Language Models Resemble Humans In Language Use? Cai Zhenguang G., Duan Xufeng, Haslett David A., Wang Shuqi, Pickering Martin J.
- Can Transformers Learn Sequential Function Classes In Context? Campbell Ryan, Guo Emma, Hu Evan, Vir Reya, Hsiao Ethan
- H2ogpt: Democratizing Large Language Models Candel Arno, Mckinney Jon, Singer Philipp, Pfeiffer Pascal, Jeblick Maximilian, Prabhu Prithvi, Gambera Jeff, Landry Mark, Bansal Shivam, Chesler Ryan, Lee Chun Ming, Conde Marcos V., Stetsenko Pasha, Grellier Olivier, Ambati Srisatish
- Pumer: Pruning And Merging Tokens For Efficient Vision Language Models Cao Qingqing, Paranjape Bhargavi, Hajishirzi Hannaneh
- Revolutionizing Mobile Interaction: Enabling A 3 Billion Parameter GPT LLM On Mobile Carreira Samuel, Marques Tomás, Ribeiro José, Grilo Carlos
- Do Androids Know They're Only Dreaming Of Electric Sheep? Ch-wang Sky, Van Durme Benjamin, Eisner Jason, Kedzie Chris
- Exploring Answer Information Methods For Question Generation With Transformers Chafekar Talha, Hussain Aafiya, Sharma Grishma, Sharma Deepak
- Muse: Text-to-image Generation Via Masked Generative Transformers Chang Huiwen, Zhang Han, Barber Jarred, Maschinot Aj, Lezama Jose, Jiang Lu, Yang Ming-hsuan, Murphy Kevin, Freeman William T., Rubinstein Michael, Li Yuanzhen, Krishnan Dilip
- Language Model Behavior: A Comprehensive Survey Chang Tyler A., Bergen Benjamin K.
- Not All Layers Are Equally As Important: Every Layer Counts BERT Charpentier Lucas Georges Gabriel, Samuel David
- Efficient GPT Model Pre-training Using Tensor Train Matrix Representation Chekalina Viktoriia, Novikov Georgii, Gusak Julia, Oseledets Ivan, Panchenko Alexander
- Sudden Drops In The Loss: Syntax Acquisition, Phase Transitions, And Simplicity Bias In Mlms Chen Angelica, Shwartz-ziv Ravid, Cho Kyunghyun, Leavitt Matthew L., Saphra Naomi
- Generate Labeled Training Data Using Prompt Programming And GPT-3. An Example Of Big Five Personality Classification Chen Eason
- Tem-adapter: Adapting Image-text Pretraining For Video Question Answer Chen Guangyi, Liu Xiao, Wang Guangrun, Zhang Kun, Torr Philip H. S., Zhang Xiao-ping, Tang Yansong
- CLEX: Continuous Length Extrapolation For Large Language Models Chen Guanzheng, Li Xin, Meng Zaiqiao, Liang Shangsong, Bing Lidong
- Interpreting And Controlling Vision Foundation Models Via Text Explanations Chen Haozhe, Yang Junfeng, Vondrick Carl, Mao Chengzhi
- Walking Down The Memory Maze: Beyond Context Limit Through Interactive Reading Chen Howard, Pasunuru Ramakanth, Weston Jason, Celikyilmaz Asli
- Unlearn What You Want To Forget: Efficient Unlearning For Llms Chen Jiaao, Yang Diyi
- Evaluating Factual Consistency Of Summaries With Large Language Models Chen Shiqi, Gao Siyang, He Junxian
- Extending Context Window Of Large Language Models Via Positional Interpolation Chen Shouyuan, Wong Sherman, Chen Liangjian, Tian Yuandong
- Reason Out Your Layout: Evoking The Layout Master From Large Language Models For Text-to-image Synthesis Chen Xiaohui, Liu Yongfei, Yang Yingxiang, Yuan Jianbo, You Quanzeng, Liu Li-ping, Yang Hongxia
- Medprompt: Cross-modal Prompting For Multi-task Medical Image Translation Chen Xuhang, Pun Chi-man, Wang Shuqiang
- Confidant: Customizing Transformer-based Llms Via Collaborative Edge Training Chen Yuhao, Yan Yuxuan, Yang Qianqian, Shu Yuanchao, He Shibo, Chen Jiming
- Token Prediction As Implicit Classification To Identify Llm-generated Text Chen Yutian, Kang Hao, Zhai Vivian, Li Liangze, Singh Rita, Raj Bhiksha
- Visual Chatgpt: Talking, Drawing And Editing With Visual Foundation Models Chenfei Wu, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, Nan Duan
- Latent Positional Information Is In The Self-attention Variance Of Transformer Language Models Without Positional Embeddings Chi Ta-chung, Fan Ting-han, Chen Li-wei, Rudnicky Alexander I., Ramadge Peter J.
- Attention Alignment And Flexible Positional Embeddings Improve Transformer Length Extrapolation Chi Ta-chung, Fan Ting-han, Rudnicky Alexander I.
- Transformer Working Memory Enables Regular Language Reasoning And Natural Language Length Extrapolation Chi Ta-chung, Fan Ting-han, Rudnicky Alexander I., Ramadge Peter J.
- Language Grounded Qformer For Efficient Vision Language Understanding Choraria Moulik, Sekhar Nitesh, Wu Yue, Zhang Xu, Singhal Prateek, Varshney Lav R.
- Tutorials On Stance Detection Using Pre-trained Language Models: Fine-tuning BERT And Prompting Large Language Models Chuang Yun-shiuan
- Dola: Decoding By Contrasting Layers Improves Factuality In Large Language Models Chuang Yung-sung, Xie Yujia, Luo Hongyin, Kim Yoon, Glass James, He Pengcheng
- Inducing Anxiety In Large Language Models Increases Exploration And Bias Coda-forno Julian, Witte Kristin, Jagadish Akshay K., Binz Marcel, Akata Zeynep, Schulz Eric
- Seamless: Multilingual Expressive And Streaming Speech Translation Communication Seamless, Barrault Loïc, Chung Yu-an, Meglioli Mariano Coria, Dale David, Dong Ning, Duppenthaler Mark, Duquenne Paul-ambroise, Ellis Brian, Elsahar Hady, Haaheim Justin, Hoffman John, Hwang Min-jae, Inaguma Hirofumi, Klaiber Christopher, Kulikov Ilia, Li Pengwei, Licht Daniel, Maillard Jean, Mavlyutov Ruslan, Rakotoarison Alice, Sadagopan Kaushik Ram, Ramakrishnan Abinesh, Tran Tuan, Wenzek Guillaume, Yang Yilin, Ye Ethan, Evtimov Ivan, Fernandez Pierre, Gao Cynthia, Hansanti Prangthip, Kalbassi Elahe, Kallet Amanda, Kozhevnikov Artyom, Gonzalez Gabriel Mejia, Roman Robin San, Touret Christophe, Wong Corinne, Wood Carleigh, Yu Bokai, Andrews Pierre, Balioglu Can, Chen Peng-jen, Costa-jussà Marta R., Elbayad Maha, Gong Hongyu, Guzmán Francisco, Heffernan Kevin, Jain Somya, Kao Justine, Lee Ann, Ma Xutai, Mourachko Alex, Peloquin Benjamin, Pino Juan, Popuri Sravya, Ropers Christophe, Saleem Safiyyah, Schwenk Holger, Sun Anna, Tomasello Paden, Wang Changhan, Wang Jeff, Wang Skyler, Williamson Mary
- Robust Infidelity: When Faithfulness Measures On Masked Language Models Are Misleading Crothers Evan, Viktor Herna, Japkowicz Nathalie
- Large Language Models For Compiler Optimization Cummins Chris, Seeker Volker, Grubisic Dejan, Elhoushi Mostafa, Liang Youwei, Roziere Baptiste, Gehring Jonas, Gloeckle Fabian, Hazelwood Kim, Synnaeve Gabriel, Leather Hugh
- Evaluating Prompt-based Question Answering For Object Prediction In The Open Research Knowledge Graph D'souza Jennifer, Hrou Moussab, Auer Sören
- Syntactic Knowledge Via Graph Attention With BERT In Machine Translation Dai Yuqian, Sharoff Serge, De Kamps Marc
- An Adversarial Example For Direct Logit Attribution: Memory Management In Gelu-4l Dao James, Lau Yeu-tong, Rager Can, Janiak Jett
- Flashattention-2: Faster Attention With Better Parallelism And Work Partitioning Dao Tri
- Im-promptu: In-context Composition From Image Prompts Dedhia Bhishma, Chang Michael, Snell Jake C., Griffiths Thomas L., Jha Niraj K.
- Scaling Vision Transformers To 22 Billion Parameters Dehghani Mostafa, Djolonga Josip, Mustafa Basil, Padlewski Piotr, Heek Jonathan, Gilmer Justin, Steiner Andreas, Caron Mathilde, Geirhos Robert, Alabdulmohsin Ibrahim, Jenatton Rodolphe, Beyer Lucas, Tschannen Michael, Arnab Anurag, Wang Xiao, Riquelme Carlos, Minderer Matthias, Puigcerver Joan, Evci Utku, Kumar Manoj, Van Steenkiste Sjoerd, Elsayed Gamaleldin F., Mahendran Aravindh, Yu Fisher, Oliver Avital, Huot Fantine, Bastings Jasmijn, Collier Mark Patrick, Gritsenko Alexey, Birodkar Vighnesh, Vasconcelos Cristina, Tay Yi, Mensink Thomas, Kolesnikov Alexander, Pavetić Filip, Tran Dustin, Kipf Thomas, Lučić Mario, Zhai Xiaohua, Keysers Daniel, Harmsen Jeremiah, Houlsby Neil
- Context-aware Transformer Pre-training For Answer Sentence Selection Di Liello Luca, Garg Siddhant, Moschitti Alessandro
- Jump To Conclusions: Short-cutting Transformers With Linear Transformations Din Alexander Yom, Karidi Taelin, Choshen Leshem, Geva Mor
- Longnet: Scaling Transformers To 1,000,000,000 Tokens Ding Jiayu, Ma Shuming, Dong Li, Zhang Xingxing, Huang Shaohan, Wang Wenhui, Zheng Nanning, Wei Furu
- Self-agreement: A Framework For Fine-tuning Language Models To Find Agreement Among Diverse Opinions Ding Shiyao, Ito Takayuki
- Fluid Transformers And Creative Analogies: Exploring Large Language Models' Capacity For Augmenting Cross-domain Analogical Creativity Ding Zijian, Srinivasan Arvind, Macneil Stephen, Chan Joel
- Block-wise Bit-compression Of Transformer-based Models Dong Gaochen, Chen Wei
- Blockwise Compression Of Transformer-based Models Without Retraining Dong Gaochen, Chen Wei
- Large Language Models Douglas Michael R.
- Lauragpt: Listen, Attend, Understand, And Regenerate Audio With GPT Du Zhihao, Wang Jiaming, Chen Qian, Chu Yunfei, Gao Zhifu, Li Zerui, Hu Kai, Zhou Xiaohuan, Xu Jin, Ma Ziyang, Wang Wen, Zheng Siqi, Zhou Chang, Yan Zhijie, Zhang Shiliang
- Stack Attention: Improving The Ability Of Transformers To Model Hierarchical Patterns Dusell Brian, Chiang David
- Faith And Fate: Limits Of Transformers On Compositionality Dziri Nouha, Lu Ximing, Sclar Melanie, Li Xiang Lorraine, Jiang Liwei, Lin Bill Yuchen, West Peter, Bhagavatula Chandra, Bras Ronan Le, Hwang Jena D., Sanyal Soumya, Welleck Sean, Ren Xiang, Ettinger Allyson, Harchaoui Zaid, Choi Yejin
- Tinystories: How Small Can Language Models Be And Still Speak Coherent English? Eldan Ronen, Li Yuanzhi
- Octopus: A Multitask Model And Toolkit For Arabic Natural Language Generation Elmadany Abdelrahim, Nagoudi El Moatez Billah, Abdul-mageed Muhammad
- A Comprehensive Performance Study Of Large Language Models On Novel AI Accelerators Emani Murali, Foreman Sam, Sastry Varuni, Xie Zhen, Raskar Siddhisanket, Arnold William, Thakur Rajeev, Vishwanath Venkatram, Papka Michael E.
- Jacotext: A Pretrained Model For Java Code-text Generation Espejel Jessica López, Alassan Mahaman Sanoussi Yahaya, Dahhane Walid, Ettifouri El Hassane
- NAP At Semeval-2023 Task 3: Is Less Really More? (back-)translation As Data Augmentation Strategies For Detecting Persuasion Techniques Falk Neele, Eichel Annerose, Piccirilli Prisca
- Instructseq: Unifying Vision Tasks With Instruction-conditioned Multi-modal Sequence Generation Fang Rongyao, Yan Shilin, Huang Zhaoyang, Zhou Jingqiu, Tian Hao, Dai Jifeng, Li Hongsheng
- Extending Context Window Of Large Language Models Via Semantic Compression Fei Weizhi, Niu Xueyan, Zhou Pingyi, Hou Lu, Bai Bo, Deng Lei, Han Wei
- Explaining How Transformers Use Context To Build Predictions Ferrando Javier, Gállego Gerard I., Tsiamas Ioannis, Costa-jussà Marta R.
- Grounded Complex Task Segmentation For Conversational Assistants Ferreira Rafael, Semedo David, Magalhães João
- Vision Language Transformers: A Survey Fields Clayton, Kennington Casey
- Breaking The Language Barrier: Improving Cross-lingual Reasoning With Structured Self-attention Foroutan Negar, Banaei Mohammadreza, Aberer Karl, Bosselut Antoine
- Sparsegpt: Massive Language Models Can Be Accurately Pruned In One-shot Frantar Elias, Alistarh Dan
- The Inner Sentiments Of A Thought Gagne Chris, Dayan Peter
- Cerbero-7b: A Leap Forward In Language-specific Llms Through Enhanced Chat Corpus Generation And Evaluation Galatolo Federico A., Cimino Mario G. C. A.
- Roles Of Scaling And Instruction Tuning In Language Perception: Model Vs. Human Attention Gao Changjiang, Huang Shujian, Li Jixing, Chen Jiajun
- Examining User-friendly And Open-sourced Large GPT Models: A Survey On Language, Multimodal, And Scientific GPT Models Gao Kaiyuan, He Sunan, He Zhenyu, Lin Jiacheng, Pei Qizhi, Shao Jie, Zhang Wei
- Ahead-of-time P-tuning Gavrilov Daniil, Balagansky Nikita
- Making Llama SEE And Draw With SEED Tokenizer Ge Yuying, Zhao Sijie, Zeng Ziyun, Ge Yixiao, Li Chen, Wang Xintao, Shan Ying
- Does Synthetic Data Make Large Language Models More Efficient? Gholami Sia, Omar Marwan
- Resetox: Re-learning Attention Weights For Toxicity Mitigation In Machine Translation Gilabert Javier García, Escolano Carlos, Costa-jussà Marta R.
- Generative AI Beyond Llms: System Implications Of Multi-modal Generation Golden Alicia, Hsia Samuel, Sun Fei, Acun Bilge, Hosmer Basil, Lee Yejin, Devito Zachary, Johnson Jeff, Wei Gu-yeon, Brooks David, Wu Carole-jean
- Model-generated Pretraining Signals Improves Zero-shot Generalization Of Text-to-text Transformers Gong Linyuan, Xiong Chenyan, Liu Xiaodong, Bajaj Payal, Xie Yiqing, Cheung Alvin, Gao Jianfeng, Song Xia
- Using Large Language Models For Interpreting Autonomous Robots Behaviors González-santamarta Miguel A., Fernández-becerra Laura, Sobrín-hidalgo David, Guerrero-higueras Ángel Manuel, González Irene, Lera Francisco J. Rodríguez
- AMAGO: Scalable In-context Reinforcement Learning For Adaptive Agents Grigsby Jake, Fan Linxi, Zhu Yuke
- Hiformer: Heterogeneous Feature Interactions Learning With Transformers For Recommender Systems Gui Huan, Wang Ruoxi, Yin Ke, Jin Long, Kula Maciej, Xu Taibai, Hong Lichan, Chi Ed H.
- SPT: Fine-tuning Transformer-based Language Models Efficiently With Sparsification Gui Yuntao, Yan Xiao, Yin Peiqi, Yang Han, Cheng James
- Longcoder: A Long-range Pre-trained Language Model For Code Completion Guo Daya, Xu Canwen, Duan Nan, Yin Jian, Mcauley Julian
- Editing Common Sense In Transformers Gupta Anshita, Mondal Debanjan, Sheshadri Akshay Krishna, Zhao Wenlong, Li Xiang Lorraine, Wiegreffe Sarah, Tandon Niket
- A Theory Of Emergent In-context Learning As Implicit Structure Induction Hahn Michael, Goyal Navin
- Large Language Models Meet Computer Vision: A Brief Survey Hamadi Raby
- Lm-infinite: Zero-shot Extreme Length Generalization For Large Language Models Han Chi, Wang Qifan, Peng Hao, Xiong Wenhan, Chen Yu, Ji Heng, Wang Sinong
- Blockwise Parallel Transformer For Large Context Models Hao Liu, Pieter Abbeel
- Test-time Training On Nearest Neighbors For Large Language Models Hardt Moritz, Sun Yu
- Unlocking The Potential Of Chatgpt: A Comprehensive Exploration Of Its Applications, Advantages, Limitations, And Future Directions In Natural Language Processing Hariri Walid
- CHAI-DT: A Framework For Prompting Conversational Generative AI Agents To Actively Participate In Co-creation Harwood Brandon
- Zero- And Few-shot Prompting With Llms: A Comparative Study With Fine-tuned Models For Bangla Sentiment Analysis Hasan Md. Arid, Das Shudipta, Anjum Afiyat, Alam Firoj, Anjum Anika, Sarker Avijit, Noori Sheak Rashed Haider
- Linear Latent World Models In Simple Transformers: A Case Study On Othello-gpt Hazineh Dean S., Zhang Zechen, Chiu Jeffery
- Multi-modal Latent Space Learning For Chain-of-thought Reasoning In Language Models He Liqi, Li Zuchao, Cai Xiantao, Wang Ping
- Multimodal Graph Transformer For Multimodal Question Answering He Xuehai, Wang Xin Eric
- Divide Et Impera: Multi-transformer Architectures For Complex Nlp-tasks Helland Solveig, Gavagnin Elena, De Spindler Alexandre
- Simplemtod: A Simple Language Model For Multimodal Task-oriented Dialogue With Symbolic Scene Representation Hemanthage Bhathiya, Dondrup Christian, Bartie Phil, Lemon Oliver
- How Good Are GPT Models At Machine Translation? A Comprehensive Evaluation Hendy Amr, Abdelrehim Mohamed, Sharaf Amr, Raunak Vikas, Gabr Mohamed, Matsushita Hitokazu, Kim Young Jin, Afify Mohamed, Awadalla Hany Hassan
- Stochastic Parrots Looking For Stochastic Parrots: Llms Are Easy To Fine-tune And Hard To Detect With Other Llms Henrique Da Silva Gameiro, Kucharavy Andrei, Guerraoui Rachid
- Improving Long Context Document-level Machine Translation Herold Christian, Ney Hermann
- Backpack Language Models Hewitt John, Thickstun John, Manning Christopher D., Liang Percy
- Newsgpt: Chatgpt Integration For Robot-reporter Hireche Abdelhadi, Belkacem Abdelkader Nasreddine, Jamil Sadia, Chen Chao
- L2MAC: Large Language Model Automatic Computer For Extensive Code Generation Holt Samuel, Luyten Max Ruiz, Van Der Schaar Mihaela
- SPEED: Speculative Pipelined Execution For Efficient Decoding Hooper Coleman, Kim Sehoon, Mohammadzadeh Hiva, Genc Hasan, Keutzer Kurt, Gholami Amir, Shao Sophia
- COMMA: Co-articulated Multi-modal Learning Hu Lianyu, Gao Liqing, Liu Zekang, Pun Chi-man, Feng Wei
- Enhancing Answer Selection In Community Question Answering With Pre-trained And Large Language Models Hu Xinghang
- Towards Equipping Transformer With The Ability Of Systematic Compositionality Huang Chen, Qin Peixin, Lei Wenqiang, Lv Jiancheng
- Agentcoder: Multi-agent-based Code Generation With Iterative Testing And Optimisation Huang Dong, Zhang Jie M., Luck Michael, Bu Qingwen, Qing Yuhao, Cui Heming
- Directed Acyclic Transformer Pre-training For High-quality Non-autoregressive Text Generation Huang Fei, Ke Pei, Huang Minlie
- Chatgpt For Shaping The Future Of Dentistry: The Potential Of Multi-modal Large Language Model Huang Hanyao, Zheng Ou, Wang Dongdong, Yin Jiayi, Wang Zijin, Ding Shengxuan, Yin Heng, Xu Chuan, Yang Renjie, Zheng Qian, Shi Bing
- Generic Attention-model Explainability By Weighted Relevance Accumulation Huang Yiming, Jia Aozhe, Zhang Xiaodan, Zhang Jiawei
- Advancing Transformer Architecture In Long-context Large Language Models: A Comprehensive Survey Huang Yunpeng, Xu Jingwei, Lai Junyu, Jiang Zixu, Chen Taolue, Li Zenan, Yao Yuan, Ma Xiaoxing, Yang Lijuan, Chen Hao, Li Shupeng, Zhao Penghao
- TADA: Efficient Task-agnostic Domain Adaptation For Transformers Hung Chia-chien, Lange Lukas, Strötgen Jannik
- Pre-gated Moe: An Algorithm-system Co-design For Fast And Scalable Mixture-of-expert Inference Hwang Ranggi, Wei Jianyu, Cao Shijie, Hwang Changho, Tang Xiaohu, Cao Ting, Yang Mao
- Breaking The Token Barrier: Chunking And Convolution For Efficient Long Text Classification With BERT Jaiswal Aman, Milios Evangelos
- Bootstrapping Vision-language Learning With Decoupled Language Pre-training Jian Yiren, Gao Chongyang, Vosoughi Soroush
- Bus:efficient And Effective Vision-language Pre-training With Bottom-up Patch Summarization Jiang Chaoya, Xu Haiyang, Ye Wei, Ye Qinghao, Li Chenliang, Yan Ming, Bi Bin, Zhang Shikun, Huang Fei, Huang Songfang
- Emergent Representations Of Program Semantics In Language Models Trained On Programs Jin Charles, Rinard Martin
- Medcpt: Contrastive Pre-trained Transformers With Large-scale Pubmed Search Logs For Zero-shot Biomedical Information Retrieval Jin Qiao, Kim Won, Chen Qingyu, Comeau Donald C., Yeganova Lana, Wilbur W. John, Lu Zhiyong
- GPT Is Becoming A Turing Machine: Here Are Some Ways To Program It Jojic Ana, Wang Zhen, Jojic Nebojsa
- BLIP-2: Bootstrapping Language-image Pre-training With Frozen Image Encoders And Large Language Models Junnan Li, Dongxu Li, Silvio Savarese, Steven Hoi
- Calibrated Language Models Must Hallucinate Kalai Adam Tauman, Vempala Santosh S.
- Swectrl-mini: A Data-transparent Transformer-based Large Language Model For Controllable Text Generation In Swedish Kalpakchi Dmytro, Boye Johan
- A Survey Of GPT-3 Family Large Language Models Including Chatgpt And GPT-4 Kalyan Katikapalli Subramanyam
- Syntax-guided Transformers: Elevating Compositional Generalization And Grounding In Multimodal Environments Kamali Danial, Kordjamshidi Parisa
- Dialmat: Dialogue-enabled Transformer With Moment-based Adversarial Training Kaneda Kanta, Korekata Ryosuke, Wada Yuiga, Nagashima Shunya, Kambara Motonari, Iioka Yui, Matsuo Haruka, Imai Yuto, Nishimura Takayuki, Sugiura Komei
- Reducing Sequence Length By Predicting Edit Operations With Large Language Models Kaneko Masahiro, Okazaki Naoaki
- Llm-augmented Preference Learning From Natural Language Kang Inwon, Ruan Sikai, Ho Tyler, Lin Jui-chien, Mohsin Farhad, Seneviratne Oshani, Xia Lirong
- Sorted Llama: Unlocking The Potential Of Intermediate Layers Of Large Language Models For Dynamic Inference Kavehzadeh Parsa, Valipour Mojtaba, Tahaei Marzieh, Ghodsi Ali, Chen Boxing, Rezagholizadeh Mehdi
- Application Of Frozen Large-scale Models To Multimodal Task-oriented Dialogue Kawamoto Tatsuki, Suzuki Takuma, Miyama Ko, Meguro Takumi, Takagi Tomohiro
- The Impact Of Positional Encoding On Length Generalization In Transformers Kazemnejad Amirhossein, Padhi Inkit, Ramamurthy Karthikeyan Natesan, Das Payel, Reddy Siva
- Q: How To Specialize Large Vision-language Models To Data-scarce VQA Tasks? A: Self-train On Unlabeled Images! Khan Zaid, Bg Vijay Kumar, Schulter Samuel, Yu Xiang, Fu Yun, Chandraker Manmohan
- Tst\(^\mathrm{r}\): Target Similarity Tuning Meets The Real World Khatry Anirudh, Gulwani Sumit, Gupta Priyanshu, Le Vu, Singha Ananya, Singh Mukul, Verbruggen Gust
- Compressed Context Memory For Online Language Model Interaction Kim Jang-hyun, Yeom Junyoung, Yun Sangdoo, Song Hyun Oh
- Speculative Decoding With Big Little Decoder Kim Sehoon, Mangalam Karttikeya, Moon Suhong, Malik Jitendra, Mahoney Michael W., Gholami Amir, Keutzer Kurt
- MAGVLT: Masked Generative Vision-and-language Transformer Kim Sungwoong, Jo Daejin, Lee Donghoon, Kim Jongmin
- Revisiting Intermediate Layer Distillation For Compressing Language Models: An Overfitting Perspective Ko Jongwoo, Park Seungjoon, Jeong Minchan, Hong Sukjin, Ahn Euijai, Chang Du-seong, Yun Se-young
- Transformer Language Models Handle Word Frequency In Prediction Head Kobayashi Goro, Kuribayashi Tatsuki, Yokoi Sho, Inui Kentaro
- Chatgpt: Jack Of All Trades, Master Of None Kocoń Jan, Cichecki Igor, Kaszyca Oliwier, Kochanek Mateusz, Szydło Dominika, Baran Joanna, Bielaniewicz Julita, Gruza Marcin, Janz Arkadiusz, Kanclerz Kamil, Kocoń Anna, Koptyra Bartłomiej, Mieleszczenko-kowszewicz Wiktoria, Miłkowski Piotr, Oleksy Marcin, Piasecki Maciej, Radliński Łukasz, Wojtasik Konrad, Woźniak Stanisław, Kazienko Przemysław
- Large Language Models On The Chessboard: A Study On Chatgpt's Formal Language Comprehension And Complex Reasoning Skills Kuo Mu-tien, Hsueh Chih-chung, Tsai Richard Tzong-han
- Jotr: A Joint Transformer And Reinforcement Learning Framework For Dialog Policy Learning Kwan Wai-chung, Wang Huimin, Wang Hongru, Wang Zezhong, Wu Xian, Zheng Yefeng, Wong Kam-fai
- Enhancing Transformers Without Self-supervised Learning: A Loss Landscape Perspective In Sequential Recommendation Lai Vivian, Chen Huiyuan, Yeh Chin-chia Michael, Xu Minghua, Cai Yiwei, Yang Hao
- Exploring Large Language Models To Facilitate Variable Autonomy For Human-robot Teaming Lakhnati Younes, Pascher Max, Gerken Jens
- A Data Source For Reasoning Embodied Agents Lanchantin Jack, Sukhbaatar Sainbayar, Synnaeve Gabriel, Sun Yuxuan, Srinet Kavya, Szlam Arthur
- Cabrita: Closing The Gap For Foreign Languages Larcher Celio, Piau Marcos, Finardi Paulo, Gengo Pedro, Esposito Piero, Caridá Vinicius
- An Experimental Study On Pretraining Transformers From Scratch For IR Lassance Carlos, Déjean Hervé, Clinchant Stéphane
- Read-only Prompt Optimization For Vision-language Few-shot Learning Lee Dongjun, Song Seokwon, Suh Jihee, Choi Joonmyung, Lee Sanghyeok, Kim Hyunwoo J.
- Is Attention Required For ICL? Exploring The Relationship Between Model Architecture And In-context Learning Ability Lee Ivan, Jiang Nan, Berg-kirkpatrick Taylor
- Partially Randomizing Transformer Weights For Dialogue Response Diversity Lee Jing Yang, Lee Kong Aik, Gan Woon-seng
- Teaching Arithmetic To Small Transformers Lee Nayoung, Sreenivasan Kartik, Lee Jason D., Lee Kangwook, Papailiopoulos Dimitris
- Sequential Monte Carlo Steering Of Large Language Models Using Probabilistic Programs Lew Alexander K., Zhi-xuan Tan, Grand Gabriel, Mansinghka Vikash K.
- DISTFLASHATTN: Distributed Memory-efficient Attention For Long-context Llms Training Li Dacheng, Shao Rulin, Xie Anze, Xing Eric P., Ma Xuezhe, Stoica Ion, Gonzalez Joseph E., Zhang Hao
- Text Is All You Need: Learning Language Representations For Sequential Recommendation Li Jiacheng, Wang Ming, Li Jin, Fu Jinmiao, Shen Xin, Shang Jingbo, Mcauley Julian
- CFGPT: Chinese Financial Assistant With Large Language Model Li Jiangtong, Bian Yuxuan, Wang Guoxuan, Lei Yang, Cheng Dawei, Ding Zhijun, Jiang Changjun
- Graphix-t5: Mixing Pre-trained Transformers With Graph-aware Layers For Text-to-sql Parsing Li Jinyang, Hui Binyuan, Cheng Reynold, Qin Bowen, Ma Chenhao, Huo Nan, Huang Fei, Du Wenyu, Si Luo, Li Yongbin
- MAQA: A Multimodal QA Benchmark For Negation Li Judith Yue, Jansen Aren, Huang Qingqing, Lee Joonseok, Ganti Ravi, Kuzmin Dima
- Openba: An Open-sourced 15B Bilingual Asymmetric Seq2seq Model Pre-trained From Scratch Li Juntao, Tang Zecheng, Ding Yuyang, Wang Pinzheng, Guo Pei, You Wangjie, Qiao Dan, Chen Wenliang, Fu Guohong, Zhu Qiaoming, Zhou Guodong, Zhang Min
- PBNR: Prompt-based News Recommender System Li Xinyi, Zhang Yongfeng, Malthouse Edward C.
- Dissecting Chain-of-thought: Compositionality Through In-context Filtering And Learning Li Yingcong, Sreenivasan Kartik, Giannou Angeliki, Papailiopoulos Dimitris, Oymak Samet
- Losparse: Structured Compression Of Large Language Models Based On Low-rank And Sparse Approximation Li Yixiao, Yu Yifan, Zhang Qingru, Liang Chen, He Pengcheng, Chen Weizhu, Zhao Tuo
- Textbooks Are All You Need II: Phi-1.5 Technical Report Li Yuanzhi, Bubeck Sébastien, Eldan Ronen, Del Giorno Allie, Gunasekar Suriya, Lee Yin Tat
- White-box Multi-objective Adversarial Attack On Dialogue Generation Li Yufei, Li Zexin, Gao Yingfan, Liu Cong
- E-sparse: Boosting The Large Language Model Inference Through Entropy-based N:M Sparsity Li Yun, Niu Lin, Zhang Xipeng, Liu Kai, Zhu Jianchen, Kang Zhanhui
- Batgpt: A Bidirectional Autoregessive Talker From Generative Pre-trained Transformer Li Zuchao, Zhang Shitou, Zhao Hai, Yang Yifei, Yang Dongjie
- Homodistil: Homotopic Task-agnostic Distillation Of Pre-trained Transformers Liang Chen, Jiang Haoming, Li Zheng, Tang Xianfeng, Yin Bin, Zhao Tuo
- GPT-4 Enhanced Multimodal Grounding For Autonomous Driving: Leveraging Cross-modal Attention With Large Language Models Liao Haicheng, Shen Huanming, Li Zhenning, Wang Chengyue, Li Guofa, Bie Yiming, Xu Chengzhong
- Paralinguistics-enhanced Large Language Modeling Of Spoken Dialogue Lin Guan-ting, Shivakumar Prashanth Gurunath, Gandhe Ankur, Yang Chao-han Huck, Gu Yile, Ghosh Shalini, Stolcke Andreas, Lee Hung-yi, Bulyko Ivan
- Advances In Embodied Navigation Using Large Language Models: A Survey Lin Jinzhou, Gao Han, Feng Xuxiang, Xu Rongtao, Wang Changwei, Zhang Man, Guo Li, Xu Shibiao
- Mitigating The Alignment Tax Of RLHF Lin Yong, Lin Hangyu, Xiong Wei, Diao Shizhe, Liu Jianmeng, Zhang Jipeng, Pan Rui, Wang Haoxiang, Hu Wenbin, Zhang Hanning, Dong Hanze, Pi Renjie, Zhao Han, Jiang Nan, Ji Heng, Yao Yuan, Zhang Tong
- Comparing Generative Chatbots Based On Process Requirements Lins Luis Fernando, Nascimento Nathalia, Alencar Paulo, Oliveira Toacy, Cowan Donald
- Exposing Attention Glitches With Flip-flop Language Modeling Liu Bingbin, Ash Jordan T., Goel Surbhi, Krishnamurthy Akshay, Zhang Cyril
- Code Execution With Pre-trained Language Models Liu Chenxiao, Lu Shuai, Chen Weizhu, Jiang Daxin, Svyatkovskiy Alexey, Fu Shengyu, Sundaresan Neel, Duan Nan
- Logicot: Logical Chain-of-thought Instruction-tuning Liu Hanmeng, Teng Zhiyang, Cui Leyang, Zhang Chaoli, Zhou Qiji, Zhang Yue
- Exploration With Principles For Diverse AI Supervision Liu Hao, Zaharia Matei, Abbeel Pieter
- Ring Attention With Blockwise Transformers For Near-infinite Context Liu Hao, Zaharia Matei, Abbeel Pieter
- Qilin-med-vl: Towards Chinese Large Vision-language Model For General Healthcare Liu Junling, Wang Ziming, Ye Qichen, Chong Dading, Zhou Peilin, Hua Yining
- Contrastive Vision-language Alignment Makes Efficient Instruction Learner Liu Lizhao, Sun Xinyu, Xiang Tianhang, Zhuang Zhuangwei, Yin Liuren, Tan Mingkui
- Nlebench+norglm: A Comprehensive Empirical Analysis And Benchmark Dataset For Generative Language Models In Norwegian Liu Peng, Zhang Lemei, Farup Terje Nissen, Lauvrak Even W., Ingvaldsen Jon Espen, Eide Simen, Gulla Jon Atle, Yang Zhirong
- Class Incremental Learning With Pre-trained Vision-language Models Liu Xialei, Cao Xusheng, Lu Haori, Xiao Jia-wen, Bagdanov Andrew D., Cheng Ming-ming
- Fingpt: Democratizing Internet-scale Data For Financial Large Language Models Liu Xiao-yang, Wang Guoxuan, Yang Hongyang, Zha Daochen
- Query-utterance Attention With Joint Modeling For Query-focused Meeting Summarization Liu Xingxian, Duan Bin, Xiao Bo, Xu Yajing
- Instruction Position Matters In Sequence Generation With Large Language Models Liu Yijin, Zeng Xianfeng, Meng Fandong, Zhou Jie
- Binary And Ternary Natural Language Generation Liu Zechun, Oguz Barlas, Pappu Aasish, Shi Yangyang, Krishnamoorthi Raghuraman
- Molxpt: Wrapping Molecules With Text For Generative Pre-training Liu Zequn, Zhang Wei, Xia Yingce, Wu Lijun, Xie Shufang, Qin Tao, Zhang Ming, Liu Tie-yan
- Picking The Underused Heads: A Network Pruning Perspective Of Attention Head Selection For Fusing Dialogue Coreference Information Liu Zhengyuan, Chen Nancy F.
- Deja Vu: Contextual Sparsity For Efficient Llms At Inference Time Liu Zichang, Wang Jue, Dao Tri, Zhou Tianyi, Yuan Binhang, Song Zhao, Shrivastava Anshumali, Zhang Ce, Tian Yuandong, Re Christopher, Chen Beidi
- Unified-io 2: Scaling Autoregressive Multimodal Models With Vision, Language, Audio, And Action Lu Jiasen, Clark Christopher, Lee Sangho, Zhang Zichen, Khosla Savya, Marten Ryan, Hoiem Derek, Kembhavi Aniruddha
- Ziya-visual: Bilingual Large Vision-language Model Via Multi-task Instruction Tuning Lu Junyu, Zhang Dixiang, Wu Xiaojun, Gao Xinyu, Gan Ruyi, Zhang Jiaxing, Song Yan, Zhang Pingjian
- Lyrics: Boosting Fine-grained Language-vision Alignment And Comprehension Via Semantic-aware Visual Objects Lu Junyu, Zhang Dixiang, Zhang Songxin, Xie Zejian, Song Zhuoyang, Lin Cong, Zhang Jiaxing, Jing Bingyi, Zhang Pingjian
- Effectively Fine-tune To Improve Large Multimodal Models For Radiology Report Generation Lu Yuzhe, Hong Sungmin, Shah Yash, Xu Panpan
- Chatgpt And A New Academic Reality: Artificial Intelligence-written Research Papers And The Ethics Of The Large Language Models In Scholarly Publishing Lund Brady, Wang Ting, Mannuru Nishith Reddy, Nie Bing, Shimray Somipam, Wang Ziang
- Prosg: Using Prompt Synthetic Gradients To Alleviate Prompt Forgetting Of Rnn-like Language Models Luo Haotian, Wu Kunming, Dai Cheng, Ding Sixian, Chen Xinhao
- Biomedgpt: Open Multimodal Generative Pre-trained Transformer For Biomedicine Luo Yizhen, Zhang Jiahuan, Fan Siqi, Yang Kai, Wu Yushuai, Qiao Mu, Nie Zaiqing
- Are We Falling In A Middle-intelligence Trap? An Analysis And Mitigation Of The Reversal Curse Lv Ang, Zhang Kaiyi, Xie Shufang, Tu Quan, Chen Yuhan, Wen Ji-rong, Yan Rui
- KOSMOS-2.5: A Multimodal Literate Model Lv Tengchao, Huang Yupan, Chen Jingye, Zhao Yuzhong, Jia Yilin, Cui Lei, Ma Shuming, Chang Yaoyao, Huang Shaohan, Wang Wenhui, Dong Li, Luo Weiyao, Wu Shaoxiang, Wang Guoxin, Zhang Cha, Wei Furu
- Comparative Analysis Of Drug-gpt And Chatgpt Llms For Healthcare Insights: Evaluating Accuracy And Relevance In Patient And HCP Contexts Lysandrou Giorgos, Owen Roma English, Mursec Kirsty, Brun Grant Le, Fairley Elizabeth A. L.
- Vista-llama: Reliable Video Narrator Via Equal Distance To Visual Tokens Ma Fan, Jin Xiaojie, Wang Heng, Xian Yuchen, Feng Jiashi, Yang Yi
- Chain-of-skills: A Configurable Model For Open-domain Question Answering Ma Kaixin, Cheng Hao, Zhang Yu, Liu Xiaodong, Nyberg Eric, Gao Jianfeng
- VOLTA: Improving Generative Diversity By Variational Mutual Information Maximizing Autoencoder Ma Yueen, Chi Dafeng, Li Jingjing, Song Kai, Zhuang Yuzheng, King Irwin
- Non-autoregressive Streaming Transformer For Simultaneous Translation Ma Zhengrui, Zhang Shaolei, Guo Shoutao, Shao Chenze, Zhang Min, Feng Yang
- Harnessing The Power Of Prompt-based Techniques For Generating School-level Questions Using Large Language Models Maity Subhankar, Deroy Aniket, Sarkar Sudeshna
- Unitrec: A Unified Text-to-text Transformer And Joint Contrastive Learning Framework For Text-based Recommendation Mao Zhiming, Wang Huimin, Du Yiming, Wong Kam-fai
- Chatgpt Vs State-of-the-art Models: A Benchmarking Study In Keyphrase Generation Task Martínez-cruz Roberto, López-lópez Alvaro J., Portela José
- Investigating The Efficacy Of Large Language Models In Reflective Assessment Methods Through Chain Of Thoughts Prompting Masikisiki Baphumelele, Marivate Vukosi, Hlope Yvette
- Is Feedback All You Need? Leveraging Natural Language Feedback In Goal-conditioned Reinforcement Learning Mccallum Sabrina, Taylor-davies Max, Albrecht Stefano V., Suglia Alessandro
- Lightlm: A Lightweight Deep And Narrow Language Model For Generative Recommendation Mei Kai, Zhang Yongfeng
- Think Before You Act: Unified Policy For Interleaving Language Reasoning With Actions Mezghani Lina, Bojanowski Piotr, Alahari Karteek, Sukhbaatar Sainbayar
- Uzh_clyp At Semeval-2023 Task 9: Head-first Fine-tuning And Chatgpt Data Generation For Cross-lingual Learning In Tweet Intimacy Prediction Michail Andrianos, Konstantinou Stefanos, Clematide Simon
- LAIT: Efficient Multi-segment Encoding In Transformers With Layer-adjustable Interaction Milbauer Jeremiah, Louis Annie, Hosseini Mohammad Javad, Fabrikant Alex, Metzler Donald, Schuster Tal
- Attention Link: An Efficient Attention-based Low Resource Machine Translation Architecture Min Zeping
- VQA With Cascade Of Self- And Co-attention Blocks Mishra Aakansha, Anand Ashish, Guha Prithwijit
- Landmark Attention: Random-access Infinite Context Length For Transformers Mohtashami Amirkeivan, Jaggi Martin
- Increasing The Performance Of Cognitively Inspired Data-efficient Language Models Via Implicit Structure Building Momen Omar, Arps David, Kallmeyer Laura
- Learning To Compress Prompts With Gist Tokens Mu Jesse, Li Xiang Lisa, Goodman Noah
- Navigating Prompt Complexity For Zero-shot Classification: A Study Of Large Language Models In Computational Social Science Mu Yida, Wu Ben P., Thorne William, Robinson Ambrose, Aletras Nikolaos, Scarton Carolina, Bontcheva Kalina, Song Xingyi
- How To Plant Trees In Language Models: Data And Architectural Effects On The Emergence Of Syntactic Inductive Biases Mueller Aaron, Linzen Tal
- A Brief History Of Prompt: Leveraging Language Models. (through Advanced Prompting) Muktadir Golam Md
- Teaching Probabilistic Logical Reasoning To Transformers Nafar Aliakbar, Venable Kristen Brent, Kordjamshidi Parisa
- Cheaply Evaluating Inference Efficiency Metrics For Autoregressive Transformer Apis Narayanan Deepak, Santhanam Keshav, Henderson Peter, Bommasani Rishi, Lee Tony, Liang Percy
- Reinforcement Replaces Supervision: Query Focused Summarization Using Deep Reinforcement Learning Nath Swaroop, Khadilkar Harshad, Bhattacharyya Pushpak
- Vision Encoder-decoder Models For AI Coaching Nayak Jyothi S, Khan Afifah Khan Mohammed Ajmal, Manjeshwar Chirag, Banday Imadh Ajaz
- On Robustness Of Finetuned Transformer-based NLP Models Neerudu Pavan Kalyan Reddy, Oota Subba Reddy, Marreddy Mounika, Kagita Venkateswara Rao, Gupta Manish
- Trusting Language Models In Education Neto Jogi Suda, Deng Li, Raya Thejaswi, Shahbazi Reza, Liu Nick, Venkatesh Adhitya, Shah Miral, Khosla Neeru, Guido Rodrigo Capobianco
- Black-box Analysis: Gpts Across Time In Legal Textual Entailment Task Nguyen Ha-thanh, Goebel Randy, Toni Francesca, Stathis Kostas, Satoh Ken
- EVJVQA Challenge: Multilingual Visual Question Answering Nguyen Ngan Luu-thuy, Nguyen Nghia Hieu, Vo Duong T. D, Tran Khanh Quoc, Van Nguyen Kiet
- LEXTREME: A Multi-lingual And Multi-task Benchmark For The Legal Domain Niklaus Joel, Matoshi Veton, Rani Pooja, Galassi Andrea, Stürmer Matthias, Chalkidis Ilias
- Numeracy From Literacy: Data Science As An Emergent Skill From Large Language Models Noever David, Mckee Forrest
- Automated Assessment Of Students' Code Comprehension Using Llms Oli Priti, Banjade Rabin, Chapagain Jeevan, Rus Vasile
- GPT-4 Technical Report Openai, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko, Madelaine Boyd, Anna-luisa Brakman, Greg Brockman, Tim Brooks, Miles Brundage, Kevin Button, Trevor Cai, Rosie Campbell, Andrew Cann, Brittany Carey, Chelsea Carlson, Rory Carmichael, Brooke Chan, Che Chang, Fotis Chantzis, Derek Chen, Sully Chen, Ruby Chen, Jason Chen, Mark Chen, Ben Chess, Chester Cho, Casey Chu, Hyung Won Chung, Dave Cummings, Jeremiah Currier, Yunxing Dai, Cory Decareaux, Thomas Degry, Noah Deutsch, Damien Deville, Arka Dhar, David Dohan, Steve Dowling, Sheila Dunning, Adrien Ecoffet, Atty Eleti, Tyna Eloundou, David Farhi, Liam Fedus, Niko Felix, Simón Posada Fishman, Juston Forte, Isabella Fulford, Leo Gao, Elie Georges, Christian Gibson, Vik Goel, Tarun Gogineni, Gabriel Goh, Rapha Gontijo-lopes, Jonathan Gordon, Morgan Grafstein, Scott Gray, Ryan Greene, Joshua Gross, Shixiang Shane Gu, Yufei Guo, Chris Hallacy, Jesse Han, Jeff Harris, Yuchen He, Mike Heaton, Johannes Heidecke, Chris Hesse, Alan Hickey, Wade Hickey, Peter Hoeschele, Brandon Houghton, Kenny Hsu, Shengli Hu, Xin Hu, Joost Huizinga, Shantanu Jain, Shawn Jain, Joanne Jang, Angela Jiang, Roger Jiang, Haozhun Jin, Denny Jin, Shino Jomoto, Billie Jonn, Heewoo Jun, Tomer Kaftan, Łukasz Kaiser, Ali Kamali, Ingmar Kanitscheider, Nitish Shirish Keskar, Tabarak Khan, Logan Kilpatrick, Jong Wook Kim, Christina Kim, Yongjik Kim, Jan Hendrik Kirchner, Jamie Kiros, Matt Knight, Daniel Kokotajlo, Łukasz Kondraciuk, Andrew Kondrich, Aris Konstantinidis, Kyle Kosic, Gretchen Krueger, Vishal Kuo, Michael Lampe, Ikai Lan, Teddy Lee, Jan Leike, Jade Leung, Daniel Levy, Chak Ming Li, Rachel Lim, Molly Lin, Stephanie Lin, Mateusz Litwin, Theresa Lopez, Ryan Lowe, Patricia Lue, Anna Makanju, Kim Malfacini, Sam Manning, Todor Markov, Yaniv Markovski, Bianca Martin, Katie Mayer, Andrew Mayne, Bob Mcgrew, Scott Mayer Mckinney, Christine Mcleavey, Paul Mcmillan, Jake Mcneil, David Medina, Aalok Mehta, Jacob Menick, Luke Metz, Andrey Mishchenko, Pamela Mishkin, Vinnie Monaco, Evan Morikawa, Daniel Mossing, Tong Mu, Mira Murati, Oleg Murk, David Mély, Ashvin Nair, Reiichiro Nakano, Rajeev Nayak, Arvind Neelakantan, Richard Ngo, Hyeonwoo Noh, Long Ouyang, Cullen O'keefe, Jakub Pachocki, Alex Paino, Joe Palermo, Ashley Pantuliano, Giambattista Parascandolo, Joel Parish, Emy Parparita, Alex Passos, Mikhail Pavlov, Andrew Peng, Adam Perelman, Filipe De Avila Belbute Peres, Michael Petrov, Henrique Ponde De Oliveira Pinto, Michael, Pokorny, Michelle Pokrass, Vitchyr H. Pong, Tolly Powell, Alethea Power, Boris Power, Elizabeth Proehl, Raul Puri, Alec Radford, Jack Rae, Aditya Ramesh, Cameron Raymond, Francis Real, Kendra Rimbach, Carl Ross, Bob Rotsted, Henri Roussez, Nick Ryder, Mario Saltarelli, Ted Sanders, Shibani Santurkar, Girish Sastry, Heather Schmidt, David Schnurr, John Schulman, Daniel Selsam, Kyla Sheppard, Toki Sherbakov, Jessica Shieh, Sarah Shoker, Pranav Shyam, Szymon Sidor, Eric Sigler, Maddie Simens, Jordan Sitkin, Katarina Slama, Ian Sohl, Benjamin Sokolowsky, Yang Song, Natalie Staudacher, Felipe Petroski Such, Natalie Summers, Ilya Sutskever, Jie Tang, Nikolas Tezak, Madeleine B. Thompson, Phil Tillet, Amin Tootoonchian, Elizabeth Tseng, Preston Tuggle, Nick Turley, Jerry Tworek, Juan Felipe Cerón Uribe, Andrea Vallone, Arun Vijayvergiya, Chelsea Voss, Carroll Wainwright, Justin Jay Wang, Alvin Wang, Ben Wang, Jonathan Ward, Jason Wei, Cj Weinmann, Akila Welihinda, Peter Welinder, Jiayi Weng, Lilian Weng, Matt Wiethoff, Dave Willner, Clemens Winter, Samuel Wolrich, Hannah Wong, Lauren Workman, Sherwin Wu, Jeff Wu, Michael Wu, Kai Xiao, Tao Xu, Sarah Yoo, Kevin Yu, Qiming Yuan, Wojciech Zaremba, Rowan Zellers, Chong Zhang, Marvin Zhang, Shengjia Zhao, Tianhao Zheng, Juntang Zhuang, William Zhuk, Barret Zoph
- On The Role Of Attention In Prompt-tuning Oymak Samet, Rawat Ankit Singh, Soltanolkotabi Mahdi, Thrampoulidis Christos
- Several Categories Of Large Language Models (llms): A Short Survey Pahune Saurabh, Chandrasekharan Manoj
- Giraffe: Adventures In Expanding Context Lengths In Llms Pal Arka, Karkhanis Deep, Roberts Manley, Dooley Samuel, Sundararajan Arvind, Naidu Siddartha
- Evaluating And Explaining Large Language Models For Code Using Syntactic Structures Palacio David N, Velasco Alejandro, Rodriguez-cardenas Daniel, Moran Kevin, Poshyvanyk Denys
- Finding And Editing Multi-modal Neurons In Pre-trained Transformers Pan Haowen, Cao Yixin, Wang Xiaozhi, Yang Xun, Wang Meng
- Tppoet: Transformer-based Persian Poem Generation Using Minimal Data And Advanced Decoding Techniques Panahandeh Amir, Asemi Hanie, Nourani Esmaeil
- Peft-medaware: Large Language Model For Medical Awareness Pandya Keivalya
- Frozen Transformers In Language Models Are Effective Visual Encoder Layers Pang Ziqi, Xie Ziyang, Man Yunze, Wang Yu-xiong
- Trainable Transformer In Transformer Panigrahi Abhishek, Malladi Sadhika, Xia Mengzhou, Arora Sanjeev
- Injecting Structural Hints: Using Language Models To Study Inductive Biases In Language Learning Papadimitriou Isabel, Jurafsky Dan
- Interpretable Visual Question Answering Via Reasoning Supervision Parelli Maria, Mallis Dimitrios, Diomataris Markos, Pitsikalis Vassilis
- Longbox: Evaluating Transformers On Long-sequence Clinical Tasks Parmar Mihir, Naik Aakanksha, Gupta Himanshu, Agrawal Disha, Baral Chitta
- GPT Self-supervision For A Better Data Annotator Pei Xiaohuan, Li Yanxi, Xu Chang
- Yarn: Efficient Context Window Extension Of Large Language Models Peng Bowen, Quesnelle Jeffrey, Fan Honglu, Shippole Enrico
- Generative Large Language Models Are All-purpose Text Analytics Engines: Text-to-text Learning Is All Your Need Peng Cheng, Yang Xi, Chen Aokun, Yu Zehao, Smith Kaleb E, Costa Anthony B, Flores Mona G, Bian Jiang, Wu Yonghui
- FP8-LM: Training FP8 Large Language Models Peng Houwen, Wu Kan, Wei Yixuan, Zhao Guoshuai, Yang Yuxiang, Liu Ze, Xiong Yifan, Yang Ziyue, Ni Bolin, Hu Jingcheng, Li Ruihang, Zhang Miaosen, Li Chen, Ning Jia, Wang Ruizhe, Zhang Zheng, Liu Shuguang, Chau Joe, Hu Han, Cheng Peng
- Recjpq: Training Large-catalogue Sequential Recommenders Petrov Aleksandr V., Macdonald Craig
- The Impact Of Depth On Compositional Generalization In Transformer Language Models Petty Jackson, Van Steenkiste Sjoerd, Dasgupta Ishita, Sha Fei, Garrette Dan, Linzen Tal
- Let Models Speak Ciphers: Multiagent Debate Through Embeddings Pham Chau, Liu Boyi, Yang Yingxiang, Chen Zhengyu, Liu Tianyi, Yuan Jianbo, Plummer Bryan A., Wang Zhaoran, Yang Hongxia
- Zero-shot Question Answering Over Financial Documents Using Large Language Models Phogat Karmvir Singh, Harsha Chetan, Dasaratha Sridhar, Ramakrishna Shashishekar, Puranam Sai Akhil
- Hyena Hierarchy: Towards Larger Convolutional Language Models Poli Michael, Massaroli Stefano, Nguyen Eric, Fu Daniel Y., Dao Tri, Baccus Stephen, Bengio Yoshua, Ermon Stefano, Ré Christopher
- Mosaicbert: A Bidirectional Encoder Optimized For Fast Pretraining Portes Jacob, Trott Alex, Havens Sam, King Daniel, Venigalla Abhinav, Nadeem Moin, Sardana Nikhil, Khudia Daya, Frankle Jonathan
- Transformers In The Service Of Description Logic-based Contexts Poulis Angelos, Tsalapati Eleni, Koubarakis Manolis
- OPERA: Alleviating Hallucination In Multi-modal Large Language Models Via Over-trust Penalty And Retrospection-allocation Qidong Huang, Xiaoyi Dong, Pan Zhang, Bin Wang, Conghui He, Jiaqi Wang, Dahua Lin, Weiming Zhang, Nenghai Yu
- Transnormerllm: A Faster And Better Large Language Model With Improved Transnormer Qin Zhen, Li Dong, Sun Weigao, Sun Weixuan, Shen Xuyang, Han Xiaodong, Wei Yunshen, Lv Baohong, Luo Xiao, Qiao Yu, Zhong Yiran
- Unlocking Emergent Modularity In Large Language Models Qiu Zihan, Huang Zeyu, Fu Jie
- Exploring The Effectiveness Of GPT Models In Test-taking: A Case Study Of The Driver's License Knowledge Test Rahimi Saba, Balch Tucker, Veloso Manuela
- Quantized Transformer Language Model Implementations On Edge Devices Rahman Mohammad Wali Ur, Abrar Murad Mehrab, Copening Hunter Gibbons, Hariri Salim, Shao Sicong, Satam Pratik, Salehi Soheil
- Improving Generalization In Task-oriented Dialogues With Workflows And Action Plans Raimondo Stefania, Pal Christopher, Liu Xiaotian, Vazquez David, Palacios Hector
- Bridging The Gap: Exploring The Capabilities Of Bridge-architectures For Complex Visual Reasoning Tasks Rajesh Kousik, Raman Mrigank, Karim Mohammed Asad, Chawla Pranit
- Multi-task End-to-end Training Improves Conversational Recommendation Ram Naveen, Kuzmin Dima, Chio Ellie Ka In, Alzantot Moustafa Farid, Ontanon Santiago, Jash Ambarish, Li Judith Yue
- Mabvit -- Modified Attention Block Enhances Vision Transformers Ramesh Mahesh, Ramkumar Aswinkumar
- Towards A Performance Analysis On Pre-trained Visual Question Answering Models For Autonomous Driving Rekanar Kaavya, Eising Ciarán, Sistu Ganesh, Hayes Martin
- Context Compression For Auto-regressive Transformers With Sentinel Tokens Ren Siyu, Jia Qi, Zhu Kenny Q.
- Commonsense Reasoning For Conversational AI: A Survey Of The State Of The Art Richardson Christopher, Heck Larry
- Knowledge-infused Self Attention Transformers Roy Kaushik, Zi Yuxin, Narayanan Vignesh, Gaur Manas, Sheth Amit
- Retrieval-pretrained Transformer: Long-range Language Modeling With Self-retrieval Rubin Ohad, Berant Jonathan
- Attention-likelihood Relationship In Transformers Ruscio Valeria, Maiorca Valentino, Silvestri Fabrizio
- Plausibility Processing In Transformer Language Models: Focusing On The Role Of Attention Heads In GPT Ryu Soo Hyun
- Naturalness Of Attention: Revisiting Attention In Code Language Models Saad Mootez, Sharma Tushar
- Uncertainty Guided Global Memory Improves Multi-hop Question Answering Sagirova Alsu, Burtsev Mikhail
- Memory Injections: Correcting Multi-hop Reasoning Failures During Inference In Transformer-based Language Models Sakarvadia Mansi, Ajith Aswathy, Khan Arham, Grzenda Daniel, Hudson Nathaniel, Bauer André, Chard Kyle, Foster Ian
- Attention Lens: A Tool For Mechanistically Interpreting The Attention Head Information Retrieval Mechanism Sakarvadia Mansi, Khan Arham, Ajith Aswathy, Grzenda Daniel, Hudson Nathaniel, Bauer André, Chard Kyle, Foster Ian
- Extending The Frontier Of Chatgpt: Code Generation And Debugging Sakib Fardin Ahsan, Khan Saadat Hasan, Karim A. H. M. Rezaul
- UT5: Pretraining Non Autoregressive T5 With Unrolled Denoising Salem Mahmoud G., Ye Jiayu, Lin Chu-cheng, Liu Frederick
- Zero-shot Translation Of Attention Patterns In VQA Models To Natural Language Salewski Leonard, Koepke A. Sophia, Lensch Hendrik P. A., Akata Zeynep
- Inseq: An Interpretability Toolkit For Sequence Generation Models Sarti Gabriele, Feldhus Nils, Sickert Ludwig, Van Der Wal Oskar, Nissim Malvina, Bisazza Arianna
- Interprompt: Interpretable Prompting For Interrelated Interpersonal Risk Factors In Reddit Posts Sathvik Msvpj, Sarkar Surjodeep, Saxena Chandni, Sohn Sunghwan, Garg Muskan
- GIFT: Generative Interpretable Fine-tuning Savadikar Chinmay, Song Xi, Wu Tianfu
- Unlocking Practical Applications In Legal Domain: Evaluation Of GPT For Zero-shot Semantic Annotation Of Legal Texts Savelka Jaromir
- Large Language Models (GPT) Struggle To Answer Multiple-choice Questions About Code Savelka Jaromir, Agarwal Arav, Bogart Christopher, Sakr Majd
- Can GPT-4 Support Analysis Of Textual Data In Tasks Requiring Highly Specialized Domain Expertise? Savelka Jaromir, Ashley Kevin D., Gray Morgan A, Westermann Hannes, Xu Huihui
- Pretraining On The Test Set Is All You Need Schaeffer Rylan
- Probing Conceptual Understanding Of Large Visual-language Models Schiappa Madeline, Abdullah Raiyaan, Azad Shehreen, Claypoole Jared, Cogswell Michael, Divakaran Ajay, Rawat Yogesh
- Let's Have A Chat! A Conversation With Chatgpt: Technology, Applications, And Limitations Shahriar Sakib, Hayawi Kadhim
- Hero: Roberta And Longformer Hebrew Language Models Shalumov Vitaly, Haskey Harel
- One-shot Sensitivity-aware Mixed Sparsity Pruning For Large Language Models Shao Hang, Liu Bei, Xiao Bo, Zeng Ke, Wan Guanglu, Qian Yanmin
- The Truth Is In There: Improving Reasoning In Language Models With Layer-selective Rank Reduction Sharma Pratyusha, Ash Jordan T., Misra Dipendra
- Efficient LLM Inference On Cpus Shen Haihao, Chang Hanwen, Dong Bo, Luo Yu, Meng Hengyu
- Do Pretrained Transformers Learn In-context By Gradient Descent? Shen Lingfeng, Mishra Aayush, Khashabi Daniel
- Improving The Robustness Of Transformer-based Large Language Models With Dynamic Attention Shen Lujia, Pu Yuwen, Ji Shouling, Li Changjiang, Zhang Xuhong, Ge Chunpeng, Wang Ting
- Towards More Unified In-context Visual Understanding Sheng Dianmo, Chen Dongdong, Tan Zhentao, Liu Qiankun, Chu Qi, Bao Jianmin, Gong Tao, Liu Bin, Xu Shengwei, Yu Nenghai
- Crossget: Cross-guided Ensemble Of Tokens For Accelerating Vision-language Transformers Shi Dachuan, Tao Chaofan, Rao Anyi, Yang Zhendong, Yuan Chun, Wang Jiaqi
- Unleashing The Power Of Pre-trained Language Models For Offline Reinforcement Learning Shi Ruizhe, Liu Yuyao, Ze Yanjie, Du Simon S., Xu Huazhe
- Dept: Decomposed Prompt Tuning For Parameter-efficient Fine-tuning Shi Zhengxiang, Lipani Aldo
- Exploring The Robustness Of Large Language Models For Solving Programming Problems Shirafuji Atsushi, Watanobe Yutaka, Ito Takumi, Morishita Makoto, Nakamura Yuki, Oda Yusuke, Suzuki Jun
- Ep-alm: Efficient Perceptual Augmentation Of Language Models Shukor Mustafa, Dancette Corentin, Cord Matthieu
- EEL: Efficiently Encoding Lattices For Reranking Singhal Prasann, Xu Jiacheng, Ye Xi, Durrett Greg
- A Mathematical Abstraction For Balancing The Trade-off Between Creativity And Reality In Large Language Models Sinha Ritwik, Song Zhao, Zhou Tianyi
- On Early Detection Of Hallucinations In Factual Question Answering Snyder Ben, Moisescu Marius, Zafar Muhammad Bilal
- Decoding Chatgpt: A Taxonomy Of Existing Research, Current Challenges, And Possible Future Directions Sohail Shahab Saquib, Farhat Faiza, Himeur Yassine, Nadeem Mohammad, Madsen Dag Øivind, Singh Yashbir, Atalla Shadi, Mansoor Wathiq
- Zebra: Extending Context Window With Layerwise Grouped Local-global Attention Song Kaiqiang, Wang Xiaoyang, Cho Sangwoo, Pan Xiaoman, Yu Dong
- Learning Ui-to-code Reverse Generator Using Visual Critic Without Rendering Soselia Davit, Saifullah Khalid, Zhou Tianyi
- Dynamic Context Pruning For Efficient And Interpretable Autoregressive Transformers Sotiris Anagnostidis, Dario Pavllo, Luca Biggio, Lorenzo Noci, Aurelien Lucchi, Thomas Hofmann
- Curriculum-guided Abstractive Summarization Sotudeh Sajad, Deilamsalehy Hanieh, Dernoncourt Franck, Goharian Nazli
- Harnessing Llms In Curricular Design: Using GPT-4 To Support Authoring Of Learning Objectives Sridhar Pragnya, Doyle Aidan, Agarwal Arav, Bogart Christopher, Savelka Jaromir, Sakr Majd
- Chatgpt: A Study On Its Utility For Ubiquitous Software Engineering Tasks Sridhara Giriprasad, G. Ranjani H., Mazumdar Sourav
- Response-act Guided Reinforced Dialogue Generation For Mental Health Counseling Srivastava Aseem, Pandey Ishan, Akhtar Md. Shad, Chakraborty Tanmoy
- A Mechanistic Interpretation Of Arithmetic Reasoning In Language Models Using Causal Mediation Analysis Stolfo Alessandro, Belinkov Yonatan, Sachan Mrinmaya
- F3-pruning: A Training-free And Generalized Pruning Strategy Towards Faster And Finer Text-to-video Synthesis Su Sitong, Liu Jianzhi, Gao Lianli, Song Jingkuan
- Can Large Language Models Change User Preference Adversarially? Subhash Varshini
- RCMHA: Relative Convolutional Multi-head Attention For Natural Language Modelling Sugiharto Herman, Aradea, Mubarok Husni
- Building High-accuracy Multilingual ASR With Gated Language Experts And Curriculum Training Sun Eric, Li Jinyu, Hu Yuxuan, Zhu Yimeng, Zhou Long, Xue Jian, Wang Peidong, Liu Linquan, Liu Shujie, Lin Edward, Gong Yifan
- Character-level Chinese Backpack Language Models Sun Hao, Hewitt John
- Retentive Network: A Successor To Transformer For Large Language Models Sun Yutao, Dong Li, Huang Shaohan, Ma Shuming, Xia Yuqing, Xue Jilong, Wang Jianyong, Wei Furu
- Ctbls: Augmenting Large Language Models With Conversational Tables Sundar Anirudh S, Heck Larry
- Improving Non-autoregressive Translation Quality With Pretrained Language Model, Embedding Distillation And Upsampling Strategy For CTC Syu Shen-sian, Xie Juncheng, Lee Hung-yi
- Large Language Model-driven Classroom Flipping: Empowering Student-centric Peer Questioning With Flipped Interaction Tan Chee Wei
- Interpreting Pretrained Language Models Via Concept Bottlenecks Tan Zhen, Cheng Lu, Wang Song, Bo Yuan, Li Jundong, Liu Huan
- Multiple-question Multiple-answer Text-vqa Tang Peng, Appalaraju Srikar, Manmatha R., Xie Yusheng, Mahadevan Vijay
- Learning To Imagine: Visually-augmented Natural Language Generation Tang Tianyi, Chen Yushuo, Du Yifan, Li Junyi, Zhao Wayne Xin, Wen Ji-rong
- Adversarial Conversational Shaping For Intelligent Agents Tarasiewicz Piotr, Kenjeyev Sultan, Sebag Ilana, Alshehabi Shehab
- Soft-prompt Tuning For Large Language Models To Evaluate Bias Tian Jacob-junqi, Emerson David, Miyandoab Sevil Zanjani, Pandya Deval, Seyyed-kalantari Laleh, Khattak Faiza Khan
- Function Vectors In Large Language Models Todd Eric, Li Millicent L., Sharma Arnab Sen, Mueller Aaron, Wallace Byron C., Bau David
- Videberta: A Powerful Pre-trained Language Model For Vietnamese Tran Cong Dao, Pham Nhut Huy, Nguyen Anh, Hy Truong Son, Vu Tu
- Viclevr: A Visual Reasoning Dataset And Hybrid Multimodal Fusion Model For Visual Question Answering In Vietnamese Tran Khiem Vinh, Phan Hao Phu, Van Nguyen Kiet, Nguyen Ngan Luu Thuy
- Bartphobeit: Pre-trained Sequence-to-sequence And Image Transformers Models For Vietnamese Visual Question Answering Tran Khiem Vinh, Van Nguyen Kiet, Nguyen Ngan Luu Thuy
- Automating Code-related Tasks Through Transformers: The Impact Of Pre-training Tufano Rosalia, Pascarella Luca, Bavota Gabriele
- Focused Transformer: Contrastive Training For Context Scaling Tworkowski Szymon, Staniszewski Konrad, Pacek Mikołaj, Wu Yuhuai, Michalewski Henryk, Miłoś Piotr
- Mlongt5: A Multilingual And Efficient Text-to-text Transformer For Longer Sequences Uthus David, Ontañón Santiago, Ainslie Joshua, Guo Mandy
- The LLM Surgeon Van Der Ouderaa Tycho F. A., Nagel Markus, Van Baalen Mart, Asano Yuki M., Blankevoort Tijmen
- Adventures Of Trustworthy Vision-language Models: A Survey Vatsa Mayank, Jain Anubhooti, Singh Richa
- Generative Pre-trained Transformer For Vietnamese Community-based COVID-19 Question Answering Vo Tam Minh, Tran Khiem Vinh
- BERT4CTR: An Efficient Framework To Combine Pre-trained Language Model With Non-textual Features For CTR Prediction Wang Dong, Salamatian Kavé, Xia Yunqing, Deng Weiwei, Zhiang Qi
- Docllm: A Layout-aware Generative Language Model For Multimodal Document Understanding Wang Dongsheng, Raman Natraj, Sibue Mathieu, Ma Zhiqiang, Babkin Petr, Kaur Simerjot, Pei Yulong, Nourbakhsh Armineh, Liu Xiaomo
- Dialog Action-aware Transformer For Dialog Policy Learning Wang Huimin, Kwan Wai-chung, Wong Kam-fai
- Missrec: Pre-training And Transferring Multi-modal Interest-aware Sequence Representation For Recommendation Wang Jinpeng, Zeng Ziyun, Wang Yunxiao, Wang Yuting, Lu Xingyu, Li Tianxiang, Yuan Jun, Zhang Rui, Zheng Hai-tao, Xia Shu-tao
- Compositional Text-to-image Synthesis With Attention Map Control Of Diffusion Models Wang Ruichen, Chen Zekang, Chen Chen, Ma Jian, Lu Haonan, Lin Xiaodong
- READ: Recurrent Adaptation Of Large Transformers Wang Sid, Nguyen John, Li Ke, Wu Carole-jean
- Probing Large Language Models From A Human Behavioral Perspective Wang Xintong, Li Xiaoyu, Li Xingshan, Biemann Chris
- User-aware Prefix-tuning Is A Good Learner For Personalized Image Captioning Wang Xuan, Wang Guanhong, Chai Wenhao, Zhou Jiayu, Wang Gaoang
- Universality And Limitations Of Prompt Tuning Wang Yihan, Chauhan Jatin, Wang Wei, Hsieh Cho-jui
- Vaquita: Enhancing Alignment In Llm-assisted Video Understanding Wang Yizhou, Zhang Ruiyi, Wang Haoliang, Bhattacharya Uttaran, Fu Yun, Wu Gang
- Smarttrim: Adaptive Tokens And Attention Pruning For Efficient Vision-language Models Wang Zekun, Chen Jingchang, Zhou Wangchunshu, Zhu Haichao, Liang Jiafeng, Shan Liping, Liu Ming, Xu Dongliang, Yang Qing, Qin Bing
- Interviewbot: Real-time End-to-end Dialogue System To Interview Students For College Admission Wang Zihao, Keyes Nathan, Crawford Terry, Choi Jinho D.
- A Survey Of Large Language Models Wayne Xin Zhao, Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-yun Nie, Ji-rong Wen
- Lmsanitator: Defending Prompt-tuning Against Task-agnostic Backdoors Wei Chengkun, Meng Wenlong, Zhang Zhikun, Chen Min, Zhao Minghu, Fang Wenjing, Wang Lei, Zhang Zihui, Chen Wenzhi
- REPLUG: Retrieval-augmented Black-box Language Models Weijia Shi, Sewon Min, Michihiro Yasunaga, Minjoon Seo, Rich James, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih
- Mastering Symbolic Operations: Augmenting Language Models With Compiled Neural Networks Weng Yixuan, Zhu Minjun, Xia Fei, Li Bin, He Shizhu, Liu Kang, Zhao Jun
- Instructblip: Towards General-purpose Vision-language Models With Instruction Tuning Wenliang Dai, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Junqi Zhao, Weisheng Wang, Boyang Li, Pascale Fung, Steven Hoi
- An Overview Of Temporal Commonsense Reasoning And Acquisition Wenzel Georg, Jatowt Adam
- System 2 Attention (is Something You Might Need Too) Weston Jason, Sukhbaatar Sainbayar
- Frustratingly Simple Memory Efficiency For Pre-trained Language Models Via Dynamic Embedding Pruning Williams Miles, Aletras Nikolaos
- Natural Language Generation And Understanding Of Big Code For Ai-assisted Programming: A Review Wong Man Fai, Guo Shangxin, Hang Ching Nam, Ho Siu Wai, Tan Chee Wei
- Language Prompt For Autonomous Driving Wu Dongming, Han Wencheng, Wang Tiancai, Liu Yingfei, Zhang Xiangyu, Shen Jianbing
- Efficient LLM Inference Solution On Intel GPU Wu Hui, Gan Yi, Yuan Feng, Ma Jing, Zhu Wei, Xu Yutao, Zhu Hong, Zhu Yuhua, Liu Xiaoli, Gu Jinghui, Zhao Peng
- Document Flattening: Beyond Concatenating Context For Document-level Neural Machine Translation Wu Minghao, Foster George, Qu Lizhen, Haffari Gholamreza
- Approximated Prompt Tuning For Vision-language Pre-trained Models Wu Qiong, Huang Shubin, Zhou Yiyi, Dai Pingyang, Shu Annan, Jiang Guannan, Ji Rongrong
- Unleashing Potential Of Evidence In Knowledge-intensive Dialogue Generation Wu Xianjie, Yang Jian, Li Tongliang, Liang Di, Zhang Shiwei, Du Yiyang, Li Zhoujun
- From Language Modeling To Instruction Following: Understanding The Behavior Shift In Llms After Instruction Tuning Wu Xuansheng, Yao Wenlin, Chen Jianshu, Pan Xiaoman, Wang Xiaoyang, Liu Ninghao, Yu Dong
- TLM: Token-level Masking For Transformers Wu Yangjun, Fang Kebin, Zhang Dongxiang, Wang Han, Zhang Hao, Chen Gang
- Pali-3 Vision Language Models: Smaller, Faster, Stronger Xi Chen, Xiao Wang, Lucas Beyer, Alexander Kolesnikov, Jialin Wu, Paul Voigtlaender, Basil Mustafa, Sebastian Goodman, Ibrahim Alabdulmohsin, Piotr Padlewski, Daniel Salz, Xi Xiong, Daniel Vlasic, Filip Pavetic, Keran Rong, Tianli Yu, Daniel Keysers, Xiaohua Zhai, Radu Soricut
- Improving Question Generation With Multi-level Content Planning Xia Zehua, Gou Qi, Yu Bowen, Yu Haiyang, Huang Fei, Li Yongbin, Nguyen Cam-tu
- Instruction-vit: Multi-modal Prompts For Instruction Learning In Vit Xiao Zhenxiang, Chen Yuzhong, Zhang Lu, Yao Junjie, Wu Zihao, Yu Xiaowei, Pan Yi, Zhao Lin, Ma Chong, Liu Xinyu, Liu Wei, Li Xiang, Yuan Yixuan, Shen Dinggang, Zhu Dajiang, Liu Tianming, Jiang Xi
- A Survey On Multi-behavior Sequential Recommendation Xiaoqing Chen, Zhitao Li, Weike Pan, Zhong Ming
- Pangu-σ: Towards Trillion Parameter Language Model With Sparse Heterogeneous Computing Xiaozhe Ren, Pingyi Zhou, Xinfan Meng, Xinjing Huang, Yadao Wang, Weichao Wang, Pengfei Li, Xiaoda Zhang, Alexander Podolskiy, Grigory Arshinov, Andrey Bout, Irina Piontkovskaya, Jiansheng Wei, Xin Jiang, Teng Su, Qun Liu, Jun Yao
- Analysis Of The Reasoning With Redundant Information Provided Ability Of Large Language Models Xie Wenbei
- Making Small Language Models Better Multi-task Learners With Mixture-of-task-adapters Xie Yukang, Wang Chengyu, Yan Junbing, Zhou Jiyong, Deng Feiqi, Huang Jun
- DPL: Decoupled Prompt Learning For Vision-language Models Xu Chen, Zhu Yuhan, Zhang Guozhen, Shen Haocheng, Liao Yixuan, Chen Xiaoxin, Wu Gangshan, Wang Limin
- Re-reading Improves Reasoning In Large Language Models Xu Xiaohan, Tao Chongyang, Shen Tao, Xu Can, Xu Hongbo, Long Guodong, Lou Jian-guang
- It Ain't That Bad: Understanding The Mysterious Performance Drop In OOD Generalization For Generative Transformer Models Xu Xingcheng, Pan Zihao, Zhang Haipeng, Yang Yanqing
- Pit One Against Many: Leveraging Attention-head Embeddings For Parameter-efficient Multi-head Attention Xue Huiyin, Aletras Nikolaos
- Pretraining Data Mixtures Enable Narrow Model Selection Capabilities In Transformer Models Yadlowsky Steve, Doshi Lyric, Tripuraneni Nilesh
- LACMA: Language-aligning Contrastive Learning With Meta-actions For Embodied Instruction Following Yang Cheng-fu, Chen Yen-chun, Yang Jianwei, Dai Xiyang, Yuan Lu, Wang Yu-chiang Frank, Chang Kai-wei
- Iterative Forward Tuning Boosts In-context Learning In Language Models Yang Jiaxi, Hui Binyuan, Yang Min, Wang Bailin, Li Bowen, Li Binhua, Huang Fei, Li Yongbin
- Kobigbird-large: Transformation Of Transformer For Korean Language Understanding Yang Kisu, Jang Yoonna, Lee Taewoo, Seong Jinwoo, Lee Hyungjin, Jang Hwanseok, Lim Heuiseok
- Lidar-llm: Exploring The Potential Of Large Language Models For 3D Lidar Understanding Yang Senqiao, Liu Jiaming, Zhang Ray, Pan Mingjie, Guo Zoey, Li Xiaoqi, Chen Zehui, Gao Peng, Guo Yandong, Zhang Shanghang
- Progressive Evidence Refinement For Open-domain Multimodal Retrieval Question Answering Yang Shuwen, Wu Anran, Wu Xingjiao, Xiao Luwei, Ma Tianlong, Jin Cheng, He Liang
- Dior-cvae: Pre-trained Language Models And Diffusion Priors For Variational Dialog Generation Yang Tianyu, Tran Thy Thy, Gurevych Iryna
- Exploring An LM To Generate Prolog Predicates From Mathematics Questions Yang Xiaocheng, Tam Yik-cheung
- Lever LM: Configuring In-context Sequence To Lever Large Vision Language Models Yang Xu, Peng Yingzhe, Ma Haoxuan, Xu Shuo, Zhang Chi, Han Yucheng, Zhang Hanwang
- Bias A-head? Analyzing Bias In Transformer-based Language Model Attention Heads Yang Yi, Duan Hanyu, Abbasi Ahmed, Lalor John P., Tam Kar Yan
- On The Planning, Search, And Memorization Capabilities Of Large Language Models Yang Yunhao, Tomar Anshul
- Quantization-aware And Tensor-compressed Training Of Transformers For Natural Language Understanding Yang Zi, Choudhary Samridhi, Kunzmann Siegfried, Zhang Zheng
- LLM Lies: Hallucinations Are Not Bugs, But Features As Adversarial Examples Yao Jia-yu, Ning Kun-peng, Liu Zhen-hui, Ning Mu-nan, Liu Yu-yang, Yuan Li
- ASPEN: High-throughput Lora Fine-tuning Of Large Language Models With A Single GPU Ye Zhengmao, Li Dengchun, Tian Jingqi, Lan Tingfeng, Zuo Jie, Duan Lei, Lu Hui, Jiang Yexi, Sha Jian, Zhang Ke, Tang Mingjie
- Ctrlstruct: Dialogue Structure Learning For Open-domain Response Generation Yin Congchi, Li Piji, Ren Zhaochun
- \(P^{3}O\): Transferring Visual Representations For Reinforcement Learning Via Prompting You Guoliang, Chu Xiaomeng, Duan Yifan, Peng Jie, Ji Jianmin, Zhang Yu, Zhang Yanyong
- MEGABYTE: Predicting Million-byte Sequences With Multiscale Transformers Yu Lili, Simig Dániel, Flaherty Colin, Aghajanyan Armen, Zettlemoyer Luke, Lewis Mike
- Evotext: Enhancing Natural Language Generation Models Via Self-escalation Learning For Up-to-date Knowledge And Improved Performance Yuan Zhengqing, Xue Huiwen, Zhang Chao, Liu Yongming
- Plan, Eliminate, And Track -- Language Models Are Good Teachers For Embodied Agents Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye
- Attention Satisfies: A Constraint-satisfaction Lens On Factual Errors Of Language Models Yuksekgonul Mert, Chandrasekaran Varun, Jones Erik, Gunasekar Suriya, Naik Ranjita, Palangi Hamid, Kamar Ece, Nushi Besmira
- Fairness-aware Structured Pruning In Transformers Zayed Abdelrahman, Mordido Goncalo, Shabanian Samira, Baldini Ioana, Chandar Sarath
- Automatic Generation Of Multiple-choice Questions Zhang Cheng
- Benchmarking And In-depth Performance Study Of Large Language Models On Habana Gaudi Processors Zhang Chengming, Sun Baixi, Yu Xiaodong, Xie Zhen, Zheng Weijian, Iskra Kamil, Beckman Pete, Tao Dingwen
- Taxonomy-based Checklist For Large Language Model Evaluation Zhang Damin
- Controllable Text Generation With Residual Memory Transformer Zhang Hanqing, Si Sun, Wu Haiming, Song Dawei
- Linear Attention Via Orthogonal Memory Zhang Jun, Jiang Shuyang, Feng Jiangtao, Zheng Lin, Kong Lingpeng
- Crash: Clustering, Removing, And Sharing Enhance Fine-tuning Without Full Large Language Model Zhang Kaiyan, Ding Ning, Qi Biqing, Zhu Xuekai, Long Xinwei, Zhou Bowen
- Llama-adapter: Efficient Fine-tuning Of Language Models With Zero-init Attention Zhang Renrui, Han Jiaming, Liu Chris, Gao Peng, Zhou Aojun, Hu Xiangfei, Yan Shilin, Lu Pan, Li Hongsheng, Qiao Yu
- Planning With Large Language Models For Code Generation Zhang Shun, Chen Zhenfang, Shen Yikang, Ding Mingyu, Tenenbaum Joshua B., Gan Chuang
- History-aware Hierarchical Transformer For Multi-session Open-domain Dialogue System Zhang Tong, Liu Yong, Li Boyang, Zeng Zhiwei, Wang Pengwei, You Yuan, Miao Chunyan, Cui Lizhen
- Language Models Are Universal Embedders Zhang Xin, Li Zehan, Zhang Yanzhao, Long Dingkun, Xie Pengjun, Zhang Meishan, Zhang Min
- Cached Transformers: Improving Transformers With Differentiable Memory Cache Zhang Zhaoyang, Shao Wenqi, Ge Yixiao, Wang Xiaogang, Gu Jinwei, Luo Ping
- Mani-gpt: A Generative Model For Interactive Robotic Manipulation Zhang Zhe, Chai Wei, Wang Jiankun
- Accountable Textual-visual Chat Learns To Reject Human Instructions In Image Re-creation Zhang Zhiwei, Liu Yuliang
- Unifying The Perspectives Of NLP And Software Engineering: A Survey On Language Models For Code Zhang Ziyin, Chen Chaoyu, Liu Bingchang, Liao Cong, Gong Zi, Yu Hang, Li Jianguo, Wang Rui
- Large Language Models Are Complex Table Parsers Zhao Bowen, Ji Changkai, Zhang Yuejie, He Wen, Wang Yingwen, Wang Qing, Feng Rui, Zhang Xiaobo
- Explainability For Large Language Models: A Survey Zhao Haiyan, Chen Hanjie, Yang Fan, Liu Ninghao, Deng Huiqi, Cai Hengyi, Wang Shuaiqiang, Yin Dawei, Du Mengnan
- Hallucination Detection For Grounded Instruction Generation Zhao Lingjun, Nguyen Khanh, Daumé Hal Iii
- Group Preference Optimization: Few-shot Alignment Of Large Language Models Zhao Siyan, Dang John, Grover Aditya
- A Survey Of Large Language Models Zhao Wayne Xin, Zhou Kun, Li Junyi, Tang Tianyi, Wang Xiaolei, Hou Yupeng, Min Yingqian, Zhang Beichen, Zhang Junjie, Dong Zican, Du Yifan, Yang Chen, Chen Yushuo, Chen Zhipeng, Jiang Jinhao, Ren Ruiyang, Li Yifan, Tang Xinyu, Liu Zikang, Liu Peiyu, Nie Jian-yun, Wen Ji-rong
- Bookgpt: A General Framework For Book Recommendation Empowered By Large Language Model Zhiyuli Aakas, Chen Yanfang, Zhang Xuan, Liang Xun
- Bag Of Tricks For Effective Language Model Pretraining And Downstream Adaptation: A Case Study On GLUE Zhong Qihuang, Ding Liang, Peng Keqin, Liu Juhua, Du Bo, Shen Li, Zhan Yibing, Tao Dacheng
- CONA: A Novel Context-aware Instruction Paradigm For Communication Using Large Language Model Zhou Nan, Tao Xinghui, Chen Xi
- Recurrentgpt: Interactive Generation Of (arbitrarily) Long Text Zhou Wangchunshu, Jiang Yuchen Eleanor, Cui Peng, Wang Tiannan, Xiao Zhenxin, Hou Yifan, Cotterell Ryan, Sachan Mrinmaya
- Advancing Transformer's Capabilities In Commonsense Reasoning Zhou Yu, Han Yunqiu, Zhou Hanyu, Wu Yulun
- Dialogue-contextualized Re-ranking For Medical History-taking Zhu Jian, Valmianski Ilya, Kannan Anitha
- VL-GPT: A Generative Pre-trained Transformer For Vision And Language Understanding And Generation Zhu Jinguo, Ding Xiaohan, Ge Yixiao, Ge Yuying, Zhao Sijie, Zhao Hengshuang, Wang Xiaohua, Shan Ying
- Coca: Fusing Position Embedding With Collinear Constrained Attention In Transformers For Long Context Window Extending Zhu Shiyi, Ye Jing, Jiang Wei, Xue Siqiao, Zhang Qi, Wu Yifan, Li Jianguo
- 3d-vista: Pre-trained Transformer For 3D Vision And Text Alignment Zhu Ziyu, Ma Xiaojian, Chen Yixin, Deng Zhidong, Huang Siyuan, Li Qing
- A Family Of Pretrained Transformer Language Models For Russian Zmitrovich Dmitry, Abramov Alexander, Kalmykov Andrey, Tikhonova Maria, Taktasheva Ekaterina, Astafurov Danil, Baushenko Mark, Snegirev Artem, Kadulin Vitalii, Markov Sergey, Shavrina Tatiana, Mikhailov Vladislav, Fenogenova Alena
- Concept-aware Training Improves In-context Learning Ability Of Language Models Štefánik Michal, Kadlčík Marek
- Olvit: Multi-modal State Tracking Via Attention-based Embeddings For Video-grounded Dialog Abdessaied Adnen, Von Hochmeister Manuel, Bulling Andreas
- The Challenges Of Evaluating LLM Applications: An Analysis Of Automated, Human, And Llm-based Approaches Abeysinghe Bhashithe, Circi Ruhan
- The Llama 3 Herd Of Models Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang, Bobbie Chern, Charlotte Caucheteux, Chaya Nayak, Chloe Bi, Chris Marra, Chris Mcconnell, Christian Keller, Christophe Touret, Chunyang Wu, Corinne Wong, Cristian Canton Ferrer, Cyrus Nikolaidis, Damien Allonsius, Daniel Song, Danielle Pintz, Danny Livshits, David Esiobu, Dhruv Choudhary, Dhruv Mahajan, Diego Garcia-olano, Diego Perino, Dieuwke Hupkes, Egor Lakomkin, Ehab Albadawy, Elina Lobanova, Emily Dinan, Eric Michael Smith, Filip Radenovic, Frank Zhang, Gabriel Synnaeve, Gabrielle Lee, Georgia Lewis Anderson, Graeme Nail, Gregoire Mialon, Guan Pang, Guillem Cucurell, Hailey Nguyen, Hannah Korevaar, Hu Xu, Hugo Touvron, Iliyan Zarov, Imanol Arrieta Ibarra, Isabel Kloumann, Ishan Misra, Ivan Evtimov, Jade Copet, Jaewon Lee, Jan Geffert, Jana Vranes, Jason Park, Jay Mahadeokar, Jeet Shah, Jelmer Van Der Linde, Jennifer Billock, Jenny Hong, Jenya Lee, Jeremy Fu, Jianfeng Chi, Jianyu Huang, Jiawen Liu, Jie Wang, Jiecao Yu, Joanna Bitton, Joe Spisak, Jongsoo Park, Joseph Rocca, Joshua Johnstun, Joshua Saxe, Junteng Jia, Kalyan Vasuden Alwala, Kartikeya Upasani, Kate Plawiak, Ke Li, Kenneth Heafield, Kevin Stone, Khalid El-arini, Krithika Iyer, Kshitiz Malik, Kuenley Chiu, Kunal Bhalla, Lauren Rantala-yeary, Laurens Van Der Maaten, Lawrence Chen, Liang Tan, Liz Jenkins, Louis Martin, Lovish Madaan, Lubo Malo, Lukas Blecher, Lukas Landzaat, Luke De Oliveira, Madeline Muzzi, Mahesh Pasupuleti, Mannat Singh, Manohar Paluri, Marcin Kardas, Mathew Oldham, Mathieu Rita, Maya Pavlova, Melanie Kambadur, Mike Lewis, Min Si, Mitesh Kumar Singh, Mona Hassan, Naman Goyal, Narjes Torabi, Nikolay Bashlykov, Nikolay Bogoychev, Niladri Chatterji, Olivier Duchenne, Onur Çelebi, Patrick Alrassy, Pengchuan Zhang, Pengwei Li, Petar Vasic, Peter Weng, Prajjwal Bhargava, Pratik Dubal, Praveen Krishnan, Punit Singh Koura, Puxin Xu, Qing He, Qingxiao Dong, Ragavan Srinivasan, Raj Ganapathy, Ramon Calderer, Ricardo Silveira Cabral, Robert Stojnic, Roberta Raileanu, Rohit Girdhar, Rohit Patel, Romain Sauvestre, Ronnie Polidoro, Roshan Sumbaly, Ross Taylor, Ruan Silva, Rui Hou, Rui Wang, Saghar Hosseini, Sahana Chennabasappa, Sanjay Singh, Sean Bell, Seohyun Sonia Kim, Sergey Edunov, Shaoliang Nie, Sharan Narang, Sharath Raparthy, Sheng Shen, Shengye Wan, Shruti Bhosale, Shun Zhang, Simon Vandenhende, Soumya Batra, Spencer Whitman, Sten Sootla, Stephane Collot, Suchin Gururangan, Sydney Borodinsky, Tamar Herman, Tara Fowler, Tarek Sheasha, Thomas Georgiou, Thomas Scialom, Tobias Speckbacher, Todor Mihaylov, Tong Xiao, Ujjwal Karn, Vedanuj Goswami, Vibhor Gupta, Vignesh Ramanathan, Viktor Kerkez, Vincent Gonguet, Virginie Do, Vish Vogeti, Vladan Petrovic, Weiwei Chu, Wenhan Xiong, Wenyin Fu, Whitney Meers, Xavier Martinet, Xiaodong Wang, Xiaoqing Ellen Tan, Xinfeng Xie, Xuchao Jia, Xuewei Wang, Yaelle Goldschlag, Yashesh Gaur, Yasmine Babaei, Yi Wen, Yiwen Song, Yuchen Zhang, Yue Li, Yuning Mao, Zacharie Delpierre Coudert, Zheng Yan, Zhengxing Chen, Zoe Papakipos, Aaditya Singh, Aaron Grattafiori, Abha Jain, Adam Kelsey, Adam Shajnfeld, Adithya Gangidi, Adolfo Victoria, Ahuva Goldstand, Ajay Menon, Ajay Sharma, Alex Boesenberg, Alex Vaughan, Alexei Baevski, Allie Feinstein, Amanda Kallet, Amit Sangani, Anam Yunus, Andrei Lupu, Andres Alvarado, Andrew Caples, Andrew Gu, Andrew Ho, Andrew Poulton, Andrew Ryan, Ankit Ramchandani, Annie Franco, Aparajita Saraf, Arkabandhu Chowdhury, Ashley Gabriel, Ashwin Bharambe, Assaf Eisenman, Azadeh Yazdan, Beau James, Ben Maurer, Benjamin Leonhardi, Bernie Huang, Beth Loyd, Beto De Paola, Bhargavi Paranjape, Bing Liu, Bo Wu, Boyu Ni, Braden Hancock, Bram Wasti, Brandon Spence, Brani Stojkovic, Brian Gamido, Britt Montalvo, Carl Parker, Carly Burton, Catalina Mejia, Changhan Wang, Changkyu Kim, Chao Zhou, Chester Hu, Ching-hsiang Chu, Chris Cai, Chris Tindal, Christoph Feichtenhofer, Damon Civin, Dana Beaty, Daniel Kreymer, Daniel Li, Danny Wyatt, David Adkins, David Xu, Davide Testuggine, Delia David, Devi Parikh, Diana Liskovich, Didem Foss, Dingkang Wang, Duc Le, Dustin Holland, Edward Dowling, Eissa Jamil, Elaine Montgomery, Eleonora Presani, Emily Hahn, Emily Wood, Erik Brinkman, Esteban Arcaute, Evan Dunbar, Evan Smothers, Fei Sun, Felix Kreuk, Feng Tian, Firat Ozgenel, Francesco Caggioni, Francisco Guzmán, Frank Kanayet, Frank Seide, Gabriela Medina Florez, Gabriella Schwarz, Gada Badeer, Georgia Swee, Gil Halpern, Govind Thattai, Grant Herman, Grigory Sizov, Guangyi, Zhang, Guna Lakshminarayanan, Hamid Shojanazeri, Han Zou, Hannah Wang, Hanwen Zha, Haroun Habeeb, Harrison Rudolph, Helen Suk, Henry Aspegren, Hunter Goldman, Ibrahim Damlaj, Igor Molybog, Igor Tufanov, Irina-elena Veliche, Itai Gat, Jake Weissman, James Geboski, James Kohli, Japhet Asher, Jean-baptiste Gaya, Jeff Marcus, Jeff Tang, Jennifer Chan, Jenny Zhen, Jeremy Reizenstein, Jeremy Teboul, Jessica Zhong, Jian Jin, Jingyi Yang, Joe Cummings, Jon Carvill, Jon Shepard, Jonathan Mcphie, Jonathan Torres, Josh Ginsburg, Junjie Wang, Kai Wu, Kam Hou U, Karan Saxena, Karthik Prasad, Kartikay Khandelwal, Katayoun Zand, Kathy Matosich, Kaushik Veeraraghavan, Kelly Michelena, Keqian Li, Kun Huang, Kunal Chawla, Kushal Lakhotia, Kyle Huang, Lailin Chen, Lakshya Garg, Lavender A, Leandro Silva, Lee Bell, Lei Zhang, Liangpeng Guo, Licheng Yu, Liron Moshkovich, Luca Wehrstedt, Madian Khabsa, Manav Avalani, Manish Bhatt, Maria Tsimpoukelli, Martynas Mankus, Matan Hasson, Matthew Lennie, Matthias Reso, Maxim Groshev, Maxim Naumov, Maya Lathi, Meghan Keneally, Michael L. Seltzer, Michal Valko, Michelle Restrepo, Mihir Patel, Mik Vyatskov, Mikayel Samvelyan, Mike Clark, Mike Macey, Mike Wang, Miquel Jubert Hermoso, Mo Metanat, Mohammad Rastegari, Munish Bansal, Nandhini Santhanam, Natascha Parks, Natasha White, Navyata Bawa, Nayan Singhal, Nick Egebo, Nicolas Usunier, Nikolay Pavlovich Laptev, Ning Dong, Ning Zhang, Norman Cheng, Oleg Chernoguz, Olivia Hart, Omkar Salpekar, Ozlem Kalinli, Parkin Kent, Parth Parekh, Paul Saab, Pavan Balaji, Pedro Rittner, Philip Bontrager, Pierre Roux, Piotr Dollar, Polina Zvyagina, Prashant Ratanchandani, Pritish Yuvraj, Qian Liang, Rachad Alao, Rachel Rodriguez, Rafi Ayub, Raghotham Murthy, Raghu Nayani, Rahul Mitra, Raymond Li, Rebekkah Hogan, Robin Battey, Rocky Wang, Rohan Maheswari, Russ Howes, Ruty Rinott, Sai Jayesh Bondu, Samyak Datta, Sara Chugh, Sara Hunt, Sargun Dhillon, Sasha Sidorov, Satadru Pan, Saurabh Verma, Seiji Yamamoto, Sharadh Ramaswamy, Shaun Lindsay, Shaun Lindsay, Sheng Feng, Shenghao Lin, Shengxin Cindy Zha, Shiva Shankar, Shuqiang Zhang, Shuqiang Zhang, Sinong Wang, Sneha Agarwal, Soji Sajuyigbe, Soumith Chintala, Stephanie Max, Stephen Chen, Steve Kehoe, Steve Satterfield, Sudarshan Govindaprasad, Sumit Gupta, Sungmin Cho, Sunny Virk, Suraj Subramanian, Sy Choudhury, Sydney Goldman, Tal Remez, Tamar Glaser, Tamara Best, Thilo Kohler, Thomas Robinson, Tianhe Li, Tianjun Zhang, Tim Matthews, Timothy Chou, Tzook Shaked, Varun Vontimitta, Victoria Ajayi, Victoria Montanez, Vijai Mohan, Vinay Satish Kumar, Vishal Mangla, Vítor Albiero, Vlad Ionescu, Vlad Poenaru, Vlad Tiberiu Mihailescu, Vladimir Ivanov, Wei Li, Wenchen Wang, Wenwen Jiang, Wes Bouaziz, Will Constable, Xiaocheng Tang, Xiaofang Wang, Xiaojian Wu, Xiaolan Wang, Xide Xia, Xilun Wu, Xinbo Gao, Yanjun Chen, Ye Hu, Ye Jia, Ye Qi, Yenda Li, Yilin Zhang, Ying Zhang, Yossi Adi, Youngjin Nam, Yu, Wang, Yuchen Hao, Yundi Qian, Yuzi He, Zach Rait, Zachary Devito, Zef Rosnbrick, Zhaoduo Wen, Zhenyu Yang, Zhiwei Zhao
- A Survey On Symbolic Knowledge Distillation Of Large Language Models Acharya Kamal, Velasquez Alvaro, Song Houbing Herbert
- Attnlrp: Attention-aware Layer-wise Relevance Propagation For Transformers Achtibat Reduan, Hatefi Sayed Mohammad Vakilzadeh, Dreyer Maximilian, Jain Aakriti, Wiegand Thomas, Lapuschkin Sebastian, Samek Wojciech
- Keyformer: KV Cache Reduction Through Key Tokens Selection For Efficient Generative Inference Adnan Muhammad, Arunkumar Akhil, Jain Gaurav, Nair Prashant J., Soloveychik Ilya, Kamath Purushotham
- Leapformer: Enabling Linear Transformers For Autoregressive And Simultaneous Tasks Via Learned Proportions Agostinelli Victor, Hong Sanghyun, Chen Lizhong
- Illuminate: A Novel Approach For Depression Detection With Explainable Analysis And Proactive Therapy Using Prompt Engineering Agrawal Aryan
- Yi: Open Foundation Models By 01.AI Ai 01., :, Young Alex, Chen Bei, Li Chao, Huang Chengen, Zhang Ge, Zhang Guanwei, Li Heng, Zhu Jiangcheng, Chen Jianqun, Chang Jing, Yu Kaidong, Liu Peng, Liu Qiang, Yue Shawn, Yang Senbin, Yang Shiming, Yu Tao, Xie Wen, Huang Wenhao, Hu Xiaohui, Ren Xiaoyi, Niu Xinyao, Nie Pengcheng, Xu Yuchi, Liu Yudong, Wang Yue, Cai Yuxuan, Gu Zhenyu, Liu Zhiyuan, Dai Zonghong
- Plug And Play With Prompts: A Prompt Tuning Approach For Controlling Text Generation Ajwani Rohan Deepak, Zhu Zining, Rose Jonathan, Rudzicz Frank
- In-context Language Learning: Architectures And Algorithms Akyürek Ekin, Wang Bailin, Kim Yoon, Andreas Jacob
- History Of Generative Artificial Intelligence (AI) Chatbots: Past, Present, And Future Development Al-amin Md., Ali Mohammad Shazed, Salam Abdus, Khan Arif, Ali Ashraf, Ullah Ahsan, Alam Md Nur, Chowdhury Shamsul Kabir
- D-NLP At Semeval-2024 Task 2: Evaluating Clinical Inference Capabilities Of Large Language Models Altinok Duygu
- Sentencevae: Enable Next-sentence Prediction For Large Language Models With Faster Speed, Higher Accuracy And Longer Context An Hongjun, Chen Yifan, Sun Zhe, Li Xuelong
- An Assessment On Comprehending Mental Health Through Large Language Models Arcan Mihael, Niland David-paul, Delahunty Fionn
- Forklift: An Extensible Neural Lifter Armengol-estapé Jordi, Rocha Rodrigo C. O., Woodruff Jackson, Minervini Pasquale, O'boyle Michael F. P.
- SPAFIT: Stratified Progressive Adaptation Fine-tuning For Pre-trained Large Language Models Arora Samir, Wang Liangliang
- Exploring The Robustness Of Task-oriented Dialogue Systems For Colloquial German Varieties Artemova Ekaterina, Blaschke Verena, Plank Barbara
- What Makes Language Models Good-enough? Asami Daiki, Sugawara Saku
- Slicegpt: Compress Large Language Models By Deleting Rows And Columns Ashkboos Saleh, Croci Maximilian L., Nascimento Marcelo Gennari Do, Hoefler Torsten, Hensman James
- Bifurcated Attention: Accelerating Massively Parallel Decoding With Shared Prefixes In Llms Athiwaratkun Ben, Gonugondla Sujan Kumar, Gouda Sanjay Krishna, Qian Haifeng, Ding Hantian, Sun Qing, Wang Jun, Guo Jiacheng, Chen Liangfu, Bhatia Parminder, Nallapati Ramesh, Sengupta Sudipta, Xiang Bing
- Transformer Alignment In Large Language Models Aubry Murdock, Meng Haoming, Sugolov Anton, Papyan Vardan
- The Pitfalls Of Next-token Prediction Bachmann Gregor, Nagarajan Vaishnavh
- Masked Mixers For Language Generation And Retrieval Badger Benjamin L.
- Citrus: Chunked Instruction-aware State Eviction For Long Sequence Modeling Bai Yu, Zou Xiyuan, Huang Heyan, Chen Sanxing, Rondeau Marc-antoine, Gao Yang, Cheung Jackie Chi Kit
- Downstream Bias Mitigation Is All You Need Baksi Arkadeep, Singh Rahul, Joshi Tarun
- Understanding Information Storage And Transfer In Multi-modal Large Language Models Basu Samyadeep, Grayson Martin, Morrison Cecily, Nushi Besmira, Feizi Soheil, Massiceti Daniela
- Xlstm: Extended Long Short-term Memory Beck Maximilian, Pöppel Korbinian, Spanring Markus, Auer Andreas, Prudnikova Oleksandra, Kopp Michael, Klambauer Günter, Brandstetter Johannes, Hochreiter Sepp
- Lotr: Low Tensor Rank Weight Adaptation Bershatsky Daniel, Cherniuk Daria, Daulbaev Talgat, Mikhalev Aleksandr, Oseledets Ivan
- Multi-head RAG: Solving Multi-aspect Problems With Llms Besta Maciej, Kubicek Ales, Niggli Roman, Gerstenberger Robert, Weitzendorf Lucas, Chi Mingyuan, Iff Patrick, Gajda Joanna, Nyczyk Piotr, Müller Jürgen, Niewiadomski Hubert, Chrapek Marcin, Podstawski Michał, Hoefler Torsten
- On The Robustness Of Language Models For Tabular Question Answering Bhandari Kushal Raj, Xing Sixue, Dan Soham, Gao Jianxi
- Lipost: Improved Content Understanding With Effective Use Of Multi-task Contrastive Learning Bindal Akanksha, Ramanujam Sudarshan, Golland Dave, Hazen Tj, Jiang Tina, Zhang Fengyu, Yan Peng
- GRAM: Global Reasoning For Multi-page VQA Blau Tsachi, Fogel Sharon, Ronen Roi, Golts Alona, Ganz Roy, Avraham Elad Ben, Aberdam Aviad, Tsiper Shahar, Litman Ron
- Long-span Question-answering: Automatic Question Generation And Qa-system Ranking Via Side-by-side Evaluation Bohnet Bernd, Swersky Kevin, Liu Rosanne, Awasthi Pranjal, Nova Azade, Snaider Javier, Sedghi Hanie, Parisi Aaron T, Collins Michael, Lazaridou Angeliki, Firat Orhan, Fiedel Noah
- Recurrentgemma: Moving Past Transformers For Efficient Open Language Models Botev Aleksandar, De Soham, Smith Samuel L, Fernando Anushan, Muraru George-cristian, Haroun Ruba, Berrada Leonard, Pascanu Razvan, Sessa Pier Giuseppe, Dadashi Robert, Hussenot Léonard, Ferret Johan, Girgin Sertan, Bachem Olivier, Andreev Alek, Kenealy Kathleen, Mesnard Thomas, Hardin Cassidy, Bhupatiraju Surya, Pathak Shreya, Sifre Laurent, Rivière Morgane, Kale Mihir Sanjay, Love Juliette, Tafti Pouya, Joulin Armand, Fiedel Noah, Senter Evan, Chen Yutian, Srinivasan Srivatsan, Desjardins Guillaume, Budden David, Doucet Arnaud, Vikram Sharad, Paszke Adam, Gale Trevor, Borgeaud Sebastian, Chen Charlie, Brock Andy, Paterson Antonia, Brennan Jenny, Risdal Meg, Gundluru Raj, Devanathan Nesh, Mooney Paul, Chauhan Nilay, Culliton Phil, Martins Luiz Gustavo, Bandy Elisa, Huntsperger David, Cameron Glenn, Zucker Arthur, Warkentin Tris, Peran Ludovic, Giang Minh, Ghahramani Zoubin, Farabet Clément, Kavukcuoglu Koray, Hassabis Demis, Hadsell Raia, Teh Yee Whye, De Frietas Nando
- Transformers Meet Neural Algorithmic Reasoners Bounsi Wilfried, Ibarz Borja, Dudzik Andrew, Hamrick Jessica B., Markeeva Larisa, Vitvitskyi Alex, Pascanu Razvan, Veličković Petar
- Reducing Transformer Key-value Cache Size With Cross-layer Attention Brandon William, Mishra Mayank, Nrusimha Aniruddha, Panda Rameswar, Kelly Jonathan Ragan
- LOCOST: State-space Models For Long Document Abstractive Summarization Bronnec Florian Le, Duong Song, Ravaut Mathieu, Allauzen Alexandre, Chen Nancy F., Guigue Vincent, Lumbreras Alberto, Soulier Laure, Gallinari Patrick
- MADTP: Multimodal Alignment-guided Dynamic Token Pruning For Accelerating Vision-language Transformer Cao Jianjian, Ye Peng, Li Shengze, Yu Chong, Tang Yansong, Lu Jiwen, Chen Tao
- Genrec: Generative Sequential Recommendation With Large Language Models Cao Panfeng, Lio Pietro
- Personalized Steering Of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization Cao Yuanpu, Zhang Tianrong, Cao Bochuan, Yin Ziyi, Lin Lu, Ma Fenglong, Chen Jinghui
- Captioning Visualizations With Large Language Models (CVLLM): A Tutorial Carenini Giuseppe, Johnson Jordon, Salamatian Ali
- A Review Of Multi-modal Large Language And Vision Models Carolan Kilian, Fennelly Laura, Smeaton Alan F.
- Dense Reward For Free In Reinforcement Learning From Human Feedback Chan Alex J., Sun Hao, Holt Samuel, Van Der Schaar Mihaela
- Goldfish: Monolingual Language Models For 350 Languages Chang Tyler A., Arnett Catherine, Tu Zhuowen, Bergen Benjamin K.
- When Large Language Models Meet Evolutionary Algorithms Chao Wang, Zhao Jiaxuan, Jiao Licheng, Li Lingling, Liu Fang, Yang Shuyuan
- Language Models Can Exploit Cross-task In-context Learning For Data-scarce Novel Tasks Chatterjee Anwoy, Tanwar Eshaan, Dutta Subhabrata, Chakraborty Tanmoy
- Control-dag: Constrained Decoding For Non-autoregressive Directed Acyclic T5 Using Weighted Finite State Automata Chen Jinghong, Lin Weizhe, Mei Jingbiao, Byrne Bill
- Pixart-\sigma: Weak-to-strong Training Of Diffusion Transformer For 4K Text-to-image Generation Chen Junsong, Ge Chongjian, Xie Enze, Wu Yue, Yao Lewei, Ren Xiaozhe, Wang Zhongdao, Luo Ping, Lu Huchuan, Li Zhenguo
- OMPGPT: A Generative Pre-trained Transformer Model For Openmp Chen Le, Bhattacharjee Arijit, Ahmed Nesreen, Hasabnis Niranjan, Oren Gal, Vo Vy, Jannesari Ali
- How Truncating Weights Improves Reasoning In Language Models Chen Lei, Bruna Joan, Bietti Alberto
- Efficientqat: Efficient Quantization-aware Training For Large Language Models Chen Mengzhao, Shao Wenqi, Xu Peng, Wang Jiahao, Gao Peng, Zhang Kaipeng, Qiao Yu, Luo Ping
- Enhancing Visual Question Answering Through Ranking-based Hybrid Training And Multimodal Fusion Chen Peiyuan, Zhang Zecheng, Dong Yiping, Zhou Li, Wang Han
- WSI-VQA: Interpreting Whole Slide Images By Generative Visual Question Answering Chen Pingyi, Zhu Chenglu, Zheng Sunyi, Li Honglin, Yang Lin
- Llava-mole: Sparse Mixture Of Lora Experts For Mitigating Data Conflicts In Instruction Finetuning Mllms Chen Shaoxiang, Jie Zequn, Ma Lin
- Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided By Text Information Chen Yi, Xu Jian, Zhang Xu-yao, Liu Wen-zhuo, Liu Yang-yang, Liu Cheng-lin
- Emergence Of A High-dimensional Abstraction Phase In Language Transformers Cheng Emily, Doimo Diego, Kervadec Corentin, Macocco Iuri, Yu Jade, Laio Alessandro, Baroni Marco
- Exploring Quantization For Efficient Pre-training Of Transformer Language Models Chitsaz Kamran, Fournier Quentin, Mordido Gonçalo, Chandar Sarath
- Transformer Explainer: Interactive Learning Of Text-generative Models Cho Aeree, Kim Grace C., Karpekov Alexander, Helbling Alec, Wang Zijie J., Lee Seongmin, Hoover Benjamin, Chau Duen Horng
- Cross-architecture Transfer Learning For Linear-cost Inference Transformers Choi Sehyun
- Improving Token-based World Models With Parallel Observation Prediction Cohen Lior, Wang Kaixin, Kang Bingyi, Mannor Shie
- Attentionlego: An Open-source Building Block For Spatially-scalable Large Language Model Accelerator With Processing-in-memory Technology Cong Rongqing, He Wenyang, Li Mingxuan, Luo Bangning, Yang Zebin, Yang Yuchao, Huang Ru, Yan Bonan
- Representations As Language: An Information-theoretic Framework For Interpretability Conklin Henry, Smith Kenny
- Llambert: Large-scale Low-cost Data Annotation In NLP Csanády Bálint, Muzsai Lajos, Vedres Péter, Nádasdy Zoltán, Lukács András
- Moeut: Mixture-of-experts Universal Transformers Csordás Róbert, Irie Kazuki, Schmidhuber Jürgen, Potts Christopher, Manning Christopher D.
- Synthetic Dataset Creation And Fine-tuning Of Transformer Models For Question Answering In Serbian Cvetanović Aleksa, Tadić Predrag
- CORM: Cache Optimization With Recent Message For Large Language Model Inference Dai Jincheng, Huang Zhuowei, Jiang Haiyun, Chen Chen, Cai Deng, Bi Wei, Shi Shuming
- Muap: Multi-step Adaptive Prompt Learning For Vision-language Model With Missing Modality Dai Ruiting, Tan Yuqiao, Mo Lisi, He Tao, Qin Ke, Liang Shuang
- Application Of GPT Language Models For Innovation In Activities In University Teaching De Buenaga Manuel, Bueno Francisco Javier
- Pelle: Encoder-based Language Models For Brazilian Portuguese Based On Open Data De Mello Guilherme Lamartine, Finger Marcelo, Serras And Felipe, Carpi Miguel De Mello, Jose Marcos Menon, Domingues Pedro Henrique, Cavalim Paulo
- ELASTIC: Efficient Linear Attention For Sequential Interest Compression Deng Jiaxin, Wang Shiyao, Lu Song, Li Yinfeng, Luo Xinchen, Liu Yuanjun, Xu Peixing, Zhou Guorui
- Attention Is Naturally Sparse With Gaussian Distributed Input Deng Yichuan, Song Zhao, Yang Chiwun
- A Simple And Effective \(L_2\) Norm-based Strategy For KV Cache Compression Devoto Alessio, Zhao Yu, Scardapane Simone, Minervini Pasquale
- A Tale Of Tails: Model Collapse As A Change Of Scaling Laws Dohmatob Elvis, Feng Yunzhen, Yang Pu, Charton Francois, Kempe Julia
- Latent Attention For Linear Time Transformers Dolga Rares, Cobzarenco Marius, Barber David
- Internlm-xcomposer2-4khd: A Pioneering Large Vision-language Model Handling Resolutions From 336 Pixels To 4K HD Dong Xiaoyi, Zhang Pan, Zang Yuhang, Cao Yuhang, Wang Bin, Ouyang Linke, Zhang Songyang, Duan Haodong, Zhang Wenwei, Li Yining, Yan Hang, Gao Yang, Chen Zhe, Zhang Xinyue, Li Wei, Li Jingwen, Wang Wenhai, Chen Kai, He Conghui, Zhang Xingcheng, Dai Jifeng, Qiao Yu, Lin Dahua, Wang Jiaqi
- Prompt Your Brain: Scaffold Prompt Tuning For Efficient Adaptation Of Fmri Pre-trained Model Dong Zijian, Wu Yilei, Chen Zijiao, Zhang Yichi, Jin Yueming, Zhou Juan Helen
- Enhancing Robustness Of Llm-synthetic Text Detectors For Academic Writing: A Comprehensive Analysis Dou Zhicheng, Guo Yuchen, Chang Ching-chun, Nguyen Huy H., Echizen Isao
- Stacking Your Transformers: A Closer Look At Model Growth For Efficient LLM Pre-training Du Wenyu, Luo Tongxu, Qiu Zihan, Huang Zeyu, Shen Yikang, Cheng Reynold, Guo Yike, Fu Jie
- Your Context Is Not An Array: Unveiling Random Access Limitations In Transformers Ebrahimi Mohammadreza, Panchal Sunny, Memisevic Roland
- Dynamic Q&A Of Clinical Documents With Large Language Models Elgedawy Ran, Danciu Ioana, Mahbub Maria, Srinivasan Sudarshan
- Layerskip: Enabling Early Exit Inference And Self-speculative Decoding Elhoushi Mostafa, Shrivastava Akshat, Liskovich Diana, Hosmer Basil, Wasti Bram, Lai Liangzhen, Mahmoud Anas, Acun Bilge, Agarwal Saurabh, Roman Ahmed, Aly Ahmed A, Chen Beidi, Wu Carole-jean
- Reasoning In Transformers -- Mitigating Spurious Correlations And Reasoning Shortcuts Enström Daniel, Kjellberg Viktor, Johansson Moa
- Unraveling The Dominance Of Large Language Models Over Transformer Models For Bangla Natural Language Inference: A Comprehensive Study Faria Fatema Tuj Johora, Moin Mukaffi Bin, Fahim Asif Iftekher, Debnath Pronay, Shah Faisal Muhammad
- What Are Large Language Models Mapping To In The Brain? A Case Against Over-reliance On Brain Scores Feghhi Ebrahim, Hadidi Nima, Song Bryan, Blank Idan A., Kao Jonathan C.
- Video-ccam: Enhancing Video-language Understanding With Causal Cross-attention Masks For Short And Long Videos Fei Jiajun, Li Dian, Deng Zhidong, Wang Zekun, Liu Gang, Wang Hui
- CLIP With Generative Latent Replay: A Strong Baseline For Incremental Learning Frascaroli Emanuele, Panariello Aniello, Buzzega Pietro, Bonicelli Lorenzo, Porrello Angelo, Calderara Simone
- Representing Rule-based Chatbots With Transformers Friedman Dan, Panigrahi Abhishek, Chen Danqi
- Lazyllm: Dynamic Token Pruning For Efficient Long Context LLM Inference Fu Qichen, Cho Minsik, Merth Thomas, Mehta Sachin, Rastegari Mohammad, Najibi Mahyar
- Using Large Language Models For The Interpretation Of Building Regulations Fuchs Stefan, Witbrock Michael, Dimyadi Johannes, Amor Robert
- Student Answer Forecasting: Transformer-driven Answer Choice Prediction For Language Learning Gado Elena Grazia, Martorella Tommaso, Zunino Luca, Mejia-domenzain Paola, Swamy Vinitra, Frej Jibril, Käser Tanja
- Question Aware Vision Transformer For Multimodal Reasoning Ganz Roy, Kittenplon Yair, Aberdam Aviad, Avraham Elad Ben, Nuriel Oren, Mazor Shai, Litman Ron
- Cost-efficient Large Language Model Serving For Multi-turn Conversations With Cachedattention Gao Bin, He Zhuomin, Sharma Puru, Kang Qingxuan, Jevdjic Djordje, Deng Junbo, Yang Xingkun, Yu Zhou, Zuo Pengfei
- Selfcp: Compressing Over-limit Prompt Via The Frozen Large Language Model Itself Gao Jun, Cao Ziqiang, Li Wenjie
- Harnessing The Intrinsic Knowledge Of Pretrained Language Models For Challenging Text Classification Settings Gao Lingyu
- Lumina-t2x: Transforming Text Into Any Modality, Resolution, And Duration Via Flow-based Large Diffusion Transformers Gao Peng, Zhuo Le, Liu Dongyang, Du Ruoyi, Luo Xu, Qiu Longtian, Zhang Yuhang, Lin Chen, Huang Rongjie, Geng Shijie, Zhang Renrui, Xi Junlin, Shao Wenqi, Jiang Zhengkai, Yang Tianshuo, Ye Weicai, Tong He, He Jingwen, Qiao Yu, Li Hongsheng
- A Review On The Use Of Large Language Models As Virtual Tutors García-méndez Silvia, De Arriba-pérez Francisco, Somoza-lópez María Del Carmen
- Gemma 2: Improving Open Language Models At A Practical Size Gemma Team, Riviere Morgane, Pathak Shreya, Sessa Pier Giuseppe, Hardin Cassidy, Bhupatiraju Surya, Hussenot Léonard, Mesnard Thomas, Shahriari Bobak, Ramé Alexandre, Ferret Johan, Liu Peter, Tafti Pouya, Friesen Abe, Casbon Michelle, Ramos Sabela, Kumar Ravin, Lan Charline Le, Jerome Sammy, Tsitsulin Anton, Vieillard Nino, Stanczyk Piotr, Girgin Sertan, Momchev Nikola, Hoffman Matt, Thakoor Shantanu, Grill Jean-bastien, Neyshabur Behnam, Bachem Olivier, Walton Alanna, Severyn Aliaksei, Parrish Alicia, Ahmad Aliya, Hutchison Allen, Abdagic Alvin, Carl Amanda, Shen Amy, Brock Andy, Coenen Andy, Laforge Anthony, Paterson Antonia, Bastian Ben, Piot Bilal, Wu Bo, Royal Brandon, Chen Charlie, Kumar Chintu, Perry Chris, Welty Chris, Choquette-choo Christopher A., Sinopalnikov Danila, Weinberger David, Vijaykumar Dimple, Rogozińska Dominika, Herbison Dustin, Bandy Elisa, Wang Emma, Noland Eric, Moreira Erica, Senter Evan, Eltyshev Evgenii, Visin Francesco, Rasskin Gabriel, Wei Gary, Cameron Glenn, Martins Gus, Hashemi Hadi, Klimczak-plucińska Hanna, Batra Harleen, Dhand Harsh, Nardini Ivan, Mein Jacinda, Zhou Jack, Svensson James, Stanway Jeff, Chan Jetha, Zhou Jin Peng, Carrasqueira Joana, Iljazi Joana, Becker Jocelyn, Fernandez Joe, Van Amersfoort Joost, Gordon Josh, Lipschultz Josh, Newlan Josh, Ji Ju-yeong, Mohamed Kareem, Badola Kartikeya, Black Kat, Millican Katie, Mcdonell Keelin, Nguyen Kelvin, Sodhia Kiranbir, Greene Kish, Sjoesund Lars Lowe, Usui Lauren, Sifre Laurent, Heuermann Lena, Lago Leticia, Mcnealus Lilly, Soares Livio Baldini, Kilpatrick Logan, Dixon Lucas, Martins Luciano, Reid Machel, Singh Manvinder, Iverson Mark, Görner Martin, Velloso Mat, Wirth Mateo, Davidow Matt, Miller Matt, Rahtz Matthew, Watson Matthew, Risdal Meg, Kazemi Mehran, Moynihan Michael, Zhang Ming, Kahng Minsuk, Park Minwoo, Rahman Mofi, Khatwani Mohit, Dao Natalie, Bardoliwalla Nenshad, Devanathan Nesh, Dumai Neta, Chauhan Nilay, Wahltinez Oscar, Botarda Pankil, Barnes Parker, Barham Paul, Michel Paul, Jin Pengchong, Georgiev Petko, Culliton Phil, Kuppala Pradeep, Comanescu Ramona, Merhej Ramona, Jana Reena, Rokni Reza Ardeshir, Agarwal Rishabh, Mullins Ryan, Saadat Samaneh, Carthy Sara Mc, Perrin Sarah, Arnold Sébastien M. R., Krause Sebastian, Dai Shengyang, Garg Shruti, Sheth Shruti, Ronstrom Sue, Chan Susan, Jordan Timothy, Yu Ting, Eccles Tom, Hennigan Tom, Kocisky Tomas, Doshi Tulsee, Jain Vihan, Yadav Vikas, Meshram Vilobh, Dharmadhikari Vishal, Barkley Warren, Wei Wei, Ye Wenming, Han Woohyun, Kwon Woosuk, Xu Xiang, Shen Zhe, Gong Zhitao, Wei Zichuan, Cotruta Victor, Kirk Phoebe, Rao Anand, Giang Minh, Peran Ludovic, Warkentin Tris, Collins Eli, Barral Joelle, Ghahramani Zoubin, Hadsell Raia, Sculley D., Banks Jeanine, Dragan Anca, Petrov Slav, Vinyals Oriol, Dean Jeff, Hassabis Demis, Kavukcuoglu Koray, Farabet Clement, Buchatskaya Elena, Borgeaud Sebastian, Fiedel Noah, Joulin Armand, Kenealy Kathleen, Dadashi Robert, Andreev Alek
- Beyond Self-learned Attention: Mitigating Attention Bias In Transformer-based Models Using Attention Guidance Gesi Jiri, Ahmed Iftekhar
- AI And Memory Wall Gholami Amir, Yao Zhewei, Kim Sehoon, Hooper Coleman, Mahoney Michael W., Keutzer Kurt
- Blendsql: A Scalable Dialect For Unifying Hybrid Question Answering In Relational Algebra Glenn Parker, Dakle Parag Pravin, Wang Liang, Raghavan Preethi
- Zamba: A Compact 7B SSM Hybrid Model Glorioso Paolo, Anthony Quentin, Tokpanov Yury, Whittington James, Pilault Jonathan, Ibrahim Adam, Millidge Beren
- Language Guided Exploration For RL Agents In Text Environments Golchha Hitesh, Yerawar Sahil, Patel Dhruvesh, Dan Soham, Murugesan Keerthiram
- Goldfinch: High Performance Rwkv/transformer Hybrid With Linear Pre-fill And Extreme Kv-cache Compression Goldstein Daniel, Obeid Fares, Alcaide Eric, Song Guangyu, Cheah Eugene
- Mechanistic Interpretability Of Large Language Models With Applications To The Financial Services Industry Golgoon Ashkan, Filom Khashayar, Kannan Arjun Ravi
- Omnifusion Technical Report Goncharova Elizaveta, Razzhigaev Anton, Mikhalchuk Matvey, Kurkin Maxim, Abdullaeva Irina, Skripkin Matvey, Oseledets Ivan, Dimitrov Denis, Kuznetsov Andrey
- AST-T5: Structure-aware Pretraining For Code Generation And Understanding Gong Linyuan, Elhoushi Mostafa, Cheung Alvin
- Assessing Generative Language Models In Classification Tasks: Performance And Self-evaluation Capabilities In The Environmental And Climate Change Domain Grasso Francesca, Locci Stefano
- Is Mamba Capable Of In-context Learning? Grazzi Riccardo, Siems Julien, Schrodi Simon, Brox Thomas, Hutter Frank
- Transformers, Contextualism, And Polysemy Grindrod Jumbly
- RWKV-CLIP: A Robust Vision-language Representation Learner Gu Tiancheng, Yang Kaicheng, An Xiang, Feng Ziyong, Liu Dongnan, Cai Weidong, Deng Jiankang
- Probing The Robustness Of Vision-language Pretrained Models: A Multimodal Adversarial Attack Approach Guan Jiwei, Ding Tianyu, Cao Longbing, Pan Lei, Wang Chen, Zheng Xi
- Conversational Topic Recommendation In Counseling And Psychotherapy With Decision Transformer And Large Language Models Gunal Aylin, Lin Baihan, Bouneffouf Djallel
- Agent-simt: Agent-assisted Simultaneous Machine Translation With Large Language Models Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Sillm: Large Language Models For Simultaneous Machine Translation Guo Shoutao, Zhang Shaolei, Ma Zhengrui, Zhang Min, Feng Yang
- Looking Into Black Box Code Language Models Haider Muhammad Umair, Farooq Umar, Siddique A. B., Marron Mark
- Mamba State-space Models Can Be Strong Downstream Learners Halloran John T., Gulati Manbir, Roysdon Paul F.
- Enhancing CTR Prediction Through Sequential Recommendation Pre-training: Introducing The SRP4CTR Framework Han Ruidong, Li Qianzhong, Jiang He, Li Rui, Zhao Yurou, Li Xiang, Lin Wei
- EMMA: Your Text-to-image Diffusion Model Can Secretly Accept Multi-modal Prompts Han Yucheng, Wang Rui, Zhang Chi, Hu Juntao, Cheng Pei, Fu Bin, Zhang Hanwang
- Vision-and-language Navigation Generative Pretrained Transformer Hanlin Wen
- Exploring Chatgpt And Its Impact On Society Haque Md. Asraful, Li Shuai
- A Comprehensive Overview Of Large Language Models (llms) For Cyber Defences: Opportunities And Directions Hassanin Mohammed, Moustafa Nour
- A Law Of Next-token Prediction In Large Language Models He Hangfeng, Su Weijie J.
- What Matters In Transformers? Not All Attention Is Needed He Shwai, Sun Guoheng, Shen Zheyu, Li Ang
- Language Models As Hierarchy Encoders He Yuan, Yuan Zhangdie, Chen Jiaoyan, Horrocks Ian
- Fortifying Ethical Boundaries In AI: Advanced Strategies For Enhancing Security In Large Language Models He Yunhong, Qiu Jianling, Zhang Wei, Yuan Zhengqing
- Camelot: Towards Large Language Models With Training-free Consolidated Associative Memory He Zexue, Karlinsky Leonid, Kim Donghyun, Mcauley Julian, Krotov Dmitry, Feris Rogerio
- HMT: Hierarchical Memory Transformer For Long Context Language Processing He Zifan, Qin Zongyue, Prakriya Neha, Sun Yizhou, Cong Jason
- Super Tiny Language Models Hillier Dylan, Guertler Leon, Tan Cheston, Agrawal Palaash, Ruirui Chen, Cheng Bobby
- Outcome-constrained Large Language Models For Countering Hate Speech Hong Lingzi, Luo Pengcheng, Blanco Eduardo, Song Xiaoying
- Free To Play: UN Trade And Development's Experience With Developing Its Own Open-source Retrieval Augmented Generation Large Language Model Application Hopp Daniel
- Visual Prompting For Generalized Few-shot Segmentation: A Multi-scale Approach Hossain Mir Rayat Imtiaz, Siam Mennatullah, Sigal Leonid, Little James J.
- Retrieval-augmented Code Completion For Local Projects Using Large Language Models Hostnik Marko, Robnik-šikonja Marko
- Outlier-efficient Hopfield Layers For Large Transformer-based Models Hu Jerry Yao-chieh, Chang Pei-hsuan, Luo Robin, Chen Hong-yu, Li Weijian, Wang Wei-po, Liu Han
- Mixed Sparsity Training: Achieving 4\(\times\) FLOP Reduction For Transformer Pretraining Hu Pihe, Li Shaolong, Huang Longbo
- Matryoshka Query Transformer For Large Vision-language Models Hu Wenbo, Dou Zi-yi, Li Liunian Harold, Kamath Amita, Peng Nanyun, Chang Kai-wei
- Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models At Scale Hu Xiang, Ji Pengyu, Zhu Qingyang, Wu Wei, Tu Kewei
- Case-based Or Rule-based: How Do Transformers Do The Math? Hu Yi, Tang Xiaojuan, Yang Haotong, Zhang Muhan
- Recurrent Context Compression: Efficiently Expanding The Context Window Of LLM Huang Chensen, Zhu Guibo, Wang Xuepeng, Luo Yifei, Ge Guojing, Chen Haoran, Yi Dong, Wang Jinqiao
- Ml-mamba: Efficient Multi-modal Large Language Model Utilizing Mamba-2 Huang Wenjun, Pan Jiakai, Tang Jiahao, Ding Yanyu, Xing Yifei, Wang Yuhe, Wang Zhengzhuo, Hu Jianguo
- A Comprehensive Survey On Evaluating Large Language Model Applications In The Medical Industry Huang Yining, Tang Keke, Chen Meilian, Wang Boyuan
- Transformerfam: Feedback Attention Is Working Memory Hwang Dongseong, Wang Weiran, Huo Zhuoyuan, Sim Khe Chai, Mengibar Pedro Moreno
- Test-time Low Rank Adaptation Via Confidence Maximization For Zero-shot Generalization Of Vision-language Models Imam Raza, Gani Hanan, Huzaifa Muhammad, Nandakumar Karthik
- Hallucinations Or Attention Misdirection? The Path To Strategic Value Extraction In Business Using Large Language Models Ioste Aline
- Autocompletion Of Chief Complaints In The Electronic Health Records Using Large Language Models Islam K M Sajjadul, Nipu Ayesha Siddika, Madiraju Praveen, Deshpande Priya
- Response Generation For Cognitive Behavioral Therapy With Large Language Models: Comparative Study With Socratic Questioning Izumi Kenta, Tanaka Hiroki, Shidara Kazuhiro, Adachi Hiroyoshi, Kanayama Daisuke, Kudo Takashi, Nakamura Satoshi
- Jamba-1.5: Hybrid Transformer-mamba Models At Scale Jamba Team, Lenz Barak, Arazi Alan, Bergman Amir, Manevich Avshalom, Peleg Barak, Aviram Ben, Almagor Chen, Fridman Clara, Padnos Dan, Gissin Daniel, Jannai Daniel, Muhlgay Dor, Zimberg Dor, Gerber Edden M, Dolev Elad, Krakovsky Eran, Safahi Erez, Schwartz Erez, Cohen Gal, Shachaf Gal, Rozenblum Haim, Bata Hofit, Blass Ido, Magar Inbal, Dalmedigos Itay, Osin Jhonathan, Fadlon Julie, Rozman Maria, Danos Matan, Gokhman Michael, Zusman Mor, Gidron Naama, Ratner Nir, Gat Noam, Rozen Noam, Fried Oded, Leshno Ohad, Antverg Omer, Abend Omri, Lieber Opher, Dagan Or, Cohavi Orit, Alon Raz, Belson Ro'i, Cohen Roi, Gilad Rom, Glozman Roman, Lev Shahar, Meirom Shaked, Delbari Tal, Ness Tal, Asida Tomer, Gal Tom Ben, Braude Tom, Pumerantz Uriya, Cohen Yehoshua, Belinkov Yonatan, Globerson Yuval, Levy Yuval Peleg, Shoham Yoav
- Are Human Conversations Special? A Large Language Model Perspective Jawale Toshish, Animesh Chaitanya, Vallath Sekhar, Talamadupula Kartik, Heck Larry
- Precision Empowers, Excess Distracts: Visual Question Answering With Dynamically Infused Knowledge In Language Models Jhalani Manas, M Annervaz K, Bhattacharyya Pushpak
- Linking In-context Learning In Transformers To Human Episodic Memory Ji-an Li, Zhou Corey Y., Benna Marcus K., Mattar Marcelo G.
- Heuristic-enhanced Candidates Selection Strategy For Gpts Tackle Few-shot Aspect-based Sentiment Analysis Jiang Baoxing, Wan Yujie, Ju Shenggen
- Time Matters: Enhancing Pre-trained News Recommendation Models With Robust User Dwell Time Injection Jiang Hao, Li Chuanzhen, An Mingxiao
- Do Llms Dream Of Elephants (when Told Not To)? Latent Concept Association And Associative Memory In Transformers Jiang Yibo, Rajendran Goutham, Ravikumar Pradeep, Aragam Bryon
- LLM Maybe Longlm: Self-extend LLM Context Window Without Tuning Jin Hongye, Han Xiaotian, Yang Jingfeng, Jiang Zhimeng, Liu Zirui, Chang Chia-yuan, Chen Huiyuan, Hu Xia
- Hidden Flaws Behind Expert-level Accuracy Of Multimodal GPT-4 Vision In Medicine Jin Qiao, Chen Fangyuan, Zhou Yiliang, Xu Ziyang, Cheung Justin M., Chen Robert, Summers Ronald M., Rousseau Justin F., Ni Peiyun, Landsman Marc J, Baxter Sally L., Al'aref Subhi J., Li Yijia, Chen Alex, Brejt Josef A., Chiang Michael F., Peng Yifan, Lu Zhiyong
- Read To Play (r2-play): Decision Transformer With Multimodal Game Instruction Jin Yonggang, Zhang Ge, Zhao Hao, Zheng Tianyu, Guo Jarvi, Xiang Liuyu, Yue Shawn, Huang Stephen W., He Zhaofeng, Fu Jie
- A2SF: Accumulative Attention Scoring With Forgetting Factor For Token Pruning In Transformer Decoder Jo Hyun-rae, Shin Dongkun
- Pmoe: Progressive Mixture Of Experts With Asymmetric Transformer For Continual Learning Jung Min Jae, Kim Joohee
- Understanding The Effect Of Using Semantically Meaningful Tokens For Visual Representation Learning Kalibhat Neha, Kattakinda Priyatham, Zarei Arman, Seleznev Nikita, Sharpe Samuel, Kumar Senthil, Feizi Soheil
- C-RAG: Certified Generation Risks For Retrieval-augmented Language Models Kang Mintong, Gürel Nezihe Merve, Yu Ning, Song Dawn, Li Bo
- From Text To Transformation: A Comprehensive Review Of Large Language Models' Versatility Kaur Pravneet, Kashyap Gautam Siddharth, Kumar Ankit, Nafis Md Tabrez, Kumar Sandeep, Shokeen Vikrant
- Defending Large Language Models Against Attacks With Residual Stream Activation Analysis Kawasaki Amelia, Davis Andrew, Abbas Houssam
- Mhgpt: A Lightweight Generative Pre-trained Transformer For Mental Health Text Analysis Kim Dae-young, Hwa Rebecca, Rahman Muhammad Mahbubur
- On Initializing Transformers With Pre-trained Embeddings Kim Ha Young, Balasubramanian Niranjan, Kang Byungkon
- Llmem: Estimating GPU Memory Usage For Fine-tuning Pre-trained Llms Kim Taeho, Wang Yanming, Chaturvedi Vatshank, Gupta Lokesh, Kim Seyeon, Kwon Yongin, Ha Sangtae
- Extended Mind Transformers Klett Phoebe, Ahle Thomas
- Transllama: Llm-based Simultaneous Translation System Koshkin Roman, Sudoh Katsuhito, Nakamura Satoshi
- Llms Are Zero-shot Context-aware Simultaneous Translators Koshkin Roman, Sudoh Katsuhito, Nakamura Satoshi
- Arabiangpt: Native Arabic Gpt-based Large Language Model Koubaa Anis, Ammar Adel, Ghouti Lahouari, Najar Omar, Sibaee Serry
- Babilong: Testing The Limits Of Llms With Long Context Reasoning-in-a-haystack Kuratov Yuri, Bulatov Aydar, Anokhin Petr, Rodkin Ivan, Sorokin Dmitry, Sorokin Artyom, Burtsev Mikhail
- Enhancing Multi-domain Automatic Short Answer Grading Through An Explainable Neuro-symbolic Pipeline Künnecke Felix, Filighera Anna, Leong Colin, Steuer Tim
- Residual-based Language Models Are Free Boosters For Biomedical Imaging Lai Zhixin, Wu Jing, Chen Suiyao, Zhou Yucheng, Hovakimyan Naira
- Lapdoc: Layout-aware Prompting For Documents Lamott Marcel, Weweler Yves-noel, Ulges Adrian, Shafait Faisal, Krechel Dirk, Obradovic Darko
- Large Language Models As Evolution Strategies Lange Robert Tjarko, Tian Yingtao, Tang Yujin
- Enhancing Neural Machine Translation Of Low-resource Languages: Corpus Development, Human Evaluation And Explainable AI Architectures Lankford Séamus
- Human Evaluation Of English--irish Transformer-based NMT Lankford Séamus, Afli Haithem, Way Andy
- Adaptnmt: An Open-source, Language-agnostic Development Environment For Neural Machine Translation Lankford Séamus, Afli Haithem, Way Andy
- Transformers For Low-resource Languages:is F\'eidir Linn! Lankford Séamus, Afli Haithem, Way Andy
- Melting Point: Mobile Evaluation Of Language Transformers Laskaridis Stefanos, Katevas Kleomenis, Minto Lorenzo, Haddadi Hamed
- What Matters When Building Vision-language Models? Laurençon Hugo, Tronchon Léo, Cord Matthieu, Sanh Victor
- MATTER: Memory-augmented Transformer Using Heterogeneous Knowledge Sources Lee Dongkyu, Prakash Chandana Satya, Fitzgerald Jack, Lehmann Jens
- Hip Attention: Sparse Sub-quadratic Attention With Hierarchical Attention Pruning Lee Heejun, Park Geon, Lee Youngwan, Kim Jina, Jeong Wonyoung, Jeon Myeongjae, Hwang Sung Ju
- LRQ: Optimizing Post-training Quantization For Large Language Models By Learning Low-rank Weight-scaling Matrices Lee Jung Hyun, Kim Jeonghoon, Yang June Yong, Kwon Se Jung, Yang Eunho, Yoo Kang Min, Lee Dongsoo
- Infinigen: Efficient Generative Inference Of Large Language Models With Dynamic KV Cache Management Lee Wonbeom, Lee Jungi, Seo Junghwan, Sim Jaewoong
- Towards Understanding Counseling Conversations: Domain Knowledge And Large Language Models Lee Younghun, Goldwasser Dan, Reese Laura Schwab
- Small-e: Small Language Model With Linear Attention For Efficient Speech Synthesis Lemerle Théodor, Obin Nicolas, Roebel Axel
- Look Within, Why Llms Hallucinate: A Causal Perspective Li He, Chi Haoang, Liu Mingyu, Yang Wenjing
- A Transformer With Stack Attention Li Jiaoda, White Jennifer C., Sachan Mrinmaya, Cotterell Ryan
- Measuring And Controlling Instruction (in)stability In Language Model Dialogs Li Kenneth, Liu Tianle, Bashkansky Naomi, Bau David, Viégas Fernanda, Pfister Hanspeter, Wattenberg Martin
- On The Power Of Convolution Augmented Transformer Li Mingchen, Zhang Xuechen, Huang Yixiao, Oymak Samet
- FSMR: A Feature Swapping Multi-modal Reasoning Approach With Joint Textual And Visual Clues Li Shuang, Wang Jiahua, Wen Lijie
- Uio-llms: Unbiased Incremental Optimization For Long-context Llms Li Wenhao, Lin Mingbao, Zhong Yunshan, Yan Shuicheng, Ji Rongrong
- What If We Recaption Billions Of Web Images With Llama-3? Li Xianhang, Tu Haoqin, Hui Mude, Wang Zeyu, Zhao Bingchen, Xiao Junfei, Ren Sucheng, Mei Jieru, Liu Qing, Zheng Huangjie, Zhou Yuyin, Xie Cihang
- Calrec: Contrastive Alignment Of Generative Llms For Sequential Recommendation Li Yaoyiran, Zhai Xiang, Alzantot Moustafa, Yu Keyi, Vulić Ivan, Korhonen Anna, Hammad Mohamed
- Spin Glass Model Of In-context Learning Li Yuhao, Bai Ruoran, Huang Haiping
- Llms Meet Long Video: Advancing Long Video Question Answering With An Interactive Visual Adapter In Llms Li Yunxin, Chen Xinyu, Hu Baotain, Zhang Min
- Text-to-model: Text-conditioned Neural Network Diffusion For Train-once-for-all Personalization Li Zexi, Gao Lingzhi, Wu Chao
- Understanding And Patching Compositional Reasoning In Llms Li Zhaoyi, Jiang Gangwei, Xie Hong, Song Linqi, Lian Defu, Wei Ying
- Do Vision-language Transformers Exhibit Visual Commonsense? An Empirical Study Of VCR Li Zhenyang, Guo Yangyang, Wang Kejie, Chen Xiaolin, Nie Liqiang, Kankanhalli Mohan
- Focusllm: Scaling Llm's Context By Parallel Decoding Li Zhenyu, Zhang Yike, Pan Tengyu, Sun Yutao, Duan Zhichao, Fang Junjie, Han Rong, Wang Zixuan, Wang Jianyong
- Hunyuan-dit: A Powerful Multi-resolution Diffusion Transformer With Fine-grained Chinese Understanding Li Zhimin, Zhang Jianwei, Lin Qin, Xiong Jiangfeng, Long Yanxin, Deng Xinchi, Zhang Yingfang, Liu Xingchao, Huang Minbin, Xiao Zedong, Chen Dayou, He Jiajun, Li Jiahao, Li Wenyue, Zhang Chen, Quan Rongwei, Lu Jianxiang, Huang Jiabin, Yuan Xiaoyan, Zheng Xiaoxiao, Li Yixuan, Zhang Jihong, Zhang Chao, Chen Meng, Liu Jie, Fang Zheng, Wang Weiyan, Xue Jinbao, Tao Yangyu, Zhu Jianchen, Liu Kai, Lin Sihuan, Sun Yifu, Li Yun, Wang Dongdong, Chen Mingtao, Hu Zhichao, Xiao Xiao, Chen Yan, Liu Yuhong, Liu Wei, Wang Di, Yang Yong, Jiang Jie, Lu Qinglin
- Mitigating Hallucination In Visual-language Models Via Re-balancing Contrastive Decoding Liang Xiaoyu, Yu Jiayuan, Mu Lianrui, Zhuang Jiedong, Hu Jiaqi, Yang Yuchen, Ye Jiangnan, Lu Lu, Chen Jian, Hu Haoji
- Beyond KV Caching: Shared Attention For Efficient Llms Liao Bingli, Vargas Danilo Vasconcellos
- Tree-based Hard Attention With Self-motivation For Large Language Models Lin Chenxi, Ren Jiayu, He Guoxiu, Jiang Zhuoren, Yu Haiyan, Zhu Xiaomin
- Local And Global Contexts For Conversation Lin Zuoquan, Shen Xinyi
- Unveiling Vulnerability Of Self-attention Liong Khai Jiet, Wu Hongqiu, Zhao Hai
- Lumina-mgpt: Illuminate Flexible Photorealistic Text-to-image Generation With Multimodal Generative Pretraining Liu Dongyang, Zhao Shitian, Zhuo Le, Lin Weifeng, Qiao Yu, Li Hongsheng, Gao Peng
- Mmgrec: Multimodal Generative Recommendation With Transformer Model Liu Han, Wei Yinwei, Song Xuemeng, Guan Weili, Li Yuan-fang, Nie Liqiang
- Memlong: Memory-augmented Retrieval For Long Text Modeling Liu Weijie, Tang Zecheng, Li Juntao, Chen Kehai, Zhang Min
- Sparser Is Faster And Less Is More: Efficient Sparse Attention For Long-range Transformers Lou Chao, Jia Zixia, Zheng Zilong, Tu Kewei
- Efficient Encoder-decoder Transformer Decoding For Decomposable Tasks Lu Bo-ru, Haduong Nikita, Lin Chien-yu, Cheng Hao, Smith Noah A., Ostendorf Mari
- In-context Time Series Predictor Lu Jiecheng, Sun Yan, Yang Shihao
- Ovis: Structural Embedding Alignment For Multimodal Large Language Model Lu Shiyin, Li Yang, Chen Qing-guo, Xu Zhao, Luo Weihua, Zhang Kaifu, Ye Han-jia
- Insights Into LLM Long-context Failures: When Transformers Know But Don't Tell Lu Taiming, Gao Muhan, Yu Kuai, Byerly Adam, Khashabi Daniel
- Asymptotic Theory Of In-context Learning By Linear Attention Lu Yue M., Letey Mary I., Zavatone-veth Jacob A., Maiti Anindita, Pehlevan Cengiz
- Can Perplexity Predict Fine-tuning Performance? An Investigation Of Tokenization Effects On Sequential Language Models For Nepali Luitel Nishant, Bekoju Nirajan, Sah Anand Kumar, Shakya Subarna
- MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory For Long Sequences Training Luo Cheng, Zhao Jiawei, Chen Zhuoming, Chen Beidi, Anandkumar Anima
- Learning Positional Attention For Sequential Recommendation Luo Fan, Zhang Juan, Xu Shenghui
- From Understanding To Utilization: A Survey On Explainability For Large Language Models Luo Haoyan, Specia Lucia
- Taking A Deep Breath: Enhancing Language Modeling Of Large Language Models With Sentinel Tokens Luo Weiyao, Zheng Suncong, Xia Heming, Wang Weikang, Lei Yan, Liu Tianyu, Chen Shuang, Sui Zhifang
- Zero-shot Cross-domain Dialogue State Tracking Via Dual Low-rank Adaptation Luo Xiang, Tang Zhiwen, Wang Jin, Zhang Xuejie
- TRAWL: Tensor Reduced And Approximated Weights For Large Language Models Luo Yiran, Patel Het, Fu Yu, Ahn Dawon, Chen Jia, Dong Yue, Papalexakis Evangelos E.
- Interpreting Key Mechanisms Of Factual Recall In Transformer-based Language Models Lv Ang, Chen Yuhan, Zhang Kaiyi, Wang Yulong, Liu Lifeng, Wen Ji-rong, Xie Jian, Yan Rui
- EE-MLLM: A Data-efficient And Compute-efficient Multimodal Large Language Model Ma Feipeng, Zhou Yizhou, Li Hebei, He Zilong, Wu Siying, Rao Fengyun, Zhang Yueyi, Sun Xiaoyan
- FBI-LLM: Scaling Up Fully Binarized Llms From Scratch Via Autoregressive Distillation Ma Liqun, Sun Mingjie, Shen Zhiqiang
- The Era Of 1-bit Llms: All Large Language Models Are In 1.58 Bits Ma Shuming, Wang Hongyu, Ma Lingxiao, Wang Lei, Wang Wenhui, Huang Shaohan, Dong Li, Wang Ruiping, Xue Jilong, Wei Furu
- Megalodon: Efficient LLM Pretraining And Inference With Unlimited Context Length Ma Xuezhe, Yang Xiaomeng, Xiong Wenhan, Chen Beidi, Yu Lili, Zhang Hao, May Jonathan, Zettlemoyer Luke, Levy Omer, Zhou Chunting
- ULLME: A Unified Framework For Large Language Model Embeddings With Generation-augmented Learning Man Hieu, Ngo Nghia Trung, Dernoncourt Franck, Nguyen Thien Huu
- Can Large Language Models Automatically Score Proficiency Of Written Essays? Mansour Watheq, Albatarni Salam, Eltanbouly Sohaila, Elsayed Tamer
- Visually Guided Generative Text-layout Pre-training For Document Intelligence Mao Zhiming, Bai Haoli, Hou Lu, Wei Jiansheng, Jiang Xin, Liu Qun, Wong Kam-fai
- Set-based Prompting: Provably Solving The Language Model Order Dependency Problem Mcilroy-young Reid, Brown Katrina, Olson Conlan, Zhang Linjun, Dwork Cynthia
- Metacheckgpt -- A Multi-task Hallucination Detector Using LLM Uncertainty And Meta-models Mehta Rahul, Hoblitzell Andrew, O'keefe Jack, Jang Hyeju, Varma Vasudeva
- Openelm: An Efficient Language Model Family With Open Training And Inference Framework Mehta Sachin, Sekhavat Mohammad Hossein, Cao Qingqing, Horton Maxwell, Jin Yanzi, Sun Chenfan, Mirzadeh Iman, Najibi Mahyar, Belenko Dmitry, Zatloukal Peter, Rastegari Mohammad
- Deepstack: Deeply Stacking Visual Tokens Is Surprisingly Simple And Effective For Lmms Meng Lingchen, Yang Jianwei, Tian Rui, Dai Xiyang, Wu Zuxuan, Gao Jianfeng, Jiang Yu-gang
- A Survey Of Resource-efficient LLM And Multimodal Foundation Models Mengwei Xu, Wangsong Yin, Dongqi Cai, Rongjie Yi, Daliang Xu, Qipeng Wang, Bingyang Wu, Yihao Zhao, Chen Yang, Shihe Wang, Qiyang Zhang, Zhenyan Lu, Li Zhang, Shangguang Wang, Yuanchun Li, Yunxin Liu, Xin Jin, Xuanzhe Liu
- Linearizing Large Language Models Mercat Jean, Vasiljevic Igor, Keh Sedrick, Arora Kushal, Dave Achal, Gaidon Adrien, Kollar Thomas
- Superposition Prompting: Improving And Accelerating Retrieval-augmented Generation Merth Thomas, Fu Qichen, Rastegari Mohammad, Najibi Mahyar
- Revenge Of The Fallen? Recurrent Models Match Transformers At Predicting Human Language Comprehension Metrics Michaelov James A., Arnett Catherine, Bergen Benjamin K.
- Semantics Of Multiword Expressions In Transformer-based Models: A Survey Miletić Filip, Walde Sabine Schulte Im
- LSPT: Long-term Spatial Prompt Tuning For Visual Representation Learning Mo Shentong, Wang Yansen, Luo Xufang, Li Dongsheng
- Large Language Model (LLM) AI Text Generation Detection Based On Transformer Deep Learning Algorithm Mo Yuhong, Qin Hao, Dong Yushan, Zhu Ziyi, Li Zhenglin
- Towards Generating Informative Textual Description For Neurons In Language Models Mondal Shrayani, Garodia Rishabh, Qureshi Arbaaz, Lee Taesung, Park Youngja
- Xc-cache: Cross-attending To Cached Context For Efficient LLM Inference Monteiro João, Marcotte Étienne, Noël Pierre-andré, Zantedeschi Valentina, Vázquez David, Chapados Nicolas, Pal Christopher, Taslakian Perouz
- Cross-layer Attention Sharing For Large Language Models Mu Yongyu, Wu Yuzhang, Fan Yuchun, Wang Chenglong, Li Hengyu, He Qiaozhi, Yang Murun, Xiao Tong, Zhu Jingbo
- Leave No Context Behind: Efficient Infinite Context Transformers With Infini-attention Munkhdalai Tsendsuren, Faruqui Manaal, Gopal Siddharth
- Segment-based Interactive Machine Translation For Pre-trained Models Navarro Angel, Casacuberta Francisco
- Dynamic Memory Compression: Retrofitting Llms For Accelerated Inference Nawrot Piotr, Łańcucki Adrian, Chochowski Marcin, Tarjan David, Ponti Edoardo M.
- Improving Robustness Of Llm-based Speech Synthesis By Learning Monotonic Alignment Neekhara Paarth, Hussain Shehzeen, Ghosh Subhankar, Li Jason, Valle Rafael, Badlani Rohan, Ginsburg Boris
- Needle In The Haystack For Memory Based Large Language Models Nelson Elliot, Kollias Georgios, Das Payel, Chaudhury Subhajit, Dan Soham
- Interpreting Context Look-ups In Transformers: Investigating Attention-mlp Interactions Neo Clement, Cohen Shay B., Barez Fazl
- Gpts And Language Barrier: A Cross-lingual Legal QA Examination Nguyen Ha-thanh, Yamada Hiroaki, Satoh Ken
- Advancing Vietnamese Visual Question Answering With Transformer And Convolutional Integration Nguyen Ngoc Son, Nguyen Van Son, Le Tung
- Layer-wise Regularized Dropout For Neural Language Models Ni Shiwen, Yang Min, Xu Ruifeng, Li Chengming, Hu Xiping
- How Reliable AI Chatbots Are For Disease Prediction From Patient Complaints? Nipu Ayesha Siddika, Islam K M Sajjadul, Madiraju Praveen
- Jamba: A Hybrid Transformer-mamba Language Model Opher Lieber, Barak Lenz, Hofit Bata, Gal Cohen, Jhonathan Osin, Itay Dalmedigos, Erez Safahi, Shaked Meirom, Yonatan Belinkov, Shai Shalev-shwartz, Omri Abend, Raz Alon, Tomer Asida, Amir Bergman, Roman Glozman, Michael Gokhman, Avashalom Manevich, Nir Ratner, Noam Rozen, Erez Shwartz, Mor Zusman, Yoav Shoham
- Transformers Are Multi-state Rnns Oren Matanel, Hassid Michael, Yarden Nir, Adi Yossi, Schwartz Roy
- A Systematic Review Of Data-to-text NLG Osuji Chinonso Cynthia, Ferreira Thiago Castro, Davis Brian
- The Accuracy Of Domain Specific And Descriptive Analysis Generated By Large Language Models Otieno Denish Omondi, Abri Faranak, Siami-namini Sima, Namin Akbar Siami
- The Solution For The AIGC Inference Performance Optimization Competition Pan Sishun, Xu Haonan, Wan Zhonghua, Yang Yang
- Preparing Lessons For Progressive Training On Language Models Pan Yu, Yuan Ye, Yin Yichun, Shi Jiaxin, Xu Zenglin, Zhang Ming, Shang Lifeng, Jiang Xin, Liu Qun
- Llmlingua-2: Data Distillation For Efficient And Faithful Task-agnostic Prompt Compression Pan Zhuoshi, Wu Qianhui, Jiang Huiqiang, Xia Menglin, Luo Xufang, Zhang Jue, Lin Qingwei, Rühle Victor, Yang Yuqing, Lin Chin-yew, Zhao H. Vicky, Qiu Lili, Zhang Dongmei
- AILS-NTUA At Semeval-2024 Task 9: Cracking Brain Teasers: Transformer Models For Lateral Thinking Puzzles Panagiotopoulos Ioannis, Filandrianos Giorgos, Lymperaiou Maria, Stamou Giorgos
- Anchor-based Large Language Models Pang Jianhui, Ye Fanghua, Wong Derek Fai, He Xin, Chen Wanshun, Wang Longyue
- Shaking Up Vlms: Comparing Transformers And Structured State Space Models For Vision & Language Modeling Pantazopoulos Georgios, Nikandrou Malvina, Suglia Alessandro, Lemon Oliver, Eshghi Arash
- Token-picker: Accelerating Attention In Text Generation With Minimized Memory Transfer Via Probability Estimation Park Junyoung, Kang Myeonggu, Han Yunki, Kim Yanggon, Shin Jaekang, Kim Lee-sup
- Pre-trained Vision And Language Transformers Are Few-shot Incremental Learners Park Keon-hee, Song Kyungwoo, Park Gyeong-moon
- Automated Multi-language To English Machine Translation Using Generative Pre-trained Transformers Pelofske Elijah, Urias Vincent, Liebrock Lorie M.
- Prompting A Pretrained Transformer Can Be A Universal Approximator Petrov Aleksandar, Torr Philip H. S., Bibi Adel
- Benchmarking GPT-4 On Algorithmic Problems: A Systematic Evaluation Of Prompting Strategies Petruzzellis Flavio, Testolin Alberto, Sperduti Alessandro
- Investigating The Effectiveness Of Hypertuning Via Gisting Phang Jason
- Mamba-ptq: Outlier Channels In Recurrent Large Language Models Pierro Alessandro, Abreu Steven
- How Effective Are State Space Models For Machine Translation? Pitorro Hugo, Vasylenko Pavlo, Treviso Marcos, Martins André F. T.
- LM-PUB-QUIZ: A Comprehensive Framework For Zero-shot Evaluation Of Relational Knowledge In Language Models Ploner Max, Wiland Jacek, Pohl Sebastian, Akbik Alan
- Evaluation Of Language Models In The Medical Context Under Resource-constrained Settings Posada Andrea, Rueckert Daniel, Meissen Felix, Müller Philip
- Practical Token Pruning For Foundation Models In Few-shot Conversational Virtual Assistant Systems Qi Haode, Qian Cheng, Ni Jian, Singh Pratyush, Fazeli Reza, Wang Gengyu, Shu Zhongzheng, Wayne Eric, Bross Juergen
- Is Next Token Prediction Sufficient For GPT? Exploration On Code Logic Comprehension Qi Mengnan, Huang Yufan, Yao Yongqiang, Wang Maoquan, Gu Bin, Sundaresan Neel
- Vl-mamba: Exploring State Space Models For Multimodal Learning Qiao Yanyuan, Yu Zheng, Guo Longteng, Chen Sihan, Zhao Zijia, Sun Mingzhen, Wu Qi, Liu Jing
- The Uniqueness Of Llama3-70b With Per-channel Quantization: An Empirical Study Qin Minghai
- Lightning Attention-2: A Free Lunch For Handling Unlimited Sequence Lengths In Large Language Models Qin Zhen, Sun Weigao, Li Dong, Shen Xuyang, Sun Weixuan, Zhong Yiran
- Various Lengths, Constant Speed: Efficient Language Modeling With Lightning Attention Qin Zhen, Sun Weigao, Li Dong, Shen Xuyang, Sun Weixuan, Zhong Yiran
- Dissecting Multiplication In Transformers: Insights Into Llms Qiu Luyu, Li Jianing, Su Chi, Zhang Chen Jason, Chen Lei
- Mupt: A Generative Symbolic Music Pretrained Transformer Qu Xingwei, Bai Yuelin, Ma Yinghao, Zhou Ziya, Lo Ka Man, Liu Jiaheng, Yuan Ruibin, Min Lejun, Liu Xueling, Zhang Tianyu, Du Xinrun, Guo Shuyue, Liang Yiming, Li Yizhi, Wu Shangda, Zhou Junting, Zheng Tianyu, Ma Ziyang, Han Fengze, Xue Wei, Xia Gus, Benetos Emmanouil, Yue Xiang, Lin Chenghua, Tan Xu, Huang Stephen W., Fu Jie, Zhang Ge
- Memory-augmented Generative Adversarial Transformers Raaijmakers Stephan, Bakker Roos, Cremers Anita, De Kleijn Roy, Kouwenhoven Tom, Verhoef Tessa
- Scavenging Hyena: Distilling Transformers Into Long Convolution Models Ralambomihanta Tokiniaina Raharison, Mohammadzadeh Shahrad, Islam Mohammad Sami Nur, Jabbour Wassim, Liang Laurence
- Narrow Transformer: Starcoder-based Java-lm For Desktop Rathinasamy Kamalkumar, J Balaji A, Kumar Ankush, Gayari Gagan, K Harshini, Mondal Rajab Ali, S Sreenivasa Raghavan K, Singh Swayam, Tarafdar Mohammed Rafee
- Optimizing Visual Question Answering Models For Driving: Bridging The Gap Between Human And Machine Attention Patterns Rekanar Kaavya, Hayes Martin, Sistu Ganesh, Eising Ciaran
- Identifying Semantic Induction Heads To Understand In-context Learning Ren Jie, Guo Qipeng, Yan Hang, Liu Dongrui, Zhang Quanshi, Qiu Xipeng, Lin Dahua
- Unveiling And Mitigating Memorization In Text-to-image Diffusion Models Through Cross Attention Ren Jie, Li Yaxin, Zeng Shenglai, Xu Han, Lyu Lingjuan, Xing Yue, Tang Jiliang
- Samba: Simple Hybrid State Space Models For Efficient Unlimited Context Language Modeling Ren Liliang, Liu Yang, Lu Yadong, Shen Yelong, Liang Chen, Chen Weizhu
- Self-generated Replay Memories For Continual Neural Machine Translation Resta Michele, Bacciu Davide
- Practical And Reproducible Symbolic Music Generation By Large Language Models With Structural Embeddings Rhyu Seungyeon, Yang Kichang, Cho Sungjun, Kim Jaehyeon, Lee Kyogu, Lee Moontae
- Simulating Weighted Automata Over Sequences And Trees With Transformers Rizvi Michael, Lizaire Maude, Lacroce Clara, Rabusseau Guillaume
- Pretrained Hybrids With MAD Skills Roberts Nicholas, Guo Samuel, Gao Zhiqi, Gnvv Satya Sai Srinath Namburi, Cromp Sonia, Wu Chengjun, Duan Chengyu, Sala Frederic
- Exploring Advanced Large Language Models With Llmsuite Roffo Giorgio
- Unibucllm: Harnessing Llms For Automated Prediction Of Item Difficulty And Response Time For Multiple-choice Questions Rogoz Ana-cristina, Ionescu Radu Tudor
- Convolutional Prompting Meets Language Models For Continual Learning Roy Anurag, Moulick Riddhiman, Verma Vinay K., Ghosh Saptarshi, Das Abir
- From PEFT To DEFT: Parameter Efficient Finetuning For Reducing Activation Density In Transformers Runwal Bharat, Pedapati Tejaswini, Chen Pin-yu
- Writing In The Margins: Better Inference Pattern For Long Context Retrieval Russak Melisa, Jamil Umar, Bryant Christopher, Kamble Kiran, Magnuson Axel, Russak Mateusz, Alshikh Waseem
- Tandem Transformers For Inference Efficient Llms S Aishwarya P, Nair Pranav Ajit, Samaga Yashas, Boyd Toby, Kumar Sanjiv, Jain Prateek, Netrapalli Praneeth
- Benchmarking And Building Long-context Retrieval Models With Loco And M2-BERT Saad-falcon Jon, Fu Daniel Y., Arora Simran, Guha Neel, Ré Christopher
- Algorithmic Language Models With Neurally Compiled Libraries Saldyt Lucas, Kambhampati Subbarao
- Zero-shot RTL Code Generation With Attention Sink Augmented Large Language Models Sandal Selim, Akturk Ismail
- Lean Attention: Hardware-aware Scalable Attention Mechanism For The Decode-phase Of Transformers Sanovar Rya, Bharadwaj Srikant, Amant Renee St., Rühle Victor, Rajmohan Saravan
- Advancing Generative AI For Portuguese With Open Decoder Gerv\'asio PT* Santos Rodrigo, Silva João, Gomes Luís, Rodrigues João, Branco António
- Pre-training Small Base Lms With Fewer Tokens Sanyal Sunny, Sanghavi Sujay, Dimakis Alexandros G.
- Detecting Generated Native Ads In Conversational Search Schmidt Sebastian, Zelch Ines, Bevendorff Janek, Stein Benno, Hagen Matthias, Potthast Martin
- Surprising Efficacy Of Fine-tuned Transformers For Fact-checking Over Larger Language Models Setty Vinay
- Tencdm: Understanding The Properties Of Diffusion Model In The Space Of Language Model Encodings Shabalin Alexander, Meshchaninov Viacheslav, Chimbulatov Egor, Lapikov Vladislav, Kim Roman, Bartosh Grigory, Molchanov Dmitry, Markov Sergey, Vetrov Dmitry
- Flashattention-3: Fast And Accurate Attention With Asynchrony And Low-precision Shah Jay, Bikshandi Ganesh, Zhang Ying, Thakkar Vijay, Ramani Pradeep, Dao Tri
- Evaluating Text Summaries Generated By Large Language Models Using Openai's GPT Shakil Hassan, Mahi Atqiya Munawara, Nguyen Phuoc, Ortiz Zeydy, Mardini Mamoun T.
- Llava-prumerge: Adaptive Token Reduction For Efficient Large Multimodal Models Shang Yuzhang, Cai Mu, Xu Bingxin, Lee Yong Jae, Yan Yan
- An Empirical Study On Context Length For Open-domain Dialog Generation Shen Xinyi, Lin Zuoquan
- Scaling Laws For Linear Complexity Language Models Shen Xuyang, Li Dong, Leng Ruitao, Qin Zhen, Sun Weigao, Zhong Yiran
- Keep The Cost Down: A Review On Methods To Optimize LLM' S Kv-cache Consumption Shi Luohe, Zhang Hongyi, Yao Yao, Li Zuchao, Zhao Hai
- Action Controlled Paraphrasing Shi Ning, Wu Zijun
- Why Larger Language Models Do In-context Learning Differently? Shi Zhenmei, Wei Junyi, Xu Zhuoyan, Liang Yingyu
- The Mysterious Case Of Neuron 1512: Injectable Realignment Architectures Reveal Internal Characteristics Of Meta's Llama 2 Model Smith Brenden, Baker Dallin, Chase Clayton, Barney Myles, Parker Kaden, Allred Makenna, Hu Peter, Evans Alex, Fulda Nancy
- Moviechat+: Question-aware Sparse Memory For Long Video Question Answering Song Enxin, Chai Wenhao, Ye Tian, Hwang Jenq-neng, Li Xi, Wang Gaoang
- Out-of-distribution Generalization Via Composition: A Lens Through Induction Heads In Transformers Song Jiajun, Xu Zhuoyan, Zhong Yiqiao
- Moma: Multimodal LLM Adapter For Fast Personalized Image Generation Song Kunpeng, Zhu Yizhe, Liu Bingchen, Yan Qing, Elgammal Ahmed, Yang Xiao
- Hierarchical Context Merging: Better Long Context Understanding For Pre-trained Llms Song Woomin, Oh Seunghyuk, Mo Sangwoo, Kim Jaehyung, Yun Sukmin, Ha Jung-woo, Shin Jinwoo
- Position: Leverage Foundational Models For Black-box Optimization Song Xingyou, Tian Yingtao, Lange Robert Tjarko, Lee Chansoo, Tang Yujin, Chen Yutian
- Sharelora: Parameter Efficient And Robust Large Language Model Fine-tuning Via Shared Low-rank Adaptation Song Yurun, Zhao Junchen, Harris Ian G., Jyothi Sangeetha Abdu
- Llm-barber: Block-aware Rebuilder For Sparsity Mask In One-shot For Large Language Models Su Yupeng, Guan Ziyi, Liu Xiaoqun, Jin Tianlai, Wu Dongkuan, Chesi Graziano, Wong Ngai, Yu Hao
- Diversifying Query: Region-guided Transformer For Temporal Sentence Grounding Sun Xiaolong, Shi Liushuai, Wang Le, Zhou Sanping, Xia Kun, Wang Yabing, Hua Gang
- You Only Cache Once: Decoder-decoder Architectures For Language Models Sun Yutao, Dong Li, Zhu Yi, Huang Shaohan, Wang Wenhui, Ma Shuming, Zhang Quanlu, Wang Jianyong, Wei Furu
- Towards Smaller, Faster Decoder-only Transformers: Architectural Variants And Their Implications Suresh Sathya Krishnan, P Shunmugapriya
- Freely Long-thinking Transformer (frailt) Tabak Akbay
- Lloco: Learning Long Contexts Offline Tan Sijun, Li Xiuyu, Patil Shishir, Wu Ziyang, Zhang Tianjun, Keutzer Kurt, Gonzalez Joseph E., Popa Raluca Ada
- DLO: Dynamic Layer Operation For Efficient Vertical Scaling Of Llms Tan Zhen, Dong Daize, Zhao Xinyu, Peng Jie, Cheng Yu, Chen Tianlong
- Can I Understand What I Create? Self-knowledge Evaluation Of Large Language Models Tan Zhiquan, Wei Lai, Wang Jindong, Xie Xing, Huang Weiran
- Language-specific Neurons: The Key To Multilingual Capabilities In Large Language Models Tang Tianyi, Luo Wenyang, Huang Haoyang, Zhang Dongdong, Wang Xiaolei, Zhao Xin, Wei Furu, Wen Ji-rong
- Limits Of Transformer Language Models On Learning To Compose Algorithms Thomm Jonathan, Terzic Aleksandar, Camposampiero Giacomo, Hersche Michael, Schölkopf Bernhard, Rahimi Abbas
- Elsevier Arena: Human Evaluation Of Chemistry/biology/health Foundational Large Language Models Thorne Camilo, Druckenbrodt Christian, Szarkowska Kinga, Goyal Deepika, Marajan Pranita, Somanath Vijay, Harper Corey, Yan Mao, Scerri Tony
- Untie The Knots: An Efficient Data Augmentation Strategy For Long-context Pre-training In Language Models Tian Junfeng, Zheng Da, Cheng Yang, Wang Rui, Zhang Colin, Zhang Debing
- Selective Prompt Anchoring For Code Generation Tian Yuan, Zhang Tianyi
- Understanding Linear Probing Then Fine-tuning Language Models From NTK Perspective Tomihari Akiyoshi, Sato Issei
- Transformers Are Ssms: Generalized Models And Efficient Algorithms Through Structured State Space Duality Tri Dao, Albert Gu
- Discrete Multimodal Transformers With A Pretrained Large Language Model For Mixed-supervision Speech Processing Trinh Viet Anh, Southwell Rosy, Guan Yiwen, He Xinlu, Wang Zhiyong, Whitehill Jacob
- LM Transparency Tool: Interactive Tool For Analyzing Transformer Language Models Tufanov Igor, Hambardzumyan Karen, Ferrando Javier, Voita Elena
- Enhancing Inference Efficiency Of Large Language Models: Investigating Optimization Strategies And Architectural Innovations Tyukin Georgy
- Asking And Answering Questions To Extract Event-argument Structures Uddin Md Nayem, George Enfa Rose, Blanco Eduardo, Corman Steven
- Accelerating Large Language Model Inference With Self-supervised Early Exits Valade Florian
- Gender Bias In Machine Translation And The Era Of Large Language Models Vanmassenhove Eva
- An Empirical Study Of Mamba-based Language Models Waleffe Roger, Byeon Wonmin, Riach Duncan, Norick Brandon, Korthikanti Vijay, Dao Tri, Gu Albert, Hatamizadeh Ali, Singh Sudhakar, Narayanan Deepak, Kulshreshtha Garvit, Singh Vartika, Casper Jared, Kautz Jan, Shoeybi Mohammad, Catanzaro Bryan
- IAA: Inner-adaptor Architecture Empowers Frozen Large Language Model With Multimodal Capabilities Wang Bin, Xie Chunyu, Leng Dawei, Yin Yuhui
- Grokked Transformers Are Implicit Reasoners: A Mechanistic Journey To The Edge Of Generalization Wang Boshi, Yue Xiang, Su Yu, Sun Huan
- Reinforcement Learning For Edit-based Non-autoregressive Neural Machine Translation Wang Hao, Morimura Tetsuro, Honda Ukyo, Kawahara Daisuke
- Target-constrained Bidirectional Planning For Generation Of Target-oriented Proactive Dialogue Wang Jian, Lin Dongding, Li Wenjie
- Length Generalization Of Causal Transformers Without Position Encoding Wang Jie, Ji Tao, Wu Yuanbin, Yan Hang, Gui Tao, Zhang Qi, Huang Xuanjing, Wang Xiaoling
- Mambabyte: Token-free Selective State Space Model Wang Junxiong, Gangavarapu Tushaar, Yan Jing Nathan, Rush Alexander M.
- The Mamba In The Llama: Distilling And Accelerating Hybrid Models Wang Junxiong, Paliotta Daniele, May Avner, Rush Alexander M., Dao Tri
- Lora Meets Dropout Under A Unified Framework Wang Sheng, Chen Liheng, Jiang Jiyue, Xue Boyang, Kong Lingpeng, Wu Chuan
- Resonance Rope: Improving Context Length Generalization Of Large Language Models Wang Suyuchen, Kobyzev Ivan, Lu Peng, Rezagholizadeh Mehdi, Liu Bang
- Validating Llm-generated Programs With Metamorphic Prompt Testing Wang Xiaoyin, Zhu Dakai
- Longllava: Scaling Multi-modal Llms To 1000 Images Efficiently Via Hybrid Architecture Wang Xidong, Song Dingjie, Chen Shunian, Zhang Chen, Wang Benyou
- A Theoretical Understanding Of Self-correction Through In-context Alignment Wang Yifei, Wu Yuyang, Wei Zeming, Jegelka Stefanie, Wang Yisen
- Loma: Lossless Compressed Memory Attention Wang Yumeng, Xiao Zhenyang
- A Comprehensive Survey Of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO And More Wang Zhichao James, Bi Bin James, Pentyala Shiva Kumar James, Ramnath Kiran James, Chaudhuri Sougata James, Mehrotra Shubham James, Zixu James, Zhu Claire, Mao Xiang-bo Claire, Asur Sitaram Claire, Na Claire, Cheng
- Omnijarvis: Unified Vision-language-action Tokenization Enables Open-world Instruction Following Agents Wang Zihao, Cai Shaofei, Mu Zhancun, Lin Haowei, Zhang Ceyao, Liu Xuejie, Li Qing, Liu Anji, Ma Xiaojian, Liang Yitao
- C3LLM: Conditional Multimodal Content Generation Using Large Language Models Wang Zixuan, Duan Qinkai, Tai Yu-wing, Tang Chi-keung
- Do Llamas Work In English? On The Latent Language Of Multilingual Transformers Wendler Chris, Veselovsky Veniamin, Monea Giovanni, West Robert
- Towards Knowledge-grounded Natural Language Understanding And Generation Whitehouse Chenxi
- Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions Wiegreffe Sarah, Tafjord Oyvind, Belinkov Yonatan, Hajishirzi Hannaneh, Sabharwal Ashish
- Training-free Exponential Extension Of Sliding Window Context With Cascading KV Cache Willette Jeffrey, Lee Heejun, Lee Youngwan, Jeon Myeongjae, Hwang Sung Ju
- Memory Is All You Need: An Overview Of Compute-in-memory Architectures For Accelerating Large Language Model Inference Wolters Christopher, Yang Xiaoxuan, Schlichtmann Ulf, Suzumura Toyotaro
- Llama Pro: Progressive Llama With Block Expansion Wu Chengyue, Gan Yukang, Ge Yixiao, Lu Zeyu, Wang Jiahao, Feng Ye, Shan Ying, Luo Ping
- Parallel Decoding Via Hidden Transfer For Lossless Large Language Model Acceleration Wu Pengfei, Liu Jiahao, Gong Zhuocheng, Wang Qifan, Li Jinpeng, Wang Jingang, Cai Xunliang, Zhao Dongyan
- Motionllm: Multimodal Motion-language Learning With Large Language Models Wu Qi, Zhao Yubo, Wang Yifan, Tai Yu-wing, Tang Chi-keung
- Videollm-mod: Efficient Video-language Streaming With Mixture-of-depths Vision Computation Wu Shiwei, Chen Joya, Lin Kevin Qinghong, Wang Qimeng, Gao Yan, Xu Qianli, Xu Tong, Hu Yao, Chen Enhong, Shou Mike Zheng
- Do Language Models Plan Ahead For Future Tokens? Wu Wilson, Morris John X., Levine Lionel
- Transformer-based Causal Language Models Perform Clustering Wu Xinbo, Varshney Lav R.
- Clover-2: Accurate Inference For Regressive Lightweight Speculative Decoding Xiao Bin, Gui Lujun, Su Lei, Chen Weipeng
- Show-o: One Single Transformer To Unify Multimodal Understanding And Generation Xie Jinheng, Mao Weijia, Bai Zechen, Zhang David Junhao, Wang Weihao, Lin Kevin Qinghong, Gu Yuchao, Chen Zhijie, Yang Zhenheng, Shou Mike Zheng
- RALL-E: Robust Codec Language Modeling With Chain-of-thought Prompting For Text-to-speech Synthesis Xin Detai, Tan Xu, Shen Kai, Ju Zeqian, Yang Dongchao, Wang Yuancheng, Takamichi Shinnosuke, Saruwatari Hiroshi, Liu Shujie, Li Jinyu, Zhao Sheng
- Theoretical Understanding Of In-context Learning In Shallow Transformers With Unstructured Data Xing Yue, Lin Xiaofeng, Xu Chenheng, Suh Namjoon, Song Qifan, Cheng Guang
- Empowering 1000 Tokens/second On-device LLM Prefilling With Mllm-npu Xu Daliang, Zhang Hao, Yang Liming, Liu Ruiqi, Huang Gang, Xu Mengwei, Liu Xuanzhe
- Vtensor: Flexible Virtual Tensor Management For Efficient LLM Serving Xu Jiale, Zhang Rui, Guo Cong, Hu Weiming, Liu Zihan, Wu Feiyang, Feng Yu, Sun Shixuan, Shao Changxu, Guo Yuhong, Zhao Junping, Zhang Ke, Guo Minyi, Leng Jingwen
- BESA: Pruning Large Language Models With Blockwise Parameter-efficient Sparsity Allocation Xu Peng, Shao Wenqi, Chen Mengzhao, Tang Shitao, Zhang Kaipeng, Gao Peng, An Fengwei, Qiao Yu, Luo Ping
- Think: Thinner Key Cache By Query-driven Pruning Xu Yuhui, Jie Zhanming, Dong Hanze, Wang Lei, Lu Xudong, Zhou Aojun, Saha Amrita, Xiong Caiming, Sahoo Doyen
- Longvila: Scaling Long-context Visual Language Models For Long Videos Xue Fuzhao, Chen Yukang, Li Dacheng, Hu Qinghao, Zhu Ligeng, Li Xiuyu, Fang Yunhao, Tang Haotian, Yang Shang, Liu Zhijian, He Ethan, Yin Hongxu, Molchanov Pavlo, Kautz Jan, Fan Linxi, Zhu Yuke, Lu Yao, Han Song
- Equipping Transformer With Random-access Reading For Long-context Understanding Yang Chenghao, Yang Zi, Hua Nan
- 3D Vision And Language Pretraining With Large-scale Synthetic Data Yang Dejie, Xu Zhu, Mo Wentao, Chen Qingchao, Huang Siyuan, Liu Yang
- Asynchronous And Segmented Bidirectional Encoding For NMT Yang Jingpu, Han Zehua, Xiang Mengyu, Wang Helin, Huang Yuxiao, Fang Miao
- GPT-4 As Evaluator: Evaluating Large Language Models On Pest Management In Agriculture Yang Shanglong, Yuan Zhipeng, Li Shunbao, Peng Ruoling, Liu Kang, Yang Po
- Parallelizing Linear Transformers With The Delta Rule Over Sequence Length Yang Songlin, Wang Bailin, Zhang Yu, Shen Yikang, Kim Yoon
- Pre-trained Language Models Improve The Few-shot Prompt Ability Of Decision Transformer Yang Yu, Xu Pan
- Adversarial Attacks And Defense For Conversation Entailment Task Yang Zhenning, Krawec Ryan, Wu Liang-yuan
- Exploiting Inter-layer Expert Affinity For Accelerating Mixture-of-experts Model Inference Yao Jinghan Dk, Anthony Quentin Dk, Shafi Aamir Dk, Subramoni Hari Dk, K. Dhabaleswar Dk, Panda
- Training Ultra Long Context Language Model With Fully Pipelined Distributed Transformer Yao Jinghan, Jacobs Sam Ade, Tanaka Masahiro, Ruwase Olatunji, Shafi Aamir, Subramoni Hari, Panda Dhabaleswar K.
- Enhancing In-context Learning Performance With Just Svd-based Weight Pruning: A Theoretical Perspective Yao Xinhao, Hu Xiaolin, Yang Shenzhi, Liu Yong
- Efficient Vision-and-language Pre-training With Text-relevant Image Patch Selection Ye Wei, Jiang Chaoya, Xu Haiyang, Ye Chenhao, Li Chenliang, Yan Ming, Zhang Shikun, Huang Songhang, Huang Fei
- Long-context Language Modeling With Parallel Context Encoding Yen Howard, Gao Tianyu, Chen Danqi
- MEND: Meta Demonstration Distillation For Efficient And Effective In-context Learning Yichuan Li, Xiyao Ma, Sixing Lu, Kyumin Lee, Xiaohu Liu, Chenlei Guo
- Stablemask: Refining Causal Masking In Decoder-only Transformer Yin Qingyu, He Xuzheng, Zhuang Xiang, Zhao Yu, Yao Jianhua, Shen Xiaoyu, Zhang Qiang
- Debunc: Mitigating Hallucinations In Large Language Model Agent Communication With Uncertainty Estimations Yoffe Luke, Amayuelas Alfonso, Wang William Yang
- Making The Most Of Your Model: Methods For Finetuning And Applying Pretrained Transformers Yoshida Davis
- Tree-planted Transformers: Unidirectional Transformer Language Models With Implicit Syntactic Supervision Yoshida Ryo, Someya Taiga, Oseki Yohei
- Unveiling And Harnessing Hidden Attention Sinks: Enhancing Large Language Models Without Training Through Attention Calibration Yu Zhongzhi, Wang Zheng, Fu Yonggan, Shi Huihong, Shaikh Khalid, Lin Yingyan Celine
- Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion For Efficient Inference Intervention In Large Language Model Yuan Chenhan, Huang Fei, Peng Ru, Lu Keming, Yu Bowen, Zhou Chang, Zhou Jingren
- Remamba: Equip Mamba With Effective Long-sequence Modeling Yuan Danlong, Liu Jiahao, Li Bei, Zhang Huishuai, Wang Jingang, Cai Xunliang, Zhao Dongyan
- KV Cache Compression, But What Must We Give In Return? A Comprehensive Benchmark Of Long Context Capable Approaches Yuan Jiayi Henry, Liu Hongyi Henry, Shaochen Henry, Zhong, Chuang Yu-neng, Li Songchen, Wang Guanchu, Le Duy, Jin Hongye, Chaudhary Vipin, Xu Zhaozhuo, Liu Zirui, Hu Xia
- Focus On The Core: Efficient Attention Via Pruned Token Compression For Document Classification Yun Jungmin, Kim Mihyeon, Kim Youngbin
- Toward Inference-optimal Mixture-of-expert Large Language Models Yun Longfei, Zhuang Yonghao, Fu Yao, Xing Eric P, Zhang Hao
- ABQ-LLM: Arbitrary-bit Quantized Inference Acceleration For Large Language Models Zeng Chao, Liu Songwei, Xie Yusheng, Liu Hong, Wang Xiaojian, Wei Miao, Yang Shu, Chen Fangmin, Mei Xing
- Actions Speak Louder Than Words: Trillion-parameter Sequential Transducers For Generative Recommendations Zhai Jiaqi, Liao Lucy, Liu Xing, Wang Yueming, Li Rui, Cao Xuan, Gao Leon, Gong Zhaojie, Gu Fangda, He Michael, Lu Yinghai, Shi Yu
- Distilling Implicit Multimodal Knowledge Into Llms For Zero-resource Dialogue Generation Zhang Bo, Ma Hui, Ding Jian, Wang Jian, Xu Bo, Lin Hongfei
- Unforgettable Generalization In Language Models Zhang Eric, Chosen Leshem, Andreas Jacob
- Sinklora: Enhanced Efficiency And Chat Capabilities For Long-context Large Language Models Zhang Hengyu
- Regurgitative Training: The Value Of Real Data In Training Large Language Models Zhang Jinghui, Qiao Dandan, Yang Mochen, Wei Qiang
- The Hedgehog & The Porcupine: Expressive Linear Attentions With Softmax Mimicry Zhang Michael, Bhatia Kush, Kumbong Hermann, Ré Christopher
- VAR-CLIP: Text-to-image Generator With Visual Auto-regressive Modeling Zhang Qian, Dai Xiangzi, Yang Ninghua, An Xiang, Feng Ziyong, Ren Xingyu
- Laser: Parameter-efficient LLM Bi-tuning For Sequential Recommendation With Collaborative Information Zhang Xinyu, Hu Linmei, Zhang Luhao, Song Dandan, Huang Heyan, Nie Liqiang
- Finercut: Finer-grained Interpretable Layer Pruning For Large Language Models Zhang Yang, Li Yawei, Wang Xinpeng, Shen Qianli, Plank Barbara, Bischl Bernd, Rezaei Mina, Kawaguchi Kenji
- OT-VP: Optimal Transport-guided Visual Prompting For Test-time Adaptation Zhang Yunbei, Mehra Akshay, Hamm Jihun
- Galla: Graph Aligned Large Language Models For Improved Source Code Understanding Zhang Ziyin, Yu Hang, Li Shijie, Di Peng, Li Jianguo, Wang Rui
- Revolutionizing Finance With Llms: An Overview Of Applications And Insights Zhao Huaqin, Liu Zhengliang, Wu Zihao, Li Yiwei, Yang Tianze, Shu Peng, Xu Shaochen, Dai Haixing, Zhao Lin, Mai Gengchen, Liu Ninghao, Liu Tianming
- Efficiently Training 7B LLM With 1 Million Sequence Length On 8 Gpus Zhao Pinxue, Zhang Hailin, Fu Fangcheng, Nie Xiaonan, Liu Qibin, Yang Fang, Peng Yuanbo, Jiao Dian, Li Shuaipeng, Xue Jinbao, Tao Yangyu, Cui Bin
- Prepacking: A Simple Method For Fast Prefilling And Increased Throughput In Large Language Models Zhao Siyan, Israel Daniel, Broeck Guy Van Den, Grover Aditya
- Enhancing Cross-domain Pre-trained Decision Transformers With Adaptive Attention Zhao Wenhao, Xu Qiushui, Xu Linjie, Song Lei, Wang Jinyu, Zhou Chunlai, Bian Jiang
- How Do Large Language Models Handle Multilingualism? Zhao Yiran, Zhang Wenxuan, Chen Guizhen, Kawaguchi Kenji, Bing Lidong
- ALISA: Accelerating Large Language Model Inference Via Sparsity-aware KV Caching Zhao Youpeng, Wu Di, Wang Jun
- SWIFT:A Scalable Lightweight Infrastructure For Fine-tuning Zhao Yuze, Huang Jintao, Hu Jinghan, Wang Xingjun, Mao Yunlin, Zhang Daoze, Jiang Zeyinzi, Wu Zhikai, Ai Baole, Wang Ang, Zhou Wenmeng, Chen Yingda
- ICE-GRT: Instruction Context Enhancement By Generative Reinforcement Based Transformers Zheng Chen, Sun Ke, Tang Da, Ma Yukun, Zhang Yuyu, Xi Chenguang, Zhou Xun
- Exploring Boundary Of GPT-4V On Marine Analysis: A Preliminary Case Study Zheng Ziqiang, Chen Yiwei, Zhang Jipeng, Vu Tuan-anh, Zeng Huimin, Tim Yue Him Wong, Yeung Sai-kit
- Enhancing Multimodal Large Language Models With Multi-instance Visual Prompt Generator For Visual Representation Enrichment Zhong Wenliang, Wu Wenyi, Li Qi, Barton Rob, Du Boxin, Sam Shioulin, Bouyarmane Karim, Tutar Ismail, Huang Junzhou
- Misinforming Llms: Vulnerabilities, Challenges And Opportunities Zhou Bo, Geißler Daniel, Lukowicz Paul
- Mlps Compass: What Is Learned When Mlps Are Combined With Plms? Zhou Li, Chen Wenyu, Cao Yong, Zeng Dingyi, Liu Wanlong, Qu Hong
- Pre-trained Large Language Models Use Fourier Features To Compute Addition Zhou Tianyi, Fu Deqing, Sharan Vatsal, Jia Robin
- DETAIL: Task Demonstration Attribution For Interpretable In-context Learning Zhou Zijian, Lin Xiaoqiang, Xu Xinyi, Prakash Alok, Rus Daniela, Low Bryan Kian Hsiang
- Scalable Matmul-free Language Modeling Zhu Rui-jie, Zhang Yu, Sifferman Ethan, Sheaves Tyler, Wang Yiqiao, Richmond Dustin, Zhou Peng, Eshraghian Jason K.
- IAPT: Instruction-aware Prompt Tuning For Large Language Models Zhu Wei, Tian Aaron Xuxiang, Yin Congrui, Ni Yuan, Wang Xiaoling, Xie Guotong
- Generative Pre-trained Speech Language Model With Efficient Hierarchical Transformer Zhu Yongxin, Su Dan, He Liqiang, Xu Linli, Yu Dong
- Lumina-next: Making Lumina-t2x Stronger And Faster With Next-dit Zhuo Le, Du Ruoyi, Xiao Han, Li Yangguang, Liu Dongyang, Huang Rongjie, Liu Wenze, Zhao Lirui, Wang Fu-yun, Ma Zhanyu, Luo Xu, Wang Zehan, Zhang Kaipeng, Zhu Xiangyang, Liu Si, Yue Xiangyu, Liu Dingning, Ouyang Wanli, Liu Ziwei, Qiao Yu, Li Hongsheng, Gao Peng
- Llama-excitor: General Instruction Tuning Via Indirect Feature Interaction Zou Bo, Yang Chao, Qiao Yu, Quan Chengbin, Zhao Youjian
- MLKV: Multi-layer Key-value Heads For Memory Efficient Transformer Decoding Zuhri Zayd Muhammad Kawakibi, Adilazuarda Muhammad Farid, Purwarianti Ayu, Aji Alham Fikri
- BASE TTS: Lessons From Building A Billion-parameter Text-to-speech Model On 100K Hours Of Data Łajszczak Mateusz, Cámbara Guillermo, Li Yang, Beyhan Fatih, Van Korlaar Arent, Yang Fan, Joly Arnaud, Martín-cortinas Álvaro, Abbas Ammar, Michalski Adam, Moinet Alexis, Karlapati Sri, Muszyńska Ewa, Guo Haohan, Putrycz Bartosz, Gambino Soledad López, Yoo Kayeon, Sokolova Elena, Drugman Thomas
- Concept-aware Data Construction Improves In-context Learning Of Language Models Štefánik Michal, Kadlčík Marek, Sojka Petr