arXiv:2407.10490 Abstract | arXiv Analytics

arXiv:2407.10490 [cs.LG]Abstract References Reviews Resources

Learning Dynamics of LLM Finetuning

Published 2024-07-15Version 1

Learning dynamics, which describes how the learning of specific training examples influences the model's prediction of other examples, give us a powerful tool for understanding the behavior of deep learning systems. We study the learning dynamics of large language models during finetuning, by analyzing the step-wise decomposition and accumulated influence among different responses. Our framework allows a uniform interpretation of many interesting observations about the training of popular algorithms for both instruction tuning and preference tuning. The analysis not only explains where the benefits of these methods come from but also inspires a simple, effective method to further improve the alignment performance. Code for experiments is available at https://github.com/Joshua-Ren/Learning_dynamics_LLM.

Comments: 32 pages

Categories: cs.LG, cs.AI, cs.CL

Keywords: learning dynamics, llm finetuning, specific training examples influences, large language models, deep learning systems

Related articles: Most relevant | Search more

arXiv:2308.08614 [cs.LG] (Published 2023-08-16)

Boosting Logical Reasoning in Large Language Models through a New Framework: The Graph of Thought

Bin Lei, pei-Hung Lin, Chunhua Liao, Caiwen Ding

arXiv:2306.05052 [cs.LG] (Published 2023-06-08)

Interpretable Medical Diagnostics with Structured Data Extraction by Large Language Models

Aleksa Bisercic, Mladen Nikolic, Mihaela van der Schaar, Boris Delibasic, Pietro Lio, Andrija Petrovic

arXiv:2306.03438 [cs.LG] (Published 2023-06-06)

Large Language Models of Code Fail at Completing Code with Potential Bugs