arXiv:2305.16704 Abstract | arXiv Analytics

arXiv:2305.16704 [cs.LG]Abstract References Reviews Resources

A Closer Look at In-Context Learning under Distribution Shifts

Published 2023-05-26Version 1

In-context learning, a capability that enables a model to learn from input examples on the fly without necessitating weight updates, is a defining characteristic of large language models. In this work, we follow the setting proposed in (Garg et al., 2022) to better understand the generality and limitations of in-context learning from the lens of the simple yet fundamental task of linear regression. The key question we aim to address is: Are transformers more adept than some natural and simpler architectures at performing in-context learning under varying distribution shifts? To compare transformers, we propose to use a simple architecture based on set-based Multi-Layer Perceptrons (MLPs). We find that both transformers and set-based MLPs exhibit in-context learning under in-distribution evaluations, but transformers more closely emulate the performance of ordinary least squares (OLS). Transformers also display better resilience to mild distribution shifts, where set-based MLPs falter. However, under severe distribution shifts, both models' in-context learning abilities diminish.

Categories: cs.LG, stat.ML

Keywords: in-context learning, closer look, transformers, large language models, set-based mlps

Related articles: Most relevant | Search more

arXiv:2410.21698 [cs.LG] (Published 2024-10-29)

On the Role of Depth and Looping for In-Context Learning with Task Diversity

Khashayar Gatmiry, Nikunj Saunshi, Sashank J. Reddi, Stefanie Jegelka, Sanjiv Kumar

arXiv:1910.00292 [cs.LG] (Published 2019-10-01)

Generalization in Generation: A closer look at Exposure Bias

Florian Schmidt

arXiv:2405.19156 [cs.LG] (Published 2024-05-29)

Beyond Discrepancy: A Closer Look at the Theory of Distribution Shift

Robi Bhattacharjee, Nick Rittler, Kamalika Chaudhuri

arXiv Analytics

arXiv:2305.16704 [cs.LG]Abstract References Reviews Resources

A Closer Look at In-Context Learning under Distribution Shifts

Links

Toolbox

arXiv:2305.16704 [cs.LG]AbstractReferencesReviewsResources

A Closer Look at In-Context Learning under Distribution Shifts

Links

Toolbox

arXiv:2305.16704 [cs.LG]Abstract References Reviews Resources