arXiv Analytics

Sign in

arXiv:2107.10314 [cs.LG]AbstractReferencesReviewsResources

Small-text: Active Learning for Text Classification in Python

Christopher Schröder, Lydia Müller, Andreas Niekler, Martin Potthast

Published 2021-07-21Version 1

We present small-text, a simple modular active learning library, which offers pool-based active learning for text classification in Python. It comes with various pre-implemented state-of-the-art query strategies, including some which can leverage the GPU. Clearly defined interfaces allow to combine a multitude of such query strategies with different classifiers, thereby facilitating a quick mix and match, and enabling a rapid development of both active learning experiments and applications. To make various classifiers accessible in a consistent way, it integrates several well-known machine learning libraries, namely, scikit-learn, PyTorch, and huggingface transformers -- for which the latter integrations are available as optionally installable extensions. The library is available under the MIT License at https://github.com/webis-de/small-text.

Related articles: Most relevant | Search more
arXiv:2409.15848 [cs.LG] (Published 2024-09-24)
iGAiVA: Integrated Generative AI and Visual Analytics in a Machine Learning Workflow for Text Classification
arXiv:1911.11756 [cs.LG] (Published 2019-11-26)
Semi-Supervised Learning for Text Classification by Layer Partitioning
arXiv:2211.00369 [cs.LG] (Published 2022-11-01)
Anytime Generation of Counterfactual Explanations for Text Classification