arXiv:1711.08058 [cs.LG]AbstractReferencesReviewsResources
Multiple-Instance, Cascaded Classification for Keyword Spotting in Narrow-Band Audio
Ahmad AbdulKader, Kareem Nassar, Mohamed Mahmoud, Daniel Galvez, Chetan Patil
Published 2017-11-21Version 1
We propose using cascaded classifiers for a keyword spotting (KWS) task on narrow-band (NB), 8kHz audio acquired in non-IID environments --- a more challenging task than most state-of-the-art KWS systems face. We present a model that incorporates Deep Neural Networks (DNNs), cascading, multiple-feature representations, and multiple-instance learning. The cascaded classifiers handle the task's class imbalance and reduce power consumption on computationally-constrained devices via early termination. The KWS system achieves a false negative rate of 6% at an hourly false positive rate of 0.75
Comments: To be published in the proceedings of NIPS 2017
Related articles: Most relevant | Search more
arXiv:2109.10252 [cs.LG] (Published 2021-09-21)
Audiomer: A Convolutional Transformer for Keyword Spotting
arXiv:1807.00560 [cs.LG] (Published 2018-07-02)
weight-importance sparse training in keyword spotting
arXiv:2305.05110 [cs.LG] (Published 2023-05-09)
Semi-Supervised Federated Learning for Keyword Spotting