arXiv:1402.2447 Abstract | arXiv Analytics

arXiv:1402.2447 [stat.ML]Abstract References Reviews Resources

A comparison of linear and non-linear calibrations for speaker recognition

Niko Brümmer, Albert Swart, David van Leeuwen

Published 2014-02-11, updated 2014-04-09Version 2

In recent work on both generative and discriminative score to log-likelihood-ratio calibration, it was shown that linear transforms give good accuracy only for a limited range of operating points. Moreover, these methods required tailoring of the calibration training objective functions in order to target the desired region of best accuracy. Here, we generalize the linear recipes to non-linear ones. We experiment with a non-linear, non-parametric, discriminative PAV solution, as well as parametric, generative, maximum-likelihood solutions that use Gaussian, Student's T and normal-inverse-Gaussian score distributions. Experiments on NIST SRE'12 scores suggest that the non-linear methods provide wider ranges of optimal accuracy and can be trained without having to resort to objective function tailoring.

Comments: accepted for Odyssey 2014: The Speaker and Language Recognition Workshop

Categories: stat.ML, cs.LG

Keywords: non-linear calibrations, speaker recognition, comparison, nist sre12 scores, normal-inverse-gaussian score distributions

Related articles: Most relevant | Search more

arXiv:1709.09868 [stat.ML] (Published 2017-09-28)

A Generative Model for Score Normalization in Speaker Recognition

Albert Swart, Niko Brummer

arXiv:2202.00095 [stat.ML] (Published 2022-01-31)

Deconfounded Representation Similarity for Comparison of Neural Networks

Tianyu Cui, Yogesh Kumar, Pekka Marttinen, Samuel Kaski

arXiv:1709.10222 [stat.ML] (Published 2017-09-29)

Comparison of PCA with ICA from data distribution perspective

Miron Ivanov

arXiv Analytics

arXiv:1402.2447 [stat.ML]Abstract References Reviews Resources

A comparison of linear and non-linear calibrations for speaker recognition

Links

Toolbox

arXiv:1402.2447 [stat.ML]AbstractReferencesReviewsResources

A comparison of linear and non-linear calibrations for speaker recognition

Links

Toolbox

arXiv:1402.2447 [stat.ML]Abstract References Reviews Resources