arXiv Analytics

Sign in

arXiv:1406.2139 [cs.CV]AbstractReferencesReviewsResources

Log-Euclidean Bag of Words for Human Action Recognition

Masoud Faraki, Maziar Palhang, Conrad Sanderson

Published 2014-06-09, updated 2014-07-08Version 2

Representing videos by densely extracted local space-time features has recently become a popular approach for analysing actions. In this paper, we tackle the problem of categorising human actions by devising Bag of Words (BoW) models based on covariance matrices of spatio-temporal features, with the features formed from histograms of optical flow. Since covariance matrices form a special type of Riemannian manifold, the space of Symmetric Positive Definite (SPD) matrices, non-Euclidean geometry should be taken into account while discriminating between covariance matrices. To this end, we propose to embed SPD manifolds to Euclidean spaces via a diffeomorphism and extend the BoW approach to its Riemannian version. The proposed BoW approach takes into account the manifold geometry of SPD matrices during the generation of the codebook and histograms. Experiments on challenging human action datasets show that the proposed method obtains notable improvements in discrimination accuracy, in comparison to several state-of-the-art methods.

Related articles: Most relevant | Search more
arXiv:1905.00745 [cs.CV] (Published 2019-05-02)
Human Action Recognition with Deep Temporal Pyramids
arXiv:1202.2528 [cs.CV] (Published 2012-02-12)
Using Covariance Matrices as Feature Descriptors for Vehicle Detection from a Fixed Camera
arXiv:2208.00306 [cs.CV] (Published 2022-07-30)
Doubly Deformable Aggregation of Covariance Matrices for Few-shot Segmentation