arXiv:2107.00451 [cs.CV]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords transformer, full spatiotemporal feature structure, videolightformer, 2d convolutional temporal segment network, efficient video action recognition remains Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset