arXiv Analytics

Sign in

arXiv:2206.14597 [cs.LG]AbstractReferencesReviewsResources

Generative Anomaly Detection for Time Series Datasets

Zhuangwei Kang, Ayan Mukhopadhyay, Aniruddha Gokhale, Shijie Wen, Abhishek Dubey

Published 2022-06-28Version 1

Traffic congestion anomaly detection is of paramount importance in intelligent traffic systems. The goals of transportation agencies are two-fold: to monitor the general traffic conditions in the area of interest and to locate road segments under abnormal congestion states. Modeling congestion patterns can achieve these goals for citywide roadways, which amounts to learning the distribution of multivariate time series (MTS). However, existing works are either not scalable or unable to capture the spatial-temporal information in MTS simultaneously. To this end, we propose a principled and comprehensive framework consisting of a data-driven generative approach that can perform tractable density estimation for detecting traffic anomalies. Our approach first clusters segments in the feature space and then uses conditional normalizing flow to identify anomalous temporal snapshots at the cluster level in an unsupervised setting. Then, we identify anomalies at the segment level by using a kernel density estimator on the anomalous cluster. Extensive experiments on synthetic datasets show that our approach significantly outperforms several state-of-the-art congestion anomaly detection and diagnosis methods in terms of Recall and F1-Score. We also use the generative model to sample labeled data, which can train classifiers in a supervised setting, alleviating the lack of labeled data for anomaly detection in sparse settings.

Comments: A shorter version of the paper was accepted at the ITSC 2022
Categories: cs.LG, cs.AI, eess.SP
Related articles: Most relevant | Search more
arXiv:2302.00061 [cs.LG] (Published 2023-01-31)
Dynamic Flows on Curved Space Generated by Labeled Data
arXiv:1703.00854 [cs.LG] (Published 2017-03-02)
Learning the Structure of Generative Models without Labeled Data
arXiv:2207.02964 [cs.LG] (Published 2022-07-06)
Mitigating shortage of labeled data using clustering-based active learning with diversity exploration