Talks and Poster Presentations (with Proceedings-Entry):
F. Iglesias Vazquez, T. Zseby, A. Zimek:
"Outlier Detection Based on Low Density Models";
Talk: ICDM Workshop on Data Science and Big Data Analytics (DSBDA-2018), IEEE Internation Conference on Data Mining (ICDM-2018),
- 11-20-2018; in: "IEEE ICDM 2018 Workshops Proceedings",
IEEE Computer Society Press,
Most outlier detection algorithms are based on lazy learning or imply quadratic complexity. Both characteristics make them unsuitable for big data and stream data applications and preclude their applicability in systems that must operate autonomously. In this paper we propose a new algorithm---called SDO (Sparse Data Observers)---to estimate outlierness based on low density models of data. SDO is an eager learner; therefore, computational costs in application phases are severely reduced. We perform tests with a wide variation of synthetic datasets as well as the main datasets published in the literature for anomaly detection testing. Results show that SDO satisfactorily competes with the best ranked outlier detection alternatives. The good detection performance coupled with a low complexity makes SDO highly flexible and adaptable to stand-alone frameworks that must detect outliers fast with accuracy rates equivalent to lazy learning algorithms.
outlier analysis; eager learning; machine learning model
"Official" electronic version of the publication (accessed through its Digital Object Identifier - DOI)
Created from the Publication Database of the Vienna University of Technology.