[Zurück]


Zeitschriftenartikel:

F. Meghdouri, F. Iglesias Vazquez, T. Zseby:
"Modeling Data with Observers";
Intelligent Data Analysis, 26 (2021), 3; S. 785 - 803.



Kurzfassung englisch:
Compact data models have become relevant due to the massive, ever-increasing generation of data. We propose Observers-based Data Modeling (ODM), a lightweight algorithm to extract low density data models (aka coresets) that are suitable for both static and stream data analysis. ODM coresets keep data internal structures while alleviating computational costs of machine learning during evaluation phases accounting for a O(n log n) worst-case complexity. We compare ODM with previous proposals in classification, clustering, and outlier detection. Results show the preponderance of ODM for obtaining the best trade-off in accuracy, versatility, and speed.

Schlagworte:
big data, low density models, coresets


"Offizielle" elektronische Version der Publikation (entsprechend ihrem Digital Object Identifier - DOI)
http://dx.doi.org/10.3233/IDA-215741

Elektronische Version der Publikation:
https://publik.tuwien.ac.at/files/publik_300951.pdf


Erstellt aus der Publikationsdatenbank der Technischen Universität Wien.