Talks and Poster Presentations (with Proceedings-Entry):
F. Iglesias Vazquez, D. Ojdanic, A. Hartl, T. Zseby:
"MDCStream: Stream Data Generator for Testing Analysis Algorithms";
Talk: EAI Valuetools 2020,
Tsukuba, Japan;
2020-05-18
- 2020-05-20; in: "Proceedings of the 13th EAI International Conference on Performance Evaluation Methodologies and Tools",
Association for Computing Machinery,
Tsukuba, Japan
(2020),
ISBN: 9781450376464;
56
- 63.
English abstract:
The establishment of modern technological paradigms like ubiquitous computing, big data, cyber-physical systems, or communication networks has strongly increased the need for efficient, effective data stream analysis. MDCStream is a MATLAB tool for generating temporal-dependent numerical datasets in order to stress-test stream data classification, clustering, and outlier detection algorithms. MDCStream is built on MDCGen, therefore showing a high flexibility for creating a wide diversity of data scenarios. To show an example of the potential of MDCStream, we tested a stream data clustering algorithm recently proposed in the literature with datasets generated with MDCStream. Datasets were designed to draw challenges related to space geometries and concept drift.
German abstract:
The establishment of modern technological paradigms like ubiquitous computing, big data, cyber-physical systems, or communication networks has strongly increased the need for efficient, effective data stream analysis. MDCStream is a MATLAB tool for generating temporal-dependent numerical datasets in order to stress-test stream data classification, clustering, and outlier detection algorithms. MDCStream is built on MDCGen, therefore showing a high flexibility for creating a wide diversity of data scenarios. To show an example of the potential of MDCStream, we tested a stream data clustering algorithm recently proposed in the literature with datasets generated with MDCStream. Datasets were designed to draw challenges related to space geometries and concept drift.
Keywords:
data generation, stream data, synthetic data, multi-dimensional data, concept drift, nonstationarity, MATLAB
"Official" electronic version of the publication (accessed through its Digital Object Identifier - DOI)
http://dx.doi.org/10.1145/3388831.3388832
Electronic version of the publication:
https://dl.acm.org/doi/10.1145/3388831.3388832
Created from the Publication Database of the Vienna University of Technology.