Talks and Poster Presentations (with Proceedings-Entry):
A. Schindler, T. Lidy, A. Rauber:
"Comparing shallow versus deep neural network architectures for automatic music genre classification";
Talk: 9th Forum Media Technology (FMT2016),
St. Pölten, Austria;
- 2016-11-24; in: "Proceedings of the 9th Forum Media Technology (FMT2016)",
St. Pölten University of Applied Sciences, Institute of Creative\Media/Technologies,
In this paper we investigate performance differences of different neural network architectures on the task of automatic music genre classification. Comparative evaluations on four well known datasets of different sizes were performed including the application of two audio data augmentation methods. The results show that shallow network architectures are better suited for small datasets than deeper models, which could be relevant for experiments and applications which rely on small datasets. A noticeable advantage was observed through the application of data augmentation using deep models. A final comparison with previous evaluations on the same datasets shows that the presented neural network based approaches already outperform state-of-the-art handcrafted music features.
Neural networks, Deep learning, Classification, Music, Audio, Convolutional Neural Networks, Neural Network Architectures
Electronic version of the publication:
Created from the Publication Database of the Vienna University of Technology.