Talks and Poster Presentations (with Proceedings-Entry):
R. Mayer, R. Neumayer, A. Rauber:
"Combination of Audio and Lyrics Features for Genre Classiﬁcation in Digital Audio Collections";
Talk: ACM Multimedia,
- 2008-10-31; in: "Proceedings of the ACM Multimedia 2008",
ACM New York, NY, USA,
In many areas multimedia technology has made its way into mainstream. In the case of digital audio this is manifested in numerous online music stores having turned into proﬁtable businesses. The widespread user adaption of digital audio both on home computers and mobile players show the size of this market. Thus, ways to automatically process and handle the growing size of private and commercial collections become increasingly important; along goes a need to make music interpretable by computers. The most obvious representation of audio ﬁles is their sound - there are, however, more ways of describing a song, for instance its lyrics, which describe songs in terms of content words. Lyrics of music may be orthogonal to its sound, and diﬀer greatly from other texts regarding their (rhyme) structure. Consequently, the exploitation of these properties has potential for typical music information retrieval tasks such as musical genre classiﬁcation; so far, there is a lack of means to eﬃciently combine these modalities. In this paper, we present ﬁndings from investigating advanced lyrics features such as the frequency of certain rhyme patterns, several parts-of-speech features, and statistic features such as words per minute (WPM). We further analyse in how far a combination of these features with existing acoustic feature sets can be exploited for genre classiﬁcation and provide experiments on two test collections.
Music Information Retrieval, Text Retrieval, Lyrics
"Official" electronic version of the publication (accessed through its Digital Object Identifier - DOI)
Electronic version of the publication:
Created from the Publication Database of the Vienna University of Technology.