TY - GEN
T1 - Multimodal music retrieval for large databases
AU - Schuller, Björn
AU - Rigoll, Gerhard
AU - Lang, Manfred
PY - 2004
Y1 - 2004
N2 - In this contribution we present a novel multi-modal access to large MP3 music data-bases. Retrieval can be either fulfilled in a content-based manner or by keywords. As input modalities speech by natural language utterances or singing, and manual interaction by handwriting, typing or hardkeys are used. In order to achieve especially robust retrieval results and automatically suggest music to the user contextual knowledge of the time, date, season, user emotion, and listening habits is integrated in the retrieval process. The system communicates with the user by speech or visual reactions. The concepts shown are especially designed for home and mobile access on Tablet-PCs, PDAs, and similar PC solutions. The paper discusses the concept and a working prototype called Shangrila. An evaluation by a user study leads to an impression of the capabilities of the suggested approach to multimodal music retrieval.
AB - In this contribution we present a novel multi-modal access to large MP3 music data-bases. Retrieval can be either fulfilled in a content-based manner or by keywords. As input modalities speech by natural language utterances or singing, and manual interaction by handwriting, typing or hardkeys are used. In order to achieve especially robust retrieval results and automatically suggest music to the user contextual knowledge of the time, date, season, user emotion, and listening habits is integrated in the retrieval process. The system communicates with the user by speech or visual reactions. The concepts shown are especially designed for home and mobile access on Tablet-PCs, PDAs, and similar PC solutions. The paper discusses the concept and a working prototype called Shangrila. An evaluation by a user study leads to an impression of the capabilities of the suggested approach to multimodal music retrieval.
UR - http://www.scopus.com/inward/record.url?scp=11244298756&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:11244298756
SN - 0780386035
SN - 9780780386037
T3 - 2004 IEEE International Conference on Multimedia and Expo (ICME)
SP - 755
EP - 758
BT - 2004 IEEE International Conference on Multimedia and Expo (ICME)
T2 - 2004 IEEE International Conference on Multimedia and Expo (ICME)
Y2 - 27 June 2004 through 30 June 2004
ER -