TY - GEN
T1 - "The Godfather" vs. "Chaos"
T2 - ICDAR2009 - 10th International Conference on Document Analysis and Recognition
AU - Schuller, Björn
AU - Schenk, Joachim
AU - Rigoll, Gerhard
AU - Knaup, Tobias
PY - 2009
Y1 - 2009
N2 - In the fields of sentiment and emotion recognition, bag of words modeling has lately become popular for the estimation of valence in text. A typical application is the evaluation of reviews of e. g. movies, music, or games. In this respect we suggest the use of back-off N-Grams as basis for a vector space construction in order to combine advantages of word-order modeling and easy integration into potential acoustic feature vectors intended for spoken-document retrieval. For a fine granular estimate we consider data-driven regression next to classification based on Support Vector Machines. Alternatively the on-line knowledge sources ConceptNet, General Inquirer, and WordNet not only serve to reduce out-of-vocabulary events, but also as basis for a purely linguistic analysis. As special benefit, this approach does not demand labeled training data. A large set of 100 k movie reviews of 20 years stemming from Metacritic is utilized throughout extensive parameter discussion and comparative evaluation effectively demonstrating efficiency of the proposed methods.
AB - In the fields of sentiment and emotion recognition, bag of words modeling has lately become popular for the estimation of valence in text. A typical application is the evaluation of reviews of e. g. movies, music, or games. In this respect we suggest the use of back-off N-Grams as basis for a vector space construction in order to combine advantages of word-order modeling and easy integration into potential acoustic feature vectors intended for spoken-document retrieval. For a fine granular estimate we consider data-driven regression next to classification based on Support Vector Machines. Alternatively the on-line knowledge sources ConceptNet, General Inquirer, and WordNet not only serve to reduce out-of-vocabulary events, but also as basis for a purely linguistic analysis. As special benefit, this approach does not demand labeled training data. A large set of 100 k movie reviews of 20 years stemming from Metacritic is utilized throughout extensive parameter discussion and comparative evaluation effectively demonstrating efficiency of the proposed methods.
UR - http://www.scopus.com/inward/record.url?scp=71249091768&partnerID=8YFLogxK
U2 - 10.1109/ICDAR.2009.194
DO - 10.1109/ICDAR.2009.194
M3 - Conference contribution
AN - SCOPUS:71249091768
SN - 9780769537252
T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
SP - 858
EP - 862
BT - ICDAR2009 - 10th International Conference on Document Analysis and Recognition
Y2 - 26 July 2009 through 29 July 2009
ER -