Incremental aspect models for mining document streams

Arun C. Surendran, Suvrit Sra

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

In this paper we introduce a novel approach for incrementally building aspect models, and use it to dynamically discover underlying themes from document streams. Using the new approach we present an application which we call "query-line tracking" i.e., we automatically discover and summarize different themes or stories that appear over time, and that relate to a particular query. We present evaluation on news corpora to demonstrate the strength of our method for both query-line tracking, online indexing and clustering.

Original languageEnglish
Title of host publicationKnowledge Discovery in Databases
Subtitle of host publicationPKDD 2006 - 10th European Conference on Principles and Practice of Knowledge Discovery in Databases, Proceedings
PublisherSpringer Verlag
Pages633-640
Number of pages8
ISBN (Print)3540453741, 9783540453741
DOIs
StatePublished - 2006
Externally publishedYes
Event10th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD 2006 - Berlin, Germany
Duration: 18 Sep 200622 Sep 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4213 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference10th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD 2006
Country/TerritoryGermany
CityBerlin
Period18/09/0622/09/06

Fingerprint

Dive into the research topics of 'Incremental aspect models for mining document streams'. Together they form a unique fingerprint.

Cite this