Abstract
Sequence prediction is a key task in machine learning and data mining. It involves predicting the next symbol in a sequence given its previous symbols. Our motivating application is predicting the execution path of a process on an operating system in real-time. In this case, each symbol in the sequence represents a system call accompanied with arguments and a return value. We propose a novel online algorithm for predicting the next system call by leveraging both context and side information. The online update of our algorithm is efficient in terms of time cost and memory consumption. Experiments on real-world data sets showed that our method outperforms state-of-the-art online sequence prediction methods in both accuracy and efficiency, and incorporation of side information does significantly improve the predictive accuracy.
Original language | English |
---|---|
Article number | 6729627 |
Pages (from-to) | 1235-1240 |
Number of pages | 6 |
Journal | Proceedings - IEEE International Conference on Data Mining, ICDM |
DOIs | |
State | Published - 2013 |
Event | 13th IEEE International Conference on Data Mining, ICDM 2013 - Dallas, TX, United States Duration: 7 Dec 2013 → 10 Dec 2013 |
Keywords
- online learning
- scalability
- sequence predictio
- system trace