TY - GEN
T1 - Multi-task feature selection on multiple networks via maximum flows
AU - Sugiyama, Mahito
AU - Azencott, Chloé Agathe
AU - Grimm, Dominik
AU - Kawahara, Yoshinobu
AU - Borgwardt, Karsten M.
PY - 2014
Y1 - 2014
N2 - We propose a new formulation of multi-task feature selection coupled with multiple network regularizers, and show that the problem can be exactly and efficiently solved by maximum flow algorithms. This method contributes to one of the central topics in data mining: How to exploit structural information in multivariate data analysis, which has numerous applications, such as gene regulatory and social network analysis. On simulated data, we show that the proposed method leads to higher accuracy in discovering causal features by solving multiple tasks simultaneously using networks over features. Moreover, we apply the method to multi-locus association mapping with Arabidopsis thaliana genotypes and flowering time phenotypes, and demonstrate its ability to recover more known phenotype-related genes than other state-of-the-art methods.
AB - We propose a new formulation of multi-task feature selection coupled with multiple network regularizers, and show that the problem can be exactly and efficiently solved by maximum flow algorithms. This method contributes to one of the central topics in data mining: How to exploit structural information in multivariate data analysis, which has numerous applications, such as gene regulatory and social network analysis. On simulated data, we show that the proposed method leads to higher accuracy in discovering causal features by solving multiple tasks simultaneously using networks over features. Moreover, we apply the method to multi-locus association mapping with Arabidopsis thaliana genotypes and flowering time phenotypes, and demonstrate its ability to recover more known phenotype-related genes than other state-of-the-art methods.
KW - Feature selection
KW - Multi-locus association mapping
KW - Multi-task learning
KW - Network regularizer
UR - http://www.scopus.com/inward/record.url?scp=84945543585&partnerID=8YFLogxK
U2 - 10.1137/1.9781611973440.23
DO - 10.1137/1.9781611973440.23
M3 - Conference contribution
AN - SCOPUS:84945543585
T3 - SIAM International Conference on Data Mining 2014, SDM 2014
SP - 199
EP - 207
BT - SIAM International Conference on Data Mining 2014, SDM 2014
A2 - Zaki, Mohammed J.
A2 - Banerjee, Arindam
A2 - Parthasarathy, Srinivasan
A2 - Ning-Tan, Pang
A2 - Obradovic, Zoran
A2 - Kamath, Chandrika
PB - Society for Industrial and Applied Mathematics Publications
T2 - 14th SIAM International Conference on Data Mining, SDM 2014
Y2 - 24 April 2014 through 26 April 2014
ER -