Helleputte, Thibault
[UCL]
Dupont, Pierre
[UCL]
This paper addresses feature selection techniques for classification of high dimensional data, such as those produced by microarray experiments. Some prior knowledge may be available in this context to bias the selection towards some dimensions (genes) a priori assumed to be more relevant. We propose a feature selection method making use of this partial supervision. It extends previous works on embedded feature selection with linear models including regularization to enforce sparsity.
A practical approximation of this technique reduces to standard SVM learning with
iterative rescaling of the inputs. The scaling factors depend here on the prior knowledge but the final selection may depart from it. Practical results on several microarray data sets show the benefits of the proposed approach in terms of the stability of the selected gene lists with improved classification performances.
Bibliographic reference |
Helleputte, Thibault ; Dupont, Pierre. Partially supervised feature selection with regularized linear models.ICML '09 : 26th Annual International Conference on Machine Learning (Montréal, Canada, du 14/06/2009 au 18/06/2009). In: Andrea Danyluk, Léon Bottou, Michael Littman, ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning, ACM : Montreal,Canada2009, p. 409-416 |
Permanent URL |
http://hdl.handle.net/2078.1/87501 |