Callut, Jérome
[UCL]
Dupont, Pierre
[UCL]
Saerens, Marco
[UCL]
Françoisse, Kevin
[UCL]
This paper describes a novel technique, called D-walks, to tackle semi-supervised classification problems in large graphs. We introduce here a betweenness measure based on passage times during random walks of bounded lengths in the input graph. The class of unlabeled nodes is predicted by maximizing the betweenness with labeled
nodes. This approach can deal with directed or undirected graphs with a linear
time complexity with respect to the number of edges and the maximum walk length
considered. Preliminary experiments on the CORA database show that D-walks outper-
forms NetKit (Macskassy & Provost, 2007) as well as Zhou et al. algorithm (Zhou et al.,2005), both in classification rate and computing time.
Bibliographic reference |
Callut, Jérome ; Dupont, Pierre ; Saerens, Marco ; Françoisse, Kevin. Semi-supervised Classification in Graphs using Bounded Random Walks.17th Annual Machine Learning Conference of Belgium and the Netherlands (Benelearn) (Liège, Belgium, du 18/05/2008 au 20/05/2008). |
Permanent URL |
http://hdl.handle.net/2078/17749 |