Hostname: page-component-848d4c4894-75dct Total loading time: 0 Render date: 2024-05-11T18:58:44.553Z Has data issue: false hasContentIssue false

Graph fibrations, graph isomorphism, and PageRank

Published online by Cambridge University Press:  20 July 2006

Paolo Boldi
Affiliation:
Dipartimento di Scienze dell'Informazione, Università degli Studi di Milano, Italy; santini@dsi.unimi.it
Violetta Lonati
Affiliation:
Dipartimento di Scienze dell'Informazione, Università degli Studi di Milano, Italy; santini@dsi.unimi.it
Massimo Santini
Affiliation:
Dipartimento di Scienze dell'Informazione, Università degli Studi di Milano, Italy; santini@dsi.unimi.it
Sebastiano Vigna
Affiliation:
Dipartimento di Scienze dell'Informazione, Università degli Studi di Milano, Italy; santini@dsi.unimi.it
Get access

Abstract

PageRank is a ranking method that assigns scores to web pages using the limit distribution of a random walk on the web graph. A fibration of graphs is a morphism that is a local isomorphism of in-neighbourhoods, much in the same way a covering projection is a local isomorphism of neighbourhoods. We show that a deep connection relates fibrations and Markov chains with restart, a particular kind of Markov chains that include the PageRank one as a special case. This fact provides constraints on the values that PageRank can assume. Using our results, we show that a recently defined class of graphs that admit a polynomial-time isomorphism algorithm based on the computation of PageRank is really a subclass of fibration-prime graphs, which possess simple, entirely discrete polynomial-time isomorphism algorithms based on classical techniques for graph isomorphism. We discuss efficiency issues in the implementation of such algorithms for the particular case of web graphs, in which O(n) space occupancy (where n is the number of nodes) may be acceptable, but O(m) is not (where m is the number of arcs).

Type
Research Article
Copyright
© EDP Sciences, 2006

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

P. Boldi, M. Santini and S. Vigna, PageRank as a function of the damping factor, in Proc. of the Fourteenth International World Wide Web Conference. ACM Press. Chiba, Japan (2005) 557–566.
Boldi, P. and Vigna, S., Fibrations of graphs. Discrete Math. 243 (2002) 2166. CrossRef
P. Boldi and S. Vigna, The WebGraph framework I: Compression techniques, in Proc. of the Thirteenth International World Wide Web Conference. ACM Press, Manhattan, USA (2004) 595–601.
Cardon, A. and Crochemore, M., Partitioning a graph in O(|A|log2|V|). Theoret. Comput. Sci. 19 (1982) 8598. CrossRef
Corneil, D.G. and Gotlieb, C.C., An efficient algorithm for graph isomorphism. J. Assoc. Comput. Mach. 17 (1970) 5164. CrossRef
L. Eldén, The eigenvalues of the google matrix. Technical Report LiTH-MAT-R-04-01, Department of Mathematics, Linköping University, 2004. Available at arXiv:math/0401177.
G.H. Golub and C. Greif, Arnoldi-type algorithms for computing stationary distribution vectors, with application to PageRank, Technical Report SCCM-04-15, Stanford University Technical Report (2004).
Gori, M., Maggini, M. and Sarti, L., Exact and approximate graph matching using random walks. IEEETPAMI: IEEE Trans. Pattern Anal. Machine Intelligence 27 (2005) 11001111.
J.L. Gross and T.W. Tucker, Topological Graph Theory. Series in Discrete Mathematics and Optimization, Wiley-Interscience (1987).
A. Grothendieck, Technique de descente et théorèmes d'existence en géométrie algébrique, I. Généralités. Descente par morphismes fidèlement plats. Seminaire Bourbaki 190 (1959–1960).
T.H. Haveliwala, Efficient computation of PageRank. Technical Report 31, Stanford University Technical Report, October 1999. Available at http://dbpubs.stanford.edu/pub/1999-31.
T.H. Haveliwala, Topic-sensitive pagerank, in The eleventh International Conference on World Wide Web Conference. ACM Press (2002) 517–526.
T.H. Haveliwala and S.D. Kamvar, The second eigenvalue of the Google matrix. Technical Report 20, Stanford University Technical Report, March 2003. Available at http://dbpubs.stanford.edu/pub/2003-20.
Híc, P., Nedela, R. and Pavlíková, S., Front-divisors of trees. Acta Math. Univ. Comenian. (N.S.) 61 (1992) 6984.
J.E. Hopcroft, An nlogn algorithm for minimizing states in a finite automaton, in Theory of Machines and Computations, edited by Z. Kohavi and A. Paz. Academic Press (1971).
M. Iosifescu, Finite Markov Processes and Their Applications. John Wiley & Sons (1980).
S.D. Kamvar, T.H. Haveliwala, C.D. Manning and G.H. Golub, Exploiting the block structure of the web for computing PageRank. Technical Report 17, Stanford University Technical Report, March 2003. Available at http://dbpubs.stanford.edu/pub/2003-17.
S.D. Kamvar, T.H. Haveliwala, C.D. Manning and G.H. Golub, Extrapolation methods for accelerating PageRank computations, in Proceedings of the twelfth international conference on World Wide Web. ACM Press (2003) 261–270.
T. Kato, Perturbation Theory for Linear Operators. Springer-Verlag, second edition (1976).
L. László, Random walks on graphs: A survey, in Combinatorics, Paul Erdős is Eighty, Vol. 2, Bolyai Society Mathematical Studies, 1993, in Proceedings of the Meeting in honour of P. Erdős, Keszthely, Hungary 7 (1993) 1–46.
C. Pan-Chi Lee, G.H. Golub and S.A. Zenios, A fast two-stage algorithm for computing PageRank and its extensions. Technical Report SCCM-03-15, Stanford University Technical Report (2003). Available at http://www-sccm.stanford.edu/pub/sccm/sccm03-15_2.pdf.
D. Lind and B. Marcus, An Introduction to Symbolic Dynamics and Coding. Cambridge University Press, Cambridge UK (1995).
Practical, B.D. McKay graph isomorphism. Congressus Numerantium 30 (1981) 4587.
C.D. Meyer, Matrix analysis and applied linear algebra. Society for Industrial and Applied Mathematics, Philadelphia, PA, USA (2000).
Nasu, M., Constant-to-one and onto global maps of homomorphisms between strongly connect graphs. Ergod. Th. & Dynam. Sys. 3 (1983) 387413. CrossRef
Norris, N., Universal covers of graphs: Isomorphism to depth n - 1 implies isomorphism to all depths. Discrete Appl. Math. 56 (1995) 6174. CrossRef
L. Page, S. Brin, R. Motwani and T. Winograd, The PageRank citation ranking: Bringing order to the web. Technical Report 66, Stanford University, 1999. Available at http://dbpubs.stanford.edu/pub/1999-66.
D.M. Cvetković, M. Doob and H. Sachs, Spectra of Graphs. Academic Press (1978).
J.P. Schweitzer. Perturbation theory and finite markov chains. J. Appl. Probab. 5 (1968) 401–413.
E. Seneta, Non-negative matrices and Markov chains. Springer–Verlag, New York (1981).
Unger, S.H., GIT – A heuristic program for testing pairs of directed line graphs for isomorphism. Comm. ACM 7 (1964) 2634. CrossRef
S. Vigna, A guided tour in the topos of graphs. Technical Report 199-97, Università di Milano, Dipartimento di Scienze dell'Informazione, 1997. Available at http://vigna.dsi.unimi.it/ftp/papers/ToposGraphs.pdf.
K. Yosida, Functional Analysis. Springer-Verlag, (1980), Sixth Edition.