I’m looking for algorithms/techniques that are able to present the importance of a a single webpage. Leaving PageRank aside, are there any other methods of doing such a rating based on content, structure and hyperlinks with each other?
I’m not only talking about the connection from http://www.foo.com to http://www.bar.com as PageRank does but also from http://www.foo.com/bar to http://www.foo.com/baz and so on (beside the fact of adapting PageRank for these needs)
How do I “define” importance: I think of importance in this context as “how relevant is this side to the user, as well as how important it is to the rest of the site”.
E.g. A christmas raffle is announced on the startpage with only a single link leading to this site is more important to the user as well as to the site. An imprint, which has a link from every site (since it’s mostly somewhere in the footer) is not important although it has many links to it. Imprint is also not important to the site as a “unit” since it doesn’t give any real value for the page’s puprpose (= giving information, selling products, a general service, etc)
There is also SALSA which is more stable then HITS [so it suffers less from spam].
Since you are also interested in context of pages, you might want to have a look on Haveliwala’s work on topic sensitive page rank