David George
First edition: February 2005, Sample Section
Lulu Press
ISBN 1-4116-2251-0
Where
∂ damping factor, given as 0.85
n total number of inbound links
PRn page rank of inbound page n
Cn number of outbound links from page n
1
So the PageRank (PR) for any page is equal to some constant value (0.15) plus
the sum of all the page ranks from inbound links divided by the number of links
(C) on each corresponding page multiplied by the damping factor (0.85).
Don’t worry if that is too hard to visualize, we will look at some real examples
below. One more thing before we do. As pages can, and usually do, feedback to
other pages by way of hyperlinks the calculation has to be made iteratively.
Every time Google visits a page it recalculates the PageRank based on the
weights of the inbound links. Assuming no other changes it takes around 40
iterations for the figure to converge to a stable value although Google itself uses
linear algebra to reduce the number of calculations. The damping factor (∂) is
set at 0.85 for optimum performance.
Theoretically a page with no inbound links has a page rank of 0.15. However
such a page could only be found by the Google robots if it was added to the
index by the Google submission page. Creating such orphan pages is not a good
idea as they cannot be found by users but what it does imply is that pages within
a site can gain PageRank by creating pages and by linking those pages internally.
Ideally such a site should have at least one inbound link from another site in the
Google index. Google therefore favors large, well linked sites.
Whereas if there was only a single outbound link to our page, the increase
would be 5.25.
2
It is very illustrative and the following simple examples will give you a feel for
what happens with different linking scenarios.
Before we go further another interesting feature of the PageRank calculation is
that the average of all the PageRanks in a closed system is 1.0. The entire
Google index is just such a system. If you took all the pages in the index, added
the page ranks together then divided this by the total number of pages the result
would be 1.0. This means that there is a finite amount of PageRank to share
out, as people add pages and these get indexed, the PageRank of existing pages
decreases.
3
To illustrate this point look at the site in figure 10. It is a closed set of pages,
essentially no different from the whole Google universe. Remember from the
example above that an orphan page has an intrinsic page rank of 0.15. But what
have we here? The page rank of the home page is 1.92 and the other sub-pages
0.69. How did that happen? Well the PageRank of the home page is equal to:
(1 - 0.15) + 0.85 x ((0.15/1) + (0.15/1) + (0.15/1)) = 0.5325
We then feed this figure back to the homepage. The PageRank calculator given
above can show you all the intermediate calculations. The figures pretty much
reach their final values after 18 iterations. If you add all the page ranks together
and divide by the total number of pages, four, the result is 1.0. We have a fixed
size pot which is distributed amongst the pages by the linking structure.
Hierarchical Structures
Figure 10 is an example of a hierarchy. These are fairly common in the real
world and are therefore often used on websites, for example to represent a
company and its divisions etc. If each page links back to the top of the
hierarchy the effect is to concentrate PageRank in this entry page. All other
factors being equal a higher PageRank will mean that this page appears first in
the search results. As this is the main entry point to the hierarchy this is a good
thing. Adding more levels to the hierarchy will further boost the PageRank of
the entry page.
4
Interlinked Structures
Sometimes we have more interlinking between pages. An online presentation
would have back and forward buttons to enable navigation between individual
slides. Some authoring tools (Microsoft FrontPage springs to mind) can
automatically create links between all pages within the same level of a hierarchy
as shown in figure 12.
5
Adding Pages
Each page that is added to a site increases the total amount of PageRank
available for distribution in the site by 1.0 but because there are more pages the
average PageRank remains at 1.0, assuming no inbound links. If we add
thousands of pages all linking back to the home page with little interlinking, we
can significantly boost the PageRank of this page. An effect that has been
noticed by Black Hat SEO experts.
To increase the average PageRank of our site we need to get inbound links. As
the Google universe is in equilibrium with an average PageRank of 1.0, for
every site with an above average PageRank there must be other sites with below
average PageRank. This brings us nicely onto PageRank leakage.
6
If we take our original closed hierarchy and add a single outbound link we
immediately see a big difference. The sub-page now divides the vote it used to
send exclusively back to the homepage with another external page. This means
less boost for the home page and therefore less PageRank transferred to the
other sub-pages. The overall PageRank for the site has dropped from 4.0 to
2.61. That’s more than a page of content. The site is leaking PageRank to the
outside world. This is why a few sites, such as Microsoft.com are PageRank
titans while thousands of other sites are PageRank minnows. Your outbound
link increases the chances that someone will follow that link to another site,
PageRank reflects that possibility.
Figure 15 shows the effect of a site with one outbound and one inbound-link.
Again the effect is to transfer PageRank we acquired from our inbound-link to
another site.
7
Outbound-Links and Link Hoarding
Don’t worry too much about PageRank leakage (some SEO’ers prefer the term
dilution as it isn’t really a leak). The examples given above are very simple. We’ll
discuss some techniques for mitigating PageRank dilution under the site map
and cross-linking sections. Link hoarding is also badly viewed by other websites
that may provide inbound-links. You will inevitably link to worthwhile, external
sites just as other sites will link to you. Think of PageRank as money, use it and
spend it wisely but remember there is more to good SEO than PageRank alone.