This blog will illustrate how a Page Rank is calculated in Gephi and how it is used in social network analysis. The blog includes a video which explains the concept in detail:
The Page Rank concept is a way in which a web page or social network node can be given an “importance score”. This importance score will always be a non-negative real number and all the scores (in the network) will add to 1, sometimes it might be expressed as a percentage. Before beginning the calculation, you must remove the self-loops, otherwise, the Page Rank will not sum up to 1. You can do this by filtering out the self-loops (see figure 1 and the video for details).
The Page Rank score for a given page/node is based on the links made to that said page/node from other pages/nodes. The links to a given page/node are called the backlinks/in-degrees for that page/node. The web/social network thus becomes a democracy where pages/nodes vote for the importance of other pages by linking to them. It is a variant of the Eigenvector value, but because it uses backlinks/in-degrees it is used in directed networks. Directed networks are networks that allow handles (the node or webpage) to follow another without that page or node following back.
The formula for the Page Rank score is illustrated in figure 2:
The objective is to develop a set of simultaneous equations to solve for each Page Rank. The formula also needs a damping factor (or probability as stated in Gephi). For each node, identify the other nodes linking to it. These “other nodes” will give the “targeted node” a proportion of their out-going links. A simple example is illustrated in Figure 3a, 3b and 3c:
The reality of calculating the Page Rank of any webpage or social network analysis is, the links will be very large. So, doing the calculations by hand become almost impossible. Platforms will Gephi have the algorithms built-in so the processing can be quick. It is what I use when doing such a calculation. Hopefully, though, you now understand the principles of the calculation.
Latest posts by Alan Shaw (see all)
- Developing Semi-Structured Interview Questions: An Inductive Approach. - April 9, 2020
- Developing Semi-Structured Interview Questions: A Deductive Approach - April 9, 2020
- Calculating the time it will take to do semi-structured the interviews - April 8, 2020
- How to develop a set of questions for a semi-structured interview: academic and commercial differences. - April 8, 2020
- How a Page Rank is calculated in Gephi - December 31, 2019