MATLAB Answers

KenL
0

How to check the google page rank via MaTLAB?

Asked by KenL
on 16 Oct 2016
Latest activity Commented on by Walter Roberson
on 18 Oct 2016
I have a matrix containing list of URLs. May I know how to perform the google page rank check and update such check result to my matrix via MATLAB?

  3 Comments

his example shows how to use a PageRank algorithm to rank a collection of websites. Although the PageRank algorithm was originally designed to rank search engine results, it also can be more broadly applied to the nodes in many different types of graphs. The PageRank score gives an idea of the relative importance of each graph node based on how it is connected to the other nodes.
Theoretically, the PageRank score is the limiting probability that someone randomly clicking links on each website will arrive at any particular page. So pages with a high score are highly connected and discoverable within the network, and it is more likely a random web surfer will visit that page.
Algorithm Description At each step in the PageRank algorithm, the score of each page is updated according to,
r = (1-P)/n + P*(A'*(r./d) + s/n);
r is a vector of PageRank scores. P is a scalar damping factor (usually 0.85), which is the probability that a random surfer clicks on a link on the current page, instead of continuing on another random page. A' is the transpose of the adjacency matrix of the graph. d is a vector containing the out-degree of each node in the graph. d is set to 1 for nodes with no outgoing edges. n is the scalar number of nodes in the graph. s is the scalar sum of the PageRank scores for pages with no links. In other words, the rank of each page is largely based on the ranks of the pages that link to it. The term A'*(r./d) picks out the scores of the source nodes that link to each node in the graph, and the scores are normalized by the total number of outbound links of those source nodes. This ensures that the sum of the PageRank scores is always 1. For example, if node 2 links to nodes 1, 3, and 4, then it transfers 1/3 of its PageRank score to each of those nodes during each iteration of the algorithm. also see https://in.mathworks.com/help/matlab/examples/use-page-rank-algorithm-to-rank-websites.html
Thanks. Is there a specific command in MATLAB to extract such page rank information?
You can calculate a page ranking by crawling web sites and finding the links and then running the algorithms, but if you want to know Google's PageRank™ for a site, then that is no longer publicly available.

Sign in to comment.

2 Answers

Answer by Aditya Jain on 16 Oct 2016
Edited by Walter Roberson
on 16 Oct 2016

  1 Comment

I read through and try to understand the provided link but still didn't catch how it works. My objective is to get the pagerank for all URLs automatically via MATLAB rather than checking the pagerank for all the URLs one by one manually from the pagerank checker portal. Please advise.

Sign in to comment.


Answer by Walter Roberson
on 16 Oct 2016

If the question is how to look up Google's own pagerank of a particular URL, then the answer is that Google is turning that service off. See http://searchengineland.com/rip-google-pagerank-retrospective-244286
There are some services that claim to allow pagerank access; see https://www.google.ca/search?q=look+up+pagerank and pick one that allows fillable forms.

  0 Comments

Sign in to comment.