CitationRank (like PageRank accept for papers/citations) #18

cegme · 2012-09-15T17:46:29Z

Here is another problem for you @virup @clintpgeorge @supriyan

We want to calculate a global importance factor for all the papers in the data set.
This is similar to page rank. The value of a paper CR(p) should produce a value that is the probability that if I am randomly looking for an important paper I land on p.

A paper with citations should have a higher value than a paper with no citations.

A paper with P citations should have a smaller value compared to a paper with G citations of citations where |P| - |G| < sigma.

The references of a paper do no affect the paper's score. (Although we should have a self-citation penalty)

Also, can we compute these values using SQL?

The text was updated successfully, but these errors were encountered:

supriyan · 2012-09-16T20:15:44Z

If this ranking has to be done purely on the basis of citations, then i
have done it. I can explain when we meet.
Just brain storming - is citations the only factor for a paper to be
important or it also depends on who cited it - for eg -

2 papers, both have 50 citations in dblp, but one gets cited by more
important papers.

-Sup

On Sat, Sep 15, 2012 at 1:46 PM, Christan Grant [email protected]:

Here is another problem for you @virup https://github.com/virup
@clintpgeorge https://github.com/clintpgeorge @supriyanhttps://github.com/supriyan

We want to calculate a global importance factor for all the papers in the
data set.
This is similar to page rank. The value of a paper CR(p) should produce a
value that is the probability that if I am randomly looking for an
important paper I land on p.

A paper with citations should have a higher value than a paper with no
citations.

A paper with P citations should have a smaller value compared to a paper
with G citations of citations where |P| - |G| < sigma.

The references of a paper do no affect the paper's score. (Although we
should have a self-citation penalty)

Also, can we compute these values using SQL?

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/18.

Supriya Nirkhiwale
4337 NW 35th Terrace
Gainesville, FL 32605
USA

…l numbers are incremented by one from the python program. This is issue #18

cegme · 2012-09-19T04:22:23Z

OK, I have a solution that it looks like it solves the problem. It is in UDF/citation_count.sql. You can see the program UDF/citation_count.py to see what the solution is supposed to be.

cegme added a commit that referenced this issue Sep 19, 2012

It is working... somewhat surprisingly so! The only thing is the leve…

35530c2

…l numbers are incremented by one from the python program. This is issue #18

cegme closed this as completed Sep 19, 2012

cegme reopened this Sep 19, 2012

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CitationRank (like PageRank accept for papers/citations) #18

CitationRank (like PageRank accept for papers/citations) #18

cegme commented Sep 15, 2012

supriyan commented Sep 16, 2012

cegme commented Sep 19, 2012

CitationRank (like PageRank accept for papers/citations) #18

CitationRank (like PageRank accept for papers/citations) #18

Comments

cegme commented Sep 15, 2012

supriyan commented Sep 16, 2012

cegme commented Sep 19, 2012