-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CitationRank (like PageRank accept for papers/citations) #18
Comments
If this ranking has to be done purely on the basis of citations, then i 2 papers, both have 50 citations in dblp, but one gets cited by more -Sup On Sat, Sep 15, 2012 at 1:46 PM, Christan Grant [email protected]:
Supriya Nirkhiwale |
…l numbers are incremented by one from the python program. This is issue #18
OK, I have a solution that it looks like it solves the problem. It is in UDF/citation_count.sql. You can see the program UDF/citation_count.py to see what the solution is supposed to be. |
Here is another problem for you @virup @clintpgeorge @supriyan
We want to calculate a global importance factor for all the papers in the data set.
This is similar to page rank. The value of a paper
CR(p)
should produce a value that is the probability that if I am randomly looking for an important paper I land onp
.A paper with citations should have a higher value than a paper with no citations.
A paper with
P
citations should have a smaller value compared to a paper withG
citations of citations where|P| - |G| < sigma
.The references of a paper do no affect the paper's score. (Although we should have a self-citation penalty)
Also, can we compute these values using SQL?
The text was updated successfully, but these errors were encountered: