-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dealing with url changes #40
Comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I'm setting up a tracker for http://visir.is (a Icelandic news site).
I've noticed that when changes are done on headlines, their system makes new urls.
The urls are made up of these elements:
http://visir.is/g/<ARTICLE_ID>/< HEADLINE >
To view the article the < HEADLINE > part is reduntant.
To get around it I made some changes to allow for a regex to be applied to a url from the rss feed. See here:
pallih@0519f31
This makes the url checked: http://visir.is/g/<ARTICLE_ID> so subsequent changes to the headline are picked up, and not stored as a new article.
I'm not sure introducing a config variable is appropriate for the project, but at least my solution is there, if anyone needs it.
The text was updated successfully, but these errors were encountered: