Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Know your meme dataset #458

Open
gkaramanis opened this issue Sep 3, 2022 · 3 comments
Open

Know your meme dataset #458

gkaramanis opened this issue Sep 3, 2022 · 3 comments
Labels
data: check img alt text We need to write or confirm alt text for the images that is friendly to e-readers. data: provenance We need to make sure the data followed robots.txt and otherwise isn't going to cause headache. dataset requested PR

Comments

@gkaramanis
Copy link
Contributor

Dataset kym21_03_2022.zip (JSON):

https://owncloud.ut.ee/owncloud/s/2LosgCo4bTjGM8n

Article:
https://knowyourmeme.com/editorials/insights/where-do-memes-come-from-the-top-platforms-from-2010-2022

Seen at:
https://s2.washingtonpost.com/camp-rw/?trackId=61b504ca9bbc0f79fd77b746&s=63135e17ab732227d00897ce

@jonthegeek
Copy link
Collaborator

What is the source of the dataset? We need to make sure we can track the usage rights of any datasets we use. The Washington Post link is dead, and the Article doesn't share a dataset.

@jonthegeek jonthegeek added the data: provenance We need to make sure the data followed robots.txt and otherwise isn't going to cause headache. label Dec 9, 2022
@gkaramanis
Copy link
Contributor Author

It’s most probably scraped, the other dataset I had found was https://www.kaggle.com/datasets/podsyp/a-lot-of-memes-info-stats

The link was for the How to read this chart newsletter, it had images from the article, no dataset there

@jonthegeek jonthegeek added data: needs article link data: check img alt text We need to write or confirm alt text for the images that is friendly to e-readers. labels Feb 27, 2023
@lgibson7
Copy link
Member

Hi @gkaramanis. Thanks for submitting this issue. Would you be willing to submit the data set through a PR? You can find the instructions on how to do so here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data: check img alt text We need to write or confirm alt text for the images that is friendly to e-readers. data: provenance We need to make sure the data followed robots.txt and otherwise isn't going to cause headache. dataset requested PR
Projects
None yet
Development

No branches or pull requests

3 participants