Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[EN] Worldwide Armenian Churches Lists Extraction #10

Open
vvbabayan opened this issue Jun 6, 2023 · 0 comments
Open

[EN] Worldwide Armenian Churches Lists Extraction #10

vvbabayan opened this issue Jun 6, 2023 · 0 comments
Labels
extraction Task that require data extraction (scraping) skills topic-culture Tasks dedicatated Armenian culture, language and history

Comments

@vvbabayan
Copy link
Collaborator

vvbabayan commented Jun 6, 2023

Goal
The goal is to collect the most complete list of Armenian churches worldwide.

Tasks
The entry points for this task are the links listed below in the Resources section. We suggest that you should collect all the available churches names and locations as well as any additional information in brackets and store them in a machine readable format, such as CSV. There are four different sources in two different languages, however, the data structures there are very similar. It is not necessary to clean up the repeating values, you are most welcome to collect the raw data in csv format, and our team will tidy it up. The outputting file(s) should contain the same columns (with rows filled where available for an automatic extraction): _region (in Armeniapedia data), country, church name, location, contacts, the Wikipedia link) where available. Other columns can be added if you see it fit.

Context
There is a Worldwide Armenian Church Directory at Armeniapedia website which was last updated in early 2010, grouped by world regions and countries. Additional information from Wikipedia is necessary to retrieve information about the churches opened more recently. The most complete version of that Wikipedia page is given in Armenian, while Armeniapedia page is in English, and there is an another version of a worldwide list and a complete list of churches in Russia in Russian. We want to obtain the fullest list possible, which is why we ask you to collect the churches data in one file which will be cleaned up to avoid repetitions and to make it homogenous.

Requirements
A public GitHub repository should be created to store and publish the code and the data under one of the free and open licenses, such as Creative Commons or MIT.

Wishes
It would be best if your code is reusable, that is can be launch again by anyone who might want to update the dataset at a later point. For the same reason, we encourage you to comment your code, supplement it with at least a very brief README description, and specify the requirements and dependencies necessary to use the code.

Resources

  1. https://www.armeniapedia.org/wiki/Worldwide_Armenian_Church_Directory
  2. https://hy.wikipedia.org/wiki/Հայկական_տաճարների_և_եկեղեցիների_ցանկ
  3. https://ru.wikipedia.org/wiki/Список_армянских_храмов_по_странам
  4. https://ru.wikipedia.org/wiki/Список_армянских_храмов_России

Prepared by
The Open Data Armenia team prepared this task.

@vvbabayan vvbabayan changed the title Worldwide Armenian Churches Lists Extraction [EN] Worldwide Armenian Churches Lists Extraction Jun 6, 2023
@ivbeg ivbeg added extraction Task that require data extraction (scraping) skills topic-culture Tasks dedicatated Armenian culture, language and history labels Jun 6, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
extraction Task that require data extraction (scraping) skills topic-culture Tasks dedicatated Armenian culture, language and history
Projects
None yet
Development

No branches or pull requests

2 participants