Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update README.md #15

Open
wants to merge 1 commit into
base: master
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
61 changes: 34 additions & 27 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,18 +1,18 @@
# Developing an African Hub and Search Portal for Scholarly Publishing
- Owned by the African Science community
- Distrbuted hosting on African territory
- Distributed hosting on African territory
- Open source
- Open access
- Free for users
- Free for individual users
- Paid services for institutional users

## Vision and Approach
African research output should be owned and hosted on African territory. We therefore propose here an African smart platform and portal for preprint uploads as well as the aggregation of African research output including:
- Scholarly books
- Journal articles
African research output should be owned and hosted on African territory. We therefore propose an African smart platform and portal for the aggregation of African research output including:
- African scholarly books
- Journal articles, preprints, datasets by African scholars and on topics related to Africa
- List of digital African scholarly journals
- …

The portal will be hosted by African research institutions in each of the 5 regions on the continent. To achieve this we will build a decentralized infrastructure with the following features:
The portal will be hosted centrally with an African research-related institution with miorror backups in each of the 5 regions on the continent. To achieve this we will build a decentralized infrastructure with the following features:
- Core distributed data engine with intelligent aggregation and search models
- The data platfrom components are hosted by institutions in African regions (EastA, WestA, NorthA, CentralA)
- A master data intance can be replicated to a cloud service as a matter of backup.
Expand All @@ -21,39 +21,46 @@ The portal will be hosted by African research institutions in each of the 5 regi


## End User Requirements:
- be able to search content published online with drop down selection and/or free chouce of keywords
- specific to African scholarly content
- ...<please add more from end user prespective i.e. how the end user would like to see the applications interface running as well as other features like speed of search, retirved content dsiplay...etc.>
- search content published online with drop down selection (preselections) and/or free choice of content by keywords
- specific to scholarly content by both African scholars and on topics related to the continent and its people
- lowest possible bandwith requirements, quick page load
- eye appealing frontend with African color scheme
- accessibility from every country on the continent (incl. sanctioned nations) to allow for academic freedom
- …


Preprint Repository | Hub and Search Portal
--- | ---
[AfricArxiv/preprint-repository](https://github.com/AfricArxiv/preprint-repository) | *as described here*
upload of preprint manuscripts, student reports, research proposals, registered reports/preregistrations, short communications, etc. | aggregating scholarly output from and about Africa via one interface inlc preprints form toher platforms, scholarly books, datasets for streamlined discoverability
based on PKP OPS (?) | Integrates via API of partner platforms but applying different search sechme atop of thier functions
DOI and CC-BY 4.0 attribution | Uses crowling and feature extraction for more accurate semantic context search
integration with Crossref, ORCID, … | Enbales much more search oprtions relvent to content pulish in relation to Africa as per the sources section below
upload of preprint manuscripts, student reports, research proposals, registered reports/preregistrations, short communications, etc. | aggregating scholarly output from and about Africa via one interface incl preprints form other platforms, scholarly books, datasets for streamlined discoverability | Integrates via API of partner platforms but applying different search scheme atop of their functions
DOI and CC-BY 4.0 attribution | Uses crawling and feature extraction for more accurate semantic context search
integration with Crossref, ORCID, … | Enbales much more search options relevant to content published in relation to Africa as per the sources section below


## Sources of Published Content

### Directories

- Direct publishing to the the hub by uploading the conent directly into the core data platfrom
Scraping content per the term 'Africa' and all by official 54 countries and authors'Ä affiliations at Africna institutions.
Connect to Authors' ORCID and identify African authors and their output

- contnent from all (currently 6) AfricArXiv partner platforms (OSF, Qeios, ScienceOpen, Figshare, Zenodo, PubPub)

- content from other preprint repositories,

- [DOAJ](https://doaj.org/search?ref=homepage-box&source=%7B%22query%22%3A%7B%22query_string%22%3A%7B%22query%22%3A%22africa*%22%2C%22default_operator%22%3A%22AND%22%7D%7D%7D)
- [DOAJ](https://doaj.org/search/journals?source=%7B%22query%22%3A%7B%22query_string%22%3A%7B%22query%22%3A%22africa%22%2C%22default_operator%22%3A%22AND%22%2C%22default_field%22%3A%22bibjson.keywords%22%7D%7D%2C%22size%22%3A50%2C%22sort%22%3A%5B%7B%22created_date%22%3A%7B%22order%22%3A%22desc%22%7D%7D%5D%7D)

- [BASEsearch](https://www.base-search.net/Search/Results?lookfor=africa*&name=&oaboost=1&newsearch=1&refid=dcbasen)

- Wikidata / Scholia - https://tools.wmflabs.org/scholia/faq

- [Open Knowledge Maps (uses BASE or PubMed)](https://openknowledgemaps.org/map/57bafb92fc16fbcae701e7ef81c77b0a) / possible to integrate the map?
- [Open Knowledge Maps (uses BASE and/or PubMed)](https://openknowledgemaps.org/map/57bafb92fc16fbcae701e7ef81c77b0a) / possible to integrate the map?

- PubMed / https://www.ncbi.nlm.nih.gov/pubmed/?term=Africa*

paywalled // therefore ignore??
paywalled content - if possible with integrations like Knowledge Unlatched and Unpaywall
- https://www.scopus.com/home.uri
- http://wokinfo.com/
- https://clarivate.com/webofsciencegroup/solutions/web-of-science/

#### Repositories
- https://www.connecting-africa.net/index.htm
Expand All @@ -77,13 +84,13 @@ paywalled // therefore ignore??
#### Identify by Author / Article / Publisher / Journal
In the advanced smart search based , crawling and extrating key features related to Africa from sources enables semantic context search with more accurat results. This should take place in follwoing main categories of search criteria or combination of them:

1) Content by African authors / co-authors who are based out in Africa
1) Content by African authors / co-authors who are based out in Africa // via [ORCID](https://orcid.org/)

2) Content by African authors / co-authors who are based outside Africa

3) Content by non-African authors / co-authors related to Africa

4) Content authored / co-authored by /f or African Institutions
4) Content authored / co-authored by /for African Institutions

5) Content authored / co-authored by / for one or more African region, city or country

Expand Down Expand Up @@ -121,21 +128,21 @@ The system consists of three main tiers: The Core Smart Data Platform, Source Da

## Infrastructure Requirements
- The solution should be devloped in incremental appraoch and capatize on contriantization technologies of rease of vertical and horniztal sclability.
- For the first development increment (Proof of Value phase), 5 servers - one at each location - is needed witht he following specifictions profile:
- For the first development increment (Proof of Value phase), 5 servers - one at each location - is needed with the following specifictions profile:
- Operating System: Ubuntu Linux
- CPU: Intel Xeon 8 Cores x64 3.x GHz
- Memory: 64 GB
- Disk: 2 TB SSD or All Flash
- Networking: Static IP / 10 Gb NIC and Server-grad Internt Connection for external access
- All other Data Center services
- We might adapte to lower specifictions for first phase if the above are not avilable.
- Identify at least one host institution in each African region that can host the AfricArXiv hub preferably with a data center that is decntly oprtionalixed and uses virtulaization or private cloud.
- Capciaty provject for coming 5 years should be provided down the road after completing the first phase so that sounding extimates can be made.
- The design and technical archiecture will be made for potetial scale out to other African hosting facilities with no need to rebuild or change the solution.
- We might adapt to lower specifictions for first phase if the above are not avilable.
- Identify at least one host institution in each African region that can host the AfricArXiv hub preferably with a data center that is decently opertionalized and uses virtulaization or private cloud.
- Capciaty project for coming 5 years should be provided down the road after completing the first phase so that sounding extimates can be made.
- The design and technical archicture will be made available open source for potential scaling to other African hosting facilities with no need to rebuild or change the solution.


### Using Public Cloud
We can use public cloud hosted and runing within Africa for the baocve main nodes and at the same time we can use a Public Cloud hosted anywhare on the world as a master copy backup for high avilbaility. Some potential services:
We can use public cloud hosted and running within Africa for the above main nodes and at the same time we can use a Public Cloud hosted anywhere on the world as a master copy backup for high accessibility. Potential services:
- Microsoft Azure’s services // already in SA
- Amazon Web Services (AWS) // announced to open servers in Africa
- Google Cloud // announced to open servers in Africa
Expand Down