Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New term - recordedByID #102

Closed
timrobertson100 opened this issue Jul 24, 2015 · 37 comments
Closed

New term - recordedByID #102

timrobertson100 opened this issue Jul 24, 2015 · 37 comments

Comments

@timrobertson100
Copy link
Member

timrobertson100 commented Jul 24, 2015

New Term

Submitter: Tim Robertson
Justification: There is no way to identify individuals by e.g. ORCIDs
Proponents: GBIF (already in production), CETAF, DiSCCO
Definition: A list (concatenated and separated) of the globally unique identifiers for the person, people, groups, or organizations responsible for recording the original Occurrence.
Comment: Recommended best practice is to provide a single identifier that disambiguates the details of the identifying agent. If a list is used, it is recommended to separate the values in the list with space vertical bar space ( | ). The order of the identifiers on any list for this term can not be guaranteed to convey any semantics.
Examples: https://orcid.org/0000-0002-1825-0097 (for an individual); https://orcid.org/0000-0002-1825-0097 | https://orcid.org/0000-0002-1825-0098 (for a list of people).
Refines: None
Replaces: None
ABCD 2.06: not in ABCD

@baskaufs
Copy link

dwciri:recordedBy could be used for a single HTTP URI. But it wouldn't work for a list like this, since there is supposed to be a separate dwciri:recordedBy property for each person. Same with dwciri:identifiedBy as listed in Issue #101. See http://rs.tdwg.org/dwc/terms/guides/rdf/index.htm#2.5_Terms_in_the_dwciri:_namespace . The RDF guide was specifically designed to facilitiate RDF, but I don't think it was really discussed whether a dwciri: term could or should be used in something like a DwC archive that's full of strings to be parsed out.

@timrobertson100
Copy link
Member Author

To avoid confusion - and since this has been open for nearly 5 years - I will close this issue.

identifiedByID and recordedByID are both added in the GBIF namespace and indexed by GBIF.org. See http://rs.gbif.org/core/dwc_occurrence_2020-04-15.xml

Work is underway to create an AgentAction extension for Darwin Core archives to accommodate more expressive roles.

@tucotuco
Copy link
Member

Re-opening based on discussions about the valid uses of ID terms without using dwciri: versions, for example, here.

@baskaufs
Copy link

"ID" terms do not have dwciri: analogs for reasons given in section 2.6 of the RDF guide, so no effect in that namespace caused by the addition of this term.

@wouteraddink
Copy link

On behalf of CETAF ISTC (a group of informatics experts representing the CETAF and DiSSCo community in Europe): the group supports and recommends the implementation of this proposal, with the remark that they would like this implemented such, that the definition of identifiedByID and recordedByID support multiple references with a defined order.

@tucotuco
Copy link
Member

Could someone please confirm if there is a mapping to ABCD?

@tucotuco tucotuco added the Controversial The solution for the issue has not reached a consensus. label Apr 30, 2021
@nielsklazenga
Copy link
Member

ABCD 2 has no IDs for agents.

@tucotuco
Copy link
Member

tucotuco commented May 3, 2021

ABCD 2 has no IDs for agents.

Thanks @nielsklazenga Updated.

@dshorthouse
Copy link

Suggest change to definition as is align with what is presently shown to users of the IPT in demo mode:

An unordered list (concatenated and separated) of IDs representing names of people, groups, or organizations responsible for recording the original Occurrence. No semantics should be assumed, including for example an ordering of identifiers to indicate a primary collector or any institutional affiliation.

@tucotuco
Copy link
Member

tucotuco commented May 3, 2021

This comment also applies here.

@tdikow
Copy link

tdikow commented May 6, 2021

I strongly support the implementation of recordedByID and identifiedByID using ORCIDs.

@EstebanMH-SiB
Copy link

EstebanMH-SiB commented May 28, 2021

We endorse this proposal on behalf of @SiBColombia, adopting the last proposed changes of @dshorthouse

@tucotuco
Copy link
Member

Done.

@abubelinha
Copy link

abubelinha commented Mar 26, 2024

Could someone provide links to GBIF occurrence examples where recordedByID or identifiedByID contains an institutional identifier (or a group identifier)? (like a laboratory, research group, research project, expedition, museum, herbarium, university or whatever).

I was guessing those would mostly be Wikidata identifiers (example) but I just discovered there are also ORCID institutional IDs.
So I'd like to check how this is being used in GBIF occurrence datasets.
(@dshorthouse perhaps Bionomia is excluding this kind of institutional identifiers, so you might provide examples of some records containing them).

Related, but not the same:

Which are the proper DwC concepts to reflect source/destiny institutions which are exchanging specimens:

  • Any DwC concept to refer to the institution we received a specimen from (i.e. their staff collected its samples and later sent one of them to my institution)? I can know this either from labels, or by my own institution exchanges documentation.
  • Any DwC concept to refer the institutions we send samples of this specimen to? (my institution collected several samples, kept one and sent the others away -according to my own institution exchanges documentation-). I'd expect a DwC field were I can concatenate several institution ids.

I never used it but I am guessing the Darwin Core Resource Relationship extension might be the answer.
But as I said, never heard of this info being provided to GBIF so I'd like to find examples of how other providers are handling these particular subject. Is there any standardized or proposed way of doing it?

Thanks a lot for any hints.
@abubelinha

@dshorthouse
Copy link

@abubelinha I'm not aware of any examples of institutional IDs used in either recordedByID or identifiedByID. There are ~17k unique values for these in the downloads from GBIF that Bionomia processes (most recent example: https://doi.org/10.15468/dl.tb97qj). There are heaps of non-identifier values here (eg integers, dates), many malformed URIs, but the majority are ORCIDs and wikidata entity URIs. I suppose some of the wikidata variety could be organizations, but Bionomia does in fact resolve these and limits to "instance of human".

I believe ORCID recommends use of identifiers like RoR to declare one's affiliation(s), but does not itself generate ORCID-like IDs for organizations.

While I recognize that an institution ID can be used in recordedByID, I'm a little less clear on reasons for similar in identifiedByID despite its definition.

@abubelinha
Copy link

abubelinha commented Mar 26, 2024

I suppose some of the wikidata variety could be organizations, but Bionomia does in fact resolve these and limits to "instance of human"

Thanks a lot @dshorthouse
That's what I guess too, so I was wondering if those excluded identifiers did in fact contain some kind of instance of "organization", "museum", "herbarium" or similar.

While I recognize that an institution ID can be used in recordedByID, I'm a little less clear on reasons for similar in identifiedByID despite its definition

I have occasionally found specimens received in exchange, where identification labels are not signed by a named person but a group or lab. Some examples with images show up in this GBIF search.
In their labels, identifiedBy "SPIP" means something like "Permanent Plant Identification Seminar".

Only in one of those occurrences the "related records" tab reveals those SPIP-identified specimens were collected by another institution which later distributed samples worlwide.
And this relates to my second question: do the receptor herbaria have a recommended DwC field suitable to indicate the institutional provenance of the specimens?

The problem with ROR is that not that many GBIF providers got one (I mean identifiers for small museums or herbaria themselves, not for the universities they belong to). When searching herbarium only 3 are returned by ROR (all of them in Australia).
Actually, ROR clearly states it is primarily focused on identifying and listing global “high-level” organizations.

But I am willing to provide IDs for groups like the aforementioned "SPIP", or -more commonly- collaborative multi-institutional research projects which sometimes appear in provider data as a identifiedBy value (see examples of "Flora Zambesiaca", "Flora of North America" and others like that).

@tucotuco
Copy link
Member

Which are the proper DwC concepts to reflect source/destiny institutions which are exchanging specimens:

  • Any DwC concept to refer to the institution we received a specimen from (i.e. their staff collected its samples and later sent one of them to my institution)? I can know this either from labels, or by my own institution exchanges documentation.

There are two related fields that do not exactly cover what you area asking and a third that is a solution to what you are asking.

The related fields are ownerInstitutionCode and otherCatalogNumbers. The ownerInstitutionCode term only works to designate a different source if that source is also the owner, so not a complete solution. The otherCatalogNumbers can contain information about all the other institutions that have a catalog (accession in Botany) number for the same organism, but it is not just the institution, and doesn't help if you don't have the source's catalogNumber.

The third term that will definitely work is dynamicProperties, in which you can put a key:value pair to capture the source institution with something like

{"received from":"UCJEPS"}

This doesn't lend itself to the same ease of searchability as a Darwin Core term, because the key itself isn't a standard (the community would have to be very careful about using the same key to mean "received from"), plus the data may be in a JSON string with lots of key:value pairs.

  • Any DwC concept to refer the institutions we send samples of this specimen to? (my institution collected several samples, kept one and sent the others away -according to my own institution exchanges documentation-). I'd expect a DwC field were I can concatenate several institution ids.

The otherCatalogNumbers term could work for this as well, with the same limitations and caveats about the destination as about the source described above.

The dynamicProperties term could work for this too, and thereby distinguish a source institution from destination institutions with something like

{"sample sent to":"NYBG | K"}

where the '|' is used to separate values in a list.

@matdillen
Copy link

@abubelinha We did a breakdown of the usage of dwc:recordedByID and dwc:identifiedByID on GBIF last year. You can find a preprint on some of this work here. We didn't spot any institutional IDs used in these two fields on GBIF at the time, but it's possible some ORCIDs or Wikidata IDs (the most commonly used) are not for people. There is a lot of strange data in the other category, but nothing that suggests it represents an institution rather than one or more persons.

I know @dshorthouse that you subset ORCID (based on keywords) for Bionomia, so maybe the ORCIDs submitted to GBIF but not found in your subset (if any) may shed some more light?

@MattBlissett
Copy link
Member

I've run a couple of queries and other than possible Wikidata IDs (as Mat mentions) nothing stands out:

10 random recordedById values by host, excluding invalid URLs, first column is gbifid:

4073291655      http://isni.org/isni/0000000058766018
1839818202      http://isni.org/isni/0000000059579150
4072430396      http://isni.org/isni/0000000059579150
1839508092      http://isni.org/isni/0000000066861023
1840356777      http://isni.org/isni/0000000066861023
2243256077      http://isni.org/isni/0000000066861023
4073142767      http://isni.org/isni/000000036014173X
4073204912      http://isni.org/isni/000000036014173X
4073286907      http://isni.org/isni/000000036014173X
4072928885      http://isni.org/isni/000000036014173X
2595671862      http://purl.oclc.org/net/edu.harvard.huh/guid/uuid/643da772-fa77-4f10-9b83-60c5f28902ec
1839712972      http://purl.oclc.org/net/edu.harvard.huh/guid/uuid/861a57cc-29df-40ad-a2ed-86ad442a825f
4072468250      http://purl.oclc.org/net/edu.harvard.huh/guid/uuid/8faac441-8d05-4bab-9c22-c97cd7df1145
4073027757      http://purl.oclc.org/net/edu.harvard.huh/guid/uuid/8faac441-8d05-4bab-9c22-c97cd7df1145
4072495800      http://purl.oclc.org/net/edu.harvard.huh/guid/uuid/8faac441-8d05-4bab-9c22-c97cd7df1145
1840779045      http://purl.oclc.org/net/edu.harvard.huh/guid/uuid/8faac441-8d05-4bab-9c22-c97cd7df1145
4072857499      http://purl.oclc.org/net/edu.harvard.huh/guid/uuid/8faac441-8d05-4bab-9c22-c97cd7df1145
1840436947      http://purl.oclc.org/net/edu.harvard.huh/guid/uuid/dbff9ce1-dc7d-4405-91d6-99e920fb4cbe
1839458239      http://purl.oclc.org/net/edu.harvard.huh/guid/uuid/f5ea5399-24c6-4ad9-a04e-c24c3f215bae
2243199571      http://purl.oclc.org/net/edu.harvard.huh/guid/uuid/f5ea5399-24c6-4ad9-a04e-c24c3f215bae
1840432908      http://viaf.org/viaf/177192477
1840432888      http://viaf.org/viaf/197991916
1840432928      http://viaf.org/viaf/284547667
1840432805      http://viaf.org/viaf/289994763
1065317620      http://viaf.org/viaf/305805111
1840432611      http://viaf.org/viaf/309670836
1840432664      http://viaf.org/viaf/33914417
1839742164      http://viaf.org/viaf/36967258
1840432656      http://viaf.org/viaf/55405855
1840433257      http://viaf.org/viaf/78028129
1839479194      http://www.ipni.org/ipni/idAuthorSearch.do?id=10654-1
4073275422      http://www.ipni.org/ipni/idAuthorSearch.do?id=12789-1
1840740817      http://www.ipni.org/ipni/idAuthorSearch.do?id=16638-1
4073086152      http://www.ipni.org/ipni/idAuthorSearch.do?id=2063-1
1840848681      http://www.ipni.org/ipni/idAuthorSearch.do?id=24349-1
1839923154      http://www.ipni.org/ipni/idAuthorSearch.do?id=4364-1
4072525521      http://www.ipni.org/ipni/idAuthorSearch.do?id=6701-1
1839970624      http://www.ipni.org/ipni/idAuthorSearch.do?id=7005-1
4073061488      http://www.ipni.org/ipni/idAuthorSearch.do?id=8012-1
4073109060      http://www.ipni.org/ipni/idAuthorSearch.do?id=8012-1
2598789389      http://www.wikidata.com/entity/Q21516841
2598863568      http://www.wikidata.com/entity/Q21516841
2438019024      http://www.wikidata.org/entity/Q1175070
2437941541      http://www.wikidata.org/entity/Q16065577
2437637261      http://www.wikidata.org/entity/Q226071
2436121864      http://www.wikidata.org/entity/Q23765917
2436098009      http://www.wikidata.org/entity/Q2854893
2438019014      http://www.wikidata.org/entity/Q36621784
1840148447      http://www.wikidata.org/entity/Q62990323
1840148406      http://www.wikidata.org/entity/Q62990431
2436121857      http://www.wikidata.org/entity/Q708002
2436121860      http://www.wikidata.org/entity/Q72899
3003829321      https://cl.linkedin.com/in/lina-mar%C3%ADa-prieto-mart%C3%ADnez-52028a37
3003829302      https://cl.linkedin.com/in/lina-mar%C3%ADa-prieto-mart%C3%ADnez-52028a37
3003829323      https://cl.linkedin.com/in/lina-mar%C3%ADa-prieto-mart%C3%ADnez-52028a37
3003829322      https://cl.linkedin.com/in/lina-mar%C3%ADa-prieto-mart%C3%ADnez-52028a37
3003829309      https://cl.linkedin.com/in/lina-mar%C3%ADa-prieto-mart%C3%ADnez-52028a37
3003829314      https://cl.linkedin.com/in/lina-mar%C3%ADa-prieto-mart%C3%ADnez-52028a37
3003829304      https://cl.linkedin.com/in/lina-mar%C3%ADa-prieto-mart%C3%ADnez-52028a37
3003829305      https://cl.linkedin.com/in/lina-mar%C3%ADa-prieto-mart%C3%ADnez-52028a37
3003829301      https://cl.linkedin.com/in/lina-mar%C3%ADa-prieto-mart%C3%ADnez-52028a37
3003829303      https://cl.linkedin.com/in/lina-mar%C3%ADa-prieto-mart%C3%ADnez-52028a37
4440443315      https://co.linkedin.com/in/javier-francisco-caicedo-moncada-795348182
4440443417      https://co.linkedin.com/in/javier-francisco-caicedo-moncada-795348182
4440443418      https://co.linkedin.com/in/javier-francisco-caicedo-moncada-795348182
4440443420      https://co.linkedin.com/in/javier-francisco-caicedo-moncada-795348182
4440443422      https://co.linkedin.com/in/javier-francisco-caicedo-moncada-795348182
4440443434      https://co.linkedin.com/in/javier-francisco-caicedo-moncada-795348182
4500022405      https://co.linkedin.com/in/josé-luis-pastrana-sánchez-17498b149
4421653145      https://co.linkedin.com/in/m%C3%B3nica-andrea-novoa-salamanca-b74658101
4421652480      https://co.linkedin.com/in/m%C3%B3nica-andrea-novoa-salamanca-b74658101
4421652973      https://co.linkedin.com/in/m%C3%B3nica-andrea-novoa-salamanca-b74658101
2243155810      https://kiki.huh.harvard.edu/databases/botanist_search.php?mode=details&id=11009
1839933905      https://kiki.huh.harvard.edu/databases/botanist_search.php?mode=details&id=18092
1839732063      https://kiki.huh.harvard.edu/databases/botanist_search.php?mode=details&id=19073
4073035210      https://kiki.huh.harvard.edu/databases/botanist_search.php?mode=details&id=2103
1839869202      https://kiki.huh.harvard.edu/databases/botanist_search.php?mode=details&id=32223
1839394773      https://kiki.huh.harvard.edu/databases/botanist_search.php?mode=details&id=39911
1839731092      https://kiki.huh.harvard.edu/databases/botanist_search.php?mode=details&id=42793
2243228829      https://kiki.huh.harvard.edu/databases/botanist_search.php?mode=details&id=44949
1839869190      https://kiki.huh.harvard.edu/databases/botanist_search.php?mode=details&id=45074
1839869491      https://kiki.huh.harvard.edu/databases/botanist_search.php?mode=details&id=47671
919154393       https://orcid.org/0000-0001-5081-465X
919154353       https://orcid.org/0000-0001-9330-6233
919154364       https://orcid.org/0000-0001-9925-661X
919154421       https://orcid.org/0000-0001-9925-661X
919154365       https://orcid.org/0000-0003-4524-0617
919154353       https://orcid.org/0000-0003-4524-0617
919154428       https://orcid.org/0000-0003-4524-0617
919154503       https://orcid.org/0000-0003-4524-0617
919154370       https://orcid.org/0000-0003-4524-0617
919154363       https://orcid.org/0000-0003-4524-0617
4022468304      https://scholar.google.com.co/citations?user=pHdPix4AAAAJ&hl=en
3855696337      https://scholar.google.com.co/citations?user=rJL9yY4AAAAJ&hl=en
3855696338      https://scholar.google.com.co/citations?user=rJL9yY4AAAAJ&hl=en
3855696322      https://scholar.google.com.co/citations?user=rJL9yY4AAAAJ&hl=en
3855696314      https://scholar.google.com.co/citations?user=rJL9yY4AAAAJ&hl=en
3855696319      https://scholar.google.com.co/citations?user=rJL9yY4AAAAJ&hl=en
3855696335      https://scholar.google.com.co/citations?user=rJL9yY4AAAAJ&hl=en
3855696344      https://scholar.google.com.co/citations?user=rJL9yY4AAAAJ&hl=en
3855696308      https://scholar.google.com.co/citations?user=rJL9yY4AAAAJ&hl=en
3855696370      https://scholar.google.com.co/citations?user=rJL9yY4AAAAJ&hl=en
4178125371      https://scholar.google.com/citations?hl=uk&user=oqfvHtMAAAAJ
4178125465      https://scholar.google.com/citations?hl=uk&user=oqfvHtMAAAAJ
4178125422      https://scholar.google.com/citations?hl=uk&user=oqfvHtMAAAAJ
4178125352      https://scholar.google.com/citations?hl=uk&user=oqfvHtMAAAAJ
4178125507      https://scholar.google.com/citations?hl=uk&user=oqfvHtMAAAAJ
4178125510      https://scholar.google.com/citations?hl=uk&user=oqfvHtMAAAAJ
4178125360      https://scholar.google.com/citations?hl=uk&user=oqfvHtMAAAAJ
4178125449      https://scholar.google.com/citations?hl=uk&user=oqfvHtMAAAAJ
4135675348      https://scholar.google.com/citations?user=C_Q4DpEAAAAJ&hl=en
4135675341      https://scholar.google.com/citations?user=C_Q4DpEAAAAJ&hl=en
3890565301      https://scholar.google.es/citations?hl=es&user=A6YJUHYAAAAJ
3890565303      https://scholar.google.es/citations?hl=es&user=A6YJUHYAAAAJ
3890565342      https://scholar.google.es/citations?hl=es&user=A6YJUHYAAAAJ
3890565304      https://scholar.google.es/citations?hl=es&user=A6YJUHYAAAAJ
3890565343      https://scholar.google.es/citations?hl=es&user=A6YJUHYAAAAJ
3913924347      https://scholar.google.es/citations?user=BhnC-KUAAAAJ&hl=es
3913924372      https://scholar.google.es/citations?user=BhnC-KUAAAAJ&hl=es
3913924371      https://scholar.google.es/citations?user=BhnC-KUAAAAJ&hl=es
3913924349      https://scholar.google.es/citations?user=BhnC-KUAAAAJ&hl=es
3913924348      https://scholar.google.es/citations?user=BhnC-KUAAAAJ&hl=es
3805035256      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805035567      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805035261      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805034346      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805035514      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805035273      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805034297      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805035266      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805035226      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805035589      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
1701609520      https://wjww.wikidata.org/wiki/Q55026408
1094986719      https://wjww.wikidata.org/wiki/Q55026408
1094990544      https://wjww.wikidata.org/wiki/Q55026408
1702070861      https://wjww.wikidata.org/wiki/Q55026408
1839959191      https://www.biodiversitylibrary.org/creator/195875
4072714510      https://www.biodiversitylibrary.org/creator/231301
4072916871      https://www.biodiversitylibrary.org/creator/231301
4072978317      https://www.biodiversitylibrary.org/creator/231301
4073213348      https://www.biodiversitylibrary.org/creator/231301
4072382394      https://www.biodiversitylibrary.org/creator/231301
4072943555      https://www.biodiversitylibrary.org/creator/231301
4073123937      https://www.biodiversitylibrary.org/creator/70522
4072458027      https://www.biodiversitylibrary.org/creator/70522
4072667001      https://www.biodiversitylibrary.org/creator/70522
4513341902      https://www.linkedin.com/in/cristian-camilo-gonzalez-aguas/
4513342886      https://www.linkedin.com/in/cristian-camilo-gonzalez-aguas/
4026839516      https://www.linkedin.com/in/darwin-moreno-echeverry-91566147/?originalSubdomain=co
4026839517      https://www.linkedin.com/in/darwin-moreno-echeverry-91566147/?originalSubdomain=co
4026839514      https://www.linkedin.com/in/darwin-moreno-echeverry-91566147/?originalSubdomain=co
4026839515      https://www.linkedin.com/in/darwin-moreno-echeverry-91566147/?originalSubdomain=co
4026839513      https://www.linkedin.com/in/darwin-moreno-echeverry-91566147/?originalSubdomain=co
4431351172      https://www.linkedin.com/in/jhonny-riay-b0aa698b/
3802163416      https://www.linkedin.com/in/m%C3%B3nica-andrea-novoa-salamanca-b74658101/?originalSubdomain=co
4513342885      https://www.linkedin.com/mwlite/in/jorge-humberto-garcia-concha-2a98351b2
4077357417      https://www.researchgate.net/profile/David-Luna-Sarmiento
4077357410      https://www.researchgate.net/profile/David-Luna-Sarmiento
4077357411      https://www.researchgate.net/profile/David-Luna-Sarmiento
4077357412      https://www.researchgate.net/profile/David-Luna-Sarmiento
4077357413      https://www.researchgate.net/profile/David-Luna-Sarmiento
4077357414      https://www.researchgate.net/profile/David-Luna-Sarmiento
4077357415      https://www.researchgate.net/profile/David-Luna-Sarmiento
4077357416      https://www.researchgate.net/profile/David-Luna-Sarmiento
4077357409      https://www.researchgate.net/profile/David-Luna-Sarmiento
4077357408      https://www.researchgate.net/profile/David-Luna-Sarmiento
4068377746      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4432883602      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4432883647      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4068377577      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4068377438      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4432884854      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4068377741      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4432885593      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4068377560      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4068377641      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4072833432      https://zoobank.org/Authors/25675B38-32BF-400B-B096-3CC875AAC6C7
4072731872      https://zoobank.org/Authors/25675B38-32BF-400B-B096-3CC875AAC6C7
4072991598      https://zoobank.org/Authors/25675B38-32BF-400B-B096-3CC875AAC6C7
4072702453      https://zoobank.org/Authors/25675B38-32BF-400B-B096-3CC875AAC6C7
4073314809      https://zoobank.org/Authors/25675B38-32BF-400B-B096-3CC875AAC6C7
1839411369      https://zoobank.org/Authors/25675B38-32BF-400B-B096-3CC875AAC6C7
4072698047      https://zoobank.org/Authors/25675B38-32BF-400B-B096-3CC875AAC6C7
4072903013      https://zoobank.org/Authors/25675B38-32BF-400B-B096-3CC875AAC6C7
1839512756      https://zoobank.org/Authors/25675B38-32BF-400B-B096-3CC875AAC6C7
4072716592      https://zoobank.org/Authors/25675B38-32BF-400B-B096-3CC875AAC6C7

And the same for identifiedById:

899065877       http://viaf.org/viaf/311203376
863261651       http://viaf.org/viaf/311203376
476792088       http://viaf.org/viaf/34530735
476575801       http://viaf.org/viaf/39550856
476584062       http://viaf.org/viaf/39550856
476762812       http://viaf.org/viaf/74174669
476761991       http://viaf.org/viaf/74174669
476790689       http://viaf.org/viaf/74174669
476791424       http://viaf.org/viaf/74174669
1677373101      http://viaf.org/viaf/8577002
2598793558      http://www.wikidata.com/entity/Q5931521
2598831082      http://www.wikidata.com/entity/Q5931521
2598817301      http://www.wikidata.com/entity/Q5931521
2598799915      http://www.wikidata.com/entity/Q5931521
2598846976      http://www.wikidata.com/entity/Q5931521
2598811381      http://www.wikidata.com/entity/Q5931521
2598819360      http://www.wikidata.com/entity/Q5931521
2598838702      http://www.wikidata.com/entity/Q5931521
2598796422      http://www.wikidata.com/entity/Q5931521
2598801478      http://www.wikidata.com/entity/Q5931521
1701847057      http://www.wikidata.org/entity/Q101069270
1702212587      http://www.wikidata.org/entity/Q101069287
1702212641      http://www.wikidata.org/entity/Q101069300
1701416798      http://www.wikidata.org/entity/Q107969725
1702212611      http://www.wikidata.org/entity/Q11959947
1701847099      http://www.wikidata.org/entity/Q15997368
1702212619      http://www.wikidata.org/entity/Q21609912
1701635169      http://www.wikidata.org/entity/Q5615062
1702212561      http://www.wikidata.org/entity/Q7360815
1702212606      http://www.wikidata.org/entity/Q94819744
4143036302      https://0000-0002-0847-4724
4143036301      https://0000-0002-0847-4725
4143036304      https://0000-0002-0847-4726
4143036303      https://0000-0002-0847-4727
4143036305      https://0000-0002-0847-4728
4142879312      https://0000-0002-1624-3205
4142879436      https://0000-0002-1624-3205
4142879313      https://0000-0002-1624-3205
4142879314      https://0000-0002-1624-3205
4142879326      https://0000-0002-1624-3205
4142879407      https://0000-0002-1624-3205
4142879329      https://0000-0002-1624-3205
4142879331      https://0000-0002-1624-3205
4142879310      https://0000-0002-1624-3205
4142879311      https://0000-0002-1624-3205
3381592436      https://commons.wikimedia.org/wiki/User:AfroBrazilian
3802158338      https://orcid.org/0000-0001-9985-9250
3802110321      https://orcid.org/0000-0001-9985-9250
3801590362      https://orcid.org/0000-0001-9985-9250
3802109385      https://orcid.org/0000-0001-9985-9250
3802162352      https://orcid.org/0000-0001-9985-9250
3801590364      https://orcid.org/0000-0001-9985-9250
3801585373      https://orcid.org/0000-0001-9985-9250
3801630116      https://orcid.org/0000-0001-9985-9250
3802162353      https://orcid.org/0000-0001-9985-9250
3801589356      https://orcid.org/0000-0001-9985-9250
4417959434      https://scholar.google.com.co/citations?user=7cbyqOYAAAAJ&hl=en
4417960116      https://scholar.google.com.co/citations?user=7cbyqOYAAAAJ&hl=en
4417959908      https://scholar.google.com.co/citations?user=7cbyqOYAAAAJ&hl=en
4417961913      https://scholar.google.com.co/citations?user=7cbyqOYAAAAJ&hl=en
4417961815      https://scholar.google.com.co/citations?user=7cbyqOYAAAAJ&hl=en
4417960437      https://scholar.google.com.co/citations?user=7cbyqOYAAAAJ&hl=en
4417959429      https://scholar.google.com.co/citations?user=7cbyqOYAAAAJ&hl=en
4417959807      https://scholar.google.com.co/citations?user=7cbyqOYAAAAJ&hl=en
4417961795      https://scholar.google.com.co/citations?user=7cbyqOYAAAAJ&hl=en
4022466301      https://scholar.google.com.co/citations?user=pHdPix4AAAAJ&hl=en
3830581392      https://scholar.google.com/citations?user=68BMF74AAAAJ&hl=es
3830581393      https://scholar.google.com/citations?user=68BMF74AAAAJ&hl=es
3830581389      https://scholar.google.com/citations?user=68BMF74AAAAJ&hl=es
3830581390      https://scholar.google.com/citations?user=68BMF74AAAAJ&hl=es
3830581391      https://scholar.google.com/citations?user=68BMF74AAAAJ&hl=es
3830581387      https://scholar.google.com/citations?user=68BMF74AAAAJ&hl=es
3830581388      https://scholar.google.com/citations?user=68BMF74AAAAJ&hl=es
3830581386      https://scholar.google.com/citations?user=68BMF74AAAAJ&hl=es
3830581636      https://scholar.google.com/citations?user=68BMF74AAAAJ&hl=es
3830581394      https://scholar.google.com/citations?user=68BMF74AAAAJ&hl=es
3705306319      https://scholar.google.es/citations?user=LSf4LOYAAAAJ&hl=es
3460380372      https://scholar.google.es/citations?user=ZpbTcu8AAAAJ&hl=es
3460380349      https://scholar.google.es/citations?user=ZpbTcu8AAAAJ&hl=es
3460380432      https://scholar.google.es/citations?user=ZpbTcu8AAAAJ&hl=es
3460380331      https://scholar.google.es/citations?user=ZpbTcu8AAAAJ&hl=es
3460380371      https://scholar.google.es/citations?user=ZpbTcu8AAAAJ&hl=es
3460380428      https://scholar.google.es/citations?user=ZpbTcu8AAAAJ&hl=es
3460380373      https://scholar.google.es/citations?user=ZpbTcu8AAAAJ&hl=es
3460380336      https://scholar.google.es/citations?user=ZpbTcu8AAAAJ&hl=es
3460380345      https://scholar.google.es/citations?user=ZpbTcu8AAAAJ&hl=es
3805035300      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805034284      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805035560      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805035211      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805035260      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805034361      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805035209      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805035249      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805035213      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
3805034357      https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849928
1701609520      https://wjww.wikidata.org/wiki/Q55026408
1702070861      https://wjww.wikidata.org/wiki/Q55026408
1094986719      https://wjww.wikidata.org/wiki/Q55026408
1094990544      https://wjww.wikidata.org/wiki/Q55026408
4400827329      https://www.Wikidata.org/wiki/Q19079059
4400826162      https://www.Wikidata.org/wiki/Q2824740
4400826190      https://www.Wikidata.org/wiki/Q2824740
4400826275      https://www.Wikidata.org/wiki/Q2824740
4400827311      https://www.Wikidata.org/wiki/Q2824740
4400825859      https://www.Wikidata.org/wiki/Q44691
4400827578      https://www.Wikidata.org/wiki/Q5909629
4400825358      https://www.Wikidata.org/wiki/Q5973883
4400827604      https://www.Wikidata.org/wiki/Q5973883
4400826187      https://www.Wikidata.org/wiki/Q59940671
4429437196      https://www.linkedin.com/in/jhonny-riay-b0aa698b/
4429434740      https://www.linkedin.com/in/jhonny-riay-b0aa698b/
4429437194      https://www.linkedin.com/in/jhonny-riay-b0aa698b/
4429434739      https://www.linkedin.com/in/jhonny-riay-b0aa698b/
4429436403      https://www.linkedin.com/in/jhonny-riay-b0aa698b/
4429434738      https://www.linkedin.com/in/jhonny-riay-b0aa698b/
4429434737      https://www.linkedin.com/in/jhonny-riay-b0aa698b/
4429434736      https://www.linkedin.com/in/jhonny-riay-b0aa698b/
4429434735      https://www.linkedin.com/in/jhonny-riay-b0aa698b/
4429437192      https://www.linkedin.com/in/jhonny-riay-b0aa698b/
4077357341      https://www.researchgate.net/profile/David-Luna-Sarmiento
4077357340      https://www.researchgate.net/profile/David-Luna-Sarmiento
4077357343      https://www.researchgate.net/profile/David-Luna-Sarmiento
4077357345      https://www.researchgate.net/profile/David-Luna-Sarmiento
4077357344      https://www.researchgate.net/profile/David-Luna-Sarmiento
4077357342      https://www.researchgate.net/profile/David-Luna-Sarmiento
3705117686      https://www.researchgate.net/profile/Norida-Lucia-Marin-Canchala
3705112351      https://www.researchgate.net/profile/Norida-Lucia-Marin-Canchala
3705112355      https://www.researchgate.net/profile/Norida-Lucia-Marin-Canchala
1932890716      https://www.researchgate.net/profile/Norida-Lucia-Marin-Canchala
4432884634      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4432884629      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4432884603      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4432884601      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4432884599      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4068377372      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4432884014      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4432884011      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4432883303      https://www.xing.com/profile/EfrainAlfonso_RubioRincon
4432884631      https://www.xing.com/profile/EfrainAlfonso_RubioRincon

The SQL for my own reference:

CREATE TABLE matt.rids AS 
  SELECT gbifid, rid 
  FROM occurrence 
  LATERAL VIEW explode(recordedbyid) ridTable AS rid 
  WHERE size(recordedbyid) > 0
  AND datasetkey != '50c9509d-22c7-4a22-a47d-8c48425ef4a7';

WITH g_rids AS (
  SELECT 
    ROW_NUMBER() OVER(PARTITION BY PARSE_URL(rid, 'HOST')) AS row_num,
    rid,
    gbifid
  FROM matt.rids)
SELECT gbifid, rid FROM g_rids WHERE g_rids.row_num <= 10 ORDER BY rid;

@dshorthouse
Copy link

Thanks @MattBlissett. Interesting that there's apparent appetite for inclusion of commercial, for-profit entities like LinkedIn, Google Scholar, Xing, and ResearchGate as through they were identity providers.

@abubelinha
Copy link

abubelinha commented Mar 27, 2024

Thanks to all of you for looking into this and providing so interesting answers.
I was curious about providers' current usage of IDs for groups of people, which seems to be pretty uncommon for now.


I am surprised by the answer to my other question (thanks @tucotuco).
I expected DwC to already provide some standard way of referring to institutions exchanging specimens (when we just know the institution/s, but not the catalogNumber/s they assigned to the specimen/s).

I tried a GBIF facet search for ownerInstitutionCode, but it must be a non-searchable concept since nothing comes out. So I can't see examples of how it is currently being used:

  • When citing loaned specimens? (i.e. a dataset/datapaper of a given taxonomic group revision).
    But in that case what would the problem if using institutionCode instead?
  • When a full collection is on long-duration loan to another institution? (building renovation, inundation, war ...), so the receptor assigns its own code for proper management of those specimens

Anyway it is not suitable for reflecting exchange of duplicated specimens (where all concerned institutions own one of them).
dynamicProperties might be. How can search GBIF for datasets which are actually using it and take a look?
As @tucotuco said, this wouldn't be an standardized/searchable way of referring to other institutions.
I think the scenario is not that uncommon (knowing that a specimen has duplicates in other institutions, but not knowing their otherCatalogNumbers).

Having the option of providing / searching this institutional info would improve a lot our possibilities of linking those specimens (if I can search GBIF and download a table of foreign institutions' specimens which cite my own institution, it will be much easier for me to join them against our own datasets using i.e. scientificName, fuzzy collector & date ... and then when republishing our datasets I could provide lots of otherCatalogNumbers constructed from that previous GBIF download).

Should I create an issue to ask if there is room for this in DwC?

Thanks a lot to you all again

@tucotuco
Copy link
Member

I would say that, before creating issues for new terms, have a look at GBIF clustering (example) to see if that already satisfies the use cases you are thinking of. GBIF is able to suggest specimens that are likely to from the same Organism in the same collecting Event by using the matching tricks you mentioned. What it wouldn't cover are references to specimens elsewhere whose data have not been shared via GBIF.

@abubelinha
Copy link

abubelinha commented Mar 29, 2024

Thanks @tucotuco , I am already a big fan of GBIF clustering (I had linked an example of "related records" tab usage above too).

But I am also very interested in people helping to establish those cluster relationships too.
Apart from possible ortographical variations among duplicate specimens (toponyms and collectors spelling: "Linné, C." vs "Carolus Linnaeus" or "Manhattan, NY, US" vs "Estados Unidos: Nueva York, Manhattan") a big difficult for automatic clustering is to catch duplicates where taxonomic identifications differ.

And that's the most important use case for me: I want to be aware of those changes in our collection's duplicates, and I want to give other curators the chance to be aware of our taxonomic revisions.
IMHO having pointers from specimens to institutions would be a huge help in this task.

@tucotuco
Copy link
Member

@abubelinha OK, great. In case you are not aware, the means to recommend changes and additions to Darwin Core is explained in the Darwin Core Guidelines for contributing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests