Hierarchical Matching #2483
Replies: 3 comments 3 replies
-
this could be extended to include ADM3..5 alternatively,
|
Beta Was this translation helpful? Give feedback.
-
Just to check is this referring to what is done for instance in GADM, where each level 4 (for instance) entry has the name of the areas that contain it at levels 3, 2, and 1? |
Beta Was this translation helpful? Give feedback.
-
Hey @tfardet - are you referring to the GeoNames geocodes? We've been working on ways to try and crosswalk between geoboundaries and a wide range of alternative ID schemes - including ISO codes for sub-levels, the UN P-Codes, and a few others. Ultimately, the GeoNames geocodes would be another example of an ID scheme we would eventually want to tie into, but we won't be there immediately. I agree on the file size issue, going to lower units would be too much. What I want to avoid is turning into GADM, with so many ancillary columns that a lay user can't figure out what's going on (and, equally importantly, accuracy degradation as we add more elements we're maintaining). For example, for an ADM2, GADM would provide the following columns for each unit: Contrast this to geoBoundaries, which has the exact same schema across all data products (i.e., no matter what level you grab, you get these columns): So the question here is how we could add hierarchical information while also retaining what I would consider the interpretability / usability of gB. While also operating within the constraints of a flat attribute table to accomodate shapefiles / geojsons. And also wanting to give users easy filter capabilities. Which seems an impossible set of capabilities to offer :). One alternative solution would be to provide hierarchical information as an optional addon join - i.e., at checkout, the user ticks a box (or something), and when they get their copy of geoBoundaries it also provides a csv to join (or, perhaps, just provides an alternative version with the join already done). In that case, we would have more flexibility to add arbitrary columns. I.e., then I could see adding something like (with multiple columns according to the number of parents): shapeADM1Name - Name of ADM1 Then, if the user checks, say, a "geonames" box (that won't exist for a very long time :) ), they would also get: Obviously still at the phase of brainstorming here, so any thoughts welcome. |
Beta Was this translation helpful? Give feedback.
-
I'm moving some of our conversations about hierarchical matching out of issues and into this discussion area.
Essentially, I want to get a pulse on different strategies we could use for hierarchical standardization (ensuring ADM2 nests in ADM1, ADM3 in ADM2, and so on), and what the community might ultimately prefer.
Just to get started:
#2326 (comment)
Beta Was this translation helpful? Give feedback.
All reactions