-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How should the temporal dimension (ie. organizational reforms) be handled? #68
Comments
An use case related to this is on the first draft of the Core Public Organization Vocabulary (cpov):
I highly suggest that people interested on this publicbodies.org project do participate in the open Working Group that is developing the cpov. |
@augusto-herrmann my instinct is the simplest way to handle this is to add three columns to the tables:
There's also move about what replaces it but i think this gets really complicated quickly and could go in the notes fields for now IMO. I think this is the lightest way to get moving for now but maybe others have good ideas too! E.g. @jpmckinney may have good ideas here! |
@rufuspollock I think I'm not fond of For organizational transformation, that is indeed complicated, and may be better suited for an unstructured notes field. If structured data is necessary, some options to handle are: renaming, merging, splitting, moving within a hierarchy. |
@jpmckinney sounds good. @augusto-herrmann so I think we should go for We can put some unstructured or semistructured info in the notes field about how the change related to other orgs (i don't think we are in a position to standardize this yet and suggest you make something up for brazil and once we have trialled that a bit we can revisit whether we can standardize that). |
This issue is related to #13, should be handled in tandem when tackled. |
@jpmckinney @rufuspollock the problem is that data sources often do not provide a Right now, with the data sources that have already been automated, we just delete the missing entries when bringing in new data, i.e. the whole list gets replaced by the new version. Since we're tracking the csv in git, this is not much of a problem unless we want to display extinct public bodies in the UI. If people want to get to older data, just check older versions of the csv in git. This seems to me like the least costly, maintainable solution at the moment. |
I'm happy with whatever compromise the project chooses. In terms of options: If deleting rows, this means that if a dataset uses publicbodies IDs, then if a body it refers to is deleted in publicbodies, then that dataset can no longer reconcile. Furthermore, users of that dataset don't know whether (1) the dataset's ID is incorrect or (2) the ID refers to a body that has since been deleted. Instead of deleting rows, this issue had suggested additional columns ( Another solution that has not been discussed is to instead move/merge the old table for country XYZ into a "history" table for country XYZ. The semantics of the history table are that it contains all bodies that had been collected in the past and that could not be reconciled with the most recent collection. This way, users can still easily look up any ID that had existed in the past, without attaching any specific meaning to the "history" table – entries might be there because: a body no longer exists, reconciliation failed (like in the Environment example above), the source accidentally omitted the body, etc. |
Brazil has been through a quite comprehensive organizational reform recently, with many Ministries being merged, renamed, etc. I suppose this is quite common in other countries as well. So I'm opening this issue to discuss how this history of organizations should be handled (if at all) in this project. Ideas are welcome.
Some possibilities:
reorg_from
field containing the id of the public body this was merged from. Reorgs can then be tracked by analysing this field along with the existingfounding_date
anddissolution_date
fields.The text was updated successfully, but these errors were encountered: