-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Duplicate entries #10
Comments
I think it would be great to update the source with the official one: http://data.gov.ro/dataset/siruta |
@necenzurat Your link to official data is broken, although the text displayed is the correct link. |
it would be nice to have some sort of automation from here i found the latest list from 2018 |
@zhgabor yea, MDB and DBF. |
I spotted 912 duplicated entries, for example:
12318 | 27.86 | 46.50 | albesti | VASLUI | VS | 1171.0 | Nord-Est
27.55 | 46.70 | albesti | VASLUI | VS | 239.0 | Nord-Est
22.51 | 47.32 | almasu mic | BIHOR | BH | 552.0 | Nord-Vest
22.14 | 47.17 | almasu mic | BIHOR | BH | 209.0 | Nord-Vest
Most probably they were inserted from different data sources at different points in time.
I think it would be good to drop all duplicates, and keeping the ones with the maximum population value.
I'm open to discussion if there is anything I misunderstood.
The text was updated successfully, but these errors were encountered: