Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New Taipei: Some rows don't adhere to the CSV header #5

Open
jaller94 opened this issue Oct 7, 2024 · 1 comment
Open

New Taipei: Some rows don't adhere to the CSV header #5

jaller94 opened this issue Oct 7, 2024 · 1 comment

Comments

@jaller94
Copy link

jaller94 commented Oct 7, 2024

Starting in line 112, there is an additional column in the file 新北市門牌位置數值資料-20240709-street-special.csv.
The line does not match the fields from the CSV header. This makes CSV parsing impossible.

168887,新北市政府,TW,新北市,金山區,美田里,021,中山路,A區,395號,11,6,新北市金山區美田里021鄰中山路395號A區十一樓之6,313797.382454,2790844.1992000

When parsing by commas, we get the following data for row 111 (correct amount of commas) and 112 (one comma too many):

Map(14) {
  "addr:TW:dataset": "168887",
  "source": "新北市政府",
  "addr:couontry": "TW",
  "addr:city": "新北市",
  "addr:district": "新莊區",
  "addr:hamlet": "光華里",
  "addr:neighbourhood": "027",
  "addr:street": "民安西路",
  "addr:housenumber": "384號",
  "addr:floor": "11",
  "addr:unit": "1A",
  "addr:full": "新北市新莊區光華里027鄰民安西路384號十一樓之1A室",
  "x_3826": "293211.945100",
  "y_3826": "2766972.0470000",
}
Map(14) {
  "addr:TW:dataset": "168887",
  "source": "新北市政府",
  "addr:couontry": "TW",
  "addr:city": "新北市",
  "addr:district": "金山區",
  "addr:hamlet": "美田里",
  "addr:neighbourhood": "021",
  "addr:street": "中山路",
  "addr:housenumber": "A區",
  "addr:floor": "395號",
  "addr:unit": "11",
  "addr:full": "6",
  "x_3826": "新北市金山區美田里021鄰中山路395號A區十一樓之6",
  "y_3826": "313797.382454",
}
@jaller94
Copy link
Author

jaller94 commented Oct 7, 2024

  • What field does A區 fit into? We don't have an area column, but maybe there should be one.
  • Is the street value supposed to be 中山路,A區 or the housenumber supposed to be A區,395號?

@jaller94 jaller94 changed the title Some rows don't adhere to the CSV header New Taipei: Some rows don't adhere to the CSV header Nov 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant