Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ENH semi-automatic parser #141

Merged
merged 3 commits into from
Nov 4, 2023
Merged

ENH semi-automatic parser #141

merged 3 commits into from
Nov 4, 2023

Conversation

Gigi-G
Copy link
Member

@Gigi-G Gigi-G commented Nov 4, 2023

semi-automatic parser (#140)

I made enhancements to the semi-automatic JSON parser. The primary objective of these enhancements was to facilitate the conversion of PDF files into CSV format using the dedicated script converter.py.

The process involves the following steps:

  1. PDF to CSV Conversion: This conversion is a critical step in the extraction process, as it makes the data more accessible and manageable.

  2. Information Extraction: Following the PDF-to-CSV conversion, we can employ the parse_election_results.py script to extract information from the CSV files.

  3. CSV Refinement: This refinement process makes the data more structured and user-friendly, further facilitating the extraction of information.

It's important to note that while these enhancements have greatly streamlined the extraction process, it's not entirely automated. There may still be cases where the parser detects errors in specific rows of the CSV files. In such instances, manual intervention may be required to correct these issues.

@Gigi-G Gigi-G added the enhancement New feature or request label Nov 4, 2023
@Gigi-G Gigi-G requested review from Helias and drendog November 4, 2023 17:40
@Gigi-G Gigi-G self-assigned this Nov 4, 2023
@Helias Helias merged commit 2e14586 into UNICT-DMI:master Nov 4, 2023
1 check failed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants