Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Positive labels [PSCDD] #56

Open
4 of 6 tasks
dieko95 opened this issue Apr 11, 2021 · 0 comments
Open
4 of 6 tasks

Add Positive labels [PSCDD] #56

dieko95 opened this issue Apr 11, 2021 · 0 comments
Assignees
Labels
enhancement New feature or request
Milestone

Comments

@dieko95
Copy link
Member

dieko95 commented Apr 11, 2021

Problem

The first step of creating the development dataset (#55) is to add all positive labels + flattened articles in the same csv.

Proposed Solution

Gather the scraped information from the NGO's tagged data (positive labels).

Tasks

  • Follow up with team sambil to gather all the scraped data that have been previously done.
  • Concatenate all team sambil scraped date in the same CSV
  • Merge the dataframe with the tagged values
  • Use the data annotated last year in C4V for Negative Labels (and positive if quick) (It ain't quick, this will be tackled in Add Negative Labels [PSCDD] #57)
  • Dump locally the csv with the webscraped output + annotated positive labels
  • Upload the csv dump somewhere to enable the team to access it.
    • I'm waiting for @Edilmo suggestion
@dieko95 dieko95 self-assigned this Apr 11, 2021
@dieko95 dieko95 added the enhancement New feature or request label Apr 11, 2021
@dieko95 dieko95 changed the title Add positive labels dataset Add Positive labels [PSCDD] Apr 11, 2021
@dieko95 dieko95 linked a pull request Apr 20, 2021 that will close this issue
4 tasks
@dieko95 dieko95 added this to the PSCDD milestone Apr 20, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant