Argumentative Proposition Classifier

Implementation of a traditional classifier of argumentative components (claims and premises), trained with features/metadata previously extracted from manually annotated argumentative sentences from the citizen proposals available in the Decide Madrid platform.

The complete solution consists of a pipeline of 6 modules, which are in charge of: data extraction from the source database (Decide Madrid), manual annotation of the data using the ARGAEL tool (also supports annotations from Prodigy), the subsequent feature extraction and the final construction and validation of the feature-based classification models.

Papers

This work (v1.4) has been accepted as a paper at the 10th Workshop on Argument Mining co-located with the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023). A draft of the paper can be found here.

Pipeline diagram

Below are links to all datasets (both intermediate and final) created and used by the solution:

Decide Madrid platform
Proposals JSONL files
Annotations CSV files
Annotated propositions CSV file
Features JSON file
Labeled dataset CSV file
Models results CSV file

Dependencies

The implemented solutions depend on or make use of the following libraries:

Data processor (Python module):
- python v3.9.13
- spaCy v3.3.1
Feature extractor (Java module):
- JDK 17
- Stanford CoreNLP v4.5.3
- MongoDB Java Driver v3.12.10
- Snake YAML v1.9
- JSON Java v20210307
Argument classifier (Python module):
- python v3.9.13
- sklearn v0.24.2
- lightgbm v3.3.5
- nltk v3.6.3
- optuna v3.1.1

Authors

Created on Aug 18, 2021
Created by:

License

This project is licensed under the terms of the Apache License 2.0.

Acknowledgements

This work was supported by the Spanish Ministry of Science and Innovation (PID2019-108965GB-I00).

Name		Name	Last commit message	Last commit date
Latest commit History 300 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
code		code
data		data
docs		docs
image		image
papers/emnlp23		papers/emnlp23
results		results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
_config.yml		_config.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Argumentative Proposition Classifier

Papers

Pipeline diagram

Dependencies

Authors

License

Acknowledgements

About

Releases 3

Packages

Languages

License

argrecsys/arg-classifier

Folders and files

Latest commit

History

Repository files navigation

Argumentative Proposition Classifier

Papers

Pipeline diagram

Dependencies

Authors

License

Acknowledgements

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages