A replication package of "Towards Reliable Agile Iterative Planning via Predicting Documentation Changes of Work Items"

can be found at /code/Rscript/
RQ1_performance_measure.R must be run first to measure the performance of the model.
Then, run RQ1_performance_stattest.R to perform statistical test on the measured performance.
RQ3_rank_features.R is used to find a statistical distinct rank for each features in DocWarn-C.

can be found at /data (only available on on Figshare version: https://figshare.com/s/88547b3c197b21b60f7c)
/data/data_reverted_cleaned stores dataset that the work items were reverted to sprint assignment time.
/data/trainingData stores the dataset for each cross-validation round.
/data/features stores the metrics extracted from each work items in the dataset.
/data/modelResult stores the DocWarn-C R models (/models/...), performance of each DocWarn variations, and the result of features ranking.

/rq2_manual_validation.csv is the result of RQ2's manual classification to validate DocWarn-C.
/rq2_manual_validation_external.csv is the manual classification that were done by the external coder (to measure the inter-rater agreement).
code 0 = others, 1 = changing scope, 2 = defining scope, 3 = adding additional detail, 4 = adding implementation detail

can be found at /distilroberta-base-jira (only available on on Figshare version: https://figshare.com/s/88547b3c197b21b60f7c)
This is the fine-tuned version of distilroberta-base with 110k JIRA issues.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
code		code
README.md		README.md
metrics_extraction.txt		metrics_extraction.txt
rq2_manual_valiation_external.csv		rq2_manual_valiation_external.csv
rq2_manual_validation.csv		rq2_manual_validation.csv

Provide feedback