News Summary (.csv) We use file cl_news_summary_more.csv
only
AMI corpus (text and summary) (meeting.zip has been uploaded)
WikiHow (.csv)
-
Download News Summary, WikiHow.
-
run process.py
python file needs package tensorflow
and stanza
News Summary [Summary to a few words headline. Extended, cleaned version ] -> This is what we picked
BBC News Summary [Summary to about 1/3, very long long long]
NewsRoom [large scale]
WikiHow -> This is what we picked
AMI corpus [The AMI Meeting Corpus is a multi-modal data set consisting of 100 hours of meeting recordings.] -> This is what we picked
Legal Case [ This one has a text length about 3000 words and short summary, which is ideal ]
Opinosis [ 51 data points ]
Sentence-compressed [ Large corpus of uncompressed and compressed sentences from news articles ]