Skip to content

Latest commit

 

History

History
14 lines (10 loc) · 1.05 KB

README.md

File metadata and controls

14 lines (10 loc) · 1.05 KB

YSC-2021

The dataset developed for categorizing Bangla Sports News in named BNeC. It containes a total of 43306 text documents of four sports categories : Cricket, Football, Tennis and, Athletics.

A summary of the collected data is shown in the following Table:

Category No. of docs No. of sentences No. of words Avg. sentences per doc No. of unique words
Cricket 30032 680315 7169781 22.65 138220
Football 11429 246299 2663483 21.55 94050
Tennis 1101 21041 222152 19.11 22153
Athletics 744 17932 184616 24.10 22199

The link of the dataset containg csv file is given here : https://drive.google.com/file/d/1Ub486EMIov18FIo5lj9RwmNZWvi7XZlp/view?usp=sharing