Skip to content

AdritaBarua/YSC-2021

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 

Repository files navigation

YSC-2021

The dataset developed for categorizing Bangla Sports News in named BNeC. It containes a total of 43306 text documents of four sports categories : Cricket, Football, Tennis and, Athletics.

A summary of the collected data is shown in the following Table:

Category No. of docs No. of sentences No. of words Avg. sentences per doc No. of unique words
Cricket 30032 680315 7169781 22.65 138220
Football 11429 246299 2663483 21.55 94050
Tennis 1101 21041 222152 19.11 22153
Athletics 744 17932 184616 24.10 22199

The link of the dataset containg csv file is given here : https://drive.google.com/file/d/1Ub486EMIov18FIo5lj9RwmNZWvi7XZlp/view?usp=sharing

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published