Implementing text categorization, CATS_SCORE always equals 100 and SCORE always equals 1 #13602
Unanswered
fionaychen
asked this question in
Help: Coding & Implementations
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello!
I'm using spaCy to implement a text categorization task. When I run the command to train the model, I get a somewhat strange output. Specifically, it looks like the CATS_SCORE is always 100 and SCORE is always 1 (but the LOSS does change). I am quite new to using spaCy so I've been having trouble figuring out why this is the case, but it does not seem correct. Does anyone have a sense of why this might be the case?
E # LOSS TEXTCAT CATS_SCORE SCORE
0 0 0.25 100.00 1.00
0 200 43.38 100.00 1.00
0 400 44.46 100.00 1.00
0 600 40.68 100.00 1.00
0 800 31.62 100.00 1.00
0 1000 37.17 100.00 1.00
0 1200 34.96 100.00 1.00
0 1400 34.13 100.00 1.00
0 1600 32.05 100.00 1.00
To provide a bit more detail on the text categorization task, I have a dataset of the texts of many tasks, which I am trying to classify as either "promotable" or not. I am using the multilabel categorization with just 1 label.
I have tried to check a few things to fix this so far: it does seem like my dataset is set up alright (with ~20% of tasks being promotable) and there is not any overlap between the datasets I am using to train/develop/test the model.
Thank you for your help! And I am happy to provide more details about the code, etc as needed.
Beta Was this translation helpful? Give feedback.
All reactions