Skip to content

kell18/CAS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CAS. Content analysis system

System for automatic content moderation.


#####Realized features:

  • Researched characteristics of content: likability, inverse relevance, correctness.
  • Collected and marked data set of 1250 items
  • Trained logistic regression model on Matlab
  • Implemented сontinuous filtering service on Scala (Vk API, ElasticSearch)

#####Further work

  • Discover new characteristics
  • Implement online learning (via chrome app and human moderators)
  • Make a site for selling


#####Actuality characteristics:

  • Likeability - user sympathy
  • Inverse relevance - score from elasticsearch
  • Correctness - number of punctuation characters to message length, uppercase chars amount, message size

Albert Bikeev.

About

Content analysis system

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages