Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Need to make a fast actor coding system #182

Open
YanLiang1102 opened this issue Jul 4, 2017 · 4 comments
Open

Need to make a fast actor coding system #182

YanLiang1102 opened this issue Jul 4, 2017 · 4 comments

Comments

@YanLiang1102
Copy link
Contributor

Since the wikipedia library is in python, need to use flask to make a python api that the web can make a call to, and for flask python api this is a good tutorial.
https://blog.miguelgrinberg.com/post/designing-a-restful-api-with-python-and-flask

@YanLiang1102
Copy link
Contributor Author

using a tf-idf to find the most important actors in the db that have not be coded yet.

@YanLiang1102
Copy link
Contributor Author

YanLiang1102 commented Aug 3, 2017

  1. gather all the useful entities, and count their frequency, the data is stored in our mongo db

YanLiang1102 pushed a commit that referenced this issue Aug 3, 2017
YanLiang1102 pushed a commit that referenced this issue Aug 4, 2017
…system, getting the potential useful entities stuff
@YanLiang1102
Copy link
Contributor Author

And the ner result with our arabic nouns is imported into a tabled called spacyNer on hanover mongodb

YanLiang1102 pushed a commit that referenced this issue Aug 12, 2017
…d and store the data seperately for per and prg entities returned from Spacy
@YanLiang1102
Copy link
Contributor Author

YanLiang1102 commented Aug 12, 2017

And the output of the data is store on portland. This will be the data we use to build the fast coding interface, key is the arabic entity ,value is the first 5 snetence id that contain this word
And the data is stored in mongodb on portland, table is called fast_per_entities and fast_org_entities

YanLiang1102 pushed a commit that referenced this issue Aug 12, 2017
YanLiang1102 added a commit that referenced this issue Sep 10, 2017
…and clear logic all worked and also count left, so need to update teh db to show 5 sentences
YanLiang1102 added a commit that referenced this issue Sep 10, 2017
YanLiang1102 added a commit that referenced this issue Sep 10, 2017
… still need to dilter out the existing nouns that tagged in the normal interface
YanLiang1102 added a commit that referenced this issue Sep 17, 2017
…ffrently though, otehrwise the second time we input the same country it will not save it
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant