Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pygrunn 14 article #34

Open
wants to merge 6 commits into
base: pelican
Choose a base branch
from
Open

Pygrunn 14 article #34

wants to merge 6 commits into from

Conversation

dimazest
Copy link
Contributor

This article has two parts:

  • description of pygrunn
    • some linguistic theory

while the first is more or less covered the second need more attention

decided to show how these libraries are used to solve simple CL tasks.

First, I `covered`__ `Zipf's law <http://en.wikipedia.org/wiki/Zipf%27s_law>`_
and showed that it holds for an English text. As a homework, I asked whether the
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

First, I covered Zipf's law, which states that the frequency of any word in a corpus of texts is inversely proportional to its rank in the frequency table. With help of pandas??? I showed that it holds for an English text.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good point

it -1.601567 20.000540
========= ========= ===========

Conclusion
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe you could add a few closing words on Pygrunn, maybe something relating to linguistics as well. Or if you don't want to add anything you could just change Conclusion to References & keep the link below.

@dimazest
Copy link
Contributor Author

i still want to write a conclusion, but i was too tired to write it :)

@dimazest
Copy link
Contributor Author

@Filip-Ter did you like the draft?

@terfilip
Copy link
Member

yeah it looks good

`My presentation`__ started as a demonstration of the modern pythonic scientific
tools (my subjective classification):

__ http://nbviewer.ipython.org/urls/bitbucket.org/dimazest/phd-buildout/raw/tip/notebooks/pygrunn14.ipynb

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It looks like the notebook is not loading.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it works now.

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The article is improving!
Still the notebook is not loading. It is not found on the server.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

strange, it works for me, maybe there are some problems on the server. I'll give a link to the original file and to the rendered version.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we can use word frequencies available here http://wacky.sslmit.unibo.it/doku.php?id=frequency_lists


.. image:: {filename}/static/images/016-en_zipf.png
:align: center
:alt: English word frequency counts on the log-log scale.
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The image is not found

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

github doesn't know how to find it, but our blog engine does :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants