Skip to content

Latest commit

 

History

History
16 lines (10 loc) · 1018 Bytes

README.md

File metadata and controls

16 lines (10 loc) · 1018 Bytes

Wikipedia analysis

Introduction

The purpose of this repository is to gather python scripts that allow making the analysis on Wikipedia's available data. It is organized in directories each of those contains procedures related to a specific analysis task. Here follows a little description for each of the aforementioned directories.

Directories

  1. Mention: Once you choose a word or a regular expression you are interested in, the provided scripts allowed you to carry out some analysis relate to the mentions of the chosen words across languages. A more detailed explanation is furnished in the README.md of the directory.

Directories organization

Each directory provides the following files:

  1. README.md file: description of the work;
  2. .py scripts which rally the classes and methods implemented to accomplish the task;
  3. An IPython Notebook that shows the way to use the classes and methods provided.