Skip to content

Latest commit

 

History

History
33 lines (19 loc) · 1.02 KB

report.md

File metadata and controls

33 lines (19 loc) · 1.02 KB
title author date output
Frequency of letters at the start position in each word in the words dataset
Seevasant Indran
27 November, 2018
html_document
keep_md
true

I computed the letter usage at the begining of each word in the list, i.e. Frequency of each letter at the start position of each word in the word list, etc.

The most frequent letter at the begining is S.

Here is a histogram of the frequency of the word list that begins with all the available letters.

Fig. 1 A histogram of letter usage in the begining of the each word

Jenny Bryan

\newline

On most *nix systems, the file /usr/share/dict/words contains a bunch of words. A total 235886 words is contained on my machine.

I computed the length of each word, i.e. the number of characters, and tabulated how many words consist of 1 character, 2 characters, etc.

The most frequent word length is 9.

Here is a histogram of word lengths.

Fig. 1 A histogram of English word lengths