GitHub - appeler/sound_names: Sound Names: Predict Race and Ethnicity Based on the Sequence of Sounds

Sound Names: Classify Names Using the Sequence of Sounds

Building on prior work that classifies names based on the sequence of characters, we create a model that capitalizes on sequence of sounds to classify names.

To capture the phonetic similarity of different names, we first produce sound encodings of names using https://pypi.org/project/Metaphone/#contents and then use LSTM on top to test classification accuracy. We find that the accuracy is substantially lower than what we can achieve when we just apply LSTM to the name strings. This suggests that there is some information in the spellings (aside from the sound) and very plausibly that the sound encoding algorithms do not capture the way a name is read completely.

In the future, we plan to ensemble the two models.

Scripts

Authors

Suriyan Laohaprapanon and Gaurav Sood

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
models		models
scripts		scripts
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sound Names: Classify Names Using the Sequence of Sounds

Scripts

Authors

About

Releases

Contributors 2

Languages

appeler/sound_names

Folders and files

Latest commit

History

Repository files navigation

Sound Names: Classify Names Using the Sequence of Sounds

Scripts

Authors

About

Resources

Stars

Watchers

Forks

Releases

Contributors 2

Languages