An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
-
Updated
Nov 25, 2024 - Python
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with diverse types of annotation.
A library of functions enabling complex corpus search in context (KWIC), search aggregation, bag-of-words building & keyphrase extraction.
Parse Corpus Query Language (CQL) into a list of JSON queries
The great textmining tool that obviates all others
A concordancing program for English with a GUI interface that can read .docx, .srt, and plaintext files and export concordance lines to .txt,. docx, .tsv, .xlsx, and .html.
A lightweight, unobtrusive R companion for some common NLP & web scraping tasks.
Massive Speech Corpus Tool - Recursive (MaSCoT-R) is a Praat script for working with very large speech corpora
MaSCoT is a Praat tool developed to facilitate searching, extracting and analyzing information contained in large, richly-annotated speech corpora developed in Praat. This version is for single TextGrid/WAV pairs.
We designed an Information Retrieval system based on Vector Space model in python. We Also have implemented Bi gram Indices for Phrasal query search and Champion List retrieval. We also compared time of whole retrieving in our project report.
A fast, small, and portable Windows application for searching large text corpora, with regex and right-to-left support.
Gets text and extracts sentences in a language from text using that language's lexicon.
For a corpus linguistics project, I created an information retrieval program called "You Are Not Alone". My phrase_finder() function searches for a self-identifying phrase in 4 large classic texts (The Souls of Black Folk, Jane Eyre, The Strange Case of Dr. Jekyll & Mr. Hyde, and Frankenstein). Standpoint: "So Matilda’s strong young mind continu…
Add a description, image, and links to the corpus-search topic page so that developers can more easily learn about it.
To associate your repository with the corpus-search topic, visit your repo's landing page and select "manage topics."