Interface for easier topic modelling.
-
Updated
Jul 29, 2024 - Python
Interface for easier topic modelling.
Python implementation of bag-of-concepts
Library in C++ and a python wrapper for dealing with Page XML files
An extremely weakly-supervised text classification method using mutually-enhancing text granularities (word, sentence, and document-level context).
Hyperbolic Contrastive Learning for Document Representations - A Multi-View Approach with Paragraph-level Similarities
Dataset and code for "Label-Wise Document Pre-Training for Multi-Label Text Classification" (NLPCC 2020)
Unsupervised Discovery Of Trends In Biomedical Research Based On The PubMed Baseline Repository
Define models to represent a textual document, e.g. a PDF, preserving the hierarchy of the content.
Add a description, image, and links to the document-representation topic page so that developers can more easily learn about it.
To associate your repository with the document-representation topic, visit your repo's landing page and select "manage topics."