Cosmos

Interface for applying Cosmos to document segmentation

Current milestone (with demo links): https://github.com/UW-COSMOS/project-docs/tree/master/presentations_reports/milestone_3

Running the standalone images

We provide a separate repo (https://github.com/UW-COSMOS/cosmos-demo) describing how to use our canonical docker images, which include everything necessary to run the model.

Building + running the model from scratch

It is also possible to build the model image yourself. To do so:

Switch to the cosmos directory
Run, specifying the PDF input and desired output directories with the INPUT_DIR and OUTPUT_DIR environment variables, respectively

OUTPUT_DIR=./output/ INPUT_DIR=/path/to/input/docs DEVICE=cpu docker-compose up

Layout of the model

Documentation can be viewed at https://uw-cosmos.github.io/Cosmos/

The entry points for the program is cosmos/run.py

The procedure of the program is laid out generally as follows (docs correspond to paths)

Preprocessing -- cosmos/preprocessing
- Turn PDFs into PNGs so that they can be fed to a computer vision pipeline.
Create proposals -- cosmos/connected_components
- Generate region proposals within document pages, this segments each page.
Ingesting data -- cosmos/ingestion
- Prepare region proposals to be classified by a Neural Network as Body Text, Equation, Figure, etc.
Model inference -- Inference runner: cosmos/infer || Model definition: cosmos/model
- Run the Neural Network on each region proposal.
Convert to HTML/XML -- cosmos/converters
- Results are converted to HTML/XML and class specific information extraction modules are run.
Postprocessing -- cosmos/postprocessing
- Update class labels in light of extracted information.
Equation specific OCR -- cosmos/latex_ocr
- Custom extraction pipeline for equations.
Create knowledge base of figures and tables -- cosmos/construct_caption_tables
Create knowledge base of equations -- cosmos/UnicodeParser

License and Acknowledgements

All development work supported by DAPRA ASKE HR00111990013 and UW-Madison.

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this repo except in compliance with the License. You may obtain a copy of the License at:

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Name		Name	Last commit message	Last commit date
Latest commit History 599 Commits
cosmos		cosmos
docs		docs
services		services
.gitattributes		.gitattributes
.gitignore		.gitignore
.travis.yml		.travis.yml
README.md		README.md
docker-compose-standalone-CPU.yml		docker-compose-standalone-CPU.yml
docker-compose-standalone-GPU.yml		docker-compose-standalone-GPU.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cosmos

Running the standalone images

Building + running the model from scratch

Layout of the model

License and Acknowledgements

About

Releases

Packages

Languages

sverma25/Cosmos

Folders and files

Latest commit

History

Repository files navigation

Cosmos

Running the standalone images

Building + running the model from scratch

Layout of the model

License and Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages