dkpro-bigdata

DKPro BigData enables the easy execution of UIMA-based natural language processing pipelines on a hadoop cluster.

###Features Large scale NLP processing using UIMA and hadoop Store your corpora on a Hadoop filesystem and access them from local or distributed pipelines Find patterns in your textual data using adaptable collocation extraction ###Details

Execute DKPro pipelines on a hadoop cluster with minimal adaption
Read data stored on a HDFS Filesystem using DKPro Collection Readers
Read/Write serialized CASes from HDFS ###Contributors:
Hans-Peter Zorn
Johannes Simon
Martin Riedl
Richard Eckart de Castilho
Steffen Remus

##License DKPro BigData is licensed under the Apache Software Licence (ASL) Version 2.0.

This project is a joint effort of UKP Lab and the Language Technology Group, Technical University of Darmstadt.

Name		Name	Last commit message	Last commit date
Latest commit History 176 Commits
dkpro-bigdata-collocations		dkpro-bigdata-collocations
dkpro-bigdata-doc		dkpro-bigdata-doc
dkpro-bigdata-examples		dkpro-bigdata-examples
dkpro-bigdata-hadoop		dkpro-bigdata-hadoop
dkpro-bigdata-io-hadoop		dkpro-bigdata-io-hadoop
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dkpro-bigdata

About

Releases

Packages

Contributors 12

Languages

License

dkpro/dkpro-bigdata

Folders and files

Latest commit

History

Repository files navigation

dkpro-bigdata

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 12

Languages

Packages