diff --git a/README.md b/README.md new file mode 100644 index 0000000..39e44b7 --- /dev/null +++ b/README.md @@ -0,0 +1,11 @@ +# paperpdf2xml + +A set of Python CLI to convert scientific papers in PDF format to XML documents with sections and tables. + +## Prerequisites + +* Make sure you have installed `pdftottext` utility installed for initial PDF to text conversion +For Ubuntu/Debian +``` +sudo apt-get install poppler-utils +```