Asciidoc to HTML renderer

What?

asciidocj is a faster lightweight lexer-based Asciidoc processor outputting Jsoup DOM or plain HTML for Java

Why?

In our project we faced the need to transform Asciidoc files to HTML on-the-fly. Obviously our go-to solution was to use awesome AsciidoctorJ java library, but, unfortunately we found it's performance a bit dissatisfying: the very basic conversion was measured to run for more than 500 ms (yes, half a second), and the most of this was caused by the inner workings of JRuby.

As a solution for our use-case I started this project. It's doing only one thing: converts your Asciidoc formatted text to HTML (or JSoup Document, if you need). Output of asciidocj is verified against AsciidoctorJ, so it is producing exactly the same DOM tree with exactly same classes, meaning that you can use it right away without changing of your existing styles.

How?

1: Add maven dependency

<dependency>
   <groupId>com.github.fluorumlabs</groupId>
   <artifactId>asciidocj</artifactId>
   <version>1.0.1</version>
</dependency>

2: Convert your asciidoc text to AsciiDocument:

AsciiDocument parsedAsciidoc = AsciiDocument.from(asciidoc);

3: Get the results:

Get plain HTML in string: parsedAsciidoc.getHtml()
Get JSoup Document: parsedAsciidoc.getDocument()
Get <body> element of JSoup Document: parsedAsciidoc.getDocumentBody()
Get asciidoc attributes: parsedAsciidoc.getAttributesAsJSON() and parsedAsciidoc.getAttributesAsProperties()

Supported features

See asciidocj test suite for the list of verified supported features. The verification is performed by comparing output of asciidocj with the output of AsciidoctorJ. The following AsciidoctorJ settings are used:

backend = html5
headerFooter = false

showtitle = true
experimental = true
skip-front-matter = true

Limitations

Boundaries of delimited blocks can be unbalanced (see https://asciidoctor.org/docs/user-manual/#delimiter-lines)
Support for block nesting is limited
Pass-through blocks (++++) are considered as blocks: all non closed html tags are closed automatically. This means that they can't be used to create complex HTML layouts.
Table cells are always treated as asciidoc fragments, but only paragraphs are outputted if no a is specified
Nested tables are not supported
List continuation (attaching to parent) actually attaches to parent instead of some arbitrary level as in Asciidoctor

Internals

asciidocj is a two-stage converter. First stage is based on JFlex lexical analyzer generator, but instead of producing fully-featured AST, it creates a "semi-AST" right inside Jsoup DOM tree. The resulting DOM is then processed in a second stage to a normal HTML.

Jsoup guarantees that the resulting HTML will always be 100% syntactically correct and safe.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Asciidoc to HTML renderer

What?

Why?

How?

Supported features

Limitations

Internals

About

Releases 4

Packages

Contributors 2

Languages

License

fluorumlabs/asciidocj

Folders and files

Latest commit

History

Repository files navigation

Asciidoc to HTML renderer

What?

Why?

How?

Supported features

Limitations

Internals

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases 4

Packages 0

Contributors 2

Languages

Packages