What's the best way to add extensions? #43

yetanotherchris · 2015-09-15T18:42:02Z

I want to add a few basic extensions using CommonMark.NET: image dimensions/alignment and tables.

Is there a way of plugging into the parser yet to do this? Some suggestions on the CommonMark forum suggest putting the image dimensions in the alt part of the image tag, which I can handle with a CustomHtmlFormatter but the tables looks a lot more involved.

Any suggestions would be welcomed!

The text was updated successfully, but these errors were encountered:

Knagis · 2015-09-16T08:36:42Z

Adding inline rules is simple - they should be added to InlineMethods.InitializeParsers(). There is some advanced topics regarding the stack used in prioritizing overlapping inlines (such as emphasis and links) - the logic is described in the spec though there are slight differences now between the reference implementation and this project (though it started with identical approach).

Adding block rules is much more complex - they have to be added in BlockMethods.IncorporateLine(). The good news is that this method is kept almost identical (structure-wise) as the reference C implementation (although it is currently still missing an update for the new HTML block parsing rules).

yetanotherchris · 2015-09-16T12:05:39Z

I'm a bit wary of changing your implementation of the spec, as you obviously have a lot more knowledge of the spec and C/C# parser.

Would it be possible to add hook points easily for the two method? And also extend BlockTag somehow, maybe add BlockTag.Extension or similar.

yetanotherchris · 2015-09-16T12:19:46Z

For the table markdown, it may be best to have it parsed and turned into HTML before it reached CommonMark, so any columns with markdown inside them get styled correctly (if this is the right way to do it?). So in effect, I would just be adding extra inline parsing

Knagis · 2015-09-18T18:09:24Z

I have a good understanding of the inline parser but limited about the block parser - the code there is ported from the reference implementation and I have not really researched the logic.

I don't think that parsing tables before everything else is a good idea - if they are to work well with other structures such as lists such seperation might not be possible. You also would have to account for fenced code blocks etc.

dmitry-shechtman · 2015-11-17T22:36:59Z

@yetanotherchris Did you make any progress on the tables? I'm looking to add them as well.

Knagis · 2015-11-19T20:37:25Z

@dmitry-shechtman - see https://github.com/kevin-montrose/CommonMark.NET/tree/ast-transforms-and-tables-squashed - it seems that there is a table parsing already implemented.

I haven't had the time to properly look into ir - perhaps @kevin-montrose can give some background on that implementation and if it is something that I should look for merging back into the main library.

dmitry-shechtman · 2015-11-19T21:03:33Z

@Knagis Thanks for the reply. That looks quite promising. Some interesting AST Transforms stuff as well. Any plans on pulling it?

In the meantime, I implemented sub/sup, although I didn't test those extensively. Which brings me to another issue...

kevin-montrose · 2015-11-20T02:23:14Z

@dmitry-shechtman @Knagis

The table implementation is pretty adhoc, I try to match Github-style tables but wasn't able to find a proper "spec" for them. The basic approach is to do a pass over paragraphs and see if they can be turned into tables, basically waiting until the last possible moment to make the decision. The code works for our (currently, very limited - Q&A remains on pre-CommonMark Markdown) purposes at Stack Overflow, but I doubt it's performant enough to be merged upstream.

The AST Transforms are a little rough (things like "insert after" and "insert before" require lots of work, there's only helpers for "replace" and "remove" because that's all we needed), but might serve as an OK starting point (the biggest annoyance was figuring out adjustments). Biggest issue to merging is that it requires a full copy of the markdown be kept, which again hurts performance.

dmitry-shechtman · 2015-11-20T15:17:10Z

@kevin-montrose Thanks for chiming in. I'm a SO user since beta, which makes this even more exciting!

You've done a very impressive job. I'm not worried about the performance, but I do have concerns, such as:

My own changes. Although the outcomes so far are modest (sub/sup and case-sensitive reference labels), they required extensive modifications, such as propagating the settings through all inline methods down to NormalizeReference().
The AST Transforms are not an actual requirement at this point. Are tables-squashed safe for merge?
How do I go about the merge? My git(hub) knowledge is lacking. Can I create a pull request to myself?

Thanks again, and sorry about the noob question.

kevin-montrose · 2015-11-20T15:27:39Z

@dmitry-shechtman tables-squashed is lacking some fixes for bugs that were found after it and the AST branches were merged - I wouldn't merge it.

Easiest thing is to probably to pull the full branch, delete the AST stuff, then merge into your changes. If I recall correctly, the AST changes are all contained in the Transforms/ folder so just deleting it should do the trick.

Be aware, the table extensions add .EquivalentMarkdown to Block and Inline, .OriginalMarkdown to Block, and requires you parse with CommonMarkSettings.TrackSourcePosition = true. There's also a new CommonMarkAdditionalFeatures: GithubStyleTables. I think I also deleted some deprecated properties, hopefully you don't need them.

Quickest path to merge in git would be something like:

git checkout -b tables
git pull [email protected]:kevin-montrose/CommonMark.NET.git st-transforms-and-tables-squashed
Delete what you don't need and commit
git checkout master
git merge tables

dmitry-shechtman · 2015-11-20T16:29:53Z

@kevin-montrose Thank you so much, that worked (though I wish you hadn't put the settings in the middle of the block method args ;)

A couple more questions if I may:

Aren't AST transforms an opt-in feature?
Isn't TrackSourcePosition turned on automatically with GithubStyleTables?
Should TableInBlockquote fail?

dmitry-shechtman · 2015-11-20T16:59:40Z

In the meantime, an easy fix for .NET 2.0 compatibility (thanks to SO :)

AMDL@8f02e57

dmitry-shechtman · 2015-12-23T05:09:57Z

(although it is currently still missing an update for the new HTML block parsing rules).

@Knagis is this still relevant?

Knagis · 2015-12-24T11:33:11Z

No, those spec updates were already implemented.

dmitry-shechtman · 2015-12-24T15:17:17Z

No, those spec updates were already implemented.

This makes this issue dangerously close to being closed 👍

yetanotherchris · 2016-01-02T14:14:16Z

@dmitry-shechtman Is the table support part of your PR? Or is it still pending? Looking at the SO implementation that goes beyond anything I would do, so I'm not likely to re-invent the wheel until that's stable and ready to use.

The only problem I have with that is I want to ditch Creole support for Roadkill, and move to Markdown/CommonMark but obviously can't until table support is there, unless I put some quick and dirty regex code in to support tables/img dimensions.

dmitry-shechtman · 2016-01-02T14:30:49Z

@yetanotherchris The last time I checked the "SO implementation" wasn't ready (not stable enough for me, that is).

I'm not sure which PR you're referring to. I'm working on a major overhaul of CommonMark.NET that would allow to seamlessly add extensions, but pipe tables aren't part of it yet (although they started it all - note the name of the branch).

ruffin-- · 2018-03-16T20:27:08Z

(Realizing I'm zombie threading a little...) I've had success with pre-processing when I added table support to MarkUpDown, a Windows 10 Markdown editor that I moved over to CommonMark.NET last year. The process is relatively painless, though tedious...

Pass through your Markdown, removing & storing all of the things, like code blocks, that shouldn't be parsed.
- So, for instance, if you had GitHub flavored table markup in a fenced code block, you don't want to turn that into a table. Take the code block out.
Byte sniff any table markup.
Remove and store the GitHub flavored table markup (or whatever flavor you're using).
Place markers for the now-removed tables in the Markdown to reinsert processed tables later.
Reinsert the "to ignore" text from step 1 into your table-free Markdown source.
Process the Markdown sans tables (but with table markers!) through CommonMark.NET normally.
Translate your table markup (from step 3) to html with your own logic.
Process the contents of each cell of the table through CommonMark.NET individually (painful, but straightforward).
- I think that's legitimate. I can't think of a reason what happens in another cell or elsewhere in the doc would affect how you process an individual cell.
- Bullheaded and inefficient, but the result is fine.
Replace the markers for tables (from step 4) in your now-processed Markdown (from step 6) with the html for your processed table(s) (from step 8).
Profit?

Obviously it should be more efficient to integrate with a good parser, but this route "does no evil" and means any mistakes are your own, not CommonMark.NET's.

Knagis added the question label Oct 21, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's the best way to add extensions? #43

What's the best way to add extensions? #43

yetanotherchris commented Sep 15, 2015

Knagis commented Sep 16, 2015

yetanotherchris commented Sep 16, 2015

yetanotherchris commented Sep 16, 2015

Knagis commented Sep 18, 2015

dmitry-shechtman commented Nov 17, 2015

Knagis commented Nov 19, 2015

dmitry-shechtman commented Nov 19, 2015

kevin-montrose commented Nov 20, 2015

dmitry-shechtman commented Nov 20, 2015

kevin-montrose commented Nov 20, 2015

dmitry-shechtman commented Nov 20, 2015

dmitry-shechtman commented Nov 20, 2015

dmitry-shechtman commented Dec 23, 2015

Knagis commented Dec 24, 2015

dmitry-shechtman commented Dec 24, 2015

yetanotherchris commented Jan 2, 2016

dmitry-shechtman commented Jan 2, 2016

ruffin-- commented Mar 16, 2018

What's the best way to add extensions? #43

What's the best way to add extensions? #43

Comments

yetanotherchris commented Sep 15, 2015

Knagis commented Sep 16, 2015

yetanotherchris commented Sep 16, 2015

yetanotherchris commented Sep 16, 2015

Knagis commented Sep 18, 2015

dmitry-shechtman commented Nov 17, 2015

Knagis commented Nov 19, 2015

dmitry-shechtman commented Nov 19, 2015

kevin-montrose commented Nov 20, 2015

dmitry-shechtman commented Nov 20, 2015

kevin-montrose commented Nov 20, 2015

dmitry-shechtman commented Nov 20, 2015

dmitry-shechtman commented Nov 20, 2015

dmitry-shechtman commented Dec 23, 2015

Knagis commented Dec 24, 2015

dmitry-shechtman commented Dec 24, 2015

yetanotherchris commented Jan 2, 2016

dmitry-shechtman commented Jan 2, 2016

ruffin-- commented Mar 16, 2018