spans (from rich text) inside cells cause newlines #7

jywarren · 2016-07-29T16:20:26Z

Pasting content from an external page into a table in a Woofmark instance (like this one: https://publiclab.github.io/PublicLab.Editor/examples/) often results in spans inside tables.

That becomes:

| col0 | col1 | col2 |
|------|------|------|
| cell | through the steps  
 | cell |
| cell | cell | cell |
| cell | cell | cell |
| cell | cell | cell |

However, tds may often contain spans, and it seems we could figure out how to parse them gracefully at least. Here it seems like that is almost happening properly.

The span tags are removed. I was going to suggest just that the extra newline could be stripped too. I'm happy to write a test and attempt this change?

The text was updated successfully, but these errors were encountered:

bevacqua · 2016-07-29T16:23:07Z

Please do!

jywarren · 2016-07-29T20:03:27Z

That's odd -- putting a span into the 'tables with complex content still get proper padding' test doesn't trigger the issue. Digging a bit.

jywarren · 2016-07-29T20:22:22Z

OK -- pasting in even a single word (on a recent Chrome, in ChromeOS) appends a   to the table cell content:

<td><span style="color: rgb(170, 170, 170);">steps</span><br></td>

Trying to think through if we should strip that out, strip trailing  s out, or what. It definitely doesn't survive the markdown conversion because a newline mid-row breaks the table.

I think it's appropriate to strip   tags in this case. Thoughts or counter-cases?

bevacqua · 2016-07-29T20:24:26Z

Are you copying a part of a page and getting this rich format? Where are you pasting? It may be more of a contenteditable issue than something specific to domador I think.

I'm fine with stripping <\s*br\s*\/?\s*> from start/end of cell, in any case.

jywarren · 2016-07-29T20:28:07Z

Yes, copying a single word (mid-sentence). I'd agree that it's contenteditable but it's also an extremely common use case for Woofmark (actually reported by a user of one of my projects). Perhaps it could be a configurable option?

Though if a   results in an unparseable markdown table, it seems like an issue regardless of where the
came from, but I guess that's a question for the GFM table spec.

bevacqua · 2016-07-29T20:34:34Z

I think the issue specific to domador is that   in table cells should be kept as-is, and not as \n. There should, furthermore, be a fix to woofmark where we strip   from start/end of pasted content. Thoughts?

jywarren · 2016-07-29T20:41:02Z

That sounds reasonable, yeah. I didn't find tables represented in Commonmark, but is the GFM tables behavior supposed to be that HTML may be nested in tables, but Markdown may not be?

For example, an unordered list can be placed inside a table in woofmark but it doesn't survive conversion to valid markdown via domador. I tend to think it's unreasonable (impossible?) to expect multiline Markdown to be parseable inside a GFM table cell (see below), but just curious if there's a well-established spec for that.

| col0 | col1 |
|------|------|
| cell | 

- one
- two |
| cell | cell |

jywarren · 2016-07-29T20:41:58Z

And for good measure, this works:

col0	col1
cell	Hi!
cell	cell

| col0 | col1 |
|------|------|
| cell | <br> Hi! <br> |
| cell | cell |

bevacqua · 2016-07-29T20:51:06Z

Yeah, I'd avoid newlines being produced by domador in general when creating Markdown. How about a flag where we tell domador to preserve certain HTML tags if in a cell?

bevacqua · 2016-07-29T20:51:37Z

As in:

| col0 | <ul><li>one</li><li>two</li></ul> |
|------|------|
| cell | <ul><li>one</li><li>two</li></ul> |
| cell | cell |

Which would render properly:

col0	one two
cell	one two
cell	cell

bevacqua · 2016-07-29T20:52:58Z

I'm pretty sure we need to preserve all block elements and  , any others you can think of?

bevacqua · 2016-07-29T20:58:50Z

By the way if you want to join ponyfoo.com/slack that'll make chatting about this stuff easier :)

jywarren · 2016-07-29T21:13:11Z

Joined, thanks. Created a test for this in #8, will take a look at coding in a bit.

jywarren added the bug label Jul 29, 2016

jywarren mentioned this issue Jul 29, 2016

trim leading/trailing from pasted content bevacqua/woofmark#34

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spans (from rich text) inside cells cause newlines #7

spans (from rich text) inside cells cause newlines #7

jywarren commented Jul 29, 2016

bevacqua commented Jul 29, 2016

jywarren commented Jul 29, 2016

jywarren commented Jul 29, 2016

bevacqua commented Jul 29, 2016 •

edited

Loading

jywarren commented Jul 29, 2016

bevacqua commented Jul 29, 2016 •

edited

Loading

jywarren commented Jul 29, 2016

jywarren commented Jul 29, 2016 •

edited

Loading

bevacqua commented Jul 29, 2016 •

edited

Loading

bevacqua commented Jul 29, 2016 •

edited

Loading

bevacqua commented Jul 29, 2016

bevacqua commented Jul 29, 2016 •

edited

Loading

jywarren commented Jul 29, 2016 •

edited

Loading

spans (from rich text) inside cells cause newlines #7

spans (from rich text) inside cells cause newlines #7

Comments

jywarren commented Jul 29, 2016

bevacqua commented Jul 29, 2016

jywarren commented Jul 29, 2016

jywarren commented Jul 29, 2016

bevacqua commented Jul 29, 2016 • edited Loading

jywarren commented Jul 29, 2016

bevacqua commented Jul 29, 2016 • edited Loading

jywarren commented Jul 29, 2016

jywarren commented Jul 29, 2016 • edited Loading

bevacqua commented Jul 29, 2016 • edited Loading

bevacqua commented Jul 29, 2016 • edited Loading

bevacqua commented Jul 29, 2016

bevacqua commented Jul 29, 2016 • edited Loading

jywarren commented Jul 29, 2016 • edited Loading

bevacqua commented Jul 29, 2016 •

edited

Loading

bevacqua commented Jul 29, 2016 •

edited

Loading

jywarren commented Jul 29, 2016 •

edited

Loading

bevacqua commented Jul 29, 2016 •

edited

Loading

bevacqua commented Jul 29, 2016 •

edited

Loading

bevacqua commented Jul 29, 2016 •

edited

Loading

jywarren commented Jul 29, 2016 •

edited

Loading