lamson.encoding._parse_charset_header breaks encoded multiline headers #15

wRAR · 2012-12-13T15:56:45Z

In the case of headers, wrapped per RFC 2822 §2.2.3 and MIME-encoded per RFC 2047, lamson.encoding._parse_charset_header combines encoded-text parts of all encoded-words together and passes the result to lamson.encoding.apply_charset_to_header. This is clearly wrong and if some base64-encoded encoded-word is padded (i.e. ends with =), the result will be trimmed after this point on decoding. This means that in a lot of cases the parsed header value will be shorter than it should be. I'm also worried about support of headers that simultaneously include B-encoded and Q-encoded encoded-words and unencoded text which is possible per RFC 2047 §5(1).

The text was updated successfully, but these errors were encountered:

wRAR · 2012-12-15T11:19:57Z

A simple test shows that if a header consists of a B-encoded word and a Q-encoded word separated by a space, both words are decoded properly but the space is copied into the result too, while per RFC it must be ignored. Looks like this code deserves full rewrite, this time using RFC 2047 §6.

…zedshaw#15).

wRAR added a commit to wRAR/lamson that referenced this issue Dec 17, 2012

Add a hack for decoding multiline mime-encoded headers (upstream issue …

12aa268

…zedshaw#15).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lamson.encoding._parse_charset_header breaks encoded multiline headers #15

lamson.encoding._parse_charset_header breaks encoded multiline headers #15

wRAR commented Dec 13, 2012

wRAR commented Dec 15, 2012

lamson.encoding._parse_charset_header breaks encoded multiline headers #15

lamson.encoding._parse_charset_header breaks encoded multiline headers #15

Comments

wRAR commented Dec 13, 2012

wRAR commented Dec 15, 2012