Implemented OneByteIdentityH/V encoding cmap (with test pdf) #179

hynekcer · 2017-05-13T22:54:25Z

Some type of CID fonts with "Identity" cmap is problematic and nothing is extracted. An example is http://stackoverflow.com/questions/43832499/pdfminer-empty-output

I fixed it. The simple test file is taken from question http://stackoverflow.com/questions/38119176/itextsharp-font-widths-definition-not-correctly-loaded

Maybe a better or more general solution would be to use the existing /ToUnicode map, but my solution is similar to /Identity-H that also ignores the unicode map and believe that it is an identity.

Original: euske/pdfminer#179

PR: euske/pdfminer#179

Implemented OneByteIdentityH/V encoding cmap (with test pdf)

c949b9a

ydy1225520 approved these changes Apr 6, 2019

View reviewed changes

eladkehat added a commit to eladkehat/yapdfminer that referenced this pull request May 1, 2019

Implement OneByteIdentityH/V encoding cmap

a24b226

Original: euske/pdfminer#179

pantuts added a commit to atlas-health/pdfminer that referenced this pull request Dec 12, 2019

fix PSTypeError and encoding cmap

b27ecaa

PR: euske/pdfminer#179

pantuts added a commit to atlas-health/pdfminer that referenced this pull request Dec 12, 2019

fix PSTypeError and encoding cmap

4a08159

PR: euske/pdfminer#179

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented OneByteIdentityH/V encoding cmap (with test pdf) #179

Implemented OneByteIdentityH/V encoding cmap (with test pdf) #179

hynekcer commented May 13, 2017

Implemented OneByteIdentityH/V encoding cmap (with test pdf) #179

Are you sure you want to change the base?

Implemented OneByteIdentityH/V encoding cmap (with test pdf) #179

Conversation

hynekcer commented May 13, 2017