About surrogate pair characters. #123
Unanswered
kado-junichiro
asked this question in
Q&A
Replies: 1 comment 11 replies
-
That is indeed an error in phase4 (and will similarly impact the SMP btw.). I will look into this and keep you posted. Thanks for bringing that also here to my attention |
Beta Was this translation helpful? Give feedback.
11 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
May I ask a question about the handling of surrogate pair characters?
In Japan, surrogate pair characters may be included on invoices in the name of a company or individual.
For example, if we include the characters "髙橋 𩸽𠮷" in the send invoice xml, when we check the received invoice, "𩸽𠮷" (surrogate pair characters) were automatically removed and changed "髙橋".
The log shows the following, indicating that the surrogate pair characters were recognized as an invalid character and replaced.
**
level
WARN
loggerName
com.helger.xml.serialize.write.EXMLIncorrectCharacterHandling
message
XML content contains invalid character data (will replace): '株式会社 髙橋 𩸽𠮷商事'. Invalid chars are: 0xd867, 0xde3d, 0xd842, 0xdfb7
**
Is this intentionally removed because surrogate pair characters cannot be handled on the Peppol network?
If you can provide any information, would you let us know?
Regards,
Junichiro Kado
Beta Was this translation helpful? Give feedback.
All reactions