-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding information about the source format and generated metadata #66
Comments
I think that this could be covered in a
Once we know the media type and filename, I'm not convinced that we would need to also provide a value indicating that metadata were generated. |
Another useful thing we can do with the
That's right, the only useful thing I could think of is to know that the title was generated. But if we can check that the title is the filename, then it's actionable. |
I don't see how a filename can give useful info to the application. Here you're proposing the filename because some source formats will not contain a title, so the title will be generated from the filename. Is it because the only mandatory meta in RWPM is the title? Also, knowing the source media-type seems sufficient to know which metadata have been generated (e.g. the title in case the media type is cbz). |
In the case of a CBZ or a PDF, the filename might be the only info available. If you know the filename, you can compare it to the title generated by the streamer and do something based on that comparaison. |
I see ... but the title is still mandatory, so it means the RWPM will contain:
The client part "viewer" just wants to display a title. Who cares about the original filename at this point? |
In both CBZ and PDF, sometimes we have a real title (for CBZ, it is the single root folder inside the archive, if there's one), so we can't be sure the filename was used. Also file extensions are not mandatory, so checking for |
What about overloading https://www.w3.org/publishing/epub3/epub-packages.html https://www.dublincore.org/specifications/dublin-core/dcmi-terms/elements11/source/
|
According to the parsing specification, |
Ah, here is an older related issue about |
Note that the R2 TypeScript implementation currently preserves https://github.com/IDPF/epub3-samples/blob/master/30/childrens-literature/EPUB/package.opf#L29 https://idpf.github.io/epub3-samples/30/samples.html => |
Just to clear any ambiguity, on mobile <metadata xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:dcterms="http://purl.org/dc/terms/"
xmlns:a11y="http://www.idpf.org/epub/vocab/package/a11y/#">
<dc:title>Alice's Adventures in Wonderland</dc:title>
<dc:rights>Public Domain</dc:rights>
<meta property="a11y:certifiedBy">EDRLab</meta>
</metadata> produces the RWPM (after resolving the full URI from the XML namespaces of other metadata): {
"metadata": {
"title": "Alice's Adventures in Wonderland",
"http://purl.org/dc/terms/rights": "Public Domain",
"http://www.idpf.org/epub/vocab/package/a11y/#certifiedBy": "EDRLab"
}
} And with the in-memory model: publication.metadata.title
publication.metadata["http://purl.org/dc/terms/rights"] // (internally uses `otherMetadata`) Note that we have a special case with the
I'm not sure any of this is documented in the EPUB parsing guide, as metadata extensions were not really supported at the time. |
How about {
"metadata": {
"sourceFile": {
"type": "application/epub+zip",
"name": "title.epub"
}
}
} |
...just thinking aloud regarding the term
|
Good point, then I would still use For example, fetching a CBZ from {
"metadata": {
"originalAsset": {
"type": "application/vnd.comicbook+zip",
"name": "watchmen"
}
}
} |
Sometimes, we are generating a RWPM from third-party publication formats (e.g. EPUB, CBZ). It could be a valuable information for reading apps to know from which source format a RWPM originated from.
For example:
Generating a RWPM from third-party formats also means that sometimes we need to generate some metadata, which might not be accurate. For example, a
title
is mandatory with RWPM, but doesn't exist for CBZ and some PDFs. It could be useful for reading apps to know if the metadata are not completely accurate. Two possibilities:generated: true
property when any property is generatedgenerated: ["metadata.title"]
Personally, I think that without the information about which properties are generated, this is not very actionable for reading apps.
The text was updated successfully, but these errors were encountered: