-
Notifications
You must be signed in to change notification settings - Fork 664
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add links / tags for each statement #1397
Comments
@edolnx was exploring something similar at one point |
This is possible with asciidoc. Most content in asciidoc can be assigned a
resource ID and then linked to. The challenge that we have faced is that
specs have been authored iteratively over time by many authors with no
knowledge of what IDs anyone else may have defined in their specs.
Therefore, we have to work backwards through all of the asciidoc specs,
assign IDs and link appropriately. Some of this can be automated, but any
automated additions would have to be verified.
…On Mon, Dec 2, 2024 at 2:46 PM Derek Hower ***@***.***> wrote:
@edolnx <https://github.com/edolnx> was exploring something similar at
one point
—
Reply to this email directly, view it on GitHub
<#1397 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAAN6ZCWTRUNEAY7RESVR5D2DS2IXAVCNFSM6AAAAABS4FVMUGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKMJSGYYTGMJXGI>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
Yeah it's going to be quite tedious but I don't think there's any reason it has to happen all at once. We should probably start with the infrastructure, so it can be rendered as in the examples above. |
We need links for our activity being lead in the CSC to create a golden list of all implementation-defined behaviors (calling them "parameters"). Umer is leading this effort and we are capturing our parameters in a spreadsheet at https://docs.google.com/spreadsheets/d/1tFGLbocTp1YNn11aN8JTmvxL_23e7x8SlBNJfOPRmAE Notice there is a column for a "Reference to Standard" that is empty. |
Asciidoc's built in ID feature supports tagging a span. For example this:
Generates this HTML:
It is possible to use pure CSS to show the tag with highlighting when you hover it (I've done this before). I'll have a go at doing it again. |
Ok roughly something like this: asciidoc_link_test.zip. CSS isn't quite right but you get the idea. Looks like this: anchors.mp4 |
This is an interesting solution. I'm not sure I'm crazy about the tags in
the right hand margin but it's a start. Note that we also have to consider
the Antora-ization of the content as well, we have to be careful not to
solve for PDF alone.
…On Tue, Dec 3, 2024 at 7:12 AM Tim Hutt ***@***.***> wrote:
Ok roughly something like this: asciidoc_link_test.zip
<https://github.com/user-attachments/files/17992562/asciidoc_link_test.zip>.
CSS isn't quite right but you get the idea.
Looks like this:
https://github.com/user-attachments/assets/e74ffaa4-0845-4f1d-a786-d5a0e78416d7
—
Reply to this email directly, view it on GitHub
<#1397 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAAN6ZF6BNIO45GZ76WNCMD2DWN3TAVCNFSM6AAAAABS4FVMUGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKMJUGM4TKMBYHE>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
That is the HTML output, not PDF. I don't think there's any point trying to modify the PDF output. |
Tim, the highlighting of the text is pretty cool. Also not sure about the tags being displayed in the right column. Can we put them inline in the text with special formatting somehow without affecting the readability of the existing text or requiring them to be heavily edited? I had thought of using an admonition like NOTE but that would be very disruptive to existing text. Since I've proposed moving the email thread (https://lists.riscv.org/g/csc-tests-models-recommend/topic/109815116) to this GitHub case, let me add in some content from that thread here. Here's my adoc code trying out the 3 different kinds of IDs.
and here's the output: |
Is there some way in adoc to get the text associated with an ID? For a inline assignment that would be the text between the # marks. For an anchor, that would be the whole block associated with that anchor. Here's links to the adoc documentation I'm using: |
Yes you can convert it to XML using the Docbook output, and then you get elements like this:
Yeah there's a whole range of aesthetic options. We could style the text that has been highlighted in some subtle way, and then when you hover it it could show you the ID as a tooltip. Or maybe just make them all into links that link to themselves (so you can easily copy the URL), and don't show the ID at all. |
Thank Tim. Is there a way to get the text inside adoc so it could fill in my table column I showed that currently says "TBD"? |
fwiw, I don't believe Antora supports anchors such as [[param-foo-3]], so
always go with [#param-foo-3].
…On Tue, Dec 3, 2024 at 9:48 AM james-ball-qualcomm ***@***.***> wrote:
Tim, the highlighting of the text is pretty cool. Also not sure about the
tags being displayed in the right column. Can we put them inline in the
text with special formatting somehow without affecting the readability of
the existing text or requiring them to be heavily edited? I had thought of
using an admonition like NOTE but that would be very disruptive to existing
text.
Since I've proposed moving the email thread (
https://lists.riscv.org/g/csc-tests-models-recommend/topic/109815116) to
this GitHub case, let me add in some content from that thread here.
Here's my adoc code trying out the 3 different kinds of IDs.
This is before the parameter definition. [#param-foo-1]#Here's a parameter defined using inline anchors.# This is after the parameter definition.
[#param-foo-2]
Here's a parameter defined in its own block using the # ID syntax.
[[param-foo-3]]
Here's a parameter defined using an anchor.
|===
| Parameter Name | Parameter Reference | Parameter Text
| foo-1 | <<param-foo-1,foo-1>> | TBD
| foo-2 | <<param-foo-2,foo-2>> | TBD
| foo-3 | <<param-foo-3,foo-3>> | TBD
|===
and here's the output:
image.png (view on web)
<https://github.com/user-attachments/assets/c3832676-bca4-4677-8d50-1c87e7e4eeb1>
—
Reply to this email directly, view it on GitHub
<#1397 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAAN6ZDVJPZSGEDFQYGCS3D2DXAEXAVCNFSM6AAAAABS4FVMUGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKMJUG44DCNBWG4>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
Can we agree to allow the CSC to add parameter tags (i.e., implementation-defined options) to the ISA manuals using inline and block anchors (using [# ] for the latter)? If yes, then we need to agree on a naming convention for these tags. If no, what is required to get an agreed upon solution? |
That's fine with me assuming we adhere to this one prerequisite for Antora:
"Assembler does not require that you use unique IDs across your site. IDs
only have to be unique within a page, which is already a requirement when
using Antora. Assembler automatically scopes IDs by rewriting them when it
aggregates pages. At the moment, only IDs in the shorthand form are
supported (e.g., [#install]). The long-hand (e.g., id=install) and block
anchor (e.g., [[install]]) forms are not supported." Note that this will
require the fixing of the block anchors currently in use.
As for naming convention, we could use sail-like format for instructions,
ex. [#insns-b] perhaps, extensions can be named, ex. [#zbb] or with
alternative text [#zbkb, Bit-manipulation for Cryptography], but I defer to
what others feel is most intuitive.
…On Tue, Dec 3, 2024 at 4:52 PM james-ball-qualcomm ***@***.***> wrote:
Can we agree to allow the CSC to add parameter tags (i.e.,
implementation-defined options) to the ISA manuals using inline and block
anchors (using [# ] for the latter)? If yes, then we need to agree on a
naming convention for these tags. If no, what is required to get an agreed
upon solution?
—
Reply to this email directly, view it on GitHub
<#1397 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAAN6ZHKJG56IZS2BQHUO7D2DYRZNAVCNFSM6AAAAABS4FVMUGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKMJVGYZDINRTGA>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
The IDs also need to start with something unique to this tagging effort so they are distinguishable from other IDs (e.g. the auto IDs used for section headers). I used |
@dhower-qc, you are familiar with Antora and our needs for the CSC parameter linking to standards. Will the rewriting of IDs by Antora's assembler impact our use? As for a naming convention, how about a prefix of "param-" followed by the parameter name with dashes to separate words? |
I suggest our ID standards be more strict than Antora -- we want globally-unique IDs. As for the specifics, I think we need to get a wish list of what we want out of tagging first. @kasanovic might have ideas on this, too, but as a start: Must have:
Nice to have:
|
These IDs aren't going to be just for parameters. The idea is every statement (at least the normative ones) has a tag, so you can link test plans and coverage to them. |
Understood Tim. I'm focusing on the parameters at the moment given our effort to create a golden list of implementation-defined behaviors. I'd like to start making these tags and adding them to the parameter spreadsheet and see how it goes. Do I have permission to create a new issue for this specific case and then start putting parameter tags into the adoc manual source? Then I'll be doing a PR to get them merged in. |
How do you deal with tag unions? (as opposed to worker unions :-)
E.g. text that is both a parameter and coverage and DV, or even text that
partially overlaps one category vs. another?
…On Wed, Dec 4, 2024 at 12:47 PM james-ball-qualcomm < ***@***.***> wrote:
Understood Tim. I'm focusing on the parameters at the moment given our
effort to create a golden list of implementation-defined behaviors. I'd
like to start making these tags and adding them to the parameter
spreadsheet and see how it goes. Do I have permission to create a new issue
for this specific case and then start putting parameter tags into the adoc
manual source? Then I'll be doing a PR to get them merged in.
—
Reply to this email directly, view it on GitHub
<#1397 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AHPXVJQ3YIQTCXDNB3GGIP32D5S6LAVCNFSM6AAAAABS4FVMUGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKMJYGUZDOOBSG4>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
The text only has one tag, as shown in my example. Then the parameter list, coverpoints, test plans, test suites, etc. can all refer to that same tag. |
Tim, your suggestion of having a tag for every normative statement sounds like a pretty large change to the way our specs are currently written. Am I correct? If so, I'm wondering how we take that one without delaying some solution for parameter tags. There are probably something like 100 parameter tags we need to add so a much smaller problem then having a unique tag for each normative statement and much less disruptive to the current adoc specs. Allen, couldn't we have multiple tags to the same adoc content? If adoc doesn't support this, we could add our own extension to adoc to better support multiple tags or a more sophisticated tagging mechanism. We could UDB to convert such adoc with our own tagging extension into normal adoc to be rendered into PDF/HTML. We might want to go down this pay anyways for other related reasons (discussed at the UDB meeting today). |
I don't know if the schema for tags allows multiple tags for the same
segment of text.
Overlapping tags are then possible if and only if the multiple (adjacent?)
segments of text can have the same tag.
…On Wed, Dec 4, 2024 at 2:35 PM james-ball-qualcomm ***@***.***> wrote:
Tim, your suggestion of having a tag for every normative statement sounds
like a pretty large change to the way our specs are currently written. Am I
correct? If so, I'm wondering how we take that one without delaying some
solution for parameter tags. There are probably something like 100
parameter tags we need to add so a much smaller problem then having a
unique tag for each normative statement and much less disruptive to the
current adoc specs.
Allen, couldn't we have multiple tags to the same adoc content? If adoc
doesn't support this, we could add our own extension to adoc to better
support multiple tags or a more sophisticated tagging mechanism. We could
UDB to convert such adoc with our own tagging extension into normal adoc to
be rendered into PDF/HTML. We might want to go down this pay anyways for
other related reasons (discussed at the UDB meeting today).
—
Reply to this email directly, view it on GitHub
<#1397 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AHPXVJX2ZJ7NELLDAWBZ4HT2D57SHAVCNFSM6AAAAABS4FVMUGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKMJYGY4TONRXG4>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
Yeah adding tags everywhere is a very big task. I think adding a subset that are needed for the parameter list as the initial set is totally reasonable.
That's not necessary unless I've missed something. The tags provide a programmatic way to identify statements in the spec. Other documents (the parameter list, test plans, etc.) can then use those tags to programmatically connect to those statements. The spec itself (and the tags) will have no idea that the parameter list or the test plans exist. |
Instead of making links to Tim's website, we could use this: https://riscv-software-src.github.io/riscv-unified-db/manual/html/isa/20240411/index.html This is the 2024-04-11 version of the ISA manuals so should be static. It is generated by UDB and it gets the ISA manual contents from the RVI GitHub. |
A slight issue with that is that it is multipage, which is not how the spec is built by default so the links will be different. The fact that it is on my domain is not an issue - it's trivial to change the base of the URLs. The important thing would be that the file & fragment are the same. Consider:
Note Anyway this is kind of off-topic for this issue. |
In the meeting today, there was a comment on whether this is stepping on a
documentation SIG responsibility.
I said that their responsibility is more of the documentation
infrastructure
- but the format of links and how it gets included in the asciidoc spec
does sound like infrastructure.
What gets tagged and how those tags get used is not, I'd argue, but the
tooling for the links is.
…On Thu, Dec 5, 2024 at 8:33 AM Tim Hutt ***@***.***> wrote:
A slight issue with that is that it is multipage, which is not how the
spec is built by default so the links will be different.
The fact that it is on my domain is not an issue - it's trivial to change
the base of the URLs. The important thing would be that the file & fragment
are the same.
Consider:
-
https://riscv-software-src.github.io/riscv-unified-db/manual/html/isa/20240411/chapters/zifencei.html#:~:text=The%20FENCE.I%20instruction%20is%20used%20to%20synchronize%20the%20instruction%20and%20data%20streams
.
-
https://riscv-specs.timhutt.co.uk/spec/20240411/unpriv-isa-asciidoc.html#zifencei:~:text=The%20FENCE.I%20instruction%20is%20used%20to%20synchronize%20the%20instruction%20and%20data%20streams
.
Note chapters/zifencei.html#:~:text=... vs
unpriv-isa-asciidoc.html#zifencei:~:text=... - that's the important
thing. @wmat <https://github.com/wmat> is planning to host an official
copy at some point and then we can just rewrite all the URLs to point to
that (see #1427 <#1427>).
Anyway this is kind of off-topic for this issue.
—
Reply to this email directly, view it on GitHub
<#1397 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AHPXVJT2XJCNTJ67KFCCE2T2EB555AVCNFSM6AAAAABS4FVMUGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKMRQHA2TANJXHA>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
With that said, any inline IDs added to the asciidoc should use the
shorthand formatting, i.e. [#someid], not the long form [[someid]] as
Antora doesn't recognize the longform.
Also, please refer to
https://docs.asciidoctor.org/asciidoc/latest/attributes/id/#inline-assignment
for rules about ID naming.
…On Thu, Dec 5, 2024 at 6:00 PM Allen Baum ***@***.***> wrote:
In the meeting today, there was a comment on whether this is stepping on a
documentation SIG responsibility.
I said that their responsibility is more of the documentation
infrastructure
- but the format of links and how it gets included in the asciidoc spec
does sound like infrastructure.
What gets tagged and how those tags get used is not, I'd argue, but the
tooling for the links is.
On Thu, Dec 5, 2024 at 8:33 AM Tim Hutt ***@***.***> wrote:
> A slight issue with that is that it is multipage, which is not how the
> spec is built by default so the links will be different.
>
> The fact that it is on my domain is not an issue - it's trivial to
change
> the base of the URLs. The important thing would be that the file &
fragment
> are the same.
>
> Consider:
>
> -
>
https://riscv-software-src.github.io/riscv-unified-db/manual/html/isa/20240411/chapters/zifencei.html#:~:text=The%20FENCE.I%20instruction%20is%20used%20to%20synchronize%20the%20instruction%20and%20data%20streams
> .
> -
>
https://riscv-specs.timhutt.co.uk/spec/20240411/unpriv-isa-asciidoc.html#zifencei:~:text=The%20FENCE.I%20instruction%20is%20used%20to%20synchronize%20the%20instruction%20and%20data%20streams
> .
>
> Note chapters/zifencei.html#:~:text=... vs
> unpriv-isa-asciidoc.html#zifencei:~:text=... - that's the important
> thing. @wmat <https://github.com/wmat> is planning to host an official
> copy at some point and then we can just rewrite all the URLs to point to
> that (see #1427 <#1427>).
>
> Anyway this is kind of off-topic for this issue.
>
> —
> Reply to this email directly, view it on GitHub
> <
#1397 (comment)>,
> or unsubscribe
> <
https://github.com/notifications/unsubscribe-auth/AHPXVJT2XJCNTJ67KFCCE2T2EB555AVCNFSM6AAAAABS4FVMUGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKMRQHA2TANJXHA>
> .
> You are receiving this because you commented.Message ID:
> ***@***.***>
>
—
Reply to this email directly, view it on GitHub
<#1397 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAAN6ZAZLDLQIFXBXYNXBGD2EDLHPAVCNFSM6AAAAABS4FVMUGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKMRRGY2TOMRTHA>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
I discussed this with @jrgrimm11 today at our 1:1 and came to pretty the same conclusion as you two (Tim & Bill). As long as we can extract the tags and their content from the adoc in an automated fashion and we have our certification test plans point to a particular release of the manuals then we can create a tool to detect changes to the tagged content. So, maybe that should be our plan of record. We just need a agreed upon naming scheme to identify tags and some way to extract them from the adoc. I'd prefer something smarter than just grepping for them. Tim/Bill/Derek, any recommendations here? I believe we should initially assume the tags aren't allowed to change the ISA manual content since it is ratified text. We'll see how that goes and we can focus on the base ISA and extensions for MC100 (see https://riscv-software-src.github.io/riscv-unified-db/pdfs/MC100.pdf) as a pipe cleaner. The tags wouldn't be visible in the standard PDF and HTML generated from the adoc. I would expect to add the ISA manuals as appendices to certification test plans (which we can generate from riscv-unified-db with some work) and then the tag references in the certification test plans will become hyperlinks into the ISA manual appendices. It would be cool if we could show the tag names and identify what they reference like Tim was showing with CSS and HTML. |
Just saw this in Tim's example in #1769: "Detecting when parts of the spec change. You can extract these snippets via Asciidoc's docbook output and parsing the XML." |
Let's discuss a standard naming scheme for tags. So far, we need tags for parameters (AKA implementation-defined behaviors) and we need tags for normative rules which we need for CSC certification test plans (although I expect many community members will use the tags to create their verification test plans for their IP cores they are developing). Naming of parameters (AKA implementation-defined behaviors):
I'm using - to separate fields and _ within fields. The combination of category and name must be unique (ignoring case) across all RVI standards (ISA and non-ISA). The category is whatever is most related to the parameter. For example, if a parameter is clearly related to just one extension it can be the extension name. Could also be related to a CSR name, CSR field, ISA feature, etc. Category is just used for filtering/searching. Examples:
Naming of normative rules:
Same rules for category and name as for parameters. |
Since all param-creating statements come from normative text, do they get two tags? |
Hummm, good point. Having 2 tags wouldn't be great or even work properly using the adoc facility. The parameters are meant to capture implemented-defined behaviors (AKA parameters, AKA options) whereas the normative rules are meant to capture certification test coverage points. I think of parameters as a particular type of normative rule so if the normative text can be characterized as a parameter the rv_param is used and otherwise rv_norm_rule is used. How's this idea? |
There's no need to encode information into the IDs like "is it a parameter". The only requirement is that it uniquely identifies a bit of text, so We should use a more unique prefix than |
I'd like to do so this before Christmas, but it might spill over into next year. |
Actually Andrew, I now realize that we need to be adding tags to ISA manual contents in the main branch so there isn't a dependency on you identifying a suitable Priv ISA manual release anymore. However, it would still be good to get that done so that we can point the UDB manual it uses to the release you identify. |
Tim, you raised a couple of issues with my proposal:
Let me focus on #2 first since it is the juicier topic. At the very minimum, the name has to be unique and immutable. However, we could also encode some meta data into the name. There is a continuum of possibilities of encoding meta data into the name from having no meta data to trying to encode every conceivable type of meta data such as:
Here's some screenshots of how other references have done this. Note that these are all visible tags and not adoc tags we are talking about here but I think they are still interesting to see. You'll see that the RISC-V Server SoC spec has a category and 3-digit number for the name. Arm uses "R" for normative rules and "I" for non-normative rules (AKA informative) followed by a 5 letter randomly generated unique name. SPARC had a tag to highlight implementation dependencies followed by a unique integer. For UDB, we just need a unique prefix and a unique name after that we can then put all the meta data associated with a tag in UDB. However, UDB isn't the only tool that will probably use these tags. In particular, I would imagine that customers creating verification test plans for their implementations could find them useful. This is why I'm proposing a category and a different unique prefix for parameters vs. generic normative rules. |
I see 3 properties here:
Visibility, Naming ( along with categories), Multiple tags.
The one that bothers me most is the multiple tag part.
If a bit of text is - or can be- associated with multiple tags, how do we
deal with it?
(or do we just outlaw it?)
Can/should we tag a tag in that case?
That’s the easy case. Next harder is if the text is a subset of some
existing tagged text.
( same questions, roughly: prohibit? Recursively tag embedded text?
Finally, the overlapped case, e.g tag X refers to text a,b, and tag Y
refers to text b,c.
I’m not sure I see good answers to this beyond prohibit ( as well as not
seeing an actual example of where this mightt happen
…On Saturday, December 14, 2024, james-ball-qualcomm < ***@***.***> wrote:
Tim, you raised a couple of issues with my proposal:
1. Unique prefix
2. Naming after prefix
Let me focus on #2 <#2>
first since it is the juicier topic. At the very minimum, the name has to
be unique and immutable. However, we could also encode some meta data into
the name. There is a continuum of possibilities of encoding meta data into
the name from having no meta data to trying to encode every conceivable
type of meta data such as:
- The kind of normative rule (e.g., parameter or generic normative
rule)
- Some kind of category (e.g., extension name, ISA feature, etc)
- Version number of category when normative rule becomes in-scope
(e.g., version 1.13 of Priv ISA)
- Dependencies on other extensions (e.g., normative rule only applies
if F or D extension is present)
Here's some screenshots of how other references have done this. Note that
these are all visible tags and not adoc tags we are talking about here but
I think they are still interesting to see. You'll see that the RISC-V
Server SoC spec has a category and 3-digit number for the name. Arm uses
"R" for normative rules and "I" for non-normative rules (AKA informative)
followed by a 5 letter randomly generated unique name. SPARC had a tag to
highlight implementation dependencies followed by a unique integer.
For UDB, we just need a unique prefix and a unique name after that we can
then put all the meta data associated with a tag in UDB. However, UDB isn't
the only tool that will probably use these tags. In particular, I would
imagine that customers creating verification test plans for their
implementations could find them useful. This is why I'm proposing a
category and a different unique prefix for parameters vs. generic normative
rules.
image.png (view on web)
<https://github.com/user-attachments/assets/b024a58f-e83a-4475-95fc-a14de71911b1>
image.png (view on web)
<https://github.com/user-attachments/assets/265478a2-10c3-48ff-872d-cf3205c9eab9>
image.png (view on web)
<https://github.com/user-attachments/assets/f39ad1ff-208a-4ca8-9374-af51ebd338bb>
—
Reply to this email directly, view it on GitHub
<#1397 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AHPXVJUZVUU5ZYLLOZBGOGT2FRQLXAVCNFSM6AAAAABS4FVMUGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKNBTGE4DMMRRGE>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
Allen, I fully anticipate that when we start this tagging activity we'll encounter all of the problems you imagine. This is because the normative prose in the manuals weren't written with the requirement that it could be tagged some day into crisp normative rules and implementation-defined behaviors. This tagging activity will highlight examples of where tagging doesn't work well with the existing prose. Hopefully this will help RVI understand better the nature of the issue/challenge and we'll be able to find an acceptable solution. For example, we may decide to edit the content of the manuals without changing any normative behaviors (keeping the same intent) so that we can have crisp tags. This will better meet the needs of certification and verification. To be clear, by verification, I mean the activity a processor design team engages in to fully debug their implementation (i.e., not certification). |
Ack. But I agree it would still be good to get that done, since it has been a while since the last release, and much has been ratified since then. |
Andrew, if there's anything I can do to help with moving forward toward a
published release, please let me know.
James, we want to consider Asciidoc's ID functionality and how it can be
leveraged for tagging. Note that section titles automatically become the
primary ID for a section, however this can be customized as described here:
https://docs.asciidoctor.org/asciidoc/latest/sections/custom-ids/. The
most applicable usage will likely be the ID attributes, please see:
https://docs.asciidoctor.org/asciidoc/latest/attributes/id/
Note that you can also use secondary IDs and auxiliary IDs as described in
the custom IDs link above.
…On Sun, Dec 15, 2024 at 8:13 PM Andrew Waterman ***@***.***> wrote:
Actually Andrew, I now realize that we need to be adding tags to ISA
manual contents in the main branch so there isn't a dependency on you
identifying a suitable Priv ISA manual release anymore. However, it would
still be good to get that done so that we can point the UDB manual it uses
to the release you identify.
Ack. But I agree it would still be good to get that done, since it has
been a while since the last release, and much has been ratified since then.
—
Reply to this email directly, view it on GitHub
<#1397 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAAN6ZCCWK656TKTIR4YCQD2FYSNPAVCNFSM6AAAAABS4FVMUGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKNBUGI3TGNRSGE>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Thanks Bill. I took I look at your links and they're related to sections. Those are too coarse-grained (we need to be able to tag things like sentences or even portions of a sentence). So I'm not so sure that it helps in this situation. By the way this is James Ball from my personal account instead of my work account. |
Yes, the first link was about sections. The second talks about inline assignment, I believe. |
IMO encoding information into the tag adds extra work, verbosity and opens up awkward questions (e.g. Allen's points) without really giving significant advantages. The metadata associated with the tags would be much better stored in external documents. Perhaps the best course of action would be to pick a section of the spec and fully add tags? Then we can compare a realistic example of how it looks with either scheme. |
Yes, the second link was related to inline tags and we've already seen that page (and plan to use it). |
I think Allen's issues are orthogonal to whether our tag names contain some meta data or not. That's because I see Allen's issues are related to the current writing style of the ISA manuals that were written without a requirement to make crisp normative rules. Because I plan to use UDB with the tags, I can add all the meta data I need in UDB and have none in the tags. However, as I mentioned earlier, I don't want to assume that all consumers of the tags will be using UDB so having the minimal meta data I suggested (i.e., having a category and differentiating between generic normative rules and implementation-defined behaviors) seems like a good compromise. I don't really see how it adds significantly to the work but does make the tag names larger but is that really a significant constraint? IMO what I'm proposing is the sweet-spot between having no meta data vs. having all the meta data possible. However, that's just my opinion and I'm looking for a consensus from those following this GitHub issue so I'd like to hear other opinions. |
I'm finally moving ahead with this today. I've forked the ISA manual into https://github.com/RISC-V-Certification-Steering-Committee/riscv-isa-manual (hope this was the right thing to do) and am now starting to put anchors into the ISA manual adoc source. Note that I've been poking around in the ISA manual adoc source and see extensive use of both the anchor syntax (e.g., [[foo]]) and ID syntax (e.g., [#foo]) so I'm assuming Antora can work with both. Bill hypothesized earlier in this issue that Antora can only work with the ID syntax but I think I have an existence proof that it can handle either syntax. According to https://docs.asciidoctor.org/asciidoc/latest/attributes/id/, the anchor syntax ([[foo]]) has the advantage over the ID syntax ([#foo]) that the anchor name can contain a "." without being incorrectly interpreted as a "role". The anchor syntax also works in a list item or table cell whereas the ID syntax doesn't. The only place is seems that the ID syntax has an advantage is for an inline anchor such as:
Both the anchor syntax and ID syntax can be associated with a block (on the line before it starts) such as:
or
Another advantage of the anchor syntax is that you can put multiple of them on the same section or block such as:
and
as long as you put them before the section name like in the above example. |
I see the ISA manuals have this format for the newer instructions that have an instruction per page (like B extension, scalar crypto, Zc extension). Must be the work of @kdockser and @tariqkurd-repo.
I haven't seen using a "#" in the link before in AsciiDoc. I tried an experiment and the VSCode adoc previewer seems to ignore one leading "#" in the link (a "#" isn't required though). |
BTW, I noticed this when I built the ISA manual and got lots of messages about missing link anchors (for older instructions). Maybe we should provide these in the UDB appendices so that the links actually have an anchor to go to.
|
Because the rollout of Antora deployments is incomplete, this is not a safe assumption. @wmat, can you take a definitive position on this, please? |
Seems like the better approach would be to fix the base ISA doc. @kersten1, can you work to drive this list of invalid anchors to 0? |
This is logged here: #1458 |
On Thu, Jan 16, 2025 at 7:42 AM Jeff Scheel ***@***.***> wrote:
Note that I've been poking around in the ISA manual adoc source and see
extensive use of both the anchor syntax (e.g., [[foo]]) and ID syntax
(e.g., [#foo]) so I'm assuming Antora can work with both. Bill hypothesized
earlier in this issue that Antora can only work with the ID syntax but I
think I have an existence proof that it can handle either syntax.
Because the rollout of Antora deployments is incomplete, this is not a
safe assumption. @wmat <https://github.com/wmat>, can you take a
definitive position on this, please?
There is a bug in the Antora Assembler that prohibits it from long for
ids. See https://gitlab.com/antora/antora-assembler/-/issues/17
Therfore, just to be safe, I would only use the short form syntax, i.e.
[#foo]
—
… Reply to this email directly, view it on GitHub
<#1397 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAAN6ZBLFRNWBQMZGFSSEE32K6SKZAVCNFSM6AAAAABS4FVMUGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKOJVGUYDQMBWGI>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
There are tons of anchor syntax (e.g., [[foo]]) in the ISA manuals. Do you plan to remove them all? |
On Thu, Jan 16, 2025 at 8:19 PM james-ball-qualcomm < ***@***.***> wrote:
There are tons of anchor syntax (e.g., [[foo]]) in the ISA manuals. Do you
plan to remove them all?
If necessary, I'll replace them all on my antora-refactor branch.
… —
Reply to this email directly, view it on GitHub
<#1397 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAAN6ZFCOEBACAJJZC6SDXT2LBLBTAVCNFSM6AAAAABS4FVMUGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKOJXGIZTIMZVGY>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Mind doing that @wmat? |
Sounds good Bill. I'll use "[#" anchors in UDB. |
It is very useful for specs to be linkable, for example to link coverpoints, test plans, tests, etc. to specific statements in the spec. This allows you to ensure that you
There are some examples of this in the Ferrocene Rust Specification: (note each item has a unique ID, just not rendered):
In ARM's ASL reference manual (here they render the unique IDs):
Note that in both examples the writing style is a lot more like #1396 but you don't necessarily have to do it like that. In a previous company I implemented this system but you could tag parts of paragraphs too and the tag would highlight the tagged text when you hover it.
The text was updated successfully, but these errors were encountered: