Replies: 3 comments 2 replies
-
Can a USFM milestone occur inside the text of a paragraph without interrupting the flow? (I think not.) For example:
becomes
|
Beta Was this translation helpful? Give feedback.
-
One thing that I've noticed with heavily-tagged whole-book USFM is that it's large to download and especially it's slow to parse, and hence we usually need one of the following:
It depends if the software is just display software (so we assume no errors, or just ignore them) or an editor (so we can't guarantee correctness at any point in time) where we have to rewrite all the edited data. All to say that I've been leaning more towards stand-off markup lately (so that the actual text is short and quickly parsed), but the markup needs to be not too fragile to break on small text edits and the cost is that a system is needed to ensure that both text and markup stay in step. |
Beta Was this translation helpful? Give feedback.
-
Obviously, the big downside to milestones (and even special character markers like \wj) is that they get in the way. If the editor "hides" them, then the chances are higher that they will get mangled or end up in the wrong place. If the editor shows them, they interrupt the flow of the text and prevent the user from reading the text in a straightforward way. Probably the best thing is an editor that is smart enough to display the effects of the milestones without actually showing the markup (e.g., through background shading), but even then, when editing right at the character positions right on the boundary of where the milestone is, it's not easy to ensure a perfect result. In some ways, this is no different from a normal word processor with styles or formatting applied to a bit of text. It will have rules about when that formatting applies to text typed at the beginning or the end of the run of text, but it won't always be what the user intended. (And depending on how astute the user is and how subtle the formatting differences, they might or might not notice that they got the wrong result.)
One concern I have about standoff markup is that since the USFM standard has already been enhanced to specify a means for marking up speech, if we come up with another competing approach (even if it is superior), tools may then have to be developed to support both approaches. (In the famous words of Nathan Miles, "The great thing about standards is that there are so many of them to choose from.") |
Beta Was this translation helpful? Give feedback.
-
There are several types of text that can potentially be regarded as speech or quoted text. These types of text can be identified in several different ways:
In many cases, more than one of these methods may be employed for the same speech. It is also very much the case that within a translation project these methods might be used inconsistently.
Depending on the intended use(s) of a text, not all "speech" needs to be marked up, and it may be acceptable to have it marked up in different ways. However, using lots of different ways to mark up speech does tend to make it more difficult to do various kinds of automated processing or parsing of the text. To the extent that it is feasible to maximize the extent to which speech is identified and the richness of the associated information and minimize the complexity of how that is done, current and future tools will be better able to access that information to help users with tasks. Ideally, we want to make it as easy as possible for translators and downstream editors to supply the needed information in a consistent way.
Currently, my operating belief is that milestones offer the best hope of useful markup because they do not interfere with the text itself (i.e., they can be used whether or not the script or language requires/allows quotation punctuation, formatting or particular wording to identify speech) and because they allow for extra information to be supplied that cannot always be easily inferred from the text. However, I'm open to being shown why this not necessarily the best way...
Beta Was this translation helpful? Give feedback.
All reactions