Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Documentation is all Microsoft Word and thus hard to read online #431

Open
pmetzger opened this issue Oct 9, 2024 · 10 comments
Open

Documentation is all Microsoft Word and thus hard to read online #431

pmetzger opened this issue Oct 9, 2024 · 10 comments

Comments

@pmetzger
Copy link

pmetzger commented Oct 9, 2024

Howdy! The documentation is all in .doc files, which makes it hard to track changes in source control, hard to request pulls of fixes to documentation, hard to view the docs online as web pages, etc.

Would the project be willing to accept volunteer labor to convert the docs to markdown or something similar?

@pkoning2
Copy link
Member

pkoning2 commented Oct 9, 2024

Probably not. That was actually done a while ago (using RST I think) but ended up being rejected. I forget what the discussion was.

@pmetzger
Copy link
Author

That's pretty sad. The documentation is de facto binary blobs and thus pretty hard to maintain as it is; there's no way to do pull requests to patch problems.

@atroly
Copy link

atroly commented Nov 1, 2024

I would be most interested in knowing why pmetzger's suggestion was rejected, because on the face of it it seems an eminently sensible and useful one.
Even if markdown is deemed to be insufficiently flexible, there are open document formats which offer most or all of the desirable features mentioned, rather than the proprietary and obsolete .doc

@unixwizard107
Copy link
Member

unixwizard107 commented Nov 2, 2024

The request to switch from Microsoft *.DOC or the newer *.DOCX formats to something else has come up in the past. No solution is nearly universally agreed upon anywhere. They all have issues. As Paul K points out, this topic gets tossed about occasionally. One thing the project has tried to be is as neutral as it can be when it makes sense. So, requiring a specific compiler or development environment has tended to be shied away. That said, when Bob started the project, he took (and continues to take) the make it easy on the writer's view. Since Word was what he had, that's what he used. But as Perry notes, Microsoft formats have never been source code control friendly. The core issue is that source control systems, like Git or the like, have to treat them as blobs (which is contentious I'll not open that issue any further).

Document formats such as Markup, RST, and JavaDoc can solve source control problems. Still, the troff and tex document compilers families can work with source control just as well and have the advantage of being portable and running on many emulated systems.

From a tool standpoint, besides the format issue, another negative of using either *.DOC and *.DOCX is that they are only supported by Microsoft and Apple as development vehicles. The other formats I mentioned have the advantage of being well supported on most, if not all, of the chosen development platforms, but the tools to use them vary widely (at least troff and tex allow you to use vi/emac or whatever text editor the writer wants to use). More importantly, many of the content creators, such as Bob and Rich, are using tools like Word, which, while many users are happy with tool X or Y, you will have a hard time finding ones that are universal to all of our target development environments, meets the source control criteria and is something that the content creators are willing to use.

Plus ... as Perry mentioned, one of the biggest stumbling blocks if we change is getting from one format to another - which is a big job.

That said ... Bob has observed that since the origin of many of the documents in the 1990s, the good news is that all of them should be able to be supported by LibreOffice. Rich Cornwell, another of the primary simulator authors and content creators, believes that all the *.DOC files will open with it, and we can create PDFs with the same. Besides being supported by all of MacOS, Windows, and Linux, since LibreOffice's native format of "flat odt" (.fodt) files is based on textual XML, it is known to be friendly to all of the currently available source control systems, they need not be stored as blobs. A quick poll of many of the core developers has said that, while they use Word/Pages/XXXX, they could probably live with LibreOffice - but only if it could be shown to work.

So, I offer the following multi-step approach. I think the Steering Group would consider a change in the build processes to create PDFs with LibreOffice on all three development platforms as part of the build if it notes that you have >>optionally<< installed LibreOffice. That means the build scripts must be more intelligent (i.e., do no harm) than they are now. However, if the build script running on the local system can find a proper version of LibreOffice, it will also create the PDFs as an output.

Perry, if you would like to offer scripts for each (Mac, Linux, Windows) and help us check to ensure that PDFs are reasonable, while I am one voice as it were, I do expect that such an offering would be well received by the SG, as it seems like it would bring us least amount of breakage with the large gain/removal of all the current major issues with the current format.

Once that work is complete, if the results of that work as well as I think they might, then a longer-term next step might be to consider switching from *.doc to *.fodt [](per https://wiki.documentfoundation.org/Libreoffice_and_subversion) in the repository itself.

@pmetzger
Copy link
Author

pmetzger commented Nov 2, 2024

So my main interest here is in getting the docs dramatically improved, which requires that users be able to edit the docs in their favorite text editor and do a github pull requests, which means a text format for the documents is kind of required. Otherwise, the workflow everyone is now used to just isn't going to happen.

A .fodt file still doesn't make it possible for a user to realistically submit patches, to run grep over the docs to find instances of obsolete urls or what have you, etc. It's not technically a binary file but it isn't that different in practice. You aren't going to merge simultaneous pull requests to different parts of a document that way; the changes can't be done using standard tools, the LibreOffice suite won't keep diffs minimal, and multiple sequential patches won't apply.

I've looked extensively at the docs, and they contain no particularly special formatting that can't be captured with markup languages. There's no good reason that any one of a number of popular markup languages couldn't work. I'm willing to be pushed into using a markup language I don't particularly like. However, I don't think the labor of turning things into a LibreOffice document would be a significant improvement to the project.

As a side note: I started my hacking career on a PDP-8e in the deep mists of time. I'm an old fogey like the rest of everyone interested in this stuff, and I'd like to see it made much easier to use the simulators the project has to offer and to show them off to younger people.

@pmetzger
Copy link
Author

pmetzger commented Nov 2, 2024

By the way, if needed, LaTeX could be used here. I don't think it's ideal because you can't turn it into web pages quite so easily, but I have enough experience using tools like HeVeA for the purpose (see https://hevea.inria.fr/ ) that I could probably be badgered into using that instead. Again, my taste would now run elsewhere; there's no complicated formatting in the documents, but any editable text format is probably an improvement over the current situation.

@unixwizard107
Copy link
Member

unixwizard107 commented Nov 2, 2024 via email

@unixwizard107
Copy link
Member

unixwizard107 commented Nov 2, 2024 via email

@ainsinga
Copy link

ainsinga commented Nov 2, 2024 via email

@ainsinga
Copy link

ainsinga commented Nov 2, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants