-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Maxstack calculation #3
Comments
I just read those two sections, and I agree it's a good idea to do those simultaneously. The documentation for each instruction already contains a description of stack behaviour in textual form. If there's a separate data structure describing stack behaviour, the documentation could potentially even be generated from that. I guess (off the top of my head) we need something like:
|
The machine-readable version (as a C header file) is in Partition VI, Annex C, Section C.2. It describes both the stack behavior and whether control goes straight through or potentially/unconditionally branches, etc. This analysis probably does get more complicated in the face of exception handling code, that is a whole other ball of wax. |
The file is opcodes.def, look in %ProgramFiles%\Microsoft SDKs\Windows\v7.0A\include Your mileage may vary depending on which OS you are on, on 64-bit it goes in the x86 program files directory, and that version number can change too. |
I translated the opcodes.def file into Haskell. I'm working on the abstract analysis part for stack height only (not stack shape) because it is simpler to start with. The one quirky thing is that the pop behavior of the I'm making the analysis run in the error monad because the spec disallows a fair number of strange control flow patterns (Partition III, Section 1.7.2, 1.7.5, 1.7.6) specifically for the purpose of simplifying this analysis. I'm going to return an error if the maxstack calculation happens to discover a violation of those rules. |
Partition III, Section 1.7.4 requires correct CIL instruction sequences include a limit on the height the stack may reach during execution of that sequence.
Relatedly, Section 1.7.5 requires that the shape of the stack is the same at each location in a valid instruction sequence no matter what series of branches resulted in entry to that location.
Doing an analysis of compliance with 1.7.5 gets you the answer to 1.7.4 as a side effect, and abstract interpretation to do the 1.7.5 analysis is only a little bit more complicated (track stack shapes including types, instead of just stack height), so it might be worth doing both. Doing the analysis for the .maxstack directive is actually required for correctness; unfortunately ilasm doesn't perform this analysis on its own, it relies on the input IL. (ilasm does do some other nice things though, including rewriting "long" branches to "short" branches where possible if you use the /optimize switch)
This is a medium-sized project, between writing the abstract interpretation and combing through the documents to record the behavior of each instruction.
The text was updated successfully, but these errors were encountered: