Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Blog Post] Add Blog post for Bril array & arguments extensions #429

Open
wants to merge 2 commits into
base: 2023fa
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
96 changes: 96 additions & 0 deletions content/blog/2023-12-11-arrays-and-variable-arguments-in-Bril.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,96 @@
+++
title = "Variable Arguments in Bril and Arrays in the TypeScript Compiler"
[extra]
bio = """
Alice is an M.Eng student studying Computer Science. She is broadly interested in compilers, systems and algorithm design.
"""
[[extra.authors]]
name = "Alice Sze"
+++

# What was the goal?
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think you need to include section headings that directly mirror my bullet points from the syllabus. Remember that your imaginary audience here is people who are generally interested in compilers and your work but do not particularly care that this was done as a 6120 project. Do you think there's a different way to title these sections that would more accurately reflect the contents and help such a reader more quickly navigate your writing?


The overall goal of my project is to improve the capabilities and the user experience of Bril. In particular, I set out to allow the `main` function in Bril to take a variable number of arguments (much like `argc` and `argv` in C/C++), and add support for arrays in the TypeScript compiler to Bril. I thought of adding variable arguments because we currently have to pack a fixed number of values into an array when writing Bril programs, which is not only extra effort but also limits us to testing a very specific set of inputs, e.g. an array of six elements. While it is not a huge modification, it does limit our ability to test our programs and to see the effect of our optimizations on larger inputs. I also included arrays in the TypeScript compiler because quite a few benchmarks are written using it, and I recall someone asking if arrays are supported for the first assignment. I hope that these extensions will make it ever so slightly easier for CS6120ers to write interesting programs in the future.
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I recommend using a "," after occurrences of "e.g." and "i.e.". https://capra.cs.cornell.edu/styleguide/#egie


# What did you do?
### Variable Arguments
For the first extension, I modified the TypeScript compiler in `brili.ts`. I changed the function `parseMainArguments` to check if `main` takes an integer and a pointer as arguments, in that order. The pointer can be an `int`, `float`, `bool` or `char` pointer, and there is no restriction on the names of the arguments. If it does, the interpreter allocates an array for the command-line arguments, parses them according to the annotated type and stores them into the array. The name of the int argument will be bound to the size of the array and name of the pointer argument will be bound to the pointer to the array in the environment. The user can then use the two arguments like any other variables. When the program terminates, the interpreter frees the argument array since it is the one that allocated it.
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"TypeScript compiler": I think you mean the reference interpreter?


```
# ARGS: 97 108 98 101 114 116
@main(size: int, array: ptr<int>) {
k: int = const 4;
zero: int = const 0;
five: int = const 5;
output: int = call @quickselect array zero five k;
print output;
}
```

### Arrays in the TypeScript compiler

For this part, I modified the compiler in `ts2bril.ts`. First, I had to change `tsTypeToBril` so that array types are recognized in Bril. I used the TS typechecker to check if the given type is an array type. If so, we map it to the corresponding pointer type in Bril based on what the element type is in Bril.

Then, I changed `emitExpr` to be able to handle array operations. I started by adding a case in the switch statement for array literals, e.g. `{1, 2, 3}`. We evaluate the Bril type of the array, allocate an array in Bril and iterate over the array to store the elements. Next, I added a case for array indexing. To do this, we obtain Bril expressions for the array and the index by calling `emitExpr`, then create a pointer to the indexed element using `ptradd`. Indices are `number`s in TypeScript, but integers in Bril, which is a problem. The implementation currently accepts any `bigint`s as indices, but only `number` literals can be indices. So a programmer can either use `numbers[0n]` or `numbers[0]`, but `numbers[i]` only works if `i` has type `bigint`. The reason for this is that we can try to parse numeric literals as integers, but there is no narrowing operation for variables in Bril currently, and I am not sure if it would make sense for there to be one. This could be extended in the future.
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"We evaluate the Bril type of the array, allocate an array in Bril": Seems like this is maybe missing some words… do you mean something like "to evaluate […], we generate code to […]"?

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This honestly makes sense to me! It is a weird mismatch between TS and Bril in general that TS has only one type (number) that in Bril would intuitively cover two different types (int and float). So I have mostly (not entirely) made the decision to force TS programs to use two types (bigint and number) to distinguish.


I also changed the existing assignment case to allow assignment to an element in the array. This is similar to evaluating indexed array as an rvalue, mentioned above.

It is the programmer's responsibility to free the arrays, unlike in real TypeScript, because Bril garbage management is manual by default. I overloaded the existing `free` function in `mem.d.ts` so that the programmer can free the array in TypeScript.
```
export function free<T>(array: T): void;
```

The code below
```
const numbers: bigint[] = [18n,26n];
console.log(numbers[0n])
free<bigint[]>(numbers);
```
gets compiled to
```
@main {
v0: int = const 2;
v1: ptr<int> = alloc v0;
cur: ptr<int> = id v1;
v2: int = const 1;
v3: int = const 18;
store cur v3;
cur: ptr<int> = ptradd cur v2;
v4: int = const 26;
store cur v4;
numbers: ptr<int> = id v1;
v5: ptr<int> = id numbers;
v6: int = const 0;
v7: ptr<int> = ptradd v5 v6;
v8: int = load v7;
print v8;
v9: int = const 0;
v10: ptr<int> = id numbers;
free v10;
v11: int = const 0;
}
```

As you can see, there is quite a bit of redundant and dead code due to the way expressions are evaluated in the compiler, but it should be easily eliminated with optimizations.
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

And in general, the TS-to-Bril compiler already gleefully emits a lot of dead code!



# What were the hardest parts to get right?
The compiler extension was much trickier than the variable argument one. For the longest time, I did not realize that the link to the declaration file `typescript.d.ts` was broken, because the link to the rest of the code was not. This meant that the compiler was mostly working without it except in a few places, and when I added arrays I thought I must have done something wrong. I wasn't really familiar with the setup of TypeScript and there is very little documentation and few StackOverflow posts on the public TS compiler API, so I was stuck on that part for awhile. It also took some time to find the correct `functions` or `SyntaxKind` for my use cases, as I had to comb through the TypeScript codebase or rely on print statements. For anyone who is going to work with the TypeScript AST, I highly recommend the [AST viewer](https://ts-ast-viewer.com/#).
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If you still have them in your history, do you think you could include a couple of links to StackOverflow posts that helped you?

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I had not seen that AST viewer! That's pretty cool. It would have saved me a lot of time in my days of hacking on TS compiler internals…


Although it was not intuitive at the beginning to use the TypeScript compiler, I feel that it is my biggest takeaway from this project as it allowed me to get some hands-on experience with a "real" compiler that is much more complex than what I built in CS 4120.


# Were you successful?

For the variable arguments assignment, I rewrote the benchmarks from the `mem` folder where adaptable to test the extension. The benchmarks that I did not rewrite either generate large randomized arrays or used arrays for memoization. I verified that the output from the six rewritten benchmarks matched the original programs with the same arguments to `main`.

For the compiler extension, I wrote programs in TypeScript using arrays, literals and `number`/`bigint` as indices, assignment to arrays etc, iterating over the arrays using for-loops etc (see below). I then ran the compiled Bril programs with the Bril interpreter to verify the outputs.

```
for (let i = 0n; i < 2n;) {
console.log(numbers[i])
i = i + 1n;
}
```

Overall, I have enjoyed implementing these features and tinkering with the tools I have been using all semester :) I have also developed a greater appreciation for those who worked on them.