Merge changes in to support parsing bash scripts #737

BolunThompson · 2024-12-15T21:44:49Z

Code written by @sethsabar. The tests pass with the changes from binpash/shasta#5 and binpash/libbash#1 (CI will fail until those are merged in).

Signed-off-by: Bolun Thompson <[email protected]>

The bash tests contain scripts which use UTF-8 only characters, but, by default, Python throws an exception when writing non-ASCII characters to a file. Signed-off-by: Bolun Thompson <[email protected]>

Signed-off-by: Bolun Thompson <[email protected]>

angelhof · 2024-12-17T00:26:10Z

I merged the depending PRs, but I suspect we also need to create a PyPI release of libbash correct?

BolunThompson · 2024-12-17T02:46:53Z

You’re correct — thanks for catching that. PR for it is binpash/libbash#3.

Signed-off-by: Bolun Thompson <[email protected]>

angelhof · 2024-12-17T15:40:37Z

Great, we also need to push this release to PyPI, for which I will wait for Seth to give me access to the libbash PyPI repo.

angelhof · 2024-12-21T00:42:42Z

After merging libbash and pushing the repo to PyPI we should be able to continue working on this :)

github-actions · 2024-12-21T02:11:12Z

OS =
CPU =
Ram =
Hash = 75891c3
Kernel=
||
|-|-|-|-|-|-|-|-|-|

github-actions · 2024-12-21T02:11:34Z

OS:ubuntu-20.04
Sat Dec 21 02:11:33 UTC 2024
intro: 2/2 tests passed.
interface: 42/42 tests passed.
compiler: 52/54 tests passed.
bigrams.sh are not identical
bigrams.sh are not identical

github-actions · 2024-12-22T09:51:29Z

OS =
CPU =
Ram =
Hash = 441020a
Kernel=
||
|-|-|-|-|-|-|-|-|-|

github-actions · 2024-12-22T09:51:40Z

OS:ubuntu-20.04
Sun Dec 22 09:51:39 UTC 2024
intro: 2/2 tests passed.
interface: 42/42 tests passed.
compiler: 54/54 tests passed.

angelhof

Great job @BolunThompson and @sethsabar !! The only change that we need to do is to make sure that bash tests run in CI (not by modifying the yml file but rather the test scripts). In my opinion, it would be better to just modify all test files to run both with and without --bash for all tests (avoiding the control flow that exists currently), however I don't have a strong opinion on that.

angelhof · 2024-12-23T01:33:31Z

evaluation/tests/test_evaluation_scripts.sh

@@ -179,6 +192,9 @@ execute_tests() {
                export pash_output="${intermediary_dir}/${microbenchmark}_${n_in}_pash_output"
                export script_conf=${microbenchmark}_${n_in}
                echo '' > "${pash_time}"
+                if [ "$test_mode" == "bash" ]; then


Do we need this here?

angelhof · 2024-12-23T01:34:45Z

evaluation/tests/test_evaluation_scripts.sh

@@ -96,12 +100,21 @@ pipeline_microbenchmarks=(
 execute_pash_and_check_diff() {
    TIMEFORMAT="%3R" # %3U %3S"
    if [ "$DEBUG" -eq 1 ]; then
-        { time "$PASH_TOP/pa.sh" $@ ; } 1> "$pash_output" 2> >(tee -a "${pash_time}" >&2) &&
-        diff -s "$seq_output" "$pash_output" | head | tee -a "${pash_time}" >&2
+        if [ "$test_mode" == "bash" ]; then


I feel that we can just modify the file to run tests both with --bash and without (modifying the configurations array above).

angelhof · 2024-12-23T01:35:04Z

scripts/run_tests_bash.sh

@@ -0,0 +1,24 @@
+#!/usr/bin/env bash
+
+set -x e


We should run this script in CI!

angelhof · 2024-12-23T01:35:42Z

evaluation/tests/interface_tests/run.sh

@@ -328,7 +339,7 @@ test_IFS()
 }

 ## We run all tests composed with && to exit on the first that fails
-if [ "$#" -eq 0 ]; then
+if [ "$#" -eq 0 ] || [ "$test_mode" = "bash" ]; then


I would really remove all these control flow checks and make sure that all tests always run both with and without --bash.

Also changes it so that only the bash tests only run in bash mode, which I feel is fair since they test bash only features

github-actions · 2024-12-26T03:09:53Z

OS =
CPU =
Ram =
Hash = 3c2606b
Kernel=
||
|-|-|-|-|-|-|-|-|-|

github-actions · 2024-12-26T03:17:41Z

OS =
CPU =
Ram =
Hash = 89bfc84
Kernel=
||
|-|-|-|-|-|-|-|-|-|

github-actions · 2024-12-26T03:18:07Z

OS:ubuntu-20.04
Thu Dec 26 03:18:06 UTC 2024
intro: 2/2 tests passed.
interface: 212/214 tests passed.
compiler: 98/108 tests passed.
test_histexp7.sub are not identical
test_unicode3.sub are not identical
shortest_scripts.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 2 --output_time /home/runner/work/pash/pash/evaluation/tests/shortest_scripts.sh
shortest_scripts.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 8 --output_time /home/runner/work/pash/pash/evaluation/tests/shortest_scripts.sh
deadlock_test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 2 --output_time /home/runner/work/pash/pash/evaluation/tests/deadlock_test.sh
deadlock_test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 8 --output_time /home/runner/work/pash/pash/evaluation/tests/deadlock_test.sh
micro_10.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 2 --output_time /home/runner/work/pash/pash/evaluation/tests/micro_10.sh
micro_10.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 8 --output_time /home/runner/work/pash/pash/evaluation/tests/micro_10.sh
sed-test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 2 --output_time /home/runner/work/pash/pash/evaluation/tests/sed-test.sh
sed-test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 8 --output_time /home/runner/work/pash/pash/evaluation/tests/sed-test.sh
tr-test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 2 --output_time /home/runner/work/pash/pash/evaluation/tests/tr-test.sh
tr-test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 8 --output_time /home/runner/work/pash/pash/evaluation/tests/tr-test.sh

github-actions · 2024-12-26T03:25:43Z

OS:ubuntu-20.04
Thu Dec 26 03:25:42 UTC 2024
intro: 2/2 tests passed.
interface: 212/214 tests passed.
compiler: 98/108 tests passed.
test_histexp7.sub are not identical
test_unicode3.sub are not identical
shortest_scripts.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 2 --output_time /home/runner/work/pash/pash/evaluation/tests/shortest_scripts.sh
shortest_scripts.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 8 --output_time /home/runner/work/pash/pash/evaluation/tests/shortest_scripts.sh
deadlock_test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 2 --output_time /home/runner/work/pash/pash/evaluation/tests/deadlock_test.sh
deadlock_test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 8 --output_time /home/runner/work/pash/pash/evaluation/tests/deadlock_test.sh
micro_10.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 2 --output_time /home/runner/work/pash/pash/evaluation/tests/micro_10.sh
micro_10.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 8 --output_time /home/runner/work/pash/pash/evaluation/tests/micro_10.sh
sed-test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 2 --output_time /home/runner/work/pash/pash/evaluation/tests/sed-test.sh
sed-test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 8 --output_time /home/runner/work/pash/pash/evaluation/tests/sed-test.sh
tr-test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 2 --output_time /home/runner/work/pash/pash/evaluation/tests/tr-test.sh
tr-test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 8 --output_time /home/runner/work/pash/pash/evaluation/tests/tr-test.sh

github-actions · 2024-12-29T03:50:29Z

OS =
CPU =
Ram =
Hash = 190f03e
Kernel=
||
|-|-|-|-|-|-|-|-|-|

github-actions · 2024-12-29T03:58:57Z

OS:ubuntu-20.04
Sun Dec 29 03:58:56 UTC 2024
intro: 2/2 tests passed.
interface: 212/214 tests passed.
compiler: 98/108 tests passed.
test_histexp7.sub are not identical
test_unicode3.sub are not identical
shortest_scripts.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 2 --output_time /home/runner/work/pash/pash/evaluation/tests/shortest_scripts.sh
shortest_scripts.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 8 --output_time /home/runner/work/pash/pash/evaluation/tests/shortest_scripts.sh
deadlock_test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 2 --output_time /home/runner/work/pash/pash/evaluation/tests/deadlock_test.sh
deadlock_test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 8 --output_time /home/runner/work/pash/pash/evaluation/tests/deadlock_test.sh
micro_10.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 2 --output_time /home/runner/work/pash/pash/evaluation/tests/micro_10.sh
micro_10.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 8 --output_time /home/runner/work/pash/pash/evaluation/tests/micro_10.sh
sed-test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 2 --output_time /home/runner/work/pash/pash/evaluation/tests/sed-test.sh
sed-test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 8 --output_time /home/runner/work/pash/pash/evaluation/tests/sed-test.sh
tr-test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 2 --output_time /home/runner/work/pash/pash/evaluation/tests/tr-test.sh
tr-test.sh are not identical with flags -d 1 --assert_all_regions_parallelizable --bash --width 8 --output_time /home/runner/work/pash/pash/evaluation/tests/tr-test.sh

BolunThompson · 2025-01-04T07:03:29Z

The current problem is that pash expands words in bash mode by calling echo {arg} for every arg, replacing the argument with the output. While this usually works, it doesn’t split words. Without a bash argument expander, currently they’re just split naively on IFS, leading to commands like tr “ “ “ “ being parsed as four quotation marks instead of two string arguments (since every character is a CArgChar).

I’m finishing bash_expand.py, which already sketches out using a bash server to expand asts before compilation, like the dash code. I’ll send (another) PR to sh-expand with it (hopefully soon!).

angelhof · 2025-01-05T01:13:42Z

The current problem is that pash expands words in bash mode by calling echo {arg} for every arg, replacing the argument with the output. While this usually works, it doesn’t split words. Without a bash argument expander, currently they’re just split naively on IFS, leading to commands like tr “ “ “ “ being parsed as four quotation marks instead of two string arguments (since every character is a CArgChar).

I’m finishing bash_expand.py, which already sketches out using a bash server to expand asts before compilation, like the dash code. I’ll send (another) PR to sh-expand with it (hopefully soon!).

Ohh, that is a bummer... Happy to discuss the solution if needed!

sethsabar added 21 commits March 19, 2024 17:48

starting to integrate bash into pash

5931c60

[untested] add bash expansion for bash parser

132d180

outline for ast_to_ast updates

48501ef

ast_to_ast updates complete but untested

29f63ff

some testing on ast_to_ast changes

9b2a9a2

outline for ast_to_ir

963c6a9

some work on ast_to_ir

1ea0b9c

fixed bash mode not propogating to JIT engine

ed8f1a0

update requirements for actions tests

e861a45

minor bug fixes and new bash testing script

9218ec0

make changes to test scripts to support bash

ca85a45

some work on the bash testing

958a355

Add bash tests

164ed4b

start work on benchmarking

d73a0cb

updates to the bash benchmark routine

3da7bd2

bug fixed with echo ast, crazy bash speed ups now

dbeaaaa

getting ready to run evaluations

058fd15

bug fix to attain more speedups

46b8559

add another benchmark script

57187d8

nlp test suite

2610376

bash evaluation script

e48ddbf

BolunThompson force-pushed the bash-merge branch from 1ae61d4 to d451f5a Compare December 15, 2024 21:48

BolunThompson added 8 commits December 16, 2024 00:14

Merge branch 'main' of https://github.com/sethsabar/pash into bash-merge

a44723c

Signed-off-by: Bolun Thompson <[email protected]>

Fix: python 3.8 compat

e817bb3

Signed-off-by: Bolun Thompson <[email protected]>

Split up big function

3050c4e

Signed-off-by: Bolun Thompson <[email protected]>

Fix tmpdir in tests

b430454

Signed-off-by: Bolun Thompson <[email protected]>

Add 'encoding="utf-8"' to open calls for writing

c40fa53

The bash tests contain scripts which use UTF-8 only characters, but, by default, Python throws an exception when writing non-ASCII characters to a file. Signed-off-by: Bolun Thompson <[email protected]>

Lint: black bash changes

3b238c7

Signed-off-by: Bolun Thompson <[email protected]>

Add comments

21576ce

Signed-off-by: Bolun Thompson <[email protected]>

Update shasta version

b550e15

Signed-off-by: Bolun Thompson <[email protected]>

BolunThompson force-pushed the bash-merge branch from d451f5a to b550e15 Compare December 16, 2024 00:54

Update libbash version

424bb86

Signed-off-by: Bolun Thompson <[email protected]>

Fix version specifier in requirements

75891c3

BolunThompson marked this pull request as ready for review December 21, 2024 02:08

Rerun CI

441020a

angelhof requested changes Dec 23, 2024

View reviewed changes

Run bash and dash tests together

3c2606b

Also changes it so that only the bash tests only run in bash mode, which I feel is fair since they test bash only features

Fix typo

89bfc84

Fix accidential merge removal

190f03e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge changes in to support parsing bash scripts #737

Merge changes in to support parsing bash scripts #737

BolunThompson commented Dec 15, 2024

angelhof commented Dec 17, 2024

BolunThompson commented Dec 17, 2024 •

edited

Loading

angelhof commented Dec 17, 2024

angelhof commented Dec 21, 2024

github-actions bot commented Dec 21, 2024

github-actions bot commented Dec 21, 2024

github-actions bot commented Dec 22, 2024

github-actions bot commented Dec 22, 2024

angelhof left a comment

angelhof Dec 23, 2024

angelhof Dec 23, 2024

angelhof Dec 23, 2024

angelhof Dec 23, 2024

github-actions bot commented Dec 26, 2024

github-actions bot commented Dec 26, 2024

github-actions bot commented Dec 26, 2024

github-actions bot commented Dec 26, 2024

github-actions bot commented Dec 29, 2024

github-actions bot commented Dec 29, 2024

BolunThompson commented Jan 4, 2025 •

edited

Loading

angelhof commented Jan 5, 2025

Merge changes in to support parsing bash scripts #737

Are you sure you want to change the base?

Merge changes in to support parsing bash scripts #737

Conversation

BolunThompson commented Dec 15, 2024

angelhof commented Dec 17, 2024

BolunThompson commented Dec 17, 2024 • edited Loading

angelhof commented Dec 17, 2024

angelhof commented Dec 21, 2024

github-actions bot commented Dec 21, 2024

github-actions bot commented Dec 21, 2024

github-actions bot commented Dec 22, 2024

github-actions bot commented Dec 22, 2024

angelhof left a comment

Choose a reason for hiding this comment

angelhof Dec 23, 2024

Choose a reason for hiding this comment

angelhof Dec 23, 2024

Choose a reason for hiding this comment

angelhof Dec 23, 2024

Choose a reason for hiding this comment

angelhof Dec 23, 2024

Choose a reason for hiding this comment

github-actions bot commented Dec 26, 2024

github-actions bot commented Dec 26, 2024

github-actions bot commented Dec 26, 2024

github-actions bot commented Dec 26, 2024

github-actions bot commented Dec 29, 2024

github-actions bot commented Dec 29, 2024

BolunThompson commented Jan 4, 2025 • edited Loading

angelhof commented Jan 5, 2025

BolunThompson commented Dec 17, 2024 •

edited

Loading

BolunThompson commented Jan 4, 2025 •

edited

Loading