[major] Add `fd` operand to `printf` statement #213

SpriteOvO · 2024-05-30T12:31:49Z

This PR proposes to add a new operand fd to printf statement, which will allow us to fopen a file and write formatted strings to it in the future. (fopen and fclose will be implemented as intrinsics latter)

sequencer · 2024-05-30T15:23:36Z

spec.md

@@ -2236,8 +2236,9 @@ circuit Foo:
    wire cond: UInt<1>
    wire a: UInt
    wire b: UInt
+    node fd = SInt<32>(-2)


we basically may need a new type in Firrtl and CIRCT, but anyway it LGTM. in CIRCT we need to check the fd shouldn't be used for circuit construction.

seldridge · 2024-05-30T16:31:21Z

This is getting pretty Verilog-specific which I don't really like. Specifically, in order for this to work, it likely also needs an fopen/fclose which returns an ID and that is then used. Otherwise, this is only usable for writing to stdout/stderr.

Is the motivation for this to write to specific files or to select between stdout/stderr?

If the motivation is to write to different files, then I'd prefer a Chisel/post-processing solution:

Chisel provides common infrastructure for logging. E.g., log-levels, common formats, inclusion of %m to indicate what module something was in.
There is a separate tool that can filter the raw log to extract specific information.

The end result of this is the same as writing to different files. However, it avoids having to introduce a real notion of a file descriptor in FIRRTL.

SpriteOvO · 2024-05-30T17:42:44Z

Is the motivation for this to write to specific files or to select between stdout/stderr?

The fundamental motivation for this is to enable the SystemVerilog generated by Chisel&CIRCT can be fwrite to different files by the simulator. The further goal is to allow Chisel to automatically split files by layer.

Specifically, in order for this to work, it likely also needs an fopen/fclose which returns an ID and that is then used. Otherwise, this is only usable for writing to stdout/stderr.

Sure, I'm working on adding fopen and fclose as FIRRTL intrinsics in CIRCT, and I should have a PR open in CIRCT repo in the next few days.

If the motivation is to write to different files, then I'd prefer a Chisel/post-processing solution:

Chisel provides common infrastructure for logging. E.g., log-levels, common formats, inclusion of %m to indicate what module something was in.

There is a separate tool that can filter the raw log to extract specific information.

The end result of this is the same as writing to different files. However, it avoids having to introduce a real notion of a file descriptor in FIRRTL.

If I understand correctly, printf/fwrite will be handled by the simulator, while Chisel post-processing can only output on elaboration-time anyway.

sequencer · 2024-05-30T23:18:36Z

The reason of post processing is hard is we also want write a giant log into file while keeping stdout clean for other usages e.g. uart.

seldridge · 2024-05-31T14:11:50Z

The reason of post processing is hard is we also want write a giant log into file while keeping stdout clean for other usages e.g. uart.

Thinking about this... Would it be acceptable to pipe all of the output (stdout, stderr, or both) into a script which then does the splitting for you, while leaving some things on stdout/stderr? Without writing the script:

$ ./run-verilator | slice-outputs
UART
UART
UART
$ ls
info.log warn.log debug.log layer-1.log layer2.log

sequencer · 2024-05-31T14:15:22Z

Thinking about this... Would it be acceptable to pipe all of the output (stdout, stderr, or both) into a script which then does the splitting for you, while leaving some things on stdout/stderr? Without writing the script:

This is doable, but for a user experience, this is a bad design that each user need to writing their own logging script...

seldridge · 2024-05-31T15:16:56Z

If we did do this, then it's reasonable to both build the logging library in Chisel and provide the script for working with this in Chisel as well. 🤓

sequencer · 2024-06-01T03:09:05Z

If we did do this, then it's reasonable to both build the logging library in Chisel and provide the script for working with this in Chisel as well. 🤓

I do agree, after getting fd into Chisel, I think Chisel can provide a slim package chisel3.logging to provide a logging functionality:)
It should be able to log to machine readable Json like where we did in T1

Since Chisel doesn't support UVM-based Scoreboard functionality, the logging package can be used as a utility to do the offline checking. e.g. Trace event of each core, doing check after simulation is done(not at simulation runtime or UVM check_phase).

seldridge · 2024-06-01T03:48:16Z

Err... I'm saying that I'm not convinced of the need for the file descriptor and think the logging can be packaged up with a library in Chisel and a script to post-process it.

Alternatively, a file descriptor is something that is circuit-global and is said to be open for the duration of the simulation/evaluation of a circuit.

WDYT?

@darthscsi: WDYT?

sequencer · 2024-06-01T03:59:29Z

I think the situation is the stdout is not only be used by chisel, but also be used by other programs on the supply chain, e.g. UVM, C libraries. We cannot force them to align with a same pattern of logging.
While delivering RTL as an IP, if customers want integrate the RTL to their own verification environment, delivery of a program to process the stdout is also an anti-pattern methodology.

seldridge · 2024-06-01T20:20:28Z

Going off of what I was suggesting above... Something like the following may make more sense:

circuit Foo:
  file Bar, "Bar.log", read
  public module Foo:
    input clock: Clock
    input a: UInt<1>

    printf(clock, UInt<1>(1), Bar, "hello world")

Alternatively, just putting the filename in the printf may be fine and it's up to the lowering to figure out what file descriptors to create:

printf(clock, UInt<1>(1), file="Bar.log", "hello world")

Basically, what is the right way to get a printf to a specific file? Can we avoid the need for low-level file descriptors?

sequencer · 2024-06-02T03:06:13Z

circuit Foo:
  file Bar, "Bar.log", read
  public module Foo:
    input clock: Clock
    input a: UInt<1>

    printf(clock, UInt<1>(1), Bar, "hello world")

I thought about it, maybe the file handler should for each public module or instance?

Basically, what is the right way to get a printf to a specific file? Can we avoid the need for low-level file descriptors?

I think this could be a good solution, making it a firtool emit option. This can be minimal overhead and being filesystem agnostic, however the problem is emission to a same file may be a problem for the simulator threading. Maybe we can have three options:

Stdout
SingleFile
FilePerInstance

WDYT?

uenoku · 2024-06-02T13:15:55Z

There is a limitation for the number of the files that a single process can open (ulimit -n) so file per instance approach may not work practically. So it may be possible that users want to close files during the simulation. That said I agree that such resource management is super low-level and it's reasonable to use circuit global file descriptors and not to allow manual fclose.

darthscsi · 2024-06-02T22:30:29Z

I'm actually quite amenable to this provided:

FD can be generated in the spec. You can't have to go to intrinsics to open/close an FD.
all ops which produce messages can use FDs. Particularly, the weirdness with messages on asserts means they should be able to be directed to FDs. I'd rather that capability be removed from those ops (write the printfs you want for the asserts yourself), but if we aren't doing that, we should be consistent.
A sane Chisel-level interface for this exists.
FDs should be a type not usable in most ops (no addition, not on ports, etc).
They are explicit about their interaction with Verilog FDs, C FDs, and platform FDs (e.g. they should not be a new thing from those, just a representation in FIRRTL of the underlying FDs)

Reasoning:
This is a very common piece of functionality for a reason and requested often for those reasons. Post-processing STDOUT is a hack which introduces tool-dependent filtering, imprecision, and unnecessary indirection and overhead. FIRRTL is a fairly low-level IR anyway, so I don't buy the "fds are too low level". The better argument is "fds are too simulationy". This capability matches every other language's mechanism. Interoperability with C is highly desirous, especially as DPI is rolled out. FD as an abstraction is platform agnostic (due mostly to posix and C), even on windows this is fine.

Concerns:
Execution order, having a way to have global file handles (not reopening in each module, potential buffering issues there), Mixing FD with other data-types, how to get file handles to all the places which need them without passing them in ports.

SpriteOvO force-pushed the printf-fd branch 2 times, most recently from 90153ea to d6c252d Compare May 30, 2024 15:11

sequencer reviewed May 30, 2024

View reviewed changes

[major] Add fd operand to printf statement

cb28546

SpriteOvO force-pushed the printf-fd branch from d6c252d to cb28546 Compare May 30, 2024 22:31

SpriteOvO mentioned this pull request Jun 1, 2024

[FIRRTL][Sim] Add fopen and fclose intrinsics, add fd operand to printf llvm/circt#7111

Draft

SpriteOvO marked this pull request as ready for review June 1, 2024 01:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[major] Add `fd` operand to `printf` statement #213

[major] Add `fd` operand to `printf` statement #213

SpriteOvO commented May 30, 2024

sequencer May 30, 2024

seldridge commented May 30, 2024

SpriteOvO commented May 30, 2024 •

edited

Loading

sequencer commented May 30, 2024

seldridge commented May 31, 2024

sequencer commented May 31, 2024

seldridge commented May 31, 2024

sequencer commented Jun 1, 2024

seldridge commented Jun 1, 2024

sequencer commented Jun 1, 2024

seldridge commented Jun 1, 2024

sequencer commented Jun 2, 2024

uenoku commented Jun 2, 2024

darthscsi commented Jun 2, 2024

[major] Add fd operand to printf statement #213

Are you sure you want to change the base?

[major] Add fd operand to printf statement #213

Conversation

SpriteOvO commented May 30, 2024

sequencer May 30, 2024

Choose a reason for hiding this comment

seldridge commented May 30, 2024

SpriteOvO commented May 30, 2024 • edited Loading

sequencer commented May 30, 2024

seldridge commented May 31, 2024

sequencer commented May 31, 2024

seldridge commented May 31, 2024

sequencer commented Jun 1, 2024

seldridge commented Jun 1, 2024

sequencer commented Jun 1, 2024

seldridge commented Jun 1, 2024

sequencer commented Jun 2, 2024

uenoku commented Jun 2, 2024

darthscsi commented Jun 2, 2024

[major] Add `fd` operand to `printf` statement #213

[major] Add `fd` operand to `printf` statement #213

SpriteOvO commented May 30, 2024 •

edited

Loading