Memoize or precompute subgraphs that depend only on input shapes #270

robertknight · 2024-07-05T08:48:14Z

Many models have subgraphs that depend only on the shape of inputs, and thus don't change when the model is called repeatedly with inputs of the same shape. These subgraphs are usually cheap since the tensors flowing through them are small, but there is nevertheless overhead for each operation that is run. These subgraphs could be memoized to avoid re-running them unnecessarily.

As a starting point, it would be useful to do some experiments to see how many operations can be saved on some popular models, especially decoder models which are run repeatedly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memoize or precompute subgraphs that depend only on input shapes #270

Memoize or precompute subgraphs that depend only on input shapes #270

robertknight commented Jul 5, 2024 •

edited

Loading

Memoize or precompute subgraphs that depend only on input shapes #270

Memoize or precompute subgraphs that depend only on input shapes #270

Comments

robertknight commented Jul 5, 2024 • edited Loading

robertknight commented Jul 5, 2024 •

edited

Loading