Skip to content

Issues: robertknight/rten

8-bit quantization MVP
#347 opened Sep 6, 2024 by robertknight
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Investigate skipping packing of the A / LHS matrix in GEMM operations performance Issues that affect model inference or loading performance
#415 opened Nov 24, 2024 by robertknight
Support "real" boolean tensors
#414 opened Nov 24, 2024 by robertknight
Support fusing Transpose + MatMul where both inputs are transposed performance Issues that affect model inference or loading performance
#398 opened Oct 29, 2024 by robertknight
WASM relaxed SIMD support
#389 opened Oct 18, 2024 by robertknight
VALID auto_pad value
#382 opened Oct 14, 2024 by igor-yusupov
Fuse pointwise operations into matmul / convolution operations performance Issues that affect model inference or loading performance
#371 opened Sep 21, 2024 by robertknight
Implement better depthwise convolution kernels performance Issues that affect model inference or loading performance
#370 opened Sep 21, 2024 by robertknight
8-bit quantization MVP
#347 opened Sep 6, 2024 by robertknight
6 of 10 tasks
Adjust default thread count on Apple Silicon systems performance Issues that affect model inference or loading performance
#342 opened Sep 2, 2024 by robertknight
Align ReduceMin / ReduceMax etc. handling of empty tensors with spec Spec compliance Issues with RTen behavior not matching the ONNX specifications
#341 opened Sep 1, 2024 by robertknight
Prepack weights when model is loaded performance Issues that affect model inference or loading performance
#214 opened May 27, 2024 by robertknight
Make unary ops more efficient with non-contiguous inputs performance Issues that affect model inference or loading performance
#192 opened May 20, 2024 by robertknight
1 of 2 tasks
Run tests under AddressSanitizer (and possibly other sanitizers) qa Quality / correctness checks
#151 opened May 5, 2024 by robertknight
Validate operator input counts tooling Tools for debugging / profiling etc.
#133 opened Apr 29, 2024 by robertknight
Enable re-using pool across graph executions performance Issues that affect model inference or loading performance
#122 opened Apr 26, 2024 by robertknight
Run tests under WebAssembly in CI qa Quality / correctness checks WebAssembly
#93 opened Apr 14, 2024 by robertknight
Document rten CLI tool documentation Improvements or additions to documentation
#52 opened Feb 8, 2024 by robertknight
Convert quantized models
#42 opened Jan 20, 2024 by igor-yusupov
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.