wsxhjnb1 / KVQuant Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

The CUDA kernel for KV Cache quantization and dequantization for LLM

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
csrc		csrc
LICENSE		LICENSE
README.md		README.md

Repository files navigation

KVQuant

The CUDA kernel for KV Cache quantization and dequantization for LLM

About

The CUDA kernel for KV Cache quantization and dequantization for LLM

Report repository

Releases

No releases published

Packages

No packages published

Languages