Change the repository type filter
All
Repositories list
8 repositories
SqueezedAttention
PublicTool2Vec
PublicEfficient and Scalable Estimation of Tool Representations in Vector SpaceTinyAgent
PublicKVQuant
Public[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache QuantizationSqueezeLLM
Public[ICML 2024] SqueezeLLM: Dense-and-Sparse QuantizationLLMCompiler
Public[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function CallingLLM2LLM
Public[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancementopen_source_projects
Public