๐A curated list of Awesome LLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism etc. ๐๐
sora
llm
llms
vllm
llm-inference
awesome-llm
flash-attention
flash-attention-2
tensorrt-llm
paged-attention
deepseek
open-sora
flash-attention-3
-
Updated
Nov 25, 2024