add tensorrt-llm to inference #89

Jerry-Kon · 2024-09-25T03:02:21Z

/kind documentation

kerthcet

Please squash the same time.

kerthcet · 2024-09-26T03:47:49Z

README.md

@@ -69,6 +69,8 @@
 | **[Triton Inference Server](https://github.com/triton-inference-server/server)** | ![Stars](https://img.shields.io/github/stars/triton-inference-server/server.svg) | ![Release](https://img.shields.io/github/release/triton-inference-server/server) | ![Contributors](https://img.shields.io/github/contributors/triton-inference-server/server) | The Triton Inference Server provides an optimized cloud and edge inferencing solution. | |
 | **[Text Generation Inference](https://github.com/huggingface/text-generation-inference)** | ![Stars](https://img.shields.io/github/stars/huggingface/text-generation-inference.svg) | ![Release](https://img.shields.io/github/release/huggingface/text-generation-inference) | ![Contributors](https://img.shields.io/github/contributors/huggingface/text-generation-inference) | Large Language Model Text Generation Inference | |
 | **[vLLM](https://github.com/vllm-project/vllm)** | ![Stars](https://img.shields.io/github/stars/vllm-project/vllm.svg) | ![Release](https://img.shields.io/github/release/vllm-project/vllm) | ![Contributors](https://img.shields.io/github/contributors/vllm-project/vllm) | A high-throughput and memory-efficient inference and serving engine for LLMs | |
+| **[TensorRT-LLM](https://github.com/NVIDIA/TensorRT-LLM)** | ![GitHub Repo stars](https://img.shields.io/github/stars/NVIDIA/TensorRT-LLM) | ![GitHub Release](https://img.shields.io/github/v/release/NVIDIA/TensorRT-LLM) | ![GitHub contributors](https://img.shields.io/github/contributors/NVIDIA/TensorRT-LLM) | TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs.||


We put the names in the alphabetical order, so please reorder this, thanks!

kerthcet · 2024-09-26T03:48:20Z

/kind documentation
/approve

kerthcet · 2024-09-30T05:14:51Z

Kindly ping @Jerry-Kon

Jerry-Kon · 2024-09-30T09:58:10Z

complete the merge requirement

kerthcet · 2024-09-30T10:13:48Z

Could you squash the commits? Thanks.

Jerry-Kon · 2024-09-30T12:48:48Z

squash the history commit

kerthcet · 2024-10-01T15:21:55Z

/lgtm

Thanks!

InftyAI-Agent added needs-triage Indicates an issue or PR lacks a label and requires one. needs-priority Indicates a PR lacks a label and requires one. do-not-merge/needs-kind Indicates a PR lacks a label and requires one. labels Sep 25, 2024

kerthcet reviewed Sep 26, 2024

View reviewed changes

InftyAI-Agent added documentation Categorizes issue or PR as related to documentation. approved Indicates a PR has been approved by an approver from all required OWNERS files. and removed do-not-merge/needs-kind Indicates a PR lacks a label and requires one. labels Sep 26, 2024

Jerry-Kon force-pushed the main branch from a7c91f5 to 61544db Compare September 30, 2024 11:52

add tensorrt-llm to inference

a1e06f7

Jerry-Kon force-pushed the main branch from 61544db to a1e06f7 Compare September 30, 2024 12:44

InftyAI-Agent added the lgtm Looks good to me, indicates that a PR is ready to be merged. label Oct 1, 2024

InftyAI-Agent assigned kerthcet Oct 1, 2024

InftyAI-Agent merged commit 39d1a5c into InftyAI:main Oct 1, 2024
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add tensorrt-llm to inference #89

add tensorrt-llm to inference #89

Jerry-Kon commented Sep 25, 2024

kerthcet left a comment

kerthcet Sep 26, 2024

kerthcet commented Sep 26, 2024

kerthcet commented Sep 30, 2024

Jerry-Kon commented Sep 30, 2024

kerthcet commented Sep 30, 2024 •

edited

Loading

Jerry-Kon commented Sep 30, 2024

kerthcet commented Oct 1, 2024

add tensorrt-llm to inference #89

add tensorrt-llm to inference #89

Conversation

Jerry-Kon commented Sep 25, 2024

kerthcet left a comment

Choose a reason for hiding this comment

kerthcet Sep 26, 2024

Choose a reason for hiding this comment

kerthcet commented Sep 26, 2024

kerthcet commented Sep 30, 2024

Jerry-Kon commented Sep 30, 2024

kerthcet commented Sep 30, 2024 • edited Loading

Jerry-Kon commented Sep 30, 2024

kerthcet commented Oct 1, 2024

kerthcet commented Sep 30, 2024 •

edited

Loading