Add docker container support #1271

Sing-Li · 2023-11-15T20:33:53Z

Add frequently requested docker container support for serving REST APIs (AI app developers wanting to use supported mlc-llms on their development machines / workstations / clusters ).

louis030195 · 2024-02-01T17:37:39Z

Interested too so i can integrate in:

https://github.com/stellar-amenities/assistants

e.g. one liner deployment open source assistants api!

Sing-Li · 2024-04-16T20:31:40Z

@junrushao Just updated for the new SLM jit flow. Please review, test, and merge soon.

Sing-Li · 2024-04-17T02:18:03Z

@louis030195 that's one cool project you have going 😍 At long last, these containers are ready -- mlc_llm now supports 88+ models and increasing rapidly (expect to be hundreds by the end of the year). Also batching is working (several concurrent inferences) as well as support for function calling on some models. Please give it a whirl !

louis030195 · 2024-04-17T03:12:53Z

@Sing-Li maybe stupid question but how does containers affect performance?

does containers fully use nvidia gpu, and other ai accelerators?

i know that using apple accelerator thru docker is impossible?

Sing-Li · 2024-04-17T17:54:57Z

@louis030195 great questions!

how does containers affect performance?

From my experience, there is almost no tangible impact. I think this is due to the essentially "pass through" engineering that is done for the GPU (there is no "virtualization layering" for ROCM and cuda). In fact, if you have a tunable container host, you can get better deterministic performance out of the CPU part of your application (and possibly improving overall performance). Outside of ROCm and cuda - which both have had over a decade of engineering evolution - idk

does containers fully use nvidia gpu, and other ai accelerators?

GPU only and only so because of current industry pressure and former work-better-for-"Gaming" work that has been repurposed by open source efforts like MLC-AI for AI/ML. Unfortunately, I think commercial economics will always prevent open source container tech from participating with proprietary "competitive differentiator" AI accelerators.

i know that using apple accelerator thru docker is impossible?

As far as my research goes - impossible. Also unlikely that anyone from Apple will be doing anything to help it. So we do a simple forwarding proxy to the actual host based metal accelerated server to maintain deployment combability.

The cool thing about Apple (and other upcoming unified memory implementations -- such as the flood of Qualcomm Elite machines) is that built-in multitasking at the operating system level is good enough to run multiple different models concurrently on the single system as long as you have enough RAM. (no need to have tightly-coupled-expensive-GPU-memory)

aicrazyguy and others added 13 commits November 15, 2023 09:39

docker containers initial commit

e0dc538

add directory hierachy and libs

6f275b8

Add containers README.md

a19a197

Merge branch 'mlc-ai:main' into main

5fc260d

Add cuda README.md

894551b

Add ROCm README.md

c0c9254

Update README.md

5a84746

Add cuda bundled README.md

a00b4e3

Update README.md

6d0eea4

Update README.md

c355381

Add rocm57bundled README.md

db66ca8

Update README.md

7116647

Merge branch 'mlc-ai:main' into main

88ab3ab

junrushao self-assigned this Nov 20, 2023

Merge branch 'mlc-ai:main' into main

60051a1

Sing-Li mentioned this pull request Mar 12, 2024

[Bug] SLM running serve on known-good chat model crashes on _print_kv_cache_metadata_in_json #1921

Closed

Sing-Li and others added 2 commits April 16, 2024 13:23

Merge branch 'mlc-ai:main' into main

389e5f6

updated and tested for new SLM jit flow

7c0ccea

Neet-Nestor force-pushed the main branch from 9905667 to 14bec5a Compare May 27, 2024 06:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add docker container support #1271

Add docker container support #1271

Sing-Li commented Nov 15, 2023

louis030195 commented Feb 1, 2024

Sing-Li commented Apr 16, 2024

Sing-Li commented Apr 17, 2024 •

edited

Loading

louis030195 commented Apr 17, 2024

Sing-Li commented Apr 17, 2024 •

edited

Loading

Add docker container support #1271

Are you sure you want to change the base?

Add docker container support #1271

Conversation

Sing-Li commented Nov 15, 2023

louis030195 commented Feb 1, 2024

Sing-Li commented Apr 16, 2024

Sing-Li commented Apr 17, 2024 • edited Loading

louis030195 commented Apr 17, 2024

Sing-Li commented Apr 17, 2024 • edited Loading

Sing-Li commented Apr 17, 2024 •

edited

Loading

Sing-Li commented Apr 17, 2024 •

edited

Loading