Support for LLaVA #88

varshith15 · 2024-07-26T13:52:02Z

Single Machine
Sharding
Generate function integration
Clean up code (deduplication)

AlexCheema · 2024-07-26T19:38:08Z

Looks good so far!

Assigned you on the bounty https://docs.google.com/spreadsheets/d/1cTCpTIp48UnnIvHeLEUNg1iMy_Q6lRybgECSFCoVJpE/edit

varshith15 · 2024-07-27T19:06:23Z

hey @AlexCheema

vision model is fully on the first shard, language model is multi sharded
integrated llava into generate function as well
only the main.py integration is left which i feel like should be a seperate PR (could be breaking changes)
There are no breaking changes to the current flow.
Please review and merge

AlexCheema · 2024-07-28T00:13:18Z

We should merge #87 before this as there's some relevant changes.
@varshith15 you can merge that branch already if you want and then I will merge yours as soon as the other is merged.

AlexCheema · 2024-07-28T00:19:20Z

With changes from #87, we should try to re-use as much as possible from mlx_lm, there's a lot of copied code when it should only be necessary to copy 1 or 2 classes from there and modify them.

AlexCheema · 2024-07-28T03:38:40Z

Is the test_sharded_llava.py test passing for you?
I thought we'd need to load an mlx-converted model, but it looks like this is the original ordinary model from HF llava-hf/llava-1.5-7b-hf

AlexCheema · 2024-07-28T04:33:02Z

Is the test_sharded_llava.py test passing for you? I thought we'd need to load an mlx-converted model, but it looks like this is the original ordinary model from HF llava-hf/llava-1.5-7b-hf

NVM this works fine :)

varshith15 · 2024-07-28T05:01:24Z

exo/inference/mlx/models/llava.py

+          if self.shard.start_layer <= i <= self.shard.end_layer:
+            self.layers.append(TransformerBlock(config=config))
+          else:
+            self.layers.append(IdentityBlock())


this isnt needed right @AlexCheema ?

It is with the new convention, I think

varshith15 · 2024-07-28T05:07:21Z

also thanks thanks for pushing the other changes @AlexCheema , they make a lot of sense

AlexCheema · 2024-07-28T05:37:01Z

It would be great to hook this up to chatgpt_api.py do you want to give that a go @varshith15 ? i.e. allow the user to give an image as input, similar to how chatgpt vision api works.

varshith15 · 2024-07-28T10:47:45Z

@AlexCheema done PRM
the logic to use the pixel_values only in the first pass fits in perfectly with the send_prompt, send_tensor idea (really cool idea)
i've verified, its backward compatible as well
only change left is the UI one, I am little occupied so can't pick that up rn

AlexCheema · 2024-07-28T20:57:14Z

@AlexCheema done PRM the logic to use the pixel_values only in the first pass fits in perfectly with the send_prompt, send_tensor idea (really cool idea) i've verified, its backward compatible as well only change left is the UI one, I am little occupied so can't pick that up rn

Woah! Incredible work. You even figured out how to generate the grpc service. I’m catching a flight now but will be able to look properly once I arrive.

I can fix up a small UI.

email me at [email protected] for the bounty.

really awesome work, you rock @varshith15

…m url async, use chatgpt-compatible convention for images

AlexCheema · 2024-07-30T19:03:11Z

Changed a few things, added support for image upload to tiny chat.
Awesome work @varshith15 please email me [email protected] for the bounty reward!

Support for LLaVA

Varshith added 2 commits July 26, 2024 06:00

init

803a442

working test

7cbf6a3

AlexCheema mentioned this pull request Jul 27, 2024

[BOUNTY - $100] Add support for LLaVA #3

Closed

shareded inference

9d2616b

conflicts

5499399

varshith15 marked this pull request as ready for review July 27, 2024 19:20

varshith15 and others added 2 commits July 28, 2024 01:00

Merge branch 'main' into main

6ed76b3

processor load

2849128

AlexCheema added 4 commits July 27, 2024 20:17

Merge branch 'main' into HEAD

7d5eed1

rename sharded_llava -> llava to match new convention

833e7f3

remove unused torch import

2aa1e24

add pillow as testing dependency

b44b917

AlexCheema added 2 commits July 27, 2024 21:04

stick to same convention as new llama

2fb961f

fix llava sanitize

33cbacf

varshith15 commented Jul 28, 2024

View reviewed changes

Varshith added 2 commits July 28, 2024 16:12

chatgpt api integration

acc94b5

update readme

8d3d3df

AlexCheema added 3 commits July 30, 2024 14:18

Merge branch 'main' into HEAD

824f052

add pillow to main dependencies

78db451

move model-selector styles to index.css

e68d06f

AlexCheema added 2 commits July 30, 2024 20:01

increase max request size to send raw images, make image download fro…

0d45a85

…m url async, use chatgpt-compatible convention for images

add support for image upload to tinychat for vision models

af1c7ce

AlexCheema merged commit 0ec77e1 into exo-explore:main Jul 30, 2024
3 checks passed

dan-online pushed a commit to bytebolt-media/exo that referenced this pull request Aug 28, 2024

Merge pull request exo-explore#88 from varshith15/main

170c3d7

Support for LLaVA

AlexCheema mentioned this pull request Sep 30, 2024

[BOUNTY - $100] Add support for LLaVA (tinygrad) #249

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for LLaVA #88

Support for LLaVA #88

varshith15 commented Jul 26, 2024 •

edited

Loading

AlexCheema commented Jul 26, 2024

varshith15 commented Jul 27, 2024 •

edited

Loading

AlexCheema commented Jul 28, 2024

AlexCheema commented Jul 28, 2024

AlexCheema commented Jul 28, 2024

AlexCheema commented Jul 28, 2024

varshith15 Jul 28, 2024

AlexCheema Jul 28, 2024

varshith15 commented Jul 28, 2024

AlexCheema commented Jul 28, 2024 •

edited

Loading

varshith15 commented Jul 28, 2024

AlexCheema commented Jul 28, 2024

AlexCheema commented Jul 30, 2024

Support for LLaVA #88

Support for LLaVA #88

Conversation

varshith15 commented Jul 26, 2024 • edited Loading

AlexCheema commented Jul 26, 2024

varshith15 commented Jul 27, 2024 • edited Loading

AlexCheema commented Jul 28, 2024

AlexCheema commented Jul 28, 2024

AlexCheema commented Jul 28, 2024

AlexCheema commented Jul 28, 2024

varshith15 Jul 28, 2024

Choose a reason for hiding this comment

AlexCheema Jul 28, 2024

Choose a reason for hiding this comment

varshith15 commented Jul 28, 2024

AlexCheema commented Jul 28, 2024 • edited Loading

varshith15 commented Jul 28, 2024

AlexCheema commented Jul 28, 2024

AlexCheema commented Jul 30, 2024

varshith15 commented Jul 26, 2024 •

edited

Loading

varshith15 commented Jul 27, 2024 •

edited

Loading

AlexCheema commented Jul 28, 2024 •

edited

Loading