using gemma2 2B model for js llm inference #477

iamrajhans · 2024-11-26T11:55:37Z

Description

Updating the model to use gemma2 for js llm inference

Fixes # (issue)

Checklist

Please ensure the following items are complete before submitting a pull request:

My code follows the code style of the project.
I have updated the documentation (if applicable).
I have added tests to cover my changes.

Type of Change

Please check the relevant option below:

Bug fix (non-breaking change which fixes an issue)
Documentation update (non-breaking change which updates documentation)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)

Screenshots

If applicable, please add screenshots to help explain your changes.

Additional Notes

Add any additional information or context about the pull request here.

iamrajhans · 2024-11-29T13:44:20Z

hi @schmidt-sebastian @PaulTR ,

could you please review the PR and suggest if any changes required

thanks !

schmidt-sebastian · 2024-12-05T16:19:11Z

examples/llm_inference/js/README.md

@@ -13,7 +13,7 @@ This web sample demonstrates how to use the LLM Inference API to run common text

 Follow the following instructions to run the sample on your device:
 1. Make a folder for the task, named as `llm_task`, and copy the [index.html](https://github.com/googlesamples/mediapipe/blob/main/examples/llm_inference/js/index.html) and [index.js](https://github.com/googlesamples/mediapipe/blob/main/examples/llm_inference/js/index.js) files into your `llm_task` folder.
-2. Download [Gemma 2B](https://www.kaggle.com/models/google/gemma/frameworks/tfLite/variations/gemma-2b-it-gpu-int4) (TensorFlow Lite 2b-it-gpu-int4 or 2b-it-gpu-int8) or convert an external LLM (Phi-2, Falcon, or StableLM) following the [guide](https://developers.google.com/mediapipe/solutions/genai/llm_inference/web_js#convert-model) (only gpu backend is currently supported), into the `llm_task` folder.
+2. Download [Gemma2 2B](https://www.kaggle.com/models/google/gemma-2/tfLite/gemma2-2b-it-gpu-int8) (TensorFlow Lite 2b-it-gpu-int8 or 2b-it-cpu-int8) or convert an external LLM (Phi-2, Falcon, or StableLM) following the [guide](https://developers.google.com/mediapipe/solutions/genai/llm_inference/web_js#convert-model) (only gpu backend is currently supported), into the `llm_task` folder.


We don't support CPU models on Web.

(We might also leave both models up until we have an int4 model of Gemma 2)

using gemma2 2B model for js llm inference

67cba76

schmidt-sebastian reviewed Dec 5, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

using gemma2 2B model for js llm inference #477

using gemma2 2B model for js llm inference #477

iamrajhans commented Nov 26, 2024 •

edited

Loading

iamrajhans commented Nov 29, 2024

schmidt-sebastian Dec 5, 2024

schmidt-sebastian Dec 5, 2024

using gemma2 2B model for js llm inference #477

Are you sure you want to change the base?

using gemma2 2B model for js llm inference #477

Conversation

iamrajhans commented Nov 26, 2024 • edited Loading

Description

Checklist

Type of Change

Screenshots

Additional Notes

iamrajhans commented Nov 29, 2024

schmidt-sebastian Dec 5, 2024

Choose a reason for hiding this comment

schmidt-sebastian Dec 5, 2024

Choose a reason for hiding this comment

iamrajhans commented Nov 26, 2024 •

edited

Loading