Bundle all standalone models in a single project. #8

mukel · 2024-07-30T09:45:18Z

So far I've ported the following models to Java:
Llama 3 & 3.1, Mistral/Codestral/Mathstral/Nemostral (+ Tekken tokenizer), Qwen2, Phi3 and Gemma 1 & 2 ...
All models are bundled as a single ~2K lines Java file with no dependencies, at this point, maintaining all these is a burden, some components can be shared e.g. GGUF parser, tensors, tokenizers, samplers, chat formats, even the inference (e.g. same for Mistral and Llama), would be great to have all of them as a single project.

All behind a common, low-level inference API e.g. forward implementation. Note that this is not meant high-level abstraction like langchain4j, but a low-level inference engine that can be used as a backend by langchain4j.

geoand · 2024-10-22T11:53:45Z

Note that this is not meant high-level abstraction like langchain4j, but a low-level inference engine that can be used as a backend by langchain4j

I am interested in building the latter based on this :).
Is there anything I should be aware of?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bundle all standalone models in a single project. #8

Bundle all standalone models in a single project. #8

mukel commented Jul 30, 2024

geoand commented Oct 22, 2024

Bundle all standalone models in a single project. #8

Bundle all standalone models in a single project. #8

Comments

mukel commented Jul 30, 2024

geoand commented Oct 22, 2024