Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bundle all standalone models in a single project. #8

Open
mukel opened this issue Jul 30, 2024 · 1 comment
Open

Bundle all standalone models in a single project. #8

mukel opened this issue Jul 30, 2024 · 1 comment

Comments

@mukel
Copy link
Owner

mukel commented Jul 30, 2024

So far I've ported the following models to Java:
Llama 3 & 3.1, Mistral/Codestral/Mathstral/Nemostral (+ Tekken tokenizer), Qwen2, Phi3 and Gemma 1 & 2 ...
All models are bundled as a single ~2K lines Java file with no dependencies, at this point, maintaining all these is a burden, some components can be shared e.g. GGUF parser, tensors, tokenizers, samplers, chat formats, even the inference (e.g. same for Mistral and Llama), would be great to have all of them as a single project.

All behind a common, low-level inference API e.g. forward implementation. Note that this is not meant high-level abstraction like langchain4j, but a low-level inference engine that can be used as a backend by langchain4j.

@geoand
Copy link

geoand commented Oct 22, 2024

Note that this is not meant high-level abstraction like langchain4j, but a low-level inference engine that can be used as a backend by langchain4j

I am interested in building the latter based on this :).
Is there anything I should be aware of?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants