Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TODO features #1

Open
5 of 10 tasks
namtranase opened this issue Feb 23, 2024 · 8 comments
Open
5 of 10 tasks

TODO features #1

namtranase opened this issue Feb 23, 2024 · 8 comments
Assignees

Comments

@namtranase
Copy link
Owner

namtranase commented Feb 23, 2024

  • More clear instructions to run the model
  • Add completion feature (python)
  • Add chat feature (python)
  • Add samples folders with several examples
  • Check the llama-cpp-python repo
  • Add batching infer feature (python)
  • Add quantization methods
  • Add server API support
  • Add more arguments to control the model (based on the libgemma interface)
  • Add the stream return for the completion function
@namtranase namtranase self-assigned this Feb 23, 2024
@namtranase namtranase pinned this issue Feb 23, 2024
@namtranase
Copy link
Owner Author

@bachvudinh add features that you think we should do here

@namtranase
Copy link
Owner Author

google/gemma.cpp#52
I found the dicussion about the server implementation, would be awesome if we can make it in this repo also.

@namtranase
Copy link
Owner Author

The server implementation from llama.cpp could be a rich resource for us to improve the examples:
https://github.com/ggerganov/llama.cpp/blob/master/examples/server/server.cpp

@bachvudinh
Copy link
Collaborator

Thank you for the suggestion, @namtranase. I'll delve into their code to see if we can adapt their ideas for our project.

@namtranase
Copy link
Owner Author

@bachvudinh @phamson02 any suggestions for new features/applications are welcome!

@namtranase
Copy link
Owner Author

https://www.kaggle.com/competitions/data-assistants-with-gemma
Could be useful for gemma-cpp-python/examples!

@omkar806
Copy link

add a feature like open ai api endpoint ...
I would like to have feature like this most probably using fast api

@namtranase
Copy link
Owner Author

Thank you @omkar806 for comment this feature, we will check on it and maybe make a new release with it!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants