Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] Codebase Refactor #22

Draft
wants to merge 120 commits into
base: main
Choose a base branch
from
Draft

[WIP] Codebase Refactor #22

wants to merge 120 commits into from

Conversation

maddox-j
Copy link
Contributor

@maddox-j maddox-j commented Nov 4, 2024

Adding a WIP PR to manage the codebase refactor merge to main

shawseanyang and others added 30 commits August 16, 2024 21:11
* add init to panza to turn it into a package

* add pyproject.toml but no dependencies yet

* add the rest of the panzamail dependencies :)

* install dependencies based on pyproject.toml instead of raw pip and conda commands
* Add Ollama inference

* expose Panza as a web server

* add api keys to env variables and check in server

* check api key

* switch to fastapi to prevent model reloading

* Add ollama-backed streaming HTTP server

---------

Co-authored-by: Armand Nicolicioiu <[email protected]>
Eugenia Iofinova and others added 7 commits October 28, 2024 12:48
qq
[Ad[DxxRevert "remove some unused .sh files"

This reverts commit 233083e.
- Added bug fix for error encountered in json dumps
for Message and mboxMessage objects
- Added clarification for email and username reqs
- Changed wanbd_disabled default to true to track with README
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Need to rename, and to link back to the original README


If running with Ollama, then Ollama needs to be installed from the [web page](https://ollama.com/).

Then, you will need to convert your model into a GGUF file.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is it beneficial to add more support for this?

README_panza3.md Outdated

- To run Panza after a full training run, try something like `CUDA_VISIBLE_DEVICES=0 python3 runner.py user=USERNAME interfaces=cli writer/llm=transformers`.
- To run Panza after a RoSA or LoRA training run, replace `writer/llm=transformers` with `writer/llm=peft` TODO Armand: can we fix this?

Copy link
Contributor Author

@maddox-j maddox-j Nov 6, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Integrate with the inference markdown+ resolve TODO

@@ -0,0 +1,9 @@
email_address: "[email protected]" # Change this to your email address!
username: "abc" # TODO(armand): Use custom resolver to extract username from email address.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Address TODO

README_panza3.md Outdated
</div>


## TODO: Prerequisites
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Clean TODO

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants