[WIP] Codebase Refactor #22

maddox-j · 2024-11-04T15:59:23Z

Adding a WIP PR to manage the codebase refactor merge to main

* add init to panza to turn it into a package * add pyproject.toml but no dependencies yet * add the rest of the panzamail dependencies :) * install dependencies based on pyproject.toml instead of raw pip and conda commands

* Add Ollama inference * expose Panza as a web server * add api keys to env variables and check in server * check api key * switch to fastapi to prevent model reloading * Add ollama-backed streaming HTTP server --------- Co-authored-by: Armand Nicolicioiu <[email protected]>

…exing

[Ad[DxxRevert "remove some unused .sh files" This reverts commit 233083e.

…of manual modifications

- Added bug fix for error encountered in json dumps for Message and mboxMessage objects - Added clarification for email and username reqs - Changed wanbd_disabled default to true to track with README

maddox-j · 2024-11-06T13:26:44Z

TEMP_HOW_TO_RUN_INFERENCE.md

Need to rename, and to link back to the original README

maddox-j · 2024-11-06T13:27:20Z

TEMP_HOW_TO_RUN_INFERENCE.md

+
+If running with Ollama, then Ollama needs to be installed from the [web page](https://ollama.com/).
+
+Then, you will need to convert your model into a GGUF file.


is it beneficial to add more support for this?

maddox-j · 2024-11-06T13:27:59Z

README_panza3.md

+
+- To run Panza after a full training run, try something like `CUDA_VISIBLE_DEVICES=0 python3 runner.py user=USERNAME interfaces=cli writer/llm=transformers`.
+- To run Panza after a RoSA or LoRA training run, replace `writer/llm=transformers` with `writer/llm=peft` TODO Armand: can we fix this? 
+


Integrate with the inference markdown+ resolve TODO

maddox-j · 2024-11-06T13:29:08Z

configs/user/default.yaml

@@ -0,0 +1,9 @@
+email_address: "[email protected]"  # Change this to your email address!
+username: "abc"  # TODO(armand): Use custom resolver to extract username from email address.
+


Address TODO

maddox-j · 2024-11-06T13:29:52Z

README_panza3.md

+</div>
+
+
+## TODO: Prerequisites


This commit features a series of updates. 1. Introduction of formatting with Black added through a precommit that contributers should install. Instructions to do so have been added so that if PRs are created, all code is in same formatting. 2. Formatting code with Black. 3. Removal of debug print statements. 4. Addressing bug with n_proc > in datasets.map with HF

…nto jen/eval-refactor

shawseanyang and others added 30 commits August 16, 2024 21:11

Restructure as python package (#19)

52099b6

* add init to panza to turn it into a package * add pyproject.toml but no dependencies yet * add the rest of the panzamail dependencies :) * install dependencies based on pyproject.toml instead of raw pip and conda commands

Add Ollama inference

053dc09

Add black line length configuration

498fdc1

Add new Panza src path

aec8d50

Create class interfaces

39d37e2

Set up unit testing

769277c

web hosting

42acc78

implement ollama llm

081bf94

Add FAISS retriever and update Document interface to preepare for ind…

de2fe12

…exing

Fix missing method in retriever interface

32a8a39

Make thread and past_messages optional in EmailInstruction

094fa24

Add email prompt builder

c9ea456

Add local transformers inference

6a9897e

Add Peft models and conditional imports

30b3dc1

Add Panza Writer

1d73aff

Add support to return full prompt from writer

c648f98

Set corresponding retriever document type in prompt builder

09f08e4

Remove debugging print from prompting utils

a3c6ac6

Add Hydra config-based runner for Panza writer

6b67156

rename ollama_llm.py to ollama.py

cd9b041

add type annotations to OllamaLLM

7e3ee43

add some more type annotations to OllamaLLM

8b1ef60

check installation for OllamaLLM

73b37be

rename test_llm.py to test_local_llm.py

90d59d1

add pytest to dev dependencies

f9ddb8f

add sampling_params to super() init call

af88a28

add unit tests for ollama_llm.py

28dce17

black formatting

b8fba39

fix types

5f05228

Eugenia Iofinova and others added 7 commits October 28, 2024 12:48

slight refactor of runner.py

ca4b690

remove some unused .sh files

233083e

qq

6f94379

[Ad[DxxRevert "remove some unused .sh files" This reverts commit 233083e.

make the first part of the data preparation script optional (in case …

8ce3c4a

…of manual modifications

Edits and Bug Fixes

c1f36a8

- Added bug fix for error encountered in json dumps for Message and mboxMessage objects - Added clarification for email and username reqs - Changed wanbd_disabled default to true to track with README

Add additional clarification on username importance

bacbdf7

update data preparation

6be5813

maddox-j commented Nov 6, 2024

View reviewed changes

TEMP_HOW_TO_RUN_INFERENCE.md

Copy link

Contributor Author

maddox-j Nov 6, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Need to rename, and to link back to the original README

maddox-j commented Nov 6, 2024

View reviewed changes

README_panza3.md Outdated

</div>

## TODO: Prerequisites

Copy link

Contributor Author

maddox-j Nov 6, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Clean TODO

Andrej Jovanovic and others added 18 commits November 8, 2024 17:57

Fix function address

c13aa07

Clean up code TODOs and revert to defaults

fee7cbd

move top-level README to default location

99f5c7c

update the scripts/ readme

1238d39

remove useless assert

c0a94a3

Merge branch 'jen/eval-refactor' of github.com:IST-DASLab/PanzaMail i…

99a0b68

…nto jen/eval-refactor

Merge changes.

3e2203c

Once again, try to centralize the main README.

1e62597

Update the README

71a85c9

Update README.md remove resolved TODO

73308a0

update hyperparameter tuning guide

e5a9e44

Merge branch 'jen/eval-refactor' of github.com:IST-DASLab/PanzaMail i…

49258b9

…nto jen/eval-refactor

Refactor panza3 -> panza

b3bc00f

Clear ollama and web use-case

be76c39

Update README.md remove confusing period.

677dd8a

Update README.md Add instructions for quantized training

e1e8e6d

correct README for quantized training

525d0e3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Codebase Refactor #22

[WIP] Codebase Refactor #22

maddox-j commented Nov 4, 2024

maddox-j Nov 6, 2024

maddox-j Nov 6, 2024

maddox-j Nov 6, 2024 •

edited

Loading

maddox-j Nov 6, 2024

maddox-j Nov 6, 2024


		If running with Ollama, then Ollama needs to be installed from the [web page](https://ollama.com/).

		Then, you will need to convert your model into a GGUF file.


		- To run Panza after a full training run, try something like `CUDA_VISIBLE_DEVICES=0 python3 runner.py user=USERNAME interfaces=cli writer/llm=transformers`.
		- To run Panza after a RoSA or LoRA training run, replace `writer/llm=transformers` with `writer/llm=peft` TODO Armand: can we fix this?

		@@ -0,0 +1,9 @@
		email_address: "[email protected]" # Change this to your email address!
		username: "abc" # TODO(armand): Use custom resolver to extract username from email address.

[WIP] Codebase Refactor #22

Are you sure you want to change the base?

[WIP] Codebase Refactor #22

Conversation

maddox-j commented Nov 4, 2024

maddox-j Nov 6, 2024

Choose a reason for hiding this comment

maddox-j Nov 6, 2024

Choose a reason for hiding this comment

maddox-j Nov 6, 2024 • edited Loading

Choose a reason for hiding this comment

maddox-j Nov 6, 2024

Choose a reason for hiding this comment

maddox-j Nov 6, 2024

Choose a reason for hiding this comment

maddox-j Nov 6, 2024 •

edited

Loading