Add Push to Hub functionnality to Model and Pipeline #1699

kamilakesbi · 2024-04-29T14:16:49Z

I've started working on adding a push_to_hub method to both Model and Pipeline classes. It will hopefully help users push their custom pyannote speaker-segmentation and speaker-embedding models to the Hugging Face Hub, and use them within custom speaker-diarization pipelines.

In this PR, I've added:

1. A `push_to_hub` method to the base Model class:

The method is compatible with both the pyannote PyanNet segmentation model and WeSpeakerResNet34speaker embedding model. It will:

Save the state dict in a pytorch_model.bin file.
Write a config.yaml file similar to the one of pyannote/segmentation-3.0 or pyannote/wespeaker-voxceleb-resnet34-LM.
Write a minimal Readme file (which we can work on together), and add appropriate tags and licence.

I've tested the method using the following scripts:

Segmentation Model:

from pyannote.audio import Model

segmentation_model = Model.from_pretrained("pyannote/segmentation-3.0")
segmentation_model.push_to_hub('kamilakesbi/speaker-segmentation-test')

Here is the result :)

Note: I've used the diarizers library here to first load a fine-tuned speaker segmentation model from the Hugging Face Hub, convert it to a pyannote format, and push it to the Hub.

Speaker Embedding Model:

from pyannote.audio import Model

speaker_embedding_model = Model.from_pretrained('pyannote/wespeaker-voxceleb-resnet34-LM')
speaker_embedding_model.push_to_hub('kamilakesbi/speaker-embedding-test')

Here is the result :)

2. A `push_to_hub` method to the base Pipeline class:

Here, it will generate the config file associated to the pipeline, modify the embedding model and segmentation model by using specified ones from the Hub, and push the updated config file to the hub.
I added the possibility to push the model checkpoints associated with the pipeline or just the pipeline config file with pointers to the model's hub repositories.
It will also push a minimal readme with tags and licence (again, we can work on it to customize the output).

The method can be used like this:

from pyannote.audio import Pipeline

pipeline = Pipeline.from_pretrained('pyannote/speaker-diarization-3.1')
pipeline.embedding = 'kamilakesbi/speaker-segmentation-test'
pipeline.segmentation = 'kamilakesbi/speaker-segmentation-test'
pipeline.push_to_hub('kamilakesbi/spd_pipeline_test')

We can also push the model's checkpoints using:

pipeline.push_to_hub('kamilakesbi/spd_pipeline_test', save_checkpoints=True)

Note that this is still a work in progress :) I can make changes to the code and adapt it to pyannote's needs!

Hope that this PR will be useful to pyannote.

sanchit-gandhi

General functionality looks good! Left some design thoughts on how we can improve ease-of-use (e.g. saving the embedding/segmentation models when we push the pipeline)

pyannote/audio/core/model.py

pyannote/audio/core/pipeline.py

sanchit-gandhi · 2024-04-30T11:06:15Z

pyannote/audio/core/pipeline.py

+                segmentation_model = self.segmentation_model
+
+            # Modify the config with new segmentation and embedding models:
+            config["pipeline"]["params"]["embedding"] = embedding_model


As discussed offline: an elegant solution would be to save both the embedding and segmentation models to subfloders in the repo (embedding and segmentation respectively), and then load the weights from these subfolders when we call .from_pretrained

Your repo structure for kamilakesbi/spd_pipeline_test could look something like the following:

├── config.yaml <- Top-level pipeline config ├── embedding <- Subfolder for the embedding model | ├── config.yaml | ├── pytorch_model.bin ├── segmentation <- Subfolder for the segmentation model | ├── config.yaml | ├── pytorch_model.bin

And your top-level yaml file could have an extra entry:

embedding: kamilakesbi/spd_pipeline_test embedding_subfolder: embedding ... segmentation: kamilakesbi/spd_pipeline_test segmentation_subfolder: segmentation

Note that this would require updating .from_pretrained to handle this extra subfolder logic

I handled this differently:

If we want to save the checkpoints, we add a save_checkpoints=True parameter to pipeline.push_to_hub. We would then get a repo structure like the one you proposed @sanchit-gandhi, but the top yaml file would look like this:

checkpoints: True params: - embedding: 'kamilakesbi/speaker-embedding-test' - embedding: 'kamilakesbi/speaker-segmentation-test'

If we don't want to store checkpoints on the hub, then we need pointers to the segmentation and embedding models on the hub. In this case the config file would look like this:

checkpoints: False params: + embedding: 'kamilakesbi/speaker-embedding-test' + embedding: 'kamilakesbi/speaker-segmentation-test'

As discussed offline, saving all sub-models means the model repo on the Hub is fully portable -> users can clone the repository and have all sub-models available to them locally

This is the design that was adopted for diffusers pipelines and has worked very well

Thus, we'll assume that any new checkpoints being pushed will follow this new repo structure, with an exception for current pipelines on the Hub that leverage components from multiple repositories

pyannote/audio/core/model.py

pyannote/audio/core/pipeline.py

sanchit-gandhi

Looks great! Just a few suggestions regarding the save structure. Can we add relevant tests as well? (both for the model and the pipeline)

pyannote/audio/core/model.py

sanchit-gandhi · 2024-05-22T14:41:55Z

pyannote/audio/core/model.py

+            repo_type="model",
+        )
+
+        model_type = str(type(self)).split("'")[1].split(".")[-1]


Is there not a model attribute or config param we can use to get this in a more robust way?

Not that I'm aware of... but it would be great!

pyannote/audio/core/model.py

sanchit-gandhi · 2024-05-22T16:12:41Z

pyannote/audio/core/pipeline.py

+                segmentation_model = self.segmentation_model
+
+            # Modify the config with new segmentation and embedding models:
+            config["pipeline"]["params"]["embedding"] = embedding_model


As discussed offline, saving all sub-models means the model repo on the Hub is fully portable -> users can clone the repository and have all sub-models available to them locally

This is the design that was adopted for diffusers pipelines and has worked very well

Thus, we'll assume that any new checkpoints being pushed will follow this new repo structure, with an exception for current pipelines on the Hub that leverage components from multiple repositories

kamilakesbi · 2024-05-28T09:31:43Z

Hi @hbredin,

It would be nice if you had time to do a review on this PR so that we can iterate on it :)

Thank you!

hbredin · 2024-05-28T12:09:57Z

Apologies, I will eventually have a look it but I really don't have the bandwidth right now.

sanchit-gandhi · 2024-06-12T11:07:55Z

Hey @hbredin! No rush on reviewing this PR, whenever you get the chance we'd love to hear your feedback on the proposed changes! Otherwise, is there another maintainer who could give a quick review in the meantime?

hbredin · 2024-06-14T08:23:46Z

Hey @sanchit-gandhi. I understand the frustration but I am actually the sole maintainer and also have many other hats. I am doing my best but have other priorities right now (like the upcoming 3.3.0 release with speech separation support).

hbredin · 2024-11-25T21:25:52Z

pyannote/audio/core/pipeline.py

+            # If hub repo contains subfolders, load models and pipeline:
+            embedding = Model.from_pretrained(model_id, subfolder="embedding")
+            segmentation = Model.from_pretrained(model_id, subfolder="segmentation")
+            pipeline = Klass(**params)
+            pipeline.embedding = embedding
+            pipeline.segmentation_model = segmentation


This seem way too specific to pyannote/speaker-diarization-3.1.
I'd like to find a better (= more generic) way

We could do something like preprend subfolders by @model (or anything that makes sense) to indicate to Pipeline.from_pretrained that a model should be loaded from corresponding subfolders.

pipeline: name: pyannote.audio.pipelines.SpeakerDiarization params: clustering: AgglomerativeClustering embedding: @model/embedding segmentation: @model/segmentation

Similarly, we could use @pipeline/ to load sub-pipelines, and later @whatever if we ever want to add new pyannote stuff (I already have one in mind that I cannot really talk about right now).

kamilakesbi added 12 commits April 26, 2024 13:53

add push_to_hub Model

dfa3ead

fix pytorch saving

266539b

add target in config yaml

126fdfb

copy push_to_hub transformers logic

3bf0a2e

add tags

431a4e3

push_to_hub speaker embedding

be2a934

up

98b91fa

up

79a7d78

add push_to_hub to pipeline

7dfb2ca

small changes

934f409

small changes

df246a5

small doc fix

4d28ccb

sanchit-gandhi reviewed Apr 30, 2024

View reviewed changes

kamilakesbi added 5 commits April 30, 2024 15:01

apply review suggestions

b7ecdb3

add save_pretrained

1c0fb04

generate pipeline config

51ff5b6

add save_checkpoints

4bb0a2a

load model checkpoints in pipeline.from_pretrained

a8b9074

sanchit-gandhi reviewed May 22, 2024

View reviewed changes

apply review suggestions

756a266

kamilakesbi changed the title ~~[Work In Progress] - Add Push to Hub functionnality to Model and Pipeline~~ Add Push to Hub functionnality to Model and Pipeline May 23, 2024

Merge branch 'develop' into push_to_hub

02ee03d

Merge branch 'develop' into push_to_hub

c8c701a

hbredin reviewed Nov 25, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Push to Hub functionnality to Model and Pipeline #1699

Add Push to Hub functionnality to Model and Pipeline #1699

kamilakesbi commented Apr 29, 2024 •

edited

Loading

sanchit-gandhi left a comment

sanchit-gandhi Apr 30, 2024 •

edited

Loading

kamilakesbi May 21, 2024

sanchit-gandhi May 22, 2024

sanchit-gandhi left a comment

sanchit-gandhi May 22, 2024

kamilakesbi May 23, 2024

sanchit-gandhi May 22, 2024

kamilakesbi commented May 28, 2024

hbredin commented May 28, 2024

sanchit-gandhi commented Jun 12, 2024

hbredin commented Jun 14, 2024

hbredin Nov 25, 2024

Add Push to Hub functionnality to Model and Pipeline #1699

Are you sure you want to change the base?

Add Push to Hub functionnality to Model and Pipeline #1699

Conversation

kamilakesbi commented Apr 29, 2024 • edited Loading

1. A push_to_hub method to the base Model class:

2. A push_to_hub method to the base Pipeline class:

sanchit-gandhi left a comment

Choose a reason for hiding this comment

sanchit-gandhi Apr 30, 2024 • edited Loading

Choose a reason for hiding this comment

kamilakesbi May 21, 2024

Choose a reason for hiding this comment

sanchit-gandhi May 22, 2024

Choose a reason for hiding this comment

sanchit-gandhi left a comment

Choose a reason for hiding this comment

sanchit-gandhi May 22, 2024

Choose a reason for hiding this comment

kamilakesbi May 23, 2024

Choose a reason for hiding this comment

sanchit-gandhi May 22, 2024

Choose a reason for hiding this comment

kamilakesbi commented May 28, 2024

hbredin commented May 28, 2024

sanchit-gandhi commented Jun 12, 2024

hbredin commented Jun 14, 2024

hbredin Nov 25, 2024

Choose a reason for hiding this comment

kamilakesbi commented Apr 29, 2024 •

edited

Loading

1. A `push_to_hub` method to the base Model class:

2. A `push_to_hub` method to the base Pipeline class:

sanchit-gandhi Apr 30, 2024 •

edited

Loading