You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
def tensor_to_vae_latent(t, vae):
video_length = t.shape[1]
t = rearrange(t, "b f c h w -> (b f) c h w")
latents = vae.encode(t).latent_dist.sample()
latents = rearrange(latents, "(b f) c h w -> b f c h w", f=video_length)
latents = latents * vae.config.scaling_factor
return latents
NOTE here SVD used re-scale back, following InstructPix2Pix, channel_concat image do not apply vae's scaling_factor
But unlike InstructPix2Pix, SVD used vae.encode(x).latent_dist.sample() for channel_concat image
However, InstructPix2Pix used vae.encode(x).latent_dist.mode() for channel_concat image
The text was updated successfully, but these errors were encountered:
AnimateLCM/animatelcm_svd/train_svd_lcm.py
Line 1010 in 9a5a314
AnimateLCM/animatelcm_svd/train_svd_lcm.py
Line 1011 in 9a5a314
NOTE here SVD used re-scale back, following InstructPix2Pix, channel_concat image do not apply vae's
scaling_factor
But unlike InstructPix2Pix, SVD used
vae.encode(x).latent_dist.sample()
for channel_concat imageHowever, InstructPix2Pix used
vae.encode(x).latent_dist.mode()
for channel_concat imageThe text was updated successfully, but these errors were encountered: