NotImplementedError: No operator found for memory_efficient_attention_forward
with inputs: query
#4976
Labels
memory_efficient_attention_forward
with inputs: query
#4976
NotImplementedError: No operator found for
memory_efficient_attention_forward
with inputs: query : shape=(2, 12500, 8, 40) (torch.float16) key : shape=(2, 12500, 8, 40) (torch.float16) value : shape=(2, 12500, 8, 40) (torch.float16) attn_bias : <class 'NoneType'> p : 0.0cutlassF
is not supported because: xFormers wasn't build with CUDA supportflshattF
is not supported because: xFormers wasn't build with CUDA supporttritonflashattF
is not supported because: xFormers wasn't build with CUDA support requires A100 GPUsmallkF
is not supported because: xFormers wasn't build with CUDA support dtype=torch.float16 (supported: {torch.float32}) max(query.shape[-1] != value.shape[-1]) > 32 unsupported embed per head: 40easy-stable-diffusion.ipynb - Colab - Google
I'm working on it here, but it doesn't work.
I'm a beginner.
Please help me
The text was updated successfully, but these errors were encountered: