-
Notifications
You must be signed in to change notification settings - Fork 27.6k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
How can we use CPU offloading when using AutoModelForCausalLM and THUDM/cogvlm2-llama3-chat-19B
bug
#35751
opened Jan 17, 2025 by
FurkanGozukara
Qwen2VL exhibits significant performance differences under different attention implementations.
bug
#35749
opened Jan 17, 2025 by
masn1310
2 of 4 tasks
pipeline
AttributeError with torch.nn.DataParallel
bug
#35747
opened Jan 17, 2025 by
kerem-coemert
2 of 4 tasks
Audio-Classification pipeline function_to_apply ignores initialized values (possibly generalizes to other classification pipelines)
bug
#35739
opened Jan 16, 2025 by
wilke0818
2 of 4 tasks
Significant Performance Gap Between MaskFormer and Mask2Former Despite Identical Training Code
bug
#35738
opened Jan 16, 2025 by
olmobaldoni
2 of 4 tasks
Audio-Classification Pipeline top_k Documentation mismatch and bug (possibly generalizes to any classification pipelines)
bug
#35736
opened Jan 16, 2025 by
wilke0818
2 of 4 tasks
Regression - Phi3 has graph breaks in 4.48 but not in 4.47.1
bug
#35716
opened Jan 15, 2025 by
kshitij12345
4 tasks
AttributeError in automatic_speech_recognition.py when return_segments and return_timestamps are both True
bug
#35713
opened Jan 15, 2025 by
jdalegonzalez
2 of 4 tasks
Mismatch between General questions about the library
_convert_token_to_id_with_added_voc
and encode
for Llama-3.2 tokenizer
bug
Usage
#35712
opened Jan 15, 2025 by
g-benton
2 of 4 tasks
Add support for MiniMax-Text-01 and MiniMax-VL-01 from MiniMaxAI
New model
#35710
opened Jan 15, 2025 by
geetu040
2 tasks done
Issue with Progressive Generation Using inputs_embeds and past_key_values
bug
Generation
#35707
opened Jan 15, 2025 by
Superbooming
2 of 4 tasks
autocast() got an unexpected keyword argument 'cache_enabled when use trainer.torch_jit_model_eval
bug
#35706
opened Jan 15, 2025 by
Wanguy
2 of 4 tasks
use_liger_kernel requires much more GPU memory during evaluation than training
bug
#35689
opened Jan 14, 2025 by
Smu-Tan
2 of 4 tasks
Some weights of the model checkpoint at /models/DeepSeek-V3_bf16 were not used when initializing DeepseekV3ForCausalLM
bug
#35688
opened Jan 14, 2025 by
Godlovecui
4 tasks
past_key_values cat out of model generate, output appear disorder
bug
Generation
#35684
opened Jan 14, 2025 by
lzlwakeup
2 of 4 tasks
Support LLMs With No Image Placeholder Embedding in LLava-based Models
Feature request
Request for a new feature
Multimodal
VLM
#35683
opened Jan 14, 2025 by
alex-jw-brooks
Improve Guidance for Using DDP in Request for a new feature
examples/pytorch
Feature request
#35667
opened Jan 13, 2025 by
caojiaolong
AttributeError: 'MERTConfig' object has no attribute 'conv_pos_batch_norm'
bug
#35656
opened Jan 13, 2025 by
JacopoMadaluni
2 of 4 tasks
Will Qwen2VL support sequence classification head in the future?
Feature request
Request for a new feature
#35645
opened Jan 13, 2025 by
cv-nlp
tokenizer.decode() and tokenizer.convert_ids_to_tokens() return different results
bug
#35641
opened Jan 12, 2025 by
thangld201
4 tasks
Expected
tensors
and new_tensors
to have the same type but found <class ‘tuple’> and <class ‘torch.Tensor’>
bug
#35640
opened Jan 12, 2025 by
Bruce-Azar-Wayne
4 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.