huggingface / transformers Public

Notifications You must be signed in to change notification settings
Fork 27.6k
Star 138k

Code
Issues 989
Pull requests 538
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: huggingface/transformers

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

989 Open 15,501 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

multi-gpu: test_model_parallel_beam_search tests fail with "RuntimeError: Expected all tensors to be on the same device"

#35762 opened Jan 18, 2025 by dvrogozh

How can we use CPU offloading when using AutoModelForCausalLM and THUDM/cogvlm2-llama3-chat-19B bug

#35751 opened Jan 17, 2025 by FurkanGozukara

Qwen2VL exhibits significant performance differences under different attention implementations. bug

#35749 opened Jan 17, 2025 by masn1310

2 of 4 tasks

pipeline AttributeError with torch.nn.DataParallel bug

#35747 opened Jan 17, 2025 by kerem-coemert

2 of 4 tasks

Audio-Classification pipeline function_to_apply ignores initialized values (possibly generalizes to other classification pipelines) bug

#35739 opened Jan 16, 2025 by wilke0818

2 of 4 tasks

Significant Performance Gap Between MaskFormer and Mask2Former Despite Identical Training Code bug

#35738 opened Jan 16, 2025 by olmobaldoni

2 of 4 tasks

Audio-Classification Pipeline top_k Documentation mismatch and bug (possibly generalizes to any classification pipelines) bug

#35736 opened Jan 16, 2025 by wilke0818

2 of 4 tasks

TypeError: 'NoneType' object is not iterable

#35719 opened Jan 15, 2025 by 0xD4rky

Regression - Phi3 has graph breaks in 4.48 but not in 4.47.1 bug

#35716 opened Jan 15, 2025 by kshitij12345

4 tasks

AttributeError in automatic_speech_recognition.py when return_segments and return_timestamps are both True bug

#35713 opened Jan 15, 2025 by jdalegonzalez

2 of 4 tasks

Mismatch between _convert_token_to_id_with_added_voc and encode for Llama-3.2 tokenizer bug Usage

General questions about the library

#35712 opened Jan 15, 2025 by g-benton

2 of 4 tasks

Add support for MiniMax-Text-01 and MiniMax-VL-01 from MiniMaxAI New model

#35710 opened Jan 15, 2025 by geetu040

2 tasks done

Issue with Progressive Generation Using inputs_embeds and past_key_values bug Generation

#35707 opened Jan 15, 2025 by Superbooming

2 of 4 tasks

autocast() got an unexpected keyword argument 'cache_enabled when use trainer.torch_jit_model_eval bug

#35706 opened Jan 15, 2025 by Wanguy

2 of 4 tasks

use_liger_kernel requires much more GPU memory during evaluation than training bug

#35689 opened Jan 14, 2025 by Smu-Tan

2 of 4 tasks

Some weights of the model checkpoint at /models/DeepSeek-V3_bf16 were not used when initializing DeepseekV3ForCausalLM bug

#35688 opened Jan 14, 2025 by Godlovecui

4 tasks

past_key_values cat out of model generate, output appear disorder bug Generation

#35684 opened Jan 14, 2025 by lzlwakeup

2 of 4 tasks

Support LLMs With No Image Placeholder Embedding in LLava-based Models Feature request

Request for a new feature

Multimodal VLM

#35683 opened Jan 14, 2025 by alex-jw-brooks

FA2 support for Aria Flash Attention Multimodal Vision

#35670 opened Jan 13, 2025 by molbap

Improve Guidance for Using DDP in examples/pytorch Feature request

Request for a new feature

#35667 opened Jan 13, 2025 by caojiaolong

RLE of SAM can't handle masks with no change bug

#35664 opened Jan 13, 2025 by MSt-10

2 of 4 tasks

AttributeError: 'MERTConfig' object has no attribute 'conv_pos_batch_norm' bug

#35656 opened Jan 13, 2025 by JacopoMadaluni

2 of 4 tasks

Will Qwen2VL support sequence classification head in the future? Feature request

Request for a new feature

#35645 opened Jan 13, 2025 by cv-nlp

tokenizer.decode() and tokenizer.convert_ids_to_tokens() return different results bug

#35641 opened Jan 12, 2025 by thangld201

4 tasks

Expected tensors and new_tensors to have the same type but found <class ‘tuple’> and <class ‘torch.Tensor’> bug

#35640 opened Jan 12, 2025 by Bruce-Azar-Wayne

4 tasks

Previous 1 2 3 4 5 … 39 40 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly