-
-
Notifications
You must be signed in to change notification settings - Fork 873
Issues: axolotl-ai-cloud/axolotl
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Qwen 2.5 Base unable to generate <|im_end|> even after finetuning
bug
Something isn't working
waiting for reporter
#2073
opened Nov 18, 2024 by
MattYoon
6 of 8 tasks
RuntimeError: CUDA error: unknown error
When attempting to fine-tune llama-3 models
bug
#2071
opened Nov 17, 2024 by
cjfreeze
7 of 8 tasks
Deepspeed zero3 + LoRA: RuntimeError: Only Tensors of floating point and complex dtype can require gradients
bug
Something isn't working
waiting on upstream
wip
#2068
opened Nov 16, 2024 by
bursteratom
6 of 8 tasks
Mlflow duplicate logging
bug
Something isn't working
#2063
opened Nov 15, 2024 by
jsh2581
6 of 8 tasks
Axolotl hanging on bench evals with fsdp
bug
Something isn't working
under review
#2058
opened Nov 14, 2024 by
bsc001
6 of 8 tasks
Logging behavior since GA fix
bug
Something isn't working
under review
waiting for reporter
#2004
opened Oct 30, 2024 by
ccdv-ai
6 of 8 tasks
Support for Sequence / Context Parallelism
enhancement
New feature or request
#1972
opened Oct 15, 2024 by
dwzhu-pku
5 tasks done
Flash attention and multipack failing for qwen and mistral
bug
Something isn't working
waiting for reporter
#1966
opened Oct 12, 2024 by
tiger241
6 of 8 tasks
Add resize_token_embeddings feature
enhancement
New feature or request
waiting for reporter
#1965
opened Oct 11, 2024 by
ccdv-ai
5 tasks done
Should New feature or request
under review
tokenizer_legacy
be default as false
?
enhancement
#1955
opened Oct 10, 2024 by
tongyx361
5 tasks done
Llama will not save properly
bug
Something isn't working
waiting for reporter
#1947
opened Oct 6, 2024 by
mfirth-truffle
6 of 8 tasks
Feature Request: Adding dataset deduplication process
enhancement
New feature or request
#1946
opened Oct 5, 2024 by
Weyaxi
5 tasks done
fix_untrained_tokens doesn't work with zero-3
bug
Something isn't working
#1944
opened Oct 4, 2024 by
winglian
6 of 8 tasks
Cannot install on Google Colab
bug
Something isn't working
wip
#1933
opened Sep 27, 2024 by
benjamin-marie
5 of 8 tasks
Using two 8xH100 nodes to train. encounter error bf16 requested, but AMP is not supported on this GPU. Requires Ampere series or above.
bug
Something isn't working
waiting for reporter
#1924
opened Sep 23, 2024 by
michaellin99999
6 of 8 tasks
mistrall small support
enhancement
New feature or request
#1922
opened Sep 21, 2024 by
win4r
5 tasks done
Gemma 2 chat template inserts eos_token after every chat turn
bug
Something isn't working
waiting for reporter
#1921
opened Sep 20, 2024 by
Nero10578
6 of 8 tasks
Different training losses when flash_attention is on/off
bug
Something isn't working
#1918
opened Sep 18, 2024 by
zhangchen-xu
6 of 8 tasks
pretrain doesn't work on json\jsonl
bug
Something isn't working
#1895
opened Sep 5, 2024 by
SicariusSicariiStuff
6 of 8 tasks
Training with a large json dataset (>650K) throw error:pyarrow.lib.ArrowInvalid: offset overflow while concatenating arrays
bug
Something isn't working
#1888
opened Sep 3, 2024 by
bofei5675
6 of 8 tasks
Load existing LORA and continue training it
enhancement
New feature or request
waiting for reporter
#1887
opened Sep 1, 2024 by
Nero10578
5 tasks done
MixLoRA finetuning
enhancement
New feature or request
#1880
opened Aug 28, 2024 by
winglian
5 tasks done
Unable to load ORPO dataset in a *.json file
bug
Something isn't working
#1868
opened Aug 26, 2024 by
SicariusSicariiStuff
6 of 8 tasks
ORPO results in Something isn't working
Cannot flatten integer dtype tensors
bug
#1838
opened Aug 20, 2024 by
maziyarpanahi
6 of 8 tasks
inst chat jinja template does not match prompt format used while training with Something isn't working
conversation: mistral
bug
#1832
opened Aug 18, 2024 by
nyxkrage
6 of 8 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2024-11-16.