Skip to content

Pull requests: karpathy/minGPT

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Minor spelling in demo.ipynb
#144 opened Nov 21, 2024 by MicahMaphet Loading…
Reallly small correction
#143 opened Oct 9, 2024 by manuelsh Loading…
DEMO: minGPT on tinygrad
#138 opened Apr 28, 2024 by ziliangpeng Loading…
Cs674
#124 opened Nov 16, 2023 by Westen-M Loading…
Added generator repl for using adder model.
#117 opened Jun 30, 2023 by prasad83 Loading…
Rename transformer layers
#116 opened Jun 28, 2023 by rjarun8 Loading…
Adding a requirements.txt file
#109 opened Feb 17, 2023 by kukuquack Loading…
Zero-grad more aggressively to save memory
#106 opened Jan 18, 2023 by cchan Loading…
More consistent variable naming
#103 opened Jan 10, 2023 by vwxyzjn Loading…
Slightly Better README.md
#102 opened Jan 9, 2023 by MohamedAliRashad Loading…
CI for unittests
#101 opened Jan 7, 2023 by ubgk Loading…
fix: add missing dependency in setup.py
#100 opened Jan 7, 2023 by ben-schulz Loading…
Fix typo in bpe.py
#97 opened Jan 6, 2023 by eltociear Loading…
named_parameters does not have to be recursive
#91 opened Nov 2, 2022 by Equim-chan Loading…
Update readme.md
#89 opened Oct 20, 2022 by macleginn Loading…
Update README.md
#88 opened Oct 20, 2022 by macleginn Loading…
Add DataParallel and make Block support DataParallel
#87 opened Sep 10, 2022 by gngdb Loading…
Add dtype support
#86 opened Aug 5, 2022 by younesbelkada Loading…
Refactor for modern 2022 python style and usage
#80 opened Jul 24, 2022 by mattsta Loading…
#71 use config n_head instead of hardcoded 4 heads
#72 opened Apr 25, 2022 by SpeedCoder5 Loading…
Added the condition for test_dataset's presence.
#69 opened Apr 21, 2022 by RohanAwhad Loading…
Add distributed data parallel trainer
#66 opened Mar 17, 2022 by aravindsrinivas Loading…
ProTip! Follow long discussions with comments:>50.