model self-attention hardcoded to 4 heads #71

SpeedCoder5 · 2022-04-25T19:53:34Z

The self attention block is hard-coded to 4 heads. Suggest using n_heads from config instead.

SpeedCoder5 · 2022-04-25T21:07:31Z

submitted PR #72

SpeedCoder5 closed this as completed Apr 25, 2022

SpeedCoder5 reopened this Apr 25, 2022

SpeedCoder5 added a commit to SpeedCoder5/minGPT that referenced this issue Apr 25, 2022

karpathy#71 use config n_head instead of hardcoded 4 heads

0ac226d

SpeedCoder5 closed this as completed Apr 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model self-attention hardcoded to 4 heads #71

model self-attention hardcoded to 4 heads #71

SpeedCoder5 commented Apr 25, 2022

SpeedCoder5 commented Apr 25, 2022

model self-attention hardcoded to 4 heads #71

model self-attention hardcoded to 4 heads #71

Comments

SpeedCoder5 commented Apr 25, 2022

SpeedCoder5 commented Apr 25, 2022