Skip to content

Commit

Permalink
fix
Browse files Browse the repository at this point in the history
  • Loading branch information
goliaro committed Oct 10, 2024
1 parent 6da4f4a commit dbd4cf1
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/ops/inc_multihead_self_attention.cu
Original file line number Diff line number Diff line change
Expand Up @@ -126,7 +126,7 @@ void compute_attention_kernel_prompt(IncMultiHeadSelfAttentionMeta *m,
int num_new_tokens = bc->requestsInfo[i].num_tokens_in_batch;
int total_tokens = bc->requestsInfo[i].first_token_depth_in_request +
bc->requestsInfo[i].num_tokens_in_batch;
int max_peft_tokens = bc->requestsInfo[i].max_sequence_length;
int max_peft_tokens = bc->requestsInfo[i].max_length;
// Copy query to m->query_activation_buffer if we need to compute
// PEFT backward
if (bc->requestsInfo[i].peft_bwd) {
Expand Down

0 comments on commit dbd4cf1

Please sign in to comment.