Skip to content

Commit

Permalink
fix halfattention
Browse files Browse the repository at this point in the history
  • Loading branch information
黄宇扬 committed Jul 12, 2024
1 parent f98f3b9 commit 1952607
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion src/devices/cuda/fastllm-cuda.cu
Original file line number Diff line number Diff line change
Expand Up @@ -830,7 +830,8 @@ __device__ void FastllmSoftmaxKernelInner1Func(half *input, half *output, int ch
}
__syncthreads();
float maxV = sdata[0];

__syncthreads();

// 4. 求和
float sum = 0;
for (int i = tid; i < channels; i += THREAD_PER_BLOCK) {
Expand Down

0 comments on commit 1952607

Please sign in to comment.