Skip to content
This repository has been archived by the owner on Aug 30, 2024. It is now read-only.

Commit

Permalink
revert rope parallel
Browse files Browse the repository at this point in the history
  • Loading branch information
luoyu-intel committed Apr 30, 2024
1 parent a861d4e commit 7dc4dd8
Showing 1 changed file with 3 additions and 4 deletions.
7 changes: 3 additions & 4 deletions neural_speed/core/ne_layers.c
Original file line number Diff line number Diff line change
Expand Up @@ -11493,11 +11493,10 @@ void ne_graph_compute(struct ne_context* ctx, struct ne_cgraph* cgraph) {
case NE_OP_DIAG_MASK_INF:
case NE_OP_PADDING_MASK_INF:
case NE_OP_ROPE:
// only first token use parallel
if (node->type == NE_TYPE_BTLA)
node->n_tasks = 1;
else
if (node->type != NE_TYPE_BTLA && node->src0->ne[2] > 1)
node->n_tasks = n_threads;
else
node->n_tasks = 1;
break;
case NE_OP_SOFT_MAX: {
size_t rows = ne_nrows(node->src0);
Expand Down

0 comments on commit 7dc4dd8

Please sign in to comment.