Skip to content

Commit

Permalink
WOQ: Change small batch threshold from 32 to 16 (#3017)
Browse files Browse the repository at this point in the history
  • Loading branch information
Xia-Weiwen authored Jun 28, 2024
1 parent d6599b5 commit 1d02cee
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion csrc/cpu/aten/kernels/WoqTppKrnl.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -54,7 +54,7 @@ constexpr bool is_sym_quant(const int qw_type) {
#if defined(CPU_CAPABILITY_AVX512_FP16) && defined(COMPILER_PREREQ_MET)

#define QUANT_A_THRESHOLD 30720
#define SMALL_BATCH_THRESHOLD 32
#define SMALL_BATCH_THRESHOLD 16
#define DEQUANT_UPFRONT_THRESHOLD 1024
#define PARALLEL_M_THRESHOLD 128
constexpr long PREFETCH_K_DIST = 64; // TODO(jgong5): do not hard-code
Expand Down

0 comments on commit 1d02cee

Please sign in to comment.