Skip to content

Commit

Permalink
typo
Browse files Browse the repository at this point in the history
  • Loading branch information
Bob-Chen222 committed Nov 7, 2024
1 parent f7656be commit 8c203ec
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions inference/incr_decoding/incr_decoding.cc
Original file line number Diff line number Diff line change
Expand Up @@ -51,7 +51,7 @@ void parse_input_args(char **argv,
int &max_tokens_per_prefilling_batch,
int &max_sequence_length,
int &max_output_length,
int &max_kv_cache_size,
size_t &max_kv_cache_size,
int &sampling_seed,
bool &streaming_cache,
bool &slo_attainment_early_termination,
Expand Down Expand Up @@ -209,7 +209,7 @@ void FlexFlow::top_level_task(Task const *task,
int max_tokens_per_prefilling_batch = -1;
int max_sequence_length = 256;
int max_output_length = 512;
int max_kv_cache_size = -1; // if -1, then use the default value
size_t max_kv_cache_size = 0; // if -1, then use the default value
RequestManager::DecodingMode decoding_mode =
RequestManager::INCREMENTAL_DECODING;
int sampling_seed = 0;
Expand Down

0 comments on commit 8c203ec

Please sign in to comment.