Skip to content

Commit

Permalink
fix(llama.cpp): bump upstream fix for starcoder model on cuda
Browse files Browse the repository at this point in the history
  • Loading branch information
wsxiaoys committed Oct 28, 2023
1 parent 3151d91 commit 4442226
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion crates/llama-cpp-bindings/src/engine.cc
Original file line number Diff line number Diff line change
Expand Up @@ -106,7 +106,7 @@ std::unique_ptr<TextInferenceEngine> create_engine(bool use_gpu, rust::Str model
static BackendInitializer initializer;

llama_model_params model_params = llama_model_default_params();
model_params.n_gpu_layers = use_gpu ? 1 : 0;
model_params.n_gpu_layers = use_gpu ? 9999 : 0;
llama_model* model = llama_load_model_from_file(std::string(model_path).c_str(), model_params);

if (!model) {
Expand Down

0 comments on commit 4442226

Please sign in to comment.