You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@davidADSP that container was just for a special pre-release version of transformers before the model came out, hence it copied the wheel from my drive. However now that is upstreamed into Transformers. IIRC that container doesn't have llama.cpp or ollama, and you would need to check those projects if they support it or not. And if it is supported, it depends on how memory efficient they are with loading the weights/ect if it will fit in AGX Orin 64GB or not (anecdotally I was able to run NVLM-72B on AGX Orin with load_in_4bit=True, but 90B is substantially larger)
When building like this:
I get the following error at the last step of the build:
I have pulled and installed the latest version of jetson containers repo.
The text was updated successfully, but these errors were encountered: