Model File Management
GPUStack allows admins to download and manage model files.
Add Model File
GPUStack currently supports models from Hugging Face, ModelScope, and local paths. To add model files, navigate to the Model Files page.
Add a Hugging Face Model
- Click the
Add Model Filebutton and selectHugging Facefrom the dropdown. -
Use the search bar in the top left to find a model by name, e.g.,
Qwen/Qwen2.5-0.5B-Instruct. To search only for GGUF models, check theGGUFcheckbox. -
(Optional) For GGUF models, select the desired quantization format from
Available Files. - Select the target worker to download the model file.
- (Optional) Specify a
Local Directoryto download the model to a custom path instead of the GPUStack cache directory. - Click the
Savebutton.
Add a ModelScope Model
- Click the
Add Model Filebutton and selectModelScopefrom the dropdown. - Use the search bar in the top left to find a model by name, e.g.,
Qwen/Qwen2.5-0.5B-Instruct. To search only for GGUF models, check theGGUFcheckbox. - (Optional) For GGUF models, select the desired quantization format from
Available Files. - Select the target worker to download the model file.
- (Optional) Specify a
Local Directoryto download the model to a custom path instead of the GPUStack cache directory. - Click the
Savebutton.
Add a Local Path Model
You can add models from a local path. The path can be a directory (e.g., a Hugging Face model folder) or a file (e.g., a GGUF model) located on the worker.
- Click the
Add Model Filebutton and selectLocal Pathfrom the dropdown. - Enter the
Model Path. - Select the target worker.
- Click the
Savebutton.
Retry Download
If a model file download fails, you can retry it:
- Navigate to the
Model Filespage. - Locate the model file with an error status.
- Click the ellipsis button in the
Operationscolumn and selectRetry Download. - GPUStack will attempt to download the model file again from the specified source.
Deploy Model
Models can be deployed from model files. Since the model is stored on a specific worker, GPUStack will add a worker selector using the worker-name key to ensure proper scheduling.
- Navigate to the
Model Filespage. - Find the model file you want to deploy.
- Click the
Deploybutton in theOperationscolumn. - Review or adjust the
Name,Replicas, and other deployment parameters. - Click the
Savebutton.
Delete Model File
- Navigate to the
Model Filespage. - Find the model file you want to delete.
- Click the ellipsis button in the
Operationscolumn and selectDelete. - (Optional) Check the
Also delete the file from diskoption. - Click the
Deletebutton to confirm.