Model File Management
GPUStack allows admins to download and manage model files.
Add Model File
GPUStack currently supports models from Hugging Face, ModelScope, and local paths. To add model files, navigate to the Model Files page.
Add a Hugging Face Model
- Click the Add Model Filebutton and selectHugging Facefrom the dropdown.
- 
Use the search bar in the top left to find a model by name, e.g., Qwen/Qwen2.5-0.5B-Instruct. To search only for GGUF models, check theGGUFcheckbox.
- 
(Optional) For GGUF models, select the desired quantization format from Available Files.
- Select the target worker to download the model file.
- (Optional) Specify a Local Directoryto download the model to a custom path instead of the GPUStack cache directory.
- Click the Savebutton.
Add a ModelScope Model
- Click the Add Model Filebutton and selectModelScopefrom the dropdown.
- Use the search bar in the top left to find a model by name, e.g., Qwen/Qwen2.5-0.5B-Instruct. To search only for GGUF models, check theGGUFcheckbox.
- (Optional) For GGUF models, select the desired quantization format from Available Files.
- Select the target worker to download the model file.
- (Optional) Specify a Local Directoryto download the model to a custom path instead of the GPUStack cache directory.
- Click the Savebutton.
Add a Local Path Model
You can add models from a local path. The path can be a directory (e.g., a Hugging Face model folder) or a file (e.g., a GGUF model) located on the worker.
- Click the Add Model Filebutton and selectLocal Pathfrom the dropdown.
- Enter the Model Path.
- Select the target worker.
- Click the Savebutton.
Retry Download
If a model file download fails, you can retry it:
- Navigate to the Model Filespage.
- Locate the model file with an error status.
- Click the ellipsis button in the Operationscolumn and selectRetry Download.
- GPUStack will attempt to download the model file again from the specified source.
Deploy Model
Models can be deployed from model files. Since the model is stored on a specific worker, GPUStack will add a worker selector using the worker-name key to ensure proper scheduling.
- Navigate to the Model Filespage.
- Find the model file you want to deploy.
- Click the Deploybutton in theOperationscolumn.
- Review or adjust the Name,Replicas, and other deployment parameters.
- Click the Savebutton.
Delete Model File
- Navigate to the Model Filespage.
- Find the model file you want to delete.
- Click the ellipsis button in the Operationscolumn and selectDelete.
- (Optional) Check the Also delete the file from diskoption.
- Click the Deletebutton to confirm.