GPUStack
Home
Initializing search
GitHub
GPUStack
GitHub
Overview
Quickstart
Installation
Installation
Installation Requirements
NVIDIA CUDA
NVIDIA CUDA
Online Installation
Air-Gapped Installation
AMD ROCm
AMD ROCm
Online Installation
Air-Gapped Installation
Apple Metal
Ascend CANN
Ascend CANN
Online Installation
Air-Gapped Installation
Hygon DTK
Hygon DTK
Online Installation
Air-Gapped Installation
Moore Threads MUSA
Moore Threads MUSA
Online Installation
Air-Gapped Installation
CPU
CPU
Online Installation
Air-Gapped Installation
Installation Script
Uninstallation
Upgrade
User Guide
User Guide
Playground
Playground
Chat
Image
Audio
Embedding
Rerank
Model Management
Model Catalog
Model File management
API Key Management
User Management
Inference Backends
Pinned Backend Versions
Compatibility Check
OpenAI Compatible APIs
Image Generation APIs
Rerank API
Using Models
Using Models
Using Large Language Models
Using Vision Language Models
Using Embedding Models
Using Reranker Models
Using Image Generation Models
Recommended Parameters for Image Generation Models
Editing Images
Using Audio Models
Tutorials
Tutorials
Running DeepSeek R1 671B with Distributed vLLM
Performing Distributed Inference Across Workers (llama-box)
Inference On CPUs
Inference with Tool Calling
Running on Copilot+ PCs with Snapdragon X
Integrations
Integrations
OpenAI Compatible APIs
Integrate with Dify
Integrate with RAGFlow
Architecture
Scheduler
Troubleshooting
FAQ
API Reference
CLI Reference
CLI Reference
Start
Chat
Draw
Download Tools
Home