GPU and Node Affinity
Use specified GPUs
If you have different type of GPUs on the node, use additional environment variables in the worker to control which GPUs to use:
apiVersion: arcadia.kubeagi.k8s.com.cn/v1alpha1
kind: Worker
...
spec:
additionalEnvs:
# the GPU IDs will be ordered by pci bus IDs.
- name: CUDA_DEVICE_ORDER
value: "PCI_BUS_ID"
# specify which GPU(s) to be used
- name: CUDA_VISIBLE_DEVICES
value: "0,3"
creator: admin
description: qwen-7b
displayName: qwen-7b
Schedule on specified Nodes
You can also use nodeAffinity to schedule fastchat worker pod to the specified node(s), for example:
apiVersion: arcadia.kubeagi.k8s.com.cn/v1alpha1
kind: Worker
...
spec:
matchExpressions:
- key: kubernetes.io/hostname
operator: In
values:
- gpu-node-2
creator: admin
description: qwen-7b
displayName: qwen-7b