GitHub - spdk/spdk-csi: CSI driver to bring SPDK to Kubernetes storage through NVMe-oF or iSCSI. Supports dynamic volume provisioning and enables Pods to use SPDK storage transparently. (original) (raw)
SPDK CSI
About
This repo contains SPDK CSI (Container Storage Interface) plugin for Kubernetes.
SPDK CSI plugin brings SPDK to Kubernetes. It provisions SPDK logical volumes on storage node dynamically and enables Pods to access SPDK storage backend through NVMe-oF or iSCSI.
Please see SPDK CSI Design Documentfor detailed introduction.
Supported platforms
This plugin conforms to CSI Spec v1.7.0. It is currently developed and tested only on Kubernetes.
This plugin supports x86_64
and Arm64
architectures.
Project status
Status: Beta
Prerequisites
SPDK-CSI is currently developed and tested with Go 1.19
, Docker 20.10
and Kubernetes 1.25.0
on Ubuntu 22.04
.
Minimal requirement: Go 1.19+, Docker 18.03+ and Kubernetes 1.20+.
Setup
Build
$ make all
Build targets spdkcsi, lint, test.
$ make spdkcsi
Build SPDK-CSI binary _out/spdkcsi
.
$ make lint
Lint code and scripts.
$ make golangci
Install golangci-lint and perform various go code static checks.
$ make yamllint
Lint yaml files if yamllint is installed. Requires yamllint 1.10+.
$ make test
Verify go modules and run unit tests. Requires SPDK target and JsonRPC HTTP proxy running on localhost. See deploy/spdk/README for details.
$ make e2e-test
Verify core features through Kubernetes end-to-end (e2e) test.
$ make image
Build SPDK-CSI docker image.
Parameters
spdkcsi
executable accepts several command line parameters.
Parameter | Type | Description | Default |
---|---|---|---|
--controller | - | enable controller service | - |
--node | - | enable node service | - |
--endpoint | string | communicate with sidecars | /tmp/spdkcsi.sock |
--drivername | string | driver name | csi.spdk.io |
--nodeid | string | node id | - |
Usage
Example deployment files can be found in deploy/kubernetes directory.
File Name | Usage |
---|---|
storageclass.yaml | StorageClass of provisioner "csi.spdk.io" |
controller.yaml | StatefulSet running CSI Controller service |
node.yaml | DaemonSet running CSI Node service |
controller-rbac.yaml | Access control for CSI Controller service |
node-rbac.yaml | Access control for CSI Node service |
config-map.yaml | SPDK storage cluster configurations |
secret.yaml | SPDK storage cluster access tokens |
snapshotclass.yaml | SnapshotClass of provisioner "csi.spdk.io" |
driver.yaml | CSIDriver object |
NOTE:
Below example is a simplest test system running in a single host or VM. No NVMe device is required, memory based bdev is used instead. docs/multi-node.md introduces how to deploy SPDKCSI on multiple nodes with NVMe devices.
Prepare SPDK storage node
Follow deploy/spdk/README to deploy SPDK storage service on localhost.
Deploy SPDKCSI services
- Launch Minikube test cluster.
$ cd scripts $ sudo ./minikube.sh up
Create kubectl shortcut (assume kubectl version 1.25.0)
$ sudo ln -s /var/lib/minikube/binaries/v1.25.0/kubectl /usr/local/bin/kubectl
Wait for Kubernetes ready
$ kubectl get pods --all-namespaces NAMESPACE NAME READY STATUS RESTARTS AGE kube-system coredns-6955765f44-dlb88 1/1 Running 0 81s ...... ...... kube-system kube-apiserver-spdkcsi-dev 1/1 Running 0 67s ...... ......
- Install snapshot controller and CRD
The snapshot controller functions with all CSI drivers in a cluster.
Hence, you can skip it if your kubernetes cluster already has a snapshot controller.
SNAPSHOT_VERSION="v6.2.2" ./scripts/install-snapshot.sh install
Check status
$ kubectl get pod snapshot-controller-0 NAME READY STATUS RESTARTS AGE snapshot-controller-0 1/1 Running 0 6m14s
- Deploy SPDK-CSI services
$ cd deploy/kubernetes $ ./deploy.sh
Check status
$ kubectl get pods NAME READY STATUS RESTARTS AGE spdkcsi-controller-0 3/3 Running 0 3m16s spdkcsi-node-lzvg5 2/2 Running 0 3m16s
- Deploy test pod
$ cd deploy/kubernetes $ kubectl apply -f testpod.yaml
Check status
$ kubectl get pv NAME CAPACITY ... STORAGECLASS REASON AGE persistentvolume/pvc-... 256Mi ... spdkcsi-sc 43s
$ kubectl get pvc NAME ... CAPACITY ACCESS MODES STORAGECLASS AGE persistentvolumeclaim/spdkcsi-pvc ... 256Mi RWO spdkcsi-sc 44s
$ kubectl get pods NAME READY STATUS RESTARTS AGE spdkcsi-test 1/1 Running 0 1m31s
Check attached spdk volume in test pod
$ kubectl exec spdkcsi-test mount | grep spdkcsi /dev/disk/by-id/nvme-..._spdkcsi-sn on /spdkvol type ext4 (rw,relatime)
- Deploy PVC snapshot
Create snapshot of the bound PVC
$ cd deploy/kubernetes $ kubectl apply -f snapshot.yaml
Get details about the snapshot
$ kubectl get volumesnapshot spdk-snapshot NAME READYTOUSE SOURCEPVC ... SNAPSHOTCLASS AGE spdk-snapshot false spdkcsi-pvc ... csi-spdk-snapclass 29s
Get details about the volumesnapshotcontent
kubectl get volumesnapshotcontent $ kubectl get volumesnapshotcontent NAME ... READYTOUSE RESTORESIZE DELETIONPOLICY DRIVER VOLUMESNAPSHOTCLASS VOLUMESNAPSHOT AGE snapcontent-... true 268435456 Delete csi.spdk.io csi-spdk-snapclass spdk-snapshot 29s
Teardown
- Delete PVC snapshot
cd deploy/kubernetes kubectl delete -f snapshot.yaml
- Delete test pod
$ cd deploy/kubernetes $ kubectl delete -f testpod.yaml
- Delete SPDK-CSI services
$ cd deploy/kubernetes $ ./deploy.sh teardown
- Delete snapshot controller and CRD
SNAPSHOT_VERSION="v6.2.2" ./scripts/install-snapshot.sh cleanup
- Teardown Kubernetes test cluster
$ cd scripts $ sudo ./minikube.sh clean
xPU
An Infrastructure Processing Unit (IPU), commonly referred to xPU in subsequent terminologies, is an accelerator that is attached to the PCIe bus. It communicates with a storage server through its own network interface in reality, which will help offload storage-related network traffic from local CPU cores and network interfaces.
Currently, two backends are supported for xPU: Storage Management Agent (SMA) and Open Programmable Infrastructure (OPI).
From the host's point of view, the xPU will look like a hot-pluggable local storage device. For both SMA and OPI, the xPU needs to receive remote storage information to enable it to connect to the remote target. This information is shared with the CSI node driver via CSI volume parameters that were set at the volume creation time by the CSI controller driver. The CSI node driver passes the remote storage information to xPU. For SMA, there will be a SMA gRPC server on the xPU node. For OPI, there will be a OPI-SPDK-Bridge gRPC server on the xPU node. They will be able to receive these configurations in a standard way (protobufs + gRPC).
The diagram below provides a high-level view of the architecture:
[Kubernetes nodes] | [SPDK storage nodes]
|
+---[K8S-Pod]----+ | +---[xPU-Node]---+
|--CSI-Node-Pod--| | |---Controller---|
| | | | |
| spdk-csi | | | |
| node driver---->--------->-SMA/OPI->-spdk->---+
+----------------+ | | | |
| +----------------+ |
+---[K8S-Pod]----+ | |
|-CSI-Controller-| | +-[Storage-Node]-+ |
| | | |-----Target-----| |
| spdk-csi | | | | |
| driver | | | | |
| controller----->--------->---->-spdk-<----<---+
| | | | |
+----------------+ | +----------------+
Usage for xPU
The CSI-Node-Pod's configuration file for xPU is dynamically attached by Kubernetes using a config map as below.
File Name | Usage |
---|---|
deploy/kubernetes/nodeserver-config-map.yaml | SPDK xPU cluster configurations |
You can configure it using the parameters mentioned below. Multiple xPU nodes are supported, and each node's configuration includes the name, targetType, and targetAddr fields. The value of "targetType" can be one of "xpu-sma-nvmftcp", "xpu-sma-virtioblk", or "xpu-sma-nvme", and "targetAddr" is the URL used to connect to the SMA server on each cluster node.
Here is an example of the deploy/kubernetes/nodeserver-config-map.yaml file:
nodeserver-config.json: |-
{
"xpuList": [
{
"name": "xPU0",
"targetType": "xpu-sma-nvme",
"targetAddr":"127.0.0.1:5114"
}
]
}
Prerequisites for xPU
On a Fedora-based system, you will need grpcio-tools and protobuf installed.
Workflow for xPU
- The instance of SPDK Json RPC is running on both the xPU node and the storage pool server.
- For SMA, the SMA gRPC server is running on xPU node. For OPI, the OPI-SPDK-Bridge gRPC server is running on xPU node.
- The CSI controller driver is configured with the storage pool details. On the other hand, the CSI node driver is configured with the details of the SMA/OPI gRPC server.
- When the CSI controller driver receives a new volume request from the Kubernetes volume plugin, it initiates a JSON RPC request to the SPDK storage node. This request triggers the creation and export of an NVMe volume on the SPDK storage node. The volume connection details are then stored in the CSI volume parameters for further reference.
- Upon receiving a stage volume request along with the volume parameters, the CSI node driver comes into action. The CSI node plugin initiates the appropriate SMA/OPI gRPC calls to the server running on the xPU node.
- Subsequently, the CSI node driver establishes a connection to the remote NVMe device using the volume parameters obtained earlier. With the connection established, the CSI node driver proceeds to create a local volume with the appropriate volume type. Furthermore, it attaches the volume to the remote NVMe device, effectively exposing it as a local device on the host for further usage.
Prepare SPDK xPU node
Follow deploy/spdk/README to deploy SPDK SMA service on localhost.
For the rest steps, you can follow the same steps in "Prepare SPDK storage node", "Deploy SPDKCSI services", and "Teardown" above to prepare the SPDK storage node, deploy SPDKCSI driver, and tear down everything.
Communication and Contribution
Please join SPDK community for communication and contribution.
Project progress is tracked in Trello board.