/smc

Shared Memory Communication (SMC) on Linux

Primary LanguageHTMLMIT LicenseMIT

smc

Documentation, notes, and scripts for Shared Memory Communication (SMC) in Linux.

Kernel Support

Make sure your kernel is configured with CONFIG_SMC. If your kernel provides the /proc/config.gz file, you can use it to check if your kernel has been compiled with SMC support with:

$ zgrep "CONFIG_SMC" /proc/config.gz

or you can use the smc.sh script in this folder with:

$ ./smc.sh check

If SMC support is configured as a module, you need to load the smc kernel module as root with:

# modprobe smc

or with the smc.sh script:

# ./smc.sh load

Devices

SMC requires, depending on the SMC variant, a different set of devices:

SMC-R uses three devices: a handshake device, a RoCE IB device, and a RoCE net device. The handshake device is a network interface (e.g., eth0) used for the initial TCP connection setup and CLC handshake (see below) between two SMC peers. For example, a SMC server socket application listens on this device. The RoCE IB device is a RDMA over Converged Ethernet (RoCE) Infiniband (IB) device used to transfer the SMC-R traffic after the SMC connection has been established. The RoCE IB device transfers the traffic over an Ethernet device. This Ethernet device is the RoCE net device. The handshake device and RoCE net device can be the same device.

SMC-D uses two devices: a handshake device and an ISM device. The handshake device is the same as in the case of SMC-R. The ISM device is an Internal Shared Memory (ISM) device that is used to transfer the SMC-D traffic after the SMC connection has been established. ISM devices are only available on the s390 architecture.

You can use the tool pnetctl to show the net, IB, and ISM devices that are available on your host. Also, you can use it to identify the RoCE net device that belongs to a RoCE IB device. For example:

# pnetctl
====================================================================
Pnetid:          Type:           Name:  Port:  Bus:          Bus-ID:
====================================================================
n/a
--------------------------------------------------------------------
                   net            eth0          pci     0000:01:00.0
                   net            eth1          pci     0000:03:00.0
                    ib          mlx5_0      1   pci     0000:03:00.0
                   net              lo          n/a              n/a

This example shows four devices without a pnetid (see below). The IB device mlx5_0 is a RoCE IB device and has the bus type pci and the bus-id 0000:03:00.0. The net device eth1 has the same bus and bus-id. Thus, you can assume that eth1 is the RoCE net device that belongs to the RoCE IB device mlx5_0.

Device Configuration

In order to use SMC, the devices that are used for a SMC connection need to be configured.

Configure the handshake device (SMC-R and SMC-D):

  • make sure the network interface is up
  • if you use VLAN, configure the right VLAN on the network interface
  • Configure your IPv4 and/or IPv6 address(es) on the network interface

Configure the RoCE net device (SMC-R only):

  • make sure the network interface is up
  • if you use VLAN, configure the right VLAN on the network interface
  • configure an IPv6 address on the network interface

After the previous steps, the RoCE IB device should also be ready to use (i.e. active). There are multiple ways to check if the RoCE IB device is active. For example, you can use the tool ibstat from rdma-core, or you can read the respective sysfs attribute with cat /sys/class/infiniband/<ib_device>/ports/<ib_port>/state, or you can simply use the smc.sh script in this folder:

$ ./smc.sh ib state <ib_device>

ISM devices do not need extra configuration.

Examples

Example of the device configuration with handshake device eth0 and RoCE net device eth1, without VLAN, with IP address 192.168.1.23 on the handshake device, and with IPv6 stateless address autoconfiguration:

# # configure handshake device eth0:
# ip link set eth0 up
# ip address add 192.168.1.23/24 dev eth0
# # configure RoCE net device eth1:
# ip link set eth1 up

Example of checking the state of the RoCE IB device mlx5_0 with the smc.sh script:

$ ./smc.sh ib state mlx5_0
State of infiniband device mlx5_0:
Port 1 State: 4: ACTIVE

Pnetid Configuration

SMC uses a so-called Physical Network ID (pnetid) to map a handshake device to a RoCE IB device or to an ISM device. Thus, you need to make sure that your handshake device and your RoCE IB device or your ISM device have the same pnetid. You can use the tool pnetctl to read the currently configured pnetids and to configure the pnetids of your devices.

You can view the available devices and their pnetids by running pnetctl without command line parameters:

# pnetctl
====================================================================
Pnetid:          Type:           Name:  Port:  Bus:          Bus-ID:
====================================================================
n/a
--------------------------------------------------------------------
                   net            eth0          pci     0000:01:00.0
                   net            eth1          pci     0000:03:00.0
                    ib          mlx5_0      1   pci     0000:03:00.0
                   net              lo          n/a              n/a

In this example, no pnetids are currently configured as indicated by all devices listed under the pnetid n/a.

In this example, you can add a pnetid to devices eth1 and mlx5_0 by using the -a, -i and -n command line arguments:

# pnetctl -a test1 -i mlx5_0 -n eth1

After this command, the pnetid configuration should look like this:

# pnetctl
====================================================================
Pnetid:          Type:           Name:  Port:  Bus:          Bus-ID:
====================================================================
TEST1
--------------------------------------------------------------------
                    ib          mlx5_0      1   pci     0000:03:00.0
                   net            eth1          pci     0000:03:00.0
--------------------------------------------------------------------
n/a
--------------------------------------------------------------------
                   net            eth0          pci     0000:01:00.0
                   net              lo          n/a              n/a

The two devices eth1 and mlx5_0 are now listed under pnetid TEST1. Note: pnetids are capitalized when they are stored in the kernel. Also, pnetids that are configured with pnetctl are only temporary and do not persist across reboots.

SMC Socket Programming

See the folder socket for information on SMC socket programming.

CLC Handshake

See the folder handshake for information on SMC's CLC handshake.