This is a tool for managing GPU partitions for NVIDIA Fabric Manager's Shared NVSwitch feature.
Management operations include listing, activating, deactivating partitions, etc.
$ ./fmpm -h
-l : List partitions
-a <Partition ID> : Activate partition
-d <Partition ID> : Deactivate partition
--hostname : hostname or IP address (TCP socket) of Fabric Manager.
If this option is not specified, the default is 127.0.0.1
Requires FM_CMD_BIND_INTERFACE=<IP of Fabric Manager> in Fabric Manager configuration
See details: https://docs.nvidia.com/datacenter/tesla/fabric-manager-user-guide/index.html#the-fabric-manager-api-interface
--unix-domain-socket : UNIX domain socket path for Fabric Manager connection
If this option is specified, UNIX domain socket will be used instead of hostname or IP (TCP socket).
Requires FM_CMD_UNIX_SOCKET_PATH=<UNIX domain socket> in Fabric manager configuration.
See details: https://docs.nvidia.com/datacenter/tesla/fabric-manager-user-guide/index.html#the-fabric-manager-domain-socket-interface
--get-nvlink-failed-devices : Query all NVLink failed devices
--list-unsupported-partitions : Query all unsupported fabric partitions
--set-activated-list <Partition IDs> : Set a list of currently activated fabric partitions.
Comma separated, no space! Used for Fabric Manager Resiliency Mode.
Requires FABRIC_MODE_RESTART=1 for Fabric Manager resiliency restart mode.
More details:
https://docs.nvidia.com/datacenter/tesla/fabric-manager-user-guide/index.html#the-fabric-manager-restart-mode
https://docs.nvidia.com/datacenter/tesla/fabric-manager-user-guide/index.html#resiliency
-v : Show version
To get started, install the NVIDIA Fabric Manager Development package on your build environment.
On RHEL, this package is named "nvidia-fabricmanager-devel-<version>"
On Ubuntu, this package is named "nvidia-fabricmanager-dev-<version>"
Install the JSON CPP development package on your build environment.
On Ubuntu, this package is named "libjsoncpp-dev"
On RHEL, this package is named "jsoncpp-devel".
The EPEL repository must be set up on your system to access this package.