Execution Workflow
Provenance: sn-stack documentation
Normal Operation of the Runtime Environment
Configuring the firmware runtime environment
Optionally The firmware artifact produced by the build (whose contents are described in this README should be transferred to the runtime system that hosts an FPGA card. If your runtime system that hosts an FPGA card is the same one you built the firmware artifacts on (the same one you ran build.sh on), you can skip this step entirely.
unzip artifacts.esnet-smartnic-fw.package.0.zip
cd sn-stack
# edit the .env file to provide sane values for
# FPGA_PCIE_DEV=0000:d8:00
# COMPOSE_PROFILES=smartnic-mgr-vfio-unlock
# and IFF you have more than one JTAG you also need a line like this
# HW_TARGET_SERIAL=21760204S029A
Verify that the stack configuration is valid
sn-stack$ docker compose config --quiet && echo "All good!"
If this prints anything other than “All good!” then your .env
configuration file has errors. Do not proceed until this step passes.
Running the firmware
Start up the full firmware docker stack like this
sn-stack$ docker compose up -d
Verifying the bitfile download
docker compose logs smartnic-hw
TODO: What indicates a good/bad situation when inspecting the log?
Inspecting registers and interacting with the firmware
The firmware runtime environment exists inside of the smartnic-fw
container. Here, we exec a shell inside of that container and have a look around.
docker compose exec smartnic-fw bash
sn-cli dev version
regio syscfg
If this prints wrong values or all ones, make sure you don’t have the COMPOSE_PROFILES
set to smartnic-mgr-dpdk-manual
in your sn-stack/.env
file. Otherwise, you will not be able to access any sn-cli tools unless pktgen is running.
TODO: Unclear what “this” refers to above.
Using the sn-cli tool
The sn-cli tool provides subcommands to help you accomplish many common tasks for inspecting and configuring the SmartNIC platform components.
All commands described below are expected to be executed within the smartnic-fw
container environment. Use this command to enter the appropriate environment.
docker compose exec smartnic-fw bash
The sn-cli
tool will automatically look for an environment variable called SN_CLI_SLOTADDR
which can be set to the PCIe BDF address of the device that you would like to interract with. In the smartnic-fw
container, this value will already be set for you.
Displaying device information with the “dev” subcommand
This will show information about the device such as the build version, build date/time and temperature.
root@smartnic-fw:/# sn-cli dev version
Device Version Info
DNA: 0x40020000012306a21c10c285
USR_ACCESS: 0x000086d3 (34515)
BUILD_STATUS: 0x04130920
root@smartnic-fw:/# sn-cli dev temp
Temperature Monitors
FPGA SLR0: 45.551 (deg C)
USR_ACCESS
value is typically the unique build pipeline number that produced the embedded FPGA bitfile.
The BUILD_STATUS
value holds an encoded date/time (Aug 30 at 05:32am) which is when the embedded FPGA bitfile build was started.
The DNA
value holds the factory programmed unique ID of the FPGA
TODO: How is the timestamp in BUILD_STATUS interpreted?
Inspecting and Configuring the CMAC (100G) Interfaces with the “cmac” subcommand
Display the current MAC and PHY status of one or more (or all by default) 100G MAC interfaces using these commands:
root@smartnic-fw:/# sn-cli cmac status
CMAC0
Tx (MAC ENABLED/PHY UP)
Rx (MAC ENABLED/PHY UP)
CMAC1
Tx (MAC ENABLED/PHY UP)
Rx (MAC ENABLED/PHY DOWN)
In the example output above, CMAC1 PHY layer is DOWN in the Rx (receive) direction. Possible causes for this are: * It has not yet been enabled – this needs to be done manually. See below for how to do this. * No QSFP28 plugged into 100G port 0 the U280 card * Wrong type of QSFP28 module plugged into 100G port 0 * 100G QSFP28 SR4 or LR4 modules are supported * Some 100G AOC or DACs are known to work * QSFP+ 40G modules are not supported * QSFP 5G modules are not supported * QSFP28 module improperly seated in the U280 card * Check if the QSFP28 module is inserted upside down and physically blocked from being fully inserted * Unplug/replug the module, ensuring that it is properly oriented and firmly seated * Fiber not properly inserted * Unplug/replug the fiber connection at each end * Far end is operating in 4x25G or 2x50G split mode * The SmartNIC platform does not support 4x25G or 2x50G mode * Only 100G mode is supported on each of the U280 100G interfaces * Configure far end in 100G mode * Far end has RS-FEC (Reed-Solomon Forward Error Correction) enabled * The SmartNIC platform does not support RS-FEC * Disable RS-FEC on the far end equipment
A more detailed status can also be displayed using the --verbose
option. Note that the --verbose
option is a global option and thus must be positioned before the cmac
subcommand.
root@smartnic-fw:/# sn-cli --verbose cmac -p 1 status
CMAC1
Tx (MAC ENABLED/PHY UP)
tx_local_fault 0
Rx (MAC ENABLED/PHY DOWN)
rx_got_signal_os 0
rx_bad_sfd 0
rx_bad_preamble 0
rx_test_pattern_mismatch 0
rx_received_local_fault 0
rx_internal_local_fault 1
rx_local_fault 1
rx_remote_fault 0
rx_hi_ber 0
rx_aligned_err 0
rx_misaligned 0
rx_aligned 0
rx_status 0
TODO: How to interpret the above output?
Display summary statistics for packets Rx’d and Tx’d from CMAC ports
root@smartnic-fw:/# sn-cli cmac stats
CMAC0: TX 0 RX 0 RX-DISC 0 RX-ERR 0
CMAC1: TX 0 RX 0 RX-DISC 0 RX-ERR 0
TODO: How to interpret the above output?
Enable/Disable one or more (or all by default) 100G MAC interfaces using these commands:
root@smartnic-fw:/# sn-cli cmac enable
Enabled CMAC0
Enabled CMAC1
Other subcomands of cmac
include:
sn-cli cmac disable
sn-cli cmac -p 0 enable
sn-cli cmac -p 1 disable
Inspecting and Configuring the PCIe Queue DMA (QDMA) block with the “qdma” subcommand
The QDMA block is responsible for managing all DMA queues used for transferring packets and/or events bidirectionally between the U280 card and the Host CPU over the PCIe bus. In order for any DMA transfers to be allowed on either of the PCIe Physical Functions (PF), an appropriate number of DMA Queue IDs must be provisioned. This can be done using the qdma
subcommand.
Configure the number of queues allocated to each of the PCIe Physical Functions
sn-cli qdma setqs 1 1
setqs
subcommand also takes care of configuring the RSS entropy -> QID map with an equal weighted distribution of all allocated queues. If you’re unsure of how many QIDs to allocate, using 1 1
here is your best choice.
Inspect the configuration of the QDMA block
sn-cli qdma status
Packet, byte and error counters are tracked for packets heading between the QDMA engine and the user application. You can display them with this command:
sn-cli qdma stats
open-nic-shell
documentation for an explanation of exactly where in the FPGA design these statistics are measured.
Inspecting packet counters in the SmartNIC platform with the “probe” subcommand
The SmartNIC platform implements monitoring points in the datapath at various locations. You an inspect these counters using this command:
sn-cli probe stats
esnet-smartnic-hw
documentation for an explanation of exactly where in the FPGA design these statistics are measured.
Configuring the SmartNIC platform ingress/egress/bypass switch port remapping functions with the “sw” subcommand
The smartnic platform implements reconfigurable ingress and egress port remapping, connections and redirecting. You can inspect and modify these configuration points using the “sw” subcommand.
Most of the sw
subcommands take one or more port bindings as parameters. The port bindings are of the form:
<port>:<port-connector>
<port>
is one of
* cmac0 – 100G port 0
* cmac1 – 100G port 1
* host0 – DMA over PCIe Physical Function 0 (PF0)
* host1 – DMA over PCIe Physical Function 1 (PF1)
* <port-connector>
is context dependent and is one of
* cmac0
* cmac1
* host0
* host1
* bypass – a high bandwidth channel through the SmartNIC which does NOT pass through the user’s application
* app0 – user application port 0 (typically a P4 program ingress)
* app1 – user application port 1 (only available when user implements it in verilog)
* drop – infinite blackhole that discards all packets sent to it
Display the current configuration status
sn-cli sw status
Remap/rename physical input ports to logical input ports
The in-port-rename
subcommand allows you to remap the identity of a SmartNIC platform physical ingress port to any logical port as seen by the user logic. Once remapped (eg. from a
->b
), all following logic in the SmartNIC will perceive that the packet arrived on ingress port b
even though it physically arrived on port a
. This can be useful for test injection scenarios but would typically be set to a straight-through mapping in production.
sn-cli sw in-port-rename a:b
To reset this mapping so each port maps to its usual identity:
sn-cli sw in-port-rename cmac0:cmac0 cmac1:cmac1 host0:host0 host1:host1
Attach logical input ports to pipelines
The in-port-connect
subcommand allows you to connect a logical input port to different processing pipelines within the SmartNIC. This can be used to connect to a P4 program or to custom logic within the user application. It can also be used to shunt all packets to a blackhole or to bypass packets around the user application entirely.
sn-cli sw in-port-connect cmac0:app0 cmac1:app0 host0:bypass host1:bypass
Connect input ports to output ports in the bypass path
The bypass-connect
subcommand allows you to connect input ports directly to output ports as they pass through the bypass path (ie. not through the user application). This is useful for providing direct connectivity from host PCIe PFs to 100G CMAC interfaces for network testing.
sn-cli sw bypass-connect host0:cmac0 host1:cmac1 cmac0:host0 cmac1:host1
NOTE any packets that follow the bypass path will not be processed by the user’s P4 program
Override user application output port decisions and redirect to an alternate port
The app0-port-redirect
and app1-port-redirect
subcommands allow the user to override the forwarding decisions made by the user application and/or P4 program and redirect any given output port to a different output port. This can be useful during development/debugging and in test fixtures.
NOTE there are separate overrides for the app0 outputs and the app1 outputs.
sn-cli sw app0-port-redirect cmac0:host0 cmac1:host1
sn-cli sw app1-port-redirect cmac0:host0 cmac1:host1
To reset this mapping so each output ports maps to its usual destination:
sn-cli sw app0-port-redirect cmac0:cmac0 cmac1:cmac1 host0:host0 host1:host1
sn-cli sw app1-port-redirect cmac0:cmac0 cmac1:cmac1 host0:host0 host1:host1
Using the sn-p4-cli tool
The user’s P4 application embedded within the SmartNIC design may have configurable lookup tables which are used during the wire-speed execution of the packet processing pipeline. The sn-p4-cli tool provides subcommands to help you to manage the rules in all of the lookup tables defined in your P4 program.
All commands described below are expected to be executed within the smartnic-fw
container environment. Use this command to enter the appropriate environment.
docker compose exec smartnic-fw bash
The sn-p4-cli
tool will automatically look for an environment variable called SN_P4_CLI_SERVER
which can be set to the hostname of the sn-p4-agent
that will perform all of the requested actions on the real hardware. In the smartnic-fw
container, this value will already be set for you.
Inspecting the pipeline structure with the “info” subcommand
The info
subcommand is used to display the pipeline structure, including table names, match fields (and their types), action names and the list of parameters for each action. This information can be used to formulate new rule definitions for the other subcommands.
sn-p4-cli info
Inserting a new rule into a table
The table-insert
subcommand allows you to insert a new rule into a specified table.
sn-p4-cli table-insert <table-name> <action-name> --match <match-expr> [--param <param-expr>] [--priority <prio-val>]
<table-name>
is the name of the table to be operated on
* <action-name>
is the action that you would like to activate when this rule matches
* <match-expr>
is one or more match expressions which collectively define when this rule should match a given packet
* The number and type of the match fields depends on the P4 definition of the table
* The --match
option may be specified multiple times and all match-expr
s will be concatenated
* <param-expr>
is one or more parameter values which will be returned as a result when this rule matches a given packet
* The number and type of the action parameters depends on the P4 definition of the action within the table
* Some actions require zero parameters. In this case, omit the optional --param
option entirely.
* <prio-val>
is the priority to be used to resolve scenarios where multiple matches could occur
* The --priority
option is required for tables with CAM/TCAM type matches (prefix/range/ternary)
* The --priority
option is prohibited for tables without CAM/TCAM type mathes
NOTE: You can find details about your pipeline structure and valid names by running the info
subcommand.
Updating an existing rule within a table
The table-update
subcommand allows you to update the action and parameters for an existing rule within a table
sn-p4-cli table-update <table-name> <new-action-name> --match <match-expr> [--param <new-param-expr>]
<table-name>
is the table containing the rule to be updated
* <new-action-name>
is the new action that should be applied when this rule matches
* <match-expr>
is the exact original <match-expr>
used when the original rule was inserted
* <new-param-expr>
is the set of new parameters to be returned when this rule matches
* NOTE: the new parameters must be consistent with the new action
Removing previously inserted rules
The clear-all
and table-clear
and table-delete
subcommands allow you to remove rules from tables with varying precision.
Clear all rules from all tables in the pipeline.
sn-p4-cli clear-all`
Clear all rules from a single specified table.
sn-p4-cli table-clear <table-name>
Remove a specific rule from a specific table.
table-delete <table-name> --match <match-expr>
Bulk changes of rules using a p4bm simulator rules file
Using the the p4bm-apply
subcommand, a list of pipeline modifications can be applied from a file. A subset of the full p4bm simulator file format is supported by the sn-p4-cli
command.
sn-p4-cli p4bm-apply <filename>
Supported actions within the p4bm file are:
* table_insert <table-name> <action-name> <match-expr> => <param-expr> [priority]
* Insert a rule
* clear_all
* Clear all rules from all tables
* table_clear <table-name>
* Clear all rules from a specified table
All comment characters #
and text following them up to the end of the line are ignored.
Stopping the runtime environment
When we’re finished using the SmartNIC runtime environment, we can stop and remove our Docker containers.
docker compose down -v
TODO: Clarify whether different commands are invoked inside or outside the container.
Using the smartnic-dpdk container
The sn-stack
environment can be started in a mode where the FPGA can be controlled by a DPDK application. Running in this mode requires a few carefully ordered steps.
Broadly speaking, the steps required to bring up a DPDK application are as follows:
* Bind the vfio-pci
kernel driver to each FPGA PCIe physical function (PF)
* This is handled automatically by the sn-stack.
* Run a DPDK application with appropriate DPDK Environment Abstraction Layer (EAL) settings
* Use -a $SN_PCIE_DEV.0
to allow control of one or more specific FPGA PCIe PFs
* Use -d librte_net_qdma.so
to dynamically link the correct Userspace Polled-Mode Driver (PMD) for the SmartNIC QDMA engine
* The EAL will
* Open the PCIe PFs using the kernel’s vfio-pci
driver
* Take the FPGA device out of reset
* Open and map large memory regions for DMA using the kernel’s hugepages
driver
* The application is responsible for assigning buffers to one or more of the FPGA’s DMA queues
* Use the sn-cli
tool to configure some of the low-level hardware components in the FPGA
* Configure the set of valid DMA queues in the FPGA (must match what is set in the DPDK application)
* Bring up the physical ethernet ports
In the examples below, we will be running the pktgen-dpdk
application to control packet tx/rx via the FPGA’s PCIe physical functions. This can be very useful for injecting packets into a design for testing behaviour on real hardware.
For more information about DPDK in general, see: * http://core.dpdk.org/doc/
For more information about the pktgen-dpdk
application, see:
* https://pktgen-dpdk.readthedocs.io/en/latest/index.html
Before you bring up the sn-stack
, please ensure that you have uncommented this line in your .env
file
COMPOSE_PROFILES=smartnic-dpdk
If you changed this while the stack was already running, you’ll need to restart the stack with down/up.
First, you’ll need to start up the pktgen
application to open the vfio-pci device for PF0 and PF1 and take the FPGA out of reset.
$ docker compose exec smartnic-dpdk bash
root@smartnic-dpdk:/# pktgen -a $SN_PCIE_DEV.0 -a $SN_PCIE_DEV.1 -l 4-8 -n 4 -d librte_net_qdma.so --file-prefix $SN_PCIE_DEV- -- -v -m [5:6].0 -m [7:8].1
Pktgen:/> help
Open a separate shell window which you will use for doing the low-level SmartNIC platform configuration.
Configure the Queue mappings for host PF0 and PF1 interfaces and bring up the physical ethernet ports using the smartnic-fw
container.
$ docker compose exec smartnic-fw bash
root@smartnic-fw:/# sn-cli qdma setqs 1 1
root@smartnic-fw:/# sn-cli qdma status
root@smartnic-fw:/# sn-cli cmac enable
root@smartnic-fw:/# sn-cli cmac status
MAC ENABLED/PHY UP
).
Advanced usage of the pktgen-dpdk application
Example of streaming packets out of an interface from a pcap file rather than generating the packets within the UI.
Note the -s <P>:file.pcap
option where P
refers to the port number to bind the pcap file to.
root@smartnic-dpdk:/# pktgen -a $SN_PCIE_DEV.0 -a $SN_PCIE_DEV.1 -l 4-8 -n 4 -d librte_net_qdma.so --file-prefix $SN_PCIE_DEV- -- -v -m [5:6].0 -m [7:8].1 -s 1:your_custom.pcap
Pktgen:/> port 1
Pktgen:/> page pcap
Pktgen:/> page main
Pktgen:/> start 1
Pktgen:/> stop 1
Pktgen:/> clr
Example of running a particular test case via a script rather than typing at the UI
cat <<_EOF > /tmp/test.pkt
clr
set 1 size 1400
set 1 count 1000000
enable 0 capture
start 1
disable 0 capture
_EOF
root@smartnic-dpdk:/# pktgen -a $SN_PCIE_DEV.0 -a SN_PCIE_DEV.1 -l 4-8 -n 4 -d librte_net_qdma.so --file-prefix $SN_PCIE_DEV- -- -v -m [5:6].0 -m [7:8].1 -f /tmp/test.pkt
Troubleshooting the pktgen-dpdk Application
If pktgen isn’t starting, then you could try these troubleshooting steps:
-
Ensure you are using the correct profile in your
sn-stack/.env
file and that you are starting pktgen with the right command. For a more detailed understanding of the command, please refer to the pktgen documentation provided earlier. -
If pktgen is starting, but packets aren’t flowing as expected, you can check the packet path using the following command inside the
smartnic-fw
container:
sn-cli probe stats
-
If packets sent to/from the host aren’t achieving line rate (100Gbps per port), it could be due to QDMA queue allocation. You can attempt to allocate more QDMA queues per port by setting
sn-cli qdma setqs
to values higher than1 1
. -
If packets are egressing to the wrong port (whether CMAC or PF), it might be due to the
sn-cli
configuration. For example, here’s a script that routes all egress packets to CMAC1:
#!/bin/bash
sn-cli dev version
sn-cli sw in-port-rename cmac0:cmac0 cmac1:cmac1 host0:host0 host1:host1
sn-cli sw app0-port-redirect cmac0:cmac0 cmac1:cmac1 host0:host0 host1:host1
sn-cli sw app1-port-redirect cmac0:cmac0 cmac1:cmac1 host0:host0 host1:host1
sn-cli sw bypass-connect cmac0:cmac0 cmac1:cmac1 host0:host0 host1:host1
sn-cli sw in-port-connect cmac0:app0 cmac1:app0 host0:app0 host1:app0
sn-cli sw status
sn-cli qdma setqs 1 1
sn-cli cmac enable
sn-cli cmac status